BLASTP 2.2.26 [Sep-21-2011]
Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer,
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997),
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs", Nucleic Acids Res. 25:3389-3402.
Reference for compositional score matrix adjustment: Altschul, Stephen F.,
John C. Wootton, E. Michael Gertz, Richa Agarwala, Aleksandr Morgulis,
Alejandro A. Schaffer, and Yi-Kuo Yu (2005) "Protein database searches
using compositionally adjusted substitution matrices", FEBS J. 272:5101-5109.
Query= 019809
(335 letters)
Database: nr
23,463,169 sequences; 8,064,228,071 total letters
Searching..................................................done
>gi|255545512|ref|XP_002513816.1| protein with unknown function [Ricinus communis]
gi|223546902|gb|EEF48399.1| protein with unknown function [Ricinus communis]
Length = 482
Score = 567 bits (1462), Expect = e-159, Method: Compositional matrix adjust.
Identities = 261/334 (78%), Positives = 301/334 (90%)
Query: 1 MSDIDEKQLLLYAQIANLVNLILQWPEISINEIAENFSKLACNAHTICNSELRPLGTGLY 60
M DIDEKQL+LYAQ+ANLVNLIL+WP+++I EIAENFSKLACNAHTIC+SELRPLGTGLY
Sbjct: 147 MKDIDEKQLVLYAQMANLVNLILRWPDVNIKEIAENFSKLACNAHTICDSELRPLGTGLY 206
Query: 61 PVISIINHSCLPNAVLVFEGRLAVVRAVQHVPKGAEVLISYIETAGSTMTRQKALKEQYL 120
PV+SIINHSCLPNAVLVF+GRLAVV VQH+PKG+EVLISYIETAGSTMTRQKALK+QY
Sbjct: 207 PVVSIINHSCLPNAVLVFDGRLAVVHTVQHIPKGSEVLISYIETAGSTMTRQKALKQQYF 266
Query: 121 FTCTCPRCIKLGQFDDIQESAILEGYRCKDDGCSGFLLRDSDDKGFTCQQCGLVRSKEEI 180
FTCTCPRCIK+G DDIQESAILEGYRCKD+ C+GFLLRDSDD+GF CQQCGL+RSKEE+
Sbjct: 267 FTCTCPRCIKMGLLDDIQESAILEGYRCKDNRCNGFLLRDSDDRGFICQQCGLLRSKEEV 326
Query: 181 KKIASEVNILSKKTLALTSCGNHQEVVSTYKMIEKLQKKLYHPFSVNLMQTREKLIKILM 240
KK A+E+ S K S GN QE VS YK+IEKLQ+KL HPFS +LMQTREKL+K+LM
Sbjct: 327 KKSAAEIKATSDKASKSISSGNLQEAVSIYKLIEKLQRKLCHPFSTSLMQTREKLLKMLM 386
Query: 241 ELEDWKEALAYCQLTIPVYQRVYPQFHPLLGLQYYTCGKLEWFLGDTENAIKSMTEAVEI 300
ELEDW EAL+YC+LTIPVYQRVYP+FHPLLGLQY++CGKLEW LGDTE AIKS+T+A+++
Sbjct: 387 ELEDWGEALSYCKLTIPVYQRVYPEFHPLLGLQYFSCGKLEWLLGDTEAAIKSLTKALDV 446
Query: 301 LRITHGTNSPFMKELILKLEEAQAEASYKLSSKD 334
LRITHGT +PFMKEL++KLEEA+AE SY +S D
Sbjct: 447 LRITHGTKTPFMKELMMKLEEARAEVSYLRASND 480
>gi|302141868|emb|CBI19071.3| unnamed protein product [Vitis vinifera]
Length = 480
Score = 566 bits (1459), Expect = e-159, Method: Compositional matrix adjust.
Identities = 262/335 (78%), Positives = 305/335 (91%), Gaps = 1/335 (0%)
Query: 1 MSDIDEKQLLLYAQIANLVNLILQWPEISINEIAENFSKLACNAHTICNSELRPLGTGLY 60
++DIDEKQL+LYAQ+ANLVNLILQWP+I++ EIAENFSKLACNAHTIC+ ELRPLGTGLY
Sbjct: 147 ITDIDEKQLVLYAQMANLVNLILQWPDINVKEIAENFSKLACNAHTICDGELRPLGTGLY 206
Query: 61 PVISIINHSCLPNAVLVFEGRLAVVRAVQHVPKGAEVLISYIETAGSTMTRQKALKEQYL 120
PVISIINHSCLPN+VLVFE RLAVVRAVQH+PKG EVLISYIETAGST+TRQKALKEQYL
Sbjct: 207 PVISIINHSCLPNSVLVFEERLAVVRAVQHIPKGTEVLISYIETAGSTITRQKALKEQYL 266
Query: 121 FTCTCPRCIKLGQFDDIQESAILEGYRCKDDGCSGFLLRDSDDKGFTCQQCGLVRSKEEI 180
FTCTCPRC ++GQ+DDIQESAILEGYRCKDD C GFLLRDSDD GF CQQCGLVR+KEEI
Sbjct: 267 FTCTCPRCRRMGQYDDIQESAILEGYRCKDDRCDGFLLRDSDDIGFICQQCGLVRNKEEI 326
Query: 181 KKIASEVNILSKKTLALTSCGNHQEVVSTYKMIEKLQKKLYHPFSVNLMQTREKLIKILM 240
K++ASE+ LS K ++S ++ E S YKMIEKLQ KL+HPFS+NLM+TRE ++KILM
Sbjct: 327 KRLASELKPLSDKA-TMSSSSHYVEATSIYKMIEKLQTKLFHPFSINLMRTREAILKILM 385
Query: 241 ELEDWKEALAYCQLTIPVYQRVYPQFHPLLGLQYYTCGKLEWFLGDTENAIKSMTEAVEI 300
E++DW+ AL YC+LTIPVYQRVYP FHPLLGLQYYTCGKLEW LG+TE+A+KS+T+A +I
Sbjct: 386 EMKDWRAALTYCKLTIPVYQRVYPGFHPLLGLQYYTCGKLEWLLGETEDAVKSLTKAADI 445
Query: 301 LRITHGTNSPFMKELILKLEEAQAEASYKLSSKDE 335
L+ITHGTN+PFMKEL+ KLEEA+AEAS+K SSKD+
Sbjct: 446 LQITHGTNTPFMKELLFKLEEARAEASHKRSSKDK 480
>gi|225459467|ref|XP_002284386.1| PREDICTED: histone-lysine N-methyltransferase ASHR1-like [Vitis
vinifera]
Length = 477
Score = 561 bits (1447), Expect = e-157, Method: Compositional matrix adjust.
Identities = 261/335 (77%), Positives = 302/335 (90%), Gaps = 4/335 (1%)
Query: 1 MSDIDEKQLLLYAQIANLVNLILQWPEISINEIAENFSKLACNAHTICNSELRPLGTGLY 60
++DIDEKQL+LYAQ+ANLVNLILQWP+I++ EIAENFSKLACNAHTIC+ ELRPLGTGLY
Sbjct: 147 ITDIDEKQLVLYAQMANLVNLILQWPDINVKEIAENFSKLACNAHTICDGELRPLGTGLY 206
Query: 61 PVISIINHSCLPNAVLVFEGRLAVVRAVQHVPKGAEVLISYIETAGSTMTRQKALKEQYL 120
PVISIINHSCLPN+VLVFE RLAVVRAVQH+PKG EVLISYIETAGST+TRQKALKEQYL
Sbjct: 207 PVISIINHSCLPNSVLVFEERLAVVRAVQHIPKGTEVLISYIETAGSTITRQKALKEQYL 266
Query: 121 FTCTCPRCIKLGQFDDIQESAILEGYRCKDDGCSGFLLRDSDDKGFTCQQCGLVRSKEEI 180
FTCTCPRC ++GQ+DDIQESAILEGYRCKDD C GFLLRDSDD GF CQQCGLVR+KEEI
Sbjct: 267 FTCTCPRCRRMGQYDDIQESAILEGYRCKDDRCDGFLLRDSDDIGFICQQCGLVRNKEEI 326
Query: 181 KKIASEVNILSKKTLALTSCGNHQEVVSTYKMIEKLQKKLYHPFSVNLMQTREKLIKILM 240
K++ASE+ LS K +S + S YKMIEKLQ KL+HPFS+NLM+TRE ++KILM
Sbjct: 327 KRLASELKPLSDKATMSSS----SQATSIYKMIEKLQTKLFHPFSINLMRTREAILKILM 382
Query: 241 ELEDWKEALAYCQLTIPVYQRVYPQFHPLLGLQYYTCGKLEWFLGDTENAIKSMTEAVEI 300
E++DW+ AL YC+LTIPVYQRVYP FHPLLGLQYYTCGKLEW LG+TE+A+KS+T+A +I
Sbjct: 383 EMKDWRAALTYCKLTIPVYQRVYPGFHPLLGLQYYTCGKLEWLLGETEDAVKSLTKAADI 442
Query: 301 LRITHGTNSPFMKELILKLEEAQAEASYKLSSKDE 335
L+ITHGTN+PFMKEL+ KLEEA+AEAS+K SSKD+
Sbjct: 443 LQITHGTNTPFMKELLFKLEEARAEASHKRSSKDK 477
>gi|224082230|ref|XP_002306611.1| SET domain protein [Populus trichocarpa]
gi|222856060|gb|EEE93607.1| SET domain protein [Populus trichocarpa]
Length = 458
Score = 546 bits (1407), Expect = e-153, Method: Compositional matrix adjust.
Identities = 256/333 (76%), Positives = 299/333 (89%), Gaps = 2/333 (0%)
Query: 3 DIDEKQLLLYAQIANLVNLILQWPEISINEIAENFSKLACNAHTICNSELRPLGTGLYPV 62
D+DEKQL+LYAQ+ANLV+ ILQWPEI++ EIAENFSKLACNAHTIC+ ELRPLGTGLYPV
Sbjct: 128 DLDEKQLVLYAQMANLVHFILQWPEINLKEIAENFSKLACNAHTICDCELRPLGTGLYPV 187
Query: 63 ISIINHSCLPNAVLVFEGRLAVVRAVQHVPKGAEVLISYIETAGSTMTRQKALKEQYLFT 122
+SIINHSC+PNAVL FEG+ +VVRAV+H+P+GAEV I+YI+TAGSTMTRQKALKEQY FT
Sbjct: 188 VSIINHSCMPNAVLTFEGKSSVVRAVEHIPEGAEVSIAYIDTAGSTMTRQKALKEQYFFT 247
Query: 123 CTCPRCIKLGQFDDIQESAILEGYRCKDDGCSGFLLRDSDDKGFTCQQCGLVRSKEEIKK 182
CTCPRCIK+ +DDIQESAILEGYRCKDD C+GFLLRDS+DKGF CQ CGL RSKEE+K+
Sbjct: 248 CTCPRCIKV--YDDIQESAILEGYRCKDDRCNGFLLRDSEDKGFICQTCGLRRSKEEVKR 305
Query: 183 IASEVNILSKKTLALTSCGNHQEVVSTYKMIEKLQKKLYHPFSVNLMQTREKLIKILMEL 242
I E+ +S K L TS GNH+EV+S YKMIEKLQ +L HPFS++LM+T+E+L+KILMEL
Sbjct: 306 IVCEITAISDKKLKSTSPGNHEEVISLYKMIEKLQMELCHPFSISLMRTQEELLKILMEL 365
Query: 243 EDWKEALAYCQLTIPVYQRVYPQFHPLLGLQYYTCGKLEWFLGDTENAIKSMTEAVEILR 302
DW+EALAYC+LTI YQRVYP+ HPLLGLQYYTCGK+EW LG TE+AIKS+T AV+ILR
Sbjct: 366 GDWREALAYCRLTITGYQRVYPEPHPLLGLQYYTCGKIEWLLGYTEDAIKSLTRAVDILR 425
Query: 303 ITHGTNSPFMKELILKLEEAQAEASYKLSSKDE 335
ITHGTNSPFMKEL++KL+EA AEASY LSSKDE
Sbjct: 426 ITHGTNSPFMKELMMKLDEAHAEASYNLSSKDE 458
>gi|356507670|ref|XP_003522587.1| PREDICTED: histone-lysine N-methyltransferase ASHR1-like [Glycine
max]
Length = 484
Score = 530 bits (1366), Expect = e-148, Method: Compositional matrix adjust.
Identities = 248/334 (74%), Positives = 289/334 (86%), Gaps = 1/334 (0%)
Query: 1 MSDIDEKQLLLYAQIANLVNLILQWPEISINEIAENFSKLACNAHTICNSELRPLGTGLY 60
MSDI E+QL+LYAQ+ANLVN IL+WP I+I EIAENFSK ACNAHTIC+SELRP+GTGLY
Sbjct: 152 MSDITEEQLVLYAQMANLVNSILEWPGINIKEIAENFSKFACNAHTICDSELRPVGTGLY 211
Query: 61 PVISIINHSCLPNAVLVFEGRLAVVRAVQHVPKGAEVLISYIETAGSTMTRQKALKEQYL 120
PVISIINHSCLPN+VLVFEG A+VRAVQH+P G EVLISYIETA STMTRQKALKEQYL
Sbjct: 212 PVISIINHSCLPNSVLVFEGSSALVRAVQHIPSGTEVLISYIETAESTMTRQKALKEQYL 271
Query: 121 FTCTCPRCIKLGQFDDIQESAILEGYRCKDDGCSGFLLRDSDDKGFTCQQCGLVRSKEEI 180
FTCTCPRC K+GQ+DDIQESAILEGY+CK + C GFLLR +D KGF CQ CGL+R KEEI
Sbjct: 272 FTCTCPRCSKVGQYDDIQESAILEGYKCKSEKCGGFLLRTTDGKGFQCQGCGLIRDKEEI 331
Query: 181 KKIASEVNILSKKTLALTSCGNHQEVVSTYKMIEKLQKKLYHPFSVNLMQTREKLIKILM 240
K+I +E+ +LS+ + S +QE +S YK IEKLQ +L+HP S+NLM TREK++K LM
Sbjct: 332 KRITTEIKLLSEDA-SKPSATYYQEAISIYKRIEKLQTELFHPLSINLMHTREKILKSLM 390
Query: 241 ELEDWKEALAYCQLTIPVYQRVYPQFHPLLGLQYYTCGKLEWFLGDTENAIKSMTEAVEI 300
ELE W EALAYC+LTIP YQRVYP HPL GLQYYTCGKLEW+LGDTE A+KS+T+AV+I
Sbjct: 391 ELEHWTEALAYCKLTIPFYQRVYPAVHPLPGLQYYTCGKLEWYLGDTEEAVKSLTKAVDI 450
Query: 301 LRITHGTNSPFMKELILKLEEAQAEASYKLSSKD 334
LRITHGTN+PFMK+L++KLEEA+ EASYK SSK+
Sbjct: 451 LRITHGTNTPFMKDLLMKLEEARTEASYKFSSKE 484
>gi|449445136|ref|XP_004140329.1| PREDICTED: histone-lysine N-methyltransferase ASHR1-like [Cucumis
sativus]
gi|449502473|ref|XP_004161650.1| PREDICTED: histone-lysine N-methyltransferase ASHR1-like [Cucumis
sativus]
Length = 482
Score = 521 bits (1341), Expect = e-145, Method: Compositional matrix adjust.
Identities = 246/335 (73%), Positives = 287/335 (85%)
Query: 1 MSDIDEKQLLLYAQIANLVNLILQWPEISINEIAENFSKLACNAHTICNSELRPLGTGLY 60
MS IDEKQL+LYAQ+ LVN ILQWP +++ EIA NFSKLACNAHTIC+SELRPLGTGLY
Sbjct: 147 MSGIDEKQLVLYAQMGTLVNFILQWPGMNVKEIAVNFSKLACNAHTICDSELRPLGTGLY 206
Query: 61 PVISIINHSCLPNAVLVFEGRLAVVRAVQHVPKGAEVLISYIETAGSTMTRQKALKEQYL 120
PVISIINHSCLPNAVLVFEGR AVVRAVQH+P GAEV ISYIETAGSTMTRQK LKE YL
Sbjct: 207 PVISIINHSCLPNAVLVFEGRTAVVRAVQHIPAGAEVSISYIETAGSTMTRQKTLKENYL 266
Query: 121 FTCTCPRCIKLGQFDDIQESAILEGYRCKDDGCSGFLLRDSDDKGFTCQQCGLVRSKEEI 180
FTCTC RC+K+ Q D+I+ESAILEGYRC++D C GFLLR+SDD GFTCQQCGLVRSKE+I
Sbjct: 267 FTCTCSRCVKVAQEDEIKESAILEGYRCRNDQCDGFLLRNSDDTGFTCQQCGLVRSKEDI 326
Query: 181 KKIASEVNILSKKTLALTSCGNHQEVVSTYKMIEKLQKKLYHPFSVNLMQTREKLIKILM 240
K IAS++ +S + S ++ E + Y+ +EKLQ+ L HP+S++LMQTREKL+KI M
Sbjct: 327 KNIASKIKSISDEASTSLSSQSYAEALFMYEKVEKLQRILCHPYSISLMQTREKLLKISM 386
Query: 241 ELEDWKEALAYCQLTIPVYQRVYPQFHPLLGLQYYTCGKLEWFLGDTENAIKSMTEAVEI 300
ELE+W +AL YC+LTI VYQ++YP HPLLGLQ+YTCGKLEW LG TE+AIKS T+A +I
Sbjct: 387 ELENWTKALTYCKLTISVYQKLYPGIHPLLGLQFYTCGKLEWLLGHTEDAIKSYTKAFDI 446
Query: 301 LRITHGTNSPFMKELILKLEEAQAEASYKLSSKDE 335
LRITHGTNS FMKEL+LKLEEA+AEASYKLSS D+
Sbjct: 447 LRITHGTNSSFMKELLLKLEEARAEASYKLSSTDD 481
>gi|356518519|ref|XP_003527926.1| PREDICTED: LOW QUALITY PROTEIN: histone-lysine N-methyltransferase
ASHR1-like [Glycine max]
Length = 401
Score = 515 bits (1327), Expect = e-144, Method: Compositional matrix adjust.
Identities = 242/334 (72%), Positives = 285/334 (85%), Gaps = 3/334 (0%)
Query: 1 MSDIDEKQLLLYAQIANLVNLILQWPEISINEIAENFSKLACNAHTICNSELRPLGTGLY 60
MSDI E+QL+LYA++ANLVN ILQWPEI+I EIAENFSK ACNAHTIC+SELRP+GTGLY
Sbjct: 71 MSDITEEQLVLYAKMANLVNSILQWPEINIKEIAENFSKFACNAHTICDSELRPVGTGLY 130
Query: 61 PVISIINHSCLPNAVLVFEGRLAVVRAVQHVPKGAEVLISYIETAGSTMTRQKALKEQYL 120
PVISIINHSCLPN+VLVFEGR A+V VQH+P G EV ISYIETAGSTMT+Q ALK
Sbjct: 131 PVISIINHSCLPNSVLVFEGRSALVPTVQHIPTGTEVPISYIETAGSTMTQQNALKSSTX 190
Query: 121 FTCTCPRCIKLGQFDDIQESAILEGYRCKDDGCSGFLLRDSDDKGFTCQQCGLVRSKEEI 180
FTCTCPRC K+G++DDIQESAILEGY+CK + C GFLL KGF CQ CGLVR KEEI
Sbjct: 191 FTCTCPRCSKVGEYDDIQESAILEGYKCKSEKCGGFLLY---GKGFQCQGCGLVRDKEEI 247
Query: 181 KKIASEVNILSKKTLALTSCGNHQEVVSTYKMIEKLQKKLYHPFSVNLMQTREKLIKILM 240
K+I +E+ +LS++ + +S N+QE +S YK IEKLQ +LYHP SVNLMQ REK++K LM
Sbjct: 248 KRITTEIKLLSEEKIISSSTCNYQEAISIYKWIEKLQTELYHPLSVNLMQNREKILKSLM 307
Query: 241 ELEDWKEALAYCQLTIPVYQRVYPQFHPLLGLQYYTCGKLEWFLGDTENAIKSMTEAVEI 300
ELE W EALAYC+LTIP YQRVYP HPLLGLQYYTCGKLEW+LGDT+ A+KS+ +AV+I
Sbjct: 308 ELEHWAEALAYCKLTIPFYQRVYPAVHPLLGLQYYTCGKLEWYLGDTDEAVKSLIKAVDI 367
Query: 301 LRITHGTNSPFMKELILKLEEAQAEASYKLSSKD 334
LRITHGTN+PFMK+L++KLEEA+AEASY+LS K+
Sbjct: 368 LRITHGTNTPFMKDLLMKLEEARAEASYRLSPKE 401
>gi|357461631|ref|XP_003601097.1| Histone-lysine N-methyltransferase ASHR1 [Medicago truncatula]
gi|355490145|gb|AES71348.1| Histone-lysine N-methyltransferase ASHR1 [Medicago truncatula]
Length = 511
Score = 511 bits (1317), Expect = e-142, Method: Compositional matrix adjust.
Identities = 237/328 (72%), Positives = 286/328 (87%), Gaps = 1/328 (0%)
Query: 1 MSDIDEKQLLLYAQIANLVNLILQWPEISINEIAENFSKLACNAHTICNSELRPLGTGLY 60
MSDI E+QL+LYAQ+ANLV+LILQWPEI+I EIA FSK ACNAHT+C+SELRPLGTGLY
Sbjct: 180 MSDIKEEQLVLYAQMANLVHLILQWPEINIKEIANFFSKFACNAHTVCDSELRPLGTGLY 239
Query: 61 PVISIINHSCLPNAVLVFEGRLAVVRAVQHVPKGAEVLISYIETAGSTMTRQKALKEQYL 120
PV+SIINHSCLPN+VLVF+GR A VRA+QH+PKG EVLISYIETAGST+TRQKAL+EQYL
Sbjct: 240 PVVSIINHSCLPNSVLVFDGREASVRALQHIPKGTEVLISYIETAGSTVTRQKALREQYL 299
Query: 121 FTCTCPRCIKLGQFDDIQESAILEGYRCKDDGCSGFLLRDSDDKGFTCQQCGLVRSKEEI 180
F C CP C K+GQ++D++E+AILEGYRCK++ C GFLLR +D K F CQ+CGLVR KEEI
Sbjct: 300 FQCVCPLCSKVGQYEDVRENAILEGYRCKNETCDGFLLRTTDGKAFQCQECGLVRDKEEI 359
Query: 181 KKIASEVNILSKKTLALTSCGNHQEVVSTYKMIEKLQKKLYHPFSVNLMQTREKLIKILM 240
K+IA+E+ L ++ +S +H E +S +KMIEKLQ KLYHPFS+NLMQTRE ++K LM
Sbjct: 360 KQIATEIKFLLEEASKPSSNDSH-EAISIHKMIEKLQTKLYHPFSINLMQTRETILKSLM 418
Query: 241 ELEDWKEALAYCQLTIPVYQRVYPQFHPLLGLQYYTCGKLEWFLGDTENAIKSMTEAVEI 300
+LE W+EALAYC+LTIP+YQRVYP HPLLGLQYYTCGKLEW+LGDTE AIKS+T+AV+I
Sbjct: 419 KLEYWREALAYCKLTIPIYQRVYPAVHPLLGLQYYTCGKLEWYLGDTEEAIKSLTKAVDI 478
Query: 301 LRITHGTNSPFMKELILKLEEAQAEASY 328
LRITHGT +PF+KEL + L EA+AEAS+
Sbjct: 479 LRITHGTKTPFVKELSMMLVEARAEASF 506
>gi|297836508|ref|XP_002886136.1| hypothetical protein ARALYDRAFT_319740 [Arabidopsis lyrata subsp.
lyrata]
gi|297331976|gb|EFH62395.1| hypothetical protein ARALYDRAFT_319740 [Arabidopsis lyrata subsp.
lyrata]
Length = 483
Score = 471 bits (1211), Expect = e-130, Method: Compositional matrix adjust.
Identities = 218/331 (65%), Positives = 277/331 (83%)
Query: 1 MSDIDEKQLLLYAQIANLVNLILQWPEISINEIAENFSKLACNAHTICNSELRPLGTGLY 60
MS+IDEKQ+LLYAQ+ANLVNLILQ+P + + EIAENFSK +CNAH+IC+SELRP G GL+
Sbjct: 147 MSEIDEKQMLLYAQMANLVNLILQFPSVDLREIAENFSKFSCNAHSICDSELRPQGIGLF 206
Query: 61 PVISIINHSCLPNAVLVFEGRLAVVRAVQHVPKGAEVLISYIETAGSTMTRQKALKEQYL 120
P++SIINHSC PNAVLVFE ++AVVRA+ ++ K +E+ ISYIETAGST+TRQK+LKEQYL
Sbjct: 207 PLVSIINHSCSPNAVLVFEEQMAVVRAMDNISKDSEITISYIETAGSTLTRQKSLKEQYL 266
Query: 121 FTCTCPRCIKLGQFDDIQESAILEGYRCKDDGCSGFLLRDSDDKGFTCQQCGLVRSKEEI 180
F C C RC +G+ DI+ESAILEGYRC ++ C+GFLLRD ++KGF CQ+C L+RSKEE+
Sbjct: 267 FHCQCARCSNVGKPHDIEESAILEGYRCSNEKCTGFLLRDPEEKGFVCQKCLLLRSKEEV 326
Query: 181 KKIASEVNILSKKTLALTSCGNHQEVVSTYKMIEKLQKKLYHPFSVNLMQTREKLIKILM 240
KK+AS++ +S+K A S N Q + YK IEKLQ KLYH FS+ LM+TREKL+K+LM
Sbjct: 327 KKLASDLKTVSEKAHASPSAENKQAAIELYKTIEKLQVKLYHSFSITLMRTREKLLKMLM 386
Query: 241 ELEDWKEALAYCQLTIPVYQRVYPQFHPLLGLQYYTCGKLEWFLGDTENAIKSMTEAVEI 300
+LE W+EAL YC+L +PVYQRVYP HPL+GLQ+YT GKLEW LG+T+ A+ S+ +A +I
Sbjct: 387 DLEIWREALNYCRLIVPVYQRVYPATHPLIGLQFYTQGKLEWLLGETKEAVSSLIKAFDI 446
Query: 301 LRITHGTNSPFMKELILKLEEAQAEASYKLS 331
LRI+HGT++PFMKEL KLEEA+AEASYKL+
Sbjct: 447 LRISHGTSTPFMKELSAKLEEARAEASYKLA 477
>gi|30680137|ref|NP_849969.1| histone-lysine N-methyltransferase ASHR1 [Arabidopsis thaliana]
gi|94707144|sp|Q7XJS0.2|ASHR1_ARATH RecName: Full=Histone-lysine N-methyltransferase ASHR1; AltName:
Full=ASH1-related protein 1; AltName: Full=Protein SET
DOMAIN GROUP 37
gi|145651792|gb|ABP88121.1| At2g17900 [Arabidopsis thaliana]
gi|330251606|gb|AEC06700.1| histone-lysine N-methyltransferase ASHR1 [Arabidopsis thaliana]
Length = 480
Score = 461 bits (1186), Expect = e-127, Method: Compositional matrix adjust.
Identities = 213/329 (64%), Positives = 272/329 (82%)
Query: 1 MSDIDEKQLLLYAQIANLVNLILQWPEISINEIAENFSKLACNAHTICNSELRPLGTGLY 60
MS+IDEKQ+LLYAQ+ANLVNLILQ+P + + EIAENFSK +CNAH+IC+SELRP G GL+
Sbjct: 147 MSEIDEKQMLLYAQMANLVNLILQFPSVDLREIAENFSKFSCNAHSICDSELRPQGIGLF 206
Query: 61 PVISIINHSCLPNAVLVFEGRLAVVRAVQHVPKGAEVLISYIETAGSTMTRQKALKEQYL 120
P++SIINHSC PNAVLVFE ++AVVRA+ ++ K +E+ ISYIETAGST+TRQK+LKEQYL
Sbjct: 207 PLVSIINHSCSPNAVLVFEEQMAVVRAMDNISKDSEITISYIETAGSTLTRQKSLKEQYL 266
Query: 121 FTCTCPRCIKLGQFDDIQESAILEGYRCKDDGCSGFLLRDSDDKGFTCQQCGLVRSKEEI 180
F C C RC G+ DI+ESAILEGYRC ++ C+GFLLRD ++KGF CQ+C L+RSKEE+
Sbjct: 267 FHCQCARCSNFGKPHDIEESAILEGYRCANEKCTGFLLRDPEEKGFVCQKCLLLRSKEEV 326
Query: 181 KKIASEVNILSKKTLALTSCGNHQEVVSTYKMIEKLQKKLYHPFSVNLMQTREKLIKILM 240
KK+AS++ +S+K S + Q + YK IEKLQ KLYH FS+ LM+TREKL+K+LM
Sbjct: 327 KKLASDLKTVSEKAPTSPSAEDKQAAIELYKTIEKLQVKLYHSFSIPLMRTREKLLKMLM 386
Query: 241 ELEDWKEALAYCQLTIPVYQRVYPQFHPLLGLQYYTCGKLEWFLGDTENAIKSMTEAVEI 300
++E W+EAL YC+L +PVYQRVYP HPL+GLQ+YT GKLEW LG+T+ A+ S+ +A +I
Sbjct: 387 DVEIWREALNYCRLIVPVYQRVYPATHPLIGLQFYTQGKLEWLLGETKEAVSSLIKAFDI 446
Query: 301 LRITHGTNSPFMKELILKLEEAQAEASYK 329
LRI+HG ++PFMKEL KLEEA+AEASYK
Sbjct: 447 LRISHGISTPFMKELSAKLEEARAEASYK 475
>gi|388518229|gb|AFK47176.1| unknown [Medicago truncatula]
Length = 477
Score = 443 bits (1140), Expect = e-122, Method: Compositional matrix adjust.
Identities = 210/328 (64%), Positives = 256/328 (78%), Gaps = 35/328 (10%)
Query: 1 MSDIDEKQLLLYAQIANLVNLILQWPEISINEIAENFSKLACNAHTICNSELRPLGTGLY 60
MSDI E+QL+LYAQ+ANLV+LILQWPEI+I EIA FSK ACNAHT+C+SELRPLGTGLY
Sbjct: 180 MSDIKEEQLVLYAQMANLVHLILQWPEINIKEIANFFSKFACNAHTVCDSELRPLGTGLY 239
Query: 61 PVISIINHSCLPNAVLVFEGRLAVVRAVQHVPKGAEVLISYIETAGSTMTRQKALKEQYL 120
PV+SIINHSCLPN+VLVF+GR A+VRA+QH+PKG E
Sbjct: 240 PVVSIINHSCLPNSVLVFDGREALVRALQHIPKGTE------------------------ 275
Query: 121 FTCTCPRCIKLGQFDDIQESAILEGYRCKDDGCSGFLLRDSDDKGFTCQQCGLVRSKEEI 180
GQ++D++E+AILEGYRCK++ C GFLLR +D K F CQ+CGLVR KEEI
Sbjct: 276 -----------GQYEDVRENAILEGYRCKNETCDGFLLRTTDGKAFQCQECGLVRDKEEI 324
Query: 181 KKIASEVNILSKKTLALTSCGNHQEVVSTYKMIEKLQKKLYHPFSVNLMQTREKLIKILM 240
K+IA+E+ L ++ +S GN E +S +KMIEKLQ KLYHPFS+NLMQTRE ++K LM
Sbjct: 325 KQIATEIKFLLEEASKPSSNGNSHEAISIHKMIEKLQTKLYHPFSINLMQTRETILKSLM 384
Query: 241 ELEDWKEALAYCQLTIPVYQRVYPQFHPLLGLQYYTCGKLEWFLGDTENAIKSMTEAVEI 300
+LE W+EALAYC+LTIP+YQRVYP HPLLGLQYYTCGKLEW+LGDTE AIKS+T+AV+I
Sbjct: 385 KLEYWREALAYCKLTIPIYQRVYPAVHPLLGLQYYTCGKLEWYLGDTEEAIKSLTKAVDI 444
Query: 301 LRITHGTNSPFMKELILKLEEAQAEASY 328
LRITHGT +PF+KEL + L EA+AEAS+
Sbjct: 445 LRITHGTKTPFVKELSMMLVEARAEASF 472
>gi|41469441|gb|AAS07242.1| putative MYND finger protein [Oryza sativa Japonica Group]
gi|108710650|gb|ABF98445.1| MYND finger family protein, expressed [Oryza sativa Japonica Group]
Length = 480
Score = 441 bits (1133), Expect = e-121, Method: Compositional matrix adjust.
Identities = 209/335 (62%), Positives = 262/335 (78%), Gaps = 6/335 (1%)
Query: 1 MSDIDEKQLLLYAQIANLVNLILQWPEISINEIAENFSKLACNAHTICNSELRPLGTGLY 60
+S++D+ QL+LYAQ+ANLV LIL E+ + EI FSK ACNAHTIC+ ELRPLGTGLY
Sbjct: 151 ISEVDKNQLVLYAQMANLVQLILPSFELDLKEITHTFSKFACNAHTICDPELRPLGTGLY 210
Query: 61 PVISIINHSCLPNAVLVFEGRLAVVRAVQHVPKGAEVLISYIETAGSTMTRQKALKEQYL 120
PV+SIINHSC+PNAVL+FEGR A VRA+Q + K EV ISYIETA +TM RQ LK Y
Sbjct: 211 PVLSIINHSCVPNAVLIFEGRTAYVRALQPISKNEEVSISYIETAATTMKRQDDLK-HYY 269
Query: 121 FTCTCPRCIKLGQFDDIQESAILEGYRCKDDGCSGFLLRDSDDKGFTCQQCGLVRSKEEI 180
FTCTCPRC+K D +E A+LEGYRC D C GFLL ++ +KG+TCQ+C R EE+
Sbjct: 270 FTCTCPRCVK-----DSEEDALLEGYRCNDQKCDGFLLPNAGNKGYTCQKCSTSRDGEEL 324
Query: 181 KKIASEVNILSKKTLALTSCGNHQEVVSTYKMIEKLQKKLYHPFSVNLMQTREKLIKILM 240
+K+AS+V +LS K +L S GN+ EV S YK IE+L++KLYHP S+ L+ TRE L+KI M
Sbjct: 325 QKMASDVLLLSDKVSSLVSSGNNSEVGSMYKTIEELERKLYHPLSITLLHTRETLLKIYM 384
Query: 241 ELEDWKEALAYCQLTIPVYQRVYPQFHPLLGLQYYTCGKLEWFLGDTENAIKSMTEAVEI 300
EL+DW+ AL YC+LTIPVY+R+YP FHP++GLQ+YTCGKLEW L TE+A+ S+T A +I
Sbjct: 385 ELQDWQTALMYCRLTIPVYERIYPPFHPMIGLQFYTCGKLEWLLEYTEDALMSLTRAADI 444
Query: 301 LRITHGTNSPFMKELILKLEEAQAEASYKLSSKDE 335
LRITHGT S FMKEL+ KLEE +AEAS++LS+ DE
Sbjct: 445 LRITHGTKSEFMKELLGKLEEVRAEASFRLSAGDE 479
>gi|108710649|gb|ABF98444.1| MYND finger family protein, expressed [Oryza sativa Japonica Group]
Length = 481
Score = 434 bits (1117), Expect = e-119, Method: Compositional matrix adjust.
Identities = 208/336 (61%), Positives = 262/336 (77%), Gaps = 7/336 (2%)
Query: 1 MSDIDEKQLLLYAQIANLVNLILQWPEISINEIAENFSKLACNAHTICNSELRPLGTGLY 60
+S++D+ QL+LYAQ+ANLV LIL E+ + EI FSK ACNAHTIC+ ELRPLGTGLY
Sbjct: 151 ISEVDKNQLVLYAQMANLVQLILPSFELDLKEITHTFSKFACNAHTICDPELRPLGTGLY 210
Query: 61 PVISIINHSCLPNAVLVFEGRLAVVRAVQHVPKGAEVLISYIETAGSTMTRQKALKEQYL 120
PV+SIINHSC+PNAVL+FEGR A VRA+Q + K EV ISYIETA +TM RQ LK Y
Sbjct: 211 PVLSIINHSCVPNAVLIFEGRTAYVRALQPISKNEEVSISYIETAATTMKRQDDLK-HYY 269
Query: 121 FTCTCPRCIKLGQFDDIQESAILEGYRCKDDGCSGFLLRDSDDKGFTCQQCGLVRSKEEI 180
FTCTCPRC+K D +E A+LEGYRC D C GFLL ++ +KG+TCQ+C R EE+
Sbjct: 270 FTCTCPRCVK-----DSEEDALLEGYRCNDQKCDGFLLPNAGNKGYTCQKCSTSRDGEEL 324
Query: 181 KKIASEVNILSKKTLALTSCG-NHQEVVSTYKMIEKLQKKLYHPFSVNLMQTREKLIKIL 239
+K+AS+V +LS K +L S G ++ EV S YK IE+L++KLYHP S+ L+ TRE L+KI
Sbjct: 325 QKMASDVLLLSDKVSSLVSSGIDNSEVGSMYKTIEELERKLYHPLSITLLHTRETLLKIY 384
Query: 240 MELEDWKEALAYCQLTIPVYQRVYPQFHPLLGLQYYTCGKLEWFLGDTENAIKSMTEAVE 299
MEL+DW+ AL YC+LTIPVY+R+YP FHP++GLQ+YTCGKLEW L TE+A+ S+T A +
Sbjct: 385 MELQDWQTALMYCRLTIPVYERIYPPFHPMIGLQFYTCGKLEWLLEYTEDALMSLTRAAD 444
Query: 300 ILRITHGTNSPFMKELILKLEEAQAEASYKLSSKDE 335
ILRITHGT S FMKEL+ KLEE +AEAS++LS+ DE
Sbjct: 445 ILRITHGTKSEFMKELLGKLEEVRAEASFRLSAGDE 480
>gi|218193604|gb|EEC76031.1| hypothetical protein OsI_13199 [Oryza sativa Indica Group]
Length = 481
Score = 431 bits (1108), Expect = e-118, Method: Compositional matrix adjust.
Identities = 207/336 (61%), Positives = 261/336 (77%), Gaps = 7/336 (2%)
Query: 1 MSDIDEKQLLLYAQIANLVNLILQWPEISINEIAENFSKLACNAHTICNSELRPLGTGLY 60
+S++D+ QL+LYAQ+ANLV LIL E+ + EI FSK ACNAHTIC+ ELR LGTGLY
Sbjct: 151 ISEVDKNQLVLYAQMANLVQLILPSFELDLKEITHTFSKFACNAHTICDPELRSLGTGLY 210
Query: 61 PVISIINHSCLPNAVLVFEGRLAVVRAVQHVPKGAEVLISYIETAGSTMTRQKALKEQYL 120
PV+SIINHSC+PNAVL+FEGR A VRA+Q + K EV ISYIETA +TM RQ LK Y
Sbjct: 211 PVLSIINHSCVPNAVLIFEGRTAYVRALQPISKNEEVSISYIETAATTMKRQDDLK-HYY 269
Query: 121 FTCTCPRCIKLGQFDDIQESAILEGYRCKDDGCSGFLLRDSDDKGFTCQQCGLVRSKEEI 180
FTCTCPRC+K D +E A+LEGYRC D C GFLL ++ +KG+TCQ+C R EE+
Sbjct: 270 FTCTCPRCVK-----DSEEDALLEGYRCNDQKCDGFLLPNAGNKGYTCQKCSTSRDGEEL 324
Query: 181 KKIASEVNILSKKTLALTSCG-NHQEVVSTYKMIEKLQKKLYHPFSVNLMQTREKLIKIL 239
+K+AS+V +LS K +L S G ++ EV S YK IE+L++KLYHP S+ L+ TRE L+KI
Sbjct: 325 QKMASDVLLLSDKVSSLVSSGIDNSEVGSMYKTIEELERKLYHPLSITLLHTRETLLKIY 384
Query: 240 MELEDWKEALAYCQLTIPVYQRVYPQFHPLLGLQYYTCGKLEWFLGDTENAIKSMTEAVE 299
MEL+DW+ AL YC+LTIPVY+R+YP FHP++GLQ+YTCGKLEW L TE+A+ S+T A +
Sbjct: 385 MELQDWQTALMYCRLTIPVYERIYPPFHPMIGLQFYTCGKLEWLLEYTEDALMSLTRAAD 444
Query: 300 ILRITHGTNSPFMKELILKLEEAQAEASYKLSSKDE 335
ILRITHGT S FMKEL+ KLEE +AEAS++LS+ DE
Sbjct: 445 ILRITHGTKSEFMKELLGKLEEVRAEASFRLSAGDE 480
>gi|357117918|ref|XP_003560708.1| PREDICTED: histone-lysine N-methyltransferase ASHR1-like
[Brachypodium distachyon]
Length = 482
Score = 426 bits (1096), Expect = e-117, Method: Compositional matrix adjust.
Identities = 201/335 (60%), Positives = 255/335 (76%), Gaps = 6/335 (1%)
Query: 1 MSDIDEKQLLLYAQIANLVNLILQWPEISINEIAENFSKLACNAHTICNSELRPLGTGLY 60
+S++D+KQL+LYAQ+ANLV LIL E+ + E A FSK +CNAHTIC+ ELRP+GTGL+
Sbjct: 151 ISEVDDKQLVLYAQMANLVQLILPAIELDLKETAHIFSKFSCNAHTICDPELRPVGTGLF 210
Query: 61 PVISIINHSCLPNAVLVFEGRLAVVRAVQHVPKGAEVLISYIETAGSTMTRQKALKEQYL 120
P IS INHSC+PNAVL+FEGR A VRA+Q + EV ISYIETA +T+ R LK Y
Sbjct: 211 PAISTINHSCVPNAVLLFEGRTAYVRALQPLSNNTEVSISYIETAATTLKRHNDLK-HYF 269
Query: 121 FTCTCPRCIKLGQFDDIQESAILEGYRCKDDGCSGFLLRDSDDKGFTCQQCGLVRSKEEI 180
FTCTCPRCIK +E +LEGYRCKD C GFLL DS K +TCQ+C + R EE+
Sbjct: 270 FTCTCPRCIK-----GSEEDPLLEGYRCKDQNCDGFLLPDSGKKAYTCQKCSMCRDGEEV 324
Query: 181 KKIASEVNILSKKTLALTSCGNHQEVVSTYKMIEKLQKKLYHPFSVNLMQTREKLIKILM 240
KK++SE+ +LS K + S GN+ E S YK+IE+L++KLYH FS+ L+ TRE L+K+ M
Sbjct: 325 KKLSSEILLLSDKASSFVSSGNNNEAGSVYKIIEQLERKLYHSFSITLLHTRETLLKLYM 384
Query: 241 ELEDWKEALAYCQLTIPVYQRVYPQFHPLLGLQYYTCGKLEWFLGDTENAIKSMTEAVEI 300
EL+DW+ AL YC+ TIPVY+RVYP FHP++GLQ+YTCGKLEW L TE+A+KS+T A ++
Sbjct: 385 ELQDWRSALMYCRSTIPVYERVYPPFHPMVGLQFYTCGKLEWLLEYTEDALKSLTRAADV 444
Query: 301 LRITHGTNSPFMKELILKLEEAQAEASYKLSSKDE 335
LRITHGTNS FMKEL KLEEA+AE S++LS +E
Sbjct: 445 LRITHGTNSQFMKELFGKLEEARAEVSFRLSPGNE 479
>gi|212722234|ref|NP_001131420.1| uncharacterized protein LOC100192749 [Zea mays]
gi|194691468|gb|ACF79818.1| unknown [Zea mays]
gi|413933330|gb|AFW67881.1| histone-lysine N-methyltransferase ASHR1 [Zea mays]
Length = 482
Score = 418 bits (1074), Expect = e-114, Method: Compositional matrix adjust.
Identities = 204/333 (61%), Positives = 252/333 (75%), Gaps = 6/333 (1%)
Query: 1 MSDIDEKQLLLYAQIANLVNLILQWPEISINEIAENFSKLACNAHTICNSELRPLGTGLY 60
+S +DE QL+LYAQ+ANLV+LIL + E+ + EIA+ FSK ACNAHTIC+ ELRPLGTGLY
Sbjct: 151 ISKVDENQLVLYAQMANLVSLILPFIELDLKEIAQTFSKFACNAHTICDPELRPLGTGLY 210
Query: 61 PVISIINHSCLPNAVLVFEGRLAVVRAVQHVPKGAEVLISYIETAGSTMTRQKALKEQYL 120
PVISIINHSC+PNAVL+F+GR A VRA+Q + K EV ISYIETA T R LK QY
Sbjct: 211 PVISIINHSCVPNAVLIFDGRTAYVRALQPINKDEEVSISYIETATVTKKRNNDLK-QYF 269
Query: 121 FTCTCPRCIKLGQFDDIQESAILEGYRCKDDGCSGFLLRDSDDKGFTCQQCGLVRSKEEI 180
FTCTCPRC+K FD E A+LEG+RCK+ C GFLL +S K +TCQ+CG R EEI
Sbjct: 270 FTCTCPRCVK--GFD---EDALLEGFRCKNQACDGFLLPNSGKKAYTCQKCGASRDVEEI 324
Query: 181 KKIASEVNILSKKTLALTSCGNHQEVVSTYKMIEKLQKKLYHPFSVNLMQTREKLIKILM 240
K + SE+ LS K + S GN E S YK+IE+L++ LYH FS L+ T E L+KI +
Sbjct: 325 KNMRSEILQLSDKASSFLSSGNKAEAGSIYKIIEQLERNLYHAFSTTLLHTCETLLKIYL 384
Query: 241 ELEDWKEALAYCQLTIPVYQRVYPQFHPLLGLQYYTCGKLEWFLGDTENAIKSMTEAVEI 300
EL+DW AL YC+LTIPVY+RVYP FHP++GLQ+YTCGKLEW L TE+A+KS+T A +I
Sbjct: 385 ELQDWWTALTYCRLTIPVYERVYPPFHPMIGLQFYTCGKLEWLLEFTEDALKSLTRAADI 444
Query: 301 LRITHGTNSPFMKELILKLEEAQAEASYKLSSK 333
L++THG S FMKEL KLEEA+AE S++LSS+
Sbjct: 445 LKVTHGVKSQFMKELFGKLEEARAEVSFRLSSR 477
>gi|242033255|ref|XP_002464022.1| hypothetical protein SORBIDRAFT_01g010740 [Sorghum bicolor]
gi|241917876|gb|EER91020.1| hypothetical protein SORBIDRAFT_01g010740 [Sorghum bicolor]
Length = 482
Score = 417 bits (1071), Expect = e-114, Method: Compositional matrix adjust.
Identities = 200/330 (60%), Positives = 250/330 (75%), Gaps = 6/330 (1%)
Query: 4 IDEKQLLLYAQIANLVNLILQWPEISINEIAENFSKLACNAHTICNSELRPLGTGLYPVI 63
+DE L+LYAQ+ANLV+LIL E+ + EIA FSK ACNAHTIC+ ELRPLGTGLYPVI
Sbjct: 154 VDENHLVLYAQMANLVSLILPLIELDLKEIAHTFSKFACNAHTICDPELRPLGTGLYPVI 213
Query: 64 SIINHSCLPNAVLVFEGRLAVVRAVQHVPKGAEVLISYIETAGSTMTRQKALKEQYLFTC 123
SIINHSC+PNAVL+F+GR A VRA+Q + K EV ISYIETA T R LK QY FTC
Sbjct: 214 SIINHSCVPNAVLIFDGRTAYVRALQPIGKNEEVSISYIETAAVTKKRHNDLK-QYFFTC 272
Query: 124 TCPRCIKLGQFDDIQESAILEGYRCKDDGCSGFLLRDSDDKGFTCQQCGLVRSKEEIKKI 183
+CPRC+K +E A+LEG+RCK+ C GFLL DS K +TCQ+C + R +EEI+K+
Sbjct: 273 SCPRCVK-----GSEEDALLEGFRCKNQTCDGFLLPDSGKKAYTCQKCSVSRDEEEIQKM 327
Query: 184 ASEVNILSKKTLALTSCGNHQEVVSTYKMIEKLQKKLYHPFSVNLMQTREKLIKILMELE 243
SE+ LS K + S GN E S YK IE+L++ LYH FS L+ T E L+KI MEL+
Sbjct: 328 RSEILQLSDKASSFLSSGNKAEAGSVYKTIEQLEQNLYHAFSTTLLHTCETLLKIYMELQ 387
Query: 244 DWKEALAYCQLTIPVYQRVYPQFHPLLGLQYYTCGKLEWFLGDTENAIKSMTEAVEILRI 303
DW+ ALAYC+LT+PVY+RVYP FHP++GLQ+YTCGKLEW L TE+A+KS+T A +IL++
Sbjct: 388 DWRTALAYCRLTVPVYERVYPPFHPMIGLQFYTCGKLEWLLECTEDALKSLTRATDILKV 447
Query: 304 THGTNSPFMKELILKLEEAQAEASYKLSSK 333
THGT S FMKEL KLEEA+AE S+++SS+
Sbjct: 448 THGTKSQFMKELFGKLEEARAEVSFRISSR 477
>gi|195636164|gb|ACG37550.1| histone-lysine N-methyltransferase ASHR1 [Zea mays]
Length = 482
Score = 416 bits (1068), Expect = e-113, Method: Compositional matrix adjust.
Identities = 203/333 (60%), Positives = 250/333 (75%), Gaps = 6/333 (1%)
Query: 1 MSDIDEKQLLLYAQIANLVNLILQWPEISINEIAENFSKLACNAHTICNSELRPLGTGLY 60
+S +DE QL+LYAQ+ANLV+LIL + E+ + EIA+ FSK ACNAHTIC+ ELRPLGTGLY
Sbjct: 151 ISKVDENQLVLYAQMANLVSLILPFIELDLKEIAQTFSKFACNAHTICDPELRPLGTGLY 210
Query: 61 PVISIINHSCLPNAVLVFEGRLAVVRAVQHVPKGAEVLISYIETAGSTMTRQKALKEQYL 120
PVISIINHSC+PNAVL+F+GR A VRA+Q + K EV ISYIETA T R LK QY
Sbjct: 211 PVISIINHSCVPNAVLIFDGRTAYVRALQPIDKDEEVSISYIETAAVTKKRNNDLK-QYF 269
Query: 121 FTCTCPRCIKLGQFDDIQESAILEGYRCKDDGCSGFLLRDSDDKGFTCQQCGLVRSKEEI 180
FTCTCPRC+K FD E +LEG+RCK+ C GFLL S K +TCQ+CG R EEI
Sbjct: 270 FTCTCPRCVK--GFD---EDPLLEGFRCKNQTCDGFLLPSSGKKAYTCQKCGASRDVEEI 324
Query: 181 KKIASEVNILSKKTLALTSCGNHQEVVSTYKMIEKLQKKLYHPFSVNLMQTREKLIKILM 240
K + SE+ LS K + S GN E S YK+IE+L++ LYH FS L+ T E L+KI +
Sbjct: 325 KNMRSEILQLSDKASSFLSSGNKAEAGSIYKIIEQLERNLYHAFSTTLLHTCETLLKIYL 384
Query: 241 ELEDWKEALAYCQLTIPVYQRVYPQFHPLLGLQYYTCGKLEWFLGDTENAIKSMTEAVEI 300
EL+DW AL YC+LTIPVY+RVYP FHP++GLQ+YTCGKLEW L TE+A+KS+T A +I
Sbjct: 385 ELQDWWTALTYCRLTIPVYERVYPPFHPMIGLQFYTCGKLEWLLEFTEDALKSLTRAADI 444
Query: 301 LRITHGTNSPFMKELILKLEEAQAEASYKLSSK 333
L++THG S FMKEL KLEEA+AE S++LSS+
Sbjct: 445 LKVTHGVKSQFMKELFGKLEEARAEVSFRLSSR 477
>gi|22450579|gb|AAM97151.1| unknown protein,3'-partial [Oryza sativa Japonica Group]
Length = 425
Score = 373 bits (957), Expect = e-101, Method: Compositional matrix adjust.
Identities = 175/281 (62%), Positives = 219/281 (77%), Gaps = 6/281 (2%)
Query: 1 MSDIDEKQLLLYAQIANLVNLILQWPEISINEIAENFSKLACNAHTICNSELRPLGTGLY 60
+S++D+ QL+LYAQ+ANLV LIL E+ + EI FSK ACNAHTIC+ ELRPLGTGLY
Sbjct: 151 ISEVDKNQLVLYAQMANLVQLILPSFELDLKEITHTFSKFACNAHTICDPELRPLGTGLY 210
Query: 61 PVISIINHSCLPNAVLVFEGRLAVVRAVQHVPKGAEVLISYIETAGSTMTRQKALKEQYL 120
PV+SIINHSC+PNAVL+FEGR A VRA+Q + K EV ISYIETA +TM RQ LK Y
Sbjct: 211 PVLSIINHSCVPNAVLIFEGRTAYVRALQPISKNEEVSISYIETAATTMKRQDDLK-HYY 269
Query: 121 FTCTCPRCIKLGQFDDIQESAILEGYRCKDDGCSGFLLRDSDDKGFTCQQCGLVRSKEEI 180
FTCTCPRC+K D +E A+LEGYRC D C GFLL ++ +KG+TCQ+C R EE+
Sbjct: 270 FTCTCPRCVK-----DSEEDALLEGYRCNDQKCDGFLLPNAGNKGYTCQKCSTSRDGEEL 324
Query: 181 KKIASEVNILSKKTLALTSCGNHQEVVSTYKMIEKLQKKLYHPFSVNLMQTREKLIKILM 240
+K+AS+V +LS K +L S GN+ EV S YK IE+L++KLYHP S+ L+ TRE L+KI M
Sbjct: 325 QKMASDVLLLSDKVSSLVSSGNNSEVGSMYKTIEELERKLYHPLSITLLHTRETLLKIYM 384
Query: 241 ELEDWKEALAYCQLTIPVYQRVYPQFHPLLGLQYYTCGKLE 281
EL+DW+ AL YC+LTIPVY+R+YP FHP++GLQ+YTCGKLE
Sbjct: 385 ELQDWQTALMYCRLTIPVYERIYPPFHPMIGLQFYTCGKLE 425
>gi|413933331|gb|AFW67882.1| hypothetical protein ZEAMMB73_219813 [Zea mays]
Length = 443
Score = 359 bits (922), Expect = 8e-97, Method: Compositional matrix adjust.
Identities = 174/283 (61%), Positives = 213/283 (75%), Gaps = 6/283 (2%)
Query: 1 MSDIDEKQLLLYAQIANLVNLILQWPEISINEIAENFSKLACNAHTICNSELRPLGTGLY 60
+S +DE QL+LYAQ+ANLV+LIL + E+ + EIA+ FSK ACNAHTIC+ ELRPLGTGLY
Sbjct: 151 ISKVDENQLVLYAQMANLVSLILPFIELDLKEIAQTFSKFACNAHTICDPELRPLGTGLY 210
Query: 61 PVISIINHSCLPNAVLVFEGRLAVVRAVQHVPKGAEVLISYIETAGSTMTRQKALKEQYL 120
PVISIINHSC+PNAVL+F+GR A VRA+Q + K EV ISYIETA T R LK QY
Sbjct: 211 PVISIINHSCVPNAVLIFDGRTAYVRALQPINKDEEVSISYIETATVTKKRNNDLK-QYF 269
Query: 121 FTCTCPRCIKLGQFDDIQESAILEGYRCKDDGCSGFLLRDSDDKGFTCQQCGLVRSKEEI 180
FTCTCPRC+K FD E A+LEG+RCK+ C GFLL +S K +TCQ+CG R EEI
Sbjct: 270 FTCTCPRCVK--GFD---EDALLEGFRCKNQACDGFLLPNSGKKAYTCQKCGASRDVEEI 324
Query: 181 KKIASEVNILSKKTLALTSCGNHQEVVSTYKMIEKLQKKLYHPFSVNLMQTREKLIKILM 240
K + SE+ LS K + S GN E S YK+IE+L++ LYH FS L+ T E L+KI +
Sbjct: 325 KNMRSEILQLSDKASSFLSSGNKAEAGSIYKIIEQLERNLYHAFSTTLLHTCETLLKIYL 384
Query: 241 ELEDWKEALAYCQLTIPVYQRVYPQFHPLLGLQYYTCGKLEWF 283
EL+DW AL YC+LTIPVY+RVYP FHP++GLQ+YTCGKLEW+
Sbjct: 385 ELQDWWTALTYCRLTIPVYERVYPPFHPMIGLQFYTCGKLEWY 427
>gi|148906849|gb|ABR16570.1| unknown [Picea sitchensis]
Length = 441
Score = 355 bits (911), Expect = 2e-95, Method: Compositional matrix adjust.
Identities = 165/290 (56%), Positives = 216/290 (74%), Gaps = 1/290 (0%)
Query: 40 LACNAHTICNSELRPLGTGLYPVISIINHSCLPNAVLVFEGRLAVVRAVQHVPKGAEVLI 99
+CNAHTIC+SELRP+GTGLYPVISIINHSC PNAVL+FEGR AVVRAV+ + +G+E+ +
Sbjct: 142 FSCNAHTICDSELRPMGTGLYPVISIINHSCFPNAVLLFEGRQAVVRAVEPIREGSELTV 201
Query: 100 SYIETAGSTMTRQKALKEQYLFTCTCPRCIKLGQFDDIQESAILEGYRCKDDGCSGFLLR 159
SYIE A ST +R+K+LKEQY F C C RC+K+ D + E AILEG+RC D C GFLL
Sbjct: 202 SYIEIAASTASRKKSLKEQYFFDCKCLRCLKVDTPDGLHEDAILEGFRCSSDHCEGFLLH 261
Query: 160 DSDD-KGFTCQQCGLVRSKEEIKKIASEVNILSKKTLALTSCGNHQEVVSTYKMIEKLQK 218
D DD + CQ CG R++EE KK A +V+ L K+ L S GN+ E S Y+ I++LQ
Sbjct: 262 DPDDAQSLVCQLCGCGRNEEETKKQARKVDKLGKEASKLLSSGNYSEARSLYEQIQQLQT 321
Query: 219 KLYHPFSVNLMQTREKLIKILMELEDWKEALAYCQLTIPVYQRVYPQFHPLLGLQYYTCG 278
+L+HP+SV L++T + L+KI MEL DWK+AL YC+LTIP Y+R YP HP++GLQYY CG
Sbjct: 322 QLWHPYSVILLRTGDTLLKICMELYDWKQALKYCRLTIPAYERAYPTCHPMMGLQYYACG 381
Query: 279 KLEWFLGDTENAIKSMTEAVEILRITHGTNSPFMKELILKLEEAQAEASY 328
KLEWFL +T A+ +A +IL +THG NS F+ +L +++EA AEA++
Sbjct: 382 KLEWFLENTLEALNFFEKAAKILTVTHGRNSEFLTQLFDRIQEAHAEAAH 431
>gi|168051542|ref|XP_001778213.1| predicted protein [Physcomitrella patens subsp. patens]
gi|162670426|gb|EDQ56995.1| predicted protein [Physcomitrella patens subsp. patens]
Length = 490
Score = 332 bits (850), Expect = 2e-88, Method: Compositional matrix adjust.
Identities = 152/331 (45%), Positives = 221/331 (66%)
Query: 1 MSDIDEKQLLLYAQIANLVNLILQWPEISINEIAENFSKLACNAHTICNSELRPLGTGLY 60
MS+ E +L++YAQ+A++V ++ E+++ E+ + + ACNAHTIC+ E+RPLGTGLY
Sbjct: 154 MSETSEDRLVMYAQMASIVQQMMAPDEVNVKEVTQMICRFACNAHTICDEEVRPLGTGLY 213
Query: 61 PVISIINHSCLPNAVLVFEGRLAVVRAVQHVPKGAEVLISYIETAGSTMTRQKALKEQYL 120
PVISI+NHSC+PNAVL F+G A +RA++ +G E+ ISY+E A ST TR+KAL++QY
Sbjct: 214 PVISIVNHSCVPNAVLHFDGNRAALRALEDTQEGTEITISYVELAASTNTRRKALRDQYY 273
Query: 121 FTCTCPRCIKLGQFDDIQESAILEGYRCKDDGCSGFLLRDSDDKGFTCQQCGLVRSKEEI 180
F C C RC +L + +E A LEGY C + C+G L+ D C+ CGL R ++
Sbjct: 274 FDCNCIRCSRLVTREGSREDAFLEGYGCVNSDCNGPLIEDPGSDKVICEVCGLKREVQQT 333
Query: 181 KKIASEVNILSKKTLALTSCGNHQEVVSTYKMIEKLQKKLYHPFSVNLMQTREKLIKILM 240
K A EV + + L + G + Y +E Q+KL+HP+SV L++T + L+KI M
Sbjct: 334 KSAAKEVELDVLEASNLYAAGKLESARRLYSEVEAKQRKLWHPYSVPLLRTHDALLKICM 393
Query: 241 ELEDWKEALAYCQLTIPVYQRVYPQFHPLLGLQYYTCGKLEWFLGDTENAIKSMTEAVEI 300
++EDW AL +CQ TIP Y+R YP F PLLGLQY+T GKL+W LGD+ A+ ++ +A +
Sbjct: 394 DMEDWASALEFCQSTIPAYERAYPPFSPLLGLQYFTLGKLQWLLGDSAKAVNTLRKAYTV 453
Query: 301 LRITHGTNSPFMKELILKLEEAQAEASYKLS 331
+++THG+ S + L L+EAQAE +YK S
Sbjct: 454 IQVTHGSKSELLHGLTSTLQEAQAEVAYKRS 484
>gi|18568265|gb|AAL75997.1|AF466646_5 putative SET-domain transcriptional regulator [Zea mays]
gi|413933333|gb|AFW67884.1| putative SET-domain transcriptional regulator [Zea mays]
Length = 410
Score = 295 bits (754), Expect = 3e-77, Method: Compositional matrix adjust.
Identities = 166/332 (50%), Positives = 207/332 (62%), Gaps = 55/332 (16%)
Query: 7 KQLLLYAQIANLVNLILQWPEISINEIAENFSKLACNAHTICNSELRPLGTGLYPVISII 66
K+ L ++ANLV+LIL + E+ + EIA+ FSK ACNAHTIC+ ELRPLGTGLYPVISII
Sbjct: 124 KRKLQNEKMANLVSLILPFIELDLKEIAQTFSKFACNAHTICDPELRPLGTGLYPVISII 183
Query: 67 NHSCLPNAVLVFEGRLAVVRAVQHVPKGAEVLISYIETAGSTMTRQKALKEQYLFTCTCP 126
NHSC+PNAVL+F+GR A VRA+Q + K EV ISYIETA T R LK QY FTCTCP
Sbjct: 184 NHSCVPNAVLIFDGRTAYVRALQPINKDEEVSISYIETATVTKKRNNDLK-QYFFTCTCP 242
Query: 127 RCIKLGQFDDIQESAILEGYRCKDDGCSGFLLRDSDDKGFTCQQCGLVRSKEEIKKIASE 186
RC+K FD E A+LEG+RCK+ C GFLL +SD
Sbjct: 243 RCVK--GFD---EDALLEGFRCKNQACDGFLLPNSDK----------------------- 274
Query: 187 VNILSKKTLALTSCGNHQEVVSTYKMIEKLQKKLYHPFSVNLMQTREKLIKILMELEDWK 246
E S YK+IE+L++ LYH FS L+ T E L+KI +EL+DW
Sbjct: 275 -----------------AEAGSIYKIIEQLERNLYHAFSTTLLHTCETLLKIYLELQDWW 317
Query: 247 EALAYCQLTIPVYQR-----VYPQFHPLLGLQYYTCGKLEWFLGDTENAIKSMTEAVEIL 301
AL YC+LTIPVY+ P P + Y G LE+ TE+A+KS+T A +IL
Sbjct: 318 TALTYCRLTIPVYESKAWRLKIPFPRPRTVREAYGTGLLEF----TEDALKSLTRAADIL 373
Query: 302 RITHGTNSPFMKELILKLEEAQAEASYKLSSK 333
++THG S FMKEL KLEEA+AE S++LSS+
Sbjct: 374 KVTHGVKSQFMKELFGKLEEARAEVSFRLSSR 405
>gi|302799034|ref|XP_002981276.1| hypothetical protein SELMODRAFT_114558 [Selaginella moellendorffii]
gi|300150816|gb|EFJ17464.1| hypothetical protein SELMODRAFT_114558 [Selaginella moellendorffii]
Length = 453
Score = 270 bits (691), Expect = 5e-70, Method: Compositional matrix adjust.
Identities = 133/331 (40%), Positives = 210/331 (63%), Gaps = 7/331 (2%)
Query: 1 MSDIDEKQLLLYAQIANLVNLIL--QWPEISINEIAENFSKLACNAHTICNSELRPLGTG 58
S+ +++L++YAQ+A L+ IL ++ E + EI ++ +++CN HTIC+ ELRP+G G
Sbjct: 115 FSETGDERLVMYAQMAVLIKTILNARYAE-DVKEITKDICRISCNGHTICDDELRPVGIG 173
Query: 59 LYPVISIINHSCLPNAVLVFEGRLAVVRAVQHVPKGAEVLISYIETAGSTMTRQKALKEQ 118
L+PV+S+INHSC N++L+F+G+ AVVRA+ + +G EV +SYIE ST +R++AL +Q
Sbjct: 174 LFPVVSVINHSCSSNSLLLFDGKHAVVRALGTISRGCEVTVSYIELGASTNSRREALSDQ 233
Query: 119 YLFTCTCPRCIKLG-QFDDIQESAILEGYRCKDDGCSGFLLRDSDDKGFTCQQCGLVRSK 177
Y F C CPRC + Q + + +LE C D C F+ + + C CG R
Sbjct: 234 YYFHCKCPRCTDVSPQAHGLYKDDVLEAVACLDPACESFM--RLSNGSWRCMSCGSSREP 291
Query: 178 EEIKKIASEVNILSKKTLALTSCGNHQEVVSTYKMIEKLQKKLYHPFSVNLMQTREKLIK 237
E+ K+++E + +K L + G+ ++ EKLQ +L++P SV LM+TR+ L++
Sbjct: 292 NEVNKLSTEAEGMIEKANRLQAAGDLHGARMAFQQAEKLQTELWNPRSVKLMRTRDLLLR 351
Query: 238 ILMELEDWKEALAYCQLTIPVYQRVYP-QFHPLLGLQYYTCGKLEWFLGDTENAIKSMTE 296
+ + LEDW AL C+LT+P Y+ YP HPLLGLQYYT GKLE G A+++ +
Sbjct: 352 VYLSLEDWDLALHVCRLTLPAYETAYPGSKHPLLGLQYYTLGKLEMHAGSVTEAVRAYAK 411
Query: 297 AVEILRITHGTNSPFMKELILKLEEAQAEAS 327
A +IL +THG+ F+++L +L++A+ AS
Sbjct: 412 AFDILSVTHGSRGEFVRKLRNELDQAKMVAS 442
>gi|302772491|ref|XP_002969663.1| hypothetical protein SELMODRAFT_231366 [Selaginella moellendorffii]
gi|300162174|gb|EFJ28787.1| hypothetical protein SELMODRAFT_231366 [Selaginella moellendorffii]
Length = 484
Score = 269 bits (688), Expect = 1e-69, Method: Compositional matrix adjust.
Identities = 133/330 (40%), Positives = 209/330 (63%), Gaps = 7/330 (2%)
Query: 1 MSDIDEKQLLLYAQIANLVNLIL--QWPEISINEIAENFSKLACNAHTICNSELRPLGTG 58
S+ +++L++YAQ+A L+ IL ++ E + EI ++ +++CN HTIC+ ELRP+G G
Sbjct: 148 FSETGDERLVMYAQMAVLIKTILNARYAE-DVKEITKDICRISCNGHTICDDELRPVGIG 206
Query: 59 LYPVISIINHSCLPNAVLVFEGRLAVVRAVQHVPKGAEVLISYIETAGSTMTRQKALKEQ 118
L+PV+SIINHSC N++L+F+G+ AVVRA+ + +G EV +SYIE ST +R++AL +Q
Sbjct: 207 LFPVVSIINHSCSSNSLLLFDGKHAVVRALGTISRGCEVTVSYIELGASTNSRREALSDQ 266
Query: 119 YLFTCTCPRCIKLGQFDDIQESAILEGYRCKDDGCSGFLLRDSDDKGFTCQQCGLVRSKE 178
Y F C CPRC + + + +LE C D C F+ + + C CG R
Sbjct: 267 YYFHCKCPRCTDDSEA-GLYKDDVLEAVACLDPACESFM--RLSNGSWRCMSCGSSREPN 323
Query: 179 EIKKIASEVNILSKKTLALTSCGNHQEVVSTYKMIEKLQKKLYHPFSVNLMQTREKLIKI 238
E+ K+++E + +K L + G+ ++ EKLQ +L++P SV LM+TR+ L+++
Sbjct: 324 EVNKLSTEAEGMIEKANRLQAAGDLHGARMAFQQAEKLQTELWNPRSVKLMRTRDLLLRV 383
Query: 239 LMELEDWKEALAYCQLTIPVYQRVYP-QFHPLLGLQYYTCGKLEWFLGDTENAIKSMTEA 297
+ LEDW AL C+LT+P Y+ YP HPLLGLQYYT GKLE G A+++ +A
Sbjct: 384 YLSLEDWDLALHVCRLTLPAYETAYPGSKHPLLGLQYYTLGKLEMHAGSVTEAVRAYAKA 443
Query: 298 VEILRITHGTNSPFMKELILKLEEAQAEAS 327
+IL +THG+ F+++L +L++A+ AS
Sbjct: 444 FDILSVTHGSRGEFVRKLRNELDQAKMVAS 473
>gi|108710651|gb|ABF98446.1| MYND finger family protein, expressed [Oryza sativa Japonica Group]
Length = 352
Score = 250 bits (639), Expect = 6e-64, Method: Compositional matrix adjust.
Identities = 122/202 (60%), Positives = 152/202 (75%), Gaps = 6/202 (2%)
Query: 1 MSDIDEKQLLLYAQIANLVNLILQWPEISINEIAENFSKLACNAHTICNSELRPLGTGLY 60
+S++D+ QL+LYAQ+ANLV LIL E+ + EI FSK ACNAHTIC+ ELRPLGTGLY
Sbjct: 151 ISEVDKNQLVLYAQMANLVQLILPSFELDLKEITHTFSKFACNAHTICDPELRPLGTGLY 210
Query: 61 PVISIINHSCLPNAVLVFEGRLAVVRAVQHVPKGAEVLISYIETAGSTMTRQKALKEQYL 120
PV+SIINHSC+PNAVL+FEGR A VRA+Q + K EV ISYIETA +TM RQ LK Y
Sbjct: 211 PVLSIINHSCVPNAVLIFEGRTAYVRALQPISKNEEVSISYIETAATTMKRQDDLK-HYY 269
Query: 121 FTCTCPRCIKLGQFDDIQESAILEGYRCKDDGCSGFLLRDSDDKGFTCQQCGLVRSKEEI 180
FTCTCPRC+K D +E A+LEGYRC D C GFLL ++ +KG+TCQ+C R EE+
Sbjct: 270 FTCTCPRCVK-----DSEEDALLEGYRCNDQKCDGFLLPNAGNKGYTCQKCSTSRDGEEL 324
Query: 181 KKIASEVNILSKKTLALTSCGN 202
+K+AS+V +LS K +L S G+
Sbjct: 325 QKMASDVLLLSDKVSSLVSSGS 346
>gi|326496881|dbj|BAJ98467.1| predicted protein [Hordeum vulgare subsp. vulgare]
Length = 153
Score = 186 bits (473), Expect = 1e-44, Method: Compositional matrix adjust.
Identities = 86/140 (61%), Positives = 111/140 (79%)
Query: 196 ALTSCGNHQEVVSTYKMIEKLQKKLYHPFSVNLMQTREKLIKILMELEDWKEALAYCQLT 255
L S ++ E S YK+IE+L++K YH FS+ L+ TRE L+KI MEL+DW+ AL YC+LT
Sbjct: 11 TLISDTDNSEAGSVYKIIEQLEQKHYHSFSITLLHTRETLLKIYMELQDWQTALMYCRLT 70
Query: 256 IPVYQRVYPQFHPLLGLQYYTCGKLEWFLGDTENAIKSMTEAVEILRITHGTNSPFMKEL 315
IPVY+RVYP FHP++GLQ+YTCGKLEW L TE+A+K +T A ++LRITHGT S FMKEL
Sbjct: 71 IPVYERVYPPFHPMVGLQFYTCGKLEWLLEYTEDALKLLTRAADVLRITHGTQSQFMKEL 130
Query: 316 ILKLEEAQAEASYKLSSKDE 335
KLEEA+AE S++LSS +E
Sbjct: 131 FGKLEEARAEVSFRLSSGEE 150
>gi|260799834|ref|XP_002594889.1| hypothetical protein BRAFLDRAFT_124463 [Branchiostoma floridae]
gi|229280126|gb|EEN50900.1| hypothetical protein BRAFLDRAFT_124463 [Branchiostoma floridae]
Length = 463
Score = 165 bits (417), Expect = 3e-38, Method: Compositional matrix adjust.
Identities = 100/313 (31%), Positives = 166/313 (53%), Gaps = 17/313 (5%)
Query: 11 LYAQIANLVNLILQWPEIS-INEIAENFSKLACNAHTICNSELRPLGTGLYPVISIINHS 69
++AQ+A ++ + + + EI E F ++ CN +IC+ E++ +G G+YP +S+ NHS
Sbjct: 140 MFAQLAVVLRMYVGKDVMDDAREIFELFGRMTCNTFSICDPEMQYIGIGIYPKMSLFNHS 199
Query: 70 CLPNAVLVFEGRLAVVRAVQHVPKGAEVLISYIETAGSTMTRQKALKEQYLFTCTCPRCI 129
C PN V VF G VRA+Q++ G E+LISY+E + R++ L +QY FTC CPRC
Sbjct: 200 CEPNCVAVFNGLRMEVRAIQNIQPGEELLISYVEMLAMSSVRKQQLLQQYYFTCKCPRC- 258
Query: 130 KLGQFDDIQESAILEGYRCKDDGCSGFLLRDSDDKGFTCQQCGLVRSKEEIKKIASEVNI 189
D + ++ +C + C +++ D TC+ C E+ K EVN
Sbjct: 259 -----QDQTKDGMMMAVKCGNINCKKVIIQ-VDGAYETCKVC--CHDNEKDPKFWKEVNK 310
Query: 190 LSKKT----LALTSCGNHQEVVSTYKMIEKL---QKKLYHPFSVNLMQTREKLIKILMEL 242
+++ + A+TS QE ++ E+L Q+++ H + +++ +K + + L
Sbjct: 311 VTQFSEEMLAAITSADQRQEPKEGLRLAERLLERQQQVLHNNHLFVLKCLDKALDQAVVL 370
Query: 243 EDWKEALAYCQLTIPVYQRVYPQFHPLLGLQYYTCGKLEWFLGDTENAIKSMTEAVEILR 302
W AL TI Y+ +P +HP LG+QY GKL +L A++++ A IL
Sbjct: 371 RKWNRALRCALQTIEPYKVHFPAYHPSLGIQYMRIGKLLLYLEKRLAALEALQMAEHILN 430
Query: 303 ITHGTNSPFMKEL 315
+THG + P KEL
Sbjct: 431 VTHGKDHPINKEL 443
>gi|414864678|tpg|DAA43235.1| TPA: hypothetical protein ZEAMMB73_073593 [Zea mays]
Length = 342
Score = 162 bits (410), Expect = 2e-37, Method: Compositional matrix adjust.
Identities = 85/157 (54%), Positives = 104/157 (66%), Gaps = 6/157 (3%)
Query: 69 SCLPNAVLVFEGRLAVVRAVQHVPKGAEVLISYIETAGSTMTRQKALKEQYLFTCTCPRC 128
SC+PNAVL+F+GR VRA+Q + K EV ISYIETA T R LK QY FT TCPRC
Sbjct: 182 SCVPNAVLIFDGRTTYVRALQPIDKDEEVSISYIETAAVTKKRNNYLK-QYFFTYTCPRC 240
Query: 129 IKLGQFDDIQESAILEGYRCKDDGCSGFLLRDSDDKGFTCQQCGLVRSKEEIKKIASEVN 188
+K FDD A+LEG+RCK+ C GFLL +S K +TCQ+CG R EEI K+ SE+
Sbjct: 241 VK--GFDD---DALLEGFRCKNQTCDGFLLPNSGKKAYTCQKCGASRDVEEINKMRSEIL 295
Query: 189 ILSKKTLALTSCGNHQEVVSTYKMIEKLQKKLYHPFS 225
LS K + S GN E S YK+IE+L++ LYH FS
Sbjct: 296 QLSDKASSFLSSGNKAEAGSIYKIIEQLERNLYHAFS 332
>gi|222625640|gb|EEE59772.1| hypothetical protein OsJ_12271 [Oryza sativa Japonica Group]
Length = 384
Score = 156 bits (394), Expect = 1e-35, Method: Compositional matrix adjust.
Identities = 82/167 (49%), Positives = 100/167 (59%), Gaps = 39/167 (23%)
Query: 1 MSDIDEKQLLLYAQIANLVNLILQWPEISINEIAENFSKLACNAHTICNSELRPLGTGLY 60
+S++D+ QL+LYAQ+ANLV LIL E+ + EI FSK ACNAHTIC+ ELRPLGTGLY
Sbjct: 163 ISEVDKNQLVLYAQMANLVQLILPSFELDLKEITHTFSKFACNAHTICDPELRPLGTGLY 222
Query: 61 PVISIINHSCLPNAVLVFEGRLAVVRAVQHVPKGAEVLISYIETAGSTMTRQKALKEQYL 120
PV+SIINHSC+PNAVL+FEGR A VRA+Q + K E
Sbjct: 223 PVLSIINHSCVPNAVLIFEGRTAYVRALQPISKNEE------------------------ 258
Query: 121 FTCTCPRCIKLGQFDDIQESAILEGYRCKDDGCSGFLLRDSDDKGFT 167
D +E A+LEGYRC D C GFLL ++D G T
Sbjct: 259 ---------------DSEEDALLEGYRCNDQKCDGFLLPNADIYGVT 290
Score = 121 bits (304), Expect = 4e-25, Method: Compositional matrix adjust.
Identities = 53/80 (66%), Positives = 67/80 (83%)
Query: 256 IPVYQRVYPQFHPLLGLQYYTCGKLEWFLGDTENAIKSMTEAVEILRITHGTNSPFMKEL 315
PVY+R+YP FHP++GLQ+YTCGKLEW L TE+A+ S+T A +ILRITHGT S FMKEL
Sbjct: 304 FPVYERIYPPFHPMIGLQFYTCGKLEWLLEYTEDALMSLTRAADILRITHGTKSEFMKEL 363
Query: 316 ILKLEEAQAEASYKLSSKDE 335
+ KLEE +AEAS++LS+ DE
Sbjct: 364 LGKLEEVRAEASFRLSAGDE 383
>gi|348534108|ref|XP_003454545.1| PREDICTED: SET and MYND domain-containing protein 3-like
[Oreochromis niloticus]
Length = 439
Score = 154 bits (389), Expect = 5e-35, Method: Compositional matrix adjust.
Identities = 105/345 (30%), Positives = 172/345 (49%), Gaps = 51/345 (14%)
Query: 1 MSDIDEKQLLLYAQIANLVNLILQW--PEI----------SINEIAENFSKLACNAHTIC 48
++ + E++ +Q+A+++ L LQ P++ S E +K+ CN TI
Sbjct: 133 LTSMPEQKKQGLSQLASMLELYLQQEAPDLLQEMGSALPPSCREPLSLIAKVTCNCFTIS 192
Query: 49 NSELRPLGTGLYPVISIINHSCLPNAVLVFEGRLAVVRAVQHVPKGAEVLISYIETAGST 108
+ EL+ +G GLYP +S++NH C PN V+VFEG +RAV+ + E+ ISYIET T
Sbjct: 193 DGELQEIGVGLYPSLSLLNHDCRPNCVMVFEGTKLELRAVRDIDPEDELTISYIETLSLT 252
Query: 109 MTRQKALKEQYLFTCTCPRCIKLGQFDDIQESAILEGYRCKDDGCSGFLLRDSDDKGFTC 168
RQ+ L+EQY FTC C RC R KD G +L + K
Sbjct: 253 EDRQRQLEEQYHFTCHCQRC----------------DSRDKD----GLMLSGEEGKWRLL 292
Query: 169 QQCGLVRSKEEIKKIASEVNILSKKTLALTSCGNHQEVVSTYKMIEKLQKKLYHPFSVNL 228
++ L R E +K ++ +L + L+S G + YK+
Sbjct: 293 KE-ALPR-LEGLKAESNWETLLESCSHLLSSVGGEVPEENLYKL---------------- 334
Query: 229 MQTREKLIKILMELEDWKEALAYCQLTIPVYQRVYPQFHPLLGLQYYTCGKLEWFLGDTE 288
+ + + + L W+EA++Y T+P Y++ YP HP+ G+Q GKL+ +L + E
Sbjct: 335 -RITDMALDASVHLGRWEEAMSYGVTTLPAYRQYYPDPHPVHGIQLMRVGKLQHYLENIE 393
Query: 289 NAIKSMTEAVEILRITHGTNSPFMKELILKLEEAQAEASYKLSSK 333
+A+ + +A EI++ THG + P EL +K++E +E + SS+
Sbjct: 394 DALDTFRQAYEIVKFTHGEDHPLTTELTMKMKECHSEMDHHSSSR 438
>gi|432852874|ref|XP_004067428.1| PREDICTED: SET and MYND domain-containing protein 3-like [Oryzias
latipes]
Length = 495
Score = 152 bits (385), Expect = 2e-34, Method: Compositional matrix adjust.
Identities = 101/318 (31%), Positives = 159/318 (50%), Gaps = 39/318 (12%)
Query: 8 QLLLYAQIANLVNLILQWPEISINEIAENFSKLACNAHTICNSELRPLGTGLYPVISIIN 67
+L + +I +L + +S E +K+ CN TI + EL+ +G GLYP +S++N
Sbjct: 209 ELYMKTEIPDLKQRMTSELPLSCREALSIIAKVTCNCFTISDGELQEIGVGLYPSLSLLN 268
Query: 68 HSCLPNAVLVFEGRLAVVRAVQHVPKGAEVLISYIETAGSTMTRQKALKEQYLFTCTCPR 127
H C PN V+VF G +RAVQ + G E+ ISYIET T RQK L+EQY FTC C R
Sbjct: 269 HDCRPNCVMVFNGTELQLRAVQDINPGEELRISYIETLSLTEDRQKQLEEQYHFTCHCQR 328
Query: 128 CIKLGQFDDIQESAILEGYRCKDDGCSGFLLRDSDDKGFTCQQCGLVRSKEEIKKIASEV 187
C Q+ +L G ++ + KE + ++ +
Sbjct: 329 C-----SSKEQDGVMLSG-----------------------REAAWRQLKEALPRLEG-L 359
Query: 188 NILSKKTLALTSCGNHQEVVSTYKMIEKLQKKLYHPFSVNLMQTREKLIKILMELEDWKE 247
+ S L SC N + T E + LY ++ + + + L W+E
Sbjct: 360 QLESNWLELLESCSN----LLTSAGEEVPDENLYK------LKITDMALDASVHLGRWEE 409
Query: 248 ALAYCQLTIPVYQRVYPQFHPLLGLQYYTCGKLEWFLGDTENAIKSMTEAVEILRITHGT 307
AL++ T+P Y+ Y HP+ G+Q GKL+ LG E+A++S EA +IL++THG
Sbjct: 410 ALSFGLKTLPSYRLYYADPHPVHGVQLMRVGKLQHHLGCIEDALESFKEAFKILKLTHGE 469
Query: 308 NSPFMKELILKLEEAQAE 325
+ P + +L++K+EE ++E
Sbjct: 470 DHPLIVDLMMKMEECRSE 487
>gi|443720661|gb|ELU10312.1| hypothetical protein CAPTEDRAFT_158133 [Capitella teleta]
Length = 441
Score = 149 bits (376), Expect = 2e-33, Method: Compositional matrix adjust.
Identities = 89/318 (27%), Positives = 160/318 (50%), Gaps = 27/318 (8%)
Query: 9 LLLYAQIANLVNLILQWPEISINEIAENFSKLACNAHTICNSELRPLGTGLY----PVIS 64
++L + + ++ P S+ E+ F ++ N+ +IC+ E+ P+G G+Y S
Sbjct: 131 MVLMTTLKSFSENVMSMP--SVEELFVIFGRVCVNSFSICDPEMNPIGVGVYIRQIKPCS 188
Query: 65 IINHSCLPNAVLVFEGRLAVVRAVQHVPKGAEVLISYIETAGSTMTRQKALKEQYLFTCT 124
+++HSC PNAV VFEG +R V+ V ++ ISYI+T T TR++ L++QY F C
Sbjct: 189 VLDHSCRPNAVAVFEGTTLRIRCVEPVDSEQDLRISYIDTLDDTTTRRRNLQQQYYFNCL 248
Query: 125 CPRCIKLGQFDDIQESAILEGYRCKDDGCSGFLLRDSDDKGFTCQQCG-------LVRSK 177
C G+ D ++ I + C C G + + D F+C CG L ++
Sbjct: 249 C------GECKDSEKDLIKFSFNCTSVECKGHVTQVPADDRFSCDICGTTVDDVNLNQAA 302
Query: 178 EEIKKIASEVNILSKKTLALTSCGNHQEVVSTYKMIEKLQKKLYHPFSVNLMQTREKLIK 237
E+ +KI +N L K+ H+ + + + HP++V+ ++ + +
Sbjct: 303 EKAQKIIKHLNELKKQR-------EHESIRMKGLSCVSEMRAILHPYNVHFIKLCDLSLD 355
Query: 238 ILMELEDWKEALAYCQLTIPVYQRVYPQFHPLLGLQYYTCGKLEWFLGDTENAIKSMTEA 297
+E W +AL +LT+ Y+ Y + HP +G+ + GK+ +L D E A++ + +A
Sbjct: 356 ACIETSQWDDALELGRLTVVAYRFYYGELHPSVGILLFKIGKILSYLADRE-ALQFLKQA 414
Query: 298 VEILRITHGTNSPFMKEL 315
+LR+THG + K+L
Sbjct: 415 ATVLRVTHGDDHSLYKDL 432
>gi|395531511|ref|XP_003767821.1| PREDICTED: SET and MYND domain-containing protein 3 [Sarcophilus
harrisii]
Length = 428
Score = 149 bits (376), Expect = 2e-33, Method: Compositional matrix adjust.
Identities = 90/293 (30%), Positives = 152/293 (51%), Gaps = 39/293 (13%)
Query: 32 EIAENFSKLACNAHTICNSELRPLGTGLYPVISIINHSCLPNAVLVFEGRLAVVRAVQHV 91
+I E+F+K+ CN TI N E++ +G GLYP +S++NHSC PN V+VFEG +RA++++
Sbjct: 170 DIFESFAKVICNGFTISNGEMQEVGVGLYPSMSLLNHSCDPNCVIVFEGPSLFLRAIRNI 229
Query: 92 PKGAEVLISYIETAGSTMTRQKALKEQYLFTCTCPRCIKLGQFDDIQESAILEGYRCKDD 151
P G E+ I Y++ T RQK LKEQY F C CP C + D +L G
Sbjct: 230 PLGEELTICYLDVLMPTAERQKQLKEQYCFDCDCPLCKTQSKDAD-----MLAGEEQAWK 284
Query: 152 GCSGFLLRDSDDKGFTCQQCGLVRSKEEIKKIASEVNILSKKTLALTSCGNHQEVVSTYK 211
G L++ D ++S+E+ +++ + L + +CGN + Y+
Sbjct: 285 EIQGSLIKIED-----------LQSQEKWEQVLAMCQTL------INNCGNRLPDRNIYQ 327
Query: 212 MIEKLQKKLYHPFSVNLMQTREKLIKILMELEDWKEALAYCQLTIPVYQRVYPQFHPLLG 271
++ E + + L W++AL Y T+ Y+ YP FHP+ G
Sbjct: 328 -----------------LKMLECAMDACINLSLWEDALLYGSRTLEPYRLYYPGFHPVRG 370
Query: 272 LQYYTCGKLEWFLGDTENAIKSMTEAVEILRITHGTNSPFMKELILKLEEAQA 324
+Q GKL+ G A++++ +A E++++THG + +++L+L L + +A
Sbjct: 371 VQVMKVGKLQQHQGLYPQALETLKQAFELIKVTHGRDHSLIEDLMLLLGDCEA 423
>gi|413955565|gb|AFW88214.1| hypothetical protein ZEAMMB73_999877 [Zea mays]
Length = 529
Score = 148 bits (373), Expect = 4e-33, Method: Compositional matrix adjust.
Identities = 83/189 (43%), Positives = 105/189 (55%), Gaps = 46/189 (24%)
Query: 75 VLVFEGRLAVVRAVQHVPKGAEVLISYIETAGSTMTRQKALKEQYLFTCTCPRCIKLGQF 134
VL+F+GR A VRA+Q + K EV ISYIETA T R LK QY FTCTCPRC+K F
Sbjct: 211 VLIFDGRTAYVRALQPINKDEEVSISYIETAAVTKKRNNDLK-QYFFTCTCPRCVK--GF 267
Query: 135 DDIQESAILEGYRCKDDGCSGFLLRDSDDKGFTCQQCGLVRSKEEIKKIASEVNILSKKT 194
D E A+LEG+RCK C GFLL +SD
Sbjct: 268 D---EDALLEGFRCKSQACDGFLLPNSD-------------------------------- 292
Query: 195 LALTSCGNHQEVVSTYKMIEKLQKKLYHPFSVNLMQTREKLIKILMELEDWKEALAYCQL 254
E S YK+I++L++ LYH FS L+ T E L+KI +EL+DW AL YC+L
Sbjct: 293 --------KAEAGSIYKIIKQLERNLYHAFSTTLLHTCETLLKIYLELQDWWTALTYCRL 344
Query: 255 TIPVYQRVY 263
TIPVY++++
Sbjct: 345 TIPVYEKIF 353
>gi|413955566|gb|AFW88215.1| hypothetical protein ZEAMMB73_999877 [Zea mays]
Length = 484
Score = 147 bits (372), Expect = 5e-33, Method: Compositional matrix adjust.
Identities = 83/189 (43%), Positives = 105/189 (55%), Gaps = 46/189 (24%)
Query: 75 VLVFEGRLAVVRAVQHVPKGAEVLISYIETAGSTMTRQKALKEQYLFTCTCPRCIKLGQF 134
VL+F+GR A VRA+Q + K EV ISYIETA T R LK QY FTCTCPRC+K F
Sbjct: 211 VLIFDGRTAYVRALQPINKDEEVSISYIETAAVTKKRNNDLK-QYFFTCTCPRCVK--GF 267
Query: 135 DDIQESAILEGYRCKDDGCSGFLLRDSDDKGFTCQQCGLVRSKEEIKKIASEVNILSKKT 194
D E A+LEG+RCK C GFLL +SD
Sbjct: 268 D---EDALLEGFRCKSQACDGFLLPNSD-------------------------------- 292
Query: 195 LALTSCGNHQEVVSTYKMIEKLQKKLYHPFSVNLMQTREKLIKILMELEDWKEALAYCQL 254
E S YK+I++L++ LYH FS L+ T E L+KI +EL+DW AL YC+L
Sbjct: 293 --------KAEAGSIYKIIKQLERNLYHAFSTTLLHTCETLLKIYLELQDWWTALTYCRL 344
Query: 255 TIPVYQRVY 263
TIPVY++++
Sbjct: 345 TIPVYEKIF 353
>gi|196000040|ref|XP_002109888.1| hypothetical protein TRIADDRAFT_53251 [Trichoplax adhaerens]
gi|190588012|gb|EDV28054.1| hypothetical protein TRIADDRAFT_53251 [Trichoplax adhaerens]
Length = 448
Score = 146 bits (369), Expect = 1e-32, Method: Compositional matrix adjust.
Identities = 90/283 (31%), Positives = 149/283 (52%), Gaps = 17/283 (6%)
Query: 38 SKLACNAHTICNSELRPLGTGLYPVISIINHSCLPNAVLVFEGRLAVVRAVQHVPKGAEV 97
S++ CN TICNSE++ +G G+YP +S++NHSC PN F G+ +R +++ G E+
Sbjct: 172 SRVTCNTFTICNSEMQTVGIGVYPGLSLVNHSCSPNCSATFRGKQMQLRIIENTKIGDEL 231
Query: 98 LISYIETAGSTMTRQKALKEQYLFTCTCPRCIKLGQFDDIQESA--ILEGYRCKDDGCSG 155
LISYI+ +RQ L+ QY F C C RCI D ++S +++ RC C
Sbjct: 232 LISYIDPMQVLSSRQNQLQSQYCFKCICERCI-----DTTKDSCNNLMDSVRCPKKICKA 286
Query: 156 FLLRDSDDKGFTCQQCGLVRSKE---EIKKIASEVNILSKKTLALTSCGNHQEVVSTYKM 212
DS C +CG + + EI+ +++N KT++L G Q + K
Sbjct: 287 ASSLDSLLANKLCPECGSIVDQSFFAEIENFQAQIN----KTISL---GYQQAKLEDLKK 339
Query: 213 IEKLQKKLYHPFSVNLMQTREKLIKILMELEDWKEALAYCQLTIPVYQRVYPQFHPLLGL 272
+ K+ ++ ++ E L+ +E + ++EAL Y + Y R+YP+++P+ G+
Sbjct: 340 LFIEGKQRLGECNMLYIRIIENLMDAYIESQRYEEALEYARRLEEPYCRLYPRYYPVTGV 399
Query: 273 QYYTCGKLEWFLGDTENAIKSMTEAVEILRITHGTNSPFMKEL 315
GKLE +LG A+KS+ +A EIL ++HG + E+
Sbjct: 400 HLMKQGKLECYLGKFGEAVKSLGKAKEILLVSHGKECGLIHEM 442
>gi|410912232|ref|XP_003969594.1| PREDICTED: SET and MYND domain-containing protein 3-like [Takifugu
rubripes]
Length = 434
Score = 144 bits (363), Expect = 6e-32, Method: Compositional matrix adjust.
Identities = 89/289 (30%), Positives = 147/289 (50%), Gaps = 39/289 (13%)
Query: 37 FSKLACNAHTICNSELRPLGTGLYPVISIINHSCLPNAVLVFEGRLAVVRAVQHVPKGAE 96
+K+ CN TI + EL+ +G GLYP +S++NH C PN V+VFEG ++RAV+ + G E
Sbjct: 178 IAKVTCNCFTISDGELQEIGVGLYPSLSLLNHDCRPNCVMVFEGTKLLLRAVRGLSPGEE 237
Query: 97 VLISYIETAGSTMTRQKALKEQYLFTCTCPRCIKLGQFDDIQESAILEGYRCKDDGCSGF 156
+ ISYIET RQ+ L++QY FTC C +C D + S +R
Sbjct: 238 LTISYIETLSLNEDRQQRLEDQYCFTCHC-QCCNSPDNDKLMLSGEESSWR--------- 287
Query: 157 LLRDSDDKGFTCQQCGLVRSKEEIKKIASEVNILSKKTLALTSCGNHQEVVSTYKMIEKL 216
+L+++ + + EE+K ++ ++ L++ G + YK
Sbjct: 288 VLKEA------------LPTLEELKAKSNWQTLVESCHRLLSTEGKRVPEENLYK----- 330
Query: 217 QKKLYHPFSVNLMQTREKLIKILMELEDWKEALAYCQLTIPVYQRVYPQFHPLLGLQYYT 276
++ + + ++L W EA+ Y Q T+PVY YP HP+ G+Q
Sbjct: 331 ------------LRVTDMALDASIQLGLWTEAVEYGQKTLPVYHLYYPDPHPVHGVQLVR 378
Query: 277 CGKLEWFLGDTENAIKSMTEAVEILRITHGTNSPFMKELILKLEEAQAE 325
GKL+ +L E A+ + +A IL++THG ++ +L++K+EE +AE
Sbjct: 379 VGKLQHYLAHIEEALDTFKQAYRILKVTHGNDNSITTDLLMKMEECRAE 427
>gi|449496642|ref|XP_002188283.2| PREDICTED: SET and MYND domain-containing protein 3 [Taeniopygia
guttata]
Length = 289
Score = 144 bits (363), Expect = 6e-32, Method: Compositional matrix adjust.
Identities = 99/303 (32%), Positives = 153/303 (50%), Gaps = 50/303 (16%)
Query: 26 PEISINEIAENFSKLACNAHTICNSELRPLGTGLYPVISIINHSCLPNAVLVFEGRLAVV 85
P I +I F+K+ CN TI N E++ +G GLYP +S++NHSC PN V+VFEG ++
Sbjct: 28 PAIDFFQI---FTKVTCNCFTISNGEMQDVGVGLYPSMSLLNHSCDPNCVIVFEGYQLLL 84
Query: 86 RAVQHVPKGAEVLISYIETAGSTMTRQKALKEQYLFTCTCPRCIKLGQFDDIQESAILEG 145
+V+ + G E+ ISY+E+ T RQK L QY F C CP C Q D ++ A E
Sbjct: 85 HSVRDIQIGEELTISYVESLMPTRERQKQLMRQYCFECDCPLC--QNQEKDAEKLAG-EV 141
Query: 146 YRCKDDGCSGFLLRDSDDKGFTCQQCGLVRSKEEIKKIASEVNILSKKTLALTSCGNHQE 205
+ K+ ++D+ + G +SKEE K ++L++ L S H
Sbjct: 142 HAWKE-------VKDA------VNEVGYPKSKEEWK------HVLARCQNLLRSNKGHLP 182
Query: 206 VVSTYKMIEKLQKKLYHPFSVNLMQTREKLIKILME----LEDWKEALAYCQLTIPVYQR 261
+ Y++ K++ M+ LE W+EAL Y T+ Y+
Sbjct: 183 DTNIYQL---------------------KMLDCAMDACINLESWEEALYYGSRTLEPYRL 221
Query: 262 VYPQFHPLLGLQYYTCGKLEWFLGDTENAIKSMTEAVEILRITHGTNSPFMKELILKLEE 321
YP FHPL +Q GKL++ G A++++ +A I+++THGT+ MK L+ EE
Sbjct: 222 YYPGFHPLRAVQLMRVGKLQYSQGMVPQALETLKQAYNIMKVTHGTDHSLMKVLMEMKEE 281
Query: 322 AQA 324
+A
Sbjct: 282 CEA 284
>gi|50740707|ref|XP_419536.1| PREDICTED: SET and MYND domain-containing protein 3 [Gallus gallus]
Length = 428
Score = 143 bits (361), Expect = 9e-32, Method: Compositional matrix adjust.
Identities = 107/327 (32%), Positives = 165/327 (50%), Gaps = 55/327 (16%)
Query: 8 QLLLYAQIANLVNLILQWPEISINEIAENFSKLACNAHTICNSELRPLGTGLYPVISIIN 67
QL L A+I + +L P I +I F+K+ CN TI N E++ +G GLYP +S++N
Sbjct: 152 QLYLRAEIQDASHLP---PAIDFFQI---FTKVTCNCFTISNGEMQDVGVGLYPSMSLLN 205
Query: 68 HSCLPNAVLVFEGRLAVVRAVQHVPKGAEVLISYIETAGSTMTRQKALKEQYLFTCTCPR 127
HSC PN V++FEG ++R+++ + G E+ ISYIE+ T RQK LK QY F C C
Sbjct: 206 HSCDPNCVIIFEGYQLLLRSIREIQIGEELTISYIESLMPTSERQKQLKRQYCFECDC-- 263
Query: 128 CIKLGQFDDIQESAILEGYRCKDDGCSGFLLRDSDDKGFTCQQCGLVRSKEEIKKIASEV 187
C+ Q D ++ A E ++ D + +SKEE +K
Sbjct: 264 CLCQDQEKDAKKLAGEEP-----------AWKEVKD---AVNEVKYPKSKEEWEK----- 304
Query: 188 NILSKKTLALTSCGNHQEVVSTYKMIEKLQKKLYHPFSVNLMQTREKLIKILME----LE 243
+L+K L+S +H + Y++ KL+ M+ LE
Sbjct: 305 -VLAKCQHLLSSHTSHLPDTNIYQL---------------------KLLDCAMDACINLE 342
Query: 244 DWKEALAYCQLTIPVYQRVYPQFHPLLGLQYYTCGKLEWFLGDTENAIKSMTEAVEILRI 303
W++AL Y T+ Y+ YP FHPL +Q GKL++ G A+ ++ +A +I+++
Sbjct: 343 AWEQALCYGSRTLGPYRLYYPDFHPLRAVQLMRVGKLQYSQGMLPQALGTLKQAYDIMKV 402
Query: 304 THGTNSPFMKELILKLEEAQAEASYKL 330
THGT+ M+ L + L+E Q EA KL
Sbjct: 403 THGTDHSLMQAL-MDLKE-QCEAMMKL 427
>gi|326915443|ref|XP_003204027.1| PREDICTED: SET and MYND domain-containing protein 3-like [Meleagris
gallopavo]
Length = 427
Score = 140 bits (352), Expect = 1e-30, Method: Compositional matrix adjust.
Identities = 101/322 (31%), Positives = 161/322 (50%), Gaps = 55/322 (17%)
Query: 8 QLLLYAQIANLVNLILQWPEISINEIAENFSKLACNAHTICNSELRPLGTGLYPVISIIN 67
QL L A+I + +L P I +I F+K+ CN TI N E++ +G GLYP +S++N
Sbjct: 151 QLYLRAEIQDASHLS---PAIDFFQI---FTKVTCNCFTISNGEMQDVGVGLYPSMSLLN 204
Query: 68 HSCLPNAVLVFEGRLAVVRAVQHVPKGAEVLISYIETAGSTMTRQKALKEQYLFTCTCPR 127
HSC PN V++FEG ++R+V+ + G E+ +SYIE+ T RQ+ LK QY F C C
Sbjct: 205 HSCDPNCVIIFEGYQLLLRSVREIQIGEELTVSYIESLMPTSERQEQLKRQYCFECDC-- 262
Query: 128 CIKLGQFDDIQESAILEGYRCKDDGCSGFLLRDSDDKGFTCQQCGLVRSKEEIKKIASEV 187
C+ Q D ++ A E ++ D + +SKEE +K+ ++
Sbjct: 263 CLCQDQEKDAKKLAGEEP-----------AWKEVKD---AVNEVKYPKSKEEWEKVLAKC 308
Query: 188 -NILSKKTLALTSCGNHQEVVSTYKMIEKLQKKLYHPFSVNLMQTREKLIKILME----L 242
++LS T L N+ Q KL+ M+ L
Sbjct: 309 QHLLSSHTSRLP--------------------------DTNIYQL--KLLDCAMDACINL 340
Query: 243 EDWKEALAYCQLTIPVYQRVYPQFHPLLGLQYYTCGKLEWFLGDTENAIKSMTEAVEILR 302
E W++AL Y T+ Y+ YP FHPL +Q GKL++ G A++++ +A +I++
Sbjct: 341 EAWEQALCYGSRTLGPYRLYYPNFHPLRAVQLMRVGKLQYSQGMLPQALETLKQAYDIMK 400
Query: 303 ITHGTNSPFMKELILKLEEAQA 324
+THGT+ M+ L+ E+ +A
Sbjct: 401 VTHGTDHSLMQALMDLKEQCEA 422
>gi|108710652|gb|ABF98447.1| MYND finger family protein, expressed [Oryza sativa Japonica Group]
Length = 282
Score = 139 bits (350), Expect = 2e-30, Method: Compositional matrix adjust.
Identities = 65/98 (66%), Positives = 80/98 (81%)
Query: 1 MSDIDEKQLLLYAQIANLVNLILQWPEISINEIAENFSKLACNAHTICNSELRPLGTGLY 60
+S++D+ QL+LYAQ+ANLV LIL E+ + EI FSK ACNAHTIC+ ELRPLGTGLY
Sbjct: 151 ISEVDKNQLVLYAQMANLVQLILPSFELDLKEITHTFSKFACNAHTICDPELRPLGTGLY 210
Query: 61 PVISIINHSCLPNAVLVFEGRLAVVRAVQHVPKGAEVL 98
PV+SIINHSC+PNAVL+FEGR A VRA+Q + K EV+
Sbjct: 211 PVLSIINHSCVPNAVLIFEGRTAYVRALQPISKNEEVM 248
>gi|449665374|ref|XP_002163555.2| PREDICTED: uncharacterized protein LOC100200645 [Hydra
magnipapillata]
Length = 1037
Score = 137 bits (344), Expect = 8e-30, Method: Compositional matrix adjust.
Identities = 93/302 (30%), Positives = 153/302 (50%), Gaps = 7/302 (2%)
Query: 23 LQWPEISINEIAEN--FSKLACNAHTICNSELRPLGTGLYPVISIINHSCLPNAVLVFEG 80
LQ I+IN+I K +CN+ I N+EL LGTG++ S+ NHSC PN V F G
Sbjct: 152 LQDVNININDIDIYGLMCKASCNSFAITNAELNSLGTGIFSSASLFNHSCDPNCVATFNG 211
Query: 81 RLAVVRAVQHVPKGAEVLISYIETAGSTMTRQKALKEQYLFTCTCPRCIKLGQFDDIQES 140
R +RA++ + +G E+++SYI ++ RQ L+E Y+FTC C C + D + +S
Sbjct: 212 RDISIRAIKPIAEGEELMLSYISILATSDVRQLELRESYMFTCKCTVCSRKEVNDSLMKS 271
Query: 141 AILEGYRCKDDGCSGFLLRDSDDKGFTCQQCGLVRSKEEIKKIASEVNILSKKTLALTSC 200
+C C FL+ + +CQ+ S + I K ++ L +++
Sbjct: 272 VKCSQPQC---LCMKFLITAPESNKCSCQK-NCEASNDYILKANDCMDKLQALYNSISVV 327
Query: 201 GNHQEVVSTYKMIEKLQKKLYHPFSVNLMQTREKLIKILMELEDWKEALAYCQLTIPVYQ 260
++ S ++I ++ L P ++ L+Q E + +E DWK A Y Y+
Sbjct: 328 PTIEQQKSLTQLIRCGEEILCSP-NIALLQCYEVAMDGCIESGDWKGAFQYGIKLECSYK 386
Query: 261 RVYPQFHPLLGLQYYTCGKLEWFLGDTENAIKSMTEAVEILRITHGTNSPFMKELILKLE 320
++HP LGL Y+ GKL + D N I + + ++L ITHG++ F+++L L+
Sbjct: 387 NYLSEYHPTLGLHYFKLGKLALQIEDLRNGITYLEKGYKVLSITHGSSCHFVQKLKSYLD 446
Query: 321 EA 322
EA
Sbjct: 447 EA 448
>gi|158299622|ref|XP_319707.4| AGAP008954-PA [Anopheles gambiae str. PEST]
gi|157013605|gb|EAA14812.4| AGAP008954-PA [Anopheles gambiae str. PEST]
Length = 453
Score = 134 bits (338), Expect = 5e-29, Method: Compositional matrix adjust.
Identities = 87/310 (28%), Positives = 153/310 (49%), Gaps = 17/310 (5%)
Query: 32 EIAENFSKLACNAHTICNSELRPLGTGLYPVISIINHSCLPNAVLVFEGRLAVVRAVQHV 91
E+ + K+ N I ++E+ +GTG+Y SII+HSC PN V+ F+G +R ++
Sbjct: 154 ELLRIYGKMCINTFNILDAEMSTIGTGMYIGASIIDHSCRPNVVVSFDGETLRMRLLEDY 213
Query: 92 PKG----AEVLISYIETAGSTMTRQKALKEQYLFTCTCPRCIKLGQFDDIQESAILEGYR 147
P+ ++ ISYI+ + RQ+ L E+Y F C C RC D QE +
Sbjct: 214 PEQELDFGKLFISYIDLIDTAEVRQEQLAERYYFHCACERC------RDEQEQKRMNAAA 267
Query: 148 CKDDGCSGFLLRDSDDKGFTCQQCGLVRSKEEIKKIASEVNILSKKTLALTSCGNHQEVV 207
C + C L ++ C CG + + + A E++ ++ LA + +V
Sbjct: 268 CPNTTCHEPLDFSDSEQLNQCPACGTAVTHSDREAFA-EISSFTRDHLAQMKSVAYLDVS 326
Query: 208 STYKMIEKLQKKLYHPFSVNLMQTREKLIKILMELEDWKEALAYCQLTIPVYQRVYPQFH 267
++ + Q + H ++V+ ++T + ++ + LE W EA Y + +++ Y +H
Sbjct: 327 ---RLCLEKQANVLHRYNVHHIKTLDNAMESALNLEKWTEATGYGLRLLDGFRQYYSTYH 383
Query: 268 PLLGLQYYTCGKLEWFLGDTENAIKSMTEAVEILRITHGTNSPFMKELILKL--EEAQAE 325
PLLGL Y GKL+ + A+K + +A +ILR+THG K +++ L + AQ +
Sbjct: 384 PLLGLTYLKVGKLQLYQCQFAEALKQLQQAAKILRVTHGEQDDLYKRVLVPLLCDAAQGD 443
Query: 326 ASYKLSSKDE 335
+ L+ DE
Sbjct: 444 LGH-LAIADE 452
>gi|345329417|ref|XP_001513395.2| PREDICTED: SET and MYND domain-containing protein 3-like
[Ornithorhynchus anatinus]
Length = 415
Score = 133 bits (335), Expect = 1e-28, Method: Compositional matrix adjust.
Identities = 83/274 (30%), Positives = 136/274 (49%), Gaps = 39/274 (14%)
Query: 33 IAENFSKLACNAHTICNSELRPLGTGLYPVISIINHSCLPNAVLVFEGRLAVVRAVQHVP 92
I E+F+K+ CN TI N E++ +G GLYP +S++NHSC PN V+VFEG +RAV+ +
Sbjct: 145 IFESFAKVICNGFTISNGEMQEVGVGLYPSMSLLNHSCDPNCVIVFEGTSLFLRAVREIQ 204
Query: 93 KGAEVLISYIETAGSTMTRQKALKEQYLFTCTCPRCIKLGQFDDIQESAILEGYRCKDDG 152
KG E+ I Y++ + RQK LKEQY F C C RC Q D+ A E +
Sbjct: 205 KGEELTICYLDVLLPSQERQKQLKEQYCFACDCIRCKT--QDKDVDMLAGEEPF------ 256
Query: 153 CSGFLLRDSDDKGFTCQQCGLVRSKEEIKKIASEVNILSKKTLALTSCGNHQEVVSTYKM 212
K++ V+ + + L S ++V++T +
Sbjct: 257 ---------------------------WKEVKDAVDTVEE----LQSQKKWEQVLATCQA 285
Query: 213 IEKLQKKLYHPFSVNLMQTREKLIKILMELEDWKEALAYCQLTIPVYQRVYPQFHPLLGL 272
+ K ++ ++ + + + L W++AL Y + T+ Y+ YP FHP+ +
Sbjct: 286 LINNHKDRIPDRNIYQLKMLDCAMDACINLSLWEDALLYGRRTLDPYRLYYPGFHPIKAI 345
Query: 273 QYYTCGKLEWFLGDTENAIKSMTEAVEILRITHG 306
Q GKL+ G A++++ EA ++L++THG
Sbjct: 346 QIMKIGKLQQHQGMFREALETLREAFDLLKVTHG 379
>gi|213512076|ref|NP_001133866.1| SET and MYND domain-containing protein 3 [Salmo salar]
gi|209155626|gb|ACI34045.1| SET and MYND domain-containing protein 3 [Salmo salar]
Length = 429
Score = 132 bits (332), Expect = 2e-28, Method: Compositional matrix adjust.
Identities = 93/331 (28%), Positives = 154/331 (46%), Gaps = 42/331 (12%)
Query: 1 MSDIDEKQLLLYAQIANLVNLILQW--PEISINEIAEN----FSKLACNAHTICNSELRP 54
+ D+ E++ +Q+++++ L +Q P I+ A + +KL CN TI + ELR
Sbjct: 131 LCDMGEERREGLSQLSSMLQLYIQQEVPNITQELPAVDPLSLLAKLTCNCFTISDGELRE 190
Query: 55 LGTGLYPVISIINHSCLPNAVLVFEGRLAVVRAVQHVPKGAEVLISYIETAGSTMTRQKA 114
+G GLYP +S++NH C P+ V++FEG +RAV+ + E+ ISYI T T R+
Sbjct: 191 IGVGLYPSMSLLNHDCRPSCVMLFEGETLHLRAVRDMQPAEELTISYIGTLAPTRDRRTQ 250
Query: 115 LKEQYLFTCTCPRCIKLGQFDDIQESAILEGYRCKDDGCSGFLLRDSDDKGFTCQQCGLV 174
L+EQY FTC C RC D + + + + + + Q C
Sbjct: 251 LEEQYHFTCQCQRCTT-ADMDPVMLCGEEKAWTPLREAIPTL-------ENWQTQACW-- 300
Query: 175 RSKEEIKKIASEVNILSKKTLALTSCGNHQEVVSTYKMIEKLQKKLYHPFSVNLMQTREK 234
E + + +S ++ L A+ C +V ++ +
Sbjct: 301 --NELLYECSSLLDPLDGDVPAVPDC------------------------NVYRLRVTDL 334
Query: 235 LIKILMELEDWKEALAYCQLTIPVYQRVYPQFHPLLGLQYYTCGKLEWFLGDTENAIKSM 294
+ + L W ALAY +T+ Y++ YP HP +Q KL+ FL +A ++
Sbjct: 335 ALDACINLAAWDTALAYGTITLEPYRQYYPDPHPAHAIQLMRVAKLQHFLVHLGDAQHTL 394
Query: 295 TEAVEILRITHGTNSPFMKELILKLEEAQAE 325
A +I+++THG P +LI KLEE +AE
Sbjct: 395 RLAYDIMKVTHGNQHPLTSDLIRKLEECRAE 425
>gi|310656796|gb|ADP02224.1| zf-MYND domain-containing protein [Triticum aestivum]
Length = 520
Score = 131 bits (330), Expect = 4e-28, Method: Compositional matrix adjust.
Identities = 56/82 (68%), Positives = 70/82 (85%)
Query: 235 LIKILMELEDWKEALAYCQLTIPVYQRVYPQFHPLLGLQYYTCGKLEWFLGDTENAIKSM 294
L++I MEL+DW+ AL YC+LTIPVY+RVYP FHP++GLQ+YTCGKLEW L TE+A+KS+
Sbjct: 367 LLQIYMELQDWQTALMYCRLTIPVYERVYPPFHPMVGLQFYTCGKLEWLLEYTEDALKSL 426
Query: 295 TEAVEILRITHGTNSPFMKELI 316
T A ++LRITHGT S FMKEL
Sbjct: 427 TRAADVLRITHGTQSQFMKELF 448
Score = 63.5 bits (153), Expect = 1e-07, Method: Compositional matrix adjust.
Identities = 39/102 (38%), Positives = 45/102 (44%), Gaps = 44/102 (43%)
Query: 14 QIANLVNLILQWPEISINEIAENFSK---------------------------------- 39
++ANLV LIL E+ + EIA FSK
Sbjct: 131 KMANLVQLILPSIELDLKEIAHTFSKSFVCTEVGIAYCHVLEILAVIGSLRPNPLGLGSA 190
Query: 40 ----------LACNAHTICNSELRPLGTGLYPVISIINHSCL 71
ACNAHTIC+ ELRPLGTGL+P ISIINH L
Sbjct: 191 VIAVHPTQVQFACNAHTICDPELRPLGTGLFPAISIINHRIL 232
>gi|334322119|ref|XP_003340188.1| PREDICTED: SET and MYND domain-containing protein 3-like
[Monodelphis domestica]
Length = 441
Score = 130 bits (327), Expect = 9e-28, Method: Compositional matrix adjust.
Identities = 84/293 (28%), Positives = 148/293 (50%), Gaps = 39/293 (13%)
Query: 32 EIAENFSKLACNAHTICNSELRPLGTGLYPVISIINHSCLPNAVLVFEGRLAVVRAVQHV 91
+I E+F+K+ CN I N E++ +G GLYP +S++NHSC PN V+VFEG +RAV+ +
Sbjct: 183 DIFESFAKVICNGFAISNGEMQEVGVGLYPSMSLLNHSCDPNCVIVFEGPSLFLRAVRDI 242
Query: 92 PKGAEVLISYIETAGSTMTRQKALKEQYLFTCTCPRCIKLGQFDDIQESAILEGYRCKDD 151
+G E+ I Y++ + RQK LKEQY F C CP C
Sbjct: 243 QQGEELTICYLDVLMPSAERQKQLKEQYCFDCDCPGCET--------------------- 281
Query: 152 GCSGFLLRDSDDKGFTCQQCGLVRSKEEIKKIASEVNILSKKTLALTSCGNHQEVVSTYK 211
R DD + ++ ++ +KK+ ++ K L C Q +++ Y
Sbjct: 282 -------RSKDDDMLSGEEQAWKEVQDSLKKV-EDLRAQEKWEQILAIC---QTLINNYG 330
Query: 212 MIEKLQKKLYHPFSVNLMQTREKLIKILMELEDWKEALAYCQLTIPVYQRVYPQFHPLLG 271
++L + ++ ++ + + + L W++AL Y T+ Y+ Y FHP+ G
Sbjct: 331 --DRLPDR-----NIYQLKMLDCAMDACINLCLWEDALLYGSRTLEPYRLYYSGFHPISG 383
Query: 272 LQYYTCGKLEWFLGDTENAIKSMTEAVEILRITHGTNSPFMKELILKLEEAQA 324
+Q GKL+ G A++++ +A E++++THG + +++L+L L + +A
Sbjct: 384 VQVMRVGKLQHHQGLYPQALETLKQAFELMKVTHGRDHSLVEDLLLLLGDCEA 436
>gi|350416087|ref|XP_003490836.1| PREDICTED: SET and MYND domain-containing protein 3-like [Bombus
impatiens]
Length = 439
Score = 129 bits (325), Expect = 2e-27, Method: Compositional matrix adjust.
Identities = 90/307 (29%), Positives = 154/307 (50%), Gaps = 24/307 (7%)
Query: 5 DEKQLLLYAQIANLVNLILQ-WPEISINEIAENFSKLACNAHTICNSELRPLGTGLYPVI 63
DEK++ + + N+++ L+ P S E+ + ++ N+ +I NS++ +G G+Y
Sbjct: 127 DEKKMEHFVCLCNVLHKFLEDMPIPSTAELLGIYGRITINSFSIFNSDMN-IGVGIYLGP 185
Query: 64 SIINHSCLPNAVLVFEGRLAVVRAVQHVPKG--AEVLISYIETAGSTMTRQKALKEQYLF 121
SI++HSC PNAV FEG V+A++ +P +++ I YI+ +T R+ L+ Y F
Sbjct: 186 SILDHSCKPNAVATFEGTTITVKAIEDLPSLDLSQIRIPYIDVIKTTRDRRAELQSSYYF 245
Query: 122 TCTCPRCIKLGQFDDIQESAILEGYRCKDDGCSGFLLRDSDDKGFTCQQCGLVRSKEEIK 181
C C RC K + E C + C+ ++D C++C + E K
Sbjct: 246 WCDCERCEK--------PEPMAEAAACPNKLCTYPCAPNAD----LCEKCN-TKFPENFK 292
Query: 182 KIASEVNILSKKTLALTSCGNHQEV--VSTYKMIEKLQKKLYHPFSVNLMQTREKLIKIL 239
+I E++ + L N + + + KM Q+ + HP +V +QT +
Sbjct: 293 EIFDEISEFTAYHLE-----NMKNIAYLDVSKMCLSRQEGILHPLNVQYVQTLQTAFDSS 347
Query: 240 MELEDWKEALAYCQLTIPVYQRVYPQFHPLLGLQYYTCGKLEWFLGDTENAIKSMTEAVE 299
+ L+ W+EA +Y + I Y Y + HP G+ Y + GKLE +L + AIK++ +A
Sbjct: 348 INLQHWEEAESYAKKLINGYLTYYGEIHPSTGILYLSIGKLEVYLKKLKQAIKTLRKASS 407
Query: 300 ILRITHG 306
IL ITHG
Sbjct: 408 ILTITHG 414
>gi|395852657|ref|XP_003798851.1| PREDICTED: SET and MYND domain-containing protein 3 [Otolemur
garnettii]
Length = 428
Score = 129 bits (324), Expect = 2e-27, Method: Compositional matrix adjust.
Identities = 85/293 (29%), Positives = 146/293 (49%), Gaps = 39/293 (13%)
Query: 32 EIAENFSKLACNAHTICNSELRPLGTGLYPVISIINHSCLPNAVLVFEGRLAVVRAVQHV 91
++ E F+K+ CN+ TICN+E++ +G GLYP +S++NHSC PN +VF G ++RAV+ +
Sbjct: 170 DVFEAFAKVICNSFTICNAEMQEVGVGLYPSMSLLNHSCDPNCSIVFNGPHLLLRAVRDI 229
Query: 92 PKGAEVLISYIETAGSTMTRQKALKEQYLFTCTCPRCIKLGQFDDIQESAILEGYRCKDD 151
G E+ I Y++ ++ R+K L++QY F C C RC Q D
Sbjct: 230 EAGEELTICYLDMLMTSEERRKQLRDQYCFECDCLRC----QTQD--------------- 270
Query: 152 GCSGFLLRDSDDKGFTCQQCGLVRSKEEIKKIASEVNILSKKTLALTSCGNHQEVVSTYK 211
+D+D ++ E++ K EV KK L + ++V++ +
Sbjct: 271 -------KDAD----------MLTGDEQVWK---EVQESLKKIEELKAHWKWEQVLAMCQ 310
Query: 212 MIEKLQKKLYHPFSVNLMQTREKLIKILMELEDWKEALAYCQLTIPVYQRVYPQFHPLLG 271
I + ++ ++ + + + L +EAL Y T+ Y+ +P HP+ G
Sbjct: 311 TIISSNSERLPDINIYQLKVLDCAMDACINLGLLEEALFYATRTMEPYRIFFPGSHPVRG 370
Query: 272 LQYYTCGKLEWFLGDTENAIKSMTEAVEILRITHGTNSPFMKELILKLEEAQA 324
+Q GKL+ G A++++ A +I+R+THG ++LIL LEE A
Sbjct: 371 VQVMKVGKLQLHQGMFPQAMQNLRLAFDIIRVTHGREHSLTEDLILLLEECDA 423
>gi|74204261|dbj|BAE39889.1| unnamed protein product [Mus musculus]
Length = 428
Score = 129 bits (323), Expect = 3e-27, Method: Compositional matrix adjust.
Identities = 85/293 (29%), Positives = 146/293 (49%), Gaps = 39/293 (13%)
Query: 32 EIAENFSKLACNAHTICNSELRPLGTGLYPVISIINHSCLPNAVLVFEGRLAVVRAVQHV 91
++ E F+K+ CN+ TICN+E++ +G GLYP +S++NHSC PN +VF G ++RAV+ +
Sbjct: 170 DLFEAFAKVICNSFTICNAEMQEVGVGLYPSMSLLNHSCDPNCSIVFNGPHLLLRAVREI 229
Query: 92 PKGAEVLISYIETAGSTMTRQKALKEQYLFTCTCPRCIKLGQFDDIQESAILEGYRCKDD 151
G E+ I Y++ ++ R+K L++QY F C C RC Q D
Sbjct: 230 EAGEELTICYLDMLMTSEERRKQLRDQYCFECDCIRC----QTQD--------------- 270
Query: 152 GCSGFLLRDSDDKGFTCQQCGLVRSKEEIKKIASEVNILSKKTLALTSCGNHQEVVSTYK 211
+D+D ++ E+I K EV KK L + ++V++ +
Sbjct: 271 -------KDAD----------MLTGDEQIWK---EVQESLKKIEELKAHWKWEQVLALCQ 310
Query: 212 MIEKLQKKLYHPFSVNLMQTREKLIKILMELEDWKEALAYCQLTIPVYQRVYPQFHPLLG 271
I ++ ++ + + + L +EAL Y T+ Y+ +P HP+ G
Sbjct: 311 AIINSNSNRLPNINIYQLKVLDCAMDACINLGMLEEALFYAMRTMEPYRIFFPGSHPVRG 370
Query: 272 LQYYTCGKLEWFLGDTENAIKSMTEAVEILRITHGTNSPFMKELILKLEEAQA 324
+Q GKL+ G A+K++ A +I+++THG +++LI+ LEE A
Sbjct: 371 VQVMKVGKLQLHQGMFPQAMKNLRLAFDIMKVTHGREHSLIEDLIVLLEECDA 423
>gi|312383019|gb|EFR28260.1| hypothetical protein AND_04036 [Anopheles darlingi]
Length = 455
Score = 128 bits (322), Expect = 3e-27, Method: Compositional matrix adjust.
Identities = 92/334 (27%), Positives = 159/334 (47%), Gaps = 21/334 (6%)
Query: 5 DEKQLLLYAQIANLVNLILQWPEISI-NEIAENFSKLACNAHTICNSELRPLGTGLYPVI 63
D K++ + + ++ +L P I +E+ F K+ N+ +C+ E+ +GTG+Y
Sbjct: 126 DVKRIEHFHTLNVVLQRLLDEPAIPPRDELLRIFGKMCINSFNVCDDEMNSIGTGMYLGA 185
Query: 64 SIINHSCLPNAVLVFEGRLAVVRAVQHVPKG----AEVLISYIETAGSTMTRQKALKEQY 119
SI++HSC PNAV F G +R ++ + + ISYI+ + TR++ L E+Y
Sbjct: 186 SILDHSCRPNAVATFVGEQLQLRLLEDFAGPELDFSRIFISYIDLIDPSDTRREQLSERY 245
Query: 120 LFTCTCPRCIKLGQFDDIQESAILEGYRCKDDGCSGFLLRDSDDKGFTCQQCGLVRSKEE 179
F C C RC D E ++ C++ C +RD C+ +
Sbjct: 246 YFRCECVRC------RDEAERELMGAAACQNRKCDE-PIRDGQTLCSACE----APFDQS 294
Query: 180 IKKIASEVNILSKKTLALTSCGNHQEVVSTYKMIEKLQKKLYHPFSVNLMQTREKLIKIL 239
+ EV ++ LA + +V ++ + Q + HP + ++T +
Sbjct: 295 ARDRFDEVTSFTRDRLAEMKDVAYFDVC---RLCLEKQSGVLHPLNAQHIKTLDYAFDSA 351
Query: 240 MELEDWKEALAYCQLTIPVYQRVYPQFHPLLGLQYYTCGKLEWFLGDTENAIKSMTEAVE 299
++LE W+ AL Y + Y+R P +PLLGL GK++ +LGD + A+ S+ EA +
Sbjct: 352 IKLEKWEAALRYGAGAVAGYRRYSPS-NPLLGLMLANIGKIQLYLGDAKTALSSLHEADK 410
Query: 300 ILRITHGTNSPFMKELILKLEEAQAEASYKLSSK 333
ILR+THG K+ ++ L A Y+LS +
Sbjct: 411 ILRVTHGEQHDLYKDQLVPL-LCDAAQQYELSQR 443
>gi|344278575|ref|XP_003411069.1| PREDICTED: SET and MYND domain-containing protein 3 [Loxodonta
africana]
Length = 428
Score = 127 bits (319), Expect = 7e-27, Method: Compositional matrix adjust.
Identities = 90/295 (30%), Positives = 146/295 (49%), Gaps = 43/295 (14%)
Query: 32 EIAENFSKLACNAHTICNSELRPLGTGLYPVISIINHSCLPNAVLVFEGRLAVVRAVQHV 91
+I E F+K+ CN+ TIC++E++ +G GLYP +S++NHSC PN +VF G ++RAV+ +
Sbjct: 170 DIFEAFAKVICNSFTICDAEMQEVGVGLYPSMSLLNHSCDPNCSIVFNGPHLLLRAVRDI 229
Query: 92 PKGAEVLISYIETAGSTMTRQKALKEQYLFTCTCPRCIKLGQFDDIQESAILEGYRCKDD 151
G E+ I Y++ ++ R+K L++QY F C C +RC+
Sbjct: 230 EVGEELTICYLDMLMTSEERRKQLRDQYCFECDC--------------------FRCR-- 267
Query: 152 GCSGFLLRDSDDKGFTCQQCGLVRSKEEIKKIASEVNILSKKTLALTSCGNHQEVVSTYK 211
+D D T + +E +KKI E+ K L C Q ++S+
Sbjct: 268 ------TQDKDADMLTGNEQAWKEVQESLKKI-EELKAHRKWEQVLAMC---QTILSSNS 317
Query: 212 MIEKLQKKLYHPFSVNLMQTR--EKLIKILMELEDWKEALAYCQLTIPVYQRVYPQFHPL 269
E+L N+ Q + + + + L +EAL Y T+ Y+ YP HP+
Sbjct: 318 --EQLP-------DTNIYQLKVLDCAMDACINLGLLEEALFYGIRTMEPYRIFYPGSHPV 368
Query: 270 LGLQYYTCGKLEWFLGDTENAIKSMTEAVEILRITHGTNSPFMKELILKLEEAQA 324
G+Q GKL+ G A+K++ A +I+R+THG +++LI LEE A
Sbjct: 369 RGVQVMKVGKLQLHQGMFPQAMKNLRLAFDIMRVTHGREHSLIEDLIRLLEECDA 423
>gi|242020146|ref|XP_002430517.1| SET and MYND domain-containing protein, putative [Pediculus humanus
corporis]
gi|212515674|gb|EEB17779.1| SET and MYND domain-containing protein, putative [Pediculus humanus
corporis]
Length = 378
Score = 126 bits (317), Expect = 1e-26, Method: Compositional matrix adjust.
Identities = 84/285 (29%), Positives = 142/285 (49%), Gaps = 30/285 (10%)
Query: 40 LACNAHTICNSELRPLGTGLYPVISIINHSCLPNAVLVFEGRLAVVRAVQHVP--KGAEV 97
+ N I +SE++ +G+GLY SII+HSC PNAV VF+G +R ++ +P ++V
Sbjct: 108 MVINTFNILDSEMQSIGSGLYLASSIIDHSCSPNAVAVFKGTTIFIRTLEDIPIMDWSKV 167
Query: 98 LISYIETAGSTMTRQKALKEQYLFTCTCPRCIKLGQFDDIQESAILEGYRCKDDGCSGFL 157
ISYIE RQ+ L Y F C C +C D ++ C+++ C FL
Sbjct: 168 FISYIELLNLPEIRQQELLSSYYFLCQCSKCT------DSDNLNFMKSIYCQNEKCKNFL 221
Query: 158 LRDSDDKGFTCQQCGLVRSKE---EIKKI----ASEVNILSKKTLALTSCGNHQEVVSTY 210
+ + + C +CG SKE EIK+I SE+N + + +
Sbjct: 222 MPNEVE----CHKCGQSISKEDEDEIKEIIQYTESELNKME-----------NMAYLDIC 266
Query: 211 KMIEKLQKKLYHPFSVNLMQTREKLIKILMELEDWKEALAYCQLTIPVYQRVYPQFHPLL 270
++ + Q+K P ++ ++ + ++ + LE+W+ A+ Y + + + + Y ++HPL
Sbjct: 267 RICLQKQEKYLTPLNIYKVKILDLAMESSIALENWESAVFYGRQLLEPFLKYYGEYHPLR 326
Query: 271 GLQYYTCGKLEWFLGDTENAIKSMTEAVEILRITHGTNSPFMKEL 315
G+ GK+ + L + + A K EA IL +THG N F +EL
Sbjct: 327 GIFLMKLGKILFLLNNIDEAKKYFKEANLILSVTHGKNHCFSREL 371
>gi|24639370|ref|NP_524768.2| buzidau [Drosophila melanogaster]
gi|6018870|emb|CAB58065.1| EG:BACR7C10.4 [Drosophila melanogaster]
gi|22831587|gb|AAF45795.2| buzidau [Drosophila melanogaster]
gi|223718730|gb|ACN22202.1| MIP05435p [Drosophila melanogaster]
Length = 468
Score = 124 bits (312), Expect = 4e-26, Method: Compositional matrix adjust.
Identities = 86/318 (27%), Positives = 159/318 (50%), Gaps = 23/318 (7%)
Query: 16 ANLVNLILQWPEISIN--EIAENFSKLACNAHTICNSELRPLGTGLYPVISIINHSCLPN 73
A L +++ + P N E+ + +L N I ++E+ + T +Y +SI +HSC PN
Sbjct: 166 AVLTDMMAESPSTVPNKTELMSIYGRLITNGFNILDAEMNSIATAIYLGVSITDHSCQPN 225
Query: 74 AVLVFEGRLAVVRAVQHVP--KGAEVLISYIETAGSTMTRQKALKEQYLFTCTCPRCIKL 131
AV FEG V A++ + +++ ISYI+ + R+ LKE Y F C C +C
Sbjct: 226 AVATFEGNELHVHAIEDMECLDWSKIFISYIDLLNTPEQRRLDLKEHYYFLCVCSKCT-- 283
Query: 132 GQFDDIQESAILEGYRCKDDGCSGFLLRDSDDKGFTCQQCGLVRSKEEIKKIASEVNILS 191
D +ES + C + C + D ++ C +C S + ++ +E L+
Sbjct: 284 ----DAKESKEMLAALCPNRNCGAGISVDRNN----CPRCDAGISPK-LRNAFNEAMTLT 334
Query: 192 KKTLALTSCGNHQEV--VSTYKMIEKLQKKLYHPFSVNLMQTREKLIKILMELEDWKEAL 249
+ L N ++V + K+ Q ++HP +V ++T + + +E+ W +AL
Sbjct: 335 RHNLE-----NMKDVAYLDVCKVCLDKQTGVFHPLNVWYVKTLDAAFEAAIEVGKWSDAL 389
Query: 250 AYCQLTIPVYQRVYPQFHPLLGLQYYTCGKLEWFLGDTENAIKSMTEAVEILRITHGTNS 309
Y Q +P +++ + ++PLLGL + GK++ + G ++ A+ + EA IL +THG +
Sbjct: 390 DYGQRLLPGFRKYHGPWNPLLGLLHMKLGKIQLYEGHSKEALHHLEEAQRILTVTHGRDH 449
Query: 310 PFMKE-LILKLEEAQAEA 326
+ E L + + +A+ EA
Sbjct: 450 RLLTEQLYMLVLQARQEA 467
>gi|21430850|gb|AAM51103.1| SD20045p [Drosophila melanogaster]
Length = 382
Score = 124 bits (312), Expect = 5e-26, Method: Compositional matrix adjust.
Identities = 86/318 (27%), Positives = 159/318 (50%), Gaps = 23/318 (7%)
Query: 16 ANLVNLILQWPEISIN--EIAENFSKLACNAHTICNSELRPLGTGLYPVISIINHSCLPN 73
A L +++ + P N E+ + +L N I ++E+ + T +Y +SI +HSC PN
Sbjct: 80 AVLTDMMAESPSTVPNKTELMSIYGRLITNGFNILDAEMNSIATAIYLGVSITDHSCQPN 139
Query: 74 AVLVFEGRLAVVRAVQHVP--KGAEVLISYIETAGSTMTRQKALKEQYLFTCTCPRCIKL 131
AV FEG V A++ + +++ ISYI+ + R+ LKE Y F C C +C
Sbjct: 140 AVATFEGNELHVHAIEDMECLDWSKIFISYIDLLNTPEQRRLDLKEHYYFLCVCSKCT-- 197
Query: 132 GQFDDIQESAILEGYRCKDDGCSGFLLRDSDDKGFTCQQCGLVRSKEEIKKIASEVNILS 191
D +ES + C + C + D ++ C +C S + ++ +E L+
Sbjct: 198 ----DAKESKEMLAALCPNRNCGAGISVDRNN----CPRCDAGISPK-LRNAFNEAMTLT 248
Query: 192 KKTLALTSCGNHQEV--VSTYKMIEKLQKKLYHPFSVNLMQTREKLIKILMELEDWKEAL 249
+ L N ++V + K+ Q ++HP +V ++T + + +E+ W +AL
Sbjct: 249 RHNLE-----NMKDVAYLDVCKVCLDKQTGVFHPLNVWYVKTLDAAFEAAIEVGKWSDAL 303
Query: 250 AYCQLTIPVYQRVYPQFHPLLGLQYYTCGKLEWFLGDTENAIKSMTEAVEILRITHGTNS 309
Y Q +P +++ + ++PLLGL + GK++ + G ++ A+ + EA IL +THG +
Sbjct: 304 DYGQRLLPGFRKYHGPWNPLLGLLHMKLGKIQLYEGHSKEALHHLEEAQRILTVTHGRDH 363
Query: 310 PFMKE-LILKLEEAQAEA 326
+ E L + + +A+ EA
Sbjct: 364 RLLTEQLYVLVLQARQEA 381
>gi|224033847|gb|ACN35999.1| unknown [Zea mays]
gi|413955568|gb|AFW88217.1| hypothetical protein ZEAMMB73_999877 [Zea mays]
Length = 129
Score = 124 bits (311), Expect = 7e-26, Method: Compositional matrix adjust.
Identities = 56/98 (57%), Positives = 73/98 (74%)
Query: 185 SEVNILSKKTLALTSCGNHQEVVSTYKMIEKLQKKLYHPFSVNLMQTREKLIKILMELED 244
S++ LS K + S GN E S YK+I++L++ LYH FS L+ T E L+KI +EL+D
Sbjct: 3 SKILQLSDKASSFLSSGNKAEAGSIYKIIKQLERNLYHAFSTTLLHTCETLLKIYLELQD 62
Query: 245 WKEALAYCQLTIPVYQRVYPQFHPLLGLQYYTCGKLEW 282
W AL YC+LTIPVY+RVYP FHP++GLQ+YTCGKLEW
Sbjct: 63 WWTALTYCRLTIPVYERVYPPFHPMIGLQFYTCGKLEW 100
>gi|345803330|ref|XP_537223.3| PREDICTED: SET and MYND domain-containing protein 3 [Canis lupus
familiaris]
Length = 428
Score = 124 bits (310), Expect = 7e-26, Method: Compositional matrix adjust.
Identities = 87/277 (31%), Positives = 137/277 (49%), Gaps = 43/277 (15%)
Query: 32 EIAENFSKLACNAHTICNSELRPLGTGLYPVISIINHSCLPNAVLVFEGRLAVVRAVQHV 91
+I E F+K+ CNA TICN+E++ +G GLYP +S++NHSC PN +VF G ++RAV+ +
Sbjct: 170 DIFEAFAKVICNAFTICNAEMQEVGVGLYPSMSLLNHSCDPNCSIVFNGPHLLLRAVRDI 229
Query: 92 PKGAEVLISYIETAGSTMTRQKALKEQYLFTCTCPRCIKLGQFDDIQESAILEGYRCKDD 151
G E+ I Y++ ++ R+K LK+QY F C C RC
Sbjct: 230 EAGEELTICYLDMLMTSEERRKQLKDQYCFECDCVRCET--------------------- 268
Query: 152 GCSGFLLRDSDDKGFTCQQCGLVRSKEEIKKIASEVNILSKKTLALTSCGNHQEVVSTYK 211
+D D T + + +E +KKI E+ K L C Q ++S+
Sbjct: 269 -------QDKDADMLTGDEQVWRQVQESLKKI-EELKAHWKWEQVLAMC---QTIISSN- 316
Query: 212 MIEKLQKKLYHPFSVNLMQTR--EKLIKILMELEDWKEALAYCQLTIPVYQRVYPQFHPL 269
E+L +N+ Q R + + + L +EAL Y T+ Y+ +P HP+
Sbjct: 317 -AERLP-------DINVYQLRVLDCAMDACINLGLLEEALFYGIRTMEPYRIFFPGSHPV 368
Query: 270 LGLQYYTCGKLEWFLGDTENAIKSMTEAVEILRITHG 306
G+Q GKL+ G A+K++ A +I+R+THG
Sbjct: 369 RGVQVMKVGKLQLHQGMFPQAMKNLRLAFDIMRVTHG 405
>gi|432089341|gb|ELK23292.1| N-lysine methyltransferase SMYD2 [Myotis davidii]
Length = 391
Score = 124 bits (310), Expect = 7e-26, Method: Compositional matrix adjust.
Identities = 90/340 (26%), Positives = 164/340 (48%), Gaps = 26/340 (7%)
Query: 5 DEKQLLLYAQIANLVNLILQWPEISINE-IAENFSKLACNAHTICNSELRPLGTGLYPVI 63
+EK+ L+ + I+ L + + E E + F+++ CN TI + EL LG+ ++P +
Sbjct: 52 NEKKDLIQSDISALHHFYSKHLEFPDQESLVALFAQVNCNGFTIEDEELSHLGSAIFPDV 111
Query: 64 SIINHSCLPNAVLVFEGRLAVVRAVQHVPKGAEVLISYIETAGSTMTRQKALKEQYLFTC 123
+++NHSC PN ++ ++G LA VRAVQ + G EV SYI+ T R L++ Y FTC
Sbjct: 112 ALMNHSCCPNVIVTYKGTLAEVRAVQEIGPGDEVFTSYIDLLYPTEDRNDRLRDSYFFTC 171
Query: 124 TCPRC---------IKLGQFDDIQES-AILEGYRCKDDGCSGFLLRDSDDKGFTCQQCGL 173
C C +K+ + +D ++ AI + R + F R + + CG
Sbjct: 172 ECQECTTKDKDKAKVKIRKLNDPPKAEAIRDMVRYARNVIEEF--RRAKHYKYILDSCGR 229
Query: 174 --VRSKEEIKKIASEVNILSKKT--------LALTSCGNHQEVVSTYKMIEKLQKKLYHP 223
R E+ + + V++L KT L L H ++ E Q+K+
Sbjct: 230 SPARGAGELGQELNVVDVLLSKTTENGSLKVLILGHYFGHTPPSELLEICELSQEKMSSV 289
Query: 224 F---SVNLMQTREKLIKILMELEDWKEALAYCQLTIPVYQRVYPQFHPLLGLQYYTCGKL 280
F +V ++ + + + + ++DW+ AL Y Q I Y + YP + + + G+L
Sbjct: 290 FEDSNVYMLHMMYQAMGVCLYIQDWEGALRYGQKIIKPYSKHYPLYSLNVASMWLKLGRL 349
Query: 281 EWFLGDTENAIKSMTEAVEILRITHGTNSPFMKELILKLE 320
L + K++ +A+ I+ + HG + P++ E+ ++E
Sbjct: 350 YMGLENRAAGEKALRKAMAIMEVAHGKDHPYISEIKQEIE 389
>gi|91076642|ref|XP_970424.1| PREDICTED: similar to Buzidau CG13761-PB [Tribolium castaneum]
gi|270002373|gb|EEZ98820.1| hypothetical protein TcasGA2_TC004426 [Tribolium castaneum]
Length = 442
Score = 123 bits (309), Expect = 9e-26, Method: Compositional matrix adjust.
Identities = 79/301 (26%), Positives = 151/301 (50%), Gaps = 24/301 (7%)
Query: 32 EIAENFSKLACNAHTICNSELRPLGTGLYPVISIINHSCLPNAVLVFEGRLAVVRAVQHV 91
E+ + ++ N+ I + EL+ +GTG+Y S+I+HSC PNAV +F+G + +RA+Q
Sbjct: 154 ELMGMYGRMCINSFNIIDQELQCIGTGMYLGASVIDHSCSPNAVAIFDGPILSIRALQTF 213
Query: 92 P--KGAEVLISYIETAGSTMTRQKALKEQYLFTCTCPRCIKLGQFDDIQESAILEGYRCK 149
+++ ISYI+ +T RQ L+ Y F C CP+C++ E + C
Sbjct: 214 QYLDWSQIKISYIDILNTTKDRQSELEAAYYFLCKCPKCLE-------PEPPEINAAACP 266
Query: 150 DDGCSGFLLRDSDDKGFTCQQCGLVRSKEEIKKIA-----SEVNILSKKTLALTSCGNHQ 204
++ C + + G C +C V S+ +K+ +++++ + K LA
Sbjct: 267 NEKCDNHIDTEIITPGDKCAKCDTVVSETFLKRFKEVIEFTDLHLQNMKQLA-------- 318
Query: 205 EVVSTYKMIEKLQKKLYHPFSVNLMQTREKLIKILMELEDWKEALAYCQLTIPVYQRVYP 264
++ K Q+ + H F++ ++T + + ++ + W A + + + + Y
Sbjct: 319 -YFDVCEICLKKQEGVLHKFNIKHVKTLDLAFQSSIDFQKWDFARKFALELVDAFYKYYG 377
Query: 265 QFHPLLGLQYYTCGKLEWFLGDTENAIKSMTEAVEILRITHGTNSPFMK-ELILKLEEAQ 323
HP+ GL + GK+ F + + A++ +T+A +IL+I HG S K EL+ L++A+
Sbjct: 378 HVHPITGLLHLKLGKILLFEENDQLALEHLTKAYQILKIIHGVGSHLFKDELVPLLQQAR 437
Query: 324 A 324
A
Sbjct: 438 A 438
>gi|125981921|ref|XP_001354964.1| GA12511 [Drosophila pseudoobscura pseudoobscura]
gi|54643276|gb|EAL32020.1| GA12511 [Drosophila pseudoobscura pseudoobscura]
Length = 439
Score = 123 bits (308), Expect = 1e-25, Method: Compositional matrix adjust.
Identities = 86/317 (27%), Positives = 159/317 (50%), Gaps = 19/317 (5%)
Query: 16 ANLVNLILQWPEISIN--EIAENFSKLACNAHTICNSELRPLGTGLYPVISIINHSCLPN 73
A L +++ P N E+ + +L N + ++E+ + T +Y +SI +HSC PN
Sbjct: 137 AVLTDMMADSPSTVPNKSELMSIYGRLITNGFNVLDAEMNSIATAIYLGVSITDHSCQPN 196
Query: 74 AVLVFEGRLAVVRAVQHVP--KGAEVLISYIETAGSTMTRQKALKEQYLFTCTCPRCIKL 131
AV FEG + A++ +P +++ ISYI+ + R++ LK+ Y F C C +C
Sbjct: 197 AVATFEGNELHIHALEDMPCLDWSKIFISYIDLLNTPEQRRQDLKDHYYFLCVCSKC--- 253
Query: 132 GQFDDIQESAILEGYRCKDDGCSGFLLRDSDDKGFTCQQCGLVRSKEEIKKIASEVNILS 191
D +E+ + C + CS + + ++ C++C + S + ++ +E+ L+
Sbjct: 254 ---RDPKEARQMTAAACPNRKCSASINIEWNN----CKRCSVGISPK-LRNAYNEIMALT 305
Query: 192 KKTLALTSCGNHQEVVSTYKMIEKLQKKLYHPFSVNLMQTREKLIKILMELEDWKEALAY 251
K L + +V K+ Q L HP +V ++T + + +E+ W+EAL Y
Sbjct: 306 KHNLEAMKDVAYLDVC---KVCLDKQAGLMHPLNVWHVKTLDAAFEAAIEVGKWEEALDY 362
Query: 252 CQLTIPVYQRVYPQFHPLLGLQYYTCGKLEWFLGDTENAIKSMTEAVEILRITHGTNSPF 311
Q +P +Q + Q++PL+GL + GK++ F + A + EA IL +THG +
Sbjct: 363 GQQLLPGFQMYHGQWNPLVGLLHMKLGKIQLFERHHKEASHHLEEAQRILSVTHGRDHRL 422
Query: 312 MKELILKL-EEAQAEAS 327
+ E + L +A+ EA+
Sbjct: 423 LVEQLYPLIFQARQEAN 439
>gi|170064095|ref|XP_001867383.1| conserved hypothetical protein [Culex quinquefasciatus]
gi|167881524|gb|EDS44907.1| conserved hypothetical protein [Culex quinquefasciatus]
Length = 445
Score = 122 bits (307), Expect = 2e-25, Method: Compositional matrix adjust.
Identities = 91/326 (27%), Positives = 162/326 (49%), Gaps = 29/326 (8%)
Query: 5 DEKQLLLYAQIANLVNLILQWPEISIN-EIAENFSKLACNAHTICNSELRPLGTGLYPVI 63
DEK++ + + ++ ++ + N E+ F K+ N+ I + E+ +GTG+Y
Sbjct: 125 DEKRMEHFQSLTVVLRSLIDEAAMPGNQELLRIFGKMCINSFNILDDEMNSIGTGMYLGA 184
Query: 64 SIINHSCLPNAVLVFEGRLAVVRAVQHVPKGAEV-----LISYIETAGSTMTRQKALKEQ 118
SI++HSC PNAV +F+G VR ++ GAE+ ISYI+ T R+ L+++
Sbjct: 185 SIMDHSCRPNAVAIFDGCNLNVRLLEDY-HGAEIDFSKIFISYIDLLNPTDVRRDMLRKR 243
Query: 119 YLFTCTCPRCIKLGQFDDIQESAILEGYRCKDDGCSGFL-LRDSDDKGFTCQQCGLVRSK 177
Y F C C RC D QE ++ G C + C + + D+ C +
Sbjct: 244 YYFECGCERC------RDEQELKLMNGAACANAQCDEPISMTARQDRCPGCSTAIKQTER 297
Query: 178 EEIKKIASEVNILSKKTLALTSCGNHQEVVSTYKMIEKL----QKKLYHPFSVNLMQTRE 233
++ ++I++ +T ++V TY I +L Q+K++HPF+V ++T +
Sbjct: 298 DKFREISA---------FTMTQLEQMKDV--TYLDICQLCLSKQEKVFHPFNVWYLKTLD 346
Query: 234 KLIKILMELEDWKEALAYCQLTIPVYQRVYPQFHPLLGLQYYTCGKLEWFLGDTENAIKS 293
+ +E+E W++A+ Y ++R FHPL L GK+E +L + A+ +
Sbjct: 347 LAFESAIEMEKWEDAIDYGSRLKDGFKRFNGAFHPLYALLLLKLGKIEIYLKHGKEALMN 406
Query: 294 MTEAVEILRITHGTNSPFMKELILKL 319
+ E+ +ILRITHG K+ ++ L
Sbjct: 407 VNESEKILRITHGEEHDLYKKQLIPL 432
>gi|66540564|ref|XP_625013.1| PREDICTED: SET and MYND domain-containing protein 3 [Apis
mellifera]
Length = 440
Score = 122 bits (307), Expect = 2e-25, Method: Compositional matrix adjust.
Identities = 90/313 (28%), Positives = 158/313 (50%), Gaps = 27/313 (8%)
Query: 2 SDI--DEKQLLLYAQIANLVNLILQWPEISINEIAE---NFSKLACNAHTICNSELRPLG 56
SDI DEK++ + + ++ L ++SI AE + ++ N+ I + ++ +G
Sbjct: 122 SDIKKDEKKMEHFVCVCGVLYEFL--GDMSIPNSAELMGIYGRIYINSFNISDLDMNNIG 179
Query: 57 TGLYPVISIINHSCLPNAVLVFEGRLAVVRAVQHVP--KGAEVLISYIETAGSTMTRQKA 114
G+Y SI++HSC PNAV FEG ++R + +P +++ ISYI+ +T R++
Sbjct: 180 AGIYLGPSILDHSCKPNAVATFEGTTIIIRTTEDLPCLDLSQIRISYIDVIKTTKDRREE 239
Query: 115 LKEQYLFTCTCPRCIKLGQFDDIQES-AILEGYRCKDDGCSGFLLRDSDDKGFTCQQCGL 173
L+ Y F C C +C +ES ++E C + C+ D+D C+ C
Sbjct: 240 LQSSYYFWCNCKKC---------EESEPMVEAAACPNKFCTYPCSLDAD----MCENCN- 285
Query: 174 VRSKEEIKKIASEVNILSKKTLALTSCGNHQEVVSTYKMIEKLQKKLYHPFSVNLMQTRE 233
+ E K+ E++ L+ L + +V + M K Q+ + HP ++ +QT +
Sbjct: 286 TKFPENFKETFYEISDLTAYHLQNMKNIAYLDVST---MCLKKQEGVLHPLNIQHVQTLQ 342
Query: 234 KLIKILMELEDWKEALAYCQLTIPVYQRVYPQFHPLLGLQYYTCGKLEWFLGDTENAIKS 293
+ L+ W+EA +Y + I Y Y +FHP G+ Y + GKL+ +L + AI++
Sbjct: 343 SAFDSSLTLQHWEEAESYAKRLINGYLTYYGEFHPSTGILYLSIGKLQVYLKKLKQAIET 402
Query: 294 MTEAVEILRITHG 306
+ +A IL ITHG
Sbjct: 403 LRKASAILTITHG 415
>gi|355746250|gb|EHH50875.1| hypothetical protein EGM_01766, partial [Macaca fascicularis]
Length = 352
Score = 122 bits (305), Expect = 3e-25, Method: Compositional matrix adjust.
Identities = 84/277 (30%), Positives = 136/277 (49%), Gaps = 43/277 (15%)
Query: 32 EIAENFSKLACNAHTICNSELRPLGTGLYPVISIINHSCLPNAVLVFEGRLAVVRAVQHV 91
++ E F+K+ CN+ TICN+E++ +G GLYP IS++NHSC PN +VF G ++RAV+ +
Sbjct: 94 DLFEAFAKVICNSFTICNAEMQEVGVGLYPSISLLNHSCDPNCSIVFNGPHLLLRAVRDI 153
Query: 92 PKGAEVLISYIETAGSTMTRQKALKEQYLFTCTCPRCIKLGQFDDIQESAILEGYRCKDD 151
G E+ I Y++ ++ R+K L++QY F C C RC
Sbjct: 154 EVGEELTICYLDMLMTSEERRKQLRDQYCFECDCFRC----------------------- 190
Query: 152 GCSGFLLRDSDDKGFTCQQCGLVRSKEEIKKIASEVNILSKKTLALTSCGNHQEVVSTYK 211
+D D T + +E +KKI E+ K L C Q ++S+
Sbjct: 191 -----QTQDKDADMLTGDEQVWKEVQESLKKI-EELKAHWKWEQVLAMC---QAIISSNS 241
Query: 212 MIEKLQKKLYHPFSVNLMQTR--EKLIKILMELEDWKEALAYCQLTIPVYQRVYPQFHPL 269
E+L +N+ Q + + + + L +EAL Y T+ Y+ +P HP+
Sbjct: 242 --ERLP-------DINIYQLKVLDCAMDACINLGLLEEALFYATRTMEPYRIFFPGSHPV 292
Query: 270 LGLQYYTCGKLEWFLGDTENAIKSMTEAVEILRITHG 306
G+Q GKL+ G A+K++ A +I+R+THG
Sbjct: 293 RGVQVMKVGKLQLHQGMFPQAMKNLRLAFDIMRVTHG 329
>gi|260796111|ref|XP_002593048.1| hypothetical protein BRAFLDRAFT_74375 [Branchiostoma floridae]
gi|229278272|gb|EEN49059.1| hypothetical protein BRAFLDRAFT_74375 [Branchiostoma floridae]
Length = 390
Score = 122 bits (305), Expect = 3e-25, Method: Compositional matrix adjust.
Identities = 81/278 (29%), Positives = 126/278 (45%), Gaps = 32/278 (11%)
Query: 37 FSKLACNAHTICNSELRPLGTGLYPVISIINHSCLPNAVLVFEGRLAVVRAVQHVPKGAE 96
+ K CN I N +LR +G GLYP ++INHSC N V F G +RA+ + G E
Sbjct: 106 YGKTTCNCFAIHNLDLREIGVGLYPQAAMINHSCKSNCVSTFRGPTLQIRALVDIQPGEE 165
Query: 97 VLISYIETAGSTMTRQKALKEQYLFTCTCPRCIKLGQFDDIQESAILEGYRCKDDGCSGF 156
V SY E T R+ L+ +Y F C CP C+ D AI++ +C C G
Sbjct: 166 VCYSYTEKGNVTHERRDELR-KYFFECQCPHCL------DTDRDAIMKSVKCP--SCQGQ 216
Query: 157 LLRDSDDKGFTCQQCGLVRSKEEIKK-----IASEVNILSKKTLALTSCGNHQEVVSTYK 211
+ S D+ C CG E + I E ++L ++
Sbjct: 217 VKPTSSDRYEKCSSCGFTDFTTEFYEDLEIYIHVEFDLLFRENC---------------- 260
Query: 212 MIEKLQKKLYHPFSVNLMQTREKLIKILMELEDWKEALAYCQLTIPVYQRVYPQFHPLLG 271
++E K+ HP ++++++ ++LE+W +A+ Y + + P P G
Sbjct: 261 LVE--LDKILHPDNIHVVRILVGAFAASVKLEEWTKAIDYGKRLDRAFGLYLPPNEPDTG 318
Query: 272 LQYYTCGKLEWFLGDTENAIKSMTEAVEILRITHGTNS 309
L YY GK + L D ENA+ S+ +A +L I +G +S
Sbjct: 319 LLYYKMGKAYYHLDDIENAVTSLRKAKTLLSIAYGRDS 356
>gi|225715142|gb|ACO13417.1| SET and MYND domain-containing protein 3 [Esox lucius]
Length = 429
Score = 122 bits (305), Expect = 3e-25, Method: Compositional matrix adjust.
Identities = 91/338 (26%), Positives = 155/338 (45%), Gaps = 48/338 (14%)
Query: 1 MSDIDEKQLLLYAQIANLVNLILQWPEISINEIAENF------SKLACNAHTICNSELRP 54
+ D+ E++ +Q+++++ L L+ + I + +F +K+ CN TI ++EL+
Sbjct: 131 LCDMGEEKREGLSQLSSMLQLYLKQEQPDIIQKVPSFDPISLLAKVTCNCFTISDAELQE 190
Query: 55 LGTGLYPVISIINHSCLPNAVLVFEGRLAVVRAVQHVPKGAEVLISYIETAGSTMTRQKA 114
+G GLYP +S++NH C P+ V++F+G+ +RA++ + EV ISYI T RQ
Sbjct: 191 IGVGLYPSMSLLNHDCRPSCVMLFQGKTLQLRAIRDIQPTEEVTISYIGVLLPTRERQTQ 250
Query: 115 LKEQYLFTCTCPRCIKLGQFDDIQESAILEGYRCKDDGCSGFLLRDSDDKGFTCQQCGLV 174
L EQY F+C C C + D + CG+
Sbjct: 251 LMEQYHFSCQCGLC-STAELDPLMF-------------------------------CGVK 278
Query: 175 RSKEEIKKIASEVNILSKKTLALTSCGNHQEVVS-TYKMIEKLQKKLYHPFSVNLMQTR- 232
+ +K+ + IL GN +E++ ++ + + N+ + R
Sbjct: 279 EAWTPMKEAIPRLEILQTN-------GNWEELLQECSSLLAPVGGAVPAVPDSNVYRLRV 331
Query: 233 -EKLIKILMELEDWKEALAYCQLTIPVYQRVYPQFHPLLGLQYYTCGKLEWFLGDTENAI 291
+ + L W+ ALAY T+ Y++ YP HP +Q KL+ FL E+A
Sbjct: 332 TDLAFDACINLARWETALAYGLKTLGPYRQYYPDPHPAHAIQLMRVAKLQHFLVHLEDAQ 391
Query: 292 KSMTEAVEILRITHGTNSPFMKELILKLEEAQAEASYK 329
++ A +I++ITHG +LI KLEE AE K
Sbjct: 392 HTLRLAYDIMKITHGNEHSLTSDLIRKLEECLAEMDCK 429
>gi|195347982|ref|XP_002040530.1| GM18871 [Drosophila sechellia]
gi|194121958|gb|EDW44001.1| GM18871 [Drosophila sechellia]
Length = 466
Score = 122 bits (305), Expect = 3e-25, Method: Compositional matrix adjust.
Identities = 87/318 (27%), Positives = 157/318 (49%), Gaps = 23/318 (7%)
Query: 16 ANLVNLILQWPEISIN--EIAENFSKLACNAHTICNSELRPLGTGLYPVISIINHSCLPN 73
A L +++ + P N E+ + +L N I ++E+ + T +Y +SI +HSC PN
Sbjct: 164 AVLSDMMAESPSTVPNKTELMSIYGRLITNGFNILDAEMNSIATAIYLGVSITDHSCQPN 223
Query: 74 AVLVFEGRLAVVRAVQHVP--KGAEVLISYIETAGSTMTRQKALKEQYLFTCTCPRCIKL 131
AV FEG V A++ + +++ ISYI+ + R+ LKE Y F C C +C
Sbjct: 224 AVATFEGNELHVHAIEDMECLDWSKIFISYIDLLNTPEQRRLDLKEHYYFLCVCSKCT-- 281
Query: 132 GQFDDIQESAILEGYRCKDDGCSGFLLRDSDDKGFTCQQCGLVRSKEEIKKIASEVNILS 191
D +ES + C + C + D + C +C S + ++ +E L+
Sbjct: 282 ----DAKESKEMLAALCPNRNCGVGISVDRTN----CPRCDAGISPK-LRNAFNEAMTLT 332
Query: 192 KKTLALTSCGNHQEV--VSTYKMIEKLQKKLYHPFSVNLMQTREKLIKILMELEDWKEAL 249
+ L N ++V + K+ Q ++HP +V ++T + + +E+ W +AL
Sbjct: 333 RHNLE-----NMKDVAYLDVCKVCLDKQTGVFHPLNVWYVKTLDAAFEAAIEVGKWSDAL 387
Query: 250 AYCQLTIPVYQRVYPQFHPLLGLQYYTCGKLEWFLGDTENAIKSMTEAVEILRITHGTNS 309
Y Q +P +++ + ++PLLGL + GK++ + G + A+ + EA IL +THG +
Sbjct: 388 DYGQRLLPGFRKYHGPWNPLLGLLHMKLGKIQLYEGHAKEALHHLEEAQRILTVTHGRDH 447
Query: 310 PFMKE-LILKLEEAQAEA 326
+ E L L + +A+ EA
Sbjct: 448 RLLTEQLYLLVLQARQEA 465
>gi|349604328|gb|AEP99912.1| SET and MYND domain-containing protein 3-like protein, partial
[Equus caballus]
Length = 358
Score = 122 bits (305), Expect = 3e-25, Method: Compositional matrix adjust.
Identities = 84/277 (30%), Positives = 136/277 (49%), Gaps = 43/277 (15%)
Query: 32 EIAENFSKLACNAHTICNSELRPLGTGLYPVISIINHSCLPNAVLVFEGRLAVVRAVQHV 91
++ E F+K+ CN+ TICN+E++ +G GLYP +S++NHSC PN +VF G ++RAV+ +
Sbjct: 100 DVFEAFAKVICNSFTICNAEMQEVGVGLYPSMSLLNHSCEPNCSIVFNGPHLLLRAVRDI 159
Query: 92 PKGAEVLISYIETAGSTMTRQKALKEQYLFTCTCPRCIKLGQFDDIQESAILEGYRCKDD 151
G E+ I Y++ ++ R+K L++QY F C C RC
Sbjct: 160 EAGEELTICYLDMLMTSEERRKQLRDQYCFECDCFRC----------------------- 196
Query: 152 GCSGFLLRDSDDKGFTCQQCGLVRSKEEIKKIASEVNILSKKTLALTSCGNHQEVVSTYK 211
+D D T + +E +KKI E+ K L C Q ++S+
Sbjct: 197 -----QTQDKDADMLTGDEHVWKEVQESLKKI-EELKAHWKWEQVLAMC---QTIISSNS 247
Query: 212 MIEKLQKKLYHPFSVNLMQTR--EKLIKILMELEDWKEALAYCQLTIPVYQRVYPQFHPL 269
E+L +N+ Q + + + + L +EAL Y T+ Y+ YP HP+
Sbjct: 248 --ERLP-------DINIYQLKVLDCAMDACINLGLLEEALFYGIRTMEPYRIFYPGSHPV 298
Query: 270 LGLQYYTCGKLEWFLGDTENAIKSMTEAVEILRITHG 306
G+Q GKL+ G A+K++ A +I+R+THG
Sbjct: 299 RGVQVMKVGKLQLHQGMFPQAMKNLRLAFDIMRVTHG 335
>gi|48257059|gb|AAH17079.2| SMYD3 protein, partial [Homo sapiens]
Length = 287
Score = 121 bits (304), Expect = 4e-25, Method: Compositional matrix adjust.
Identities = 84/277 (30%), Positives = 136/277 (49%), Gaps = 43/277 (15%)
Query: 32 EIAENFSKLACNAHTICNSELRPLGTGLYPVISIINHSCLPNAVLVFEGRLAVVRAVQHV 91
++ E F+K+ CN+ TICN+E++ +G GLYP IS++NHSC PN +VF G ++RAV+ +
Sbjct: 29 DLFEAFAKVICNSFTICNAEMQEVGVGLYPSISLLNHSCDPNCSIVFNGPHLLLRAVRDI 88
Query: 92 PKGAEVLISYIETAGSTMTRQKALKEQYLFTCTCPRCIKLGQFDDIQESAILEGYRCKDD 151
G E+ I Y++ ++ R+K L++QY F C C RC
Sbjct: 89 EVGEELTICYLDMLMTSEERRKQLRDQYCFECDCFRCQT--------------------- 127
Query: 152 GCSGFLLRDSDDKGFTCQQCGLVRSKEEIKKIASEVNILSKKTLALTSCGNHQEVVSTYK 211
+D D T + +E +KKI E+ K L C Q ++S+
Sbjct: 128 -------QDKDADMLTGDEQVWKEVQESLKKI-EELKAHWKWEQVLAMC---QAIISSNS 176
Query: 212 MIEKLQKKLYHPFSVNLMQTR--EKLIKILMELEDWKEALAYCQLTIPVYQRVYPQFHPL 269
E+L +N+ Q + + + + L +EAL Y T+ Y+ +P HP+
Sbjct: 177 --ERLP-------DINIYQLKVLDCAMDACINLGLLEEALFYGTRTMEPYRIFFPGSHPV 227
Query: 270 LGLQYYTCGKLEWFLGDTENAIKSMTEAVEILRITHG 306
G+Q GKL+ G A+K++ A +I+R+THG
Sbjct: 228 RGVQVMKVGKLQLHQGMFPQAMKNLRLAFDIMRVTHG 264
>gi|195477573|ref|XP_002100247.1| GE16937 [Drosophila yakuba]
gi|194187771|gb|EDX01355.1| GE16937 [Drosophila yakuba]
Length = 469
Score = 121 bits (304), Expect = 4e-25, Method: Compositional matrix adjust.
Identities = 85/320 (26%), Positives = 159/320 (49%), Gaps = 23/320 (7%)
Query: 16 ANLVNLILQWPEISIN--EIAENFSKLACNAHTICNSELRPLGTGLYPVISIINHSCLPN 73
A L +++ P N E+ + +L N I ++E+ + T +Y +SI +HSC PN
Sbjct: 166 AVLTDMMADSPSTVPNKTELMSIYGRLITNGFNILDAEMNSIATAIYLGVSITDHSCQPN 225
Query: 74 AVLVFEGRLAVVRAVQHVP--KGAEVLISYIETAGSTMTRQKALKEQYLFTCTCPRCIKL 131
AV FEG V A++ + +++ ISYI+ + R+ LKE Y F C C +C
Sbjct: 226 AVATFEGNELHVHAIEDMECLDWSKIFISYIDLLNTPEQRRLDLKEHYYFLCVCSKCT-- 283
Query: 132 GQFDDIQESAILEGYRCKDDGCSGFLLRDSDDKGFTCQQCGLVRSKEEIKKIASEVNILS 191
D +ES + C + C + D + C +C S + ++ ++ L+
Sbjct: 284 ----DAKESKEMLSALCPNRNCGVGITVDRTN----CPRCDAGISPK-LRNAFNDAMTLT 334
Query: 192 KKTLALTSCGNHQEV--VSTYKMIEKLQKKLYHPFSVNLMQTREKLIKILMELEDWKEAL 249
+ L N ++V + K+ Q ++HP +V ++T + + +E+ W +AL
Sbjct: 335 RHNLE-----NMKDVAYLDVCKVCLDKQTGVFHPLNVWYVKTLDAAFEAAIEVGKWTDAL 389
Query: 250 AYCQLTIPVYQRVYPQFHPLLGLQYYTCGKLEWFLGDTENAIKSMTEAVEILRITHGTNS 309
Y Q +P +++ + ++PLLGL + GK++ + G ++ A+ + EA IL +THG +
Sbjct: 390 DYGQRLLPGFRKYHGPWNPLLGLLHMKLGKIQLYEGRSKEALHHLEEAQRILTVTHGRDH 449
Query: 310 PFMKE-LILKLEEAQAEASY 328
+ E L + + +A+ EA++
Sbjct: 450 RLLTEQLYVLILQARQEAAH 469
>gi|197692956|gb|ACH71266.1| SET and MYND domain-containing 3 [Sus scrofa]
Length = 369
Score = 121 bits (304), Expect = 4e-25, Method: Compositional matrix adjust.
Identities = 84/277 (30%), Positives = 136/277 (49%), Gaps = 43/277 (15%)
Query: 32 EIAENFSKLACNAHTICNSELRPLGTGLYPVISIINHSCLPNAVLVFEGRLAVVRAVQHV 91
+I E F+K+ CN+ TICN+E++ +G GLYP +S++NHSC PN +VF G ++RAV+ +
Sbjct: 111 DIFEAFAKVICNSFTICNAEMQEVGVGLYPSMSLLNHSCDPNCSIVFNGPHLLLRAVRDI 170
Query: 92 PKGAEVLISYIETAGSTMTRQKALKEQYLFTCTCPRCIKLGQFDDIQESAILEGYRCKDD 151
G E+ I Y++ ++ R+K L++QY F C C RC
Sbjct: 171 EAGEELTICYLDMLMTSEERRKQLRDQYCFECDCFRC----------------------- 207
Query: 152 GCSGFLLRDSDDKGFTCQQCGLVRSKEEIKKIASEVNILSKKTLALTSCGNHQEVVSTYK 211
+D D T + +E +KKI E+ K L C Q ++S+
Sbjct: 208 -----QTQDKDADMLTGDEQVWKEVQESLKKI-EELKAHWKWEQVLAMC---QTIISSN- 257
Query: 212 MIEKLQKKLYHPFSVNLMQTR--EKLIKILMELEDWKEALAYCQLTIPVYQRVYPQFHPL 269
E+L +N+ Q + + + + L +EAL Y T+ Y+ +P HP+
Sbjct: 258 -AERLP-------DINIYQLKVLDCAMDACINLGLLEEALFYGMRTMEPYRIFFPGSHPV 309
Query: 270 LGLQYYTCGKLEWFLGDTENAIKSMTEAVEILRITHG 306
G+Q GKL+ G A+K++ A +I+R+THG
Sbjct: 310 RGVQVMKVGKLQLHQGMFPQAMKNLRLAFDIMRVTHG 346
>gi|338722720|ref|XP_001914950.2| PREDICTED: LOW QUALITY PROTEIN: SET and MYND domain-containing
protein 3 [Equus caballus]
Length = 440
Score = 121 bits (303), Expect = 5e-25, Method: Compositional matrix adjust.
Identities = 84/277 (30%), Positives = 136/277 (49%), Gaps = 43/277 (15%)
Query: 32 EIAENFSKLACNAHTICNSELRPLGTGLYPVISIINHSCLPNAVLVFEGRLAVVRAVQHV 91
++ E F+K+ CN+ TICN+E++ +G GLYP +S++NHSC PN +VF G ++RAV+ +
Sbjct: 182 DVFEAFAKVICNSFTICNAEMQEVGVGLYPSMSLLNHSCDPNCSIVFNGPHLLLRAVRDI 241
Query: 92 PKGAEVLISYIETAGSTMTRQKALKEQYLFTCTCPRCIKLGQFDDIQESAILEGYRCKDD 151
G E+ I Y++ ++ R+K L++QY F C C RC
Sbjct: 242 EAGEELTICYLDMLMTSEERRKQLRDQYCFECDCFRC----------------------- 278
Query: 152 GCSGFLLRDSDDKGFTCQQCGLVRSKEEIKKIASEVNILSKKTLALTSCGNHQEVVSTYK 211
+D D T + +E +KKI E+ K L C Q ++S+
Sbjct: 279 -----QTQDKDADMLTGDEHVWKEVQESLKKI-EELKAHWKWEQVLAMC---QTIISSNS 329
Query: 212 MIEKLQKKLYHPFSVNLMQTR--EKLIKILMELEDWKEALAYCQLTIPVYQRVYPQFHPL 269
E+L +N+ Q + + + + L +EAL Y T+ Y+ YP HP+
Sbjct: 330 --ERLP-------DINIYQLKVLDCAMDACINLGLLEEALFYGIRTMEPYRIFYPGSHPV 380
Query: 270 LGLQYYTCGKLEWFLGDTENAIKSMTEAVEILRITHG 306
G+Q GKL+ G A+K++ A +I+R+THG
Sbjct: 381 RGVQVMKVGKLQLHQGMFPQAMKNLRLAFDIMRVTHG 417
>gi|347300358|ref|NP_001153564.2| SET and MYND domain-containing protein 3 [Sus scrofa]
Length = 428
Score = 121 bits (303), Expect = 5e-25, Method: Compositional matrix adjust.
Identities = 84/277 (30%), Positives = 136/277 (49%), Gaps = 43/277 (15%)
Query: 32 EIAENFSKLACNAHTICNSELRPLGTGLYPVISIINHSCLPNAVLVFEGRLAVVRAVQHV 91
+I E F+K+ CN+ TICN+E++ +G GLYP +S++NHSC PN +VF G ++RAV+ +
Sbjct: 170 DIFEAFAKVICNSFTICNAEMQEVGVGLYPSMSLLNHSCDPNCSIVFNGPHLLLRAVRDI 229
Query: 92 PKGAEVLISYIETAGSTMTRQKALKEQYLFTCTCPRCIKLGQFDDIQESAILEGYRCKDD 151
G E+ I Y++ ++ R+K L++QY F C C RC
Sbjct: 230 EAGEELTICYLDMLMTSEERRKQLRDQYCFECDCFRC----------------------- 266
Query: 152 GCSGFLLRDSDDKGFTCQQCGLVRSKEEIKKIASEVNILSKKTLALTSCGNHQEVVSTYK 211
+D D T + +E +KKI E+ K L C Q ++S+
Sbjct: 267 -----QTQDKDADMLTGDEQVWKEVQESLKKI-EELKAHWKWEQVLAMC---QTIISSN- 316
Query: 212 MIEKLQKKLYHPFSVNLMQTR--EKLIKILMELEDWKEALAYCQLTIPVYQRVYPQFHPL 269
E+L +N+ Q + + + + L +EAL Y T+ Y+ +P HP+
Sbjct: 317 -AERLP-------DINIYQLKVLDCAMDACINLGLLEEALFYGMRTMEPYRIFFPGSHPV 368
Query: 270 LGLQYYTCGKLEWFLGDTENAIKSMTEAVEILRITHG 306
G+Q GKL+ G A+K++ A +I+R+THG
Sbjct: 369 RGVQVMKVGKLQLHQGMFPQAMKNLRLAFDIMRVTHG 405
>gi|332812383|ref|XP_514316.3| PREDICTED: SET and MYND domain-containing protein 3 [Pan
troglodytes]
gi|397473047|ref|XP_003808033.1| PREDICTED: SET and MYND domain-containing protein 3 [Pan paniscus]
gi|410208860|gb|JAA01649.1| SET and MYND domain containing 3 [Pan troglodytes]
gi|410254410|gb|JAA15172.1| SET and MYND domain containing 3 [Pan troglodytes]
gi|410289336|gb|JAA23268.1| SET and MYND domain containing 3 [Pan troglodytes]
gi|410332281|gb|JAA35087.1| SET and MYND domain containing 3 [Pan troglodytes]
Length = 428
Score = 121 bits (303), Expect = 6e-25, Method: Compositional matrix adjust.
Identities = 84/277 (30%), Positives = 136/277 (49%), Gaps = 43/277 (15%)
Query: 32 EIAENFSKLACNAHTICNSELRPLGTGLYPVISIINHSCLPNAVLVFEGRLAVVRAVQHV 91
++ E F+K+ CN+ TICN+E++ +G GLYP IS++NHSC PN +VF G ++RAV+ +
Sbjct: 170 DLFEAFAKVICNSFTICNAEMQEVGVGLYPSISLLNHSCDPNCSIVFNGPHLLLRAVRDI 229
Query: 92 PKGAEVLISYIETAGSTMTRQKALKEQYLFTCTCPRCIKLGQFDDIQESAILEGYRCKDD 151
G E+ I Y++ ++ R+K L++QY F C C RC
Sbjct: 230 EVGEELTICYLDMLMTSEERRKQLRDQYCFECDCFRC----------------------- 266
Query: 152 GCSGFLLRDSDDKGFTCQQCGLVRSKEEIKKIASEVNILSKKTLALTSCGNHQEVVSTYK 211
+D D T + +E +KKI E+ K L C Q ++S+
Sbjct: 267 -----QTQDKDADMLTGDEQVWKEVQESLKKI-EELKAHWKWEQVLAMC---QAIISSNS 317
Query: 212 MIEKLQKKLYHPFSVNLMQTR--EKLIKILMELEDWKEALAYCQLTIPVYQRVYPQFHPL 269
E+L +N+ Q + + + + L +EAL Y T+ Y+ +P HP+
Sbjct: 318 --ERLP-------DINIYQLKVLDCAMDACINLGLLEEALFYATRTMEPYRIFFPGSHPV 368
Query: 270 LGLQYYTCGKLEWFLGDTENAIKSMTEAVEILRITHG 306
G+Q GKL+ G A+K++ A +I+R+THG
Sbjct: 369 RGVQVMKVGKLQLHQGMFPQAMKNLRLAFDIMRVTHG 405
>gi|12232401|ref|NP_073580.1| SET and MYND domain-containing protein 3 isoform 2 [Homo sapiens]
gi|10437096|dbj|BAB14981.1| unnamed protein product [Homo sapiens]
gi|119597549|gb|EAW77143.1| SET and MYND domain containing 3, isoform CRA_d [Homo sapiens]
Length = 369
Score = 121 bits (303), Expect = 6e-25, Method: Compositional matrix adjust.
Identities = 84/277 (30%), Positives = 136/277 (49%), Gaps = 43/277 (15%)
Query: 32 EIAENFSKLACNAHTICNSELRPLGTGLYPVISIINHSCLPNAVLVFEGRLAVVRAVQHV 91
++ E F+K+ CN+ TICN+E++ +G GLYP IS++NHSC PN +VF G ++RAV+ +
Sbjct: 111 DLFEAFAKVICNSFTICNAEMQEVGVGLYPSISLLNHSCDPNCSIVFNGPHLLLRAVRDI 170
Query: 92 PKGAEVLISYIETAGSTMTRQKALKEQYLFTCTCPRCIKLGQFDDIQESAILEGYRCKDD 151
G E+ I Y++ ++ R+K L++QY F C C RC
Sbjct: 171 EVGEELTICYLDMLMTSEERRKQLRDQYCFECDCFRC----------------------- 207
Query: 152 GCSGFLLRDSDDKGFTCQQCGLVRSKEEIKKIASEVNILSKKTLALTSCGNHQEVVSTYK 211
+D D T + +E +KKI E+ K L C Q ++S+
Sbjct: 208 -----QTQDKDADMLTGDEQVWKEVQESLKKI-EELKAHWKWEQVLAMC---QAIISSNS 258
Query: 212 MIEKLQKKLYHPFSVNLMQTR--EKLIKILMELEDWKEALAYCQLTIPVYQRVYPQFHPL 269
E+L +N+ Q + + + + L +EAL Y T+ Y+ +P HP+
Sbjct: 259 --ERLP-------DINIYQLKVLDCAMDACINLGLLEEALFYGTRTMEPYRIFFPGSHPV 309
Query: 270 LGLQYYTCGKLEWFLGDTENAIKSMTEAVEILRITHG 306
G+Q GKL+ G A+K++ A +I+R+THG
Sbjct: 310 RGVQVMKVGKLQLHQGMFPQAMKNLRLAFDIMRVTHG 346
>gi|332236389|ref|XP_003267386.1| PREDICTED: SET and MYND domain-containing protein 3 [Nomascus
leucogenys]
Length = 369
Score = 120 bits (302), Expect = 6e-25, Method: Compositional matrix adjust.
Identities = 84/277 (30%), Positives = 136/277 (49%), Gaps = 43/277 (15%)
Query: 32 EIAENFSKLACNAHTICNSELRPLGTGLYPVISIINHSCLPNAVLVFEGRLAVVRAVQHV 91
++ E F+K+ CN+ TICN+E++ +G GLYP IS++NHSC PN +VF G ++RAV+ +
Sbjct: 111 DLFEAFAKVICNSFTICNAEMQEVGVGLYPSISLLNHSCDPNCSIVFNGPHLLLRAVRDI 170
Query: 92 PKGAEVLISYIETAGSTMTRQKALKEQYLFTCTCPRCIKLGQFDDIQESAILEGYRCKDD 151
G E+ I Y++ ++ R+K L++QY F C C RC
Sbjct: 171 EVGEELTICYLDMLMTSEERRKQLRDQYCFECDCLRC----------------------- 207
Query: 152 GCSGFLLRDSDDKGFTCQQCGLVRSKEEIKKIASEVNILSKKTLALTSCGNHQEVVSTYK 211
+D D T + +E +KKI E+ K L C Q ++S+
Sbjct: 208 -----QTQDKDADMLTGDEQVWKEVQESLKKI-EELKAHWKWEQVLAMC---QAIISSNS 258
Query: 212 MIEKLQKKLYHPFSVNLMQTR--EKLIKILMELEDWKEALAYCQLTIPVYQRVYPQFHPL 269
E+L +N+ Q + + + + L +EAL Y T+ Y+ +P HP+
Sbjct: 259 --ERLP-------DINIYQLKVLDCAMDACINLGLLEEALFYGTRTMEPYRIFFPGSHPV 309
Query: 270 LGLQYYTCGKLEWFLGDTENAIKSMTEAVEILRITHG 306
G+Q GKL+ G A+K++ A +I+R+THG
Sbjct: 310 RGVQVMKVGKLQLHQGMFPQAMKNLRLAFDIMRVTHG 346
>gi|340711607|ref|XP_003394366.1| PREDICTED: SET and MYND domain-containing protein 3-like [Bombus
terrestris]
Length = 439
Score = 120 bits (302), Expect = 7e-25, Method: Compositional matrix adjust.
Identities = 87/308 (28%), Positives = 150/308 (48%), Gaps = 24/308 (7%)
Query: 5 DEKQLLLYAQIANLVNLILQ-WPEISINEIAENFSKLACNAHTICNSELRPLGTGLYPVI 63
DEK++ + + N+++ L+ P S E+ + ++ N+ +I N ++ +G G+Y
Sbjct: 127 DEKKMEHFVCLCNVLHKFLEDMPIPSTAELLGIYGRITINSFSIFNLDMN-IGVGIYLGP 185
Query: 64 SIINHSCLPNAVLVFEGRLAVVRAVQHVPKG--AEVLISYIETAGSTMTRQKALKEQYLF 121
SI++HSC PNAV FEG V+A++ +P +++ I YI+ + R+ L+ Y F
Sbjct: 186 SILDHSCKPNAVATFEGTTINVKAIEDLPSLDLSQIRIPYIDVIKTAGDRRAELQSSYYF 245
Query: 122 TCTCPRCIKLGQFDDIQESAILEGYRCKDDGCSGFLLRDSDDKGFTCQQCGLVRSKEEIK 181
C C +C K + E C + C+ D C++C + E K
Sbjct: 246 WCDCEKCEK--------PEPMAEAAACPNKLCT----YPCDPNADLCEKCN-TKFPENFK 292
Query: 182 KIASEVNILSKKTLALTSCGNHQEV--VSTYKMIEKLQKKLYHPFSVNLMQTREKLIKIL 239
+I E++ + L N + + + KM Q+ + HP +V +QT +
Sbjct: 293 EIFDEISEFTAYHLE-----NMKNIAYLDVSKMCLSRQEGILHPLNVQYVQTLQTAFDSS 347
Query: 240 MELEDWKEALAYCQLTIPVYQRVYPQFHPLLGLQYYTCGKLEWFLGDTENAIKSMTEAVE 299
+ L+ W+EA +Y + I Y Y + HP G+ Y + GKLE +L + AIK++ +A
Sbjct: 348 INLQHWEEAESYAKKLINGYLAYYGEIHPSTGILYLSIGKLEVYLKKLKQAIKTLRKASL 407
Query: 300 ILRITHGT 307
IL ITHG
Sbjct: 408 ILTITHGA 415
>gi|355559119|gb|EHH15899.1| hypothetical protein EGK_02058, partial [Macaca mulatta]
Length = 373
Score = 120 bits (302), Expect = 7e-25, Method: Compositional matrix adjust.
Identities = 84/277 (30%), Positives = 136/277 (49%), Gaps = 43/277 (15%)
Query: 32 EIAENFSKLACNAHTICNSELRPLGTGLYPVISIINHSCLPNAVLVFEGRLAVVRAVQHV 91
++ E F+K+ CN+ TICN+E++ +G GLYP IS++NHSC PN +VF G ++RAV+ +
Sbjct: 115 DLFEAFAKVICNSFTICNAEMQEVGVGLYPSISLLNHSCDPNCSIVFNGPHLLLRAVRDI 174
Query: 92 PKGAEVLISYIETAGSTMTRQKALKEQYLFTCTCPRCIKLGQFDDIQESAILEGYRCKDD 151
G E+ I Y++ ++ R+K L++QY F C C RC
Sbjct: 175 EVGEELTICYLDMLMTSEERRKQLRDQYCFECDCFRC----------------------- 211
Query: 152 GCSGFLLRDSDDKGFTCQQCGLVRSKEEIKKIASEVNILSKKTLALTSCGNHQEVVSTYK 211
+D D T + +E +KKI E+ K L C Q ++S+
Sbjct: 212 -----QTQDKDADMLTGDEQVWKEVQESLKKI-EELKAHWKWEQVLAMC---QAIISSNS 262
Query: 212 MIEKLQKKLYHPFSVNLMQTR--EKLIKILMELEDWKEALAYCQLTIPVYQRVYPQFHPL 269
E+L +N+ Q + + + + L +EAL Y T+ Y+ +P HP+
Sbjct: 263 --ERLP-------DINIYQLKVLDCAMDACINLGLLEEALFYGTRTMEPYRIFFPGSHPV 313
Query: 270 LGLQYYTCGKLEWFLGDTENAIKSMTEAVEILRITHG 306
G+Q GKL+ G A+K++ A +I+R+THG
Sbjct: 314 RGVQVMKVGKLQLHQGMFPQAMKNLRLAFDIMRVTHG 350
>gi|426239577|ref|XP_004013696.1| PREDICTED: SET and MYND domain-containing protein 3 isoform 1 [Ovis
aries]
Length = 369
Score = 120 bits (302), Expect = 7e-25, Method: Compositional matrix adjust.
Identities = 85/277 (30%), Positives = 135/277 (48%), Gaps = 43/277 (15%)
Query: 32 EIAENFSKLACNAHTICNSELRPLGTGLYPVISIINHSCLPNAVLVFEGRLAVVRAVQHV 91
+I E F+K+ CN+ TICN+E++ +G GLYP +S++NHSC PN +VF G ++R V+ V
Sbjct: 111 DIFEAFAKVICNSFTICNAEMQEVGVGLYPSMSLLNHSCDPNCSIVFNGPHLLLRTVRDV 170
Query: 92 PKGAEVLISYIETAGSTMTRQKALKEQYLFTCTCPRCIKLGQFDDIQESAILEGYRCKDD 151
G E+ I Y++ ++ R+K L++QY F C C RC
Sbjct: 171 EAGEELTICYLDMLMTSEERRKQLRDQYCFDCDCFRC----------------------- 207
Query: 152 GCSGFLLRDSDDKGFTCQQCGLVRSKEEIKKIASEVNILSKKTLALTSCGNHQEVVSTYK 211
+D D T + +E +KKI E+ K L C Q ++S+
Sbjct: 208 -----QTQDKDADMLTGDEQVWKEVQESLKKI-EELKAHWKWEQVLAMC---QSIISSNA 258
Query: 212 MIEKLQKKLYHPFSVNLMQTR--EKLIKILMELEDWKEALAYCQLTIPVYQRVYPQFHPL 269
E+L +N+ Q + + + + L +EAL Y T+ Y+ YP HP+
Sbjct: 259 --ERLP-------DINIYQLKVLDCAMDACINLGLLEEALFYGIRTMEPYRIFYPGNHPV 309
Query: 270 LGLQYYTCGKLEWFLGDTENAIKSMTEAVEILRITHG 306
G+Q GKL+ G A+K++ A +I+R+THG
Sbjct: 310 RGVQIMKVGKLQLHQGMFPQAMKNLRLAFDIMRVTHG 346
>gi|332024084|gb|EGI64301.1| SET and MYND domain-containing protein 3 [Acromyrmex echinatior]
Length = 441
Score = 120 bits (302), Expect = 7e-25, Method: Compositional matrix adjust.
Identities = 84/299 (28%), Positives = 144/299 (48%), Gaps = 34/299 (11%)
Query: 29 SINEIAENFSKLACNAHTICNSELRPLGTGLYPVISIINHSCLPNAVLVFEGRLAVVRAV 88
+I E+ + ++ N+ I + + +G G+Y S+I+HSC PN ++VFEG +VR +
Sbjct: 148 NIAELMGIYGRICTNSFNILDINMNTIGVGIYLGASVIDHSCKPNVIVVFEGTTIIVRTL 207
Query: 89 QHVPK--------GAEVLISYIETAGSTMTRQKALKEQYLFTCTCPRCIKLGQFDDIQES 140
+P ++ ISY++ S R++ L Y F C C RC K E
Sbjct: 208 TDLPSLDWSQASIDKDIRISYVDLLNSNKDRREELHSSYYFWCDCERCKK--------EE 259
Query: 141 AILEGYRCKDDGCSGFLLRDSDDKGFTCQQCG---LVRSKEEIKKIAS-EVNILSK-KTL 195
++E C + C ++D+ C++C V KE +++ ++ L K KT+
Sbjct: 260 PMVEAAACPNLLCDSPCSIEADE----CEKCNKEISVEFKETFREVVDFTIHHLEKMKTM 315
Query: 196 ALTSCGNHQEVVSTYKMIEKLQKKLYHPFSVNLMQTREKLIKILMELEDWKEALAYCQLT 255
A + K+ K QK + H F++ ++T E M L+ W++A Y +
Sbjct: 316 AY---------LDVSKICLKKQKGIMHKFNIQHVRTLEMAHIAAMNLKCWEDAEFYGKEL 366
Query: 256 IPVYQRVYPQFHPLLGLQYYTCGKLEWFLGDTENAIKSMTEAVEILRITHGTNSPFMKE 314
+P Y Y + HPL GL Y T GK++ L + A +++T+A +L ITHG ++E
Sbjct: 367 VPGYLLYYGEIHPLTGLLYLTVGKIQLHLEKPKEAFQALTKANTVLTITHGDKHSIVEE 425
>gi|333361525|pdb|3RU0|A Chain A, Cocrystal Structure Of Human Smyd3 With Inhibitor
Sinefungin Bound
gi|333361526|pdb|3RU0|B Chain B, Cocrystal Structure Of Human Smyd3 With Inhibitor
Sinefungin Bound
Length = 438
Score = 120 bits (301), Expect = 8e-25, Method: Compositional matrix adjust.
Identities = 84/277 (30%), Positives = 136/277 (49%), Gaps = 43/277 (15%)
Query: 32 EIAENFSKLACNAHTICNSELRPLGTGLYPVISIINHSCLPNAVLVFEGRLAVVRAVQHV 91
++ E F+K+ CN+ TICN+E++ +G GLYP IS++NHSC PN +VF G ++RAV+ +
Sbjct: 172 DLFEAFAKVICNSFTICNAEMQEVGVGLYPSISLLNHSCDPNCSIVFNGPHLLLRAVRDI 231
Query: 92 PKGAEVLISYIETAGSTMTRQKALKEQYLFTCTCPRCIKLGQFDDIQESAILEGYRCKDD 151
G E+ I Y++ ++ R+K L++QY F C C RC
Sbjct: 232 EVGEELTICYLDMLMTSEERRKQLRDQYCFECDCFRC----------------------- 268
Query: 152 GCSGFLLRDSDDKGFTCQQCGLVRSKEEIKKIASEVNILSKKTLALTSCGNHQEVVSTYK 211
+D D T + +E +KKI E+ K L C Q ++S+
Sbjct: 269 -----QTQDKDADMLTGDEQVWKEVQESLKKI-EELKAHWKWEQVLAMC---QAIISSNS 319
Query: 212 MIEKLQKKLYHPFSVNLMQTR--EKLIKILMELEDWKEALAYCQLTIPVYQRVYPQFHPL 269
E+L +N+ Q + + + + L +EAL Y T+ Y+ +P HP+
Sbjct: 320 --ERLP-------DINIYQLKVLDCAMDACINLGLLEEALFYGTRTMEPYRIFFPGSHPV 370
Query: 270 LGLQYYTCGKLEWFLGDTENAIKSMTEAVEILRITHG 306
G+Q GKL+ G A+K++ A +I+R+THG
Sbjct: 371 RGVQVMKVGKLQLHQGMFPQAMKNLRLAFDIMRVTHG 407
>gi|426239579|ref|XP_004013697.1| PREDICTED: SET and MYND domain-containing protein 3 isoform 2 [Ovis
aries]
Length = 428
Score = 120 bits (301), Expect = 8e-25, Method: Compositional matrix adjust.
Identities = 85/277 (30%), Positives = 135/277 (48%), Gaps = 43/277 (15%)
Query: 32 EIAENFSKLACNAHTICNSELRPLGTGLYPVISIINHSCLPNAVLVFEGRLAVVRAVQHV 91
+I E F+K+ CN+ TICN+E++ +G GLYP +S++NHSC PN +VF G ++R V+ V
Sbjct: 170 DIFEAFAKVICNSFTICNAEMQEVGVGLYPSMSLLNHSCDPNCSIVFNGPHLLLRTVRDV 229
Query: 92 PKGAEVLISYIETAGSTMTRQKALKEQYLFTCTCPRCIKLGQFDDIQESAILEGYRCKDD 151
G E+ I Y++ ++ R+K L++QY F C C RC
Sbjct: 230 EAGEELTICYLDMLMTSEERRKQLRDQYCFDCDCFRC----------------------- 266
Query: 152 GCSGFLLRDSDDKGFTCQQCGLVRSKEEIKKIASEVNILSKKTLALTSCGNHQEVVSTYK 211
+D D T + +E +KKI E+ K L C Q ++S+
Sbjct: 267 -----QTQDKDADMLTGDEQVWKEVQESLKKI-EELKAHWKWEQVLAMC---QSIISSN- 316
Query: 212 MIEKLQKKLYHPFSVNLMQTR--EKLIKILMELEDWKEALAYCQLTIPVYQRVYPQFHPL 269
E+L +N+ Q + + + + L +EAL Y T+ Y+ YP HP+
Sbjct: 317 -AERLP-------DINIYQLKVLDCAMDACINLGLLEEALFYGIRTMEPYRIFYPGNHPV 368
Query: 270 LGLQYYTCGKLEWFLGDTENAIKSMTEAVEILRITHG 306
G+Q GKL+ G A+K++ A +I+R+THG
Sbjct: 369 RGVQIMKVGKLQLHQGMFPQAMKNLRLAFDIMRVTHG 405
>gi|267844824|ref|NP_001161212.1| SET and MYND domain-containing protein 3 isoform 1 [Homo sapiens]
gi|212276523|sp|Q9H7B4.4|SMYD3_HUMAN RecName: Full=SET and MYND domain-containing protein 3; AltName:
Full=Zinc finger MYND domain-containing protein 1
gi|119597548|gb|EAW77142.1| SET and MYND domain containing 3, isoform CRA_c [Homo sapiens]
Length = 428
Score = 120 bits (301), Expect = 9e-25, Method: Compositional matrix adjust.
Identities = 84/277 (30%), Positives = 136/277 (49%), Gaps = 43/277 (15%)
Query: 32 EIAENFSKLACNAHTICNSELRPLGTGLYPVISIINHSCLPNAVLVFEGRLAVVRAVQHV 91
++ E F+K+ CN+ TICN+E++ +G GLYP IS++NHSC PN +VF G ++RAV+ +
Sbjct: 170 DLFEAFAKVICNSFTICNAEMQEVGVGLYPSISLLNHSCDPNCSIVFNGPHLLLRAVRDI 229
Query: 92 PKGAEVLISYIETAGSTMTRQKALKEQYLFTCTCPRCIKLGQFDDIQESAILEGYRCKDD 151
G E+ I Y++ ++ R+K L++QY F C C RC
Sbjct: 230 EVGEELTICYLDMLMTSEERRKQLRDQYCFECDCFRC----------------------- 266
Query: 152 GCSGFLLRDSDDKGFTCQQCGLVRSKEEIKKIASEVNILSKKTLALTSCGNHQEVVSTYK 211
+D D T + +E +KKI E+ K L C Q ++S+
Sbjct: 267 -----QTQDKDADMLTGDEQVWKEVQESLKKI-EELKAHWKWEQVLAMC---QAIISSNS 317
Query: 212 MIEKLQKKLYHPFSVNLMQTR--EKLIKILMELEDWKEALAYCQLTIPVYQRVYPQFHPL 269
E+L +N+ Q + + + + L +EAL Y T+ Y+ +P HP+
Sbjct: 318 --ERLP-------DINIYQLKVLDCAMDACINLGLLEEALFYGTRTMEPYRIFFPGSHPV 368
Query: 270 LGLQYYTCGKLEWFLGDTENAIKSMTEAVEILRITHG 306
G+Q GKL+ G A+K++ A +I+R+THG
Sbjct: 369 RGVQVMKVGKLQLHQGMFPQAMKNLRLAFDIMRVTHG 405
>gi|312208186|pdb|3PDN|A Chain A, Crystal Structure Of Smyd3 In Complex With
Methyltransferase Inhibitor Sinefungin
gi|323714509|pdb|3OXL|A Chain A, Human Lysine Methyltransferase Smyd3 In Complex With
Adohcy (Form Ii)
gi|21410974|gb|AAH31010.1| SMYD3 protein [Homo sapiens]
gi|123984179|gb|ABM83506.1| SET and MYND domain containing 3 [synthetic construct]
gi|123998243|gb|ABM86723.1| SET and MYND domain containing 3 [synthetic construct]
Length = 428
Score = 120 bits (301), Expect = 9e-25, Method: Compositional matrix adjust.
Identities = 84/277 (30%), Positives = 136/277 (49%), Gaps = 43/277 (15%)
Query: 32 EIAENFSKLACNAHTICNSELRPLGTGLYPVISIINHSCLPNAVLVFEGRLAVVRAVQHV 91
++ E F+K+ CN+ TICN+E++ +G GLYP IS++NHSC PN +VF G ++RAV+ +
Sbjct: 170 DLFEAFAKVICNSFTICNAEMQEVGVGLYPSISLLNHSCDPNCSIVFNGPHLLLRAVRDI 229
Query: 92 PKGAEVLISYIETAGSTMTRQKALKEQYLFTCTCPRCIKLGQFDDIQESAILEGYRCKDD 151
G E+ I Y++ ++ R+K L++QY F C C RC
Sbjct: 230 EVGEELTICYLDMLMTSEERRKQLRDQYCFECDCFRC----------------------- 266
Query: 152 GCSGFLLRDSDDKGFTCQQCGLVRSKEEIKKIASEVNILSKKTLALTSCGNHQEVVSTYK 211
+D D T + +E +KKI E+ K L C Q ++S+
Sbjct: 267 -----QTQDKDADMLTGDEQVWKEVQESLKKI-EELKAHWKWEQVLAMC---QAIISSNS 317
Query: 212 MIEKLQKKLYHPFSVNLMQTR--EKLIKILMELEDWKEALAYCQLTIPVYQRVYPQFHPL 269
E+L +N+ Q + + + + L +EAL Y T+ Y+ +P HP+
Sbjct: 318 --ERLP-------DINIYQLKVLDCAMDACINLGLLEEALFYGTRTMEPYRIFFPGSHPV 368
Query: 270 LGLQYYTCGKLEWFLGDTENAIKSMTEAVEILRITHG 306
G+Q GKL+ G A+K++ A +I+R+THG
Sbjct: 369 RGVQVMKVGKLQLHQGMFPQAMKNLRLAFDIMRVTHG 405
>gi|19570495|dbj|BAB86333.1| histone methyltransferase [Homo sapiens]
gi|158260233|dbj|BAF82294.1| unnamed protein product [Homo sapiens]
gi|261858444|dbj|BAI45744.1| SET and MYND domain containing 3 [synthetic construct]
Length = 428
Score = 120 bits (301), Expect = 9e-25, Method: Compositional matrix adjust.
Identities = 84/277 (30%), Positives = 136/277 (49%), Gaps = 43/277 (15%)
Query: 32 EIAENFSKLACNAHTICNSELRPLGTGLYPVISIINHSCLPNAVLVFEGRLAVVRAVQHV 91
++ E F+K+ CN+ TICN+E++ +G GLYP IS++NHSC PN +VF G ++RAV+ +
Sbjct: 170 DLFEAFAKVICNSFTICNAEMQEVGVGLYPSISLLNHSCDPNCSIVFNGPHLLLRAVRDI 229
Query: 92 PKGAEVLISYIETAGSTMTRQKALKEQYLFTCTCPRCIKLGQFDDIQESAILEGYRCKDD 151
G E+ I Y++ ++ R+K L++QY F C C RC
Sbjct: 230 EVGEELTICYLDMLMTSEERRKQLRDQYCFECDCFRC----------------------- 266
Query: 152 GCSGFLLRDSDDKGFTCQQCGLVRSKEEIKKIASEVNILSKKTLALTSCGNHQEVVSTYK 211
+D D T + +E +KKI E+ K L C Q ++S+
Sbjct: 267 -----QTQDKDADMLTGDEQVWKEVQESLKKI-EELKAHWKWEQVLAMC---QAIISSNS 317
Query: 212 MIEKLQKKLYHPFSVNLMQTR--EKLIKILMELEDWKEALAYCQLTIPVYQRVYPQFHPL 269
E+L +N+ Q + + + + L +EAL Y T+ Y+ +P HP+
Sbjct: 318 --ERLP-------DINIYQLKVLDCAMDACINLGLLEEALFYGTRTMEPYRIFFPGSHPV 368
Query: 270 LGLQYYTCGKLEWFLGDTENAIKSMTEAVEILRITHG 306
G+Q GKL+ G A+K++ A +I+R+THG
Sbjct: 369 RGVQVMKVGKLQLHQGMFPQAMKNLRLAFDIMRVTHG 405
>gi|327533748|pdb|3QWP|A Chain A, Crystal Structure Of Set And Mynd Domain Containing 3;
Zinc Finger Mynd Domain-Containing Protein 1
Length = 429
Score = 120 bits (301), Expect = 9e-25, Method: Compositional matrix adjust.
Identities = 84/277 (30%), Positives = 136/277 (49%), Gaps = 43/277 (15%)
Query: 32 EIAENFSKLACNAHTICNSELRPLGTGLYPVISIINHSCLPNAVLVFEGRLAVVRAVQHV 91
++ E F+K+ CN+ TICN+E++ +G GLYP IS++NHSC PN +VF G ++RAV+ +
Sbjct: 171 DLFEAFAKVICNSFTICNAEMQEVGVGLYPSISLLNHSCDPNCSIVFNGPHLLLRAVRDI 230
Query: 92 PKGAEVLISYIETAGSTMTRQKALKEQYLFTCTCPRCIKLGQFDDIQESAILEGYRCKDD 151
G E+ I Y++ ++ R+K L++QY F C C RC
Sbjct: 231 EVGEELTICYLDMLMTSEERRKQLRDQYCFECDCFRC----------------------- 267
Query: 152 GCSGFLLRDSDDKGFTCQQCGLVRSKEEIKKIASEVNILSKKTLALTSCGNHQEVVSTYK 211
+D D T + +E +KKI E+ K L C Q ++S+
Sbjct: 268 -----QTQDKDADMLTGDEQVWKEVQESLKKI-EELKAHWKWEQVLAMC---QAIISSNS 318
Query: 212 MIEKLQKKLYHPFSVNLMQTR--EKLIKILMELEDWKEALAYCQLTIPVYQRVYPQFHPL 269
E+L +N+ Q + + + + L +EAL Y T+ Y+ +P HP+
Sbjct: 319 --ERLP-------DINIYQLKVLDCAMDACINLGLLEEALFYGTRTMEPYRIFFPGSHPV 369
Query: 270 LGLQYYTCGKLEWFLGDTENAIKSMTEAVEILRITHG 306
G+Q GKL+ G A+K++ A +I+R+THG
Sbjct: 370 RGVQVMKVGKLQLHQGMFPQAMKNLRLAFDIMRVTHG 406
>gi|388454549|ref|NP_001252862.1| SET and MYND domain-containing protein 3 [Macaca mulatta]
gi|380787707|gb|AFE65729.1| SET and MYND domain-containing protein 3 isoform 1 [Macaca mulatta]
gi|383412903|gb|AFH29665.1| SET and MYND domain-containing protein 3 isoform 1 [Macaca mulatta]
Length = 428
Score = 120 bits (301), Expect = 9e-25, Method: Compositional matrix adjust.
Identities = 84/277 (30%), Positives = 136/277 (49%), Gaps = 43/277 (15%)
Query: 32 EIAENFSKLACNAHTICNSELRPLGTGLYPVISIINHSCLPNAVLVFEGRLAVVRAVQHV 91
++ E F+K+ CN+ TICN+E++ +G GLYP IS++NHSC PN +VF G ++RAV+ +
Sbjct: 170 DLFEAFAKVICNSFTICNAEMQEVGVGLYPSISLLNHSCDPNCSIVFNGPHLLLRAVRDI 229
Query: 92 PKGAEVLISYIETAGSTMTRQKALKEQYLFTCTCPRCIKLGQFDDIQESAILEGYRCKDD 151
G E+ I Y++ ++ R+K L++QY F C C RC
Sbjct: 230 EVGEELTICYLDMLMTSEERRKQLRDQYCFECDCFRC----------------------- 266
Query: 152 GCSGFLLRDSDDKGFTCQQCGLVRSKEEIKKIASEVNILSKKTLALTSCGNHQEVVSTYK 211
+D D T + +E +KKI E+ K L C Q ++S+
Sbjct: 267 -----QTQDKDADMLTGDEQVWKEVQESLKKI-EELKAHWKWEQVLAMC---QAIISSNS 317
Query: 212 MIEKLQKKLYHPFSVNLMQTR--EKLIKILMELEDWKEALAYCQLTIPVYQRVYPQFHPL 269
E+L +N+ Q + + + + L +EAL Y T+ Y+ +P HP+
Sbjct: 318 --ERLP-------DINIYQLKVLDCAMDACINLGLLEEALFYGTRTMEPYRIFFPGSHPV 368
Query: 270 LGLQYYTCGKLEWFLGDTENAIKSMTEAVEILRITHG 306
G+Q GKL+ G A+K++ A +I+R+THG
Sbjct: 369 RGVQVMKVGKLQLHQGMFPQAMKNLRLAFDIMRVTHG 405
>gi|323714508|pdb|3OXG|A Chain A, Human Lysine Methyltransferase Smyd3 In Complex With
Adohcy (Form Iii)
Length = 464
Score = 120 bits (301), Expect = 9e-25, Method: Compositional matrix adjust.
Identities = 84/277 (30%), Positives = 136/277 (49%), Gaps = 43/277 (15%)
Query: 32 EIAENFSKLACNAHTICNSELRPLGTGLYPVISIINHSCLPNAVLVFEGRLAVVRAVQHV 91
++ E F+K+ CN+ TICN+E++ +G GLYP IS++NHSC PN +VF G ++RAV+ +
Sbjct: 206 DLFEAFAKVICNSFTICNAEMQEVGVGLYPSISLLNHSCDPNCSIVFNGPHLLLRAVRDI 265
Query: 92 PKGAEVLISYIETAGSTMTRQKALKEQYLFTCTCPRCIKLGQFDDIQESAILEGYRCKDD 151
G E+ I Y++ ++ R+K L++QY F C C RC
Sbjct: 266 EVGEELTICYLDMLMTSEERRKQLRDQYCFECDCFRC----------------------- 302
Query: 152 GCSGFLLRDSDDKGFTCQQCGLVRSKEEIKKIASEVNILSKKTLALTSCGNHQEVVSTYK 211
+D D T + +E +KKI E+ K L C Q ++S+
Sbjct: 303 -----QTQDKDADMLTGDEQVWKEVQESLKKI-EELKAHWKWEQVLAMC---QAIISSNS 353
Query: 212 MIEKLQKKLYHPFSVNLMQTR--EKLIKILMELEDWKEALAYCQLTIPVYQRVYPQFHPL 269
E+L +N+ Q + + + + L +EAL Y T+ Y+ +P HP+
Sbjct: 354 --ERLP-------DINIYQLKVLDCAMDACINLGLLEEALFYGTRTMEPYRIFFPGSHPV 404
Query: 270 LGLQYYTCGKLEWFLGDTENAIKSMTEAVEILRITHG 306
G+Q GKL+ G A+K++ A +I+R+THG
Sbjct: 405 RGVQVMKVGKLQLHQGMFPQAMKNLRLAFDIMRVTHG 441
>gi|323714506|pdb|3OXF|A Chain A, Human Lysine Methyltransferase Smyd3 In Complex With
Adohcy (Form I)
gi|323714507|pdb|3OXF|B Chain B, Human Lysine Methyltransferase Smyd3 In Complex With
Adohcy (Form I)
Length = 436
Score = 120 bits (301), Expect = 9e-25, Method: Compositional matrix adjust.
Identities = 84/277 (30%), Positives = 136/277 (49%), Gaps = 43/277 (15%)
Query: 32 EIAENFSKLACNAHTICNSELRPLGTGLYPVISIINHSCLPNAVLVFEGRLAVVRAVQHV 91
++ E F+K+ CN+ TICN+E++ +G GLYP IS++NHSC PN +VF G ++RAV+ +
Sbjct: 170 DLFEAFAKVICNSFTICNAEMQEVGVGLYPSISLLNHSCDPNCSIVFNGPHLLLRAVRDI 229
Query: 92 PKGAEVLISYIETAGSTMTRQKALKEQYLFTCTCPRCIKLGQFDDIQESAILEGYRCKDD 151
G E+ I Y++ ++ R+K L++QY F C C RC
Sbjct: 230 EVGEELTICYLDMLMTSEERRKQLRDQYCFECDCFRC----------------------- 266
Query: 152 GCSGFLLRDSDDKGFTCQQCGLVRSKEEIKKIASEVNILSKKTLALTSCGNHQEVVSTYK 211
+D D T + +E +KKI E+ K L C Q ++S+
Sbjct: 267 -----QTQDKDADMLTGDEQVWKEVQESLKKI-EELKAHWKWEQVLAMC---QAIISSNS 317
Query: 212 MIEKLQKKLYHPFSVNLMQTR--EKLIKILMELEDWKEALAYCQLTIPVYQRVYPQFHPL 269
E+L +N+ Q + + + + L +EAL Y T+ Y+ +P HP+
Sbjct: 318 --ERLP-------DINIYQLKVLDCAMDACINLGLLEEALFYGTRTMEPYRIFFPGSHPV 368
Query: 270 LGLQYYTCGKLEWFLGDTENAIKSMTEAVEILRITHG 306
G+Q GKL+ G A+K++ A +I+R+THG
Sbjct: 369 RGVQVMKVGKLQLHQGMFPQAMKNLRLAFDIMRVTHG 405
>gi|71043910|ref|NP_001020933.1| SET and MYND domain-containing protein 3 [Rattus norvegicus]
gi|66910630|gb|AAH97455.1| SET and MYND domain containing 3 [Rattus norvegicus]
Length = 428
Score = 120 bits (301), Expect = 9e-25, Method: Compositional matrix adjust.
Identities = 79/275 (28%), Positives = 137/275 (49%), Gaps = 39/275 (14%)
Query: 32 EIAENFSKLACNAHTICNSELRPLGTGLYPVISIINHSCLPNAVLVFEGRLAVVRAVQHV 91
++ E F+K+ CN+ TICN+E++ +G GLYP +S++NHSC PN +VF G ++RAV+ +
Sbjct: 170 DLFEAFAKVICNSFTICNAEMQEVGVGLYPSMSLLNHSCDPNCSIVFNGPHLLLRAVREI 229
Query: 92 PKGAEVLISYIETAGSTMTRQKALKEQYLFTCTCPRCIKLGQFDDIQESAILEGYRCKDD 151
G E+ I Y++ ++ R+K L++QY F C C RC Q D
Sbjct: 230 EAGEELTICYLDMLMTSEERRKQLRDQYCFECDCIRC----QTQD--------------- 270
Query: 152 GCSGFLLRDSDDKGFTCQQCGLVRSKEEIKKIASEVNILSKKTLALTSCGNHQEVVSTYK 211
+D+D ++ E+I K EV KK L + ++V++ +
Sbjct: 271 -------KDAD----------MLTGDEQIWK---EVQESLKKIEELKAHWKWEQVLALCQ 310
Query: 212 MIEKLQKKLYHPFSVNLMQTREKLIKILMELEDWKEALAYCQLTIPVYQRVYPQFHPLLG 271
I + ++ ++ + + + L +EAL Y T+ Y+ +P HP+ G
Sbjct: 311 AIINSNSERLPDINIYQLKVLDCAMDACINLGMLEEALFYAMRTMEPYRIFFPGSHPVRG 370
Query: 272 LQYYTCGKLEWFLGDTENAIKSMTEAVEILRITHG 306
+Q GKL+ G A+K++ A +I+++THG
Sbjct: 371 VQVMKVGKLQLHQGMFPQAMKNLRLAFDIMKVTHG 405
>gi|431915909|gb|ELK16163.1| SET and MYND domain-containing protein 2, partial [Pteropus alecto]
Length = 390
Score = 120 bits (300), Expect = 1e-24, Method: Compositional matrix adjust.
Identities = 83/320 (25%), Positives = 147/320 (45%), Gaps = 25/320 (7%)
Query: 5 DEKQLLLYAQIANLVNLILQWPEISINE-IAENFSKLACNAHTICNSELRPLGTGLYPVI 63
+EK+ L+ + I+ L + + E N+ + F+++ CN TI + EL LG+ ++P +
Sbjct: 90 NEKKDLIQSDISALHHFYSKHLEFPDNKSLVVLFAQVNCNGFTIEDEELSHLGSAIFPDV 149
Query: 64 SIINHSCLPNAVLVFEGRLAVVRAVQHVPKGAEVLISYIETAGSTMTRQKALKEQYLFTC 123
+++NHSC PN ++ ++G LA VRAVQ + G EV SYI+ T R L++ Y FTC
Sbjct: 150 ALMNHSCCPNVIVTYKGTLAEVRAVQEINPGEEVFTSYIDLLYPTEDRNDRLRDSYFFTC 209
Query: 124 TCPRCIKLGQFDDIQESAILEGYRCKDDGCSGFLLRDSDDKGFTCQQCGLVRSKEEIKKI 183
C C K + +R D C +VR + +
Sbjct: 210 ECQECTT------------------KHKDKAKVKIRKLSDPPKAEAICDMVRY---ARNV 248
Query: 184 ASEVNILSKKTLALTSCGNHQEVVSTYKMIEKLQKKLYHPF---SVNLMQTREKLIKILM 240
E L C ++ E Q+K+ F +V ++ + + + +
Sbjct: 249 IEEFRRAKHYKYILYRCPGVAPPSELLEICELSQEKMSSVFEDSNVYMLHMMYQAMGVCL 308
Query: 241 ELEDWKEALAYCQLTIPVYQRVYPQFHPLLGLQYYTCGKLEWFLGDTENAIKSMTEAVEI 300
++DW+ AL Y Q I Y + YP + + + G+L L + K++ +AVEI
Sbjct: 309 YMQDWEGALRYGQKIIKPYSKHYPLYSLNVASMWLKLGRLYMGLENKAAGEKALRKAVEI 368
Query: 301 LRITHGTNSPFMKELILKLE 320
+ + HG + P++ E+ ++E
Sbjct: 369 MEVAHGKDHPYISEIKQEIE 388
>gi|21312378|ref|NP_081464.1| SET and MYND domain-containing protein 3 [Mus musculus]
gi|30913566|sp|Q9CWR2.1|SMYD3_MOUSE RecName: Full=SET and MYND domain-containing protein 3; AltName:
Full=Zinc finger MYND domain-containing protein 1
gi|12845900|dbj|BAB26947.1| unnamed protein product [Mus musculus]
gi|30851412|gb|AAH52431.1| SET and MYND domain containing 3 [Mus musculus]
Length = 428
Score = 120 bits (300), Expect = 1e-24, Method: Compositional matrix adjust.
Identities = 79/275 (28%), Positives = 136/275 (49%), Gaps = 39/275 (14%)
Query: 32 EIAENFSKLACNAHTICNSELRPLGTGLYPVISIINHSCLPNAVLVFEGRLAVVRAVQHV 91
++ E F+K+ CN+ TICN+E++ +G GLYP +S++NHSC PN +VF G ++RAV+ +
Sbjct: 170 DLFEAFAKVICNSFTICNAEMQEVGVGLYPSMSLLNHSCDPNCSIVFNGPHLLLRAVREI 229
Query: 92 PKGAEVLISYIETAGSTMTRQKALKEQYLFTCTCPRCIKLGQFDDIQESAILEGYRCKDD 151
G E+ I Y++ ++ R+K L++QY F C C RC Q D
Sbjct: 230 EAGEELTICYLDMLMTSEERRKQLRDQYCFECDCIRC----QTQD--------------- 270
Query: 152 GCSGFLLRDSDDKGFTCQQCGLVRSKEEIKKIASEVNILSKKTLALTSCGNHQEVVSTYK 211
+D+D ++ E+I K EV KK L + ++V++ +
Sbjct: 271 -------KDAD----------MLTGDEQIWK---EVQESLKKIEELKAHWKWEQVLALCQ 310
Query: 212 MIEKLQKKLYHPFSVNLMQTREKLIKILMELEDWKEALAYCQLTIPVYQRVYPQFHPLLG 271
I ++ ++ + + + L +EAL Y T+ Y+ +P HP+ G
Sbjct: 311 AIINSNSNRLPDINIYQLKVLDCAMDACINLGMLEEALFYAMRTMEPYRIFFPGSHPVRG 370
Query: 272 LQYYTCGKLEWFLGDTENAIKSMTEAVEILRITHG 306
+Q GKL+ G A+K++ A +I+++THG
Sbjct: 371 VQVMKVGKLQLHQGMFPQAMKNLRLAFDIMKVTHG 405
>gi|74188167|dbj|BAE37175.1| unnamed protein product [Mus musculus]
Length = 428
Score = 120 bits (300), Expect = 1e-24, Method: Compositional matrix adjust.
Identities = 79/275 (28%), Positives = 136/275 (49%), Gaps = 39/275 (14%)
Query: 32 EIAENFSKLACNAHTICNSELRPLGTGLYPVISIINHSCLPNAVLVFEGRLAVVRAVQHV 91
++ E F+K+ CN+ TICN+E++ +G GLYP +S++NHSC PN +VF G ++RAV+ +
Sbjct: 170 DLFEAFAKVICNSFTICNAEMQEVGVGLYPSMSLLNHSCDPNCSIVFNGPHLLLRAVREI 229
Query: 92 PKGAEVLISYIETAGSTMTRQKALKEQYLFTCTCPRCIKLGQFDDIQESAILEGYRCKDD 151
G E+ I Y++ ++ R+K L++QY F C C RC Q D
Sbjct: 230 EAGEELTICYLDMLMTSEERRKQLRDQYCFECDCIRC----QTQD--------------- 270
Query: 152 GCSGFLLRDSDDKGFTCQQCGLVRSKEEIKKIASEVNILSKKTLALTSCGNHQEVVSTYK 211
+D+D ++ E+I K EV KK L + ++V++ +
Sbjct: 271 -------KDAD----------MLTGDEQIWK---EVQESLKKIEELKAHWKWEQVLALCQ 310
Query: 212 MIEKLQKKLYHPFSVNLMQTREKLIKILMELEDWKEALAYCQLTIPVYQRVYPQFHPLLG 271
I ++ ++ + + + L +EAL Y T+ Y+ +P HP+ G
Sbjct: 311 AIINSNSNRLPDINIYQLKVLDCAMDACINLGMLEEALLYAMRTMEPYRIFFPGSHPVRG 370
Query: 272 LQYYTCGKLEWFLGDTENAIKSMTEAVEILRITHG 306
+Q GKL+ G A+K++ A +I+++THG
Sbjct: 371 VQVMKVGKLQLHQGMFPQAMKNLRLAFDIMKVTHG 405
>gi|348577081|ref|XP_003474313.1| PREDICTED: SET and MYND domain-containing protein 3-like [Cavia
porcellus]
Length = 428
Score = 120 bits (300), Expect = 1e-24, Method: Compositional matrix adjust.
Identities = 87/277 (31%), Positives = 143/277 (51%), Gaps = 43/277 (15%)
Query: 32 EIAENFSKLACNAHTICNSELRPLGTGLYPVISIINHSCLPNAVLVFEGRLAVVRAVQHV 91
+I E F+K+ CN+ TICN+E++ +G GLYP +S++NHSC PN +VF G ++RAV+ +
Sbjct: 170 DIFEAFAKVICNSFTICNAEMQEVGVGLYPSMSLLNHSCDPNCSIVFNGPHLLLRAVRDI 229
Query: 92 PKGAEVLISYIETAGSTMTRQKALKEQYLFTCTCPRCIKLGQFDDIQESAILEGYRCKDD 151
G E+ I Y++ ++ R+K L++QY F C C RC Q D +++ +L G
Sbjct: 230 EVGEELTICYLDMLMTSEERRKQLRDQYCFECDCFRC----QTQD-KDADMLTG------ 278
Query: 152 GCSGFLLRDSDDKGFTCQQCGLVRSKEEIKKIASEVNILSKKTLALTSCGNHQEVVSTYK 211
D++ + Q E +KKI E+ K L C Q ++S+
Sbjct: 279 ----------DEQVWKGVQ-------ESLKKI-EELKTHWKWEQVLAMC---QSIISSNS 317
Query: 212 MIEKLQKKLYHPFSVNLMQTR--EKLIKILMELEDWKEALAYCQLTIPVYQRVYPQFHPL 269
E+L +N+ Q + + + + L +EAL Y T+ Y+ +P HP+
Sbjct: 318 --ERLP-------DINIYQLKVLDCAMDACINLGLLEEALFYGIRTMEPYRIFFPGSHPV 368
Query: 270 LGLQYYTCGKLEWFLGDTENAIKSMTEAVEILRITHG 306
G+Q GKL+ G A+K++ A +I+R+THG
Sbjct: 369 RGVQVMKVGKLQLHQGMFPQAMKNLRLAFDIMRVTHG 405
>gi|38494371|gb|AAH61485.1| SET and MYND domain containing 3 [Mus musculus]
gi|148681224|gb|EDL13171.1| SET and MYND domain containing 3, isoform CRA_b [Mus musculus]
Length = 428
Score = 120 bits (300), Expect = 1e-24, Method: Compositional matrix adjust.
Identities = 79/275 (28%), Positives = 136/275 (49%), Gaps = 39/275 (14%)
Query: 32 EIAENFSKLACNAHTICNSELRPLGTGLYPVISIINHSCLPNAVLVFEGRLAVVRAVQHV 91
++ E F+K+ CN+ TICN+E++ +G GLYP +S++NHSC PN +VF G ++RAV+ +
Sbjct: 170 DLFEAFAKVICNSFTICNAEMQEVGVGLYPSMSLLNHSCDPNCSIVFNGPHLLLRAVREI 229
Query: 92 PKGAEVLISYIETAGSTMTRQKALKEQYLFTCTCPRCIKLGQFDDIQESAILEGYRCKDD 151
G E+ I Y++ ++ R+K L++QY F C C RC Q D
Sbjct: 230 EAGEELTICYLDMLMTSEERRKQLRDQYCFECDCIRC----QTQD--------------- 270
Query: 152 GCSGFLLRDSDDKGFTCQQCGLVRSKEEIKKIASEVNILSKKTLALTSCGNHQEVVSTYK 211
+D+D ++ E+I K EV KK L + ++V++ +
Sbjct: 271 -------KDAD----------MLTGDEQIWK---EVQESLKKIEELKAHWKWEQVLALCQ 310
Query: 212 MIEKLQKKLYHPFSVNLMQTREKLIKILMELEDWKEALAYCQLTIPVYQRVYPQFHPLLG 271
I ++ ++ + + + L +EAL Y T+ Y+ +P HP+ G
Sbjct: 311 AIINSNSDRLPDINIYQLKVLDCAMDACINLGMLEEALFYAMRTMEPYRIFFPGSHPVRG 370
Query: 272 LQYYTCGKLEWFLGDTENAIKSMTEAVEILRITHG 306
+Q GKL+ G A+K++ A +I+++THG
Sbjct: 371 VQVMKVGKLQLHQGMFPQAMKNLRLAFDIMKVTHG 405
>gi|291402060|ref|XP_002717678.1| PREDICTED: SET and MYND domain containing 3 [Oryctolagus cuniculus]
Length = 428
Score = 120 bits (300), Expect = 1e-24, Method: Compositional matrix adjust.
Identities = 80/275 (29%), Positives = 138/275 (50%), Gaps = 39/275 (14%)
Query: 32 EIAENFSKLACNAHTICNSELRPLGTGLYPVISIINHSCLPNAVLVFEGRLAVVRAVQHV 91
+I E F+K+ CN+ TICN+E++ +G GLYP +S++NHSC PN +VF G ++RAV+ +
Sbjct: 170 DIFEAFAKVICNSFTICNAEMQEVGVGLYPSMSLLNHSCDPNCSIVFNGPHLLLRAVRDI 229
Query: 92 PKGAEVLISYIETAGSTMTRQKALKEQYLFTCTCPRCIKLGQFDDIQESAILEGYRCKDD 151
G E+ I Y++ ++ R+K L++QY F C C RC + Q+
Sbjct: 230 EAGEELTICYLDMLMTSEERRKQLRDQYCFECDCFRC-------ETQD------------ 270
Query: 152 GCSGFLLRDSDDKGFTCQQCGLVRSKEEIKKIASEVNILSKKTLALTSCGNHQEVVSTYK 211
+D+D ++ E+I K EV KK L + ++V++ +
Sbjct: 271 -------KDAD----------MLTGDEQIWK---EVQESLKKIEELKAHWKWEQVLALCQ 310
Query: 212 MIEKLQKKLYHPFSVNLMQTREKLIKILMELEDWKEALAYCQLTIPVYQRVYPQFHPLLG 271
I + ++ ++ + + + L +EAL Y T+ Y+ +P HP+ G
Sbjct: 311 TIISSNSERLPDINIYQLKVLDCAMDACINLGLLEEALFYGIRTMEPYRIFFPGSHPVRG 370
Query: 272 LQYYTCGKLEWFLGDTENAIKSMTEAVEILRITHG 306
+Q GKL+ G A+K++ A +I+R+THG
Sbjct: 371 VQVMKVGKLQLHQGMFPQAMKNLRLAFDIMRVTHG 405
>gi|126307073|ref|XP_001374886.1| PREDICTED: n-lysine methyltransferase SMYD2 [Monodelphis domestica]
Length = 386
Score = 119 bits (299), Expect = 1e-24, Method: Compositional matrix adjust.
Identities = 82/319 (25%), Positives = 151/319 (47%), Gaps = 31/319 (9%)
Query: 5 DEKQLLLYAQIANLVNLILQWPEISIN-EIAENFSKLACNAHTICNSELRPLGTGLYPVI 63
+EK+ L+ + IA L + + E N + F+++ CN TI + EL LG+ ++P +
Sbjct: 96 NEKRELIQSDIAALHHFYSKHLEYPDNCSLVTLFAQVNCNGFTIEDEELSHLGSAIFPDV 155
Query: 64 SIINHSCLPNAVLVFEGRLAVVRAVQHVPKGAEVLISYIETAGSTMTRQKALKEQYLFTC 123
+++NHSC PN ++ ++G LA VRAVQ + G EV SYI+ T R LK+ Y FTC
Sbjct: 156 ALMNHSCCPNVIVTYKGTLAEVRAVQEINPGDEVFTSYIDLLYPTEDRNDRLKDSYFFTC 215
Query: 124 TCPRC-IKLGQFDDIQESAILEGYRCKDDGCSGFLLRDSDDKGFTCQQCGLVRSKEEIKK 182
C C K ++ + E + +D +RD +V+ + +
Sbjct: 216 ECRECTTKAKDKAKVEIRKLSEPPKAED-------IRD------------MVKYARNVIE 256
Query: 183 IASEVNILSKKTLALTSCGNHQEVVSTYKMIEKLQKKLYHPFSVNLMQTREKLIKILMEL 242
+ L C QE +S+ L+ +V ++ + + + + +
Sbjct: 257 EFRRAKHYKSPSELLEICELSQEKMSS----------LFEDSNVYMLHMMYQAMGVCLYM 306
Query: 243 EDWKEALAYCQLTIPVYQRVYPQFHPLLGLQYYTCGKLEWFLGDTENAIKSMTEAVEILR 302
+DW+ AL Y Q I Y + YP + + + G+L L + +K++ +A+ I+
Sbjct: 307 QDWEGALRYGQKIIKPYSKHYPLYSLNVASMWLKLGRLYMGLEHKASGVKALKKAIVIME 366
Query: 303 ITHGTNSPFMKELILKLEE 321
+ HG + P++ E+ ++E+
Sbjct: 367 VAHGKDHPYISEIKKEIED 385
>gi|403288342|ref|XP_003935365.1| PREDICTED: SET and MYND domain-containing protein 3 [Saimiri
boliviensis boliviensis]
Length = 369
Score = 119 bits (299), Expect = 1e-24, Method: Compositional matrix adjust.
Identities = 80/275 (29%), Positives = 137/275 (49%), Gaps = 39/275 (14%)
Query: 32 EIAENFSKLACNAHTICNSELRPLGTGLYPVISIINHSCLPNAVLVFEGRLAVVRAVQHV 91
+I E F+K+ CN+ TICN+E++ +G GLYP IS++NHSC PN +VF G ++RAV+ +
Sbjct: 111 DIFEAFAKVICNSFTICNAEMQEVGVGLYPSISLLNHSCDPNCSVVFNGPHLLLRAVRDI 170
Query: 92 PKGAEVLISYIETAGSTMTRQKALKEQYLFTCTCPRCIKLGQFDDIQESAILEGYRCKDD 151
G E+ I Y++T ++ R+K L++QY F C C RC Q D
Sbjct: 171 EVGEELTICYLDTLMTSEERRKQLRDQYCFECDCFRC----QTQD--------------- 211
Query: 152 GCSGFLLRDSDDKGFTCQQCGLVRSKEEIKKIASEVNILSKKTLALTSCGNHQEVVSTYK 211
+D+D ++ E++ K EV K L + ++V++ +
Sbjct: 212 -------KDAD----------MLTGDEQVWK---EVQKSLKTIEELKAHWKWEQVLAMCQ 251
Query: 212 MIEKLQKKLYHPFSVNLMQTREKLIKILMELEDWKEALAYCQLTIPVYQRVYPQFHPLLG 271
I + ++ ++ + + + L ++AL Y T+ Y+ +P HP+ G
Sbjct: 252 AIISSNSERLPDINIYQLKVLDCAMDACIHLGLLEKALFYGTRTMEPYRIFFPGSHPVRG 311
Query: 272 LQYYTCGKLEWFLGDTENAIKSMTEAVEILRITHG 306
+Q GKL+ G A+K++ A +I+R+THG
Sbjct: 312 VQVMKVGKLQLHQGMFPQAMKNLRLAFDIMRVTHG 346
>gi|417400767|gb|JAA47309.1| Hypothetical protein [Desmodus rotundus]
Length = 428
Score = 119 bits (299), Expect = 2e-24, Method: Compositional matrix adjust.
Identities = 84/277 (30%), Positives = 135/277 (48%), Gaps = 43/277 (15%)
Query: 32 EIAENFSKLACNAHTICNSELRPLGTGLYPVISIINHSCLPNAVLVFEGRLAVVRAVQHV 91
+I E F+K+ CNA TICN+E++ +G GLYP +S++NHSC PN +VF G ++RAV+ +
Sbjct: 170 DIFEAFAKVFCNAFTICNAEMQGVGVGLYPSMSLLNHSCDPNCSIVFNGPHLLLRAVRDI 229
Query: 92 PKGAEVLISYIETAGSTMTRQKALKEQYLFTCTCPRCIKLGQFDDIQESAILEGYRCKDD 151
G E+ + Y++ ++ R+K L+ QY F C C RC
Sbjct: 230 ETGEELTVCYLDLLMTSEERRKHLRSQYCFDCDCVRC----------------------- 266
Query: 152 GCSGFLLRDSDDKGFTCQQCGLVRSKEEIKKIASEVNILSKKTLALTSCGNHQEVVSTYK 211
+D D T + +E +KKI E+ K L C Q ++S+
Sbjct: 267 -----QTQDKDADMLTGDEQVWKEVQESLKKI-EELKAHWKWEQVLAMC---QAIISSN- 316
Query: 212 MIEKLQKKLYHPFSVNLMQTR--EKLIKILMELEDWKEALAYCQLTIPVYQRVYPQFHPL 269
E+L +N+ Q + + + + L +EAL Y T+ Y+ +P HP+
Sbjct: 317 -AERLP-------DINIYQLKVLDCAMDACIHLGLLEEALFYGLRTMEPYRIFFPGSHPV 368
Query: 270 LGLQYYTCGKLEWFLGDTENAIKSMTEAVEILRITHG 306
G+Q GKL+ G A+K++ A +I+R+THG
Sbjct: 369 RGVQVMKVGKLQLHQGMFPQAMKNLRLAFDIMRVTHG 405
>gi|410985721|ref|XP_003999165.1| PREDICTED: SET and MYND domain-containing protein 3 [Felis catus]
Length = 428
Score = 119 bits (299), Expect = 2e-24, Method: Compositional matrix adjust.
Identities = 83/277 (29%), Positives = 136/277 (49%), Gaps = 43/277 (15%)
Query: 32 EIAENFSKLACNAHTICNSELRPLGTGLYPVISIINHSCLPNAVLVFEGRLAVVRAVQHV 91
+I E F+K+ CN+ T+CN+E++ +G GLYP +S++NHSC PN +VF G ++RAV+ +
Sbjct: 170 DIFEAFAKVICNSFTVCNAEMQEVGVGLYPSMSLLNHSCDPNCSIVFNGPHLLLRAVRDI 229
Query: 92 PKGAEVLISYIETAGSTMTRQKALKEQYLFTCTCPRCIKLGQFDDIQESAILEGYRCKDD 151
G E+ I Y++ ++ R+K L++QY F C C RC
Sbjct: 230 EAGEELTICYLDMLMTSEERRKQLRDQYCFECDCIRC----------------------- 266
Query: 152 GCSGFLLRDSDDKGFTCQQCGLVRSKEEIKKIASEVNILSKKTLALTSCGNHQEVVSTYK 211
+D D T + +E +KKI E+ K L C Q ++S+
Sbjct: 267 -----QTQDKDADMLTGDEQVWKEVQESLKKI-EELKAHWKWEQVLAMC---QAIISSNS 317
Query: 212 MIEKLQKKLYHPFSVNLMQTR--EKLIKILMELEDWKEALAYCQLTIPVYQRVYPQFHPL 269
E+L +N+ Q + + + + L +EAL Y T+ Y+ +P HP+
Sbjct: 318 --ERLP-------DINIYQLKVLDCAMDACINLGLLEEALFYGIRTMEPYRIFFPGSHPV 368
Query: 270 LGLQYYTCGKLEWFLGDTENAIKSMTEAVEILRITHG 306
G+Q GKL+ G A+K++ A +I+R+THG
Sbjct: 369 RGVQVMKVGKLQLHQGMFPQAMKNLRLAFDIMRVTHG 405
>gi|296230793|ref|XP_002760890.1| PREDICTED: SET and MYND domain-containing protein 3 [Callithrix
jacchus]
Length = 428
Score = 119 bits (298), Expect = 2e-24, Method: Compositional matrix adjust.
Identities = 85/275 (30%), Positives = 141/275 (51%), Gaps = 39/275 (14%)
Query: 32 EIAENFSKLACNAHTICNSELRPLGTGLYPVISIINHSCLPNAVLVFEGRLAVVRAVQHV 91
+I E F+K+ CN+ TICN+E++ +G GLYP IS++NHSC PN +VF G ++RAV+ V
Sbjct: 170 DIFEAFAKVICNSFTICNAEMQEVGVGLYPSISLLNHSCDPNCSVVFNGPHLLLRAVRDV 229
Query: 92 PKGAEVLISYIETAGSTMTRQKALKEQYLFTCTCPRCIKLGQFDDIQESAILEGYRCKDD 151
G E+ I Y++ ++ R+K L++QY F C C RC Q D +++ +L G
Sbjct: 230 EVGEELTICYLDMLMTSEERRKQLRDQYCFECDCFRC----QTQD-KDADMLTG------ 278
Query: 152 GCSGFLLRDSDDKGFTCQQCGLVRSKEEIKKIASEVNILSKKTLALTSCGNHQEVVSTYK 211
D++ + +E +K I E+ K L C Q ++S+
Sbjct: 279 ----------DERVWK-------EVQESLKHI-EELKAHWKWEQVLAMC---QAIISSNS 317
Query: 212 MIEKLQKKLYHPFSVNLMQTREKLIKILMELEDWKEALAYCQLTIPVYQRVYPQFHPLLG 271
E+L ++ ++ + + + L +EAL Y T+ Y+ +P HP+ G
Sbjct: 318 --ERLPD-----LNIYQLKVLDCAMDACINLGLLEEALFYGTRTMEPYRIFFPGSHPVRG 370
Query: 272 LQYYTCGKLEWFLGDTENAIKSMTEAVEILRITHG 306
+Q GKL+ G A+K++ A +I+R+THG
Sbjct: 371 VQVMKVGKLQLHQGMFPQAMKNLRLAFDIMRVTHG 405
>gi|195398853|ref|XP_002058035.1| GJ15711 [Drosophila virilis]
gi|194150459|gb|EDW66143.1| GJ15711 [Drosophila virilis]
Length = 448
Score = 119 bits (298), Expect = 2e-24, Method: Compositional matrix adjust.
Identities = 80/292 (27%), Positives = 142/292 (48%), Gaps = 20/292 (6%)
Query: 32 EIAENFSKLACNAHTICNSELRPLGTGLYPVISIINHSCLPNAVLVFEGRLAVVRAVQHV 91
E+ + +L N I ++E+ + T +Y +SI +HSC PNAV FEG V A++ +
Sbjct: 164 ELMSIYGRLITNGFNILDAEMNSIATAIYLGVSITDHSCQPNAVATFEGNELHVHAIEDL 223
Query: 92 P--KGAEVLISYIETAGSTMTRQKALKEQYLFTCTCPRCIKLGQFDDIQESAILEGYRCK 149
P +++ ISYI+ + R+ LKE Y F C C +CI D QE + C
Sbjct: 224 PCLDWSKIYISYIDLLNTPEQRRADLKEHYYFLCVCSKCI------DPQEMHEMTAAVCP 277
Query: 150 DDGCSGFLLRDSDDKGFTCQQCGLVRSKEEIKKIASEVNILSKKTLALTSCGNHQEV--V 207
+ C + D CQ C E++ +E+ L++ +L + +EV +
Sbjct: 278 NASCDASVNIDL----AKCQSCD-ASVTPELRSAYNEIMSLTQSSL-----DSMKEVAYL 327
Query: 208 STYKMIEKLQKKLYHPFSVNLMQTREKLIKILMELEDWKEALAYCQLTIPVYQRVYPQFH 267
K+ Q+ + HP +V ++T + + + + W +AL Y Q +P + + + ++
Sbjct: 328 DVCKVCLAKQRGVLHPLNVWHVKTLDAAFEAAINVGKWTDALEYGQQLLPGFAKYHGAWN 387
Query: 268 PLLGLQYYTCGKLEWFLGDTENAIKSMTEAVEILRITHGTNSPFMKELILKL 319
PLLGL + GK++ + + + A+ + EA IL +THG + + E + L
Sbjct: 388 PLLGLLHMKLGKIQLYERNYKQAVHHLQEAQRILNVTHGRDHRLLLEQLRPL 439
>gi|354475964|ref|XP_003500195.1| PREDICTED: SET and MYND domain-containing protein 3 [Cricetulus
griseus]
Length = 428
Score = 119 bits (297), Expect = 2e-24, Method: Compositional matrix adjust.
Identities = 79/275 (28%), Positives = 136/275 (49%), Gaps = 39/275 (14%)
Query: 32 EIAENFSKLACNAHTICNSELRPLGTGLYPVISIINHSCLPNAVLVFEGRLAVVRAVQHV 91
++ E F+K+ CN+ TICN+E++ +G GLYP +S++NHSC PN +VF G ++RAV+ +
Sbjct: 170 DLFEAFAKVICNSFTICNAEMQEVGVGLYPSMSLLNHSCDPNCSIVFNGPHLLLRAVREI 229
Query: 92 PKGAEVLISYIETAGSTMTRQKALKEQYLFTCTCPRCIKLGQFDDIQESAILEGYRCKDD 151
G E+ I Y++ ++ R+K L++QY F C C RC Q D
Sbjct: 230 EAGEELTICYLDMLMTSEERRKQLRDQYCFECDCIRC----QTHD--------------- 270
Query: 152 GCSGFLLRDSDDKGFTCQQCGLVRSKEEIKKIASEVNILSKKTLALTSCGNHQEVVSTYK 211
+D+D ++ E+I K EV KK L + ++V++ +
Sbjct: 271 -------KDAD----------MLTGDEQIWK---EVQESLKKIEELKAHWKWEQVLALCQ 310
Query: 212 MIEKLQKKLYHPFSVNLMQTREKLIKILMELEDWKEALAYCQLTIPVYQRVYPQFHPLLG 271
I ++ ++ + + + L +EAL Y T+ Y+ +P HP+ G
Sbjct: 311 AIISSNSDRLPDINIYQLKVLDCAMDACINLGMLEEALLYALRTMEPYRIFFPGSHPVRG 370
Query: 272 LQYYTCGKLEWFLGDTENAIKSMTEAVEILRITHG 306
+Q GKL+ G A+K++ A +I+++THG
Sbjct: 371 VQVMKVGKLQLHQGMFPQAMKNLRLAFDIMKVTHG 405
>gi|119613763|gb|EAW93357.1| SET and MYND domain containing 2, isoform CRA_c [Homo sapiens]
Length = 391
Score = 119 bits (297), Expect = 3e-24, Method: Compositional matrix adjust.
Identities = 82/318 (25%), Positives = 149/318 (46%), Gaps = 30/318 (9%)
Query: 5 DEKQLLLYAQIANLVNLILQWPEISINE-IAENFSKLACNAHTICNSELRPLGTGLYPVI 63
+EK+ L+ + IA L + + E N+ + F+++ CN TI + EL LG+ ++P +
Sbjct: 100 NEKKDLIQSDIAALHHFYSKHLEFPDNDSLVVLFAQVNCNGFTIEDEELSHLGSAIFPDV 159
Query: 64 SIINHSCLPNAVLVFEGRLAVVRAVQHVPKGAEVLISYIETAGSTMTRQKALKEQYLFTC 123
+++NHSC PN ++ ++G LA VRAVQ + G EV SYI+ T R L++ Y FTC
Sbjct: 160 ALMNHSCCPNVIVTYKGTLAEVRAVQEIKPGEEVFTSYIDLLYPTEDRNDRLRDSYFFTC 219
Query: 124 TCPRCIKLGQFDDIQESAILEGYRCKDDGCSGFLLRDSDDKGFTCQQCGLVR-SKEEIKK 182
C C KD + +R D +VR ++ I++
Sbjct: 220 ECQECT------------------TKDKDKAKVEIRKLSDPPKAEAIRDMVRYARNVIEE 261
Query: 183 IASEVNILSKKTLALTSCGNHQEVVSTYKMIEKLQKKLYHPFSVNLMQTREKLIKILMEL 242
+ S + L C QE +S+ ++ +V ++ + + + + +
Sbjct: 262 FRRAKHYKSPPSELLEICELSQEKMSS----------VFEDSNVYMLHMMYQAMGVCLYM 311
Query: 243 EDWKEALAYCQLTIPVYQRVYPQFHPLLGLQYYTCGKLEWFLGDTENAIKSMTEAVEILR 302
+DW+ AL Y Q I Y + YP + + + G+L L K++ +A+ I+
Sbjct: 312 QDWEGALQYGQKIIKPYSKHYPLYSLNVASMWLKLGRLYMGLEHKAAGEKALKKAIAIME 371
Query: 303 ITHGTNSPFMKELILKLE 320
+ HG + P++ E+ ++E
Sbjct: 372 VAHGKDHPYISEIKQEIE 389
>gi|149040852|gb|EDL94809.1| similar to SET and MYND domain containing 3, isoform CRA_b [Rattus
norvegicus]
Length = 428
Score = 119 bits (297), Expect = 3e-24, Method: Compositional matrix adjust.
Identities = 78/275 (28%), Positives = 137/275 (49%), Gaps = 39/275 (14%)
Query: 32 EIAENFSKLACNAHTICNSELRPLGTGLYPVISIINHSCLPNAVLVFEGRLAVVRAVQHV 91
++ E F+K+ CN+ TICN+E++ +G GLYP +S++NHSC PN +VF G ++RAV+ +
Sbjct: 170 DLFEAFAKVICNSFTICNAEMQEVGVGLYPSMSLLNHSCDPNCSIVFNGPHLLLRAVREI 229
Query: 92 PKGAEVLISYIETAGSTMTRQKALKEQYLFTCTCPRCIKLGQFDDIQESAILEGYRCKDD 151
G E+ I Y++ ++ R+K L++QY F C C RC Q+
Sbjct: 230 EAGEELTICYLDMLMTSEERRKQLRDQYCFECDCIRC-------PTQD------------ 270
Query: 152 GCSGFLLRDSDDKGFTCQQCGLVRSKEEIKKIASEVNILSKKTLALTSCGNHQEVVSTYK 211
+D+D ++ E+I K EV KK L + ++V++ +
Sbjct: 271 -------KDAD----------MLTGDEQIWK---EVQESLKKIEELKAHWKWEQVLALCQ 310
Query: 212 MIEKLQKKLYHPFSVNLMQTREKLIKILMELEDWKEALAYCQLTIPVYQRVYPQFHPLLG 271
I + ++ ++ + + + L +EAL Y T+ Y+ +P HP+ G
Sbjct: 311 AIINSNSERLPDINIYQLKVLDCAMDACINLGMLEEALFYAMRTMEPYRIFFPGSHPVRG 370
Query: 272 LQYYTCGKLEWFLGDTENAIKSMTEAVEILRITHG 306
+Q GKL+ G A+K++ A +I+++THG
Sbjct: 371 VQVMKVGKLQLHQGMFPQAMKNLRLAFDIMKVTHG 405
>gi|355720907|gb|AES07089.1| SET and MYND domain containing 2 [Mustela putorius furo]
Length = 432
Score = 118 bits (296), Expect = 3e-24, Method: Compositional matrix adjust.
Identities = 74/319 (23%), Positives = 149/319 (46%), Gaps = 33/319 (10%)
Query: 5 DEKQLLLYAQIANLVNLI---LQWPEISINEIAENFSKLACNAHTICNSELRPLGTGLYP 61
+EK+ L+ + +A L + L++P+ + + F+++ CN TI + EL LG+ ++P
Sbjct: 143 NEKRDLIQSDVAALHHFYSKHLEFPDT--DSLVVLFAQVNCNGFTIEDEELSHLGSAIFP 200
Query: 62 VISIINHSCLPNAVLVFEGRLAVVRAVQHVPKGAEVLISYIETAGSTMTRQKALKEQYLF 121
++++NHSC PN ++ ++G LA VRAVQ + G EV SYI+ T R L++ Y F
Sbjct: 201 DVALMNHSCCPNVIVTYKGTLAEVRAVQEISPGEEVFTSYIDLLYPTEDRNDRLRDSYFF 260
Query: 122 TCTCPRCIKLGQFDDIQESAILEGYRCKDDGCSGFLLRDSDDKGFTCQQCGLVRSKEEIK 181
TC C C +D D ++ E I+
Sbjct: 261 TCQCQECT----------------------------TKDKDKAKVEIRKLSDPPKAETIR 292
Query: 182 KIASEVNILSKKTLALTSCGNHQEVVSTYKMIEKLQKKLYHPFSVNLMQTREKLIKILME 241
++ + ++ + E++ ++ ++ ++ +V ++ + + + +
Sbjct: 293 EMVRYARNVIEEFRRAKHYKSPSELLEICELSQEKMGSVFEDSNVYMLHMMYQAMGVCLY 352
Query: 242 LEDWKEALAYCQLTIPVYQRVYPQFHPLLGLQYYTCGKLEWFLGDTENAIKSMTEAVEIL 301
++DW+ AL Y Q I Y + YP + + + G+L L D K++ +A+ I+
Sbjct: 353 MQDWEGALRYGQKIIKPYSKHYPLYSLNVASMWLKLGRLYMGLEDKAAGEKALKKAIAIM 412
Query: 302 RITHGTNSPFMKELILKLE 320
+ HG + P++ E+ ++E
Sbjct: 413 EVAHGKDHPYISEIKQEIE 431
>gi|301771175|ref|XP_002921007.1| PREDICTED: SET and MYND domain-containing protein 2-like
[Ailuropoda melanoleuca]
Length = 385
Score = 118 bits (296), Expect = 3e-24, Method: Compositional matrix adjust.
Identities = 80/319 (25%), Positives = 146/319 (45%), Gaps = 33/319 (10%)
Query: 5 DEKQLLLYAQIANLVNLI---LQWPEISINEIAENFSKLACNAHTICNSELRPLGTGLYP 61
+EK+ L+ + IA L + L++P+ + + F+++ CN TI + EL LG+ ++P
Sbjct: 95 NEKKDLIQSDIAALHHFYSKHLEFPDS--DSLVVLFAQVNCNGFTIEDEELSHLGSAIFP 152
Query: 62 VISIINHSCLPNAVLVFEGRLAVVRAVQHVPKGAEVLISYIETAGSTMTRQKALKEQYLF 121
++++NHSC PN ++ ++G LA VRAVQ + G EV SYI+ T R L++ Y F
Sbjct: 153 DVALMNHSCCPNVIVTYKGTLAEVRAVQEISPGEEVFTSYIDLLYPTEDRNDRLRDSYFF 212
Query: 122 TCTCPRCIKLGQFDDIQESAILEGYRCKDDGCSGFLLRDSDDKGFTCQQCGLVRSKEEIK 181
TC C C KD + +R D +VR +
Sbjct: 213 TCQCQECT------------------TKDKDKAKVEIRKLSDPPKAETIRDMVRYARNVI 254
Query: 182 KIASEVNILSKKTLALTSCGNHQEVVSTYKMIEKLQKKLYHPFSVNLMQTREKLIKILME 241
+ + L C QE + + ++ +V ++ + + + +
Sbjct: 255 EEFRRAKHYKSPSELLEICELSQEKMGS----------VFEDSNVYMLHMMYQAMGVCLY 304
Query: 242 LEDWKEALAYCQLTIPVYQRVYPQFHPLLGLQYYTCGKLEWFLGDTENAIKSMTEAVEIL 301
++DW+ AL Y Q I Y + YP + + + G+L L D K++ +A+ I+
Sbjct: 305 MQDWEGALRYGQKIIKPYSKHYPLYSLNVASMWLKLGRLYMGLEDKAAGEKALKKAIAIM 364
Query: 302 RITHGTNSPFMKELILKLE 320
+ HG + P++ E+ ++E
Sbjct: 365 EVAHGKDHPYISEIKQEIE 383
>gi|281342967|gb|EFB18551.1| hypothetical protein PANDA_009836 [Ailuropoda melanoleuca]
Length = 386
Score = 118 bits (296), Expect = 3e-24, Method: Compositional matrix adjust.
Identities = 85/325 (26%), Positives = 151/325 (46%), Gaps = 39/325 (12%)
Query: 5 DEKQLLLYAQIANLVNLI---LQWPEISINEIAENFSKLACNAHTICNSELRPLGTGLYP 61
+EK+ L+ + IA L + L++P+ + + F+++ CN TI + EL LG+ ++P
Sbjct: 90 NEKKDLIQSDIAALHHFYSKHLEFPDS--DSLVVLFAQVNCNGFTIEDEELSHLGSAIFP 147
Query: 62 VISIINHSCLPNAVLVFEGRLAVVRAVQHVPKGAEVLISYIETAGSTMTRQKALKEQYLF 121
++++NHSC PN ++ ++G LA VRAVQ + G EV SYI+ T R L++ Y F
Sbjct: 148 DVALMNHSCCPNVIVTYKGTLAEVRAVQEISPGEEVFTSYIDLLYPTEDRNDRLRDSYFF 207
Query: 122 TCTCPRCIKLGQFDDIQESAILEGYRCKDDGCSGFLLRDSDDKGFTCQQCGLVRSK---- 177
TC C C KD + +R D +VR
Sbjct: 208 TCQCQECT------------------TKDKDKAKVEIRKLSDPPKAETIRDMVRYARNVI 249
Query: 178 EEIKKIASEVNIL--SKKTLALTSCGNHQEVVSTYKMIEKLQKKLYHPFSVNLMQTREKL 235
EE ++ IL S + L C QE + + ++ +V ++ +
Sbjct: 250 EEFRRAKHYKYILYNSPPSELLEICELSQEKMGS----------VFEDSNVYMLHMMYQA 299
Query: 236 IKILMELEDWKEALAYCQLTIPVYQRVYPQFHPLLGLQYYTCGKLEWFLGDTENAIKSMT 295
+ + + ++DW+ AL Y Q I Y + YP + + + G+L L D K++
Sbjct: 300 MGVCLYMQDWEGALRYGQKIIKPYSKHYPLYSLNVASMWLKLGRLYMGLEDKAAGEKALK 359
Query: 296 EAVEILRITHGTNSPFMKELILKLE 320
+A+ I+ + HG + P++ E+ ++E
Sbjct: 360 KAIAIMEVAHGKDHPYISEIKQEIE 384
>gi|395859538|ref|XP_003802095.1| PREDICTED: N-lysine methyltransferase SMYD2 [Otolemur garnettii]
Length = 433
Score = 118 bits (296), Expect = 4e-24, Method: Compositional matrix adjust.
Identities = 80/317 (25%), Positives = 145/317 (45%), Gaps = 29/317 (9%)
Query: 5 DEKQLLLYAQIANLVNLILQWPEISINE-IAENFSKLACNAHTICNSELRPLGTGLYPVI 63
+EK+ L+ + IA L + + E N+ + F+++ CN TI + EL LG+ ++P +
Sbjct: 143 NEKKELIQSDIAALHHFYSKHLEFPDNDSLVVLFAQVNCNGFTIEDEELSHLGSAIFPDV 202
Query: 64 SIINHSCLPNAVLVFEGRLAVVRAVQHVPKGAEVLISYIETAGSTMTRQKALKEQYLFTC 123
+++NHSC PN ++ ++G LA VRAVQ + G EV SYI+ T R L++ Y FTC
Sbjct: 203 ALMNHSCCPNVIVTYKGTLAEVRAVQEINPGEEVFTSYIDLLYPTEDRNDRLRDSYFFTC 262
Query: 124 TCPRCIKLGQFDDIQESAILEGYRCKDDGCSGFLLRDSDDKGFTCQQCGLVRSKEEIKKI 183
C C KD + +R +D +VR + +
Sbjct: 263 ECQECTT------------------KDKDKAKVEIRKLNDPPKAETVRDMVRYARNVIEE 304
Query: 184 ASEVNILSKKTLALTSCGNHQEVVSTYKMIEKLQKKLYHPFSVNLMQTREKLIKILMELE 243
+ L C QE +S+ ++ +V ++ + + + + ++
Sbjct: 305 FRRAKHYKSPSELLEICELSQEKMSS----------VFEDSNVYMLHMMYQAMGVCLYMQ 354
Query: 244 DWKEALAYCQLTIPVYQRVYPQFHPLLGLQYYTCGKLEWFLGDTENAIKSMTEAVEILRI 303
DW+ AL Y Q I Y + YP + + + G+L L K++ +A+ I+ +
Sbjct: 355 DWEGALRYGQKIIKPYSKHYPLYSLNVASMWLKLGRLYMGLEHKAAGEKALKKAIAIMEV 414
Query: 304 THGTNSPFMKELILKLE 320
HG + P++ E+ ++E
Sbjct: 415 AHGKDHPYISEIKQEIE 431
>gi|194913303|ref|XP_001982670.1| GG12606 [Drosophila erecta]
gi|190648346|gb|EDV45639.1| GG12606 [Drosophila erecta]
Length = 468
Score = 118 bits (296), Expect = 4e-24, Method: Compositional matrix adjust.
Identities = 85/318 (26%), Positives = 154/318 (48%), Gaps = 23/318 (7%)
Query: 16 ANLVNLILQWPEISIN--EIAENFSKLACNAHTICNSELRPLGTGLYPVISIINHSCLPN 73
A L +++ P N E+ + +L N + ++++ + T +Y +SI +HSC PN
Sbjct: 166 AVLTDMMADSPSTVPNKTELMSIYGRLITNGFNVLDTDMNSIATAIYLGVSITDHSCQPN 225
Query: 74 AVLVFEGRLAVVRAVQHVP--KGAEVLISYIETAGSTMTRQKALKEQYLFTCTCPRCIKL 131
AV FEG V A++ + +++ ISYI+ + R+ LKE Y F C C +C
Sbjct: 226 AVATFEGNELHVHAIEDMECLDWSKIFISYIDLLNTPEQRRLDLKEHYYFLCVCSKCT-- 283
Query: 132 GQFDDIQESAILEGYRCKDDGCSGFLLRDSDDKGFTCQQCGLVRSKEEIKKIASEVNILS 191
D +ES + C + C + D + C +C S K+ + N
Sbjct: 284 ----DAKESKEMLAALCPNRNCGVGISVDRTN----CPRCDAGISP----KLRNAFN--D 329
Query: 192 KKTLALTSCGNHQEV--VSTYKMIEKLQKKLYHPFSVNLMQTREKLIKILMELEDWKEAL 249
L L + N ++V + K+ Q ++HP +V ++T + + +E+ W +AL
Sbjct: 330 AMALTLHNLENMKDVAYLDVCKVCLDKQTGVFHPLNVWYVKTLDAAFEAAIEVGKWSDAL 389
Query: 250 AYCQLTIPVYQRVYPQFHPLLGLQYYTCGKLEWFLGDTENAIKSMTEAVEILRITHGTNS 309
Y Q +P +++ + ++PLLGL + GK++ + G + A+ + EA IL +THG +
Sbjct: 390 DYGQRLLPGFRKYHGPWNPLLGLVHMKLGKIQLYTGQCKEALHHLEEAQRILTVTHGRDH 449
Query: 310 PFMKE-LILKLEEAQAEA 326
+ E L L + +A+ +A
Sbjct: 450 RLLTEQLYLLILQARQQA 467
>gi|119613761|gb|EAW93355.1| SET and MYND domain containing 2, isoform CRA_a [Homo sapiens]
Length = 349
Score = 118 bits (295), Expect = 4e-24, Method: Compositional matrix adjust.
Identities = 80/317 (25%), Positives = 144/317 (45%), Gaps = 29/317 (9%)
Query: 5 DEKQLLLYAQIANLVNLILQWPEISINE-IAENFSKLACNAHTICNSELRPLGTGLYPVI 63
+EK+ L+ + IA L + + E N+ + F+++ CN TI + EL LG+ ++P +
Sbjct: 59 NEKKDLIQSDIAALHHFYSKHLEFPDNDSLVVLFAQVNCNGFTIEDEELSHLGSAIFPDV 118
Query: 64 SIINHSCLPNAVLVFEGRLAVVRAVQHVPKGAEVLISYIETAGSTMTRQKALKEQYLFTC 123
+++NHSC PN ++ ++G LA VRAVQ + G EV SYI+ T R L++ Y FTC
Sbjct: 119 ALMNHSCCPNVIVTYKGTLAEVRAVQEIKPGEEVFTSYIDLLYPTEDRNDRLRDSYFFTC 178
Query: 124 TCPRCIKLGQFDDIQESAILEGYRCKDDGCSGFLLRDSDDKGFTCQQCGLVRSKEEIKKI 183
C C KD + +R D +VR + +
Sbjct: 179 ECQECT------------------TKDKDKAKVEIRKLSDPPKAEAIRDMVRYARNVIEE 220
Query: 184 ASEVNILSKKTLALTSCGNHQEVVSTYKMIEKLQKKLYHPFSVNLMQTREKLIKILMELE 243
+ L C QE +S+ ++ +V ++ + + + + ++
Sbjct: 221 FRRAKHYKSPSELLEICELSQEKMSS----------VFEDSNVYMLHMMYQAMGVCLYMQ 270
Query: 244 DWKEALAYCQLTIPVYQRVYPQFHPLLGLQYYTCGKLEWFLGDTENAIKSMTEAVEILRI 303
DW+ AL Y Q I Y + YP + + + G+L L K++ +A+ I+ +
Sbjct: 271 DWEGALQYGQKIIKPYSKHYPLYSLNVASMWLKLGRLYMGLEHKAAGEKALKKAIAIMEV 330
Query: 304 THGTNSPFMKELILKLE 320
HG + P++ E+ ++E
Sbjct: 331 AHGKDHPYISEIKQEIE 347
>gi|332811913|ref|XP_003308794.1| PREDICTED: N-lysine methyltransferase SMYD2 [Pan troglodytes]
Length = 433
Score = 118 bits (295), Expect = 5e-24, Method: Compositional matrix adjust.
Identities = 83/317 (26%), Positives = 148/317 (46%), Gaps = 29/317 (9%)
Query: 5 DEKQLLLYAQIANLVNLILQWPEISINE-IAENFSKLACNAHTICNSELRPLGTGLYPVI 63
+EK+ L+ + IA L + + E N+ + F+++ CN TI + EL LG+ ++P +
Sbjct: 143 NEKKDLIQSDIAALHHFYSKHLEFPDNDSLVVVFAQVNCNGFTIEDEELSHLGSAIFPDV 202
Query: 64 SIINHSCLPNAVLVFEGRLAVVRAVQHVPKGAEVLISYIETAGSTMTRQKALKEQYLFTC 123
+++NHSC PN ++ ++G LA VRAVQ + G EV SYI+ T R L++ Y FTC
Sbjct: 203 ALMNHSCCPNVIVTYKGTLAEVRAVQEIKPGEEVFTSYIDLLYPTEDRNDRLRDSYFFTC 262
Query: 124 TCPRCIKLGQFDDIQESAILEGYRCKDDGCSGFLLRDSDDKGFTCQQCGLVRSKEEIKKI 183
C C ++ A +E R D +RD +VR + +
Sbjct: 263 ECQECTTKD-----KDKAKVE-IRKLSDPPKAEAIRD------------MVRYARNVIEE 304
Query: 184 ASEVNILSKKTLALTSCGNHQEVVSTYKMIEKLQKKLYHPFSVNLMQTREKLIKILMELE 243
+ L C QE +S+ ++ +V ++ + + + + ++
Sbjct: 305 FRRAKHYKSPSELLEICELSQEKMSS----------VFEDSNVYMLHMMYQAMGVCLYMQ 354
Query: 244 DWKEALAYCQLTIPVYQRVYPQFHPLLGLQYYTCGKLEWFLGDTENAIKSMTEAVEILRI 303
DW+ AL Y Q I Y + YP + + + G+L L K++ +A+ I+ I
Sbjct: 355 DWEGALQYGQKIIKPYSKHYPLYSLNVASMWLKLGRLYMGLEHKAAGEKALKKAIAIMEI 414
Query: 304 THGTNSPFMKELILKLE 320
HG + P++ E+ ++E
Sbjct: 415 AHGKDHPYISEIKQEIE 431
>gi|297661992|ref|XP_002809506.1| PREDICTED: N-lysine methyltransferase SMYD2 [Pongo abelii]
Length = 433
Score = 117 bits (294), Expect = 5e-24, Method: Compositional matrix adjust.
Identities = 82/317 (25%), Positives = 149/317 (47%), Gaps = 29/317 (9%)
Query: 5 DEKQLLLYAQIANLVNLILQWPEISINE-IAENFSKLACNAHTICNSELRPLGTGLYPVI 63
+EK+ L+ + IA L + + E N+ + F+++ CN TI + EL LG+ ++P +
Sbjct: 143 NEKKDLIQSDIAALHHFYSKHLEFPDNDSLIVLFAQVNCNGFTIEDEELSHLGSAIFPDV 202
Query: 64 SIINHSCLPNAVLVFEGRLAVVRAVQHVPKGAEVLISYIETAGSTMTRQKALKEQYLFTC 123
+++NHSC PN ++ ++G LA VRAVQ + G EV SYI+ T R L++ Y FTC
Sbjct: 203 ALMNHSCCPNVIVTYKGTLAEVRAVQEIKPGEEVFTSYIDLLYPTEDRNDRLRDSYFFTC 262
Query: 124 TCPRCIKLGQFDDIQESAILEGYRCKDDGCSGFLLRDSDDKGFTCQQCGLVRSKEEIKKI 183
C C ++ A +E R D ++RD +VR + +
Sbjct: 263 ECQECTTKD-----KDKAKVE-IRKLSDPPKAEVIRD------------MVRYARNVIEE 304
Query: 184 ASEVNILSKKTLALTSCGNHQEVVSTYKMIEKLQKKLYHPFSVNLMQTREKLIKILMELE 243
+ L C QE +S+ ++ +V ++ + + + + ++
Sbjct: 305 FRRAKHYKSPSELLEICELSQEKMSS----------VFEDSNVYMLHMMYQAMGVCLYMQ 354
Query: 244 DWKEALAYCQLTIPVYQRVYPQFHPLLGLQYYTCGKLEWFLGDTENAIKSMTEAVEILRI 303
DW+ AL Y Q I Y + YP + + + G+L L K++ +A+ I+ +
Sbjct: 355 DWEGALQYGQKIIKPYSKHYPLYSLNVASMWLKLGRLYMGLEHKAAGEKALKKAIAIMEV 414
Query: 304 THGTNSPFMKELILKLE 320
HG + P++ E+ ++E
Sbjct: 415 AHGKDHPYISEIKQEIE 431
>gi|426333764|ref|XP_004028440.1| PREDICTED: N-lysine methyltransferase SMYD2 [Gorilla gorilla
gorilla]
Length = 384
Score = 117 bits (294), Expect = 5e-24, Method: Compositional matrix adjust.
Identities = 80/317 (25%), Positives = 144/317 (45%), Gaps = 29/317 (9%)
Query: 5 DEKQLLLYAQIANLVNLILQWPEISINE-IAENFSKLACNAHTICNSELRPLGTGLYPVI 63
+EK+ L+ + IA L + + E N+ + F+++ CN TI + EL LG+ ++P +
Sbjct: 94 NEKKDLIQSDIAALHHFYSKHLEFPDNDSLVVLFAQVNCNGFTIEDEELSHLGSAIFPDV 153
Query: 64 SIINHSCLPNAVLVFEGRLAVVRAVQHVPKGAEVLISYIETAGSTMTRQKALKEQYLFTC 123
+++NHSC PN ++ ++G LA VRAVQ + G EV SYI+ T R L++ Y FTC
Sbjct: 154 ALMNHSCCPNVIVTYKGTLAEVRAVQEIKPGEEVFTSYIDLLYPTEDRNDRLRDSYFFTC 213
Query: 124 TCPRCIKLGQFDDIQESAILEGYRCKDDGCSGFLLRDSDDKGFTCQQCGLVRSKEEIKKI 183
C C KD + +R D +VR + +
Sbjct: 214 ECQECT------------------TKDKDKAKVEIRKLSDPPKAEAIRDMVRYARNVIEE 255
Query: 184 ASEVNILSKKTLALTSCGNHQEVVSTYKMIEKLQKKLYHPFSVNLMQTREKLIKILMELE 243
+ L C QE +S+ ++ +V ++ + + + + ++
Sbjct: 256 FRRAKHYKSPSELLEICELSQEKMSS----------VFEDSNVYMLHMMYQAMGVCLYMQ 305
Query: 244 DWKEALAYCQLTIPVYQRVYPQFHPLLGLQYYTCGKLEWFLGDTENAIKSMTEAVEILRI 303
DW+ AL Y Q I Y + YP + + + G+L L K++ +A+ I+ +
Sbjct: 306 DWEGALQYGQKIIKPYSKHYPLYSLNVASMWLKLGRLYMGLEHKAAGEKALKKAIAIMEV 365
Query: 304 THGTNSPFMKELILKLE 320
HG + P++ E+ ++E
Sbjct: 366 AHGKDHPYISEIKQEIE 382
>gi|355558772|gb|EHH15552.1| hypothetical protein EGK_01662, partial [Macaca mulatta]
Length = 380
Score = 117 bits (294), Expect = 5e-24, Method: Compositional matrix adjust.
Identities = 80/317 (25%), Positives = 144/317 (45%), Gaps = 29/317 (9%)
Query: 5 DEKQLLLYAQIANLVNLILQWPEISINE-IAENFSKLACNAHTICNSELRPLGTGLYPVI 63
+EK+ L+ + IA L + + E N+ + F+++ CN TI + EL LG+ ++P +
Sbjct: 90 NEKKDLIQSDIAALHHFYSKHLEFPDNDSLVVLFAQVNCNGFTIEDEELSHLGSAIFPDV 149
Query: 64 SIINHSCLPNAVLVFEGRLAVVRAVQHVPKGAEVLISYIETAGSTMTRQKALKEQYLFTC 123
+++NHSC PN ++ ++G LA VRAVQ + G EV SYI+ T R L++ Y FTC
Sbjct: 150 ALMNHSCCPNVIVTYKGTLAEVRAVQEIKPGEEVFTSYIDLLYPTEDRNDRLRDSYFFTC 209
Query: 124 TCPRCIKLGQFDDIQESAILEGYRCKDDGCSGFLLRDSDDKGFTCQQCGLVRSKEEIKKI 183
C C KD + +R D +VR + +
Sbjct: 210 ECQECT------------------TKDKDKAKVEIRKLSDPPKAEAIRDMVRYARNVIEE 251
Query: 184 ASEVNILSKKTLALTSCGNHQEVVSTYKMIEKLQKKLYHPFSVNLMQTREKLIKILMELE 243
+ L C QE +S+ ++ +V ++ + + + + ++
Sbjct: 252 FRRAKHYKSPSELLEICELSQEKMSS----------VFEDSNVYMLHMMYQAMGVCLYMQ 301
Query: 244 DWKEALAYCQLTIPVYQRVYPQFHPLLGLQYYTCGKLEWFLGDTENAIKSMTEAVEILRI 303
DW+ AL Y Q I Y + YP + + + G+L L K++ +A+ I+ +
Sbjct: 302 DWEGALQYGQKIIKPYSKHYPLYSLNVASMWLKLGRLYMGLEHKAAGEKALKKAIAIMEV 361
Query: 304 THGTNSPFMKELILKLE 320
HG + P++ E+ ++E
Sbjct: 362 AHGKDHPYISEIKQEIE 378
>gi|410219524|gb|JAA06981.1| SET and MYND domain containing 2 [Pan troglodytes]
gi|410248566|gb|JAA12250.1| SET and MYND domain containing 2 [Pan troglodytes]
gi|410291460|gb|JAA24330.1| SET and MYND domain containing 2 [Pan troglodytes]
gi|410336589|gb|JAA37241.1| SET and MYND domain containing 2 [Pan troglodytes]
Length = 433
Score = 117 bits (294), Expect = 6e-24, Method: Compositional matrix adjust.
Identities = 81/317 (25%), Positives = 144/317 (45%), Gaps = 29/317 (9%)
Query: 5 DEKQLLLYAQIANLVNLILQWPEISINE-IAENFSKLACNAHTICNSELRPLGTGLYPVI 63
+EK+ L+ + IA L + + E N+ + F+++ CN TI + EL LG+ ++P +
Sbjct: 143 NEKKDLIQSDIAALHHFYSKHLEFPDNDSLIVVFAQVNCNGFTIEDEELSHLGSAIFPDV 202
Query: 64 SIINHSCLPNAVLVFEGRLAVVRAVQHVPKGAEVLISYIETAGSTMTRQKALKEQYLFTC 123
+++NHSC PN ++ ++G LA VRAVQ + G EV SYI+ T R L++ Y FTC
Sbjct: 203 ALMNHSCCPNVIVTYKGTLAEVRAVQEIKPGEEVFTSYIDLLYPTEDRNDRLRDSYFFTC 262
Query: 124 TCPRCIKLGQFDDIQESAILEGYRCKDDGCSGFLLRDSDDKGFTCQQCGLVRSKEEIKKI 183
C C KD + +R D +VR + +
Sbjct: 263 ECQECTT------------------KDKDKAKVEIRKLSDPPKAEAIRDMVRYARNVIEE 304
Query: 184 ASEVNILSKKTLALTSCGNHQEVVSTYKMIEKLQKKLYHPFSVNLMQTREKLIKILMELE 243
+ L C QE +S+ ++ +V ++ + + + + ++
Sbjct: 305 FRRAKHYKSPSELLEICELSQEKMSS----------VFEDSNVYMLHMMYQAMGVCLYMQ 354
Query: 244 DWKEALAYCQLTIPVYQRVYPQFHPLLGLQYYTCGKLEWFLGDTENAIKSMTEAVEILRI 303
DW+ AL Y Q I Y + YP + + + G+L L K++ +A+ I+ I
Sbjct: 355 DWEGALQYGQKIIKPYSKHYPLYSLNVASMWLKLGRLYMGLEHKAAGEKALKKAIAIMEI 414
Query: 304 THGTNSPFMKELILKLE 320
HG + P++ E+ ++E
Sbjct: 415 AHGKDHPYISEIKQEIE 431
>gi|149708613|ref|XP_001488507.1| PREDICTED: n-lysine methyltransferase SMYD2 [Equus caballus]
Length = 384
Score = 117 bits (294), Expect = 6e-24, Method: Compositional matrix adjust.
Identities = 82/317 (25%), Positives = 149/317 (47%), Gaps = 29/317 (9%)
Query: 5 DEKQLLLYAQIANLVNLILQWPEISINE-IAENFSKLACNAHTICNSELRPLGTGLYPVI 63
+EK+ L+ + IA L + + E N+ + F+++ CN TI + EL LG+ ++P +
Sbjct: 94 NEKKDLIQSDIAALHHFYSKHLEFPDNDSLVVLFAQVNCNGFTIEDEELSHLGSAIFPDV 153
Query: 64 SIINHSCLPNAVLVFEGRLAVVRAVQHVPKGAEVLISYIETAGSTMTRQKALKEQYLFTC 123
+++NHSC PN ++ ++G LA VRAVQ + G EV SYI+ T R L++ Y FTC
Sbjct: 154 ALMNHSCCPNVIVTYKGTLAEVRAVQEINPGEEVFTSYIDLLYPTEDRNDRLRDSYFFTC 213
Query: 124 TCPRCIKLGQFDDIQESAILEGYRCKDDGCSGFLLRDSDDKGFTCQQCGLVRSKEEIKKI 183
C C ++ A +E R D +RD +VR + +
Sbjct: 214 ECQECTTRD-----KDKAKVE-IRKLTDPPKAEAIRD------------MVRYARNVIEE 255
Query: 184 ASEVNILSKKTLALTSCGNHQEVVSTYKMIEKLQKKLYHPFSVNLMQTREKLIKILMELE 243
+ L C QE +S+ ++ +V ++ + + + + ++
Sbjct: 256 FRRAKHYKSPSELLEICELSQEKMSS----------VFEDSNVYMLHMMYQAMGVCLYMQ 305
Query: 244 DWKEALAYCQLTIPVYQRVYPQFHPLLGLQYYTCGKLEWFLGDTENAIKSMTEAVEILRI 303
DW+ AL Y Q I Y + YP + + + G+L L + K++ +A+ I+ +
Sbjct: 306 DWEGALRYGQKIIKPYSKHYPLYSLNVASMWLKLGRLYMGLENKAAGEKALKKAIAIMEV 365
Query: 304 THGTNSPFMKELILKLE 320
HG + P++ E+ ++E
Sbjct: 366 AHGKDHPYISEIKQEIE 382
>gi|440901481|gb|ELR52415.1| SET and MYND domain-containing protein 2, partial [Bos grunniens
mutus]
Length = 383
Score = 117 bits (294), Expect = 6e-24, Method: Compositional matrix adjust.
Identities = 81/318 (25%), Positives = 150/318 (47%), Gaps = 30/318 (9%)
Query: 5 DEKQLLLYAQIANLVNLILQWPEISINE-IAENFSKLACNAHTICNSELRPLGTGLYPVI 63
+EK+ L+ + IA L + + E N+ + F+++ CN TI + EL LG+ ++P +
Sbjct: 92 NEKRDLIQSDIAALHHFYSKHLEFPDNDSLVVLFAQVNCNGFTIEDEELSHLGSAIFPDV 151
Query: 64 SIINHSCLPNAVLVFEGRLAVVRAVQHVPKGAEVLISYIETAGSTMTRQKALKEQYLFTC 123
+++NHSC PN ++ ++G LA VRAVQ + G EV SYI+ T R L++ Y FTC
Sbjct: 152 ALMNHSCCPNVIVTYKGTLAEVRAVQEIHPGEEVFTSYIDLLYPTEDRNDRLRDSYFFTC 211
Query: 124 TCPRCIKLGQFDDIQESAILEGYRCKDDGCSGFLLRDSDDKGFTCQQCGLVR-SKEEIKK 182
C C KD + +R +D +VR ++ I++
Sbjct: 212 ECQECTT------------------KDKDKAKVEIRKLNDPPKAETIRDMVRYARNVIEE 253
Query: 183 IASEVNILSKKTLALTSCGNHQEVVSTYKMIEKLQKKLYHPFSVNLMQTREKLIKILMEL 242
+ S + L C QE +S ++ +V ++ + + + + +
Sbjct: 254 FRRAKHYKSPPSELLEICELSQEKMSC----------VFEDSNVYMLHMMYQAMGVCLYM 303
Query: 243 EDWKEALAYCQLTIPVYQRVYPQFHPLLGLQYYTCGKLEWFLGDTENAIKSMTEAVEILR 302
+DW+ AL Y Q I Y + YP + + + G+L L + +++ +A+ I+
Sbjct: 304 QDWEGALRYGQKIIQPYSKHYPLYSLNVASMWLKLGRLYMGLENKAAGERALKKAIAIME 363
Query: 303 ITHGTNSPFMKELILKLE 320
+ HG + P++ E+ ++E
Sbjct: 364 VAHGKDHPYISEIKQEIE 381
>gi|449283344|gb|EMC90014.1| SET and MYND domain-containing protein 2, partial [Columba livia]
Length = 385
Score = 117 bits (294), Expect = 6e-24, Method: Compositional matrix adjust.
Identities = 84/321 (26%), Positives = 156/321 (48%), Gaps = 28/321 (8%)
Query: 5 DEKQLLLYAQIANLVNLI---LQWPEISINEIAENFSKLACNAHTICNSELRPLGTGLYP 61
+EK+ L+ IA L + L++P+ + + F+++ CN TI + EL LG+ ++P
Sbjct: 88 NEKRELIQNDIAALHHFYSKHLEYPDNAA--LVVLFAQVNCNGFTIEDEELSHLGSAIFP 145
Query: 62 VISIINHSCLPNAVLVFEGRLAVVRAVQHVPKGAEVLISYIETAGSTMTRQKALKEQYLF 121
++++NHSC PN ++ ++G LA VRAV+ + G EV SYI+ T R L++ Y F
Sbjct: 146 DVALMNHSCCPNVIVTYKGTLAEVRAVKEIEPGEEVFTSYIDLLYPTEDRNDRLRDSYFF 205
Query: 122 TCTCPRCIKLGQFDDIQESAILEGYRCKDDGCSGFLLRDSDDKGFTCQQCGLVRSK-EEI 180
C C C+ + D ++ I R +D S +RD R+ EE
Sbjct: 206 NCDCRECVTKEK--DKEKLEI----RKLNDPPSAETVRD---------MIKYARNVIEEF 250
Query: 181 KKIASEVNILSKKTLALTSCGNHQEVVSTYKMIEKLQKKLYHPFSVNLMQTREKLIKILM 240
++ IL TLA E++ ++ ++ +V ++ + + + +
Sbjct: 251 RRAKHYKYILCLTTLAC-------ELLEICELSLDKMGAVFEDSNVYMLHMMYQAMGVCL 303
Query: 241 ELEDWKEALAYCQLTIPVYQRVYPQFHPLLGLQYYTCGKLEWFLGDTENAIKSMTEAVEI 300
++DW+ AL Y Q I Y + YP + + + G+L L + +K++ A+ I
Sbjct: 304 YVQDWEGALRYGQKIIRPYSKHYPSYSLNVASMWLKLGRLYMALENRAAGVKALKRAIAI 363
Query: 301 LRITHGTNSPFMKELILKLEE 321
+ + HG + P++ E+ +LE+
Sbjct: 364 MEVAHGKDHPYISEIKKELED 384
>gi|297280801|ref|XP_001106482.2| PREDICTED: SET and MYND domain-containing protein 2 [Macaca
mulatta]
Length = 433
Score = 117 bits (294), Expect = 7e-24, Method: Compositional matrix adjust.
Identities = 80/317 (25%), Positives = 144/317 (45%), Gaps = 29/317 (9%)
Query: 5 DEKQLLLYAQIANLVNLILQWPEISINE-IAENFSKLACNAHTICNSELRPLGTGLYPVI 63
+EK+ L+ + IA L + + E N+ + F+++ CN TI + EL LG+ ++P +
Sbjct: 143 NEKKDLIQSDIAALHHFYSKHLEFPDNDSLVVLFAQVNCNGFTIEDEELSHLGSAIFPDV 202
Query: 64 SIINHSCLPNAVLVFEGRLAVVRAVQHVPKGAEVLISYIETAGSTMTRQKALKEQYLFTC 123
+++NHSC PN ++ ++G LA VRAVQ + G EV SYI+ T R L++ Y FTC
Sbjct: 203 ALMNHSCCPNVIVTYKGTLAEVRAVQEIKPGEEVFTSYIDLLYPTEDRNDRLRDSYFFTC 262
Query: 124 TCPRCIKLGQFDDIQESAILEGYRCKDDGCSGFLLRDSDDKGFTCQQCGLVRSKEEIKKI 183
C C KD + +R D +VR + +
Sbjct: 263 ECQECTT------------------KDKDKAKVEIRKLSDPPKAEAIRDMVRYARNVIEE 304
Query: 184 ASEVNILSKKTLALTSCGNHQEVVSTYKMIEKLQKKLYHPFSVNLMQTREKLIKILMELE 243
+ L C QE +S+ ++ +V ++ + + + + ++
Sbjct: 305 FRRAKHYKSPSELLEICELSQEKMSS----------VFEDSNVYMLHMMYQAMGVCLYMQ 354
Query: 244 DWKEALAYCQLTIPVYQRVYPQFHPLLGLQYYTCGKLEWFLGDTENAIKSMTEAVEILRI 303
DW+ AL Y Q I Y + YP + + + G+L L K++ +A+ I+ +
Sbjct: 355 DWEGALQYGQKIIKPYSKHYPLYSLNVASMWLKLGRLYMGLEHKAAGEKALKKAIAIMEV 414
Query: 304 THGTNSPFMKELILKLE 320
HG + P++ E+ ++E
Sbjct: 415 AHGKDHPYISEIKQEIE 431
>gi|403277520|ref|XP_003930406.1| PREDICTED: N-lysine methyltransferase SMYD2 [Saimiri boliviensis
boliviensis]
Length = 433
Score = 117 bits (293), Expect = 7e-24, Method: Compositional matrix adjust.
Identities = 80/317 (25%), Positives = 144/317 (45%), Gaps = 29/317 (9%)
Query: 5 DEKQLLLYAQIANLVNLILQWPEISINE-IAENFSKLACNAHTICNSELRPLGTGLYPVI 63
+EK+ L+ + IA L + + E N+ + F+++ CN TI + EL LG+ ++P +
Sbjct: 143 NEKKDLIQSDIAALHHFYSKHLEFPDNDSLVVLFAQVNCNGFTIEDEELSHLGSAIFPDV 202
Query: 64 SIINHSCLPNAVLVFEGRLAVVRAVQHVPKGAEVLISYIETAGSTMTRQKALKEQYLFTC 123
+++NHSC PN ++ ++G LA VRAVQ + G EV SYI+ T R L++ Y FTC
Sbjct: 203 ALMNHSCCPNVIVTYKGTLAEVRAVQEISPGEEVFTSYIDLLYPTEDRNDRLRDSYFFTC 262
Query: 124 TCPRCIKLGQFDDIQESAILEGYRCKDDGCSGFLLRDSDDKGFTCQQCGLVRSKEEIKKI 183
C C KD + +R D +VR + +
Sbjct: 263 ECQECTT------------------KDKDKAKVEIRKLSDPPKAEAIRDMVRYARNVIEE 304
Query: 184 ASEVNILSKKTLALTSCGNHQEVVSTYKMIEKLQKKLYHPFSVNLMQTREKLIKILMELE 243
+ L C QE +S+ ++ +V ++ + + + + ++
Sbjct: 305 FRRAKHYKSPSELLEICELSQEKMSS----------VFEDSNVYMLHMMYQAMGVCLYMQ 354
Query: 244 DWKEALAYCQLTIPVYQRVYPQFHPLLGLQYYTCGKLEWFLGDTENAIKSMTEAVEILRI 303
DW+ AL Y Q I Y + YP + + + G+L L K++ +A+ I+ +
Sbjct: 355 DWEGALQYGQKIIKPYSKHYPLYSLNVASMWLKLGRLYMGLEHKAAGEKALKKAIAIMEV 414
Query: 304 THGTNSPFMKELILKLE 320
HG + P++ E+ ++E
Sbjct: 415 AHGKDHPYISEIKQEIE 431
>gi|380814842|gb|AFE79295.1| N-lysine methyltransferase SMYD2 [Macaca mulatta]
Length = 433
Score = 117 bits (293), Expect = 7e-24, Method: Compositional matrix adjust.
Identities = 82/317 (25%), Positives = 148/317 (46%), Gaps = 29/317 (9%)
Query: 5 DEKQLLLYAQIANLVNLILQWPEISINE-IAENFSKLACNAHTICNSELRPLGTGLYPVI 63
+EK+ L+ + IA L + + E N+ + F+++ CN TI + EL LG+ ++P +
Sbjct: 143 NEKKDLIQSDIAALHHFYSKHLEFPDNDSLVVLFAQVNCNGFTIEDEELSHLGSAIFPDV 202
Query: 64 SIINHSCLPNAVLVFEGRLAVVRAVQHVPKGAEVLISYIETAGSTMTRQKALKEQYLFTC 123
+++NHSC PN ++ ++G LA VRAVQ + G EV SYI+ T R L++ Y FTC
Sbjct: 203 ALMNHSCCPNVIVTYKGTLAEVRAVQEIKPGEEVFTSYIDLLYPTEDRNDRLRDSYFFTC 262
Query: 124 TCPRCIKLGQFDDIQESAILEGYRCKDDGCSGFLLRDSDDKGFTCQQCGLVRSKEEIKKI 183
C C ++ A +E R D +RD +VR + +
Sbjct: 263 ECQECTTKD-----KDKAKVE-IRKLSDPPKAEAIRD------------MVRYARNVIEE 304
Query: 184 ASEVNILSKKTLALTSCGNHQEVVSTYKMIEKLQKKLYHPFSVNLMQTREKLIKILMELE 243
+ L C QE +S+ ++ +V ++ + + + + ++
Sbjct: 305 FRRAKHYKSPSELLEICELSQEKMSS----------VFEDSNVYMLHMMYQAMGVCLYMQ 354
Query: 244 DWKEALAYCQLTIPVYQRVYPQFHPLLGLQYYTCGKLEWFLGDTENAIKSMTEAVEILRI 303
DW+ AL Y Q I Y + YP + + + G+L L K++ +A+ I+ +
Sbjct: 355 DWEGALQYGQKIIKPYSKHYPLYSLNVASMWLKLGRLYMGLEHKAAGEKALKKAIAIMEV 414
Query: 304 THGTNSPFMKELILKLE 320
HG + P++ E+ ++E
Sbjct: 415 AHGKDHPYISEIKQEIE 431
>gi|296230124|ref|XP_002760570.1| PREDICTED: N-lysine methyltransferase SMYD2 [Callithrix jacchus]
Length = 433
Score = 117 bits (293), Expect = 7e-24, Method: Compositional matrix adjust.
Identities = 82/317 (25%), Positives = 148/317 (46%), Gaps = 29/317 (9%)
Query: 5 DEKQLLLYAQIANLVNLILQWPEISINE-IAENFSKLACNAHTICNSELRPLGTGLYPVI 63
+EK+ L+ + IA L + + E N+ + F+++ CN TI + EL LG+ ++P +
Sbjct: 143 NEKKDLIQSDIAALHHFYSKHLEFPDNDSLVVLFAQVNCNGFTIEDEELSHLGSAIFPDV 202
Query: 64 SIINHSCLPNAVLVFEGRLAVVRAVQHVPKGAEVLISYIETAGSTMTRQKALKEQYLFTC 123
+++NHSC PN ++ ++G LA VRAVQ + G EV SYI+ T R L++ Y FTC
Sbjct: 203 ALMNHSCCPNVIVTYKGTLAEVRAVQEISPGEEVFTSYIDLLYPTEDRNDRLRDSYFFTC 262
Query: 124 TCPRCIKLGQFDDIQESAILEGYRCKDDGCSGFLLRDSDDKGFTCQQCGLVRSKEEIKKI 183
C C ++ A +E R D +RD +VR + +
Sbjct: 263 ECQECTTKD-----KDKAKVE-IRKLSDPPKAEAIRD------------MVRYARNVIEE 304
Query: 184 ASEVNILSKKTLALTSCGNHQEVVSTYKMIEKLQKKLYHPFSVNLMQTREKLIKILMELE 243
+ L C QE +S+ ++ +V ++ + + + + ++
Sbjct: 305 FRRAKHYKSPSELLEICELSQEKMSS----------VFEDSNVYMLHMMYQAMGVCLYMQ 354
Query: 244 DWKEALAYCQLTIPVYQRVYPQFHPLLGLQYYTCGKLEWFLGDTENAIKSMTEAVEILRI 303
DW+ AL Y Q I Y + YP + + + G+L L K++ +A+ I+ +
Sbjct: 355 DWEGALQYGQKIIKPYSKHYPLYSLNVASMWLKLGRLYMGLEHKAAGEKALKKAIAIMEV 414
Query: 304 THGTNSPFMKELILKLE 320
HG + P++ E+ ++E
Sbjct: 415 AHGKDHPYISEIKQEIE 431
>gi|402857226|ref|XP_003893168.1| PREDICTED: N-lysine methyltransferase SMYD2 [Papio anubis]
Length = 433
Score = 117 bits (293), Expect = 7e-24, Method: Compositional matrix adjust.
Identities = 82/317 (25%), Positives = 148/317 (46%), Gaps = 29/317 (9%)
Query: 5 DEKQLLLYAQIANLVNLILQWPEISINE-IAENFSKLACNAHTICNSELRPLGTGLYPVI 63
+EK+ L+ + IA L + + E N+ + F+++ CN TI + EL LG+ ++P +
Sbjct: 143 NEKKDLIQSDIAALHHFYSRHLEFPDNDSLVVLFAQVNCNGFTIEDEELSHLGSAIFPDV 202
Query: 64 SIINHSCLPNAVLVFEGRLAVVRAVQHVPKGAEVLISYIETAGSTMTRQKALKEQYLFTC 123
+++NHSC PN ++ ++G LA VRAVQ + G EV SYI+ T R L++ Y FTC
Sbjct: 203 ALMNHSCCPNVIVTYKGTLAEVRAVQEIKPGEEVFTSYIDLLYPTEDRNDRLRDSYFFTC 262
Query: 124 TCPRCIKLGQFDDIQESAILEGYRCKDDGCSGFLLRDSDDKGFTCQQCGLVRSKEEIKKI 183
C C ++ A +E R D +RD +VR + +
Sbjct: 263 ECQECTTKD-----KDKAKVE-IRKLSDPPKAEAIRD------------MVRYARNVIEE 304
Query: 184 ASEVNILSKKTLALTSCGNHQEVVSTYKMIEKLQKKLYHPFSVNLMQTREKLIKILMELE 243
+ L C QE +S+ ++ +V ++ + + + + ++
Sbjct: 305 FRRAKHYKSPSELLEICELSQEKMSS----------VFEDSNVYMLHMMYQAMGVCLYMQ 354
Query: 244 DWKEALAYCQLTIPVYQRVYPQFHPLLGLQYYTCGKLEWFLGDTENAIKSMTEAVEILRI 303
DW+ AL Y Q I Y + YP + + + G+L L K++ +A+ I+ +
Sbjct: 355 DWEGALQYGQKIIKPYSKHYPLYSLNVASMWLKLGRLYMGLEHKAAGEKALKKAIAIMEV 414
Query: 304 THGTNSPFMKELILKLE 320
HG + P++ E+ ++E
Sbjct: 415 AHGKDHPYISEIKQEIE 431
>gi|343197592|pdb|3S7B|A Chain A, Structural Basis Of Substrate Methylation And Inhibition
Of Smyd2
gi|343197593|pdb|3S7D|A Chain A, Structural Basis Of Substrate Methylation And Inhibition
Of Smyd2
gi|343197595|pdb|3S7F|A Chain A, Structural Basis Of Substrate Methylation And Inhibition
Of Smyd2
gi|343197597|pdb|3S7J|A Chain A, Structural Basis Of Substrate Methylation And Inhibition
Of Smyd2
gi|9295345|gb|AAF86953.1|AF226053_1 HSKM-B [Homo sapiens]
gi|66990077|gb|AAH98133.1| SET and MYND domain containing 2 [Homo sapiens]
gi|68226703|gb|AAH98335.1| SET and MYND domain containing 2 [Homo sapiens]
gi|189054089|dbj|BAG36596.1| unnamed protein product [Homo sapiens]
gi|307686281|dbj|BAJ21071.1| SET and MYND domain containing 2 [synthetic construct]
Length = 433
Score = 117 bits (293), Expect = 7e-24, Method: Compositional matrix adjust.
Identities = 80/317 (25%), Positives = 144/317 (45%), Gaps = 29/317 (9%)
Query: 5 DEKQLLLYAQIANLVNLILQWPEISINE-IAENFSKLACNAHTICNSELRPLGTGLYPVI 63
+EK+ L+ + IA L + + E N+ + F+++ CN TI + EL LG+ ++P +
Sbjct: 143 NEKKDLIQSDIAALHHFYSKHLEFPDNDSLVVLFAQVNCNGFTIEDEELSHLGSAIFPDV 202
Query: 64 SIINHSCLPNAVLVFEGRLAVVRAVQHVPKGAEVLISYIETAGSTMTRQKALKEQYLFTC 123
+++NHSC PN ++ ++G LA VRAVQ + G EV SYI+ T R L++ Y FTC
Sbjct: 203 ALMNHSCCPNVIVTYKGTLAEVRAVQEIKPGEEVFTSYIDLLYPTEDRNDRLRDSYFFTC 262
Query: 124 TCPRCIKLGQFDDIQESAILEGYRCKDDGCSGFLLRDSDDKGFTCQQCGLVRSKEEIKKI 183
C C KD + +R D +VR + +
Sbjct: 263 ECQECTT------------------KDKDKAKVEIRKLSDPPKAEAIRDMVRYARNVIEE 304
Query: 184 ASEVNILSKKTLALTSCGNHQEVVSTYKMIEKLQKKLYHPFSVNLMQTREKLIKILMELE 243
+ L C QE +S+ ++ +V ++ + + + + ++
Sbjct: 305 FRRAKHYKSPSELLEICELSQEKMSS----------VFEDSNVYMLHMMYQAMGVCLYMQ 354
Query: 244 DWKEALAYCQLTIPVYQRVYPQFHPLLGLQYYTCGKLEWFLGDTENAIKSMTEAVEILRI 303
DW+ AL Y Q I Y + YP + + + G+L L K++ +A+ I+ +
Sbjct: 355 DWEGALQYGQKIIKPYSKHYPLYSLNVASMWLKLGRLYMGLEHKAAGEKALKKAIAIMEV 414
Query: 304 THGTNSPFMKELILKLE 320
HG + P++ E+ ++E
Sbjct: 415 AHGKDHPYISEIKQEIE 431
>gi|417400865|gb|JAA47349.1| Putative histone tail methylase [Desmodus rotundus]
Length = 433
Score = 117 bits (293), Expect = 8e-24, Method: Compositional matrix adjust.
Identities = 80/317 (25%), Positives = 144/317 (45%), Gaps = 29/317 (9%)
Query: 5 DEKQLLLYAQIANLVNLILQWPEISINE-IAENFSKLACNAHTICNSELRPLGTGLYPVI 63
+EK+ L+ + I+ L + E NE + F+++ CN TI + EL LG+ ++P +
Sbjct: 143 NEKKDLIQSDISALHRFYSKHLEFPDNESLVTLFAQVNCNGFTIEDEELSHLGSAIFPDV 202
Query: 64 SIINHSCLPNAVLVFEGRLAVVRAVQHVPKGAEVLISYIETAGSTMTRQKALKEQYLFTC 123
+++NHSC PN ++ ++G LA VRAVQ + G EV SYI+ T R L++ Y FTC
Sbjct: 203 ALMNHSCCPNVIVTYKGTLAEVRAVQAISPGEEVFTSYIDLLYPTEDRNDRLRDSYFFTC 262
Query: 124 TCPRCIKLGQFDDIQESAILEGYRCKDDGCSGFLLRDSDDKGFTCQQCGLVRSKEEIKKI 183
C C KD + +R +D +VR + +
Sbjct: 263 ECQECTT------------------KDKDKAKVKIRKLNDPPKADAVRDMVRYARNVIEE 304
Query: 184 ASEVNILSKKTLALTSCGNHQEVVSTYKMIEKLQKKLYHPFSVNLMQTREKLIKILMELE 243
+ L C QE +S+ ++ +V ++ + + + + +
Sbjct: 305 FRRAKHYKSPSELLEICELSQEKMSS----------VFEDSNVYMLHMMYQAMGVCLYTQ 354
Query: 244 DWKEALAYCQLTIPVYQRVYPQFHPLLGLQYYTCGKLEWFLGDTENAIKSMTEAVEILRI 303
DW+ AL Y Q I Y + YP + + + G+L L + K++ +A+ I+ +
Sbjct: 355 DWEGALRYGQKIIKPYSKHYPLYSLNVASMWLKLGRLYMGLENRAAGEKALRKAMAIMEV 414
Query: 304 THGTNSPFMKELILKLE 320
HG + P++ E+ ++E
Sbjct: 415 AHGKDHPYISEIKQEIE 431
>gi|355745920|gb|EHH50545.1| hypothetical protein EGM_01398, partial [Macaca fascicularis]
Length = 380
Score = 117 bits (293), Expect = 8e-24, Method: Compositional matrix adjust.
Identities = 80/317 (25%), Positives = 144/317 (45%), Gaps = 29/317 (9%)
Query: 5 DEKQLLLYAQIANLVNLILQWPEISINE-IAENFSKLACNAHTICNSELRPLGTGLYPVI 63
+EK+ L+ + IA L + + E N+ + F+++ CN TI + EL LG+ ++P +
Sbjct: 90 NEKKDLIQSDIAALHHFYSKHLEFPDNDSLIVLFAQVNCNGFTIEDEELSHLGSAIFPDV 149
Query: 64 SIINHSCLPNAVLVFEGRLAVVRAVQHVPKGAEVLISYIETAGSTMTRQKALKEQYLFTC 123
+++NHSC PN ++ ++G LA VRAVQ + G EV SYI+ T R L++ Y FTC
Sbjct: 150 ALMNHSCCPNVIVTYKGTLAEVRAVQEIKPGEEVFTSYIDLLYPTEDRNDRLRDSYFFTC 209
Query: 124 TCPRCIKLGQFDDIQESAILEGYRCKDDGCSGFLLRDSDDKGFTCQQCGLVRSKEEIKKI 183
C C KD + +R D +VR + +
Sbjct: 210 ECQECT------------------TKDKDKAKVEIRKLSDPPKAEAIRDMVRYARNVIEE 251
Query: 184 ASEVNILSKKTLALTSCGNHQEVVSTYKMIEKLQKKLYHPFSVNLMQTREKLIKILMELE 243
+ L C QE +S+ ++ +V ++ + + + + ++
Sbjct: 252 FRRAKHYKSPSELLEICELSQEKMSS----------VFEDSNVYMLHMMYQAMGVCLYMQ 301
Query: 244 DWKEALAYCQLTIPVYQRVYPQFHPLLGLQYYTCGKLEWFLGDTENAIKSMTEAVEILRI 303
DW+ AL Y Q I Y + YP + + + G+L L K++ +A+ I+ +
Sbjct: 302 DWEGALQYGQKIIKPYSKHYPLYSLNVASMWLKLGRLYMGLEHKAAGEKALKKAIAIMEV 361
Query: 304 THGTNSPFMKELILKLE 320
HG + P++ E+ ++E
Sbjct: 362 AHGKDHPYISEIKQEIE 378
>gi|402858454|ref|XP_003893720.1| PREDICTED: SET and MYND domain-containing protein 3-like [Papio
anubis]
Length = 258
Score = 117 bits (293), Expect = 8e-24, Method: Compositional matrix adjust.
Identities = 80/274 (29%), Positives = 134/274 (48%), Gaps = 39/274 (14%)
Query: 33 IAENFSKLACNAHTICNSELRPLGTGLYPVISIINHSCLPNAVLVFEGRLAVVRAVQHVP 92
+ E K+ CN+ TICN+E++ +G GLYP IS++NHSC PN +VF G ++RAV+ +
Sbjct: 1 MEEEEEKVICNSFTICNAEMQEVGVGLYPSISLLNHSCDPNCSIVFNGPHLLLRAVRDIE 60
Query: 93 KGAEVLISYIETAGSTMTRQKALKEQYLFTCTCPRCIKLGQFDDIQESAILEGYRCKDDG 152
G E+ I Y++ ++ R+K L++QY F C C RC Q D
Sbjct: 61 VGEELTICYLDMLMTSEERRKQLRDQYCFECDCFRC----QTQD---------------- 100
Query: 153 CSGFLLRDSDDKGFTCQQCGLVRSKEEIKKIASEVNILSKKTLALTSCGNHQEVVSTYKM 212
+D+D ++ E++ K EV KK L + ++V++ +
Sbjct: 101 ------KDAD----------MLTGDEQVWK---EVQESLKKIEELKAHWKWEQVLAMCQA 141
Query: 213 IEKLQKKLYHPFSVNLMQTREKLIKILMELEDWKEALAYCQLTIPVYQRVYPQFHPLLGL 272
I + +V ++ + + + L +EAL Y T+ Y+ +P HP+ G+
Sbjct: 142 IISSNSERLPDINVYQLKVLDCAMDACINLGLLEEALFYGTRTMEPYRIFFPGSHPVRGV 201
Query: 273 QYYTCGKLEWFLGDTENAIKSMTEAVEILRITHG 306
Q GKL+ G A+K++ A +I+R+THG
Sbjct: 202 QVMKVGKLQLHQGMFPQAMKNLRLAFDIMRVTHG 235
>gi|345802872|ref|XP_537149.3| PREDICTED: N-lysine methyltransferase SMYD2 [Canis lupus
familiaris]
Length = 523
Score = 117 bits (293), Expect = 8e-24, Method: Compositional matrix adjust.
Identities = 74/317 (23%), Positives = 146/317 (46%), Gaps = 29/317 (9%)
Query: 5 DEKQLLLYAQIANLVNLILQWPEISINE-IAENFSKLACNAHTICNSELRPLGTGLYPVI 63
+EK+ L+ + IA L + + E +E + F+++ CN TI + EL LG+ ++P +
Sbjct: 233 NEKKDLIQSDIAALHHFYSKHLEFPDSESLVVLFAQVNCNGFTIEDEELSHLGSAIFPDV 292
Query: 64 SIINHSCLPNAVLVFEGRLAVVRAVQHVPKGAEVLISYIETAGSTMTRQKALKEQYLFTC 123
+++NHSC PN ++ ++G LA VRAVQ + G EV SYI+ T R L++ Y FTC
Sbjct: 293 ALMNHSCCPNVIVTYKGTLAEVRAVQEISPGEEVFTSYIDLLYPTEDRNDRLRDSYFFTC 352
Query: 124 TCPRCIKLGQFDDIQESAILEGYRCKDDGCSGFLLRDSDDKGFTCQQCGLVRSKEEIKKI 183
C C +D D ++ E I+ +
Sbjct: 353 QCQECTT----------------------------KDKDKAKVEIRKLSDPPKAETIRDM 384
Query: 184 ASEVNILSKKTLALTSCGNHQEVVSTYKMIEKLQKKLYHPFSVNLMQTREKLIKILMELE 243
+ ++ + E++ ++ ++ ++ +V ++ + + + + ++
Sbjct: 385 VRYARNVIEEFRRAKHYKSPSELLEICELSQEKMGSVFEDSNVYMLHMMYQAMGVCLYMQ 444
Query: 244 DWKEALAYCQLTIPVYQRVYPQFHPLLGLQYYTCGKLEWFLGDTENAIKSMTEAVEILRI 303
DW+ AL Y Q I Y + YP + + + G+L L D +++ +A+ I+ +
Sbjct: 445 DWEGALRYGQKIIKPYSKHYPLYSLNVASMWLKLGRLYMGLEDKAAGERALKKAIAIMEV 504
Query: 304 THGTNSPFMKELILKLE 320
HG + P++ E+ ++E
Sbjct: 505 AHGKDHPYISEIKQEIE 521
>gi|397486375|ref|XP_003814305.1| PREDICTED: N-lysine methyltransferase SMYD2 [Pan paniscus]
Length = 384
Score = 117 bits (293), Expect = 8e-24, Method: Compositional matrix adjust.
Identities = 80/317 (25%), Positives = 144/317 (45%), Gaps = 29/317 (9%)
Query: 5 DEKQLLLYAQIANLVNLILQWPEISINE-IAENFSKLACNAHTICNSELRPLGTGLYPVI 63
+EK+ L+ + IA L + + E N+ + F+++ CN TI + EL LG+ ++P +
Sbjct: 94 NEKKDLIQSDIAALHHFYSKHLEFLDNDSLVVVFAQVNCNGFTIEDEELSHLGSAIFPDV 153
Query: 64 SIINHSCLPNAVLVFEGRLAVVRAVQHVPKGAEVLISYIETAGSTMTRQKALKEQYLFTC 123
+++NHSC PN ++ ++G LA VRAVQ + G EV SYI+ T R L++ Y FTC
Sbjct: 154 ALMNHSCCPNVIVTYKGTLAEVRAVQEIKPGEEVFTSYIDLLYPTEDRNDRLRDSYFFTC 213
Query: 124 TCPRCIKLGQFDDIQESAILEGYRCKDDGCSGFLLRDSDDKGFTCQQCGLVRSKEEIKKI 183
C C KD + +R D +VR + +
Sbjct: 214 ECQECT------------------TKDKDKAKVEIRKLSDPPKAEAIRDMVRYARNVIEE 255
Query: 184 ASEVNILSKKTLALTSCGNHQEVVSTYKMIEKLQKKLYHPFSVNLMQTREKLIKILMELE 243
+ L C QE +S+ ++ +V ++ + + + + ++
Sbjct: 256 FRRAKHYKSPSELLEICELSQEKMSS----------VFEDSNVYMLHMMYQAMGVCLYMQ 305
Query: 244 DWKEALAYCQLTIPVYQRVYPQFHPLLGLQYYTCGKLEWFLGDTENAIKSMTEAVEILRI 303
DW+ AL Y Q I Y + YP + + + G+L L K++ +A+ I+ +
Sbjct: 306 DWEGALQYGQKIIKPYSKHYPLYSLNVASMWLKLGRLYMGLEHKAAGEKALKKAIAIMEV 365
Query: 304 THGTNSPFMKELILKLE 320
HG + P++ E+ ++E
Sbjct: 366 AHGKDHPYISEIKQEIE 382
>gi|383865502|ref|XP_003708212.1| PREDICTED: SET and MYND domain-containing protein 3-like [Megachile
rotundata]
Length = 438
Score = 117 bits (293), Expect = 8e-24, Method: Compositional matrix adjust.
Identities = 83/319 (26%), Positives = 153/319 (47%), Gaps = 23/319 (7%)
Query: 2 SDI--DEKQLLLYAQIAN-LVNLILQWPEISINEIAENFSKLACNAHTICNSELRPLGTG 58
SDI DEK++ + + L + P + E+ + ++ N+ I + ++ +G G
Sbjct: 122 SDIKKDEKKMEHFMFLCGVLFGFLGDTPMPNSAELMGIYGRICINSFNIFDLDMNSIGVG 181
Query: 59 LYPVISIINHSCLPNAVLVFEGRLAVVRAVQHVPK--GAEVLISYIETAGSTMTRQKALK 116
+Y S+++HSC+PNAV FEG +R ++ +P +++ ISYI+ +T R+ L+
Sbjct: 182 IYLAPSVVDHSCVPNAVATFEGITLNIRTIEDLPSLDWSQIRISYIDVLKTTKERRSELQ 241
Query: 117 EQYLFTCTCPRCIKLGQFDDIQESAIL-EGYRCKDDGCSGFLLRDSDDKGFTCQQCGLVR 175
Y F C C +C +E ++ E C + C+ D + +C +C ++
Sbjct: 242 SSYYFWCNCKKC---------EEPELMAEAAACSNKNCTNPCSPDME----SCPECN-IK 287
Query: 176 SKEEIKKIASEVNILSKKTLALTSCGNHQEVVSTYKMIEKLQKKLYHPFSVNLMQTREKL 235
E K+I EV+ + L + + KM K Q+ + H ++ +QT +
Sbjct: 288 LLENFKEIFDEVSSFTAHHLQNMK---NMAYLDVSKMCLKKQEGVLHSLNIQHVQTLQTA 344
Query: 236 IKILMELEDWKEALAYCQLTIPVYQRVYPQFHPLLGLQYYTCGKLEWFLGDTENAIKSMT 295
+ L+ W+EA Y + I Y Y + HPL G+ GK++ +L + A++++
Sbjct: 345 FDSSISLQHWEEAEFYAKKLIKGYLVYYGEVHPLTGILLLMTGKIQLYLEKPKQALEALR 404
Query: 296 EAVEILRITHGTNSPFMKE 314
+A IL ITHG ++E
Sbjct: 405 KANSILAITHGEQHTLVRE 423
>gi|294662600|pdb|3MEK|A Chain A, Crystal Structure Of Human Histone-Lysine N-
Methyltransferase Smyd3 In Complex With S-Adenosyl-L-
Methionine
Length = 429
Score = 117 bits (292), Expect = 9e-24, Method: Compositional matrix adjust.
Identities = 84/277 (30%), Positives = 131/277 (47%), Gaps = 43/277 (15%)
Query: 32 EIAENFSKLACNAHTICNSELRPLGTGLYPVISIINHSCLPNAVLVFEGRLAVVRAVQHV 91
++ E F+K+ CN+ TICN+E + +G GLYP IS++NHSC PN +VF G ++RAV+ +
Sbjct: 171 DLFEAFAKVICNSFTICNAEXQEVGVGLYPSISLLNHSCDPNCSIVFNGPHLLLRAVRDI 230
Query: 92 PKGAEVLISYIETAGSTMTRQKALKEQYLFTCTCPRCIKLGQFDDIQESAILEGYRCKDD 151
G E+ I Y++ ++ R+K L++QY F C C RC
Sbjct: 231 EVGEELTICYLDXLXTSEERRKQLRDQYCFECDCFRC----------------------- 267
Query: 152 GCSGFLLRDSDDKGFTCQQCGLVRSKEEIKKIASEVNILSKKTLALTSCGNHQEVVSTYK 211
+D D T + +E +KKI E+ K L C Q ++S+
Sbjct: 268 -----QTQDKDADXLTGDEQVWKEVQESLKKI-EELKAHWKWEQVLAXC---QAIISSNS 318
Query: 212 MIEKLQKKLYHPFSVNLMQTR--EKLIKILMELEDWKEALAYCQLTIPVYQRVYPQFHPL 269
E+L +N+ Q + + + L +EAL Y T Y+ +P HP+
Sbjct: 319 --ERLP-------DINIYQLKVLDCAXDACINLGLLEEALFYGTRTXEPYRIFFPGSHPV 369
Query: 270 LGLQYYTCGKLEWFLGDTENAIKSMTEAVEILRITHG 306
G+Q GKL+ G A K++ A +I R+THG
Sbjct: 370 RGVQVXKVGKLQLHQGXFPQAXKNLRLAFDIXRVTHG 406
>gi|297661482|ref|XP_002809270.1| PREDICTED: SET and MYND domain-containing protein 3-like [Pongo
abelii]
gi|426334412|ref|XP_004028746.1| PREDICTED: SET and MYND domain-containing protein 3-like [Gorilla
gorilla gorilla]
gi|28200379|gb|AAO31695.1| hypothetical protein FLJ21080 [Homo sapiens]
gi|119597546|gb|EAW77140.1| SET and MYND domain containing 3, isoform CRA_b [Homo sapiens]
Length = 258
Score = 117 bits (292), Expect = 1e-23, Method: Compositional matrix adjust.
Identities = 83/276 (30%), Positives = 133/276 (48%), Gaps = 43/276 (15%)
Query: 33 IAENFSKLACNAHTICNSELRPLGTGLYPVISIINHSCLPNAVLVFEGRLAVVRAVQHVP 92
+ E K+ CN+ TICN+E++ +G GLYP IS++NHSC PN +VF G ++RAV+ +
Sbjct: 1 MEEEEEKVICNSFTICNAEMQEVGVGLYPSISLLNHSCDPNCSIVFNGPHLLLRAVRDIE 60
Query: 93 KGAEVLISYIETAGSTMTRQKALKEQYLFTCTCPRCIKLGQFDDIQESAILEGYRCKDDG 152
G E+ I Y++ ++ R+K L++QY F C C RC
Sbjct: 61 VGEELTICYLDMLMTSEERRKQLRDQYCFECDCFRC------------------------ 96
Query: 153 CSGFLLRDSDDKGFTCQQCGLVRSKEEIKKIASEVNILSKKTLALTSCGNHQEVVSTYKM 212
+D D T + +E +KKI E+ K L C Q ++S+
Sbjct: 97 ----QTQDKDADMLTGDEQVWKEVQESLKKI-EELKAHWKWEQVLAMC---QAIISSNS- 147
Query: 213 IEKLQKKLYHPFSVNLMQTR--EKLIKILMELEDWKEALAYCQLTIPVYQRVYPQFHPLL 270
E+L +N+ Q + + + + L +EAL Y T+ Y+ +P HP+
Sbjct: 148 -ERLP-------DINIYQLKVLDCAMDACINLGLLEEALFYGTRTMEPYRIFFPGSHPVR 199
Query: 271 GLQYYTCGKLEWFLGDTENAIKSMTEAVEILRITHG 306
G+Q GKL+ G A+K++ A +I+R+THG
Sbjct: 200 GVQVMKVGKLQLHQGMFPQAMKNLRLAFDIMRVTHG 235
>gi|195438687|ref|XP_002067264.1| GK16327 [Drosophila willistoni]
gi|194163349|gb|EDW78250.1| GK16327 [Drosophila willistoni]
Length = 461
Score = 117 bits (292), Expect = 1e-23, Method: Compositional matrix adjust.
Identities = 86/310 (27%), Positives = 146/310 (47%), Gaps = 22/310 (7%)
Query: 16 ANLVNLILQWPEISIN--EIAENFSKLACNAHTICNSELRPLGTGLYPVISIINHSCLPN 73
A L ++++ P N E+ + +L N I + E+ + T +Y +SI +HSC PN
Sbjct: 153 AVLTDMMVDSPSTVPNKTELMSIYGRLITNGFNILDGEMNSIATAIYLGVSITDHSCQPN 212
Query: 74 AVLVFEGRLAVVRAVQHVP--KGAEVLISYIETAGSTMTRQKALKEQYLFTCTCPRCIKL 131
AV FEG + A++ +P +++ ISYI+ + R+ LKE Y F C C +C
Sbjct: 213 AVATFEGNELHIYAIEDMPCLDWSKIFISYIDLLNTPEQRRLDLKEHYYFLCVCSKCT-- 270
Query: 132 GQFDDIQESAILEGYRCKDDGCSGFLLRDSDDKGFTCQQC--GLVRSKEEIKKIASEVNI 189
D QE +E C C + + K C+ C G+ +++ +EV
Sbjct: 271 ----DRQEQREMEAAVCPFRKCGSSI----NVKWTHCRNCQAGIT---PKLRNTYNEVMA 319
Query: 190 LSKKTLALTSCGNHQEVVSTYKMIEKLQKKLYHPFSVNLMQTREKLIKILMELEDWKEAL 249
++K L + +V K+ Q + HP +V ++T + + +E+ W EAL
Sbjct: 320 MTKHHLESMKDVAYLDVC---KVCLDKQTGVLHPLNVWHVKTLDAAFEAAIEVGKWPEAL 376
Query: 250 AYCQLTIPVYQRVYPQFHPLLGLQYYTCGKLEWFLGDTENAIKSMTEAVEILRITHGTNS 309
Y Q + +++ +HPLLGL + GK++ + G + A+ EA IL +THG
Sbjct: 377 NYGQQLLQGFRKYNGPWHPLLGLLHMKLGKIQLYEGQCKEAMHHFEEARRILCVTHGREH 436
Query: 310 PFMKELILKL 319
+ E +L L
Sbjct: 437 RLIGEQLLPL 446
>gi|344236460|gb|EGV92563.1| SET and MYND domain-containing protein 2 [Cricetulus griseus]
Length = 326
Score = 116 bits (291), Expect = 1e-23, Method: Compositional matrix adjust.
Identities = 81/319 (25%), Positives = 149/319 (46%), Gaps = 27/319 (8%)
Query: 5 DEKQLLLYAQIANLVNLI---LQWPEISINEIAENFSKLACNAHTICNSELRPLGTGLYP 61
+EK+ L+ + IA L L++P+ S + F+++ CN TI + EL LG+ ++P
Sbjct: 30 NEKKDLIQSDIAALHQFYSRHLEFPDHS--SLVVLFAQVNCNGFTIEDEELSHLGSAIFP 87
Query: 62 VISIINHSCLPNAVLVFEGRLAVVRAVQHVPKGAEVLISYIETAGSTMTRQKALKEQYLF 121
++++NHSC PN ++ ++G LA VRAVQ + G EV SYI+ T R L++ Y F
Sbjct: 88 DVALMNHSCCPNVIVTYKGTLAEVRAVQEIHPGDEVFTSYIDLLYPTEDRNDRLRDSYFF 147
Query: 122 TCTCPRCIKLGQFDDIQESAILEGYRCKDDGCSGFLLRDSDDKGFTCQQCGLVRSKEEIK 181
TC C C KD + +R K + Q VR
Sbjct: 148 TCECRECT------------------TKDKDKAKVEIR----KLSSPPQAEAVRDMVRYA 185
Query: 182 KIASEVNILSKKTLALTSCGNHQEVVSTYKMIEKLQKKLYHPFSVNLMQTREKLIKILME 241
+ E +K S E++ ++ ++ ++ +V ++ + + + +
Sbjct: 186 RNVIEEFRRAKHYKYNLSVSPPSELLEICELSQEKMSSVFEDSNVYMLHMMYQAMGVCLY 245
Query: 242 LEDWKEALAYCQLTIPVYQRVYPQFHPLLGLQYYTCGKLEWFLGDTENAIKSMTEAVEIL 301
++DW+ AL Y Q I Y + YP + + + G+L L + K++ +A+ I+
Sbjct: 246 MQDWEGALKYGQKIIKPYSKHYPVYSLNVASMWLKLGRLYMGLENKAAGQKALKKAIAIM 305
Query: 302 RITHGTNSPFMKELILKLE 320
+ HG + P++ E+ ++E
Sbjct: 306 EVAHGKDHPYISEIKQEIE 324
>gi|115496736|ref|NP_001069832.1| N-lysine methyltransferase SMYD2 [Bos taurus]
gi|122144240|sp|Q0P585.1|SMYD2_BOVIN RecName: Full=N-lysine methyltransferase SMYD2; AltName:
Full=Histone methyltransferase SMYD2; AltName: Full=SET
and MYND domain-containing protein 2
gi|112362245|gb|AAI20365.1| SET and MYND domain containing 2 [Bos taurus]
gi|296478866|tpg|DAA20981.1| TPA: SET and MYND domain containing 2 [Bos taurus]
Length = 433
Score = 116 bits (291), Expect = 1e-23, Method: Compositional matrix adjust.
Identities = 81/317 (25%), Positives = 149/317 (47%), Gaps = 29/317 (9%)
Query: 5 DEKQLLLYAQIANLVNLILQWPEISINE-IAENFSKLACNAHTICNSELRPLGTGLYPVI 63
+EK+ L+ + IA L + + E N+ + F+++ CN TI + EL LG+ ++P +
Sbjct: 143 NEKRDLIQSDIAALHHFYSKHLEFPDNDSLVVLFAQVNCNGFTIEDEELSHLGSAIFPDV 202
Query: 64 SIINHSCLPNAVLVFEGRLAVVRAVQHVPKGAEVLISYIETAGSTMTRQKALKEQYLFTC 123
+++NHSC PN ++ ++G LA VRAVQ + G EV SYI+ T R L++ Y FTC
Sbjct: 203 ALMNHSCCPNVIVTYKGTLAEVRAVQEIHPGEEVFTSYIDLLYPTEDRNDRLRDSYFFTC 262
Query: 124 TCPRCIKLGQFDDIQESAILEGYRCKDDGCSGFLLRDSDDKGFTCQQCGLVRSKEEIKKI 183
C C ++ A +E R +D +RD +VR + +
Sbjct: 263 ECQECTTKD-----KDKAKVE-IRKLNDPPKAETIRD------------MVRYARNVIEE 304
Query: 184 ASEVNILSKKTLALTSCGNHQEVVSTYKMIEKLQKKLYHPFSVNLMQTREKLIKILMELE 243
+ L C QE +S ++ +V ++ + + + + ++
Sbjct: 305 FRRAKHYKSPSELLEICELSQEKMSC----------VFEDSNVYMLHMMYQAMGVCLYMQ 354
Query: 244 DWKEALAYCQLTIPVYQRVYPQFHPLLGLQYYTCGKLEWFLGDTENAIKSMTEAVEILRI 303
DW+ AL Y Q I Y + YP + + + G+L L + +++ +A+ I+ +
Sbjct: 355 DWEGALRYGQKIIQPYSKHYPLYSLNVASMWLKLGRLYMGLENKAAGERALKKAIAIMEV 414
Query: 304 THGTNSPFMKELILKLE 320
HG + P++ E+ ++E
Sbjct: 415 AHGKDHPYISEIKQEIE 431
>gi|441613050|ref|XP_003265162.2| PREDICTED: N-lysine methyltransferase SMYD2 [Nomascus leucogenys]
Length = 536
Score = 116 bits (291), Expect = 1e-23, Method: Compositional matrix adjust.
Identities = 82/317 (25%), Positives = 148/317 (46%), Gaps = 29/317 (9%)
Query: 5 DEKQLLLYAQIANLVNLILQWPEISINE-IAENFSKLACNAHTICNSELRPLGTGLYPVI 63
+EK+ L+ + IA L + + E N+ + F+++ CN TI + EL LG+ ++P +
Sbjct: 246 NEKKDLIQSDIAALHHFYSKHLEFPDNDSLVVLFAQVNCNGFTIEDEELSHLGSAIFPDV 305
Query: 64 SIINHSCLPNAVLVFEGRLAVVRAVQHVPKGAEVLISYIETAGSTMTRQKALKEQYLFTC 123
+++NHSC PN ++ ++G LA VRAVQ + G EV SYI+ T R L++ Y FTC
Sbjct: 306 ALMNHSCCPNVIVTYKGTLAEVRAVQEIKTGEEVFTSYIDLLYPTEDRNDRLRDSYFFTC 365
Query: 124 TCPRCIKLGQFDDIQESAILEGYRCKDDGCSGFLLRDSDDKGFTCQQCGLVRSKEEIKKI 183
C C ++ A +E R D +RD +VR + +
Sbjct: 366 ECQECTTKD-----KDKAKVE-IRKLSDPPKAEAIRD------------MVRYARNVIEE 407
Query: 184 ASEVNILSKKTLALTSCGNHQEVVSTYKMIEKLQKKLYHPFSVNLMQTREKLIKILMELE 243
+ L C QE +S+ ++ +V ++ + + + + ++
Sbjct: 408 FRRAKHYKSPSELLEICELSQEKMSS----------VFEDSNVYMLHMMYQAMGVCLYMQ 457
Query: 244 DWKEALAYCQLTIPVYQRVYPQFHPLLGLQYYTCGKLEWFLGDTENAIKSMTEAVEILRI 303
DW+ AL Y Q I Y + YP + + + G+L L K++ +A+ I+ +
Sbjct: 458 DWEGALQYGQKIIKPYSKHYPLYSLNVASMWLKLGRLYMGLEYKAAGEKALKKAIAIMEV 517
Query: 304 THGTNSPFMKELILKLE 320
HG + P++ E+ ++E
Sbjct: 518 AHGKDHPYISEIKQEIE 534
>gi|30840984|ref|NP_081072.1| N-lysine methyltransferase SMYD2 [Mus musculus]
gi|47117248|sp|Q8R5A0.1|SMYD2_MOUSE RecName: Full=N-lysine methyltransferase SMYD2; AltName:
Full=Histone methyltransferase SMYD2; AltName: Full=SET
and MYND domain-containing protein 2
gi|339717596|pdb|3QWV|A Chain A, Crystal Structure Of Histone Lysine Methyltransferase
Smyd2 In Complex With The Cofactor Product Adohcy
gi|339717597|pdb|3QWW|A Chain A, Crystal Structure Of Histone Lysine Methyltransferase
Smyd2 In Complex With The Methyltransferase Inhibitor
Sinefungin
gi|18606376|gb|AAH23119.1| SET and MYND domain containing 2 [Mus musculus]
Length = 433
Score = 116 bits (291), Expect = 1e-23, Method: Compositional matrix adjust.
Identities = 75/319 (23%), Positives = 147/319 (46%), Gaps = 33/319 (10%)
Query: 5 DEKQLLLYAQIANLVNLI---LQWPEISINEIAENFSKLACNAHTICNSELRPLGTGLYP 61
+EK+ L+ + IA L L++P+ S + F+++ CN TI + EL LG+ ++P
Sbjct: 143 NEKKDLIQSDIAALHQFYSKYLEFPDHS--SLVVLFAQVNCNGFTIEDEELSHLGSAIFP 200
Query: 62 VISIINHSCLPNAVLVFEGRLAVVRAVQHVPKGAEVLISYIETAGSTMTRQKALKEQYLF 121
++++NHSC PN ++ ++G LA VRAVQ + G EV SYI+ T R L++ Y F
Sbjct: 201 DVALMNHSCCPNVIVTYKGTLAEVRAVQEIHPGDEVFTSYIDLLYPTEDRNDRLRDSYFF 260
Query: 122 TCTCPRCIKLGQFDDIQESAILEGYRCKDDGCSGFLLRDSDDKGFTCQQCGLVRSKEEIK 181
TC C C +D D ++ E I+
Sbjct: 261 TCECRECT----------------------------TKDKDKAKVEVRKLSSPPQAEAIR 292
Query: 182 KIASEVNILSKKTLALTSCGNHQEVVSTYKMIEKLQKKLYHPFSVNLMQTREKLIKILME 241
+ + ++ + E++ ++ ++ ++ +V ++ + + + +
Sbjct: 293 DMVRYARNVIEEFRRAKHYKSPSELLEICELSQEKMSSVFEDSNVYMLHMMYQAMGVCLY 352
Query: 242 LEDWKEALAYCQLTIPVYQRVYPQFHPLLGLQYYTCGKLEWFLGDTENAIKSMTEAVEIL 301
++DW+ AL Y Q I Y + YP + + + G+L L + K++ +A+ I+
Sbjct: 353 MQDWEGALKYGQKIIKPYSKHYPVYSLNVASMWLKLGRLYMGLENKAAGEKALKKAIAIM 412
Query: 302 RITHGTNSPFMKELILKLE 320
+ HG + P++ E+ ++E
Sbjct: 413 EVAHGKDHPYISEIKQEIE 431
>gi|345329543|ref|XP_001510161.2| PREDICTED: N-lysine methyltransferase SMYD2-like [Ornithorhynchus
anatinus]
Length = 412
Score = 116 bits (290), Expect = 1e-23, Method: Compositional matrix adjust.
Identities = 79/318 (24%), Positives = 151/318 (47%), Gaps = 29/318 (9%)
Query: 5 DEKQLLLYAQIANLVNLILQWPEISIN-EIAENFSKLACNAHTICNSELRPLGTGLYPVI 63
+EK+ L+ + IA L + + E N + F+++ CN TI + EL LG+ ++P +
Sbjct: 122 NEKRELIQSDIAALHHFYSKHIEYPDNASLVVLFAQVNCNGFTIEDEELSHLGSAIFPDV 181
Query: 64 SIINHSCLPNAVLVFEGRLAVVRAVQHVPKGAEVLISYIETAGSTMTRQKALKEQYLFTC 123
+++NHSC PN ++ ++G LA VRAVQ + G E+ SYI+ T R L++ Y FTC
Sbjct: 182 ALMNHSCCPNVIVTYKGTLAEVRAVQEINPGDEIFTSYIDLLYPTEDRNDRLRDSYFFTC 241
Query: 124 TCPRCIKLGQFDDIQESAILEGYRCKDDGCSGFLLRDSDDKGFTCQQCGLVRSKEEIKKI 183
C CI ++ A +E R +D +RD +++ + +
Sbjct: 242 ECRECITKE-----KDKAKVE-IRKLNDPPKAEAIRD------------MIKYARNVIEE 283
Query: 184 ASEVNILSKKTLALTSCGNHQEVVSTYKMIEKLQKKLYHPFSVNLMQTREKLIKILMELE 243
+ L C QE + + ++ +V ++ + + + + ++
Sbjct: 284 FRRAKHYKSPSELLEICELSQEKMGS----------VFEDSNVYMLHMMYQAMGVCLYMQ 333
Query: 244 DWKEALAYCQLTIPVYQRVYPQFHPLLGLQYYTCGKLEWFLGDTENAIKSMTEAVEILRI 303
DW+ AL Y Q I Y + YP + + + G+L L + +K++ +A+ I+ +
Sbjct: 334 DWEGALRYGQKIIKPYSKHYPLYSLNVASMWLKLGRLYMGLENKSAGVKALKKALTIMEV 393
Query: 304 THGTNSPFMKELILKLEE 321
HG + P++ E+ ++E+
Sbjct: 394 AHGKDHPYISEIKKEIED 411
>gi|426240539|ref|XP_004014156.1| PREDICTED: N-lysine methyltransferase SMYD2 [Ovis aries]
Length = 395
Score = 116 bits (290), Expect = 1e-23, Method: Compositional matrix adjust.
Identities = 79/317 (24%), Positives = 145/317 (45%), Gaps = 29/317 (9%)
Query: 5 DEKQLLLYAQIANLVNLILQWPEISINE-IAENFSKLACNAHTICNSELRPLGTGLYPVI 63
+EK+ L+ + IA L + + E N+ + F+++ CN TI + EL LG+ ++P +
Sbjct: 105 NEKRDLIQSDIAALHHFYSKHLEFPDNDSLVVLFAQVNCNGFTIEDEELSHLGSAIFPDV 164
Query: 64 SIINHSCLPNAVLVFEGRLAVVRAVQHVPKGAEVLISYIETAGSTMTRQKALKEQYLFTC 123
+++NHSC PN ++ ++G LA VRAVQ + G EV SYI+ T R L++ Y FTC
Sbjct: 165 ALMNHSCCPNVIVTYKGTLAEVRAVQEIHPGEEVFTSYIDLLYPTEDRNDRLRDSYFFTC 224
Query: 124 TCPRCIKLGQFDDIQESAILEGYRCKDDGCSGFLLRDSDDKGFTCQQCGLVRSKEEIKKI 183
C C KD + +R +D +VR + +
Sbjct: 225 ECQECTT------------------KDKDKAKVEIRKLNDPPKAETIRDMVRYARNVIEE 266
Query: 184 ASEVNILSKKTLALTSCGNHQEVVSTYKMIEKLQKKLYHPFSVNLMQTREKLIKILMELE 243
+ L C QE +S ++ +V ++ + + + + ++
Sbjct: 267 FRRAKHYKSPSELLEICELSQEKMSC----------VFEDTNVYMLHMMYQAMGVCLYMQ 316
Query: 244 DWKEALAYCQLTIPVYQRVYPQFHPLLGLQYYTCGKLEWFLGDTENAIKSMTEAVEILRI 303
DW+ AL Y Q I Y + YP + + + G+L L + +++ +A+ I+ +
Sbjct: 317 DWEGALRYGQKIIQPYSKHYPLYSLNVASMWLKLGRLYMGLENKAAGERALKKAIAIMEV 376
Query: 304 THGTNSPFMKELILKLE 320
HG + P++ E+ ++E
Sbjct: 377 AHGKDHPYISEIKQEIE 393
>gi|281306785|ref|NP_996733.1| N-lysine methyltransferase SMYD2 [Rattus norvegicus]
gi|47116972|sp|Q7M6Z3.1|SMYD2_RAT RecName: Full=N-lysine methyltransferase SMYD2; AltName:
Full=Histone methyltransferase SMYD2; AltName: Full=SET
and MYND domain-containing protein 2
gi|32140157|tpg|DAA01315.1| TPA_exp: SET and MYND domain protein 2 [Rattus norvegicus]
Length = 433
Score = 116 bits (290), Expect = 2e-23, Method: Compositional matrix adjust.
Identities = 76/319 (23%), Positives = 147/319 (46%), Gaps = 33/319 (10%)
Query: 5 DEKQLLLYAQIANLVNLI---LQWPEISINEIAENFSKLACNAHTICNSELRPLGTGLYP 61
+EK+ L+ + IA L L++P+ S + F+++ CN TI + EL LG+ ++P
Sbjct: 143 NEKKDLIQSDIAALHQFYSKHLEFPDHS--SLVVLFAQVNCNGFTIEDEELSHLGSAIFP 200
Query: 62 VISIINHSCLPNAVLVFEGRLAVVRAVQHVPKGAEVLISYIETAGSTMTRQKALKEQYLF 121
++++NHSC PN ++ ++G LA VRAVQ + G EV SYI+ T R L++ Y F
Sbjct: 201 DVALMNHSCCPNVIVTYKGTLAEVRAVQEIHPGDEVFTSYIDLLYPTEDRNDRLRDSYFF 260
Query: 122 TCTCPRCIKLGQFDDIQESAILEGYRCKDDGCSGFLLRDSDDKGFTCQQCGLVRSKEEIK 181
TC C C +D D ++ E I+
Sbjct: 261 TCECRECT----------------------------TKDKDKAKVEIRKLSNPPQAEAIR 292
Query: 182 KIASEVNILSKKTLALTSCGNHQEVVSTYKMIEKLQKKLYHPFSVNLMQTREKLIKILME 241
+ + ++ + E++ ++ ++ ++ +V ++ + + + +
Sbjct: 293 DMVRYARNVIEEFRRAKHYKSPSELLEICELSQEKMSSVFEDSNVYMLHMMYQAMGVCLY 352
Query: 242 LEDWKEALAYCQLTIPVYQRVYPQFHPLLGLQYYTCGKLEWFLGDTENAIKSMTEAVEIL 301
++DW+ AL Y Q I Y + YP + + + G+L L + K++ +A+ I+
Sbjct: 353 MQDWEGALKYGQKIIKPYSKHYPVYSLNVASMWLKLGRLYMGLENKAAGEKALKKAIAIM 412
Query: 302 RITHGTNSPFMKELILKLE 320
I HG + P++ E+ ++E
Sbjct: 413 EIAHGKDHPYISEIKQEIE 431
>gi|410986126|ref|XP_003999363.1| PREDICTED: N-lysine methyltransferase SMYD2 [Felis catus]
Length = 386
Score = 116 bits (290), Expect = 2e-23, Method: Compositional matrix adjust.
Identities = 77/314 (24%), Positives = 144/314 (45%), Gaps = 33/314 (10%)
Query: 5 DEKQLLLYAQIANLVNLI---LQWPEISINEIAENFSKLACNAHTICNSELRPLGTGLYP 61
+EK+ L+ + +A L + L++P+ + + F+++ CN TI + EL LG+ ++P
Sbjct: 96 NEKKDLIQSDVAALHHFYSKHLEFPDS--DSLVVLFAQVNCNGFTIEDEELSHLGSAIFP 153
Query: 62 VISIINHSCLPNAVLVFEGRLAVVRAVQHVPKGAEVLISYIETAGSTMTRQKALKEQYLF 121
++++NHSC PN ++ ++G LA VRAVQ + G EV SYI+ T R L++ Y F
Sbjct: 154 DVALMNHSCCPNVIVTYKGTLAEVRAVQEINPGEEVFTSYIDLLYPTEDRNDRLRDSYFF 213
Query: 122 TCTCPRCIKLGQFDDIQESAILEGYRCKDDGCSGFLLRDSDDKGFTCQQCGLVRSKEEIK 181
TC C C KD + +R D +VR +
Sbjct: 214 TCQCQECT------------------TKDKDKAKVEIRKLSDPPKAETIRDMVRYARNVI 255
Query: 182 KIASEVNILSKKTLALTSCGNHQEVVSTYKMIEKLQKKLYHPFSVNLMQTREKLIKILME 241
+ + L C QE +S+ ++ +V ++ + + + +
Sbjct: 256 EEFRRAKHYKSPSELLEICELSQEKMSS----------VFEDSNVYMLHMMYQAMGVCLY 305
Query: 242 LEDWKEALAYCQLTIPVYQRVYPQFHPLLGLQYYTCGKLEWFLGDTENAIKSMTEAVEIL 301
++DW+ AL Y Q I Y + YP + + + G+L L + +++ +A+ I+
Sbjct: 306 MQDWEGALRYGQKIIKPYSKHYPLYSLNVASMWLKLGRLHMGLENKAAGERALKKAIAIM 365
Query: 302 RITHGTNSPFMKEL 315
+ HG + P++ E+
Sbjct: 366 EVAHGKDHPYISEI 379
>gi|354465172|ref|XP_003495054.1| PREDICTED: N-lysine methyltransferase SMYD2-like [Cricetulus
griseus]
Length = 404
Score = 116 bits (290), Expect = 2e-23, Method: Compositional matrix adjust.
Identities = 74/319 (23%), Positives = 147/319 (46%), Gaps = 33/319 (10%)
Query: 5 DEKQLLLYAQIANLVNLI---LQWPEISINEIAENFSKLACNAHTICNSELRPLGTGLYP 61
+EK+ L+ + IA L L++P+ S + F+++ CN TI + EL LG+ ++P
Sbjct: 114 NEKKDLIQSDIAALHQFYSRHLEFPDHS--SLVVLFAQVNCNGFTIEDEELSHLGSAIFP 171
Query: 62 VISIINHSCLPNAVLVFEGRLAVVRAVQHVPKGAEVLISYIETAGSTMTRQKALKEQYLF 121
++++NHSC PN ++ ++G LA VRAVQ + G EV SYI+ T R L++ Y F
Sbjct: 172 DVALMNHSCCPNVIVTYKGTLAEVRAVQEIHPGDEVFTSYIDLLYPTEDRNDRLRDSYFF 231
Query: 122 TCTCPRCIKLGQFDDIQESAILEGYRCKDDGCSGFLLRDSDDKGFTCQQCGLVRSKEEIK 181
TC C C +D D ++ E ++
Sbjct: 232 TCECRECT----------------------------TKDKDKAKVEIRKLSSPPQAEAVR 263
Query: 182 KIASEVNILSKKTLALTSCGNHQEVVSTYKMIEKLQKKLYHPFSVNLMQTREKLIKILME 241
+ + ++ + E++ ++ ++ ++ +V ++ + + + +
Sbjct: 264 DMVRYARNVIEEFRRAKHYKSPSELLEICELSQEKMSSVFEDSNVYMLHMMYQAMGVCLY 323
Query: 242 LEDWKEALAYCQLTIPVYQRVYPQFHPLLGLQYYTCGKLEWFLGDTENAIKSMTEAVEIL 301
++DW+ AL Y Q I Y + YP + + + G+L L + K++ +A+ I+
Sbjct: 324 MQDWEGALKYGQKIIKPYSKHYPVYSLNVASMWLKLGRLYMGLENKAAGQKALKKAIAIM 383
Query: 302 RITHGTNSPFMKELILKLE 320
+ HG + P++ E+ ++E
Sbjct: 384 EVAHGKDHPYISEIKQEIE 402
>gi|348577033|ref|XP_003474289.1| PREDICTED: N-lysine methyltransferase SMYD2-like [Cavia porcellus]
Length = 433
Score = 116 bits (290), Expect = 2e-23, Method: Compositional matrix adjust.
Identities = 77/319 (24%), Positives = 147/319 (46%), Gaps = 33/319 (10%)
Query: 5 DEKQLLLYAQIANLVNLI---LQWPEISINEIAENFSKLACNAHTICNSELRPLGTGLYP 61
+EK+ L+ + IA L L++P+ + + F+++ CN TI + EL LG+ ++P
Sbjct: 143 NEKKDLIQSDIAMLHQFYSKHLEFPDH--DSLVVLFAQVNCNGFTIEDEELSHLGSAIFP 200
Query: 62 VISIINHSCLPNAVLVFEGRLAVVRAVQHVPKGAEVLISYIETAGSTMTRQKALKEQYLF 121
++++NHSC PN ++ ++G LA VRAVQ V G EV SYI+ T R L++ Y F
Sbjct: 201 DVALMNHSCCPNVIVTYKGTLAEVRAVQEVLPGEEVFTSYIDLLYPTEDRNDRLRDSYFF 260
Query: 122 TCTCPRCIKLGQFDDIQESAILEGYRCKDDGCSGFLLRDSDDKGFTCQQCGLVRSKEEIK 181
TC C C RD D ++ E I+
Sbjct: 261 TCECQECTT----------------------------RDKDKAKVEIRKLSNPPKAEAIR 292
Query: 182 KIASEVNILSKKTLALTSCGNHQEVVSTYKMIEKLQKKLYHPFSVNLMQTREKLIKILME 241
+ + ++ + E++ ++ ++ ++ +V ++ + + + +
Sbjct: 293 DMVRYARNVIEEFRRAKHYKSPSELLEICELSQEKMSAVFEDSNVYMLHMMYQAMGVCLY 352
Query: 242 LEDWKEALAYCQLTIPVYQRVYPQFHPLLGLQYYTCGKLEWFLGDTENAIKSMTEAVEIL 301
L+DW+ AL Y Q I Y R YP + + + G+L L + +++ +A+ I+
Sbjct: 353 LQDWEGALRYGQKIIKPYSRHYPLYSLNVASMWLKLGRLYLGLENRVAGERALRKALAIM 412
Query: 302 RITHGTNSPFMKELILKLE 320
+ HG + P++ E+ ++E
Sbjct: 413 EVAHGKDHPYISEIKREIE 431
>gi|195133772|ref|XP_002011313.1| GI16066 [Drosophila mojavensis]
gi|193907288|gb|EDW06155.1| GI16066 [Drosophila mojavensis]
Length = 448
Score = 116 bits (290), Expect = 2e-23, Method: Compositional matrix adjust.
Identities = 85/295 (28%), Positives = 150/295 (50%), Gaps = 20/295 (6%)
Query: 29 SINEIAENFSKLACNAHTICNSELRPLGTGLYPVISIINHSCLPNAVLVFEGRLAVVRAV 88
++NE+ + +L N I ++E+ + TG+Y +SI +HSC PNAV FEG V A+
Sbjct: 161 NMNELMSIYGRLITNGFNILDAEMNSIATGIYLGVSITDHSCQPNAVATFEGNELHVHAI 220
Query: 89 QHVP--KGAEVLISYIETAGSTMTRQKALKEQYLFTCTCPRCIKLGQFDDIQESAILEGY 146
+ +P +++ ISYI+ + R+ LKE Y F C C +CI D QE +
Sbjct: 221 EDLPCLDWSKIYISYIDLLNTPEQRRADLKEHYYFLCVCSKCI------DPQEMHEMTAA 274
Query: 147 RCKDDGCSGFLLRDSDDKGFTCQQCGLVRSKEEIKKIASEVNILSKKTLALTSCGNHQEV 206
C + C + D C+ CG S E+++ SEV L++ +L + +EV
Sbjct: 275 VCPNGSCDASVNIQLD----KCESCGTPVS-EQLRTDYSEVMALTQSSL-----DSMKEV 324
Query: 207 --VSTYKMIEKLQKKLYHPFSVNLMQTREKLIKILMELEDWKEALAYCQLTIPVYQRVYP 264
+ K+ + + HP +V ++T + + +++ W EAL Y + +P + + +
Sbjct: 325 AYLDVCKVCLAKHRGILHPLNVWHVKTLDAAFEASIQVGKWTEALDYGRQLLPGFAKYHG 384
Query: 265 QFHPLLGLQYYTCGKLEWFLGDTENAIKSMTEAVEILRITHGTNSPFMKELILKL 319
+++PLLGL GK+E + + + A++ + EA IL +THG + + E + L
Sbjct: 385 EWNPLLGLLQLKLGKIELYERNYKQALQHLQEAQRILNVTHGRDHRLLLEQLRPL 439
>gi|395531367|ref|XP_003767751.1| PREDICTED: N-lysine methyltransferase SMYD2 [Sarcophilus harrisii]
Length = 402
Score = 115 bits (289), Expect = 2e-23, Method: Compositional matrix adjust.
Identities = 74/318 (23%), Positives = 145/318 (45%), Gaps = 29/318 (9%)
Query: 5 DEKQLLLYAQIANLVNLILQWPEISIN-EIAENFSKLACNAHTICNSELRPLGTGLYPVI 63
+EK+ L+ + I+ L + + E N + F+++ CN TI + EL LG+ ++P +
Sbjct: 112 NEKRELIQSDISALHHFYSKHIEYPDNASLVTLFAQVNCNGFTIEDEELSHLGSAIFPDV 171
Query: 64 SIINHSCLPNAVLVFEGRLAVVRAVQHVPKGAEVLISYIETAGSTMTRQKALKEQYLFTC 123
+++NHSC PN ++ ++G LA VRAVQ + G EV SYI+ T R LK+ Y FTC
Sbjct: 172 ALMNHSCCPNVIVTYKGTLAEVRAVQEINPGDEVFTSYIDLLYPTEDRNDRLKDSYFFTC 231
Query: 124 TCPRCIKLGQFDDIQESAILEGYRCKDDGCSGFLLRDSDDKGFTCQQCGLVRSKEEIKKI 183
C C + D ++ E I+ +
Sbjct: 232 ECRECTT----------------------------KAKDKAKVEIRKLSEPPKAESIRDM 263
Query: 184 ASEVNILSKKTLALTSCGNHQEVVSTYKMIEKLQKKLYHPFSVNLMQTREKLIKILMELE 243
+ ++ E++ ++ ++ L+ +V ++ + + + + ++
Sbjct: 264 VKYARNVIEEFRRAKHYKTPSELLEICELSQEKMGSLFEDSNVYMLHMMYQAMGVCLYMQ 323
Query: 244 DWKEALAYCQLTIPVYQRVYPQFHPLLGLQYYTCGKLEWFLGDTENAIKSMTEAVEILRI 303
DW+ AL Y Q I Y + YP + + + G+L L + +K++ +A+ I+ +
Sbjct: 324 DWEGALRYGQKIIKPYSKHYPLYSLNVASMWLKLGRLYMGLEHKASGVKALKKAIAIMEV 383
Query: 304 THGTNSPFMKELILKLEE 321
HG + P++ E+ ++E+
Sbjct: 384 AHGKDHPYISEIKKEIED 401
>gi|355720910|gb|AES07090.1| SET and MYND domain containing 3 [Mustela putorius furo]
Length = 251
Score = 115 bits (287), Expect = 4e-23, Method: Compositional matrix adjust.
Identities = 82/267 (30%), Positives = 129/267 (48%), Gaps = 43/267 (16%)
Query: 42 CNAHTICNSELRPLGTGLYPVISIINHSCLPNAVLVFEGRLAVVRAVQHVPKGAEVLISY 101
CNA TICN+E++ +G GLYP +S++NHSC PN +VF G ++RAV+ + G E+ I Y
Sbjct: 3 CNAFTICNAEMQEVGVGLYPSMSLLNHSCDPNCSIVFNGPHLLLRAVRDIEAGEELTICY 62
Query: 102 IETAGSTMTRQKALKEQYLFTCTCPRCIKLGQFDDIQESAILEGYRCKDDGCSGFLLRDS 161
++ ++ R+K L++QY F C C RC +D
Sbjct: 63 LDMLMTSEERRKQLRDQYCFECDCFRCQT----------------------------QDK 94
Query: 162 DDKGFTCQQCGLVRSKEEIKKIASEVNILSKKTLALTSCGNHQEVVSTYKMIEKLQKKLY 221
D T + +E +KKI E+ K L C Q V+S+ E+L
Sbjct: 95 DADMLTGDEQVWKEVQESLKKI-EELKAHWKWEQVLAMC---QAVISSNA--ERLP---- 144
Query: 222 HPFSVNLMQTR--EKLIKILMELEDWKEALAYCQLTIPVYQRVYPQFHPLLGLQYYTCGK 279
+N+ Q + + + + L +EAL Y T+ Y+ +P HP+ G+Q GK
Sbjct: 145 ---DINIYQLKVLDCAMDACINLGLLEEALFYGIRTMEPYRIFFPGSHPVRGVQVMKVGK 201
Query: 280 LEWFLGDTENAIKSMTEAVEILRITHG 306
L+ G A+K++ A +I+R+THG
Sbjct: 202 LQLHQGMFPQAMKNLRLAFDIMRVTHG 228
>gi|188035871|ref|NP_064582.2| N-lysine methyltransferase SMYD2 [Homo sapiens]
gi|90185234|sp|Q9NRG4.2|SMYD2_HUMAN RecName: Full=N-lysine methyltransferase SMYD2; AltName:
Full=HSKM-B; AltName: Full=Histone methyltransferase
SMYD2; AltName: Full=Lysine N-methyltransferase 3C;
AltName: Full=SET and MYND domain-containing protein 2
gi|345111051|pdb|3TG4|A Chain A, Structure Of Smyd2 In Complex With Sam
gi|345111052|pdb|3TG5|A Chain A, Structure Of Smyd2 In Complex With P53 And Sah
gi|67514269|gb|AAH98276.1| SET and MYND domain containing 2 [Homo sapiens]
Length = 433
Score = 115 bits (287), Expect = 4e-23, Method: Compositional matrix adjust.
Identities = 82/319 (25%), Positives = 149/319 (46%), Gaps = 33/319 (10%)
Query: 5 DEKQLLLYAQIANLVNLI---LQWPEISINEIAENFSKLACNAHTICNSELRPLGTGLYP 61
+EK+ L+ + IA L + L +P+ + + F+++ CN TI + EL LG+ ++P
Sbjct: 143 NEKKDLIQSDIAALHHFYSKHLGFPDN--DSLVVLFAQVNCNGFTIEDEELSHLGSAIFP 200
Query: 62 VISIINHSCLPNAVLVFEGRLAVVRAVQHVPKGAEVLISYIETAGSTMTRQKALKEQYLF 121
++++NHSC PN ++ ++G LA VRAVQ + G EV SYI+ T R L++ Y F
Sbjct: 201 DVALMNHSCCPNVIVTYKGTLAEVRAVQEIKPGEEVFTSYIDLLYPTEDRNDRLRDSYFF 260
Query: 122 TCTCPRCIKLGQFDDIQESAILEGYRCKDDGCSGFLLRDSDDKGFTCQQCGLVRSKEEIK 181
TC C C ++ A +E R D +RD +VR +
Sbjct: 261 TCECQECTTKD-----KDKAKVE-IRKLSDPPKAEAIRD------------MVRYARNVI 302
Query: 182 KIASEVNILSKKTLALTSCGNHQEVVSTYKMIEKLQKKLYHPFSVNLMQTREKLIKILME 241
+ + L C QE +S+ ++ +V ++ + + + +
Sbjct: 303 EEFRRAKHYKSPSELLEICELSQEKMSS----------VFEDSNVYMLHMMYQAMGVCLY 352
Query: 242 LEDWKEALAYCQLTIPVYQRVYPQFHPLLGLQYYTCGKLEWFLGDTENAIKSMTEAVEIL 301
++DW+ AL Y Q I Y + YP + + + G+L L K++ +A+ I+
Sbjct: 353 MQDWEGALQYGQKIIKPYSKHYPLYSLNVASMWLKLGRLYMGLEHKAAGEKALKKAIAIM 412
Query: 302 RITHGTNSPFMKELILKLE 320
+ HG + P++ E+ ++E
Sbjct: 413 EVAHGKDHPYISEIKQEIE 431
>gi|340780606|pdb|3RIB|A Chain A, Human Lysine Methyltransferase Smyd2 In Complex With
Adohcy
gi|340780607|pdb|3RIB|B Chain B, Human Lysine Methyltransferase Smyd2 In Complex With
Adohcy
Length = 441
Score = 115 bits (287), Expect = 4e-23, Method: Compositional matrix adjust.
Identities = 80/319 (25%), Positives = 145/319 (45%), Gaps = 33/319 (10%)
Query: 5 DEKQLLLYAQIANLVNLI---LQWPEISINEIAENFSKLACNAHTICNSELRPLGTGLYP 61
+EK+ L+ + IA L + L +P+ + + F+++ CN TI + EL LG+ ++P
Sbjct: 143 NEKKDLIQSDIAALHHFYSKHLGFPDN--DSLVVLFAQVNCNGFTIEDEELSHLGSAIFP 200
Query: 62 VISIINHSCLPNAVLVFEGRLAVVRAVQHVPKGAEVLISYIETAGSTMTRQKALKEQYLF 121
++++NHSC PN ++ ++G LA VRAVQ + G EV SYI+ T R L++ Y F
Sbjct: 201 DVALMNHSCCPNVIVTYKGTLAEVRAVQEIKPGEEVFTSYIDLLYPTEDRNDRLRDSYFF 260
Query: 122 TCTCPRCIKLGQFDDIQESAILEGYRCKDDGCSGFLLRDSDDKGFTCQQCGLVRSKEEIK 181
TC C C KD + +R D +VR +
Sbjct: 261 TCECQECTT------------------KDKDKAKVEIRKLSDPPKAEAIRDMVRYARNVI 302
Query: 182 KIASEVNILSKKTLALTSCGNHQEVVSTYKMIEKLQKKLYHPFSVNLMQTREKLIKILME 241
+ + L C QE +S+ ++ +V ++ + + + +
Sbjct: 303 EEFRRAKHYKSPSELLEICELSQEKMSS----------VFEDSNVYMLHMMYQAMGVCLY 352
Query: 242 LEDWKEALAYCQLTIPVYQRVYPQFHPLLGLQYYTCGKLEWFLGDTENAIKSMTEAVEIL 301
++DW+ AL Y Q I Y + YP + + + G+L L K++ +A+ I+
Sbjct: 353 MQDWEGALQYGQKIIKPYSKHYPLYSLNVASMWLKLGRLYMGLEHKAAGEKALKKAIAIM 412
Query: 302 RITHGTNSPFMKELILKLE 320
+ HG + P++ E+ ++E
Sbjct: 413 EVAHGKDHPYISEIKQEIE 431
>gi|291402401|ref|XP_002717452.1| PREDICTED: SET and MYND domain containing 2, partial [Oryctolagus
cuniculus]
Length = 394
Score = 114 bits (286), Expect = 4e-23, Method: Compositional matrix adjust.
Identities = 81/319 (25%), Positives = 150/319 (47%), Gaps = 33/319 (10%)
Query: 5 DEKQLLLYAQIANLVNLI---LQWPEISINEIAENFSKLACNAHTICNSELRPLGTGLYP 61
+EK+ L+ + I+ L + L++P+ + + F+++ CN TI + EL LG+ ++P
Sbjct: 104 NEKKDLIQSDISALHHFYSKHLEFPDH--DSLVVLFAQVNCNGFTIEDEELSHLGSAIFP 161
Query: 62 VISIINHSCLPNAVLVFEGRLAVVRAVQHVPKGAEVLISYIETAGSTMTRQKALKEQYLF 121
++++NHSC PN ++ ++G LA VRAVQ + G EV SYI+ T R + L++ Y F
Sbjct: 162 DVALMNHSCCPNVIVTYKGTLAEVRAVQEISPGEEVFTSYIDLLYPTEDRNERLRDSYFF 221
Query: 122 TCTCPRCIKLGQFDDIQESAILEGYRCKDDGCSGFLLRDSDDKGFTCQQCGLVRSKEEIK 181
TC C C ++ A +E R D +RD +VR +
Sbjct: 222 TCECLECTTKD-----KDKAKVEIRRLSDPP-KAEAIRD------------MVRYARNVI 263
Query: 182 KIASEVNILSKKTLALTSCGNHQEVVSTYKMIEKLQKKLYHPFSVNLMQTREKLIKILME 241
+ L C QE + + ++ +V ++ + + + +
Sbjct: 264 EEFRRAKHYKSPPELLEICELSQEKMGS----------VFEDSNVYMLHMMYQAMGVCLY 313
Query: 242 LEDWKEALAYCQLTIPVYQRVYPQFHPLLGLQYYTCGKLEWFLGDTENAIKSMTEAVEIL 301
++DW+ AL Y Q I Y + YP + + + G+L L + K++ +A+ I+
Sbjct: 314 MQDWEGALRYGQKIIKPYSKHYPLYSLNVASMWLKLGRLYLGLENKAAGEKALRKAIAIM 373
Query: 302 RITHGTNSPFMKELILKLE 320
+ HG + P++ E+ +LE
Sbjct: 374 EVAHGKDHPYISEIKQELE 392
>gi|229892824|ref|NP_001153563.1| N-lysine methyltransferase SMYD2 [Sus scrofa]
gi|325530260|sp|C3RZA1.1|SMYD2_PIG RecName: Full=N-lysine methyltransferase SMYD2; AltName:
Full=Histone methyltransferase SMYD2; AltName: Full=SET
and MYND domain-containing protein 2
gi|197692954|gb|ACH71265.1| SET and MYND domain-containing 2 [Sus scrofa]
Length = 433
Score = 114 bits (286), Expect = 5e-23, Method: Compositional matrix adjust.
Identities = 81/319 (25%), Positives = 151/319 (47%), Gaps = 33/319 (10%)
Query: 5 DEKQLLLYAQIANLVNLI---LQWPEISINEIAENFSKLACNAHTICNSELRPLGTGLYP 61
+EK+ L+ + IA L + L++P+ + + F+++ CN TI + EL LG+ ++P
Sbjct: 143 NEKRDLIQSDIAALHHFYSKHLEFPDS--DSLVVLFAQVNCNGFTIEDEELSHLGSXIFP 200
Query: 62 VISIINHSCLPNAVLVFEGRLAVVRAVQHVPKGAEVLISYIETAGSTMTRQKALKEQYLF 121
++++NHSC PN ++ ++G LA VRAVQ + G EV SYI+ T R L++ Y F
Sbjct: 201 DVALMNHSCCPNVIVTYKGTLAEVRAVQEIHPGEEVFTSYIDLLYPTEDRNDRLRDSYFF 260
Query: 122 TCTCPRCIKLGQFDDIQESAILEGYRCKDDGCSGFLLRDSDDKGFTCQQCGLVRSKEEIK 181
TC C C ++ A +E R +D +RD +VR +
Sbjct: 261 TCECQECTTKD-----KDKAKVE-IRKLNDPPKAEAIRD------------MVRYARNVI 302
Query: 182 KIASEVNILSKKTLALTSCGNHQEVVSTYKMIEKLQKKLYHPFSVNLMQTREKLIKILME 241
+ + L C QE +S ++ +V ++ + + + +
Sbjct: 303 EEFRRAKHYKSPSELLEICELSQEKMSC----------VFEDSNVYMLHMMYQAMGVCLY 352
Query: 242 LEDWKEALAYCQLTIPVYQRVYPQFHPLLGLQYYTCGKLEWFLGDTENAIKSMTEAVEIL 301
++DW+ AL Y Q I Y + YP + + + G+L L + +++ +A+ I+
Sbjct: 353 MQDWEGALRYGQKIIQPYSKHYPLYSLNVASMWLKLGRLYMGLENKAAGERALRKAIAIM 412
Query: 302 RITHGTNSPFMKELILKLE 320
+ HG + P++ E+ ++E
Sbjct: 413 EVAHGKDHPYISEIKQEIE 431
>gi|116283746|gb|AAH28104.1| SMYD2 protein [Homo sapiens]
Length = 417
Score = 114 bits (286), Expect = 5e-23, Method: Compositional matrix adjust.
Identities = 80/319 (25%), Positives = 145/319 (45%), Gaps = 33/319 (10%)
Query: 5 DEKQLLLYAQIANLVNLI---LQWPEISINEIAENFSKLACNAHTICNSELRPLGTGLYP 61
+EK+ L+ + IA L + L +P+ + + F+++ CN TI + EL LG+ ++P
Sbjct: 127 NEKKDLIQSDIAALHHFYSKHLGFPDN--DSLVVLFAQVNCNGFTIEDEELSHLGSAIFP 184
Query: 62 VISIINHSCLPNAVLVFEGRLAVVRAVQHVPKGAEVLISYIETAGSTMTRQKALKEQYLF 121
++++NHSC PN ++ ++G LA VRAVQ + G EV SYI+ T R L++ Y F
Sbjct: 185 DVALMNHSCCPNVIVTYKGTLAEVRAVQEIKPGEEVFTSYIDLLYPTEDRNDRLRDSYFF 244
Query: 122 TCTCPRCIKLGQFDDIQESAILEGYRCKDDGCSGFLLRDSDDKGFTCQQCGLVRSKEEIK 181
TC C C KD + +R D +VR +
Sbjct: 245 TCECQECT------------------TKDKDKAKVEIRKLSDPPKAEAIRDMVRYARNVI 286
Query: 182 KIASEVNILSKKTLALTSCGNHQEVVSTYKMIEKLQKKLYHPFSVNLMQTREKLIKILME 241
+ + L C QE +S+ ++ +V ++ + + + +
Sbjct: 287 EEFRRAKHYKSPSELLEICELSQEKMSS----------VFEDSNVYMLHMMYQAMGVCLY 336
Query: 242 LEDWKEALAYCQLTIPVYQRVYPQFHPLLGLQYYTCGKLEWFLGDTENAIKSMTEAVEIL 301
++DW+ AL Y Q I Y + YP + + + G+L L K++ +A+ I+
Sbjct: 337 MQDWEGALQYGQKIIKPYSKHYPLYSLNVASMWLKLGRLYMGLEHKAAGEKALKKAIAIM 396
Query: 302 RITHGTNSPFMKELILKLE 320
+ HG + P++ E+ ++E
Sbjct: 397 EVAHGKDHPYISEIKQEME 415
>gi|50740296|ref|XP_419420.1| PREDICTED: N-lysine methyltransferase SMYD2 [Gallus gallus]
Length = 436
Score = 114 bits (284), Expect = 8e-23, Method: Compositional matrix adjust.
Identities = 82/318 (25%), Positives = 147/318 (46%), Gaps = 29/318 (9%)
Query: 5 DEKQLLLYAQIANLVNLILQWPEISINE-IAENFSKLACNAHTICNSELRPLGTGLYPVI 63
+EK+ L+ IA L + + E N + F+++ CN TI + EL LG+ ++P +
Sbjct: 146 NEKRELIQNDIAALHHFYSKHMEYPDNAALVVLFAQVNCNGFTIEDEELSHLGSAIFPDV 205
Query: 64 SIINHSCLPNAVLVFEGRLAVVRAVQHVPKGAEVLISYIETAGSTMTRQKALKEQYLFTC 123
+++NHSC PN ++ ++G LA VRAV+ + G EV SYI+ T R L++ Y FTC
Sbjct: 206 ALMNHSCCPNVIVTYKGTLAEVRAVKEIEPGEEVFTSYIDLLYPTEDRNDRLRDSYFFTC 265
Query: 124 TCPRCIKLGQFDDIQESAILEGYRCKDDGCSGFLLRDSDDKGFTCQQCGLVRSKEEIKKI 183
C C + D ++ I R +D S +RD ++ K
Sbjct: 266 DCRECTM--KEKDKEKLKI----RKLNDPPSAEAVRDMIKYARN-----VIEEFRRAKHY 314
Query: 184 ASEVNILSKKTLALTSCGNHQEVVSTYKMIEKLQKKLYHPFSVNLMQTREKLIKILMELE 243
S +L L+L G ++ +V ++ + + + + ++
Sbjct: 315 KSPSELLEICELSLDKMG-----------------AVFEDSNVYMLHMMYQAMGVCLYVQ 357
Query: 244 DWKEALAYCQLTIPVYQRVYPQFHPLLGLQYYTCGKLEWFLGDTENAIKSMTEAVEILRI 303
DW+ AL Y Q I Y + YP + + + G+L L + K++ +A+ I+ +
Sbjct: 358 DWEGALRYGQKIIRPYSKHYPSYSLNVASMWLKLGRLYMALENRPAGDKALKKAIAIMEV 417
Query: 304 THGTNSPFMKELILKLEE 321
HG + P++ E+ +LE+
Sbjct: 418 AHGKDHPYISEIKKELED 435
>gi|195059707|ref|XP_001995687.1| GH17627 [Drosophila grimshawi]
gi|193896473|gb|EDV95339.1| GH17627 [Drosophila grimshawi]
Length = 457
Score = 114 bits (284), Expect = 9e-23, Method: Compositional matrix adjust.
Identities = 74/286 (25%), Positives = 142/286 (49%), Gaps = 16/286 (5%)
Query: 29 SINEIAENFSKLACNAHTICNSELRPLGTGLYPVISIINHSCLPNAVLVFEGRLAVVRAV 88
+I E+ + +L N I ++E+ + T +Y +SI +HSC PNAV FEG V +
Sbjct: 170 NITELMSIYGRLITNGFNILDAEMNSIATAIYLGVSITDHSCQPNAVATFEGNELHVHVI 229
Query: 89 QHVP--KGAEVLISYIETAGSTMTRQKALKEQYLFTCTCPRCIKLGQFDDIQESAILEGY 146
+ +P +++ ISYI+ + R+ L+E Y F C C +CI D QE +
Sbjct: 230 EDLPCLDWSKIYISYIDLLNTPEQRRADLREHYYFLCVCSKCI------DPQEMHEMTAA 283
Query: 147 RCKDDGCSGFLLRDSDDKGFTCQQCGLVRSKEEIKKIASEVNILSKKTLALTSCGNHQEV 206
C + C + + + CQQC +++ +++ ++ +L + +V
Sbjct: 284 VCPNASCDASV----NIELAKCQQCD-ASVTPQLRTAYNDIMAFTQSSLDSMKEVAYLDV 338
Query: 207 VSTYKMIEKLQKKLYHPFSVNLMQTREKLIKILMELEDWKEALAYCQLTIPVYQRVYPQF 266
+ +EK Q+ + HP +V ++T + + + + W +ALA+ Q +P + + + +
Sbjct: 339 CNV--CLEK-QRGVLHPLNVWHVKTLDAAFEAAINVGKWTDALAFGQQLLPGFGKYHGAW 395
Query: 267 HPLLGLQYYTCGKLEWFLGDTENAIKSMTEAVEILRITHGTNSPFM 312
+PL+GL Y GK++ + + + A+ + EA IL +THG + +
Sbjct: 396 NPLVGLLYMKLGKIQLYERNYKLAVHHLQEAQCILSVTHGHDHRLL 441
>gi|327262509|ref|XP_003216066.1| PREDICTED: n-lysine methyltransferase SMYD2-like [Anolis
carolinensis]
Length = 437
Score = 113 bits (283), Expect = 1e-22, Method: Compositional matrix adjust.
Identities = 74/285 (25%), Positives = 134/285 (47%), Gaps = 30/285 (10%)
Query: 37 FSKLACNAHTICNSELRPLGTGLYPVISIINHSCLPNAVLVFEGRLAVVRAVQHVPKGAE 96
F+++ CN TI + EL LG+ ++P ++++NHSC PN ++ ++G +A VRAVQ + G E
Sbjct: 180 FAQVNCNGFTIEDEELSHLGSAIFPDVALMNHSCCPNVIVTYKGTVAEVRAVQEIEPGDE 239
Query: 97 VLISYIETAGSTMTRQKALKEQYLFTCTCPRCI-KLGQFDDIQESAILEGYRCKDDGCSG 155
+ SYI+ T R LK+ Y FTC C CI K+ D ++ R ++
Sbjct: 240 IFTSYIDLLYPTEDRNDRLKDSYFFTCDCRECITKVKDKDKLE-------IRKLNEPPPA 292
Query: 156 FLLRDSDDKGFTCQQCGLVRSKEEIKKIASEVNILSKKTLALTSCGNHQEVVSTYKMIEK 215
+RD ++ K S +L L+L G+
Sbjct: 293 EAVRDMIKYARN-----VIEEFRRAKHYKSPSELLEICELSLEKMGS------------- 334
Query: 216 LQKKLYHPFSVNLMQTREKLIKILMELEDWKEALAYCQLTIPVYQRVYPQFHPLLGLQYY 275
++ +V ++ + + + + ++DW+ AL Y Q I Y + YP + + +
Sbjct: 335 ----VFAESNVYMLHMMYQAMGVCLYMQDWEGALRYGQKIIKPYSKHYPLYSLNVASMWL 390
Query: 276 TCGKLEWFLGDTENAIKSMTEAVEILRITHGTNSPFMKELILKLE 320
G+L L + +K++ +A+ I+ I HG + P++ E+ +LE
Sbjct: 391 KLGRLYMGLENRTAGVKALKKAIAIMEIAHGKDHPYVIEIKKELE 435
>gi|195059722|ref|XP_001995690.1| GH17625 [Drosophila grimshawi]
gi|193896476|gb|EDV95342.1| GH17625 [Drosophila grimshawi]
Length = 457
Score = 113 bits (283), Expect = 1e-22, Method: Compositional matrix adjust.
Identities = 73/286 (25%), Positives = 142/286 (49%), Gaps = 16/286 (5%)
Query: 29 SINEIAENFSKLACNAHTICNSELRPLGTGLYPVISIINHSCLPNAVLVFEGRLAVVRAV 88
+I E+ + +L N I ++E+ + T +Y +SI +HSC PNAV FEG V +
Sbjct: 170 NITELMSIYGRLITNGFNILDAEMNSIATAIYLGVSITDHSCQPNAVATFEGNELHVHVI 229
Query: 89 QHVP--KGAEVLISYIETAGSTMTRQKALKEQYLFTCTCPRCIKLGQFDDIQESAILEGY 146
+ +P +++ ISYI+ + R+ L+E Y F C C +CI D QE +
Sbjct: 230 EDLPCLDWSKIYISYIDLLNTPEQRRADLREHYYFLCVCSKCI------DPQEMHEMTAA 283
Query: 147 RCKDDGCSGFLLRDSDDKGFTCQQCGLVRSKEEIKKIASEVNILSKKTLALTSCGNHQEV 206
C + C + + + CQQC +++ +++ ++ +L + +V
Sbjct: 284 VCPNASCDASV----NIELAKCQQCD-ASVTPQLRTAYNDIMAFTQSSLDSMKEVAYLDV 338
Query: 207 VSTYKMIEKLQKKLYHPFSVNLMQTREKLIKILMELEDWKEALAYCQLTIPVYQRVYPQF 266
+ +EK Q+ + HP +V ++T + + + + W +ALA+ Q +P + + + +
Sbjct: 339 CNV--CLEK-QRGVLHPLNVWHVKTLDAAFEAAINVGKWTDALAFGQQLLPGFGKYHGAW 395
Query: 267 HPLLGLQYYTCGKLEWFLGDTENAIKSMTEAVEILRITHGTNSPFM 312
+PL+GL + GK++ + + + A+ + EA IL +THG + +
Sbjct: 396 NPLVGLLHMKLGKIQLYERNYKLAVHHLQEAQRILSVTHGHDHRLL 441
>gi|313227557|emb|CBY22704.1| unnamed protein product [Oikopleura dioica]
Length = 589
Score = 113 bits (283), Expect = 1e-22, Method: Compositional matrix adjust.
Identities = 78/273 (28%), Positives = 140/273 (51%), Gaps = 33/273 (12%)
Query: 37 FSKLACNAHTICNSE-LRPLGTGLYPVISIINHSCLPNAVLVFEGRLAVVRAVQHVPKGA 95
F ++CN +I + L+ LG ++P +++INH C PN V V G VRA++ + +G
Sbjct: 279 FGIVSCNGMSITDMRGLQYLGVAIHPTLNLINHDCNPNVVAVSCGPNIFVRAIKPIKEGD 338
Query: 96 EVLISYIETAGSTMTRQKALKEQYLFTCTCPRCIKLGQFDDIQESAILEGYRCKDDGCSG 155
E+ ISYI+T+ ++ TR+ LK+QY F CTC C + G+ D+++ + ++ + S
Sbjct: 339 ELFISYIDTSATSETRKNILKDQYYFDCTCKMC-ESGEKDELKSAYVMPKEDVSEKR-ST 396
Query: 156 FLLRDSDDKGFTCQQCGLVRSKEEIKKIASEVNILSKKTLALTSCGN--HQEVVSTYKMI 213
++ +++D +KKI S + + + +A + G QE + +
Sbjct: 397 YIEKNTDIM---------------MKKIESSKKVQAWERVAAQAGGCLLQQENLFDDTHL 441
Query: 214 EKLQKKLYHPFSVNLMQTREKLIKILMELEDWKEALAYCQLTIPVYQRVYPQFHPLLGLQ 273
+KL ++QT ++ IL +D A Y + + Y+++YP++ +G+Q
Sbjct: 442 KKLM----------ILQTCSEVSAILNHYDD---AAQYAERVLAAYEQLYPEYSTQIGMQ 488
Query: 274 YYTCGKLEWFLGDTENAIKSMTEAVEILRITHG 306
Y G W L E+AIK + A++ L ITHG
Sbjct: 489 AYRLGVHYWHLQRVEDAIKMLGRALKHLEITHG 521
>gi|351695044|gb|EHA97962.1| SET and MYND domain-containing protein 3, partial [Heterocephalus
glaber]
Length = 252
Score = 113 bits (282), Expect = 1e-22, Method: Compositional matrix adjust.
Identities = 83/270 (30%), Positives = 138/270 (51%), Gaps = 43/270 (15%)
Query: 39 KLACNAHTICNSELRPLGTGLYPVISIINHSCLPNAVLVFEGRLAVVRAVQHVPKGAEVL 98
++ CN+ TICN+E++ +G GLYP +S++NHSC PN +VF G ++RAV+ + G E+
Sbjct: 1 QVICNSFTICNAEMQEVGVGLYPSMSLLNHSCDPNCSIVFSGPHLLLRAVRDIEVGEELT 60
Query: 99 ISYIETAGSTMTRQKALKEQYLFTCTCPRCIKLGQFDDIQESAILEGYRCKDDGCSGFLL 158
I Y++ ++ R+K L++QY F C C RC Q D +++ +L G
Sbjct: 61 ICYLDMLMTSEERRKQLRDQYCFDCDCFRC----QTQD-KDADMLTG------------- 102
Query: 159 RDSDDKGFTCQQCGLVRSKEEIKKIASEVNILSKKTLALTSCGNHQEVVSTYKMIEKLQK 218
D++ + Q E +KKI E+ K L C Q ++S+ E+L
Sbjct: 103 ---DEQVWKGVQ-------ESLKKI-EELKAHWKWEQVLAMC---QSIISSNS--ERLP- 145
Query: 219 KLYHPFSVNLMQTR--EKLIKILMELEDWKEALAYCQLTIPVYQRVYPQFHPLLGLQYYT 276
+N+ Q + + + + L +EAL Y T+ Y+ +P HP+ G+Q
Sbjct: 146 ------DINIYQLKVLDCAMDACINLGLLEEALFYGIRTMEPYRIFFPGSHPVRGVQVMK 199
Query: 277 CGKLEWFLGDTENAIKSMTEAVEILRITHG 306
GKL+ G A+K++ A +I+R+THG
Sbjct: 200 VGKLQLHQGMFPQAMKNLRLAFDIMRVTHG 229
>gi|325530259|sp|E1C5V0.1|SMYD2_CHICK RecName: Full=N-lysine methyltransferase SMYD2; AltName:
Full=Histone methyltransferase SMYD2; AltName: Full=SET
and MYND domain-containing protein 2
Length = 436
Score = 112 bits (281), Expect = 2e-22, Method: Compositional matrix adjust.
Identities = 73/318 (22%), Positives = 145/318 (45%), Gaps = 29/318 (9%)
Query: 5 DEKQLLLYAQIANLVNLILQWPEISINE-IAENFSKLACNAHTICNSELRPLGTGLYPVI 63
+EK+ L+ IA L + + E N + F+++ CN TI + EL LG+ ++P +
Sbjct: 146 NEKRELIQNDIAALHHFYSKHMEYPDNAALVVLFAQVNCNGFTIEDEELSHLGSAIFPDV 205
Query: 64 SIINHSCLPNAVLVFEGRLAVVRAVQHVPKGAEVLISYIETAGSTMTRQKALKEQYLFTC 123
+++NHSC PN ++ ++G LA VRAV+ + G EV SYI+ T R L++ Y FTC
Sbjct: 206 ALMNHSCCPNVIVTYKGTLAEVRAVKEIEPGEEVFTSYIDLLYPTEDRNDRLRDSYFFTC 265
Query: 124 TCPRCIKLGQFDDIQESAILEGYRCKDDGCSGFLLRDSDDKGFTCQQCGLVRSKEEIKKI 183
C C +++ D + ++ S E ++ +
Sbjct: 266 DCRECT----------------------------MKEKDKEKLKIRKLNDPPSAEAVRDM 297
Query: 184 ASEVNILSKKTLALTSCGNHQEVVSTYKMIEKLQKKLYHPFSVNLMQTREKLIKILMELE 243
+ ++ E++ ++ ++ +V ++ + + + + ++
Sbjct: 298 IKYARNVIEEFRRAKHYKPPSELLEICELSLDKMGAVFEDSNVYMLHMMYQAMGVCLYVQ 357
Query: 244 DWKEALAYCQLTIPVYQRVYPQFHPLLGLQYYTCGKLEWFLGDTENAIKSMTEAVEILRI 303
DW+ AL Y Q I Y + YP + + + G+L L + K++ +A+ I+ +
Sbjct: 358 DWEGALRYGQKIIRPYSKHYPSYSLNVASMWLKLGRLYMALENRPAGDKALKKAIAIMEV 417
Query: 304 THGTNSPFMKELILKLEE 321
HG + P++ E+ +LE+
Sbjct: 418 AHGKDHPYISEIKKELED 435
>gi|326915197|ref|XP_003203906.1| PREDICTED: n-lysine methyltransferase SMYD2-like [Meleagris
gallopavo]
Length = 575
Score = 112 bits (280), Expect = 2e-22, Method: Compositional matrix adjust.
Identities = 82/318 (25%), Positives = 147/318 (46%), Gaps = 29/318 (9%)
Query: 5 DEKQLLLYAQIANLVNLILQWPEISINE-IAENFSKLACNAHTICNSELRPLGTGLYPVI 63
+EK+ L+ IA L + + E N + F+++ CN TI + EL LG+ ++P +
Sbjct: 285 NEKRELIQNDIAALHHFYSKHMEYPDNAALVVLFAQVNCNGFTIEDEELSHLGSAIFPDV 344
Query: 64 SIINHSCLPNAVLVFEGRLAVVRAVQHVPKGAEVLISYIETAGSTMTRQKALKEQYLFTC 123
+++NHSC PN ++ ++G LA VRAV+ + G EV SYI+ T R L++ Y FTC
Sbjct: 345 ALMNHSCCPNVIVTYKGILAEVRAVKEIEPGEEVFTSYIDLLYPTEDRNDRLRDSYFFTC 404
Query: 124 TCPRCIKLGQFDDIQESAILEGYRCKDDGCSGFLLRDSDDKGFTCQQCGLVRSKEEIKKI 183
C C + D ++ I R +D S +RD ++ K
Sbjct: 405 DCRECTM--KEKDKEKLKI----RKLNDPPSAEAVRD-----MIKYARNVIEEFRRAKHY 453
Query: 184 ASEVNILSKKTLALTSCGNHQEVVSTYKMIEKLQKKLYHPFSVNLMQTREKLIKILMELE 243
S +L L+L G ++ +V ++ + + + + ++
Sbjct: 454 KSPSELLEICELSLDKMG-----------------AVFEDSNVYMLHMMYQAMGVCLYVQ 496
Query: 244 DWKEALAYCQLTIPVYQRVYPQFHPLLGLQYYTCGKLEWFLGDTENAIKSMTEAVEILRI 303
DW+ AL Y Q I Y + YP + + + G+L L + K++ +A+ I+ +
Sbjct: 497 DWEGALRYGQKIIRPYSKYYPSYSLNVASMWLKLGRLYMALENRPAGDKALKKAIAIMEV 556
Query: 304 THGTNSPFMKELILKLEE 321
HG + P++ E+ +LE+
Sbjct: 557 AHGKDHPYISEIKKELED 574
>gi|148681077|gb|EDL13024.1| SET and MYND domain containing 2 [Mus musculus]
Length = 450
Score = 112 bits (280), Expect = 2e-22, Method: Compositional matrix adjust.
Identities = 78/319 (24%), Positives = 151/319 (47%), Gaps = 16/319 (5%)
Query: 5 DEKQLLLYAQIANLVNLI---LQWPEISINEIAENFSKLACNAHTICNSELRPLGTGLYP 61
+EK+ L+ + IA L L++P+ S + F+++ CN TI + EL LG+ ++P
Sbjct: 143 NEKKDLIQSDIAALHQFYSKYLEFPDHS--SLVVLFAQVNCNGFTIEDEELSHLGSAIFP 200
Query: 62 VISIINHSCLPNAVLVFEGRLAVVRAVQHVPKGAEVLISYIETAGSTMTRQKALKEQYLF 121
++++NHSC PN ++ ++G LA VRAVQ + G EV SYI+ T R L++ Y F
Sbjct: 201 DVALMNHSCCPNVIVTYKGTLAEVRAVQEIHPGDEVFTSYIDLLYPTEDRNDRLRDSYFF 260
Query: 122 TCTCPRCIKLGQFDDIQESAILEGYRCKDDGCSGFLLRDSDDKGFTCQQCGLVRSKEEIK 181
TC C C ++ A +E R +RD ++ K
Sbjct: 261 TCECRECTTKD-----KDKAKVE-VRKLSSPPQAEAIRD-----MVRYARNVIEEFRRAK 309
Query: 182 KIASEVNILSKKTLALTSCGNHQEVVSTYKMIEKLQKKLYHPFSVNLMQTREKLIKILME 241
+++ + A E++ ++ ++ ++ +V ++ + + + +
Sbjct: 310 HYKYNLSVWPARGTAPCLTDGVCELLEICELSQEKMSSVFEDSNVYMLHMMYQAMGVCLY 369
Query: 242 LEDWKEALAYCQLTIPVYQRVYPQFHPLLGLQYYTCGKLEWFLGDTENAIKSMTEAVEIL 301
++DW+ AL Y Q I Y + YP + + + G+L L + K++ +A+ I+
Sbjct: 370 MQDWEGALKYGQKIIKPYSKHYPVYSLNVASMWLKLGRLYMGLENKAAGEKALKKAIAIM 429
Query: 302 RITHGTNSPFMKELILKLE 320
+ HG + P++ E+ ++E
Sbjct: 430 EVAHGKDHPYISEIKQEIE 448
>gi|224047131|ref|XP_002191464.1| PREDICTED: N-lysine methyltransferase SMYD2 [Taeniopygia guttata]
Length = 436
Score = 112 bits (280), Expect = 2e-22, Method: Compositional matrix adjust.
Identities = 80/321 (24%), Positives = 150/321 (46%), Gaps = 35/321 (10%)
Query: 5 DEKQLLLYAQIANLVNLI---LQWPEISINEIAENFSKLACNAHTICNSELRPLGTGLYP 61
+EK+ L+ IA L + L +P+ + + F+++ CN TI + EL LG+ ++P
Sbjct: 146 NEKRELIQNDIAALHHFYSKHLDYPDNAA--LVVLFAQVNCNGFTIEDEELSHLGSAIFP 203
Query: 62 VISIINHSCLPNAVLVFEGRLAVVRAVQHVPKGAEVLISYIETAGSTMTRQKALKEQYLF 121
++++NHSC PN ++ ++G LA VRAV+ + G E+ SYI+ T R L++ Y F
Sbjct: 204 DVALMNHSCCPNVIVTYKGTLAEVRAVREIEPGEEIFSSYIDLLYPTEDRNDRLRDSYFF 263
Query: 122 TCTCPRCIKLGQFDDIQESAILEGYRCK-DDGCSGFLLRDSDDKGFTCQQCGLVRSKEEI 180
+C C C + D ++ I CK +D S ++D ++
Sbjct: 264 SCDCRECTT--KEKDKEKLEI-----CKLNDPPSAETVKDMIKYARN-----VIEEFRRA 311
Query: 181 KKIASEVNILSKKTLALTSCGNHQEVVSTYKMIEKLQKKLYHPFSVNLMQTREKLIKILM 240
K S +L L+L G ++ +V ++ + + + +
Sbjct: 312 KHYKSPSELLEICELSLDKMG-----------------AVFEDSNVYMLHMMYQAMGVCL 354
Query: 241 ELEDWKEALAYCQLTIPVYQRVYPQFHPLLGLQYYTCGKLEWFLGDTENAIKSMTEAVEI 300
++DW+ AL Y Q I Y + YP + + + G+L L + +K++ A+ I
Sbjct: 355 YVQDWEGALCYGQKIIRPYSKHYPSYSLNVASMWLKLGRLYMALKNRTAGVKALKRAIAI 414
Query: 301 LRITHGTNSPFMKELILKLEE 321
+ + HG + P++ E+ +LE+
Sbjct: 415 MEVAHGKDHPYISEIKKELED 435
>gi|157137747|ref|XP_001657162.1| hypothetical protein AaeL_AAEL003674 [Aedes aegypti]
gi|108880819|gb|EAT45044.1| AAEL003674-PA [Aedes aegypti]
Length = 446
Score = 112 bits (280), Expect = 3e-22, Method: Compositional matrix adjust.
Identities = 86/321 (26%), Positives = 151/321 (47%), Gaps = 18/321 (5%)
Query: 18 LVNLILQWPEISINEIAENFSKLACNAHTICNSELRPLGTGLYPVISIINHSCLPNAVLV 77
L +L+ + S E+ + F K+ N+ I + E+ +GTG+Y ISII+HSC PNA+
Sbjct: 139 LRSLVEEAAMPSKAELLQIFGKMCINSFNILDDEMNSIGTGMYLGISIIDHSCRPNALAT 198
Query: 78 FEGRLAVVRAVQHVPKGAEV-----LISYIETAGSTMTRQKALKEQYLFTCTCPRCIKLG 132
F+G +R ++ G+EV ISYI+ R+ L+ QY F C C RC
Sbjct: 199 FDGTTIHLRLLEDY-HGSEVDFSKIFISYIDLMNPAEERKARLRAQYYFECNCERC---- 253
Query: 133 QFDDIQESAILEGYRCKDDGCSGFLLRDSDDKGFTCQQCGLVRSKEEIKKIASEVNILSK 192
D QE ++ C + C L D K C +C K +++ EV +
Sbjct: 254 --RDEQEQQLMIAGACPNVDCDE-PLSMLDRKVEKCPRCDTA-IKHSAREMFREVTDFTL 309
Query: 193 KTLALTSCGNHQEVVSTYKMIEKLQKKLYHPFSVNLMQTREKLIKILMELEDWKEALAYC 252
LA + ++ ++ K Q+ + H ++V ++T + + + ++ W++A+ Y
Sbjct: 310 AQLAKMKDITYMDIC---QLCLKKQENILHYYNVWYLKTLDLAFESAINMQKWEDAIGYG 366
Query: 253 QLTIPVYQRVYPQFHPLLGLQYYTCGKLEWFLGDTENAIKSMTEAVEILRITHGTNSPFM 312
+++ FHPL GL GK++ + A++++ ++ +ILR+THG
Sbjct: 367 LRLKDGFKKYNGPFHPLYGLLLLKIGKIQLYTKHVVEALQNINDSEKILRVTHGEEHDLY 426
Query: 313 KELILKLEEAQAEASYKLSSK 333
K ++ L QA Y +K
Sbjct: 427 KSQLIPL-LCQAAGEYDALNK 446
>gi|194768855|ref|XP_001966527.1| GF21943 [Drosophila ananassae]
gi|190617291|gb|EDV32815.1| GF21943 [Drosophila ananassae]
Length = 459
Score = 112 bits (279), Expect = 3e-22, Method: Compositional matrix adjust.
Identities = 80/317 (25%), Positives = 154/317 (48%), Gaps = 19/317 (5%)
Query: 16 ANLVNLILQWPEISIN--EIAENFSKLACNAHTICNSELRPLGTGLYPVISIINHSCLPN 73
A L +++ + P N E+ + +L N + ++E+ + T +Y +SI +HSC PN
Sbjct: 157 AVLTDMMAESPSTVPNKTELMSIYGRLITNGFNVLDAEMNSIATAIYLGVSITDHSCQPN 216
Query: 74 AVLVFEGRLAVVRAVQHVP--KGAEVLISYIETAGSTMTRQKALKEQYLFTCTCPRCIKL 131
AV FEG + A++ + +++ ISYI+ + R+ LKE Y F C C +CI
Sbjct: 217 AVATFEGNELHIHAIEDMECLDWSKIFISYIDLLNTPEERRLDLKEHYYFLCCCSKCI-- 274
Query: 132 GQFDDIQESAILEGYRCKDDGCSGFLLRDSDDKGFTCQQCGLVRSKEEIKKIASEVNILS 191
D QE+ + C + C + + + C++C S + ++ + V +
Sbjct: 275 ----DPQETKEMMAALCPNRNCGIGI----NPERTNCKRCDTGISPK-LRNAYNSVMAFT 325
Query: 192 KKTLALTSCGNHQEVVSTYKMIEKLQKKLYHPFSVNLMQTREKLIKILMELEDWKEALAY 251
+ L + +V K+ Q + HP +V ++T + + +++ W +AL Y
Sbjct: 326 RTNLEAMKDVAYLDVC---KVCLNKQTGVLHPLNVWHVKTLDAAFEAAIDVGKWTDALDY 382
Query: 252 CQLTIPVYQRVYPQFHPLLGLQYYTCGKLEWFLGDTENAIKSMTEAVEILRITHGTNSPF 311
Q +P Y++ + ++PLLGL + GK++ + A+ + +A+ IL +THG +
Sbjct: 383 GQRLLPGYRKYHGAWNPLLGLLHMKLGKIQLHEQRAKEALPHLEDALRILTVTHGRDHRL 442
Query: 312 MKELILKLE-EAQAEAS 327
+ E + L +A+ EA+
Sbjct: 443 LAEQLYPLLIQARHEAN 459
>gi|47217177|emb|CAG11013.1| unnamed protein product [Tetraodon nigroviridis]
Length = 473
Score = 112 bits (279), Expect = 3e-22, Method: Compositional matrix adjust.
Identities = 85/328 (25%), Positives = 151/328 (46%), Gaps = 41/328 (12%)
Query: 1 MSDIDEKQLL--LYAQI-----ANLVNLILQWPEISINEIAENFSKLACNAHTICNSELR 53
+SD+ + Q L L+AQ+ A LI P S+N CN TI + EL
Sbjct: 180 ISDLPDDQALTQLFAQVRSRTGAGGEKLIRGSPPASVN----------CNGFTIEDEELS 229
Query: 54 PLGTGLYPVISIINHSCLPNAVLVFEGRLAVVRAVQHVPKGAEVLISYIETAGSTMTRQK 113
LG+ ++P ++++NHSC PN ++ ++G +A VRAVQ + G E+ SYI+ T R++
Sbjct: 230 HLGSAVFPDVALMNHSCSPNVIVTYKGTVAEVRAVQEINPGDEIFNSYIDLLYPTEDRKE 289
Query: 114 ALKEQYLFTCTCPRCIKLGQFDDIQESAILEGYRCKDDGCSGFLLRDSDDKGFTCQQCGL 173
L + Y FTC C C + ++ +S + L + + L
Sbjct: 290 RLLDSYFFTCQCAECTTRSKDEEKMKSTKPKSS-----------LEPEEVQSMVVYARNL 338
Query: 174 VRSKEEIKKIASEVNILSKKTLALTSCGNHQEVVSTYKMIEKLQKKLYHPFSVNLMQTRE 233
+ EE ++ NILS L C +S KM ++ +V ++
Sbjct: 339 I---EEFRRAKHYKNILSFPVSLLEMCE-----LSLDKM-----GSVFADTNVYMLHMMY 385
Query: 234 KLIKILMELEDWKEALAYCQLTIPVYQRVYPQFHPLLGLQYYTCGKLEWFLGDTENAIKS 293
+ + + + ++DW A++Y + I Y YP + + Y G+L L T +K+
Sbjct: 386 QAMGVCLYMQDWDGAMSYGEKIIQPYSVHYPAYSLNVASMYLKLGRLYLGLEKTPQGVKA 445
Query: 294 MTEAVEILRITHGTNSPFMKELILKLEE 321
+ +A+ I+ + HG + ++ E+ ++EE
Sbjct: 446 LKKALSIMEVAHGKDHYYVAEVKREIEE 473
>gi|195163295|ref|XP_002022487.1| GL12938 [Drosophila persimilis]
gi|194104479|gb|EDW26522.1| GL12938 [Drosophila persimilis]
Length = 420
Score = 111 bits (277), Expect = 5e-22, Method: Compositional matrix adjust.
Identities = 81/317 (25%), Positives = 154/317 (48%), Gaps = 38/317 (11%)
Query: 16 ANLVNLILQWPEISIN--EIAENFSKLACNAHTICNSELRPLGTGLYPVISIINHSCLPN 73
A L +++ P N E+ + +L N + ++E+ + T +Y +SI +HSC PN
Sbjct: 137 AVLTDMMADSPSTVPNKSELMSIYGRLITNGFNVLDAEMNSIATAIYLGVSITDHSCQPN 196
Query: 74 AVLVFEGRLAVVRAVQHVP--KGAEVLISYIETAGSTMTRQKALKEQYLFTCTCPRCIKL 131
AV FEG + A++ +P +++ ISYI+ + R++ LK+ Y F C C +C
Sbjct: 197 AVATFEGNELHIHALEDMPCLDWSKIFISYIDLLNTPEQRRQDLKDHYYFLCVCSKC--- 253
Query: 132 GQFDDIQESAILEGYRCKDDGCSGFLLRDSDDKGFTCQQCGLVRSKEEIKKIASEVNILS 191
D +E+ + C + CS + + ++ C++C + S + ++ +E+ L+
Sbjct: 254 ---RDPKEARQMTAAACPNRKCSASINIEWNN----CKRCSVGISPK-LRNAYNEIMALT 305
Query: 192 KKTLALTSCGNHQEVVSTYKMIEKLQKKLYHPFSVNLMQTREKLIKILMELEDWKEALAY 251
K L E ++ Y +++ +T + + +E+ W+EAL Y
Sbjct: 306 KHNL------------------EAMKDVAY----LDVCKTLDAACEAAIEVGKWEEALNY 343
Query: 252 CQLTIPVYQRVYPQFHPLLGLQYYTCGKLEWFLGDTENAIKSMTEAVEILRITHGTNSPF 311
Q +P +Q + Q++PL+GL + GK++ F + A + EA IL +THG +
Sbjct: 344 GQQLLPGFQMYHGQWNPLVGLLHMKLGKIQLFERHHKEASHHLEEAQRILSVTHGRDHRL 403
Query: 312 MKELILKL-EEAQAEAS 327
+ E + L +A+ EA+
Sbjct: 404 LVEQLYPLIFQARQEAN 420
>gi|380030528|ref|XP_003698898.1| PREDICTED: LOW QUALITY PROTEIN: SET and MYND domain-containing
protein 3-like [Apis florea]
Length = 414
Score = 110 bits (276), Expect = 7e-22, Method: Compositional matrix adjust.
Identities = 87/313 (27%), Positives = 152/313 (48%), Gaps = 53/313 (16%)
Query: 2 SDI--DEKQLLLYAQIANLVNLILQWPEISINEIAE---NFSKLACNAHTICNSELRPLG 56
SDI DEK++ + + ++ L ++SI AE + ++ N+ I + ++ +G
Sbjct: 122 SDIKKDEKKMEHFVCVCGVLYEFL--GDMSIPNSAELMGIYGRIYINSFNISDLDMNNIG 179
Query: 57 TGLYPVISIINHSCLPNAVLVFEGRLAVVRAVQHVP--KGAEVLISYIETAGSTMTRQKA 114
G+Y SI++HSC PNAV FEG ++R + +P +++ ISYI+ +T R++
Sbjct: 180 AGIYLGPSILDHSCKPNAVATFEGTTIIIRTTEDLPCLDLSQIRISYIDVIKTTKDRREE 239
Query: 115 LKEQYLFTCTCPRCIKLGQFDDIQES-AILEGYRCKDDGCSGFLLRDSDDKGFTCQQCGL 173
L+ Y F C C +C +ES ++E C +
Sbjct: 240 LQNSYYFWCNCKKC---------EESEPMVEAAACPN----------------------- 267
Query: 174 VRSKEEIKKIASEVNILSKKTLALTSCGNHQEVVSTYKMIEKLQKKLYHPFSVNLMQTRE 233
R+ EI + + ++ + K +A VST M K Q+ + HP ++ +QT +
Sbjct: 268 -RTFYEISDLTA-YHLQNMKNIAYLD-------VST--MCLKKQEGVLHPLNIQHVQTLQ 316
Query: 234 KLIKILMELEDWKEALAYCQLTIPVYQRVYPQFHPLLGLQYYTCGKLEWFLGDTENAIKS 293
+ L+ W+EA +Y + I Y Y +FHP G+ Y + GKL+ +L + + AI++
Sbjct: 317 SAFDSSLTLQHWEEAESYAKRLINGYLTYYGEFHPSTGILYLSIGKLQVYLKNXKQAIET 376
Query: 294 MTEAVEILRITHG 306
+ +A IL ITHG
Sbjct: 377 LRKASAILTITHG 389
>gi|432945357|ref|XP_004083558.1| PREDICTED: N-lysine methyltransferase SMYD2-like [Oryzias latipes]
Length = 432
Score = 110 bits (275), Expect = 1e-21, Method: Compositional matrix adjust.
Identities = 94/334 (28%), Positives = 147/334 (44%), Gaps = 61/334 (18%)
Query: 5 DEKQLLLYAQIANLVNLILQWPEI-SINEIAENFSKLACNAHTICNSELRPLGTGLYPVI 63
+EK+ IA L + E+ S ++ FS++ACN TI + EL +GT +YP +
Sbjct: 142 NEKKERTELDIAGLGRFFSKHLEVPSHKDLLTLFSQVACNGFTIEDDELSHMGTAVYPDV 201
Query: 64 SIINHSCLPNAVLVFEGRLAVVRAVQHVPKGAEVLISYIETAGSTMTRQKALKEQYLFTC 123
++INHSCLP+ ++ F G LA VRAVQ + G EVLISYI+ T R L+E Y FTC
Sbjct: 202 ALINHSCLPSVIVTFNGTLAQVRAVQDMKPGDEVLISYIDLLYPTDDRNSRLRESYYFTC 261
Query: 124 TCPRC-------IKL---GQFDDIQESAILEGYRCKDDGCSGF-LLRDSDDKGFTCQQCG 172
C C +KL Q D I+ I R + F ++ + + C
Sbjct: 262 NCLECQNKQKDKVKLKVRKQSDPIEPQVISNMTRYAKNTIREFRAMKSTKTPSELLEMCE 321
Query: 173 LVRSKEEIKKI--ASEVNILSKKTLALTSCGNHQEVVSTYKMIEKLQK---KLYHPFSVN 227
+S EE+ + S V +L A+ C Q+ + EK+ + K YHP+S+N
Sbjct: 322 --QSLEEMGAVFEDSNVYVLHMMYQAMGVCIYMQDADGALRYGEKVARYYSKRYHPYSLN 379
Query: 228 LMQTREKLIKILMELEDWKEALAYCQLTIPVYQRVYPQFHPLLGLQYYTCGKLEWFLGDT 287
+ KL +I M LG
Sbjct: 380 VSSLYLKLGRIYMA------------------------------------------LGRH 397
Query: 288 ENAIKSMTEAVEILRITHGTNSPFMKELILKLEE 321
I ++T+A+ I+ + HG + ++K++ +L +
Sbjct: 398 SAGINALTKAMAIMEVAHGKDHQYLKDIRKELSQ 431
>gi|324509870|gb|ADY44135.1| SET and MYND domain-containing protein 3 [Ascaris suum]
Length = 442
Score = 110 bits (274), Expect = 1e-21, Method: Compositional matrix adjust.
Identities = 81/307 (26%), Positives = 144/307 (46%), Gaps = 28/307 (9%)
Query: 31 NEIAENFSKLACNAHTICNSELRPLGTGLYPVISIINHSCLPNAVLVFEGRLAVVRAVQH 90
+E+ F K+ N TI N +L +G GLY +S+++HSC P+A ++F G AV+R ++
Sbjct: 142 SELLTIFGKVMVNVFTISNDDLNTIGLGLYLGLSVLDHSCDPDAFVLFNGTKAVLRPLKQ 201
Query: 91 --VPKGAEVLISYIETAGSTMTRQKALKEQYLFTCTCPRCIKLGQFDDIQESAILEGYRC 148
+ + I+Y + T R+ LK+Q+ FTC C C+ D++ RC
Sbjct: 202 YITAYDSSLRIAYCDLLDLTSMRRNQLKQQFFFTCECSACL------DLEREKTARSVRC 255
Query: 149 KDDGCSGFLLRDSDDKGFTCQQCGL---VRSKEEI---KKIASEVNILSKKTLALTSCGN 202
+ G+ D ++ C QCG V E + +++ EV L ++
Sbjct: 256 R-HCVDGYCPLDVNENSLVCWQCGATSEVHVDEAVHLMQQVEWEVKRLQER--------- 305
Query: 203 HQEVVSTYKMIEKLQKKLYHPFSVNLMQTREKLIKILMELEDWKEALAYCQLTIPVYQRV 262
+ + T K + + ++ L Q +++ ++ ++ A Y + TI ++R
Sbjct: 306 RNDSICTAKDLYNEASIILSSLNIPLCQLADRIASWGIDNGNFSLAADYIEKTISCFKRF 365
Query: 263 YPQFHPLLGLQYYTCGKLEWFLGDT-ENAIKSMTEAVEILRITHGTN---SPFMKELILK 318
YP HP L LQY+ GKL T E A + + +A+E L +HG S EL+ K
Sbjct: 366 YPTAHPSLSLQYFKAGKLHSLNDLTLELAAERLEQAIESLHASHGVQHSLSAEAHELLCK 425
Query: 319 LEEAQAE 325
+++ + +
Sbjct: 426 VQQQRRD 432
>gi|301113434|ref|XP_002998487.1| conserved hypothetical protein [Phytophthora infestans T30-4]
gi|262111788|gb|EEY69840.1| conserved hypothetical protein [Phytophthora infestans T30-4]
Length = 424
Score = 110 bits (274), Expect = 1e-21, Method: Compositional matrix adjust.
Identities = 94/330 (28%), Positives = 154/330 (46%), Gaps = 55/330 (16%)
Query: 3 DIDEKQLLLYAQIANLVNLILQWPEISINEIAENFSKLACNAHTICNSELRPLGTGLYPV 62
D+D+++LLL A +A +NL+ + S++E+A S+ N +IC+ L LG G +P+
Sbjct: 138 DMDDQELLLLAALAQKINLVDE--SFSMDEMARMLSRFRNNNFSICDELLLELGAGCFPL 195
Query: 63 ISIINHSCLPNAVLVFEGRLAVV--RAVQHVPKGAEVLISYIETAGSTMTRQKALKEQYL 120
++INHSC PN + F + + RA++ + G E+ +Y++ A RQ+ L+ +Y
Sbjct: 196 GAMINHSCDPNCAVTFVPKTLDMEFRAMKPIKSGEEITQTYVDIALPRRERQQRLQRKYH 255
Query: 121 FTCTCPRCIKLGQFDDIQESAILEGYRCKDDGCSGFLLRDSDDKGFTCQQCGLVRSKEEI 180
FTC CPRC + +QE L+ Y D+D G ++ R +E
Sbjct: 256 FTCGCPRCSQ-----PLQEPGSLDAY------------LDADIDGVPQERWTEERQQE-- 296
Query: 181 KKIASEVNILSKKTLALTSCGNHQEVVSTYKMIEKLQKKLYHPFSVNLMQTREKLIKILM 240
+ + + + Q + H SV +QT + M
Sbjct: 297 ------------------------QCIDALQNLADRQSTILHCHSVARLQTLATIFSAEM 332
Query: 241 ELEDWKEALAYCQLTIPVYQRVYPQFHPLLGLQYYTCGKL-----EWFLGDTENAIKS-- 293
E +EA+ Y + + Y+RVY HP+ GL +T G L + G ++ KS
Sbjct: 333 ERGSVEEAIGYGERMLEFYRRVYNPNHPMTGLHVFTLGDLYGQQAQAGTGAEDSKQKSAE 392
Query: 294 -MTEAVEILRITHGTNSPFMKELILKLEEA 322
+ EA IL+ITHG + F+K L +LE A
Sbjct: 393 YLAEAQRILQITHGKDHRFVKMLGDRLETA 422
>gi|290998093|ref|XP_002681615.1| SET domain-containing protein [Naegleria gruberi]
gi|284095240|gb|EFC48871.1| SET domain-containing protein [Naegleria gruberi]
Length = 430
Score = 109 bits (273), Expect = 2e-21, Method: Compositional matrix adjust.
Identities = 73/257 (28%), Positives = 122/257 (47%), Gaps = 53/257 (20%)
Query: 39 KLACNAHTICNSELRPLGTGLYPVISIINHSCLPNAVLVFEGR----LAVVRAVQHVPKG 94
KL N +ICN E+ +G+G+Y S+ NHSC+PN ++F+ + ++ + +G
Sbjct: 182 KLQRNTFSICNEEMNAIGSGIYLKASMFNHSCVPNCAILFDSDKNLYVRILNPSSLLEEG 241
Query: 95 AEVLISYIETAGSTMTRQKALKEQYLFTCTCPRCIKLGQFDDIQESAILEGYRCKDDGCS 154
+ I+Y++ T RQK LKEQY FTCTCPRC L +++ E+
Sbjct: 242 TPLTINYVDLMDLTANRQKKLKEQYHFTCTCPRC--LNSNEEVNENV------------- 286
Query: 155 GFLLRDSDDKGFTCQQCGLVRSKEEIKKIASEVNILSKKTLALTSCGNH-QEVVSTYKMI 213
+++ IA+E +K L N+ Q +VS +
Sbjct: 287 -----------------------DKMISIANE----HRKNQNLQEAVNYFQRIVSKKRSS 319
Query: 214 EKLQKKLYHPFSVNLMQTREKLIKILMELEDWKEALAYCQLTIPVYQRVYPQFHPLLGLQ 273
L ++Y ++N L + ++L D+K+A + ++ +++ YP F+PLLGLQ
Sbjct: 320 PSLFHEIYMGIALN------SLTHLNVDLSDFKKAYEFGIESLEYFEKYYPPFYPLLGLQ 373
Query: 274 YYTCGKLEWFLGDTENA 290
Y TC KL +L + A
Sbjct: 374 YLTCAKLASYLEENGQA 390
>gi|348670159|gb|EGZ09981.1| hypothetical protein PHYSODRAFT_523060 [Phytophthora sojae]
Length = 421
Score = 108 bits (270), Expect = 3e-21, Method: Compositional matrix adjust.
Identities = 87/330 (26%), Positives = 145/330 (43%), Gaps = 55/330 (16%)
Query: 3 DIDEKQLLLYAQIANLVNLILQWPEISINEIAENFSKLACNAHTICNSELRPLGTGLYPV 62
D+++++L+L A +A + L+ S++E+ S+ N +IC+ L G G +P+
Sbjct: 136 DMEDQELMLLAALAQKLELVD--GSYSMDEMLRMLSRFRNNNFSICDELLLEQGAGCFPL 193
Query: 63 ISIINHSCLPNAVLVFEGRLAVV--RAVQHVPKGAEVLISYIETAGSTMTRQKALKEQYL 120
++INHSC PN + F + + RA++ + G E+ +Y++ A R + L+ +Y
Sbjct: 194 GAMINHSCDPNCAITFVPKTLEMEFRAMRPIKAGEEITQTYVDVALPRRERHERLQRKYH 253
Query: 121 FTCTCPRCIKLGQFDDIQESAILEGYRCKDDGCSGFLLRDSDDKGFTCQQCGLVRSKEEI 180
F C C RC +QES L+ + D+D G +Q R E
Sbjct: 254 FNCACSRCSV-----PLQESGSLDAF------------LDADIDGVPKEQWSQERKDE-- 294
Query: 181 KKIASEVNILSKKTLALTSCGNHQEVVSTYKMIEKLQKKLYHPFSVNLMQTREKLIKILM 240
+ + + + + Q + H S+ +QT + M
Sbjct: 295 ------------------------QCIDALQKLAERQSNILHRDSIARLQTLATIFSAEM 330
Query: 241 ELEDWKEALAYCQLTIPVYQRVYPQFHPLLGLQYYTCGKLEWFL-----GDTENAIKS-- 293
E +EA+ Y + + YQRVY HP+ GL +T G L L G + KS
Sbjct: 331 ERGSVEEAVVYGEKMLEFYQRVYNANHPMTGLHLFTLGDLYAQLAQVGTGPENSKAKSSE 390
Query: 294 -MTEAVEILRITHGTNSPFMKELILKLEEA 322
+TEA IL+IT G F+K L +L+ A
Sbjct: 391 YLTEARRILQITQGKEHRFVKMLADRLQAA 420
>gi|432873524|ref|XP_004072259.1| PREDICTED: SET and MYND domain-containing protein 1-like isoform 1
[Oryzias latipes]
Length = 476
Score = 108 bits (269), Expect = 4e-21, Method: Compositional matrix adjust.
Identities = 83/324 (25%), Positives = 148/324 (45%), Gaps = 42/324 (12%)
Query: 1 MSDIDEKQLL-LYAQIANLVNLILQWPEIS----INEIAENFSKLACNAHTICNSE-LRP 54
++D+ E +L L I N ++ WP S +++I+ F + CN ++ + L+
Sbjct: 131 ITDMQEDELKELKVDIHNFLDF---WPRTSKQHTVDDISHIFGVINCNGFSVSDQRGLQA 187
Query: 55 LGTGLYPVISIINHSCLPNAVLVFEGRLAVVRAVQHVPKGAEVLISYIETAGSTMTRQKA 114
+G GL+P + ++NH C PN ++ +R++ + +G E+ ++Y++ + RQ+
Sbjct: 188 VGVGLFPNLCLVNHDCWPNCTVILNHGKIELRSLGKIAEGEELTVAYVDYMNLSEERQRL 247
Query: 115 LKEQYLFTCTCPRCIKLGQFDDIQESAILEGYRCKDDGCSGFLLRDSDDKGFTCQQCGLV 174
LK QY F CTC C + I++ L G R+ D G+
Sbjct: 248 LKTQYFFDCTCEHCK-----NKIKDDIKLGG-------------REVD---------GVK 280
Query: 175 RSKEEIKKIASEVNILSKKTLALTSCGNHQEVVSTYK-MIEKLQKKLY--HPFSVNLMQT 231
S+E++K+ + +K G++ EVV + IEK + L H + + + T
Sbjct: 281 PSEEQVKEATDYCYQMLEKMDKARLNGDYHEVVKICRECIEKTEPVLAETHIYLLRMWST 340
Query: 232 REKLIKILMELEDWKEALAYCQLTIPVYQRVYPQFHPLLGLQYYTCGKLEWFLGDTENAI 291
++ L L D A Y + + Y ++YP + LG+ G + W GD E A
Sbjct: 341 MSEVQAYLQYLND---AAEYARKMVEGYTKLYPPNNATLGMAAMRAGVIHWQAGDIEVAH 397
Query: 292 KSMTEAVEILRITHGTNSPFMKEL 315
+ +A IL +THG P K+L
Sbjct: 398 GMVCKAYAILMVTHGPTHPITKDL 421
>gi|61806679|ref|NP_001013568.1| N-lysine methyltransferase SMYD2-A [Danio rerio]
gi|82178565|sp|Q5BJI7.1|SMY2A_DANRE RecName: Full=N-lysine methyltransferase SMYD2-A; AltName:
Full=Histone methyltransferase SMYD2-A; AltName:
Full=SET and MYND domain-containing protein 2A
gi|60552471|gb|AAH91465.1| SET and MYND domain containing 2a [Danio rerio]
Length = 435
Score = 106 bits (265), Expect = 1e-20, Method: Compositional matrix adjust.
Identities = 76/313 (24%), Positives = 144/313 (46%), Gaps = 37/313 (11%)
Query: 13 AQIANLVNLILQWPEISINEIAENFSKLACNAHTICNSELRPLGTGLYPVISIINHSCLP 72
A + + + L +P+ + + E +++ CN TI + EL LG+ L+P ++++NHSC P
Sbjct: 154 AALHHFYSRHLDFPDNAA--LTELIAQVNCNGFTIEDEELSHLGSALFPDVALMNHSCSP 211
Query: 73 NAVLVFEGRLAVVRAVQHVPKGAEVLISYIETAGSTMTRQKALKEQYLFTCTCPRCIKLG 132
N ++ ++G +A VRAVQ + E+ SYI+ T R + LK+ Y F C C C
Sbjct: 212 NVIVTYKGTVAEVRAVQEINPEEEIFNSYIDLLYPTEDRIERLKDSYFFNCDCKECTS-- 269
Query: 133 QFDDIQESAILEGYRCKDDGCSGFLLRDSDDKGFTCQQCGLVRSKEEIKKIASEVNILSK 192
+ KD+ K Q+ + +EEIK++ I ++
Sbjct: 270 --------------KSKDEA-----------KMEIRQKLSIPPEEEEIKQMV----IYAR 300
Query: 193 KTLALTSCGNHQEVVSTYKMIEKLQKK----LYHPFSVNLMQTREKLIKILMELEDWKEA 248
+ H + S I +L + ++ +V ++ + + + + ++DW A
Sbjct: 301 NVIEEFRRAKHYKTPSELLEICELSMEKMGAIFAETNVYMLHMMYQAMGVCLYMQDWDGA 360
Query: 249 LAYCQLTIPVYQRVYPQFHPLLGLQYYTCGKLEWFLGDTENAIKSMTEAVEILRITHGTN 308
+ Y + I Y YP + + Y G+L L +K++ +A+ I+ I HG +
Sbjct: 361 MKYGEKIIHPYSVHYPPYSLNVASMYLKLGRLYLGLEKRTQGVKALKKALAIMDIAHGKD 420
Query: 309 SPFMKELILKLEE 321
P++ E+ ++EE
Sbjct: 421 HPYIDEIKKEMEE 433
>gi|348524789|ref|XP_003449905.1| PREDICTED: N-lysine methyltransferase SMYD2-like [Oreochromis
niloticus]
Length = 440
Score = 105 bits (263), Expect = 2e-20, Method: Compositional matrix adjust.
Identities = 81/312 (25%), Positives = 140/312 (44%), Gaps = 29/312 (9%)
Query: 5 DEKQLLLYAQIANLVNLILQWPEISIN-EIAENFSKLACNAHTICNSELRPLGTGLYPVI 63
+EK+ + A IA L + ++ + E+ FS++ACN TI + EL LGT +YP +
Sbjct: 150 NEKREMTEADIAGLHRFYSKHLDVPDHKELLTLFSQVACNGFTIEDDELSHLGTAVYPDM 209
Query: 64 SIINHSCLPNAVLVFEGRLAVVRAVQHVPKGAEVLISYIETAGSTMTRQKALKEQYLFTC 123
++INHSCLP+ ++ F+G A VRAVQ + G EVLISYI+ T R L+E Y F C
Sbjct: 210 ALINHSCLPSVIVTFKGTSAEVRAVQDMKPGDEVLISYIDLLYPTDDRNNRLRESYYFIC 269
Query: 124 TCPRCIKLGQFDDIQESAILEGYRCKDDGCSGFLLRDSDDKGFTCQQCGLVRSKEEIKKI 183
C C Q D + + + + ++R + +R E K
Sbjct: 270 DCQECK--SQSKDKAKLKVRKQRDSIEPDVINNMVRYARKA---------IREFREFKHT 318
Query: 184 ASEVNILSKKTLALTSCGNHQEVVSTYKMIEKLQKKLYHPFSVNLMQTREKLIKILMELE 243
+ +L +L G ++ +V ++ + + I M ++
Sbjct: 319 KTPSELLEMCEQSLEEMG-----------------AVFDDSNVYMLHMMYQAMGICMYMQ 361
Query: 244 DWKEALAYCQLTIPVYQRVYPQFHPLLGLQYYTCGKLEWFLGDTENAIKSMTEAVEILRI 303
D A+ Y + + Y +YP + + Y G+L L ++ +A+ I+ +
Sbjct: 362 DLDGAIRYGEKLLKPYSHLYPPYSLNVSSVYLKLGRLYLGLERQSACTSALKKAMAIMEV 421
Query: 304 THGTNSPFMKEL 315
HG + +++EL
Sbjct: 422 AHGKDHFYIEEL 433
>gi|322779440|gb|EFZ09632.1| hypothetical protein SINV_01274 [Solenopsis invicta]
Length = 425
Score = 105 bits (263), Expect = 2e-20, Method: Compositional matrix adjust.
Identities = 84/302 (27%), Positives = 138/302 (45%), Gaps = 30/302 (9%)
Query: 29 SINEIAENFSKLACNAHTICNSELRPLGTGLYPVISIINHSCLPNAVLVFEGRLAVVRAV 88
SI E+ + K++ N I + L +G G+Y S++ HSC PNAV FEG ++
Sbjct: 148 SIAELISIYGKMSTNRLNIQDEWLNVIGFGIYLGASVVGHSCKPNAVATFEGTTIII--- 204
Query: 89 QHVPKGAEVLISYIETAGSTMTRQKALKEQYLFTCTCPRCIKLGQFDDIQESAILEGYRC 148
+ ISY++ S R++ L Y F C C RC K E ++ E C
Sbjct: 205 --------IKISYVDLIKSKKDRREELYSLYHFWCDCERCEK--------EESLTEAAAC 248
Query: 149 KDDGCSGFLLRDSDDKGFTCQQCGL---VRSKEEIKKIASEVNILSKKTLALTSCGNHQE 205
+ C D+D C++C V+ KE +++ + + +++ N +
Sbjct: 249 PNSSCDSPCSIDADK----CEKCNTRISVKFKETFQEVVDLTDYHFEHIISILF-FNDLD 303
Query: 206 VVSTYKMIEKLQKKLYHPFSVNLMQTREKLIKILMELEDWKEALAYCQLTIPVYQRVYPQ 265
S K I + QK + H F+V+L+ T K+ L+ WK+ Y + Y Y +
Sbjct: 304 TFSWIKEILEKQKNVMHKFNVHLICTLRKIHCAAWNLKCWKDVEFYGKKLEAGYLFYYGK 363
Query: 266 FHPLLGLQYYTCGKLEWFLGDTENAIKSMTEAVEILRITHGTNSPFMKELILKLEEAQAE 325
+P +G Y T G LG + A K++ +A +L+ITHG +KE+ LE + +
Sbjct: 364 VYPPIGTLYLTLGIALLHLGKPKEAFKTLNKAYTVLKITHGNKHFLVKEI---LEPLRFD 420
Query: 326 AS 327
AS
Sbjct: 421 AS 422
>gi|301603797|ref|XP_002931556.1| PREDICTED: SET and MYND domain-containing protein 3-like [Xenopus
(Silurana) tropicalis]
Length = 226
Score = 105 bits (263), Expect = 2e-20, Method: Compositional matrix adjust.
Identities = 74/264 (28%), Positives = 126/264 (47%), Gaps = 45/264 (17%)
Query: 63 ISIINHSCLPNAVLVFEGRLAVVRAVQHVPKGAEVLISYIETAGSTMTRQKALKEQYLFT 122
+S++NHSC PN V+VFEG ++R V+ +PKG E+ ISYI+ T R+ L+ QY F
Sbjct: 1 MSLLNHSCDPNCVIVFEGTCLLLRTVKEIPKGEELTISYIDVKMPTQGRRDQLQRQYCFL 60
Query: 123 CTCPRCIKLGQFDDIQESAILEGYRCKDDGCSGFLLRDSDDKGFTCQQCGLVRSKEEIKK 182
C C RC LLRD D+ L E ++
Sbjct: 61 CDCQRC----------------------------LLRDKDEDM-------LAGDAEASRE 85
Query: 183 IASEVNILSKKTLALTSCGNHQEVVSTYKMIEKLQKKLYHPFSVNLMQTR--EKLIKILM 240
+ S V+ L + L S +E ++ + + L + Y P N+ Q + + + +
Sbjct: 86 VESSVSRLEE----LLSQNTAEEALN---LCKTLMNRYYLP-DKNIYQLKILDCAMDASI 137
Query: 241 ELEDWKEALAYCQLTIPVYQRVYPQFHPLLGLQYYTCGKLEWFLGDTENAIKSMTEAVEI 300
+L W+EAL + T+ Y Y +HP+ +Q GKL+ + G A+K++ +A +I
Sbjct: 138 DLGLWEEALHFGLRTLEPYSLYYSNYHPVRAVQLMKVGKLQNYQGLFAEAMKTLKQAFDI 197
Query: 301 LRITHGTNSPFMKELILKLEEAQA 324
+++THG + ++L + + +A
Sbjct: 198 MKVTHGRDHGQTQQLAELMNDCEA 221
>gi|147905510|ref|NP_001085986.1| N-lysine methyltransferase SMYD2-B [Xenopus laevis]
gi|82184154|sp|Q6GN68.1|SMY2B_XENLA RecName: Full=N-lysine methyltransferase SMYD2-B; AltName:
Full=Histone methyltransferase SMYD2-B; AltName:
Full=SET and MYND domain-containing protein 2B
gi|49115919|gb|AAH73650.1| MGC82991 protein [Xenopus laevis]
Length = 430
Score = 105 bits (262), Expect = 3e-20, Method: Compositional matrix adjust.
Identities = 76/286 (26%), Positives = 135/286 (47%), Gaps = 32/286 (11%)
Query: 37 FSKLACNAHTICNSELRPLGTGLYPVISIINHSCLPNAVLVFEGRLAVVRAVQHVPKGAE 96
F+++ CN TI + EL LG+ ++P ++++NHSC PN ++ ++G +A VRAVQ + G E
Sbjct: 174 FAQVNCNGFTIEDEELSHLGSAIFPDVALMNHSCCPNVIVTYKGTVAEVRAVQEIHAGEE 233
Query: 97 VLISYIETAGSTMTRQKALKEQYLFTCTCPRCIKLGQFDDIQESAILEGYRCKDDGCSGF 156
V SYI+ T R LK+ Y F+C C C ++ A LE R D S
Sbjct: 234 VFTSYIDLLYPTEDRNDRLKDSYFFSCDCRECST-----KQKDPAKLE-LRKLSDPPSPQ 287
Query: 157 LLRDSDDKGFTCQQCGLVRSKEEIKKIASEVNILSKKTLALTSCGNHQEVVSTYKMIEKL 216
+RD +V K + +L L+L G+
Sbjct: 288 TVRD-----MITYARNVVEEFRRAKHYKTPSELLEICELSLDKMGS-------------- 328
Query: 217 QKKLYHPFSVNLMQTREKLIKILMELEDWKEALAYCQLTIPVYQRVYPQFHPLLGLQYYT 276
++ +V ++ + + + + ++DW+ AL Y + I Y + YP + + +
Sbjct: 329 ---VFVDSNVYMLHMMYQAMGVCLYMQDWEGALKYGEKIIKPYSKHYPAYSLNVASMWLK 385
Query: 277 CGKLEWFLGDTENAI--KSMTEAVEILRITHGTNSPFMKELILKLE 320
G+L ++G +N I K++ +A+ I+ I HG + ++ E+ +LE
Sbjct: 386 LGRL--YMGLEKNTIGTKALKKALAIMEIAHGPDHYYIAEIKKELE 429
>gi|119597486|gb|EAW77080.1| SET and MYND domain containing 1, isoform CRA_d [Homo sapiens]
Length = 497
Score = 105 bits (261), Expect = 4e-20, Method: Compositional matrix adjust.
Identities = 78/307 (25%), Positives = 142/307 (46%), Gaps = 43/307 (14%)
Query: 19 VNLILQ-WP----EISINEIAENFSKLACNAHTICNSE-LRPLGTGLYPVISIINHSCLP 72
V+ LQ WP + S+ I+ F + CN T+ + L+ +G G++P + ++NH C P
Sbjct: 151 VDTFLQYWPPQSQQFSMQYISHIFGVINCNGFTLSDQRGLQAVGVGIFPNLGLVNHDCWP 210
Query: 73 NAVLVFEGRLAVVRAVQHVPKGAEVLISYIETAGSTMTRQKALKEQYLFTCTCPRCIKLG 132
N ++F +RA+ + +G E+ +SYI+ + R++ LK+QY F CTC C K
Sbjct: 211 NCTVIFNNGKIELRALGKISEGEELTVSYIDFLNVSEERKRQLKKQYYFDCTCEHCQK-- 268
Query: 133 QFDDIQESAILEGYRCKDDGCSGFLLRDSDDKGFTCQQCGLVRSKEEIKKIASEVNILSK 192
+ KDD FL G+ + + +++ E+ SK
Sbjct: 269 --------------KLKDD---LFL--------------GVKDNPKPSQEVVKEMIQFSK 297
Query: 193 KTL----ALTSCGNHQEVVSTYKMIEKLQKKLYHPFSVNLMQTREKLIKILMELEDWKEA 248
TL S G + EVV + + Q+ ++ ++ +++ + ++L L+ ++EA
Sbjct: 298 DTLEKIDKARSEGLYHEVVKLCRECLEKQEPVFADTNIYMLRMLSIVSEVLSYLQAFEEA 357
Query: 249 LAYCQLTIPVYQRVYPQFHPLLGLQYYTCGKLEWFLGDTENAIKSMTEAVEILRITHGTN 308
Y + + Y ++Y + LG+ G W G+ E + +A IL +THG +
Sbjct: 358 SFYARRMVDGYMKLYHPNNAQLGMAVMRAGLTNWHAGNIEVGHGMICKAYAILLVTHGPS 417
Query: 309 SPFMKEL 315
P K+L
Sbjct: 418 HPITKDL 424
>gi|432852547|ref|XP_004067302.1| PREDICTED: N-lysine methyltransferase SMYD2-A-like isoform 1
[Oryzias latipes]
Length = 435
Score = 104 bits (260), Expect = 5e-20, Method: Compositional matrix adjust.
Identities = 65/292 (22%), Positives = 131/292 (44%), Gaps = 28/292 (9%)
Query: 32 EIAENFSKLACNAHTICNSELRPLGTGLYPVISIINHSCLPNAVLVFEGRLAVVRAVQHV 91
E+ E F+++ CN TI + EL LG+ ++P ++++NHSC PN ++ ++G +A VRAVQ +
Sbjct: 172 ELTELFAQVNCNGFTIEDEELSHLGSAVFPDVALMNHSCSPNVIVTYKGTVAEVRAVQEI 231
Query: 92 PKGAEVLISYIETAGSTMTRQKALKEQYLFTCTCPRCIKLGQFDDIQESAILEGYRCKDD 151
G E+ SYI+ T R++ L + Y F C C C
Sbjct: 232 NPGEEIFNSYIDLLYPTEDRKERLLDSYFFGCQCTECT---------------------- 269
Query: 152 GCSGFLLRDSDDKGFTCQQCGLVRSKEEIKKIASEVNILSKKTLALTSCGNHQEVVSTYK 211
+ D ++ EEIK + + ++ E++ +
Sbjct: 270 ------TKSKDKAKMEIRKLSSPPEPEEIKSMVHYAKNVIEEFRKAKHYKTPSELLEMCE 323
Query: 212 MIEKLQKKLYHPFSVNLMQTREKLIKILMELEDWKEALAYCQLTIPVYQRVYPQFHPLLG 271
+ + ++ +V ++ + + + + ++DW A++Y + + Y YP + +
Sbjct: 324 LSLEKMGAIFADTNVYMLHMMYQAMGVCLYMQDWDGAMSYGEKIVYPYSVHYPPYSLNVA 383
Query: 272 LQYYTCGKLEWFLGDTENAIKSMTEAVEILRITHGTNSPFMKELILKLEEAQ 323
Y G+L L +K++ +A+ I+ I HG + ++ E+ ++EE +
Sbjct: 384 SMYLKLGRLYLGLEKKTQGVKALKKALAIMEIAHGKDHHYVAEVKREIEEQK 435
>gi|432852549|ref|XP_004067303.1| PREDICTED: N-lysine methyltransferase SMYD2-A-like isoform 2
[Oryzias latipes]
Length = 424
Score = 104 bits (260), Expect = 5e-20, Method: Compositional matrix adjust.
Identities = 65/292 (22%), Positives = 131/292 (44%), Gaps = 28/292 (9%)
Query: 32 EIAENFSKLACNAHTICNSELRPLGTGLYPVISIINHSCLPNAVLVFEGRLAVVRAVQHV 91
E+ E F+++ CN TI + EL LG+ ++P ++++NHSC PN ++ ++G +A VRAVQ +
Sbjct: 161 ELTELFAQVNCNGFTIEDEELSHLGSAVFPDVALMNHSCSPNVIVTYKGTVAEVRAVQEI 220
Query: 92 PKGAEVLISYIETAGSTMTRQKALKEQYLFTCTCPRCIKLGQFDDIQESAILEGYRCKDD 151
G E+ SYI+ T R++ L + Y F C C C
Sbjct: 221 NPGEEIFNSYIDLLYPTEDRKERLLDSYFFGCQCTECT---------------------- 258
Query: 152 GCSGFLLRDSDDKGFTCQQCGLVRSKEEIKKIASEVNILSKKTLALTSCGNHQEVVSTYK 211
+ D ++ EEIK + + ++ E++ +
Sbjct: 259 ------TKSKDKAKMEIRKLSSPPEPEEIKSMVHYAKNVIEEFRKAKHYKTPSELLEMCE 312
Query: 212 MIEKLQKKLYHPFSVNLMQTREKLIKILMELEDWKEALAYCQLTIPVYQRVYPQFHPLLG 271
+ + ++ +V ++ + + + + ++DW A++Y + + Y YP + +
Sbjct: 313 LSLEKMGAIFADTNVYMLHMMYQAMGVCLYMQDWDGAMSYGEKIVYPYSVHYPPYSLNVA 372
Query: 272 LQYYTCGKLEWFLGDTENAIKSMTEAVEILRITHGTNSPFMKELILKLEEAQ 323
Y G+L L +K++ +A+ I+ I HG + ++ E+ ++EE +
Sbjct: 373 SMYLKLGRLYLGLEKKTQGVKALKKALAIMEIAHGKDHHYVAEVKREIEEQK 424
>gi|410916601|ref|XP_003971775.1| PREDICTED: N-lysine methyltransferase SMYD2-B-like [Takifugu
rubripes]
Length = 432
Score = 104 bits (259), Expect = 6e-20, Method: Compositional matrix adjust.
Identities = 84/325 (25%), Positives = 147/325 (45%), Gaps = 38/325 (11%)
Query: 3 DID-EKQLLLYAQIANLVNLILQWPEI-SINEIAENFSKLACNAHTICNSELRPLGTGLY 60
D+D E++ + A +A L + EI ++ FS++ACN TI + EL LGT +Y
Sbjct: 139 DMDNERRETMEADVAGLHQFFSKHLEIPGHKDLLTLFSQVACNGFTIEDEELSHLGTAVY 198
Query: 61 PVISIINHSCLPNAVLVFEGRLAVVRAVQHVPKGAEVLISYIETAGSTMTRQKALKEQYL 120
P +++INHSC P+ ++ + G A VRA++ + G EVLISYI+ T R L+E Y
Sbjct: 199 PDVALINHSCRPSVIVTYSGTSAHVRALRDMKPGDEVLISYIDVLYPTEDRNNRLRESYY 258
Query: 121 FTCTCPRCIKLGQFDDIQESAILEGYRCKDDGCSGFLLRDSDDKGFTCQQCGLVRSKE-E 179
FTC C C ES ++ + K RS+ E
Sbjct: 259 FTCQCEEC----------ESRSMDQAKLK----------------------ARKRSEPIE 286
Query: 180 IKKIASEVNILSKKTLALTSCGNHQEVVSTYKMIEKLQKKLYHPF---SVNLMQTREKLI 236
+ I++ V K N + +M E+ ++ F +V ++ + +
Sbjct: 287 PEAISNMVRYARKCIREFRVFKNTSPASTLLEMCEQSLDEMGAVFDDSNVYMLHMMYQAM 346
Query: 237 KILMELEDWKEALAYCQLTIPVYQRVYPQFHPLLGLQYYTCGKLEWFLGDTENAIKSMTE 296
+ + ++D A+ Y + Y+++YP + L Y G+L + + I + +
Sbjct: 347 GVCLYMQDPDGAIRYGEKIAKYYRKLYPAYSLNLSSLYLKLGRLYFGMERNAECIGMLKK 406
Query: 297 AVEILRITHGTNSPFMKELILKLEE 321
A I+ +THG ++ EL +++E
Sbjct: 407 AKAIMEVTHGKEHFYLTELDRQMKE 431
>gi|384246814|gb|EIE20303.1| hypothetical protein COCSUDRAFT_48675 [Coccomyxa subellipsoidea
C-169]
Length = 377
Score = 104 bits (259), Expect = 6e-20, Method: Compositional matrix adjust.
Identities = 43/90 (47%), Positives = 59/90 (65%)
Query: 32 EIAENFSKLACNAHTICNSELRPLGTGLYPVISIINHSCLPNAVLVFEGRLAVVRAVQHV 91
++A ++ ACN HTIC+ ELRP+G G+YP +++NHSC PN + F G V RAVQH+
Sbjct: 22 DVAYLLARFACNNHTICDDELRPIGVGIYPQGAMLNHSCTPNTMQSFHGSRIVFRAVQHI 81
Query: 92 PKGAEVLISYIETAGSTMTRQKALKEQYLF 121
P GAE I+Y+E A + R AL+ Y F
Sbjct: 82 PAGAEATIAYVELAATRAERSAALRANYFF 111
>gi|189204167|ref|XP_001938419.1| MYND finger family protein [Pyrenophora tritici-repentis Pt-1C-BFP]
gi|187985518|gb|EDU51006.1| MYND finger family protein [Pyrenophora tritici-repentis Pt-1C-BFP]
Length = 454
Score = 104 bits (259), Expect = 6e-20, Method: Compositional matrix adjust.
Identities = 82/316 (25%), Positives = 139/316 (43%), Gaps = 39/316 (12%)
Query: 33 IAENFSKLACNAHTICNSELRPLGTGLYPVISIINHSCLPNAVLVFEGRLAVVRAVQHVP 92
IA ++++ NA TI L PLG L P + +NHSC PNA ++ +G +R ++ +
Sbjct: 121 IAAMYARVMSNALTIITPTLDPLGIILDPTLCSLNHSCDPNAFIMMDGPSVSIRTLRPIR 180
Query: 93 KGAEVLISYIETAGSTMTRQKALKEQYLFTCTCPRCIKLGQFDDIQESAILEGYRCKDDG 152
K E+ ISYI+T RQ+ L+ ++ FTC C +C QE A L+
Sbjct: 181 KDKEIFISYIDTTYPYHKRQEELQTRWFFTCRCAKC---------QEKATLQEDNWLLPV 231
Query: 153 CSGFLLRDSDDKGFTCQQCGLVRSKEEIKKIASEVNILSKKTLALTSCGNHQEVVSTYKM 212
S F+L K Q EE+ K+++E I K + L SC
Sbjct: 232 DSKFVLDAEAKKAMAQTQEQTFAVYEELHKLSTEHVIYGLKQI-LASC------------ 278
Query: 213 IEKLQKKLYHPFSVNLMQTREKLIKILMELEDWKEALAYCQLTIP-VYQRVYP-QFHPLL 270
+ + Y + + R+ LI L+ + +++A A C + ++YP FHP+
Sbjct: 279 ---YESRFYPIYRQPYAEARDVLIVNLLSMGKFQDAWAQCAKRYKYILPKLYPVPFHPVR 335
Query: 271 GLQYYTCGKLEWFLGDTENAIKS------------MTEAVEILRITHGTNSPFMKELILK 318
+Q + L +L TE + + + + +++ ++HG N+ F K + K
Sbjct: 336 VVQTWQMAMLAAYLASTEEGVGAPGVNMGLIAMMLVKQVLDVAHLSHGPNNAFTKSVKGK 395
Query: 319 LEEAQAEASYKLSSKD 334
EE E + + D
Sbjct: 396 AEEMIEELKRSVGNPD 411
>gi|348566409|ref|XP_003468994.1| PREDICTED: SET and MYND domain-containing protein 1-like isoform 2
[Cavia porcellus]
Length = 477
Score = 103 bits (258), Expect = 9e-20, Method: Compositional matrix adjust.
Identities = 78/307 (25%), Positives = 138/307 (44%), Gaps = 43/307 (14%)
Query: 19 VNLILQ-WP----EISINEIAENFSKLACNAHTICNSE-LRPLGTGLYPVISIINHSCLP 72
V+ LQ WP + S+ I+ F + CN T+ + L+ +G G++P + ++NH C P
Sbjct: 151 VDTFLQYWPPQSQQFSMQYISHIFGVINCNGFTLSDQRGLQAVGVGIFPNLGLVNHDCWP 210
Query: 73 NAVLVFEGRLAVVRAVQHVPKGAEVLISYIETAGSTMTRQKALKEQYLFTCTCPRCIKLG 132
N ++F +RA+ + G E+ +SYI+ + R++ LK+QY F CTC C K G
Sbjct: 211 NCTVIFNNGKIELRALGKISVGEELTVSYIDFLNVSEERKRQLKKQYYFDCTCEHCQK-G 269
Query: 133 QFDDIQESAILEGYRCKDDGCSGFLLRDSDDKGFTCQQCGLVRSKEEIKKIASEVNILSK 192
DD+ FL G+ + +++ E+ SK
Sbjct: 270 LKDDL------------------FL--------------GVKEDPKPSQEVVKEMTQFSK 297
Query: 193 KTL----ALTSCGNHQEVVSTYKMIEKLQKKLYHPFSVNLMQTREKLIKILMELEDWKEA 248
TL S G + EVV + + Q+ ++ ++ ++ + ++L L+ + EA
Sbjct: 298 DTLEKIDKARSEGLYHEVVKLCRECLEKQEPVFADTNLYTLRMLSVVSEVLSYLQAFGEA 357
Query: 249 LAYCQLTIPVYQRVYPQFHPLLGLQYYTCGKLEWFLGDTENAIKSMTEAVEILRITHGTN 308
Y + + Y ++Y + LG+ G W G+ E + +A IL +THG +
Sbjct: 358 AHYARRMVDGYMKLYHPNNAQLGMAVMRAGLTNWHAGNIEVGHGMICKAYAILLVTHGPS 417
Query: 309 SPFMKEL 315
P K+L
Sbjct: 418 HPITKDL 424
>gi|148228090|ref|NP_001080251.1| N-lysine methyltransferase SMYD2-A [Xenopus laevis]
gi|82176673|sp|Q7ZXV5.1|SMY2A_XENLA RecName: Full=N-lysine methyltransferase SMYD2-A; AltName:
Full=Histone methyltransferase SMYD2-A; AltName:
Full=SET and MYND domain-containing protein 2A
gi|28279861|gb|AAH44103.1| Smyd2-prov protein [Xenopus laevis]
Length = 430
Score = 103 bits (256), Expect = 1e-19, Method: Compositional matrix adjust.
Identities = 72/284 (25%), Positives = 128/284 (45%), Gaps = 28/284 (9%)
Query: 37 FSKLACNAHTICNSELRPLGTGLYPVISIINHSCLPNAVLVFEGRLAVVRAVQHVPKGAE 96
F+++ CN TI + EL LG+ ++P ++++NHSC PN ++ F+G +A +RAVQ + G E
Sbjct: 174 FAQVNCNGFTIEDEELSHLGSAIFPDVALMNHSCCPNIIVTFKGTVAEIRAVQEIHAGDE 233
Query: 97 VLISYIETAGSTMTRQKALKEQYLFTCTCPRCIKLGQFDDIQESAILEGYRCKDDGCSGF 156
V SYI+ T R L + Y FTC C C ++ A LE + D
Sbjct: 234 VFTSYIDLLYPTEDRNDRLMDSYFFTCDCRECST-----KQKDPAKLEIRKLSDPPSHQT 288
Query: 157 LLRDSDDKGFTCQQCGLVRSKEEIKKIASEVNILSKKTLALTSCGNHQEVVSTYKMIEKL 216
+ K +V K + +L L+L G+
Sbjct: 289 V------KDMIKYARNIVEEFRRAKHYKTPSELLEMCELSLDKMGS-------------- 328
Query: 217 QKKLYHPFSVNLMQTREKLIKILMELEDWKEALAYCQLTIPVYQRVYPQFHPLLGLQYYT 276
++ +V ++ + + + + L++W AL Y + I Y + YP + + +
Sbjct: 329 ---VFVDSNVYMLHMMYQAMGVCLYLQEWDGALKYGEKIIKPYSKHYPAYSLNVASMWLK 385
Query: 277 CGKLEWFLGDTENAIKSMTEAVEILRITHGTNSPFMKELILKLE 320
G+L L T K++ +A+ I++I HG + ++ E+ +LE
Sbjct: 386 LGRLYMGLEKTTIGTKALKKALAIMQIAHGPDHHYIAEIKKELE 429
>gi|348534090|ref|XP_003454536.1| PREDICTED: N-lysine methyltransferase SMYD2-A-like [Oreochromis
niloticus]
Length = 435
Score = 103 bits (256), Expect = 1e-19, Method: Compositional matrix adjust.
Identities = 62/291 (21%), Positives = 131/291 (45%), Gaps = 28/291 (9%)
Query: 33 IAENFSKLACNAHTICNSELRPLGTGLYPVISIINHSCLPNAVLVFEGRLAVVRAVQHVP 92
+ E F+++ CN TI + EL LG+ ++P ++++NHSC PN ++ ++G +A VRAV+ +
Sbjct: 173 LTELFAQVNCNGFTIEDEELSHLGSAVFPDVALMNHSCSPNVIVTYKGTVAEVRAVKEIN 232
Query: 93 KGAEVLISYIETAGSTMTRQKALKEQYLFTCTCPRCIKLGQFDDIQESAILEGYRCKDDG 152
G E+ SYI+ T R++ L + Y FTC C C
Sbjct: 233 PGEEIFNSYIDLLYPTEDRKERLLDSYFFTCQCTECTS---------------------- 270
Query: 153 CSGFLLRDSDDKGFTCQQCGLVRSKEEIKKIASEVNILSKKTLALTSCGNHQEVVSTYKM 212
R D ++ EEI+ + + ++ E++ ++
Sbjct: 271 ------RSKDKAKMEIRKLNPPPEPEEIRSMVRYAKNVIEEFRRAKHYKTPSELLEMCEL 324
Query: 213 IEKLQKKLYHPFSVNLMQTREKLIKILMELEDWKEALAYCQLTIPVYQRVYPQFHPLLGL 272
+ ++ +V ++ + + + + ++DW A++Y + + Y YP + +
Sbjct: 325 SLEKMGAIFADTNVYMLHMMYQAMGVCLYMQDWDGAMSYGEKIVQPYSVHYPAYSLNVAS 384
Query: 273 QYYTCGKLEWFLGDTENAIKSMTEAVEILRITHGTNSPFMKELILKLEEAQ 323
Y G+L L +K++ +A+ ++ + HG + ++ E+ ++EE +
Sbjct: 385 MYLKLGRLYLGLEKKTQGVKALKKALAVMEVAHGKDHHYVAEVKREIEEQK 435
>gi|291386373|ref|XP_002709685.1| PREDICTED: SET and MYND domain containing 1 isoform 1 [Oryctolagus
cuniculus]
Length = 477
Score = 103 bits (256), Expect = 2e-19, Method: Compositional matrix adjust.
Identities = 75/305 (24%), Positives = 138/305 (45%), Gaps = 34/305 (11%)
Query: 16 ANLVNLILQWP----EISINEIAENFSKLACNAHTICNSE-LRPLGTGLYPVISIINHSC 70
A++ + WP S+ I+ F + CN T+ + L+ +G G++P ++++NH C
Sbjct: 149 ADVDTFLQYWPPQSQHFSMQYISHIFGVINCNGFTLSDQRGLQAVGVGIFPNLALVNHDC 208
Query: 71 LPNAVLVFEGRLAVVRAVQHVPKGAEVLISYIETAGSTMTRQKALKEQYLFTCTCPRCIK 130
PN ++F +RA+ + +G E+ +SYI+ + R++ LK+QY F CTC C K
Sbjct: 209 WPNCTVIFNNGKIELRALGKISEGEELTVSYIDFLNVSEERRRQLKKQYYFDCTCEHCQK 268
Query: 131 LGQFDDIQESAILEGYRCKDDGCSGFLLRDSDDKGFTCQQCGLVRSKEEIKKIASEVNIL 190
G DD+ FL + K S+E +K++
Sbjct: 269 -GLKDDL------------------FLGVKDNPKP----------SQEVVKEMTEFCKDT 299
Query: 191 SKKTLALTSCGNHQEVVSTYKMIEKLQKKLYHPFSVNLMQTREKLIKILMELEDWKEALA 250
+K S G + EVV + + Q+ ++ ++ ++ + ++L L+ + EA
Sbjct: 300 LEKIDKARSEGLYHEVVKLCRECLQKQEPVFADTNLYTLRMLSIVSEVLSYLQAFGEAAD 359
Query: 251 YCQLTIPVYQRVYPQFHPLLGLQYYTCGKLEWFLGDTENAIKSMTEAVEILRITHGTNSP 310
Y + + Y ++Y + LG+ G W G+ E + +A IL +THG + P
Sbjct: 360 YAKRMVDGYMKLYHPNNAQLGMAVMRAGLTNWHAGNIEVGHGMICKAYAILLVTHGPSHP 419
Query: 311 FMKEL 315
K+L
Sbjct: 420 ITKDL 424
>gi|236461142|ref|NP_033892.2| SET and MYND domain-containing protein 1 isoform 2 [Mus musculus]
gi|74209105|dbj|BAE24950.1| unnamed protein product [Mus musculus]
gi|148666517|gb|EDK98933.1| SET and MYND domain containing 1, isoform CRA_a [Mus musculus]
Length = 477
Score = 103 bits (256), Expect = 2e-19, Method: Compositional matrix adjust.
Identities = 77/303 (25%), Positives = 138/303 (45%), Gaps = 35/303 (11%)
Query: 19 VNLILQ-WP----EISINEIAENFSKLACNAHTICNSE-LRPLGTGLYPVISIINHSCLP 72
V+ LQ WP + S+ I+ F + CN T+ + L+ +G G++P + ++NH C P
Sbjct: 151 VDTFLQYWPPQSQQFSMQYISHIFGVINCNGFTLSDQRGLQAVGVGIFPNLGLVNHDCWP 210
Query: 73 NAVLVFEGRLAVVRAVQHVPKGAEVLISYIETAGSTMTRQKALKEQYLFTCTCPRCIKLG 132
N ++F +RA+ + +G E+ +SYI+ + R++ LK+QY F C+C C K G
Sbjct: 211 NCTVIFNNGKIELRALGKISEGEELTVSYIDFLHLSEERRRQLKKQYYFDCSCEHCQK-G 269
Query: 133 QFDDIQESAILEGYRCKDDGCSGFLLRDSDDKGFTCQQCGLVRSKEEIKKIASEVNILSK 192
DD+ FL D K S+E +K++ +
Sbjct: 270 LKDDL------------------FLAAKEDPKP----------SQEVVKEMIQFSKDTLE 301
Query: 193 KTLALTSCGNHQEVVSTYKMIEKLQKKLYHPFSVNLMQTREKLIKILMELEDWKEALAYC 252
K S G + EVV + + Q+ ++ ++ +++ ++L L+ ++EA Y
Sbjct: 302 KIDKARSEGLYHEVVKLCRECLEKQEPVFADTNLYVLRLLSIASEVLSYLQAYEEASHYA 361
Query: 253 QLTIPVYQRVYPQFHPLLGLQYYTCGKLEWFLGDTENAIKSMTEAVEILRITHGTNSPFM 312
+ + Y ++Y + LG+ G W G E + +A IL +THG + P
Sbjct: 362 RRMVDGYMKLYHHNNAQLGMAVMRAGLTNWHAGHIEVGHGMICKAYAILLVTHGPSHPIT 421
Query: 313 KEL 315
K+L
Sbjct: 422 KDL 424
>gi|5870834|gb|AAC53022.2| skm-BOP2 [Mus musculus]
Length = 472
Score = 103 bits (256), Expect = 2e-19, Method: Compositional matrix adjust.
Identities = 77/303 (25%), Positives = 138/303 (45%), Gaps = 35/303 (11%)
Query: 19 VNLILQ-WP----EISINEIAENFSKLACNAHTICNSE-LRPLGTGLYPVISIINHSCLP 72
V+ LQ WP + S+ I+ F + CN T+ + L+ +G G++P + ++NH C P
Sbjct: 146 VDTFLQYWPPQSQQFSMQYISHIFGVINCNGFTLSDQRGLQAVGVGIFPNLGLVNHDCWP 205
Query: 73 NAVLVFEGRLAVVRAVQHVPKGAEVLISYIETAGSTMTRQKALKEQYLFTCTCPRCIKLG 132
N ++F +RA+ + +G E+ +SYI+ + R++ LK+QY F C+C C K G
Sbjct: 206 NCTVIFNNGKIELRALGKISEGEELTVSYIDFLHLSEERRRQLKKQYYFDCSCEHCQK-G 264
Query: 133 QFDDIQESAILEGYRCKDDGCSGFLLRDSDDKGFTCQQCGLVRSKEEIKKIASEVNILSK 192
DD+ FL D K S+E +K++ +
Sbjct: 265 LKDDL------------------FLAAKEDPKP----------SQEVVKEMIQFSKDTLE 296
Query: 193 KTLALTSCGNHQEVVSTYKMIEKLQKKLYHPFSVNLMQTREKLIKILMELEDWKEALAYC 252
K S G + EVV + + Q+ ++ ++ +++ ++L L+ ++EA Y
Sbjct: 297 KIDKARSEGLYHEVVKLCRECLEKQEPVFADTNLYVLRLLSIASEVLSYLQAYEEASHYA 356
Query: 253 QLTIPVYQRVYPQFHPLLGLQYYTCGKLEWFLGDTENAIKSMTEAVEILRITHGTNSPFM 312
+ + Y ++Y + LG+ G W G E + +A IL +THG + P
Sbjct: 357 RRMVDGYMKLYHHNNAQLGMAVMRAGLTNWHAGHIEVGHGMICKAYAILLVTHGPSHPIT 416
Query: 313 KEL 315
K+L
Sbjct: 417 KDL 419
>gi|149036368|gb|EDL90986.1| rCG56113, isoform CRA_b [Rattus norvegicus]
Length = 477
Score = 102 bits (255), Expect = 2e-19, Method: Compositional matrix adjust.
Identities = 81/307 (26%), Positives = 141/307 (45%), Gaps = 43/307 (14%)
Query: 19 VNLILQ-WP----EISINEIAENFSKLACNAHTICNSE-LRPLGTGLYPVISIINHSCLP 72
V+ LQ WP + S+ I+ F + CN T+ + L+ +G G++P + ++NH C P
Sbjct: 151 VDTFLQYWPPQSQQFSMQYISHIFGVINCNGFTLSDQRGLQAVGVGIFPNLGLVNHDCWP 210
Query: 73 NAVLVFEGRLAVVRAVQHVPKGAEVLISYIETAGSTMTRQKALKEQYLFTCTCPRCIKLG 132
N ++F +RA+ + +G E+ +SYI+ + R++ LK+QY F C+C C K G
Sbjct: 211 NCTVIFNNGKIELRALGKISEGEELTVSYIDFLHLSEERRQQLKKQYYFDCSCEHCQK-G 269
Query: 133 QFDDIQESAILEGYRCKDDGCSGFLLRDSDDKGFTCQQCGLVRSKEEIKKIASEVNILSK 192
DD+ FL D K S+E +K E+ SK
Sbjct: 270 LKDDL------------------FLAVKEDPKP----------SQEVVK----EMTQFSK 297
Query: 193 KTL----ALTSCGNHQEVVSTYKMIEKLQKKLYHPFSVNLMQTREKLIKILMELEDWKEA 248
TL S G + EVV + + Q+ ++ ++ +++ + ++L L+ ++EA
Sbjct: 298 DTLEKIDKARSEGLYHEVVKLCRECLEKQEPVFADTNLYVLRLLSIVSEVLSYLQAFEEA 357
Query: 249 LAYCQLTIPVYQRVYPQFHPLLGLQYYTCGKLEWFLGDTENAIKSMTEAVEILRITHGTN 308
Y + + Y ++Y + LG+ G W G E + +A IL +THG +
Sbjct: 358 SHYARRMVDGYMKLYHHNNAQLGMAVMRAGLTNWHAGHIEVGHGMICKAYAILLVTHGPS 417
Query: 309 SPFMKEL 315
P K+L
Sbjct: 418 HPITKDL 424
>gi|432113965|gb|ELK36030.1| SET and MYND domain-containing protein 3, partial [Myotis davidii]
Length = 241
Score = 102 bits (255), Expect = 2e-19, Method: Compositional matrix adjust.
Identities = 75/260 (28%), Positives = 123/260 (47%), Gaps = 43/260 (16%)
Query: 37 FSKLACNAHTICNSELRPLGTGLYPVISIINHSCLPNAVLVFEGRLAVVRAVQHVPKGAE 96
F +L CN+ TICN+E++ +G GLYP +S++NHSC PN +VF G ++RAV+ + G E
Sbjct: 1 FPQLICNSFTICNAEMQEVGVGLYPSMSLLNHSCDPNCSIVFNGPHLLLRAVRDIQAGEE 60
Query: 97 VLISYIETAGSTMTRQKALKEQYLFTCTCPRCIKLGQFDDIQESAILEGYRCKDDGCSGF 156
+ I Y++ ++ R++ L+ QY F C C RC
Sbjct: 61 LTICYLDMLMTSAERREQLRNQYCFDCDCARC---------------------------- 92
Query: 157 LLRDSDDKGFTCQQCGLVRSKEEIKKIASEVNILSKKTLALTSCGNHQEVVSTYKMIEKL 216
L +D D T + +E +KKI E+ K L C Q ++S+ E+L
Sbjct: 93 LTQDKDADMLTGDEKVWKEVQESLKKI-EELKAEWKWEQVLALC---QAIISSNS--ERL 146
Query: 217 QKKLYHPFSVNLMQTR--EKLIKILMELEDWKEALAYCQLTIPVYQRVYPQFHPLLGLQY 274
+N+ Q + + + + L +EAL Y T+ Y+ +P HP+ G+Q
Sbjct: 147 P-------DINIYQLKVLDCAMDACIRLGLLEEALFYGIRTMEPYRIFFPGCHPVRGVQV 199
Query: 275 YTCGKLEWFLGDTENAIKSM 294
GKL+ G A++++
Sbjct: 200 MKVGKLQLHQGMFPQAMRNL 219
>gi|432873526|ref|XP_004072260.1| PREDICTED: SET and MYND domain-containing protein 1-like isoform 2
[Oryzias latipes]
Length = 489
Score = 102 bits (254), Expect = 3e-19, Method: Compositional matrix adjust.
Identities = 86/337 (25%), Positives = 151/337 (44%), Gaps = 55/337 (16%)
Query: 1 MSDIDEKQLL-LYAQIANLVNLILQWPEIS----INEIAENFSKLACNAHTICNSE-LRP 54
++D+ E +L L I N ++ WP S +++I+ F + CN ++ + L+
Sbjct: 131 ITDMQEDELKELKVDIHNFLDF---WPRTSKQHTVDDISHIFGVINCNGFSVSDQRGLQA 187
Query: 55 LGTGLYPVISIINHSCLPN-AVLVFEGRLAVV------------RAVQHVPKGAEVLISY 101
+G GL+P + ++NH C PN V++ G + V R++ + +G E+ ++Y
Sbjct: 188 VGVGLFPNLCLVNHDCWPNCTVILNHGNQSAVNTMFHSQRRIELRSLGKIAEGEELTVAY 247
Query: 102 IETAGSTMTRQKALKEQYLFTCTCPRCIKLGQFDDIQESAILEGYRCKDDGCSGFLLRDS 161
++ + RQ+ LK QY F CTC C + I++ L G R+
Sbjct: 248 VDYMNLSEERQRLLKTQYFFDCTCEHCK-----NKIKDDIKLGG-------------REV 289
Query: 162 DDKGFTCQQCGLVRSKEEIKKIASEVNILSKKTLALTSCGNHQEVVSTYK-MIEKLQKKL 220
D G+ S+E++K+ + +K G++ EVV + IEK + L
Sbjct: 290 D---------GVKPSEEQVKEATDYCYQMLEKMDKARLNGDYHEVVKICRECIEKTEPVL 340
Query: 221 Y--HPFSVNLMQTREKLIKILMELEDWKEALAYCQLTIPVYQRVYPQFHPLLGLQYYTCG 278
H + + + T ++ L L D A Y + + Y ++YP + LG+ G
Sbjct: 341 AETHIYLLRMWSTMSEVQAYLQYLND---AAEYARKMVEGYTKLYPPNNATLGMAAMRAG 397
Query: 279 KLEWFLGDTENAIKSMTEAVEILRITHGTNSPFMKEL 315
+ W GD E A + +A IL +THG P K+L
Sbjct: 398 VIHWQAGDIEVAHGMVCKAYAILMVTHGPTHPITKDL 434
>gi|187607241|ref|NP_001120357.1| SET and MYND domain containing 1 [Xenopus (Silurana) tropicalis]
gi|170284459|gb|AAI61001.1| LOC100145429 protein [Xenopus (Silurana) tropicalis]
Length = 478
Score = 102 bits (253), Expect = 3e-19, Method: Compositional matrix adjust.
Identities = 86/342 (25%), Positives = 153/342 (44%), Gaps = 47/342 (13%)
Query: 3 DIDEKQLLLYAQIANLVNLILQWP----EISINEIAENFSKLACNAHTICNSE-LRPLGT 57
D EK LL+ ++ + WP + + I+ FS ++CN T+ + L+ +G
Sbjct: 135 DEAEKGLLM----EDVQKFLEYWPSQSQQFGMQYISHIFSVISCNGFTLSDQRGLQAVGV 190
Query: 58 GLYPVISIINHSCLPNAVLVFEGRLAVVRAVQHVPKGAEVLISYIETAGSTMTRQKALKE 117
G++P + + NH C PN ++F +RA+ + KG E+ +SY++ T R+ LK+
Sbjct: 191 GIFPNLCLANHDCWPNCTVIFNNGKIELRALGKINKGEELTVSYVDFLNLTEDRKAQLKK 250
Query: 118 QYLFTCTCPRCIKLGQFDDIQESAILEGYRCKDDGCSGFLLRDSDDKGFTCQQCGLVRSK 177
QY F CTC C K + A+L ++D +DK
Sbjct: 251 QYYFDCTCEHCTKKTK------DALL------------LAVKDGEDK------------P 280
Query: 178 EEIKKIASEVNILSKKTL----ALTSCGNHQEVVSTYKMIEKLQKKLYHPFSVNLMQTRE 233
EE ++ EV SK T+ S G + +VV + K Q+ ++ ++ +++
Sbjct: 281 EE--RVVKEVIQYSKDTMEKIEKARSEGLYNDVVKLCRDCLKRQEPIFADTNIYMLRILS 338
Query: 234 KLIKILMELEDWKEALAYCQLTIPVYQRVYPQFHPLLGLQYYTCGKLEWFLGDTENAIKS 293
++L L+ + +A + + Y ++Y Q + LG+ G W G E
Sbjct: 339 IYSEVLSYLQMFDDAAENAKKMVDGYLKIYHQNNAQLGMAVMRAGVTHWHAGMIEVGHGM 398
Query: 294 MTEAVEILRITHGTNSPFMKELILKLEEAQAEASYKLSSKDE 335
+ +A IL ITHG P K+ L++ AQ E ++ ++E
Sbjct: 399 ICKAFAILLITHGPLHPITKD--LEVMRAQTEMELRMFKENE 438
>gi|118343990|ref|NP_001071820.1| SET and MYND domain containing protein [Ciona intestinalis]
gi|70571203|dbj|BAE06699.1| SET and MYND domain containing protein [Ciona intestinalis]
Length = 474
Score = 102 bits (253), Expect = 3e-19, Method: Compositional matrix adjust.
Identities = 77/308 (25%), Positives = 146/308 (47%), Gaps = 30/308 (9%)
Query: 29 SINEIAENFSKLACNAHTICNSE-LRPLGTGLYPVISIINHSCLPNAVLVFEGRLAVVRA 87
S E+A F+ + CNA + + ++ +G G+YP IS++NH C PN V + G VRA
Sbjct: 182 SDEEMAHLFAIIDCNAIGLNDHRGVQTIGVGIYPGISMLNHDCSPNCVAMNNGPRLEVRA 241
Query: 88 VQHVPKGAEVLISYIETAGSTMTRQKALKEQYLFTCTCPRCIKLGQFDDIQESAILEGYR 147
++ + G E+ ISYI++ +T R++ LK QY F C C C K + ++++ + + E +
Sbjct: 242 LRVIQPGEELCISYIDSLETTEKRREKLKLQYYFDCECDTCTKGEELENLKHALVSEDIK 301
Query: 148 CKDDGCSGFLLRDSDDKGFTCQQCGLVRSKEEIKKIASEVNILSKKTLALTSCGNHQEVV 207
+ ++ + S D ++ + + +++ ++ + TLA C V+
Sbjct: 302 ---EESVKYINQFSKD---MLKRIHKTKQNQNWERMCNQ----TLGTLAQQDC-----VI 346
Query: 208 STYKMIEKLQKKLYHPFSVNLMQTREKLIKILMELEDWKEALAYCQLTIPVYQRVYPQFH 267
+ +++ L H V R+ + AL Y + Y+++ P H
Sbjct: 347 ADTNVLKIAM--LNHAVEVQSFLRRQ------------EPALEYAERVAAAYEKLLPPVH 392
Query: 268 PLLGLQYYTCGKLEWFLGDTENAIKSMTEAVEILRITHGTNSPFMKELILKLEEAQAEAS 327
P LG+ G ++W + TE A+ ++ A I+ THG + KEL+ + + + EA
Sbjct: 393 PTLGMYLMRLGVIQWQIQKTEAAVATLGRAASIISKTHGDDHGMFKELLGLIHQCKMEAH 452
Query: 328 YKLSSKDE 335
++ E
Sbjct: 453 MSKGAQRE 460
>gi|303286946|ref|XP_003062762.1| set domain protein [Micromonas pusilla CCMP1545]
gi|226455398|gb|EEH52701.1| set domain protein [Micromonas pusilla CCMP1545]
Length = 732
Score = 102 bits (253), Expect = 3e-19, Method: Compositional matrix adjust.
Identities = 79/295 (26%), Positives = 124/295 (42%), Gaps = 36/295 (12%)
Query: 3 DIDEKQLLLYAQIANLVNLILQW--PEISINEIAENFSKLACNAHTICNSELRPLGTGLY 60
D+ ++ Y +AN VN +L + ++ +AE SK+ N+H + ++ R LGTG+Y
Sbjct: 342 DVSAEKRATYVGMANAVNSMLPPGPARMDVDALAEVMSKVHVNSHGVVDAAGRALGTGVY 401
Query: 61 PVISIINHSCLPNAVLVF------------------------EGRLAVVRAVQHVPKGAE 96
P ++ NHSC PNAV+ F G VR V V +G E
Sbjct: 402 PPAALFNHSCAPNAVVSFGGADDDGDGSHSDRDGSCSNGSTNNGARLTVRCVSPVEEGEE 461
Query: 97 VLISYIETAGSTMTRQKALKEQYLFTCTCPRCIKLGQFDDIQESAILEGYRCKD-DGCSG 155
+ I+Y E + TR++AL E+ F C C RC + A L G+RC C G
Sbjct: 462 ICIAYAEVYATRETRREALWEKKAFACECRRCADAAS---ARRDAPLGGWRCPRLSACDG 518
Query: 156 FLLRDSDDKGFTCQQCGLVRSKEEIKKIASEVNILSKKTLALTSC--GNHQEVVSTYKMI 213
+ D G +C CG ++ K+ A+E + + ALT+ +H VS +
Sbjct: 519 VV----PDAGDSCLTCGAKCARARSKRDAAERRWIGRHAAALTALRENDHATAVSIACEV 574
Query: 214 EKLQKKLYHPFSVNLMQTREKLIKILMELEDWKEALAYCQLTIPVYQRVYPQFHP 268
+ V + R L+ L+ W+ A + +R HP
Sbjct: 575 VRESDDALCDAHVIRHECRLVLMDAHAALKQWRHAADVSRDVADAMRRFVNVNHP 629
>gi|326919603|ref|XP_003206069.1| PREDICTED: SET and MYND domain-containing protein 1-like isoform 2
[Meleagris gallopavo]
Length = 478
Score = 101 bits (252), Expect = 4e-19, Method: Compositional matrix adjust.
Identities = 84/320 (26%), Positives = 141/320 (44%), Gaps = 43/320 (13%)
Query: 25 WP----EISINEIAENFSKLACNAHTICNSE-LRPLGTGLYPVISIINHSCLPNAVLVFE 79
WP + + I+ F + CNA T+ + L+ +G G++P + NH C PN ++F
Sbjct: 158 WPAQSQQFGMQYISHIFGVINCNAFTLSDQRGLQAVGVGIFPNLCQANHDCWPNCTVIFN 217
Query: 80 GRLAVVRAVQHVPKGAEVLISYIETAGSTMTRQKALKEQYLFTCTCPRCIKLGQFDDIQE 139
+RA+ + G E+ +SY++ + RQK LK+QY F CTC C K
Sbjct: 218 NGKIELRALSKISPGDELTVSYVDFLNVSEERQKQLKKQYYFDCTCEHCKK--------- 268
Query: 140 SAILEGYRCKDDGCSGFLLRDSDDKGFTCQQCGLVRSKEEIKKIASEVNILSKKTLALTS 199
+ KDD +++ D K S E +K EV SK TL +
Sbjct: 269 -------KIKDD--LMLAVKEGDKKP----------SAETVK----EVIQFSKDTLEKIN 305
Query: 200 C----GNHQEVVSTYKMIEKLQKKLYHPFSVNLMQTREKLIKILMELEDWKEALAYCQLT 255
G + EVV + K Q+ + ++ L++ ++L L+ ++EA Y +
Sbjct: 306 KARLEGTYHEVVKLCRECLKKQEPVLGDTNIYLLRILSIASEVLSYLQMFEEAAEYAKRM 365
Query: 256 IPVYQRVYPQFHPLLGLQYYTCGKLEWFLGDTENAIKSMTEAVEILRITHGTNSPFMKEL 315
+ Y ++Y + LG+ G W G E + +A IL ITHG + P K+
Sbjct: 366 VDGYLKIYHPNNAQLGMAVMRAGVTHWHAGLIEAGHSMICKAYAILLITHGPSHPITKD- 424
Query: 316 ILKLEEAQAEASYKLSSKDE 335
L++ Q E ++ ++E
Sbjct: 425 -LEVMRVQTEMELRMFQQNE 443
>gi|156368233|ref|XP_001627600.1| predicted protein [Nematostella vectensis]
gi|156214514|gb|EDO35500.1| predicted protein [Nematostella vectensis]
Length = 377
Score = 101 bits (252), Expect = 5e-19, Method: Compositional matrix adjust.
Identities = 50/136 (36%), Positives = 77/136 (56%), Gaps = 5/136 (3%)
Query: 27 EISINEIAENFSKLACNAHTICNSELRPLGTGLYPVISIINHSCLPNAVLVFEGRLAVVR 86
EIS E E FSK++CN+ IC+ E++ +GTG++P +NHSC PN+V VF G ++
Sbjct: 152 EISPPEGLELFSKISCNSFAICDGEMQAIGTGIFPNAVCLNHSCAPNSVAVFNGTNIYIK 211
Query: 87 AVQHVPKGAEVLISYIETAGSTMTRQKALKEQYLFTCTCPRCIKLGQFDDIQESAILEGY 146
A++ +P G E+ ISYI+ TRQ+ L+ Q+ F C C RC+ D + +L
Sbjct: 212 ALEEIPVGEELTISYIQQLHPRETRQEELQTQFCFYCQCHRCL-----DASDNNKMLTSL 266
Query: 147 RCKDDGCSGFLLRDSD 162
C + C + + D
Sbjct: 267 ICPNKSCEAIVYQTFD 282
Score = 62.4 bits (150), Expect = 3e-07, Method: Compositional matrix adjust.
Identities = 33/84 (39%), Positives = 47/84 (55%), Gaps = 1/84 (1%)
Query: 243 EDWKEALAYCQLTIPVYQRVYPQFHPLLGLQYYTCGKLEWFLG-DTENAIKSMTEAVEIL 301
+DW +AL Y + VY YP++HP LG+ Y GKL D E A+K + +A IL
Sbjct: 288 QDWTKALEYATRNLEVYTWFYPKYHPCLGVHLYKIGKLLAVTHQDLELAVKRLEQARRIL 347
Query: 302 RITHGTNSPFMKELILKLEEAQAE 325
+THG + P ++EL L +A E
Sbjct: 348 EVTHGQSHPLVQELCEYLCQASEE 371
>gi|440907991|gb|ELR58064.1| SET and MYND domain-containing protein 3, partial [Bos grunniens
mutus]
Length = 221
Score = 101 bits (251), Expect = 5e-19, Method: Compositional matrix adjust.
Identities = 73/258 (28%), Positives = 123/258 (47%), Gaps = 39/258 (15%)
Query: 37 FSKLACNAHTICNSELRPLGTGLYPVISIINHSCLPNAVLVFEGRLAVVRAVQHVPKGAE 96
F ++ CN+ TICN+E++ +G GLYP +S++NHSC PN +VF G ++RAV+ V G E
Sbjct: 1 FLQVICNSFTICNAEMQEVGVGLYPSMSLLNHSCDPNCSIVFNGPHLLLRAVRDVEAGEE 60
Query: 97 VLISYIETAGSTMTRQKALKEQYLFTCTCPRCIKLGQFDDIQESAILEGYRCKDDGCSGF 156
+ I Y++ ++ R+K L+ QY F C C RC Q D
Sbjct: 61 LTICYLDMLMTSEERRKQLRGQYCFDCDCFRC----QTQD-------------------- 96
Query: 157 LLRDSDDKGFTCQQCGLVRSKEEIKKIASEVNILSKKTLALTSCGNHQEVVSTYKMIEKL 216
+D+D ++ E++ K EV KK L + ++V++ + I
Sbjct: 97 --KDAD----------MLTGDEQVWK---EVQESLKKIEDLKAHWKWEQVLAMCQSIISS 141
Query: 217 QKKLYHPFSVNLMQTREKLIKILMELEDWKEALAYCQLTIPVYQRVYPQFHPLLGLQYYT 276
+ ++ ++ + + + L +EAL Y T+ Y+ YP HP+ G+Q
Sbjct: 142 NAERLPDINIYQLKVLDCAMDACLNLGLLEEALFYGIRTMEPYRIFYPGNHPVRGVQIMK 201
Query: 277 CGKLEWFLGDTENAIKSM 294
GKL+ G A+K++
Sbjct: 202 VGKLQLHQGMFPQAMKNL 219
>gi|301777093|ref|XP_002923966.1| PREDICTED: SET and MYND domain-containing protein 3-like
[Ailuropoda melanoleuca]
Length = 477
Score = 100 bits (250), Expect = 7e-19, Method: Compositional matrix adjust.
Identities = 44/97 (45%), Positives = 67/97 (69%)
Query: 32 EIAENFSKLACNAHTICNSELRPLGTGLYPVISIINHSCLPNAVLVFEGRLAVVRAVQHV 91
+I E F+K+ CNA TICN+E++ +G GLYP +S++NHSC PN +VF G ++RAV+ +
Sbjct: 170 DIFEAFAKVICNAFTICNAEMQEVGVGLYPSMSLLNHSCDPNCSIVFNGPHLLLRAVRDI 229
Query: 92 PKGAEVLISYIETAGSTMTRQKALKEQYLFTCTCPRC 128
G E+ I Y++ ++ R+K L++QY F C C RC
Sbjct: 230 EAGEELTICYLDMLMTSEERRKQLRDQYCFECDCFRC 266
>gi|16930389|gb|AAL31881.1|AF410782_1 cardiac and skeletal muscle-specific BOP2 [Gallus gallus]
Length = 473
Score = 100 bits (250), Expect = 7e-19, Method: Compositional matrix adjust.
Identities = 83/320 (25%), Positives = 141/320 (44%), Gaps = 43/320 (13%)
Query: 25 WP----EISINEIAENFSKLACNAHTICNSE-LRPLGTGLYPVISIINHSCLPNAVLVFE 79
WP + + I+ F + CNA T+ + L+ +G G++P + NH C PN ++F
Sbjct: 153 WPAQSQQFGMQYISHIFGVINCNAFTLSDQRGLQAVGVGIFPNLCQANHDCWPNCTVIFN 212
Query: 80 GRLAVVRAVQHVPKGAEVLISYIETAGSTMTRQKALKEQYLFTCTCPRCIKLGQFDDIQE 139
+RA+ + G E+ +SY++ + R+K LK+QY F CTC C K
Sbjct: 213 NGKIELRALSKISPGDELTVSYVDFLNVSEERRKQLKKQYYFDCTCEHCKK--------- 263
Query: 140 SAILEGYRCKDDGCSGFLLRDSDDKGFTCQQCGLVRSKEEIKKIASEVNILSKKTLALTS 199
+ KDD +++ D K S E +K EV SK TL +
Sbjct: 264 -------KIKDD--LMLAVKEGDKKP----------SAETVK----EVIQFSKDTLEKIN 300
Query: 200 C----GNHQEVVSTYKMIEKLQKKLYHPFSVNLMQTREKLIKILMELEDWKEALAYCQLT 255
G + EVV + K Q+ + ++ L++ ++L L+ ++EA Y +
Sbjct: 301 KARLEGTYHEVVKLCRECLKKQEPVLGDTNIYLLRILSIASEVLSYLQMFEEAAEYAKRM 360
Query: 256 IPVYQRVYPQFHPLLGLQYYTCGKLEWFLGDTENAIKSMTEAVEILRITHGTNSPFMKEL 315
+ Y ++Y + LG+ G W G E + +A IL ITHG + P K+
Sbjct: 361 VEGYMKIYHPNNAQLGMAVMRAGVTHWHAGLIEAGHSMICKAYAILLITHGPSHPITKD- 419
Query: 316 ILKLEEAQAEASYKLSSKDE 335
L++ Q E ++ ++E
Sbjct: 420 -LEVMRVQTEMELRMFQQNE 438
>gi|260788099|ref|XP_002589088.1| hypothetical protein BRAFLDRAFT_75068 [Branchiostoma floridae]
gi|229274262|gb|EEN45099.1| hypothetical protein BRAFLDRAFT_75068 [Branchiostoma floridae]
Length = 909
Score = 100 bits (250), Expect = 7e-19, Method: Compositional matrix adjust.
Identities = 76/299 (25%), Positives = 135/299 (45%), Gaps = 23/299 (7%)
Query: 33 IAENFSKLACNAHTIC------------NSELRPLGTGLYPVISIINHSCLPNAVLVFEG 80
+ + +L CNA I + ++R L T ++P +++NHSC PN + F+G
Sbjct: 606 LCHHMQQLRCNAQAITTLQEQDSVSLLEDKQVR-LATAVFPTEALLNHSCRPNVFVSFQG 664
Query: 81 RLAVVRAVQHVPKGAEVLISYIETAGSTM--TRQKALKEQYLFTCTCPRCIKLGQFDDIQ 138
+ +VRAV H+ G E+L Y AG + RQ ALKEQY F+C+C C + Q +
Sbjct: 665 KTLIVRAVSHIKPGEELLHCYGPHAGRMVYGERQAALKEQYFFSCSCDACQE--QVGNPN 722
Query: 139 ESAILEGYRCKDDGCSGFLLRDSDDKGFTCQ--QCGLVRSKEEIKKIASEVNILSKKTLA 196
+ Y+C + L DK C C ++IK + ++ L ++ A
Sbjct: 723 TVDMFSAYKCPVCNNAAKL----QDKKLVCTSPNCDAQGDTDDIKTTSKKIQDLFVQSSA 778
Query: 197 LTSCGNHQEVVSTYKMIEKLQKKLYHPFSVNLMQTREKLIKILMELEDWKEALAYCQLTI 256
G QE VS K LQ K+ HP + ++ +T + L + D K+++ + + ++
Sbjct: 779 FLEEGQIQEAVSGLKQCLVLQWKILHPSNKDIARTHDALARCYATSGDCKKSVEHLKKSL 838
Query: 257 PVYQRVYPQFHPLLGLQYYTCGKLEWFLGDTENAIKSMTEAVEILRITHGTNSPFMKEL 315
+ + L + + +L++ + ++ A+ I +G + P KEL
Sbjct: 839 STVELQFGACSVELAHELHKLAQLQFNGQQVAECLDTIERALTIFGCHYGDHHPKSKEL 897
>gi|443728057|gb|ELU14532.1| hypothetical protein CAPTEDRAFT_199484 [Capitella teleta]
Length = 374
Score = 100 bits (249), Expect = 8e-19, Method: Compositional matrix adjust.
Identities = 71/254 (27%), Positives = 122/254 (48%), Gaps = 18/254 (7%)
Query: 33 IAENFSKLACNAHTICNSELRPLGTGLYPVISIINHSCLPNAVLVFEGRLAVVRAVQHVP 92
+A S+++ N+ TI + + LG GLY S ++HSC PNA FEG VR ++ +
Sbjct: 95 MAFYLSQISVNSITINHPQAVGLGVGLYLRFSAVDHSCRPNAFANFEGTKVCVRCIEPIE 154
Query: 93 KGAEVLISYIETAGSTMTRQKALKEQYLFTCTCPRCIKLGQFDDIQESAILEGYRCKDDG 152
++ ISY+ T TR+K L ++YLF CTC C+ DI + + ++C G
Sbjct: 155 DEKDLRISYVSPLDDTATRRKNLLQKYLFECTCEACL------DIDQDQMKFSFKCVRGG 208
Query: 153 CSGFLLRDSDDKGFTCQQCGLVR-SKEEIKKIASEVN------ILSKKTLALTSCGNHQE 205
C G + R D++ F C CG + +++EI+ + V +SKK L + G + E
Sbjct: 209 CKGHMTRMQDNR-FRCDYCGEKQVTEDEIQTLNVAVEKTRRMMDISKK---LKAQGRYAE 264
Query: 206 VVSTYKMIEKLQKKLYHPFSVNLMQTREKLIKILMELEDWKEALAYCQL-TIPVYQRVYP 264
++ K H ++ +++ R+ L+ ++W C + + Y+ Y
Sbjct: 265 IMVAGFACLKDAVDFLHSDNLLIVELRDCLLNACRMTQNWSVEAIQCGIANLEPYRSFYG 324
Query: 265 QFHPLLGLQYYTCG 278
+ HP L + Y G
Sbjct: 325 RHHPALAILLYDIG 338
>gi|302850329|ref|XP_002956692.1| SET and zf-MYND domain-containing protein [Volvox carteri f.
nagariensis]
gi|300258053|gb|EFJ42294.1| SET and zf-MYND domain-containing protein [Volvox carteri f.
nagariensis]
Length = 265
Score = 100 bits (249), Expect = 8e-19, Method: Compositional matrix adjust.
Identities = 45/120 (37%), Positives = 73/120 (60%), Gaps = 1/120 (0%)
Query: 3 DIDEKQLLLYAQIANLVNLILQWPEI-SINEIAENFSKLACNAHTICNSELRPLGTGLYP 61
++D+++ LYAQ+A + + +W +A+ S L+CN HT+C+ ELRPLG LYP
Sbjct: 146 ELDDRRKQLYAQMAVVTWWVARWGTWPGFRTVAQLLSLLSCNCHTVCDEELRPLGVALYP 205
Query: 62 VISIINHSCLPNAVLVFEGRLAVVRAVQHVPKGAEVLISYIETAGSTMTRQKALKEQYLF 121
+++NHSC P+ V F G +RA++ + G E+ I+YIE A + R++ L + Y F
Sbjct: 206 TGALVNHSCSPSTVQTFHGSTLELRALRQLAPGDEITIAYIELAATRQERRETLADSYFF 265
>gi|148225404|ref|NP_001085463.1| SET and MYND domain containing 1 [Xenopus laevis]
gi|49118725|gb|AAH72803.1| MGC80131 protein [Xenopus laevis]
Length = 478
Score = 100 bits (249), Expect = 9e-19, Method: Compositional matrix adjust.
Identities = 76/316 (24%), Positives = 145/316 (45%), Gaps = 35/316 (11%)
Query: 25 WP----EISINEIAENFSKLACNAHTICNSE-LRPLGTGLYPVISIINHSCLPNAVLVFE 79
WP + + I+ FS ++CN T+ + L+ +G ++P + + NH C PN ++F
Sbjct: 153 WPSQSQQFGMQYISHIFSVISCNGFTLSDQRGLQAVGVAIFPNLCLTNHDCWPNCTVIFN 212
Query: 80 GRLAVVRAVQHVPKGAEVLISYIETAGSTMTRQKALKEQYLFTCTCPRCIKLGQFDDIQE 139
+RA+ + KG E+ +SY++ T R + LK+QY F CTC C + D
Sbjct: 213 NGKIELRALGKINKGDELTVSYVDFLNLTEDRMEQLKKQYYFDCTCEHCT--NKTKDALL 270
Query: 140 SAILEGYRCKDDGCSGFLLRDSDDKGFTCQQCGLVRSKEEIKKIASEVNILSKKTLALTS 199
A+ +G +D +++ S D + E+I+K SE
Sbjct: 271 LAVNDGESKPEDRVVKEVIQYSKD------------TMEKIEKARSE------------- 305
Query: 200 CGNHQEVVSTYKMIEKLQKKLYHPFSVNLMQTREKLIKILMELEDWKEALAYCQLTIPVY 259
G++ +VV + K Q+ ++ ++ +++ ++L L+ +++A + I Y
Sbjct: 306 -GSYHQVVKLCRECLKRQEPIFADTNIYMLRILSIYSEVLSYLQMFQDAAENAKKMIDGY 364
Query: 260 QRVYPQFHPLLGLQYYTCGKLEWFLGDTENAIKSMTEAVEILRITHGTNSPFMKELILKL 319
++Y + LG+ G +W G E + +A IL ITHG + P K+ L++
Sbjct: 365 MKIYHPNNAQLGMAIMRAGVTQWHAGMIEVGHGMICKAFAILLITHGPSHPITKD--LEM 422
Query: 320 EEAQAEASYKLSSKDE 335
+Q E ++ ++E
Sbjct: 423 MRSQTEMELRMFKENE 438
>gi|345328822|ref|XP_001510515.2| PREDICTED: SET and MYND domain-containing protein 1 isoform 2
[Ornithorhynchus anatinus]
Length = 477
Score = 100 bits (249), Expect = 1e-18, Method: Compositional matrix adjust.
Identities = 77/300 (25%), Positives = 131/300 (43%), Gaps = 42/300 (14%)
Query: 25 WP----EISINEIAENFSKLACNAHTICNSE-LRPLGTGLYPVISIINHSCLPNAVLVFE 79
WP + S+ I+ F + CN T+ + L+ +G G++P + ++NH C PN ++F
Sbjct: 158 WPPQSQQFSMQYISHIFGVINCNGFTLSDQRGLQAVGVGVFPNLCLVNHDCWPNCTVIFN 217
Query: 80 GRLAVVRAVQHVPKGAEVLISYIETAGSTMTRQKALKEQYLFTCTCPRCIKLGQFDDIQE 139
+RA+ + +G E+ +SYI+ + R+K LK+QY F CTC C K G DD+
Sbjct: 218 NGKIELRALGKISEGEELTVSYIDFLNVSADRKKLLKKQYYFDCTCEHCEK-GIKDDL-- 274
Query: 140 SAILEGYRCKDDGCSGFLLRDSDDKGFTCQQCGLVRSKEEIKKIASEVNILSKKTL---- 195
FL G+ + + + EV SK L
Sbjct: 275 ----------------FL--------------GVKDEPKPSQDVVKEVIQFSKDALEKID 304
Query: 196 ALTSCGNHQEVVSTYKMIEKLQKKLYHPFSVNLMQTREKLIKILMELEDWKEALAYCQLT 255
S G + EVV + + Q+ + ++ L++ ++L L+ ++EA Y +
Sbjct: 305 KARSEGVYHEVVKLCRECLQKQEPVLADTNIYLLRILSVASEVLSYLQSFEEASDYAKKM 364
Query: 256 IPVYQRVYPQFHPLLGLQYYTCGKLEWFLGDTENAIKSMTEAVEILRITHGTNSPFMKEL 315
+ Y ++Y + LG+ G W G E + +A IL +THG P K+L
Sbjct: 365 VDGYMKLYHPNNAQLGMAVMRAGVTHWHAGLIEVGHGMICKAYAILLVTHGPTHPITKDL 424
>gi|348517231|ref|XP_003446138.1| PREDICTED: SET and MYND domain-containing protein 1-like isoform 1
[Oreochromis niloticus]
Length = 476
Score = 100 bits (248), Expect = 1e-18, Method: Compositional matrix adjust.
Identities = 77/305 (25%), Positives = 136/305 (44%), Gaps = 34/305 (11%)
Query: 17 NLVNLILQWPEIS----INEIAENFSKLACNAHTICNSE-LRPLGTGLYPVISIINHSCL 71
++ N + WP S I++I+ F + CN ++ + L+ +G GL+P + ++NH C
Sbjct: 145 DIHNFLDYWPHNSKQHTIDDISHIFGVINCNGFSVSDQRGLQAVGVGLFPNLCLVNHDCW 204
Query: 72 PNAVLVFEGRLAVVRAVQHVPKGAEVLISYIETAGSTMTRQKALKEQYLFTCTCPRCIKL 131
PN ++ +RA+ + +G E+ ++Y++ + RQ+ LK QY F CTC C
Sbjct: 205 PNCTVILNHGKIELRALGKIAEGEELTVAYVDFLNLSEERQRLLKTQYFFDCTCEHCKN- 263
Query: 132 GQFDDIQESAILEGYRCKDDGCSGFLLRDSDDKGFTCQQCGLVRSKEEIKKIASEVNILS 191
R KDD G R+ D G+ S+E++K+ +
Sbjct: 264 ---------------RIKDDIKIGG--REED---------GVKPSEEQVKEATDYCFQML 297
Query: 192 KKTLALTSCGNHQEVVSTYK-MIEKLQKKLYHPFSVNLMQTREKLIKILMELEDWKEALA 250
+K G++ EVV + IEK + L + L++ + ++ L+ + +A
Sbjct: 298 EKMEKARLNGDYHEVVKICRECIEKTEPVLADT-HIYLLRMWSTMSEVQAYLQYFDDAAD 356
Query: 251 YCQLTIPVYQRVYPQFHPLLGLQYYTCGKLEWFLGDTENAIKSMTEAVEILRITHGTNSP 310
Y + + Y ++Y + LG+ G W G E + +A IL ITHG P
Sbjct: 357 YARKMVEGYMKLYHPNNAALGMAAMRAGVTHWQAGQIEVGHGMICKAYAILMITHGPIHP 416
Query: 311 FMKEL 315
K+L
Sbjct: 417 ITKDL 421
>gi|126305322|ref|XP_001379366.1| PREDICTED: SET and MYND domain-containing protein 1 [Monodelphis
domestica]
Length = 490
Score = 100 bits (248), Expect = 1e-18, Method: Compositional matrix adjust.
Identities = 83/333 (24%), Positives = 145/333 (43%), Gaps = 58/333 (17%)
Query: 5 DEKQLLLYAQIANLVNLILQWP----EISINEIAENFSKLACNAHTICNSE-LRPLGTGL 59
+E+Q L + + +N WP + S+ I+ F + CNA T+ + L+ +G G+
Sbjct: 141 EEEQKDLRVDVDSFLNF---WPAQSQQFSMQYISHIFGVINCNAFTLSDQRGLQAVGVGI 197
Query: 60 YPVISIINHSCLPNAVLVFE-GRLAVVRAVQH------------VPKGAEVLISYIETAG 106
+P + ++NH C PN ++F G V+++ H + +G E+ +SY++
Sbjct: 198 FPNLCLVNHDCWPNCTVIFNNGNHEAVKSMFHTQMRIELRALGKISEGEELTVSYVDFLN 257
Query: 107 STMTRQKALKEQYLFTCTCPRCIKLGQFDDIQESAILEGYRCKDDGCSGFLLRDSDDKGF 166
+ R+K LK+QY F CTC C K G DD+ FL +D K
Sbjct: 258 VSQERKKQLKKQYYFDCTCEHCEK-GIKDDL------------------FLAVKADPK-- 296
Query: 167 TCQQCGLVRSKEEIKKIASEVNILSKKTL----ALTSCGNHQEVVSTYKMIEKLQKKLYH 222
+ + EV SK+TL S G + EVV + + Q+ +
Sbjct: 297 ------------PSQDVVKEVTQFSKETLEKIDKARSEGLYHEVVKLCRECLQKQEPVLA 344
Query: 223 PFSVNLMQTREKLIKILMELEDWKEALAYCQLTIPVYQRVYPQFHPLLGLQYYTCGKLEW 282
++++++ ++L L+ + EA Y + + Y ++Y + LG+ G W
Sbjct: 345 DTNIHVLRILSIASEVLSYLQSFHEASDYAKRMVDGYMKLYHPNNAQLGMAVMRAGLTHW 404
Query: 283 FLGDTENAIKSMTEAVEILRITHGTNSPFMKEL 315
G E + +A IL +THG P K+L
Sbjct: 405 HAGLIEAGHGMICKAYAILLVTHGPTHPITKDL 437
>gi|330917311|ref|XP_003297760.1| hypothetical protein PTT_08278 [Pyrenophora teres f. teres 0-1]
gi|311329388|gb|EFQ94158.1| hypothetical protein PTT_08278 [Pyrenophora teres f. teres 0-1]
Length = 532
Score = 100 bits (248), Expect = 1e-18, Method: Compositional matrix adjust.
Identities = 79/316 (25%), Positives = 139/316 (43%), Gaps = 39/316 (12%)
Query: 33 IAENFSKLACNAHTICNSELRPLGTGLYPVISIINHSCLPNAVLVFEGRLAVVRAVQHVP 92
IA ++++ NA TI L PLG L P + +NHSC PNA ++ +G +R ++ +
Sbjct: 199 IAAMYARVMSNALTIITPTLDPLGIILDPTLCSLNHSCDPNAFIMMDGPSVSIRTLRPIR 258
Query: 93 KGAEVLISYIETAGSTMTRQKALKEQYLFTCTCPRCIKLGQFDDIQESAILEGYRCKDDG 152
K E+ ISYI+T RQ+ L+ ++ FTC C +C QE A L+
Sbjct: 259 KDKEIFISYIDTTYPYHKRQEELQTRWFFTCRCAKC---------QEKATLQEDNWLVPA 309
Query: 153 CSGFLLRDSDDKGFTCQQCGLVRSKEEIKKIASEVNILSKKTLALTSCGNHQEVVSTYKM 212
S F+L + Q E++K+++E I K + L SC
Sbjct: 310 DSKFVLDPEAKQAMAQTQEQTFALYGELQKLSTEHVIHGLKQI-LASC------------ 356
Query: 213 IEKLQKKLYHPFSVNLMQTREKLIKILMELEDWKEALAYCQLTIP-VYQRVYP-QFHPLL 270
+ + Y + + R+ L+ L+ + +++A A C + ++YP FHP+
Sbjct: 357 ---YESRFYPMYRQPYAEARDVLVVNLLSVGKFQDAWAQCAKRYKYILPKLYPVPFHPVR 413
Query: 271 GLQYYTCGKLEWFLGDTENAIKS------------MTEAVEILRITHGTNSPFMKELILK 318
+Q + L +L TE + + + + +++ ++HG N+ F K + K
Sbjct: 414 VVQTWQMAMLAAYLASTEEGVGAPGVNMGLIAMMLVKQVLDVAHLSHGPNNAFTKSVKEK 473
Query: 319 LEEAQAEASYKLSSKD 334
EE E + + D
Sbjct: 474 AEEMIEELKRSVGNPD 489
>gi|410955280|ref|XP_003984284.1| PREDICTED: SET and MYND domain-containing protein 1 [Felis catus]
Length = 490
Score = 99.8 bits (247), Expect = 2e-18, Method: Compositional matrix adjust.
Identities = 83/320 (25%), Positives = 145/320 (45%), Gaps = 56/320 (17%)
Query: 19 VNLILQ-WP----EISINEIAENFSKLACNAHTICNSE-LRPLGTGLYPVISIINHSCLP 72
V+ LQ WP + S+ I+ F ++CN T+ + L+ +G G++P + ++NH C P
Sbjct: 151 VDTFLQYWPPQSQQFSMQYISHIFGVISCNGFTLSDQRGLQAVGVGIFPNLGLVNHDCWP 210
Query: 73 NAVLVFE-GRLAVVRAVQH------------VPKGAEVLISYIETAGSTMTRQKALKEQY 119
N ++F G V+++ H + +G E+ +SYI+ + R++ LK+QY
Sbjct: 211 NCTVIFNNGNHEAVKSMFHTQMRIELRALGKISEGEELTVSYIDFLNVSEERKRQLKKQY 270
Query: 120 LFTCTCPRCIKLGQFDDIQESAILEGYRCKDDGCSGFLLRDSDDKGFTCQQCGLVRSKEE 179
F CTC C K + KDD FL D K S+E
Sbjct: 271 YFDCTCEHCQK----------------KLKDD---LFLGVKDDPKP----------SQEA 301
Query: 180 IKKIASEVNILSKKTL----ALTSCGNHQEVVSTYKMIEKLQKKLYHPFSVNLMQTREKL 235
+K++ LSK TL S G + EVV + + Q+ ++ ++ ++ +
Sbjct: 302 VKEMIQ----LSKDTLEKIDKARSEGLYHEVVKLCRECLQKQEPVFADTNLYTLRMLSTV 357
Query: 236 IKILMELEDWKEALAYCQLTIPVYQRVYPQFHPLLGLQYYTCGKLEWFLGDTENAIKSMT 295
++L L+ ++EA Y + + Y ++Y + LG+ G W G+ E +
Sbjct: 358 SEVLSYLQAFEEASHYARRMVDGYMKLYHPNNAQLGMAVMRAGLTNWHAGNIEVGHGMIC 417
Query: 296 EAVEILRITHGTNSPFMKEL 315
+A IL +THG + P K+L
Sbjct: 418 KAYAILLVTHGPSHPITKDL 437
>gi|296482488|tpg|DAA24603.1| TPA: SET and MYND domain containing 1 [Bos taurus]
Length = 490
Score = 99.8 bits (247), Expect = 2e-18, Method: Compositional matrix adjust.
Identities = 80/320 (25%), Positives = 144/320 (45%), Gaps = 56/320 (17%)
Query: 19 VNLILQ-WP----EISINEIAENFSKLACNAHTICNSE-LRPLGTGLYPVISIINHSCLP 72
V++ LQ WP + S+ I+ F + CN T+ + L+ +G G++P ++++NH C P
Sbjct: 151 VDMFLQYWPPQSQQFSMQYISHIFGVINCNGFTLSDQRGLQAVGVGVFPNLALVNHDCWP 210
Query: 73 NAVLVFE-GRLAVVRAVQH------------VPKGAEVLISYIETAGSTMTRQKALKEQY 119
N ++F G V+++ H + +G E+ +SYI+ + R+K LK QY
Sbjct: 211 NCTVIFNNGNHEAVKSMFHTQMRIELRALGKISEGEELTVSYIDFLNVSEERKKQLKRQY 270
Query: 120 LFTCTCPRCIKLGQFDDIQESAILEGYRCKDDGCSGFLLRDSDDKGFTCQQCGLVRSKEE 179
F CTC C K G DD+ FL G+ + +
Sbjct: 271 YFDCTCEHCQK-GLKDDL------------------FL--------------GVKDNPKP 297
Query: 180 IKKIASEVNILSKKTL----ALTSCGNHQEVVSTYKMIEKLQKKLYHPFSVNLMQTREKL 235
+++ E+ SK TL S G + EVV + + Q+ ++ ++ ++ +
Sbjct: 298 SQEVVKEMIQFSKDTLEKIDKARSEGLYHEVVKLCRECLQKQEPVFADTNIYTLRLLSIV 357
Query: 236 IKILMELEDWKEALAYCQLTIPVYQRVYPQFHPLLGLQYYTCGKLEWFLGDTENAIKSMT 295
++L L+ ++EA Y + + Y ++Y + LG+ G W G+ E +
Sbjct: 358 SEVLSYLQAFEEASHYARRMVDGYMKLYHHNNAQLGMAIMRAGLTNWHAGNIEVGHGMIC 417
Query: 296 EAVEILRITHGTNSPFMKEL 315
+A IL +THG + P K+L
Sbjct: 418 KAYAILLVTHGPSHPITKDL 437
>gi|395508794|ref|XP_003758694.1| PREDICTED: SET and MYND domain-containing protein 1 [Sarcophilus
harrisii]
Length = 490
Score = 99.8 bits (247), Expect = 2e-18, Method: Compositional matrix adjust.
Identities = 84/333 (25%), Positives = 144/333 (43%), Gaps = 58/333 (17%)
Query: 5 DEKQLLLYAQIANLVNLILQWP----EISINEIAENFSKLACNAHTICNSE-LRPLGTGL 59
+E+Q L + + +N WP + S+ I+ F + CNA T+ + L+ +G G+
Sbjct: 141 EEEQKDLRMDVDSFLNF---WPPQSQQFSMQYISHIFGVINCNAFTLSDQRGLQAVGVGI 197
Query: 60 YPVISIINHSCLPNAVLVFE-GRLAVVRAVQH------------VPKGAEVLISYIETAG 106
+P + ++NH C PN ++F G V+++ H + +G E+ +SYI+
Sbjct: 198 FPNLCLVNHDCWPNCTVIFNNGNHEAVKSMFHTQMRIELRALGKISEGEELTVSYIDFLN 257
Query: 107 STMTRQKALKEQYLFTCTCPRCIKLGQFDDIQESAILEGYRCKDDGCSGFLLRDSDDKGF 166
+ R+K LK+QY F CTC C K G DD+ FL D K
Sbjct: 258 ISQERKKQLKKQYYFDCTCEHCEK-GIKDDL------------------FLAVKEDPK-- 296
Query: 167 TCQQCGLVRSKEEIKKIASEVNILSKKTL----ALTSCGNHQEVVSTYKMIEKLQKKLYH 222
+ + EV SK TL S G + EVV + + Q+ +
Sbjct: 297 ------------PSQDVVKEVTQFSKDTLEKIDKARSEGLYHEVVKLCRECLQKQEPVLA 344
Query: 223 PFSVNLMQTREKLIKILMELEDWKEALAYCQLTIPVYQRVYPQFHPLLGLQYYTCGKLEW 282
++++++ ++L L+ ++EA Y + + Y ++Y + LG+ G W
Sbjct: 345 DTNIHVLRILSIASEVLSYLQSFQEASDYAKRMVDGYMKLYHPNNAQLGMAVMRAGLTHW 404
Query: 283 FLGDTENAIKSMTEAVEILRITHGTNSPFMKEL 315
G E + +A IL +THG P K+L
Sbjct: 405 HAGLIEAGHGMICKAYAILLVTHGPTHPITKDL 437
>gi|197102816|ref|NP_001125377.1| SET and MYND domain-containing protein 1 [Pongo abelii]
gi|55727869|emb|CAH90687.1| hypothetical protein [Pongo abelii]
Length = 477
Score = 99.4 bits (246), Expect = 2e-18, Method: Compositional matrix adjust.
Identities = 73/297 (24%), Positives = 135/297 (45%), Gaps = 38/297 (12%)
Query: 24 QWPEISINEIAENFSKLACNAHTICNSE-LRPLGTGLYPVISIINHSCLPNAVLVFEGRL 82
Q + S+ I+ F + CN T+ + L+ +G G++P + ++NH C PN ++F
Sbjct: 161 QSQQFSMQYISHIFGVINCNGFTLSDQRGLQAVGVGIFPNLGLVNHDCWPNCTVIFNNGK 220
Query: 83 AVVRAVQHVPKGAEVLISYIETAGSTMTRQKALKEQYLFTCTCPRCIKLGQFDDIQESAI 142
+RA+ + +G E+ +SYI+ + R++ LK+QY F C C C K
Sbjct: 221 IELRALGKISEGEELTVSYIDFLNVSEERKRQLKKQYYFDCACEHCQK------------ 268
Query: 143 LEGYRCKDDGCSGFLLRDSDDKGFTCQQCGLVRSKEEIKKIASEVNILSKKTL----ALT 198
+ KDD FL G+ + + +++ E+ SK TL
Sbjct: 269 ----KLKDD---LFL--------------GVKDNPKPSQEVVKEMIRFSKDTLEKIDKAR 307
Query: 199 SCGNHQEVVSTYKMIEKLQKKLYHPFSVNLMQTREKLIKILMELEDWKEALAYCQLTIPV 258
S G + EVV + + Q+ ++ ++ ++ + ++L L+ ++EA Y + +
Sbjct: 308 SEGLYHEVVKLCRECLQKQEPVFADTNIYTLRMLSIVSEVLSYLQAFEEASFYARRMVDG 367
Query: 259 YQRVYPQFHPLLGLQYYTCGKLEWFLGDTENAIKSMTEAVEILRITHGTNSPFMKEL 315
Y ++Y + LG+ G W G+ E + +A IL +THG + P K+L
Sbjct: 368 YMKLYHPNNAQLGMAVMRAGLTNWHAGNIEVGHGMICKAYAILLVTHGPSHPITKDL 424
>gi|321462734|gb|EFX73755.1| SET and MYND domain-containing protein 1 [Daphnia pulex]
Length = 452
Score = 99.4 bits (246), Expect = 2e-18, Method: Compositional matrix adjust.
Identities = 86/321 (26%), Positives = 139/321 (43%), Gaps = 69/321 (21%)
Query: 5 DEKQLLLYAQIANLVNLILQWPEISINEIAEN-------FSKLACNAHTICNSELRPLGT 57
D K++ + I ++N L NEI N + ++ N+ I N E++ +GT
Sbjct: 180 DTKRMEYFMTICGVLNEYLS------NEILPNSVELLGIYGRMCINSFNILNGEMQAIGT 233
Query: 58 GLYPVISIINHSCLPNAVLVFEGRLAVVRAVQHVPK--GAEVLISYIETAGSTMTRQKAL 115
G+Y SI++HSC PNAV F+G ++ Q +PK + ISYI+ S R+K L
Sbjct: 234 GIYLAPSILDHSCSPNAVATFDGFKLRIQLTQELPKLEWDSIRISYIDLMNSKSHRKKEL 293
Query: 116 KEQYLFTCTCPRCIKLGQFDDIQESAILEGYRCKDDGCSGFLLRDSDDKGFTCQQCGLVR 175
K++Y F C CPRC + DDI C + K TCQ
Sbjct: 294 KDRYYFDCDCPRC----KNDDID--------------CYHYAA-----KCPTCQ------ 324
Query: 176 SKEEIKKIASEVNILSKKTLALTSCGNHQEVVSTYKMIEKLQKKLYHPFSVNLMQTREKL 235
K + ++V+ L C K+ Q+ L+H + + +
Sbjct: 325 -----KPVIAKVD--------LDVC----------KLCIAKQENLFHDLCLVRSKIMDAA 361
Query: 236 IKILMELEDWKEALAYCQLTIPVYQRVYPQFHPLLGLQYYTCGKLEWFLGDTENAI--KS 293
++L+ W+ A+ Y I Y+ Y + HPL + K+ + T +++ K
Sbjct: 362 FDSAIQLQLWEAAIQYGIPLIQAYKLWYGEEHPLTAILLLKLFKITLLISTTNDSVAQKY 421
Query: 294 MTEAVEILRITHGTNSPFMKE 314
EA +IL ITHG +S F ++
Sbjct: 422 YEEAAKILEITHGRDSSFYRQ 442
>gi|325188357|emb|CCA22894.1| conserved hypothetical protein [Albugo laibachii Nc14]
Length = 528
Score = 99.4 bits (246), Expect = 2e-18, Method: Compositional matrix adjust.
Identities = 53/146 (36%), Positives = 79/146 (54%), Gaps = 12/146 (8%)
Query: 32 EIAENFSKLACNAHTICNSELRPLGTGLYPVISIINHSCLPNAVLVFE--GRLAVVRAVQ 89
+I + F + CN TICN+ PLG GLYP +++NHSC PN ++ F+ + AV+ ++
Sbjct: 202 QITKLFCMVQCNVFTICNTLNEPLGIGLYPKAAMLNHSCTPNCIVRFDLTRKQAVIHTIE 261
Query: 90 HVPKGAEVLISYIETAGSTMTRQKALKEQYLFTCTCPRCIKLGQFDDIQESAILEGY--- 146
+ G E+ ISY++ T R+K L+ Y F CTC RC QE A LEG+
Sbjct: 262 TIHSGQELTISYVDIFDPTTIRKKRLRSSYFFDCTCQRCT------SSQEDAFLEGFLCT 315
Query: 147 RCKDDGCSGFLLRDSDDKGFTCQQCG 172
+C++ S +L D + F C C
Sbjct: 316 QCRNTSESARIL-DEKKRIFRCCVCA 340
>gi|73980306|ref|XP_852104.1| PREDICTED: SET and MYND domain-containing protein 1 isoform 2
[Canis lupus familiaris]
Length = 490
Score = 99.0 bits (245), Expect = 3e-18, Method: Compositional matrix adjust.
Identities = 80/316 (25%), Positives = 142/316 (44%), Gaps = 48/316 (15%)
Query: 19 VNLILQ-WP----EISINEIAENFSKLACNAHTICNSE-LRPLGTGLYPVISIINHSCLP 72
V+ LQ WP + S+ I+ F ++CN T+ + L+ +G G++P + ++NH C P
Sbjct: 151 VDTFLQYWPPQGQQFSMQYISHIFGVISCNGFTLSDQRGLQAVGVGIFPNLGLVNHDCWP 210
Query: 73 NAVLVFE-GRLAVVRAVQH------------VPKGAEVLISYIETAGSTMTRQKALKEQY 119
N ++F G V+++ H + +G E+ +SYI+ + R+K LK+QY
Sbjct: 211 NCTVIFNNGNHEAVKSMFHTQMRIELRALGKISEGEELTVSYIDFLNLSEDRKKQLKKQY 270
Query: 120 LFTCTCPRCIKLGQFDDIQESAILEGYRCKDDGCSGFLLRDSDDKGFTCQQCGLVRSKEE 179
F CTC C K + KDD FL D K S+E
Sbjct: 271 YFDCTCEHCQK----------------KLKDD---LFLGVKDDPKP----------SQEV 301
Query: 180 IKKIASEVNILSKKTLALTSCGNHQEVVSTYKMIEKLQKKLYHPFSVNLMQTREKLIKIL 239
+K++ +K S G + EVV + + Q+ ++ ++ ++ + ++L
Sbjct: 302 VKEMIQFSKDTLEKIDKARSEGLYHEVVKLCRECLQKQEPVFADTNLYTLRMLSTVSEVL 361
Query: 240 MELEDWKEALAYCQLTIPVYQRVYPQFHPLLGLQYYTCGKLEWFLGDTENAIKSMTEAVE 299
L+ ++EA Y + + Y ++Y + LG+ G W G+ E + +A
Sbjct: 362 SYLQAFEEASYYARRMVDGYMKLYHPNNAQLGMAVMRAGLTNWHAGNIEVGHGMICKAYA 421
Query: 300 ILRITHGTNSPFMKEL 315
IL +THG + P K+L
Sbjct: 422 ILLVTHGPSHPITKDL 437
>gi|134085965|ref|NP_001076953.1| SET and MYND domain-containing protein 1 [Bos taurus]
gi|126717435|gb|AAI33279.1| SMYD1 protein [Bos taurus]
Length = 490
Score = 98.6 bits (244), Expect = 4e-18, Method: Compositional matrix adjust.
Identities = 80/320 (25%), Positives = 143/320 (44%), Gaps = 56/320 (17%)
Query: 19 VNLILQ-WP----EISINEIAENFSKLACNAHTICNSE-LRPLGTGLYPVISIINHSCLP 72
V+ LQ WP + S+ I+ F + CN T+ + L+ +G G++P ++++NH C P
Sbjct: 151 VDTFLQYWPPQSQQFSMQYISHIFGVINCNGFTLSDQRGLQAVGVGVFPNLALVNHDCWP 210
Query: 73 NAVLVFE-GRLAVVRAVQH------------VPKGAEVLISYIETAGSTMTRQKALKEQY 119
N ++F G V+++ H + +G E+ +SYI+ + R+K LK QY
Sbjct: 211 NCTVIFNNGNHEAVKSMFHTQMRIELRALGKISEGEELTVSYIDFLNVSEERKKQLKRQY 270
Query: 120 LFTCTCPRCIKLGQFDDIQESAILEGYRCKDDGCSGFLLRDSDDKGFTCQQCGLVRSKEE 179
F CTC C K G DD+ FL G+ + +
Sbjct: 271 YFDCTCEHCQK-GLKDDL------------------FL--------------GVKDNPKP 297
Query: 180 IKKIASEVNILSKKTL----ALTSCGNHQEVVSTYKMIEKLQKKLYHPFSVNLMQTREKL 235
+++ E+ SK TL S G + EVV + + Q+ ++ ++ ++ +
Sbjct: 298 SQEVVKEMIQFSKDTLEKIDKARSEGLYHEVVKLCRECLQKQEPVFADTNIYTLRLLSIV 357
Query: 236 IKILMELEDWKEALAYCQLTIPVYQRVYPQFHPLLGLQYYTCGKLEWFLGDTENAIKSMT 295
++L L+ ++EA Y + + Y ++Y + LG+ G W G+ E +
Sbjct: 358 SEVLSYLQAFEEASHYARRMVDGYMKLYHHNNAQLGMAIMRAGLTNWHAGNIEVGHGMIC 417
Query: 296 EAVEILRITHGTNSPFMKEL 315
+A IL +THG + P K+L
Sbjct: 418 KAYAILLVTHGPSHPITKDL 437
>gi|395853479|ref|XP_003799234.1| PREDICTED: SET and MYND domain-containing protein 1 [Otolemur
garnettii]
Length = 490
Score = 98.2 bits (243), Expect = 4e-18, Method: Compositional matrix adjust.
Identities = 83/317 (26%), Positives = 142/317 (44%), Gaps = 50/317 (15%)
Query: 19 VNLILQ-WPE----ISINEIAENFSKLACNAHTICNSE-LRPLGTGLYPVISIINHSCLP 72
V+ LQ WP SI I+ F + CN T+ + L+ +G G++P + ++NH C P
Sbjct: 151 VDTFLQYWPPQSQPFSIQYISHIFGVINCNGFTLSDQRGLQAVGVGIFPNLGLVNHDCWP 210
Query: 73 NAVLVFE-GRLAVVRAVQH------------VPKGAEVLISYIETAGSTMTRQKALKEQY 119
N ++F G V+A+ H + +G E+ +SYI+ T R+K LK+QY
Sbjct: 211 NCTVIFNNGNHEAVKAMFHTQMRIELRALGKISEGEELTVSYIDFLNVTEERRKQLKKQY 270
Query: 120 LFTCTCPRCIKLGQFDDIQESAILEGYRCKDDGCSGFL-LRDSDDKGFTCQQCGLVRSKE 178
F C C C K + KDD FL ++D+ + + SKE
Sbjct: 271 YFDCACEHCQK----------------KLKDD---LFLAVKDNPKPSQEVVKEMIQFSKE 311
Query: 179 EIKKIASEVNILSKKTLALTSCGNHQEVVSTYKMIEKLQKKLYHPFSVNLMQTREKLIKI 238
+ KIA S G + EVV + + Q+ ++ ++ +++ + ++
Sbjct: 312 TLDKIAK-----------ARSEGLYHEVVKLCRECLQKQEPVFADTNLYMLRILSIVSEV 360
Query: 239 LMELEDWKEALAYCQLTIPVYQRVYPQFHPLLGLQYYTCGKLEWFLGDTENAIKSMTEAV 298
L L+ ++EA Y + + Y ++Y + LG+ G W G+ E + +A
Sbjct: 361 LSYLQAFEEASDYARRMVDGYMKLYHPNNAQLGMAVMRAGLTNWHAGNIEVGHGMICKAY 420
Query: 299 EILRITHGTNSPFMKEL 315
IL +THG + P K+L
Sbjct: 421 AILLVTHGPSHPITKDL 437
>gi|45387527|ref|NP_991103.1| SET and MYND domain containing 1 [Danio rerio]
gi|41223368|gb|AAH65475.1| SET and MYND domain containing 1a [Danio rerio]
Length = 485
Score = 98.2 bits (243), Expect = 5e-18, Method: Compositional matrix adjust.
Identities = 71/322 (22%), Positives = 138/322 (42%), Gaps = 52/322 (16%)
Query: 16 ANLVNLILQWPE----ISINEIAENFSKLACNAHTICNSE-LRPLGTGLYPVISIINHSC 70
A++ WP+ + + ++ F ++CN T+ + L+ +G GL+P + ++NH C
Sbjct: 149 ADVKTFYTYWPKKSKAVGEDYVSHLFGVISCNGFTLSDQRGLQSVGIGLFPNLCLVNHDC 208
Query: 71 LPNAVLVFE-------------GRLAVVRAVQHVPKGAEVLISYIETAGSTMTRQKALKE 117
PN ++ R +RA++ + G E+ +SY++ + RQ+ L++
Sbjct: 209 WPNCTVILNHGDQSALDASFHSSRRIELRALEPISAGQELTVSYVDFLSVSTDRQRLLQQ 268
Query: 118 QYLFTCTCPRCIKLGQFDDIQESAILEGYRCKDDGCSGFLLRDSDDKGFTCQQCGLVRSK 177
QY F C C C+ G D++ + + +DG K
Sbjct: 269 QYYFDCKCEHCVN-GTKDEL-----MTAVKPTEDG------------------------K 298
Query: 178 EEIKKIASEVNILSKKTL----ALTSCGNHQEVVSTYKMIEKLQKKLYHPFSVNLMQTRE 233
+ + ++ S + L A + GN EV+ + + Q ++ +V++++
Sbjct: 299 QPSADVVKQLTDFSLQALVKIEAARAQGNFHEVIRICRECLEKQDPVFADTNVHVLRVLS 358
Query: 234 KLIKILMELEDWKEALAYCQLTIPVYQRVYPQFHPLLGLQYYTCGKLEWFLGDTENAIKS 293
++L L+ ++EA Y Q + Y ++Y + LG+ G W G E A
Sbjct: 359 TASEVLSFLQQFQEAAGYAQRMVDGYMKLYHPNNAQLGMAIMRAGVTHWHAGLIEAAHGL 418
Query: 294 MTEAVEILRITHGTNSPFMKEL 315
+ A IL ITHG + P ++L
Sbjct: 419 ICRAYGILMITHGPHHPISRDL 440
>gi|47227584|emb|CAG09581.1| unnamed protein product [Tetraodon nigroviridis]
Length = 382
Score = 98.2 bits (243), Expect = 5e-18, Method: Compositional matrix adjust.
Identities = 73/305 (23%), Positives = 134/305 (43%), Gaps = 34/305 (11%)
Query: 17 NLVNLILQWPEIS----INEIAENFSKLACNAHTICNSE-LRPLGTGLYPVISIINHSCL 71
++ N + WP S I++I+ F + CN T+ + L+ +G GL+P + ++NH C
Sbjct: 51 DIHNFLDYWPRNSKQHTIDDISHLFGVINCNGFTVSDQRGLQAVGVGLFPNLCLVNHDCW 110
Query: 72 PNAVLVFEGRLAVVRAVQHVPKGAEVLISYIETAGSTMTRQKALKEQYLFTCTCPRCIKL 131
PN ++ +R++ + +G E+ ++Y++ + R++ LK QY F C C C
Sbjct: 111 PNCTVILNHGKIELRSLGKIAEGEELTVAYVDFLNLSEERRRLLKTQYFFDCQCEHCRN- 169
Query: 132 GQFDDIQESAILEGYRCKDD-GCSGFLLRDSDDKGFTCQQCGLVRSKEEIKKIASEVNIL 190
R KDD +G R+ D GL S+E++K+ +
Sbjct: 170 ---------------RTKDDLKLAG---REVD---------GLKPSEEQVKEATDYCFQM 202
Query: 191 SKKTLALTSCGNHQEVVSTYKMIEKLQKKLYHPFSVNLMQTREKLIKILMELEDWKEALA 250
+K G++ EVV + + + + L++ L ++ L+ + +A
Sbjct: 203 LEKMDKARLDGDYHEVVKICRDAIDRTEPVLADTHIYLLRMWSTLSEVQAYLQYFNDAAT 262
Query: 251 YCQLTIPVYQRVYPQFHPLLGLQYYTCGKLEWFLGDTENAIKSMTEAVEILRITHGTNSP 310
Y + + Y ++YP + LG+ G W G E + +A IL +THG P
Sbjct: 263 YSRKMVEGYMKLYPPNNAALGMAAMRAGVNHWQAGLIEVGHGMVCKAYAILLVTHGPTHP 322
Query: 311 FMKEL 315
K+L
Sbjct: 323 ITKDL 327
>gi|119597545|gb|EAW77139.1| SET and MYND domain containing 3, isoform CRA_a [Homo sapiens]
gi|119597547|gb|EAW77141.1| SET and MYND domain containing 3, isoform CRA_a [Homo sapiens]
gi|193785773|dbj|BAG51208.1| unnamed protein product [Homo sapiens]
Length = 239
Score = 97.8 bits (242), Expect = 5e-18, Method: Compositional matrix adjust.
Identities = 74/257 (28%), Positives = 120/257 (46%), Gaps = 43/257 (16%)
Query: 52 LRPLGTGLYPVISIINHSCLPNAVLVFEGRLAVVRAVQHVPKGAEVLISYIETAGSTMTR 111
++ +G GLYP IS++NHSC PN +VF G ++RAV+ + G E+ I Y++ ++ R
Sbjct: 1 MQEVGVGLYPSISLLNHSCDPNCSIVFNGPHLLLRAVRDIEVGEELTICYLDMLMTSEER 60
Query: 112 QKALKEQYLFTCTCPRCIKLGQFDDIQESAILEGYRCKDDGCSGFLLRDSDDKGFTCQQC 171
+K L++QY F C C RC +D D T +
Sbjct: 61 RKQLRDQYCFECDCFRCQT----------------------------QDKDADMLTGDEQ 92
Query: 172 GLVRSKEEIKKIASEVNILSKKTLALTSCGNHQEVVSTYKMIEKLQKKLYHPFSVNLMQT 231
+E +KKI E+ K L C Q ++S+ E+L +N+ Q
Sbjct: 93 VWKEVQESLKKI-EELKAHWKWEQVLAMC---QAIISSNS--ERLP-------DINIYQL 139
Query: 232 R--EKLIKILMELEDWKEALAYCQLTIPVYQRVYPQFHPLLGLQYYTCGKLEWFLGDTEN 289
+ + + + L +EAL Y T+ Y+ +P HP+ G+Q GKL+ G
Sbjct: 140 KVLDCAMDACINLGLLEEALFYGTRTMEPYRIFFPGSHPVRGVQVMKVGKLQLHQGMFPQ 199
Query: 290 AIKSMTEAVEILRITHG 306
A+K++ A +I+R+THG
Sbjct: 200 AMKNLRLAFDIMRVTHG 216
>gi|426223567|ref|XP_004005946.1| PREDICTED: SET and MYND domain-containing protein 1 [Ovis aries]
Length = 490
Score = 97.8 bits (242), Expect = 6e-18, Method: Compositional matrix adjust.
Identities = 80/316 (25%), Positives = 141/316 (44%), Gaps = 48/316 (15%)
Query: 19 VNLILQ-WP----EISINEIAENFSKLACNAHTICNSE-LRPLGTGLYPVISIINHSCLP 72
V+ LQ WP + S+ I+ F + CN T+ + L+ +G G++P ++++NH C P
Sbjct: 151 VDTFLQYWPPQSQQFSMQYISHIFGVINCNGFTLSDQRGLQAVGVGVFPNLALVNHDCWP 210
Query: 73 NAVLVFE-GRLAVVRAVQH------------VPKGAEVLISYIETAGSTMTRQKALKEQY 119
N ++F G V+++ H + +G E+ +SYI+ + R+K LK QY
Sbjct: 211 NCTVIFNNGNHEAVKSMFHTQMRIELRALGKISEGEELTVSYIDFLNISEERKKQLKRQY 270
Query: 120 LFTCTCPRCIKLGQFDDIQESAILEGYRCKDDGCSGFLLRDSDDKGFTCQQCGLVRSKEE 179
F CTC C K G DD+ FL D K S+E
Sbjct: 271 YFDCTCEHCQK-GLKDDL------------------FLGVKDDPKP----------SQEV 301
Query: 180 IKKIASEVNILSKKTLALTSCGNHQEVVSTYKMIEKLQKKLYHPFSVNLMQTREKLIKIL 239
+K++ +K S G + EVV + + Q+ ++ ++ ++ + ++L
Sbjct: 302 VKEMIQFSKDTLEKIDKARSEGLYHEVVKLCRECLQKQEPVFADTNIYTLRLLSIVSEVL 361
Query: 240 MELEDWKEALAYCQLTIPVYQRVYPQFHPLLGLQYYTCGKLEWFLGDTENAIKSMTEAVE 299
L+ ++EA Y + + Y ++Y + LG+ G W G+ E + +A
Sbjct: 362 SYLQAFEEASHYARRMVDGYMKLYHPNNAQLGMAIMRAGLTNWHAGNIEVGHGMICKAYA 421
Query: 300 ILRITHGTNSPFMKEL 315
IL +THG + P K+L
Sbjct: 422 ILLVTHGPSHPITKDL 437
>gi|402891521|ref|XP_003908994.1| PREDICTED: SET and MYND domain-containing protein 1 [Papio anubis]
gi|355565873|gb|EHH22302.1| hypothetical protein EGK_05542 [Macaca mulatta]
gi|355751476|gb|EHH55731.1| hypothetical protein EGM_04994 [Macaca fascicularis]
Length = 490
Score = 97.8 bits (242), Expect = 7e-18, Method: Compositional matrix adjust.
Identities = 79/320 (24%), Positives = 144/320 (45%), Gaps = 56/320 (17%)
Query: 19 VNLILQ-WP----EISINEIAENFSKLACNAHTICNSE-LRPLGTGLYPVISIINHSCLP 72
V+ LQ WP + S+ I+ F + CN T+ + L+ +G G++P + ++NH C P
Sbjct: 151 VDTFLQYWPPQSQQFSMQYISHIFGVINCNGFTLSDQRGLQAVGVGIFPNLGLVNHDCWP 210
Query: 73 NAVLVFE-GRLAVVRAVQH------------VPKGAEVLISYIETAGSTMTRQKALKEQY 119
N ++F G V+++ H + +G E+ +SYI+ + R++ LK+QY
Sbjct: 211 NCTVIFNNGNHEAVKSMFHTQMRIELRALGKISEGEELTVSYIDFLNVSEERKRQLKKQY 270
Query: 120 LFTCTCPRCIKLGQFDDIQESAILEGYRCKDDGCSGFLLRDSDDKGFTCQQCGLVRSKEE 179
F CTC C K + KDD FL G+ + +
Sbjct: 271 YFDCTCEHCQK----------------KLKDD---LFL--------------GVKDNPKP 297
Query: 180 IKKIASEVNILSKKTL----ALTSCGNHQEVVSTYKMIEKLQKKLYHPFSVNLMQTREKL 235
+++ E+ SK TL S G + EVV + + Q+ ++ ++ +++ +
Sbjct: 298 SQEVVKEMIQFSKDTLEKIDKARSEGLYHEVVKLCRECLQKQEPVFADTNIYMLRILSIV 357
Query: 236 IKILMELEDWKEALAYCQLTIPVYQRVYPQFHPLLGLQYYTCGKLEWFLGDTENAIKSMT 295
++L L+ ++EA Y + + Y ++Y + LG+ G W G+ E +
Sbjct: 358 SEVLSYLQAFEEASDYARRMVDGYMKLYHPNNAQLGMAVMRAGLTNWHAGNIEVGHGMIC 417
Query: 296 EAVEILRITHGTNSPFMKEL 315
+A IL +THG + P K+L
Sbjct: 418 KAYAILLVTHGPSHPITKDL 437
>gi|241712248|ref|XP_002413449.1| SET and MYND domain-containing protein, putative [Ixodes
scapularis]
gi|215507263|gb|EEC16757.1| SET and MYND domain-containing protein, putative [Ixodes
scapularis]
Length = 374
Score = 97.8 bits (242), Expect = 7e-18, Method: Compositional matrix adjust.
Identities = 75/283 (26%), Positives = 127/283 (44%), Gaps = 41/283 (14%)
Query: 35 ENFSKLACNAHTICNSELRPLGTGLYPVISIINHSCLPNAVLVFEGRLAVVRAVQHVPKG 94
E F K+ N +TI N E P+GTGLY SI++HSC PNA VF+GR +RA +
Sbjct: 112 EIFGKMVVNTYTITNYEECPIGTGLYIGPSILDHSCTPNAQAVFDGRTLRLRAAADIECD 171
Query: 95 A--EVLISYIETAGSTMTRQKALKEQYLFTCTCPRCIKLGQFDDIQESAILEGYRCKDDG 152
+ + +SYI+ TR + L+E+Y F+C CPRC D+ QE
Sbjct: 172 SVDGIRVSYIDLKELRRTRIEELRERYYFSCNCPRC----SADEAQEY------------ 215
Query: 153 CSGFLLRDSDDKGFTCQQCGLVRSKEEIKKIASEVNILSKKTLALTSCGNHQEVVSTYKM 212
T + L +E+ + + + +++ L G + ++S++K+
Sbjct: 216 -------------LTEKSPSLTAKAKELLREFMQAEVHAREDLVRLR-GQAEALLSSHKL 261
Query: 213 IEKLQKKLYHPFSVNLMQTREKLIKILMELEDWKEALAYCQLTIPVYQRVYPQFHPLLGL 272
+ + E L K +E+ D+ AL Y Y+R Y + P+ G+
Sbjct: 262 PATDAAR---------SEALELLSKSCLEMGDFDAALPYYLAKEETYRRCYGPYSPVYGV 312
Query: 273 QYYTCGKLEWFLGDTENAIKSMTEAVEILRITHGTNSPFMKEL 315
+T KL + A++ +A E++ ++HG N ++L
Sbjct: 313 LLFTIAKLYHHKVLLQEALQYFEKASEVVTVSHGRNHRLYRDL 355
>gi|397468470|ref|XP_003805903.1| PREDICTED: SET and MYND domain-containing protein 1 [Pan paniscus]
Length = 490
Score = 97.4 bits (241), Expect = 7e-18, Method: Compositional matrix adjust.
Identities = 79/320 (24%), Positives = 144/320 (45%), Gaps = 56/320 (17%)
Query: 19 VNLILQ-WP----EISINEIAENFSKLACNAHTICNSE-LRPLGTGLYPVISIINHSCLP 72
V+ LQ WP + S+ I+ F + CN T+ + L+ +G G++P + ++NH C P
Sbjct: 151 VDTFLQYWPPQSQQFSMQYISHIFGVINCNGFTLSDQRGLQAVGVGIFPNLGLVNHDCWP 210
Query: 73 NAVLVFE-GRLAVVRAVQH------------VPKGAEVLISYIETAGSTMTRQKALKEQY 119
N ++F G V+++ H + +G E+ +SYI+ + R++ LK+QY
Sbjct: 211 NCTVIFNNGNHEAVKSMFHTQMRIELRALGKISEGEELTVSYIDFLNVSEERKRQLKKQY 270
Query: 120 LFTCTCPRCIKLGQFDDIQESAILEGYRCKDDGCSGFLLRDSDDKGFTCQQCGLVRSKEE 179
F CTC C K + KDD FL G+ + +
Sbjct: 271 YFDCTCEHCQK----------------KLKDD---LFL--------------GVKDNPKP 297
Query: 180 IKKIASEVNILSKKTL----ALTSCGNHQEVVSTYKMIEKLQKKLYHPFSVNLMQTREKL 235
+++ E+ SK TL S G + EVV + + Q+ ++ ++ +++ +
Sbjct: 298 SQEVVKEMIQFSKDTLEKIDKARSEGLYHEVVKLCRECLEKQEPVFADTNIYMLRMLSIV 357
Query: 236 IKILMELEDWKEALAYCQLTIPVYQRVYPQFHPLLGLQYYTCGKLEWFLGDTENAIKSMT 295
++L L+ ++EA Y + + Y ++Y + LG+ G W G+ E +
Sbjct: 358 SEVLSYLQAFEEASFYARRMVDGYMKLYHPNNAQLGMAVMRAGLTNWHAGNIEVGHGMIC 417
Query: 296 EAVEILRITHGTNSPFMKEL 315
+A IL +THG + P K+L
Sbjct: 418 KAYAILLVTHGPSHPITKDL 437
>gi|38093643|ref|NP_938015.1| SET and MYND domain-containing protein 1 [Homo sapiens]
gi|34925329|sp|Q8NB12.1|SMYD1_HUMAN RecName: Full=SET and MYND domain-containing protein 1
gi|21750163|dbj|BAC03732.1| unnamed protein product [Homo sapiens]
gi|57997113|emb|CAI46139.1| hypothetical protein [Homo sapiens]
gi|116497127|gb|AAI26192.1| SET and MYND domain containing 1 [Homo sapiens]
gi|119597484|gb|EAW77078.1| SET and MYND domain containing 1, isoform CRA_b [Homo sapiens]
gi|190689781|gb|ACE86665.1| SET and MYND domain containing 1 protein [synthetic construct]
gi|190691149|gb|ACE87349.1| SET and MYND domain containing 1 protein [synthetic construct]
gi|261858976|dbj|BAI46010.1| SET and MYND domain containing 1 [synthetic construct]
gi|313883064|gb|ADR83018.1| SET and MYND domain containing 1 (SMYD1) [synthetic construct]
Length = 490
Score = 97.4 bits (241), Expect = 8e-18, Method: Compositional matrix adjust.
Identities = 79/320 (24%), Positives = 144/320 (45%), Gaps = 56/320 (17%)
Query: 19 VNLILQ-WP----EISINEIAENFSKLACNAHTICNSE-LRPLGTGLYPVISIINHSCLP 72
V+ LQ WP + S+ I+ F + CN T+ + L+ +G G++P + ++NH C P
Sbjct: 151 VDTFLQYWPPQSQQFSMQYISHIFGVINCNGFTLSDQRGLQAVGVGIFPNLGLVNHDCWP 210
Query: 73 NAVLVFE-GRLAVVRAVQH------------VPKGAEVLISYIETAGSTMTRQKALKEQY 119
N ++F G V+++ H + +G E+ +SYI+ + R++ LK+QY
Sbjct: 211 NCTVIFNNGNHEAVKSMFHTQMRIELRALGKISEGEELTVSYIDFLNVSEERKRQLKKQY 270
Query: 120 LFTCTCPRCIKLGQFDDIQESAILEGYRCKDDGCSGFLLRDSDDKGFTCQQCGLVRSKEE 179
F CTC C K + KDD FL G+ + +
Sbjct: 271 YFDCTCEHCQK----------------KLKDD---LFL--------------GVKDNPKP 297
Query: 180 IKKIASEVNILSKKTL----ALTSCGNHQEVVSTYKMIEKLQKKLYHPFSVNLMQTREKL 235
+++ E+ SK TL S G + EVV + + Q+ ++ ++ +++ +
Sbjct: 298 SQEVVKEMIQFSKDTLEKIDKARSEGLYHEVVKLCRECLEKQEPVFADTNIYMLRMLSIV 357
Query: 236 IKILMELEDWKEALAYCQLTIPVYQRVYPQFHPLLGLQYYTCGKLEWFLGDTENAIKSMT 295
++L L+ ++EA Y + + Y ++Y + LG+ G W G+ E +
Sbjct: 358 SEVLSYLQAFEEASFYARRMVDGYMKLYHPNNAQLGMAVMRAGLTNWHAGNIEVGHGMIC 417
Query: 296 EAVEILRITHGTNSPFMKEL 315
+A IL +THG + P K+L
Sbjct: 418 KAYAILLVTHGPSHPITKDL 437
>gi|403303925|ref|XP_003942566.1| PREDICTED: SET and MYND domain-containing protein 1 [Saimiri
boliviensis boliviensis]
Length = 490
Score = 97.4 bits (241), Expect = 8e-18, Method: Compositional matrix adjust.
Identities = 80/320 (25%), Positives = 143/320 (44%), Gaps = 56/320 (17%)
Query: 19 VNLILQ-WP----EISINEIAENFSKLACNAHTICNSE-LRPLGTGLYPVISIINHSCLP 72
V+ LQ WP + S+ I+ F + CN T+ + L+ +G G++P + ++NH C P
Sbjct: 151 VDTFLQYWPPQSQQFSMQYISHIFGVINCNGFTLSDQRGLQAVGVGIFPNLGLVNHDCWP 210
Query: 73 NAVLVFE-GRLAVVRAVQH------------VPKGAEVLISYIETAGSTMTRQKALKEQY 119
N ++F G V+++ H + +G E+ +SYI+ + R+K LK+QY
Sbjct: 211 NCTVIFNNGNHEAVKSMFHTQMRIELRALGKISEGEELTVSYIDFLNVSEERKKQLKKQY 270
Query: 120 LFTCTCPRCIKLGQFDDIQESAILEGYRCKDDGCSGFLLRDSDDKGFTCQQCGLVRSKEE 179
F CTC C K + KDD FL G+ + +
Sbjct: 271 YFDCTCEHCQK----------------KLKDD---LFL--------------GVKDNPKP 297
Query: 180 IKKIASEVNILSKKTL----ALTSCGNHQEVVSTYKMIEKLQKKLYHPFSVNLMQTREKL 235
+++ E+ SK TL S G + EVV + + Q+ ++ ++ ++ +
Sbjct: 298 SQEVVKEMIQFSKDTLEKIDKARSEGLYHEVVKLCRECLQKQEPVFADTNIYTLRMLSIV 357
Query: 236 IKILMELEDWKEALAYCQLTIPVYQRVYPQFHPLLGLQYYTCGKLEWFLGDTENAIKSMT 295
++L L+ ++EA Y + + Y ++Y + LG+ G W G+ E +
Sbjct: 358 SEVLSYLQAFEEASDYARRMVDGYMKLYHPNNAQLGMAVMRAGLTNWHAGNIEVGHGMIC 417
Query: 296 EAVEILRITHGTNSPFMKEL 315
+A IL +THG + P K+L
Sbjct: 418 KAYAILLVTHGPSHPITKDL 437
>gi|119597485|gb|EAW77079.1| SET and MYND domain containing 1, isoform CRA_c [Homo sapiens]
Length = 493
Score = 97.4 bits (241), Expect = 8e-18, Method: Compositional matrix adjust.
Identities = 79/320 (24%), Positives = 144/320 (45%), Gaps = 56/320 (17%)
Query: 19 VNLILQ-WP----EISINEIAENFSKLACNAHTICNSE-LRPLGTGLYPVISIINHSCLP 72
V+ LQ WP + S+ I+ F + CN T+ + L+ +G G++P + ++NH C P
Sbjct: 151 VDTFLQYWPPQSQQFSMQYISHIFGVINCNGFTLSDQRGLQAVGVGIFPNLGLVNHDCWP 210
Query: 73 NAVLVFE-GRLAVVRAVQH------------VPKGAEVLISYIETAGSTMTRQKALKEQY 119
N ++F G V+++ H + +G E+ +SYI+ + R++ LK+QY
Sbjct: 211 NCTVIFNNGNHEAVKSMFHTQMRIELRALGKISEGEELTVSYIDFLNVSEERKRQLKKQY 270
Query: 120 LFTCTCPRCIKLGQFDDIQESAILEGYRCKDDGCSGFLLRDSDDKGFTCQQCGLVRSKEE 179
F CTC C K + KDD FL G+ + +
Sbjct: 271 YFDCTCEHCQK----------------KLKDD---LFL--------------GVKDNPKP 297
Query: 180 IKKIASEVNILSKKTL----ALTSCGNHQEVVSTYKMIEKLQKKLYHPFSVNLMQTREKL 235
+++ E+ SK TL S G + EVV + + Q+ ++ ++ +++ +
Sbjct: 298 SQEVVKEMIQFSKDTLEKIDKARSEGLYHEVVKLCRECLEKQEPVFADTNIYMLRMLSIV 357
Query: 236 IKILMELEDWKEALAYCQLTIPVYQRVYPQFHPLLGLQYYTCGKLEWFLGDTENAIKSMT 295
++L L+ ++EA Y + + Y ++Y + LG+ G W G+ E +
Sbjct: 358 SEVLSYLQAFEEASFYARRMVDGYMKLYHPNNAQLGMAVMRAGLTNWHAGNIEVGHGMIC 417
Query: 296 EAVEILRITHGTNSPFMKEL 315
+A IL +THG + P K+L
Sbjct: 418 KAYAILLVTHGPSHPITKDL 437
>gi|26336238|dbj|BAC31804.1| unnamed protein product [Mus musculus]
gi|26338432|dbj|BAC32887.1| unnamed protein product [Mus musculus]
Length = 239
Score = 97.4 bits (241), Expect = 8e-18, Method: Compositional matrix adjust.
Identities = 69/255 (27%), Positives = 120/255 (47%), Gaps = 39/255 (15%)
Query: 52 LRPLGTGLYPVISIINHSCLPNAVLVFEGRLAVVRAVQHVPKGAEVLISYIETAGSTMTR 111
++ +G GLYP +S++NHSC PN +VF G ++RAV+ + G E+ I Y++ ++ R
Sbjct: 1 MQEVGVGLYPSMSLLNHSCDPNCSIVFNGPHLLLRAVREIEAGEELTICYLDMLMTSEER 60
Query: 112 QKALKEQYLFTCTCPRCIKLGQFDDIQESAILEGYRCKDDGCSGFLLRDSDDKGFTCQQC 171
+K L++QY F C C RC Q D +D+D
Sbjct: 61 RKQLRDQYCFECDCIRC----QTQD----------------------KDAD--------- 85
Query: 172 GLVRSKEEIKKIASEVNILSKKTLALTSCGNHQEVVSTYKMIEKLQKKLYHPFSVNLMQT 231
++ E+I K EV KK L + ++V++ + I ++ ++
Sbjct: 86 -MLTGDEQIWK---EVQESLKKIEELKAHWKWEQVLALCQAIINSNSNRLPDINIYQLKV 141
Query: 232 REKLIKILMELEDWKEALAYCQLTIPVYQRVYPQFHPLLGLQYYTCGKLEWFLGDTENAI 291
+ + + L +EAL Y T+ Y+ +P HP+ G+Q GKL+ G A+
Sbjct: 142 LDCAMDACINLGMLEEALFYAMRTMEPYRIFFPGSHPVRGVQVMKVGKLQLHQGMFPQAM 201
Query: 292 KSMTEAVEILRITHG 306
K++ A +I+++THG
Sbjct: 202 KNLRLAFDIMKVTHG 216
>gi|48476975|gb|AAT44535.1| hypothetical rhabdomyosarcoma antigen MU-RMS-40.9A [Homo sapiens]
Length = 484
Score = 97.4 bits (241), Expect = 9e-18, Method: Compositional matrix adjust.
Identities = 79/320 (24%), Positives = 144/320 (45%), Gaps = 56/320 (17%)
Query: 19 VNLILQ-WP----EISINEIAENFSKLACNAHTICNSE-LRPLGTGLYPVISIINHSCLP 72
V+ LQ WP + S+ I+ F + CN T+ + L+ +G G++P + ++NH C P
Sbjct: 145 VDTFLQYWPPQSQQFSMQYISHIFGVINCNGFTLSDQRGLQAVGVGIFPNLGLVNHDCWP 204
Query: 73 NAVLVFE-GRLAVVRAVQH------------VPKGAEVLISYIETAGSTMTRQKALKEQY 119
N ++F G V+++ H + +G E+ +SYI+ + R++ LK+QY
Sbjct: 205 NCTVIFNNGNHEAVKSMFHTQMRIELRALGKISEGEELTVSYIDFLNVSEERKRQLKKQY 264
Query: 120 LFTCTCPRCIKLGQFDDIQESAILEGYRCKDDGCSGFLLRDSDDKGFTCQQCGLVRSKEE 179
F CTC C K + KDD FL G+ + +
Sbjct: 265 YFDCTCEHCQK----------------KLKDD---LFL--------------GVKDNPKP 291
Query: 180 IKKIASEVNILSKKTL----ALTSCGNHQEVVSTYKMIEKLQKKLYHPFSVNLMQTREKL 235
+++ E+ SK TL S G + EVV + + Q+ ++ ++ +++ +
Sbjct: 292 SQEVVKEMIQFSKDTLEKIDKARSEGLYHEVVKLCRECLEKQEPVFADTNIYMLRMLSIV 351
Query: 236 IKILMELEDWKEALAYCQLTIPVYQRVYPQFHPLLGLQYYTCGKLEWFLGDTENAIKSMT 295
++L L+ ++EA Y + + Y ++Y + LG+ G W G+ E +
Sbjct: 352 SEVLSYLQAFEEASFYARRMVDGYMKLYHPNNAQLGMAVMRAGLTNWHAGNIEVGHGMIC 411
Query: 296 EAVEILRITHGTNSPFMKEL 315
+A IL +THG + P K+L
Sbjct: 412 KAYAILLVTHGPSHPITKDL 431
>gi|71834584|ref|NP_001025394.1| N-lysine methyltransferase SMYD2-B isoform 1 [Danio rerio]
Length = 437
Score = 97.4 bits (241), Expect = 9e-18, Method: Compositional matrix adjust.
Identities = 81/323 (25%), Positives = 146/323 (45%), Gaps = 39/323 (12%)
Query: 1 MSDID-EKQLLLYAQIANLVNLI---LQWPEISINEIAENFSKLACNAHTICNSELRPLG 56
+ D+D EK+ + A IA L L +P+ + FS++ CN T+ + EL LG
Sbjct: 139 LEDMDNEKREMTEAHIAGLHQFYSKHLDFPDH--QALLTLFSQVHCNGFTVEDEELSNLG 196
Query: 57 TGLYPVISIINHSCLPNAVLVFEGRLAVVRAVQHVPKGAEVLISYIETAGSTMTRQKALK 116
++P I+++NHSC PN ++ + G A VRAV+ + G E+ SYI+ T R + L+
Sbjct: 197 LAIFPDIALLNHSCSPNVIVTYRGINAEVRAVKDISPGQEIYTSYIDLLYPTADRLERLR 256
Query: 117 EQYLFTCTCPRCIKLGQFDDIQESAILEGYRCKDDGCSGFLLRDSDDKGFTCQQCGLVR- 175
+ Y F+C C C D ++ S R + D ++D +VR
Sbjct: 257 DMYYFSCDCKECTT-KSMDVVKMSV-----RKRSDEIGEKEIKD------------MVRY 298
Query: 176 ---SKEEIKKIASEVNILSKKTLALTSCGNHQEVVSTYKMIEKLQKKLYHPFSVNLMQTR 232
S E ++ + NIL + L C + +ST ++ +V ++
Sbjct: 299 ARNSMENFRRAKQDKNILC-GSFILEMCELSIDKMST----------VFDDSNVYILHMM 347
Query: 233 EKLIKILMELEDWKEALAYCQLTIPVYQRVYPQFHPLLGLQYYTCGKLEWFLGDTENAIK 292
+ + I + ED++ A+ Y + I + +YP + + + G+L L I
Sbjct: 348 YQAMGICLFTEDYEGAVRYGEKVIKPFTVLYPAYSMNVASMFLKLGRLYIALDRKLAGID 407
Query: 293 SMTEAVEILRITHGTNSPFMKEL 315
+ +A+ I+ + HG + ++ EL
Sbjct: 408 AFQKALTIMEVVHGKDHTYVTEL 430
>gi|148681223|gb|EDL13170.1| SET and MYND domain containing 3, isoform CRA_a [Mus musculus]
Length = 239
Score = 97.4 bits (241), Expect = 9e-18, Method: Compositional matrix adjust.
Identities = 69/255 (27%), Positives = 120/255 (47%), Gaps = 39/255 (15%)
Query: 52 LRPLGTGLYPVISIINHSCLPNAVLVFEGRLAVVRAVQHVPKGAEVLISYIETAGSTMTR 111
++ +G GLYP +S++NHSC PN +VF G ++RAV+ + G E+ I Y++ ++ R
Sbjct: 1 MQEVGVGLYPSMSLLNHSCDPNCSIVFNGPHLLLRAVREIEAGEELTICYLDMLMTSEER 60
Query: 112 QKALKEQYLFTCTCPRCIKLGQFDDIQESAILEGYRCKDDGCSGFLLRDSDDKGFTCQQC 171
+K L++QY F C C RC Q D +D+D
Sbjct: 61 RKQLRDQYCFECDCIRC----QTQD----------------------KDAD--------- 85
Query: 172 GLVRSKEEIKKIASEVNILSKKTLALTSCGNHQEVVSTYKMIEKLQKKLYHPFSVNLMQT 231
++ E+I K EV KK L + ++V++ + I ++ ++
Sbjct: 86 -MLTGDEQIWK---EVQESLKKIEELKAHWKWEQVLALCQAIINSNSDRLPDINIYQLKV 141
Query: 232 REKLIKILMELEDWKEALAYCQLTIPVYQRVYPQFHPLLGLQYYTCGKLEWFLGDTENAI 291
+ + + L +EAL Y T+ Y+ +P HP+ G+Q GKL+ G A+
Sbjct: 142 LDCAMDACINLGMLEEALFYAMRTMEPYRIFFPGSHPVRGVQVMKVGKLQLHQGMFPQAM 201
Query: 292 KSMTEAVEILRITHG 306
K++ A +I+++THG
Sbjct: 202 KNLRLAFDIMKVTHG 216
>gi|57997548|emb|CAI46077.1| hypothetical protein [Homo sapiens]
Length = 490
Score = 97.4 bits (241), Expect = 9e-18, Method: Compositional matrix adjust.
Identities = 79/320 (24%), Positives = 144/320 (45%), Gaps = 56/320 (17%)
Query: 19 VNLILQ-WP----EISINEIAENFSKLACNAHTICNSE-LRPLGTGLYPVISIINHSCLP 72
V+ LQ WP + S+ I+ F + CN T+ + L+ +G G++P + ++NH C P
Sbjct: 151 VDTFLQYWPPQSQQFSMQYISHIFGVINCNGFTLSDQRGLQAVGVGIFPNLGLVNHDCWP 210
Query: 73 NAVLVFE-GRLAVVRAVQH------------VPKGAEVLISYIETAGSTMTRQKALKEQY 119
N ++F G V+++ H + +G E+ +SYI+ + R++ LK+QY
Sbjct: 211 NCTVIFNNGNHEAVKSMFHTQMRIELRALGKISEGEELTVSYIDFLNVSEERKRQLKKQY 270
Query: 120 LFTCTCPRCIKLGQFDDIQESAILEGYRCKDDGCSGFLLRDSDDKGFTCQQCGLVRSKEE 179
F CTC C K + KDD FL G+ + +
Sbjct: 271 YFDCTCEHCQK----------------KLKDD---LFL--------------GVKDNPKP 297
Query: 180 IKKIASEVNILSKKTL----ALTSCGNHQEVVSTYKMIEKLQKKLYHPFSVNLMQTREKL 235
+++ E+ SK TL S G + EVV + + Q+ ++ ++ +++ +
Sbjct: 298 SQEVVKEMIQFSKDTLEKIDKARSEGLYHEVVKLCRECLEKQEPVFADTNIYMLRMLSIV 357
Query: 236 IKILMELEDWKEALAYCQLTIPVYQRVYPQFHPLLGLQYYTCGKLEWFLGDTENAIKSMT 295
++L L+ ++EA Y + + Y ++Y + LG+ G W G+ E +
Sbjct: 358 SEVLSYLQAFEEASFYARRMVDGYMKLYHPNNAQLGMAVMRAGLTNWHAGNIEVGHGMIC 417
Query: 296 EAVEILRITHGTNSPFMKEL 315
+A IL +THG + P K+L
Sbjct: 418 KAYAILLVTHGPSHPITKDL 437
>gi|119597483|gb|EAW77077.1| SET and MYND domain containing 1, isoform CRA_a [Homo sapiens]
Length = 510
Score = 97.1 bits (240), Expect = 9e-18, Method: Compositional matrix adjust.
Identities = 79/320 (24%), Positives = 144/320 (45%), Gaps = 56/320 (17%)
Query: 19 VNLILQ-WP----EISINEIAENFSKLACNAHTICNSE-LRPLGTGLYPVISIINHSCLP 72
V+ LQ WP + S+ I+ F + CN T+ + L+ +G G++P + ++NH C P
Sbjct: 151 VDTFLQYWPPQSQQFSMQYISHIFGVINCNGFTLSDQRGLQAVGVGIFPNLGLVNHDCWP 210
Query: 73 NAVLVFE-GRLAVVRAVQH------------VPKGAEVLISYIETAGSTMTRQKALKEQY 119
N ++F G V+++ H + +G E+ +SYI+ + R++ LK+QY
Sbjct: 211 NCTVIFNNGNHEAVKSMFHTQMRIELRALGKISEGEELTVSYIDFLNVSEERKRQLKKQY 270
Query: 120 LFTCTCPRCIKLGQFDDIQESAILEGYRCKDDGCSGFLLRDSDDKGFTCQQCGLVRSKEE 179
F CTC C K + KDD FL G+ + +
Sbjct: 271 YFDCTCEHCQK----------------KLKDD---LFL--------------GVKDNPKP 297
Query: 180 IKKIASEVNILSKKTL----ALTSCGNHQEVVSTYKMIEKLQKKLYHPFSVNLMQTREKL 235
+++ E+ SK TL S G + EVV + + Q+ ++ ++ +++ +
Sbjct: 298 SQEVVKEMIQFSKDTLEKIDKARSEGLYHEVVKLCRECLEKQEPVFADTNIYMLRMLSIV 357
Query: 236 IKILMELEDWKEALAYCQLTIPVYQRVYPQFHPLLGLQYYTCGKLEWFLGDTENAIKSMT 295
++L L+ ++EA Y + + Y ++Y + LG+ G W G+ E +
Sbjct: 358 SEVLSYLQAFEEASFYARRMVDGYMKLYHPNNAQLGMAVMRAGLTNWHAGNIEVGHGMIC 417
Query: 296 EAVEILRITHGTNSPFMKEL 315
+A IL +THG + P K+L
Sbjct: 418 KAYAILLVTHGPSHPITKDL 437
>gi|301773928|ref|XP_002922386.1| PREDICTED: SET and MYND domain-containing protein 1-like
[Ailuropoda melanoleuca]
Length = 490
Score = 97.1 bits (240), Expect = 1e-17, Method: Compositional matrix adjust.
Identities = 79/316 (25%), Positives = 142/316 (44%), Gaps = 48/316 (15%)
Query: 19 VNLILQ-WP----EISINEIAENFSKLACNAHTICNSE-LRPLGTGLYPVISIINHSCLP 72
V+ LQ WP + S+ I+ F + CN T+ + L+ +G G++P + ++NH+C P
Sbjct: 151 VDTFLQYWPPQSQQFSMQYISHVFGVINCNGFTLSDQRGLQAVGVGIFPNLGLVNHNCWP 210
Query: 73 NAVLVFE-GRLAVVRAVQH------------VPKGAEVLISYIETAGSTMTRQKALKEQY 119
N ++F G V+++ H + +G E+ +SYI+ + R++ LK+QY
Sbjct: 211 NCTVIFNNGNHEAVKSMFHTQMRIELRALGKISEGEELTVSYIDFLNVSEERKRQLKKQY 270
Query: 120 LFTCTCPRCIKLGQFDDIQESAILEGYRCKDDGCSGFLLRDSDDKGFTCQQCGLVRSKEE 179
F CTC C K + KDD FL D K S+E
Sbjct: 271 YFDCTCEHCQK----------------KLKDD---LFLGVKDDPKP----------SQEV 301
Query: 180 IKKIASEVNILSKKTLALTSCGNHQEVVSTYKMIEKLQKKLYHPFSVNLMQTREKLIKIL 239
+K++ +K S G + EVV + + Q+ ++ ++ ++ + ++L
Sbjct: 302 VKEMIQFSKDTLEKIDKARSEGLYHEVVKLCRECLQKQEPVFADTNLYTLRLLSTVSEVL 361
Query: 240 MELEDWKEALAYCQLTIPVYQRVYPQFHPLLGLQYYTCGKLEWFLGDTENAIKSMTEAVE 299
L+ ++EA Y + + Y ++Y + LG+ G W G+ E + +A
Sbjct: 362 SYLQAFEEASYYARRMVDGYMKLYHPNNAQLGMAVMRAGLTNWHAGNIEVGHGMICKAYA 421
Query: 300 ILRITHGTNSPFMKEL 315
IL +THG + P K+L
Sbjct: 422 ILLVTHGPSHPITKDL 437
>gi|74178258|dbj|BAE29912.1| unnamed protein product [Mus musculus]
Length = 420
Score = 97.1 bits (240), Expect = 1e-17, Method: Compositional matrix adjust.
Identities = 64/265 (24%), Positives = 120/265 (45%), Gaps = 33/265 (12%)
Query: 5 DEKQLLLYAQIANLVNLI---LQWPEISINEIAENFSKLACNAHTICNSELRPLGTGLYP 61
+EK+ L+ + IA L L++P+ S + F+++ CN TI + EL LG+ ++P
Sbjct: 143 NEKKDLIQSDIAALHQFYSKYLEFPDHS--SLVVLFAQVNCNGFTIEDEELSHLGSAIFP 200
Query: 62 VISIINHSCLPNAVLVFEGRLAVVRAVQHVPKGAEVLISYIETAGSTMTRQKALKEQYLF 121
++++NHSC PN ++ ++G LA VRAVQ + G EV SYI+ T R L++ Y F
Sbjct: 201 DVALMNHSCCPNVIVTYKGTLAEVRAVQEIHPGDEVFTSYIDLLYPTEDRNDRLRDSYFF 260
Query: 122 TCTCPRCIKLGQFDDIQESAILEGYRCKDDGCSGFLLRDSDDKGFTCQQCGLVRSKEEIK 181
TC C C +D D ++ E I+
Sbjct: 261 TCECRECT----------------------------TKDKDKAKVEVRKLSSPPQAEAIR 292
Query: 182 KIASEVNILSKKTLALTSCGNHQEVVSTYKMIEKLQKKLYHPFSVNLMQTREKLIKILME 241
+ + ++ + E++ ++ ++ ++ +V ++ + + + +
Sbjct: 293 DMVRYARNVIEEFRRAKHYKSPSELLEICELSQEKMSSVFEDSNVYMLHMMYQAMGVCLY 352
Query: 242 LEDWKEALAYCQLTIPVYQRVYPQF 266
++DW+ AL Y Q I Y + YP +
Sbjct: 353 MQDWEGALKYGQKIIKPYSKHYPVY 377
>gi|328769214|gb|EGF79258.1| hypothetical protein BATDEDRAFT_35411 [Batrachochytrium
dendrobatidis JAM81]
Length = 503
Score = 96.7 bits (239), Expect = 1e-17, Method: Compositional matrix adjust.
Identities = 81/315 (25%), Positives = 144/315 (45%), Gaps = 50/315 (15%)
Query: 27 EISINEIAENFSKLACNAHTICNSELRPLGTGLYPVISIINHSCLPNAVLVFEGRLAVVR 86
+ ++++ + + CN TI +S L P+G G YP+ S+ NH C PN + +F+G V++
Sbjct: 228 NLPVSDLIHHLGRFRCNNFTIIDSNLFPVGEGTYPLASLFNHDCWPNCIAIFDGSRVVIQ 287
Query: 87 AVQHVPKGAEVLISYIETAGSTMTRQKALKEQYLFTCTCPRCIKLGQFDDIQESAILEGY 146
++ + KG E+ ISYI+ +R+ +L+ +Y F C C C+ + L
Sbjct: 288 TIRDIAKGDELCISYIDPILDHDSRRMSLETKYCFNCQCSVCMS-------ESCTPLHSP 340
Query: 147 RCKDDGCSGFLLRDSDDKGF-------TCQQCGLVRSKEEIKKIASEVNILSKKTLALTS 199
K D L + D G+ TC L +S +E ++ A + I KT+AL S
Sbjct: 341 TTKKDAL--LSLDEKSDVGWRLLKWFHTC----LKKSDDEFER-AQQDQI--SKTIALVS 391
Query: 200 CGNHQEVVSTYKMIEKLQKKLYHPFSVNLMQTREKLIKILMELEDWKEALAYCQLTIPVY 259
+ VV+ ++ F +LM+ +DW++A + Y
Sbjct: 392 ---KRAVVAEIGNATQM-------FQSHLMR------------QDWEQAYKSGLHLLGQY 429
Query: 260 QRVYPQFHPLLGLQYYTCGKLEWFLGDTENAIKSMTEAVEILRITHGTNSPFMK--ELIL 317
YP+++PL+ + + W G T+ I + + L IT+G+ MK LI
Sbjct: 430 LLRYPRYYPLVSQHMFFVAQCMWNSGQTDETIFLLGVIKKCLEITYGS---VMKSHHLIG 486
Query: 318 KLEEAQAEASYKLSS 332
+++ A+A Y+ S+
Sbjct: 487 QVDSLLAQALYEQSA 501
>gi|229892820|ref|NP_001153561.1| SET and MYND domain-containing protein 1 [Sus scrofa]
gi|197692962|gb|ACH71269.1| SET and MYND domain-containing 1 [Sus scrofa]
Length = 490
Score = 96.7 bits (239), Expect = 1e-17, Method: Compositional matrix adjust.
Identities = 80/320 (25%), Positives = 143/320 (44%), Gaps = 56/320 (17%)
Query: 19 VNLILQ-WP----EISINEIAENFSKLACNAHTICNSE-LRPLGTGLYPVISIINHSCLP 72
V+ LQ WP + S+ I+ F + CN T+ L+ +G G++P + ++NH+C P
Sbjct: 151 VDTFLQYWPPQSQQFSMQYISHIFGVINCNGFTLSEQRGLQAVGVGIFPNLGLVNHNCWP 210
Query: 73 NAVLVFE-GRLAVVRAVQH------------VPKGAEVLISYIETAGSTMTRQKALKEQY 119
N ++F G V+++ H + +G E+ +SYI+ + R+K LK+QY
Sbjct: 211 NCTVIFNNGNHEAVKSMFHTQTRIELRALGKISEGEELTVSYIDFLNFSEERKKQLKKQY 270
Query: 120 LFTCTCPRCIKLGQFDDIQESAILEGYRCKDDGCSGFLLRDSDDKGFTCQQCGLVRSKEE 179
F CTC C K G DD+ FL G+ + +
Sbjct: 271 YFDCTCEHCQK-GLKDDL------------------FL--------------GVKDNPKP 297
Query: 180 IKKIASEVNILSKKTL----ALTSCGNHQEVVSTYKMIEKLQKKLYHPFSVNLMQTREKL 235
+++ E+ SK TL S G + EVV + + Q+ ++ ++ ++ +
Sbjct: 298 SQEVVKEMIQFSKDTLEKIDKARSKGLYHEVVKLCRECLEKQEPVFADTNIYTLRLLSIV 357
Query: 236 IKILMELEDWKEALAYCQLTIPVYQRVYPQFHPLLGLQYYTCGKLEWFLGDTENAIKSMT 295
++L L+ ++EA Y + + Y ++Y + LG+ G W G+ E +
Sbjct: 358 SEVLSYLQAFEEASHYARRMVDGYMKLYHPNNAQLGMAIMRAGLTNWHAGNIEVGHSMIC 417
Query: 296 EAVEILRITHGTNSPFMKEL 315
+A IL +THG + P K+L
Sbjct: 418 KAYAILLVTHGPSHPITKDL 437
>gi|302753848|ref|XP_002960348.1| hypothetical protein SELMODRAFT_437450 [Selaginella moellendorffii]
gi|300171287|gb|EFJ37887.1| hypothetical protein SELMODRAFT_437450 [Selaginella moellendorffii]
Length = 614
Score = 96.7 bits (239), Expect = 1e-17, Method: Compositional matrix adjust.
Identities = 68/236 (28%), Positives = 105/236 (44%), Gaps = 16/236 (6%)
Query: 58 GLYPVISIINHSCLPNAVLVFEGRLAVVRAVQHVPKGAEVLISYIETAGSTMTRQKALKE 117
GL+P +SI+NHSC PN EG + VRAVQ +PKGAE+ +SYI S R+ L
Sbjct: 296 GLFPYVSILNHSCRPNCCFASEGSVMYVRAVQDIPKGAELCLSYINLYESRRVRKTLLVA 355
Query: 118 QYLFTCTCPRCIKLGQFDDIQESAILEGYRCKDDGCSGFLLRDS---DDKG----FTCQQ 170
F CTC RC+ + + LEG C GC G LR + +D+ + C
Sbjct: 356 TKHFDCTCDRCV---EPLNTSIDRFLEGCVCHVRGCGGVFLRTAALHEDQASSTTWECDL 412
Query: 171 CGLVRSKEEIKKIASEVNILSK---KTLALTSCGNHQEVVSTYKMIEKLQKKL---YHPF 224
C V E ++ +++K + +A + + K++E + HP
Sbjct: 413 CSRVLDPESMRFKDPPWEVIAKAEERLVAAVRIYSERRFKEARKLLESYLSEFTGKLHPL 472
Query: 225 SVNLMQTREKLIKILMELEDWKEALAYCQLTIPVYQRVYPQFHPLLGLQYYTCGKL 280
V L L+ + D +E L C+L + ++V P L Y+ G++
Sbjct: 473 HVFLFDALTPLMNCCRAMGDAEEGLRVCRLILQCMEKVLPGPSLELANFYFCLGEM 528
>gi|149727214|ref|XP_001497840.1| PREDICTED: SET and MYND domain-containing protein 1 isoform 2
[Equus caballus]
Length = 490
Score = 96.7 bits (239), Expect = 1e-17, Method: Compositional matrix adjust.
Identities = 79/320 (24%), Positives = 142/320 (44%), Gaps = 56/320 (17%)
Query: 19 VNLILQ-WP----EISINEIAENFSKLACNAHTICNSE-LRPLGTGLYPVISIINHSCLP 72
V+ LQ WP + S+ I+ F + CN T+ + L+ +G G++P + ++NH C P
Sbjct: 151 VDTFLQYWPPQSQQFSMQYISHIFGVINCNGFTLSDQRGLQAVGVGVFPNLGLVNHDCWP 210
Query: 73 NAVLVFE-GRLAVVRAVQH------------VPKGAEVLISYIETAGSTMTRQKALKEQY 119
N ++F G V+++ H + +G E+ +SYI+ + R++ LK+QY
Sbjct: 211 NCTVIFNNGNHEAVKSMFHTQMRIELRALGKISEGEELTVSYIDFLNVSEERKRQLKKQY 270
Query: 120 LFTCTCPRCIKLGQFDDIQESAILEGYRCKDDGCSGFLLRDSDDKGFTCQQCGLVRSKEE 179
F CTC C K G DD+ FL G+ +
Sbjct: 271 YFDCTCEHCQK-GLKDDL------------------FL--------------GVKDDPKP 297
Query: 180 IKKIASEVNILSKKTL----ALTSCGNHQEVVSTYKMIEKLQKKLYHPFSVNLMQTREKL 235
+ + EV SK TL + G + EVV + + Q+ ++ ++ +++ +
Sbjct: 298 SQDVVKEVVQFSKDTLEKIDKARTEGLYHEVVKLCRECLQKQEPVFADTNLYMLRILSIV 357
Query: 236 IKILMELEDWKEALAYCQLTIPVYQRVYPQFHPLLGLQYYTCGKLEWFLGDTENAIKSMT 295
++L L+ ++EA Y + + Y ++Y + LG+ G W G+ E +
Sbjct: 358 SEVLSYLQAFEEASYYARKMVDGYMKLYHPNNAQLGMAVMRAGLTNWHAGNIEVGHGMIC 417
Query: 296 EAVEILRITHGTNSPFMKEL 315
+A IL +THG + P K+L
Sbjct: 418 KAYAILLVTHGPSHPITKDL 437
>gi|348566407|ref|XP_003468993.1| PREDICTED: SET and MYND domain-containing protein 1-like isoform 1
[Cavia porcellus]
Length = 490
Score = 96.7 bits (239), Expect = 1e-17, Method: Compositional matrix adjust.
Identities = 79/320 (24%), Positives = 140/320 (43%), Gaps = 56/320 (17%)
Query: 19 VNLILQ-WP----EISINEIAENFSKLACNAHTICNSE-LRPLGTGLYPVISIINHSCLP 72
V+ LQ WP + S+ I+ F + CN T+ + L+ +G G++P + ++NH C P
Sbjct: 151 VDTFLQYWPPQSQQFSMQYISHIFGVINCNGFTLSDQRGLQAVGVGIFPNLGLVNHDCWP 210
Query: 73 NAVLVFE-GRLAVVRAVQH------------VPKGAEVLISYIETAGSTMTRQKALKEQY 119
N ++F G V+++ H + G E+ +SYI+ + R++ LK+QY
Sbjct: 211 NCTVIFNNGNHEAVKSMFHTQMRIELRALGKISVGEELTVSYIDFLNVSEERKRQLKKQY 270
Query: 120 LFTCTCPRCIKLGQFDDIQESAILEGYRCKDDGCSGFLLRDSDDKGFTCQQCGLVRSKEE 179
F CTC C K G DD+ FL G+ +
Sbjct: 271 YFDCTCEHCQK-GLKDDL------------------FL--------------GVKEDPKP 297
Query: 180 IKKIASEVNILSKKTL----ALTSCGNHQEVVSTYKMIEKLQKKLYHPFSVNLMQTREKL 235
+++ E+ SK TL S G + EVV + + Q+ ++ ++ ++ +
Sbjct: 298 SQEVVKEMTQFSKDTLEKIDKARSEGLYHEVVKLCRECLEKQEPVFADTNLYTLRMLSVV 357
Query: 236 IKILMELEDWKEALAYCQLTIPVYQRVYPQFHPLLGLQYYTCGKLEWFLGDTENAIKSMT 295
++L L+ + EA Y + + Y ++Y + LG+ G W G+ E +
Sbjct: 358 SEVLSYLQAFGEAAHYARRMVDGYMKLYHPNNAQLGMAVMRAGLTNWHAGNIEVGHGMIC 417
Query: 296 EAVEILRITHGTNSPFMKEL 315
+A IL +THG + P K+L
Sbjct: 418 KAYAILLVTHGPSHPITKDL 437
>gi|84180545|gb|ABC54714.1| histone methyltransferase SmyD1b [Danio rerio]
gi|190337458|gb|AAI63095.1| Smyd1b protein [Danio rerio]
Length = 473
Score = 96.3 bits (238), Expect = 2e-17, Method: Compositional matrix adjust.
Identities = 73/318 (22%), Positives = 134/318 (42%), Gaps = 34/318 (10%)
Query: 3 DIDEKQLLLYAQIANLVNLILQWPEIS----INEIAENFSKLACNAHTICNSE-LRPLGT 57
DI E L + ++ N + WP S ++ ++ + CN + + L+ +G
Sbjct: 133 DISEDDLKDFK--VDIHNFLDYWPRNSKPHTVDSVSHILGVINCNGFMVSDQRGLQAVGV 190
Query: 58 GLYPVISIINHSCLPNAVLVFEGRLAVVRAVQHVPKGAEVLISYIETAGSTMTRQKALKE 117
GL+P + ++NH C PN ++ +RA+ + G EV ++Y++ + RQ+ LK+
Sbjct: 191 GLFPNLCLVNHDCWPNCTVILNNGKIELRALGKISAGEEVTVAYVDYLNVSADRQRLLKQ 250
Query: 118 QYLFTCTCPRCIKLGQFDDIQESAILEGYRCKDDGCSGFLLRDSDDKGFTCQQCGLVRSK 177
QY F CTC C + + D A ++G + +
Sbjct: 251 QYFFDCTCKHCTEKIKDDLKMAGAEVDGVKVPE--------------------------- 283
Query: 178 EEIKKIASEVNILSKKTLALTSCGNHQEVVSTYKMIEKLQKKLYHPFSVNLMQTREKLIK 237
E++K++ +K N+ EVV + + Q+ + + ++ L +
Sbjct: 284 EQVKEVTEFSRQKLEKMEKARIEANYNEVVKICRECVEKQENVLADTHIYYLRVCCTLSE 343
Query: 238 ILMELEDWKEALAYCQLTIPVYQRVYPQFHPLLGLQYYTCGKLEWFLGDTENAIKSMTEA 297
+L L+ + EA Y + + Y ++Y + LG+ G W G E A + +A
Sbjct: 344 VLSYLQFFDEASEYARKMVDGYLKLYHPNNAQLGMATMRAGVTHWQAGFIEVAHGMICKA 403
Query: 298 VEILRITHGTNSPFMKEL 315
IL ITHG P K+L
Sbjct: 404 FAILMITHGPTHPITKDL 421
>gi|344244710|gb|EGW00814.1| SET and MYND domain-containing protein 3 [Cricetulus griseus]
Length = 239
Score = 96.3 bits (238), Expect = 2e-17, Method: Compositional matrix adjust.
Identities = 69/255 (27%), Positives = 120/255 (47%), Gaps = 39/255 (15%)
Query: 52 LRPLGTGLYPVISIINHSCLPNAVLVFEGRLAVVRAVQHVPKGAEVLISYIETAGSTMTR 111
++ +G GLYP +S++NHSC PN +VF G ++RAV+ + G E+ I Y++ ++ R
Sbjct: 1 MQEVGVGLYPSMSLLNHSCDPNCSIVFNGPHLLLRAVREIEAGEELTICYLDMLMTSEER 60
Query: 112 QKALKEQYLFTCTCPRCIKLGQFDDIQESAILEGYRCKDDGCSGFLLRDSDDKGFTCQQC 171
+K L++QY F C C RC Q D +D+D
Sbjct: 61 RKQLRDQYCFECDCIRC----QTHD----------------------KDAD--------- 85
Query: 172 GLVRSKEEIKKIASEVNILSKKTLALTSCGNHQEVVSTYKMIEKLQKKLYHPFSVNLMQT 231
++ E+I K EV KK L + ++V++ + I ++ ++
Sbjct: 86 -MLTGDEQIWK---EVQESLKKIEELKAHWKWEQVLALCQAIISSNSDRLPDINIYQLKV 141
Query: 232 REKLIKILMELEDWKEALAYCQLTIPVYQRVYPQFHPLLGLQYYTCGKLEWFLGDTENAI 291
+ + + L +EAL Y T+ Y+ +P HP+ G+Q GKL+ G A+
Sbjct: 142 LDCAMDACINLGMLEEALLYALRTMEPYRIFFPGSHPVRGVQVMKVGKLQLHQGMFPQAM 201
Query: 292 KSMTEAVEILRITHG 306
K++ A +I+++THG
Sbjct: 202 KNLRLAFDIMKVTHG 216
>gi|302767894|ref|XP_002967367.1| hypothetical protein SELMODRAFT_439878 [Selaginella moellendorffii]
gi|300165358|gb|EFJ31966.1| hypothetical protein SELMODRAFT_439878 [Selaginella moellendorffii]
Length = 614
Score = 96.3 bits (238), Expect = 2e-17, Method: Compositional matrix adjust.
Identities = 67/236 (28%), Positives = 105/236 (44%), Gaps = 16/236 (6%)
Query: 58 GLYPVISIINHSCLPNAVLVFEGRLAVVRAVQHVPKGAEVLISYIETAGSTMTRQKALKE 117
GL+P +SI+NHSC PN EG + VRAVQ +PKGAE+ +SYI S R+ L
Sbjct: 296 GLFPYVSILNHSCRPNCCFASEGSVMYVRAVQDIPKGAELCLSYINLYESRRVRKTLLVA 355
Query: 118 QYLFTCTCPRCIKLGQFDDIQESAILEGYRCKDDGCSGFLLRDS---DDKG----FTCQQ 170
F CTC RC+ + + LEG C GC G LR + +D+ + C
Sbjct: 356 TKHFDCTCDRCV---EPLNTSIDRFLEGCVCHVRGCGGVFLRTAALHEDQASSTTWECDL 412
Query: 171 CGLVRSKEEIKKIASEVNILSK---KTLALTSCGNHQEVVSTYKMIEKLQKKL---YHPF 224
C + E ++ +++K + +A + + K++E + HP
Sbjct: 413 CSRILDPESMRFKDPPWEVIAKAEERLVAAVRIYSERRFKEARKLLESYLSEFTGKLHPL 472
Query: 225 SVNLMQTREKLIKILMELEDWKEALAYCQLTIPVYQRVYPQFHPLLGLQYYTCGKL 280
V L L+ + D +E L C+L + ++V P L Y+ G++
Sbjct: 473 HVFLFDALTPLMNCCRAMGDAEEGLRVCRLILQCMEKVLPGPSLELANFYFCLGEM 528
>gi|149040851|gb|EDL94808.1| similar to SET and MYND domain containing 3, isoform CRA_a [Rattus
norvegicus]
Length = 239
Score = 95.9 bits (237), Expect = 2e-17, Method: Compositional matrix adjust.
Identities = 71/257 (27%), Positives = 120/257 (46%), Gaps = 43/257 (16%)
Query: 52 LRPLGTGLYPVISIINHSCLPNAVLVFEGRLAVVRAVQHVPKGAEVLISYIETAGSTMTR 111
++ +G GLYP +S++NHSC PN +VF G ++RAV+ + G E+ I Y++ ++ R
Sbjct: 1 MQEVGVGLYPSMSLLNHSCDPNCSIVFNGPHLLLRAVREIEAGEELTICYLDMLMTSEER 60
Query: 112 QKALKEQYLFTCTCPRCIKLGQFDDIQESAILEGYRCKDDGCSGFLLRDSDDKGFTCQQC 171
+K L++QY F C C RC +D D T +
Sbjct: 61 RKQLRDQYCFECDCIRCPT----------------------------QDKDADMLTGDEQ 92
Query: 172 GLVRSKEEIKKIASEVNILSKKTLALTSCGNHQEVVSTYKMIEKLQKKLYHPFSVNLMQT 231
+E +KKI E+ K L C Q ++++ E+L +N+ Q
Sbjct: 93 IWKEVQESLKKI-EELKAHWKWEQVLALC---QAIINSNS--ERLP-------DINIYQL 139
Query: 232 R--EKLIKILMELEDWKEALAYCQLTIPVYQRVYPQFHPLLGLQYYTCGKLEWFLGDTEN 289
+ + + + L +EAL Y T+ Y+ +P HP+ G+Q GKL+ G
Sbjct: 140 KVLDCAMDACINLGMLEEALFYAMRTMEPYRIFFPGSHPVRGVQVMKVGKLQLHQGMFPQ 199
Query: 290 AIKSMTEAVEILRITHG 306
A+K++ A +I+++THG
Sbjct: 200 AMKNLRLAFDIMKVTHG 216
>gi|327286307|ref|XP_003227872.1| PREDICTED: SET and MYND domain-containing protein 3-like [Anolis
carolinensis]
Length = 334
Score = 95.5 bits (236), Expect = 3e-17, Method: Compositional matrix adjust.
Identities = 44/106 (41%), Positives = 67/106 (63%)
Query: 32 EIAENFSKLACNAHTICNSELRPLGTGLYPVISIINHSCLPNAVLVFEGRLAVVRAVQHV 91
++ E F+K+ACN +I N E++ +G GLYP +S++N+SC PN +VFEG +R+++ +
Sbjct: 200 DVFETFAKVACNCFSISNGEMQDVGVGLYPSMSLLNNSCDPNCAIVFEGPELHLRSIREI 259
Query: 92 PKGAEVLISYIETAGSTMTRQKALKEQYLFTCTCPRCIKLGQFDDI 137
+G E+ I Y+ET T RQ+ LK QY F C C C + DI
Sbjct: 260 QEGEELTICYVETMMPTPERQERLKRQYCFECDCLMCCTKSKAYDI 305
>gi|291386375|ref|XP_002709686.1| PREDICTED: SET and MYND domain containing 1 isoform 2 [Oryctolagus
cuniculus]
Length = 490
Score = 95.5 bits (236), Expect = 3e-17, Method: Compositional matrix adjust.
Identities = 76/318 (23%), Positives = 140/318 (44%), Gaps = 47/318 (14%)
Query: 16 ANLVNLILQWP----EISINEIAENFSKLACNAHTICNSE-LRPLGTGLYPVISIINHSC 70
A++ + WP S+ I+ F + CN T+ + L+ +G G++P ++++NH C
Sbjct: 149 ADVDTFLQYWPPQSQHFSMQYISHIFGVINCNGFTLSDQRGLQAVGVGIFPNLALVNHDC 208
Query: 71 LPNAVLVFE-GRLAVVRAVQH------------VPKGAEVLISYIETAGSTMTRQKALKE 117
PN ++F G V+++ H + +G E+ +SYI+ + R++ LK+
Sbjct: 209 WPNCTVIFNNGNHEAVKSMFHTQMRIELRALGKISEGEELTVSYIDFLNVSEERRRQLKK 268
Query: 118 QYLFTCTCPRCIKLGQFDDIQESAILEGYRCKDDGCSGFLLRDSDDKGFTCQQCGLVRSK 177
QY F CTC C K G DD+ FL + K S+
Sbjct: 269 QYYFDCTCEHCQK-GLKDDL------------------FLGVKDNPKP----------SQ 299
Query: 178 EEIKKIASEVNILSKKTLALTSCGNHQEVVSTYKMIEKLQKKLYHPFSVNLMQTREKLIK 237
E +K++ +K S G + EVV + + Q+ ++ ++ ++ + +
Sbjct: 300 EVVKEMTEFCKDTLEKIDKARSEGLYHEVVKLCRECLQKQEPVFADTNLYTLRMLSIVSE 359
Query: 238 ILMELEDWKEALAYCQLTIPVYQRVYPQFHPLLGLQYYTCGKLEWFLGDTENAIKSMTEA 297
+L L+ + EA Y + + Y ++Y + LG+ G W G+ E + +A
Sbjct: 360 VLSYLQAFGEAADYAKRMVDGYMKLYHPNNAQLGMAVMRAGLTNWHAGNIEVGHGMICKA 419
Query: 298 VEILRITHGTNSPFMKEL 315
IL +THG + P K+L
Sbjct: 420 YAILLVTHGPSHPITKDL 437
>gi|47224323|emb|CAG09169.1| unnamed protein product [Tetraodon nigroviridis]
Length = 361
Score = 95.5 bits (236), Expect = 3e-17, Method: Compositional matrix adjust.
Identities = 76/286 (26%), Positives = 126/286 (44%), Gaps = 41/286 (14%)
Query: 44 AHTIC--NSELRPLGTGLYPVISIINHSCLPNAVLVFEGRLAVVRAVQHVPKGAEVLISY 101
+HT+ + EL LGT +YP +++INHSCLP+ ++ + G A VRAV+ + G EVLISY
Sbjct: 109 SHTLFSEDEELSHLGTAVYPDVALINHSCLPSVIVTYNGTSADVRAVRDMNPGDEVLISY 168
Query: 102 IETAGSTMTRQKALKEQYLFTCTCPRCIKLGQFDDIQESAILEGYRCKDDGCSGFLLRDS 161
I+ T R L+E Y FTC C C G + D + LR
Sbjct: 169 IDVLYPTEDRNTRLRESYYFTCQCQEC----------------GSQSSDQ--AKLKLRKR 210
Query: 162 DDKGFTCQQCGLVRSKEEIKKIASEVNILSKKTLALTSCGNHQEVVSTYKMIEKLQKKL- 220
D E + I + V K N ST ++E ++ L
Sbjct: 211 SDPA-------------EAEVINTMVRYARKCIREFRVFKNSNTPAST--LLEMCEQSLD 255
Query: 221 -----YHPFSVNLMQTREKLIKILMELEDWKEALAYCQLTIPVYQRVYPQFHPLLGLQYY 275
+ +V ++ + + + + ++D AL Y + Y+++YP + L Y
Sbjct: 256 EMGAVFDESNVYMLHMMYQAMGVCLYMQDPDGALRYGEKIAKYYRKLYPAYSLNLSSLYL 315
Query: 276 TCGKLEWFLGDTENAIKSMTEAVEILRITHGTNSPFMKELILKLEE 321
G+L + + I + +A I+ +THG N ++ EL +++E
Sbjct: 316 KLGRLYFGMERNSECISVLKKAKAIMEVTHGKNHFYVTELDRQMKE 361
>gi|5870832|gb|AAC53021.2| skm-BOP1 [Mus musculus]
Length = 485
Score = 95.5 bits (236), Expect = 3e-17, Method: Compositional matrix adjust.
Identities = 78/316 (24%), Positives = 140/316 (44%), Gaps = 48/316 (15%)
Query: 19 VNLILQ-WP----EISINEIAENFSKLACNAHTICNSE-LRPLGTGLYPVISIINHSCLP 72
V+ LQ WP + S+ I+ F + CN T+ + L+ +G G++P + ++NH C P
Sbjct: 146 VDTFLQYWPPQSQQFSMQYISHIFGVINCNGFTLSDQRGLQAVGVGIFPNLGLVNHDCWP 205
Query: 73 NAVLVFE-GRLAVVRAVQH------------VPKGAEVLISYIETAGSTMTRQKALKEQY 119
N ++F G V+++ H + +G E+ +SYI+ + R++ LK+QY
Sbjct: 206 NCTVIFNNGNHEAVKSMFHTQMRIELRALGKISEGEELTVSYIDFLHLSEERRRQLKKQY 265
Query: 120 LFTCTCPRCIKLGQFDDIQESAILEGYRCKDDGCSGFLLRDSDDKGFTCQQCGLVRSKEE 179
F C+C C K G DD+ FL D K S+E
Sbjct: 266 YFDCSCEHCQK-GLKDDL------------------FLAAKEDPKP----------SQEV 296
Query: 180 IKKIASEVNILSKKTLALTSCGNHQEVVSTYKMIEKLQKKLYHPFSVNLMQTREKLIKIL 239
+K++ +K S G + EVV + + Q+ ++ ++ +++ ++L
Sbjct: 297 VKEMIQFSKDTLEKIDKARSEGLYHEVVKLCRECLEKQEPVFADTNLYVLRLLSIASEVL 356
Query: 240 MELEDWKEALAYCQLTIPVYQRVYPQFHPLLGLQYYTCGKLEWFLGDTENAIKSMTEAVE 299
L+ ++EA Y + + Y ++Y + LG+ G W G E + +A
Sbjct: 357 SYLQAYEEASHYARRMVDGYMKLYHHNNAQLGMAVMRAGLTNWHAGHIEVGHGMICKAYA 416
Query: 300 ILRITHGTNSPFMKEL 315
IL +THG + P K+L
Sbjct: 417 ILLVTHGPSHPITKDL 432
>gi|1809322|gb|AAC53020.1| t-BOP [Mus musculus]
Length = 456
Score = 95.5 bits (236), Expect = 3e-17, Method: Compositional matrix adjust.
Identities = 78/316 (24%), Positives = 140/316 (44%), Gaps = 48/316 (15%)
Query: 19 VNLILQ-WP----EISINEIAENFSKLACNAHTICNSE-LRPLGTGLYPVISIINHSCLP 72
V+ LQ WP + S+ I+ F + CN T+ + L+ +G G++P + ++NH C P
Sbjct: 117 VDTFLQYWPLQSQQFSMQYISHIFGVINCNGFTLSDQRGLQAVGVGIFPNLGLVNHDCWP 176
Query: 73 NAVLVFE-GRLAVVRAVQH------------VPKGAEVLISYIETAGSTMTRQKALKEQY 119
N ++F G V+++ H + +G E+ +SYI+ + R++ LK+QY
Sbjct: 177 NCTVIFNNGNHEAVKSMFHTQMRIELRALGKISEGEELTVSYIDFLHLSEERRRQLKKQY 236
Query: 120 LFTCTCPRCIKLGQFDDIQESAILEGYRCKDDGCSGFLLRDSDDKGFTCQQCGLVRSKEE 179
F C+C C K G DD+ FL D K S+E
Sbjct: 237 YFDCSCEHCQK-GLKDDL------------------FLAAKEDPKP----------SQEV 267
Query: 180 IKKIASEVNILSKKTLALTSCGNHQEVVSTYKMIEKLQKKLYHPFSVNLMQTREKLIKIL 239
+K++ +K S G + EVV + + Q+ ++ ++ +++ ++L
Sbjct: 268 VKEMIQFSKDTLEKIDKARSEGLYHEVVKLCRECLEKQEPVFTDTNLYVLRLLSIASEVL 327
Query: 240 MELEDWKEALAYCQLTIPVYQRVYPQFHPLLGLQYYTCGKLEWFLGDTENAIKSMTEAVE 299
L+ ++EA Y + + Y ++Y + LG+ G W G E + +A
Sbjct: 328 SYLQAYEEASHYARRMVDGYMKLYHHNNAQLGMAVMRAGLTNWHAGHIEVGHGMICKAYA 387
Query: 300 ILRITHGTNSPFMKEL 315
IL +THG + P K+L
Sbjct: 388 ILLVTHGPSHPITKDL 403
>gi|236461014|ref|NP_001153599.1| SET and MYND domain-containing protein 1 isoform 1 [Mus musculus]
gi|332278221|sp|P97443.3|SMYD1_MOUSE RecName: Full=SET and MYND domain-containing protein 1; AltName:
Full=CD8b-opposite; AltName: Full=Zinc finger protein
BOP; Short=m-BOP
gi|308387879|pdb|3N71|A Chain A, Crystal Structure Of Cardiac Specific Histone
Methyltransferase Smyd1
gi|49903961|gb|AAH76601.1| Smyd1 protein [Mus musculus]
gi|74209249|dbj|BAE24995.1| unnamed protein product [Mus musculus]
gi|148666518|gb|EDK98934.1| SET and MYND domain containing 1, isoform CRA_b [Mus musculus]
Length = 490
Score = 95.5 bits (236), Expect = 3e-17, Method: Compositional matrix adjust.
Identities = 78/316 (24%), Positives = 140/316 (44%), Gaps = 48/316 (15%)
Query: 19 VNLILQ-WP----EISINEIAENFSKLACNAHTICNSE-LRPLGTGLYPVISIINHSCLP 72
V+ LQ WP + S+ I+ F + CN T+ + L+ +G G++P + ++NH C P
Sbjct: 151 VDTFLQYWPPQSQQFSMQYISHIFGVINCNGFTLSDQRGLQAVGVGIFPNLGLVNHDCWP 210
Query: 73 NAVLVFE-GRLAVVRAVQH------------VPKGAEVLISYIETAGSTMTRQKALKEQY 119
N ++F G V+++ H + +G E+ +SYI+ + R++ LK+QY
Sbjct: 211 NCTVIFNNGNHEAVKSMFHTQMRIELRALGKISEGEELTVSYIDFLHLSEERRRQLKKQY 270
Query: 120 LFTCTCPRCIKLGQFDDIQESAILEGYRCKDDGCSGFLLRDSDDKGFTCQQCGLVRSKEE 179
F C+C C K G DD+ FL D K S+E
Sbjct: 271 YFDCSCEHCQK-GLKDDL------------------FLAAKEDPKP----------SQEV 301
Query: 180 IKKIASEVNILSKKTLALTSCGNHQEVVSTYKMIEKLQKKLYHPFSVNLMQTREKLIKIL 239
+K++ +K S G + EVV + + Q+ ++ ++ +++ ++L
Sbjct: 302 VKEMIQFSKDTLEKIDKARSEGLYHEVVKLCRECLEKQEPVFADTNLYVLRLLSIASEVL 361
Query: 240 MELEDWKEALAYCQLTIPVYQRVYPQFHPLLGLQYYTCGKLEWFLGDTENAIKSMTEAVE 299
L+ ++EA Y + + Y ++Y + LG+ G W G E + +A
Sbjct: 362 SYLQAYEEASHYARRMVDGYMKLYHHNNAQLGMAVMRAGLTNWHAGHIEVGHGMICKAYA 421
Query: 300 ILRITHGTNSPFMKEL 315
IL +THG + P K+L
Sbjct: 422 ILLVTHGPSHPITKDL 437
>gi|119613764|gb|EAW93358.1| SET and MYND domain containing 2, isoform CRA_d [Homo sapiens]
Length = 230
Score = 95.5 bits (236), Expect = 3e-17, Method: Compositional matrix adjust.
Identities = 47/125 (37%), Positives = 74/125 (59%), Gaps = 1/125 (0%)
Query: 5 DEKQLLLYAQIANLVNLILQWPEISINE-IAENFSKLACNAHTICNSELRPLGTGLYPVI 63
+EK+ L+ + IA L + + E N+ + F+++ CN TI + EL LG+ ++P +
Sbjct: 59 NEKKDLIQSDIAALHHFYSKHLEFPDNDSLVVLFAQVNCNGFTIEDEELSHLGSAIFPDV 118
Query: 64 SIINHSCLPNAVLVFEGRLAVVRAVQHVPKGAEVLISYIETAGSTMTRQKALKEQYLFTC 123
+++NHSC PN ++ ++G LA VRAVQ + G EV SYI+ T R L++ Y FTC
Sbjct: 119 ALMNHSCCPNVIVTYKGTLAEVRAVQEIKPGEEVFTSYIDLLYPTEDRNDRLRDSYFFTC 178
Query: 124 TCPRC 128
C C
Sbjct: 179 ECQEC 183
>gi|157819445|ref|NP_001100065.1| SET and MYND domain-containing protein 1 [Rattus norvegicus]
gi|149036367|gb|EDL90985.1| rCG56113, isoform CRA_a [Rattus norvegicus]
Length = 490
Score = 95.5 bits (236), Expect = 3e-17, Method: Compositional matrix adjust.
Identities = 82/320 (25%), Positives = 143/320 (44%), Gaps = 56/320 (17%)
Query: 19 VNLILQ-WP----EISINEIAENFSKLACNAHTICNSE-LRPLGTGLYPVISIINHSCLP 72
V+ LQ WP + S+ I+ F + CN T+ + L+ +G G++P + ++NH C P
Sbjct: 151 VDTFLQYWPPQSQQFSMQYISHIFGVINCNGFTLSDQRGLQAVGVGIFPNLGLVNHDCWP 210
Query: 73 NAVLVFE-GRLAVVRAVQH------------VPKGAEVLISYIETAGSTMTRQKALKEQY 119
N ++F G V+++ H + +G E+ +SYI+ + R++ LK+QY
Sbjct: 211 NCTVIFNNGNHEAVKSMFHTQMRIELRALGKISEGEELTVSYIDFLHLSEERRQQLKKQY 270
Query: 120 LFTCTCPRCIKLGQFDDIQESAILEGYRCKDDGCSGFLLRDSDDKGFTCQQCGLVRSKEE 179
F C+C C K G DD+ FL D K S+E
Sbjct: 271 YFDCSCEHCQK-GLKDDL------------------FLAVKEDPKP----------SQEV 301
Query: 180 IKKIASEVNILSKKTL----ALTSCGNHQEVVSTYKMIEKLQKKLYHPFSVNLMQTREKL 235
+K E+ SK TL S G + EVV + + Q+ ++ ++ +++ +
Sbjct: 302 VK----EMTQFSKDTLEKIDKARSEGLYHEVVKLCRECLEKQEPVFADTNLYVLRLLSIV 357
Query: 236 IKILMELEDWKEALAYCQLTIPVYQRVYPQFHPLLGLQYYTCGKLEWFLGDTENAIKSMT 295
++L L+ ++EA Y + + Y ++Y + LG+ G W G E +
Sbjct: 358 SEVLSYLQAFEEASHYARRMVDGYMKLYHHNNAQLGMAVMRAGLTNWHAGHIEVGHGMIC 417
Query: 296 EAVEILRITHGTNSPFMKEL 315
+A IL +THG + P K+L
Sbjct: 418 KAYAILLVTHGPSHPITKDL 437
>gi|326919601|ref|XP_003206068.1| PREDICTED: SET and MYND domain-containing protein 1-like isoform 1
[Meleagris gallopavo]
Length = 486
Score = 95.5 bits (236), Expect = 3e-17, Method: Compositional matrix adjust.
Identities = 86/333 (25%), Positives = 143/333 (42%), Gaps = 56/333 (16%)
Query: 25 WP----EISINEIAENFSKLACNAHTICNSE-LRPLGTGLYPVISIINHSCLPNAVLVFE 79
WP + + I+ F + CNA T+ + L+ +G G++P + NH C PN ++F
Sbjct: 153 WPAQSQQFGMQYISHIFGVINCNAFTLSDQRGLQAVGVGIFPNLCQANHDCWPNCTVIFN 212
Query: 80 -GRLAVVRAVQH------------VPKGAEVLISYIETAGSTMTRQKALKEQYLFTCTCP 126
G VR++ H + G E+ +SY++ + RQK LK+QY F CTC
Sbjct: 213 NGNHEAVRSMFHTQMRIELRALSKISPGDELTVSYVDFLNVSEERQKQLKKQYYFDCTCE 272
Query: 127 RCIKLGQFDDIQESAILEGYRCKDDGCSGFLLRDSDDKGFTCQQCGLVRSKEEIKKIASE 186
C K + KDD +++ D K S E +K E
Sbjct: 273 HCKK----------------KIKDD--LMLAVKEGDKKP----------SAETVK----E 300
Query: 187 VNILSKKTLALTSC----GNHQEVVSTYKMIEKLQKKLYHPFSVNLMQTREKLIKILMEL 242
V SK TL + G + EVV + K Q+ + ++ L++ ++L L
Sbjct: 301 VIQFSKDTLEKINKARLEGTYHEVVKLCRECLKKQEPVLGDTNIYLLRILSIASEVLSYL 360
Query: 243 EDWKEALAYCQLTIPVYQRVYPQFHPLLGLQYYTCGKLEWFLGDTENAIKSMTEAVEILR 302
+ ++EA Y + + Y ++Y + LG+ G W G E + +A IL
Sbjct: 361 QMFEEAAEYAKRMVDGYLKIYHPNNAQLGMAVMRAGVTHWHAGLIEAGHSMICKAYAILL 420
Query: 303 ITHGTNSPFMKELILKLEEAQAEASYKLSSKDE 335
ITHG + P K+ L++ Q E ++ ++E
Sbjct: 421 ITHGPSHPITKD--LEVMRVQTEMELRMFQQNE 451
>gi|384498493|gb|EIE88984.1| hypothetical protein RO3G_13695 [Rhizopus delemar RA 99-880]
Length = 553
Score = 95.5 bits (236), Expect = 3e-17, Method: Compositional matrix adjust.
Identities = 42/108 (38%), Positives = 65/108 (60%)
Query: 23 LQWPEISINEIAENFSKLACNAHTICNSELRPLGTGLYPVISIINHSCLPNAVLVFEGRL 82
+Q I +E+ + N T+ + E+ +G G YPV S+ NH+C PNAV+VF+G L
Sbjct: 178 VQQTSIDEDELVHYLNVFKSNNFTLDDQEMFAVGEGTYPVASLFNHTCRPNAVIVFDGAL 237
Query: 83 AVVRAVQHVPKGAEVLISYIETAGSTMTRQKALKEQYLFTCTCPRCIK 130
A +RA+ + E+ ISYI+ A + R++AL+E+Y F C C RC +
Sbjct: 238 AEIRAIDTIEPDTEITISYIDPAHARAHRKRALREKYFFDCRCVRCTR 285
>gi|326427056|gb|EGD72626.1| hypothetical protein PTSG_04361 [Salpingoeca sp. ATCC 50818]
Length = 464
Score = 95.1 bits (235), Expect = 4e-17, Method: Compositional matrix adjust.
Identities = 73/283 (25%), Positives = 124/283 (43%), Gaps = 19/283 (6%)
Query: 41 ACNAHTICNSELRPLGTGLYPVISIINHSCLPNAVLVFEGRLAVVRAVQHVPKGAEVLIS 100
+ N+ I E LG GLY +S INHSC PN + F G A VR+V +P E+ I+
Sbjct: 191 SANSFGISGVEGNNLGVGLYDTVSYINHSCAPNCSITFSGVYARVRSVHDLPPNQELTIA 250
Query: 101 YIETAGSTMTRQKALKEQYLFTCTCPRCIKLGQFDDIQESAILEGYRCKDDGCSGF---- 156
YI+ R+ LK Q++F C C RC + + DD + C GC
Sbjct: 251 YIDPCDPRAKRRAHLKSQFMFDCECSRCER--ERDDDPLLTLC----CPGSGCDQIIKYQ 304
Query: 157 LLRDSDDKGF-TCQQC--GLVRSKEEIKKIASEVNILSKKTLALTSCGNHQEVVSTYKMI 213
L DD F ++C G + + +V K+ L + N M
Sbjct: 305 LTAPPDDTAFEVTKKCERGHLDDPATFDSWSRKV----KRVLGIIGEANKARRPVQGDMA 360
Query: 214 EKLQKKLYHPFSVNLMQTREKLIKILMELEDWKEALAYCQLTIPVYQRVYPQFHPLLGLQ 273
++L P + + ++ + ++EAL + + I VY + PL G++
Sbjct: 361 SF--ERLLPPTNYLMHIAYTGVLDQAIGTGQFQEALQWARKVILVYDALNSPMDPLRGIE 418
Query: 274 YYTCGKLEWFLGDTENAIKSMTEAVEILRITHGTNSPFMKELI 316
KL+ D ++++ +AV+++R+THG +S ++++
Sbjct: 419 LLRVAKLQALARDGRGFLETIQKAVDVIRLTHGEDSVVYQQVL 461
>gi|449501420|ref|XP_002187966.2| PREDICTED: SET and MYND domain-containing protein 1 isoform 1
[Taeniopygia guttata]
Length = 478
Score = 95.1 bits (235), Expect = 4e-17, Method: Compositional matrix adjust.
Identities = 78/325 (24%), Positives = 147/325 (45%), Gaps = 35/325 (10%)
Query: 16 ANLVNLILQWP----EISINEIAENFSKLACNAHTICNSE-LRPLGTGLYPVISIINHSC 70
A++ + + WP + + I+ F ++CN T+ + L+ +G G++P + NH C
Sbjct: 149 ADVESFLEFWPPQCQQFGMQLISHIFGVISCNGFTLSDQRGLQAVGVGIFPNLCQANHDC 208
Query: 71 LPNAVLVFEGRLAVVRAVQHVPKGAEVLISYIETAGSTMTRQKALKEQYLFTCTCPRCIK 130
PN +VF +RA+ + G E+ +SY++ + R++ LK+QY F CTC C K
Sbjct: 209 WPNCTVVFNNGKIELRALNKISPGDELTVSYVDFLSLSEERRRQLKKQYYFDCTCEHCKK 268
Query: 131 LGQFDDIQESAILEGYRCKDDGCSGFLLRDSDDKGFTCQQCGLVRSKEEIKKIASEVNIL 190
+ KDD +S T ++ + SK+ ++KI
Sbjct: 269 ----------------QLKDDLMLAVKAGESKPSADTVKEV-IQLSKDTLEKI------- 304
Query: 191 SKKTLALTSCGNHQEVVSTYKMIEKLQKKLYHPFSVNLMQTREKLIKILMELEDWKEALA 250
+K + G++ EVV + + Q+ + ++ L++ ++L L+ ++EA
Sbjct: 305 NKARME----GHYHEVVKLCRDCLQKQEPVLGDTNIYLLRILSIASEVLSYLQMFEEAAD 360
Query: 251 YCQLTIPVYQRVYPQFHPLLGLQYYTCGKLEWFLGDTENAIKSMTEAVEILRITHGTNSP 310
Y + + Y ++Y + LG+ G W G E + +A IL ITHG + P
Sbjct: 361 YAKRMVDGYLKIYHPNNAQLGMAVMRAGVTHWHAGLIEVGHGLICKAYAILLITHGPSHP 420
Query: 311 FMKELILKLEEAQAEASYKLSSKDE 335
K+ L++ Q E ++ ++E
Sbjct: 421 ITKD--LEVMRVQTEMELRMFQQNE 443
>gi|301610418|ref|XP_002934751.1| PREDICTED: LOW QUALITY PROTEIN: SET and MYND domain-containing
protein 2-like [Xenopus (Silurana) tropicalis]
Length = 430
Score = 94.7 bits (234), Expect = 5e-17, Method: Compositional matrix adjust.
Identities = 70/293 (23%), Positives = 135/293 (46%), Gaps = 46/293 (15%)
Query: 37 FSKLACNAHTICNSELRPLGTGLYPVISIINHSCLPNAVLVFEGRLAVVRAVQHVPKGAE 96
F+++ CN TI + EL LG+ ++P ++++NHSC PN ++ ++G +A VRAVQ + G E
Sbjct: 174 FAQVNCNGFTIEDEELSHLGSAIFPDVALMNHSCCPNVIVTYKGTVAEVRAVQEIHAGDE 233
Query: 97 VLISYIETAGSTMTRQKALKEQYLFTCTCPRC-------IKLGQFDDIQESAILEGYRCK 149
V+ SYI+ T R L + C C C +K+ + + + ++ K
Sbjct: 234 VITSYIDLLYPTEDRNDRLIDSXFCNCDCRECSTKQKGRLKIAEISLKIHPPLDKXWKNK 293
Query: 150 DDGCSGFLLRDSDDKGFTCQQCGLVRSKEEIKKIASEVNILSKKTLALTSCGNHQEVVST 209
+ CS LL L T+A S E++
Sbjct: 294 ERSCSVPLL------------------------------YLFPLTIAPPS-----ELLEI 318
Query: 210 YKMIEKLQKKLYHPFSVNLMQTREKLIKILMELEDWKEALAYCQLTIPVYQRVYPQFHPL 269
++ + ++ +V ++ + + + + ++DW+ AL Y + I Y + YP +
Sbjct: 319 CELSLEKMGSVFEDSNVYMLHMMYQAMGVCLYMQDWEGALKYGEKIIKPYSKHYPPYSLN 378
Query: 270 LGLQYYTCGKLEWFLGDTENAI--KSMTEAVEILRITHGTNSPFMKELILKLE 320
+ + G+L ++G + AI K++ +A+ I+ I HG + ++ E+ +LE
Sbjct: 379 VASMWLKLGRL--YMGLEKKAIGTKALKKAIAIMDIAHGPDHHYIAEIKKELE 429
>gi|45383804|ref|NP_989486.1| SET and MYND domain-containing protein 1 [Gallus gallus]
gi|16930387|gb|AAL31880.1|AF410781_1 cardiac and skeletal muscle-specific BOP1 [Gallus gallus]
Length = 486
Score = 94.4 bits (233), Expect = 6e-17, Method: Compositional matrix adjust.
Identities = 85/333 (25%), Positives = 143/333 (42%), Gaps = 56/333 (16%)
Query: 25 WP----EISINEIAENFSKLACNAHTICNSE-LRPLGTGLYPVISIINHSCLPNAVLVFE 79
WP + + I+ F + CNA T+ + L+ +G G++P + NH C PN ++F
Sbjct: 153 WPAQSQQFGMQYISHIFGVINCNAFTLSDQRGLQAVGVGIFPNLCQANHDCWPNCTVIFN 212
Query: 80 -GRLAVVRAVQH------------VPKGAEVLISYIETAGSTMTRQKALKEQYLFTCTCP 126
G VR++ H + G E+ +SY++ + R+K LK+QY F CTC
Sbjct: 213 NGNHEAVRSMFHTQMRIELRALSKISPGDELTVSYVDFLNVSEERRKQLKKQYYFDCTCE 272
Query: 127 RCIKLGQFDDIQESAILEGYRCKDDGCSGFLLRDSDDKGFTCQQCGLVRSKEEIKKIASE 186
C K + KDD +++ D K S E +K E
Sbjct: 273 HCKK----------------KIKDD--LMLAVKEGDKKP----------SAETVK----E 300
Query: 187 VNILSKKTLALTSC----GNHQEVVSTYKMIEKLQKKLYHPFSVNLMQTREKLIKILMEL 242
V SK TL + G + EVV + K Q+ + ++ L++ ++L L
Sbjct: 301 VIQFSKDTLEKINKARLEGTYHEVVKLCRECLKKQEPVLGDTNIYLLRILSIASEVLSYL 360
Query: 243 EDWKEALAYCQLTIPVYQRVYPQFHPLLGLQYYTCGKLEWFLGDTENAIKSMTEAVEILR 302
+ ++EA Y + + Y ++Y + LG+ G W G E + +A IL
Sbjct: 361 QMFEEAAEYAKRMVEGYMKIYHPNNAQLGMAVMRAGVTHWHAGLIEAGHSMICKAYAILL 420
Query: 303 ITHGTNSPFMKELILKLEEAQAEASYKLSSKDE 335
ITHG + P K+ L++ Q E ++ ++E
Sbjct: 421 ITHGPSHPITKD--LEVMRVQTEMELRMFQQNE 451
>gi|332239335|ref|XP_003268859.1| PREDICTED: SET and MYND domain-containing protein 1 [Nomascus
leucogenys]
Length = 512
Score = 94.4 bits (233), Expect = 6e-17, Method: Compositional matrix adjust.
Identities = 75/310 (24%), Positives = 139/310 (44%), Gaps = 51/310 (16%)
Query: 24 QWPEISINEIAENFSKLACNAHTICNSE-LRPLGTGLYPVISIINHSCLPNAVLVFE-GR 81
Q + S+ I+ F + CN T+ + L+ +G G++P + ++NH C PN ++F G
Sbjct: 183 QSQQFSMQYISHIFGVINCNGFTLSDQRGLQAVGVGIFPNLGLVNHDCWPNCTVIFNNGN 242
Query: 82 LAVVRAVQH------------VPKGAEVLISYIETAGSTMTRQKALKEQYLFTCTCPRCI 129
V+++ H + +G E+ +SYI+ + R++ LK+QY F CTC C
Sbjct: 243 HEAVKSMFHTQMRIELRALGKISEGEELTVSYIDFLNVSEERKRQLKKQYYFDCTCEHCQ 302
Query: 130 KLGQFDDIQESAILEGYRCKDDGCSGFLLRDSDDKGFTCQQCGLVRSKEEIKKIASEVNI 189
K + KDD FL G+ + + +++ E+
Sbjct: 303 K----------------KLKDD---LFL--------------GVKDNPKPSQEVVKEMIQ 329
Query: 190 LSKKTL----ALTSCGNHQEVVSTYKMIEKLQKKLYHPFSVNLMQTREKLIKILMELEDW 245
SK TL S G + EVV + + Q+ ++ ++ +++ + ++L L+ +
Sbjct: 330 FSKDTLEKIDKARSEGLYHEVVKLCRECLQKQEPVFADTNIYMLRMLSIVSEVLSYLQAF 389
Query: 246 KEALAYCQLTIPVYQRVYPQFHPLLGLQYYTCGKLEWFLGDTENAIKSMTEAVEILRITH 305
+EA Y + + Y ++Y + LG+ G W G+ E + +A IL +TH
Sbjct: 390 EEASFYARRMVDGYMKLYHPNNAQLGMAVMRAGLTNWHAGNIEVGHGMICKAYAILLVTH 449
Query: 306 GTNSPFMKEL 315
G + P K+L
Sbjct: 450 GPSHPITKDL 459
>gi|71052093|gb|AAH49367.2| SMYD2 protein [Homo sapiens]
Length = 371
Score = 94.4 bits (233), Expect = 7e-17, Method: Compositional matrix adjust.
Identities = 47/127 (37%), Positives = 76/127 (59%), Gaps = 5/127 (3%)
Query: 5 DEKQLLLYAQIANLVNLI---LQWPEISINEIAENFSKLACNAHTICNSELRPLGTGLYP 61
+EK+ L+ + IA L + L++P+ + + F+++ CN TI + EL LG+ ++P
Sbjct: 143 NEKKDLIQSDIAALHHFYSKHLEFPDS--DSLVVLFAQVNCNGFTIEDEELSHLGSAIFP 200
Query: 62 VISIINHSCLPNAVLVFEGRLAVVRAVQHVPKGAEVLISYIETAGSTMTRQKALKEQYLF 121
++++NHSC PN ++ ++G LA VRAVQ + G EV SYI+ T R L++ Y F
Sbjct: 201 DVALMNHSCCPNVIVTYKGTLAEVRAVQEIKPGEEVFTSYIDLLYPTEDRNDRLRDSYFF 260
Query: 122 TCTCPRC 128
TC C C
Sbjct: 261 TCECQEC 267
>gi|348517233|ref|XP_003446139.1| PREDICTED: SET and MYND domain-containing protein 1-like isoform 2
[Oreochromis niloticus]
Length = 489
Score = 94.4 bits (233), Expect = 7e-17, Method: Compositional matrix adjust.
Identities = 80/318 (25%), Positives = 139/318 (43%), Gaps = 47/318 (14%)
Query: 17 NLVNLILQWPEIS----INEIAENFSKLACNAHTICNSE-LRPLGTGLYPVISIINHSCL 71
++ N + WP S I++I+ F + CN ++ + L+ +G GL+P + ++NH C
Sbjct: 145 DIHNFLDYWPHNSKQHTIDDISHIFGVINCNGFSVSDQRGLQAVGVGLFPNLCLVNHDCW 204
Query: 72 PN-AVLVFEGRLAVV------------RAVQHVPKGAEVLISYIETAGSTMTRQKALKEQ 118
PN V++ G + V RA+ + +G E+ ++Y++ + RQ+ LK Q
Sbjct: 205 PNCTVILNHGNQSAVNTMFHSQRRIELRALGKIAEGEELTVAYVDFLNLSEERQRLLKTQ 264
Query: 119 YLFTCTCPRCIKLGQFDDIQESAILEGYRCKDDGCSGFLLRDSDDKGFTCQQCGLVRSKE 178
Y F CTC C R KDD G R+ D G+ S+E
Sbjct: 265 YFFDCTCEHCKN----------------RIKDDIKIGG--REED---------GVKPSEE 297
Query: 179 EIKKIASEVNILSKKTLALTSCGNHQEVVSTYK-MIEKLQKKLYHPFSVNLMQTREKLIK 237
++K+ + +K G++ EVV + IEK + L + L++ + +
Sbjct: 298 QVKEATDYCFQMLEKMEKARLNGDYHEVVKICRECIEKTEPVLADT-HIYLLRMWSTMSE 356
Query: 238 ILMELEDWKEALAYCQLTIPVYQRVYPQFHPLLGLQYYTCGKLEWFLGDTENAIKSMTEA 297
+ L+ + +A Y + + Y ++Y + LG+ G W G E + +A
Sbjct: 357 VQAYLQYFDDAADYARKMVEGYMKLYHPNNAALGMAAMRAGVTHWQAGQIEVGHGMICKA 416
Query: 298 VEILRITHGTNSPFMKEL 315
IL ITHG P K+L
Sbjct: 417 YAILMITHGPIHPITKDL 434
>gi|348505210|ref|XP_003440154.1| PREDICTED: SET and MYND domain-containing protein 1-like isoform 1
[Oreochromis niloticus]
Length = 454
Score = 94.4 bits (233), Expect = 7e-17, Method: Compositional matrix adjust.
Identities = 76/297 (25%), Positives = 132/297 (44%), Gaps = 49/297 (16%)
Query: 29 SINEIAENFSKLACNAHTICNSE-LRPLGTGLYPVISIINHSCLPNAVLVFEGRLAVVRA 87
S++ I+ F + CN T+ + L+ +G GL+P + ++NH C PN ++ +RA
Sbjct: 166 SVDYISHIFGIIKCNGFTLSDQRGLQAVGVGLFPNLCLVNHDCWPNCTVILNHGKIELRA 225
Query: 88 VQHVPKGAEVLISYIETAGSTMTRQKALKEQYLFTCTCPRCIKLGQFDDIQESAILEGYR 147
+ +P+G E+ +SY++ + RQK LKE++ F CTC C K
Sbjct: 226 LGKIPEGEELTVSYVDFLNLSADRQKKLKERFHFDCTCEHCSK----------------H 269
Query: 148 CKDDGCSGFLLRDSDDKGFTCQQCGLVRSKEEIKKIASEVNILSKKTLALTSCG----NH 203
KDD + +D K S +++K EV SK++L ++
Sbjct: 270 IKDD----LMTAVADSKP----------SADKVK----EVTAFSKESLEKIEKSRVERDY 311
Query: 204 QEVVS-TYKMIEKLQKKLYHPFSVNLMQTREKLIKILMELEDWKEALA----YCQLTIPV 258
EV+ Y+ ++K + L NL + R ++ I E+ ++ + A Y +
Sbjct: 312 NEVLKLCYECLQKQENVLA---DTNLYKLR--VLSIASEVLSYQRSFAKAANYAHRMVEG 366
Query: 259 YQRVYPQFHPLLGLQYYTCGKLEWFLGDTENAIKSMTEAVEILRITHGTNSPFMKEL 315
Y+++Y + LG+ G W G E + +A IL +THG N K+L
Sbjct: 367 YRKLYHPNNAQLGMAIMRAGVTHWHAGQIEEGHSLICQAYRILMVTHGPNHAITKDL 423
>gi|426194765|gb|EKV44696.1| hypothetical protein AGABI2DRAFT_120820 [Agaricus bisporus var.
bisporus H97]
Length = 674
Score = 94.0 bits (232), Expect = 8e-17, Method: Compositional matrix adjust.
Identities = 59/158 (37%), Positives = 81/158 (51%), Gaps = 18/158 (11%)
Query: 29 SINEIAENFSKLACNAHTICNSELRPLGTGLYPVISIINHSCLPNAVLVFEGR------- 81
SI + + S+ NA T+ + L P+G + P+I++ NHSC PNAV VF
Sbjct: 273 SIAGLVDLVSRFTTNAFTVASPTLTPIGVSISPIIALFNHSCAPNAVPVFPRAPHNAKAN 332
Query: 82 --LAVVRAVQHVPKGAEVLISYIETAGSTMTRQKALKEQYLFTCTCPRCIKL--GQFDDI 137
+A V ++ +P EV+ISYI+T + RQKALKE Y FTC+C C K G + D
Sbjct: 333 EPMASVITLRDIPANEEVVISYIDTTLTKRERQKALKETYYFTCSCSLCEKPRDGAYVDS 392
Query: 138 QESAILEGYRCKDDGCSGF-LLRDSDDKGFTCQQCGLV 174
+ES C + C G L DD C +CG V
Sbjct: 393 RES-----LYCPKN-CGGLCWLPTEDDPLARCAKCGTV 424
>gi|395731585|ref|XP_003775930.1| PREDICTED: LOW QUALITY PROTEIN: SET and MYND domain-containing
protein 1-like [Pongo abelii]
Length = 490
Score = 93.6 bits (231), Expect = 1e-16, Method: Compositional matrix adjust.
Identities = 79/330 (23%), Positives = 146/330 (44%), Gaps = 52/330 (15%)
Query: 5 DEKQLLLYAQIANLVNLIL-QWPEISINEIAENFSKLACNAHTICNSE-LRPLGTGLYPV 62
+E+Q L + + L Q + S+ I+ F + CN T+ + L+ +G G++P
Sbjct: 141 EEEQKELRVDVDTFLQYWLPQSQQFSMQYISHIFGVINCNGFTLSDQRGLQAVGVGIFPN 200
Query: 63 ISIINHSCLPNAVLVFE-GRLAVVRAVQH------------VPKGAEVLISYIETAGSTM 109
+ ++NH C PN ++F G V+++ H + +G E+ +SYI+ +
Sbjct: 201 LGLVNHDCWPNCTVIFNNGNHEAVKSMFHTQMRIELRALGKISEGEELTVSYIDFLNVSE 260
Query: 110 TRQKALKEQYLFTCTCPRCIKLGQFDDIQESAILEGYRCKDDGCSGFLLRDSDDKGFTCQ 169
R++ LK+QY F CTC C K + KDD FL
Sbjct: 261 ERKRQLKKQYYFDCTCEHCQK----------------KLKDD---LFL------------ 289
Query: 170 QCGLVRSKEEIKKIASEVNILSKKTL----ALTSCGNHQEVVSTYKMIEKLQKKLYHPFS 225
G+ + + +++ E+ SK TL S G + EVV + + Q+ ++ +
Sbjct: 290 --GVKDNPKPSQEVVKEMIRFSKDTLEKIDKARSEGLYHEVVKLCRECLQKQEPVFADTN 347
Query: 226 VNLMQTREKLIKILMELEDWKEALAYCQLTIPVYQRVYPQFHPLLGLQYYTCGKLEWFLG 285
+ ++ + ++L L+ ++EA Y + + Y ++Y + LG+ G W G
Sbjct: 348 IYTLRMLSIVSEVLSYLQAFEEASFYARRMVDGYMKLYHPNNAQLGMAVMRAGLTNWHAG 407
Query: 286 DTENAIKSMTEAVEILRITHGTNSPFMKEL 315
+ E + +A IL +THG + P K+L
Sbjct: 408 NIEVGHGMICKAYAILLVTHGPSHPITKDL 437
>gi|410903311|ref|XP_003965137.1| PREDICTED: SET and MYND domain-containing protein 1-like [Takifugu
rubripes]
Length = 471
Score = 93.6 bits (231), Expect = 1e-16, Method: Compositional matrix adjust.
Identities = 90/354 (25%), Positives = 156/354 (44%), Gaps = 50/354 (14%)
Query: 1 MSDIDEKQLLLYAQIANLVNLILQW-----PEISINEIAENFSKLACNAHTICNSE-LRP 54
++D+ E L +I V++ LQ+ + S++EIA F + CN T+ + L+
Sbjct: 136 VADLPEDHL---KRIDTDVHVFLQYWSCGRTKHSLDEIAHIFGIIKCNGFTLSDQRGLKA 192
Query: 55 LGTGLYPVISIINHSCLPNAVLVFEG-------------RLAVVRAVQHVPKGAEVLISY 101
+G GL+P + ++NH C PN +V R +RA++ + +G E+ +SY
Sbjct: 193 VGVGLFPNLCLVNHDCWPNCSVVLNHGNHSATNSALHSKRRIELRALRKICEGEELTVSY 252
Query: 102 IETAGSTMTRQKALKEQYLFTCTCPRCIKLGQFDDIQESAILEGYRCKDDGCSGFLLRDS 161
++ ++ RQ+ LKE + F CTC C + + DD+ +A +G G +
Sbjct: 253 VDFLDTSAERQRKLKEHFYFECTCEHCRQHIK-DDLMTAAAADG-----PGGKPSADQVQ 306
Query: 162 DDKGFTCQQCGLVRSKEEIKKIASEVNILSKKTLALTSCGNHQEVVSTYKMIEKLQKKLY 221
+ F+ Q+C + + K EV +L K L EK QK L
Sbjct: 307 EVTAFS-QECLEKIERSLMDKDFQEVRMLCSKCL------------------EK-QKNLL 346
Query: 222 HPFSVNLMQTREKLIKILMELEDWKEALAYCQLTIPVYQRVYPQFHPLLGLQYYTCGKLE 281
++ ++ +++L L + EA AY + + Y ++Y LG+ G
Sbjct: 347 ADTHLHHLRVLSAAVEVLSYLRCFSEAAAYARRMVQGYTKLYHPNSAQLGIAVMRAGVTH 406
Query: 282 WFLGDTENAIKSMTEAVEILRITHGTNSPFMKELILKLEEAQAEASYKLSSKDE 335
G E A + + +A IL +THG N ++ L+ Q E KL +DE
Sbjct: 407 LQAGMIEMAHELICKAYRILLVTHGPNHSVTRD--LEAMRRQTEVELKLLKQDE 458
>gi|354483543|ref|XP_003503952.1| PREDICTED: SET and MYND domain-containing protein 1 [Cricetulus
griseus]
Length = 498
Score = 93.6 bits (231), Expect = 1e-16, Method: Compositional matrix adjust.
Identities = 76/316 (24%), Positives = 140/316 (44%), Gaps = 48/316 (15%)
Query: 19 VNLILQ-WP----EISINEIAENFSKLACNAHTICNSE-LRPLGTGLYPVISIINHSCLP 72
V+ LQ WP + S+ I+ F + CN T+ + L+ +G G++P + ++NH C P
Sbjct: 151 VDAFLQYWPPQGQQFSMQYISHIFGVINCNGFTLSDQRGLQAVGVGIFPNLGLVNHDCWP 210
Query: 73 NAVLVFE-GRLAVVRAVQH------------VPKGAEVLISYIETAGSTMTRQKALKEQY 119
N ++F G V+++ H + +G E+ +SYI+ + R++ L++QY
Sbjct: 211 NCTVIFNNGNHEAVKSMFHTQMRIELRALGKISEGEELTVSYIDFLHLSEERRRQLRKQY 270
Query: 120 LFTCTCPRCIKLGQFDDIQESAILEGYRCKDDGCSGFLLRDSDDKGFTCQQCGLVRSKEE 179
F C+C C K G DD+ F D K S E
Sbjct: 271 YFDCSCEHCQK-GLKDDL------------------FQAVKEDPKP----------SPEV 301
Query: 180 IKKIASEVNILSKKTLALTSCGNHQEVVSTYKMIEKLQKKLYHPFSVNLMQTREKLIKIL 239
+K++ +K S G + EVV + + Q+ ++ ++ +++ + ++L
Sbjct: 302 VKEMVQFSKDTLEKIDKARSEGLYHEVVKLCRECLEKQESVFADTNLYVLRLLSIVSEVL 361
Query: 240 MELEDWKEALAYCQLTIPVYQRVYPQFHPLLGLQYYTCGKLEWFLGDTENAIKSMTEAVE 299
L+ ++EA Y + + Y ++Y + LG+ G W G+ E + +A
Sbjct: 362 SYLQAFEEAAHYARRMVDGYMKLYHHNNAQLGMAVMRAGLTNWHAGNIEVGHGMICKAYA 421
Query: 300 ILRITHGTNSPFMKEL 315
IL +THG + P K+L
Sbjct: 422 ILLVTHGPSHPITKDL 437
>gi|428171150|gb|EKX40069.1| hypothetical protein GUITHDRAFT_164767 [Guillardia theta CCMP2712]
Length = 500
Score = 93.2 bits (230), Expect = 1e-16, Method: Compositional matrix adjust.
Identities = 79/299 (26%), Positives = 126/299 (42%), Gaps = 45/299 (15%)
Query: 32 EIAENFSKLACN-----AHTICNSELRPLGTGLYPVISIINHSCLPNAVLVF--EGRLAV 84
E+ + K ACN A I + + G G+YP+ +I+NHSC PN V+ + E
Sbjct: 236 ELLKVLCKFACNNFSHAARQIWDDLIVSHGMGVYPLGAILNHSCKPNCVIYYHPETHEQE 295
Query: 85 VRAVQHVPKGAEVLISYIETAGSTMTRQKALKEQYLFTCTCPRCIKLGQFDDIQESAILE 144
R ++ + G ++ SYI+ A + TR++ L+ Y F C C C + D+
Sbjct: 296 FRCIEDIQVGEDICHSYIDLAAVSKTRKEKLQSTYYFDCDCQCCKFPEELDN-------- 347
Query: 145 GYRCKDDGCSGFLLRDSDDKGFTCQQCGLVRSKEEIKKIASEVNILSKKTLALTSCGNHQ 204
K G + D G S+ EI+ S + L + LA GN
Sbjct: 348 ----KLGARDGKVTEKCDRAAELLAAAG---SRTEIEHALSRLKDLDENVLA--DRGN-- 396
Query: 205 EVVSTYKMIEKLQKKLYHPFSVNLMQTREKLIKILMELEDWKEALAYCQLTIPVYQRVYP 264
++ + + K+++ +EL A+ C+ + Y+ +YP
Sbjct: 397 -------------------VDLDRLSVKSKMLQASIELGMMDSAIRACKQVVEGYRGIYP 437
Query: 265 QFHPLLGLQYYTCGKLEWFLGDTENAIKSMTEAVEILRITHGTNSPFMKELILKLEEAQ 323
HPLLGLQ YT G L + G E A + E +IL TH S ++ + L EA+
Sbjct: 438 PLHPLLGLQLYTLGNLLFDDGRGEEAADVLQEGQKILLATHDRRSTMVQGITELLAEAR 496
>gi|409076546|gb|EKM76917.1| hypothetical protein AGABI1DRAFT_102088 [Agaricus bisporus var.
burnettii JB137-S8]
Length = 643
Score = 93.2 bits (230), Expect = 1e-16, Method: Compositional matrix adjust.
Identities = 59/158 (37%), Positives = 80/158 (50%), Gaps = 18/158 (11%)
Query: 29 SINEIAENFSKLACNAHTICNSELRPLGTGLYPVISIINHSCLPNAVLVFEGR------- 81
SI + + S+ NA T+ + L P+G + P+I++ NHSC PNAV VF
Sbjct: 240 SIAGLVDLVSRFTTNAFTVASPTLTPIGVSISPIIALFNHSCAPNAVPVFPRAPHNAKAN 299
Query: 82 --LAVVRAVQHVPKGAEVLISYIETAGSTMTRQKALKEQYLFTCTCPRCIKL--GQFDDI 137
+A V ++ +P EV+ISYI+T + RQKALKE Y FTC C C K G + D
Sbjct: 300 EPMASVITLRDIPANEEVVISYIDTTLTKRERQKALKETYYFTCNCSLCEKPRDGAYVDS 359
Query: 138 QESAILEGYRCKDDGCSGF-LLRDSDDKGFTCQQCGLV 174
+ES C + C G L DD C +CG V
Sbjct: 360 RES-----LYCPKN-CGGLCWLPTEDDPLARCAKCGTV 391
>gi|159463378|ref|XP_001689919.1| histone methyltransferase [Chlamydomonas reinhardtii]
gi|158283907|gb|EDP09657.1| histone methyltransferase [Chlamydomonas reinhardtii]
Length = 765
Score = 93.2 bits (230), Expect = 1e-16, Method: Compositional matrix adjust.
Identities = 43/97 (44%), Positives = 62/97 (63%), Gaps = 2/97 (2%)
Query: 25 WPEISINEIAENFSKLACNAHTICNSELRPLGTGLYPVISIINHSCLPNAVLVFEGRLAV 84
WP E+A+ ++LA N HT+C+ ELRPLGT LYP ++ NHSC P+ V VF GR
Sbjct: 94 WP--GFKEVAQLLARLAVNCHTVCDEELRPLGTALYPSGALANHSCRPSTVQVFRGRTLQ 151
Query: 85 VRAVQHVPKGAEVLISYIETAGSTMTRQKALKEQYLF 121
+RA++ + G EV + Y+E A + R++AL + Y F
Sbjct: 152 LRALRPLAPGQEVTLCYLEPAATAQERREALWDSYCF 188
>gi|296423702|ref|XP_002841392.1| hypothetical protein [Tuber melanosporum Mel28]
gi|295637630|emb|CAZ85583.1| unnamed protein product [Tuber melanosporum]
Length = 489
Score = 93.2 bits (230), Expect = 1e-16, Method: Compositional matrix adjust.
Identities = 68/260 (26%), Positives = 114/260 (43%), Gaps = 38/260 (14%)
Query: 31 NEIAENFSKLACNAHTICNSELRPLGTGLYPVISIINHSCLPNAVLVFEGRLAVVRAVQH 90
E+ + + N+ T+ P+G P+ + INH C+PNAV++F GR+ +RA++
Sbjct: 170 GEVRRLYCAVLVNSMTLVTETFDPIGIAFDPLTASINHDCIPNAVMLFNGRILQIRALED 229
Query: 91 VPKGAEVLISYIETAGSTMTRQKALKEQYLFTCTCPRCIKLGQFDDIQESAILEGYRCKD 150
+P AEVLISYI+ RQ L E+Y FTCTC RC + ++G C
Sbjct: 230 IPANAEVLISYIDNTVPRERRQLELSEKYFFTCTCSRCTG--------PAGPMDGLLC-- 279
Query: 151 DGCSGFLLRDSDDKGFTCQQCGLVRSKEEIKKIASEVNILSKKTLALTSCGNHQEVVSTY 210
C G + + C +C ++V IL + + + ++S
Sbjct: 280 -ACGGVGM--PSREAVLCNRCA------RALPATADVEILEQTAWSALDSDKSEAMISAL 330
Query: 211 KMIEK-----LQKKLYHPFSVNLMQTREKLIKILMELEDWKEALAYCQLTIPVYQRV--- 262
+ + K L ++ + L+Q + +W+ AL + L +Y R+
Sbjct: 331 RHLHKSPMWPLPRQPLAALHLALVQGE------YIPSGNWERALLHHLL---IYLRLDPV 381
Query: 263 -YPQ-FHPLLGLQYYTCGKL 280
YPQ FHP+ + +T L
Sbjct: 382 FYPQRFHPMRVVHGFTLANL 401
>gi|451849565|gb|EMD62868.1| hypothetical protein COCSADRAFT_144768 [Cochliobolus sativus
ND90Pr]
Length = 529
Score = 93.2 bits (230), Expect = 2e-16, Method: Compositional matrix adjust.
Identities = 76/323 (23%), Positives = 141/323 (43%), Gaps = 50/323 (15%)
Query: 33 IAENFSKLACNAHTICNSELRPLGTGLYPVISIINHSCLPNAVLVFEGRLAVVRAVQHVP 92
+A + ++ NA T+ L PLG + P + +NHSC PNA ++ +G L +R ++ +
Sbjct: 199 VAAMYGRVMSNALTLITPTLDPLGIIVDPTLCQMNHSCDPNAYIMMDGPLISIRTLRPIR 258
Query: 93 KGAEVLISYIETAGSTMTRQKALKEQYLFTCTCPRCIKLGQFDDIQESAILEGYRCKDDG 152
K E+ ISYI+T RQ L+ ++ FTC C +C Q+ A +
Sbjct: 259 KDKEIFISYIDTTHPYQKRQDELQARWFFTCRCAKC---------QDKATHQ-------- 301
Query: 153 CSGFLLRDSDDKGFTCQQCGLVRSKEEIKKIASEVNILSKKTLALTSCGNHQEVVSTYKM 212
+D V SKEE++ +AS + + N + V+ K
Sbjct: 302 ---------EDNWLVPANSDFVASKEELEALASNQEQMFTLYEQVQGTPNAEAVIPVLKE 352
Query: 213 IEKLQKK-----LY-HPFSVNLMQTREKLIKILMELEDWKEALAYCQLTIP-VYQRVYP- 264
K ++ LY P++ R++LI L+ + ++EA A+ + ++YP
Sbjct: 353 SLKACRESGNWPLYRQPYA----SIRDELIVNLLSVGKYQEAWAHSAKRYKYILPKLYPV 408
Query: 265 QFHPLLGLQYYTCGKLEWFLGDTENAIKS------------MTEAVEILRITHGTNSPFM 312
FHP+ +Q + L +L TE + + + + +++ ++HG ++ F
Sbjct: 409 PFHPVRVVQTWQMAMLAAYLAGTEEGVGAPGVNMGLIAMMLVKQVLDVAPMSHGQDNAFT 468
Query: 313 KELILKLEEAQAEASYKLSSKDE 335
K ++ K E E + + D+
Sbjct: 469 KSVMEKTREMVEELKRSVGNPDK 491
>gi|149640598|ref|XP_001510487.1| PREDICTED: SET and MYND domain-containing protein 1 isoform 1
[Ornithorhynchus anatinus]
Length = 490
Score = 92.8 bits (229), Expect = 2e-16, Method: Compositional matrix adjust.
Identities = 78/313 (24%), Positives = 133/313 (42%), Gaps = 55/313 (17%)
Query: 25 WP----EISINEIAENFSKLACNAHTICNSE-LRPLGTGLYPVISIINHSCLPNAVLVFE 79
WP + S+ I+ F + CN T+ + L+ +G G++P + ++NH C PN ++F
Sbjct: 158 WPPQSQQFSMQYISHIFGVINCNGFTLSDQRGLQAVGVGVFPNLCLVNHDCWPNCTVIFN 217
Query: 80 -GRLAVVRAVQH------------VPKGAEVLISYIETAGSTMTRQKALKEQYLFTCTCP 126
G V+++ H + +G E+ +SYI+ + R+K LK+QY F CTC
Sbjct: 218 NGNHEAVKSMFHTQMRIELRALGKISEGEELTVSYIDFLNVSADRKKLLKKQYYFDCTCE 277
Query: 127 RCIKLGQFDDIQESAILEGYRCKDDGCSGFLLRDSDDKGFTCQQCGLVRSKEEIKKIASE 186
C K G DD+ FL G+ + + + E
Sbjct: 278 HCEK-GIKDDL------------------FL--------------GVKDEPKPSQDVVKE 304
Query: 187 VNILSKKTL----ALTSCGNHQEVVSTYKMIEKLQKKLYHPFSVNLMQTREKLIKILMEL 242
V SK L S G + EVV + + Q+ + ++ L++ ++L L
Sbjct: 305 VIQFSKDALEKIDKARSEGVYHEVVKLCRECLQKQEPVLADTNIYLLRILSVASEVLSYL 364
Query: 243 EDWKEALAYCQLTIPVYQRVYPQFHPLLGLQYYTCGKLEWFLGDTENAIKSMTEAVEILR 302
+ ++EA Y + + Y ++Y + LG+ G W G E + +A IL
Sbjct: 365 QSFEEASDYAKKMVDGYMKLYHPNNAQLGMAVMRAGVTHWHAGLIEVGHGMICKAYAILL 424
Query: 303 ITHGTNSPFMKEL 315
+THG P K+L
Sbjct: 425 VTHGPTHPITKDL 437
>gi|432880358|ref|XP_004073658.1| PREDICTED: SET and MYND domain-containing protein 1-like isoform 3
[Oryzias latipes]
Length = 476
Score = 92.8 bits (229), Expect = 2e-16, Method: Compositional matrix adjust.
Identities = 76/293 (25%), Positives = 130/293 (44%), Gaps = 32/293 (10%)
Query: 29 SINEIAENFSKLACNAHTICNSE-LRPLGTGLYPVISIINHSCLPNAVLVFEGRLAVVRA 87
S+ I+ F + CN T+ + L+ +G GL+P ++++NH C PN + VR
Sbjct: 166 SVEYISHIFGIIKCNGFTLTDQRGLQAVGVGLFPNLALVNHDCWPNCTAILNHGKIEVRT 225
Query: 88 VQHVPKGAEVLISYIETAGSTMTRQKALKEQYLFTCTCPRCIKLGQFDDIQESAILEGYR 147
+ + +G E+ ISY++ + RQK LKEQ+ F C+C C + DD+ +A
Sbjct: 226 LGKISEGEELTISYVDFLQLSADRQKQLKEQFHFECSCKHCSQHIS-DDLMMAAA----- 279
Query: 148 CKDDGCSGFLLRDSDDKGFTCQQCGLVR-SKEEIKKIASEVNILSKKTLALTS----CGN 202
DG D F L R S E+I+ EV+ SK +L G+
Sbjct: 280 ---DG--------QDSPWFI-----LCRPSAEQIQ----EVSAFSKDSLEKIEKSRLDGD 319
Query: 203 HQEVVSTYKMIEKLQKKLYHPFSVNLMQTREKLIKILMELEDWKEALAYCQLTIPVYQRV 262
+ EV+ + + Q+ + ++ ++ ++L L + EA + + Y ++
Sbjct: 320 YSEVLRLCRECLQKQRGVLADTHLHRLRVLSIAAEVLSSLRQFSEAADCARKMVEGYMKL 379
Query: 263 YPQFHPLLGLQYYTCGKLEWFLGDTENAIKSMTEAVEILRITHGTNSPFMKEL 315
Y + LLG+ G W G E + +A IL +THG N ++L
Sbjct: 380 YHPNNALLGMAIMRAGVTLWHAGQIEEGHSLICKAYSILMVTHGPNHSVTRDL 432
>gi|149041012|gb|EDL94969.1| SET and MYND domain containing 2, isoform CRA_b [Rattus norvegicus]
Length = 349
Score = 92.8 bits (229), Expect = 2e-16, Method: Compositional matrix adjust.
Identities = 48/127 (37%), Positives = 75/127 (59%), Gaps = 5/127 (3%)
Query: 5 DEKQLLLYAQIANLVNLI---LQWPEISINEIAENFSKLACNAHTICNSELRPLGTGLYP 61
+EK+ L+ + IA L L++P+ S + F+++ CN TI + EL LG+ ++P
Sbjct: 143 NEKKDLIQSDIAALHQFYSKHLEFPDHS--SLVVLFAQVNCNGFTIEDEELSHLGSAIFP 200
Query: 62 VISIINHSCLPNAVLVFEGRLAVVRAVQHVPKGAEVLISYIETAGSTMTRQKALKEQYLF 121
++++NHSC PN ++ ++G LA VRAVQ + G EV SYI+ T R L++ Y F
Sbjct: 201 DVALMNHSCCPNVIVTYKGTLAEVRAVQEIHPGDEVFTSYIDLLYPTEDRNDRLRDSYFF 260
Query: 122 TCTCPRC 128
TC C C
Sbjct: 261 TCECREC 267
>gi|325530307|sp|Q5RGL7.2|SMY2B_DANRE RecName: Full=N-lysine methyltransferase SMYD2-B; AltName:
Full=Histone methyltransferase SMYD2-B; AltName:
Full=SET and MYND domain-containing protein 2B
Length = 434
Score = 92.4 bits (228), Expect = 3e-16, Method: Compositional matrix adjust.
Identities = 79/329 (24%), Positives = 143/329 (43%), Gaps = 54/329 (16%)
Query: 1 MSDID-EKQLLLYAQIANLVNLI---LQWPEISINEIAENFSKLACNAHTICNSELRPLG 56
+ D+D EK+ + A IA L L +P+ + FS++ CN T+ + EL LG
Sbjct: 139 LEDMDNEKREMTEAHIAGLHQFYSKHLDFPDH--QALLTLFSQVHCNGFTVEDEELSNLG 196
Query: 57 TGLYPVISIINHSCLPNAVLVFEGRLAVVRAVQHVPKGAEVLISYIETAGSTMTRQKALK 116
++P I+++NHSC PN ++ + G A VRAV+ + G E+ SYI+ T R + L+
Sbjct: 197 LAIFPDIALLNHSCSPNVIVTYRGINAEVRAVKDISPGQEIYTSYIDLLYPTADRLERLR 256
Query: 117 EQYLFTCTCPRC-------IKLG---QFDDIQESAILEGYRCKDDGCSGFLLRDSDDKGF 166
+ Y F+C C C +K+ + D+I E I + R + F
Sbjct: 257 DMYYFSCDCKECTTKSMDVVKMSVRKRSDEIGEKEIKDMVRYARNSMENFR--------- 307
Query: 167 TCQQCGLVRSKEEIKKIASEVNILSKKTLALTSCGNHQEVVSTYKMIEKLQKKLYHPFSV 226
R+K++ T L C + +ST ++ +V
Sbjct: 308 --------RAKQD-----------KSPTELLEMCELSIDKMST----------VFDDSNV 338
Query: 227 NLMQTREKLIKILMELEDWKEALAYCQLTIPVYQRVYPQFHPLLGLQYYTCGKLEWFLGD 286
++ + + I + ED++ A+ Y + I + +YP + + + G+L L
Sbjct: 339 YILHMMYQAMGICLFTEDYEGAVRYGEKVIKPFTVLYPAYSMNVASMFLKLGRLYIALDR 398
Query: 287 TENAIKSMTEAVEILRITHGTNSPFMKEL 315
I + +A+ I+ + HG + ++ EL
Sbjct: 399 KLAGIDAFQKALTIMEVVHGKDHTYVTEL 427
>gi|351710111|gb|EHB13030.1| SET and MYND domain-containing protein 2 [Heterocephalus glaber]
Length = 527
Score = 92.0 bits (227), Expect = 3e-16, Method: Compositional matrix adjust.
Identities = 69/274 (25%), Positives = 120/274 (43%), Gaps = 34/274 (12%)
Query: 63 ISIINHSCLPNAVLVFEGRLAVVRAVQHVPKGAEVLISYIETAGSTMTRQKALKEQYLFT 122
++++NHSC PN ++ + G LA VRAVQ + G EV SYI+ T R L++ Y FT
Sbjct: 270 VALMNHSCCPNVIVTYRGTLAEVRAVQEIHPGEEVFTSYIDLLYPTEDRNDRLRDSYFFT 329
Query: 123 CTCPRCIKLGQFDDIQESAILEGYRCKDDGCSGFLLRDSDDKGFTCQQCGLVRSK----E 178
C C C G Q+ A LE R D +RD +VR E
Sbjct: 330 CECRECSTKG-----QDQAKLE-IRKLSDPPRAEAVRD------------MVRYARGVIE 371
Query: 179 EIKKIASEVNILSKKTLALTSCGNH------------QEVVSTYKMIEKLQKKLYHPFSV 226
E ++ IL H E++ ++ ++ ++ +V
Sbjct: 372 EFRRAKHYKYILGCSVQGPMPRSPHCEGREAGPGSPPSELLEICELSQEQMSSVFEDSNV 431
Query: 227 NLMQTREKLIKILMELEDWKEALAYCQLTIPVYQRVYPQFHPLLGLQYYTCGKLEWFLGD 286
++ + + + + L+DW+ AL Y Q I Y + YP + + + G+L L +
Sbjct: 432 YMLHMMYQAMGVCLYLQDWEGALRYGQKIIKPYSKHYPLYSLNVASMWLKLGRLYLGLEN 491
Query: 287 TENAIKSMTEAVEILRITHGTNSPFMKELILKLE 320
K++ +A+ I+ + HG + P++ E+ ++E
Sbjct: 492 RAAGEKALKKALAIMEVAHGKDHPYISEIRREIE 525
>gi|113674346|ref|NP_001038756.1| N-lysine methyltransferase SMYD2-B isoform 2 [Danio rerio]
gi|95132413|gb|AAI16607.1| SET and MYND domain containing 2b [Danio rerio]
Length = 423
Score = 92.0 bits (227), Expect = 3e-16, Method: Compositional matrix adjust.
Identities = 79/329 (24%), Positives = 143/329 (43%), Gaps = 54/329 (16%)
Query: 1 MSDID-EKQLLLYAQIANLVNLI---LQWPEISINEIAENFSKLACNAHTICNSELRPLG 56
+ D+D EK+ + A IA L L +P+ + FS++ CN T+ + EL LG
Sbjct: 128 LEDMDNEKREMTEAHIAGLHQFYSKHLDFPDH--QALLTLFSQVHCNGFTVEDEELSNLG 185
Query: 57 TGLYPVISIINHSCLPNAVLVFEGRLAVVRAVQHVPKGAEVLISYIETAGSTMTRQKALK 116
++P I+++NHSC PN ++ + G A VRAV+ + G E+ SYI+ T R + L+
Sbjct: 186 LAIFPDIALLNHSCSPNVIVTYRGINAEVRAVKDISPGQEIYTSYIDLLYPTADRLERLR 245
Query: 117 EQYLFTCTCPRC-------IKLG---QFDDIQESAILEGYRCKDDGCSGFLLRDSDDKGF 166
+ Y F+C C C +K+ + D+I E I + R + F
Sbjct: 246 DMYYFSCDCKECTTKSMDVVKMSVRKRSDEIGEKEIKDMVRYARNSMENFR--------- 296
Query: 167 TCQQCGLVRSKEEIKKIASEVNILSKKTLALTSCGNHQEVVSTYKMIEKLQKKLYHPFSV 226
R+K++ T L C + +ST ++ +V
Sbjct: 297 --------RAKQD-----------KSPTELLEMCELSIDKMST----------VFDDSNV 327
Query: 227 NLMQTREKLIKILMELEDWKEALAYCQLTIPVYQRVYPQFHPLLGLQYYTCGKLEWFLGD 286
++ + + I + ED++ A+ Y + I + +YP + + + G+L L
Sbjct: 328 YILHMMYQAMGICLFTEDYEGAVRYGEKVIKPFTVLYPAYSMNVASMFLKLGRLYIALDR 387
Query: 287 TENAIKSMTEAVEILRITHGTNSPFMKEL 315
I + +A+ I+ + HG + ++ EL
Sbjct: 388 KLAGIDAFQKALTIMEVVHGKDHTYVTEL 416
>gi|449273578|gb|EMC83062.1| SET and MYND domain-containing protein 1 [Columba livia]
Length = 491
Score = 92.0 bits (227), Expect = 3e-16, Method: Compositional matrix adjust.
Identities = 86/342 (25%), Positives = 148/342 (43%), Gaps = 56/342 (16%)
Query: 16 ANLVNLILQWP----EISINEIAENFSKLACNAHTICNSE-LRPLGTGLYPVISIINHSC 70
A++ + + WP + + I+ F ++CN T+ + L+ +G G++P + NH C
Sbjct: 149 ADVESFLEFWPPHSQQFGMQYISHIFGVISCNGFTLSDQRGLQAVGVGIFPNLCQANHDC 208
Query: 71 LPNAVLVFE-GRLAVVRAVQH------------VPKGAEVLISYIETAGSTMTRQKALKE 117
PN ++F G VR++ H + G E+ +SY++ + R+K LK+
Sbjct: 209 WPNCTVIFNNGNHEAVRSMFHTQMRIELRALTKISPGDELTVSYVDFLNVSEERRKQLKK 268
Query: 118 QYLFTCTCPRCIKLGQFDDIQESAILEGYRCKDDGCSGFLLRDSDDKGFTCQQCGLVRSK 177
QY F CTC C K + KDD +L + +G S
Sbjct: 269 QYYFDCTCEHCKK----------------QIKDD----LMLAVKEGEGKP--------SA 300
Query: 178 EEIKKIASEVNILSKKTLALTSC----GNHQEVVSTYKMIEKLQKKLYHPFSVNLMQTRE 233
E +K EV SK TL S G + EVV + K Q+ + ++ L++
Sbjct: 301 ETVK----EVIQFSKDTLEKISKARMEGLYHEVVKLCRECLKKQEPVLGDTNIYLLRILS 356
Query: 234 KLIKILMELEDWKEALAYCQLTIPVYQRVYPQFHPLLGLQYYTCGKLEWFLGDTENAIKS 293
++L L+ ++EA Y + + Y ++Y + LG+ G W G E
Sbjct: 357 IASEVLSYLQMFEEAADYAKRMVDGYLKIYHPNNAQLGMAVMRAGVTHWHAGLIEAGHGL 416
Query: 294 MTEAVEILRITHGTNSPFMKELILKLEEAQAEASYKLSSKDE 335
+ +A IL ITHG + P K+ L++ Q E ++ ++E
Sbjct: 417 ICKAYAILLITHGPSHPITKD--LEVMRVQTEMELRMFQQNE 456
>gi|70947750|ref|XP_743461.1| hypothetical protein [Plasmodium chabaudi chabaudi]
gi|56522968|emb|CAH78269.1| conserved hypothetical protein [Plasmodium chabaudi chabaudi]
Length = 353
Score = 92.0 bits (227), Expect = 3e-16, Method: Compositional matrix adjust.
Identities = 81/326 (24%), Positives = 138/326 (42%), Gaps = 40/326 (12%)
Query: 30 INEIAENFSKLACNAHTICNSELRPLGTGLYPV-ISIINHSCLPNAVLVFEGRLAVVRAV 88
+ + E F K++ N I ++EL P G G+Y I NHSCL N + +F+ + +R +
Sbjct: 19 LKNVIEKFIKVSKNTLQIIDNELEPCGLGIYKKPIPYFNHSCLSNCITIFKNQRLYIRTL 78
Query: 89 QHVPKGAEVLISYIETAGSTMTRQKALKEQYLFTCTCP---------------------- 126
+ G E+ ISY++ A R +QY FTCTC
Sbjct: 79 MDIYPGEELTISYLDVAFDRNARLAICADQYFFTCTCKLCKVNIASECHNIFNNDFICTQ 138
Query: 127 --RCIKLGQFDDIQESAILE---GYRCKDDGCSGFLLRDSDDKGFTCQQCGLVRSK---- 177
C K + +I + LE Y K + +L+ S +K T +C L +++
Sbjct: 139 SENCKKFINYMEIVLMSELERKISYLNKSHFKTFPILKKSTEKNETVWKCMLCKNETNDS 198
Query: 178 ------EEIKKIASEVNILSKKTLALTSCGNHQEVVSTYKMIEKLQK--KLYHPFSVNLM 229
E+ K+ EV L S N + S K+ K+ YH +L
Sbjct: 199 VIKALIEKEKETIKEVEYLDTLFAEKYSYDNKNVLQSLTKIKSKIDDLTDFYHHSKYSLQ 258
Query: 230 QTREKLIKILMELEDWKEALAYCQLTIPVYQRVYPQFHPLLGLQYYTCGKLEWFLGDTEN 289
+ R K++ + ++L+D+K A + + + Y ++ P+ G + GKL FL
Sbjct: 259 KMRAKILYVSIQLQDFKLAYSIATQYLKSIEVSYGKYSPIYGYYIFLTGKLALFLDLKSA 318
Query: 290 AIKSMTEAVEILRITHGTNSPFMKEL 315
+ + +A + + T+G SP K+L
Sbjct: 319 GLSLIHKAKKNIIKTYGPESPIYKDL 344
>gi|67514234|gb|AAH98305.1| SMYD2 protein [Homo sapiens]
Length = 272
Score = 92.0 bits (227), Expect = 3e-16, Method: Compositional matrix adjust.
Identities = 47/127 (37%), Positives = 75/127 (59%), Gaps = 5/127 (3%)
Query: 5 DEKQLLLYAQIANLVNLI---LQWPEISINEIAENFSKLACNAHTICNSELRPLGTGLYP 61
+EK+ L+ + IA L + L +P+ + + F+++ CN TI + EL LG+ ++P
Sbjct: 143 NEKKDLIQSDIAALHHFYSKHLGFPDN--DSLVVLFAQVNCNGFTIEDEELSHLGSAIFP 200
Query: 62 VISIINHSCLPNAVLVFEGRLAVVRAVQHVPKGAEVLISYIETAGSTMTRQKALKEQYLF 121
++++NHSC PN ++ ++G LA VRAVQ + G EV SYI+ T R L++ Y F
Sbjct: 201 DVALMNHSCCPNVIVTYKGTLAEVRAVQEIKPGEEVFTSYIDLLYPTEDRNDRLRDSYFF 260
Query: 122 TCTCPRC 128
TC C C
Sbjct: 261 TCECQEC 267
>gi|303321560|ref|XP_003070774.1| MYND finger family protein [Coccidioides posadasii C735 delta
SOWgp]
gi|240110471|gb|EER28629.1| MYND finger family protein [Coccidioides posadasii C735 delta
SOWgp]
gi|320040248|gb|EFW22181.1| conserved hypothetical protein [Coccidioides posadasii str.
Silveira]
Length = 497
Score = 92.0 bits (227), Expect = 3e-16, Method: Compositional matrix adjust.
Identities = 40/110 (36%), Positives = 65/110 (59%)
Query: 27 EISINEIAENFSKLACNAHTICNSELRPLGTGLYPVISIINHSCLPNAVLVFEGRLAVVR 86
E+S+ E+ + F KL NA T+ +G + P+ S+ NHSC PNA + F+ +R
Sbjct: 178 ELSLQEVVDCFGKLDINAFTLTTPFYDQIGAAVEPLASLCNHSCDPNAAVDFDKGKTWLR 237
Query: 87 AVQHVPKGAEVLISYIETAGSTMTRQKALKEQYLFTCTCPRCIKLGQFDD 136
A+QH+ +G ++ +SY+E + + RQ L ++Y F C CPRCI+ + D
Sbjct: 238 ALQHIEEGEQIFVSYVEPTDACLHRQAELSKRYYFECECPRCIREKEIGD 287
>gi|432102449|gb|ELK30026.1| SET and MYND domain-containing protein 1 [Myotis davidii]
Length = 449
Score = 92.0 bits (227), Expect = 4e-16, Method: Compositional matrix adjust.
Identities = 81/326 (24%), Positives = 145/326 (44%), Gaps = 36/326 (11%)
Query: 5 DEKQLLLYAQIANLVNLIL-QWPEISINEIAENFSKLACNAHTICNSE-LRPLGTGLYPV 62
+E+Q L + + L Q + S+ I+ F + CN T+ + L+ +G G++P
Sbjct: 141 EEEQKQLRVDVDTFLQYWLPQSQQFSMQYISHIFGVINCNGFTLSDQRGLQAVGVGIFPN 200
Query: 63 ISIINHSCLPNAVLVFEGRLAVVRAVQHVPKGAEVLISYIETAGSTMTRQKALKEQYLFT 122
+ ++NH C PN ++F +RA+ + +G E+ +SYI+ + R+K LK+QY F
Sbjct: 201 LGLVNHDCWPNCTVIFNNGKIELRALGKISEGEELTVSYIDFLNVSEERKKQLKKQYYFD 260
Query: 123 CTCPRCIKLGQFDDIQESAILEGYRCKDDGCSGFLLRDSDDKGFTCQQCGLVRSKEEIKK 182
CTC C K G DD+ FL D K S++ +K+
Sbjct: 261 CTCEHCQK-GLKDDL------------------FLGVKDDPKP----------SQDVVKE 291
Query: 183 IASEVNILSKKTLALTSCGNHQEVVSTYKMIEKLQKKLYHPFSVNLMQTREKLIKILMEL 242
I +K S G + EVV + + Q ++ ++ +++ + ++L L
Sbjct: 292 IIQFSKDTLEKIDKARSEGLYHEVVKLCRECLEKQDSVFADTNIYMLRVLSIVSEVLSYL 351
Query: 243 EDWKEALAYCQLTIPVYQRVYPQFHPLLGLQYYTCGKLEWFLGDTENAIKSMTEAVEILR 302
+ ++EA Y + + Y ++Y + LG+ G W G+ E A++ TE LR
Sbjct: 352 QAFEEASYYARRMVDGYVKLYHPNNAQLGMAVMRAGLTNWHAGNIE-AMRMQTEME--LR 408
Query: 303 ITHGTNSPF--MKELILKLEEAQAEA 326
+ H + M+E L + Q A
Sbjct: 409 LFHQNEFMYRKMREAALNNQPMQVMA 434
>gi|156101287|ref|XP_001616337.1| hypothetical protein [Plasmodium vivax Sal-1]
gi|148805211|gb|EDL46610.1| hypothetical protein, conserved [Plasmodium vivax]
Length = 517
Score = 91.7 bits (226), Expect = 4e-16, Method: Compositional matrix adjust.
Identities = 80/327 (24%), Positives = 133/327 (40%), Gaps = 43/327 (13%)
Query: 30 INEIAENFSKLACNAHTICNSELRPLGTGLY--PVISIINHSCLPNAVLVFEGRLAVVRA 87
+ I E F K++ N I ++EL P G G Y PV NHSCL N V +F+ + +R
Sbjct: 184 LRNIIEKFIKISKNTLQIIDNELEPAGLGFYKKPV-PYFNHSCLSNCVTIFKNQKLFIRT 242
Query: 88 VQHVPKGAEVLISYIETAGSTMTRQKALKEQYLFTCTCPRCIKLGQFDDIQESAILEGYR 147
+ V G E+ ISY++ A TR +QY FTCTC C K+ + Q E
Sbjct: 243 LMDVYPGEELTISYVDIAFDKNTRLAICMDQYFFTCTCKLC-KVNVVSECQNVFNTEFVC 301
Query: 148 CKDDGCSGF------------------------------LLRDSDDKGFTCQQCGLVRSK 177
+ + C + L + ++ + C C
Sbjct: 302 TQSESCKKYVNHMELILISELERKSCYVNKSQFKNSYPVLKKANEQHVWKCSICKNEVHD 361
Query: 178 EEIKKIASEVNILSKKTLALTSCGNHQEVVSTYKMIEKLQK---------KLYHPFSVNL 228
IK + + K+T L S N + +++ L K YH +L
Sbjct: 362 NVIKSLLEKEKETVKETAHLESLFNEKYTYDNKSVLQSLNKIKSKIDYLTTFYHHTRYSL 421
Query: 229 MQTREKLIKILMELEDWKEALAYCQLTIPVYQRVYPQFHPLLGLQYYTCGKLEWFLGDTE 288
+ R K++ I ++L+++K A + + Y ++ P+ G + GKL FL
Sbjct: 422 QKMRAKILYISIQLQEFKLAYNIANQYLKSIEVSYGKYSPIYGYYIFLTGKLALFLDLKS 481
Query: 289 NAIKSMTEAVEILRITHGTNSPFMKEL 315
+ + +A + + T+G +SP K+L
Sbjct: 482 EGLSLIHKAKKNIIKTYGPDSPIYKDL 508
>gi|392862338|gb|EAS37026.2| hypothetical protein CIMG_02225 [Coccidioides immitis RS]
Length = 497
Score = 91.7 bits (226), Expect = 4e-16, Method: Compositional matrix adjust.
Identities = 40/110 (36%), Positives = 65/110 (59%)
Query: 27 EISINEIAENFSKLACNAHTICNSELRPLGTGLYPVISIINHSCLPNAVLVFEGRLAVVR 86
E+S+ E+ + F KL NA T+ +G + P+ S+ NHSC PNA + F+ +R
Sbjct: 178 ELSLQEVVDCFGKLDINAFTLTTPFYDQIGAAVEPLASLCNHSCDPNAAVDFDKGKTWLR 237
Query: 87 AVQHVPKGAEVLISYIETAGSTMTRQKALKEQYLFTCTCPRCIKLGQFDD 136
A+QH+ +G ++ +SY+E + + RQ L ++Y F C CPRCI+ + D
Sbjct: 238 ALQHIEEGEQIFVSYVEPTDACLHRQAELSKRYYFECECPRCIREKEVGD 287
>gi|327275307|ref|XP_003222415.1| PREDICTED: SET and MYND domain-containing protein 1-like isoform 1
[Anolis carolinensis]
Length = 472
Score = 91.7 bits (226), Expect = 4e-16, Method: Compositional matrix adjust.
Identities = 75/296 (25%), Positives = 133/296 (44%), Gaps = 34/296 (11%)
Query: 25 WP----EISINEIAENFSKLACNAHTICNSE-LRPLGTGLYPVISIINHSCLPNAVLVFE 79
WP + ++ I+ + CN T+ + L+ +G G++P + ++NH C PN ++F
Sbjct: 153 WPHDGKQFAMQYISHILGVINCNGFTLSDQRGLQAVGVGIFPNLCLVNHDCWPNCTVIFN 212
Query: 80 GRLAVVRAVQHVPKGAEVLISYIETAGSTMTRQKALKEQYLFTCTCPRCIKLGQFDDIQE 139
+RA+ + G E+ +SY++ R++ LK+QY F CTC C K G DD+
Sbjct: 213 NGKIELRALGKISIGEELTVSYVDFLNVCEDRRQQLKKQYYFDCTCEHCQK-GIKDDLM- 270
Query: 140 SAILEGYRCKDDGCSGFLLRDSDDKGFTCQQCGLVRSKEEIKKIASEVNILSKKTLALTS 199
A+ E ++ S +++D + SKE I+KI S
Sbjct: 271 LAVKE-----EEKPSPEVVKDV-----------IQFSKEAIEKIDK-----------ARS 303
Query: 200 CGNHQEVVSTYKMIEKLQKKLYHPFSVNLMQTREKLIKILMELEDWKEALAYCQLTIPVY 259
G ++EVV + + Q+ + ++ +++ L ++L L+ EA Y + Y
Sbjct: 304 EGLYKEVVKLCRACLEKQEPVLGDTNIYMLRILSILSEVLSYLQHVDEAAGYSKRMADGY 363
Query: 260 QRVYPQFHPLLGLQYYTCGKLEWFLGDTENAIKSMTEAVEILRITHGTNSPFMKEL 315
++Y + LG+ G W G E + +A IL +THG P K+L
Sbjct: 364 LKLYHPNNAQLGMALMRAGVTHWHAGLIEAGHGMICKAYAILLVTHGPTHPITKDL 419
>gi|348670644|gb|EGZ10465.1| hypothetical protein PHYSODRAFT_518318 [Phytophthora sojae]
Length = 437
Score = 90.9 bits (224), Expect = 7e-16, Method: Compositional matrix adjust.
Identities = 47/125 (37%), Positives = 67/125 (53%), Gaps = 2/125 (1%)
Query: 32 EIAENFSKLACNAHTICNS-ELRPLGTGLYPVISIINHSCLPNAVLVFEGRLAVVRAVQH 90
EI + F ++ CNA ++ N +G GLYP ++ NH C PN V+ F+GR VR V+
Sbjct: 173 EILKLFGRVNCNAFSVANEFTNEAVGIGLYPEGALFNHDCDPNCVVSFKGREMQVRVVRD 232
Query: 91 VPKGAEVLISYIETAGSTMTRQKALKEQYLFTCTCPRCIKLGQFDDIQESAILEGYRCKD 150
+ E+ +SY+E ST R++ LKE Y F C C RC K E L+G RC +
Sbjct: 233 IEVDEELTVSYVELLQSTKARRRELKESYFFDCECKRC-KAATNGQSNEDWYLDGLRCSN 291
Query: 151 DGCSG 155
C+
Sbjct: 292 KKCAS 296
>gi|444714585|gb|ELW55465.1| N-lysine methyltransferase SMYD2 [Tupaia chinensis]
Length = 494
Score = 90.9 bits (224), Expect = 8e-16, Method: Compositional matrix adjust.
Identities = 65/264 (24%), Positives = 116/264 (43%), Gaps = 34/264 (12%)
Query: 63 ISIINHSCLPNAVLVFEGRLAVVRAVQHVPKGAEVLISYIETAGSTMTRQKALKEQYLFT 122
++++NHSC PN ++ + G LA VRAV + G EV SYI+ T R L++ Y FT
Sbjct: 257 VALMNHSCCPNVIVTYRGTLAEVRAVHEISPGEEVFTSYIDLLYPTEDRNDRLRDSYFFT 316
Query: 123 CTCPRCIKLGQFDDIQESAILEGYRCKDDGCSGFLLRDSDDKGFTCQQCGLVRSK----E 178
C C C KD + +R +D +VR E
Sbjct: 317 CECHECT------------------TKDKDKAKVEIRKLNDPPKAEAIRDMVRYARNVIE 358
Query: 179 EIKKIASEVNIL--SKKTLALTSCGNHQEVVSTYKMIEKLQKKLYHPFSVNLMQTREKLI 236
E ++ IL S + L C QE +S+ ++ +V ++ + +
Sbjct: 359 EFRRAKHYKYILYNSPPSELLEICELSQEKMSS----------VFEDSNVYMLHMMYQAM 408
Query: 237 KILMELEDWKEALAYCQLTIPVYQRVYPQFHPLLGLQYYTCGKLEWFLGDTENAIKSMTE 296
+ + ++DW+ AL Y Q I Y + YP + + + G+L L + +++ +
Sbjct: 409 GVCLYMQDWEGALRYGQRIIKPYSKHYPLYSLNVASMWLKLGRLYMGLENKAAGERALKK 468
Query: 297 AVEILRITHGTNSPFMKELILKLE 320
A+ I+ + HG + P++ E+ ++E
Sbjct: 469 AIAIMEVAHGKDHPYISEIKQEIE 492
>gi|451992674|gb|EMD85153.1| hypothetical protein COCHEDRAFT_1149213 [Cochliobolus
heterostrophus C5]
Length = 508
Score = 90.9 bits (224), Expect = 8e-16, Method: Compositional matrix adjust.
Identities = 72/321 (22%), Positives = 139/321 (43%), Gaps = 46/321 (14%)
Query: 33 IAENFSKLACNAHTICNSELRPLGTGLYPVISIINHSCLPNAVLVFEGRLAVVRAVQHVP 92
+A + ++ NA T+ L PLG + P + +NHSC PNA ++ +G L +R ++ +
Sbjct: 178 VAAMYGRVMSNALTLITPTLDPLGIIVDPTLCQMNHSCDPNAYIMMDGPLVSIRTLRPIR 237
Query: 93 KGAEVLISYIETAGSTMTRQKALKEQYLFTCTCPRCIKLGQFDDIQESAILEGYRCKDDG 152
K E+ ISYI+T RQ L+ ++ FTC C +C Q+ A + G
Sbjct: 238 KDKEIFISYIDTTHPYQKRQDELQARWFFTCRCAKC---------QDKATHQEDNWLVPG 288
Query: 153 CSGFLLRDSDDKGFTCQQCGLVRSKEEIKKIASEVNILSKKTLALTSCGNHQEVVSTYKM 212
S F+ + Q + E+++ + + ++ AL +C
Sbjct: 289 NSDFVASKEQVQALASNQEQMFTLYEQVQGMPNAEAVIPVLKEALKACH----------- 337
Query: 213 IEKLQKKLY-HPFSVNLMQTREKLIKILMELEDWKEALAYCQLTIPVYQRVYPQ-----F 266
E LY P++ R++LI L+ + ++EA A+ + Y+ + P+ F
Sbjct: 338 -ESGNWPLYRQPYA----SIRDELIVNLLSVGKYQEAWAH---SAKRYKYILPKLYSVPF 389
Query: 267 HPLLGLQYYTCGKLEWFLGDTENAIKS------------MTEAVEILRITHGTNSPFMKE 314
HP+ +Q + L +L TE + + + + +++ ++H ++ F K
Sbjct: 390 HPVRVVQTWQMAMLAAYLASTEEGVGAPGVNMGLIAMMLVKQVLDVAPMSHSQDNAFTKS 449
Query: 315 LILKLEEAQAEASYKLSSKDE 335
++ K E E + + D+
Sbjct: 450 VVEKTREMVEELKRSVGNPDK 470
>gi|167533183|ref|XP_001748272.1| hypothetical protein [Monosiga brevicollis MX1]
gi|163773392|gb|EDQ87033.1| predicted protein [Monosiga brevicollis MX1]
Length = 456
Score = 90.1 bits (222), Expect = 1e-15, Method: Compositional matrix adjust.
Identities = 80/291 (27%), Positives = 126/291 (43%), Gaps = 31/291 (10%)
Query: 33 IAENFSKLACNAHTICNSELRPLGTGLYPVISIINHSCLPNAVLVFEGRL--AVVRAVQH 90
+ + + CN +I + L P G G+YP +I+NHSC PN V+ + G L V+A++
Sbjct: 183 VMDVLQRADCNNFSIWDELLLPRGAGVYPWGAILNHSCEPNCVMTYRGPLHAQAVKALRD 242
Query: 91 VPKGAEVLISYIETAGSTMTRQKALKEQYLFTCTCPRCIK--LGQFDDIQESAILEGYRC 148
+ G E+ SYI+ T R L +QY F C C + L + D + E E
Sbjct: 243 IAVGEELCHSYIDLYAPTGQRHSHLGDQYGFECDCALYLDGALAELDALPEVTAAEMR-- 300
Query: 149 KDDGCSGFLLRDSDDKGFTCQQ-CGLVRSKEEIKKIASEVNILSKKTLALTSCGNHQEVV 207
G +L + + + Q+ L R+ E +K A +E+V
Sbjct: 301 ----LPGKMLHELECEAEERQRLLSLYRAAETLKDYAWST--------------PEEELV 342
Query: 208 STYKMIEKLQKKLYHPFSVNLMQTREKLIKILME---LEDWKEALAYCQLTIPVYQRVYP 264
+ L ++L HP +++L +L + E LED AL Q Y VYP
Sbjct: 343 CLLQGYNIL-RRLAHPANISLTSIMTRLQNVATECGRLEDI--ALPVGQHLALAYDHVYP 399
Query: 265 QFHPLLGLQYYTCGKLEWFLGDTENAIKSMTEAVEILRITHGTNSPFMKEL 315
+ +PL GLQYY G + A+ A ++LR TH + + +L
Sbjct: 400 EHNPLCGLQYYRLGDVANLAQKQALALLWHRRAYQVLRTTHDADHWLLTQL 450
>gi|12834773|dbj|BAB23038.1| unnamed protein product [Mus musculus]
Length = 229
Score = 89.4 bits (220), Expect = 2e-15, Method: Compositional matrix adjust.
Identities = 56/255 (21%), Positives = 110/255 (43%), Gaps = 28/255 (10%)
Query: 66 INHSCLPNAVLVFEGRLAVVRAVQHVPKGAEVLISYIETAGSTMTRQKALKEQYLFTCTC 125
+NHSC PN ++ ++G LA VRAVQ + G EV SYI+ T R L++ Y FTC C
Sbjct: 1 MNHSCCPNVIVTYKGTLAEVRAVQEIHPGDEVFTSYIDLLYPTEDRNDRLRDSYFFTCEC 60
Query: 126 PRCIKLGQFDDIQESAILEGYRCKDDGCSGFLLRDSDDKGFTCQQCGLVRSKEEIKKIAS 185
C +D D ++ E I+ +
Sbjct: 61 RECT----------------------------TKDKDKAKVEVRKLSSPPQAEAIRDMVR 92
Query: 186 EVNILSKKTLALTSCGNHQEVVSTYKMIEKLQKKLYHPFSVNLMQTREKLIKILMELEDW 245
+ ++ + E++ ++ ++ ++ +V ++ + + + + ++DW
Sbjct: 93 YARNVIEEFRRAKHYKSPSELLEICELSQEKMSSVFEDSNVYMLHMMYQAMGVCLYMQDW 152
Query: 246 KEALAYCQLTIPVYQRVYPQFHPLLGLQYYTCGKLEWFLGDTENAIKSMTEAVEILRITH 305
+ AL Y Q I Y + YP + + + G+L L + K++ +A+ I+ + H
Sbjct: 153 EGALKYGQKIIKPYSKHYPVYSLNVASMWLKLGRLYMGLENKAAGEKALKKAIAIMEVAH 212
Query: 306 GTNSPFMKELILKLE 320
G + P++ E+ ++E
Sbjct: 213 GKDHPYISEIKQEIE 227
>gi|32140151|tpg|DAA01312.1| TPA_exp: SET and MYND domain protein 2 [Takifugu rubripes]
Length = 433
Score = 89.4 bits (220), Expect = 2e-15, Method: Compositional matrix adjust.
Identities = 45/123 (36%), Positives = 73/123 (59%), Gaps = 10/123 (8%)
Query: 13 AQIANLVNLILQWPEISINEIAEN------FSKLACNAHTICNSELRPLGTGLYPVISII 66
A IA L + ++ IN+I ++ F+++ CN TI + EL LG+ ++P ++++
Sbjct: 151 ADIAALHHFYSRY----INDIPDDQSLTELFAQVNCNGFTIEDEELSHLGSAIFPDVALM 206
Query: 67 NHSCLPNAVLVFEGRLAVVRAVQHVPKGAEVLISYIETAGSTMTRQKALKEQYLFTCTCP 126
NHSC PN ++ ++G +A VRAVQ + G E+L SYI+ T R++ L + Y FTC C
Sbjct: 207 NHSCNPNVIVTYKGTVAEVRAVQELNPGDEILNSYIDLLYPTEDRKERLLDSYFFTCLCS 266
Query: 127 RCI 129
C
Sbjct: 267 ECF 269
>gi|410912130|ref|XP_003969543.1| PREDICTED: N-lysine methyltransferase SMYD2-A-like [Takifugu
rubripes]
Length = 434
Score = 89.4 bits (220), Expect = 2e-15, Method: Compositional matrix adjust.
Identities = 45/123 (36%), Positives = 73/123 (59%), Gaps = 10/123 (8%)
Query: 13 AQIANLVNLILQWPEISINEIAEN------FSKLACNAHTICNSELRPLGTGLYPVISII 66
A IA L + ++ IN+I ++ F+++ CN TI + EL LG+ ++P ++++
Sbjct: 151 ADIAALHHFYSRY----INDIPDDQSLTELFAQVNCNGFTIEDEELSHLGSAIFPDVALM 206
Query: 67 NHSCLPNAVLVFEGRLAVVRAVQHVPKGAEVLISYIETAGSTMTRQKALKEQYLFTCTCP 126
NHSC PN ++ ++G +A VRAVQ + G E+L SYI+ T R++ L + Y FTC C
Sbjct: 207 NHSCNPNVIVTYKGTVAEVRAVQELNPGDEILNSYIDLLYPTEDRKERLLDSYFFTCLCS 266
Query: 127 RCI 129
C
Sbjct: 267 ECF 269
Score = 42.4 bits (98), Expect = 0.31, Method: Compositional matrix adjust.
Identities = 21/99 (21%), Positives = 49/99 (49%)
Query: 225 SVNLMQTREKLIKILMELEDWKEALAYCQLTIPVYQRVYPQFHPLLGLQYYTCGKLEWFL 284
+V ++ + + + + ++DW+ A+ Y + I Y YP + + Y G L L
Sbjct: 336 NVYMLHMMYQAMGVCLYMQDWEGAMRYGEKIIHPYSFHYPAYSLNVASMYLKLGHLYLGL 395
Query: 285 GDTENAIKSMTEAVEILRITHGTNSPFMKELILKLEEAQ 323
+K++ +A+ I+ + HG + ++ E+ ++EE +
Sbjct: 396 EKRTQGVKALKKALSIMEVAHGKDHYYVAEVKREIEEQK 434
>gi|301102901|ref|XP_002900537.1| conserved hypothetical protein [Phytophthora infestans T30-4]
gi|262101800|gb|EEY59852.1| conserved hypothetical protein [Phytophthora infestans T30-4]
Length = 425
Score = 89.4 bits (220), Expect = 2e-15, Method: Compositional matrix adjust.
Identities = 46/123 (37%), Positives = 68/123 (55%), Gaps = 6/123 (4%)
Query: 32 EIAENFSKLACNAHTICNSELR-PLGTGLYPVISIINHSCLPNAVLVFEGRLAVVRAVQH 90
EI + F +++CN +I N P+G GL+ S+ NH C PN V+ F+G+ V ++
Sbjct: 172 EILKLFGQVSCNGFSIMNGVTNEPVGIGLFLQGSMFNHDCDPNCVVSFQGQEMNVHVIKD 231
Query: 91 VPKGAEVLISYIETAGSTMTRQKALKEQYLFTCTCPRCIKLGQFDDIQESAILEGYRCKD 150
V +G E+ ISY+E ST RQK LK+ Y F C C RC + + L+G +C +
Sbjct: 232 VKEGQELTISYVEVLQSTKKRQKILKDSYFFECQCSRCTT-----ETTDDWYLDGLQCGN 286
Query: 151 DGC 153
GC
Sbjct: 287 KGC 289
>gi|348505212|ref|XP_003440155.1| PREDICTED: SET and MYND domain-containing protein 1-like isoform 2
[Oreochromis niloticus]
Length = 467
Score = 89.0 bits (219), Expect = 3e-15, Method: Compositional matrix adjust.
Identities = 79/310 (25%), Positives = 135/310 (43%), Gaps = 62/310 (20%)
Query: 29 SINEIAENFSKLACNAHTICNSE-LRPLGTGLYPVISIINHSCLPN-AVLVFEGRLAVV- 85
S++ I+ F + CN T+ + L+ +G GL+P + ++NH C PN V++ G + V
Sbjct: 166 SVDYISHIFGIIKCNGFTLSDQRGLQAVGVGLFPNLCLVNHDCWPNCTVILNHGNQSAVS 225
Query: 86 -----------RAVQHVPKGAEVLISYIETAGSTMTRQKALKEQYLFTCTCPRCIKLGQF 134
RA+ +P+G E+ +SY++ + RQK LKE++ F CTC C K
Sbjct: 226 SALHSQRRIELRALGKIPEGEELTVSYVDFLNLSADRQKKLKERFHFDCTCEHCSK---- 281
Query: 135 DDIQESAILEGYRCKDDGCSGFLLRDSDDKGFTCQQCGLVRSKEEIKKIASEVNILSKKT 194
KDD + +D K S +++K EV SK++
Sbjct: 282 ------------HIKDD----LMTAVADSKP----------SADKVK----EVTAFSKES 311
Query: 195 LALTSCG----NHQEVVS-TYKMIEKLQKKLYHPFSVNLMQTREKLIKILMELEDWKEAL 249
L ++ EV+ Y+ ++K + L NL + R ++ I E+ ++ +
Sbjct: 312 LEKIEKSRVERDYNEVLKLCYECLQKQENVLA---DTNLYKLR--VLSIASEVLSYQRSF 366
Query: 250 A----YCQLTIPVYQRVYPQFHPLLGLQYYTCGKLEWFLGDTENAIKSMTEAVEILRITH 305
A Y + Y+++Y + LG+ G W G E + +A IL +TH
Sbjct: 367 AKAANYAHRMVEGYRKLYHPNNAQLGMAIMRAGVTHWHAGQIEEGHSLICQAYRILMVTH 426
Query: 306 GTNSPFMKEL 315
G N K+L
Sbjct: 427 GPNHAITKDL 436
>gi|88900471|ref|NP_001034725.1| SET and MYND domain-containing protein 1 [Danio rerio]
gi|84180543|gb|ABC54713.1| histone methyltransferase SmyD1a [Danio rerio]
Length = 486
Score = 89.0 bits (219), Expect = 3e-15, Method: Compositional matrix adjust.
Identities = 75/331 (22%), Positives = 137/331 (41%), Gaps = 47/331 (14%)
Query: 3 DIDEKQLLLYAQIANLVNLILQWPEIS----INEIAENFSKLACNAHTICNSE-LRPLGT 57
DI E L + ++ N + WP S ++ ++ + CN + + L+ +G
Sbjct: 133 DISEDDLKDFK--VDIHNFLDYWPRNSKPHTVDSVSHILGVINCNGFMVSDQRGLQAVGV 190
Query: 58 GLYPVISIINHSCLPNAVL------------VFEGRLAV-VRAVQHVPKGAEVLISYIET 104
GL+P + ++NH C PN + VF + + +RA+ + G EV ++Y++
Sbjct: 191 GLFPNLCLVNHDCWPNCTVILNNGNQSAIDTVFHSQKRIELRALGKISAGEEVTVAYVDY 250
Query: 105 AGSTMTRQKALKEQYLFTCTCPRCIKLGQFDDIQESAILEGYRCKDDGCSGFLLRDSDDK 164
+ RQ+ LK+QY F CTC C + + D A ++G + +
Sbjct: 251 LNVSADRQRLLKQQYFFDCTCKHCTEKIKDDLKMAGAEVDGVKVPE-------------- 296
Query: 165 GFTCQQCGLVRSKEEIKKIASEVNILSKKTLALTSCGNHQEVVSTYKMIEKLQKKLYHPF 224
E++K++ +K N+ EVV + + Q+ +
Sbjct: 297 -------------EQVKEVTEFSRQKLEKMEKARIEANYNEVVKICRECVEKQENVLADT 343
Query: 225 SVNLMQTREKLIKILMELEDWKEALAYCQLTIPVYQRVYPQFHPLLGLQYYTCGKLEWFL 284
+ ++ L ++L L+ + EA Y + + Y ++Y + LG+ G W
Sbjct: 344 HIYYLRVCCTLSEVLSYLQFFDEASEYARKMVDGYLKLYHPNNAQLGMATMRAGVTHWQA 403
Query: 285 GDTENAIKSMTEAVEILRITHGTNSPFMKEL 315
G E A + +A IL ITHG P K+L
Sbjct: 404 GFIEVAHGMICKAFAILMITHGPTHPITKDL 434
>gi|224050278|ref|XP_002188029.1| PREDICTED: SET and MYND domain-containing protein 1 isoform 3
[Taeniopygia guttata]
Length = 490
Score = 89.0 bits (219), Expect = 3e-15, Method: Compositional matrix adjust.
Identities = 84/342 (24%), Positives = 148/342 (43%), Gaps = 57/342 (16%)
Query: 16 ANLVNLILQWP----EISINEIAENFSKLACNAHTICNSE-LRPLGTGLYPVISIINHSC 70
A++ + + WP + + I+ F ++CN T+ + L+ +G G++P + NH C
Sbjct: 149 ADVESFLEFWPPQCQQFGMQLISHIFGVISCNGFTLSDQRGLQAVGVGIFPNLCQANHDC 208
Query: 71 LPNAVLVFE-GRLAVVRAVQH------------VPKGAEVLISYIETAGSTMTRQKALKE 117
PN +VF G VR++ H + G E+ +SY++ + R++ LK+
Sbjct: 209 WPNCTVVFNNGNHEAVRSMFHTQMRIELRALNKISPGDELTVSYVDFLSLSEERRRQLKK 268
Query: 118 QYLFTCTCPRCIKLGQFDDIQESAILEGYRCKDDGCSGFLLRDSDDKGFTCQQCGLVRSK 177
QY F CTC C K + KDD +L D S
Sbjct: 269 QYYFDCTCEHCKK----------------QLKDD----LMLAVKDSP---------TPSA 299
Query: 178 EEIKKIASEVNILSKKTLALTSC----GNHQEVVSTYKMIEKLQKKLYHPFSVNLMQTRE 233
+ +K EV LSK TL + G++ EVV + + Q+ + ++ L++
Sbjct: 300 DTVK----EVIQLSKDTLEKINKARMEGHYHEVVKLCRDCLQKQEPVLGDTNIYLLRILS 355
Query: 234 KLIKILMELEDWKEALAYCQLTIPVYQRVYPQFHPLLGLQYYTCGKLEWFLGDTENAIKS 293
++L L+ ++EA Y + + Y ++Y + LG+ G W G E
Sbjct: 356 IASEVLSYLQMFEEAADYAKRMVDGYLKIYHPNNAQLGMAVMRAGVTHWHAGLIEVGHGL 415
Query: 294 MTEAVEILRITHGTNSPFMKELILKLEEAQAEASYKLSSKDE 335
+ +A IL ITHG + P K+ L++ Q E ++ ++E
Sbjct: 416 ICKAYAILLITHGPSHPITKD--LEVMRVQTEMELRMFQQNE 455
>gi|432880354|ref|XP_004073656.1| PREDICTED: SET and MYND domain-containing protein 1-like isoform 1
[Oryzias latipes]
Length = 489
Score = 88.6 bits (218), Expect = 4e-15, Method: Compositional matrix adjust.
Identities = 78/306 (25%), Positives = 132/306 (43%), Gaps = 45/306 (14%)
Query: 29 SINEIAENFSKLACNAHTICNSE-LRPLGTGLYPVISIINHSCLPNAVLVF-EGRLAVVR 86
S+ I+ F + CN T+ + L+ +G GL+P ++++NH C PN + G + V
Sbjct: 166 SVEYISHIFGIIKCNGFTLTDQRGLQAVGVGLFPNLALVNHDCWPNCTAILNHGNHSAVS 225
Query: 87 AVQHVPK------------GAEVLISYIETAGSTMTRQKALKEQYLFTCTCPRCIKLGQF 134
+ H P+ G E+ ISY++ + RQK LKEQ+ F C+C C +
Sbjct: 226 SALHSPRRIEVRTLGKISEGEELTISYVDFLQLSADRQKQLKEQFHFECSCKHCSQHIS- 284
Query: 135 DDIQESAILEGYRCKDDGCSGFLLRDSDDKGFTCQQCGLVR-SKEEIKKIASEVNILSKK 193
DD+ +A DG D F L R S E+I+ EV+ SK
Sbjct: 285 DDLMMAAA--------DG--------QDSPWFI-----LCRPSAEQIQ----EVSAFSKD 319
Query: 194 TLALTS----CGNHQEVVSTYKMIEKLQKKLYHPFSVNLMQTREKLIKILMELEDWKEAL 249
+L G++ EV+ + + Q+ + ++ ++ ++L L + EA
Sbjct: 320 SLEKIEKSRLDGDYSEVLRLCRECLQKQRGVLADTHLHRLRVLSIAAEVLSSLRQFSEAA 379
Query: 250 AYCQLTIPVYQRVYPQFHPLLGLQYYTCGKLEWFLGDTENAIKSMTEAVEILRITHGTNS 309
+ + Y ++Y + LLG+ G W G E + +A IL +THG N
Sbjct: 380 DCARKMVEGYMKLYHPNNALLGMAIMRAGVTLWHAGQIEEGHSLICKAYSILMVTHGPNH 439
Query: 310 PFMKEL 315
++L
Sbjct: 440 SVTRDL 445
>gi|426336314|ref|XP_004031420.1| PREDICTED: SET and MYND domain-containing protein 1 [Gorilla
gorilla gorilla]
Length = 459
Score = 88.6 bits (218), Expect = 4e-15, Method: Compositional matrix adjust.
Identities = 75/316 (23%), Positives = 133/316 (42%), Gaps = 79/316 (25%)
Query: 19 VNLILQ-WP----EISINEIAENFSKLACNAHTICNSE-LRPLGTGLYPVISIINHSCLP 72
V+ LQ WP + S+ I+ F + CN T+ + L+ +G G++P + ++NH C P
Sbjct: 151 VDTFLQYWPPQSQQFSMQYISHIFGVINCNGFTLSDQRGLQAVGVGIFPNLGLVNHDCWP 210
Query: 73 NAVLVFE-GRLAVVRAVQH------------VPKGAEVLISYIETAGSTMTRQKALKEQY 119
N ++F G V+++ H + +G E+ +SYI+ + R++ LK+QY
Sbjct: 211 NCTVIFNNGNHEAVKSMFHTQMRIELRALGKISEGEELTVSYIDFLNVSEERKRQLKKQY 270
Query: 120 LFTCTCPRCIKLGQFDDIQESAILEGYRCKDDGCSGFLLRDSDDKGFTCQQCGLVRSKEE 179
F CTC C K + KDD G ++D+ C++C K+E
Sbjct: 271 YFDCTCEHCQK----------------KLKDDLFLG--VKDNPKVVKLCREC---LQKQE 309
Query: 180 IKKIASEVNILSKKTLALTSCGNHQEVVSTYKMIEKLQKKLYHPFSVNLMQTREKLIKIL 239
+ ++ NI + L++ S ++L
Sbjct: 310 --PVFADTNIYMLRMLSIVS-------------------------------------EVL 330
Query: 240 MELEDWKEALAYCQLTIPVYQRVYPQFHPLLGLQYYTCGKLEWFLGDTENAIKSMTEAVE 299
L+ ++EA Y + + Y ++Y + LG+ G W G+ E + +A
Sbjct: 331 SYLQAFEEASFYARRMVDGYMKLYHPNNAQLGMAVMRAGLTNWHAGNIEVGHGMICKAYA 390
Query: 300 ILRITHGTNSPFMKEL 315
IL +THG + P K+L
Sbjct: 391 ILLVTHGPSHPITKDL 406
>gi|242009727|ref|XP_002425634.1| set and mynd domain-containing protein, putative [Pediculus humanus
corporis]
gi|212509527|gb|EEB12896.1| set and mynd domain-containing protein, putative [Pediculus humanus
corporis]
Length = 666
Score = 88.6 bits (218), Expect = 4e-15, Method: Compositional matrix adjust.
Identities = 61/195 (31%), Positives = 96/195 (49%), Gaps = 15/195 (7%)
Query: 43 NAHTICNSELRPLGTGLYPVISIINHSCLPNAVLVFEGRLAVVRAVQHVPKGAEVLISY- 101
N + + +S+ +G GLYP +++ NHSC P V F G +VRA + + KG V +Y
Sbjct: 449 NKNILDDSKSLFVGGGLYPTLALFNHSCDPGIVRYFIGTQVIVRAAKPIKKGEIVAENYG 508
Query: 102 -IETAGSTMTRQKALKEQYLFTCTCPRCIKL-GQFDDIQESAILEGYRC-------KDDG 152
I + + RQ+ LK QY FTCTC C++L FD++ I +RC K +
Sbjct: 509 PIYSQMKKLERQEKLKSQYWFTCTCTPCLELWPTFDELDTKTI--RFRCSGYGTDEKREN 566
Query: 153 CSGFLL--RDSDDKGFTCQQCGLVRSKEEIKKIASEVNILSKKTLALTSCGNHQEVVSTY 210
C L+ ++DD C QC + + K + +IL K+ + GN Q + Y
Sbjct: 567 CKNILIVPVNTDDFMIKCSQCNQHTNLFKGLKALQDTDILYKQAIKYAQTGNFQNALDIY 626
Query: 211 -KMIEKLQKKLYHPF 224
++++ L L PF
Sbjct: 627 LEILKTLHGILAPPF 641
>gi|449501413|ref|XP_002187998.2| PREDICTED: SET and MYND domain-containing protein 1 isoform 2
[Taeniopygia guttata]
Length = 491
Score = 88.6 bits (218), Expect = 4e-15, Method: Compositional matrix adjust.
Identities = 80/338 (23%), Positives = 149/338 (44%), Gaps = 48/338 (14%)
Query: 16 ANLVNLILQWP----EISINEIAENFSKLACNAHTICNSE-LRPLGTGLYPVISIINHSC 70
A++ + + WP + + I+ F ++CN T+ + L+ +G G++P + NH C
Sbjct: 149 ADVESFLEFWPPQCQQFGMQLISHIFGVISCNGFTLSDQRGLQAVGVGIFPNLCQANHDC 208
Query: 71 LPNAVLVFE-GRLAVVRAVQH------------VPKGAEVLISYIETAGSTMTRQKALKE 117
PN +VF G VR++ H + G E+ +SY++ + R++ LK+
Sbjct: 209 WPNCTVVFNNGNHEAVRSMFHTQMRIELRALNKISPGDELTVSYVDFLSLSEERRRQLKK 268
Query: 118 QYLFTCTCPRCIKLGQFDDIQESAILEGYRCKDDGCSGFLLRDSDDKGFTCQQCGLVRSK 177
QY F CTC C K + KDD +S T ++ + SK
Sbjct: 269 QYYFDCTCEHCKK----------------QLKDDLMLAVKAGESKPSADTVKEV-IQLSK 311
Query: 178 EEIKKIASEVNILSKKTLALTSCGNHQEVVSTYKMIEKLQKKLYHPFSVNLMQTREKLIK 237
+ ++KI +K + G++ EVV + + Q+ + ++ L++ +
Sbjct: 312 DTLEKI-------NKARME----GHYHEVVKLCRDCLQKQEPVLGDTNIYLLRILSIASE 360
Query: 238 ILMELEDWKEALAYCQLTIPVYQRVYPQFHPLLGLQYYTCGKLEWFLGDTENAIKSMTEA 297
+L L+ ++EA Y + + Y ++Y + LG+ G W G E + +A
Sbjct: 361 VLSYLQMFEEAADYAKRMVDGYLKIYHPNNAQLGMAVMRAGVTHWHAGLIEVGHGLICKA 420
Query: 298 VEILRITHGTNSPFMKELILKLEEAQAEASYKLSSKDE 335
IL ITHG + P K+ L++ Q E ++ ++E
Sbjct: 421 YAILLITHGPSHPITKD--LEVMRVQTEMELRMFQQNE 456
>gi|431907306|gb|ELK11286.1| SET and MYND domain-containing protein 1 [Pteropus alecto]
Length = 476
Score = 88.6 bits (218), Expect = 4e-15, Method: Compositional matrix adjust.
Identities = 74/302 (24%), Positives = 134/302 (44%), Gaps = 48/302 (15%)
Query: 19 VNLILQ-WP----EISINEIAENFSKLACNAHTICNSE-LRPLGTGLYPVISIINHSCLP 72
V+ LQ WP + S+ I+ F + CN T+ + L+ +G G++P + ++NH C P
Sbjct: 151 VDTFLQYWPPKSQQFSMQYISNIFGVINCNGFTLSDQRGLQAVGVGIFPNLGLVNHDCWP 210
Query: 73 NAVLVFE-GRLAVVRAVQH------------VPKGAEVLISYIETAGSTMTRQKALKEQY 119
N ++F G V+++ H + +G E+ +SYI+ + R+K LK+QY
Sbjct: 211 NCTVIFNNGNHEAVKSMFHTQMRIELRALGKIAEGEELTVSYIDFLNVSEERKKQLKKQY 270
Query: 120 LFTCTCPRCIKLGQFDDIQESAILEGYRCKDDGCSGFLLRDSDDKGFTCQQCGLVRSKEE 179
F CTC C K G DD+ LR DD S++
Sbjct: 271 YFDCTCEHCQK-GLKDDL-------------------FLRVKDDPK---------PSQDV 301
Query: 180 IKKIASEVNILSKKTLALTSCGNHQEVVSTYKMIEKLQKKLYHPFSVNLMQTREKLIKIL 239
+K++ N +K S G + EVV + + Q+ ++ ++ +++ + ++L
Sbjct: 302 VKEMIQFSNDTLEKIDKARSEGLYHEVVKLCRECLEKQESVFADTNLYMLRMLSIVSEVL 361
Query: 240 MELEDWKEALAYCQLTIPVYQRVYPQFHPLLGLQYYTCGKLEWFLGDTENAIKSMTEAVE 299
L+ ++EA Y + + Y ++Y + LG+ G W G+ E + +A
Sbjct: 362 SYLQAFEEASYYARRMVDGYMKLYHPNNAQLGMAVMRAGLTNWHAGNIEVGHGMICKAYA 421
Query: 300 IL 301
IL
Sbjct: 422 IL 423
>gi|118379402|ref|XP_001022867.1| conserved hypothetical protein [Tetrahymena thermophila]
gi|89304634|gb|EAS02622.1| conserved hypothetical protein [Tetrahymena thermophila SB210]
Length = 418
Score = 87.8 bits (216), Expect = 6e-15, Method: Compositional matrix adjust.
Identities = 82/307 (26%), Positives = 146/307 (47%), Gaps = 34/307 (11%)
Query: 38 SKLACNAHTICNS--ELRPLGTGLYPVISIINHSCLPNAVLVFEGRLAVVR--AVQHVPK 93
+K A NA TI +S E +G GLY ++ +NHSC PN + VF +L VR A++ + +
Sbjct: 133 AKFATNALTIQDSLFETDGIGAGLYEEVNYMNHSCTPNVICVF-NKLPQVRVIAIRDIEQ 191
Query: 94 GAEVLISYIETAGSTMTRQKALKEQYLFTCTCPRCIKLGQFDDIQESAILEGYRCKDDGC 153
G E++ SYI+T R++ LK+ Y F C C RCIK +++ + RC+ C
Sbjct: 192 GEEIMNSYIDTKKDLDFRRRFLKQNYFFLCECKRCIK-------EQNEGVSFVRCQ--KC 242
Query: 154 SGFLLRDSDDKGFTCQQCGL--VRSKEEIKKIASEVNILSKKTLALTSCGNHQEVVSTYK 211
+ + K C C +++E ++I +++ + K+ TS +++ + +
Sbjct: 243 MKGRI---NSKTLNCSDCNTQGQMTEQEYQRIITQMEEIQKQ----TSQSTYEKTIELTE 295
Query: 212 MIEKLQKKLYHPFSVNLMQTREKLIKILMELEDWKEALAYCQLTIPVYQRVYPQF----- 266
EKL L P+ ++ +R I + ++ +K+A Y + I + + Y +
Sbjct: 296 TAEKLC-HLEAPYFNDIYNSR---ILATLGVQQYKKANKYLKNFIELSKHWYEEIIGNTL 351
Query: 267 -HPLLGLQYYTCGKLEWFLGDTENAIKSMTEAVEILRITHGTNS-PFMKELILKLEEAQA 324
P G + KL + L A K A++IL + S P + EL L+L + ++
Sbjct: 352 EMPFFGWKLNELSKLNFHLDKCLEAEKYSERALKILTQYYTKESCPELNELFLRLNDIKS 411
Query: 325 EASYKLS 331
E K +
Sbjct: 412 EIKMKYN 418
>gi|124513818|ref|XP_001350265.1| histone-lysine N-methyltransferase, putative [Plasmodium falciparum
3D7]
gi|23615682|emb|CAD52674.1| histone-lysine N-methyltransferase, putative [Plasmodium falciparum
3D7]
Length = 509
Score = 87.8 bits (216), Expect = 6e-15, Method: Compositional matrix adjust.
Identities = 82/326 (25%), Positives = 132/326 (40%), Gaps = 42/326 (12%)
Query: 30 INEIAENFSKLACNAHTICNSELRPLGTGLY--PVISIINHSCLPNAVLVFEGRLAVVRA 87
+ I E F K++ N+ I ++EL P G +Y PV NHSCL N V VF + +R
Sbjct: 177 LKNIIEKFIKISKNSLQIIDNELEPAGLAIYKKPV-PFFNHSCLSNCVTVFRNQKLYIRT 235
Query: 88 VQHVPKGAEVLISYIETAGSTMTRQKALKEQYLFTCTCPRCIKLGQFDDIQESAILEGYR 147
+ V G E+ ISYI+ A TR +QY FTC+C C K+ + E
Sbjct: 236 LMDVYPGEELTISYIDIAFDRCTRLSICMDQYFFTCSCKLC-KVNIASECHNIFNTEIVC 294
Query: 148 CKDDGCSGFL----------------------------LRDSD-DKGFTCQQCGLVRSKE 178
+ C FL LR S+ D + C C
Sbjct: 295 TNSENCKKFLGYMEIVLISELERKQCYINKSSFKTYPVLRKSNIDNVWRCMLCKAEVGDN 354
Query: 179 EIKKIASEVNILSKKTLALTSCGNHQEVVSTYKMIEKLQK---------KLYHPFSVNLM 229
IK + + K+T+ L + N + +++ L K YH +L
Sbjct: 355 IIKGLVDKEKETFKETVYLETLFNEKYTYDNKSVLQSLNKIKSKIDYLTNYYHHAKYSLQ 414
Query: 230 QTREKLIKILMELEDWKEALAYCQLTIPVYQRVYPQFHPLLGLQYYTCGKLEWFLGDTEN 289
+ R K++ I ++L ++K A + + Y ++ P+ G + GKL FL
Sbjct: 415 KMRAKILYISIQLHEFKLAYNIANQYLKSIEVSYGKYSPIYGYYIFLTGKLALFLDLKNE 474
Query: 290 AIKSMTEAVEILRITHGTNSPFMKEL 315
+ + +A + + T+G +S K+L
Sbjct: 475 GLGLIHKAKKNIIKTYGPDSLIYKDL 500
>gi|240981045|ref|XP_002403605.1| SET and MYND domain-containing protein (SMYD), putative [Ixodes
scapularis]
gi|215491387|gb|EEC01028.1| SET and MYND domain-containing protein (SMYD), putative [Ixodes
scapularis]
Length = 770
Score = 87.8 bits (216), Expect = 6e-15, Method: Compositional matrix adjust.
Identities = 66/245 (26%), Positives = 113/245 (46%), Gaps = 24/245 (9%)
Query: 36 NFSKLACNAHTICNSELRP--------------LGTGLYPVISIINHSCLPNAVLVFE-G 80
+ +L CNAH I + E + + T +YP S++NHSC PN F G
Sbjct: 455 HIQQLVCNAHAITSLESKTSQEDDVVMTTEQVRIATAIYPSASLMNHSCDPNIFSSFRCG 514
Query: 81 RLAVVRAVQHVPKGAEVLISY--IETAGSTMTRQKALKEQYLFTCTCPRCIKLGQFDDIQ 138
VVRA++ + +G EVL Y S RQ+ L+EQY F C+C C +D +
Sbjct: 515 STLVVRAIRRIQEGEEVLNCYGPHHRRMSFAERQQLLQEQYFFVCSCTAC---SSGEDAE 571
Query: 139 ESAILEGYRCKDDGCSGFLLRDSDDKGFTCQQCGLVRSKEEIKKIASEVNILSKKTLALT 198
+ L+ +C + C G L + D TC CG + ++ A ++ L + + L
Sbjct: 572 QR--LQALKC--EYCEGPLKQPDDSGKATCLDCGTTQECLGSEQKAFRMHDLYVQGVQLA 627
Query: 199 SCGNHQEVVSTYKMIEKLQKKLYHPFSVNLMQTREKLIKILMELEDWKEALAYCQLTIPV 258
GNH E + K ++K+ + ++ L++ ++ + K L L D++ A + +
Sbjct: 628 EKGNHVEALQRLNKCLKTREKVMYKHNLKLLEVKDMVAKCLCSLGDFRSASGVLKPAVDA 687
Query: 259 YQRVY 263
+ +Y
Sbjct: 688 VRVIY 692
>gi|428182841|gb|EKX51700.1| hypothetical protein GUITHDRAFT_55948, partial [Guillardia theta
CCMP2712]
Length = 98
Score = 87.8 bits (216), Expect = 7e-15, Method: Composition-based stats.
Identities = 41/96 (42%), Positives = 58/96 (60%)
Query: 33 IAENFSKLACNAHTICNSELRPLGTGLYPVISIINHSCLPNAVLVFEGRLAVVRAVQHVP 92
IA SK N H I + + + G GLYP+ S INHSC PNA++ F+G VVRA++++P
Sbjct: 3 IASMISKAQRNMHGIVDLKGQNFGHGLYPLASFINHSCEPNAIISFDGNKLVVRALENIP 62
Query: 93 KGAEVLISYIETAGSTMTRQKALKEQYLFTCTCPRC 128
+G E+ I+Y+E R+ AL + F C C RC
Sbjct: 63 RGTEITIAYVELYAPLDVRRDALLSRKGFLCRCSRC 98
>gi|312076846|ref|XP_003141043.1| MYND finger family protein [Loa loa]
gi|307763792|gb|EFO23026.1| MYND finger family protein [Loa loa]
Length = 439
Score = 87.4 bits (215), Expect = 9e-15, Method: Compositional matrix adjust.
Identities = 78/334 (23%), Positives = 159/334 (47%), Gaps = 30/334 (8%)
Query: 5 DEKQLLLYAQIANLVNLILQWPEISIN-EIAENFSKLACNAHTICNSELRPLGTGLYPVI 63
DE++ + I +++ L + ++ N EI + F K+ NA I +S L +G +Y +
Sbjct: 124 DEEKRNGFIAIIHVIKDYLPFDKMPSNTEIFDIFCKILINALVITDSCLNRIGLAVYLGL 183
Query: 64 SIINHSCLPNAVLVFEGRLAVVRAVQH--VPKGAEVLISYIETAGSTMTRQKALKEQYLF 121
S ++HSC P+A ++F G A++R++ + I Y + R +AL++Q+ F
Sbjct: 184 SALDHSCKPDAFIIFSGAKAILRSLNENITEYNDNLHIPYCDLLDLRSARCEALQKQHNF 243
Query: 122 TCTCPRCIKLGQFDDIQESAILEGYRCKDDGCSGFLLRDSDDKGFTCQQCGL-----VRS 176
C C C FD ++ + + +C D C S D T ++C + V +
Sbjct: 244 VCNCDIC---QDFDLDRQKSSVRCTKCTDGFCPY-----SPDDDHTVKRCKVCHEISVFN 295
Query: 177 KEEIKKIASEVNILSKKTLALTSCGNHQEVVSTYKMIEKLQKKLYHPFSVNLMQTREKLI 236
+ ++K+ ++ T + N E++ Y +E +++ P++V L + E ++
Sbjct: 296 SDHVQKLYQQL------TAPRPAEKNLNELIDLYHELE----EVFSPYNVPLCKLAESIM 345
Query: 237 KILMELEDWKEALAYCQLTIPVYQRVYPQFHPLLGLQYYTCGKLEWFLGDTENAIKSMTE 296
+ E + EA+ Y + T+ Y+ YP+ HP ++ + KL L ++ + +
Sbjct: 346 ISALNNEKYDEAVEYAEKTLLCYRTYYPKGHPSPSVRMFEYAKL-LMLQHNRESLPVLRK 404
Query: 297 AVEILRITHGTNSPF---MKELILKLEEAQAEAS 327
A++++ ++G+ S F L+ LE+ + AS
Sbjct: 405 ALKMICESYGSESSFAFNAATLLSDLEKCVSTAS 438
>gi|344290601|ref|XP_003417026.1| PREDICTED: SET and MYND domain-containing protein 4-like [Loxodonta
africana]
Length = 800
Score = 87.0 bits (214), Expect = 1e-14, Method: Compositional matrix adjust.
Identities = 84/318 (26%), Positives = 130/318 (40%), Gaps = 75/318 (23%)
Query: 26 PEISINEIA--ENFSKLACNAHTIC--------------NSELRPLGTGLYPVISIINHS 69
PE++I + + +L CNA I + E+R L TG++PV+S++NHS
Sbjct: 481 PELNIWGVVMLRHMLQLQCNAQAITTIQHTGSKESIITESREIR-LATGIFPVVSLLNHS 539
Query: 70 CLPNAVLVFEGRLAVVRAVQHVPKGAEVLISY--IETAGSTMTRQKALKEQYLFTCTCPR 127
C PN + F +A +RA Q + KG E+L Y E+ + RQ+ L+ QY F CTCP
Sbjct: 540 CRPNTSVSFTSTVATIRASQQIAKGQEILHCYGPHESRMAVAERQQKLRSQYFFECTCPA 599
Query: 128 CIKLGQFDDIQESAILEGYRCKDDGCSGFLLRDSDDKGFTCQQCGLVRSKEEIKKIASEV 187
C K E G R F C C V E++
Sbjct: 600 CHK--------EPRATAGPRW---------------AAFCCHSCRAVMQGEDV------- 629
Query: 188 NILSKKTLALTSCGNHQ--EVVSTYKMIEKLQKKLYHPFSVNLMQTREKLIKILM--ELE 243
SCGN E VS ++ +LQ +L Q K+L +LE
Sbjct: 630 ----------LSCGNRSCTESVSRSHLVSRLQ---------DLQQQVGMAQKLLRSGKLE 670
Query: 244 DWKEALAYCQLTIPVYQRVYPQFHPLLGLQYYTCGKLEWFLGDTENAIKSMTEAVEILRI 303
+ L CQ + P H ++G + LGD E + + ++++++ +
Sbjct: 671 QAVQQLLGCQHAAESF---LPAEHTVVGEIDDHLAQAYAALGDWEKSATHLQKSLQVVEV 727
Query: 304 THGTNSPFMKELILKLEE 321
HG +S M + KL +
Sbjct: 728 RHGPSSIEMGRELFKLAQ 745
>gi|410922900|ref|XP_003974920.1| PREDICTED: SET and MYND domain-containing protein 1-like [Takifugu
rubripes]
Length = 489
Score = 87.0 bits (214), Expect = 1e-14, Method: Compositional matrix adjust.
Identities = 74/321 (23%), Positives = 142/321 (44%), Gaps = 53/321 (16%)
Query: 17 NLVNLILQWPEIS----INEIAENFSKLACNAHTICNSE-LRPLGTGLYPVISIINHSCL 71
++ N + WP S I++I+ F + CN T+ + L+ +G GL+P + ++NH+C
Sbjct: 145 DIHNFLDYWPRNSKQHTIDDISHIFGVINCNGFTVSDQRGLQAVGVGLFPNLCMVNHNCW 204
Query: 72 PN-AVLVFEGRLAVV------------RAVQHVPKGAEVLISYIETAGSTMTRQKALKEQ 118
PN V++ G + V R++ + +G E+ ++Y++ + R++ LK Q
Sbjct: 205 PNCTVILNHGNQSAVNTMFHSQRRIELRSLGKIAEGEELTVAYVDFLNLSEERRRLLKTQ 264
Query: 119 YLFTCTCPRCIKLGQFDDIQESAILEGYRCKDDGCSGFLLRDSDDKGFTCQQCGLVRSKE 178
Y F C C C K G DD++ L G R+ D G+ S++
Sbjct: 265 YFFDCQCDYC-KNGTKDDLK----LAG-------------REVD---------GVKPSEQ 297
Query: 179 EIKKIAS----EVNILSKKTLALTSCGNHQEVVSTYKMIEKLQKKLYHPFSVNLMQTREK 234
++K+ ++ + K L G++ +VV + + + + + L++
Sbjct: 298 QVKEATDYCFQKLETMDKARLD----GDYHQVVKICRDVIDRTEPVLADTHIYLLRMWST 353
Query: 235 LIKILMELEDWKEALAYCQLTIPVYQRVYPQFHPLLGLQYYTCGKLEWFLGDTENAIKSM 294
L ++ L+ + +A Y + + Y ++Y + LG+ G W G E +
Sbjct: 354 LSEVQAYLQYFNDAAEYSRKMVEGYMKLYHPNNAALGMAAMRAGVNHWQAGLIEVGHGMV 413
Query: 295 TEAVEILRITHGTNSPFMKEL 315
+A IL +THG P K+L
Sbjct: 414 CKAYAILLVTHGPTHPITKDL 434
>gi|432880356|ref|XP_004073657.1| PREDICTED: SET and MYND domain-containing protein 1-like isoform 2
[Oryzias latipes]
Length = 478
Score = 86.7 bits (213), Expect = 1e-14, Method: Compositional matrix adjust.
Identities = 71/301 (23%), Positives = 127/301 (42%), Gaps = 46/301 (15%)
Query: 29 SINEIAENFSKLACNAHTICNSE-LRPLGTGLYPVISIINHSCLPNAVLVF-EGRLAVVR 86
S+ I+ F + CN T+ + L+ +G GL+P ++++NH C PN + G + V
Sbjct: 166 SVEYISHIFGIIKCNGFTLTDQRGLQAVGVGLFPNLALVNHDCWPNCTAILNHGNHSAVS 225
Query: 87 AVQHVPK------------GAEVLISYIETAGSTMTRQKALKEQYLFTCTCPRCIKLGQF 134
+ H P+ G E+ ISY++ + RQK LKEQ+ F C+C C +
Sbjct: 226 SALHSPRRIEVRTLGKISEGEELTISYVDFLQLSADRQKQLKEQFHFECSCKHCSQ---- 281
Query: 135 DDIQESAILEGYRCKDDGCSGFLLRDSDDKGFTCQQCGLVRSKEEIKKIASEVNILSKKT 194
DD ++ +D + Q+ SK+ ++KI K
Sbjct: 282 ------------HISDD----LMMAAADGQDSPIQEVSAF-SKDSLEKI-------EKSR 317
Query: 195 LALTSCGNHQEVVSTYKMIEKLQKKLYHPFSVNLMQTREKLIKILMELEDWKEALAYCQL 254
L G++ EV+ + + Q+ + ++ ++ ++L L + EA +
Sbjct: 318 LD----GDYSEVLRLCRECLQKQRGVLADTHLHRLRVLSIAAEVLSSLRQFSEAADCARK 373
Query: 255 TIPVYQRVYPQFHPLLGLQYYTCGKLEWFLGDTENAIKSMTEAVEILRITHGTNSPFMKE 314
+ Y ++Y + LLG+ G W G E + +A IL +THG N ++
Sbjct: 374 MVEGYMKLYHPNNALLGMAIMRAGVTLWHAGQIEEGHSLICKAYSILMVTHGPNHSVTRD 433
Query: 315 L 315
L
Sbjct: 434 L 434
>gi|444515202|gb|ELV10791.1| SET and MYND domain-containing protein 1 [Tupaia chinensis]
Length = 462
Score = 86.3 bits (212), Expect = 2e-14, Method: Compositional matrix adjust.
Identities = 74/297 (24%), Positives = 133/297 (44%), Gaps = 49/297 (16%)
Query: 19 VNLILQ-WP----EISINEIAENFSKLACNAHTICNSE-LRPLGTGLYPVISIINHSCLP 72
V+ LQ WP + S+ I+ F + CN T+ + L+ +G G++P + ++NH C P
Sbjct: 151 VDTFLQYWPPQSQQFSMQYISHIFGVINCNGFTLSDQRGLQAVGVGIFPNLGLVNHDCWP 210
Query: 73 NAVLVFE-GRLAVVRAVQH------------VPKGAEVLISYIETAGSTMTRQKALKEQY 119
N ++F G V+++ H + +G E+ +SYI+ + R+K LK+QY
Sbjct: 211 NCTVIFNNGNHEAVKSMFHTQMRIELRALGKISEGEELTVSYIDFLNLSEERRKQLKKQY 270
Query: 120 LFTCTCPRCIKLGQFDDIQESAILEGYRCKDDGCSGFLLRDSDDKGFTCQQCGLVRSKEE 179
F CTC C K G DD+ FL D K S+E
Sbjct: 271 YFDCTCEHCQK-GLKDDL------------------FLGVKDDPKP----------SQEA 301
Query: 180 IKKIASEVNILSKKTLALTSCGNHQEVVSTYKMIEKLQKKLYHPFSVNLMQTREKLIKIL 239
+K++ +K S G + EVV + + Q+ ++ ++ +++ + ++L
Sbjct: 302 VKEMTQFSKDTLEKIEKARSEGLYHEVVKLCRECLQKQEPVFADTNLYVLRMLSIVSEVL 361
Query: 240 MELEDWKEALAYCQLTIPVYQRVYPQFHPLLGLQYYTCGKLEWFLGDTENAIKSMTE 296
L+ ++EA Y + + Y ++Y + LG+ G W G+ E A++ TE
Sbjct: 362 SYLQAFEEASHYARRMVDGYMKLYHPNNAQLGMAVMRAGLTNWHAGNIE-AMRVQTE 417
>gi|82594042|ref|XP_725259.1| hypothetical protein [Plasmodium yoelii yoelii 17XNL]
gi|23480195|gb|EAA16824.1| Homo sapiens HSKM-B [Plasmodium yoelii yoelii]
Length = 511
Score = 86.3 bits (212), Expect = 2e-14, Method: Compositional matrix adjust.
Identities = 81/327 (24%), Positives = 135/327 (41%), Gaps = 42/327 (12%)
Query: 30 INEIAENFSKLACNAHTICNSELRPLGTGLY--PVISIINHSCLPNAVLVFEGRLAVVRA 87
+ + E F K++ N I ++EL P G G Y PV NHSCL N + +F+ + +R
Sbjct: 177 LKNVIEKFIKVSKNTLQIIDNELEPCGLGFYKKPV-PYFNHSCLSNCITIFKNQKLYIRT 235
Query: 88 VQHVPKGAEVLISYIETAGSTMTRQKALKEQYLFTCTCPRCI------------------ 129
+ + G E+ ISY++ A TR +QY FTCTC C
Sbjct: 236 LMDIYPGEELTISYLDIAFDRNTRLAICTDQYFFTCTCKLCKVNIPSECHNMFNNDFICT 295
Query: 130 ---KLGQFDDIQESAILE------GYRCKDDGCSGFLLRDSDDKGFTCQQCGLVRSK--- 177
+F + E ++ Y K + +L+ S DK +C L + +
Sbjct: 296 RSENCKKFVNYMEMVLISELERKLNYHNKLHFKAFPILKKSTDKNENIWKCMLCKHETND 355
Query: 178 -------EEIKKIASEVNILSKKTLALTSCGNHQEVVSTYKMIEKLQ--KKLYHPFSVNL 228
E+ K+ EV L S N + S K+ K+ YH +L
Sbjct: 356 SVIKGVIEKEKETVKEVEYLETLFAEKYSYDNKIILQSLNKIKSKIDYLTTFYHHSRYSL 415
Query: 229 MQTREKLIKILMELEDWKEALAYCQLTIPVYQRVYPQFHPLLGLQYYTCGKLEWFLGDTE 288
+ R K++ I ++L+++K A + + Y ++ P+ G + GKL FL
Sbjct: 416 QKMRAKILYISIQLQEFKLAYNIANQYLKSIEVSYGKYSPIYGYYIFLTGKLALFLDLKS 475
Query: 289 NAIKSMTEAVEILRITHGTNSPFMKEL 315
+ + +A + + T+G SP K+L
Sbjct: 476 AGLNLIHKAKKNIIKTYGPESPIYKDL 502
>gi|261198320|ref|XP_002625562.1| SET and MYND domain-containing protein [Ajellomyces dermatitidis
SLH14081]
gi|239595525|gb|EEQ78106.1| SET and MYND domain-containing protein [Ajellomyces dermatitidis
SLH14081]
gi|327355791|gb|EGE84648.1| SET and MYND domain-containing protein [Ajellomyces dermatitidis
ATCC 18188]
Length = 519
Score = 85.9 bits (211), Expect = 2e-14, Method: Compositional matrix adjust.
Identities = 39/104 (37%), Positives = 60/104 (57%)
Query: 27 EISINEIAENFSKLACNAHTICNSELRPLGTGLYPVISIINHSCLPNAVLVFEGRLAVVR 86
+I ++ +A F+KL NA T+ N +G P S INHSC PNA + F+G + ++
Sbjct: 181 DIDLSTLAAYFAKLETNAFTLTNQYFDRIGLCFLPFASYINHSCQPNAYIGFDGPVMYLK 240
Query: 87 AVQHVPKGAEVLISYIETAGSTMTRQKALKEQYLFTCTCPRCIK 130
A+Q + ++ ISYI+ TRQ L+ QY F C CP+C++
Sbjct: 241 ALQDIAPDEQIYISYIDNTEPFQTRQSELQLQYFFECKCPKCLE 284
>gi|440799473|gb|ELR20518.1| Histone-lysine N-methyltransferase [Acanthamoeba castellanii str.
Neff]
Length = 404
Score = 85.9 bits (211), Expect = 2e-14, Method: Compositional matrix adjust.
Identities = 57/177 (32%), Positives = 81/177 (45%), Gaps = 12/177 (6%)
Query: 8 QLLLYAQIANLVNLILQWPEISINEIAENFSKL----ACNAHTICNSELRPLGTGLYPVI 63
+L Y Q+A +V + P + ++ +L CNA + N E G LYP +
Sbjct: 208 RLAQYEQMAEVVAASISLPRGARKVTKDDVVRLLCIQQCNAFGLSNGEGEMTGVALYPAL 267
Query: 64 SIINHSCLPNAVLVFEG----RLAVVRAVQHVPKGAEVLISYIETAGSTMTRQKALKEQY 119
S+ NHSC+PN V +G R+ ++ + VP G E+ ISYI+ + RQ L+E Y
Sbjct: 268 SLFNHSCMPNCAAVDDGTGSKRVCAIKTLVAVPPGEELTISYIDLDLTRELRQDKLEESY 327
Query: 120 LFTCTCPRCIKLGQFDDIQESAILEGYRCKDDGCSGFLLRDSDDKGFT--CQQCGLV 174
F CTC RC D +A L C C G L+ + T C C LV
Sbjct: 328 AFRCTCARCRAPDADDSPAVAAHLRRITCGS--CGGSLIPAPPGQELTVACNSCPLV 382
>gi|239607850|gb|EEQ84837.1| SET and MYND domain-containing protein [Ajellomyces dermatitidis
ER-3]
Length = 519
Score = 85.9 bits (211), Expect = 2e-14, Method: Compositional matrix adjust.
Identities = 39/104 (37%), Positives = 60/104 (57%)
Query: 27 EISINEIAENFSKLACNAHTICNSELRPLGTGLYPVISIINHSCLPNAVLVFEGRLAVVR 86
+I ++ +A F+KL NA T+ N +G P S INHSC PNA + F+G + ++
Sbjct: 181 DIDLSTLAAYFAKLETNAFTLTNQYFDRIGLCFLPFASYINHSCQPNAYIGFDGPVMYLK 240
Query: 87 AVQHVPKGAEVLISYIETAGSTMTRQKALKEQYLFTCTCPRCIK 130
A+Q + ++ ISYI+ TRQ L+ QY F C CP+C++
Sbjct: 241 ALQDIAPDEQIYISYIDNTEPFQTRQSELQLQYFFECKCPKCLE 284
>gi|403351926|gb|EJY75465.1| SET and MYND domain containing 3 [Oxytricha trifallax]
Length = 517
Score = 85.9 bits (211), Expect = 3e-14, Method: Compositional matrix adjust.
Identities = 73/283 (25%), Positives = 128/283 (45%), Gaps = 54/283 (19%)
Query: 42 CNAHTICNSELRPLGTGLYPVISIINHSCLPNAVLVFEGRLAVVRAVQHVPKGAEVLISY 101
CN I ++++ +GTGLY +++NHSC PN +++F G+ + + + E+ I Y
Sbjct: 243 CNGFGIADNQMLRIGTGLYYPSNLLNHSCDPNCMVLFRGQTQFIVTCRPIEADEEITICY 302
Query: 102 IETAGS-TMTRQKALKEQYLFTCTCPRCIKLGQFDDIQESAILEGYRCKDDGCSGFLLRD 160
I+ S + RQ+ L+EQY F C C RC+K I EG K+
Sbjct: 303 IDNGISERIIRQQYLQEQYHFNCMCARCLK----------QIGEGTELKE---------- 342
Query: 161 SDDKGFTCQQCGLVRSKEEIKKIASEVNILSKKTLALTSCGNHQEVVSTYKMIEKLQKKL 220
K +I + + E IL+K+ L S GN + +S + +L K +
Sbjct: 343 ---------------QKVQI-QFSEEQQILNKQALHEESLGNFDKSLS---QLSQLYKSV 383
Query: 221 YHPFSVNLMQTR------EKLIKILMELEDWKEALAYCQLTIPVYQRVYPQF-------- 266
S + Q E+L+ + + +D+K A+ + Q + + + +Y +
Sbjct: 384 ETNKSTQIFQNSMIKELLERLVYLSVMNKDYKFAVKFQQKLVDLCEIMYIKEVKREDELP 443
Query: 267 HPLLGLQYYTCGKLEWFLGDTENAIKSMTEAVEILRITHGTNS 309
HPL+ L YY GKL + AI ++ +A+ ++ +G S
Sbjct: 444 HPLIALHYYQLGKLLSQRKKYQEAIDNLQKAMTLVSAYYGLKS 486
>gi|389584498|dbj|GAB67230.1| hypothetical protein PCYB_112510 [Plasmodium cynomolgi strain B]
Length = 501
Score = 85.5 bits (210), Expect = 3e-14, Method: Compositional matrix adjust.
Identities = 84/332 (25%), Positives = 141/332 (42%), Gaps = 52/332 (15%)
Query: 30 INEIAENFSKLACNAHTICNSELRPLGTGLY--PVISIINHSCLPNAVLVFEGRLAVVRA 87
+ I E F K++ N+ I ++EL P G Y PV NHSCL N V +F+ + +R
Sbjct: 167 LKNIIEKFVKISKNSLQIIDNELEPAGLAFYKKPV-PYFNHSCLSNCVTIFKNQKLFIRT 225
Query: 88 VQHVPKGAEVLISYIETAGSTMTRQKALKEQYLFTCTCPRCIKLGQFDDIQESAILEGYR 147
+ V G E+ ISYI+ A TR +QY FTC+C C K+ + Q E
Sbjct: 226 LVDVYPGEELTISYIDIAFDKNTRLAICMDQYFFTCSCKLC-KVNVVSECQNIFNTEFVC 284
Query: 148 CKDDGC------------------------SGF-----LLRDSDDKG--FTCQQCG---- 172
+ C S F +L+ S+++ + C C
Sbjct: 285 THSESCKKCVNNMELILISELERKNCYMNKSQFKNSYPILKKSNEQQHMWKCALCKNEVH 344
Query: 173 -------LVRSKEEIKKIASEVNILSKKTLALTSCGNHQEVVSTYKMIEKLQ--KKLYHP 223
+ + KE +K+ A ++ S+K S N + S K+ K+ YH
Sbjct: 345 DNVIKSLMEKEKETVKETAHLESLFSEKY----SYDNKSVLQSLNKIKSKIDYLTTFYHH 400
Query: 224 FSVNLMQTREKLIKILMELEDWKEALAYCQLTIPVYQRVYPQFHPLLGLQYYTCGKLEWF 283
+L + R K++ I ++L+++K A + + Y ++ P+ G + GKL F
Sbjct: 401 TRYSLQKMRAKILYISIQLQEFKLAYNIANQYLKSIEVSYGKYSPIYGYYIFLTGKLALF 460
Query: 284 LGDTENAIKSMTEAVEILRITHGTNSPFMKEL 315
L + + +A + + T+G +SP K+L
Sbjct: 461 LDLKSEGLSLIHKAKKNIIKTYGPDSPIYKDL 492
>gi|327275309|ref|XP_003222416.1| PREDICTED: SET and MYND domain-containing protein 1-like isoform 2
[Anolis carolinensis]
Length = 485
Score = 85.5 bits (210), Expect = 3e-14, Method: Compositional matrix adjust.
Identities = 77/309 (24%), Positives = 135/309 (43%), Gaps = 47/309 (15%)
Query: 25 WP----EISINEIAENFSKLACNAHTICNSE-LRPLGTGLYPVISIINHSCLPNAVLVFE 79
WP + ++ I+ + CN T+ + L+ +G G++P + ++NH C PN ++F
Sbjct: 153 WPHDGKQFAMQYISHILGVINCNGFTLSDQRGLQAVGVGIFPNLCLVNHDCWPNCTVIFN 212
Query: 80 -GRLAVVRAVQH------------VPKGAEVLISYIETAGSTMTRQKALKEQYLFTCTCP 126
G VR++ H + G E+ +SY++ R++ LK+QY F CTC
Sbjct: 213 NGNHEAVRSMFHTQMRIELRALGKISIGEELTVSYVDFLNVCEDRRQQLKKQYYFDCTCE 272
Query: 127 RCIKLGQFDDIQESAILEGYRCKDDGCSGFLLRDSDDKGFTCQQCGLVRSKEEIKKIASE 186
C K G DD+ A+ E ++ S +++D + SKE I+KI
Sbjct: 273 HCQK-GIKDDLM-LAVKE-----EEKPSPEVVKDV-----------IQFSKEAIEKIDK- 313
Query: 187 VNILSKKTLALTSCGNHQEVVSTYKMIEKLQKKLYHPFSVNLMQTREKLIKILMELEDWK 246
S G ++EVV + + Q+ + ++ +++ L ++L L+
Sbjct: 314 ----------ARSEGLYKEVVKLCRACLEKQEPVLGDTNIYMLRILSILSEVLSYLQHVD 363
Query: 247 EALAYCQLTIPVYQRVYPQFHPLLGLQYYTCGKLEWFLGDTENAIKSMTEAVEILRITHG 306
EA Y + Y ++Y + LG+ G W G E + +A IL +THG
Sbjct: 364 EAAGYSKRMADGYLKLYHPNNAQLGMALMRAGVTHWHAGLIEAGHGMICKAYAILLVTHG 423
Query: 307 TNSPFMKEL 315
P K+L
Sbjct: 424 PTHPITKDL 432
>gi|440804218|gb|ELR25095.1| SET and MYND domain containing 3, putative [Acanthamoeba
castellanii str. Neff]
Length = 365
Score = 84.7 bits (208), Expect = 5e-14, Method: Compositional matrix adjust.
Identities = 37/86 (43%), Positives = 55/86 (63%)
Query: 27 EISINEIAENFSKLACNAHTICNSELRPLGTGLYPVISIINHSCLPNAVLVFEGRLAVVR 86
+I+ EI F ACN T+ + EL+PLG G+YP ++INHSC PN V++FEGR VR
Sbjct: 99 DITPQEIMHLFCLFACNNFTVSDGELKPLGLGIYPPAALINHSCDPNCVIIFEGRQCTVR 158
Query: 87 AVQHVPKGAEVLISYIETAGSTMTRQ 112
+++ + K E+ +Y+E T TR+
Sbjct: 159 SLRDITKDEEITFNYVEVGDPTETRR 184
Score = 82.8 bits (203), Expect = 2e-13, Method: Compositional matrix adjust.
Identities = 39/85 (45%), Positives = 56/85 (65%)
Query: 245 WKEALAYCQLTIPVYQRVYPQFHPLLGLQYYTCGKLEWFLGDTENAIKSMTEAVEILRIT 304
W+EAL YC ++ V++R+YP PL+GLQY GKL ++L TE A++SM A+ IL IT
Sbjct: 269 WREALLYCSASLAVFERIYPPRWPLVGLQYLIHGKLSFYLKYTEAALESMQRALPILTIT 328
Query: 305 HGTNSPFMKELILKLEEAQAEASYK 329
H + P ++ L L EA AE +Y+
Sbjct: 329 HSSGHPLVRTLHTMLAEATAEWNYE 353
>gi|344247559|gb|EGW03663.1| SET and MYND domain-containing protein 1 [Cricetulus griseus]
Length = 467
Score = 84.7 bits (208), Expect = 6e-14, Method: Compositional matrix adjust.
Identities = 72/317 (22%), Positives = 131/317 (41%), Gaps = 81/317 (25%)
Query: 19 VNLILQ-WP----EISINEIAENFSKLACNAHTICNSE-LRPLGTGLYPVISIINHSCLP 72
V+ LQ WP + S+ I+ F + CN T+ + L+ +G G++P + ++NH C P
Sbjct: 151 VDAFLQYWPPQGQQFSMQYISHIFGVINCNGFTLSDQRGLQAVGVGIFPNLGLVNHDCWP 210
Query: 73 NAVLVFE-GRLAVVRAVQH------------VPKGAEVLISYIETAGSTMTRQKALKEQY 119
N ++F G V+++ H + +G E+ +SYI+ + R++ L++QY
Sbjct: 211 NCTVIFNNGNHEAVKSMFHTQMRIELRALGKISEGEELTVSYIDFLHLSEERRRQLRKQY 270
Query: 120 LFTCTCPRCIKLGQFDDIQESAILEGYRCKDDGCSGFLLRDSDDKGFT-CQQCGLVRSKE 178
F C+C C K G DD+ F D K C++C E
Sbjct: 271 YFDCSCEHCQK-GLKDDL------------------FQAVKEDPKVVKLCREC-----LE 306
Query: 179 EIKKIASEVNILSKKTLALTSCGNHQEVVSTYKMIEKLQKKLYHPFSVNLMQTREKLIKI 238
+ + + ++ N+ + L++ S ++
Sbjct: 307 KQESVFADTNLYVLRLLSIVS-------------------------------------EV 329
Query: 239 LMELEDWKEALAYCQLTIPVYQRVYPQFHPLLGLQYYTCGKLEWFLGDTENAIKSMTEAV 298
L L+ ++EA Y + + Y ++Y + LG+ G W G+ E + +A
Sbjct: 330 LSYLQAFEEAAHYARRMVDGYMKLYHHNNAQLGMAVMRAGLTNWHAGNIEVGHGMICKAY 389
Query: 299 EILRITHGTNSPFMKEL 315
IL +THG + P K+L
Sbjct: 390 AILLVTHGPSHPITKDL 406
>gi|194913080|ref|XP_001982622.1| GG12639 [Drosophila erecta]
gi|190648298|gb|EDV45591.1| GG12639 [Drosophila erecta]
Length = 462
Score = 84.3 bits (207), Expect = 6e-14, Method: Compositional matrix adjust.
Identities = 41/102 (40%), Positives = 57/102 (55%), Gaps = 6/102 (5%)
Query: 58 GLYPVISIINHSCLPNAVLVFE-GRLAVVRAVQHVPKGAEVLISYIETAGSTMTRQKALK 116
GL+P+ +I+NH C PNA FE GRLAVVRA + +PKG E+ +Y + S +TR LK
Sbjct: 242 GLFPLTAIMNHECTPNASHYFENGRLAVVRAARDIPKGGEITTTYTKILWSNLTRNIFLK 301
Query: 117 EQYLFTCTCPRCIKLGQFDDIQESAILEGYRCKDDGCSGFLL 158
F C C RC D+ + L C++ GC G ++
Sbjct: 302 MTKHFACDCARC-----HDNTENGTYLSALFCREQGCRGLVI 338
>gi|195456692|ref|XP_002075245.1| GK16970 [Drosophila willistoni]
gi|194171330|gb|EDW86231.1| GK16970 [Drosophila willistoni]
Length = 766
Score = 84.3 bits (207), Expect = 8e-14, Method: Compositional matrix adjust.
Identities = 40/102 (39%), Positives = 58/102 (56%), Gaps = 6/102 (5%)
Query: 58 GLYPVISIINHSCLPNAVLVFE-GRLAVVRAVQHVPKGAEVLISYIETAGSTMTRQKALK 116
GL+P+ +I+NH C PNA F+ GR+AVVRA + +PKG E+ +Y + S +TR LK
Sbjct: 546 GLFPLTAIMNHECTPNASHYFDNGRMAVVRAARDIPKGGEITTTYTKILWSNLTRGIFLK 605
Query: 117 EQYLFTCTCPRCIKLGQFDDIQESAILEGYRCKDDGCSGFLL 158
F C CPRC D+ + L C++ GC G ++
Sbjct: 606 MTKNFMCDCPRC-----NDNTENGTYLSALFCREQGCKGLVI 642
Score = 62.8 bits (151), Expect = 2e-07, Method: Compositional matrix adjust.
Identities = 29/72 (40%), Positives = 41/72 (56%), Gaps = 1/72 (1%)
Query: 58 GLYPVISIINHSCLPNAVLVFE-GRLAVVRAVQHVPKGAEVLISYIETAGSTMTRQKALK 116
L+P+ ++NH C PNA FE G VV A + +PKG E+ ++Y + ST+ R+ L
Sbjct: 222 ALFPLAGLLNHHCTPNAAHHFEDGETIVVSATERIPKGTEITMTYAKLLWSTLARKLFLA 281
Query: 117 EQYLFTCTCPRC 128
F C CPRC
Sbjct: 282 MTKHFICQCPRC 293
>gi|402217590|gb|EJT97670.1| hypothetical protein DACRYDRAFT_111714 [Dacryopinax sp. DJM-731
SS1]
Length = 531
Score = 84.3 bits (207), Expect = 8e-14, Method: Compositional matrix adjust.
Identities = 77/273 (28%), Positives = 121/273 (44%), Gaps = 26/273 (9%)
Query: 29 SINEIAENFSKLACNAHTICNSELRPLGTGLYPVISIINHSCLPNAVLVF------EGRL 82
SI ++ S+ N+ ++ +L +G + P+ ++I+HSC+PNAV+VF G L
Sbjct: 182 SIRDLTNLLSRFQTNSFSLTTPDLTNVGVAISPLAALISHSCMPNAVVVFPTGLGRRGGL 241
Query: 83 AVVRAVQHVPKGAEVLISYIETAGSTMTRQKALKEQYLFTCTCPRCIKLGQFDDIQESAI 142
V+ A++ + G EVL SY++ A R K LK++YLF C C C K + I
Sbjct: 242 EVI-ALRDLQPGEEVLTSYVDIALPRSLRWKELKDRYLFDCECVLCEKHHDHEWIDPR-- 298
Query: 143 LEGYRCKDDGCSGFLLRDSD--DKGFTCQQCGL-----VRSKEEIKKIASEVNILSKKTL 195
E RC GC G + + TC CG S +E K+A +V ++
Sbjct: 299 -EALRCSKKGCKGKMGTPTSLGTTEITCSICGTKLSVSTASLQEHVKLAEDVLHKAEAIQ 357
Query: 196 ALTSCGNHQEVVSTYKMIEKLQKKLYHPFSVNLMQTREKLIKILMELED--------WKE 247
S +VS + + Y P L ++ LI + +L ++
Sbjct: 358 FSDSDKARYLLVSILRDFDPFALSAY-PLLQILQLLQQLLITAVFDLPSTGTERDGLLQD 416
Query: 248 ALAYCQLTIPVYQRVYPQFHPLLGLQYYTCGKL 280
A + L + + VYP HP G+QY T KL
Sbjct: 417 AASCSALVLSGMEDVYPYGHPSKGIQYVTSAKL 449
>gi|325091290|gb|EGC44600.1| MYND finger family protein [Ajellomyces capsulatus H88]
Length = 479
Score = 84.0 bits (206), Expect = 9e-14, Method: Compositional matrix adjust.
Identities = 38/104 (36%), Positives = 60/104 (57%)
Query: 27 EISINEIAENFSKLACNAHTICNSELRPLGTGLYPVISIINHSCLPNAVLVFEGRLAVVR 86
E+ ++ + F+KL NA T+ N +G L P + INHSC PNA + F+G++ ++
Sbjct: 141 EVELSTLVVYFAKLETNAFTLTNQYFDRIGLCLLPFAAYINHSCEPNAYIGFDGQVMYLK 200
Query: 87 AVQHVPKGAEVLISYIETAGSTMTRQKALKEQYLFTCTCPRCIK 130
A+Q + E+ ISY + TRQ L+ +Y F C CP+C+K
Sbjct: 201 ALQDIAPDEEIFISYTDNTEPLKTRQTELQLRYFFECKCPKCLK 244
>gi|240275453|gb|EER38967.1| MYND finger family protein [Ajellomyces capsulatus H143]
Length = 518
Score = 84.0 bits (206), Expect = 1e-13, Method: Compositional matrix adjust.
Identities = 38/104 (36%), Positives = 60/104 (57%)
Query: 27 EISINEIAENFSKLACNAHTICNSELRPLGTGLYPVISIINHSCLPNAVLVFEGRLAVVR 86
E+ ++ + F+KL NA T+ N +G L P + INHSC PNA + F+G++ ++
Sbjct: 180 EVELSTLVVYFAKLETNAFTLTNQYFDRIGLCLLPFAAYINHSCEPNAYIGFDGQVMYLK 239
Query: 87 AVQHVPKGAEVLISYIETAGSTMTRQKALKEQYLFTCTCPRCIK 130
A+Q + E+ ISY + TRQ L+ +Y F C CP+C+K
Sbjct: 240 ALQDIAPDEEIFISYTDNTEPLKTRQTELQLRYFFECKCPKCLK 283
>gi|302654157|ref|XP_003018889.1| SET and MYND domain protein, putative [Trichophyton verrucosum HKI
0517]
gi|291182573|gb|EFE38244.1| SET and MYND domain protein, putative [Trichophyton verrucosum HKI
0517]
Length = 498
Score = 84.0 bits (206), Expect = 1e-13, Method: Compositional matrix adjust.
Identities = 39/102 (38%), Positives = 59/102 (57%)
Query: 27 EISINEIAENFSKLACNAHTICNSELRPLGTGLYPVISIINHSCLPNAVLVFEGRLAVVR 86
E+ + IAE +++ N+ T + PLG + P +NHSC PNAV+ F+G L +V+
Sbjct: 179 ELDVETIAEYLARIEVNSFTFTTAFGDPLGLCIQPFACYMNHSCEPNAVVGFDGGLIIVK 238
Query: 87 AVQHVPKGAEVLISYIETAGSTMTRQKALKEQYLFTCTCPRC 128
A++ + +V ISYI+ RQK L E+Y FTC C +C
Sbjct: 239 ALREIKPDEQVFISYIDNTYPLEVRQKQLAERYFFTCKCSKC 280
>gi|225561846|gb|EEH10126.1| MYND finger family protein [Ajellomyces capsulatus G186AR]
Length = 518
Score = 83.6 bits (205), Expect = 1e-13, Method: Compositional matrix adjust.
Identities = 38/104 (36%), Positives = 60/104 (57%)
Query: 27 EISINEIAENFSKLACNAHTICNSELRPLGTGLYPVISIINHSCLPNAVLVFEGRLAVVR 86
E+ ++ + F+KL NA T+ N +G L P + INHSC PNA + F+G++ ++
Sbjct: 180 EVELSTLVVYFAKLETNAFTLTNQYFDRIGLCLLPFAAYINHSCEPNAYIGFDGQVMYLK 239
Query: 87 AVQHVPKGAEVLISYIETAGSTMTRQKALKEQYLFTCTCPRCIK 130
A+Q + E+ ISY + TRQ L+ +Y F C CP+C+K
Sbjct: 240 ALQDIAPDEEIFISYTDNTEPLKTRQTELQLRYFFECKCPKCLK 283
>gi|452820074|gb|EME27122.1| SET and MYND domain-containing protein [Galdieria sulphuraria]
Length = 464
Score = 83.6 bits (205), Expect = 1e-13, Method: Compositional matrix adjust.
Identities = 43/107 (40%), Positives = 56/107 (52%), Gaps = 9/107 (8%)
Query: 31 NEIAENFSKLACNAHTICNSELRPLGTGLYPVISIINHSCLPNAVLVFEGRLAVV----- 85
N +A L NAHTI +SEL LG G +P S +NH C PN V +F G V
Sbjct: 168 NFVASLIRMLEMNAHTIYDSELNTLGVGFFPKASFMNHDCRPNCVALFTGGFHSVSGKPI 227
Query: 86 ----RAVQHVPKGAEVLISYIETAGSTMTRQKALKEQYLFTCTCPRC 128
R ++ + G E++ISY++ S M R + LKE Y F C C RC
Sbjct: 228 SIHIRCIRPIEAGEEIVISYLDVCLSWMDRLEWLKEHYQFECCCSRC 274
>gi|164656030|ref|XP_001729143.1| hypothetical protein MGL_3610 [Malassezia globosa CBS 7966]
gi|159103033|gb|EDP41929.1| hypothetical protein MGL_3610 [Malassezia globosa CBS 7966]
Length = 390
Score = 83.2 bits (204), Expect = 2e-13, Method: Compositional matrix adjust.
Identities = 49/149 (32%), Positives = 74/149 (49%), Gaps = 17/149 (11%)
Query: 34 AENFSKLAC----NAHTICNSELRPLGTGLYPVISIINHSCLPNAVLVFEGRLAVVR--- 86
A+ +L C NA T+ ++ L PLG + P ++INHSC PNAV+VF R+ +
Sbjct: 54 ADALMELVCQHETNAFTLADAHLNPLGVCIEPTFALINHSCDPNAVIVFPDRVRGMPSKM 113
Query: 87 ---AVQHVPKGAEVLISYIETAGSTMTRQKALKEQYLFTCTCPRCIKLGQFDDIQESAIL 143
A++ + G EV ISY++ A RQ L E+Y F C C C ++G D
Sbjct: 114 HLVAIRAISAGEEVRISYVDVASCQAERQATLLERYCFACECRLCKRMGWRDPRM----- 168
Query: 144 EGYRCKDDGCSGFLLRDSDDKGFT-CQQC 171
C C+G+++ + +T C QC
Sbjct: 169 -ALWCPQPKCTGWVVFQNAASSWTACTQC 196
>gi|198420659|ref|XP_002128556.1| PREDICTED: similar to SET and MYND domain containing 3 [Ciona
intestinalis]
Length = 430
Score = 83.2 bits (204), Expect = 2e-13, Method: Compositional matrix adjust.
Identities = 75/293 (25%), Positives = 133/293 (45%), Gaps = 40/293 (13%)
Query: 26 PEISINE-IAENFSKLACNAHTICNSELRP-LGTGLYPVISIINHSCLPNAVLVFEGRLA 83
P+++ N+ + E +++ N+ ICN EL+ +GTG+Y S INHSC PN V F R
Sbjct: 162 PQVTDNKMLLELCARIKNNSFAICNEELQSDVGTGVYLNCSFINHSCEPNCVAEFNMRTL 221
Query: 84 VVRAVQHVPKGAEVLISYIETAGSTMTRQKALKEQYLFTCTCPRCIKLGQFDDIQESAIL 143
+RAV+++ G EVLISY++ ++ RQ+ L Y
Sbjct: 222 KIRAVKNITAGEEVLISYVDLFATSFERQRELMSIY------------------------ 257
Query: 144 EGYRCKDDGCSGFLLRDSDDKGFTCQQCGLVRSKEEIKKIASEVNILSKKTLALTSCGNH 203
++C C+ K Q K K++S ++LS+ C ++
Sbjct: 258 -HFQCTCHSCNA--------KTDDDQMMQDFDGKITESKLSSVKDMLSQMEELRKQC-DY 307
Query: 204 QEVVSTYKMIEKLQKKLYHPF-SVNLMQTREKLIKILMELEDWKEALAYCQLTIPVYQRV 262
Q++ ++E K+ P ++ + + + + +EL +A + + + Y+
Sbjct: 308 QKIKD---LVEGCVKRKILPHENIYMAKVLDFGMDACIELGVLTQAFEFGSMALFSYKLY 364
Query: 263 YPQFHPLLGLQYYTCGKLEWFLGDTENAIKSMTEAVEILRITHGTNSPFMKEL 315
HP+LG+Q GK+ ++A++ + EA +IL ITH NS + EL
Sbjct: 365 LHANHPMLGIQLMKLGKILLHEEKNQDAMQFLREAFKILTITHSPNSSVLTEL 417
>gi|327305077|ref|XP_003237230.1| hypothetical protein TERG_01951 [Trichophyton rubrum CBS 118892]
gi|326460228|gb|EGD85681.1| hypothetical protein TERG_01951 [Trichophyton rubrum CBS 118892]
Length = 498
Score = 83.2 bits (204), Expect = 2e-13, Method: Compositional matrix adjust.
Identities = 39/102 (38%), Positives = 58/102 (56%)
Query: 27 EISINEIAENFSKLACNAHTICNSELRPLGTGLYPVISIINHSCLPNAVLVFEGRLAVVR 86
E+ + IAE +++ N+ T + PLG + P +NHSC PNAV+ F+G L V+
Sbjct: 179 ELDVETIAEYLARIEVNSFTFTTAFGDPLGLCIQPFACYMNHSCEPNAVVGFDGGLITVK 238
Query: 87 AVQHVPKGAEVLISYIETAGSTMTRQKALKEQYLFTCTCPRC 128
A++ + +V ISYI+ RQK L E+Y FTC C +C
Sbjct: 239 ALREIKSDEQVFISYIDNTYPFEVRQKQLAERYFFTCKCSKC 280
>gi|195397183|ref|XP_002057208.1| GJ16480 [Drosophila virilis]
gi|194146975|gb|EDW62694.1| GJ16480 [Drosophila virilis]
Length = 600
Score = 83.2 bits (204), Expect = 2e-13, Method: Compositional matrix adjust.
Identities = 41/102 (40%), Positives = 57/102 (55%), Gaps = 6/102 (5%)
Query: 58 GLYPVISIINHSCLPNAVLVFE-GRLAVVRAVQHVPKGAEVLISYIETAGSTMTRQKALK 116
GL+P+ +I+NH C PNA FE GRLAVVRA + +PKG E+ +Y + S +TR LK
Sbjct: 380 GLFPLTAIMNHECTPNASHYFENGRLAVVRAARDIPKGGEITTTYTKILWSNLTRGIFLK 439
Query: 117 EQYLFTCTCPRCIKLGQFDDIQESAILEGYRCKDDGCSGFLL 158
F C C RC D+ + L C++ GC G ++
Sbjct: 440 MTKYFVCNCDRCN-----DNSENGTYLSALFCREQGCKGLVI 476
Score = 64.3 bits (155), Expect = 7e-08, Method: Compositional matrix adjust.
Identities = 32/85 (37%), Positives = 45/85 (52%), Gaps = 1/85 (1%)
Query: 55 LGTGLYPVISIINHSCLPNAVLVFE-GRLAVVRAVQHVPKGAEVLISYIETAGSTMTRQK 113
+ L+P+ ++NH C PNA FE G VV A + +P GAE+ +SY + ST+ R+
Sbjct: 210 VARALFPLAGMLNHQCTPNAAHHFEDGETIVVTATERIPLGAEITMSYAKLLWSTLARKM 269
Query: 114 ALKEQYLFTCTCPRCIKLGQFDDIQ 138
L F C CPRC Q D +
Sbjct: 270 FLGMTKHFMCQCPRCQDPTQTDAVN 294
>gi|195132929|ref|XP_002010892.1| GI21459 [Drosophila mojavensis]
gi|193907680|gb|EDW06547.1| GI21459 [Drosophila mojavensis]
Length = 460
Score = 83.2 bits (204), Expect = 2e-13, Method: Compositional matrix adjust.
Identities = 41/102 (40%), Positives = 57/102 (55%), Gaps = 6/102 (5%)
Query: 58 GLYPVISIINHSCLPNAVLVFE-GRLAVVRAVQHVPKGAEVLISYIETAGSTMTRQKALK 116
GL+P+ +I+NH C PNA FE GRLAVVRA + +PKG E+ +Y + S +TR LK
Sbjct: 240 GLFPLTAIMNHECTPNASHYFENGRLAVVRAARDIPKGGEITTTYTKILWSNLTRGIFLK 299
Query: 117 EQYLFTCTCPRCIKLGQFDDIQESAILEGYRCKDDGCSGFLL 158
F C C RC D+ + L C++ GC G ++
Sbjct: 300 MTKYFVCNCDRC-----NDNSENGTYLSALFCREQGCKGLVI 336
>gi|302506763|ref|XP_003015338.1| SET and MYND domain protein, putative [Arthroderma benhamiae CBS
112371]
gi|291178910|gb|EFE34698.1| SET and MYND domain protein, putative [Arthroderma benhamiae CBS
112371]
Length = 498
Score = 82.8 bits (203), Expect = 2e-13, Method: Compositional matrix adjust.
Identities = 39/102 (38%), Positives = 58/102 (56%)
Query: 27 EISINEIAENFSKLACNAHTICNSELRPLGTGLYPVISIINHSCLPNAVLVFEGRLAVVR 86
E+ + IAE +++ N+ T + PLG + P +NHSC PNAV+ F+G L V+
Sbjct: 179 ELDVETIAEYLARIEVNSFTFTTAFGDPLGLCIQPFACYMNHSCEPNAVVGFDGGLITVK 238
Query: 87 AVQHVPKGAEVLISYIETAGSTMTRQKALKEQYLFTCTCPRC 128
A++ + +V ISYI+ RQK L E+Y FTC C +C
Sbjct: 239 ALREIKPDEQVFISYIDNTYPLEVRQKQLTERYFFTCKCSKC 280
>gi|212541368|ref|XP_002150839.1| SET and MYND domain protein, putative [Talaromyces marneffei ATCC
18224]
gi|210068138|gb|EEA22230.1| SET and MYND domain protein, putative [Talaromyces marneffei ATCC
18224]
Length = 527
Score = 82.8 bits (203), Expect = 2e-13, Method: Compositional matrix adjust.
Identities = 37/93 (39%), Positives = 55/93 (59%)
Query: 37 FSKLACNAHTICNSELRPLGTGLYPVISIINHSCLPNAVLVFEGRLAVVRAVQHVPKGAE 96
F KL+ N+ T+ ++ + G L+P + NHSC PNA F+ +RA + + KG +
Sbjct: 195 FGKLSANSFTLTSAFGKRRGVYLHPAAARFNHSCNPNASYSFDKGKCYIRATKPIAKGEQ 254
Query: 97 VLISYIETAGSTMTRQKALKEQYLFTCTCPRCI 129
+ I YI+T S TR+ LKE+Y F C CPRC+
Sbjct: 255 IFIPYIDTTYSVGTRRHELKERYKFDCQCPRCL 287
>gi|194217425|ref|XP_001504392.2| PREDICTED: SET and MYND domain-containing protein 4 [Equus
caballus]
Length = 802
Score = 82.8 bits (203), Expect = 2e-13, Method: Compositional matrix adjust.
Identities = 47/120 (39%), Positives = 63/120 (52%), Gaps = 17/120 (14%)
Query: 26 PEISINEIA--ENFSKLACNAHTIC-------------NSELRPLGTGLYPVISIINHSC 70
PE SI +A + +L CNA I +S L TG++PV+S++NHSC
Sbjct: 482 PESSIWGVAMLRHMLQLQCNAQAITAIQQTGSKENIITDSRQVRLATGIFPVVSLLNHSC 541
Query: 71 LPNAVLVFEGRLAVVRAVQHVPKGAEVLISY--IETAGSTMTRQKALKEQYLFTCTCPRC 128
PN + F +A VRA QH+ KG E+L Y E+ RQ+ L+ QY F CTCP C
Sbjct: 542 SPNTSMSFVSTVATVRASQHIGKGQEILHCYGPHESRMGVAERQQKLRSQYFFDCTCPAC 601
>gi|195040117|ref|XP_001991004.1| GH12314 [Drosophila grimshawi]
gi|193900762|gb|EDV99628.1| GH12314 [Drosophila grimshawi]
Length = 611
Score = 82.8 bits (203), Expect = 2e-13, Method: Compositional matrix adjust.
Identities = 41/102 (40%), Positives = 57/102 (55%), Gaps = 6/102 (5%)
Query: 58 GLYPVISIINHSCLPNAVLVFE-GRLAVVRAVQHVPKGAEVLISYIETAGSTMTRQKALK 116
GL+P+ +I+NH C PNA FE GRLAVVRA + +PKG E+ +Y + S +TR LK
Sbjct: 391 GLFPLTAIMNHECTPNASHYFENGRLAVVRAARAIPKGGEITTTYTKILWSNLTRGIFLK 450
Query: 117 EQYLFTCTCPRCIKLGQFDDIQESAILEGYRCKDDGCSGFLL 158
F C C RC D+ + L C++ GC G ++
Sbjct: 451 MTKYFVCNCDRCN-----DNSENGTYLAALFCREQGCKGLVI 487
Score = 63.5 bits (153), Expect = 1e-07, Method: Compositional matrix adjust.
Identities = 30/75 (40%), Positives = 42/75 (56%), Gaps = 1/75 (1%)
Query: 55 LGTGLYPVISIINHSCLPNAVLVFE-GRLAVVRAVQHVPKGAEVLISYIETAGSTMTRQK 113
+ L+P+ ++NH C PNA FE G VV A + +P GAE+ +SY + ST+ R+
Sbjct: 222 VARALFPLAGLLNHQCTPNAAHHFEDGETIVVTATERIPMGAEITMSYAKLLWSTLARKM 281
Query: 114 ALKEQYLFTCTCPRC 128
L F C CPRC
Sbjct: 282 FLGMTKHFMCRCPRC 296
>gi|324513338|gb|ADY45484.1| SET and MYND domain-containing protein 3 [Ascaris suum]
Length = 477
Score = 82.8 bits (203), Expect = 2e-13, Method: Compositional matrix adjust.
Identities = 79/313 (25%), Positives = 133/313 (42%), Gaps = 23/313 (7%)
Query: 5 DEKQLLLYAQIANLVNLILQWPEISINEIAENFSKLACNAHTICNSELRPLGTGLYPVIS 64
D+ + + +I N +++ + I ++ F K + N+H+I S +G L +S
Sbjct: 133 DQAKYEHFEKIFNKMSIFRKDEMIDKDKFYIIFCKTSINSHSIHTSAGTEVGMALDLGVS 192
Query: 65 IINHSCLPNAVLVFEGRLAVVRAVQHVPKGAE---VLISYIETAGSTMTRQKALKEQYLF 121
+HSC PN LVF+G A +R + ++ ISYI+ S R+K LK ++ F
Sbjct: 193 AYDHSCRPNCSLVFDGYQACLRPLTPQTNASDPRTSFISYIDIGRSRYQRRKELKAKWYF 252
Query: 122 TCTCPRCIKLGQFDDIQESAILEGYRCKDDGCSGFLL--RDSDDKGFTCQQCGLVRSKEE 179
C C RC IL RC D C L+ D++ TC +CG V +E
Sbjct: 253 DCACERCCDPA-------DNILTSIRCSDASCDEPLITAEDAEATAITCPKCGQVADEEY 305
Query: 180 IKKIASEVNILSKKTLALTSCGNHQEVVSTYKMIEKLQKKLYHPFSVNLMQTREKLIKIL 239
++ + L K ++ + QE++ + QK +Y + +QT ++ I
Sbjct: 306 VRAAQQMMLRLPAKFSPESNPDSLQELLDEASNVLH-QKNIY----ITRLQT--AIMHIT 358
Query: 240 MELEDWKEALAYCQLTI-PVYQRVYPQFHPLLGLQYYTCGKLEWFLGDTENAIKSMTEAV 298
L+D L + Q + Y+ +P +G Q K G + A+ EA+
Sbjct: 359 GTLKD---NLPFIQHQVYENYKMCFPYADRHIGYQLLQIVKAYIEKGQRKEAVSYAFEAM 415
Query: 299 EILRITHGTNSPF 311
I I G P+
Sbjct: 416 NIFEICFGLQHPY 428
>gi|195347880|ref|XP_002040479.1| GM18906 [Drosophila sechellia]
gi|194121907|gb|EDW43950.1| GM18906 [Drosophila sechellia]
Length = 302
Score = 82.4 bits (202), Expect = 3e-13, Method: Compositional matrix adjust.
Identities = 40/102 (39%), Positives = 56/102 (54%), Gaps = 6/102 (5%)
Query: 58 GLYPVISIINHSCLPNAVLVFE-GRLAVVRAVQHVPKGAEVLISYIETAGSTMTRQKALK 116
GL+P+ +I+NH C PNA FE GRLAVVRA + +PKG E+ +Y + +TR LK
Sbjct: 85 GLFPLTAIMNHECTPNASHYFENGRLAVVRAARDIPKGGEITTTYTKILWGNLTRNIFLK 144
Query: 117 EQYLFTCTCPRCIKLGQFDDIQESAILEGYRCKDDGCSGFLL 158
F C C RC D+ + L C++ GC G ++
Sbjct: 145 MTKHFACDCVRCN-----DNTENGTYLSALFCREQGCRGLVI 181
>gi|62473423|ref|NP_001014717.1| msta, isoform A [Drosophila melanogaster]
gi|25090702|sp|O46040.3|MSTAA_DROME RecName: Full=Protein msta, isoform A
gi|22831561|gb|AAN09072.1| msta, isoform A [Drosophila melanogaster]
Length = 462
Score = 82.4 bits (202), Expect = 3e-13, Method: Compositional matrix adjust.
Identities = 40/102 (39%), Positives = 56/102 (54%), Gaps = 6/102 (5%)
Query: 58 GLYPVISIINHSCLPNAVLVFE-GRLAVVRAVQHVPKGAEVLISYIETAGSTMTRQKALK 116
GL+P+ +I+NH C PNA FE GRLAVVRA + +PKG E+ +Y + +TR LK
Sbjct: 242 GLFPLTAIMNHECTPNASHYFENGRLAVVRAARDIPKGGEITTTYTKILWGNLTRNIFLK 301
Query: 117 EQYLFTCTCPRCIKLGQFDDIQESAILEGYRCKDDGCSGFLL 158
F C C RC D+ + L C++ GC G ++
Sbjct: 302 MTKHFACDCVRC-----HDNTENGTYLSALFCREQGCRGLVI 338
>gi|4753708|emb|CAB42051.1| MSTA protein [Drosophila melanogaster]
Length = 462
Score = 82.4 bits (202), Expect = 3e-13, Method: Compositional matrix adjust.
Identities = 40/102 (39%), Positives = 56/102 (54%), Gaps = 6/102 (5%)
Query: 58 GLYPVISIINHSCLPNAVLVFE-GRLAVVRAVQHVPKGAEVLISYIETAGSTMTRQKALK 116
GL+P+ +I+NH C PNA FE GRLAVVRA + +PKG E+ +Y + +TR LK
Sbjct: 242 GLFPLTAIMNHECTPNASHYFENGRLAVVRAARDIPKGGEITTTYTKILWGNLTRNIFLK 301
Query: 117 EQYLFTCTCPRCIKLGQFDDIQESAILEGYRCKDDGCSGFLL 158
F C C RC D+ + L C++ GC G ++
Sbjct: 302 MTKHFACDCVRC-----HDNTENGTYLSALFCREQGCRGLVI 338
>gi|4725963|emb|CAA15694.2| EG:103B4.4 [Drosophila melanogaster]
Length = 991
Score = 82.0 bits (201), Expect = 3e-13, Method: Compositional matrix adjust.
Identities = 40/102 (39%), Positives = 56/102 (54%), Gaps = 6/102 (5%)
Query: 58 GLYPVISIINHSCLPNAVLVFE-GRLAVVRAVQHVPKGAEVLISYIETAGSTMTRQKALK 116
GL+P+ +I+NH C PNA FE GRLAVVRA + +PKG E+ +Y + +TR LK
Sbjct: 771 GLFPLTAIMNHECTPNASHYFENGRLAVVRAARDIPKGGEITTTYTKILWGNLTRNIFLK 830
Query: 117 EQYLFTCTCPRCIKLGQFDDIQESAILEGYRCKDDGCSGFLL 158
F C C RC D+ + L C++ GC G ++
Sbjct: 831 MTKHFACDCVRC-----HDNTENGTYLSALFCREQGCRGLVI 867
Score = 57.8 bits (138), Expect = 7e-06, Method: Compositional matrix adjust.
Identities = 29/72 (40%), Positives = 40/72 (55%), Gaps = 1/72 (1%)
Query: 58 GLYPVISIINHSCLPNAVLVFE-GRLAVVRAVQHVPKGAEVLISYIETAGSTMTRQKALK 116
L+P+ ++NH C PNA FE G VV A + +P GAE+ +SY + ST+ R+ L
Sbjct: 228 ALFPLAGLLNHQCTPNAAHHFENGETIVVCATERIPAGAEITMSYAKLLWSTLARKIFLG 287
Query: 117 EQYLFTCTCPRC 128
F C C RC
Sbjct: 288 MTKHFICKCVRC 299
>gi|201066227|gb|ACH92522.1| IP12582p [Drosophila melanogaster]
Length = 495
Score = 82.0 bits (201), Expect = 3e-13, Method: Compositional matrix adjust.
Identities = 40/102 (39%), Positives = 56/102 (54%), Gaps = 6/102 (5%)
Query: 58 GLYPVISIINHSCLPNAVLVFE-GRLAVVRAVQHVPKGAEVLISYIETAGSTMTRQKALK 116
GL+P+ +I+NH C PNA FE GRLAVVRA + +PKG E+ +Y + +TR LK
Sbjct: 275 GLFPLTAIMNHECTPNASHYFENGRLAVVRAARDIPKGGEITTTYTKILWGNLTRNIFLK 334
Query: 117 EQYLFTCTCPRCIKLGQFDDIQESAILEGYRCKDDGCSGFLL 158
F C C RC D+ + L C++ GC G ++
Sbjct: 335 MTKHFACDCVRC-----HDNTENGTYLSALFCREQGCRGLVI 371
>gi|390361260|ref|XP_799878.2| PREDICTED: SET and MYND domain-containing protein 3-like
[Strongylocentrotus purpuratus]
Length = 315
Score = 82.0 bits (201), Expect = 3e-13, Method: Compositional matrix adjust.
Identities = 36/93 (38%), Positives = 56/93 (60%)
Query: 37 FSKLACNAHTICNSELRPLGTGLYPVISIINHSCLPNAVLVFEGRLAVVRAVQHVPKGAE 96
F CN+ +IC+++L + G+Y S++NHSC PN V+VF+ R +R V+ V +G
Sbjct: 180 FGATICNSFSICDNDLNGIAVGIYLRASMLNHSCYPNCVVVFDERKLQLRTVRDVKEGDV 239
Query: 97 VLISYIETAGSTMTRQKALKEQYLFTCTCPRCI 129
ISY++ RQ L+E+Y F+C C +CI
Sbjct: 240 CTISYVDVINPAKERQTELEERYHFSCNCVKCI 272
>gi|294933189|ref|XP_002780642.1| SET and MYND domain-containing protein, putative [Perkinsus marinus
ATCC 50983]
gi|239890576|gb|EER12437.1| SET and MYND domain-containing protein, putative [Perkinsus marinus
ATCC 50983]
Length = 409
Score = 82.0 bits (201), Expect = 3e-13, Method: Compositional matrix adjust.
Identities = 39/98 (39%), Positives = 60/98 (61%), Gaps = 2/98 (2%)
Query: 43 NAHTICNSELRPLGTGLYPVISIINHSCLPNAVLVFEGRLAVVRAVQHVPKGAEVLISYI 102
NA TI N L+ + GLY +S +NHSC PN VL+F G +R ++ V GAE+ ISY+
Sbjct: 188 NAVTITNDSLQDVALGLYTEVSAMNHSCAPNVVLIFSGSEVTLRTIRAVEDGAELFISYV 247
Query: 103 ETAGSTMTRQ-KALKEQYLFTCTCPRCIKLGQF-DDIQ 138
+ S ++ + L++QY F C+C RC + ++ DD+
Sbjct: 248 DVCISPKAKRCQRLRDQYKFDCSCERCTREDEYLDDVD 285
>gi|298705753|emb|CBJ49061.1| conserved unknown protein [Ectocarpus siliculosus]
Length = 401
Score = 81.3 bits (199), Expect = 5e-13, Method: Compositional matrix adjust.
Identities = 46/148 (31%), Positives = 71/148 (47%), Gaps = 8/148 (5%)
Query: 26 PEISINEIA-ENFSKLACNAHTICNSELRPLGTGLYPVISIINHSCLPNAVLVFEGRLAV 84
P ++ ++A + +++ N T C+ P G GL+P ++INHSC PN + G
Sbjct: 20 PSAALLDLARKELGRVSLNGITTCDG-TGPTGLGLFPSGAMINHSCSPNCQAWWRGSQLE 78
Query: 85 VRAVQHVPKGAEVLISYIETAGSTMTRQKALKEQYLFTCTCPRCIKLGQFDDIQESAILE 144
+R + V G E+ +SYI + R+ L+ + F C C RC+ Q+D A L
Sbjct: 79 IRCTKPVATGEELCLSYIPIDQPSTVRRAQLRHSWFFACRCRRCVSR-QWD-----AELV 132
Query: 145 GYRCKDDGCSGFLLRDSDDKGFTCQQCG 172
G RC GC+G + G C CG
Sbjct: 133 GLRCPTKGCAGAVPPPPWRGGHACSHCG 160
>gi|221057668|ref|XP_002261342.1| hypothetical protein, conserved in Plasmodium species [Plasmodium
knowlesi strain H]
gi|194247347|emb|CAQ40747.1| hypothetical protein, conserved in Plasmodium species [Plasmodium
knowlesi strain H]
Length = 518
Score = 81.3 bits (199), Expect = 6e-13, Method: Compositional matrix adjust.
Identities = 79/329 (24%), Positives = 134/329 (40%), Gaps = 46/329 (13%)
Query: 30 INEIAENFSKLACNAHTICNSELRPLGTGLY--PVISIINHSCLPNAVLVFEGRLAVVRA 87
+ I E F K++ N+ I +++L P G Y PV NHSCL N V +F+ + +R
Sbjct: 184 LKNIIEKFVKISKNSLQIIDNDLEPAGLAFYKKPV-PYFNHSCLSNCVTIFKNQKLFIRT 242
Query: 88 VQHVPKGAEVLISYIETAGSTMTRQKALKEQYLFTCTCPRCIKLGQFDDIQESAILEGYR 147
+ V G E+ ISY++ A TR +QY FTC+C C K+ + Q E
Sbjct: 243 LVDVYPGEELTISYVDIAFDRNTRLAICMDQYFFTCSCKLC-KVNMVSECQHIFNTELLC 301
Query: 148 CKDDGCSG-------FLLRDSDDKG------------------------FTCQQC----- 171
+ + C L+ + + K + C C
Sbjct: 302 TQSESCKKCVNHMELLLISELERKSCYMNKNQFKSAYPVFKKTNEQQHVWKCSICKGEVH 361
Query: 172 -----GLVRSKEEIKKIASEVNILSKKTLALTSCGNHQEVVSTYKMIEKLQKKLYHPFSV 226
LV ++E K AS + L + + Q + I+ L YH
Sbjct: 362 DNVIKSLVEKEKETVKEASHLESLFSEKYTYDNKSVLQALNKIKSKIDYLTT-FYHHTKY 420
Query: 227 NLMQTREKLIKILMELEDWKEALAYCQLTIPVYQRVYPQFHPLLGLQYYTCGKLEWFLGD 286
+L + R K++ I ++L+++K A + + Y ++ P+ G + GKL FL
Sbjct: 421 SLQKMRAKILYISIQLQEFKLAYNIANQYLKSIEVTYGKYSPIYGYYIFLTGKLALFLDL 480
Query: 287 TENAIKSMTEAVEILRITHGTNSPFMKEL 315
+ + +A + + T+G +SP K+L
Sbjct: 481 KSEGLSLIHKAKKNIIKTYGPDSPIYKDL 509
>gi|412986118|emb|CCO17318.1| SET and MYND domain-containing protein 3 [Bathycoccus prasinos]
Length = 734
Score = 81.3 bits (199), Expect = 6e-13, Method: Compositional matrix adjust.
Identities = 39/93 (41%), Positives = 55/93 (59%)
Query: 29 SINEIAENFSKLACNAHTICNSELRPLGTGLYPVISIINHSCLPNAVLVFEGRLAVVRAV 88
S+ ++A+ KL N HT+C+ ELRP G G+YPV +++NHS PN F G+ +VR +
Sbjct: 280 SVADLAKLIGKLRLNCHTLCDDELRPYGIGVYPVAAMMNHSENPNCFATFRGKKMIVRCL 339
Query: 89 QHVPKGAEVLISYIETAGSTMTRQKALKEQYLF 121
+ V G E+ ISY E R K+LK Y F
Sbjct: 340 RDVLPGEELTISYDELMKPKRERAKSLKSNYGF 372
>gi|359486519|ref|XP_002270770.2| PREDICTED: uncharacterized protein LOC100263830 [Vitis vinifera]
Length = 799
Score = 81.3 bits (199), Expect = 7e-13, Method: Compositional matrix adjust.
Identities = 72/312 (23%), Positives = 120/312 (38%), Gaps = 54/312 (17%)
Query: 49 NSELRPLGTGLYPVISIINHSCLPNAVLVFEGRLAVVRAVQHVPKGAEVLISYIETAG-- 106
N E +G +Y V S+ NHSC PN F R +RA +HV G + +SY G
Sbjct: 478 NMEQVRVGQAIYSVASLFNHSCQPNIHAYFLSRTLFLRATEHVAVGCPLELSYGPQVGQW 537
Query: 107 STMTRQKALKEQYLFTCTCPRCIKLGQFDDIQESAILEGYRCKDDGCSGFLL-------- 158
RQK LK++Y F C C C +L D +L +RC + C G +L
Sbjct: 538 DCKDRQKFLKDEYSFRCECSGCSELNVSD-----LVLNAFRCVNPDCFGTVLDSCVIKYE 592
Query: 159 ---------------------------RDSDDKGFT----------CQQCGLVRSKEEIK 181
R+ + F C CG R E
Sbjct: 593 NKKFERFQGVPQDCISEPHLQLKNDGIREVAHQAFANSSFRAAPGYCLHCGAYRDLEASH 652
Query: 182 KIASEVNI-LSKKTLALTSCGNHQEVVSTYKMIEKLQKKLYHPFSVNLMQTREKLIKILM 240
E I +S+ A+ S S L K H ++ + + + + +
Sbjct: 653 ATVGEAGIYISRLQEAIVSKEVPATTFSDALRSLDLLKSTLHAYNKGIAEAEDWIAQAFC 712
Query: 241 ELEDWKEALAYCQLTIPVYQRVYPQFHPLLGLQYYTCGKLEWFLGDTENAIKSMTEAVEI 300
+ + + A+ +C+ +I + +++Y H ++G + ++ LGDT A+KS++ I
Sbjct: 713 MIGELQPAMHHCKASIEILEKLYGSNHIVIGYELMKLSSIQLSLGDTA-AMKSISRLAAI 771
Query: 301 LRITHGTNSPFM 312
+G ++ M
Sbjct: 772 FSWYYGPHADMM 783
>gi|296806220|ref|XP_002843920.1| histone methyltransferase SmyD1b [Arthroderma otae CBS 113480]
gi|238845222|gb|EEQ34884.1| histone methyltransferase SmyD1b [Arthroderma otae CBS 113480]
Length = 495
Score = 81.3 bits (199), Expect = 7e-13, Method: Compositional matrix adjust.
Identities = 70/261 (26%), Positives = 115/261 (44%), Gaps = 42/261 (16%)
Query: 27 EISINEIAENFSKLACNAHTICNSELRPLGTGLYPVISIINHSCLPNAVLVF-EGRLAVV 85
++ I IAE +K+ N T PLG + P +NHSC PNAV+ F EGR+ V
Sbjct: 176 DLDIETIAEYLAKIEVNGFTFTTPFGDPLGLCIQPFACFVNHSCDPNAVVGFDEGRI-TV 234
Query: 86 RAVQHVPKGAEVLISYIETAGSTMTRQKALKEQYLFTCTCPRCIKLGQFDDIQESAILEG 145
+A++ + +V ISYI+ RQK L E+Y FTC C +C+ Q D +E +
Sbjct: 235 KALRTIEPDEQVFISYIDNTNPFEIRQKELAERYFFTCRCSKCL---QGDKAREDQFIPA 291
Query: 146 YRCKDDGCSGFLLRDSDDKGFTCQQCGLVRSKEEIKKIASEVNILSKKTLALTSCGNHQE 205
+D + + +E +K A E+ L S N +
Sbjct: 292 SPPSND----------------------IETLKEAEKQAQEL---------LASAKNARP 320
Query: 206 VVSTYKMIEKLQ----KKLYHPFSVNLMQTREKLIKILMELEDWKEALAYCQLT-IPVYQ 260
V + ++ ++ K++ Q R +LI L+++E + AL + + + V
Sbjct: 321 VFAVRQLKAAIKILHDTKMWPITRQPYPQIRSELIVALLQIESFWPALQHSAVRYLFVDP 380
Query: 261 RVYPQ-FHPLLGLQYYTCGKL 280
+YPQ +HP+ Y +L
Sbjct: 381 VLYPQEWHPIRNNHAYILSQL 401
>gi|440636518|gb|ELR06437.1| hypothetical protein GMDG_07962 [Geomyces destructans 20631-21]
Length = 287
Score = 80.9 bits (198), Expect = 7e-13, Method: Compositional matrix adjust.
Identities = 37/86 (43%), Positives = 54/86 (62%)
Query: 43 NAHTICNSELRPLGTGLYPVISIINHSCLPNAVLVFEGRLAVVRAVQHVPKGAEVLISYI 102
N+ I ++ +G G P++ + NHSC PNA L F+GR AV+ A++H+ +G E+ ISYI
Sbjct: 188 NSFRITDASGDEIGIGFDPLLGMANHSCAPNASLKFDGRCAVLTALKHIEEGEEITISYI 247
Query: 103 ETAGSTMTRQKALKEQYLFTCTCPRC 128
+T RQ L++ Y FTCTC C
Sbjct: 248 DTTLPRAARQAFLQKHYYFTCTCAAC 273
>gi|213514028|ref|NP_001135157.1| SET and MYND domain-containing protein 1 [Salmo salar]
gi|197631975|gb|ACH70711.1| SET and MYND domain containing 1b [Salmo salar]
Length = 301
Score = 80.9 bits (198), Expect = 7e-13, Method: Compositional matrix adjust.
Identities = 37/117 (31%), Positives = 61/117 (52%), Gaps = 5/117 (4%)
Query: 17 NLVNLILQWPEIS----INEIAENFSKLACNAHTICNSE-LRPLGTGLYPVISIINHSCL 71
++ N + WP S +++I+ F + CN T+ + L +G GL+P + ++NH C
Sbjct: 152 DIHNFLDYWPRNSKQHRVDDISHIFGVINCNGFTVSDQRGLNAVGVGLFPNLCLVNHDCW 211
Query: 72 PNAVLVFEGRLAVVRAVQHVPKGAEVLISYIETAGSTMTRQKALKEQYLFTCTCPRC 128
PN ++ +RA+ + G EV +SY++ T RQ+ LK Y F CTC C
Sbjct: 212 PNCTVILNHGKIELRALGKIEPGEEVTVSYVDFLNVTEDRQRQLKMHYFFDCTCEHC 268
>gi|390363795|ref|XP_788278.2| PREDICTED: SET and MYND domain-containing protein 3-like
[Strongylocentrotus purpuratus]
Length = 356
Score = 80.9 bits (198), Expect = 7e-13, Method: Compositional matrix adjust.
Identities = 41/107 (38%), Positives = 59/107 (55%), Gaps = 4/107 (3%)
Query: 37 FSKLACNAHTICNSELRPLGTGLYPVISIINHSCLPNAVLVFEGRLAVVRAVQHVPKGAE 96
F CN+ +IC+++L + G+Y S++NHSC PN V +GR + V+ V +G E
Sbjct: 171 FGATICNSISICDNDLIDIAVGIYLRASMLNHSCDPNCAWVCDGRKLQIMTVKDVKEGDE 230
Query: 97 VLISYIETAGSTMTRQKALKEQYLFTCTCPRCIK----LGQFDDIQE 139
ISY++ RQ LKE Y FTC C +CI+ LG D + E
Sbjct: 231 CTISYVDAMKPAKVRQADLKESYHFTCKCVKCIEEINALGPDDGLGE 277
Score = 58.5 bits (140), Expect = 4e-06, Method: Compositional matrix adjust.
Identities = 25/66 (37%), Positives = 39/66 (59%)
Query: 65 IINHSCLPNAVLVFEGRLAVVRAVQHVPKGAEVLISYIETAGSTMTRQKALKEQYLFTCT 124
+ NHSC N ++VF+ R +R ++ V G E ISY++ RQ L+E+Y FTC
Sbjct: 1 MANHSCDYNCIVVFDERKLQLRTIKDVKDGEECTISYVDVINPAKERQAKLEEEYHFTCK 60
Query: 125 CPRCIK 130
C +C++
Sbjct: 61 CVKCVE 66
>gi|355720917|gb|AES07092.1| SET and MYND domain containing 4 [Mustela putorius furo]
Length = 576
Score = 80.9 bits (198), Expect = 7e-13, Method: Compositional matrix adjust.
Identities = 68/284 (23%), Positives = 127/284 (44%), Gaps = 29/284 (10%)
Query: 45 HTICNSELRPLGTGLYPVISIINHSCLPNAVLVFEGRLAVVRAVQHVPKGAEVLISY--I 102
+ + NS L TG++PV+S++NHSC PN + F +A VRA + + KG E+L Y
Sbjct: 296 NNVTNSRQVRLATGIFPVVSLLNHSCSPNTSMSFVSTVATVRASEKIAKGQEILHCYGPH 355
Query: 103 ETAGSTMTRQKALKEQYLFTCTCPRCIKLGQFDDIQESAILEGYRCKDDGCSGFLLRDSD 162
+ + RQ+ L+ QY F C+CP C K + E + C G G LL+ D
Sbjct: 356 HSRMAVAERQQKLRAQYFFDCSCPVCQKEKHRATV--GPRWEAFCC---GRCGALLQGGD 410
Query: 163 DKGFTCQQCGLVRSKEEIKK--IASEVNILSKKTLALTSCGNHQEVVSTYKMIEKLQ--- 217
CG E + + + S V L ++ G Q+++ ++ + +Q
Sbjct: 411 -----VLSCGGTSCMESVSRDHLISRVQDLQRQV------GGAQKLLGNGQLEQAIQLLL 459
Query: 218 ------KKLYHPFSVNLMQTREKLIKILMELEDWKEALAYCQLTIPVYQRVYPQFHPLLG 271
+ P + + + L + L DW+++ A+ Q ++ V + + +G
Sbjct: 460 GCRHDAENFLSPEHSVVGEMEDALAQAYAALGDWQKSAAHLQKSLQVVEAQHGPSSVEMG 519
Query: 272 LQYYTCGKLEWFLGDTENAIKSMTEAVEILRITHGTNSPFMKEL 315
+ + +L + A+ ++ A ++L + +G S ++EL
Sbjct: 520 HELFKLAQLFFNGFAIPEALNTIERAEKVLLVHYGPCSDEIQEL 563
>gi|351704191|gb|EHB07110.1| SET and MYND domain-containing protein 4, partial [Heterocephalus
glaber]
Length = 793
Score = 80.9 bits (198), Expect = 7e-13, Method: Compositional matrix adjust.
Identities = 64/274 (23%), Positives = 123/274 (44%), Gaps = 29/274 (10%)
Query: 55 LGTGLYPVISIINHSCLPNAVLVFEGRLAVVRAVQHVPKGAEVLISY--IETAGSTMTRQ 112
L TG++PV+S++NHSC PN + F G +A +RA Q + KG E+L Y ++ S RQ
Sbjct: 520 LATGIFPVVSLLNHSCSPNTSISFSGTVATIRAAQQIGKGQEILHCYGPHKSWMSIAERQ 579
Query: 113 KALKEQYLFTCTCPRCIKLGQFDDIQESA--ILEGYRCKDDGCSGFLLRDSDDKGFTCQQ 170
+ L+ QY F C CP C ++ ++ +A E +RC + C + +G
Sbjct: 580 QKLRSQYFFDCNCPAC----HYEKLRTAAGPRWEAFRC--NNCRALM------QGDDVLS 627
Query: 171 CG-------LVRSK--EEIKKIASEVNILSKKTLALTSCGNHQEVVSTYKMIEKLQKKLY 221
CG + R+ +++ + +V I SK L G + + + +
Sbjct: 628 CGNRACLEPISRTHLVSQLQDLQQQVGIASK----LLRNGKLDQAIPRLLRCRQDAESFL 683
Query: 222 HPFSVNLMQTREKLIKILMELEDWKEALAYCQLTIPVYQRVYPQFHPLLGLQYYTCGKLE 281
+ + L + L DW+++ + Q ++ V + + +G + + ++
Sbjct: 684 AAEHAVAGEIEDNLARAYAALGDWQKSATHLQKSLQVVEARHGPSSVEMGHELFKLAQVL 743
Query: 282 WFLGDTENAIKSMTEAVEILRITHGTNSPFMKEL 315
+ A+ ++ +A E+L + G S ++EL
Sbjct: 744 FNGFAVPEALNAIQKAEEVLLVHCGPQSDEVQEL 777
>gi|73967367|ref|XP_548315.2| PREDICTED: SET and MYND domain-containing protein 4 [Canis lupus
familiaris]
Length = 794
Score = 80.9 bits (198), Expect = 9e-13, Method: Compositional matrix adjust.
Identities = 44/112 (39%), Positives = 60/112 (53%), Gaps = 6/112 (5%)
Query: 21 LILQWPEISINEIAENFSKLACNAHTICNSELRPLGTGLYPVISIINHSCLPNAVLVFEG 80
L LQ +I I E SK + +CNS L TG++PV+S++NHSC PN + F
Sbjct: 488 LQLQCNAQAITTIQETGSK----ENNVCNSRQIRLATGIFPVVSLLNHSCSPNTSVSFSS 543
Query: 81 RLAVVRAVQHVPKGAEVLISYIE--TAGSTMTRQKALKEQYLFTCTCPRCIK 130
+A +RA Q + KG E+L Y + RQ+ L+ QY F C CP C K
Sbjct: 544 TVATIRASQQIGKGQEILHCYGPHYSRMGVADRQQKLRSQYFFDCGCPACQK 595
>gi|320169071|gb|EFW45970.1| predicted protein [Capsaspora owczarzaki ATCC 30864]
Length = 509
Score = 80.9 bits (198), Expect = 9e-13, Method: Compositional matrix adjust.
Identities = 67/266 (25%), Positives = 121/266 (45%), Gaps = 41/266 (15%)
Query: 40 LACNAHTICNSELRPLGTGLYPVISIINHSCLPNAVLVFE--GRLAVVRAVQHVPKGAEV 97
+ CN I N L G+G+YPV +++NH+C+ N VL ++ + +RA++ + G E+
Sbjct: 254 MQCNNFAIHNDILFARGSGIYPVAALVNHACVANCVLTYDLKSKRQFIRAIRDIRAGEEI 313
Query: 98 LISYIETAGSTMTRQKALKEQYLFTCTCPRCIKLGQFDDIQE-SAILEGYRCKDDGCSGF 156
++ + A T+ R+ LK Y F C C RC D +E A L R D + F
Sbjct: 314 THAFTDAASPTVVRKAHLKSLYAFDCNCSRC---NDSDAAKELDAELVATRPIDTIPAYF 370
Query: 157 LLRDSDDKGFTCQQCG-----LVRSKEEIKKIAS--EVNILSKKTLALTSCGNHQEVVST 209
K F ++ L R+ ++ K++S +V L + + + G + +
Sbjct: 371 -------KRFRLERLAGLPSLLERATRDVPKVSSPADVEALLQASRSWLQEGMDPRLPTA 423
Query: 210 YKMI-----EKLQKKLYHPFSVNLMQTREKLIKILM-----ELEDWKEALAY-------- 251
++ +++ L +++ L ++ K+ + + L + + LA
Sbjct: 424 RAIVLLVTAWAVRQALLGDYNLELFESNVKIFGMALLWREEALSERQRDLALVSGLRNVL 483
Query: 252 ---CQLTIPVYQRVYPQFHPLLGLQY 274
+ I VYQR+YP HPLLGLQ+
Sbjct: 484 VDSARHVIRVYQRIYPANHPLLGLQW 509
>gi|325189974|emb|CCA24457.1| conserved hypothetical protein [Albugo laibachii Nc14]
Length = 465
Score = 80.9 bits (198), Expect = 9e-13, Method: Compositional matrix adjust.
Identities = 64/262 (24%), Positives = 116/262 (44%), Gaps = 21/262 (8%)
Query: 30 INEIAENFSKLACNAHTICNSELRPLGTGLYPVISIINHSCLPNAVLVFEGRLAVVR--A 87
+ ++ E ++ CN I + +G G YP+ +++NHSC PN V +F A + A
Sbjct: 172 VRDLQEMLARFQCNNFCILDENFFEVGAGCYPLGAMVNHSCDPNCVTIFARGSAQLELWA 231
Query: 88 VQHVPKGAEVLISYIETAGSTMTRQKALKEQYLFTCTCPRCIKLGQFDDIQESAILEGYR 147
++ + K EV ISY++ A R+K L+++Y F C C RC + +A ++ +
Sbjct: 232 MKSIGKDEEVTISYVDPANCMNKRRKYLQKRYHFDCRCQRC-----YCKASHAATIDTFL 286
Query: 148 CKDDG---CSGFLLRDSDDKGFTCQQ-CGLVRSKEEIKKIASEVNILSKKTLALTSCGNH 203
D+ + L ++ +G Q G + +K+ + IL+
Sbjct: 287 VADENGIPSEKWSLNEATKRGTVLSQITGKIEEFRVCEKLTRKCAILADIDSDFV----- 341
Query: 204 QEVVSTYKMIE----KLQKKLYHPFSVNLMQTREKLIKILMELEDWKE-ALAYCQLTIPV 258
+ V + M +LQ +L+ ++ + +++ + ELE ++ AL Y
Sbjct: 342 RNVFTHLHMYNIDAFRLQARLFSDLTIMIAESQSGNSREFAELEFVQQKALVYGYRISKF 401
Query: 259 YQRVYPQFHPLLGLQYYTCGKL 280
Y VYP HPL L G+L
Sbjct: 402 YAFVYPALHPLSNLHNIRMGEL 423
>gi|291405379|ref|XP_002719091.1| PREDICTED: SET and MYND domain containing 4 [Oryctolagus cuniculus]
Length = 801
Score = 80.5 bits (197), Expect = 1e-12, Method: Compositional matrix adjust.
Identities = 69/269 (25%), Positives = 121/269 (44%), Gaps = 19/269 (7%)
Query: 55 LGTGLYPVISIINHSCLPNAVLVFEGRLAVVRAVQHVPKGAEVLISYIETAG--STMTRQ 112
L TG++PVIS++NHSC PN + F G +A VRA QH+ +G E+L Y RQ
Sbjct: 525 LATGVFPVISLLNHSCSPNTSVSFVGTVATVRASQHIRRGQEILHCYGPHVSRLGIAARQ 584
Query: 113 KALKEQYLFTCTCPRCIKLGQFDDIQESAILEGYRCKDDGCSG--FLLRDSDDKGFTCQQ 170
+ LK QY F C+C C + A R + CS LL+ D +
Sbjct: 585 QKLKSQYFFNCSCLAC-------QTEAHATAAAPRWEAFCCSSCRALLQGGDVLSCSSGS 637
Query: 171 CGLVRSKEEIKKIASEVNILSKKTLALTSCGNHQEVVSTYKMIEKLQKK----LYHPFSV 226
C S++ +A+ + L ++T E+ + + Q+ L+ SV
Sbjct: 638 CTEAVSRDH---LAARLRDLQRQTGLARELLRSGELEKAIQQLSGCQRDAESFLWAEHSV 694
Query: 227 NLMQTREKLIKILMELEDWKEALAYCQLTIPVYQRVYPQFHPLLGLQYYTCGKLEWFLGD 286
L + + L + L DWK++ A+ Q ++ V + + +G + + ++ + +
Sbjct: 695 -LGEIEDDLAQAYAALGDWKKSAAHLQKSLTVVEARHGPSSVEMGHELFKLAQILFNGFE 753
Query: 287 TENAIKSMTEAVEILRITHGTNSPFMKEL 315
A+ ++ +A +L + G S ++EL
Sbjct: 754 VPEALSTIQKAETVLAVHCGPCSDEIQEL 782
>gi|345480324|ref|XP_003424127.1| PREDICTED: SET and MYND domain-containing protein 3-like [Nasonia
vitripennis]
Length = 391
Score = 80.5 bits (197), Expect = 1e-12, Method: Compositional matrix adjust.
Identities = 37/99 (37%), Positives = 61/99 (61%), Gaps = 2/99 (2%)
Query: 32 EIAENFSKLACNAHTICNSELRPLGTGLYPVISIINHSCLPNAVLVFEGRLAVVRAVQHV 91
EI + ++ N++ I + ++ +G G+Y S+I+HSC PNAV VFEG ++RA++ +
Sbjct: 150 EILGLYGRICVNSYNILDPDMNSIGVGIYLGPSVIDHSCKPNAVAVFEGTTILIRALEDI 209
Query: 92 PK--GAEVLISYIETAGSTMTRQKALKEQYLFTCTCPRC 128
P+ +++ ISYI+ +T TR L+ Y F C C RC
Sbjct: 210 PRLDWSQIHISYIDVLNTTSTRCTELQNTYYFLCECERC 248
Score = 45.4 bits (106), Expect = 0.035, Method: Compositional matrix adjust.
Identities = 21/52 (40%), Positives = 31/52 (59%)
Query: 263 YPQFHPLLGLQYYTCGKLEWFLGDTENAIKSMTEAVEILRITHGTNSPFMKE 314
Y + HPL G+ Y GK++ L ++A+ +T+A +ILRITHG KE
Sbjct: 325 YGEIHPLTGILYLMLGKIQLHLDKPKSALDMLTKADKILRITHGEKHSLFKE 376
>gi|406700189|gb|EKD03370.1| hypothetical protein A1Q2_02350 [Trichosporon asahii var. asahii
CBS 8904]
Length = 573
Score = 80.5 bits (197), Expect = 1e-12, Method: Compositional matrix adjust.
Identities = 62/233 (26%), Positives = 100/233 (42%), Gaps = 19/233 (8%)
Query: 43 NAHTICNSELRPLGTGLYPVISIINHSCLPNAVLVFE--GRLAVVRAVQHVPKGAEVLIS 100
NA T+ + +L+P+G + P++++ NHSC PNA +VF G+ V A + G EVL +
Sbjct: 287 NAFTLSSFDLKPIGVAMSPLLALFNHSCAPNAAIVFPRGGKEMVAVANADIAAGEEVLTT 346
Query: 101 YIETAGSTMTRQKALKEQYLFTCTCPRCIKLGQFDDIQESAILEGYRCKDDGCSGFLLRD 160
Y++ + TRQ L+ +Y F C CP C D + + C C G
Sbjct: 347 YVDISDDKETRQGDLQSRYGFECECPACT----LDAV------DPRNCLLHECGGLARMP 396
Query: 161 SDDKGFTCQQCGLVRSKEEIKKIASE-VNILSKKTLALTSCGNHQEVVSTYKMIEKLQKK 219
K TC V S ++ + +E ++++ + V +EK+
Sbjct: 397 ETVKCDTCGASWTV-SPNALRDLVNEGLSLVEADDQDQLDPDDLPRVAEVIARLEKIM-- 453
Query: 220 LYHPFSVNLMQTREKLIKILMELEDWKEALAYCQLTIPVYQRVYPQFHPLLGL 272
P S + + K L EALAY + +VYP+ HP + L
Sbjct: 454 ---PPSSRPLLRLHTIAKNLHVPAAQPEALAYAHKALAGAYKVYPRNHPSVAL 503
>gi|430813591|emb|CCJ29069.1| unnamed protein product [Pneumocystis jirovecii]
Length = 463
Score = 80.5 bits (197), Expect = 1e-12, Method: Compositional matrix adjust.
Identities = 35/99 (35%), Positives = 54/99 (54%)
Query: 37 FSKLACNAHTICNSELRPLGTGLYPVISIINHSCLPNAVLVFEGRLAVVRAVQHVPKGAE 96
F K+ N+ ++ S +GT + P+IS INHSC PN LVF+ +R++Q + E
Sbjct: 181 FCKIMINSFSLMTSSYDTIGTAIDPIISRINHSCYPNTALVFDRNTVALRSLQKILPNQE 240
Query: 97 VLISYIETAGSTMTRQKALKEQYLFTCTCPRCIKLGQFD 135
+ +SYI+ + R L +Y F+C C RCI F+
Sbjct: 241 ITVSYIDIYNTQKNRHDELLSRYYFSCKCTRCIVSDGFE 279
>gi|48476977|gb|AAT44536.1| hypothetical rhabdomyosarcoma antigen MU-RMS-40.9C [Homo sapiens]
Length = 273
Score = 80.1 bits (196), Expect = 1e-12, Method: Compositional matrix adjust.
Identities = 63/253 (24%), Positives = 113/253 (44%), Gaps = 37/253 (14%)
Query: 67 NHSCLPNAVLVFEGRLAVVRAVQHVPKGAEVLISYIETAGSTMTRQKALKEQYLFTCTCP 126
NH C PN ++F +RA+ + +G E+ +SYI+ + R++ LK+QY F CTC
Sbjct: 1 NHDCWPNCTVIFNNGKIELRALGKISEGEELTVSYIDFLNVSEERKRQLKKQYYFDCTCE 60
Query: 127 RCIKLGQFDDIQESAILEGYRCKDDGCSGFLLRDSDDKGFTCQQCGLVRSKEEIKKIASE 186
C K + KDD FL G+ + + +++ E
Sbjct: 61 HCQK----------------KLKDD---LFL--------------GVKDNPKPSQEVVKE 87
Query: 187 VNILSKKTL----ALTSCGNHQEVVSTYKMIEKLQKKLYHPFSVNLMQTREKLIKILMEL 242
+ SK TL S G + EVV + + Q+ ++ ++ +++ + ++L L
Sbjct: 88 MIQFSKDTLEKIDKARSEGLYHEVVKLCRECLEKQEPVFADTNIYMLRMLSIVSEVLSYL 147
Query: 243 EDWKEALAYCQLTIPVYQRVYPQFHPLLGLQYYTCGKLEWFLGDTENAIKSMTEAVEILR 302
+ ++EA Y + + Y ++Y + LG+ G W G+ E + +A IL
Sbjct: 148 QAFEEASFYARRMVDGYMKLYHPNNAQLGMAVMRAGLTNWHAGNIEVGHGMICKAYAILL 207
Query: 303 ITHGTNSPFMKEL 315
+THG + P K+L
Sbjct: 208 VTHGPSHPITKDL 220
>gi|332257691|ref|XP_003277939.1| PREDICTED: LOW QUALITY PROTEIN: SET and MYND domain-containing
protein 4 [Nomascus leucogenys]
Length = 703
Score = 80.1 bits (196), Expect = 1e-12, Method: Compositional matrix adjust.
Identities = 78/315 (24%), Positives = 132/315 (41%), Gaps = 67/315 (21%)
Query: 26 PEISINEIA--ENFSKLACNA-------HT------ICNSELRPLGTGLYPVISIINHSC 70
P+++I +A + +L CNA HT + +S L TG++PVIS++NHSC
Sbjct: 382 PDVTIWGVAMLRHMLQLQCNAQAMTTIQHTGSKGSIVTDSRQVRLATGIFPVISLLNHSC 441
Query: 71 LPNAVLVFEGRLAVVRAVQHVPKGAEVLISY--IETAGSTMTRQKALKEQYLFTCTCPRC 128
PN + F +A +RA Q + KG E+L Y ++ RQ+ L+ QY F CTCP C
Sbjct: 442 SPNTSVSFISTVATIRASQRIRKGQEILHCYGPHKSRMGVAERQQKLRSQYFFDCTCPAC 501
Query: 129 IKLGQFDDIQESAILEGYRCKDDGCSGFLLRDSDDKGFTCQQCGLVRSKEEIKKIASEVN 188
+ + G R + F C CG +++ + S
Sbjct: 502 -------QTEAHRMAAGPRW---------------EAFCCNGCGAPMQGDDVLRCGSR-- 537
Query: 189 ILSKKTLALTSCGNHQEVVSTYKMIEKLQKKLYHPFSVNLMQTREKLIKILM--ELEDWK 246
SC + VS ++ +LQ +L Q K+L ELE
Sbjct: 538 ----------SCA--ESAVSRDHLVSRLQ---------DLQQQVRVAQKLLRDGELERAI 576
Query: 247 EALAYCQLTIPVYQRVYPQFHPLLGLQYYTCGKLEWFLGDTENAIKSMTEAVEILRITHG 306
+ L+ CQ + ++ + H ++G + LGD + + + ++ ++ + HG
Sbjct: 577 QRLSGCQRDAESF--LWAE-HAMVGEIADGLARACAALGDWQKSATHLQRSLRVVEVRHG 633
Query: 307 TNSPFMKELILKLEE 321
+S M + KL +
Sbjct: 634 PSSVEMGHELFKLAQ 648
>gi|307104881|gb|EFN53133.1| hypothetical protein CHLNCDRAFT_137504 [Chlorella variabilis]
Length = 623
Score = 80.1 bits (196), Expect = 1e-12, Method: Compositional matrix adjust.
Identities = 36/99 (36%), Positives = 54/99 (54%)
Query: 28 ISINEIAENFSKLACNAHTICNSELRPLGTGLYPVISIINHSCLPNAVLVFEGRLAVVRA 87
+ E+A ++ CN+HTI + EL+PL G++P+ ++ NH C PN + F G V RA
Sbjct: 155 VGPRELALLIARFGCNSHTISDDELQPLAVGIFPLGAMANHDCRPNTLHAFRGGRMVFRA 214
Query: 88 VQHVPKGAEVLISYIETAGSTMTRQKALKEQYLFTCTCP 126
V+ + G EV SY E A R+ L + +LF P
Sbjct: 215 VRAIQPGEEVTTSYTELAAPRWERRAVLLQHHLFDIDAP 253
>gi|348567599|ref|XP_003469586.1| PREDICTED: SET and MYND domain-containing protein 4-like [Cavia
porcellus]
Length = 800
Score = 79.7 bits (195), Expect = 2e-12, Method: Compositional matrix adjust.
Identities = 43/105 (40%), Positives = 58/105 (55%), Gaps = 15/105 (14%)
Query: 39 KLACNA-------HT------ICNSELRPLGTGLYPVISIINHSCLPNAVLVFEGRLAVV 85
+L CNA HT I +S L TG++PV+S++NHSC PN + F G +A +
Sbjct: 491 QLQCNAQAITAIQHTGSKESIITDSRQVRLATGIFPVVSLLNHSCSPNTSVSFTGTIATI 550
Query: 86 RAVQHVPKGAEVLISY--IETAGSTMTRQKALKEQYLFTCTCPRC 128
RA Q + KG E+L Y E+ S RQ+ L+ QY F C CP C
Sbjct: 551 RAAQQIRKGQEILHCYGPHESHMSVAERQQKLRSQYFFDCCCPAC 595
>gi|401886254|gb|EJT50303.1| hypothetical protein A1Q1_00408 [Trichosporon asahii var. asahii
CBS 2479]
Length = 582
Score = 79.7 bits (195), Expect = 2e-12, Method: Compositional matrix adjust.
Identities = 62/241 (25%), Positives = 103/241 (42%), Gaps = 26/241 (10%)
Query: 43 NAHTICNSELRPLGTGLYPVISIINHSCLPNAVLVFE--GRLAVVRAVQHVPKGAEVLIS 100
NA T+ + +L+P+G + P++++ NHSC PNA +VF G+ V A + G EVL +
Sbjct: 287 NAFTLSSFDLKPIGVAMSPLLALFNHSCAPNAAIVFPRGGKEMVAVANADIAAGEEVLTT 346
Query: 101 YIETAGSTMTRQKALKEQYLFTCTCPRCIKLGQFDDIQESAILEGYRCKDDGCSGF---- 156
Y++ + TRQ L+ +Y F C CP C D + + C C G
Sbjct: 347 YVDISDDKETRQGDLQSRYGFECECPACT----LDAV------DPRNCLLHECGGLARMP 396
Query: 157 --LLRDSDDKGFTCQQCG--LVRSKEEIKKIAS-EVNILSKKTLALTSCGNHQEVVSTYK 211
L + SD + C CG S ++ + + ++++ + V
Sbjct: 397 ADLRQMSDSETVKCDTCGASWTVSPNALRDLVNGGLSLVEADDQDQLDPDDLPRVAEVIA 456
Query: 212 MIEKLQKKLYHPFSVNLMQTREKLIKILMELEDWKEALAYCQLTIPVYQRVYPQFHPLLG 271
+E++ P S + + K L EALAY + +VYP+ HP +
Sbjct: 457 RLEEIM-----PPSSRPLLRLHTIAKNLHVPAAQPEALAYAHKALAGAYKVYPRNHPSVA 511
Query: 272 L 272
L
Sbjct: 512 L 512
>gi|154308769|ref|XP_001553720.1| hypothetical protein BC1G_07807 [Botryotinia fuckeliana B05.10]
Length = 409
Score = 79.7 bits (195), Expect = 2e-12, Method: Compositional matrix adjust.
Identities = 40/117 (34%), Positives = 66/117 (56%), Gaps = 2/117 (1%)
Query: 14 QIANLVNLILQWPEISINEIAENFSKLACNAHTI-CNSELRPLGTGLYPVISIINHSCLP 72
Q +VNL+ + PE + E+ +L+ N + N P+G L P+++ NHSC P
Sbjct: 58 QAHAVVNLLQRKPEF-VKYALESLCRLSTNGFRVESNVGNGPIGLCLDPLLARANHSCRP 116
Query: 73 NAVLVFEGRLAVVRAVQHVPKGAEVLISYIETAGSTMTRQKALKEQYLFTCTCPRCI 129
NA + F+G+ A +RA+ + KG ++ ISYI+ R++AL + + F C C RC+
Sbjct: 117 NAAITFDGKRATLRALSPIAKGEQIFISYIDETQRQEVRREALAKTWFFQCLCSRCV 173
>gi|258575593|ref|XP_002541978.1| predicted protein [Uncinocarpus reesii 1704]
gi|237902244|gb|EEP76645.1| predicted protein [Uncinocarpus reesii 1704]
Length = 481
Score = 79.7 bits (195), Expect = 2e-12, Method: Compositional matrix adjust.
Identities = 68/263 (25%), Positives = 119/263 (45%), Gaps = 38/263 (14%)
Query: 27 EISINEIAENFSKLACNAHTICNSELRPLGTGLYPVISIINHSCLPNAVLVFEGRLAVVR 86
E+S+++I E F KL NA T+ +G + P+ ++ NHSC PNA F+ VR
Sbjct: 162 ELSLHKIIEYFCKLDINAFTLTTPFYDHVGAAIEPLAALCNHSCSPNAATDFDKGKIWVR 221
Query: 87 AVQHVPKGAEVLISYIETAGSTMTRQKALKEQYLFTCTCPRCIKLGQFDDIQESAILEGY 146
A++ + KG +V +SYIET RQ L ++Y F C C +C +I+++A +
Sbjct: 222 ALRDIGKGEQVFVSYIETTDPYAHRQSELLKRYYFNCKCNKC-------EIEKNAPDTHF 274
Query: 147 RCKDDGCSGFLLRDSDDKG---FTCQQCGLVRSKEEIKKIASEVNILSKKTLALTSCGNH 203
++++ + F + G S + IK + S ++ L + TL
Sbjct: 275 LRAITAVDSKTIQNAQQEAMELFEMVKPG-APSTDSIKNLRSAMSALRRTTLW------- 326
Query: 204 QEVVSTYKMIEKLQKKLYHPFSVNLMQTREKLIKILMELEDWKEALAYCQLT-IPVYQRV 262
L ++ Y ++ R +LI LM ++ A +C + + V V
Sbjct: 327 -----------PLTRQPY-------VRLRGELIASLMGARQFQSAFVHCVIRHLRVNPVV 368
Query: 263 YP-QFHPLLGLQYYTCGKLEWFL 284
YP ++HP+ + + KL +L
Sbjct: 369 YPNRWHPISSMHKWVFVKLMRYL 391
>gi|303271753|ref|XP_003055238.1| predicted protein [Micromonas pusilla CCMP1545]
gi|226463212|gb|EEH60490.1| predicted protein [Micromonas pusilla CCMP1545]
Length = 838
Score = 79.7 bits (195), Expect = 2e-12, Method: Compositional matrix adjust.
Identities = 39/106 (36%), Positives = 57/106 (53%), Gaps = 3/106 (2%)
Query: 17 NLVNLILQWPEISINEIAENFSKLACNAHTICNSELRPLGTGLYPVISIINHSCLPNAVL 76
N V+ + P+ ++A + ACN HTI + EL P+G G+YP +++NH+C P
Sbjct: 235 NFVDPLAGGPDP--RDVARLLAAFACNVHTIHDDELNPVGMGVYPTAAMMNHACAPTVAQ 292
Query: 77 VF-EGRLAVVRAVQHVPKGAEVLISYIETAGSTMTRQKALKEQYLF 121
F EG A R V+ V KG E+ ISY+E + R+ L Y F
Sbjct: 293 TFGEGTTATFRCVRDVRKGEEIAISYVELLATRAERRAELLNHYRF 338
>gi|347831782|emb|CCD47479.1| hypothetical protein [Botryotinia fuckeliana]
Length = 612
Score = 79.7 bits (195), Expect = 2e-12, Method: Compositional matrix adjust.
Identities = 40/117 (34%), Positives = 66/117 (56%), Gaps = 2/117 (1%)
Query: 14 QIANLVNLILQWPEISINEIAENFSKLACNAHTI-CNSELRPLGTGLYPVISIINHSCLP 72
Q +VNL+ + PE + E+ +L+ N + N P+G L P+++ NHSC P
Sbjct: 221 QAHAVVNLLQRKPEF-VKYALESLCRLSTNGFRVESNVGNGPIGLCLDPLLARANHSCRP 279
Query: 73 NAVLVFEGRLAVVRAVQHVPKGAEVLISYIETAGSTMTRQKALKEQYLFTCTCPRCI 129
NA + F+G+ A +RA+ + KG ++ ISYI+ R++AL + + F C C RC+
Sbjct: 280 NAAITFDGKRATLRALSPIAKGEQIFISYIDETQRQEVRREALAKTWFFQCLCSRCV 336
>gi|405121568|gb|AFR96336.1| hypothetical protein CNAG_03112 [Cryptococcus neoformans var.
grubii H99]
Length = 447
Score = 79.7 bits (195), Expect = 2e-12, Method: Compositional matrix adjust.
Identities = 52/177 (29%), Positives = 87/177 (49%), Gaps = 13/177 (7%)
Query: 29 SINEIAENFSKLACNAHTICNSELRPLGTGLYPVISIINHSCLPNAVLVFE--GRLAVVR 86
++E+ S + NA T+ + L P+G + P++++ NHSC PNA+ VF GR +
Sbjct: 110 GVDEVLNLCSAIQVNAFTLTSPSLAPIGMSISPLLALANHSCEPNAIAVFPKGGRDIFLV 169
Query: 87 AVQHVPKGAEVLISYIETAGSTMTRQKALKEQYLFTCTCPRCIKLGQFDDIQESAI---- 142
A+ +P G E+L SYI+T+ RQ L +Y F C C C K + ++S +
Sbjct: 170 ALNDIPPGEEILTSYIDTSTPYHHRQSELLSRYRFVCHCSLCQKSESTANGEQSWVDPRW 229
Query: 143 -LEGYRCKDDGCSGFLLRDSDDK-GFTCQQCGLVRSKEEIKKIASEVNILSKKTLAL 197
+ C +G + L ++ DK C C KE+ K A +V L ++ ++
Sbjct: 230 CVRHRECSKEGKARMPLENATDKVEVRCIAC-----KEDFKVDADKVRKLVQEGMSF 281
>gi|392587935|gb|EIW77268.1| SET domain-containing protein [Coniophora puteana RWD-64-598 SS2]
Length = 656
Score = 79.7 bits (195), Expect = 2e-12, Method: Compositional matrix adjust.
Identities = 54/174 (31%), Positives = 80/174 (45%), Gaps = 16/174 (9%)
Query: 29 SINEIAENFSKLACNAHTICNSELRPLGTGLYPVISIINHSCLPNAVLVF---------- 78
S ++ + S+ N+ T+ L P+G + P+I++INHSC PNAV+VF
Sbjct: 239 SARDLVDLTSRFTSNSFTLATPALAPIGVSVSPLIALINHSCDPNAVVVFPRSSTASTPA 298
Query: 79 --EGRLAVVRAVQHVPKGAEVLISYIETAGSTMTRQKALKEQYLFTCTCPRCIKLGQFDD 136
E ++ VV A++ + E+L SYI+T RQ ALKE Y FTC C C +
Sbjct: 299 KLEPQMQVV-AIKDIEPDTEILTSYIDTTLPQPIRQSALKETYCFTCACTLCARPQDATT 357
Query: 137 IQ-ESAILEGYRCKDDGCSGFLLRDSDDKGFTCQQCGLVRSKEEIKKIASEVNI 189
I SA+ RC G + + C CG S + I +N+
Sbjct: 358 IDPRSAVWCAQRC--GGVASVYSAGDELVVGRCNTCGFSASMTTAESILDAINV 409
>gi|299470965|emb|CBN79949.1| conserved unknown protein [Ectocarpus siliculosus]
Length = 556
Score = 79.3 bits (194), Expect = 2e-12, Method: Compositional matrix adjust.
Identities = 44/124 (35%), Positives = 64/124 (51%), Gaps = 6/124 (4%)
Query: 37 FSKLACNAHTICNSELRP-LGTGLYPVISIINHSCLPNAVLVFEGRLAVVRAVQHVPKGA 95
KL+CN +I S L +G GLY + NHSC PNA F G+ +R + + KG
Sbjct: 190 LGKLSCNVFSIAESALNGEVGCGLYLEAAAANHSCNPNAAQSFSGKTLSLRCTRPIRKGE 249
Query: 96 EVLISYIETAGSTMTRQKALKEQYLFTCTCPRC-IKLGQFDDIQESAILEGYRCKDDGCS 154
E+ I + R+++L++ Y F C C RC G+ +D++ LE + C D CS
Sbjct: 250 EITIGITQIQKPGPARRESLRKTYFFECRCERCESPEGRAEDMR----LEAFACPDSECS 305
Query: 155 GFLL 158
GF L
Sbjct: 306 GFCL 309
>gi|346320789|gb|EGX90389.1| SET domain protein [Cordyceps militaris CM01]
Length = 540
Score = 79.3 bits (194), Expect = 2e-12, Method: Compositional matrix adjust.
Identities = 51/132 (38%), Positives = 68/132 (51%), Gaps = 9/132 (6%)
Query: 27 EISINEIAENFSKLACNAHTICNSELRPLGTGLYPVISIINHSCLPNAVLVFEGRLAVVR 86
E ++ E AE KL NA +++L +G L P +++ NHSCLPNA + F GR AV+R
Sbjct: 167 EGAVREAAEILCKLQTNAFHRFDADLGHVGIFLEPTLAMANHSCLPNAFVQFVGRTAVLR 226
Query: 87 AVQHVPKGAEVLISYIETAGSTMTRQKALKEQYLFTCTCPRCIKLGQFDDI---QESAIL 143
A Q + G E+ ISY + S RQ AL Y F C C RC + D+ Q SA
Sbjct: 227 AEQRIQSGDEIEISYTDYTSSLSKRQAALAP-YHFECRCRRCTQ-----DLCVYQASAHY 280
Query: 144 EGYRCKDDGCSG 155
+ DD G
Sbjct: 281 RDFALNDDSVLG 292
>gi|357607357|gb|EHJ65468.1| hypothetical protein KGM_05644 [Danaus plexippus]
Length = 370
Score = 79.3 bits (194), Expect = 2e-12, Method: Compositional matrix adjust.
Identities = 40/119 (33%), Positives = 65/119 (54%), Gaps = 8/119 (6%)
Query: 37 FSKLACNAHTICNSELRPLGTGLYPVISIINHSCLPNAVLVFEGRLAVVRAVQHVP--KG 94
+ ++ N+ TI + E+ +GTG+Y S+I+HSC PNAV VF+G+ +RA++ +
Sbjct: 163 YGRMVINSFTILDIEMNSIGTGIYLASSVIDHSCNPNAVAVFDGKTINIRALKDMNCLDW 222
Query: 95 AEVLISYIETAGSTMTRQKALKEQYLFTCTCPRCIKLGQFDDIQESAILEGYRCKDDGC 153
++ ISYI+ + RQ L++ Y F C C RC+ D + +C DGC
Sbjct: 223 KKIRISYIDLMKTPYERQMELRQSYYFLCQCDRCL------DENRIKYVHAAKCLKDGC 275
>gi|315046180|ref|XP_003172465.1| SET and MYND domain-containing protein 2 [Arthroderma gypseum CBS
118893]
gi|311342851|gb|EFR02054.1| SET and MYND domain-containing protein 2 [Arthroderma gypseum CBS
118893]
Length = 498
Score = 79.3 bits (194), Expect = 3e-12, Method: Compositional matrix adjust.
Identities = 38/103 (36%), Positives = 58/103 (56%)
Query: 28 ISINEIAENFSKLACNAHTICNSELRPLGTGLYPVISIINHSCLPNAVLVFEGRLAVVRA 87
+ + IAE +++ N+ T S PLG + P +NHSC PNAV+ F+ L V+A
Sbjct: 180 LDLETIAEYLARIEVNSFTFTTSFGDPLGLCIQPFACYMNHSCDPNAVVGFDEGLITVKA 239
Query: 88 VQHVPKGAEVLISYIETAGSTMTRQKALKEQYLFTCTCPRCIK 130
++ + +V ISYI+ RQK L E+Y FTC C +C++
Sbjct: 240 LRKIKPDEQVFISYIDNTYPFEVRQKQLAERYFFTCKCSKCVE 282
>gi|195168896|ref|XP_002025266.1| GL13396 [Drosophila persimilis]
gi|194108722|gb|EDW30765.1| GL13396 [Drosophila persimilis]
Length = 787
Score = 79.3 bits (194), Expect = 3e-12, Method: Compositional matrix adjust.
Identities = 38/102 (37%), Positives = 56/102 (54%), Gaps = 6/102 (5%)
Query: 58 GLYPVISIINHSCLPNAVLVFE-GRLAVVRAVQHVPKGAEVLISYIETAGSTMTRQKALK 116
GL+P+ +I+NH C PNA F+ GRLA+VRA + +PKG E+ +Y + +TR LK
Sbjct: 567 GLFPLTAIMNHECTPNASHYFDNGRLAIVRAARDIPKGGEITTTYTKILWGNLTRGIFLK 626
Query: 117 EQYLFTCTCPRCIKLGQFDDIQESAILEGYRCKDDGCSGFLL 158
F C C RC D+ + L C++ GC G ++
Sbjct: 627 MTKHFMCDCTRC-----NDNTENGTYLSALFCREQGCRGLVI 663
Score = 55.5 bits (132), Expect = 4e-05, Method: Compositional matrix adjust.
Identities = 27/72 (37%), Positives = 40/72 (55%), Gaps = 1/72 (1%)
Query: 58 GLYPVISIINHSCLPNAVLVF-EGRLAVVRAVQHVPKGAEVLISYIETAGSTMTRQKALK 116
L+P+ ++NH C PNA F +G +V A + + GAE+ +SY + ST+ R+ L
Sbjct: 237 ALFPLAGLLNHQCTPNAGHHFQDGETIIVCATERIACGAEITVSYAKMLWSTLARKIFLG 296
Query: 117 EQYLFTCTCPRC 128
F C CPRC
Sbjct: 297 MTKGFMCQCPRC 308
>gi|67539656|ref|XP_663602.1| hypothetical protein AN5998.2 [Aspergillus nidulans FGSC A4]
gi|40738557|gb|EAA57747.1| hypothetical protein AN5998.2 [Aspergillus nidulans FGSC A4]
gi|259479819|tpe|CBF70391.1| TPA: SET and MYND domain protein, putative (AFU_orthologue;
AFUA_2G10080) [Aspergillus nidulans FGSC A4]
Length = 497
Score = 79.3 bits (194), Expect = 3e-12, Method: Compositional matrix adjust.
Identities = 37/98 (37%), Positives = 60/98 (61%), Gaps = 2/98 (2%)
Query: 32 EIAENFSKLACNAHTICNSELRPLGTGLYPVISIINHSCLPNAVLVFEGRLAVVRAVQHV 91
+I ++L N+ + N + +G L+P ++INHSC NAV+ F+G V+A++ +
Sbjct: 187 KIVAYHARLDLNSFNLTNDD--DIGIYLHPYAALINHSCDYNAVVGFDGSEIFVKAIRPI 244
Query: 92 PKGAEVLISYIETAGSTMTRQKALKEQYLFTCTCPRCI 129
G ++ ISYI+T T RQK L+E+Y FTC C +C+
Sbjct: 245 ATGEQIFISYIDTTYPTRIRQKELQERYFFTCNCAKCL 282
>gi|297266487|ref|XP_001093225.2| PREDICTED: SET and MYND domain-containing protein 1 [Macaca
mulatta]
Length = 399
Score = 79.0 bits (193), Expect = 3e-12, Method: Compositional matrix adjust.
Identities = 38/118 (32%), Positives = 65/118 (55%), Gaps = 6/118 (5%)
Query: 19 VNLILQ-WP----EISINEIAENFSKLACNAHTICNSE-LRPLGTGLYPVISIINHSCLP 72
V+ LQ WP + + I+ F + CN T+ + L+ +G G++P + ++NH C P
Sbjct: 151 VDTFLQYWPPQSQQFGMQYISHIFGVINCNGFTLSDQRGLQAVGVGIFPNLGLVNHDCWP 210
Query: 73 NAVLVFEGRLAVVRAVQHVPKGAEVLISYIETAGSTMTRQKALKEQYLFTCTCPRCIK 130
N ++F +RA+ + +G E+ +SYI+ + R++ LK+QY F CTC C K
Sbjct: 211 NCTVIFNNGKIELRALGKISEGEELTVSYIDFLNVSEERKRQLKKQYYFDCTCEHCQK 268
>gi|198470120|ref|XP_002133373.1| GA22855 [Drosophila pseudoobscura pseudoobscura]
gi|198145302|gb|EDY72001.1| GA22855 [Drosophila pseudoobscura pseudoobscura]
Length = 844
Score = 79.0 bits (193), Expect = 3e-12, Method: Compositional matrix adjust.
Identities = 38/102 (37%), Positives = 56/102 (54%), Gaps = 6/102 (5%)
Query: 58 GLYPVISIINHSCLPNAVLVFE-GRLAVVRAVQHVPKGAEVLISYIETAGSTMTRQKALK 116
GL+P+ +I+NH C PNA F+ GRLA+VRA + +PKG E+ +Y + +TR LK
Sbjct: 624 GLFPLTAIMNHECTPNASHYFDNGRLAIVRAARDIPKGGEITTTYTKILWGNLTRGIFLK 683
Query: 117 EQYLFTCTCPRCIKLGQFDDIQESAILEGYRCKDDGCSGFLL 158
F C C RC D+ + L C++ GC G ++
Sbjct: 684 MTKHFMCDCTRC-----NDNTENGTYLSALFCREQGCRGLVI 720
Score = 55.5 bits (132), Expect = 4e-05, Method: Compositional matrix adjust.
Identities = 27/72 (37%), Positives = 40/72 (55%), Gaps = 1/72 (1%)
Query: 58 GLYPVISIINHSCLPNAVLVF-EGRLAVVRAVQHVPKGAEVLISYIETAGSTMTRQKALK 116
L+P+ ++NH C PNA F +G +V A + + GAE+ +SY + ST+ R+ L
Sbjct: 237 ALFPLAGLLNHQCTPNAGHHFQDGETIIVCATERIACGAEITVSYAKMLWSTLARKIFLG 296
Query: 117 EQYLFTCTCPRC 128
F C CPRC
Sbjct: 297 MTKGFMCQCPRC 308
>gi|255072485|ref|XP_002499917.1| predicted protein [Micromonas sp. RCC299]
gi|226515179|gb|ACO61175.1| predicted protein [Micromonas sp. RCC299]
Length = 676
Score = 79.0 bits (193), Expect = 3e-12, Method: Compositional matrix adjust.
Identities = 33/89 (37%), Positives = 55/89 (61%)
Query: 33 IAENFSKLACNAHTICNSELRPLGTGLYPVISIINHSCLPNAVLVFEGRLAVVRAVQHVP 92
+A +K++CNAHT+C+ EL +G G+YP +++NH P A F+G+ V+RA + +
Sbjct: 228 VARLLAKISCNAHTVCDEELNAIGVGVYPAAAMVNHGDAPTAAQSFKGKKIVLRATRDLK 287
Query: 93 KGAEVLISYIETAGSTMTRQKALKEQYLF 121
+G EV ++Y+E + R+ AL Y F
Sbjct: 288 RGDEVTMAYVELLATRQERRAALHAGYNF 316
Score = 53.9 bits (128), Expect = 1e-04, Method: Compositional matrix adjust.
Identities = 30/105 (28%), Positives = 53/105 (50%), Gaps = 2/105 (1%)
Query: 226 VNLMQTR--EKLIKILMELEDWKEALAYCQLTIPVYQRVYPQFHPLLGLQYYTCGKLEWF 283
++L++ R + + + + D++ A +P Y+ YP HP LGL K+E
Sbjct: 549 IHLLRVRVLDAMQRAAVTAGDFESARDAAYAVLPAYRLSYPPCHPPLGLHLALIAKIEAH 608
Query: 284 LGDTENAIKSMTEAVEILRITHGTNSPFMKELILKLEEAQAEASY 328
L D A++ EA+ L ++HG NS ++ + L E++AE Y
Sbjct: 609 LVDLNAAVQFAREAISCLNVSHGRNSDVVRMMETLLGESEAELRY 653
>gi|402898218|ref|XP_003912123.1| PREDICTED: SET and MYND domain-containing protein 4 [Papio anubis]
Length = 802
Score = 79.0 bits (193), Expect = 3e-12, Method: Compositional matrix adjust.
Identities = 79/318 (24%), Positives = 132/318 (41%), Gaps = 73/318 (22%)
Query: 26 PEISINEIA--ENFSKLACNA-------HT------ICNSELRPLGTGLYPVISIINHSC 70
P+++I +A + +L CNA HT + +S L TG++PVIS++NHSC
Sbjct: 481 PDVTIWGVAMLRHMLQLQCNAQAITTVQHTGSKGSIVTDSRQVRLATGIFPVISLLNHSC 540
Query: 71 LPNAVLVFEGRLAVVRAVQHVPKGAEVLISY--IETAGSTMTRQKALKEQYLFTCTCPRC 128
PN + F +A +RA Q + KG E+L Y ++ RQ+ L+ QY F CTCP C
Sbjct: 541 SPNTSVSFISTVATIRASQRIGKGQEILHCYGPHKSRMGVAERQQKLRSQYFFDCTCPAC 600
Query: 129 IKLGQFDDIQESAILEGYRCKDDGCSGFLLRDSDDKGFTCQQCGLVRSKEEIKKIASEVN 188
+ + G R K F C C + + +
Sbjct: 601 -------QTEAHRMAAGPRW---------------KAFCCNSCRAPMQGDGVLR------ 632
Query: 189 ILSKKTLALTSCGNH---QEVVSTYKMIEKLQKKLYHPFSVNLMQTREKLIKILM--ELE 243
CGN + VVS ++ +LQ +L Q K+L ELE
Sbjct: 633 -----------CGNRSCAESVVSRDHLVSRLQ---------DLQQQVGVAQKLLRDGELE 672
Query: 244 DWKEALAYCQLTIPVYQRVYPQFHPLLGLQYYTCGKLEWFLGDTENAIKSMTEAVEILRI 303
+ L+ CQ + ++ + H ++G + LGD + + + +++ ++ +
Sbjct: 673 RAVQLLSRCQRDAESF--LWAE-HAMVGAIADGLARACAALGDWQKSATHLQKSLRVVEV 729
Query: 304 THGTNSPFMKELILKLEE 321
HG +S M + KL +
Sbjct: 730 RHGPSSIEMGHELFKLAQ 747
>gi|452977535|gb|EME77301.1| hypothetical protein MYCFIDRAFT_179882 [Pseudocercospora fijiensis
CIRAD86]
Length = 616
Score = 79.0 bits (193), Expect = 3e-12, Method: Compositional matrix adjust.
Identities = 34/82 (41%), Positives = 49/82 (59%)
Query: 55 LGTGLYPVISIINHSCLPNAVLVFEGRLAVVRAVQHVPKGAEVLISYIETAGSTMTRQKA 114
LG G P+++ NHSC PNA +VF V+RA++ + KG E+ + YI+ + RQ
Sbjct: 284 LGIGFDPILNSSNHSCDPNAAVVFNQPRLVLRALRSISKGEEIFMKYIDVSNPFSVRQAE 343
Query: 115 LKEQYLFTCTCPRCIKLGQFDD 136
LKE Y F+C C +C K FD+
Sbjct: 344 LKESYFFSCRCSKCKKGAIFDE 365
>gi|440792261|gb|ELR13489.1| tetratricopeptide repeat domain containing protein [Acanthamoeba
castellanii str. Neff]
Length = 764
Score = 79.0 bits (193), Expect = 3e-12, Method: Compositional matrix adjust.
Identities = 45/122 (36%), Positives = 70/122 (57%), Gaps = 8/122 (6%)
Query: 14 QIANLVNLILQWPEISINEIAENFSKL----ACNAHTICN-SELRPLGTGLYPVISIINH 68
Q N+ L P +++ + E ++L NA T+ + S LRP+G GLY +++NH
Sbjct: 313 QARNVRWLAAHGPGLALPDADETVARLFAVAQANALTLTDPSSLRPIGQGLYASAALLNH 372
Query: 69 SCLPNA--VLVFEGRLAVVRAVQHVPKGAEVLISYIETAGSTMTRQKALKEQYLFTCTCP 126
SCLPNA + EGRL VRAV+ + G EV ++Y++ RQ+AL++ + F C C
Sbjct: 373 SCLPNANWSVDGEGRL-CVRAVRPIEAGEEVTVAYVDPTLPYHARQQALQDHFFFACRCL 431
Query: 127 RC 128
+C
Sbjct: 432 QC 433
>gi|393219982|gb|EJD05468.1| SET domain-containing protein [Fomitiporia mediterranea MF3/22]
Length = 635
Score = 78.6 bits (192), Expect = 4e-12, Method: Compositional matrix adjust.
Identities = 38/107 (35%), Positives = 60/107 (56%), Gaps = 7/107 (6%)
Query: 29 SINEIAENFSKLACNAHTICNSELRPLGTGLYPVISIINHSCLPNAVLVFEGRLAV---- 84
SI+E+ + SK N++T+ L P+G + P+ ++NHSC PN +VF L+
Sbjct: 261 SISELVDLISKFTLNSYTLTTPSLSPIGVSVSPLAGLLNHSCDPNVSVVFPRILSADKEP 320
Query: 85 ---VRAVQHVPKGAEVLISYIETAGSTMTRQKALKEQYLFTCTCPRC 128
+ A+Q +P +E+L SY++ + RQK LKE Y F C+C C
Sbjct: 321 ALHIVAIQDIPADSELLTSYVDVTLPVLQRQKDLKETYSFNCSCHSC 367
>gi|431891028|gb|ELK01907.1| SET and MYND domain-containing protein 4 [Pteropus alecto]
Length = 776
Score = 78.2 bits (191), Expect = 4e-12, Method: Compositional matrix adjust.
Identities = 70/282 (24%), Positives = 118/282 (41%), Gaps = 59/282 (20%)
Query: 47 ICNSELRPLGTGLYPVISIINHSCLPNAVLVFEGRLAVVRAVQHVPKGAEVLISY--IET 104
I NS+ L TG++PV+S++NHSC PN + F +A VRA + KG E+L Y E+
Sbjct: 492 ITNSKQVRLATGIFPVVSLLNHSCSPNTSVSFISTVATVRASVQIGKGQEILHCYGPHES 551
Query: 105 AGSTMTRQKALKEQYLFTCTCPRCIKLGQFDDIQESAILEGYRCKDDGCSGFLLRDSDDK 164
RQ+ L+ QY F C CP C E +R S +
Sbjct: 552 RMGAAERQQKLRSQYFFDCNCPACQN-------------EKHR---------TTTGSKWE 589
Query: 165 GFTCQQCGLVRSKEEIKKIASEVNILSKKTLALTSCGNHQ--EVVSTYKMIEKLQKKLYH 222
F C +C + +++ SCGN E VS +++ +LQ
Sbjct: 590 AFCCNRCRTLMQGDDV-----------------LSCGNTSCTEAVSRDRLVSQLQD---- 628
Query: 223 PFSVNLMQTREKLIKILM---ELEDWKEALAYCQLTIPVYQRVYPQFHPLLGLQYYTCGK 279
+Q + K+ + L+ +LE + L C+ + P H ++G +
Sbjct: 629 ------LQQQIKMAQKLLKNGKLEQAIQLLLGCRYDAESF---LPAEHSVVGEIEDDLAQ 679
Query: 280 LEWFLGDTENAIKSMTEAVEILRITHGTNSPFMKELILKLEE 321
LGD + + + +++ ++ + HG +S M + KL +
Sbjct: 680 AYAALGDWQKSAAHLQKSLHVVEVRHGPSSVEMGHELFKLAQ 721
>gi|119597550|gb|EAW77144.1| SET and MYND domain containing 3, isoform CRA_e [Homo sapiens]
Length = 170
Score = 78.2 bits (191), Expect = 4e-12, Method: Compositional matrix adjust.
Identities = 33/77 (42%), Positives = 51/77 (66%)
Query: 52 LRPLGTGLYPVISIINHSCLPNAVLVFEGRLAVVRAVQHVPKGAEVLISYIETAGSTMTR 111
++ +G GLYP IS++NHSC PN +VF G ++RAV+ + G E+ I Y++ ++ R
Sbjct: 1 MQEVGVGLYPSISLLNHSCDPNCSIVFNGPHLLLRAVRDIEVGEELTICYLDMLMTSEER 60
Query: 112 QKALKEQYLFTCTCPRC 128
+K L++QY F C C RC
Sbjct: 61 RKQLRDQYCFECDCFRC 77
>gi|301765320|ref|XP_002918077.1| PREDICTED: SET and MYND domain-containing protein 4-like
[Ailuropoda melanoleuca]
gi|281342357|gb|EFB17941.1| hypothetical protein PANDA_006471 [Ailuropoda melanoleuca]
Length = 793
Score = 78.2 bits (191), Expect = 5e-12, Method: Compositional matrix adjust.
Identities = 76/307 (24%), Positives = 128/307 (41%), Gaps = 61/307 (19%)
Query: 21 LILQWPEISINEIAENFSKLACNAHTICNSELRPLGTGLYPVISIINHSCLPNAVLVFEG 80
L LQ +I I + SK + I +S L L TG++PV+S++NHSC PN + F
Sbjct: 487 LQLQCNAQAITTIQQTGSK----ENNITDSRLVRLATGIFPVVSLLNHSCSPNTSMSFVS 542
Query: 81 RLAVVRAVQHVPKGAEVLISY--IETAGSTMTRQKALKEQYLFTCTCPRCIKLGQFDDIQ 138
+A +RA + + KG E+L Y + RQ+ L+ QY F C CP C K +
Sbjct: 543 TVATIRASEKIGKGQEILHCYGPHHSRMGVADRQQKLRSQYFFDCGCPACQK-----EKH 597
Query: 139 ESAILEGYRCKDDGCSGFLLRDSDDKGFTCQQCGLVRSKEEIKKIASEVNILSKKTLALT 198
+A+ + + F C +CG V ++ S T
Sbjct: 598 RAAVGPRW-----------------EAFCCSRCGAVLQGGDVLSCGS------------T 628
Query: 199 SCGNHQEVVSTYKMIEKLQKKLYHPFSVNLMQTREKLIKILMELEDWKEALAYCQLTIPV 258
SC E VS ++ +LQ ++ R + L+ ++A+ QL + V
Sbjct: 629 SC---TESVSREHLVSRLQD----------LRQRVGSARNLLSNGQLEQAI---QLLL-V 671
Query: 259 YQRVYPQF----HPLLGLQYYTCGKLEWFLGDTENAIKSMTEAVEILRITHGTNSPFMKE 314
Y+ F H ++G + LGD + + + +++ ++ + HG S M
Sbjct: 672 YRHDAENFLSAEHSMVGEMEDALAQAYAALGDWQKSATHLQKSLRVVEVQHGPASVEMGH 731
Query: 315 LILKLEE 321
+ KL +
Sbjct: 732 ELFKLAQ 738
>gi|170102677|ref|XP_001882554.1| predicted protein [Laccaria bicolor S238N-H82]
gi|164642451|gb|EDR06707.1| predicted protein [Laccaria bicolor S238N-H82]
Length = 645
Score = 78.2 bits (191), Expect = 5e-12, Method: Compositional matrix adjust.
Identities = 42/104 (40%), Positives = 61/104 (58%), Gaps = 6/104 (5%)
Query: 31 NEIAENFSKLACNAHTICNSELRPLGTGLYPVISIINHSCLPNAVLVF-----EGR-LAV 84
+++ + S+ N TI + L PLG + P +++INHSC PNA +VF EG L
Sbjct: 264 SDLVDLISRFTTNTFTITSPTLAPLGACVSPSVALINHSCDPNAAVVFPRSAKEGEPLMQ 323
Query: 85 VRAVQHVPKGAEVLISYIETAGSTMTRQKALKEQYLFTCTCPRC 128
V A++++ E+L +YI+T T RQ+ALKE Y F C CP C
Sbjct: 324 VVALKYIGPDEEILTAYIDTTLPTGLRQQALKETYHFVCECPLC 367
>gi|355568065|gb|EHH24346.1| hypothetical protein EGK_07991 [Macaca mulatta]
Length = 806
Score = 78.2 bits (191), Expect = 5e-12, Method: Compositional matrix adjust.
Identities = 79/318 (24%), Positives = 132/318 (41%), Gaps = 73/318 (22%)
Query: 26 PEISINEIA--ENFSKLACNA-------HT------ICNSELRPLGTGLYPVISIINHSC 70
P+++I +A + +L CNA HT + +S L TG++PVIS++NHSC
Sbjct: 485 PDVTIWGVAMLRHMLQLQCNAQAITTVQHTGSKGSIVTDSRQVRLATGIFPVISLLNHSC 544
Query: 71 LPNAVLVFEGRLAVVRAVQHVPKGAEVLISY--IETAGSTMTRQKALKEQYLFTCTCPRC 128
PN + F +A +RA Q + KG E+L Y ++ RQ+ L+ QY F CTCP C
Sbjct: 545 SPNTSVSFISTVATIRASQRIGKGQEILHCYGPHKSRMGVAERQQKLRSQYFFDCTCPAC 604
Query: 129 IKLGQFDDIQESAILEGYRCKDDGCSGFLLRDSDDKGFTCQQCGLVRSKEEIKKIASEVN 188
+ + G R K F C C + + +
Sbjct: 605 -------QTEAHRMAAGPRW---------------KAFCCNSCRAPMQGDGVLR------ 636
Query: 189 ILSKKTLALTSCGNH---QEVVSTYKMIEKLQKKLYHPFSVNLMQTREKLIKILM--ELE 243
CGN + VVS ++ +LQ +L Q K+L ELE
Sbjct: 637 -----------CGNRSCAESVVSRDHLVSRLQ---------DLQQQVGVAQKLLRDGELE 676
Query: 244 DWKEALAYCQLTIPVYQRVYPQFHPLLGLQYYTCGKLEWFLGDTENAIKSMTEAVEILRI 303
+ L+ CQ + ++ + H ++G + LGD + + + +++ ++ +
Sbjct: 677 RAVQLLSGCQRDAESF--LWAE-HAMVGEIADGLARACAALGDWQKSATHLQKSLRVVEV 733
Query: 304 THGTNSPFMKELILKLEE 321
HG +S M + KL +
Sbjct: 734 RHGPSSVEMGHELFKLAQ 751
>gi|313234749|emb|CBY24692.1| unnamed protein product [Oikopleura dioica]
Length = 475
Score = 78.2 bits (191), Expect = 5e-12, Method: Compositional matrix adjust.
Identities = 69/276 (25%), Positives = 113/276 (40%), Gaps = 34/276 (12%)
Query: 43 NAHTICNSE--LRPLGTGLYPVISIINHSCLPNAVLVFEGRLAVVRAVQHVPKGAEVLIS 100
N IC+ E + +G+ LY S+INHSC PNA VF G V++A++ + G E+ I+
Sbjct: 183 NQFAICDEESSICDIGSALYLNHSLINHSCKPNAFPVFNGTNLVIKALEKIAPGEEIKIA 242
Query: 101 YIETAGSTMTRQKALKEQYLFTCTCPRCIKLGQFDDIQESAILEGYRCKDDGCSGFLLRD 160
Y +T R+ L + Y F C C C + D R K G ++R
Sbjct: 243 YTDTKAVIQDRRDYLNDIYRFVCQCQGCTNDDEVD-----------RKKHLDKKGNVIRR 291
Query: 161 SDDKGFTCQQCGLVRSKEEIKKIASEVNILSKKTLALTSCGNHQEVVSTYKMIEKLQKKL 220
SD + + +VR EE KK E N++ +E + L +K
Sbjct: 292 SDAIWQSAET--MVRDMEEFKK-NKEWNLM-------------KEAAQGW-----LARKF 330
Query: 221 YHPFSVNLMQTREKLIKILMELEDWKEALAYCQLTIPVYQRVYPQFHPLLGLQYYTCGKL 280
++ ++ E +E ++W L I Y+ +Y HP LG+ K+
Sbjct: 331 LPDENIFWIRLNEFAFDAGIETQEWSLCLETGASLIVNYKEIYGPMHPTLGIHLMKFAKI 390
Query: 281 EWFLGDTENAIKSMTEAVEILRITHGTNSPFMKELI 316
+ A + A I+ + + S K+L+
Sbjct: 391 LLHIEKPNEAEEYFRRAFAIMSLFYEPESAVRKQLL 426
>gi|336375028|gb|EGO03364.1| hypothetical protein SERLA73DRAFT_174827 [Serpula lacrymans var.
lacrymans S7.3]
gi|336387991|gb|EGO29135.1| hypothetical protein SERLADRAFT_456499 [Serpula lacrymans var.
lacrymans S7.9]
Length = 530
Score = 78.2 bits (191), Expect = 5e-12, Method: Compositional matrix adjust.
Identities = 42/114 (36%), Positives = 61/114 (53%), Gaps = 9/114 (7%)
Query: 31 NEIAENFSKLACNAHTICNSELRPLGTGLYPVISIINHSCLPNAVLVF---------EGR 81
++ + SK N T+ +S L LG + P++++INHSC PNAV+V+ E
Sbjct: 180 GDLVDIISKFITNTFTLTSSSLSALGVSVSPLVALINHSCDPNAVIVYPRCSNEPSTEEP 239
Query: 82 LAVVRAVQHVPKGAEVLISYIETAGSTMTRQKALKEQYLFTCTCPRCIKLGQFD 135
L V A++ + E+L +YI+T +RQK LKE Y F C CP C K D
Sbjct: 240 LMQVVAIRDIEVDEEILTAYIDTTLPRFSRQKFLKETYNFDCQCPSCTKYSGVD 293
>gi|326931360|ref|XP_003211799.1| PREDICTED: SET and MYND domain-containing protein 4-like, partial
[Meleagris gallopavo]
Length = 706
Score = 78.2 bits (191), Expect = 5e-12, Method: Compositional matrix adjust.
Identities = 62/264 (23%), Positives = 112/264 (42%), Gaps = 9/264 (3%)
Query: 55 LGTGLYPVISIINHSCLPNAVLVFEGRLAVVRAVQHVPKGAEVLISY-IETAGSTMTRQK 113
L T +PV+S++NHSC PN + F G +A VRA Q +P G E+ Y + ++
Sbjct: 431 LATAFFPVLSLLNHSCSPNISVSFSGTVATVRASQPIPSGQEIFHCYGPHRCRMRVAERQ 490
Query: 114 ALKEQYLFTCTCPRCIKLGQFDDIQESAILEGYRCKDDGCSGFLLRDSDDKGFTC--QQC 171
L QY F C C C L + + ES + C C ++ C + C
Sbjct: 491 QLLSQYFFECRCQAC--LDELESDVESVSMRNSFC----CPSCQASMQGEEMLCCSNEAC 544
Query: 172 GLVRSKEEIKKIASEVNILSKKTLALTSCGNHQEVVSTYKMIEKLQKKLYHPFSVNLMQT 231
L S+E + + ++ +K L L N E + + + P + + +
Sbjct: 545 ALSVSRERLSRRLLDLQQQMEKALELLRDSNADEAIRMLLKCQMDARNFLSPEHLLMGEL 604
Query: 232 REKLIKILMELEDWKEALAYCQLTIPVYQRVYPQFHPLLGLQYYTCGKLEWFLGDTENAI 291
+ L +I L W+EA + + +I + + + +G + + ++ + A+
Sbjct: 605 EDHLAQIYATLGKWQEAARHLERSIQLVEMHHGPSSVEMGHELFKLAQILFNGFAVSEAL 664
Query: 292 KSMTEAVEILRITHGTNSPFMKEL 315
++ A EIL + G S ++EL
Sbjct: 665 STIQRAEEILSVHCGPQSTQIQEL 688
>gi|355753603|gb|EHH57568.1| hypothetical protein EGM_07234 [Macaca fascicularis]
Length = 806
Score = 78.2 bits (191), Expect = 5e-12, Method: Compositional matrix adjust.
Identities = 79/318 (24%), Positives = 132/318 (41%), Gaps = 73/318 (22%)
Query: 26 PEISINEIA--ENFSKLACNA-------HT------ICNSELRPLGTGLYPVISIINHSC 70
P+++I +A + +L CNA HT + +S L TG++PVIS++NHSC
Sbjct: 485 PDVTIWGVAMLRHMLQLQCNAQAITTVQHTGSKGSIVTDSRQVRLATGIFPVISLLNHSC 544
Query: 71 LPNAVLVFEGRLAVVRAVQHVPKGAEVLISY--IETAGSTMTRQKALKEQYLFTCTCPRC 128
PN + F +A +RA Q + KG E+L Y ++ RQ+ L+ QY F CTCP C
Sbjct: 545 SPNTSVSFISTVATIRASQRIGKGQEILHCYGPHKSRMGVAERQQKLRSQYFFDCTCPAC 604
Query: 129 IKLGQFDDIQESAILEGYRCKDDGCSGFLLRDSDDKGFTCQQCGLVRSKEEIKKIASEVN 188
+ + G R K F C C + + +
Sbjct: 605 -------QTEAHRMAAGPRW---------------KAFCCNSCRAPMQGDGVLR------ 636
Query: 189 ILSKKTLALTSCGNH---QEVVSTYKMIEKLQKKLYHPFSVNLMQTREKLIKILM--ELE 243
CGN + VVS ++ +LQ +L Q K+L ELE
Sbjct: 637 -----------CGNRSCAESVVSRDHLVSRLQ---------DLQQQVGVAQKLLRDGELE 676
Query: 244 DWKEALAYCQLTIPVYQRVYPQFHPLLGLQYYTCGKLEWFLGDTENAIKSMTEAVEILRI 303
+ L+ CQ + ++ + H ++G + LGD + + + +++ ++ +
Sbjct: 677 RAVQLLSGCQRDAESF--LWAE-HAMVGEIADGLARACAALGDWQKSATHLQKSLRVVEV 733
Query: 304 THGTNSPFMKELILKLEE 321
HG +S M + KL +
Sbjct: 734 RHGPSSVEMGHELFKLAQ 751
>gi|397612716|gb|EJK61854.1| hypothetical protein THAOC_17577 [Thalassiosira oceanica]
Length = 750
Score = 78.2 bits (191), Expect = 5e-12, Method: Compositional matrix adjust.
Identities = 84/339 (24%), Positives = 138/339 (40%), Gaps = 64/339 (18%)
Query: 17 NLVNLILQWPEISINE-IAENFSKLACNAHTICNSELRPLGTGLYPVISIINHSCLPNAV 75
LV IL E +++E I + N + +S P+G +YP +++NHSC PN +
Sbjct: 437 GLVLRILGCKEKTLDEAIRHTLTAFQQNNFGVTDSLYAPIGEAVYPHAALLNHSCSPNCI 496
Query: 76 LVFE-------GRLAVVRAVQHVPKGAEVLISYIETAGSTMTRQKALKEQYLFTCTCPRC 128
L ++ +L +V A + + G E++ SY++ T TR+ L++ + F C C RC
Sbjct: 497 LRYKIGLESSPPQLEIV-ACKDISSGEELVHSYVDLVLPTETRRNRLRDTHGFICECKRC 555
Query: 129 IKLGQFDDIQESAILEGYRCKDDGCSGFLLRDSDDKGF--TCQQCGLVRSKEEIKKIASE 186
GQ C+ L RD T ++ G + S ++I E
Sbjct: 556 A--GQ-------------------CTVELPRDRQSWMLWPTKERLGEIPSSTPTQQIDIE 594
Query: 187 VNILSKKTLALTSCGNHQEVVSTYKM-----------------IEKLQKKL--------Y 221
I + A H ++V+T ++ I+ LQ+ + +
Sbjct: 595 EAIGGRSGEA-----EHLQIVNTSRLLQDQATRAMVNGNNDLEIQLLQQAVGVFLLSGGF 649
Query: 222 HPFSVNLMQTREKLIKILMELEDWKEALAYCQLTIPVYQRVYP--QFHPLLGLQYYTCGK 279
PF L R L+ E + C+ + + HPLLGLQ YT G
Sbjct: 650 SPFHNELYTARCAYFSALLANGKIDEGIEQCEHIVSSLSVCLGNVKHHPLLGLQLYTLGD 709
Query: 280 LEWFLGDTENAIKSMTEAVEILRITHGTNSPFMKELILK 318
L G + A + A EI+ ++HG P ++ LI K
Sbjct: 710 LCSGGGCGQKAANAYRWAYEIMSVSHGAQDPMVQALIRK 748
>gi|328789942|ref|XP_392262.3| PREDICTED: SET and MYND domain-containing protein 4-like [Apis
mellifera]
Length = 677
Score = 78.2 bits (191), Expect = 6e-12, Method: Compositional matrix adjust.
Identities = 59/201 (29%), Positives = 90/201 (44%), Gaps = 18/201 (8%)
Query: 33 IAENFSKLACNAHTI------------CNSELRPLGTGLYPVISIINHSCLPNAVLVFEG 80
I N + NAH I N++ + +G GLYP IS+ NHSC P + F G
Sbjct: 446 ILHNLMTIQFNAHEISELVIPKADNNLANAKSKFIGGGLYPTISLFNHSCNPGIIRYFIG 505
Query: 81 RLAVVRAVQHVPKGAEVLISY--IETAGSTMTRQKALKEQYLFTCTCPRCIKLGQFDDIQ 138
VVRA++ + G E+ +Y I T R++ L+ QY F C C C + +
Sbjct: 506 TTMVVRAIRSISSGEEISENYGQIFTTTPESERKRKLRLQYFFDCNCEACREHWPLLEEI 565
Query: 139 ESAILEGYRCKDDGCSGFLL---RDSDDKGFTCQQCGLVRSKEEIKKIASEVNILSKKTL 195
+ IL ++C+ G +L DS++ C +CG + + K E + + K
Sbjct: 566 DPTILR-FKCETGKECGNVLPVRTDSNEFMIECSKCGKCMNIFKGLKALQETDAIFKIAS 624
Query: 196 ALTSCGNHQEVVSTYKMIEKL 216
GNH+E + Y I KL
Sbjct: 625 RYLEQGNHREALKNYLKILKL 645
>gi|109112713|ref|XP_001117365.1| PREDICTED: SET and MYND domain-containing protein 4-like [Macaca
mulatta]
Length = 802
Score = 77.8 bits (190), Expect = 6e-12, Method: Compositional matrix adjust.
Identities = 79/318 (24%), Positives = 132/318 (41%), Gaps = 73/318 (22%)
Query: 26 PEISINEIA--ENFSKLACNA-------HT------ICNSELRPLGTGLYPVISIINHSC 70
P+++I +A + +L CNA HT + +S L TG++PVIS++NHSC
Sbjct: 481 PDVTIWGVAMLRHMLQLQCNAQAITTVQHTGSKGSIVTDSRQVRLATGIFPVISLLNHSC 540
Query: 71 LPNAVLVFEGRLAVVRAVQHVPKGAEVLISY--IETAGSTMTRQKALKEQYLFTCTCPRC 128
PN + F +A +RA Q + KG E+L Y ++ RQ+ L+ QY F CTCP C
Sbjct: 541 SPNTSVSFISTVATIRASQRIGKGQEILHCYGPHKSRMGVAERQQKLRSQYFFDCTCPAC 600
Query: 129 IKLGQFDDIQESAILEGYRCKDDGCSGFLLRDSDDKGFTCQQCGLVRSKEEIKKIASEVN 188
+ + G R K F C C + + +
Sbjct: 601 -------QTEAHRMAAGPRW---------------KAFCCNSCRAPMQGDGVLR------ 632
Query: 189 ILSKKTLALTSCGNH---QEVVSTYKMIEKLQKKLYHPFSVNLMQTREKLIKILM--ELE 243
CGN + VVS ++ +LQ +L Q K+L ELE
Sbjct: 633 -----------CGNRSCAESVVSRDHLVSRLQ---------DLQQQVGVAQKLLRDGELE 672
Query: 244 DWKEALAYCQLTIPVYQRVYPQFHPLLGLQYYTCGKLEWFLGDTENAIKSMTEAVEILRI 303
+ L+ CQ + ++ + H ++G + LGD + + + +++ ++ +
Sbjct: 673 RAVQLLSGCQRDAESF--LWAE-HAMVGEIADGLARACAALGDWQKSATHLQKSLRVVEV 729
Query: 304 THGTNSPFMKELILKLEE 321
HG +S M + KL +
Sbjct: 730 RHGPSSVEMGHELFKLAQ 747
>gi|357146941|ref|XP_003574165.1| PREDICTED: uncharacterized protein LOC100821025 [Brachypodium
distachyon]
Length = 791
Score = 77.8 bits (190), Expect = 7e-12, Method: Compositional matrix adjust.
Identities = 70/312 (22%), Positives = 135/312 (43%), Gaps = 52/312 (16%)
Query: 46 TICNSELRPLGTGLYPVISIINHSCLPNAVLVFEGRLAVVRAVQHVPKGAEVLISYIETA 105
+C+ E + +Y S+ NHSC PN F R +R+ +++ G+ + +SY A
Sbjct: 466 VMCSVEQIRVAQAIYMSGSLFNHSCRPNVHTYFHSRTLFLRSTEYIESGSPLELSYGPQA 525
Query: 106 GST--MTRQKALKEQYLFTCTCPRCIKLGQFDDIQES----------AILEG--YRCKDD 151
G + RQK+L+E Y F+C C C +L D + +S A+ E YR +++
Sbjct: 526 GEMDLLERQKSLQENYKFSCRCSSCSELNLSDLVIDSFCCPRSFCLGAVSESTCYRSEEN 585
Query: 152 ---------------------------GCSGFLLRDSD-----DKGF--TCQ-QCGLVRS 176
R+ D D G+ +C+ Q L
Sbjct: 586 FVHVSLNGSAVCKLSLPDVSKVDEDMEKVGKLFFRNDDADLKIDPGYCMSCRSQLDLSSV 645
Query: 177 KEEIKKIASEVNILSKKTLALTSCGNHQEVVSTYKMIEKLQKKLYHPFSVNLMQTREKLI 236
+ + AS++N K L + + + + + +E++ KKL HP+S L Q +K+
Sbjct: 646 VATLDRAASKINRF--KELIVIDDVSEVLITNALQSLEQI-KKLRHPYSKALAQAEDKIA 702
Query: 237 KILMELEDWKEALAYCQLTIPVYQRVYPQFHPLLGLQYYTCGKLEWFLGDTENAIKSMTE 296
+ ++ D ++A +C+ +I + +++Y H + + ++ +GD A ++T
Sbjct: 703 EAFAKVGDQEQARKHCEASIKILEKLYHPKHIAIAHELIKLISIKLCMGDKAGAAATVTR 762
Query: 297 AVEILRITHGTN 308
A I + +G +
Sbjct: 763 AEAIFLLYYGPD 774
>gi|380013978|ref|XP_003691021.1| PREDICTED: SET and MYND domain-containing protein 4-like [Apis
florea]
Length = 678
Score = 77.8 bits (190), Expect = 7e-12, Method: Compositional matrix adjust.
Identities = 59/201 (29%), Positives = 91/201 (45%), Gaps = 18/201 (8%)
Query: 33 IAENFSKLACNAHTI------------CNSELRPLGTGLYPVISIINHSCLPNAVLVFEG 80
I N + NAH I N++ + +G GLYP IS+ NHSC P + F G
Sbjct: 447 ILHNLMTIQFNAHEISELVIPKADNNLANAKSKFIGGGLYPTISLFNHSCNPGIIRYFIG 506
Query: 81 RLAVVRAVQHVPKGAEVLISY--IETAGSTMTRQKALKEQYLFTCTCPRCIKLGQFDDIQ 138
VVRA++ + G E+ +Y I T R++ L+ QY F C C C + +
Sbjct: 507 TTMVVRAIRSISSGEEISENYGQIFTTTPEKERKRKLRLQYFFDCNCEACREHWPLLEEI 566
Query: 139 ESAILEGYRCKDDGCSGFLL---RDSDDKGFTCQQCGLVRSKEEIKKIASEVNILSKKTL 195
+ IL ++C+ G +L DS++ C +CG + + K E + + K
Sbjct: 567 DPTILR-FKCETGKECGNVLPVKTDSNEFMIECSKCGKCMNIFKGLKALQETDAIFKIAS 625
Query: 196 ALTSCGNHQEVVSTYKMIEKL 216
GNH+E + +Y I KL
Sbjct: 626 RNLEQGNHREALKSYLKILKL 646
>gi|299115343|emb|CBN74163.1| conserved unknown protein [Ectocarpus siliculosus]
Length = 579
Score = 77.8 bits (190), Expect = 7e-12, Method: Compositional matrix adjust.
Identities = 45/143 (31%), Positives = 71/143 (49%), Gaps = 7/143 (4%)
Query: 17 NLVNLILQWPEISINEIAENFSKLACNAHTICNSELRPL--GTGLYPVISIINHSCLPNA 74
+L+ L+ +W + E+ + ++ NAH + + L G G++P+ ++INH+C PN
Sbjct: 252 DLLQLLPEWVRFDVEEVVQLACRVNVNAHGLRDDSGANLVIGVGMFPLTAMINHACRPNC 311
Query: 75 VLVFEGRLAVVRAVQHVPKGAEVLISYIETAGSTMTRQKALKEQYLFTCTCPRCIKLGQF 134
V+ G VR ++ V GAE+ + YI+ ST R++ L F C C RC
Sbjct: 312 TFVYFGGNLEVRTLEPVSAGAELSVYYIDLLQSTAARRQELLTSKHFLCKCSRCENPSSM 371
Query: 135 DDIQESAILEGYRCKDDGCSGFL 157
DD L+G C D G G L
Sbjct: 372 DD-----YLDGVCCTDCGERGCL 389
>gi|281205858|gb|EFA80047.1| SET domain-containing protein [Polysphondylium pallidum PN500]
Length = 496
Score = 77.8 bits (190), Expect = 7e-12, Method: Compositional matrix adjust.
Identities = 48/141 (34%), Positives = 74/141 (52%), Gaps = 15/141 (10%)
Query: 19 VNLILQWPEISINEIAENFSKLACNAHTI--CNSELRPLGTGLYPVISIINHSCLPNAVL 76
+N +LQ I+ + + + CNA TI C S + LGTGLYP ++ INHSC PN +
Sbjct: 206 INQVLQS---EIDVLYDALCSINCNAFTIIDCVSGVL-LGTGLYPGLTFINHSCDPNLQV 261
Query: 77 VFEGRLAVVRAVQHVPKGAEVLISYIETAGSTMTRQKALKEQYLFTCTCPRC--IKLGQF 134
G++ ++AV+ + KG E ISYI+ + R + L E + F C C +C ++ +
Sbjct: 262 TQIGKILTLKAVRPIKKGEEFTISYIDRTEGAIQRNEELMETFFFECQCHKCASVRSEKR 321
Query: 135 DDIQESAILEGYRCKDDGCSG 155
+D + CKD C G
Sbjct: 322 NDFY-------FVCKDPSCKG 335
>gi|157786648|ref|NP_001099280.1| SET and MYND domain-containing protein 4 [Rattus norvegicus]
gi|149053389|gb|EDM05206.1| SET and MYND domain containing 4 (predicted) [Rattus norvegicus]
Length = 801
Score = 77.4 bits (189), Expect = 8e-12, Method: Compositional matrix adjust.
Identities = 36/84 (42%), Positives = 49/84 (58%), Gaps = 2/84 (2%)
Query: 47 ICNSELRPLGTGLYPVISIINHSCLPNAVLVFEGRLAVVRAVQHVPKGAEVLISY--IET 104
I NS L TG++PV+S++NHSC PN + F +A +RA Q + KG E+L Y E+
Sbjct: 517 ITNSRQVRLATGVFPVVSLLNHSCSPNTSVSFTSTVATIRAAQQIAKGQEILHCYGPHES 576
Query: 105 AGSTMTRQKALKEQYLFTCTCPRC 128
RQ+ L QY F C+CP C
Sbjct: 577 RMGVAERQQRLSSQYFFDCSCPAC 600
>gi|21754614|dbj|BAC04538.1| unnamed protein product [Homo sapiens]
Length = 804
Score = 77.4 bits (189), Expect = 9e-12, Method: Compositional matrix adjust.
Identities = 77/315 (24%), Positives = 131/315 (41%), Gaps = 67/315 (21%)
Query: 26 PEISINEIA--ENFSKLACNA-------HT------ICNSELRPLGTGLYPVISIINHSC 70
P+++I +A + +L CNA HT + +S L TG++PVIS++NHSC
Sbjct: 483 PDVTIWGVAMLRHMLQLQCNAQAMTTIQHTGSKGSIVTDSRQVRLATGIFPVISLLNHSC 542
Query: 71 LPNAVLVFEGRLAVVRAVQHVPKGAEVLISY--IETAGSTMTRQKALKEQYLFTCTCPRC 128
PN + F +A +RA Q + KG E+L Y ++ RQ+ L+ QY F C CP C
Sbjct: 543 SPNTSVSFISTVATIRASQRIRKGQEILHCYGPHKSRMGVAERQQKLRSQYFFDCACPAC 602
Query: 129 IKLGQFDDIQESAILEGYRCKDDGCSGFLLRDSDDKGFTCQQCGLVRSKEEIKKIASEVN 188
+ + G R + F C CG +++ + S
Sbjct: 603 -------QTEAHRMAAGPRW---------------EAFCCNSCGAPMQGDDVLRCGSR-- 638
Query: 189 ILSKKTLALTSCGNHQEVVSTYKMIEKLQKKLYHPFSVNLMQTREKLIKILM--ELEDWK 246
SC + VS ++ +LQ +L Q K+L ELE
Sbjct: 639 ----------SCA--ESAVSRDHLVSRLQ---------DLQQQVRVAQKLLRDGELERAV 677
Query: 247 EALAYCQLTIPVYQRVYPQFHPLLGLQYYTCGKLEWFLGDTENAIKSMTEAVEILRITHG 306
+ L+ CQ + ++ + H ++G + LGD + + + ++ ++ + HG
Sbjct: 678 QRLSGCQRDAESF--LWAE-HAVVGEIADGLARACAALGDWQKSATHLQRSLYVVEVRHG 734
Query: 307 TNSPFMKELILKLEE 321
+S M + KL +
Sbjct: 735 PSSVEMGHELFKLAQ 749
>gi|410980233|ref|XP_003996482.1| PREDICTED: SET and MYND domain-containing protein 4 [Felis catus]
Length = 800
Score = 77.4 bits (189), Expect = 9e-12, Method: Compositional matrix adjust.
Identities = 71/283 (25%), Positives = 122/283 (43%), Gaps = 57/283 (20%)
Query: 45 HTICNSELRPLGTGLYPVISIINHSCLPNAVLVFEGRLAVVRAVQHVPKGAEVLISY--I 102
+++ NS L TG++PVIS++NHSC PN + F +A VRA Q + KG E+L Y
Sbjct: 506 NSVTNSRQVRLATGIFPVISLLNHSCSPNTSVSFISTVATVRASQQIGKGQEILHCYGPH 565
Query: 103 ETAGSTMTRQKALKEQYLFTCTCPRCIKLGQFDDIQESAILEGYRCKDDGCSGFLLRDSD 162
++ RQ+ L+ QY F C+C C ++ + G R
Sbjct: 566 QSRMRAAERQQKLRSQYFFDCSCAACQN-------EKHSAATGPRW-------------- 604
Query: 163 DKGFTCQQCGLVRSKEEIKKIASEVNILSKKTLALTSCGNHQEVVSTYKMIEKLQKKLYH 222
+ F C+ CG + +++ S TSC E VS +I +LQ
Sbjct: 605 -EAFCCRSCGALMQGDDVLICGS------------TSC---TESVSRDHLISRLQD---- 644
Query: 223 PFSVNLMQTREKLIKILMELEDWKEALAYCQLTIPVYQRVYPQF----HPLLGLQYYTCG 278
+Q + + + L+ + + A+ QL + QR F H L+G
Sbjct: 645 ------LQQQVGMARKLLRNDKPERAI---QLLLGC-QRDAQSFLSGEHSLVGEMEDDLA 694
Query: 279 KLEWFLGDTENAIKSMTEAVEILRITHGTNSPFMKELILKLEE 321
+ LGD + + + ++++++ + HG +S M + KL +
Sbjct: 695 QAYAALGDWQKSATHLQKSLQVVEVRHGPSSVEMGHELFKLAQ 737
>gi|156616308|ref|NP_443160.2| SET and MYND domain-containing protein 4 [Homo sapiens]
gi|296452956|sp|Q8IYR2.3|SMYD4_HUMAN RecName: Full=SET and MYND domain-containing protein 4
Length = 804
Score = 77.4 bits (189), Expect = 1e-11, Method: Compositional matrix adjust.
Identities = 77/315 (24%), Positives = 131/315 (41%), Gaps = 67/315 (21%)
Query: 26 PEISINEIA--ENFSKLACNA-------HT------ICNSELRPLGTGLYPVISIINHSC 70
P+++I +A + +L CNA HT + +S L TG++PVIS++NHSC
Sbjct: 483 PDVTIWGVAMLRHMLQLQCNAQAMTTIQHTGPKGSIVTDSRQVRLATGIFPVISLLNHSC 542
Query: 71 LPNAVLVFEGRLAVVRAVQHVPKGAEVLISY--IETAGSTMTRQKALKEQYLFTCTCPRC 128
PN + F +A +RA Q + KG E+L Y ++ RQ+ L+ QY F C CP C
Sbjct: 543 SPNTSVSFISTVATIRASQRIRKGQEILHCYGPHKSRMGVAERQQKLRSQYFFDCACPAC 602
Query: 129 IKLGQFDDIQESAILEGYRCKDDGCSGFLLRDSDDKGFTCQQCGLVRSKEEIKKIASEVN 188
+ + G R + F C CG +++ + S
Sbjct: 603 -------QTEAHRMAAGPRW---------------EAFCCNSCGAPMQGDDVLRCGSR-- 638
Query: 189 ILSKKTLALTSCGNHQEVVSTYKMIEKLQKKLYHPFSVNLMQTREKLIKILM--ELEDWK 246
SC + VS ++ +LQ +L Q K+L ELE
Sbjct: 639 ----------SCA--ESAVSRDHLVSRLQ---------DLQQQVRVAQKLLRDGELERAV 677
Query: 247 EALAYCQLTIPVYQRVYPQFHPLLGLQYYTCGKLEWFLGDTENAIKSMTEAVEILRITHG 306
+ L+ CQ + ++ + H ++G + LGD + + + ++ ++ + HG
Sbjct: 678 QRLSGCQRDAESF--LWAE-HAVVGEIADGLARACAALGDWQKSATHLQRSLYVVEVRHG 734
Query: 307 TNSPFMKELILKLEE 321
+S M + KL +
Sbjct: 735 PSSVEMGHELFKLAQ 749
>gi|345326540|ref|XP_003431054.1| PREDICTED: LOW QUALITY PROTEIN: SET and MYND domain-containing
protein 4-like [Ornithorhynchus anatinus]
Length = 815
Score = 77.0 bits (188), Expect = 1e-11, Method: Compositional matrix adjust.
Identities = 38/92 (41%), Positives = 53/92 (57%), Gaps = 2/92 (2%)
Query: 41 ACNAHTICNSELRPLGTGLYPVISIINHSCLPNAVLVFEGRLAVVRAVQHVPKGAEVLIS 100
A H I +S L TGL+PV+S++NHSC PN + F GR+AVV+A Q + +G E+L
Sbjct: 508 ASEDHVIASSRQVRLATGLFPVVSLLNHSCSPNTSVSFSGRVAVVQATQPIGRGQEILHC 567
Query: 101 Y--IETAGSTMTRQKALKEQYLFTCTCPRCIK 130
Y R++ LK QY F C C C++
Sbjct: 568 YGPHRCRMDVAERRQKLKAQYFFDCRCQACLE 599
>gi|194768765|ref|XP_001966482.1| GF21975 [Drosophila ananassae]
gi|190617246|gb|EDV32770.1| GF21975 [Drosophila ananassae]
Length = 445
Score = 77.0 bits (188), Expect = 1e-11, Method: Compositional matrix adjust.
Identities = 38/102 (37%), Positives = 55/102 (53%), Gaps = 6/102 (5%)
Query: 58 GLYPVISIINHSCLPNAVLVFE-GRLAVVRAVQHVPKGAEVLISYIETAGSTMTRQKALK 116
GL+P+ +I+NH C PNA F+ G LAVVRA + +PKG E+ +Y + +TR LK
Sbjct: 225 GLFPLTAIMNHECTPNASHYFDNGTLAVVRAARDIPKGGEITTTYTKILWGNLTRGIFLK 284
Query: 117 EQYLFTCTCPRCIKLGQFDDIQESAILEGYRCKDDGCSGFLL 158
F C C RC D+ + L C++ GC G ++
Sbjct: 285 MTKNFNCDCVRC-----NDNTENGTYLSALFCREQGCRGLVI 321
>gi|390363607|ref|XP_001199204.2| PREDICTED: N-lysine methyltransferase SMYD2-like
[Strongylocentrotus purpuratus]
Length = 490
Score = 77.0 bits (188), Expect = 1e-11, Method: Compositional matrix adjust.
Identities = 68/294 (23%), Positives = 115/294 (39%), Gaps = 58/294 (19%)
Query: 37 FSKLACNAHTICNSELRPLGTGLYPVISIINHSCLPNAVLVFEGRLAVVRAVQHVPKGAE 96
+ CN+ I +++L + +Y S+ NHSC N ++VF+ R +R V+ V +G E
Sbjct: 241 YGATNCNSFGIFDNDLIVISDAIYLRASMANHSCDYNCIVVFDERKLQLRTVKDVQEGEE 300
Query: 97 VLISYIETAGSTMTRQKALKEQYLFTCTCPRCIK----LGQFDDIQESAILEGYRCKDDG 152
I Y++ R+ L+E+Y FTC C +CI+ LG DDG
Sbjct: 301 CTIGYVDVIHPAKERRAELEEKYHFTCKCVKCIEEINALG----------------PDDG 344
Query: 153 CSGFLLRDSDDKGFTCQQCGLVRSKEEIKKIASEVNILSKKTLALTSCGNHQEVVSTYKM 212
+ E+ L K + N + ++
Sbjct: 345 ------------------------------LGEELRDLKKSLEQIVDAENSHDWAKVIQL 374
Query: 213 IEKLQKKLYHPFSVN-----LMQTREKLIKILMELEDWKEALAYCQLTIPVYQRVYPQFH 267
E K + S+ L+ R+ ++ + WK+A QL I Y Y +++
Sbjct: 375 CEPYLKPMDSSSSLPANHQLLVMLRDTAFFACIQSQSWKKAAEMGQLNIESYIYHYGRYN 434
Query: 268 PLLGLQYYTCGKLEWFLGDTENAIKSMTEAVEILRITHGTNSPFM---KELILK 318
P +G+ G++ L A K EA + ++THG M K+L+ K
Sbjct: 435 PNVGMYLLKIGEVLLNLDRLREARKCFKEAESVFKVTHGLQHSLMASVKKLLFK 488
Score = 56.6 bits (135), Expect = 1e-05, Method: Compositional matrix adjust.
Identities = 25/66 (37%), Positives = 38/66 (57%)
Query: 65 IINHSCLPNAVLVFEGRLAVVRAVQHVPKGAEVLISYIETAGSTMTRQKALKEQYLFTCT 124
+ NHSC N VF+G +R ++ V +G E ISY++ RQ L+E+Y FTC
Sbjct: 1 MANHSCDYNCAGVFDGMKLQLRTIKDVKEGEECTISYVDVINPAKERQAKLEEEYHFTCK 60
Query: 125 CPRCIK 130
C +C++
Sbjct: 61 CVKCVE 66
>gi|395536340|ref|XP_003770178.1| PREDICTED: SET and MYND domain-containing protein 4 [Sarcophilus
harrisii]
Length = 802
Score = 77.0 bits (188), Expect = 1e-11, Method: Compositional matrix adjust.
Identities = 64/276 (23%), Positives = 117/276 (42%), Gaps = 32/276 (11%)
Query: 55 LGTGLYPVISIINHSCLPNAVLVFEGRLAVVRAVQHVPKGAEVLISYIETAG--STMTRQ 112
L TGL+P IS++NHSC PN L F G + V A Q + +G E+L Y G TRQ
Sbjct: 525 LATGLFPAISLMNHSCRPNTSLSFRGSVGSVHASQLIARGQEILHCYGPHEGRMDAATRQ 584
Query: 113 KALKEQYLFTCTCPRCIKLGQFDDIQESAILE---GYRCK----------DDGCSGFLLR 159
+ L+ QY F C C C Q +++ + + G+RC GCS R
Sbjct: 585 QKLRSQYFFDCRCQAC----QEEEVHGAGVTPKQGGFRCPTCRAALQGLDPLGCSSGSCR 640
Query: 160 DSDDKGFTCQQCGLVRSKEEIKKIASEVNILSKKTLALTSCGNHQEVVSTYKMIEKLQKK 219
+ Q G ++ + E+ + ++ L +C E T +
Sbjct: 641 AQVSRAHLLGQLGDLQQRVEMAGELLRGDRTARAVQLLLACRREAEDFLTSEH------- 693
Query: 220 LYHPFSVNLMQTREKLIKILMELEDWKEALAYCQLTIPVYQRVYPQFHPLLGLQYYTCGK 279
V + + ++L + L DW ++ A+ Q ++ V + + +G + + +
Sbjct: 694 ------VLMGEIEDRLAQAYASLGDWPKSAAHLQNSLQVVKAQHGPASVEMGHELFKLAQ 747
Query: 280 LEWFLGDTENAIKSMTEAVEILRITHGTNSPFMKEL 315
+ + A+ + +A ++L + +G + ++EL
Sbjct: 748 VLFNGFAVTEAVSVIEKAEKVLSLHYGPENEQVQEL 783
>gi|22902087|gb|AAC60295.2| skmBOP [Takifugu rubripes]
Length = 433
Score = 77.0 bits (188), Expect = 1e-11, Method: Compositional matrix adjust.
Identities = 37/127 (29%), Positives = 70/127 (55%), Gaps = 6/127 (4%)
Query: 17 NLVNLILQWPEIS----INEIAENFSKLACNAHTICNSE-LRPLGTGLYPVISIINHSCL 71
++ N + WP S I++I+ F + CN T+ + L+ +G GL+P + ++NH+C
Sbjct: 131 DIHNFLDYWPRNSKQHTIDDISHIFGVINCNGFTVSDQRGLQAVGVGLFPNLCMVNHNCW 190
Query: 72 PNAVLVFEGRLAVVRAVQHVPKGAEVLISYIETAGSTMTRQKALKEQYLFTCTCPRCIKL 131
PN ++ +R++ + +G E+ ++Y++ + R++ LK QY F C C C K
Sbjct: 191 PNCTVILNHGKIELRSLGKIAEGEELTVAYVDFLNLSEERRRLLKTQYFFDCQCDYC-KN 249
Query: 132 GQFDDIQ 138
G DD++
Sbjct: 250 GTKDDLK 256
>gi|440794181|gb|ELR15350.1| tetratricopeptide repeat domain containing protein [Acanthamoeba
castellanii str. Neff]
Length = 823
Score = 77.0 bits (188), Expect = 1e-11, Method: Compositional matrix adjust.
Identities = 68/269 (25%), Positives = 114/269 (42%), Gaps = 35/269 (13%)
Query: 29 SINEIAENFSKLACNAHTICNSELRPLGTGLYPVISIINHSCLPNAVLVFEGRLAVVRAV 88
+I E+A++ + H +R + +Y S+ NHSC PN VL +EGR+ +RA
Sbjct: 513 AIYEVADD------DEHGGGTKPIRRVAEAMYATGSLFNHSCRPNTVLNYEGRVLTIRAC 566
Query: 89 QHVPKGAEVLISY---IETAGSTMTRQKALKEQYLFTCTCPRCIKLGQFDDIQESAILEG 145
+ + KG EVL Y + S RQKAL+ QY FTC C C S++ +G
Sbjct: 567 EDIAKGKEVLNCYGPHVAHIPSRDERQKALRHQYFFTCRCDAC----------SSSVEQG 616
Query: 146 YR------CKDDGCSGFLLRDSDDKGFTCQQCGLVRSKEEIKKIASEVNILSKKTLALTS 199
+ C +GC+ L + + Q+ + + EI K+A E +++ L
Sbjct: 617 GQDFDYLVCGSEGCTRELQINPEQ---LRQEVDIRVVQREI-KLAREFYERARRALDGGG 672
Query: 200 CGNH-----QEVVSTYKMIEKLQKKLYHPFSVNLMQTREKLIKILMELEDWKEALAYCQL 254
+ + +L+ K HP L +T + L + L ++ A + +
Sbjct: 673 GDGSGDGDVERAAQSLAHALRLRAKYLHPQHRELGRTHDALAEAHARLGRYEAAATHLER 732
Query: 255 TIPVYQRVYPQFHPLLGLQYYTCGKLEWF 283
+ V + + F L QY + WF
Sbjct: 733 AVAVLESRFAPFSFELADQYAKLAQA-WF 760
>gi|322708541|gb|EFZ00118.1| MYND finger family protein [Metarhizium anisopliae ARSEF 23]
Length = 526
Score = 76.6 bits (187), Expect = 1e-11, Method: Compositional matrix adjust.
Identities = 42/102 (41%), Positives = 60/102 (58%), Gaps = 1/102 (0%)
Query: 27 EISINEIAENFSKLACNAHTICNSELRPLGTGLYPVISIINHSCLPNAVLVFEGRLAVVR 86
E +I E K+ NA + +L +G L P +++ NHSC+PNAV+ F GR A++R
Sbjct: 156 ESNIRRAVELLCKIQTNAFHRWDVDLGQVGVFLEPTLAMANHSCVPNAVVQFVGRKAILR 215
Query: 87 AVQHVPKGAEVLISYIETAGSTMTRQKALKEQYLFTCTCPRC 128
A + + G E+ ISY + TR++AL EQY F CTC RC
Sbjct: 216 AERPIHAGDEIEISYTDYTMPLSTRREAL-EQYSFECTCARC 256
>gi|330799867|ref|XP_003287962.1| hypothetical protein DICPUDRAFT_33308 [Dictyostelium purpureum]
gi|325081986|gb|EGC35483.1| hypothetical protein DICPUDRAFT_33308 [Dictyostelium purpureum]
Length = 400
Score = 76.6 bits (187), Expect = 1e-11, Method: Compositional matrix adjust.
Identities = 61/211 (28%), Positives = 91/211 (43%), Gaps = 35/211 (16%)
Query: 15 IANLVNLI--------LQWPEIS-------INEIAENFSKLACNAHTICNSELRPLGTGL 59
IANL NL+ L PE+ I+ I E K+ CN I + + +
Sbjct: 200 IANLFNLVITSSDNHDLPTPELKKEAVEKVISVIRELIHKVRCNQFGIWTKNDKCIAVAI 259
Query: 60 YPVISIINHSCLPNAVLVFEGRLAVVRAVQHVPKGAEVLISYIETAGSTMTRQKALKEQY 119
P S NHSC+PN + + +G +A+ V KG + ISY++ +R++ LK Y
Sbjct: 260 SPSSSFFNHSCIPNCINIRDGNKMTFKALYPVKKGEPLAISYLDLDLPVESRKEYLKYGY 319
Query: 120 LFTCTCPRCIKLGQFDDIQESAILEGYRCKDDGCSGFLLRDSDDKGFTCQQCGLVRSKEE 179
F C CPRC + D+ ++ I + Y C+ C GL SK +
Sbjct: 320 YFDCGCPRCDEKTNQDECMDNWISKFY-CQRKKCV-----------------GLYYSKTK 361
Query: 180 IKKIASEVNILSKKTLALTSCGNHQEVVSTY 210
+ +VN K TL + C N + STY
Sbjct: 362 VN--IDQVNKNDKITLNCSDCNNEFIINSTY 390
>gi|26354773|dbj|BAC41013.1| unnamed protein product [Mus musculus]
Length = 799
Score = 76.6 bits (187), Expect = 1e-11, Method: Compositional matrix adjust.
Identities = 38/88 (43%), Positives = 49/88 (55%), Gaps = 2/88 (2%)
Query: 43 NAHTICNSELRPLGTGLYPVISIINHSCLPNAVLVFEGRLAVVRAVQHVPKGAEVLISY- 101
N I NS L TG++PV+S++NHSC PN + F G +A VRA Q + KG E+L Y
Sbjct: 511 NESIITNSRQIRLATGIFPVVSLLNHSCRPNTSVSFTGTVATVRAAQRIAKGQEILHCYG 570
Query: 102 -IETAGSTMTRQKALKEQYLFTCTCPRC 128
E+ RQ+ L QY F C C C
Sbjct: 571 PHESRMGVAERQQRLSSQYFFDCRCGAC 598
>gi|168275572|dbj|BAG10506.1| SET and MYND domain-containing protein 4 [synthetic construct]
Length = 804
Score = 76.6 bits (187), Expect = 1e-11, Method: Compositional matrix adjust.
Identities = 77/315 (24%), Positives = 131/315 (41%), Gaps = 67/315 (21%)
Query: 26 PEISINEIA--ENFSKLACNA-------HT------ICNSELRPLGTGLYPVISIINHSC 70
P+++I +A + +L CNA HT + +S L TG++PVIS++NHSC
Sbjct: 483 PDVTIWGVAMLRHMLQLQCNAQAMTTIQHTGPKGSIVTDSRQVRLATGIFPVISLLNHSC 542
Query: 71 LPNAVLVFEGRLAVVRAVQHVPKGAEVLISY--IETAGSTMTRQKALKEQYLFTCTCPRC 128
PN + F +A +RA Q + KG E+L Y ++ RQ+ L+ QY F C CP C
Sbjct: 543 SPNTSVSFISTVATIRASQRIRKGQEILHCYGPHKSRMGVAERQQKLRSQYFFDCACPAC 602
Query: 129 IKLGQFDDIQESAILEGYRCKDDGCSGFLLRDSDDKGFTCQQCGLVRSKEEIKKIASEVN 188
+ + G R + F C CG +++ + S
Sbjct: 603 -------QTEAHRMAAGPRW---------------EAFCCNSCGAPMQGDDVLRCGSR-- 638
Query: 189 ILSKKTLALTSCGNHQEVVSTYKMIEKLQKKLYHPFSVNLMQTREKLIKILM--ELEDWK 246
SC + VS ++ +LQ +L Q K+L ELE
Sbjct: 639 ----------SCA--ESAVSRDHLVSRLQ---------DLQQQVRVAQKLLRDGELERAV 677
Query: 247 EALAYCQLTIPVYQRVYPQFHPLLGLQYYTCGKLEWFLGDTENAIKSMTEAVEILRITHG 306
+ L+ CQ + ++ + H ++G + LGD + + + ++ ++ + HG
Sbjct: 678 QRLSGCQRDAESF--LWAE-HAVVGEIADGLARACAALGDWQKSATHLQRSLCVVEVRHG 734
Query: 307 TNSPFMKELILKLEE 321
+S M + KL +
Sbjct: 735 PSSVEMGHELFKLAQ 749
>gi|156616310|ref|NP_001096081.1| SET and MYND domain-containing protein 4 [Mus musculus]
gi|90101757|sp|Q8BTK5.2|SMYD4_MOUSE RecName: Full=SET and MYND domain-containing protein 4
gi|148680856|gb|EDL12803.1| SET and MYND domain containing 4, isoform CRA_c [Mus musculus]
Length = 799
Score = 76.6 bits (187), Expect = 1e-11, Method: Compositional matrix adjust.
Identities = 38/88 (43%), Positives = 49/88 (55%), Gaps = 2/88 (2%)
Query: 43 NAHTICNSELRPLGTGLYPVISIINHSCLPNAVLVFEGRLAVVRAVQHVPKGAEVLISY- 101
N I NS L TG++PV+S++NHSC PN + F G +A VRA Q + KG E+L Y
Sbjct: 511 NESIITNSRQIRLATGIFPVVSLLNHSCRPNTSVSFTGTVATVRAAQRIAKGQEILHCYG 570
Query: 102 -IETAGSTMTRQKALKEQYLFTCTCPRC 128
E+ RQ+ L QY F C C C
Sbjct: 571 PHESRMGVAERQQRLSSQYFFDCRCGAC 598
>gi|391335768|ref|XP_003742261.1| PREDICTED: SET and MYND domain-containing protein 4-like
[Metaseiulus occidentalis]
Length = 775
Score = 76.6 bits (187), Expect = 2e-11, Method: Compositional matrix adjust.
Identities = 70/249 (28%), Positives = 105/249 (42%), Gaps = 29/249 (11%)
Query: 36 NFSKLACNAHTICN-----------------SELRPLGTGLYPVISIINHSCLPNAVLVF 78
+ +L CNAH I E + T +YP S++NHSC PN + F
Sbjct: 455 HIQQLICNAHAITTIQQPGDHVIEEDGIILEQEQVRVATAIYPSASLMNHSCNPNIISGF 514
Query: 79 E-GRLAVVRAVQHVPKGAEVLISYIETAGSTMT---RQKALKEQYLFTCTCPRCIKLGQF 134
G VV++V+ + G EV Y MT RQ AL+EQY F C C C K G
Sbjct: 515 RSGSTLVVKSVRPIASGEEVFNCY-GPHFRRMTFQERQTALQEQYFFRCDCTACQK-GDL 572
Query: 135 DDIQESAILEGYRCKDDGCSGFLLRDSDDKGFTCQQCGLVRSKEEIKKIASEVNILSKKT 194
DD I RC + C G L C QC + E ++ ++ L +
Sbjct: 573 DD----QISMALRC--EYCEGPLSAVQSSGKADCLQCSTTQDCLEKEQKVFRMHDLFVQG 626
Query: 195 LALTSCGNHQEVVSTYKMIEKLQKKLYHPFSVNLMQTREKLIKILMELEDWKEALAYCQL 254
L L +H E + + Q+KL + + LM TR+ + K L L +++A+ +
Sbjct: 627 LQLAEMDSHGEALERLQKCLASQEKLLYRHNKQLMITRDMVAKSLCALGRFRDAVGVLKS 686
Query: 255 TIPVYQRVY 263
+ + +Y
Sbjct: 687 AVESVRHMY 695
>gi|23272879|gb|AAH35077.1| SET and MYND domain containing 4 [Homo sapiens]
Length = 804
Score = 76.6 bits (187), Expect = 2e-11, Method: Compositional matrix adjust.
Identities = 77/315 (24%), Positives = 131/315 (41%), Gaps = 67/315 (21%)
Query: 26 PEISINEIA--ENFSKLACNA-------HT------ICNSELRPLGTGLYPVISIINHSC 70
P+++I +A + +L CNA HT + +S L TG++PVIS++NHSC
Sbjct: 483 PDVTIWGVAMLRHMLQLQCNAQAMTTIQHTGPKGSIVTDSRQVRLATGIFPVISLLNHSC 542
Query: 71 LPNAVLVFEGRLAVVRAVQHVPKGAEVLISY--IETAGSTMTRQKALKEQYLFTCTCPRC 128
PN + F +A +RA Q + KG E+L Y ++ RQ+ L+ QY F C CP C
Sbjct: 543 SPNTSVSFISTVATIRASQRIRKGQEILHCYGPHKSRMGVAERQQKLRSQYFFDCACPAC 602
Query: 129 IKLGQFDDIQESAILEGYRCKDDGCSGFLLRDSDDKGFTCQQCGLVRSKEEIKKIASEVN 188
+ + G R + F C CG +++ + S
Sbjct: 603 -------QTEAHRMAAGPRW---------------EAFCCNSCGAPMQGDDVLRCGSR-- 638
Query: 189 ILSKKTLALTSCGNHQEVVSTYKMIEKLQKKLYHPFSVNLMQTREKLIKILM--ELEDWK 246
SC + VS ++ +LQ +L Q K+L ELE
Sbjct: 639 ----------SCA--ESAVSRDHLVSRLQ---------DLQQQVRVAQKLLRDGELERAV 677
Query: 247 EALAYCQLTIPVYQRVYPQFHPLLGLQYYTCGKLEWFLGDTENAIKSMTEAVEILRITHG 306
+ L+ CQ + ++ + H ++G + LGD + + + ++ ++ + HG
Sbjct: 678 QRLSGCQRDAESF--LWAE-HAVVGEIADGLARACAALGDWQKSATHLQRSLCVVEVRHG 734
Query: 307 TNSPFMKELILKLEE 321
+S M + KL +
Sbjct: 735 PSSVEMGHELFKLAQ 749
>gi|119610982|gb|EAW90576.1| SET and MYND domain containing 4 [Homo sapiens]
Length = 804
Score = 76.6 bits (187), Expect = 2e-11, Method: Compositional matrix adjust.
Identities = 77/315 (24%), Positives = 131/315 (41%), Gaps = 67/315 (21%)
Query: 26 PEISINEIA--ENFSKLACNA-------HT------ICNSELRPLGTGLYPVISIINHSC 70
P+++I +A + +L CNA HT + +S L TG++PVIS++NHSC
Sbjct: 483 PDVTIWGVAMLRHMLQLQCNAQAMTTIQHTGPKGSIVTDSRQVRLATGIFPVISLLNHSC 542
Query: 71 LPNAVLVFEGRLAVVRAVQHVPKGAEVLISY--IETAGSTMTRQKALKEQYLFTCTCPRC 128
PN + F +A +RA Q + KG E+L Y ++ RQ+ L+ QY F C CP C
Sbjct: 543 SPNTSVSFISTVATIRASQRIRKGQEILHCYGPHKSRMGVAERQQKLRSQYFFDCACPAC 602
Query: 129 IKLGQFDDIQESAILEGYRCKDDGCSGFLLRDSDDKGFTCQQCGLVRSKEEIKKIASEVN 188
+ + G R + F C CG +++ + S
Sbjct: 603 -------QTEAHRMAAGPRW---------------EAFCCNSCGAPMQGDDVLRCGSR-- 638
Query: 189 ILSKKTLALTSCGNHQEVVSTYKMIEKLQKKLYHPFSVNLMQTREKLIKILM--ELEDWK 246
SC + VS ++ +LQ +L Q K+L ELE
Sbjct: 639 ----------SCA--ESAVSRDHLVSRLQ---------DLQQQVRVAQKLLRDGELERAV 677
Query: 247 EALAYCQLTIPVYQRVYPQFHPLLGLQYYTCGKLEWFLGDTENAIKSMTEAVEILRITHG 306
+ L+ CQ + ++ + H ++G + LGD + + + ++ ++ + HG
Sbjct: 678 QRLSGCQRDAESF--LWAE-HAVVGEIADGLARACAALGDWQKSATHLQRSLCVVEVRHG 734
Query: 307 TNSPFMKELILKLEE 321
+S M + KL +
Sbjct: 735 PSSVEMGHELFKLAQ 749
>gi|397491961|ref|XP_003816904.1| PREDICTED: SET and MYND domain-containing protein 4 [Pan paniscus]
Length = 835
Score = 76.6 bits (187), Expect = 2e-11, Method: Compositional matrix adjust.
Identities = 77/315 (24%), Positives = 131/315 (41%), Gaps = 67/315 (21%)
Query: 26 PEISINEIA--ENFSKLACNA-------HT------ICNSELRPLGTGLYPVISIINHSC 70
P+++I +A + +L CNA HT + +S L TG++PVIS++NHSC
Sbjct: 483 PDVTIWGVAMLRHMLQLQCNAQAMTTIQHTGPKGSIVTDSRQVRLATGIFPVISLLNHSC 542
Query: 71 LPNAVLVFEGRLAVVRAVQHVPKGAEVLISY--IETAGSTMTRQKALKEQYLFTCTCPRC 128
PN + F +A +RA Q + KG E+L Y ++ RQ+ L+ QY F C CP C
Sbjct: 543 NPNTSVSFISTVATIRASQRIRKGQEILHCYGPHKSRMGVAERQQKLRSQYFFDCACPAC 602
Query: 129 IKLGQFDDIQESAILEGYRCKDDGCSGFLLRDSDDKGFTCQQCGLVRSKEEIKKIASEVN 188
+ + G R + F C CG +++ + S
Sbjct: 603 -------QTEAHRMAAGPRW---------------EAFCCNSCGAPMQGDDVLRCGSR-- 638
Query: 189 ILSKKTLALTSCGNHQEVVSTYKMIEKLQKKLYHPFSVNLMQTREKLIKILM--ELEDWK 246
SC + VS ++ +LQ +L Q K+L ELE
Sbjct: 639 ----------SCA--ESAVSRDHLVSRLQ---------DLQQQVRVAQKLLRDGELERAV 677
Query: 247 EALAYCQLTIPVYQRVYPQFHPLLGLQYYTCGKLEWFLGDTENAIKSMTEAVEILRITHG 306
+ L+ CQ + ++ + H ++G + LGD + + + ++ ++ + HG
Sbjct: 678 QRLSGCQRDAESF--LWAE-HAVVGEIADGLARAFAALGDWQKSATHLQRSLCVVEVRHG 734
Query: 307 TNSPFMKELILKLEE 321
+S M + KL +
Sbjct: 735 PSSVEMGHELFKLAQ 749
>gi|114665632|ref|XP_511253.2| PREDICTED: SET and MYND domain-containing protein 4 isoform 3 [Pan
troglodytes]
Length = 835
Score = 76.6 bits (187), Expect = 2e-11, Method: Compositional matrix adjust.
Identities = 77/315 (24%), Positives = 131/315 (41%), Gaps = 67/315 (21%)
Query: 26 PEISINEIA--ENFSKLACNA-------HT------ICNSELRPLGTGLYPVISIINHSC 70
P+++I +A + +L CNA HT + +S L TG++PVIS++NHSC
Sbjct: 483 PDVTIWGVAMLRHMLQLQCNAQAVTTIQHTGPKGSIVTDSRQVRLATGIFPVISLLNHSC 542
Query: 71 LPNAVLVFEGRLAVVRAVQHVPKGAEVLISY--IETAGSTMTRQKALKEQYLFTCTCPRC 128
PN + F +A +RA Q + KG E+L Y ++ RQ+ L+ QY F C CP C
Sbjct: 543 NPNTSVSFISTVATIRASQRIRKGQEILHCYGPHKSRMGVAERQQKLRSQYFFDCACPAC 602
Query: 129 IKLGQFDDIQESAILEGYRCKDDGCSGFLLRDSDDKGFTCQQCGLVRSKEEIKKIASEVN 188
+ + G R + F C CG +++ + S
Sbjct: 603 -------QTEAHRMAAGPRW---------------EAFCCNSCGAPMQGDDVLRCGSR-- 638
Query: 189 ILSKKTLALTSCGNHQEVVSTYKMIEKLQKKLYHPFSVNLMQTREKLIKILM--ELEDWK 246
SC + VS ++ +LQ +L Q K+L ELE
Sbjct: 639 ----------SCA--ESAVSRDHLVSRLQ---------DLQQQVRVAQKLLRDGELERAV 677
Query: 247 EALAYCQLTIPVYQRVYPQFHPLLGLQYYTCGKLEWFLGDTENAIKSMTEAVEILRITHG 306
+ L+ CQ + ++ + H ++G + LGD + + + ++ ++ + HG
Sbjct: 678 QRLSGCQRDAESF--LWAE-HAVVGEIADGLARAFAALGDWQKSATHLQRSLCVVEVRHG 734
Query: 307 TNSPFMKELILKLEE 321
+S M + KL +
Sbjct: 735 PSSVEMGHELFKLAQ 749
>gi|426383436|ref|XP_004058287.1| PREDICTED: SET and MYND domain-containing protein 4 [Gorilla
gorilla gorilla]
Length = 804
Score = 76.6 bits (187), Expect = 2e-11, Method: Compositional matrix adjust.
Identities = 76/315 (24%), Positives = 131/315 (41%), Gaps = 67/315 (21%)
Query: 26 PEISINEIA--ENFSKLACNA-------HT------ICNSELRPLGTGLYPVISIINHSC 70
P++++ +A + +L CNA HT + +S L TG++PVIS++NHSC
Sbjct: 483 PDVTVWGVAMLRHMLQLQCNAQAMTTIQHTGSKGSIVTDSRQVRLATGIFPVISLLNHSC 542
Query: 71 LPNAVLVFEGRLAVVRAVQHVPKGAEVLISY--IETAGSTMTRQKALKEQYLFTCTCPRC 128
PN + F +A +RA Q + KG E+L Y ++ RQ+ L+ QY F C CP C
Sbjct: 543 CPNTSVSFISTVATIRASQRIQKGQEILHCYGPHKSRMGVAERQQKLRSQYFFDCACPAC 602
Query: 129 IKLGQFDDIQESAILEGYRCKDDGCSGFLLRDSDDKGFTCQQCGLVRSKEEIKKIASEVN 188
+ + G R + F C CG +++ + S
Sbjct: 603 -------QTEAHRMAAGPRW---------------EAFCCNSCGAPMQGDDVLRCGSR-- 638
Query: 189 ILSKKTLALTSCGNHQEVVSTYKMIEKLQKKLYHPFSVNLMQTREKLIKILM--ELEDWK 246
SC + VS ++ +LQ +L Q K+L ELE
Sbjct: 639 ----------SCA--ESAVSRDHLVSRLQ---------DLQQQVRVAQKLLRDGELERAV 677
Query: 247 EALAYCQLTIPVYQRVYPQFHPLLGLQYYTCGKLEWFLGDTENAIKSMTEAVEILRITHG 306
+ L+ CQ + ++ + H ++G + LGD + + + ++ ++ + HG
Sbjct: 678 QRLSGCQRDAESF--LWAE-HAVVGEIADGLARACAALGDWQKSATHLQRSLCVVEVRHG 734
Query: 307 TNSPFMKELILKLEE 321
+S M + KL +
Sbjct: 735 PSSVEMGHELFKLAQ 749
>gi|405969486|gb|EKC34455.1| SET and MYND domain-containing protein 4 [Crassostrea gigas]
Length = 324
Score = 76.6 bits (187), Expect = 2e-11, Method: Compositional matrix adjust.
Identities = 52/156 (33%), Positives = 78/156 (50%), Gaps = 19/156 (12%)
Query: 31 NEIAENFSKLACNAHTICNSELRP----------LGTGLYPVISIINHSCLPNAVLVFEG 80
+ I L CNAH I +P +G+G Y ++S+INHSC P+ V G
Sbjct: 94 SHILRQIQMLPCNAHEISEILWKPGDPTVTNSIEIGSGAYALLSLINHSCDPSVVRHNYG 153
Query: 81 RLAVVRAVQHVPKGAEVLISYIETAGSTMT--RQKALKEQYLFTCTCPRC-IKLGQFDDI 137
+ VVRA++ + KG E+L +Y T+ R+ L+ QY F C C C ++L + DI
Sbjct: 154 NICVVRAIKPIKKGEEILDNYGALYPLTIREERRAKLRPQYFFDCNCDACQLELPLYFDI 213
Query: 138 QESAILEGYRCKDDGCSG--FLLRDSDDKGFTCQQC 171
+ + ++CKD CSG F+ +D D C C
Sbjct: 214 PDDVPV--FKCKD--CSGPIFISQDKDLAEAECSSC 245
>gi|410209806|gb|JAA02122.1| SET and MYND domain containing 4 [Pan troglodytes]
gi|410209808|gb|JAA02123.1| SET and MYND domain containing 4 [Pan troglodytes]
gi|410246942|gb|JAA11438.1| SET and MYND domain containing 4 [Pan troglodytes]
gi|410246944|gb|JAA11439.1| SET and MYND domain containing 4 [Pan troglodytes]
gi|410289944|gb|JAA23572.1| SET and MYND domain containing 4 [Pan troglodytes]
gi|410289946|gb|JAA23573.1| SET and MYND domain containing 4 [Pan troglodytes]
gi|410289948|gb|JAA23574.1| SET and MYND domain containing 4 [Pan troglodytes]
gi|410330765|gb|JAA34329.1| SET and MYND domain containing 4 [Pan troglodytes]
gi|410330767|gb|JAA34330.1| SET and MYND domain containing 4 [Pan troglodytes]
gi|410330769|gb|JAA34331.1| SET and MYND domain containing 4 [Pan troglodytes]
Length = 835
Score = 76.6 bits (187), Expect = 2e-11, Method: Compositional matrix adjust.
Identities = 77/315 (24%), Positives = 131/315 (41%), Gaps = 67/315 (21%)
Query: 26 PEISINEIA--ENFSKLACNA-------HT------ICNSELRPLGTGLYPVISIINHSC 70
P+++I +A + +L CNA HT + +S L TG++PVIS++NHSC
Sbjct: 483 PDVTIWGVAMLRHMLQLQCNAQAVTTIQHTGPKGSIVTDSRQVRLATGIFPVISLLNHSC 542
Query: 71 LPNAVLVFEGRLAVVRAVQHVPKGAEVLISY--IETAGSTMTRQKALKEQYLFTCTCPRC 128
PN + F +A +RA Q + KG E+L Y ++ RQ+ L+ QY F C CP C
Sbjct: 543 NPNTSVSFISTVATIRASQRIRKGQEILHCYGPHKSRMGVAERQQKLRSQYFFDCACPAC 602
Query: 129 IKLGQFDDIQESAILEGYRCKDDGCSGFLLRDSDDKGFTCQQCGLVRSKEEIKKIASEVN 188
+ + G R + F C CG +++ + S
Sbjct: 603 -------QTEAHRMAAGPRW---------------EAFCCNSCGAPMQGDDVLRCGSR-- 638
Query: 189 ILSKKTLALTSCGNHQEVVSTYKMIEKLQKKLYHPFSVNLMQTREKLIKILM--ELEDWK 246
SC + VS ++ +LQ +L Q K+L ELE
Sbjct: 639 ----------SCA--ESAVSRDHLVSRLQ---------DLQQQVRVAQKLLRDGELERAV 677
Query: 247 EALAYCQLTIPVYQRVYPQFHPLLGLQYYTCGKLEWFLGDTENAIKSMTEAVEILRITHG 306
+ L+ CQ + ++ + H ++G + LGD + + + ++ ++ + HG
Sbjct: 678 QRLSGCQRDAESF--LWAE-HAVVGEIADGLARAFAALGDWQKSATHLQRSLCVVEVRHG 734
Query: 307 TNSPFMKELILKLEE 321
+S M + KL +
Sbjct: 735 PSSVEMGHELFKLAQ 749
>gi|326472015|gb|EGD96024.1| hypothetical protein TESG_03485 [Trichophyton tonsurans CBS 112818]
Length = 499
Score = 76.6 bits (187), Expect = 2e-11, Method: Compositional matrix adjust.
Identities = 38/102 (37%), Positives = 56/102 (54%)
Query: 27 EISINEIAENFSKLACNAHTICNSELRPLGTGLYPVISIINHSCLPNAVLVFEGRLAVVR 86
++ I IAE +++ N+ T + PLG + P +NHSC PNAV+ F+ L V+
Sbjct: 180 DLDIETIAEYLARIEVNSFTFTTAFGDPLGLCIQPFACYMNHSCEPNAVVGFDEGLITVK 239
Query: 87 AVQHVPKGAEVLISYIETAGSTMTRQKALKEQYLFTCTCPRC 128
A+ + +V ISYI+ RQK L E+Y FTC C +C
Sbjct: 240 ALCKIKPDEQVFISYIDNTYPFEVRQKQLAERYFFTCKCSKC 281
>gi|395853348|ref|XP_003799177.1| PREDICTED: SET and MYND domain-containing protein 4 [Otolemur
garnettii]
Length = 749
Score = 76.6 bits (187), Expect = 2e-11, Method: Compositional matrix adjust.
Identities = 43/120 (35%), Positives = 61/120 (50%), Gaps = 17/120 (14%)
Query: 26 PEISINEIA--ENFSKLACNAHTIC-------------NSELRPLGTGLYPVISIINHSC 70
P++ I +A + +L CNA I NS L TG++PV+S++NHSC
Sbjct: 483 PDLDIWGVAMLRHMLQLQCNAQAITTIQQTGSEENIITNSRQVRLATGVFPVVSLLNHSC 542
Query: 71 LPNAVLVFEGRLAVVRAVQHVPKGAEVLISY--IETAGSTMTRQKALKEQYLFTCTCPRC 128
PN + F +A +RA Q + KG E+L Y E+ RQ+ L+ QY F C CP C
Sbjct: 543 SPNTSISFCSTVATIRASQQIGKGQEILHCYGPHESRMGVAKRQQKLRSQYFFDCNCPAC 602
>gi|326477120|gb|EGE01130.1| SET and MYND domain-containing protein [Trichophyton equinum CBS
127.97]
Length = 499
Score = 76.3 bits (186), Expect = 2e-11, Method: Compositional matrix adjust.
Identities = 38/102 (37%), Positives = 56/102 (54%)
Query: 27 EISINEIAENFSKLACNAHTICNSELRPLGTGLYPVISIINHSCLPNAVLVFEGRLAVVR 86
++ I IAE +++ N+ T + PLG + P +NHSC PNAV+ F+ L V+
Sbjct: 180 DLDIETIAEYLARIEVNSFTFTTAFGDPLGLCIQPFACYMNHSCEPNAVVGFDEGLITVK 239
Query: 87 AVQHVPKGAEVLISYIETAGSTMTRQKALKEQYLFTCTCPRC 128
A+ + +V ISYI+ RQK L E+Y FTC C +C
Sbjct: 240 ALCKIKPDEQVFISYIDNTYPFEVRQKQLAERYFFTCKCSKC 281
>gi|390474255|ref|XP_002757614.2| PREDICTED: SET and MYND domain-containing protein 1 [Callithrix
jacchus]
Length = 442
Score = 76.3 bits (186), Expect = 2e-11, Method: Compositional matrix adjust.
Identities = 77/320 (24%), Positives = 123/320 (38%), Gaps = 104/320 (32%)
Query: 19 VNLILQ-WP----EISINEIAENFSKLACNAHTICNSE-LRPLGTGLYPVISIINHSCLP 72
V+ LQ WP + S+ I+ F + CN T+ + L+ +G G++P + ++NH C P
Sbjct: 151 VDTFLQYWPPQSQQFSMQYISHIFGVINCNGFTLSDQRGLQAVGVGIFPNLGLVNHDCWP 210
Query: 73 NAVLVFE-GRLAVVRAVQH------------VPKGAEVLISYIETAGSTMTRQKALKEQY 119
N ++F G V+++ H + +G E+ +SYI+ + R+K LK+QY
Sbjct: 211 NCTVIFNNGNHEAVKSMFHTQMRIELRALGKISEGEELTVSYIDFLNVSEERKKQLKKQY 270
Query: 120 LFTCTCPRCIKLGQFDDIQESAILEGYRCKDDGCSGFLLRDSDDKGFTCQQCGLVRSKEE 179
F CTC C K + KDD FL G+ + +
Sbjct: 271 YFDCTCEHCQK----------------KLKDD---LFL--------------GVKDNPKA 297
Query: 180 IKKIASEVNILSKKTL----ALTSCGNHQEVVSTYKMIEKLQKKLYHPFSVNLMQTREKL 235
+++ E+ SK TL S G + EVV ++KLYHP +
Sbjct: 298 SQEVVKEMIQFSKDTLEKIDKARSEGLYHEVVKL------CREKLYHPNNAQ-------- 343
Query: 236 IKILMELEDWKEALAYCQLTIPVYQRVYPQFHPLLGLQYYTCGKLEWFLGDTENAIKSMT 295
LG+ G W G+ E +
Sbjct: 344 ----------------------------------LGMAVMRAGLTNWHAGNIEVGHGMIC 369
Query: 296 EAVEILRITHGTNSPFMKEL 315
+A IL +THG + P K+L
Sbjct: 370 KAYAILLVTHGPSHPITKDL 389
>gi|339235237|ref|XP_003379173.1| MYND finger protein [Trichinella spiralis]
gi|316978237|gb|EFV61247.1| MYND finger protein [Trichinella spiralis]
Length = 451
Score = 76.3 bits (186), Expect = 2e-11, Method: Compositional matrix adjust.
Identities = 70/285 (24%), Positives = 121/285 (42%), Gaps = 26/285 (9%)
Query: 43 NAHTICNS-ELRPLGTGLYPVISIINHSCLPNAVLVFEGRLAVVRAVQ---HVPKGAEVL 98
N H I + L +G GLY + +HSC PNA+ EG +A ++A+ ++
Sbjct: 168 NRHAIGDQLYLNEIGKGLYLDLCAYDHSCRPNAIYTCEGFVARLKALDDSVNLSDRKRTH 227
Query: 99 ISYIETAGSTMTRQKALKEQYLFTCTCPRCIKLGQFDDIQESAILEGYRCKDDGCSGFLL 158
YI S R++ LKE + F C C RC +L +C + CS +
Sbjct: 228 TCYITPLDSRAERRRLLKESWYFDCFCERCTD-------PNDHLLTALKC--NNCSTAVP 278
Query: 159 RDS--------DDKGFTCQQC-GLVRSKEEIKKIASEVNILSKKTLALTSCGNHQEVVST 209
+ DD FTC +C S+ ++++ ++ + K+ S + + +
Sbjct: 279 IHNADNDKDLIDDSFFTCPKCEQRCFSRPDVEQALFKMIDIEKQLAKTGSETLSKSELDS 338
Query: 210 YKMIEKLQKKLYHPFSVNLMQTREKLIKILMELEDWKEALAYCQLTIPVYQRVYPQFHPL 269
++ K + F L+Q + +LI + D K+ L +P + YP+ HP
Sbjct: 339 MLLVCKSVLADENVFFAKLLQRKLQLI----DCNDSKQLLETSLQALPCLRVCYPEMHPT 394
Query: 270 LGLQYYTCGKLEWFLGDTENAIKSMTEAVEILRITHGTNSPFMKE 314
L Y T G L D + A++ + +A+ R T G+N K+
Sbjct: 395 LAYHYMTIGIYYKNLADRKTAMQYLNDALRRFRFTLGSNHLLTKQ 439
>gi|148680855|gb|EDL12802.1| SET and MYND domain containing 4, isoform CRA_b [Mus musculus]
Length = 828
Score = 76.3 bits (186), Expect = 2e-11, Method: Compositional matrix adjust.
Identities = 38/88 (43%), Positives = 49/88 (55%), Gaps = 2/88 (2%)
Query: 43 NAHTICNSELRPLGTGLYPVISIINHSCLPNAVLVFEGRLAVVRAVQHVPKGAEVLISY- 101
N I NS L TG++PV+S++NHSC PN + F G +A VRA Q + KG E+L Y
Sbjct: 540 NESIITNSRQIRLATGIFPVVSLLNHSCRPNTSVSFTGTVATVRAAQRIAKGQEILHCYG 599
Query: 102 -IETAGSTMTRQKALKEQYLFTCTCPRC 128
E+ RQ+ L QY F C C C
Sbjct: 600 PHESRMGVAERQQRLSSQYFFDCRCGAC 627
>gi|224006285|ref|XP_002292103.1| hypothetical protein THAPSDRAFT_23831 [Thalassiosira pseudonana
CCMP1335]
gi|220972622|gb|EED90954.1| hypothetical protein THAPSDRAFT_23831 [Thalassiosira pseudonana
CCMP1335]
Length = 575
Score = 76.3 bits (186), Expect = 2e-11, Method: Compositional matrix adjust.
Identities = 43/124 (34%), Positives = 59/124 (47%), Gaps = 11/124 (8%)
Query: 37 FSKLACNAHTICNSELRPLGTGLYPVISIINHSCLPNAVLVF-----EGRLAVVRAVQHV 91
S+L N TI SE PLG G+YP S+INHSC PNAV F + + + V
Sbjct: 257 MSRLIMNGFTISTSEQLPLGVGVYPAASMINHSCRPNAVPTFWFSTPSPPMLQITMCKSV 316
Query: 92 PKGAEVLISYIETAGSTMTRQKALKEQYLFTCTCPRCIKLGQFDDIQESAILEGYRCKDD 151
G E+ ISY + + R++ L + Y F C C C D + + G +C+
Sbjct: 317 RVGDEIAISYCDVSAPRYVRREGLIKNYKFACDCSHC------GDFETDGDMIGLKCQAS 370
Query: 152 GCSG 155
GC G
Sbjct: 371 GCIG 374
>gi|449265983|gb|EMC77110.1| SET and MYND domain-containing protein 4, partial [Columba livia]
Length = 706
Score = 76.3 bits (186), Expect = 2e-11, Method: Compositional matrix adjust.
Identities = 63/265 (23%), Positives = 116/265 (43%), Gaps = 10/265 (3%)
Query: 55 LGTGLYPVISIINHSCLPNAVLVFEGRLAVVRAVQHVPKGAEVLISY-IETAGSTMTRQK 113
L T +PV+S++NHSC PN + F G A VRA Q +P G E+ Y + ++
Sbjct: 430 LATAFFPVLSLLNHSCCPNTSVSFSGTTATVRASQLIPSGQEIFHCYGPHRCRMRVAERQ 489
Query: 114 ALKEQYLFTCTCPRCIKLGQFDDIQESAILEGYRCKDDGCSGFLLRDSDDKGFTCQQCGL 173
L QY F C C C + D ++ + C GC G ++ D + + C +
Sbjct: 490 QLLSQYFFECRCQACCDELESDVKSVVSLRNTFCCP--GCRG-PMQGEDMLCCSNEACAI 546
Query: 174 VRSKEEIKKIASEVNILSKKTLALTSCGNHQEVVSTYKMIEKLQ---KKLYHPFSVNLMQ 230
S+E + ++ K+ L L ++ KM+ K Q + P + + +
Sbjct: 547 SVSRESLSCRLQDLQQRIKRALELL---RDRKADQAIKMLLKCQVDAECFLSPEHLLMGE 603
Query: 231 TREKLIKILMELEDWKEALAYCQLTIPVYQRVYPQFHPLLGLQYYTCGKLEWFLGDTENA 290
+ L ++ L W+EA + + +I + + + +G + + ++ + A
Sbjct: 604 LEDHLAQVYATLGKWQEAARHLERSIEIVEMHHGPSSIEIGHELFKLAQILFNGFAVSEA 663
Query: 291 IKSMTEAVEILRITHGTNSPFMKEL 315
+ ++ A EIL + G SP ++EL
Sbjct: 664 LSTIQRAEEILSVHCGPQSPQIQEL 688
>gi|403275299|ref|XP_003929391.1| PREDICTED: SET and MYND domain-containing protein 4 [Saimiri
boliviensis boliviensis]
Length = 800
Score = 76.3 bits (186), Expect = 2e-11, Method: Compositional matrix adjust.
Identities = 71/312 (22%), Positives = 134/312 (42%), Gaps = 32/312 (10%)
Query: 26 PEISINEIA--ENFSKLACNA-------HTIC------NSELRPLGTGLYPVISIINHSC 70
P++++ +A + +L CNA HT C S L TG++PV+S++NHSC
Sbjct: 480 PDVTVWGVAMLRHMLQLQCNAQAITTIQHTGCKEGIVTGSRQVRLATGIFPVVSLLNHSC 539
Query: 71 LPNAVLVFEGRLAVVRAVQHVPKGAEVLISY--IETAGSTMTRQKALKEQYLFTCTCPRC 128
PN + F +A +RA Q + KG E++ Y ++ RQ+ L+ QY F CTCP C
Sbjct: 540 SPNTSVSFISTVATIRASQRIGKGQEIVHCYGPHKSRMGVAERQQKLRSQYFFDCTCPAC 599
Query: 129 IKLGQFDDIQESAILEGYRCKDDGCSGFLLRDSDDKGFTCQQCGLVRSKEEIKK--IASE 186
G + E + C C + +G CG E +++ +AS
Sbjct: 600 HSEGH--GMAARPRWEAFCCSS--CRAPM------QGDDVLSCGRRGCAESVRRDHLASR 649
Query: 187 VNILSKK---TLALTSCGNHQEVVSTYKMIEKLQKKLYHPFSVNLMQTREKLIKILMELE 243
+ L ++ L S G + + ++ + L + + L + L
Sbjct: 650 LQDLRQQVGMAQKLLSDGELEPAIQQLSGCQRDAESFLSAEHAVLGEIEDGLAQACAALG 709
Query: 244 DWKEALAYCQLTIPVYQRVYPQFHPLLGLQYYTCGKLEWFLGDTENAIKSMTEAVEILRI 303
DW+++ + Q ++ V + + +G + + ++ + A+ ++ +A E+L +
Sbjct: 710 DWQKSATHLQNSLRVVEARHGPSSVEMGHELFKLAQIFFNGFAVPEALSTIQKAEEVLSL 769
Query: 304 THGTNSPFMKEL 315
G ++EL
Sbjct: 770 HCGPWDDEIQEL 781
>gi|213406173|ref|XP_002173858.1| conserved hypothetical protein [Schizosaccharomyces japonicus
yFS275]
gi|212001905|gb|EEB07565.1| conserved hypothetical protein [Schizosaccharomyces japonicus
yFS275]
Length = 449
Score = 76.3 bits (186), Expect = 2e-11, Method: Compositional matrix adjust.
Identities = 38/107 (35%), Positives = 59/107 (55%)
Query: 30 INEIAENFSKLACNAHTICNSELRPLGTGLYPVISIINHSCLPNAVLVFEGRLAVVRAVQ 89
+N F KL+ NA ++ +GT + ++ INHSC PN V + EG A + A+Q
Sbjct: 155 VNAFLSLFCKLSINAMSLVTPAFDAIGTCMDSTLARINHSCQPNCVFMVEGATARLVALQ 214
Query: 90 HVPKGAEVLISYIETAGSTMTRQKALKEQYLFTCTCPRCIKLGQFDD 136
+ KG V ISY++T R L+++Y FTCTC +C++ + D
Sbjct: 215 TLQKGDFVHISYVDTTLPYNVRTNELQQKYFFTCTCQKCMEEARRSD 261
>gi|312373132|gb|EFR20944.1| hypothetical protein AND_18266 [Anopheles darlingi]
Length = 648
Score = 76.3 bits (186), Expect = 2e-11, Method: Compositional matrix adjust.
Identities = 66/236 (27%), Positives = 106/236 (44%), Gaps = 30/236 (12%)
Query: 5 DEKQLLLYAQIANLVNLILQWPEISINE----------IAENFSKLACNAHTICN-SELR 53
D + L++ +A ++N IL+ + E + N + +AH + R
Sbjct: 375 DPEDYLVWTLMAAMLNTILRHGNYTNTEQPDDGFIGYLLLHNLQIVNYSAHDVAELQRKR 434
Query: 54 P--------LGTGLYPVISIINHSCLPNAVLVFEGRLAVVRAVQHVPKGAEVLISY--IE 103
P +G LYP++++ NHSC P V F G VR ++++ G + +Y +
Sbjct: 435 PNEAGKSVAIGAALYPLLALFNHSCDPGIVRYFSGTTVHVRTIKNIAAGQIIAENYGPLY 494
Query: 104 TAGSTMTRQKALKEQYLFTCTCPRCIKLGQ-FDDIQESAILEGYRCKDDGCSGFLLRD-- 160
T R+++L Y F C C C + Q + + ES I +RC C G LL D
Sbjct: 495 TKMERTERRQSLAANYKFQCNCDACRQDWQLYTHMDESVIR--FRCTGPACQGALLFDLT 552
Query: 161 SDDKGFTCQQCG-LVRSKEEIKKIASEVNILSKKTLA--LTSCGNHQEVVSTYKMI 213
S+ G C CG LV +E I ++ E N+LS+ A L + G + +S Y I
Sbjct: 553 SECYGMQCDDCGELVDIRERI-RLLQEANMLSRFNEAAHLYAIGLFEHALSKYAAI 607
>gi|431906502|gb|ELK10625.1| SET and MYND domain-containing protein 3 [Pteropus alecto]
Length = 352
Score = 76.3 bits (186), Expect = 2e-11, Method: Compositional matrix adjust.
Identities = 32/74 (43%), Positives = 49/74 (66%)
Query: 55 LGTGLYPVISIINHSCLPNAVLVFEGRLAVVRAVQHVPKGAEVLISYIETAGSTMTRQKA 114
+G GLYP +S++NHSC PN +VF G ++RAV+ + G E+ I Y++ ++ R+K
Sbjct: 4 VGVGLYPSMSLLNHSCDPNCSIVFNGPHLLLRAVRDIEAGEELTICYLDMLMTSEERRKQ 63
Query: 115 LKEQYLFTCTCPRC 128
L++QY F C C RC
Sbjct: 64 LRDQYCFECDCSRC 77
>gi|405959013|gb|EKC25088.1| SET and MYND domain-containing protein 5 [Crassostrea gigas]
Length = 396
Score = 76.3 bits (186), Expect = 2e-11, Method: Compositional matrix adjust.
Identities = 37/87 (42%), Positives = 51/87 (58%), Gaps = 5/87 (5%)
Query: 56 GTGLYPVISIINHSCLPNAVLVF--EGRLAVVRAVQHVPKGAEVLISYI---ETAGSTMT 110
G+GLY + S NHSCLPNA + F + V A + +P G E+ ISY+ + + S +
Sbjct: 296 GSGLYSLQSTCNHSCLPNAEITFPYNNNVMAVVAKEKIPSGQEICISYLSECDLSRSRHS 355
Query: 111 RQKALKEQYLFTCTCPRCIKLGQFDDI 137
RQ LKE YLFTC CP+C+ D+
Sbjct: 356 RQNILKENYLFTCDCPKCLSEADEPDV 382
>gi|290562039|gb|ADD38416.1| SET and MYND domain-containing protein 3 [Lepeophtheirus salmonis]
Length = 311
Score = 76.3 bits (186), Expect = 2e-11, Method: Compositional matrix adjust.
Identities = 40/119 (33%), Positives = 64/119 (53%)
Query: 10 LLYAQIANLVNLILQWPEISINEIAENFSKLACNAHTICNSELRPLGTGLYPVISIINHS 69
+ Y I + N + EI+ +E+ + F+K++ N+ I N + G+ L S I+HS
Sbjct: 132 MFYRYIGIIKNKLGGALEINESEVLDIFTKMSINSVFILNGKFLNFGSCLSLEFSAIDHS 191
Query: 70 CLPNAVLVFEGRLAVVRAVQHVPKGAEVLISYIETAGSTMTRQKALKEQYLFTCTCPRC 128
C PNAV +F GR VV+A+ + +V ++Y + R+ LK+QY F CTC C
Sbjct: 192 CRPNAVYMFIGRTLVVKALCDIANFDDVRVTYTDITQPRRMRRNFLKDQYFFDCTCEEC 250
>gi|50511197|dbj|BAD32584.1| mKIAA1936 protein [Mus musculus]
Length = 716
Score = 76.3 bits (186), Expect = 2e-11, Method: Compositional matrix adjust.
Identities = 38/88 (43%), Positives = 49/88 (55%), Gaps = 2/88 (2%)
Query: 43 NAHTICNSELRPLGTGLYPVISIINHSCLPNAVLVFEGRLAVVRAVQHVPKGAEVLISY- 101
N I NS L TG++PV+S++NHSC PN + F G +A VRA Q + KG E+L Y
Sbjct: 428 NESIITNSRQIRLATGIFPVVSLLNHSCRPNTSVSFTGTVATVRAAQRIAKGQEILHCYG 487
Query: 102 -IETAGSTMTRQKALKEQYLFTCTCPRC 128
E+ RQ+ L QY F C C C
Sbjct: 488 PHESRMGVAERQQRLSSQYFFDCRCGAC 515
>gi|432096123|gb|ELK26991.1| SET and MYND domain-containing protein 4 [Myotis davidii]
Length = 777
Score = 75.9 bits (185), Expect = 2e-11, Method: Compositional matrix adjust.
Identities = 69/299 (23%), Positives = 129/299 (43%), Gaps = 15/299 (5%)
Query: 21 LILQWPEISINEIAENFSKLACNAHTICNSELRPLGTGLYPVISIINHSCLPNAVLVFEG 80
L LQ +I I + SK + + +S L TGL+PV+S++NHSC PN + F
Sbjct: 471 LQLQCNAQAITTIQQTGSK----ENMVTDSRQVRLATGLFPVVSLLNHSCSPNTSVSFVS 526
Query: 81 RLAVVRAVQHVPKGAEVLISY--IETAGSTMTRQKALKEQYLFTCTCPRCIKLGQFDDIQ 138
+A VRA Q + +G E+L Y E+ RQ+ L+ QY F C C C Q +
Sbjct: 527 TVATVRASQQIGEGQEILHCYGPHESRMGVAERQQKLRSQYFFDCDCLAC----QNGKCR 582
Query: 139 ESA--ILEGYRCKDDGCSGFLLRDSDDKGFTCQQCGLVRSKEEIKKIASEVNILSKKTLA 196
+A E +RC C G L++ D C S++ + ++ K
Sbjct: 583 AAAGPRWEAFRCH---CCGELMQGDDVLSCGSASCTESASRDRLVSRLQDLRQRVGKAQT 639
Query: 197 LTSCGNHQEVVSTYKMIEKLQKKLYHPFSVNLMQTREKLIKILMELEDWKEALAYCQLTI 256
L G + + ++ + + + + L + L DW ++ A+ Q ++
Sbjct: 640 LLRNGKVERAIQLLLGCQRDAESFLSAEHSLVGEIEDALAQAYAALGDWHKSAAHLQRSL 699
Query: 257 PVYQRVYPQFHPLLGLQYYTCGKLEWFLGDTENAIKSMTEAVEILRITHGTNSPFMKEL 315
V + + +G + + ++ + A+ ++ +A ++L + +G + ++EL
Sbjct: 700 RVVEVRHGPASVEMGHELFKLAQIFFNGFAIPEALNTIQKAEKVLLVHYGRGNEEIQEL 758
>gi|427782607|gb|JAA56755.1| Putative set and mynd domain protein [Rhipicephalus pulchellus]
Length = 423
Score = 75.9 bits (185), Expect = 2e-11, Method: Compositional matrix adjust.
Identities = 39/105 (37%), Positives = 58/105 (55%), Gaps = 6/105 (5%)
Query: 26 PEISINEIAENFSKLACNAHTICNSELRPLGTGLYPVISIINHSCLPNAVLVFEGRLAVV 85
PE+ + E + K+ N++ ICN E +GTGLY SI++HSC PNA V+EG +
Sbjct: 151 PEVGL----EIYGKMIINSYCICNDEHTAIGTGLYMGPSILDHSCSPNAHAVYEGHKLHL 206
Query: 86 RAVQHV--PKGAEVLISYIETAGSTMTRQKALKEQYLFTCTCPRC 128
RA + + + +SYI+ RQ+ L+ QY F C C +C
Sbjct: 207 RAAEDINCSNFDGIRVSYIDVMAPKKVRQEELRSQYYFECNCMKC 251
>gi|74181871|dbj|BAE32637.1| unnamed protein product [Mus musculus]
gi|122936404|gb|AAI30221.1| Smyd4 protein [Mus musculus]
Length = 666
Score = 75.9 bits (185), Expect = 2e-11, Method: Compositional matrix adjust.
Identities = 38/88 (43%), Positives = 49/88 (55%), Gaps = 2/88 (2%)
Query: 43 NAHTICNSELRPLGTGLYPVISIINHSCLPNAVLVFEGRLAVVRAVQHVPKGAEVLISY- 101
N I NS L TG++PV+S++NHSC PN + F G +A VRA Q + KG E+L Y
Sbjct: 378 NESIITNSRQIRLATGIFPVVSLLNHSCRPNTSVSFTGTVATVRAAQRIAKGQEILHCYG 437
Query: 102 -IETAGSTMTRQKALKEQYLFTCTCPRC 128
E+ RQ+ L QY F C C C
Sbjct: 438 PHESRMGVAERQQRLSSQYFFDCRCGAC 465
>gi|148680854|gb|EDL12801.1| SET and MYND domain containing 4, isoform CRA_a [Mus musculus]
Length = 743
Score = 75.9 bits (185), Expect = 2e-11, Method: Compositional matrix adjust.
Identities = 38/88 (43%), Positives = 49/88 (55%), Gaps = 2/88 (2%)
Query: 43 NAHTICNSELRPLGTGLYPVISIINHSCLPNAVLVFEGRLAVVRAVQHVPKGAEVLISY- 101
N I NS L TG++PV+S++NHSC PN + F G +A VRA Q + KG E+L Y
Sbjct: 455 NESIITNSRQIRLATGIFPVVSLLNHSCRPNTSVSFTGTVATVRAAQRIAKGQEILHCYG 514
Query: 102 -IETAGSTMTRQKALKEQYLFTCTCPRC 128
E+ RQ+ L QY F C C C
Sbjct: 515 PHESRMGVAERQQRLSSQYFFDCRCGAC 542
>gi|26326379|dbj|BAC26933.1| unnamed protein product [Mus musculus]
Length = 666
Score = 75.9 bits (185), Expect = 2e-11, Method: Compositional matrix adjust.
Identities = 38/88 (43%), Positives = 49/88 (55%), Gaps = 2/88 (2%)
Query: 43 NAHTICNSELRPLGTGLYPVISIINHSCLPNAVLVFEGRLAVVRAVQHVPKGAEVLISY- 101
N I NS L TG++PV+S++NHSC PN + F G +A VRA Q + KG E+L Y
Sbjct: 378 NESIITNSRQIRLATGIFPVVSLLNHSCRPNTSVSFTGTVATVRAAQRIAKGQEILHCYG 437
Query: 102 -IETAGSTMTRQKALKEQYLFTCTCPRC 128
E+ RQ+ L QY F C C C
Sbjct: 438 PHESRMGVAERQQRLSSQYFFDCRCGAC 465
>gi|63146229|gb|AAH95952.1| Smyd4 protein [Mus musculus]
Length = 799
Score = 75.9 bits (185), Expect = 3e-11, Method: Compositional matrix adjust.
Identities = 37/88 (42%), Positives = 49/88 (55%), Gaps = 2/88 (2%)
Query: 43 NAHTICNSELRPLGTGLYPVISIINHSCLPNAVLVFEGRLAVVRAVQHVPKGAEVLISY- 101
N I NS L TG++PV+S++NHSC PN + F G +A +RA Q + KG E+L Y
Sbjct: 511 NESIITNSRQIRLATGVFPVVSLLNHSCRPNTSVSFTGTVATIRAAQRIAKGQEILHCYG 570
Query: 102 -IETAGSTMTRQKALKEQYLFTCTCPRC 128
E+ RQ+ L QY F C C C
Sbjct: 571 PHESRMGVAERQQRLSSQYFFDCRCGAC 598
>gi|307172338|gb|EFN63826.1| Protein msta, isoform B [Camponotus floridanus]
Length = 510
Score = 75.9 bits (185), Expect = 3e-11, Method: Compositional matrix adjust.
Identities = 75/314 (23%), Positives = 131/314 (41%), Gaps = 37/314 (11%)
Query: 17 NLVNLILQWPEISINEIAENFSKLACNAHTICNSELRPL----GTGLYPVISIINHSCLP 72
N+V+ + + + ++ +E F + C I E+R + GLYP ++++NHSC+
Sbjct: 170 NIVDYLRK--RLKLDRFSEEFIQTICGILDINTFEVRTMKGFCARGLYPTVAMMNHSCVS 227
Query: 73 N---AVLVFEGRLAVVRAVQHVPKGAEVLISYIETAGSTMTRQKALKEQYLFTCTCPRCI 129
N ++ + R+ + R VP G E+ SY + TM R++ L E F C CPRC
Sbjct: 228 NTSHSISPIDYRIRL-RTTLKVPAGGELYGSYTHSLLPTMLRREHLLEGKYFACACPRC- 285
Query: 130 KLGQFDDIQESAILEGYRCKDDGCSGFLLRDSDDKGFT--CQQCGLVRSKEEIKKIASEV 187
D + + +C L DS D T C C S + ++KI +
Sbjct: 286 ----SDPTELGTHMSSLKCNKCDNGVILSLDSLDSTSTWNCTHCDFSTSGQAVRKIFKII 341
Query: 188 NILSKKTLALTSCGNHQEVVSTYKMIEKLQKKLYHPFSVNLMQTREKLIKILMELEDW-- 245
A++ ++ +I+K + L HP L R L ++ ++++
Sbjct: 342 QAEVDTVEAISGADGADAILERENVIKKYRSVL-HPHHAFLTILRHSLTQMYGRVDEYLL 400
Query: 246 --------KEALAYCQLTIPVYQRVYPQFHPLLGLQYYT-------CGKLEWFLGDTENA 290
+ + C+L + V V P + + G+ Y K +W G + A
Sbjct: 401 DDLPDVVLEHKVDMCRLLLQVLDIVEPGYSRIRGMTLYELHAPLLFLAKGQWNAGVIDEA 460
Query: 291 -IKS-MTEAVEILR 302
+KS M EA IL+
Sbjct: 461 GLKSKMIEAATILK 474
>gi|194890613|ref|XP_001977353.1| GG18992 [Drosophila erecta]
gi|190649002|gb|EDV46280.1| GG18992 [Drosophila erecta]
Length = 503
Score = 75.9 bits (185), Expect = 3e-11, Method: Compositional matrix adjust.
Identities = 45/183 (24%), Positives = 83/183 (45%), Gaps = 27/183 (14%)
Query: 58 GLYPVISIINHSCLPNAVLVFEGRL--AVVRAVQHVPKGAEVLISYIETAGSTMTRQKAL 115
LYP+ ++NH C+PNA FE + +VRA +P+G EV +Y + + R L
Sbjct: 208 ALYPLFGVVNHDCIPNAYYTFEDKTNNMIVRAAVDIPEGFEVTTTYTKLFTGNIARHLFL 267
Query: 116 KEQYLFTCTCPRCIKLGQFDDIQESAILEGYRCKDDGCSGFLLRDSDD---KGFTCQQC- 171
K + FTC C RC D ++ A + G C+D C+G ++ + + C +C
Sbjct: 268 KMKKSFTCKCSRC-----SDPTEKGAFISGLYCRDTNCTGLVVPEISGLPHPNWNCLECK 322
Query: 172 ----------------GLVRSKEEIKKIASEVNILSKKTLALTSCGNHQEVVSTYKMIEK 215
G + +K + + V L++K+ + N+ + + ++++
Sbjct: 323 QKSTHAQMMKSQDFASGAINAKANSNSLRTLVQYLNEKSDSFIPSSNYVVIDAKMSVLQR 382
Query: 216 LQK 218
LQ+
Sbjct: 383 LQQ 385
>gi|197102342|ref|NP_001127594.1| SET and MYND domain-containing protein 4 [Pongo abelii]
gi|75054780|sp|Q5R5X9.1|SMYD4_PONAB RecName: Full=SET and MYND domain-containing protein 4
gi|55732267|emb|CAH92837.1| hypothetical protein [Pongo abelii]
Length = 804
Score = 75.5 bits (184), Expect = 3e-11, Method: Compositional matrix adjust.
Identities = 77/315 (24%), Positives = 130/315 (41%), Gaps = 67/315 (21%)
Query: 26 PEISINEIA--ENFSKLACNA-------HT------ICNSELRPLGTGLYPVISIINHSC 70
P+++I +A + +L CNA HT + +S L TG++PV+S++NHSC
Sbjct: 483 PDVTIWGVAMLRHMLQLQCNAQAMTTIQHTGSKGSIVTDSRQVRLATGIFPVVSLLNHSC 542
Query: 71 LPNAVLVFEGRLAVVRAVQHVPKGAEVLISY--IETAGSTMTRQKALKEQYLFTCTCPRC 128
PN + F +A ++A Q + KG E+L Y ++ RQ+ L+ QY F C CP C
Sbjct: 543 SPNTSMSFISTVATIQASQRIRKGQEILHCYGPHKSRMGVAERQQELRSQYFFDCACPAC 602
Query: 129 IKLGQFDDIQESAILEGYRCKDDGCSGFLLRDSDDKGFTCQQCGLVRSKEEIKKIASEVN 188
E +R + + + F C CG +++ S
Sbjct: 603 QT-------------EAHR---------MAAEPRWEAFCCNSCGAPMQGDDVLHCGSR-- 638
Query: 189 ILSKKTLALTSCGNHQEVVSTYKMIEKLQKKLYHPFSVNLMQTREKLIKILM--ELEDWK 246
SC + VS ++ +LQ +L Q K+L ELE
Sbjct: 639 ----------SCA--ESAVSRDHLVSRLQ---------DLQQQVGVAQKLLRDGELERAV 677
Query: 247 EALAYCQLTIPVYQRVYPQFHPLLGLQYYTCGKLEWFLGDTENAIKSMTEAVEILRITHG 306
+ L CQ + ++ + H L+G + LGD + A + ++ ++ + HG
Sbjct: 678 QQLLGCQRDAESF--LWAE-HALVGEIADGLARACAALGDWQKAATHLQRSLRVVEVRHG 734
Query: 307 TNSPFMKELILKLEE 321
+S M + KL +
Sbjct: 735 PSSVEMGHELFKLAQ 749
>gi|242798932|ref|XP_002483270.1| SET and MYND domain protein, putative [Talaromyces stipitatus ATCC
10500]
gi|218716615|gb|EED16036.1| SET and MYND domain protein, putative [Talaromyces stipitatus ATCC
10500]
Length = 547
Score = 75.5 bits (184), Expect = 3e-11, Method: Compositional matrix adjust.
Identities = 37/116 (31%), Positives = 62/116 (53%), Gaps = 7/116 (6%)
Query: 14 QIANLVNLILQWPEISINEIAENFSKLACNAHTICNSELRPLGTGLYPVISIINHSCLPN 73
+ +NL +L +W +A F +L N+ + ++ R G +P + NHSC PN
Sbjct: 197 EYSNLTDLDNKW-------VASLFGRLNANSFCLTSAFGRRRGVYFHPGPARFNHSCDPN 249
Query: 74 AVLVFEGRLAVVRAVQHVPKGAEVLISYIETAGSTMTRQKALKEQYLFTCTCPRCI 129
A F +RA++ + K ++ ISY++T S TR+ L+E+Y F C CP+C+
Sbjct: 250 ASYSFAKGKCYIRAIRPIAKDEQIFISYVDTTYSVGTRRHELQERYRFECQCPKCL 305
>gi|406868025|gb|EKD21062.1| SET and MYND domain-containing protein 2 [Marssonina brunnea f. sp.
'multigermtubi' MB_m1]
Length = 570
Score = 75.5 bits (184), Expect = 3e-11, Method: Compositional matrix adjust.
Identities = 35/91 (38%), Positives = 51/91 (56%)
Query: 40 LACNAHTICNSELRPLGTGLYPVISIINHSCLPNAVLVFEGRLAVVRAVQHVPKGAEVLI 99
L+ NA I + P G P +++ NHSC PNA++V R +RA++ + K ++ I
Sbjct: 194 LSTNAFRITLPDNTPFGMCFSPTLALANHSCKPNAIIVSNSRSISLRALRPIKKNEQIFI 253
Query: 100 SYIETAGSTMTRQKALKEQYLFTCTCPRCIK 130
SYI+ +RQ LKE+Y FTC C C K
Sbjct: 254 SYIDPTEDLPSRQSKLKERYFFTCKCDSCEK 284
>gi|167535778|ref|XP_001749562.1| hypothetical protein [Monosiga brevicollis MX1]
gi|163771954|gb|EDQ85613.1| predicted protein [Monosiga brevicollis MX1]
Length = 338
Score = 75.5 bits (184), Expect = 3e-11, Method: Compositional matrix adjust.
Identities = 35/94 (37%), Positives = 57/94 (60%), Gaps = 1/94 (1%)
Query: 36 NFSKLACNAHTICNS-ELRPLGTGLYPVISIINHSCLPNAVLVFEGRLAVVRAVQHVPKG 94
+F+KL N C++ ++ G +Y +S NHSC PNA +V+ G AV+R+++ +P+G
Sbjct: 109 SFNKLMPNTFGWCDTFDMSSFGAVVYAELSRANHSCQPNAAVVYNGAAAVLRSMRDIPEG 168
Query: 95 AEVLISYIETAGSTMTRQKALKEQYLFTCTCPRC 128
EV ISY++ + R++ L + Y F C C RC
Sbjct: 169 EEVCISYVDPTLARDVRRRELVQSYGFACDCARC 202
>gi|290975196|ref|XP_002670329.1| predicted protein [Naegleria gruberi]
gi|284083887|gb|EFC37585.1| predicted protein [Naegleria gruberi]
Length = 626
Score = 75.5 bits (184), Expect = 3e-11, Method: Compositional matrix adjust.
Identities = 37/120 (30%), Positives = 64/120 (53%), Gaps = 16/120 (13%)
Query: 50 SELRPLGTGLYPVISIINHSCLPNAVL----------VFEGRLAVVRAVQHVPKGAEVLI 99
++ R +GTGL+P++S+ +HSC PN + G + +V+ V+ V KG E+ I
Sbjct: 287 ADFRDIGTGLFPMVSMFDHSCSPNCSFQTFDDMKSRNSYSGNVILVQTVKKVKKGEELCI 346
Query: 100 SYIETAGSTMTRQKALKEQYLFTCTCPRCIKLGQFDDIQESAILEGYRCKDDGCS-GFLL 158
SYI+ T R++ L F C C RC+ + +E+ ++ Y C + C+ G+L+
Sbjct: 347 SYIDIMNPTCIRRRELWYSKYFVCRCSRCMS-----ETEENRMVRAYSCGESECTEGYLV 401
>gi|403413006|emb|CCL99706.1| predicted protein [Fibroporia radiculosa]
Length = 800
Score = 75.5 bits (184), Expect = 3e-11, Method: Compositional matrix adjust.
Identities = 48/127 (37%), Positives = 66/127 (51%), Gaps = 14/127 (11%)
Query: 35 ENFSKLACNAHTICNSELRPLGTGLYPVIS-IINHSCLPNAVL-----VFEGRLAVVRAV 88
E +++ N + I +S L G G+YP+ S + NHSC+PNAV FE V A+
Sbjct: 531 EAYTRFG-NNNFIIHSHLNTFGHGIYPLASRLFNHSCVPNAVAKYIISPFESVQMEVVAL 589
Query: 89 QHVPKGAEVLISYIETAGSTMTRQKALKEQYLFTCTCPRCIKLGQFDDI-------QESA 141
+ + +G E+ I Y++ A TRQ+AL+ Y FTCTCP C G D E A
Sbjct: 590 RDIAEGEEITIPYLDPALPLHTRQEALRLNYGFTCTCPLCSWAGHLDPAPPPARGSAELA 649
Query: 142 ILEGYRC 148
ILE C
Sbjct: 650 ILESQLC 656
>gi|195481611|ref|XP_002101711.1| GE15465 [Drosophila yakuba]
gi|194189235|gb|EDX02819.1| GE15465 [Drosophila yakuba]
Length = 497
Score = 75.5 bits (184), Expect = 3e-11, Method: Compositional matrix adjust.
Identities = 45/183 (24%), Positives = 83/183 (45%), Gaps = 27/183 (14%)
Query: 58 GLYPVISIINHSCLPNAVLVFEGRL--AVVRAVQHVPKGAEVLISYIETAGSTMTRQKAL 115
LYP+ ++NH C+PNA FE + +VRA +P+G EV +Y + + R L
Sbjct: 208 ALYPLFGVVNHDCIPNAYYTFEEKTNNMIVRAAVDIPEGFEVTTTYTKLFTGNIARHLFL 267
Query: 116 KEQYLFTCTCPRCIKLGQFDDIQESAILEGYRCKDDGCSGFLLRDSDD---KGFTCQQC- 171
K + FTC C RC D ++ A + G C+D C+G ++ + + C +C
Sbjct: 268 KMKKSFTCKCSRCS-----DPTEKGAFISGLYCRDTNCTGLVVPEISGLPHPNWNCLECK 322
Query: 172 ----------------GLVRSKEEIKKIASEVNILSKKTLALTSCGNHQEVVSTYKMIEK 215
G + +K + + V L++K+ + N+ + + ++++
Sbjct: 323 QKSTHAQMMKSQDFASGAINAKANSNSLRTLVQYLNEKSDSFIPSSNYVVIDAKMSVLQR 382
Query: 216 LQK 218
LQ+
Sbjct: 383 LQQ 385
>gi|353241217|emb|CCA73046.1| hypothetical protein PIIN_07001 [Piriformospora indica DSM 11827]
Length = 565
Score = 75.5 bits (184), Expect = 4e-11, Method: Compositional matrix adjust.
Identities = 53/153 (34%), Positives = 80/153 (52%), Gaps = 20/153 (13%)
Query: 29 SINEIAENFSKLACNAHTICNSELRPLGTGLYPVISIINHSCLPNAVLVFEGR------- 81
S ++ + SK N+ T+ ++ L +G P+ ++INHSC PNAV+VF
Sbjct: 215 SGKDLVDLMSKFTTNSFTLTSTFLNAIGVATAPIPALINHSCQPNAVVVFPATRKGAPPT 274
Query: 82 LAVVRAVQHVPKGAEVLISYIETAGSTMTRQKALKEQYLFTCTCPRCIKLGQFDDIQESA 141
L V+ A+Q + +G EVL +Y++ RQK+LKE Y F C+C C KL D++E
Sbjct: 275 LDVI-AIQPIRRGEEVLAAYVDITLPREIRQKSLKETYAFECSCTLC-KLPGTLDLRECV 332
Query: 142 ILEGYRCKDDGCSGFLLRDSDDKGFT--CQQCG 172
+C + GFL S KG + C +CG
Sbjct: 333 FCP--KCNE----GFL---SSHKGQSAPCSKCG 356
>gi|295665212|ref|XP_002793157.1| conserved hypothetical protein [Paracoccidioides sp. 'lutzii' Pb01]
gi|226278071|gb|EEH33637.1| conserved hypothetical protein [Paracoccidioides sp. 'lutzii' Pb01]
Length = 508
Score = 75.5 bits (184), Expect = 4e-11, Method: Compositional matrix adjust.
Identities = 36/94 (38%), Positives = 54/94 (57%), Gaps = 1/94 (1%)
Query: 40 LACNAHTICNSELRPLGTGLYPVISIINHSCLPNAVLVFEGRLAVVRAVQHVPKGAEVLI 99
L NA T+ N +G + P S NHSC PNA + F+G + ++A+Q + ++ I
Sbjct: 187 LETNAFTLTNRYFDRIGLCMLPFASYANHSCEPNAYIGFDGPVIYLKALQDIALDEQIFI 246
Query: 100 SYIETAGSTMTRQKALKEQYLFTCTCPRCIKLGQ 133
SYI+ RQ LK++Y FTC CP+C++ GQ
Sbjct: 247 SYIDNTEPWEKRQSELKKRYFFTCKCPKCVQ-GQ 279
>gi|428180198|gb|EKX49066.1| hypothetical protein GUITHDRAFT_136248 [Guillardia theta CCMP2712]
Length = 719
Score = 75.5 bits (184), Expect = 4e-11, Method: Compositional matrix adjust.
Identities = 39/112 (34%), Positives = 57/112 (50%), Gaps = 15/112 (13%)
Query: 32 EIAENFSKLACNAHTICNSELRPLGTGLYPVISIINHSCLPNAVLVFEGRLAVVRAVQHV 91
EI +L TIC+ E RP+GTGLYP+ S+ NH C PN ++ F G +RA + +
Sbjct: 230 EILMTICRLYNYGFTICDEEERPIGTGLYPLASLCNHDCAPNCLVTFCGPSLQLRATEEI 289
Query: 92 PKG---------------AEVLISYIETAGSTMTRQKALKEQYLFTCTCPRC 128
G E+ ISY+++A R+K L ++YL C+C C
Sbjct: 290 EMGQVQMARVDWRLRSGLQEITISYVDSAEPHWLRRKTLHKRYLINCSCSLC 341
Score = 47.8 bits (112), Expect = 0.008, Method: Compositional matrix adjust.
Identities = 36/138 (26%), Positives = 66/138 (47%), Gaps = 13/138 (9%)
Query: 199 SCGNHQEVVSTYKMIEKLQKKLYHPFSVNLMQTREKLIKILMELEDWKEALAYCQLTIPV 258
S E++ + + + + + H +++L++ RE + +E + WK AL+ +
Sbjct: 572 SADRANELLQMFILAVQKAESILHDNNLSLLRLREAMQSQAVEAKSWKFALSTSYKVLRR 631
Query: 259 YQRVYPQFHPLLGLQYYTCGKLEWFL------------GDT-ENAIKSMTEAVEILRITH 305
++YP+ P +GLQ GKL FL GD + A+K + EA IL TH
Sbjct: 632 LYKIYPRCSPYVGLQEAFLGKLCMFLSGEPDKTLCHGFGDPLQEAVKHLREAKLILSGTH 691
Query: 306 GTNSPFMKELILKLEEAQ 323
G+ ++++ L+EA
Sbjct: 692 GSEHVLVQDINRLLQEAH 709
>gi|351696197|gb|EHA99115.1| SET and MYND domain-containing protein 1 [Heterocephalus glaber]
Length = 483
Score = 75.5 bits (184), Expect = 4e-11, Method: Compositional matrix adjust.
Identities = 40/131 (30%), Positives = 69/131 (52%), Gaps = 19/131 (14%)
Query: 19 VNLILQ-WP----EISINEIAENFSKLACNAHTICNSE-LRPLGTGLYPVISIINHSCLP 72
V++ LQ WP + S+ I+ F + CN T+ + L+ +G G++P + ++NH C P
Sbjct: 146 VDMFLQYWPPQSQQFSMQYISHIFGVINCNGFTLSDQRGLQAVGVGIFPNLGLVNHDCWP 205
Query: 73 NAVLVFE-GRLAVVRAVQH------------VPKGAEVLISYIETAGSTMTRQKALKEQY 119
N ++F G V+++ H + +G E+ +SYI+ + R++ LK+QY
Sbjct: 206 NCTVIFNNGNHEAVKSMFHTQMRIELRALGKISEGEELTVSYIDFLNVSEERKRQLKKQY 265
Query: 120 LFTCTCPRCIK 130
F CTC C K
Sbjct: 266 YFDCTCEHCQK 276
>gi|321476199|gb|EFX87160.1| hypothetical protein DAPPUDRAFT_312722 [Daphnia pulex]
Length = 732
Score = 75.1 bits (183), Expect = 4e-11, Method: Compositional matrix adjust.
Identities = 74/307 (24%), Positives = 137/307 (44%), Gaps = 43/307 (14%)
Query: 39 KLACNAHTIC--------NSELRPLGTGLYPVISIINHSCLPNAVLVFEGRLAVVRAVQH 90
++ NAH I N + T +YP S++NH+C P + F+G +VRA+++
Sbjct: 426 QMVSNAHAITELCLIDENNERQERIATAIYPSASLMNHNCDPTVINSFQGNTLIVRAIRN 485
Query: 91 VPKGAEVLISYIETAGSTMTRQK------ALKEQYLFTCTCPRCIKLGQFDDIQESAILE 144
V +G EV Y G R + AL+ QY FTCTC C+ +D Q+ ++
Sbjct: 486 VRQGDEVFNCY----GPHYRRMRRSERVEALEAQYSFTCTCDSCLD-KNTEDFQD--VIY 538
Query: 145 GYRCKDDGCSGFLLRDSDDKGFTCQQCGLVRSKE-------EIKKIASEVNILSKKTLAL 197
+ C C G L+ + + Q L RS + ++K V++ ++ T A+
Sbjct: 539 SFSCP--SCQGSLINPTGNNSSAQNQMALCRSCKTPQSYFTQLKADLEAVSLDARGTEAM 596
Query: 198 TSCGNHQEVVSTYKMIEK---LQKKLYHPFSVNLMQTREKLIKILMELEDWKEALAYCQL 254
+ +++ K++ K L+ K +L ++ +KL + + ++E ++
Sbjct: 597 ----DQGDIMGAIKLLTKCVQLRSKALFKGHPDLGKSADKLAQCYAFIGKYEECEKMLRI 652
Query: 255 TIPVYQRVYPQFHPLLG--LQYYTCGKLEWFLGDTENA----IKSMTEAVEILRITHGTN 308
++ + Y ++ + LQ +T +E G + A + + EA+ I RI +G
Sbjct: 653 SLTAVEHRYGRYSIEMANELQKFTDVLMELASGRNQRARDELVNYLEEAMLIYRIHYGPW 712
Query: 309 SPFMKEL 315
S KEL
Sbjct: 713 STSYKEL 719
>gi|400601369|gb|EJP69012.1| MYND finger protein [Beauveria bassiana ARSEF 2860]
Length = 542
Score = 75.1 bits (183), Expect = 4e-11, Method: Compositional matrix adjust.
Identities = 41/104 (39%), Positives = 60/104 (57%), Gaps = 1/104 (0%)
Query: 27 EISINEIAENFSKLACNAHTICNSELRPLGTGLYPVISIINHSCLPNAVLVFEGRLAVVR 86
E S+ + E KL NA +++L +G L P +++ NHSCLPNA + F GR AV+R
Sbjct: 166 EESVRQAVEILCKLQTNAFHRFDADLGHVGIFLEPTLAMANHSCLPNAFVQFVGRAAVLR 225
Query: 87 AVQHVPKGAEVLISYIETAGSTMTRQKALKEQYLFTCTCPRCIK 130
A + KG E+ ISY + S++ ++KA Y F C C RC +
Sbjct: 226 AESRIQKGDEIEISYTDYT-SSLGKRKAALAPYNFECRCRRCTQ 268
>gi|194769376|ref|XP_001966780.1| GF19100 [Drosophila ananassae]
gi|190618301|gb|EDV33825.1| GF19100 [Drosophila ananassae]
Length = 499
Score = 75.1 bits (183), Expect = 4e-11, Method: Compositional matrix adjust.
Identities = 34/103 (33%), Positives = 55/103 (53%), Gaps = 7/103 (6%)
Query: 58 GLYPVISIINHSCLPNAVLVFEGRL--AVVRAVQHVPKGAEVLISYIETAGSTMTRQKAL 115
LYP+ ++NH C+PN+ F+ + VVRA +P+G E+ +Y + + R L
Sbjct: 208 ALYPLFGVMNHDCIPNSYYTFDEKTNHMVVRAAVDIPEGFEITTTYTKLFTGNIARHLFL 267
Query: 116 KEQYLFTCTCPRCIKLGQFDDIQESAILEGYRCKDDGCSGFLL 158
K + FTC CPRC D ++ A + G C+D C+G ++
Sbjct: 268 KMKKGFTCKCPRC-----SDPTEKGAFISGLYCRDTNCTGLVV 305
>gi|392562474|gb|EIW55654.1| SET domain-containing protein [Trametes versicolor FP-101664 SS1]
Length = 499
Score = 75.1 bits (183), Expect = 4e-11, Method: Compositional matrix adjust.
Identities = 49/154 (31%), Positives = 70/154 (45%), Gaps = 17/154 (11%)
Query: 29 SINEIAENFSKLACNAHTICNSELRPLGTGLYPVISIINHSCLPNAVLVF---------- 78
S ++ + S+ N T+ ++ L P+G + P ++ NHSC PNAV+VF
Sbjct: 148 SAGDLVDLISRFTTNTFTLTSASLSPIGICVAPTVAFANHSCSPNAVIVFPRAQGTPASK 207
Query: 79 EGRLAVVRAVQHVPKGAEVLISYIETAGSTMTRQKALKEQYLFTCTCPRCIKLGQFDDIQ 138
E L +V A++ + G E+ ISY++T RQK L E Y FTC C C K D
Sbjct: 208 EPSLNLV-ALRDIAPGKEIRISYVDTTLPKDLRQKELTEVYNFTCQCKLCTKPPAVDP-- 264
Query: 139 ESAILEGYRCKDDGCSGFLLRDSDDKGFTCQQCG 172
E C L +D F C +CG
Sbjct: 265 ----RESLWCPKKCGGTCPLPTEEDNLFRCAKCG 294
>gi|308499663|ref|XP_003112017.1| CRE-SET-18 protein [Caenorhabditis remanei]
gi|308268498|gb|EFP12451.1| CRE-SET-18 protein [Caenorhabditis remanei]
Length = 507
Score = 75.1 bits (183), Expect = 4e-11, Method: Compositional matrix adjust.
Identities = 68/314 (21%), Positives = 132/314 (42%), Gaps = 24/314 (7%)
Query: 5 DEKQLLLYAQIANLVNLILQWPEISINEIAENFSKLACNAHTICNSELRPLGTGLYPVIS 64
DE+++ + ++ + L I + + F K + N+H+I + +G L +S
Sbjct: 150 DEEKMKHFEKVFTKMGLFRGDEMIEKGQFFDVFCKASINSHSIHTNAGNEVGMALDLGVS 209
Query: 65 IINHSCLPNAVLVFEGRLAVVRAVQHVP-----KGAEVLISYIETAGSTMTRQKALKEQY 119
+ NHSC P +VF+G +R + VP + ISYI+ S R+K LK ++
Sbjct: 210 MYNHSCRPTCSMVFDGYRVCLRPL--VPGVDAESTEQAFISYIDVGRSKYVRRKDLKSRW 267
Query: 120 LFTCTCPRCIKLGQFDDIQESAILEGYRCKDDGCSGFLLRDSDDK--GFTCQQCGLVRSK 177
F C C RC+ D ++ A L RC + C +L ++ C +C + +
Sbjct: 268 YFDCECTRCM------DPEDDA-LTAIRCANPACDAPVLTSETEEPMNIACDKCKTIIEE 320
Query: 178 EEIKKIASEVNILSKKTLALTSCGNHQEVVSTYKMIEKLQKKLYHPFSVNLMQTREKLIK 237
++KK + L Q+++S +++ HP +V + + R L
Sbjct: 321 NDVKKAQEFMKSLPASFDPQCPADILQDLLSKAEVV-------LHPSNVYVARLRTALFH 373
Query: 238 ILMELEDWKEALAYCQLTIPVYQRVYPQFHPLLGLQYYTCGKLEWFLGDTENAIKSMTEA 297
+ +L + + Q+ Y+ +P+ +G Q K + + A+ +A
Sbjct: 374 VTGQLTMDNLSTMHTQI-YNNYKLCFPKADRHVGFQLLHIVKALIEKDERDEAMPYAFDA 432
Query: 298 VEILRITHGTNSPF 311
+ I + G + P+
Sbjct: 433 MNIFEVCFGLDHPY 446
>gi|195565893|ref|XP_002106530.1| GD16076 [Drosophila simulans]
gi|194203908|gb|EDX17484.1| GD16076 [Drosophila simulans]
Length = 497
Score = 75.1 bits (183), Expect = 4e-11, Method: Compositional matrix adjust.
Identities = 35/103 (33%), Positives = 54/103 (52%), Gaps = 7/103 (6%)
Query: 58 GLYPVISIINHSCLPNAVLVFEGRL--AVVRAVQHVPKGAEVLISYIETAGSTMTRQKAL 115
LYP+ ++NH C+PNA FE + +VRA +P+G EV +Y + + R L
Sbjct: 208 ALYPLFGVVNHDCIPNAYYTFEEKTNNMIVRAAVDIPEGFEVTTTYTKLFTGNIARHLFL 267
Query: 116 KEQYLFTCTCPRCIKLGQFDDIQESAILEGYRCKDDGCSGFLL 158
K + FTC C RC D ++ A + G C+D C+G ++
Sbjct: 268 KMKKSFTCKCSRCS-----DPTEKGAFISGLYCRDTNCTGLVV 305
>gi|12322390|gb|AAG51222.1|AC051630_19 unknown protein; 33246-28649 [Arabidopsis thaliana]
Length = 781
Score = 75.1 bits (183), Expect = 5e-11, Method: Compositional matrix adjust.
Identities = 68/326 (20%), Positives = 135/326 (41%), Gaps = 62/326 (19%)
Query: 55 LGTGLYPVISIINHSCLPNAVLVFEGRLAVVRAVQHVPKGAEVLISYIETAG--STMTRQ 112
+G LY S+ NHSC PN L F R +++ + VP G + +SY G R
Sbjct: 458 VGQALYKTGSLFNHSCKPNIHLYFLSRGLIMQTTEFVPTGCPLELSYGPEVGKWDCKNRI 517
Query: 113 KALKEQYLFTCTCPRCIKLGQFDDIQESAILEGYRCKDDGCSGFLLRDSDDKGFTCQQCG 172
+ L+E+Y F C C C ++ D ++ GY C + C+G +L D TC+
Sbjct: 518 RFLEEEYFFHCRCRGCAQINISD-----LVINGYGCVNTNCTGVVL---DSNVATCESEK 569
Query: 173 L-------------VRSKEEIKKIASEV--NILSKKTLALT-------SCG-------NH 203
L V+ +E++ EV ++LSK + +L CG +H
Sbjct: 570 LNHFFTAPRNVDQQVQMREKVYADVGEVASSLLSKPSGSLHIEPEICLKCGSRCDIENSH 629
Query: 204 QEVVSTYKMIEKLQ-----------------------KKLYHPFSVNLMQTREKLIKILM 240
EV + + +++ + H ++ ++ +K+ +
Sbjct: 630 AEVNKAWNHMRRVEELMNSGRANYSVLSDCSRSIAVLRTFLHMYNKDIADAEDKVAQACY 689
Query: 241 ELEDWKEALAYCQLTIPVYQRVYPQFHPLLGLQYYTCGKLEWFLGDTENAIKSMTEAVEI 300
+ +A +C+ +I + +R+Y H ++G + ++ GD+ A + + +I
Sbjct: 690 LAGELVDARKHCEASIKILKRLYEDEHVVIGNEMVKLASIQLASGDSSGAWDTTKRSSQI 749
Query: 301 LRITHGTNSPFMKELILKLEEAQAEA 326
+G+++ + + L++ A+A
Sbjct: 750 FSKYYGSHAETLFSYLPCLKQETAKA 775
>gi|390353457|ref|XP_003728116.1| PREDICTED: SET and MYND domain-containing protein 3-like
[Strongylocentrotus purpuratus]
Length = 319
Score = 75.1 bits (183), Expect = 5e-11, Method: Compositional matrix adjust.
Identities = 40/123 (32%), Positives = 64/123 (52%), Gaps = 7/123 (5%)
Query: 32 EIAENFSKLACNAHTICNSELRPL-GTGLYPVISIINHSCLPNAVLVFEGRLAVVRAVQH 90
E+ + + KL N+ ++ + R + G LY S+ NHSC PN VFEG VRA++
Sbjct: 178 EVLDTYQKLTINSFSMYDEMTRTIVGEALYIRASMFNHSCEPNCTFVFEGSRLSVRAIKR 237
Query: 91 VPKGAEVLISYIETAGSTMTRQKALKEQYLFTCTCPRCIKLGQFDDIQESAILEGYRCKD 150
+ G E ISY+ + + R++ L+ Y FTC CPRC+ D ++ +C +
Sbjct: 238 IEIGEECCISYMSSLLPSPLRKEKLRSIYGFTCQCPRCL------DSARDNLMLCVKCPN 291
Query: 151 DGC 153
+ C
Sbjct: 292 ESC 294
>gi|327283812|ref|XP_003226634.1| PREDICTED: SET and MYND domain-containing protein 4-like [Anolis
carolinensis]
Length = 683
Score = 75.1 bits (183), Expect = 5e-11, Method: Compositional matrix adjust.
Identities = 62/265 (23%), Positives = 111/265 (41%), Gaps = 11/265 (4%)
Query: 55 LGTGLYPVISIINHSCLPNAVLVFEGRLAVVRAVQHVPKGAEVLISY--IETAGSTMTRQ 112
L T L+PV+S++NHSC PN + F+GR A RA + +P+G E+L Y R+
Sbjct: 413 LATALFPVLSLLNHSCDPNTSVTFDGRTATARASRAIPRGQEILHCYGPHRCRMKPSERR 472
Query: 113 KALKEQYLFTCTCPRCIKLGQFDDIQESAILEGYRCKD--DGCSGFLLRDSDDKGFTCQQ 170
+ L QY F C C C + S + C+ G FL S
Sbjct: 473 QRLLAQYFFECRCSACTDETGPESASGSTLFCCPTCRAPMQGEGLFLCCSSAS------- 525
Query: 171 CGLVRSKEEIKKIASEVNILSKKTLALTSCGNHQEVVSTYKMIEKLQKKLYHPFSVNLMQ 230
C + KE + E+ + + L + G V + ++L P + + +
Sbjct: 526 CKAMLPKEGFQHQLRELRQHMEAAMDLLAQGKADRSVEQLRKCHLEAQQLLSPGHLVVGE 585
Query: 231 TREKLIKILMELEDWKEALAYCQLTIPVYQRVYPQFHPLLGLQYYTCGKLEWFLGDTENA 290
+ L ++ +W EA A+ + +I V + Y +G + + ++ + A
Sbjct: 586 IEDHLAQVHATQGNWPEAAAHLRRSIEVVEAHYGAGSIEVGQELFKLAQVLFNGRFVPEA 645
Query: 291 IKSMTEAVEILRITHGTNSPFMKEL 315
+++ +A +L + G P ++EL
Sbjct: 646 LQAAEKAEAVLSVQLGPRGPEVQEL 670
>gi|195359283|ref|XP_002045335.1| GM11659 [Drosophila sechellia]
gi|194129184|gb|EDW51227.1| GM11659 [Drosophila sechellia]
Length = 496
Score = 75.1 bits (183), Expect = 5e-11, Method: Compositional matrix adjust.
Identities = 49/209 (23%), Positives = 91/209 (43%), Gaps = 27/209 (12%)
Query: 58 GLYPVISIINHSCLPNAVLVFEGRL--AVVRAVQHVPKGAEVLISYIETAGSTMTRQKAL 115
LYP+ ++NH C+PNA FE + +VRA +P+G EV +Y + + R L
Sbjct: 208 ALYPLFGVVNHDCIPNAYYTFEEKTNNMIVRAAVDIPEGFEVTTTYTKLFTGNIARHLFL 267
Query: 116 KEQYLFTCTCPRCIKLGQFDDIQESAILEGYRCKDDGCSGFLLRDSDD---KGFTCQQC- 171
K + FTC C RC D ++ A + G C+D C+G ++ + + C +C
Sbjct: 268 KMKKSFTCKCSRCS-----DPTEKGAFISGLYCRDTNCTGLVVPEISGLPHPNWNCLECK 322
Query: 172 ----------------GLVRSKEEIKKIASEVNILSKKTLALTSCGNHQEVVSTYKMIEK 215
G + +K + + V L++K+ + N+ + + ++++
Sbjct: 323 QKSTHAQMMKSQDFASGAINAKVNSNSLRTLVQYLNEKSDSFIPNSNYVVIDAKMSVLQR 382
Query: 216 LQKKLYHPFSVNLMQTREKLIKILMELED 244
LQ+ TR + + + +L D
Sbjct: 383 LQQGTRGLLRGAGAHTRLRYSRDITQLMD 411
>gi|24640767|ref|NP_572539.2| CG12119 [Drosophila melanogaster]
gi|7291024|gb|AAF46462.1| CG12119 [Drosophila melanogaster]
Length = 500
Score = 75.1 bits (183), Expect = 5e-11, Method: Compositional matrix adjust.
Identities = 35/103 (33%), Positives = 54/103 (52%), Gaps = 7/103 (6%)
Query: 58 GLYPVISIINHSCLPNAVLVFEGRL--AVVRAVQHVPKGAEVLISYIETAGSTMTRQKAL 115
LYP+ ++NH C+PNA FE + +VRA +P+G EV +Y + + R L
Sbjct: 208 ALYPLFGVVNHDCIPNAYYTFEEKTNNMIVRAAVDIPEGFEVTTTYTKLFTGNIARHLFL 267
Query: 116 KEQYLFTCTCPRCIKLGQFDDIQESAILEGYRCKDDGCSGFLL 158
K + FTC C RC D ++ A + G C+D C+G ++
Sbjct: 268 KMKKSFTCKCSRCS-----DPTEKGAFISGLYCRDTNCTGLVV 305
>gi|42562487|ref|NP_174606.2| tetratricopeptide repeat-containing protein [Arabidopsis thaliana]
gi|332193468|gb|AEE31589.1| tetratricopeptide repeat-containing protein [Arabidopsis thaliana]
Length = 798
Score = 74.7 bits (182), Expect = 5e-11, Method: Compositional matrix adjust.
Identities = 68/326 (20%), Positives = 135/326 (41%), Gaps = 62/326 (19%)
Query: 55 LGTGLYPVISIINHSCLPNAVLVFEGRLAVVRAVQHVPKGAEVLISYIETAG--STMTRQ 112
+G LY S+ NHSC PN L F R +++ + VP G + +SY G R
Sbjct: 475 VGQALYKTGSLFNHSCKPNIHLYFLSRGLIMQTTEFVPTGCPLELSYGPEVGKWDCKNRI 534
Query: 113 KALKEQYLFTCTCPRCIKLGQFDDIQESAILEGYRCKDDGCSGFLLRDSDDKGFTCQQCG 172
+ L+E+Y F C C C ++ D ++ GY C + C+G +L D TC+
Sbjct: 535 RFLEEEYFFHCRCRGCAQINISD-----LVINGYGCVNTNCTGVVL---DSNVATCESEK 586
Query: 173 L-------------VRSKEEIKKIASEV--NILSKKTLALT-------SCG-------NH 203
L V+ +E++ EV ++LSK + +L CG +H
Sbjct: 587 LNHFFTAPRNVDQQVQMREKVYADVGEVASSLLSKPSGSLHIEPEICLKCGSRCDIENSH 646
Query: 204 QEVVSTYKMIEKLQ-----------------------KKLYHPFSVNLMQTREKLIKILM 240
EV + + +++ + H ++ ++ +K+ +
Sbjct: 647 AEVNKAWNHMRRVEELMNSGRANYSVLSDCSRSIAVLRTFLHMYNKDIADAEDKVAQACY 706
Query: 241 ELEDWKEALAYCQLTIPVYQRVYPQFHPLLGLQYYTCGKLEWFLGDTENAIKSMTEAVEI 300
+ +A +C+ +I + +R+Y H ++G + ++ GD+ A + + +I
Sbjct: 707 LAGELVDARKHCEASIKILKRLYEDEHVVIGNEMVKLASIQLASGDSSGAWDTTKRSSQI 766
Query: 301 LRITHGTNSPFMKELILKLEEAQAEA 326
+G+++ + + L++ A+A
Sbjct: 767 FSKYYGSHAETLFSYLPCLKQETAKA 792
>gi|17946274|gb|AAL49177.1| RE62495p [Drosophila melanogaster]
Length = 500
Score = 74.7 bits (182), Expect = 5e-11, Method: Compositional matrix adjust.
Identities = 35/103 (33%), Positives = 54/103 (52%), Gaps = 7/103 (6%)
Query: 58 GLYPVISIINHSCLPNAVLVFEGRL--AVVRAVQHVPKGAEVLISYIETAGSTMTRQKAL 115
LYP+ ++NH C+PNA FE + +VRA +P+G EV +Y + + R L
Sbjct: 208 ALYPLFGVVNHDCIPNAYYTFEEKTNNMIVRAAVDIPEGFEVTTTYTKLFTGNIARHLFL 267
Query: 116 KEQYLFTCTCPRCIKLGQFDDIQESAILEGYRCKDDGCSGFLL 158
K + FTC C RC D ++ A + G C+D C+G ++
Sbjct: 268 KMKKSFTCKCSRCS-----DPTEKGAFISGLYCRDTNCAGLVV 305
>gi|195446904|ref|XP_002070974.1| GK25543 [Drosophila willistoni]
gi|194167059|gb|EDW81960.1| GK25543 [Drosophila willistoni]
Length = 503
Score = 74.7 bits (182), Expect = 5e-11, Method: Compositional matrix adjust.
Identities = 34/103 (33%), Positives = 55/103 (53%), Gaps = 7/103 (6%)
Query: 58 GLYPVISIINHSCLPNAVLVFEGRL--AVVRAVQHVPKGAEVLISYIETAGSTMTRQKAL 115
LYP+ ++NH C+PN+ FE + +VRA +P+G E+ +Y + + R L
Sbjct: 208 ALYPLFGVMNHDCIPNSYYTFEEKTNNMIVRAAVDIPEGFEITTTYTKLFTGNIARHLFL 267
Query: 116 KEQYLFTCTCPRCIKLGQFDDIQESAILEGYRCKDDGCSGFLL 158
K + FTC CPRC D ++ A + G C+D C+G ++
Sbjct: 268 KMKKNFTCKCPRC-----SDPTEKGAYISGLYCRDTSCNGLVV 305
>gi|16553701|dbj|BAB71564.1| unnamed protein product [Homo sapiens]
Length = 313
Score = 74.7 bits (182), Expect = 5e-11, Method: Compositional matrix adjust.
Identities = 67/271 (24%), Positives = 113/271 (41%), Gaps = 52/271 (19%)
Query: 55 LGTGLYPVISIINHSCLPNAVLVFEGRLAVVRAVQHVPKGAEVLISY--IETAGSTMTRQ 112
L TG++PVIS++NHSC PN + F +A +RA Q + KG E+L Y ++ RQ
Sbjct: 36 LATGIFPVISLLNHSCSPNTSVSFISTVATIRASQRIRKGQEILHCYGPHKSRMGVAERQ 95
Query: 113 KALKEQYLFTCTCPRCIKLGQFDDIQESAILEGYRCKDDGCSGFLLRDSDDKGFTCQQCG 172
+ L+ QY F C CP C + + G R + F C CG
Sbjct: 96 QKLRSQYFFDCACPAC-------QTEAHRMAAGPRW---------------EAFCCNSCG 133
Query: 173 LVRSKEEIKKIASEVNILSKKTLALTSCGNHQEVVSTYKMIEKLQKKLYHPFSVNLMQTR 232
+++ + S SC + VS ++ +LQ +L Q
Sbjct: 134 APMQGDDVLRCGSR------------SCA--ESAVSRDHLVSRLQ---------DLQQQV 170
Query: 233 EKLIKILM--ELEDWKEALAYCQLTIPVYQRVYPQFHPLLGLQYYTCGKLEWFLGDTENA 290
K+L ELE + L+ CQ + ++ + H ++G + LGD + +
Sbjct: 171 RVAQKLLRDGELERAVQRLSGCQRDAESF--LWAE-HAVVGEIADGLARACAALGDWQKS 227
Query: 291 IKSMTEAVEILRITHGTNSPFMKELILKLEE 321
+ ++ ++ + HG +S M + KL +
Sbjct: 228 ATHLQRSLCVVEVRHGPSSVEMGHELFKLAQ 258
>gi|15620931|dbj|BAB67829.1| KIAA1936 protein [Homo sapiens]
Length = 558
Score = 74.7 bits (182), Expect = 5e-11, Method: Compositional matrix adjust.
Identities = 75/315 (23%), Positives = 129/315 (40%), Gaps = 67/315 (21%)
Query: 26 PEISINEIA--ENFSKLACNAHT-------------ICNSELRPLGTGLYPVISIINHSC 70
P+++I +A + +L CNA + +S L TG++PVIS++NHSC
Sbjct: 287 PDVTIWGVAMLRHMLQLQCNAQAMTTIQRTGPKGSIVTDSRQVRLATGIFPVISLLNHSC 346
Query: 71 LPNAVLVFEGRLAVVRAVQHVPKGAEVLISY--IETAGSTMTRQKALKEQYLFTCTCPRC 128
PN + F +A +RA Q + KG E+L Y ++ RQ+ L+ QY F C CP C
Sbjct: 347 SPNTSVSFISTVATIRASQWIRKGQEILHCYGPHKSRMGVAERQQKLRSQYFFDCACPAC 406
Query: 129 IKLGQFDDIQESAILEGYRCKDDGCSGFLLRDSDDKGFTCQQCGLVRSKEEIKKIASEVN 188
+ + G R + F C CG +++ + S
Sbjct: 407 -------QTEAHRMAAGPRW---------------EAFCCNSCGAPMQGDDVLRCGSR-- 442
Query: 189 ILSKKTLALTSCGNHQEVVSTYKMIEKLQKKLYHPFSVNLMQTREKLIKILM--ELEDWK 246
SC + VS ++ +LQ +L Q K+L ELE
Sbjct: 443 ----------SCA--ESAVSRDHLVSRLQ---------DLQQQVRVAQKLLRDGELERAV 481
Query: 247 EALAYCQLTIPVYQRVYPQFHPLLGLQYYTCGKLEWFLGDTENAIKSMTEAVEILRITHG 306
+ L+ CQ + ++ + H ++G + LGD + + + ++ ++ + HG
Sbjct: 482 QRLSGCQRDAESF--LWAE-HAVVGEIADGLARACAALGDWQKSATHLQRSLCVVEVRHG 538
Query: 307 TNSPFMKELILKLEE 321
+S M + KL +
Sbjct: 539 PSSVEMGHELFKLAQ 553
>gi|126314231|ref|XP_001371610.1| PREDICTED: SET and MYND domain-containing protein 4-like
[Monodelphis domestica]
Length = 845
Score = 74.7 bits (182), Expect = 6e-11, Method: Compositional matrix adjust.
Identities = 61/266 (22%), Positives = 116/266 (43%), Gaps = 12/266 (4%)
Query: 55 LGTGLYPVISIINHSCLPNAVLVFEGRLAVVRAVQHVPKGAEVLISY--IETAGSTMTRQ 112
L TG +PVIS++NHSC PN L F G + +++A + + +G E+L Y E TRQ
Sbjct: 513 LATGFFPVISLLNHSCRPNTSLSFRGSVGIIQASRLIAQGEEILHCYGPHECRMDVTTRQ 572
Query: 113 KALKEQYLFTCTCPRCIKLGQFDDIQESAILEGYRCKDDGCSGFLLRDSDDKGFTC---Q 169
+ L+ QY F C C C + + + + G+RC C L D F C +
Sbjct: 573 QKLRSQYFFDCHCQAC-QNEEVCPVSTAPKPSGFRCPS--CKAALQGDD----FLCCDNR 625
Query: 170 QCGLVRSKEEIKKIASEVNILSKKTLALTSCGNHQEVVSTYKMIEKLQKKLYHPFSVNLM 229
C S+ +++ ++ K L L + + ++ + + +
Sbjct: 626 TCTASASRAQLEHQVQDLQQQVKAALELLRDDRSAQAIQLLLGCQQDAENFLSSEHMLMG 685
Query: 230 QTREKLIKILMELEDWKEALAYCQLTIPVYQRVYPQFHPLLGLQYYTCGKLEWFLGDTEN 289
+ + L + L DW + A+ Q ++ V + + LG + + ++ +
Sbjct: 686 EIEDHLAQAYASLGDWSRSAAHLQKSLLVVEARHGPASVELGHELFKLAQVLFNGFAVSK 745
Query: 290 AIKSMTEAVEILRITHGTNSPFMKEL 315
A+ ++ +A ++L I + + M+EL
Sbjct: 746 ALCAIEKAEKVLSIHYDPENEQMQEL 771
>gi|119480927|ref|XP_001260492.1| SET and MYND domain protein, putative [Neosartorya fischeri NRRL
181]
gi|119408646|gb|EAW18595.1| SET and MYND domain protein, putative [Neosartorya fischeri NRRL
181]
Length = 544
Score = 74.7 bits (182), Expect = 6e-11, Method: Compositional matrix adjust.
Identities = 41/132 (31%), Positives = 72/132 (54%), Gaps = 2/132 (1%)
Query: 1 MSDIDEKQLLLYAQIANLVNLILQWPEISINE--IAENFSKLACNAHTICNSELRPLGTG 58
M++I + + +I+ + ++ +++E I +KL N++ + LG
Sbjct: 195 MAEIQAQNWEQWQRISLTAKAVKKYSGCALDEEMICHYGAKLELNSYNFHSPLADRLGFY 254
Query: 59 LYPVISIINHSCLPNAVLVFEGRLAVVRAVQHVPKGAEVLISYIETAGSTMTRQKALKEQ 118
L+P ++INHSC N+V+ +G V+A++ + KG E+L+SYI+ TRQ L E+
Sbjct: 255 LHPYAALINHSCDYNSVVGSDGDALYVKALRPIQKGEEILVSYIDATNPCKTRQTELSER 314
Query: 119 YLFTCTCPRCIK 130
Y F C CP+C K
Sbjct: 315 YYFKCRCPKCAK 326
>gi|296201020|ref|XP_002747857.1| PREDICTED: SET and MYND domain-containing protein 4 [Callithrix
jacchus]
Length = 798
Score = 74.7 bits (182), Expect = 6e-11, Method: Compositional matrix adjust.
Identities = 34/76 (44%), Positives = 47/76 (61%), Gaps = 2/76 (2%)
Query: 55 LGTGLYPVISIINHSCLPNAVLVFEGRLAVVRAVQHVPKGAEVLISY--IETAGSTMTRQ 112
L TG++PVIS++NHSC PN + F +A +RA Q + KG E+L Y ++ RQ
Sbjct: 522 LATGIFPVISLLNHSCSPNTSVSFVSTVATIRASQRIGKGQEILHCYGPHKSRIGVAERQ 581
Query: 113 KALKEQYLFTCTCPRC 128
+ L+ QY F CTCP C
Sbjct: 582 QKLRSQYFFDCTCPAC 597
>gi|195130539|ref|XP_002009709.1| GI15088 [Drosophila mojavensis]
gi|193908159|gb|EDW07026.1| GI15088 [Drosophila mojavensis]
Length = 492
Score = 74.7 bits (182), Expect = 6e-11, Method: Compositional matrix adjust.
Identities = 33/103 (32%), Positives = 56/103 (54%), Gaps = 7/103 (6%)
Query: 58 GLYPVISIINHSCLPNAVLVFEGRL--AVVRAVQHVPKGAEVLISYIETAGSTMTRQKAL 115
LYP+ +++NH C+PN+ FE + +VRA + +G E+ +Y + + R L
Sbjct: 208 ALYPLFAVMNHDCIPNSYYTFEEKTNNMIVRAAVDIAEGEEITTTYTKLFTGNIARHLFL 267
Query: 116 KEQYLFTCTCPRCIKLGQFDDIQESAILEGYRCKDDGCSGFLL 158
K + FTC CPRC+ D ++ + + G C+D CSG ++
Sbjct: 268 KMKKGFTCKCPRCL-----DPTEKGSFISGLYCRDTNCSGLVV 305
>gi|224140669|ref|XP_002323703.1| tetratricopeptide repeat-containing protein [Populus trichocarpa]
gi|222868333|gb|EEF05464.1| tetratricopeptide repeat-containing protein [Populus trichocarpa]
Length = 584
Score = 74.3 bits (181), Expect = 6e-11, Method: Compositional matrix adjust.
Identities = 64/312 (20%), Positives = 126/312 (40%), Gaps = 59/312 (18%)
Query: 29 SINEIAENFSKLACNAHTICNS-ELRPLGTGLYPVISIINHSCLPNAVLVFEGRLAVVRA 87
S+++ + F KL + +S E P+G +Y S+ NHSCLPN F R +R
Sbjct: 240 SVDDPPDQFRKLTSVGDALTSSLEQVPVGQAIYKAASLFNHSCLPNIHAYFLSRTLFIRT 299
Query: 88 VQHVPKGAEVLISYIETAGST--MTRQKALKEQYLFTCTCPRCIKLGQFDDIQESAILEG 145
++V G + +SY G + R + L ++Y F C C C +L D +L
Sbjct: 300 TEYVSTGCPLELSYGPQVGQSDCEDRLRYLADKYSFRCQCRGCSQLNLSD-----LVLNA 354
Query: 146 YRCKDDGCSGFLLRDSDDKGFTCQQCGLVRSKE----------------EIKKIASEVNI 189
+ C + C+G +L + G T + R+ E +I +AS
Sbjct: 355 FCCVNHNCAGVVLESTIINGETRKLNNFPRAPEKQKFDSHLQGHKLNIVDINDVASLALK 414
Query: 190 LSKKTLALTS-----CGNHQEVVSTYKMIEK----------------------------- 215
+ +L + CG+H+++ ++++ I K
Sbjct: 415 FNNSSLHIHPGFCLHCGSHRDLDASHEAINKAWSYIKRLQEAIISKDISGTTLLDASRAL 474
Query: 216 -LQKKLYHPFSVNLMQTREKLIKILMELEDWKEALAYCQLTIPVYQRVYPQFHPLLGLQY 274
+ + H ++ ++ + + L + + D++ A +C+ +I + Q +Y H ++G +
Sbjct: 475 GILRSTLHAYNKSVAEAEDNLAQAFCLVRDFQSAREHCKESIKILQTLYDPDHIVIGYEL 534
Query: 275 YTCGKLEWFLGD 286
++ L D
Sbjct: 535 VKLASIQLSLDD 546
>gi|322697990|gb|EFY89764.1| MYND finger family protein [Metarhizium acridum CQMa 102]
Length = 513
Score = 74.3 bits (181), Expect = 7e-11, Method: Compositional matrix adjust.
Identities = 40/102 (39%), Positives = 60/102 (58%), Gaps = 1/102 (0%)
Query: 27 EISINEIAENFSKLACNAHTICNSELRPLGTGLYPVISIINHSCLPNAVLVFEGRLAVVR 86
E ++ + E K+ NA + +L +G L P +++ NHSC+PNAV+ F GR A++R
Sbjct: 156 ESNVRKAVELLCKIQTNAFHRWDVDLGQVGVFLEPTLAMANHSCVPNAVVQFVGRKAILR 215
Query: 87 AVQHVPKGAEVLISYIETAGSTMTRQKALKEQYLFTCTCPRC 128
A + + G E+ ISY + R++AL EQY F CTC RC
Sbjct: 216 AERPIQAGDEIEISYTDYTMPLSARREAL-EQYSFECTCARC 256
>gi|115497232|ref|NP_001069874.1| SET and MYND domain-containing protein 3 [Bos taurus]
gi|112362116|gb|AAI20242.1| SET and MYND domain containing 3 [Bos taurus]
gi|296479288|tpg|DAA21403.1| TPA: SET and MYND domain containing 3 [Bos taurus]
Length = 391
Score = 74.3 bits (181), Expect = 7e-11, Method: Compositional matrix adjust.
Identities = 32/65 (49%), Positives = 47/65 (72%)
Query: 32 EIAENFSKLACNAHTICNSELRPLGTGLYPVISIINHSCLPNAVLVFEGRLAVVRAVQHV 91
+I E F+K+ CN+ TICN+E++ +G GLYP +S++NHSC PN +VF G ++RAV+ V
Sbjct: 170 DIFEAFAKVICNSFTICNAEMQEVGVGLYPSMSLLNHSCDPNCSIVFNGPHLLLRAVRDV 229
Query: 92 PKGAE 96
G E
Sbjct: 230 EAGEE 234
Score = 48.1 bits (113), Expect = 0.006, Method: Compositional matrix adjust.
Identities = 24/61 (39%), Positives = 36/61 (59%)
Query: 246 KEALAYCQLTIPVYQRVYPQFHPLLGLQYYTCGKLEWFLGDTENAIKSMTEAVEILRITH 305
+EAL Y T+ Y+ YP HP+ G+Q GKL+ G A+K++ A +I+R+TH
Sbjct: 308 EEALFYGIRTMEPYRIFYPGNHPVRGVQIMKVGKLQLHQGMFPQAMKNLRLAFDIMRVTH 367
Query: 306 G 306
G
Sbjct: 368 G 368
>gi|336274372|ref|XP_003351940.1| hypothetical protein SMAC_00488 [Sordaria macrospora k-hell]
gi|380096224|emb|CCC06271.1| unnamed protein product [Sordaria macrospora k-hell]
Length = 537
Score = 74.3 bits (181), Expect = 7e-11, Method: Compositional matrix adjust.
Identities = 47/142 (33%), Positives = 75/142 (52%), Gaps = 3/142 (2%)
Query: 7 KQLLLYAQIANLVNLILQWPEISINEIAENFSKLACNAHTICNSELRPLGTGLYPVISII 66
K L A A + +LQ E ++ + E K+ NA +++ G L+P +S++
Sbjct: 171 KDFELQATGAVVYAGLLQSDE-TLKQAMEVLCKIQTNAFNRFDADTGQAGIYLHPSLSMV 229
Query: 67 NHSCLPNAVLVFEGRLAVVRAVQHVPKGAEVLISYIETAGSTMTRQKALKEQYLFTCTCP 126
NHSC+PNA + FE R A ++A + + G E+LISYI+ RQ++L+ Y F C C
Sbjct: 230 NHSCVPNAYITFEKRKAFLKAERDIEPGDEILISYIDHTAPRRARQESLR-LYHFQCNCV 288
Query: 127 RC-IKLGQFDDIQESAILEGYR 147
RC L ++ IQ + G +
Sbjct: 289 RCKDDLNVYEVIQACGVSPGVK 310
>gi|380801673|gb|AFE72712.1| SET and MYND domain-containing protein 4, partial [Macaca mulatta]
Length = 212
Score = 74.3 bits (181), Expect = 8e-11, Method: Compositional matrix adjust.
Identities = 34/76 (44%), Positives = 47/76 (61%), Gaps = 2/76 (2%)
Query: 55 LGTGLYPVISIINHSCLPNAVLVFEGRLAVVRAVQHVPKGAEVLISYI--ETAGSTMTRQ 112
L TG++PVIS++NHSC PN + F +A +RA Q + KG E+L Y ++ RQ
Sbjct: 26 LATGIFPVISLLNHSCSPNTSVSFISTVATIRASQRIGKGQEILHCYGPHKSRMGVAERQ 85
Query: 113 KALKEQYLFTCTCPRC 128
+ L+ QY F CTCP C
Sbjct: 86 QKLRSQYFFDCTCPAC 101
>gi|346978403|gb|EGY21855.1| hypothetical protein VDAG_03295 [Verticillium dahliae VdLs.17]
Length = 558
Score = 74.3 bits (181), Expect = 8e-11, Method: Compositional matrix adjust.
Identities = 40/104 (38%), Positives = 61/104 (58%), Gaps = 8/104 (7%)
Query: 27 EISINEIAENFSKLACNAHTICNSELRPLGTGLY--PVISIINHSCLPNAVLVFEGRLAV 84
E++ + E K+ NA T + G GL+ ++++NHSC+PNA++ F GR A
Sbjct: 184 EVTERRVVELLCKIKTNAFTRSEA-----GEGLFLDTTLAMVNHSCVPNAIVAFSGRRAF 238
Query: 85 VRAVQHVPKGAEVLISYIETAGSTMTRQKALKEQYLFTCTCPRC 128
+RA++ + +G E+ ISYI+ S R+KAL E Y F C C RC
Sbjct: 239 LRALRDIKEGEEIEISYIDCTQSLEHRRKAL-ELYFFQCVCTRC 281
>gi|332813752|ref|XP_003309160.1| PREDICTED: SET and MYND domain-containing protein 1 [Pan
troglodytes]
Length = 452
Score = 73.9 bits (180), Expect = 9e-11, Method: Compositional matrix adjust.
Identities = 40/131 (30%), Positives = 68/131 (51%), Gaps = 19/131 (14%)
Query: 19 VNLILQ-WP----EISINEIAENFSKLACNAHTICNSE-LRPLGTGLYPVISIINHSCLP 72
V+ LQ WP + S+ I+ F + CN T+ + L+ +G G++P + ++NH C P
Sbjct: 151 VDTFLQYWPPQSQQFSMQYISHIFGVINCNGFTLSDQRGLQAVGVGIFPNLGLVNHDCWP 210
Query: 73 NAVLVFE-GRLAVVRAVQH------------VPKGAEVLISYIETAGSTMTRQKALKEQY 119
N ++F G V+++ H + +G E+ +SYI+ + R++ LK+QY
Sbjct: 211 NCTVIFNNGNHEAVKSMFHTQMRIELRALGKISEGEELTVSYIDFLNVSEERKRQLKKQY 270
Query: 120 LFTCTCPRCIK 130
F CTC C K
Sbjct: 271 YFDCTCEHCQK 281
>gi|91079710|ref|XP_969181.1| PREDICTED: similar to SET and MYND domain containing 4 [Tribolium
castaneum]
Length = 393
Score = 73.9 bits (180), Expect = 9e-11, Method: Compositional matrix adjust.
Identities = 79/302 (26%), Positives = 126/302 (41%), Gaps = 47/302 (15%)
Query: 33 IAENFSKLACNAHTI----CNSE-------LRPLGTGLYPVISIINHSCLPNAVLVFEGR 81
I ++ ++LACN+ TI C+S L + +G++P +SI+NHSC PN F
Sbjct: 92 ITKHIAQLACNSSTIEQWTCSSSDLLFPDVLITIASGIFPSVSIMNHSCRPNVTNYFMSD 151
Query: 82 LAVVRAVQHVPKGAEVLISY-IETAG-STMTRQKALKEQYLFTCTCPRCIKLGQFDDIQE 139
VV+A++ + + E+ Y I+ G RQ A KE Y F C C C D E
Sbjct: 152 TIVVKALEDIAQNEEIFNCYGIDYRGMEREQRQYACKELYHFECKCVICS-----DPAHE 206
Query: 140 SAILEGYRCKDDGCSGFLLRDSDDKGFTCQQCGLVRSKEEIKKIASEVNILSKKTLALTS 199
+L+ Y C C G + + C CG + +KI E ++K + S
Sbjct: 207 LDMLDSYLCPK--CKGLVPEIKNTILSFCVNCGEKYHLKPFRKINYE----AQKYMESDS 260
Query: 200 CGNHQEVVSTYKMIEKLQKKLYHPFSVNLMQTREKLIKILMELEDWKEALAYCQLTI--- 256
+ ++ + K+ EK+ K + F + +L ++L D + Y L +
Sbjct: 261 SNQLELLIKSLKIREKILYKHHKDFE----EVYYRLYSFYVKLGDAENMFKYFHLWLENE 316
Query: 257 -------------PVYQRVYPQFHPLLGLQYYTCGKLEWFLGDTENAIKSMTEAVEILRI 303
+Y+ H L G Q C L+ FL + E IK EA IL +
Sbjct: 317 KARRGCNSRVVGTKLYEAALAILHCLKGGQQKNCTNLKDFLQNVEYMIK---EAKAILNL 373
Query: 304 TH 305
+
Sbjct: 374 YY 375
>gi|384250973|gb|EIE24451.1| SET domain-containing protein [Coccomyxa subellipsoidea C-169]
Length = 606
Score = 73.9 bits (180), Expect = 9e-11, Method: Compositional matrix adjust.
Identities = 52/207 (25%), Positives = 80/207 (38%), Gaps = 16/207 (7%)
Query: 43 NAHTI--CNSELRPLGTGLYPVISIINHSCLPNAVLVFEGRLAVVRAVQHVPKGAEVLIS 100
NAH + NS GLYP +S++NHSCLPN V G VRA++ V G ++ ++
Sbjct: 306 NAHGMGAANSTNTDSALGLYPALSMLNHSCLPNCVFASCGSDMHVRAIRPVAAGEQLTVT 365
Query: 101 YIETAGSTMTRQKALKEQYLFTCTCPRCIKLGQFDDIQESAILEGYRCKDDGCSGFLLRD 160
YI R + L + F C C RC+ + LE +C GC G+LL
Sbjct: 366 YINIMEPRRIRARELMDTKHFACACERCV---SPLETHPDRFLEAVKCGAGGCDGWLLEQ 422
Query: 161 -----SDDKGFTCQQCGLVRSKEEIKKIASEVNILSKKTLALTSCGN------HQEVVST 209
S+ + C C + +I A N H
Sbjct: 423 RADNLSEGDTWKCTVCATAVPSRSVDGRTGPQDITESFRAAWHQAANILRYKGHAAGRPV 482
Query: 210 YKMIEKLQKKLYHPFSVNLMQTREKLI 236
++ + + + HP V+ + L+
Sbjct: 483 FEAVLARAEGVLHPMHVSAFDSLMPLV 509
>gi|443926535|gb|ELU45157.1| zf-MYND domain-containing protein [Rhizoctonia solani AG-1 IA]
Length = 584
Score = 73.9 bits (180), Expect = 1e-10, Method: Compositional matrix adjust.
Identities = 71/264 (26%), Positives = 104/264 (39%), Gaps = 46/264 (17%)
Query: 31 NEIAENFSK------LACNAHTICNSELRPLGTGLYPVISIINHSCLPNAVLVFE----- 79
N + F K + NAHT+ L P+G L PV ++INHSC+PN V+VF
Sbjct: 189 NSVKAGFGKRYRGSIIEANAHTLSTPSLTPIGVALSPVAALINHSCVPNCVVVFPKASET 248
Query: 80 ---GRLAVVRAVQHVPKGAEVLISYIETAGSTMTRQKALKEQYLFTCTCPRC-----IKL 131
++ A+ + G EV I + T G K L+E+Y F C CP C +K
Sbjct: 249 KKVPNEMLIIAIGKILPGEEVQIPH--TLGK-FADNKILQERYFFKCHCPNCEMTSLVKT 305
Query: 132 GQFDDIQESAILEGYRCKDDGCSGFLLRDSDDKGFTCQQCGLVRSKEEIKKIASEVNILS 191
D + +RC C GFL S +Q N+ S
Sbjct: 306 PYVDSRR------AFRCASKTCEGFLPMPSMQSQLGQKQ---------------RFNVYS 344
Query: 192 KKTLALTSCGNHQEVVSTYKMIEKLQKKLYHPFSV--NLMQTREKLIKILMELEDWKEAL 249
+ AL++ Q + K K ++ S + T + + + L D + A
Sbjct: 345 AISCALSNPRRSQMQSDSEKKGSKGRRNFNSLVSYFRSPGLTYHQSLTLCAGLLDPERAF 404
Query: 250 AYCQLTIPVYQRVY-PQFHPLLGL 272
Y IP+ R + P HPLL L
Sbjct: 405 KYTSNLIPMVSRFFHPSAHPLLAL 428
>gi|345566296|gb|EGX49239.1| hypothetical protein AOL_s00078g272 [Arthrobotrys oligospora ATCC
24927]
Length = 526
Score = 73.9 bits (180), Expect = 1e-10, Method: Compositional matrix adjust.
Identities = 33/92 (35%), Positives = 54/92 (58%)
Query: 37 FSKLACNAHTICNSELRPLGTGLYPVISIINHSCLPNAVLVFEGRLAVVRAVQHVPKGAE 96
+ KL N+ ++ P+G L ++ NHSC PNAV++F+GR +R+++ + K E
Sbjct: 196 YCKLLINSASVVTQTFDPIGLCLAYQSAMFNHSCDPNAVMMFDGRQLSIRSLKEITKDTE 255
Query: 97 VLISYIETAGSTMTRQKALKEQYLFTCTCPRC 128
+ ISYI+ S R++ LK +Y F C+C C
Sbjct: 256 ITISYIDNLASRKERKEELKSRYFFDCSCSLC 287
>gi|350295434|gb|EGZ76411.1| hypothetical protein NEUTE2DRAFT_98210 [Neurospora tetrasperma FGSC
2509]
Length = 547
Score = 73.9 bits (180), Expect = 1e-10, Method: Compositional matrix adjust.
Identities = 41/120 (34%), Positives = 67/120 (55%), Gaps = 2/120 (1%)
Query: 29 SINEIAENFSKLACNAHTICNSELRPLGTGLYPVISIINHSCLPNAVLVFEGRLAVVRAV 88
++ + E K+ NA +++ G L+P +S++NHSC+PNA + FE R AV++A
Sbjct: 172 TLKQAMEVLCKIQTNAFNRFDADTGQAGIYLHPSLSMVNHSCVPNAYITFEKRKAVLKAE 231
Query: 89 QHVPKGAEVLISYIETAGSTMTRQKALKEQYLFTCTCPRC-IKLGQFDDIQESAILEGYR 147
+ + G E+LISYI+ RQ++L+ Y F C C RC L ++ IQ + G +
Sbjct: 232 RDLEPGDEILISYIDHTMPRRARQESLR-LYHFQCNCIRCKDDLNAYEVIQACGVSPGVK 290
>gi|303282641|ref|XP_003060612.1| predicted protein [Micromonas pusilla CCMP1545]
gi|226458083|gb|EEH55381.1| predicted protein [Micromonas pusilla CCMP1545]
Length = 302
Score = 73.9 bits (180), Expect = 1e-10, Method: Compositional matrix adjust.
Identities = 40/105 (38%), Positives = 55/105 (52%), Gaps = 3/105 (2%)
Query: 26 PEISINEIAENFSKLACNAHTICNSE---LRPLGTGLYPVISIINHSCLPNAVLVFEGRL 82
P + E+ + CN+HT+ SE P+GT +Y S NHSCLP+A EG
Sbjct: 198 PGYGVEELVRLMCCVRCNSHTLYASEEWPSEPVGTAVYLKGSAFNHSCLPSAEFYNEGTS 257
Query: 83 AVVRAVQHVPKGAEVLISYIETAGSTMTRQKALKEQYLFTCTCPR 127
VR+V+ + G EV ISY+ + R++AL QY F C C R
Sbjct: 258 LRVRSVRDISAGEEVTISYVPVTETLWDRRQALWRQYKFDCECDR 302
>gi|440799648|gb|ELR20692.1| Histone-lysine N-methyltransferase [Acanthamoeba castellanii str.
Neff]
Length = 548
Score = 73.9 bits (180), Expect = 1e-10, Method: Compositional matrix adjust.
Identities = 75/287 (26%), Positives = 121/287 (42%), Gaps = 27/287 (9%)
Query: 14 QIANLVNLILQWP-EISINEIAENFSKLACNAHTICNSEL--RPLGTGLYPVISIINHSC 70
+ A LV +L P +S E + +++ N+H + + R LG GLYP+ S+INHSC
Sbjct: 208 ETAKLVMPLLPKPARLSAREFVKVCARINTNSHRQHHMFVPQRILGVGLYPLASLINHSC 267
Query: 71 LPNAVLVFEGRLAVVRAVQHVPKGAEVLISYIETAGSTMTRQKALKEQYLFTCTCPRCIK 130
PN G +R + V +G E+ SYI+ S R+ L E F C C RC
Sbjct: 268 QPNCGFYNRGPTLYIRTLCDVKEGEELCYSYIDLYQSRSKRKAELLETKHFDCLCNRCSP 327
Query: 131 LGQFDDIQESA--ILEGYRCKDDG---CSGFL-LRDSD----DKGFTCQQ-CGLVRSKEE 179
I +S L G++C + C G L L D + +K +C CG ++ +
Sbjct: 328 -----PITDSVDRYLSGFQCPNKAKTSCDGLLVLPDGEGLASEKPVSCTAGCGETKTAGD 382
Query: 180 IKKIASEVNILSKKTLALTSCGNHQEVVSTYKMIEKLQKKLY------HPFSVNLMQTRE 233
+ K I+ + +L + + + M++KL Y HP+ + Q
Sbjct: 383 LLKAQKRAEIMLTASQSLYY--DQKRTSDSKTMLDKLLTGGYSPDVQLHPYHPTVFQAHV 440
Query: 234 KLIKILMELEDWKEALAYCQLTIPVYQRVYPQFHPLLGLQYYTCGKL 280
I + L ++ +C+ + + V P H YY G L
Sbjct: 441 DSINVSDALGYTHMSIDHCEAVLRCAEAVLPMNHLETSNYYYYQGML 487
>gi|344297528|ref|XP_003420449.1| PREDICTED: SET and MYND domain-containing protein 1 [Loxodonta
africana]
Length = 478
Score = 73.9 bits (180), Expect = 1e-10, Method: Compositional matrix adjust.
Identities = 80/308 (25%), Positives = 135/308 (43%), Gaps = 44/308 (14%)
Query: 19 VNLILQ-WP----EISINEIAENFSKLACNAHTICNSE-LRPLGTGLYPVISIINHSCLP 72
VN L+ WP + S I+ F + CN T+ + L+ +G G++P + ++NH C P
Sbjct: 151 VNTFLEYWPPQNQQFSTQYISHIFGVINCNGFTLSDQRGLQAVGVGVFPNLGLVNHDCWP 210
Query: 73 NAVLVFE-GRLAVVRAVQHVPKGAEVLISYIETAGSTMTRQKALKEQYLFTCTCPRCIKL 131
N ++F G V+++ H +S + G T Q +K L TC C+ L
Sbjct: 211 NCTVIFNNGNHEAVKSMFHTQ------MSKAQDPGKT-GPQFVIKLVGLTTCFFS-CLAL 262
Query: 132 GQFDDIQESAILEGYRCKDDGCSGFLLRDSDDKGFTCQQCGLVRSKEEIKKIASEVNILS 191
G A+ EG KDD L DD + +E+ K E+ LS
Sbjct: 263 GV------EALPEGL--KDD----LFLGVKDDP----------KPSQEVVK---EMIQLS 297
Query: 192 KKTL----ALTSCGNHQEVVSTYKMIEKLQKKLYHPFSVNLMQTREKLIKILMELEDWKE 247
K TL S G + EVV + + Q+ ++ ++ ++ + ++L L+ ++E
Sbjct: 298 KDTLEKIDKARSEGLYHEVVKLCRECLQKQEPVFADTNLYTLRMLSIVSEVLSYLQAFEE 357
Query: 248 ALAYCQLTIPVYQRVYPQFHPLLGLQYYTCGKLEWFLGDTENAIKSMTEAVEILRITHGT 307
A Y + + Y ++Y + LG+ G W G+ E + +A IL +THG
Sbjct: 358 AADYAKRMVDGYMKLYHHNNAQLGMAVMRAGLTNWHAGNIEVGHGMICKAYAILLVTHGP 417
Query: 308 NSPFMKEL 315
+ P K+L
Sbjct: 418 SHPITKDL 425
>gi|388854233|emb|CCF52152.1| uncharacterized protein [Ustilago hordei]
Length = 938
Score = 73.9 bits (180), Expect = 1e-10, Method: Compositional matrix adjust.
Identities = 39/118 (33%), Positives = 65/118 (55%), Gaps = 16/118 (13%)
Query: 29 SINEIAENFSKLACNAHTICNSELRPLGTGLYPVISIINHSCLPNAVLVF---------- 78
S E+ + + A N+ T+ +S+L PLG ++P +++INH+C PNA +VF
Sbjct: 481 SAKELLDLVCQFASNSFTLTDSDLNPLGVCMHPSMAMINHACTPNAAVVFPFGGAAKNGQ 540
Query: 79 ------EGRLAVVRAVQHVPKGAEVLISYIETAGSTMTRQKALKEQYLFTCTCPRCIK 130
E R+ + A++ + G E+LISY++ A + R+ LK++Y F C C C K
Sbjct: 541 QKWNDGEDRIMQLVALRAIEPGEELLISYVDIADTYQDRRNYLKKRYCFDCRCELCRK 598
>gi|195168440|ref|XP_002025039.1| GL26796 [Drosophila persimilis]
gi|194108484|gb|EDW30527.1| GL26796 [Drosophila persimilis]
Length = 498
Score = 73.9 bits (180), Expect = 1e-10, Method: Compositional matrix adjust.
Identities = 34/103 (33%), Positives = 56/103 (54%), Gaps = 7/103 (6%)
Query: 58 GLYPVISIINHSCLPNAVLVFEGR--LAVVRAVQHVPKGAEVLISYIETAGSTMTRQKAL 115
LYP+ +++NH C+PN+ FE + + +VRA +P+G E+ +Y + + R L
Sbjct: 208 ALYPLFAVMNHDCIPNSYYTFEEKTNIMIVRAAVDIPEGFEITTTYTKLFTGNIARHLFL 267
Query: 116 KEQYLFTCTCPRCIKLGQFDDIQESAILEGYRCKDDGCSGFLL 158
K + FTC C RC D ++ A + G C+D CSG ++
Sbjct: 268 KIKRGFTCKCSRC-----SDPTEKGAYISGLYCRDTNCSGLVV 305
>gi|125983736|ref|XP_001355633.1| GA11413 [Drosophila pseudoobscura pseudoobscura]
gi|54643949|gb|EAL32692.1| GA11413 [Drosophila pseudoobscura pseudoobscura]
Length = 496
Score = 73.9 bits (180), Expect = 1e-10, Method: Compositional matrix adjust.
Identities = 34/103 (33%), Positives = 56/103 (54%), Gaps = 7/103 (6%)
Query: 58 GLYPVISIINHSCLPNAVLVFEGR--LAVVRAVQHVPKGAEVLISYIETAGSTMTRQKAL 115
LYP+ +++NH C+PN+ FE + + +VRA +P+G E+ +Y + + R L
Sbjct: 208 ALYPLFAVMNHDCIPNSYYTFEEKTNIMIVRAAVDIPEGFEITTTYTKLFTGNIARHLFL 267
Query: 116 KEQYLFTCTCPRCIKLGQFDDIQESAILEGYRCKDDGCSGFLL 158
K + FTC C RC D ++ A + G C+D CSG ++
Sbjct: 268 KIKRGFTCKCSRC-----SDPTEKGAYISGLYCRDTNCSGLVV 305
>gi|344296426|ref|XP_003419908.1| PREDICTED: N-lysine methyltransferase SMYD2-like [Loxodonta
africana]
Length = 427
Score = 73.6 bits (179), Expect = 1e-10, Method: Compositional matrix adjust.
Identities = 70/319 (21%), Positives = 137/319 (42%), Gaps = 39/319 (12%)
Query: 5 DEKQLLLYAQIANLVNLI---LQWPEISINEIAENFSKLACNAHTICNSELRPLGTGLYP 61
+EK+ L+ + IA L + L++P+ + F+++ CN TI + EL +G+ ++P
Sbjct: 143 NEKKDLIQSDIAALHHFYSKHLEFPDT--ESLVVLFAQVNCNGFTIEDEELSHVGSAIFP 200
Query: 62 VISIINHSCLPNAVLVFEGRLAVVRAVQHVPKGAEVLISYIETAGSTMTRQKALKEQYLF 121
+ S +P ++ VF+ L + V SYI+ T R L++ Y F
Sbjct: 201 DPVV---SGVPGSLTVFQAHLF---SQMFASLCGMVFTSYIDLLYPTEDRNDRLRDSYFF 254
Query: 122 TCTCPRCIKLGQFDDIQESAILEGYRCKDDGCSGFLLRDSDDKGFTCQQCGLVRSKEEIK 181
TC C C + ++A +E R D +RD +VR +
Sbjct: 255 TCECQECTTKDK-----DNAKVE-IRKLTDPPKAEAIRD------------MVRYARNVI 296
Query: 182 KIASEVNILSKKTLALTSCGNHQEVVSTYKMIEKLQKKLYHPFSVNLMQTREKLIKILME 241
+ + L C QE +S+ ++ +V ++ + + + +
Sbjct: 297 EEFRRAKHYKSPSELLEMCALSQEKMSS----------VFEDSNVYMLHMMYQAMGVCLY 346
Query: 242 LEDWKEALAYCQLTIPVYQRVYPQFHPLLGLQYYTCGKLEWFLGDTENAIKSMTEAVEIL 301
++DW+ AL Y Q I Y + YP + + + G+L L + K++ +A+ I+
Sbjct: 347 MQDWEGALRYGQKIIQPYSKHYPLYSLNVASMWLKLGRLYMGLENKAAGEKALKKAIAIM 406
Query: 302 RITHGTNSPFMKELILKLE 320
+ HG + P++ E+ ++E
Sbjct: 407 EVAHGKDHPYISEIKHEIE 425
>gi|270004498|gb|EFA00946.1| hypothetical protein TcasGA2_TC003856 [Tribolium castaneum]
Length = 713
Score = 73.6 bits (179), Expect = 1e-10, Method: Compositional matrix adjust.
Identities = 79/302 (26%), Positives = 126/302 (41%), Gaps = 47/302 (15%)
Query: 33 IAENFSKLACNAHTI----CNSE-------LRPLGTGLYPVISIINHSCLPNAVLVFEGR 81
I ++ ++LACN+ TI C+S L + +G++P +SI+NHSC PN F
Sbjct: 412 ITKHIAQLACNSSTIEQWTCSSSDLLFPDVLITIASGIFPSVSIMNHSCRPNVTNYFMSD 471
Query: 82 LAVVRAVQHVPKGAEVLISY-IETAG-STMTRQKALKEQYLFTCTCPRCIKLGQFDDIQE 139
VV+A++ + + E+ Y I+ G RQ A KE Y F C C C D E
Sbjct: 472 TIVVKALEDIAQNEEIFNCYGIDYRGMEREQRQYACKELYHFECKCVICS-----DPAHE 526
Query: 140 SAILEGYRCKDDGCSGFLLRDSDDKGFTCQQCGLVRSKEEIKKIASEVNILSKKTLALTS 199
+L+ Y C C G + + C CG + +KI E ++K + S
Sbjct: 527 LDMLDSYLCPK--CKGLVPEIKNTILSFCVNCGEKYHLKPFRKINYE----AQKYMESDS 580
Query: 200 CGNHQEVVSTYKMIEKLQKKLYHPFSVNLMQTREKLIKILMELEDWKEALAYCQLTI--- 256
+ ++ + K+ EK+ K + F + +L ++L D + Y L +
Sbjct: 581 SNQLELLIKSLKIREKILYKHHKDFE----EVYYRLYSFYVKLGDAENMFKYFHLWLENE 636
Query: 257 -------------PVYQRVYPQFHPLLGLQYYTCGKLEWFLGDTENAIKSMTEAVEILRI 303
+Y+ H L G Q C L+ FL + E IK EA IL +
Sbjct: 637 KARRGCNSRVVGTKLYEAALAILHCLKGGQQKNCTNLKDFLQNVEYMIK---EAKAILNL 693
Query: 304 TH 305
+
Sbjct: 694 YY 695
>gi|449480134|ref|XP_004177074.1| PREDICTED: LOW QUALITY PROTEIN: SET and MYND domain-containing
protein 4 [Taeniopygia guttata]
Length = 797
Score = 73.6 bits (179), Expect = 1e-10, Method: Compositional matrix adjust.
Identities = 68/294 (23%), Positives = 120/294 (40%), Gaps = 22/294 (7%)
Query: 39 KLACNAHTI-CNSELRP------------LGTGLYPVISIINHSCLPNAVLVFEGRLAVV 85
+L CNA I EL P L T +PV+S++NHSC PN + F G A V
Sbjct: 491 QLQCNAQAITVMQELGPGDGAVVDKKPVRLATAFFPVLSLLNHSCCPNTSMSFSGTAATV 550
Query: 86 RAVQHVPKGAEVLISY-IETAGSTMTRQKALKEQYLFTCTCPRCIKLGQFDDIQESAILE 144
RA Q + G EVL Y + ++ L QY F C CP C++ + +I
Sbjct: 551 RASQPISSGQEVLHCYGPHWCRMRVAERQQLLRQYFFECRCPACLEELESGVKSVVSIRN 610
Query: 145 GYRCKDDGCSGFLLRDSDDKGFTCQQCGLVRSKEEIKKIASEVNILSKKTLALTSCGNHQ 204
+ C C + + D + + C S+ + ++ KK L + G
Sbjct: 611 SFCCPK--CQAQMQGEEDTLCCSNEACATSASRNHLSGRLQDLQQQIKKALGMLRVGKAD 668
Query: 205 EVVSTYKMIEKLQ---KKLYHPFSVNLMQTREKLIKILMELEDWKEALAYCQLTIPVYQR 261
+ + KM+ K Q P + + + + L ++ L W+EA + + +I + +
Sbjct: 669 QAI---KMLLKCQMDAGTFLSPEHLLMGEMEDHLAQVYATLGKWQEAARHLKKSIEIVEM 725
Query: 262 VYPQFHPLLGLQYYTCGKLEWFLGDTENAIKSMTEAVEILRITHGTNSPFMKEL 315
+ G + + ++ + A+ ++ A IL + G + ++EL
Sbjct: 726 HHGPSSVETGHELFKLAQILFNGFAVSEALSTIQRAEGILSVHFGPQNAQIQEL 779
>gi|281208760|gb|EFA82935.1| SET domain-containing protein [Polysphondylium pallidum PN500]
Length = 422
Score = 73.6 bits (179), Expect = 1e-10, Method: Compositional matrix adjust.
Identities = 46/132 (34%), Positives = 64/132 (48%), Gaps = 9/132 (6%)
Query: 55 LGTGLYPVISIINHSCLPNAVLVFEGRLAVVRAVQHVPKGAEVLISYIETAGSTMTRQKA 114
G +Y S+ NHSC PNA V GR + A++ + E+ ISY+ + R+
Sbjct: 251 FGLAIYLEASLFNHSCYPNAARVQRGRSIDIIAIRDIEPNEEICISYLNITNGSHERKDH 310
Query: 115 LKEQYLFTCTCPRCIKLG-QFDDIQESAILEGYRCKDDGCSGFLLRDSDDKGFTCQQCGL 173
LK YLF C C RC + ++I S I R K CSG+L +G +QC
Sbjct: 311 LKNNYLFDCVCIRCTQTNPDIENIVRSFICRNPRVK---CSGYLYL---PQGSDSRQCNF 364
Query: 174 VRSKE--EIKKI 183
+ KE EIK+I
Sbjct: 365 CQWKESSEIKRI 376
>gi|336465134|gb|EGO53374.1| hypothetical protein NEUTE1DRAFT_150707 [Neurospora tetrasperma
FGSC 2508]
Length = 668
Score = 73.2 bits (178), Expect = 2e-10, Method: Compositional matrix adjust.
Identities = 40/120 (33%), Positives = 66/120 (55%), Gaps = 2/120 (1%)
Query: 29 SINEIAENFSKLACNAHTICNSELRPLGTGLYPVISIINHSCLPNAVLVFEGRLAVVRAV 88
++ + E K+ NA +++ G L+P +S++NHSC+PNA + FE R A ++A
Sbjct: 192 TLKQAMEVLCKMQTNAFNRFDADTGQAGIYLHPSLSMVNHSCVPNAYIAFEKRKAFLKAE 251
Query: 89 QHVPKGAEVLISYIETAGSTMTRQKALKEQYLFTCTCPRC-IKLGQFDDIQESAILEGYR 147
+ + G E+LISYI+ RQ++L+ Y F C C RC L ++ IQ + G +
Sbjct: 252 RDIEPGDEILISYIDHTMPRRARQESLR-LYHFQCNCIRCKDDLNAYEVIQACGVSPGVK 310
>gi|443693726|gb|ELT95020.1| hypothetical protein CAPTEDRAFT_183922 [Capitella teleta]
Length = 666
Score = 73.2 bits (178), Expect = 2e-10, Method: Compositional matrix adjust.
Identities = 45/139 (32%), Positives = 73/139 (52%), Gaps = 16/139 (11%)
Query: 55 LGTGLYPVISIINHSCLPNAVLVFEGRLAVVRAVQHVPKGAEVLISYIET--AGSTMTRQ 112
+G GL+P +++NHSC P A++ + G AVVRA++ + K E+ I+Y T ++R+
Sbjct: 472 IGIGLFPTAALLNHSCNPEAIVCYYGNKAVVRAIRDIDKNEEISIAYGVTFYDDEELSRR 531
Query: 113 KALKEQYLFTCTCPRCIK----LGQFDDIQESAILEGYRCKDDGCSGFLLRDS--DDKGF 166
LKE + F CTC C++ + D Q + E C LL D D+K
Sbjct: 532 HQLKETHFFHCTCKACLEGWPMWLEMDQNQPDWLCE-------ACGSILLSDKIEDNKFA 584
Query: 167 TCQQCGLVRSKEE-IKKIA 184
C++C ++ E+ I K+A
Sbjct: 585 KCKKCSHRQNLEDAINKLA 603
>gi|195393214|ref|XP_002055249.1| GJ18896 [Drosophila virilis]
gi|194149759|gb|EDW65450.1| GJ18896 [Drosophila virilis]
Length = 506
Score = 73.2 bits (178), Expect = 2e-10, Method: Compositional matrix adjust.
Identities = 34/103 (33%), Positives = 55/103 (53%), Gaps = 7/103 (6%)
Query: 58 GLYPVISIINHSCLPNAVLVFEGRL--AVVRAVQHVPKGAEVLISYIETAGSTMTRQKAL 115
LYP+ +++NH C+PN+ FE + +VRA + +G E+ +Y + + R L
Sbjct: 208 ALYPLFAVMNHDCIPNSYYTFEEKTNNMIVRAAVDIAEGEEITTTYTKLFTGNIARHLYL 267
Query: 116 KEQYLFTCTCPRCIKLGQFDDIQESAILEGYRCKDDGCSGFLL 158
K + FTC CPRC D ++ A + G C+D CSG ++
Sbjct: 268 KMKKGFTCKCPRC-----SDPTEKGAYISGLYCRDTKCSGLVV 305
>gi|302411726|ref|XP_003003696.1| conserved hypothetical protein [Verticillium albo-atrum VaMs.102]
gi|261357601|gb|EEY20029.1| conserved hypothetical protein [Verticillium albo-atrum VaMs.102]
Length = 564
Score = 72.8 bits (177), Expect = 2e-10, Method: Compositional matrix adjust.
Identities = 40/104 (38%), Positives = 61/104 (58%), Gaps = 8/104 (7%)
Query: 27 EISINEIAENFSKLACNAHTICNSELRPLGTGLY--PVISIINHSCLPNAVLVFEGRLAV 84
E++ + E K+ NA T + G GL+ ++++NHSC+PNA++ F GR A
Sbjct: 190 EVTERRVVELLCKIKTNAFTRSEA-----GEGLFLDTTLAMVNHSCVPNAIVAFSGRRAF 244
Query: 85 VRAVQHVPKGAEVLISYIETAGSTMTRQKALKEQYLFTCTCPRC 128
+RA++ + +G E+ ISYI+ S R+KAL E Y F C C RC
Sbjct: 245 LRALRDIKEGEEIEISYIDCTQSLEHRRKAL-ELYFFQCVCMRC 287
>gi|389631537|ref|XP_003713421.1| hypothetical protein MGG_15522 [Magnaporthe oryzae 70-15]
gi|351645754|gb|EHA53614.1| hypothetical protein MGG_15522 [Magnaporthe oryzae 70-15]
gi|440463734|gb|ELQ33288.1| hypothetical protein OOU_Y34scaffold00979g72 [Magnaporthe oryzae
Y34]
gi|440483676|gb|ELQ64025.1| hypothetical protein OOW_P131scaffold00888g14 [Magnaporthe oryzae
P131]
Length = 542
Score = 72.8 bits (177), Expect = 2e-10, Method: Compositional matrix adjust.
Identities = 37/90 (41%), Positives = 55/90 (61%), Gaps = 1/90 (1%)
Query: 39 KLACNAHTICNSELRPLGTGLYPVISIINHSCLPNAVLVFEGRLAVVRAVQHVPKGAEVL 98
K+ N+ +++ GT L PV+++ NHSC+PNAV++F R A +RA + +G+E+
Sbjct: 188 KIQTNSFDRFDADTGASGTFLDPVLAMANHSCVPNAVVLFWRRKAYLRAEMPIKQGSEIS 247
Query: 99 ISYIETAGSTMTRQKALKEQYLFTCTCPRC 128
ISYI+ RQ+ L Y FTC CPRC
Sbjct: 248 ISYIDYTKPVRFRQEDL-WLYHFTCKCPRC 276
>gi|156050587|ref|XP_001591255.1| hypothetical protein SS1G_07881 [Sclerotinia sclerotiorum 1980]
gi|154692281|gb|EDN92019.1| hypothetical protein SS1G_07881 [Sclerotinia sclerotiorum 1980
UF-70]
Length = 593
Score = 72.8 bits (177), Expect = 2e-10, Method: Compositional matrix adjust.
Identities = 37/112 (33%), Positives = 58/112 (51%), Gaps = 11/112 (9%)
Query: 31 NEIAENFSK---LACNAHTICNSELR--------PLGTGLYPVISIINHSCLPNAVLVFE 79
+++ +N SK L AH + + R P+G L P+++ NH C PNA + F+
Sbjct: 201 DDLEKNNSKWDDLLLQAHALSTNAFRVESNVGNGPIGLCLDPLLARANHCCYPNAAITFD 260
Query: 80 GRLAVVRAVQHVPKGAEVLISYIETAGSTMTRQKALKEQYLFTCTCPRCIKL 131
G+ A +RA+ + G ++ ISYI+ RQ AL+E + F C C RC
Sbjct: 261 GKRATLRALFPIKNGEQIFISYIDETQRQEVRQAALEETWFFKCRCSRCTNF 312
>gi|66825697|ref|XP_646203.1| hypothetical protein DDB_G0270830 [Dictyostelium discoideum AX4]
gi|60474828|gb|EAL72765.1| hypothetical protein DDB_G0270830 [Dictyostelium discoideum AX4]
Length = 696
Score = 72.8 bits (177), Expect = 2e-10, Method: Compositional matrix adjust.
Identities = 39/107 (36%), Positives = 59/107 (55%), Gaps = 10/107 (9%)
Query: 28 ISINEIAENFSKLACNAHTICNSELRPL-----GTGLYPVISIINHSCLPNAVLVFEGR- 81
++I+EI E +S + N H + L PL G G++P S +NHSCLPNA + +
Sbjct: 403 VTIDEIIEIYSIVLSNGHEM----LHPLNCHTYGLGIFPTGSYLNHSCLPNAFWYNDDQG 458
Query: 82 LAVVRAVQHVPKGAEVLISYIETAGSTMTRQKALKEQYLFTCTCPRC 128
+ V R ++ + KG E+L SY + R+K L +QY F C C +C
Sbjct: 459 MMVFRTLRPIKKGEEILTSYTDITTECSERRKHLLKQYFFFCQCQQC 505
>gi|255570887|ref|XP_002526395.1| heat shock protein 70 (HSP70)-interacting protein, putative
[Ricinus communis]
gi|223534257|gb|EEF35971.1| heat shock protein 70 (HSP70)-interacting protein, putative
[Ricinus communis]
Length = 781
Score = 72.8 bits (177), Expect = 2e-10, Method: Compositional matrix adjust.
Identities = 72/364 (19%), Positives = 153/364 (42%), Gaps = 52/364 (14%)
Query: 7 KQLLLYAQIANLVNLILQWPEISINEIAENFSKLACNAHTICNS-ELRPLGTGLYPVISI 65
+ ++L +Q+ +++ + + +++ KL+ + + +S E P+G +Y S+
Sbjct: 423 QAIILVSQVRVNAMAVVRMKSVDAHCPSDHLVKLSHSGDALTSSVEQVPVGQAIYTAGSL 482
Query: 66 INHSCLPNAVLVFEGRLAVVRAVQHVPKGAEVLISYIETAG--STMTRQKALKEQYLFTC 123
NHSC PN F R +R +H+ G + +SY G R K L+++Y F C
Sbjct: 483 FNHSCQPNVHAYFLSRTLFIRTTEHLATGCPLELSYGPQVGQRDCKDRLKFLQDKYSFRC 542
Query: 124 TCPRCIKLGQFDDIQESAILEGYRCKDDGCSGFLLRDSDDKGFTCQQCGLVRSKEEIK-K 182
C C + D +Q + +RC D C G +L S + R+ E +
Sbjct: 543 HCNGCSIVNLSDLVQNA-----FRCIDLNCVGVVLDRSVINSEIKKLKNFPRAPERQRLD 597
Query: 183 IASEVNILSKKTLALT------------SCGN-------HQEVVSTYKMIEKLQ------ 217
+ +V+ L+ L L+ +CG+ H+ + + + ++LQ
Sbjct: 598 LCLQVDDLAHLALELSNGPLHIQPGCCLNCGSYCDLEAVHEGMRTAWIYFKRLQDAIVVK 657
Query: 218 -----------------KKLYHPFSVNLMQTREKLIKILMELEDWKEALAYCQLTIPVYQ 260
+ + H ++ ++ + + L + +ED++ A +C+ +I + +
Sbjct: 658 KISTTVITDASRALGALRSILHAYNKHIAEAEDILAQAFCLVEDFQSARDHCRASIKILE 717
Query: 261 RVYPQFHPLLGLQYYTCGKLEWFLGDTENAIKSMTEAVEILRITHGTNSPFMKELILKLE 320
+Y H ++G + ++ +GD +A+ S+ I G+++ F+ + L
Sbjct: 718 MLYGPDHIVIGYELIKLSTIQLSMGDL-SAVDSINRLGAIFERYFGSHADFIFPYLQTLR 776
Query: 321 EAQA 324
E A
Sbjct: 777 EKLA 780
>gi|426239581|ref|XP_004013698.1| PREDICTED: SET and MYND domain-containing protein 3 isoform 3 [Ovis
aries]
Length = 391
Score = 72.8 bits (177), Expect = 2e-10, Method: Compositional matrix adjust.
Identities = 31/65 (47%), Positives = 46/65 (70%)
Query: 32 EIAENFSKLACNAHTICNSELRPLGTGLYPVISIINHSCLPNAVLVFEGRLAVVRAVQHV 91
+I E F+K+ CN+ TICN+E++ +G GLYP +S++NHSC PN +VF G ++R V+ V
Sbjct: 170 DIFEAFAKVICNSFTICNAEMQEVGVGLYPSMSLLNHSCDPNCSIVFNGPHLLLRTVRDV 229
Query: 92 PKGAE 96
G E
Sbjct: 230 EAGEE 234
Score = 48.1 bits (113), Expect = 0.006, Method: Compositional matrix adjust.
Identities = 24/61 (39%), Positives = 36/61 (59%)
Query: 246 KEALAYCQLTIPVYQRVYPQFHPLLGLQYYTCGKLEWFLGDTENAIKSMTEAVEILRITH 305
+EAL Y T+ Y+ YP HP+ G+Q GKL+ G A+K++ A +I+R+TH
Sbjct: 308 EEALFYGIRTMEPYRIFYPGNHPVRGVQIMKVGKLQLHQGMFPQAMKNLRLAFDIMRVTH 367
Query: 306 G 306
G
Sbjct: 368 G 368
>gi|335309609|ref|XP_003131876.2| PREDICTED: SET and MYND domain-containing protein 4-like [Sus
scrofa]
Length = 788
Score = 72.4 bits (176), Expect = 2e-10, Method: Compositional matrix adjust.
Identities = 53/171 (30%), Positives = 77/171 (45%), Gaps = 22/171 (12%)
Query: 25 WPEISINEIA--ENFSKLACNAHTICNSELRPLGTGLYPVISIINHSCLPNAVLVFEGRL 82
+PE +I +A + +L CNA I + ++ G ++P++++ HSC PNA + F +
Sbjct: 481 YPEWNIWGVAMLRHMLQLQCNAQAI--TAIQQTGDSIFPIVNVFXHSCSPNASVSFISTV 538
Query: 83 AVVRAVQHVPKGAEVLISY--IETAGSTMTRQKALKEQYLFTCTCPRCIKLGQFDDIQES 140
A VRA Q + KG EVL Y E+ R + L+ QY F C CP C + Q E
Sbjct: 539 ATVRASQPIRKGQEVLHCYGPHESRMGVAERHQKLRSQYFFDCNCPPCEREKQRPSALEP 598
Query: 141 -----------AILEG---YRCKDDGCSGFLLRDSDDKGFT--CQQCGLVR 175
A+L+G C C + R D QQ GL R
Sbjct: 599 RREAFCCHRCRALLQGDDVLSCGSTSCPASVRRHHLDSRLQDLQQQVGLAR 649
>gi|156386886|ref|XP_001634142.1| predicted protein [Nematostella vectensis]
gi|156221221|gb|EDO42079.1| predicted protein [Nematostella vectensis]
Length = 73
Score = 72.4 bits (176), Expect = 3e-10, Method: Compositional matrix adjust.
Identities = 32/73 (43%), Positives = 46/73 (63%)
Query: 56 GTGLYPVISIINHSCLPNAVLVFEGRLAVVRAVQHVPKGAEVLISYIETAGSTMTRQKAL 115
GTG++P +NHSC PN+V VF G ++A++ +P G E+ ISYI+ TRQ+ L
Sbjct: 1 GTGIFPNAVCLNHSCAPNSVAVFNGTNIYIKALEEIPVGEELTISYIQQLHPRETRQEEL 60
Query: 116 KEQYLFTCTCPRC 128
+ Q+ F C C RC
Sbjct: 61 QTQFCFYCQCHRC 73
>gi|12007319|gb|AAG45134.1|AF310894_3 BOP [Dictyostelium discoideum]
Length = 403
Score = 72.4 bits (176), Expect = 3e-10, Method: Compositional matrix adjust.
Identities = 39/107 (36%), Positives = 59/107 (55%), Gaps = 10/107 (9%)
Query: 28 ISINEIAENFSKLACNAHTICNSELRPL-----GTGLYPVISIINHSCLPNAVLVFEGR- 81
++I+EI E +S + N H + L PL G G++P S +NHSCLPNA + +
Sbjct: 110 VTIDEIIEIYSIVLSNGHEM----LHPLNCHTYGLGIFPTGSYLNHSCLPNAFWYNDDQG 165
Query: 82 LAVVRAVQHVPKGAEVLISYIETAGSTMTRQKALKEQYLFTCTCPRC 128
+ V R ++ + KG E+L SY + R+K L +QY F C C +C
Sbjct: 166 MMVFRTLRPIKKGEEILTSYTDITTECSERRKHLLKQYFFFCQCQQC 212
>gi|383852386|ref|XP_003701709.1| PREDICTED: SET and MYND domain-containing protein 4-like [Megachile
rotundata]
Length = 675
Score = 72.4 bits (176), Expect = 3e-10, Method: Compositional matrix adjust.
Identities = 52/168 (30%), Positives = 78/168 (46%), Gaps = 8/168 (4%)
Query: 55 LGTGLYPVISIINHSCLPNAVLVFEGRLAVVRAVQHVPKGAEVLISY--IETAGSTMTRQ 112
+G GL+P I++ NHSC P + F G VVRA++ + G E+ +Y I T R+
Sbjct: 478 IGGGLFPTIALFNHSCNPGVIRYFIGTTMVVRAIRSIAAGEEISENYGPIFTMSPEAERK 537
Query: 113 KALKEQYLFTCTCPRCIKLGQFDDIQESAILEGYRCKDDG--CSGFLLRDSDDKGFT--C 168
+ L+ QY F C C C+ D + IL ++C D G C L +D F C
Sbjct: 538 RTLRWQYWFDCNCEACVGHWPLLDDIDPTILR-FKC-DTGRECGNILPVKTDTNEFMIRC 595
Query: 169 QQCGLVRSKEEIKKIASEVNILSKKTLALTSCGNHQEVVSTYKMIEKL 216
+CG + + K + + L + G H E +S+Y I KL
Sbjct: 596 PKCGKNTNILKGLKALQDTDSLFRVASNNLEDGKHVEALSSYLKILKL 643
>gi|322801942|gb|EFZ22489.1| hypothetical protein SINV_02250 [Solenopsis invicta]
Length = 675
Score = 72.4 bits (176), Expect = 3e-10, Method: Compositional matrix adjust.
Identities = 51/179 (28%), Positives = 85/179 (47%), Gaps = 6/179 (3%)
Query: 43 NAHTICNSELRPLGTGLYPVISIINHSCLPNAVLVFEGRLAVVRAVQHVPKGAEVLISY- 101
A T+ ++ +G G+YP ++++NHSC P V F G +VRA++ + G E+ +Y
Sbjct: 466 GAKTLAKAKSMFIGGGVYPTVAMLNHSCNPGVVRYFIGTTMIVRAIRTISVGEEISENYG 525
Query: 102 -IETAGSTMTRQKALKEQYLFTCTCPRCIKLGQFDDIQESAILEGYRCK-DDGCSGFLLR 159
I T + R++ L+ QY F C C C D + IL ++C+ C LL
Sbjct: 526 PIFTTTPEIERKRKLRVQYWFDCNCEACSGHWPLLDELDPTILR-FKCETGPSCGNVLLV 584
Query: 160 DSDDKGFT--CQQCGLVRSKEEIKKIASEVNILSKKTLALTSCGNHQEVVSTYKMIEKL 216
SD K F C +CG + + K+ + + L + G +++ + Y I KL
Sbjct: 585 RSDTKEFMIGCAKCGKSTNILKGLKVLQDTDALFRVASTNLEEGRNEQALKAYLEILKL 643
>gi|392579385|gb|EIW72512.1| hypothetical protein TREMEDRAFT_58684 [Tremella mesenterica DSM
1558]
Length = 885
Score = 72.4 bits (176), Expect = 3e-10, Method: Compositional matrix adjust.
Identities = 38/102 (37%), Positives = 52/102 (50%), Gaps = 1/102 (0%)
Query: 57 TGLYPVISIINHSCLPNAVLVFEGRLAVVRAVQHVPKGAEVLISYIETAGSTMTRQKALK 116
T ++P +S+INHSCLPN V + G +AVVRA+ + G E+ ISY TR K
Sbjct: 553 TAIWPTVSMINHSCLPNTVHIAWGDVAVVRALNDMSAGTELTISYFSREQPYETRAAKAK 612
Query: 117 EQYLFTCTCPRCIKLGQFDDIQESAILEGYRCKDDGCSGFLL 158
QYLFTC C C + I+ ++E + LL
Sbjct: 613 -QYLFTCDCNLCAADRMDESIRRGKLMEKFDMAKQASRNILL 653
>gi|115449849|ref|XP_001218712.1| predicted protein [Aspergillus terreus NIH2624]
gi|114187661|gb|EAU29361.1| predicted protein [Aspergillus terreus NIH2624]
Length = 306
Score = 72.0 bits (175), Expect = 3e-10, Method: Compositional matrix adjust.
Identities = 35/87 (40%), Positives = 50/87 (57%), Gaps = 2/87 (2%)
Query: 55 LGTGLYPVISIINHSCLPNAVLVFEGRLAVVRAVQHVPKGAEVLISYIETAGSTMTRQKA 114
LGT L S +NHSC P A + FEGR VRA+ +P G E+ +Y++ +GS +RQ
Sbjct: 9 LGTTLDLAASFMNHSCNPGAFVFFEGRQMRVRALLSLPAGEEITQAYVDLSGSVFSRQAT 68
Query: 115 LKEQYLFTCTCPRCIKLGQFDDIQESA 141
+ +Y F C C RC +D+Q+ A
Sbjct: 69 TEAEYFFQCHCVRCED--DLEDLQQIA 93
>gi|226290918|gb|EEH46346.1| conserved hypothetical protein [Paracoccidioides brasiliensis Pb18]
Length = 389
Score = 72.0 bits (175), Expect = 3e-10, Method: Compositional matrix adjust.
Identities = 36/119 (30%), Positives = 62/119 (52%)
Query: 12 YAQIANLVNLILQWPEISINEIAENFSKLACNAHTICNSELRPLGTGLYPVISIINHSCL 71
+ + + VN L + + + ++L NA T+ N +G + P S NHSC
Sbjct: 73 FVTVGDEVNPGLTLKACTGCRVVKYCNELETNAFTLTNRYFDRIGLCMLPFASYANHSCE 132
Query: 72 PNAVLVFEGRLAVVRAVQHVPKGAEVLISYIETAGSTMTRQKALKEQYLFTCTCPRCIK 130
PNA + F+G + ++A+Q + ++ ISYI+ RQ L+++Y FTC CP+C +
Sbjct: 133 PNAYIGFDGPVIYLKALQDIALDEQIFISYIDNTEPWGKRQSELQKRYFFTCKCPKCAQ 191
>gi|195058496|ref|XP_001995453.1| GH17754 [Drosophila grimshawi]
gi|193896239|gb|EDV95105.1| GH17754 [Drosophila grimshawi]
Length = 499
Score = 72.0 bits (175), Expect = 3e-10, Method: Compositional matrix adjust.
Identities = 34/103 (33%), Positives = 54/103 (52%), Gaps = 7/103 (6%)
Query: 58 GLYPVISIINHSCLPNAVLVFEGRL--AVVRAVQHVPKGAEVLISYIETAGSTMTRQKAL 115
LYP+ ++NH C+PN+ FE + +VRA + +G E+ +Y + + R L
Sbjct: 208 ALYPLFGVMNHDCIPNSYYTFEEKTNNMIVRAAVDILEGEEITTTYTKLFTGNIARHLFL 267
Query: 116 KEQYLFTCTCPRCIKLGQFDDIQESAILEGYRCKDDGCSGFLL 158
K + FTC CPRC D ++ A + G C+D CSG ++
Sbjct: 268 KMKKGFTCKCPRC-----SDPTEKGAYISGLYCRDTNCSGLVV 305
>gi|330841743|ref|XP_003292851.1| hypothetical protein DICPUDRAFT_89992 [Dictyostelium purpureum]
gi|325076863|gb|EGC30616.1| hypothetical protein DICPUDRAFT_89992 [Dictyostelium purpureum]
Length = 680
Score = 72.0 bits (175), Expect = 3e-10, Method: Compositional matrix adjust.
Identities = 38/103 (36%), Positives = 57/103 (55%), Gaps = 2/103 (1%)
Query: 28 ISINEIAENFSKLACNAHTICNS-ELRPLGTGLYPVISIINHSCLPNAVLVF-EGRLAVV 85
I++NEI E +S + N H + + R G G++P S +NHSC PNA E + V
Sbjct: 379 ITLNEIIELYSIVLSNGHEMLHPLTCRSYGMGIFPTGSYLNHSCSPNAFWYNDEQGMMVF 438
Query: 86 RAVQHVPKGAEVLISYIETAGSTMTRQKALKEQYLFTCTCPRC 128
R+++ + KG E+L SY + R+K L +QY F C C +C
Sbjct: 439 RSLRPLKKGEELLTSYTDVTNPLEDRRKYLMKQYFFFCQCNQC 481
>gi|19113306|ref|NP_596514.1| histone lysine methyltransferase Set6 (predicted)
[Schizosaccharomyces pombe 972h-]
gi|74626997|sp|O94256.1|SET6_SCHPO RecName: Full=SET domain and MYND-type zinc finger protein 6
gi|3810831|emb|CAA21792.1| histone lysine methyltransferase Set6 (predicted)
[Schizosaccharomyces pombe]
Length = 483
Score = 72.0 bits (175), Expect = 4e-10, Method: Compositional matrix adjust.
Identities = 33/92 (35%), Positives = 52/92 (56%)
Query: 37 FSKLACNAHTICNSELRPLGTGLYPVISIINHSCLPNAVLVFEGRLAVVRAVQHVPKGAE 96
F +LA NA + S LG L ++ +NHSC PN ++F+G + + + + + K +
Sbjct: 163 FCRLAVNAMNLVTSSFDSLGMCLDTILCRLNHSCDPNCQIIFDGAIVQLVSKRDIKKDEQ 222
Query: 97 VLISYIETAGSTMTRQKALKEQYLFTCTCPRC 128
+ ISYI+ RQK L ++Y F+C CPRC
Sbjct: 223 LFISYIDIRLPKSIRQKQLLKKYFFSCYCPRC 254
>gi|328875104|gb|EGG23469.1| SET domain-containing protein [Dictyostelium fasciculatum]
Length = 448
Score = 72.0 bits (175), Expect = 4e-10, Method: Compositional matrix adjust.
Identities = 43/125 (34%), Positives = 64/125 (51%), Gaps = 4/125 (3%)
Query: 39 KLACNAHTICNSELRPLGTGLYPVISIINHSCLPNAVLVFEGRLAVVRAVQHVPKGAEVL 98
K+ CN I + + + LG + P+ S NHSC PN V V L +A+ +PKG+++
Sbjct: 272 KIRCNQFGIWSKKDKCLGVSVTPIASYFNHSCCPNIVDVRGTTLLEFKALHFIPKGSQLC 331
Query: 99 ISYIETAGSTMTRQKALKEQYLFTCTCPRCIKLGQFDDIQESAILEGYRCKDDGCSG-FL 157
ISY++ +T +RQ L Y F C C RC G D ++ I + Y CSG +
Sbjct: 332 ISYLDLDQTTDSRQDYLIYSYYFKCGCKRCNDKG---DSIDNWISQFYCQGTKKCSGTYF 388
Query: 158 LRDSD 162
L D +
Sbjct: 389 LEDEE 393
>gi|225679205|gb|EEH17489.1| conserved hypothetical protein [Paracoccidioides brasiliensis Pb03]
Length = 377
Score = 72.0 bits (175), Expect = 4e-10, Method: Compositional matrix adjust.
Identities = 36/119 (30%), Positives = 62/119 (52%)
Query: 12 YAQIANLVNLILQWPEISINEIAENFSKLACNAHTICNSELRPLGTGLYPVISIINHSCL 71
+ + + VN L + + + ++L NA T+ N +G + P S NHSC
Sbjct: 62 FVTVGDEVNPGLTLKACTGCRVVKYCNELETNAFTLTNRYFDRIGLCMLPFASYANHSCE 121
Query: 72 PNAVLVFEGRLAVVRAVQHVPKGAEVLISYIETAGSTMTRQKALKEQYLFTCTCPRCIK 130
PNA + F+G + ++A+Q + ++ ISYI+ RQ L+++Y FTC CP+C +
Sbjct: 122 PNAYIGFDGPVIYLKALQDIALDEQIFISYIDNTEPWEKRQSELQKRYFFTCKCPKCAQ 180
>gi|121715400|ref|XP_001275309.1| SET and MYND domain protein, putative [Aspergillus clavatus NRRL 1]
gi|119403466|gb|EAW13883.1| SET and MYND domain protein, putative [Aspergillus clavatus NRRL 1]
Length = 555
Score = 72.0 bits (175), Expect = 4e-10, Method: Compositional matrix adjust.
Identities = 34/96 (35%), Positives = 54/96 (56%)
Query: 33 IAENFSKLACNAHTICNSELRPLGTGLYPVISIINHSCLPNAVLVFEGRLAVVRAVQHVP 92
IAE +KL N+ N LG ++P +++NHSC NA + F+ V+A+ +
Sbjct: 233 IAEYAAKLDVNSFNFDNVMDERLGIYMHPYAALMNHSCDYNATVAFDDDRLHVKALHPIK 292
Query: 93 KGAEVLISYIETAGSTMTRQKALKEQYLFTCTCPRC 128
KG ++ I+Y++T RQK L ++Y FTC C +C
Sbjct: 293 KGEQIFITYVDTTNPYKIRQKELSDRYYFTCRCSKC 328
>gi|320586941|gb|EFW99604.1| set and mynd domain protein [Grosmannia clavigera kw1407]
Length = 576
Score = 71.6 bits (174), Expect = 4e-10, Method: Compositional matrix adjust.
Identities = 41/106 (38%), Positives = 58/106 (54%), Gaps = 3/106 (2%)
Query: 25 WP--EISINEIAENFSKLACNAHTICNSELRPLGTGLYPVISIINHSCLPNAVLVFEGRL 82
WP + S+ E K+ N+ + ++ GT L +++ NHSC+PNAV+ F GR
Sbjct: 206 WPATDASLAIAVEAMCKIHTNSFDRSDVDVAYTGTFLDAHLAMANHSCVPNAVVSFAGRK 265
Query: 83 AVVRAVQHVPKGAEVLISYIETAGSTMTRQKALKEQYLFTCTCPRC 128
A +RA Q + G EV ISYI+ RQ+ L + Y FTC C RC
Sbjct: 266 AFLRAEQAIRAGDEVTISYIDYTKPKSVRQRGL-DLYHFTCDCARC 310
>gi|255950300|ref|XP_002565917.1| Pc22g20170 [Penicillium chrysogenum Wisconsin 54-1255]
gi|211592934|emb|CAP99305.1| Pc22g20170 [Penicillium chrysogenum Wisconsin 54-1255]
Length = 495
Score = 71.6 bits (174), Expect = 4e-10, Method: Compositional matrix adjust.
Identities = 36/106 (33%), Positives = 59/106 (55%), Gaps = 1/106 (0%)
Query: 27 EISINEIAENFSKLACNAHTICNSELRPLGTGLYPVISIINHSCLPNAVLVFEGRLAVVR 86
E+ +A +KL N+ + S +G ++P +INHSC N+ + F+G V+
Sbjct: 175 EMDEGTVASYAAKLDLNSFNLTTSMYDRIGLYMHPYAGLINHSCDYNSTVGFDGEELYVK 234
Query: 87 AVQHVPKGAEVLISYIETAGSTMTRQKALKEQYLFTCTCPRCIKLG 132
A++ + KG ++ ISYI+T R+ LKE+Y F C C +C K+G
Sbjct: 235 AMRPIKKGEQIFISYIDTTTPYDIRRNELKERYFFDCQCTKC-KMG 279
>gi|302845794|ref|XP_002954435.1| hypothetical protein VOLCADRAFT_118721 [Volvox carteri f.
nagariensis]
gi|300260365|gb|EFJ44585.1| hypothetical protein VOLCADRAFT_118721 [Volvox carteri f.
nagariensis]
Length = 405
Score = 71.6 bits (174), Expect = 5e-10, Method: Compositional matrix adjust.
Identities = 35/73 (47%), Positives = 45/73 (61%), Gaps = 2/73 (2%)
Query: 58 GLYPVISIINHSCLPNAVLVFEGRLAVVRAVQHVPKGAEVLISYIETAG--STMTRQKAL 115
GL+P S +NHSCLPNAV G + VV A + + +G+EVLI+Y+ A RQ L
Sbjct: 122 GLFPSFSFLNHSCLPNAVNFVVGGIMVVVAARKIRQGSEVLINYLGRASLRPVGERQGQL 181
Query: 116 KEQYLFTCTCPRC 128
E Y F+C CPRC
Sbjct: 182 AEGYHFSCDCPRC 194
>gi|340992675|gb|EGS23230.1| hypothetical protein CTHT_0008940 [Chaetomium thermophilum var.
thermophilum DSM 1495]
Length = 459
Score = 71.6 bits (174), Expect = 5e-10, Method: Compositional matrix adjust.
Identities = 39/103 (37%), Positives = 59/103 (57%), Gaps = 2/103 (1%)
Query: 27 EISINEIAENFSKLACNAHTICNSELRPLGTGLYPVISIINHSCLPNAVLVFEGRLAVVR 86
+ SI E K+ NA + + G L PV+++ NHSC+P+A + F+ R AV+R
Sbjct: 89 DFSIPRAREILCKIQTNAFNRLDPDAGMTGIYLDPVLAMANHSCMPSAFVSFDQRNAVLR 148
Query: 87 AVQHVPKGAEVLISYIE-TAGSTMTRQKALKEQYLFTCTCPRC 128
A + + +G E+ I Y++ T + RQ+ALK Y F C CPRC
Sbjct: 149 AWRDIKEGDEITICYVDVTLPNKAARQEALK-LYHFECRCPRC 190
>gi|4324419|gb|AAD16880.1| unknown [Dictyostelium discoideum]
Length = 333
Score = 71.6 bits (174), Expect = 5e-10, Method: Compositional matrix adjust.
Identities = 39/107 (36%), Positives = 59/107 (55%), Gaps = 10/107 (9%)
Query: 28 ISINEIAENFSKLACNAHTICNSELRPL-----GTGLYPVISIINHSCLPNAVLVFEGR- 81
++I+EI E +S + N H + L PL G G++P S +NHSCLPNA + +
Sbjct: 40 VTIDEIIEIYSIVLSNGHEM----LHPLNCHTYGLGIFPTGSYLNHSCLPNAFWYNDDQG 95
Query: 82 LAVVRAVQHVPKGAEVLISYIETAGSTMTRQKALKEQYLFTCTCPRC 128
+ V R ++ + KG E+L SY + R+K L +QY F C C +C
Sbjct: 96 MMVFRTLRPIKKGEEILTSYTDITTECSERRKHLLKQYFFFCQCQQC 142
>gi|449681118|ref|XP_002159729.2| PREDICTED: uncharacterized protein LOC100210152, partial [Hydra
magnipapillata]
Length = 1044
Score = 71.2 bits (173), Expect = 6e-10, Method: Compositional matrix adjust.
Identities = 36/101 (35%), Positives = 57/101 (56%), Gaps = 12/101 (11%)
Query: 40 LACNAHTIC----------NSELRPLGTGLYPVISIINHSCLPNAVLVFEGRLAVVRAVQ 89
L CNAH + +SEL+ +G+ +Y +S++NHSC P+ V G V+RA++
Sbjct: 385 LPCNAHEVSELQLKASNYKDSELKEIGSAVYATLSLLNHSCDPSVVRHCYGDTCVLRAIK 444
Query: 90 HVPKGAEVLISY--IETAGSTMTRQKALKEQYLFTCTCPRC 128
H+ +G+E++ +Y + S + RQ L EQY F C C C
Sbjct: 445 HIKEGSEIVDNYGFLYAVESKVIRQSHLMEQYYFACQCEAC 485
>gi|390599694|gb|EIN09090.1| SET domain-containing protein [Punctularia strigosozonata HHB-11173
SS5]
Length = 562
Score = 71.2 bits (173), Expect = 6e-10, Method: Compositional matrix adjust.
Identities = 41/121 (33%), Positives = 64/121 (52%), Gaps = 12/121 (9%)
Query: 29 SINEIAENFSKLACNAHTICNSELRPLGTGLYPVISIINHSCLPNAVLVF---------E 79
S ++ + S+ NA T+ L PLG + P ++++NHSC PNAV+VF E
Sbjct: 207 SAGDVVDLISRFTTNAITLAAPSLTPLGVSVSPAVALVNHSCAPNAVVVFPRVSKTVDQE 266
Query: 80 GRLAVVRAVQHVPKGAEVLISYIETAGSTMTRQKALKEQYLFTCTCPRCIKLGQFDDIQE 139
+ V+ A++ + EV +YI+T RQK L++ Y FTC+C C + D +E
Sbjct: 267 PVMQVI-ALRDIHPDEEVTTAYIDTTVPREQRQKILRDTYHFTCSCSLCA--AEDPDPRE 323
Query: 140 S 140
S
Sbjct: 324 S 324
>gi|156367132|ref|XP_001627273.1| predicted protein [Nematostella vectensis]
gi|156214178|gb|EDO35173.1| predicted protein [Nematostella vectensis]
Length = 750
Score = 71.2 bits (173), Expect = 6e-10, Method: Compositional matrix adjust.
Identities = 58/272 (21%), Positives = 118/272 (43%), Gaps = 11/272 (4%)
Query: 51 ELRPLGTGLYPVISIINHSCLPNAVLVFEGRLAVVRAVQHVPKGAEVLISYIETAGSTMT 110
E R + + +YP S++NH+C P+ ++ F + V RA ++ G+ + Y
Sbjct: 468 EQRRIASAIYPTASLLNHACDPDVLVSFVDGVLVARATHNIAPGSGITHCYGPHVNHMPR 527
Query: 111 --RQKALKEQYLFTCTCPRCIKLGQFDDIQESAILEGYRCKDDGCSGFLLRDS--DDKGF 166
RQK L +QY FTC C C + ++ + + C C ++ S +
Sbjct: 528 EERQKLLYKQYFFTCQCSACTSDEEMENTR--LCFSAFACPRCKCP---MKTSPLEPSLA 582
Query: 167 TCQ--QCGLVRSKEEIKKIASEVNILSKKTLALTSCGNHQEVVSTYKMIEKLQKKLYHPF 224
CQ +C L +S EE + + +L K + QE + ++ + + ++ HP
Sbjct: 583 RCQNKKCTLEKSIEEELSHSRQAELLFFKAVRTMERIGVQEALGLFQECLRTRTQILHPH 642
Query: 225 SVNLMQTREKLIKILMELEDWKEALAYCQLTIPVYQRVYPQFHPLLGLQYYTCGKLEWFL 284
+L +T + L + + D+K A +C + ++ + + + +L +
Sbjct: 643 HKDLAETHDALARCYAMIGDFKLASQHCLQSSEAVEKAFGSTSVEYAHELHKLSQLLFND 702
Query: 285 GDTENAIKSMTEAVEILRITHGTNSPFMKELI 316
+ A+ + +A +L +G N P ++EL+
Sbjct: 703 RQAKKALPMIDKAASLLATYYGRNHPDVQELV 734
>gi|157124668|ref|XP_001654144.1| hypothetical protein AaeL_AAEL001857 [Aedes aegypti]
gi|108882773|gb|EAT46998.1| AAEL001857-PA [Aedes aegypti]
Length = 497
Score = 71.2 bits (173), Expect = 6e-10, Method: Compositional matrix adjust.
Identities = 48/188 (25%), Positives = 81/188 (43%), Gaps = 17/188 (9%)
Query: 58 GLYPVISIINHSCLPNAVLVF---EGRLAVVRAVQHVPKGAEVLISYIETAGSTMTRQKA 114
GLYP + ++ HSC+PN V +G + R +P+G ++ Y T T TRQ A
Sbjct: 200 GLYPKLGVMAHSCVPNVVHSIHPSDGYRLIGRVAIDIPEGEKLYTVYTYTLNGTSTRQAA 259
Query: 115 LKEQYLFTCTCPRCIKLGQFDDIQESAILEGYRCKDDGCSGFLLRDS----DDKGFTCQQ 170
LK FTC CPRC+ D + +C+ C L+ S D+ + C
Sbjct: 260 LKSSKYFTCRCPRCL-----DPTELGTHFSSLKCQK--CDNGLIVSSNPLDDEAEWRCSN 312
Query: 171 CGLVRSKEEIKKIASEVNILSKKTLALTSCGNHQEVVSTYKMIEKLQKKLYHPFSVNLMQ 230
C ++K + ++ + L+ N E ++ ++ + K + + HP
Sbjct: 313 CEFKTRGAAVQKA---IQVMQAEIDQLSCLENGPEKLTAFEKLYKKYRSILHPLHFIKTS 369
Query: 231 TREKLIKI 238
R LI++
Sbjct: 370 IRHSLIEL 377
>gi|225711848|gb|ACO11770.1| SET and MYND domain-containing protein 3 [Lepeophtheirus salmonis]
Length = 426
Score = 71.2 bits (173), Expect = 6e-10, Method: Compositional matrix adjust.
Identities = 40/128 (31%), Positives = 67/128 (52%), Gaps = 2/128 (1%)
Query: 1 MSDIDEKQLLLYAQIANLVNLILQWPEISINEIAENFSKLACNAHTICNSELRPLGTGLY 60
M+ +K+ L Y I ++ ++ E + +E+ F+KL NA + N +L G L
Sbjct: 125 MNPEHKKRFLTYLDI--IIYMLGGSLETNESEVLNIFTKLMINASFMLNEKLIDFGGCLC 182
Query: 61 PVISIINHSCLPNAVLVFEGRLAVVRAVQHVPKGAEVLISYIETAGSTMTRQKALKEQYL 120
S I+HSC PNA+ +F G VV+A+ + +V ++Y++ + RQ+ LK Q+
Sbjct: 183 LEFSAIDHSCRPNAIYMFNGHTLVVKALCEIANFDDVRVAYVDMSQPRSIRQELLKNQFF 242
Query: 121 FTCTCPRC 128
F C C C
Sbjct: 243 FDCNCEEC 250
>gi|156356358|ref|XP_001623892.1| predicted protein [Nematostella vectensis]
gi|156210632|gb|EDO31792.1| predicted protein [Nematostella vectensis]
Length = 634
Score = 71.2 bits (173), Expect = 7e-10, Method: Compositional matrix adjust.
Identities = 48/159 (30%), Positives = 78/159 (49%), Gaps = 18/159 (11%)
Query: 36 NFSKLACNAH----------TICNSELRPLGTGLYPVISIINHSCLPNAVLVFEGRLAVV 85
+ L CNAH ++ SE +G G+Y +S+ NHSC PN F G VV
Sbjct: 408 HLQSLPCNAHEISELQLSLKSVATSEAAEIGAGIYGTLSLFNHSCEPNVTRFFYGDKCVV 467
Query: 86 RAVQHVPKGAEVLISY--IETAGSTMTRQKALKEQYLFTCTCPRCIKLGQ-FDDI--QES 140
RA +P EV+ +Y + RQ++L+ QY F C C C++ + ++ Q+
Sbjct: 468 RAFSSIPCRGEVVDNYGILSALTPRKQRQESLQSQYYFKCNCHACLEDSPLYSELIKQDV 527
Query: 141 AILEGYRCKDDGCSGFLLRDSDDKGFTCQQCGLVRSKEE 179
L+ C+ +G +L +D K C++CG+ +S E+
Sbjct: 528 PQLKCANCR-MALAGEIL--TDGKLVKCEKCGVPQSLED 563
>gi|341883759|gb|EGT39694.1| hypothetical protein CAEBREN_28295 [Caenorhabditis brenneri]
Length = 582
Score = 71.2 bits (173), Expect = 7e-10, Method: Compositional matrix adjust.
Identities = 65/314 (20%), Positives = 131/314 (41%), Gaps = 24/314 (7%)
Query: 5 DEKQLLLYAQIANLVNLILQWPEISINEIAENFSKLACNAHTICNSELRPLGTGLYPVIS 64
DE+++ + ++ + L + I E + F K + N+H+I + +G L +S
Sbjct: 211 DEEKMTHFEKVFTKMGLFRKDDMIEKGEFFDIFCKASINSHSIHTNAGNEVGMALDLGVS 270
Query: 65 IINHSCLPNAVLVFEGRLAVVRAVQHVP-----KGAEVLISYIETAGSTMTRQKALKEQY 119
NHSC P +VF+G +R + VP + ISYI+ S R++ LK ++
Sbjct: 271 KYNHSCRPTCSMVFDGYRVCLRPL--VPGVDAEDTEQAFISYIDVGRSKFVRRRDLKSRW 328
Query: 120 LFTCTCPRCIKLGQFDDIQESAILEGYRCKDDGCSGFLLRDSDDK--GFTCQQCGLVRSK 177
F C C RC+ D ++ A L RC + C +L ++ C +C + +
Sbjct: 329 YFNCECTRCM------DPEDDA-LTAIRCANPSCDAPILTSETEEPMNIACDKCKTIVEE 381
Query: 178 EEIKKIASEVNILSKKTLALTSCGNHQEVVSTYKMIEKLQKKLYHPFSVNLMQTREKLIK 237
++K + +L + + + + + +++ HP +V + + R L
Sbjct: 382 NDVKAAQEYMK-------SLPASFDPKCPADILRDLLAKAEQVLHPSNVYVARLRTALFH 434
Query: 238 ILMELEDWKEALAYCQLTIPVYQRVYPQFHPLLGLQYYTCGKLEWFLGDTENAIKSMTEA 297
+ L + + Q+ Y+ +P+ +G Q K + + A+ +A
Sbjct: 435 VTGTLTMDNLSSMHTQI-YNNYKMCFPKADRHVGFQLLHIVKALIEKDERDEAMPYAFDA 493
Query: 298 VEILRITHGTNSPF 311
+ I + G + P+
Sbjct: 494 MNIFEVCFGLDHPY 507
>gi|328767279|gb|EGF77329.1| hypothetical protein BATDEDRAFT_27650 [Batrachochytrium
dendrobatidis JAM81]
Length = 445
Score = 70.9 bits (172), Expect = 7e-10, Method: Compositional matrix adjust.
Identities = 71/285 (24%), Positives = 120/285 (42%), Gaps = 35/285 (12%)
Query: 56 GTGLYPVISIINHSCLPNAVLVFE-GRLAVVRAVQHVPKGAEVLISYIETAGSTMTRQKA 114
G +Y +S +NHSC+PNA L F G +A + + + G ++ ISY++ S TRQ+
Sbjct: 183 GVAIYDRLSTVNHSCVPNACLTFGIGGIARLSPMTAIASGDQINISYVDVFQSCETRQRQ 242
Query: 115 LKEQYLFTCTCPRCIKLGQFDDIQESAILEGYRCKDDGCSGFLLRDSDDKGFTCQQCGLV 174
LKEQY F CTC C + Q+ C S L S
Sbjct: 243 LKEQYYFDCTCRLCTANANGESRQQ--------CISQFDSAVDLYQS------------- 281
Query: 175 RSKEEIKKIASEVNILSK--KTLALTSCGNHQEVVS-TYKMIEKLQKKLYHPFSVNLMQT 231
+K++ ++ +IL++ ++L L C +H ++ +IE L + + + L
Sbjct: 282 -TKKQTNDASASQSILTRALESLELIVCKSHSVLIRLRTDLIEILVGQRNYAKVIELSND 340
Query: 232 REKLIKILMELEDWKEALAYCQLTIPVYQRVYPQFHPLLGLQYYTCGKLE-WFLGDTENA 290
+ + + WK + + + I YP + P+ + K W + D +
Sbjct: 341 QLDAFRCV-----WKASQSI-EKGISESDNAYPVW-PMESVSSVDIFKASMWTMDDDLDR 393
Query: 291 IKSM-TEAVEILRITHGTNSPFMKELILKLEEAQAEASYKLSSKD 334
+ + +A LRITHG P +L LE+ E + S D
Sbjct: 394 VAELGKQAAHNLRITHGEKHPMYLDLTEGLEDVLRELEARHSGVD 438
>gi|72169832|ref|XP_781331.1| PREDICTED: SET and MYND domain-containing protein 4-like
[Strongylocentrotus purpuratus]
Length = 704
Score = 70.9 bits (172), Expect = 7e-10, Method: Compositional matrix adjust.
Identities = 59/278 (21%), Positives = 124/278 (44%), Gaps = 34/278 (12%)
Query: 55 LGTGLYPVISIINHSCLPNAVLVFEGRLAVVRAVQHVPKGAEVLISYIETAG--STMTRQ 112
+ T +YP +S++NH+C PN + F + VRA++ + +G E+ Y G +T RQ
Sbjct: 436 IATAVYPTVSLMNHACQPNVIASFRKGIISVRAIEKIMRGDEIQHCYGPQVGHMTTSDRQ 495
Query: 113 KALKEQYLFTCTCPRCIKLGQFDDIQESAILEGYRCKD---------DGCS-----GFLL 158
+AL QY FTC C C + + D +E ++ +C C+ G L+
Sbjct: 496 QALLNQYCFTCRCRACTRKPRTFDKEEDLCIKCPQCGQPLNIQTSMCGKCAERIDVGVLI 555
Query: 159 RDSDDKGFTCQQCGLVRSKEEIKKIASEVNILSKKTLALTSCGNHQEVVSTYKMIEKLQK 218
+ + G T L+ +E ++ ++ +EV+S K + +
Sbjct: 556 HELTNAGTT-----LIGLEEMFSAAVNDDTLM-------------REVISKTKSCIDVLE 597
Query: 219 KLYHPFSVNLMQTREKLIKILMELEDWKEALAYCQLTIPVYQRVYPQFHPLLGLQYYTCG 278
++ P + L + + K +EL+++KEA + P + + + + + Y
Sbjct: 598 RIIIPPDMQLATAYDDMAKCHVELDEFKEAALWLAKATPSIESRFGRDSIEVAHELYKLA 657
Query: 279 KLEWFLGDTENAIKSMTEAVEILRITHGTNSPFMKELI 316
++ + + A++++ A+E+ +G ++ + EL+
Sbjct: 658 QIYFNGKEIAPAMETIDRALELFIRHYGNSNEEVLELV 695
>gi|157125402|ref|XP_001654323.1| hypothetical protein AaeL_AAEL001936 [Aedes aegypti]
gi|108882687|gb|EAT46912.1| AAEL001936-PA [Aedes aegypti]
Length = 546
Score = 70.9 bits (172), Expect = 8e-10, Method: Compositional matrix adjust.
Identities = 79/291 (27%), Positives = 123/291 (42%), Gaps = 34/291 (11%)
Query: 24 QWPEISINEIAENFSKLACNAHTICNSELRPLGTGLYPVISIINHSCLPNAVLVFEGRLA 83
+W E +EI + L N H + +E P +Y S++ HSC PN F +
Sbjct: 224 KWSE---DEILKIVGILQVNGHEVPLTE--PPSVAIYNNASMLEHSCRPNLSKSFTSKKE 278
Query: 84 VV-RAVQHVPKGAEVLISYIETAGSTMTRQKALKEQYLFTCTCPRCIKLGQFDDIQESAI 142
+V A + +G + ISY + T RQ L++ LF CTC RC+ +F +
Sbjct: 279 IVFWAPNPIKQGERLSISYSDVLWGTANRQDHLQQTKLFRCTCVRCLDPTEFGTYLSALK 338
Query: 143 LEGYRCKDDGCSGFLLRDSDDK---GFTCQQC-GLVRSKEEIKKIASEVNILSKKTLALT 198
G++ KD CSG LL ++ G+ C +C GLV KE NIL + +
Sbjct: 339 CSGFK-KDSNCSGLLLPENLKNWYGGYICNKCRGLVDGKE-------ITNILDRARVDHE 390
Query: 199 SCGNHQEVVSTYKMIEKLQKKLYHPFSVNLMQTREKLIKIL----------MELEDWKEA 248
+ E +K I + L P L+ + L +I+ + ED
Sbjct: 391 AMQKDNE-QHCHKYIAHYGRWL-GPNHHLLVDVKISLSQIMGGGDPNAIQKISDEDLMTK 448
Query: 249 LAYCQLTIPVYQRVYPQFHPLLGLQYYTCGKLEWFLGD-TENAIKSMTEAV 298
+ CQ I V+QRV P ++G + +L L + ++SM AV
Sbjct: 449 MKICQELIDVFQRVCPAEARVIGTTRF---ELHAALAEFARRGVESMNSAV 496
>gi|407929314|gb|EKG22146.1| hypothetical protein MPH_00473 [Macrophomina phaseolina MS6]
Length = 492
Score = 70.9 bits (172), Expect = 9e-10, Method: Compositional matrix adjust.
Identities = 33/94 (35%), Positives = 52/94 (55%)
Query: 37 FSKLACNAHTICNSELRPLGTGLYPVISIINHSCLPNAVLVFEGRLAVVRAVQHVPKGAE 96
+ ++ N+ T+ S + PLG P+ NHSC PNA ++ +G RA++ + K E
Sbjct: 181 YGRVFTNSLTLIGSAMEPLGICFDPLACSANHSCDPNAFVIMDGAQLSFRALKPIAKDEE 240
Query: 97 VLISYIETAGSTMTRQKALKEQYLFTCTCPRCIK 130
+LISYI+ RQ L +Y FTC+C +C +
Sbjct: 241 ILISYIDGTLPFERRQTDLTRRYYFTCSCTKCAQ 274
>gi|32564211|ref|NP_871849.1| Protein SET-18, isoform b [Caenorhabditis elegans]
gi|25005018|emb|CAD56600.1| Protein SET-18, isoform b [Caenorhabditis elegans]
Length = 507
Score = 70.5 bits (171), Expect = 1e-09, Method: Compositional matrix adjust.
Identities = 76/319 (23%), Positives = 133/319 (41%), Gaps = 28/319 (8%)
Query: 5 DEKQLLLYAQIANLVNLILQWPEISINEIAENFSKLACNAHTICNSELRPLGTGLYPVIS 64
DE+++ + +I + I E + F K N+H+I + +G L +S
Sbjct: 146 DEEKMAHFEKIFEKMGAFRGEEMIEKGEFFDVFCKATINSHSIHTNAGNEVGMALDLGVS 205
Query: 65 IINHSCLPNAVLVFEGRLAVVRAVQHVP-----KGAEVLISYIETAGSTMTRQKALKEQY 119
NHSC P +VF+G +R + VP E ISYI+ S R++ L ++
Sbjct: 206 KYNHSCRPTCSMVFDGYRVCLRPL--VPGVDAENTEEAFISYIDVGRSKYIRRRDLNSRW 263
Query: 120 LFTCTCPRCIKLGQFDDIQESAILEGYRCKDDGCSGFLLRDSDDK--GFTCQQCGLVRSK 177
F C C RC+ D ++ A L RC + C +L ++ C++C + +
Sbjct: 264 YFNCECTRCM------DPEDDA-LTAIRCANPACDAPILTSETEEPMNIACEKCKTI-VE 315
Query: 178 EEIKKIASEVNILSKKTLALT---SCGNHQEVVSTYKMIEKLQK--KLYHPFSVNLMQTR 232
E+ K A E KTL + C E + K+ E L K ++ HP +V + + R
Sbjct: 316 EDTVKAAQEY----MKTLPASFDPKCPAEIEALPG-KLKELLAKAEQILHPSNVYVARLR 370
Query: 233 EKLIKILMELEDWKEALAYCQLTIPVYQRVYPQFHPLLGLQYYTCGKLEWFLGDTENAIK 292
L + L + + Q+ Y+ +P+ +G Q K + + A+
Sbjct: 371 TALFHVTGTLTMDNLSSMHTQI-YNNYKMCFPKADRHVGFQLLHIVKALIEKDERDEAMP 429
Query: 293 SMTEAVEILRITHGTNSPF 311
+A+ I + G + P+
Sbjct: 430 YAFDAMNIFEVCFGLDHPY 448
>gi|281347298|gb|EFB22882.1| hypothetical protein PANDA_013199 [Ailuropoda melanoleuca]
Length = 1019
Score = 70.5 bits (171), Expect = 1e-09, Method: Composition-based stats.
Identities = 29/59 (49%), Positives = 43/59 (72%)
Query: 38 SKLACNAHTICNSELRPLGTGLYPVISIINHSCLPNAVLVFEGRLAVVRAVQHVPKGAE 96
S++ CNA TICN+E++ +G GLYP +S++NHSC PN +VF G ++RAV+ + G E
Sbjct: 130 SQVICNAFTICNAEMQEVGVGLYPSMSLLNHSCDPNCSIVFNGPHLLLRAVRDIEAGEE 188
>gi|426237300|ref|XP_004012599.1| PREDICTED: SET and MYND domain-containing protein 4 [Ovis aries]
Length = 802
Score = 70.1 bits (170), Expect = 1e-09, Method: Compositional matrix adjust.
Identities = 36/96 (37%), Positives = 51/96 (53%), Gaps = 4/96 (4%)
Query: 55 LGTGLYPVISIINHSCLPNAVLVFEGRLAVVRAVQHVPKGAEVLISY--IETAGSTMTRQ 112
L T L+PV+S++NHSC PN + F G +A +RA Q + G E+L Y E+ R+
Sbjct: 526 LATALFPVVSLLNHSCSPNTSVSFIGTIATIRASQLIRSGQEILHCYGPHESRMGVAERR 585
Query: 113 KALKEQYLFTCTCPRCIKLGQFDDIQESAILEGYRC 148
+ L+ QY F C CP C + Q E +RC
Sbjct: 586 QKLRSQYFFDCDCPPCEREKQRPSAGPGR--EAFRC 619
>gi|158299452|ref|XP_319583.4| AGAP008839-PA [Anopheles gambiae str. PEST]
gi|157013526|gb|EAA14823.4| AGAP008839-PA [Anopheles gambiae str. PEST]
Length = 650
Score = 70.1 bits (170), Expect = 1e-09, Method: Compositional matrix adjust.
Identities = 50/177 (28%), Positives = 84/177 (47%), Gaps = 9/177 (5%)
Query: 55 LGTGLYPVISIINHSCLPNAVLVFEGRLAVVRAVQHVPKGAEVLISY--IETAGSTMTRQ 112
+G GLYP +++ NHSC P + G VR V+++P + V +Y + T R+
Sbjct: 448 IGGGLYPTLALFNHSCDPGVTRYYRGNQVCVRTVKNIPADSMVAENYGPLFTQVRRDERR 507
Query: 113 KALKEQYLFTCTCPRCIKLGQFDDIQESAILEGYRCKDDG--CSGFLLRDS--DDKGFTC 168
L QY FTC C C++ + ++ +RC D G CS LL + +D C
Sbjct: 508 DTLLHQYRFTCQCVPCVENWPLFTEMDPGVIR-FRC-DSGKICSNVLLIPAAVNDFMVKC 565
Query: 169 QQCGLVRSKEEIKKIASEVNILSKKTLALTSCGNHQEVVSTY-KMIEKLQKKLYHPF 224
+CG + + K + ++L K L S G ++ + Y +M+E + + L P+
Sbjct: 566 TECGEHTNIMKGLKSLQDTDMLFKTATRLHSAGEYEAALRKYIEMMETMSEVLVPPY 622
>gi|195038187|ref|XP_001990541.1| GH18189 [Drosophila grimshawi]
gi|193894737|gb|EDV93603.1| GH18189 [Drosophila grimshawi]
Length = 393
Score = 70.1 bits (170), Expect = 1e-09, Method: Compositional matrix adjust.
Identities = 39/101 (38%), Positives = 58/101 (57%), Gaps = 9/101 (8%)
Query: 33 IAENFSKLACNAHTICNSELRPLGTGLYPVISIINHSCLPNA--VLVFEGRLAVVRAVQH 90
I E ++K+ A N+E G+GLY + S INHSC+PNA + + V++AV
Sbjct: 267 IDELYAKVGDFAGEFLNNE----GSGLYILQSKINHSCVPNAQSTFPYSNDIVVLKAVAP 322
Query: 91 VPKGAEVLISYIETAG---STMTRQKALKEQYLFTCTCPRC 128
+ KG E+ ISY++ S +R K L+E Y+F C CP+C
Sbjct: 323 IEKGDEICISYLDECQLERSRHSRHKTLRENYIFVCQCPKC 363
>gi|159129417|gb|EDP54531.1| SET and MYND domain protein, putative [Aspergillus fumigatus A1163]
Length = 544
Score = 70.1 bits (170), Expect = 1e-09, Method: Compositional matrix adjust.
Identities = 35/93 (37%), Positives = 54/93 (58%)
Query: 38 SKLACNAHTICNSELRPLGTGLYPVISIINHSCLPNAVLVFEGRLAVVRAVQHVPKGAEV 97
+KL N++ + LG L+P ++INHSC N+V+ +G V+A++ + KG E+
Sbjct: 234 AKLELNSYNFHSPLADRLGFYLHPYAALINHSCDYNSVVGSDGDALYVKALRPIRKGEEI 293
Query: 98 LISYIETAGSTMTRQKALKEQYLFTCTCPRCIK 130
L+SYI+ RQ L E+Y F C CP+C K
Sbjct: 294 LVSYIDATNPYKIRQTELLERYYFKCRCPKCGK 326
>gi|47204414|emb|CAF87026.1| unnamed protein product [Tetraodon nigroviridis]
Length = 386
Score = 70.1 bits (170), Expect = 2e-09, Method: Compositional matrix adjust.
Identities = 70/302 (23%), Positives = 120/302 (39%), Gaps = 47/302 (15%)
Query: 29 SINEIAENFSKLACNAHTICNSE-LRPLGTGLYPVISIINHSCLPNAVLVFEG------- 80
S +IA + N T+ + L+ +G GL+P + ++NH C PN +V
Sbjct: 116 SREDIAHVLGIIKTNGFTLSDQRGLQAVGVGLFPNLCLVNHDCWPNCSVVLNHGNQSATN 175
Query: 81 ------RLAVVRAVQHVPKGAEVLISYIETAGSTMTRQKALKEQYLFTCTCPRCIKLGQF 134
R +RA+ V +G E+ +SY++ ++ RQ+ L E + F CTC R
Sbjct: 176 PALHSQRRVELRALGKVCQGEELTVSYVDFLDTSAERQRKLSELFFFHCTCQR------- 228
Query: 135 DDIQESAILEGYRCKDDGCSGFLLRDSDDKGFTCQQCGLVRSKEEIKKI-ASEVNILSKK 193
CS L K E++++ A L K
Sbjct: 229 ------------------CSQHL------KDDAMTAAADGADAAEVQEVTAFSQECLEKI 264
Query: 194 TLALTSCGNHQEVVSTYKMIEKLQKKLYHPFSVNLMQTREKLIKILMELEDWKEALAYCQ 253
LT ++QEV++ + Q+ L + ++ ++ L L + EA AY
Sbjct: 265 DRCLTG-RDYQEVLALCSACLQKQRGLLADTHLCHLRVLSAAVEALSYLRCFPEAAAYAL 323
Query: 254 LTIPVYQRVYPQFHPLLGLQYYTCGKLEWFLGDTENAIKSMTEAVEILRITHGTNSPFMK 313
+ Y+++YP H LG+ G G E A + + +A L ++HG N +
Sbjct: 324 RLLQGYRKLYPPNHAQLGVAAMRAGVAHLQAGMPEAAHELICQAYRTLLVSHGPNHSTTR 383
Query: 314 EL 315
+L
Sbjct: 384 DL 385
>gi|222613106|gb|EEE51238.1| hypothetical protein OsJ_32096 [Oryza sativa Japonica Group]
Length = 718
Score = 70.1 bits (170), Expect = 2e-09, Method: Compositional matrix adjust.
Identities = 79/351 (22%), Positives = 149/351 (42%), Gaps = 58/351 (16%)
Query: 9 LLLYAQI-ANLVNLILQWPEISINEIAENFSKLACNAHTICNSELRPLGTGLYPVISIIN 67
+LL QI N + ++ + + + FS + + +C+ E + +Y S N
Sbjct: 352 VLLICQIKVNSIAIVHMKSMDGVKALTKGFSGFSGD--VMCSVEQVRVAQAIYMSGSFFN 409
Query: 68 HSCLPNAVLVFEGRLAVVRAVQHVPKGAEVLISYIETAGST--MTRQKALKEQYLFTCT- 124
HSC PN F R ++R+ +++ G+ + +SY G RQK+L+E Y F+C
Sbjct: 410 HSCRPNIHAYFHSRTLILRSTEYIKAGSPIELSYGPQVGEMDLPERQKSLRENYYFSCGC 469
Query: 125 -----------------CPRCIKLGQF---------DDIQESAILEGYRC---------- 148
CP+ LG ++ +I E + C
Sbjct: 470 SSCSVLSLSDLVMNSFCCPQSNCLGAVSELIHHRHKENFVHVSIGESHVCTLSLPDVSKF 529
Query: 149 -KDDGCSGFLLRDSD-----DKGF--TCQ-QCGLVRSKEEIKKIASEVNILSKKTLALTS 199
+D G L SD D GF +C+ Q L + + S++N L + L S
Sbjct: 530 DEDIVKVGKLFFKSDTMFNIDPGFCMSCRSQLDLSSAVAMSDRATSKINRLKE----LPS 585
Query: 200 CGNHQEVV--STYKMIEKLQKKLYHPFSVNLMQTREKLIKILMELEDWKEALAYCQLTIP 257
N EV+ + +E+++ KL HP+S L Q+ + + + ++ D ++A +C+ +I
Sbjct: 586 LDNVPEVLIAEALQSLERIE-KLRHPYSKTLAQSHDTIAEAFAKVGDQEQARKHCEASIK 644
Query: 258 VYQRVYPQFHPLLGLQYYTCGKLEWFLGDTENAIKSMTEAVEILRITHGTN 308
+ +++Y H ++ + +E +GD +A + A I + +G +
Sbjct: 645 ILEKLYHPRHIIIAHELIKLVSIELSMGDGASAAAAFARADAIFSLYYGPD 695
>gi|367042716|ref|XP_003651738.1| hypothetical protein THITE_68287 [Thielavia terrestris NRRL 8126]
gi|346999000|gb|AEO65402.1| hypothetical protein THITE_68287 [Thielavia terrestris NRRL 8126]
Length = 562
Score = 70.1 bits (170), Expect = 2e-09, Method: Compositional matrix adjust.
Identities = 38/102 (37%), Positives = 60/102 (58%), Gaps = 1/102 (0%)
Query: 27 EISINEIAENFSKLACNAHTICNSELRPLGTGLYPVISIINHSCLPNAVLVFEGRLAVVR 86
E ++ + E K+ NA +++ G L ++++NHSC+PNA + F+ R AV+R
Sbjct: 189 EETLEKAREVLCKIQTNAFNRLDADTGLSGIFLDVGLAMVNHSCVPNAFIGFDRRTAVLR 248
Query: 87 AVQHVPKGAEVLISYIETAGSTMTRQKALKEQYLFTCTCPRC 128
A + + +G E+ ISYI+ A RQ+AL+ Y F C CPRC
Sbjct: 249 AERPIQEGEEITISYIDNALPKSARQEALR-LYHFRCDCPRC 289
>gi|157136833|ref|XP_001656930.1| hypothetical protein AaeL_AAEL003527 [Aedes aegypti]
gi|108880959|gb|EAT45184.1| AAEL003527-PA [Aedes aegypti]
Length = 650
Score = 70.1 bits (170), Expect = 2e-09, Method: Compositional matrix adjust.
Identities = 49/179 (27%), Positives = 86/179 (48%), Gaps = 11/179 (6%)
Query: 55 LGTGLYPVISIINHSCLPNAVLVFEGRLAVVRAVQHVPKGAEVLISY--IETAGSTMTRQ 112
+G GLYP + + NHSC P F G +R ++++P G+ V +Y + T R+
Sbjct: 447 IGAGLYPTLVLFNHSCDPGITRYFVGNAVYIRTIKNIPAGSMVAENYGQLYTRAGRRERR 506
Query: 113 KALKEQYLFTCTCPRCIKLGQFDDIQE-SAILEGYRC-KDDGCSGFLLR--DSDDKGFTC 168
K L E Y F C C C + + + E + ++ ++C ++GC LL +S + C
Sbjct: 507 KLLAENYKFDCCCQACEE--DWPSLHEMNPMIRRFKCGANEGCGNELLFKLNSTENEMEC 564
Query: 169 QQCGLVRSKEEIKKIASEVNILSKKTLA--LTSCGNHQEVVSTY-KMIEKLQKKLYHPF 224
+CG + + +V+ L++ A L S G+ + +S Y +I L + L P+
Sbjct: 565 NKCGGLTEVNASFETLKQVDFLNRYNDAARLYSQGDFERALSKYAALINSLDEILVPPY 623
>gi|255085128|ref|XP_002504995.1| predicted protein [Micromonas sp. RCC299]
gi|226520264|gb|ACO66253.1| predicted protein [Micromonas sp. RCC299]
Length = 267
Score = 69.7 bits (169), Expect = 2e-09, Method: Compositional matrix adjust.
Identities = 36/101 (35%), Positives = 55/101 (54%), Gaps = 3/101 (2%)
Query: 32 EIAENFSKLACNAHTIC---NSELRPLGTGLYPVISIINHSCLPNAVLVFEGRLAVVRAV 88
E+ + + CNAHT+ S L P+G +Y S NHSC+P+A G VR++
Sbjct: 162 ELTQLLCSVNCNAHTLYANDRSSLVPVGIAVYLQGSAFNHSCVPSAEFCNVGTSLTVRSL 221
Query: 89 QHVPKGAEVLISYIETAGSTMTRQKALKEQYLFTCTCPRCI 129
+ V G E+ +SY+ T + R++ L+ QY F C C RC+
Sbjct: 222 RRVQAGEEITVSYVPTTMNLKERRRCLEGQYKFVCACARCV 262
>gi|289740749|gb|ADD19122.1| SET and MYND domain containing protein 5 [Glossina morsitans
morsitans]
Length = 395
Score = 69.7 bits (169), Expect = 2e-09, Method: Compositional matrix adjust.
Identities = 41/111 (36%), Positives = 58/111 (52%), Gaps = 5/111 (4%)
Query: 23 LQWPEISINEIAENFSKLACNAHTICNSELRPLGTGLYPVISIINHSCLPNAVLVF--EG 80
L PE N++ E + L G+GLY + S INHSCLPNA + F
Sbjct: 253 LPLPETDKNKLDEYIDDIYHKVGEFAGEFLNNEGSGLYLLQSKINHSCLPNAQVTFPYSN 312
Query: 81 RLAVVRAVQHVPKGAEVLISYIETAG---STMTRQKALKEQYLFTCTCPRC 128
+ V++A+Q + G E+ ISY++ S +RQK LKE Y+F C C +C
Sbjct: 313 DIVVLKALQPIQVGEEICISYLDEGQLERSRHSRQKILKENYIFVCECFKC 363
>gi|440912312|gb|ELR61896.1| SET and MYND domain-containing protein 4 [Bos grunniens mutus]
Length = 802
Score = 69.7 bits (169), Expect = 2e-09, Method: Compositional matrix adjust.
Identities = 32/76 (42%), Positives = 45/76 (59%), Gaps = 2/76 (2%)
Query: 55 LGTGLYPVISIINHSCLPNAVLVFEGRLAVVRAVQHVPKGAEVLISY--IETAGSTMTRQ 112
L T L+PV+S++NHSC PN + F G +A +RA Q + G E+L Y E+ R+
Sbjct: 526 LATALFPVVSLLNHSCSPNTSVSFIGTIATIRASQLIRSGQEILHCYGPHESRMGVAERR 585
Query: 113 KALKEQYLFTCTCPRC 128
+ L+ QY F C CP C
Sbjct: 586 QKLRSQYFFDCDCPPC 601
>gi|402587997|gb|EJW81931.1| MYND finger family protein, partial [Wuchereria bancrofti]
Length = 348
Score = 69.7 bits (169), Expect = 2e-09, Method: Compositional matrix adjust.
Identities = 73/317 (23%), Positives = 133/317 (41%), Gaps = 31/317 (9%)
Query: 28 ISINEIAENFSKLACNAHTICN-SELRPLGTGLYPVISIINHSCLPNAVLVFEGRLAVVR 86
+S +E+ E + N H I + + +G GLY + +HSC PN + +A +R
Sbjct: 37 VSKDEVFELHCRNYINRHAISDCGYIEEIGKGLYLDLCAYDHSCRPNTIYTCNSFVATLR 96
Query: 87 AVQ---HVPKGAEVLISYIETAGSTMTRQKALKEQYLFTCTCPRCIKLGQFDDIQESAIL 143
+ + SYI+ +T R+K LK+ + F C C RC DD+ S+IL
Sbjct: 97 GLTAGVDLRNLNSTHYSYIDLINTTQQRRKLLKDTWYFECHCTRC---DDPDDVLLSSIL 153
Query: 144 EGYRCKDDGC----SGFLLRDSDDKGFTCQQCGLVRSKEEIKKIASEVNILSK--KTLAL 197
K C +D + + TC +C + S E + + + + + K + +
Sbjct: 154 CPNCPKKRECLCIFGNVPYKDRNTQIITCPKCHNIVSPEYVVEAIAAMRFIDKIVENREV 213
Query: 198 TSCGNHQEVVSTYKMIEKLQKKLYHPFS-VNLMQTREKLIKILMELEDWKEALAYCQLTI 256
Q + + + L+K+ S VN+ K+I++L+ L D + CQL +
Sbjct: 214 EKMSREQSI----RFLTDLKKRFSELLSKVNVFLC--KVIQLLIPLIDISD----CQLLL 263
Query: 257 PVY-------QRVYPQFHPLLGLQYYTCGKLEWFLGDTENAIKSMTEAVEILRITHGTNS 309
++ + +P HP +G+ Y + A+ +A E+++ T G+
Sbjct: 264 DLHLETEECVRFCFPYNHPAVGIHYRSIATFFLKCKQPHRALLYYKKAYEVVKFTLGSKH 323
Query: 310 PFMKELILKLEEAQAEA 326
P E L++A E
Sbjct: 324 PMTIETDAMLKDASREV 340
>gi|329664548|ref|NP_001192662.1| SET and MYND domain-containing protein 4 [Bos taurus]
Length = 802
Score = 69.7 bits (169), Expect = 2e-09, Method: Compositional matrix adjust.
Identities = 32/76 (42%), Positives = 45/76 (59%), Gaps = 2/76 (2%)
Query: 55 LGTGLYPVISIINHSCLPNAVLVFEGRLAVVRAVQHVPKGAEVLISY--IETAGSTMTRQ 112
L T L+PV+S++NHSC PN + F G +A +RA Q + G E+L Y E+ R+
Sbjct: 526 LATALFPVVSLLNHSCSPNTSVSFIGTIATIRASQLIRSGQEILHCYGPHESRMGVAERR 585
Query: 113 KALKEQYLFTCTCPRC 128
+ L+ QY F C CP C
Sbjct: 586 QKLRSQYFFDCDCPPC 601
>gi|296476893|tpg|DAA19008.1| TPA: CG8378-like [Bos taurus]
Length = 802
Score = 69.7 bits (169), Expect = 2e-09, Method: Compositional matrix adjust.
Identities = 32/76 (42%), Positives = 45/76 (59%), Gaps = 2/76 (2%)
Query: 55 LGTGLYPVISIINHSCLPNAVLVFEGRLAVVRAVQHVPKGAEVLISY--IETAGSTMTRQ 112
L T L+PV+S++NHSC PN + F G +A +RA Q + G E+L Y E+ R+
Sbjct: 526 LATALFPVVSLLNHSCSPNTSVSFIGTIATIRASQLIRSGQEILHCYGPHESRMGVAERR 585
Query: 113 KALKEQYLFTCTCPRC 128
+ L+ QY F C CP C
Sbjct: 586 QKLRSQYFFDCDCPPC 601
>gi|291227705|ref|XP_002733823.1| PREDICTED: predicted protein-like [Saccoglossus kowalevskii]
Length = 644
Score = 69.7 bits (169), Expect = 2e-09, Method: Compositional matrix adjust.
Identities = 67/294 (22%), Positives = 127/294 (43%), Gaps = 13/294 (4%)
Query: 29 SINEIAENFSKLACNAHTICNSELRPLGTGLYPVISIINHSCLPNAVLVFEGRLAVVRAV 88
+++ + E +K + + ++ + +Y S++NHSC PN + ++G +RA
Sbjct: 346 NVHAVTEVATKTDSSTSFVATTQQIRIAVAVYGTASMLNHSCTPNVIAGYDGNQLTIRAT 405
Query: 89 QHVPKGAEVLISYIETAGSTMT--RQKALKEQYLFTCTCPRCIKLGQFDDIQESAILEGY 146
+ + KG EVL Y R K L++QY FTC C C + Q + I S L
Sbjct: 406 EMIKKGGEVLHCYGPRVSDMFRDERLKVLRDQYYFTCKCMFC-GIPQ-EAITPSGALRCP 463
Query: 147 RCKDDGCSGFLLRDSDDKG----FTCQQCGLVRSKEEIKKIASEVNILSKKTLALTSCGN 202
RC D SG + + + G F C + +E KI + + +
Sbjct: 464 RCSD--MSG-VYQTPEVTGQCCNFDCHHSWKLGETDE--KIENARQLFRVGKTLIGDKNK 518
Query: 203 HQEVVSTYKMIEKLQKKLYHPFSVNLMQTREKLIKILMELEDWKEALAYCQLTIPVYQRV 262
+E +ST K+Q+ + + + L +T + L ++ A Y +L+I +
Sbjct: 519 VKECLSTLHKAYKIQQSVLYKHNKELAETEDCLALCYASFGNYHYAAKYLKLSIKSNIIL 578
Query: 263 YPQFHPLLGLQYYTCGKLEWFLGDTENAIKSMTEAVEILRITHGTNSPFMKELI 316
Y + +G + Y ++ + + + + +AV+I+ +G N +KEL+
Sbjct: 579 YGEDSIEIGHELYKLAQILFNCKNVTEGLGVIDKAVKIISDHYGENHSDVKELL 632
>gi|291242073|ref|XP_002740933.1| PREDICTED: predicted protein-like [Saccoglossus kowalevskii]
Length = 669
Score = 69.7 bits (169), Expect = 2e-09, Method: Compositional matrix adjust.
Identities = 42/123 (34%), Positives = 61/123 (49%), Gaps = 15/123 (12%)
Query: 40 LACNAHTICNSELR----------PLGTGLYPVISIINHSCLPNAVLVFEGRLAVVRAVQ 89
+ CNAH I EL +G+GLYP +S++NHSC P G VVRA++
Sbjct: 449 IPCNAHEISEYELWRSDITKCHFVEVGSGLYPTMSLVNHSCDPVVTRNCYGETCVVRAIR 508
Query: 90 HVPKGAEVL--ISYIETAGSTMTRQKALKEQYLFTCTCPRCI-KLGQFDDIQESAILEGY 146
++ KG E+ Y+ RQ LK QY F C C C+ + DI++ +L Y
Sbjct: 509 NIYKGEEITDNYGYLYPVHDKSERQTRLKWQYFFECKCDACVHDWPLYPDIED--LLPKY 566
Query: 147 RCK 149
+C+
Sbjct: 567 KCE 569
>gi|115482840|ref|NP_001065013.1| Os10g0506800 [Oryza sativa Japonica Group]
gi|62733614|gb|AAX95731.1| tetratricopeptide repeat, putative [Oryza sativa Japonica Group]
gi|110289368|gb|ABB47859.2| TPR Domain containing protein, expressed [Oryza sativa Japonica
Group]
gi|113639622|dbj|BAF26927.1| Os10g0506800 [Oryza sativa Japonica Group]
gi|215695497|dbj|BAG90688.1| unnamed protein product [Oryza sativa Japonica Group]
Length = 793
Score = 69.7 bits (169), Expect = 2e-09, Method: Compositional matrix adjust.
Identities = 79/351 (22%), Positives = 148/351 (42%), Gaps = 58/351 (16%)
Query: 9 LLLYAQI-ANLVNLILQWPEISINEIAENFSKLACNAHTICNSELRPLGTGLYPVISIIN 67
+LL QI N + ++ + + + FS + +C+ E + +Y S N
Sbjct: 427 VLLICQIKVNSIAIVHMKSMDGVKALTKGFSGFS--GDVMCSVEQVRVAQAIYMSGSFFN 484
Query: 68 HSCLPNAVLVFEGRLAVVRAVQHVPKGAEVLISYIETAGST--MTRQKALKEQYLFTCT- 124
HSC PN F R ++R+ +++ G+ + +SY G RQK+L+E Y F+C
Sbjct: 485 HSCRPNIHAYFHSRTLILRSTEYIKAGSPIELSYGPQVGEMDLPERQKSLRENYYFSCGC 544
Query: 125 -----------------CPRCIKLGQF---------DDIQESAILEGYRC---------- 148
CP+ LG ++ +I E + C
Sbjct: 545 SSCSVLSLSDLVMNSFCCPQSNCLGAVSELIHHRHKENFVHVSIGESHVCTLSLPDVSKF 604
Query: 149 -KDDGCSGFLLRDSD-----DKGF--TCQ-QCGLVRSKEEIKKIASEVNILSKKTLALTS 199
+D G L SD D GF +C+ Q L + + S++N L + L S
Sbjct: 605 DEDIVKVGKLFFKSDTMFNIDPGFCMSCRSQLDLSSAVAMSDRATSKINRLKE----LPS 660
Query: 200 CGNHQEVV--STYKMIEKLQKKLYHPFSVNLMQTREKLIKILMELEDWKEALAYCQLTIP 257
N EV+ + +E+++ KL HP+S L Q+ + + + ++ D ++A +C+ +I
Sbjct: 661 LDNVPEVLIAEALQSLERIE-KLRHPYSKTLAQSHDTIAEAFAKVGDQEQARKHCEASIK 719
Query: 258 VYQRVYPQFHPLLGLQYYTCGKLEWFLGDTENAIKSMTEAVEILRITHGTN 308
+ +++Y H ++ + +E +GD +A + A I + +G +
Sbjct: 720 ILEKLYHPRHIIIAHELIKLVSIELSMGDGASAAAAFARADAIFSLYYGPD 770
>gi|71019315|ref|XP_759888.1| hypothetical protein UM03741.1 [Ustilago maydis 521]
gi|46099457|gb|EAK84690.1| hypothetical protein UM03741.1 [Ustilago maydis 521]
Length = 739
Score = 69.7 bits (169), Expect = 2e-09, Method: Compositional matrix adjust.
Identities = 35/118 (29%), Positives = 65/118 (55%), Gaps = 16/118 (13%)
Query: 29 SINEIAENFSKLACNAHTICNSELRPLGTGLYPVISIINHSCLPNAVLVF---------- 78
S E+ + + +CN+ T+ +S+L LG ++ ++++NH+C+PNA +VF
Sbjct: 423 SATELLDLVCQFSCNSFTLADSDLNALGVCMHASMAMLNHACIPNAAVVFPFGGAAKGGQ 482
Query: 79 ------EGRLAVVRAVQHVPKGAEVLISYIETAGSTMTRQKALKEQYLFTCTCPRCIK 130
+ ++ + A++ + G E+LISY++ + RQK LK++Y F C C C K
Sbjct: 483 QRWNDGDDKIMQLVALRAIEPGEELLISYVDLCDTVEERQKQLKQRYCFDCRCDLCCK 540
>gi|66822113|ref|XP_644411.1| SET domain-containing protein [Dictyostelium discoideum AX4]
gi|66822969|ref|XP_644839.1| SET domain-containing protein [Dictyostelium discoideum AX4]
gi|122057743|sp|Q557F7.1|Y3589_DICDI RecName: Full=SET and MYND domain-containing protein DDB_G0273589
gi|60472534|gb|EAL70485.1| SET domain-containing protein [Dictyostelium discoideum AX4]
gi|60472895|gb|EAL70844.1| SET domain-containing protein [Dictyostelium discoideum AX4]
Length = 386
Score = 69.3 bits (168), Expect = 2e-09, Method: Compositional matrix adjust.
Identities = 38/117 (32%), Positives = 54/117 (46%), Gaps = 1/117 (0%)
Query: 39 KLACNAHTICNSELRPLGTGLYPVISIINHSCLPNAVLVFEGRLAVVRAVQHVPKGAEVL 98
K CN I + +G + P S NHSC+PN V +G +++ + KG ++
Sbjct: 231 KTRCNQFGIWTKNDKCIGVAVSPSSSYFNHSCIPNCTDVRDGSNMTFKSLYPIKKGDQLT 290
Query: 99 ISYIETAGSTMTRQKALKEQYLFTCTCPRCIKLGQFDDIQESAILEGYRCKDDGCSG 155
ISYIE R+ LK Y F C CPRC D ++ I + Y C C+G
Sbjct: 291 ISYIELDQPIQDRKDELKYGYYFDCICPRCNGDSNSIDSMDNWISKFY-CSQKKCTG 346
>gi|393234411|gb|EJD41974.1| hypothetical protein AURDEDRAFT_105699 [Auricularia delicata
TFB-10046 SS5]
Length = 531
Score = 69.3 bits (168), Expect = 3e-09, Method: Compositional matrix adjust.
Identities = 77/296 (26%), Positives = 115/296 (38%), Gaps = 58/296 (19%)
Query: 29 SINEIAENFSKLACNAHTICNSELRPLGTGLYPVISIINHSCLPNAVLVFEGRLAV---- 84
S ++ + S+ N+ T+ + L P+G + P +++ NHSC PNAV+VF A
Sbjct: 188 SAGDLVDLVSRFTTNSFTLSSPSLSPIGVCVSPTVALANHSCDPNAVVVFPSAAAPLGDE 247
Query: 85 ----VRAVQHVPKGAEVLISYIETAGSTMTRQKALKEQYLFTCTCPRCIKLGQFDDIQES 140
+ A++ + EVL SY++ + R++ LKE Y FTC C C L DD
Sbjct: 248 TLMQIIAIKDIMPEEEVLTSYVDISLPRHLRRRDLKETYNFTCNCSACSSL---DDPSVG 304
Query: 141 AILEGYRCKDDGCSGFLLRDSDDKGFT-CQQCGLVRSKEEIKKIASEVNILSKKTLALTS 199
C GC G G C +CG A S
Sbjct: 305 DPRYAVWCP-KGCGGVCRIPELGSGIVICLRCG-----------------------AEYS 340
Query: 200 CGNHQEVVSTYKM-IEKLQKKLYHPFSVNLMQTREKLIKILMELEDWKEALAYCQLTIPV 258
E + ++ E L+K TR +EL D AL +P+
Sbjct: 341 PAKQDEALDRVRLGTEALEKA-----------TR-------LELSDPDRALRLTTNMLPL 382
Query: 259 YQRVYPQFHPLLGLQYYTCGKLEWFLGDTENAIKSMTEAVEI-LRITHGTNSPFMK 313
Y HPLLGL L FL + + K++ EA+ + RI G F++
Sbjct: 383 IGMPY-SSHPLLGLLRLHSSLLISFLAASPSN-KTLDEAIAMNFRIQRGMQDVFVE 436
>gi|403414107|emb|CCM00807.1| predicted protein [Fibroporia radiculosa]
Length = 304
Score = 69.3 bits (168), Expect = 3e-09, Method: Compositional matrix adjust.
Identities = 35/104 (33%), Positives = 56/104 (53%), Gaps = 9/104 (8%)
Query: 29 SINEIAENFSKLACNAHTICNSELRPLGTGLYPVISIINHSCLPNAVLVF---------E 79
S ++ + S+ NA T+ +S L +G + P ++++NHSC PN V+VF E
Sbjct: 10 SPGDLVDLISRFTTNAFTLTSSSLSAIGVAISPTVALVNHSCDPNVVIVFPRNPSTSHAE 69
Query: 80 GRLAVVRAVQHVPKGAEVLISYIETAGSTMTRQKALKEQYLFTC 123
+ V A++ + G E+L +Y++ RQK LKE Y FTC
Sbjct: 70 EPMMTVVAIKPILPGEEILSAYVDVTQPRELRQKELKETYNFTC 113
>gi|242007004|ref|XP_002424332.1| conserved hypothetical protein [Pediculus humanus corporis]
gi|212507732|gb|EEB11594.1| conserved hypothetical protein [Pediculus humanus corporis]
Length = 572
Score = 69.3 bits (168), Expect = 3e-09, Method: Compositional matrix adjust.
Identities = 39/95 (41%), Positives = 49/95 (51%), Gaps = 5/95 (5%)
Query: 40 LACNAHTICN---SELRPLGTGLYPVISIINHSCLPNAVLVFEGRLAVVRAVQHVPKGAE 96
L CNAH I S L +G G Y +S+INHSC PN V ++RA++ + KG E
Sbjct: 384 LPCNAHEISEYYCSNLLEIGAGAYATLSLINHSCDPNVVRHSCRNTVILRAIKPIKKGEE 443
Query: 97 VL--ISYIETAGSTMTRQKALKEQYLFTCTCPRCI 129
+ Y RQKAL +QY FTC C CI
Sbjct: 444 LFDNYGYHYATHELQERQKALLKQYYFTCQCKACI 478
>gi|312371974|gb|EFR20029.1| hypothetical protein AND_20776 [Anopheles darlingi]
Length = 256
Score = 69.3 bits (168), Expect = 3e-09, Method: Compositional matrix adjust.
Identities = 42/106 (39%), Positives = 61/106 (57%), Gaps = 10/106 (9%)
Query: 29 SINE-IAENFSKLACNAHTICNSELRPLGTGLYPVISIINHSCLPNAVLVF--EGRLAVV 85
S++E I E ++K+ T N+E G+ LY + S INHSC PNA +VF + +
Sbjct: 125 SVDELIDELYNKMDEVVGTFLNNE----GSALYAMQSKINHSCTPNAEIVFPKSNHVLAL 180
Query: 86 RAVQHVPKGAEVLISYIETAG---STMTRQKALKEQYLFTCTCPRC 128
RA++ + G E+ ISY++ S +RQK L+E YLF C C RC
Sbjct: 181 RALRDLAAGEEICISYLDECNLQRSRHSRQKNLREYYLFECQCERC 226
>gi|302794157|ref|XP_002978843.1| hypothetical protein SELMODRAFT_418586 [Selaginella moellendorffii]
gi|300153652|gb|EFJ20290.1| hypothetical protein SELMODRAFT_418586 [Selaginella moellendorffii]
Length = 849
Score = 69.3 bits (168), Expect = 3e-09, Method: Compositional matrix adjust.
Identities = 62/233 (26%), Positives = 103/233 (44%), Gaps = 19/233 (8%)
Query: 43 NAHTICNSELRPLGTGLYPVISIINHSCLPNAVLVFEGRLAVVRAVQHVPKGAEVLISYI 102
AH IC E + L+ S NHSC PN + + R + +P + ISY
Sbjct: 391 GAH-ICGIEHVVVAQALFVRGSKFNHSCSPNVHVSYVKRTLRAHCTEALPAFCPLEISYG 449
Query: 103 ETAGSTMTRQKA--LKEQYLFTCTCPRCIKLGQFDDIQESAILEGYRCKDDGCSGFLLR- 159
G + Q+ L++ YLF C+C C +LG D+ A Y C GC G +L
Sbjct: 450 VQVGESEKSQRKIWLRDHYLFECSCEPC-RLGLVSDLLHIA----YYCCQPGCQGIVLDP 504
Query: 160 DSDDKGFTCQQCGLVRSKEEIKKIASEVNILSKKTLA-LTSCG---NHQEVVSTYKMIEK 215
D+ D+ L ++ + NIL +K L L CG N E ++ + +
Sbjct: 505 DASDEDIL-----LPGGSGTVRFLIEHCNILPQKELEDLQICGSRLNGNEAIADGLRLLE 559
Query: 216 LQKKLYHPFSVNLMQTREKLIKILMEL-EDWKEALAYCQLTIPVYQRVYPQFH 267
+K +HPF+ L Q + + ++ E + A+ + + +I + +R+Y + H
Sbjct: 560 NGRKCFHPFNKQLAQMEDIVARLYCEAGQQPAAAIVHARRSIEILERIYGKDH 612
>gi|110289369|gb|AAP54526.2| TPR Domain containing protein, expressed [Oryza sativa Japonica
Group]
Length = 687
Score = 68.9 bits (167), Expect = 3e-09, Method: Compositional matrix adjust.
Identities = 80/355 (22%), Positives = 149/355 (41%), Gaps = 66/355 (18%)
Query: 9 LLLYAQI-ANLVNLILQWPEISINEIAENFSKLACNAHTICNSELRPLGTGLYPVISIIN 67
+LL QI N + ++ + + + FS + +C+ E + +Y S N
Sbjct: 321 VLLICQIKVNSIAIVHMKSMDGVKALTKGFSGFS--GDVMCSVEQVRVAQAIYMSGSFFN 378
Query: 68 HSCLPNAVLVFEGRLAVVRAVQHVPKGAEVLISYIETAGST--MTRQKALKEQYLFTC-- 123
HSC PN F R ++R+ +++ G+ + +SY G RQK+L+E Y F+C
Sbjct: 379 HSCRPNIHAYFHSRTLILRSTEYIKAGSPIELSYGPQVGEMDLPERQKSLRENYYFSCGC 438
Query: 124 ----------------TCPRCIKLGQF---------DDIQESAILEGYRC---------- 148
CP+ LG ++ +I E + C
Sbjct: 439 SSCSVLSLSDLVMNSFCCPQSNCLGAVSELIHHRHKENFVHVSIGESHVCTLSLPDVSKF 498
Query: 149 -KDDGCSGFLLRDSD-----DKGFTCQQCGLVRSKEEIK-------KIASEVNILSKKTL 195
+D G L SD D GF C C RS+ ++ + S++N L +
Sbjct: 499 DEDIVKVGKLFFKSDTMFNIDPGF-CMSC---RSQLDLSSAVAMSDRATSKINRLKE--- 551
Query: 196 ALTSCGNHQEVV--STYKMIEKLQKKLYHPFSVNLMQTREKLIKILMELEDWKEALAYCQ 253
L S N EV+ + +E+++ KL HP+S L Q+ + + + ++ D ++A +C+
Sbjct: 552 -LPSLDNVPEVLIAEALQSLERIE-KLRHPYSKTLAQSHDTIAEAFAKVGDQEQARKHCE 609
Query: 254 LTIPVYQRVYPQFHPLLGLQYYTCGKLEWFLGDTENAIKSMTEAVEILRITHGTN 308
+I + +++Y H ++ + +E +GD +A + A I + +G +
Sbjct: 610 ASIKILEKLYHPRHIIIAHELIKLVSIELSMGDGASAAAAFARADAIFSLYYGPD 664
>gi|328873018|gb|EGG21385.1| SET domain-containing protein [Dictyostelium fasciculatum]
Length = 740
Score = 68.9 bits (167), Expect = 3e-09, Method: Compositional matrix adjust.
Identities = 39/105 (37%), Positives = 57/105 (54%), Gaps = 5/105 (4%)
Query: 32 EIAENFSKLACNAHTICNSELR-PLGTGLYPVISIINHSCLPNAVLVFEGR-LAVVRAVQ 89
EI + + K+ NAH + +G G+YP S INH+CLPN + L + R+ +
Sbjct: 451 EIFDLYPKILFNAHEYIDPLTHYEIGRGIYPTASYINHTCLPNTTWYNDDHGLILYRSSR 510
Query: 90 HVPKGAEVLISYIETAGSTMTRQKALKEQYLFTCTCPRCI--KLG 132
+ KG E+ SY++ + R+K LK QY F C C RC+ KLG
Sbjct: 511 DILKGEEITTSYLDILKPKLQRRKDLK-QYSFVCQCERCLNEKLG 554
>gi|302838330|ref|XP_002950723.1| hypothetical protein VOLCADRAFT_91197 [Volvox carteri f.
nagariensis]
gi|300263840|gb|EFJ48038.1| hypothetical protein VOLCADRAFT_91197 [Volvox carteri f.
nagariensis]
Length = 610
Score = 68.9 bits (167), Expect = 3e-09, Method: Compositional matrix adjust.
Identities = 33/73 (45%), Positives = 44/73 (60%), Gaps = 2/73 (2%)
Query: 58 GLYPVISIINHSCLPNAVLVFEGRLAVVRAVQHVPKGAEVLISYI--ETAGSTMTRQKAL 115
GL+P + +NH+C PN V G VVRAVQ +P+G E+L+SY+ + RQ AL
Sbjct: 246 GLWPYFNQLNHACAPNCVHYVVGSTMVVRAVQVIPEGCELLVSYLGRDDLAPRQVRQAAL 305
Query: 116 KEQYLFTCTCPRC 128
K +Y F C C RC
Sbjct: 306 KARYGFICNCTRC 318
>gi|71001320|ref|XP_755341.1| SET and MYND domain protein [Aspergillus fumigatus Af293]
gi|66852979|gb|EAL93303.1| SET and MYND domain protein, putative [Aspergillus fumigatus Af293]
Length = 544
Score = 68.9 bits (167), Expect = 3e-09, Method: Compositional matrix adjust.
Identities = 35/93 (37%), Positives = 53/93 (56%)
Query: 38 SKLACNAHTICNSELRPLGTGLYPVISIINHSCLPNAVLVFEGRLAVVRAVQHVPKGAEV 97
+KL N+ + LG L+P ++INHSC N+V+ +G V+A++ + KG E+
Sbjct: 234 AKLELNSFNFHSPLADRLGFYLHPYAALINHSCDYNSVVGSDGDALYVKALRPIRKGEEI 293
Query: 98 LISYIETAGSTMTRQKALKEQYLFTCTCPRCIK 130
L+SYI+ RQ L E+Y F C CP+C K
Sbjct: 294 LVSYIDATNPYKIRQTELLERYYFKCRCPKCGK 326
>gi|17509133|ref|NP_492772.1| Protein SET-18, isoform a [Caenorhabditis elegans]
gi|3924878|emb|CAB03382.1| Protein SET-18, isoform a [Caenorhabditis elegans]
Length = 501
Score = 68.9 bits (167), Expect = 3e-09, Method: Compositional matrix adjust.
Identities = 67/314 (21%), Positives = 126/314 (40%), Gaps = 24/314 (7%)
Query: 5 DEKQLLLYAQIANLVNLILQWPEISINEIAENFSKLACNAHTICNSELRPLGTGLYPVIS 64
DE+++ + +I + I E + F K N+H+I + +G L +S
Sbjct: 146 DEEKMAHFEKIFEKMGAFRGEEMIEKGEFFDVFCKATINSHSIHTNAGNEVGMALDLGVS 205
Query: 65 IINHSCLPNAVLVFEGRLAVVRAVQHVP-----KGAEVLISYIETAGSTMTRQKALKEQY 119
NHSC P +VF+G +R + VP E ISYI+ S R++ L ++
Sbjct: 206 KYNHSCRPTCSMVFDGYRVCLRPL--VPGVDAENTEEAFISYIDVGRSKYIRRRDLNSRW 263
Query: 120 LFTCTCPRCIKLGQFDDIQESAILEGYRCKDDGCSGFLLRDSDDK--GFTCQQCGLVRSK 177
F C C RC+ D ++ A L RC + C +L ++ C++C + +
Sbjct: 264 YFNCECTRCM------DPEDDA-LTAIRCANPACDAPILTSETEEPMNIACEKCKTIVEE 316
Query: 178 EEIKKIASEVNILSKKTLALTSCGNHQEVVSTYKMIEKLQKKLYHPFSVNLMQTREKLIK 237
+ +K + L +E+++ + I HP +V + + R L
Sbjct: 317 DTVKAAQEYMKTLPASFDPKCPAEILKELLAKAEQI-------LHPSNVYVARLRTALFH 369
Query: 238 ILMELEDWKEALAYCQLTIPVYQRVYPQFHPLLGLQYYTCGKLEWFLGDTENAIKSMTEA 297
+ L + + Q+ Y+ +P+ +G Q K + + A+ +A
Sbjct: 370 VTGTLTMDNLSSMHTQI-YNNYKMCFPKADRHVGFQLLHIVKALIEKDERDEAMPYAFDA 428
Query: 298 VEILRITHGTNSPF 311
+ I + G + P+
Sbjct: 429 MNIFEVCFGLDHPY 442
>gi|317157122|ref|XP_001826232.2| SET and MYND domain protein [Aspergillus oryzae RIB40]
Length = 435
Score = 68.6 bits (166), Expect = 4e-09, Method: Compositional matrix adjust.
Identities = 31/93 (33%), Positives = 50/93 (53%)
Query: 38 SKLACNAHTICNSELRPLGTGLYPVISIINHSCLPNAVLVFEGRLAVVRAVQHVPKGAEV 97
+KL N+ N +G L+P ++INHSC NA + F+ ++A + + KG ++
Sbjct: 122 AKLELNSFNFTNILSDRIGLYLHPYAALINHSCNYNAAVTFDSDNLYIKATRPIQKGDQI 181
Query: 98 LISYIETAGSTMTRQKALKEQYLFTCTCPRCIK 130
ISYI+ R+ L+E+Y F C C +C K
Sbjct: 182 FISYIDATNPVKLRRSELRERYYFDCHCAKCAK 214
>gi|330812969|ref|XP_003291388.1| hypothetical protein DICPUDRAFT_98991 [Dictyostelium purpureum]
gi|325078448|gb|EGC32099.1| hypothetical protein DICPUDRAFT_98991 [Dictyostelium purpureum]
Length = 824
Score = 68.6 bits (166), Expect = 4e-09, Method: Compositional matrix adjust.
Identities = 32/98 (32%), Positives = 51/98 (52%), Gaps = 1/98 (1%)
Query: 32 EIAENFSKLACNAHTICNSELRPLGTGLYPVISIINHSCLPNAV-LVFEGRLAVVRAVQH 90
EI E ++ N HT+ N + G G YP+ NHSC+PN V + + RA +
Sbjct: 509 EIYEIYTIFLSNGHTVSNIFGKGYGHGFYPLGCYFNHSCVPNVTWYVDDNGMLAFRATRT 568
Query: 91 VPKGAEVLISYIETAGSTMTRQKALKEQYLFTCTCPRC 128
+ +G E+ ++Y + + R+ L +QY F C CP+C
Sbjct: 569 ISQGEELFLTYYDMFSTLDDRRPHLLDQYRFFCQCPQC 606
>gi|238493359|ref|XP_002377916.1| SET and MYND domain protein, putative [Aspergillus flavus NRRL3357]
gi|220696410|gb|EED52752.1| SET and MYND domain protein, putative [Aspergillus flavus NRRL3357]
Length = 415
Score = 68.6 bits (166), Expect = 4e-09, Method: Compositional matrix adjust.
Identities = 31/93 (33%), Positives = 50/93 (53%)
Query: 38 SKLACNAHTICNSELRPLGTGLYPVISIINHSCLPNAVLVFEGRLAVVRAVQHVPKGAEV 97
+KL N+ N +G L+P ++INHSC NA + F+ ++A + + KG ++
Sbjct: 102 AKLELNSFNFTNILSDRIGLYLHPYAALINHSCNYNAAVTFDSDNLYIKATRPIQKGDQI 161
Query: 98 LISYIETAGSTMTRQKALKEQYLFTCTCPRCIK 130
ISYI+ R+ L+E+Y F C C +C K
Sbjct: 162 FISYIDATNPVKLRRSELRERYYFDCHCAKCAK 194
>gi|61097985|ref|NP_001012912.1| SET and MYND domain-containing protein 5 [Gallus gallus]
gi|75571258|sp|Q5ZIZ2.1|SMYD5_CHICK RecName: Full=SET and MYND domain-containing protein 5
gi|53134094|emb|CAG32301.1| hypothetical protein RCJMB04_22j23 [Gallus gallus]
Length = 420
Score = 68.6 bits (166), Expect = 4e-09, Method: Compositional matrix adjust.
Identities = 37/86 (43%), Positives = 51/86 (59%), Gaps = 7/86 (8%)
Query: 56 GTGLYPVISIINHSCLPNAVLVFEGR--LAVVRAVQHVPKGAEVLISYIETAG---STMT 110
G+GLY + S NHSC+PNA F L + A++ + G E+ ISY++ S +
Sbjct: 310 GSGLYMLQSCCNHSCIPNAETSFPDNNFLLYLTALEDIEAGEEICISYLDCCQRERSRHS 369
Query: 111 RQKALKEQYLFTCTCPRCIKLGQFDD 136
R K L+E YLFTC+CP+C L Q DD
Sbjct: 370 RNKILRENYLFTCSCPKC--LAQADD 393
>gi|340711193|ref|XP_003394163.1| PREDICTED: SET and MYND domain-containing protein 4-like [Bombus
terrestris]
Length = 676
Score = 68.6 bits (166), Expect = 4e-09, Method: Compositional matrix adjust.
Identities = 51/201 (25%), Positives = 95/201 (47%), Gaps = 11/201 (5%)
Query: 21 LILQWPEISINEIAENFSKLACNAHTICNSELRPLGTGLYPVISIINHSCLPNAVLVFEG 80
+++Q+ I+E+A + ++ + ++ + +G G+Y +S+ NHSC P + F G
Sbjct: 450 MLIQFNAHEISELA-----VPKGSNILAKAKSKFIGGGVYSTVSLFNHSCNPGIIRYFIG 504
Query: 81 RLAVVRAVQHVPKGAEVLISY--IETAGSTMTRQKALKEQYLFTCTCPRCIKLGQFDDIQ 138
VVRA++ +P G E+ +Y I T R++ L+ QY F C C C +
Sbjct: 505 TTMVVRAIRTIPAGEEISENYGPIFTTTPEAERKRKLRVQYWFDCNCEACTAHWPTLEEI 564
Query: 139 ESAILEGYRCKDDGCSGFLL---RDSDDKGFTCQQCGLVRSKEEIKKIASEVNILSKKTL 195
+ IL ++C+ G +L D+++ C +CG + + K + + + +
Sbjct: 565 DPTILR-FKCETGKECGNVLPIKADTNEFMIRCSKCGKNTNIFKGLKALQDTDAIFRTAS 623
Query: 196 ALTSCGNHQEVVSTYKMIEKL 216
G HQE + +Y I KL
Sbjct: 624 RNLEEGKHQEALKSYLEILKL 644
>gi|350405733|ref|XP_003487532.1| PREDICTED: SET and MYND domain-containing protein 4-like [Bombus
impatiens]
Length = 676
Score = 68.6 bits (166), Expect = 4e-09, Method: Compositional matrix adjust.
Identities = 51/201 (25%), Positives = 95/201 (47%), Gaps = 11/201 (5%)
Query: 21 LILQWPEISINEIAENFSKLACNAHTICNSELRPLGTGLYPVISIINHSCLPNAVLVFEG 80
+++Q+ I+E+A + ++ + ++ + +G G+Y +S+ NHSC P + F G
Sbjct: 450 MLIQFNAHEISELA-----VPKGSNILAKAKSKFIGGGVYSTVSLFNHSCNPGIIRYFIG 504
Query: 81 RLAVVRAVQHVPKGAEVLISY--IETAGSTMTRQKALKEQYLFTCTCPRCIKLGQFDDIQ 138
VVRA++ +P G E+ +Y I T R++ L+ QY F C C C +
Sbjct: 505 TTMVVRAIRTIPAGEEISENYGPIFTTTPEAERKRKLRVQYWFDCNCEACTAHWPTLEEI 564
Query: 139 ESAILEGYRCKDDGCSGFLL---RDSDDKGFTCQQCGLVRSKEEIKKIASEVNILSKKTL 195
+ IL ++C+ G +L D+++ C +CG + + K + + + +
Sbjct: 565 DPTILR-FKCETGKECGNVLPIKADTNEFMIRCSKCGKNTNIFKGLKALQDTDAIFRTAS 623
Query: 196 ALTSCGNHQEVVSTYKMIEKL 216
G HQE + +Y I KL
Sbjct: 624 RNLEEGKHQEALKSYLEILKL 644
>gi|391868965|gb|EIT78172.1| hypothetical protein Ao3042_05555 [Aspergillus oryzae 3.042]
Length = 324
Score = 68.6 bits (166), Expect = 4e-09, Method: Compositional matrix adjust.
Identities = 31/93 (33%), Positives = 50/93 (53%)
Query: 38 SKLACNAHTICNSELRPLGTGLYPVISIINHSCLPNAVLVFEGRLAVVRAVQHVPKGAEV 97
+KL N+ N +G L+P ++INHSC NA + F+ ++A + + KG ++
Sbjct: 11 AKLELNSFNFTNILSDRIGLYLHPYAALINHSCNYNAAVTFDSDNLYIKATRPIQKGDQI 70
Query: 98 LISYIETAGSTMTRQKALKEQYLFTCTCPRCIK 130
ISYI+ R+ L+E+Y F C C +C K
Sbjct: 71 FISYIDATNPVKLRRSELRERYYFDCHCAKCAK 103
>gi|82658264|ref|NP_001032477.1| SET and MYND domain-containing protein 3 [Danio rerio]
gi|81097726|gb|AAI09413.1| SET and MYND domain containing 3 [Danio rerio]
Length = 380
Score = 68.6 bits (166), Expect = 4e-09, Method: Compositional matrix adjust.
Identities = 27/76 (35%), Positives = 48/76 (63%)
Query: 37 FSKLACNAHTICNSELRPLGTGLYPVISIINHSCLPNAVLVFEGRLAVVRAVQHVPKGAE 96
+++ CN +I + EL+ +G GLYP +S++NH C PN +++FEG+ +RAV+ + E
Sbjct: 175 LARVTCNCFSISDGELQDVGVGLYPSMSLLNHDCQPNCIMMFEGKRLTLRAVRVIRSAEE 234
Query: 97 VLISYIETAGSTMTRQ 112
+ ISY + + R+
Sbjct: 235 LTISYTDILAPSKDRR 250
Score = 54.7 bits (130), Expect = 6e-05, Method: Compositional matrix adjust.
Identities = 30/96 (31%), Positives = 49/96 (51%), Gaps = 4/96 (4%)
Query: 234 KLIKILME----LEDWKEALAYCQLTIPVYQRVYPQFHPLLGLQYYTCGKLEWFLGDTEN 289
+L+ + M+ L+D++ AL Y + Y+ + HP ++ GKL+ FLG E
Sbjct: 281 RLLDLAMDACISLDDYETALEYGNRALGPYKLYFSDPHPSRAVELLRVGKLQHFLGRLEE 340
Query: 290 AIKSMTEAVEILRITHGTNSPFMKELILKLEEAQAE 325
A S T+A I+ +THG ++ +L E QAE
Sbjct: 341 ARGSFTQAYSIMSVTHGAAHALTNQVQRRLSECQAE 376
>gi|83774976|dbj|BAE65099.1| unnamed protein product [Aspergillus oryzae RIB40]
Length = 323
Score = 68.6 bits (166), Expect = 4e-09, Method: Compositional matrix adjust.
Identities = 32/98 (32%), Positives = 51/98 (52%)
Query: 33 IAENFSKLACNAHTICNSELRPLGTGLYPVISIINHSCLPNAVLVFEGRLAVVRAVQHVP 92
+A KL N+ N +G L+P ++INHSC NA + F+ ++A + +
Sbjct: 5 VANVLIKLELNSFNFTNILSDRIGLYLHPYAALINHSCNYNAAVTFDSDNLYIKATRPIQ 64
Query: 93 KGAEVLISYIETAGSTMTRQKALKEQYLFTCTCPRCIK 130
KG ++ ISYI+ R+ L+E+Y F C C +C K
Sbjct: 65 KGDQIFISYIDATNPVKLRRSELRERYYFDCHCAKCAK 102
>gi|242006418|ref|XP_002424047.1| SET and MYND domain-containing protein, putative [Pediculus humanus
corporis]
gi|212507353|gb|EEB11309.1| SET and MYND domain-containing protein, putative [Pediculus humanus
corporis]
Length = 236
Score = 68.6 bits (166), Expect = 4e-09, Method: Compositional matrix adjust.
Identities = 44/105 (41%), Positives = 61/105 (58%), Gaps = 14/105 (13%)
Query: 30 INEIAENFSKLACNAHTICNSELRPLGTGLYPVISIINHSCLPNAVLVFEG---RLAVVR 86
I+ + ENF K + N+E G+GLY + S INHSC PNA + F +LAVV
Sbjct: 122 IDAVYENFEKGVGD---FLNNE----GSGLYELQSTINHSCSPNAEVTFPHSNYQLAVV- 173
Query: 87 AVQHVPKGAEVLISYIET---AGSTMTRQKALKEQYLFTCTCPRC 128
A ++ G E+ ISY++ + S +RQK L+E YLFTC C +C
Sbjct: 174 ATDNINPGDEICISYLDMCSLSRSRHSRQKILQENYLFTCKCHKC 218
>gi|425770916|gb|EKV09375.1| hypothetical protein PDIP_64570 [Penicillium digitatum Pd1]
gi|425776732|gb|EKV14940.1| hypothetical protein PDIG_30190 [Penicillium digitatum PHI26]
Length = 495
Score = 68.6 bits (166), Expect = 5e-09, Method: Compositional matrix adjust.
Identities = 33/102 (32%), Positives = 54/102 (52%)
Query: 27 EISINEIAENFSKLACNAHTICNSELRPLGTGLYPVISIINHSCLPNAVLVFEGRLAVVR 86
++ +A +KL N+ + S +G ++P +INHSC N+ + F+G V+
Sbjct: 175 DVDEGTVASYAAKLDLNSFNLTTSMYDRIGLYMHPYAGLINHSCDYNSTVGFDGEELYVK 234
Query: 87 AVQHVPKGAEVLISYIETAGSTMTRQKALKEQYLFTCTCPRC 128
A+ + KG ++ ISYI+T R LKE+Y F C C +C
Sbjct: 235 AMHPIKKGEQIFISYIDTTTPYDIRCNELKERYFFDCQCTKC 276
>gi|449274577|gb|EMC83667.1| SET and MYND domain-containing protein 5, partial [Columba livia]
Length = 383
Score = 68.2 bits (165), Expect = 5e-09, Method: Compositional matrix adjust.
Identities = 37/86 (43%), Positives = 51/86 (59%), Gaps = 7/86 (8%)
Query: 56 GTGLYPVISIINHSCLPNAVLVF--EGRLAVVRAVQHVPKGAEVLISYIETAG---STMT 110
G+GLY + S NHSC+PNA F L + A++ + G E+ ISY++ S +
Sbjct: 273 GSGLYVLQSCCNHSCIPNAETSFPENNFLLHLTALEDIEAGEEICISYLDCCQRERSRHS 332
Query: 111 RQKALKEQYLFTCTCPRCIKLGQFDD 136
R K L+E YLFTC+CP+C L Q DD
Sbjct: 333 RNKILRENYLFTCSCPKC--LAQADD 356
>gi|195389466|ref|XP_002053397.1| GJ23360 [Drosophila virilis]
gi|194151483|gb|EDW66917.1| GJ23360 [Drosophila virilis]
Length = 393
Score = 68.2 bits (165), Expect = 5e-09, Method: Compositional matrix adjust.
Identities = 34/87 (39%), Positives = 50/87 (57%), Gaps = 5/87 (5%)
Query: 56 GTGLYPVISIINHSCLPNA--VLVFEGRLAVVRAVQHVPKGAEVLISYIETAG---STMT 110
G+GLY + S INHSC+PNA + + V++A+ + KG E+ ISY++ S +
Sbjct: 286 GSGLYILQSKINHSCVPNAQSTFPYSNDIVVLKALAPIQKGDEICISYLDDCQLERSRHS 345
Query: 111 RQKALKEQYLFTCTCPRCIKLGQFDDI 137
R K L+E Y+F C CP+C DI
Sbjct: 346 RHKTLRENYIFVCQCPKCRAQASDPDI 372
>gi|328866132|gb|EGG14518.1| hypothetical protein DFA_12294 [Dictyostelium fasciculatum]
Length = 561
Score = 68.2 bits (165), Expect = 5e-09, Method: Compositional matrix adjust.
Identities = 35/94 (37%), Positives = 50/94 (53%), Gaps = 12/94 (12%)
Query: 49 NSELRP------LGTGLYPVISIINHSCLPNAVLVFEGRLAVVRAV------QHVPKGAE 96
N +++P LG+ L P+ + +NHSC PN + E R V +A+ Q + KG E
Sbjct: 466 NKKMQPKLRQYCLGSALIPIFACLNHSCAPNIEMGREQRDGVTKAIAVMIAKQDIKKGQE 525
Query: 97 VLISYIETAGSTMTRQKALKEQYLFTCTCPRCIK 130
+L SYI+ + RQ L QY FTC C +C K
Sbjct: 526 LLTSYIDESNPFKDRQSILSSQYGFTCNCNKCSK 559
>gi|343424810|emb|CBQ68348.1| conserved hypothetical protein [Sporisorium reilianum SRZ2]
Length = 855
Score = 68.2 bits (165), Expect = 5e-09, Method: Compositional matrix adjust.
Identities = 35/116 (30%), Positives = 64/116 (55%), Gaps = 16/116 (13%)
Query: 29 SINEIAENFSKLACNAHTICNSELRPLGTGLYPVISIINHSCLPNAVLVF---------- 78
S E+ + + +CN+ T+ +S+L PLG ++ ++++NH+C PNA +VF
Sbjct: 415 SATELLDLVCQFSCNSFTLTDSDLNPLGVCMHASMAMLNHACTPNAAVVFPFGGAAKSGQ 474
Query: 79 ------EGRLAVVRAVQHVPKGAEVLISYIETAGSTMTRQKALKEQYLFTCTCPRC 128
E ++ + A++ + G E+LISY++ A + R+ LK++Y F C C C
Sbjct: 475 QKWRDGEDKIMQLVALRAIEPGEELLISYVDIADTFEERRSYLKKRYCFDCRCELC 530
>gi|297736653|emb|CBI25524.3| unnamed protein product [Vitis vinifera]
Length = 716
Score = 68.2 bits (165), Expect = 5e-09, Method: Compositional matrix adjust.
Identities = 40/112 (35%), Positives = 54/112 (48%), Gaps = 7/112 (6%)
Query: 49 NSELRPLGTGLYPVISIINHSCLPNAVLVFEGRLAVVRAVQHVPKGAEVLISYIETAG-- 106
N E +G +Y V S+ NHSC PN F R +RA +HV G + +SY G
Sbjct: 452 NMEQVRVGQAIYSVASLFNHSCQPNIHAYFLSRTLFLRATEHVAVGCPLELSYGPQVGQW 511
Query: 107 STMTRQKALKEQYLFTCTCPRCIKLGQFDDIQESAILEGYRCKDDGCSGFLL 158
RQK LK++Y F C C C +L D +L +RC + C G +L
Sbjct: 512 DCKDRQKFLKDEYSFRCECSGCSELNVSD-----LVLNAFRCVNPDCFGTVL 558
>gi|157136831|ref|XP_001656929.1| hypothetical protein AaeL_AAEL003516 [Aedes aegypti]
gi|108880958|gb|EAT45183.1| AAEL003516-PA [Aedes aegypti]
Length = 649
Score = 68.2 bits (165), Expect = 5e-09, Method: Compositional matrix adjust.
Identities = 61/209 (29%), Positives = 92/209 (44%), Gaps = 26/209 (12%)
Query: 36 NFSKLACNAHTICNSELRP-----------LGTGLYPVISIINHSCLPNAVLVFEGRLAV 84
N L NAH I SEL+ +G GLYP +++ NHSC P + G
Sbjct: 419 NLQFLQYNAHEI--SELQRENERDIGKSTFIGGGLYPTLALFNHSCEPGVTRYYRGNSVC 476
Query: 85 VRAVQHVPKGAEVLISY--IETAGSTMTRQKALKEQYLFTCTCPRCI-KLGQFDDIQESA 141
VR V+ +P G+ V +Y + T S R+ L QY FTC C C +F D+ +
Sbjct: 477 VRTVKGIPAGSMVAENYGPLFTQVSREERRSTLLNQYKFTCNCRACAGDWPKFADMDSNV 536
Query: 142 ILEGYRCKDDG---CSGFLLRDSDDKGFT--CQQCGLVRSKEEIKKIASEVNILSKKTLA 196
+ R K DG CS L ++ F C +CG + + K + ++L K
Sbjct: 537 L----RFKCDGGKNCSNVLEIPAEINEFMVQCTECGEHTNIMKGLKSLQDTDMLFKSATK 592
Query: 197 LTSCGNHQEVVSTY-KMIEKLQKKLYHPF 224
L G ++ + Y +M+ L + L P+
Sbjct: 593 LHEAGEYEFALKKYVEMMNTLDEVLVPPY 621
>gi|308509934|ref|XP_003117150.1| CRE-SET-14 protein [Caenorhabditis remanei]
gi|308242064|gb|EFO86016.1| CRE-SET-14 protein [Caenorhabditis remanei]
Length = 429
Score = 68.2 bits (165), Expect = 5e-09, Method: Compositional matrix adjust.
Identities = 60/223 (26%), Positives = 98/223 (43%), Gaps = 15/223 (6%)
Query: 26 PEISINEIAENFSKLACNAHTICNSELRPLGTGLYPVISIINHSCLPNAVLVFEGRLAVV 85
P I F K++ N+ ++ NS +G L +S NHSC P+ + + GR A++
Sbjct: 136 PHSHPETIKSIFKKVSVNSFSLSNSTGNSIGIALCVKLSAANHSCKPSTRVCYRGRTAML 195
Query: 86 RAV-QHVPKGAE-VLISYIETAGSTMTRQKALKEQYLFTCTCPRCIKLGQFDDIQESAIL 143
V +P E SYI+ + TRQ LK++Y F C C C D + + +
Sbjct: 196 VPVDDRLPTSLEGACHSYIDELQTLSTRQATLKKKYKFDCACEGCT------DDERNGRM 249
Query: 144 EGYRCKDDGCSGFLLRDSDDKGFTCQQCG--LVRSKEEIKKIASEVNILSKKTLALTSCG 201
E + C + C +R+ + G +C CG L R + E+ + A E I S+ L S
Sbjct: 250 EAWAC--EICKTGWIRNVE--GASCNPCGYVLTRDQYELCRTAEEAAIASRPKLENDSIP 305
Query: 202 NHQEVVSTYKMIEKLQKKLYHPFSVNLMQTREKLIKILMELED 244
K++E Q L H ++V+ + L + + D
Sbjct: 306 LETRRHLCEKLLELFQDTL-HAYNVHRIPVLRCLFVATLAIRD 347
>gi|194752924|ref|XP_001958769.1| GF12554 [Drosophila ananassae]
gi|190620067|gb|EDV35591.1| GF12554 [Drosophila ananassae]
Length = 660
Score = 68.2 bits (165), Expect = 5e-09, Method: Compositional matrix adjust.
Identities = 56/219 (25%), Positives = 99/219 (45%), Gaps = 18/219 (8%)
Query: 15 IANLVNLILQWPEISINEIAE--NFSKLACNAHTICNSELRPLGTGLYPVISIINHSCLP 72
IA+LV LQ+ + + +E+AE FS +G +YP +++ NHSC P
Sbjct: 427 IASLVLRSLQFIQFNTHEVAELHKFSSSGREKSIF-------IGGAIYPTLALFNHSCDP 479
Query: 73 NAVLVFEGRLAVVRAVQHVPKGAEVLISY--IETAGSTMTRQKALKEQYLFTCTCPRCI- 129
V F G + +V+ + G + +Y I T RQ LKE Y F C+C CI
Sbjct: 480 GVVRYFRGNTIHINSVRPIEAGLPINENYGPIYTQDERSERQARLKELYWFECSCDACID 539
Query: 130 KLGQFDDIQESAILEGYRCK-DDGCSGFL--LRDSDDKGFTCQQCGLVRSKEEIKKIASE 186
+FDD+ I +RC+ + C+ + +D C CG + + + K+ +
Sbjct: 540 NWPKFDDLPRDII--RFRCEAPNNCAAVIEVPPSCNDFMVKCVTCGEITNILKGLKVMQD 597
Query: 187 VNILSKKTLALTSCGNHQEVVSTY-KMIEKLQKKLYHPF 224
++++ L G + + ++ + +I + + L PF
Sbjct: 598 TEMMTRTAKRLYETGEYSKALTKFIDLIRIMYEVLAPPF 636
>gi|307173919|gb|EFN64667.1| Protein msta, isoform B [Camponotus floridanus]
Length = 513
Score = 68.2 bits (165), Expect = 5e-09, Method: Compositional matrix adjust.
Identities = 58/194 (29%), Positives = 93/194 (47%), Gaps = 22/194 (11%)
Query: 48 CNSELRPLGTGLYPVISIINHSCLPNAVLVFEGRLAV-VRAVQHVPKGAEVLISYIETAG 106
C + LR GLYP+ ++ NH C+PN F+ + + V A + G E+ +SY +
Sbjct: 213 CTTSLR----GLYPLGALQNHCCVPNTRHHFDDQQQLHVTATLPIAAGEEITMSYTDLLW 268
Query: 107 STMTRQKALKEQYLFTCTCPRCIKLGQFDDIQESAILEGYRCKDDGCSGFLLRDSDDKGF 166
T +R++ LK F+C C RC +F Q A+L C D CSG LL S+ F
Sbjct: 269 DTSSRRQFLKATKHFSCNCSRCSDPSEFGS-QLGALL----CAKDDCSGHLL-PSNPLNF 322
Query: 167 ----TCQQCGLVRSKEEIKKIASEVNILSKKTLALTSCGNHQEVVSTYKMIEKLQKKLYH 222
C +C + + ++I+ I S++N L + +T E+V K IE KL
Sbjct: 323 ISPWICDKCQISVNYKQIEFIHSQLNALVLDAMYMTP----HEIV---KFIETALSKLVP 375
Query: 223 PFSVNLMQTREKLI 236
+ L+ + ++I
Sbjct: 376 TSNFILIDVKFRII 389
>gi|449501577|ref|XP_002187845.2| PREDICTED: SET and MYND domain-containing protein 5-like
[Taeniopygia guttata]
Length = 404
Score = 68.2 bits (165), Expect = 6e-09, Method: Compositional matrix adjust.
Identities = 37/86 (43%), Positives = 51/86 (59%), Gaps = 7/86 (8%)
Query: 56 GTGLYPVISIINHSCLPNAVLVF--EGRLAVVRAVQHVPKGAEVLISYIETAG---STMT 110
G+GLY + S NHSC+PNA F L + A++ + G E+ ISY++ S +
Sbjct: 294 GSGLYVLQSCCNHSCIPNAETSFPENNFLLHLTALEDIEAGEEICISYLDCCQRERSRHS 353
Query: 111 RQKALKEQYLFTCTCPRCIKLGQFDD 136
R K L+E YLFTC+CP+C L Q DD
Sbjct: 354 RNKILRENYLFTCSCPKC--LAQADD 377
>gi|302853821|ref|XP_002958423.1| hypothetical protein VOLCADRAFT_99704 [Volvox carteri f.
nagariensis]
gi|300256228|gb|EFJ40499.1| hypothetical protein VOLCADRAFT_99704 [Volvox carteri f.
nagariensis]
Length = 729
Score = 68.2 bits (165), Expect = 6e-09, Method: Compositional matrix adjust.
Identities = 36/91 (39%), Positives = 54/91 (59%), Gaps = 7/91 (7%)
Query: 43 NAHTICNSELRPLGT--GLYPVISIINHSCLPNAVLVFEGRLAVVRAVQHVPKGAEVLIS 100
A + NSE PL + G++P +++INHSC PNAV V VVRA + +P+G EVL++
Sbjct: 601 GAAAVANSE--PLTSRMGVWPEVALINHSCGPNAVAVMLYDRLVVRATRRIPRGREVLLN 658
Query: 101 YIETAGS---TMTRQKALKEQYLFTCTCPRC 128
++ + G+ R+ L + Y F C C RC
Sbjct: 659 WLGSQGALAPARERRAQLSQMYDFACRCARC 689
>gi|389746511|gb|EIM87691.1| SET domain-containing protein [Stereum hirsutum FP-91666 SS1]
Length = 552
Score = 67.8 bits (164), Expect = 6e-09, Method: Compositional matrix adjust.
Identities = 70/260 (26%), Positives = 110/260 (42%), Gaps = 75/260 (28%)
Query: 32 EIAENFSKLACNAHTICNSELRPLGTGLYPVISIINHSCLPNAVLVF----------EGR 81
++ + S+ N+ T+ L PLG + P++++INHSC PNAV+V+ E +
Sbjct: 200 DLVDLISRFITNSITLTAPSLTPLGVSVSPLVALINHSCDPNAVVVYPRSTSDPSQQEPQ 259
Query: 82 LAVVRAVQHVPKGAEVLISYIETAGSTMTRQKALKEQYLFTCTCPRCIKLGQFDDIQESA 141
+ VV A++++ E+ +YI+T RQ AL Y FTC C C + D
Sbjct: 260 MQVV-AIKNIAPDQEIFTAYIDTTMPRGHRQAALTTTYNFTCKCSLCSQPDGMD------ 312
Query: 142 ILEGYRCKDDGCSGFLLRDSDDKGFTCQQCGLVRSKEEIKKIASEVNILSKKTLALTSCG 201
+ E C + C G TC + +E N LS+ CG
Sbjct: 313 LREAIWCPKN-CGG-----------TC-------------PLPTEENTLSR-------CG 340
Query: 202 NHQEVV-STYKMIEKL---QKKLYHPFSVNLMQTREKLIKILMELEDWKEALAYCQLT-- 255
+ +V ST +++ L Q+ L S+ L +T + L QLT
Sbjct: 341 KCKAIVSSTDSVLDALRVGQEALQKATSLQLTETAKAL-----------------QLTTN 383
Query: 256 -IPVY--QRVYPQFHPLLGL 272
IP+ ++ P HPLL L
Sbjct: 384 IIPILTSSKLTPSSHPLLAL 403
>gi|219130732|ref|XP_002185512.1| predicted protein [Phaeodactylum tricornutum CCAP 1055/1]
gi|217403043|gb|EEC42999.1| predicted protein [Phaeodactylum tricornutum CCAP 1055/1]
Length = 493
Score = 67.8 bits (164), Expect = 6e-09, Method: Compositional matrix adjust.
Identities = 37/104 (35%), Positives = 52/104 (50%), Gaps = 3/104 (2%)
Query: 28 ISINEIAENFSKLACNAHT---ICNSELRPLGTGLYPVISIINHSCLPNAVLVFEGRLAV 84
+S I +++ A NA T + L P GL+P+ ++INHSC+ NAV V + V
Sbjct: 188 LSYTNIEQSWRTTALNAPTPLDLSRLSLPPRLLGLFPLAAMINHSCVGNAVRVLVDDVMV 247
Query: 85 VRAVQHVPKGAEVLISYIETAGSTMTRQKALKEQYLFTCTCPRC 128
V A +P G E++ SY R L+ Y F C CPRC
Sbjct: 248 VHATTDLPAGTELVWSYGPPTTPFAQRNTRLRAHYGFVCDCPRC 291
>gi|332018228|gb|EGI58833.1| Protein msta, isoform A [Acromyrmex echinatior]
Length = 483
Score = 67.8 bits (164), Expect = 6e-09, Method: Compositional matrix adjust.
Identities = 67/314 (21%), Positives = 131/314 (41%), Gaps = 37/314 (11%)
Query: 17 NLVNLILQWPEISINEIAENFSKLACNAHTICNSELRPL----GTGLYPVISIINHSCLP 72
N+V+ + + + ++ +E + + C I E+R GLYP ++++NHSC+
Sbjct: 143 NIVDYLRK--RLKLDRFSEKYIQTICGILEINTFEVRTAKGFSARGLYPTVAMMNHSCVS 200
Query: 73 N---AVLVFEGRLAVVRAVQHVPKGAEVLISYIETAGSTMTRQKALKEQYLFTCTCPRCI 129
N ++ + R+ + R +P G E+ SY + T+ R++ L E F C CPRC
Sbjct: 201 NTSHSISPIDYRIRL-RTTLKIPAGGELYASYTHSLLPTILRREHLLEGKHFACACPRC- 258
Query: 130 KLGQFDDIQESAILEGYRCK--DDGCSGFLLRDSDDKGFTCQQCGLVRSKEEIKKIASEV 187
D + + +C D+G L + + C C + + ++KI +
Sbjct: 259 ----SDPTELGTHMSSLKCNKCDNGIVLPLDSLDSESTWKCTHCDFSTNGQAVRKILRII 314
Query: 188 NILSKKTLALTSCGNHQEVVSTYKMIEKLQKKLYHPFSVNLMQTREKLIKILMELEDW-- 245
A++ + + + + + K + + HP L R L ++ ++++
Sbjct: 315 QAQVDAAEAISG-ADGADAIYKRETVMKKYRLVLHPHHAFLSMLRHSLTQMYGRVDEYLL 373
Query: 246 --------KEALAYCQLTIPVYQRVYPQFHPLLGLQYYT-------CGKLEWFLGDTENA 290
+ + C+L + V V P + + G+ Y K +W G + A
Sbjct: 374 DDLPDVVLEHKVDMCRLLLQVLDVVEPGYSRVRGMTLYELHAPLLFLAKDQWNAGVIDEA 433
Query: 291 -IKS-MTEAVEILR 302
+KS M EA IL+
Sbjct: 434 KLKSKMIEAANILK 447
>gi|307196209|gb|EFN77865.1| SET and MYND domain-containing protein 3 [Harpegnathos saltator]
Length = 354
Score = 67.8 bits (164), Expect = 6e-09, Method: Compositional matrix adjust.
Identities = 33/98 (33%), Positives = 55/98 (56%)
Query: 217 QKKLYHPFSVNLMQTREKLIKILMELEDWKEALAYCQLTIPVYQRVYPQFHPLLGLQYYT 276
QK + H F++ ++T + + LE W++A Y + +P Y Y + HPL+GL Y T
Sbjct: 234 QKDILHSFNIQHIRTLDTAFLAAVNLEYWEDAELYSKQLLPGYLLYYGEVHPLIGLLYLT 293
Query: 277 CGKLEWFLGDTENAIKSMTEAVEILRITHGTNSPFMKE 314
GK++ L + A++ + +A +L ITHG P M+E
Sbjct: 294 MGKIQLHLRKLKQALEILKKASAVLTITHGDKHPIMRE 331
Score = 50.4 bits (119), Expect = 0.001, Method: Compositional matrix adjust.
Identities = 20/56 (35%), Positives = 35/56 (62%)
Query: 37 FSKLACNAHTICNSELRPLGTGLYPVISIINHSCLPNAVLVFEGRLAVVRAVQHVP 92
+ ++ N+ I + ++ +G G+Y S+I+HSC PNAV+ FEG +VR + +P
Sbjct: 160 YGRICINSFNILDLDMNSIGVGIYLGASVIDHSCKPNAVVTFEGTTIIVRTLIDLP 215
>gi|358368449|dbj|GAA85066.1| SET and MYND domain protein [Aspergillus kawachii IFO 4308]
Length = 484
Score = 67.8 bits (164), Expect = 6e-09, Method: Compositional matrix adjust.
Identities = 34/102 (33%), Positives = 56/102 (54%)
Query: 27 EISINEIAENFSKLACNAHTICNSELRPLGTGLYPVISIINHSCLPNAVLVFEGRLAVVR 86
E+S I+ +KL N+ + N+ LG L+P +I NHSC NA + F+G ++
Sbjct: 168 EMSEEVISAMGAKLDLNSFNLTNAVYDRLGVYLHPYAAIFNHSCDHNAAVSFDGPNLHIK 227
Query: 87 AVQHVPKGAEVLISYIETAGSTMTRQKALKEQYLFTCTCPRC 128
A++ + K ++ I+YI+ RQ L+ +Y FTC C +C
Sbjct: 228 AIRPIQKDEQIFITYIDVTDPYPIRQHNLQSRYYFTCHCSKC 269
>gi|321472149|gb|EFX83120.1| hypothetical protein DAPPUDRAFT_2393 [Daphnia pulex]
Length = 458
Score = 67.8 bits (164), Expect = 7e-09, Method: Compositional matrix adjust.
Identities = 48/186 (25%), Positives = 87/186 (46%), Gaps = 17/186 (9%)
Query: 58 GLYPVISIINHSCLPNAVLVFEGRLAVVR--AVQHVPKGAEVLISYIETAGSTMTRQKAL 115
GLYP S++NH C+ N VF+ +R A + +P G ++ +Y + +T+ R+ L
Sbjct: 225 GLYPTASLMNHDCVANTRHVFDPADFRIRILATKDIPAGDKISATYTRSLWNTLDRRLHL 284
Query: 116 KEQYLFTCTCPRCIKLGQFDDIQESAILEGYRCKDDGCSGFLLRDS---DDKGFTCQQCG 172
K F C C RC D + +L +C GC G +L + + C CG
Sbjct: 285 KSTKHFWCQCSRCA-----DPRELGTLLSAVKCT--GCGGAVLSQNPLYQMSNWECSDCG 337
Query: 173 LVRSKEEIKKIASEVNILSKKTLALTSCGNHQEVVSTYKMIEKLQKKLYHPFSVNLMQTR 232
V+ E++K++ + K+ L N E++ + I K + HP S ++++ +
Sbjct: 338 SVQKVEQVKRVHDSARMELKQIAQLAR--NRPELLEDF--IRKYSGAI-HPDSCHVIEAK 392
Query: 233 EKLIKI 238
L+++
Sbjct: 393 YALVQL 398
>gi|66822115|ref|XP_644412.1| SET domain-containing protein [Dictyostelium discoideum AX4]
gi|66822967|ref|XP_644838.1| SET domain-containing protein [Dictyostelium discoideum AX4]
gi|74857697|sp|Q557F6.1|Y3591_DICDI RecName: Full=SET and MYND domain-containing protein DDB_G0273591
gi|60472535|gb|EAL70486.1| SET domain-containing protein [Dictyostelium discoideum AX4]
gi|60472965|gb|EAL70914.1| SET domain-containing protein [Dictyostelium discoideum AX4]
Length = 413
Score = 67.8 bits (164), Expect = 7e-09, Method: Compositional matrix adjust.
Identities = 39/122 (31%), Positives = 59/122 (48%), Gaps = 6/122 (4%)
Query: 39 KLACNAHTICNSELRPLGTGLYPVISIINHSCLPNAVLVFEGRLAVVRAVQHVPKGAEVL 98
K+ CN I + +G + P S NHSC+PN V +G +++ + KG ++
Sbjct: 248 KVRCNQFGIWTKNDKCIGMAVSPSSSYFNHSCIPNCESVRDGSDMTFKSLFPIKKGDQIN 307
Query: 99 ISYIETAGSTMTRQKALKEQYLFTCTCPRCIK-----LGQFDDIQESAILEGYRCKDDGC 153
ISY+ ST R+ LK Y F C CPRC G+ +D ++ I + Y C C
Sbjct: 308 ISYLALDKSTKRRRDYLKFGYYFHCQCPRCNSTDIDPTGKLEDSLDNWISKFY-CHQKKC 366
Query: 154 SG 155
+G
Sbjct: 367 TG 368
>gi|443896871|dbj|GAC74214.1| predicted histone tail methylase containing SET domain [Pseudozyma
antarctica T-34]
Length = 882
Score = 67.8 bits (164), Expect = 7e-09, Method: Compositional matrix adjust.
Identities = 36/119 (30%), Positives = 65/119 (54%), Gaps = 17/119 (14%)
Query: 29 SINEIAENFSKLACNAHTICNSELRPLGTGLYPVISIINHSCLPNAVLVF---------- 78
S E+ + + + N+ T+ +S+L PLG ++ ++++NHSC PNA +VF
Sbjct: 463 SATELLDLVCQFSSNSFTLTDSDLNPLGVCMHASMAMLNHSCAPNAAVVFPFGGAAKSGQ 522
Query: 79 -------EGRLAVVRAVQHVPKGAEVLISYIETAGSTMTRQKALKEQYLFTCTCPRCIK 130
+ ++ + A++ + G E+LISY++ A + R+ ALK++Y F C C C K
Sbjct: 523 QNWSDGEDAKVMQLVALRAIEPGEELLISYVDVADAYEQRRSALKKRYCFECRCELCRK 581
>gi|440635519|gb|ELR05438.1| hypothetical protein GMDG_01733 [Geomyces destructans 20631-21]
Length = 505
Score = 67.8 bits (164), Expect = 7e-09, Method: Compositional matrix adjust.
Identities = 39/118 (33%), Positives = 61/118 (51%), Gaps = 3/118 (2%)
Query: 33 IAENFSKLACNAHTICNSELRPLGTGLYPVISIINHSCLPNAVLVFEGRLAVVRAVQHVP 92
I + F K+ N I + E P G L V S +NHSC PNA + FEG +R+++ +
Sbjct: 193 IQDLFCKVLTNVMCINHQEGSP-GITLDLVGSFVNHSCDPNAFVFFEGSQLRMRSLKPIN 251
Query: 93 KGAEVLISYIETAGSTMTRQKALKEQYLFTCTCPRCIKLGQFDDIQESAILEGYRCKD 150
G E+ ++Y E + R++ L Y F C C RC K + ++ E + EG+ +D
Sbjct: 252 AGDEITLTYTELREGVLMRKRKLHRGYSFFCRCNRCKK--EHKELHELSFREGFSIRD 307
>gi|342874438|gb|EGU76450.1| hypothetical protein FOXB_13043 [Fusarium oxysporum Fo5176]
Length = 896
Score = 67.8 bits (164), Expect = 7e-09, Method: Compositional matrix adjust.
Identities = 35/92 (38%), Positives = 53/92 (57%), Gaps = 1/92 (1%)
Query: 39 KLACNAHTICNSELRPLGTGLYPVISIINHSCLPNAVLVFEGRLAVVRAVQHVPKGAEVL 98
K+ NA +++L +G L P +++ NHSC+PNA++ F GR A++RA + + E+
Sbjct: 535 KIQTNAFHRYDADLGQVGIFLEPKLAMANHSCIPNAMVQFVGRKAILRAEKPIKVDEEIE 594
Query: 99 ISYIETAGSTMTRQKALKEQYLFTCTCPRCIK 130
ISY + R+ AL Y F C CPRC K
Sbjct: 595 ISYTDYTFPRSKRKHALAP-YFFDCQCPRCEK 625
>gi|402073679|gb|EJT69231.1| hypothetical protein GGTG_12851 [Gaeumannomyces graminis var.
tritici R3-111a-1]
Length = 541
Score = 67.8 bits (164), Expect = 7e-09, Method: Compositional matrix adjust.
Identities = 38/104 (36%), Positives = 60/104 (57%), Gaps = 1/104 (0%)
Query: 27 EISINEIAENFSKLACNAHTICNSELRPLGTGLYPVISIINHSCLPNAVLVFEGRLAVVR 86
+ ++N AE K+ N+ +++ GT L P+++++NHSC+PNAV++F R A +R
Sbjct: 173 DANLNLAAEILCKIQTNSFDRFDADTGQSGTFLDPLLAMVNHSCIPNAVVLFWKRKAYLR 232
Query: 87 AVQHVPKGAEVLISYIETAGSTMTRQKALKEQYLFTCTCPRCIK 130
A V G ++ ISYI+ R++ L E Y F C C RC K
Sbjct: 233 AETPVKAGEDITISYIDYTKPLSFRRQDL-ELYHFQCGCLRCEK 275
>gi|327288923|ref|XP_003229174.1| PREDICTED: SET and MYND domain-containing protein 5-like, partial
[Anolis carolinensis]
Length = 163
Score = 67.8 bits (164), Expect = 7e-09, Method: Compositional matrix adjust.
Identities = 36/86 (41%), Positives = 50/86 (58%), Gaps = 7/86 (8%)
Query: 56 GTGLYPVISIINHSCLPNAVLVFEGR--LAVVRAVQHVPKGAEVLISYIETAG---STMT 110
G+GLY + S NHSC+PNA F L + A++ + G E+ ISY++ S +
Sbjct: 45 GSGLYVLQSCCNHSCIPNAETSFPDNNFLLHLTALEDIRPGEEICISYLDCCQRERSRHS 104
Query: 111 RQKALKEQYLFTCTCPRCIKLGQFDD 136
R K L+E YLF C+CP+C L Q DD
Sbjct: 105 RHKVLRENYLFVCSCPKC--LAQADD 128
>gi|332027349|gb|EGI67433.1| SET and MYND domain-containing protein 4 [Acromyrmex echinatior]
Length = 619
Score = 67.8 bits (164), Expect = 8e-09, Method: Compositional matrix adjust.
Identities = 40/123 (32%), Positives = 61/123 (49%), Gaps = 6/123 (4%)
Query: 55 LGTGLYPVISIINHSCLPNAVLVFEGRLAVVRAVQHVPKGAEVLISY--IETAGSTMTRQ 112
+G G+YP ++++NHSC P + F G +VRAV+ + G E+ +Y I T R+
Sbjct: 422 IGGGVYPTVAMLNHSCNPGVIRYFIGTTMIVRAVRTINAGEEISENYGPIFTTMPESERK 481
Query: 113 KALKEQYLFTCTCPRCIKLGQFDDIQESAILEGYRCKDD-GCSGFLLRDSDDKGFT--CQ 169
+ L+ QY F C C C D + IL ++C+ C L+ SD F C
Sbjct: 482 RKLRVQYWFDCNCEACSGHWPLLDELDPTILR-FKCETGPSCGNVLMVKSDTNEFMIGCA 540
Query: 170 QCG 172
+CG
Sbjct: 541 KCG 543
>gi|147817677|emb|CAN75589.1| hypothetical protein VITISV_042880 [Vitis vinifera]
Length = 816
Score = 67.8 bits (164), Expect = 8e-09, Method: Compositional matrix adjust.
Identities = 46/125 (36%), Positives = 60/125 (48%), Gaps = 9/125 (7%)
Query: 36 NFSKLACNAHTICNSELRPLGTGLYPVISIINHSCLPNAVLVFEGRLAVVRAVQHVPKGA 95
NFS A A T N E +G +Y V S+ NHSC PN F R +RA +HV G
Sbjct: 62 NFSP-AGGAFT-SNMEQVRVGQAIYSVASLFNHSCQPNIHAYFLSRTLFLRATEHVAVGC 119
Query: 96 EVLISYIETAG--STMTRQKALKEQYLFTCTCPRCIKLGQFDDIQESAILEGYRCKDDGC 153
+ +SY G RQK LK++Y F C C C +L D +L +RC + C
Sbjct: 120 PLELSYGPQVGQWDCKDRQKFLKDEYSFRCECSGCSELNVSD-----LVLNAFRCVNPDC 174
Query: 154 SGFLL 158
G +L
Sbjct: 175 FGTVL 179
>gi|390365247|ref|XP_003730778.1| PREDICTED: SET and MYND domain-containing protein 3-like
[Strongylocentrotus purpuratus]
Length = 487
Score = 67.4 bits (163), Expect = 8e-09, Method: Compositional matrix adjust.
Identities = 31/87 (35%), Positives = 51/87 (58%)
Query: 42 CNAHTICNSELRPLGTGLYPVISIINHSCLPNAVLVFEGRLAVVRAVQHVPKGAEVLISY 101
CN+ I +++L + G+Y S++NHSC N ++VF+ R +R V+ V +G E I Y
Sbjct: 181 CNSFGIFDNDLIAISDGIYLRASMVNHSCDYNCIVVFDERKLQLRTVKDVKEGEECTIGY 240
Query: 102 IETAGSTMTRQKALKEQYLFTCTCPRC 128
++ R+ L+E+Y FTC C +C
Sbjct: 241 VDVIHPAKERRAELEEKYHFTCKCVKC 267
Score = 63.2 bits (152), Expect = 2e-07, Method: Compositional matrix adjust.
Identities = 28/67 (41%), Positives = 40/67 (59%)
Query: 64 SIINHSCLPNAVLVFEGRLAVVRAVQHVPKGAEVLISYIETAGSTMTRQKALKEQYLFTC 123
S++NHSC PN V +GR + V+ V +G E I+Y++ RQ LKE+Y FTC
Sbjct: 364 SMLNHSCDPNCAWVSDGRKIQIMTVKDVKEGEECTITYVDVMDPAKVRQADLKERYHFTC 423
Query: 124 TCPRCIK 130
C +CI+
Sbjct: 424 KCVKCIE 430
>gi|307182513|gb|EFN69720.1| Histone-lysine N-methyltransferase ASHR1 [Camponotus floridanus]
Length = 132
Score = 67.4 bits (163), Expect = 8e-09, Method: Compositional matrix adjust.
Identities = 36/100 (36%), Positives = 55/100 (55%)
Query: 215 KLQKKLYHPFSVNLMQTREKLIKILMELEDWKEALAYCQLTIPVYQRVYPQFHPLLGLQY 274
K QK + H F++ ++T E M L+ W++A Y + IP Y Y + HPL+GL Y
Sbjct: 12 KKQKNILHRFNIQHIRTLEAAHIAAMNLKYWEDAELYGKELIPGYLLYYGEIHPLIGLLY 71
Query: 275 YTCGKLEWFLGDTENAIKSMTEAVEILRITHGTNSPFMKE 314
GK++ L + A+K + +A E+L ITHG M+E
Sbjct: 72 LMTGKIQLHLNKPKEALKVLEKASEVLMITHGDKHSLMRE 111
>gi|289739577|gb|ADD18536.1| putative histone tail methylase [Glossina morsitans morsitans]
Length = 429
Score = 67.4 bits (163), Expect = 9e-09, Method: Compositional matrix adjust.
Identities = 32/101 (31%), Positives = 52/101 (51%), Gaps = 7/101 (6%)
Query: 58 GLYPVISIINHSCLPNAVLVFEGRL--AVVRAVQHVPKGAEVLISYIETAGSTMTRQKAL 115
L+P+ +++NH C+PN+ FE + V+RA +P+G E+ +Y + + R L
Sbjct: 208 ALFPLFALVNHDCVPNSYYTFEEKTNYMVLRASVDLPEGTEITTTYTKLFTGNIARHLFL 267
Query: 116 KEQYLFTCTCPRCIKLGQFDDIQESAILEGYRCKDDGCSGF 156
K + FTC C RC D ++ A + C+D CSG
Sbjct: 268 KMKKNFTCKCSRC-----SDPTEKGAFISAVYCRDTNCSGL 303
>gi|66800595|ref|XP_629223.1| hypothetical protein DDB_G0293294 [Dictyostelium discoideum AX4]
gi|60462643|gb|EAL60846.1| hypothetical protein DDB_G0293294 [Dictyostelium discoideum AX4]
Length = 508
Score = 67.4 bits (163), Expect = 9e-09, Method: Compositional matrix adjust.
Identities = 32/81 (39%), Positives = 48/81 (59%), Gaps = 6/81 (7%)
Query: 56 GTGLYPVISIINHSCLPNAVL---VFEGRLAV---VRAVQHVPKGAEVLISYIETAGSTM 109
G GL+P+ S +NHSC PN + + +G +V V+A +++P G+E+L SY + S
Sbjct: 428 GVGLFPIFSCMNHSCFPNVEISNEIIDGVTSVRMVVKAKKNIPAGSEILHSYCDETLSNK 487
Query: 110 TRQKALKEQYLFTCTCPRCIK 130
R+ L QY F CTC +C K
Sbjct: 488 ERKDILFSQYGFKCTCNKCSK 508
>gi|345480361|ref|XP_003424136.1| PREDICTED: protein msta, isoform A-like [Nasonia vitripennis]
Length = 501
Score = 67.4 bits (163), Expect = 9e-09, Method: Compositional matrix adjust.
Identities = 42/135 (31%), Positives = 66/135 (48%), Gaps = 11/135 (8%)
Query: 58 GLYPVISIINHSCLPNA--VLVFEGRLAVVRAVQHVPKGAEVLISYIETAGSTMTRQKAL 115
L+PV ++ NH C+PN ++ EG L V AV + +G E+ +SY + T R+ L
Sbjct: 208 ALFPVGALTNHQCVPNTRHIVNAEGELLVYAAVP-IAEGQEITMSYADVLWDTQMRRHFL 266
Query: 116 KEQYLFTCTCPRCIKLGQFDDIQESAILEGYRCKDDGCSGFLLRDSD---DKGFTCQQCG 172
F C CPRC D + ++L C D C G LL ++ + +TC +CG
Sbjct: 267 LATKHFACQCPRCT-----DVTECGSLLGALSCAIDNCQGNLLPNTPLSLNCPWTCNECG 321
Query: 173 LVRSKEEIKKIASEV 187
L+ +I I + +
Sbjct: 322 LIIKGRQISSIRAGI 336
>gi|350403520|ref|XP_003486827.1| PREDICTED: protein msta, isoform B-like [Bombus impatiens]
Length = 513
Score = 67.4 bits (163), Expect = 1e-08, Method: Compositional matrix adjust.
Identities = 70/316 (22%), Positives = 133/316 (42%), Gaps = 41/316 (12%)
Query: 17 NLVNLILQWPEISINEIAENFSKLACNAHTICNSELRPL----GTGLYPVISIINHSCLP 72
N+V+ I + ++ ++ +E AC I E+R LYP ++++NHSC+
Sbjct: 173 NIVDYIRK--QLKLDRFSEEQIHTACGILEINTFEIRTAKGCSARALYPTVALMNHSCIS 230
Query: 73 N---AVLVFEGRLAVVRAVQHVPKGAEVLISYIETAGSTMTRQKALKEQYLFTCTCPRCI 129
N ++ + R+ + R +P G E+ SY + T+ R++ L E F C CPRC
Sbjct: 231 NTCHSISPTDYRVRL-RTTLKIPAGGELYGSYTHSLLPTLLRREHLLEGKYFACACPRC- 288
Query: 130 KLGQFDDIQESAILEGYRCK--DDGCSGFLLRDSDDKGFTCQQCGLVRSKEEIKKIASEV 187
D + + +C D+G L + + C C S ++K+ +
Sbjct: 289 ----SDPTELGTHVSSLKCNKCDNGIVLSLDSLDPESSWKCTHCEFTTSGSAVRKV---L 341
Query: 188 NILSKKTLAL--TSCGNHQEVVSTYKMIEKLQKKLYHPFSVNLMQTREKLIKILMELEDW 245
+I+ + A+ S + + + + + K + + HP L R L ++ ++++
Sbjct: 342 HIIQAEVDAVETISGADGADAIQERETVVKKYRSVLHPRHGFLTMLRHSLTQMYGRVDEY 401
Query: 246 ----------KEALAYCQLTIPVYQRVYPQFHPLLGLQYYT-------CGKLEWFLGD-T 287
+ + C+L + V + P + + G+ Y K +W G
Sbjct: 402 LLDDLPDVVLEHKIDICRLLLQVLDVIEPGYSRIRGMTLYELHAPLLFIAKTQWNAGVID 461
Query: 288 ENAIKS-MTEAVEILR 302
E A+KS M EA IL+
Sbjct: 462 EAALKSKMIEAANILK 477
>gi|393212658|gb|EJC98158.1| SET domain-containing protein [Fomitiporia mediterranea MF3/22]
Length = 459
Score = 67.4 bits (163), Expect = 1e-08, Method: Compositional matrix adjust.
Identities = 40/106 (37%), Positives = 59/106 (55%), Gaps = 11/106 (10%)
Query: 33 IAENFSKLACNAHTICNSELRPLGTGLYPVIS-IINHSCLPNAVLVF----EGRLAVVRA 87
I E FS+ + N + I +S L +G G++P+ S + NHSCLPNA++ + EG VV+A
Sbjct: 189 IKEIFSRFS-NNNFIVHSHLSAVGHGIFPLASRLFNHSCLPNAIVTYSFTSEGIQMVVKA 247
Query: 88 VQHVPKGAEVLISYIETAGSTMTRQKALKEQYLFTCTC-----PRC 128
+ + +G E+ I Y + A RQ + Y F CTC PRC
Sbjct: 248 LTPIKQGEEITIPYFDPALPYHQRQAICRYSYGFECTCSVCMFPRC 293
>gi|170047543|ref|XP_001851277.1| conserved hypothetical protein [Culex quinquefasciatus]
gi|167869950|gb|EDS33333.1| conserved hypothetical protein [Culex quinquefasciatus]
Length = 697
Score = 67.4 bits (163), Expect = 1e-08, Method: Compositional matrix adjust.
Identities = 57/202 (28%), Positives = 82/202 (40%), Gaps = 51/202 (25%)
Query: 11 LYAQIANLVNLILQ--WPEISINEIAENFSKLACNAHTICNSELRP-------------- 54
+ Q+ +L+N++ + W I + +L CN H I SELR
Sbjct: 353 FFQQLGDLINVMSKENWSIYCGAVIFRHIGQLVCNGHAI--SELRGSFASENNCLEADSF 410
Query: 55 ---------------LGTGLYPVISIINHSCLPNAVLVFEGRLAVVRAVQHVPKGAEVLI 99
+ TG++P IS+ NHSC PN F V A + +P G E+
Sbjct: 411 NIKAGFLHRYFESTRVFTGIFPQISMFNHSCEPNIRNTFNKNTLTVYAAKDIPVGGEIFN 470
Query: 100 SYIETAGSTMT------RQKALKEQYLFTCTCPRCIKLGQFDDIQESAILEGYRCKDDGC 153
Y G R+ AL++QY F C C RC D+ ES E Y+C C
Sbjct: 471 CY----GPNFKLMCKDERKSALRQQYGFDCKCTRCAMNN--DEAYES--FEHYKCPFAKC 522
Query: 154 SG-FLLRDSDD---KGFTCQQC 171
S F+L+++ D K C C
Sbjct: 523 SKYFMLKENADPFEKDIKCPMC 544
>gi|302834810|ref|XP_002948967.1| hypothetical protein VOLCADRAFT_89375 [Volvox carteri f.
nagariensis]
gi|300265712|gb|EFJ49902.1| hypothetical protein VOLCADRAFT_89375 [Volvox carteri f.
nagariensis]
Length = 679
Score = 67.0 bits (162), Expect = 1e-08, Method: Compositional matrix adjust.
Identities = 37/104 (35%), Positives = 52/104 (50%), Gaps = 2/104 (1%)
Query: 28 ISINEIAENFSKLACNAHTICNSELRPLGTGLYPVISIINHSCLPNAVLVFEGRLAVVRA 87
++ N E+ A +A ++EL GL+P S+INHSC P A G + VVRA
Sbjct: 218 VTFNAYGESKPDPAVSAVRSLSAELLSGCVGLWPPFSLINHSCAPVASYGLVGDVMVVRA 277
Query: 88 VQHVPKGAEVLISYI--ETAGSTMTRQKALKEQYLFTCTCPRCI 129
+P G +V ISY G R+ L++ Y F C C RC+
Sbjct: 278 AADLPAGQQVTISYFGRRALGPLELRRAYLRQHYGFVCACERCV 321
>gi|340924027|gb|EGS18930.1| hypothetical protein CTHT_0055430 [Chaetomium thermophilum var.
thermophilum DSM 1495]
Length = 278
Score = 67.0 bits (162), Expect = 1e-08, Method: Compositional matrix adjust.
Identities = 31/76 (40%), Positives = 40/76 (52%)
Query: 58 GLYPVISIINHSCLPNAVLVFEGRLAVVRAVQHVPKGAEVLISYIETAGSTMTRQKALKE 117
+YP +S NH C PN G + AV+ +P G E+ ISYI S RQK L E
Sbjct: 69 AVYPKVSRFNHDCRPNVHYRLSGLIHTTVAVRDIPAGTELTISYIYVRKSRAERQKELSE 128
Query: 118 QYLFTCTCPRCIKLGQ 133
+ FTCTC +C K +
Sbjct: 129 SWNFTCTCEQCSKSAE 144
>gi|452846662|gb|EME48594.1| hypothetical protein DOTSEDRAFT_19117 [Dothistroma septosporum
NZE10]
Length = 199
Score = 67.0 bits (162), Expect = 1e-08, Method: Compositional matrix adjust.
Identities = 35/92 (38%), Positives = 48/92 (52%), Gaps = 16/92 (17%)
Query: 37 FSKLACNAHTICNSELRPLGTGLYPVISIINHSCLPNAVLVFEGRLAVVRAVQHVPKGAE 96
S L CN+ + PLG + +IS +NHSC+PNA +VFE E
Sbjct: 38 ISLLLCNSLILMTPANDPLGIVVELLISNVNHSCVPNACIVFE----------------E 81
Query: 97 VLISYIETAGSTMTRQKALKEQYLFTCTCPRC 128
+ +SY++ G RQ AL +Y FTCTCP+C
Sbjct: 82 IFVSYVDQHGLFHARQAALWHRYYFTCTCPKC 113
>gi|281202062|gb|EFA76267.1| autophagy protein 9 [Polysphondylium pallidum PN500]
Length = 1064
Score = 67.0 bits (162), Expect = 1e-08, Method: Compositional matrix adjust.
Identities = 41/123 (33%), Positives = 54/123 (43%), Gaps = 6/123 (4%)
Query: 39 KLACNAHTICNSELRPLGTGLYPVISIINHSCLPNAVLVFEGRLAVVRAVQHVPKGAEVL 98
K+ CN I + + +G P S NHSC PN V + V +A+ + KG +
Sbjct: 910 KIRCNQFGIWSKRDKCIGVSCTPAASYFNHSCCPNLADVRGTTVVVFKALHFISKGTPIS 969
Query: 99 ISYIETAGSTMTRQKALKEQYLFTCTCPRCIKLGQFDDIQES-AILEGYRCKDDGCSGFL 157
ISY++ T RQ LK Y FTC C RC D ES + + C CSG
Sbjct: 970 ISYLDLDQPTHERQSYLKTFYYFTCQCLRCK-----DQTDESDNWISRFYCDRFKCSGTY 1024
Query: 158 LRD 160
D
Sbjct: 1025 FLD 1027
>gi|383858987|ref|XP_003704980.1| PREDICTED: protein msta, isoform B-like [Megachile rotundata]
Length = 514
Score = 67.0 bits (162), Expect = 1e-08, Method: Compositional matrix adjust.
Identities = 70/308 (22%), Positives = 126/308 (40%), Gaps = 43/308 (13%)
Query: 27 EISINEIAENFSKLACNAHTICNSELR-PLGTG---LYPVISIINHSCLPN---AVLVFE 79
++ ++ +E + AC I E+R G G LYP ++++NHSC+ N ++ +
Sbjct: 182 QLKLDRFSEELIQTACGILEINTFEVRTSRGYGARALYPTVALMNHSCVSNTCHSISPTD 241
Query: 80 GRLAVVRAVQHVPKGAEVLISYIETAGSTMTRQKALKEQYLFTCTCPRCIKLGQFDDIQE 139
R+ + R HVP E+ SY + TM R++ L E F C C RC D +
Sbjct: 242 YRIRL-RTTVHVPPSGELYGSYTHSLYPTMLRREHLLEGKHFACACARC-----SDPTEL 295
Query: 140 SAILEGYRCK--DDGCSGFLLRDSDDKGFTCQQCGLVRSKEEIKK----IASEVNILSKK 193
+ +C D+G L D + C C S ++K I +EV+ + +
Sbjct: 296 GTHMSSLKCNKCDNGIVLSLDSLDPDSSWKCTHCEFSTSGPAVRKVLKIIQAEVDAVEQ- 354
Query: 194 TLALTSCGNHQEVVSTYKMIEKLQKKLYHPFSVNLMQTREKLIKILMELEDW-------- 245
S + + + + + K + + HP L R L ++ ++++
Sbjct: 355 ----ISGADGADAIQERETVMKKYRSVLHPRHAFLTMLRHSLTQMYGRVDEYLLDDLPDV 410
Query: 246 --KEALAYCQLTIPVYQRVYPQFHPLLGLQYYTCGKLEWFLGDT--------ENAIKS-M 294
+ + C+ + V + P + + G+ Y F+ T E A+KS M
Sbjct: 411 VLEHKIDMCRFLLQVLDVIEPGYSRIRGMTLYELHAPLLFVAKTLWNAGVIDEAALKSKM 470
Query: 295 TEAVEILR 302
EA IL+
Sbjct: 471 IEASNILK 478
>gi|171684729|ref|XP_001907306.1| hypothetical protein [Podospora anserina S mat+]
gi|170942325|emb|CAP67977.1| unnamed protein product [Podospora anserina S mat+]
Length = 559
Score = 67.0 bits (162), Expect = 1e-08, Method: Compositional matrix adjust.
Identities = 32/91 (35%), Positives = 52/91 (57%), Gaps = 1/91 (1%)
Query: 39 KLACNAHTICNSELRPLGTGLYPVISIINHSCLPNAVLVFEGRLAVVRAVQHVPKGAEVL 98
++ NA +++ G L P ++ +NHSC+PNA + F+ R A +RA + + +G E+
Sbjct: 199 QIQTNAFNRLDADTGMSGIFLDPALARVNHSCVPNAFIGFDKRTATLRAERPIKEGEEIT 258
Query: 99 ISYIETAGSTMTRQKALKEQYLFTCTCPRCI 129
ISYI R++ L+ Y F C CPRC+
Sbjct: 259 ISYIANDKPRSIRREGLR-LYYFECDCPRCV 288
>gi|400595566|gb|EJP63361.1| TPR domain-containing protein [Beauveria bassiana ARSEF 2860]
Length = 662
Score = 67.0 bits (162), Expect = 1e-08, Method: Compositional matrix adjust.
Identities = 32/73 (43%), Positives = 44/73 (60%), Gaps = 1/73 (1%)
Query: 57 TGLYPVISIINHSCLPNAVLVFEGRLAVVRAVQHVPKGAEVLISYIETAGSTMTRQKALK 116
TGL+ S +NHSC+PN+V F G L VVRA+QH+ G E+ +Y + G RQ L+
Sbjct: 515 TGLWARASYLNHSCMPNSVKDFAGDLIVVRALQHIRTGDEITHAYQDN-GDYDARQALLQ 573
Query: 117 EQYLFTCTCPRCI 129
+ FTC C C+
Sbjct: 574 TTWGFTCRCKLCV 586
>gi|281206015|gb|EFA80204.1| SET domain-containing protein [Polysphondylium pallidum PN500]
Length = 490
Score = 66.6 bits (161), Expect = 1e-08, Method: Compositional matrix adjust.
Identities = 32/82 (39%), Positives = 49/82 (59%), Gaps = 2/82 (2%)
Query: 50 SELRPLGTGLYPVISIINHSCLPNAVLVFEGRLAVV--RAVQHVPKGAEVLISYIETAGS 107
S+L G+ L+P+++ NH C PNAV+ + V R+++ +P EV ISYI+ S
Sbjct: 406 SDLCISGSALFPIVNSCNHHCNPNAVVSYTTNCNRVTLRSLRSIPIHEEVEISYIDETVS 465
Query: 108 TMTRQKALKEQYLFTCTCPRCI 129
R+K L+ +YLF C C RC+
Sbjct: 466 CSQRRKELQHKYLFNCKCTRCL 487
>gi|195355540|ref|XP_002044249.1| GM15092 [Drosophila sechellia]
gi|194129550|gb|EDW51593.1| GM15092 [Drosophila sechellia]
Length = 392
Score = 66.6 bits (161), Expect = 1e-08, Method: Compositional matrix adjust.
Identities = 37/97 (38%), Positives = 57/97 (58%), Gaps = 9/97 (9%)
Query: 37 FSKLACNAHTICNSELRPLGTGLYPVISIINHSCLPNAVLVF--EGRLAVVRAVQHVPKG 94
++K+A A N+E G+GLY + S INHSC+PNA F + V++A+ + +G
Sbjct: 271 YAKVAEFAGEFLNNE----GSGLYLLQSKINHSCVPNACSTFPYSNDIVVLKALGPIQQG 326
Query: 95 AEVLISYIETA---GSTMTRQKALKEQYLFTCTCPRC 128
E+ ISY++ S +R K L+E Y+F C CP+C
Sbjct: 327 EEICISYLDECMLERSRHSRHKVLRENYVFICQCPKC 363
>gi|328791769|ref|XP_001121272.2| PREDICTED: SET and MYND domain-containing protein 4-like [Apis
mellifera]
Length = 668
Score = 66.6 bits (161), Expect = 1e-08, Method: Compositional matrix adjust.
Identities = 54/183 (29%), Positives = 89/183 (48%), Gaps = 23/183 (12%)
Query: 33 IAENFSKLACNAHTICNS---------ELRPLGTGLYPVISIINHSCLPNAVL-VFEGRL 82
I + + CNA+ I + E R +G +YP +S+INHSC PN V + +
Sbjct: 442 ILHHLQAINCNAYEIVENIYDKKTHIWEPRQIGGAIYPSVSLINHSCYPNVVRHTYPSGI 501
Query: 83 AVVRAVQHVPKGAEVLISYIET--AGSTMTRQKALKEQYLFTCTCPRCIKLGQFDDIQES 140
VVR ++ V KG E+L Y + + ++R + L ++Y F CTC CI+ Q+ +
Sbjct: 502 VVVRTLRFVGKGTEILDCYGPHWFSENKLSRIEYLWKKYRFLCTCDACIQNWQYPLPE-- 559
Query: 141 AILEGYRCKDDGCSGFL----LRDSDDKGFTCQQCGLVRSKEEIKKIASEVNILSKKTLA 196
+ Y+CK CS + L + D + +QC ++ +KKI +++ KK L
Sbjct: 560 --IMKYKCK--TCSEIIGIIALNEKDMQNVLNKQCNCNKTI-NLKKIKNQLQNSIKKRLN 614
Query: 197 LTS 199
S
Sbjct: 615 AIS 617
>gi|195569233|ref|XP_002102615.1| GD19999 [Drosophila simulans]
gi|194198542|gb|EDX12118.1| GD19999 [Drosophila simulans]
Length = 391
Score = 66.6 bits (161), Expect = 1e-08, Method: Compositional matrix adjust.
Identities = 37/97 (38%), Positives = 57/97 (58%), Gaps = 9/97 (9%)
Query: 37 FSKLACNAHTICNSELRPLGTGLYPVISIINHSCLPNAVLVF--EGRLAVVRAVQHVPKG 94
++K+A A N+E G+GLY + S INHSC+PNA F + V++A+ + +G
Sbjct: 271 YAKVAEFAGEFLNNE----GSGLYLLQSKINHSCVPNACSTFPYSNDIVVLKALGPIQQG 326
Query: 95 AEVLISYIETA---GSTMTRQKALKEQYLFTCTCPRC 128
E+ ISY++ S +R K L+E Y+F C CP+C
Sbjct: 327 EEICISYLDECMLERSRHSRHKVLRENYVFICQCPKC 363
>gi|166796007|ref|NP_001107677.1| SMYD family member 5 [Xenopus (Silurana) tropicalis]
gi|163916066|gb|AAI57309.1| smyd5 protein [Xenopus (Silurana) tropicalis]
Length = 421
Score = 66.6 bits (161), Expect = 1e-08, Method: Compositional matrix adjust.
Identities = 36/86 (41%), Positives = 51/86 (59%), Gaps = 7/86 (8%)
Query: 56 GTGLYPVISIINHSCLPNAVLVFEGRLAVVR--AVQHVPKGAEVLISYIETAG---STMT 110
G+GLY + S NHSC+PNA F ++ A++ + G E+ ISY++ S +
Sbjct: 305 GSGLYLLQSCCNHSCVPNAEASFPDNNFILHLTALEDIQPGEEICISYLDCCQRDRSRHS 364
Query: 111 RQKALKEQYLFTCTCPRCIKLGQFDD 136
RQK L+E YLF C+CP+C L Q DD
Sbjct: 365 RQKILRENYLFVCSCPKC--LAQADD 388
>gi|56788824|gb|AAH88504.1| smyd5-prov protein, partial [Xenopus (Silurana) tropicalis]
Length = 420
Score = 66.6 bits (161), Expect = 1e-08, Method: Compositional matrix adjust.
Identities = 36/86 (41%), Positives = 51/86 (59%), Gaps = 7/86 (8%)
Query: 56 GTGLYPVISIINHSCLPNAVLVFEGRLAVVR--AVQHVPKGAEVLISYIETAG---STMT 110
G+GLY + S NHSC+PNA F ++ A++ + G E+ ISY++ S +
Sbjct: 303 GSGLYLLQSCCNHSCVPNAEASFPDNNFILHLTALEDIQPGEEICISYLDCCQRDRSRHS 362
Query: 111 RQKALKEQYLFTCTCPRCIKLGQFDD 136
RQK L+E YLF C+CP+C L Q DD
Sbjct: 363 RQKILRENYLFVCSCPKC--LAQADD 386
>gi|328702715|ref|XP_001948576.2| PREDICTED: SET and MYND domain-containing protein 5-like
[Acyrthosiphon pisum]
Length = 387
Score = 66.6 bits (161), Expect = 2e-08, Method: Compositional matrix adjust.
Identities = 37/80 (46%), Positives = 47/80 (58%), Gaps = 10/80 (12%)
Query: 56 GTGLYPVISIINHSCLPNAVLVFEG----RLAVVRAVQHVPKGAEVLISYI---ETAGST 108
G+ L+ S INHSC PNA VF+G RL +R ++ G E+ ISYI E S
Sbjct: 289 GSALFQNHSSINHSCFPNAASVFDGNHVLRLVAIRMIE---PGDEINISYIAPCELDHSR 345
Query: 109 MTRQKALKEQYLFTCTCPRC 128
TRQK L+E Y+FTC C +C
Sbjct: 346 HTRQKYLQENYVFTCRCIKC 365
>gi|452839795|gb|EME41734.1| hypothetical protein DOTSEDRAFT_81948 [Dothistroma septosporum
NZE10]
Length = 549
Score = 66.6 bits (161), Expect = 2e-08, Method: Compositional matrix adjust.
Identities = 29/74 (39%), Positives = 40/74 (54%)
Query: 55 LGTGLYPVISIINHSCLPNAVLVFEGRLAVVRAVQHVPKGAEVLISYIETAGSTMTRQKA 114
LG G P++ NHSC PNA +F ++RA++ + KG E+ + Y + RQ
Sbjct: 223 LGMGFDPIMCSANHSCDPNAAALFNQPRQILRALKPIKKGEEISMKYTDVTNPLCVRQAE 282
Query: 115 LKEQYLFTCTCPRC 128
LK Y FTC C RC
Sbjct: 283 LKGYYFFTCHCARC 296
>gi|218184846|gb|EEC67273.1| hypothetical protein OsI_34249 [Oryza sativa Indica Group]
Length = 816
Score = 66.6 bits (161), Expect = 2e-08, Method: Compositional matrix adjust.
Identities = 82/359 (22%), Positives = 147/359 (40%), Gaps = 77/359 (21%)
Query: 17 NLVNLILQWPEISINEIA--------------ENFSKLACNAHTICNSELRPLGTGLYPV 62
+L L+L +I +N IA + FS + +C+ E + +Y
Sbjct: 445 SLSQLVLLICQIKVNSIAIVHMKSMDGGKVLTKGFSGFS--GDVMCSVEQVRVAQAIYMS 502
Query: 63 ISIINHSCLPNAVLVFEGRLAVVRAVQHVPKGAEVLISYIETAGST--MTRQKALKEQYL 120
S NHSC PN F R ++R+ +++ G+ + +SY G RQK+L+E Y
Sbjct: 503 GSFFNHSCQPNIHAYFHSRTLILRSTEYIKAGSPIELSYGPQVGEMDLPERQKSLRENYY 562
Query: 121 FTCT------------------CPRCIKLGQF---------DDIQESAILEGYRC----- 148
F+C CP+ LG ++ +I E + C
Sbjct: 563 FSCGCSSCSVLSLSDLVMNSFCCPQSNCLGAVSELIHHRRKENFVHVSIGESHVCTLSLP 622
Query: 149 ------KDDGCSGFLLRDSD-----DKGF--TCQ-QCGLVRSKEEIKKIASEVNILSKKT 194
+D G L +SD D GF +C+ Q L + + S++N L++
Sbjct: 623 DVSKFDEDIVKVGKLFFNSDTMLNIDPGFCMSCRSQLDLSSAVAMSDRATSKINRLNE-- 680
Query: 195 LALTSCGNHQEVVSTYKMIEKLQ-----KKLYHPFSVNLMQTREKLIKILMELEDWKEAL 249
L S N EV+ + E LQ +KL HP+S L Q + + + ++ D ++A
Sbjct: 681 --LPSPDNVPEVL----IAEALQSLEHIEKLRHPYSKALAQAHDTIAEAFAKVGDQEQAR 734
Query: 250 AYCQLTIPVYQRVYPQFHPLLGLQYYTCGKLEWFLGDTENAIKSMTEAVEILRITHGTN 308
+C+ + + +++Y H ++ + +E +GD +A + A I + +G +
Sbjct: 735 KHCEASFKILEKLYHPRHIIIAHELIKLVSIELSMGDGASAAAAFARADAIFSLYYGPD 793
>gi|384249285|gb|EIE22767.1| hypothetical protein COCSUDRAFT_42390 [Coccomyxa subellipsoidea
C-169]
Length = 329
Score = 66.6 bits (161), Expect = 2e-08, Method: Compositional matrix adjust.
Identities = 43/128 (33%), Positives = 57/128 (44%), Gaps = 16/128 (12%)
Query: 16 ANLVNLILQWPEISINEIAENFSKLACNAHTIC-----NSELRPLGTGLYPVISIINHSC 70
A L L P +S E+A+ +K N + I E R GT LYP S++NH C
Sbjct: 99 AQLPQLAGSRP-LSPEEVADALAKEELNGYGIMAPSGPEGERRIRGTALYPKASLLNHEC 157
Query: 71 LPNAVLVF----------EGRLAVVRAVQHVPKGAEVLISYIETAGSTMTRQKALKEQYL 120
LPN V E +A+ ++P G E SY S RQ+ ++QY
Sbjct: 158 LPNVARVDNFDEADVNAPENTAVHFKALHNLPAGEEFTQSYFPMHTSYHVRQQRCQDQYG 217
Query: 121 FTCTCPRC 128
F C CPRC
Sbjct: 218 FACNCPRC 225
>gi|242007370|ref|XP_002424514.1| set and mynd domain containing protein, putative [Pediculus humanus
corporis]
gi|212507932|gb|EEB11776.1| set and mynd domain containing protein, putative [Pediculus humanus
corporis]
Length = 646
Score = 66.6 bits (161), Expect = 2e-08, Method: Compositional matrix adjust.
Identities = 67/298 (22%), Positives = 123/298 (41%), Gaps = 53/298 (17%)
Query: 7 KQLLLYAQIANLVNLILQWP----------------EISINEIAENFSKLACNAHTICNS 50
K L YA A+L+ L L+ EI + + ++ +L CN H I +
Sbjct: 308 KHLFQYALTASLLALYLKTHTSYFSNESNDDDDNNLEIVASTLFKHILQLICNGHAITDV 367
Query: 51 EL-------------RPLGTGLYPVISIINHSCLPNAVLVFEGRLAVVRAVQHVPKGAEV 97
+ +GT +YP S++NHSC P+ + F +VRA + + G E+
Sbjct: 368 ITDDSNDSNAVDLIEKRIGTAIYPSASMMNHSCEPSIITSFSNNHLIVRASKPIKAGEEI 427
Query: 98 LISYIETAGSTMT--RQKALKEQYLFTCTCPRCIKLGQFDDIQESAILEGYRCKDDGCSG 155
Y A RQ+ L++QY F C C C + Q+D ++ Y C + C G
Sbjct: 428 FNCYGPHAKRMKKEERQEILRKQYFFNCHCMACHQ--QYDIMER---FMAYMCFE--CKG 480
Query: 156 FLLRDSDDKGFTCQQCGLVRSKEEIKKIASEVNILSKKTL--ALTSCGNHQEVVSTYKMI 213
L+ +D C C ++ +I+S+ + + G + + +
Sbjct: 481 PLIIINDSNNGYCLDC-------------NKNSIISQDVIRESFKEQGKILDALKKFLNF 527
Query: 214 EKLQKKLYHPFSVNLMQTREKLIKILMELEDWKEALAYCQLTIPVYQRVYPQFHPLLG 271
+ +K+ + F+ +L + ++ +LED+ + L + TI ++ + F LG
Sbjct: 528 FIIGRKILYKFNKDLCYSADQAASCFSKLEDYDKCLVILKCTILSEEKRFGSFSIELG 585
>gi|62473432|ref|NP_001014718.1| msta, isoform B [Drosophila melanogaster]
gi|76363525|sp|P83501.2|MSTAB_DROME RecName: Full=Protein msta, isoform B
gi|61677866|gb|AAF45751.3| msta, isoform B [Drosophila melanogaster]
Length = 448
Score = 66.6 bits (161), Expect = 2e-08, Method: Compositional matrix adjust.
Identities = 36/105 (34%), Positives = 52/105 (49%), Gaps = 6/105 (5%)
Query: 55 LGTGLYPVISIINHSCLPNAVLVFE-GRLAVVRAVQHVPKGAEVLISYIETAGSTMTRQK 113
L L+P+ ++NH C PNA FE G VV A + +P GAE+ +SY + ST+ R+
Sbjct: 225 LVRALFPLAGLLNHQCTPNAAHHFENGETIVVCATERIPAGAEITMSYAKLLWSTLARKI 284
Query: 114 ALKEQYLFTCTCPRCIKLGQFDDIQESAILEGYRCKDDGCSGFLL 158
L F C C RC D + L C++ GC G ++
Sbjct: 285 FLGMTKHFICKCVRCQ-----DPTENGTYLSALFCREQGCRGLVI 324
>gi|350633710|gb|EHA22075.1| hypothetical protein ASPNIDRAFT_183370 [Aspergillus niger ATCC
1015]
Length = 386
Score = 66.6 bits (161), Expect = 2e-08, Method: Compositional matrix adjust.
Identities = 33/104 (31%), Positives = 57/104 (54%)
Query: 27 EISINEIAENFSKLACNAHTICNSELRPLGTGLYPVISIINHSCLPNAVLVFEGRLAVVR 86
++S I+ +KL N+ + N+ LG L+P +I NHSC NA + F+G ++
Sbjct: 168 DMSEEVISAMGAKLDLNSFNLTNAVYDRLGVYLHPYAAIFNHSCDHNAAVSFDGPNLHIK 227
Query: 87 AVQHVPKGAEVLISYIETAGSTMTRQKALKEQYLFTCTCPRCIK 130
A++ + K ++ I+YI+ RQ L+ +Y FTC C +C +
Sbjct: 228 ALRPIRKDEQIFITYIDVTDPYPIRQANLQSRYYFTCHCSKCTR 271
>gi|134083355|emb|CAK42922.1| unnamed protein product [Aspergillus niger]
Length = 415
Score = 66.6 bits (161), Expect = 2e-08, Method: Compositional matrix adjust.
Identities = 31/93 (33%), Positives = 52/93 (55%)
Query: 38 SKLACNAHTICNSELRPLGTGLYPVISIINHSCLPNAVLVFEGRLAVVRAVQHVPKGAEV 97
+KL N+ + N+ LG L+P +I NHSC NA + F+G ++A++ + K ++
Sbjct: 147 AKLDLNSFNLTNAVYDRLGVYLHPYAAIFNHSCDHNAAVSFDGPNLHIKALRPIRKDEQI 206
Query: 98 LISYIETAGSTMTRQKALKEQYLFTCTCPRCIK 130
I+YI+ RQ L+ +Y FTC C +C +
Sbjct: 207 FITYIDVTDPYPIRQANLQSRYYFTCHCSKCSR 239
>gi|402590196|gb|EJW84127.1| hypothetical protein WUBG_04964 [Wuchereria bancrofti]
Length = 283
Score = 66.6 bits (161), Expect = 2e-08, Method: Compositional matrix adjust.
Identities = 70/288 (24%), Positives = 132/288 (45%), Gaps = 32/288 (11%)
Query: 29 SINEIAENFSKLACNAHTICNSELRPLGTGLYPVISIINHSCLPNAVLVFEGRLAVVRAV 88
S E+ + F K+ N I +S L G G+Y +S ++HSC P+A ++F G A++R++
Sbjct: 3 SSTEMFDIFCKIVINTVVITDSCLSRTGIGVYLGLSALDHSCKPDAFIIFNGTKAILRSL 62
Query: 89 QH--VPKGAEVLISYIETAGSTMTRQKALKEQYLFTCTCPRCIKLGQFDDIQESAILEGY 146
+ I Y + T TR K ++ Q+ F C C C + + D ++ L
Sbjct: 63 SKNITEYNNNLRIPYCDLLELTSTRCKYMQLQHNFICNCEICQNV-ELD--RQKCSLRCT 119
Query: 147 RCKDDGCSGFLLRDSDDKGFT-CQQCGLVR--SKEEIKKIASEVNILSKKTLALTSCGNH 203
+C D C D++ T C+ CG + + + ++K+ ++ T ++ N
Sbjct: 120 KCTDGFCP---YSPEDEQAETRCKVCGEISLFNFDHLQKLYQQL------TTDDSTEKNL 170
Query: 204 QEVVSTYKMIEKLQKKLYHPFSVNLMQTREKLIKILMELEDWKEALAYCQLTIPVYQRVY 263
E++ Y E +++ P++V L + EK++ M+ + EA + Y
Sbjct: 171 NELIDLYCESE----EVFSPYNVPLCKFAEKIMISAMKHHKYDEAA----------KTFY 216
Query: 264 PQFHPLLGLQYYTCGKLEWFLGDTENAIKSMTEAVEILRITHGTNSPF 311
P+ P L + KL L E + + EA++++R ++G+ S F
Sbjct: 217 PKGLPPLPTRMLEYAKL-LMLQHDEASFPILREALKMIRESYGSESNF 263
>gi|340722924|ref|XP_003399849.1| PREDICTED: protein msta, isoform B-like [Bombus terrestris]
Length = 513
Score = 66.2 bits (160), Expect = 2e-08, Method: Compositional matrix adjust.
Identities = 67/316 (21%), Positives = 130/316 (41%), Gaps = 41/316 (12%)
Query: 17 NLVNLILQWPEISINEIAENFSKLACNAHTICNSELRPL----GTGLYPVISIINHSCLP 72
N+V+ I + ++ ++ +E AC I E+R LYP ++++NHSC+
Sbjct: 173 NIVDYIRK--QLKLDRFSEEQIHTACGILEINTFEIRTAKGCSARALYPTVALMNHSCIS 230
Query: 73 N---AVLVFEGRLAVVRAVQHVPKGAEVLISYIETAGSTMTRQKALKEQYLFTCTCPRCI 129
N ++ + R+ + R +P G E+ SY + T+ R++ L E F C C RC
Sbjct: 231 NTCHSISPTDYRVRL-RTTLKIPAGGELYGSYTHSLLPTLLRREHLLEGKYFACACSRC- 288
Query: 130 KLGQFDDIQESAILEGYRCK--DDGCSGFLLRDSDDKGFTCQQCGLVRSKEEIKKIASEV 187
D + + +C D+G L D + C C S ++K+ +
Sbjct: 289 ----SDPTELGTHVSSLKCNKCDNGIVLSLDSLDPDSSWKCTHCEFTTSGSAVRKV---L 341
Query: 188 NILSKKTLAL--TSCGNHQEVVSTYKMIEKLQKKLYHPFSVNLMQTREKLIKILMELEDW 245
+I+ + A+ S + + + + + K + + HP L R L ++ ++++
Sbjct: 342 HIIQAEVDAVETISGADGADAIQERETVVKKYRSVLHPRHGFLTMLRHSLTQMYGRVDEY 401
Query: 246 ----------KEALAYCQLTIPVYQRVYPQFHPLLGLQYYT-------CGKLEWFLGDTE 288
+ + C+L + V + P + + G+ Y K +W G +
Sbjct: 402 LLDDLPDVVLEHKIDICRLLLQVLDVIEPGYSRIRGMTLYELHAPLLFVAKTQWNAGVID 461
Query: 289 NAI--KSMTEAVEILR 302
A+ M EA IL+
Sbjct: 462 EAVLKSKMIEAANILK 477
>gi|256062338|ref|XP_002570318.1| set and mynd domain containing [Schistosoma mansoni]
gi|353230727|emb|CCD77144.1| putative set and mynd domain containing [Schistosoma mansoni]
Length = 423
Score = 66.2 bits (160), Expect = 2e-08, Method: Compositional matrix adjust.
Identities = 36/79 (45%), Positives = 48/79 (60%), Gaps = 7/79 (8%)
Query: 56 GTGLYPVISIINHSCLPNAVLVFEG---RLAVVRAVQHVPKGAEVLISYIE---TAGSTM 109
G GLY S INHSC PNA++ F G RL+VV A+ + +G E+ ISY++ +
Sbjct: 314 GVGLYYYQSRINHSCSPNAIIRFSGVNSRLSVV-ALTSIQEGEEITISYLDHCLQSRGRH 372
Query: 110 TRQKALKEQYLFTCTCPRC 128
TR+K L YLF C CP+C
Sbjct: 373 TRRKHLSSNYLFWCNCPKC 391
>gi|322792366|gb|EFZ16350.1| hypothetical protein SINV_08812 [Solenopsis invicta]
Length = 510
Score = 66.2 bits (160), Expect = 2e-08, Method: Compositional matrix adjust.
Identities = 67/314 (21%), Positives = 128/314 (40%), Gaps = 37/314 (11%)
Query: 17 NLVNLILQWPEISINEIAENFSKLACNAHTICNSELRPL----GTGLYPVISIINHSCLP 72
N+V+ + + + ++ +E + C I E+R GLYP ++++NHSC+
Sbjct: 170 NIVDYLRK--RLKLDRFSEEHIQTVCGILEINTFEVRTAKGFSARGLYPTVAMMNHSCVS 227
Query: 73 N---AVLVFEGRLAVVRAVQHVPKGAEVLISYIETAGSTMTRQKALKEQYLFTCTCPRCI 129
N ++ + R+ + R +P G E+ SY + TM R++ L E F C CPRC
Sbjct: 228 NTSHSISPIDYRIRL-RTTLKIPAGGELYASYTHSLLPTMLRREHLLEGKHFACACPRC- 285
Query: 130 KLGQFDDIQESAILEGYRCK--DDGCSGFLLRDSDDKGFTCQQCGLVRSKEEIKKIASEV 187
D + + +C D+G L + + C C + + ++K+ +
Sbjct: 286 ----SDPTELGTHMSSLKCNKCDNGIVLSLDSLDSESKWKCTHCDFSTTGQAVRKVFRII 341
Query: 188 NILSKKTLALTSCGNHQEVVSTYKMIEKLQKKLYHPFSVNLMQTREKLIKILMELEDW-- 245
A++ + + + + I K + + HP L R L ++ ++++
Sbjct: 342 QAEVDAAEAISG-ADGADAIHERETIMKKYRSVLHPQHAFLSMLRHSLTQMYGRVDEYLL 400
Query: 246 --------KEALAYCQLTIPVYQRVYPQFHPLLGLQYYT-------CGKLEWFLGD-TEN 289
+ + C+ + V V P + + G+ Y K +W G E
Sbjct: 401 DDLPDVVLEHKVDLCRSLLQVLNVVEPGYSRVRGMTLYELHAPLLFLAKGQWNAGVIDET 460
Query: 290 AIKS-MTEAVEILR 302
+KS M EA IL+
Sbjct: 461 GLKSKMIEAANILK 474
>gi|253741927|gb|EES98785.1| Hypothetical protein GL50581_3974 [Giardia intestinalis ATCC 50581]
Length = 409
Score = 66.2 bits (160), Expect = 2e-08, Method: Compositional matrix adjust.
Identities = 36/104 (34%), Positives = 59/104 (56%), Gaps = 8/104 (7%)
Query: 34 AENFSKLAC----NAHTICNSELRPLGTGLYPVISIINHSCLPNAVLVFE----GRLAVV 85
A+ F+ L C N+++I + + +G LY +IS NHSC PNA ++FE R A +
Sbjct: 300 AQMFNYLVCILLTNSYSIEDKNGQEIGAALYSLISCCNHSCAPNAQVIFEDSEDAREATL 359
Query: 86 RAVQHVPKGAEVLISYIETAGSTMTRQKALKEQYLFTCTCPRCI 129
++ +G E+ ISYI G +++ ++ Q+ FTC C RC+
Sbjct: 360 VLLRPCVQGEELYISYITDLGRSVSERRRELAQWCFTCQCTRCL 403
>gi|407394357|gb|EKF26910.1| hypothetical protein MOQ_009378 [Trypanosoma cruzi marinkellei]
Length = 697
Score = 66.2 bits (160), Expect = 2e-08, Method: Compositional matrix adjust.
Identities = 34/93 (36%), Positives = 49/93 (52%), Gaps = 2/93 (2%)
Query: 38 SKLACNAHTICNSELRPLGTGLYPVISIINHSCLPNAVLV-FEGRLAVVRAVQHVPKGAE 96
+ L CN+ + N++ +G LYP S NHSC PN V + G A A++ +PKG
Sbjct: 550 AALQCNSFGLFNADGNCIGVALYPEASYFNHSCCPNICRVTYRGLFAAFHALREIPKGEP 609
Query: 97 VLISYIETA-GSTMTRQKALKEQYLFTCTCPRC 128
+ I Y++ ST R++ L Y F C C RC
Sbjct: 610 LTICYVDVQETSTAERRRTLFSSYRFFCECARC 642
>gi|356525020|ref|XP_003531125.1| PREDICTED: histone-lysine N-methyltransferase ASHR2-like [Glycine
max]
Length = 419
Score = 66.2 bits (160), Expect = 2e-08, Method: Compositional matrix adjust.
Identities = 38/119 (31%), Positives = 52/119 (43%), Gaps = 13/119 (10%)
Query: 58 GLYPVISIINHSCLPNAVLV----------FEGRLAVVRAVQHVPKGAEVLISYIETAGS 107
G+YP S NH CLPNA ++R + VP+G E+ +SY
Sbjct: 231 GIYPYASFFNHDCLPNACRFDYVDANPSDDSHNTDFIIRMIHDVPQGREICLSYFPVNEK 290
Query: 108 TMTRQKALKEQYLFTCTCPRC---IKLGQFDDIQESAILEGYRCKDDGCSGFLLRDSDD 163
+RQK L E Y FTC C RC D ++++A E +D C D+DD
Sbjct: 291 YSSRQKRLIEDYGFTCNCDRCNVESNWSDNDSVEDNAEEEEEVMDEDQCETMAASDTDD 349
>gi|148235066|ref|NP_001085635.1| SET and MYND domain-containing protein 5 [Xenopus laevis]
gi|82184468|sp|Q6GPQ4.1|SMYD5_XENLA RecName: Full=SET and MYND domain-containing protein 5
gi|49115717|gb|AAH73058.1| MGC82689 protein [Xenopus laevis]
Length = 421
Score = 66.2 bits (160), Expect = 2e-08, Method: Compositional matrix adjust.
Identities = 34/87 (39%), Positives = 50/87 (57%), Gaps = 5/87 (5%)
Query: 56 GTGLYPVISIINHSCLPNAVLVFEGRLAVVR--AVQHVPKGAEVLISYIETAG---STMT 110
G+GLY + S NHSC+PNA F ++ A++ + G E+ ISY++ S +
Sbjct: 303 GSGLYLLQSCCNHSCVPNAEASFPDNNFILHLTALEDIQPGEEICISYLDCCQRDRSRHS 362
Query: 111 RQKALKEQYLFTCTCPRCIKLGQFDDI 137
RQK L+E YLF C+CP+C+ DI
Sbjct: 363 RQKILRENYLFMCSCPKCLAQADEPDI 389
>gi|350419310|ref|XP_003492139.1| PREDICTED: SET and MYND domain-containing protein 4-like [Bombus
impatiens]
Length = 673
Score = 66.2 bits (160), Expect = 2e-08, Method: Compositional matrix adjust.
Identities = 53/180 (29%), Positives = 85/180 (47%), Gaps = 22/180 (12%)
Query: 36 NFSKLACNAHTICNS---------ELRPLGTGLYPVISIINHSCLPNAVL-VFEGRLAVV 85
N + CNA+ I + E R +G +YP IS+INHSC PN V + V+
Sbjct: 450 NLQAINCNAYEIVENIYDKKTHVWEPRNIGGAIYPSISLINHSCYPNVVRHSYPSGTVVL 509
Query: 86 RAVQHVPKGAEVLISYIET--AGSTMTRQKALKEQYLFTCTCPRCIKLGQFDDIQESAIL 143
R ++ + KG E+L Y + ++R + L ++Y F CTC C + QF + E+
Sbjct: 510 RTLRFIGKGTEILDCYGPHWLSEKRLSRLEYLWKKYCFLCTCEACTQNWQF-PLPETM-- 566
Query: 144 EGYRCKDDGCSGFL----LRDSDDKGFTCQQCGLVRSKEEIKKIASEVNILSKKTLALTS 199
Y+CK CS + L + + + +QC K ++KKI ++ +K L S
Sbjct: 567 -NYKCK--MCSEIIGTITLDEKHTQNVSSKQCCKCNKKTDLKKIKNQFRKSVEKRLDAIS 623
>gi|307187924|gb|EFN72837.1| SET and MYND domain-containing protein 4 [Camponotus floridanus]
Length = 651
Score = 66.2 bits (160), Expect = 2e-08, Method: Compositional matrix adjust.
Identities = 40/124 (32%), Positives = 62/124 (50%), Gaps = 8/124 (6%)
Query: 55 LGTGLYPVISIINHSCLPNAVLVFEGRLAVVRAVQHVPKGAEVLISY--IETAGSTMTRQ 112
+G G+YP ++++NHSC P + F G VVRA + + G E+ +Y I T R+
Sbjct: 454 IGGGVYPTVAMLNHSCNPGVIRYFIGTTMVVRAARTIRAGEEISENYGPIFTTTPENERK 513
Query: 113 KALKEQYLFTCTCPRCIKLGQFDDIQE-SAILEGYRCKDD-GCSGFLLRDSDDKGFT--C 168
+ L+ QY F C C C G + ++E + ++C+ C LL SD F C
Sbjct: 514 RRLRVQYWFDCNCEACS--GHWPLLEELDPTILRFKCETGLSCGNVLLVRSDTNEFMIGC 571
Query: 169 QQCG 172
+CG
Sbjct: 572 AKCG 575
>gi|449540918|gb|EMD31905.1| hypothetical protein CERSUDRAFT_119222 [Ceriporiopsis subvermispora
B]
Length = 604
Score = 66.2 bits (160), Expect = 2e-08, Method: Compositional matrix adjust.
Identities = 43/136 (31%), Positives = 66/136 (48%), Gaps = 10/136 (7%)
Query: 9 LLLYAQIANLVNLILQWPEISINEIAENFSKLACNAHTICNSELRPLGTGLYPVISIINH 68
L+ Y IAN L + S + + S+ A N T+ + L P+G + P I++ NH
Sbjct: 232 LVRYLGIANPAELA-PFGLNSAGNLVDLISRFATNTFTLTSFSLTPIGICICPSIALANH 290
Query: 69 SCLPNAVLVF---------EGRLAVVRAVQHVPKGAEVLISYIETAGSTMTRQKALKEQY 119
SC PNA +VF + L + A++ + +V+ +Y++ RQKAL+E Y
Sbjct: 291 SCEPNAAIVFPRSSSLPQAQEPLMHLMAIRDIAPNEQVVAAYVDVTLPRELRQKALQETY 350
Query: 120 LFTCTCPRCIKLGQFD 135
FTC C C K D
Sbjct: 351 SFTCKCKLCTKRMAVD 366
>gi|256062336|ref|XP_002570317.1| set and mynd domain containing [Schistosoma mansoni]
gi|353230728|emb|CCD77145.1| putative set and mynd domain containing [Schistosoma mansoni]
Length = 429
Score = 66.2 bits (160), Expect = 2e-08, Method: Compositional matrix adjust.
Identities = 36/79 (45%), Positives = 48/79 (60%), Gaps = 7/79 (8%)
Query: 56 GTGLYPVISIINHSCLPNAVLVFEG---RLAVVRAVQHVPKGAEVLISYIE---TAGSTM 109
G GLY S INHSC PNA++ F G RL+VV A+ + +G E+ ISY++ +
Sbjct: 320 GVGLYYYQSRINHSCSPNAIIRFSGVNSRLSVV-ALTSIQEGEEITISYLDHCLQSRGRH 378
Query: 110 TRQKALKEQYLFTCTCPRC 128
TR+K L YLF C CP+C
Sbjct: 379 TRRKHLSSNYLFWCNCPKC 397
>gi|268532182|ref|XP_002631219.1| C. briggsae CBR-SET-14 protein [Caenorhabditis briggsae]
Length = 428
Score = 66.2 bits (160), Expect = 2e-08, Method: Compositional matrix adjust.
Identities = 66/293 (22%), Positives = 124/293 (42%), Gaps = 16/293 (5%)
Query: 37 FSKLACNAHTICNSELRPLGTGLYPVISIINHSCLPNAVLVFEGRLAVVRAVQ--HVPKG 94
F K++ N ++ NS +G GL +S NHSC P + + R A++ + P
Sbjct: 147 FKKISINGFSVSNSMGSAIGHGLCIKLSAANHSCKPLTRVCYRNRTAMLVPIDPYKPPTS 206
Query: 95 AE-VLISYIETAGSTMTRQKALKEQYLFTCTCPRCIKLGQFDDIQESAILEGYRCKDDGC 153
E SYI+ S RQ LK++Y F C C C+ D + + +E + C D C
Sbjct: 207 LEGACHSYIDELQSVSARQSELKKKYNFICICEGCM------DDERNESMEAWACGD--C 258
Query: 154 SGFLLRDSDDKGFTCQQCG--LVRSKEEIKKIASEVNILSKKTLALTSCGNHQEVVSTYK 211
+ + + +D C C + R E+ + A E I +K + S K
Sbjct: 259 ANGKVPNVEDG--RCASCSWKMTRDHYELCRTAEESAIAAKPKIENESISLETRKALCEK 316
Query: 212 MIEKLQKKLYHPFSVNLMQTREKLIKILMELEDWKEALAYCQLTIPVYQRVYPQFHPLLG 271
++E L H F+V+ + + L + ++D A + + + + R + P++
Sbjct: 317 LMELFDDTL-HEFNVHRVVPQRLLFFASLAVQDVNGASKHGKALLAMMMRYQGKHDPIVM 375
Query: 272 LQYYTCGKLEWFLGDTENAIKSMTEAVEILRITHGTNSPFMKELILKLEEAQA 324
L+ + ++ G+ +K + E E L+ + +P + +E A++
Sbjct: 376 LKKWQIAQICHANGEYNKVVKLLEEIKEPLKKIYTAGAPISVNIRRLMEMARS 428
>gi|256062334|ref|XP_002570316.1| set and mynd domain containing [Schistosoma mansoni]
gi|353230729|emb|CCD77146.1| putative set and mynd domain containing [Schistosoma mansoni]
Length = 433
Score = 66.2 bits (160), Expect = 2e-08, Method: Compositional matrix adjust.
Identities = 36/79 (45%), Positives = 48/79 (60%), Gaps = 7/79 (8%)
Query: 56 GTGLYPVISIINHSCLPNAVLVFEG---RLAVVRAVQHVPKGAEVLISYIE---TAGSTM 109
G GLY S INHSC PNA++ F G RL+VV A+ + +G E+ ISY++ +
Sbjct: 324 GVGLYYYQSRINHSCSPNAIIRFSGVNSRLSVV-ALTSIQEGEEITISYLDHCLQSRGRH 382
Query: 110 TRQKALKEQYLFTCTCPRC 128
TR+K L YLF C CP+C
Sbjct: 383 TRRKHLSSNYLFWCNCPKC 401
>gi|378731922|gb|EHY58381.1| SET and MYND domain-containing protein [Exophiala dermatitidis
NIH/UT8656]
Length = 550
Score = 66.2 bits (160), Expect = 2e-08, Method: Compositional matrix adjust.
Identities = 36/101 (35%), Positives = 57/101 (56%), Gaps = 12/101 (11%)
Query: 39 KLAC----NAHTICNSELRPLGTGLYPVISIINHSCLPNAVLVFE-------GRLAVVRA 87
+L+C N+ T+ N+ +G L+P +++NHSC PNA + F+ G ++V A
Sbjct: 212 RLSCAVIVNSFTLSNATFDSIGVILHPKPALLNHSCDPNAYVRFDVSETDTLGSISV-HA 270
Query: 88 VQHVPKGAEVLISYIETAGSTMTRQKALKEQYLFTCTCPRC 128
++ + K E+ ISYI+T RQ+ L E+Y FTC C C
Sbjct: 271 LRDIAKDEEITISYIDTTVPCKRRQQQLSERYFFTCQCHLC 311
>gi|317036659|ref|XP_001397803.2| SET and MYND domain protein [Aspergillus niger CBS 513.88]
Length = 481
Score = 65.9 bits (159), Expect = 2e-08, Method: Compositional matrix adjust.
Identities = 33/104 (31%), Positives = 57/104 (54%)
Query: 27 EISINEIAENFSKLACNAHTICNSELRPLGTGLYPVISIINHSCLPNAVLVFEGRLAVVR 86
++S I+ +KL N+ + N+ LG L+P +I NHSC NA + F+G ++
Sbjct: 168 DMSEEVISAMGAKLDLNSFNLTNAVYDRLGVYLHPYAAIFNHSCDHNAAVSFDGPNLHIK 227
Query: 87 AVQHVPKGAEVLISYIETAGSTMTRQKALKEQYLFTCTCPRCIK 130
A++ + K ++ I+YI+ RQ L+ +Y FTC C +C +
Sbjct: 228 ALRPIRKDEQIFITYIDVTDPYPIRQANLQSRYYFTCHCSKCSR 271
>gi|156539569|ref|XP_001601623.1| PREDICTED: SET and MYND domain-containing protein 4-like [Nasonia
vitripennis]
Length = 697
Score = 65.9 bits (159), Expect = 2e-08, Method: Compositional matrix adjust.
Identities = 30/76 (39%), Positives = 44/76 (57%), Gaps = 2/76 (2%)
Query: 55 LGTGLYPVISIINHSCLPNAVLVFEGRLAVVRAVQHVPKGAEVLISY--IETAGSTMTRQ 112
+G G++P +++ NHSC P V F G VVRA++ +P GAE+ +Y I T R+
Sbjct: 499 IGGGVFPTVALFNHSCNPGVVRYFIGNTMVVRAIKTIPAGAEISENYGPIFTEEEENDRK 558
Query: 113 KALKEQYLFTCTCPRC 128
+ L+ QY F C C C
Sbjct: 559 RKLRLQYWFDCDCEAC 574
>gi|24648668|ref|NP_650955.1| CG3353 [Drosophila melanogaster]
gi|23171865|gb|AAF55866.2| CG3353 [Drosophila melanogaster]
gi|33636489|gb|AAQ23542.1| RE75113p [Drosophila melanogaster]
Length = 393
Score = 65.9 bits (159), Expect = 2e-08, Method: Compositional matrix adjust.
Identities = 32/78 (41%), Positives = 48/78 (61%), Gaps = 5/78 (6%)
Query: 56 GTGLYPVISIINHSCLPNAVLVF--EGRLAVVRAVQHVPKGAEVLISYIETA---GSTMT 110
G+GLY + S INHSC+PNA F + V++A+ + +G E+ ISY++ S +
Sbjct: 286 GSGLYLLQSKINHSCVPNACSTFPYSNDIVVLKALAPIQQGEEICISYLDECMLERSRHS 345
Query: 111 RQKALKEQYLFTCTCPRC 128
R K L+E Y+F C CP+C
Sbjct: 346 RHKVLRENYVFICQCPKC 363
>gi|115395962|ref|XP_001213620.1| predicted protein [Aspergillus terreus NIH2624]
gi|114193189|gb|EAU34889.1| predicted protein [Aspergillus terreus NIH2624]
Length = 425
Score = 65.9 bits (159), Expect = 2e-08, Method: Compositional matrix adjust.
Identities = 60/262 (22%), Positives = 110/262 (41%), Gaps = 50/262 (19%)
Query: 26 PEISINEIAENFSKLACNAHTICNSELRPLGTGLYPVISIINHSCLPNAVLVFEGRLAVV 85
PE +I+ +KL N+ + + +G L+P ++INHSC NAV+ F+G
Sbjct: 140 PEDTISAFG---AKLDVNSFNMTTALADRIGLYLHPYAALINHSCAYNAVIGFDGAELFA 196
Query: 86 RAVQHVPKGAEVLISYIETAGSTMTRQKALKEQYLFTCTCPRCIKLGQFDDIQESAILEG 145
A++ + + ++ ISY++ R+ L+E+Y F C C +C + D
Sbjct: 197 TALRPITRDEQIFISYVDATNPVAVRRNELRERYFFDCRCAKCA--AEMD---------- 244
Query: 146 YRCKDDGCSGFLLRDSDDKGFTCQQCGL---VRSKEEIKKIASEVNILSKKTLALTSCGN 202
DG D+ K +T + V E ++ + S + L + A
Sbjct: 245 ---APDGGRAPAAADAQRKAYTLMEVATGAGVDPAEAVRALESAMRTLRESEGA------ 295
Query: 203 HQEVVSTYKMIEKLQKKLYHPFSVNLMQTREKLIKILMELEDWKEALAYCQLTIPVYQ-- 260
++++ E+ L+ R++LI L+ ++ A+ Q I V +
Sbjct: 296 ------SWRITEQ-----------PLVSLRDELIASLLGAGRFQ--TAFVQAAIRVVRVD 336
Query: 261 -RVYP-QFHPLLGLQYYTCGKL 280
VYP + HP+ L +T +L
Sbjct: 337 CVVYPVRGHPIRQLHAWTLARL 358
>gi|357610542|gb|EHJ67027.1| hypothetical protein KGM_11275 [Danaus plexippus]
Length = 661
Score = 65.9 bits (159), Expect = 3e-08, Method: Compositional matrix adjust.
Identities = 60/174 (34%), Positives = 84/174 (48%), Gaps = 14/174 (8%)
Query: 55 LGTGLYPVISIINHSCLPNAVLVFEGRLAVVRAVQHVPKGAEVLISY--IETAGSTMTRQ 112
+G +YP +++ NHSC P V F G VV AV+++ KG EV +Y I T RQ
Sbjct: 464 IGGAVYPTLALFNHSCEPGIVRYFCGSRIVVCAVKNIRKGEEVAENYGPIFTTVPKDKRQ 523
Query: 113 KALKEQYLFTCTCPRCIK-LGQFDDIQESAILEGYRCKDD-GCSGFLLRDSDDKGFTCQQ 170
LKEQY F C C C + +++D+ E+ + ++C D CS + D F Q
Sbjct: 524 SQLKEQYWFDCKCLPCEQNWPKYEDMTENYLR--FKCDSDQPCSNVIPVPYDCMEFMV-Q 580
Query: 171 CGLVRSKEEI-KKIAS--EVNILSKKTLALTSCGNHQEVVSTYKMIEKLQKKLY 221
CGL + I K + S + L K A G + E + K IE L KLY
Sbjct: 581 CGLCQQYTNILKGLKSLQDTETLYKLGRAAMGEGKYGEAIK--KFIETL--KLY 630
>gi|398392173|ref|XP_003849546.1| hypothetical protein MYCGRDRAFT_48051 [Zymoseptoria tritici IPO323]
gi|339469423|gb|EGP84522.1| hypothetical protein MYCGRDRAFT_48051 [Zymoseptoria tritici IPO323]
Length = 633
Score = 65.9 bits (159), Expect = 3e-08, Method: Compositional matrix adjust.
Identities = 45/131 (34%), Positives = 67/131 (51%), Gaps = 12/131 (9%)
Query: 13 AQIANLVNLILQW----PEISINEIAENFSKLAC-NAH---TICNSE---LRPLGTGLYP 61
+QI +++L W E+ I E A + C NA + E +R TGL+
Sbjct: 430 SQIEKVMDLFGDWRSIGKEVLIGEDAFQVHDVVCRNAFGPGAVIAGEAENIRKASTGLWL 489
Query: 62 VISIINHSCLPNAVLVFEGRLAVVRAVQHVPKGAEVLISYIETAGSTMTRQKALKEQYLF 121
+ + INHSCLPNA F G L VVRA +++ G+E+ SY +++ RQ+AL + + F
Sbjct: 490 MAAYINHSCLPNAEKSFLGDLMVVRATRNITAGSEITHSY-DSSSDYDARQEALMKTWGF 548
Query: 122 TCTCPRCIKLG 132
C C C G
Sbjct: 549 RCRCELCKADG 559
>gi|332025390|gb|EGI65557.1| SET and MYND domain-containing protein 4 [Acromyrmex echinatior]
Length = 401
Score = 65.9 bits (159), Expect = 3e-08, Method: Compositional matrix adjust.
Identities = 40/124 (32%), Positives = 58/124 (46%), Gaps = 7/124 (5%)
Query: 53 RP--LGTGLYPVISIINHSCLPNAVLVFEGRLAVVRAVQHVPKGAEVLISY--IETAGST 108
RP LG +YP ++ NH C P F GR V+RA++ + G + +Y I T +
Sbjct: 200 RPVYLGVAIYPSVARFNHDCYPAVTRYFIGRHIVIRAIRGLGPGDVIAENYGPIFTKRTL 259
Query: 109 MTRQKALKEQYLFTCTCPRCIK-LGQFDDIQESAILEGYRCKDDGCSGFLLRDSDDKGFT 167
RQ+ L +Y F CTC C + F+++ ++ RC GC G LR K
Sbjct: 260 AERQRTLTGRYWFQCTCKACQEDWPCFENLTNDSV--KLRCPTVGCGGLHLRSRQGKPIK 317
Query: 168 CQQC 171
C C
Sbjct: 318 CPDC 321
>gi|393236328|gb|EJD43877.1| hypothetical protein AURDEDRAFT_114449 [Auricularia delicata
TFB-10046 SS5]
Length = 451
Score = 65.9 bits (159), Expect = 3e-08, Method: Compositional matrix adjust.
Identities = 31/75 (41%), Positives = 49/75 (65%), Gaps = 3/75 (4%)
Query: 57 TGLYPVISIINHSCLPNAVLVFE-GRLAV-VRAVQHVPKGAEVLISYIETA-GSTMTRQK 113
+G++P S ++HSC PNA+ ++ G + + +RA +H+ +GAEV ISY+ T RQ+
Sbjct: 244 SGVFPTFSRVSHSCCPNAIPRWDSGTMTLELRATRHIQQGAEVTISYVPPPLKPTEARQE 303
Query: 114 ALKEQYLFTCTCPRC 128
L++ Y F CTCP C
Sbjct: 304 FLRDGYHFECTCPAC 318
>gi|383858836|ref|XP_003704905.1| PREDICTED: SET and MYND domain-containing protein DDB_G0273589-like
[Megachile rotundata]
Length = 523
Score = 65.9 bits (159), Expect = 3e-08, Method: Compositional matrix adjust.
Identities = 51/195 (26%), Positives = 87/195 (44%), Gaps = 17/195 (8%)
Query: 4 IDEKQLLLYAQIANLVNLILQW---PEISINEIAENFSKLACNAHTICNSELRPLGTGLY 60
I K+ ++ + +NLVN I ++ +I +I + L N H + ++ P +Y
Sbjct: 177 IKNKETFIFEESSNLVNFIKRFFKSEDIPEEDIRKIIGVLQVNGHEVPLTD--PPYVAVY 234
Query: 61 PVISIINHSCLPNAVLVFE--GRLAVVRAVQHVPKGAEVLISYIETAGSTMTRQKALKEQ 118
+ S++ H+C N F G L ++ A + KG + I Y + TM R+ L E
Sbjct: 235 ELASLLEHNCKANCSKSFTDTGGL-IIHAATPITKGDHISICYTDPLWGTMNRRHHLLET 293
Query: 119 YLFTCTCPRCIKLGQFDDIQESAILEGYRCKDDGCSGFLLRDS----DDKGFTCQQCGLV 174
F CTC RC +F A+ RC + CSG++L + + + C C V
Sbjct: 294 KYFECTCDRCRDPTEF-----GAMFNAIRCSEIDCSGYVLPKTFLGDHREDYVCTNCTTV 348
Query: 175 RSKEEIKKIASEVNI 189
E I++ ++ I
Sbjct: 349 VPLEIIEETLEDIGI 363
>gi|302697297|ref|XP_003038327.1| hypothetical protein SCHCODRAFT_103387 [Schizophyllum commune H4-8]
gi|300112024|gb|EFJ03425.1| hypothetical protein SCHCODRAFT_103387, partial [Schizophyllum
commune H4-8]
Length = 360
Score = 65.9 bits (159), Expect = 3e-08, Method: Compositional matrix adjust.
Identities = 33/74 (44%), Positives = 43/74 (58%), Gaps = 2/74 (2%)
Query: 57 TGLYPVISIINHSCLPNAVLVFE--GRLAVVRAVQHVPKGAEVLISYIETAGSTMTRQKA 114
G++ S INHSC PNAV F+ VVRA++H+ KG E+ ISY ET TR++
Sbjct: 174 VGVFERASRINHSCSPNAVYHFDLQSFCLVVRAIRHISKGEEIFISYSETLLQAATRRQN 233
Query: 115 LKEQYLFTCTCPRC 128
+ Y F C CP C
Sbjct: 234 SLQDYGFRCACPAC 247
>gi|403222774|dbj|BAM40905.1| uncharacterized protein TOT_030000166 [Theileria orientalis strain
Shintoku]
Length = 150
Score = 65.9 bits (159), Expect = 3e-08, Method: Compositional matrix adjust.
Identities = 35/90 (38%), Positives = 50/90 (55%), Gaps = 3/90 (3%)
Query: 46 TICNSELRP--LGTGLYPVISIINHSCLPNAVLVF-EGRLAVVRAVQHVPKGAEVLISYI 102
TI EL P +G GL+ +S +NHSC PN + + + +A + + VP+G E ISYI
Sbjct: 56 TIQKKELYPETMGLGLFNYLSKMNHSCEPNLQIEYTKNNIAHIAPLVDVPRGEEATISYI 115
Query: 103 ETAGSTMTRQKALKEQYLFTCTCPRCIKLG 132
+ S RQ+ L + Y F C C +CI G
Sbjct: 116 DEKDSVENRQEKLYKNYGFKCDCNKCILEG 145
>gi|194883957|ref|XP_001976062.1| GG20202 [Drosophila erecta]
gi|190659249|gb|EDV56462.1| GG20202 [Drosophila erecta]
Length = 660
Score = 65.9 bits (159), Expect = 3e-08, Method: Compositional matrix adjust.
Identities = 55/219 (25%), Positives = 97/219 (44%), Gaps = 18/219 (8%)
Query: 15 IANLVNLILQWPEISINEIAE--NFSKLACNAHTICNSELRPLGTGLYPVISIINHSCLP 72
I +LV LQ+ + + +E+AE FS +G +YP +++ NHSC P
Sbjct: 427 ICSLVLRSLQFIQFNTHEVAELHKFSSSGREKSIF-------IGGAIYPTLALFNHSCDP 479
Query: 73 NAVLVFEGRLAVVRAVQHVPKGAEVLISY--IETAGSTMTRQKALKEQYLFTCTCPRCI- 129
V F G + +V+ + G + +Y + T RQ LKE Y F C+C CI
Sbjct: 480 GVVRYFRGTTIHINSVRPIEAGLPINENYGPMYTQDERAERQARLKELYWFECSCDACID 539
Query: 130 KLGQFDDIQESAILEGYRCK-DDGCSGFL--LRDSDDKGFTCQQCGLVRSKEEIKKIASE 186
+FDD+ I +RC + CS + +D C CG + + + K+ +
Sbjct: 540 NWPKFDDLPRDVI--RFRCDAPNNCSAIIEVPPSCNDFMVKCVTCGEITNILKGLKVMQD 597
Query: 187 VNILSKKTLALTSCGNHQEVVSTY-KMIEKLQKKLYHPF 224
++++ L G + + ++ + +I + + L PF
Sbjct: 598 TEMMTRTAKRLYETGEYLKALAKFIDLIRIMYEVLAPPF 636
>gi|330796724|ref|XP_003286415.1| hypothetical protein DICPUDRAFT_77304 [Dictyostelium purpureum]
gi|325083610|gb|EGC37058.1| hypothetical protein DICPUDRAFT_77304 [Dictyostelium purpureum]
Length = 499
Score = 65.9 bits (159), Expect = 3e-08, Method: Compositional matrix adjust.
Identities = 33/79 (41%), Positives = 47/79 (59%), Gaps = 6/79 (7%)
Query: 56 GTGLYPVISIINHSCLPNAVLVFE-----GRLA-VVRAVQHVPKGAEVLISYIETAGSTM 109
G GL+P+ S +NHSCLPN + E R+ VV+A + +P G+E+L SY + T
Sbjct: 419 GVGLFPIFSCMNHSCLPNIEISNEIQDGVDRVKMVVKAKKLIPAGSEILHSYCDENLPTK 478
Query: 110 TRQKALKEQYLFTCTCPRC 128
R++ L QY F C CP+C
Sbjct: 479 ERKQLLLSQYGFKCLCPKC 497
>gi|118403760|ref|NP_001072288.1| SET and MYND domain containing 4 [Xenopus (Silurana) tropicalis]
gi|111305557|gb|AAI21315.1| hypothetical protein MGC145614 [Xenopus (Silurana) tropicalis]
Length = 739
Score = 65.5 bits (158), Expect = 3e-08, Method: Compositional matrix adjust.
Identities = 68/295 (23%), Positives = 116/295 (39%), Gaps = 64/295 (21%)
Query: 49 NSELRPLGTGLYPVISIINHSCLPNAVLVFEGRLAVVRAVQHVPKGAEVLISY--IETAG 106
N +R L T ++PV+S++NHSC PN + F GR VRA + + + EV Y +
Sbjct: 465 NKSIR-LATAVFPVLSLLNHSCDPNTTVSFTGRFVTVRANRPIRRDEEVTHCYGPHKLRM 523
Query: 107 STMTRQKALKEQYLFTCTCPRCIKLGQFDDIQESAILEGYRCKDDGCSGFLLRDSDDKGF 166
RQ+ LK+QY F C C C + L+ GF
Sbjct: 524 DVAERQQLLKDQYFFVCQCKACTE--------------------------ELKGKKTHGF 557
Query: 167 TCQQCGLVRSKEEI--------KKIASEVNILSKKTLALTSCGNHQEVVSTYKMIEKLQK 218
C C EE AS+ + S+ L GNH ++ +LQ
Sbjct: 558 FCPLCKAQLEGEEALYCTGARCTYTASQTQLTSR----LNQLGNHIQIAKV-----QLQD 608
Query: 219 KLYHPFSVNLM-----------QTREKLIKILMELE-------DWKEALAYCQLTIPVYQ 260
+ LM Q L +I+ L +WK A + + +I + +
Sbjct: 609 NKTDNAKMTLMSCLSEAECFLSQDHLLLGEIMDHLAQAEASDGNWKAAAGHLRKSISIVK 668
Query: 261 RVYPQFHPLLGLQYYTCGKLEWFLGDTENAIKSMTEAVEILRITHGTNSPFMKEL 315
Y LG + + ++ + + +A++++ EA ++L + +G + ++EL
Sbjct: 669 VHYGSSSMELGHELFKLAQILFNGFEVPDAMRTIMEAQKVLSMHYGPDHNLVQEL 723
>gi|341883792|gb|EGT39727.1| CBN-SET-18 protein [Caenorhabditis brenneri]
Length = 499
Score = 65.5 bits (158), Expect = 3e-08, Method: Compositional matrix adjust.
Identities = 64/314 (20%), Positives = 127/314 (40%), Gaps = 26/314 (8%)
Query: 5 DEKQLLLYAQIANLVNLILQWPEISINEIAENFSKLACNAHTICNSELRPLGTGLYPVIS 64
DE+++ + ++ + L + I E + F K + N+H+I + +G L +S
Sbjct: 146 DEEKMKHFEKVFTKMGLFRKDDMIEKGEFFDIFCKASINSHSIHTNAGNEVGMALDLGVS 205
Query: 65 IINHSCLPNAVLVFEGRLAVVRAVQHVP-----KGAEVLISYIETAGSTMTRQKALKEQY 119
NHSC P +VF+G +R + VP + ISYI+ S R++ LK +
Sbjct: 206 KYNHSCRPTCSMVFDGYRVCLRPL--VPGVDAEDTEQAFISYIDVGRSKFVRRRDLK--W 261
Query: 120 LFTCTCPRCIKLGQFDDIQESAILEGYRCKDDGCSGFLLRDSDDK--GFTCQQCGLVRSK 177
F C C RC+ E L RC + C +L ++ C +C + +
Sbjct: 262 YFNCECTRCMD-------PEDDALTAIRCANPSCDAPILTSETEEPMNIACDKCKTIVEE 314
Query: 178 EEIKKIASEVNILSKKTLALTSCGNHQEVVSTYKMIEKLQKKLYHPFSVNLMQTREKLIK 237
++K + +L + + + + + +++ HP +V + + R L
Sbjct: 315 NDVKAAQEYMK-------SLPASFDPKCPADILRDLLAKAEQVLHPSNVYVARLRTALFH 367
Query: 238 ILMELEDWKEALAYCQLTIPVYQRVYPQFHPLLGLQYYTCGKLEWFLGDTENAIKSMTEA 297
+ L + + Q+ Y+ +P+ +G Q K + + A+ +A
Sbjct: 368 VTGTLTMDNLSSMHTQI-YNNYKMCFPKADRHVGFQLLHIVKALIEKDERDEAMPYAFDA 426
Query: 298 VEILRITHGTNSPF 311
+ I + G + P+
Sbjct: 427 MNIFEVCFGLDHPY 440
>gi|194764587|ref|XP_001964410.1| GF23162 [Drosophila ananassae]
gi|190614682|gb|EDV30206.1| GF23162 [Drosophila ananassae]
Length = 392
Score = 65.5 bits (158), Expect = 3e-08, Method: Compositional matrix adjust.
Identities = 37/101 (36%), Positives = 58/101 (57%), Gaps = 9/101 (8%)
Query: 33 IAENFSKLACNAHTICNSELRPLGTGLYPVISIINHSCLPNAVLVF--EGRLAVVRAVQH 90
I E ++K+ A N+E G+GLY + S INHSC+PN+ F + V++A+
Sbjct: 267 IDEIYAKVGEFAGEFLNNE----GSGLYLLQSKINHSCVPNSCSTFPYSNDIVVLKALTP 322
Query: 91 VPKGAEVLISYIETAG---STMTRQKALKEQYLFTCTCPRC 128
+ +G E+ ISY++ S +R K L+E Y+F C CP+C
Sbjct: 323 IQEGDEICISYLDECQLERSRHSRHKVLRENYIFVCQCPKC 363
>gi|392579066|gb|EIW72193.1| hypothetical protein TREMEDRAFT_58351 [Tremella mesenterica DSM
1558]
Length = 660
Score = 65.5 bits (158), Expect = 3e-08, Method: Compositional matrix adjust.
Identities = 43/129 (33%), Positives = 61/129 (47%), Gaps = 11/129 (8%)
Query: 29 SINEIAENFSKLACNAHTICNSELRPLGTGLYPVISIINHSCLPNAVLVF-EG----RLA 83
S I + S N + + L +G L P +++ NHSC PNAV+VF EG R+
Sbjct: 355 STGSILQVISSFMINTFSASSPSLDLVGAALNPAMAMSNHSCSPNAVVVFPEGADSMRIV 414
Query: 84 VVRAVQHVPKGAEVLISYIETAGSTMTRQKALKEQYLFTCTCPRC---IKLGQFDDIQES 140
++A++ G EVL Y++ A RQ L+ Y F C CP C K G +
Sbjct: 415 AIKAIE---AGEEVLTHYVDLALPYAQRQAELRRTYHFECKCPTCDNRTKDGDKPVLDPR 471
Query: 141 AILEGYRCK 149
+IL CK
Sbjct: 472 SILIHRECK 480
>gi|195333441|ref|XP_002033400.1| GM21288 [Drosophila sechellia]
gi|194125370|gb|EDW47413.1| GM21288 [Drosophila sechellia]
Length = 660
Score = 65.5 bits (158), Expect = 3e-08, Method: Compositional matrix adjust.
Identities = 55/219 (25%), Positives = 96/219 (43%), Gaps = 18/219 (8%)
Query: 15 IANLVNLILQWPEISINEIAE--NFSKLACNAHTICNSELRPLGTGLYPVISIINHSCLP 72
I +LV LQ+ + + +E+AE FS +G +YP +++ NHSC P
Sbjct: 427 ICSLVLRSLQFIQFNTHEVAELHKFSSSGREKSIF-------IGGAIYPTLALFNHSCDP 479
Query: 73 NAVLVFEGRLAVVRAVQHVPKGAEVLISY--IETAGSTMTRQKALKEQYLFTCTCPRCI- 129
V F G + +V+ + G + +Y + T RQ LKE Y F C+C CI
Sbjct: 480 GVVRYFRGTTIHINSVRPIEAGLPINENYGPMYTQDERSERQARLKELYWFECSCDACID 539
Query: 130 KLGQFDDIQESAILEGYRCK-DDGCSGFL--LRDSDDKGFTCQQCGLVRSKEEIKKIASE 186
+FDD+ I +RC + CS + +D C CG + + + K+ +
Sbjct: 540 NWPKFDDLPRDVI--RFRCDAPNNCSAVIEVPPSCNDFMVKCVTCGEITNILKGLKVMQD 597
Query: 187 VNILSKKTLALTSCGNHQEVVSTY-KMIEKLQKKLYHPF 224
++++ L G + + + + +I + + L PF
Sbjct: 598 TEMMTRTAKRLYETGEYSKALPKFIDLIRIMYEVLAPPF 636
>gi|195485733|ref|XP_002091210.1| GE12361 [Drosophila yakuba]
gi|194177311|gb|EDW90922.1| GE12361 [Drosophila yakuba]
Length = 660
Score = 65.5 bits (158), Expect = 3e-08, Method: Compositional matrix adjust.
Identities = 55/219 (25%), Positives = 97/219 (44%), Gaps = 18/219 (8%)
Query: 15 IANLVNLILQWPEISINEIAE--NFSKLACNAHTICNSELRPLGTGLYPVISIINHSCLP 72
I +LV LQ+ + + +E+AE FS +G +YP +++ NHSC P
Sbjct: 427 ICSLVLRSLQFIQFNTHEVAELHKFSSSGREKSIF-------IGGAIYPTLALFNHSCDP 479
Query: 73 NAVLVFEGRLAVVRAVQHVPKGAEVLISY--IETAGSTMTRQKALKEQYLFTCTCPRCI- 129
V F G V +V+ + G + +Y + T RQ LKE Y F C+C CI
Sbjct: 480 GVVRYFRGTTIHVNSVRPIEAGLPINENYGPMYTQDGRPERQARLKELYWFECSCDACID 539
Query: 130 KLGQFDDIQESAILEGYRCK-DDGCSGFL--LRDSDDKGFTCQQCGLVRSKEEIKKIASE 186
+FDD+ I +RC + C+ + +D C CG + + + K+ +
Sbjct: 540 NWPKFDDLPRDVI--RFRCDAPNNCAAVIEVPPSCNDFMVKCVTCGEITNILKGLKVMQD 597
Query: 187 VNILSKKTLALTSCGNHQEVVSTY-KMIEKLQKKLYHPF 224
++++ L G + + ++ + +I + + L PF
Sbjct: 598 TEMMTRTAKRLYETGEYSKALAKFIDLIRIMYEVLAPPF 636
>gi|255084131|ref|XP_002508640.1| predicted protein [Micromonas sp. RCC299]
gi|226523917|gb|ACO69898.1| predicted protein [Micromonas sp. RCC299]
Length = 701
Score = 65.5 bits (158), Expect = 3e-08, Method: Compositional matrix adjust.
Identities = 41/135 (30%), Positives = 60/135 (44%), Gaps = 24/135 (17%)
Query: 18 LVNLILQWPEISINEIAENFSKLACNAHTICNSELRPLGTGLYPVISIINHSCLPNAVLV 77
L + + + +E+ + K N+H + + +LR LGTG+YP S+ NHSC PNA +
Sbjct: 246 LASAVRHAAALDPSEVEDMLGKTRENSHGVVDWKLRQLGTGIYPEASMFNHSCAPNAAVS 305
Query: 78 F----------------EGRLAVVRAVQHVP--------KGAEVLISYIETAGSTMTRQK 113
F E R V +VP KG EV I+Y E +R+
Sbjct: 306 FGAGGTLNTCCIGVDDYEARAMEVGPKGYVPLGRPRAIRKGEEVCIAYTELYRPAASRRL 365
Query: 114 ALKEQYLFTCTCPRC 128
L++ F C C RC
Sbjct: 366 TLEKSKGFVCECVRC 380
>gi|328875121|gb|EGG23486.1| hypothetical protein DFA_05619 [Dictyostelium fasciculatum]
Length = 391
Score = 65.5 bits (158), Expect = 3e-08, Method: Compositional matrix adjust.
Identities = 37/107 (34%), Positives = 54/107 (50%), Gaps = 7/107 (6%)
Query: 53 RPLGTGLYPVISIINHSCLPNAVLVFE--GRLAVVRAVQHVPKGAEVLISYIETAGSTMT 110
+ +G+GLY S NHSC PN V + G V+ A++ + +G E+ +YI+ + S
Sbjct: 268 KSIGSGLYDQCSYFNHSCQPNIFKVNQTPGGELVMVALRDIEQGEELFYNYIQISMSGEA 327
Query: 111 RQKALKEQYLFTCTCPRCIKLGQFDDIQESAILEGYRCKDDGCSGFL 157
R K LKE Y F C CP C + L+ Y CK C+G +
Sbjct: 328 RIKKLKESYFFNCQCPGCK-----NAPSHKQFLDKYLCKVKQCNGVI 369
>gi|326435247|gb|EGD80817.1| hypothetical protein PTSG_01403 [Salpingoeca sp. ATCC 50818]
Length = 431
Score = 65.5 bits (158), Expect = 3e-08, Method: Compositional matrix adjust.
Identities = 36/80 (45%), Positives = 46/80 (57%), Gaps = 9/80 (11%)
Query: 58 GLYPVISIINHSCLPNAVLVFE-----GRLA----VVRAVQHVPKGAEVLISYIETAGST 108
G Y ++ NHSC PNA + RL V+RAV + +G EV ISYIE +T
Sbjct: 174 GFYLRMAFCNHSCRPNACQYIDPNSTRARLNSPSIVLRAVSDIAEGEEVCISYIELMDTT 233
Query: 109 MTRQKALKEQYLFTCTCPRC 128
R++AL+E Y FTC CPRC
Sbjct: 234 PERREALQELYYFTCQCPRC 253
>gi|324507181|gb|ADY43048.1| SET and MYND domain-containing protein 3 [Ascaris suum]
Length = 614
Score = 65.5 bits (158), Expect = 3e-08, Method: Compositional matrix adjust.
Identities = 67/296 (22%), Positives = 127/296 (42%), Gaps = 21/296 (7%)
Query: 43 NAHTICNSE-LRPLGTGLYPVISIINHSCLPNAVLVFEGRLAVVRAVQ---HVPKGAEVL 98
N H I + L +G GLY + +HSC PN + +G +A +RA+ ++
Sbjct: 231 NRHAISDDGYLEEIGKGLYLDLCAYDHSCRPNTIYTCKGFVATLRALDLTVNLMDRTTTF 290
Query: 99 ISYIETAGSTMTRQKALKEQYLFTCTCPRCIKLGQFDDIQESAILEGYRCKDDGCSGF-- 156
SYI+ +T R+K L++ + F C C RC+ D + S R K + F
Sbjct: 291 YSYIDLLSTTQERKKLLRDTWYFDCQCVRCV--DNDDHVLSSMFCPSCREKPERLCIFGE 348
Query: 157 -LLRDSDDKGFTCQQCGLVRSKEEIKKIASEVNILSKKTLALTSCGNHQEVVS--TYKMI 213
++ + TC +C + KE + + + + + + + Q++ ++ +
Sbjct: 349 SPYKNPVTQILTCPKCHIEVPKERVLEAVDAMRFIDR----IVAKNEIQQMAKEQAFRFL 404
Query: 214 EKLQKKLYHPF-SVNLMQTREKLIKILMELEDWKEALAYCQLTIPVYQRV---YPQFHPL 269
+ ++ + SVN+ K+I++L+ L D ++ L + + V +P HP
Sbjct: 405 KGIKDRFSKILPSVNVYMC--KIIQLLIPLIDPTDSKTLLSLHLESEECVRFCFPPNHPA 462
Query: 270 LGLQYYTCGKLEWFLGDTENAIKSMTEAVEILRITHGTNSPFMKELILKLEEAQAE 325
+ + + G +G A + A +IL+ T G P + + LEEA E
Sbjct: 463 VAVHLSSIGLFYLRIGHPHRAELYLEMACDILKFTLGHTHPLTVDKLSILEEAHRE 518
>gi|358394896|gb|EHK44289.1| hypothetical protein TRIATDRAFT_79382 [Trichoderma atroviride IMI
206040]
Length = 742
Score = 65.5 bits (158), Expect = 3e-08, Method: Compositional matrix adjust.
Identities = 28/72 (38%), Positives = 41/72 (56%)
Query: 57 TGLYPVISIINHSCLPNAVLVFEGRLAVVRAVQHVPKGAEVLISYIETAGSTMTRQKALK 116
TG++ S NHSCLPNA F G + +VRA++ +P G E+L+ Y+ RQ+ ++
Sbjct: 501 TGIWLQASYANHSCLPNATRAFIGDMMIVRALREIPVGGEILMQYVPQDKPFDKRQEVVE 560
Query: 117 EQYLFTCTCPRC 128
Y F C C C
Sbjct: 561 NHYGFKCDCDLC 572
>gi|260940791|ref|XP_002615235.1| hypothetical protein CLUG_04117 [Clavispora lusitaniae ATCC 42720]
gi|238850525|gb|EEQ39989.1| hypothetical protein CLUG_04117 [Clavispora lusitaniae ATCC 42720]
Length = 402
Score = 65.5 bits (158), Expect = 3e-08, Method: Compositional matrix adjust.
Identities = 44/132 (33%), Positives = 59/132 (44%), Gaps = 11/132 (8%)
Query: 12 YAQIANLVNLILQ---WPEISINEIAENFSKLACNAHTI------CNSELRPLGTGLYPV 62
Y+ I + LI P ISI + + + NA I + E G G+YP
Sbjct: 271 YSNIYKFIRLIAPEEYLPFISIQAVRDIIGRNLTNAFGIWSPVTSSDEEREYFGFGVYPS 330
Query: 63 ISIINHSCLPNAVLVFEGRLAVVRAVQHVPKGAEVLISY-IETAGSTMTRQKALKEQYLF 121
S NHSC PN + G A + +P GAE+ ISY I + RQ AL+E + F
Sbjct: 331 ASFFNHSCKPNVTKIRRGASYEYIAKEDIPVGAELCISYGIRESDGLKQRQDALRE-WFF 389
Query: 122 TCTCPRCIKLGQ 133
C C RC G+
Sbjct: 390 ECGCSRCTTEGK 401
>gi|428176655|gb|EKX45538.1| hypothetical protein GUITHDRAFT_138770 [Guillardia theta CCMP2712]
Length = 598
Score = 65.5 bits (158), Expect = 4e-08, Method: Compositional matrix adjust.
Identities = 71/300 (23%), Positives = 127/300 (42%), Gaps = 28/300 (9%)
Query: 26 PEISINEIAENFSKLACNAHTICNSELRPLGTGLYPVISIINHSCLPNAVLVFEGRLAVV 85
P +S + A S L N+H + G+GL+ + ++ HSC PN G V
Sbjct: 257 PTVSDEDAARMLSILNTNSHELGEWG----GSGLFLLGCLMEHSCAPNCNFSTYGTQLWV 312
Query: 86 RAVQHVPKGAEVLISYIE-TAGSTMTRQKALKEQYLFTCTCPRCIKLGQFDDIQESAILE 144
A+ + KG + I Y + T T R+K LK+ Y F C CPRC L D+ + I
Sbjct: 313 TALNSIKKGESLSIDYNDSTFEPTAVRRKLLKDSYDFLCLCPRCTTL---PDVMRAFICP 369
Query: 145 GYRCKDDGCSGFLLRDSDDKGFTCQQCGLVRSKEEIKKIASEVNILSKKTLALTSCGNHQ 204
+CK C L S + C++C K ++ K + + + +K L + H
Sbjct: 370 VKQCKGKVCP--LGEGSKPSDWVCEKC-----KSQLDKKSRKRFLEIEKAL-MEEGPEHD 421
Query: 205 EVVSTYKMIEKLQKKLYHPFSVNLMQTREKLIKILMELEDW----KEALAYCQLTIPVYQ 260
T+ ++ + ++L ++ + E+ + E W K A + IP+
Sbjct: 422 VFHETHHIVLEALRELGSVYAKD-----EEAVADGSAEEIWLRVLKGADQARSILIPL-- 474
Query: 261 RVYPQFHPLLGLQYYTCGKLEWFLGDTENAIKSMTEAVEILRITHGTNSPFMKELILKLE 320
V P+FHP + + +L + A +++ + E+ R G +P ++ + +E
Sbjct: 475 -VLPKFHPTKAIWWDNVAQLRLSRKNQAGAQEALKRSYEVTRRCCGDATPSTQDALRLVE 533
>gi|240979812|ref|XP_002403255.1| SET and MYND domain-containing protein (SMYD), putative [Ixodes
scapularis]
gi|215491338|gb|EEC00979.1| SET and MYND domain-containing protein (SMYD), putative [Ixodes
scapularis]
Length = 413
Score = 65.5 bits (158), Expect = 4e-08, Method: Compositional matrix adjust.
Identities = 41/106 (38%), Positives = 59/106 (55%), Gaps = 8/106 (7%)
Query: 56 GTGLYPVISIINHSCLPNAVLVF---EGRLAVVRAVQHVPKGAEVLISYIETAG---STM 109
G GLYP+ S+ HSC PNA F + L++V A++ + G EV +SYI+ S
Sbjct: 309 GCGLYPLQSLCAHSCCPNAEARFLHNDHTLSLV-ALRDIRPGEEVTVSYIDECSLSRSRH 367
Query: 110 TRQKALKEQYLFTCTCPRCIKLG-QFDDIQESAILEGYRCKDDGCS 154
+R K L+E +LFTCTCPRC + Q D+ +E ++ CS
Sbjct: 368 SRIKMLRESHLFTCTCPRCEEEATQQPDVTSDEEMEDSSDAEEACS 413
>gi|302691170|ref|XP_003035264.1| hypothetical protein SCHCODRAFT_14438 [Schizophyllum commune H4-8]
gi|300108960|gb|EFJ00362.1| hypothetical protein SCHCODRAFT_14438 [Schizophyllum commune H4-8]
Length = 597
Score = 65.5 bits (158), Expect = 4e-08, Method: Compositional matrix adjust.
Identities = 32/100 (32%), Positives = 54/100 (54%), Gaps = 9/100 (9%)
Query: 38 SKLACNAHTICNSELRPLGTGLYPVISIINHSCLPNAVLVF---------EGRLAVVRAV 88
S+ A N++++ +L P+G + P++S++NHSC PNA VF + L V A+
Sbjct: 252 SRFATNSYSVTTPDLTPIGACVSPLVSLVNHSCSPNAATVFPRASKTPSTDEPLISVVAI 311
Query: 89 QHVPKGAEVLISYIETAGSTMTRQKALKEQYLFTCTCPRC 128
+ + ++ SYI+T R++ L+E Y F C C C
Sbjct: 312 RDIQPDEQIFTSYIDTTLPRALRRRELQEGYNFLCKCSLC 351
>gi|159472132|ref|XP_001694205.1| predicted protein [Chlamydomonas reinhardtii]
gi|158276868|gb|EDP02638.1| predicted protein [Chlamydomonas reinhardtii]
Length = 468
Score = 65.5 bits (158), Expect = 4e-08, Method: Compositional matrix adjust.
Identities = 33/73 (45%), Positives = 42/73 (57%), Gaps = 2/73 (2%)
Query: 58 GLYPVISIINHSCLPNAVLVFEGRLAVVRAVQHVPKGAEVLISYIETAG--STMTRQKAL 115
GL+ S++NHSC PNAV G V A HV +G+E+LI+Y+ A RQ L
Sbjct: 139 GLWHSFSMLNHSCCPNAVNYVVGDSMCVFAAGHVEQGSELLINYLGAASLRPVEERQAGL 198
Query: 116 KEQYLFTCTCPRC 128
E Y F+C CPRC
Sbjct: 199 AEAYAFSCGCPRC 211
>gi|452988674|gb|EME88429.1| hypothetical protein MYCFIDRAFT_43814 [Pseudocercospora fijiensis
CIRAD86]
Length = 334
Score = 65.5 bits (158), Expect = 4e-08, Method: Compositional matrix adjust.
Identities = 37/100 (37%), Positives = 51/100 (51%), Gaps = 8/100 (8%)
Query: 36 NFSKLACNAHTICNSELRPLGT-----GLYPVISIINHSCLPNAVLVFEGRLA--VVRAV 88
NF I S PLG G++P++S +NHSCLPNA + RL +V +
Sbjct: 89 NFPGSPTPFSNIVRSNGYPLGPSSSIGGIFPLVSRLNHSCLPNAQHAYNERLGKMLVHII 148
Query: 89 QHVPKGAEVLISYIETAGSTMTRQKALKEQYLFTCTCPRC 128
+ + E+ +SYI G + RQ LK +LFTCTC C
Sbjct: 149 RPILPNEEITLSYI-PGGPSPQRQTELKSNFLFTCTCTLC 187
>gi|298710004|emb|CBJ31723.1| SET and MYND domain containing 3 [Ectocarpus siliculosus]
Length = 636
Score = 65.1 bits (157), Expect = 4e-08, Method: Compositional matrix adjust.
Identities = 32/78 (41%), Positives = 43/78 (55%), Gaps = 3/78 (3%)
Query: 55 LGTGLYPVISIINHSCLPNAVLVF--EGRLAVVRAVQHVPKGAEVLISYIETAGSTMTRQ 112
G GLYP ++ NHSC PN G L VV AV+ VP G+++ ISY++ RQ
Sbjct: 228 FGVGLYPAGAMFNHSCRPNCSWRTNRSGELCVV-AVEDVPAGSQLFISYVDILQPWPVRQ 286
Query: 113 KALKEQYLFTCTCPRCIK 130
L+ + F C CPRC +
Sbjct: 287 DLLRCHFFFECACPRCCR 304
>gi|194756896|ref|XP_001960706.1| GF13486 [Drosophila ananassae]
gi|190622004|gb|EDV37528.1| GF13486 [Drosophila ananassae]
Length = 517
Score = 65.1 bits (157), Expect = 4e-08, Method: Compositional matrix adjust.
Identities = 42/163 (25%), Positives = 68/163 (41%), Gaps = 11/163 (6%)
Query: 13 AQIANLVNLILQWPEISINEIAENFSKLACNAHTICNSELRPLGTGLYPVISIINHSCLP 72
A I + + + + +I L N H I S+ P G++ S +SCLP
Sbjct: 189 ASIGQFIPKFFKTQKFTEEQIMRAVGALQINGHEIPTSD--PPHVGVFYTASFTENSCLP 246
Query: 73 NAVLVF-EGRLAVVRAVQHVPKGAEVLISYIETAGSTMTRQKALKEQYLFTCTCPRCIKL 131
N F + ++ A Q + K A + I Y + T RQ+ L + LF C C RC +
Sbjct: 247 NLAKSFNKNGHCILWAPQEIKKNAHLSICYSDAMWGTADRQRHLMQTKLFKCACARCADV 306
Query: 132 GQFDDIQESAILEGYRCKDDGCSGFLLRDSDDK---GFTCQQC 171
+ D +C+D C G +L D+ + C++C
Sbjct: 307 TELD-----TFYSALKCEDRQCGGLMLPSKSDEWNGSWRCREC 344
>gi|195498437|ref|XP_002096523.1| GE25007 [Drosophila yakuba]
gi|194182624|gb|EDW96235.1| GE25007 [Drosophila yakuba]
Length = 393
Score = 65.1 bits (157), Expect = 4e-08, Method: Compositional matrix adjust.
Identities = 32/78 (41%), Positives = 48/78 (61%), Gaps = 5/78 (6%)
Query: 56 GTGLYPVISIINHSCLPNAVLVF--EGRLAVVRAVQHVPKGAEVLISYIETA---GSTMT 110
G+GLY + S INHSC+PNA F + V++A+ + +G E+ ISY++ S +
Sbjct: 286 GSGLYLLQSKINHSCVPNACSTFPYSNDIIVLKALAPIQQGDEICISYLDECMLERSRHS 345
Query: 111 RQKALKEQYLFTCTCPRC 128
R K L+E Y+F C CP+C
Sbjct: 346 RHKVLRENYVFICQCPKC 363
>gi|357611158|gb|EHJ67334.1| hypothetical protein KGM_20152 [Danaus plexippus]
Length = 397
Score = 65.1 bits (157), Expect = 4e-08, Method: Compositional matrix adjust.
Identities = 43/137 (31%), Positives = 62/137 (45%), Gaps = 9/137 (6%)
Query: 58 GLYPVISIINHSCLPNAVLVFEGRLAV-VRAVQHVPKGAEVLISYIETAGSTMTRQKALK 116
GLYPV SI+NH+C+PN F G L + V+A + + G+E+ Y T R+ L
Sbjct: 209 GLYPVASILNHNCVPNTRNCFNGDLQMTVKATKTINAGSEIFTCYSGMLWGTPARRLYLY 268
Query: 117 EQYLFTCTCPRCIKLGQFDDIQESAILEGYRCKDDGCSGFLLRDSDDK---GFTCQQCGL 173
+ F C C RC D + +L +C C G LL K + C +CG+
Sbjct: 269 KSKHFLCDCERCA-----DPTERGTLLAALKCFSTECQGSLLPIQPLKTTTAWRCLECGM 323
Query: 174 VRSKEEIKKIASEVNIL 190
+ I I S + L
Sbjct: 324 RVPNDNICVIQSALGSL 340
>gi|195582454|ref|XP_002081043.1| GD10796 [Drosophila simulans]
gi|194193052|gb|EDX06628.1| GD10796 [Drosophila simulans]
Length = 646
Score = 65.1 bits (157), Expect = 4e-08, Method: Compositional matrix adjust.
Identities = 53/216 (24%), Positives = 94/216 (43%), Gaps = 26/216 (12%)
Query: 15 IANLVNLILQWPEISINEIAE--NFSKLACNAHTICNSELRPLGTGLYPVISIINHSCLP 72
I +LV LQ+ + + +E+AE FS +G +YP +++ NHSC P
Sbjct: 427 ICSLVLRSLQFIQFNTHEVAELHKFSSSGREKSIF-------IGGAIYPTLALFNHSCDP 479
Query: 73 NAVLVFEGRLAVVRAVQHVPKGAEVLISY--IETAGSTMTRQKALKEQYLFTCTCPRCI- 129
V F G + +V+ + G + +Y + T RQ LKE Y F C+C CI
Sbjct: 480 GVVRYFRGTTIHINSVRPIEAGLPINENYGPMYTQDERSERQARLKELYWFECSCDACID 539
Query: 130 KLGQFDDIQESAILEGYRCKDDGCSGFLLRDSDDKGFTCQQCGLVRSKEEIKKIASEVNI 189
+FDD+ I +RC D+ + C CG + + + K+ + +
Sbjct: 540 NWPKFDDLPRDVI--RFRC-----------DAPNNCSACVTCGEITNILKGLKVMQDTEM 586
Query: 190 LSKKTLALTSCGNHQEVVSTY-KMIEKLQKKLYHPF 224
+++ L G + + ++ + +I + + L PF
Sbjct: 587 MTRTAKRLYETGEYSKALAKFIDLIRIMYEVLAPPF 622
>gi|156550498|ref|XP_001601757.1| PREDICTED: protein msta, isoform A-like [Nasonia vitripennis]
Length = 519
Score = 65.1 bits (157), Expect = 4e-08, Method: Compositional matrix adjust.
Identities = 44/167 (26%), Positives = 69/167 (41%), Gaps = 12/167 (7%)
Query: 27 EISINEIAENFSKLACNAHTICNSELRPLGTGLYPVISIINHSCLPNAVLVFEGRLAVV- 85
+ S +EIA + N H + SE P+ +Y S + HSC N F +V
Sbjct: 198 DFSEDEIARAAGIMQINGHEVPISE--PVLIAVYDESSYVEHSCRANCSKSFTSSGGIVI 255
Query: 86 RAVQHVPKGAEVLISYIETAGSTMTRQKALKEQYLFTCTCPRCIKLGQFDDIQESAILEG 145
RA + KG ++I Y + T R+ L F CTCPRC +F ++
Sbjct: 256 RAAMEIEKGEHIMICYTDPLWGTANRRHHLLRTKFFECTCPRCSDPAEF-----GSMFNA 310
Query: 146 YRCKDDGCSGFLLRDS----DDKGFTCQQCGLVRSKEEIKKIASEVN 188
C CSG +L S + + C +C RS + ++ ++
Sbjct: 311 IECSKSDCSGCMLPKSFTVQEIPDYRCNKCDNSRSSNSVDEMLEQIG 357
>gi|340708900|ref|XP_003393055.1| PREDICTED: SET and MYND domain-containing protein 4-like [Bombus
terrestris]
Length = 673
Score = 65.1 bits (157), Expect = 4e-08, Method: Compositional matrix adjust.
Identities = 51/178 (28%), Positives = 83/178 (46%), Gaps = 18/178 (10%)
Query: 36 NFSKLACNAHTICNS---------ELRPLGTGLYPVISIINHSCLPNAVL-VFEGRLAVV 85
N + CNA+ I + E R +G +YP IS+INHSC PN V + V+
Sbjct: 450 NLQAINCNAYEIVENIYDKKTHVWEPRHIGGAIYPSISLINHSCYPNVVRHSYPSGTVVL 509
Query: 86 RAVQHVPKGAEVLISYIET--AGSTMTRQKALKEQYLFTCTCPRCIKLGQFDDIQESAIL 143
R ++ + KG E+L Y + ++R + L ++Y F C C C + QF + E+
Sbjct: 510 RTLRFIGKGTEILDCYGPHWLSEKRLSRLEYLWKKYRFLCACEACTQNWQF-PLPETM-- 566
Query: 144 EGYRCK--DDGCSGFLLRDSDDKGFTCQQCGLVRSKEEIKKIASEVNILSKKTLALTS 199
Y+CK + S L + + + +QC K ++KKI ++ +K L S
Sbjct: 567 -NYKCKMCSEIISTITLDEKHTQNVSSKQCCKCNKKTDLKKIKNQFRKSVEKRLDAIS 623
>gi|350418674|ref|XP_003491932.1| PREDICTED: SET and MYND domain-containing protein 4-like [Bombus
impatiens]
Length = 681
Score = 65.1 bits (157), Expect = 5e-08, Method: Compositional matrix adjust.
Identities = 44/144 (30%), Positives = 64/144 (44%), Gaps = 17/144 (11%)
Query: 43 NAHTICNSEL---------RPL--GTGLYPVISIINHSCLPNAVLVFEGRLAVVRAVQHV 91
NAH + + L RPL G +YP ++ NH C P F GR V+RA++ +
Sbjct: 454 NAHEVFETRLGNEHRFRGSRPLYIGVAIYPTVARFNHDCYPAVTRYFVGRSIVIRAIRSL 513
Query: 92 PKGAEVLISY--IETAGSTMTRQKALKEQYLFTCTCPRCIK-LGQFDDIQESAILEGYRC 148
G V +Y I T S R++ L +Y F C C C + +F+ + + RC
Sbjct: 514 RAGDAVAENYGPIFTKRSLEDRRRTLAARYWFRCECTACREDWPRFETLTNDMVR--LRC 571
Query: 149 KDDGCSGFLLRDSD-DKGFTCQQC 171
+GCS R D D+ C C
Sbjct: 572 PTEGCSKLHPRPRDPDRSIECSSC 595
>gi|149238794|ref|XP_001525273.1| hypothetical protein LELG_03201 [Lodderomyces elongisporus NRRL
YB-4239]
gi|146450766|gb|EDK45022.1| hypothetical protein LELG_03201 [Lodderomyces elongisporus NRRL
YB-4239]
Length = 619
Score = 65.1 bits (157), Expect = 5e-08, Method: Compositional matrix adjust.
Identities = 36/97 (37%), Positives = 51/97 (52%), Gaps = 4/97 (4%)
Query: 35 ENFSKLA-CNAHTICNSELRPLGTGLYPVISIINHSCLPNAVLVFEGRLAVVRAVQHVP- 92
ENF L N +T+ + + R +G L P S+INHSC+PN+ V E R + +P
Sbjct: 213 ENFVNLIFINCNTLVDYKFRIVGYNLDPQFSMINHSCMPNSCQV-ETEYGCYRLINTLPI 271
Query: 93 -KGAEVLISYIETAGSTMTRQKALKEQYLFTCTCPRC 128
G E+ ++YI + T RQ L +Y F C CP C
Sbjct: 272 NAGDEITVNYIASGIPTELRQVQLFSRYFFRCKCPLC 308
>gi|449459406|ref|XP_004147437.1| PREDICTED: SET and MYND domain-containing protein 4-like [Cucumis
sativus]
gi|449500639|ref|XP_004161155.1| PREDICTED: SET and MYND domain-containing protein 4-like [Cucumis
sativus]
Length = 775
Score = 65.1 bits (157), Expect = 5e-08, Method: Compositional matrix adjust.
Identities = 66/311 (21%), Positives = 120/311 (38%), Gaps = 46/311 (14%)
Query: 48 CNSELRPLGTGLYPVISIINHSCLPNAVLVFEGRLAVVRAVQHVPKGAEVLISYIETAG- 106
CN E +G +Y S+ NHSC PN F R +RA + G + +SY G
Sbjct: 464 CNMEQVRVGQAIYTTGSLFNHSCKPNIHAYFNSRTLFIRATVFMAVGCPLELSYGPQVGQ 523
Query: 107 -STMTRQKALKEQYLFTCTCPRCIKLGQFDDIQESAILEGYRCKDDGCSGFLLRDSDDKG 165
R + LK++Y F C C C + D ++ + C + C G +L D
Sbjct: 524 LDCKDRLQLLKDEYSFNCQCSGCSTVHISD-----LVINAFCCINPNCRGVVL---DRSI 575
Query: 166 FTCQQCG-----LVRSKEEIKKIASEVNILSKKTLALTSCGNHQEVVSTYKMIEK----- 215
F+C+ V + ++ + L CG++ ++ S+ ++K
Sbjct: 576 FSCENTKTKDFLTVNDQMILEPFMQTDSFLHAGPSHCLKCGSYCDIKSSRLTVDKAGIHF 635
Query: 216 --LQ-----------------------KKLYHPFSVNLMQTREKLIKILMELEDWKEALA 250
LQ K H ++ + + + L + L + A
Sbjct: 636 TRLQQEINLNRVSETTVSDALGALISLKSTLHEYNRRIAEAEDNLSQAFSLLGKLELAAE 695
Query: 251 YCQLTIPVYQRVYPQFHPLLGLQYYTCGKLEWFLGDTENAIKSMTEAVEILRITHGTNSP 310
+C+ +I + +++Y + H +G + + +GD NA+ + +I R +G+N
Sbjct: 696 HCKASIRILEKLYGENHIAIGNELSKLSSILISVGD-HNAVDCIKRLSKIFRCYYGSNVN 754
Query: 311 FMKELILKLEE 321
M + LEE
Sbjct: 755 TMFPFLNILEE 765
>gi|395331361|gb|EJF63742.1| SET domain-containing protein [Dichomitus squalens LYAD-421 SS1]
Length = 652
Score = 65.1 bits (157), Expect = 5e-08, Method: Compositional matrix adjust.
Identities = 44/133 (33%), Positives = 63/133 (47%), Gaps = 21/133 (15%)
Query: 52 LRPLGTGLYPVISIINHSCLPNAVLVF----------EGRLAVVRAVQHVPKGAEVLISY 101
L P+G + P +++ NHSC PNA LVF E L++V A+++V G E+ ISY
Sbjct: 314 LTPIGICVSPTVALANHSCDPNAALVFPRADGGSRVKEPLLSLV-ALRNVAPGKEIRISY 372
Query: 102 IETAGSTMTRQKALKEQYLFTCTCPRCIKLGQFDDIQESAILEGYRCKDDGCSG---FLL 158
++T RQ LKE Y F+C C C + D E C C G +
Sbjct: 373 VDTTLPNRLRQNELKEVYSFSCQCKLCCRTTAADP------REALWCPKS-CGGICPYPT 425
Query: 159 RDSDDKGFTCQQC 171
+S D+ C +C
Sbjct: 426 EESGDRATHCVKC 438
>gi|330843375|ref|XP_003293631.1| hypothetical protein DICPUDRAFT_90323 [Dictyostelium purpureum]
gi|325076008|gb|EGC29833.1| hypothetical protein DICPUDRAFT_90323 [Dictyostelium purpureum]
Length = 343
Score = 65.1 bits (157), Expect = 5e-08, Method: Compositional matrix adjust.
Identities = 30/73 (41%), Positives = 43/73 (58%), Gaps = 1/73 (1%)
Query: 58 GLYPVISIINHSCLPNAVLVFEG-RLAVVRAVQHVPKGAEVLISYIETAGSTMTRQKALK 116
GLY + S INHSC PNA + F A +R ++ + G E+ ISY +T+ + R+ L
Sbjct: 260 GLYFLTSFINHSCDPNAYVQFPNDHTAEIRLLKPINPGEEITISYADTSKDIIDRRSQLF 319
Query: 117 EQYLFTCTCPRCI 129
E Y F C CP+C+
Sbjct: 320 ENYGFNCECPKCL 332
>gi|383861061|ref|XP_003706005.1| PREDICTED: SET and MYND domain-containing protein 5-like [Megachile
rotundata]
Length = 388
Score = 65.1 bits (157), Expect = 5e-08, Method: Compositional matrix adjust.
Identities = 34/86 (39%), Positives = 52/86 (60%), Gaps = 7/86 (8%)
Query: 56 GTGLYPVISIINHSCLPNAVLVF--EGRLAVVRAVQHVPKGAEVLISYIETAG---STMT 110
G+GLY + S +NHSC+PNA++ F + V++A++ + E+ ISY++ S +
Sbjct: 287 GSGLYILQSAVNHSCIPNAIVEFPYSNNVLVLKAIRDIHPEEEICISYLDECDLERSRHS 346
Query: 111 RQKALKEQYLFTCTCPRCIKLGQFDD 136
RQKAL YLF C C +C L Q +D
Sbjct: 347 RQKALSSLYLFICCCSKC--LSQTND 370
>gi|340720588|ref|XP_003398716.1| PREDICTED: SET and MYND domain-containing protein 4-like [Bombus
terrestris]
Length = 735
Score = 64.7 bits (156), Expect = 5e-08, Method: Compositional matrix adjust.
Identities = 41/122 (33%), Positives = 61/122 (50%), Gaps = 13/122 (10%)
Query: 55 LGTGLYPVISIINHSCLPNAVLVFEGRLAVVRAVQHVPKGAEVLISYIE--TAGSTMTRQ 112
+ TG+YP S++NHSC PN + +F G+ +VRA + + + + Y +T RQ
Sbjct: 456 VATGIYPSASMMNHSCDPNIINIFVGQYLIVRASRDIDQSEGIFNCYGPHYRHMTTEDRQ 515
Query: 113 KALKEQYLFTCTCPRCI--KLGQFDDIQESAILEGYRCKDDGCSGFLLRDSDDKGFTCQQ 170
K LK QY FTC C C KL F I+ ++ +C C+ ++DS C
Sbjct: 516 KILKSQYCFTCKCKACTLPKLQYF--IERFNAVKCLKCNGPVCN---IKDS----IYCLD 566
Query: 171 CG 172
CG
Sbjct: 567 CG 568
>gi|443719695|gb|ELU09739.1| hypothetical protein CAPTEDRAFT_159764 [Capitella teleta]
Length = 338
Score = 64.7 bits (156), Expect = 5e-08, Method: Compositional matrix adjust.
Identities = 31/78 (39%), Positives = 45/78 (57%), Gaps = 5/78 (6%)
Query: 56 GTGLYPVISIINHSCLPNAVLVF--EGRLAVVRAVQHVPKGAEVLISYIETAG---STMT 110
G+GLY + S NHSC+PNA + F V+A+ ++ G E+ ISY++ S +
Sbjct: 234 GSGLYLLQSTCNHSCMPNAEITFPHNNSTLAVKALSNIKTGEEICISYLDECALERSRHS 293
Query: 111 RQKALKEQYLFTCTCPRC 128
R K L+E YLF C C +C
Sbjct: 294 RHKILRENYLFNCNCSKC 311
>gi|348540237|ref|XP_003457594.1| PREDICTED: SET and MYND domain-containing protein 4-like
[Oreochromis niloticus]
Length = 725
Score = 64.7 bits (156), Expect = 5e-08, Method: Compositional matrix adjust.
Identities = 74/322 (22%), Positives = 128/322 (39%), Gaps = 57/322 (17%)
Query: 41 ACNAHTICNSELRPLGTGLYPVISIINHSCLPNAVLVF---------------------- 78
A N+ N E+R + T ++P +S++NHSC PN LVF
Sbjct: 405 ATNSLVQSNREIR-IATAVFPALSLLNHSCCPNTSLVFNTGGIIEPSGSALSADFSEGEA 463
Query: 79 ------EGRLAVVRAVQHVPKGAEVLISYIETAG--STMTRQKALKEQYLFTCTCPRCIK 130
+G VRA + + G E+L Y + +T R++ L+EQY F C C C +
Sbjct: 464 EWRHKAQGVTVTVRAAKLITPGQEILHCYGPHSSRMATQERRRLLQEQYYFLCQCEACSQ 523
Query: 131 LGQFDDIQESAILE-GYRCKDDG-----CSGFLLRDSDDK--GFTC--QQCGLVRSKEEI 180
Q D++E + G +D G C G L + S+D+ GF C C S ++
Sbjct: 524 HQQ--DVEEPRQQQPGAEGRDLGLLCIKCKGSLKKRSEDREAGFVCSLSSCSHCMSLSDV 581
Query: 181 KKIASEVNILSKKTLALTSCGNHQEVVSTYKMIEKLQKKLYHPFSVNLMQTR-------E 233
E+ + +K + L E + +L KK V L +T +
Sbjct: 582 NNRLQEIRVNLEKAVELMETDRPGEAL-------RLLKKTLCQSGVILAETHPLQGELAD 634
Query: 234 KLIKILMELEDWKEALAYCQLTIPVYQRVYPQFHPLLGLQYYTCGKLEWFLGDTENAIKS 293
+ + DW A ++ + + + + LG Q + +L + G A+
Sbjct: 635 ATARAYASMGDWNNAASHLERSTVAIASQFGKESVELGRQLFKLAQLHFNSGSRGPALSV 694
Query: 294 MTEAVEILRITHGTNSPFMKEL 315
+ + +L + G + ++EL
Sbjct: 695 IPKVRRLLCLHCGPHCHELQEL 716
>gi|224010509|ref|XP_002294212.1| predicted protein [Thalassiosira pseudonana CCMP1335]
gi|220970229|gb|EED88567.1| predicted protein [Thalassiosira pseudonana CCMP1335]
Length = 516
Score = 64.7 bits (156), Expect = 5e-08, Method: Compositional matrix adjust.
Identities = 34/87 (39%), Positives = 46/87 (52%), Gaps = 14/87 (16%)
Query: 58 GLYPVISIINHSCLPNAVLVF-------------EGR-LAVVRAVQHVPKGAEVLISYIE 103
GLYP+ ++INHSC PNAV VF +GR + +V A + KG E+ SY+
Sbjct: 244 GLYPLAAMINHSCCPNAVRVFGTLPAINDNMHRVQGREVMIVHATAKISKGDEITWSYLP 303
Query: 104 TAGSTMTRQKALKEQYLFTCTCPRCIK 130
R+ L ++ FTC C RCIK
Sbjct: 304 PCSPFNERRAVLSSKFGFTCQCVRCIK 330
>gi|307204816|gb|EFN83374.1| SET and MYND domain-containing protein 4 [Harpegnathos saltator]
Length = 628
Score = 64.7 bits (156), Expect = 5e-08, Method: Compositional matrix adjust.
Identities = 65/300 (21%), Positives = 124/300 (41%), Gaps = 28/300 (9%)
Query: 37 FSKLACNAHTICNSELRPLGTGLYPVISIINHSCLPNAVLVFEGRLAVVRAVQHVPKGAE 96
+K+A + +C + + T +YP S++NHSC PN + F + +VRA++ + G E
Sbjct: 326 LNKIASDKDKLCVEQQDRIATAIYPSASMMNHSCDPNIITSFVDQYLIVRAMKDIQAGEE 385
Query: 97 VLISYIET--AGSTMTRQKALKEQYLFTCTCPRCIKLGQFDDIQESAILEGYRCKDDGCS 154
V Y S RQ+ LK QY F C C C+ + ++++ E +C + C
Sbjct: 386 VSNCYGPNFRRMSREHRQEILKNQYGFECRCDPCV-MPEYENFMER--FHAIKCPE--CG 440
Query: 155 GFLLRDSDDKGFTCQQCG---LVRSKEEIKKIASEVNILSKKTLALTSCGNHQEVVSTYK 211
G L + C CG + + E+K + + +E V K
Sbjct: 441 G-ALEEFHSYSMHCLDCGATPYINCQIELKNAVTAFDAAQ----IYIELEREEEAVVKLK 495
Query: 212 MIEKLQKKLYHPFSVNLMQTREKLIKILMELEDWKEALAYCQLTIPVYQRVY-------- 263
+++ L + + +L T + L K+ + W E++++ + ++ + Y
Sbjct: 496 ECLSIRRNLLYKYHQDLTATLDILGKVYAIMGRWLESISHIEHSMAAIEERYGCSSMEVA 555
Query: 264 PQFHPLLG-----LQYYTCGKLEWFLGDTENAIKSMTEAVEILRITHGTNSPFMKELILK 318
+ + L LQ +W+ + + + A EI+ T+G +E+I K
Sbjct: 556 NELNKLTDICIRYLQEEPNRTTKWYKNVLKKTRRYLDRAEEIVNFTYGPWHEIHQEIIEK 615
>gi|19922036|ref|NP_610684.1| CG7759, isoform B [Drosophila melanogaster]
gi|16768704|gb|AAL28571.1| HL04910p [Drosophila melanogaster]
gi|21627490|gb|AAM68738.1| CG7759, isoform B [Drosophila melanogaster]
Length = 567
Score = 64.7 bits (156), Expect = 6e-08, Method: Compositional matrix adjust.
Identities = 54/219 (24%), Positives = 97/219 (44%), Gaps = 18/219 (8%)
Query: 15 IANLVNLILQWPEISINEIAE--NFSKLACNAHTICNSELRPLGTGLYPVISIINHSCLP 72
I +LV LQ+ + + +E+AE FS +G +YP +++ NHSC P
Sbjct: 334 ICSLVLRSLQFIQFNTHEVAELHKFSSSGREKSIF-------IGGAIYPTLALFNHSCDP 386
Query: 73 NAVLVFEGRLAVVRAVQHVPKGAEVLISY--IETAGSTMTRQKALKEQYLFTCTCPRCI- 129
V F G + +V+ + G + +Y + T RQ LK+ Y F C+C CI
Sbjct: 387 GVVRYFRGTTIHINSVRPIEAGLPINENYGPMYTQDERSERQARLKDLYWFECSCDACID 446
Query: 130 KLGQFDDIQESAILEGYRCK-DDGCSGFL--LRDSDDKGFTCQQCGLVRSKEEIKKIASE 186
+FDD+ I +RC + CS + +D C CG + + + K+ +
Sbjct: 447 NWPKFDDLPRDVI--RFRCDAPNNCSAVIEVPPSCNDFMVKCVTCGEITNILKGLKVMQD 504
Query: 187 VNILSKKTLALTSCGNHQEVVSTY-KMIEKLQKKLYHPF 224
++++ L G + + ++ + +I + + L PF
Sbjct: 505 TEMMTRTAKRLYETGEYPKALAKFVDLIRIMYEVLAPPF 543
>gi|313217681|emb|CBY38723.1| unnamed protein product [Oikopleura dioica]
Length = 228
Score = 64.7 bits (156), Expect = 6e-08, Method: Compositional matrix adjust.
Identities = 30/101 (29%), Positives = 52/101 (51%), Gaps = 2/101 (1%)
Query: 37 FSKLACNAHTICNSEL--RPLGTGLYPVISIINHSCLPNAVLVFEGRLAVVRAVQHVPKG 94
F+ + N IC+ E +G+ +Y +++NHSC PNA VF + +A++ + G
Sbjct: 30 FAAIKNNQFAICDEESGDYDVGSAVYIDHALVNHSCRPNAYPVFNKTNMIFKALRKIEPG 89
Query: 95 AEVLISYIETAGSTMTRQKALKEQYLFTCTCPRCIKLGQFD 135
E+ +Y +T R++ L + + F C CP C K + D
Sbjct: 90 EEITHAYTDTISPIQERREYLNDVWRFMCNCPGCTKSNEID 130
>gi|73980996|ref|XP_533007.2| PREDICTED: SET and MYND domain-containing protein 5 [Canis lupus
familiaris]
Length = 419
Score = 64.7 bits (156), Expect = 6e-08, Method: Compositional matrix adjust.
Identities = 33/79 (41%), Positives = 47/79 (59%), Gaps = 5/79 (6%)
Query: 56 GTGLYPVISIINHSCLPNAVLVF--EGRLAVVRAVQHVPKGAEVLISYIETAG---STMT 110
G+GLY + S NHSC+PNA F L V A++ + G E+ ISY++ S +
Sbjct: 304 GSGLYVLQSCCNHSCVPNAETTFPENNFLLHVTALEDIKPGEEICISYLDCCQRERSRHS 363
Query: 111 RQKALKEQYLFTCTCPRCI 129
R K L+E YLF C+CP+C+
Sbjct: 364 RHKILRENYLFVCSCPKCL 382
>gi|24652742|ref|NP_725048.1| CG7759, isoform A [Drosophila melanogaster]
gi|21627489|gb|AAF58645.2| CG7759, isoform A [Drosophila melanogaster]
Length = 660
Score = 64.7 bits (156), Expect = 6e-08, Method: Compositional matrix adjust.
Identities = 54/219 (24%), Positives = 97/219 (44%), Gaps = 18/219 (8%)
Query: 15 IANLVNLILQWPEISINEIAE--NFSKLACNAHTICNSELRPLGTGLYPVISIINHSCLP 72
I +LV LQ+ + + +E+AE FS +G +YP +++ NHSC P
Sbjct: 427 ICSLVLRSLQFIQFNTHEVAELHKFSSSGREKSIF-------IGGAIYPTLALFNHSCDP 479
Query: 73 NAVLVFEGRLAVVRAVQHVPKGAEVLISY--IETAGSTMTRQKALKEQYLFTCTCPRCI- 129
V F G + +V+ + G + +Y + T RQ LK+ Y F C+C CI
Sbjct: 480 GVVRYFRGTTIHINSVRPIEAGLPINENYGPMYTQDERSERQARLKDLYWFECSCDACID 539
Query: 130 KLGQFDDIQESAILEGYRCK-DDGCSGFL--LRDSDDKGFTCQQCGLVRSKEEIKKIASE 186
+FDD+ I +RC + CS + +D C CG + + + K+ +
Sbjct: 540 NWPKFDDLPRDVI--RFRCDAPNNCSAVIEVPPSCNDFMVKCVTCGEITNILKGLKVMQD 597
Query: 187 VNILSKKTLALTSCGNHQEVVSTY-KMIEKLQKKLYHPF 224
++++ L G + + ++ + +I + + L PF
Sbjct: 598 TEMMTRTAKRLYETGEYPKALAKFVDLIRIMYEVLAPPF 636
>gi|328866939|gb|EGG15322.1| SET domain-containing protein [Dictyostelium fasciculatum]
Length = 524
Score = 64.7 bits (156), Expect = 6e-08, Method: Compositional matrix adjust.
Identities = 30/75 (40%), Positives = 43/75 (57%), Gaps = 2/75 (2%)
Query: 56 GTGLYPVISIINHSCLPNAVLVFEGR--LAVVRAVQHVPKGAEVLISYIETAGSTMTRQK 113
G+GLY + NHSC PNA + E A +R++ +P G ++ ISYIE + TRQ
Sbjct: 444 GSGLYVYANSCNHSCSPNAAITNESTNFSATIRSITDIPNGNQIEISYIEEDQPSQTRQS 503
Query: 114 ALKEQYLFTCTCPRC 128
L ++Y F C C +C
Sbjct: 504 ELIDKYKFKCHCQKC 518
>gi|195121780|ref|XP_002005397.1| GI19093 [Drosophila mojavensis]
gi|193910465|gb|EDW09332.1| GI19093 [Drosophila mojavensis]
Length = 515
Score = 64.7 bits (156), Expect = 6e-08, Method: Compositional matrix adjust.
Identities = 42/160 (26%), Positives = 67/160 (41%), Gaps = 8/160 (5%)
Query: 13 AQIANLVNLILQWPEISINEIAENFSKLACNAHTICNSELRPLGTGLYPVISIINHSCLP 72
A I + + + S EI L N H + S+ P ++ S +SC+P
Sbjct: 187 ASIGQFIPKFFRTQKFSEEEIMRAVGALQINGHEVPTSD--PPHVAVFYTASFTENSCVP 244
Query: 73 NAVLVF-EGRLAVVRAVQHVPKGAEVLISYIETAGSTMTRQKALKEQYLFTCTCPRCIKL 131
N F + ++ A + + KGA + I Y + T RQ+ L + LF CTC RC+ +
Sbjct: 245 NLAKSFTKNGHCILWAPKAIKKGANLSICYSDAVWGTADRQRHLMQTKLFKCTCERCVDV 304
Query: 132 GQFDDIQESAILEGYRCKDDGCSGFLLRDSDDKGFTCQQC 171
+ +C+D C+G LL D C +C
Sbjct: 305 TEL-----GTYYSALKCEDRKCTGLLLPIKADDWNGCWRC 339
>gi|328770867|gb|EGF80908.1| hypothetical protein BATDEDRAFT_88247 [Batrachochytrium
dendrobatidis JAM81]
Length = 428
Score = 64.7 bits (156), Expect = 6e-08, Method: Compositional matrix adjust.
Identities = 66/265 (24%), Positives = 114/265 (43%), Gaps = 47/265 (17%)
Query: 56 GTGLYPVISIINHSCLPNAVLVFEGRLAVVRAVQHVPKGAEVLISYIETAGSTM--TRQK 113
G+G+YP S+INH RA +++ KG + ISY E+AG ++ TRQ+
Sbjct: 153 GSGIYPTASLINH-----------------RASKNIRKGNAITISYGESAGRSLRATRQQ 195
Query: 114 ALKEQYLFTCTCPRCIKLGQFDDIQESAILEGYRCKDDGCSGFLLRDSDDKGFTCQQCGL 173
LK + F C C CI D Q L G +C GC + L + + + C +C
Sbjct: 196 NLKAGWGFECRCSACISEDDTKDRQ----LAGLKCP--GCP-YPLYEGEPR---CPKC-- 243
Query: 174 VRSKEEIKKIASEVNILSKKTLALTSCGNHQEVVSTYKMIEKLQ------KKLYHPFSVN 227
I I+ + +++K C +H++ + +I+ LQ + + HP S
Sbjct: 244 ----HSIVDISRQQKVMAKIDAVWHECQDHRKHDNVASLIQTLQGIQTDLESIMHPSSSR 299
Query: 228 LMQTREKLIKILMELE---DWKEALAYCQLTIPVYQRVYPQFHPLLGLQYYT-CGKLEWF 283
L + L + ++L D + L++C+ + R+ L ++ K+ +F
Sbjct: 300 LGCMYDNLASLNVQLAKQGDNRILLSWCEKLLESSIRIVEGIFGLTSIEAANEHNKMAYF 359
Query: 284 LG--DTENAIKSMTEAVEILRITHG 306
+ D E IK A + R +G
Sbjct: 360 MMAVDAEKGIKIARHAEGLFRTLYG 384
>gi|50552992|ref|XP_503906.1| YALI0E13552p [Yarrowia lipolytica]
gi|49649775|emb|CAG79499.1| YALI0E13552p [Yarrowia lipolytica CLIB122]
Length = 637
Score = 64.7 bits (156), Expect = 6e-08, Method: Compositional matrix adjust.
Identities = 34/101 (33%), Positives = 50/101 (49%)
Query: 30 INEIAENFSKLACNAHTICNSELRPLGTGLYPVISIINHSCLPNAVLVFEGRLAVVRAVQ 89
+ +I E L N +T C+ R G P +++INHSC+PNA L+F GR + +
Sbjct: 179 LEQILEYAGMLFVNQYTRCDDLGRQGGYIFDPTLALINHSCVPNAYLLFRGRKVHLVCWK 238
Query: 90 HVPKGAEVLISYIETAGSTMTRQKALKEQYLFTCTCPRCIK 130
+ G EV +SY T R+ L + F C CP C+
Sbjct: 239 PINDGDEVFLSYTRFMHPTPERRTLLYMHFRFWCECPGCVS 279
>gi|407867908|gb|EKG08717.1| hypothetical protein TCSYLVIO_000127 [Trypanosoma cruzi]
Length = 697
Score = 64.7 bits (156), Expect = 6e-08, Method: Compositional matrix adjust.
Identities = 41/123 (33%), Positives = 59/123 (47%), Gaps = 12/123 (9%)
Query: 18 LVNLILQW------PEISINEIAENFSKLA----CNAHTICNSELRPLGTGLYPVISIIN 67
LIL W E ++ A F++L CN+ + N++ +G LYP S N
Sbjct: 520 FTKLILPWLGEGGSTEANLTVTATFFNRLCAALQCNSFGLFNADGNCIGVALYPEASYFN 579
Query: 68 HSCLPNAVLV-FEGRLAVVRAVQHVPKGAEVLISYIETA-GSTMTRQKALKEQYLFTCTC 125
HSC PN V + G LA A++ + KG + I Y++ ST R++ L Y F C C
Sbjct: 580 HSCCPNICRVTYRGILAAFHALREIRKGEPLTICYVDVQETSTAERRRTLFSSYRFFCEC 639
Query: 126 PRC 128
RC
Sbjct: 640 ARC 642
>gi|157124067|ref|XP_001660315.1| hypothetical protein AaeL_AAEL009757 [Aedes aegypti]
gi|108874116|gb|EAT38341.1| AAEL009757-PA [Aedes aegypti]
Length = 518
Score = 64.7 bits (156), Expect = 6e-08, Method: Compositional matrix adjust.
Identities = 63/242 (26%), Positives = 103/242 (42%), Gaps = 35/242 (14%)
Query: 50 SELRPLGTGLYPVISIINHSCLPNAVLVFEGRLAVVRAVQHVP-KGAEVL-ISYIETAGS 107
S++ L GLYP +++ HSC+PN +L +G + VR P K E+L ISY +
Sbjct: 190 SQMGNLARGLYPKTALMMHSCMPNTLLSVDG-YSNVRVFTSAPVKMGEILNISYTRSLFG 248
Query: 108 TMTRQKALKEQYLFTCTCPRCIKLGQFDDIQESAILEGYRCK--DDGCSGFLLRDSDDKG 165
T RQ L++ F C+C RC D + L RC DDG F D+
Sbjct: 249 TYDRQTHLRKGKYFICSCRRCK-----DPTELGTHLSSIRCSECDDGLCSFY---PDEPR 300
Query: 166 FTCQQCGLVRSKEEIKKI--ASEVNILSKKTLALTSCGNHQEVVSTYKMIEKLQKKL--Y 221
+ C +C + +E + +I A+ +I++ EV ++I K K L +
Sbjct: 301 WECNKCHKLLKREYVNEILCAARDDIMACPL----------EVRDLERVISKHSKILNPH 350
Query: 222 HPFSVNLMQTREKLIKILMELED--------WKEALAYCQLTIPVYQRVYPQFHPLLGLQ 273
H + QT +K + D + C+ +P+ + + P L+G+
Sbjct: 351 HSLVLEAKQTLMGELKAICMSTDPTAVPKPVLRRKFELCEEILPILRTLEPGLSRLVGIA 410
Query: 274 YY 275
Y
Sbjct: 411 MY 412
>gi|406867180|gb|EKD20219.1| xanthine dehydrogenase [Marssonina brunnea f. sp. 'multigermtubi'
MB_m1]
Length = 654
Score = 64.7 bits (156), Expect = 7e-08, Method: Compositional matrix adjust.
Identities = 31/77 (40%), Positives = 41/77 (53%), Gaps = 2/77 (2%)
Query: 55 LGTGLYPVISIINHSCLPNAVLVFE--GRLAVVRAVQHVPKGAEVLISYIETAGSTMTRQ 112
LG +Y + NHSC PNA F G L V+RA + +G E+ I Y G+T R+
Sbjct: 439 LGPFIYEIAGRFNHSCTPNAARGFSDVGELIVLRAFVDIQEGEEITIEYFSDGGTTTERR 498
Query: 113 KALKEQYLFTCTCPRCI 129
+ L +QY F C C CI
Sbjct: 499 QHLYKQYRFNCDCRACI 515
>gi|195111430|ref|XP_002000282.1| GI10143 [Drosophila mojavensis]
gi|193916876|gb|EDW15743.1| GI10143 [Drosophila mojavensis]
Length = 392
Score = 64.3 bits (155), Expect = 7e-08, Method: Compositional matrix adjust.
Identities = 31/78 (39%), Positives = 47/78 (60%), Gaps = 5/78 (6%)
Query: 56 GTGLYPVISIINHSCLPNA--VLVFEGRLAVVRAVQHVPKGAEVLISYIETAG---STMT 110
G+GLY + S +NHSC+PNA + + V++AV + KG E+ ISY++ S +
Sbjct: 286 GSGLYILQSKVNHSCVPNAQSTFPYSNDIVVLKAVAPIQKGEEICISYLDECQLERSRHS 345
Query: 111 RQKALKEQYLFTCTCPRC 128
R K L+E Y+F C C +C
Sbjct: 346 RHKILRENYIFVCQCSKC 363
>gi|414870819|tpg|DAA49376.1| TPA: hypothetical protein ZEAMMB73_691595 [Zea mays]
Length = 489
Score = 64.3 bits (155), Expect = 7e-08, Method: Compositional matrix adjust.
Identities = 73/308 (23%), Positives = 129/308 (41%), Gaps = 56/308 (18%)
Query: 50 SELRPLGTGLYPVISIINHSCLPNAVLVFEGRLAVVRAVQHVPKGAEVLISYIETAGSTM 109
S++R + LY S+ NHSC PN F R V+R + + G+ V +SY G
Sbjct: 171 SQVR-VAQALYVSGSLFNHSCQPNVHAYFLSRAFVLRTTEFIKSGSPVELSYGPQVGEMH 229
Query: 110 T--RQKALKEQYLFTCTCPRCIKLGQFDDIQES----------AILEG--YRCKDD---- 151
RQK+L+E Y F+C C C +L D + S AI E YR K++
Sbjct: 230 ISERQKSLQENYYFSCQCSSCSELNLSDLVMNSFCCPQSNCLGAISESTYYRSKENFVNV 289
Query: 152 -----------------------GCSGFLLRDSD-----DKG--FTCQQC-GLVRSKEEI 180
+ LL++S D G +C+ C L +
Sbjct: 290 SLGGSYVCKLSLPDVSKVDMDMENVAKSLLQNSGVSLNIDHGCCMSCRSCIDLSSALATS 349
Query: 181 KKIASEVNILSKKTLALTSCGNHQEVVSTYKMIEKLQKKLYHPFSVNLMQTREKLIKILM 240
+ S ++ L K T + + + +++L KKL HP+S L Q + + +
Sbjct: 350 HREESTIDRLKKHTFL-----DKTLITEALQSLKQL-KKLRHPYSKALAQAEDTIAEAFA 403
Query: 241 ELEDWKEALAYCQLTIPVYQRVYPQFHPLLGLQYYTCGKLEWFLGDTENAIKSMTEAVEI 300
++ D + A +C+ +I + +++Y H ++ + + LGD +A + +A I
Sbjct: 404 KVGDQELARKHCEASIQILEKLYHPKHIIIAHELIKLVSILLSLGDGASAAATFAQAEAI 463
Query: 301 LRITHGTN 308
+ +G++
Sbjct: 464 FSLYYGSH 471
>gi|357464329|ref|XP_003602446.1| SET and MYND domain-containing protein [Medicago truncatula]
gi|355491494|gb|AES72697.1| SET and MYND domain-containing protein [Medicago truncatula]
Length = 785
Score = 64.3 bits (155), Expect = 7e-08, Method: Compositional matrix adjust.
Identities = 38/118 (32%), Positives = 54/118 (45%), Gaps = 7/118 (5%)
Query: 43 NAHTICNSELRPLGTGLYPVISIINHSCLPNAVLVFEGRLAVVRAVQHVPKGAEVLISYI 102
+ H N E +G +Y V S+ NHSC PN F R +R Q V G ++ +SY
Sbjct: 462 SVHLTSNVEQVRVGKAIYKVGSLFNHSCQPNVHAYFLSRTLYLRTTQAVAAGCQLELSYG 521
Query: 103 ETAG--STMTRQKALKEQYLFTCTCPRCIKLGQFDDIQESAILEGYRCKDDGCSGFLL 158
G RQ LK++Y F C C C ++ D +L + C + CSG +L
Sbjct: 522 PQVGLWDCKDRQSFLKDEYAFHCQCTGCSEVNLSD-----IVLNAFHCVNPNCSGAVL 574
>gi|440802077|gb|ELR23016.1| Histone-lysine N-methyltransferase [Acanthamoeba castellanii str.
Neff]
Length = 481
Score = 64.3 bits (155), Expect = 7e-08, Method: Compositional matrix adjust.
Identities = 45/156 (28%), Positives = 75/156 (48%), Gaps = 12/156 (7%)
Query: 9 LLLYAQIANLVNLILQWPEISINEIAENFSKLACNAHTICNSELRPL---GTGLYPVISI 65
L++ A + +LQ + ++EI E +L N ++ R G+ LY ++
Sbjct: 288 LIMEALSGRATSDVLQG--LDVDEIVEYIGRLESNTFARYSTTKRKTVNYGSALYNCAAM 345
Query: 66 INHSCLPNAVLVFEGRLAVVRAVQHVPKGAEVLISYIETAGSTMTRQKALKEQYLFTCTC 125
NHSC P+ V F+G+ +RA++ + G E+ ++YI T RQ L++ Y F C C
Sbjct: 346 FNHSCFPSVVRQFDGQHLTLRALRPLNPGDELTMTYIPLRDDTPERQDELQQHYHFACRC 405
Query: 126 PRCIKLG-QFDDIQESAILEGYRCKDDG--CSGFLL 158
RC G + + E +L C +D C G L+
Sbjct: 406 QRCEARGAEVERYMEECLL----CPNDKRECDGLLV 437
>gi|116179288|ref|XP_001219493.1| hypothetical protein CHGG_00272 [Chaetomium globosum CBS 148.51]
gi|88184569|gb|EAQ92037.1| hypothetical protein CHGG_00272 [Chaetomium globosum CBS 148.51]
Length = 588
Score = 64.3 bits (155), Expect = 7e-08, Method: Compositional matrix adjust.
Identities = 30/70 (42%), Positives = 45/70 (64%), Gaps = 1/70 (1%)
Query: 59 LYPVISIINHSCLPNAVLVFEGRLAVVRAVQHVPKGAEVLISYIETAGSTMTRQKALKEQ 118
L+P ++INHSC+PN+ F G L V+RA +++ KG E++ SY E+ G RQ+AL
Sbjct: 425 LWPWAALINHSCIPNSEREFVGDLMVIRATKNIAKGEEIVHSYDES-GVYDDRQRALMTT 483
Query: 119 YLFTCTCPRC 128
+ F C+C C
Sbjct: 484 WGFECSCALC 493
>gi|302922324|ref|XP_003053442.1| hypothetical protein NECHADRAFT_74600 [Nectria haematococca mpVI
77-13-4]
gi|256734383|gb|EEU47729.1| hypothetical protein NECHADRAFT_74600 [Nectria haematococca mpVI
77-13-4]
Length = 530
Score = 64.3 bits (155), Expect = 7e-08, Method: Compositional matrix adjust.
Identities = 33/90 (36%), Positives = 52/90 (57%), Gaps = 1/90 (1%)
Query: 39 KLACNAHTICNSELRPLGTGLYPVISIINHSCLPNAVLVFEGRLAVVRAVQHVPKGAEVL 98
K+ NA ++++ +G L P +++ NHSC+PNA + F GR A++RA + + E+
Sbjct: 169 KIQTNAFHRYDADIGQVGIFLEPRLAMANHSCIPNATVQFVGRRAILRAERPIKADEEIE 228
Query: 99 ISYIETAGSTMTRQKALKEQYLFTCTCPRC 128
ISY + R++AL Y FTC C RC
Sbjct: 229 ISYTDYNYPLSKRKEALAP-YFFTCECTRC 257
>gi|355720921|gb|AES07094.1| SMYD family member 5 [Mustela putorius furo]
Length = 159
Score = 64.3 bits (155), Expect = 7e-08, Method: Compositional matrix adjust.
Identities = 32/79 (40%), Positives = 47/79 (59%), Gaps = 5/79 (6%)
Query: 56 GTGLYPVISIINHSCLPNAVLVF--EGRLAVVRAVQHVPKGAEVLISYIETAG---STMT 110
G+GL+ + S NHSC+PNA F L V A++ + G E+ ISY++ S +
Sbjct: 46 GSGLFVLQSCCNHSCVPNAETSFPENNFLLHVTALEDIKPGEEICISYLDCCQRERSRHS 105
Query: 111 RQKALKEQYLFTCTCPRCI 129
R K L+E YLF C+CP+C+
Sbjct: 106 RHKILRENYLFVCSCPKCL 124
>gi|321473475|gb|EFX84442.1| hypothetical protein DAPPUDRAFT_194440 [Daphnia pulex]
Length = 524
Score = 64.3 bits (155), Expect = 7e-08, Method: Compositional matrix adjust.
Identities = 41/148 (27%), Positives = 69/148 (46%), Gaps = 15/148 (10%)
Query: 58 GLYPVISIINHSCLPNAVLVFEGRLAVV-RAVQHVPKGAEVLISYIETAGSTMTRQKALK 116
+Y S++ H C+ NA+ F + +V RA +PKG ++ + Y E TM RQ+ L
Sbjct: 231 AVYSTASLLEHDCVANAIKTFTNKGDIVIRAAVPIPKGEKIALCYTEPLWGTMNRQRHLS 290
Query: 117 EQYLFTCTCPRCIKLGQFDDIQESAILEGYRCKD--DGCSGFLLRDS---DDKGFTCQQC 171
+ F C C RC + D G+ C+ +G LL ++ D + C+QC
Sbjct: 291 QTKFFQCVCERCKDPTELD-----TFTSGFYCQKCPQQSAGILLTENPLDDASDWICRQC 345
Query: 172 GLVR----SKEEIKKIASEVNILSKKTL 195
G + + E I+ + E+ L K ++
Sbjct: 346 GARQPANYAAEIIESVGKEIVALKKGSV 373
>gi|71421326|ref|XP_811771.1| hypothetical protein [Trypanosoma cruzi strain CL Brener]
gi|70876474|gb|EAN89920.1| hypothetical protein, conserved [Trypanosoma cruzi]
Length = 697
Score = 64.3 bits (155), Expect = 7e-08, Method: Compositional matrix adjust.
Identities = 41/123 (33%), Positives = 59/123 (47%), Gaps = 12/123 (9%)
Query: 18 LVNLILQW------PEISINEIAENFSKLA----CNAHTICNSELRPLGTGLYPVISIIN 67
LIL W E ++ A F++L CN+ + N++ +G LYP S N
Sbjct: 520 FTKLILPWLGEGGSTEANLTVTATFFNRLCAALQCNSFGLFNADGNCIGVALYPEASYFN 579
Query: 68 HSCLPNAVLV-FEGRLAVVRAVQHVPKGAEVLISYIETA-GSTMTRQKALKEQYLFTCTC 125
HSC PN V + G LA A++ + KG + I Y++ ST R++ L Y F C C
Sbjct: 580 HSCCPNICRVTYRGILAAFHALREIRKGEPLTICYVDVQETSTAERRRTLFSSYRFFCEC 639
Query: 126 PRC 128
RC
Sbjct: 640 ARC 642
>gi|328869415|gb|EGG17793.1| SET domain-containing protein [Dictyostelium fasciculatum]
Length = 363
Score = 64.3 bits (155), Expect = 8e-08, Method: Compositional matrix adjust.
Identities = 28/74 (37%), Positives = 42/74 (56%), Gaps = 1/74 (1%)
Query: 58 GLYPVISIINHSCLPNAVLVF-EGRLAVVRAVQHVPKGAEVLISYIETAGSTMTRQKALK 116
GL+ + S NHSC PN + F + AV+ A++ + KG E+ ISY ++ RQ L
Sbjct: 279 GLFFLSSFFNHSCDPNVYMAFPHDKTAVITALRDIKKGEELFISYGDSEKDMFDRQTHLF 338
Query: 117 EQYLFTCTCPRCIK 130
+ Y F C CP+C +
Sbjct: 339 DNYGFNCDCPKCTQ 352
>gi|296413480|ref|XP_002836440.1| hypothetical protein [Tuber melanosporum Mel28]
gi|295630261|emb|CAZ80631.1| unnamed protein product [Tuber melanosporum]
Length = 763
Score = 64.3 bits (155), Expect = 8e-08, Method: Compositional matrix adjust.
Identities = 29/73 (39%), Positives = 42/73 (57%)
Query: 56 GTGLYPVISIINHSCLPNAVLVFEGRLAVVRAVQHVPKGAEVLISYIETAGSTMTRQKAL 115
G+GL+ + S INHSC N+ F G L +VRA + + +G E+ I+Y+E RQKA
Sbjct: 515 GSGLWIMPSYINHSCWQNSTRSFLGDLLIVRAARDLSEGEEITINYMENESGVQKRQKAF 574
Query: 116 KEQYLFTCTCPRC 128
++ F C C C
Sbjct: 575 LSEWGFECKCTMC 587
>gi|350401158|ref|XP_003486068.1| PREDICTED: SET and MYND domain-containing protein 4-like [Bombus
impatiens]
Length = 662
Score = 64.3 bits (155), Expect = 8e-08, Method: Compositional matrix adjust.
Identities = 41/122 (33%), Positives = 61/122 (50%), Gaps = 13/122 (10%)
Query: 55 LGTGLYPVISIINHSCLPNAVLVFEGRLAVVRAVQHVPKGAEVLISYIE--TAGSTMTRQ 112
+ TG+YP S++NHSC PN + +F G+ +VRA + + + E+ Y +T RQ
Sbjct: 377 IATGIYPSASMMNHSCDPNIINIFVGQYLIVRASRDIGQSEEIFNCYGPHYRHMTTEDRQ 436
Query: 113 KALKEQYLFTCTCPRCI--KLGQFDDIQESAILEGYRCKDDGCSGFLLRDSDDKGFTCQQ 170
K LK Y FTC C C KL F + +A + +C C+ ++DS C
Sbjct: 437 KILKSHYCFTCKCKACTLPKLQYFTERFNAA--KCLKCNGPVCN---IKDS----IYCLD 487
Query: 171 CG 172
CG
Sbjct: 488 CG 489
>gi|328785542|ref|XP_001122203.2| PREDICTED: protein msta, isoform A-like [Apis mellifera]
Length = 490
Score = 64.3 bits (155), Expect = 8e-08, Method: Compositional matrix adjust.
Identities = 42/132 (31%), Positives = 64/132 (48%), Gaps = 9/132 (6%)
Query: 58 GLYPVISIINHSCLPNAVLVFEGRLAV-VRAVQHVPKGAEVLISYIETAGSTMTRQKALK 116
GLYP+ S+ NH C+PN F+ + + VRA + G E+ +SY T R++ L
Sbjct: 214 GLYPLGSLQNHCCIPNTRHYFDEKFRLYVRAALPISAGEEITMSYTSLFWDTTLRRQFLN 273
Query: 117 EQYLFTCTCPRCIKLGQFDDIQESAILEGYRCKDDGCSGFLLRDSD---DKGFTCQQCGL 173
F+C C RC +F+ + SA+L C D CSG LL + + C +C +
Sbjct: 274 VTKNFSCMCKRCSDPTEFNS-KLSALL----CASDKCSGELLPKNPLNMRTSWICNKCSI 328
Query: 174 VRSKEEIKKIAS 185
+ +I I S
Sbjct: 329 TVNHRQICSIRS 340
>gi|401405154|ref|XP_003882027.1| conserved hypothetical protein [Neospora caninum Liverpool]
gi|325116441|emb|CBZ51994.1| conserved hypothetical protein [Neospora caninum Liverpool]
Length = 576
Score = 64.3 bits (155), Expect = 9e-08, Method: Compositional matrix adjust.
Identities = 30/76 (39%), Positives = 46/76 (60%), Gaps = 2/76 (2%)
Query: 55 LGTGLYPVISIINHSCLPNAVLVFEG--RLAVVRAVQHVPKGAEVLISYIETAGSTMTRQ 112
+G GL+ +S+ NHSC PNA + + + A V A++ + + +VL+SYI+ +RQ
Sbjct: 419 MGIGLFRAVSMTNHSCWPNAEVDYPSLTKTAQVTALRDIAQDEQVLLSYIDETLPLASRQ 478
Query: 113 KALKEQYLFTCTCPRC 128
+ LK Y FTC C RC
Sbjct: 479 RLLKRHYKFTCACVRC 494
>gi|260798620|ref|XP_002594298.1| hypothetical protein BRAFLDRAFT_117670 [Branchiostoma floridae]
gi|229279531|gb|EEN50309.1| hypothetical protein BRAFLDRAFT_117670 [Branchiostoma floridae]
Length = 400
Score = 63.9 bits (154), Expect = 9e-08, Method: Compositional matrix adjust.
Identities = 32/96 (33%), Positives = 50/96 (52%)
Query: 33 IAENFSKLACNAHTICNSELRPLGTGLYPVISIINHSCLPNAVLVFEGRLAVVRAVQHVP 92
+ E + K+A N+ I + + +G G+YP S+INHSC N + +F G +RA + +
Sbjct: 170 MEEIYGKIASNSFAILDENMCSIGIGVYPQASMINHSCKSNCIGMFYGPQIQIRANEFIR 229
Query: 93 KGAEVLISYIETAGSTMTRQKALKEQYLFTCTCPRC 128
G ++ YI T RQ+ L + Y F C C C
Sbjct: 230 PGEQIFHGYIPPLLPTAKRQEKLLKTYHFLCQCADC 265
Score = 40.8 bits (94), Expect = 0.89, Method: Compositional matrix adjust.
Identities = 27/104 (25%), Positives = 52/104 (50%)
Query: 222 HPFSVNLMQTREKLIKILMELEDWKEALAYCQLTIPVYQRVYPQFHPLLGLQYYTCGKLE 281
H +V +++ + L+++ +EL W++A+ Y + P Y+ ++ GL Y
Sbjct: 281 HSDNVYVIRILKDLVEVCVELGLWEKAVVYGKRIEPGYKMFLKRYDLDQGLLYQKMALAY 340
Query: 282 WFLGDTENAIKSMTEAVEILRITHGTNSPFMKELILKLEEAQAE 325
+ LGD A+ + +A L IT G +S ++E+ L E A+
Sbjct: 341 YRLGDQSAALPYLRQAKTTLTITSGEDSGLVQEVSDMLAECVAK 384
>gi|428180005|gb|EKX48874.1| hypothetical protein GUITHDRAFT_105498 [Guillardia theta CCMP2712]
Length = 197
Score = 63.9 bits (154), Expect = 9e-08, Method: Compositional matrix adjust.
Identities = 45/158 (28%), Positives = 70/158 (44%), Gaps = 22/158 (13%)
Query: 43 NAHTICNSELRPLGTGLYPVISIINHSCLPNAVLVFEGRLAVVRAVQHVPKGAEVLISYI 102
NAH I N + +G G++P ++NH+C NAV F VP GAEV + I
Sbjct: 7 NAHPITNGSFKRIGLGIFPSACMLNHACNYNAVYSF------------VPGGAEVHANAI 54
Query: 103 E----TAGSTMTRQKALKEQYLFTCTCPRCIKLGQFDDIQESAILEGY-RCKDDGCSGFL 157
+ T+ + RQK ++E Y F C C C + S + + C C +
Sbjct: 55 DFGLNTSSQLIRRQKLIREGYFFQCACEWCAGTSGPERYFNSLVCNPWPDCDGHKCEAAI 114
Query: 158 LRDSD-DKGFTCQQCGLVRSKEEIKKIASEVNILSKKT 194
L D++ D+G C + + I +E+N L+K
Sbjct: 115 LPDANGDEGQAVVICDACKRRHPI----TEMNALNKSA 148
>gi|195352295|ref|XP_002042648.1| GM14901 [Drosophila sechellia]
gi|194124532|gb|EDW46575.1| GM14901 [Drosophila sechellia]
Length = 530
Score = 63.9 bits (154), Expect = 9e-08, Method: Compositional matrix adjust.
Identities = 51/181 (28%), Positives = 84/181 (46%), Gaps = 22/181 (12%)
Query: 10 LLYAQIANLVNLIL---QWPEISINEIAENFSKLACNAHTICNSELRPLGTGLYPVISII 66
+L A + +N +L WPE+ I IA + L N + R LYP ++I
Sbjct: 199 VLRANLITFINTVLGMKDWPEMDILRIA---AILDTNTFEVRQPRERRKIRALYPEAAMI 255
Query: 67 NHSCLPNAVLVFEGRLAVV-RAVQHVPKGAEVLISYIETAGSTMTRQKALKEQYLFTCTC 125
+H C+PN F+ + +V A + + KG + ISY + ST+ R+ L++ F C+C
Sbjct: 256 SHDCVPNMRHRFDDDMNIVFLAKRKIAKGEILSISYTQPLRSTIQRRVHLRQAKCFDCSC 315
Query: 126 PRCI---KLGQFDDIQESAILEGYRCKDDGCSGFLLRDS---DDKGFTCQQCGLVRSKEE 179
RC +LG F Q +CK +G ++ + + + CQ C RS +E
Sbjct: 316 ARCQDPEELGSFAGAQTC-----LKCK----AGKIISQNPLLNSAPWKCQLCNFKRSAKE 366
Query: 180 I 180
+
Sbjct: 367 V 367
>gi|198454620|ref|XP_001359652.2| GA17400 [Drosophila pseudoobscura pseudoobscura]
gi|198132877|gb|EAL28802.2| GA17400 [Drosophila pseudoobscura pseudoobscura]
Length = 392
Score = 63.9 bits (154), Expect = 1e-07, Method: Compositional matrix adjust.
Identities = 38/101 (37%), Positives = 57/101 (56%), Gaps = 9/101 (8%)
Query: 33 IAENFSKLACNAHTICNSELRPLGTGLYPVISIINHSCLPNAVLVF--EGRLAVVRAVQH 90
I E +SK+ A N+E G+GLY + S INHSC+PNA F + V++A+
Sbjct: 267 IDELYSKVGEFAGEFLNNE----GSGLYLLQSKINHSCVPNACSTFPYSNDIVVLKALAP 322
Query: 91 VPKGAEVLISYIETAG---STMTRQKALKEQYLFTCTCPRC 128
+ +G E+ ISY++ S +R K L+E Y+F C C +C
Sbjct: 323 IQEGEEICISYLDECQLERSRHSRHKVLRENYIFVCQCLKC 363
>gi|345484303|ref|XP_001603615.2| PREDICTED: SET and MYND domain-containing protein 5-like [Nasonia
vitripennis]
Length = 386
Score = 63.9 bits (154), Expect = 1e-07, Method: Compositional matrix adjust.
Identities = 36/88 (40%), Positives = 51/88 (57%), Gaps = 7/88 (7%)
Query: 56 GTGLYPVISIINHSCLPNAVLVF---EGRLAVVRAVQHVPKGAEVLISYIETAG---STM 109
G+GLY + S INHSC PNA + F + RL VVRA++ + E+ I+Y++ S
Sbjct: 288 GSGLYVLQSAINHSCAPNASVEFPHSDSRL-VVRALRDIKPDEEICIAYLDECHLERSRH 346
Query: 110 TRQKALKEQYLFTCTCPRCIKLGQFDDI 137
+RQKAL YLF C C +C + D+
Sbjct: 347 SRQKALSSLYLFVCKCDKCQQQASDPDV 374
>gi|167521373|ref|XP_001745025.1| hypothetical protein [Monosiga brevicollis MX1]
gi|163776639|gb|EDQ90258.1| predicted protein [Monosiga brevicollis MX1]
Length = 399
Score = 63.9 bits (154), Expect = 1e-07, Method: Compositional matrix adjust.
Identities = 34/98 (34%), Positives = 46/98 (46%), Gaps = 17/98 (17%)
Query: 58 GLYPVISIINHSCLPNAV-----------------LVFEGRLAVVRAVQHVPKGAEVLIS 100
GL+P + INHSC PNA L EG ++R+V + +G + IS
Sbjct: 132 GLWPQAAAINHSCRPNATHYLDASAPMKPRESGADLPPEGGTMIIRSVSDIARGEPITIS 191
Query: 101 YIETAGSTMTRQKALKEQYLFTCTCPRCIKLGQFDDIQ 138
Y+E RQ+AL+ Y F CTC RC + D Q
Sbjct: 192 YVELGDPWPVRQEALRTGYGFACTCIRCTEEAALDSEQ 229
>gi|16741406|gb|AAH16525.1| SET and MYND domain containing 5 [Mus musculus]
Length = 300
Score = 63.9 bits (154), Expect = 1e-07, Method: Compositional matrix adjust.
Identities = 35/86 (40%), Positives = 50/86 (58%), Gaps = 7/86 (8%)
Query: 56 GTGLYPVISIINHSCLPNAVLVFEGRLAV--VRAVQHVPKGAEVLISYIETAG---STMT 110
G+GL+ + S NHSC+PNA F V V A++ + G E+ ISY++ S +
Sbjct: 187 GSGLFVLQSCCNHSCVPNAETSFPENNFVLHVTALEDIKPGEEICISYLDCCQRERSRHS 246
Query: 111 RQKALKEQYLFTCTCPRCIKLGQFDD 136
R K L+E YLF C+CP+C L + DD
Sbjct: 247 RHKILRENYLFNCSCPKC--LAEADD 270
>gi|255085478|ref|XP_002505170.1| predicted protein [Micromonas sp. RCC299]
gi|226520439|gb|ACO66428.1| predicted protein [Micromonas sp. RCC299]
Length = 625
Score = 63.9 bits (154), Expect = 1e-07, Method: Compositional matrix adjust.
Identities = 36/106 (33%), Positives = 57/106 (53%), Gaps = 11/106 (10%)
Query: 26 PEISINEIAENFSKLACNAHTICNSELRPLGT-GLYPVISIINHSCLPNAVLVFEGRLAV 84
PE+ E+ F++LA N + S +R + G+YP S +NHSC PNA + G
Sbjct: 282 PELPPQEV---FARLAAN--VVKTSAVRNANSVGVYPFPSFLNHSCAPNACKLMVGHTMF 336
Query: 85 VRAVQHVPKGAEVLISYIETAGSTMTR--QKALKEQYLFTCTCPRC 128
+RA + + G EV + Y + TM + + A+ +++ F C CPRC
Sbjct: 337 IRAARDLVAGEEVFMKYFDV---TMPKPERSAVAKRWGFECACPRC 379
>gi|321449488|gb|EFX61914.1| hypothetical protein DAPPUDRAFT_68494 [Daphnia pulex]
Length = 680
Score = 63.9 bits (154), Expect = 1e-07, Method: Compositional matrix adjust.
Identities = 38/111 (34%), Positives = 59/111 (53%), Gaps = 5/111 (4%)
Query: 55 LGTGLYPVISIINHSCLPNAVLVFEGRLAVVRAVQHVPKGAEVLISY--IETAGSTMTRQ 112
LG G+Y ++++NHSC P + F G + VVRA++ G V +Y I T + RQ
Sbjct: 476 LGAGVYSTVALLNHSCEPGVIRHFIGDVMVVRAIKSFQPGEMVNENYGPIFTQKRRVDRQ 535
Query: 113 KALKEQYLFTCTCPRCIK-LGQFDDIQESAILEGYRCKDDGCSGFLLRDSD 162
++LK++Y F C C C + ++ E A+ +RC D C L+ SD
Sbjct: 536 RSLKDRYWFDCRCNPCTENWPLIGEMTEEALC--FRCADRRCRKPLVVQSD 584
>gi|195426423|ref|XP_002061334.1| GK20779 [Drosophila willistoni]
gi|194157419|gb|EDW72320.1| GK20779 [Drosophila willistoni]
Length = 515
Score = 63.9 bits (154), Expect = 1e-07, Method: Compositional matrix adjust.
Identities = 46/173 (26%), Positives = 72/173 (41%), Gaps = 11/173 (6%)
Query: 15 IANLVNLILQWPEISINEIAENFSKLACNAHTICNSELRPLGTGLYPVISIINHSCLPNA 74
I + + + S EI L N H + S+ P ++ V S +SCLPN
Sbjct: 189 IGQFIPKFFKTQKFSEEEIMRTVGALQINGHEVPTSD--PPHVAVFYVASFTENSCLPNL 246
Query: 75 VLVFEGR-LAVVRAVQHVPKGAEVLISYIETAGSTMTRQKALKEQYLFTCTCPRCIKLGQ 133
F ++ A Q + K + + I Y + T RQ+ L + LF C C RC
Sbjct: 247 AKSFNKHGHCILWAPQAIKKNSHLSICYSDAIWGTSDRQRHLMQTKLFKCACLRCQ---- 302
Query: 134 FDDIQESAILEGYRCKDDGCSGFLLRDSDDK---GFTCQQCGLVRSKEEIKKI 183
D + +C+D CSG +L D G+ C++CG K+ ++ I
Sbjct: 303 -DPTELGTNYSAIKCEDRKCSGLMLPTKADDWHGGWQCKECGKQVPKQYVEGI 354
>gi|66525107|ref|XP_394075.2| PREDICTED: SET and MYND domain-containing protein 5 [Apis
mellifera]
Length = 387
Score = 63.9 bits (154), Expect = 1e-07, Method: Compositional matrix adjust.
Identities = 34/87 (39%), Positives = 53/87 (60%), Gaps = 7/87 (8%)
Query: 56 GTGLYPVISIINHSCLPNAVLVF--EGRLAVVRAVQHVPKGAEVLISYIETA---GSTMT 110
G+GLY + S +NHSC+PNA++ F + V++A++ + E+ ISY++ S +
Sbjct: 287 GSGLYILQSSVNHSCVPNAIVEFPYSNNVLVLKAIRDIHPEEEICISYLDECCLERSRHS 346
Query: 111 RQKALKEQYLFTCTCPRCIKLGQFDDI 137
RQKAL YLF C C +C L Q +D+
Sbjct: 347 RQKALNSLYLFQCYCNKC--LSQANDL 371
>gi|270007661|gb|EFA04109.1| hypothetical protein TcasGA2_TC014347 [Tribolium castaneum]
Length = 497
Score = 63.9 bits (154), Expect = 1e-07, Method: Compositional matrix adjust.
Identities = 45/130 (34%), Positives = 63/130 (48%), Gaps = 9/130 (6%)
Query: 58 GLYPVISIINHSCLPNAVLVFEGRL-AVVRAVQHVPKGAEVLISYIETAGSTMTRQKALK 116
GLYP+ S+ NHSC PN VF+ + VVRA + +P+G+E+ SY ST R+ L
Sbjct: 207 GLYPLGSLANHSCCPNTCHVFDDKQHMVVRASKFIPQGSEIFHSYSRLIWSTSARRFHLY 266
Query: 117 EQYLFTCTCPRCIKLGQFDDIQESAILEGYRCKDDGCSGFLLRDS--DDKGFTCQQCGLV 174
F C C RC +F S + CK + +S DK + C+ CG +
Sbjct: 267 RTKHFLCKCQRCEDPTEFGSYIGSIL-----CKVCKTGKVIPTNSLQTDK-WQCEVCGSL 320
Query: 175 RSKEEIKKIA 184
KEE+ I
Sbjct: 321 IKKEEVANIT 330
>gi|367028494|ref|XP_003663531.1| hypothetical protein MYCTH_33039, partial [Myceliophthora
thermophila ATCC 42464]
gi|347010800|gb|AEO58286.1| hypothetical protein MYCTH_33039, partial [Myceliophthora
thermophila ATCC 42464]
Length = 419
Score = 63.9 bits (154), Expect = 1e-07, Method: Compositional matrix adjust.
Identities = 32/87 (36%), Positives = 46/87 (52%), Gaps = 2/87 (2%)
Query: 42 CNAHTICNSELRPLGTGLYPVISIINHSCLPNAVLVFEGRLAVVRAVQHVPKGAEVLISY 101
C C+ ++ +G L V ++INHSC PNA + EG VR+++ + G E+ +SY
Sbjct: 103 CKIARYCSKDV--IGYALDMVTAMINHSCAPNAFVTLEGCQLRVRSLKPIAAGEEITVSY 160
Query: 102 IETAGSTMTRQKALKEQYLFTCTCPRC 128
+ RQK LKE Y F C C C
Sbjct: 161 ADPTLPVFNRQKFLKETYFFDCRCKGC 187
>gi|198433887|ref|XP_002127168.1| PREDICTED: similar to SMYD family member 5 [Ciona intestinalis]
Length = 372
Score = 63.9 bits (154), Expect = 1e-07, Method: Compositional matrix adjust.
Identities = 35/79 (44%), Positives = 46/79 (58%), Gaps = 7/79 (8%)
Query: 56 GTGLYPVISIINHSCLPNAVLVFEG---RLAVVRAVQHVPKGAEVLISYI---ETAGSTM 109
G GLY + S NHSC PNA + F RLAV +A + + G E+ ISY+ + A
Sbjct: 281 GAGLYRIQSKCNHSCEPNAEVCFPNNNHRLAV-KACRDIAAGEEITISYLSQCQIARGCR 339
Query: 110 TRQKALKEQYLFTCTCPRC 128
+RQ+ LKE YLF C C +C
Sbjct: 340 SRQQYLKENYLFHCCCSKC 358
>gi|390360826|ref|XP_001181205.2| PREDICTED: SET and MYND domain-containing protein 4-like
[Strongylocentrotus purpuratus]
Length = 679
Score = 63.5 bits (153), Expect = 1e-07, Method: Compositional matrix adjust.
Identities = 60/267 (22%), Positives = 118/267 (44%), Gaps = 21/267 (7%)
Query: 50 SELRPLGTGLYPVISIINHSCLPNAVLVFEGRLAVVRAVQHVPKGAEVLISYIETAG--S 107
S+ R + T +YP +S++NH+C+PN + F + VRA + + G E+ Y G +
Sbjct: 410 SQFR-VATAVYPTVSMMNHACIPNIIPSFRKGILCVRATKKIMHGDEIQHCYGPQVGHMT 468
Query: 108 TMTRQKALKEQYLFTCTCPRCIKLGQFDDIQESAILEGYRCKDDGCSGFLLRDSDDKGFT 167
T RQ+AL QY FTC C C + + D + ++ +C + +
Sbjct: 469 TSDRQQALLNQYCFTCRCRACTRDPKSRD---NLCMKCPKCG---------QPLNAMTSI 516
Query: 168 CQQCG-----LVRSKEEIKKIASEVNILSKKTLALTSCGNHQEVVSTYKMIEKLQKKLYH 222
C +C LV KE + + ++ A+ ++V+S K + ++
Sbjct: 517 CGKCAVWIDVLVLLKEVDDTMITFAELVEMYPAAVKDVTLMRDVISKTKTCHDVLDRILI 576
Query: 223 PFSVNLMQTREKLIKILMELEDWKEALAYCQLTIPVYQRVYPQFHPLLGLQYYTCGKLEW 282
P ++L + K + L+++KEA + IP + + + + + Y ++ +
Sbjct: 577 PPHMHLAVAYNFMAKCHIALQEFKEAAPWLAKAIPTIECRFGRDSIEVANELYKLAQIYF 636
Query: 283 FLGDTENAIKSMTEAVEILRITHGTNS 309
+ A++++ A+E+ I H NS
Sbjct: 637 NGKEIAPAMEAIDRALELF-IRHYGNS 662
>gi|367020696|ref|XP_003659633.1| hypothetical protein MYCTH_99686 [Myceliophthora thermophila ATCC
42464]
gi|347006900|gb|AEO54388.1| hypothetical protein MYCTH_99686 [Myceliophthora thermophila ATCC
42464]
Length = 573
Score = 63.5 bits (153), Expect = 1e-07, Method: Compositional matrix adjust.
Identities = 40/122 (32%), Positives = 66/122 (54%), Gaps = 2/122 (1%)
Query: 7 KQLLLYAQIANLVNLILQWPEISINEIAENFSKLACNAHTICNSELRPLGTGLYPVISII 66
K + L A A + +LQ E+ + + E K+ NA +++ G L ++++
Sbjct: 181 KDMELQATAAVVYAGLLQGEEV-LEKAREILCKIQTNAFNRLDADTGMAGIFLDVGLAMV 239
Query: 67 NHSCLPNAVLVFEGRLAVVRAVQHVPKGAEVLISYIETAGSTMTRQKALKEQYLFTCTCP 126
NHSC+PNA + F+ R A++RA + + +G E+ ISYI+ R +AL+ Y F C C
Sbjct: 240 NHSCVPNAFIGFDKRTAILRAERPIQEGEEITISYIDNTLPKAARYEALR-LYHFQCDCV 298
Query: 127 RC 128
RC
Sbjct: 299 RC 300
>gi|426335970|ref|XP_004029477.1| PREDICTED: SET and MYND domain-containing protein 5 isoform 2
[Gorilla gorilla gorilla]
Length = 303
Score = 63.5 bits (153), Expect = 1e-07, Method: Compositional matrix adjust.
Identities = 32/79 (40%), Positives = 47/79 (59%), Gaps = 5/79 (6%)
Query: 56 GTGLYPVISIINHSCLPNAVLVF--EGRLAVVRAVQHVPKGAEVLISYIETAG---STMT 110
G+GL+ + S NHSC+PNA F L V A++ + G E+ ISY++ S +
Sbjct: 188 GSGLFVLQSCCNHSCVPNAETSFPENNFLLHVTALEDIKPGEEICISYLDCCQRERSRHS 247
Query: 111 RQKALKEQYLFTCTCPRCI 129
R K L+E YLF C+CP+C+
Sbjct: 248 RHKILRENYLFVCSCPKCL 266
>gi|397473489|ref|XP_003808243.1| PREDICTED: SET and MYND domain-containing protein 5 isoform 2 [Pan
paniscus]
Length = 304
Score = 63.5 bits (153), Expect = 1e-07, Method: Compositional matrix adjust.
Identities = 32/79 (40%), Positives = 47/79 (59%), Gaps = 5/79 (6%)
Query: 56 GTGLYPVISIINHSCLPNAVLVF--EGRLAVVRAVQHVPKGAEVLISYIETAG---STMT 110
G+GL+ + S NHSC+PNA F L V A++ + G E+ ISY++ S +
Sbjct: 188 GSGLFVLQSCCNHSCVPNAETSFPENNFLLHVTALEDIKPGEEICISYLDCCQRERSRHS 247
Query: 111 RQKALKEQYLFTCTCPRCI 129
R K L+E YLF C+CP+C+
Sbjct: 248 RHKILRENYLFVCSCPKCL 266
>gi|380016398|ref|XP_003692172.1| PREDICTED: SET and MYND domain-containing protein 5-like [Apis
florea]
Length = 387
Score = 63.5 bits (153), Expect = 1e-07, Method: Compositional matrix adjust.
Identities = 34/87 (39%), Positives = 53/87 (60%), Gaps = 7/87 (8%)
Query: 56 GTGLYPVISIINHSCLPNAVLVF--EGRLAVVRAVQHVPKGAEVLISYIETA---GSTMT 110
G+GLY + S +NHSC+PNA++ F + V++A++ + E+ ISY++ S +
Sbjct: 287 GSGLYILQSSVNHSCVPNAIVEFPYSNNVLVLKAIRDIHPEEEICISYLDECCLERSRHS 346
Query: 111 RQKALKEQYLFTCTCPRCIKLGQFDDI 137
RQKAL YLF C C +C L Q +D+
Sbjct: 347 RQKALNSLYLFQCYCNKC--LSQANDL 371
>gi|26450409|dbj|BAC42319.1| unknown protein [Arabidopsis thaliana]
Length = 604
Score = 63.5 bits (153), Expect = 1e-07, Method: Compositional matrix adjust.
Identities = 37/117 (31%), Positives = 56/117 (47%), Gaps = 10/117 (8%)
Query: 55 LGTGLYPVISIINHSCLPNAVLVFEGRLAVVRAVQHVPKGAEVLISYIETAG--STMTRQ 112
+G LY S+ NHSC PN L F R +++ + VP G + +SY G R
Sbjct: 475 VGQALYKTGSLFNHSCKPNIHLYFLSRGLIMQTTEFVPTGCPLELSYGPEVGKWDCKNRI 534
Query: 113 KALKEQYLFTCTCPRCIKLGQFDDIQESAILEGYRCKDDGCSGFLLRDSDDKGFTCQ 169
+ L+E+Y F C C C ++ D ++ GY C + C+G +L D TC+
Sbjct: 535 RFLEEEYFFHCRCRGCAQINISD-----LVINGYGCVNTNCTGVVL---DSNVATCE 583
>gi|170060284|ref|XP_001865734.1| conserved hypothetical protein [Culex quinquefasciatus]
gi|167878798|gb|EDS42181.1| conserved hypothetical protein [Culex quinquefasciatus]
Length = 647
Score = 63.5 bits (153), Expect = 1e-07, Method: Compositional matrix adjust.
Identities = 39/124 (31%), Positives = 59/124 (47%), Gaps = 8/124 (6%)
Query: 55 LGTGLYPVISIINHSCLPNAVLVFEGRLAVVRAVQHVPKGAEVLISY--IETAGSTMTRQ 112
+G GLYP + + NHSC P F G VR V+++P G+ V +Y + R+
Sbjct: 446 IGAGLYPTLVLFNHSCDPGITRYFVGSAVFVRTVRNIPAGSVVAENYGQLFVRAPRHERR 505
Query: 113 KALKEQYLFTCTCPRCIK-LGQFDDIQESAILEGYRC-KDDGCSGFLLRDS--DDKGFTC 168
K+LK+ Y F C C C + F D+ S + +RC +GC L+ D C
Sbjct: 506 KSLKKLYKFDCYCQACYEDWPTFFDMNPSVV--RFRCAATEGCENGLIYTERLDQNVMKC 563
Query: 169 QQCG 172
++C
Sbjct: 564 EKCN 567
>gi|344283937|ref|XP_003413727.1| PREDICTED: SET and MYND domain-containing protein 5 [Loxodonta
africana]
Length = 417
Score = 63.5 bits (153), Expect = 1e-07, Method: Compositional matrix adjust.
Identities = 35/86 (40%), Positives = 50/86 (58%), Gaps = 7/86 (8%)
Query: 56 GTGLYPVISIINHSCLPNAVLVF--EGRLAVVRAVQHVPKGAEVLISYIETAG---STMT 110
G+GL+ + S NHSC+PNA F L V A++ + G E+ ISY++ S +
Sbjct: 304 GSGLFVLQSCCNHSCVPNAETSFPENNFLLHVTALEDIKPGEEICISYLDCCQRERSRHS 363
Query: 111 RQKALKEQYLFTCTCPRCIKLGQFDD 136
R K L+E YLF C+CP+C L + DD
Sbjct: 364 RHKILRENYLFVCSCPKC--LAEADD 387
>gi|380022725|ref|XP_003695189.1| PREDICTED: protein msta, isoform B-like [Apis florea]
Length = 513
Score = 63.5 bits (153), Expect = 1e-07, Method: Compositional matrix adjust.
Identities = 68/316 (21%), Positives = 127/316 (40%), Gaps = 41/316 (12%)
Query: 17 NLVNLILQWPEISINEIAENFSKLACNAHTICNSELRPL----GTGLYPVISIINHSCLP 72
N+V I + ++ ++ +E + C I E+R +YP ++++NHSC+
Sbjct: 173 NIVEYIRK--QLKLDRFSEKEIQTVCGILEINTFEVRTAKGFSARAIYPTVALMNHSCIS 230
Query: 73 N-----AVLVFEGRLAVVRAVQHVPKGAEVLISYIETAGSTMTRQKALKEQYLFTCTCPR 127
N + ++ RL R +P G E+ SY + TM R++ L E F C C R
Sbjct: 231 NTCHSISPTDYKIRL---RTTLKIPPGGELYGSYTHSLLPTMLRREHLLEGKHFACACAR 287
Query: 128 CIKLGQFDDIQESAILEGYRCK--DDGCSGFLLRDSDDKGFTCQQCGLVRSKEEIKKIAS 185
C D + + +C D+G L + + C C S ++K+
Sbjct: 288 C-----SDPTELGTHMSSLKCNKCDNGIVLSLDSLDPESSWKCTHCEFSTSGSAVRKVLQ 342
Query: 186 EVNILSKKTLALTSCGNHQEVVSTYKMIEKLQKKLYHPFSVNLMQTREKLIKILMELEDW 245
+++ A+ S + + + + + K + + HP L R L ++ +ED+
Sbjct: 343 IIHMEVDAVEAI-SGADGADAIQERETVAKKYRSVLHPRHAFLTMLRHSLTQMYGRVEDY 401
Query: 246 ----------KEALAYCQLTIPVYQRVYPQFHPLLGLQYYTCGKLEWFLGDTE------- 288
+ + C+L + V + P + + G+ Y F T+
Sbjct: 402 LLEDLPDVVLEHKIDMCKLLLQVLDVIEPGYSRIRGMTLYELHAPLLFTAKTQWNAEVID 461
Query: 289 -NAIKS-MTEAVEILR 302
A+KS M EA +IL+
Sbjct: 462 KAALKSKMIEASKILK 477
>gi|31208665|ref|XP_313299.1| AGAP003552-PA [Anopheles gambiae str. PEST]
gi|21296724|gb|EAA08869.1| AGAP003552-PA [Anopheles gambiae str. PEST]
Length = 391
Score = 63.5 bits (153), Expect = 1e-07, Method: Compositional matrix adjust.
Identities = 35/87 (40%), Positives = 47/87 (54%), Gaps = 5/87 (5%)
Query: 47 ICNSELRPLGTGLYPVISIINHSCLPNAVLVF--EGRLAVVRAVQHVPKGAEVLISYIET 104
+ S L G+ LY S INHSC PNA VF + +RA + + G E+ ISY++
Sbjct: 279 VVGSFLNNEGSALYARQSKINHSCAPNAETVFPKSNHMLALRATRDIQPGEEICISYLDE 338
Query: 105 AG---STMTRQKALKEQYLFTCTCPRC 128
S +RQK LK+ YLF C C +C
Sbjct: 339 CNLQRSRHSRQKTLKDYYLFICQCEKC 365
>gi|74187979|dbj|BAE37118.1| unnamed protein product [Mus musculus]
Length = 400
Score = 63.5 bits (153), Expect = 1e-07, Method: Compositional matrix adjust.
Identities = 35/86 (40%), Positives = 50/86 (58%), Gaps = 7/86 (8%)
Query: 56 GTGLYPVISIINHSCLPNAVLVFEGRLAV--VRAVQHVPKGAEVLISYIETAG---STMT 110
G+GL+ + S NHSC+PNA F V V A++ + G E+ ISY++ S +
Sbjct: 287 GSGLFVLQSCCNHSCVPNAETSFPENNFVLHVTALEDIKPGEEICISYLDCCQRERSRHS 346
Query: 111 RQKALKEQYLFTCTCPRCIKLGQFDD 136
R K L+E YLF C+CP+C L + DD
Sbjct: 347 RHKILRENYLFNCSCPKC--LAEADD 370
>gi|221503737|gb|EEE29421.1| SET and MYND domain containing protein, putative [Toxoplasma gondii
VEG]
Length = 402
Score = 63.5 bits (153), Expect = 1e-07, Method: Compositional matrix adjust.
Identities = 32/76 (42%), Positives = 45/76 (59%), Gaps = 2/76 (2%)
Query: 55 LGTGLYPVISIINHSCLPNAVL--VFEGRLAVVRAVQHVPKGAEVLISYIETAGSTMTRQ 112
LG GL+ +S+ NHSC PNA + F A V A++ + + E+L+SYI+ + RQ
Sbjct: 263 LGVGLFRAVSMTNHSCWPNAEVDYPFLTNAAQVTALRDIAEKEEILLSYIDESLPLAERQ 322
Query: 113 KALKEQYLFTCTCPRC 128
+ LK Y FTC C RC
Sbjct: 323 RLLKSHYKFTCGCQRC 338
>gi|255543292|ref|XP_002512709.1| protein with unknown function [Ricinus communis]
gi|223548670|gb|EEF50161.1| protein with unknown function [Ricinus communis]
Length = 379
Score = 63.5 bits (153), Expect = 1e-07, Method: Compositional matrix adjust.
Identities = 29/75 (38%), Positives = 41/75 (54%), Gaps = 4/75 (5%)
Query: 58 GLYPVISIINHSCLPNAV----LVFEGRLAVVRAVQHVPKGAEVLISYIETAGSTMTRQK 113
G+YP ++ NH CLPNA + + ++R + VP+G E+ +SY TRQK
Sbjct: 209 GIYPKAALFNHDCLPNACRFDYVDTQDTDLIIRMIHDVPQGREICLSYFPVNYDYSTRQK 268
Query: 114 ALKEQYLFTCTCPRC 128
L+E Y F C C RC
Sbjct: 269 RLREDYGFICDCDRC 283
>gi|116181330|ref|XP_001220514.1| hypothetical protein CHGG_01293 [Chaetomium globosum CBS 148.51]
gi|88185590|gb|EAQ93058.1| hypothetical protein CHGG_01293 [Chaetomium globosum CBS 148.51]
Length = 348
Score = 63.5 bits (153), Expect = 1e-07, Method: Compositional matrix adjust.
Identities = 31/71 (43%), Positives = 40/71 (56%), Gaps = 1/71 (1%)
Query: 58 GLYPVISIINHSCLPNAVLVFEGRLAVVRAVQHVPKGAEVLISYIETAGSTMTRQKALKE 117
G++P +S NH C PN G AV+ +P G E+ ISYI S TRQ L+E
Sbjct: 173 GVFPEVSKFNHDCRPNVHYRLNGLKHTTIAVRDIPAGDELTISYIYGRASHATRQSQLRE 232
Query: 118 QYLFTCTCPRC 128
+ FTCTCP+C
Sbjct: 233 -WGFTCTCPQC 242
>gi|294886191|ref|XP_002771602.1| hypothetical protein, conserved [Perkinsus marinus ATCC 50983]
gi|239875308|gb|EER03418.1| hypothetical protein, conserved [Perkinsus marinus ATCC 50983]
Length = 265
Score = 63.5 bits (153), Expect = 1e-07, Method: Compositional matrix adjust.
Identities = 34/98 (34%), Positives = 51/98 (52%), Gaps = 9/98 (9%)
Query: 56 GTGLYPVISIINHSCLPNAVLVFEGRLAVVRAVQHVPKGAEVLISYIETAGSTMTRQKAL 115
GT +YPV S+ NHSC PN G L R V +G E+ ISYI + KA+
Sbjct: 17 GTAVYPVASMFNHSCTPNVTRASLGDLTWFRTCTDVKRGQELTISYI--GSDLLCEPKAV 74
Query: 116 KEQYL---FTCTCPRCIKLGQFDDIQESAILEGYRCKD 150
++++L F+C CP C K +D +S ++ + +D
Sbjct: 75 RQKHLARDFSCNCPACTK----EDDDDSVVIMPFTLQD 108
>gi|157135679|ref|XP_001663543.1| hypothetical protein AaeL_AAEL003295 [Aedes aegypti]
gi|108881205|gb|EAT45430.1| AAEL003295-PA [Aedes aegypti]
Length = 565
Score = 63.5 bits (153), Expect = 1e-07, Method: Compositional matrix adjust.
Identities = 50/199 (25%), Positives = 86/199 (43%), Gaps = 38/199 (19%)
Query: 58 GLYPVISIINHSCLPNAVLVF---EGRLAVVRAVQHVPKGAEVLISYIETAGSTMTRQKA 114
GLYP+ ++ H C+PN F +G +A +++ KG + +Y + T R+
Sbjct: 263 GLYPIACMLEHCCMPNCFYTFNCTKGMKLTFKAGRNIKKGEHLTTTYTHSLWGTQQRRDH 322
Query: 115 LKEQYLFTCTCPRCIKLGQFDDIQESAILEGYRC---KDDGCSGFL-----LRDSDDKGF 166
LK F+C+C RC D + L +C + C+G+ L+++ D +
Sbjct: 323 LKANKYFSCSCARCA-----DPTELGTYLSALKCMGIDGNACAGYQLPIDPLKETSD--W 375
Query: 167 TCQQCGLVRSKEEIK----KIASEVNILSKKTLALTSCGNHQEVVSTYKMIEKLQKKL-- 220
C QC + +++ KI EV+ + E S+ K +E L KL
Sbjct: 376 KCNQCPITIESDQVNFLLSKIGEEVD-------------DAMERKSSVKQLEDLISKLLT 422
Query: 221 -YHPFSVNLMQTREKLIKI 238
HP L+Q + LI++
Sbjct: 423 FLHPNHHFLLQLKHSLIQM 441
>gi|332813421|ref|XP_515547.3| PREDICTED: SET and MYND domain-containing protein 5 isoform 4 [Pan
troglodytes]
Length = 299
Score = 63.5 bits (153), Expect = 1e-07, Method: Compositional matrix adjust.
Identities = 32/79 (40%), Positives = 47/79 (59%), Gaps = 5/79 (6%)
Query: 56 GTGLYPVISIINHSCLPNAVLVF--EGRLAVVRAVQHVPKGAEVLISYIETAG---STMT 110
G+GL+ + S NHSC+PNA F L V A++ + G E+ ISY++ S +
Sbjct: 188 GSGLFVLQSCCNHSCVPNAETSFPENNFLLHVTALEDIKPGEEICISYLDCCQRERSRHS 247
Query: 111 RQKALKEQYLFTCTCPRCI 129
R K L+E YLF C+CP+C+
Sbjct: 248 RHKILRENYLFVCSCPKCL 266
>gi|49257476|gb|AAH73806.1| SMYD family member 5 [Homo sapiens]
gi|167773713|gb|ABZ92291.1| SMYD family member 5 [synthetic construct]
gi|194374657|dbj|BAG62443.1| unnamed protein product [Homo sapiens]
Length = 302
Score = 63.5 bits (153), Expect = 1e-07, Method: Compositional matrix adjust.
Identities = 32/79 (40%), Positives = 47/79 (59%), Gaps = 5/79 (6%)
Query: 56 GTGLYPVISIINHSCLPNAVLVF--EGRLAVVRAVQHVPKGAEVLISYIETAG---STMT 110
G+GL+ + S NHSC+PNA F L V A++ + G E+ ISY++ S +
Sbjct: 188 GSGLFVLQSCCNHSCVPNAETSFPENNFLLHVTALEDIKPGEEICISYLDCCQRERSRHS 247
Query: 111 RQKALKEQYLFTCTCPRCI 129
R K L+E YLF C+CP+C+
Sbjct: 248 RHKILRENYLFVCSCPKCL 266
>gi|154689581|ref|NP_659167.2| SET and MYND domain-containing protein 5 [Mus musculus]
gi|90101759|sp|Q3TYX3.2|SMYD5_MOUSE RecName: Full=SET and MYND domain-containing protein 5; AltName:
Full=Protein NN8-4AG; AltName: Full=Retinoic
acid-induced protein 15
gi|148666711|gb|EDK99127.1| SET and MYND domain containing 5 [Mus musculus]
Length = 416
Score = 63.5 bits (153), Expect = 1e-07, Method: Compositional matrix adjust.
Identities = 35/86 (40%), Positives = 50/86 (58%), Gaps = 7/86 (8%)
Query: 56 GTGLYPVISIINHSCLPNAVLVFEGRLAV--VRAVQHVPKGAEVLISYIETAG---STMT 110
G+GL+ + S NHSC+PNA F V V A++ + G E+ ISY++ S +
Sbjct: 303 GSGLFVLQSCCNHSCVPNAETSFPENNFVLHVTALEDIKPGEEICISYLDCCQRERSRHS 362
Query: 111 RQKALKEQYLFTCTCPRCIKLGQFDD 136
R K L+E YLF C+CP+C L + DD
Sbjct: 363 RHKILRENYLFNCSCPKC--LAEADD 386
>gi|440583731|emb|CCH47233.1| similar to histone-lysine N-methyltransferase ASHR2 [Lupinus
angustifolius]
Length = 357
Score = 63.5 bits (153), Expect = 1e-07, Method: Compositional matrix adjust.
Identities = 39/143 (27%), Positives = 60/143 (41%), Gaps = 24/143 (16%)
Query: 58 GLYPVISIINHSCLPNAVLV-----------FEGRLAVVRAVQHVPKGAEVLISYIETAG 106
+YP S+ NH C+PNA V+R +Q +P+G EV ISY +
Sbjct: 212 AIYPKASMFNHDCIPNACRFDYVDSADLDDEHNNTDMVIRMIQDLPEGREVCISYFRISR 271
Query: 107 STMTRQKALKEQYLFTCTCPRCI-----------KLGQFDDIQESAILEGYRCKDDGCSG 155
TR++ L + Y F+C C RC + ++ D+ + Y C C+G
Sbjct: 272 DYCTRKRILMDDYGFSCECDRCKIEANWPHDCQNYVEEYSDLPHVRFIAKYVCDRKNCNG 331
Query: 156 FLL--RDSDDKGFTCQQCGLVRS 176
L D+ C CG ++S
Sbjct: 332 TLAPKDDAHTNVLECNFCGNLKS 354
>gi|194762123|ref|XP_001963207.1| GF15829 [Drosophila ananassae]
gi|190616904|gb|EDV32428.1| GF15829 [Drosophila ananassae]
Length = 507
Score = 63.5 bits (153), Expect = 1e-07, Method: Compositional matrix adjust.
Identities = 51/199 (25%), Positives = 84/199 (42%), Gaps = 25/199 (12%)
Query: 59 LYPVISIINHSCLPN---AVLVFEGRLAVVRAVQHVPKGAEVLISYIETAGSTMTRQKAL 115
LYP I+ H+C+PN ++ EG +RA+ + G + SY T T RQ+ L
Sbjct: 207 LYPYTGILAHNCVPNTARSIYPSEGYKIRLRAMVDLKDGQPLHHSYTYTLDGTAQRQRHL 266
Query: 116 KEQYLFTCTCPRC---IKLG-QFDDIQESAILEGYRCKDDGCSGFLLRDSDDKGFTCQQC 171
K+ F+CTC RC +LG F ++ EGY+ D + C C
Sbjct: 267 KQGKYFSCTCERCQDPTELGTHFSSLKCGQCTEGYQVPRQP-------TDSDSSWNCNNC 319
Query: 172 GLVRSKEEI----KKIASEVNILSKKTLALTSCGNHQEVVSTYKMIEKLQKKLYHPFSVN 227
G S+ E+ + + SEVN + + + ++ Y K L HP
Sbjct: 320 GASSSQAEVETLLQSLQSEVNKVQGLEMGAKRLEEAERLLRKY-------KSLLHPLHFI 372
Query: 228 LMQTREKLIKILMELEDWK 246
R+ LI++ ++ ++
Sbjct: 373 ATGLRQLLIEMYGRVQGYE 391
>gi|440796609|gb|ELR17718.1| Histone-lysine N-methyltransferase [Acanthamoeba castellanii str.
Neff]
Length = 419
Score = 63.5 bits (153), Expect = 1e-07, Method: Compositional matrix adjust.
Identities = 36/98 (36%), Positives = 47/98 (47%), Gaps = 1/98 (1%)
Query: 32 EIAENFSKLACNAHTICNSELRPLGTGLYPVISIINHSCLPN-AVLVFEGRLAVVRAVQH 90
E+ E SK+ N + + +L +G LY S NHSC PN EG+ A
Sbjct: 278 ELMEYISKIESNTFGMWSDKLVSMGMVLYAEGSYFNHSCAPNCGTRTGEGQAVQFVATHD 337
Query: 91 VPKGAEVLISYIETAGSTMTRQKALKEQYLFTCTCPRC 128
+P G EV I YI+ T +R+ L Y FTC CP C
Sbjct: 338 IPAGDEVCIRYIDVDKPTTSRRSELLSHYHFTCMCPLC 375
>gi|350403870|ref|XP_003486931.1| PREDICTED: SET and MYND domain-containing protein 5-like [Bombus
impatiens]
Length = 386
Score = 63.5 bits (153), Expect = 1e-07, Method: Compositional matrix adjust.
Identities = 36/86 (41%), Positives = 51/86 (59%), Gaps = 7/86 (8%)
Query: 56 GTGLYPVISIINHSCLPNAVLVF--EGRLAVVRAVQHVPKGAEVLISYIETA---GSTMT 110
G+GLY S INHSC+PNAV+ F + V++A++ + E+ ISY++ S +
Sbjct: 287 GSGLYIFQSSINHSCVPNAVVEFPYSNNVLVLKAIRDIHPEEEICISYLDECCLERSRHS 346
Query: 111 RQKALKEQYLFTCTCPRCIKLGQFDD 136
RQKAL YLF C C +C L Q +D
Sbjct: 347 RQKALSSLYLFHCYCNKC--LSQVND 370
>gi|157823305|ref|NP_001101340.1| SET and MYND domain-containing protein 5 [Rattus norvegicus]
gi|149036567|gb|EDL91185.1| SET and MYND domain containing 5 (predicted) [Rattus norvegicus]
Length = 417
Score = 63.5 bits (153), Expect = 1e-07, Method: Compositional matrix adjust.
Identities = 35/86 (40%), Positives = 50/86 (58%), Gaps = 7/86 (8%)
Query: 56 GTGLYPVISIINHSCLPNAVLVF--EGRLAVVRAVQHVPKGAEVLISYIETAG---STMT 110
G+GL+ + S NHSC+PNA F L V A++ + G E+ ISY++ S +
Sbjct: 304 GSGLFVLQSCCNHSCVPNAETSFPENNFLLHVTALEDIEPGEEICISYLDCCQRERSRHS 363
Query: 111 RQKALKEQYLFTCTCPRCIKLGQFDD 136
R K L+E YLF C+CP+C L + DD
Sbjct: 364 RHKILRENYLFVCSCPKC--LAEADD 387
>gi|71409653|ref|XP_807160.1| hypothetical protein [Trypanosoma cruzi strain CL Brener]
gi|70871100|gb|EAN85309.1| hypothetical protein, conserved [Trypanosoma cruzi]
Length = 697
Score = 63.5 bits (153), Expect = 1e-07, Method: Compositional matrix adjust.
Identities = 41/123 (33%), Positives = 59/123 (47%), Gaps = 12/123 (9%)
Query: 18 LVNLILQW------PEISINEIAENFSKLA----CNAHTICNSELRPLGTGLYPVISIIN 67
LIL W E ++ A F++L CN+ + N++ +G LYP S N
Sbjct: 520 FTKLILPWLGEGGSIEANLTVTATFFNRLCAALQCNSFGLFNADGNCIGVALYPEASYFN 579
Query: 68 HSCLPNAVLV-FEGRLAVVRAVQHVPKGAEVLISYIETA-GSTMTRQKALKEQYLFTCTC 125
HSC PN V + G LA A++ + KG + I Y++ ST R++ L Y F C C
Sbjct: 580 HSCCPNICRVTYRGILAAFHALREIRKGEPLTICYVDVQETSTAERRRTLFSSYRFFCEC 639
Query: 126 PRC 128
RC
Sbjct: 640 ARC 642
>gi|440302623|gb|ELP94930.1| set and mynd domain containing protein, putative [Entamoeba
invadens IP1]
Length = 428
Score = 63.5 bits (153), Expect = 1e-07, Method: Compositional matrix adjust.
Identities = 30/86 (34%), Positives = 48/86 (55%), Gaps = 1/86 (1%)
Query: 43 NAHTICNSELRPLGTGLYPVISIINHSCLPNAVLVFEGRLAVVRAVQHVPKGAEVLISYI 102
N+ + N E + +G GL+ S INHSC PN V + R ++++ + G E+ ISYI
Sbjct: 154 NSFCLTNGEEQVIGIGLFDYASFINHSCCPNCVPLQNKREMSIKSLTSIKSGEEIFISYI 213
Query: 103 ETAGSTMTRQKALKEQYLFTCTCPRC 128
+ S R+K L+ ++ F+C C C
Sbjct: 214 DITESFERREKELR-KWHFSCGCSLC 238
>gi|348681060|gb|EGZ20876.1| hypothetical protein PHYSODRAFT_313346 [Phytophthora sojae]
Length = 494
Score = 63.5 bits (153), Expect = 1e-07, Method: Compositional matrix adjust.
Identities = 44/127 (34%), Positives = 61/127 (48%), Gaps = 17/127 (13%)
Query: 56 GTGLYPVISIINHSCLPNAVLVFEGR-LAVVRAVQHVPKGAEVLISYIETAGSTMTRQKA 114
GL+P+ +INHSC PN G +A VRA++ + +G E+ +SYI+ R+K
Sbjct: 208 AVGLFPICGLINHSCQPNCTWSNAGDGIAEVRALRDIKEGDEITLSYIDIDKERAERRKE 267
Query: 115 LKEQYLFTCTCPRCIKLGQFDDIQESA--ILEGYRCKDDGCS------GFLLRDSDDKGF 166
L+E F C C RC + ES LEG+ C CS +LL +DK
Sbjct: 268 LRETKHFDCQCGRCAA-----PLSESVDRYLEGFCCP--RCSVMASGKDYLLAQVEDK-L 319
Query: 167 TCQQCGL 173
C C L
Sbjct: 320 VCSDCQL 326
>gi|354492175|ref|XP_003508226.1| PREDICTED: SET and MYND domain-containing protein 5-like
[Cricetulus griseus]
Length = 435
Score = 63.2 bits (152), Expect = 2e-07, Method: Compositional matrix adjust.
Identities = 35/86 (40%), Positives = 50/86 (58%), Gaps = 7/86 (8%)
Query: 56 GTGLYPVISIINHSCLPNAVLVF--EGRLAVVRAVQHVPKGAEVLISYIETAG---STMT 110
G+GL+ + S NHSC+PNA F L V A++ + G E+ ISY++ S +
Sbjct: 322 GSGLFVLQSCCNHSCVPNAETSFPENNFLLHVTALEDIKPGEEICISYLDCCQRERSRHS 381
Query: 111 RQKALKEQYLFTCTCPRCIKLGQFDD 136
R K L+E YLF C+CP+C L + DD
Sbjct: 382 RHKILRENYLFICSCPKC--LAEADD 405
>gi|332022210|gb|EGI62525.1| SET and MYND domain-containing protein 5 [Acromyrmex echinatior]
Length = 391
Score = 63.2 bits (152), Expect = 2e-07, Method: Compositional matrix adjust.
Identities = 31/78 (39%), Positives = 48/78 (61%), Gaps = 5/78 (6%)
Query: 56 GTGLYPVISIINHSCLPNAVLVF--EGRLAVVRAVQHVPKGAEVLISYIETAG---STMT 110
G+GLY + S +NHSC+PNAV+ F V++A++++ G E+ SY++ S +
Sbjct: 295 GSGLYVLQSSVNHSCIPNAVVEFPYSNNTLVLKAIRNIKVGEEICTSYLDECQLERSRHS 354
Query: 111 RQKALKEQYLFTCTCPRC 128
RQ+AL YLF C C +C
Sbjct: 355 RQQALSSLYLFVCHCDKC 372
>gi|332021392|gb|EGI61760.1| SET and MYND domain-containing protein 4 [Acromyrmex echinatior]
Length = 749
Score = 63.2 bits (152), Expect = 2e-07, Method: Compositional matrix adjust.
Identities = 49/217 (22%), Positives = 90/217 (41%), Gaps = 16/217 (7%)
Query: 55 LGTGLYPVISIINHSCLPNAVLVFEGRLAVVRAVQHVPKGAEVLISYIETAGSTM--TRQ 112
+ T +YP S++NHSC PN + F G+ + +A + + G EV Y + RQ
Sbjct: 460 IATAIYPSASMMNHSCDPNIINSFLGQFLITKATRDIAAGEEVFNCYGADFRRMLRKDRQ 519
Query: 113 KALKEQYLFTCTCPRCIKLGQFDDIQESAILEGYRCK------DDGCSGFLLRDSDDKGF 166
+ ++ QY F C C C +++DI + +C +D CS L R S
Sbjct: 520 EKMESQYCFKCNCAAC-SAPEYEDILKK--FTAKKCPECSGPLNDNCS-VLHRSS----I 571
Query: 167 TCQQCGLVRSKEEIKKIASEVNILSKKTLALTSCGNHQEVVSTYKMIEKLQKKLYHPFSV 226
C CG + + +K N+ E + K +L+K + +
Sbjct: 572 HCMDCGTAVYENIYDFTLRQAQHYFEKAEICIENENYDEALDKLKKCLRLRKDALYKYHD 631
Query: 227 NLMQTREKLIKILMELEDWKEALAYCQLTIPVYQRVY 263
++ T + + K+ + W +++Y + +I + Y
Sbjct: 632 DIADTMDFIAKVYAIIGQWLNSISYLEHSIVAIEEKY 668
>gi|224057872|ref|XP_002299366.1| SET domain protein [Populus trichocarpa]
gi|222846624|gb|EEE84171.1| SET domain protein [Populus trichocarpa]
Length = 391
Score = 63.2 bits (152), Expect = 2e-07, Method: Compositional matrix adjust.
Identities = 33/79 (41%), Positives = 42/79 (53%), Gaps = 9/79 (11%)
Query: 58 GLYPVISIINHSCLPNAVLVFE--------GRLAVVRAVQHVPKGAEVLISYIETAGSTM 109
G+YP S+ NH CLPNA F+ VVR + VP+G E+ +SY +
Sbjct: 220 GIYPKASLFNHDCLPNACR-FDYVDTNNSGNTDIVVRMIHDVPQGREICLSYFPVNSNYS 278
Query: 110 TRQKALKEQYLFTCTCPRC 128
TR+K L E Y FTC C RC
Sbjct: 279 TRRKRLLEDYGFTCDCDRC 297
>gi|307212779|gb|EFN88450.1| SET and MYND domain-containing protein 4 [Harpegnathos saltator]
Length = 687
Score = 63.2 bits (152), Expect = 2e-07, Method: Compositional matrix adjust.
Identities = 37/124 (29%), Positives = 64/124 (51%), Gaps = 8/124 (6%)
Query: 55 LGTGLYPVISIINHSCLPNAVLVFEGRLAVVRAVQHVPKGAEVLISY--IETAGSTMTRQ 112
+G G+YP ++++NHSC P V F G ++RAV+ + G E+ +Y I T R+
Sbjct: 490 IGGGVYPTVAMLNHSCNPGVVRYFVGTTMILRAVRTISAGEEISENYGPIFTTTPENERK 549
Query: 113 KALKEQYLFTCTCPRCIKLGQFDDIQE-SAILEGYRCKDDGCSGFLL---RDSDDKGFTC 168
+ L+ QY F C C C G + ++E + ++C+ G +L D+++ C
Sbjct: 550 RRLRVQYWFDCNCEACT--GHWPLLEELDPTVLRFKCETGPSCGNVLPIRSDTNEFMIGC 607
Query: 169 QQCG 172
+CG
Sbjct: 608 VKCG 611
>gi|392558999|gb|EIW52184.1| hypothetical protein TRAVEDRAFT_75273 [Trametes versicolor
FP-101664 SS1]
Length = 721
Score = 63.2 bits (152), Expect = 2e-07, Method: Compositional matrix adjust.
Identities = 36/72 (50%), Positives = 41/72 (56%), Gaps = 3/72 (4%)
Query: 63 ISIINHSCLPNAVLVFEGRLAV--VRAVQHVPKGAEVLISYIETAGSTMTRQKALKEQYL 120
IS INHSC PNAV FE VRA+ +P GA+V ISYI+ A RQ+AL Y
Sbjct: 179 ISRINHSCSPNAVYRFEPATLTFEVRALSPIPSGAQVFISYIDPALPRAKRQEALS-SYG 237
Query: 121 FTCTCPRCIKLG 132
FTC C C G
Sbjct: 238 FTCACTACALTG 249
>gi|389643366|ref|XP_003719315.1| TPR domain-containing protein [Magnaporthe oryzae 70-15]
gi|351639084|gb|EHA46948.1| TPR domain-containing protein [Magnaporthe oryzae 70-15]
gi|440463125|gb|ELQ32752.1| TPR domain-containing protein [Magnaporthe oryzae Y34]
gi|440483490|gb|ELQ63875.1| TPR domain-containing protein [Magnaporthe oryzae P131]
Length = 746
Score = 63.2 bits (152), Expect = 2e-07, Method: Compositional matrix adjust.
Identities = 26/72 (36%), Positives = 39/72 (54%)
Query: 57 TGLYPVISIINHSCLPNAVLVFEGRLAVVRAVQHVPKGAEVLISYIETAGSTMTRQKALK 116
TG++ + NH+C+PNA F G + +VRA + +P GAE+ + Y A +R+ K
Sbjct: 503 TGMWLHAAYANHTCIPNATRAFIGDMMIVRAARDIPAGAEIFMGYASLAEPFESRRSKFK 562
Query: 117 EQYLFTCTCPRC 128
Y F C C C
Sbjct: 563 TSYGFECDCEMC 574
>gi|195156964|ref|XP_002019366.1| GL12277 [Drosophila persimilis]
gi|194115957|gb|EDW38000.1| GL12277 [Drosophila persimilis]
Length = 188
Score = 63.2 bits (152), Expect = 2e-07, Method: Compositional matrix adjust.
Identities = 38/101 (37%), Positives = 57/101 (56%), Gaps = 9/101 (8%)
Query: 33 IAENFSKLACNAHTICNSELRPLGTGLYPVISIINHSCLPNAVLVF--EGRLAVVRAVQH 90
I E +SK+ A N+E G+GLY + S INHSC+PNA F + V++A+
Sbjct: 62 IDELYSKVGEFAGEFLNNE----GSGLYLLQSKINHSCVPNACSTFPYSNDIVVLKALAP 117
Query: 91 VPKGAEVLISYIETAG---STMTRQKALKEQYLFTCTCPRC 128
+ +G E+ ISY++ S +R K L+E Y+F C C +C
Sbjct: 118 IQEGEEICISYLDECQLERSRHSRHKVLRENYIFVCQCLKC 158
>gi|74196676|dbj|BAE34438.1| unnamed protein product [Mus musculus]
Length = 416
Score = 63.2 bits (152), Expect = 2e-07, Method: Compositional matrix adjust.
Identities = 35/86 (40%), Positives = 50/86 (58%), Gaps = 7/86 (8%)
Query: 56 GTGLYPVISIINHSCLPNAVLVFEGRLAV--VRAVQHVPKGAEVLISYIE---TAGSTMT 110
G+GL+ + S NHSC+PNA F V V A++ + G E+ ISY++ S +
Sbjct: 303 GSGLFVLQSCCNHSCVPNAETSFPENNFVLHVTALEDIKPGEEICISYLDCRQRERSRHS 362
Query: 111 RQKALKEQYLFTCTCPRCIKLGQFDD 136
R K L+E YLF C+CP+C L + DD
Sbjct: 363 RHKILRENYLFNCSCPKC--LAEADD 386
>gi|351716047|gb|EHB18966.1| SET and MYND domain-containing protein 5 [Heterocephalus glaber]
Length = 420
Score = 63.2 bits (152), Expect = 2e-07, Method: Compositional matrix adjust.
Identities = 32/79 (40%), Positives = 47/79 (59%), Gaps = 5/79 (6%)
Query: 56 GTGLYPVISIINHSCLPNAVLVF--EGRLAVVRAVQHVPKGAEVLISYIETAG---STMT 110
G+GL+ + S NHSC+PNA F L V A++ + G E+ ISY++ S +
Sbjct: 304 GSGLFVLQSCCNHSCVPNAETAFPENNFLLHVTALEDINPGEEICISYLDCCQRERSRHS 363
Query: 111 RQKALKEQYLFTCTCPRCI 129
R K L+E YLF C+CP+C+
Sbjct: 364 RHKILRENYLFVCSCPKCL 382
>gi|346320526|gb|EGX90126.1| TPR domain protein [Cordyceps militaris CM01]
Length = 656
Score = 63.2 bits (152), Expect = 2e-07, Method: Compositional matrix adjust.
Identities = 31/72 (43%), Positives = 42/72 (58%), Gaps = 1/72 (1%)
Query: 57 TGLYPVISIINHSCLPNAVLVFEGRLAVVRAVQHVPKGAEVLISYIETAGSTMTRQKALK 116
TGL+ S +NH+C+ NAV F G L V RA +H+ G E+ +Y E A RQ AL+
Sbjct: 509 TGLWVRASYVNHACVANAVKDFAGDLMVFRAARHIRAGEEIFHAYDENADYD-ARQAALR 567
Query: 117 EQYLFTCTCPRC 128
+ + F CTC C
Sbjct: 568 KTWGFACTCALC 579
>gi|291231170|ref|XP_002735533.1| PREDICTED: SMYD family member 5-like [Saccoglossus kowalevskii]
Length = 395
Score = 63.2 bits (152), Expect = 2e-07, Method: Compositional matrix adjust.
Identities = 35/86 (40%), Positives = 48/86 (55%), Gaps = 7/86 (8%)
Query: 56 GTGLYPVISIINHSCLPNAVLVFEGRLAVVR--AVQHVPKGAEVLISYI---ETAGSTMT 110
G+ LY + S NHSC+PNA + F A V A+Q + + E+ ISY+ + S +
Sbjct: 295 GSALYSLQSCCNHSCVPNAEVTFPDNDAAVSVMALQDIQENEEICISYLGECDIGRSRHS 354
Query: 111 RQKALKEQYLFTCTCPRCIKLGQFDD 136
RQK L+E YLF C C +C Q DD
Sbjct: 355 RQKILRENYLFNCNCMKC--QSQCDD 378
>gi|195421076|ref|XP_002060836.1| GK23578 [Drosophila willistoni]
gi|194156921|gb|EDW71822.1| GK23578 [Drosophila willistoni]
Length = 128
Score = 63.2 bits (152), Expect = 2e-07, Method: Compositional matrix adjust.
Identities = 29/72 (40%), Positives = 41/72 (56%), Gaps = 1/72 (1%)
Query: 58 GLYPVISIINHSCLPNAVLVFE-GRLAVVRAVQHVPKGAEVLISYIETAGSTMTRQKALK 116
L+P+ ++NH C PNA FE G VV A + +PKG E+ ++Y + ST+ R+ L
Sbjct: 51 ALFPLAGLLNHHCTPNAAHHFEDGETIVVSATERIPKGTEITMTYAKLLWSTLARKLFLA 110
Query: 117 EQYLFTCTCPRC 128
F C CPRC
Sbjct: 111 MTKHFICQCPRC 122
>gi|321476942|gb|EFX87901.1| SET and MYND domain-containing protein 4A [Daphnia pulex]
Length = 648
Score = 63.2 bits (152), Expect = 2e-07, Method: Compositional matrix adjust.
Identities = 27/78 (34%), Positives = 47/78 (60%), Gaps = 2/78 (2%)
Query: 55 LGTGLYPVISIINHSCLPNAVLVFEGRLAVVRAVQHVPKGAEVLISYIETAGSTMTRQKA 114
+G+ +PV+S++NHSC PN V + G + VV+ + + +G E+L +Y + R++
Sbjct: 479 IGSAAFPVVSLMNHSCNPNVVHLCYGDVMVVKVIHRIARGEEILDNYGYHYATHEKRERQ 538
Query: 115 LK--EQYLFTCTCPRCIK 130
LK +QY F C C C++
Sbjct: 539 LKLCQQYYFRCRCQSCVE 556
>gi|307183932|gb|EFN70520.1| SET and MYND domain-containing protein 4 [Camponotus floridanus]
Length = 421
Score = 63.2 bits (152), Expect = 2e-07, Method: Compositional matrix adjust.
Identities = 39/126 (30%), Positives = 59/126 (46%), Gaps = 9/126 (7%)
Query: 53 RP--LGTGLYPVISIINHSCLPNAVLVFEGRLAVVRAVQHVPKGAEVLISY--IETAGST 108
RP LG +YP ++ NH C P F GR V+RA++ + G + +Y I T +
Sbjct: 218 RPVYLGVAIYPTVARFNHDCYPAVTRYFVGRHIVIRAIRGLRPGDVIAENYGPIFTKRTL 277
Query: 109 MTRQKALKEQYLFTCTCPRCIK-LGQFDDIQESAILEGYRCKDDGCSGFLLR--DSDDKG 165
RQ+ L +Y F C+C C + +F+ + ++ RC GC+G R DK
Sbjct: 278 AERQRTLAGRYWFRCSCRACQEDWPRFETLTNDSVR--LRCPTTGCNGLHSRPQQRPDKP 335
Query: 166 FTCQQC 171
C C
Sbjct: 336 IKCSAC 341
>gi|426335968|ref|XP_004029476.1| PREDICTED: SET and MYND domain-containing protein 5 isoform 1
[Gorilla gorilla gorilla]
Length = 419
Score = 62.8 bits (151), Expect = 2e-07, Method: Compositional matrix adjust.
Identities = 32/79 (40%), Positives = 47/79 (59%), Gaps = 5/79 (6%)
Query: 56 GTGLYPVISIINHSCLPNAVLVF--EGRLAVVRAVQHVPKGAEVLISYIETAG---STMT 110
G+GL+ + S NHSC+PNA F L V A++ + G E+ ISY++ S +
Sbjct: 304 GSGLFVLQSCCNHSCVPNAETSFPENNFLLHVTALEDIKPGEEICISYLDCCQRERSRHS 363
Query: 111 RQKALKEQYLFTCTCPRCI 129
R K L+E YLF C+CP+C+
Sbjct: 364 RHKILRENYLFVCSCPKCL 382
>gi|348566513|ref|XP_003469046.1| PREDICTED: SET and MYND domain-containing protein 5-like [Cavia
porcellus]
Length = 416
Score = 62.8 bits (151), Expect = 2e-07, Method: Compositional matrix adjust.
Identities = 32/79 (40%), Positives = 47/79 (59%), Gaps = 5/79 (6%)
Query: 56 GTGLYPVISIINHSCLPNAVLVF--EGRLAVVRAVQHVPKGAEVLISYIETAG---STMT 110
G+GL+ + S NHSC+PNA F L V A++ + G E+ ISY++ S +
Sbjct: 304 GSGLFVLQSCCNHSCVPNAETSFPENNFLLHVTALEDISPGEEICISYLDCCQRERSRHS 363
Query: 111 RQKALKEQYLFTCTCPRCI 129
R K L+E YLF C+CP+C+
Sbjct: 364 RHKILRENYLFVCSCPKCL 382
>gi|328874283|gb|EGG22649.1| SET domain-containing protein [Dictyostelium fasciculatum]
Length = 1240
Score = 62.8 bits (151), Expect = 2e-07, Method: Compositional matrix adjust.
Identities = 64/277 (23%), Positives = 117/277 (42%), Gaps = 30/277 (10%)
Query: 55 LGTGLYPVISIINHSCLPNAVLVFEGRLAVVRAVQHVPKGAEVLISYIETAGSTMTRQK- 113
+ ++PV S++NHSC N +L + G +++++++ K E+ Y A R++
Sbjct: 893 IAYAIFPVSSLLNHSCDNNTILQYNGSSITIKSLKNISKNDEITGCYGPHAFHLELRERL 952
Query: 114 -ALKEQYLFTCTCPRC-IKLGQFDDIQESAILE--GYRCK-----------------DDG 152
LKE+Y F C C C K+G + LE G +CK D
Sbjct: 953 ECLKEEYFFICRCHACNAKIGPNLLRCPTQSLESGGEQCKGTLLERVDTNSLGQEEDDSY 1012
Query: 153 CSGFLLRDSDDKGFTCQQCGLVRSKEEIKKIASEV----NILSKKTLALTSCGNHQEVVS 208
G D + + F C C L K + + S+ I S T L +HQ
Sbjct: 1013 PEGASYDDYNGRKFVCSICSLELGKMDCFMLTSQSVMSGAIFSNATKLLIDTEHHQPPRE 1072
Query: 209 TYKMIEK---LQKKLYHPFSVNLMQTREKLIKILMELEDWKEALAYCQLTI-PVYQRVYP 264
+M+ K ++K++YH S + + + L + + + +KEA+ Y + I ++ R+
Sbjct: 1073 VEEMLFKCLEIRKQIYHVKSRKIGEVYDALSRYYIAKDKYKEAVTYAERAINNIFLRLGH 1132
Query: 265 QFHPLLGLQYYTCGKLEWFLGDTENAIKSMTEAVEIL 301
L ++ + G+ AIK++ + +L
Sbjct: 1133 SHSTELAREWAKLANIYINAGEPAKAIKAINTSESLL 1169
>gi|417400530|gb|JAA47200.1| Putative histone tail methylase [Desmodus rotundus]
Length = 415
Score = 62.8 bits (151), Expect = 2e-07, Method: Compositional matrix adjust.
Identities = 32/79 (40%), Positives = 47/79 (59%), Gaps = 5/79 (6%)
Query: 56 GTGLYPVISIINHSCLPNAVLVF--EGRLAVVRAVQHVPKGAEVLISYIETAG---STMT 110
G+GL+ + S NHSC+PNA F L V A++ + G E+ ISY++ S +
Sbjct: 304 GSGLFVLQSCCNHSCVPNAETSFPENNFLLHVTALEDIKPGEEICISYLDCCQRERSRHS 363
Query: 111 RQKALKEQYLFTCTCPRCI 129
R K L+E YLF C+CP+C+
Sbjct: 364 RHKILRENYLFVCSCPKCL 382
>gi|268637641|ref|XP_002649106.1| SET domain-containing protein [Dictyostelium discoideum AX4]
gi|256012848|gb|EEU04054.1| SET domain-containing protein [Dictyostelium discoideum AX4]
Length = 449
Score = 62.8 bits (151), Expect = 2e-07, Method: Compositional matrix adjust.
Identities = 41/135 (30%), Positives = 63/135 (46%), Gaps = 13/135 (9%)
Query: 28 ISINEIAENFSKLACNAHTICNSELRPLGTGLYPVISIINHSCLPNAVLVFEGR-LAVVR 86
I+I+EI E FS + NAH + + +G G+ P NHSC+PN + + + +
Sbjct: 178 ITIHEILECFSSVLTNAHQFSYATSKEIGRGVCPT-GYFNHSCMPNTTWSLDDQGMLLFS 236
Query: 87 AVQHVPKGAEVLISYIETAGSTMTRQKALKEQYLFTCTCPRCIKLGQFDDIQESAILEGY 146
+V KG E+ + Y+ R++ L + Y F C CP C + + L GY
Sbjct: 237 TSSNVKKGDELSLGYLANEYPLKNRRRELLDGYYFFCQCPLC---------EFQSNLSGY 287
Query: 147 RCKDDGCSGFLLRDS 161
C + C LL DS
Sbjct: 288 LC--EKCKEPLLNDS 300
>gi|189237463|ref|XP_967982.2| PREDICTED: similar to msta CG33548-PB [Tribolium castaneum]
Length = 387
Score = 62.8 bits (151), Expect = 2e-07, Method: Compositional matrix adjust.
Identities = 45/130 (34%), Positives = 63/130 (48%), Gaps = 9/130 (6%)
Query: 58 GLYPVISIINHSCLPNAVLVFEGRL-AVVRAVQHVPKGAEVLISYIETAGSTMTRQKALK 116
GLYP+ S+ NHSC PN VF+ + VVRA + +P+G+E+ SY ST R+ L
Sbjct: 207 GLYPLGSLANHSCCPNTCHVFDDKQHMVVRASKFIPQGSEIFHSYSRLIWSTSARRFHLY 266
Query: 117 EQYLFTCTCPRCIKLGQFDDIQESAILEGYRCKDDGCSGFLLRDS--DDKGFTCQQCGLV 174
F C C RC +F S + CK + +S DK + C+ CG +
Sbjct: 267 RTKHFLCKCQRCEDPTEFGSYIGSIL-----CKVCKTGKVIPTNSLQTDK-WQCEVCGSL 320
Query: 175 RSKEEIKKIA 184
KEE+ I
Sbjct: 321 IKKEEVANIT 330
>gi|296223556|ref|XP_002807574.1| PREDICTED: LOW QUALITY PROTEIN: SET and MYND domain-containing
protein 5 [Callithrix jacchus]
Length = 431
Score = 62.8 bits (151), Expect = 2e-07, Method: Compositional matrix adjust.
Identities = 32/79 (40%), Positives = 47/79 (59%), Gaps = 5/79 (6%)
Query: 56 GTGLYPVISIINHSCLPNAVLVF--EGRLAVVRAVQHVPKGAEVLISYIETAG---STMT 110
G+GL+ + S NHSC+PNA F L V A++ + G E+ ISY++ S +
Sbjct: 320 GSGLFVLQSCCNHSCVPNAETSFPENNFLLHVTALEDIKPGEEICISYLDCCQRERSRHS 379
Query: 111 RQKALKEQYLFTCTCPRCI 129
R K L+E YLF C+CP+C+
Sbjct: 380 RHKILRENYLFVCSCPKCL 398
>gi|390362089|ref|XP_784346.3| PREDICTED: SET and MYND domain-containing protein 5-like
[Strongylocentrotus purpuratus]
Length = 396
Score = 62.8 bits (151), Expect = 2e-07, Method: Compositional matrix adjust.
Identities = 34/79 (43%), Positives = 49/79 (62%), Gaps = 7/79 (8%)
Query: 56 GTGLYPVISIINHSCLPNAVLVF---EGRLAVVRAVQHVPKGAEVLISYIETA---GSTM 109
G+ LY + S NHSC+PNA + F L++V A+ + +G EVLISY++ S
Sbjct: 288 GSALYKLQSCCNHSCVPNAEISFLHNNSTLSLV-ALTDITEGQEVLISYLDECCKERSRH 346
Query: 110 TRQKALKEQYLFTCTCPRC 128
+RQK L+E YLF+C C +C
Sbjct: 347 SRQKELRENYLFSCDCSKC 365
>gi|403260393|ref|XP_003922658.1| PREDICTED: SET and MYND domain-containing protein 5 [Saimiri
boliviensis boliviensis]
Length = 419
Score = 62.8 bits (151), Expect = 2e-07, Method: Compositional matrix adjust.
Identities = 32/79 (40%), Positives = 47/79 (59%), Gaps = 5/79 (6%)
Query: 56 GTGLYPVISIINHSCLPNAVLVF--EGRLAVVRAVQHVPKGAEVLISYIETAG---STMT 110
G+GL+ + S NHSC+PNA F L V A++ + G E+ ISY++ S +
Sbjct: 304 GSGLFVLQSCCNHSCVPNAETSFPENNFLLHVTALEDIKPGEEICISYLDCCQRERSRHS 363
Query: 111 RQKALKEQYLFTCTCPRCI 129
R K L+E YLF C+CP+C+
Sbjct: 364 RHKILRENYLFVCSCPKCL 382
>gi|431920345|gb|ELK18377.1| SET and MYND domain-containing protein 5 [Pteropus alecto]
Length = 418
Score = 62.8 bits (151), Expect = 2e-07, Method: Compositional matrix adjust.
Identities = 32/79 (40%), Positives = 47/79 (59%), Gaps = 5/79 (6%)
Query: 56 GTGLYPVISIINHSCLPNAVLVF--EGRLAVVRAVQHVPKGAEVLISYIETAG---STMT 110
G+GL+ + S NHSC+PNA F L V A++ + G E+ ISY++ S +
Sbjct: 304 GSGLFVLQSCCNHSCVPNAETSFPENNFLLHVTALEDIKPGEEICISYLDCCQRERSRHS 363
Query: 111 RQKALKEQYLFTCTCPRCI 129
R K L+E YLF C+CP+C+
Sbjct: 364 RHKILRENYLFVCSCPKCL 382
>gi|444723373|gb|ELW64030.1| SET and MYND domain-containing protein 5 [Tupaia chinensis]
Length = 415
Score = 62.8 bits (151), Expect = 2e-07, Method: Compositional matrix adjust.
Identities = 32/79 (40%), Positives = 47/79 (59%), Gaps = 5/79 (6%)
Query: 56 GTGLYPVISIINHSCLPNAVLVF--EGRLAVVRAVQHVPKGAEVLISYIETAG---STMT 110
G+GL+ + S NHSC+PNA F L V A++ + G E+ ISY++ S +
Sbjct: 304 GSGLFVLQSCCNHSCVPNAETSFPENNFLLHVTALEEIKPGEEICISYLDCCQRERSRHS 363
Query: 111 RQKALKEQYLFTCTCPRCI 129
R K L+E YLF C+CP+C+
Sbjct: 364 RHKILRENYLFVCSCPKCL 382
>gi|397473487|ref|XP_003808242.1| PREDICTED: SET and MYND domain-containing protein 5 isoform 1 [Pan
paniscus]
Length = 420
Score = 62.8 bits (151), Expect = 2e-07, Method: Compositional matrix adjust.
Identities = 32/79 (40%), Positives = 47/79 (59%), Gaps = 5/79 (6%)
Query: 56 GTGLYPVISIINHSCLPNAVLVF--EGRLAVVRAVQHVPKGAEVLISYIETAG---STMT 110
G+GL+ + S NHSC+PNA F L V A++ + G E+ ISY++ S +
Sbjct: 304 GSGLFVLQSCCNHSCVPNAETSFPENNFLLHVTALEDIKPGEEICISYLDCCQRERSRHS 363
Query: 111 RQKALKEQYLFTCTCPRCI 129
R K L+E YLF C+CP+C+
Sbjct: 364 RHKILRENYLFVCSCPKCL 382
>gi|332226837|ref|XP_003262596.1| PREDICTED: SET and MYND domain-containing protein 5 isoform 1
[Nomascus leucogenys]
Length = 418
Score = 62.8 bits (151), Expect = 2e-07, Method: Compositional matrix adjust.
Identities = 32/79 (40%), Positives = 47/79 (59%), Gaps = 5/79 (6%)
Query: 56 GTGLYPVISIINHSCLPNAVLVF--EGRLAVVRAVQHVPKGAEVLISYIETAG---STMT 110
G+GL+ + S NHSC+PNA F L V A++ + G E+ ISY++ S +
Sbjct: 304 GSGLFVLQSCCNHSCVPNAETSFPENNFLLHVTALEDIKPGEEICISYLDCCQRERSRHS 363
Query: 111 RQKALKEQYLFTCTCPRCI 129
R K L+E YLF C+CP+C+
Sbjct: 364 RHKILRENYLFVCSCPKCL 382
>gi|194377790|dbj|BAG63258.1| unnamed protein product [Homo sapiens]
Length = 418
Score = 62.8 bits (151), Expect = 2e-07, Method: Compositional matrix adjust.
Identities = 32/79 (40%), Positives = 47/79 (59%), Gaps = 5/79 (6%)
Query: 56 GTGLYPVISIINHSCLPNAVLVF--EGRLAVVRAVQHVPKGAEVLISYIETAG---STMT 110
G+GL+ + S NHSC+PNA F L V A++ + G E+ ISY++ S +
Sbjct: 304 GSGLFVLQSCCNHSCVPNAETSFPENNFLLHVTALEDIKPGEEICISYLDCCQRERSRHS 363
Query: 111 RQKALKEQYLFTCTCPRCI 129
R K L+E YLF C+CP+C+
Sbjct: 364 RHKILRENYLFVCSCPKCL 382
>gi|332813419|ref|XP_001150449.2| PREDICTED: SET and MYND domain-containing protein 5 isoform 1 [Pan
troglodytes]
Length = 415
Score = 62.8 bits (151), Expect = 2e-07, Method: Compositional matrix adjust.
Identities = 32/79 (40%), Positives = 47/79 (59%), Gaps = 5/79 (6%)
Query: 56 GTGLYPVISIINHSCLPNAVLVF--EGRLAVVRAVQHVPKGAEVLISYIETAG---STMT 110
G+GL+ + S NHSC+PNA F L V A++ + G E+ ISY++ S +
Sbjct: 304 GSGLFVLQSCCNHSCVPNAETSFPENNFLLHVTALEDIKPGEEICISYLDCCQRERSRHS 363
Query: 111 RQKALKEQYLFTCTCPRCI 129
R K L+E YLF C+CP+C+
Sbjct: 364 RHKILRENYLFVCSCPKCL 382
>gi|297667345|ref|XP_002811937.1| PREDICTED: SET and MYND domain-containing protein 5 [Pongo abelii]
Length = 419
Score = 62.8 bits (151), Expect = 2e-07, Method: Compositional matrix adjust.
Identities = 32/79 (40%), Positives = 47/79 (59%), Gaps = 5/79 (6%)
Query: 56 GTGLYPVISIINHSCLPNAVLVF--EGRLAVVRAVQHVPKGAEVLISYIETAG---STMT 110
G+GL+ + S NHSC+PNA F L V A++ + G E+ ISY++ S +
Sbjct: 304 GSGLFVLQSCCNHSCVPNAETSFPENNFLLHVTALEDIKPGEEICISYLDCCQRERSRHS 363
Query: 111 RQKALKEQYLFTCTCPRCI 129
R K L+E YLF C+CP+C+
Sbjct: 364 RHKILRENYLFVCSCPKCL 382
>gi|301758142|ref|XP_002914921.1| PREDICTED: SET and MYND domain-containing protein 5-like
[Ailuropoda melanoleuca]
gi|281346678|gb|EFB22262.1| hypothetical protein PANDA_002863 [Ailuropoda melanoleuca]
Length = 417
Score = 62.8 bits (151), Expect = 2e-07, Method: Compositional matrix adjust.
Identities = 32/79 (40%), Positives = 47/79 (59%), Gaps = 5/79 (6%)
Query: 56 GTGLYPVISIINHSCLPNAVLVF--EGRLAVVRAVQHVPKGAEVLISYIETAG---STMT 110
G+GL+ + S NHSC+PNA F L V A++ + G E+ ISY++ S +
Sbjct: 304 GSGLFVLQSCCNHSCVPNAETSFPENNFLLHVTALEDIKPGEEICISYLDCCQRERSRHS 363
Query: 111 RQKALKEQYLFTCTCPRCI 129
R K L+E YLF C+CP+C+
Sbjct: 364 RHKILRENYLFVCSCPKCL 382
>gi|219112705|ref|XP_002178104.1| predicted protein [Phaeodactylum tricornutum CCAP 1055/1]
gi|217410989|gb|EEC50918.1| predicted protein [Phaeodactylum tricornutum CCAP 1055/1]
Length = 501
Score = 62.8 bits (151), Expect = 2e-07, Method: Compositional matrix adjust.
Identities = 35/91 (38%), Positives = 54/91 (59%), Gaps = 10/91 (10%)
Query: 47 ICNSELRPLGTGLYPVISIINHSCLPNAVL----VFEGRLAVVRAVQHVPKGAEVLISYI 102
+ ++ P ++P+ + INHSC+PNA + + R+ VV A++ + G E+ ISYI
Sbjct: 408 VVEDKVAPEIVAVFPLTARINHSCVPNAQVQSQEFVDARIDVV-ALRDIAAGEEITISYI 466
Query: 103 ---ETAGSTMT--RQKALKEQYLFTCTCPRC 128
T+GS T R++ L +YLFTC CPRC
Sbjct: 467 GCGRTSGSKSTSRRRRELLAKYLFTCECPRC 497
>gi|384943470|gb|AFI35340.1| SET and MYND domain-containing protein 5 [Macaca mulatta]
Length = 418
Score = 62.8 bits (151), Expect = 2e-07, Method: Compositional matrix adjust.
Identities = 32/79 (40%), Positives = 47/79 (59%), Gaps = 5/79 (6%)
Query: 56 GTGLYPVISIINHSCLPNAVLVF--EGRLAVVRAVQHVPKGAEVLISYIETAG---STMT 110
G+GL+ + S NHSC+PNA F L V A++ + G E+ ISY++ S +
Sbjct: 304 GSGLFVLQSCCNHSCVPNAETSFPENNFLLHVTALEDIKPGEEICISYLDCCQRERSRHS 363
Query: 111 RQKALKEQYLFTCTCPRCI 129
R K L+E YLF C+CP+C+
Sbjct: 364 RHKILRENYLFVCSCPKCL 382
>gi|410955043|ref|XP_003984168.1| PREDICTED: SET and MYND domain-containing protein 5 [Felis catus]
Length = 416
Score = 62.8 bits (151), Expect = 2e-07, Method: Compositional matrix adjust.
Identities = 32/79 (40%), Positives = 47/79 (59%), Gaps = 5/79 (6%)
Query: 56 GTGLYPVISIINHSCLPNAVLVF--EGRLAVVRAVQHVPKGAEVLISYIETAG---STMT 110
G+GL+ + S NHSC+PNA F L V A++ + G E+ ISY++ S +
Sbjct: 304 GSGLFVLQSCCNHSCVPNAETSFPENNFLLHVTALEDIKPGEEICISYLDCCQRERSRHS 363
Query: 111 RQKALKEQYLFTCTCPRCI 129
R K L+E YLF C+CP+C+
Sbjct: 364 RHKILRENYLFVCSCPKCL 382
>gi|340725361|ref|XP_003401039.1| PREDICTED: SET and MYND domain-containing protein 5-like [Bombus
terrestris]
Length = 386
Score = 62.8 bits (151), Expect = 2e-07, Method: Compositional matrix adjust.
Identities = 35/87 (40%), Positives = 49/87 (56%), Gaps = 5/87 (5%)
Query: 56 GTGLYPVISIINHSCLPNAVLVF--EGRLAVVRAVQHVPKGAEVLISYIETA---GSTMT 110
G+GLY S INHSC+PNAV+ F + V++A++ + E+ ISY++ S +
Sbjct: 287 GSGLYIFQSSINHSCVPNAVVEFPYSNNVLVLKAIRDIHPEEEICISYLDECCLERSRHS 346
Query: 111 RQKALKEQYLFTCTCPRCIKLGQFDDI 137
RQK L YLF C C RC+ DI
Sbjct: 347 RQKVLSSLYLFHCYCNRCLSQVNNPDI 373
>gi|291386545|ref|XP_002709793.1| PREDICTED: SMYD family member 5 [Oryctolagus cuniculus]
Length = 421
Score = 62.8 bits (151), Expect = 2e-07, Method: Compositional matrix adjust.
Identities = 32/79 (40%), Positives = 47/79 (59%), Gaps = 5/79 (6%)
Query: 56 GTGLYPVISIINHSCLPNAVLVF--EGRLAVVRAVQHVPKGAEVLISYIETAG---STMT 110
G+GL+ + S NHSC+PNA F L V A++ + G E+ ISY++ S +
Sbjct: 304 GSGLFVLQSCCNHSCVPNAETSFPENNFLLHVTALEDIKPGEEICISYLDCCQRERSRHS 363
Query: 111 RQKALKEQYLFTCTCPRCI 129
R K L+E YLF C+CP+C+
Sbjct: 364 RHKILRENYLFVCSCPKCL 382
>gi|154689858|ref|NP_006053.2| SET and MYND domain-containing protein 5 [Homo sapiens]
gi|90101758|sp|Q6GMV2.2|SMYD5_HUMAN RecName: Full=SET and MYND domain-containing protein 5; AltName:
Full=Protein NN8-4AG; AltName: Full=Retinoic
acid-induced protein 15
gi|119620150|gb|EAW99744.1| SMYD family member 5, isoform CRA_a [Homo sapiens]
gi|119620151|gb|EAW99745.1| SMYD family member 5, isoform CRA_a [Homo sapiens]
gi|261859134|dbj|BAI46089.1| SMYD family member 5 [synthetic construct]
Length = 418
Score = 62.8 bits (151), Expect = 2e-07, Method: Compositional matrix adjust.
Identities = 32/79 (40%), Positives = 47/79 (59%), Gaps = 5/79 (6%)
Query: 56 GTGLYPVISIINHSCLPNAVLVF--EGRLAVVRAVQHVPKGAEVLISYIETAG---STMT 110
G+GL+ + S NHSC+PNA F L V A++ + G E+ ISY++ S +
Sbjct: 304 GSGLFVLQSCCNHSCVPNAETSFPENNFLLHVTALEDIKPGEEICISYLDCCQRERSRHS 363
Query: 111 RQKALKEQYLFTCTCPRCI 129
R K L+E YLF C+CP+C+
Sbjct: 364 RHKILRENYLFVCSCPKCL 382
>gi|409047429|gb|EKM56908.1| hypothetical protein PHACADRAFT_208075 [Phanerochaete carnosa
HHB-10118-sp]
Length = 606
Score = 62.8 bits (151), Expect = 2e-07, Method: Compositional matrix adjust.
Identities = 36/118 (30%), Positives = 57/118 (48%), Gaps = 14/118 (11%)
Query: 29 SINEIAENFSKLACNAHTICNSELRPLGTGLYPVISIINHSCLPNAVLVF---------- 78
S ++ S+ N T+ + L P+G + P +++ NHSC PN V+VF
Sbjct: 248 SAGDLVGLISRFTTNTFTLTSYTLSPVGICISPAMALTNHSCDPNVVIVFPRSSKKVTKE 307
Query: 79 -EGRLAVVRAVQHVPKGAEVLISYIETAGSTMTRQKALKEQYLFTCTCPRCIKLGQFD 135
+ +L +R++ H E++ +Y++ RQ ALKE Y FTC C C G D
Sbjct: 308 PQMQLIALRSILHE---EEIMTAYVDVTLPKELRQSALKEAYNFTCMCSLCKNTGPTD 362
>gi|355751414|gb|EHH55669.1| hypothetical protein EGM_04917 [Macaca fascicularis]
Length = 419
Score = 62.8 bits (151), Expect = 2e-07, Method: Compositional matrix adjust.
Identities = 32/79 (40%), Positives = 47/79 (59%), Gaps = 5/79 (6%)
Query: 56 GTGLYPVISIINHSCLPNAVLVF--EGRLAVVRAVQHVPKGAEVLISYIETAG---STMT 110
G+GL+ + S NHSC+PNA F L V A++ + G E+ ISY++ S +
Sbjct: 304 GSGLFVLQSCCNHSCVPNAETSFPENNFLLHVTALEDIKPGEEICISYLDCCQRERSRHS 363
Query: 111 RQKALKEQYLFTCTCPRCI 129
R K L+E YLF C+CP+C+
Sbjct: 364 RHKILRENYLFVCSCPKCL 382
>gi|149727812|ref|XP_001488565.1| PREDICTED: SET and MYND domain-containing protein 5 [Equus
caballus]
Length = 417
Score = 62.8 bits (151), Expect = 2e-07, Method: Compositional matrix adjust.
Identities = 32/79 (40%), Positives = 47/79 (59%), Gaps = 5/79 (6%)
Query: 56 GTGLYPVISIINHSCLPNAVLVF--EGRLAVVRAVQHVPKGAEVLISYIETAG---STMT 110
G+GL+ + S NHSC+PNA F L V A++ + G E+ ISY++ S +
Sbjct: 304 GSGLFVLQSCCNHSCVPNAETSFPENNFLLHVTALEDIKPGEEICISYLDCCQRERSRHS 363
Query: 111 RQKALKEQYLFTCTCPRCI 129
R K L+E YLF C+CP+C+
Sbjct: 364 RHKILRENYLFVCSCPKCL 382
>gi|109103368|ref|XP_001104101.1| PREDICTED: SET and MYND domain-containing protein 5 [Macaca
mulatta]
gi|402891240|ref|XP_003908860.1| PREDICTED: SET and MYND domain-containing protein 5 [Papio anubis]
gi|355565788|gb|EHH22217.1| hypothetical protein EGK_05442 [Macaca mulatta]
Length = 418
Score = 62.8 bits (151), Expect = 2e-07, Method: Compositional matrix adjust.
Identities = 32/79 (40%), Positives = 47/79 (59%), Gaps = 5/79 (6%)
Query: 56 GTGLYPVISIINHSCLPNAVLVF--EGRLAVVRAVQHVPKGAEVLISYIETAG---STMT 110
G+GL+ + S NHSC+PNA F L V A++ + G E+ ISY++ S +
Sbjct: 304 GSGLFVLQSCCNHSCVPNAETSFPENNFLLHVTALEDIKPGEEICISYLDCCQRERSRHS 363
Query: 111 RQKALKEQYLFTCTCPRCI 129
R K L+E YLF C+CP+C+
Sbjct: 364 RHKILRENYLFVCSCPKCL 382
>gi|380797359|gb|AFE70555.1| SET and MYND domain-containing protein 5, partial [Macaca mulatta]
Length = 407
Score = 62.8 bits (151), Expect = 2e-07, Method: Compositional matrix adjust.
Identities = 32/79 (40%), Positives = 47/79 (59%), Gaps = 5/79 (6%)
Query: 56 GTGLYPVISIINHSCLPNAVLVF--EGRLAVVRAVQHVPKGAEVLISYIETAG---STMT 110
G+GL+ + S NHSC+PNA F L V A++ + G E+ ISY++ S +
Sbjct: 293 GSGLFVLQSCCNHSCVPNAETSFPENNFLLHVTALEDIKPGEEICISYLDCCQRERSRHS 352
Query: 111 RQKALKEQYLFTCTCPRCI 129
R K L+E YLF C+CP+C+
Sbjct: 353 RHKILRENYLFVCSCPKCL 371
>gi|311252397|ref|XP_003125060.1| PREDICTED: SET and MYND domain-containing protein 5 [Sus scrofa]
Length = 417
Score = 62.8 bits (151), Expect = 2e-07, Method: Compositional matrix adjust.
Identities = 32/79 (40%), Positives = 47/79 (59%), Gaps = 5/79 (6%)
Query: 56 GTGLYPVISIINHSCLPNAVLVF--EGRLAVVRAVQHVPKGAEVLISYIETAG---STMT 110
G+GL+ + S NHSC+PNA F L V A++ + G E+ ISY++ S +
Sbjct: 304 GSGLFVLQSCCNHSCVPNAETSFPENNFLLHVTALEDIKPGEEICISYLDCCQRERSRHS 363
Query: 111 RQKALKEQYLFTCTCPRCI 129
R K L+E YLF C+CP+C+
Sbjct: 364 RHKILRENYLFVCSCPKCL 382
>gi|195132931|ref|XP_002010893.1| GI21458 [Drosophila mojavensis]
gi|193907681|gb|EDW06548.1| GI21458 [Drosophila mojavensis]
Length = 302
Score = 62.8 bits (151), Expect = 2e-07, Method: Compositional matrix adjust.
Identities = 30/72 (41%), Positives = 41/72 (56%), Gaps = 1/72 (1%)
Query: 58 GLYPVISIINHSCLPNAVLVFE-GRLAVVRAVQHVPKGAEVLISYIETAGSTMTRQKALK 116
L+P+ ++NH C PNA FE G VV A + +P GAE+ +SY + ST+ R+ L
Sbjct: 226 ALFPLAGLLNHQCTPNAAHHFEDGETIVVTATERIPAGAEITMSYTKLLWSTLARKIFLG 285
Query: 117 EQYLFTCTCPRC 128
F C CPRC
Sbjct: 286 MTKHFMCQCPRC 297
>gi|440901422|gb|ELR52368.1| SET and MYND domain-containing protein 5 [Bos grunniens mutus]
Length = 418
Score = 62.8 bits (151), Expect = 2e-07, Method: Compositional matrix adjust.
Identities = 32/79 (40%), Positives = 47/79 (59%), Gaps = 5/79 (6%)
Query: 56 GTGLYPVISIINHSCLPNAVLVF--EGRLAVVRAVQHVPKGAEVLISYIETAG---STMT 110
G+GL+ + S NHSC+PNA F L V A++ + G E+ ISY++ S +
Sbjct: 304 GSGLFVLQSCCNHSCVPNAETSFPENNFLLHVTALEDIKPGEEICISYLDCCQRERSRHS 363
Query: 111 RQKALKEQYLFTCTCPRCI 129
R K L+E YLF C+CP+C+
Sbjct: 364 RHKILRENYLFVCSCPKCL 382
>gi|122692553|ref|NP_001073717.1| SET and MYND domain-containing protein 5 [Bos taurus]
gi|120419446|gb|ABM21542.1| SMYD family member 5 [Bos taurus]
gi|151553913|gb|AAI49126.1| SMYD5 protein [Bos taurus]
gi|296482713|tpg|DAA24828.1| TPA: SMYD family member 5 [Bos taurus]
Length = 418
Score = 62.8 bits (151), Expect = 2e-07, Method: Compositional matrix adjust.
Identities = 32/79 (40%), Positives = 47/79 (59%), Gaps = 5/79 (6%)
Query: 56 GTGLYPVISIINHSCLPNAVLVF--EGRLAVVRAVQHVPKGAEVLISYIETAG---STMT 110
G+GL+ + S NHSC+PNA F L V A++ + G E+ ISY++ S +
Sbjct: 304 GSGLFVLQSCCNHSCVPNAETSFPENNFLLHVTALEDIKPGEEICISYLDCCQRERSRHS 363
Query: 111 RQKALKEQYLFTCTCPRCI 129
R K L+E YLF C+CP+C+
Sbjct: 364 RHKILRENYLFVCSCPKCL 382
>gi|328873515|gb|EGG21882.1| SET domain-containing protein [Dictyostelium fasciculatum]
Length = 438
Score = 62.8 bits (151), Expect = 2e-07, Method: Compositional matrix adjust.
Identities = 40/86 (46%), Positives = 48/86 (55%), Gaps = 10/86 (11%)
Query: 49 NSELRPLGTGLYPVISIINHSCLPNA-VLVFEGRLAV----VRAVQHVPKGAEVLISYIE 103
N E R G G+Y S NHSC PN V E L V +RAV+ KG E+ ISYI+
Sbjct: 324 NGESR--GCGVYVRNSFFNHSCNPNVNYWVVENTLEVECSLMRAVR---KGEELCISYID 378
Query: 104 TAGSTMTRQKALKEQYLFTCTCPRCI 129
TA S R++ L E YLF C C +CI
Sbjct: 379 TAASLRDRREKLSEGYLFHCRCEKCI 404
>gi|195485910|ref|XP_002091283.1| GE13567 [Drosophila yakuba]
gi|194177384|gb|EDW90995.1| GE13567 [Drosophila yakuba]
Length = 514
Score = 62.8 bits (151), Expect = 2e-07, Method: Compositional matrix adjust.
Identities = 44/173 (25%), Positives = 76/173 (43%), Gaps = 11/173 (6%)
Query: 15 IANLVNLILQWPEISINEIAENFSKLACNAHTICNSELRPLGTGLYPVISIINHSCLPNA 74
I + + + S EI + L N H + ++ P ++ S +SCLPN
Sbjct: 189 IGQFIPKFFKTQKFSEEEIMKAVGALQINGHEVPTTD--PSHVAVFYTASFTENSCLPNL 246
Query: 75 VLVF-EGRLAVVRAVQHVPKGAEVLISYIETAGSTMTRQKALKEQYLFTCTCPRCIKLGQ 133
F + ++ A + + K A + I Y + T RQ+ L + LF C C RC+ + +
Sbjct: 247 AKSFNKNGHCILWAPREIKKNAHLSICYSDAVWGTADRQRHLMQTKLFRCACERCVDVTE 306
Query: 134 FDDIQESAILEGYRCKDDGCSGFLL---RDSDDKGFTCQQCGLVRSKEEIKKI 183
D SAI +C+D C G +L D + + C++C K +++I
Sbjct: 307 LDTFY-SAI----KCEDRQCGGLMLPTKTDDWNGNWRCRECHKQVQKHYVERI 354
>gi|154346778|ref|XP_001569326.1| hypothetical protein LBRM_35_7140 [Leishmania braziliensis
MHOM/BR/75/M2904]
gi|134066668|emb|CAM44467.1| hypothetical protein LBRM_35_7140 [Leishmania braziliensis
MHOM/BR/75/M2904]
Length = 442
Score = 62.8 bits (151), Expect = 2e-07, Method: Compositional matrix adjust.
Identities = 34/84 (40%), Positives = 47/84 (55%), Gaps = 5/84 (5%)
Query: 56 GTGLYPVISIINHSCLPNAVLVFEG----RLAVVRAVQHVPKGAEVLISYIETAGSTMTR 111
G G+Y V + NHSC PN + + L VV A++ V G E+ ISYI+++ R
Sbjct: 357 GQGIYEVGCLFNHSCDPNLSVQYSSLNDETLTVV-ALRDVKAGEELTISYIDSSLPFAVR 415
Query: 112 QKALKEQYLFTCTCPRCIKLGQFD 135
Q+ L + YLF C CPRC+ G D
Sbjct: 416 QQQLLDHYLFECRCPRCVAEGTTD 439
>gi|340723180|ref|XP_003399973.1| PREDICTED: SET and MYND domain-containing protein 4-like [Bombus
terrestris]
Length = 718
Score = 62.8 bits (151), Expect = 2e-07, Method: Compositional matrix adjust.
Identities = 43/144 (29%), Positives = 63/144 (43%), Gaps = 17/144 (11%)
Query: 43 NAHTICNSEL---------RPL--GTGLYPVISIINHSCLPNAVLVFEGRLAVVRAVQHV 91
NAH + + L RPL G +YP ++ NH C P F GR V+RA++ +
Sbjct: 491 NAHEVFETRLGNEHRFRGSRPLYIGVAIYPTVARFNHDCYPAVTRYFVGRSIVIRAIRSL 550
Query: 92 PKGAEVLISY--IETAGSTMTRQKALKEQYLFTCTCPRCIK-LGQFDDIQESAILEGYRC 148
G V +Y I T S R++ L +Y F C C C + +F+ + + RC
Sbjct: 551 RPGDTVAENYGPIFTKRSLEDRRRTLAARYWFRCECTACREDWPRFETLTNDMVR--LRC 608
Query: 149 KDDGCSGFLLRDSD-DKGFTCQQC 171
+GCS R D + C C
Sbjct: 609 PTEGCSKLHSRPRDPSRSIECSSC 632
>gi|449550207|gb|EMD41172.1| hypothetical protein CERSUDRAFT_91931 [Ceriporiopsis subvermispora
B]
Length = 329
Score = 62.8 bits (151), Expect = 2e-07, Method: Compositional matrix adjust.
Identities = 28/74 (37%), Positives = 40/74 (54%), Gaps = 2/74 (2%)
Query: 57 TGLYPVISIINHSCLPNA--VLVFEGRLAVVRAVQHVPKGAEVLISYIETAGSTMTRQKA 114
+G +P++S INH C P+A FE + A + +P G E+ I+Y + RQ
Sbjct: 138 SGCFPILSRINHDCSPSANYFFAFEPYCGQLWAARDIPDGGEITIAYTTLPAPRVDRQAH 197
Query: 115 LKEQYLFTCTCPRC 128
L E+Y FTCTC C
Sbjct: 198 LSERYFFTCTCVAC 211
>gi|391340382|ref|XP_003744521.1| PREDICTED: son of sevenless homolog 2-like [Metaseiulus occidentalis]
Length = 1624
Score = 62.8 bits (151), Expect = 3e-07, Method: Compositional matrix adjust.
Identities = 78/329 (23%), Positives = 131/329 (39%), Gaps = 64/329 (19%)
Query: 7 KQLLLYAQIANLVNLILQWPEISINEIAENFSKLACNAHTICNSELRPLGTGLYPVISII 66
KQ L AQ+ + +L +I E F + N I N + G LY S+I
Sbjct: 1332 KQFLTAAQLEGVSDL----------DILEIFGIMCVNTIHISNDD-DSFGCALYLAPSLI 1380
Query: 67 NHSCLPNAVLVFEGRLAVVRAVQHV-PKG-AEVLISYIETAGSTMTRQKALKEQYLFTCT 124
+HSC PN F+G+ V++ ++ PK A++ ++Y+ + R+K L+E+Y FTC
Sbjct: 1381 DHSCYPNLTATFKGQKIVLKVLRPCEPKTVADLSLAYMPVCTTKERRRKTLREEYYFTCE 1440
Query: 125 CPRCIKLGQFDDIQESAILEGYRCKDDGCSGFLLRDSDDKGFTCQQCGLVRSKEEIKKIA 184
C C G +L ++D K+
Sbjct: 1441 CEMC----------------------SGKVPEVLSEAD------------------PKLT 1460
Query: 185 SEVNILSKKTLALTSCGNHQEVVSTYKMIEKLQKKLYHPFSVNLMQTREKLIKILMEL-- 242
EV L K +L L+S NH++ + + E L KL ++ + R L+ +
Sbjct: 1461 DEVLELEKLSLDLSSPENHRKALKGVE--ELLSTKLKDLDDSDVAKFRAILVAADASVCA 1518
Query: 243 EDWKEALAYCQLTIPVYQRVYPQFHPLLGLQYYTCGKLEWFLGDTENA-------IKSMT 295
+ A Y ++PV +RV+ + + +L + T I +
Sbjct: 1519 SSYDRAYNYYSRSLPVMKRVFTENKAEYAYKLVRLARLSTIVVKTSQDPEVFGPLIALLQ 1578
Query: 296 EAVEILRITHGTNSPFMKELILKLEEAQA 324
EAV + RI G + K++ L EE +A
Sbjct: 1579 EAVRVTRIALGEDHSDTKDVALLYEELRA 1607
>gi|332016864|gb|EGI57673.1| Protein msta, isoform A [Acromyrmex echinatior]
Length = 421
Score = 62.8 bits (151), Expect = 3e-07, Method: Compositional matrix adjust.
Identities = 55/205 (26%), Positives = 97/205 (47%), Gaps = 18/205 (8%)
Query: 58 GLYPVISIINHSCLPNAVLVFEGRLAV-VRAVQHVPKGAEVLISYIETAGSTMTRQKALK 116
GLYP+ ++ NH C+PN F+ + + V AV + G E+ +SY + T +R++ LK
Sbjct: 147 GLYPMGALQNHCCVPNTRHHFDDQQRLYVSAVLPIAAGEEITMSYTDLLWDTSSRRQFLK 206
Query: 117 EQYLFTCTCPRCIKLGQFDDIQESAILEGYRCKDDGCSGFLL-RD--SDDKGFTCQQCGL 173
F+C C RC D ++ + L C D CSG LL R+ + + + C +C +
Sbjct: 207 VTKRFSCNCNRC-----SDPLEFGSQLSTLLCAKDNCSGCLLPRNPLNHESSWICDKCQI 261
Query: 174 -VRSKEEIKKIASEVNILSKKTLALTSCGNHQEVVSTYKMIEKLQKKLYHPFSVNLMQTR 232
V ++ I+ I S +N + T +E++ + E + +L + +M +
Sbjct: 262 SVNYRQVIECIHSGLNTFVSDAMYKTP----REIL---RFTEAILSRLVPATNYIMMDMK 314
Query: 233 EKLIKILMELEDWK-EALAYCQLTI 256
++I + D K E L +L I
Sbjct: 315 YRIISYFGRVPDLKWENLTDTELRI 339
>gi|1245372|gb|AAB38131.1| NN8-4AG, partial [Homo sapiens]
Length = 412
Score = 62.4 bits (150), Expect = 3e-07, Method: Compositional matrix adjust.
Identities = 32/79 (40%), Positives = 47/79 (59%), Gaps = 5/79 (6%)
Query: 56 GTGLYPVISIINHSCLPNAVLVF--EGRLAVVRAVQHVPKGAEVLISYIETAG---STMT 110
G+GL+ + S NHSC+PNA F L V A++ + G E+ ISY++ S +
Sbjct: 298 GSGLFVLQSCCNHSCVPNAETSFPENNFLLHVTALEDIKPGEEICISYLDCCQRERSRHS 357
Query: 111 RQKALKEQYLFTCTCPRCI 129
R K L+E YLF C+CP+C+
Sbjct: 358 RHKILRENYLFVCSCPKCL 376
>gi|307177472|gb|EFN66599.1| SET and MYND domain-containing protein 5 [Camponotus floridanus]
Length = 386
Score = 62.4 bits (150), Expect = 3e-07, Method: Compositional matrix adjust.
Identities = 34/87 (39%), Positives = 48/87 (55%), Gaps = 5/87 (5%)
Query: 47 ICNSELRPLGTGLYPVISIINHSCLPNAVLVF--EGRLAVVRAVQHVPKGAEVLISYI-- 102
+ S L G+GLY + S +NHSC PNA+ F V++A++ + G E+ SY+
Sbjct: 278 VVGSFLNNEGSGLYILQSSVNHSCTPNAIAEFPYSNNTLVLKAIRDIQVGEEICTSYLDE 337
Query: 103 -ETAGSTMTRQKALKEQYLFTCTCPRC 128
E S +RQKAL YLF C C +C
Sbjct: 338 CELERSRYSRQKALSSLYLFVCHCDKC 364
>gi|312373131|gb|EFR20943.1| hypothetical protein AND_18265 [Anopheles darlingi]
Length = 675
Score = 62.4 bits (150), Expect = 3e-07, Method: Compositional matrix adjust.
Identities = 49/175 (28%), Positives = 80/175 (45%), Gaps = 16/175 (9%)
Query: 55 LGTGLYPVISIINHSCLPNAVLVFEGRLAVVRAVQHVPKGAEVLISYIETAGSTMTRQKA 114
+G GLYP +++ NHSC P + G VR V+++ G T R++
Sbjct: 463 IGGGLYPTLALFNHSCDPGVTRYYRGNQVCVRTVKNIAAG---------TRVRREERRET 513
Query: 115 LKEQYLFTCTCPRCIK-LGQFDDIQESAILEGYRCKDDG-CSGFLLRDSDDKGF--TCQQ 170
L QY FTC C C++ F D+ S I +RC+ CS LL S+ F C +
Sbjct: 514 LLNQYRFTCYCEPCMQNWPLFSDMDPSII--RFRCEGGKICSNVLLIPSEINDFMVKCTE 571
Query: 171 CGLVRSKEEIKKIASEVNILSKKTLALTSCGNHQEVVSTY-KMIEKLQKKLYHPF 224
CG + + K + ++L + L S G ++ + Y +M+ + + L P+
Sbjct: 572 CGEHTNIMKGLKSLQDTDMLFQTATRLHSAGEYEAALMKYVEMMAIMSEVLVPPY 626
>gi|344246823|gb|EGW02927.1| SET and MYND domain-containing protein 5 [Cricetulus griseus]
Length = 633
Score = 62.4 bits (150), Expect = 3e-07, Method: Compositional matrix adjust.
Identities = 35/86 (40%), Positives = 50/86 (58%), Gaps = 7/86 (8%)
Query: 56 GTGLYPVISIINHSCLPNAVLVF--EGRLAVVRAVQHVPKGAEVLISYIETAG---STMT 110
G+GL+ + S NHSC+PNA F L V A++ + G E+ ISY++ S +
Sbjct: 520 GSGLFVLQSCCNHSCVPNAETSFPENNFLLHVTALEDIKPGEEICISYLDCCQRERSRHS 579
Query: 111 RQKALKEQYLFTCTCPRCIKLGQFDD 136
R K L+E YLF C+CP+C L + DD
Sbjct: 580 RHKILRENYLFICSCPKC--LAEADD 603
>gi|440490836|gb|ELQ70343.1| hypothetical protein OOW_P131scaffold00045g35 [Magnaporthe oryzae
P131]
Length = 582
Score = 62.4 bits (150), Expect = 3e-07, Method: Compositional matrix adjust.
Identities = 28/74 (37%), Positives = 42/74 (56%)
Query: 55 LGTGLYPVISIINHSCLPNAVLVFEGRLAVVRAVQHVPKGAEVLISYIETAGSTMTRQKA 114
GT L V+S+INH C PNA + FEG VR+++ + G E+ +SY + + RQ+
Sbjct: 240 FGTSLDLVVSMINHDCSPNAHVFFEGSQVRVRSLKAIAAGGEITVSYCDPRLDVLLRQEI 299
Query: 115 LKEQYLFTCTCPRC 128
L++ F C C C
Sbjct: 300 LRQTQFFHCECTTC 313
>gi|307195015|gb|EFN77083.1| SET and MYND domain-containing protein 5 [Harpegnathos saltator]
Length = 385
Score = 62.4 bits (150), Expect = 3e-07, Method: Compositional matrix adjust.
Identities = 32/78 (41%), Positives = 45/78 (57%), Gaps = 5/78 (6%)
Query: 56 GTGLYPVISIINHSCLPNAVLVF--EGRLAVVRAVQHVPKGAEVLISYIETA---GSTMT 110
G+GLY + S +NHSC+PNA + F V+RA++ + E+ SY+E S +
Sbjct: 287 GSGLYILQSAVNHSCMPNASVEFPYSNNTLVLRAIRDIQPEEEICTSYLEECELERSRYS 346
Query: 111 RQKALKEQYLFTCTCPRC 128
RQKAL YLF C C +C
Sbjct: 347 RQKALSSLYLFVCHCDKC 364
>gi|294892061|ref|XP_002773875.1| hypothetical protein, conserved [Perkinsus marinus ATCC 50983]
gi|239879079|gb|EER05691.1| hypothetical protein, conserved [Perkinsus marinus ATCC 50983]
Length = 140
Score = 62.4 bits (150), Expect = 3e-07, Method: Compositional matrix adjust.
Identities = 32/99 (32%), Positives = 47/99 (47%), Gaps = 4/99 (4%)
Query: 39 KLACNAHTICNSELRPLGTGLYPVISIINHSCLPNAVLVF-EGRLAVVRAVQHVPKGAEV 97
+ CN I + + G+YP ++ NHSC PN + F G RA++ + G E+
Sbjct: 41 RFPCNNFAIVDDLWSGIAAGVYPNAALFNHSCHPNVIPAFGHGSTLSFRAIRDISPGEEI 100
Query: 98 LISYIETAGSTMTRQKALKEQYLFTCTCPRCIKLGQFDD 136
SY+E + R+ L Y F C C RC G+ DD
Sbjct: 101 CHSYVELTLPSWKRRDVLLRDYEFLCECERC---GKRDD 136
>gi|71666752|ref|XP_820332.1| hypothetical protein [Trypanosoma cruzi strain CL Brener]
gi|70885672|gb|EAN98481.1| hypothetical protein, conserved [Trypanosoma cruzi]
Length = 359
Score = 62.4 bits (150), Expect = 3e-07, Method: Compositional matrix adjust.
Identities = 44/131 (33%), Positives = 67/131 (51%), Gaps = 19/131 (14%)
Query: 16 ANLVNLILQWPEISINEIAENFSKLACNAHTI---------CNSELRPL-----GTGLYP 61
A L+N + P + + + E +L C T+ +EL+ L G G+Y
Sbjct: 203 AVLLNGQERSPPSNYDRLKELVRRLPCGESTMDAFEQEVQMAGNELQHLRQSSRGQGVYT 262
Query: 62 VISIINHSCLPNAVLVF----EGRLAVVRAVQHVPKGAEVLISYIETAGSTMTRQKALKE 117
V + NHSC PN +V+ + L+VV A++ + G E+ ISYI+ + S RQ+ L E
Sbjct: 263 VGCLFNHSCEPNLQVVYSESGDETLSVV-ALRDIEPGEELCISYIDESLSYPERQQELYE 321
Query: 118 QYLFTCTCPRC 128
YLF C CP+C
Sbjct: 322 HYLFFCQCPKC 332
>gi|440476470|gb|ELQ45066.1| hypothetical protein OOU_Y34scaffold00021g6 [Magnaporthe oryzae
Y34]
Length = 581
Score = 62.4 bits (150), Expect = 3e-07, Method: Compositional matrix adjust.
Identities = 28/74 (37%), Positives = 42/74 (56%)
Query: 55 LGTGLYPVISIINHSCLPNAVLVFEGRLAVVRAVQHVPKGAEVLISYIETAGSTMTRQKA 114
GT L V+S+INH C PNA + FEG VR+++ + G E+ +SY + + RQ+
Sbjct: 240 FGTSLDLVVSMINHDCSPNAHVFFEGSQVRVRSLKAIAAGGEITVSYCDPRLDVLLRQEI 299
Query: 115 LKEQYLFTCTCPRC 128
L++ F C C C
Sbjct: 300 LRQTQFFHCECTTC 313
>gi|407849671|gb|EKG04345.1| hypothetical protein TCSYLVIO_004595 [Trypanosoma cruzi]
Length = 359
Score = 62.4 bits (150), Expect = 3e-07, Method: Compositional matrix adjust.
Identities = 44/131 (33%), Positives = 67/131 (51%), Gaps = 19/131 (14%)
Query: 16 ANLVNLILQWPEISINEIAENFSKLACNAHTI---------CNSELRPL-----GTGLYP 61
A L+N + P + + + E +L C T+ EL+ L G G+Y
Sbjct: 203 AVLLNGQERSPPSNYDRLKELVQRLPCGESTMDAFEQEVQMAGKELQHLLQSSRGQGVYT 262
Query: 62 VISIINHSCLPNAVLVF----EGRLAVVRAVQHVPKGAEVLISYIETAGSTMTRQKALKE 117
V + NHSC PN +V+ + L+VV A++++ G E+ ISYI+ + S RQ+ L E
Sbjct: 263 VGCLFNHSCEPNLQVVYSESGDETLSVV-ALRNIELGEELCISYIDESLSYPERQQELYE 321
Query: 118 QYLFTCTCPRC 128
YLF C CP+C
Sbjct: 322 HYLFVCQCPKC 332
>gi|328784417|ref|XP_624539.3| PREDICTED: protein msta, isoform A-like [Apis mellifera]
Length = 440
Score = 62.4 bits (150), Expect = 3e-07, Method: Compositional matrix adjust.
Identities = 49/185 (26%), Positives = 80/185 (43%), Gaps = 21/185 (11%)
Query: 14 QIANLVNLILQWPEISINEIAENFSKLACNAHTICNSELRPLG----TGLYPVISIINHS 69
IA+ + + +IS EIA L N H + PL +Y + S+I H+
Sbjct: 184 NIAHFIKRFFKADDISEEEIATIIGILQVNGHEV------PLTDSPYVAVYEMASLIEHN 237
Query: 70 CLPNAVLVFE--GRLAVVRAVQHVPKGAEVLISYIETAGSTMTRQKALKEQYLFTCTCPR 127
C N F G L ++RA + KG + I Y + T R+ L + F C C R
Sbjct: 238 CRANCSKSFTDMGGL-IIRAALPITKGDHISICYTDPLWGTANRRHHLFKTKFFECICNR 296
Query: 128 CIKLGQFDDIQESAILEGYRCKDDGCSGFLLRDS---DDKGFTCQQCGLVRSKEEIKKIA 184
C +F + +C + CSG++L + ++ + C+ C V S EI+K+
Sbjct: 297 CQDPTEF-----QTMFNALKCNNINCSGYILPKTFLEQEQDYICKICESVVSCTEIEKVL 351
Query: 185 SEVNI 189
++ I
Sbjct: 352 EDIGI 356
>gi|268570288|ref|XP_002648464.1| C. briggsae CBR-SET-18 protein [Caenorhabditis briggsae]
Length = 369
Score = 62.0 bits (149), Expect = 3e-07, Method: Compositional matrix adjust.
Identities = 41/152 (26%), Positives = 66/152 (43%), Gaps = 16/152 (10%)
Query: 37 FSKLACNAHTICNSELRPLGTGLYPVISIINHSCLPNAVLVFEGRLAVVRAVQHVP---- 92
F K + N+H+I + +G L +S NHSC P +VF+G +R + VP
Sbjct: 180 FCKASINSHSIHTNAGNEVGMALDLGVSKYNHSCRPTCSMVFDGYRVCLRPL--VPGVDA 237
Query: 93 -KGAEVLISYIETAGSTMTRQKALKEQYLFTCTCPRCIKLGQFDDIQESAILEGYRCKDD 151
+ ISYI+ S R++ LK ++ F C C RC+ E L +C +
Sbjct: 238 ENTEQAFISYIDVGRSKFVRRRDLKSRWYFDCECTRCMD-------PEDDSLTAIKCANP 290
Query: 152 GCSGFLLRDSDDK--GFTCQQCGLVRSKEEIK 181
C +L ++ C +C V + +K
Sbjct: 291 ACDAPILTSETEEPMNIACDKCKTVVEESTVK 322
>gi|303279368|ref|XP_003058977.1| predicted protein [Micromonas pusilla CCMP1545]
gi|226460137|gb|EEH57432.1| predicted protein [Micromonas pusilla CCMP1545]
Length = 322
Score = 62.0 bits (149), Expect = 3e-07, Method: Compositional matrix adjust.
Identities = 28/77 (36%), Positives = 41/77 (53%), Gaps = 1/77 (1%)
Query: 53 RPLGTGLYPVISIINHSCLPNAVLVFEGRLAV-VRAVQHVPKGAEVLISYIETAGSTMTR 111
RP+G G+YP ++ NH C PNA F+ V V + V KG E+ I Y++ R
Sbjct: 241 RPIGVGVYPSAAMFNHDCAPNAAQRFDAFGCVRVETTRRVRKGEELTIPYVDVMLGREER 300
Query: 112 QKALKEQYLFTCTCPRC 128
+ L++ + F C C RC
Sbjct: 301 RGKLRKNFAFECACARC 317
>gi|444708470|gb|ELW49533.1| SET and MYND domain-containing protein 3 [Tupaia chinensis]
Length = 125
Score = 62.0 bits (149), Expect = 3e-07, Method: Compositional matrix adjust.
Identities = 27/66 (40%), Positives = 43/66 (65%)
Query: 63 ISIINHSCLPNAVLVFEGRLAVVRAVQHVPKGAEVLISYIETAGSTMTRQKALKEQYLFT 122
+S++NHSC PN +VF G ++RAV+ + G E+ I Y++ ++ R+K L++QY F
Sbjct: 16 MSLLNHSCDPNCSIVFNGPHLLLRAVRDIEVGEELTICYLDMLMTSEERRKQLRDQYCFE 75
Query: 123 CTCPRC 128
C C RC
Sbjct: 76 CDCFRC 81
>gi|322785185|gb|EFZ11898.1| hypothetical protein SINV_10758 [Solenopsis invicta]
Length = 389
Score = 62.0 bits (149), Expect = 3e-07, Method: Compositional matrix adjust.
Identities = 31/78 (39%), Positives = 47/78 (60%), Gaps = 5/78 (6%)
Query: 56 GTGLYPVISIINHSCLPNAVLVF--EGRLAVVRAVQHVPKGAEVLISYIETAG---STMT 110
G+GLY + S +NHSC+PNAV+ F V++A++ + G E+ SY++ S +
Sbjct: 294 GSGLYVLQSSVNHSCVPNAVVEFPYSNNTLVLKAIRDIKVGEEICTSYLDECQLERSRHS 353
Query: 111 RQKALKEQYLFTCTCPRC 128
RQ+AL YLF C C +C
Sbjct: 354 RQQALSSLYLFVCHCDKC 371
>gi|395841250|ref|XP_003793459.1| PREDICTED: SET and MYND domain-containing protein 5 isoform 2
[Otolemur garnettii]
Length = 300
Score = 62.0 bits (149), Expect = 4e-07, Method: Compositional matrix adjust.
Identities = 31/79 (39%), Positives = 46/79 (58%), Gaps = 5/79 (6%)
Query: 56 GTGLYPVISIINHSCLPNAVLVF--EGRLAVVRAVQHVPKGAEVLISYIETAG---STMT 110
G+GL+ + S NHSC+PNA F V A++ + G E+ ISY++ S +
Sbjct: 188 GSGLFVLQSCCNHSCVPNAETSFPENNFFLHVTALEDIKPGEEICISYLDCCQRERSRHS 247
Query: 111 RQKALKEQYLFTCTCPRCI 129
R K L+E YLF C+CP+C+
Sbjct: 248 RHKILRENYLFICSCPKCL 266
>gi|348515415|ref|XP_003445235.1| PREDICTED: SET and MYND domain-containing protein 5-like isoform 1
[Oreochromis niloticus]
Length = 417
Score = 62.0 bits (149), Expect = 4e-07, Method: Compositional matrix adjust.
Identities = 33/87 (37%), Positives = 51/87 (58%), Gaps = 7/87 (8%)
Query: 56 GTGLYPVISIINHSCLPNAVLVFEGR--LAVVRAVQHVPKGAEVLISYIETAG---STMT 110
G+GL+ + S NHSC+PNA F L + A+ + +G E+ ISY++ S +
Sbjct: 303 GSGLFLLQSSCNHSCIPNAEASFPDNNFLLHLSALCDINQGEEICISYLDCCQRDRSRHS 362
Query: 111 RQKALKEQYLFTCTCPRCIKLGQFDDI 137
R K L+E YLF C+CP+C+ Q D++
Sbjct: 363 RHKILRENYLFVCSCPKCV--SQMDEL 387
>gi|194899548|ref|XP_001979321.1| GG14794 [Drosophila erecta]
gi|190651024|gb|EDV48279.1| GG14794 [Drosophila erecta]
Length = 392
Score = 62.0 bits (149), Expect = 4e-07, Method: Compositional matrix adjust.
Identities = 31/78 (39%), Positives = 47/78 (60%), Gaps = 5/78 (6%)
Query: 56 GTGLYPVISIINHSCLPNAVLVF--EGRLAVVRAVQHVPKGAEVLISYIETA---GSTMT 110
G+GLY + S INHSC+PNA F + V++A+ + +G E+ ISY++ S +
Sbjct: 286 GSGLYLLQSKINHSCVPNACSTFPYSNDIVVLKALAPIQEGDEICISYLDECMLERSRHS 345
Query: 111 RQKALKEQYLFTCTCPRC 128
R K L+E Y+F C C +C
Sbjct: 346 RHKVLRENYVFICQCSKC 363
>gi|66521464|ref|XP_396314.2| PREDICTED: protein msta, isoform B-like [Apis mellifera]
Length = 513
Score = 62.0 bits (149), Expect = 4e-07, Method: Compositional matrix adjust.
Identities = 59/271 (21%), Positives = 109/271 (40%), Gaps = 35/271 (12%)
Query: 58 GLYPVISIINHSCLPN-----AVLVFEGRLAVVRAVQHVPKGAEVLISYIETAGSTMTRQ 112
+YP ++++NHSC+ N + ++ RL R +P G E+ SY + TM R+
Sbjct: 216 AIYPTVALMNHSCISNTCHSISPTDYKIRL---RTTLKIPVGGELYGSYTHSLLPTMLRR 272
Query: 113 KALKEQYLFTCTCPRCIKLGQFDDIQESAILEGYRCK--DDGCSGFLLRDSDDKGFTCQQ 170
+ L E F C C RC D + + +C D+G L + + C
Sbjct: 273 EHLLEGKNFACACARC-----SDPTELGTHMSSLKCNKCDNGIVLSLDSLDPESSWKCTH 327
Query: 171 CGLVRSKEEIKKIASEVNILSKKTLALTSCGNHQEVVSTYKMIEKLQKKLYHPFSVNLMQ 230
C S ++K+ + + A++ + + + + + K + + HP L
Sbjct: 328 CEFSTSGSAVRKVLQIIQMEVDAVEAISG-ADGADAIQERETVAKKYRSVLHPRHAFLTM 386
Query: 231 TREKLIKILMELEDW----------KEALAYCQLTIPVYQRVYPQFHPLLGLQYYTCGKL 280
R L ++ +ED+ + + C+L + V + P + + G+ Y
Sbjct: 387 LRHSLTQMYGRVEDYLLEDLPDVVLEHKIDMCRLLLQVLDVIEPGYSRIRGMTLYELHAP 446
Query: 281 EWFLGDTE--------NAIKS-MTEAVEILR 302
F T+ A+KS M EA +IL+
Sbjct: 447 LLFTAKTQWNAEVIDKAALKSKMIEASKILK 477
>gi|270011596|gb|EFA08044.1| hypothetical protein TcasGA2_TC005638 [Tribolium castaneum]
Length = 575
Score = 62.0 bits (149), Expect = 4e-07, Method: Compositional matrix adjust.
Identities = 52/182 (28%), Positives = 77/182 (42%), Gaps = 24/182 (13%)
Query: 59 LYPVISIINHSCLPNAVLVFE----GRLAVVRAVQHVPKGAEVLISYIETAGSTMTRQKA 114
LYP ++ H CL N V F+ G VRA + KG + Y T R++
Sbjct: 252 LYPTAYLMEHDCLCNTVHSFDTEENGYKITVRAALPIKKGDHISTMYTHALWGTQARREH 311
Query: 115 LKEQYLFTCTCPRCIKLGQFDDIQESAILEGYRCKD---DGCSGFLL---RDSDDKGFTC 168
LKE F+CTC RC D + + L RC D C G+ L D+ + C
Sbjct: 312 LKETKYFSCTCKRCK-----DPTEMGSYLSALRCLGTGVDSCDGYQLPADPTDDNTQWLC 366
Query: 169 QQCGLVRSKEE----IKKIASEVNILSKKTLALTSCGNHQEVVSTYKMIEKLQKKLYHPF 224
+C + + E I +I EV+ + + + N ++S KM+ L YH +
Sbjct: 367 NKCDIKLTNSEVSYLINQIGEEVDHVQLSSPTVKDLDN---LLS--KMLTFLHPNHYHVY 421
Query: 225 SV 226
SV
Sbjct: 422 SV 423
>gi|156366270|ref|XP_001627062.1| predicted protein [Nematostella vectensis]
gi|156213960|gb|EDO34962.1| predicted protein [Nematostella vectensis]
Length = 391
Score = 62.0 bits (149), Expect = 4e-07, Method: Compositional matrix adjust.
Identities = 32/78 (41%), Positives = 44/78 (56%), Gaps = 5/78 (6%)
Query: 56 GTGLYPVISIINHSCLPNAVLVF--EGRLAVVRAVQHVPKGAEVLISYIETAG---STMT 110
G GLY + S NHSC PNA + F V++A+ + G E+ ISY+E S +
Sbjct: 293 GAGLYALQSSCNHSCAPNAEVTFPKNNSTLVLKALHPIKNGEEICISYLEECQRERSRHS 352
Query: 111 RQKALKEQYLFTCTCPRC 128
R K L+E Y+F CTC +C
Sbjct: 353 RLKYLRENYIFDCTCTKC 370
>gi|281206858|gb|EFA81042.1| hypothetical protein PPL_05877 [Polysphondylium pallidum PN500]
Length = 425
Score = 62.0 bits (149), Expect = 4e-07, Method: Compositional matrix adjust.
Identities = 31/79 (39%), Positives = 41/79 (51%), Gaps = 6/79 (7%)
Query: 56 GTGLYPVISIINHSCLPNAVLVFEG------RLAVVRAVQHVPKGAEVLISYIETAGSTM 109
G GLYP+ S +NHSC PN + E V RA +++ G E+L +Y + T
Sbjct: 346 GIGLYPIFSCMNHSCQPNVEICNERTDGVTFNKVVFRAKKNIKAGQELLNNYCDVTLPTK 405
Query: 110 TRQKALKEQYLFTCTCPRC 128
RQ LK QY F C C +C
Sbjct: 406 ERQSQLKSQYDFICKCNKC 424
>gi|91088857|ref|XP_971416.1| PREDICTED: similar to CG14590 CG14590-PA [Tribolium castaneum]
Length = 563
Score = 62.0 bits (149), Expect = 4e-07, Method: Compositional matrix adjust.
Identities = 52/182 (28%), Positives = 77/182 (42%), Gaps = 24/182 (13%)
Query: 59 LYPVISIINHSCLPNAVLVFE----GRLAVVRAVQHVPKGAEVLISYIETAGSTMTRQKA 114
LYP ++ H CL N V F+ G VRA + KG + Y T R++
Sbjct: 252 LYPTAYLMEHDCLCNTVHSFDTEENGYKITVRAALPIKKGDHISTMYTHALWGTQARREH 311
Query: 115 LKEQYLFTCTCPRCIKLGQFDDIQESAILEGYRCKD---DGCSGFLL---RDSDDKGFTC 168
LKE F+CTC RC D + + L RC D C G+ L D+ + C
Sbjct: 312 LKETKYFSCTCKRCK-----DPTEMGSYLSALRCLGTGVDSCDGYQLPADPTDDNTQWLC 366
Query: 169 QQCGLVRSKEE----IKKIASEVNILSKKTLALTSCGNHQEVVSTYKMIEKLQKKLYHPF 224
+C + + E I +I EV+ + + + N ++S KM+ L YH +
Sbjct: 367 NKCDIKLTNSEVSYLINQIGEEVDHVQLSSPTVKDLDN---LLS--KMLTFLHPNHYHVY 421
Query: 225 SV 226
SV
Sbjct: 422 SV 423
>gi|407404730|gb|EKF30079.1| hypothetical protein MOQ_006118 [Trypanosoma cruzi marinkellei]
Length = 359
Score = 62.0 bits (149), Expect = 4e-07, Method: Compositional matrix adjust.
Identities = 33/77 (42%), Positives = 47/77 (61%), Gaps = 5/77 (6%)
Query: 56 GTGLYPVISIINHSCLPNAVLVF----EGRLAVVRAVQHVPKGAEVLISYIETAGSTMTR 111
G G+Y V + NHSC PN +V+ + L+VV A++ + G E+ ISYI+ + S R
Sbjct: 257 GQGVYTVGCLFNHSCEPNLQVVYSESGDETLSVV-ALRDIEPGEELCISYIDESLSYPER 315
Query: 112 QKALKEQYLFTCTCPRC 128
Q+ L E YLF C CP+C
Sbjct: 316 QQELYEHYLFVCHCPKC 332
>gi|301121168|ref|XP_002908311.1| conserved hypothetical protein [Phytophthora infestans T30-4]
gi|262103342|gb|EEY61394.1| conserved hypothetical protein [Phytophthora infestans T30-4]
Length = 424
Score = 62.0 bits (149), Expect = 4e-07, Method: Compositional matrix adjust.
Identities = 42/127 (33%), Positives = 61/127 (48%), Gaps = 16/127 (12%)
Query: 56 GTGLYPVISIINHSCLPNAVLVFEGRLAV-VRAVQHVPKGAEVLISYIETAGSTMTRQKA 114
GL+ + +INHSC PN G + VRA++ + +G E+ +SYI+ RQK
Sbjct: 162 AVGLFSICGLINHSCQPNCTWSNAGDSVMEVRALRDIEEGEEITLSYIDIDKERSERQKE 221
Query: 115 LKEQYLFTCTCPRC-IKLGQFDDIQESAILEGYRCKDDGCS-------GFLLRDSDDKGF 166
L++ F C C RC L + D +L+G+RC CS +LL +DK
Sbjct: 222 LRDTKHFDCQCERCSTPLSESVD----RVLDGFRCP--RCSVKASEEENYLLAQVEDK-L 274
Query: 167 TCQQCGL 173
C C L
Sbjct: 275 VCPDCQL 281
>gi|429853222|gb|ELA28312.1| TPR domain-containing protein [Colletotrichum gloeosporioides Nara
gc5]
Length = 643
Score = 62.0 bits (149), Expect = 4e-07, Method: Compositional matrix adjust.
Identities = 31/72 (43%), Positives = 42/72 (58%), Gaps = 1/72 (1%)
Query: 57 TGLYPVISIINHSCLPNAVLVFEGRLAVVRAVQHVPKGAEVLISYIETAGSTMTRQKALK 116
TGL+ + INHSC+PNA F G L V+R+++ + KG E+ SY E+ G RQ AL
Sbjct: 496 TGLWVRAAYINHSCVPNAKREFVGDLMVIRSLRKIKKGEEIFHSYDES-GDYEARQAALM 554
Query: 117 EQYLFTCTCPRC 128
+ F C C C
Sbjct: 555 TTWGFECGCALC 566
>gi|328770914|gb|EGF80955.1| hypothetical protein BATDEDRAFT_88226 [Batrachochytrium
dendrobatidis JAM81]
Length = 227
Score = 62.0 bits (149), Expect = 4e-07, Method: Compositional matrix adjust.
Identities = 27/75 (36%), Positives = 37/75 (49%)
Query: 55 LGTGLYPVISIINHSCLPNAVLVFEGRLAVVRAVQHVPKGAEVLISYIETAGSTMTRQKA 114
+G ++P S NHSC PN + R ++ V KG + ISYI+T R+
Sbjct: 96 MGRAVFPAASYFNHSCFPNCQSIKHDHKMAFRTLKDVSKGEMLTISYIDTNMPVSARRAR 155
Query: 115 LKEQYLFTCTCPRCI 129
L + Y F C C RCI
Sbjct: 156 LMDDYFFECMCERCI 170
>gi|449548889|gb|EMD39855.1| hypothetical protein CERSUDRAFT_122028 [Ceriporiopsis subvermispora
B]
Length = 454
Score = 62.0 bits (149), Expect = 4e-07, Method: Compositional matrix adjust.
Identities = 33/93 (35%), Positives = 53/93 (56%), Gaps = 6/93 (6%)
Query: 43 NAHTICNSELRPLGTGLYPVIS-IINHSCLPNAVLVF-----EGRLAVVRAVQHVPKGAE 96
N + I +S L P G++P+ S NHSC+PNAV+ + E V A++ + +G E
Sbjct: 184 NNNFIVHSHLYPYAHGIFPLASRTFNHSCVPNAVVKYIIRPSEPVCMQVVALREIQEGEE 243
Query: 97 VLISYIETAGSTMTRQKALKEQYLFTCTCPRCI 129
++I Y++ A S R+ AL+ Y F C+C C+
Sbjct: 244 IVIPYLDPALSYAARRDALQTNYGFICSCALCV 276
>gi|395508886|ref|XP_003758739.1| PREDICTED: SET and MYND domain-containing protein 5 [Sarcophilus
harrisii]
Length = 609
Score = 61.6 bits (148), Expect = 4e-07, Method: Compositional matrix adjust.
Identities = 32/79 (40%), Positives = 47/79 (59%), Gaps = 5/79 (6%)
Query: 56 GTGLYPVISIINHSCLPNAVLVFEGR--LAVVRAVQHVPKGAEVLISYIE---TAGSTMT 110
G+GL+ + S NHSC+PNA F L V A++ + G E+ ISY++ S +
Sbjct: 496 GSGLFVLQSCCNHSCVPNAETSFPENNFLLPVTALEDIKPGEEICISYLDCCQRERSRYS 555
Query: 111 RQKALKEQYLFTCTCPRCI 129
R K L+E YLF C+CP+C+
Sbjct: 556 RHKILRENYLFVCSCPKCL 574
>gi|195583324|ref|XP_002081472.1| GD11032 [Drosophila simulans]
gi|194193481|gb|EDX07057.1| GD11032 [Drosophila simulans]
Length = 1033
Score = 61.6 bits (148), Expect = 4e-07, Method: Compositional matrix adjust.
Identities = 42/156 (26%), Positives = 71/156 (45%), Gaps = 11/156 (7%)
Query: 32 EIAENFSKLACNAHTICNSELRPLGTGLYPVISIINHSCLPNAVLVF-EGRLAVVRAVQH 90
EI + L N H + ++ P ++ S +SCLPN F + ++ A +
Sbjct: 725 EIMKAVGALQINGHEVPTTD--PPHVAVFYTASFTENSCLPNLAKSFNKNGHCILWAPRE 782
Query: 91 VPKGAEVLISYIETAGSTMTRQKALKEQYLFTCTCPRCIKLGQFDDIQESAILEGYRCKD 150
+ K A + I Y + T RQ+ L + LF C C RC+ + + D SAI +C+D
Sbjct: 783 IKKNAHLSICYSDAMWGTADRQRHLMQTKLFKCACERCVDVTEL-DTNYSAI----KCED 837
Query: 151 DGCSGFLL---RDSDDKGFTCQQCGLVRSKEEIKKI 183
C G +L D + + C++C K +++I
Sbjct: 838 RQCGGLMLPTKADDWNGSWRCRECHKQVQKHYVERI 873
>gi|294935956|ref|XP_002781574.1| hypothetical protein, conserved [Perkinsus marinus ATCC 50983]
gi|239892411|gb|EER13369.1| hypothetical protein, conserved [Perkinsus marinus ATCC 50983]
Length = 181
Score = 61.6 bits (148), Expect = 4e-07, Method: Compositional matrix adjust.
Identities = 31/75 (41%), Positives = 44/75 (58%), Gaps = 1/75 (1%)
Query: 55 LGTGLYPVISIINHSCLPNAVLVFEG-RLAVVRAVQHVPKGAEVLISYIETAGSTMTRQK 113
LG+GLY +++ NHSC PNA F G R V++++ V G EV SYI+ + RQ
Sbjct: 91 LGSGLYRGVAVTNHSCSPNAEASFGGSRCLRVKSLRPVHAGEEVFQSYIDENLPLVERQS 150
Query: 114 ALKEQYLFTCTCPRC 128
L++ Y F C C +C
Sbjct: 151 KLRQAYGFACRCGKC 165
>gi|395329994|gb|EJF62379.1| hypothetical protein DICSQDRAFT_104880 [Dichomitus squalens
LYAD-421 SS1]
Length = 810
Score = 61.6 bits (148), Expect = 5e-07, Method: Compositional matrix adjust.
Identities = 31/72 (43%), Positives = 41/72 (56%), Gaps = 1/72 (1%)
Query: 57 TGLYPVISIINHSCLPNAVLVFEGRLAVVRAVQHVPKGAEVLISYIETAGSTMTRQKALK 116
TGLYP+ S+ NHSC+ NA+ G + ++RA + VP G E+ I Y S RQ LK
Sbjct: 556 TGLYPLASLFNHSCVANAIWYCIGDVMIIRAAEPVPAGTEITIPY-SVEESYFARQSVLK 614
Query: 117 EQYLFTCTCPRC 128
+ L CTC C
Sbjct: 615 KHMLEHCTCWLC 626
>gi|344303810|gb|EGW34059.1| hypothetical protein SPAPADRAFT_65242 [Spathaspora passalidarum
NRRL Y-27907]
Length = 619
Score = 61.6 bits (148), Expect = 5e-07, Method: Compositional matrix adjust.
Identities = 31/90 (34%), Positives = 47/90 (52%), Gaps = 2/90 (2%)
Query: 43 NAHTICNSELRPLGTGLYPVISIINHSCLPNAVLVFEGRLAVVRAVQHVP--KGAEVLIS 100
N + + + + P+G GL P S+INHSC PN +L + + V VP K +E+ ++
Sbjct: 226 NHNILLDQDNEPIGIGLDPDFSLINHSCAPNCILKHKQNWSGFELVNTVPIVKNSELTVT 285
Query: 101 YIETAGSTMTRQKALKEQYLFTCTCPRCIK 130
YI+T R L+ +Y F C C C K
Sbjct: 286 YIDTCFPKELRMLDLRSRYFFNCKCELCSK 315
>gi|125810418|ref|XP_001361485.1| GA20567 [Drosophila pseudoobscura pseudoobscura]
gi|54636660|gb|EAL26063.1| GA20567 [Drosophila pseudoobscura pseudoobscura]
Length = 660
Score = 61.6 bits (148), Expect = 5e-07, Method: Compositional matrix adjust.
Identities = 54/219 (24%), Positives = 95/219 (43%), Gaps = 18/219 (8%)
Query: 15 IANLVNLILQWPEISINEIAE--NFSKLACNAHTICNSELRPLGTGLYPVISIINHSCLP 72
I +LV LQ+ + + +E+AE FS +G +YP +++ NHSC P
Sbjct: 427 ICSLVLRSLQFIQFNTHEVAELHKFSSTGREKSIF-------IGGAIYPTLALFNHSCDP 479
Query: 73 NAVLVFEGRLAVVRAVQHVPKGAEVLISY--IETAGSTMTRQKALKEQYLFTCTCPRCI- 129
V F G + +V+ + G + +Y I T RQ +KE Y F C C CI
Sbjct: 480 GVVRYFRGTTIHINSVRPIEAGLPINENYGPIYTQDKREDRQAKMKELYWFECCCDACID 539
Query: 130 KLGQFDDIQESAILEGYRCK-DDGCSGFL--LRDSDDKGFTCQQCGLVRSKEEIKKIASE 186
F+D+ I +RC+ + C+ + +D C CG V + + K+ +
Sbjct: 540 NWPLFEDLPRDVI--RFRCEAPNNCAAVIEVPPSCNDFMVKCVTCGEVTNILKGLKVMQD 597
Query: 187 VNILSKKTLALTSCGNHQEVVSTY-KMIEKLQKKLYHPF 224
++++ L G++ + + + +I + L PF
Sbjct: 598 TEMMTRTGKRLYETGDYAKALQKFIDLIRIMYDVLAPPF 636
>gi|195123911|ref|XP_002006445.1| GI18563 [Drosophila mojavensis]
gi|193911513|gb|EDW10380.1| GI18563 [Drosophila mojavensis]
Length = 666
Score = 61.6 bits (148), Expect = 5e-07, Method: Compositional matrix adjust.
Identities = 44/194 (22%), Positives = 87/194 (44%), Gaps = 8/194 (4%)
Query: 55 LGTGLYPVISIINHSCLPNAVLVFEGRLAVVRAVQHVPKGAEVLISY--IETAGSTMTRQ 112
+G +YP +++ NHSC P V F G + V+ V G + +Y I T RQ
Sbjct: 468 IGGAIYPTLALFNHSCDPGVVRYFRGNTIHINTVRPVEAGLPINENYGPIYTQDKREDRQ 527
Query: 113 KALKEQYLFTCTCPRCIK-LGQFDDIQESAILEGYRCK-DDGCSGFL--LRDSDDKGFTC 168
LKE Y F C C C++ FDD+ I +RC+ + C+ + +D C
Sbjct: 528 ARLKELYWFECNCDACLENWPLFDDLPRDII--RFRCEAPNNCTAVIEVPPSCNDFMIKC 585
Query: 169 QQCGLVRSKEEIKKIASEVNILSKKTLALTSCGNHQEVVSTYKMIEKLQKKLYHPFSVNL 228
CG + + + K+ + ++++ L G++ + ++ + + ++ ++ P +
Sbjct: 586 VTCGELTNILKGLKVMQDTEMMTRTAKRLYDTGDYAKALNKFVDLLRIMYEVLAPPFPDF 645
Query: 229 MQTREKLIKILMEL 242
+ ++ L + L
Sbjct: 646 CECQQHLKDCFLNL 659
>gi|189233943|ref|XP_974086.2| PREDICTED: similar to AGAP008839-PA [Tribolium castaneum]
Length = 666
Score = 61.6 bits (148), Expect = 5e-07, Method: Compositional matrix adjust.
Identities = 57/235 (24%), Positives = 105/235 (44%), Gaps = 9/235 (3%)
Query: 15 IANLVNLILQWPEISINEIAE--NFSKLACNAHTICNSELRPLGTGLYPVISIINHSCLP 72
IA+L+ LQ + + +EI+E N ++ C+ + +G GLYP +++ NHSC P
Sbjct: 423 IASLILRHLQILQFNSHEISELRNLNEEMVTNGIQCHYKSEYIGAGLYPTLALFNHSCDP 482
Query: 73 NAVLVFEGRLAVVRAVQHVPKGAEVLISY--IETAGSTMTRQKALKEQYLFTCTCPRCIK 130
+ V G +VR ++ + G + +Y + T+ R+ L+ +Y F C C C +
Sbjct: 483 SIVRYNIGNRMIVRTIKPIKAGEIIYENYGPLYTSMDADERRVTLQNRYWFECYCTPCQQ 542
Query: 131 -LGQFDDIQESAILEGYRCKDDGCSGFLLRDSDD--KGFTCQQCGLVRSKEEIKKIASEV 187
F+ + + I G C+ + C DD F C C V K S++
Sbjct: 543 EWPLFEYMDPNQIKIG--CQKENCPFEFTLYKDDLCPYFQCDYCDGVTKIFPSLKGLSQL 600
Query: 188 NILSKKTLALTSCGNHQEVVSTYKMIEKLQKKLYHPFSVNLMQTREKLIKILMEL 242
I+ K L S G +E + + + K P ++++ +++L + + L
Sbjct: 601 AIMLPKAEDLYSAGETREAMKLFMQSLDILYKYSRPPCPDMIKVQQRLKTLFVHL 655
>gi|194698994|gb|ACF83581.1| unknown [Zea mays]
Length = 268
Score = 61.6 bits (148), Expect = 5e-07, Method: Compositional matrix adjust.
Identities = 33/84 (39%), Positives = 41/84 (48%), Gaps = 7/84 (8%)
Query: 52 LRPLGTGLYPVISIINHSCLPNAV-LVFEGRLA------VVRAVQHVPKGAEVLISYIET 104
L+ +YP S+ NH CLPNA + R VVRA+ +P+G EV ISY
Sbjct: 72 LKARAYAVYPRASLFNHDCLPNACHFDYPDRPGPGNTDIVVRALHDIPEGREVCISYFAA 131
Query: 105 AGSTMTRQKALKEQYLFTCTCPRC 128
RQ+ L E Y F C C RC
Sbjct: 132 NWRYADRQRRLLEDYGFRCECDRC 155
>gi|281202822|gb|EFA77024.1| hypothetical protein PPL_09776 [Polysphondylium pallidum PN500]
Length = 1187
Score = 61.6 bits (148), Expect = 5e-07, Method: Compositional matrix adjust.
Identities = 36/121 (29%), Positives = 57/121 (47%), Gaps = 13/121 (10%)
Query: 32 EIAENFSKLACNAHTICNSELRP-LGTGLYPVISIINHSCLPNAVLVFEGR-LAVVRAVQ 89
+I + + K+ NAH + + G+YP + +NHSC PN V + + R+++
Sbjct: 908 QILDLYPKMLFNAHEYIDPLYHSEVARGIYPTAAYLNHSCEPNTVWHNDNNGMIAYRSIR 967
Query: 90 HVPKGAEVLISYIETAGSTMTRQKALKEQYLFTCTCPRCIKLGQFDDIQESAILEGYRCK 149
+ G E+ +YI+ TRQ L QY F C C RC Q+ A G++C
Sbjct: 968 DIKAGEEITTTYIDITKYKSTRQLNLLSQYAFLCQCARC---------QDRAT--GFKCL 1016
Query: 150 D 150
D
Sbjct: 1017 D 1017
>gi|90077650|dbj|BAE88505.1| unnamed protein product [Macaca fascicularis]
Length = 418
Score = 61.6 bits (148), Expect = 5e-07, Method: Compositional matrix adjust.
Identities = 31/79 (39%), Positives = 46/79 (58%), Gaps = 5/79 (6%)
Query: 56 GTGLYPVISIINHSCLPNAVLVF--EGRLAVVRAVQHVPKGAEVLISYIETAG---STMT 110
G+G + + S NHSC+PNA F L V A++ + G E+ ISY++ S +
Sbjct: 304 GSGFFVLQSCCNHSCVPNAETSFPENNFLLHVTALEDIKPGEEICISYLDCCQRERSRHS 363
Query: 111 RQKALKEQYLFTCTCPRCI 129
R K L+E YLF C+CP+C+
Sbjct: 364 RHKILRENYLFVCSCPKCL 382
>gi|17535507|ref|NP_496323.1| Protein SET-14 [Caenorhabditis elegans]
gi|3123316|sp|Q09415.2|SET14_CAEEL RecName: Full=SET domain-containing protein 14
gi|3878909|emb|CAA86783.1| Protein SET-14 [Caenorhabditis elegans]
Length = 429
Score = 61.6 bits (148), Expect = 5e-07, Method: Compositional matrix adjust.
Identities = 48/164 (29%), Positives = 77/164 (46%), Gaps = 14/164 (8%)
Query: 37 FSKLACNAHTICNSELRPLGTGLYPVISIINHSCLPNAVLVFEGRLAVVRAV-QHVPKGA 95
F K++ N+ + NS P+G GL +S NHSC P + + R A++ V +P
Sbjct: 147 FQKVSINSFVVGNSTGNPIGVGLCIKLSAANHSCKPLTRVCYRNRTAMLVPVSSELPSTL 206
Query: 96 E-VLISYIETAGSTMTRQKALKEQYLFTCTCPRCIKLGQFDDIQESAILEGYRCKDDGCS 154
E SYI+ R+ LK++Y F C C C+ D +A +E + C C
Sbjct: 207 EGACHSYIDELMPRDMRRDTLKKKYKFLCQCDGCL------DEDRNARMEAWTC--GICV 258
Query: 155 GFLLRDSDDKGFTCQQCGLVRSKE--EIKKIASEVNILSKKTLA 196
+R+ ++ C+ CG SK+ E+ + A E I ++ LA
Sbjct: 259 KGWMRNKENG--QCELCGWTMSKDHFELCRTAEEAGIAARSRLA 300
>gi|342883051|gb|EGU83613.1| hypothetical protein FOXB_05861 [Fusarium oxysporum Fo5176]
Length = 374
Score = 61.6 bits (148), Expect = 5e-07, Method: Compositional matrix adjust.
Identities = 37/113 (32%), Positives = 58/113 (51%), Gaps = 7/113 (6%)
Query: 56 GTGLYPVISIINHSCLPNAVLVFEGRLAVV-RAVQHVPKGAEVLISYIE--TAGSTMTRQ 112
G LYP + NHSCLPN +G+ +V A + + KG E +I+Y + T +RQ
Sbjct: 261 GMALYPRAAQFNHSCLPNVTHQPDGQARMVYTAARDISKGEECMITYFDLTTHKDLTSRQ 320
Query: 113 KALKEQYLFTCTCPRCIKLGQFDDIQESAILEGYRCKDDGCSGFLLRDSDDKG 165
+EQ+ F CTC RC+K ++ +E+ + K + C SD++G
Sbjct: 321 NHTQEQFQFKCTCERCLK----EEAEENIEFQELELKLEPCPISSATTSDNEG 369
>gi|443683649|gb|ELT87827.1| hypothetical protein CAPTEDRAFT_118237 [Capitella teleta]
Length = 596
Score = 61.6 bits (148), Expect = 5e-07, Method: Compositional matrix adjust.
Identities = 34/101 (33%), Positives = 47/101 (46%), Gaps = 12/101 (11%)
Query: 40 LACNAHTICNSELRP----------LGTGLYPVISIINHSCLPNAVLVFEGRLAVVRAVQ 89
CNAH + L P LG G+Y +S+ NHSC P F G VVRA++
Sbjct: 377 FPCNAHEVPELYLDPNAIDLSMPNELGAGIYSTLSLFNHSCDPGVNRNFYGDTCVVRAIK 436
Query: 90 HVPKGAEVLISY--IETAGSTMTRQKALKEQYLFTCTCPRC 128
+ KG +V +Y + + R L+ QY F+C C C
Sbjct: 437 TIRKGHQVSDNYGALYATNTLKERHDKLQPQYFFSCRCEPC 477
>gi|345493155|ref|XP_001601354.2| PREDICTED: SET and MYND domain-containing protein 4-like [Nasonia
vitripennis]
Length = 737
Score = 61.6 bits (148), Expect = 5e-07, Method: Compositional matrix adjust.
Identities = 61/288 (21%), Positives = 108/288 (37%), Gaps = 66/288 (22%)
Query: 39 KLACNAHTICNSELRP-------------LGTGLYPVISIINHSCLPNAVLVFEGRLAVV 85
+L CN H I + + T +YP S++NHSC PN + F+ + +V
Sbjct: 422 QLICNGHAITRLNISDSESGNVVTEYQCRIATAIYPSASMMNHSCDPNIINSFKDQYLIV 481
Query: 86 RAVQHVPKGAEVLISYIE--TAGSTMTRQKALKEQYLFTCTCPRCIKLGQFDDIQESAIL 143
+A + + EV Y RQ AL+ QY FTC C
Sbjct: 482 KATKDIAAKEEVFNCYGPHYRRMRKKDRQIALQNQYCFTCEC------------------ 523
Query: 144 EGYRCKDDGCSGFLLRDSDDK--GFTCQQCGLVRSKEEIKKIASEVNILSKKTLALTSCG 201
+ C+ L++ DK F C++C V I+S ++ C
Sbjct: 524 -------EACTQRALQNFSDKFQRFNCEECN------------GPVEIISHSSMRCLDCE 564
Query: 202 NHQEVVSTYKMIEKLQKKLYHPFSVNLMQTREKLIKILMELEDWKEALAYCQLTIPVYQR 261
++V + + + KL+ +NL + KEAL + + + +R
Sbjct: 565 TTFDLVKSQLLELEEANKLFEAAKINLKSQKV------------KEALENAKQCLEIRKR 612
Query: 262 VYPQFHPLLGLQYYTCGKLEWFLGDTENAIKSMTEAVEILRITHGTNS 309
+ ++H + L Y GK+ G ++I + ++ + G +S
Sbjct: 613 ILYEYHESVTLTYDLIGKIFAVTGRWLDSISHLEHSLAAVEERFGPDS 660
>gi|384252143|gb|EIE25620.1| hypothetical protein COCSUDRAFT_83627 [Coccomyxa subellipsoidea
C-169]
Length = 394
Score = 61.6 bits (148), Expect = 5e-07, Method: Compositional matrix adjust.
Identities = 54/226 (23%), Positives = 100/226 (44%), Gaps = 20/226 (8%)
Query: 55 LGTGLYPVISIINHSCLPNAVLVFEGRLAVVRAVQHVPKGAEVLISYIETAGSTM--TRQ 112
+G +YP +++NH+C PN F G + A +P G + Y AG ++ R
Sbjct: 171 MGLAIYPRAAMLNHACSPNVAAAFAGTRLHICATSDLPPGTTLRFCYGPQAGESIREVRL 230
Query: 113 KALKEQYLFTCTCPRCIKLGQFDDIQESAILEGYRCKDDGCSGFLLRDSDDKGFTCQQCG 172
+ LKEQY F C C C + +QE+A++ G +C+ C+G ++ CG
Sbjct: 231 RQLKEQYHFWCRCSAC--EASYSGVQEAAMI-GMKCRL--CNGPVVPSIG--------CG 277
Query: 173 L-VRSKEEIKKIASEVNILSKKTLALTSCGNHQ----EVVSTYKMIEKLQKKLYHPFSVN 227
+ S+E+I+++A + A S H E + + ++Q+ L +
Sbjct: 278 AEMSSEEQIERMARLSQADAAFASACLSLAQHSAVSAETLQSMHFCLEVQEVLLVTSNQV 337
Query: 228 LMQTREKLIKILMELEDWKEALAYCQLTIPVYQRVYPQFHPLLGLQ 273
L +T + L + D + A+ + Q + + + Y H ++ Q
Sbjct: 338 LGRTHDALARAYGPRGDTENAMLHAQKALDIVRSNYGDGHLVVAHQ 383
>gi|195151412|ref|XP_002016641.1| GL11690 [Drosophila persimilis]
gi|194110488|gb|EDW32531.1| GL11690 [Drosophila persimilis]
Length = 660
Score = 61.6 bits (148), Expect = 5e-07, Method: Compositional matrix adjust.
Identities = 54/219 (24%), Positives = 95/219 (43%), Gaps = 18/219 (8%)
Query: 15 IANLVNLILQWPEISINEIAE--NFSKLACNAHTICNSELRPLGTGLYPVISIINHSCLP 72
I +LV LQ+ + + +E+AE FS +G +YP +++ NHSC P
Sbjct: 427 ICSLVLRSLQFIQFNTHEVAELHKFSSTGREKSIF-------IGGAIYPTLALFNHSCDP 479
Query: 73 NAVLVFEGRLAVVRAVQHVPKGAEVLISY--IETAGSTMTRQKALKEQYLFTCTCPRCI- 129
V F G + +V+ + G + +Y I T RQ +KE Y F C C CI
Sbjct: 480 GVVRYFRGTTIHINSVRPIEAGLPINENYGPIYTQDKREERQAKMKELYWFECCCDACID 539
Query: 130 KLGQFDDIQESAILEGYRCK-DDGCSGFL--LRDSDDKGFTCQQCGLVRSKEEIKKIASE 186
F+D+ I +RC+ + C+ + +D C CG V + + K+ +
Sbjct: 540 NWPLFEDLPRDVI--RFRCEAPNNCAAVIEVPPSCNDFMVKCVTCGEVTNILKGLKVMQD 597
Query: 187 VNILSKKTLALTSCGNHQEVVSTY-KMIEKLQKKLYHPF 224
++++ L G++ + + + +I + L PF
Sbjct: 598 TEMMTRTGKRLYETGDYAKALQKFIDLIRIMYDVLAPPF 636
>gi|198437054|ref|XP_002123001.1| PREDICTED: similar to SET and MYND domain containing 4 [Ciona
intestinalis]
Length = 773
Score = 61.6 bits (148), Expect = 5e-07, Method: Compositional matrix adjust.
Identities = 44/154 (28%), Positives = 72/154 (46%), Gaps = 8/154 (5%)
Query: 1 MSDIDEKQLLLYAQIANLVNLILQWPEISINEIAENF-SKLACNAHTICNSELRPLGTGL 59
M+ I+ QL ++ + L + ++ IN + +F ++ C+ T+ ++ + +
Sbjct: 440 MTKIEPGQLTSMGEVIEYL-LHRHYLQVPINGQSISFVTEELCDNVTVTRRDI--VASAF 496
Query: 60 YPVISIINHSCLPNAVLVFEGRLAVVRAVQHVPKGAEVLISYIETA--GSTMTRQKALKE 117
+P +S++NHSC N +F G R+ Q +P GAE+ Y + S RQK LKE
Sbjct: 497 FPTMSMMNHSCDCNTDALFNGSTVTFRSNQFIPVGAEITHCYGPSVFHASFEERQKTLKE 556
Query: 118 QYLFTCTCPRCI--KLGQFDDIQESAILEGYRCK 149
Y F C C C K G D L +CK
Sbjct: 557 NYSFDCDCTPCAMHKSGNTDKEALHTQLRAMKCK 590
>gi|281210720|gb|EFA84886.1| hypothetical protein PPL_01879 [Polysphondylium pallidum PN500]
Length = 439
Score = 61.6 bits (148), Expect = 5e-07, Method: Compositional matrix adjust.
Identities = 39/149 (26%), Positives = 63/149 (42%), Gaps = 26/149 (17%)
Query: 40 LACNAH----TICNSELRPLGTGLYPVISIINHSCLPNAVLVFE----GRLAVVRAVQHV 91
L CN H T+ +G+ +Y S+ NHSC PN + G L ++ ++ +
Sbjct: 261 LECNTHEIGITVDEYNYCSIGSAVYEKASLFNHSCQPNVCRINRAGEWGALEMI-SLTDI 319
Query: 92 PKGAEVLISYIETAGSTMTRQKALKEQYLFTCTCPRCIKLGQFDDI-------------- 137
G E++ +YI+ + T RQ L E Y F C C C+
Sbjct: 320 AAGTELVYNYIQISLPTEDRQSKLSENYFFECKCNGCVNNTSISSRQKQQKQQQKQRQQK 379
Query: 138 ---QESAILEGYRCKDDGCSGFLLRDSDD 163
+ + L+ Y C D C+G L+ D++D
Sbjct: 380 QSNSQKSFLKKYLCTRDTCNGVLITDAND 408
>gi|159465319|ref|XP_001690870.1| predicted protein [Chlamydomonas reinhardtii]
gi|158279556|gb|EDP05316.1| predicted protein [Chlamydomonas reinhardtii]
Length = 1470
Score = 61.6 bits (148), Expect = 6e-07, Method: Composition-based stats.
Identities = 42/126 (33%), Positives = 64/126 (50%), Gaps = 19/126 (15%)
Query: 28 ISINEIAENFSKLACNAHTICNSELR---PLG-TGLYPVISIINHSCLPNAVLVFEGRLA 83
+++N + E+F LA +C ELR P G GL+P + HSC P A G
Sbjct: 1174 VNLNCMGEDFQDLA-----LC--ELRGEAPRGHIGLWPEAAFAAHSCAPTATAYSIGDRL 1226
Query: 84 VVRAVQHVPKGAEVLISYIETAGSTMT-----RQKALKEQYLFTCTCPRCIKLGQFDDIQ 138
++RA +PKG EV ++++ GS +T R+ L+ QY FTC C RC +
Sbjct: 1227 LIRAAAEIPKGGEVSLNFL---GSLLTSPLSVRRAELRSQYGFTCGCSRCAAEARHTGTP 1283
Query: 139 ESAILE 144
+A++E
Sbjct: 1284 LAALVE 1289
>gi|328724471|ref|XP_003248160.1| PREDICTED: SET and MYND domain-containing protein 4-like
[Acyrthosiphon pisum]
Length = 627
Score = 61.2 bits (147), Expect = 6e-07, Method: Compositional matrix adjust.
Identities = 37/100 (37%), Positives = 53/100 (53%), Gaps = 5/100 (5%)
Query: 36 NFSKLACNAHTICNSELR-PLGTGLYPVISIINHSCLPNAVL--VFEGRLAVVRAVQHVP 92
N KL NA +I + + PL LYP IS+ NHSC N V R+ V++AVQ +P
Sbjct: 441 NARKLNINAPSISHQQFTFPLALTLYPTISLFNHSCDGNIKRSGVISDRIRVMKAVQPIP 500
Query: 93 KGAEVLISY--IETAGSTMTRQKALKEQYLFTCTCPRCIK 130
KG ++ +Y + +RQ A +++ F C C CIK
Sbjct: 501 KGTQLCCNYGIMFKEHDKESRQSACNDRFNFNCYCDPCIK 540
>gi|159110558|ref|XP_001705534.1| Hypothetical protein GL50803_17036 [Giardia lamblia ATCC 50803]
gi|157433620|gb|EDO77860.1| hypothetical protein GL50803_17036 [Giardia lamblia ATCC 50803]
Length = 409
Score = 61.2 bits (147), Expect = 6e-07, Method: Compositional matrix adjust.
Identities = 34/104 (32%), Positives = 56/104 (53%), Gaps = 8/104 (7%)
Query: 34 AENFSKLAC----NAHTICNSELRPLGTGLYPVISIINHSCLPNAVLVF----EGRLAVV 85
A+ F+ C N+++I + + +G GLY +IS NHSC PNA ++F R A +
Sbjct: 300 AQMFNYFVCILLTNSYSIEDKTGQEIGAGLYSLISCCNHSCTPNAQVIFGDSENAREATL 359
Query: 86 RAVQHVPKGAEVLISYIETAGSTMTRQKALKEQYLFTCTCPRCI 129
++ + E+ ISYI G ++ ++ Q+ FTC C RC+
Sbjct: 360 VLLRPCAQKEELYISYITDLGRSVVERRRELAQWCFTCQCTRCL 403
>gi|361129398|gb|EHL01305.1| putative SET domain-containing protein 5 [Glarea lozoyensis 74030]
Length = 286
Score = 61.2 bits (147), Expect = 6e-07, Method: Compositional matrix adjust.
Identities = 36/103 (34%), Positives = 52/103 (50%), Gaps = 11/103 (10%)
Query: 36 NFSKLACNAHTICNSELRPLGT-----GLYPVISIINHSCLPNAVLVFEGRLA-----VV 85
NF +I S PLG G++ IS INH CLPNAV + G L V
Sbjct: 49 NFPGQGNPLTSIIRSNGYPLGADAGCGGVFENISRINHGCLPNAVQNWNGLLGEEGEETV 108
Query: 86 RAVQHVPKGAEVLISYIETAGSTMTRQKALKEQYLFTCTCPRC 128
A++ + +G E+ SY+ + G++ R+ LK+ + F CTC C
Sbjct: 109 YAIKDIKEGEEITTSYL-SGGTSKERRAVLKQSFGFDCTCKLC 150
>gi|195127975|ref|XP_002008442.1| GI11814 [Drosophila mojavensis]
gi|193920051|gb|EDW18918.1| GI11814 [Drosophila mojavensis]
Length = 527
Score = 61.2 bits (147), Expect = 6e-07, Method: Compositional matrix adjust.
Identities = 52/192 (27%), Positives = 86/192 (44%), Gaps = 21/192 (10%)
Query: 16 ANLVNLI------LQWPEISINEIAENFSKLACNAHTICNSELRPLGTGLYPVISIINHS 69
ANL++ I +WP++ I IA + L NA + + R GL+P ++ +H
Sbjct: 200 ANLISFIKTVLGLREWPDLDILRIA---AILDTNAFEVRQAGDRIKVRGLFPGGAMFSHD 256
Query: 70 CLPNAVLVFEGRLAVV-RAVQHVPKGAEVLISYIETAGSTMTRQKALKEQYLFTCTCPRC 128
C+PN F+ + ++ A + + KG + ISY + ST+ R+ LK+ F C C RC
Sbjct: 257 CVPNMRHRFDDDMNIMFLAKRPIAKGEILSISYTQQLRSTIQRRLHLKQVKCFECACARC 316
Query: 129 IKLGQFDDIQESAILEGYRCKDDGCSGFLLRD--SDDKGFTCQQCGLVRSKEEI----KK 182
D + + C F+ D + + C+ C L+R +E K
Sbjct: 317 A-----DPTELGTYAGAHMCAKCKVGKFISMDPLQNAANWRCEVCNLIRPAKEYLINDAK 371
Query: 183 IASEVNILSKKT 194
I +E+ L K T
Sbjct: 372 IEAELESLDKTT 383
>gi|195454342|ref|XP_002074198.1| GK12748 [Drosophila willistoni]
gi|194170283|gb|EDW85184.1| GK12748 [Drosophila willistoni]
Length = 394
Score = 61.2 bits (147), Expect = 6e-07, Method: Compositional matrix adjust.
Identities = 37/102 (36%), Positives = 56/102 (54%), Gaps = 9/102 (8%)
Query: 32 EIAENFSKLACNAHTICNSELRPLGTGLYPVISIINHSCLPNAVLVF--EGRLAVVRAVQ 89
EI ++K+ A N+E G+GLY + S INHSC+PNA F + V++A+
Sbjct: 266 EIDNLYAKVGEFAGEFLNNE----GSGLYLLQSKINHSCVPNACSTFPYSNDIVVLKALS 321
Query: 90 HVPKGAEVLISYIETAG---STMTRQKALKEQYLFTCTCPRC 128
+ +G E+ ISY++ S +R K L Y+F C CP+C
Sbjct: 322 PIQEGDEICISYLDECQLERSRHSRHKILCGNYIFICQCPKC 363
>gi|170041827|ref|XP_001848651.1| SET and MYND domain-containing protein 5 [Culex quinquefasciatus]
gi|167865410|gb|EDS28793.1| SET and MYND domain-containing protein 5 [Culex quinquefasciatus]
Length = 387
Score = 61.2 bits (147), Expect = 6e-07, Method: Compositional matrix adjust.
Identities = 33/87 (37%), Positives = 46/87 (52%), Gaps = 5/87 (5%)
Query: 47 ICNSELRPLGTGLYPVISIINHSCLPNAVLVF--EGRLAVVRAVQHVPKGAEVLISYIET 104
+ S L G+ LY + S +NHSC PNA F + + VQ + G E+ ISY++
Sbjct: 276 VVGSFLNNEGSALYSLQSKVNHSCAPNAECQFPHSNNVLALTTVQDIAAGEEICISYLDE 335
Query: 105 AG---STMTRQKALKEQYLFTCTCPRC 128
S +RQK L+E YLF C C +C
Sbjct: 336 CALERSRHSRQKMLRENYLFQCQCEKC 362
>gi|395841248|ref|XP_003793458.1| PREDICTED: SET and MYND domain-containing protein 5 isoform 1
[Otolemur garnettii]
Length = 416
Score = 61.2 bits (147), Expect = 6e-07, Method: Compositional matrix adjust.
Identities = 31/79 (39%), Positives = 46/79 (58%), Gaps = 5/79 (6%)
Query: 56 GTGLYPVISIINHSCLPNAVLVF--EGRLAVVRAVQHVPKGAEVLISYIETAG---STMT 110
G+GL+ + S NHSC+PNA F V A++ + G E+ ISY++ S +
Sbjct: 304 GSGLFVLQSCCNHSCVPNAETSFPENNFFLHVTALEDIKPGEEICISYLDCCQRERSRHS 363
Query: 111 RQKALKEQYLFTCTCPRCI 129
R K L+E YLF C+CP+C+
Sbjct: 364 RHKILRENYLFICSCPKCL 382
>gi|71985487|ref|NP_493620.2| Protein SET-10 [Caenorhabditis elegans]
gi|31873104|emb|CAB04267.2| Protein SET-10 [Caenorhabditis elegans]
Length = 430
Score = 61.2 bits (147), Expect = 6e-07, Method: Compositional matrix adjust.
Identities = 38/117 (32%), Positives = 61/117 (52%), Gaps = 8/117 (6%)
Query: 42 CNAHTICNSE-LRPLGTGLYPVISIINHSCLPNAVLVFEGRLAVVRAVQHVPKGAEVLIS 100
CN+ +I N + + P+G+GLY ++ NHSC + +VFEG +R Q E+ IS
Sbjct: 159 CNSFSIINEKRVEPIGSGLYVGVAKHNHSCASTSHVVFEGNQVFLRTNQE-EYSKELTIS 217
Query: 101 YIETAGSTMTRQKALKEQYLFTCTCPRCIKLGQFDDIQESAILEGYRCKDDGCSGFL 157
Y+ T R+K ++ + TC C C K + D I S+ +C+ C+GF+
Sbjct: 218 YVSRMLPTSERRKTIRGVHFLTCQCEMC-KNEELDLIGLSS-----KCRTKNCTGFV 268
>gi|320168174|gb|EFW45073.1| predicted protein [Capsaspora owczarzaki ATCC 30864]
Length = 291
Score = 61.2 bits (147), Expect = 6e-07, Method: Compositional matrix adjust.
Identities = 33/76 (43%), Positives = 46/76 (60%), Gaps = 4/76 (5%)
Query: 56 GTGLYPVISIINHSCLPNAVLVFEGR---LAVVRAVQHVPKGAEVLISYIETAGSTMTRQ 112
G+GL+ + + +NHSC PN V + L+VV AV +P EV ISYI+T RQ
Sbjct: 213 GSGLFEIGNTMNHSCQPNVVSMTRATDFTLSVV-AVATIPVNTEVCISYIDTDLPKAKRQ 271
Query: 113 KALKEQYLFTCTCPRC 128
AL+E Y F+C+C +C
Sbjct: 272 AALEELYYFSCSCAKC 287
>gi|342182559|emb|CCC92038.1| conserved hypothetical protein [Trypanosoma congolense IL3000]
Length = 712
Score = 61.2 bits (147), Expect = 7e-07, Method: Compositional matrix adjust.
Identities = 37/118 (31%), Positives = 56/118 (47%), Gaps = 2/118 (1%)
Query: 42 CNAHTICNSELRPLGTGLYPVISIINHSCLPNAVLV-FEGRLAVVRAVQHVPKGAEVLIS 100
CN + +S+ +G ++P S NHSCLPN V + G +A A+Q + KG + I
Sbjct: 559 CNNFGLFSSKESCIGVSIFPEASYFNHSCLPNLCRVMYRGNIAAFYALQSIRKGEPLTIC 618
Query: 101 YIETA-GSTMTRQKALKEQYLFTCTCPRCIKLGQFDDIQESAILEGYRCKDDGCSGFL 157
Y++ ST R++ L Y F C C RC D ++ + E C G+L
Sbjct: 619 YVDVQEASTAERRRTLLTSYRFFCECRRCHGYSDEDGKGDTVMSEIRFCDTCDARGYL 676
>gi|406863858|gb|EKD16905.1| TPR domain protein [Marssonina brunnea f. sp. 'multigermtubi'
MB_m1]
Length = 739
Score = 61.2 bits (147), Expect = 7e-07, Method: Compositional matrix adjust.
Identities = 33/91 (36%), Positives = 49/91 (53%), Gaps = 6/91 (6%)
Query: 57 TGLYPVISIINHSCLPNAVLVFEGRLAVVRAVQHVPKGAEVLISYIETAGSTMTRQKALK 116
+GL+ + S INHSC PN F G + +VRA + +P E+ +SYIET Q+ L
Sbjct: 508 SGLWILASYINHSCEPNCRRAFIGDVQIVRAARDMPADTEITLSYIETDDPAKMNQR-LF 566
Query: 117 EQYLFTCTCPRCIKLGQFDDIQESAILEGYR 147
+ + F CTC C+ DD + A ++ R
Sbjct: 567 DGWGFDCTCAMCV-----DDRETPAAVKNRR 592
>gi|194883134|ref|XP_001975659.1| GG20436 [Drosophila erecta]
gi|190658846|gb|EDV56059.1| GG20436 [Drosophila erecta]
Length = 515
Score = 61.2 bits (147), Expect = 7e-07, Method: Compositional matrix adjust.
Identities = 43/173 (24%), Positives = 75/173 (43%), Gaps = 11/173 (6%)
Query: 15 IANLVNLILQWPEISINEIAENFSKLACNAHTICNSELRPLGTGLYPVISIINHSCLPNA 74
I + + + + EI + L N H + ++ P ++ S +SCLPN
Sbjct: 189 IGQFIPKFFKTQKFTEEEIMKAVGALQINGHEVPTTD--PSHVAVFYTASFTENSCLPNL 246
Query: 75 VLVF-EGRLAVVRAVQHVPKGAEVLISYIETAGSTMTRQKALKEQYLFTCTCPRCIKLGQ 133
F + ++ A + K A + I Y + T RQ+ L + LF C C RC+ + +
Sbjct: 247 AKSFNKNGHCILWAPHEIKKNAHLSICYSDAVWGTADRQRHLMQTKLFKCACDRCVDVTE 306
Query: 134 FDDIQESAILEGYRCKDDGCSGFLL---RDSDDKGFTCQQCGLVRSKEEIKKI 183
D SAI +C+D C G +L D + + C++C K +++I
Sbjct: 307 L-DTNYSAI----KCEDRQCGGLMLPTKADDWNGSWRCRECHKQVQKHYVERI 354
>gi|19922236|ref|NP_610944.1| CG8503 [Drosophila melanogaster]
gi|7303209|gb|AAF58272.1| CG8503 [Drosophila melanogaster]
gi|15291193|gb|AAK92865.1| GH11294p [Drosophila melanogaster]
gi|220945042|gb|ACL85064.1| CG8503-PA [synthetic construct]
gi|220954946|gb|ACL90016.1| CG8503-PA [synthetic construct]
Length = 513
Score = 61.2 bits (147), Expect = 7e-07, Method: Compositional matrix adjust.
Identities = 43/173 (24%), Positives = 76/173 (43%), Gaps = 11/173 (6%)
Query: 15 IANLVNLILQWPEISINEIAENFSKLACNAHTICNSELRPLGTGLYPVISIINHSCLPNA 74
I + + + + EI + L N H + ++ P ++ S +SCLPN
Sbjct: 189 IGQFIPKFFKTQKFTEEEIMKAVGALQINGHEVPTTD--PSHVAVFYTASFTENSCLPNL 246
Query: 75 VLVF-EGRLAVVRAVQHVPKGAEVLISYIETAGSTMTRQKALKEQYLFTCTCPRCIKLGQ 133
F + ++ A + + K A + I Y + T RQ+ L + LF C C RC+ + +
Sbjct: 247 AKSFNKNGHCILWAPREIKKNAHLSICYSDAMWGTADRQRHLMQTKLFKCACERCVDVTE 306
Query: 134 FDDIQESAILEGYRCKDDGCSGFLLRDSDDK---GFTCQQCGLVRSKEEIKKI 183
D SAI +C+D C G +L D+ + C++C K +++I
Sbjct: 307 L-DTNYSAI----KCEDRQCGGLMLPTKADEWNGSWRCRECHKQVQKHYVERI 354
>gi|225425346|ref|XP_002275154.1| PREDICTED: histone-lysine N-methyltransferase ASHR2-like [Vitis
vinifera]
Length = 405
Score = 61.2 bits (147), Expect = 7e-07, Method: Compositional matrix adjust.
Identities = 30/78 (38%), Positives = 38/78 (48%), Gaps = 7/78 (8%)
Query: 58 GLYPVISIINHSCLPNA-------VLVFEGRLAVVRAVQHVPKGAEVLISYIETAGSTMT 110
G+YP S NH CLPNA +R + VP+G+E+ +SY +
Sbjct: 217 GIYPKASFFNHDCLPNACRFDYVDTASHHNTDITIRLIHDVPEGSEICLSYFPVNETYAD 276
Query: 111 RQKALKEQYLFTCTCPRC 128
RQK L E Y FTC C RC
Sbjct: 277 RQKRLLEDYGFTCYCDRC 294
>gi|451848012|gb|EMD61318.1| hypothetical protein COCSADRAFT_96412 [Cochliobolus sativus ND90Pr]
Length = 417
Score = 60.8 bits (146), Expect = 7e-07, Method: Compositional matrix adjust.
Identities = 34/89 (38%), Positives = 48/89 (53%), Gaps = 8/89 (8%)
Query: 58 GLYPVISIINHSCLPNAVLVFE-GRLA-VVRAVQHVPKGAEVLISYIETAGSTMTRQKAL 115
++P S +NH+C PNA V + L+ VR + + +G E+ ISY T TRQ L
Sbjct: 234 AVWPETSRLNHACAPNAQYVIDTDHLSHTVRVTRPIAEGEEITISYTSPLEPTETRQHHL 293
Query: 116 KEQYLFTCTCPRC------IKLGQFDDIQ 138
+ + FTCTCPRC I L +D+Q
Sbjct: 294 AQGFHFTCTCPRCTSPITDITLSIINDLQ 322
>gi|147844297|emb|CAN82112.1| hypothetical protein VITISV_031337 [Vitis vinifera]
Length = 405
Score = 60.8 bits (146), Expect = 7e-07, Method: Compositional matrix adjust.
Identities = 30/78 (38%), Positives = 38/78 (48%), Gaps = 7/78 (8%)
Query: 58 GLYPVISIINHSCLPNA-------VLVFEGRLAVVRAVQHVPKGAEVLISYIETAGSTMT 110
G+YP S NH CLPNA +R + VP+G+E+ +SY +
Sbjct: 217 GIYPKASFFNHDCLPNACRFDYVDTASHHNTDITIRLIHDVPEGSEICLSYFPVNETYAD 276
Query: 111 RQKALKEQYLFTCTCPRC 128
RQK L E Y FTC C RC
Sbjct: 277 RQKRLLEDYGFTCYCDRC 294
>gi|357605609|gb|EHJ64694.1| hypothetical protein KGM_16822 [Danaus plexippus]
Length = 382
Score = 60.8 bits (146), Expect = 8e-07, Method: Compositional matrix adjust.
Identities = 32/79 (40%), Positives = 47/79 (59%), Gaps = 5/79 (6%)
Query: 56 GTGLYPVISIINHSCLPNAVLVFE-GRLAV-VRAVQHVPKGAEVLISYIETAG---STMT 110
G+GLY + S NHSC PNA F G + ++A++++ G E+ ISY++ S +
Sbjct: 283 GSGLYQLQSACNHSCAPNAESSFPYGNHRIQLKAIRNIKPGDEIHISYLDDCTLQRSRHS 342
Query: 111 RQKALKEQYLFTCTCPRCI 129
RQ+ L E YLF C C RC+
Sbjct: 343 RQRELSENYLFVCCCERCM 361
>gi|383849380|ref|XP_003700323.1| PREDICTED: SET and MYND domain-containing protein 4-like [Megachile
rotundata]
Length = 691
Score = 60.8 bits (146), Expect = 8e-07, Method: Compositional matrix adjust.
Identities = 30/80 (37%), Positives = 46/80 (57%), Gaps = 3/80 (3%)
Query: 51 ELRPLGTGLYPVISIINHSCLPNAVLVFEGRLAVVRAVQHVPKGAEVLISYIET--AGST 108
++ + TG+YP S++NHSC PN + ++ + +VRAV+ + E+ SY+ T T
Sbjct: 403 QINIVATGIYPSASMMNHSCNPNIIKIYMDQYLIVRAVEDIFPTEEIFNSYVATYRYKKT 462
Query: 109 MTRQKALKEQYLFTCTCPRC 128
RQK L E Y F+C C C
Sbjct: 463 KARQKLL-ELYYFSCKCEAC 481
>gi|307198438|gb|EFN79380.1| Protein msta, isoform B [Harpegnathos saltator]
Length = 514
Score = 60.8 bits (146), Expect = 8e-07, Method: Compositional matrix adjust.
Identities = 61/303 (20%), Positives = 122/303 (40%), Gaps = 35/303 (11%)
Query: 28 ISINEIAENFSKLACNAHTICNSELRPL----GTGLYPVISIINHSCLPN---AVLVFEG 80
+ ++ +E ++AC I E+R LYP ++++NHSC+ N + +
Sbjct: 183 LKLDRFSEECIQMACGILEINTFEIRTAKGYSARALYPTVAMMNHSCVSNMSHTISPIDY 242
Query: 81 RLAVVRAVQHVPKGAEVLISYIETAGSTMTRQKALKEQYLFTCTCPRCIKLGQFDDIQES 140
R+ + R +P E+ SY + T+ R++ L E F C CPRC D + +
Sbjct: 243 RIRL-RTTLKIPAAGELYASYTHSLLPTLLRREHLLEGKHFACACPRC-----SDPTELA 296
Query: 141 AILEGYRCK--DDGCSGFLLRDSDDKGFTCQQCGLVRSKEEIKKIASEVNILSKKTLALT 198
+ +C D+G L + + C C + + ++K+ + A++
Sbjct: 297 THMSSLKCNKCDNGIVLPLDSLDAESAWKCTHCEYSINGQAVRKVFRIIQAEVDNVEAIS 356
Query: 199 SCGNHQEVVSTYKMIEKLQKKLYHPFSVNLMQTREKLIKILMELEDW----------KEA 248
+ + + + + + + + HP L R L ++ ++++ +
Sbjct: 357 G-ADGADAIHERETVLRKYRSVLHPRHAFLTILRHSLTQMYGRVDEYLLDDLPDVVLEHK 415
Query: 249 LAYCQLTIPVYQRVYPQFHPLLGLQYYTCGKLEWFLGD--------TENAIKS-MTEAVE 299
+ C+L + + V P + + G+ Y FL E +KS MTEA
Sbjct: 416 VDMCRLLLQILDVVEPGYSRIRGMTLYELHAPLLFLAKGQWNADVIDEAGLKSKMTEAAN 475
Query: 300 ILR 302
IL+
Sbjct: 476 ILK 478
>gi|194862038|ref|XP_001969906.1| GG23681 [Drosophila erecta]
gi|190661773|gb|EDV58965.1| GG23681 [Drosophila erecta]
Length = 507
Score = 60.8 bits (146), Expect = 8e-07, Method: Compositional matrix adjust.
Identities = 51/199 (25%), Positives = 85/199 (42%), Gaps = 25/199 (12%)
Query: 59 LYPVISIINHSCLPN---AVLVFEGRLAVVRAVQHVPKGAEVLISYIETAGSTMTRQKAL 115
L+P I+ H+C+PN ++ EG +RA+ + +G + SY T T RQK L
Sbjct: 207 LFPYTGILAHNCVPNTSRSIYPSEGYKIRLRAMVDLEEGQPLHHSYTYTLDGTAQRQKHL 266
Query: 116 KEQYLFTCTCPRCI---KLG-QFDDIQESAILEGYRCKDDGCSGFLLRDSDDKGFTCQQC 171
K+ FTC C RC+ +LG F ++ EGY+ D ++C C
Sbjct: 267 KQGKFFTCQCDRCLDPTELGTHFSSLKCGQCAEGYQVPRQPTEA-------DTSWSCASC 319
Query: 172 GLVRSKEE----IKKIASEVNILSKKTLALTSCGNHQEVVSTYKMIEKLQKKLYHPFSVN 227
G S E ++ + SEVN + + + ++ Y K L HP
Sbjct: 320 GSDTSNVEALAMLQSLQSEVNAVQALPMGAKRLEEVERLLRKY-------KSLLHPLHFI 372
Query: 228 LMQTREKLIKILMELEDWK 246
R+ LI++ ++ ++
Sbjct: 373 ATGLRQLLIEMYGRVQGYE 391
>gi|145347416|ref|XP_001418163.1| predicted protein [Ostreococcus lucimarinus CCE9901]
gi|144578392|gb|ABO96456.1| predicted protein [Ostreococcus lucimarinus CCE9901]
Length = 639
Score = 60.8 bits (146), Expect = 9e-07, Method: Compositional matrix adjust.
Identities = 25/67 (37%), Positives = 42/67 (62%)
Query: 37 FSKLACNAHTICNSELRPLGTGLYPVISIINHSCLPNAVLVFEGRLAVVRAVQHVPKGAE 96
S+ N TI + +++ +G G+YP S+ NHS PNA ++F+G+ VV+ ++ + G E
Sbjct: 218 LSRFEINGFTIADDDMQRVGFGIYPEASLFNHSSTPNAQVMFKGKTLVVKTLREIAVGEE 277
Query: 97 VLISYIE 103
+ ISY E
Sbjct: 278 ITISYGE 284
>gi|297845524|ref|XP_002890643.1| predicted protein [Arabidopsis lyrata subsp. lyrata]
gi|297336485|gb|EFH66902.1| predicted protein [Arabidopsis lyrata subsp. lyrata]
Length = 976
Score = 60.8 bits (146), Expect = 9e-07, Method: Compositional matrix adjust.
Identities = 26/73 (35%), Positives = 41/73 (56%), Gaps = 1/73 (1%)
Query: 56 GTGLYPVISIINHSCLPNAVLVFEGRLAVVRAVQHVPKGAEVLISYIETAGSTMTRQKAL 115
G GL+ + S INHSC+PNA + G +V A + + G E+ +Y + S + ++K +
Sbjct: 761 GVGLWTLASFINHSCIPNARRLHVGDYVIVHASRDIKAGEEITFAYFDVL-SPLEKRKEM 819
Query: 116 KEQYLFTCTCPRC 128
E + F C C RC
Sbjct: 820 AESWGFRCGCSRC 832
>gi|393233731|gb|EJD41300.1| hypothetical protein AURDEDRAFT_186645 [Auricularia delicata
TFB-10046 SS5]
Length = 293
Score = 60.8 bits (146), Expect = 9e-07, Method: Compositional matrix adjust.
Identities = 36/94 (38%), Positives = 50/94 (53%), Gaps = 8/94 (8%)
Query: 43 NAHTICNSELRPLGTGLYPVIS-IINHSCLPNAVLVF------EGRLAVVRAVQHVPKGA 95
N + + +S L PL +YP S +NHSC NAV +F R+ VV V+ V G
Sbjct: 83 NNNFVLHSHLTPLAAAVYPAASRSLNHSCASNAVPLFVFAPATPPRMEVV-LVRDVAPGD 141
Query: 96 EVLISYIETAGSTMTRQKALKEQYLFTCTCPRCI 129
E+ I YI+ A + R++ L+ Y F C C RCI
Sbjct: 142 EITIPYIDPALAPSARRERLRASYGFECACARCI 175
>gi|195020683|ref|XP_001985247.1| GH16955 [Drosophila grimshawi]
gi|193898729|gb|EDV97595.1| GH16955 [Drosophila grimshawi]
Length = 529
Score = 60.8 bits (146), Expect = 9e-07, Method: Compositional matrix adjust.
Identities = 57/196 (29%), Positives = 91/196 (46%), Gaps = 29/196 (14%)
Query: 16 ANLVNLI------LQWPEISINEIAENFSKLACNAHTICNSELRPLGTGLYPVISIINHS 69
ANL+ I WPE+ I IA + L NA + + R ++P ++I H
Sbjct: 200 ANLITFIKTVLGQRDWPELEILRIA---AILDTNAFEVRQNGERRKVRAIFPGGAMIAHD 256
Query: 70 CLPNAVLVFEGRLAVV-RAVQHVPKGAEVLISYIETAGSTMTRQKALKEQYLFTCTCPRC 128
C+PN F+ + ++ A + +PKGA + ISY + ST+ R+ LK+ F C C RC
Sbjct: 257 CVPNLRHRFDDNMRIIFLAKRPIPKGAILSISYTQPLRSTVQRRVHLKQVKCFDCACERC 316
Query: 129 ---IKLGQFDDIQESAILEGYRCKDDGCSGFLLRDS---DDKGFTCQQCGLVRSKEEI-- 180
+LG F A L G +CK G ++ + + + CQ C + +S E+
Sbjct: 317 SDPTELGTF----AGAHLCG-KCK----VGKVISQNPLENAANWHCQLCNVKKSAREVLT 367
Query: 181 --KKIASEVNILSKKT 194
++ E+ L K T
Sbjct: 368 QDARLQQEIESLDKTT 383
>gi|196009510|ref|XP_002114620.1| hypothetical protein TRIADDRAFT_28394 [Trichoplax adhaerens]
gi|190582682|gb|EDV22754.1| hypothetical protein TRIADDRAFT_28394 [Trichoplax adhaerens]
Length = 398
Score = 60.8 bits (146), Expect = 9e-07, Method: Compositional matrix adjust.
Identities = 32/77 (41%), Positives = 46/77 (59%), Gaps = 3/77 (3%)
Query: 56 GTGLYPVISIINHSCLPNAVLVFEGRLAV--VRAVQHVPKGAEVLISYIETA-GSTMTRQ 112
G+GLY + S NH C PN + F A V+A++++ +G E+ ISYI++ + RQ
Sbjct: 298 GSGLYLLQSCCNHDCSPNVEINFLDNNATLTVKAIRNISEGQELCISYIDSDIKNWKKRQ 357
Query: 113 KALKEQYLFTCTCPRCI 129
L E YLF CTC RC+
Sbjct: 358 AILMENYLFECTCNRCM 374
>gi|310793444|gb|EFQ28905.1| MYND finger [Glomerella graminicola M1.001]
Length = 548
Score = 60.8 bits (146), Expect = 9e-07, Method: Compositional matrix adjust.
Identities = 33/90 (36%), Positives = 47/90 (52%), Gaps = 1/90 (1%)
Query: 39 KLACNAHTICNSELRPLGTGLYPVISIINHSCLPNAVLVFEGRLAVVRAVQHVPKGAEVL 98
KL NA + G L +++INHSC+PNA++ F GR A +R+ + G+E+
Sbjct: 183 KLQTNAFSRSEEYYETGGVFLDTTLAMINHSCVPNALVQFGGRTATLRSASFIHPGSEIE 242
Query: 99 ISYIETAGSTMTRQKALKEQYLFTCTCPRC 128
ISYI+ RQ L Y F C C +C
Sbjct: 243 ISYIDQTQPKSRRQHELS-LYHFECHCSKC 271
>gi|393233728|gb|EJD41297.1| hypothetical protein AURDEDRAFT_106306 [Auricularia delicata
TFB-10046 SS5]
Length = 373
Score = 60.8 bits (146), Expect = 1e-06, Method: Compositional matrix adjust.
Identities = 36/94 (38%), Positives = 50/94 (53%), Gaps = 8/94 (8%)
Query: 43 NAHTICNSELRPLGTGLYPVIS-IINHSCLPNAVLVF------EGRLAVVRAVQHVPKGA 95
N + + +S L PL +YP S +NHSC NAV +F R+ VV V+ V G
Sbjct: 163 NNNFVLHSHLTPLAAAVYPAASRALNHSCATNAVPLFVFAPATPPRMEVV-LVRDVAPGD 221
Query: 96 EVLISYIETAGSTMTRQKALKEQYLFTCTCPRCI 129
E+ I YI+ A + R++ L+ Y F C C RCI
Sbjct: 222 EITIPYIDPALAPSARRERLRASYGFECACARCI 255
>gi|297836294|ref|XP_002886029.1| hypothetical protein ARALYDRAFT_343234 [Arabidopsis lyrata subsp.
lyrata]
gi|297331869|gb|EFH62288.1| hypothetical protein ARALYDRAFT_343234 [Arabidopsis lyrata subsp.
lyrata]
Length = 424
Score = 60.8 bits (146), Expect = 1e-06, Method: Compositional matrix adjust.
Identities = 31/78 (39%), Positives = 40/78 (51%), Gaps = 7/78 (8%)
Query: 58 GLYPVISIINHSCLPNAVL------VFEGRLAVV-RAVQHVPKGAEVLISYIETAGSTMT 110
G+YP S NH CLPNA +G ++ R + VP+G EV +SY + +
Sbjct: 214 GIYPKTSFFNHDCLPNACRFDYVDSASDGNTDIIIRTIHDVPEGREVCLSYFPVNMNYSS 273
Query: 111 RQKALKEQYLFTCTCPRC 128
RQK L E Y F C C RC
Sbjct: 274 RQKRLLEDYGFKCDCDRC 291
>gi|413917046|gb|AFW56978.1| hypothetical protein ZEAMMB73_771542 [Zea mays]
Length = 404
Score = 60.5 bits (145), Expect = 1e-06, Method: Compositional matrix adjust.
Identities = 49/180 (27%), Positives = 65/180 (36%), Gaps = 57/180 (31%)
Query: 58 GLYPVISIINHSCLPNAV-LVFEGRLA------VVRAVQHVPKGAEVLISYIETAGSTMT 110
+YP S+ NH CLPNA + R VVRA+ +P+G EV ISY
Sbjct: 214 AVYPRASLFNHDCLPNACHFDYPDRPGPGNTDIVVRALHDIPEGREVCISYFAANWRYAD 273
Query: 111 RQKALKEQYLFTCTCPRCIKLGQF------------------------------------ 134
RQ+ L E Y F C C RC Q+
Sbjct: 274 RQRRLLEDYGFRCECDRCQVESQWKDDDDNNGDDGDDTMEEEGEDDVGDRGDDGIEEEEG 333
Query: 135 --------DDIQESAILEGYRCKDDGCSGFL--LRDSDDKG----FTCQQCGLVRSKEEI 180
DD + Y C ++GC G L L S + F C CG +R +E++
Sbjct: 334 DGATNGGDDDFPHAYFFVRYLCDNEGCWGMLAPLPPSPNGDLSHVFECNLCGKLRKEEDV 393
>gi|268570006|ref|XP_002640670.1| Hypothetical protein CBG19732 [Caenorhabditis briggsae]
Length = 445
Score = 60.5 bits (145), Expect = 1e-06, Method: Compositional matrix adjust.
Identities = 52/231 (22%), Positives = 104/231 (45%), Gaps = 29/231 (12%)
Query: 32 EIAENFSKLACNAHTICNS-------ELRPLGTGLYPVISIINHSCLPNAVLVFEGRLAV 84
E +N +A +CNS + P+G+G+Y ++ NHSC +VF+G AV
Sbjct: 163 ESMDNNKMIAIICSVLCNSFGIIAEKRVEPIGSGMYVGLATHNHSCASTCHVVFDGNQAV 222
Query: 85 VRAVQHVPKGAEVLISYIETAGSTMTRQKALKEQYLFTCTCPRCIKLGQFDDIQESAILE 144
+R+ + ISY+ T RQK+++ + TC C C+ +D+ + +
Sbjct: 223 LRS-RDRQYCKNTTISYVSRMLPTFERQKSIRNVHFITCRCEMCLN----EDLDLTGL-- 275
Query: 145 GYRCKDDGCSGFLLRDSDDKGFTCQQCGLV---RSKEEIKKIASEVNILSKKTLALTSCG 201
+C+ C GF ++ ++ TC + ++ +S + K+ + L K
Sbjct: 276 ASKCQTSKCQGF-VKGANGSCTTCGKPAMIPFEQSTQSTSKLLDTLENLHK--------S 326
Query: 202 NHQEVVSTYKMIEKLQK---KLYHPFSVNLMQTREKLIKILMELEDWKEAL 249
+ V Y+ ++ LQ+ ++ +V ++Q E+L E+++ E +
Sbjct: 327 QQLDTVQEYRHLQNLQEEYGRILADCNVAILQLDEQLAYCASEMKNVPEDM 377
>gi|403170123|ref|XP_003329506.2| hypothetical protein PGTG_11256 [Puccinia graminis f. sp. tritici
CRL 75-36-700-3]
gi|375168575|gb|EFP85087.2| hypothetical protein PGTG_11256 [Puccinia graminis f. sp. tritici
CRL 75-36-700-3]
Length = 647
Score = 60.5 bits (145), Expect = 1e-06, Method: Compositional matrix adjust.
Identities = 35/102 (34%), Positives = 54/102 (52%), Gaps = 11/102 (10%)
Query: 38 SKLACNAHTICNSELRPLGTGLYPVISIINHSCLPNAVLVF----EGRLAV-------VR 86
S+ N+ T+ + L +G P + INHSC PNAV+VF EG + V
Sbjct: 277 SRFIDNSFTLTSIILDQIGVVFVPSAAFINHSCNPNAVVVFPEGGEGAGSTAGKEWVKVI 336
Query: 87 AVQHVPKGAEVLISYIETAGSTMTRQKALKEQYLFTCTCPRC 128
A++ + G E++ SYI++AG+ R+ L ++Y F C C C
Sbjct: 337 AIKPIEPGEEIVTSYIDSAGTRQERRNELVKRYKFVCDCQAC 378
>gi|193683573|ref|XP_001946512.1| PREDICTED: protein msta, isoform A-like [Acyrthosiphon pisum]
Length = 499
Score = 60.5 bits (145), Expect = 1e-06, Method: Compositional matrix adjust.
Identities = 44/167 (26%), Positives = 66/167 (39%), Gaps = 19/167 (11%)
Query: 24 QWPEISINEIAENFSKLACNAHTICNSELRPLGTGLYPVISIINHSCLPNAVLVF--EGR 81
+WP ++EI + N H + ++ P +Y S++ HSC+PN F +G
Sbjct: 168 EWP---VDEILRVCGIVQVNGHEVPLTD--PPYVAIYDAGSMLEHSCVPNCSKSFTRDGH 222
Query: 82 LAVVRAVQHVPKGAEVLISYIETAGSTMTRQKALKEQYLFTCTCPRCIKLGQFDDIQESA 141
L + A V G + ISY + T R L + F C CPRC D +
Sbjct: 223 LLIRTAAAAVESGGHLSISYTDVLWGTAQRLAHLADTKFFVCKCPRCS-----DPTELGT 277
Query: 142 ILEGYRCKDDGCSGFLL-------RDSDDKGFTCQQCGLVRSKEEIK 181
G +C + C G+ L D D + C C V I+
Sbjct: 278 YFSGVKCATEDCMGYSLPNVHPSTNDPFDVEWMCNFCFTVADPNRIE 324
>gi|307108187|gb|EFN56428.1| hypothetical protein CHLNCDRAFT_145047 [Chlorella variabilis]
Length = 383
Score = 60.5 bits (145), Expect = 1e-06, Method: Compositional matrix adjust.
Identities = 26/77 (33%), Positives = 45/77 (58%), Gaps = 2/77 (2%)
Query: 56 GTGLYPVISIINHSCLPNAVLVFE--GRLAVVRAVQHVPKGAEVLISYIETAGSTMTRQK 113
G+ Y + S++NHSC P+ + F +A A +H+ +G ++ +SY++ R++
Sbjct: 300 GSAAYLLASLLNHSCEPSLGVSFPRNNAVAAFTAARHIARGEQLTVSYVDAGQGLAARRQ 359
Query: 114 ALKEQYLFTCTCPRCIK 130
AL Y FTC CPRC++
Sbjct: 360 ALAWAYGFTCRCPRCVE 376
>gi|322786609|gb|EFZ13004.1| hypothetical protein SINV_09986 [Solenopsis invicta]
Length = 490
Score = 60.5 bits (145), Expect = 1e-06, Method: Compositional matrix adjust.
Identities = 40/145 (27%), Positives = 68/145 (46%), Gaps = 9/145 (6%)
Query: 58 GLYPVISIINHSCLPNAVLVFEGRLAV-VRAVQHVPKGAEVLISYIETAGSTMTRQKALK 116
GLYP+ ++ NH C+PN+ F+ + + V A + G E+ +SY + T TR++ L+
Sbjct: 216 GLYPMGALQNHCCVPNSRHHFDDQQRLHVSAALPIAAGEEITMSYTDLLWDTSTRRQFLR 275
Query: 117 EQYLFTCTCPRCIKLGQFDDIQESAILEGYRCKDDGCSGFLLRDSD---DKGFTCQQCGL 173
F+C C RC D ++ + L C D C G LL + + + C +C
Sbjct: 276 ITKRFSCNCNRC-----SDPLEFGSRLSALLCAKDECLGHLLPRNPLNYESSWICDKCQT 330
Query: 174 VRSKEEIKKIASEVNILSKKTLALT 198
+ +I+ I S +N + T
Sbjct: 331 SVNHRQIECIHSGLNTFVSNVMYKT 355
>gi|226499648|ref|NP_001143943.1| uncharacterized protein LOC100276756 [Zea mays]
gi|195629926|gb|ACG36604.1| hypothetical protein [Zea mays]
Length = 404
Score = 60.5 bits (145), Expect = 1e-06, Method: Compositional matrix adjust.
Identities = 49/180 (27%), Positives = 65/180 (36%), Gaps = 57/180 (31%)
Query: 58 GLYPVISIINHSCLPNAV-LVFEGRLA------VVRAVQHVPKGAEVLISYIETAGSTMT 110
+YP S+ NH CLPNA + R VVRA+ +P+G EV ISY
Sbjct: 214 AVYPRASLFNHDCLPNACHFDYPDRPGPGNTDIVVRALHDIPEGREVCISYFAANWRYAD 273
Query: 111 RQKALKEQYLFTCTCPRCIKLGQF------------------------------------ 134
RQ+ L E Y F C C RC Q+
Sbjct: 274 RQRRLLEDYGFRCECDRCQVESQWKDDDDNNGDDGDDTMEEEDEDDAGDRGDDGIEEEEG 333
Query: 135 --------DDIQESAILEGYRCKDDGCSGFL--LRDSDDKG----FTCQQCGLVRSKEEI 180
DD + Y C ++GC G L L S + F C CG +R +E++
Sbjct: 334 DGATNGGDDDFPHAYFFVRYLCDNEGCWGMLAPLPPSPNGDLSHVFECNLCGKLRKEEDV 393
>gi|356518688|ref|XP_003528010.1| PREDICTED: uncharacterized protein LOC100783548 [Glycine max]
Length = 786
Score = 60.5 bits (145), Expect = 1e-06, Method: Compositional matrix adjust.
Identities = 36/117 (30%), Positives = 53/117 (45%), Gaps = 7/117 (5%)
Query: 44 AHTICNSELRPLGTGLYPVISIINHSCLPNAVLVFEGRLAVVRAVQHVPKGAEVLISYIE 103
AH+ N E +G +Y S+ NHSC PN F R +R V G+++ +SY
Sbjct: 462 AHSTSNVEQVRVGKAIYKAGSLFNHSCQPNVHAYFLSRALYLRTTNVVAAGSQLELSYGP 521
Query: 104 TAG--STMTRQKALKEQYLFTCTCPRCIKLGQFDDIQESAILEGYRCKDDGCSGFLL 158
G R LK +Y F C C C ++ + D +L + C + CSG +L
Sbjct: 522 QVGLWDCKDRLNFLKNEYAFHCLCTGCSEVNRSD-----LVLNAFHCVNPNCSGAVL 573
>gi|30680748|ref|NP_849991.1| histone-lysine N-methyltransferase ASHR2 [Arabidopsis thaliana]
gi|94707155|sp|Q9ZUM9.3|ASHR2_ARATH RecName: Full=Histone-lysine N-methyltransferase ASHR2; AltName:
Full=ASH1-related protein 2; AltName: Full=Protein SET
DOMAIN GROUP 39
gi|28393236|gb|AAO42047.1| putative SET-domain transcriptional regulator [Arabidopsis
thaliana]
gi|330251813|gb|AEC06907.1| histone-lysine N-methyltransferase ASHR2 [Arabidopsis thaliana]
Length = 398
Score = 60.5 bits (145), Expect = 1e-06, Method: Compositional matrix adjust.
Identities = 31/78 (39%), Positives = 40/78 (51%), Gaps = 7/78 (8%)
Query: 58 GLYPVISIINHSCLPNAVL------VFEGRLAVV-RAVQHVPKGAEVLISYIETAGSTMT 110
G+YP S NH CLPNA +G ++ R + VP+G EV +SY + +
Sbjct: 219 GIYPKTSFFNHDCLPNACRFDYVDSASDGNTDIIIRMIHDVPEGREVCLSYFPVNMNYSS 278
Query: 111 RQKALKEQYLFTCTCPRC 128
RQK L E Y F C C RC
Sbjct: 279 RQKRLLEDYGFKCDCDRC 296
>gi|328785590|ref|XP_003250620.1| PREDICTED: SET and MYND domain-containing protein 4-like, partial
[Apis mellifera]
Length = 444
Score = 60.5 bits (145), Expect = 1e-06, Method: Compositional matrix adjust.
Identities = 29/71 (40%), Positives = 41/71 (57%), Gaps = 2/71 (2%)
Query: 55 LGTGLYPVISIINHSCLPNAVLVFEGRLAVVRAVQHVPKGAEVLISYIE--TAGSTMTRQ 112
+ TG+YP SI+NHSC PN + +F + +VRA + + +G E+ Y +T RQ
Sbjct: 373 VATGIYPSASIMNHSCDPNIINIFVNQYLIVRASRDISQGEEIFNCYGPHYRHMTTENRQ 432
Query: 113 KALKEQYLFTC 123
K LK QY F C
Sbjct: 433 KILKNQYCFIC 443
>gi|198461003|ref|XP_002138938.1| GA25085 [Drosophila pseudoobscura pseudoobscura]
gi|198137203|gb|EDY69496.1| GA25085 [Drosophila pseudoobscura pseudoobscura]
Length = 515
Score = 60.5 bits (145), Expect = 1e-06, Method: Compositional matrix adjust.
Identities = 44/175 (25%), Positives = 72/175 (41%), Gaps = 11/175 (6%)
Query: 13 AQIANLVNLILQWPEISINEIAENFSKLACNAHTICNSELRPLGTGLYPVISIINHSCLP 72
A I + + + S E+ L N H + S+ P ++ S HSC+P
Sbjct: 187 ASIGQFIPKFFKTEKFSEEEVMRAVGALQINGHEVPTSD--PPHVAVFYTASFTEHSCMP 244
Query: 73 NAVLVF-EGRLAVVRAVQHVPKGAEVLISYIETAGSTMTRQKALKEQYLFTCTCPRCIKL 131
N F + ++ A + + K + + I Y + T RQ+ L + LF C C RC +
Sbjct: 245 NLAKSFNKNGHCILWAPREIKKNSHLSICYSDAMWGTADRQRHLVQTKLFKCACDRCTDV 304
Query: 132 GQFDDIQESAILEGYRCKDDGCSGFLLRDSDDK---GFTCQQCGLVRSKEEIKKI 183
+ SAI +C+D C G +L D + C++C K I +I
Sbjct: 305 TEL-GTNYSAI----KCEDRRCDGLMLPSKTDDWNGSWKCRECQKQVQKHYIDQI 354
>gi|195170308|ref|XP_002025955.1| GL10122 [Drosophila persimilis]
gi|194110819|gb|EDW32862.1| GL10122 [Drosophila persimilis]
Length = 515
Score = 60.5 bits (145), Expect = 1e-06, Method: Compositional matrix adjust.
Identities = 44/175 (25%), Positives = 72/175 (41%), Gaps = 11/175 (6%)
Query: 13 AQIANLVNLILQWPEISINEIAENFSKLACNAHTICNSELRPLGTGLYPVISIINHSCLP 72
A I + + + S E+ L N H + S+ P ++ S HSC+P
Sbjct: 187 ASIGQFIPKFFKTEKFSEEEVMRAVGALQINGHEVPTSD--PPHVAVFYTASFTEHSCMP 244
Query: 73 NAVLVF-EGRLAVVRAVQHVPKGAEVLISYIETAGSTMTRQKALKEQYLFTCTCPRCIKL 131
N F + ++ A + + K + + I Y + T RQ+ L + LF C C RC +
Sbjct: 245 NLAKSFNKNGHCILWAPREIKKNSHLSICYSDAMWGTADRQRHLVQTKLFKCACDRCTDV 304
Query: 132 GQFDDIQESAILEGYRCKDDGCSGFLLRDSDDK---GFTCQQCGLVRSKEEIKKI 183
+ SAI +C+D C G +L D + C++C K I +I
Sbjct: 305 TEL-GTNYSAI----KCEDRRCDGLMLPSKTDDWNGSWKCRECQKQVQKHYIDQI 354
>gi|380494289|emb|CCF33265.1| MYND finger [Colletotrichum higginsianum]
Length = 549
Score = 60.5 bits (145), Expect = 1e-06, Method: Compositional matrix adjust.
Identities = 34/92 (36%), Positives = 48/92 (52%), Gaps = 1/92 (1%)
Query: 39 KLACNAHTICNSELRPLGTGLYPVISIINHSCLPNAVLVFEGRLAVVRAVQHVPKGAEVL 98
KL NA + G L +++INHSC+PNA++ F GR A +RA + G E+
Sbjct: 183 KLQTNAFSRTEEYYETGGVFLDTTLAMINHSCVPNALVQFGGRTATLRATSFLDPGDEIE 242
Query: 99 ISYIETAGSTMTRQKALKEQYLFTCTCPRCIK 130
ISYI+ R L + Y F C+C +C K
Sbjct: 243 ISYIDQTQPRGKRHGEL-DLYHFECSCYKCQK 273
>gi|158288115|ref|XP_309979.4| AGAP011530-PA [Anopheles gambiae str. PEST]
gi|157019319|gb|EAA05706.4| AGAP011530-PA [Anopheles gambiae str. PEST]
Length = 522
Score = 60.5 bits (145), Expect = 1e-06, Method: Compositional matrix adjust.
Identities = 48/183 (26%), Positives = 83/183 (45%), Gaps = 10/183 (5%)
Query: 24 QWPEISINEIAENFSKLACNAHTICNSELRPLGTGLYPVISIINHSCLPNAVLVFEGRLA 83
+W E +EI + N H + +E P +Y + S++ HSC PN F R
Sbjct: 200 KWDE---DEILRVVGIIQVNGHEVPMTE--PSSVAIYNMASMLEHSCRPNLAKSFTNRGE 254
Query: 84 VVR-AVQHVPKGAEVLISYIETAGSTMTRQKALKEQYLFTCTCPRCIKLGQFDDIQESAI 142
VV A + +G + I Y + +T R + L++ +F C C RC +++ +
Sbjct: 255 VVMWAPNPIRRGDRLSICYTDVLWTTGNRLEHLQQTKMFRCECERCSDRTEYETYFSAVR 314
Query: 143 LEGYRCKDDGCSGFLLRDSDDKGFTCQQCGLVRSKEEIKKIASEVNILSKKTLALTSCGN 202
G++ KD C G+LL + + QC +R +E+ A+ IL + + L +
Sbjct: 315 CSGFQ-KDSKCKGYLLPVDNAQWTGEWQC--LRCSKEVSG-ATVCQILERARMDLEAMEK 370
Query: 203 HQE 205
H+E
Sbjct: 371 HRE 373
>gi|402083521|gb|EJT78539.1| hypothetical protein GGTG_03639 [Gaeumannomyces graminis var.
tritici R3-111a-1]
Length = 779
Score = 60.1 bits (144), Expect = 1e-06, Method: Compositional matrix adjust.
Identities = 29/95 (30%), Positives = 46/95 (48%), Gaps = 7/95 (7%)
Query: 41 ACNAHTICNSELRPLGT-------GLYPVISIINHSCLPNAVLVFEGRLAVVRAVQHVPK 93
+C T+ N+ RP G G++P +I+NH+C+ N+ F G L + RA + +P
Sbjct: 459 SCPPLTVDNTRPRPEGVPEPPQTRGIWPHAAIMNHACVANSTRAFCGDLFITRASRDIPA 518
Query: 94 GAEVLISYIETAGSTMTRQKALKEQYLFTCTCPRC 128
G E+ Y+ RQ L+ + F C C C
Sbjct: 519 GDEITQQYVPVRADWGERQAQLRHWWGFECACALC 553
>gi|224029645|gb|ACN33898.1| unknown [Zea mays]
Length = 369
Score = 60.1 bits (144), Expect = 1e-06, Method: Compositional matrix adjust.
Identities = 35/111 (31%), Positives = 51/111 (45%), Gaps = 7/111 (6%)
Query: 47 ICNSELRPLGTGLYPVISIINHSCLPNAVLVFEGRLAVVRAVQHVPKGAEVLISYIETAG 106
+C+ E + LY S+ NHSC PN F R V+R + + G+ V +SY G
Sbjct: 122 MCSVEQVRVAQALYVSGSLFNHSCQPNVHAYFLSRAFVLRTTEFIKSGSPVELSYGPQVG 181
Query: 107 STMT--RQKALKEQYLFTCTCPRCIKLGQFDDIQESAILEGYRCKDDGCSG 155
RQK+L+E Y F+C C C +L D ++ + C C G
Sbjct: 182 EMHISERQKSLQENYYFSCQCSSCSELNLSD-----LVMNSFCCPQSNCLG 227
>gi|302836969|ref|XP_002950044.1| hypothetical protein VOLCADRAFT_104571 [Volvox carteri f.
nagariensis]
gi|300264517|gb|EFJ48712.1| hypothetical protein VOLCADRAFT_104571 [Volvox carteri f.
nagariensis]
Length = 430
Score = 60.1 bits (144), Expect = 1e-06, Method: Compositional matrix adjust.
Identities = 31/73 (42%), Positives = 40/73 (54%), Gaps = 2/73 (2%)
Query: 58 GLYPVISIINHSCLPNAVLVFEGRLAVVRAVQHVPKGAEVLISYIET--AGSTMTRQKAL 115
GL+P ++ NHSC PNAV V GR +R V G EV I+Y G R+ L
Sbjct: 286 GLWPAHAMFNHSCAPNAVAVVAGRELHIRCSGPVSAGEEVCITYSGALGLGPLPLRRALL 345
Query: 116 KEQYLFTCTCPRC 128
++ + F CTCPRC
Sbjct: 346 EKNHRFRCTCPRC 358
>gi|296085566|emb|CBI29298.3| unnamed protein product [Vitis vinifera]
Length = 327
Score = 60.1 bits (144), Expect = 1e-06, Method: Compositional matrix adjust.
Identities = 30/78 (38%), Positives = 38/78 (48%), Gaps = 7/78 (8%)
Query: 58 GLYPVISIINHSCLPNA-------VLVFEGRLAVVRAVQHVPKGAEVLISYIETAGSTMT 110
G+YP S NH CLPNA +R + VP+G+E+ +SY +
Sbjct: 148 GIYPKASFFNHDCLPNACRFDYVDTASHHNTDITIRLIHDVPEGSEICLSYFPVNETYAD 207
Query: 111 RQKALKEQYLFTCTCPRC 128
RQK L E Y FTC C RC
Sbjct: 208 RQKRLLEDYGFTCYCDRC 225
>gi|256071345|ref|XP_002572001.1| set and mynd domain containing [Schistosoma mansoni]
gi|350645143|emb|CCD60151.1| set and mynd domain containing, putative [Schistosoma mansoni]
Length = 527
Score = 60.1 bits (144), Expect = 1e-06, Method: Compositional matrix adjust.
Identities = 30/87 (34%), Positives = 45/87 (51%), Gaps = 3/87 (3%)
Query: 47 ICNSELRPLGTGLYPVISIINHSCLPNAVLVFEGRLAVVRAVQHVPKGAEVLISY---IE 103
IC + + TGL+P +S++NHSC PN FE ++R ++ + G EV Y
Sbjct: 384 ICKFQQIRIATGLFPCVSLLNHSCDPNTAHNFEESFLILRCLKPILPGTEVFHCYGPHYL 443
Query: 104 TAGSTMTRQKALKEQYLFTCTCPRCIK 130
S++ R L++QY F C C C K
Sbjct: 444 HYPSSVQRVTLLQQQYFFICDCEHCSK 470
>gi|154337874|ref|XP_001565163.1| conserved hypothetical protein [Leishmania braziliensis
MHOM/BR/75/M2904]
gi|134062210|emb|CAM36598.1| conserved hypothetical protein [Leishmania braziliensis
MHOM/BR/75/M2904]
Length = 466
Score = 60.1 bits (144), Expect = 1e-06, Method: Compositional matrix adjust.
Identities = 42/129 (32%), Positives = 64/129 (49%), Gaps = 20/129 (15%)
Query: 28 ISINEIAENFSKLACNAHTICNSELRPLGT-------------GLYPVISIINHSCLPN- 73
+S + + ++ NAH I + L P+G GLY +++ NHSC+PN
Sbjct: 329 LSFDTLRTLVGQMVLNAHAINDYVLLPVGAPSPSAFDWVLKGAGLYSLLACFNHSCVPNV 388
Query: 74 AVLVFEGRLAVV-RAVQHVPKGAEVLISYIETAGSTMT---RQKALKEQYLFTCTCPRCI 129
AV +G +V + + + G + I+YI + M+ RQ+ L+ Y FTC CPRCI
Sbjct: 389 AVSNVDGTHEIVLKTTRPIQAGEPLTITYIPLSAGAMSRAERQRQLR-NYFFTCHCPRCI 447
Query: 130 K-LGQFDDI 137
K G DI
Sbjct: 448 KEAGVAGDI 456
>gi|353236057|emb|CCA68059.1| hypothetical protein PIIN_01926 [Piriformospora indica DSM 11827]
Length = 786
Score = 60.1 bits (144), Expect = 1e-06, Method: Compositional matrix adjust.
Identities = 77/309 (24%), Positives = 119/309 (38%), Gaps = 84/309 (27%)
Query: 56 GTGLYPVISIINHSCLPNAVLVFEGRLAVVRAVQHVPKGAEVLISYIETAGSTMTRQKAL 115
GTGLY + S NHSC+PNA F L V+RA Q + + E+ ISY A S +R+K L
Sbjct: 522 GTGLYLLASYFNHSCIPNAGHQFFSDLMVIRATQSIKQDEEITISYCSHA-SYASREKNL 580
Query: 116 KEQYLFTCTCPRCIKLGQFDDIQESAILEGYRCKDDGCSGFLLRDSDDKGFTCQQCGLVR 175
K + C C CI DD +G R
Sbjct: 581 KP-WFDQCDCQLCI--------------------DDRLAGSNRRS--------------- 604
Query: 176 SKEEIKK-IASEVNILSKKTLALTSCGNHQEVVSTYKMIEKLQKKL---YHPFSVNLMQT 231
++E + K + + L + AL Q++VSTY E+++ +L YH FS L Q
Sbjct: 605 ARESLSKAVRNAATPLPQARDAL------QKMVSTYSSQERVRPELSTAYHAFSHRLQQM 658
Query: 232 REKLIKILMELE---------DW----------KEALAYCQL-TIPVYQRVYP------- 264
+ + E +W E + L T+P+ R P
Sbjct: 659 AHEFVSKAAATEAIEKEIQSLEWLGVEVTEKGITEVSDHSHLPTLPISTRRIPNTFGEPD 718
Query: 265 -QFHPLLGLQYYTCG----KLEWFLGDTENAIKSMTEAVEILRITHG----TNSPFMKEL 315
QF + G+ + G + WF + K++ + + + TH P + E
Sbjct: 719 MQFLMIFGM-FVLLGCPNRAMSWFRAAIWMSEKTIGGGIPMFKRTHSDIIRVLGPVLAEF 777
Query: 316 ILKLEEAQA 324
+ K+E +A
Sbjct: 778 LSKIESERA 786
>gi|440800514|gb|ELR21550.1| Histone-lysine N-methyltransferase [Acanthamoeba castellanii str.
Neff]
Length = 333
Score = 60.1 bits (144), Expect = 1e-06, Method: Compositional matrix adjust.
Identities = 33/76 (43%), Positives = 44/76 (57%), Gaps = 2/76 (2%)
Query: 56 GTGLYPVISIINHSCLPNAVLVF-EGRLAVVRAVQHVPKGAEVLISYIETAGSTMTRQKA 114
G GL+P+ +++NHSC PN + F R AVV A+ + +G E+ SYIE RQ
Sbjct: 254 GFGLFPIAAMMNHSCEPNTQVKFGRNREAVVVALCDIAEGEELTHSYIENDRPLAERQAD 313
Query: 115 LKEQYLFTCTCPRCIK 130
L E Y F C C RC+K
Sbjct: 314 LLE-YNFVCQCVRCLK 328
>gi|195591423|ref|XP_002085440.1| GD12310 [Drosophila simulans]
gi|194197449|gb|EDX11025.1| GD12310 [Drosophila simulans]
Length = 530
Score = 60.1 bits (144), Expect = 1e-06, Method: Compositional matrix adjust.
Identities = 50/181 (27%), Positives = 83/181 (45%), Gaps = 22/181 (12%)
Query: 10 LLYAQIANLVNLIL---QWPEISINEIAENFSKLACNAHTICNSELRPLGTGLYPVISII 66
+L A + + +L WPE+ I IA + L N + R LYP ++I
Sbjct: 199 VLRANLITFIKTVLGMKDWPEMDILRIA---AILDTNTFEVRQPRERRKIRALYPGAAMI 255
Query: 67 NHSCLPNAVLVFEGRLAVV-RAVQHVPKGAEVLISYIETAGSTMTRQKALKEQYLFTCTC 125
+H C+PN F+ + +V A + + KG + ISY + ST+ R+ L++ F C+C
Sbjct: 256 SHDCVPNMRHRFDDDMNIVFLAKRKIAKGEILSISYTQPLRSTIQRRVHLRQAKCFDCSC 315
Query: 126 PRCI---KLGQFDDIQESAILEGYRCKDDGCSGFLLRDS---DDKGFTCQQCGLVRSKEE 179
RC +LG F Q +CK +G ++ + + + CQ C RS +E
Sbjct: 316 ARCQDPEELGSFAGAQTC-----LKCK----AGKIISLNPLLNSAPWKCQLCNFKRSAKE 366
Query: 180 I 180
+
Sbjct: 367 V 367
>gi|46115982|ref|XP_384009.1| hypothetical protein FG03833.1 [Gibberella zeae PH-1]
Length = 690
Score = 60.1 bits (144), Expect = 1e-06, Method: Compositional matrix adjust.
Identities = 36/128 (28%), Positives = 61/128 (47%), Gaps = 5/128 (3%)
Query: 6 EKQLLLYAQIANLVNLILQWPEISINEIAENFSKLACNAHTIC----NSELRPLGTGLYP 61
E+ L L + +++ E+ + + + + CNA + + ++ TGL+
Sbjct: 452 ERILGLQGDYKGIGQKLIELDEVPVIDTFQIHDIVQCNAFGLGQQTEDEDISNASTGLWV 511
Query: 62 VISIINHSCLPNAVLVFEGRLAVVRAVQHVPKGAEVLISYIETAGSTMTRQKALKEQYLF 121
S INHSC+PNA G L + RA + + G E+ +Y E+ S RQ A + + F
Sbjct: 512 RASYINHSCIPNAKKDLIGDLILFRATRRIASGEEITHAYDEST-SYEARQAAFRRTWNF 570
Query: 122 TCTCPRCI 129
C CP C+
Sbjct: 571 ECRCPLCL 578
>gi|169606119|ref|XP_001796480.1| hypothetical protein SNOG_06094 [Phaeosphaeria nodorum SN15]
gi|160706916|gb|EAT87158.2| hypothetical protein SNOG_06094 [Phaeosphaeria nodorum SN15]
Length = 323
Score = 60.1 bits (144), Expect = 1e-06, Method: Compositional matrix adjust.
Identities = 48/155 (30%), Positives = 71/155 (45%), Gaps = 21/155 (13%)
Query: 58 GLYPVISIINHSCLPNAVLVFEGRL--AVVRAVQHVPKGAEVLISYIETAGSTMTRQKAL 115
GL+P I+ INHSC PNA + L +V A + + KG E+ +SYI + RQK L
Sbjct: 117 GLFPKIARINHSCRPNASYYWSQTLNKRIVYATRRIAKGDEIFVSYIPLLLTQEQRQKHL 176
Query: 116 KEQYLFTCTCPRCIKLGQFDDIQESAILEGYRCKDDGCSGFLLRDSDDKGFTCQQCGLVR 175
++Y F CTC C QE A +E DD R + K F + L
Sbjct: 177 -DRYGFKCTCEAC--------AQEHAAME---ASDD------RRTTISKAFVDFEPQLTL 218
Query: 176 SKEEIKKIASEVNILSKKTLALTSCGNHQEVVSTY 210
+ + KK + ++ +L L H+E ++ Y
Sbjct: 219 TSPKTKKATQQARKNARASLKLAELV-HEEGLADY 252
>gi|66810740|ref|XP_639077.1| SET domain-containing protein [Dictyostelium discoideum AX4]
gi|74854618|sp|Q54R14.1|Y3443_DICDI RecName: Full=SET domain-containing protein DDB_G0283443
gi|60467687|gb|EAL65706.1| SET domain-containing protein [Dictyostelium discoideum AX4]
Length = 393
Score = 60.1 bits (144), Expect = 1e-06, Method: Compositional matrix adjust.
Identities = 37/99 (37%), Positives = 49/99 (49%), Gaps = 5/99 (5%)
Query: 63 ISIINHSCLPNAVLVFEGRLAVVRAVQHVPKGAEVLISYIETAGSTMTRQKALKEQYLFT 122
IS NHSC PN V V E + + ++ + KG E+ ISYI+ S R LKE Y F
Sbjct: 273 ISYFNHSCFPNCVRVQENQSISIYSLIPIKKGDELSISYIDIRMSKNDRLLHLKEIYYFE 332
Query: 123 CTCPRCI--KLGQFDDIQESAILEGYRCKDDG--CSGFL 157
C C RC L ++ +E Y CK+ C+G L
Sbjct: 333 CKCKRCTLPPLSNL-SLEIEKTIENYTCKNQSIKCTGIL 370
>gi|325188967|emb|CCA23495.1| conserved hypothetical protein [Albugo laibachii Nc14]
Length = 497
Score = 60.1 bits (144), Expect = 2e-06, Method: Compositional matrix adjust.
Identities = 31/76 (40%), Positives = 45/76 (59%), Gaps = 2/76 (2%)
Query: 56 GTGLYPVISIINHSCLPNAVLVF-EGRLAVVRAVQHVPKGAEVLISYIETAGSTMTRQKA 114
GT L+ +I +NHSCLPN ++V+ A V A+Q + E+ I YI+T S RQ
Sbjct: 415 GTALFSLICTMNHSCLPNCIVVYGHNGEAHVHAIQAIQPKDELCIEYIDTDRSYDERQYE 474
Query: 115 LKEQYLFTCTCPRCIK 130
L+E Y F C C +C++
Sbjct: 475 LRE-YHFRCQCMKCMR 489
>gi|332372756|gb|AEE61520.1| unknown [Dendroctonus ponderosae]
Length = 501
Score = 60.1 bits (144), Expect = 2e-06, Method: Compositional matrix adjust.
Identities = 49/195 (25%), Positives = 85/195 (43%), Gaps = 15/195 (7%)
Query: 59 LYPVISIINHSCLPNAVLVFE----GRL----AVVRAVQHVPKGAEVLISYIETAGSTMT 110
L+P ++I++H+C+ N + G L VRA +PKG E+ SY + T+
Sbjct: 198 LFPKLAILSHNCVSNIHHAVDCNGDGELKDCVVTVRAAVDIPKGGELFSSYTFSLWPTLV 257
Query: 111 RQKALKEQYLFTCTCPRCIKLGQFDDIQESAILEGYRCKDDGCSGFLLRDSDDKGFTCQQ 170
R++ L+E F CTCPRC + D S L+ RC D+G D + C
Sbjct: 258 RREFLRESKFFDCTCPRCSSPTELDTHMSS--LKCQRC-DNGVIISTNTADDACEWKCTH 314
Query: 171 CGLVRSKEEIKKIASEVNILSKKTLALTSCGNHQEVVSTYKMIEKLQKKLYHPFSVNLMQ 230
C + ++K+ + + + + G Q + + K + L HP + +
Sbjct: 315 CDYKTNGRAVRKVFAAIQ---NEIDQVEYIGGPQGIQQRETIFRKYRSVL-HPNNSYMTT 370
Query: 231 TREKLIKILMELEDW 245
R LI++ + E +
Sbjct: 371 LRSALIQLYGKTEGY 385
>gi|281212379|gb|EFA86539.1| hypothetical protein PPL_00338 [Polysphondylium pallidum PN500]
Length = 468
Score = 60.1 bits (144), Expect = 2e-06, Method: Compositional matrix adjust.
Identities = 29/75 (38%), Positives = 46/75 (61%), Gaps = 3/75 (4%)
Query: 56 GTGLYPVISIINHSCLPNAVLVFE--GRLAVVRAVQHVPKGAEVLISYIETAGSTMTRQK 113
G G++ + ++INHSC+PN ++VFE LA ++A++ + G E+ SYIE + R +
Sbjct: 373 GFGVFGLQAMINHSCVPNCLVVFEQGSSLAYIKALRDIVPGEELFHSYIEESAPFEERSQ 432
Query: 114 ALKEQYLFTCTCPRC 128
L Y F C CP+C
Sbjct: 433 ELV-TYGFNCDCPKC 446
>gi|189234759|ref|XP_001814707.1| PREDICTED: similar to SMYD family member 5 [Tribolium castaneum]
gi|270001538|gb|EEZ97985.1| hypothetical protein TcasGA2_TC000380 [Tribolium castaneum]
Length = 383
Score = 60.1 bits (144), Expect = 2e-06, Method: Compositional matrix adjust.
Identities = 32/88 (36%), Positives = 50/88 (56%), Gaps = 7/88 (7%)
Query: 56 GTGLYPVISIINHSCLPNAVLVF---EGRLAVVRAVQHVPKGAEVLISYIETAG---STM 109
G L+ + S NHSC+PNA + +L++V AV+ + +G E+ ISY++ S
Sbjct: 284 GVALFTLQSACNHSCVPNAEPTYLHNNNKLSLV-AVRDIQEGEEICISYLDECNLHRSRH 342
Query: 110 TRQKALKEQYLFTCTCPRCIKLGQFDDI 137
+R+K L E YLF C CP+C + D+
Sbjct: 343 SRRKELMENYLFACNCPKCEEQADEPDV 370
>gi|389645452|ref|XP_003720358.1| hypothetical protein MGG_09876 [Magnaporthe oryzae 70-15]
gi|351640127|gb|EHA47991.1| hypothetical protein MGG_09876 [Magnaporthe oryzae 70-15]
Length = 427
Score = 60.1 bits (144), Expect = 2e-06, Method: Compositional matrix adjust.
Identities = 27/71 (38%), Positives = 41/71 (57%)
Query: 55 LGTGLYPVISIINHSCLPNAVLVFEGRLAVVRAVQHVPKGAEVLISYIETAGSTMTRQKA 114
GT L V+S+INH C PNA + FEG VR+++ + G E+ +SY + + RQ+
Sbjct: 128 FGTSLDLVVSMINHDCSPNAHVFFEGSQVRVRSLKAIAAGGEITVSYCDPRLDVLLRQEI 187
Query: 115 LKEQYLFTCTC 125
L++ F C C
Sbjct: 188 LRQTQFFHCEC 198
>gi|18399103|ref|NP_565457.1| histone-lysine N-methyltransferase ASHR2 [Arabidopsis thaliana]
gi|14334524|gb|AAK59459.1| putative SET-domain transcriptional regulator protein [Arabidopsis
thaliana]
gi|17979109|gb|AAL47498.1| putative SET-domain transcriptional regulator protein [Arabidopsis
thaliana]
gi|20197479|gb|AAD10162.2| putative SET-domain transcriptional regulator [Arabidopsis
thaliana]
gi|330251812|gb|AEC06906.1| histone-lysine N-methyltransferase ASHR2 [Arabidopsis thaliana]
Length = 341
Score = 60.1 bits (144), Expect = 2e-06, Method: Compositional matrix adjust.
Identities = 31/78 (39%), Positives = 40/78 (51%), Gaps = 7/78 (8%)
Query: 58 GLYPVISIINHSCLPNAVL------VFEGRLAVV-RAVQHVPKGAEVLISYIETAGSTMT 110
G+YP S NH CLPNA +G ++ R + VP+G EV +SY + +
Sbjct: 219 GIYPKTSFFNHDCLPNACRFDYVDSASDGNTDIIIRMIHDVPEGREVCLSYFPVNMNYSS 278
Query: 111 RQKALKEQYLFTCTCPRC 128
RQK L E Y F C C RC
Sbjct: 279 RQKRLLEDYGFKCDCDRC 296
>gi|328855353|gb|EGG04480.1| hypothetical protein MELLADRAFT_108480 [Melampsora larici-populina
98AG31]
Length = 398
Score = 59.7 bits (143), Expect = 2e-06, Method: Compositional matrix adjust.
Identities = 36/108 (33%), Positives = 54/108 (50%), Gaps = 9/108 (8%)
Query: 25 WPEISINEIAENFSKLACNAHTICNSELRPLGTGLYPVISIINHSCLPNAVLVFEGR--- 81
W S++ I+ L NA +I +++L ++P +S INH C + V+ R
Sbjct: 171 WQNQSLSNISNYIRILQTNAISIGSNQL-----AIFPSLSRINHGCAGASNSVYNWREKE 225
Query: 82 -LAVVRAVQHVPKGAEVLISYIETAGSTMTRQKALKEQYLFTCTCPRC 128
+ VV A + + G E+LISY ++ S RQ LK Y F CTC C
Sbjct: 226 GVEVVHATKLIEVGEEILISYWDSKRSRSDRQDYLKSNYGFQCTCQTC 273
>gi|340057403|emb|CCC51748.1| conserved hypothetical protein [Trypanosoma vivax Y486]
Length = 438
Score = 59.7 bits (143), Expect = 2e-06, Method: Compositional matrix adjust.
Identities = 30/78 (38%), Positives = 44/78 (56%), Gaps = 3/78 (3%)
Query: 56 GTGLYPVISIINHSCLPNAVLVFEG---RLAVVRAVQHVPKGAEVLISYIETAGSTMTRQ 112
G G+Y V S+ NHSC PN ++ VV A++ G E+ ISYI+ + RQ
Sbjct: 350 GQGVYAVGSLFNHSCEPNLQVLNSATGDETLVVEALRDCEPGEELYISYIDESLPYAIRQ 409
Query: 113 KALKEQYLFTCTCPRCIK 130
+ L+E YLF C C +C++
Sbjct: 410 QQLREHYLFECRCSKCVR 427
>gi|453088056|gb|EMF16097.1| SET domain-containing protein, partial [Mycosphaerella populorum
SO2202]
Length = 558
Score = 59.7 bits (143), Expect = 2e-06, Method: Compositional matrix adjust.
Identities = 24/76 (31%), Positives = 38/76 (50%)
Query: 55 LGTGLYPVISIINHSCLPNAVLVFEGRLAVVRAVQHVPKGAEVLISYIETAGSTMTRQKA 114
LG P++ NHSC PN ++F ++RA + K E+ + Y++ RQ
Sbjct: 221 LGIAFDPILCSANHSCEPNTAVIFNQPQLILRAQTKIKKDEEIFMKYVDITNPFSVRQAE 280
Query: 115 LKEQYLFTCTCPRCIK 130
L++ Y F C C +C K
Sbjct: 281 LRQSYFFGCRCTKCRK 296
>gi|119189249|ref|XP_001245231.1| hypothetical protein CIMG_04672 [Coccidioides immitis RS]
gi|303323233|ref|XP_003071608.1| hypothetical protein CPC735_071450 [Coccidioides posadasii C735
delta SOWgp]
gi|240111310|gb|EER29463.1| hypothetical protein CPC735_071450 [Coccidioides posadasii C735
delta SOWgp]
gi|320031288|gb|EFW13261.1| conserved hypothetical protein [Coccidioides posadasii str.
Silveira]
gi|392868134|gb|EAS33877.2| hypothetical protein CIMG_04672 [Coccidioides immitis RS]
Length = 338
Score = 59.7 bits (143), Expect = 2e-06, Method: Compositional matrix adjust.
Identities = 28/74 (37%), Positives = 44/74 (59%), Gaps = 2/74 (2%)
Query: 58 GLYPVISIINHSCLPNAVLVFEGRL--AVVRAVQHVPKGAEVLISYIETAGSTMTRQKAL 115
G++P +S NH C PN++ + L AVV V+ V +G E+ +SY+E + RQ+ L
Sbjct: 119 GIFPKVSRFNHGCRPNSMRSYHPVLDQAVVHVVKDVSEGEEITVSYVEPGLAFYLRQEQL 178
Query: 116 KEQYLFTCTCPRCI 129
KE++ F C C C+
Sbjct: 179 KEKFGFICGCNLCL 192
>gi|302796834|ref|XP_002980178.1| hypothetical protein SELMODRAFT_419787 [Selaginella moellendorffii]
gi|300151794|gb|EFJ18438.1| hypothetical protein SELMODRAFT_419787 [Selaginella moellendorffii]
Length = 662
Score = 59.7 bits (143), Expect = 2e-06, Method: Compositional matrix adjust.
Identities = 31/73 (42%), Positives = 45/73 (61%), Gaps = 3/73 (4%)
Query: 58 GLYPVISIINHSCLPNAVLVFEGRLAVVRAVQHVPKGAEVLISYIET--AGSTMTRQKAL 115
GL+ + +INHSC+PNA + G + VRA + + G EVL+SY+E A R+K +
Sbjct: 445 GLWASVGLINHSCIPNASKMSIGDVVFVRAAKDLKAGDEVLLSYLEPPLAPYAGYREKMI 504
Query: 116 KEQYLFTCTCPRC 128
+QY F C+C RC
Sbjct: 505 -QQYNFECSCERC 516
>gi|426226456|ref|XP_004007359.1| PREDICTED: SET and MYND domain-containing protein 5, partial [Ovis
aries]
Length = 391
Score = 59.7 bits (143), Expect = 2e-06, Method: Compositional matrix adjust.
Identities = 31/79 (39%), Positives = 46/79 (58%), Gaps = 5/79 (6%)
Query: 56 GTGLYPVISIINHSCLPNAVLVF--EGRLAVVRAVQHVPKGAEVLISYIETAG---STMT 110
G+GL+ + S NHSC+PNA F L V A++ + G + ISY++ S +
Sbjct: 278 GSGLFVLQSCCNHSCVPNAETSFPENNFLLHVTALEDIKPGERLGISYLDCCQRERSRHS 337
Query: 111 RQKALKEQYLFTCTCPRCI 129
R K L+E YLF C+CP+C+
Sbjct: 338 RHKILRENYLFVCSCPKCL 356
>gi|19921122|ref|NP_609464.1| CG43129, isoform D [Drosophila melanogaster]
gi|442627404|ref|NP_001260366.1| CG43129, isoform E [Drosophila melanogaster]
gi|7297774|gb|AAF53024.1| CG43129, isoform D [Drosophila melanogaster]
gi|17945207|gb|AAL48662.1| RE12806p [Drosophila melanogaster]
gi|440213690|gb|AGB92901.1| CG43129, isoform E [Drosophila melanogaster]
Length = 507
Score = 59.7 bits (143), Expect = 2e-06, Method: Compositional matrix adjust.
Identities = 50/199 (25%), Positives = 85/199 (42%), Gaps = 25/199 (12%)
Query: 59 LYPVISIINHSCLPN---AVLVFEGRLAVVRAVQHVPKGAEVLISYIETAGSTMTRQKAL 115
L+P I+ H+C+PN ++ EG +RA+ + +G + SY T T RQK L
Sbjct: 207 LFPYTGILAHNCVPNTSRSIYPSEGYKIRLRAMVDLEEGQPLHHSYTYTLDGTAQRQKHL 266
Query: 116 KEQYLFTCTCPRCI---KLG-QFDDIQESAILEGYRCKDDGCSGFLLRDSDDKGFTCQQC 171
K+ FTC C RC+ +LG F ++ EG++ D + C C
Sbjct: 267 KQGKFFTCQCERCLDPTELGTHFSSLKCGQCAEGFQVPRQP-------TEPDTSWNCANC 319
Query: 172 GLVRSKEE----IKKIASEVNILSKKTLALTSCGNHQEVVSTYKMIEKLQKKLYHPFSVN 227
G S + ++ + SEVN + +A + ++ Y K L HP
Sbjct: 320 GSDTSNADALAMLQSLQSEVNAVQALPMAAKRLEEIERLLRKY-------KSLLHPLHFI 372
Query: 228 LMQTREKLIKILMELEDWK 246
R+ LI++ ++ ++
Sbjct: 373 ATGLRQLLIEMYGRVQGYE 391
>gi|429860577|gb|ELA35307.1| mynd finger family protein [Colletotrichum gloeosporioides Nara
gc5]
Length = 551
Score = 59.7 bits (143), Expect = 2e-06, Method: Compositional matrix adjust.
Identities = 35/104 (33%), Positives = 54/104 (51%), Gaps = 4/104 (3%)
Query: 28 ISINEIAENFS---KLACNAHTICNSELRPLGTGLYPVISIINHSCLPNAVLVFEGRLAV 84
I++ + E+F KL NA + G L +++INHSC+PNA++ F GR A
Sbjct: 171 ITLKGLEESFQVLCKLQTNAFSRTEEYYETGGVFLDTTLAMINHSCVPNAMVQFGGRTAT 230
Query: 85 VRAVQHVPKGAEVLISYIETAGSTMTRQKALKEQYLFTCTCPRC 128
+R+ + G E+ ISY + RQ L + Y F C+C +C
Sbjct: 231 LRSATFINPGDEIEISYADHTQPKGKRQGEL-DLYHFECSCYKC 273
>gi|9295736|gb|AAF87042.1|AC006535_20 T24P13.14 [Arabidopsis thaliana]
Length = 969
Score = 59.7 bits (143), Expect = 2e-06, Method: Compositional matrix adjust.
Identities = 26/73 (35%), Positives = 41/73 (56%), Gaps = 1/73 (1%)
Query: 56 GTGLYPVISIINHSCLPNAVLVFEGRLAVVRAVQHVPKGAEVLISYIETAGSTMTRQKAL 115
G GL+ + S INHSC+PNA + G +V A + + G E+ +Y + S + ++K +
Sbjct: 754 GVGLWTLASFINHSCIPNARRLHVGDYVIVHASRDIKTGEEISFAYFDVL-SPLEKRKEM 812
Query: 116 KEQYLFTCTCPRC 128
E + F C C RC
Sbjct: 813 AESWGFCCGCSRC 825
>gi|332373072|gb|AEE61677.1| unknown [Dendroctonus ponderosae]
Length = 382
Score = 59.7 bits (143), Expect = 2e-06, Method: Compositional matrix adjust.
Identities = 36/91 (39%), Positives = 50/91 (54%), Gaps = 9/91 (9%)
Query: 52 LRPLGTGLYPVISIINHSCLPNAVLVF---EGRLAVVRAVQHVPKGAEVLISYI---ETA 105
L G GLY S NHSC PNA + F RL+++ A++ + G E+ ISY+ +
Sbjct: 280 LNSEGVGLYARQSCANHSCDPNAEISFVHNNYRLSLI-ALKDIQPGEEICISYLGDCDNE 338
Query: 106 GSTMTRQKALKEQYLFTCTCPRCIKLGQFDD 136
S +R+K L E YLF C C +C L + DD
Sbjct: 339 RSRHSRRKTLMENYLFACECSKC--LAEIDD 367
>gi|195334290|ref|XP_002033816.1| GM21522 [Drosophila sechellia]
gi|194125786|gb|EDW47829.1| GM21522 [Drosophila sechellia]
Length = 514
Score = 59.7 bits (143), Expect = 2e-06, Method: Compositional matrix adjust.
Identities = 42/173 (24%), Positives = 76/173 (43%), Gaps = 11/173 (6%)
Query: 15 IANLVNLILQWPEISINEIAENFSKLACNAHTICNSELRPLGTGLYPVISIINHSCLPNA 74
I + + + + +I + L N H + ++ P ++ S +SCLPN
Sbjct: 189 IGQFIPKFFKTQKFTEEQIMKAVGALQINGHEVPTTD--PSHVAVFYTASFTENSCLPNL 246
Query: 75 VLVF-EGRLAVVRAVQHVPKGAEVLISYIETAGSTMTRQKALKEQYLFTCTCPRCIKLGQ 133
F + ++ A + + K A + I Y + T RQ+ L + LF C C RC+ + +
Sbjct: 247 AKSFNKNGHCILWAPREIKKNAHLSICYSDAMWGTADRQRHLMQTKLFKCACERCVDVTE 306
Query: 134 FDDIQESAILEGYRCKDDGCSGFLL---RDSDDKGFTCQQCGLVRSKEEIKKI 183
D SAI +C+D C G +L D + + C++C K +++I
Sbjct: 307 L-DTNYSAI----KCEDRQCGGLMLPTKADDWNGSWRCRECHKQVQKHYVERI 354
>gi|302759442|ref|XP_002963144.1| hypothetical protein SELMODRAFT_404768 [Selaginella moellendorffii]
gi|300170005|gb|EFJ36607.1| hypothetical protein SELMODRAFT_404768 [Selaginella moellendorffii]
Length = 664
Score = 59.7 bits (143), Expect = 2e-06, Method: Compositional matrix adjust.
Identities = 31/73 (42%), Positives = 45/73 (61%), Gaps = 3/73 (4%)
Query: 58 GLYPVISIINHSCLPNAVLVFEGRLAVVRAVQHVPKGAEVLISYIET--AGSTMTRQKAL 115
GL+ + +INHSC+PNA + G + VRA + + G EVL+SY+E A R+K +
Sbjct: 447 GLWASVGLINHSCIPNASKMSIGDVVFVRAAKDLKAGDEVLLSYLEPPLAPYAGYREKMI 506
Query: 116 KEQYLFTCTCPRC 128
+QY F C+C RC
Sbjct: 507 -QQYNFECSCERC 518
>gi|383850313|ref|XP_003700740.1| PREDICTED: protein msta, isoform B-like [Megachile rotundata]
Length = 494
Score = 59.7 bits (143), Expect = 2e-06, Method: Compositional matrix adjust.
Identities = 45/142 (31%), Positives = 66/142 (46%), Gaps = 9/142 (6%)
Query: 58 GLYPVISIINHSCLPNAVLVF-EGRLAVVRAVQHVPKGAEVLISYIETAGSTMTRQKALK 116
GLYP+ ++ NH+C PN F E V A + G E+ +SY T R++ L
Sbjct: 214 GLYPLGALQNHNCAPNTRHHFDENYRLYVSATVPISAGEELTMSYTSLFWDTTLRRQFLS 273
Query: 117 EQYLFTCTCPRCIKLGQFDDIQESAILEGYRCKDDGCSGFLL-RDSDDKG--FTCQQCGL 173
F+CTC RC D + ++L C D CSG LL RD G + C CG
Sbjct: 274 ITKHFSCTCKRC-----SDPTESGSMLCALLCAFDNCSGNLLPRDPLRIGTPWVCDMCGK 328
Query: 174 VRSKEEIKKIASEVNILSKKTL 195
+ +I I S + ++++ L
Sbjct: 329 SINYRQICSIRSGLAAITEEAL 350
>gi|195387830|ref|XP_002052595.1| GJ20777 [Drosophila virilis]
gi|194149052|gb|EDW64750.1| GJ20777 [Drosophila virilis]
Length = 504
Score = 59.7 bits (143), Expect = 2e-06, Method: Compositional matrix adjust.
Identities = 59/234 (25%), Positives = 99/234 (42%), Gaps = 33/234 (14%)
Query: 32 EIAENFSK---------LACNAHTICNSELRPLGTGLYPVISIINHSCLPNAVLVF---E 79
++AE FS+ L NA S+ PL LYP I+ H+C+PN E
Sbjct: 169 KLAERFSEELIMQVVGVLEVNAFEARTSQGYPLRC-LYPYTGILAHNCVPNTARTIHPSE 227
Query: 80 GRLAVVRAVQHVPKGAEVLISYIETAGSTMTRQKALKEQYLFTCTCPRCIKLGQFDDIQE 139
G +RA+ + G ++ SY T T RQ L+E FTC+C RC+ +
Sbjct: 228 GYKIRLRAMVALEAGQQLQHSYTYTLDGTAQRQAHLREGKYFTCSCERCLDASELQTHFS 287
Query: 140 SAILEGYRCKDDGCSGFLLRDSD---DKGFTCQQCGLVRSKEEIKKIA----SEVNILSK 192
S ++ +C + G+LL + + C+ C S EE+++I +EVN +
Sbjct: 288 S--MKCGQCTE----GWLLPKHPTAMNSSWNCRACDHSTSSEEVRQIVAALQAEVNAVQA 341
Query: 193 KTLALTSCGNHQEVVSTYKMIEKLQKKLYHPFSVNLMQTREKLIKILMELEDWK 246
+ + ++ Y K L HP R+ LI++ ++ ++
Sbjct: 342 LDMGPKRLEESERLLRKY-------KSLLHPSHYIATSLRQLLIEMYGRVQGYE 388
>gi|380018360|ref|XP_003693098.1| PREDICTED: protein msta, isoform A-like [Apis florea]
Length = 467
Score = 59.7 bits (143), Expect = 2e-06, Method: Compositional matrix adjust.
Identities = 47/185 (25%), Positives = 78/185 (42%), Gaps = 21/185 (11%)
Query: 14 QIANLVNLILQWPEISINEIAENFSKLACNAHTICNSELRPLG----TGLYPVISIINHS 69
IA+ + + +IS EI L N H + PL +Y + S+I H+
Sbjct: 184 NIAHFIKRFFKADDISEEEIVTIIGILQVNGHEV------PLTDSPYVAVYEMASLIEHN 237
Query: 70 CLPNAVLVFE--GRLAVVRAVQHVPKGAEVLISYIETAGSTMTRQKALKEQYLFTCTCPR 127
C N F G L ++RA + KG + I Y + T R+ L + F C C R
Sbjct: 238 CRANCSKSFTDMGGL-IIRAALPITKGDHISICYTDPLWGTANRRHHLLKTKFFECICNR 296
Query: 128 CIKLGQFDDIQESAILEGYRCKDDGCSGFLLRDS---DDKGFTCQQCGLVRSKEEIKKIA 184
C +F + +C CSG++L + ++ + C+ C + S EI+K+
Sbjct: 297 CQDPTEF-----QTMFNALKCNKINCSGYVLPKTFLEQEQDYVCKTCESIVSCTEIEKML 351
Query: 185 SEVNI 189
++ I
Sbjct: 352 EDIGI 356
>gi|330801330|ref|XP_003288681.1| hypothetical protein DICPUDRAFT_34471 [Dictyostelium purpureum]
gi|325081244|gb|EGC34766.1| hypothetical protein DICPUDRAFT_34471 [Dictyostelium purpureum]
Length = 378
Score = 59.7 bits (143), Expect = 2e-06, Method: Compositional matrix adjust.
Identities = 41/132 (31%), Positives = 60/132 (45%), Gaps = 20/132 (15%)
Query: 40 LACNAHTIC-----NSELR-----PLGTGLYPVISIINHSCLPNAVLVFE------GRLA 83
L CN+H I NS + +G G+Y S+ NHSC PN V E G +
Sbjct: 226 LECNSHDISFPFRTNSSITTFNYYSIGIGIYFNSSLFNHSCQPNICKVIESCKNNFGCHS 285
Query: 84 VVRAVQHVPKGAEVLISYIETAGSTMTRQKALKEQYLFTCTCPRCIKLGQFDDIQESAIL 143
+V A++ + + E+ +YI+ + R + L+ Y FTC CP C + L
Sbjct: 286 MV-AIRDIKENEEISFNYIQLNQNKQERIEKLQSSYYFTCNCPSCCTN---NPNPHKEFL 341
Query: 144 EGYRCKDDGCSG 155
Y+CK CSG
Sbjct: 342 NKYKCKKLNCSG 353
>gi|345570043|gb|EGX52868.1| hypothetical protein AOL_s00007g204 [Arthrobotrys oligospora ATCC
24927]
Length = 772
Score = 59.7 bits (143), Expect = 2e-06, Method: Compositional matrix adjust.
Identities = 29/73 (39%), Positives = 41/73 (56%)
Query: 57 TGLYPVISIINHSCLPNAVLVFEGRLAVVRAVQHVPKGAEVLISYIETAGSTMTRQKALK 116
+GL+ + + NHSC+ NA F G + + RAV +PK EVLISY + RQ L+
Sbjct: 519 SGLWILPAYTNHSCISNAHRTFLGDMMIFRAVVDMPKDTEVLISYTDHNLPYEERQGILQ 578
Query: 117 EQYLFTCTCPRCI 129
+ FTC C C+
Sbjct: 579 TSWKFTCQCKLCV 591
>gi|186478896|ref|NP_173998.2| SET domain protein 35 [Arabidopsis thaliana]
gi|332192607|gb|AEE30728.1| SET domain protein 35 [Arabidopsis thaliana]
Length = 545
Score = 59.7 bits (143), Expect = 2e-06, Method: Compositional matrix adjust.
Identities = 26/73 (35%), Positives = 41/73 (56%), Gaps = 1/73 (1%)
Query: 56 GTGLYPVISIINHSCLPNAVLVFEGRLAVVRAVQHVPKGAEVLISYIETAGSTMTRQKAL 115
G GL+ + S INHSC+PNA + G +V A + + G E+ +Y + S + ++K +
Sbjct: 330 GVGLWTLASFINHSCIPNARRLHVGDYVIVHASRDIKTGEEISFAYFDVL-SPLEKRKEM 388
Query: 116 KEQYLFTCTCPRC 128
E + F C C RC
Sbjct: 389 AESWGFCCGCSRC 401
>gi|392592096|gb|EIW81423.1| SET domain-containing protein [Coniophora puteana RWD-64-598 SS2]
Length = 450
Score = 59.7 bits (143), Expect = 2e-06, Method: Compositional matrix adjust.
Identities = 35/107 (32%), Positives = 60/107 (56%), Gaps = 10/107 (9%)
Query: 29 SINEIAENFSKLACNAHTICNSELRPLGTGLYPVIS-IINHSCLPNAVLVF------EGR 81
+I ++ + FS+ + N + +S L + G++P+ S + NHSCLPNA + R
Sbjct: 167 TIPDVNDLFSRFSNNNFAV-HSHLTTVAHGVFPLASRLFNHSCLPNAAARYILSEDAAPR 225
Query: 82 LAVVRAVQHVPKGAEVLISYIETAGSTMTRQKALKEQYLFTCTCPRC 128
+ +V A+Q + G E+ + Y++ A +RQ+ + Y FTCTCP C
Sbjct: 226 MEIV-ALQDIGAGEEICVPYLDPA-LLQSRQQGFQLTYGFTCTCPSC 270
>gi|170060287|ref|XP_001865735.1| conserved hypothetical protein [Culex quinquefasciatus]
gi|167878799|gb|EDS42182.1| conserved hypothetical protein [Culex quinquefasciatus]
Length = 650
Score = 59.7 bits (143), Expect = 2e-06, Method: Compositional matrix adjust.
Identities = 46/171 (26%), Positives = 79/171 (46%), Gaps = 14/171 (8%)
Query: 55 LGTGLYPVISIINHSCLPNAVLVFEGRLAVVRAVQHVPKGAEVLISY--IETAGSTMTRQ 112
+G GLYP +++ NHSC P ++G VR+V+ + G+ + +Y + T R+
Sbjct: 448 IGGGLYPTLALFNHSCEPGVTRYYKGNSVCVRSVRSIAAGSMIGENYGPLFTQTPRDERR 507
Query: 113 KALKEQYLFTCTCPRCIK-LGQFDDIQESAILEGYRCKDDG---CSGFLLRDSDDKGF-- 166
L QY F+C C C + F ++ ++ + R K DG CS LL S+ F
Sbjct: 508 ATLLNQYRFSCNCRACSENWPLFSEMDDTVL----RFKCDGGKICSNVLLIPSEINEFMI 563
Query: 167 TCQQCGLVRSKEEIKKIASEVNILSKKTLALTSCGNHQEVVSTYKMIEKLQ 217
C CG + + K+ + + K + S G + + YK +E++
Sbjct: 564 KCTDCGEHTNIMKGLKLVQDTETMFKLATKMHSAGEIEAAL--YKYVEEMN 612
>gi|312373795|gb|EFR21480.1| hypothetical protein AND_16998 [Anopheles darlingi]
Length = 486
Score = 59.7 bits (143), Expect = 2e-06, Method: Compositional matrix adjust.
Identities = 43/158 (27%), Positives = 74/158 (46%), Gaps = 14/158 (8%)
Query: 54 PLGTGLYPVISIINHSCLPNAVLVFEGRLAVV-RAVQHVPKGAEVLISYIETAGSTMTRQ 112
P +Y S++ HSC PN F R +V A + +G + I Y + +T R
Sbjct: 189 PSSVAIYNQASMLEHSCRPNLSKSFTNRGEIVLWAPNTIKRGERLSICYTDAMWATGNRL 248
Query: 113 KALKEQYLFTCTCPRCIKLGQFDDIQESAILEGYRCKDDGCSGFLLR-DSDD---KGFTC 168
+ L++ +F C C RC +++ + G++ KD C G++L DSD+ ++C
Sbjct: 249 EHLQQTKMFRCECERCSDPTEYETYFSALRCSGFQ-KDSKCKGYILPVDSDNWTTGDWSC 307
Query: 169 QQC-GLVRSKEEIKKIASEVNILSKKTLALTSCGNHQE 205
Q+C G+V A+ IL + + L + H+E
Sbjct: 308 QKCRGVVTG-------AAVGQILERARMDLEAMQKHRE 338
>gi|380489700|emb|CCF36529.1| TPR domain-containing protein [Colletotrichum higginsianum]
Length = 257
Score = 59.7 bits (143), Expect = 2e-06, Method: Compositional matrix adjust.
Identities = 34/84 (40%), Positives = 44/84 (52%), Gaps = 1/84 (1%)
Query: 53 RPLGTGLYPVISIINHSCLPNAVLVFEGRLAVVRAVQHVPKGAEVLISYIETAGSTMTRQ 112
R TGL+ + INHSC+ NA + G L V+RA+ + KG E+ SY E+A RQ
Sbjct: 104 RNASTGLWIWAAYINHSCIANAKKEYVGDLMVLRALGAIKKGEEIFHSYDESA-DYEARQ 162
Query: 113 KALKEQYLFTCTCPRCIKLGQFDD 136
KAL + F C C C DD
Sbjct: 163 KALLTTWGFECGCALCAAERTDDD 186
>gi|242078489|ref|XP_002444013.1| hypothetical protein SORBIDRAFT_07g005870 [Sorghum bicolor]
gi|241940363|gb|EES13508.1| hypothetical protein SORBIDRAFT_07g005870 [Sorghum bicolor]
Length = 404
Score = 59.7 bits (143), Expect = 2e-06, Method: Compositional matrix adjust.
Identities = 32/78 (41%), Positives = 39/78 (50%), Gaps = 7/78 (8%)
Query: 58 GLYPVISIINHSCLPNAV-LVFEGRLA------VVRAVQHVPKGAEVLISYIETAGSTMT 110
+YP S+ NH CLPNA + R VVRA+ +P+G EV ISY
Sbjct: 214 AVYPRASLFNHDCLPNACHFDYPDRPGPGNTDIVVRALHDIPEGREVCISYFAANWRYAD 273
Query: 111 RQKALKEQYLFTCTCPRC 128
RQ+ L E Y F C C RC
Sbjct: 274 RQRRLLEDYGFRCECDRC 291
>gi|298708048|emb|CBJ30401.1| set and mynd domain containing [Ectocarpus siliculosus]
Length = 589
Score = 59.3 bits (142), Expect = 2e-06, Method: Compositional matrix adjust.
Identities = 40/121 (33%), Positives = 56/121 (46%), Gaps = 24/121 (19%)
Query: 27 EISINEIAENFSKLACNAHTICNSELRPLG-------TGLYPVISIINHSCLPNAVLVFE 79
E+S EI +L N HT+ + PLG G++P S++NHSC PNA F
Sbjct: 299 EVSAEEIQTVHRRLRRNFHTV---QPEPLGRLLPPTVVGVFPAASLVNHSCEPNA--CFH 353
Query: 80 GRLA---------VVRAVQHVPKGAEVLISYIETAGSTMT---RQKALKEQYLFTCTCPR 127
R A +R V G EV +SY+ T R++ L+ + F+C CPR
Sbjct: 354 SRRAGPEGPPLELALRCTTDVSAGEEVCVSYLAHCADAATKEGRRELLQNVWGFSCDCPR 413
Query: 128 C 128
C
Sbjct: 414 C 414
>gi|51859341|gb|AAH81479.1| SMYD family member 5 [Danio rerio]
Length = 300
Score = 59.3 bits (142), Expect = 2e-06, Method: Compositional matrix adjust.
Identities = 34/86 (39%), Positives = 48/86 (55%), Gaps = 7/86 (8%)
Query: 56 GTGLYPVISIINHSCLPNAVLVFEGR--LAVVRAVQHVPKGAEVLISYIETAG---STMT 110
G+GL+ + S NHSC+PNA F L + A+ + G E+ ISY++ S +
Sbjct: 188 GSGLFLLQSSCNHSCVPNAEASFPDNNFLLHLTALGDIGPGEEICISYLDCCQRDRSRHS 247
Query: 111 RQKALKEQYLFTCTCPRCIKLGQFDD 136
R K L+E YLF C+C +C L Q DD
Sbjct: 248 RHKILRENYLFICSCQKC--LSQMDD 271
>gi|195578361|ref|XP_002079034.1| GD23739 [Drosophila simulans]
gi|194191043|gb|EDX04619.1| GD23739 [Drosophila simulans]
Length = 325
Score = 59.3 bits (142), Expect = 2e-06, Method: Compositional matrix adjust.
Identities = 50/199 (25%), Positives = 86/199 (43%), Gaps = 25/199 (12%)
Query: 59 LYPVISIINHSCLPN---AVLVFEGRLAVVRAVQHVPKGAEVLISYIETAGSTMTRQKAL 115
L+P I+ H+C+PN ++ EG +RA+ + +G + SY T T RQK L
Sbjct: 25 LFPYTGILAHNCVPNTARSIYPSEGYKIRLRAMVDLEEGQPLHHSYTYTLDGTAQRQKHL 84
Query: 116 KEQYLFTCTCPRCI---KLG-QFDDIQESAILEGYRCKDDGCSGFLLRDSDDKGFTCQQC 171
K+ FTC C RC+ +LG F ++ EG++ D ++C C
Sbjct: 85 KQGKFFTCQCERCLDPTELGTHFSSLKCGQCAEGFQVPRQP-------TEPDTSWSCAGC 137
Query: 172 GLVRSKEE----IKKIASEVNILSKKTLALTSCGNHQEVVSTYKMIEKLQKKLYHPFSVN 227
G S + ++ + SEVN + +A + ++ Y K L HP
Sbjct: 138 GSDTSNADALAMLQSLQSEVNAVQALPMAAKRLEEIERLLRKY-------KSLLHPLHFI 190
Query: 228 LMQTREKLIKILMELEDWK 246
R+ LI++ ++ ++
Sbjct: 191 ATGLRQLLIEMYGRVQGYE 209
>gi|351737851|gb|AEQ60886.1| hypothetical protein [Acanthamoeba castellanii mamavirus]
Length = 263
Score = 59.3 bits (142), Expect = 2e-06, Method: Compositional matrix adjust.
Identities = 29/92 (31%), Positives = 45/92 (48%), Gaps = 6/92 (6%)
Query: 38 SKLACNAHTICNSELRPLGTGLYPVISIINHSCLPNAVLVFEGRLAVVRAVQHVPKGAEV 97
+K CNA N G+ + + NHSC+PN + V + V+++ G E+
Sbjct: 118 AKYMCNAFEFNN------GSAILINGAKFNHSCVPNVIFVSDENYMYFYTVRNIKTGEEL 171
Query: 98 LISYIETAGSTMTRQKALKEQYLFTCTCPRCI 129
+Y++ +T TR+ L QY F C C RCI
Sbjct: 172 TDNYVDIMSNTKTRKNRLFNQYGFDCQCERCI 203
>gi|311978082|ref|YP_003987202.1| putative SET domain-containing protein [Acanthamoeba polyphaga
mimivirus]
gi|81999695|sp|Q5UNT8.1|YL678_MIMIV RecName: Full=Putative SET domain-containing protein L678
gi|55417289|gb|AAV50939.1| unknown [Acanthamoeba polyphaga mimivirus]
gi|308204535|gb|ADO18336.1| putative SET domain-containing protein [Acanthamoeba polyphaga
mimivirus]
gi|339061617|gb|AEJ34921.1| hypothetical protein MIMI_L678 [Acanthamoeba polyphaga mimivirus]
Length = 255
Score = 59.3 bits (142), Expect = 2e-06, Method: Compositional matrix adjust.
Identities = 29/92 (31%), Positives = 45/92 (48%), Gaps = 6/92 (6%)
Query: 38 SKLACNAHTICNSELRPLGTGLYPVISIINHSCLPNAVLVFEGRLAVVRAVQHVPKGAEV 97
+K CNA N G+ + + NHSC+PN + V + V+++ G E+
Sbjct: 118 AKYMCNAFEFNN------GSAILINGAKFNHSCVPNVIFVSDENYMYFYTVRNIKTGEEL 171
Query: 98 LISYIETAGSTMTRQKALKEQYLFTCTCPRCI 129
+Y++ +T TR+ L QY F C C RCI
Sbjct: 172 TDNYVDIMSNTKTRKNRLFNQYGFDCQCERCI 203
>gi|195339975|ref|XP_002036592.1| GM18755 [Drosophila sechellia]
gi|194130472|gb|EDW52515.1| GM18755 [Drosophila sechellia]
Length = 507
Score = 59.3 bits (142), Expect = 2e-06, Method: Compositional matrix adjust.
Identities = 50/199 (25%), Positives = 86/199 (43%), Gaps = 25/199 (12%)
Query: 59 LYPVISIINHSCLPN---AVLVFEGRLAVVRAVQHVPKGAEVLISYIETAGSTMTRQKAL 115
L+P I+ H+C+PN ++ EG +RA+ + +G + SY T T RQK L
Sbjct: 207 LFPYTGILAHNCVPNTARSIYPSEGYKIRLRAMVDLEEGQPLHHSYTYTLDGTAQRQKHL 266
Query: 116 KEQYLFTCTCPRCI---KLG-QFDDIQESAILEGYRCKDDGCSGFLLRDSDDKGFTCQQC 171
K+ FTC C RC+ +LG F ++ EG++ D ++C C
Sbjct: 267 KQGKFFTCQCERCLDPTELGTHFSSLKCGQCAEGFQVPRQP-------TEPDTSWSCASC 319
Query: 172 GLVRSKEE----IKKIASEVNILSKKTLALTSCGNHQEVVSTYKMIEKLQKKLYHPFSVN 227
G S + ++ + SEVN + +A + ++ Y K L HP
Sbjct: 320 GSETSNVDALAMLQSLQSEVNAVQALPMAAKRLEEIERLLRKY-------KSLLHPLHFI 372
Query: 228 LMQTREKLIKILMELEDWK 246
R+ LI++ ++ ++
Sbjct: 373 ATGLRQLLIEMYGRVQGYE 391
>gi|146263138|gb|ABQ15327.1| Mcg1p [Magnaporthe grisea]
Length = 456
Score = 59.3 bits (142), Expect = 2e-06, Method: Compositional matrix adjust.
Identities = 30/92 (32%), Positives = 47/92 (51%), Gaps = 1/92 (1%)
Query: 57 TGLYPVISIINHSCLPN-AVLVFEGRLAVVRAVQHVPKGAEVLISYIETAGSTMTRQKAL 115
G + +S +NH C PN A V + + A + + G E+ ISY+++ S RQ+ L
Sbjct: 258 AGTFLGVSRLNHDCRPNLAFHVAKSFVHTTHATRDIAAGEELTISYVDSYSSRNVRQERL 317
Query: 116 KEQYLFTCTCPRCIKLGQFDDIQESAILEGYR 147
K + FTCTCP C G + +S + Y+
Sbjct: 318 KRNWGFTCTCPHCSLSGAMGKLSDSRLYAIYQ 349
>gi|408398088|gb|EKJ77223.1| hypothetical protein FPSE_02597 [Fusarium pseudograminearum CS3096]
Length = 520
Score = 59.3 bits (142), Expect = 2e-06, Method: Compositional matrix adjust.
Identities = 33/88 (37%), Positives = 50/88 (56%), Gaps = 1/88 (1%)
Query: 43 NAHTICNSELRPLGTGLYPVISIINHSCLPNAVLVFEGRLAVVRAVQHVPKGAEVLISYI 102
NA +++L +G L P +++ NHSC+PNA++ F GR A++RA + + E+ ISY
Sbjct: 163 NAFHRYDTDLGQVGIFLEPKLAMANHSCIPNAMVQFIGRKAILRAEKPIKIDDEIEISYT 222
Query: 103 ETAGSTMTRQKALKEQYLFTCTCPRCIK 130
+ R+ AL Y F C C RC K
Sbjct: 223 DYTFPLSKRKLALAP-YFFDCMCLRCEK 249
>gi|440300086|gb|ELP92579.1| set and mynd domain containing protein, putative [Entamoeba
invadens IP1]
Length = 420
Score = 59.3 bits (142), Expect = 2e-06, Method: Compositional matrix adjust.
Identities = 31/82 (37%), Positives = 49/82 (59%), Gaps = 4/82 (4%)
Query: 50 SELRPLGTGLYPVISIINHSCLPNAVLVF---EGRLAVVRAVQHVPKGAEVLISYIETAG 106
S+ G GLY ++ +NHSC PN VL + +L+++ A++ + G E+ ISYI+ +
Sbjct: 337 SKFSVEGEGLYKYLNTLNHSCDPNCVLACTTDDFKLSLI-ALKDIKAGEELTISYIDNSM 395
Query: 107 STMTRQKALKEQYLFTCTCPRC 128
+ TR K L +QY F C C +C
Sbjct: 396 NKETRLKTLMDQYNFDCKCKKC 417
>gi|348668850|gb|EGZ08673.1| hypothetical protein PHYSODRAFT_352669 [Phytophthora sojae]
Length = 1354
Score = 59.3 bits (142), Expect = 2e-06, Method: Compositional matrix adjust.
Identities = 44/140 (31%), Positives = 70/140 (50%), Gaps = 20/140 (14%)
Query: 5 DEKQLLLYAQIANLVNLILQWPEISINEIAENFSKLACNAHTICNSELRPL--------- 55
DE Q ++ + V +L W E + ++ + +L N + +SE PL
Sbjct: 1167 DEDQTIIAS-----VRRVLGWEEERL-QVLQTQERLERNDGEL-DSEFDPLMPLGYPSIE 1219
Query: 56 GTGLYPVISIINHSCLPNAVLVF--EGRLAVVRAVQHVPKGAEVLISYIETAGSTMTRQK 113
GT L+P+I +NHSC PN +++ G VV A++ + KG E+ I YI+ R+
Sbjct: 1220 GTALFPIICTMNHSCDPNCTVLYTKNGDGHVV-AIRDIHKGEELCICYIDVDMDVQMREA 1278
Query: 114 ALKEQYLFTCTCPRCIKLGQ 133
L+E Y F C C RC++ Q
Sbjct: 1279 NLRE-YKFKCFCSRCVQERQ 1297
Score = 42.4 bits (98), Expect = 0.31, Method: Compositional matrix adjust.
Identities = 25/62 (40%), Positives = 34/62 (54%), Gaps = 1/62 (1%)
Query: 56 GTGLYPVISIINHSCLPN-AVLVFEGRLAVVRAVQHVPKGAEVLISYIETAGSTMTRQKA 114
GT L+ I +NHSC PN VL + A V AVQ + +G E+ ISYI+ R +
Sbjct: 390 GTALFSGICTMNHSCDPNCTVLYTKDGAAHVFAVQDIQEGEELCISYIDVDQEVEERAEC 449
Query: 115 LK 116
L+
Sbjct: 450 LR 451
>gi|330798181|ref|XP_003287133.1| hypothetical protein DICPUDRAFT_77998 [Dictyostelium purpureum]
gi|325082849|gb|EGC36318.1| hypothetical protein DICPUDRAFT_77998 [Dictyostelium purpureum]
Length = 510
Score = 59.3 bits (142), Expect = 2e-06, Method: Compositional matrix adjust.
Identities = 35/121 (28%), Positives = 54/121 (44%), Gaps = 19/121 (15%)
Query: 56 GTGLYPVISIINHSCLPNAVLVFE-----------------GRLAV-VRAVQHVPKGAEV 97
G+ +Y +S+ NH C PN + E RL++ + ++ VP+ +E+
Sbjct: 361 GSAVYLKLSLFNHGCFPNCTTLLEYNINHNDYSYYQGFESDNRLSISIITLRDVPQNSEL 420
Query: 98 LISYIETAGSTMTRQKALKEQYLFTCTCPRCIKLGQFDDIQESAILEGYRCKDDGCSGFL 157
LI+YI R LK ++F C C RC ++ E I CK+ CSG L
Sbjct: 421 LITYIPLNQKGHERYHNLKSNWMFPCDCLRCFHEKTNPELTE-KIFNDSSCKNSKCSGGL 479
Query: 158 L 158
L
Sbjct: 480 L 480
>gi|340517499|gb|EGR47743.1| predicted protein [Trichoderma reesei QM6a]
Length = 535
Score = 59.3 bits (142), Expect = 2e-06, Method: Compositional matrix adjust.
Identities = 36/95 (37%), Positives = 51/95 (53%), Gaps = 1/95 (1%)
Query: 34 AENFSKLACNAHTICNSELRPLGTGLYPVISIINHSCLPNAVLVFEGRLAVVRAVQHVPK 93
AE K+ N+ +SEL G L P +++ NHSC+PNA + F GR A++ A +
Sbjct: 168 AEMLCKIQNNSFQRFDSELGVAGLYLEPTLAMANHSCIPNASVQFIGRNALLIAENPIRA 227
Query: 94 GAEVLISYIETAGSTMTRQKALKEQYLFTCTCPRC 128
G E+ I+Y R++AL Y FTC C RC
Sbjct: 228 GDEIEIAYTFYTDPLPKRREALA-HYKFTCQCLRC 261
>gi|310798972|gb|EFQ33865.1| tetratricopeptide [Glomerella graminicola M1.001]
Length = 657
Score = 59.3 bits (142), Expect = 2e-06, Method: Compositional matrix adjust.
Identities = 31/76 (40%), Positives = 45/76 (59%), Gaps = 1/76 (1%)
Query: 53 RPLGTGLYPVISIINHSCLPNAVLVFEGRLAVVRAVQHVPKGAEVLISYIETAGSTMTRQ 112
R TGL+ + INHSC+ NA + G L V+RA++ + +G E+ SY E+A TRQ
Sbjct: 503 RNASTGLWVWAAYINHSCVANAKKEYVGDLMVLRALRPIKEGEEIFHSYDESA-DYETRQ 561
Query: 113 KALKEQYLFTCTCPRC 128
+AL + F C+C C
Sbjct: 562 RALMTTWGFECSCALC 577
>gi|407844697|gb|EKG02089.1| hypothetical protein TCSYLVIO_006900 [Trypanosoma cruzi]
Length = 584
Score = 59.3 bits (142), Expect = 3e-06, Method: Compositional matrix adjust.
Identities = 58/234 (24%), Positives = 102/234 (43%), Gaps = 40/234 (17%)
Query: 42 CNAHTICNSELRPLGTGLYP--VISIINHSCLPNAVLVFEGRLAVVRAVQHVPKGAEVLI 99
CNA +C++ P+G L+ V S NHSCLPN + G + RA++ G E+ I
Sbjct: 327 CNALAVCDASGLPVGQALHAASVTSYFNHSCLPNCA-IEAGAIVTTRAIR---PGEELTI 382
Query: 100 SYI-ETAGSTMTRQKALKEQYLFTCTCPRCIKLGQFDDIQ-ESAILEGYRCKDDGCSGFL 157
SY+ + R++ L E+Y F C RC++ G D ESA+ R
Sbjct: 383 SYLPQLYWPAWLRREELAERYFFDC---RCVRCGDGDRSPFESALSATLRLGGS------ 433
Query: 158 LRDSDDKGFTCQQ--CGLVRSKEEIKKIASEVNILSKKTLALTSCGNHQEVVSTYKMIEK 215
R+ + + + Q CG VR+K+ +V++ + L ++++
Sbjct: 434 -REKEREYISSVQILCGRVRAKD-----VGDVDVGDRDAL--------------LHLLQE 473
Query: 216 LQKKLYHPFSVNLMQTREKLIKILMELEDWKEALAYCQLTIPVYQRVYPQFHPL 269
++ L+ PF + L + L D + L C + +++ + P HP+
Sbjct: 474 CRQHLF-PFHYLCHELHNTLSFVYAVLGDTRACLCGCLRELVMWEAIVPGAHPV 526
>gi|302832792|ref|XP_002947960.1| hypothetical protein VOLCADRAFT_116676 [Volvox carteri f.
nagariensis]
gi|300266762|gb|EFJ50948.1| hypothetical protein VOLCADRAFT_116676 [Volvox carteri f.
nagariensis]
Length = 1170
Score = 59.3 bits (142), Expect = 3e-06, Method: Composition-based stats.
Identities = 31/76 (40%), Positives = 40/76 (52%), Gaps = 13/76 (17%)
Query: 53 RPLGTGLYPVISIINHSCLPNAVLVFEGRLAVVRAVQHVPKGAEVLISYIETAGSTMTRQ 112
RP TGL+ +++NHSC+PN G +VR + V G EV R+
Sbjct: 703 RPGHTGLWTEHALMNHSCVPNVCTYVLGEAMLVRVSRPVQGGEEV-------------RR 749
Query: 113 KALKEQYLFTCTCPRC 128
KAL +QY FTC CPRC
Sbjct: 750 KALYDQYGFTCCCPRC 765
>gi|328715119|ref|XP_003245540.1| PREDICTED: protein msta, isoform A-like [Acyrthosiphon pisum]
Length = 478
Score = 59.3 bits (142), Expect = 3e-06, Method: Compositional matrix adjust.
Identities = 43/139 (30%), Positives = 61/139 (43%), Gaps = 9/139 (6%)
Query: 58 GLYPVISIINHSCLPNAVLVFEGRLAVV-RAVQHVPKGAEVLISYIETAGSTMTRQKALK 116
GLYPV + +NH C+PN + F +L ++ +A + KG E+ SY + T RQ L
Sbjct: 216 GLYPVAAFMNHCCVPNTMHNFNEKLQMIAKASLPIYKGQEITTSYTYSIWPTSLRQNHLL 275
Query: 117 EQYLFTCTCPRCIKLGQFDDIQESAILEGYRCKDDGCSGFLL--RDSDDKG-FTCQQCGL 173
F CTC RC +F L C C G +L D K + C+ C
Sbjct: 276 TSKQFICTCSRCCDTEEF-----GTKLAALNCIVKNCDGRILPINPLDKKSIWQCKMCTK 330
Query: 174 VRSKEEIKKIASEVNILSK 192
+ S E+ +V L K
Sbjct: 331 LVSSTEMICFYKDVETLMK 349
>gi|453080542|gb|EMF08593.1| SET domain-containing protein [Mycosphaerella populorum SO2202]
Length = 350
Score = 59.3 bits (142), Expect = 3e-06, Method: Compositional matrix adjust.
Identities = 29/73 (39%), Positives = 43/73 (58%), Gaps = 2/73 (2%)
Query: 58 GLYPVISIINHSCLPNAVLVFEGRLAVVR--AVQHVPKGAEVLISYIETAGSTMTRQKAL 115
++ I+ INHSCLPNA + L +R +++ + G E+ +SYI G + TR+K L
Sbjct: 122 AIFLTIARINHSCLPNAQHSWNAPLQRMRIHSIKTLQPGEEITLSYILPCGPSSTRKKLL 181
Query: 116 KEQYLFTCTCPRC 128
KE + F C CP C
Sbjct: 182 KENFCFNCLCPLC 194
>gi|389744758|gb|EIM85940.1| SET domain-containing protein [Stereum hirsutum FP-91666 SS1]
Length = 480
Score = 59.3 bits (142), Expect = 3e-06, Method: Compositional matrix adjust.
Identities = 34/102 (33%), Positives = 56/102 (54%), Gaps = 8/102 (7%)
Query: 34 AENFSKLACNAHTICNSELRPLGTGLYPVIS-IINHSCLPNAVLVF------EGRLAVVR 86
+E +S+ N + + +S L P+ G++PV S + NHSC PNA + G VV
Sbjct: 189 SELYSRFGRN-NFVVHSHLSPIAHGIFPVASRLFNHSCTPNAAAKYIFGRDKPGVRMVVV 247
Query: 87 AVQHVPKGAEVLISYIETAGSTMTRQKALKEQYLFTCTCPRC 128
A++++ E+ I Y++ A + RQ +L++ Y F CTC C
Sbjct: 248 ALRNISHNEEITIPYLDPALPFLERQLSLEQNYGFRCTCHVC 289
>gi|321478979|gb|EFX89935.1| SET and MYND domain-containing 5 protein [Daphnia pulex]
Length = 387
Score = 59.3 bits (142), Expect = 3e-06, Method: Compositional matrix adjust.
Identities = 29/78 (37%), Positives = 42/78 (53%), Gaps = 5/78 (6%)
Query: 56 GTGLYPVISIINHSCLPNAV--LVFEGRLAVVRAVQHVPKGAEVLISYIETAG---STMT 110
G+ L+ + S NHSC PN + F + A + + +G E+ ISY++ S +
Sbjct: 290 GSALFAIQSACNHSCEPNCISTFPFSNHTVALVASKDLEEGEEIFISYLDECAQSRSRHS 349
Query: 111 RQKALKEQYLFTCTCPRC 128
R+K LKE YLF C C RC
Sbjct: 350 RRKILKENYLFHCNCSRC 367
>gi|396460396|ref|XP_003834810.1| hypothetical protein LEMA_P069530.1 [Leptosphaeria maculans JN3]
gi|312211360|emb|CBX91445.1| hypothetical protein LEMA_P069530.1 [Leptosphaeria maculans JN3]
Length = 388
Score = 59.3 bits (142), Expect = 3e-06, Method: Compositional matrix adjust.
Identities = 28/77 (36%), Positives = 40/77 (51%), Gaps = 3/77 (3%)
Query: 55 LGTGLYPVISIINHSCLPNAVLVFEGRLAVVRAVQHVPKGAEVLISYIETAGSTMTRQK- 113
G G +P S NHSC PN +GR RA + + KG E+ I+Y+ ++R K
Sbjct: 309 FGYGCWPAASYFNHSCHPNVEKQRDGRAWTFRARRAIAKGDELCITYLSGEERKLSRAKR 368
Query: 114 --ALKEQYLFTCTCPRC 128
LK+ + F C+C RC
Sbjct: 369 MLRLKKTWGFDCSCERC 385
>gi|159471331|ref|XP_001693810.1| histone methyltransferase [Chlamydomonas reinhardtii]
gi|158283313|gb|EDP09064.1| histone methyltransferase [Chlamydomonas reinhardtii]
Length = 484
Score = 59.3 bits (142), Expect = 3e-06, Method: Compositional matrix adjust.
Identities = 36/103 (34%), Positives = 49/103 (47%), Gaps = 10/103 (9%)
Query: 30 INEIAENFSKLACNAHTICNSELRPLG-TGLYPVISIINHSCLPNAVLVFEGRLAVVRAV 88
+ AE+ + AC P G GL+P +++NHSC PN V G VVRAV
Sbjct: 185 FGDDAEDLAACACRGE-------EPRGHIGLWPEFALLNHSCAPNTVNYVVGGSMVVRAV 237
Query: 89 QHVPKGAEVLISYIETAG--STMTRQKALKEQYLFTCTCPRCI 129
+ +G EV I Y+ R L + Y F C+CPRC+
Sbjct: 238 APISQGEEVTICYLGRPQLLPFNKRIAILADDYGFECSCPRCV 280
>gi|328710001|ref|XP_003244133.1| PREDICTED: SET and MYND domain-containing protein 4-like isoform 1
[Acyrthosiphon pisum]
gi|328710003|ref|XP_003244134.1| PREDICTED: SET and MYND domain-containing protein 4-like isoform 2
[Acyrthosiphon pisum]
Length = 645
Score = 59.3 bits (142), Expect = 3e-06, Method: Compositional matrix adjust.
Identities = 37/125 (29%), Positives = 60/125 (48%), Gaps = 19/125 (15%)
Query: 23 LQWPEISINEIAENFSKLAC----NAHTI-----CNSELR--------PLGTGLYPVISI 65
LQ P + + E+ + C NAH I C ++L+ P+ + L PV+S+
Sbjct: 423 LQIPNHQLGTVGESLVHIICAVSSNAHGITQPSDCKTQLKLSLDNRFVPVASLLMPVLSL 482
Query: 66 INHSCLPNAVLVFEGRLAVVRAVQHVPKGAEVLISY--IETAGSTMTRQKALKEQYLFTC 123
+NH C PN V V+ A+Q + KG+++ +Y + +R + LK QY F+C
Sbjct: 483 LNHHCDPNVVRHNYNGTIVLTAIQPISKGSQLFDNYGLLYATHPKESRLQILKNQYYFSC 542
Query: 124 TCPRC 128
C C
Sbjct: 543 ECSSC 547
>gi|170029520|ref|XP_001842640.1| conserved hypothetical protein [Culex quinquefasciatus]
gi|167863224|gb|EDS26607.1| conserved hypothetical protein [Culex quinquefasciatus]
Length = 498
Score = 59.3 bits (142), Expect = 3e-06, Method: Compositional matrix adjust.
Identities = 47/187 (25%), Positives = 81/187 (43%), Gaps = 15/187 (8%)
Query: 58 GLYPVISIINHSCLPNAVLVF----EGRLAVVRAVQHVPKGAEVLISYIETAGSTMTRQK 113
GLYP ++I+ HSC+PN V + RL A+ V +G+++ +Y T TM RQ
Sbjct: 198 GLYPKLAIMAHSCVPNVVHSIHPSKDYRLTARTAID-VEEGSKLYTTYTYTLSGTMVRQA 256
Query: 114 ALKEQYLFTCTCPRCIKLGQFDDIQESAILEGYRCK--DDGCSGFLLRDSDDKGFTCQQC 171
ALK FTC C RC+ D + +C+ D+G ++ + C C
Sbjct: 257 ALKSTKYFTCQCKRCL-----DPTELGTHFSSLKCQKCDNGVIVSSKPTDEEAEWHCTHC 311
Query: 172 GLVRSKEEIKKIASEVNILSKKTLALTSCGNHQEVVSTYKMIEKLQKKLYHPFSVNLMQT 231
K + +A + ++ + L E + +++ + K + + HP
Sbjct: 312 ---EYKLKGAAMAKAIQVMQAEIDELAYMEYGPERLESFERVFKKYRSVLHPLHFINTSI 368
Query: 232 REKLIKI 238
R LI++
Sbjct: 369 RNSLIEL 375
>gi|167525114|ref|XP_001746892.1| hypothetical protein [Monosiga brevicollis MX1]
gi|163774672|gb|EDQ88299.1| predicted protein [Monosiga brevicollis MX1]
Length = 500
Score = 58.9 bits (141), Expect = 3e-06, Method: Compositional matrix adjust.
Identities = 35/97 (36%), Positives = 49/97 (50%), Gaps = 1/97 (1%)
Query: 53 RPLGTGLYPVISIINHSCLPN-AVLVFEGRLAVVRAVQHVPKGAEVLISYIETAGSTMTR 111
R G +PVI+++NHSC PN AV+ G VR + V GAE+ +SY++ R
Sbjct: 233 RETAWGHFPVIALVNHSCHPNCAVVSRPGGELEVRTLGTVRAGAELFVSYVDLTLPRAER 292
Query: 112 QKALKEQYLFTCTCPRCIKLGQFDDIQESAILEGYRC 148
Q L FTCTC RC F E+++ +C
Sbjct: 293 QAHLLASKEFTCTCYRCQHPDAFPHEHEASMPSCPQC 329
>gi|398257480|gb|EJN41088.1| hypothetical protein lvs_L585 [Acanthamoeba polyphaga
lentillevirus]
Length = 237
Score = 58.9 bits (141), Expect = 3e-06, Method: Compositional matrix adjust.
Identities = 29/92 (31%), Positives = 45/92 (48%), Gaps = 6/92 (6%)
Query: 38 SKLACNAHTICNSELRPLGTGLYPVISIINHSCLPNAVLVFEGRLAVVRAVQHVPKGAEV 97
+K CNA N G+ + + NHSC+PN + V + V+++ G E+
Sbjct: 100 AKYMCNAFEFNN------GSAILINGAKFNHSCVPNVIFVSDENYMYFYTVRNIKTGEEL 153
Query: 98 LISYIETAGSTMTRQKALKEQYLFTCTCPRCI 129
+Y++ +T TR+ L QY F C C RCI
Sbjct: 154 TDNYVDIMSNTKTRKNRLFNQYGFDCQCERCI 185
>gi|339236741|ref|XP_003379925.1| MYND finger protein [Trichinella spiralis]
gi|316977375|gb|EFV60485.1| MYND finger protein [Trichinella spiralis]
Length = 476
Score = 58.9 bits (141), Expect = 3e-06, Method: Compositional matrix adjust.
Identities = 69/285 (24%), Positives = 114/285 (40%), Gaps = 25/285 (8%)
Query: 37 FSKLACNAHTICNSELRPLGTGLYPVISIINHSCLPNAVLVFEGRLAVVRAVQH---VPK 93
F + N+H+I ++ +G L IS +HSC PN +VF G AV+R + +
Sbjct: 164 FCRSWINSHSIHSNTGIEVGMALDLGISKYDHSCRPNTAMVFNGFRAVLRPLVNGIDTAD 223
Query: 94 GAEVLISYIETAGSTMTRQKALKEQYLFTCTCPRCIKLGQFDDIQESAILEGYRCKDDGC 153
++ I+Y++ S R+K L+ ++ F C C RC DD L RC + C
Sbjct: 224 PSQCFIAYVDVGRSRYQRRKELQSKWYFWCECERCRDPS--DD-----RLTSIRCVNVDC 276
Query: 154 SGFLLRDSDD---KGFTCQQCGLVRSKEEIKKIASEVNILSK-KTLALTS-CGN--HQEV 206
S + D K C+ CG KI V I ++ LAL GN E
Sbjct: 277 SEPVCITEDQTNTKNVQCRGCG--------SKIPENVIIEAQCFMLALPQHFGNVKSAEE 328
Query: 207 VSTYKMIEKLQKKLYHPFSVNLMQTREKLIKILMELEDWKEALAYCQLTIPVYQRVYPQF 266
V K+ ++L H ++ + +++ ++ + L + Y+R P
Sbjct: 329 VRRLKIYLNTAERLLHKENIYFCRLLTAYLQLTEGVDSFANNLELQKSVYTNYRRCLPAA 388
Query: 267 HPLLGLQYYTCGKLEWFLGDTENAIKSMTEAVEILRITHGTNSPF 311
+G Q + +I EA+ I I G + P+
Sbjct: 389 DRHVGFQLLHIVRTLITQNRRAESIPYAHEAMRIFEICFGLDHPY 433
>gi|168054064|ref|XP_001779453.1| predicted protein [Physcomitrella patens subsp. patens]
gi|162669138|gb|EDQ55731.1| predicted protein [Physcomitrella patens subsp. patens]
Length = 377
Score = 58.9 bits (141), Expect = 3e-06, Method: Compositional matrix adjust.
Identities = 38/114 (33%), Positives = 57/114 (50%), Gaps = 12/114 (10%)
Query: 27 EISINEIAENFSKLACNAHTI----CNSELRPL-GTGLYPVISIINHSCLPNAV------ 75
E+S+ + ++ ACN + C E R + G ++ S+ NH CLPNA
Sbjct: 165 ELSLELTGKLLARDACNTFGLMAPSCAGEERKVRGYAMFAQASMFNHDCLPNACRFEYVD 224
Query: 76 LVFEGRLAV-VRAVQHVPKGAEVLISYIETAGSTMTRQKALKEQYLFTCTCPRC 128
+ +G V VRA+ + +G EV +SY RQ+ L+E+Y F CTC RC
Sbjct: 225 IDGDGNTDVIVRALHDMEEGTEVCLSYFPVDWPYGDRQQKLQEEYGFWCTCARC 278
>gi|386769408|ref|NP_001245962.1| CG43129, isoform C [Drosophila melanogaster]
gi|383291417|gb|AFH03636.1| CG43129, isoform C [Drosophila melanogaster]
Length = 503
Score = 58.9 bits (141), Expect = 3e-06, Method: Compositional matrix adjust.
Identities = 50/199 (25%), Positives = 85/199 (42%), Gaps = 25/199 (12%)
Query: 59 LYPVISIINHSCLPN---AVLVFEGRLAVVRAVQHVPKGAEVLISYIETAGSTMTRQKAL 115
L+P I+ H+C+PN ++ EG +RA+ + +G + SY T T RQK L
Sbjct: 203 LFPYTGILAHNCVPNTSRSIYPSEGYKIRLRAMVDLEEGQPLHHSYTYTLDGTAQRQKHL 262
Query: 116 KEQYLFTCTCPRCI---KLG-QFDDIQESAILEGYRCKDDGCSGFLLRDSDDKGFTCQQC 171
K+ FTC C RC+ +LG F ++ EG++ D + C C
Sbjct: 263 KQGKFFTCQCERCLDPTELGTHFSSLKCGQCAEGFQVPRQP-------TEPDTSWNCANC 315
Query: 172 GLVRSKEE----IKKIASEVNILSKKTLALTSCGNHQEVVSTYKMIEKLQKKLYHPFSVN 227
G S + ++ + SEVN + +A + ++ Y K L HP
Sbjct: 316 GSDTSNADALAMLQSLQSEVNAVQALPMAAKRLEEIERLLRKY-------KSLLHPLHFI 368
Query: 228 LMQTREKLIKILMELEDWK 246
R+ LI++ ++ ++
Sbjct: 369 ATGLRQLLIEMYGRVQGYE 387
>gi|195171538|ref|XP_002026562.1| GL21959 [Drosophila persimilis]
gi|194111478|gb|EDW33521.1| GL21959 [Drosophila persimilis]
Length = 532
Score = 58.9 bits (141), Expect = 3e-06, Method: Compositional matrix adjust.
Identities = 54/199 (27%), Positives = 88/199 (44%), Gaps = 26/199 (13%)
Query: 10 LLYAQIANLVNLIL---QWPEISINEIAENFSKLACNAHTICNSELRPLGTGLYPVISII 66
+L A + + IL W E+ I IA + L N + R L+P ++I
Sbjct: 198 VLRANLITFIKTILGMRDWSEVDILRIA---AILDTNTFELRQPRERRKVRALFPGAAMI 254
Query: 67 NHSCLPNAVLVFEGRLAVV-RAVQHVPKGAEVLISYIETAGSTMTRQKALKEQYLFTCTC 125
+H C PN F+ + ++ A + + KG + ISY + ST+ R+ L++ F C C
Sbjct: 255 SHDCAPNMRHRFDDDMNIIFLAKRPIAKGEILTISYTQPLRSTIQRRLHLRQAKCFDCAC 314
Query: 126 PRC---IKLGQFDDIQESAILEGYRCKDDGCSGFLLRDS---DDKGFTCQQCGLVRSKEE 179
RC +LG F Q +CK +G ++ + + + CQ C L RS +E
Sbjct: 315 DRCQDPTELGTFAGAQTCV-----KCK----AGKIISVNPLQNTANWKCQLCNLKRSAKE 365
Query: 180 I----KKIASEVNILSKKT 194
+ K+ E+ L K T
Sbjct: 366 VLLSDAKLQQEIEALDKTT 384
>gi|341876422|gb|EGT32357.1| CBN-SET-10 protein [Caenorhabditis brenneri]
Length = 428
Score = 58.9 bits (141), Expect = 3e-06, Method: Compositional matrix adjust.
Identities = 45/145 (31%), Positives = 69/145 (47%), Gaps = 19/145 (13%)
Query: 35 ENFSKLACNAHTICNS-----ELR--PLGTGLYPVISIINHSCLPNAVLVFEGRLAVVRA 87
EN +A +CNS E R P+G+G+Y ++ NHSC + +VF+ ++RA
Sbjct: 145 ENNRMIAMICSVLCNSFGIIDEKRGEPIGSGMYIGLAKHNHSCASTSHVVFDKNQVLLRA 204
Query: 88 VQHVPKGAEVLISYIETAGSTMTRQKALKEQYLFTCTCPRCIKLGQFDDIQESAILEGYR 147
+ ISY+ T RQK+++ + TC C C Q +D+ I R
Sbjct: 205 RKE-EYCRNTTISYVSRMLPTAERQKSIRSVHFITCRCEMC----QNEDLD--LIGLASR 257
Query: 148 CKDDGCSGFLLRDSDDKGFTCQQCG 172
C+ DGC+GF+ S +C CG
Sbjct: 258 CQTDGCNGFVKGSS-----SCGSCG 277
>gi|358395915|gb|EHK45302.1| hypothetical protein TRIATDRAFT_79702 [Trichoderma atroviride IMI
206040]
Length = 543
Score = 58.9 bits (141), Expect = 3e-06, Method: Compositional matrix adjust.
Identities = 35/102 (34%), Positives = 54/102 (52%), Gaps = 1/102 (0%)
Query: 27 EISINEIAENFSKLACNAHTICNSELRPLGTGLYPVISIINHSCLPNAVLVFEGRLAVVR 86
E I + AE K+ N+ +S+L +G L P +++ NHSC+PNA + F GR A++
Sbjct: 166 EEDIRKAAEMLCKIQNNSFQRFDSDLGVIGLFLEPTLAMANHSCIPNAAVQFIGRNALLI 225
Query: 87 AVQHVPKGAEVLISYIETAGSTMTRQKALKEQYLFTCTCPRC 128
A + G E+ ++Y R++AL Y F C C RC
Sbjct: 226 AENPIRAGDEMELAYTFYTDPLPKRKEALA-HYKFVCQCLRC 266
>gi|307179275|gb|EFN67661.1| SET and MYND domain-containing protein 4 [Camponotus floridanus]
Length = 631
Score = 58.9 bits (141), Expect = 3e-06, Method: Compositional matrix adjust.
Identities = 35/107 (32%), Positives = 56/107 (52%), Gaps = 12/107 (11%)
Query: 40 LACNAHTICNS---------ELRPLGTGLYPVISIINHSCLPNAVL-VFEGRLAVVRAVQ 89
+ CNA+ I + E R +G +Y +S++NHSC PN V + + VVRA++
Sbjct: 411 INCNAYEIVENVHEEVTRVWEPRNIGGAIYTTVSLVNHSCYPNVVRHSYPSGIVVVRALR 470
Query: 90 HVPKGAEVLISYIET--AGSTMTRQKALKEQYLFTCTCPRCIKLGQF 134
+ KG E+L Y + S M R++ L ++Y F C C C + +F
Sbjct: 471 FIGKGCEILDCYGPQFLSESRMARRELLWKKYRFLCECDACTQDWKF 517
>gi|308803330|ref|XP_003078978.1| SET domain protein 123 (ISS) [Ostreococcus tauri]
gi|116057431|emb|CAL51858.1| SET domain protein 123 (ISS) [Ostreococcus tauri]
Length = 311
Score = 58.9 bits (141), Expect = 3e-06, Method: Compositional matrix adjust.
Identities = 30/75 (40%), Positives = 44/75 (58%), Gaps = 2/75 (2%)
Query: 56 GTGLYPVISIINHSCLPNAVLVFE-GRLAV-VRAVQHVPKGAEVLISYIETAGSTMTRQK 113
GT +Y S+ NHSC PNA + +E G A+ +R ++ V G E I+Y++ + +R+
Sbjct: 232 GTAVYFTASLFNHSCAPNAHVSWENGDAAITIRTLRPVRAGEEFNITYVDANERSASRRA 291
Query: 114 ALKEQYLFTCTCPRC 128
LKE Y F C C RC
Sbjct: 292 RLKEWYGFDCACERC 306
>gi|312085936|ref|XP_003144878.1| MYND finger family protein [Loa loa]
Length = 535
Score = 58.9 bits (141), Expect = 3e-06, Method: Compositional matrix adjust.
Identities = 74/330 (22%), Positives = 132/330 (40%), Gaps = 35/330 (10%)
Query: 16 ANLVNLILQWPEISINEIAENFSKLACNAHTICN-SELRPLGTGLYPVISIINHSCLPNA 74
A+L+ + +E+ E + N H I + + +G GLY + +HSC PN
Sbjct: 198 ADLLAFYGSKAMVGKDEVFELHCRDYINRHAISDCGYIEEIGKGLYLDLCAYDHSCRPNT 257
Query: 75 VLVFEGRLAVVRAVQ---HVPKGAEVLISYIETAGSTMTRQKALKEQYLFTCTCPRCIKL 131
+ +G +A +R + + + SYI+ +T R+K LK+ + F C C RC
Sbjct: 258 IYTCDGFVATLRGLTASVDLRNLSSAHYSYIDLINTTQQRRKLLKDTWYFECHCTRC--- 314
Query: 132 GQFDDIQESAIL-------EGYRCKDDGCSGFLLRDSDDKGFTCQQCGLVRSKEEIKKIA 184
DD S+IL Y C +D + + TC +C S E + +
Sbjct: 315 SDPDDALLSSILCPNCPEKREYLCI---FGDLPYKDRNTQIITCPKCHNKVSPEYVVEAI 371
Query: 185 SEVNILSK--KTLALTSCGNHQEVVSTYKMIEKLQKKLYHPFSVNLMQTREKLIKILMEL 242
+ + K + + Q + + E+ K L VN+ K+I++L+ L
Sbjct: 372 GAMRFIDKIVENHEVEQMSREQSIKFLTDLKERFSKLLS---KVNVFLC--KIIQLLIPL 426
Query: 243 EDWKEALAYCQLTIPVY-------QRVYPQFHPLLGLQYYTCGKLEWFLGDTENAIKSMT 295
+ ++ CQL + ++ + +P HP +G+ Y + A+
Sbjct: 427 IN----MSDCQLLLDLHLEAEECVRFCFPFNHPAVGIHYRSIATFYLKCKQPHRALLYCK 482
Query: 296 EAVEILRITHGTNSPFMKELILKLEEAQAE 325
+A EI+ T G E L++A E
Sbjct: 483 KAYEIISFTLGPKHLMTIETDAMLKDASNE 512
>gi|357609801|gb|EHJ66685.1| hypothetical protein KGM_08794 [Danaus plexippus]
Length = 776
Score = 58.9 bits (141), Expect = 3e-06, Method: Compositional matrix adjust.
Identities = 65/291 (22%), Positives = 123/291 (42%), Gaps = 34/291 (11%)
Query: 43 NAHTICNSELRPLGTGLYPVISIINHSCLPNAVLVFEGRLAVVRAVQHVPKGAEVLISY- 101
N TI E+R T +YP +++NHSC PN + F +VR + +P G EV Y
Sbjct: 469 NGKTITEKEVR-RATAIYPSAAMMNHSCDPNIINTFYKSRLIVRCQRELPAGGEVFNCYG 527
Query: 102 -IETAGSTMTRQKALKEQYLFTCTCPRCIKLGQFDDIQESAILEGYRCKDDGCSGFLLRD 160
R+KALK QY+FTC C C + D + ++ Y C+ C G +
Sbjct: 528 PHRARAPAAARRKALKAQYMFTCHCADCNDTERKDFV---SLFSAYLCQ--SCKGPVWAH 582
Query: 161 SDDKGFTCQQCGLVRSKEEIKKIASEVNILSKKTLALTSCGNH-QEVVSTYKMIEKLQKK 219
C QC E + + L+ + + S +++ ++Y +L+++
Sbjct: 583 CVRP--LCTQCRSALHLERAHTLLDRADDLATQAEQVVSLEERCEKMAASY----RLKQQ 636
Query: 220 LYHPFSVNLMQTREKLIKILMELEDWKEALAYCQLTIPVYQRVYPQF-----HPLLGLQY 274
++H +L ++L ++ + D+ +++ + I + + F H L L
Sbjct: 637 VWHRHHASLRMAADRLARLYADTGDFGKSMELIKQNIQSLEYRFGSFSVEVAHELRKLSD 696
Query: 275 YTCGKL----------EWFLGDTENAIKSMTEAVEILRITHGTNSPFMKEL 315
++ EW L A K + +A++++ + +G+ P + L
Sbjct: 697 VMLERILNSPQHLEYREWCL----EAHKVVKKAIQLMELNYGSWEPLVSRL 743
>gi|226508960|ref|NP_001146688.1| uncharacterized protein LOC100280288 [Zea mays]
gi|219888317|gb|ACL54533.1| unknown [Zea mays]
Length = 318
Score = 58.9 bits (141), Expect = 3e-06, Method: Compositional matrix adjust.
Identities = 35/108 (32%), Positives = 51/108 (47%), Gaps = 8/108 (7%)
Query: 50 SELRPLGTGLYPVISIINHSCLPNAVLVFEGRLAVVRAVQHVPKGAEVLISYIETAGSTM 109
S++R + LY S+ NHSC PN F R V+R + + G+ V +SY G
Sbjct: 171 SQVR-VAQALYVSGSLFNHSCQPNVHAYFLSRAFVLRTTEFIKSGSPVELSYGPQVGEMH 229
Query: 110 T--RQKALKEQYLFTCTCPRCIKLGQFDDIQESAILEGYRCKDDGCSG 155
RQK+L+E Y F+C C C +L D ++ + C C G
Sbjct: 230 ISERQKSLQENYYFSCQCSSCSELNLSD-----LVMNSFCCPQSNCHG 272
>gi|197304712|ref|NP_001127869.1| N-lysine methyltransferase SMYD2-like [Nasonia vitripennis]
Length = 704
Score = 58.9 bits (141), Expect = 3e-06, Method: Compositional matrix adjust.
Identities = 33/106 (31%), Positives = 54/106 (50%), Gaps = 12/106 (11%)
Query: 36 NFSKLACNAHTICNS---------ELRPLGTGLYPVISIINHSCLPNAVL-VFEGRLAVV 85
N + CNA+ I + E R +G +Y +S+ NHSC PN V F VV
Sbjct: 488 NMQAIKCNAYEIVENVRDDETKILEPRNVGGAIYTTVSLTNHSCYPNIVRHSFPNGTVVV 547
Query: 86 RAVQHVPKGAEVLISYIET--AGSTMTRQKALKEQYLFTCTCPRCI 129
+++++P+G+E+L Y + +R++ L E+Y F C C C+
Sbjct: 548 TSLRYIPEGSEILDCYGQHFLENKRDSRRRLLAEKYYFDCQCEPCL 593
>gi|389750876|gb|EIM91949.1| hypothetical protein STEHIDRAFT_106624 [Stereum hirsutum FP-91666
SS1]
Length = 820
Score = 58.9 bits (141), Expect = 3e-06, Method: Compositional matrix adjust.
Identities = 26/70 (37%), Positives = 39/70 (55%)
Query: 59 LYPVISIINHSCLPNAVLVFEGRLAVVRAVQHVPKGAEVLISYIETAGSTMTRQKALKEQ 118
L+ + S+ NHSC PNA G V+R+ + +PK E+ ++Y+ AG+ TR K L +
Sbjct: 546 LHTLPSLFNHSCAPNATWRLFGDTMVIRSNRDIPKNTEITLAYVSPAGNARTRSKDLFTK 605
Query: 119 YLFTCTCPRC 128
L C C C
Sbjct: 606 LLGPCNCDLC 615
>gi|157115758|ref|XP_001652682.1| hypothetical protein AaeL_AAEL007346 [Aedes aegypti]
gi|108876750|gb|EAT40975.1| AAEL007346-PA [Aedes aegypti]
Length = 354
Score = 58.9 bits (141), Expect = 3e-06, Method: Compositional matrix adjust.
Identities = 43/135 (31%), Positives = 52/135 (38%), Gaps = 33/135 (24%)
Query: 25 WPEISINEIAENFSKLACNAHTICNSELR-----------------------------PL 55
W S I + +L CN H I SELR +
Sbjct: 42 WFLYSSAVILRHIGQLVCNGHAI--SELRVCTASENNCLEADSFNIKAGFLHRCFESTRV 99
Query: 56 GTGLYPVISIINHSCLPNAVLVFEGRLAVVRAVQHVPKGAEVLISYIETAG--STMTRQK 113
TG++P IS+ NHSC PN F V A + V G E+ Y S RQ
Sbjct: 100 FTGIFPQISMFNHSCDPNIRNCFSKSTLTVYATRDVEAGGEIFNCYGPNFKLMSREDRQS 159
Query: 114 ALKEQYLFTCTCPRC 128
ALK+QY F C C RC
Sbjct: 160 ALKQQYCFDCDCIRC 174
>gi|380022845|ref|XP_003695246.1| PREDICTED: protein msta, isoform A-like [Apis florea]
Length = 485
Score = 58.9 bits (141), Expect = 3e-06, Method: Compositional matrix adjust.
Identities = 39/132 (29%), Positives = 59/132 (44%), Gaps = 9/132 (6%)
Query: 58 GLYPVISIINHSCLPNAVLVFEGRLAV-VRAVQHVPKGAEVLISYIETAGSTMTRQKALK 116
GLYP+ S+ NH C+PN F+ + VRA + G E+ +SY T R++ L
Sbjct: 212 GLYPLGSLQNHCCIPNTRHYFDEEFQLYVRAALPISTGEEITMSYTSLFWDTTLRRQFLN 271
Query: 117 EQYLFTCTCPRCIKLGQFDDIQESAILEGYRCKDDGCSGFLLRDSD---DKGFTCQQCGL 173
F+C C RC +F+ + L C D C G LL + + C +C +
Sbjct: 272 VTKNFSCMCKRCSDPTEFN-----SKLGALLCASDKCCGELLPKNPLNMRTSWICNKCTI 326
Query: 174 VRSKEEIKKIAS 185
+ +I I S
Sbjct: 327 SVNHRQICSIRS 338
>gi|300121578|emb|CBK22096.2| unnamed protein product [Blastocystis hominis]
Length = 223
Score = 58.9 bits (141), Expect = 3e-06, Method: Compositional matrix adjust.
Identities = 25/62 (40%), Positives = 41/62 (66%)
Query: 43 NAHTICNSELRPLGTGLYPVISIINHSCLPNAVLVFEGRLAVVRAVQHVPKGAEVLISYI 102
N TI ++E+ P+G GLYP I+ INH C PNA LVF+G ++ +++ + E+ I Y+
Sbjct: 149 NRVTIRDAEMNPIGYGLYPYITAINHHCEPNAALVFQGNQLILTSLRPIEASEEIRIRYL 208
Query: 103 ET 104
++
Sbjct: 209 DS 210
>gi|255082145|ref|XP_002508291.1| predicted protein [Micromonas sp. RCC299]
gi|226523567|gb|ACO69549.1| predicted protein [Micromonas sp. RCC299]
Length = 311
Score = 58.9 bits (141), Expect = 3e-06, Method: Compositional matrix adjust.
Identities = 35/115 (30%), Positives = 53/115 (46%), Gaps = 8/115 (6%)
Query: 15 IANLVNLILQWPEISINEIAENFSKLACNAHTICNSELRPLGTGLYPVISIINHSCLPNA 74
+ L L L E++ +E E + H RP+G G++P S NHSC PN
Sbjct: 199 VRTLGRLQLNGFEMTASEPEEGADEAEGGGH-------RPVGVGVFPSASYTNHSCAPNC 251
Query: 75 VLVFEGRLA-VVRAVQHVPKGAEVLISYIETAGSTMTRQKALKEQYLFTCTCPRC 128
F+G VV + V G E+ I Y++ R++ L++ + F C C RC
Sbjct: 252 AQRFDGHGCIVVETARDVRGGEELTIPYVDVRLGRRERRERLRKNFAFDCACERC 306
>gi|449547900|gb|EMD38867.1| hypothetical protein CERSUDRAFT_47730 [Ceriporiopsis subvermispora
B]
Length = 410
Score = 58.9 bits (141), Expect = 3e-06, Method: Compositional matrix adjust.
Identities = 28/76 (36%), Positives = 39/76 (51%), Gaps = 1/76 (1%)
Query: 55 LGTGLYPVISIINHSCLPNAVLVFEGRLAVVRAVQHVPKGAEVLISYIETAG-STMTRQK 113
LG +YP +S NH C PNA +GR + V +G E+ ISY G + R+K
Sbjct: 321 LGFAVYPRLSFFNHDCAPNARKERDGRALRFVTTREVAEGDELCISYGHVDGMAWRERRK 380
Query: 114 ALKEQYLFTCTCPRCI 129
L + + F C C RC+
Sbjct: 381 ELSDGWFFDCACGRCV 396
>gi|401422463|ref|XP_003875719.1| conserved hypothetical protein [Leishmania mexicana
MHOM/GT/2001/U1103]
gi|322491958|emb|CBZ27231.1| conserved hypothetical protein [Leishmania mexicana
MHOM/GT/2001/U1103]
Length = 510
Score = 58.9 bits (141), Expect = 3e-06, Method: Compositional matrix adjust.
Identities = 48/137 (35%), Positives = 68/137 (49%), Gaps = 26/137 (18%)
Query: 10 LLYAQIANLVNLILQWPEISINEIAENFSKLACNAHTICNSELRP-------------LG 56
LL+A+ LV +I S + + ++ NAH I + L P G
Sbjct: 366 LLWAE--ALVQVIF-----STDTLRTLVGQMVLNAHAINDYVLPPGEAPSSGVFDWVLKG 418
Query: 57 TGLYPVISIINHSCLPN-AVLVFEGRLAVV-RAVQHVPKGAEVLISYIE-TAGST--MTR 111
GLY ++S NHSC+PN AV +G +V + + + G + I+YI TAG+T R
Sbjct: 419 AGLYSLLSCFNHSCVPNVAVSAVDGTHEIVLKTTRSIRAGEPLAITYIPLTAGATSRAER 478
Query: 112 QKALKEQYLFTCTCPRC 128
Q+ LK Y FTC CPRC
Sbjct: 479 QRQLK-NYFFTCHCPRC 494
>gi|308501687|ref|XP_003113028.1| CRE-SET-3 protein [Caenorhabditis remanei]
gi|308265329|gb|EFP09282.1| CRE-SET-3 protein [Caenorhabditis remanei]
Length = 464
Score = 58.9 bits (141), Expect = 3e-06, Method: Compositional matrix adjust.
Identities = 43/134 (32%), Positives = 61/134 (45%), Gaps = 10/134 (7%)
Query: 39 KLACNAHTICN-------SELRPLGTGLYPVISIINHSCLPNAVLVFEGRLAVVRAVQHV 91
+ A NAHTI + E P+GTGL+P+ SI NHSC PN V F R + + V
Sbjct: 196 RCAKNAHTIYSIEQIEKKDEDVPIGTGLFPISSIFNHSCTPN-VFGFFVRNTFIFVSRGV 254
Query: 92 PKGAEVLISYIETAG--STMTRQKALKEQYLFTCTCPRCIKLGQFDDIQESAILEGYRCK 149
G E++ SY T S R++ L F C C C++ +D E + + R
Sbjct: 255 KSGEELVDSYGVTYNQHSLKQREEFLANVSGFKCHCDSCVEQKSLEDYLEKSFKDVDRSA 314
Query: 150 DDGCSGFLLRDSDD 163
+ S + D D
Sbjct: 315 REASSFLDISDYID 328
>gi|24666583|ref|NP_649084.1| CG18136 [Drosophila melanogaster]
gi|7293833|gb|AAF49199.1| CG18136 [Drosophila melanogaster]
Length = 530
Score = 58.9 bits (141), Expect = 3e-06, Method: Compositional matrix adjust.
Identities = 49/181 (27%), Positives = 83/181 (45%), Gaps = 22/181 (12%)
Query: 10 LLYAQIANLVNLIL---QWPEISINEIAENFSKLACNAHTICNSELRPLGTGLYPVISII 66
+L A + + +L WPE+ I IA + L N + R LYP ++I
Sbjct: 199 VLRANLITFIKTVLGMKDWPEMDILRIA---AILDTNTFEVRQPRERRKIRALYPGAAMI 255
Query: 67 NHSCLPNAVLVFEGRLAVV-RAVQHVPKGAEVLISYIETAGSTMTRQKALKEQYLFTCTC 125
+H C+PN F+ + +V A + + KG + ISY + ST+ R+ L++ F C+C
Sbjct: 256 SHDCVPNMRHRFDDDMNIVFLAKRKIAKGEILSISYTQPLRSTIQRRVHLRQAKCFDCSC 315
Query: 126 PRCI---KLGQFDDIQESAILEGYRCKDDGCSGFLLRDS---DDKGFTCQQCGLVRSKEE 179
RC +LG F Q +CK +G ++ + + + CQ C RS ++
Sbjct: 316 ARCQDPEELGSFAGAQTC-----LKCK----AGKIISLNPLLNSAPWKCQLCNFKRSAKD 366
Query: 180 I 180
+
Sbjct: 367 V 367
>gi|358378046|gb|EHK15729.1| hypothetical protein TRIVIDRAFT_228756 [Trichoderma virens Gv29-8]
Length = 583
Score = 58.9 bits (141), Expect = 3e-06, Method: Compositional matrix adjust.
Identities = 34/104 (32%), Positives = 52/104 (50%), Gaps = 3/104 (2%)
Query: 27 EISINEIAENFSKLACNAHTICNSELR--PLGTGLYPVISIINHSCLPNAVLVFEGRLAV 84
E+SI E+ + + CN+ + + R PLG L +++NH C PN V+VF
Sbjct: 258 ELSIGEVLMLYGIVRCNSLPV-DQTFRNAPLGIALDMGGALLNHCCDPNVVIVFNSTQVQ 316
Query: 85 VRAVQHVPKGAEVLISYIETAGSTMTRQKALKEQYLFTCTCPRC 128
VRA++ + G E+L Y + A R + +Y F C C RC
Sbjct: 317 VRALRKIKDGEELLHCYRDIAYDFTFRNPRITARYQFNCQCDRC 360
>gi|393905719|gb|EFO19192.2| MYND finger family protein [Loa loa]
Length = 545
Score = 58.9 bits (141), Expect = 3e-06, Method: Compositional matrix adjust.
Identities = 74/330 (22%), Positives = 132/330 (40%), Gaps = 35/330 (10%)
Query: 16 ANLVNLILQWPEISINEIAENFSKLACNAHTICN-SELRPLGTGLYPVISIINHSCLPNA 74
A+L+ + +E+ E + N H I + + +G GLY + +HSC PN
Sbjct: 198 ADLLAFYGSKAMVGKDEVFELHCRDYINRHAISDCGYIEEIGKGLYLDLCAYDHSCRPNT 257
Query: 75 VLVFEGRLAVVRAVQ---HVPKGAEVLISYIETAGSTMTRQKALKEQYLFTCTCPRCIKL 131
+ +G +A +R + + + SYI+ +T R+K LK+ + F C C RC
Sbjct: 258 IYTCDGFVATLRGLTASVDLRNLSSAHYSYIDLINTTQQRRKLLKDTWYFECHCTRC--- 314
Query: 132 GQFDDIQESAIL-------EGYRCKDDGCSGFLLRDSDDKGFTCQQCGLVRSKEEIKKIA 184
DD S+IL Y C +D + + TC +C S E + +
Sbjct: 315 SDPDDALLSSILCPNCPEKREYLCI---FGDLPYKDRNTQIITCPKCHNKVSPEYVVEAI 371
Query: 185 SEVNILSK--KTLALTSCGNHQEVVSTYKMIEKLQKKLYHPFSVNLMQTREKLIKILMEL 242
+ + K + + Q + + E+ K L VN+ K+I++L+ L
Sbjct: 372 GAMRFIDKIVENHEVEQMSREQSIKFLTDLKERFSKLLS---KVNVFLC--KIIQLLIPL 426
Query: 243 EDWKEALAYCQLTIPVY-------QRVYPQFHPLLGLQYYTCGKLEWFLGDTENAIKSMT 295
+ ++ CQL + ++ + +P HP +G+ Y + A+
Sbjct: 427 IN----MSDCQLLLDLHLEAEECVRFCFPFNHPAVGIHYRSIATFYLKCKQPHRALLYCK 482
Query: 296 EAVEILRITHGTNSPFMKELILKLEEAQAE 325
+A EI+ T G E L++A E
Sbjct: 483 KAYEIISFTLGPKHLMTIETDAMLKDASNE 512
>gi|307170062|gb|EFN62504.1| SET and MYND domain-containing protein 4 [Camponotus floridanus]
Length = 333
Score = 58.9 bits (141), Expect = 3e-06, Method: Compositional matrix adjust.
Identities = 35/113 (30%), Positives = 53/113 (46%), Gaps = 16/113 (14%)
Query: 32 EIAENFSKLACNAHTICNSELRP--------------LGTGLYPVISIINHSCLPNAVLV 77
E A ++ L CN H I + + T +YP S++NHSC PN +
Sbjct: 187 EDAFSYGILVCNGHAITKINVTADDHENKLLIEEQIRIATAIYPSASMMNHSCDPNIINS 246
Query: 78 FEGRLAVVRAVQHVPKGAEVLISYIET--AGSTMTRQKALKEQYLFTCTCPRC 128
F + +V+A+Q + +G E+L Y RQ++L+ QY FTC C C
Sbjct: 247 FLDQTLIVKAIQDIKEGEEILHCYGVDFRRMPKAERQESLESQYYFTCNCKAC 299
>gi|46115820|ref|XP_383928.1| hypothetical protein FG03752.1 [Gibberella zeae PH-1]
Length = 358
Score = 58.9 bits (141), Expect = 4e-06, Method: Compositional matrix adjust.
Identities = 30/78 (38%), Positives = 44/78 (56%), Gaps = 3/78 (3%)
Query: 56 GTGLYPVISIINHSCLPNAVLVFEGRLAVV-RAVQHVPKGAEVLISYIE--TAGSTMTRQ 112
G LYP + NHSCLPN +G+ +V A + + KG E +I+Y + T +RQ
Sbjct: 264 GFALYPRAAQFNHSCLPNVSHKPDGQARMVYTAARDISKGEECMITYFDLATRKDVSSRQ 323
Query: 113 KALKEQYLFTCTCPRCIK 130
K + Q+ F CTC RC++
Sbjct: 324 KYAQTQFQFKCTCNRCLE 341
Database: nr
Posted date: Mar 3, 2013 10:45 PM
Number of letters in database: 999,999,864
Number of sequences in database: 2,912,245
Database: /local_scratch/syshi//blastdatabase/nr.01
Posted date: Mar 3, 2013 10:52 PM
Number of letters in database: 999,999,666
Number of sequences in database: 2,912,720
Database: /local_scratch/syshi//blastdatabase/nr.02
Posted date: Mar 3, 2013 10:58 PM
Number of letters in database: 999,999,938
Number of sequences in database: 3,014,250
Database: /local_scratch/syshi//blastdatabase/nr.03
Posted date: Mar 3, 2013 11:03 PM
Number of letters in database: 999,999,780
Number of sequences in database: 2,805,020
Database: /local_scratch/syshi//blastdatabase/nr.04
Posted date: Mar 3, 2013 11:08 PM
Number of letters in database: 999,999,551
Number of sequences in database: 2,816,253
Database: /local_scratch/syshi//blastdatabase/nr.05
Posted date: Mar 3, 2013 11:13 PM
Number of letters in database: 999,999,897
Number of sequences in database: 2,981,387
Database: /local_scratch/syshi//blastdatabase/nr.06
Posted date: Mar 3, 2013 11:18 PM
Number of letters in database: 999,999,649
Number of sequences in database: 2,911,476
Database: /local_scratch/syshi//blastdatabase/nr.07
Posted date: Mar 3, 2013 11:24 PM
Number of letters in database: 999,999,452
Number of sequences in database: 2,920,260
Database: /local_scratch/syshi//blastdatabase/nr.08
Posted date: Mar 3, 2013 11:25 PM
Number of letters in database: 64,230,274
Number of sequences in database: 189,558
Lambda K H
0.320 0.135 0.397
Lambda K H
0.267 0.0410 0.140
Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Number of Hits to DB: 4,800,127,029
Number of Sequences: 23463169
Number of extensions: 187367448
Number of successful extensions: 500268
Number of sequences better than 100.0: 1000
Number of HSP's better than 100.0 without gapping: 1278
Number of HSP's successfully gapped in prelim test: 1456
Number of HSP's that attempted gapping in prelim test: 495840
Number of HSP's gapped (non-prelim): 3856
length of query: 335
length of database: 8,064,228,071
effective HSP length: 143
effective length of query: 192
effective length of database: 9,003,962,200
effective search space: 1728760742400
effective search space used: 1728760742400
T: 11
A: 40
X1: 16 ( 7.4 bits)
X2: 38 (14.6 bits)
X3: 64 (24.7 bits)
S1: 41 (21.8 bits)
S2: 77 (34.3 bits)