BLASTP 2.2.26 [Sep-21-2011]


Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, 
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), 
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs",  Nucleic Acids Res. 25:3389-3402.


Reference for compositional score matrix adjustment: Altschul, Stephen F., 
John C. Wootton, E. Michael Gertz, Richa Agarwala, Aleksandr Morgulis,
Alejandro A. Schaffer, and Yi-Kuo Yu (2005) "Protein database searches
using compositionally adjusted substitution matrices", FEBS J. 272:5101-5109.

Query= 019809
         (335 letters)

Database: nr 
           23,463,169 sequences; 8,064,228,071 total letters

Searching..................................................done



>gi|255545512|ref|XP_002513816.1| protein with unknown function [Ricinus communis]
 gi|223546902|gb|EEF48399.1| protein with unknown function [Ricinus communis]
          Length = 482

 Score =  567 bits (1462), Expect = e-159,   Method: Compositional matrix adjust.
 Identities = 261/334 (78%), Positives = 301/334 (90%)

Query: 1   MSDIDEKQLLLYAQIANLVNLILQWPEISINEIAENFSKLACNAHTICNSELRPLGTGLY 60
           M DIDEKQL+LYAQ+ANLVNLIL+WP+++I EIAENFSKLACNAHTIC+SELRPLGTGLY
Sbjct: 147 MKDIDEKQLVLYAQMANLVNLILRWPDVNIKEIAENFSKLACNAHTICDSELRPLGTGLY 206

Query: 61  PVISIINHSCLPNAVLVFEGRLAVVRAVQHVPKGAEVLISYIETAGSTMTRQKALKEQYL 120
           PV+SIINHSCLPNAVLVF+GRLAVV  VQH+PKG+EVLISYIETAGSTMTRQKALK+QY 
Sbjct: 207 PVVSIINHSCLPNAVLVFDGRLAVVHTVQHIPKGSEVLISYIETAGSTMTRQKALKQQYF 266

Query: 121 FTCTCPRCIKLGQFDDIQESAILEGYRCKDDGCSGFLLRDSDDKGFTCQQCGLVRSKEEI 180
           FTCTCPRCIK+G  DDIQESAILEGYRCKD+ C+GFLLRDSDD+GF CQQCGL+RSKEE+
Sbjct: 267 FTCTCPRCIKMGLLDDIQESAILEGYRCKDNRCNGFLLRDSDDRGFICQQCGLLRSKEEV 326

Query: 181 KKIASEVNILSKKTLALTSCGNHQEVVSTYKMIEKLQKKLYHPFSVNLMQTREKLIKILM 240
           KK A+E+   S K     S GN QE VS YK+IEKLQ+KL HPFS +LMQTREKL+K+LM
Sbjct: 327 KKSAAEIKATSDKASKSISSGNLQEAVSIYKLIEKLQRKLCHPFSTSLMQTREKLLKMLM 386

Query: 241 ELEDWKEALAYCQLTIPVYQRVYPQFHPLLGLQYYTCGKLEWFLGDTENAIKSMTEAVEI 300
           ELEDW EAL+YC+LTIPVYQRVYP+FHPLLGLQY++CGKLEW LGDTE AIKS+T+A+++
Sbjct: 387 ELEDWGEALSYCKLTIPVYQRVYPEFHPLLGLQYFSCGKLEWLLGDTEAAIKSLTKALDV 446

Query: 301 LRITHGTNSPFMKELILKLEEAQAEASYKLSSKD 334
           LRITHGT +PFMKEL++KLEEA+AE SY  +S D
Sbjct: 447 LRITHGTKTPFMKELMMKLEEARAEVSYLRASND 480


>gi|302141868|emb|CBI19071.3| unnamed protein product [Vitis vinifera]
          Length = 480

 Score =  566 bits (1459), Expect = e-159,   Method: Compositional matrix adjust.
 Identities = 262/335 (78%), Positives = 305/335 (91%), Gaps = 1/335 (0%)

Query: 1   MSDIDEKQLLLYAQIANLVNLILQWPEISINEIAENFSKLACNAHTICNSELRPLGTGLY 60
           ++DIDEKQL+LYAQ+ANLVNLILQWP+I++ EIAENFSKLACNAHTIC+ ELRPLGTGLY
Sbjct: 147 ITDIDEKQLVLYAQMANLVNLILQWPDINVKEIAENFSKLACNAHTICDGELRPLGTGLY 206

Query: 61  PVISIINHSCLPNAVLVFEGRLAVVRAVQHVPKGAEVLISYIETAGSTMTRQKALKEQYL 120
           PVISIINHSCLPN+VLVFE RLAVVRAVQH+PKG EVLISYIETAGST+TRQKALKEQYL
Sbjct: 207 PVISIINHSCLPNSVLVFEERLAVVRAVQHIPKGTEVLISYIETAGSTITRQKALKEQYL 266

Query: 121 FTCTCPRCIKLGQFDDIQESAILEGYRCKDDGCSGFLLRDSDDKGFTCQQCGLVRSKEEI 180
           FTCTCPRC ++GQ+DDIQESAILEGYRCKDD C GFLLRDSDD GF CQQCGLVR+KEEI
Sbjct: 267 FTCTCPRCRRMGQYDDIQESAILEGYRCKDDRCDGFLLRDSDDIGFICQQCGLVRNKEEI 326

Query: 181 KKIASEVNILSKKTLALTSCGNHQEVVSTYKMIEKLQKKLYHPFSVNLMQTREKLIKILM 240
           K++ASE+  LS K   ++S  ++ E  S YKMIEKLQ KL+HPFS+NLM+TRE ++KILM
Sbjct: 327 KRLASELKPLSDKA-TMSSSSHYVEATSIYKMIEKLQTKLFHPFSINLMRTREAILKILM 385

Query: 241 ELEDWKEALAYCQLTIPVYQRVYPQFHPLLGLQYYTCGKLEWFLGDTENAIKSMTEAVEI 300
           E++DW+ AL YC+LTIPVYQRVYP FHPLLGLQYYTCGKLEW LG+TE+A+KS+T+A +I
Sbjct: 386 EMKDWRAALTYCKLTIPVYQRVYPGFHPLLGLQYYTCGKLEWLLGETEDAVKSLTKAADI 445

Query: 301 LRITHGTNSPFMKELILKLEEAQAEASYKLSSKDE 335
           L+ITHGTN+PFMKEL+ KLEEA+AEAS+K SSKD+
Sbjct: 446 LQITHGTNTPFMKELLFKLEEARAEASHKRSSKDK 480


>gi|225459467|ref|XP_002284386.1| PREDICTED: histone-lysine N-methyltransferase ASHR1-like [Vitis
           vinifera]
          Length = 477

 Score =  561 bits (1447), Expect = e-157,   Method: Compositional matrix adjust.
 Identities = 261/335 (77%), Positives = 302/335 (90%), Gaps = 4/335 (1%)

Query: 1   MSDIDEKQLLLYAQIANLVNLILQWPEISINEIAENFSKLACNAHTICNSELRPLGTGLY 60
           ++DIDEKQL+LYAQ+ANLVNLILQWP+I++ EIAENFSKLACNAHTIC+ ELRPLGTGLY
Sbjct: 147 ITDIDEKQLVLYAQMANLVNLILQWPDINVKEIAENFSKLACNAHTICDGELRPLGTGLY 206

Query: 61  PVISIINHSCLPNAVLVFEGRLAVVRAVQHVPKGAEVLISYIETAGSTMTRQKALKEQYL 120
           PVISIINHSCLPN+VLVFE RLAVVRAVQH+PKG EVLISYIETAGST+TRQKALKEQYL
Sbjct: 207 PVISIINHSCLPNSVLVFEERLAVVRAVQHIPKGTEVLISYIETAGSTITRQKALKEQYL 266

Query: 121 FTCTCPRCIKLGQFDDIQESAILEGYRCKDDGCSGFLLRDSDDKGFTCQQCGLVRSKEEI 180
           FTCTCPRC ++GQ+DDIQESAILEGYRCKDD C GFLLRDSDD GF CQQCGLVR+KEEI
Sbjct: 267 FTCTCPRCRRMGQYDDIQESAILEGYRCKDDRCDGFLLRDSDDIGFICQQCGLVRNKEEI 326

Query: 181 KKIASEVNILSKKTLALTSCGNHQEVVSTYKMIEKLQKKLYHPFSVNLMQTREKLIKILM 240
           K++ASE+  LS K    +S     +  S YKMIEKLQ KL+HPFS+NLM+TRE ++KILM
Sbjct: 327 KRLASELKPLSDKATMSSS----SQATSIYKMIEKLQTKLFHPFSINLMRTREAILKILM 382

Query: 241 ELEDWKEALAYCQLTIPVYQRVYPQFHPLLGLQYYTCGKLEWFLGDTENAIKSMTEAVEI 300
           E++DW+ AL YC+LTIPVYQRVYP FHPLLGLQYYTCGKLEW LG+TE+A+KS+T+A +I
Sbjct: 383 EMKDWRAALTYCKLTIPVYQRVYPGFHPLLGLQYYTCGKLEWLLGETEDAVKSLTKAADI 442

Query: 301 LRITHGTNSPFMKELILKLEEAQAEASYKLSSKDE 335
           L+ITHGTN+PFMKEL+ KLEEA+AEAS+K SSKD+
Sbjct: 443 LQITHGTNTPFMKELLFKLEEARAEASHKRSSKDK 477


>gi|224082230|ref|XP_002306611.1| SET domain protein [Populus trichocarpa]
 gi|222856060|gb|EEE93607.1| SET domain protein [Populus trichocarpa]
          Length = 458

 Score =  546 bits (1407), Expect = e-153,   Method: Compositional matrix adjust.
 Identities = 256/333 (76%), Positives = 299/333 (89%), Gaps = 2/333 (0%)

Query: 3   DIDEKQLLLYAQIANLVNLILQWPEISINEIAENFSKLACNAHTICNSELRPLGTGLYPV 62
           D+DEKQL+LYAQ+ANLV+ ILQWPEI++ EIAENFSKLACNAHTIC+ ELRPLGTGLYPV
Sbjct: 128 DLDEKQLVLYAQMANLVHFILQWPEINLKEIAENFSKLACNAHTICDCELRPLGTGLYPV 187

Query: 63  ISIINHSCLPNAVLVFEGRLAVVRAVQHVPKGAEVLISYIETAGSTMTRQKALKEQYLFT 122
           +SIINHSC+PNAVL FEG+ +VVRAV+H+P+GAEV I+YI+TAGSTMTRQKALKEQY FT
Sbjct: 188 VSIINHSCMPNAVLTFEGKSSVVRAVEHIPEGAEVSIAYIDTAGSTMTRQKALKEQYFFT 247

Query: 123 CTCPRCIKLGQFDDIQESAILEGYRCKDDGCSGFLLRDSDDKGFTCQQCGLVRSKEEIKK 182
           CTCPRCIK+  +DDIQESAILEGYRCKDD C+GFLLRDS+DKGF CQ CGL RSKEE+K+
Sbjct: 248 CTCPRCIKV--YDDIQESAILEGYRCKDDRCNGFLLRDSEDKGFICQTCGLRRSKEEVKR 305

Query: 183 IASEVNILSKKTLALTSCGNHQEVVSTYKMIEKLQKKLYHPFSVNLMQTREKLIKILMEL 242
           I  E+  +S K L  TS GNH+EV+S YKMIEKLQ +L HPFS++LM+T+E+L+KILMEL
Sbjct: 306 IVCEITAISDKKLKSTSPGNHEEVISLYKMIEKLQMELCHPFSISLMRTQEELLKILMEL 365

Query: 243 EDWKEALAYCQLTIPVYQRVYPQFHPLLGLQYYTCGKLEWFLGDTENAIKSMTEAVEILR 302
            DW+EALAYC+LTI  YQRVYP+ HPLLGLQYYTCGK+EW LG TE+AIKS+T AV+ILR
Sbjct: 366 GDWREALAYCRLTITGYQRVYPEPHPLLGLQYYTCGKIEWLLGYTEDAIKSLTRAVDILR 425

Query: 303 ITHGTNSPFMKELILKLEEAQAEASYKLSSKDE 335
           ITHGTNSPFMKEL++KL+EA AEASY LSSKDE
Sbjct: 426 ITHGTNSPFMKELMMKLDEAHAEASYNLSSKDE 458


>gi|356507670|ref|XP_003522587.1| PREDICTED: histone-lysine N-methyltransferase ASHR1-like [Glycine
           max]
          Length = 484

 Score =  530 bits (1366), Expect = e-148,   Method: Compositional matrix adjust.
 Identities = 248/334 (74%), Positives = 289/334 (86%), Gaps = 1/334 (0%)

Query: 1   MSDIDEKQLLLYAQIANLVNLILQWPEISINEIAENFSKLACNAHTICNSELRPLGTGLY 60
           MSDI E+QL+LYAQ+ANLVN IL+WP I+I EIAENFSK ACNAHTIC+SELRP+GTGLY
Sbjct: 152 MSDITEEQLVLYAQMANLVNSILEWPGINIKEIAENFSKFACNAHTICDSELRPVGTGLY 211

Query: 61  PVISIINHSCLPNAVLVFEGRLAVVRAVQHVPKGAEVLISYIETAGSTMTRQKALKEQYL 120
           PVISIINHSCLPN+VLVFEG  A+VRAVQH+P G EVLISYIETA STMTRQKALKEQYL
Sbjct: 212 PVISIINHSCLPNSVLVFEGSSALVRAVQHIPSGTEVLISYIETAESTMTRQKALKEQYL 271

Query: 121 FTCTCPRCIKLGQFDDIQESAILEGYRCKDDGCSGFLLRDSDDKGFTCQQCGLVRSKEEI 180
           FTCTCPRC K+GQ+DDIQESAILEGY+CK + C GFLLR +D KGF CQ CGL+R KEEI
Sbjct: 272 FTCTCPRCSKVGQYDDIQESAILEGYKCKSEKCGGFLLRTTDGKGFQCQGCGLIRDKEEI 331

Query: 181 KKIASEVNILSKKTLALTSCGNHQEVVSTYKMIEKLQKKLYHPFSVNLMQTREKLIKILM 240
           K+I +E+ +LS+   +  S   +QE +S YK IEKLQ +L+HP S+NLM TREK++K LM
Sbjct: 332 KRITTEIKLLSEDA-SKPSATYYQEAISIYKRIEKLQTELFHPLSINLMHTREKILKSLM 390

Query: 241 ELEDWKEALAYCQLTIPVYQRVYPQFHPLLGLQYYTCGKLEWFLGDTENAIKSMTEAVEI 300
           ELE W EALAYC+LTIP YQRVYP  HPL GLQYYTCGKLEW+LGDTE A+KS+T+AV+I
Sbjct: 391 ELEHWTEALAYCKLTIPFYQRVYPAVHPLPGLQYYTCGKLEWYLGDTEEAVKSLTKAVDI 450

Query: 301 LRITHGTNSPFMKELILKLEEAQAEASYKLSSKD 334
           LRITHGTN+PFMK+L++KLEEA+ EASYK SSK+
Sbjct: 451 LRITHGTNTPFMKDLLMKLEEARTEASYKFSSKE 484


>gi|449445136|ref|XP_004140329.1| PREDICTED: histone-lysine N-methyltransferase ASHR1-like [Cucumis
           sativus]
 gi|449502473|ref|XP_004161650.1| PREDICTED: histone-lysine N-methyltransferase ASHR1-like [Cucumis
           sativus]
          Length = 482

 Score =  521 bits (1341), Expect = e-145,   Method: Compositional matrix adjust.
 Identities = 246/335 (73%), Positives = 287/335 (85%)

Query: 1   MSDIDEKQLLLYAQIANLVNLILQWPEISINEIAENFSKLACNAHTICNSELRPLGTGLY 60
           MS IDEKQL+LYAQ+  LVN ILQWP +++ EIA NFSKLACNAHTIC+SELRPLGTGLY
Sbjct: 147 MSGIDEKQLVLYAQMGTLVNFILQWPGMNVKEIAVNFSKLACNAHTICDSELRPLGTGLY 206

Query: 61  PVISIINHSCLPNAVLVFEGRLAVVRAVQHVPKGAEVLISYIETAGSTMTRQKALKEQYL 120
           PVISIINHSCLPNAVLVFEGR AVVRAVQH+P GAEV ISYIETAGSTMTRQK LKE YL
Sbjct: 207 PVISIINHSCLPNAVLVFEGRTAVVRAVQHIPAGAEVSISYIETAGSTMTRQKTLKENYL 266

Query: 121 FTCTCPRCIKLGQFDDIQESAILEGYRCKDDGCSGFLLRDSDDKGFTCQQCGLVRSKEEI 180
           FTCTC RC+K+ Q D+I+ESAILEGYRC++D C GFLLR+SDD GFTCQQCGLVRSKE+I
Sbjct: 267 FTCTCSRCVKVAQEDEIKESAILEGYRCRNDQCDGFLLRNSDDTGFTCQQCGLVRSKEDI 326

Query: 181 KKIASEVNILSKKTLALTSCGNHQEVVSTYKMIEKLQKKLYHPFSVNLMQTREKLIKILM 240
           K IAS++  +S +     S  ++ E +  Y+ +EKLQ+ L HP+S++LMQTREKL+KI M
Sbjct: 327 KNIASKIKSISDEASTSLSSQSYAEALFMYEKVEKLQRILCHPYSISLMQTREKLLKISM 386

Query: 241 ELEDWKEALAYCQLTIPVYQRVYPQFHPLLGLQYYTCGKLEWFLGDTENAIKSMTEAVEI 300
           ELE+W +AL YC+LTI VYQ++YP  HPLLGLQ+YTCGKLEW LG TE+AIKS T+A +I
Sbjct: 387 ELENWTKALTYCKLTISVYQKLYPGIHPLLGLQFYTCGKLEWLLGHTEDAIKSYTKAFDI 446

Query: 301 LRITHGTNSPFMKELILKLEEAQAEASYKLSSKDE 335
           LRITHGTNS FMKEL+LKLEEA+AEASYKLSS D+
Sbjct: 447 LRITHGTNSSFMKELLLKLEEARAEASYKLSSTDD 481


>gi|356518519|ref|XP_003527926.1| PREDICTED: LOW QUALITY PROTEIN: histone-lysine N-methyltransferase
           ASHR1-like [Glycine max]
          Length = 401

 Score =  515 bits (1327), Expect = e-144,   Method: Compositional matrix adjust.
 Identities = 242/334 (72%), Positives = 285/334 (85%), Gaps = 3/334 (0%)

Query: 1   MSDIDEKQLLLYAQIANLVNLILQWPEISINEIAENFSKLACNAHTICNSELRPLGTGLY 60
           MSDI E+QL+LYA++ANLVN ILQWPEI+I EIAENFSK ACNAHTIC+SELRP+GTGLY
Sbjct: 71  MSDITEEQLVLYAKMANLVNSILQWPEINIKEIAENFSKFACNAHTICDSELRPVGTGLY 130

Query: 61  PVISIINHSCLPNAVLVFEGRLAVVRAVQHVPKGAEVLISYIETAGSTMTRQKALKEQYL 120
           PVISIINHSCLPN+VLVFEGR A+V  VQH+P G EV ISYIETAGSTMT+Q ALK    
Sbjct: 131 PVISIINHSCLPNSVLVFEGRSALVPTVQHIPTGTEVPISYIETAGSTMTQQNALKSSTX 190

Query: 121 FTCTCPRCIKLGQFDDIQESAILEGYRCKDDGCSGFLLRDSDDKGFTCQQCGLVRSKEEI 180
           FTCTCPRC K+G++DDIQESAILEGY+CK + C GFLL     KGF CQ CGLVR KEEI
Sbjct: 191 FTCTCPRCSKVGEYDDIQESAILEGYKCKSEKCGGFLLY---GKGFQCQGCGLVRDKEEI 247

Query: 181 KKIASEVNILSKKTLALTSCGNHQEVVSTYKMIEKLQKKLYHPFSVNLMQTREKLIKILM 240
           K+I +E+ +LS++ +  +S  N+QE +S YK IEKLQ +LYHP SVNLMQ REK++K LM
Sbjct: 248 KRITTEIKLLSEEKIISSSTCNYQEAISIYKWIEKLQTELYHPLSVNLMQNREKILKSLM 307

Query: 241 ELEDWKEALAYCQLTIPVYQRVYPQFHPLLGLQYYTCGKLEWFLGDTENAIKSMTEAVEI 300
           ELE W EALAYC+LTIP YQRVYP  HPLLGLQYYTCGKLEW+LGDT+ A+KS+ +AV+I
Sbjct: 308 ELEHWAEALAYCKLTIPFYQRVYPAVHPLLGLQYYTCGKLEWYLGDTDEAVKSLIKAVDI 367

Query: 301 LRITHGTNSPFMKELILKLEEAQAEASYKLSSKD 334
           LRITHGTN+PFMK+L++KLEEA+AEASY+LS K+
Sbjct: 368 LRITHGTNTPFMKDLLMKLEEARAEASYRLSPKE 401


>gi|357461631|ref|XP_003601097.1| Histone-lysine N-methyltransferase ASHR1 [Medicago truncatula]
 gi|355490145|gb|AES71348.1| Histone-lysine N-methyltransferase ASHR1 [Medicago truncatula]
          Length = 511

 Score =  511 bits (1317), Expect = e-142,   Method: Compositional matrix adjust.
 Identities = 237/328 (72%), Positives = 286/328 (87%), Gaps = 1/328 (0%)

Query: 1   MSDIDEKQLLLYAQIANLVNLILQWPEISINEIAENFSKLACNAHTICNSELRPLGTGLY 60
           MSDI E+QL+LYAQ+ANLV+LILQWPEI+I EIA  FSK ACNAHT+C+SELRPLGTGLY
Sbjct: 180 MSDIKEEQLVLYAQMANLVHLILQWPEINIKEIANFFSKFACNAHTVCDSELRPLGTGLY 239

Query: 61  PVISIINHSCLPNAVLVFEGRLAVVRAVQHVPKGAEVLISYIETAGSTMTRQKALKEQYL 120
           PV+SIINHSCLPN+VLVF+GR A VRA+QH+PKG EVLISYIETAGST+TRQKAL+EQYL
Sbjct: 240 PVVSIINHSCLPNSVLVFDGREASVRALQHIPKGTEVLISYIETAGSTVTRQKALREQYL 299

Query: 121 FTCTCPRCIKLGQFDDIQESAILEGYRCKDDGCSGFLLRDSDDKGFTCQQCGLVRSKEEI 180
           F C CP C K+GQ++D++E+AILEGYRCK++ C GFLLR +D K F CQ+CGLVR KEEI
Sbjct: 300 FQCVCPLCSKVGQYEDVRENAILEGYRCKNETCDGFLLRTTDGKAFQCQECGLVRDKEEI 359

Query: 181 KKIASEVNILSKKTLALTSCGNHQEVVSTYKMIEKLQKKLYHPFSVNLMQTREKLIKILM 240
           K+IA+E+  L ++    +S  +H E +S +KMIEKLQ KLYHPFS+NLMQTRE ++K LM
Sbjct: 360 KQIATEIKFLLEEASKPSSNDSH-EAISIHKMIEKLQTKLYHPFSINLMQTRETILKSLM 418

Query: 241 ELEDWKEALAYCQLTIPVYQRVYPQFHPLLGLQYYTCGKLEWFLGDTENAIKSMTEAVEI 300
           +LE W+EALAYC+LTIP+YQRVYP  HPLLGLQYYTCGKLEW+LGDTE AIKS+T+AV+I
Sbjct: 419 KLEYWREALAYCKLTIPIYQRVYPAVHPLLGLQYYTCGKLEWYLGDTEEAIKSLTKAVDI 478

Query: 301 LRITHGTNSPFMKELILKLEEAQAEASY 328
           LRITHGT +PF+KEL + L EA+AEAS+
Sbjct: 479 LRITHGTKTPFVKELSMMLVEARAEASF 506


>gi|297836508|ref|XP_002886136.1| hypothetical protein ARALYDRAFT_319740 [Arabidopsis lyrata subsp.
           lyrata]
 gi|297331976|gb|EFH62395.1| hypothetical protein ARALYDRAFT_319740 [Arabidopsis lyrata subsp.
           lyrata]
          Length = 483

 Score =  471 bits (1211), Expect = e-130,   Method: Compositional matrix adjust.
 Identities = 218/331 (65%), Positives = 277/331 (83%)

Query: 1   MSDIDEKQLLLYAQIANLVNLILQWPEISINEIAENFSKLACNAHTICNSELRPLGTGLY 60
           MS+IDEKQ+LLYAQ+ANLVNLILQ+P + + EIAENFSK +CNAH+IC+SELRP G GL+
Sbjct: 147 MSEIDEKQMLLYAQMANLVNLILQFPSVDLREIAENFSKFSCNAHSICDSELRPQGIGLF 206

Query: 61  PVISIINHSCLPNAVLVFEGRLAVVRAVQHVPKGAEVLISYIETAGSTMTRQKALKEQYL 120
           P++SIINHSC PNAVLVFE ++AVVRA+ ++ K +E+ ISYIETAGST+TRQK+LKEQYL
Sbjct: 207 PLVSIINHSCSPNAVLVFEEQMAVVRAMDNISKDSEITISYIETAGSTLTRQKSLKEQYL 266

Query: 121 FTCTCPRCIKLGQFDDIQESAILEGYRCKDDGCSGFLLRDSDDKGFTCQQCGLVRSKEEI 180
           F C C RC  +G+  DI+ESAILEGYRC ++ C+GFLLRD ++KGF CQ+C L+RSKEE+
Sbjct: 267 FHCQCARCSNVGKPHDIEESAILEGYRCSNEKCTGFLLRDPEEKGFVCQKCLLLRSKEEV 326

Query: 181 KKIASEVNILSKKTLALTSCGNHQEVVSTYKMIEKLQKKLYHPFSVNLMQTREKLIKILM 240
           KK+AS++  +S+K  A  S  N Q  +  YK IEKLQ KLYH FS+ LM+TREKL+K+LM
Sbjct: 327 KKLASDLKTVSEKAHASPSAENKQAAIELYKTIEKLQVKLYHSFSITLMRTREKLLKMLM 386

Query: 241 ELEDWKEALAYCQLTIPVYQRVYPQFHPLLGLQYYTCGKLEWFLGDTENAIKSMTEAVEI 300
           +LE W+EAL YC+L +PVYQRVYP  HPL+GLQ+YT GKLEW LG+T+ A+ S+ +A +I
Sbjct: 387 DLEIWREALNYCRLIVPVYQRVYPATHPLIGLQFYTQGKLEWLLGETKEAVSSLIKAFDI 446

Query: 301 LRITHGTNSPFMKELILKLEEAQAEASYKLS 331
           LRI+HGT++PFMKEL  KLEEA+AEASYKL+
Sbjct: 447 LRISHGTSTPFMKELSAKLEEARAEASYKLA 477


>gi|30680137|ref|NP_849969.1| histone-lysine N-methyltransferase ASHR1 [Arabidopsis thaliana]
 gi|94707144|sp|Q7XJS0.2|ASHR1_ARATH RecName: Full=Histone-lysine N-methyltransferase ASHR1; AltName:
           Full=ASH1-related protein 1; AltName: Full=Protein SET
           DOMAIN GROUP 37
 gi|145651792|gb|ABP88121.1| At2g17900 [Arabidopsis thaliana]
 gi|330251606|gb|AEC06700.1| histone-lysine N-methyltransferase ASHR1 [Arabidopsis thaliana]
          Length = 480

 Score =  461 bits (1186), Expect = e-127,   Method: Compositional matrix adjust.
 Identities = 213/329 (64%), Positives = 272/329 (82%)

Query: 1   MSDIDEKQLLLYAQIANLVNLILQWPEISINEIAENFSKLACNAHTICNSELRPLGTGLY 60
           MS+IDEKQ+LLYAQ+ANLVNLILQ+P + + EIAENFSK +CNAH+IC+SELRP G GL+
Sbjct: 147 MSEIDEKQMLLYAQMANLVNLILQFPSVDLREIAENFSKFSCNAHSICDSELRPQGIGLF 206

Query: 61  PVISIINHSCLPNAVLVFEGRLAVVRAVQHVPKGAEVLISYIETAGSTMTRQKALKEQYL 120
           P++SIINHSC PNAVLVFE ++AVVRA+ ++ K +E+ ISYIETAGST+TRQK+LKEQYL
Sbjct: 207 PLVSIINHSCSPNAVLVFEEQMAVVRAMDNISKDSEITISYIETAGSTLTRQKSLKEQYL 266

Query: 121 FTCTCPRCIKLGQFDDIQESAILEGYRCKDDGCSGFLLRDSDDKGFTCQQCGLVRSKEEI 180
           F C C RC   G+  DI+ESAILEGYRC ++ C+GFLLRD ++KGF CQ+C L+RSKEE+
Sbjct: 267 FHCQCARCSNFGKPHDIEESAILEGYRCANEKCTGFLLRDPEEKGFVCQKCLLLRSKEEV 326

Query: 181 KKIASEVNILSKKTLALTSCGNHQEVVSTYKMIEKLQKKLYHPFSVNLMQTREKLIKILM 240
           KK+AS++  +S+K     S  + Q  +  YK IEKLQ KLYH FS+ LM+TREKL+K+LM
Sbjct: 327 KKLASDLKTVSEKAPTSPSAEDKQAAIELYKTIEKLQVKLYHSFSIPLMRTREKLLKMLM 386

Query: 241 ELEDWKEALAYCQLTIPVYQRVYPQFHPLLGLQYYTCGKLEWFLGDTENAIKSMTEAVEI 300
           ++E W+EAL YC+L +PVYQRVYP  HPL+GLQ+YT GKLEW LG+T+ A+ S+ +A +I
Sbjct: 387 DVEIWREALNYCRLIVPVYQRVYPATHPLIGLQFYTQGKLEWLLGETKEAVSSLIKAFDI 446

Query: 301 LRITHGTNSPFMKELILKLEEAQAEASYK 329
           LRI+HG ++PFMKEL  KLEEA+AEASYK
Sbjct: 447 LRISHGISTPFMKELSAKLEEARAEASYK 475


>gi|388518229|gb|AFK47176.1| unknown [Medicago truncatula]
          Length = 477

 Score =  443 bits (1140), Expect = e-122,   Method: Compositional matrix adjust.
 Identities = 210/328 (64%), Positives = 256/328 (78%), Gaps = 35/328 (10%)

Query: 1   MSDIDEKQLLLYAQIANLVNLILQWPEISINEIAENFSKLACNAHTICNSELRPLGTGLY 60
           MSDI E+QL+LYAQ+ANLV+LILQWPEI+I EIA  FSK ACNAHT+C+SELRPLGTGLY
Sbjct: 180 MSDIKEEQLVLYAQMANLVHLILQWPEINIKEIANFFSKFACNAHTVCDSELRPLGTGLY 239

Query: 61  PVISIINHSCLPNAVLVFEGRLAVVRAVQHVPKGAEVLISYIETAGSTMTRQKALKEQYL 120
           PV+SIINHSCLPN+VLVF+GR A+VRA+QH+PKG E                        
Sbjct: 240 PVVSIINHSCLPNSVLVFDGREALVRALQHIPKGTE------------------------ 275

Query: 121 FTCTCPRCIKLGQFDDIQESAILEGYRCKDDGCSGFLLRDSDDKGFTCQQCGLVRSKEEI 180
                      GQ++D++E+AILEGYRCK++ C GFLLR +D K F CQ+CGLVR KEEI
Sbjct: 276 -----------GQYEDVRENAILEGYRCKNETCDGFLLRTTDGKAFQCQECGLVRDKEEI 324

Query: 181 KKIASEVNILSKKTLALTSCGNHQEVVSTYKMIEKLQKKLYHPFSVNLMQTREKLIKILM 240
           K+IA+E+  L ++    +S GN  E +S +KMIEKLQ KLYHPFS+NLMQTRE ++K LM
Sbjct: 325 KQIATEIKFLLEEASKPSSNGNSHEAISIHKMIEKLQTKLYHPFSINLMQTRETILKSLM 384

Query: 241 ELEDWKEALAYCQLTIPVYQRVYPQFHPLLGLQYYTCGKLEWFLGDTENAIKSMTEAVEI 300
           +LE W+EALAYC+LTIP+YQRVYP  HPLLGLQYYTCGKLEW+LGDTE AIKS+T+AV+I
Sbjct: 385 KLEYWREALAYCKLTIPIYQRVYPAVHPLLGLQYYTCGKLEWYLGDTEEAIKSLTKAVDI 444

Query: 301 LRITHGTNSPFMKELILKLEEAQAEASY 328
           LRITHGT +PF+KEL + L EA+AEAS+
Sbjct: 445 LRITHGTKTPFVKELSMMLVEARAEASF 472


>gi|41469441|gb|AAS07242.1| putative MYND finger protein [Oryza sativa Japonica Group]
 gi|108710650|gb|ABF98445.1| MYND finger family protein, expressed [Oryza sativa Japonica Group]
          Length = 480

 Score =  441 bits (1133), Expect = e-121,   Method: Compositional matrix adjust.
 Identities = 209/335 (62%), Positives = 262/335 (78%), Gaps = 6/335 (1%)

Query: 1   MSDIDEKQLLLYAQIANLVNLILQWPEISINEIAENFSKLACNAHTICNSELRPLGTGLY 60
           +S++D+ QL+LYAQ+ANLV LIL   E+ + EI   FSK ACNAHTIC+ ELRPLGTGLY
Sbjct: 151 ISEVDKNQLVLYAQMANLVQLILPSFELDLKEITHTFSKFACNAHTICDPELRPLGTGLY 210

Query: 61  PVISIINHSCLPNAVLVFEGRLAVVRAVQHVPKGAEVLISYIETAGSTMTRQKALKEQYL 120
           PV+SIINHSC+PNAVL+FEGR A VRA+Q + K  EV ISYIETA +TM RQ  LK  Y 
Sbjct: 211 PVLSIINHSCVPNAVLIFEGRTAYVRALQPISKNEEVSISYIETAATTMKRQDDLK-HYY 269

Query: 121 FTCTCPRCIKLGQFDDIQESAILEGYRCKDDGCSGFLLRDSDDKGFTCQQCGLVRSKEEI 180
           FTCTCPRC+K     D +E A+LEGYRC D  C GFLL ++ +KG+TCQ+C   R  EE+
Sbjct: 270 FTCTCPRCVK-----DSEEDALLEGYRCNDQKCDGFLLPNAGNKGYTCQKCSTSRDGEEL 324

Query: 181 KKIASEVNILSKKTLALTSCGNHQEVVSTYKMIEKLQKKLYHPFSVNLMQTREKLIKILM 240
           +K+AS+V +LS K  +L S GN+ EV S YK IE+L++KLYHP S+ L+ TRE L+KI M
Sbjct: 325 QKMASDVLLLSDKVSSLVSSGNNSEVGSMYKTIEELERKLYHPLSITLLHTRETLLKIYM 384

Query: 241 ELEDWKEALAYCQLTIPVYQRVYPQFHPLLGLQYYTCGKLEWFLGDTENAIKSMTEAVEI 300
           EL+DW+ AL YC+LTIPVY+R+YP FHP++GLQ+YTCGKLEW L  TE+A+ S+T A +I
Sbjct: 385 ELQDWQTALMYCRLTIPVYERIYPPFHPMIGLQFYTCGKLEWLLEYTEDALMSLTRAADI 444

Query: 301 LRITHGTNSPFMKELILKLEEAQAEASYKLSSKDE 335
           LRITHGT S FMKEL+ KLEE +AEAS++LS+ DE
Sbjct: 445 LRITHGTKSEFMKELLGKLEEVRAEASFRLSAGDE 479


>gi|108710649|gb|ABF98444.1| MYND finger family protein, expressed [Oryza sativa Japonica Group]
          Length = 481

 Score =  434 bits (1117), Expect = e-119,   Method: Compositional matrix adjust.
 Identities = 208/336 (61%), Positives = 262/336 (77%), Gaps = 7/336 (2%)

Query: 1   MSDIDEKQLLLYAQIANLVNLILQWPEISINEIAENFSKLACNAHTICNSELRPLGTGLY 60
           +S++D+ QL+LYAQ+ANLV LIL   E+ + EI   FSK ACNAHTIC+ ELRPLGTGLY
Sbjct: 151 ISEVDKNQLVLYAQMANLVQLILPSFELDLKEITHTFSKFACNAHTICDPELRPLGTGLY 210

Query: 61  PVISIINHSCLPNAVLVFEGRLAVVRAVQHVPKGAEVLISYIETAGSTMTRQKALKEQYL 120
           PV+SIINHSC+PNAVL+FEGR A VRA+Q + K  EV ISYIETA +TM RQ  LK  Y 
Sbjct: 211 PVLSIINHSCVPNAVLIFEGRTAYVRALQPISKNEEVSISYIETAATTMKRQDDLK-HYY 269

Query: 121 FTCTCPRCIKLGQFDDIQESAILEGYRCKDDGCSGFLLRDSDDKGFTCQQCGLVRSKEEI 180
           FTCTCPRC+K     D +E A+LEGYRC D  C GFLL ++ +KG+TCQ+C   R  EE+
Sbjct: 270 FTCTCPRCVK-----DSEEDALLEGYRCNDQKCDGFLLPNAGNKGYTCQKCSTSRDGEEL 324

Query: 181 KKIASEVNILSKKTLALTSCG-NHQEVVSTYKMIEKLQKKLYHPFSVNLMQTREKLIKIL 239
           +K+AS+V +LS K  +L S G ++ EV S YK IE+L++KLYHP S+ L+ TRE L+KI 
Sbjct: 325 QKMASDVLLLSDKVSSLVSSGIDNSEVGSMYKTIEELERKLYHPLSITLLHTRETLLKIY 384

Query: 240 MELEDWKEALAYCQLTIPVYQRVYPQFHPLLGLQYYTCGKLEWFLGDTENAIKSMTEAVE 299
           MEL+DW+ AL YC+LTIPVY+R+YP FHP++GLQ+YTCGKLEW L  TE+A+ S+T A +
Sbjct: 385 MELQDWQTALMYCRLTIPVYERIYPPFHPMIGLQFYTCGKLEWLLEYTEDALMSLTRAAD 444

Query: 300 ILRITHGTNSPFMKELILKLEEAQAEASYKLSSKDE 335
           ILRITHGT S FMKEL+ KLEE +AEAS++LS+ DE
Sbjct: 445 ILRITHGTKSEFMKELLGKLEEVRAEASFRLSAGDE 480


>gi|218193604|gb|EEC76031.1| hypothetical protein OsI_13199 [Oryza sativa Indica Group]
          Length = 481

 Score =  431 bits (1108), Expect = e-118,   Method: Compositional matrix adjust.
 Identities = 207/336 (61%), Positives = 261/336 (77%), Gaps = 7/336 (2%)

Query: 1   MSDIDEKQLLLYAQIANLVNLILQWPEISINEIAENFSKLACNAHTICNSELRPLGTGLY 60
           +S++D+ QL+LYAQ+ANLV LIL   E+ + EI   FSK ACNAHTIC+ ELR LGTGLY
Sbjct: 151 ISEVDKNQLVLYAQMANLVQLILPSFELDLKEITHTFSKFACNAHTICDPELRSLGTGLY 210

Query: 61  PVISIINHSCLPNAVLVFEGRLAVVRAVQHVPKGAEVLISYIETAGSTMTRQKALKEQYL 120
           PV+SIINHSC+PNAVL+FEGR A VRA+Q + K  EV ISYIETA +TM RQ  LK  Y 
Sbjct: 211 PVLSIINHSCVPNAVLIFEGRTAYVRALQPISKNEEVSISYIETAATTMKRQDDLK-HYY 269

Query: 121 FTCTCPRCIKLGQFDDIQESAILEGYRCKDDGCSGFLLRDSDDKGFTCQQCGLVRSKEEI 180
           FTCTCPRC+K     D +E A+LEGYRC D  C GFLL ++ +KG+TCQ+C   R  EE+
Sbjct: 270 FTCTCPRCVK-----DSEEDALLEGYRCNDQKCDGFLLPNAGNKGYTCQKCSTSRDGEEL 324

Query: 181 KKIASEVNILSKKTLALTSCG-NHQEVVSTYKMIEKLQKKLYHPFSVNLMQTREKLIKIL 239
           +K+AS+V +LS K  +L S G ++ EV S YK IE+L++KLYHP S+ L+ TRE L+KI 
Sbjct: 325 QKMASDVLLLSDKVSSLVSSGIDNSEVGSMYKTIEELERKLYHPLSITLLHTRETLLKIY 384

Query: 240 MELEDWKEALAYCQLTIPVYQRVYPQFHPLLGLQYYTCGKLEWFLGDTENAIKSMTEAVE 299
           MEL+DW+ AL YC+LTIPVY+R+YP FHP++GLQ+YTCGKLEW L  TE+A+ S+T A +
Sbjct: 385 MELQDWQTALMYCRLTIPVYERIYPPFHPMIGLQFYTCGKLEWLLEYTEDALMSLTRAAD 444

Query: 300 ILRITHGTNSPFMKELILKLEEAQAEASYKLSSKDE 335
           ILRITHGT S FMKEL+ KLEE +AEAS++LS+ DE
Sbjct: 445 ILRITHGTKSEFMKELLGKLEEVRAEASFRLSAGDE 480


>gi|357117918|ref|XP_003560708.1| PREDICTED: histone-lysine N-methyltransferase ASHR1-like
           [Brachypodium distachyon]
          Length = 482

 Score =  426 bits (1096), Expect = e-117,   Method: Compositional matrix adjust.
 Identities = 201/335 (60%), Positives = 255/335 (76%), Gaps = 6/335 (1%)

Query: 1   MSDIDEKQLLLYAQIANLVNLILQWPEISINEIAENFSKLACNAHTICNSELRPLGTGLY 60
           +S++D+KQL+LYAQ+ANLV LIL   E+ + E A  FSK +CNAHTIC+ ELRP+GTGL+
Sbjct: 151 ISEVDDKQLVLYAQMANLVQLILPAIELDLKETAHIFSKFSCNAHTICDPELRPVGTGLF 210

Query: 61  PVISIINHSCLPNAVLVFEGRLAVVRAVQHVPKGAEVLISYIETAGSTMTRQKALKEQYL 120
           P IS INHSC+PNAVL+FEGR A VRA+Q +    EV ISYIETA +T+ R   LK  Y 
Sbjct: 211 PAISTINHSCVPNAVLLFEGRTAYVRALQPLSNNTEVSISYIETAATTLKRHNDLK-HYF 269

Query: 121 FTCTCPRCIKLGQFDDIQESAILEGYRCKDDGCSGFLLRDSDDKGFTCQQCGLVRSKEEI 180
           FTCTCPRCIK       +E  +LEGYRCKD  C GFLL DS  K +TCQ+C + R  EE+
Sbjct: 270 FTCTCPRCIK-----GSEEDPLLEGYRCKDQNCDGFLLPDSGKKAYTCQKCSMCRDGEEV 324

Query: 181 KKIASEVNILSKKTLALTSCGNHQEVVSTYKMIEKLQKKLYHPFSVNLMQTREKLIKILM 240
           KK++SE+ +LS K  +  S GN+ E  S YK+IE+L++KLYH FS+ L+ TRE L+K+ M
Sbjct: 325 KKLSSEILLLSDKASSFVSSGNNNEAGSVYKIIEQLERKLYHSFSITLLHTRETLLKLYM 384

Query: 241 ELEDWKEALAYCQLTIPVYQRVYPQFHPLLGLQYYTCGKLEWFLGDTENAIKSMTEAVEI 300
           EL+DW+ AL YC+ TIPVY+RVYP FHP++GLQ+YTCGKLEW L  TE+A+KS+T A ++
Sbjct: 385 ELQDWRSALMYCRSTIPVYERVYPPFHPMVGLQFYTCGKLEWLLEYTEDALKSLTRAADV 444

Query: 301 LRITHGTNSPFMKELILKLEEAQAEASYKLSSKDE 335
           LRITHGTNS FMKEL  KLEEA+AE S++LS  +E
Sbjct: 445 LRITHGTNSQFMKELFGKLEEARAEVSFRLSPGNE 479


>gi|212722234|ref|NP_001131420.1| uncharacterized protein LOC100192749 [Zea mays]
 gi|194691468|gb|ACF79818.1| unknown [Zea mays]
 gi|413933330|gb|AFW67881.1| histone-lysine N-methyltransferase ASHR1 [Zea mays]
          Length = 482

 Score =  418 bits (1074), Expect = e-114,   Method: Compositional matrix adjust.
 Identities = 204/333 (61%), Positives = 252/333 (75%), Gaps = 6/333 (1%)

Query: 1   MSDIDEKQLLLYAQIANLVNLILQWPEISINEIAENFSKLACNAHTICNSELRPLGTGLY 60
           +S +DE QL+LYAQ+ANLV+LIL + E+ + EIA+ FSK ACNAHTIC+ ELRPLGTGLY
Sbjct: 151 ISKVDENQLVLYAQMANLVSLILPFIELDLKEIAQTFSKFACNAHTICDPELRPLGTGLY 210

Query: 61  PVISIINHSCLPNAVLVFEGRLAVVRAVQHVPKGAEVLISYIETAGSTMTRQKALKEQYL 120
           PVISIINHSC+PNAVL+F+GR A VRA+Q + K  EV ISYIETA  T  R   LK QY 
Sbjct: 211 PVISIINHSCVPNAVLIFDGRTAYVRALQPINKDEEVSISYIETATVTKKRNNDLK-QYF 269

Query: 121 FTCTCPRCIKLGQFDDIQESAILEGYRCKDDGCSGFLLRDSDDKGFTCQQCGLVRSKEEI 180
           FTCTCPRC+K   FD   E A+LEG+RCK+  C GFLL +S  K +TCQ+CG  R  EEI
Sbjct: 270 FTCTCPRCVK--GFD---EDALLEGFRCKNQACDGFLLPNSGKKAYTCQKCGASRDVEEI 324

Query: 181 KKIASEVNILSKKTLALTSCGNHQEVVSTYKMIEKLQKKLYHPFSVNLMQTREKLIKILM 240
           K + SE+  LS K  +  S GN  E  S YK+IE+L++ LYH FS  L+ T E L+KI +
Sbjct: 325 KNMRSEILQLSDKASSFLSSGNKAEAGSIYKIIEQLERNLYHAFSTTLLHTCETLLKIYL 384

Query: 241 ELEDWKEALAYCQLTIPVYQRVYPQFHPLLGLQYYTCGKLEWFLGDTENAIKSMTEAVEI 300
           EL+DW  AL YC+LTIPVY+RVYP FHP++GLQ+YTCGKLEW L  TE+A+KS+T A +I
Sbjct: 385 ELQDWWTALTYCRLTIPVYERVYPPFHPMIGLQFYTCGKLEWLLEFTEDALKSLTRAADI 444

Query: 301 LRITHGTNSPFMKELILKLEEAQAEASYKLSSK 333
           L++THG  S FMKEL  KLEEA+AE S++LSS+
Sbjct: 445 LKVTHGVKSQFMKELFGKLEEARAEVSFRLSSR 477


>gi|242033255|ref|XP_002464022.1| hypothetical protein SORBIDRAFT_01g010740 [Sorghum bicolor]
 gi|241917876|gb|EER91020.1| hypothetical protein SORBIDRAFT_01g010740 [Sorghum bicolor]
          Length = 482

 Score =  417 bits (1071), Expect = e-114,   Method: Compositional matrix adjust.
 Identities = 200/330 (60%), Positives = 250/330 (75%), Gaps = 6/330 (1%)

Query: 4   IDEKQLLLYAQIANLVNLILQWPEISINEIAENFSKLACNAHTICNSELRPLGTGLYPVI 63
           +DE  L+LYAQ+ANLV+LIL   E+ + EIA  FSK ACNAHTIC+ ELRPLGTGLYPVI
Sbjct: 154 VDENHLVLYAQMANLVSLILPLIELDLKEIAHTFSKFACNAHTICDPELRPLGTGLYPVI 213

Query: 64  SIINHSCLPNAVLVFEGRLAVVRAVQHVPKGAEVLISYIETAGSTMTRQKALKEQYLFTC 123
           SIINHSC+PNAVL+F+GR A VRA+Q + K  EV ISYIETA  T  R   LK QY FTC
Sbjct: 214 SIINHSCVPNAVLIFDGRTAYVRALQPIGKNEEVSISYIETAAVTKKRHNDLK-QYFFTC 272

Query: 124 TCPRCIKLGQFDDIQESAILEGYRCKDDGCSGFLLRDSDDKGFTCQQCGLVRSKEEIKKI 183
           +CPRC+K       +E A+LEG+RCK+  C GFLL DS  K +TCQ+C + R +EEI+K+
Sbjct: 273 SCPRCVK-----GSEEDALLEGFRCKNQTCDGFLLPDSGKKAYTCQKCSVSRDEEEIQKM 327

Query: 184 ASEVNILSKKTLALTSCGNHQEVVSTYKMIEKLQKKLYHPFSVNLMQTREKLIKILMELE 243
            SE+  LS K  +  S GN  E  S YK IE+L++ LYH FS  L+ T E L+KI MEL+
Sbjct: 328 RSEILQLSDKASSFLSSGNKAEAGSVYKTIEQLEQNLYHAFSTTLLHTCETLLKIYMELQ 387

Query: 244 DWKEALAYCQLTIPVYQRVYPQFHPLLGLQYYTCGKLEWFLGDTENAIKSMTEAVEILRI 303
           DW+ ALAYC+LT+PVY+RVYP FHP++GLQ+YTCGKLEW L  TE+A+KS+T A +IL++
Sbjct: 388 DWRTALAYCRLTVPVYERVYPPFHPMIGLQFYTCGKLEWLLECTEDALKSLTRATDILKV 447

Query: 304 THGTNSPFMKELILKLEEAQAEASYKLSSK 333
           THGT S FMKEL  KLEEA+AE S+++SS+
Sbjct: 448 THGTKSQFMKELFGKLEEARAEVSFRISSR 477


>gi|195636164|gb|ACG37550.1| histone-lysine N-methyltransferase ASHR1 [Zea mays]
          Length = 482

 Score =  416 bits (1068), Expect = e-113,   Method: Compositional matrix adjust.
 Identities = 203/333 (60%), Positives = 250/333 (75%), Gaps = 6/333 (1%)

Query: 1   MSDIDEKQLLLYAQIANLVNLILQWPEISINEIAENFSKLACNAHTICNSELRPLGTGLY 60
           +S +DE QL+LYAQ+ANLV+LIL + E+ + EIA+ FSK ACNAHTIC+ ELRPLGTGLY
Sbjct: 151 ISKVDENQLVLYAQMANLVSLILPFIELDLKEIAQTFSKFACNAHTICDPELRPLGTGLY 210

Query: 61  PVISIINHSCLPNAVLVFEGRLAVVRAVQHVPKGAEVLISYIETAGSTMTRQKALKEQYL 120
           PVISIINHSC+PNAVL+F+GR A VRA+Q + K  EV ISYIETA  T  R   LK QY 
Sbjct: 211 PVISIINHSCVPNAVLIFDGRTAYVRALQPIDKDEEVSISYIETAAVTKKRNNDLK-QYF 269

Query: 121 FTCTCPRCIKLGQFDDIQESAILEGYRCKDDGCSGFLLRDSDDKGFTCQQCGLVRSKEEI 180
           FTCTCPRC+K   FD   E  +LEG+RCK+  C GFLL  S  K +TCQ+CG  R  EEI
Sbjct: 270 FTCTCPRCVK--GFD---EDPLLEGFRCKNQTCDGFLLPSSGKKAYTCQKCGASRDVEEI 324

Query: 181 KKIASEVNILSKKTLALTSCGNHQEVVSTYKMIEKLQKKLYHPFSVNLMQTREKLIKILM 240
           K + SE+  LS K  +  S GN  E  S YK+IE+L++ LYH FS  L+ T E L+KI +
Sbjct: 325 KNMRSEILQLSDKASSFLSSGNKAEAGSIYKIIEQLERNLYHAFSTTLLHTCETLLKIYL 384

Query: 241 ELEDWKEALAYCQLTIPVYQRVYPQFHPLLGLQYYTCGKLEWFLGDTENAIKSMTEAVEI 300
           EL+DW  AL YC+LTIPVY+RVYP FHP++GLQ+YTCGKLEW L  TE+A+KS+T A +I
Sbjct: 385 ELQDWWTALTYCRLTIPVYERVYPPFHPMIGLQFYTCGKLEWLLEFTEDALKSLTRAADI 444

Query: 301 LRITHGTNSPFMKELILKLEEAQAEASYKLSSK 333
           L++THG  S FMKEL  KLEEA+AE S++LSS+
Sbjct: 445 LKVTHGVKSQFMKELFGKLEEARAEVSFRLSSR 477


>gi|22450579|gb|AAM97151.1| unknown protein,3'-partial [Oryza sativa Japonica Group]
          Length = 425

 Score =  373 bits (957), Expect = e-101,   Method: Compositional matrix adjust.
 Identities = 175/281 (62%), Positives = 219/281 (77%), Gaps = 6/281 (2%)

Query: 1   MSDIDEKQLLLYAQIANLVNLILQWPEISINEIAENFSKLACNAHTICNSELRPLGTGLY 60
           +S++D+ QL+LYAQ+ANLV LIL   E+ + EI   FSK ACNAHTIC+ ELRPLGTGLY
Sbjct: 151 ISEVDKNQLVLYAQMANLVQLILPSFELDLKEITHTFSKFACNAHTICDPELRPLGTGLY 210

Query: 61  PVISIINHSCLPNAVLVFEGRLAVVRAVQHVPKGAEVLISYIETAGSTMTRQKALKEQYL 120
           PV+SIINHSC+PNAVL+FEGR A VRA+Q + K  EV ISYIETA +TM RQ  LK  Y 
Sbjct: 211 PVLSIINHSCVPNAVLIFEGRTAYVRALQPISKNEEVSISYIETAATTMKRQDDLK-HYY 269

Query: 121 FTCTCPRCIKLGQFDDIQESAILEGYRCKDDGCSGFLLRDSDDKGFTCQQCGLVRSKEEI 180
           FTCTCPRC+K     D +E A+LEGYRC D  C GFLL ++ +KG+TCQ+C   R  EE+
Sbjct: 270 FTCTCPRCVK-----DSEEDALLEGYRCNDQKCDGFLLPNAGNKGYTCQKCSTSRDGEEL 324

Query: 181 KKIASEVNILSKKTLALTSCGNHQEVVSTYKMIEKLQKKLYHPFSVNLMQTREKLIKILM 240
           +K+AS+V +LS K  +L S GN+ EV S YK IE+L++KLYHP S+ L+ TRE L+KI M
Sbjct: 325 QKMASDVLLLSDKVSSLVSSGNNSEVGSMYKTIEELERKLYHPLSITLLHTRETLLKIYM 384

Query: 241 ELEDWKEALAYCQLTIPVYQRVYPQFHPLLGLQYYTCGKLE 281
           EL+DW+ AL YC+LTIPVY+R+YP FHP++GLQ+YTCGKLE
Sbjct: 385 ELQDWQTALMYCRLTIPVYERIYPPFHPMIGLQFYTCGKLE 425


>gi|413933331|gb|AFW67882.1| hypothetical protein ZEAMMB73_219813 [Zea mays]
          Length = 443

 Score =  359 bits (922), Expect = 8e-97,   Method: Compositional matrix adjust.
 Identities = 174/283 (61%), Positives = 213/283 (75%), Gaps = 6/283 (2%)

Query: 1   MSDIDEKQLLLYAQIANLVNLILQWPEISINEIAENFSKLACNAHTICNSELRPLGTGLY 60
           +S +DE QL+LYAQ+ANLV+LIL + E+ + EIA+ FSK ACNAHTIC+ ELRPLGTGLY
Sbjct: 151 ISKVDENQLVLYAQMANLVSLILPFIELDLKEIAQTFSKFACNAHTICDPELRPLGTGLY 210

Query: 61  PVISIINHSCLPNAVLVFEGRLAVVRAVQHVPKGAEVLISYIETAGSTMTRQKALKEQYL 120
           PVISIINHSC+PNAVL+F+GR A VRA+Q + K  EV ISYIETA  T  R   LK QY 
Sbjct: 211 PVISIINHSCVPNAVLIFDGRTAYVRALQPINKDEEVSISYIETATVTKKRNNDLK-QYF 269

Query: 121 FTCTCPRCIKLGQFDDIQESAILEGYRCKDDGCSGFLLRDSDDKGFTCQQCGLVRSKEEI 180
           FTCTCPRC+K   FD   E A+LEG+RCK+  C GFLL +S  K +TCQ+CG  R  EEI
Sbjct: 270 FTCTCPRCVK--GFD---EDALLEGFRCKNQACDGFLLPNSGKKAYTCQKCGASRDVEEI 324

Query: 181 KKIASEVNILSKKTLALTSCGNHQEVVSTYKMIEKLQKKLYHPFSVNLMQTREKLIKILM 240
           K + SE+  LS K  +  S GN  E  S YK+IE+L++ LYH FS  L+ T E L+KI +
Sbjct: 325 KNMRSEILQLSDKASSFLSSGNKAEAGSIYKIIEQLERNLYHAFSTTLLHTCETLLKIYL 384

Query: 241 ELEDWKEALAYCQLTIPVYQRVYPQFHPLLGLQYYTCGKLEWF 283
           EL+DW  AL YC+LTIPVY+RVYP FHP++GLQ+YTCGKLEW+
Sbjct: 385 ELQDWWTALTYCRLTIPVYERVYPPFHPMIGLQFYTCGKLEWY 427


>gi|148906849|gb|ABR16570.1| unknown [Picea sitchensis]
          Length = 441

 Score =  355 bits (911), Expect = 2e-95,   Method: Compositional matrix adjust.
 Identities = 165/290 (56%), Positives = 216/290 (74%), Gaps = 1/290 (0%)

Query: 40  LACNAHTICNSELRPLGTGLYPVISIINHSCLPNAVLVFEGRLAVVRAVQHVPKGAEVLI 99
            +CNAHTIC+SELRP+GTGLYPVISIINHSC PNAVL+FEGR AVVRAV+ + +G+E+ +
Sbjct: 142 FSCNAHTICDSELRPMGTGLYPVISIINHSCFPNAVLLFEGRQAVVRAVEPIREGSELTV 201

Query: 100 SYIETAGSTMTRQKALKEQYLFTCTCPRCIKLGQFDDIQESAILEGYRCKDDGCSGFLLR 159
           SYIE A ST +R+K+LKEQY F C C RC+K+   D + E AILEG+RC  D C GFLL 
Sbjct: 202 SYIEIAASTASRKKSLKEQYFFDCKCLRCLKVDTPDGLHEDAILEGFRCSSDHCEGFLLH 261

Query: 160 DSDD-KGFTCQQCGLVRSKEEIKKIASEVNILSKKTLALTSCGNHQEVVSTYKMIEKLQK 218
           D DD +   CQ CG  R++EE KK A +V+ L K+   L S GN+ E  S Y+ I++LQ 
Sbjct: 262 DPDDAQSLVCQLCGCGRNEEETKKQARKVDKLGKEASKLLSSGNYSEARSLYEQIQQLQT 321

Query: 219 KLYHPFSVNLMQTREKLIKILMELEDWKEALAYCQLTIPVYQRVYPQFHPLLGLQYYTCG 278
           +L+HP+SV L++T + L+KI MEL DWK+AL YC+LTIP Y+R YP  HP++GLQYY CG
Sbjct: 322 QLWHPYSVILLRTGDTLLKICMELYDWKQALKYCRLTIPAYERAYPTCHPMMGLQYYACG 381

Query: 279 KLEWFLGDTENAIKSMTEAVEILRITHGTNSPFMKELILKLEEAQAEASY 328
           KLEWFL +T  A+    +A +IL +THG NS F+ +L  +++EA AEA++
Sbjct: 382 KLEWFLENTLEALNFFEKAAKILTVTHGRNSEFLTQLFDRIQEAHAEAAH 431


>gi|168051542|ref|XP_001778213.1| predicted protein [Physcomitrella patens subsp. patens]
 gi|162670426|gb|EDQ56995.1| predicted protein [Physcomitrella patens subsp. patens]
          Length = 490

 Score =  332 bits (850), Expect = 2e-88,   Method: Compositional matrix adjust.
 Identities = 152/331 (45%), Positives = 221/331 (66%)

Query: 1   MSDIDEKQLLLYAQIANLVNLILQWPEISINEIAENFSKLACNAHTICNSELRPLGTGLY 60
           MS+  E +L++YAQ+A++V  ++   E+++ E+ +   + ACNAHTIC+ E+RPLGTGLY
Sbjct: 154 MSETSEDRLVMYAQMASIVQQMMAPDEVNVKEVTQMICRFACNAHTICDEEVRPLGTGLY 213

Query: 61  PVISIINHSCLPNAVLVFEGRLAVVRAVQHVPKGAEVLISYIETAGSTMTRQKALKEQYL 120
           PVISI+NHSC+PNAVL F+G  A +RA++   +G E+ ISY+E A ST TR+KAL++QY 
Sbjct: 214 PVISIVNHSCVPNAVLHFDGNRAALRALEDTQEGTEITISYVELAASTNTRRKALRDQYY 273

Query: 121 FTCTCPRCIKLGQFDDIQESAILEGYRCKDDGCSGFLLRDSDDKGFTCQQCGLVRSKEEI 180
           F C C RC +L   +  +E A LEGY C +  C+G L+ D       C+ CGL R  ++ 
Sbjct: 274 FDCNCIRCSRLVTREGSREDAFLEGYGCVNSDCNGPLIEDPGSDKVICEVCGLKREVQQT 333

Query: 181 KKIASEVNILSKKTLALTSCGNHQEVVSTYKMIEKLQKKLYHPFSVNLMQTREKLIKILM 240
           K  A EV +   +   L + G  +     Y  +E  Q+KL+HP+SV L++T + L+KI M
Sbjct: 334 KSAAKEVELDVLEASNLYAAGKLESARRLYSEVEAKQRKLWHPYSVPLLRTHDALLKICM 393

Query: 241 ELEDWKEALAYCQLTIPVYQRVYPQFHPLLGLQYYTCGKLEWFLGDTENAIKSMTEAVEI 300
           ++EDW  AL +CQ TIP Y+R YP F PLLGLQY+T GKL+W LGD+  A+ ++ +A  +
Sbjct: 394 DMEDWASALEFCQSTIPAYERAYPPFSPLLGLQYFTLGKLQWLLGDSAKAVNTLRKAYTV 453

Query: 301 LRITHGTNSPFMKELILKLEEAQAEASYKLS 331
           +++THG+ S  +  L   L+EAQAE +YK S
Sbjct: 454 IQVTHGSKSELLHGLTSTLQEAQAEVAYKRS 484


>gi|18568265|gb|AAL75997.1|AF466646_5 putative SET-domain transcriptional regulator [Zea mays]
 gi|413933333|gb|AFW67884.1| putative SET-domain transcriptional regulator [Zea mays]
          Length = 410

 Score =  295 bits (754), Expect = 3e-77,   Method: Compositional matrix adjust.
 Identities = 166/332 (50%), Positives = 207/332 (62%), Gaps = 55/332 (16%)

Query: 7   KQLLLYAQIANLVNLILQWPEISINEIAENFSKLACNAHTICNSELRPLGTGLYPVISII 66
           K+ L   ++ANLV+LIL + E+ + EIA+ FSK ACNAHTIC+ ELRPLGTGLYPVISII
Sbjct: 124 KRKLQNEKMANLVSLILPFIELDLKEIAQTFSKFACNAHTICDPELRPLGTGLYPVISII 183

Query: 67  NHSCLPNAVLVFEGRLAVVRAVQHVPKGAEVLISYIETAGSTMTRQKALKEQYLFTCTCP 126
           NHSC+PNAVL+F+GR A VRA+Q + K  EV ISYIETA  T  R   LK QY FTCTCP
Sbjct: 184 NHSCVPNAVLIFDGRTAYVRALQPINKDEEVSISYIETATVTKKRNNDLK-QYFFTCTCP 242

Query: 127 RCIKLGQFDDIQESAILEGYRCKDDGCSGFLLRDSDDKGFTCQQCGLVRSKEEIKKIASE 186
           RC+K   FD   E A+LEG+RCK+  C GFLL +SD                        
Sbjct: 243 RCVK--GFD---EDALLEGFRCKNQACDGFLLPNSDK----------------------- 274

Query: 187 VNILSKKTLALTSCGNHQEVVSTYKMIEKLQKKLYHPFSVNLMQTREKLIKILMELEDWK 246
                             E  S YK+IE+L++ LYH FS  L+ T E L+KI +EL+DW 
Sbjct: 275 -----------------AEAGSIYKIIEQLERNLYHAFSTTLLHTCETLLKIYLELQDWW 317

Query: 247 EALAYCQLTIPVYQR-----VYPQFHPLLGLQYYTCGKLEWFLGDTENAIKSMTEAVEIL 301
            AL YC+LTIPVY+        P   P    + Y  G LE+    TE+A+KS+T A +IL
Sbjct: 318 TALTYCRLTIPVYESKAWRLKIPFPRPRTVREAYGTGLLEF----TEDALKSLTRAADIL 373

Query: 302 RITHGTNSPFMKELILKLEEAQAEASYKLSSK 333
           ++THG  S FMKEL  KLEEA+AE S++LSS+
Sbjct: 374 KVTHGVKSQFMKELFGKLEEARAEVSFRLSSR 405


>gi|302799034|ref|XP_002981276.1| hypothetical protein SELMODRAFT_114558 [Selaginella moellendorffii]
 gi|300150816|gb|EFJ17464.1| hypothetical protein SELMODRAFT_114558 [Selaginella moellendorffii]
          Length = 453

 Score =  270 bits (691), Expect = 5e-70,   Method: Compositional matrix adjust.
 Identities = 133/331 (40%), Positives = 210/331 (63%), Gaps = 7/331 (2%)

Query: 1   MSDIDEKQLLLYAQIANLVNLIL--QWPEISINEIAENFSKLACNAHTICNSELRPLGTG 58
            S+  +++L++YAQ+A L+  IL  ++ E  + EI ++  +++CN HTIC+ ELRP+G G
Sbjct: 115 FSETGDERLVMYAQMAVLIKTILNARYAE-DVKEITKDICRISCNGHTICDDELRPVGIG 173

Query: 59  LYPVISIINHSCLPNAVLVFEGRLAVVRAVQHVPKGAEVLISYIETAGSTMTRQKALKEQ 118
           L+PV+S+INHSC  N++L+F+G+ AVVRA+  + +G EV +SYIE   ST +R++AL +Q
Sbjct: 174 LFPVVSVINHSCSSNSLLLFDGKHAVVRALGTISRGCEVTVSYIELGASTNSRREALSDQ 233

Query: 119 YLFTCTCPRCIKLG-QFDDIQESAILEGYRCKDDGCSGFLLRDSDDKGFTCQQCGLVRSK 177
           Y F C CPRC  +  Q   + +  +LE   C D  C  F+     +  + C  CG  R  
Sbjct: 234 YYFHCKCPRCTDVSPQAHGLYKDDVLEAVACLDPACESFM--RLSNGSWRCMSCGSSREP 291

Query: 178 EEIKKIASEVNILSKKTLALTSCGNHQEVVSTYKMIEKLQKKLYHPFSVNLMQTREKLIK 237
            E+ K+++E   + +K   L + G+       ++  EKLQ +L++P SV LM+TR+ L++
Sbjct: 292 NEVNKLSTEAEGMIEKANRLQAAGDLHGARMAFQQAEKLQTELWNPRSVKLMRTRDLLLR 351

Query: 238 ILMELEDWKEALAYCQLTIPVYQRVYP-QFHPLLGLQYYTCGKLEWFLGDTENAIKSMTE 296
           + + LEDW  AL  C+LT+P Y+  YP   HPLLGLQYYT GKLE   G    A+++  +
Sbjct: 352 VYLSLEDWDLALHVCRLTLPAYETAYPGSKHPLLGLQYYTLGKLEMHAGSVTEAVRAYAK 411

Query: 297 AVEILRITHGTNSPFMKELILKLEEAQAEAS 327
           A +IL +THG+   F+++L  +L++A+  AS
Sbjct: 412 AFDILSVTHGSRGEFVRKLRNELDQAKMVAS 442


>gi|302772491|ref|XP_002969663.1| hypothetical protein SELMODRAFT_231366 [Selaginella moellendorffii]
 gi|300162174|gb|EFJ28787.1| hypothetical protein SELMODRAFT_231366 [Selaginella moellendorffii]
          Length = 484

 Score =  269 bits (688), Expect = 1e-69,   Method: Compositional matrix adjust.
 Identities = 133/330 (40%), Positives = 209/330 (63%), Gaps = 7/330 (2%)

Query: 1   MSDIDEKQLLLYAQIANLVNLIL--QWPEISINEIAENFSKLACNAHTICNSELRPLGTG 58
            S+  +++L++YAQ+A L+  IL  ++ E  + EI ++  +++CN HTIC+ ELRP+G G
Sbjct: 148 FSETGDERLVMYAQMAVLIKTILNARYAE-DVKEITKDICRISCNGHTICDDELRPVGIG 206

Query: 59  LYPVISIINHSCLPNAVLVFEGRLAVVRAVQHVPKGAEVLISYIETAGSTMTRQKALKEQ 118
           L+PV+SIINHSC  N++L+F+G+ AVVRA+  + +G EV +SYIE   ST +R++AL +Q
Sbjct: 207 LFPVVSIINHSCSSNSLLLFDGKHAVVRALGTISRGCEVTVSYIELGASTNSRREALSDQ 266

Query: 119 YLFTCTCPRCIKLGQFDDIQESAILEGYRCKDDGCSGFLLRDSDDKGFTCQQCGLVRSKE 178
           Y F C CPRC    +   + +  +LE   C D  C  F+     +  + C  CG  R   
Sbjct: 267 YYFHCKCPRCTDDSEA-GLYKDDVLEAVACLDPACESFM--RLSNGSWRCMSCGSSREPN 323

Query: 179 EIKKIASEVNILSKKTLALTSCGNHQEVVSTYKMIEKLQKKLYHPFSVNLMQTREKLIKI 238
           E+ K+++E   + +K   L + G+       ++  EKLQ +L++P SV LM+TR+ L+++
Sbjct: 324 EVNKLSTEAEGMIEKANRLQAAGDLHGARMAFQQAEKLQTELWNPRSVKLMRTRDLLLRV 383

Query: 239 LMELEDWKEALAYCQLTIPVYQRVYP-QFHPLLGLQYYTCGKLEWFLGDTENAIKSMTEA 297
            + LEDW  AL  C+LT+P Y+  YP   HPLLGLQYYT GKLE   G    A+++  +A
Sbjct: 384 YLSLEDWDLALHVCRLTLPAYETAYPGSKHPLLGLQYYTLGKLEMHAGSVTEAVRAYAKA 443

Query: 298 VEILRITHGTNSPFMKELILKLEEAQAEAS 327
            +IL +THG+   F+++L  +L++A+  AS
Sbjct: 444 FDILSVTHGSRGEFVRKLRNELDQAKMVAS 473


>gi|108710651|gb|ABF98446.1| MYND finger family protein, expressed [Oryza sativa Japonica Group]
          Length = 352

 Score =  250 bits (639), Expect = 6e-64,   Method: Compositional matrix adjust.
 Identities = 122/202 (60%), Positives = 152/202 (75%), Gaps = 6/202 (2%)

Query: 1   MSDIDEKQLLLYAQIANLVNLILQWPEISINEIAENFSKLACNAHTICNSELRPLGTGLY 60
           +S++D+ QL+LYAQ+ANLV LIL   E+ + EI   FSK ACNAHTIC+ ELRPLGTGLY
Sbjct: 151 ISEVDKNQLVLYAQMANLVQLILPSFELDLKEITHTFSKFACNAHTICDPELRPLGTGLY 210

Query: 61  PVISIINHSCLPNAVLVFEGRLAVVRAVQHVPKGAEVLISYIETAGSTMTRQKALKEQYL 120
           PV+SIINHSC+PNAVL+FEGR A VRA+Q + K  EV ISYIETA +TM RQ  LK  Y 
Sbjct: 211 PVLSIINHSCVPNAVLIFEGRTAYVRALQPISKNEEVSISYIETAATTMKRQDDLK-HYY 269

Query: 121 FTCTCPRCIKLGQFDDIQESAILEGYRCKDDGCSGFLLRDSDDKGFTCQQCGLVRSKEEI 180
           FTCTCPRC+K     D +E A+LEGYRC D  C GFLL ++ +KG+TCQ+C   R  EE+
Sbjct: 270 FTCTCPRCVK-----DSEEDALLEGYRCNDQKCDGFLLPNAGNKGYTCQKCSTSRDGEEL 324

Query: 181 KKIASEVNILSKKTLALTSCGN 202
           +K+AS+V +LS K  +L S G+
Sbjct: 325 QKMASDVLLLSDKVSSLVSSGS 346


>gi|326496881|dbj|BAJ98467.1| predicted protein [Hordeum vulgare subsp. vulgare]
          Length = 153

 Score =  186 bits (473), Expect = 1e-44,   Method: Compositional matrix adjust.
 Identities = 86/140 (61%), Positives = 111/140 (79%)

Query: 196 ALTSCGNHQEVVSTYKMIEKLQKKLYHPFSVNLMQTREKLIKILMELEDWKEALAYCQLT 255
            L S  ++ E  S YK+IE+L++K YH FS+ L+ TRE L+KI MEL+DW+ AL YC+LT
Sbjct: 11  TLISDTDNSEAGSVYKIIEQLEQKHYHSFSITLLHTRETLLKIYMELQDWQTALMYCRLT 70

Query: 256 IPVYQRVYPQFHPLLGLQYYTCGKLEWFLGDTENAIKSMTEAVEILRITHGTNSPFMKEL 315
           IPVY+RVYP FHP++GLQ+YTCGKLEW L  TE+A+K +T A ++LRITHGT S FMKEL
Sbjct: 71  IPVYERVYPPFHPMVGLQFYTCGKLEWLLEYTEDALKLLTRAADVLRITHGTQSQFMKEL 130

Query: 316 ILKLEEAQAEASYKLSSKDE 335
             KLEEA+AE S++LSS +E
Sbjct: 131 FGKLEEARAEVSFRLSSGEE 150


>gi|260799834|ref|XP_002594889.1| hypothetical protein BRAFLDRAFT_124463 [Branchiostoma floridae]
 gi|229280126|gb|EEN50900.1| hypothetical protein BRAFLDRAFT_124463 [Branchiostoma floridae]
          Length = 463

 Score =  165 bits (417), Expect = 3e-38,   Method: Compositional matrix adjust.
 Identities = 100/313 (31%), Positives = 166/313 (53%), Gaps = 17/313 (5%)

Query: 11  LYAQIANLVNLILQWPEIS-INEIAENFSKLACNAHTICNSELRPLGTGLYPVISIINHS 69
           ++AQ+A ++ + +    +    EI E F ++ CN  +IC+ E++ +G G+YP +S+ NHS
Sbjct: 140 MFAQLAVVLRMYVGKDVMDDAREIFELFGRMTCNTFSICDPEMQYIGIGIYPKMSLFNHS 199

Query: 70  CLPNAVLVFEGRLAVVRAVQHVPKGAEVLISYIETAGSTMTRQKALKEQYLFTCTCPRCI 129
           C PN V VF G    VRA+Q++  G E+LISY+E    +  R++ L +QY FTC CPRC 
Sbjct: 200 CEPNCVAVFNGLRMEVRAIQNIQPGEELLISYVEMLAMSSVRKQQLLQQYYFTCKCPRC- 258

Query: 130 KLGQFDDIQESAILEGYRCKDDGCSGFLLRDSDDKGFTCQQCGLVRSKEEIKKIASEVNI 189
                 D  +  ++   +C +  C   +++  D    TC+ C      E+  K   EVN 
Sbjct: 259 -----QDQTKDGMMMAVKCGNINCKKVIIQ-VDGAYETCKVC--CHDNEKDPKFWKEVNK 310

Query: 190 LSKKT----LALTSCGNHQEVVSTYKMIEKL---QKKLYHPFSVNLMQTREKLIKILMEL 242
           +++ +     A+TS    QE     ++ E+L   Q+++ H   + +++  +K +   + L
Sbjct: 311 VTQFSEEMLAAITSADQRQEPKEGLRLAERLLERQQQVLHNNHLFVLKCLDKALDQAVVL 370

Query: 243 EDWKEALAYCQLTIPVYQRVYPQFHPLLGLQYYTCGKLEWFLGDTENAIKSMTEAVEILR 302
             W  AL     TI  Y+  +P +HP LG+QY   GKL  +L     A++++  A  IL 
Sbjct: 371 RKWNRALRCALQTIEPYKVHFPAYHPSLGIQYMRIGKLLLYLEKRLAALEALQMAEHILN 430

Query: 303 ITHGTNSPFMKEL 315
           +THG + P  KEL
Sbjct: 431 VTHGKDHPINKEL 443


>gi|414864678|tpg|DAA43235.1| TPA: hypothetical protein ZEAMMB73_073593 [Zea mays]
          Length = 342

 Score =  162 bits (410), Expect = 2e-37,   Method: Compositional matrix adjust.
 Identities = 85/157 (54%), Positives = 104/157 (66%), Gaps = 6/157 (3%)

Query: 69  SCLPNAVLVFEGRLAVVRAVQHVPKGAEVLISYIETAGSTMTRQKALKEQYLFTCTCPRC 128
           SC+PNAVL+F+GR   VRA+Q + K  EV ISYIETA  T  R   LK QY FT TCPRC
Sbjct: 182 SCVPNAVLIFDGRTTYVRALQPIDKDEEVSISYIETAAVTKKRNNYLK-QYFFTYTCPRC 240

Query: 129 IKLGQFDDIQESAILEGYRCKDDGCSGFLLRDSDDKGFTCQQCGLVRSKEEIKKIASEVN 188
           +K   FDD    A+LEG+RCK+  C GFLL +S  K +TCQ+CG  R  EEI K+ SE+ 
Sbjct: 241 VK--GFDD---DALLEGFRCKNQTCDGFLLPNSGKKAYTCQKCGASRDVEEINKMRSEIL 295

Query: 189 ILSKKTLALTSCGNHQEVVSTYKMIEKLQKKLYHPFS 225
            LS K  +  S GN  E  S YK+IE+L++ LYH FS
Sbjct: 296 QLSDKASSFLSSGNKAEAGSIYKIIEQLERNLYHAFS 332


>gi|222625640|gb|EEE59772.1| hypothetical protein OsJ_12271 [Oryza sativa Japonica Group]
          Length = 384

 Score =  156 bits (394), Expect = 1e-35,   Method: Compositional matrix adjust.
 Identities = 82/167 (49%), Positives = 100/167 (59%), Gaps = 39/167 (23%)

Query: 1   MSDIDEKQLLLYAQIANLVNLILQWPEISINEIAENFSKLACNAHTICNSELRPLGTGLY 60
           +S++D+ QL+LYAQ+ANLV LIL   E+ + EI   FSK ACNAHTIC+ ELRPLGTGLY
Sbjct: 163 ISEVDKNQLVLYAQMANLVQLILPSFELDLKEITHTFSKFACNAHTICDPELRPLGTGLY 222

Query: 61  PVISIINHSCLPNAVLVFEGRLAVVRAVQHVPKGAEVLISYIETAGSTMTRQKALKEQYL 120
           PV+SIINHSC+PNAVL+FEGR A VRA+Q + K  E                        
Sbjct: 223 PVLSIINHSCVPNAVLIFEGRTAYVRALQPISKNEE------------------------ 258

Query: 121 FTCTCPRCIKLGQFDDIQESAILEGYRCKDDGCSGFLLRDSDDKGFT 167
                          D +E A+LEGYRC D  C GFLL ++D  G T
Sbjct: 259 ---------------DSEEDALLEGYRCNDQKCDGFLLPNADIYGVT 290



 Score =  121 bits (304), Expect = 4e-25,   Method: Compositional matrix adjust.
 Identities = 53/80 (66%), Positives = 67/80 (83%)

Query: 256 IPVYQRVYPQFHPLLGLQYYTCGKLEWFLGDTENAIKSMTEAVEILRITHGTNSPFMKEL 315
            PVY+R+YP FHP++GLQ+YTCGKLEW L  TE+A+ S+T A +ILRITHGT S FMKEL
Sbjct: 304 FPVYERIYPPFHPMIGLQFYTCGKLEWLLEYTEDALMSLTRAADILRITHGTKSEFMKEL 363

Query: 316 ILKLEEAQAEASYKLSSKDE 335
           + KLEE +AEAS++LS+ DE
Sbjct: 364 LGKLEEVRAEASFRLSAGDE 383


>gi|348534108|ref|XP_003454545.1| PREDICTED: SET and MYND domain-containing protein 3-like
           [Oreochromis niloticus]
          Length = 439

 Score =  154 bits (389), Expect = 5e-35,   Method: Compositional matrix adjust.
 Identities = 105/345 (30%), Positives = 172/345 (49%), Gaps = 51/345 (14%)

Query: 1   MSDIDEKQLLLYAQIANLVNLILQW--PEI----------SINEIAENFSKLACNAHTIC 48
           ++ + E++    +Q+A+++ L LQ   P++          S  E     +K+ CN  TI 
Sbjct: 133 LTSMPEQKKQGLSQLASMLELYLQQEAPDLLQEMGSALPPSCREPLSLIAKVTCNCFTIS 192

Query: 49  NSELRPLGTGLYPVISIINHSCLPNAVLVFEGRLAVVRAVQHVPKGAEVLISYIETAGST 108
           + EL+ +G GLYP +S++NH C PN V+VFEG    +RAV+ +    E+ ISYIET   T
Sbjct: 193 DGELQEIGVGLYPSLSLLNHDCRPNCVMVFEGTKLELRAVRDIDPEDELTISYIETLSLT 252

Query: 109 MTRQKALKEQYLFTCTCPRCIKLGQFDDIQESAILEGYRCKDDGCSGFLLRDSDDKGFTC 168
             RQ+ L+EQY FTC C RC                  R KD    G +L   + K    
Sbjct: 253 EDRQRQLEEQYHFTCHCQRC----------------DSRDKD----GLMLSGEEGKWRLL 292

Query: 169 QQCGLVRSKEEIKKIASEVNILSKKTLALTSCGNHQEVVSTYKMIEKLQKKLYHPFSVNL 228
           ++  L R  E +K  ++   +L   +  L+S G      + YK+                
Sbjct: 293 KE-ALPR-LEGLKAESNWETLLESCSHLLSSVGGEVPEENLYKL---------------- 334

Query: 229 MQTREKLIKILMELEDWKEALAYCQLTIPVYQRVYPQFHPLLGLQYYTCGKLEWFLGDTE 288
            +  +  +   + L  W+EA++Y   T+P Y++ YP  HP+ G+Q    GKL+ +L + E
Sbjct: 335 -RITDMALDASVHLGRWEEAMSYGVTTLPAYRQYYPDPHPVHGIQLMRVGKLQHYLENIE 393

Query: 289 NAIKSMTEAVEILRITHGTNSPFMKELILKLEEAQAEASYKLSSK 333
           +A+ +  +A EI++ THG + P   EL +K++E  +E  +  SS+
Sbjct: 394 DALDTFRQAYEIVKFTHGEDHPLTTELTMKMKECHSEMDHHSSSR 438


>gi|432852874|ref|XP_004067428.1| PREDICTED: SET and MYND domain-containing protein 3-like [Oryzias
           latipes]
          Length = 495

 Score =  152 bits (385), Expect = 2e-34,   Method: Compositional matrix adjust.
 Identities = 101/318 (31%), Positives = 159/318 (50%), Gaps = 39/318 (12%)

Query: 8   QLLLYAQIANLVNLILQWPEISINEIAENFSKLACNAHTICNSELRPLGTGLYPVISIIN 67
           +L +  +I +L   +     +S  E     +K+ CN  TI + EL+ +G GLYP +S++N
Sbjct: 209 ELYMKTEIPDLKQRMTSELPLSCREALSIIAKVTCNCFTISDGELQEIGVGLYPSLSLLN 268

Query: 68  HSCLPNAVLVFEGRLAVVRAVQHVPKGAEVLISYIETAGSTMTRQKALKEQYLFTCTCPR 127
           H C PN V+VF G    +RAVQ +  G E+ ISYIET   T  RQK L+EQY FTC C R
Sbjct: 269 HDCRPNCVMVFNGTELQLRAVQDINPGEELRISYIETLSLTEDRQKQLEEQYHFTCHCQR 328

Query: 128 CIKLGQFDDIQESAILEGYRCKDDGCSGFLLRDSDDKGFTCQQCGLVRSKEEIKKIASEV 187
           C         Q+  +L G                       ++    + KE + ++   +
Sbjct: 329 C-----SSKEQDGVMLSG-----------------------REAAWRQLKEALPRLEG-L 359

Query: 188 NILSKKTLALTSCGNHQEVVSTYKMIEKLQKKLYHPFSVNLMQTREKLIKILMELEDWKE 247
            + S     L SC N    + T    E   + LY       ++  +  +   + L  W+E
Sbjct: 360 QLESNWLELLESCSN----LLTSAGEEVPDENLYK------LKITDMALDASVHLGRWEE 409

Query: 248 ALAYCQLTIPVYQRVYPQFHPLLGLQYYTCGKLEWFLGDTENAIKSMTEAVEILRITHGT 307
           AL++   T+P Y+  Y   HP+ G+Q    GKL+  LG  E+A++S  EA +IL++THG 
Sbjct: 410 ALSFGLKTLPSYRLYYADPHPVHGVQLMRVGKLQHHLGCIEDALESFKEAFKILKLTHGE 469

Query: 308 NSPFMKELILKLEEAQAE 325
           + P + +L++K+EE ++E
Sbjct: 470 DHPLIVDLMMKMEECRSE 487


>gi|443720661|gb|ELU10312.1| hypothetical protein CAPTEDRAFT_158133 [Capitella teleta]
          Length = 441

 Score =  149 bits (376), Expect = 2e-33,   Method: Compositional matrix adjust.
 Identities = 89/318 (27%), Positives = 160/318 (50%), Gaps = 27/318 (8%)

Query: 9   LLLYAQIANLVNLILQWPEISINEIAENFSKLACNAHTICNSELRPLGTGLY----PVIS 64
           ++L   + +    ++  P  S+ E+   F ++  N+ +IC+ E+ P+G G+Y       S
Sbjct: 131 MVLMTTLKSFSENVMSMP--SVEELFVIFGRVCVNSFSICDPEMNPIGVGVYIRQIKPCS 188

Query: 65  IINHSCLPNAVLVFEGRLAVVRAVQHVPKGAEVLISYIETAGSTMTRQKALKEQYLFTCT 124
           +++HSC PNAV VFEG    +R V+ V    ++ ISYI+T   T TR++ L++QY F C 
Sbjct: 189 VLDHSCRPNAVAVFEGTTLRIRCVEPVDSEQDLRISYIDTLDDTTTRRRNLQQQYYFNCL 248

Query: 125 CPRCIKLGQFDDIQESAILEGYRCKDDGCSGFLLRDSDDKGFTCQQCG-------LVRSK 177
           C      G+  D ++  I   + C    C G + +   D  F+C  CG       L ++ 
Sbjct: 249 C------GECKDSEKDLIKFSFNCTSVECKGHVTQVPADDRFSCDICGTTVDDVNLNQAA 302

Query: 178 EEIKKIASEVNILSKKTLALTSCGNHQEVVSTYKMIEKLQKKLYHPFSVNLMQTREKLIK 237
           E+ +KI   +N L K+         H+ +           + + HP++V+ ++  +  + 
Sbjct: 303 EKAQKIIKHLNELKKQR-------EHESIRMKGLSCVSEMRAILHPYNVHFIKLCDLSLD 355

Query: 238 ILMELEDWKEALAYCQLTIPVYQRVYPQFHPLLGLQYYTCGKLEWFLGDTENAIKSMTEA 297
             +E   W +AL   +LT+  Y+  Y + HP +G+  +  GK+  +L D E A++ + +A
Sbjct: 356 ACIETSQWDDALELGRLTVVAYRFYYGELHPSVGILLFKIGKILSYLADRE-ALQFLKQA 414

Query: 298 VEILRITHGTNSPFMKEL 315
             +LR+THG +    K+L
Sbjct: 415 ATVLRVTHGDDHSLYKDL 432


>gi|395531511|ref|XP_003767821.1| PREDICTED: SET and MYND domain-containing protein 3 [Sarcophilus
           harrisii]
          Length = 428

 Score =  149 bits (376), Expect = 2e-33,   Method: Compositional matrix adjust.
 Identities = 90/293 (30%), Positives = 152/293 (51%), Gaps = 39/293 (13%)

Query: 32  EIAENFSKLACNAHTICNSELRPLGTGLYPVISIINHSCLPNAVLVFEGRLAVVRAVQHV 91
           +I E+F+K+ CN  TI N E++ +G GLYP +S++NHSC PN V+VFEG    +RA++++
Sbjct: 170 DIFESFAKVICNGFTISNGEMQEVGVGLYPSMSLLNHSCDPNCVIVFEGPSLFLRAIRNI 229

Query: 92  PKGAEVLISYIETAGSTMTRQKALKEQYLFTCTCPRCIKLGQFDDIQESAILEGYRCKDD 151
           P G E+ I Y++    T  RQK LKEQY F C CP C    +  D     +L G      
Sbjct: 230 PLGEELTICYLDVLMPTAERQKQLKEQYCFDCDCPLCKTQSKDAD-----MLAGEEQAWK 284

Query: 152 GCSGFLLRDSDDKGFTCQQCGLVRSKEEIKKIASEVNILSKKTLALTSCGNHQEVVSTYK 211
              G L++  D           ++S+E+ +++ +    L      + +CGN     + Y+
Sbjct: 285 EIQGSLIKIED-----------LQSQEKWEQVLAMCQTL------INNCGNRLPDRNIYQ 327

Query: 212 MIEKLQKKLYHPFSVNLMQTREKLIKILMELEDWKEALAYCQLTIPVYQRVYPQFHPLLG 271
                            ++  E  +   + L  W++AL Y   T+  Y+  YP FHP+ G
Sbjct: 328 -----------------LKMLECAMDACINLSLWEDALLYGSRTLEPYRLYYPGFHPVRG 370

Query: 272 LQYYTCGKLEWFLGDTENAIKSMTEAVEILRITHGTNSPFMKELILKLEEAQA 324
           +Q    GKL+   G    A++++ +A E++++THG +   +++L+L L + +A
Sbjct: 371 VQVMKVGKLQQHQGLYPQALETLKQAFELIKVTHGRDHSLIEDLMLLLGDCEA 423


>gi|413955565|gb|AFW88214.1| hypothetical protein ZEAMMB73_999877 [Zea mays]
          Length = 529

 Score =  148 bits (373), Expect = 4e-33,   Method: Compositional matrix adjust.
 Identities = 83/189 (43%), Positives = 105/189 (55%), Gaps = 46/189 (24%)

Query: 75  VLVFEGRLAVVRAVQHVPKGAEVLISYIETAGSTMTRQKALKEQYLFTCTCPRCIKLGQF 134
           VL+F+GR A VRA+Q + K  EV ISYIETA  T  R   LK QY FTCTCPRC+K   F
Sbjct: 211 VLIFDGRTAYVRALQPINKDEEVSISYIETAAVTKKRNNDLK-QYFFTCTCPRCVK--GF 267

Query: 135 DDIQESAILEGYRCKDDGCSGFLLRDSDDKGFTCQQCGLVRSKEEIKKIASEVNILSKKT 194
           D   E A+LEG+RCK   C GFLL +SD                                
Sbjct: 268 D---EDALLEGFRCKSQACDGFLLPNSD-------------------------------- 292

Query: 195 LALTSCGNHQEVVSTYKMIEKLQKKLYHPFSVNLMQTREKLIKILMELEDWKEALAYCQL 254
                     E  S YK+I++L++ LYH FS  L+ T E L+KI +EL+DW  AL YC+L
Sbjct: 293 --------KAEAGSIYKIIKQLERNLYHAFSTTLLHTCETLLKIYLELQDWWTALTYCRL 344

Query: 255 TIPVYQRVY 263
           TIPVY++++
Sbjct: 345 TIPVYEKIF 353


>gi|413955566|gb|AFW88215.1| hypothetical protein ZEAMMB73_999877 [Zea mays]
          Length = 484

 Score =  147 bits (372), Expect = 5e-33,   Method: Compositional matrix adjust.
 Identities = 83/189 (43%), Positives = 105/189 (55%), Gaps = 46/189 (24%)

Query: 75  VLVFEGRLAVVRAVQHVPKGAEVLISYIETAGSTMTRQKALKEQYLFTCTCPRCIKLGQF 134
           VL+F+GR A VRA+Q + K  EV ISYIETA  T  R   LK QY FTCTCPRC+K   F
Sbjct: 211 VLIFDGRTAYVRALQPINKDEEVSISYIETAAVTKKRNNDLK-QYFFTCTCPRCVK--GF 267

Query: 135 DDIQESAILEGYRCKDDGCSGFLLRDSDDKGFTCQQCGLVRSKEEIKKIASEVNILSKKT 194
           D   E A+LEG+RCK   C GFLL +SD                                
Sbjct: 268 D---EDALLEGFRCKSQACDGFLLPNSD-------------------------------- 292

Query: 195 LALTSCGNHQEVVSTYKMIEKLQKKLYHPFSVNLMQTREKLIKILMELEDWKEALAYCQL 254
                     E  S YK+I++L++ LYH FS  L+ T E L+KI +EL+DW  AL YC+L
Sbjct: 293 --------KAEAGSIYKIIKQLERNLYHAFSTTLLHTCETLLKIYLELQDWWTALTYCRL 344

Query: 255 TIPVYQRVY 263
           TIPVY++++
Sbjct: 345 TIPVYEKIF 353


>gi|196000040|ref|XP_002109888.1| hypothetical protein TRIADDRAFT_53251 [Trichoplax adhaerens]
 gi|190588012|gb|EDV28054.1| hypothetical protein TRIADDRAFT_53251 [Trichoplax adhaerens]
          Length = 448

 Score =  146 bits (369), Expect = 1e-32,   Method: Compositional matrix adjust.
 Identities = 90/283 (31%), Positives = 149/283 (52%), Gaps = 17/283 (6%)

Query: 38  SKLACNAHTICNSELRPLGTGLYPVISIINHSCLPNAVLVFEGRLAVVRAVQHVPKGAEV 97
           S++ CN  TICNSE++ +G G+YP +S++NHSC PN    F G+   +R +++   G E+
Sbjct: 172 SRVTCNTFTICNSEMQTVGIGVYPGLSLVNHSCSPNCSATFRGKQMQLRIIENTKIGDEL 231

Query: 98  LISYIETAGSTMTRQKALKEQYLFTCTCPRCIKLGQFDDIQESA--ILEGYRCKDDGCSG 155
           LISYI+      +RQ  L+ QY F C C RCI     D  ++S   +++  RC    C  
Sbjct: 232 LISYIDPMQVLSSRQNQLQSQYCFKCICERCI-----DTTKDSCNNLMDSVRCPKKICKA 286

Query: 156 FLLRDSDDKGFTCQQCGLVRSKE---EIKKIASEVNILSKKTLALTSCGNHQEVVSTYKM 212
               DS      C +CG +  +    EI+   +++N    KT++L   G  Q  +   K 
Sbjct: 287 ASSLDSLLANKLCPECGSIVDQSFFAEIENFQAQIN----KTISL---GYQQAKLEDLKK 339

Query: 213 IEKLQKKLYHPFSVNLMQTREKLIKILMELEDWKEALAYCQLTIPVYQRVYPQFHPLLGL 272
           +    K+     ++  ++  E L+   +E + ++EAL Y +     Y R+YP+++P+ G+
Sbjct: 340 LFIEGKQRLGECNMLYIRIIENLMDAYIESQRYEEALEYARRLEEPYCRLYPRYYPVTGV 399

Query: 273 QYYTCGKLEWFLGDTENAIKSMTEAVEILRITHGTNSPFMKEL 315
                GKLE +LG    A+KS+ +A EIL ++HG     + E+
Sbjct: 400 HLMKQGKLECYLGKFGEAVKSLGKAKEILLVSHGKECGLIHEM 442


>gi|410912232|ref|XP_003969594.1| PREDICTED: SET and MYND domain-containing protein 3-like [Takifugu
           rubripes]
          Length = 434

 Score =  144 bits (363), Expect = 6e-32,   Method: Compositional matrix adjust.
 Identities = 89/289 (30%), Positives = 147/289 (50%), Gaps = 39/289 (13%)

Query: 37  FSKLACNAHTICNSELRPLGTGLYPVISIINHSCLPNAVLVFEGRLAVVRAVQHVPKGAE 96
            +K+ CN  TI + EL+ +G GLYP +S++NH C PN V+VFEG   ++RAV+ +  G E
Sbjct: 178 IAKVTCNCFTISDGELQEIGVGLYPSLSLLNHDCRPNCVMVFEGTKLLLRAVRGLSPGEE 237

Query: 97  VLISYIETAGSTMTRQKALKEQYLFTCTCPRCIKLGQFDDIQESAILEGYRCKDDGCSGF 156
           + ISYIET      RQ+ L++QY FTC C +C      D +  S     +R         
Sbjct: 238 LTISYIETLSLNEDRQQRLEDQYCFTCHC-QCCNSPDNDKLMLSGEESSWR--------- 287

Query: 157 LLRDSDDKGFTCQQCGLVRSKEEIKKIASEVNILSKKTLALTSCGNHQEVVSTYKMIEKL 216
           +L+++            + + EE+K  ++   ++      L++ G      + YK     
Sbjct: 288 VLKEA------------LPTLEELKAKSNWQTLVESCHRLLSTEGKRVPEENLYK----- 330

Query: 217 QKKLYHPFSVNLMQTREKLIKILMELEDWKEALAYCQLTIPVYQRVYPQFHPLLGLQYYT 276
                       ++  +  +   ++L  W EA+ Y Q T+PVY   YP  HP+ G+Q   
Sbjct: 331 ------------LRVTDMALDASIQLGLWTEAVEYGQKTLPVYHLYYPDPHPVHGVQLVR 378

Query: 277 CGKLEWFLGDTENAIKSMTEAVEILRITHGTNSPFMKELILKLEEAQAE 325
            GKL+ +L   E A+ +  +A  IL++THG ++    +L++K+EE +AE
Sbjct: 379 VGKLQHYLAHIEEALDTFKQAYRILKVTHGNDNSITTDLLMKMEECRAE 427


>gi|449496642|ref|XP_002188283.2| PREDICTED: SET and MYND domain-containing protein 3 [Taeniopygia
           guttata]
          Length = 289

 Score =  144 bits (363), Expect = 6e-32,   Method: Compositional matrix adjust.
 Identities = 99/303 (32%), Positives = 153/303 (50%), Gaps = 50/303 (16%)

Query: 26  PEISINEIAENFSKLACNAHTICNSELRPLGTGLYPVISIINHSCLPNAVLVFEGRLAVV 85
           P I   +I   F+K+ CN  TI N E++ +G GLYP +S++NHSC PN V+VFEG   ++
Sbjct: 28  PAIDFFQI---FTKVTCNCFTISNGEMQDVGVGLYPSMSLLNHSCDPNCVIVFEGYQLLL 84

Query: 86  RAVQHVPKGAEVLISYIETAGSTMTRQKALKEQYLFTCTCPRCIKLGQFDDIQESAILEG 145
            +V+ +  G E+ ISY+E+   T  RQK L  QY F C CP C    Q  D ++ A  E 
Sbjct: 85  HSVRDIQIGEELTISYVESLMPTRERQKQLMRQYCFECDCPLC--QNQEKDAEKLAG-EV 141

Query: 146 YRCKDDGCSGFLLRDSDDKGFTCQQCGLVRSKEEIKKIASEVNILSKKTLALTSCGNHQE 205
           +  K+       ++D+        + G  +SKEE K      ++L++    L S   H  
Sbjct: 142 HAWKE-------VKDA------VNEVGYPKSKEEWK------HVLARCQNLLRSNKGHLP 182

Query: 206 VVSTYKMIEKLQKKLYHPFSVNLMQTREKLIKILME----LEDWKEALAYCQLTIPVYQR 261
             + Y++                     K++   M+    LE W+EAL Y   T+  Y+ 
Sbjct: 183 DTNIYQL---------------------KMLDCAMDACINLESWEEALYYGSRTLEPYRL 221

Query: 262 VYPQFHPLLGLQYYTCGKLEWFLGDTENAIKSMTEAVEILRITHGTNSPFMKELILKLEE 321
            YP FHPL  +Q    GKL++  G    A++++ +A  I+++THGT+   MK L+   EE
Sbjct: 222 YYPGFHPLRAVQLMRVGKLQYSQGMVPQALETLKQAYNIMKVTHGTDHSLMKVLMEMKEE 281

Query: 322 AQA 324
            +A
Sbjct: 282 CEA 284


>gi|50740707|ref|XP_419536.1| PREDICTED: SET and MYND domain-containing protein 3 [Gallus gallus]
          Length = 428

 Score =  143 bits (361), Expect = 9e-32,   Method: Compositional matrix adjust.
 Identities = 107/327 (32%), Positives = 165/327 (50%), Gaps = 55/327 (16%)

Query: 8   QLLLYAQIANLVNLILQWPEISINEIAENFSKLACNAHTICNSELRPLGTGLYPVISIIN 67
           QL L A+I +  +L    P I   +I   F+K+ CN  TI N E++ +G GLYP +S++N
Sbjct: 152 QLYLRAEIQDASHLP---PAIDFFQI---FTKVTCNCFTISNGEMQDVGVGLYPSMSLLN 205

Query: 68  HSCLPNAVLVFEGRLAVVRAVQHVPKGAEVLISYIETAGSTMTRQKALKEQYLFTCTCPR 127
           HSC PN V++FEG   ++R+++ +  G E+ ISYIE+   T  RQK LK QY F C C  
Sbjct: 206 HSCDPNCVIIFEGYQLLLRSIREIQIGEELTISYIESLMPTSERQKQLKRQYCFECDC-- 263

Query: 128 CIKLGQFDDIQESAILEGYRCKDDGCSGFLLRDSDDKGFTCQQCGLVRSKEEIKKIASEV 187
           C+   Q  D ++ A  E              ++  D      +    +SKEE +K     
Sbjct: 264 CLCQDQEKDAKKLAGEEP-----------AWKEVKD---AVNEVKYPKSKEEWEK----- 304

Query: 188 NILSKKTLALTSCGNHQEVVSTYKMIEKLQKKLYHPFSVNLMQTREKLIKILME----LE 243
            +L+K    L+S  +H    + Y++                     KL+   M+    LE
Sbjct: 305 -VLAKCQHLLSSHTSHLPDTNIYQL---------------------KLLDCAMDACINLE 342

Query: 244 DWKEALAYCQLTIPVYQRVYPQFHPLLGLQYYTCGKLEWFLGDTENAIKSMTEAVEILRI 303
            W++AL Y   T+  Y+  YP FHPL  +Q    GKL++  G    A+ ++ +A +I+++
Sbjct: 343 AWEQALCYGSRTLGPYRLYYPDFHPLRAVQLMRVGKLQYSQGMLPQALGTLKQAYDIMKV 402

Query: 304 THGTNSPFMKELILKLEEAQAEASYKL 330
           THGT+   M+ L + L+E Q EA  KL
Sbjct: 403 THGTDHSLMQAL-MDLKE-QCEAMMKL 427


>gi|326915443|ref|XP_003204027.1| PREDICTED: SET and MYND domain-containing protein 3-like [Meleagris
           gallopavo]
          Length = 427

 Score =  140 bits (352), Expect = 1e-30,   Method: Compositional matrix adjust.
 Identities = 101/322 (31%), Positives = 161/322 (50%), Gaps = 55/322 (17%)

Query: 8   QLLLYAQIANLVNLILQWPEISINEIAENFSKLACNAHTICNSELRPLGTGLYPVISIIN 67
           QL L A+I +  +L    P I   +I   F+K+ CN  TI N E++ +G GLYP +S++N
Sbjct: 151 QLYLRAEIQDASHLS---PAIDFFQI---FTKVTCNCFTISNGEMQDVGVGLYPSMSLLN 204

Query: 68  HSCLPNAVLVFEGRLAVVRAVQHVPKGAEVLISYIETAGSTMTRQKALKEQYLFTCTCPR 127
           HSC PN V++FEG   ++R+V+ +  G E+ +SYIE+   T  RQ+ LK QY F C C  
Sbjct: 205 HSCDPNCVIIFEGYQLLLRSVREIQIGEELTVSYIESLMPTSERQEQLKRQYCFECDC-- 262

Query: 128 CIKLGQFDDIQESAILEGYRCKDDGCSGFLLRDSDDKGFTCQQCGLVRSKEEIKKIASEV 187
           C+   Q  D ++ A  E              ++  D      +    +SKEE +K+ ++ 
Sbjct: 263 CLCQDQEKDAKKLAGEEP-----------AWKEVKD---AVNEVKYPKSKEEWEKVLAKC 308

Query: 188 -NILSKKTLALTSCGNHQEVVSTYKMIEKLQKKLYHPFSVNLMQTREKLIKILME----L 242
            ++LS  T  L                             N+ Q   KL+   M+    L
Sbjct: 309 QHLLSSHTSRLP--------------------------DTNIYQL--KLLDCAMDACINL 340

Query: 243 EDWKEALAYCQLTIPVYQRVYPQFHPLLGLQYYTCGKLEWFLGDTENAIKSMTEAVEILR 302
           E W++AL Y   T+  Y+  YP FHPL  +Q    GKL++  G    A++++ +A +I++
Sbjct: 341 EAWEQALCYGSRTLGPYRLYYPNFHPLRAVQLMRVGKLQYSQGMLPQALETLKQAYDIMK 400

Query: 303 ITHGTNSPFMKELILKLEEAQA 324
           +THGT+   M+ L+   E+ +A
Sbjct: 401 VTHGTDHSLMQALMDLKEQCEA 422


>gi|108710652|gb|ABF98447.1| MYND finger family protein, expressed [Oryza sativa Japonica Group]
          Length = 282

 Score =  139 bits (350), Expect = 2e-30,   Method: Compositional matrix adjust.
 Identities = 65/98 (66%), Positives = 80/98 (81%)

Query: 1   MSDIDEKQLLLYAQIANLVNLILQWPEISINEIAENFSKLACNAHTICNSELRPLGTGLY 60
           +S++D+ QL+LYAQ+ANLV LIL   E+ + EI   FSK ACNAHTIC+ ELRPLGTGLY
Sbjct: 151 ISEVDKNQLVLYAQMANLVQLILPSFELDLKEITHTFSKFACNAHTICDPELRPLGTGLY 210

Query: 61  PVISIINHSCLPNAVLVFEGRLAVVRAVQHVPKGAEVL 98
           PV+SIINHSC+PNAVL+FEGR A VRA+Q + K  EV+
Sbjct: 211 PVLSIINHSCVPNAVLIFEGRTAYVRALQPISKNEEVM 248


>gi|449665374|ref|XP_002163555.2| PREDICTED: uncharacterized protein LOC100200645 [Hydra
           magnipapillata]
          Length = 1037

 Score =  137 bits (344), Expect = 8e-30,   Method: Compositional matrix adjust.
 Identities = 93/302 (30%), Positives = 153/302 (50%), Gaps = 7/302 (2%)

Query: 23  LQWPEISINEIAEN--FSKLACNAHTICNSELRPLGTGLYPVISIINHSCLPNAVLVFEG 80
           LQ   I+IN+I       K +CN+  I N+EL  LGTG++   S+ NHSC PN V  F G
Sbjct: 152 LQDVNININDIDIYGLMCKASCNSFAITNAELNSLGTGIFSSASLFNHSCDPNCVATFNG 211

Query: 81  RLAVVRAVQHVPKGAEVLISYIETAGSTMTRQKALKEQYLFTCTCPRCIKLGQFDDIQES 140
           R   +RA++ + +G E+++SYI    ++  RQ  L+E Y+FTC C  C +    D + +S
Sbjct: 212 RDISIRAIKPIAEGEELMLSYISILATSDVRQLELRESYMFTCKCTVCSRKEVNDSLMKS 271

Query: 141 AILEGYRCKDDGCSGFLLRDSDDKGFTCQQCGLVRSKEEIKKIASEVNILSKKTLALTSC 200
                 +C    C  FL+   +    +CQ+     S + I K    ++ L     +++  
Sbjct: 272 VKCSQPQC---LCMKFLITAPESNKCSCQK-NCEASNDYILKANDCMDKLQALYNSISVV 327

Query: 201 GNHQEVVSTYKMIEKLQKKLYHPFSVNLMQTREKLIKILMELEDWKEALAYCQLTIPVYQ 260
              ++  S  ++I   ++ L  P ++ L+Q  E  +   +E  DWK A  Y       Y+
Sbjct: 328 PTIEQQKSLTQLIRCGEEILCSP-NIALLQCYEVAMDGCIESGDWKGAFQYGIKLECSYK 386

Query: 261 RVYPQFHPLLGLQYYTCGKLEWFLGDTENAIKSMTEAVEILRITHGTNSPFMKELILKLE 320
               ++HP LGL Y+  GKL   + D  N I  + +  ++L ITHG++  F+++L   L+
Sbjct: 387 NYLSEYHPTLGLHYFKLGKLALQIEDLRNGITYLEKGYKVLSITHGSSCHFVQKLKSYLD 446

Query: 321 EA 322
           EA
Sbjct: 447 EA 448


>gi|158299622|ref|XP_319707.4| AGAP008954-PA [Anopheles gambiae str. PEST]
 gi|157013605|gb|EAA14812.4| AGAP008954-PA [Anopheles gambiae str. PEST]
          Length = 453

 Score =  134 bits (338), Expect = 5e-29,   Method: Compositional matrix adjust.
 Identities = 87/310 (28%), Positives = 153/310 (49%), Gaps = 17/310 (5%)

Query: 32  EIAENFSKLACNAHTICNSELRPLGTGLYPVISIINHSCLPNAVLVFEGRLAVVRAVQHV 91
           E+   + K+  N   I ++E+  +GTG+Y   SII+HSC PN V+ F+G    +R ++  
Sbjct: 154 ELLRIYGKMCINTFNILDAEMSTIGTGMYIGASIIDHSCRPNVVVSFDGETLRMRLLEDY 213

Query: 92  PKG----AEVLISYIETAGSTMTRQKALKEQYLFTCTCPRCIKLGQFDDIQESAILEGYR 147
           P+      ++ ISYI+   +   RQ+ L E+Y F C C RC       D QE   +    
Sbjct: 214 PEQELDFGKLFISYIDLIDTAEVRQEQLAERYYFHCACERC------RDEQEQKRMNAAA 267

Query: 148 CKDDGCSGFLLRDSDDKGFTCQQCGLVRSKEEIKKIASEVNILSKKTLALTSCGNHQEVV 207
           C +  C   L     ++   C  CG   +  + +  A E++  ++  LA      + +V 
Sbjct: 268 CPNTTCHEPLDFSDSEQLNQCPACGTAVTHSDREAFA-EISSFTRDHLAQMKSVAYLDVS 326

Query: 208 STYKMIEKLQKKLYHPFSVNLMQTREKLIKILMELEDWKEALAYCQLTIPVYQRVYPQFH 267
              ++  + Q  + H ++V+ ++T +  ++  + LE W EA  Y    +  +++ Y  +H
Sbjct: 327 ---RLCLEKQANVLHRYNVHHIKTLDNAMESALNLEKWTEATGYGLRLLDGFRQYYSTYH 383

Query: 268 PLLGLQYYTCGKLEWFLGDTENAIKSMTEAVEILRITHGTNSPFMKELILKL--EEAQAE 325
           PLLGL Y   GKL+ +      A+K + +A +ILR+THG      K +++ L  + AQ +
Sbjct: 384 PLLGLTYLKVGKLQLYQCQFAEALKQLQQAAKILRVTHGEQDDLYKRVLVPLLCDAAQGD 443

Query: 326 ASYKLSSKDE 335
             + L+  DE
Sbjct: 444 LGH-LAIADE 452


>gi|345329417|ref|XP_001513395.2| PREDICTED: SET and MYND domain-containing protein 3-like
           [Ornithorhynchus anatinus]
          Length = 415

 Score =  133 bits (335), Expect = 1e-28,   Method: Compositional matrix adjust.
 Identities = 83/274 (30%), Positives = 136/274 (49%), Gaps = 39/274 (14%)

Query: 33  IAENFSKLACNAHTICNSELRPLGTGLYPVISIINHSCLPNAVLVFEGRLAVVRAVQHVP 92
           I E+F+K+ CN  TI N E++ +G GLYP +S++NHSC PN V+VFEG    +RAV+ + 
Sbjct: 145 IFESFAKVICNGFTISNGEMQEVGVGLYPSMSLLNHSCDPNCVIVFEGTSLFLRAVREIQ 204

Query: 93  KGAEVLISYIETAGSTMTRQKALKEQYLFTCTCPRCIKLGQFDDIQESAILEGYRCKDDG 152
           KG E+ I Y++    +  RQK LKEQY F C C RC    Q  D+   A  E +      
Sbjct: 205 KGEELTICYLDVLLPSQERQKQLKEQYCFACDCIRCKT--QDKDVDMLAGEEPF------ 256

Query: 153 CSGFLLRDSDDKGFTCQQCGLVRSKEEIKKIASEVNILSKKTLALTSCGNHQEVVSTYKM 212
                                       K++   V+ + +    L S    ++V++T + 
Sbjct: 257 ---------------------------WKEVKDAVDTVEE----LQSQKKWEQVLATCQA 285

Query: 213 IEKLQKKLYHPFSVNLMQTREKLIKILMELEDWKEALAYCQLTIPVYQRVYPQFHPLLGL 272
           +    K      ++  ++  +  +   + L  W++AL Y + T+  Y+  YP FHP+  +
Sbjct: 286 LINNHKDRIPDRNIYQLKMLDCAMDACINLSLWEDALLYGRRTLDPYRLYYPGFHPIKAI 345

Query: 273 QYYTCGKLEWFLGDTENAIKSMTEAVEILRITHG 306
           Q    GKL+   G    A++++ EA ++L++THG
Sbjct: 346 QIMKIGKLQQHQGMFREALETLREAFDLLKVTHG 379


>gi|213512076|ref|NP_001133866.1| SET and MYND domain-containing protein 3 [Salmo salar]
 gi|209155626|gb|ACI34045.1| SET and MYND domain-containing protein 3 [Salmo salar]
          Length = 429

 Score =  132 bits (332), Expect = 2e-28,   Method: Compositional matrix adjust.
 Identities = 93/331 (28%), Positives = 154/331 (46%), Gaps = 42/331 (12%)

Query: 1   MSDIDEKQLLLYAQIANLVNLILQW--PEISINEIAEN----FSKLACNAHTICNSELRP 54
           + D+ E++    +Q+++++ L +Q   P I+    A +     +KL CN  TI + ELR 
Sbjct: 131 LCDMGEERREGLSQLSSMLQLYIQQEVPNITQELPAVDPLSLLAKLTCNCFTISDGELRE 190

Query: 55  LGTGLYPVISIINHSCLPNAVLVFEGRLAVVRAVQHVPKGAEVLISYIETAGSTMTRQKA 114
           +G GLYP +S++NH C P+ V++FEG    +RAV+ +    E+ ISYI T   T  R+  
Sbjct: 191 IGVGLYPSMSLLNHDCRPSCVMLFEGETLHLRAVRDMQPAEELTISYIGTLAPTRDRRTQ 250

Query: 115 LKEQYLFTCTCPRCIKLGQFDDIQESAILEGYRCKDDGCSGFLLRDSDDKGFTCQQCGLV 174
           L+EQY FTC C RC      D +      + +    +            + +  Q C   
Sbjct: 251 LEEQYHFTCQCQRCTT-ADMDPVMLCGEEKAWTPLREAIPTL-------ENWQTQACW-- 300

Query: 175 RSKEEIKKIASEVNILSKKTLALTSCGNHQEVVSTYKMIEKLQKKLYHPFSVNLMQTREK 234
              E + + +S ++ L     A+  C                        +V  ++  + 
Sbjct: 301 --NELLYECSSLLDPLDGDVPAVPDC------------------------NVYRLRVTDL 334

Query: 235 LIKILMELEDWKEALAYCQLTIPVYQRVYPQFHPLLGLQYYTCGKLEWFLGDTENAIKSM 294
            +   + L  W  ALAY  +T+  Y++ YP  HP   +Q     KL+ FL    +A  ++
Sbjct: 335 ALDACINLAAWDTALAYGTITLEPYRQYYPDPHPAHAIQLMRVAKLQHFLVHLGDAQHTL 394

Query: 295 TEAVEILRITHGTNSPFMKELILKLEEAQAE 325
             A +I+++THG   P   +LI KLEE +AE
Sbjct: 395 RLAYDIMKVTHGNQHPLTSDLIRKLEECRAE 425


>gi|310656796|gb|ADP02224.1| zf-MYND domain-containing protein [Triticum aestivum]
          Length = 520

 Score =  131 bits (330), Expect = 4e-28,   Method: Compositional matrix adjust.
 Identities = 56/82 (68%), Positives = 70/82 (85%)

Query: 235 LIKILMELEDWKEALAYCQLTIPVYQRVYPQFHPLLGLQYYTCGKLEWFLGDTENAIKSM 294
           L++I MEL+DW+ AL YC+LTIPVY+RVYP FHP++GLQ+YTCGKLEW L  TE+A+KS+
Sbjct: 367 LLQIYMELQDWQTALMYCRLTIPVYERVYPPFHPMVGLQFYTCGKLEWLLEYTEDALKSL 426

Query: 295 TEAVEILRITHGTNSPFMKELI 316
           T A ++LRITHGT S FMKEL 
Sbjct: 427 TRAADVLRITHGTQSQFMKELF 448



 Score = 63.5 bits (153), Expect = 1e-07,   Method: Compositional matrix adjust.
 Identities = 39/102 (38%), Positives = 45/102 (44%), Gaps = 44/102 (43%)

Query: 14  QIANLVNLILQWPEISINEIAENFSK---------------------------------- 39
           ++ANLV LIL   E+ + EIA  FSK                                  
Sbjct: 131 KMANLVQLILPSIELDLKEIAHTFSKSFVCTEVGIAYCHVLEILAVIGSLRPNPLGLGSA 190

Query: 40  ----------LACNAHTICNSELRPLGTGLYPVISIINHSCL 71
                      ACNAHTIC+ ELRPLGTGL+P ISIINH  L
Sbjct: 191 VIAVHPTQVQFACNAHTICDPELRPLGTGLFPAISIINHRIL 232


>gi|334322119|ref|XP_003340188.1| PREDICTED: SET and MYND domain-containing protein 3-like
           [Monodelphis domestica]
          Length = 441

 Score =  130 bits (327), Expect = 9e-28,   Method: Compositional matrix adjust.
 Identities = 84/293 (28%), Positives = 148/293 (50%), Gaps = 39/293 (13%)

Query: 32  EIAENFSKLACNAHTICNSELRPLGTGLYPVISIINHSCLPNAVLVFEGRLAVVRAVQHV 91
           +I E+F+K+ CN   I N E++ +G GLYP +S++NHSC PN V+VFEG    +RAV+ +
Sbjct: 183 DIFESFAKVICNGFAISNGEMQEVGVGLYPSMSLLNHSCDPNCVIVFEGPSLFLRAVRDI 242

Query: 92  PKGAEVLISYIETAGSTMTRQKALKEQYLFTCTCPRCIKLGQFDDIQESAILEGYRCKDD 151
            +G E+ I Y++    +  RQK LKEQY F C CP C                       
Sbjct: 243 QQGEELTICYLDVLMPSAERQKQLKEQYCFDCDCPGCET--------------------- 281

Query: 152 GCSGFLLRDSDDKGFTCQQCGLVRSKEEIKKIASEVNILSKKTLALTSCGNHQEVVSTYK 211
                  R  DD   + ++      ++ +KK+  ++    K    L  C   Q +++ Y 
Sbjct: 282 -------RSKDDDMLSGEEQAWKEVQDSLKKV-EDLRAQEKWEQILAIC---QTLINNYG 330

Query: 212 MIEKLQKKLYHPFSVNLMQTREKLIKILMELEDWKEALAYCQLTIPVYQRVYPQFHPLLG 271
             ++L  +     ++  ++  +  +   + L  W++AL Y   T+  Y+  Y  FHP+ G
Sbjct: 331 --DRLPDR-----NIYQLKMLDCAMDACINLCLWEDALLYGSRTLEPYRLYYSGFHPISG 383

Query: 272 LQYYTCGKLEWFLGDTENAIKSMTEAVEILRITHGTNSPFMKELILKLEEAQA 324
           +Q    GKL+   G    A++++ +A E++++THG +   +++L+L L + +A
Sbjct: 384 VQVMRVGKLQHHQGLYPQALETLKQAFELMKVTHGRDHSLVEDLLLLLGDCEA 436


>gi|350416087|ref|XP_003490836.1| PREDICTED: SET and MYND domain-containing protein 3-like [Bombus
           impatiens]
          Length = 439

 Score =  129 bits (325), Expect = 2e-27,   Method: Compositional matrix adjust.
 Identities = 90/307 (29%), Positives = 154/307 (50%), Gaps = 24/307 (7%)

Query: 5   DEKQLLLYAQIANLVNLILQ-WPEISINEIAENFSKLACNAHTICNSELRPLGTGLYPVI 63
           DEK++  +  + N+++  L+  P  S  E+   + ++  N+ +I NS++  +G G+Y   
Sbjct: 127 DEKKMEHFVCLCNVLHKFLEDMPIPSTAELLGIYGRITINSFSIFNSDMN-IGVGIYLGP 185

Query: 64  SIINHSCLPNAVLVFEGRLAVVRAVQHVPKG--AEVLISYIETAGSTMTRQKALKEQYLF 121
           SI++HSC PNAV  FEG    V+A++ +P    +++ I YI+   +T  R+  L+  Y F
Sbjct: 186 SILDHSCKPNAVATFEGTTITVKAIEDLPSLDLSQIRIPYIDVIKTTRDRRAELQSSYYF 245

Query: 122 TCTCPRCIKLGQFDDIQESAILEGYRCKDDGCSGFLLRDSDDKGFTCQQCGLVRSKEEIK 181
            C C RC K           + E   C +  C+     ++D     C++C   +  E  K
Sbjct: 246 WCDCERCEK--------PEPMAEAAACPNKLCTYPCAPNAD----LCEKCN-TKFPENFK 292

Query: 182 KIASEVNILSKKTLALTSCGNHQEV--VSTYKMIEKLQKKLYHPFSVNLMQTREKLIKIL 239
           +I  E++  +   L      N + +  +   KM    Q+ + HP +V  +QT +      
Sbjct: 293 EIFDEISEFTAYHLE-----NMKNIAYLDVSKMCLSRQEGILHPLNVQYVQTLQTAFDSS 347

Query: 240 MELEDWKEALAYCQLTIPVYQRVYPQFHPLLGLQYYTCGKLEWFLGDTENAIKSMTEAVE 299
           + L+ W+EA +Y +  I  Y   Y + HP  G+ Y + GKLE +L   + AIK++ +A  
Sbjct: 348 INLQHWEEAESYAKKLINGYLTYYGEIHPSTGILYLSIGKLEVYLKKLKQAIKTLRKASS 407

Query: 300 ILRITHG 306
           IL ITHG
Sbjct: 408 ILTITHG 414


>gi|395852657|ref|XP_003798851.1| PREDICTED: SET and MYND domain-containing protein 3 [Otolemur
           garnettii]
          Length = 428

 Score =  129 bits (324), Expect = 2e-27,   Method: Compositional matrix adjust.
 Identities = 85/293 (29%), Positives = 146/293 (49%), Gaps = 39/293 (13%)

Query: 32  EIAENFSKLACNAHTICNSELRPLGTGLYPVISIINHSCLPNAVLVFEGRLAVVRAVQHV 91
           ++ E F+K+ CN+ TICN+E++ +G GLYP +S++NHSC PN  +VF G   ++RAV+ +
Sbjct: 170 DVFEAFAKVICNSFTICNAEMQEVGVGLYPSMSLLNHSCDPNCSIVFNGPHLLLRAVRDI 229

Query: 92  PKGAEVLISYIETAGSTMTRQKALKEQYLFTCTCPRCIKLGQFDDIQESAILEGYRCKDD 151
             G E+ I Y++   ++  R+K L++QY F C C RC    Q  D               
Sbjct: 230 EAGEELTICYLDMLMTSEERRKQLRDQYCFECDCLRC----QTQD--------------- 270

Query: 152 GCSGFLLRDSDDKGFTCQQCGLVRSKEEIKKIASEVNILSKKTLALTSCGNHQEVVSTYK 211
                  +D+D          ++   E++ K   EV    KK   L +    ++V++  +
Sbjct: 271 -------KDAD----------MLTGDEQVWK---EVQESLKKIEELKAHWKWEQVLAMCQ 310

Query: 212 MIEKLQKKLYHPFSVNLMQTREKLIKILMELEDWKEALAYCQLTIPVYQRVYPQFHPLLG 271
            I     +     ++  ++  +  +   + L   +EAL Y   T+  Y+  +P  HP+ G
Sbjct: 311 TIISSNSERLPDINIYQLKVLDCAMDACINLGLLEEALFYATRTMEPYRIFFPGSHPVRG 370

Query: 272 LQYYTCGKLEWFLGDTENAIKSMTEAVEILRITHGTNSPFMKELILKLEEAQA 324
           +Q    GKL+   G    A++++  A +I+R+THG      ++LIL LEE  A
Sbjct: 371 VQVMKVGKLQLHQGMFPQAMQNLRLAFDIIRVTHGREHSLTEDLILLLEECDA 423


>gi|74204261|dbj|BAE39889.1| unnamed protein product [Mus musculus]
          Length = 428

 Score =  129 bits (323), Expect = 3e-27,   Method: Compositional matrix adjust.
 Identities = 85/293 (29%), Positives = 146/293 (49%), Gaps = 39/293 (13%)

Query: 32  EIAENFSKLACNAHTICNSELRPLGTGLYPVISIINHSCLPNAVLVFEGRLAVVRAVQHV 91
           ++ E F+K+ CN+ TICN+E++ +G GLYP +S++NHSC PN  +VF G   ++RAV+ +
Sbjct: 170 DLFEAFAKVICNSFTICNAEMQEVGVGLYPSMSLLNHSCDPNCSIVFNGPHLLLRAVREI 229

Query: 92  PKGAEVLISYIETAGSTMTRQKALKEQYLFTCTCPRCIKLGQFDDIQESAILEGYRCKDD 151
             G E+ I Y++   ++  R+K L++QY F C C RC    Q  D               
Sbjct: 230 EAGEELTICYLDMLMTSEERRKQLRDQYCFECDCIRC----QTQD--------------- 270

Query: 152 GCSGFLLRDSDDKGFTCQQCGLVRSKEEIKKIASEVNILSKKTLALTSCGNHQEVVSTYK 211
                  +D+D          ++   E+I K   EV    KK   L +    ++V++  +
Sbjct: 271 -------KDAD----------MLTGDEQIWK---EVQESLKKIEELKAHWKWEQVLALCQ 310

Query: 212 MIEKLQKKLYHPFSVNLMQTREKLIKILMELEDWKEALAYCQLTIPVYQRVYPQFHPLLG 271
            I           ++  ++  +  +   + L   +EAL Y   T+  Y+  +P  HP+ G
Sbjct: 311 AIINSNSNRLPNINIYQLKVLDCAMDACINLGMLEEALFYAMRTMEPYRIFFPGSHPVRG 370

Query: 272 LQYYTCGKLEWFLGDTENAIKSMTEAVEILRITHGTNSPFMKELILKLEEAQA 324
           +Q    GKL+   G    A+K++  A +I+++THG     +++LI+ LEE  A
Sbjct: 371 VQVMKVGKLQLHQGMFPQAMKNLRLAFDIMKVTHGREHSLIEDLIVLLEECDA 423


>gi|312383019|gb|EFR28260.1| hypothetical protein AND_04036 [Anopheles darlingi]
          Length = 455

 Score =  128 bits (322), Expect = 3e-27,   Method: Compositional matrix adjust.
 Identities = 92/334 (27%), Positives = 159/334 (47%), Gaps = 21/334 (6%)

Query: 5   DEKQLLLYAQIANLVNLILQWPEISI-NEIAENFSKLACNAHTICNSELRPLGTGLYPVI 63
           D K++  +  +  ++  +L  P I   +E+   F K+  N+  +C+ E+  +GTG+Y   
Sbjct: 126 DVKRIEHFHTLNVVLQRLLDEPAIPPRDELLRIFGKMCINSFNVCDDEMNSIGTGMYLGA 185

Query: 64  SIINHSCLPNAVLVFEGRLAVVRAVQHVPKG----AEVLISYIETAGSTMTRQKALKEQY 119
           SI++HSC PNAV  F G    +R ++         + + ISYI+    + TR++ L E+Y
Sbjct: 186 SILDHSCRPNAVATFVGEQLQLRLLEDFAGPELDFSRIFISYIDLIDPSDTRREQLSERY 245

Query: 120 LFTCTCPRCIKLGQFDDIQESAILEGYRCKDDGCSGFLLRDSDDKGFTCQQCGLVRSKEE 179
            F C C RC       D  E  ++    C++  C    +RD       C+        + 
Sbjct: 246 YFRCECVRC------RDEAERELMGAAACQNRKCDE-PIRDGQTLCSACE----APFDQS 294

Query: 180 IKKIASEVNILSKKTLALTSCGNHQEVVSTYKMIEKLQKKLYHPFSVNLMQTREKLIKIL 239
            +    EV   ++  LA      + +V    ++  + Q  + HP +   ++T +      
Sbjct: 295 ARDRFDEVTSFTRDRLAEMKDVAYFDVC---RLCLEKQSGVLHPLNAQHIKTLDYAFDSA 351

Query: 240 MELEDWKEALAYCQLTIPVYQRVYPQFHPLLGLQYYTCGKLEWFLGDTENAIKSMTEAVE 299
           ++LE W+ AL Y    +  Y+R  P  +PLLGL     GK++ +LGD + A+ S+ EA +
Sbjct: 352 IKLEKWEAALRYGAGAVAGYRRYSPS-NPLLGLMLANIGKIQLYLGDAKTALSSLHEADK 410

Query: 300 ILRITHGTNSPFMKELILKLEEAQAEASYKLSSK 333
           ILR+THG      K+ ++ L    A   Y+LS +
Sbjct: 411 ILRVTHGEQHDLYKDQLVPL-LCDAAQQYELSQR 443


>gi|344278575|ref|XP_003411069.1| PREDICTED: SET and MYND domain-containing protein 3 [Loxodonta
           africana]
          Length = 428

 Score =  127 bits (319), Expect = 7e-27,   Method: Compositional matrix adjust.
 Identities = 90/295 (30%), Positives = 146/295 (49%), Gaps = 43/295 (14%)

Query: 32  EIAENFSKLACNAHTICNSELRPLGTGLYPVISIINHSCLPNAVLVFEGRLAVVRAVQHV 91
           +I E F+K+ CN+ TIC++E++ +G GLYP +S++NHSC PN  +VF G   ++RAV+ +
Sbjct: 170 DIFEAFAKVICNSFTICDAEMQEVGVGLYPSMSLLNHSCDPNCSIVFNGPHLLLRAVRDI 229

Query: 92  PKGAEVLISYIETAGSTMTRQKALKEQYLFTCTCPRCIKLGQFDDIQESAILEGYRCKDD 151
             G E+ I Y++   ++  R+K L++QY F C C                    +RC+  
Sbjct: 230 EVGEELTICYLDMLMTSEERRKQLRDQYCFECDC--------------------FRCR-- 267

Query: 152 GCSGFLLRDSDDKGFTCQQCGLVRSKEEIKKIASEVNILSKKTLALTSCGNHQEVVSTYK 211
                  +D D    T  +      +E +KKI  E+    K    L  C   Q ++S+  
Sbjct: 268 ------TQDKDADMLTGNEQAWKEVQESLKKI-EELKAHRKWEQVLAMC---QTILSSNS 317

Query: 212 MIEKLQKKLYHPFSVNLMQTR--EKLIKILMELEDWKEALAYCQLTIPVYQRVYPQFHPL 269
             E+L          N+ Q +  +  +   + L   +EAL Y   T+  Y+  YP  HP+
Sbjct: 318 --EQLP-------DTNIYQLKVLDCAMDACINLGLLEEALFYGIRTMEPYRIFYPGSHPV 368

Query: 270 LGLQYYTCGKLEWFLGDTENAIKSMTEAVEILRITHGTNSPFMKELILKLEEAQA 324
            G+Q    GKL+   G    A+K++  A +I+R+THG     +++LI  LEE  A
Sbjct: 369 RGVQVMKVGKLQLHQGMFPQAMKNLRLAFDIMRVTHGREHSLIEDLIRLLEECDA 423


>gi|242020146|ref|XP_002430517.1| SET and MYND domain-containing protein, putative [Pediculus humanus
           corporis]
 gi|212515674|gb|EEB17779.1| SET and MYND domain-containing protein, putative [Pediculus humanus
           corporis]
          Length = 378

 Score =  126 bits (317), Expect = 1e-26,   Method: Compositional matrix adjust.
 Identities = 84/285 (29%), Positives = 142/285 (49%), Gaps = 30/285 (10%)

Query: 40  LACNAHTICNSELRPLGTGLYPVISIINHSCLPNAVLVFEGRLAVVRAVQHVP--KGAEV 97
           +  N   I +SE++ +G+GLY   SII+HSC PNAV VF+G    +R ++ +P    ++V
Sbjct: 108 MVINTFNILDSEMQSIGSGLYLASSIIDHSCSPNAVAVFKGTTIFIRTLEDIPIMDWSKV 167

Query: 98  LISYIETAGSTMTRQKALKEQYLFTCTCPRCIKLGQFDDIQESAILEGYRCKDDGCSGFL 157
            ISYIE       RQ+ L   Y F C C +C       D      ++   C+++ C  FL
Sbjct: 168 FISYIELLNLPEIRQQELLSSYYFLCQCSKCT------DSDNLNFMKSIYCQNEKCKNFL 221

Query: 158 LRDSDDKGFTCQQCGLVRSKE---EIKKI----ASEVNILSKKTLALTSCGNHQEVVSTY 210
           + +  +    C +CG   SKE   EIK+I     SE+N +            +   +   
Sbjct: 222 MPNEVE----CHKCGQSISKEDEDEIKEIIQYTESELNKME-----------NMAYLDIC 266

Query: 211 KMIEKLQKKLYHPFSVNLMQTREKLIKILMELEDWKEALAYCQLTIPVYQRVYPQFHPLL 270
           ++  + Q+K   P ++  ++  +  ++  + LE+W+ A+ Y +  +  + + Y ++HPL 
Sbjct: 267 RICLQKQEKYLTPLNIYKVKILDLAMESSIALENWESAVFYGRQLLEPFLKYYGEYHPLR 326

Query: 271 GLQYYTCGKLEWFLGDTENAIKSMTEAVEILRITHGTNSPFMKEL 315
           G+     GK+ + L + + A K   EA  IL +THG N  F +EL
Sbjct: 327 GIFLMKLGKILFLLNNIDEAKKYFKEANLILSVTHGKNHCFSREL 371


>gi|24639370|ref|NP_524768.2| buzidau [Drosophila melanogaster]
 gi|6018870|emb|CAB58065.1| EG:BACR7C10.4 [Drosophila melanogaster]
 gi|22831587|gb|AAF45795.2| buzidau [Drosophila melanogaster]
 gi|223718730|gb|ACN22202.1| MIP05435p [Drosophila melanogaster]
          Length = 468

 Score =  124 bits (312), Expect = 4e-26,   Method: Compositional matrix adjust.
 Identities = 86/318 (27%), Positives = 159/318 (50%), Gaps = 23/318 (7%)

Query: 16  ANLVNLILQWPEISIN--EIAENFSKLACNAHTICNSELRPLGTGLYPVISIINHSCLPN 73
           A L +++ + P    N  E+   + +L  N   I ++E+  + T +Y  +SI +HSC PN
Sbjct: 166 AVLTDMMAESPSTVPNKTELMSIYGRLITNGFNILDAEMNSIATAIYLGVSITDHSCQPN 225

Query: 74  AVLVFEGRLAVVRAVQHVP--KGAEVLISYIETAGSTMTRQKALKEQYLFTCTCPRCIKL 131
           AV  FEG    V A++ +     +++ ISYI+   +   R+  LKE Y F C C +C   
Sbjct: 226 AVATFEGNELHVHAIEDMECLDWSKIFISYIDLLNTPEQRRLDLKEHYYFLCVCSKCT-- 283

Query: 132 GQFDDIQESAILEGYRCKDDGCSGFLLRDSDDKGFTCQQCGLVRSKEEIKKIASEVNILS 191
               D +ES  +    C +  C   +  D ++    C +C    S + ++   +E   L+
Sbjct: 284 ----DAKESKEMLAALCPNRNCGAGISVDRNN----CPRCDAGISPK-LRNAFNEAMTLT 334

Query: 192 KKTLALTSCGNHQEV--VSTYKMIEKLQKKLYHPFSVNLMQTREKLIKILMELEDWKEAL 249
           +  L      N ++V  +   K+    Q  ++HP +V  ++T +   +  +E+  W +AL
Sbjct: 335 RHNLE-----NMKDVAYLDVCKVCLDKQTGVFHPLNVWYVKTLDAAFEAAIEVGKWSDAL 389

Query: 250 AYCQLTIPVYQRVYPQFHPLLGLQYYTCGKLEWFLGDTENAIKSMTEAVEILRITHGTNS 309
            Y Q  +P +++ +  ++PLLGL +   GK++ + G ++ A+  + EA  IL +THG + 
Sbjct: 390 DYGQRLLPGFRKYHGPWNPLLGLLHMKLGKIQLYEGHSKEALHHLEEAQRILTVTHGRDH 449

Query: 310 PFMKE-LILKLEEAQAEA 326
             + E L + + +A+ EA
Sbjct: 450 RLLTEQLYMLVLQARQEA 467


>gi|21430850|gb|AAM51103.1| SD20045p [Drosophila melanogaster]
          Length = 382

 Score =  124 bits (312), Expect = 5e-26,   Method: Compositional matrix adjust.
 Identities = 86/318 (27%), Positives = 159/318 (50%), Gaps = 23/318 (7%)

Query: 16  ANLVNLILQWPEISIN--EIAENFSKLACNAHTICNSELRPLGTGLYPVISIINHSCLPN 73
           A L +++ + P    N  E+   + +L  N   I ++E+  + T +Y  +SI +HSC PN
Sbjct: 80  AVLTDMMAESPSTVPNKTELMSIYGRLITNGFNILDAEMNSIATAIYLGVSITDHSCQPN 139

Query: 74  AVLVFEGRLAVVRAVQHVP--KGAEVLISYIETAGSTMTRQKALKEQYLFTCTCPRCIKL 131
           AV  FEG    V A++ +     +++ ISYI+   +   R+  LKE Y F C C +C   
Sbjct: 140 AVATFEGNELHVHAIEDMECLDWSKIFISYIDLLNTPEQRRLDLKEHYYFLCVCSKCT-- 197

Query: 132 GQFDDIQESAILEGYRCKDDGCSGFLLRDSDDKGFTCQQCGLVRSKEEIKKIASEVNILS 191
               D +ES  +    C +  C   +  D ++    C +C    S + ++   +E   L+
Sbjct: 198 ----DAKESKEMLAALCPNRNCGAGISVDRNN----CPRCDAGISPK-LRNAFNEAMTLT 248

Query: 192 KKTLALTSCGNHQEV--VSTYKMIEKLQKKLYHPFSVNLMQTREKLIKILMELEDWKEAL 249
           +  L      N ++V  +   K+    Q  ++HP +V  ++T +   +  +E+  W +AL
Sbjct: 249 RHNLE-----NMKDVAYLDVCKVCLDKQTGVFHPLNVWYVKTLDAAFEAAIEVGKWSDAL 303

Query: 250 AYCQLTIPVYQRVYPQFHPLLGLQYYTCGKLEWFLGDTENAIKSMTEAVEILRITHGTNS 309
            Y Q  +P +++ +  ++PLLGL +   GK++ + G ++ A+  + EA  IL +THG + 
Sbjct: 304 DYGQRLLPGFRKYHGPWNPLLGLLHMKLGKIQLYEGHSKEALHHLEEAQRILTVTHGRDH 363

Query: 310 PFMKE-LILKLEEAQAEA 326
             + E L + + +A+ EA
Sbjct: 364 RLLTEQLYVLVLQARQEA 381


>gi|224033847|gb|ACN35999.1| unknown [Zea mays]
 gi|413955568|gb|AFW88217.1| hypothetical protein ZEAMMB73_999877 [Zea mays]
          Length = 129

 Score =  124 bits (311), Expect = 7e-26,   Method: Compositional matrix adjust.
 Identities = 56/98 (57%), Positives = 73/98 (74%)

Query: 185 SEVNILSKKTLALTSCGNHQEVVSTYKMIEKLQKKLYHPFSVNLMQTREKLIKILMELED 244
           S++  LS K  +  S GN  E  S YK+I++L++ LYH FS  L+ T E L+KI +EL+D
Sbjct: 3   SKILQLSDKASSFLSSGNKAEAGSIYKIIKQLERNLYHAFSTTLLHTCETLLKIYLELQD 62

Query: 245 WKEALAYCQLTIPVYQRVYPQFHPLLGLQYYTCGKLEW 282
           W  AL YC+LTIPVY+RVYP FHP++GLQ+YTCGKLEW
Sbjct: 63  WWTALTYCRLTIPVYERVYPPFHPMIGLQFYTCGKLEW 100


>gi|345803330|ref|XP_537223.3| PREDICTED: SET and MYND domain-containing protein 3 [Canis lupus
           familiaris]
          Length = 428

 Score =  124 bits (310), Expect = 7e-26,   Method: Compositional matrix adjust.
 Identities = 87/277 (31%), Positives = 137/277 (49%), Gaps = 43/277 (15%)

Query: 32  EIAENFSKLACNAHTICNSELRPLGTGLYPVISIINHSCLPNAVLVFEGRLAVVRAVQHV 91
           +I E F+K+ CNA TICN+E++ +G GLYP +S++NHSC PN  +VF G   ++RAV+ +
Sbjct: 170 DIFEAFAKVICNAFTICNAEMQEVGVGLYPSMSLLNHSCDPNCSIVFNGPHLLLRAVRDI 229

Query: 92  PKGAEVLISYIETAGSTMTRQKALKEQYLFTCTCPRCIKLGQFDDIQESAILEGYRCKDD 151
             G E+ I Y++   ++  R+K LK+QY F C C RC                       
Sbjct: 230 EAGEELTICYLDMLMTSEERRKQLKDQYCFECDCVRCET--------------------- 268

Query: 152 GCSGFLLRDSDDKGFTCQQCGLVRSKEEIKKIASEVNILSKKTLALTSCGNHQEVVSTYK 211
                  +D D    T  +    + +E +KKI  E+    K    L  C   Q ++S+  
Sbjct: 269 -------QDKDADMLTGDEQVWRQVQESLKKI-EELKAHWKWEQVLAMC---QTIISSN- 316

Query: 212 MIEKLQKKLYHPFSVNLMQTR--EKLIKILMELEDWKEALAYCQLTIPVYQRVYPQFHPL 269
             E+L         +N+ Q R  +  +   + L   +EAL Y   T+  Y+  +P  HP+
Sbjct: 317 -AERLP-------DINVYQLRVLDCAMDACINLGLLEEALFYGIRTMEPYRIFFPGSHPV 368

Query: 270 LGLQYYTCGKLEWFLGDTENAIKSMTEAVEILRITHG 306
            G+Q    GKL+   G    A+K++  A +I+R+THG
Sbjct: 369 RGVQVMKVGKLQLHQGMFPQAMKNLRLAFDIMRVTHG 405


>gi|432089341|gb|ELK23292.1| N-lysine methyltransferase SMYD2 [Myotis davidii]
          Length = 391

 Score =  124 bits (310), Expect = 7e-26,   Method: Compositional matrix adjust.
 Identities = 90/340 (26%), Positives = 164/340 (48%), Gaps = 26/340 (7%)

Query: 5   DEKQLLLYAQIANLVNLILQWPEISINE-IAENFSKLACNAHTICNSELRPLGTGLYPVI 63
           +EK+ L+ + I+ L +   +  E    E +   F+++ CN  TI + EL  LG+ ++P +
Sbjct: 52  NEKKDLIQSDISALHHFYSKHLEFPDQESLVALFAQVNCNGFTIEDEELSHLGSAIFPDV 111

Query: 64  SIINHSCLPNAVLVFEGRLAVVRAVQHVPKGAEVLISYIETAGSTMTRQKALKEQYLFTC 123
           +++NHSC PN ++ ++G LA VRAVQ +  G EV  SYI+    T  R   L++ Y FTC
Sbjct: 112 ALMNHSCCPNVIVTYKGTLAEVRAVQEIGPGDEVFTSYIDLLYPTEDRNDRLRDSYFFTC 171

Query: 124 TCPRC---------IKLGQFDDIQES-AILEGYRCKDDGCSGFLLRDSDDKGFTCQQCGL 173
            C  C         +K+ + +D  ++ AI +  R   +    F  R +    +    CG 
Sbjct: 172 ECQECTTKDKDKAKVKIRKLNDPPKAEAIRDMVRYARNVIEEF--RRAKHYKYILDSCGR 229

Query: 174 --VRSKEEIKKIASEVNILSKKT--------LALTSCGNHQEVVSTYKMIEKLQKKLYHP 223
              R   E+ +  + V++L  KT        L L     H       ++ E  Q+K+   
Sbjct: 230 SPARGAGELGQELNVVDVLLSKTTENGSLKVLILGHYFGHTPPSELLEICELSQEKMSSV 289

Query: 224 F---SVNLMQTREKLIKILMELEDWKEALAYCQLTIPVYQRVYPQFHPLLGLQYYTCGKL 280
           F   +V ++    + + + + ++DW+ AL Y Q  I  Y + YP +   +   +   G+L
Sbjct: 290 FEDSNVYMLHMMYQAMGVCLYIQDWEGALRYGQKIIKPYSKHYPLYSLNVASMWLKLGRL 349

Query: 281 EWFLGDTENAIKSMTEAVEILRITHGTNSPFMKELILKLE 320
              L +     K++ +A+ I+ + HG + P++ E+  ++E
Sbjct: 350 YMGLENRAAGEKALRKAMAIMEVAHGKDHPYISEIKQEIE 389


>gi|91076642|ref|XP_970424.1| PREDICTED: similar to Buzidau CG13761-PB [Tribolium castaneum]
 gi|270002373|gb|EEZ98820.1| hypothetical protein TcasGA2_TC004426 [Tribolium castaneum]
          Length = 442

 Score =  123 bits (309), Expect = 9e-26,   Method: Compositional matrix adjust.
 Identities = 79/301 (26%), Positives = 151/301 (50%), Gaps = 24/301 (7%)

Query: 32  EIAENFSKLACNAHTICNSELRPLGTGLYPVISIINHSCLPNAVLVFEGRLAVVRAVQHV 91
           E+   + ++  N+  I + EL+ +GTG+Y   S+I+HSC PNAV +F+G +  +RA+Q  
Sbjct: 154 ELMGMYGRMCINSFNIIDQELQCIGTGMYLGASVIDHSCSPNAVAIFDGPILSIRALQTF 213

Query: 92  P--KGAEVLISYIETAGSTMTRQKALKEQYLFTCTCPRCIKLGQFDDIQESAILEGYRCK 149
                +++ ISYI+   +T  RQ  L+  Y F C CP+C++        E   +    C 
Sbjct: 214 QYLDWSQIKISYIDILNTTKDRQSELEAAYYFLCKCPKCLE-------PEPPEINAAACP 266

Query: 150 DDGCSGFLLRDSDDKGFTCQQCGLVRSKEEIKKIA-----SEVNILSKKTLALTSCGNHQ 204
           ++ C   +  +    G  C +C  V S+  +K+       +++++ + K LA        
Sbjct: 267 NEKCDNHIDTEIITPGDKCAKCDTVVSETFLKRFKEVIEFTDLHLQNMKQLA-------- 318

Query: 205 EVVSTYKMIEKLQKKLYHPFSVNLMQTREKLIKILMELEDWKEALAYCQLTIPVYQRVYP 264
                 ++  K Q+ + H F++  ++T +   +  ++ + W  A  +    +  + + Y 
Sbjct: 319 -YFDVCEICLKKQEGVLHKFNIKHVKTLDLAFQSSIDFQKWDFARKFALELVDAFYKYYG 377

Query: 265 QFHPLLGLQYYTCGKLEWFLGDTENAIKSMTEAVEILRITHGTNSPFMK-ELILKLEEAQ 323
             HP+ GL +   GK+  F  + + A++ +T+A +IL+I HG  S   K EL+  L++A+
Sbjct: 378 HVHPITGLLHLKLGKILLFEENDQLALEHLTKAYQILKIIHGVGSHLFKDELVPLLQQAR 437

Query: 324 A 324
           A
Sbjct: 438 A 438


>gi|125981921|ref|XP_001354964.1| GA12511 [Drosophila pseudoobscura pseudoobscura]
 gi|54643276|gb|EAL32020.1| GA12511 [Drosophila pseudoobscura pseudoobscura]
          Length = 439

 Score =  123 bits (308), Expect = 1e-25,   Method: Compositional matrix adjust.
 Identities = 86/317 (27%), Positives = 159/317 (50%), Gaps = 19/317 (5%)

Query: 16  ANLVNLILQWPEISIN--EIAENFSKLACNAHTICNSELRPLGTGLYPVISIINHSCLPN 73
           A L +++   P    N  E+   + +L  N   + ++E+  + T +Y  +SI +HSC PN
Sbjct: 137 AVLTDMMADSPSTVPNKSELMSIYGRLITNGFNVLDAEMNSIATAIYLGVSITDHSCQPN 196

Query: 74  AVLVFEGRLAVVRAVQHVP--KGAEVLISYIETAGSTMTRQKALKEQYLFTCTCPRCIKL 131
           AV  FEG    + A++ +P    +++ ISYI+   +   R++ LK+ Y F C C +C   
Sbjct: 197 AVATFEGNELHIHALEDMPCLDWSKIFISYIDLLNTPEQRRQDLKDHYYFLCVCSKC--- 253

Query: 132 GQFDDIQESAILEGYRCKDDGCSGFLLRDSDDKGFTCQQCGLVRSKEEIKKIASEVNILS 191
               D +E+  +    C +  CS  +  + ++    C++C +  S + ++   +E+  L+
Sbjct: 254 ---RDPKEARQMTAAACPNRKCSASINIEWNN----CKRCSVGISPK-LRNAYNEIMALT 305

Query: 192 KKTLALTSCGNHQEVVSTYKMIEKLQKKLYHPFSVNLMQTREKLIKILMELEDWKEALAY 251
           K  L       + +V    K+    Q  L HP +V  ++T +   +  +E+  W+EAL Y
Sbjct: 306 KHNLEAMKDVAYLDVC---KVCLDKQAGLMHPLNVWHVKTLDAAFEAAIEVGKWEEALDY 362

Query: 252 CQLTIPVYQRVYPQFHPLLGLQYYTCGKLEWFLGDTENAIKSMTEAVEILRITHGTNSPF 311
            Q  +P +Q  + Q++PL+GL +   GK++ F    + A   + EA  IL +THG +   
Sbjct: 363 GQQLLPGFQMYHGQWNPLVGLLHMKLGKIQLFERHHKEASHHLEEAQRILSVTHGRDHRL 422

Query: 312 MKELILKL-EEAQAEAS 327
           + E +  L  +A+ EA+
Sbjct: 423 LVEQLYPLIFQARQEAN 439


>gi|170064095|ref|XP_001867383.1| conserved hypothetical protein [Culex quinquefasciatus]
 gi|167881524|gb|EDS44907.1| conserved hypothetical protein [Culex quinquefasciatus]
          Length = 445

 Score =  122 bits (307), Expect = 2e-25,   Method: Compositional matrix adjust.
 Identities = 91/326 (27%), Positives = 162/326 (49%), Gaps = 29/326 (8%)

Query: 5   DEKQLLLYAQIANLVNLILQWPEISIN-EIAENFSKLACNAHTICNSELRPLGTGLYPVI 63
           DEK++  +  +  ++  ++    +  N E+   F K+  N+  I + E+  +GTG+Y   
Sbjct: 125 DEKRMEHFQSLTVVLRSLIDEAAMPGNQELLRIFGKMCINSFNILDDEMNSIGTGMYLGA 184

Query: 64  SIINHSCLPNAVLVFEGRLAVVRAVQHVPKGAEV-----LISYIETAGSTMTRQKALKEQ 118
           SI++HSC PNAV +F+G    VR ++    GAE+      ISYI+    T  R+  L+++
Sbjct: 185 SIMDHSCRPNAVAIFDGCNLNVRLLEDY-HGAEIDFSKIFISYIDLLNPTDVRRDMLRKR 243

Query: 119 YLFTCTCPRCIKLGQFDDIQESAILEGYRCKDDGCSGFL-LRDSDDKGFTCQQCGLVRSK 177
           Y F C C RC       D QE  ++ G  C +  C   + +    D+   C        +
Sbjct: 244 YYFECGCERC------RDEQELKLMNGAACANAQCDEPISMTARQDRCPGCSTAIKQTER 297

Query: 178 EEIKKIASEVNILSKKTLALTSCGNHQEVVSTYKMIEKL----QKKLYHPFSVNLMQTRE 233
           ++ ++I++           +T     ++V  TY  I +L    Q+K++HPF+V  ++T +
Sbjct: 298 DKFREISA---------FTMTQLEQMKDV--TYLDICQLCLSKQEKVFHPFNVWYLKTLD 346

Query: 234 KLIKILMELEDWKEALAYCQLTIPVYQRVYPQFHPLLGLQYYTCGKLEWFLGDTENAIKS 293
              +  +E+E W++A+ Y       ++R    FHPL  L     GK+E +L   + A+ +
Sbjct: 347 LAFESAIEMEKWEDAIDYGSRLKDGFKRFNGAFHPLYALLLLKLGKIEIYLKHGKEALMN 406

Query: 294 MTEAVEILRITHGTNSPFMKELILKL 319
           + E+ +ILRITHG      K+ ++ L
Sbjct: 407 VNESEKILRITHGEEHDLYKKQLIPL 432


>gi|66540564|ref|XP_625013.1| PREDICTED: SET and MYND domain-containing protein 3 [Apis
           mellifera]
          Length = 440

 Score =  122 bits (307), Expect = 2e-25,   Method: Compositional matrix adjust.
 Identities = 90/313 (28%), Positives = 158/313 (50%), Gaps = 27/313 (8%)

Query: 2   SDI--DEKQLLLYAQIANLVNLILQWPEISINEIAE---NFSKLACNAHTICNSELRPLG 56
           SDI  DEK++  +  +  ++   L   ++SI   AE    + ++  N+  I + ++  +G
Sbjct: 122 SDIKKDEKKMEHFVCVCGVLYEFL--GDMSIPNSAELMGIYGRIYINSFNISDLDMNNIG 179

Query: 57  TGLYPVISIINHSCLPNAVLVFEGRLAVVRAVQHVP--KGAEVLISYIETAGSTMTRQKA 114
            G+Y   SI++HSC PNAV  FEG   ++R  + +P    +++ ISYI+   +T  R++ 
Sbjct: 180 AGIYLGPSILDHSCKPNAVATFEGTTIIIRTTEDLPCLDLSQIRISYIDVIKTTKDRREE 239

Query: 115 LKEQYLFTCTCPRCIKLGQFDDIQES-AILEGYRCKDDGCSGFLLRDSDDKGFTCQQCGL 173
           L+  Y F C C +C         +ES  ++E   C +  C+     D+D     C+ C  
Sbjct: 240 LQSSYYFWCNCKKC---------EESEPMVEAAACPNKFCTYPCSLDAD----MCENCN- 285

Query: 174 VRSKEEIKKIASEVNILSKKTLALTSCGNHQEVVSTYKMIEKLQKKLYHPFSVNLMQTRE 233
            +  E  K+   E++ L+   L       + +V +   M  K Q+ + HP ++  +QT +
Sbjct: 286 TKFPENFKETFYEISDLTAYHLQNMKNIAYLDVST---MCLKKQEGVLHPLNIQHVQTLQ 342

Query: 234 KLIKILMELEDWKEALAYCQLTIPVYQRVYPQFHPLLGLQYYTCGKLEWFLGDTENAIKS 293
                 + L+ W+EA +Y +  I  Y   Y +FHP  G+ Y + GKL+ +L   + AI++
Sbjct: 343 SAFDSSLTLQHWEEAESYAKRLINGYLTYYGEFHPSTGILYLSIGKLQVYLKKLKQAIET 402

Query: 294 MTEAVEILRITHG 306
           + +A  IL ITHG
Sbjct: 403 LRKASAILTITHG 415


>gi|355746250|gb|EHH50875.1| hypothetical protein EGM_01766, partial [Macaca fascicularis]
          Length = 352

 Score =  122 bits (305), Expect = 3e-25,   Method: Compositional matrix adjust.
 Identities = 84/277 (30%), Positives = 136/277 (49%), Gaps = 43/277 (15%)

Query: 32  EIAENFSKLACNAHTICNSELRPLGTGLYPVISIINHSCLPNAVLVFEGRLAVVRAVQHV 91
           ++ E F+K+ CN+ TICN+E++ +G GLYP IS++NHSC PN  +VF G   ++RAV+ +
Sbjct: 94  DLFEAFAKVICNSFTICNAEMQEVGVGLYPSISLLNHSCDPNCSIVFNGPHLLLRAVRDI 153

Query: 92  PKGAEVLISYIETAGSTMTRQKALKEQYLFTCTCPRCIKLGQFDDIQESAILEGYRCKDD 151
             G E+ I Y++   ++  R+K L++QY F C C RC                       
Sbjct: 154 EVGEELTICYLDMLMTSEERRKQLRDQYCFECDCFRC----------------------- 190

Query: 152 GCSGFLLRDSDDKGFTCQQCGLVRSKEEIKKIASEVNILSKKTLALTSCGNHQEVVSTYK 211
                  +D D    T  +      +E +KKI  E+    K    L  C   Q ++S+  
Sbjct: 191 -----QTQDKDADMLTGDEQVWKEVQESLKKI-EELKAHWKWEQVLAMC---QAIISSNS 241

Query: 212 MIEKLQKKLYHPFSVNLMQTR--EKLIKILMELEDWKEALAYCQLTIPVYQRVYPQFHPL 269
             E+L         +N+ Q +  +  +   + L   +EAL Y   T+  Y+  +P  HP+
Sbjct: 242 --ERLP-------DINIYQLKVLDCAMDACINLGLLEEALFYATRTMEPYRIFFPGSHPV 292

Query: 270 LGLQYYTCGKLEWFLGDTENAIKSMTEAVEILRITHG 306
            G+Q    GKL+   G    A+K++  A +I+R+THG
Sbjct: 293 RGVQVMKVGKLQLHQGMFPQAMKNLRLAFDIMRVTHG 329


>gi|260796111|ref|XP_002593048.1| hypothetical protein BRAFLDRAFT_74375 [Branchiostoma floridae]
 gi|229278272|gb|EEN49059.1| hypothetical protein BRAFLDRAFT_74375 [Branchiostoma floridae]
          Length = 390

 Score =  122 bits (305), Expect = 3e-25,   Method: Compositional matrix adjust.
 Identities = 81/278 (29%), Positives = 126/278 (45%), Gaps = 32/278 (11%)

Query: 37  FSKLACNAHTICNSELRPLGTGLYPVISIINHSCLPNAVLVFEGRLAVVRAVQHVPKGAE 96
           + K  CN   I N +LR +G GLYP  ++INHSC  N V  F G    +RA+  +  G E
Sbjct: 106 YGKTTCNCFAIHNLDLREIGVGLYPQAAMINHSCKSNCVSTFRGPTLQIRALVDIQPGEE 165

Query: 97  VLISYIETAGSTMTRQKALKEQYLFTCTCPRCIKLGQFDDIQESAILEGYRCKDDGCSGF 156
           V  SY E    T  R+  L+ +Y F C CP C+      D    AI++  +C    C G 
Sbjct: 166 VCYSYTEKGNVTHERRDELR-KYFFECQCPHCL------DTDRDAIMKSVKCP--SCQGQ 216

Query: 157 LLRDSDDKGFTCQQCGLVRSKEEIKK-----IASEVNILSKKTLALTSCGNHQEVVSTYK 211
           +   S D+   C  CG      E  +     I  E ++L ++                  
Sbjct: 217 VKPTSSDRYEKCSSCGFTDFTTEFYEDLEIYIHVEFDLLFRENC---------------- 260

Query: 212 MIEKLQKKLYHPFSVNLMQTREKLIKILMELEDWKEALAYCQLTIPVYQRVYPQFHPLLG 271
           ++E    K+ HP ++++++         ++LE+W +A+ Y +     +    P   P  G
Sbjct: 261 LVE--LDKILHPDNIHVVRILVGAFAASVKLEEWTKAIDYGKRLDRAFGLYLPPNEPDTG 318

Query: 272 LQYYTCGKLEWFLGDTENAIKSMTEAVEILRITHGTNS 309
           L YY  GK  + L D ENA+ S+ +A  +L I +G +S
Sbjct: 319 LLYYKMGKAYYHLDDIENAVTSLRKAKTLLSIAYGRDS 356


>gi|225715142|gb|ACO13417.1| SET and MYND domain-containing protein 3 [Esox lucius]
          Length = 429

 Score =  122 bits (305), Expect = 3e-25,   Method: Compositional matrix adjust.
 Identities = 91/338 (26%), Positives = 155/338 (45%), Gaps = 48/338 (14%)

Query: 1   MSDIDEKQLLLYAQIANLVNLILQWPEISINEIAENF------SKLACNAHTICNSELRP 54
           + D+ E++    +Q+++++ L L+  +  I +   +F      +K+ CN  TI ++EL+ 
Sbjct: 131 LCDMGEEKREGLSQLSSMLQLYLKQEQPDIIQKVPSFDPISLLAKVTCNCFTISDAELQE 190

Query: 55  LGTGLYPVISIINHSCLPNAVLVFEGRLAVVRAVQHVPKGAEVLISYIETAGSTMTRQKA 114
           +G GLYP +S++NH C P+ V++F+G+   +RA++ +    EV ISYI     T  RQ  
Sbjct: 191 IGVGLYPSMSLLNHDCRPSCVMLFQGKTLQLRAIRDIQPTEEVTISYIGVLLPTRERQTQ 250

Query: 115 LKEQYLFTCTCPRCIKLGQFDDIQESAILEGYRCKDDGCSGFLLRDSDDKGFTCQQCGLV 174
           L EQY F+C C  C    + D +                                 CG+ 
Sbjct: 251 LMEQYHFSCQCGLC-STAELDPLMF-------------------------------CGVK 278

Query: 175 RSKEEIKKIASEVNILSKKTLALTSCGNHQEVVS-TYKMIEKLQKKLYHPFSVNLMQTR- 232
            +   +K+    + IL          GN +E++     ++  +   +      N+ + R 
Sbjct: 279 EAWTPMKEAIPRLEILQTN-------GNWEELLQECSSLLAPVGGAVPAVPDSNVYRLRV 331

Query: 233 -EKLIKILMELEDWKEALAYCQLTIPVYQRVYPQFHPLLGLQYYTCGKLEWFLGDTENAI 291
            +      + L  W+ ALAY   T+  Y++ YP  HP   +Q     KL+ FL   E+A 
Sbjct: 332 TDLAFDACINLARWETALAYGLKTLGPYRQYYPDPHPAHAIQLMRVAKLQHFLVHLEDAQ 391

Query: 292 KSMTEAVEILRITHGTNSPFMKELILKLEEAQAEASYK 329
            ++  A +I++ITHG       +LI KLEE  AE   K
Sbjct: 392 HTLRLAYDIMKITHGNEHSLTSDLIRKLEECLAEMDCK 429


>gi|195347982|ref|XP_002040530.1| GM18871 [Drosophila sechellia]
 gi|194121958|gb|EDW44001.1| GM18871 [Drosophila sechellia]
          Length = 466

 Score =  122 bits (305), Expect = 3e-25,   Method: Compositional matrix adjust.
 Identities = 87/318 (27%), Positives = 157/318 (49%), Gaps = 23/318 (7%)

Query: 16  ANLVNLILQWPEISIN--EIAENFSKLACNAHTICNSELRPLGTGLYPVISIINHSCLPN 73
           A L +++ + P    N  E+   + +L  N   I ++E+  + T +Y  +SI +HSC PN
Sbjct: 164 AVLSDMMAESPSTVPNKTELMSIYGRLITNGFNILDAEMNSIATAIYLGVSITDHSCQPN 223

Query: 74  AVLVFEGRLAVVRAVQHVP--KGAEVLISYIETAGSTMTRQKALKEQYLFTCTCPRCIKL 131
           AV  FEG    V A++ +     +++ ISYI+   +   R+  LKE Y F C C +C   
Sbjct: 224 AVATFEGNELHVHAIEDMECLDWSKIFISYIDLLNTPEQRRLDLKEHYYFLCVCSKCT-- 281

Query: 132 GQFDDIQESAILEGYRCKDDGCSGFLLRDSDDKGFTCQQCGLVRSKEEIKKIASEVNILS 191
               D +ES  +    C +  C   +  D  +    C +C    S + ++   +E   L+
Sbjct: 282 ----DAKESKEMLAALCPNRNCGVGISVDRTN----CPRCDAGISPK-LRNAFNEAMTLT 332

Query: 192 KKTLALTSCGNHQEV--VSTYKMIEKLQKKLYHPFSVNLMQTREKLIKILMELEDWKEAL 249
           +  L      N ++V  +   K+    Q  ++HP +V  ++T +   +  +E+  W +AL
Sbjct: 333 RHNLE-----NMKDVAYLDVCKVCLDKQTGVFHPLNVWYVKTLDAAFEAAIEVGKWSDAL 387

Query: 250 AYCQLTIPVYQRVYPQFHPLLGLQYYTCGKLEWFLGDTENAIKSMTEAVEILRITHGTNS 309
            Y Q  +P +++ +  ++PLLGL +   GK++ + G  + A+  + EA  IL +THG + 
Sbjct: 388 DYGQRLLPGFRKYHGPWNPLLGLLHMKLGKIQLYEGHAKEALHHLEEAQRILTVTHGRDH 447

Query: 310 PFMKE-LILKLEEAQAEA 326
             + E L L + +A+ EA
Sbjct: 448 RLLTEQLYLLVLQARQEA 465


>gi|349604328|gb|AEP99912.1| SET and MYND domain-containing protein 3-like protein, partial
           [Equus caballus]
          Length = 358

 Score =  122 bits (305), Expect = 3e-25,   Method: Compositional matrix adjust.
 Identities = 84/277 (30%), Positives = 136/277 (49%), Gaps = 43/277 (15%)

Query: 32  EIAENFSKLACNAHTICNSELRPLGTGLYPVISIINHSCLPNAVLVFEGRLAVVRAVQHV 91
           ++ E F+K+ CN+ TICN+E++ +G GLYP +S++NHSC PN  +VF G   ++RAV+ +
Sbjct: 100 DVFEAFAKVICNSFTICNAEMQEVGVGLYPSMSLLNHSCEPNCSIVFNGPHLLLRAVRDI 159

Query: 92  PKGAEVLISYIETAGSTMTRQKALKEQYLFTCTCPRCIKLGQFDDIQESAILEGYRCKDD 151
             G E+ I Y++   ++  R+K L++QY F C C RC                       
Sbjct: 160 EAGEELTICYLDMLMTSEERRKQLRDQYCFECDCFRC----------------------- 196

Query: 152 GCSGFLLRDSDDKGFTCQQCGLVRSKEEIKKIASEVNILSKKTLALTSCGNHQEVVSTYK 211
                  +D D    T  +      +E +KKI  E+    K    L  C   Q ++S+  
Sbjct: 197 -----QTQDKDADMLTGDEHVWKEVQESLKKI-EELKAHWKWEQVLAMC---QTIISSNS 247

Query: 212 MIEKLQKKLYHPFSVNLMQTR--EKLIKILMELEDWKEALAYCQLTIPVYQRVYPQFHPL 269
             E+L         +N+ Q +  +  +   + L   +EAL Y   T+  Y+  YP  HP+
Sbjct: 248 --ERLP-------DINIYQLKVLDCAMDACINLGLLEEALFYGIRTMEPYRIFYPGSHPV 298

Query: 270 LGLQYYTCGKLEWFLGDTENAIKSMTEAVEILRITHG 306
            G+Q    GKL+   G    A+K++  A +I+R+THG
Sbjct: 299 RGVQVMKVGKLQLHQGMFPQAMKNLRLAFDIMRVTHG 335


>gi|48257059|gb|AAH17079.2| SMYD3 protein, partial [Homo sapiens]
          Length = 287

 Score =  121 bits (304), Expect = 4e-25,   Method: Compositional matrix adjust.
 Identities = 84/277 (30%), Positives = 136/277 (49%), Gaps = 43/277 (15%)

Query: 32  EIAENFSKLACNAHTICNSELRPLGTGLYPVISIINHSCLPNAVLVFEGRLAVVRAVQHV 91
           ++ E F+K+ CN+ TICN+E++ +G GLYP IS++NHSC PN  +VF G   ++RAV+ +
Sbjct: 29  DLFEAFAKVICNSFTICNAEMQEVGVGLYPSISLLNHSCDPNCSIVFNGPHLLLRAVRDI 88

Query: 92  PKGAEVLISYIETAGSTMTRQKALKEQYLFTCTCPRCIKLGQFDDIQESAILEGYRCKDD 151
             G E+ I Y++   ++  R+K L++QY F C C RC                       
Sbjct: 89  EVGEELTICYLDMLMTSEERRKQLRDQYCFECDCFRCQT--------------------- 127

Query: 152 GCSGFLLRDSDDKGFTCQQCGLVRSKEEIKKIASEVNILSKKTLALTSCGNHQEVVSTYK 211
                  +D D    T  +      +E +KKI  E+    K    L  C   Q ++S+  
Sbjct: 128 -------QDKDADMLTGDEQVWKEVQESLKKI-EELKAHWKWEQVLAMC---QAIISSNS 176

Query: 212 MIEKLQKKLYHPFSVNLMQTR--EKLIKILMELEDWKEALAYCQLTIPVYQRVYPQFHPL 269
             E+L         +N+ Q +  +  +   + L   +EAL Y   T+  Y+  +P  HP+
Sbjct: 177 --ERLP-------DINIYQLKVLDCAMDACINLGLLEEALFYGTRTMEPYRIFFPGSHPV 227

Query: 270 LGLQYYTCGKLEWFLGDTENAIKSMTEAVEILRITHG 306
            G+Q    GKL+   G    A+K++  A +I+R+THG
Sbjct: 228 RGVQVMKVGKLQLHQGMFPQAMKNLRLAFDIMRVTHG 264


>gi|195477573|ref|XP_002100247.1| GE16937 [Drosophila yakuba]
 gi|194187771|gb|EDX01355.1| GE16937 [Drosophila yakuba]
          Length = 469

 Score =  121 bits (304), Expect = 4e-25,   Method: Compositional matrix adjust.
 Identities = 85/320 (26%), Positives = 159/320 (49%), Gaps = 23/320 (7%)

Query: 16  ANLVNLILQWPEISIN--EIAENFSKLACNAHTICNSELRPLGTGLYPVISIINHSCLPN 73
           A L +++   P    N  E+   + +L  N   I ++E+  + T +Y  +SI +HSC PN
Sbjct: 166 AVLTDMMADSPSTVPNKTELMSIYGRLITNGFNILDAEMNSIATAIYLGVSITDHSCQPN 225

Query: 74  AVLVFEGRLAVVRAVQHVP--KGAEVLISYIETAGSTMTRQKALKEQYLFTCTCPRCIKL 131
           AV  FEG    V A++ +     +++ ISYI+   +   R+  LKE Y F C C +C   
Sbjct: 226 AVATFEGNELHVHAIEDMECLDWSKIFISYIDLLNTPEQRRLDLKEHYYFLCVCSKCT-- 283

Query: 132 GQFDDIQESAILEGYRCKDDGCSGFLLRDSDDKGFTCQQCGLVRSKEEIKKIASEVNILS 191
               D +ES  +    C +  C   +  D  +    C +C    S + ++   ++   L+
Sbjct: 284 ----DAKESKEMLSALCPNRNCGVGITVDRTN----CPRCDAGISPK-LRNAFNDAMTLT 334

Query: 192 KKTLALTSCGNHQEV--VSTYKMIEKLQKKLYHPFSVNLMQTREKLIKILMELEDWKEAL 249
           +  L      N ++V  +   K+    Q  ++HP +V  ++T +   +  +E+  W +AL
Sbjct: 335 RHNLE-----NMKDVAYLDVCKVCLDKQTGVFHPLNVWYVKTLDAAFEAAIEVGKWTDAL 389

Query: 250 AYCQLTIPVYQRVYPQFHPLLGLQYYTCGKLEWFLGDTENAIKSMTEAVEILRITHGTNS 309
            Y Q  +P +++ +  ++PLLGL +   GK++ + G ++ A+  + EA  IL +THG + 
Sbjct: 390 DYGQRLLPGFRKYHGPWNPLLGLLHMKLGKIQLYEGRSKEALHHLEEAQRILTVTHGRDH 449

Query: 310 PFMKE-LILKLEEAQAEASY 328
             + E L + + +A+ EA++
Sbjct: 450 RLLTEQLYVLILQARQEAAH 469


>gi|197692956|gb|ACH71266.1| SET and MYND domain-containing 3 [Sus scrofa]
          Length = 369

 Score =  121 bits (304), Expect = 4e-25,   Method: Compositional matrix adjust.
 Identities = 84/277 (30%), Positives = 136/277 (49%), Gaps = 43/277 (15%)

Query: 32  EIAENFSKLACNAHTICNSELRPLGTGLYPVISIINHSCLPNAVLVFEGRLAVVRAVQHV 91
           +I E F+K+ CN+ TICN+E++ +G GLYP +S++NHSC PN  +VF G   ++RAV+ +
Sbjct: 111 DIFEAFAKVICNSFTICNAEMQEVGVGLYPSMSLLNHSCDPNCSIVFNGPHLLLRAVRDI 170

Query: 92  PKGAEVLISYIETAGSTMTRQKALKEQYLFTCTCPRCIKLGQFDDIQESAILEGYRCKDD 151
             G E+ I Y++   ++  R+K L++QY F C C RC                       
Sbjct: 171 EAGEELTICYLDMLMTSEERRKQLRDQYCFECDCFRC----------------------- 207

Query: 152 GCSGFLLRDSDDKGFTCQQCGLVRSKEEIKKIASEVNILSKKTLALTSCGNHQEVVSTYK 211
                  +D D    T  +      +E +KKI  E+    K    L  C   Q ++S+  
Sbjct: 208 -----QTQDKDADMLTGDEQVWKEVQESLKKI-EELKAHWKWEQVLAMC---QTIISSN- 257

Query: 212 MIEKLQKKLYHPFSVNLMQTR--EKLIKILMELEDWKEALAYCQLTIPVYQRVYPQFHPL 269
             E+L         +N+ Q +  +  +   + L   +EAL Y   T+  Y+  +P  HP+
Sbjct: 258 -AERLP-------DINIYQLKVLDCAMDACINLGLLEEALFYGMRTMEPYRIFFPGSHPV 309

Query: 270 LGLQYYTCGKLEWFLGDTENAIKSMTEAVEILRITHG 306
            G+Q    GKL+   G    A+K++  A +I+R+THG
Sbjct: 310 RGVQVMKVGKLQLHQGMFPQAMKNLRLAFDIMRVTHG 346


>gi|338722720|ref|XP_001914950.2| PREDICTED: LOW QUALITY PROTEIN: SET and MYND domain-containing
           protein 3 [Equus caballus]
          Length = 440

 Score =  121 bits (303), Expect = 5e-25,   Method: Compositional matrix adjust.
 Identities = 84/277 (30%), Positives = 136/277 (49%), Gaps = 43/277 (15%)

Query: 32  EIAENFSKLACNAHTICNSELRPLGTGLYPVISIINHSCLPNAVLVFEGRLAVVRAVQHV 91
           ++ E F+K+ CN+ TICN+E++ +G GLYP +S++NHSC PN  +VF G   ++RAV+ +
Sbjct: 182 DVFEAFAKVICNSFTICNAEMQEVGVGLYPSMSLLNHSCDPNCSIVFNGPHLLLRAVRDI 241

Query: 92  PKGAEVLISYIETAGSTMTRQKALKEQYLFTCTCPRCIKLGQFDDIQESAILEGYRCKDD 151
             G E+ I Y++   ++  R+K L++QY F C C RC                       
Sbjct: 242 EAGEELTICYLDMLMTSEERRKQLRDQYCFECDCFRC----------------------- 278

Query: 152 GCSGFLLRDSDDKGFTCQQCGLVRSKEEIKKIASEVNILSKKTLALTSCGNHQEVVSTYK 211
                  +D D    T  +      +E +KKI  E+    K    L  C   Q ++S+  
Sbjct: 279 -----QTQDKDADMLTGDEHVWKEVQESLKKI-EELKAHWKWEQVLAMC---QTIISSNS 329

Query: 212 MIEKLQKKLYHPFSVNLMQTR--EKLIKILMELEDWKEALAYCQLTIPVYQRVYPQFHPL 269
             E+L         +N+ Q +  +  +   + L   +EAL Y   T+  Y+  YP  HP+
Sbjct: 330 --ERLP-------DINIYQLKVLDCAMDACINLGLLEEALFYGIRTMEPYRIFYPGSHPV 380

Query: 270 LGLQYYTCGKLEWFLGDTENAIKSMTEAVEILRITHG 306
            G+Q    GKL+   G    A+K++  A +I+R+THG
Sbjct: 381 RGVQVMKVGKLQLHQGMFPQAMKNLRLAFDIMRVTHG 417


>gi|347300358|ref|NP_001153564.2| SET and MYND domain-containing protein 3 [Sus scrofa]
          Length = 428

 Score =  121 bits (303), Expect = 5e-25,   Method: Compositional matrix adjust.
 Identities = 84/277 (30%), Positives = 136/277 (49%), Gaps = 43/277 (15%)

Query: 32  EIAENFSKLACNAHTICNSELRPLGTGLYPVISIINHSCLPNAVLVFEGRLAVVRAVQHV 91
           +I E F+K+ CN+ TICN+E++ +G GLYP +S++NHSC PN  +VF G   ++RAV+ +
Sbjct: 170 DIFEAFAKVICNSFTICNAEMQEVGVGLYPSMSLLNHSCDPNCSIVFNGPHLLLRAVRDI 229

Query: 92  PKGAEVLISYIETAGSTMTRQKALKEQYLFTCTCPRCIKLGQFDDIQESAILEGYRCKDD 151
             G E+ I Y++   ++  R+K L++QY F C C RC                       
Sbjct: 230 EAGEELTICYLDMLMTSEERRKQLRDQYCFECDCFRC----------------------- 266

Query: 152 GCSGFLLRDSDDKGFTCQQCGLVRSKEEIKKIASEVNILSKKTLALTSCGNHQEVVSTYK 211
                  +D D    T  +      +E +KKI  E+    K    L  C   Q ++S+  
Sbjct: 267 -----QTQDKDADMLTGDEQVWKEVQESLKKI-EELKAHWKWEQVLAMC---QTIISSN- 316

Query: 212 MIEKLQKKLYHPFSVNLMQTR--EKLIKILMELEDWKEALAYCQLTIPVYQRVYPQFHPL 269
             E+L         +N+ Q +  +  +   + L   +EAL Y   T+  Y+  +P  HP+
Sbjct: 317 -AERLP-------DINIYQLKVLDCAMDACINLGLLEEALFYGMRTMEPYRIFFPGSHPV 368

Query: 270 LGLQYYTCGKLEWFLGDTENAIKSMTEAVEILRITHG 306
            G+Q    GKL+   G    A+K++  A +I+R+THG
Sbjct: 369 RGVQVMKVGKLQLHQGMFPQAMKNLRLAFDIMRVTHG 405


>gi|332812383|ref|XP_514316.3| PREDICTED: SET and MYND domain-containing protein 3 [Pan
           troglodytes]
 gi|397473047|ref|XP_003808033.1| PREDICTED: SET and MYND domain-containing protein 3 [Pan paniscus]
 gi|410208860|gb|JAA01649.1| SET and MYND domain containing 3 [Pan troglodytes]
 gi|410254410|gb|JAA15172.1| SET and MYND domain containing 3 [Pan troglodytes]
 gi|410289336|gb|JAA23268.1| SET and MYND domain containing 3 [Pan troglodytes]
 gi|410332281|gb|JAA35087.1| SET and MYND domain containing 3 [Pan troglodytes]
          Length = 428

 Score =  121 bits (303), Expect = 6e-25,   Method: Compositional matrix adjust.
 Identities = 84/277 (30%), Positives = 136/277 (49%), Gaps = 43/277 (15%)

Query: 32  EIAENFSKLACNAHTICNSELRPLGTGLYPVISIINHSCLPNAVLVFEGRLAVVRAVQHV 91
           ++ E F+K+ CN+ TICN+E++ +G GLYP IS++NHSC PN  +VF G   ++RAV+ +
Sbjct: 170 DLFEAFAKVICNSFTICNAEMQEVGVGLYPSISLLNHSCDPNCSIVFNGPHLLLRAVRDI 229

Query: 92  PKGAEVLISYIETAGSTMTRQKALKEQYLFTCTCPRCIKLGQFDDIQESAILEGYRCKDD 151
             G E+ I Y++   ++  R+K L++QY F C C RC                       
Sbjct: 230 EVGEELTICYLDMLMTSEERRKQLRDQYCFECDCFRC----------------------- 266

Query: 152 GCSGFLLRDSDDKGFTCQQCGLVRSKEEIKKIASEVNILSKKTLALTSCGNHQEVVSTYK 211
                  +D D    T  +      +E +KKI  E+    K    L  C   Q ++S+  
Sbjct: 267 -----QTQDKDADMLTGDEQVWKEVQESLKKI-EELKAHWKWEQVLAMC---QAIISSNS 317

Query: 212 MIEKLQKKLYHPFSVNLMQTR--EKLIKILMELEDWKEALAYCQLTIPVYQRVYPQFHPL 269
             E+L         +N+ Q +  +  +   + L   +EAL Y   T+  Y+  +P  HP+
Sbjct: 318 --ERLP-------DINIYQLKVLDCAMDACINLGLLEEALFYATRTMEPYRIFFPGSHPV 368

Query: 270 LGLQYYTCGKLEWFLGDTENAIKSMTEAVEILRITHG 306
            G+Q    GKL+   G    A+K++  A +I+R+THG
Sbjct: 369 RGVQVMKVGKLQLHQGMFPQAMKNLRLAFDIMRVTHG 405


>gi|12232401|ref|NP_073580.1| SET and MYND domain-containing protein 3 isoform 2 [Homo sapiens]
 gi|10437096|dbj|BAB14981.1| unnamed protein product [Homo sapiens]
 gi|119597549|gb|EAW77143.1| SET and MYND domain containing 3, isoform CRA_d [Homo sapiens]
          Length = 369

 Score =  121 bits (303), Expect = 6e-25,   Method: Compositional matrix adjust.
 Identities = 84/277 (30%), Positives = 136/277 (49%), Gaps = 43/277 (15%)

Query: 32  EIAENFSKLACNAHTICNSELRPLGTGLYPVISIINHSCLPNAVLVFEGRLAVVRAVQHV 91
           ++ E F+K+ CN+ TICN+E++ +G GLYP IS++NHSC PN  +VF G   ++RAV+ +
Sbjct: 111 DLFEAFAKVICNSFTICNAEMQEVGVGLYPSISLLNHSCDPNCSIVFNGPHLLLRAVRDI 170

Query: 92  PKGAEVLISYIETAGSTMTRQKALKEQYLFTCTCPRCIKLGQFDDIQESAILEGYRCKDD 151
             G E+ I Y++   ++  R+K L++QY F C C RC                       
Sbjct: 171 EVGEELTICYLDMLMTSEERRKQLRDQYCFECDCFRC----------------------- 207

Query: 152 GCSGFLLRDSDDKGFTCQQCGLVRSKEEIKKIASEVNILSKKTLALTSCGNHQEVVSTYK 211
                  +D D    T  +      +E +KKI  E+    K    L  C   Q ++S+  
Sbjct: 208 -----QTQDKDADMLTGDEQVWKEVQESLKKI-EELKAHWKWEQVLAMC---QAIISSNS 258

Query: 212 MIEKLQKKLYHPFSVNLMQTR--EKLIKILMELEDWKEALAYCQLTIPVYQRVYPQFHPL 269
             E+L         +N+ Q +  +  +   + L   +EAL Y   T+  Y+  +P  HP+
Sbjct: 259 --ERLP-------DINIYQLKVLDCAMDACINLGLLEEALFYGTRTMEPYRIFFPGSHPV 309

Query: 270 LGLQYYTCGKLEWFLGDTENAIKSMTEAVEILRITHG 306
            G+Q    GKL+   G    A+K++  A +I+R+THG
Sbjct: 310 RGVQVMKVGKLQLHQGMFPQAMKNLRLAFDIMRVTHG 346


>gi|332236389|ref|XP_003267386.1| PREDICTED: SET and MYND domain-containing protein 3 [Nomascus
           leucogenys]
          Length = 369

 Score =  120 bits (302), Expect = 6e-25,   Method: Compositional matrix adjust.
 Identities = 84/277 (30%), Positives = 136/277 (49%), Gaps = 43/277 (15%)

Query: 32  EIAENFSKLACNAHTICNSELRPLGTGLYPVISIINHSCLPNAVLVFEGRLAVVRAVQHV 91
           ++ E F+K+ CN+ TICN+E++ +G GLYP IS++NHSC PN  +VF G   ++RAV+ +
Sbjct: 111 DLFEAFAKVICNSFTICNAEMQEVGVGLYPSISLLNHSCDPNCSIVFNGPHLLLRAVRDI 170

Query: 92  PKGAEVLISYIETAGSTMTRQKALKEQYLFTCTCPRCIKLGQFDDIQESAILEGYRCKDD 151
             G E+ I Y++   ++  R+K L++QY F C C RC                       
Sbjct: 171 EVGEELTICYLDMLMTSEERRKQLRDQYCFECDCLRC----------------------- 207

Query: 152 GCSGFLLRDSDDKGFTCQQCGLVRSKEEIKKIASEVNILSKKTLALTSCGNHQEVVSTYK 211
                  +D D    T  +      +E +KKI  E+    K    L  C   Q ++S+  
Sbjct: 208 -----QTQDKDADMLTGDEQVWKEVQESLKKI-EELKAHWKWEQVLAMC---QAIISSNS 258

Query: 212 MIEKLQKKLYHPFSVNLMQTR--EKLIKILMELEDWKEALAYCQLTIPVYQRVYPQFHPL 269
             E+L         +N+ Q +  +  +   + L   +EAL Y   T+  Y+  +P  HP+
Sbjct: 259 --ERLP-------DINIYQLKVLDCAMDACINLGLLEEALFYGTRTMEPYRIFFPGSHPV 309

Query: 270 LGLQYYTCGKLEWFLGDTENAIKSMTEAVEILRITHG 306
            G+Q    GKL+   G    A+K++  A +I+R+THG
Sbjct: 310 RGVQVMKVGKLQLHQGMFPQAMKNLRLAFDIMRVTHG 346


>gi|340711607|ref|XP_003394366.1| PREDICTED: SET and MYND domain-containing protein 3-like [Bombus
           terrestris]
          Length = 439

 Score =  120 bits (302), Expect = 7e-25,   Method: Compositional matrix adjust.
 Identities = 87/308 (28%), Positives = 150/308 (48%), Gaps = 24/308 (7%)

Query: 5   DEKQLLLYAQIANLVNLILQ-WPEISINEIAENFSKLACNAHTICNSELRPLGTGLYPVI 63
           DEK++  +  + N+++  L+  P  S  E+   + ++  N+ +I N ++  +G G+Y   
Sbjct: 127 DEKKMEHFVCLCNVLHKFLEDMPIPSTAELLGIYGRITINSFSIFNLDMN-IGVGIYLGP 185

Query: 64  SIINHSCLPNAVLVFEGRLAVVRAVQHVPKG--AEVLISYIETAGSTMTRQKALKEQYLF 121
           SI++HSC PNAV  FEG    V+A++ +P    +++ I YI+   +   R+  L+  Y F
Sbjct: 186 SILDHSCKPNAVATFEGTTINVKAIEDLPSLDLSQIRIPYIDVIKTAGDRRAELQSSYYF 245

Query: 122 TCTCPRCIKLGQFDDIQESAILEGYRCKDDGCSGFLLRDSDDKGFTCQQCGLVRSKEEIK 181
            C C +C K           + E   C +  C+       D     C++C   +  E  K
Sbjct: 246 WCDCEKCEK--------PEPMAEAAACPNKLCT----YPCDPNADLCEKCN-TKFPENFK 292

Query: 182 KIASEVNILSKKTLALTSCGNHQEV--VSTYKMIEKLQKKLYHPFSVNLMQTREKLIKIL 239
           +I  E++  +   L      N + +  +   KM    Q+ + HP +V  +QT +      
Sbjct: 293 EIFDEISEFTAYHLE-----NMKNIAYLDVSKMCLSRQEGILHPLNVQYVQTLQTAFDSS 347

Query: 240 MELEDWKEALAYCQLTIPVYQRVYPQFHPLLGLQYYTCGKLEWFLGDTENAIKSMTEAVE 299
           + L+ W+EA +Y +  I  Y   Y + HP  G+ Y + GKLE +L   + AIK++ +A  
Sbjct: 348 INLQHWEEAESYAKKLINGYLAYYGEIHPSTGILYLSIGKLEVYLKKLKQAIKTLRKASL 407

Query: 300 ILRITHGT 307
           IL ITHG 
Sbjct: 408 ILTITHGA 415


>gi|355559119|gb|EHH15899.1| hypothetical protein EGK_02058, partial [Macaca mulatta]
          Length = 373

 Score =  120 bits (302), Expect = 7e-25,   Method: Compositional matrix adjust.
 Identities = 84/277 (30%), Positives = 136/277 (49%), Gaps = 43/277 (15%)

Query: 32  EIAENFSKLACNAHTICNSELRPLGTGLYPVISIINHSCLPNAVLVFEGRLAVVRAVQHV 91
           ++ E F+K+ CN+ TICN+E++ +G GLYP IS++NHSC PN  +VF G   ++RAV+ +
Sbjct: 115 DLFEAFAKVICNSFTICNAEMQEVGVGLYPSISLLNHSCDPNCSIVFNGPHLLLRAVRDI 174

Query: 92  PKGAEVLISYIETAGSTMTRQKALKEQYLFTCTCPRCIKLGQFDDIQESAILEGYRCKDD 151
             G E+ I Y++   ++  R+K L++QY F C C RC                       
Sbjct: 175 EVGEELTICYLDMLMTSEERRKQLRDQYCFECDCFRC----------------------- 211

Query: 152 GCSGFLLRDSDDKGFTCQQCGLVRSKEEIKKIASEVNILSKKTLALTSCGNHQEVVSTYK 211
                  +D D    T  +      +E +KKI  E+    K    L  C   Q ++S+  
Sbjct: 212 -----QTQDKDADMLTGDEQVWKEVQESLKKI-EELKAHWKWEQVLAMC---QAIISSNS 262

Query: 212 MIEKLQKKLYHPFSVNLMQTR--EKLIKILMELEDWKEALAYCQLTIPVYQRVYPQFHPL 269
             E+L         +N+ Q +  +  +   + L   +EAL Y   T+  Y+  +P  HP+
Sbjct: 263 --ERLP-------DINIYQLKVLDCAMDACINLGLLEEALFYGTRTMEPYRIFFPGSHPV 313

Query: 270 LGLQYYTCGKLEWFLGDTENAIKSMTEAVEILRITHG 306
            G+Q    GKL+   G    A+K++  A +I+R+THG
Sbjct: 314 RGVQVMKVGKLQLHQGMFPQAMKNLRLAFDIMRVTHG 350


>gi|426239577|ref|XP_004013696.1| PREDICTED: SET and MYND domain-containing protein 3 isoform 1 [Ovis
           aries]
          Length = 369

 Score =  120 bits (302), Expect = 7e-25,   Method: Compositional matrix adjust.
 Identities = 85/277 (30%), Positives = 135/277 (48%), Gaps = 43/277 (15%)

Query: 32  EIAENFSKLACNAHTICNSELRPLGTGLYPVISIINHSCLPNAVLVFEGRLAVVRAVQHV 91
           +I E F+K+ CN+ TICN+E++ +G GLYP +S++NHSC PN  +VF G   ++R V+ V
Sbjct: 111 DIFEAFAKVICNSFTICNAEMQEVGVGLYPSMSLLNHSCDPNCSIVFNGPHLLLRTVRDV 170

Query: 92  PKGAEVLISYIETAGSTMTRQKALKEQYLFTCTCPRCIKLGQFDDIQESAILEGYRCKDD 151
             G E+ I Y++   ++  R+K L++QY F C C RC                       
Sbjct: 171 EAGEELTICYLDMLMTSEERRKQLRDQYCFDCDCFRC----------------------- 207

Query: 152 GCSGFLLRDSDDKGFTCQQCGLVRSKEEIKKIASEVNILSKKTLALTSCGNHQEVVSTYK 211
                  +D D    T  +      +E +KKI  E+    K    L  C   Q ++S+  
Sbjct: 208 -----QTQDKDADMLTGDEQVWKEVQESLKKI-EELKAHWKWEQVLAMC---QSIISSNA 258

Query: 212 MIEKLQKKLYHPFSVNLMQTR--EKLIKILMELEDWKEALAYCQLTIPVYQRVYPQFHPL 269
             E+L         +N+ Q +  +  +   + L   +EAL Y   T+  Y+  YP  HP+
Sbjct: 259 --ERLP-------DINIYQLKVLDCAMDACINLGLLEEALFYGIRTMEPYRIFYPGNHPV 309

Query: 270 LGLQYYTCGKLEWFLGDTENAIKSMTEAVEILRITHG 306
            G+Q    GKL+   G    A+K++  A +I+R+THG
Sbjct: 310 RGVQIMKVGKLQLHQGMFPQAMKNLRLAFDIMRVTHG 346


>gi|332024084|gb|EGI64301.1| SET and MYND domain-containing protein 3 [Acromyrmex echinatior]
          Length = 441

 Score =  120 bits (302), Expect = 7e-25,   Method: Compositional matrix adjust.
 Identities = 84/299 (28%), Positives = 144/299 (48%), Gaps = 34/299 (11%)

Query: 29  SINEIAENFSKLACNAHTICNSELRPLGTGLYPVISIINHSCLPNAVLVFEGRLAVVRAV 88
           +I E+   + ++  N+  I +  +  +G G+Y   S+I+HSC PN ++VFEG   +VR +
Sbjct: 148 NIAELMGIYGRICTNSFNILDINMNTIGVGIYLGASVIDHSCKPNVIVVFEGTTIIVRTL 207

Query: 89  QHVPK--------GAEVLISYIETAGSTMTRQKALKEQYLFTCTCPRCIKLGQFDDIQES 140
             +P           ++ ISY++   S   R++ L   Y F C C RC K        E 
Sbjct: 208 TDLPSLDWSQASIDKDIRISYVDLLNSNKDRREELHSSYYFWCDCERCKK--------EE 259

Query: 141 AILEGYRCKDDGCSGFLLRDSDDKGFTCQQCG---LVRSKEEIKKIAS-EVNILSK-KTL 195
            ++E   C +  C      ++D+    C++C     V  KE  +++    ++ L K KT+
Sbjct: 260 PMVEAAACPNLLCDSPCSIEADE----CEKCNKEISVEFKETFREVVDFTIHHLEKMKTM 315

Query: 196 ALTSCGNHQEVVSTYKMIEKLQKKLYHPFSVNLMQTREKLIKILMELEDWKEALAYCQLT 255
           A          +   K+  K QK + H F++  ++T E      M L+ W++A  Y +  
Sbjct: 316 AY---------LDVSKICLKKQKGIMHKFNIQHVRTLEMAHIAAMNLKCWEDAEFYGKEL 366

Query: 256 IPVYQRVYPQFHPLLGLQYYTCGKLEWFLGDTENAIKSMTEAVEILRITHGTNSPFMKE 314
           +P Y   Y + HPL GL Y T GK++  L   + A +++T+A  +L ITHG     ++E
Sbjct: 367 VPGYLLYYGEIHPLTGLLYLTVGKIQLHLEKPKEAFQALTKANTVLTITHGDKHSIVEE 425


>gi|333361525|pdb|3RU0|A Chain A, Cocrystal Structure Of Human Smyd3 With Inhibitor
           Sinefungin Bound
 gi|333361526|pdb|3RU0|B Chain B, Cocrystal Structure Of Human Smyd3 With Inhibitor
           Sinefungin Bound
          Length = 438

 Score =  120 bits (301), Expect = 8e-25,   Method: Compositional matrix adjust.
 Identities = 84/277 (30%), Positives = 136/277 (49%), Gaps = 43/277 (15%)

Query: 32  EIAENFSKLACNAHTICNSELRPLGTGLYPVISIINHSCLPNAVLVFEGRLAVVRAVQHV 91
           ++ E F+K+ CN+ TICN+E++ +G GLYP IS++NHSC PN  +VF G   ++RAV+ +
Sbjct: 172 DLFEAFAKVICNSFTICNAEMQEVGVGLYPSISLLNHSCDPNCSIVFNGPHLLLRAVRDI 231

Query: 92  PKGAEVLISYIETAGSTMTRQKALKEQYLFTCTCPRCIKLGQFDDIQESAILEGYRCKDD 151
             G E+ I Y++   ++  R+K L++QY F C C RC                       
Sbjct: 232 EVGEELTICYLDMLMTSEERRKQLRDQYCFECDCFRC----------------------- 268

Query: 152 GCSGFLLRDSDDKGFTCQQCGLVRSKEEIKKIASEVNILSKKTLALTSCGNHQEVVSTYK 211
                  +D D    T  +      +E +KKI  E+    K    L  C   Q ++S+  
Sbjct: 269 -----QTQDKDADMLTGDEQVWKEVQESLKKI-EELKAHWKWEQVLAMC---QAIISSNS 319

Query: 212 MIEKLQKKLYHPFSVNLMQTR--EKLIKILMELEDWKEALAYCQLTIPVYQRVYPQFHPL 269
             E+L         +N+ Q +  +  +   + L   +EAL Y   T+  Y+  +P  HP+
Sbjct: 320 --ERLP-------DINIYQLKVLDCAMDACINLGLLEEALFYGTRTMEPYRIFFPGSHPV 370

Query: 270 LGLQYYTCGKLEWFLGDTENAIKSMTEAVEILRITHG 306
            G+Q    GKL+   G    A+K++  A +I+R+THG
Sbjct: 371 RGVQVMKVGKLQLHQGMFPQAMKNLRLAFDIMRVTHG 407


>gi|426239579|ref|XP_004013697.1| PREDICTED: SET and MYND domain-containing protein 3 isoform 2 [Ovis
           aries]
          Length = 428

 Score =  120 bits (301), Expect = 8e-25,   Method: Compositional matrix adjust.
 Identities = 85/277 (30%), Positives = 135/277 (48%), Gaps = 43/277 (15%)

Query: 32  EIAENFSKLACNAHTICNSELRPLGTGLYPVISIINHSCLPNAVLVFEGRLAVVRAVQHV 91
           +I E F+K+ CN+ TICN+E++ +G GLYP +S++NHSC PN  +VF G   ++R V+ V
Sbjct: 170 DIFEAFAKVICNSFTICNAEMQEVGVGLYPSMSLLNHSCDPNCSIVFNGPHLLLRTVRDV 229

Query: 92  PKGAEVLISYIETAGSTMTRQKALKEQYLFTCTCPRCIKLGQFDDIQESAILEGYRCKDD 151
             G E+ I Y++   ++  R+K L++QY F C C RC                       
Sbjct: 230 EAGEELTICYLDMLMTSEERRKQLRDQYCFDCDCFRC----------------------- 266

Query: 152 GCSGFLLRDSDDKGFTCQQCGLVRSKEEIKKIASEVNILSKKTLALTSCGNHQEVVSTYK 211
                  +D D    T  +      +E +KKI  E+    K    L  C   Q ++S+  
Sbjct: 267 -----QTQDKDADMLTGDEQVWKEVQESLKKI-EELKAHWKWEQVLAMC---QSIISSN- 316

Query: 212 MIEKLQKKLYHPFSVNLMQTR--EKLIKILMELEDWKEALAYCQLTIPVYQRVYPQFHPL 269
             E+L         +N+ Q +  +  +   + L   +EAL Y   T+  Y+  YP  HP+
Sbjct: 317 -AERLP-------DINIYQLKVLDCAMDACINLGLLEEALFYGIRTMEPYRIFYPGNHPV 368

Query: 270 LGLQYYTCGKLEWFLGDTENAIKSMTEAVEILRITHG 306
            G+Q    GKL+   G    A+K++  A +I+R+THG
Sbjct: 369 RGVQIMKVGKLQLHQGMFPQAMKNLRLAFDIMRVTHG 405


>gi|267844824|ref|NP_001161212.1| SET and MYND domain-containing protein 3 isoform 1 [Homo sapiens]
 gi|212276523|sp|Q9H7B4.4|SMYD3_HUMAN RecName: Full=SET and MYND domain-containing protein 3; AltName:
           Full=Zinc finger MYND domain-containing protein 1
 gi|119597548|gb|EAW77142.1| SET and MYND domain containing 3, isoform CRA_c [Homo sapiens]
          Length = 428

 Score =  120 bits (301), Expect = 9e-25,   Method: Compositional matrix adjust.
 Identities = 84/277 (30%), Positives = 136/277 (49%), Gaps = 43/277 (15%)

Query: 32  EIAENFSKLACNAHTICNSELRPLGTGLYPVISIINHSCLPNAVLVFEGRLAVVRAVQHV 91
           ++ E F+K+ CN+ TICN+E++ +G GLYP IS++NHSC PN  +VF G   ++RAV+ +
Sbjct: 170 DLFEAFAKVICNSFTICNAEMQEVGVGLYPSISLLNHSCDPNCSIVFNGPHLLLRAVRDI 229

Query: 92  PKGAEVLISYIETAGSTMTRQKALKEQYLFTCTCPRCIKLGQFDDIQESAILEGYRCKDD 151
             G E+ I Y++   ++  R+K L++QY F C C RC                       
Sbjct: 230 EVGEELTICYLDMLMTSEERRKQLRDQYCFECDCFRC----------------------- 266

Query: 152 GCSGFLLRDSDDKGFTCQQCGLVRSKEEIKKIASEVNILSKKTLALTSCGNHQEVVSTYK 211
                  +D D    T  +      +E +KKI  E+    K    L  C   Q ++S+  
Sbjct: 267 -----QTQDKDADMLTGDEQVWKEVQESLKKI-EELKAHWKWEQVLAMC---QAIISSNS 317

Query: 212 MIEKLQKKLYHPFSVNLMQTR--EKLIKILMELEDWKEALAYCQLTIPVYQRVYPQFHPL 269
             E+L         +N+ Q +  +  +   + L   +EAL Y   T+  Y+  +P  HP+
Sbjct: 318 --ERLP-------DINIYQLKVLDCAMDACINLGLLEEALFYGTRTMEPYRIFFPGSHPV 368

Query: 270 LGLQYYTCGKLEWFLGDTENAIKSMTEAVEILRITHG 306
            G+Q    GKL+   G    A+K++  A +I+R+THG
Sbjct: 369 RGVQVMKVGKLQLHQGMFPQAMKNLRLAFDIMRVTHG 405


>gi|312208186|pdb|3PDN|A Chain A, Crystal Structure Of Smyd3 In Complex With
           Methyltransferase Inhibitor Sinefungin
 gi|323714509|pdb|3OXL|A Chain A, Human Lysine Methyltransferase Smyd3 In Complex With
           Adohcy (Form Ii)
 gi|21410974|gb|AAH31010.1| SMYD3 protein [Homo sapiens]
 gi|123984179|gb|ABM83506.1| SET and MYND domain containing 3 [synthetic construct]
 gi|123998243|gb|ABM86723.1| SET and MYND domain containing 3 [synthetic construct]
          Length = 428

 Score =  120 bits (301), Expect = 9e-25,   Method: Compositional matrix adjust.
 Identities = 84/277 (30%), Positives = 136/277 (49%), Gaps = 43/277 (15%)

Query: 32  EIAENFSKLACNAHTICNSELRPLGTGLYPVISIINHSCLPNAVLVFEGRLAVVRAVQHV 91
           ++ E F+K+ CN+ TICN+E++ +G GLYP IS++NHSC PN  +VF G   ++RAV+ +
Sbjct: 170 DLFEAFAKVICNSFTICNAEMQEVGVGLYPSISLLNHSCDPNCSIVFNGPHLLLRAVRDI 229

Query: 92  PKGAEVLISYIETAGSTMTRQKALKEQYLFTCTCPRCIKLGQFDDIQESAILEGYRCKDD 151
             G E+ I Y++   ++  R+K L++QY F C C RC                       
Sbjct: 230 EVGEELTICYLDMLMTSEERRKQLRDQYCFECDCFRC----------------------- 266

Query: 152 GCSGFLLRDSDDKGFTCQQCGLVRSKEEIKKIASEVNILSKKTLALTSCGNHQEVVSTYK 211
                  +D D    T  +      +E +KKI  E+    K    L  C   Q ++S+  
Sbjct: 267 -----QTQDKDADMLTGDEQVWKEVQESLKKI-EELKAHWKWEQVLAMC---QAIISSNS 317

Query: 212 MIEKLQKKLYHPFSVNLMQTR--EKLIKILMELEDWKEALAYCQLTIPVYQRVYPQFHPL 269
             E+L         +N+ Q +  +  +   + L   +EAL Y   T+  Y+  +P  HP+
Sbjct: 318 --ERLP-------DINIYQLKVLDCAMDACINLGLLEEALFYGTRTMEPYRIFFPGSHPV 368

Query: 270 LGLQYYTCGKLEWFLGDTENAIKSMTEAVEILRITHG 306
            G+Q    GKL+   G    A+K++  A +I+R+THG
Sbjct: 369 RGVQVMKVGKLQLHQGMFPQAMKNLRLAFDIMRVTHG 405


>gi|19570495|dbj|BAB86333.1| histone methyltransferase [Homo sapiens]
 gi|158260233|dbj|BAF82294.1| unnamed protein product [Homo sapiens]
 gi|261858444|dbj|BAI45744.1| SET and MYND domain containing 3 [synthetic construct]
          Length = 428

 Score =  120 bits (301), Expect = 9e-25,   Method: Compositional matrix adjust.
 Identities = 84/277 (30%), Positives = 136/277 (49%), Gaps = 43/277 (15%)

Query: 32  EIAENFSKLACNAHTICNSELRPLGTGLYPVISIINHSCLPNAVLVFEGRLAVVRAVQHV 91
           ++ E F+K+ CN+ TICN+E++ +G GLYP IS++NHSC PN  +VF G   ++RAV+ +
Sbjct: 170 DLFEAFAKVICNSFTICNAEMQEVGVGLYPSISLLNHSCDPNCSIVFNGPHLLLRAVRDI 229

Query: 92  PKGAEVLISYIETAGSTMTRQKALKEQYLFTCTCPRCIKLGQFDDIQESAILEGYRCKDD 151
             G E+ I Y++   ++  R+K L++QY F C C RC                       
Sbjct: 230 EVGEELTICYLDMLMTSEERRKQLRDQYCFECDCFRC----------------------- 266

Query: 152 GCSGFLLRDSDDKGFTCQQCGLVRSKEEIKKIASEVNILSKKTLALTSCGNHQEVVSTYK 211
                  +D D    T  +      +E +KKI  E+    K    L  C   Q ++S+  
Sbjct: 267 -----QTQDKDADMLTGDEQVWKEVQESLKKI-EELKAHWKWEQVLAMC---QAIISSNS 317

Query: 212 MIEKLQKKLYHPFSVNLMQTR--EKLIKILMELEDWKEALAYCQLTIPVYQRVYPQFHPL 269
             E+L         +N+ Q +  +  +   + L   +EAL Y   T+  Y+  +P  HP+
Sbjct: 318 --ERLP-------DINIYQLKVLDCAMDACINLGLLEEALFYGTRTMEPYRIFFPGSHPV 368

Query: 270 LGLQYYTCGKLEWFLGDTENAIKSMTEAVEILRITHG 306
            G+Q    GKL+   G    A+K++  A +I+R+THG
Sbjct: 369 RGVQVMKVGKLQLHQGMFPQAMKNLRLAFDIMRVTHG 405


>gi|327533748|pdb|3QWP|A Chain A, Crystal Structure Of Set And Mynd Domain Containing 3;
           Zinc Finger Mynd Domain-Containing Protein 1
          Length = 429

 Score =  120 bits (301), Expect = 9e-25,   Method: Compositional matrix adjust.
 Identities = 84/277 (30%), Positives = 136/277 (49%), Gaps = 43/277 (15%)

Query: 32  EIAENFSKLACNAHTICNSELRPLGTGLYPVISIINHSCLPNAVLVFEGRLAVVRAVQHV 91
           ++ E F+K+ CN+ TICN+E++ +G GLYP IS++NHSC PN  +VF G   ++RAV+ +
Sbjct: 171 DLFEAFAKVICNSFTICNAEMQEVGVGLYPSISLLNHSCDPNCSIVFNGPHLLLRAVRDI 230

Query: 92  PKGAEVLISYIETAGSTMTRQKALKEQYLFTCTCPRCIKLGQFDDIQESAILEGYRCKDD 151
             G E+ I Y++   ++  R+K L++QY F C C RC                       
Sbjct: 231 EVGEELTICYLDMLMTSEERRKQLRDQYCFECDCFRC----------------------- 267

Query: 152 GCSGFLLRDSDDKGFTCQQCGLVRSKEEIKKIASEVNILSKKTLALTSCGNHQEVVSTYK 211
                  +D D    T  +      +E +KKI  E+    K    L  C   Q ++S+  
Sbjct: 268 -----QTQDKDADMLTGDEQVWKEVQESLKKI-EELKAHWKWEQVLAMC---QAIISSNS 318

Query: 212 MIEKLQKKLYHPFSVNLMQTR--EKLIKILMELEDWKEALAYCQLTIPVYQRVYPQFHPL 269
             E+L         +N+ Q +  +  +   + L   +EAL Y   T+  Y+  +P  HP+
Sbjct: 319 --ERLP-------DINIYQLKVLDCAMDACINLGLLEEALFYGTRTMEPYRIFFPGSHPV 369

Query: 270 LGLQYYTCGKLEWFLGDTENAIKSMTEAVEILRITHG 306
            G+Q    GKL+   G    A+K++  A +I+R+THG
Sbjct: 370 RGVQVMKVGKLQLHQGMFPQAMKNLRLAFDIMRVTHG 406


>gi|388454549|ref|NP_001252862.1| SET and MYND domain-containing protein 3 [Macaca mulatta]
 gi|380787707|gb|AFE65729.1| SET and MYND domain-containing protein 3 isoform 1 [Macaca mulatta]
 gi|383412903|gb|AFH29665.1| SET and MYND domain-containing protein 3 isoform 1 [Macaca mulatta]
          Length = 428

 Score =  120 bits (301), Expect = 9e-25,   Method: Compositional matrix adjust.
 Identities = 84/277 (30%), Positives = 136/277 (49%), Gaps = 43/277 (15%)

Query: 32  EIAENFSKLACNAHTICNSELRPLGTGLYPVISIINHSCLPNAVLVFEGRLAVVRAVQHV 91
           ++ E F+K+ CN+ TICN+E++ +G GLYP IS++NHSC PN  +VF G   ++RAV+ +
Sbjct: 170 DLFEAFAKVICNSFTICNAEMQEVGVGLYPSISLLNHSCDPNCSIVFNGPHLLLRAVRDI 229

Query: 92  PKGAEVLISYIETAGSTMTRQKALKEQYLFTCTCPRCIKLGQFDDIQESAILEGYRCKDD 151
             G E+ I Y++   ++  R+K L++QY F C C RC                       
Sbjct: 230 EVGEELTICYLDMLMTSEERRKQLRDQYCFECDCFRC----------------------- 266

Query: 152 GCSGFLLRDSDDKGFTCQQCGLVRSKEEIKKIASEVNILSKKTLALTSCGNHQEVVSTYK 211
                  +D D    T  +      +E +KKI  E+    K    L  C   Q ++S+  
Sbjct: 267 -----QTQDKDADMLTGDEQVWKEVQESLKKI-EELKAHWKWEQVLAMC---QAIISSNS 317

Query: 212 MIEKLQKKLYHPFSVNLMQTR--EKLIKILMELEDWKEALAYCQLTIPVYQRVYPQFHPL 269
             E+L         +N+ Q +  +  +   + L   +EAL Y   T+  Y+  +P  HP+
Sbjct: 318 --ERLP-------DINIYQLKVLDCAMDACINLGLLEEALFYGTRTMEPYRIFFPGSHPV 368

Query: 270 LGLQYYTCGKLEWFLGDTENAIKSMTEAVEILRITHG 306
            G+Q    GKL+   G    A+K++  A +I+R+THG
Sbjct: 369 RGVQVMKVGKLQLHQGMFPQAMKNLRLAFDIMRVTHG 405


>gi|323714508|pdb|3OXG|A Chain A, Human Lysine Methyltransferase Smyd3 In Complex With
           Adohcy (Form Iii)
          Length = 464

 Score =  120 bits (301), Expect = 9e-25,   Method: Compositional matrix adjust.
 Identities = 84/277 (30%), Positives = 136/277 (49%), Gaps = 43/277 (15%)

Query: 32  EIAENFSKLACNAHTICNSELRPLGTGLYPVISIINHSCLPNAVLVFEGRLAVVRAVQHV 91
           ++ E F+K+ CN+ TICN+E++ +G GLYP IS++NHSC PN  +VF G   ++RAV+ +
Sbjct: 206 DLFEAFAKVICNSFTICNAEMQEVGVGLYPSISLLNHSCDPNCSIVFNGPHLLLRAVRDI 265

Query: 92  PKGAEVLISYIETAGSTMTRQKALKEQYLFTCTCPRCIKLGQFDDIQESAILEGYRCKDD 151
             G E+ I Y++   ++  R+K L++QY F C C RC                       
Sbjct: 266 EVGEELTICYLDMLMTSEERRKQLRDQYCFECDCFRC----------------------- 302

Query: 152 GCSGFLLRDSDDKGFTCQQCGLVRSKEEIKKIASEVNILSKKTLALTSCGNHQEVVSTYK 211
                  +D D    T  +      +E +KKI  E+    K    L  C   Q ++S+  
Sbjct: 303 -----QTQDKDADMLTGDEQVWKEVQESLKKI-EELKAHWKWEQVLAMC---QAIISSNS 353

Query: 212 MIEKLQKKLYHPFSVNLMQTR--EKLIKILMELEDWKEALAYCQLTIPVYQRVYPQFHPL 269
             E+L         +N+ Q +  +  +   + L   +EAL Y   T+  Y+  +P  HP+
Sbjct: 354 --ERLP-------DINIYQLKVLDCAMDACINLGLLEEALFYGTRTMEPYRIFFPGSHPV 404

Query: 270 LGLQYYTCGKLEWFLGDTENAIKSMTEAVEILRITHG 306
            G+Q    GKL+   G    A+K++  A +I+R+THG
Sbjct: 405 RGVQVMKVGKLQLHQGMFPQAMKNLRLAFDIMRVTHG 441


>gi|323714506|pdb|3OXF|A Chain A, Human Lysine Methyltransferase Smyd3 In Complex With
           Adohcy (Form I)
 gi|323714507|pdb|3OXF|B Chain B, Human Lysine Methyltransferase Smyd3 In Complex With
           Adohcy (Form I)
          Length = 436

 Score =  120 bits (301), Expect = 9e-25,   Method: Compositional matrix adjust.
 Identities = 84/277 (30%), Positives = 136/277 (49%), Gaps = 43/277 (15%)

Query: 32  EIAENFSKLACNAHTICNSELRPLGTGLYPVISIINHSCLPNAVLVFEGRLAVVRAVQHV 91
           ++ E F+K+ CN+ TICN+E++ +G GLYP IS++NHSC PN  +VF G   ++RAV+ +
Sbjct: 170 DLFEAFAKVICNSFTICNAEMQEVGVGLYPSISLLNHSCDPNCSIVFNGPHLLLRAVRDI 229

Query: 92  PKGAEVLISYIETAGSTMTRQKALKEQYLFTCTCPRCIKLGQFDDIQESAILEGYRCKDD 151
             G E+ I Y++   ++  R+K L++QY F C C RC                       
Sbjct: 230 EVGEELTICYLDMLMTSEERRKQLRDQYCFECDCFRC----------------------- 266

Query: 152 GCSGFLLRDSDDKGFTCQQCGLVRSKEEIKKIASEVNILSKKTLALTSCGNHQEVVSTYK 211
                  +D D    T  +      +E +KKI  E+    K    L  C   Q ++S+  
Sbjct: 267 -----QTQDKDADMLTGDEQVWKEVQESLKKI-EELKAHWKWEQVLAMC---QAIISSNS 317

Query: 212 MIEKLQKKLYHPFSVNLMQTR--EKLIKILMELEDWKEALAYCQLTIPVYQRVYPQFHPL 269
             E+L         +N+ Q +  +  +   + L   +EAL Y   T+  Y+  +P  HP+
Sbjct: 318 --ERLP-------DINIYQLKVLDCAMDACINLGLLEEALFYGTRTMEPYRIFFPGSHPV 368

Query: 270 LGLQYYTCGKLEWFLGDTENAIKSMTEAVEILRITHG 306
            G+Q    GKL+   G    A+K++  A +I+R+THG
Sbjct: 369 RGVQVMKVGKLQLHQGMFPQAMKNLRLAFDIMRVTHG 405


>gi|71043910|ref|NP_001020933.1| SET and MYND domain-containing protein 3 [Rattus norvegicus]
 gi|66910630|gb|AAH97455.1| SET and MYND domain containing 3 [Rattus norvegicus]
          Length = 428

 Score =  120 bits (301), Expect = 9e-25,   Method: Compositional matrix adjust.
 Identities = 79/275 (28%), Positives = 137/275 (49%), Gaps = 39/275 (14%)

Query: 32  EIAENFSKLACNAHTICNSELRPLGTGLYPVISIINHSCLPNAVLVFEGRLAVVRAVQHV 91
           ++ E F+K+ CN+ TICN+E++ +G GLYP +S++NHSC PN  +VF G   ++RAV+ +
Sbjct: 170 DLFEAFAKVICNSFTICNAEMQEVGVGLYPSMSLLNHSCDPNCSIVFNGPHLLLRAVREI 229

Query: 92  PKGAEVLISYIETAGSTMTRQKALKEQYLFTCTCPRCIKLGQFDDIQESAILEGYRCKDD 151
             G E+ I Y++   ++  R+K L++QY F C C RC    Q  D               
Sbjct: 230 EAGEELTICYLDMLMTSEERRKQLRDQYCFECDCIRC----QTQD--------------- 270

Query: 152 GCSGFLLRDSDDKGFTCQQCGLVRSKEEIKKIASEVNILSKKTLALTSCGNHQEVVSTYK 211
                  +D+D          ++   E+I K   EV    KK   L +    ++V++  +
Sbjct: 271 -------KDAD----------MLTGDEQIWK---EVQESLKKIEELKAHWKWEQVLALCQ 310

Query: 212 MIEKLQKKLYHPFSVNLMQTREKLIKILMELEDWKEALAYCQLTIPVYQRVYPQFHPLLG 271
            I     +     ++  ++  +  +   + L   +EAL Y   T+  Y+  +P  HP+ G
Sbjct: 311 AIINSNSERLPDINIYQLKVLDCAMDACINLGMLEEALFYAMRTMEPYRIFFPGSHPVRG 370

Query: 272 LQYYTCGKLEWFLGDTENAIKSMTEAVEILRITHG 306
           +Q    GKL+   G    A+K++  A +I+++THG
Sbjct: 371 VQVMKVGKLQLHQGMFPQAMKNLRLAFDIMKVTHG 405


>gi|431915909|gb|ELK16163.1| SET and MYND domain-containing protein 2, partial [Pteropus alecto]
          Length = 390

 Score =  120 bits (300), Expect = 1e-24,   Method: Compositional matrix adjust.
 Identities = 83/320 (25%), Positives = 147/320 (45%), Gaps = 25/320 (7%)

Query: 5   DEKQLLLYAQIANLVNLILQWPEISINE-IAENFSKLACNAHTICNSELRPLGTGLYPVI 63
           +EK+ L+ + I+ L +   +  E   N+ +   F+++ CN  TI + EL  LG+ ++P +
Sbjct: 90  NEKKDLIQSDISALHHFYSKHLEFPDNKSLVVLFAQVNCNGFTIEDEELSHLGSAIFPDV 149

Query: 64  SIINHSCLPNAVLVFEGRLAVVRAVQHVPKGAEVLISYIETAGSTMTRQKALKEQYLFTC 123
           +++NHSC PN ++ ++G LA VRAVQ +  G EV  SYI+    T  R   L++ Y FTC
Sbjct: 150 ALMNHSCCPNVIVTYKGTLAEVRAVQEINPGEEVFTSYIDLLYPTEDRNDRLRDSYFFTC 209

Query: 124 TCPRCIKLGQFDDIQESAILEGYRCKDDGCSGFLLRDSDDKGFTCQQCGLVRSKEEIKKI 183
            C  C                    K    +   +R   D       C +VR     + +
Sbjct: 210 ECQECTT------------------KHKDKAKVKIRKLSDPPKAEAICDMVRY---ARNV 248

Query: 184 ASEVNILSKKTLALTSCGNHQEVVSTYKMIEKLQKKLYHPF---SVNLMQTREKLIKILM 240
             E          L  C          ++ E  Q+K+   F   +V ++    + + + +
Sbjct: 249 IEEFRRAKHYKYILYRCPGVAPPSELLEICELSQEKMSSVFEDSNVYMLHMMYQAMGVCL 308

Query: 241 ELEDWKEALAYCQLTIPVYQRVYPQFHPLLGLQYYTCGKLEWFLGDTENAIKSMTEAVEI 300
            ++DW+ AL Y Q  I  Y + YP +   +   +   G+L   L +     K++ +AVEI
Sbjct: 309 YMQDWEGALRYGQKIIKPYSKHYPLYSLNVASMWLKLGRLYMGLENKAAGEKALRKAVEI 368

Query: 301 LRITHGTNSPFMKELILKLE 320
           + + HG + P++ E+  ++E
Sbjct: 369 MEVAHGKDHPYISEIKQEIE 388


>gi|21312378|ref|NP_081464.1| SET and MYND domain-containing protein 3 [Mus musculus]
 gi|30913566|sp|Q9CWR2.1|SMYD3_MOUSE RecName: Full=SET and MYND domain-containing protein 3; AltName:
           Full=Zinc finger MYND domain-containing protein 1
 gi|12845900|dbj|BAB26947.1| unnamed protein product [Mus musculus]
 gi|30851412|gb|AAH52431.1| SET and MYND domain containing 3 [Mus musculus]
          Length = 428

 Score =  120 bits (300), Expect = 1e-24,   Method: Compositional matrix adjust.
 Identities = 79/275 (28%), Positives = 136/275 (49%), Gaps = 39/275 (14%)

Query: 32  EIAENFSKLACNAHTICNSELRPLGTGLYPVISIINHSCLPNAVLVFEGRLAVVRAVQHV 91
           ++ E F+K+ CN+ TICN+E++ +G GLYP +S++NHSC PN  +VF G   ++RAV+ +
Sbjct: 170 DLFEAFAKVICNSFTICNAEMQEVGVGLYPSMSLLNHSCDPNCSIVFNGPHLLLRAVREI 229

Query: 92  PKGAEVLISYIETAGSTMTRQKALKEQYLFTCTCPRCIKLGQFDDIQESAILEGYRCKDD 151
             G E+ I Y++   ++  R+K L++QY F C C RC    Q  D               
Sbjct: 230 EAGEELTICYLDMLMTSEERRKQLRDQYCFECDCIRC----QTQD--------------- 270

Query: 152 GCSGFLLRDSDDKGFTCQQCGLVRSKEEIKKIASEVNILSKKTLALTSCGNHQEVVSTYK 211
                  +D+D          ++   E+I K   EV    KK   L +    ++V++  +
Sbjct: 271 -------KDAD----------MLTGDEQIWK---EVQESLKKIEELKAHWKWEQVLALCQ 310

Query: 212 MIEKLQKKLYHPFSVNLMQTREKLIKILMELEDWKEALAYCQLTIPVYQRVYPQFHPLLG 271
            I           ++  ++  +  +   + L   +EAL Y   T+  Y+  +P  HP+ G
Sbjct: 311 AIINSNSNRLPDINIYQLKVLDCAMDACINLGMLEEALFYAMRTMEPYRIFFPGSHPVRG 370

Query: 272 LQYYTCGKLEWFLGDTENAIKSMTEAVEILRITHG 306
           +Q    GKL+   G    A+K++  A +I+++THG
Sbjct: 371 VQVMKVGKLQLHQGMFPQAMKNLRLAFDIMKVTHG 405


>gi|74188167|dbj|BAE37175.1| unnamed protein product [Mus musculus]
          Length = 428

 Score =  120 bits (300), Expect = 1e-24,   Method: Compositional matrix adjust.
 Identities = 79/275 (28%), Positives = 136/275 (49%), Gaps = 39/275 (14%)

Query: 32  EIAENFSKLACNAHTICNSELRPLGTGLYPVISIINHSCLPNAVLVFEGRLAVVRAVQHV 91
           ++ E F+K+ CN+ TICN+E++ +G GLYP +S++NHSC PN  +VF G   ++RAV+ +
Sbjct: 170 DLFEAFAKVICNSFTICNAEMQEVGVGLYPSMSLLNHSCDPNCSIVFNGPHLLLRAVREI 229

Query: 92  PKGAEVLISYIETAGSTMTRQKALKEQYLFTCTCPRCIKLGQFDDIQESAILEGYRCKDD 151
             G E+ I Y++   ++  R+K L++QY F C C RC    Q  D               
Sbjct: 230 EAGEELTICYLDMLMTSEERRKQLRDQYCFECDCIRC----QTQD--------------- 270

Query: 152 GCSGFLLRDSDDKGFTCQQCGLVRSKEEIKKIASEVNILSKKTLALTSCGNHQEVVSTYK 211
                  +D+D          ++   E+I K   EV    KK   L +    ++V++  +
Sbjct: 271 -------KDAD----------MLTGDEQIWK---EVQESLKKIEELKAHWKWEQVLALCQ 310

Query: 212 MIEKLQKKLYHPFSVNLMQTREKLIKILMELEDWKEALAYCQLTIPVYQRVYPQFHPLLG 271
            I           ++  ++  +  +   + L   +EAL Y   T+  Y+  +P  HP+ G
Sbjct: 311 AIINSNSNRLPDINIYQLKVLDCAMDACINLGMLEEALLYAMRTMEPYRIFFPGSHPVRG 370

Query: 272 LQYYTCGKLEWFLGDTENAIKSMTEAVEILRITHG 306
           +Q    GKL+   G    A+K++  A +I+++THG
Sbjct: 371 VQVMKVGKLQLHQGMFPQAMKNLRLAFDIMKVTHG 405


>gi|348577081|ref|XP_003474313.1| PREDICTED: SET and MYND domain-containing protein 3-like [Cavia
           porcellus]
          Length = 428

 Score =  120 bits (300), Expect = 1e-24,   Method: Compositional matrix adjust.
 Identities = 87/277 (31%), Positives = 143/277 (51%), Gaps = 43/277 (15%)

Query: 32  EIAENFSKLACNAHTICNSELRPLGTGLYPVISIINHSCLPNAVLVFEGRLAVVRAVQHV 91
           +I E F+K+ CN+ TICN+E++ +G GLYP +S++NHSC PN  +VF G   ++RAV+ +
Sbjct: 170 DIFEAFAKVICNSFTICNAEMQEVGVGLYPSMSLLNHSCDPNCSIVFNGPHLLLRAVRDI 229

Query: 92  PKGAEVLISYIETAGSTMTRQKALKEQYLFTCTCPRCIKLGQFDDIQESAILEGYRCKDD 151
             G E+ I Y++   ++  R+K L++QY F C C RC    Q  D +++ +L G      
Sbjct: 230 EVGEELTICYLDMLMTSEERRKQLRDQYCFECDCFRC----QTQD-KDADMLTG------ 278

Query: 152 GCSGFLLRDSDDKGFTCQQCGLVRSKEEIKKIASEVNILSKKTLALTSCGNHQEVVSTYK 211
                     D++ +   Q       E +KKI  E+    K    L  C   Q ++S+  
Sbjct: 279 ----------DEQVWKGVQ-------ESLKKI-EELKTHWKWEQVLAMC---QSIISSNS 317

Query: 212 MIEKLQKKLYHPFSVNLMQTR--EKLIKILMELEDWKEALAYCQLTIPVYQRVYPQFHPL 269
             E+L         +N+ Q +  +  +   + L   +EAL Y   T+  Y+  +P  HP+
Sbjct: 318 --ERLP-------DINIYQLKVLDCAMDACINLGLLEEALFYGIRTMEPYRIFFPGSHPV 368

Query: 270 LGLQYYTCGKLEWFLGDTENAIKSMTEAVEILRITHG 306
            G+Q    GKL+   G    A+K++  A +I+R+THG
Sbjct: 369 RGVQVMKVGKLQLHQGMFPQAMKNLRLAFDIMRVTHG 405


>gi|38494371|gb|AAH61485.1| SET and MYND domain containing 3 [Mus musculus]
 gi|148681224|gb|EDL13171.1| SET and MYND domain containing 3, isoform CRA_b [Mus musculus]
          Length = 428

 Score =  120 bits (300), Expect = 1e-24,   Method: Compositional matrix adjust.
 Identities = 79/275 (28%), Positives = 136/275 (49%), Gaps = 39/275 (14%)

Query: 32  EIAENFSKLACNAHTICNSELRPLGTGLYPVISIINHSCLPNAVLVFEGRLAVVRAVQHV 91
           ++ E F+K+ CN+ TICN+E++ +G GLYP +S++NHSC PN  +VF G   ++RAV+ +
Sbjct: 170 DLFEAFAKVICNSFTICNAEMQEVGVGLYPSMSLLNHSCDPNCSIVFNGPHLLLRAVREI 229

Query: 92  PKGAEVLISYIETAGSTMTRQKALKEQYLFTCTCPRCIKLGQFDDIQESAILEGYRCKDD 151
             G E+ I Y++   ++  R+K L++QY F C C RC    Q  D               
Sbjct: 230 EAGEELTICYLDMLMTSEERRKQLRDQYCFECDCIRC----QTQD--------------- 270

Query: 152 GCSGFLLRDSDDKGFTCQQCGLVRSKEEIKKIASEVNILSKKTLALTSCGNHQEVVSTYK 211
                  +D+D          ++   E+I K   EV    KK   L +    ++V++  +
Sbjct: 271 -------KDAD----------MLTGDEQIWK---EVQESLKKIEELKAHWKWEQVLALCQ 310

Query: 212 MIEKLQKKLYHPFSVNLMQTREKLIKILMELEDWKEALAYCQLTIPVYQRVYPQFHPLLG 271
            I           ++  ++  +  +   + L   +EAL Y   T+  Y+  +P  HP+ G
Sbjct: 311 AIINSNSDRLPDINIYQLKVLDCAMDACINLGMLEEALFYAMRTMEPYRIFFPGSHPVRG 370

Query: 272 LQYYTCGKLEWFLGDTENAIKSMTEAVEILRITHG 306
           +Q    GKL+   G    A+K++  A +I+++THG
Sbjct: 371 VQVMKVGKLQLHQGMFPQAMKNLRLAFDIMKVTHG 405


>gi|291402060|ref|XP_002717678.1| PREDICTED: SET and MYND domain containing 3 [Oryctolagus cuniculus]
          Length = 428

 Score =  120 bits (300), Expect = 1e-24,   Method: Compositional matrix adjust.
 Identities = 80/275 (29%), Positives = 138/275 (50%), Gaps = 39/275 (14%)

Query: 32  EIAENFSKLACNAHTICNSELRPLGTGLYPVISIINHSCLPNAVLVFEGRLAVVRAVQHV 91
           +I E F+K+ CN+ TICN+E++ +G GLYP +S++NHSC PN  +VF G   ++RAV+ +
Sbjct: 170 DIFEAFAKVICNSFTICNAEMQEVGVGLYPSMSLLNHSCDPNCSIVFNGPHLLLRAVRDI 229

Query: 92  PKGAEVLISYIETAGSTMTRQKALKEQYLFTCTCPRCIKLGQFDDIQESAILEGYRCKDD 151
             G E+ I Y++   ++  R+K L++QY F C C RC       + Q+            
Sbjct: 230 EAGEELTICYLDMLMTSEERRKQLRDQYCFECDCFRC-------ETQD------------ 270

Query: 152 GCSGFLLRDSDDKGFTCQQCGLVRSKEEIKKIASEVNILSKKTLALTSCGNHQEVVSTYK 211
                  +D+D          ++   E+I K   EV    KK   L +    ++V++  +
Sbjct: 271 -------KDAD----------MLTGDEQIWK---EVQESLKKIEELKAHWKWEQVLALCQ 310

Query: 212 MIEKLQKKLYHPFSVNLMQTREKLIKILMELEDWKEALAYCQLTIPVYQRVYPQFHPLLG 271
            I     +     ++  ++  +  +   + L   +EAL Y   T+  Y+  +P  HP+ G
Sbjct: 311 TIISSNSERLPDINIYQLKVLDCAMDACINLGLLEEALFYGIRTMEPYRIFFPGSHPVRG 370

Query: 272 LQYYTCGKLEWFLGDTENAIKSMTEAVEILRITHG 306
           +Q    GKL+   G    A+K++  A +I+R+THG
Sbjct: 371 VQVMKVGKLQLHQGMFPQAMKNLRLAFDIMRVTHG 405


>gi|126307073|ref|XP_001374886.1| PREDICTED: n-lysine methyltransferase SMYD2 [Monodelphis domestica]
          Length = 386

 Score =  119 bits (299), Expect = 1e-24,   Method: Compositional matrix adjust.
 Identities = 82/319 (25%), Positives = 151/319 (47%), Gaps = 31/319 (9%)

Query: 5   DEKQLLLYAQIANLVNLILQWPEISIN-EIAENFSKLACNAHTICNSELRPLGTGLYPVI 63
           +EK+ L+ + IA L +   +  E   N  +   F+++ CN  TI + EL  LG+ ++P +
Sbjct: 96  NEKRELIQSDIAALHHFYSKHLEYPDNCSLVTLFAQVNCNGFTIEDEELSHLGSAIFPDV 155

Query: 64  SIINHSCLPNAVLVFEGRLAVVRAVQHVPKGAEVLISYIETAGSTMTRQKALKEQYLFTC 123
           +++NHSC PN ++ ++G LA VRAVQ +  G EV  SYI+    T  R   LK+ Y FTC
Sbjct: 156 ALMNHSCCPNVIVTYKGTLAEVRAVQEINPGDEVFTSYIDLLYPTEDRNDRLKDSYFFTC 215

Query: 124 TCPRC-IKLGQFDDIQESAILEGYRCKDDGCSGFLLRDSDDKGFTCQQCGLVRSKEEIKK 182
            C  C  K      ++   + E  + +D       +RD            +V+    + +
Sbjct: 216 ECRECTTKAKDKAKVEIRKLSEPPKAED-------IRD------------MVKYARNVIE 256

Query: 183 IASEVNILSKKTLALTSCGNHQEVVSTYKMIEKLQKKLYHPFSVNLMQTREKLIKILMEL 242
                      +  L  C   QE +S+          L+   +V ++    + + + + +
Sbjct: 257 EFRRAKHYKSPSELLEICELSQEKMSS----------LFEDSNVYMLHMMYQAMGVCLYM 306

Query: 243 EDWKEALAYCQLTIPVYQRVYPQFHPLLGLQYYTCGKLEWFLGDTENAIKSMTEAVEILR 302
           +DW+ AL Y Q  I  Y + YP +   +   +   G+L   L    + +K++ +A+ I+ 
Sbjct: 307 QDWEGALRYGQKIIKPYSKHYPLYSLNVASMWLKLGRLYMGLEHKASGVKALKKAIVIME 366

Query: 303 ITHGTNSPFMKELILKLEE 321
           + HG + P++ E+  ++E+
Sbjct: 367 VAHGKDHPYISEIKKEIED 385


>gi|403288342|ref|XP_003935365.1| PREDICTED: SET and MYND domain-containing protein 3 [Saimiri
           boliviensis boliviensis]
          Length = 369

 Score =  119 bits (299), Expect = 1e-24,   Method: Compositional matrix adjust.
 Identities = 80/275 (29%), Positives = 137/275 (49%), Gaps = 39/275 (14%)

Query: 32  EIAENFSKLACNAHTICNSELRPLGTGLYPVISIINHSCLPNAVLVFEGRLAVVRAVQHV 91
           +I E F+K+ CN+ TICN+E++ +G GLYP IS++NHSC PN  +VF G   ++RAV+ +
Sbjct: 111 DIFEAFAKVICNSFTICNAEMQEVGVGLYPSISLLNHSCDPNCSVVFNGPHLLLRAVRDI 170

Query: 92  PKGAEVLISYIETAGSTMTRQKALKEQYLFTCTCPRCIKLGQFDDIQESAILEGYRCKDD 151
             G E+ I Y++T  ++  R+K L++QY F C C RC    Q  D               
Sbjct: 171 EVGEELTICYLDTLMTSEERRKQLRDQYCFECDCFRC----QTQD--------------- 211

Query: 152 GCSGFLLRDSDDKGFTCQQCGLVRSKEEIKKIASEVNILSKKTLALTSCGNHQEVVSTYK 211
                  +D+D          ++   E++ K   EV    K    L +    ++V++  +
Sbjct: 212 -------KDAD----------MLTGDEQVWK---EVQKSLKTIEELKAHWKWEQVLAMCQ 251

Query: 212 MIEKLQKKLYHPFSVNLMQTREKLIKILMELEDWKEALAYCQLTIPVYQRVYPQFHPLLG 271
            I     +     ++  ++  +  +   + L   ++AL Y   T+  Y+  +P  HP+ G
Sbjct: 252 AIISSNSERLPDINIYQLKVLDCAMDACIHLGLLEKALFYGTRTMEPYRIFFPGSHPVRG 311

Query: 272 LQYYTCGKLEWFLGDTENAIKSMTEAVEILRITHG 306
           +Q    GKL+   G    A+K++  A +I+R+THG
Sbjct: 312 VQVMKVGKLQLHQGMFPQAMKNLRLAFDIMRVTHG 346


>gi|417400767|gb|JAA47309.1| Hypothetical protein [Desmodus rotundus]
          Length = 428

 Score =  119 bits (299), Expect = 2e-24,   Method: Compositional matrix adjust.
 Identities = 84/277 (30%), Positives = 135/277 (48%), Gaps = 43/277 (15%)

Query: 32  EIAENFSKLACNAHTICNSELRPLGTGLYPVISIINHSCLPNAVLVFEGRLAVVRAVQHV 91
           +I E F+K+ CNA TICN+E++ +G GLYP +S++NHSC PN  +VF G   ++RAV+ +
Sbjct: 170 DIFEAFAKVFCNAFTICNAEMQGVGVGLYPSMSLLNHSCDPNCSIVFNGPHLLLRAVRDI 229

Query: 92  PKGAEVLISYIETAGSTMTRQKALKEQYLFTCTCPRCIKLGQFDDIQESAILEGYRCKDD 151
             G E+ + Y++   ++  R+K L+ QY F C C RC                       
Sbjct: 230 ETGEELTVCYLDLLMTSEERRKHLRSQYCFDCDCVRC----------------------- 266

Query: 152 GCSGFLLRDSDDKGFTCQQCGLVRSKEEIKKIASEVNILSKKTLALTSCGNHQEVVSTYK 211
                  +D D    T  +      +E +KKI  E+    K    L  C   Q ++S+  
Sbjct: 267 -----QTQDKDADMLTGDEQVWKEVQESLKKI-EELKAHWKWEQVLAMC---QAIISSN- 316

Query: 212 MIEKLQKKLYHPFSVNLMQTR--EKLIKILMELEDWKEALAYCQLTIPVYQRVYPQFHPL 269
             E+L         +N+ Q +  +  +   + L   +EAL Y   T+  Y+  +P  HP+
Sbjct: 317 -AERLP-------DINIYQLKVLDCAMDACIHLGLLEEALFYGLRTMEPYRIFFPGSHPV 368

Query: 270 LGLQYYTCGKLEWFLGDTENAIKSMTEAVEILRITHG 306
            G+Q    GKL+   G    A+K++  A +I+R+THG
Sbjct: 369 RGVQVMKVGKLQLHQGMFPQAMKNLRLAFDIMRVTHG 405


>gi|410985721|ref|XP_003999165.1| PREDICTED: SET and MYND domain-containing protein 3 [Felis catus]
          Length = 428

 Score =  119 bits (299), Expect = 2e-24,   Method: Compositional matrix adjust.
 Identities = 83/277 (29%), Positives = 136/277 (49%), Gaps = 43/277 (15%)

Query: 32  EIAENFSKLACNAHTICNSELRPLGTGLYPVISIINHSCLPNAVLVFEGRLAVVRAVQHV 91
           +I E F+K+ CN+ T+CN+E++ +G GLYP +S++NHSC PN  +VF G   ++RAV+ +
Sbjct: 170 DIFEAFAKVICNSFTVCNAEMQEVGVGLYPSMSLLNHSCDPNCSIVFNGPHLLLRAVRDI 229

Query: 92  PKGAEVLISYIETAGSTMTRQKALKEQYLFTCTCPRCIKLGQFDDIQESAILEGYRCKDD 151
             G E+ I Y++   ++  R+K L++QY F C C RC                       
Sbjct: 230 EAGEELTICYLDMLMTSEERRKQLRDQYCFECDCIRC----------------------- 266

Query: 152 GCSGFLLRDSDDKGFTCQQCGLVRSKEEIKKIASEVNILSKKTLALTSCGNHQEVVSTYK 211
                  +D D    T  +      +E +KKI  E+    K    L  C   Q ++S+  
Sbjct: 267 -----QTQDKDADMLTGDEQVWKEVQESLKKI-EELKAHWKWEQVLAMC---QAIISSNS 317

Query: 212 MIEKLQKKLYHPFSVNLMQTR--EKLIKILMELEDWKEALAYCQLTIPVYQRVYPQFHPL 269
             E+L         +N+ Q +  +  +   + L   +EAL Y   T+  Y+  +P  HP+
Sbjct: 318 --ERLP-------DINIYQLKVLDCAMDACINLGLLEEALFYGIRTMEPYRIFFPGSHPV 368

Query: 270 LGLQYYTCGKLEWFLGDTENAIKSMTEAVEILRITHG 306
            G+Q    GKL+   G    A+K++  A +I+R+THG
Sbjct: 369 RGVQVMKVGKLQLHQGMFPQAMKNLRLAFDIMRVTHG 405


>gi|296230793|ref|XP_002760890.1| PREDICTED: SET and MYND domain-containing protein 3 [Callithrix
           jacchus]
          Length = 428

 Score =  119 bits (298), Expect = 2e-24,   Method: Compositional matrix adjust.
 Identities = 85/275 (30%), Positives = 141/275 (51%), Gaps = 39/275 (14%)

Query: 32  EIAENFSKLACNAHTICNSELRPLGTGLYPVISIINHSCLPNAVLVFEGRLAVVRAVQHV 91
           +I E F+K+ CN+ TICN+E++ +G GLYP IS++NHSC PN  +VF G   ++RAV+ V
Sbjct: 170 DIFEAFAKVICNSFTICNAEMQEVGVGLYPSISLLNHSCDPNCSVVFNGPHLLLRAVRDV 229

Query: 92  PKGAEVLISYIETAGSTMTRQKALKEQYLFTCTCPRCIKLGQFDDIQESAILEGYRCKDD 151
             G E+ I Y++   ++  R+K L++QY F C C RC    Q  D +++ +L G      
Sbjct: 230 EVGEELTICYLDMLMTSEERRKQLRDQYCFECDCFRC----QTQD-KDADMLTG------ 278

Query: 152 GCSGFLLRDSDDKGFTCQQCGLVRSKEEIKKIASEVNILSKKTLALTSCGNHQEVVSTYK 211
                     D++ +          +E +K I  E+    K    L  C   Q ++S+  
Sbjct: 279 ----------DERVWK-------EVQESLKHI-EELKAHWKWEQVLAMC---QAIISSNS 317

Query: 212 MIEKLQKKLYHPFSVNLMQTREKLIKILMELEDWKEALAYCQLTIPVYQRVYPQFHPLLG 271
             E+L        ++  ++  +  +   + L   +EAL Y   T+  Y+  +P  HP+ G
Sbjct: 318 --ERLPD-----LNIYQLKVLDCAMDACINLGLLEEALFYGTRTMEPYRIFFPGSHPVRG 370

Query: 272 LQYYTCGKLEWFLGDTENAIKSMTEAVEILRITHG 306
           +Q    GKL+   G    A+K++  A +I+R+THG
Sbjct: 371 VQVMKVGKLQLHQGMFPQAMKNLRLAFDIMRVTHG 405


>gi|195398853|ref|XP_002058035.1| GJ15711 [Drosophila virilis]
 gi|194150459|gb|EDW66143.1| GJ15711 [Drosophila virilis]
          Length = 448

 Score =  119 bits (298), Expect = 2e-24,   Method: Compositional matrix adjust.
 Identities = 80/292 (27%), Positives = 142/292 (48%), Gaps = 20/292 (6%)

Query: 32  EIAENFSKLACNAHTICNSELRPLGTGLYPVISIINHSCLPNAVLVFEGRLAVVRAVQHV 91
           E+   + +L  N   I ++E+  + T +Y  +SI +HSC PNAV  FEG    V A++ +
Sbjct: 164 ELMSIYGRLITNGFNILDAEMNSIATAIYLGVSITDHSCQPNAVATFEGNELHVHAIEDL 223

Query: 92  P--KGAEVLISYIETAGSTMTRQKALKEQYLFTCTCPRCIKLGQFDDIQESAILEGYRCK 149
           P    +++ ISYI+   +   R+  LKE Y F C C +CI      D QE   +    C 
Sbjct: 224 PCLDWSKIYISYIDLLNTPEQRRADLKEHYYFLCVCSKCI------DPQEMHEMTAAVCP 277

Query: 150 DDGCSGFLLRDSDDKGFTCQQCGLVRSKEEIKKIASEVNILSKKTLALTSCGNHQEV--V 207
           +  C   +  D       CQ C       E++   +E+  L++ +L      + +EV  +
Sbjct: 278 NASCDASVNIDL----AKCQSCD-ASVTPELRSAYNEIMSLTQSSL-----DSMKEVAYL 327

Query: 208 STYKMIEKLQKKLYHPFSVNLMQTREKLIKILMELEDWKEALAYCQLTIPVYQRVYPQFH 267
              K+    Q+ + HP +V  ++T +   +  + +  W +AL Y Q  +P + + +  ++
Sbjct: 328 DVCKVCLAKQRGVLHPLNVWHVKTLDAAFEAAINVGKWTDALEYGQQLLPGFAKYHGAWN 387

Query: 268 PLLGLQYYTCGKLEWFLGDTENAIKSMTEAVEILRITHGTNSPFMKELILKL 319
           PLLGL +   GK++ +  + + A+  + EA  IL +THG +   + E +  L
Sbjct: 388 PLLGLLHMKLGKIQLYERNYKQAVHHLQEAQRILNVTHGRDHRLLLEQLRPL 439


>gi|354475964|ref|XP_003500195.1| PREDICTED: SET and MYND domain-containing protein 3 [Cricetulus
           griseus]
          Length = 428

 Score =  119 bits (297), Expect = 2e-24,   Method: Compositional matrix adjust.
 Identities = 79/275 (28%), Positives = 136/275 (49%), Gaps = 39/275 (14%)

Query: 32  EIAENFSKLACNAHTICNSELRPLGTGLYPVISIINHSCLPNAVLVFEGRLAVVRAVQHV 91
           ++ E F+K+ CN+ TICN+E++ +G GLYP +S++NHSC PN  +VF G   ++RAV+ +
Sbjct: 170 DLFEAFAKVICNSFTICNAEMQEVGVGLYPSMSLLNHSCDPNCSIVFNGPHLLLRAVREI 229

Query: 92  PKGAEVLISYIETAGSTMTRQKALKEQYLFTCTCPRCIKLGQFDDIQESAILEGYRCKDD 151
             G E+ I Y++   ++  R+K L++QY F C C RC    Q  D               
Sbjct: 230 EAGEELTICYLDMLMTSEERRKQLRDQYCFECDCIRC----QTHD--------------- 270

Query: 152 GCSGFLLRDSDDKGFTCQQCGLVRSKEEIKKIASEVNILSKKTLALTSCGNHQEVVSTYK 211
                  +D+D          ++   E+I K   EV    KK   L +    ++V++  +
Sbjct: 271 -------KDAD----------MLTGDEQIWK---EVQESLKKIEELKAHWKWEQVLALCQ 310

Query: 212 MIEKLQKKLYHPFSVNLMQTREKLIKILMELEDWKEALAYCQLTIPVYQRVYPQFHPLLG 271
            I           ++  ++  +  +   + L   +EAL Y   T+  Y+  +P  HP+ G
Sbjct: 311 AIISSNSDRLPDINIYQLKVLDCAMDACINLGMLEEALLYALRTMEPYRIFFPGSHPVRG 370

Query: 272 LQYYTCGKLEWFLGDTENAIKSMTEAVEILRITHG 306
           +Q    GKL+   G    A+K++  A +I+++THG
Sbjct: 371 VQVMKVGKLQLHQGMFPQAMKNLRLAFDIMKVTHG 405


>gi|119613763|gb|EAW93357.1| SET and MYND domain containing 2, isoform CRA_c [Homo sapiens]
          Length = 391

 Score =  119 bits (297), Expect = 3e-24,   Method: Compositional matrix adjust.
 Identities = 82/318 (25%), Positives = 149/318 (46%), Gaps = 30/318 (9%)

Query: 5   DEKQLLLYAQIANLVNLILQWPEISINE-IAENFSKLACNAHTICNSELRPLGTGLYPVI 63
           +EK+ L+ + IA L +   +  E   N+ +   F+++ CN  TI + EL  LG+ ++P +
Sbjct: 100 NEKKDLIQSDIAALHHFYSKHLEFPDNDSLVVLFAQVNCNGFTIEDEELSHLGSAIFPDV 159

Query: 64  SIINHSCLPNAVLVFEGRLAVVRAVQHVPKGAEVLISYIETAGSTMTRQKALKEQYLFTC 123
           +++NHSC PN ++ ++G LA VRAVQ +  G EV  SYI+    T  R   L++ Y FTC
Sbjct: 160 ALMNHSCCPNVIVTYKGTLAEVRAVQEIKPGEEVFTSYIDLLYPTEDRNDRLRDSYFFTC 219

Query: 124 TCPRCIKLGQFDDIQESAILEGYRCKDDGCSGFLLRDSDDKGFTCQQCGLVR-SKEEIKK 182
            C  C                    KD   +   +R   D         +VR ++  I++
Sbjct: 220 ECQECT------------------TKDKDKAKVEIRKLSDPPKAEAIRDMVRYARNVIEE 261

Query: 183 IASEVNILSKKTLALTSCGNHQEVVSTYKMIEKLQKKLYHPFSVNLMQTREKLIKILMEL 242
                +  S  +  L  C   QE +S+          ++   +V ++    + + + + +
Sbjct: 262 FRRAKHYKSPPSELLEICELSQEKMSS----------VFEDSNVYMLHMMYQAMGVCLYM 311

Query: 243 EDWKEALAYCQLTIPVYQRVYPQFHPLLGLQYYTCGKLEWFLGDTENAIKSMTEAVEILR 302
           +DW+ AL Y Q  I  Y + YP +   +   +   G+L   L       K++ +A+ I+ 
Sbjct: 312 QDWEGALQYGQKIIKPYSKHYPLYSLNVASMWLKLGRLYMGLEHKAAGEKALKKAIAIME 371

Query: 303 ITHGTNSPFMKELILKLE 320
           + HG + P++ E+  ++E
Sbjct: 372 VAHGKDHPYISEIKQEIE 389


>gi|149040852|gb|EDL94809.1| similar to SET and MYND domain containing 3, isoform CRA_b [Rattus
           norvegicus]
          Length = 428

 Score =  119 bits (297), Expect = 3e-24,   Method: Compositional matrix adjust.
 Identities = 78/275 (28%), Positives = 137/275 (49%), Gaps = 39/275 (14%)

Query: 32  EIAENFSKLACNAHTICNSELRPLGTGLYPVISIINHSCLPNAVLVFEGRLAVVRAVQHV 91
           ++ E F+K+ CN+ TICN+E++ +G GLYP +S++NHSC PN  +VF G   ++RAV+ +
Sbjct: 170 DLFEAFAKVICNSFTICNAEMQEVGVGLYPSMSLLNHSCDPNCSIVFNGPHLLLRAVREI 229

Query: 92  PKGAEVLISYIETAGSTMTRQKALKEQYLFTCTCPRCIKLGQFDDIQESAILEGYRCKDD 151
             G E+ I Y++   ++  R+K L++QY F C C RC         Q+            
Sbjct: 230 EAGEELTICYLDMLMTSEERRKQLRDQYCFECDCIRC-------PTQD------------ 270

Query: 152 GCSGFLLRDSDDKGFTCQQCGLVRSKEEIKKIASEVNILSKKTLALTSCGNHQEVVSTYK 211
                  +D+D          ++   E+I K   EV    KK   L +    ++V++  +
Sbjct: 271 -------KDAD----------MLTGDEQIWK---EVQESLKKIEELKAHWKWEQVLALCQ 310

Query: 212 MIEKLQKKLYHPFSVNLMQTREKLIKILMELEDWKEALAYCQLTIPVYQRVYPQFHPLLG 271
            I     +     ++  ++  +  +   + L   +EAL Y   T+  Y+  +P  HP+ G
Sbjct: 311 AIINSNSERLPDINIYQLKVLDCAMDACINLGMLEEALFYAMRTMEPYRIFFPGSHPVRG 370

Query: 272 LQYYTCGKLEWFLGDTENAIKSMTEAVEILRITHG 306
           +Q    GKL+   G    A+K++  A +I+++THG
Sbjct: 371 VQVMKVGKLQLHQGMFPQAMKNLRLAFDIMKVTHG 405


>gi|355720907|gb|AES07089.1| SET and MYND domain containing 2 [Mustela putorius furo]
          Length = 432

 Score =  118 bits (296), Expect = 3e-24,   Method: Compositional matrix adjust.
 Identities = 74/319 (23%), Positives = 149/319 (46%), Gaps = 33/319 (10%)

Query: 5   DEKQLLLYAQIANLVNLI---LQWPEISINEIAENFSKLACNAHTICNSELRPLGTGLYP 61
           +EK+ L+ + +A L +     L++P+   + +   F+++ CN  TI + EL  LG+ ++P
Sbjct: 143 NEKRDLIQSDVAALHHFYSKHLEFPDT--DSLVVLFAQVNCNGFTIEDEELSHLGSAIFP 200

Query: 62  VISIINHSCLPNAVLVFEGRLAVVRAVQHVPKGAEVLISYIETAGSTMTRQKALKEQYLF 121
            ++++NHSC PN ++ ++G LA VRAVQ +  G EV  SYI+    T  R   L++ Y F
Sbjct: 201 DVALMNHSCCPNVIVTYKGTLAEVRAVQEISPGEEVFTSYIDLLYPTEDRNDRLRDSYFF 260

Query: 122 TCTCPRCIKLGQFDDIQESAILEGYRCKDDGCSGFLLRDSDDKGFTCQQCGLVRSKEEIK 181
           TC C  C                              +D D      ++       E I+
Sbjct: 261 TCQCQECT----------------------------TKDKDKAKVEIRKLSDPPKAETIR 292

Query: 182 KIASEVNILSKKTLALTSCGNHQEVVSTYKMIEKLQKKLYHPFSVNLMQTREKLIKILME 241
           ++      + ++        +  E++   ++ ++    ++   +V ++    + + + + 
Sbjct: 293 EMVRYARNVIEEFRRAKHYKSPSELLEICELSQEKMGSVFEDSNVYMLHMMYQAMGVCLY 352

Query: 242 LEDWKEALAYCQLTIPVYQRVYPQFHPLLGLQYYTCGKLEWFLGDTENAIKSMTEAVEIL 301
           ++DW+ AL Y Q  I  Y + YP +   +   +   G+L   L D     K++ +A+ I+
Sbjct: 353 MQDWEGALRYGQKIIKPYSKHYPLYSLNVASMWLKLGRLYMGLEDKAAGEKALKKAIAIM 412

Query: 302 RITHGTNSPFMKELILKLE 320
            + HG + P++ E+  ++E
Sbjct: 413 EVAHGKDHPYISEIKQEIE 431


>gi|301771175|ref|XP_002921007.1| PREDICTED: SET and MYND domain-containing protein 2-like
           [Ailuropoda melanoleuca]
          Length = 385

 Score =  118 bits (296), Expect = 3e-24,   Method: Compositional matrix adjust.
 Identities = 80/319 (25%), Positives = 146/319 (45%), Gaps = 33/319 (10%)

Query: 5   DEKQLLLYAQIANLVNLI---LQWPEISINEIAENFSKLACNAHTICNSELRPLGTGLYP 61
           +EK+ L+ + IA L +     L++P+   + +   F+++ CN  TI + EL  LG+ ++P
Sbjct: 95  NEKKDLIQSDIAALHHFYSKHLEFPDS--DSLVVLFAQVNCNGFTIEDEELSHLGSAIFP 152

Query: 62  VISIINHSCLPNAVLVFEGRLAVVRAVQHVPKGAEVLISYIETAGSTMTRQKALKEQYLF 121
            ++++NHSC PN ++ ++G LA VRAVQ +  G EV  SYI+    T  R   L++ Y F
Sbjct: 153 DVALMNHSCCPNVIVTYKGTLAEVRAVQEISPGEEVFTSYIDLLYPTEDRNDRLRDSYFF 212

Query: 122 TCTCPRCIKLGQFDDIQESAILEGYRCKDDGCSGFLLRDSDDKGFTCQQCGLVRSKEEIK 181
           TC C  C                    KD   +   +R   D         +VR    + 
Sbjct: 213 TCQCQECT------------------TKDKDKAKVEIRKLSDPPKAETIRDMVRYARNVI 254

Query: 182 KIASEVNILSKKTLALTSCGNHQEVVSTYKMIEKLQKKLYHPFSVNLMQTREKLIKILME 241
           +           +  L  C   QE + +          ++   +V ++    + + + + 
Sbjct: 255 EEFRRAKHYKSPSELLEICELSQEKMGS----------VFEDSNVYMLHMMYQAMGVCLY 304

Query: 242 LEDWKEALAYCQLTIPVYQRVYPQFHPLLGLQYYTCGKLEWFLGDTENAIKSMTEAVEIL 301
           ++DW+ AL Y Q  I  Y + YP +   +   +   G+L   L D     K++ +A+ I+
Sbjct: 305 MQDWEGALRYGQKIIKPYSKHYPLYSLNVASMWLKLGRLYMGLEDKAAGEKALKKAIAIM 364

Query: 302 RITHGTNSPFMKELILKLE 320
            + HG + P++ E+  ++E
Sbjct: 365 EVAHGKDHPYISEIKQEIE 383


>gi|281342967|gb|EFB18551.1| hypothetical protein PANDA_009836 [Ailuropoda melanoleuca]
          Length = 386

 Score =  118 bits (296), Expect = 3e-24,   Method: Compositional matrix adjust.
 Identities = 85/325 (26%), Positives = 151/325 (46%), Gaps = 39/325 (12%)

Query: 5   DEKQLLLYAQIANLVNLI---LQWPEISINEIAENFSKLACNAHTICNSELRPLGTGLYP 61
           +EK+ L+ + IA L +     L++P+   + +   F+++ CN  TI + EL  LG+ ++P
Sbjct: 90  NEKKDLIQSDIAALHHFYSKHLEFPDS--DSLVVLFAQVNCNGFTIEDEELSHLGSAIFP 147

Query: 62  VISIINHSCLPNAVLVFEGRLAVVRAVQHVPKGAEVLISYIETAGSTMTRQKALKEQYLF 121
            ++++NHSC PN ++ ++G LA VRAVQ +  G EV  SYI+    T  R   L++ Y F
Sbjct: 148 DVALMNHSCCPNVIVTYKGTLAEVRAVQEISPGEEVFTSYIDLLYPTEDRNDRLRDSYFF 207

Query: 122 TCTCPRCIKLGQFDDIQESAILEGYRCKDDGCSGFLLRDSDDKGFTCQQCGLVRSK---- 177
           TC C  C                    KD   +   +R   D         +VR      
Sbjct: 208 TCQCQECT------------------TKDKDKAKVEIRKLSDPPKAETIRDMVRYARNVI 249

Query: 178 EEIKKIASEVNIL--SKKTLALTSCGNHQEVVSTYKMIEKLQKKLYHPFSVNLMQTREKL 235
           EE ++      IL  S  +  L  C   QE + +          ++   +V ++    + 
Sbjct: 250 EEFRRAKHYKYILYNSPPSELLEICELSQEKMGS----------VFEDSNVYMLHMMYQA 299

Query: 236 IKILMELEDWKEALAYCQLTIPVYQRVYPQFHPLLGLQYYTCGKLEWFLGDTENAIKSMT 295
           + + + ++DW+ AL Y Q  I  Y + YP +   +   +   G+L   L D     K++ 
Sbjct: 300 MGVCLYMQDWEGALRYGQKIIKPYSKHYPLYSLNVASMWLKLGRLYMGLEDKAAGEKALK 359

Query: 296 EAVEILRITHGTNSPFMKELILKLE 320
           +A+ I+ + HG + P++ E+  ++E
Sbjct: 360 KAIAIMEVAHGKDHPYISEIKQEIE 384


>gi|395859538|ref|XP_003802095.1| PREDICTED: N-lysine methyltransferase SMYD2 [Otolemur garnettii]
          Length = 433

 Score =  118 bits (296), Expect = 4e-24,   Method: Compositional matrix adjust.
 Identities = 80/317 (25%), Positives = 145/317 (45%), Gaps = 29/317 (9%)

Query: 5   DEKQLLLYAQIANLVNLILQWPEISINE-IAENFSKLACNAHTICNSELRPLGTGLYPVI 63
           +EK+ L+ + IA L +   +  E   N+ +   F+++ CN  TI + EL  LG+ ++P +
Sbjct: 143 NEKKELIQSDIAALHHFYSKHLEFPDNDSLVVLFAQVNCNGFTIEDEELSHLGSAIFPDV 202

Query: 64  SIINHSCLPNAVLVFEGRLAVVRAVQHVPKGAEVLISYIETAGSTMTRQKALKEQYLFTC 123
           +++NHSC PN ++ ++G LA VRAVQ +  G EV  SYI+    T  R   L++ Y FTC
Sbjct: 203 ALMNHSCCPNVIVTYKGTLAEVRAVQEINPGEEVFTSYIDLLYPTEDRNDRLRDSYFFTC 262

Query: 124 TCPRCIKLGQFDDIQESAILEGYRCKDDGCSGFLLRDSDDKGFTCQQCGLVRSKEEIKKI 183
            C  C                    KD   +   +R  +D         +VR    + + 
Sbjct: 263 ECQECTT------------------KDKDKAKVEIRKLNDPPKAETVRDMVRYARNVIEE 304

Query: 184 ASEVNILSKKTLALTSCGNHQEVVSTYKMIEKLQKKLYHPFSVNLMQTREKLIKILMELE 243
                     +  L  C   QE +S+          ++   +V ++    + + + + ++
Sbjct: 305 FRRAKHYKSPSELLEICELSQEKMSS----------VFEDSNVYMLHMMYQAMGVCLYMQ 354

Query: 244 DWKEALAYCQLTIPVYQRVYPQFHPLLGLQYYTCGKLEWFLGDTENAIKSMTEAVEILRI 303
           DW+ AL Y Q  I  Y + YP +   +   +   G+L   L       K++ +A+ I+ +
Sbjct: 355 DWEGALRYGQKIIKPYSKHYPLYSLNVASMWLKLGRLYMGLEHKAAGEKALKKAIAIMEV 414

Query: 304 THGTNSPFMKELILKLE 320
            HG + P++ E+  ++E
Sbjct: 415 AHGKDHPYISEIKQEIE 431


>gi|194913303|ref|XP_001982670.1| GG12606 [Drosophila erecta]
 gi|190648346|gb|EDV45639.1| GG12606 [Drosophila erecta]
          Length = 468

 Score =  118 bits (296), Expect = 4e-24,   Method: Compositional matrix adjust.
 Identities = 85/318 (26%), Positives = 154/318 (48%), Gaps = 23/318 (7%)

Query: 16  ANLVNLILQWPEISIN--EIAENFSKLACNAHTICNSELRPLGTGLYPVISIINHSCLPN 73
           A L +++   P    N  E+   + +L  N   + ++++  + T +Y  +SI +HSC PN
Sbjct: 166 AVLTDMMADSPSTVPNKTELMSIYGRLITNGFNVLDTDMNSIATAIYLGVSITDHSCQPN 225

Query: 74  AVLVFEGRLAVVRAVQHVP--KGAEVLISYIETAGSTMTRQKALKEQYLFTCTCPRCIKL 131
           AV  FEG    V A++ +     +++ ISYI+   +   R+  LKE Y F C C +C   
Sbjct: 226 AVATFEGNELHVHAIEDMECLDWSKIFISYIDLLNTPEQRRLDLKEHYYFLCVCSKCT-- 283

Query: 132 GQFDDIQESAILEGYRCKDDGCSGFLLRDSDDKGFTCQQCGLVRSKEEIKKIASEVNILS 191
               D +ES  +    C +  C   +  D  +    C +C    S     K+ +  N   
Sbjct: 284 ----DAKESKEMLAALCPNRNCGVGISVDRTN----CPRCDAGISP----KLRNAFN--D 329

Query: 192 KKTLALTSCGNHQEV--VSTYKMIEKLQKKLYHPFSVNLMQTREKLIKILMELEDWKEAL 249
              L L +  N ++V  +   K+    Q  ++HP +V  ++T +   +  +E+  W +AL
Sbjct: 330 AMALTLHNLENMKDVAYLDVCKVCLDKQTGVFHPLNVWYVKTLDAAFEAAIEVGKWSDAL 389

Query: 250 AYCQLTIPVYQRVYPQFHPLLGLQYYTCGKLEWFLGDTENAIKSMTEAVEILRITHGTNS 309
            Y Q  +P +++ +  ++PLLGL +   GK++ + G  + A+  + EA  IL +THG + 
Sbjct: 390 DYGQRLLPGFRKYHGPWNPLLGLVHMKLGKIQLYTGQCKEALHHLEEAQRILTVTHGRDH 449

Query: 310 PFMKE-LILKLEEAQAEA 326
             + E L L + +A+ +A
Sbjct: 450 RLLTEQLYLLILQARQQA 467


>gi|119613761|gb|EAW93355.1| SET and MYND domain containing 2, isoform CRA_a [Homo sapiens]
          Length = 349

 Score =  118 bits (295), Expect = 4e-24,   Method: Compositional matrix adjust.
 Identities = 80/317 (25%), Positives = 144/317 (45%), Gaps = 29/317 (9%)

Query: 5   DEKQLLLYAQIANLVNLILQWPEISINE-IAENFSKLACNAHTICNSELRPLGTGLYPVI 63
           +EK+ L+ + IA L +   +  E   N+ +   F+++ CN  TI + EL  LG+ ++P +
Sbjct: 59  NEKKDLIQSDIAALHHFYSKHLEFPDNDSLVVLFAQVNCNGFTIEDEELSHLGSAIFPDV 118

Query: 64  SIINHSCLPNAVLVFEGRLAVVRAVQHVPKGAEVLISYIETAGSTMTRQKALKEQYLFTC 123
           +++NHSC PN ++ ++G LA VRAVQ +  G EV  SYI+    T  R   L++ Y FTC
Sbjct: 119 ALMNHSCCPNVIVTYKGTLAEVRAVQEIKPGEEVFTSYIDLLYPTEDRNDRLRDSYFFTC 178

Query: 124 TCPRCIKLGQFDDIQESAILEGYRCKDDGCSGFLLRDSDDKGFTCQQCGLVRSKEEIKKI 183
            C  C                    KD   +   +R   D         +VR    + + 
Sbjct: 179 ECQECT------------------TKDKDKAKVEIRKLSDPPKAEAIRDMVRYARNVIEE 220

Query: 184 ASEVNILSKKTLALTSCGNHQEVVSTYKMIEKLQKKLYHPFSVNLMQTREKLIKILMELE 243
                     +  L  C   QE +S+          ++   +V ++    + + + + ++
Sbjct: 221 FRRAKHYKSPSELLEICELSQEKMSS----------VFEDSNVYMLHMMYQAMGVCLYMQ 270

Query: 244 DWKEALAYCQLTIPVYQRVYPQFHPLLGLQYYTCGKLEWFLGDTENAIKSMTEAVEILRI 303
           DW+ AL Y Q  I  Y + YP +   +   +   G+L   L       K++ +A+ I+ +
Sbjct: 271 DWEGALQYGQKIIKPYSKHYPLYSLNVASMWLKLGRLYMGLEHKAAGEKALKKAIAIMEV 330

Query: 304 THGTNSPFMKELILKLE 320
            HG + P++ E+  ++E
Sbjct: 331 AHGKDHPYISEIKQEIE 347


>gi|332811913|ref|XP_003308794.1| PREDICTED: N-lysine methyltransferase SMYD2 [Pan troglodytes]
          Length = 433

 Score =  118 bits (295), Expect = 5e-24,   Method: Compositional matrix adjust.
 Identities = 83/317 (26%), Positives = 148/317 (46%), Gaps = 29/317 (9%)

Query: 5   DEKQLLLYAQIANLVNLILQWPEISINE-IAENFSKLACNAHTICNSELRPLGTGLYPVI 63
           +EK+ L+ + IA L +   +  E   N+ +   F+++ CN  TI + EL  LG+ ++P +
Sbjct: 143 NEKKDLIQSDIAALHHFYSKHLEFPDNDSLVVVFAQVNCNGFTIEDEELSHLGSAIFPDV 202

Query: 64  SIINHSCLPNAVLVFEGRLAVVRAVQHVPKGAEVLISYIETAGSTMTRQKALKEQYLFTC 123
           +++NHSC PN ++ ++G LA VRAVQ +  G EV  SYI+    T  R   L++ Y FTC
Sbjct: 203 ALMNHSCCPNVIVTYKGTLAEVRAVQEIKPGEEVFTSYIDLLYPTEDRNDRLRDSYFFTC 262

Query: 124 TCPRCIKLGQFDDIQESAILEGYRCKDDGCSGFLLRDSDDKGFTCQQCGLVRSKEEIKKI 183
            C  C         ++ A +E  R   D      +RD            +VR    + + 
Sbjct: 263 ECQECTTKD-----KDKAKVE-IRKLSDPPKAEAIRD------------MVRYARNVIEE 304

Query: 184 ASEVNILSKKTLALTSCGNHQEVVSTYKMIEKLQKKLYHPFSVNLMQTREKLIKILMELE 243
                     +  L  C   QE +S+          ++   +V ++    + + + + ++
Sbjct: 305 FRRAKHYKSPSELLEICELSQEKMSS----------VFEDSNVYMLHMMYQAMGVCLYMQ 354

Query: 244 DWKEALAYCQLTIPVYQRVYPQFHPLLGLQYYTCGKLEWFLGDTENAIKSMTEAVEILRI 303
           DW+ AL Y Q  I  Y + YP +   +   +   G+L   L       K++ +A+ I+ I
Sbjct: 355 DWEGALQYGQKIIKPYSKHYPLYSLNVASMWLKLGRLYMGLEHKAAGEKALKKAIAIMEI 414

Query: 304 THGTNSPFMKELILKLE 320
            HG + P++ E+  ++E
Sbjct: 415 AHGKDHPYISEIKQEIE 431


>gi|297661992|ref|XP_002809506.1| PREDICTED: N-lysine methyltransferase SMYD2 [Pongo abelii]
          Length = 433

 Score =  117 bits (294), Expect = 5e-24,   Method: Compositional matrix adjust.
 Identities = 82/317 (25%), Positives = 149/317 (47%), Gaps = 29/317 (9%)

Query: 5   DEKQLLLYAQIANLVNLILQWPEISINE-IAENFSKLACNAHTICNSELRPLGTGLYPVI 63
           +EK+ L+ + IA L +   +  E   N+ +   F+++ CN  TI + EL  LG+ ++P +
Sbjct: 143 NEKKDLIQSDIAALHHFYSKHLEFPDNDSLIVLFAQVNCNGFTIEDEELSHLGSAIFPDV 202

Query: 64  SIINHSCLPNAVLVFEGRLAVVRAVQHVPKGAEVLISYIETAGSTMTRQKALKEQYLFTC 123
           +++NHSC PN ++ ++G LA VRAVQ +  G EV  SYI+    T  R   L++ Y FTC
Sbjct: 203 ALMNHSCCPNVIVTYKGTLAEVRAVQEIKPGEEVFTSYIDLLYPTEDRNDRLRDSYFFTC 262

Query: 124 TCPRCIKLGQFDDIQESAILEGYRCKDDGCSGFLLRDSDDKGFTCQQCGLVRSKEEIKKI 183
            C  C         ++ A +E  R   D     ++RD            +VR    + + 
Sbjct: 263 ECQECTTKD-----KDKAKVE-IRKLSDPPKAEVIRD------------MVRYARNVIEE 304

Query: 184 ASEVNILSKKTLALTSCGNHQEVVSTYKMIEKLQKKLYHPFSVNLMQTREKLIKILMELE 243
                     +  L  C   QE +S+          ++   +V ++    + + + + ++
Sbjct: 305 FRRAKHYKSPSELLEICELSQEKMSS----------VFEDSNVYMLHMMYQAMGVCLYMQ 354

Query: 244 DWKEALAYCQLTIPVYQRVYPQFHPLLGLQYYTCGKLEWFLGDTENAIKSMTEAVEILRI 303
           DW+ AL Y Q  I  Y + YP +   +   +   G+L   L       K++ +A+ I+ +
Sbjct: 355 DWEGALQYGQKIIKPYSKHYPLYSLNVASMWLKLGRLYMGLEHKAAGEKALKKAIAIMEV 414

Query: 304 THGTNSPFMKELILKLE 320
            HG + P++ E+  ++E
Sbjct: 415 AHGKDHPYISEIKQEIE 431


>gi|426333764|ref|XP_004028440.1| PREDICTED: N-lysine methyltransferase SMYD2 [Gorilla gorilla
           gorilla]
          Length = 384

 Score =  117 bits (294), Expect = 5e-24,   Method: Compositional matrix adjust.
 Identities = 80/317 (25%), Positives = 144/317 (45%), Gaps = 29/317 (9%)

Query: 5   DEKQLLLYAQIANLVNLILQWPEISINE-IAENFSKLACNAHTICNSELRPLGTGLYPVI 63
           +EK+ L+ + IA L +   +  E   N+ +   F+++ CN  TI + EL  LG+ ++P +
Sbjct: 94  NEKKDLIQSDIAALHHFYSKHLEFPDNDSLVVLFAQVNCNGFTIEDEELSHLGSAIFPDV 153

Query: 64  SIINHSCLPNAVLVFEGRLAVVRAVQHVPKGAEVLISYIETAGSTMTRQKALKEQYLFTC 123
           +++NHSC PN ++ ++G LA VRAVQ +  G EV  SYI+    T  R   L++ Y FTC
Sbjct: 154 ALMNHSCCPNVIVTYKGTLAEVRAVQEIKPGEEVFTSYIDLLYPTEDRNDRLRDSYFFTC 213

Query: 124 TCPRCIKLGQFDDIQESAILEGYRCKDDGCSGFLLRDSDDKGFTCQQCGLVRSKEEIKKI 183
            C  C                    KD   +   +R   D         +VR    + + 
Sbjct: 214 ECQECT------------------TKDKDKAKVEIRKLSDPPKAEAIRDMVRYARNVIEE 255

Query: 184 ASEVNILSKKTLALTSCGNHQEVVSTYKMIEKLQKKLYHPFSVNLMQTREKLIKILMELE 243
                     +  L  C   QE +S+          ++   +V ++    + + + + ++
Sbjct: 256 FRRAKHYKSPSELLEICELSQEKMSS----------VFEDSNVYMLHMMYQAMGVCLYMQ 305

Query: 244 DWKEALAYCQLTIPVYQRVYPQFHPLLGLQYYTCGKLEWFLGDTENAIKSMTEAVEILRI 303
           DW+ AL Y Q  I  Y + YP +   +   +   G+L   L       K++ +A+ I+ +
Sbjct: 306 DWEGALQYGQKIIKPYSKHYPLYSLNVASMWLKLGRLYMGLEHKAAGEKALKKAIAIMEV 365

Query: 304 THGTNSPFMKELILKLE 320
            HG + P++ E+  ++E
Sbjct: 366 AHGKDHPYISEIKQEIE 382


>gi|355558772|gb|EHH15552.1| hypothetical protein EGK_01662, partial [Macaca mulatta]
          Length = 380

 Score =  117 bits (294), Expect = 5e-24,   Method: Compositional matrix adjust.
 Identities = 80/317 (25%), Positives = 144/317 (45%), Gaps = 29/317 (9%)

Query: 5   DEKQLLLYAQIANLVNLILQWPEISINE-IAENFSKLACNAHTICNSELRPLGTGLYPVI 63
           +EK+ L+ + IA L +   +  E   N+ +   F+++ CN  TI + EL  LG+ ++P +
Sbjct: 90  NEKKDLIQSDIAALHHFYSKHLEFPDNDSLVVLFAQVNCNGFTIEDEELSHLGSAIFPDV 149

Query: 64  SIINHSCLPNAVLVFEGRLAVVRAVQHVPKGAEVLISYIETAGSTMTRQKALKEQYLFTC 123
           +++NHSC PN ++ ++G LA VRAVQ +  G EV  SYI+    T  R   L++ Y FTC
Sbjct: 150 ALMNHSCCPNVIVTYKGTLAEVRAVQEIKPGEEVFTSYIDLLYPTEDRNDRLRDSYFFTC 209

Query: 124 TCPRCIKLGQFDDIQESAILEGYRCKDDGCSGFLLRDSDDKGFTCQQCGLVRSKEEIKKI 183
            C  C                    KD   +   +R   D         +VR    + + 
Sbjct: 210 ECQECT------------------TKDKDKAKVEIRKLSDPPKAEAIRDMVRYARNVIEE 251

Query: 184 ASEVNILSKKTLALTSCGNHQEVVSTYKMIEKLQKKLYHPFSVNLMQTREKLIKILMELE 243
                     +  L  C   QE +S+          ++   +V ++    + + + + ++
Sbjct: 252 FRRAKHYKSPSELLEICELSQEKMSS----------VFEDSNVYMLHMMYQAMGVCLYMQ 301

Query: 244 DWKEALAYCQLTIPVYQRVYPQFHPLLGLQYYTCGKLEWFLGDTENAIKSMTEAVEILRI 303
           DW+ AL Y Q  I  Y + YP +   +   +   G+L   L       K++ +A+ I+ +
Sbjct: 302 DWEGALQYGQKIIKPYSKHYPLYSLNVASMWLKLGRLYMGLEHKAAGEKALKKAIAIMEV 361

Query: 304 THGTNSPFMKELILKLE 320
            HG + P++ E+  ++E
Sbjct: 362 AHGKDHPYISEIKQEIE 378


>gi|410219524|gb|JAA06981.1| SET and MYND domain containing 2 [Pan troglodytes]
 gi|410248566|gb|JAA12250.1| SET and MYND domain containing 2 [Pan troglodytes]
 gi|410291460|gb|JAA24330.1| SET and MYND domain containing 2 [Pan troglodytes]
 gi|410336589|gb|JAA37241.1| SET and MYND domain containing 2 [Pan troglodytes]
          Length = 433

 Score =  117 bits (294), Expect = 6e-24,   Method: Compositional matrix adjust.
 Identities = 81/317 (25%), Positives = 144/317 (45%), Gaps = 29/317 (9%)

Query: 5   DEKQLLLYAQIANLVNLILQWPEISINE-IAENFSKLACNAHTICNSELRPLGTGLYPVI 63
           +EK+ L+ + IA L +   +  E   N+ +   F+++ CN  TI + EL  LG+ ++P +
Sbjct: 143 NEKKDLIQSDIAALHHFYSKHLEFPDNDSLIVVFAQVNCNGFTIEDEELSHLGSAIFPDV 202

Query: 64  SIINHSCLPNAVLVFEGRLAVVRAVQHVPKGAEVLISYIETAGSTMTRQKALKEQYLFTC 123
           +++NHSC PN ++ ++G LA VRAVQ +  G EV  SYI+    T  R   L++ Y FTC
Sbjct: 203 ALMNHSCCPNVIVTYKGTLAEVRAVQEIKPGEEVFTSYIDLLYPTEDRNDRLRDSYFFTC 262

Query: 124 TCPRCIKLGQFDDIQESAILEGYRCKDDGCSGFLLRDSDDKGFTCQQCGLVRSKEEIKKI 183
            C  C                    KD   +   +R   D         +VR    + + 
Sbjct: 263 ECQECTT------------------KDKDKAKVEIRKLSDPPKAEAIRDMVRYARNVIEE 304

Query: 184 ASEVNILSKKTLALTSCGNHQEVVSTYKMIEKLQKKLYHPFSVNLMQTREKLIKILMELE 243
                     +  L  C   QE +S+          ++   +V ++    + + + + ++
Sbjct: 305 FRRAKHYKSPSELLEICELSQEKMSS----------VFEDSNVYMLHMMYQAMGVCLYMQ 354

Query: 244 DWKEALAYCQLTIPVYQRVYPQFHPLLGLQYYTCGKLEWFLGDTENAIKSMTEAVEILRI 303
           DW+ AL Y Q  I  Y + YP +   +   +   G+L   L       K++ +A+ I+ I
Sbjct: 355 DWEGALQYGQKIIKPYSKHYPLYSLNVASMWLKLGRLYMGLEHKAAGEKALKKAIAIMEI 414

Query: 304 THGTNSPFMKELILKLE 320
            HG + P++ E+  ++E
Sbjct: 415 AHGKDHPYISEIKQEIE 431


>gi|149708613|ref|XP_001488507.1| PREDICTED: n-lysine methyltransferase SMYD2 [Equus caballus]
          Length = 384

 Score =  117 bits (294), Expect = 6e-24,   Method: Compositional matrix adjust.
 Identities = 82/317 (25%), Positives = 149/317 (47%), Gaps = 29/317 (9%)

Query: 5   DEKQLLLYAQIANLVNLILQWPEISINE-IAENFSKLACNAHTICNSELRPLGTGLYPVI 63
           +EK+ L+ + IA L +   +  E   N+ +   F+++ CN  TI + EL  LG+ ++P +
Sbjct: 94  NEKKDLIQSDIAALHHFYSKHLEFPDNDSLVVLFAQVNCNGFTIEDEELSHLGSAIFPDV 153

Query: 64  SIINHSCLPNAVLVFEGRLAVVRAVQHVPKGAEVLISYIETAGSTMTRQKALKEQYLFTC 123
           +++NHSC PN ++ ++G LA VRAVQ +  G EV  SYI+    T  R   L++ Y FTC
Sbjct: 154 ALMNHSCCPNVIVTYKGTLAEVRAVQEINPGEEVFTSYIDLLYPTEDRNDRLRDSYFFTC 213

Query: 124 TCPRCIKLGQFDDIQESAILEGYRCKDDGCSGFLLRDSDDKGFTCQQCGLVRSKEEIKKI 183
            C  C         ++ A +E  R   D      +RD            +VR    + + 
Sbjct: 214 ECQECTTRD-----KDKAKVE-IRKLTDPPKAEAIRD------------MVRYARNVIEE 255

Query: 184 ASEVNILSKKTLALTSCGNHQEVVSTYKMIEKLQKKLYHPFSVNLMQTREKLIKILMELE 243
                     +  L  C   QE +S+          ++   +V ++    + + + + ++
Sbjct: 256 FRRAKHYKSPSELLEICELSQEKMSS----------VFEDSNVYMLHMMYQAMGVCLYMQ 305

Query: 244 DWKEALAYCQLTIPVYQRVYPQFHPLLGLQYYTCGKLEWFLGDTENAIKSMTEAVEILRI 303
           DW+ AL Y Q  I  Y + YP +   +   +   G+L   L +     K++ +A+ I+ +
Sbjct: 306 DWEGALRYGQKIIKPYSKHYPLYSLNVASMWLKLGRLYMGLENKAAGEKALKKAIAIMEV 365

Query: 304 THGTNSPFMKELILKLE 320
            HG + P++ E+  ++E
Sbjct: 366 AHGKDHPYISEIKQEIE 382


>gi|440901481|gb|ELR52415.1| SET and MYND domain-containing protein 2, partial [Bos grunniens
           mutus]
          Length = 383

 Score =  117 bits (294), Expect = 6e-24,   Method: Compositional matrix adjust.
 Identities = 81/318 (25%), Positives = 150/318 (47%), Gaps = 30/318 (9%)

Query: 5   DEKQLLLYAQIANLVNLILQWPEISINE-IAENFSKLACNAHTICNSELRPLGTGLYPVI 63
           +EK+ L+ + IA L +   +  E   N+ +   F+++ CN  TI + EL  LG+ ++P +
Sbjct: 92  NEKRDLIQSDIAALHHFYSKHLEFPDNDSLVVLFAQVNCNGFTIEDEELSHLGSAIFPDV 151

Query: 64  SIINHSCLPNAVLVFEGRLAVVRAVQHVPKGAEVLISYIETAGSTMTRQKALKEQYLFTC 123
           +++NHSC PN ++ ++G LA VRAVQ +  G EV  SYI+    T  R   L++ Y FTC
Sbjct: 152 ALMNHSCCPNVIVTYKGTLAEVRAVQEIHPGEEVFTSYIDLLYPTEDRNDRLRDSYFFTC 211

Query: 124 TCPRCIKLGQFDDIQESAILEGYRCKDDGCSGFLLRDSDDKGFTCQQCGLVR-SKEEIKK 182
            C  C                    KD   +   +R  +D         +VR ++  I++
Sbjct: 212 ECQECTT------------------KDKDKAKVEIRKLNDPPKAETIRDMVRYARNVIEE 253

Query: 183 IASEVNILSKKTLALTSCGNHQEVVSTYKMIEKLQKKLYHPFSVNLMQTREKLIKILMEL 242
                +  S  +  L  C   QE +S           ++   +V ++    + + + + +
Sbjct: 254 FRRAKHYKSPPSELLEICELSQEKMSC----------VFEDSNVYMLHMMYQAMGVCLYM 303

Query: 243 EDWKEALAYCQLTIPVYQRVYPQFHPLLGLQYYTCGKLEWFLGDTENAIKSMTEAVEILR 302
           +DW+ AL Y Q  I  Y + YP +   +   +   G+L   L +     +++ +A+ I+ 
Sbjct: 304 QDWEGALRYGQKIIQPYSKHYPLYSLNVASMWLKLGRLYMGLENKAAGERALKKAIAIME 363

Query: 303 ITHGTNSPFMKELILKLE 320
           + HG + P++ E+  ++E
Sbjct: 364 VAHGKDHPYISEIKQEIE 381


>gi|449283344|gb|EMC90014.1| SET and MYND domain-containing protein 2, partial [Columba livia]
          Length = 385

 Score =  117 bits (294), Expect = 6e-24,   Method: Compositional matrix adjust.
 Identities = 84/321 (26%), Positives = 156/321 (48%), Gaps = 28/321 (8%)

Query: 5   DEKQLLLYAQIANLVNLI---LQWPEISINEIAENFSKLACNAHTICNSELRPLGTGLYP 61
           +EK+ L+   IA L +     L++P+ +   +   F+++ CN  TI + EL  LG+ ++P
Sbjct: 88  NEKRELIQNDIAALHHFYSKHLEYPDNAA--LVVLFAQVNCNGFTIEDEELSHLGSAIFP 145

Query: 62  VISIINHSCLPNAVLVFEGRLAVVRAVQHVPKGAEVLISYIETAGSTMTRQKALKEQYLF 121
            ++++NHSC PN ++ ++G LA VRAV+ +  G EV  SYI+    T  R   L++ Y F
Sbjct: 146 DVALMNHSCCPNVIVTYKGTLAEVRAVKEIEPGEEVFTSYIDLLYPTEDRNDRLRDSYFF 205

Query: 122 TCTCPRCIKLGQFDDIQESAILEGYRCKDDGCSGFLLRDSDDKGFTCQQCGLVRSK-EEI 180
            C C  C+   +  D ++  I    R  +D  S   +RD              R+  EE 
Sbjct: 206 NCDCRECVTKEK--DKEKLEI----RKLNDPPSAETVRD---------MIKYARNVIEEF 250

Query: 181 KKIASEVNILSKKTLALTSCGNHQEVVSTYKMIEKLQKKLYHPFSVNLMQTREKLIKILM 240
           ++      IL   TLA        E++   ++       ++   +V ++    + + + +
Sbjct: 251 RRAKHYKYILCLTTLAC-------ELLEICELSLDKMGAVFEDSNVYMLHMMYQAMGVCL 303

Query: 241 ELEDWKEALAYCQLTIPVYQRVYPQFHPLLGLQYYTCGKLEWFLGDTENAIKSMTEAVEI 300
            ++DW+ AL Y Q  I  Y + YP +   +   +   G+L   L +    +K++  A+ I
Sbjct: 304 YVQDWEGALRYGQKIIRPYSKHYPSYSLNVASMWLKLGRLYMALENRAAGVKALKRAIAI 363

Query: 301 LRITHGTNSPFMKELILKLEE 321
           + + HG + P++ E+  +LE+
Sbjct: 364 MEVAHGKDHPYISEIKKELED 384


>gi|297280801|ref|XP_001106482.2| PREDICTED: SET and MYND domain-containing protein 2 [Macaca
           mulatta]
          Length = 433

 Score =  117 bits (294), Expect = 7e-24,   Method: Compositional matrix adjust.
 Identities = 80/317 (25%), Positives = 144/317 (45%), Gaps = 29/317 (9%)

Query: 5   DEKQLLLYAQIANLVNLILQWPEISINE-IAENFSKLACNAHTICNSELRPLGTGLYPVI 63
           +EK+ L+ + IA L +   +  E   N+ +   F+++ CN  TI + EL  LG+ ++P +
Sbjct: 143 NEKKDLIQSDIAALHHFYSKHLEFPDNDSLVVLFAQVNCNGFTIEDEELSHLGSAIFPDV 202

Query: 64  SIINHSCLPNAVLVFEGRLAVVRAVQHVPKGAEVLISYIETAGSTMTRQKALKEQYLFTC 123
           +++NHSC PN ++ ++G LA VRAVQ +  G EV  SYI+    T  R   L++ Y FTC
Sbjct: 203 ALMNHSCCPNVIVTYKGTLAEVRAVQEIKPGEEVFTSYIDLLYPTEDRNDRLRDSYFFTC 262

Query: 124 TCPRCIKLGQFDDIQESAILEGYRCKDDGCSGFLLRDSDDKGFTCQQCGLVRSKEEIKKI 183
            C  C                    KD   +   +R   D         +VR    + + 
Sbjct: 263 ECQECTT------------------KDKDKAKVEIRKLSDPPKAEAIRDMVRYARNVIEE 304

Query: 184 ASEVNILSKKTLALTSCGNHQEVVSTYKMIEKLQKKLYHPFSVNLMQTREKLIKILMELE 243
                     +  L  C   QE +S+          ++   +V ++    + + + + ++
Sbjct: 305 FRRAKHYKSPSELLEICELSQEKMSS----------VFEDSNVYMLHMMYQAMGVCLYMQ 354

Query: 244 DWKEALAYCQLTIPVYQRVYPQFHPLLGLQYYTCGKLEWFLGDTENAIKSMTEAVEILRI 303
           DW+ AL Y Q  I  Y + YP +   +   +   G+L   L       K++ +A+ I+ +
Sbjct: 355 DWEGALQYGQKIIKPYSKHYPLYSLNVASMWLKLGRLYMGLEHKAAGEKALKKAIAIMEV 414

Query: 304 THGTNSPFMKELILKLE 320
            HG + P++ E+  ++E
Sbjct: 415 AHGKDHPYISEIKQEIE 431


>gi|403277520|ref|XP_003930406.1| PREDICTED: N-lysine methyltransferase SMYD2 [Saimiri boliviensis
           boliviensis]
          Length = 433

 Score =  117 bits (293), Expect = 7e-24,   Method: Compositional matrix adjust.
 Identities = 80/317 (25%), Positives = 144/317 (45%), Gaps = 29/317 (9%)

Query: 5   DEKQLLLYAQIANLVNLILQWPEISINE-IAENFSKLACNAHTICNSELRPLGTGLYPVI 63
           +EK+ L+ + IA L +   +  E   N+ +   F+++ CN  TI + EL  LG+ ++P +
Sbjct: 143 NEKKDLIQSDIAALHHFYSKHLEFPDNDSLVVLFAQVNCNGFTIEDEELSHLGSAIFPDV 202

Query: 64  SIINHSCLPNAVLVFEGRLAVVRAVQHVPKGAEVLISYIETAGSTMTRQKALKEQYLFTC 123
           +++NHSC PN ++ ++G LA VRAVQ +  G EV  SYI+    T  R   L++ Y FTC
Sbjct: 203 ALMNHSCCPNVIVTYKGTLAEVRAVQEISPGEEVFTSYIDLLYPTEDRNDRLRDSYFFTC 262

Query: 124 TCPRCIKLGQFDDIQESAILEGYRCKDDGCSGFLLRDSDDKGFTCQQCGLVRSKEEIKKI 183
            C  C                    KD   +   +R   D         +VR    + + 
Sbjct: 263 ECQECTT------------------KDKDKAKVEIRKLSDPPKAEAIRDMVRYARNVIEE 304

Query: 184 ASEVNILSKKTLALTSCGNHQEVVSTYKMIEKLQKKLYHPFSVNLMQTREKLIKILMELE 243
                     +  L  C   QE +S+          ++   +V ++    + + + + ++
Sbjct: 305 FRRAKHYKSPSELLEICELSQEKMSS----------VFEDSNVYMLHMMYQAMGVCLYMQ 354

Query: 244 DWKEALAYCQLTIPVYQRVYPQFHPLLGLQYYTCGKLEWFLGDTENAIKSMTEAVEILRI 303
           DW+ AL Y Q  I  Y + YP +   +   +   G+L   L       K++ +A+ I+ +
Sbjct: 355 DWEGALQYGQKIIKPYSKHYPLYSLNVASMWLKLGRLYMGLEHKAAGEKALKKAIAIMEV 414

Query: 304 THGTNSPFMKELILKLE 320
            HG + P++ E+  ++E
Sbjct: 415 AHGKDHPYISEIKQEIE 431


>gi|380814842|gb|AFE79295.1| N-lysine methyltransferase SMYD2 [Macaca mulatta]
          Length = 433

 Score =  117 bits (293), Expect = 7e-24,   Method: Compositional matrix adjust.
 Identities = 82/317 (25%), Positives = 148/317 (46%), Gaps = 29/317 (9%)

Query: 5   DEKQLLLYAQIANLVNLILQWPEISINE-IAENFSKLACNAHTICNSELRPLGTGLYPVI 63
           +EK+ L+ + IA L +   +  E   N+ +   F+++ CN  TI + EL  LG+ ++P +
Sbjct: 143 NEKKDLIQSDIAALHHFYSKHLEFPDNDSLVVLFAQVNCNGFTIEDEELSHLGSAIFPDV 202

Query: 64  SIINHSCLPNAVLVFEGRLAVVRAVQHVPKGAEVLISYIETAGSTMTRQKALKEQYLFTC 123
           +++NHSC PN ++ ++G LA VRAVQ +  G EV  SYI+    T  R   L++ Y FTC
Sbjct: 203 ALMNHSCCPNVIVTYKGTLAEVRAVQEIKPGEEVFTSYIDLLYPTEDRNDRLRDSYFFTC 262

Query: 124 TCPRCIKLGQFDDIQESAILEGYRCKDDGCSGFLLRDSDDKGFTCQQCGLVRSKEEIKKI 183
            C  C         ++ A +E  R   D      +RD            +VR    + + 
Sbjct: 263 ECQECTTKD-----KDKAKVE-IRKLSDPPKAEAIRD------------MVRYARNVIEE 304

Query: 184 ASEVNILSKKTLALTSCGNHQEVVSTYKMIEKLQKKLYHPFSVNLMQTREKLIKILMELE 243
                     +  L  C   QE +S+          ++   +V ++    + + + + ++
Sbjct: 305 FRRAKHYKSPSELLEICELSQEKMSS----------VFEDSNVYMLHMMYQAMGVCLYMQ 354

Query: 244 DWKEALAYCQLTIPVYQRVYPQFHPLLGLQYYTCGKLEWFLGDTENAIKSMTEAVEILRI 303
           DW+ AL Y Q  I  Y + YP +   +   +   G+L   L       K++ +A+ I+ +
Sbjct: 355 DWEGALQYGQKIIKPYSKHYPLYSLNVASMWLKLGRLYMGLEHKAAGEKALKKAIAIMEV 414

Query: 304 THGTNSPFMKELILKLE 320
            HG + P++ E+  ++E
Sbjct: 415 AHGKDHPYISEIKQEIE 431


>gi|296230124|ref|XP_002760570.1| PREDICTED: N-lysine methyltransferase SMYD2 [Callithrix jacchus]
          Length = 433

 Score =  117 bits (293), Expect = 7e-24,   Method: Compositional matrix adjust.
 Identities = 82/317 (25%), Positives = 148/317 (46%), Gaps = 29/317 (9%)

Query: 5   DEKQLLLYAQIANLVNLILQWPEISINE-IAENFSKLACNAHTICNSELRPLGTGLYPVI 63
           +EK+ L+ + IA L +   +  E   N+ +   F+++ CN  TI + EL  LG+ ++P +
Sbjct: 143 NEKKDLIQSDIAALHHFYSKHLEFPDNDSLVVLFAQVNCNGFTIEDEELSHLGSAIFPDV 202

Query: 64  SIINHSCLPNAVLVFEGRLAVVRAVQHVPKGAEVLISYIETAGSTMTRQKALKEQYLFTC 123
           +++NHSC PN ++ ++G LA VRAVQ +  G EV  SYI+    T  R   L++ Y FTC
Sbjct: 203 ALMNHSCCPNVIVTYKGTLAEVRAVQEISPGEEVFTSYIDLLYPTEDRNDRLRDSYFFTC 262

Query: 124 TCPRCIKLGQFDDIQESAILEGYRCKDDGCSGFLLRDSDDKGFTCQQCGLVRSKEEIKKI 183
            C  C         ++ A +E  R   D      +RD            +VR    + + 
Sbjct: 263 ECQECTTKD-----KDKAKVE-IRKLSDPPKAEAIRD------------MVRYARNVIEE 304

Query: 184 ASEVNILSKKTLALTSCGNHQEVVSTYKMIEKLQKKLYHPFSVNLMQTREKLIKILMELE 243
                     +  L  C   QE +S+          ++   +V ++    + + + + ++
Sbjct: 305 FRRAKHYKSPSELLEICELSQEKMSS----------VFEDSNVYMLHMMYQAMGVCLYMQ 354

Query: 244 DWKEALAYCQLTIPVYQRVYPQFHPLLGLQYYTCGKLEWFLGDTENAIKSMTEAVEILRI 303
           DW+ AL Y Q  I  Y + YP +   +   +   G+L   L       K++ +A+ I+ +
Sbjct: 355 DWEGALQYGQKIIKPYSKHYPLYSLNVASMWLKLGRLYMGLEHKAAGEKALKKAIAIMEV 414

Query: 304 THGTNSPFMKELILKLE 320
            HG + P++ E+  ++E
Sbjct: 415 AHGKDHPYISEIKQEIE 431


>gi|402857226|ref|XP_003893168.1| PREDICTED: N-lysine methyltransferase SMYD2 [Papio anubis]
          Length = 433

 Score =  117 bits (293), Expect = 7e-24,   Method: Compositional matrix adjust.
 Identities = 82/317 (25%), Positives = 148/317 (46%), Gaps = 29/317 (9%)

Query: 5   DEKQLLLYAQIANLVNLILQWPEISINE-IAENFSKLACNAHTICNSELRPLGTGLYPVI 63
           +EK+ L+ + IA L +   +  E   N+ +   F+++ CN  TI + EL  LG+ ++P +
Sbjct: 143 NEKKDLIQSDIAALHHFYSRHLEFPDNDSLVVLFAQVNCNGFTIEDEELSHLGSAIFPDV 202

Query: 64  SIINHSCLPNAVLVFEGRLAVVRAVQHVPKGAEVLISYIETAGSTMTRQKALKEQYLFTC 123
           +++NHSC PN ++ ++G LA VRAVQ +  G EV  SYI+    T  R   L++ Y FTC
Sbjct: 203 ALMNHSCCPNVIVTYKGTLAEVRAVQEIKPGEEVFTSYIDLLYPTEDRNDRLRDSYFFTC 262

Query: 124 TCPRCIKLGQFDDIQESAILEGYRCKDDGCSGFLLRDSDDKGFTCQQCGLVRSKEEIKKI 183
            C  C         ++ A +E  R   D      +RD            +VR    + + 
Sbjct: 263 ECQECTTKD-----KDKAKVE-IRKLSDPPKAEAIRD------------MVRYARNVIEE 304

Query: 184 ASEVNILSKKTLALTSCGNHQEVVSTYKMIEKLQKKLYHPFSVNLMQTREKLIKILMELE 243
                     +  L  C   QE +S+          ++   +V ++    + + + + ++
Sbjct: 305 FRRAKHYKSPSELLEICELSQEKMSS----------VFEDSNVYMLHMMYQAMGVCLYMQ 354

Query: 244 DWKEALAYCQLTIPVYQRVYPQFHPLLGLQYYTCGKLEWFLGDTENAIKSMTEAVEILRI 303
           DW+ AL Y Q  I  Y + YP +   +   +   G+L   L       K++ +A+ I+ +
Sbjct: 355 DWEGALQYGQKIIKPYSKHYPLYSLNVASMWLKLGRLYMGLEHKAAGEKALKKAIAIMEV 414

Query: 304 THGTNSPFMKELILKLE 320
            HG + P++ E+  ++E
Sbjct: 415 AHGKDHPYISEIKQEIE 431


>gi|343197592|pdb|3S7B|A Chain A, Structural Basis Of Substrate Methylation And Inhibition
           Of Smyd2
 gi|343197593|pdb|3S7D|A Chain A, Structural Basis Of Substrate Methylation And Inhibition
           Of Smyd2
 gi|343197595|pdb|3S7F|A Chain A, Structural Basis Of Substrate Methylation And Inhibition
           Of Smyd2
 gi|343197597|pdb|3S7J|A Chain A, Structural Basis Of Substrate Methylation And Inhibition
           Of Smyd2
 gi|9295345|gb|AAF86953.1|AF226053_1 HSKM-B [Homo sapiens]
 gi|66990077|gb|AAH98133.1| SET and MYND domain containing 2 [Homo sapiens]
 gi|68226703|gb|AAH98335.1| SET and MYND domain containing 2 [Homo sapiens]
 gi|189054089|dbj|BAG36596.1| unnamed protein product [Homo sapiens]
 gi|307686281|dbj|BAJ21071.1| SET and MYND domain containing 2 [synthetic construct]
          Length = 433

 Score =  117 bits (293), Expect = 7e-24,   Method: Compositional matrix adjust.
 Identities = 80/317 (25%), Positives = 144/317 (45%), Gaps = 29/317 (9%)

Query: 5   DEKQLLLYAQIANLVNLILQWPEISINE-IAENFSKLACNAHTICNSELRPLGTGLYPVI 63
           +EK+ L+ + IA L +   +  E   N+ +   F+++ CN  TI + EL  LG+ ++P +
Sbjct: 143 NEKKDLIQSDIAALHHFYSKHLEFPDNDSLVVLFAQVNCNGFTIEDEELSHLGSAIFPDV 202

Query: 64  SIINHSCLPNAVLVFEGRLAVVRAVQHVPKGAEVLISYIETAGSTMTRQKALKEQYLFTC 123
           +++NHSC PN ++ ++G LA VRAVQ +  G EV  SYI+    T  R   L++ Y FTC
Sbjct: 203 ALMNHSCCPNVIVTYKGTLAEVRAVQEIKPGEEVFTSYIDLLYPTEDRNDRLRDSYFFTC 262

Query: 124 TCPRCIKLGQFDDIQESAILEGYRCKDDGCSGFLLRDSDDKGFTCQQCGLVRSKEEIKKI 183
            C  C                    KD   +   +R   D         +VR    + + 
Sbjct: 263 ECQECTT------------------KDKDKAKVEIRKLSDPPKAEAIRDMVRYARNVIEE 304

Query: 184 ASEVNILSKKTLALTSCGNHQEVVSTYKMIEKLQKKLYHPFSVNLMQTREKLIKILMELE 243
                     +  L  C   QE +S+          ++   +V ++    + + + + ++
Sbjct: 305 FRRAKHYKSPSELLEICELSQEKMSS----------VFEDSNVYMLHMMYQAMGVCLYMQ 354

Query: 244 DWKEALAYCQLTIPVYQRVYPQFHPLLGLQYYTCGKLEWFLGDTENAIKSMTEAVEILRI 303
           DW+ AL Y Q  I  Y + YP +   +   +   G+L   L       K++ +A+ I+ +
Sbjct: 355 DWEGALQYGQKIIKPYSKHYPLYSLNVASMWLKLGRLYMGLEHKAAGEKALKKAIAIMEV 414

Query: 304 THGTNSPFMKELILKLE 320
            HG + P++ E+  ++E
Sbjct: 415 AHGKDHPYISEIKQEIE 431


>gi|417400865|gb|JAA47349.1| Putative histone tail methylase [Desmodus rotundus]
          Length = 433

 Score =  117 bits (293), Expect = 8e-24,   Method: Compositional matrix adjust.
 Identities = 80/317 (25%), Positives = 144/317 (45%), Gaps = 29/317 (9%)

Query: 5   DEKQLLLYAQIANLVNLILQWPEISINE-IAENFSKLACNAHTICNSELRPLGTGLYPVI 63
           +EK+ L+ + I+ L     +  E   NE +   F+++ CN  TI + EL  LG+ ++P +
Sbjct: 143 NEKKDLIQSDISALHRFYSKHLEFPDNESLVTLFAQVNCNGFTIEDEELSHLGSAIFPDV 202

Query: 64  SIINHSCLPNAVLVFEGRLAVVRAVQHVPKGAEVLISYIETAGSTMTRQKALKEQYLFTC 123
           +++NHSC PN ++ ++G LA VRAVQ +  G EV  SYI+    T  R   L++ Y FTC
Sbjct: 203 ALMNHSCCPNVIVTYKGTLAEVRAVQAISPGEEVFTSYIDLLYPTEDRNDRLRDSYFFTC 262

Query: 124 TCPRCIKLGQFDDIQESAILEGYRCKDDGCSGFLLRDSDDKGFTCQQCGLVRSKEEIKKI 183
            C  C                    KD   +   +R  +D         +VR    + + 
Sbjct: 263 ECQECTT------------------KDKDKAKVKIRKLNDPPKADAVRDMVRYARNVIEE 304

Query: 184 ASEVNILSKKTLALTSCGNHQEVVSTYKMIEKLQKKLYHPFSVNLMQTREKLIKILMELE 243
                     +  L  C   QE +S+          ++   +V ++    + + + +  +
Sbjct: 305 FRRAKHYKSPSELLEICELSQEKMSS----------VFEDSNVYMLHMMYQAMGVCLYTQ 354

Query: 244 DWKEALAYCQLTIPVYQRVYPQFHPLLGLQYYTCGKLEWFLGDTENAIKSMTEAVEILRI 303
           DW+ AL Y Q  I  Y + YP +   +   +   G+L   L +     K++ +A+ I+ +
Sbjct: 355 DWEGALRYGQKIIKPYSKHYPLYSLNVASMWLKLGRLYMGLENRAAGEKALRKAMAIMEV 414

Query: 304 THGTNSPFMKELILKLE 320
            HG + P++ E+  ++E
Sbjct: 415 AHGKDHPYISEIKQEIE 431


>gi|355745920|gb|EHH50545.1| hypothetical protein EGM_01398, partial [Macaca fascicularis]
          Length = 380

 Score =  117 bits (293), Expect = 8e-24,   Method: Compositional matrix adjust.
 Identities = 80/317 (25%), Positives = 144/317 (45%), Gaps = 29/317 (9%)

Query: 5   DEKQLLLYAQIANLVNLILQWPEISINE-IAENFSKLACNAHTICNSELRPLGTGLYPVI 63
           +EK+ L+ + IA L +   +  E   N+ +   F+++ CN  TI + EL  LG+ ++P +
Sbjct: 90  NEKKDLIQSDIAALHHFYSKHLEFPDNDSLIVLFAQVNCNGFTIEDEELSHLGSAIFPDV 149

Query: 64  SIINHSCLPNAVLVFEGRLAVVRAVQHVPKGAEVLISYIETAGSTMTRQKALKEQYLFTC 123
           +++NHSC PN ++ ++G LA VRAVQ +  G EV  SYI+    T  R   L++ Y FTC
Sbjct: 150 ALMNHSCCPNVIVTYKGTLAEVRAVQEIKPGEEVFTSYIDLLYPTEDRNDRLRDSYFFTC 209

Query: 124 TCPRCIKLGQFDDIQESAILEGYRCKDDGCSGFLLRDSDDKGFTCQQCGLVRSKEEIKKI 183
            C  C                    KD   +   +R   D         +VR    + + 
Sbjct: 210 ECQECT------------------TKDKDKAKVEIRKLSDPPKAEAIRDMVRYARNVIEE 251

Query: 184 ASEVNILSKKTLALTSCGNHQEVVSTYKMIEKLQKKLYHPFSVNLMQTREKLIKILMELE 243
                     +  L  C   QE +S+          ++   +V ++    + + + + ++
Sbjct: 252 FRRAKHYKSPSELLEICELSQEKMSS----------VFEDSNVYMLHMMYQAMGVCLYMQ 301

Query: 244 DWKEALAYCQLTIPVYQRVYPQFHPLLGLQYYTCGKLEWFLGDTENAIKSMTEAVEILRI 303
           DW+ AL Y Q  I  Y + YP +   +   +   G+L   L       K++ +A+ I+ +
Sbjct: 302 DWEGALQYGQKIIKPYSKHYPLYSLNVASMWLKLGRLYMGLEHKAAGEKALKKAIAIMEV 361

Query: 304 THGTNSPFMKELILKLE 320
            HG + P++ E+  ++E
Sbjct: 362 AHGKDHPYISEIKQEIE 378


>gi|402858454|ref|XP_003893720.1| PREDICTED: SET and MYND domain-containing protein 3-like [Papio
           anubis]
          Length = 258

 Score =  117 bits (293), Expect = 8e-24,   Method: Compositional matrix adjust.
 Identities = 80/274 (29%), Positives = 134/274 (48%), Gaps = 39/274 (14%)

Query: 33  IAENFSKLACNAHTICNSELRPLGTGLYPVISIINHSCLPNAVLVFEGRLAVVRAVQHVP 92
           + E   K+ CN+ TICN+E++ +G GLYP IS++NHSC PN  +VF G   ++RAV+ + 
Sbjct: 1   MEEEEEKVICNSFTICNAEMQEVGVGLYPSISLLNHSCDPNCSIVFNGPHLLLRAVRDIE 60

Query: 93  KGAEVLISYIETAGSTMTRQKALKEQYLFTCTCPRCIKLGQFDDIQESAILEGYRCKDDG 152
            G E+ I Y++   ++  R+K L++QY F C C RC    Q  D                
Sbjct: 61  VGEELTICYLDMLMTSEERRKQLRDQYCFECDCFRC----QTQD---------------- 100

Query: 153 CSGFLLRDSDDKGFTCQQCGLVRSKEEIKKIASEVNILSKKTLALTSCGNHQEVVSTYKM 212
                 +D+D          ++   E++ K   EV    KK   L +    ++V++  + 
Sbjct: 101 ------KDAD----------MLTGDEQVWK---EVQESLKKIEELKAHWKWEQVLAMCQA 141

Query: 213 IEKLQKKLYHPFSVNLMQTREKLIKILMELEDWKEALAYCQLTIPVYQRVYPQFHPLLGL 272
           I     +     +V  ++  +  +   + L   +EAL Y   T+  Y+  +P  HP+ G+
Sbjct: 142 IISSNSERLPDINVYQLKVLDCAMDACINLGLLEEALFYGTRTMEPYRIFFPGSHPVRGV 201

Query: 273 QYYTCGKLEWFLGDTENAIKSMTEAVEILRITHG 306
           Q    GKL+   G    A+K++  A +I+R+THG
Sbjct: 202 QVMKVGKLQLHQGMFPQAMKNLRLAFDIMRVTHG 235


>gi|345802872|ref|XP_537149.3| PREDICTED: N-lysine methyltransferase SMYD2 [Canis lupus
           familiaris]
          Length = 523

 Score =  117 bits (293), Expect = 8e-24,   Method: Compositional matrix adjust.
 Identities = 74/317 (23%), Positives = 146/317 (46%), Gaps = 29/317 (9%)

Query: 5   DEKQLLLYAQIANLVNLILQWPEISINE-IAENFSKLACNAHTICNSELRPLGTGLYPVI 63
           +EK+ L+ + IA L +   +  E   +E +   F+++ CN  TI + EL  LG+ ++P +
Sbjct: 233 NEKKDLIQSDIAALHHFYSKHLEFPDSESLVVLFAQVNCNGFTIEDEELSHLGSAIFPDV 292

Query: 64  SIINHSCLPNAVLVFEGRLAVVRAVQHVPKGAEVLISYIETAGSTMTRQKALKEQYLFTC 123
           +++NHSC PN ++ ++G LA VRAVQ +  G EV  SYI+    T  R   L++ Y FTC
Sbjct: 293 ALMNHSCCPNVIVTYKGTLAEVRAVQEISPGEEVFTSYIDLLYPTEDRNDRLRDSYFFTC 352

Query: 124 TCPRCIKLGQFDDIQESAILEGYRCKDDGCSGFLLRDSDDKGFTCQQCGLVRSKEEIKKI 183
            C  C                              +D D      ++       E I+ +
Sbjct: 353 QCQECTT----------------------------KDKDKAKVEIRKLSDPPKAETIRDM 384

Query: 184 ASEVNILSKKTLALTSCGNHQEVVSTYKMIEKLQKKLYHPFSVNLMQTREKLIKILMELE 243
                 + ++        +  E++   ++ ++    ++   +V ++    + + + + ++
Sbjct: 385 VRYARNVIEEFRRAKHYKSPSELLEICELSQEKMGSVFEDSNVYMLHMMYQAMGVCLYMQ 444

Query: 244 DWKEALAYCQLTIPVYQRVYPQFHPLLGLQYYTCGKLEWFLGDTENAIKSMTEAVEILRI 303
           DW+ AL Y Q  I  Y + YP +   +   +   G+L   L D     +++ +A+ I+ +
Sbjct: 445 DWEGALRYGQKIIKPYSKHYPLYSLNVASMWLKLGRLYMGLEDKAAGERALKKAIAIMEV 504

Query: 304 THGTNSPFMKELILKLE 320
            HG + P++ E+  ++E
Sbjct: 505 AHGKDHPYISEIKQEIE 521


>gi|397486375|ref|XP_003814305.1| PREDICTED: N-lysine methyltransferase SMYD2 [Pan paniscus]
          Length = 384

 Score =  117 bits (293), Expect = 8e-24,   Method: Compositional matrix adjust.
 Identities = 80/317 (25%), Positives = 144/317 (45%), Gaps = 29/317 (9%)

Query: 5   DEKQLLLYAQIANLVNLILQWPEISINE-IAENFSKLACNAHTICNSELRPLGTGLYPVI 63
           +EK+ L+ + IA L +   +  E   N+ +   F+++ CN  TI + EL  LG+ ++P +
Sbjct: 94  NEKKDLIQSDIAALHHFYSKHLEFLDNDSLVVVFAQVNCNGFTIEDEELSHLGSAIFPDV 153

Query: 64  SIINHSCLPNAVLVFEGRLAVVRAVQHVPKGAEVLISYIETAGSTMTRQKALKEQYLFTC 123
           +++NHSC PN ++ ++G LA VRAVQ +  G EV  SYI+    T  R   L++ Y FTC
Sbjct: 154 ALMNHSCCPNVIVTYKGTLAEVRAVQEIKPGEEVFTSYIDLLYPTEDRNDRLRDSYFFTC 213

Query: 124 TCPRCIKLGQFDDIQESAILEGYRCKDDGCSGFLLRDSDDKGFTCQQCGLVRSKEEIKKI 183
            C  C                    KD   +   +R   D         +VR    + + 
Sbjct: 214 ECQECT------------------TKDKDKAKVEIRKLSDPPKAEAIRDMVRYARNVIEE 255

Query: 184 ASEVNILSKKTLALTSCGNHQEVVSTYKMIEKLQKKLYHPFSVNLMQTREKLIKILMELE 243
                     +  L  C   QE +S+          ++   +V ++    + + + + ++
Sbjct: 256 FRRAKHYKSPSELLEICELSQEKMSS----------VFEDSNVYMLHMMYQAMGVCLYMQ 305

Query: 244 DWKEALAYCQLTIPVYQRVYPQFHPLLGLQYYTCGKLEWFLGDTENAIKSMTEAVEILRI 303
           DW+ AL Y Q  I  Y + YP +   +   +   G+L   L       K++ +A+ I+ +
Sbjct: 306 DWEGALQYGQKIIKPYSKHYPLYSLNVASMWLKLGRLYMGLEHKAAGEKALKKAIAIMEV 365

Query: 304 THGTNSPFMKELILKLE 320
            HG + P++ E+  ++E
Sbjct: 366 AHGKDHPYISEIKQEIE 382


>gi|383865502|ref|XP_003708212.1| PREDICTED: SET and MYND domain-containing protein 3-like [Megachile
           rotundata]
          Length = 438

 Score =  117 bits (293), Expect = 8e-24,   Method: Compositional matrix adjust.
 Identities = 83/319 (26%), Positives = 153/319 (47%), Gaps = 23/319 (7%)

Query: 2   SDI--DEKQLLLYAQIAN-LVNLILQWPEISINEIAENFSKLACNAHTICNSELRPLGTG 58
           SDI  DEK++  +  +   L   +   P  +  E+   + ++  N+  I + ++  +G G
Sbjct: 122 SDIKKDEKKMEHFMFLCGVLFGFLGDTPMPNSAELMGIYGRICINSFNIFDLDMNSIGVG 181

Query: 59  LYPVISIINHSCLPNAVLVFEGRLAVVRAVQHVPK--GAEVLISYIETAGSTMTRQKALK 116
           +Y   S+++HSC+PNAV  FEG    +R ++ +P    +++ ISYI+   +T  R+  L+
Sbjct: 182 IYLAPSVVDHSCVPNAVATFEGITLNIRTIEDLPSLDWSQIRISYIDVLKTTKERRSELQ 241

Query: 117 EQYLFTCTCPRCIKLGQFDDIQESAIL-EGYRCKDDGCSGFLLRDSDDKGFTCQQCGLVR 175
             Y F C C +C         +E  ++ E   C +  C+     D +    +C +C  ++
Sbjct: 242 SSYYFWCNCKKC---------EEPELMAEAAACSNKNCTNPCSPDME----SCPECN-IK 287

Query: 176 SKEEIKKIASEVNILSKKTLALTSCGNHQEVVSTYKMIEKLQKKLYHPFSVNLMQTREKL 235
             E  K+I  EV+  +   L       +   +   KM  K Q+ + H  ++  +QT +  
Sbjct: 288 LLENFKEIFDEVSSFTAHHLQNMK---NMAYLDVSKMCLKKQEGVLHSLNIQHVQTLQTA 344

Query: 236 IKILMELEDWKEALAYCQLTIPVYQRVYPQFHPLLGLQYYTCGKLEWFLGDTENAIKSMT 295
               + L+ W+EA  Y +  I  Y   Y + HPL G+     GK++ +L   + A++++ 
Sbjct: 345 FDSSISLQHWEEAEFYAKKLIKGYLVYYGEVHPLTGILLLMTGKIQLYLEKPKQALEALR 404

Query: 296 EAVEILRITHGTNSPFMKE 314
           +A  IL ITHG     ++E
Sbjct: 405 KANSILAITHGEQHTLVRE 423


>gi|294662600|pdb|3MEK|A Chain A, Crystal Structure Of Human Histone-Lysine N-
           Methyltransferase Smyd3 In Complex With S-Adenosyl-L-
           Methionine
          Length = 429

 Score =  117 bits (292), Expect = 9e-24,   Method: Compositional matrix adjust.
 Identities = 84/277 (30%), Positives = 131/277 (47%), Gaps = 43/277 (15%)

Query: 32  EIAENFSKLACNAHTICNSELRPLGTGLYPVISIINHSCLPNAVLVFEGRLAVVRAVQHV 91
           ++ E F+K+ CN+ TICN+E + +G GLYP IS++NHSC PN  +VF G   ++RAV+ +
Sbjct: 171 DLFEAFAKVICNSFTICNAEXQEVGVGLYPSISLLNHSCDPNCSIVFNGPHLLLRAVRDI 230

Query: 92  PKGAEVLISYIETAGSTMTRQKALKEQYLFTCTCPRCIKLGQFDDIQESAILEGYRCKDD 151
             G E+ I Y++   ++  R+K L++QY F C C RC                       
Sbjct: 231 EVGEELTICYLDXLXTSEERRKQLRDQYCFECDCFRC----------------------- 267

Query: 152 GCSGFLLRDSDDKGFTCQQCGLVRSKEEIKKIASEVNILSKKTLALTSCGNHQEVVSTYK 211
                  +D D    T  +      +E +KKI  E+    K    L  C   Q ++S+  
Sbjct: 268 -----QTQDKDADXLTGDEQVWKEVQESLKKI-EELKAHWKWEQVLAXC---QAIISSNS 318

Query: 212 MIEKLQKKLYHPFSVNLMQTR--EKLIKILMELEDWKEALAYCQLTIPVYQRVYPQFHPL 269
             E+L         +N+ Q +  +      + L   +EAL Y   T   Y+  +P  HP+
Sbjct: 319 --ERLP-------DINIYQLKVLDCAXDACINLGLLEEALFYGTRTXEPYRIFFPGSHPV 369

Query: 270 LGLQYYTCGKLEWFLGDTENAIKSMTEAVEILRITHG 306
            G+Q    GKL+   G    A K++  A +I R+THG
Sbjct: 370 RGVQVXKVGKLQLHQGXFPQAXKNLRLAFDIXRVTHG 406


>gi|297661482|ref|XP_002809270.1| PREDICTED: SET and MYND domain-containing protein 3-like [Pongo
           abelii]
 gi|426334412|ref|XP_004028746.1| PREDICTED: SET and MYND domain-containing protein 3-like [Gorilla
           gorilla gorilla]
 gi|28200379|gb|AAO31695.1| hypothetical protein FLJ21080 [Homo sapiens]
 gi|119597546|gb|EAW77140.1| SET and MYND domain containing 3, isoform CRA_b [Homo sapiens]
          Length = 258

 Score =  117 bits (292), Expect = 1e-23,   Method: Compositional matrix adjust.
 Identities = 83/276 (30%), Positives = 133/276 (48%), Gaps = 43/276 (15%)

Query: 33  IAENFSKLACNAHTICNSELRPLGTGLYPVISIINHSCLPNAVLVFEGRLAVVRAVQHVP 92
           + E   K+ CN+ TICN+E++ +G GLYP IS++NHSC PN  +VF G   ++RAV+ + 
Sbjct: 1   MEEEEEKVICNSFTICNAEMQEVGVGLYPSISLLNHSCDPNCSIVFNGPHLLLRAVRDIE 60

Query: 93  KGAEVLISYIETAGSTMTRQKALKEQYLFTCTCPRCIKLGQFDDIQESAILEGYRCKDDG 152
            G E+ I Y++   ++  R+K L++QY F C C RC                        
Sbjct: 61  VGEELTICYLDMLMTSEERRKQLRDQYCFECDCFRC------------------------ 96

Query: 153 CSGFLLRDSDDKGFTCQQCGLVRSKEEIKKIASEVNILSKKTLALTSCGNHQEVVSTYKM 212
                 +D D    T  +      +E +KKI  E+    K    L  C   Q ++S+   
Sbjct: 97  ----QTQDKDADMLTGDEQVWKEVQESLKKI-EELKAHWKWEQVLAMC---QAIISSNS- 147

Query: 213 IEKLQKKLYHPFSVNLMQTR--EKLIKILMELEDWKEALAYCQLTIPVYQRVYPQFHPLL 270
            E+L         +N+ Q +  +  +   + L   +EAL Y   T+  Y+  +P  HP+ 
Sbjct: 148 -ERLP-------DINIYQLKVLDCAMDACINLGLLEEALFYGTRTMEPYRIFFPGSHPVR 199

Query: 271 GLQYYTCGKLEWFLGDTENAIKSMTEAVEILRITHG 306
           G+Q    GKL+   G    A+K++  A +I+R+THG
Sbjct: 200 GVQVMKVGKLQLHQGMFPQAMKNLRLAFDIMRVTHG 235


>gi|195438687|ref|XP_002067264.1| GK16327 [Drosophila willistoni]
 gi|194163349|gb|EDW78250.1| GK16327 [Drosophila willistoni]
          Length = 461

 Score =  117 bits (292), Expect = 1e-23,   Method: Compositional matrix adjust.
 Identities = 86/310 (27%), Positives = 146/310 (47%), Gaps = 22/310 (7%)

Query: 16  ANLVNLILQWPEISIN--EIAENFSKLACNAHTICNSELRPLGTGLYPVISIINHSCLPN 73
           A L ++++  P    N  E+   + +L  N   I + E+  + T +Y  +SI +HSC PN
Sbjct: 153 AVLTDMMVDSPSTVPNKTELMSIYGRLITNGFNILDGEMNSIATAIYLGVSITDHSCQPN 212

Query: 74  AVLVFEGRLAVVRAVQHVP--KGAEVLISYIETAGSTMTRQKALKEQYLFTCTCPRCIKL 131
           AV  FEG    + A++ +P    +++ ISYI+   +   R+  LKE Y F C C +C   
Sbjct: 213 AVATFEGNELHIYAIEDMPCLDWSKIFISYIDLLNTPEQRRLDLKEHYYFLCVCSKCT-- 270

Query: 132 GQFDDIQESAILEGYRCKDDGCSGFLLRDSDDKGFTCQQC--GLVRSKEEIKKIASEVNI 189
               D QE   +E   C    C   +    + K   C+ C  G+     +++   +EV  
Sbjct: 271 ----DRQEQREMEAAVCPFRKCGSSI----NVKWTHCRNCQAGIT---PKLRNTYNEVMA 319

Query: 190 LSKKTLALTSCGNHQEVVSTYKMIEKLQKKLYHPFSVNLMQTREKLIKILMELEDWKEAL 249
           ++K  L       + +V    K+    Q  + HP +V  ++T +   +  +E+  W EAL
Sbjct: 320 MTKHHLESMKDVAYLDVC---KVCLDKQTGVLHPLNVWHVKTLDAAFEAAIEVGKWPEAL 376

Query: 250 AYCQLTIPVYQRVYPQFHPLLGLQYYTCGKLEWFLGDTENAIKSMTEAVEILRITHGTNS 309
            Y Q  +  +++    +HPLLGL +   GK++ + G  + A+    EA  IL +THG   
Sbjct: 377 NYGQQLLQGFRKYNGPWHPLLGLLHMKLGKIQLYEGQCKEAMHHFEEARRILCVTHGREH 436

Query: 310 PFMKELILKL 319
             + E +L L
Sbjct: 437 RLIGEQLLPL 446


>gi|344236460|gb|EGV92563.1| SET and MYND domain-containing protein 2 [Cricetulus griseus]
          Length = 326

 Score =  116 bits (291), Expect = 1e-23,   Method: Compositional matrix adjust.
 Identities = 81/319 (25%), Positives = 149/319 (46%), Gaps = 27/319 (8%)

Query: 5   DEKQLLLYAQIANLVNLI---LQWPEISINEIAENFSKLACNAHTICNSELRPLGTGLYP 61
           +EK+ L+ + IA L       L++P+ S   +   F+++ CN  TI + EL  LG+ ++P
Sbjct: 30  NEKKDLIQSDIAALHQFYSRHLEFPDHS--SLVVLFAQVNCNGFTIEDEELSHLGSAIFP 87

Query: 62  VISIINHSCLPNAVLVFEGRLAVVRAVQHVPKGAEVLISYIETAGSTMTRQKALKEQYLF 121
            ++++NHSC PN ++ ++G LA VRAVQ +  G EV  SYI+    T  R   L++ Y F
Sbjct: 88  DVALMNHSCCPNVIVTYKGTLAEVRAVQEIHPGDEVFTSYIDLLYPTEDRNDRLRDSYFF 147

Query: 122 TCTCPRCIKLGQFDDIQESAILEGYRCKDDGCSGFLLRDSDDKGFTCQQCGLVRSKEEIK 181
           TC C  C                    KD   +   +R    K  +  Q   VR      
Sbjct: 148 TCECRECT------------------TKDKDKAKVEIR----KLSSPPQAEAVRDMVRYA 185

Query: 182 KIASEVNILSKKTLALTSCGNHQEVVSTYKMIEKLQKKLYHPFSVNLMQTREKLIKILME 241
           +   E    +K      S     E++   ++ ++    ++   +V ++    + + + + 
Sbjct: 186 RNVIEEFRRAKHYKYNLSVSPPSELLEICELSQEKMSSVFEDSNVYMLHMMYQAMGVCLY 245

Query: 242 LEDWKEALAYCQLTIPVYQRVYPQFHPLLGLQYYTCGKLEWFLGDTENAIKSMTEAVEIL 301
           ++DW+ AL Y Q  I  Y + YP +   +   +   G+L   L +     K++ +A+ I+
Sbjct: 246 MQDWEGALKYGQKIIKPYSKHYPVYSLNVASMWLKLGRLYMGLENKAAGQKALKKAIAIM 305

Query: 302 RITHGTNSPFMKELILKLE 320
            + HG + P++ E+  ++E
Sbjct: 306 EVAHGKDHPYISEIKQEIE 324


>gi|115496736|ref|NP_001069832.1| N-lysine methyltransferase SMYD2 [Bos taurus]
 gi|122144240|sp|Q0P585.1|SMYD2_BOVIN RecName: Full=N-lysine methyltransferase SMYD2; AltName:
           Full=Histone methyltransferase SMYD2; AltName: Full=SET
           and MYND domain-containing protein 2
 gi|112362245|gb|AAI20365.1| SET and MYND domain containing 2 [Bos taurus]
 gi|296478866|tpg|DAA20981.1| TPA: SET and MYND domain containing 2 [Bos taurus]
          Length = 433

 Score =  116 bits (291), Expect = 1e-23,   Method: Compositional matrix adjust.
 Identities = 81/317 (25%), Positives = 149/317 (47%), Gaps = 29/317 (9%)

Query: 5   DEKQLLLYAQIANLVNLILQWPEISINE-IAENFSKLACNAHTICNSELRPLGTGLYPVI 63
           +EK+ L+ + IA L +   +  E   N+ +   F+++ CN  TI + EL  LG+ ++P +
Sbjct: 143 NEKRDLIQSDIAALHHFYSKHLEFPDNDSLVVLFAQVNCNGFTIEDEELSHLGSAIFPDV 202

Query: 64  SIINHSCLPNAVLVFEGRLAVVRAVQHVPKGAEVLISYIETAGSTMTRQKALKEQYLFTC 123
           +++NHSC PN ++ ++G LA VRAVQ +  G EV  SYI+    T  R   L++ Y FTC
Sbjct: 203 ALMNHSCCPNVIVTYKGTLAEVRAVQEIHPGEEVFTSYIDLLYPTEDRNDRLRDSYFFTC 262

Query: 124 TCPRCIKLGQFDDIQESAILEGYRCKDDGCSGFLLRDSDDKGFTCQQCGLVRSKEEIKKI 183
            C  C         ++ A +E  R  +D      +RD            +VR    + + 
Sbjct: 263 ECQECTTKD-----KDKAKVE-IRKLNDPPKAETIRD------------MVRYARNVIEE 304

Query: 184 ASEVNILSKKTLALTSCGNHQEVVSTYKMIEKLQKKLYHPFSVNLMQTREKLIKILMELE 243
                     +  L  C   QE +S           ++   +V ++    + + + + ++
Sbjct: 305 FRRAKHYKSPSELLEICELSQEKMSC----------VFEDSNVYMLHMMYQAMGVCLYMQ 354

Query: 244 DWKEALAYCQLTIPVYQRVYPQFHPLLGLQYYTCGKLEWFLGDTENAIKSMTEAVEILRI 303
           DW+ AL Y Q  I  Y + YP +   +   +   G+L   L +     +++ +A+ I+ +
Sbjct: 355 DWEGALRYGQKIIQPYSKHYPLYSLNVASMWLKLGRLYMGLENKAAGERALKKAIAIMEV 414

Query: 304 THGTNSPFMKELILKLE 320
            HG + P++ E+  ++E
Sbjct: 415 AHGKDHPYISEIKQEIE 431


>gi|441613050|ref|XP_003265162.2| PREDICTED: N-lysine methyltransferase SMYD2 [Nomascus leucogenys]
          Length = 536

 Score =  116 bits (291), Expect = 1e-23,   Method: Compositional matrix adjust.
 Identities = 82/317 (25%), Positives = 148/317 (46%), Gaps = 29/317 (9%)

Query: 5   DEKQLLLYAQIANLVNLILQWPEISINE-IAENFSKLACNAHTICNSELRPLGTGLYPVI 63
           +EK+ L+ + IA L +   +  E   N+ +   F+++ CN  TI + EL  LG+ ++P +
Sbjct: 246 NEKKDLIQSDIAALHHFYSKHLEFPDNDSLVVLFAQVNCNGFTIEDEELSHLGSAIFPDV 305

Query: 64  SIINHSCLPNAVLVFEGRLAVVRAVQHVPKGAEVLISYIETAGSTMTRQKALKEQYLFTC 123
           +++NHSC PN ++ ++G LA VRAVQ +  G EV  SYI+    T  R   L++ Y FTC
Sbjct: 306 ALMNHSCCPNVIVTYKGTLAEVRAVQEIKTGEEVFTSYIDLLYPTEDRNDRLRDSYFFTC 365

Query: 124 TCPRCIKLGQFDDIQESAILEGYRCKDDGCSGFLLRDSDDKGFTCQQCGLVRSKEEIKKI 183
            C  C         ++ A +E  R   D      +RD            +VR    + + 
Sbjct: 366 ECQECTTKD-----KDKAKVE-IRKLSDPPKAEAIRD------------MVRYARNVIEE 407

Query: 184 ASEVNILSKKTLALTSCGNHQEVVSTYKMIEKLQKKLYHPFSVNLMQTREKLIKILMELE 243
                     +  L  C   QE +S+          ++   +V ++    + + + + ++
Sbjct: 408 FRRAKHYKSPSELLEICELSQEKMSS----------VFEDSNVYMLHMMYQAMGVCLYMQ 457

Query: 244 DWKEALAYCQLTIPVYQRVYPQFHPLLGLQYYTCGKLEWFLGDTENAIKSMTEAVEILRI 303
           DW+ AL Y Q  I  Y + YP +   +   +   G+L   L       K++ +A+ I+ +
Sbjct: 458 DWEGALQYGQKIIKPYSKHYPLYSLNVASMWLKLGRLYMGLEYKAAGEKALKKAIAIMEV 517

Query: 304 THGTNSPFMKELILKLE 320
            HG + P++ E+  ++E
Sbjct: 518 AHGKDHPYISEIKQEIE 534


>gi|30840984|ref|NP_081072.1| N-lysine methyltransferase SMYD2 [Mus musculus]
 gi|47117248|sp|Q8R5A0.1|SMYD2_MOUSE RecName: Full=N-lysine methyltransferase SMYD2; AltName:
           Full=Histone methyltransferase SMYD2; AltName: Full=SET
           and MYND domain-containing protein 2
 gi|339717596|pdb|3QWV|A Chain A, Crystal Structure Of Histone Lysine Methyltransferase
           Smyd2 In Complex With The Cofactor Product Adohcy
 gi|339717597|pdb|3QWW|A Chain A, Crystal Structure Of Histone Lysine Methyltransferase
           Smyd2 In Complex With The Methyltransferase Inhibitor
           Sinefungin
 gi|18606376|gb|AAH23119.1| SET and MYND domain containing 2 [Mus musculus]
          Length = 433

 Score =  116 bits (291), Expect = 1e-23,   Method: Compositional matrix adjust.
 Identities = 75/319 (23%), Positives = 147/319 (46%), Gaps = 33/319 (10%)

Query: 5   DEKQLLLYAQIANLVNLI---LQWPEISINEIAENFSKLACNAHTICNSELRPLGTGLYP 61
           +EK+ L+ + IA L       L++P+ S   +   F+++ CN  TI + EL  LG+ ++P
Sbjct: 143 NEKKDLIQSDIAALHQFYSKYLEFPDHS--SLVVLFAQVNCNGFTIEDEELSHLGSAIFP 200

Query: 62  VISIINHSCLPNAVLVFEGRLAVVRAVQHVPKGAEVLISYIETAGSTMTRQKALKEQYLF 121
            ++++NHSC PN ++ ++G LA VRAVQ +  G EV  SYI+    T  R   L++ Y F
Sbjct: 201 DVALMNHSCCPNVIVTYKGTLAEVRAVQEIHPGDEVFTSYIDLLYPTEDRNDRLRDSYFF 260

Query: 122 TCTCPRCIKLGQFDDIQESAILEGYRCKDDGCSGFLLRDSDDKGFTCQQCGLVRSKEEIK 181
           TC C  C                              +D D      ++       E I+
Sbjct: 261 TCECRECT----------------------------TKDKDKAKVEVRKLSSPPQAEAIR 292

Query: 182 KIASEVNILSKKTLALTSCGNHQEVVSTYKMIEKLQKKLYHPFSVNLMQTREKLIKILME 241
            +      + ++        +  E++   ++ ++    ++   +V ++    + + + + 
Sbjct: 293 DMVRYARNVIEEFRRAKHYKSPSELLEICELSQEKMSSVFEDSNVYMLHMMYQAMGVCLY 352

Query: 242 LEDWKEALAYCQLTIPVYQRVYPQFHPLLGLQYYTCGKLEWFLGDTENAIKSMTEAVEIL 301
           ++DW+ AL Y Q  I  Y + YP +   +   +   G+L   L +     K++ +A+ I+
Sbjct: 353 MQDWEGALKYGQKIIKPYSKHYPVYSLNVASMWLKLGRLYMGLENKAAGEKALKKAIAIM 412

Query: 302 RITHGTNSPFMKELILKLE 320
            + HG + P++ E+  ++E
Sbjct: 413 EVAHGKDHPYISEIKQEIE 431


>gi|345329543|ref|XP_001510161.2| PREDICTED: N-lysine methyltransferase SMYD2-like [Ornithorhynchus
           anatinus]
          Length = 412

 Score =  116 bits (290), Expect = 1e-23,   Method: Compositional matrix adjust.
 Identities = 79/318 (24%), Positives = 151/318 (47%), Gaps = 29/318 (9%)

Query: 5   DEKQLLLYAQIANLVNLILQWPEISIN-EIAENFSKLACNAHTICNSELRPLGTGLYPVI 63
           +EK+ L+ + IA L +   +  E   N  +   F+++ CN  TI + EL  LG+ ++P +
Sbjct: 122 NEKRELIQSDIAALHHFYSKHIEYPDNASLVVLFAQVNCNGFTIEDEELSHLGSAIFPDV 181

Query: 64  SIINHSCLPNAVLVFEGRLAVVRAVQHVPKGAEVLISYIETAGSTMTRQKALKEQYLFTC 123
           +++NHSC PN ++ ++G LA VRAVQ +  G E+  SYI+    T  R   L++ Y FTC
Sbjct: 182 ALMNHSCCPNVIVTYKGTLAEVRAVQEINPGDEIFTSYIDLLYPTEDRNDRLRDSYFFTC 241

Query: 124 TCPRCIKLGQFDDIQESAILEGYRCKDDGCSGFLLRDSDDKGFTCQQCGLVRSKEEIKKI 183
            C  CI        ++ A +E  R  +D      +RD            +++    + + 
Sbjct: 242 ECRECITKE-----KDKAKVE-IRKLNDPPKAEAIRD------------MIKYARNVIEE 283

Query: 184 ASEVNILSKKTLALTSCGNHQEVVSTYKMIEKLQKKLYHPFSVNLMQTREKLIKILMELE 243
                     +  L  C   QE + +          ++   +V ++    + + + + ++
Sbjct: 284 FRRAKHYKSPSELLEICELSQEKMGS----------VFEDSNVYMLHMMYQAMGVCLYMQ 333

Query: 244 DWKEALAYCQLTIPVYQRVYPQFHPLLGLQYYTCGKLEWFLGDTENAIKSMTEAVEILRI 303
           DW+ AL Y Q  I  Y + YP +   +   +   G+L   L +    +K++ +A+ I+ +
Sbjct: 334 DWEGALRYGQKIIKPYSKHYPLYSLNVASMWLKLGRLYMGLENKSAGVKALKKALTIMEV 393

Query: 304 THGTNSPFMKELILKLEE 321
            HG + P++ E+  ++E+
Sbjct: 394 AHGKDHPYISEIKKEIED 411


>gi|426240539|ref|XP_004014156.1| PREDICTED: N-lysine methyltransferase SMYD2 [Ovis aries]
          Length = 395

 Score =  116 bits (290), Expect = 1e-23,   Method: Compositional matrix adjust.
 Identities = 79/317 (24%), Positives = 145/317 (45%), Gaps = 29/317 (9%)

Query: 5   DEKQLLLYAQIANLVNLILQWPEISINE-IAENFSKLACNAHTICNSELRPLGTGLYPVI 63
           +EK+ L+ + IA L +   +  E   N+ +   F+++ CN  TI + EL  LG+ ++P +
Sbjct: 105 NEKRDLIQSDIAALHHFYSKHLEFPDNDSLVVLFAQVNCNGFTIEDEELSHLGSAIFPDV 164

Query: 64  SIINHSCLPNAVLVFEGRLAVVRAVQHVPKGAEVLISYIETAGSTMTRQKALKEQYLFTC 123
           +++NHSC PN ++ ++G LA VRAVQ +  G EV  SYI+    T  R   L++ Y FTC
Sbjct: 165 ALMNHSCCPNVIVTYKGTLAEVRAVQEIHPGEEVFTSYIDLLYPTEDRNDRLRDSYFFTC 224

Query: 124 TCPRCIKLGQFDDIQESAILEGYRCKDDGCSGFLLRDSDDKGFTCQQCGLVRSKEEIKKI 183
            C  C                    KD   +   +R  +D         +VR    + + 
Sbjct: 225 ECQECTT------------------KDKDKAKVEIRKLNDPPKAETIRDMVRYARNVIEE 266

Query: 184 ASEVNILSKKTLALTSCGNHQEVVSTYKMIEKLQKKLYHPFSVNLMQTREKLIKILMELE 243
                     +  L  C   QE +S           ++   +V ++    + + + + ++
Sbjct: 267 FRRAKHYKSPSELLEICELSQEKMSC----------VFEDTNVYMLHMMYQAMGVCLYMQ 316

Query: 244 DWKEALAYCQLTIPVYQRVYPQFHPLLGLQYYTCGKLEWFLGDTENAIKSMTEAVEILRI 303
           DW+ AL Y Q  I  Y + YP +   +   +   G+L   L +     +++ +A+ I+ +
Sbjct: 317 DWEGALRYGQKIIQPYSKHYPLYSLNVASMWLKLGRLYMGLENKAAGERALKKAIAIMEV 376

Query: 304 THGTNSPFMKELILKLE 320
            HG + P++ E+  ++E
Sbjct: 377 AHGKDHPYISEIKQEIE 393


>gi|281306785|ref|NP_996733.1| N-lysine methyltransferase SMYD2 [Rattus norvegicus]
 gi|47116972|sp|Q7M6Z3.1|SMYD2_RAT RecName: Full=N-lysine methyltransferase SMYD2; AltName:
           Full=Histone methyltransferase SMYD2; AltName: Full=SET
           and MYND domain-containing protein 2
 gi|32140157|tpg|DAA01315.1| TPA_exp: SET and MYND domain protein 2 [Rattus norvegicus]
          Length = 433

 Score =  116 bits (290), Expect = 2e-23,   Method: Compositional matrix adjust.
 Identities = 76/319 (23%), Positives = 147/319 (46%), Gaps = 33/319 (10%)

Query: 5   DEKQLLLYAQIANLVNLI---LQWPEISINEIAENFSKLACNAHTICNSELRPLGTGLYP 61
           +EK+ L+ + IA L       L++P+ S   +   F+++ CN  TI + EL  LG+ ++P
Sbjct: 143 NEKKDLIQSDIAALHQFYSKHLEFPDHS--SLVVLFAQVNCNGFTIEDEELSHLGSAIFP 200

Query: 62  VISIINHSCLPNAVLVFEGRLAVVRAVQHVPKGAEVLISYIETAGSTMTRQKALKEQYLF 121
            ++++NHSC PN ++ ++G LA VRAVQ +  G EV  SYI+    T  R   L++ Y F
Sbjct: 201 DVALMNHSCCPNVIVTYKGTLAEVRAVQEIHPGDEVFTSYIDLLYPTEDRNDRLRDSYFF 260

Query: 122 TCTCPRCIKLGQFDDIQESAILEGYRCKDDGCSGFLLRDSDDKGFTCQQCGLVRSKEEIK 181
           TC C  C                              +D D      ++       E I+
Sbjct: 261 TCECRECT----------------------------TKDKDKAKVEIRKLSNPPQAEAIR 292

Query: 182 KIASEVNILSKKTLALTSCGNHQEVVSTYKMIEKLQKKLYHPFSVNLMQTREKLIKILME 241
            +      + ++        +  E++   ++ ++    ++   +V ++    + + + + 
Sbjct: 293 DMVRYARNVIEEFRRAKHYKSPSELLEICELSQEKMSSVFEDSNVYMLHMMYQAMGVCLY 352

Query: 242 LEDWKEALAYCQLTIPVYQRVYPQFHPLLGLQYYTCGKLEWFLGDTENAIKSMTEAVEIL 301
           ++DW+ AL Y Q  I  Y + YP +   +   +   G+L   L +     K++ +A+ I+
Sbjct: 353 MQDWEGALKYGQKIIKPYSKHYPVYSLNVASMWLKLGRLYMGLENKAAGEKALKKAIAIM 412

Query: 302 RITHGTNSPFMKELILKLE 320
            I HG + P++ E+  ++E
Sbjct: 413 EIAHGKDHPYISEIKQEIE 431


>gi|410986126|ref|XP_003999363.1| PREDICTED: N-lysine methyltransferase SMYD2 [Felis catus]
          Length = 386

 Score =  116 bits (290), Expect = 2e-23,   Method: Compositional matrix adjust.
 Identities = 77/314 (24%), Positives = 144/314 (45%), Gaps = 33/314 (10%)

Query: 5   DEKQLLLYAQIANLVNLI---LQWPEISINEIAENFSKLACNAHTICNSELRPLGTGLYP 61
           +EK+ L+ + +A L +     L++P+   + +   F+++ CN  TI + EL  LG+ ++P
Sbjct: 96  NEKKDLIQSDVAALHHFYSKHLEFPDS--DSLVVLFAQVNCNGFTIEDEELSHLGSAIFP 153

Query: 62  VISIINHSCLPNAVLVFEGRLAVVRAVQHVPKGAEVLISYIETAGSTMTRQKALKEQYLF 121
            ++++NHSC PN ++ ++G LA VRAVQ +  G EV  SYI+    T  R   L++ Y F
Sbjct: 154 DVALMNHSCCPNVIVTYKGTLAEVRAVQEINPGEEVFTSYIDLLYPTEDRNDRLRDSYFF 213

Query: 122 TCTCPRCIKLGQFDDIQESAILEGYRCKDDGCSGFLLRDSDDKGFTCQQCGLVRSKEEIK 181
           TC C  C                    KD   +   +R   D         +VR    + 
Sbjct: 214 TCQCQECT------------------TKDKDKAKVEIRKLSDPPKAETIRDMVRYARNVI 255

Query: 182 KIASEVNILSKKTLALTSCGNHQEVVSTYKMIEKLQKKLYHPFSVNLMQTREKLIKILME 241
           +           +  L  C   QE +S+          ++   +V ++    + + + + 
Sbjct: 256 EEFRRAKHYKSPSELLEICELSQEKMSS----------VFEDSNVYMLHMMYQAMGVCLY 305

Query: 242 LEDWKEALAYCQLTIPVYQRVYPQFHPLLGLQYYTCGKLEWFLGDTENAIKSMTEAVEIL 301
           ++DW+ AL Y Q  I  Y + YP +   +   +   G+L   L +     +++ +A+ I+
Sbjct: 306 MQDWEGALRYGQKIIKPYSKHYPLYSLNVASMWLKLGRLHMGLENKAAGERALKKAIAIM 365

Query: 302 RITHGTNSPFMKEL 315
            + HG + P++ E+
Sbjct: 366 EVAHGKDHPYISEI 379


>gi|354465172|ref|XP_003495054.1| PREDICTED: N-lysine methyltransferase SMYD2-like [Cricetulus
           griseus]
          Length = 404

 Score =  116 bits (290), Expect = 2e-23,   Method: Compositional matrix adjust.
 Identities = 74/319 (23%), Positives = 147/319 (46%), Gaps = 33/319 (10%)

Query: 5   DEKQLLLYAQIANLVNLI---LQWPEISINEIAENFSKLACNAHTICNSELRPLGTGLYP 61
           +EK+ L+ + IA L       L++P+ S   +   F+++ CN  TI + EL  LG+ ++P
Sbjct: 114 NEKKDLIQSDIAALHQFYSRHLEFPDHS--SLVVLFAQVNCNGFTIEDEELSHLGSAIFP 171

Query: 62  VISIINHSCLPNAVLVFEGRLAVVRAVQHVPKGAEVLISYIETAGSTMTRQKALKEQYLF 121
            ++++NHSC PN ++ ++G LA VRAVQ +  G EV  SYI+    T  R   L++ Y F
Sbjct: 172 DVALMNHSCCPNVIVTYKGTLAEVRAVQEIHPGDEVFTSYIDLLYPTEDRNDRLRDSYFF 231

Query: 122 TCTCPRCIKLGQFDDIQESAILEGYRCKDDGCSGFLLRDSDDKGFTCQQCGLVRSKEEIK 181
           TC C  C                              +D D      ++       E ++
Sbjct: 232 TCECRECT----------------------------TKDKDKAKVEIRKLSSPPQAEAVR 263

Query: 182 KIASEVNILSKKTLALTSCGNHQEVVSTYKMIEKLQKKLYHPFSVNLMQTREKLIKILME 241
            +      + ++        +  E++   ++ ++    ++   +V ++    + + + + 
Sbjct: 264 DMVRYARNVIEEFRRAKHYKSPSELLEICELSQEKMSSVFEDSNVYMLHMMYQAMGVCLY 323

Query: 242 LEDWKEALAYCQLTIPVYQRVYPQFHPLLGLQYYTCGKLEWFLGDTENAIKSMTEAVEIL 301
           ++DW+ AL Y Q  I  Y + YP +   +   +   G+L   L +     K++ +A+ I+
Sbjct: 324 MQDWEGALKYGQKIIKPYSKHYPVYSLNVASMWLKLGRLYMGLENKAAGQKALKKAIAIM 383

Query: 302 RITHGTNSPFMKELILKLE 320
            + HG + P++ E+  ++E
Sbjct: 384 EVAHGKDHPYISEIKQEIE 402


>gi|348577033|ref|XP_003474289.1| PREDICTED: N-lysine methyltransferase SMYD2-like [Cavia porcellus]
          Length = 433

 Score =  116 bits (290), Expect = 2e-23,   Method: Compositional matrix adjust.
 Identities = 77/319 (24%), Positives = 147/319 (46%), Gaps = 33/319 (10%)

Query: 5   DEKQLLLYAQIANLVNLI---LQWPEISINEIAENFSKLACNAHTICNSELRPLGTGLYP 61
           +EK+ L+ + IA L       L++P+   + +   F+++ CN  TI + EL  LG+ ++P
Sbjct: 143 NEKKDLIQSDIAMLHQFYSKHLEFPDH--DSLVVLFAQVNCNGFTIEDEELSHLGSAIFP 200

Query: 62  VISIINHSCLPNAVLVFEGRLAVVRAVQHVPKGAEVLISYIETAGSTMTRQKALKEQYLF 121
            ++++NHSC PN ++ ++G LA VRAVQ V  G EV  SYI+    T  R   L++ Y F
Sbjct: 201 DVALMNHSCCPNVIVTYKGTLAEVRAVQEVLPGEEVFTSYIDLLYPTEDRNDRLRDSYFF 260

Query: 122 TCTCPRCIKLGQFDDIQESAILEGYRCKDDGCSGFLLRDSDDKGFTCQQCGLVRSKEEIK 181
           TC C  C                              RD D      ++       E I+
Sbjct: 261 TCECQECTT----------------------------RDKDKAKVEIRKLSNPPKAEAIR 292

Query: 182 KIASEVNILSKKTLALTSCGNHQEVVSTYKMIEKLQKKLYHPFSVNLMQTREKLIKILME 241
            +      + ++        +  E++   ++ ++    ++   +V ++    + + + + 
Sbjct: 293 DMVRYARNVIEEFRRAKHYKSPSELLEICELSQEKMSAVFEDSNVYMLHMMYQAMGVCLY 352

Query: 242 LEDWKEALAYCQLTIPVYQRVYPQFHPLLGLQYYTCGKLEWFLGDTENAIKSMTEAVEIL 301
           L+DW+ AL Y Q  I  Y R YP +   +   +   G+L   L +     +++ +A+ I+
Sbjct: 353 LQDWEGALRYGQKIIKPYSRHYPLYSLNVASMWLKLGRLYLGLENRVAGERALRKALAIM 412

Query: 302 RITHGTNSPFMKELILKLE 320
            + HG + P++ E+  ++E
Sbjct: 413 EVAHGKDHPYISEIKREIE 431


>gi|195133772|ref|XP_002011313.1| GI16066 [Drosophila mojavensis]
 gi|193907288|gb|EDW06155.1| GI16066 [Drosophila mojavensis]
          Length = 448

 Score =  116 bits (290), Expect = 2e-23,   Method: Compositional matrix adjust.
 Identities = 85/295 (28%), Positives = 150/295 (50%), Gaps = 20/295 (6%)

Query: 29  SINEIAENFSKLACNAHTICNSELRPLGTGLYPVISIINHSCLPNAVLVFEGRLAVVRAV 88
           ++NE+   + +L  N   I ++E+  + TG+Y  +SI +HSC PNAV  FEG    V A+
Sbjct: 161 NMNELMSIYGRLITNGFNILDAEMNSIATGIYLGVSITDHSCQPNAVATFEGNELHVHAI 220

Query: 89  QHVP--KGAEVLISYIETAGSTMTRQKALKEQYLFTCTCPRCIKLGQFDDIQESAILEGY 146
           + +P    +++ ISYI+   +   R+  LKE Y F C C +CI      D QE   +   
Sbjct: 221 EDLPCLDWSKIYISYIDLLNTPEQRRADLKEHYYFLCVCSKCI------DPQEMHEMTAA 274

Query: 147 RCKDDGCSGFLLRDSDDKGFTCQQCGLVRSKEEIKKIASEVNILSKKTLALTSCGNHQEV 206
            C +  C   +    D     C+ CG   S E+++   SEV  L++ +L      + +EV
Sbjct: 275 VCPNGSCDASVNIQLD----KCESCGTPVS-EQLRTDYSEVMALTQSSL-----DSMKEV 324

Query: 207 --VSTYKMIEKLQKKLYHPFSVNLMQTREKLIKILMELEDWKEALAYCQLTIPVYQRVYP 264
             +   K+     + + HP +V  ++T +   +  +++  W EAL Y +  +P + + + 
Sbjct: 325 AYLDVCKVCLAKHRGILHPLNVWHVKTLDAAFEASIQVGKWTEALDYGRQLLPGFAKYHG 384

Query: 265 QFHPLLGLQYYTCGKLEWFLGDTENAIKSMTEAVEILRITHGTNSPFMKELILKL 319
           +++PLLGL     GK+E +  + + A++ + EA  IL +THG +   + E +  L
Sbjct: 385 EWNPLLGLLQLKLGKIELYERNYKQALQHLQEAQRILNVTHGRDHRLLLEQLRPL 439


>gi|395531367|ref|XP_003767751.1| PREDICTED: N-lysine methyltransferase SMYD2 [Sarcophilus harrisii]
          Length = 402

 Score =  115 bits (289), Expect = 2e-23,   Method: Compositional matrix adjust.
 Identities = 74/318 (23%), Positives = 145/318 (45%), Gaps = 29/318 (9%)

Query: 5   DEKQLLLYAQIANLVNLILQWPEISIN-EIAENFSKLACNAHTICNSELRPLGTGLYPVI 63
           +EK+ L+ + I+ L +   +  E   N  +   F+++ CN  TI + EL  LG+ ++P +
Sbjct: 112 NEKRELIQSDISALHHFYSKHIEYPDNASLVTLFAQVNCNGFTIEDEELSHLGSAIFPDV 171

Query: 64  SIINHSCLPNAVLVFEGRLAVVRAVQHVPKGAEVLISYIETAGSTMTRQKALKEQYLFTC 123
           +++NHSC PN ++ ++G LA VRAVQ +  G EV  SYI+    T  R   LK+ Y FTC
Sbjct: 172 ALMNHSCCPNVIVTYKGTLAEVRAVQEINPGDEVFTSYIDLLYPTEDRNDRLKDSYFFTC 231

Query: 124 TCPRCIKLGQFDDIQESAILEGYRCKDDGCSGFLLRDSDDKGFTCQQCGLVRSKEEIKKI 183
            C  C                              +  D      ++       E I+ +
Sbjct: 232 ECRECTT----------------------------KAKDKAKVEIRKLSEPPKAESIRDM 263

Query: 184 ASEVNILSKKTLALTSCGNHQEVVSTYKMIEKLQKKLYHPFSVNLMQTREKLIKILMELE 243
                 + ++           E++   ++ ++    L+   +V ++    + + + + ++
Sbjct: 264 VKYARNVIEEFRRAKHYKTPSELLEICELSQEKMGSLFEDSNVYMLHMMYQAMGVCLYMQ 323

Query: 244 DWKEALAYCQLTIPVYQRVYPQFHPLLGLQYYTCGKLEWFLGDTENAIKSMTEAVEILRI 303
           DW+ AL Y Q  I  Y + YP +   +   +   G+L   L    + +K++ +A+ I+ +
Sbjct: 324 DWEGALRYGQKIIKPYSKHYPLYSLNVASMWLKLGRLYMGLEHKASGVKALKKAIAIMEV 383

Query: 304 THGTNSPFMKELILKLEE 321
            HG + P++ E+  ++E+
Sbjct: 384 AHGKDHPYISEIKKEIED 401


>gi|355720910|gb|AES07090.1| SET and MYND domain containing 3 [Mustela putorius furo]
          Length = 251

 Score =  115 bits (287), Expect = 4e-23,   Method: Compositional matrix adjust.
 Identities = 82/267 (30%), Positives = 129/267 (48%), Gaps = 43/267 (16%)

Query: 42  CNAHTICNSELRPLGTGLYPVISIINHSCLPNAVLVFEGRLAVVRAVQHVPKGAEVLISY 101
           CNA TICN+E++ +G GLYP +S++NHSC PN  +VF G   ++RAV+ +  G E+ I Y
Sbjct: 3   CNAFTICNAEMQEVGVGLYPSMSLLNHSCDPNCSIVFNGPHLLLRAVRDIEAGEELTICY 62

Query: 102 IETAGSTMTRQKALKEQYLFTCTCPRCIKLGQFDDIQESAILEGYRCKDDGCSGFLLRDS 161
           ++   ++  R+K L++QY F C C RC                              +D 
Sbjct: 63  LDMLMTSEERRKQLRDQYCFECDCFRCQT----------------------------QDK 94

Query: 162 DDKGFTCQQCGLVRSKEEIKKIASEVNILSKKTLALTSCGNHQEVVSTYKMIEKLQKKLY 221
           D    T  +      +E +KKI  E+    K    L  C   Q V+S+    E+L     
Sbjct: 95  DADMLTGDEQVWKEVQESLKKI-EELKAHWKWEQVLAMC---QAVISSNA--ERLP---- 144

Query: 222 HPFSVNLMQTR--EKLIKILMELEDWKEALAYCQLTIPVYQRVYPQFHPLLGLQYYTCGK 279
               +N+ Q +  +  +   + L   +EAL Y   T+  Y+  +P  HP+ G+Q    GK
Sbjct: 145 ---DINIYQLKVLDCAMDACINLGLLEEALFYGIRTMEPYRIFFPGSHPVRGVQVMKVGK 201

Query: 280 LEWFLGDTENAIKSMTEAVEILRITHG 306
           L+   G    A+K++  A +I+R+THG
Sbjct: 202 LQLHQGMFPQAMKNLRLAFDIMRVTHG 228


>gi|188035871|ref|NP_064582.2| N-lysine methyltransferase SMYD2 [Homo sapiens]
 gi|90185234|sp|Q9NRG4.2|SMYD2_HUMAN RecName: Full=N-lysine methyltransferase SMYD2; AltName:
           Full=HSKM-B; AltName: Full=Histone methyltransferase
           SMYD2; AltName: Full=Lysine N-methyltransferase 3C;
           AltName: Full=SET and MYND domain-containing protein 2
 gi|345111051|pdb|3TG4|A Chain A, Structure Of Smyd2 In Complex With Sam
 gi|345111052|pdb|3TG5|A Chain A, Structure Of Smyd2 In Complex With P53 And Sah
 gi|67514269|gb|AAH98276.1| SET and MYND domain containing 2 [Homo sapiens]
          Length = 433

 Score =  115 bits (287), Expect = 4e-23,   Method: Compositional matrix adjust.
 Identities = 82/319 (25%), Positives = 149/319 (46%), Gaps = 33/319 (10%)

Query: 5   DEKQLLLYAQIANLVNLI---LQWPEISINEIAENFSKLACNAHTICNSELRPLGTGLYP 61
           +EK+ L+ + IA L +     L +P+   + +   F+++ CN  TI + EL  LG+ ++P
Sbjct: 143 NEKKDLIQSDIAALHHFYSKHLGFPDN--DSLVVLFAQVNCNGFTIEDEELSHLGSAIFP 200

Query: 62  VISIINHSCLPNAVLVFEGRLAVVRAVQHVPKGAEVLISYIETAGSTMTRQKALKEQYLF 121
            ++++NHSC PN ++ ++G LA VRAVQ +  G EV  SYI+    T  R   L++ Y F
Sbjct: 201 DVALMNHSCCPNVIVTYKGTLAEVRAVQEIKPGEEVFTSYIDLLYPTEDRNDRLRDSYFF 260

Query: 122 TCTCPRCIKLGQFDDIQESAILEGYRCKDDGCSGFLLRDSDDKGFTCQQCGLVRSKEEIK 181
           TC C  C         ++ A +E  R   D      +RD            +VR    + 
Sbjct: 261 TCECQECTTKD-----KDKAKVE-IRKLSDPPKAEAIRD------------MVRYARNVI 302

Query: 182 KIASEVNILSKKTLALTSCGNHQEVVSTYKMIEKLQKKLYHPFSVNLMQTREKLIKILME 241
           +           +  L  C   QE +S+          ++   +V ++    + + + + 
Sbjct: 303 EEFRRAKHYKSPSELLEICELSQEKMSS----------VFEDSNVYMLHMMYQAMGVCLY 352

Query: 242 LEDWKEALAYCQLTIPVYQRVYPQFHPLLGLQYYTCGKLEWFLGDTENAIKSMTEAVEIL 301
           ++DW+ AL Y Q  I  Y + YP +   +   +   G+L   L       K++ +A+ I+
Sbjct: 353 MQDWEGALQYGQKIIKPYSKHYPLYSLNVASMWLKLGRLYMGLEHKAAGEKALKKAIAIM 412

Query: 302 RITHGTNSPFMKELILKLE 320
            + HG + P++ E+  ++E
Sbjct: 413 EVAHGKDHPYISEIKQEIE 431


>gi|340780606|pdb|3RIB|A Chain A, Human Lysine Methyltransferase Smyd2 In Complex With
           Adohcy
 gi|340780607|pdb|3RIB|B Chain B, Human Lysine Methyltransferase Smyd2 In Complex With
           Adohcy
          Length = 441

 Score =  115 bits (287), Expect = 4e-23,   Method: Compositional matrix adjust.
 Identities = 80/319 (25%), Positives = 145/319 (45%), Gaps = 33/319 (10%)

Query: 5   DEKQLLLYAQIANLVNLI---LQWPEISINEIAENFSKLACNAHTICNSELRPLGTGLYP 61
           +EK+ L+ + IA L +     L +P+   + +   F+++ CN  TI + EL  LG+ ++P
Sbjct: 143 NEKKDLIQSDIAALHHFYSKHLGFPDN--DSLVVLFAQVNCNGFTIEDEELSHLGSAIFP 200

Query: 62  VISIINHSCLPNAVLVFEGRLAVVRAVQHVPKGAEVLISYIETAGSTMTRQKALKEQYLF 121
            ++++NHSC PN ++ ++G LA VRAVQ +  G EV  SYI+    T  R   L++ Y F
Sbjct: 201 DVALMNHSCCPNVIVTYKGTLAEVRAVQEIKPGEEVFTSYIDLLYPTEDRNDRLRDSYFF 260

Query: 122 TCTCPRCIKLGQFDDIQESAILEGYRCKDDGCSGFLLRDSDDKGFTCQQCGLVRSKEEIK 181
           TC C  C                    KD   +   +R   D         +VR    + 
Sbjct: 261 TCECQECTT------------------KDKDKAKVEIRKLSDPPKAEAIRDMVRYARNVI 302

Query: 182 KIASEVNILSKKTLALTSCGNHQEVVSTYKMIEKLQKKLYHPFSVNLMQTREKLIKILME 241
           +           +  L  C   QE +S+          ++   +V ++    + + + + 
Sbjct: 303 EEFRRAKHYKSPSELLEICELSQEKMSS----------VFEDSNVYMLHMMYQAMGVCLY 352

Query: 242 LEDWKEALAYCQLTIPVYQRVYPQFHPLLGLQYYTCGKLEWFLGDTENAIKSMTEAVEIL 301
           ++DW+ AL Y Q  I  Y + YP +   +   +   G+L   L       K++ +A+ I+
Sbjct: 353 MQDWEGALQYGQKIIKPYSKHYPLYSLNVASMWLKLGRLYMGLEHKAAGEKALKKAIAIM 412

Query: 302 RITHGTNSPFMKELILKLE 320
            + HG + P++ E+  ++E
Sbjct: 413 EVAHGKDHPYISEIKQEIE 431


>gi|291402401|ref|XP_002717452.1| PREDICTED: SET and MYND domain containing 2, partial [Oryctolagus
           cuniculus]
          Length = 394

 Score =  114 bits (286), Expect = 4e-23,   Method: Compositional matrix adjust.
 Identities = 81/319 (25%), Positives = 150/319 (47%), Gaps = 33/319 (10%)

Query: 5   DEKQLLLYAQIANLVNLI---LQWPEISINEIAENFSKLACNAHTICNSELRPLGTGLYP 61
           +EK+ L+ + I+ L +     L++P+   + +   F+++ CN  TI + EL  LG+ ++P
Sbjct: 104 NEKKDLIQSDISALHHFYSKHLEFPDH--DSLVVLFAQVNCNGFTIEDEELSHLGSAIFP 161

Query: 62  VISIINHSCLPNAVLVFEGRLAVVRAVQHVPKGAEVLISYIETAGSTMTRQKALKEQYLF 121
            ++++NHSC PN ++ ++G LA VRAVQ +  G EV  SYI+    T  R + L++ Y F
Sbjct: 162 DVALMNHSCCPNVIVTYKGTLAEVRAVQEISPGEEVFTSYIDLLYPTEDRNERLRDSYFF 221

Query: 122 TCTCPRCIKLGQFDDIQESAILEGYRCKDDGCSGFLLRDSDDKGFTCQQCGLVRSKEEIK 181
           TC C  C         ++ A +E  R  D       +RD            +VR    + 
Sbjct: 222 TCECLECTTKD-----KDKAKVEIRRLSDPP-KAEAIRD------------MVRYARNVI 263

Query: 182 KIASEVNILSKKTLALTSCGNHQEVVSTYKMIEKLQKKLYHPFSVNLMQTREKLIKILME 241
           +              L  C   QE + +          ++   +V ++    + + + + 
Sbjct: 264 EEFRRAKHYKSPPELLEICELSQEKMGS----------VFEDSNVYMLHMMYQAMGVCLY 313

Query: 242 LEDWKEALAYCQLTIPVYQRVYPQFHPLLGLQYYTCGKLEWFLGDTENAIKSMTEAVEIL 301
           ++DW+ AL Y Q  I  Y + YP +   +   +   G+L   L +     K++ +A+ I+
Sbjct: 314 MQDWEGALRYGQKIIKPYSKHYPLYSLNVASMWLKLGRLYLGLENKAAGEKALRKAIAIM 373

Query: 302 RITHGTNSPFMKELILKLE 320
            + HG + P++ E+  +LE
Sbjct: 374 EVAHGKDHPYISEIKQELE 392


>gi|229892824|ref|NP_001153563.1| N-lysine methyltransferase SMYD2 [Sus scrofa]
 gi|325530260|sp|C3RZA1.1|SMYD2_PIG RecName: Full=N-lysine methyltransferase SMYD2; AltName:
           Full=Histone methyltransferase SMYD2; AltName: Full=SET
           and MYND domain-containing protein 2
 gi|197692954|gb|ACH71265.1| SET and MYND domain-containing 2 [Sus scrofa]
          Length = 433

 Score =  114 bits (286), Expect = 5e-23,   Method: Compositional matrix adjust.
 Identities = 81/319 (25%), Positives = 151/319 (47%), Gaps = 33/319 (10%)

Query: 5   DEKQLLLYAQIANLVNLI---LQWPEISINEIAENFSKLACNAHTICNSELRPLGTGLYP 61
           +EK+ L+ + IA L +     L++P+   + +   F+++ CN  TI + EL  LG+ ++P
Sbjct: 143 NEKRDLIQSDIAALHHFYSKHLEFPDS--DSLVVLFAQVNCNGFTIEDEELSHLGSXIFP 200

Query: 62  VISIINHSCLPNAVLVFEGRLAVVRAVQHVPKGAEVLISYIETAGSTMTRQKALKEQYLF 121
            ++++NHSC PN ++ ++G LA VRAVQ +  G EV  SYI+    T  R   L++ Y F
Sbjct: 201 DVALMNHSCCPNVIVTYKGTLAEVRAVQEIHPGEEVFTSYIDLLYPTEDRNDRLRDSYFF 260

Query: 122 TCTCPRCIKLGQFDDIQESAILEGYRCKDDGCSGFLLRDSDDKGFTCQQCGLVRSKEEIK 181
           TC C  C         ++ A +E  R  +D      +RD            +VR    + 
Sbjct: 261 TCECQECTTKD-----KDKAKVE-IRKLNDPPKAEAIRD------------MVRYARNVI 302

Query: 182 KIASEVNILSKKTLALTSCGNHQEVVSTYKMIEKLQKKLYHPFSVNLMQTREKLIKILME 241
           +           +  L  C   QE +S           ++   +V ++    + + + + 
Sbjct: 303 EEFRRAKHYKSPSELLEICELSQEKMSC----------VFEDSNVYMLHMMYQAMGVCLY 352

Query: 242 LEDWKEALAYCQLTIPVYQRVYPQFHPLLGLQYYTCGKLEWFLGDTENAIKSMTEAVEIL 301
           ++DW+ AL Y Q  I  Y + YP +   +   +   G+L   L +     +++ +A+ I+
Sbjct: 353 MQDWEGALRYGQKIIQPYSKHYPLYSLNVASMWLKLGRLYMGLENKAAGERALRKAIAIM 412

Query: 302 RITHGTNSPFMKELILKLE 320
            + HG + P++ E+  ++E
Sbjct: 413 EVAHGKDHPYISEIKQEIE 431


>gi|116283746|gb|AAH28104.1| SMYD2 protein [Homo sapiens]
          Length = 417

 Score =  114 bits (286), Expect = 5e-23,   Method: Compositional matrix adjust.
 Identities = 80/319 (25%), Positives = 145/319 (45%), Gaps = 33/319 (10%)

Query: 5   DEKQLLLYAQIANLVNLI---LQWPEISINEIAENFSKLACNAHTICNSELRPLGTGLYP 61
           +EK+ L+ + IA L +     L +P+   + +   F+++ CN  TI + EL  LG+ ++P
Sbjct: 127 NEKKDLIQSDIAALHHFYSKHLGFPDN--DSLVVLFAQVNCNGFTIEDEELSHLGSAIFP 184

Query: 62  VISIINHSCLPNAVLVFEGRLAVVRAVQHVPKGAEVLISYIETAGSTMTRQKALKEQYLF 121
            ++++NHSC PN ++ ++G LA VRAVQ +  G EV  SYI+    T  R   L++ Y F
Sbjct: 185 DVALMNHSCCPNVIVTYKGTLAEVRAVQEIKPGEEVFTSYIDLLYPTEDRNDRLRDSYFF 244

Query: 122 TCTCPRCIKLGQFDDIQESAILEGYRCKDDGCSGFLLRDSDDKGFTCQQCGLVRSKEEIK 181
           TC C  C                    KD   +   +R   D         +VR    + 
Sbjct: 245 TCECQECT------------------TKDKDKAKVEIRKLSDPPKAEAIRDMVRYARNVI 286

Query: 182 KIASEVNILSKKTLALTSCGNHQEVVSTYKMIEKLQKKLYHPFSVNLMQTREKLIKILME 241
           +           +  L  C   QE +S+          ++   +V ++    + + + + 
Sbjct: 287 EEFRRAKHYKSPSELLEICELSQEKMSS----------VFEDSNVYMLHMMYQAMGVCLY 336

Query: 242 LEDWKEALAYCQLTIPVYQRVYPQFHPLLGLQYYTCGKLEWFLGDTENAIKSMTEAVEIL 301
           ++DW+ AL Y Q  I  Y + YP +   +   +   G+L   L       K++ +A+ I+
Sbjct: 337 MQDWEGALQYGQKIIKPYSKHYPLYSLNVASMWLKLGRLYMGLEHKAAGEKALKKAIAIM 396

Query: 302 RITHGTNSPFMKELILKLE 320
            + HG + P++ E+  ++E
Sbjct: 397 EVAHGKDHPYISEIKQEME 415


>gi|50740296|ref|XP_419420.1| PREDICTED: N-lysine methyltransferase SMYD2 [Gallus gallus]
          Length = 436

 Score =  114 bits (284), Expect = 8e-23,   Method: Compositional matrix adjust.
 Identities = 82/318 (25%), Positives = 147/318 (46%), Gaps = 29/318 (9%)

Query: 5   DEKQLLLYAQIANLVNLILQWPEISINE-IAENFSKLACNAHTICNSELRPLGTGLYPVI 63
           +EK+ L+   IA L +   +  E   N  +   F+++ CN  TI + EL  LG+ ++P +
Sbjct: 146 NEKRELIQNDIAALHHFYSKHMEYPDNAALVVLFAQVNCNGFTIEDEELSHLGSAIFPDV 205

Query: 64  SIINHSCLPNAVLVFEGRLAVVRAVQHVPKGAEVLISYIETAGSTMTRQKALKEQYLFTC 123
           +++NHSC PN ++ ++G LA VRAV+ +  G EV  SYI+    T  R   L++ Y FTC
Sbjct: 206 ALMNHSCCPNVIVTYKGTLAEVRAVKEIEPGEEVFTSYIDLLYPTEDRNDRLRDSYFFTC 265

Query: 124 TCPRCIKLGQFDDIQESAILEGYRCKDDGCSGFLLRDSDDKGFTCQQCGLVRSKEEIKKI 183
            C  C    +  D ++  I    R  +D  S   +RD            ++      K  
Sbjct: 266 DCRECTM--KEKDKEKLKI----RKLNDPPSAEAVRDMIKYARN-----VIEEFRRAKHY 314

Query: 184 ASEVNILSKKTLALTSCGNHQEVVSTYKMIEKLQKKLYHPFSVNLMQTREKLIKILMELE 243
            S   +L    L+L   G                  ++   +V ++    + + + + ++
Sbjct: 315 KSPSELLEICELSLDKMG-----------------AVFEDSNVYMLHMMYQAMGVCLYVQ 357

Query: 244 DWKEALAYCQLTIPVYQRVYPQFHPLLGLQYYTCGKLEWFLGDTENAIKSMTEAVEILRI 303
           DW+ AL Y Q  I  Y + YP +   +   +   G+L   L +     K++ +A+ I+ +
Sbjct: 358 DWEGALRYGQKIIRPYSKHYPSYSLNVASMWLKLGRLYMALENRPAGDKALKKAIAIMEV 417

Query: 304 THGTNSPFMKELILKLEE 321
            HG + P++ E+  +LE+
Sbjct: 418 AHGKDHPYISEIKKELED 435


>gi|195059707|ref|XP_001995687.1| GH17627 [Drosophila grimshawi]
 gi|193896473|gb|EDV95339.1| GH17627 [Drosophila grimshawi]
          Length = 457

 Score =  114 bits (284), Expect = 9e-23,   Method: Compositional matrix adjust.
 Identities = 74/286 (25%), Positives = 142/286 (49%), Gaps = 16/286 (5%)

Query: 29  SINEIAENFSKLACNAHTICNSELRPLGTGLYPVISIINHSCLPNAVLVFEGRLAVVRAV 88
           +I E+   + +L  N   I ++E+  + T +Y  +SI +HSC PNAV  FEG    V  +
Sbjct: 170 NITELMSIYGRLITNGFNILDAEMNSIATAIYLGVSITDHSCQPNAVATFEGNELHVHVI 229

Query: 89  QHVP--KGAEVLISYIETAGSTMTRQKALKEQYLFTCTCPRCIKLGQFDDIQESAILEGY 146
           + +P    +++ ISYI+   +   R+  L+E Y F C C +CI      D QE   +   
Sbjct: 230 EDLPCLDWSKIYISYIDLLNTPEQRRADLREHYYFLCVCSKCI------DPQEMHEMTAA 283

Query: 147 RCKDDGCSGFLLRDSDDKGFTCQQCGLVRSKEEIKKIASEVNILSKKTLALTSCGNHQEV 206
            C +  C   +    + +   CQQC       +++   +++   ++ +L       + +V
Sbjct: 284 VCPNASCDASV----NIELAKCQQCD-ASVTPQLRTAYNDIMAFTQSSLDSMKEVAYLDV 338

Query: 207 VSTYKMIEKLQKKLYHPFSVNLMQTREKLIKILMELEDWKEALAYCQLTIPVYQRVYPQF 266
            +    +EK Q+ + HP +V  ++T +   +  + +  W +ALA+ Q  +P + + +  +
Sbjct: 339 CNV--CLEK-QRGVLHPLNVWHVKTLDAAFEAAINVGKWTDALAFGQQLLPGFGKYHGAW 395

Query: 267 HPLLGLQYYTCGKLEWFLGDTENAIKSMTEAVEILRITHGTNSPFM 312
           +PL+GL Y   GK++ +  + + A+  + EA  IL +THG +   +
Sbjct: 396 NPLVGLLYMKLGKIQLYERNYKLAVHHLQEAQCILSVTHGHDHRLL 441


>gi|327262509|ref|XP_003216066.1| PREDICTED: n-lysine methyltransferase SMYD2-like [Anolis
           carolinensis]
          Length = 437

 Score =  113 bits (283), Expect = 1e-22,   Method: Compositional matrix adjust.
 Identities = 74/285 (25%), Positives = 134/285 (47%), Gaps = 30/285 (10%)

Query: 37  FSKLACNAHTICNSELRPLGTGLYPVISIINHSCLPNAVLVFEGRLAVVRAVQHVPKGAE 96
           F+++ CN  TI + EL  LG+ ++P ++++NHSC PN ++ ++G +A VRAVQ +  G E
Sbjct: 180 FAQVNCNGFTIEDEELSHLGSAIFPDVALMNHSCCPNVIVTYKGTVAEVRAVQEIEPGDE 239

Query: 97  VLISYIETAGSTMTRQKALKEQYLFTCTCPRCI-KLGQFDDIQESAILEGYRCKDDGCSG 155
           +  SYI+    T  R   LK+ Y FTC C  CI K+   D ++        R  ++    
Sbjct: 240 IFTSYIDLLYPTEDRNDRLKDSYFFTCDCRECITKVKDKDKLE-------IRKLNEPPPA 292

Query: 156 FLLRDSDDKGFTCQQCGLVRSKEEIKKIASEVNILSKKTLALTSCGNHQEVVSTYKMIEK 215
             +RD            ++      K   S   +L    L+L   G+             
Sbjct: 293 EAVRDMIKYARN-----VIEEFRRAKHYKSPSELLEICELSLEKMGS------------- 334

Query: 216 LQKKLYHPFSVNLMQTREKLIKILMELEDWKEALAYCQLTIPVYQRVYPQFHPLLGLQYY 275
               ++   +V ++    + + + + ++DW+ AL Y Q  I  Y + YP +   +   + 
Sbjct: 335 ----VFAESNVYMLHMMYQAMGVCLYMQDWEGALRYGQKIIKPYSKHYPLYSLNVASMWL 390

Query: 276 TCGKLEWFLGDTENAIKSMTEAVEILRITHGTNSPFMKELILKLE 320
             G+L   L +    +K++ +A+ I+ I HG + P++ E+  +LE
Sbjct: 391 KLGRLYMGLENRTAGVKALKKAIAIMEIAHGKDHPYVIEIKKELE 435


>gi|195059722|ref|XP_001995690.1| GH17625 [Drosophila grimshawi]
 gi|193896476|gb|EDV95342.1| GH17625 [Drosophila grimshawi]
          Length = 457

 Score =  113 bits (283), Expect = 1e-22,   Method: Compositional matrix adjust.
 Identities = 73/286 (25%), Positives = 142/286 (49%), Gaps = 16/286 (5%)

Query: 29  SINEIAENFSKLACNAHTICNSELRPLGTGLYPVISIINHSCLPNAVLVFEGRLAVVRAV 88
           +I E+   + +L  N   I ++E+  + T +Y  +SI +HSC PNAV  FEG    V  +
Sbjct: 170 NITELMSIYGRLITNGFNILDAEMNSIATAIYLGVSITDHSCQPNAVATFEGNELHVHVI 229

Query: 89  QHVP--KGAEVLISYIETAGSTMTRQKALKEQYLFTCTCPRCIKLGQFDDIQESAILEGY 146
           + +P    +++ ISYI+   +   R+  L+E Y F C C +CI      D QE   +   
Sbjct: 230 EDLPCLDWSKIYISYIDLLNTPEQRRADLREHYYFLCVCSKCI------DPQEMHEMTAA 283

Query: 147 RCKDDGCSGFLLRDSDDKGFTCQQCGLVRSKEEIKKIASEVNILSKKTLALTSCGNHQEV 206
            C +  C   +    + +   CQQC       +++   +++   ++ +L       + +V
Sbjct: 284 VCPNASCDASV----NIELAKCQQCD-ASVTPQLRTAYNDIMAFTQSSLDSMKEVAYLDV 338

Query: 207 VSTYKMIEKLQKKLYHPFSVNLMQTREKLIKILMELEDWKEALAYCQLTIPVYQRVYPQF 266
            +    +EK Q+ + HP +V  ++T +   +  + +  W +ALA+ Q  +P + + +  +
Sbjct: 339 CNV--CLEK-QRGVLHPLNVWHVKTLDAAFEAAINVGKWTDALAFGQQLLPGFGKYHGAW 395

Query: 267 HPLLGLQYYTCGKLEWFLGDTENAIKSMTEAVEILRITHGTNSPFM 312
           +PL+GL +   GK++ +  + + A+  + EA  IL +THG +   +
Sbjct: 396 NPLVGLLHMKLGKIQLYERNYKLAVHHLQEAQRILSVTHGHDHRLL 441


>gi|313227557|emb|CBY22704.1| unnamed protein product [Oikopleura dioica]
          Length = 589

 Score =  113 bits (283), Expect = 1e-22,   Method: Compositional matrix adjust.
 Identities = 78/273 (28%), Positives = 140/273 (51%), Gaps = 33/273 (12%)

Query: 37  FSKLACNAHTICNSE-LRPLGTGLYPVISIINHSCLPNAVLVFEGRLAVVRAVQHVPKGA 95
           F  ++CN  +I +   L+ LG  ++P +++INH C PN V V  G    VRA++ + +G 
Sbjct: 279 FGIVSCNGMSITDMRGLQYLGVAIHPTLNLINHDCNPNVVAVSCGPNIFVRAIKPIKEGD 338

Query: 96  EVLISYIETAGSTMTRQKALKEQYLFTCTCPRCIKLGQFDDIQESAILEGYRCKDDGCSG 155
           E+ ISYI+T+ ++ TR+  LK+QY F CTC  C + G+ D+++ + ++      +   S 
Sbjct: 339 ELFISYIDTSATSETRKNILKDQYYFDCTCKMC-ESGEKDELKSAYVMPKEDVSEKR-ST 396

Query: 156 FLLRDSDDKGFTCQQCGLVRSKEEIKKIASEVNILSKKTLALTSCGN--HQEVVSTYKMI 213
           ++ +++D                 +KKI S   + + + +A  + G    QE +     +
Sbjct: 397 YIEKNTDIM---------------MKKIESSKKVQAWERVAAQAGGCLLQQENLFDDTHL 441

Query: 214 EKLQKKLYHPFSVNLMQTREKLIKILMELEDWKEALAYCQLTIPVYQRVYPQFHPLLGLQ 273
           +KL           ++QT  ++  IL   +D   A  Y +  +  Y+++YP++   +G+Q
Sbjct: 442 KKLM----------ILQTCSEVSAILNHYDD---AAQYAERVLAAYEQLYPEYSTQIGMQ 488

Query: 274 YYTCGKLEWFLGDTENAIKSMTEAVEILRITHG 306
            Y  G   W L   E+AIK +  A++ L ITHG
Sbjct: 489 AYRLGVHYWHLQRVEDAIKMLGRALKHLEITHG 521


>gi|351695044|gb|EHA97962.1| SET and MYND domain-containing protein 3, partial [Heterocephalus
           glaber]
          Length = 252

 Score =  113 bits (282), Expect = 1e-22,   Method: Compositional matrix adjust.
 Identities = 83/270 (30%), Positives = 138/270 (51%), Gaps = 43/270 (15%)

Query: 39  KLACNAHTICNSELRPLGTGLYPVISIINHSCLPNAVLVFEGRLAVVRAVQHVPKGAEVL 98
           ++ CN+ TICN+E++ +G GLYP +S++NHSC PN  +VF G   ++RAV+ +  G E+ 
Sbjct: 1   QVICNSFTICNAEMQEVGVGLYPSMSLLNHSCDPNCSIVFSGPHLLLRAVRDIEVGEELT 60

Query: 99  ISYIETAGSTMTRQKALKEQYLFTCTCPRCIKLGQFDDIQESAILEGYRCKDDGCSGFLL 158
           I Y++   ++  R+K L++QY F C C RC    Q  D +++ +L G             
Sbjct: 61  ICYLDMLMTSEERRKQLRDQYCFDCDCFRC----QTQD-KDADMLTG------------- 102

Query: 159 RDSDDKGFTCQQCGLVRSKEEIKKIASEVNILSKKTLALTSCGNHQEVVSTYKMIEKLQK 218
              D++ +   Q       E +KKI  E+    K    L  C   Q ++S+    E+L  
Sbjct: 103 ---DEQVWKGVQ-------ESLKKI-EELKAHWKWEQVLAMC---QSIISSNS--ERLP- 145

Query: 219 KLYHPFSVNLMQTR--EKLIKILMELEDWKEALAYCQLTIPVYQRVYPQFHPLLGLQYYT 276
                  +N+ Q +  +  +   + L   +EAL Y   T+  Y+  +P  HP+ G+Q   
Sbjct: 146 ------DINIYQLKVLDCAMDACINLGLLEEALFYGIRTMEPYRIFFPGSHPVRGVQVMK 199

Query: 277 CGKLEWFLGDTENAIKSMTEAVEILRITHG 306
            GKL+   G    A+K++  A +I+R+THG
Sbjct: 200 VGKLQLHQGMFPQAMKNLRLAFDIMRVTHG 229


>gi|325530259|sp|E1C5V0.1|SMYD2_CHICK RecName: Full=N-lysine methyltransferase SMYD2; AltName:
           Full=Histone methyltransferase SMYD2; AltName: Full=SET
           and MYND domain-containing protein 2
          Length = 436

 Score =  112 bits (281), Expect = 2e-22,   Method: Compositional matrix adjust.
 Identities = 73/318 (22%), Positives = 145/318 (45%), Gaps = 29/318 (9%)

Query: 5   DEKQLLLYAQIANLVNLILQWPEISINE-IAENFSKLACNAHTICNSELRPLGTGLYPVI 63
           +EK+ L+   IA L +   +  E   N  +   F+++ CN  TI + EL  LG+ ++P +
Sbjct: 146 NEKRELIQNDIAALHHFYSKHMEYPDNAALVVLFAQVNCNGFTIEDEELSHLGSAIFPDV 205

Query: 64  SIINHSCLPNAVLVFEGRLAVVRAVQHVPKGAEVLISYIETAGSTMTRQKALKEQYLFTC 123
           +++NHSC PN ++ ++G LA VRAV+ +  G EV  SYI+    T  R   L++ Y FTC
Sbjct: 206 ALMNHSCCPNVIVTYKGTLAEVRAVKEIEPGEEVFTSYIDLLYPTEDRNDRLRDSYFFTC 265

Query: 124 TCPRCIKLGQFDDIQESAILEGYRCKDDGCSGFLLRDSDDKGFTCQQCGLVRSKEEIKKI 183
            C  C                             +++ D +    ++     S E ++ +
Sbjct: 266 DCRECT----------------------------MKEKDKEKLKIRKLNDPPSAEAVRDM 297

Query: 184 ASEVNILSKKTLALTSCGNHQEVVSTYKMIEKLQKKLYHPFSVNLMQTREKLIKILMELE 243
                 + ++           E++   ++       ++   +V ++    + + + + ++
Sbjct: 298 IKYARNVIEEFRRAKHYKPPSELLEICELSLDKMGAVFEDSNVYMLHMMYQAMGVCLYVQ 357

Query: 244 DWKEALAYCQLTIPVYQRVYPQFHPLLGLQYYTCGKLEWFLGDTENAIKSMTEAVEILRI 303
           DW+ AL Y Q  I  Y + YP +   +   +   G+L   L +     K++ +A+ I+ +
Sbjct: 358 DWEGALRYGQKIIRPYSKHYPSYSLNVASMWLKLGRLYMALENRPAGDKALKKAIAIMEV 417

Query: 304 THGTNSPFMKELILKLEE 321
            HG + P++ E+  +LE+
Sbjct: 418 AHGKDHPYISEIKKELED 435


>gi|326915197|ref|XP_003203906.1| PREDICTED: n-lysine methyltransferase SMYD2-like [Meleagris
           gallopavo]
          Length = 575

 Score =  112 bits (280), Expect = 2e-22,   Method: Compositional matrix adjust.
 Identities = 82/318 (25%), Positives = 147/318 (46%), Gaps = 29/318 (9%)

Query: 5   DEKQLLLYAQIANLVNLILQWPEISINE-IAENFSKLACNAHTICNSELRPLGTGLYPVI 63
           +EK+ L+   IA L +   +  E   N  +   F+++ CN  TI + EL  LG+ ++P +
Sbjct: 285 NEKRELIQNDIAALHHFYSKHMEYPDNAALVVLFAQVNCNGFTIEDEELSHLGSAIFPDV 344

Query: 64  SIINHSCLPNAVLVFEGRLAVVRAVQHVPKGAEVLISYIETAGSTMTRQKALKEQYLFTC 123
           +++NHSC PN ++ ++G LA VRAV+ +  G EV  SYI+    T  R   L++ Y FTC
Sbjct: 345 ALMNHSCCPNVIVTYKGILAEVRAVKEIEPGEEVFTSYIDLLYPTEDRNDRLRDSYFFTC 404

Query: 124 TCPRCIKLGQFDDIQESAILEGYRCKDDGCSGFLLRDSDDKGFTCQQCGLVRSKEEIKKI 183
            C  C    +  D ++  I    R  +D  S   +RD            ++      K  
Sbjct: 405 DCRECTM--KEKDKEKLKI----RKLNDPPSAEAVRD-----MIKYARNVIEEFRRAKHY 453

Query: 184 ASEVNILSKKTLALTSCGNHQEVVSTYKMIEKLQKKLYHPFSVNLMQTREKLIKILMELE 243
            S   +L    L+L   G                  ++   +V ++    + + + + ++
Sbjct: 454 KSPSELLEICELSLDKMG-----------------AVFEDSNVYMLHMMYQAMGVCLYVQ 496

Query: 244 DWKEALAYCQLTIPVYQRVYPQFHPLLGLQYYTCGKLEWFLGDTENAIKSMTEAVEILRI 303
           DW+ AL Y Q  I  Y + YP +   +   +   G+L   L +     K++ +A+ I+ +
Sbjct: 497 DWEGALRYGQKIIRPYSKYYPSYSLNVASMWLKLGRLYMALENRPAGDKALKKAIAIMEV 556

Query: 304 THGTNSPFMKELILKLEE 321
            HG + P++ E+  +LE+
Sbjct: 557 AHGKDHPYISEIKKELED 574


>gi|148681077|gb|EDL13024.1| SET and MYND domain containing 2 [Mus musculus]
          Length = 450

 Score =  112 bits (280), Expect = 2e-22,   Method: Compositional matrix adjust.
 Identities = 78/319 (24%), Positives = 151/319 (47%), Gaps = 16/319 (5%)

Query: 5   DEKQLLLYAQIANLVNLI---LQWPEISINEIAENFSKLACNAHTICNSELRPLGTGLYP 61
           +EK+ L+ + IA L       L++P+ S   +   F+++ CN  TI + EL  LG+ ++P
Sbjct: 143 NEKKDLIQSDIAALHQFYSKYLEFPDHS--SLVVLFAQVNCNGFTIEDEELSHLGSAIFP 200

Query: 62  VISIINHSCLPNAVLVFEGRLAVVRAVQHVPKGAEVLISYIETAGSTMTRQKALKEQYLF 121
            ++++NHSC PN ++ ++G LA VRAVQ +  G EV  SYI+    T  R   L++ Y F
Sbjct: 201 DVALMNHSCCPNVIVTYKGTLAEVRAVQEIHPGDEVFTSYIDLLYPTEDRNDRLRDSYFF 260

Query: 122 TCTCPRCIKLGQFDDIQESAILEGYRCKDDGCSGFLLRDSDDKGFTCQQCGLVRSKEEIK 181
           TC C  C         ++ A +E  R          +RD            ++      K
Sbjct: 261 TCECRECTTKD-----KDKAKVE-VRKLSSPPQAEAIRD-----MVRYARNVIEEFRRAK 309

Query: 182 KIASEVNILSKKTLALTSCGNHQEVVSTYKMIEKLQKKLYHPFSVNLMQTREKLIKILME 241
                +++   +  A        E++   ++ ++    ++   +V ++    + + + + 
Sbjct: 310 HYKYNLSVWPARGTAPCLTDGVCELLEICELSQEKMSSVFEDSNVYMLHMMYQAMGVCLY 369

Query: 242 LEDWKEALAYCQLTIPVYQRVYPQFHPLLGLQYYTCGKLEWFLGDTENAIKSMTEAVEIL 301
           ++DW+ AL Y Q  I  Y + YP +   +   +   G+L   L +     K++ +A+ I+
Sbjct: 370 MQDWEGALKYGQKIIKPYSKHYPVYSLNVASMWLKLGRLYMGLENKAAGEKALKKAIAIM 429

Query: 302 RITHGTNSPFMKELILKLE 320
            + HG + P++ E+  ++E
Sbjct: 430 EVAHGKDHPYISEIKQEIE 448


>gi|224047131|ref|XP_002191464.1| PREDICTED: N-lysine methyltransferase SMYD2 [Taeniopygia guttata]
          Length = 436

 Score =  112 bits (280), Expect = 2e-22,   Method: Compositional matrix adjust.
 Identities = 80/321 (24%), Positives = 150/321 (46%), Gaps = 35/321 (10%)

Query: 5   DEKQLLLYAQIANLVNLI---LQWPEISINEIAENFSKLACNAHTICNSELRPLGTGLYP 61
           +EK+ L+   IA L +     L +P+ +   +   F+++ CN  TI + EL  LG+ ++P
Sbjct: 146 NEKRELIQNDIAALHHFYSKHLDYPDNAA--LVVLFAQVNCNGFTIEDEELSHLGSAIFP 203

Query: 62  VISIINHSCLPNAVLVFEGRLAVVRAVQHVPKGAEVLISYIETAGSTMTRQKALKEQYLF 121
            ++++NHSC PN ++ ++G LA VRAV+ +  G E+  SYI+    T  R   L++ Y F
Sbjct: 204 DVALMNHSCCPNVIVTYKGTLAEVRAVREIEPGEEIFSSYIDLLYPTEDRNDRLRDSYFF 263

Query: 122 TCTCPRCIKLGQFDDIQESAILEGYRCK-DDGCSGFLLRDSDDKGFTCQQCGLVRSKEEI 180
           +C C  C    +  D ++  I     CK +D  S   ++D            ++      
Sbjct: 264 SCDCRECTT--KEKDKEKLEI-----CKLNDPPSAETVKDMIKYARN-----VIEEFRRA 311

Query: 181 KKIASEVNILSKKTLALTSCGNHQEVVSTYKMIEKLQKKLYHPFSVNLMQTREKLIKILM 240
           K   S   +L    L+L   G                  ++   +V ++    + + + +
Sbjct: 312 KHYKSPSELLEICELSLDKMG-----------------AVFEDSNVYMLHMMYQAMGVCL 354

Query: 241 ELEDWKEALAYCQLTIPVYQRVYPQFHPLLGLQYYTCGKLEWFLGDTENAIKSMTEAVEI 300
            ++DW+ AL Y Q  I  Y + YP +   +   +   G+L   L +    +K++  A+ I
Sbjct: 355 YVQDWEGALCYGQKIIRPYSKHYPSYSLNVASMWLKLGRLYMALKNRTAGVKALKRAIAI 414

Query: 301 LRITHGTNSPFMKELILKLEE 321
           + + HG + P++ E+  +LE+
Sbjct: 415 MEVAHGKDHPYISEIKKELED 435


>gi|157137747|ref|XP_001657162.1| hypothetical protein AaeL_AAEL003674 [Aedes aegypti]
 gi|108880819|gb|EAT45044.1| AAEL003674-PA [Aedes aegypti]
          Length = 446

 Score =  112 bits (280), Expect = 3e-22,   Method: Compositional matrix adjust.
 Identities = 86/321 (26%), Positives = 151/321 (47%), Gaps = 18/321 (5%)

Query: 18  LVNLILQWPEISINEIAENFSKLACNAHTICNSELRPLGTGLYPVISIINHSCLPNAVLV 77
           L +L+ +    S  E+ + F K+  N+  I + E+  +GTG+Y  ISII+HSC PNA+  
Sbjct: 139 LRSLVEEAAMPSKAELLQIFGKMCINSFNILDDEMNSIGTGMYLGISIIDHSCRPNALAT 198

Query: 78  FEGRLAVVRAVQHVPKGAEV-----LISYIETAGSTMTRQKALKEQYLFTCTCPRCIKLG 132
           F+G    +R ++    G+EV      ISYI+       R+  L+ QY F C C RC    
Sbjct: 199 FDGTTIHLRLLEDY-HGSEVDFSKIFISYIDLMNPAEERKARLRAQYYFECNCERC---- 253

Query: 133 QFDDIQESAILEGYRCKDDGCSGFLLRDSDDKGFTCQQCGLVRSKEEIKKIASEVNILSK 192
              D QE  ++    C +  C    L   D K   C +C     K   +++  EV   + 
Sbjct: 254 --RDEQEQQLMIAGACPNVDCDE-PLSMLDRKVEKCPRCDTA-IKHSAREMFREVTDFTL 309

Query: 193 KTLALTSCGNHQEVVSTYKMIEKLQKKLYHPFSVNLMQTREKLIKILMELEDWKEALAYC 252
             LA      + ++    ++  K Q+ + H ++V  ++T +   +  + ++ W++A+ Y 
Sbjct: 310 AQLAKMKDITYMDIC---QLCLKKQENILHYYNVWYLKTLDLAFESAINMQKWEDAIGYG 366

Query: 253 QLTIPVYQRVYPQFHPLLGLQYYTCGKLEWFLGDTENAIKSMTEAVEILRITHGTNSPFM 312
                 +++    FHPL GL     GK++ +      A++++ ++ +ILR+THG      
Sbjct: 367 LRLKDGFKKYNGPFHPLYGLLLLKIGKIQLYTKHVVEALQNINDSEKILRVTHGEEHDLY 426

Query: 313 KELILKLEEAQAEASYKLSSK 333
           K  ++ L   QA   Y   +K
Sbjct: 427 KSQLIPL-LCQAAGEYDALNK 446


>gi|194768855|ref|XP_001966527.1| GF21943 [Drosophila ananassae]
 gi|190617291|gb|EDV32815.1| GF21943 [Drosophila ananassae]
          Length = 459

 Score =  112 bits (279), Expect = 3e-22,   Method: Compositional matrix adjust.
 Identities = 80/317 (25%), Positives = 154/317 (48%), Gaps = 19/317 (5%)

Query: 16  ANLVNLILQWPEISIN--EIAENFSKLACNAHTICNSELRPLGTGLYPVISIINHSCLPN 73
           A L +++ + P    N  E+   + +L  N   + ++E+  + T +Y  +SI +HSC PN
Sbjct: 157 AVLTDMMAESPSTVPNKTELMSIYGRLITNGFNVLDAEMNSIATAIYLGVSITDHSCQPN 216

Query: 74  AVLVFEGRLAVVRAVQHVP--KGAEVLISYIETAGSTMTRQKALKEQYLFTCTCPRCIKL 131
           AV  FEG    + A++ +     +++ ISYI+   +   R+  LKE Y F C C +CI  
Sbjct: 217 AVATFEGNELHIHAIEDMECLDWSKIFISYIDLLNTPEERRLDLKEHYYFLCCCSKCI-- 274

Query: 132 GQFDDIQESAILEGYRCKDDGCSGFLLRDSDDKGFTCQQCGLVRSKEEIKKIASEVNILS 191
               D QE+  +    C +  C   +    + +   C++C    S + ++   + V   +
Sbjct: 275 ----DPQETKEMMAALCPNRNCGIGI----NPERTNCKRCDTGISPK-LRNAYNSVMAFT 325

Query: 192 KKTLALTSCGNHQEVVSTYKMIEKLQKKLYHPFSVNLMQTREKLIKILMELEDWKEALAY 251
           +  L       + +V    K+    Q  + HP +V  ++T +   +  +++  W +AL Y
Sbjct: 326 RTNLEAMKDVAYLDVC---KVCLNKQTGVLHPLNVWHVKTLDAAFEAAIDVGKWTDALDY 382

Query: 252 CQLTIPVYQRVYPQFHPLLGLQYYTCGKLEWFLGDTENAIKSMTEAVEILRITHGTNSPF 311
            Q  +P Y++ +  ++PLLGL +   GK++      + A+  + +A+ IL +THG +   
Sbjct: 383 GQRLLPGYRKYHGAWNPLLGLLHMKLGKIQLHEQRAKEALPHLEDALRILTVTHGRDHRL 442

Query: 312 MKELILKLE-EAQAEAS 327
           + E +  L  +A+ EA+
Sbjct: 443 LAEQLYPLLIQARHEAN 459


>gi|47217177|emb|CAG11013.1| unnamed protein product [Tetraodon nigroviridis]
          Length = 473

 Score =  112 bits (279), Expect = 3e-22,   Method: Compositional matrix adjust.
 Identities = 85/328 (25%), Positives = 151/328 (46%), Gaps = 41/328 (12%)

Query: 1   MSDIDEKQLL--LYAQI-----ANLVNLILQWPEISINEIAENFSKLACNAHTICNSELR 53
           +SD+ + Q L  L+AQ+     A    LI   P  S+N          CN  TI + EL 
Sbjct: 180 ISDLPDDQALTQLFAQVRSRTGAGGEKLIRGSPPASVN----------CNGFTIEDEELS 229

Query: 54  PLGTGLYPVISIINHSCLPNAVLVFEGRLAVVRAVQHVPKGAEVLISYIETAGSTMTRQK 113
            LG+ ++P ++++NHSC PN ++ ++G +A VRAVQ +  G E+  SYI+    T  R++
Sbjct: 230 HLGSAVFPDVALMNHSCSPNVIVTYKGTVAEVRAVQEINPGDEIFNSYIDLLYPTEDRKE 289

Query: 114 ALKEQYLFTCTCPRCIKLGQFDDIQESAILEGYRCKDDGCSGFLLRDSDDKGFTCQQCGL 173
            L + Y FTC C  C    + ++  +S   +             L   + +        L
Sbjct: 290 RLLDSYFFTCQCAECTTRSKDEEKMKSTKPKSS-----------LEPEEVQSMVVYARNL 338

Query: 174 VRSKEEIKKIASEVNILSKKTLALTSCGNHQEVVSTYKMIEKLQKKLYHPFSVNLMQTRE 233
           +   EE ++     NILS     L  C      +S  KM       ++   +V ++    
Sbjct: 339 I---EEFRRAKHYKNILSFPVSLLEMCE-----LSLDKM-----GSVFADTNVYMLHMMY 385

Query: 234 KLIKILMELEDWKEALAYCQLTIPVYQRVYPQFHPLLGLQYYTCGKLEWFLGDTENAIKS 293
           + + + + ++DW  A++Y +  I  Y   YP +   +   Y   G+L   L  T   +K+
Sbjct: 386 QAMGVCLYMQDWDGAMSYGEKIIQPYSVHYPAYSLNVASMYLKLGRLYLGLEKTPQGVKA 445

Query: 294 MTEAVEILRITHGTNSPFMKELILKLEE 321
           + +A+ I+ + HG +  ++ E+  ++EE
Sbjct: 446 LKKALSIMEVAHGKDHYYVAEVKREIEE 473


>gi|195163295|ref|XP_002022487.1| GL12938 [Drosophila persimilis]
 gi|194104479|gb|EDW26522.1| GL12938 [Drosophila persimilis]
          Length = 420

 Score =  111 bits (277), Expect = 5e-22,   Method: Compositional matrix adjust.
 Identities = 81/317 (25%), Positives = 154/317 (48%), Gaps = 38/317 (11%)

Query: 16  ANLVNLILQWPEISIN--EIAENFSKLACNAHTICNSELRPLGTGLYPVISIINHSCLPN 73
           A L +++   P    N  E+   + +L  N   + ++E+  + T +Y  +SI +HSC PN
Sbjct: 137 AVLTDMMADSPSTVPNKSELMSIYGRLITNGFNVLDAEMNSIATAIYLGVSITDHSCQPN 196

Query: 74  AVLVFEGRLAVVRAVQHVP--KGAEVLISYIETAGSTMTRQKALKEQYLFTCTCPRCIKL 131
           AV  FEG    + A++ +P    +++ ISYI+   +   R++ LK+ Y F C C +C   
Sbjct: 197 AVATFEGNELHIHALEDMPCLDWSKIFISYIDLLNTPEQRRQDLKDHYYFLCVCSKC--- 253

Query: 132 GQFDDIQESAILEGYRCKDDGCSGFLLRDSDDKGFTCQQCGLVRSKEEIKKIASEVNILS 191
               D +E+  +    C +  CS  +  + ++    C++C +  S + ++   +E+  L+
Sbjct: 254 ---RDPKEARQMTAAACPNRKCSASINIEWNN----CKRCSVGISPK-LRNAYNEIMALT 305

Query: 192 KKTLALTSCGNHQEVVSTYKMIEKLQKKLYHPFSVNLMQTREKLIKILMELEDWKEALAY 251
           K  L                  E ++   Y    +++ +T +   +  +E+  W+EAL Y
Sbjct: 306 KHNL------------------EAMKDVAY----LDVCKTLDAACEAAIEVGKWEEALNY 343

Query: 252 CQLTIPVYQRVYPQFHPLLGLQYYTCGKLEWFLGDTENAIKSMTEAVEILRITHGTNSPF 311
            Q  +P +Q  + Q++PL+GL +   GK++ F    + A   + EA  IL +THG +   
Sbjct: 344 GQQLLPGFQMYHGQWNPLVGLLHMKLGKIQLFERHHKEASHHLEEAQRILSVTHGRDHRL 403

Query: 312 MKELILKL-EEAQAEAS 327
           + E +  L  +A+ EA+
Sbjct: 404 LVEQLYPLIFQARQEAN 420


>gi|380030528|ref|XP_003698898.1| PREDICTED: LOW QUALITY PROTEIN: SET and MYND domain-containing
           protein 3-like [Apis florea]
          Length = 414

 Score =  110 bits (276), Expect = 7e-22,   Method: Compositional matrix adjust.
 Identities = 87/313 (27%), Positives = 152/313 (48%), Gaps = 53/313 (16%)

Query: 2   SDI--DEKQLLLYAQIANLVNLILQWPEISINEIAE---NFSKLACNAHTICNSELRPLG 56
           SDI  DEK++  +  +  ++   L   ++SI   AE    + ++  N+  I + ++  +G
Sbjct: 122 SDIKKDEKKMEHFVCVCGVLYEFL--GDMSIPNSAELMGIYGRIYINSFNISDLDMNNIG 179

Query: 57  TGLYPVISIINHSCLPNAVLVFEGRLAVVRAVQHVP--KGAEVLISYIETAGSTMTRQKA 114
            G+Y   SI++HSC PNAV  FEG   ++R  + +P    +++ ISYI+   +T  R++ 
Sbjct: 180 AGIYLGPSILDHSCKPNAVATFEGTTIIIRTTEDLPCLDLSQIRISYIDVIKTTKDRREE 239

Query: 115 LKEQYLFTCTCPRCIKLGQFDDIQES-AILEGYRCKDDGCSGFLLRDSDDKGFTCQQCGL 173
           L+  Y F C C +C         +ES  ++E   C +                       
Sbjct: 240 LQNSYYFWCNCKKC---------EESEPMVEAAACPN----------------------- 267

Query: 174 VRSKEEIKKIASEVNILSKKTLALTSCGNHQEVVSTYKMIEKLQKKLYHPFSVNLMQTRE 233
            R+  EI  + +  ++ + K +A          VST  M  K Q+ + HP ++  +QT +
Sbjct: 268 -RTFYEISDLTA-YHLQNMKNIAYLD-------VST--MCLKKQEGVLHPLNIQHVQTLQ 316

Query: 234 KLIKILMELEDWKEALAYCQLTIPVYQRVYPQFHPLLGLQYYTCGKLEWFLGDTENAIKS 293
                 + L+ W+EA +Y +  I  Y   Y +FHP  G+ Y + GKL+ +L + + AI++
Sbjct: 317 SAFDSSLTLQHWEEAESYAKRLINGYLTYYGEFHPSTGILYLSIGKLQVYLKNXKQAIET 376

Query: 294 MTEAVEILRITHG 306
           + +A  IL ITHG
Sbjct: 377 LRKASAILTITHG 389


>gi|432945357|ref|XP_004083558.1| PREDICTED: N-lysine methyltransferase SMYD2-like [Oryzias latipes]
          Length = 432

 Score =  110 bits (275), Expect = 1e-21,   Method: Compositional matrix adjust.
 Identities = 94/334 (28%), Positives = 147/334 (44%), Gaps = 61/334 (18%)

Query: 5   DEKQLLLYAQIANLVNLILQWPEI-SINEIAENFSKLACNAHTICNSELRPLGTGLYPVI 63
           +EK+      IA L     +  E+ S  ++   FS++ACN  TI + EL  +GT +YP +
Sbjct: 142 NEKKERTELDIAGLGRFFSKHLEVPSHKDLLTLFSQVACNGFTIEDDELSHMGTAVYPDV 201

Query: 64  SIINHSCLPNAVLVFEGRLAVVRAVQHVPKGAEVLISYIETAGSTMTRQKALKEQYLFTC 123
           ++INHSCLP+ ++ F G LA VRAVQ +  G EVLISYI+    T  R   L+E Y FTC
Sbjct: 202 ALINHSCLPSVIVTFNGTLAQVRAVQDMKPGDEVLISYIDLLYPTDDRNSRLRESYYFTC 261

Query: 124 TCPRC-------IKL---GQFDDIQESAILEGYRCKDDGCSGF-LLRDSDDKGFTCQQCG 172
            C  C       +KL    Q D I+   I    R   +    F  ++ +       + C 
Sbjct: 262 NCLECQNKQKDKVKLKVRKQSDPIEPQVISNMTRYAKNTIREFRAMKSTKTPSELLEMCE 321

Query: 173 LVRSKEEIKKI--ASEVNILSKKTLALTSCGNHQEVVSTYKMIEKLQK---KLYHPFSVN 227
             +S EE+  +   S V +L     A+  C   Q+     +  EK+ +   K YHP+S+N
Sbjct: 322 --QSLEEMGAVFEDSNVYVLHMMYQAMGVCIYMQDADGALRYGEKVARYYSKRYHPYSLN 379

Query: 228 LMQTREKLIKILMELEDWKEALAYCQLTIPVYQRVYPQFHPLLGLQYYTCGKLEWFLGDT 287
           +     KL +I M                                           LG  
Sbjct: 380 VSSLYLKLGRIYMA------------------------------------------LGRH 397

Query: 288 ENAIKSMTEAVEILRITHGTNSPFMKELILKLEE 321
              I ++T+A+ I+ + HG +  ++K++  +L +
Sbjct: 398 SAGINALTKAMAIMEVAHGKDHQYLKDIRKELSQ 431


>gi|324509870|gb|ADY44135.1| SET and MYND domain-containing protein 3 [Ascaris suum]
          Length = 442

 Score =  110 bits (274), Expect = 1e-21,   Method: Compositional matrix adjust.
 Identities = 81/307 (26%), Positives = 144/307 (46%), Gaps = 28/307 (9%)

Query: 31  NEIAENFSKLACNAHTICNSELRPLGTGLYPVISIINHSCLPNAVLVFEGRLAVVRAVQH 90
           +E+   F K+  N  TI N +L  +G GLY  +S+++HSC P+A ++F G  AV+R ++ 
Sbjct: 142 SELLTIFGKVMVNVFTISNDDLNTIGLGLYLGLSVLDHSCDPDAFVLFNGTKAVLRPLKQ 201

Query: 91  --VPKGAEVLISYIETAGSTMTRQKALKEQYLFTCTCPRCIKLGQFDDIQESAILEGYRC 148
                 + + I+Y +    T  R+  LK+Q+ FTC C  C+      D++        RC
Sbjct: 202 YITAYDSSLRIAYCDLLDLTSMRRNQLKQQFFFTCECSACL------DLEREKTARSVRC 255

Query: 149 KDDGCSGFLLRDSDDKGFTCQQCGL---VRSKEEI---KKIASEVNILSKKTLALTSCGN 202
           +     G+   D ++    C QCG    V   E +   +++  EV  L ++         
Sbjct: 256 R-HCVDGYCPLDVNENSLVCWQCGATSEVHVDEAVHLMQQVEWEVKRLQER--------- 305

Query: 203 HQEVVSTYKMIEKLQKKLYHPFSVNLMQTREKLIKILMELEDWKEALAYCQLTIPVYQRV 262
             + + T K +      +    ++ L Q  +++    ++  ++  A  Y + TI  ++R 
Sbjct: 306 RNDSICTAKDLYNEASIILSSLNIPLCQLADRIASWGIDNGNFSLAADYIEKTISCFKRF 365

Query: 263 YPQFHPLLGLQYYTCGKLEWFLGDT-ENAIKSMTEAVEILRITHGTN---SPFMKELILK 318
           YP  HP L LQY+  GKL      T E A + + +A+E L  +HG     S    EL+ K
Sbjct: 366 YPTAHPSLSLQYFKAGKLHSLNDLTLELAAERLEQAIESLHASHGVQHSLSAEAHELLCK 425

Query: 319 LEEAQAE 325
           +++ + +
Sbjct: 426 VQQQRRD 432


>gi|301113434|ref|XP_002998487.1| conserved hypothetical protein [Phytophthora infestans T30-4]
 gi|262111788|gb|EEY69840.1| conserved hypothetical protein [Phytophthora infestans T30-4]
          Length = 424

 Score =  110 bits (274), Expect = 1e-21,   Method: Compositional matrix adjust.
 Identities = 94/330 (28%), Positives = 154/330 (46%), Gaps = 55/330 (16%)

Query: 3   DIDEKQLLLYAQIANLVNLILQWPEISINEIAENFSKLACNAHTICNSELRPLGTGLYPV 62
           D+D+++LLL A +A  +NL+ +    S++E+A   S+   N  +IC+  L  LG G +P+
Sbjct: 138 DMDDQELLLLAALAQKINLVDE--SFSMDEMARMLSRFRNNNFSICDELLLELGAGCFPL 195

Query: 63  ISIINHSCLPNAVLVFEGRLAVV--RAVQHVPKGAEVLISYIETAGSTMTRQKALKEQYL 120
            ++INHSC PN  + F  +   +  RA++ +  G E+  +Y++ A     RQ+ L+ +Y 
Sbjct: 196 GAMINHSCDPNCAVTFVPKTLDMEFRAMKPIKSGEEITQTYVDIALPRRERQQRLQRKYH 255

Query: 121 FTCTCPRCIKLGQFDDIQESAILEGYRCKDDGCSGFLLRDSDDKGFTCQQCGLVRSKEEI 180
           FTC CPRC +      +QE   L+ Y             D+D  G   ++    R +E  
Sbjct: 256 FTCGCPRCSQ-----PLQEPGSLDAY------------LDADIDGVPQERWTEERQQE-- 296

Query: 181 KKIASEVNILSKKTLALTSCGNHQEVVSTYKMIEKLQKKLYHPFSVNLMQTREKLIKILM 240
                                   + +   + +   Q  + H  SV  +QT   +    M
Sbjct: 297 ------------------------QCIDALQNLADRQSTILHCHSVARLQTLATIFSAEM 332

Query: 241 ELEDWKEALAYCQLTIPVYQRVYPQFHPLLGLQYYTCGKL-----EWFLGDTENAIKS-- 293
           E    +EA+ Y +  +  Y+RVY   HP+ GL  +T G L     +   G  ++  KS  
Sbjct: 333 ERGSVEEAIGYGERMLEFYRRVYNPNHPMTGLHVFTLGDLYGQQAQAGTGAEDSKQKSAE 392

Query: 294 -MTEAVEILRITHGTNSPFMKELILKLEEA 322
            + EA  IL+ITHG +  F+K L  +LE A
Sbjct: 393 YLAEAQRILQITHGKDHRFVKMLGDRLETA 422


>gi|290998093|ref|XP_002681615.1| SET domain-containing protein [Naegleria gruberi]
 gi|284095240|gb|EFC48871.1| SET domain-containing protein [Naegleria gruberi]
          Length = 430

 Score =  109 bits (273), Expect = 2e-21,   Method: Compositional matrix adjust.
 Identities = 73/257 (28%), Positives = 122/257 (47%), Gaps = 53/257 (20%)

Query: 39  KLACNAHTICNSELRPLGTGLYPVISIINHSCLPNAVLVFEGR----LAVVRAVQHVPKG 94
           KL  N  +ICN E+  +G+G+Y   S+ NHSC+PN  ++F+      + ++     + +G
Sbjct: 182 KLQRNTFSICNEEMNAIGSGIYLKASMFNHSCVPNCAILFDSDKNLYVRILNPSSLLEEG 241

Query: 95  AEVLISYIETAGSTMTRQKALKEQYLFTCTCPRCIKLGQFDDIQESAILEGYRCKDDGCS 154
             + I+Y++    T  RQK LKEQY FTCTCPRC  L   +++ E+              
Sbjct: 242 TPLTINYVDLMDLTANRQKKLKEQYHFTCTCPRC--LNSNEEVNENV------------- 286

Query: 155 GFLLRDSDDKGFTCQQCGLVRSKEEIKKIASEVNILSKKTLALTSCGNH-QEVVSTYKMI 213
                                  +++  IA+E     +K   L    N+ Q +VS  +  
Sbjct: 287 -----------------------DKMISIANE----HRKNQNLQEAVNYFQRIVSKKRSS 319

Query: 214 EKLQKKLYHPFSVNLMQTREKLIKILMELEDWKEALAYCQLTIPVYQRVYPQFHPLLGLQ 273
             L  ++Y   ++N       L  + ++L D+K+A  +   ++  +++ YP F+PLLGLQ
Sbjct: 320 PSLFHEIYMGIALN------SLTHLNVDLSDFKKAYEFGIESLEYFEKYYPPFYPLLGLQ 373

Query: 274 YYTCGKLEWFLGDTENA 290
           Y TC KL  +L +   A
Sbjct: 374 YLTCAKLASYLEENGQA 390


>gi|348670159|gb|EGZ09981.1| hypothetical protein PHYSODRAFT_523060 [Phytophthora sojae]
          Length = 421

 Score =  108 bits (270), Expect = 3e-21,   Method: Compositional matrix adjust.
 Identities = 87/330 (26%), Positives = 145/330 (43%), Gaps = 55/330 (16%)

Query: 3   DIDEKQLLLYAQIANLVNLILQWPEISINEIAENFSKLACNAHTICNSELRPLGTGLYPV 62
           D+++++L+L A +A  + L+      S++E+    S+   N  +IC+  L   G G +P+
Sbjct: 136 DMEDQELMLLAALAQKLELVD--GSYSMDEMLRMLSRFRNNNFSICDELLLEQGAGCFPL 193

Query: 63  ISIINHSCLPNAVLVFEGRLAVV--RAVQHVPKGAEVLISYIETAGSTMTRQKALKEQYL 120
            ++INHSC PN  + F  +   +  RA++ +  G E+  +Y++ A     R + L+ +Y 
Sbjct: 194 GAMINHSCDPNCAITFVPKTLEMEFRAMRPIKAGEEITQTYVDVALPRRERHERLQRKYH 253

Query: 121 FTCTCPRCIKLGQFDDIQESAILEGYRCKDDGCSGFLLRDSDDKGFTCQQCGLVRSKEEI 180
           F C C RC        +QES  L+ +             D+D  G   +Q    R  E  
Sbjct: 254 FNCACSRCSV-----PLQESGSLDAF------------LDADIDGVPKEQWSQERKDE-- 294

Query: 181 KKIASEVNILSKKTLALTSCGNHQEVVSTYKMIEKLQKKLYHPFSVNLMQTREKLIKILM 240
                                   + +   + + + Q  + H  S+  +QT   +    M
Sbjct: 295 ------------------------QCIDALQKLAERQSNILHRDSIARLQTLATIFSAEM 330

Query: 241 ELEDWKEALAYCQLTIPVYQRVYPQFHPLLGLQYYTCGKLEWFL-----GDTENAIKS-- 293
           E    +EA+ Y +  +  YQRVY   HP+ GL  +T G L   L     G   +  KS  
Sbjct: 331 ERGSVEEAVVYGEKMLEFYQRVYNANHPMTGLHLFTLGDLYAQLAQVGTGPENSKAKSSE 390

Query: 294 -MTEAVEILRITHGTNSPFMKELILKLEEA 322
            +TEA  IL+IT G    F+K L  +L+ A
Sbjct: 391 YLTEARRILQITQGKEHRFVKMLADRLQAA 420


>gi|432873524|ref|XP_004072259.1| PREDICTED: SET and MYND domain-containing protein 1-like isoform 1
           [Oryzias latipes]
          Length = 476

 Score =  108 bits (269), Expect = 4e-21,   Method: Compositional matrix adjust.
 Identities = 83/324 (25%), Positives = 148/324 (45%), Gaps = 42/324 (12%)

Query: 1   MSDIDEKQLL-LYAQIANLVNLILQWPEIS----INEIAENFSKLACNAHTICNSE-LRP 54
           ++D+ E +L  L   I N ++    WP  S    +++I+  F  + CN  ++ +   L+ 
Sbjct: 131 ITDMQEDELKELKVDIHNFLDF---WPRTSKQHTVDDISHIFGVINCNGFSVSDQRGLQA 187

Query: 55  LGTGLYPVISIINHSCLPNAVLVFEGRLAVVRAVQHVPKGAEVLISYIETAGSTMTRQKA 114
           +G GL+P + ++NH C PN  ++       +R++  + +G E+ ++Y++    +  RQ+ 
Sbjct: 188 VGVGLFPNLCLVNHDCWPNCTVILNHGKIELRSLGKIAEGEELTVAYVDYMNLSEERQRL 247

Query: 115 LKEQYLFTCTCPRCIKLGQFDDIQESAILEGYRCKDDGCSGFLLRDSDDKGFTCQQCGLV 174
           LK QY F CTC  C      + I++   L G             R+ D         G+ 
Sbjct: 248 LKTQYFFDCTCEHCK-----NKIKDDIKLGG-------------REVD---------GVK 280

Query: 175 RSKEEIKKIASEVNILSKKTLALTSCGNHQEVVSTYK-MIEKLQKKLY--HPFSVNLMQT 231
            S+E++K+       + +K       G++ EVV   +  IEK +  L   H + + +  T
Sbjct: 281 PSEEQVKEATDYCYQMLEKMDKARLNGDYHEVVKICRECIEKTEPVLAETHIYLLRMWST 340

Query: 232 REKLIKILMELEDWKEALAYCQLTIPVYQRVYPQFHPLLGLQYYTCGKLEWFLGDTENAI 291
             ++   L  L D   A  Y +  +  Y ++YP  +  LG+     G + W  GD E A 
Sbjct: 341 MSEVQAYLQYLND---AAEYARKMVEGYTKLYPPNNATLGMAAMRAGVIHWQAGDIEVAH 397

Query: 292 KSMTEAVEILRITHGTNSPFMKEL 315
             + +A  IL +THG   P  K+L
Sbjct: 398 GMVCKAYAILMVTHGPTHPITKDL 421


>gi|61806679|ref|NP_001013568.1| N-lysine methyltransferase SMYD2-A [Danio rerio]
 gi|82178565|sp|Q5BJI7.1|SMY2A_DANRE RecName: Full=N-lysine methyltransferase SMYD2-A; AltName:
           Full=Histone methyltransferase SMYD2-A; AltName:
           Full=SET and MYND domain-containing protein 2A
 gi|60552471|gb|AAH91465.1| SET and MYND domain containing 2a [Danio rerio]
          Length = 435

 Score =  106 bits (265), Expect = 1e-20,   Method: Compositional matrix adjust.
 Identities = 76/313 (24%), Positives = 144/313 (46%), Gaps = 37/313 (11%)

Query: 13  AQIANLVNLILQWPEISINEIAENFSKLACNAHTICNSELRPLGTGLYPVISIINHSCLP 72
           A + +  +  L +P+ +   + E  +++ CN  TI + EL  LG+ L+P ++++NHSC P
Sbjct: 154 AALHHFYSRHLDFPDNAA--LTELIAQVNCNGFTIEDEELSHLGSALFPDVALMNHSCSP 211

Query: 73  NAVLVFEGRLAVVRAVQHVPKGAEVLISYIETAGSTMTRQKALKEQYLFTCTCPRCIKLG 132
           N ++ ++G +A VRAVQ +    E+  SYI+    T  R + LK+ Y F C C  C    
Sbjct: 212 NVIVTYKGTVAEVRAVQEINPEEEIFNSYIDLLYPTEDRIERLKDSYFFNCDCKECTS-- 269

Query: 133 QFDDIQESAILEGYRCKDDGCSGFLLRDSDDKGFTCQQCGLVRSKEEIKKIASEVNILSK 192
                         + KD+            K    Q+  +   +EEIK++     I ++
Sbjct: 270 --------------KSKDEA-----------KMEIRQKLSIPPEEEEIKQMV----IYAR 300

Query: 193 KTLALTSCGNHQEVVSTYKMIEKLQKK----LYHPFSVNLMQTREKLIKILMELEDWKEA 248
             +       H +  S    I +L  +    ++   +V ++    + + + + ++DW  A
Sbjct: 301 NVIEEFRRAKHYKTPSELLEICELSMEKMGAIFAETNVYMLHMMYQAMGVCLYMQDWDGA 360

Query: 249 LAYCQLTIPVYQRVYPQFHPLLGLQYYTCGKLEWFLGDTENAIKSMTEAVEILRITHGTN 308
           + Y +  I  Y   YP +   +   Y   G+L   L      +K++ +A+ I+ I HG +
Sbjct: 361 MKYGEKIIHPYSVHYPPYSLNVASMYLKLGRLYLGLEKRTQGVKALKKALAIMDIAHGKD 420

Query: 309 SPFMKELILKLEE 321
            P++ E+  ++EE
Sbjct: 421 HPYIDEIKKEMEE 433


>gi|348524789|ref|XP_003449905.1| PREDICTED: N-lysine methyltransferase SMYD2-like [Oreochromis
           niloticus]
          Length = 440

 Score =  105 bits (263), Expect = 2e-20,   Method: Compositional matrix adjust.
 Identities = 81/312 (25%), Positives = 140/312 (44%), Gaps = 29/312 (9%)

Query: 5   DEKQLLLYAQIANLVNLILQWPEISIN-EIAENFSKLACNAHTICNSELRPLGTGLYPVI 63
           +EK+ +  A IA L     +  ++  + E+   FS++ACN  TI + EL  LGT +YP +
Sbjct: 150 NEKREMTEADIAGLHRFYSKHLDVPDHKELLTLFSQVACNGFTIEDDELSHLGTAVYPDM 209

Query: 64  SIINHSCLPNAVLVFEGRLAVVRAVQHVPKGAEVLISYIETAGSTMTRQKALKEQYLFTC 123
           ++INHSCLP+ ++ F+G  A VRAVQ +  G EVLISYI+    T  R   L+E Y F C
Sbjct: 210 ALINHSCLPSVIVTFKGTSAEVRAVQDMKPGDEVLISYIDLLYPTDDRNNRLRESYYFIC 269

Query: 124 TCPRCIKLGQFDDIQESAILEGYRCKDDGCSGFLLRDSDDKGFTCQQCGLVRSKEEIKKI 183
            C  C    Q  D  +  + +     +      ++R +            +R   E K  
Sbjct: 270 DCQECK--SQSKDKAKLKVRKQRDSIEPDVINNMVRYARKA---------IREFREFKHT 318

Query: 184 ASEVNILSKKTLALTSCGNHQEVVSTYKMIEKLQKKLYHPFSVNLMQTREKLIKILMELE 243
            +   +L     +L   G                  ++   +V ++    + + I M ++
Sbjct: 319 KTPSELLEMCEQSLEEMG-----------------AVFDDSNVYMLHMMYQAMGICMYMQ 361

Query: 244 DWKEALAYCQLTIPVYQRVYPQFHPLLGLQYYTCGKLEWFLGDTENAIKSMTEAVEILRI 303
           D   A+ Y +  +  Y  +YP +   +   Y   G+L   L        ++ +A+ I+ +
Sbjct: 362 DLDGAIRYGEKLLKPYSHLYPPYSLNVSSVYLKLGRLYLGLERQSACTSALKKAMAIMEV 421

Query: 304 THGTNSPFMKEL 315
            HG +  +++EL
Sbjct: 422 AHGKDHFYIEEL 433


>gi|322779440|gb|EFZ09632.1| hypothetical protein SINV_01274 [Solenopsis invicta]
          Length = 425

 Score =  105 bits (263), Expect = 2e-20,   Method: Compositional matrix adjust.
 Identities = 84/302 (27%), Positives = 138/302 (45%), Gaps = 30/302 (9%)

Query: 29  SINEIAENFSKLACNAHTICNSELRPLGTGLYPVISIINHSCLPNAVLVFEGRLAVVRAV 88
           SI E+   + K++ N   I +  L  +G G+Y   S++ HSC PNAV  FEG   ++   
Sbjct: 148 SIAELISIYGKMSTNRLNIQDEWLNVIGFGIYLGASVVGHSCKPNAVATFEGTTIII--- 204

Query: 89  QHVPKGAEVLISYIETAGSTMTRQKALKEQYLFTCTCPRCIKLGQFDDIQESAILEGYRC 148
                   + ISY++   S   R++ L   Y F C C RC K        E ++ E   C
Sbjct: 205 --------IKISYVDLIKSKKDRREELYSLYHFWCDCERCEK--------EESLTEAAAC 248

Query: 149 KDDGCSGFLLRDSDDKGFTCQQCGL---VRSKEEIKKIASEVNILSKKTLALTSCGNHQE 205
            +  C      D+D     C++C     V+ KE  +++    +   +  +++    N  +
Sbjct: 249 PNSSCDSPCSIDADK----CEKCNTRISVKFKETFQEVVDLTDYHFEHIISILF-FNDLD 303

Query: 206 VVSTYKMIEKLQKKLYHPFSVNLMQTREKLIKILMELEDWKEALAYCQLTIPVYQRVYPQ 265
             S  K I + QK + H F+V+L+ T  K+      L+ WK+   Y +     Y   Y +
Sbjct: 304 TFSWIKEILEKQKNVMHKFNVHLICTLRKIHCAAWNLKCWKDVEFYGKKLEAGYLFYYGK 363

Query: 266 FHPLLGLQYYTCGKLEWFLGDTENAIKSMTEAVEILRITHGTNSPFMKELILKLEEAQAE 325
            +P +G  Y T G     LG  + A K++ +A  +L+ITHG     +KE+   LE  + +
Sbjct: 364 VYPPIGTLYLTLGIALLHLGKPKEAFKTLNKAYTVLKITHGNKHFLVKEI---LEPLRFD 420

Query: 326 AS 327
           AS
Sbjct: 421 AS 422


>gi|301603797|ref|XP_002931556.1| PREDICTED: SET and MYND domain-containing protein 3-like [Xenopus
           (Silurana) tropicalis]
          Length = 226

 Score =  105 bits (263), Expect = 2e-20,   Method: Compositional matrix adjust.
 Identities = 74/264 (28%), Positives = 126/264 (47%), Gaps = 45/264 (17%)

Query: 63  ISIINHSCLPNAVLVFEGRLAVVRAVQHVPKGAEVLISYIETAGSTMTRQKALKEQYLFT 122
           +S++NHSC PN V+VFEG   ++R V+ +PKG E+ ISYI+    T  R+  L+ QY F 
Sbjct: 1   MSLLNHSCDPNCVIVFEGTCLLLRTVKEIPKGEELTISYIDVKMPTQGRRDQLQRQYCFL 60

Query: 123 CTCPRCIKLGQFDDIQESAILEGYRCKDDGCSGFLLRDSDDKGFTCQQCGLVRSKEEIKK 182
           C C RC                            LLRD D+         L    E  ++
Sbjct: 61  CDCQRC----------------------------LLRDKDEDM-------LAGDAEASRE 85

Query: 183 IASEVNILSKKTLALTSCGNHQEVVSTYKMIEKLQKKLYHPFSVNLMQTR--EKLIKILM 240
           + S V+ L +    L S    +E ++   + + L  + Y P   N+ Q +  +  +   +
Sbjct: 86  VESSVSRLEE----LLSQNTAEEALN---LCKTLMNRYYLP-DKNIYQLKILDCAMDASI 137

Query: 241 ELEDWKEALAYCQLTIPVYQRVYPQFHPLLGLQYYTCGKLEWFLGDTENAIKSMTEAVEI 300
           +L  W+EAL +   T+  Y   Y  +HP+  +Q    GKL+ + G    A+K++ +A +I
Sbjct: 138 DLGLWEEALHFGLRTLEPYSLYYSNYHPVRAVQLMKVGKLQNYQGLFAEAMKTLKQAFDI 197

Query: 301 LRITHGTNSPFMKELILKLEEAQA 324
           +++THG +    ++L   + + +A
Sbjct: 198 MKVTHGRDHGQTQQLAELMNDCEA 221


>gi|147905510|ref|NP_001085986.1| N-lysine methyltransferase SMYD2-B [Xenopus laevis]
 gi|82184154|sp|Q6GN68.1|SMY2B_XENLA RecName: Full=N-lysine methyltransferase SMYD2-B; AltName:
           Full=Histone methyltransferase SMYD2-B; AltName:
           Full=SET and MYND domain-containing protein 2B
 gi|49115919|gb|AAH73650.1| MGC82991 protein [Xenopus laevis]
          Length = 430

 Score =  105 bits (262), Expect = 3e-20,   Method: Compositional matrix adjust.
 Identities = 76/286 (26%), Positives = 135/286 (47%), Gaps = 32/286 (11%)

Query: 37  FSKLACNAHTICNSELRPLGTGLYPVISIINHSCLPNAVLVFEGRLAVVRAVQHVPKGAE 96
           F+++ CN  TI + EL  LG+ ++P ++++NHSC PN ++ ++G +A VRAVQ +  G E
Sbjct: 174 FAQVNCNGFTIEDEELSHLGSAIFPDVALMNHSCCPNVIVTYKGTVAEVRAVQEIHAGEE 233

Query: 97  VLISYIETAGSTMTRQKALKEQYLFTCTCPRCIKLGQFDDIQESAILEGYRCKDDGCSGF 156
           V  SYI+    T  R   LK+ Y F+C C  C         ++ A LE  R   D  S  
Sbjct: 234 VFTSYIDLLYPTEDRNDRLKDSYFFSCDCRECST-----KQKDPAKLE-LRKLSDPPSPQ 287

Query: 157 LLRDSDDKGFTCQQCGLVRSKEEIKKIASEVNILSKKTLALTSCGNHQEVVSTYKMIEKL 216
            +RD            +V      K   +   +L    L+L   G+              
Sbjct: 288 TVRD-----MITYARNVVEEFRRAKHYKTPSELLEICELSLDKMGS-------------- 328

Query: 217 QKKLYHPFSVNLMQTREKLIKILMELEDWKEALAYCQLTIPVYQRVYPQFHPLLGLQYYT 276
              ++   +V ++    + + + + ++DW+ AL Y +  I  Y + YP +   +   +  
Sbjct: 329 ---VFVDSNVYMLHMMYQAMGVCLYMQDWEGALKYGEKIIKPYSKHYPAYSLNVASMWLK 385

Query: 277 CGKLEWFLGDTENAI--KSMTEAVEILRITHGTNSPFMKELILKLE 320
            G+L  ++G  +N I  K++ +A+ I+ I HG +  ++ E+  +LE
Sbjct: 386 LGRL--YMGLEKNTIGTKALKKALAIMEIAHGPDHYYIAEIKKELE 429


>gi|119597486|gb|EAW77080.1| SET and MYND domain containing 1, isoform CRA_d [Homo sapiens]
          Length = 497

 Score =  105 bits (261), Expect = 4e-20,   Method: Compositional matrix adjust.
 Identities = 78/307 (25%), Positives = 142/307 (46%), Gaps = 43/307 (14%)

Query: 19  VNLILQ-WP----EISINEIAENFSKLACNAHTICNSE-LRPLGTGLYPVISIINHSCLP 72
           V+  LQ WP    + S+  I+  F  + CN  T+ +   L+ +G G++P + ++NH C P
Sbjct: 151 VDTFLQYWPPQSQQFSMQYISHIFGVINCNGFTLSDQRGLQAVGVGIFPNLGLVNHDCWP 210

Query: 73  NAVLVFEGRLAVVRAVQHVPKGAEVLISYIETAGSTMTRQKALKEQYLFTCTCPRCIKLG 132
           N  ++F      +RA+  + +G E+ +SYI+    +  R++ LK+QY F CTC  C K  
Sbjct: 211 NCTVIFNNGKIELRALGKISEGEELTVSYIDFLNVSEERKRQLKKQYYFDCTCEHCQK-- 268

Query: 133 QFDDIQESAILEGYRCKDDGCSGFLLRDSDDKGFTCQQCGLVRSKEEIKKIASEVNILSK 192
                         + KDD    FL              G+  + +  +++  E+   SK
Sbjct: 269 --------------KLKDD---LFL--------------GVKDNPKPSQEVVKEMIQFSK 297

Query: 193 KTL----ALTSCGNHQEVVSTYKMIEKLQKKLYHPFSVNLMQTREKLIKILMELEDWKEA 248
            TL       S G + EVV   +   + Q+ ++   ++ +++    + ++L  L+ ++EA
Sbjct: 298 DTLEKIDKARSEGLYHEVVKLCRECLEKQEPVFADTNIYMLRMLSIVSEVLSYLQAFEEA 357

Query: 249 LAYCQLTIPVYQRVYPQFHPLLGLQYYTCGKLEWFLGDTENAIKSMTEAVEILRITHGTN 308
             Y +  +  Y ++Y   +  LG+     G   W  G+ E     + +A  IL +THG +
Sbjct: 358 SFYARRMVDGYMKLYHPNNAQLGMAVMRAGLTNWHAGNIEVGHGMICKAYAILLVTHGPS 417

Query: 309 SPFMKEL 315
            P  K+L
Sbjct: 418 HPITKDL 424


>gi|432852547|ref|XP_004067302.1| PREDICTED: N-lysine methyltransferase SMYD2-A-like isoform 1
           [Oryzias latipes]
          Length = 435

 Score =  104 bits (260), Expect = 5e-20,   Method: Compositional matrix adjust.
 Identities = 65/292 (22%), Positives = 131/292 (44%), Gaps = 28/292 (9%)

Query: 32  EIAENFSKLACNAHTICNSELRPLGTGLYPVISIINHSCLPNAVLVFEGRLAVVRAVQHV 91
           E+ E F+++ CN  TI + EL  LG+ ++P ++++NHSC PN ++ ++G +A VRAVQ +
Sbjct: 172 ELTELFAQVNCNGFTIEDEELSHLGSAVFPDVALMNHSCSPNVIVTYKGTVAEVRAVQEI 231

Query: 92  PKGAEVLISYIETAGSTMTRQKALKEQYLFTCTCPRCIKLGQFDDIQESAILEGYRCKDD 151
             G E+  SYI+    T  R++ L + Y F C C  C                       
Sbjct: 232 NPGEEIFNSYIDLLYPTEDRKERLLDSYFFGCQCTECT---------------------- 269

Query: 152 GCSGFLLRDSDDKGFTCQQCGLVRSKEEIKKIASEVNILSKKTLALTSCGNHQEVVSTYK 211
                  +  D      ++       EEIK +      + ++           E++   +
Sbjct: 270 ------TKSKDKAKMEIRKLSSPPEPEEIKSMVHYAKNVIEEFRKAKHYKTPSELLEMCE 323

Query: 212 MIEKLQKKLYHPFSVNLMQTREKLIKILMELEDWKEALAYCQLTIPVYQRVYPQFHPLLG 271
           +  +    ++   +V ++    + + + + ++DW  A++Y +  +  Y   YP +   + 
Sbjct: 324 LSLEKMGAIFADTNVYMLHMMYQAMGVCLYMQDWDGAMSYGEKIVYPYSVHYPPYSLNVA 383

Query: 272 LQYYTCGKLEWFLGDTENAIKSMTEAVEILRITHGTNSPFMKELILKLEEAQ 323
             Y   G+L   L      +K++ +A+ I+ I HG +  ++ E+  ++EE +
Sbjct: 384 SMYLKLGRLYLGLEKKTQGVKALKKALAIMEIAHGKDHHYVAEVKREIEEQK 435


>gi|432852549|ref|XP_004067303.1| PREDICTED: N-lysine methyltransferase SMYD2-A-like isoform 2
           [Oryzias latipes]
          Length = 424

 Score =  104 bits (260), Expect = 5e-20,   Method: Compositional matrix adjust.
 Identities = 65/292 (22%), Positives = 131/292 (44%), Gaps = 28/292 (9%)

Query: 32  EIAENFSKLACNAHTICNSELRPLGTGLYPVISIINHSCLPNAVLVFEGRLAVVRAVQHV 91
           E+ E F+++ CN  TI + EL  LG+ ++P ++++NHSC PN ++ ++G +A VRAVQ +
Sbjct: 161 ELTELFAQVNCNGFTIEDEELSHLGSAVFPDVALMNHSCSPNVIVTYKGTVAEVRAVQEI 220

Query: 92  PKGAEVLISYIETAGSTMTRQKALKEQYLFTCTCPRCIKLGQFDDIQESAILEGYRCKDD 151
             G E+  SYI+    T  R++ L + Y F C C  C                       
Sbjct: 221 NPGEEIFNSYIDLLYPTEDRKERLLDSYFFGCQCTECT---------------------- 258

Query: 152 GCSGFLLRDSDDKGFTCQQCGLVRSKEEIKKIASEVNILSKKTLALTSCGNHQEVVSTYK 211
                  +  D      ++       EEIK +      + ++           E++   +
Sbjct: 259 ------TKSKDKAKMEIRKLSSPPEPEEIKSMVHYAKNVIEEFRKAKHYKTPSELLEMCE 312

Query: 212 MIEKLQKKLYHPFSVNLMQTREKLIKILMELEDWKEALAYCQLTIPVYQRVYPQFHPLLG 271
           +  +    ++   +V ++    + + + + ++DW  A++Y +  +  Y   YP +   + 
Sbjct: 313 LSLEKMGAIFADTNVYMLHMMYQAMGVCLYMQDWDGAMSYGEKIVYPYSVHYPPYSLNVA 372

Query: 272 LQYYTCGKLEWFLGDTENAIKSMTEAVEILRITHGTNSPFMKELILKLEEAQ 323
             Y   G+L   L      +K++ +A+ I+ I HG +  ++ E+  ++EE +
Sbjct: 373 SMYLKLGRLYLGLEKKTQGVKALKKALAIMEIAHGKDHHYVAEVKREIEEQK 424


>gi|410916601|ref|XP_003971775.1| PREDICTED: N-lysine methyltransferase SMYD2-B-like [Takifugu
           rubripes]
          Length = 432

 Score =  104 bits (259), Expect = 6e-20,   Method: Compositional matrix adjust.
 Identities = 84/325 (25%), Positives = 147/325 (45%), Gaps = 38/325 (11%)

Query: 3   DID-EKQLLLYAQIANLVNLILQWPEI-SINEIAENFSKLACNAHTICNSELRPLGTGLY 60
           D+D E++  + A +A L     +  EI    ++   FS++ACN  TI + EL  LGT +Y
Sbjct: 139 DMDNERRETMEADVAGLHQFFSKHLEIPGHKDLLTLFSQVACNGFTIEDEELSHLGTAVY 198

Query: 61  PVISIINHSCLPNAVLVFEGRLAVVRAVQHVPKGAEVLISYIETAGSTMTRQKALKEQYL 120
           P +++INHSC P+ ++ + G  A VRA++ +  G EVLISYI+    T  R   L+E Y 
Sbjct: 199 PDVALINHSCRPSVIVTYSGTSAHVRALRDMKPGDEVLISYIDVLYPTEDRNNRLRESYY 258

Query: 121 FTCTCPRCIKLGQFDDIQESAILEGYRCKDDGCSGFLLRDSDDKGFTCQQCGLVRSKE-E 179
           FTC C  C          ES  ++  + K                         RS+  E
Sbjct: 259 FTCQCEEC----------ESRSMDQAKLK----------------------ARKRSEPIE 286

Query: 180 IKKIASEVNILSKKTLALTSCGNHQEVVSTYKMIEKLQKKLYHPF---SVNLMQTREKLI 236
            + I++ V    K         N     +  +M E+   ++   F   +V ++    + +
Sbjct: 287 PEAISNMVRYARKCIREFRVFKNTSPASTLLEMCEQSLDEMGAVFDDSNVYMLHMMYQAM 346

Query: 237 KILMELEDWKEALAYCQLTIPVYQRVYPQFHPLLGLQYYTCGKLEWFLGDTENAIKSMTE 296
            + + ++D   A+ Y +     Y+++YP +   L   Y   G+L + +      I  + +
Sbjct: 347 GVCLYMQDPDGAIRYGEKIAKYYRKLYPAYSLNLSSLYLKLGRLYFGMERNAECIGMLKK 406

Query: 297 AVEILRITHGTNSPFMKELILKLEE 321
           A  I+ +THG    ++ EL  +++E
Sbjct: 407 AKAIMEVTHGKEHFYLTELDRQMKE 431


>gi|384246814|gb|EIE20303.1| hypothetical protein COCSUDRAFT_48675 [Coccomyxa subellipsoidea
           C-169]
          Length = 377

 Score =  104 bits (259), Expect = 6e-20,   Method: Compositional matrix adjust.
 Identities = 43/90 (47%), Positives = 59/90 (65%)

Query: 32  EIAENFSKLACNAHTICNSELRPLGTGLYPVISIINHSCLPNAVLVFEGRLAVVRAVQHV 91
           ++A   ++ ACN HTIC+ ELRP+G G+YP  +++NHSC PN +  F G   V RAVQH+
Sbjct: 22  DVAYLLARFACNNHTICDDELRPIGVGIYPQGAMLNHSCTPNTMQSFHGSRIVFRAVQHI 81

Query: 92  PKGAEVLISYIETAGSTMTRQKALKEQYLF 121
           P GAE  I+Y+E A +   R  AL+  Y F
Sbjct: 82  PAGAEATIAYVELAATRAERSAALRANYFF 111


>gi|189204167|ref|XP_001938419.1| MYND finger family protein [Pyrenophora tritici-repentis Pt-1C-BFP]
 gi|187985518|gb|EDU51006.1| MYND finger family protein [Pyrenophora tritici-repentis Pt-1C-BFP]
          Length = 454

 Score =  104 bits (259), Expect = 6e-20,   Method: Compositional matrix adjust.
 Identities = 82/316 (25%), Positives = 139/316 (43%), Gaps = 39/316 (12%)

Query: 33  IAENFSKLACNAHTICNSELRPLGTGLYPVISIINHSCLPNAVLVFEGRLAVVRAVQHVP 92
           IA  ++++  NA TI    L PLG  L P +  +NHSC PNA ++ +G    +R ++ + 
Sbjct: 121 IAAMYARVMSNALTIITPTLDPLGIILDPTLCSLNHSCDPNAFIMMDGPSVSIRTLRPIR 180

Query: 93  KGAEVLISYIETAGSTMTRQKALKEQYLFTCTCPRCIKLGQFDDIQESAILEGYRCKDDG 152
           K  E+ ISYI+T      RQ+ L+ ++ FTC C +C         QE A L+        
Sbjct: 181 KDKEIFISYIDTTYPYHKRQEELQTRWFFTCRCAKC---------QEKATLQEDNWLLPV 231

Query: 153 CSGFLLRDSDDKGFTCQQCGLVRSKEEIKKIASEVNILSKKTLALTSCGNHQEVVSTYKM 212
            S F+L     K     Q       EE+ K+++E  I   K + L SC            
Sbjct: 232 DSKFVLDAEAKKAMAQTQEQTFAVYEELHKLSTEHVIYGLKQI-LASC------------ 278

Query: 213 IEKLQKKLYHPFSVNLMQTREKLIKILMELEDWKEALAYCQLTIP-VYQRVYP-QFHPLL 270
               + + Y  +     + R+ LI  L+ +  +++A A C      +  ++YP  FHP+ 
Sbjct: 279 ---YESRFYPIYRQPYAEARDVLIVNLLSMGKFQDAWAQCAKRYKYILPKLYPVPFHPVR 335

Query: 271 GLQYYTCGKLEWFLGDTENAIKS------------MTEAVEILRITHGTNSPFMKELILK 318
            +Q +    L  +L  TE  + +            + + +++  ++HG N+ F K +  K
Sbjct: 336 VVQTWQMAMLAAYLASTEEGVGAPGVNMGLIAMMLVKQVLDVAHLSHGPNNAFTKSVKGK 395

Query: 319 LEEAQAEASYKLSSKD 334
            EE   E    + + D
Sbjct: 396 AEEMIEELKRSVGNPD 411


>gi|348566409|ref|XP_003468994.1| PREDICTED: SET and MYND domain-containing protein 1-like isoform 2
           [Cavia porcellus]
          Length = 477

 Score =  103 bits (258), Expect = 9e-20,   Method: Compositional matrix adjust.
 Identities = 78/307 (25%), Positives = 138/307 (44%), Gaps = 43/307 (14%)

Query: 19  VNLILQ-WP----EISINEIAENFSKLACNAHTICNSE-LRPLGTGLYPVISIINHSCLP 72
           V+  LQ WP    + S+  I+  F  + CN  T+ +   L+ +G G++P + ++NH C P
Sbjct: 151 VDTFLQYWPPQSQQFSMQYISHIFGVINCNGFTLSDQRGLQAVGVGIFPNLGLVNHDCWP 210

Query: 73  NAVLVFEGRLAVVRAVQHVPKGAEVLISYIETAGSTMTRQKALKEQYLFTCTCPRCIKLG 132
           N  ++F      +RA+  +  G E+ +SYI+    +  R++ LK+QY F CTC  C K G
Sbjct: 211 NCTVIFNNGKIELRALGKISVGEELTVSYIDFLNVSEERKRQLKKQYYFDCTCEHCQK-G 269

Query: 133 QFDDIQESAILEGYRCKDDGCSGFLLRDSDDKGFTCQQCGLVRSKEEIKKIASEVNILSK 192
             DD+                  FL              G+    +  +++  E+   SK
Sbjct: 270 LKDDL------------------FL--------------GVKEDPKPSQEVVKEMTQFSK 297

Query: 193 KTL----ALTSCGNHQEVVSTYKMIEKLQKKLYHPFSVNLMQTREKLIKILMELEDWKEA 248
            TL       S G + EVV   +   + Q+ ++   ++  ++    + ++L  L+ + EA
Sbjct: 298 DTLEKIDKARSEGLYHEVVKLCRECLEKQEPVFADTNLYTLRMLSVVSEVLSYLQAFGEA 357

Query: 249 LAYCQLTIPVYQRVYPQFHPLLGLQYYTCGKLEWFLGDTENAIKSMTEAVEILRITHGTN 308
             Y +  +  Y ++Y   +  LG+     G   W  G+ E     + +A  IL +THG +
Sbjct: 358 AHYARRMVDGYMKLYHPNNAQLGMAVMRAGLTNWHAGNIEVGHGMICKAYAILLVTHGPS 417

Query: 309 SPFMKEL 315
            P  K+L
Sbjct: 418 HPITKDL 424


>gi|148228090|ref|NP_001080251.1| N-lysine methyltransferase SMYD2-A [Xenopus laevis]
 gi|82176673|sp|Q7ZXV5.1|SMY2A_XENLA RecName: Full=N-lysine methyltransferase SMYD2-A; AltName:
           Full=Histone methyltransferase SMYD2-A; AltName:
           Full=SET and MYND domain-containing protein 2A
 gi|28279861|gb|AAH44103.1| Smyd2-prov protein [Xenopus laevis]
          Length = 430

 Score =  103 bits (256), Expect = 1e-19,   Method: Compositional matrix adjust.
 Identities = 72/284 (25%), Positives = 128/284 (45%), Gaps = 28/284 (9%)

Query: 37  FSKLACNAHTICNSELRPLGTGLYPVISIINHSCLPNAVLVFEGRLAVVRAVQHVPKGAE 96
           F+++ CN  TI + EL  LG+ ++P ++++NHSC PN ++ F+G +A +RAVQ +  G E
Sbjct: 174 FAQVNCNGFTIEDEELSHLGSAIFPDVALMNHSCCPNIIVTFKGTVAEIRAVQEIHAGDE 233

Query: 97  VLISYIETAGSTMTRQKALKEQYLFTCTCPRCIKLGQFDDIQESAILEGYRCKDDGCSGF 156
           V  SYI+    T  R   L + Y FTC C  C         ++ A LE  +  D      
Sbjct: 234 VFTSYIDLLYPTEDRNDRLMDSYFFTCDCRECST-----KQKDPAKLEIRKLSDPPSHQT 288

Query: 157 LLRDSDDKGFTCQQCGLVRSKEEIKKIASEVNILSKKTLALTSCGNHQEVVSTYKMIEKL 216
           +      K        +V      K   +   +L    L+L   G+              
Sbjct: 289 V------KDMIKYARNIVEEFRRAKHYKTPSELLEMCELSLDKMGS-------------- 328

Query: 217 QKKLYHPFSVNLMQTREKLIKILMELEDWKEALAYCQLTIPVYQRVYPQFHPLLGLQYYT 276
              ++   +V ++    + + + + L++W  AL Y +  I  Y + YP +   +   +  
Sbjct: 329 ---VFVDSNVYMLHMMYQAMGVCLYLQEWDGALKYGEKIIKPYSKHYPAYSLNVASMWLK 385

Query: 277 CGKLEWFLGDTENAIKSMTEAVEILRITHGTNSPFMKELILKLE 320
            G+L   L  T    K++ +A+ I++I HG +  ++ E+  +LE
Sbjct: 386 LGRLYMGLEKTTIGTKALKKALAIMQIAHGPDHHYIAEIKKELE 429


>gi|348534090|ref|XP_003454536.1| PREDICTED: N-lysine methyltransferase SMYD2-A-like [Oreochromis
           niloticus]
          Length = 435

 Score =  103 bits (256), Expect = 1e-19,   Method: Compositional matrix adjust.
 Identities = 62/291 (21%), Positives = 131/291 (45%), Gaps = 28/291 (9%)

Query: 33  IAENFSKLACNAHTICNSELRPLGTGLYPVISIINHSCLPNAVLVFEGRLAVVRAVQHVP 92
           + E F+++ CN  TI + EL  LG+ ++P ++++NHSC PN ++ ++G +A VRAV+ + 
Sbjct: 173 LTELFAQVNCNGFTIEDEELSHLGSAVFPDVALMNHSCSPNVIVTYKGTVAEVRAVKEIN 232

Query: 93  KGAEVLISYIETAGSTMTRQKALKEQYLFTCTCPRCIKLGQFDDIQESAILEGYRCKDDG 152
            G E+  SYI+    T  R++ L + Y FTC C  C                        
Sbjct: 233 PGEEIFNSYIDLLYPTEDRKERLLDSYFFTCQCTECTS---------------------- 270

Query: 153 CSGFLLRDSDDKGFTCQQCGLVRSKEEIKKIASEVNILSKKTLALTSCGNHQEVVSTYKM 212
                 R  D      ++       EEI+ +      + ++           E++   ++
Sbjct: 271 ------RSKDKAKMEIRKLNPPPEPEEIRSMVRYAKNVIEEFRRAKHYKTPSELLEMCEL 324

Query: 213 IEKLQKKLYHPFSVNLMQTREKLIKILMELEDWKEALAYCQLTIPVYQRVYPQFHPLLGL 272
             +    ++   +V ++    + + + + ++DW  A++Y +  +  Y   YP +   +  
Sbjct: 325 SLEKMGAIFADTNVYMLHMMYQAMGVCLYMQDWDGAMSYGEKIVQPYSVHYPAYSLNVAS 384

Query: 273 QYYTCGKLEWFLGDTENAIKSMTEAVEILRITHGTNSPFMKELILKLEEAQ 323
            Y   G+L   L      +K++ +A+ ++ + HG +  ++ E+  ++EE +
Sbjct: 385 MYLKLGRLYLGLEKKTQGVKALKKALAVMEVAHGKDHHYVAEVKREIEEQK 435


>gi|291386373|ref|XP_002709685.1| PREDICTED: SET and MYND domain containing 1 isoform 1 [Oryctolagus
           cuniculus]
          Length = 477

 Score =  103 bits (256), Expect = 2e-19,   Method: Compositional matrix adjust.
 Identities = 75/305 (24%), Positives = 138/305 (45%), Gaps = 34/305 (11%)

Query: 16  ANLVNLILQWP----EISINEIAENFSKLACNAHTICNSE-LRPLGTGLYPVISIINHSC 70
           A++   +  WP      S+  I+  F  + CN  T+ +   L+ +G G++P ++++NH C
Sbjct: 149 ADVDTFLQYWPPQSQHFSMQYISHIFGVINCNGFTLSDQRGLQAVGVGIFPNLALVNHDC 208

Query: 71  LPNAVLVFEGRLAVVRAVQHVPKGAEVLISYIETAGSTMTRQKALKEQYLFTCTCPRCIK 130
            PN  ++F      +RA+  + +G E+ +SYI+    +  R++ LK+QY F CTC  C K
Sbjct: 209 WPNCTVIFNNGKIELRALGKISEGEELTVSYIDFLNVSEERRRQLKKQYYFDCTCEHCQK 268

Query: 131 LGQFDDIQESAILEGYRCKDDGCSGFLLRDSDDKGFTCQQCGLVRSKEEIKKIASEVNIL 190
            G  DD+                  FL    + K           S+E +K++       
Sbjct: 269 -GLKDDL------------------FLGVKDNPKP----------SQEVVKEMTEFCKDT 299

Query: 191 SKKTLALTSCGNHQEVVSTYKMIEKLQKKLYHPFSVNLMQTREKLIKILMELEDWKEALA 250
            +K     S G + EVV   +   + Q+ ++   ++  ++    + ++L  L+ + EA  
Sbjct: 300 LEKIDKARSEGLYHEVVKLCRECLQKQEPVFADTNLYTLRMLSIVSEVLSYLQAFGEAAD 359

Query: 251 YCQLTIPVYQRVYPQFHPLLGLQYYTCGKLEWFLGDTENAIKSMTEAVEILRITHGTNSP 310
           Y +  +  Y ++Y   +  LG+     G   W  G+ E     + +A  IL +THG + P
Sbjct: 360 YAKRMVDGYMKLYHPNNAQLGMAVMRAGLTNWHAGNIEVGHGMICKAYAILLVTHGPSHP 419

Query: 311 FMKEL 315
             K+L
Sbjct: 420 ITKDL 424


>gi|236461142|ref|NP_033892.2| SET and MYND domain-containing protein 1 isoform 2 [Mus musculus]
 gi|74209105|dbj|BAE24950.1| unnamed protein product [Mus musculus]
 gi|148666517|gb|EDK98933.1| SET and MYND domain containing 1, isoform CRA_a [Mus musculus]
          Length = 477

 Score =  103 bits (256), Expect = 2e-19,   Method: Compositional matrix adjust.
 Identities = 77/303 (25%), Positives = 138/303 (45%), Gaps = 35/303 (11%)

Query: 19  VNLILQ-WP----EISINEIAENFSKLACNAHTICNSE-LRPLGTGLYPVISIINHSCLP 72
           V+  LQ WP    + S+  I+  F  + CN  T+ +   L+ +G G++P + ++NH C P
Sbjct: 151 VDTFLQYWPPQSQQFSMQYISHIFGVINCNGFTLSDQRGLQAVGVGIFPNLGLVNHDCWP 210

Query: 73  NAVLVFEGRLAVVRAVQHVPKGAEVLISYIETAGSTMTRQKALKEQYLFTCTCPRCIKLG 132
           N  ++F      +RA+  + +G E+ +SYI+    +  R++ LK+QY F C+C  C K G
Sbjct: 211 NCTVIFNNGKIELRALGKISEGEELTVSYIDFLHLSEERRRQLKKQYYFDCSCEHCQK-G 269

Query: 133 QFDDIQESAILEGYRCKDDGCSGFLLRDSDDKGFTCQQCGLVRSKEEIKKIASEVNILSK 192
             DD+                  FL    D K           S+E +K++        +
Sbjct: 270 LKDDL------------------FLAAKEDPKP----------SQEVVKEMIQFSKDTLE 301

Query: 193 KTLALTSCGNHQEVVSTYKMIEKLQKKLYHPFSVNLMQTREKLIKILMELEDWKEALAYC 252
           K     S G + EVV   +   + Q+ ++   ++ +++      ++L  L+ ++EA  Y 
Sbjct: 302 KIDKARSEGLYHEVVKLCRECLEKQEPVFADTNLYVLRLLSIASEVLSYLQAYEEASHYA 361

Query: 253 QLTIPVYQRVYPQFHPLLGLQYYTCGKLEWFLGDTENAIKSMTEAVEILRITHGTNSPFM 312
           +  +  Y ++Y   +  LG+     G   W  G  E     + +A  IL +THG + P  
Sbjct: 362 RRMVDGYMKLYHHNNAQLGMAVMRAGLTNWHAGHIEVGHGMICKAYAILLVTHGPSHPIT 421

Query: 313 KEL 315
           K+L
Sbjct: 422 KDL 424


>gi|5870834|gb|AAC53022.2| skm-BOP2 [Mus musculus]
          Length = 472

 Score =  103 bits (256), Expect = 2e-19,   Method: Compositional matrix adjust.
 Identities = 77/303 (25%), Positives = 138/303 (45%), Gaps = 35/303 (11%)

Query: 19  VNLILQ-WP----EISINEIAENFSKLACNAHTICNSE-LRPLGTGLYPVISIINHSCLP 72
           V+  LQ WP    + S+  I+  F  + CN  T+ +   L+ +G G++P + ++NH C P
Sbjct: 146 VDTFLQYWPPQSQQFSMQYISHIFGVINCNGFTLSDQRGLQAVGVGIFPNLGLVNHDCWP 205

Query: 73  NAVLVFEGRLAVVRAVQHVPKGAEVLISYIETAGSTMTRQKALKEQYLFTCTCPRCIKLG 132
           N  ++F      +RA+  + +G E+ +SYI+    +  R++ LK+QY F C+C  C K G
Sbjct: 206 NCTVIFNNGKIELRALGKISEGEELTVSYIDFLHLSEERRRQLKKQYYFDCSCEHCQK-G 264

Query: 133 QFDDIQESAILEGYRCKDDGCSGFLLRDSDDKGFTCQQCGLVRSKEEIKKIASEVNILSK 192
             DD+                  FL    D K           S+E +K++        +
Sbjct: 265 LKDDL------------------FLAAKEDPKP----------SQEVVKEMIQFSKDTLE 296

Query: 193 KTLALTSCGNHQEVVSTYKMIEKLQKKLYHPFSVNLMQTREKLIKILMELEDWKEALAYC 252
           K     S G + EVV   +   + Q+ ++   ++ +++      ++L  L+ ++EA  Y 
Sbjct: 297 KIDKARSEGLYHEVVKLCRECLEKQEPVFADTNLYVLRLLSIASEVLSYLQAYEEASHYA 356

Query: 253 QLTIPVYQRVYPQFHPLLGLQYYTCGKLEWFLGDTENAIKSMTEAVEILRITHGTNSPFM 312
           +  +  Y ++Y   +  LG+     G   W  G  E     + +A  IL +THG + P  
Sbjct: 357 RRMVDGYMKLYHHNNAQLGMAVMRAGLTNWHAGHIEVGHGMICKAYAILLVTHGPSHPIT 416

Query: 313 KEL 315
           K+L
Sbjct: 417 KDL 419


>gi|149036368|gb|EDL90986.1| rCG56113, isoform CRA_b [Rattus norvegicus]
          Length = 477

 Score =  102 bits (255), Expect = 2e-19,   Method: Compositional matrix adjust.
 Identities = 81/307 (26%), Positives = 141/307 (45%), Gaps = 43/307 (14%)

Query: 19  VNLILQ-WP----EISINEIAENFSKLACNAHTICNSE-LRPLGTGLYPVISIINHSCLP 72
           V+  LQ WP    + S+  I+  F  + CN  T+ +   L+ +G G++P + ++NH C P
Sbjct: 151 VDTFLQYWPPQSQQFSMQYISHIFGVINCNGFTLSDQRGLQAVGVGIFPNLGLVNHDCWP 210

Query: 73  NAVLVFEGRLAVVRAVQHVPKGAEVLISYIETAGSTMTRQKALKEQYLFTCTCPRCIKLG 132
           N  ++F      +RA+  + +G E+ +SYI+    +  R++ LK+QY F C+C  C K G
Sbjct: 211 NCTVIFNNGKIELRALGKISEGEELTVSYIDFLHLSEERRQQLKKQYYFDCSCEHCQK-G 269

Query: 133 QFDDIQESAILEGYRCKDDGCSGFLLRDSDDKGFTCQQCGLVRSKEEIKKIASEVNILSK 192
             DD+                  FL    D K           S+E +K    E+   SK
Sbjct: 270 LKDDL------------------FLAVKEDPKP----------SQEVVK----EMTQFSK 297

Query: 193 KTL----ALTSCGNHQEVVSTYKMIEKLQKKLYHPFSVNLMQTREKLIKILMELEDWKEA 248
            TL       S G + EVV   +   + Q+ ++   ++ +++    + ++L  L+ ++EA
Sbjct: 298 DTLEKIDKARSEGLYHEVVKLCRECLEKQEPVFADTNLYVLRLLSIVSEVLSYLQAFEEA 357

Query: 249 LAYCQLTIPVYQRVYPQFHPLLGLQYYTCGKLEWFLGDTENAIKSMTEAVEILRITHGTN 308
             Y +  +  Y ++Y   +  LG+     G   W  G  E     + +A  IL +THG +
Sbjct: 358 SHYARRMVDGYMKLYHHNNAQLGMAVMRAGLTNWHAGHIEVGHGMICKAYAILLVTHGPS 417

Query: 309 SPFMKEL 315
            P  K+L
Sbjct: 418 HPITKDL 424


>gi|432113965|gb|ELK36030.1| SET and MYND domain-containing protein 3, partial [Myotis davidii]
          Length = 241

 Score =  102 bits (255), Expect = 2e-19,   Method: Compositional matrix adjust.
 Identities = 75/260 (28%), Positives = 123/260 (47%), Gaps = 43/260 (16%)

Query: 37  FSKLACNAHTICNSELRPLGTGLYPVISIINHSCLPNAVLVFEGRLAVVRAVQHVPKGAE 96
           F +L CN+ TICN+E++ +G GLYP +S++NHSC PN  +VF G   ++RAV+ +  G E
Sbjct: 1   FPQLICNSFTICNAEMQEVGVGLYPSMSLLNHSCDPNCSIVFNGPHLLLRAVRDIQAGEE 60

Query: 97  VLISYIETAGSTMTRQKALKEQYLFTCTCPRCIKLGQFDDIQESAILEGYRCKDDGCSGF 156
           + I Y++   ++  R++ L+ QY F C C RC                            
Sbjct: 61  LTICYLDMLMTSAERREQLRNQYCFDCDCARC---------------------------- 92

Query: 157 LLRDSDDKGFTCQQCGLVRSKEEIKKIASEVNILSKKTLALTSCGNHQEVVSTYKMIEKL 216
           L +D D    T  +      +E +KKI  E+    K    L  C   Q ++S+    E+L
Sbjct: 93  LTQDKDADMLTGDEKVWKEVQESLKKI-EELKAEWKWEQVLALC---QAIISSNS--ERL 146

Query: 217 QKKLYHPFSVNLMQTR--EKLIKILMELEDWKEALAYCQLTIPVYQRVYPQFHPLLGLQY 274
                    +N+ Q +  +  +   + L   +EAL Y   T+  Y+  +P  HP+ G+Q 
Sbjct: 147 P-------DINIYQLKVLDCAMDACIRLGLLEEALFYGIRTMEPYRIFFPGCHPVRGVQV 199

Query: 275 YTCGKLEWFLGDTENAIKSM 294
              GKL+   G    A++++
Sbjct: 200 MKVGKLQLHQGMFPQAMRNL 219


>gi|432873526|ref|XP_004072260.1| PREDICTED: SET and MYND domain-containing protein 1-like isoform 2
           [Oryzias latipes]
          Length = 489

 Score =  102 bits (254), Expect = 3e-19,   Method: Compositional matrix adjust.
 Identities = 86/337 (25%), Positives = 151/337 (44%), Gaps = 55/337 (16%)

Query: 1   MSDIDEKQLL-LYAQIANLVNLILQWPEIS----INEIAENFSKLACNAHTICNSE-LRP 54
           ++D+ E +L  L   I N ++    WP  S    +++I+  F  + CN  ++ +   L+ 
Sbjct: 131 ITDMQEDELKELKVDIHNFLDF---WPRTSKQHTVDDISHIFGVINCNGFSVSDQRGLQA 187

Query: 55  LGTGLYPVISIINHSCLPN-AVLVFEGRLAVV------------RAVQHVPKGAEVLISY 101
           +G GL+P + ++NH C PN  V++  G  + V            R++  + +G E+ ++Y
Sbjct: 188 VGVGLFPNLCLVNHDCWPNCTVILNHGNQSAVNTMFHSQRRIELRSLGKIAEGEELTVAY 247

Query: 102 IETAGSTMTRQKALKEQYLFTCTCPRCIKLGQFDDIQESAILEGYRCKDDGCSGFLLRDS 161
           ++    +  RQ+ LK QY F CTC  C      + I++   L G             R+ 
Sbjct: 248 VDYMNLSEERQRLLKTQYFFDCTCEHCK-----NKIKDDIKLGG-------------REV 289

Query: 162 DDKGFTCQQCGLVRSKEEIKKIASEVNILSKKTLALTSCGNHQEVVSTYK-MIEKLQKKL 220
           D         G+  S+E++K+       + +K       G++ EVV   +  IEK +  L
Sbjct: 290 D---------GVKPSEEQVKEATDYCYQMLEKMDKARLNGDYHEVVKICRECIEKTEPVL 340

Query: 221 Y--HPFSVNLMQTREKLIKILMELEDWKEALAYCQLTIPVYQRVYPQFHPLLGLQYYTCG 278
              H + + +  T  ++   L  L D   A  Y +  +  Y ++YP  +  LG+     G
Sbjct: 341 AETHIYLLRMWSTMSEVQAYLQYLND---AAEYARKMVEGYTKLYPPNNATLGMAAMRAG 397

Query: 279 KLEWFLGDTENAIKSMTEAVEILRITHGTNSPFMKEL 315
            + W  GD E A   + +A  IL +THG   P  K+L
Sbjct: 398 VIHWQAGDIEVAHGMVCKAYAILMVTHGPTHPITKDL 434


>gi|187607241|ref|NP_001120357.1| SET and MYND domain containing 1 [Xenopus (Silurana) tropicalis]
 gi|170284459|gb|AAI61001.1| LOC100145429 protein [Xenopus (Silurana) tropicalis]
          Length = 478

 Score =  102 bits (253), Expect = 3e-19,   Method: Compositional matrix adjust.
 Identities = 86/342 (25%), Positives = 153/342 (44%), Gaps = 47/342 (13%)

Query: 3   DIDEKQLLLYAQIANLVNLILQWP----EISINEIAENFSKLACNAHTICNSE-LRPLGT 57
           D  EK LL+     ++   +  WP    +  +  I+  FS ++CN  T+ +   L+ +G 
Sbjct: 135 DEAEKGLLM----EDVQKFLEYWPSQSQQFGMQYISHIFSVISCNGFTLSDQRGLQAVGV 190

Query: 58  GLYPVISIINHSCLPNAVLVFEGRLAVVRAVQHVPKGAEVLISYIETAGSTMTRQKALKE 117
           G++P + + NH C PN  ++F      +RA+  + KG E+ +SY++    T  R+  LK+
Sbjct: 191 GIFPNLCLANHDCWPNCTVIFNNGKIELRALGKINKGEELTVSYVDFLNLTEDRKAQLKK 250

Query: 118 QYLFTCTCPRCIKLGQFDDIQESAILEGYRCKDDGCSGFLLRDSDDKGFTCQQCGLVRSK 177
           QY F CTC  C K  +       A+L              ++D +DK             
Sbjct: 251 QYYFDCTCEHCTKKTK------DALL------------LAVKDGEDK------------P 280

Query: 178 EEIKKIASEVNILSKKTL----ALTSCGNHQEVVSTYKMIEKLQKKLYHPFSVNLMQTRE 233
           EE  ++  EV   SK T+       S G + +VV   +   K Q+ ++   ++ +++   
Sbjct: 281 EE--RVVKEVIQYSKDTMEKIEKARSEGLYNDVVKLCRDCLKRQEPIFADTNIYMLRILS 338

Query: 234 KLIKILMELEDWKEALAYCQLTIPVYQRVYPQFHPLLGLQYYTCGKLEWFLGDTENAIKS 293
              ++L  L+ + +A    +  +  Y ++Y Q +  LG+     G   W  G  E     
Sbjct: 339 IYSEVLSYLQMFDDAAENAKKMVDGYLKIYHQNNAQLGMAVMRAGVTHWHAGMIEVGHGM 398

Query: 294 MTEAVEILRITHGTNSPFMKELILKLEEAQAEASYKLSSKDE 335
           + +A  IL ITHG   P  K+  L++  AQ E   ++  ++E
Sbjct: 399 ICKAFAILLITHGPLHPITKD--LEVMRAQTEMELRMFKENE 438


>gi|118343990|ref|NP_001071820.1| SET and MYND domain containing protein [Ciona intestinalis]
 gi|70571203|dbj|BAE06699.1| SET and MYND domain containing protein [Ciona intestinalis]
          Length = 474

 Score =  102 bits (253), Expect = 3e-19,   Method: Compositional matrix adjust.
 Identities = 77/308 (25%), Positives = 146/308 (47%), Gaps = 30/308 (9%)

Query: 29  SINEIAENFSKLACNAHTICNSE-LRPLGTGLYPVISIINHSCLPNAVLVFEGRLAVVRA 87
           S  E+A  F+ + CNA  + +   ++ +G G+YP IS++NH C PN V +  G    VRA
Sbjct: 182 SDEEMAHLFAIIDCNAIGLNDHRGVQTIGVGIYPGISMLNHDCSPNCVAMNNGPRLEVRA 241

Query: 88  VQHVPKGAEVLISYIETAGSTMTRQKALKEQYLFTCTCPRCIKLGQFDDIQESAILEGYR 147
           ++ +  G E+ ISYI++  +T  R++ LK QY F C C  C K  + ++++ + + E  +
Sbjct: 242 LRVIQPGEELCISYIDSLETTEKRREKLKLQYYFDCECDTCTKGEELENLKHALVSEDIK 301

Query: 148 CKDDGCSGFLLRDSDDKGFTCQQCGLVRSKEEIKKIASEVNILSKKTLALTSCGNHQEVV 207
              +    ++ + S D     ++    +  +  +++ ++    +  TLA   C     V+
Sbjct: 302 ---EESVKYINQFSKD---MLKRIHKTKQNQNWERMCNQ----TLGTLAQQDC-----VI 346

Query: 208 STYKMIEKLQKKLYHPFSVNLMQTREKLIKILMELEDWKEALAYCQLTIPVYQRVYPQFH 267
           +   +++     L H   V     R+            + AL Y +     Y+++ P  H
Sbjct: 347 ADTNVLKIAM--LNHAVEVQSFLRRQ------------EPALEYAERVAAAYEKLLPPVH 392

Query: 268 PLLGLQYYTCGKLEWFLGDTENAIKSMTEAVEILRITHGTNSPFMKELILKLEEAQAEAS 327
           P LG+     G ++W +  TE A+ ++  A  I+  THG +    KEL+  + + + EA 
Sbjct: 393 PTLGMYLMRLGVIQWQIQKTEAAVATLGRAASIISKTHGDDHGMFKELLGLIHQCKMEAH 452

Query: 328 YKLSSKDE 335
               ++ E
Sbjct: 453 MSKGAQRE 460


>gi|303286946|ref|XP_003062762.1| set domain protein [Micromonas pusilla CCMP1545]
 gi|226455398|gb|EEH52701.1| set domain protein [Micromonas pusilla CCMP1545]
          Length = 732

 Score =  102 bits (253), Expect = 3e-19,   Method: Compositional matrix adjust.
 Identities = 79/295 (26%), Positives = 124/295 (42%), Gaps = 36/295 (12%)

Query: 3   DIDEKQLLLYAQIANLVNLILQW--PEISINEIAENFSKLACNAHTICNSELRPLGTGLY 60
           D+  ++   Y  +AN VN +L      + ++ +AE  SK+  N+H + ++  R LGTG+Y
Sbjct: 342 DVSAEKRATYVGMANAVNSMLPPGPARMDVDALAEVMSKVHVNSHGVVDAAGRALGTGVY 401

Query: 61  PVISIINHSCLPNAVLVF------------------------EGRLAVVRAVQHVPKGAE 96
           P  ++ NHSC PNAV+ F                         G    VR V  V +G E
Sbjct: 402 PPAALFNHSCAPNAVVSFGGADDDGDGSHSDRDGSCSNGSTNNGARLTVRCVSPVEEGEE 461

Query: 97  VLISYIETAGSTMTRQKALKEQYLFTCTCPRCIKLGQFDDIQESAILEGYRCKD-DGCSG 155
           + I+Y E   +  TR++AL E+  F C C RC         +  A L G+RC     C G
Sbjct: 462 ICIAYAEVYATRETRREALWEKKAFACECRRCADAAS---ARRDAPLGGWRCPRLSACDG 518

Query: 156 FLLRDSDDKGFTCQQCGLVRSKEEIKKIASEVNILSKKTLALTSC--GNHQEVVSTYKMI 213
            +     D G +C  CG   ++   K+ A+E   + +   ALT+    +H   VS    +
Sbjct: 519 VV----PDAGDSCLTCGAKCARARSKRDAAERRWIGRHAAALTALRENDHATAVSIACEV 574

Query: 214 EKLQKKLYHPFSVNLMQTREKLIKILMELEDWKEALAYCQLTIPVYQRVYPQFHP 268
            +          V   + R  L+     L+ W+ A    +      +R     HP
Sbjct: 575 VRESDDALCDAHVIRHECRLVLMDAHAALKQWRHAADVSRDVADAMRRFVNVNHP 629


>gi|326919603|ref|XP_003206069.1| PREDICTED: SET and MYND domain-containing protein 1-like isoform 2
           [Meleagris gallopavo]
          Length = 478

 Score =  101 bits (252), Expect = 4e-19,   Method: Compositional matrix adjust.
 Identities = 84/320 (26%), Positives = 141/320 (44%), Gaps = 43/320 (13%)

Query: 25  WP----EISINEIAENFSKLACNAHTICNSE-LRPLGTGLYPVISIINHSCLPNAVLVFE 79
           WP    +  +  I+  F  + CNA T+ +   L+ +G G++P +   NH C PN  ++F 
Sbjct: 158 WPAQSQQFGMQYISHIFGVINCNAFTLSDQRGLQAVGVGIFPNLCQANHDCWPNCTVIFN 217

Query: 80  GRLAVVRAVQHVPKGAEVLISYIETAGSTMTRQKALKEQYLFTCTCPRCIKLGQFDDIQE 139
                +RA+  +  G E+ +SY++    +  RQK LK+QY F CTC  C K         
Sbjct: 218 NGKIELRALSKISPGDELTVSYVDFLNVSEERQKQLKKQYYFDCTCEHCKK--------- 268

Query: 140 SAILEGYRCKDDGCSGFLLRDSDDKGFTCQQCGLVRSKEEIKKIASEVNILSKKTLALTS 199
                  + KDD      +++ D K           S E +K    EV   SK TL   +
Sbjct: 269 -------KIKDD--LMLAVKEGDKKP----------SAETVK----EVIQFSKDTLEKIN 305

Query: 200 C----GNHQEVVSTYKMIEKLQKKLYHPFSVNLMQTREKLIKILMELEDWKEALAYCQLT 255
                G + EVV   +   K Q+ +    ++ L++      ++L  L+ ++EA  Y +  
Sbjct: 306 KARLEGTYHEVVKLCRECLKKQEPVLGDTNIYLLRILSIASEVLSYLQMFEEAAEYAKRM 365

Query: 256 IPVYQRVYPQFHPLLGLQYYTCGKLEWFLGDTENAIKSMTEAVEILRITHGTNSPFMKEL 315
           +  Y ++Y   +  LG+     G   W  G  E     + +A  IL ITHG + P  K+ 
Sbjct: 366 VDGYLKIYHPNNAQLGMAVMRAGVTHWHAGLIEAGHSMICKAYAILLITHGPSHPITKD- 424

Query: 316 ILKLEEAQAEASYKLSSKDE 335
            L++   Q E   ++  ++E
Sbjct: 425 -LEVMRVQTEMELRMFQQNE 443


>gi|156368233|ref|XP_001627600.1| predicted protein [Nematostella vectensis]
 gi|156214514|gb|EDO35500.1| predicted protein [Nematostella vectensis]
          Length = 377

 Score =  101 bits (252), Expect = 5e-19,   Method: Compositional matrix adjust.
 Identities = 50/136 (36%), Positives = 77/136 (56%), Gaps = 5/136 (3%)

Query: 27  EISINEIAENFSKLACNAHTICNSELRPLGTGLYPVISIINHSCLPNAVLVFEGRLAVVR 86
           EIS  E  E FSK++CN+  IC+ E++ +GTG++P    +NHSC PN+V VF G    ++
Sbjct: 152 EISPPEGLELFSKISCNSFAICDGEMQAIGTGIFPNAVCLNHSCAPNSVAVFNGTNIYIK 211

Query: 87  AVQHVPKGAEVLISYIETAGSTMTRQKALKEQYLFTCTCPRCIKLGQFDDIQESAILEGY 146
           A++ +P G E+ ISYI+      TRQ+ L+ Q+ F C C RC+     D    + +L   
Sbjct: 212 ALEEIPVGEELTISYIQQLHPRETRQEELQTQFCFYCQCHRCL-----DASDNNKMLTSL 266

Query: 147 RCKDDGCSGFLLRDSD 162
            C +  C   + +  D
Sbjct: 267 ICPNKSCEAIVYQTFD 282



 Score = 62.4 bits (150), Expect = 3e-07,   Method: Compositional matrix adjust.
 Identities = 33/84 (39%), Positives = 47/84 (55%), Gaps = 1/84 (1%)

Query: 243 EDWKEALAYCQLTIPVYQRVYPQFHPLLGLQYYTCGKLEWFLG-DTENAIKSMTEAVEIL 301
           +DW +AL Y    + VY   YP++HP LG+  Y  GKL      D E A+K + +A  IL
Sbjct: 288 QDWTKALEYATRNLEVYTWFYPKYHPCLGVHLYKIGKLLAVTHQDLELAVKRLEQARRIL 347

Query: 302 RITHGTNSPFMKELILKLEEAQAE 325
            +THG + P ++EL   L +A  E
Sbjct: 348 EVTHGQSHPLVQELCEYLCQASEE 371


>gi|440907991|gb|ELR58064.1| SET and MYND domain-containing protein 3, partial [Bos grunniens
           mutus]
          Length = 221

 Score =  101 bits (251), Expect = 5e-19,   Method: Compositional matrix adjust.
 Identities = 73/258 (28%), Positives = 123/258 (47%), Gaps = 39/258 (15%)

Query: 37  FSKLACNAHTICNSELRPLGTGLYPVISIINHSCLPNAVLVFEGRLAVVRAVQHVPKGAE 96
           F ++ CN+ TICN+E++ +G GLYP +S++NHSC PN  +VF G   ++RAV+ V  G E
Sbjct: 1   FLQVICNSFTICNAEMQEVGVGLYPSMSLLNHSCDPNCSIVFNGPHLLLRAVRDVEAGEE 60

Query: 97  VLISYIETAGSTMTRQKALKEQYLFTCTCPRCIKLGQFDDIQESAILEGYRCKDDGCSGF 156
           + I Y++   ++  R+K L+ QY F C C RC    Q  D                    
Sbjct: 61  LTICYLDMLMTSEERRKQLRGQYCFDCDCFRC----QTQD-------------------- 96

Query: 157 LLRDSDDKGFTCQQCGLVRSKEEIKKIASEVNILSKKTLALTSCGNHQEVVSTYKMIEKL 216
             +D+D          ++   E++ K   EV    KK   L +    ++V++  + I   
Sbjct: 97  --KDAD----------MLTGDEQVWK---EVQESLKKIEDLKAHWKWEQVLAMCQSIISS 141

Query: 217 QKKLYHPFSVNLMQTREKLIKILMELEDWKEALAYCQLTIPVYQRVYPQFHPLLGLQYYT 276
             +     ++  ++  +  +   + L   +EAL Y   T+  Y+  YP  HP+ G+Q   
Sbjct: 142 NAERLPDINIYQLKVLDCAMDACLNLGLLEEALFYGIRTMEPYRIFYPGNHPVRGVQIMK 201

Query: 277 CGKLEWFLGDTENAIKSM 294
            GKL+   G    A+K++
Sbjct: 202 VGKLQLHQGMFPQAMKNL 219


>gi|301777093|ref|XP_002923966.1| PREDICTED: SET and MYND domain-containing protein 3-like
           [Ailuropoda melanoleuca]
          Length = 477

 Score =  100 bits (250), Expect = 7e-19,   Method: Compositional matrix adjust.
 Identities = 44/97 (45%), Positives = 67/97 (69%)

Query: 32  EIAENFSKLACNAHTICNSELRPLGTGLYPVISIINHSCLPNAVLVFEGRLAVVRAVQHV 91
           +I E F+K+ CNA TICN+E++ +G GLYP +S++NHSC PN  +VF G   ++RAV+ +
Sbjct: 170 DIFEAFAKVICNAFTICNAEMQEVGVGLYPSMSLLNHSCDPNCSIVFNGPHLLLRAVRDI 229

Query: 92  PKGAEVLISYIETAGSTMTRQKALKEQYLFTCTCPRC 128
             G E+ I Y++   ++  R+K L++QY F C C RC
Sbjct: 230 EAGEELTICYLDMLMTSEERRKQLRDQYCFECDCFRC 266


>gi|16930389|gb|AAL31881.1|AF410782_1 cardiac and skeletal muscle-specific BOP2 [Gallus gallus]
          Length = 473

 Score =  100 bits (250), Expect = 7e-19,   Method: Compositional matrix adjust.
 Identities = 83/320 (25%), Positives = 141/320 (44%), Gaps = 43/320 (13%)

Query: 25  WP----EISINEIAENFSKLACNAHTICNSE-LRPLGTGLYPVISIINHSCLPNAVLVFE 79
           WP    +  +  I+  F  + CNA T+ +   L+ +G G++P +   NH C PN  ++F 
Sbjct: 153 WPAQSQQFGMQYISHIFGVINCNAFTLSDQRGLQAVGVGIFPNLCQANHDCWPNCTVIFN 212

Query: 80  GRLAVVRAVQHVPKGAEVLISYIETAGSTMTRQKALKEQYLFTCTCPRCIKLGQFDDIQE 139
                +RA+  +  G E+ +SY++    +  R+K LK+QY F CTC  C K         
Sbjct: 213 NGKIELRALSKISPGDELTVSYVDFLNVSEERRKQLKKQYYFDCTCEHCKK--------- 263

Query: 140 SAILEGYRCKDDGCSGFLLRDSDDKGFTCQQCGLVRSKEEIKKIASEVNILSKKTLALTS 199
                  + KDD      +++ D K           S E +K    EV   SK TL   +
Sbjct: 264 -------KIKDD--LMLAVKEGDKKP----------SAETVK----EVIQFSKDTLEKIN 300

Query: 200 C----GNHQEVVSTYKMIEKLQKKLYHPFSVNLMQTREKLIKILMELEDWKEALAYCQLT 255
                G + EVV   +   K Q+ +    ++ L++      ++L  L+ ++EA  Y +  
Sbjct: 301 KARLEGTYHEVVKLCRECLKKQEPVLGDTNIYLLRILSIASEVLSYLQMFEEAAEYAKRM 360

Query: 256 IPVYQRVYPQFHPLLGLQYYTCGKLEWFLGDTENAIKSMTEAVEILRITHGTNSPFMKEL 315
           +  Y ++Y   +  LG+     G   W  G  E     + +A  IL ITHG + P  K+ 
Sbjct: 361 VEGYMKIYHPNNAQLGMAVMRAGVTHWHAGLIEAGHSMICKAYAILLITHGPSHPITKD- 419

Query: 316 ILKLEEAQAEASYKLSSKDE 335
            L++   Q E   ++  ++E
Sbjct: 420 -LEVMRVQTEMELRMFQQNE 438


>gi|260788099|ref|XP_002589088.1| hypothetical protein BRAFLDRAFT_75068 [Branchiostoma floridae]
 gi|229274262|gb|EEN45099.1| hypothetical protein BRAFLDRAFT_75068 [Branchiostoma floridae]
          Length = 909

 Score =  100 bits (250), Expect = 7e-19,   Method: Compositional matrix adjust.
 Identities = 76/299 (25%), Positives = 135/299 (45%), Gaps = 23/299 (7%)

Query: 33  IAENFSKLACNAHTIC------------NSELRPLGTGLYPVISIINHSCLPNAVLVFEG 80
           +  +  +L CNA  I             + ++R L T ++P  +++NHSC PN  + F+G
Sbjct: 606 LCHHMQQLRCNAQAITTLQEQDSVSLLEDKQVR-LATAVFPTEALLNHSCRPNVFVSFQG 664

Query: 81  RLAVVRAVQHVPKGAEVLISYIETAGSTM--TRQKALKEQYLFTCTCPRCIKLGQFDDIQ 138
           +  +VRAV H+  G E+L  Y   AG  +   RQ ALKEQY F+C+C  C +  Q  +  
Sbjct: 665 KTLIVRAVSHIKPGEELLHCYGPHAGRMVYGERQAALKEQYFFSCSCDACQE--QVGNPN 722

Query: 139 ESAILEGYRCKDDGCSGFLLRDSDDKGFTCQ--QCGLVRSKEEIKKIASEVNILSKKTLA 196
              +   Y+C     +  L     DK   C    C      ++IK  + ++  L  ++ A
Sbjct: 723 TVDMFSAYKCPVCNNAAKL----QDKKLVCTSPNCDAQGDTDDIKTTSKKIQDLFVQSSA 778

Query: 197 LTSCGNHQEVVSTYKMIEKLQKKLYHPFSVNLMQTREKLIKILMELEDWKEALAYCQLTI 256
               G  QE VS  K    LQ K+ HP + ++ +T + L +      D K+++ + + ++
Sbjct: 779 FLEEGQIQEAVSGLKQCLVLQWKILHPSNKDIARTHDALARCYATSGDCKKSVEHLKKSL 838

Query: 257 PVYQRVYPQFHPLLGLQYYTCGKLEWFLGDTENAIKSMTEAVEILRITHGTNSPFMKEL 315
              +  +      L  + +   +L++        + ++  A+ I    +G + P  KEL
Sbjct: 839 STVELQFGACSVELAHELHKLAQLQFNGQQVAECLDTIERALTIFGCHYGDHHPKSKEL 897


>gi|443728057|gb|ELU14532.1| hypothetical protein CAPTEDRAFT_199484 [Capitella teleta]
          Length = 374

 Score =  100 bits (249), Expect = 8e-19,   Method: Compositional matrix adjust.
 Identities = 71/254 (27%), Positives = 122/254 (48%), Gaps = 18/254 (7%)

Query: 33  IAENFSKLACNAHTICNSELRPLGTGLYPVISIINHSCLPNAVLVFEGRLAVVRAVQHVP 92
           +A   S+++ N+ TI + +   LG GLY   S ++HSC PNA   FEG    VR ++ + 
Sbjct: 95  MAFYLSQISVNSITINHPQAVGLGVGLYLRFSAVDHSCRPNAFANFEGTKVCVRCIEPIE 154

Query: 93  KGAEVLISYIETAGSTMTRQKALKEQYLFTCTCPRCIKLGQFDDIQESAILEGYRCKDDG 152
              ++ ISY+     T TR+K L ++YLF CTC  C+      DI +  +   ++C   G
Sbjct: 155 DEKDLRISYVSPLDDTATRRKNLLQKYLFECTCEACL------DIDQDQMKFSFKCVRGG 208

Query: 153 CSGFLLRDSDDKGFTCQQCGLVR-SKEEIKKIASEVN------ILSKKTLALTSCGNHQE 205
           C G + R  D++ F C  CG  + +++EI+ +   V        +SKK   L + G + E
Sbjct: 209 CKGHMTRMQDNR-FRCDYCGEKQVTEDEIQTLNVAVEKTRRMMDISKK---LKAQGRYAE 264

Query: 206 VVSTYKMIEKLQKKLYHPFSVNLMQTREKLIKILMELEDWKEALAYCQL-TIPVYQRVYP 264
           ++       K      H  ++ +++ R+ L+      ++W      C +  +  Y+  Y 
Sbjct: 265 IMVAGFACLKDAVDFLHSDNLLIVELRDCLLNACRMTQNWSVEAIQCGIANLEPYRSFYG 324

Query: 265 QFHPLLGLQYYTCG 278
           + HP L +  Y  G
Sbjct: 325 RHHPALAILLYDIG 338


>gi|302850329|ref|XP_002956692.1| SET and zf-MYND domain-containing protein [Volvox carteri f.
           nagariensis]
 gi|300258053|gb|EFJ42294.1| SET and zf-MYND domain-containing protein [Volvox carteri f.
           nagariensis]
          Length = 265

 Score =  100 bits (249), Expect = 8e-19,   Method: Compositional matrix adjust.
 Identities = 45/120 (37%), Positives = 73/120 (60%), Gaps = 1/120 (0%)

Query: 3   DIDEKQLLLYAQIANLVNLILQWPEI-SINEIAENFSKLACNAHTICNSELRPLGTGLYP 61
           ++D+++  LYAQ+A +   + +W        +A+  S L+CN HT+C+ ELRPLG  LYP
Sbjct: 146 ELDDRRKQLYAQMAVVTWWVARWGTWPGFRTVAQLLSLLSCNCHTVCDEELRPLGVALYP 205

Query: 62  VISIINHSCLPNAVLVFEGRLAVVRAVQHVPKGAEVLISYIETAGSTMTRQKALKEQYLF 121
             +++NHSC P+ V  F G    +RA++ +  G E+ I+YIE A +   R++ L + Y F
Sbjct: 206 TGALVNHSCSPSTVQTFHGSTLELRALRQLAPGDEITIAYIELAATRQERRETLADSYFF 265


>gi|148225404|ref|NP_001085463.1| SET and MYND domain containing 1 [Xenopus laevis]
 gi|49118725|gb|AAH72803.1| MGC80131 protein [Xenopus laevis]
          Length = 478

 Score =  100 bits (249), Expect = 9e-19,   Method: Compositional matrix adjust.
 Identities = 76/316 (24%), Positives = 145/316 (45%), Gaps = 35/316 (11%)

Query: 25  WP----EISINEIAENFSKLACNAHTICNSE-LRPLGTGLYPVISIINHSCLPNAVLVFE 79
           WP    +  +  I+  FS ++CN  T+ +   L+ +G  ++P + + NH C PN  ++F 
Sbjct: 153 WPSQSQQFGMQYISHIFSVISCNGFTLSDQRGLQAVGVAIFPNLCLTNHDCWPNCTVIFN 212

Query: 80  GRLAVVRAVQHVPKGAEVLISYIETAGSTMTRQKALKEQYLFTCTCPRCIKLGQFDDIQE 139
                +RA+  + KG E+ +SY++    T  R + LK+QY F CTC  C    +  D   
Sbjct: 213 NGKIELRALGKINKGDELTVSYVDFLNLTEDRMEQLKKQYYFDCTCEHCT--NKTKDALL 270

Query: 140 SAILEGYRCKDDGCSGFLLRDSDDKGFTCQQCGLVRSKEEIKKIASEVNILSKKTLALTS 199
            A+ +G    +D     +++ S D            + E+I+K  SE             
Sbjct: 271 LAVNDGESKPEDRVVKEVIQYSKD------------TMEKIEKARSE------------- 305

Query: 200 CGNHQEVVSTYKMIEKLQKKLYHPFSVNLMQTREKLIKILMELEDWKEALAYCQLTIPVY 259
            G++ +VV   +   K Q+ ++   ++ +++      ++L  L+ +++A    +  I  Y
Sbjct: 306 -GSYHQVVKLCRECLKRQEPIFADTNIYMLRILSIYSEVLSYLQMFQDAAENAKKMIDGY 364

Query: 260 QRVYPQFHPLLGLQYYTCGKLEWFLGDTENAIKSMTEAVEILRITHGTNSPFMKELILKL 319
            ++Y   +  LG+     G  +W  G  E     + +A  IL ITHG + P  K+  L++
Sbjct: 365 MKIYHPNNAQLGMAIMRAGVTQWHAGMIEVGHGMICKAFAILLITHGPSHPITKD--LEM 422

Query: 320 EEAQAEASYKLSSKDE 335
             +Q E   ++  ++E
Sbjct: 423 MRSQTEMELRMFKENE 438


>gi|345328822|ref|XP_001510515.2| PREDICTED: SET and MYND domain-containing protein 1 isoform 2
           [Ornithorhynchus anatinus]
          Length = 477

 Score =  100 bits (249), Expect = 1e-18,   Method: Compositional matrix adjust.
 Identities = 77/300 (25%), Positives = 131/300 (43%), Gaps = 42/300 (14%)

Query: 25  WP----EISINEIAENFSKLACNAHTICNSE-LRPLGTGLYPVISIINHSCLPNAVLVFE 79
           WP    + S+  I+  F  + CN  T+ +   L+ +G G++P + ++NH C PN  ++F 
Sbjct: 158 WPPQSQQFSMQYISHIFGVINCNGFTLSDQRGLQAVGVGVFPNLCLVNHDCWPNCTVIFN 217

Query: 80  GRLAVVRAVQHVPKGAEVLISYIETAGSTMTRQKALKEQYLFTCTCPRCIKLGQFDDIQE 139
                +RA+  + +G E+ +SYI+    +  R+K LK+QY F CTC  C K G  DD+  
Sbjct: 218 NGKIELRALGKISEGEELTVSYIDFLNVSADRKKLLKKQYYFDCTCEHCEK-GIKDDL-- 274

Query: 140 SAILEGYRCKDDGCSGFLLRDSDDKGFTCQQCGLVRSKEEIKKIASEVNILSKKTL---- 195
                           FL              G+    +  + +  EV   SK  L    
Sbjct: 275 ----------------FL--------------GVKDEPKPSQDVVKEVIQFSKDALEKID 304

Query: 196 ALTSCGNHQEVVSTYKMIEKLQKKLYHPFSVNLMQTREKLIKILMELEDWKEALAYCQLT 255
              S G + EVV   +   + Q+ +    ++ L++      ++L  L+ ++EA  Y +  
Sbjct: 305 KARSEGVYHEVVKLCRECLQKQEPVLADTNIYLLRILSVASEVLSYLQSFEEASDYAKKM 364

Query: 256 IPVYQRVYPQFHPLLGLQYYTCGKLEWFLGDTENAIKSMTEAVEILRITHGTNSPFMKEL 315
           +  Y ++Y   +  LG+     G   W  G  E     + +A  IL +THG   P  K+L
Sbjct: 365 VDGYMKLYHPNNAQLGMAVMRAGVTHWHAGLIEVGHGMICKAYAILLVTHGPTHPITKDL 424


>gi|348517231|ref|XP_003446138.1| PREDICTED: SET and MYND domain-containing protein 1-like isoform 1
           [Oreochromis niloticus]
          Length = 476

 Score =  100 bits (248), Expect = 1e-18,   Method: Compositional matrix adjust.
 Identities = 77/305 (25%), Positives = 136/305 (44%), Gaps = 34/305 (11%)

Query: 17  NLVNLILQWPEIS----INEIAENFSKLACNAHTICNSE-LRPLGTGLYPVISIINHSCL 71
           ++ N +  WP  S    I++I+  F  + CN  ++ +   L+ +G GL+P + ++NH C 
Sbjct: 145 DIHNFLDYWPHNSKQHTIDDISHIFGVINCNGFSVSDQRGLQAVGVGLFPNLCLVNHDCW 204

Query: 72  PNAVLVFEGRLAVVRAVQHVPKGAEVLISYIETAGSTMTRQKALKEQYLFTCTCPRCIKL 131
           PN  ++       +RA+  + +G E+ ++Y++    +  RQ+ LK QY F CTC  C   
Sbjct: 205 PNCTVILNHGKIELRALGKIAEGEELTVAYVDFLNLSEERQRLLKTQYFFDCTCEHCKN- 263

Query: 132 GQFDDIQESAILEGYRCKDDGCSGFLLRDSDDKGFTCQQCGLVRSKEEIKKIASEVNILS 191
                          R KDD   G   R+ D         G+  S+E++K+       + 
Sbjct: 264 ---------------RIKDDIKIGG--REED---------GVKPSEEQVKEATDYCFQML 297

Query: 192 KKTLALTSCGNHQEVVSTYK-MIEKLQKKLYHPFSVNLMQTREKLIKILMELEDWKEALA 250
           +K       G++ EVV   +  IEK +  L     + L++    + ++   L+ + +A  
Sbjct: 298 EKMEKARLNGDYHEVVKICRECIEKTEPVLADT-HIYLLRMWSTMSEVQAYLQYFDDAAD 356

Query: 251 YCQLTIPVYQRVYPQFHPLLGLQYYTCGKLEWFLGDTENAIKSMTEAVEILRITHGTNSP 310
           Y +  +  Y ++Y   +  LG+     G   W  G  E     + +A  IL ITHG   P
Sbjct: 357 YARKMVEGYMKLYHPNNAALGMAAMRAGVTHWQAGQIEVGHGMICKAYAILMITHGPIHP 416

Query: 311 FMKEL 315
             K+L
Sbjct: 417 ITKDL 421


>gi|126305322|ref|XP_001379366.1| PREDICTED: SET and MYND domain-containing protein 1 [Monodelphis
           domestica]
          Length = 490

 Score =  100 bits (248), Expect = 1e-18,   Method: Compositional matrix adjust.
 Identities = 83/333 (24%), Positives = 145/333 (43%), Gaps = 58/333 (17%)

Query: 5   DEKQLLLYAQIANLVNLILQWP----EISINEIAENFSKLACNAHTICNSE-LRPLGTGL 59
           +E+Q  L   + + +N    WP    + S+  I+  F  + CNA T+ +   L+ +G G+
Sbjct: 141 EEEQKDLRVDVDSFLNF---WPAQSQQFSMQYISHIFGVINCNAFTLSDQRGLQAVGVGI 197

Query: 60  YPVISIINHSCLPNAVLVFE-GRLAVVRAVQH------------VPKGAEVLISYIETAG 106
           +P + ++NH C PN  ++F  G    V+++ H            + +G E+ +SY++   
Sbjct: 198 FPNLCLVNHDCWPNCTVIFNNGNHEAVKSMFHTQMRIELRALGKISEGEELTVSYVDFLN 257

Query: 107 STMTRQKALKEQYLFTCTCPRCIKLGQFDDIQESAILEGYRCKDDGCSGFLLRDSDDKGF 166
            +  R+K LK+QY F CTC  C K G  DD+                  FL   +D K  
Sbjct: 258 VSQERKKQLKKQYYFDCTCEHCEK-GIKDDL------------------FLAVKADPK-- 296

Query: 167 TCQQCGLVRSKEEIKKIASEVNILSKKTL----ALTSCGNHQEVVSTYKMIEKLQKKLYH 222
                         + +  EV   SK+TL       S G + EVV   +   + Q+ +  
Sbjct: 297 ------------PSQDVVKEVTQFSKETLEKIDKARSEGLYHEVVKLCRECLQKQEPVLA 344

Query: 223 PFSVNLMQTREKLIKILMELEDWKEALAYCQLTIPVYQRVYPQFHPLLGLQYYTCGKLEW 282
             ++++++      ++L  L+ + EA  Y +  +  Y ++Y   +  LG+     G   W
Sbjct: 345 DTNIHVLRILSIASEVLSYLQSFHEASDYAKRMVDGYMKLYHPNNAQLGMAVMRAGLTHW 404

Query: 283 FLGDTENAIKSMTEAVEILRITHGTNSPFMKEL 315
             G  E     + +A  IL +THG   P  K+L
Sbjct: 405 HAGLIEAGHGMICKAYAILLVTHGPTHPITKDL 437


>gi|330917311|ref|XP_003297760.1| hypothetical protein PTT_08278 [Pyrenophora teres f. teres 0-1]
 gi|311329388|gb|EFQ94158.1| hypothetical protein PTT_08278 [Pyrenophora teres f. teres 0-1]
          Length = 532

 Score =  100 bits (248), Expect = 1e-18,   Method: Compositional matrix adjust.
 Identities = 79/316 (25%), Positives = 139/316 (43%), Gaps = 39/316 (12%)

Query: 33  IAENFSKLACNAHTICNSELRPLGTGLYPVISIINHSCLPNAVLVFEGRLAVVRAVQHVP 92
           IA  ++++  NA TI    L PLG  L P +  +NHSC PNA ++ +G    +R ++ + 
Sbjct: 199 IAAMYARVMSNALTIITPTLDPLGIILDPTLCSLNHSCDPNAFIMMDGPSVSIRTLRPIR 258

Query: 93  KGAEVLISYIETAGSTMTRQKALKEQYLFTCTCPRCIKLGQFDDIQESAILEGYRCKDDG 152
           K  E+ ISYI+T      RQ+ L+ ++ FTC C +C         QE A L+        
Sbjct: 259 KDKEIFISYIDTTYPYHKRQEELQTRWFFTCRCAKC---------QEKATLQEDNWLVPA 309

Query: 153 CSGFLLRDSDDKGFTCQQCGLVRSKEEIKKIASEVNILSKKTLALTSCGNHQEVVSTYKM 212
            S F+L     +     Q        E++K+++E  I   K + L SC            
Sbjct: 310 DSKFVLDPEAKQAMAQTQEQTFALYGELQKLSTEHVIHGLKQI-LASC------------ 356

Query: 213 IEKLQKKLYHPFSVNLMQTREKLIKILMELEDWKEALAYCQLTIP-VYQRVYP-QFHPLL 270
               + + Y  +     + R+ L+  L+ +  +++A A C      +  ++YP  FHP+ 
Sbjct: 357 ---YESRFYPMYRQPYAEARDVLVVNLLSVGKFQDAWAQCAKRYKYILPKLYPVPFHPVR 413

Query: 271 GLQYYTCGKLEWFLGDTENAIKS------------MTEAVEILRITHGTNSPFMKELILK 318
            +Q +    L  +L  TE  + +            + + +++  ++HG N+ F K +  K
Sbjct: 414 VVQTWQMAMLAAYLASTEEGVGAPGVNMGLIAMMLVKQVLDVAHLSHGPNNAFTKSVKEK 473

Query: 319 LEEAQAEASYKLSSKD 334
            EE   E    + + D
Sbjct: 474 AEEMIEELKRSVGNPD 489


>gi|410955280|ref|XP_003984284.1| PREDICTED: SET and MYND domain-containing protein 1 [Felis catus]
          Length = 490

 Score = 99.8 bits (247), Expect = 2e-18,   Method: Compositional matrix adjust.
 Identities = 83/320 (25%), Positives = 145/320 (45%), Gaps = 56/320 (17%)

Query: 19  VNLILQ-WP----EISINEIAENFSKLACNAHTICNSE-LRPLGTGLYPVISIINHSCLP 72
           V+  LQ WP    + S+  I+  F  ++CN  T+ +   L+ +G G++P + ++NH C P
Sbjct: 151 VDTFLQYWPPQSQQFSMQYISHIFGVISCNGFTLSDQRGLQAVGVGIFPNLGLVNHDCWP 210

Query: 73  NAVLVFE-GRLAVVRAVQH------------VPKGAEVLISYIETAGSTMTRQKALKEQY 119
           N  ++F  G    V+++ H            + +G E+ +SYI+    +  R++ LK+QY
Sbjct: 211 NCTVIFNNGNHEAVKSMFHTQMRIELRALGKISEGEELTVSYIDFLNVSEERKRQLKKQY 270

Query: 120 LFTCTCPRCIKLGQFDDIQESAILEGYRCKDDGCSGFLLRDSDDKGFTCQQCGLVRSKEE 179
            F CTC  C K                + KDD    FL    D K           S+E 
Sbjct: 271 YFDCTCEHCQK----------------KLKDD---LFLGVKDDPKP----------SQEA 301

Query: 180 IKKIASEVNILSKKTL----ALTSCGNHQEVVSTYKMIEKLQKKLYHPFSVNLMQTREKL 235
           +K++      LSK TL       S G + EVV   +   + Q+ ++   ++  ++    +
Sbjct: 302 VKEMIQ----LSKDTLEKIDKARSEGLYHEVVKLCRECLQKQEPVFADTNLYTLRMLSTV 357

Query: 236 IKILMELEDWKEALAYCQLTIPVYQRVYPQFHPLLGLQYYTCGKLEWFLGDTENAIKSMT 295
            ++L  L+ ++EA  Y +  +  Y ++Y   +  LG+     G   W  G+ E     + 
Sbjct: 358 SEVLSYLQAFEEASHYARRMVDGYMKLYHPNNAQLGMAVMRAGLTNWHAGNIEVGHGMIC 417

Query: 296 EAVEILRITHGTNSPFMKEL 315
           +A  IL +THG + P  K+L
Sbjct: 418 KAYAILLVTHGPSHPITKDL 437


>gi|296482488|tpg|DAA24603.1| TPA: SET and MYND domain containing 1 [Bos taurus]
          Length = 490

 Score = 99.8 bits (247), Expect = 2e-18,   Method: Compositional matrix adjust.
 Identities = 80/320 (25%), Positives = 144/320 (45%), Gaps = 56/320 (17%)

Query: 19  VNLILQ-WP----EISINEIAENFSKLACNAHTICNSE-LRPLGTGLYPVISIINHSCLP 72
           V++ LQ WP    + S+  I+  F  + CN  T+ +   L+ +G G++P ++++NH C P
Sbjct: 151 VDMFLQYWPPQSQQFSMQYISHIFGVINCNGFTLSDQRGLQAVGVGVFPNLALVNHDCWP 210

Query: 73  NAVLVFE-GRLAVVRAVQH------------VPKGAEVLISYIETAGSTMTRQKALKEQY 119
           N  ++F  G    V+++ H            + +G E+ +SYI+    +  R+K LK QY
Sbjct: 211 NCTVIFNNGNHEAVKSMFHTQMRIELRALGKISEGEELTVSYIDFLNVSEERKKQLKRQY 270

Query: 120 LFTCTCPRCIKLGQFDDIQESAILEGYRCKDDGCSGFLLRDSDDKGFTCQQCGLVRSKEE 179
            F CTC  C K G  DD+                  FL              G+  + + 
Sbjct: 271 YFDCTCEHCQK-GLKDDL------------------FL--------------GVKDNPKP 297

Query: 180 IKKIASEVNILSKKTL----ALTSCGNHQEVVSTYKMIEKLQKKLYHPFSVNLMQTREKL 235
            +++  E+   SK TL       S G + EVV   +   + Q+ ++   ++  ++    +
Sbjct: 298 SQEVVKEMIQFSKDTLEKIDKARSEGLYHEVVKLCRECLQKQEPVFADTNIYTLRLLSIV 357

Query: 236 IKILMELEDWKEALAYCQLTIPVYQRVYPQFHPLLGLQYYTCGKLEWFLGDTENAIKSMT 295
            ++L  L+ ++EA  Y +  +  Y ++Y   +  LG+     G   W  G+ E     + 
Sbjct: 358 SEVLSYLQAFEEASHYARRMVDGYMKLYHHNNAQLGMAIMRAGLTNWHAGNIEVGHGMIC 417

Query: 296 EAVEILRITHGTNSPFMKEL 315
           +A  IL +THG + P  K+L
Sbjct: 418 KAYAILLVTHGPSHPITKDL 437


>gi|395508794|ref|XP_003758694.1| PREDICTED: SET and MYND domain-containing protein 1 [Sarcophilus
           harrisii]
          Length = 490

 Score = 99.8 bits (247), Expect = 2e-18,   Method: Compositional matrix adjust.
 Identities = 84/333 (25%), Positives = 144/333 (43%), Gaps = 58/333 (17%)

Query: 5   DEKQLLLYAQIANLVNLILQWP----EISINEIAENFSKLACNAHTICNSE-LRPLGTGL 59
           +E+Q  L   + + +N    WP    + S+  I+  F  + CNA T+ +   L+ +G G+
Sbjct: 141 EEEQKDLRMDVDSFLNF---WPPQSQQFSMQYISHIFGVINCNAFTLSDQRGLQAVGVGI 197

Query: 60  YPVISIINHSCLPNAVLVFE-GRLAVVRAVQH------------VPKGAEVLISYIETAG 106
           +P + ++NH C PN  ++F  G    V+++ H            + +G E+ +SYI+   
Sbjct: 198 FPNLCLVNHDCWPNCTVIFNNGNHEAVKSMFHTQMRIELRALGKISEGEELTVSYIDFLN 257

Query: 107 STMTRQKALKEQYLFTCTCPRCIKLGQFDDIQESAILEGYRCKDDGCSGFLLRDSDDKGF 166
            +  R+K LK+QY F CTC  C K G  DD+                  FL    D K  
Sbjct: 258 ISQERKKQLKKQYYFDCTCEHCEK-GIKDDL------------------FLAVKEDPK-- 296

Query: 167 TCQQCGLVRSKEEIKKIASEVNILSKKTL----ALTSCGNHQEVVSTYKMIEKLQKKLYH 222
                         + +  EV   SK TL       S G + EVV   +   + Q+ +  
Sbjct: 297 ------------PSQDVVKEVTQFSKDTLEKIDKARSEGLYHEVVKLCRECLQKQEPVLA 344

Query: 223 PFSVNLMQTREKLIKILMELEDWKEALAYCQLTIPVYQRVYPQFHPLLGLQYYTCGKLEW 282
             ++++++      ++L  L+ ++EA  Y +  +  Y ++Y   +  LG+     G   W
Sbjct: 345 DTNIHVLRILSIASEVLSYLQSFQEASDYAKRMVDGYMKLYHPNNAQLGMAVMRAGLTHW 404

Query: 283 FLGDTENAIKSMTEAVEILRITHGTNSPFMKEL 315
             G  E     + +A  IL +THG   P  K+L
Sbjct: 405 HAGLIEAGHGMICKAYAILLVTHGPTHPITKDL 437


>gi|197102816|ref|NP_001125377.1| SET and MYND domain-containing protein 1 [Pongo abelii]
 gi|55727869|emb|CAH90687.1| hypothetical protein [Pongo abelii]
          Length = 477

 Score = 99.4 bits (246), Expect = 2e-18,   Method: Compositional matrix adjust.
 Identities = 73/297 (24%), Positives = 135/297 (45%), Gaps = 38/297 (12%)

Query: 24  QWPEISINEIAENFSKLACNAHTICNSE-LRPLGTGLYPVISIINHSCLPNAVLVFEGRL 82
           Q  + S+  I+  F  + CN  T+ +   L+ +G G++P + ++NH C PN  ++F    
Sbjct: 161 QSQQFSMQYISHIFGVINCNGFTLSDQRGLQAVGVGIFPNLGLVNHDCWPNCTVIFNNGK 220

Query: 83  AVVRAVQHVPKGAEVLISYIETAGSTMTRQKALKEQYLFTCTCPRCIKLGQFDDIQESAI 142
             +RA+  + +G E+ +SYI+    +  R++ LK+QY F C C  C K            
Sbjct: 221 IELRALGKISEGEELTVSYIDFLNVSEERKRQLKKQYYFDCACEHCQK------------ 268

Query: 143 LEGYRCKDDGCSGFLLRDSDDKGFTCQQCGLVRSKEEIKKIASEVNILSKKTL----ALT 198
               + KDD    FL              G+  + +  +++  E+   SK TL       
Sbjct: 269 ----KLKDD---LFL--------------GVKDNPKPSQEVVKEMIRFSKDTLEKIDKAR 307

Query: 199 SCGNHQEVVSTYKMIEKLQKKLYHPFSVNLMQTREKLIKILMELEDWKEALAYCQLTIPV 258
           S G + EVV   +   + Q+ ++   ++  ++    + ++L  L+ ++EA  Y +  +  
Sbjct: 308 SEGLYHEVVKLCRECLQKQEPVFADTNIYTLRMLSIVSEVLSYLQAFEEASFYARRMVDG 367

Query: 259 YQRVYPQFHPLLGLQYYTCGKLEWFLGDTENAIKSMTEAVEILRITHGTNSPFMKEL 315
           Y ++Y   +  LG+     G   W  G+ E     + +A  IL +THG + P  K+L
Sbjct: 368 YMKLYHPNNAQLGMAVMRAGLTNWHAGNIEVGHGMICKAYAILLVTHGPSHPITKDL 424


>gi|321462734|gb|EFX73755.1| SET and MYND domain-containing protein 1 [Daphnia pulex]
          Length = 452

 Score = 99.4 bits (246), Expect = 2e-18,   Method: Compositional matrix adjust.
 Identities = 86/321 (26%), Positives = 139/321 (43%), Gaps = 69/321 (21%)

Query: 5   DEKQLLLYAQIANLVNLILQWPEISINEIAEN-------FSKLACNAHTICNSELRPLGT 57
           D K++  +  I  ++N  L       NEI  N       + ++  N+  I N E++ +GT
Sbjct: 180 DTKRMEYFMTICGVLNEYLS------NEILPNSVELLGIYGRMCINSFNILNGEMQAIGT 233

Query: 58  GLYPVISIINHSCLPNAVLVFEGRLAVVRAVQHVPK--GAEVLISYIETAGSTMTRQKAL 115
           G+Y   SI++HSC PNAV  F+G    ++  Q +PK     + ISYI+   S   R+K L
Sbjct: 234 GIYLAPSILDHSCSPNAVATFDGFKLRIQLTQELPKLEWDSIRISYIDLMNSKSHRKKEL 293

Query: 116 KEQYLFTCTCPRCIKLGQFDDIQESAILEGYRCKDDGCSGFLLRDSDDKGFTCQQCGLVR 175
           K++Y F C CPRC    + DDI               C  +       K  TCQ      
Sbjct: 294 KDRYYFDCDCPRC----KNDDID--------------CYHYAA-----KCPTCQ------ 324

Query: 176 SKEEIKKIASEVNILSKKTLALTSCGNHQEVVSTYKMIEKLQKKLYHPFSVNLMQTREKL 235
                K + ++V+        L  C          K+    Q+ L+H   +   +  +  
Sbjct: 325 -----KPVIAKVD--------LDVC----------KLCIAKQENLFHDLCLVRSKIMDAA 361

Query: 236 IKILMELEDWKEALAYCQLTIPVYQRVYPQFHPLLGLQYYTCGKLEWFLGDTENAI--KS 293
               ++L+ W+ A+ Y    I  Y+  Y + HPL  +      K+   +  T +++  K 
Sbjct: 362 FDSAIQLQLWEAAIQYGIPLIQAYKLWYGEEHPLTAILLLKLFKITLLISTTNDSVAQKY 421

Query: 294 MTEAVEILRITHGTNSPFMKE 314
             EA +IL ITHG +S F ++
Sbjct: 422 YEEAAKILEITHGRDSSFYRQ 442


>gi|325188357|emb|CCA22894.1| conserved hypothetical protein [Albugo laibachii Nc14]
          Length = 528

 Score = 99.4 bits (246), Expect = 2e-18,   Method: Compositional matrix adjust.
 Identities = 53/146 (36%), Positives = 79/146 (54%), Gaps = 12/146 (8%)

Query: 32  EIAENFSKLACNAHTICNSELRPLGTGLYPVISIINHSCLPNAVLVFE--GRLAVVRAVQ 89
           +I + F  + CN  TICN+   PLG GLYP  +++NHSC PN ++ F+   + AV+  ++
Sbjct: 202 QITKLFCMVQCNVFTICNTLNEPLGIGLYPKAAMLNHSCTPNCIVRFDLTRKQAVIHTIE 261

Query: 90  HVPKGAEVLISYIETAGSTMTRQKALKEQYLFTCTCPRCIKLGQFDDIQESAILEGY--- 146
            +  G E+ ISY++    T  R+K L+  Y F CTC RC         QE A LEG+   
Sbjct: 262 TIHSGQELTISYVDIFDPTTIRKKRLRSSYFFDCTCQRCT------SSQEDAFLEGFLCT 315

Query: 147 RCKDDGCSGFLLRDSDDKGFTCQQCG 172
           +C++   S  +L D   + F C  C 
Sbjct: 316 QCRNTSESARIL-DEKKRIFRCCVCA 340


>gi|73980306|ref|XP_852104.1| PREDICTED: SET and MYND domain-containing protein 1 isoform 2
           [Canis lupus familiaris]
          Length = 490

 Score = 99.0 bits (245), Expect = 3e-18,   Method: Compositional matrix adjust.
 Identities = 80/316 (25%), Positives = 142/316 (44%), Gaps = 48/316 (15%)

Query: 19  VNLILQ-WP----EISINEIAENFSKLACNAHTICNSE-LRPLGTGLYPVISIINHSCLP 72
           V+  LQ WP    + S+  I+  F  ++CN  T+ +   L+ +G G++P + ++NH C P
Sbjct: 151 VDTFLQYWPPQGQQFSMQYISHIFGVISCNGFTLSDQRGLQAVGVGIFPNLGLVNHDCWP 210

Query: 73  NAVLVFE-GRLAVVRAVQH------------VPKGAEVLISYIETAGSTMTRQKALKEQY 119
           N  ++F  G    V+++ H            + +G E+ +SYI+    +  R+K LK+QY
Sbjct: 211 NCTVIFNNGNHEAVKSMFHTQMRIELRALGKISEGEELTVSYIDFLNLSEDRKKQLKKQY 270

Query: 120 LFTCTCPRCIKLGQFDDIQESAILEGYRCKDDGCSGFLLRDSDDKGFTCQQCGLVRSKEE 179
            F CTC  C K                + KDD    FL    D K           S+E 
Sbjct: 271 YFDCTCEHCQK----------------KLKDD---LFLGVKDDPKP----------SQEV 301

Query: 180 IKKIASEVNILSKKTLALTSCGNHQEVVSTYKMIEKLQKKLYHPFSVNLMQTREKLIKIL 239
           +K++        +K     S G + EVV   +   + Q+ ++   ++  ++    + ++L
Sbjct: 302 VKEMIQFSKDTLEKIDKARSEGLYHEVVKLCRECLQKQEPVFADTNLYTLRMLSTVSEVL 361

Query: 240 MELEDWKEALAYCQLTIPVYQRVYPQFHPLLGLQYYTCGKLEWFLGDTENAIKSMTEAVE 299
             L+ ++EA  Y +  +  Y ++Y   +  LG+     G   W  G+ E     + +A  
Sbjct: 362 SYLQAFEEASYYARRMVDGYMKLYHPNNAQLGMAVMRAGLTNWHAGNIEVGHGMICKAYA 421

Query: 300 ILRITHGTNSPFMKEL 315
           IL +THG + P  K+L
Sbjct: 422 ILLVTHGPSHPITKDL 437


>gi|134085965|ref|NP_001076953.1| SET and MYND domain-containing protein 1 [Bos taurus]
 gi|126717435|gb|AAI33279.1| SMYD1 protein [Bos taurus]
          Length = 490

 Score = 98.6 bits (244), Expect = 4e-18,   Method: Compositional matrix adjust.
 Identities = 80/320 (25%), Positives = 143/320 (44%), Gaps = 56/320 (17%)

Query: 19  VNLILQ-WP----EISINEIAENFSKLACNAHTICNSE-LRPLGTGLYPVISIINHSCLP 72
           V+  LQ WP    + S+  I+  F  + CN  T+ +   L+ +G G++P ++++NH C P
Sbjct: 151 VDTFLQYWPPQSQQFSMQYISHIFGVINCNGFTLSDQRGLQAVGVGVFPNLALVNHDCWP 210

Query: 73  NAVLVFE-GRLAVVRAVQH------------VPKGAEVLISYIETAGSTMTRQKALKEQY 119
           N  ++F  G    V+++ H            + +G E+ +SYI+    +  R+K LK QY
Sbjct: 211 NCTVIFNNGNHEAVKSMFHTQMRIELRALGKISEGEELTVSYIDFLNVSEERKKQLKRQY 270

Query: 120 LFTCTCPRCIKLGQFDDIQESAILEGYRCKDDGCSGFLLRDSDDKGFTCQQCGLVRSKEE 179
            F CTC  C K G  DD+                  FL              G+  + + 
Sbjct: 271 YFDCTCEHCQK-GLKDDL------------------FL--------------GVKDNPKP 297

Query: 180 IKKIASEVNILSKKTL----ALTSCGNHQEVVSTYKMIEKLQKKLYHPFSVNLMQTREKL 235
            +++  E+   SK TL       S G + EVV   +   + Q+ ++   ++  ++    +
Sbjct: 298 SQEVVKEMIQFSKDTLEKIDKARSEGLYHEVVKLCRECLQKQEPVFADTNIYTLRLLSIV 357

Query: 236 IKILMELEDWKEALAYCQLTIPVYQRVYPQFHPLLGLQYYTCGKLEWFLGDTENAIKSMT 295
            ++L  L+ ++EA  Y +  +  Y ++Y   +  LG+     G   W  G+ E     + 
Sbjct: 358 SEVLSYLQAFEEASHYARRMVDGYMKLYHHNNAQLGMAIMRAGLTNWHAGNIEVGHGMIC 417

Query: 296 EAVEILRITHGTNSPFMKEL 315
           +A  IL +THG + P  K+L
Sbjct: 418 KAYAILLVTHGPSHPITKDL 437


>gi|395853479|ref|XP_003799234.1| PREDICTED: SET and MYND domain-containing protein 1 [Otolemur
           garnettii]
          Length = 490

 Score = 98.2 bits (243), Expect = 4e-18,   Method: Compositional matrix adjust.
 Identities = 83/317 (26%), Positives = 142/317 (44%), Gaps = 50/317 (15%)

Query: 19  VNLILQ-WPE----ISINEIAENFSKLACNAHTICNSE-LRPLGTGLYPVISIINHSCLP 72
           V+  LQ WP      SI  I+  F  + CN  T+ +   L+ +G G++P + ++NH C P
Sbjct: 151 VDTFLQYWPPQSQPFSIQYISHIFGVINCNGFTLSDQRGLQAVGVGIFPNLGLVNHDCWP 210

Query: 73  NAVLVFE-GRLAVVRAVQH------------VPKGAEVLISYIETAGSTMTRQKALKEQY 119
           N  ++F  G    V+A+ H            + +G E+ +SYI+    T  R+K LK+QY
Sbjct: 211 NCTVIFNNGNHEAVKAMFHTQMRIELRALGKISEGEELTVSYIDFLNVTEERRKQLKKQY 270

Query: 120 LFTCTCPRCIKLGQFDDIQESAILEGYRCKDDGCSGFL-LRDSDDKGFTCQQCGLVRSKE 178
            F C C  C K                + KDD    FL ++D+        +  +  SKE
Sbjct: 271 YFDCACEHCQK----------------KLKDD---LFLAVKDNPKPSQEVVKEMIQFSKE 311

Query: 179 EIKKIASEVNILSKKTLALTSCGNHQEVVSTYKMIEKLQKKLYHPFSVNLMQTREKLIKI 238
            + KIA              S G + EVV   +   + Q+ ++   ++ +++    + ++
Sbjct: 312 TLDKIAK-----------ARSEGLYHEVVKLCRECLQKQEPVFADTNLYMLRILSIVSEV 360

Query: 239 LMELEDWKEALAYCQLTIPVYQRVYPQFHPLLGLQYYTCGKLEWFLGDTENAIKSMTEAV 298
           L  L+ ++EA  Y +  +  Y ++Y   +  LG+     G   W  G+ E     + +A 
Sbjct: 361 LSYLQAFEEASDYARRMVDGYMKLYHPNNAQLGMAVMRAGLTNWHAGNIEVGHGMICKAY 420

Query: 299 EILRITHGTNSPFMKEL 315
            IL +THG + P  K+L
Sbjct: 421 AILLVTHGPSHPITKDL 437


>gi|45387527|ref|NP_991103.1| SET and MYND domain containing 1 [Danio rerio]
 gi|41223368|gb|AAH65475.1| SET and MYND domain containing 1a [Danio rerio]
          Length = 485

 Score = 98.2 bits (243), Expect = 5e-18,   Method: Compositional matrix adjust.
 Identities = 71/322 (22%), Positives = 138/322 (42%), Gaps = 52/322 (16%)

Query: 16  ANLVNLILQWPE----ISINEIAENFSKLACNAHTICNSE-LRPLGTGLYPVISIINHSC 70
           A++      WP+    +  + ++  F  ++CN  T+ +   L+ +G GL+P + ++NH C
Sbjct: 149 ADVKTFYTYWPKKSKAVGEDYVSHLFGVISCNGFTLSDQRGLQSVGIGLFPNLCLVNHDC 208

Query: 71  LPNAVLVFE-------------GRLAVVRAVQHVPKGAEVLISYIETAGSTMTRQKALKE 117
            PN  ++                R   +RA++ +  G E+ +SY++    +  RQ+ L++
Sbjct: 209 WPNCTVILNHGDQSALDASFHSSRRIELRALEPISAGQELTVSYVDFLSVSTDRQRLLQQ 268

Query: 118 QYLFTCTCPRCIKLGQFDDIQESAILEGYRCKDDGCSGFLLRDSDDKGFTCQQCGLVRSK 177
           QY F C C  C+  G  D++     +   +  +DG                        K
Sbjct: 269 QYYFDCKCEHCVN-GTKDEL-----MTAVKPTEDG------------------------K 298

Query: 178 EEIKKIASEVNILSKKTL----ALTSCGNHQEVVSTYKMIEKLQKKLYHPFSVNLMQTRE 233
           +    +  ++   S + L    A  + GN  EV+   +   + Q  ++   +V++++   
Sbjct: 299 QPSADVVKQLTDFSLQALVKIEAARAQGNFHEVIRICRECLEKQDPVFADTNVHVLRVLS 358

Query: 234 KLIKILMELEDWKEALAYCQLTIPVYQRVYPQFHPLLGLQYYTCGKLEWFLGDTENAIKS 293
              ++L  L+ ++EA  Y Q  +  Y ++Y   +  LG+     G   W  G  E A   
Sbjct: 359 TASEVLSFLQQFQEAAGYAQRMVDGYMKLYHPNNAQLGMAIMRAGVTHWHAGLIEAAHGL 418

Query: 294 MTEAVEILRITHGTNSPFMKEL 315
           +  A  IL ITHG + P  ++L
Sbjct: 419 ICRAYGILMITHGPHHPISRDL 440


>gi|47227584|emb|CAG09581.1| unnamed protein product [Tetraodon nigroviridis]
          Length = 382

 Score = 98.2 bits (243), Expect = 5e-18,   Method: Compositional matrix adjust.
 Identities = 73/305 (23%), Positives = 134/305 (43%), Gaps = 34/305 (11%)

Query: 17  NLVNLILQWPEIS----INEIAENFSKLACNAHTICNSE-LRPLGTGLYPVISIINHSCL 71
           ++ N +  WP  S    I++I+  F  + CN  T+ +   L+ +G GL+P + ++NH C 
Sbjct: 51  DIHNFLDYWPRNSKQHTIDDISHLFGVINCNGFTVSDQRGLQAVGVGLFPNLCLVNHDCW 110

Query: 72  PNAVLVFEGRLAVVRAVQHVPKGAEVLISYIETAGSTMTRQKALKEQYLFTCTCPRCIKL 131
           PN  ++       +R++  + +G E+ ++Y++    +  R++ LK QY F C C  C   
Sbjct: 111 PNCTVILNHGKIELRSLGKIAEGEELTVAYVDFLNLSEERRRLLKTQYFFDCQCEHCRN- 169

Query: 132 GQFDDIQESAILEGYRCKDD-GCSGFLLRDSDDKGFTCQQCGLVRSKEEIKKIASEVNIL 190
                          R KDD   +G   R+ D         GL  S+E++K+       +
Sbjct: 170 ---------------RTKDDLKLAG---REVD---------GLKPSEEQVKEATDYCFQM 202

Query: 191 SKKTLALTSCGNHQEVVSTYKMIEKLQKKLYHPFSVNLMQTREKLIKILMELEDWKEALA 250
            +K       G++ EVV   +      + +     + L++    L ++   L+ + +A  
Sbjct: 203 LEKMDKARLDGDYHEVVKICRDAIDRTEPVLADTHIYLLRMWSTLSEVQAYLQYFNDAAT 262

Query: 251 YCQLTIPVYQRVYPQFHPLLGLQYYTCGKLEWFLGDTENAIKSMTEAVEILRITHGTNSP 310
           Y +  +  Y ++YP  +  LG+     G   W  G  E     + +A  IL +THG   P
Sbjct: 263 YSRKMVEGYMKLYPPNNAALGMAAMRAGVNHWQAGLIEVGHGMVCKAYAILLVTHGPTHP 322

Query: 311 FMKEL 315
             K+L
Sbjct: 323 ITKDL 327


>gi|119597545|gb|EAW77139.1| SET and MYND domain containing 3, isoform CRA_a [Homo sapiens]
 gi|119597547|gb|EAW77141.1| SET and MYND domain containing 3, isoform CRA_a [Homo sapiens]
 gi|193785773|dbj|BAG51208.1| unnamed protein product [Homo sapiens]
          Length = 239

 Score = 97.8 bits (242), Expect = 5e-18,   Method: Compositional matrix adjust.
 Identities = 74/257 (28%), Positives = 120/257 (46%), Gaps = 43/257 (16%)

Query: 52  LRPLGTGLYPVISIINHSCLPNAVLVFEGRLAVVRAVQHVPKGAEVLISYIETAGSTMTR 111
           ++ +G GLYP IS++NHSC PN  +VF G   ++RAV+ +  G E+ I Y++   ++  R
Sbjct: 1   MQEVGVGLYPSISLLNHSCDPNCSIVFNGPHLLLRAVRDIEVGEELTICYLDMLMTSEER 60

Query: 112 QKALKEQYLFTCTCPRCIKLGQFDDIQESAILEGYRCKDDGCSGFLLRDSDDKGFTCQQC 171
           +K L++QY F C C RC                              +D D    T  + 
Sbjct: 61  RKQLRDQYCFECDCFRCQT----------------------------QDKDADMLTGDEQ 92

Query: 172 GLVRSKEEIKKIASEVNILSKKTLALTSCGNHQEVVSTYKMIEKLQKKLYHPFSVNLMQT 231
                +E +KKI  E+    K    L  C   Q ++S+    E+L         +N+ Q 
Sbjct: 93  VWKEVQESLKKI-EELKAHWKWEQVLAMC---QAIISSNS--ERLP-------DINIYQL 139

Query: 232 R--EKLIKILMELEDWKEALAYCQLTIPVYQRVYPQFHPLLGLQYYTCGKLEWFLGDTEN 289
           +  +  +   + L   +EAL Y   T+  Y+  +P  HP+ G+Q    GKL+   G    
Sbjct: 140 KVLDCAMDACINLGLLEEALFYGTRTMEPYRIFFPGSHPVRGVQVMKVGKLQLHQGMFPQ 199

Query: 290 AIKSMTEAVEILRITHG 306
           A+K++  A +I+R+THG
Sbjct: 200 AMKNLRLAFDIMRVTHG 216


>gi|426223567|ref|XP_004005946.1| PREDICTED: SET and MYND domain-containing protein 1 [Ovis aries]
          Length = 490

 Score = 97.8 bits (242), Expect = 6e-18,   Method: Compositional matrix adjust.
 Identities = 80/316 (25%), Positives = 141/316 (44%), Gaps = 48/316 (15%)

Query: 19  VNLILQ-WP----EISINEIAENFSKLACNAHTICNSE-LRPLGTGLYPVISIINHSCLP 72
           V+  LQ WP    + S+  I+  F  + CN  T+ +   L+ +G G++P ++++NH C P
Sbjct: 151 VDTFLQYWPPQSQQFSMQYISHIFGVINCNGFTLSDQRGLQAVGVGVFPNLALVNHDCWP 210

Query: 73  NAVLVFE-GRLAVVRAVQH------------VPKGAEVLISYIETAGSTMTRQKALKEQY 119
           N  ++F  G    V+++ H            + +G E+ +SYI+    +  R+K LK QY
Sbjct: 211 NCTVIFNNGNHEAVKSMFHTQMRIELRALGKISEGEELTVSYIDFLNISEERKKQLKRQY 270

Query: 120 LFTCTCPRCIKLGQFDDIQESAILEGYRCKDDGCSGFLLRDSDDKGFTCQQCGLVRSKEE 179
            F CTC  C K G  DD+                  FL    D K           S+E 
Sbjct: 271 YFDCTCEHCQK-GLKDDL------------------FLGVKDDPKP----------SQEV 301

Query: 180 IKKIASEVNILSKKTLALTSCGNHQEVVSTYKMIEKLQKKLYHPFSVNLMQTREKLIKIL 239
           +K++        +K     S G + EVV   +   + Q+ ++   ++  ++    + ++L
Sbjct: 302 VKEMIQFSKDTLEKIDKARSEGLYHEVVKLCRECLQKQEPVFADTNIYTLRLLSIVSEVL 361

Query: 240 MELEDWKEALAYCQLTIPVYQRVYPQFHPLLGLQYYTCGKLEWFLGDTENAIKSMTEAVE 299
             L+ ++EA  Y +  +  Y ++Y   +  LG+     G   W  G+ E     + +A  
Sbjct: 362 SYLQAFEEASHYARRMVDGYMKLYHPNNAQLGMAIMRAGLTNWHAGNIEVGHGMICKAYA 421

Query: 300 ILRITHGTNSPFMKEL 315
           IL +THG + P  K+L
Sbjct: 422 ILLVTHGPSHPITKDL 437


>gi|402891521|ref|XP_003908994.1| PREDICTED: SET and MYND domain-containing protein 1 [Papio anubis]
 gi|355565873|gb|EHH22302.1| hypothetical protein EGK_05542 [Macaca mulatta]
 gi|355751476|gb|EHH55731.1| hypothetical protein EGM_04994 [Macaca fascicularis]
          Length = 490

 Score = 97.8 bits (242), Expect = 7e-18,   Method: Compositional matrix adjust.
 Identities = 79/320 (24%), Positives = 144/320 (45%), Gaps = 56/320 (17%)

Query: 19  VNLILQ-WP----EISINEIAENFSKLACNAHTICNSE-LRPLGTGLYPVISIINHSCLP 72
           V+  LQ WP    + S+  I+  F  + CN  T+ +   L+ +G G++P + ++NH C P
Sbjct: 151 VDTFLQYWPPQSQQFSMQYISHIFGVINCNGFTLSDQRGLQAVGVGIFPNLGLVNHDCWP 210

Query: 73  NAVLVFE-GRLAVVRAVQH------------VPKGAEVLISYIETAGSTMTRQKALKEQY 119
           N  ++F  G    V+++ H            + +G E+ +SYI+    +  R++ LK+QY
Sbjct: 211 NCTVIFNNGNHEAVKSMFHTQMRIELRALGKISEGEELTVSYIDFLNVSEERKRQLKKQY 270

Query: 120 LFTCTCPRCIKLGQFDDIQESAILEGYRCKDDGCSGFLLRDSDDKGFTCQQCGLVRSKEE 179
            F CTC  C K                + KDD    FL              G+  + + 
Sbjct: 271 YFDCTCEHCQK----------------KLKDD---LFL--------------GVKDNPKP 297

Query: 180 IKKIASEVNILSKKTL----ALTSCGNHQEVVSTYKMIEKLQKKLYHPFSVNLMQTREKL 235
            +++  E+   SK TL       S G + EVV   +   + Q+ ++   ++ +++    +
Sbjct: 298 SQEVVKEMIQFSKDTLEKIDKARSEGLYHEVVKLCRECLQKQEPVFADTNIYMLRILSIV 357

Query: 236 IKILMELEDWKEALAYCQLTIPVYQRVYPQFHPLLGLQYYTCGKLEWFLGDTENAIKSMT 295
            ++L  L+ ++EA  Y +  +  Y ++Y   +  LG+     G   W  G+ E     + 
Sbjct: 358 SEVLSYLQAFEEASDYARRMVDGYMKLYHPNNAQLGMAVMRAGLTNWHAGNIEVGHGMIC 417

Query: 296 EAVEILRITHGTNSPFMKEL 315
           +A  IL +THG + P  K+L
Sbjct: 418 KAYAILLVTHGPSHPITKDL 437


>gi|241712248|ref|XP_002413449.1| SET and MYND domain-containing protein, putative [Ixodes
           scapularis]
 gi|215507263|gb|EEC16757.1| SET and MYND domain-containing protein, putative [Ixodes
           scapularis]
          Length = 374

 Score = 97.8 bits (242), Expect = 7e-18,   Method: Compositional matrix adjust.
 Identities = 75/283 (26%), Positives = 127/283 (44%), Gaps = 41/283 (14%)

Query: 35  ENFSKLACNAHTICNSELRPLGTGLYPVISIINHSCLPNAVLVFEGRLAVVRAVQHVPKG 94
           E F K+  N +TI N E  P+GTGLY   SI++HSC PNA  VF+GR   +RA   +   
Sbjct: 112 EIFGKMVVNTYTITNYEECPIGTGLYIGPSILDHSCTPNAQAVFDGRTLRLRAAADIECD 171

Query: 95  A--EVLISYIETAGSTMTRQKALKEQYLFTCTCPRCIKLGQFDDIQESAILEGYRCKDDG 152
           +   + +SYI+      TR + L+E+Y F+C CPRC      D+ QE             
Sbjct: 172 SVDGIRVSYIDLKELRRTRIEELRERYYFSCNCPRC----SADEAQEY------------ 215

Query: 153 CSGFLLRDSDDKGFTCQQCGLVRSKEEIKKIASEVNILSKKTLALTSCGNHQEVVSTYKM 212
                         T +   L    +E+ +   +  + +++ L     G  + ++S++K+
Sbjct: 216 -------------LTEKSPSLTAKAKELLREFMQAEVHAREDLVRLR-GQAEALLSSHKL 261

Query: 213 IEKLQKKLYHPFSVNLMQTREKLIKILMELEDWKEALAYCQLTIPVYQRVYPQFHPLLGL 272
                 +          +  E L K  +E+ D+  AL Y       Y+R Y  + P+ G+
Sbjct: 262 PATDAAR---------SEALELLSKSCLEMGDFDAALPYYLAKEETYRRCYGPYSPVYGV 312

Query: 273 QYYTCGKLEWFLGDTENAIKSMTEAVEILRITHGTNSPFMKEL 315
             +T  KL       + A++   +A E++ ++HG N    ++L
Sbjct: 313 LLFTIAKLYHHKVLLQEALQYFEKASEVVTVSHGRNHRLYRDL 355


>gi|397468470|ref|XP_003805903.1| PREDICTED: SET and MYND domain-containing protein 1 [Pan paniscus]
          Length = 490

 Score = 97.4 bits (241), Expect = 7e-18,   Method: Compositional matrix adjust.
 Identities = 79/320 (24%), Positives = 144/320 (45%), Gaps = 56/320 (17%)

Query: 19  VNLILQ-WP----EISINEIAENFSKLACNAHTICNSE-LRPLGTGLYPVISIINHSCLP 72
           V+  LQ WP    + S+  I+  F  + CN  T+ +   L+ +G G++P + ++NH C P
Sbjct: 151 VDTFLQYWPPQSQQFSMQYISHIFGVINCNGFTLSDQRGLQAVGVGIFPNLGLVNHDCWP 210

Query: 73  NAVLVFE-GRLAVVRAVQH------------VPKGAEVLISYIETAGSTMTRQKALKEQY 119
           N  ++F  G    V+++ H            + +G E+ +SYI+    +  R++ LK+QY
Sbjct: 211 NCTVIFNNGNHEAVKSMFHTQMRIELRALGKISEGEELTVSYIDFLNVSEERKRQLKKQY 270

Query: 120 LFTCTCPRCIKLGQFDDIQESAILEGYRCKDDGCSGFLLRDSDDKGFTCQQCGLVRSKEE 179
            F CTC  C K                + KDD    FL              G+  + + 
Sbjct: 271 YFDCTCEHCQK----------------KLKDD---LFL--------------GVKDNPKP 297

Query: 180 IKKIASEVNILSKKTL----ALTSCGNHQEVVSTYKMIEKLQKKLYHPFSVNLMQTREKL 235
            +++  E+   SK TL       S G + EVV   +   + Q+ ++   ++ +++    +
Sbjct: 298 SQEVVKEMIQFSKDTLEKIDKARSEGLYHEVVKLCRECLEKQEPVFADTNIYMLRMLSIV 357

Query: 236 IKILMELEDWKEALAYCQLTIPVYQRVYPQFHPLLGLQYYTCGKLEWFLGDTENAIKSMT 295
            ++L  L+ ++EA  Y +  +  Y ++Y   +  LG+     G   W  G+ E     + 
Sbjct: 358 SEVLSYLQAFEEASFYARRMVDGYMKLYHPNNAQLGMAVMRAGLTNWHAGNIEVGHGMIC 417

Query: 296 EAVEILRITHGTNSPFMKEL 315
           +A  IL +THG + P  K+L
Sbjct: 418 KAYAILLVTHGPSHPITKDL 437


>gi|38093643|ref|NP_938015.1| SET and MYND domain-containing protein 1 [Homo sapiens]
 gi|34925329|sp|Q8NB12.1|SMYD1_HUMAN RecName: Full=SET and MYND domain-containing protein 1
 gi|21750163|dbj|BAC03732.1| unnamed protein product [Homo sapiens]
 gi|57997113|emb|CAI46139.1| hypothetical protein [Homo sapiens]
 gi|116497127|gb|AAI26192.1| SET and MYND domain containing 1 [Homo sapiens]
 gi|119597484|gb|EAW77078.1| SET and MYND domain containing 1, isoform CRA_b [Homo sapiens]
 gi|190689781|gb|ACE86665.1| SET and MYND domain containing 1 protein [synthetic construct]
 gi|190691149|gb|ACE87349.1| SET and MYND domain containing 1 protein [synthetic construct]
 gi|261858976|dbj|BAI46010.1| SET and MYND domain containing 1 [synthetic construct]
 gi|313883064|gb|ADR83018.1| SET and MYND domain containing 1 (SMYD1) [synthetic construct]
          Length = 490

 Score = 97.4 bits (241), Expect = 8e-18,   Method: Compositional matrix adjust.
 Identities = 79/320 (24%), Positives = 144/320 (45%), Gaps = 56/320 (17%)

Query: 19  VNLILQ-WP----EISINEIAENFSKLACNAHTICNSE-LRPLGTGLYPVISIINHSCLP 72
           V+  LQ WP    + S+  I+  F  + CN  T+ +   L+ +G G++P + ++NH C P
Sbjct: 151 VDTFLQYWPPQSQQFSMQYISHIFGVINCNGFTLSDQRGLQAVGVGIFPNLGLVNHDCWP 210

Query: 73  NAVLVFE-GRLAVVRAVQH------------VPKGAEVLISYIETAGSTMTRQKALKEQY 119
           N  ++F  G    V+++ H            + +G E+ +SYI+    +  R++ LK+QY
Sbjct: 211 NCTVIFNNGNHEAVKSMFHTQMRIELRALGKISEGEELTVSYIDFLNVSEERKRQLKKQY 270

Query: 120 LFTCTCPRCIKLGQFDDIQESAILEGYRCKDDGCSGFLLRDSDDKGFTCQQCGLVRSKEE 179
            F CTC  C K                + KDD    FL              G+  + + 
Sbjct: 271 YFDCTCEHCQK----------------KLKDD---LFL--------------GVKDNPKP 297

Query: 180 IKKIASEVNILSKKTL----ALTSCGNHQEVVSTYKMIEKLQKKLYHPFSVNLMQTREKL 235
            +++  E+   SK TL       S G + EVV   +   + Q+ ++   ++ +++    +
Sbjct: 298 SQEVVKEMIQFSKDTLEKIDKARSEGLYHEVVKLCRECLEKQEPVFADTNIYMLRMLSIV 357

Query: 236 IKILMELEDWKEALAYCQLTIPVYQRVYPQFHPLLGLQYYTCGKLEWFLGDTENAIKSMT 295
            ++L  L+ ++EA  Y +  +  Y ++Y   +  LG+     G   W  G+ E     + 
Sbjct: 358 SEVLSYLQAFEEASFYARRMVDGYMKLYHPNNAQLGMAVMRAGLTNWHAGNIEVGHGMIC 417

Query: 296 EAVEILRITHGTNSPFMKEL 315
           +A  IL +THG + P  K+L
Sbjct: 418 KAYAILLVTHGPSHPITKDL 437


>gi|403303925|ref|XP_003942566.1| PREDICTED: SET and MYND domain-containing protein 1 [Saimiri
           boliviensis boliviensis]
          Length = 490

 Score = 97.4 bits (241), Expect = 8e-18,   Method: Compositional matrix adjust.
 Identities = 80/320 (25%), Positives = 143/320 (44%), Gaps = 56/320 (17%)

Query: 19  VNLILQ-WP----EISINEIAENFSKLACNAHTICNSE-LRPLGTGLYPVISIINHSCLP 72
           V+  LQ WP    + S+  I+  F  + CN  T+ +   L+ +G G++P + ++NH C P
Sbjct: 151 VDTFLQYWPPQSQQFSMQYISHIFGVINCNGFTLSDQRGLQAVGVGIFPNLGLVNHDCWP 210

Query: 73  NAVLVFE-GRLAVVRAVQH------------VPKGAEVLISYIETAGSTMTRQKALKEQY 119
           N  ++F  G    V+++ H            + +G E+ +SYI+    +  R+K LK+QY
Sbjct: 211 NCTVIFNNGNHEAVKSMFHTQMRIELRALGKISEGEELTVSYIDFLNVSEERKKQLKKQY 270

Query: 120 LFTCTCPRCIKLGQFDDIQESAILEGYRCKDDGCSGFLLRDSDDKGFTCQQCGLVRSKEE 179
            F CTC  C K                + KDD    FL              G+  + + 
Sbjct: 271 YFDCTCEHCQK----------------KLKDD---LFL--------------GVKDNPKP 297

Query: 180 IKKIASEVNILSKKTL----ALTSCGNHQEVVSTYKMIEKLQKKLYHPFSVNLMQTREKL 235
            +++  E+   SK TL       S G + EVV   +   + Q+ ++   ++  ++    +
Sbjct: 298 SQEVVKEMIQFSKDTLEKIDKARSEGLYHEVVKLCRECLQKQEPVFADTNIYTLRMLSIV 357

Query: 236 IKILMELEDWKEALAYCQLTIPVYQRVYPQFHPLLGLQYYTCGKLEWFLGDTENAIKSMT 295
            ++L  L+ ++EA  Y +  +  Y ++Y   +  LG+     G   W  G+ E     + 
Sbjct: 358 SEVLSYLQAFEEASDYARRMVDGYMKLYHPNNAQLGMAVMRAGLTNWHAGNIEVGHGMIC 417

Query: 296 EAVEILRITHGTNSPFMKEL 315
           +A  IL +THG + P  K+L
Sbjct: 418 KAYAILLVTHGPSHPITKDL 437


>gi|119597485|gb|EAW77079.1| SET and MYND domain containing 1, isoform CRA_c [Homo sapiens]
          Length = 493

 Score = 97.4 bits (241), Expect = 8e-18,   Method: Compositional matrix adjust.
 Identities = 79/320 (24%), Positives = 144/320 (45%), Gaps = 56/320 (17%)

Query: 19  VNLILQ-WP----EISINEIAENFSKLACNAHTICNSE-LRPLGTGLYPVISIINHSCLP 72
           V+  LQ WP    + S+  I+  F  + CN  T+ +   L+ +G G++P + ++NH C P
Sbjct: 151 VDTFLQYWPPQSQQFSMQYISHIFGVINCNGFTLSDQRGLQAVGVGIFPNLGLVNHDCWP 210

Query: 73  NAVLVFE-GRLAVVRAVQH------------VPKGAEVLISYIETAGSTMTRQKALKEQY 119
           N  ++F  G    V+++ H            + +G E+ +SYI+    +  R++ LK+QY
Sbjct: 211 NCTVIFNNGNHEAVKSMFHTQMRIELRALGKISEGEELTVSYIDFLNVSEERKRQLKKQY 270

Query: 120 LFTCTCPRCIKLGQFDDIQESAILEGYRCKDDGCSGFLLRDSDDKGFTCQQCGLVRSKEE 179
            F CTC  C K                + KDD    FL              G+  + + 
Sbjct: 271 YFDCTCEHCQK----------------KLKDD---LFL--------------GVKDNPKP 297

Query: 180 IKKIASEVNILSKKTL----ALTSCGNHQEVVSTYKMIEKLQKKLYHPFSVNLMQTREKL 235
            +++  E+   SK TL       S G + EVV   +   + Q+ ++   ++ +++    +
Sbjct: 298 SQEVVKEMIQFSKDTLEKIDKARSEGLYHEVVKLCRECLEKQEPVFADTNIYMLRMLSIV 357

Query: 236 IKILMELEDWKEALAYCQLTIPVYQRVYPQFHPLLGLQYYTCGKLEWFLGDTENAIKSMT 295
            ++L  L+ ++EA  Y +  +  Y ++Y   +  LG+     G   W  G+ E     + 
Sbjct: 358 SEVLSYLQAFEEASFYARRMVDGYMKLYHPNNAQLGMAVMRAGLTNWHAGNIEVGHGMIC 417

Query: 296 EAVEILRITHGTNSPFMKEL 315
           +A  IL +THG + P  K+L
Sbjct: 418 KAYAILLVTHGPSHPITKDL 437


>gi|26336238|dbj|BAC31804.1| unnamed protein product [Mus musculus]
 gi|26338432|dbj|BAC32887.1| unnamed protein product [Mus musculus]
          Length = 239

 Score = 97.4 bits (241), Expect = 8e-18,   Method: Compositional matrix adjust.
 Identities = 69/255 (27%), Positives = 120/255 (47%), Gaps = 39/255 (15%)

Query: 52  LRPLGTGLYPVISIINHSCLPNAVLVFEGRLAVVRAVQHVPKGAEVLISYIETAGSTMTR 111
           ++ +G GLYP +S++NHSC PN  +VF G   ++RAV+ +  G E+ I Y++   ++  R
Sbjct: 1   MQEVGVGLYPSMSLLNHSCDPNCSIVFNGPHLLLRAVREIEAGEELTICYLDMLMTSEER 60

Query: 112 QKALKEQYLFTCTCPRCIKLGQFDDIQESAILEGYRCKDDGCSGFLLRDSDDKGFTCQQC 171
           +K L++QY F C C RC    Q  D                      +D+D         
Sbjct: 61  RKQLRDQYCFECDCIRC----QTQD----------------------KDAD--------- 85

Query: 172 GLVRSKEEIKKIASEVNILSKKTLALTSCGNHQEVVSTYKMIEKLQKKLYHPFSVNLMQT 231
            ++   E+I K   EV    KK   L +    ++V++  + I           ++  ++ 
Sbjct: 86  -MLTGDEQIWK---EVQESLKKIEELKAHWKWEQVLALCQAIINSNSNRLPDINIYQLKV 141

Query: 232 REKLIKILMELEDWKEALAYCQLTIPVYQRVYPQFHPLLGLQYYTCGKLEWFLGDTENAI 291
            +  +   + L   +EAL Y   T+  Y+  +P  HP+ G+Q    GKL+   G    A+
Sbjct: 142 LDCAMDACINLGMLEEALFYAMRTMEPYRIFFPGSHPVRGVQVMKVGKLQLHQGMFPQAM 201

Query: 292 KSMTEAVEILRITHG 306
           K++  A +I+++THG
Sbjct: 202 KNLRLAFDIMKVTHG 216


>gi|48476975|gb|AAT44535.1| hypothetical rhabdomyosarcoma antigen MU-RMS-40.9A [Homo sapiens]
          Length = 484

 Score = 97.4 bits (241), Expect = 9e-18,   Method: Compositional matrix adjust.
 Identities = 79/320 (24%), Positives = 144/320 (45%), Gaps = 56/320 (17%)

Query: 19  VNLILQ-WP----EISINEIAENFSKLACNAHTICNSE-LRPLGTGLYPVISIINHSCLP 72
           V+  LQ WP    + S+  I+  F  + CN  T+ +   L+ +G G++P + ++NH C P
Sbjct: 145 VDTFLQYWPPQSQQFSMQYISHIFGVINCNGFTLSDQRGLQAVGVGIFPNLGLVNHDCWP 204

Query: 73  NAVLVFE-GRLAVVRAVQH------------VPKGAEVLISYIETAGSTMTRQKALKEQY 119
           N  ++F  G    V+++ H            + +G E+ +SYI+    +  R++ LK+QY
Sbjct: 205 NCTVIFNNGNHEAVKSMFHTQMRIELRALGKISEGEELTVSYIDFLNVSEERKRQLKKQY 264

Query: 120 LFTCTCPRCIKLGQFDDIQESAILEGYRCKDDGCSGFLLRDSDDKGFTCQQCGLVRSKEE 179
            F CTC  C K                + KDD    FL              G+  + + 
Sbjct: 265 YFDCTCEHCQK----------------KLKDD---LFL--------------GVKDNPKP 291

Query: 180 IKKIASEVNILSKKTL----ALTSCGNHQEVVSTYKMIEKLQKKLYHPFSVNLMQTREKL 235
            +++  E+   SK TL       S G + EVV   +   + Q+ ++   ++ +++    +
Sbjct: 292 SQEVVKEMIQFSKDTLEKIDKARSEGLYHEVVKLCRECLEKQEPVFADTNIYMLRMLSIV 351

Query: 236 IKILMELEDWKEALAYCQLTIPVYQRVYPQFHPLLGLQYYTCGKLEWFLGDTENAIKSMT 295
            ++L  L+ ++EA  Y +  +  Y ++Y   +  LG+     G   W  G+ E     + 
Sbjct: 352 SEVLSYLQAFEEASFYARRMVDGYMKLYHPNNAQLGMAVMRAGLTNWHAGNIEVGHGMIC 411

Query: 296 EAVEILRITHGTNSPFMKEL 315
           +A  IL +THG + P  K+L
Sbjct: 412 KAYAILLVTHGPSHPITKDL 431


>gi|71834584|ref|NP_001025394.1| N-lysine methyltransferase SMYD2-B isoform 1 [Danio rerio]
          Length = 437

 Score = 97.4 bits (241), Expect = 9e-18,   Method: Compositional matrix adjust.
 Identities = 81/323 (25%), Positives = 146/323 (45%), Gaps = 39/323 (12%)

Query: 1   MSDID-EKQLLLYAQIANLVNLI---LQWPEISINEIAENFSKLACNAHTICNSELRPLG 56
           + D+D EK+ +  A IA L       L +P+     +   FS++ CN  T+ + EL  LG
Sbjct: 139 LEDMDNEKREMTEAHIAGLHQFYSKHLDFPDH--QALLTLFSQVHCNGFTVEDEELSNLG 196

Query: 57  TGLYPVISIINHSCLPNAVLVFEGRLAVVRAVQHVPKGAEVLISYIETAGSTMTRQKALK 116
             ++P I+++NHSC PN ++ + G  A VRAV+ +  G E+  SYI+    T  R + L+
Sbjct: 197 LAIFPDIALLNHSCSPNVIVTYRGINAEVRAVKDISPGQEIYTSYIDLLYPTADRLERLR 256

Query: 117 EQYLFTCTCPRCIKLGQFDDIQESAILEGYRCKDDGCSGFLLRDSDDKGFTCQQCGLVR- 175
           + Y F+C C  C      D ++ S      R + D      ++D            +VR 
Sbjct: 257 DMYYFSCDCKECTT-KSMDVVKMSV-----RKRSDEIGEKEIKD------------MVRY 298

Query: 176 ---SKEEIKKIASEVNILSKKTLALTSCGNHQEVVSTYKMIEKLQKKLYHPFSVNLMQTR 232
              S E  ++   + NIL   +  L  C    + +ST          ++   +V ++   
Sbjct: 299 ARNSMENFRRAKQDKNILC-GSFILEMCELSIDKMST----------VFDDSNVYILHMM 347

Query: 233 EKLIKILMELEDWKEALAYCQLTIPVYQRVYPQFHPLLGLQYYTCGKLEWFLGDTENAIK 292
            + + I +  ED++ A+ Y +  I  +  +YP +   +   +   G+L   L      I 
Sbjct: 348 YQAMGICLFTEDYEGAVRYGEKVIKPFTVLYPAYSMNVASMFLKLGRLYIALDRKLAGID 407

Query: 293 SMTEAVEILRITHGTNSPFMKEL 315
           +  +A+ I+ + HG +  ++ EL
Sbjct: 408 AFQKALTIMEVVHGKDHTYVTEL 430


>gi|148681223|gb|EDL13170.1| SET and MYND domain containing 3, isoform CRA_a [Mus musculus]
          Length = 239

 Score = 97.4 bits (241), Expect = 9e-18,   Method: Compositional matrix adjust.
 Identities = 69/255 (27%), Positives = 120/255 (47%), Gaps = 39/255 (15%)

Query: 52  LRPLGTGLYPVISIINHSCLPNAVLVFEGRLAVVRAVQHVPKGAEVLISYIETAGSTMTR 111
           ++ +G GLYP +S++NHSC PN  +VF G   ++RAV+ +  G E+ I Y++   ++  R
Sbjct: 1   MQEVGVGLYPSMSLLNHSCDPNCSIVFNGPHLLLRAVREIEAGEELTICYLDMLMTSEER 60

Query: 112 QKALKEQYLFTCTCPRCIKLGQFDDIQESAILEGYRCKDDGCSGFLLRDSDDKGFTCQQC 171
           +K L++QY F C C RC    Q  D                      +D+D         
Sbjct: 61  RKQLRDQYCFECDCIRC----QTQD----------------------KDAD--------- 85

Query: 172 GLVRSKEEIKKIASEVNILSKKTLALTSCGNHQEVVSTYKMIEKLQKKLYHPFSVNLMQT 231
            ++   E+I K   EV    KK   L +    ++V++  + I           ++  ++ 
Sbjct: 86  -MLTGDEQIWK---EVQESLKKIEELKAHWKWEQVLALCQAIINSNSDRLPDINIYQLKV 141

Query: 232 REKLIKILMELEDWKEALAYCQLTIPVYQRVYPQFHPLLGLQYYTCGKLEWFLGDTENAI 291
            +  +   + L   +EAL Y   T+  Y+  +P  HP+ G+Q    GKL+   G    A+
Sbjct: 142 LDCAMDACINLGMLEEALFYAMRTMEPYRIFFPGSHPVRGVQVMKVGKLQLHQGMFPQAM 201

Query: 292 KSMTEAVEILRITHG 306
           K++  A +I+++THG
Sbjct: 202 KNLRLAFDIMKVTHG 216


>gi|57997548|emb|CAI46077.1| hypothetical protein [Homo sapiens]
          Length = 490

 Score = 97.4 bits (241), Expect = 9e-18,   Method: Compositional matrix adjust.
 Identities = 79/320 (24%), Positives = 144/320 (45%), Gaps = 56/320 (17%)

Query: 19  VNLILQ-WP----EISINEIAENFSKLACNAHTICNSE-LRPLGTGLYPVISIINHSCLP 72
           V+  LQ WP    + S+  I+  F  + CN  T+ +   L+ +G G++P + ++NH C P
Sbjct: 151 VDTFLQYWPPQSQQFSMQYISHIFGVINCNGFTLSDQRGLQAVGVGIFPNLGLVNHDCWP 210

Query: 73  NAVLVFE-GRLAVVRAVQH------------VPKGAEVLISYIETAGSTMTRQKALKEQY 119
           N  ++F  G    V+++ H            + +G E+ +SYI+    +  R++ LK+QY
Sbjct: 211 NCTVIFNNGNHEAVKSMFHTQMRIELRALGKISEGEELTVSYIDFLNVSEERKRQLKKQY 270

Query: 120 LFTCTCPRCIKLGQFDDIQESAILEGYRCKDDGCSGFLLRDSDDKGFTCQQCGLVRSKEE 179
            F CTC  C K                + KDD    FL              G+  + + 
Sbjct: 271 YFDCTCEHCQK----------------KLKDD---LFL--------------GVKDNPKP 297

Query: 180 IKKIASEVNILSKKTL----ALTSCGNHQEVVSTYKMIEKLQKKLYHPFSVNLMQTREKL 235
            +++  E+   SK TL       S G + EVV   +   + Q+ ++   ++ +++    +
Sbjct: 298 SQEVVKEMIQFSKDTLEKIDKARSEGLYHEVVKLCRECLEKQEPVFADTNIYMLRMLSIV 357

Query: 236 IKILMELEDWKEALAYCQLTIPVYQRVYPQFHPLLGLQYYTCGKLEWFLGDTENAIKSMT 295
            ++L  L+ ++EA  Y +  +  Y ++Y   +  LG+     G   W  G+ E     + 
Sbjct: 358 SEVLSYLQAFEEASFYARRMVDGYMKLYHPNNAQLGMAVMRAGLTNWHAGNIEVGHGMIC 417

Query: 296 EAVEILRITHGTNSPFMKEL 315
           +A  IL +THG + P  K+L
Sbjct: 418 KAYAILLVTHGPSHPITKDL 437


>gi|119597483|gb|EAW77077.1| SET and MYND domain containing 1, isoform CRA_a [Homo sapiens]
          Length = 510

 Score = 97.1 bits (240), Expect = 9e-18,   Method: Compositional matrix adjust.
 Identities = 79/320 (24%), Positives = 144/320 (45%), Gaps = 56/320 (17%)

Query: 19  VNLILQ-WP----EISINEIAENFSKLACNAHTICNSE-LRPLGTGLYPVISIINHSCLP 72
           V+  LQ WP    + S+  I+  F  + CN  T+ +   L+ +G G++P + ++NH C P
Sbjct: 151 VDTFLQYWPPQSQQFSMQYISHIFGVINCNGFTLSDQRGLQAVGVGIFPNLGLVNHDCWP 210

Query: 73  NAVLVFE-GRLAVVRAVQH------------VPKGAEVLISYIETAGSTMTRQKALKEQY 119
           N  ++F  G    V+++ H            + +G E+ +SYI+    +  R++ LK+QY
Sbjct: 211 NCTVIFNNGNHEAVKSMFHTQMRIELRALGKISEGEELTVSYIDFLNVSEERKRQLKKQY 270

Query: 120 LFTCTCPRCIKLGQFDDIQESAILEGYRCKDDGCSGFLLRDSDDKGFTCQQCGLVRSKEE 179
            F CTC  C K                + KDD    FL              G+  + + 
Sbjct: 271 YFDCTCEHCQK----------------KLKDD---LFL--------------GVKDNPKP 297

Query: 180 IKKIASEVNILSKKTL----ALTSCGNHQEVVSTYKMIEKLQKKLYHPFSVNLMQTREKL 235
            +++  E+   SK TL       S G + EVV   +   + Q+ ++   ++ +++    +
Sbjct: 298 SQEVVKEMIQFSKDTLEKIDKARSEGLYHEVVKLCRECLEKQEPVFADTNIYMLRMLSIV 357

Query: 236 IKILMELEDWKEALAYCQLTIPVYQRVYPQFHPLLGLQYYTCGKLEWFLGDTENAIKSMT 295
            ++L  L+ ++EA  Y +  +  Y ++Y   +  LG+     G   W  G+ E     + 
Sbjct: 358 SEVLSYLQAFEEASFYARRMVDGYMKLYHPNNAQLGMAVMRAGLTNWHAGNIEVGHGMIC 417

Query: 296 EAVEILRITHGTNSPFMKEL 315
           +A  IL +THG + P  K+L
Sbjct: 418 KAYAILLVTHGPSHPITKDL 437


>gi|301773928|ref|XP_002922386.1| PREDICTED: SET and MYND domain-containing protein 1-like
           [Ailuropoda melanoleuca]
          Length = 490

 Score = 97.1 bits (240), Expect = 1e-17,   Method: Compositional matrix adjust.
 Identities = 79/316 (25%), Positives = 142/316 (44%), Gaps = 48/316 (15%)

Query: 19  VNLILQ-WP----EISINEIAENFSKLACNAHTICNSE-LRPLGTGLYPVISIINHSCLP 72
           V+  LQ WP    + S+  I+  F  + CN  T+ +   L+ +G G++P + ++NH+C P
Sbjct: 151 VDTFLQYWPPQSQQFSMQYISHVFGVINCNGFTLSDQRGLQAVGVGIFPNLGLVNHNCWP 210

Query: 73  NAVLVFE-GRLAVVRAVQH------------VPKGAEVLISYIETAGSTMTRQKALKEQY 119
           N  ++F  G    V+++ H            + +G E+ +SYI+    +  R++ LK+QY
Sbjct: 211 NCTVIFNNGNHEAVKSMFHTQMRIELRALGKISEGEELTVSYIDFLNVSEERKRQLKKQY 270

Query: 120 LFTCTCPRCIKLGQFDDIQESAILEGYRCKDDGCSGFLLRDSDDKGFTCQQCGLVRSKEE 179
            F CTC  C K                + KDD    FL    D K           S+E 
Sbjct: 271 YFDCTCEHCQK----------------KLKDD---LFLGVKDDPKP----------SQEV 301

Query: 180 IKKIASEVNILSKKTLALTSCGNHQEVVSTYKMIEKLQKKLYHPFSVNLMQTREKLIKIL 239
           +K++        +K     S G + EVV   +   + Q+ ++   ++  ++    + ++L
Sbjct: 302 VKEMIQFSKDTLEKIDKARSEGLYHEVVKLCRECLQKQEPVFADTNLYTLRLLSTVSEVL 361

Query: 240 MELEDWKEALAYCQLTIPVYQRVYPQFHPLLGLQYYTCGKLEWFLGDTENAIKSMTEAVE 299
             L+ ++EA  Y +  +  Y ++Y   +  LG+     G   W  G+ E     + +A  
Sbjct: 362 SYLQAFEEASYYARRMVDGYMKLYHPNNAQLGMAVMRAGLTNWHAGNIEVGHGMICKAYA 421

Query: 300 ILRITHGTNSPFMKEL 315
           IL +THG + P  K+L
Sbjct: 422 ILLVTHGPSHPITKDL 437


>gi|74178258|dbj|BAE29912.1| unnamed protein product [Mus musculus]
          Length = 420

 Score = 97.1 bits (240), Expect = 1e-17,   Method: Compositional matrix adjust.
 Identities = 64/265 (24%), Positives = 120/265 (45%), Gaps = 33/265 (12%)

Query: 5   DEKQLLLYAQIANLVNLI---LQWPEISINEIAENFSKLACNAHTICNSELRPLGTGLYP 61
           +EK+ L+ + IA L       L++P+ S   +   F+++ CN  TI + EL  LG+ ++P
Sbjct: 143 NEKKDLIQSDIAALHQFYSKYLEFPDHS--SLVVLFAQVNCNGFTIEDEELSHLGSAIFP 200

Query: 62  VISIINHSCLPNAVLVFEGRLAVVRAVQHVPKGAEVLISYIETAGSTMTRQKALKEQYLF 121
            ++++NHSC PN ++ ++G LA VRAVQ +  G EV  SYI+    T  R   L++ Y F
Sbjct: 201 DVALMNHSCCPNVIVTYKGTLAEVRAVQEIHPGDEVFTSYIDLLYPTEDRNDRLRDSYFF 260

Query: 122 TCTCPRCIKLGQFDDIQESAILEGYRCKDDGCSGFLLRDSDDKGFTCQQCGLVRSKEEIK 181
           TC C  C                              +D D      ++       E I+
Sbjct: 261 TCECRECT----------------------------TKDKDKAKVEVRKLSSPPQAEAIR 292

Query: 182 KIASEVNILSKKTLALTSCGNHQEVVSTYKMIEKLQKKLYHPFSVNLMQTREKLIKILME 241
            +      + ++        +  E++   ++ ++    ++   +V ++    + + + + 
Sbjct: 293 DMVRYARNVIEEFRRAKHYKSPSELLEICELSQEKMSSVFEDSNVYMLHMMYQAMGVCLY 352

Query: 242 LEDWKEALAYCQLTIPVYQRVYPQF 266
           ++DW+ AL Y Q  I  Y + YP +
Sbjct: 353 MQDWEGALKYGQKIIKPYSKHYPVY 377


>gi|328769214|gb|EGF79258.1| hypothetical protein BATDEDRAFT_35411 [Batrachochytrium
           dendrobatidis JAM81]
          Length = 503

 Score = 96.7 bits (239), Expect = 1e-17,   Method: Compositional matrix adjust.
 Identities = 81/315 (25%), Positives = 144/315 (45%), Gaps = 50/315 (15%)

Query: 27  EISINEIAENFSKLACNAHTICNSELRPLGTGLYPVISIINHSCLPNAVLVFEGRLAVVR 86
            + ++++  +  +  CN  TI +S L P+G G YP+ S+ NH C PN + +F+G   V++
Sbjct: 228 NLPVSDLIHHLGRFRCNNFTIIDSNLFPVGEGTYPLASLFNHDCWPNCIAIFDGSRVVIQ 287

Query: 87  AVQHVPKGAEVLISYIETAGSTMTRQKALKEQYLFTCTCPRCIKLGQFDDIQESAILEGY 146
            ++ + KG E+ ISYI+      +R+ +L+ +Y F C C  C+        +    L   
Sbjct: 288 TIRDIAKGDELCISYIDPILDHDSRRMSLETKYCFNCQCSVCMS-------ESCTPLHSP 340

Query: 147 RCKDDGCSGFLLRDSDDKGF-------TCQQCGLVRSKEEIKKIASEVNILSKKTLALTS 199
             K D      L +  D G+       TC    L +S +E ++ A +  I   KT+AL S
Sbjct: 341 TTKKDAL--LSLDEKSDVGWRLLKWFHTC----LKKSDDEFER-AQQDQI--SKTIALVS 391

Query: 200 CGNHQEVVSTYKMIEKLQKKLYHPFSVNLMQTREKLIKILMELEDWKEALAYCQLTIPVY 259
               + VV+      ++       F  +LM+            +DW++A       +  Y
Sbjct: 392 ---KRAVVAEIGNATQM-------FQSHLMR------------QDWEQAYKSGLHLLGQY 429

Query: 260 QRVYPQFHPLLGLQYYTCGKLEWFLGDTENAIKSMTEAVEILRITHGTNSPFMK--ELIL 317
              YP+++PL+    +   +  W  G T+  I  +    + L IT+G+    MK   LI 
Sbjct: 430 LLRYPRYYPLVSQHMFFVAQCMWNSGQTDETIFLLGVIKKCLEITYGS---VMKSHHLIG 486

Query: 318 KLEEAQAEASYKLSS 332
           +++   A+A Y+ S+
Sbjct: 487 QVDSLLAQALYEQSA 501


>gi|229892820|ref|NP_001153561.1| SET and MYND domain-containing protein 1 [Sus scrofa]
 gi|197692962|gb|ACH71269.1| SET and MYND domain-containing 1 [Sus scrofa]
          Length = 490

 Score = 96.7 bits (239), Expect = 1e-17,   Method: Compositional matrix adjust.
 Identities = 80/320 (25%), Positives = 143/320 (44%), Gaps = 56/320 (17%)

Query: 19  VNLILQ-WP----EISINEIAENFSKLACNAHTICNSE-LRPLGTGLYPVISIINHSCLP 72
           V+  LQ WP    + S+  I+  F  + CN  T+     L+ +G G++P + ++NH+C P
Sbjct: 151 VDTFLQYWPPQSQQFSMQYISHIFGVINCNGFTLSEQRGLQAVGVGIFPNLGLVNHNCWP 210

Query: 73  NAVLVFE-GRLAVVRAVQH------------VPKGAEVLISYIETAGSTMTRQKALKEQY 119
           N  ++F  G    V+++ H            + +G E+ +SYI+    +  R+K LK+QY
Sbjct: 211 NCTVIFNNGNHEAVKSMFHTQTRIELRALGKISEGEELTVSYIDFLNFSEERKKQLKKQY 270

Query: 120 LFTCTCPRCIKLGQFDDIQESAILEGYRCKDDGCSGFLLRDSDDKGFTCQQCGLVRSKEE 179
            F CTC  C K G  DD+                  FL              G+  + + 
Sbjct: 271 YFDCTCEHCQK-GLKDDL------------------FL--------------GVKDNPKP 297

Query: 180 IKKIASEVNILSKKTL----ALTSCGNHQEVVSTYKMIEKLQKKLYHPFSVNLMQTREKL 235
            +++  E+   SK TL       S G + EVV   +   + Q+ ++   ++  ++    +
Sbjct: 298 SQEVVKEMIQFSKDTLEKIDKARSKGLYHEVVKLCRECLEKQEPVFADTNIYTLRLLSIV 357

Query: 236 IKILMELEDWKEALAYCQLTIPVYQRVYPQFHPLLGLQYYTCGKLEWFLGDTENAIKSMT 295
            ++L  L+ ++EA  Y +  +  Y ++Y   +  LG+     G   W  G+ E     + 
Sbjct: 358 SEVLSYLQAFEEASHYARRMVDGYMKLYHPNNAQLGMAIMRAGLTNWHAGNIEVGHSMIC 417

Query: 296 EAVEILRITHGTNSPFMKEL 315
           +A  IL +THG + P  K+L
Sbjct: 418 KAYAILLVTHGPSHPITKDL 437


>gi|302753848|ref|XP_002960348.1| hypothetical protein SELMODRAFT_437450 [Selaginella moellendorffii]
 gi|300171287|gb|EFJ37887.1| hypothetical protein SELMODRAFT_437450 [Selaginella moellendorffii]
          Length = 614

 Score = 96.7 bits (239), Expect = 1e-17,   Method: Compositional matrix adjust.
 Identities = 68/236 (28%), Positives = 105/236 (44%), Gaps = 16/236 (6%)

Query: 58  GLYPVISIINHSCLPNAVLVFEGRLAVVRAVQHVPKGAEVLISYIETAGSTMTRQKALKE 117
           GL+P +SI+NHSC PN     EG +  VRAVQ +PKGAE+ +SYI    S   R+  L  
Sbjct: 296 GLFPYVSILNHSCRPNCCFASEGSVMYVRAVQDIPKGAELCLSYINLYESRRVRKTLLVA 355

Query: 118 QYLFTCTCPRCIKLGQFDDIQESAILEGYRCKDDGCSGFLLRDS---DDKG----FTCQQ 170
              F CTC RC+   +  +      LEG  C   GC G  LR +   +D+     + C  
Sbjct: 356 TKHFDCTCDRCV---EPLNTSIDRFLEGCVCHVRGCGGVFLRTAALHEDQASSTTWECDL 412

Query: 171 CGLVRSKEEIKKIASEVNILSK---KTLALTSCGNHQEVVSTYKMIEKLQKKL---YHPF 224
           C  V   E ++       +++K   + +A     + +      K++E    +     HP 
Sbjct: 413 CSRVLDPESMRFKDPPWEVIAKAEERLVAAVRIYSERRFKEARKLLESYLSEFTGKLHPL 472

Query: 225 SVNLMQTREKLIKILMELEDWKEALAYCQLTIPVYQRVYPQFHPLLGLQYYTCGKL 280
            V L      L+     + D +E L  C+L +   ++V P     L   Y+  G++
Sbjct: 473 HVFLFDALTPLMNCCRAMGDAEEGLRVCRLILQCMEKVLPGPSLELANFYFCLGEM 528


>gi|149727214|ref|XP_001497840.1| PREDICTED: SET and MYND domain-containing protein 1 isoform 2
           [Equus caballus]
          Length = 490

 Score = 96.7 bits (239), Expect = 1e-17,   Method: Compositional matrix adjust.
 Identities = 79/320 (24%), Positives = 142/320 (44%), Gaps = 56/320 (17%)

Query: 19  VNLILQ-WP----EISINEIAENFSKLACNAHTICNSE-LRPLGTGLYPVISIINHSCLP 72
           V+  LQ WP    + S+  I+  F  + CN  T+ +   L+ +G G++P + ++NH C P
Sbjct: 151 VDTFLQYWPPQSQQFSMQYISHIFGVINCNGFTLSDQRGLQAVGVGVFPNLGLVNHDCWP 210

Query: 73  NAVLVFE-GRLAVVRAVQH------------VPKGAEVLISYIETAGSTMTRQKALKEQY 119
           N  ++F  G    V+++ H            + +G E+ +SYI+    +  R++ LK+QY
Sbjct: 211 NCTVIFNNGNHEAVKSMFHTQMRIELRALGKISEGEELTVSYIDFLNVSEERKRQLKKQY 270

Query: 120 LFTCTCPRCIKLGQFDDIQESAILEGYRCKDDGCSGFLLRDSDDKGFTCQQCGLVRSKEE 179
            F CTC  C K G  DD+                  FL              G+    + 
Sbjct: 271 YFDCTCEHCQK-GLKDDL------------------FL--------------GVKDDPKP 297

Query: 180 IKKIASEVNILSKKTL----ALTSCGNHQEVVSTYKMIEKLQKKLYHPFSVNLMQTREKL 235
            + +  EV   SK TL       + G + EVV   +   + Q+ ++   ++ +++    +
Sbjct: 298 SQDVVKEVVQFSKDTLEKIDKARTEGLYHEVVKLCRECLQKQEPVFADTNLYMLRILSIV 357

Query: 236 IKILMELEDWKEALAYCQLTIPVYQRVYPQFHPLLGLQYYTCGKLEWFLGDTENAIKSMT 295
            ++L  L+ ++EA  Y +  +  Y ++Y   +  LG+     G   W  G+ E     + 
Sbjct: 358 SEVLSYLQAFEEASYYARKMVDGYMKLYHPNNAQLGMAVMRAGLTNWHAGNIEVGHGMIC 417

Query: 296 EAVEILRITHGTNSPFMKEL 315
           +A  IL +THG + P  K+L
Sbjct: 418 KAYAILLVTHGPSHPITKDL 437


>gi|348566407|ref|XP_003468993.1| PREDICTED: SET and MYND domain-containing protein 1-like isoform 1
           [Cavia porcellus]
          Length = 490

 Score = 96.7 bits (239), Expect = 1e-17,   Method: Compositional matrix adjust.
 Identities = 79/320 (24%), Positives = 140/320 (43%), Gaps = 56/320 (17%)

Query: 19  VNLILQ-WP----EISINEIAENFSKLACNAHTICNSE-LRPLGTGLYPVISIINHSCLP 72
           V+  LQ WP    + S+  I+  F  + CN  T+ +   L+ +G G++P + ++NH C P
Sbjct: 151 VDTFLQYWPPQSQQFSMQYISHIFGVINCNGFTLSDQRGLQAVGVGIFPNLGLVNHDCWP 210

Query: 73  NAVLVFE-GRLAVVRAVQH------------VPKGAEVLISYIETAGSTMTRQKALKEQY 119
           N  ++F  G    V+++ H            +  G E+ +SYI+    +  R++ LK+QY
Sbjct: 211 NCTVIFNNGNHEAVKSMFHTQMRIELRALGKISVGEELTVSYIDFLNVSEERKRQLKKQY 270

Query: 120 LFTCTCPRCIKLGQFDDIQESAILEGYRCKDDGCSGFLLRDSDDKGFTCQQCGLVRSKEE 179
            F CTC  C K G  DD+                  FL              G+    + 
Sbjct: 271 YFDCTCEHCQK-GLKDDL------------------FL--------------GVKEDPKP 297

Query: 180 IKKIASEVNILSKKTL----ALTSCGNHQEVVSTYKMIEKLQKKLYHPFSVNLMQTREKL 235
            +++  E+   SK TL       S G + EVV   +   + Q+ ++   ++  ++    +
Sbjct: 298 SQEVVKEMTQFSKDTLEKIDKARSEGLYHEVVKLCRECLEKQEPVFADTNLYTLRMLSVV 357

Query: 236 IKILMELEDWKEALAYCQLTIPVYQRVYPQFHPLLGLQYYTCGKLEWFLGDTENAIKSMT 295
            ++L  L+ + EA  Y +  +  Y ++Y   +  LG+     G   W  G+ E     + 
Sbjct: 358 SEVLSYLQAFGEAAHYARRMVDGYMKLYHPNNAQLGMAVMRAGLTNWHAGNIEVGHGMIC 417

Query: 296 EAVEILRITHGTNSPFMKEL 315
           +A  IL +THG + P  K+L
Sbjct: 418 KAYAILLVTHGPSHPITKDL 437


>gi|84180545|gb|ABC54714.1| histone methyltransferase SmyD1b [Danio rerio]
 gi|190337458|gb|AAI63095.1| Smyd1b protein [Danio rerio]
          Length = 473

 Score = 96.3 bits (238), Expect = 2e-17,   Method: Compositional matrix adjust.
 Identities = 73/318 (22%), Positives = 134/318 (42%), Gaps = 34/318 (10%)

Query: 3   DIDEKQLLLYAQIANLVNLILQWPEIS----INEIAENFSKLACNAHTICNSE-LRPLGT 57
           DI E  L  +    ++ N +  WP  S    ++ ++     + CN   + +   L+ +G 
Sbjct: 133 DISEDDLKDFK--VDIHNFLDYWPRNSKPHTVDSVSHILGVINCNGFMVSDQRGLQAVGV 190

Query: 58  GLYPVISIINHSCLPNAVLVFEGRLAVVRAVQHVPKGAEVLISYIETAGSTMTRQKALKE 117
           GL+P + ++NH C PN  ++       +RA+  +  G EV ++Y++    +  RQ+ LK+
Sbjct: 191 GLFPNLCLVNHDCWPNCTVILNNGKIELRALGKISAGEEVTVAYVDYLNVSADRQRLLKQ 250

Query: 118 QYLFTCTCPRCIKLGQFDDIQESAILEGYRCKDDGCSGFLLRDSDDKGFTCQQCGLVRSK 177
           QY F CTC  C +  + D     A ++G +  +                           
Sbjct: 251 QYFFDCTCKHCTEKIKDDLKMAGAEVDGVKVPE--------------------------- 283

Query: 178 EEIKKIASEVNILSKKTLALTSCGNHQEVVSTYKMIEKLQKKLYHPFSVNLMQTREKLIK 237
           E++K++        +K        N+ EVV   +   + Q+ +     +  ++    L +
Sbjct: 284 EQVKEVTEFSRQKLEKMEKARIEANYNEVVKICRECVEKQENVLADTHIYYLRVCCTLSE 343

Query: 238 ILMELEDWKEALAYCQLTIPVYQRVYPQFHPLLGLQYYTCGKLEWFLGDTENAIKSMTEA 297
           +L  L+ + EA  Y +  +  Y ++Y   +  LG+     G   W  G  E A   + +A
Sbjct: 344 VLSYLQFFDEASEYARKMVDGYLKLYHPNNAQLGMATMRAGVTHWQAGFIEVAHGMICKA 403

Query: 298 VEILRITHGTNSPFMKEL 315
             IL ITHG   P  K+L
Sbjct: 404 FAILMITHGPTHPITKDL 421


>gi|344244710|gb|EGW00814.1| SET and MYND domain-containing protein 3 [Cricetulus griseus]
          Length = 239

 Score = 96.3 bits (238), Expect = 2e-17,   Method: Compositional matrix adjust.
 Identities = 69/255 (27%), Positives = 120/255 (47%), Gaps = 39/255 (15%)

Query: 52  LRPLGTGLYPVISIINHSCLPNAVLVFEGRLAVVRAVQHVPKGAEVLISYIETAGSTMTR 111
           ++ +G GLYP +S++NHSC PN  +VF G   ++RAV+ +  G E+ I Y++   ++  R
Sbjct: 1   MQEVGVGLYPSMSLLNHSCDPNCSIVFNGPHLLLRAVREIEAGEELTICYLDMLMTSEER 60

Query: 112 QKALKEQYLFTCTCPRCIKLGQFDDIQESAILEGYRCKDDGCSGFLLRDSDDKGFTCQQC 171
           +K L++QY F C C RC    Q  D                      +D+D         
Sbjct: 61  RKQLRDQYCFECDCIRC----QTHD----------------------KDAD--------- 85

Query: 172 GLVRSKEEIKKIASEVNILSKKTLALTSCGNHQEVVSTYKMIEKLQKKLYHPFSVNLMQT 231
            ++   E+I K   EV    KK   L +    ++V++  + I           ++  ++ 
Sbjct: 86  -MLTGDEQIWK---EVQESLKKIEELKAHWKWEQVLALCQAIISSNSDRLPDINIYQLKV 141

Query: 232 REKLIKILMELEDWKEALAYCQLTIPVYQRVYPQFHPLLGLQYYTCGKLEWFLGDTENAI 291
            +  +   + L   +EAL Y   T+  Y+  +P  HP+ G+Q    GKL+   G    A+
Sbjct: 142 LDCAMDACINLGMLEEALLYALRTMEPYRIFFPGSHPVRGVQVMKVGKLQLHQGMFPQAM 201

Query: 292 KSMTEAVEILRITHG 306
           K++  A +I+++THG
Sbjct: 202 KNLRLAFDIMKVTHG 216


>gi|302767894|ref|XP_002967367.1| hypothetical protein SELMODRAFT_439878 [Selaginella moellendorffii]
 gi|300165358|gb|EFJ31966.1| hypothetical protein SELMODRAFT_439878 [Selaginella moellendorffii]
          Length = 614

 Score = 96.3 bits (238), Expect = 2e-17,   Method: Compositional matrix adjust.
 Identities = 67/236 (28%), Positives = 105/236 (44%), Gaps = 16/236 (6%)

Query: 58  GLYPVISIINHSCLPNAVLVFEGRLAVVRAVQHVPKGAEVLISYIETAGSTMTRQKALKE 117
           GL+P +SI+NHSC PN     EG +  VRAVQ +PKGAE+ +SYI    S   R+  L  
Sbjct: 296 GLFPYVSILNHSCRPNCCFASEGSVMYVRAVQDIPKGAELCLSYINLYESRRVRKTLLVA 355

Query: 118 QYLFTCTCPRCIKLGQFDDIQESAILEGYRCKDDGCSGFLLRDS---DDKG----FTCQQ 170
              F CTC RC+   +  +      LEG  C   GC G  LR +   +D+     + C  
Sbjct: 356 TKHFDCTCDRCV---EPLNTSIDRFLEGCVCHVRGCGGVFLRTAALHEDQASSTTWECDL 412

Query: 171 CGLVRSKEEIKKIASEVNILSK---KTLALTSCGNHQEVVSTYKMIEKLQKKL---YHPF 224
           C  +   E ++       +++K   + +A     + +      K++E    +     HP 
Sbjct: 413 CSRILDPESMRFKDPPWEVIAKAEERLVAAVRIYSERRFKEARKLLESYLSEFTGKLHPL 472

Query: 225 SVNLMQTREKLIKILMELEDWKEALAYCQLTIPVYQRVYPQFHPLLGLQYYTCGKL 280
            V L      L+     + D +E L  C+L +   ++V P     L   Y+  G++
Sbjct: 473 HVFLFDALTPLMNCCRAMGDAEEGLRVCRLILQCMEKVLPGPSLELANFYFCLGEM 528


>gi|149040851|gb|EDL94808.1| similar to SET and MYND domain containing 3, isoform CRA_a [Rattus
           norvegicus]
          Length = 239

 Score = 95.9 bits (237), Expect = 2e-17,   Method: Compositional matrix adjust.
 Identities = 71/257 (27%), Positives = 120/257 (46%), Gaps = 43/257 (16%)

Query: 52  LRPLGTGLYPVISIINHSCLPNAVLVFEGRLAVVRAVQHVPKGAEVLISYIETAGSTMTR 111
           ++ +G GLYP +S++NHSC PN  +VF G   ++RAV+ +  G E+ I Y++   ++  R
Sbjct: 1   MQEVGVGLYPSMSLLNHSCDPNCSIVFNGPHLLLRAVREIEAGEELTICYLDMLMTSEER 60

Query: 112 QKALKEQYLFTCTCPRCIKLGQFDDIQESAILEGYRCKDDGCSGFLLRDSDDKGFTCQQC 171
           +K L++QY F C C RC                              +D D    T  + 
Sbjct: 61  RKQLRDQYCFECDCIRCPT----------------------------QDKDADMLTGDEQ 92

Query: 172 GLVRSKEEIKKIASEVNILSKKTLALTSCGNHQEVVSTYKMIEKLQKKLYHPFSVNLMQT 231
                +E +KKI  E+    K    L  C   Q ++++    E+L         +N+ Q 
Sbjct: 93  IWKEVQESLKKI-EELKAHWKWEQVLALC---QAIINSNS--ERLP-------DINIYQL 139

Query: 232 R--EKLIKILMELEDWKEALAYCQLTIPVYQRVYPQFHPLLGLQYYTCGKLEWFLGDTEN 289
           +  +  +   + L   +EAL Y   T+  Y+  +P  HP+ G+Q    GKL+   G    
Sbjct: 140 KVLDCAMDACINLGMLEEALFYAMRTMEPYRIFFPGSHPVRGVQVMKVGKLQLHQGMFPQ 199

Query: 290 AIKSMTEAVEILRITHG 306
           A+K++  A +I+++THG
Sbjct: 200 AMKNLRLAFDIMKVTHG 216


>gi|327286307|ref|XP_003227872.1| PREDICTED: SET and MYND domain-containing protein 3-like [Anolis
           carolinensis]
          Length = 334

 Score = 95.5 bits (236), Expect = 3e-17,   Method: Compositional matrix adjust.
 Identities = 44/106 (41%), Positives = 67/106 (63%)

Query: 32  EIAENFSKLACNAHTICNSELRPLGTGLYPVISIINHSCLPNAVLVFEGRLAVVRAVQHV 91
           ++ E F+K+ACN  +I N E++ +G GLYP +S++N+SC PN  +VFEG    +R+++ +
Sbjct: 200 DVFETFAKVACNCFSISNGEMQDVGVGLYPSMSLLNNSCDPNCAIVFEGPELHLRSIREI 259

Query: 92  PKGAEVLISYIETAGSTMTRQKALKEQYLFTCTCPRCIKLGQFDDI 137
            +G E+ I Y+ET   T  RQ+ LK QY F C C  C    +  DI
Sbjct: 260 QEGEELTICYVETMMPTPERQERLKRQYCFECDCLMCCTKSKAYDI 305


>gi|291386375|ref|XP_002709686.1| PREDICTED: SET and MYND domain containing 1 isoform 2 [Oryctolagus
           cuniculus]
          Length = 490

 Score = 95.5 bits (236), Expect = 3e-17,   Method: Compositional matrix adjust.
 Identities = 76/318 (23%), Positives = 140/318 (44%), Gaps = 47/318 (14%)

Query: 16  ANLVNLILQWP----EISINEIAENFSKLACNAHTICNSE-LRPLGTGLYPVISIINHSC 70
           A++   +  WP      S+  I+  F  + CN  T+ +   L+ +G G++P ++++NH C
Sbjct: 149 ADVDTFLQYWPPQSQHFSMQYISHIFGVINCNGFTLSDQRGLQAVGVGIFPNLALVNHDC 208

Query: 71  LPNAVLVFE-GRLAVVRAVQH------------VPKGAEVLISYIETAGSTMTRQKALKE 117
            PN  ++F  G    V+++ H            + +G E+ +SYI+    +  R++ LK+
Sbjct: 209 WPNCTVIFNNGNHEAVKSMFHTQMRIELRALGKISEGEELTVSYIDFLNVSEERRRQLKK 268

Query: 118 QYLFTCTCPRCIKLGQFDDIQESAILEGYRCKDDGCSGFLLRDSDDKGFTCQQCGLVRSK 177
           QY F CTC  C K G  DD+                  FL    + K           S+
Sbjct: 269 QYYFDCTCEHCQK-GLKDDL------------------FLGVKDNPKP----------SQ 299

Query: 178 EEIKKIASEVNILSKKTLALTSCGNHQEVVSTYKMIEKLQKKLYHPFSVNLMQTREKLIK 237
           E +K++        +K     S G + EVV   +   + Q+ ++   ++  ++    + +
Sbjct: 300 EVVKEMTEFCKDTLEKIDKARSEGLYHEVVKLCRECLQKQEPVFADTNLYTLRMLSIVSE 359

Query: 238 ILMELEDWKEALAYCQLTIPVYQRVYPQFHPLLGLQYYTCGKLEWFLGDTENAIKSMTEA 297
           +L  L+ + EA  Y +  +  Y ++Y   +  LG+     G   W  G+ E     + +A
Sbjct: 360 VLSYLQAFGEAADYAKRMVDGYMKLYHPNNAQLGMAVMRAGLTNWHAGNIEVGHGMICKA 419

Query: 298 VEILRITHGTNSPFMKEL 315
             IL +THG + P  K+L
Sbjct: 420 YAILLVTHGPSHPITKDL 437


>gi|47224323|emb|CAG09169.1| unnamed protein product [Tetraodon nigroviridis]
          Length = 361

 Score = 95.5 bits (236), Expect = 3e-17,   Method: Compositional matrix adjust.
 Identities = 76/286 (26%), Positives = 126/286 (44%), Gaps = 41/286 (14%)

Query: 44  AHTIC--NSELRPLGTGLYPVISIINHSCLPNAVLVFEGRLAVVRAVQHVPKGAEVLISY 101
           +HT+   + EL  LGT +YP +++INHSCLP+ ++ + G  A VRAV+ +  G EVLISY
Sbjct: 109 SHTLFSEDEELSHLGTAVYPDVALINHSCLPSVIVTYNGTSADVRAVRDMNPGDEVLISY 168

Query: 102 IETAGSTMTRQKALKEQYLFTCTCPRCIKLGQFDDIQESAILEGYRCKDDGCSGFLLRDS 161
           I+    T  R   L+E Y FTC C  C                G +  D   +   LR  
Sbjct: 169 IDVLYPTEDRNTRLRESYYFTCQCQEC----------------GSQSSDQ--AKLKLRKR 210

Query: 162 DDKGFTCQQCGLVRSKEEIKKIASEVNILSKKTLALTSCGNHQEVVSTYKMIEKLQKKL- 220
            D               E + I + V    K         N     ST  ++E  ++ L 
Sbjct: 211 SDPA-------------EAEVINTMVRYARKCIREFRVFKNSNTPAST--LLEMCEQSLD 255

Query: 221 -----YHPFSVNLMQTREKLIKILMELEDWKEALAYCQLTIPVYQRVYPQFHPLLGLQYY 275
                +   +V ++    + + + + ++D   AL Y +     Y+++YP +   L   Y 
Sbjct: 256 EMGAVFDESNVYMLHMMYQAMGVCLYMQDPDGALRYGEKIAKYYRKLYPAYSLNLSSLYL 315

Query: 276 TCGKLEWFLGDTENAIKSMTEAVEILRITHGTNSPFMKELILKLEE 321
             G+L + +      I  + +A  I+ +THG N  ++ EL  +++E
Sbjct: 316 KLGRLYFGMERNSECISVLKKAKAIMEVTHGKNHFYVTELDRQMKE 361


>gi|5870832|gb|AAC53021.2| skm-BOP1 [Mus musculus]
          Length = 485

 Score = 95.5 bits (236), Expect = 3e-17,   Method: Compositional matrix adjust.
 Identities = 78/316 (24%), Positives = 140/316 (44%), Gaps = 48/316 (15%)

Query: 19  VNLILQ-WP----EISINEIAENFSKLACNAHTICNSE-LRPLGTGLYPVISIINHSCLP 72
           V+  LQ WP    + S+  I+  F  + CN  T+ +   L+ +G G++P + ++NH C P
Sbjct: 146 VDTFLQYWPPQSQQFSMQYISHIFGVINCNGFTLSDQRGLQAVGVGIFPNLGLVNHDCWP 205

Query: 73  NAVLVFE-GRLAVVRAVQH------------VPKGAEVLISYIETAGSTMTRQKALKEQY 119
           N  ++F  G    V+++ H            + +G E+ +SYI+    +  R++ LK+QY
Sbjct: 206 NCTVIFNNGNHEAVKSMFHTQMRIELRALGKISEGEELTVSYIDFLHLSEERRRQLKKQY 265

Query: 120 LFTCTCPRCIKLGQFDDIQESAILEGYRCKDDGCSGFLLRDSDDKGFTCQQCGLVRSKEE 179
            F C+C  C K G  DD+                  FL    D K           S+E 
Sbjct: 266 YFDCSCEHCQK-GLKDDL------------------FLAAKEDPKP----------SQEV 296

Query: 180 IKKIASEVNILSKKTLALTSCGNHQEVVSTYKMIEKLQKKLYHPFSVNLMQTREKLIKIL 239
           +K++        +K     S G + EVV   +   + Q+ ++   ++ +++      ++L
Sbjct: 297 VKEMIQFSKDTLEKIDKARSEGLYHEVVKLCRECLEKQEPVFADTNLYVLRLLSIASEVL 356

Query: 240 MELEDWKEALAYCQLTIPVYQRVYPQFHPLLGLQYYTCGKLEWFLGDTENAIKSMTEAVE 299
             L+ ++EA  Y +  +  Y ++Y   +  LG+     G   W  G  E     + +A  
Sbjct: 357 SYLQAYEEASHYARRMVDGYMKLYHHNNAQLGMAVMRAGLTNWHAGHIEVGHGMICKAYA 416

Query: 300 ILRITHGTNSPFMKEL 315
           IL +THG + P  K+L
Sbjct: 417 ILLVTHGPSHPITKDL 432


>gi|1809322|gb|AAC53020.1| t-BOP [Mus musculus]
          Length = 456

 Score = 95.5 bits (236), Expect = 3e-17,   Method: Compositional matrix adjust.
 Identities = 78/316 (24%), Positives = 140/316 (44%), Gaps = 48/316 (15%)

Query: 19  VNLILQ-WP----EISINEIAENFSKLACNAHTICNSE-LRPLGTGLYPVISIINHSCLP 72
           V+  LQ WP    + S+  I+  F  + CN  T+ +   L+ +G G++P + ++NH C P
Sbjct: 117 VDTFLQYWPLQSQQFSMQYISHIFGVINCNGFTLSDQRGLQAVGVGIFPNLGLVNHDCWP 176

Query: 73  NAVLVFE-GRLAVVRAVQH------------VPKGAEVLISYIETAGSTMTRQKALKEQY 119
           N  ++F  G    V+++ H            + +G E+ +SYI+    +  R++ LK+QY
Sbjct: 177 NCTVIFNNGNHEAVKSMFHTQMRIELRALGKISEGEELTVSYIDFLHLSEERRRQLKKQY 236

Query: 120 LFTCTCPRCIKLGQFDDIQESAILEGYRCKDDGCSGFLLRDSDDKGFTCQQCGLVRSKEE 179
            F C+C  C K G  DD+                  FL    D K           S+E 
Sbjct: 237 YFDCSCEHCQK-GLKDDL------------------FLAAKEDPKP----------SQEV 267

Query: 180 IKKIASEVNILSKKTLALTSCGNHQEVVSTYKMIEKLQKKLYHPFSVNLMQTREKLIKIL 239
           +K++        +K     S G + EVV   +   + Q+ ++   ++ +++      ++L
Sbjct: 268 VKEMIQFSKDTLEKIDKARSEGLYHEVVKLCRECLEKQEPVFTDTNLYVLRLLSIASEVL 327

Query: 240 MELEDWKEALAYCQLTIPVYQRVYPQFHPLLGLQYYTCGKLEWFLGDTENAIKSMTEAVE 299
             L+ ++EA  Y +  +  Y ++Y   +  LG+     G   W  G  E     + +A  
Sbjct: 328 SYLQAYEEASHYARRMVDGYMKLYHHNNAQLGMAVMRAGLTNWHAGHIEVGHGMICKAYA 387

Query: 300 ILRITHGTNSPFMKEL 315
           IL +THG + P  K+L
Sbjct: 388 ILLVTHGPSHPITKDL 403


>gi|236461014|ref|NP_001153599.1| SET and MYND domain-containing protein 1 isoform 1 [Mus musculus]
 gi|332278221|sp|P97443.3|SMYD1_MOUSE RecName: Full=SET and MYND domain-containing protein 1; AltName:
           Full=CD8b-opposite; AltName: Full=Zinc finger protein
           BOP; Short=m-BOP
 gi|308387879|pdb|3N71|A Chain A, Crystal Structure Of Cardiac Specific Histone
           Methyltransferase Smyd1
 gi|49903961|gb|AAH76601.1| Smyd1 protein [Mus musculus]
 gi|74209249|dbj|BAE24995.1| unnamed protein product [Mus musculus]
 gi|148666518|gb|EDK98934.1| SET and MYND domain containing 1, isoform CRA_b [Mus musculus]
          Length = 490

 Score = 95.5 bits (236), Expect = 3e-17,   Method: Compositional matrix adjust.
 Identities = 78/316 (24%), Positives = 140/316 (44%), Gaps = 48/316 (15%)

Query: 19  VNLILQ-WP----EISINEIAENFSKLACNAHTICNSE-LRPLGTGLYPVISIINHSCLP 72
           V+  LQ WP    + S+  I+  F  + CN  T+ +   L+ +G G++P + ++NH C P
Sbjct: 151 VDTFLQYWPPQSQQFSMQYISHIFGVINCNGFTLSDQRGLQAVGVGIFPNLGLVNHDCWP 210

Query: 73  NAVLVFE-GRLAVVRAVQH------------VPKGAEVLISYIETAGSTMTRQKALKEQY 119
           N  ++F  G    V+++ H            + +G E+ +SYI+    +  R++ LK+QY
Sbjct: 211 NCTVIFNNGNHEAVKSMFHTQMRIELRALGKISEGEELTVSYIDFLHLSEERRRQLKKQY 270

Query: 120 LFTCTCPRCIKLGQFDDIQESAILEGYRCKDDGCSGFLLRDSDDKGFTCQQCGLVRSKEE 179
            F C+C  C K G  DD+                  FL    D K           S+E 
Sbjct: 271 YFDCSCEHCQK-GLKDDL------------------FLAAKEDPKP----------SQEV 301

Query: 180 IKKIASEVNILSKKTLALTSCGNHQEVVSTYKMIEKLQKKLYHPFSVNLMQTREKLIKIL 239
           +K++        +K     S G + EVV   +   + Q+ ++   ++ +++      ++L
Sbjct: 302 VKEMIQFSKDTLEKIDKARSEGLYHEVVKLCRECLEKQEPVFADTNLYVLRLLSIASEVL 361

Query: 240 MELEDWKEALAYCQLTIPVYQRVYPQFHPLLGLQYYTCGKLEWFLGDTENAIKSMTEAVE 299
             L+ ++EA  Y +  +  Y ++Y   +  LG+     G   W  G  E     + +A  
Sbjct: 362 SYLQAYEEASHYARRMVDGYMKLYHHNNAQLGMAVMRAGLTNWHAGHIEVGHGMICKAYA 421

Query: 300 ILRITHGTNSPFMKEL 315
           IL +THG + P  K+L
Sbjct: 422 ILLVTHGPSHPITKDL 437


>gi|119613764|gb|EAW93358.1| SET and MYND domain containing 2, isoform CRA_d [Homo sapiens]
          Length = 230

 Score = 95.5 bits (236), Expect = 3e-17,   Method: Compositional matrix adjust.
 Identities = 47/125 (37%), Positives = 74/125 (59%), Gaps = 1/125 (0%)

Query: 5   DEKQLLLYAQIANLVNLILQWPEISINE-IAENFSKLACNAHTICNSELRPLGTGLYPVI 63
           +EK+ L+ + IA L +   +  E   N+ +   F+++ CN  TI + EL  LG+ ++P +
Sbjct: 59  NEKKDLIQSDIAALHHFYSKHLEFPDNDSLVVLFAQVNCNGFTIEDEELSHLGSAIFPDV 118

Query: 64  SIINHSCLPNAVLVFEGRLAVVRAVQHVPKGAEVLISYIETAGSTMTRQKALKEQYLFTC 123
           +++NHSC PN ++ ++G LA VRAVQ +  G EV  SYI+    T  R   L++ Y FTC
Sbjct: 119 ALMNHSCCPNVIVTYKGTLAEVRAVQEIKPGEEVFTSYIDLLYPTEDRNDRLRDSYFFTC 178

Query: 124 TCPRC 128
            C  C
Sbjct: 179 ECQEC 183


>gi|157819445|ref|NP_001100065.1| SET and MYND domain-containing protein 1 [Rattus norvegicus]
 gi|149036367|gb|EDL90985.1| rCG56113, isoform CRA_a [Rattus norvegicus]
          Length = 490

 Score = 95.5 bits (236), Expect = 3e-17,   Method: Compositional matrix adjust.
 Identities = 82/320 (25%), Positives = 143/320 (44%), Gaps = 56/320 (17%)

Query: 19  VNLILQ-WP----EISINEIAENFSKLACNAHTICNSE-LRPLGTGLYPVISIINHSCLP 72
           V+  LQ WP    + S+  I+  F  + CN  T+ +   L+ +G G++P + ++NH C P
Sbjct: 151 VDTFLQYWPPQSQQFSMQYISHIFGVINCNGFTLSDQRGLQAVGVGIFPNLGLVNHDCWP 210

Query: 73  NAVLVFE-GRLAVVRAVQH------------VPKGAEVLISYIETAGSTMTRQKALKEQY 119
           N  ++F  G    V+++ H            + +G E+ +SYI+    +  R++ LK+QY
Sbjct: 211 NCTVIFNNGNHEAVKSMFHTQMRIELRALGKISEGEELTVSYIDFLHLSEERRQQLKKQY 270

Query: 120 LFTCTCPRCIKLGQFDDIQESAILEGYRCKDDGCSGFLLRDSDDKGFTCQQCGLVRSKEE 179
            F C+C  C K G  DD+                  FL    D K           S+E 
Sbjct: 271 YFDCSCEHCQK-GLKDDL------------------FLAVKEDPKP----------SQEV 301

Query: 180 IKKIASEVNILSKKTL----ALTSCGNHQEVVSTYKMIEKLQKKLYHPFSVNLMQTREKL 235
           +K    E+   SK TL       S G + EVV   +   + Q+ ++   ++ +++    +
Sbjct: 302 VK----EMTQFSKDTLEKIDKARSEGLYHEVVKLCRECLEKQEPVFADTNLYVLRLLSIV 357

Query: 236 IKILMELEDWKEALAYCQLTIPVYQRVYPQFHPLLGLQYYTCGKLEWFLGDTENAIKSMT 295
            ++L  L+ ++EA  Y +  +  Y ++Y   +  LG+     G   W  G  E     + 
Sbjct: 358 SEVLSYLQAFEEASHYARRMVDGYMKLYHHNNAQLGMAVMRAGLTNWHAGHIEVGHGMIC 417

Query: 296 EAVEILRITHGTNSPFMKEL 315
           +A  IL +THG + P  K+L
Sbjct: 418 KAYAILLVTHGPSHPITKDL 437


>gi|326919601|ref|XP_003206068.1| PREDICTED: SET and MYND domain-containing protein 1-like isoform 1
           [Meleagris gallopavo]
          Length = 486

 Score = 95.5 bits (236), Expect = 3e-17,   Method: Compositional matrix adjust.
 Identities = 86/333 (25%), Positives = 143/333 (42%), Gaps = 56/333 (16%)

Query: 25  WP----EISINEIAENFSKLACNAHTICNSE-LRPLGTGLYPVISIINHSCLPNAVLVFE 79
           WP    +  +  I+  F  + CNA T+ +   L+ +G G++P +   NH C PN  ++F 
Sbjct: 153 WPAQSQQFGMQYISHIFGVINCNAFTLSDQRGLQAVGVGIFPNLCQANHDCWPNCTVIFN 212

Query: 80  -GRLAVVRAVQH------------VPKGAEVLISYIETAGSTMTRQKALKEQYLFTCTCP 126
            G    VR++ H            +  G E+ +SY++    +  RQK LK+QY F CTC 
Sbjct: 213 NGNHEAVRSMFHTQMRIELRALSKISPGDELTVSYVDFLNVSEERQKQLKKQYYFDCTCE 272

Query: 127 RCIKLGQFDDIQESAILEGYRCKDDGCSGFLLRDSDDKGFTCQQCGLVRSKEEIKKIASE 186
            C K                + KDD      +++ D K           S E +K    E
Sbjct: 273 HCKK----------------KIKDD--LMLAVKEGDKKP----------SAETVK----E 300

Query: 187 VNILSKKTLALTSC----GNHQEVVSTYKMIEKLQKKLYHPFSVNLMQTREKLIKILMEL 242
           V   SK TL   +     G + EVV   +   K Q+ +    ++ L++      ++L  L
Sbjct: 301 VIQFSKDTLEKINKARLEGTYHEVVKLCRECLKKQEPVLGDTNIYLLRILSIASEVLSYL 360

Query: 243 EDWKEALAYCQLTIPVYQRVYPQFHPLLGLQYYTCGKLEWFLGDTENAIKSMTEAVEILR 302
           + ++EA  Y +  +  Y ++Y   +  LG+     G   W  G  E     + +A  IL 
Sbjct: 361 QMFEEAAEYAKRMVDGYLKIYHPNNAQLGMAVMRAGVTHWHAGLIEAGHSMICKAYAILL 420

Query: 303 ITHGTNSPFMKELILKLEEAQAEASYKLSSKDE 335
           ITHG + P  K+  L++   Q E   ++  ++E
Sbjct: 421 ITHGPSHPITKD--LEVMRVQTEMELRMFQQNE 451


>gi|384498493|gb|EIE88984.1| hypothetical protein RO3G_13695 [Rhizopus delemar RA 99-880]
          Length = 553

 Score = 95.5 bits (236), Expect = 3e-17,   Method: Compositional matrix adjust.
 Identities = 42/108 (38%), Positives = 65/108 (60%)

Query: 23  LQWPEISINEIAENFSKLACNAHTICNSELRPLGTGLYPVISIINHSCLPNAVLVFEGRL 82
           +Q   I  +E+    +    N  T+ + E+  +G G YPV S+ NH+C PNAV+VF+G L
Sbjct: 178 VQQTSIDEDELVHYLNVFKSNNFTLDDQEMFAVGEGTYPVASLFNHTCRPNAVIVFDGAL 237

Query: 83  AVVRAVQHVPKGAEVLISYIETAGSTMTRQKALKEQYLFTCTCPRCIK 130
           A +RA+  +    E+ ISYI+ A +   R++AL+E+Y F C C RC +
Sbjct: 238 AEIRAIDTIEPDTEITISYIDPAHARAHRKRALREKYFFDCRCVRCTR 285


>gi|326427056|gb|EGD72626.1| hypothetical protein PTSG_04361 [Salpingoeca sp. ATCC 50818]
          Length = 464

 Score = 95.1 bits (235), Expect = 4e-17,   Method: Compositional matrix adjust.
 Identities = 73/283 (25%), Positives = 124/283 (43%), Gaps = 19/283 (6%)

Query: 41  ACNAHTICNSELRPLGTGLYPVISIINHSCLPNAVLVFEGRLAVVRAVQHVPKGAEVLIS 100
           + N+  I   E   LG GLY  +S INHSC PN  + F G  A VR+V  +P   E+ I+
Sbjct: 191 SANSFGISGVEGNNLGVGLYDTVSYINHSCAPNCSITFSGVYARVRSVHDLPPNQELTIA 250

Query: 101 YIETAGSTMTRQKALKEQYLFTCTCPRCIKLGQFDDIQESAILEGYRCKDDGCSGF---- 156
           YI+       R+  LK Q++F C C RC +  + DD     +     C   GC       
Sbjct: 251 YIDPCDPRAKRRAHLKSQFMFDCECSRCER--ERDDDPLLTLC----CPGSGCDQIIKYQ 304

Query: 157 LLRDSDDKGF-TCQQC--GLVRSKEEIKKIASEVNILSKKTLALTSCGNHQEVVSTYKMI 213
           L    DD  F   ++C  G +         + +V    K+ L +    N         M 
Sbjct: 305 LTAPPDDTAFEVTKKCERGHLDDPATFDSWSRKV----KRVLGIIGEANKARRPVQGDMA 360

Query: 214 EKLQKKLYHPFSVNLMQTREKLIKILMELEDWKEALAYCQLTIPVYQRVYPQFHPLLGLQ 273
               ++L  P +  +      ++   +    ++EAL + +  I VY  +     PL G++
Sbjct: 361 SF--ERLLPPTNYLMHIAYTGVLDQAIGTGQFQEALQWARKVILVYDALNSPMDPLRGIE 418

Query: 274 YYTCGKLEWFLGDTENAIKSMTEAVEILRITHGTNSPFMKELI 316
                KL+    D    ++++ +AV+++R+THG +S   ++++
Sbjct: 419 LLRVAKLQALARDGRGFLETIQKAVDVIRLTHGEDSVVYQQVL 461


>gi|449501420|ref|XP_002187966.2| PREDICTED: SET and MYND domain-containing protein 1 isoform 1
           [Taeniopygia guttata]
          Length = 478

 Score = 95.1 bits (235), Expect = 4e-17,   Method: Compositional matrix adjust.
 Identities = 78/325 (24%), Positives = 147/325 (45%), Gaps = 35/325 (10%)

Query: 16  ANLVNLILQWP----EISINEIAENFSKLACNAHTICNSE-LRPLGTGLYPVISIINHSC 70
           A++ + +  WP    +  +  I+  F  ++CN  T+ +   L+ +G G++P +   NH C
Sbjct: 149 ADVESFLEFWPPQCQQFGMQLISHIFGVISCNGFTLSDQRGLQAVGVGIFPNLCQANHDC 208

Query: 71  LPNAVLVFEGRLAVVRAVQHVPKGAEVLISYIETAGSTMTRQKALKEQYLFTCTCPRCIK 130
            PN  +VF      +RA+  +  G E+ +SY++    +  R++ LK+QY F CTC  C K
Sbjct: 209 WPNCTVVFNNGKIELRALNKISPGDELTVSYVDFLSLSEERRRQLKKQYYFDCTCEHCKK 268

Query: 131 LGQFDDIQESAILEGYRCKDDGCSGFLLRDSDDKGFTCQQCGLVRSKEEIKKIASEVNIL 190
                           + KDD        +S     T ++  +  SK+ ++KI       
Sbjct: 269 ----------------QLKDDLMLAVKAGESKPSADTVKEV-IQLSKDTLEKI------- 304

Query: 191 SKKTLALTSCGNHQEVVSTYKMIEKLQKKLYHPFSVNLMQTREKLIKILMELEDWKEALA 250
           +K  +     G++ EVV   +   + Q+ +    ++ L++      ++L  L+ ++EA  
Sbjct: 305 NKARME----GHYHEVVKLCRDCLQKQEPVLGDTNIYLLRILSIASEVLSYLQMFEEAAD 360

Query: 251 YCQLTIPVYQRVYPQFHPLLGLQYYTCGKLEWFLGDTENAIKSMTEAVEILRITHGTNSP 310
           Y +  +  Y ++Y   +  LG+     G   W  G  E     + +A  IL ITHG + P
Sbjct: 361 YAKRMVDGYLKIYHPNNAQLGMAVMRAGVTHWHAGLIEVGHGLICKAYAILLITHGPSHP 420

Query: 311 FMKELILKLEEAQAEASYKLSSKDE 335
             K+  L++   Q E   ++  ++E
Sbjct: 421 ITKD--LEVMRVQTEMELRMFQQNE 443


>gi|301610418|ref|XP_002934751.1| PREDICTED: LOW QUALITY PROTEIN: SET and MYND domain-containing
           protein 2-like [Xenopus (Silurana) tropicalis]
          Length = 430

 Score = 94.7 bits (234), Expect = 5e-17,   Method: Compositional matrix adjust.
 Identities = 70/293 (23%), Positives = 135/293 (46%), Gaps = 46/293 (15%)

Query: 37  FSKLACNAHTICNSELRPLGTGLYPVISIINHSCLPNAVLVFEGRLAVVRAVQHVPKGAE 96
           F+++ CN  TI + EL  LG+ ++P ++++NHSC PN ++ ++G +A VRAVQ +  G E
Sbjct: 174 FAQVNCNGFTIEDEELSHLGSAIFPDVALMNHSCCPNVIVTYKGTVAEVRAVQEIHAGDE 233

Query: 97  VLISYIETAGSTMTRQKALKEQYLFTCTCPRC-------IKLGQFDDIQESAILEGYRCK 149
           V+ SYI+    T  R   L +     C C  C       +K+ +        + + ++ K
Sbjct: 234 VITSYIDLLYPTEDRNDRLIDSXFCNCDCRECSTKQKGRLKIAEISLKIHPPLDKXWKNK 293

Query: 150 DDGCSGFLLRDSDDKGFTCQQCGLVRSKEEIKKIASEVNILSKKTLALTSCGNHQEVVST 209
           +  CS  LL                               L   T+A  S     E++  
Sbjct: 294 ERSCSVPLL------------------------------YLFPLTIAPPS-----ELLEI 318

Query: 210 YKMIEKLQKKLYHPFSVNLMQTREKLIKILMELEDWKEALAYCQLTIPVYQRVYPQFHPL 269
            ++  +    ++   +V ++    + + + + ++DW+ AL Y +  I  Y + YP +   
Sbjct: 319 CELSLEKMGSVFEDSNVYMLHMMYQAMGVCLYMQDWEGALKYGEKIIKPYSKHYPPYSLN 378

Query: 270 LGLQYYTCGKLEWFLGDTENAI--KSMTEAVEILRITHGTNSPFMKELILKLE 320
           +   +   G+L  ++G  + AI  K++ +A+ I+ I HG +  ++ E+  +LE
Sbjct: 379 VASMWLKLGRL--YMGLEKKAIGTKALKKAIAIMDIAHGPDHHYIAEIKKELE 429


>gi|45383804|ref|NP_989486.1| SET and MYND domain-containing protein 1 [Gallus gallus]
 gi|16930387|gb|AAL31880.1|AF410781_1 cardiac and skeletal muscle-specific BOP1 [Gallus gallus]
          Length = 486

 Score = 94.4 bits (233), Expect = 6e-17,   Method: Compositional matrix adjust.
 Identities = 85/333 (25%), Positives = 143/333 (42%), Gaps = 56/333 (16%)

Query: 25  WP----EISINEIAENFSKLACNAHTICNSE-LRPLGTGLYPVISIINHSCLPNAVLVFE 79
           WP    +  +  I+  F  + CNA T+ +   L+ +G G++P +   NH C PN  ++F 
Sbjct: 153 WPAQSQQFGMQYISHIFGVINCNAFTLSDQRGLQAVGVGIFPNLCQANHDCWPNCTVIFN 212

Query: 80  -GRLAVVRAVQH------------VPKGAEVLISYIETAGSTMTRQKALKEQYLFTCTCP 126
            G    VR++ H            +  G E+ +SY++    +  R+K LK+QY F CTC 
Sbjct: 213 NGNHEAVRSMFHTQMRIELRALSKISPGDELTVSYVDFLNVSEERRKQLKKQYYFDCTCE 272

Query: 127 RCIKLGQFDDIQESAILEGYRCKDDGCSGFLLRDSDDKGFTCQQCGLVRSKEEIKKIASE 186
            C K                + KDD      +++ D K           S E +K    E
Sbjct: 273 HCKK----------------KIKDD--LMLAVKEGDKKP----------SAETVK----E 300

Query: 187 VNILSKKTLALTSC----GNHQEVVSTYKMIEKLQKKLYHPFSVNLMQTREKLIKILMEL 242
           V   SK TL   +     G + EVV   +   K Q+ +    ++ L++      ++L  L
Sbjct: 301 VIQFSKDTLEKINKARLEGTYHEVVKLCRECLKKQEPVLGDTNIYLLRILSIASEVLSYL 360

Query: 243 EDWKEALAYCQLTIPVYQRVYPQFHPLLGLQYYTCGKLEWFLGDTENAIKSMTEAVEILR 302
           + ++EA  Y +  +  Y ++Y   +  LG+     G   W  G  E     + +A  IL 
Sbjct: 361 QMFEEAAEYAKRMVEGYMKIYHPNNAQLGMAVMRAGVTHWHAGLIEAGHSMICKAYAILL 420

Query: 303 ITHGTNSPFMKELILKLEEAQAEASYKLSSKDE 335
           ITHG + P  K+  L++   Q E   ++  ++E
Sbjct: 421 ITHGPSHPITKD--LEVMRVQTEMELRMFQQNE 451


>gi|332239335|ref|XP_003268859.1| PREDICTED: SET and MYND domain-containing protein 1 [Nomascus
           leucogenys]
          Length = 512

 Score = 94.4 bits (233), Expect = 6e-17,   Method: Compositional matrix adjust.
 Identities = 75/310 (24%), Positives = 139/310 (44%), Gaps = 51/310 (16%)

Query: 24  QWPEISINEIAENFSKLACNAHTICNSE-LRPLGTGLYPVISIINHSCLPNAVLVFE-GR 81
           Q  + S+  I+  F  + CN  T+ +   L+ +G G++P + ++NH C PN  ++F  G 
Sbjct: 183 QSQQFSMQYISHIFGVINCNGFTLSDQRGLQAVGVGIFPNLGLVNHDCWPNCTVIFNNGN 242

Query: 82  LAVVRAVQH------------VPKGAEVLISYIETAGSTMTRQKALKEQYLFTCTCPRCI 129
              V+++ H            + +G E+ +SYI+    +  R++ LK+QY F CTC  C 
Sbjct: 243 HEAVKSMFHTQMRIELRALGKISEGEELTVSYIDFLNVSEERKRQLKKQYYFDCTCEHCQ 302

Query: 130 KLGQFDDIQESAILEGYRCKDDGCSGFLLRDSDDKGFTCQQCGLVRSKEEIKKIASEVNI 189
           K                + KDD    FL              G+  + +  +++  E+  
Sbjct: 303 K----------------KLKDD---LFL--------------GVKDNPKPSQEVVKEMIQ 329

Query: 190 LSKKTL----ALTSCGNHQEVVSTYKMIEKLQKKLYHPFSVNLMQTREKLIKILMELEDW 245
            SK TL       S G + EVV   +   + Q+ ++   ++ +++    + ++L  L+ +
Sbjct: 330 FSKDTLEKIDKARSEGLYHEVVKLCRECLQKQEPVFADTNIYMLRMLSIVSEVLSYLQAF 389

Query: 246 KEALAYCQLTIPVYQRVYPQFHPLLGLQYYTCGKLEWFLGDTENAIKSMTEAVEILRITH 305
           +EA  Y +  +  Y ++Y   +  LG+     G   W  G+ E     + +A  IL +TH
Sbjct: 390 EEASFYARRMVDGYMKLYHPNNAQLGMAVMRAGLTNWHAGNIEVGHGMICKAYAILLVTH 449

Query: 306 GTNSPFMKEL 315
           G + P  K+L
Sbjct: 450 GPSHPITKDL 459


>gi|71052093|gb|AAH49367.2| SMYD2 protein [Homo sapiens]
          Length = 371

 Score = 94.4 bits (233), Expect = 7e-17,   Method: Compositional matrix adjust.
 Identities = 47/127 (37%), Positives = 76/127 (59%), Gaps = 5/127 (3%)

Query: 5   DEKQLLLYAQIANLVNLI---LQWPEISINEIAENFSKLACNAHTICNSELRPLGTGLYP 61
           +EK+ L+ + IA L +     L++P+   + +   F+++ CN  TI + EL  LG+ ++P
Sbjct: 143 NEKKDLIQSDIAALHHFYSKHLEFPDS--DSLVVLFAQVNCNGFTIEDEELSHLGSAIFP 200

Query: 62  VISIINHSCLPNAVLVFEGRLAVVRAVQHVPKGAEVLISYIETAGSTMTRQKALKEQYLF 121
            ++++NHSC PN ++ ++G LA VRAVQ +  G EV  SYI+    T  R   L++ Y F
Sbjct: 201 DVALMNHSCCPNVIVTYKGTLAEVRAVQEIKPGEEVFTSYIDLLYPTEDRNDRLRDSYFF 260

Query: 122 TCTCPRC 128
           TC C  C
Sbjct: 261 TCECQEC 267


>gi|348517233|ref|XP_003446139.1| PREDICTED: SET and MYND domain-containing protein 1-like isoform 2
           [Oreochromis niloticus]
          Length = 489

 Score = 94.4 bits (233), Expect = 7e-17,   Method: Compositional matrix adjust.
 Identities = 80/318 (25%), Positives = 139/318 (43%), Gaps = 47/318 (14%)

Query: 17  NLVNLILQWPEIS----INEIAENFSKLACNAHTICNSE-LRPLGTGLYPVISIINHSCL 71
           ++ N +  WP  S    I++I+  F  + CN  ++ +   L+ +G GL+P + ++NH C 
Sbjct: 145 DIHNFLDYWPHNSKQHTIDDISHIFGVINCNGFSVSDQRGLQAVGVGLFPNLCLVNHDCW 204

Query: 72  PN-AVLVFEGRLAVV------------RAVQHVPKGAEVLISYIETAGSTMTRQKALKEQ 118
           PN  V++  G  + V            RA+  + +G E+ ++Y++    +  RQ+ LK Q
Sbjct: 205 PNCTVILNHGNQSAVNTMFHSQRRIELRALGKIAEGEELTVAYVDFLNLSEERQRLLKTQ 264

Query: 119 YLFTCTCPRCIKLGQFDDIQESAILEGYRCKDDGCSGFLLRDSDDKGFTCQQCGLVRSKE 178
           Y F CTC  C                  R KDD   G   R+ D         G+  S+E
Sbjct: 265 YFFDCTCEHCKN----------------RIKDDIKIGG--REED---------GVKPSEE 297

Query: 179 EIKKIASEVNILSKKTLALTSCGNHQEVVSTYK-MIEKLQKKLYHPFSVNLMQTREKLIK 237
           ++K+       + +K       G++ EVV   +  IEK +  L     + L++    + +
Sbjct: 298 QVKEATDYCFQMLEKMEKARLNGDYHEVVKICRECIEKTEPVLADT-HIYLLRMWSTMSE 356

Query: 238 ILMELEDWKEALAYCQLTIPVYQRVYPQFHPLLGLQYYTCGKLEWFLGDTENAIKSMTEA 297
           +   L+ + +A  Y +  +  Y ++Y   +  LG+     G   W  G  E     + +A
Sbjct: 357 VQAYLQYFDDAADYARKMVEGYMKLYHPNNAALGMAAMRAGVTHWQAGQIEVGHGMICKA 416

Query: 298 VEILRITHGTNSPFMKEL 315
             IL ITHG   P  K+L
Sbjct: 417 YAILMITHGPIHPITKDL 434


>gi|348505210|ref|XP_003440154.1| PREDICTED: SET and MYND domain-containing protein 1-like isoform 1
           [Oreochromis niloticus]
          Length = 454

 Score = 94.4 bits (233), Expect = 7e-17,   Method: Compositional matrix adjust.
 Identities = 76/297 (25%), Positives = 132/297 (44%), Gaps = 49/297 (16%)

Query: 29  SINEIAENFSKLACNAHTICNSE-LRPLGTGLYPVISIINHSCLPNAVLVFEGRLAVVRA 87
           S++ I+  F  + CN  T+ +   L+ +G GL+P + ++NH C PN  ++       +RA
Sbjct: 166 SVDYISHIFGIIKCNGFTLSDQRGLQAVGVGLFPNLCLVNHDCWPNCTVILNHGKIELRA 225

Query: 88  VQHVPKGAEVLISYIETAGSTMTRQKALKEQYLFTCTCPRCIKLGQFDDIQESAILEGYR 147
           +  +P+G E+ +SY++    +  RQK LKE++ F CTC  C K                 
Sbjct: 226 LGKIPEGEELTVSYVDFLNLSADRQKKLKERFHFDCTCEHCSK----------------H 269

Query: 148 CKDDGCSGFLLRDSDDKGFTCQQCGLVRSKEEIKKIASEVNILSKKTLALTSCG----NH 203
            KDD     +   +D K           S +++K    EV   SK++L          ++
Sbjct: 270 IKDD----LMTAVADSKP----------SADKVK----EVTAFSKESLEKIEKSRVERDY 311

Query: 204 QEVVS-TYKMIEKLQKKLYHPFSVNLMQTREKLIKILMELEDWKEALA----YCQLTIPV 258
            EV+   Y+ ++K +  L      NL + R  ++ I  E+  ++ + A    Y    +  
Sbjct: 312 NEVLKLCYECLQKQENVLA---DTNLYKLR--VLSIASEVLSYQRSFAKAANYAHRMVEG 366

Query: 259 YQRVYPQFHPLLGLQYYTCGKLEWFLGDTENAIKSMTEAVEILRITHGTNSPFMKEL 315
           Y+++Y   +  LG+     G   W  G  E     + +A  IL +THG N    K+L
Sbjct: 367 YRKLYHPNNAQLGMAIMRAGVTHWHAGQIEEGHSLICQAYRILMVTHGPNHAITKDL 423


>gi|426194765|gb|EKV44696.1| hypothetical protein AGABI2DRAFT_120820 [Agaricus bisporus var.
           bisporus H97]
          Length = 674

 Score = 94.0 bits (232), Expect = 8e-17,   Method: Compositional matrix adjust.
 Identities = 59/158 (37%), Positives = 81/158 (51%), Gaps = 18/158 (11%)

Query: 29  SINEIAENFSKLACNAHTICNSELRPLGTGLYPVISIINHSCLPNAVLVFEGR------- 81
           SI  + +  S+   NA T+ +  L P+G  + P+I++ NHSC PNAV VF          
Sbjct: 273 SIAGLVDLVSRFTTNAFTVASPTLTPIGVSISPIIALFNHSCAPNAVPVFPRAPHNAKAN 332

Query: 82  --LAVVRAVQHVPKGAEVLISYIETAGSTMTRQKALKEQYLFTCTCPRCIKL--GQFDDI 137
             +A V  ++ +P   EV+ISYI+T  +   RQKALKE Y FTC+C  C K   G + D 
Sbjct: 333 EPMASVITLRDIPANEEVVISYIDTTLTKRERQKALKETYYFTCSCSLCEKPRDGAYVDS 392

Query: 138 QESAILEGYRCKDDGCSGF-LLRDSDDKGFTCQQCGLV 174
           +ES       C  + C G   L   DD    C +CG V
Sbjct: 393 RES-----LYCPKN-CGGLCWLPTEDDPLARCAKCGTV 424


>gi|395731585|ref|XP_003775930.1| PREDICTED: LOW QUALITY PROTEIN: SET and MYND domain-containing
           protein 1-like [Pongo abelii]
          Length = 490

 Score = 93.6 bits (231), Expect = 1e-16,   Method: Compositional matrix adjust.
 Identities = 79/330 (23%), Positives = 146/330 (44%), Gaps = 52/330 (15%)

Query: 5   DEKQLLLYAQIANLVNLIL-QWPEISINEIAENFSKLACNAHTICNSE-LRPLGTGLYPV 62
           +E+Q  L   +   +   L Q  + S+  I+  F  + CN  T+ +   L+ +G G++P 
Sbjct: 141 EEEQKELRVDVDTFLQYWLPQSQQFSMQYISHIFGVINCNGFTLSDQRGLQAVGVGIFPN 200

Query: 63  ISIINHSCLPNAVLVFE-GRLAVVRAVQH------------VPKGAEVLISYIETAGSTM 109
           + ++NH C PN  ++F  G    V+++ H            + +G E+ +SYI+    + 
Sbjct: 201 LGLVNHDCWPNCTVIFNNGNHEAVKSMFHTQMRIELRALGKISEGEELTVSYIDFLNVSE 260

Query: 110 TRQKALKEQYLFTCTCPRCIKLGQFDDIQESAILEGYRCKDDGCSGFLLRDSDDKGFTCQ 169
            R++ LK+QY F CTC  C K                + KDD    FL            
Sbjct: 261 ERKRQLKKQYYFDCTCEHCQK----------------KLKDD---LFL------------ 289

Query: 170 QCGLVRSKEEIKKIASEVNILSKKTL----ALTSCGNHQEVVSTYKMIEKLQKKLYHPFS 225
             G+  + +  +++  E+   SK TL       S G + EVV   +   + Q+ ++   +
Sbjct: 290 --GVKDNPKPSQEVVKEMIRFSKDTLEKIDKARSEGLYHEVVKLCRECLQKQEPVFADTN 347

Query: 226 VNLMQTREKLIKILMELEDWKEALAYCQLTIPVYQRVYPQFHPLLGLQYYTCGKLEWFLG 285
           +  ++    + ++L  L+ ++EA  Y +  +  Y ++Y   +  LG+     G   W  G
Sbjct: 348 IYTLRMLSIVSEVLSYLQAFEEASFYARRMVDGYMKLYHPNNAQLGMAVMRAGLTNWHAG 407

Query: 286 DTENAIKSMTEAVEILRITHGTNSPFMKEL 315
           + E     + +A  IL +THG + P  K+L
Sbjct: 408 NIEVGHGMICKAYAILLVTHGPSHPITKDL 437


>gi|410903311|ref|XP_003965137.1| PREDICTED: SET and MYND domain-containing protein 1-like [Takifugu
           rubripes]
          Length = 471

 Score = 93.6 bits (231), Expect = 1e-16,   Method: Compositional matrix adjust.
 Identities = 90/354 (25%), Positives = 156/354 (44%), Gaps = 50/354 (14%)

Query: 1   MSDIDEKQLLLYAQIANLVNLILQW-----PEISINEIAENFSKLACNAHTICNSE-LRP 54
           ++D+ E  L    +I   V++ LQ+      + S++EIA  F  + CN  T+ +   L+ 
Sbjct: 136 VADLPEDHL---KRIDTDVHVFLQYWSCGRTKHSLDEIAHIFGIIKCNGFTLSDQRGLKA 192

Query: 55  LGTGLYPVISIINHSCLPNAVLVFEG-------------RLAVVRAVQHVPKGAEVLISY 101
           +G GL+P + ++NH C PN  +V                R   +RA++ + +G E+ +SY
Sbjct: 193 VGVGLFPNLCLVNHDCWPNCSVVLNHGNHSATNSALHSKRRIELRALRKICEGEELTVSY 252

Query: 102 IETAGSTMTRQKALKEQYLFTCTCPRCIKLGQFDDIQESAILEGYRCKDDGCSGFLLRDS 161
           ++   ++  RQ+ LKE + F CTC  C +  + DD+  +A  +G      G      +  
Sbjct: 253 VDFLDTSAERQRKLKEHFYFECTCEHCRQHIK-DDLMTAAAADG-----PGGKPSADQVQ 306

Query: 162 DDKGFTCQQCGLVRSKEEIKKIASEVNILSKKTLALTSCGNHQEVVSTYKMIEKLQKKLY 221
           +   F+ Q+C     +  + K   EV +L  K L                  EK QK L 
Sbjct: 307 EVTAFS-QECLEKIERSLMDKDFQEVRMLCSKCL------------------EK-QKNLL 346

Query: 222 HPFSVNLMQTREKLIKILMELEDWKEALAYCQLTIPVYQRVYPQFHPLLGLQYYTCGKLE 281
               ++ ++     +++L  L  + EA AY +  +  Y ++Y      LG+     G   
Sbjct: 347 ADTHLHHLRVLSAAVEVLSYLRCFSEAAAYARRMVQGYTKLYHPNSAQLGIAVMRAGVTH 406

Query: 282 WFLGDTENAIKSMTEAVEILRITHGTNSPFMKELILKLEEAQAEASYKLSSKDE 335
              G  E A + + +A  IL +THG N    ++  L+    Q E   KL  +DE
Sbjct: 407 LQAGMIEMAHELICKAYRILLVTHGPNHSVTRD--LEAMRRQTEVELKLLKQDE 458


>gi|354483543|ref|XP_003503952.1| PREDICTED: SET and MYND domain-containing protein 1 [Cricetulus
           griseus]
          Length = 498

 Score = 93.6 bits (231), Expect = 1e-16,   Method: Compositional matrix adjust.
 Identities = 76/316 (24%), Positives = 140/316 (44%), Gaps = 48/316 (15%)

Query: 19  VNLILQ-WP----EISINEIAENFSKLACNAHTICNSE-LRPLGTGLYPVISIINHSCLP 72
           V+  LQ WP    + S+  I+  F  + CN  T+ +   L+ +G G++P + ++NH C P
Sbjct: 151 VDAFLQYWPPQGQQFSMQYISHIFGVINCNGFTLSDQRGLQAVGVGIFPNLGLVNHDCWP 210

Query: 73  NAVLVFE-GRLAVVRAVQH------------VPKGAEVLISYIETAGSTMTRQKALKEQY 119
           N  ++F  G    V+++ H            + +G E+ +SYI+    +  R++ L++QY
Sbjct: 211 NCTVIFNNGNHEAVKSMFHTQMRIELRALGKISEGEELTVSYIDFLHLSEERRRQLRKQY 270

Query: 120 LFTCTCPRCIKLGQFDDIQESAILEGYRCKDDGCSGFLLRDSDDKGFTCQQCGLVRSKEE 179
            F C+C  C K G  DD+                  F     D K           S E 
Sbjct: 271 YFDCSCEHCQK-GLKDDL------------------FQAVKEDPKP----------SPEV 301

Query: 180 IKKIASEVNILSKKTLALTSCGNHQEVVSTYKMIEKLQKKLYHPFSVNLMQTREKLIKIL 239
           +K++        +K     S G + EVV   +   + Q+ ++   ++ +++    + ++L
Sbjct: 302 VKEMVQFSKDTLEKIDKARSEGLYHEVVKLCRECLEKQESVFADTNLYVLRLLSIVSEVL 361

Query: 240 MELEDWKEALAYCQLTIPVYQRVYPQFHPLLGLQYYTCGKLEWFLGDTENAIKSMTEAVE 299
             L+ ++EA  Y +  +  Y ++Y   +  LG+     G   W  G+ E     + +A  
Sbjct: 362 SYLQAFEEAAHYARRMVDGYMKLYHHNNAQLGMAVMRAGLTNWHAGNIEVGHGMICKAYA 421

Query: 300 ILRITHGTNSPFMKEL 315
           IL +THG + P  K+L
Sbjct: 422 ILLVTHGPSHPITKDL 437


>gi|428171150|gb|EKX40069.1| hypothetical protein GUITHDRAFT_164767 [Guillardia theta CCMP2712]
          Length = 500

 Score = 93.2 bits (230), Expect = 1e-16,   Method: Compositional matrix adjust.
 Identities = 79/299 (26%), Positives = 126/299 (42%), Gaps = 45/299 (15%)

Query: 32  EIAENFSKLACN-----AHTICNSELRPLGTGLYPVISIINHSCLPNAVLVF--EGRLAV 84
           E+ +   K ACN     A  I +  +   G G+YP+ +I+NHSC PN V+ +  E     
Sbjct: 236 ELLKVLCKFACNNFSHAARQIWDDLIVSHGMGVYPLGAILNHSCKPNCVIYYHPETHEQE 295

Query: 85  VRAVQHVPKGAEVLISYIETAGSTMTRQKALKEQYLFTCTCPRCIKLGQFDDIQESAILE 144
            R ++ +  G ++  SYI+ A  + TR++ L+  Y F C C  C    + D+        
Sbjct: 296 FRCIEDIQVGEDICHSYIDLAAVSKTRKEKLQSTYYFDCDCQCCKFPEELDN-------- 347

Query: 145 GYRCKDDGCSGFLLRDSDDKGFTCQQCGLVRSKEEIKKIASEVNILSKKTLALTSCGNHQ 204
               K     G +    D         G   S+ EI+   S +  L +  LA    GN  
Sbjct: 348 ----KLGARDGKVTEKCDRAAELLAAAG---SRTEIEHALSRLKDLDENVLA--DRGN-- 396

Query: 205 EVVSTYKMIEKLQKKLYHPFSVNLMQTREKLIKILMELEDWKEALAYCQLTIPVYQRVYP 264
                                ++ +  + K+++  +EL     A+  C+  +  Y+ +YP
Sbjct: 397 -------------------VDLDRLSVKSKMLQASIELGMMDSAIRACKQVVEGYRGIYP 437

Query: 265 QFHPLLGLQYYTCGKLEWFLGDTENAIKSMTEAVEILRITHGTNSPFMKELILKLEEAQ 323
             HPLLGLQ YT G L +  G  E A   + E  +IL  TH   S  ++ +   L EA+
Sbjct: 438 PLHPLLGLQLYTLGNLLFDDGRGEEAADVLQEGQKILLATHDRRSTMVQGITELLAEAR 496


>gi|409076546|gb|EKM76917.1| hypothetical protein AGABI1DRAFT_102088 [Agaricus bisporus var.
           burnettii JB137-S8]
          Length = 643

 Score = 93.2 bits (230), Expect = 1e-16,   Method: Compositional matrix adjust.
 Identities = 59/158 (37%), Positives = 80/158 (50%), Gaps = 18/158 (11%)

Query: 29  SINEIAENFSKLACNAHTICNSELRPLGTGLYPVISIINHSCLPNAVLVFEGR------- 81
           SI  + +  S+   NA T+ +  L P+G  + P+I++ NHSC PNAV VF          
Sbjct: 240 SIAGLVDLVSRFTTNAFTVASPTLTPIGVSISPIIALFNHSCAPNAVPVFPRAPHNAKAN 299

Query: 82  --LAVVRAVQHVPKGAEVLISYIETAGSTMTRQKALKEQYLFTCTCPRCIKL--GQFDDI 137
             +A V  ++ +P   EV+ISYI+T  +   RQKALKE Y FTC C  C K   G + D 
Sbjct: 300 EPMASVITLRDIPANEEVVISYIDTTLTKRERQKALKETYYFTCNCSLCEKPRDGAYVDS 359

Query: 138 QESAILEGYRCKDDGCSGF-LLRDSDDKGFTCQQCGLV 174
           +ES       C  + C G   L   DD    C +CG V
Sbjct: 360 RES-----LYCPKN-CGGLCWLPTEDDPLARCAKCGTV 391


>gi|159463378|ref|XP_001689919.1| histone methyltransferase [Chlamydomonas reinhardtii]
 gi|158283907|gb|EDP09657.1| histone methyltransferase [Chlamydomonas reinhardtii]
          Length = 765

 Score = 93.2 bits (230), Expect = 1e-16,   Method: Compositional matrix adjust.
 Identities = 43/97 (44%), Positives = 62/97 (63%), Gaps = 2/97 (2%)

Query: 25  WPEISINEIAENFSKLACNAHTICNSELRPLGTGLYPVISIINHSCLPNAVLVFEGRLAV 84
           WP     E+A+  ++LA N HT+C+ ELRPLGT LYP  ++ NHSC P+ V VF GR   
Sbjct: 94  WP--GFKEVAQLLARLAVNCHTVCDEELRPLGTALYPSGALANHSCRPSTVQVFRGRTLQ 151

Query: 85  VRAVQHVPKGAEVLISYIETAGSTMTRQKALKEQYLF 121
           +RA++ +  G EV + Y+E A +   R++AL + Y F
Sbjct: 152 LRALRPLAPGQEVTLCYLEPAATAQERREALWDSYCF 188


>gi|296423702|ref|XP_002841392.1| hypothetical protein [Tuber melanosporum Mel28]
 gi|295637630|emb|CAZ85583.1| unnamed protein product [Tuber melanosporum]
          Length = 489

 Score = 93.2 bits (230), Expect = 1e-16,   Method: Compositional matrix adjust.
 Identities = 68/260 (26%), Positives = 114/260 (43%), Gaps = 38/260 (14%)

Query: 31  NEIAENFSKLACNAHTICNSELRPLGTGLYPVISIINHSCLPNAVLVFEGRLAVVRAVQH 90
            E+   +  +  N+ T+      P+G    P+ + INH C+PNAV++F GR+  +RA++ 
Sbjct: 170 GEVRRLYCAVLVNSMTLVTETFDPIGIAFDPLTASINHDCIPNAVMLFNGRILQIRALED 229

Query: 91  VPKGAEVLISYIETAGSTMTRQKALKEQYLFTCTCPRCIKLGQFDDIQESAILEGYRCKD 150
           +P  AEVLISYI+       RQ  L E+Y FTCTC RC           +  ++G  C  
Sbjct: 230 IPANAEVLISYIDNTVPRERRQLELSEKYFFTCTCSRCTG--------PAGPMDGLLC-- 279

Query: 151 DGCSGFLLRDSDDKGFTCQQCGLVRSKEEIKKIASEVNILSKKTLALTSCGNHQEVVSTY 210
             C G  +     +   C +C             ++V IL +   +       + ++S  
Sbjct: 280 -ACGGVGM--PSREAVLCNRCA------RALPATADVEILEQTAWSALDSDKSEAMISAL 330

Query: 211 KMIEK-----LQKKLYHPFSVNLMQTREKLIKILMELEDWKEALAYCQLTIPVYQRV--- 262
           + + K     L ++      + L+Q         +   +W+ AL +  L   +Y R+   
Sbjct: 331 RHLHKSPMWPLPRQPLAALHLALVQGE------YIPSGNWERALLHHLL---IYLRLDPV 381

Query: 263 -YPQ-FHPLLGLQYYTCGKL 280
            YPQ FHP+  +  +T   L
Sbjct: 382 FYPQRFHPMRVVHGFTLANL 401


>gi|451849565|gb|EMD62868.1| hypothetical protein COCSADRAFT_144768 [Cochliobolus sativus
           ND90Pr]
          Length = 529

 Score = 93.2 bits (230), Expect = 2e-16,   Method: Compositional matrix adjust.
 Identities = 76/323 (23%), Positives = 141/323 (43%), Gaps = 50/323 (15%)

Query: 33  IAENFSKLACNAHTICNSELRPLGTGLYPVISIINHSCLPNAVLVFEGRLAVVRAVQHVP 92
           +A  + ++  NA T+    L PLG  + P +  +NHSC PNA ++ +G L  +R ++ + 
Sbjct: 199 VAAMYGRVMSNALTLITPTLDPLGIIVDPTLCQMNHSCDPNAYIMMDGPLISIRTLRPIR 258

Query: 93  KGAEVLISYIETAGSTMTRQKALKEQYLFTCTCPRCIKLGQFDDIQESAILEGYRCKDDG 152
           K  E+ ISYI+T      RQ  L+ ++ FTC C +C         Q+ A  +        
Sbjct: 259 KDKEIFISYIDTTHPYQKRQDELQARWFFTCRCAKC---------QDKATHQ-------- 301

Query: 153 CSGFLLRDSDDKGFTCQQCGLVRSKEEIKKIASEVNILSKKTLALTSCGNHQEVVSTYKM 212
                    +D          V SKEE++ +AS    +      +    N + V+   K 
Sbjct: 302 ---------EDNWLVPANSDFVASKEELEALASNQEQMFTLYEQVQGTPNAEAVIPVLKE 352

Query: 213 IEKLQKK-----LY-HPFSVNLMQTREKLIKILMELEDWKEALAYCQLTIP-VYQRVYP- 264
             K  ++     LY  P++      R++LI  L+ +  ++EA A+       +  ++YP 
Sbjct: 353 SLKACRESGNWPLYRQPYA----SIRDELIVNLLSVGKYQEAWAHSAKRYKYILPKLYPV 408

Query: 265 QFHPLLGLQYYTCGKLEWFLGDTENAIKS------------MTEAVEILRITHGTNSPFM 312
            FHP+  +Q +    L  +L  TE  + +            + + +++  ++HG ++ F 
Sbjct: 409 PFHPVRVVQTWQMAMLAAYLAGTEEGVGAPGVNMGLIAMMLVKQVLDVAPMSHGQDNAFT 468

Query: 313 KELILKLEEAQAEASYKLSSKDE 335
           K ++ K  E   E    + + D+
Sbjct: 469 KSVMEKTREMVEELKRSVGNPDK 491


>gi|149640598|ref|XP_001510487.1| PREDICTED: SET and MYND domain-containing protein 1 isoform 1
           [Ornithorhynchus anatinus]
          Length = 490

 Score = 92.8 bits (229), Expect = 2e-16,   Method: Compositional matrix adjust.
 Identities = 78/313 (24%), Positives = 133/313 (42%), Gaps = 55/313 (17%)

Query: 25  WP----EISINEIAENFSKLACNAHTICNSE-LRPLGTGLYPVISIINHSCLPNAVLVFE 79
           WP    + S+  I+  F  + CN  T+ +   L+ +G G++P + ++NH C PN  ++F 
Sbjct: 158 WPPQSQQFSMQYISHIFGVINCNGFTLSDQRGLQAVGVGVFPNLCLVNHDCWPNCTVIFN 217

Query: 80  -GRLAVVRAVQH------------VPKGAEVLISYIETAGSTMTRQKALKEQYLFTCTCP 126
            G    V+++ H            + +G E+ +SYI+    +  R+K LK+QY F CTC 
Sbjct: 218 NGNHEAVKSMFHTQMRIELRALGKISEGEELTVSYIDFLNVSADRKKLLKKQYYFDCTCE 277

Query: 127 RCIKLGQFDDIQESAILEGYRCKDDGCSGFLLRDSDDKGFTCQQCGLVRSKEEIKKIASE 186
            C K G  DD+                  FL              G+    +  + +  E
Sbjct: 278 HCEK-GIKDDL------------------FL--------------GVKDEPKPSQDVVKE 304

Query: 187 VNILSKKTL----ALTSCGNHQEVVSTYKMIEKLQKKLYHPFSVNLMQTREKLIKILMEL 242
           V   SK  L       S G + EVV   +   + Q+ +    ++ L++      ++L  L
Sbjct: 305 VIQFSKDALEKIDKARSEGVYHEVVKLCRECLQKQEPVLADTNIYLLRILSVASEVLSYL 364

Query: 243 EDWKEALAYCQLTIPVYQRVYPQFHPLLGLQYYTCGKLEWFLGDTENAIKSMTEAVEILR 302
           + ++EA  Y +  +  Y ++Y   +  LG+     G   W  G  E     + +A  IL 
Sbjct: 365 QSFEEASDYAKKMVDGYMKLYHPNNAQLGMAVMRAGVTHWHAGLIEVGHGMICKAYAILL 424

Query: 303 ITHGTNSPFMKEL 315
           +THG   P  K+L
Sbjct: 425 VTHGPTHPITKDL 437


>gi|432880358|ref|XP_004073658.1| PREDICTED: SET and MYND domain-containing protein 1-like isoform 3
           [Oryzias latipes]
          Length = 476

 Score = 92.8 bits (229), Expect = 2e-16,   Method: Compositional matrix adjust.
 Identities = 76/293 (25%), Positives = 130/293 (44%), Gaps = 32/293 (10%)

Query: 29  SINEIAENFSKLACNAHTICNSE-LRPLGTGLYPVISIINHSCLPNAVLVFEGRLAVVRA 87
           S+  I+  F  + CN  T+ +   L+ +G GL+P ++++NH C PN   +       VR 
Sbjct: 166 SVEYISHIFGIIKCNGFTLTDQRGLQAVGVGLFPNLALVNHDCWPNCTAILNHGKIEVRT 225

Query: 88  VQHVPKGAEVLISYIETAGSTMTRQKALKEQYLFTCTCPRCIKLGQFDDIQESAILEGYR 147
           +  + +G E+ ISY++    +  RQK LKEQ+ F C+C  C +    DD+  +A      
Sbjct: 226 LGKISEGEELTISYVDFLQLSADRQKQLKEQFHFECSCKHCSQHIS-DDLMMAAA----- 279

Query: 148 CKDDGCSGFLLRDSDDKGFTCQQCGLVR-SKEEIKKIASEVNILSKKTLALTS----CGN 202
              DG         D   F      L R S E+I+    EV+  SK +L         G+
Sbjct: 280 ---DG--------QDSPWFI-----LCRPSAEQIQ----EVSAFSKDSLEKIEKSRLDGD 319

Query: 203 HQEVVSTYKMIEKLQKKLYHPFSVNLMQTREKLIKILMELEDWKEALAYCQLTIPVYQRV 262
           + EV+   +   + Q+ +     ++ ++      ++L  L  + EA    +  +  Y ++
Sbjct: 320 YSEVLRLCRECLQKQRGVLADTHLHRLRVLSIAAEVLSSLRQFSEAADCARKMVEGYMKL 379

Query: 263 YPQFHPLLGLQYYTCGKLEWFLGDTENAIKSMTEAVEILRITHGTNSPFMKEL 315
           Y   + LLG+     G   W  G  E     + +A  IL +THG N    ++L
Sbjct: 380 YHPNNALLGMAIMRAGVTLWHAGQIEEGHSLICKAYSILMVTHGPNHSVTRDL 432


>gi|149041012|gb|EDL94969.1| SET and MYND domain containing 2, isoform CRA_b [Rattus norvegicus]
          Length = 349

 Score = 92.8 bits (229), Expect = 2e-16,   Method: Compositional matrix adjust.
 Identities = 48/127 (37%), Positives = 75/127 (59%), Gaps = 5/127 (3%)

Query: 5   DEKQLLLYAQIANLVNLI---LQWPEISINEIAENFSKLACNAHTICNSELRPLGTGLYP 61
           +EK+ L+ + IA L       L++P+ S   +   F+++ CN  TI + EL  LG+ ++P
Sbjct: 143 NEKKDLIQSDIAALHQFYSKHLEFPDHS--SLVVLFAQVNCNGFTIEDEELSHLGSAIFP 200

Query: 62  VISIINHSCLPNAVLVFEGRLAVVRAVQHVPKGAEVLISYIETAGSTMTRQKALKEQYLF 121
            ++++NHSC PN ++ ++G LA VRAVQ +  G EV  SYI+    T  R   L++ Y F
Sbjct: 201 DVALMNHSCCPNVIVTYKGTLAEVRAVQEIHPGDEVFTSYIDLLYPTEDRNDRLRDSYFF 260

Query: 122 TCTCPRC 128
           TC C  C
Sbjct: 261 TCECREC 267


>gi|325530307|sp|Q5RGL7.2|SMY2B_DANRE RecName: Full=N-lysine methyltransferase SMYD2-B; AltName:
           Full=Histone methyltransferase SMYD2-B; AltName:
           Full=SET and MYND domain-containing protein 2B
          Length = 434

 Score = 92.4 bits (228), Expect = 3e-16,   Method: Compositional matrix adjust.
 Identities = 79/329 (24%), Positives = 143/329 (43%), Gaps = 54/329 (16%)

Query: 1   MSDID-EKQLLLYAQIANLVNLI---LQWPEISINEIAENFSKLACNAHTICNSELRPLG 56
           + D+D EK+ +  A IA L       L +P+     +   FS++ CN  T+ + EL  LG
Sbjct: 139 LEDMDNEKREMTEAHIAGLHQFYSKHLDFPDH--QALLTLFSQVHCNGFTVEDEELSNLG 196

Query: 57  TGLYPVISIINHSCLPNAVLVFEGRLAVVRAVQHVPKGAEVLISYIETAGSTMTRQKALK 116
             ++P I+++NHSC PN ++ + G  A VRAV+ +  G E+  SYI+    T  R + L+
Sbjct: 197 LAIFPDIALLNHSCSPNVIVTYRGINAEVRAVKDISPGQEIYTSYIDLLYPTADRLERLR 256

Query: 117 EQYLFTCTCPRC-------IKLG---QFDDIQESAILEGYRCKDDGCSGFLLRDSDDKGF 166
           + Y F+C C  C       +K+    + D+I E  I +  R   +    F          
Sbjct: 257 DMYYFSCDCKECTTKSMDVVKMSVRKRSDEIGEKEIKDMVRYARNSMENFR--------- 307

Query: 167 TCQQCGLVRSKEEIKKIASEVNILSKKTLALTSCGNHQEVVSTYKMIEKLQKKLYHPFSV 226
                   R+K++              T  L  C    + +ST          ++   +V
Sbjct: 308 --------RAKQD-----------KSPTELLEMCELSIDKMST----------VFDDSNV 338

Query: 227 NLMQTREKLIKILMELEDWKEALAYCQLTIPVYQRVYPQFHPLLGLQYYTCGKLEWFLGD 286
            ++    + + I +  ED++ A+ Y +  I  +  +YP +   +   +   G+L   L  
Sbjct: 339 YILHMMYQAMGICLFTEDYEGAVRYGEKVIKPFTVLYPAYSMNVASMFLKLGRLYIALDR 398

Query: 287 TENAIKSMTEAVEILRITHGTNSPFMKEL 315
               I +  +A+ I+ + HG +  ++ EL
Sbjct: 399 KLAGIDAFQKALTIMEVVHGKDHTYVTEL 427


>gi|351710111|gb|EHB13030.1| SET and MYND domain-containing protein 2 [Heterocephalus glaber]
          Length = 527

 Score = 92.0 bits (227), Expect = 3e-16,   Method: Compositional matrix adjust.
 Identities = 69/274 (25%), Positives = 120/274 (43%), Gaps = 34/274 (12%)

Query: 63  ISIINHSCLPNAVLVFEGRLAVVRAVQHVPKGAEVLISYIETAGSTMTRQKALKEQYLFT 122
           ++++NHSC PN ++ + G LA VRAVQ +  G EV  SYI+    T  R   L++ Y FT
Sbjct: 270 VALMNHSCCPNVIVTYRGTLAEVRAVQEIHPGEEVFTSYIDLLYPTEDRNDRLRDSYFFT 329

Query: 123 CTCPRCIKLGQFDDIQESAILEGYRCKDDGCSGFLLRDSDDKGFTCQQCGLVRSK----E 178
           C C  C   G     Q+ A LE  R   D      +RD            +VR      E
Sbjct: 330 CECRECSTKG-----QDQAKLE-IRKLSDPPRAEAVRD------------MVRYARGVIE 371

Query: 179 EIKKIASEVNILSKKTLALTSCGNH------------QEVVSTYKMIEKLQKKLYHPFSV 226
           E ++      IL            H             E++   ++ ++    ++   +V
Sbjct: 372 EFRRAKHYKYILGCSVQGPMPRSPHCEGREAGPGSPPSELLEICELSQEQMSSVFEDSNV 431

Query: 227 NLMQTREKLIKILMELEDWKEALAYCQLTIPVYQRVYPQFHPLLGLQYYTCGKLEWFLGD 286
            ++    + + + + L+DW+ AL Y Q  I  Y + YP +   +   +   G+L   L +
Sbjct: 432 YMLHMMYQAMGVCLYLQDWEGALRYGQKIIKPYSKHYPLYSLNVASMWLKLGRLYLGLEN 491

Query: 287 TENAIKSMTEAVEILRITHGTNSPFMKELILKLE 320
                K++ +A+ I+ + HG + P++ E+  ++E
Sbjct: 492 RAAGEKALKKALAIMEVAHGKDHPYISEIRREIE 525


>gi|113674346|ref|NP_001038756.1| N-lysine methyltransferase SMYD2-B isoform 2 [Danio rerio]
 gi|95132413|gb|AAI16607.1| SET and MYND domain containing 2b [Danio rerio]
          Length = 423

 Score = 92.0 bits (227), Expect = 3e-16,   Method: Compositional matrix adjust.
 Identities = 79/329 (24%), Positives = 143/329 (43%), Gaps = 54/329 (16%)

Query: 1   MSDID-EKQLLLYAQIANLVNLI---LQWPEISINEIAENFSKLACNAHTICNSELRPLG 56
           + D+D EK+ +  A IA L       L +P+     +   FS++ CN  T+ + EL  LG
Sbjct: 128 LEDMDNEKREMTEAHIAGLHQFYSKHLDFPDH--QALLTLFSQVHCNGFTVEDEELSNLG 185

Query: 57  TGLYPVISIINHSCLPNAVLVFEGRLAVVRAVQHVPKGAEVLISYIETAGSTMTRQKALK 116
             ++P I+++NHSC PN ++ + G  A VRAV+ +  G E+  SYI+    T  R + L+
Sbjct: 186 LAIFPDIALLNHSCSPNVIVTYRGINAEVRAVKDISPGQEIYTSYIDLLYPTADRLERLR 245

Query: 117 EQYLFTCTCPRC-------IKLG---QFDDIQESAILEGYRCKDDGCSGFLLRDSDDKGF 166
           + Y F+C C  C       +K+    + D+I E  I +  R   +    F          
Sbjct: 246 DMYYFSCDCKECTTKSMDVVKMSVRKRSDEIGEKEIKDMVRYARNSMENFR--------- 296

Query: 167 TCQQCGLVRSKEEIKKIASEVNILSKKTLALTSCGNHQEVVSTYKMIEKLQKKLYHPFSV 226
                   R+K++              T  L  C    + +ST          ++   +V
Sbjct: 297 --------RAKQD-----------KSPTELLEMCELSIDKMST----------VFDDSNV 327

Query: 227 NLMQTREKLIKILMELEDWKEALAYCQLTIPVYQRVYPQFHPLLGLQYYTCGKLEWFLGD 286
            ++    + + I +  ED++ A+ Y +  I  +  +YP +   +   +   G+L   L  
Sbjct: 328 YILHMMYQAMGICLFTEDYEGAVRYGEKVIKPFTVLYPAYSMNVASMFLKLGRLYIALDR 387

Query: 287 TENAIKSMTEAVEILRITHGTNSPFMKEL 315
               I +  +A+ I+ + HG +  ++ EL
Sbjct: 388 KLAGIDAFQKALTIMEVVHGKDHTYVTEL 416


>gi|449273578|gb|EMC83062.1| SET and MYND domain-containing protein 1 [Columba livia]
          Length = 491

 Score = 92.0 bits (227), Expect = 3e-16,   Method: Compositional matrix adjust.
 Identities = 86/342 (25%), Positives = 148/342 (43%), Gaps = 56/342 (16%)

Query: 16  ANLVNLILQWP----EISINEIAENFSKLACNAHTICNSE-LRPLGTGLYPVISIINHSC 70
           A++ + +  WP    +  +  I+  F  ++CN  T+ +   L+ +G G++P +   NH C
Sbjct: 149 ADVESFLEFWPPHSQQFGMQYISHIFGVISCNGFTLSDQRGLQAVGVGIFPNLCQANHDC 208

Query: 71  LPNAVLVFE-GRLAVVRAVQH------------VPKGAEVLISYIETAGSTMTRQKALKE 117
            PN  ++F  G    VR++ H            +  G E+ +SY++    +  R+K LK+
Sbjct: 209 WPNCTVIFNNGNHEAVRSMFHTQMRIELRALTKISPGDELTVSYVDFLNVSEERRKQLKK 268

Query: 118 QYLFTCTCPRCIKLGQFDDIQESAILEGYRCKDDGCSGFLLRDSDDKGFTCQQCGLVRSK 177
           QY F CTC  C K                + KDD     +L   + +G          S 
Sbjct: 269 QYYFDCTCEHCKK----------------QIKDD----LMLAVKEGEGKP--------SA 300

Query: 178 EEIKKIASEVNILSKKTLALTSC----GNHQEVVSTYKMIEKLQKKLYHPFSVNLMQTRE 233
           E +K    EV   SK TL   S     G + EVV   +   K Q+ +    ++ L++   
Sbjct: 301 ETVK----EVIQFSKDTLEKISKARMEGLYHEVVKLCRECLKKQEPVLGDTNIYLLRILS 356

Query: 234 KLIKILMELEDWKEALAYCQLTIPVYQRVYPQFHPLLGLQYYTCGKLEWFLGDTENAIKS 293
              ++L  L+ ++EA  Y +  +  Y ++Y   +  LG+     G   W  G  E     
Sbjct: 357 IASEVLSYLQMFEEAADYAKRMVDGYLKIYHPNNAQLGMAVMRAGVTHWHAGLIEAGHGL 416

Query: 294 MTEAVEILRITHGTNSPFMKELILKLEEAQAEASYKLSSKDE 335
           + +A  IL ITHG + P  K+  L++   Q E   ++  ++E
Sbjct: 417 ICKAYAILLITHGPSHPITKD--LEVMRVQTEMELRMFQQNE 456


>gi|70947750|ref|XP_743461.1| hypothetical protein [Plasmodium chabaudi chabaudi]
 gi|56522968|emb|CAH78269.1| conserved hypothetical protein [Plasmodium chabaudi chabaudi]
          Length = 353

 Score = 92.0 bits (227), Expect = 3e-16,   Method: Compositional matrix adjust.
 Identities = 81/326 (24%), Positives = 138/326 (42%), Gaps = 40/326 (12%)

Query: 30  INEIAENFSKLACNAHTICNSELRPLGTGLYPV-ISIINHSCLPNAVLVFEGRLAVVRAV 88
           +  + E F K++ N   I ++EL P G G+Y   I   NHSCL N + +F+ +   +R +
Sbjct: 19  LKNVIEKFIKVSKNTLQIIDNELEPCGLGIYKKPIPYFNHSCLSNCITIFKNQRLYIRTL 78

Query: 89  QHVPKGAEVLISYIETAGSTMTRQKALKEQYLFTCTCP---------------------- 126
             +  G E+ ISY++ A     R     +QY FTCTC                       
Sbjct: 79  MDIYPGEELTISYLDVAFDRNARLAICADQYFFTCTCKLCKVNIASECHNIFNNDFICTQ 138

Query: 127 --RCIKLGQFDDIQESAILE---GYRCKDDGCSGFLLRDSDDKGFTCQQCGLVRSK---- 177
              C K   + +I   + LE    Y  K    +  +L+ S +K  T  +C L +++    
Sbjct: 139 SENCKKFINYMEIVLMSELERKISYLNKSHFKTFPILKKSTEKNETVWKCMLCKNETNDS 198

Query: 178 ------EEIKKIASEVNILSKKTLALTSCGNHQEVVSTYKMIEKLQK--KLYHPFSVNLM 229
                 E+ K+   EV  L        S  N   + S  K+  K+      YH    +L 
Sbjct: 199 VIKALIEKEKETIKEVEYLDTLFAEKYSYDNKNVLQSLTKIKSKIDDLTDFYHHSKYSLQ 258

Query: 230 QTREKLIKILMELEDWKEALAYCQLTIPVYQRVYPQFHPLLGLQYYTCGKLEWFLGDTEN 289
           + R K++ + ++L+D+K A +     +   +  Y ++ P+ G   +  GKL  FL     
Sbjct: 259 KMRAKILYVSIQLQDFKLAYSIATQYLKSIEVSYGKYSPIYGYYIFLTGKLALFLDLKSA 318

Query: 290 AIKSMTEAVEILRITHGTNSPFMKEL 315
            +  + +A + +  T+G  SP  K+L
Sbjct: 319 GLSLIHKAKKNIIKTYGPESPIYKDL 344


>gi|67514234|gb|AAH98305.1| SMYD2 protein [Homo sapiens]
          Length = 272

 Score = 92.0 bits (227), Expect = 3e-16,   Method: Compositional matrix adjust.
 Identities = 47/127 (37%), Positives = 75/127 (59%), Gaps = 5/127 (3%)

Query: 5   DEKQLLLYAQIANLVNLI---LQWPEISINEIAENFSKLACNAHTICNSELRPLGTGLYP 61
           +EK+ L+ + IA L +     L +P+   + +   F+++ CN  TI + EL  LG+ ++P
Sbjct: 143 NEKKDLIQSDIAALHHFYSKHLGFPDN--DSLVVLFAQVNCNGFTIEDEELSHLGSAIFP 200

Query: 62  VISIINHSCLPNAVLVFEGRLAVVRAVQHVPKGAEVLISYIETAGSTMTRQKALKEQYLF 121
            ++++NHSC PN ++ ++G LA VRAVQ +  G EV  SYI+    T  R   L++ Y F
Sbjct: 201 DVALMNHSCCPNVIVTYKGTLAEVRAVQEIKPGEEVFTSYIDLLYPTEDRNDRLRDSYFF 260

Query: 122 TCTCPRC 128
           TC C  C
Sbjct: 261 TCECQEC 267


>gi|303321560|ref|XP_003070774.1| MYND finger family protein [Coccidioides posadasii C735 delta
           SOWgp]
 gi|240110471|gb|EER28629.1| MYND finger family protein [Coccidioides posadasii C735 delta
           SOWgp]
 gi|320040248|gb|EFW22181.1| conserved hypothetical protein [Coccidioides posadasii str.
           Silveira]
          Length = 497

 Score = 92.0 bits (227), Expect = 3e-16,   Method: Compositional matrix adjust.
 Identities = 40/110 (36%), Positives = 65/110 (59%)

Query: 27  EISINEIAENFSKLACNAHTICNSELRPLGTGLYPVISIINHSCLPNAVLVFEGRLAVVR 86
           E+S+ E+ + F KL  NA T+       +G  + P+ S+ NHSC PNA + F+     +R
Sbjct: 178 ELSLQEVVDCFGKLDINAFTLTTPFYDQIGAAVEPLASLCNHSCDPNAAVDFDKGKTWLR 237

Query: 87  AVQHVPKGAEVLISYIETAGSTMTRQKALKEQYLFTCTCPRCIKLGQFDD 136
           A+QH+ +G ++ +SY+E   + + RQ  L ++Y F C CPRCI+  +  D
Sbjct: 238 ALQHIEEGEQIFVSYVEPTDACLHRQAELSKRYYFECECPRCIREKEIGD 287


>gi|432102449|gb|ELK30026.1| SET and MYND domain-containing protein 1 [Myotis davidii]
          Length = 449

 Score = 92.0 bits (227), Expect = 4e-16,   Method: Compositional matrix adjust.
 Identities = 81/326 (24%), Positives = 145/326 (44%), Gaps = 36/326 (11%)

Query: 5   DEKQLLLYAQIANLVNLIL-QWPEISINEIAENFSKLACNAHTICNSE-LRPLGTGLYPV 62
           +E+Q  L   +   +   L Q  + S+  I+  F  + CN  T+ +   L+ +G G++P 
Sbjct: 141 EEEQKQLRVDVDTFLQYWLPQSQQFSMQYISHIFGVINCNGFTLSDQRGLQAVGVGIFPN 200

Query: 63  ISIINHSCLPNAVLVFEGRLAVVRAVQHVPKGAEVLISYIETAGSTMTRQKALKEQYLFT 122
           + ++NH C PN  ++F      +RA+  + +G E+ +SYI+    +  R+K LK+QY F 
Sbjct: 201 LGLVNHDCWPNCTVIFNNGKIELRALGKISEGEELTVSYIDFLNVSEERKKQLKKQYYFD 260

Query: 123 CTCPRCIKLGQFDDIQESAILEGYRCKDDGCSGFLLRDSDDKGFTCQQCGLVRSKEEIKK 182
           CTC  C K G  DD+                  FL    D K           S++ +K+
Sbjct: 261 CTCEHCQK-GLKDDL------------------FLGVKDDPKP----------SQDVVKE 291

Query: 183 IASEVNILSKKTLALTSCGNHQEVVSTYKMIEKLQKKLYHPFSVNLMQTREKLIKILMEL 242
           I        +K     S G + EVV   +   + Q  ++   ++ +++    + ++L  L
Sbjct: 292 IIQFSKDTLEKIDKARSEGLYHEVVKLCRECLEKQDSVFADTNIYMLRVLSIVSEVLSYL 351

Query: 243 EDWKEALAYCQLTIPVYQRVYPQFHPLLGLQYYTCGKLEWFLGDTENAIKSMTEAVEILR 302
           + ++EA  Y +  +  Y ++Y   +  LG+     G   W  G+ E A++  TE    LR
Sbjct: 352 QAFEEASYYARRMVDGYVKLYHPNNAQLGMAVMRAGLTNWHAGNIE-AMRMQTEME--LR 408

Query: 303 ITHGTNSPF--MKELILKLEEAQAEA 326
           + H     +  M+E  L  +  Q  A
Sbjct: 409 LFHQNEFMYRKMREAALNNQPMQVMA 434


>gi|156101287|ref|XP_001616337.1| hypothetical protein [Plasmodium vivax Sal-1]
 gi|148805211|gb|EDL46610.1| hypothetical protein, conserved [Plasmodium vivax]
          Length = 517

 Score = 91.7 bits (226), Expect = 4e-16,   Method: Compositional matrix adjust.
 Identities = 80/327 (24%), Positives = 133/327 (40%), Gaps = 43/327 (13%)

Query: 30  INEIAENFSKLACNAHTICNSELRPLGTGLY--PVISIINHSCLPNAVLVFEGRLAVVRA 87
           +  I E F K++ N   I ++EL P G G Y  PV    NHSCL N V +F+ +   +R 
Sbjct: 184 LRNIIEKFIKISKNTLQIIDNELEPAGLGFYKKPV-PYFNHSCLSNCVTIFKNQKLFIRT 242

Query: 88  VQHVPKGAEVLISYIETAGSTMTRQKALKEQYLFTCTCPRCIKLGQFDDIQESAILEGYR 147
           +  V  G E+ ISY++ A    TR     +QY FTCTC  C K+    + Q     E   
Sbjct: 243 LMDVYPGEELTISYVDIAFDKNTRLAICMDQYFFTCTCKLC-KVNVVSECQNVFNTEFVC 301

Query: 148 CKDDGCSGF------------------------------LLRDSDDKGFTCQQCGLVRSK 177
            + + C  +                              L + ++   + C  C      
Sbjct: 302 TQSESCKKYVNHMELILISELERKSCYVNKSQFKNSYPVLKKANEQHVWKCSICKNEVHD 361

Query: 178 EEIKKIASEVNILSKKTLALTSCGNHQEVVSTYKMIEKLQK---------KLYHPFSVNL 228
             IK +  +     K+T  L S  N +       +++ L K           YH    +L
Sbjct: 362 NVIKSLLEKEKETVKETAHLESLFNEKYTYDNKSVLQSLNKIKSKIDYLTTFYHHTRYSL 421

Query: 229 MQTREKLIKILMELEDWKEALAYCQLTIPVYQRVYPQFHPLLGLQYYTCGKLEWFLGDTE 288
            + R K++ I ++L+++K A       +   +  Y ++ P+ G   +  GKL  FL    
Sbjct: 422 QKMRAKILYISIQLQEFKLAYNIANQYLKSIEVSYGKYSPIYGYYIFLTGKLALFLDLKS 481

Query: 289 NAIKSMTEAVEILRITHGTNSPFMKEL 315
             +  + +A + +  T+G +SP  K+L
Sbjct: 482 EGLSLIHKAKKNIIKTYGPDSPIYKDL 508


>gi|392862338|gb|EAS37026.2| hypothetical protein CIMG_02225 [Coccidioides immitis RS]
          Length = 497

 Score = 91.7 bits (226), Expect = 4e-16,   Method: Compositional matrix adjust.
 Identities = 40/110 (36%), Positives = 65/110 (59%)

Query: 27  EISINEIAENFSKLACNAHTICNSELRPLGTGLYPVISIINHSCLPNAVLVFEGRLAVVR 86
           E+S+ E+ + F KL  NA T+       +G  + P+ S+ NHSC PNA + F+     +R
Sbjct: 178 ELSLQEVVDCFGKLDINAFTLTTPFYDQIGAAVEPLASLCNHSCDPNAAVDFDKGKTWLR 237

Query: 87  AVQHVPKGAEVLISYIETAGSTMTRQKALKEQYLFTCTCPRCIKLGQFDD 136
           A+QH+ +G ++ +SY+E   + + RQ  L ++Y F C CPRCI+  +  D
Sbjct: 238 ALQHIEEGEQIFVSYVEPTDACLHRQAELSKRYYFECECPRCIREKEVGD 287


>gi|327275307|ref|XP_003222415.1| PREDICTED: SET and MYND domain-containing protein 1-like isoform 1
           [Anolis carolinensis]
          Length = 472

 Score = 91.7 bits (226), Expect = 4e-16,   Method: Compositional matrix adjust.
 Identities = 75/296 (25%), Positives = 133/296 (44%), Gaps = 34/296 (11%)

Query: 25  WP----EISINEIAENFSKLACNAHTICNSE-LRPLGTGLYPVISIINHSCLPNAVLVFE 79
           WP    + ++  I+     + CN  T+ +   L+ +G G++P + ++NH C PN  ++F 
Sbjct: 153 WPHDGKQFAMQYISHILGVINCNGFTLSDQRGLQAVGVGIFPNLCLVNHDCWPNCTVIFN 212

Query: 80  GRLAVVRAVQHVPKGAEVLISYIETAGSTMTRQKALKEQYLFTCTCPRCIKLGQFDDIQE 139
                +RA+  +  G E+ +SY++       R++ LK+QY F CTC  C K G  DD+  
Sbjct: 213 NGKIELRALGKISIGEELTVSYVDFLNVCEDRRQQLKKQYYFDCTCEHCQK-GIKDDLM- 270

Query: 140 SAILEGYRCKDDGCSGFLLRDSDDKGFTCQQCGLVRSKEEIKKIASEVNILSKKTLALTS 199
            A+ E     ++  S  +++D            +  SKE I+KI               S
Sbjct: 271 LAVKE-----EEKPSPEVVKDV-----------IQFSKEAIEKIDK-----------ARS 303

Query: 200 CGNHQEVVSTYKMIEKLQKKLYHPFSVNLMQTREKLIKILMELEDWKEALAYCQLTIPVY 259
            G ++EVV   +   + Q+ +    ++ +++    L ++L  L+   EA  Y +     Y
Sbjct: 304 EGLYKEVVKLCRACLEKQEPVLGDTNIYMLRILSILSEVLSYLQHVDEAAGYSKRMADGY 363

Query: 260 QRVYPQFHPLLGLQYYTCGKLEWFLGDTENAIKSMTEAVEILRITHGTNSPFMKEL 315
            ++Y   +  LG+     G   W  G  E     + +A  IL +THG   P  K+L
Sbjct: 364 LKLYHPNNAQLGMALMRAGVTHWHAGLIEAGHGMICKAYAILLVTHGPTHPITKDL 419


>gi|348670644|gb|EGZ10465.1| hypothetical protein PHYSODRAFT_518318 [Phytophthora sojae]
          Length = 437

 Score = 90.9 bits (224), Expect = 7e-16,   Method: Compositional matrix adjust.
 Identities = 47/125 (37%), Positives = 67/125 (53%), Gaps = 2/125 (1%)

Query: 32  EIAENFSKLACNAHTICNS-ELRPLGTGLYPVISIINHSCLPNAVLVFEGRLAVVRAVQH 90
           EI + F ++ CNA ++ N      +G GLYP  ++ NH C PN V+ F+GR   VR V+ 
Sbjct: 173 EILKLFGRVNCNAFSVANEFTNEAVGIGLYPEGALFNHDCDPNCVVSFKGREMQVRVVRD 232

Query: 91  VPKGAEVLISYIETAGSTMTRQKALKEQYLFTCTCPRCIKLGQFDDIQESAILEGYRCKD 150
           +    E+ +SY+E   ST  R++ LKE Y F C C RC K        E   L+G RC +
Sbjct: 233 IEVDEELTVSYVELLQSTKARRRELKESYFFDCECKRC-KAATNGQSNEDWYLDGLRCSN 291

Query: 151 DGCSG 155
             C+ 
Sbjct: 292 KKCAS 296


>gi|444714585|gb|ELW55465.1| N-lysine methyltransferase SMYD2 [Tupaia chinensis]
          Length = 494

 Score = 90.9 bits (224), Expect = 8e-16,   Method: Compositional matrix adjust.
 Identities = 65/264 (24%), Positives = 116/264 (43%), Gaps = 34/264 (12%)

Query: 63  ISIINHSCLPNAVLVFEGRLAVVRAVQHVPKGAEVLISYIETAGSTMTRQKALKEQYLFT 122
           ++++NHSC PN ++ + G LA VRAV  +  G EV  SYI+    T  R   L++ Y FT
Sbjct: 257 VALMNHSCCPNVIVTYRGTLAEVRAVHEISPGEEVFTSYIDLLYPTEDRNDRLRDSYFFT 316

Query: 123 CTCPRCIKLGQFDDIQESAILEGYRCKDDGCSGFLLRDSDDKGFTCQQCGLVRSK----E 178
           C C  C                    KD   +   +R  +D         +VR      E
Sbjct: 317 CECHECT------------------TKDKDKAKVEIRKLNDPPKAEAIRDMVRYARNVIE 358

Query: 179 EIKKIASEVNIL--SKKTLALTSCGNHQEVVSTYKMIEKLQKKLYHPFSVNLMQTREKLI 236
           E ++      IL  S  +  L  C   QE +S+          ++   +V ++    + +
Sbjct: 359 EFRRAKHYKYILYNSPPSELLEICELSQEKMSS----------VFEDSNVYMLHMMYQAM 408

Query: 237 KILMELEDWKEALAYCQLTIPVYQRVYPQFHPLLGLQYYTCGKLEWFLGDTENAIKSMTE 296
            + + ++DW+ AL Y Q  I  Y + YP +   +   +   G+L   L +     +++ +
Sbjct: 409 GVCLYMQDWEGALRYGQRIIKPYSKHYPLYSLNVASMWLKLGRLYMGLENKAAGERALKK 468

Query: 297 AVEILRITHGTNSPFMKELILKLE 320
           A+ I+ + HG + P++ E+  ++E
Sbjct: 469 AIAIMEVAHGKDHPYISEIKQEIE 492


>gi|451992674|gb|EMD85153.1| hypothetical protein COCHEDRAFT_1149213 [Cochliobolus
           heterostrophus C5]
          Length = 508

 Score = 90.9 bits (224), Expect = 8e-16,   Method: Compositional matrix adjust.
 Identities = 72/321 (22%), Positives = 139/321 (43%), Gaps = 46/321 (14%)

Query: 33  IAENFSKLACNAHTICNSELRPLGTGLYPVISIINHSCLPNAVLVFEGRLAVVRAVQHVP 92
           +A  + ++  NA T+    L PLG  + P +  +NHSC PNA ++ +G L  +R ++ + 
Sbjct: 178 VAAMYGRVMSNALTLITPTLDPLGIIVDPTLCQMNHSCDPNAYIMMDGPLVSIRTLRPIR 237

Query: 93  KGAEVLISYIETAGSTMTRQKALKEQYLFTCTCPRCIKLGQFDDIQESAILEGYRCKDDG 152
           K  E+ ISYI+T      RQ  L+ ++ FTC C +C         Q+ A  +       G
Sbjct: 238 KDKEIFISYIDTTHPYQKRQDELQARWFFTCRCAKC---------QDKATHQEDNWLVPG 288

Query: 153 CSGFLLRDSDDKGFTCQQCGLVRSKEEIKKIASEVNILSKKTLALTSCGNHQEVVSTYKM 212
            S F+      +     Q  +    E+++ + +   ++     AL +C            
Sbjct: 289 NSDFVASKEQVQALASNQEQMFTLYEQVQGMPNAEAVIPVLKEALKACH----------- 337

Query: 213 IEKLQKKLY-HPFSVNLMQTREKLIKILMELEDWKEALAYCQLTIPVYQRVYPQ-----F 266
            E     LY  P++      R++LI  L+ +  ++EA A+   +   Y+ + P+     F
Sbjct: 338 -ESGNWPLYRQPYA----SIRDELIVNLLSVGKYQEAWAH---SAKRYKYILPKLYSVPF 389

Query: 267 HPLLGLQYYTCGKLEWFLGDTENAIKS------------MTEAVEILRITHGTNSPFMKE 314
           HP+  +Q +    L  +L  TE  + +            + + +++  ++H  ++ F K 
Sbjct: 390 HPVRVVQTWQMAMLAAYLASTEEGVGAPGVNMGLIAMMLVKQVLDVAPMSHSQDNAFTKS 449

Query: 315 LILKLEEAQAEASYKLSSKDE 335
           ++ K  E   E    + + D+
Sbjct: 450 VVEKTREMVEELKRSVGNPDK 470


>gi|167533183|ref|XP_001748272.1| hypothetical protein [Monosiga brevicollis MX1]
 gi|163773392|gb|EDQ87033.1| predicted protein [Monosiga brevicollis MX1]
          Length = 456

 Score = 90.1 bits (222), Expect = 1e-15,   Method: Compositional matrix adjust.
 Identities = 80/291 (27%), Positives = 126/291 (43%), Gaps = 31/291 (10%)

Query: 33  IAENFSKLACNAHTICNSELRPLGTGLYPVISIINHSCLPNAVLVFEGRL--AVVRAVQH 90
           + +   +  CN  +I +  L P G G+YP  +I+NHSC PN V+ + G L    V+A++ 
Sbjct: 183 VMDVLQRADCNNFSIWDELLLPRGAGVYPWGAILNHSCEPNCVMTYRGPLHAQAVKALRD 242

Query: 91  VPKGAEVLISYIETAGSTMTRQKALKEQYLFTCTCPRCIK--LGQFDDIQESAILEGYRC 148
           +  G E+  SYI+    T  R   L +QY F C C   +   L + D + E    E    
Sbjct: 243 IAVGEELCHSYIDLYAPTGQRHSHLGDQYGFECDCALYLDGALAELDALPEVTAAEMR-- 300

Query: 149 KDDGCSGFLLRDSDDKGFTCQQ-CGLVRSKEEIKKIASEVNILSKKTLALTSCGNHQEVV 207
                 G +L + + +    Q+   L R+ E +K  A                   +E+V
Sbjct: 301 ----LPGKMLHELECEAEERQRLLSLYRAAETLKDYAWST--------------PEEELV 342

Query: 208 STYKMIEKLQKKLYHPFSVNLMQTREKLIKILME---LEDWKEALAYCQLTIPVYQRVYP 264
              +    L ++L HP +++L     +L  +  E   LED   AL   Q     Y  VYP
Sbjct: 343 CLLQGYNIL-RRLAHPANISLTSIMTRLQNVATECGRLEDI--ALPVGQHLALAYDHVYP 399

Query: 265 QFHPLLGLQYYTCGKLEWFLGDTENAIKSMTEAVEILRITHGTNSPFMKEL 315
           + +PL GLQYY  G +         A+     A ++LR TH  +   + +L
Sbjct: 400 EHNPLCGLQYYRLGDVANLAQKQALALLWHRRAYQVLRTTHDADHWLLTQL 450


>gi|12834773|dbj|BAB23038.1| unnamed protein product [Mus musculus]
          Length = 229

 Score = 89.4 bits (220), Expect = 2e-15,   Method: Compositional matrix adjust.
 Identities = 56/255 (21%), Positives = 110/255 (43%), Gaps = 28/255 (10%)

Query: 66  INHSCLPNAVLVFEGRLAVVRAVQHVPKGAEVLISYIETAGSTMTRQKALKEQYLFTCTC 125
           +NHSC PN ++ ++G LA VRAVQ +  G EV  SYI+    T  R   L++ Y FTC C
Sbjct: 1   MNHSCCPNVIVTYKGTLAEVRAVQEIHPGDEVFTSYIDLLYPTEDRNDRLRDSYFFTCEC 60

Query: 126 PRCIKLGQFDDIQESAILEGYRCKDDGCSGFLLRDSDDKGFTCQQCGLVRSKEEIKKIAS 185
             C                              +D D      ++       E I+ +  
Sbjct: 61  RECT----------------------------TKDKDKAKVEVRKLSSPPQAEAIRDMVR 92

Query: 186 EVNILSKKTLALTSCGNHQEVVSTYKMIEKLQKKLYHPFSVNLMQTREKLIKILMELEDW 245
               + ++        +  E++   ++ ++    ++   +V ++    + + + + ++DW
Sbjct: 93  YARNVIEEFRRAKHYKSPSELLEICELSQEKMSSVFEDSNVYMLHMMYQAMGVCLYMQDW 152

Query: 246 KEALAYCQLTIPVYQRVYPQFHPLLGLQYYTCGKLEWFLGDTENAIKSMTEAVEILRITH 305
           + AL Y Q  I  Y + YP +   +   +   G+L   L +     K++ +A+ I+ + H
Sbjct: 153 EGALKYGQKIIKPYSKHYPVYSLNVASMWLKLGRLYMGLENKAAGEKALKKAIAIMEVAH 212

Query: 306 GTNSPFMKELILKLE 320
           G + P++ E+  ++E
Sbjct: 213 GKDHPYISEIKQEIE 227


>gi|32140151|tpg|DAA01312.1| TPA_exp: SET and MYND domain protein 2 [Takifugu rubripes]
          Length = 433

 Score = 89.4 bits (220), Expect = 2e-15,   Method: Compositional matrix adjust.
 Identities = 45/123 (36%), Positives = 73/123 (59%), Gaps = 10/123 (8%)

Query: 13  AQIANLVNLILQWPEISINEIAEN------FSKLACNAHTICNSELRPLGTGLYPVISII 66
           A IA L +   ++    IN+I ++      F+++ CN  TI + EL  LG+ ++P ++++
Sbjct: 151 ADIAALHHFYSRY----INDIPDDQSLTELFAQVNCNGFTIEDEELSHLGSAIFPDVALM 206

Query: 67  NHSCLPNAVLVFEGRLAVVRAVQHVPKGAEVLISYIETAGSTMTRQKALKEQYLFTCTCP 126
           NHSC PN ++ ++G +A VRAVQ +  G E+L SYI+    T  R++ L + Y FTC C 
Sbjct: 207 NHSCNPNVIVTYKGTVAEVRAVQELNPGDEILNSYIDLLYPTEDRKERLLDSYFFTCLCS 266

Query: 127 RCI 129
            C 
Sbjct: 267 ECF 269


>gi|410912130|ref|XP_003969543.1| PREDICTED: N-lysine methyltransferase SMYD2-A-like [Takifugu
           rubripes]
          Length = 434

 Score = 89.4 bits (220), Expect = 2e-15,   Method: Compositional matrix adjust.
 Identities = 45/123 (36%), Positives = 73/123 (59%), Gaps = 10/123 (8%)

Query: 13  AQIANLVNLILQWPEISINEIAEN------FSKLACNAHTICNSELRPLGTGLYPVISII 66
           A IA L +   ++    IN+I ++      F+++ CN  TI + EL  LG+ ++P ++++
Sbjct: 151 ADIAALHHFYSRY----INDIPDDQSLTELFAQVNCNGFTIEDEELSHLGSAIFPDVALM 206

Query: 67  NHSCLPNAVLVFEGRLAVVRAVQHVPKGAEVLISYIETAGSTMTRQKALKEQYLFTCTCP 126
           NHSC PN ++ ++G +A VRAVQ +  G E+L SYI+    T  R++ L + Y FTC C 
Sbjct: 207 NHSCNPNVIVTYKGTVAEVRAVQELNPGDEILNSYIDLLYPTEDRKERLLDSYFFTCLCS 266

Query: 127 RCI 129
            C 
Sbjct: 267 ECF 269



 Score = 42.4 bits (98), Expect = 0.31,   Method: Compositional matrix adjust.
 Identities = 21/99 (21%), Positives = 49/99 (49%)

Query: 225 SVNLMQTREKLIKILMELEDWKEALAYCQLTIPVYQRVYPQFHPLLGLQYYTCGKLEWFL 284
           +V ++    + + + + ++DW+ A+ Y +  I  Y   YP +   +   Y   G L   L
Sbjct: 336 NVYMLHMMYQAMGVCLYMQDWEGAMRYGEKIIHPYSFHYPAYSLNVASMYLKLGHLYLGL 395

Query: 285 GDTENAIKSMTEAVEILRITHGTNSPFMKELILKLEEAQ 323
                 +K++ +A+ I+ + HG +  ++ E+  ++EE +
Sbjct: 396 EKRTQGVKALKKALSIMEVAHGKDHYYVAEVKREIEEQK 434


>gi|301102901|ref|XP_002900537.1| conserved hypothetical protein [Phytophthora infestans T30-4]
 gi|262101800|gb|EEY59852.1| conserved hypothetical protein [Phytophthora infestans T30-4]
          Length = 425

 Score = 89.4 bits (220), Expect = 2e-15,   Method: Compositional matrix adjust.
 Identities = 46/123 (37%), Positives = 68/123 (55%), Gaps = 6/123 (4%)

Query: 32  EIAENFSKLACNAHTICNSELR-PLGTGLYPVISIINHSCLPNAVLVFEGRLAVVRAVQH 90
           EI + F +++CN  +I N     P+G GL+   S+ NH C PN V+ F+G+   V  ++ 
Sbjct: 172 EILKLFGQVSCNGFSIMNGVTNEPVGIGLFLQGSMFNHDCDPNCVVSFQGQEMNVHVIKD 231

Query: 91  VPKGAEVLISYIETAGSTMTRQKALKEQYLFTCTCPRCIKLGQFDDIQESAILEGYRCKD 150
           V +G E+ ISY+E   ST  RQK LK+ Y F C C RC       +  +   L+G +C +
Sbjct: 232 VKEGQELTISYVEVLQSTKKRQKILKDSYFFECQCSRCTT-----ETTDDWYLDGLQCGN 286

Query: 151 DGC 153
            GC
Sbjct: 287 KGC 289


>gi|348505212|ref|XP_003440155.1| PREDICTED: SET and MYND domain-containing protein 1-like isoform 2
           [Oreochromis niloticus]
          Length = 467

 Score = 89.0 bits (219), Expect = 3e-15,   Method: Compositional matrix adjust.
 Identities = 79/310 (25%), Positives = 135/310 (43%), Gaps = 62/310 (20%)

Query: 29  SINEIAENFSKLACNAHTICNSE-LRPLGTGLYPVISIINHSCLPN-AVLVFEGRLAVV- 85
           S++ I+  F  + CN  T+ +   L+ +G GL+P + ++NH C PN  V++  G  + V 
Sbjct: 166 SVDYISHIFGIIKCNGFTLSDQRGLQAVGVGLFPNLCLVNHDCWPNCTVILNHGNQSAVS 225

Query: 86  -----------RAVQHVPKGAEVLISYIETAGSTMTRQKALKEQYLFTCTCPRCIKLGQF 134
                      RA+  +P+G E+ +SY++    +  RQK LKE++ F CTC  C K    
Sbjct: 226 SALHSQRRIELRALGKIPEGEELTVSYVDFLNLSADRQKKLKERFHFDCTCEHCSK---- 281

Query: 135 DDIQESAILEGYRCKDDGCSGFLLRDSDDKGFTCQQCGLVRSKEEIKKIASEVNILSKKT 194
                         KDD     +   +D K           S +++K    EV   SK++
Sbjct: 282 ------------HIKDD----LMTAVADSKP----------SADKVK----EVTAFSKES 311

Query: 195 LALTSCG----NHQEVVS-TYKMIEKLQKKLYHPFSVNLMQTREKLIKILMELEDWKEAL 249
           L          ++ EV+   Y+ ++K +  L      NL + R  ++ I  E+  ++ + 
Sbjct: 312 LEKIEKSRVERDYNEVLKLCYECLQKQENVLA---DTNLYKLR--VLSIASEVLSYQRSF 366

Query: 250 A----YCQLTIPVYQRVYPQFHPLLGLQYYTCGKLEWFLGDTENAIKSMTEAVEILRITH 305
           A    Y    +  Y+++Y   +  LG+     G   W  G  E     + +A  IL +TH
Sbjct: 367 AKAANYAHRMVEGYRKLYHPNNAQLGMAIMRAGVTHWHAGQIEEGHSLICQAYRILMVTH 426

Query: 306 GTNSPFMKEL 315
           G N    K+L
Sbjct: 427 GPNHAITKDL 436


>gi|88900471|ref|NP_001034725.1| SET and MYND domain-containing protein 1 [Danio rerio]
 gi|84180543|gb|ABC54713.1| histone methyltransferase SmyD1a [Danio rerio]
          Length = 486

 Score = 89.0 bits (219), Expect = 3e-15,   Method: Compositional matrix adjust.
 Identities = 75/331 (22%), Positives = 137/331 (41%), Gaps = 47/331 (14%)

Query: 3   DIDEKQLLLYAQIANLVNLILQWPEIS----INEIAENFSKLACNAHTICNSE-LRPLGT 57
           DI E  L  +    ++ N +  WP  S    ++ ++     + CN   + +   L+ +G 
Sbjct: 133 DISEDDLKDFK--VDIHNFLDYWPRNSKPHTVDSVSHILGVINCNGFMVSDQRGLQAVGV 190

Query: 58  GLYPVISIINHSCLPNAVL------------VFEGRLAV-VRAVQHVPKGAEVLISYIET 104
           GL+P + ++NH C PN  +            VF  +  + +RA+  +  G EV ++Y++ 
Sbjct: 191 GLFPNLCLVNHDCWPNCTVILNNGNQSAIDTVFHSQKRIELRALGKISAGEEVTVAYVDY 250

Query: 105 AGSTMTRQKALKEQYLFTCTCPRCIKLGQFDDIQESAILEGYRCKDDGCSGFLLRDSDDK 164
              +  RQ+ LK+QY F CTC  C +  + D     A ++G +  +              
Sbjct: 251 LNVSADRQRLLKQQYFFDCTCKHCTEKIKDDLKMAGAEVDGVKVPE-------------- 296

Query: 165 GFTCQQCGLVRSKEEIKKIASEVNILSKKTLALTSCGNHQEVVSTYKMIEKLQKKLYHPF 224
                        E++K++        +K        N+ EVV   +   + Q+ +    
Sbjct: 297 -------------EQVKEVTEFSRQKLEKMEKARIEANYNEVVKICRECVEKQENVLADT 343

Query: 225 SVNLMQTREKLIKILMELEDWKEALAYCQLTIPVYQRVYPQFHPLLGLQYYTCGKLEWFL 284
            +  ++    L ++L  L+ + EA  Y +  +  Y ++Y   +  LG+     G   W  
Sbjct: 344 HIYYLRVCCTLSEVLSYLQFFDEASEYARKMVDGYLKLYHPNNAQLGMATMRAGVTHWQA 403

Query: 285 GDTENAIKSMTEAVEILRITHGTNSPFMKEL 315
           G  E A   + +A  IL ITHG   P  K+L
Sbjct: 404 GFIEVAHGMICKAFAILMITHGPTHPITKDL 434


>gi|224050278|ref|XP_002188029.1| PREDICTED: SET and MYND domain-containing protein 1 isoform 3
           [Taeniopygia guttata]
          Length = 490

 Score = 89.0 bits (219), Expect = 3e-15,   Method: Compositional matrix adjust.
 Identities = 84/342 (24%), Positives = 148/342 (43%), Gaps = 57/342 (16%)

Query: 16  ANLVNLILQWP----EISINEIAENFSKLACNAHTICNSE-LRPLGTGLYPVISIINHSC 70
           A++ + +  WP    +  +  I+  F  ++CN  T+ +   L+ +G G++P +   NH C
Sbjct: 149 ADVESFLEFWPPQCQQFGMQLISHIFGVISCNGFTLSDQRGLQAVGVGIFPNLCQANHDC 208

Query: 71  LPNAVLVFE-GRLAVVRAVQH------------VPKGAEVLISYIETAGSTMTRQKALKE 117
            PN  +VF  G    VR++ H            +  G E+ +SY++    +  R++ LK+
Sbjct: 209 WPNCTVVFNNGNHEAVRSMFHTQMRIELRALNKISPGDELTVSYVDFLSLSEERRRQLKK 268

Query: 118 QYLFTCTCPRCIKLGQFDDIQESAILEGYRCKDDGCSGFLLRDSDDKGFTCQQCGLVRSK 177
           QY F CTC  C K                + KDD     +L   D             S 
Sbjct: 269 QYYFDCTCEHCKK----------------QLKDD----LMLAVKDSP---------TPSA 299

Query: 178 EEIKKIASEVNILSKKTLALTSC----GNHQEVVSTYKMIEKLQKKLYHPFSVNLMQTRE 233
           + +K    EV  LSK TL   +     G++ EVV   +   + Q+ +    ++ L++   
Sbjct: 300 DTVK----EVIQLSKDTLEKINKARMEGHYHEVVKLCRDCLQKQEPVLGDTNIYLLRILS 355

Query: 234 KLIKILMELEDWKEALAYCQLTIPVYQRVYPQFHPLLGLQYYTCGKLEWFLGDTENAIKS 293
              ++L  L+ ++EA  Y +  +  Y ++Y   +  LG+     G   W  G  E     
Sbjct: 356 IASEVLSYLQMFEEAADYAKRMVDGYLKIYHPNNAQLGMAVMRAGVTHWHAGLIEVGHGL 415

Query: 294 MTEAVEILRITHGTNSPFMKELILKLEEAQAEASYKLSSKDE 335
           + +A  IL ITHG + P  K+  L++   Q E   ++  ++E
Sbjct: 416 ICKAYAILLITHGPSHPITKD--LEVMRVQTEMELRMFQQNE 455


>gi|432880354|ref|XP_004073656.1| PREDICTED: SET and MYND domain-containing protein 1-like isoform 1
           [Oryzias latipes]
          Length = 489

 Score = 88.6 bits (218), Expect = 4e-15,   Method: Compositional matrix adjust.
 Identities = 78/306 (25%), Positives = 132/306 (43%), Gaps = 45/306 (14%)

Query: 29  SINEIAENFSKLACNAHTICNSE-LRPLGTGLYPVISIINHSCLPNAVLVF-EGRLAVVR 86
           S+  I+  F  + CN  T+ +   L+ +G GL+P ++++NH C PN   +   G  + V 
Sbjct: 166 SVEYISHIFGIIKCNGFTLTDQRGLQAVGVGLFPNLALVNHDCWPNCTAILNHGNHSAVS 225

Query: 87  AVQHVPK------------GAEVLISYIETAGSTMTRQKALKEQYLFTCTCPRCIKLGQF 134
           +  H P+            G E+ ISY++    +  RQK LKEQ+ F C+C  C +    
Sbjct: 226 SALHSPRRIEVRTLGKISEGEELTISYVDFLQLSADRQKQLKEQFHFECSCKHCSQHIS- 284

Query: 135 DDIQESAILEGYRCKDDGCSGFLLRDSDDKGFTCQQCGLVR-SKEEIKKIASEVNILSKK 193
           DD+  +A         DG         D   F      L R S E+I+    EV+  SK 
Sbjct: 285 DDLMMAAA--------DG--------QDSPWFI-----LCRPSAEQIQ----EVSAFSKD 319

Query: 194 TLALTS----CGNHQEVVSTYKMIEKLQKKLYHPFSVNLMQTREKLIKILMELEDWKEAL 249
           +L         G++ EV+   +   + Q+ +     ++ ++      ++L  L  + EA 
Sbjct: 320 SLEKIEKSRLDGDYSEVLRLCRECLQKQRGVLADTHLHRLRVLSIAAEVLSSLRQFSEAA 379

Query: 250 AYCQLTIPVYQRVYPQFHPLLGLQYYTCGKLEWFLGDTENAIKSMTEAVEILRITHGTNS 309
              +  +  Y ++Y   + LLG+     G   W  G  E     + +A  IL +THG N 
Sbjct: 380 DCARKMVEGYMKLYHPNNALLGMAIMRAGVTLWHAGQIEEGHSLICKAYSILMVTHGPNH 439

Query: 310 PFMKEL 315
              ++L
Sbjct: 440 SVTRDL 445


>gi|426336314|ref|XP_004031420.1| PREDICTED: SET and MYND domain-containing protein 1 [Gorilla
           gorilla gorilla]
          Length = 459

 Score = 88.6 bits (218), Expect = 4e-15,   Method: Compositional matrix adjust.
 Identities = 75/316 (23%), Positives = 133/316 (42%), Gaps = 79/316 (25%)

Query: 19  VNLILQ-WP----EISINEIAENFSKLACNAHTICNSE-LRPLGTGLYPVISIINHSCLP 72
           V+  LQ WP    + S+  I+  F  + CN  T+ +   L+ +G G++P + ++NH C P
Sbjct: 151 VDTFLQYWPPQSQQFSMQYISHIFGVINCNGFTLSDQRGLQAVGVGIFPNLGLVNHDCWP 210

Query: 73  NAVLVFE-GRLAVVRAVQH------------VPKGAEVLISYIETAGSTMTRQKALKEQY 119
           N  ++F  G    V+++ H            + +G E+ +SYI+    +  R++ LK+QY
Sbjct: 211 NCTVIFNNGNHEAVKSMFHTQMRIELRALGKISEGEELTVSYIDFLNVSEERKRQLKKQY 270

Query: 120 LFTCTCPRCIKLGQFDDIQESAILEGYRCKDDGCSGFLLRDSDDKGFTCQQCGLVRSKEE 179
            F CTC  C K                + KDD   G  ++D+      C++C     K+E
Sbjct: 271 YFDCTCEHCQK----------------KLKDDLFLG--VKDNPKVVKLCREC---LQKQE 309

Query: 180 IKKIASEVNILSKKTLALTSCGNHQEVVSTYKMIEKLQKKLYHPFSVNLMQTREKLIKIL 239
              + ++ NI   + L++ S                                     ++L
Sbjct: 310 --PVFADTNIYMLRMLSIVS-------------------------------------EVL 330

Query: 240 MELEDWKEALAYCQLTIPVYQRVYPQFHPLLGLQYYTCGKLEWFLGDTENAIKSMTEAVE 299
             L+ ++EA  Y +  +  Y ++Y   +  LG+     G   W  G+ E     + +A  
Sbjct: 331 SYLQAFEEASFYARRMVDGYMKLYHPNNAQLGMAVMRAGLTNWHAGNIEVGHGMICKAYA 390

Query: 300 ILRITHGTNSPFMKEL 315
           IL +THG + P  K+L
Sbjct: 391 ILLVTHGPSHPITKDL 406


>gi|242009727|ref|XP_002425634.1| set and mynd domain-containing protein, putative [Pediculus humanus
           corporis]
 gi|212509527|gb|EEB12896.1| set and mynd domain-containing protein, putative [Pediculus humanus
           corporis]
          Length = 666

 Score = 88.6 bits (218), Expect = 4e-15,   Method: Compositional matrix adjust.
 Identities = 61/195 (31%), Positives = 96/195 (49%), Gaps = 15/195 (7%)

Query: 43  NAHTICNSELRPLGTGLYPVISIINHSCLPNAVLVFEGRLAVVRAVQHVPKGAEVLISY- 101
           N + + +S+   +G GLYP +++ NHSC P  V  F G   +VRA + + KG  V  +Y 
Sbjct: 449 NKNILDDSKSLFVGGGLYPTLALFNHSCDPGIVRYFIGTQVIVRAAKPIKKGEIVAENYG 508

Query: 102 -IETAGSTMTRQKALKEQYLFTCTCPRCIKL-GQFDDIQESAILEGYRC-------KDDG 152
            I +    + RQ+ LK QY FTCTC  C++L   FD++    I   +RC       K + 
Sbjct: 509 PIYSQMKKLERQEKLKSQYWFTCTCTPCLELWPTFDELDTKTI--RFRCSGYGTDEKREN 566

Query: 153 CSGFLL--RDSDDKGFTCQQCGLVRSKEEIKKIASEVNILSKKTLALTSCGNHQEVVSTY 210
           C   L+   ++DD    C QC    +  +  K   + +IL K+ +     GN Q  +  Y
Sbjct: 567 CKNILIVPVNTDDFMIKCSQCNQHTNLFKGLKALQDTDILYKQAIKYAQTGNFQNALDIY 626

Query: 211 -KMIEKLQKKLYHPF 224
            ++++ L   L  PF
Sbjct: 627 LEILKTLHGILAPPF 641


>gi|449501413|ref|XP_002187998.2| PREDICTED: SET and MYND domain-containing protein 1 isoform 2
           [Taeniopygia guttata]
          Length = 491

 Score = 88.6 bits (218), Expect = 4e-15,   Method: Compositional matrix adjust.
 Identities = 80/338 (23%), Positives = 149/338 (44%), Gaps = 48/338 (14%)

Query: 16  ANLVNLILQWP----EISINEIAENFSKLACNAHTICNSE-LRPLGTGLYPVISIINHSC 70
           A++ + +  WP    +  +  I+  F  ++CN  T+ +   L+ +G G++P +   NH C
Sbjct: 149 ADVESFLEFWPPQCQQFGMQLISHIFGVISCNGFTLSDQRGLQAVGVGIFPNLCQANHDC 208

Query: 71  LPNAVLVFE-GRLAVVRAVQH------------VPKGAEVLISYIETAGSTMTRQKALKE 117
            PN  +VF  G    VR++ H            +  G E+ +SY++    +  R++ LK+
Sbjct: 209 WPNCTVVFNNGNHEAVRSMFHTQMRIELRALNKISPGDELTVSYVDFLSLSEERRRQLKK 268

Query: 118 QYLFTCTCPRCIKLGQFDDIQESAILEGYRCKDDGCSGFLLRDSDDKGFTCQQCGLVRSK 177
           QY F CTC  C K                + KDD        +S     T ++  +  SK
Sbjct: 269 QYYFDCTCEHCKK----------------QLKDDLMLAVKAGESKPSADTVKEV-IQLSK 311

Query: 178 EEIKKIASEVNILSKKTLALTSCGNHQEVVSTYKMIEKLQKKLYHPFSVNLMQTREKLIK 237
           + ++KI       +K  +     G++ EVV   +   + Q+ +    ++ L++      +
Sbjct: 312 DTLEKI-------NKARME----GHYHEVVKLCRDCLQKQEPVLGDTNIYLLRILSIASE 360

Query: 238 ILMELEDWKEALAYCQLTIPVYQRVYPQFHPLLGLQYYTCGKLEWFLGDTENAIKSMTEA 297
           +L  L+ ++EA  Y +  +  Y ++Y   +  LG+     G   W  G  E     + +A
Sbjct: 361 VLSYLQMFEEAADYAKRMVDGYLKIYHPNNAQLGMAVMRAGVTHWHAGLIEVGHGLICKA 420

Query: 298 VEILRITHGTNSPFMKELILKLEEAQAEASYKLSSKDE 335
             IL ITHG + P  K+  L++   Q E   ++  ++E
Sbjct: 421 YAILLITHGPSHPITKD--LEVMRVQTEMELRMFQQNE 456


>gi|431907306|gb|ELK11286.1| SET and MYND domain-containing protein 1 [Pteropus alecto]
          Length = 476

 Score = 88.6 bits (218), Expect = 4e-15,   Method: Compositional matrix adjust.
 Identities = 74/302 (24%), Positives = 134/302 (44%), Gaps = 48/302 (15%)

Query: 19  VNLILQ-WP----EISINEIAENFSKLACNAHTICNSE-LRPLGTGLYPVISIINHSCLP 72
           V+  LQ WP    + S+  I+  F  + CN  T+ +   L+ +G G++P + ++NH C P
Sbjct: 151 VDTFLQYWPPKSQQFSMQYISNIFGVINCNGFTLSDQRGLQAVGVGIFPNLGLVNHDCWP 210

Query: 73  NAVLVFE-GRLAVVRAVQH------------VPKGAEVLISYIETAGSTMTRQKALKEQY 119
           N  ++F  G    V+++ H            + +G E+ +SYI+    +  R+K LK+QY
Sbjct: 211 NCTVIFNNGNHEAVKSMFHTQMRIELRALGKIAEGEELTVSYIDFLNVSEERKKQLKKQY 270

Query: 120 LFTCTCPRCIKLGQFDDIQESAILEGYRCKDDGCSGFLLRDSDDKGFTCQQCGLVRSKEE 179
            F CTC  C K G  DD+                    LR  DD            S++ 
Sbjct: 271 YFDCTCEHCQK-GLKDDL-------------------FLRVKDDPK---------PSQDV 301

Query: 180 IKKIASEVNILSKKTLALTSCGNHQEVVSTYKMIEKLQKKLYHPFSVNLMQTREKLIKIL 239
           +K++    N   +K     S G + EVV   +   + Q+ ++   ++ +++    + ++L
Sbjct: 302 VKEMIQFSNDTLEKIDKARSEGLYHEVVKLCRECLEKQESVFADTNLYMLRMLSIVSEVL 361

Query: 240 MELEDWKEALAYCQLTIPVYQRVYPQFHPLLGLQYYTCGKLEWFLGDTENAIKSMTEAVE 299
             L+ ++EA  Y +  +  Y ++Y   +  LG+     G   W  G+ E     + +A  
Sbjct: 362 SYLQAFEEASYYARRMVDGYMKLYHPNNAQLGMAVMRAGLTNWHAGNIEVGHGMICKAYA 421

Query: 300 IL 301
           IL
Sbjct: 422 IL 423


>gi|118379402|ref|XP_001022867.1| conserved hypothetical protein [Tetrahymena thermophila]
 gi|89304634|gb|EAS02622.1| conserved hypothetical protein [Tetrahymena thermophila SB210]
          Length = 418

 Score = 87.8 bits (216), Expect = 6e-15,   Method: Compositional matrix adjust.
 Identities = 82/307 (26%), Positives = 146/307 (47%), Gaps = 34/307 (11%)

Query: 38  SKLACNAHTICNS--ELRPLGTGLYPVISIINHSCLPNAVLVFEGRLAVVR--AVQHVPK 93
           +K A NA TI +S  E   +G GLY  ++ +NHSC PN + VF  +L  VR  A++ + +
Sbjct: 133 AKFATNALTIQDSLFETDGIGAGLYEEVNYMNHSCTPNVICVF-NKLPQVRVIAIRDIEQ 191

Query: 94  GAEVLISYIETAGSTMTRQKALKEQYLFTCTCPRCIKLGQFDDIQESAILEGYRCKDDGC 153
           G E++ SYI+T      R++ LK+ Y F C C RCIK       +++  +   RC+   C
Sbjct: 192 GEEIMNSYIDTKKDLDFRRRFLKQNYFFLCECKRCIK-------EQNEGVSFVRCQ--KC 242

Query: 154 SGFLLRDSDDKGFTCQQCGL--VRSKEEIKKIASEVNILSKKTLALTSCGNHQEVVSTYK 211
               +   + K   C  C      +++E ++I +++  + K+    TS   +++ +   +
Sbjct: 243 MKGRI---NSKTLNCSDCNTQGQMTEQEYQRIITQMEEIQKQ----TSQSTYEKTIELTE 295

Query: 212 MIEKLQKKLYHPFSVNLMQTREKLIKILMELEDWKEALAYCQLTIPVYQRVYPQF----- 266
             EKL   L  P+  ++  +R   I   + ++ +K+A  Y +  I + +  Y +      
Sbjct: 296 TAEKLC-HLEAPYFNDIYNSR---ILATLGVQQYKKANKYLKNFIELSKHWYEEIIGNTL 351

Query: 267 -HPLLGLQYYTCGKLEWFLGDTENAIKSMTEAVEILRITHGTNS-PFMKELILKLEEAQA 324
             P  G +     KL + L     A K    A++IL   +   S P + EL L+L + ++
Sbjct: 352 EMPFFGWKLNELSKLNFHLDKCLEAEKYSERALKILTQYYTKESCPELNELFLRLNDIKS 411

Query: 325 EASYKLS 331
           E   K +
Sbjct: 412 EIKMKYN 418


>gi|124513818|ref|XP_001350265.1| histone-lysine N-methyltransferase, putative [Plasmodium falciparum
           3D7]
 gi|23615682|emb|CAD52674.1| histone-lysine N-methyltransferase, putative [Plasmodium falciparum
           3D7]
          Length = 509

 Score = 87.8 bits (216), Expect = 6e-15,   Method: Compositional matrix adjust.
 Identities = 82/326 (25%), Positives = 132/326 (40%), Gaps = 42/326 (12%)

Query: 30  INEIAENFSKLACNAHTICNSELRPLGTGLY--PVISIINHSCLPNAVLVFEGRLAVVRA 87
           +  I E F K++ N+  I ++EL P G  +Y  PV    NHSCL N V VF  +   +R 
Sbjct: 177 LKNIIEKFIKISKNSLQIIDNELEPAGLAIYKKPV-PFFNHSCLSNCVTVFRNQKLYIRT 235

Query: 88  VQHVPKGAEVLISYIETAGSTMTRQKALKEQYLFTCTCPRCIKLGQFDDIQESAILEGYR 147
           +  V  G E+ ISYI+ A    TR     +QY FTC+C  C K+    +       E   
Sbjct: 236 LMDVYPGEELTISYIDIAFDRCTRLSICMDQYFFTCSCKLC-KVNIASECHNIFNTEIVC 294

Query: 148 CKDDGCSGFL----------------------------LRDSD-DKGFTCQQCGLVRSKE 178
              + C  FL                            LR S+ D  + C  C       
Sbjct: 295 TNSENCKKFLGYMEIVLISELERKQCYINKSSFKTYPVLRKSNIDNVWRCMLCKAEVGDN 354

Query: 179 EIKKIASEVNILSKKTLALTSCGNHQEVVSTYKMIEKLQK---------KLYHPFSVNLM 229
            IK +  +     K+T+ L +  N +       +++ L K           YH    +L 
Sbjct: 355 IIKGLVDKEKETFKETVYLETLFNEKYTYDNKSVLQSLNKIKSKIDYLTNYYHHAKYSLQ 414

Query: 230 QTREKLIKILMELEDWKEALAYCQLTIPVYQRVYPQFHPLLGLQYYTCGKLEWFLGDTEN 289
           + R K++ I ++L ++K A       +   +  Y ++ P+ G   +  GKL  FL     
Sbjct: 415 KMRAKILYISIQLHEFKLAYNIANQYLKSIEVSYGKYSPIYGYYIFLTGKLALFLDLKNE 474

Query: 290 AIKSMTEAVEILRITHGTNSPFMKEL 315
            +  + +A + +  T+G +S   K+L
Sbjct: 475 GLGLIHKAKKNIIKTYGPDSLIYKDL 500


>gi|240981045|ref|XP_002403605.1| SET and MYND domain-containing protein (SMYD), putative [Ixodes
           scapularis]
 gi|215491387|gb|EEC01028.1| SET and MYND domain-containing protein (SMYD), putative [Ixodes
           scapularis]
          Length = 770

 Score = 87.8 bits (216), Expect = 6e-15,   Method: Compositional matrix adjust.
 Identities = 66/245 (26%), Positives = 113/245 (46%), Gaps = 24/245 (9%)

Query: 36  NFSKLACNAHTICNSELRP--------------LGTGLYPVISIINHSCLPNAVLVFE-G 80
           +  +L CNAH I + E +               + T +YP  S++NHSC PN    F  G
Sbjct: 455 HIQQLVCNAHAITSLESKTSQEDDVVMTTEQVRIATAIYPSASLMNHSCDPNIFSSFRCG 514

Query: 81  RLAVVRAVQHVPKGAEVLISY--IETAGSTMTRQKALKEQYLFTCTCPRCIKLGQFDDIQ 138
              VVRA++ + +G EVL  Y       S   RQ+ L+EQY F C+C  C      +D +
Sbjct: 515 STLVVRAIRRIQEGEEVLNCYGPHHRRMSFAERQQLLQEQYFFVCSCTAC---SSGEDAE 571

Query: 139 ESAILEGYRCKDDGCSGFLLRDSDDKGFTCQQCGLVRSKEEIKKIASEVNILSKKTLALT 198
           +   L+  +C  + C G L +  D    TC  CG  +     ++ A  ++ L  + + L 
Sbjct: 572 QR--LQALKC--EYCEGPLKQPDDSGKATCLDCGTTQECLGSEQKAFRMHDLYVQGVQLA 627

Query: 199 SCGNHQEVVSTYKMIEKLQKKLYHPFSVNLMQTREKLIKILMELEDWKEALAYCQLTIPV 258
             GNH E +       K ++K+ +  ++ L++ ++ + K L  L D++ A    +  +  
Sbjct: 628 EKGNHVEALQRLNKCLKTREKVMYKHNLKLLEVKDMVAKCLCSLGDFRSASGVLKPAVDA 687

Query: 259 YQRVY 263
            + +Y
Sbjct: 688 VRVIY 692


>gi|428182841|gb|EKX51700.1| hypothetical protein GUITHDRAFT_55948, partial [Guillardia theta
           CCMP2712]
          Length = 98

 Score = 87.8 bits (216), Expect = 7e-15,   Method: Composition-based stats.
 Identities = 41/96 (42%), Positives = 58/96 (60%)

Query: 33  IAENFSKLACNAHTICNSELRPLGTGLYPVISIINHSCLPNAVLVFEGRLAVVRAVQHVP 92
           IA   SK   N H I + + +  G GLYP+ S INHSC PNA++ F+G   VVRA++++P
Sbjct: 3   IASMISKAQRNMHGIVDLKGQNFGHGLYPLASFINHSCEPNAIISFDGNKLVVRALENIP 62

Query: 93  KGAEVLISYIETAGSTMTRQKALKEQYLFTCTCPRC 128
           +G E+ I+Y+E       R+ AL  +  F C C RC
Sbjct: 63  RGTEITIAYVELYAPLDVRRDALLSRKGFLCRCSRC 98


>gi|312076846|ref|XP_003141043.1| MYND finger family protein [Loa loa]
 gi|307763792|gb|EFO23026.1| MYND finger family protein [Loa loa]
          Length = 439

 Score = 87.4 bits (215), Expect = 9e-15,   Method: Compositional matrix adjust.
 Identities = 78/334 (23%), Positives = 159/334 (47%), Gaps = 30/334 (8%)

Query: 5   DEKQLLLYAQIANLVNLILQWPEISIN-EIAENFSKLACNAHTICNSELRPLGTGLYPVI 63
           DE++   +  I +++   L + ++  N EI + F K+  NA  I +S L  +G  +Y  +
Sbjct: 124 DEEKRNGFIAIIHVIKDYLPFDKMPSNTEIFDIFCKILINALVITDSCLNRIGLAVYLGL 183

Query: 64  SIINHSCLPNAVLVFEGRLAVVRAVQH--VPKGAEVLISYIETAGSTMTRQKALKEQYLF 121
           S ++HSC P+A ++F G  A++R++          + I Y +       R +AL++Q+ F
Sbjct: 184 SALDHSCKPDAFIIFSGAKAILRSLNENITEYNDNLHIPYCDLLDLRSARCEALQKQHNF 243

Query: 122 TCTCPRCIKLGQFDDIQESAILEGYRCKDDGCSGFLLRDSDDKGFTCQQCGL-----VRS 176
            C C  C     FD  ++ + +   +C D  C       S D   T ++C +     V +
Sbjct: 244 VCNCDIC---QDFDLDRQKSSVRCTKCTDGFCPY-----SPDDDHTVKRCKVCHEISVFN 295

Query: 177 KEEIKKIASEVNILSKKTLALTSCGNHQEVVSTYKMIEKLQKKLYHPFSVNLMQTREKLI 236
            + ++K+  ++      T    +  N  E++  Y  +E    +++ P++V L +  E ++
Sbjct: 296 SDHVQKLYQQL------TAPRPAEKNLNELIDLYHELE----EVFSPYNVPLCKLAESIM 345

Query: 237 KILMELEDWKEALAYCQLTIPVYQRVYPQFHPLLGLQYYTCGKLEWFLGDTENAIKSMTE 296
              +  E + EA+ Y + T+  Y+  YP+ HP   ++ +   KL   L     ++  + +
Sbjct: 346 ISALNNEKYDEAVEYAEKTLLCYRTYYPKGHPSPSVRMFEYAKL-LMLQHNRESLPVLRK 404

Query: 297 AVEILRITHGTNSPF---MKELILKLEEAQAEAS 327
           A++++  ++G+ S F      L+  LE+  + AS
Sbjct: 405 ALKMICESYGSESSFAFNAATLLSDLEKCVSTAS 438


>gi|344290601|ref|XP_003417026.1| PREDICTED: SET and MYND domain-containing protein 4-like [Loxodonta
           africana]
          Length = 800

 Score = 87.0 bits (214), Expect = 1e-14,   Method: Compositional matrix adjust.
 Identities = 84/318 (26%), Positives = 130/318 (40%), Gaps = 75/318 (23%)

Query: 26  PEISINEIA--ENFSKLACNAHTIC--------------NSELRPLGTGLYPVISIINHS 69
           PE++I  +    +  +L CNA  I               + E+R L TG++PV+S++NHS
Sbjct: 481 PELNIWGVVMLRHMLQLQCNAQAITTIQHTGSKESIITESREIR-LATGIFPVVSLLNHS 539

Query: 70  CLPNAVLVFEGRLAVVRAVQHVPKGAEVLISY--IETAGSTMTRQKALKEQYLFTCTCPR 127
           C PN  + F   +A +RA Q + KG E+L  Y   E+  +   RQ+ L+ QY F CTCP 
Sbjct: 540 CRPNTSVSFTSTVATIRASQQIAKGQEILHCYGPHESRMAVAERQQKLRSQYFFECTCPA 599

Query: 128 CIKLGQFDDIQESAILEGYRCKDDGCSGFLLRDSDDKGFTCQQCGLVRSKEEIKKIASEV 187
           C K        E     G R                  F C  C  V   E++       
Sbjct: 600 CHK--------EPRATAGPRW---------------AAFCCHSCRAVMQGEDV------- 629

Query: 188 NILSKKTLALTSCGNHQ--EVVSTYKMIEKLQKKLYHPFSVNLMQTREKLIKILM--ELE 243
                      SCGN    E VS   ++ +LQ         +L Q      K+L   +LE
Sbjct: 630 ----------LSCGNRSCTESVSRSHLVSRLQ---------DLQQQVGMAQKLLRSGKLE 670

Query: 244 DWKEALAYCQLTIPVYQRVYPQFHPLLGLQYYTCGKLEWFLGDTENAIKSMTEAVEILRI 303
              + L  CQ     +    P  H ++G       +    LGD E +   + ++++++ +
Sbjct: 671 QAVQQLLGCQHAAESF---LPAEHTVVGEIDDHLAQAYAALGDWEKSATHLQKSLQVVEV 727

Query: 304 THGTNSPFMKELILKLEE 321
            HG +S  M   + KL +
Sbjct: 728 RHGPSSIEMGRELFKLAQ 745


>gi|410922900|ref|XP_003974920.1| PREDICTED: SET and MYND domain-containing protein 1-like [Takifugu
           rubripes]
          Length = 489

 Score = 87.0 bits (214), Expect = 1e-14,   Method: Compositional matrix adjust.
 Identities = 74/321 (23%), Positives = 142/321 (44%), Gaps = 53/321 (16%)

Query: 17  NLVNLILQWPEIS----INEIAENFSKLACNAHTICNSE-LRPLGTGLYPVISIINHSCL 71
           ++ N +  WP  S    I++I+  F  + CN  T+ +   L+ +G GL+P + ++NH+C 
Sbjct: 145 DIHNFLDYWPRNSKQHTIDDISHIFGVINCNGFTVSDQRGLQAVGVGLFPNLCMVNHNCW 204

Query: 72  PN-AVLVFEGRLAVV------------RAVQHVPKGAEVLISYIETAGSTMTRQKALKEQ 118
           PN  V++  G  + V            R++  + +G E+ ++Y++    +  R++ LK Q
Sbjct: 205 PNCTVILNHGNQSAVNTMFHSQRRIELRSLGKIAEGEELTVAYVDFLNLSEERRRLLKTQ 264

Query: 119 YLFTCTCPRCIKLGQFDDIQESAILEGYRCKDDGCSGFLLRDSDDKGFTCQQCGLVRSKE 178
           Y F C C  C K G  DD++    L G             R+ D         G+  S++
Sbjct: 265 YFFDCQCDYC-KNGTKDDLK----LAG-------------REVD---------GVKPSEQ 297

Query: 179 EIKKIAS----EVNILSKKTLALTSCGNHQEVVSTYKMIEKLQKKLYHPFSVNLMQTREK 234
           ++K+       ++  + K  L     G++ +VV   + +    + +     + L++    
Sbjct: 298 QVKEATDYCFQKLETMDKARLD----GDYHQVVKICRDVIDRTEPVLADTHIYLLRMWST 353

Query: 235 LIKILMELEDWKEALAYCQLTIPVYQRVYPQFHPLLGLQYYTCGKLEWFLGDTENAIKSM 294
           L ++   L+ + +A  Y +  +  Y ++Y   +  LG+     G   W  G  E     +
Sbjct: 354 LSEVQAYLQYFNDAAEYSRKMVEGYMKLYHPNNAALGMAAMRAGVNHWQAGLIEVGHGMV 413

Query: 295 TEAVEILRITHGTNSPFMKEL 315
            +A  IL +THG   P  K+L
Sbjct: 414 CKAYAILLVTHGPTHPITKDL 434


>gi|432880356|ref|XP_004073657.1| PREDICTED: SET and MYND domain-containing protein 1-like isoform 2
           [Oryzias latipes]
          Length = 478

 Score = 86.7 bits (213), Expect = 1e-14,   Method: Compositional matrix adjust.
 Identities = 71/301 (23%), Positives = 127/301 (42%), Gaps = 46/301 (15%)

Query: 29  SINEIAENFSKLACNAHTICNSE-LRPLGTGLYPVISIINHSCLPNAVLVF-EGRLAVVR 86
           S+  I+  F  + CN  T+ +   L+ +G GL+P ++++NH C PN   +   G  + V 
Sbjct: 166 SVEYISHIFGIIKCNGFTLTDQRGLQAVGVGLFPNLALVNHDCWPNCTAILNHGNHSAVS 225

Query: 87  AVQHVPK------------GAEVLISYIETAGSTMTRQKALKEQYLFTCTCPRCIKLGQF 134
           +  H P+            G E+ ISY++    +  RQK LKEQ+ F C+C  C +    
Sbjct: 226 SALHSPRRIEVRTLGKISEGEELTISYVDFLQLSADRQKQLKEQFHFECSCKHCSQ---- 281

Query: 135 DDIQESAILEGYRCKDDGCSGFLLRDSDDKGFTCQQCGLVRSKEEIKKIASEVNILSKKT 194
                          DD     ++  +D +    Q+     SK+ ++KI        K  
Sbjct: 282 ------------HISDD----LMMAAADGQDSPIQEVSAF-SKDSLEKI-------EKSR 317

Query: 195 LALTSCGNHQEVVSTYKMIEKLQKKLYHPFSVNLMQTREKLIKILMELEDWKEALAYCQL 254
           L     G++ EV+   +   + Q+ +     ++ ++      ++L  L  + EA    + 
Sbjct: 318 LD----GDYSEVLRLCRECLQKQRGVLADTHLHRLRVLSIAAEVLSSLRQFSEAADCARK 373

Query: 255 TIPVYQRVYPQFHPLLGLQYYTCGKLEWFLGDTENAIKSMTEAVEILRITHGTNSPFMKE 314
            +  Y ++Y   + LLG+     G   W  G  E     + +A  IL +THG N    ++
Sbjct: 374 MVEGYMKLYHPNNALLGMAIMRAGVTLWHAGQIEEGHSLICKAYSILMVTHGPNHSVTRD 433

Query: 315 L 315
           L
Sbjct: 434 L 434


>gi|444515202|gb|ELV10791.1| SET and MYND domain-containing protein 1 [Tupaia chinensis]
          Length = 462

 Score = 86.3 bits (212), Expect = 2e-14,   Method: Compositional matrix adjust.
 Identities = 74/297 (24%), Positives = 133/297 (44%), Gaps = 49/297 (16%)

Query: 19  VNLILQ-WP----EISINEIAENFSKLACNAHTICNSE-LRPLGTGLYPVISIINHSCLP 72
           V+  LQ WP    + S+  I+  F  + CN  T+ +   L+ +G G++P + ++NH C P
Sbjct: 151 VDTFLQYWPPQSQQFSMQYISHIFGVINCNGFTLSDQRGLQAVGVGIFPNLGLVNHDCWP 210

Query: 73  NAVLVFE-GRLAVVRAVQH------------VPKGAEVLISYIETAGSTMTRQKALKEQY 119
           N  ++F  G    V+++ H            + +G E+ +SYI+    +  R+K LK+QY
Sbjct: 211 NCTVIFNNGNHEAVKSMFHTQMRIELRALGKISEGEELTVSYIDFLNLSEERRKQLKKQY 270

Query: 120 LFTCTCPRCIKLGQFDDIQESAILEGYRCKDDGCSGFLLRDSDDKGFTCQQCGLVRSKEE 179
            F CTC  C K G  DD+                  FL    D K           S+E 
Sbjct: 271 YFDCTCEHCQK-GLKDDL------------------FLGVKDDPKP----------SQEA 301

Query: 180 IKKIASEVNILSKKTLALTSCGNHQEVVSTYKMIEKLQKKLYHPFSVNLMQTREKLIKIL 239
           +K++        +K     S G + EVV   +   + Q+ ++   ++ +++    + ++L
Sbjct: 302 VKEMTQFSKDTLEKIEKARSEGLYHEVVKLCRECLQKQEPVFADTNLYVLRMLSIVSEVL 361

Query: 240 MELEDWKEALAYCQLTIPVYQRVYPQFHPLLGLQYYTCGKLEWFLGDTENAIKSMTE 296
             L+ ++EA  Y +  +  Y ++Y   +  LG+     G   W  G+ E A++  TE
Sbjct: 362 SYLQAFEEASHYARRMVDGYMKLYHPNNAQLGMAVMRAGLTNWHAGNIE-AMRVQTE 417


>gi|82594042|ref|XP_725259.1| hypothetical protein [Plasmodium yoelii yoelii 17XNL]
 gi|23480195|gb|EAA16824.1| Homo sapiens HSKM-B [Plasmodium yoelii yoelii]
          Length = 511

 Score = 86.3 bits (212), Expect = 2e-14,   Method: Compositional matrix adjust.
 Identities = 81/327 (24%), Positives = 135/327 (41%), Gaps = 42/327 (12%)

Query: 30  INEIAENFSKLACNAHTICNSELRPLGTGLY--PVISIINHSCLPNAVLVFEGRLAVVRA 87
           +  + E F K++ N   I ++EL P G G Y  PV    NHSCL N + +F+ +   +R 
Sbjct: 177 LKNVIEKFIKVSKNTLQIIDNELEPCGLGFYKKPV-PYFNHSCLSNCITIFKNQKLYIRT 235

Query: 88  VQHVPKGAEVLISYIETAGSTMTRQKALKEQYLFTCTCPRCI------------------ 129
           +  +  G E+ ISY++ A    TR     +QY FTCTC  C                   
Sbjct: 236 LMDIYPGEELTISYLDIAFDRNTRLAICTDQYFFTCTCKLCKVNIPSECHNMFNNDFICT 295

Query: 130 ---KLGQFDDIQESAILE------GYRCKDDGCSGFLLRDSDDKGFTCQQCGLVRSK--- 177
                 +F +  E  ++        Y  K    +  +L+ S DK     +C L + +   
Sbjct: 296 RSENCKKFVNYMEMVLISELERKLNYHNKLHFKAFPILKKSTDKNENIWKCMLCKHETND 355

Query: 178 -------EEIKKIASEVNILSKKTLALTSCGNHQEVVSTYKMIEKLQ--KKLYHPFSVNL 228
                  E+ K+   EV  L        S  N   + S  K+  K+      YH    +L
Sbjct: 356 SVIKGVIEKEKETVKEVEYLETLFAEKYSYDNKIILQSLNKIKSKIDYLTTFYHHSRYSL 415

Query: 229 MQTREKLIKILMELEDWKEALAYCQLTIPVYQRVYPQFHPLLGLQYYTCGKLEWFLGDTE 288
            + R K++ I ++L+++K A       +   +  Y ++ P+ G   +  GKL  FL    
Sbjct: 416 QKMRAKILYISIQLQEFKLAYNIANQYLKSIEVSYGKYSPIYGYYIFLTGKLALFLDLKS 475

Query: 289 NAIKSMTEAVEILRITHGTNSPFMKEL 315
             +  + +A + +  T+G  SP  K+L
Sbjct: 476 AGLNLIHKAKKNIIKTYGPESPIYKDL 502


>gi|261198320|ref|XP_002625562.1| SET and MYND domain-containing protein [Ajellomyces dermatitidis
           SLH14081]
 gi|239595525|gb|EEQ78106.1| SET and MYND domain-containing protein [Ajellomyces dermatitidis
           SLH14081]
 gi|327355791|gb|EGE84648.1| SET and MYND domain-containing protein [Ajellomyces dermatitidis
           ATCC 18188]
          Length = 519

 Score = 85.9 bits (211), Expect = 2e-14,   Method: Compositional matrix adjust.
 Identities = 39/104 (37%), Positives = 60/104 (57%)

Query: 27  EISINEIAENFSKLACNAHTICNSELRPLGTGLYPVISIINHSCLPNAVLVFEGRLAVVR 86
           +I ++ +A  F+KL  NA T+ N     +G    P  S INHSC PNA + F+G +  ++
Sbjct: 181 DIDLSTLAAYFAKLETNAFTLTNQYFDRIGLCFLPFASYINHSCQPNAYIGFDGPVMYLK 240

Query: 87  AVQHVPKGAEVLISYIETAGSTMTRQKALKEQYLFTCTCPRCIK 130
           A+Q +    ++ ISYI+      TRQ  L+ QY F C CP+C++
Sbjct: 241 ALQDIAPDEQIYISYIDNTEPFQTRQSELQLQYFFECKCPKCLE 284


>gi|440799473|gb|ELR20518.1| Histone-lysine N-methyltransferase [Acanthamoeba castellanii str.
           Neff]
          Length = 404

 Score = 85.9 bits (211), Expect = 2e-14,   Method: Compositional matrix adjust.
 Identities = 57/177 (32%), Positives = 81/177 (45%), Gaps = 12/177 (6%)

Query: 8   QLLLYAQIANLVNLILQWPEISINEIAENFSKL----ACNAHTICNSELRPLGTGLYPVI 63
           +L  Y Q+A +V   +  P  +     ++  +L     CNA  + N E    G  LYP +
Sbjct: 208 RLAQYEQMAEVVAASISLPRGARKVTKDDVVRLLCIQQCNAFGLSNGEGEMTGVALYPAL 267

Query: 64  SIINHSCLPNAVLVFEG----RLAVVRAVQHVPKGAEVLISYIETAGSTMTRQKALKEQY 119
           S+ NHSC+PN   V +G    R+  ++ +  VP G E+ ISYI+   +   RQ  L+E Y
Sbjct: 268 SLFNHSCMPNCAAVDDGTGSKRVCAIKTLVAVPPGEELTISYIDLDLTRELRQDKLEESY 327

Query: 120 LFTCTCPRCIKLGQFDDIQESAILEGYRCKDDGCSGFLLRDSDDKGFT--CQQCGLV 174
            F CTC RC      D    +A L    C    C G L+     +  T  C  C LV
Sbjct: 328 AFRCTCARCRAPDADDSPAVAAHLRRITCGS--CGGSLIPAPPGQELTVACNSCPLV 382


>gi|239607850|gb|EEQ84837.1| SET and MYND domain-containing protein [Ajellomyces dermatitidis
           ER-3]
          Length = 519

 Score = 85.9 bits (211), Expect = 2e-14,   Method: Compositional matrix adjust.
 Identities = 39/104 (37%), Positives = 60/104 (57%)

Query: 27  EISINEIAENFSKLACNAHTICNSELRPLGTGLYPVISIINHSCLPNAVLVFEGRLAVVR 86
           +I ++ +A  F+KL  NA T+ N     +G    P  S INHSC PNA + F+G +  ++
Sbjct: 181 DIDLSTLAAYFAKLETNAFTLTNQYFDRIGLCFLPFASYINHSCQPNAYIGFDGPVMYLK 240

Query: 87  AVQHVPKGAEVLISYIETAGSTMTRQKALKEQYLFTCTCPRCIK 130
           A+Q +    ++ ISYI+      TRQ  L+ QY F C CP+C++
Sbjct: 241 ALQDIAPDEQIYISYIDNTEPFQTRQSELQLQYFFECKCPKCLE 284


>gi|403351926|gb|EJY75465.1| SET and MYND domain containing 3 [Oxytricha trifallax]
          Length = 517

 Score = 85.9 bits (211), Expect = 3e-14,   Method: Compositional matrix adjust.
 Identities = 73/283 (25%), Positives = 128/283 (45%), Gaps = 54/283 (19%)

Query: 42  CNAHTICNSELRPLGTGLYPVISIINHSCLPNAVLVFEGRLAVVRAVQHVPKGAEVLISY 101
           CN   I ++++  +GTGLY   +++NHSC PN +++F G+   +   + +    E+ I Y
Sbjct: 243 CNGFGIADNQMLRIGTGLYYPSNLLNHSCDPNCMVLFRGQTQFIVTCRPIEADEEITICY 302

Query: 102 IETAGS-TMTRQKALKEQYLFTCTCPRCIKLGQFDDIQESAILEGYRCKDDGCSGFLLRD 160
           I+   S  + RQ+ L+EQY F C C RC+K           I EG   K+          
Sbjct: 303 IDNGISERIIRQQYLQEQYHFNCMCARCLK----------QIGEGTELKE---------- 342

Query: 161 SDDKGFTCQQCGLVRSKEEIKKIASEVNILSKKTLALTSCGNHQEVVSTYKMIEKLQKKL 220
                           K +I + + E  IL+K+ L   S GN  + +S    + +L K +
Sbjct: 343 ---------------QKVQI-QFSEEQQILNKQALHEESLGNFDKSLS---QLSQLYKSV 383

Query: 221 YHPFSVNLMQTR------EKLIKILMELEDWKEALAYCQLTIPVYQRVYPQF-------- 266
               S  + Q        E+L+ + +  +D+K A+ + Q  + + + +Y +         
Sbjct: 384 ETNKSTQIFQNSMIKELLERLVYLSVMNKDYKFAVKFQQKLVDLCEIMYIKEVKREDELP 443

Query: 267 HPLLGLQYYTCGKLEWFLGDTENAIKSMTEAVEILRITHGTNS 309
           HPL+ L YY  GKL       + AI ++ +A+ ++   +G  S
Sbjct: 444 HPLIALHYYQLGKLLSQRKKYQEAIDNLQKAMTLVSAYYGLKS 486


>gi|389584498|dbj|GAB67230.1| hypothetical protein PCYB_112510 [Plasmodium cynomolgi strain B]
          Length = 501

 Score = 85.5 bits (210), Expect = 3e-14,   Method: Compositional matrix adjust.
 Identities = 84/332 (25%), Positives = 141/332 (42%), Gaps = 52/332 (15%)

Query: 30  INEIAENFSKLACNAHTICNSELRPLGTGLY--PVISIINHSCLPNAVLVFEGRLAVVRA 87
           +  I E F K++ N+  I ++EL P G   Y  PV    NHSCL N V +F+ +   +R 
Sbjct: 167 LKNIIEKFVKISKNSLQIIDNELEPAGLAFYKKPV-PYFNHSCLSNCVTIFKNQKLFIRT 225

Query: 88  VQHVPKGAEVLISYIETAGSTMTRQKALKEQYLFTCTCPRCIKLGQFDDIQESAILEGYR 147
           +  V  G E+ ISYI+ A    TR     +QY FTC+C  C K+    + Q     E   
Sbjct: 226 LVDVYPGEELTISYIDIAFDKNTRLAICMDQYFFTCSCKLC-KVNVVSECQNIFNTEFVC 284

Query: 148 CKDDGC------------------------SGF-----LLRDSDDKG--FTCQQCG---- 172
              + C                        S F     +L+ S+++   + C  C     
Sbjct: 285 THSESCKKCVNNMELILISELERKNCYMNKSQFKNSYPILKKSNEQQHMWKCALCKNEVH 344

Query: 173 -------LVRSKEEIKKIASEVNILSKKTLALTSCGNHQEVVSTYKMIEKLQ--KKLYHP 223
                  + + KE +K+ A   ++ S+K     S  N   + S  K+  K+      YH 
Sbjct: 345 DNVIKSLMEKEKETVKETAHLESLFSEKY----SYDNKSVLQSLNKIKSKIDYLTTFYHH 400

Query: 224 FSVNLMQTREKLIKILMELEDWKEALAYCQLTIPVYQRVYPQFHPLLGLQYYTCGKLEWF 283
              +L + R K++ I ++L+++K A       +   +  Y ++ P+ G   +  GKL  F
Sbjct: 401 TRYSLQKMRAKILYISIQLQEFKLAYNIANQYLKSIEVSYGKYSPIYGYYIFLTGKLALF 460

Query: 284 LGDTENAIKSMTEAVEILRITHGTNSPFMKEL 315
           L      +  + +A + +  T+G +SP  K+L
Sbjct: 461 LDLKSEGLSLIHKAKKNIIKTYGPDSPIYKDL 492


>gi|327275309|ref|XP_003222416.1| PREDICTED: SET and MYND domain-containing protein 1-like isoform 2
           [Anolis carolinensis]
          Length = 485

 Score = 85.5 bits (210), Expect = 3e-14,   Method: Compositional matrix adjust.
 Identities = 77/309 (24%), Positives = 135/309 (43%), Gaps = 47/309 (15%)

Query: 25  WP----EISINEIAENFSKLACNAHTICNSE-LRPLGTGLYPVISIINHSCLPNAVLVFE 79
           WP    + ++  I+     + CN  T+ +   L+ +G G++P + ++NH C PN  ++F 
Sbjct: 153 WPHDGKQFAMQYISHILGVINCNGFTLSDQRGLQAVGVGIFPNLCLVNHDCWPNCTVIFN 212

Query: 80  -GRLAVVRAVQH------------VPKGAEVLISYIETAGSTMTRQKALKEQYLFTCTCP 126
            G    VR++ H            +  G E+ +SY++       R++ LK+QY F CTC 
Sbjct: 213 NGNHEAVRSMFHTQMRIELRALGKISIGEELTVSYVDFLNVCEDRRQQLKKQYYFDCTCE 272

Query: 127 RCIKLGQFDDIQESAILEGYRCKDDGCSGFLLRDSDDKGFTCQQCGLVRSKEEIKKIASE 186
            C K G  DD+   A+ E     ++  S  +++D            +  SKE I+KI   
Sbjct: 273 HCQK-GIKDDLM-LAVKE-----EEKPSPEVVKDV-----------IQFSKEAIEKIDK- 313

Query: 187 VNILSKKTLALTSCGNHQEVVSTYKMIEKLQKKLYHPFSVNLMQTREKLIKILMELEDWK 246
                       S G ++EVV   +   + Q+ +    ++ +++    L ++L  L+   
Sbjct: 314 ----------ARSEGLYKEVVKLCRACLEKQEPVLGDTNIYMLRILSILSEVLSYLQHVD 363

Query: 247 EALAYCQLTIPVYQRVYPQFHPLLGLQYYTCGKLEWFLGDTENAIKSMTEAVEILRITHG 306
           EA  Y +     Y ++Y   +  LG+     G   W  G  E     + +A  IL +THG
Sbjct: 364 EAAGYSKRMADGYLKLYHPNNAQLGMALMRAGVTHWHAGLIEAGHGMICKAYAILLVTHG 423

Query: 307 TNSPFMKEL 315
              P  K+L
Sbjct: 424 PTHPITKDL 432


>gi|440804218|gb|ELR25095.1| SET and MYND domain containing 3, putative [Acanthamoeba
           castellanii str. Neff]
          Length = 365

 Score = 84.7 bits (208), Expect = 5e-14,   Method: Compositional matrix adjust.
 Identities = 37/86 (43%), Positives = 55/86 (63%)

Query: 27  EISINEIAENFSKLACNAHTICNSELRPLGTGLYPVISIINHSCLPNAVLVFEGRLAVVR 86
           +I+  EI   F   ACN  T+ + EL+PLG G+YP  ++INHSC PN V++FEGR   VR
Sbjct: 99  DITPQEIMHLFCLFACNNFTVSDGELKPLGLGIYPPAALINHSCDPNCVIIFEGRQCTVR 158

Query: 87  AVQHVPKGAEVLISYIETAGSTMTRQ 112
           +++ + K  E+  +Y+E    T TR+
Sbjct: 159 SLRDITKDEEITFNYVEVGDPTETRR 184



 Score = 82.8 bits (203), Expect = 2e-13,   Method: Compositional matrix adjust.
 Identities = 39/85 (45%), Positives = 56/85 (65%)

Query: 245 WKEALAYCQLTIPVYQRVYPQFHPLLGLQYYTCGKLEWFLGDTENAIKSMTEAVEILRIT 304
           W+EAL YC  ++ V++R+YP   PL+GLQY   GKL ++L  TE A++SM  A+ IL IT
Sbjct: 269 WREALLYCSASLAVFERIYPPRWPLVGLQYLIHGKLSFYLKYTEAALESMQRALPILTIT 328

Query: 305 HGTNSPFMKELILKLEEAQAEASYK 329
           H +  P ++ L   L EA AE +Y+
Sbjct: 329 HSSGHPLVRTLHTMLAEATAEWNYE 353


>gi|344247559|gb|EGW03663.1| SET and MYND domain-containing protein 1 [Cricetulus griseus]
          Length = 467

 Score = 84.7 bits (208), Expect = 6e-14,   Method: Compositional matrix adjust.
 Identities = 72/317 (22%), Positives = 131/317 (41%), Gaps = 81/317 (25%)

Query: 19  VNLILQ-WP----EISINEIAENFSKLACNAHTICNSE-LRPLGTGLYPVISIINHSCLP 72
           V+  LQ WP    + S+  I+  F  + CN  T+ +   L+ +G G++P + ++NH C P
Sbjct: 151 VDAFLQYWPPQGQQFSMQYISHIFGVINCNGFTLSDQRGLQAVGVGIFPNLGLVNHDCWP 210

Query: 73  NAVLVFE-GRLAVVRAVQH------------VPKGAEVLISYIETAGSTMTRQKALKEQY 119
           N  ++F  G    V+++ H            + +G E+ +SYI+    +  R++ L++QY
Sbjct: 211 NCTVIFNNGNHEAVKSMFHTQMRIELRALGKISEGEELTVSYIDFLHLSEERRRQLRKQY 270

Query: 120 LFTCTCPRCIKLGQFDDIQESAILEGYRCKDDGCSGFLLRDSDDKGFT-CQQCGLVRSKE 178
            F C+C  C K G  DD+                  F     D K    C++C      E
Sbjct: 271 YFDCSCEHCQK-GLKDDL------------------FQAVKEDPKVVKLCREC-----LE 306

Query: 179 EIKKIASEVNILSKKTLALTSCGNHQEVVSTYKMIEKLQKKLYHPFSVNLMQTREKLIKI 238
           + + + ++ N+   + L++ S                                     ++
Sbjct: 307 KQESVFADTNLYVLRLLSIVS-------------------------------------EV 329

Query: 239 LMELEDWKEALAYCQLTIPVYQRVYPQFHPLLGLQYYTCGKLEWFLGDTENAIKSMTEAV 298
           L  L+ ++EA  Y +  +  Y ++Y   +  LG+     G   W  G+ E     + +A 
Sbjct: 330 LSYLQAFEEAAHYARRMVDGYMKLYHHNNAQLGMAVMRAGLTNWHAGNIEVGHGMICKAY 389

Query: 299 EILRITHGTNSPFMKEL 315
            IL +THG + P  K+L
Sbjct: 390 AILLVTHGPSHPITKDL 406


>gi|194913080|ref|XP_001982622.1| GG12639 [Drosophila erecta]
 gi|190648298|gb|EDV45591.1| GG12639 [Drosophila erecta]
          Length = 462

 Score = 84.3 bits (207), Expect = 6e-14,   Method: Compositional matrix adjust.
 Identities = 41/102 (40%), Positives = 57/102 (55%), Gaps = 6/102 (5%)

Query: 58  GLYPVISIINHSCLPNAVLVFE-GRLAVVRAVQHVPKGAEVLISYIETAGSTMTRQKALK 116
           GL+P+ +I+NH C PNA   FE GRLAVVRA + +PKG E+  +Y +   S +TR   LK
Sbjct: 242 GLFPLTAIMNHECTPNASHYFENGRLAVVRAARDIPKGGEITTTYTKILWSNLTRNIFLK 301

Query: 117 EQYLFTCTCPRCIKLGQFDDIQESAILEGYRCKDDGCSGFLL 158
               F C C RC      D+ +    L    C++ GC G ++
Sbjct: 302 MTKHFACDCARC-----HDNTENGTYLSALFCREQGCRGLVI 338


>gi|195456692|ref|XP_002075245.1| GK16970 [Drosophila willistoni]
 gi|194171330|gb|EDW86231.1| GK16970 [Drosophila willistoni]
          Length = 766

 Score = 84.3 bits (207), Expect = 8e-14,   Method: Compositional matrix adjust.
 Identities = 40/102 (39%), Positives = 58/102 (56%), Gaps = 6/102 (5%)

Query: 58  GLYPVISIINHSCLPNAVLVFE-GRLAVVRAVQHVPKGAEVLISYIETAGSTMTRQKALK 116
           GL+P+ +I+NH C PNA   F+ GR+AVVRA + +PKG E+  +Y +   S +TR   LK
Sbjct: 546 GLFPLTAIMNHECTPNASHYFDNGRMAVVRAARDIPKGGEITTTYTKILWSNLTRGIFLK 605

Query: 117 EQYLFTCTCPRCIKLGQFDDIQESAILEGYRCKDDGCSGFLL 158
               F C CPRC      D+ +    L    C++ GC G ++
Sbjct: 606 MTKNFMCDCPRC-----NDNTENGTYLSALFCREQGCKGLVI 642



 Score = 62.8 bits (151), Expect = 2e-07,   Method: Compositional matrix adjust.
 Identities = 29/72 (40%), Positives = 41/72 (56%), Gaps = 1/72 (1%)

Query: 58  GLYPVISIINHSCLPNAVLVFE-GRLAVVRAVQHVPKGAEVLISYIETAGSTMTRQKALK 116
            L+P+  ++NH C PNA   FE G   VV A + +PKG E+ ++Y +   ST+ R+  L 
Sbjct: 222 ALFPLAGLLNHHCTPNAAHHFEDGETIVVSATERIPKGTEITMTYAKLLWSTLARKLFLA 281

Query: 117 EQYLFTCTCPRC 128
               F C CPRC
Sbjct: 282 MTKHFICQCPRC 293


>gi|402217590|gb|EJT97670.1| hypothetical protein DACRYDRAFT_111714 [Dacryopinax sp. DJM-731
           SS1]
          Length = 531

 Score = 84.3 bits (207), Expect = 8e-14,   Method: Compositional matrix adjust.
 Identities = 77/273 (28%), Positives = 121/273 (44%), Gaps = 26/273 (9%)

Query: 29  SINEIAENFSKLACNAHTICNSELRPLGTGLYPVISIINHSCLPNAVLVF------EGRL 82
           SI ++    S+   N+ ++   +L  +G  + P+ ++I+HSC+PNAV+VF       G L
Sbjct: 182 SIRDLTNLLSRFQTNSFSLTTPDLTNVGVAISPLAALISHSCMPNAVVVFPTGLGRRGGL 241

Query: 83  AVVRAVQHVPKGAEVLISYIETAGSTMTRQKALKEQYLFTCTCPRCIKLGQFDDIQESAI 142
            V+ A++ +  G EVL SY++ A     R K LK++YLF C C  C K    + I     
Sbjct: 242 EVI-ALRDLQPGEEVLTSYVDIALPRSLRWKELKDRYLFDCECVLCEKHHDHEWIDPR-- 298

Query: 143 LEGYRCKDDGCSGFLLRDSD--DKGFTCQQCGL-----VRSKEEIKKIASEVNILSKKTL 195
            E  RC   GC G +   +       TC  CG        S +E  K+A +V   ++   
Sbjct: 299 -EALRCSKKGCKGKMGTPTSLGTTEITCSICGTKLSVSTASLQEHVKLAEDVLHKAEAIQ 357

Query: 196 ALTSCGNHQEVVSTYKMIEKLQKKLYHPFSVNLMQTREKLIKILMELED--------WKE 247
              S      +VS  +  +      Y P    L   ++ LI  + +L           ++
Sbjct: 358 FSDSDKARYLLVSILRDFDPFALSAY-PLLQILQLLQQLLITAVFDLPSTGTERDGLLQD 416

Query: 248 ALAYCQLTIPVYQRVYPQFHPLLGLQYYTCGKL 280
           A +   L +   + VYP  HP  G+QY T  KL
Sbjct: 417 AASCSALVLSGMEDVYPYGHPSKGIQYVTSAKL 449


>gi|325091290|gb|EGC44600.1| MYND finger family protein [Ajellomyces capsulatus H88]
          Length = 479

 Score = 84.0 bits (206), Expect = 9e-14,   Method: Compositional matrix adjust.
 Identities = 38/104 (36%), Positives = 60/104 (57%)

Query: 27  EISINEIAENFSKLACNAHTICNSELRPLGTGLYPVISIINHSCLPNAVLVFEGRLAVVR 86
           E+ ++ +   F+KL  NA T+ N     +G  L P  + INHSC PNA + F+G++  ++
Sbjct: 141 EVELSTLVVYFAKLETNAFTLTNQYFDRIGLCLLPFAAYINHSCEPNAYIGFDGQVMYLK 200

Query: 87  AVQHVPKGAEVLISYIETAGSTMTRQKALKEQYLFTCTCPRCIK 130
           A+Q +    E+ ISY +      TRQ  L+ +Y F C CP+C+K
Sbjct: 201 ALQDIAPDEEIFISYTDNTEPLKTRQTELQLRYFFECKCPKCLK 244


>gi|240275453|gb|EER38967.1| MYND finger family protein [Ajellomyces capsulatus H143]
          Length = 518

 Score = 84.0 bits (206), Expect = 1e-13,   Method: Compositional matrix adjust.
 Identities = 38/104 (36%), Positives = 60/104 (57%)

Query: 27  EISINEIAENFSKLACNAHTICNSELRPLGTGLYPVISIINHSCLPNAVLVFEGRLAVVR 86
           E+ ++ +   F+KL  NA T+ N     +G  L P  + INHSC PNA + F+G++  ++
Sbjct: 180 EVELSTLVVYFAKLETNAFTLTNQYFDRIGLCLLPFAAYINHSCEPNAYIGFDGQVMYLK 239

Query: 87  AVQHVPKGAEVLISYIETAGSTMTRQKALKEQYLFTCTCPRCIK 130
           A+Q +    E+ ISY +      TRQ  L+ +Y F C CP+C+K
Sbjct: 240 ALQDIAPDEEIFISYTDNTEPLKTRQTELQLRYFFECKCPKCLK 283


>gi|302654157|ref|XP_003018889.1| SET and MYND domain protein, putative [Trichophyton verrucosum HKI
           0517]
 gi|291182573|gb|EFE38244.1| SET and MYND domain protein, putative [Trichophyton verrucosum HKI
           0517]
          Length = 498

 Score = 84.0 bits (206), Expect = 1e-13,   Method: Compositional matrix adjust.
 Identities = 39/102 (38%), Positives = 59/102 (57%)

Query: 27  EISINEIAENFSKLACNAHTICNSELRPLGTGLYPVISIINHSCLPNAVLVFEGRLAVVR 86
           E+ +  IAE  +++  N+ T   +   PLG  + P    +NHSC PNAV+ F+G L +V+
Sbjct: 179 ELDVETIAEYLARIEVNSFTFTTAFGDPLGLCIQPFACYMNHSCEPNAVVGFDGGLIIVK 238

Query: 87  AVQHVPKGAEVLISYIETAGSTMTRQKALKEQYLFTCTCPRC 128
           A++ +    +V ISYI+       RQK L E+Y FTC C +C
Sbjct: 239 ALREIKPDEQVFISYIDNTYPLEVRQKQLAERYFFTCKCSKC 280


>gi|225561846|gb|EEH10126.1| MYND finger family protein [Ajellomyces capsulatus G186AR]
          Length = 518

 Score = 83.6 bits (205), Expect = 1e-13,   Method: Compositional matrix adjust.
 Identities = 38/104 (36%), Positives = 60/104 (57%)

Query: 27  EISINEIAENFSKLACNAHTICNSELRPLGTGLYPVISIINHSCLPNAVLVFEGRLAVVR 86
           E+ ++ +   F+KL  NA T+ N     +G  L P  + INHSC PNA + F+G++  ++
Sbjct: 180 EVELSTLVVYFAKLETNAFTLTNQYFDRIGLCLLPFAAYINHSCEPNAYIGFDGQVMYLK 239

Query: 87  AVQHVPKGAEVLISYIETAGSTMTRQKALKEQYLFTCTCPRCIK 130
           A+Q +    E+ ISY +      TRQ  L+ +Y F C CP+C+K
Sbjct: 240 ALQDIAPDEEIFISYTDNTEPLKTRQTELQLRYFFECKCPKCLK 283


>gi|452820074|gb|EME27122.1| SET and MYND domain-containing protein [Galdieria sulphuraria]
          Length = 464

 Score = 83.6 bits (205), Expect = 1e-13,   Method: Compositional matrix adjust.
 Identities = 43/107 (40%), Positives = 56/107 (52%), Gaps = 9/107 (8%)

Query: 31  NEIAENFSKLACNAHTICNSELRPLGTGLYPVISIINHSCLPNAVLVFEGRLAVV----- 85
           N +A     L  NAHTI +SEL  LG G +P  S +NH C PN V +F G    V     
Sbjct: 168 NFVASLIRMLEMNAHTIYDSELNTLGVGFFPKASFMNHDCRPNCVALFTGGFHSVSGKPI 227

Query: 86  ----RAVQHVPKGAEVLISYIETAGSTMTRQKALKEQYLFTCTCPRC 128
               R ++ +  G E++ISY++   S M R + LKE Y F C C RC
Sbjct: 228 SIHIRCIRPIEAGEEIVISYLDVCLSWMDRLEWLKEHYQFECCCSRC 274


>gi|164656030|ref|XP_001729143.1| hypothetical protein MGL_3610 [Malassezia globosa CBS 7966]
 gi|159103033|gb|EDP41929.1| hypothetical protein MGL_3610 [Malassezia globosa CBS 7966]
          Length = 390

 Score = 83.2 bits (204), Expect = 2e-13,   Method: Compositional matrix adjust.
 Identities = 49/149 (32%), Positives = 74/149 (49%), Gaps = 17/149 (11%)

Query: 34  AENFSKLAC----NAHTICNSELRPLGTGLYPVISIINHSCLPNAVLVFEGRLAVVR--- 86
           A+   +L C    NA T+ ++ L PLG  + P  ++INHSC PNAV+VF  R+  +    
Sbjct: 54  ADALMELVCQHETNAFTLADAHLNPLGVCIEPTFALINHSCDPNAVIVFPDRVRGMPSKM 113

Query: 87  ---AVQHVPKGAEVLISYIETAGSTMTRQKALKEQYLFTCTCPRCIKLGQFDDIQESAIL 143
              A++ +  G EV ISY++ A     RQ  L E+Y F C C  C ++G  D        
Sbjct: 114 HLVAIRAISAGEEVRISYVDVASCQAERQATLLERYCFACECRLCKRMGWRDPRM----- 168

Query: 144 EGYRCKDDGCSGFLLRDSDDKGFT-CQQC 171
               C    C+G+++  +    +T C QC
Sbjct: 169 -ALWCPQPKCTGWVVFQNAASSWTACTQC 196


>gi|198420659|ref|XP_002128556.1| PREDICTED: similar to SET and MYND domain containing 3 [Ciona
           intestinalis]
          Length = 430

 Score = 83.2 bits (204), Expect = 2e-13,   Method: Compositional matrix adjust.
 Identities = 75/293 (25%), Positives = 133/293 (45%), Gaps = 40/293 (13%)

Query: 26  PEISINE-IAENFSKLACNAHTICNSELRP-LGTGLYPVISIINHSCLPNAVLVFEGRLA 83
           P+++ N+ + E  +++  N+  ICN EL+  +GTG+Y   S INHSC PN V  F  R  
Sbjct: 162 PQVTDNKMLLELCARIKNNSFAICNEELQSDVGTGVYLNCSFINHSCEPNCVAEFNMRTL 221

Query: 84  VVRAVQHVPKGAEVLISYIETAGSTMTRQKALKEQYLFTCTCPRCIKLGQFDDIQESAIL 143
            +RAV+++  G EVLISY++   ++  RQ+ L   Y                        
Sbjct: 222 KIRAVKNITAGEEVLISYVDLFATSFERQRELMSIY------------------------ 257

Query: 144 EGYRCKDDGCSGFLLRDSDDKGFTCQQCGLVRSKEEIKKIASEVNILSKKTLALTSCGNH 203
             ++C    C+         K    Q       K    K++S  ++LS+       C ++
Sbjct: 258 -HFQCTCHSCNA--------KTDDDQMMQDFDGKITESKLSSVKDMLSQMEELRKQC-DY 307

Query: 204 QEVVSTYKMIEKLQKKLYHPF-SVNLMQTREKLIKILMELEDWKEALAYCQLTIPVYQRV 262
           Q++     ++E   K+   P  ++ + +  +  +   +EL    +A  +  + +  Y+  
Sbjct: 308 QKIKD---LVEGCVKRKILPHENIYMAKVLDFGMDACIELGVLTQAFEFGSMALFSYKLY 364

Query: 263 YPQFHPLLGLQYYTCGKLEWFLGDTENAIKSMTEAVEILRITHGTNSPFMKEL 315
               HP+LG+Q    GK+       ++A++ + EA +IL ITH  NS  + EL
Sbjct: 365 LHANHPMLGIQLMKLGKILLHEEKNQDAMQFLREAFKILTITHSPNSSVLTEL 417


>gi|327305077|ref|XP_003237230.1| hypothetical protein TERG_01951 [Trichophyton rubrum CBS 118892]
 gi|326460228|gb|EGD85681.1| hypothetical protein TERG_01951 [Trichophyton rubrum CBS 118892]
          Length = 498

 Score = 83.2 bits (204), Expect = 2e-13,   Method: Compositional matrix adjust.
 Identities = 39/102 (38%), Positives = 58/102 (56%)

Query: 27  EISINEIAENFSKLACNAHTICNSELRPLGTGLYPVISIINHSCLPNAVLVFEGRLAVVR 86
           E+ +  IAE  +++  N+ T   +   PLG  + P    +NHSC PNAV+ F+G L  V+
Sbjct: 179 ELDVETIAEYLARIEVNSFTFTTAFGDPLGLCIQPFACYMNHSCEPNAVVGFDGGLITVK 238

Query: 87  AVQHVPKGAEVLISYIETAGSTMTRQKALKEQYLFTCTCPRC 128
           A++ +    +V ISYI+       RQK L E+Y FTC C +C
Sbjct: 239 ALREIKSDEQVFISYIDNTYPFEVRQKQLAERYFFTCKCSKC 280


>gi|195397183|ref|XP_002057208.1| GJ16480 [Drosophila virilis]
 gi|194146975|gb|EDW62694.1| GJ16480 [Drosophila virilis]
          Length = 600

 Score = 83.2 bits (204), Expect = 2e-13,   Method: Compositional matrix adjust.
 Identities = 41/102 (40%), Positives = 57/102 (55%), Gaps = 6/102 (5%)

Query: 58  GLYPVISIINHSCLPNAVLVFE-GRLAVVRAVQHVPKGAEVLISYIETAGSTMTRQKALK 116
           GL+P+ +I+NH C PNA   FE GRLAVVRA + +PKG E+  +Y +   S +TR   LK
Sbjct: 380 GLFPLTAIMNHECTPNASHYFENGRLAVVRAARDIPKGGEITTTYTKILWSNLTRGIFLK 439

Query: 117 EQYLFTCTCPRCIKLGQFDDIQESAILEGYRCKDDGCSGFLL 158
               F C C RC      D+ +    L    C++ GC G ++
Sbjct: 440 MTKYFVCNCDRCN-----DNSENGTYLSALFCREQGCKGLVI 476



 Score = 64.3 bits (155), Expect = 7e-08,   Method: Compositional matrix adjust.
 Identities = 32/85 (37%), Positives = 45/85 (52%), Gaps = 1/85 (1%)

Query: 55  LGTGLYPVISIINHSCLPNAVLVFE-GRLAVVRAVQHVPKGAEVLISYIETAGSTMTRQK 113
           +   L+P+  ++NH C PNA   FE G   VV A + +P GAE+ +SY +   ST+ R+ 
Sbjct: 210 VARALFPLAGMLNHQCTPNAAHHFEDGETIVVTATERIPLGAEITMSYAKLLWSTLARKM 269

Query: 114 ALKEQYLFTCTCPRCIKLGQFDDIQ 138
            L     F C CPRC    Q D + 
Sbjct: 270 FLGMTKHFMCQCPRCQDPTQTDAVN 294


>gi|195132929|ref|XP_002010892.1| GI21459 [Drosophila mojavensis]
 gi|193907680|gb|EDW06547.1| GI21459 [Drosophila mojavensis]
          Length = 460

 Score = 83.2 bits (204), Expect = 2e-13,   Method: Compositional matrix adjust.
 Identities = 41/102 (40%), Positives = 57/102 (55%), Gaps = 6/102 (5%)

Query: 58  GLYPVISIINHSCLPNAVLVFE-GRLAVVRAVQHVPKGAEVLISYIETAGSTMTRQKALK 116
           GL+P+ +I+NH C PNA   FE GRLAVVRA + +PKG E+  +Y +   S +TR   LK
Sbjct: 240 GLFPLTAIMNHECTPNASHYFENGRLAVVRAARDIPKGGEITTTYTKILWSNLTRGIFLK 299

Query: 117 EQYLFTCTCPRCIKLGQFDDIQESAILEGYRCKDDGCSGFLL 158
               F C C RC      D+ +    L    C++ GC G ++
Sbjct: 300 MTKYFVCNCDRC-----NDNSENGTYLSALFCREQGCKGLVI 336


>gi|302506763|ref|XP_003015338.1| SET and MYND domain protein, putative [Arthroderma benhamiae CBS
           112371]
 gi|291178910|gb|EFE34698.1| SET and MYND domain protein, putative [Arthroderma benhamiae CBS
           112371]
          Length = 498

 Score = 82.8 bits (203), Expect = 2e-13,   Method: Compositional matrix adjust.
 Identities = 39/102 (38%), Positives = 58/102 (56%)

Query: 27  EISINEIAENFSKLACNAHTICNSELRPLGTGLYPVISIINHSCLPNAVLVFEGRLAVVR 86
           E+ +  IAE  +++  N+ T   +   PLG  + P    +NHSC PNAV+ F+G L  V+
Sbjct: 179 ELDVETIAEYLARIEVNSFTFTTAFGDPLGLCIQPFACYMNHSCEPNAVVGFDGGLITVK 238

Query: 87  AVQHVPKGAEVLISYIETAGSTMTRQKALKEQYLFTCTCPRC 128
           A++ +    +V ISYI+       RQK L E+Y FTC C +C
Sbjct: 239 ALREIKPDEQVFISYIDNTYPLEVRQKQLTERYFFTCKCSKC 280


>gi|212541368|ref|XP_002150839.1| SET and MYND domain protein, putative [Talaromyces marneffei ATCC
           18224]
 gi|210068138|gb|EEA22230.1| SET and MYND domain protein, putative [Talaromyces marneffei ATCC
           18224]
          Length = 527

 Score = 82.8 bits (203), Expect = 2e-13,   Method: Compositional matrix adjust.
 Identities = 37/93 (39%), Positives = 55/93 (59%)

Query: 37  FSKLACNAHTICNSELRPLGTGLYPVISIINHSCLPNAVLVFEGRLAVVRAVQHVPKGAE 96
           F KL+ N+ T+ ++  +  G  L+P  +  NHSC PNA   F+     +RA + + KG +
Sbjct: 195 FGKLSANSFTLTSAFGKRRGVYLHPAAARFNHSCNPNASYSFDKGKCYIRATKPIAKGEQ 254

Query: 97  VLISYIETAGSTMTRQKALKEQYLFTCTCPRCI 129
           + I YI+T  S  TR+  LKE+Y F C CPRC+
Sbjct: 255 IFIPYIDTTYSVGTRRHELKERYKFDCQCPRCL 287


>gi|194217425|ref|XP_001504392.2| PREDICTED: SET and MYND domain-containing protein 4 [Equus
           caballus]
          Length = 802

 Score = 82.8 bits (203), Expect = 2e-13,   Method: Compositional matrix adjust.
 Identities = 47/120 (39%), Positives = 63/120 (52%), Gaps = 17/120 (14%)

Query: 26  PEISINEIA--ENFSKLACNAHTIC-------------NSELRPLGTGLYPVISIINHSC 70
           PE SI  +A   +  +L CNA  I              +S    L TG++PV+S++NHSC
Sbjct: 482 PESSIWGVAMLRHMLQLQCNAQAITAIQQTGSKENIITDSRQVRLATGIFPVVSLLNHSC 541

Query: 71  LPNAVLVFEGRLAVVRAVQHVPKGAEVLISY--IETAGSTMTRQKALKEQYLFTCTCPRC 128
            PN  + F   +A VRA QH+ KG E+L  Y   E+      RQ+ L+ QY F CTCP C
Sbjct: 542 SPNTSMSFVSTVATVRASQHIGKGQEILHCYGPHESRMGVAERQQKLRSQYFFDCTCPAC 601


>gi|195040117|ref|XP_001991004.1| GH12314 [Drosophila grimshawi]
 gi|193900762|gb|EDV99628.1| GH12314 [Drosophila grimshawi]
          Length = 611

 Score = 82.8 bits (203), Expect = 2e-13,   Method: Compositional matrix adjust.
 Identities = 41/102 (40%), Positives = 57/102 (55%), Gaps = 6/102 (5%)

Query: 58  GLYPVISIINHSCLPNAVLVFE-GRLAVVRAVQHVPKGAEVLISYIETAGSTMTRQKALK 116
           GL+P+ +I+NH C PNA   FE GRLAVVRA + +PKG E+  +Y +   S +TR   LK
Sbjct: 391 GLFPLTAIMNHECTPNASHYFENGRLAVVRAARAIPKGGEITTTYTKILWSNLTRGIFLK 450

Query: 117 EQYLFTCTCPRCIKLGQFDDIQESAILEGYRCKDDGCSGFLL 158
               F C C RC      D+ +    L    C++ GC G ++
Sbjct: 451 MTKYFVCNCDRCN-----DNSENGTYLAALFCREQGCKGLVI 487



 Score = 63.5 bits (153), Expect = 1e-07,   Method: Compositional matrix adjust.
 Identities = 30/75 (40%), Positives = 42/75 (56%), Gaps = 1/75 (1%)

Query: 55  LGTGLYPVISIINHSCLPNAVLVFE-GRLAVVRAVQHVPKGAEVLISYIETAGSTMTRQK 113
           +   L+P+  ++NH C PNA   FE G   VV A + +P GAE+ +SY +   ST+ R+ 
Sbjct: 222 VARALFPLAGLLNHQCTPNAAHHFEDGETIVVTATERIPMGAEITMSYAKLLWSTLARKM 281

Query: 114 ALKEQYLFTCTCPRC 128
            L     F C CPRC
Sbjct: 282 FLGMTKHFMCRCPRC 296


>gi|324513338|gb|ADY45484.1| SET and MYND domain-containing protein 3 [Ascaris suum]
          Length = 477

 Score = 82.8 bits (203), Expect = 2e-13,   Method: Compositional matrix adjust.
 Identities = 79/313 (25%), Positives = 133/313 (42%), Gaps = 23/313 (7%)

Query: 5   DEKQLLLYAQIANLVNLILQWPEISINEIAENFSKLACNAHTICNSELRPLGTGLYPVIS 64
           D+ +   + +I N +++  +   I  ++    F K + N+H+I  S    +G  L   +S
Sbjct: 133 DQAKYEHFEKIFNKMSIFRKDEMIDKDKFYIIFCKTSINSHSIHTSAGTEVGMALDLGVS 192

Query: 65  IINHSCLPNAVLVFEGRLAVVRAVQHVPKGAE---VLISYIETAGSTMTRQKALKEQYLF 121
             +HSC PN  LVF+G  A +R +      ++     ISYI+   S   R+K LK ++ F
Sbjct: 193 AYDHSCRPNCSLVFDGYQACLRPLTPQTNASDPRTSFISYIDIGRSRYQRRKELKAKWYF 252

Query: 122 TCTCPRCIKLGQFDDIQESAILEGYRCKDDGCSGFLL--RDSDDKGFTCQQCGLVRSKEE 179
            C C RC             IL   RC D  C   L+   D++    TC +CG V  +E 
Sbjct: 253 DCACERCCDPA-------DNILTSIRCSDASCDEPLITAEDAEATAITCPKCGQVADEEY 305

Query: 180 IKKIASEVNILSKKTLALTSCGNHQEVVSTYKMIEKLQKKLYHPFSVNLMQTREKLIKIL 239
           ++     +  L  K    ++  + QE++     +   QK +Y    +  +QT   ++ I 
Sbjct: 306 VRAAQQMMLRLPAKFSPESNPDSLQELLDEASNVLH-QKNIY----ITRLQT--AIMHIT 358

Query: 240 MELEDWKEALAYCQLTI-PVYQRVYPQFHPLLGLQYYTCGKLEWFLGDTENAIKSMTEAV 298
             L+D    L + Q  +   Y+  +P     +G Q     K     G  + A+    EA+
Sbjct: 359 GTLKD---NLPFIQHQVYENYKMCFPYADRHIGYQLLQIVKAYIEKGQRKEAVSYAFEAM 415

Query: 299 EILRITHGTNSPF 311
            I  I  G   P+
Sbjct: 416 NIFEICFGLQHPY 428


>gi|195347880|ref|XP_002040479.1| GM18906 [Drosophila sechellia]
 gi|194121907|gb|EDW43950.1| GM18906 [Drosophila sechellia]
          Length = 302

 Score = 82.4 bits (202), Expect = 3e-13,   Method: Compositional matrix adjust.
 Identities = 40/102 (39%), Positives = 56/102 (54%), Gaps = 6/102 (5%)

Query: 58  GLYPVISIINHSCLPNAVLVFE-GRLAVVRAVQHVPKGAEVLISYIETAGSTMTRQKALK 116
           GL+P+ +I+NH C PNA   FE GRLAVVRA + +PKG E+  +Y +     +TR   LK
Sbjct: 85  GLFPLTAIMNHECTPNASHYFENGRLAVVRAARDIPKGGEITTTYTKILWGNLTRNIFLK 144

Query: 117 EQYLFTCTCPRCIKLGQFDDIQESAILEGYRCKDDGCSGFLL 158
               F C C RC      D+ +    L    C++ GC G ++
Sbjct: 145 MTKHFACDCVRCN-----DNTENGTYLSALFCREQGCRGLVI 181


>gi|62473423|ref|NP_001014717.1| msta, isoform A [Drosophila melanogaster]
 gi|25090702|sp|O46040.3|MSTAA_DROME RecName: Full=Protein msta, isoform A
 gi|22831561|gb|AAN09072.1| msta, isoform A [Drosophila melanogaster]
          Length = 462

 Score = 82.4 bits (202), Expect = 3e-13,   Method: Compositional matrix adjust.
 Identities = 40/102 (39%), Positives = 56/102 (54%), Gaps = 6/102 (5%)

Query: 58  GLYPVISIINHSCLPNAVLVFE-GRLAVVRAVQHVPKGAEVLISYIETAGSTMTRQKALK 116
           GL+P+ +I+NH C PNA   FE GRLAVVRA + +PKG E+  +Y +     +TR   LK
Sbjct: 242 GLFPLTAIMNHECTPNASHYFENGRLAVVRAARDIPKGGEITTTYTKILWGNLTRNIFLK 301

Query: 117 EQYLFTCTCPRCIKLGQFDDIQESAILEGYRCKDDGCSGFLL 158
               F C C RC      D+ +    L    C++ GC G ++
Sbjct: 302 MTKHFACDCVRC-----HDNTENGTYLSALFCREQGCRGLVI 338


>gi|4753708|emb|CAB42051.1| MSTA protein [Drosophila melanogaster]
          Length = 462

 Score = 82.4 bits (202), Expect = 3e-13,   Method: Compositional matrix adjust.
 Identities = 40/102 (39%), Positives = 56/102 (54%), Gaps = 6/102 (5%)

Query: 58  GLYPVISIINHSCLPNAVLVFE-GRLAVVRAVQHVPKGAEVLISYIETAGSTMTRQKALK 116
           GL+P+ +I+NH C PNA   FE GRLAVVRA + +PKG E+  +Y +     +TR   LK
Sbjct: 242 GLFPLTAIMNHECTPNASHYFENGRLAVVRAARDIPKGGEITTTYTKILWGNLTRNIFLK 301

Query: 117 EQYLFTCTCPRCIKLGQFDDIQESAILEGYRCKDDGCSGFLL 158
               F C C RC      D+ +    L    C++ GC G ++
Sbjct: 302 MTKHFACDCVRC-----HDNTENGTYLSALFCREQGCRGLVI 338


>gi|4725963|emb|CAA15694.2| EG:103B4.4 [Drosophila melanogaster]
          Length = 991

 Score = 82.0 bits (201), Expect = 3e-13,   Method: Compositional matrix adjust.
 Identities = 40/102 (39%), Positives = 56/102 (54%), Gaps = 6/102 (5%)

Query: 58  GLYPVISIINHSCLPNAVLVFE-GRLAVVRAVQHVPKGAEVLISYIETAGSTMTRQKALK 116
           GL+P+ +I+NH C PNA   FE GRLAVVRA + +PKG E+  +Y +     +TR   LK
Sbjct: 771 GLFPLTAIMNHECTPNASHYFENGRLAVVRAARDIPKGGEITTTYTKILWGNLTRNIFLK 830

Query: 117 EQYLFTCTCPRCIKLGQFDDIQESAILEGYRCKDDGCSGFLL 158
               F C C RC      D+ +    L    C++ GC G ++
Sbjct: 831 MTKHFACDCVRC-----HDNTENGTYLSALFCREQGCRGLVI 867



 Score = 57.8 bits (138), Expect = 7e-06,   Method: Compositional matrix adjust.
 Identities = 29/72 (40%), Positives = 40/72 (55%), Gaps = 1/72 (1%)

Query: 58  GLYPVISIINHSCLPNAVLVFE-GRLAVVRAVQHVPKGAEVLISYIETAGSTMTRQKALK 116
            L+P+  ++NH C PNA   FE G   VV A + +P GAE+ +SY +   ST+ R+  L 
Sbjct: 228 ALFPLAGLLNHQCTPNAAHHFENGETIVVCATERIPAGAEITMSYAKLLWSTLARKIFLG 287

Query: 117 EQYLFTCTCPRC 128
               F C C RC
Sbjct: 288 MTKHFICKCVRC 299


>gi|201066227|gb|ACH92522.1| IP12582p [Drosophila melanogaster]
          Length = 495

 Score = 82.0 bits (201), Expect = 3e-13,   Method: Compositional matrix adjust.
 Identities = 40/102 (39%), Positives = 56/102 (54%), Gaps = 6/102 (5%)

Query: 58  GLYPVISIINHSCLPNAVLVFE-GRLAVVRAVQHVPKGAEVLISYIETAGSTMTRQKALK 116
           GL+P+ +I+NH C PNA   FE GRLAVVRA + +PKG E+  +Y +     +TR   LK
Sbjct: 275 GLFPLTAIMNHECTPNASHYFENGRLAVVRAARDIPKGGEITTTYTKILWGNLTRNIFLK 334

Query: 117 EQYLFTCTCPRCIKLGQFDDIQESAILEGYRCKDDGCSGFLL 158
               F C C RC      D+ +    L    C++ GC G ++
Sbjct: 335 MTKHFACDCVRC-----HDNTENGTYLSALFCREQGCRGLVI 371


>gi|390361260|ref|XP_799878.2| PREDICTED: SET and MYND domain-containing protein 3-like
           [Strongylocentrotus purpuratus]
          Length = 315

 Score = 82.0 bits (201), Expect = 3e-13,   Method: Compositional matrix adjust.
 Identities = 36/93 (38%), Positives = 56/93 (60%)

Query: 37  FSKLACNAHTICNSELRPLGTGLYPVISIINHSCLPNAVLVFEGRLAVVRAVQHVPKGAE 96
           F    CN+ +IC+++L  +  G+Y   S++NHSC PN V+VF+ R   +R V+ V +G  
Sbjct: 180 FGATICNSFSICDNDLNGIAVGIYLRASMLNHSCYPNCVVVFDERKLQLRTVRDVKEGDV 239

Query: 97  VLISYIETAGSTMTRQKALKEQYLFTCTCPRCI 129
             ISY++       RQ  L+E+Y F+C C +CI
Sbjct: 240 CTISYVDVINPAKERQTELEERYHFSCNCVKCI 272


>gi|294933189|ref|XP_002780642.1| SET and MYND domain-containing protein, putative [Perkinsus marinus
           ATCC 50983]
 gi|239890576|gb|EER12437.1| SET and MYND domain-containing protein, putative [Perkinsus marinus
           ATCC 50983]
          Length = 409

 Score = 82.0 bits (201), Expect = 3e-13,   Method: Compositional matrix adjust.
 Identities = 39/98 (39%), Positives = 60/98 (61%), Gaps = 2/98 (2%)

Query: 43  NAHTICNSELRPLGTGLYPVISIINHSCLPNAVLVFEGRLAVVRAVQHVPKGAEVLISYI 102
           NA TI N  L+ +  GLY  +S +NHSC PN VL+F G    +R ++ V  GAE+ ISY+
Sbjct: 188 NAVTITNDSLQDVALGLYTEVSAMNHSCAPNVVLIFSGSEVTLRTIRAVEDGAELFISYV 247

Query: 103 ETAGSTMTRQ-KALKEQYLFTCTCPRCIKLGQF-DDIQ 138
           +   S   ++ + L++QY F C+C RC +  ++ DD+ 
Sbjct: 248 DVCISPKAKRCQRLRDQYKFDCSCERCTREDEYLDDVD 285


>gi|298705753|emb|CBJ49061.1| conserved unknown protein [Ectocarpus siliculosus]
          Length = 401

 Score = 81.3 bits (199), Expect = 5e-13,   Method: Compositional matrix adjust.
 Identities = 46/148 (31%), Positives = 71/148 (47%), Gaps = 8/148 (5%)

Query: 26  PEISINEIA-ENFSKLACNAHTICNSELRPLGTGLYPVISIINHSCLPNAVLVFEGRLAV 84
           P  ++ ++A +   +++ N  T C+    P G GL+P  ++INHSC PN    + G    
Sbjct: 20  PSAALLDLARKELGRVSLNGITTCDG-TGPTGLGLFPSGAMINHSCSPNCQAWWRGSQLE 78

Query: 85  VRAVQHVPKGAEVLISYIETAGSTMTRQKALKEQYLFTCTCPRCIKLGQFDDIQESAILE 144
           +R  + V  G E+ +SYI     +  R+  L+  + F C C RC+   Q+D     A L 
Sbjct: 79  IRCTKPVATGEELCLSYIPIDQPSTVRRAQLRHSWFFACRCRRCVSR-QWD-----AELV 132

Query: 145 GYRCKDDGCSGFLLRDSDDKGFTCQQCG 172
           G RC   GC+G +       G  C  CG
Sbjct: 133 GLRCPTKGCAGAVPPPPWRGGHACSHCG 160


>gi|221057668|ref|XP_002261342.1| hypothetical protein, conserved in Plasmodium species [Plasmodium
           knowlesi strain H]
 gi|194247347|emb|CAQ40747.1| hypothetical protein, conserved in Plasmodium species [Plasmodium
           knowlesi strain H]
          Length = 518

 Score = 81.3 bits (199), Expect = 6e-13,   Method: Compositional matrix adjust.
 Identities = 79/329 (24%), Positives = 134/329 (40%), Gaps = 46/329 (13%)

Query: 30  INEIAENFSKLACNAHTICNSELRPLGTGLY--PVISIINHSCLPNAVLVFEGRLAVVRA 87
           +  I E F K++ N+  I +++L P G   Y  PV    NHSCL N V +F+ +   +R 
Sbjct: 184 LKNIIEKFVKISKNSLQIIDNDLEPAGLAFYKKPV-PYFNHSCLSNCVTIFKNQKLFIRT 242

Query: 88  VQHVPKGAEVLISYIETAGSTMTRQKALKEQYLFTCTCPRCIKLGQFDDIQESAILEGYR 147
           +  V  G E+ ISY++ A    TR     +QY FTC+C  C K+    + Q     E   
Sbjct: 243 LVDVYPGEELTISYVDIAFDRNTRLAICMDQYFFTCSCKLC-KVNMVSECQHIFNTELLC 301

Query: 148 CKDDGCSG-------FLLRDSDDKG------------------------FTCQQC----- 171
            + + C          L+ + + K                         + C  C     
Sbjct: 302 TQSESCKKCVNHMELLLISELERKSCYMNKNQFKSAYPVFKKTNEQQHVWKCSICKGEVH 361

Query: 172 -----GLVRSKEEIKKIASEVNILSKKTLALTSCGNHQEVVSTYKMIEKLQKKLYHPFSV 226
                 LV  ++E  K AS +  L  +     +    Q +      I+ L    YH    
Sbjct: 362 DNVIKSLVEKEKETVKEASHLESLFSEKYTYDNKSVLQALNKIKSKIDYLTT-FYHHTKY 420

Query: 227 NLMQTREKLIKILMELEDWKEALAYCQLTIPVYQRVYPQFHPLLGLQYYTCGKLEWFLGD 286
           +L + R K++ I ++L+++K A       +   +  Y ++ P+ G   +  GKL  FL  
Sbjct: 421 SLQKMRAKILYISIQLQEFKLAYNIANQYLKSIEVTYGKYSPIYGYYIFLTGKLALFLDL 480

Query: 287 TENAIKSMTEAVEILRITHGTNSPFMKEL 315
               +  + +A + +  T+G +SP  K+L
Sbjct: 481 KSEGLSLIHKAKKNIIKTYGPDSPIYKDL 509


>gi|412986118|emb|CCO17318.1| SET and MYND domain-containing protein 3 [Bathycoccus prasinos]
          Length = 734

 Score = 81.3 bits (199), Expect = 6e-13,   Method: Compositional matrix adjust.
 Identities = 39/93 (41%), Positives = 55/93 (59%)

Query: 29  SINEIAENFSKLACNAHTICNSELRPLGTGLYPVISIINHSCLPNAVLVFEGRLAVVRAV 88
           S+ ++A+   KL  N HT+C+ ELRP G G+YPV +++NHS  PN    F G+  +VR +
Sbjct: 280 SVADLAKLIGKLRLNCHTLCDDELRPYGIGVYPVAAMMNHSENPNCFATFRGKKMIVRCL 339

Query: 89  QHVPKGAEVLISYIETAGSTMTRQKALKEQYLF 121
           + V  G E+ ISY E       R K+LK  Y F
Sbjct: 340 RDVLPGEELTISYDELMKPKRERAKSLKSNYGF 372


>gi|359486519|ref|XP_002270770.2| PREDICTED: uncharacterized protein LOC100263830 [Vitis vinifera]
          Length = 799

 Score = 81.3 bits (199), Expect = 7e-13,   Method: Compositional matrix adjust.
 Identities = 72/312 (23%), Positives = 120/312 (38%), Gaps = 54/312 (17%)

Query: 49  NSELRPLGTGLYPVISIINHSCLPNAVLVFEGRLAVVRAVQHVPKGAEVLISYIETAG-- 106
           N E   +G  +Y V S+ NHSC PN    F  R   +RA +HV  G  + +SY    G  
Sbjct: 478 NMEQVRVGQAIYSVASLFNHSCQPNIHAYFLSRTLFLRATEHVAVGCPLELSYGPQVGQW 537

Query: 107 STMTRQKALKEQYLFTCTCPRCIKLGQFDDIQESAILEGYRCKDDGCSGFLL-------- 158
               RQK LK++Y F C C  C +L   D      +L  +RC +  C G +L        
Sbjct: 538 DCKDRQKFLKDEYSFRCECSGCSELNVSD-----LVLNAFRCVNPDCFGTVLDSCVIKYE 592

Query: 159 ---------------------------RDSDDKGFT----------CQQCGLVRSKEEIK 181
                                      R+   + F           C  CG  R  E   
Sbjct: 593 NKKFERFQGVPQDCISEPHLQLKNDGIREVAHQAFANSSFRAAPGYCLHCGAYRDLEASH 652

Query: 182 KIASEVNI-LSKKTLALTSCGNHQEVVSTYKMIEKLQKKLYHPFSVNLMQTREKLIKILM 240
               E  I +S+   A+ S        S       L K   H ++  + +  + + +   
Sbjct: 653 ATVGEAGIYISRLQEAIVSKEVPATTFSDALRSLDLLKSTLHAYNKGIAEAEDWIAQAFC 712

Query: 241 ELEDWKEALAYCQLTIPVYQRVYPQFHPLLGLQYYTCGKLEWFLGDTENAIKSMTEAVEI 300
            + + + A+ +C+ +I + +++Y   H ++G +      ++  LGDT  A+KS++    I
Sbjct: 713 MIGELQPAMHHCKASIEILEKLYGSNHIVIGYELMKLSSIQLSLGDTA-AMKSISRLAAI 771

Query: 301 LRITHGTNSPFM 312
               +G ++  M
Sbjct: 772 FSWYYGPHADMM 783


>gi|296806220|ref|XP_002843920.1| histone methyltransferase SmyD1b [Arthroderma otae CBS 113480]
 gi|238845222|gb|EEQ34884.1| histone methyltransferase SmyD1b [Arthroderma otae CBS 113480]
          Length = 495

 Score = 81.3 bits (199), Expect = 7e-13,   Method: Compositional matrix adjust.
 Identities = 70/261 (26%), Positives = 115/261 (44%), Gaps = 42/261 (16%)

Query: 27  EISINEIAENFSKLACNAHTICNSELRPLGTGLYPVISIINHSCLPNAVLVF-EGRLAVV 85
           ++ I  IAE  +K+  N  T       PLG  + P    +NHSC PNAV+ F EGR+  V
Sbjct: 176 DLDIETIAEYLAKIEVNGFTFTTPFGDPLGLCIQPFACFVNHSCDPNAVVGFDEGRI-TV 234

Query: 86  RAVQHVPKGAEVLISYIETAGSTMTRQKALKEQYLFTCTCPRCIKLGQFDDIQESAILEG 145
           +A++ +    +V ISYI+       RQK L E+Y FTC C +C+   Q D  +E   +  
Sbjct: 235 KALRTIEPDEQVFISYIDNTNPFEIRQKELAERYFFTCRCSKCL---QGDKAREDQFIPA 291

Query: 146 YRCKDDGCSGFLLRDSDDKGFTCQQCGLVRSKEEIKKIASEVNILSKKTLALTSCGNHQE 205
               +D                      + + +E +K A E+         L S  N + 
Sbjct: 292 SPPSND----------------------IETLKEAEKQAQEL---------LASAKNARP 320

Query: 206 VVSTYKMIEKLQ----KKLYHPFSVNLMQTREKLIKILMELEDWKEALAYCQLT-IPVYQ 260
           V +  ++   ++     K++        Q R +LI  L+++E +  AL +  +  + V  
Sbjct: 321 VFAVRQLKAAIKILHDTKMWPITRQPYPQIRSELIVALLQIESFWPALQHSAVRYLFVDP 380

Query: 261 RVYPQ-FHPLLGLQYYTCGKL 280
            +YPQ +HP+     Y   +L
Sbjct: 381 VLYPQEWHPIRNNHAYILSQL 401


>gi|440636518|gb|ELR06437.1| hypothetical protein GMDG_07962 [Geomyces destructans 20631-21]
          Length = 287

 Score = 80.9 bits (198), Expect = 7e-13,   Method: Compositional matrix adjust.
 Identities = 37/86 (43%), Positives = 54/86 (62%)

Query: 43  NAHTICNSELRPLGTGLYPVISIINHSCLPNAVLVFEGRLAVVRAVQHVPKGAEVLISYI 102
           N+  I ++    +G G  P++ + NHSC PNA L F+GR AV+ A++H+ +G E+ ISYI
Sbjct: 188 NSFRITDASGDEIGIGFDPLLGMANHSCAPNASLKFDGRCAVLTALKHIEEGEEITISYI 247

Query: 103 ETAGSTMTRQKALKEQYLFTCTCPRC 128
           +T      RQ  L++ Y FTCTC  C
Sbjct: 248 DTTLPRAARQAFLQKHYYFTCTCAAC 273


>gi|213514028|ref|NP_001135157.1| SET and MYND domain-containing protein 1 [Salmo salar]
 gi|197631975|gb|ACH70711.1| SET and MYND domain containing 1b [Salmo salar]
          Length = 301

 Score = 80.9 bits (198), Expect = 7e-13,   Method: Compositional matrix adjust.
 Identities = 37/117 (31%), Positives = 61/117 (52%), Gaps = 5/117 (4%)

Query: 17  NLVNLILQWPEIS----INEIAENFSKLACNAHTICNSE-LRPLGTGLYPVISIINHSCL 71
           ++ N +  WP  S    +++I+  F  + CN  T+ +   L  +G GL+P + ++NH C 
Sbjct: 152 DIHNFLDYWPRNSKQHRVDDISHIFGVINCNGFTVSDQRGLNAVGVGLFPNLCLVNHDCW 211

Query: 72  PNAVLVFEGRLAVVRAVQHVPKGAEVLISYIETAGSTMTRQKALKEQYLFTCTCPRC 128
           PN  ++       +RA+  +  G EV +SY++    T  RQ+ LK  Y F CTC  C
Sbjct: 212 PNCTVILNHGKIELRALGKIEPGEEVTVSYVDFLNVTEDRQRQLKMHYFFDCTCEHC 268


>gi|390363795|ref|XP_788278.2| PREDICTED: SET and MYND domain-containing protein 3-like
           [Strongylocentrotus purpuratus]
          Length = 356

 Score = 80.9 bits (198), Expect = 7e-13,   Method: Compositional matrix adjust.
 Identities = 41/107 (38%), Positives = 59/107 (55%), Gaps = 4/107 (3%)

Query: 37  FSKLACNAHTICNSELRPLGTGLYPVISIINHSCLPNAVLVFEGRLAVVRAVQHVPKGAE 96
           F    CN+ +IC+++L  +  G+Y   S++NHSC PN   V +GR   +  V+ V +G E
Sbjct: 171 FGATICNSISICDNDLIDIAVGIYLRASMLNHSCDPNCAWVCDGRKLQIMTVKDVKEGDE 230

Query: 97  VLISYIETAGSTMTRQKALKEQYLFTCTCPRCIK----LGQFDDIQE 139
             ISY++       RQ  LKE Y FTC C +CI+    LG  D + E
Sbjct: 231 CTISYVDAMKPAKVRQADLKESYHFTCKCVKCIEEINALGPDDGLGE 277



 Score = 58.5 bits (140), Expect = 4e-06,   Method: Compositional matrix adjust.
 Identities = 25/66 (37%), Positives = 39/66 (59%)

Query: 65  IINHSCLPNAVLVFEGRLAVVRAVQHVPKGAEVLISYIETAGSTMTRQKALKEQYLFTCT 124
           + NHSC  N ++VF+ R   +R ++ V  G E  ISY++       RQ  L+E+Y FTC 
Sbjct: 1   MANHSCDYNCIVVFDERKLQLRTIKDVKDGEECTISYVDVINPAKERQAKLEEEYHFTCK 60

Query: 125 CPRCIK 130
           C +C++
Sbjct: 61  CVKCVE 66


>gi|355720917|gb|AES07092.1| SET and MYND domain containing 4 [Mustela putorius furo]
          Length = 576

 Score = 80.9 bits (198), Expect = 7e-13,   Method: Compositional matrix adjust.
 Identities = 68/284 (23%), Positives = 127/284 (44%), Gaps = 29/284 (10%)

Query: 45  HTICNSELRPLGTGLYPVISIINHSCLPNAVLVFEGRLAVVRAVQHVPKGAEVLISY--I 102
           + + NS    L TG++PV+S++NHSC PN  + F   +A VRA + + KG E+L  Y   
Sbjct: 296 NNVTNSRQVRLATGIFPVVSLLNHSCSPNTSMSFVSTVATVRASEKIAKGQEILHCYGPH 355

Query: 103 ETAGSTMTRQKALKEQYLFTCTCPRCIKLGQFDDIQESAILEGYRCKDDGCSGFLLRDSD 162
            +  +   RQ+ L+ QY F C+CP C K      +      E + C   G  G LL+  D
Sbjct: 356 HSRMAVAERQQKLRAQYFFDCSCPVCQKEKHRATV--GPRWEAFCC---GRCGALLQGGD 410

Query: 163 DKGFTCQQCGLVRSKEEIKK--IASEVNILSKKTLALTSCGNHQEVVSTYKMIEKLQ--- 217
                   CG     E + +  + S V  L ++       G  Q+++   ++ + +Q   
Sbjct: 411 -----VLSCGGTSCMESVSRDHLISRVQDLQRQV------GGAQKLLGNGQLEQAIQLLL 459

Query: 218 ------KKLYHPFSVNLMQTREKLIKILMELEDWKEALAYCQLTIPVYQRVYPQFHPLLG 271
                 +    P    + +  + L +    L DW+++ A+ Q ++ V +  +      +G
Sbjct: 460 GCRHDAENFLSPEHSVVGEMEDALAQAYAALGDWQKSAAHLQKSLQVVEAQHGPSSVEMG 519

Query: 272 LQYYTCGKLEWFLGDTENAIKSMTEAVEILRITHGTNSPFMKEL 315
            + +   +L +       A+ ++  A ++L + +G  S  ++EL
Sbjct: 520 HELFKLAQLFFNGFAIPEALNTIERAEKVLLVHYGPCSDEIQEL 563


>gi|351704191|gb|EHB07110.1| SET and MYND domain-containing protein 4, partial [Heterocephalus
           glaber]
          Length = 793

 Score = 80.9 bits (198), Expect = 7e-13,   Method: Compositional matrix adjust.
 Identities = 64/274 (23%), Positives = 123/274 (44%), Gaps = 29/274 (10%)

Query: 55  LGTGLYPVISIINHSCLPNAVLVFEGRLAVVRAVQHVPKGAEVLISY--IETAGSTMTRQ 112
           L TG++PV+S++NHSC PN  + F G +A +RA Q + KG E+L  Y   ++  S   RQ
Sbjct: 520 LATGIFPVVSLLNHSCSPNTSISFSGTVATIRAAQQIGKGQEILHCYGPHKSWMSIAERQ 579

Query: 113 KALKEQYLFTCTCPRCIKLGQFDDIQESA--ILEGYRCKDDGCSGFLLRDSDDKGFTCQQ 170
           + L+ QY F C CP C     ++ ++ +A    E +RC  + C   +      +G     
Sbjct: 580 QKLRSQYFFDCNCPAC----HYEKLRTAAGPRWEAFRC--NNCRALM------QGDDVLS 627

Query: 171 CG-------LVRSK--EEIKKIASEVNILSKKTLALTSCGNHQEVVSTYKMIEKLQKKLY 221
           CG       + R+    +++ +  +V I SK    L   G   + +       +  +   
Sbjct: 628 CGNRACLEPISRTHLVSQLQDLQQQVGIASK----LLRNGKLDQAIPRLLRCRQDAESFL 683

Query: 222 HPFSVNLMQTREKLIKILMELEDWKEALAYCQLTIPVYQRVYPQFHPLLGLQYYTCGKLE 281
                   +  + L +    L DW+++  + Q ++ V +  +      +G + +   ++ 
Sbjct: 684 AAEHAVAGEIEDNLARAYAALGDWQKSATHLQKSLQVVEARHGPSSVEMGHELFKLAQVL 743

Query: 282 WFLGDTENAIKSMTEAVEILRITHGTNSPFMKEL 315
           +       A+ ++ +A E+L +  G  S  ++EL
Sbjct: 744 FNGFAVPEALNAIQKAEEVLLVHCGPQSDEVQEL 777


>gi|73967367|ref|XP_548315.2| PREDICTED: SET and MYND domain-containing protein 4 [Canis lupus
           familiaris]
          Length = 794

 Score = 80.9 bits (198), Expect = 9e-13,   Method: Compositional matrix adjust.
 Identities = 44/112 (39%), Positives = 60/112 (53%), Gaps = 6/112 (5%)

Query: 21  LILQWPEISINEIAENFSKLACNAHTICNSELRPLGTGLYPVISIINHSCLPNAVLVFEG 80
           L LQ    +I  I E  SK     + +CNS    L TG++PV+S++NHSC PN  + F  
Sbjct: 488 LQLQCNAQAITTIQETGSK----ENNVCNSRQIRLATGIFPVVSLLNHSCSPNTSVSFSS 543

Query: 81  RLAVVRAVQHVPKGAEVLISYIE--TAGSTMTRQKALKEQYLFTCTCPRCIK 130
            +A +RA Q + KG E+L  Y    +      RQ+ L+ QY F C CP C K
Sbjct: 544 TVATIRASQQIGKGQEILHCYGPHYSRMGVADRQQKLRSQYFFDCGCPACQK 595


>gi|320169071|gb|EFW45970.1| predicted protein [Capsaspora owczarzaki ATCC 30864]
          Length = 509

 Score = 80.9 bits (198), Expect = 9e-13,   Method: Compositional matrix adjust.
 Identities = 67/266 (25%), Positives = 121/266 (45%), Gaps = 41/266 (15%)

Query: 40  LACNAHTICNSELRPLGTGLYPVISIINHSCLPNAVLVFE--GRLAVVRAVQHVPKGAEV 97
           + CN   I N  L   G+G+YPV +++NH+C+ N VL ++   +   +RA++ +  G E+
Sbjct: 254 MQCNNFAIHNDILFARGSGIYPVAALVNHACVANCVLTYDLKSKRQFIRAIRDIRAGEEI 313

Query: 98  LISYIETAGSTMTRQKALKEQYLFTCTCPRCIKLGQFDDIQE-SAILEGYRCKDDGCSGF 156
             ++ + A  T+ R+  LK  Y F C C RC      D  +E  A L   R  D   + F
Sbjct: 314 THAFTDAASPTVVRKAHLKSLYAFDCNCSRC---NDSDAAKELDAELVATRPIDTIPAYF 370

Query: 157 LLRDSDDKGFTCQQCG-----LVRSKEEIKKIAS--EVNILSKKTLALTSCGNHQEVVST 209
                  K F  ++       L R+  ++ K++S  +V  L + + +    G    + + 
Sbjct: 371 -------KRFRLERLAGLPSLLERATRDVPKVSSPADVEALLQASRSWLQEGMDPRLPTA 423

Query: 210 YKMI-----EKLQKKLYHPFSVNLMQTREKLIKILM-----ELEDWKEALAY-------- 251
             ++       +++ L   +++ L ++  K+  + +      L + +  LA         
Sbjct: 424 RAIVLLVTAWAVRQALLGDYNLELFESNVKIFGMALLWREEALSERQRDLALVSGLRNVL 483

Query: 252 ---CQLTIPVYQRVYPQFHPLLGLQY 274
               +  I VYQR+YP  HPLLGLQ+
Sbjct: 484 VDSARHVIRVYQRIYPANHPLLGLQW 509


>gi|325189974|emb|CCA24457.1| conserved hypothetical protein [Albugo laibachii Nc14]
          Length = 465

 Score = 80.9 bits (198), Expect = 9e-13,   Method: Compositional matrix adjust.
 Identities = 64/262 (24%), Positives = 116/262 (44%), Gaps = 21/262 (8%)

Query: 30  INEIAENFSKLACNAHTICNSELRPLGTGLYPVISIINHSCLPNAVLVFEGRLAVVR--A 87
           + ++ E  ++  CN   I +     +G G YP+ +++NHSC PN V +F    A +   A
Sbjct: 172 VRDLQEMLARFQCNNFCILDENFFEVGAGCYPLGAMVNHSCDPNCVTIFARGSAQLELWA 231

Query: 88  VQHVPKGAEVLISYIETAGSTMTRQKALKEQYLFTCTCPRCIKLGQFDDIQESAILEGYR 147
           ++ + K  EV ISY++ A     R+K L+++Y F C C RC     +     +A ++ + 
Sbjct: 232 MKSIGKDEEVTISYVDPANCMNKRRKYLQKRYHFDCRCQRC-----YCKASHAATIDTFL 286

Query: 148 CKDDG---CSGFLLRDSDDKGFTCQQ-CGLVRSKEEIKKIASEVNILSKKTLALTSCGNH 203
             D+       + L ++  +G    Q  G +      +K+  +  IL+            
Sbjct: 287 VADENGIPSEKWSLNEATKRGTVLSQITGKIEEFRVCEKLTRKCAILADIDSDFV----- 341

Query: 204 QEVVSTYKMIE----KLQKKLYHPFSVNLMQTREKLIKILMELEDWKE-ALAYCQLTIPV 258
           + V +   M      +LQ +L+   ++ + +++    +   ELE  ++ AL Y       
Sbjct: 342 RNVFTHLHMYNIDAFRLQARLFSDLTIMIAESQSGNSREFAELEFVQQKALVYGYRISKF 401

Query: 259 YQRVYPQFHPLLGLQYYTCGKL 280
           Y  VYP  HPL  L     G+L
Sbjct: 402 YAFVYPALHPLSNLHNIRMGEL 423


>gi|291405379|ref|XP_002719091.1| PREDICTED: SET and MYND domain containing 4 [Oryctolagus cuniculus]
          Length = 801

 Score = 80.5 bits (197), Expect = 1e-12,   Method: Compositional matrix adjust.
 Identities = 69/269 (25%), Positives = 121/269 (44%), Gaps = 19/269 (7%)

Query: 55  LGTGLYPVISIINHSCLPNAVLVFEGRLAVVRAVQHVPKGAEVLISYIETAG--STMTRQ 112
           L TG++PVIS++NHSC PN  + F G +A VRA QH+ +G E+L  Y           RQ
Sbjct: 525 LATGVFPVISLLNHSCSPNTSVSFVGTVATVRASQHIRRGQEILHCYGPHVSRLGIAARQ 584

Query: 113 KALKEQYLFTCTCPRCIKLGQFDDIQESAILEGYRCKDDGCSG--FLLRDSDDKGFTCQQ 170
           + LK QY F C+C  C         +  A     R +   CS    LL+  D    +   
Sbjct: 585 QKLKSQYFFNCSCLAC-------QTEAHATAAAPRWEAFCCSSCRALLQGGDVLSCSSGS 637

Query: 171 CGLVRSKEEIKKIASEVNILSKKTLALTSCGNHQEVVSTYKMIEKLQKK----LYHPFSV 226
           C    S++    +A+ +  L ++T          E+    + +   Q+     L+   SV
Sbjct: 638 CTEAVSRDH---LAARLRDLQRQTGLARELLRSGELEKAIQQLSGCQRDAESFLWAEHSV 694

Query: 227 NLMQTREKLIKILMELEDWKEALAYCQLTIPVYQRVYPQFHPLLGLQYYTCGKLEWFLGD 286
            L +  + L +    L DWK++ A+ Q ++ V +  +      +G + +   ++ +   +
Sbjct: 695 -LGEIEDDLAQAYAALGDWKKSAAHLQKSLTVVEARHGPSSVEMGHELFKLAQILFNGFE 753

Query: 287 TENAIKSMTEAVEILRITHGTNSPFMKEL 315
              A+ ++ +A  +L +  G  S  ++EL
Sbjct: 754 VPEALSTIQKAETVLAVHCGPCSDEIQEL 782


>gi|345480324|ref|XP_003424127.1| PREDICTED: SET and MYND domain-containing protein 3-like [Nasonia
           vitripennis]
          Length = 391

 Score = 80.5 bits (197), Expect = 1e-12,   Method: Compositional matrix adjust.
 Identities = 37/99 (37%), Positives = 61/99 (61%), Gaps = 2/99 (2%)

Query: 32  EIAENFSKLACNAHTICNSELRPLGTGLYPVISIINHSCLPNAVLVFEGRLAVVRAVQHV 91
           EI   + ++  N++ I + ++  +G G+Y   S+I+HSC PNAV VFEG   ++RA++ +
Sbjct: 150 EILGLYGRICVNSYNILDPDMNSIGVGIYLGPSVIDHSCKPNAVAVFEGTTILIRALEDI 209

Query: 92  PK--GAEVLISYIETAGSTMTRQKALKEQYLFTCTCPRC 128
           P+   +++ ISYI+   +T TR   L+  Y F C C RC
Sbjct: 210 PRLDWSQIHISYIDVLNTTSTRCTELQNTYYFLCECERC 248



 Score = 45.4 bits (106), Expect = 0.035,   Method: Compositional matrix adjust.
 Identities = 21/52 (40%), Positives = 31/52 (59%)

Query: 263 YPQFHPLLGLQYYTCGKLEWFLGDTENAIKSMTEAVEILRITHGTNSPFMKE 314
           Y + HPL G+ Y   GK++  L   ++A+  +T+A +ILRITHG      KE
Sbjct: 325 YGEIHPLTGILYLMLGKIQLHLDKPKSALDMLTKADKILRITHGEKHSLFKE 376


>gi|406700189|gb|EKD03370.1| hypothetical protein A1Q2_02350 [Trichosporon asahii var. asahii
           CBS 8904]
          Length = 573

 Score = 80.5 bits (197), Expect = 1e-12,   Method: Compositional matrix adjust.
 Identities = 62/233 (26%), Positives = 100/233 (42%), Gaps = 19/233 (8%)

Query: 43  NAHTICNSELRPLGTGLYPVISIINHSCLPNAVLVFE--GRLAVVRAVQHVPKGAEVLIS 100
           NA T+ + +L+P+G  + P++++ NHSC PNA +VF   G+  V  A   +  G EVL +
Sbjct: 287 NAFTLSSFDLKPIGVAMSPLLALFNHSCAPNAAIVFPRGGKEMVAVANADIAAGEEVLTT 346

Query: 101 YIETAGSTMTRQKALKEQYLFTCTCPRCIKLGQFDDIQESAILEGYRCKDDGCSGFLLRD 160
           Y++ +    TRQ  L+ +Y F C CP C      D +      +   C    C G     
Sbjct: 347 YVDISDDKETRQGDLQSRYGFECECPACT----LDAV------DPRNCLLHECGGLARMP 396

Query: 161 SDDKGFTCQQCGLVRSKEEIKKIASE-VNILSKKTLALTSCGNHQEVVSTYKMIEKLQKK 219
              K  TC     V S   ++ + +E ++++           +   V      +EK+   
Sbjct: 397 ETVKCDTCGASWTV-SPNALRDLVNEGLSLVEADDQDQLDPDDLPRVAEVIARLEKIM-- 453

Query: 220 LYHPFSVNLMQTREKLIKILMELEDWKEALAYCQLTIPVYQRVYPQFHPLLGL 272
              P S   +     + K L       EALAY    +    +VYP+ HP + L
Sbjct: 454 ---PPSSRPLLRLHTIAKNLHVPAAQPEALAYAHKALAGAYKVYPRNHPSVAL 503


>gi|430813591|emb|CCJ29069.1| unnamed protein product [Pneumocystis jirovecii]
          Length = 463

 Score = 80.5 bits (197), Expect = 1e-12,   Method: Compositional matrix adjust.
 Identities = 35/99 (35%), Positives = 54/99 (54%)

Query: 37  FSKLACNAHTICNSELRPLGTGLYPVISIINHSCLPNAVLVFEGRLAVVRAVQHVPKGAE 96
           F K+  N+ ++  S    +GT + P+IS INHSC PN  LVF+     +R++Q +    E
Sbjct: 181 FCKIMINSFSLMTSSYDTIGTAIDPIISRINHSCYPNTALVFDRNTVALRSLQKILPNQE 240

Query: 97  VLISYIETAGSTMTRQKALKEQYLFTCTCPRCIKLGQFD 135
           + +SYI+   +   R   L  +Y F+C C RCI    F+
Sbjct: 241 ITVSYIDIYNTQKNRHDELLSRYYFSCKCTRCIVSDGFE 279


>gi|48476977|gb|AAT44536.1| hypothetical rhabdomyosarcoma antigen MU-RMS-40.9C [Homo sapiens]
          Length = 273

 Score = 80.1 bits (196), Expect = 1e-12,   Method: Compositional matrix adjust.
 Identities = 63/253 (24%), Positives = 113/253 (44%), Gaps = 37/253 (14%)

Query: 67  NHSCLPNAVLVFEGRLAVVRAVQHVPKGAEVLISYIETAGSTMTRQKALKEQYLFTCTCP 126
           NH C PN  ++F      +RA+  + +G E+ +SYI+    +  R++ LK+QY F CTC 
Sbjct: 1   NHDCWPNCTVIFNNGKIELRALGKISEGEELTVSYIDFLNVSEERKRQLKKQYYFDCTCE 60

Query: 127 RCIKLGQFDDIQESAILEGYRCKDDGCSGFLLRDSDDKGFTCQQCGLVRSKEEIKKIASE 186
            C K                + KDD    FL              G+  + +  +++  E
Sbjct: 61  HCQK----------------KLKDD---LFL--------------GVKDNPKPSQEVVKE 87

Query: 187 VNILSKKTL----ALTSCGNHQEVVSTYKMIEKLQKKLYHPFSVNLMQTREKLIKILMEL 242
           +   SK TL       S G + EVV   +   + Q+ ++   ++ +++    + ++L  L
Sbjct: 88  MIQFSKDTLEKIDKARSEGLYHEVVKLCRECLEKQEPVFADTNIYMLRMLSIVSEVLSYL 147

Query: 243 EDWKEALAYCQLTIPVYQRVYPQFHPLLGLQYYTCGKLEWFLGDTENAIKSMTEAVEILR 302
           + ++EA  Y +  +  Y ++Y   +  LG+     G   W  G+ E     + +A  IL 
Sbjct: 148 QAFEEASFYARRMVDGYMKLYHPNNAQLGMAVMRAGLTNWHAGNIEVGHGMICKAYAILL 207

Query: 303 ITHGTNSPFMKEL 315
           +THG + P  K+L
Sbjct: 208 VTHGPSHPITKDL 220


>gi|332257691|ref|XP_003277939.1| PREDICTED: LOW QUALITY PROTEIN: SET and MYND domain-containing
           protein 4 [Nomascus leucogenys]
          Length = 703

 Score = 80.1 bits (196), Expect = 1e-12,   Method: Compositional matrix adjust.
 Identities = 78/315 (24%), Positives = 132/315 (41%), Gaps = 67/315 (21%)

Query: 26  PEISINEIA--ENFSKLACNA-------HT------ICNSELRPLGTGLYPVISIINHSC 70
           P+++I  +A   +  +L CNA       HT      + +S    L TG++PVIS++NHSC
Sbjct: 382 PDVTIWGVAMLRHMLQLQCNAQAMTTIQHTGSKGSIVTDSRQVRLATGIFPVISLLNHSC 441

Query: 71  LPNAVLVFEGRLAVVRAVQHVPKGAEVLISY--IETAGSTMTRQKALKEQYLFTCTCPRC 128
            PN  + F   +A +RA Q + KG E+L  Y   ++      RQ+ L+ QY F CTCP C
Sbjct: 442 SPNTSVSFISTVATIRASQRIRKGQEILHCYGPHKSRMGVAERQQKLRSQYFFDCTCPAC 501

Query: 129 IKLGQFDDIQESAILEGYRCKDDGCSGFLLRDSDDKGFTCQQCGLVRSKEEIKKIASEVN 188
                    +   +  G R                + F C  CG     +++ +  S   
Sbjct: 502 -------QTEAHRMAAGPRW---------------EAFCCNGCGAPMQGDDVLRCGSR-- 537

Query: 189 ILSKKTLALTSCGNHQEVVSTYKMIEKLQKKLYHPFSVNLMQTREKLIKILM--ELEDWK 246
                     SC   +  VS   ++ +LQ         +L Q      K+L   ELE   
Sbjct: 538 ----------SCA--ESAVSRDHLVSRLQ---------DLQQQVRVAQKLLRDGELERAI 576

Query: 247 EALAYCQLTIPVYQRVYPQFHPLLGLQYYTCGKLEWFLGDTENAIKSMTEAVEILRITHG 306
           + L+ CQ     +  ++ + H ++G       +    LGD + +   +  ++ ++ + HG
Sbjct: 577 QRLSGCQRDAESF--LWAE-HAMVGEIADGLARACAALGDWQKSATHLQRSLRVVEVRHG 633

Query: 307 TNSPFMKELILKLEE 321
            +S  M   + KL +
Sbjct: 634 PSSVEMGHELFKLAQ 648


>gi|307104881|gb|EFN53133.1| hypothetical protein CHLNCDRAFT_137504 [Chlorella variabilis]
          Length = 623

 Score = 80.1 bits (196), Expect = 1e-12,   Method: Compositional matrix adjust.
 Identities = 36/99 (36%), Positives = 54/99 (54%)

Query: 28  ISINEIAENFSKLACNAHTICNSELRPLGTGLYPVISIINHSCLPNAVLVFEGRLAVVRA 87
           +   E+A   ++  CN+HTI + EL+PL  G++P+ ++ NH C PN +  F G   V RA
Sbjct: 155 VGPRELALLIARFGCNSHTISDDELQPLAVGIFPLGAMANHDCRPNTLHAFRGGRMVFRA 214

Query: 88  VQHVPKGAEVLISYIETAGSTMTRQKALKEQYLFTCTCP 126
           V+ +  G EV  SY E A     R+  L + +LF    P
Sbjct: 215 VRAIQPGEEVTTSYTELAAPRWERRAVLLQHHLFDIDAP 253


>gi|348567599|ref|XP_003469586.1| PREDICTED: SET and MYND domain-containing protein 4-like [Cavia
           porcellus]
          Length = 800

 Score = 79.7 bits (195), Expect = 2e-12,   Method: Compositional matrix adjust.
 Identities = 43/105 (40%), Positives = 58/105 (55%), Gaps = 15/105 (14%)

Query: 39  KLACNA-------HT------ICNSELRPLGTGLYPVISIINHSCLPNAVLVFEGRLAVV 85
           +L CNA       HT      I +S    L TG++PV+S++NHSC PN  + F G +A +
Sbjct: 491 QLQCNAQAITAIQHTGSKESIITDSRQVRLATGIFPVVSLLNHSCSPNTSVSFTGTIATI 550

Query: 86  RAVQHVPKGAEVLISY--IETAGSTMTRQKALKEQYLFTCTCPRC 128
           RA Q + KG E+L  Y   E+  S   RQ+ L+ QY F C CP C
Sbjct: 551 RAAQQIRKGQEILHCYGPHESHMSVAERQQKLRSQYFFDCCCPAC 595


>gi|401886254|gb|EJT50303.1| hypothetical protein A1Q1_00408 [Trichosporon asahii var. asahii
           CBS 2479]
          Length = 582

 Score = 79.7 bits (195), Expect = 2e-12,   Method: Compositional matrix adjust.
 Identities = 62/241 (25%), Positives = 103/241 (42%), Gaps = 26/241 (10%)

Query: 43  NAHTICNSELRPLGTGLYPVISIINHSCLPNAVLVFE--GRLAVVRAVQHVPKGAEVLIS 100
           NA T+ + +L+P+G  + P++++ NHSC PNA +VF   G+  V  A   +  G EVL +
Sbjct: 287 NAFTLSSFDLKPIGVAMSPLLALFNHSCAPNAAIVFPRGGKEMVAVANADIAAGEEVLTT 346

Query: 101 YIETAGSTMTRQKALKEQYLFTCTCPRCIKLGQFDDIQESAILEGYRCKDDGCSGF---- 156
           Y++ +    TRQ  L+ +Y F C CP C      D +      +   C    C G     
Sbjct: 347 YVDISDDKETRQGDLQSRYGFECECPACT----LDAV------DPRNCLLHECGGLARMP 396

Query: 157 --LLRDSDDKGFTCQQCG--LVRSKEEIKKIAS-EVNILSKKTLALTSCGNHQEVVSTYK 211
             L + SD +   C  CG     S   ++ + +  ++++           +   V     
Sbjct: 397 ADLRQMSDSETVKCDTCGASWTVSPNALRDLVNGGLSLVEADDQDQLDPDDLPRVAEVIA 456

Query: 212 MIEKLQKKLYHPFSVNLMQTREKLIKILMELEDWKEALAYCQLTIPVYQRVYPQFHPLLG 271
            +E++      P S   +     + K L       EALAY    +    +VYP+ HP + 
Sbjct: 457 RLEEIM-----PPSSRPLLRLHTIAKNLHVPAAQPEALAYAHKALAGAYKVYPRNHPSVA 511

Query: 272 L 272
           L
Sbjct: 512 L 512


>gi|154308769|ref|XP_001553720.1| hypothetical protein BC1G_07807 [Botryotinia fuckeliana B05.10]
          Length = 409

 Score = 79.7 bits (195), Expect = 2e-12,   Method: Compositional matrix adjust.
 Identities = 40/117 (34%), Positives = 66/117 (56%), Gaps = 2/117 (1%)

Query: 14  QIANLVNLILQWPEISINEIAENFSKLACNAHTI-CNSELRPLGTGLYPVISIINHSCLP 72
           Q   +VNL+ + PE  +    E+  +L+ N   +  N    P+G  L P+++  NHSC P
Sbjct: 58  QAHAVVNLLQRKPEF-VKYALESLCRLSTNGFRVESNVGNGPIGLCLDPLLARANHSCRP 116

Query: 73  NAVLVFEGRLAVVRAVQHVPKGAEVLISYIETAGSTMTRQKALKEQYLFTCTCPRCI 129
           NA + F+G+ A +RA+  + KG ++ ISYI+       R++AL + + F C C RC+
Sbjct: 117 NAAITFDGKRATLRALSPIAKGEQIFISYIDETQRQEVRREALAKTWFFQCLCSRCV 173


>gi|258575593|ref|XP_002541978.1| predicted protein [Uncinocarpus reesii 1704]
 gi|237902244|gb|EEP76645.1| predicted protein [Uncinocarpus reesii 1704]
          Length = 481

 Score = 79.7 bits (195), Expect = 2e-12,   Method: Compositional matrix adjust.
 Identities = 68/263 (25%), Positives = 119/263 (45%), Gaps = 38/263 (14%)

Query: 27  EISINEIAENFSKLACNAHTICNSELRPLGTGLYPVISIINHSCLPNAVLVFEGRLAVVR 86
           E+S+++I E F KL  NA T+       +G  + P+ ++ NHSC PNA   F+     VR
Sbjct: 162 ELSLHKIIEYFCKLDINAFTLTTPFYDHVGAAIEPLAALCNHSCSPNAATDFDKGKIWVR 221

Query: 87  AVQHVPKGAEVLISYIETAGSTMTRQKALKEQYLFTCTCPRCIKLGQFDDIQESAILEGY 146
           A++ + KG +V +SYIET      RQ  L ++Y F C C +C       +I+++A    +
Sbjct: 222 ALRDIGKGEQVFVSYIETTDPYAHRQSELLKRYYFNCKCNKC-------EIEKNAPDTHF 274

Query: 147 RCKDDGCSGFLLRDSDDKG---FTCQQCGLVRSKEEIKKIASEVNILSKKTLALTSCGNH 203
                      ++++  +    F   + G   S + IK + S ++ L + TL        
Sbjct: 275 LRAITAVDSKTIQNAQQEAMELFEMVKPG-APSTDSIKNLRSAMSALRRTTLW------- 326

Query: 204 QEVVSTYKMIEKLQKKLYHPFSVNLMQTREKLIKILMELEDWKEALAYCQLT-IPVYQRV 262
                       L ++ Y       ++ R +LI  LM    ++ A  +C +  + V   V
Sbjct: 327 -----------PLTRQPY-------VRLRGELIASLMGARQFQSAFVHCVIRHLRVNPVV 368

Query: 263 YP-QFHPLLGLQYYTCGKLEWFL 284
           YP ++HP+  +  +   KL  +L
Sbjct: 369 YPNRWHPISSMHKWVFVKLMRYL 391


>gi|303271753|ref|XP_003055238.1| predicted protein [Micromonas pusilla CCMP1545]
 gi|226463212|gb|EEH60490.1| predicted protein [Micromonas pusilla CCMP1545]
          Length = 838

 Score = 79.7 bits (195), Expect = 2e-12,   Method: Compositional matrix adjust.
 Identities = 39/106 (36%), Positives = 57/106 (53%), Gaps = 3/106 (2%)

Query: 17  NLVNLILQWPEISINEIAENFSKLACNAHTICNSELRPLGTGLYPVISIINHSCLPNAVL 76
           N V+ +   P+    ++A   +  ACN HTI + EL P+G G+YP  +++NH+C P    
Sbjct: 235 NFVDPLAGGPDP--RDVARLLAAFACNVHTIHDDELNPVGMGVYPTAAMMNHACAPTVAQ 292

Query: 77  VF-EGRLAVVRAVQHVPKGAEVLISYIETAGSTMTRQKALKEQYLF 121
            F EG  A  R V+ V KG E+ ISY+E   +   R+  L   Y F
Sbjct: 293 TFGEGTTATFRCVRDVRKGEEIAISYVELLATRAERRAELLNHYRF 338


>gi|347831782|emb|CCD47479.1| hypothetical protein [Botryotinia fuckeliana]
          Length = 612

 Score = 79.7 bits (195), Expect = 2e-12,   Method: Compositional matrix adjust.
 Identities = 40/117 (34%), Positives = 66/117 (56%), Gaps = 2/117 (1%)

Query: 14  QIANLVNLILQWPEISINEIAENFSKLACNAHTI-CNSELRPLGTGLYPVISIINHSCLP 72
           Q   +VNL+ + PE  +    E+  +L+ N   +  N    P+G  L P+++  NHSC P
Sbjct: 221 QAHAVVNLLQRKPEF-VKYALESLCRLSTNGFRVESNVGNGPIGLCLDPLLARANHSCRP 279

Query: 73  NAVLVFEGRLAVVRAVQHVPKGAEVLISYIETAGSTMTRQKALKEQYLFTCTCPRCI 129
           NA + F+G+ A +RA+  + KG ++ ISYI+       R++AL + + F C C RC+
Sbjct: 280 NAAITFDGKRATLRALSPIAKGEQIFISYIDETQRQEVRREALAKTWFFQCLCSRCV 336


>gi|405121568|gb|AFR96336.1| hypothetical protein CNAG_03112 [Cryptococcus neoformans var.
           grubii H99]
          Length = 447

 Score = 79.7 bits (195), Expect = 2e-12,   Method: Compositional matrix adjust.
 Identities = 52/177 (29%), Positives = 87/177 (49%), Gaps = 13/177 (7%)

Query: 29  SINEIAENFSKLACNAHTICNSELRPLGTGLYPVISIINHSCLPNAVLVFE--GRLAVVR 86
            ++E+    S +  NA T+ +  L P+G  + P++++ NHSC PNA+ VF   GR   + 
Sbjct: 110 GVDEVLNLCSAIQVNAFTLTSPSLAPIGMSISPLLALANHSCEPNAIAVFPKGGRDIFLV 169

Query: 87  AVQHVPKGAEVLISYIETAGSTMTRQKALKEQYLFTCTCPRCIKLGQFDDIQESAI---- 142
           A+  +P G E+L SYI+T+     RQ  L  +Y F C C  C K     + ++S +    
Sbjct: 170 ALNDIPPGEEILTSYIDTSTPYHHRQSELLSRYRFVCHCSLCQKSESTANGEQSWVDPRW 229

Query: 143 -LEGYRCKDDGCSGFLLRDSDDK-GFTCQQCGLVRSKEEIKKIASEVNILSKKTLAL 197
            +    C  +G +   L ++ DK    C  C     KE+ K  A +V  L ++ ++ 
Sbjct: 230 CVRHRECSKEGKARMPLENATDKVEVRCIAC-----KEDFKVDADKVRKLVQEGMSF 281


>gi|392587935|gb|EIW77268.1| SET domain-containing protein [Coniophora puteana RWD-64-598 SS2]
          Length = 656

 Score = 79.7 bits (195), Expect = 2e-12,   Method: Compositional matrix adjust.
 Identities = 54/174 (31%), Positives = 80/174 (45%), Gaps = 16/174 (9%)

Query: 29  SINEIAENFSKLACNAHTICNSELRPLGTGLYPVISIINHSCLPNAVLVF---------- 78
           S  ++ +  S+   N+ T+    L P+G  + P+I++INHSC PNAV+VF          
Sbjct: 239 SARDLVDLTSRFTSNSFTLATPALAPIGVSVSPLIALINHSCDPNAVVVFPRSSTASTPA 298

Query: 79  --EGRLAVVRAVQHVPKGAEVLISYIETAGSTMTRQKALKEQYLFTCTCPRCIKLGQFDD 136
             E ++ VV A++ +    E+L SYI+T      RQ ALKE Y FTC C  C +      
Sbjct: 299 KLEPQMQVV-AIKDIEPDTEILTSYIDTTLPQPIRQSALKETYCFTCACTLCARPQDATT 357

Query: 137 IQ-ESAILEGYRCKDDGCSGFLLRDSDDKGFTCQQCGLVRSKEEIKKIASEVNI 189
           I   SA+    RC   G +       +     C  CG   S    + I   +N+
Sbjct: 358 IDPRSAVWCAQRC--GGVASVYSAGDELVVGRCNTCGFSASMTTAESILDAINV 409


>gi|299470965|emb|CBN79949.1| conserved unknown protein [Ectocarpus siliculosus]
          Length = 556

 Score = 79.3 bits (194), Expect = 2e-12,   Method: Compositional matrix adjust.
 Identities = 44/124 (35%), Positives = 64/124 (51%), Gaps = 6/124 (4%)

Query: 37  FSKLACNAHTICNSELRP-LGTGLYPVISIINHSCLPNAVLVFEGRLAVVRAVQHVPKGA 95
             KL+CN  +I  S L   +G GLY   +  NHSC PNA   F G+   +R  + + KG 
Sbjct: 190 LGKLSCNVFSIAESALNGEVGCGLYLEAAAANHSCNPNAAQSFSGKTLSLRCTRPIRKGE 249

Query: 96  EVLISYIETAGSTMTRQKALKEQYLFTCTCPRC-IKLGQFDDIQESAILEGYRCKDDGCS 154
           E+ I   +       R+++L++ Y F C C RC    G+ +D++    LE + C D  CS
Sbjct: 250 EITIGITQIQKPGPARRESLRKTYFFECRCERCESPEGRAEDMR----LEAFACPDSECS 305

Query: 155 GFLL 158
           GF L
Sbjct: 306 GFCL 309


>gi|346320789|gb|EGX90389.1| SET domain protein [Cordyceps militaris CM01]
          Length = 540

 Score = 79.3 bits (194), Expect = 2e-12,   Method: Compositional matrix adjust.
 Identities = 51/132 (38%), Positives = 68/132 (51%), Gaps = 9/132 (6%)

Query: 27  EISINEIAENFSKLACNAHTICNSELRPLGTGLYPVISIINHSCLPNAVLVFEGRLAVVR 86
           E ++ E AE   KL  NA    +++L  +G  L P +++ NHSCLPNA + F GR AV+R
Sbjct: 167 EGAVREAAEILCKLQTNAFHRFDADLGHVGIFLEPTLAMANHSCLPNAFVQFVGRTAVLR 226

Query: 87  AVQHVPKGAEVLISYIETAGSTMTRQKALKEQYLFTCTCPRCIKLGQFDDI---QESAIL 143
           A Q +  G E+ ISY +   S   RQ AL   Y F C C RC +     D+   Q SA  
Sbjct: 227 AEQRIQSGDEIEISYTDYTSSLSKRQAALAP-YHFECRCRRCTQ-----DLCVYQASAHY 280

Query: 144 EGYRCKDDGCSG 155
             +   DD   G
Sbjct: 281 RDFALNDDSVLG 292


>gi|357607357|gb|EHJ65468.1| hypothetical protein KGM_05644 [Danaus plexippus]
          Length = 370

 Score = 79.3 bits (194), Expect = 2e-12,   Method: Compositional matrix adjust.
 Identities = 40/119 (33%), Positives = 65/119 (54%), Gaps = 8/119 (6%)

Query: 37  FSKLACNAHTICNSELRPLGTGLYPVISIINHSCLPNAVLVFEGRLAVVRAVQHVP--KG 94
           + ++  N+ TI + E+  +GTG+Y   S+I+HSC PNAV VF+G+   +RA++ +     
Sbjct: 163 YGRMVINSFTILDIEMNSIGTGIYLASSVIDHSCNPNAVAVFDGKTINIRALKDMNCLDW 222

Query: 95  AEVLISYIETAGSTMTRQKALKEQYLFTCTCPRCIKLGQFDDIQESAILEGYRCKDDGC 153
            ++ ISYI+   +   RQ  L++ Y F C C RC+      D      +   +C  DGC
Sbjct: 223 KKIRISYIDLMKTPYERQMELRQSYYFLCQCDRCL------DENRIKYVHAAKCLKDGC 275


>gi|315046180|ref|XP_003172465.1| SET and MYND domain-containing protein 2 [Arthroderma gypseum CBS
           118893]
 gi|311342851|gb|EFR02054.1| SET and MYND domain-containing protein 2 [Arthroderma gypseum CBS
           118893]
          Length = 498

 Score = 79.3 bits (194), Expect = 3e-12,   Method: Compositional matrix adjust.
 Identities = 38/103 (36%), Positives = 58/103 (56%)

Query: 28  ISINEIAENFSKLACNAHTICNSELRPLGTGLYPVISIINHSCLPNAVLVFEGRLAVVRA 87
           + +  IAE  +++  N+ T   S   PLG  + P    +NHSC PNAV+ F+  L  V+A
Sbjct: 180 LDLETIAEYLARIEVNSFTFTTSFGDPLGLCIQPFACYMNHSCDPNAVVGFDEGLITVKA 239

Query: 88  VQHVPKGAEVLISYIETAGSTMTRQKALKEQYLFTCTCPRCIK 130
           ++ +    +V ISYI+       RQK L E+Y FTC C +C++
Sbjct: 240 LRKIKPDEQVFISYIDNTYPFEVRQKQLAERYFFTCKCSKCVE 282


>gi|195168896|ref|XP_002025266.1| GL13396 [Drosophila persimilis]
 gi|194108722|gb|EDW30765.1| GL13396 [Drosophila persimilis]
          Length = 787

 Score = 79.3 bits (194), Expect = 3e-12,   Method: Compositional matrix adjust.
 Identities = 38/102 (37%), Positives = 56/102 (54%), Gaps = 6/102 (5%)

Query: 58  GLYPVISIINHSCLPNAVLVFE-GRLAVVRAVQHVPKGAEVLISYIETAGSTMTRQKALK 116
           GL+P+ +I+NH C PNA   F+ GRLA+VRA + +PKG E+  +Y +     +TR   LK
Sbjct: 567 GLFPLTAIMNHECTPNASHYFDNGRLAIVRAARDIPKGGEITTTYTKILWGNLTRGIFLK 626

Query: 117 EQYLFTCTCPRCIKLGQFDDIQESAILEGYRCKDDGCSGFLL 158
               F C C RC      D+ +    L    C++ GC G ++
Sbjct: 627 MTKHFMCDCTRC-----NDNTENGTYLSALFCREQGCRGLVI 663



 Score = 55.5 bits (132), Expect = 4e-05,   Method: Compositional matrix adjust.
 Identities = 27/72 (37%), Positives = 40/72 (55%), Gaps = 1/72 (1%)

Query: 58  GLYPVISIINHSCLPNAVLVF-EGRLAVVRAVQHVPKGAEVLISYIETAGSTMTRQKALK 116
            L+P+  ++NH C PNA   F +G   +V A + +  GAE+ +SY +   ST+ R+  L 
Sbjct: 237 ALFPLAGLLNHQCTPNAGHHFQDGETIIVCATERIACGAEITVSYAKMLWSTLARKIFLG 296

Query: 117 EQYLFTCTCPRC 128
               F C CPRC
Sbjct: 297 MTKGFMCQCPRC 308


>gi|67539656|ref|XP_663602.1| hypothetical protein AN5998.2 [Aspergillus nidulans FGSC A4]
 gi|40738557|gb|EAA57747.1| hypothetical protein AN5998.2 [Aspergillus nidulans FGSC A4]
 gi|259479819|tpe|CBF70391.1| TPA: SET and MYND domain protein, putative (AFU_orthologue;
           AFUA_2G10080) [Aspergillus nidulans FGSC A4]
          Length = 497

 Score = 79.3 bits (194), Expect = 3e-12,   Method: Compositional matrix adjust.
 Identities = 37/98 (37%), Positives = 60/98 (61%), Gaps = 2/98 (2%)

Query: 32  EIAENFSKLACNAHTICNSELRPLGTGLYPVISIINHSCLPNAVLVFEGRLAVVRAVQHV 91
           +I    ++L  N+  + N +   +G  L+P  ++INHSC  NAV+ F+G    V+A++ +
Sbjct: 187 KIVAYHARLDLNSFNLTNDD--DIGIYLHPYAALINHSCDYNAVVGFDGSEIFVKAIRPI 244

Query: 92  PKGAEVLISYIETAGSTMTRQKALKEQYLFTCTCPRCI 129
             G ++ ISYI+T   T  RQK L+E+Y FTC C +C+
Sbjct: 245 ATGEQIFISYIDTTYPTRIRQKELQERYFFTCNCAKCL 282


>gi|297266487|ref|XP_001093225.2| PREDICTED: SET and MYND domain-containing protein 1 [Macaca
           mulatta]
          Length = 399

 Score = 79.0 bits (193), Expect = 3e-12,   Method: Compositional matrix adjust.
 Identities = 38/118 (32%), Positives = 65/118 (55%), Gaps = 6/118 (5%)

Query: 19  VNLILQ-WP----EISINEIAENFSKLACNAHTICNSE-LRPLGTGLYPVISIINHSCLP 72
           V+  LQ WP    +  +  I+  F  + CN  T+ +   L+ +G G++P + ++NH C P
Sbjct: 151 VDTFLQYWPPQSQQFGMQYISHIFGVINCNGFTLSDQRGLQAVGVGIFPNLGLVNHDCWP 210

Query: 73  NAVLVFEGRLAVVRAVQHVPKGAEVLISYIETAGSTMTRQKALKEQYLFTCTCPRCIK 130
           N  ++F      +RA+  + +G E+ +SYI+    +  R++ LK+QY F CTC  C K
Sbjct: 211 NCTVIFNNGKIELRALGKISEGEELTVSYIDFLNVSEERKRQLKKQYYFDCTCEHCQK 268


>gi|198470120|ref|XP_002133373.1| GA22855 [Drosophila pseudoobscura pseudoobscura]
 gi|198145302|gb|EDY72001.1| GA22855 [Drosophila pseudoobscura pseudoobscura]
          Length = 844

 Score = 79.0 bits (193), Expect = 3e-12,   Method: Compositional matrix adjust.
 Identities = 38/102 (37%), Positives = 56/102 (54%), Gaps = 6/102 (5%)

Query: 58  GLYPVISIINHSCLPNAVLVFE-GRLAVVRAVQHVPKGAEVLISYIETAGSTMTRQKALK 116
           GL+P+ +I+NH C PNA   F+ GRLA+VRA + +PKG E+  +Y +     +TR   LK
Sbjct: 624 GLFPLTAIMNHECTPNASHYFDNGRLAIVRAARDIPKGGEITTTYTKILWGNLTRGIFLK 683

Query: 117 EQYLFTCTCPRCIKLGQFDDIQESAILEGYRCKDDGCSGFLL 158
               F C C RC      D+ +    L    C++ GC G ++
Sbjct: 684 MTKHFMCDCTRC-----NDNTENGTYLSALFCREQGCRGLVI 720



 Score = 55.5 bits (132), Expect = 4e-05,   Method: Compositional matrix adjust.
 Identities = 27/72 (37%), Positives = 40/72 (55%), Gaps = 1/72 (1%)

Query: 58  GLYPVISIINHSCLPNAVLVF-EGRLAVVRAVQHVPKGAEVLISYIETAGSTMTRQKALK 116
            L+P+  ++NH C PNA   F +G   +V A + +  GAE+ +SY +   ST+ R+  L 
Sbjct: 237 ALFPLAGLLNHQCTPNAGHHFQDGETIIVCATERIACGAEITVSYAKMLWSTLARKIFLG 296

Query: 117 EQYLFTCTCPRC 128
               F C CPRC
Sbjct: 297 MTKGFMCQCPRC 308


>gi|255072485|ref|XP_002499917.1| predicted protein [Micromonas sp. RCC299]
 gi|226515179|gb|ACO61175.1| predicted protein [Micromonas sp. RCC299]
          Length = 676

 Score = 79.0 bits (193), Expect = 3e-12,   Method: Compositional matrix adjust.
 Identities = 33/89 (37%), Positives = 55/89 (61%)

Query: 33  IAENFSKLACNAHTICNSELRPLGTGLYPVISIINHSCLPNAVLVFEGRLAVVRAVQHVP 92
           +A   +K++CNAHT+C+ EL  +G G+YP  +++NH   P A   F+G+  V+RA + + 
Sbjct: 228 VARLLAKISCNAHTVCDEELNAIGVGVYPAAAMVNHGDAPTAAQSFKGKKIVLRATRDLK 287

Query: 93  KGAEVLISYIETAGSTMTRQKALKEQYLF 121
           +G EV ++Y+E   +   R+ AL   Y F
Sbjct: 288 RGDEVTMAYVELLATRQERRAALHAGYNF 316



 Score = 53.9 bits (128), Expect = 1e-04,   Method: Compositional matrix adjust.
 Identities = 30/105 (28%), Positives = 53/105 (50%), Gaps = 2/105 (1%)

Query: 226 VNLMQTR--EKLIKILMELEDWKEALAYCQLTIPVYQRVYPQFHPLLGLQYYTCGKLEWF 283
           ++L++ R  + + +  +   D++ A       +P Y+  YP  HP LGL      K+E  
Sbjct: 549 IHLLRVRVLDAMQRAAVTAGDFESARDAAYAVLPAYRLSYPPCHPPLGLHLALIAKIEAH 608

Query: 284 LGDTENAIKSMTEAVEILRITHGTNSPFMKELILKLEEAQAEASY 328
           L D   A++   EA+  L ++HG NS  ++ +   L E++AE  Y
Sbjct: 609 LVDLNAAVQFAREAISCLNVSHGRNSDVVRMMETLLGESEAELRY 653


>gi|402898218|ref|XP_003912123.1| PREDICTED: SET and MYND domain-containing protein 4 [Papio anubis]
          Length = 802

 Score = 79.0 bits (193), Expect = 3e-12,   Method: Compositional matrix adjust.
 Identities = 79/318 (24%), Positives = 132/318 (41%), Gaps = 73/318 (22%)

Query: 26  PEISINEIA--ENFSKLACNA-------HT------ICNSELRPLGTGLYPVISIINHSC 70
           P+++I  +A   +  +L CNA       HT      + +S    L TG++PVIS++NHSC
Sbjct: 481 PDVTIWGVAMLRHMLQLQCNAQAITTVQHTGSKGSIVTDSRQVRLATGIFPVISLLNHSC 540

Query: 71  LPNAVLVFEGRLAVVRAVQHVPKGAEVLISY--IETAGSTMTRQKALKEQYLFTCTCPRC 128
            PN  + F   +A +RA Q + KG E+L  Y   ++      RQ+ L+ QY F CTCP C
Sbjct: 541 SPNTSVSFISTVATIRASQRIGKGQEILHCYGPHKSRMGVAERQQKLRSQYFFDCTCPAC 600

Query: 129 IKLGQFDDIQESAILEGYRCKDDGCSGFLLRDSDDKGFTCQQCGLVRSKEEIKKIASEVN 188
                    +   +  G R                K F C  C      + + +      
Sbjct: 601 -------QTEAHRMAAGPRW---------------KAFCCNSCRAPMQGDGVLR------ 632

Query: 189 ILSKKTLALTSCGNH---QEVVSTYKMIEKLQKKLYHPFSVNLMQTREKLIKILM--ELE 243
                      CGN    + VVS   ++ +LQ         +L Q      K+L   ELE
Sbjct: 633 -----------CGNRSCAESVVSRDHLVSRLQ---------DLQQQVGVAQKLLRDGELE 672

Query: 244 DWKEALAYCQLTIPVYQRVYPQFHPLLGLQYYTCGKLEWFLGDTENAIKSMTEAVEILRI 303
              + L+ CQ     +  ++ + H ++G       +    LGD + +   + +++ ++ +
Sbjct: 673 RAVQLLSRCQRDAESF--LWAE-HAMVGAIADGLARACAALGDWQKSATHLQKSLRVVEV 729

Query: 304 THGTNSPFMKELILKLEE 321
            HG +S  M   + KL +
Sbjct: 730 RHGPSSIEMGHELFKLAQ 747


>gi|452977535|gb|EME77301.1| hypothetical protein MYCFIDRAFT_179882 [Pseudocercospora fijiensis
           CIRAD86]
          Length = 616

 Score = 79.0 bits (193), Expect = 3e-12,   Method: Compositional matrix adjust.
 Identities = 34/82 (41%), Positives = 49/82 (59%)

Query: 55  LGTGLYPVISIINHSCLPNAVLVFEGRLAVVRAVQHVPKGAEVLISYIETAGSTMTRQKA 114
           LG G  P+++  NHSC PNA +VF     V+RA++ + KG E+ + YI+ +     RQ  
Sbjct: 284 LGIGFDPILNSSNHSCDPNAAVVFNQPRLVLRALRSISKGEEIFMKYIDVSNPFSVRQAE 343

Query: 115 LKEQYLFTCTCPRCIKLGQFDD 136
           LKE Y F+C C +C K   FD+
Sbjct: 344 LKESYFFSCRCSKCKKGAIFDE 365


>gi|440792261|gb|ELR13489.1| tetratricopeptide repeat domain containing protein [Acanthamoeba
           castellanii str. Neff]
          Length = 764

 Score = 79.0 bits (193), Expect = 3e-12,   Method: Compositional matrix adjust.
 Identities = 45/122 (36%), Positives = 70/122 (57%), Gaps = 8/122 (6%)

Query: 14  QIANLVNLILQWPEISINEIAENFSKL----ACNAHTICN-SELRPLGTGLYPVISIINH 68
           Q  N+  L    P +++ +  E  ++L      NA T+ + S LRP+G GLY   +++NH
Sbjct: 313 QARNVRWLAAHGPGLALPDADETVARLFAVAQANALTLTDPSSLRPIGQGLYASAALLNH 372

Query: 69  SCLPNA--VLVFEGRLAVVRAVQHVPKGAEVLISYIETAGSTMTRQKALKEQYLFTCTCP 126
           SCLPNA   +  EGRL  VRAV+ +  G EV ++Y++       RQ+AL++ + F C C 
Sbjct: 373 SCLPNANWSVDGEGRL-CVRAVRPIEAGEEVTVAYVDPTLPYHARQQALQDHFFFACRCL 431

Query: 127 RC 128
           +C
Sbjct: 432 QC 433


>gi|393219982|gb|EJD05468.1| SET domain-containing protein [Fomitiporia mediterranea MF3/22]
          Length = 635

 Score = 78.6 bits (192), Expect = 4e-12,   Method: Compositional matrix adjust.
 Identities = 38/107 (35%), Positives = 60/107 (56%), Gaps = 7/107 (6%)

Query: 29  SINEIAENFSKLACNAHTICNSELRPLGTGLYPVISIINHSCLPNAVLVFEGRLAV---- 84
           SI+E+ +  SK   N++T+    L P+G  + P+  ++NHSC PN  +VF   L+     
Sbjct: 261 SISELVDLISKFTLNSYTLTTPSLSPIGVSVSPLAGLLNHSCDPNVSVVFPRILSADKEP 320

Query: 85  ---VRAVQHVPKGAEVLISYIETAGSTMTRQKALKEQYLFTCTCPRC 128
              + A+Q +P  +E+L SY++     + RQK LKE Y F C+C  C
Sbjct: 321 ALHIVAIQDIPADSELLTSYVDVTLPVLQRQKDLKETYSFNCSCHSC 367


>gi|431891028|gb|ELK01907.1| SET and MYND domain-containing protein 4 [Pteropus alecto]
          Length = 776

 Score = 78.2 bits (191), Expect = 4e-12,   Method: Compositional matrix adjust.
 Identities = 70/282 (24%), Positives = 118/282 (41%), Gaps = 59/282 (20%)

Query: 47  ICNSELRPLGTGLYPVISIINHSCLPNAVLVFEGRLAVVRAVQHVPKGAEVLISY--IET 104
           I NS+   L TG++PV+S++NHSC PN  + F   +A VRA   + KG E+L  Y   E+
Sbjct: 492 ITNSKQVRLATGIFPVVSLLNHSCSPNTSVSFISTVATVRASVQIGKGQEILHCYGPHES 551

Query: 105 AGSTMTRQKALKEQYLFTCTCPRCIKLGQFDDIQESAILEGYRCKDDGCSGFLLRDSDDK 164
                 RQ+ L+ QY F C CP C               E +R             S  +
Sbjct: 552 RMGAAERQQKLRSQYFFDCNCPACQN-------------EKHR---------TTTGSKWE 589

Query: 165 GFTCQQCGLVRSKEEIKKIASEVNILSKKTLALTSCGNHQ--EVVSTYKMIEKLQKKLYH 222
            F C +C  +   +++                  SCGN    E VS  +++ +LQ     
Sbjct: 590 AFCCNRCRTLMQGDDV-----------------LSCGNTSCTEAVSRDRLVSQLQD---- 628

Query: 223 PFSVNLMQTREKLIKILM---ELEDWKEALAYCQLTIPVYQRVYPQFHPLLGLQYYTCGK 279
                 +Q + K+ + L+   +LE   + L  C+     +    P  H ++G       +
Sbjct: 629 ------LQQQIKMAQKLLKNGKLEQAIQLLLGCRYDAESF---LPAEHSVVGEIEDDLAQ 679

Query: 280 LEWFLGDTENAIKSMTEAVEILRITHGTNSPFMKELILKLEE 321
               LGD + +   + +++ ++ + HG +S  M   + KL +
Sbjct: 680 AYAALGDWQKSAAHLQKSLHVVEVRHGPSSVEMGHELFKLAQ 721


>gi|119597550|gb|EAW77144.1| SET and MYND domain containing 3, isoform CRA_e [Homo sapiens]
          Length = 170

 Score = 78.2 bits (191), Expect = 4e-12,   Method: Compositional matrix adjust.
 Identities = 33/77 (42%), Positives = 51/77 (66%)

Query: 52  LRPLGTGLYPVISIINHSCLPNAVLVFEGRLAVVRAVQHVPKGAEVLISYIETAGSTMTR 111
           ++ +G GLYP IS++NHSC PN  +VF G   ++RAV+ +  G E+ I Y++   ++  R
Sbjct: 1   MQEVGVGLYPSISLLNHSCDPNCSIVFNGPHLLLRAVRDIEVGEELTICYLDMLMTSEER 60

Query: 112 QKALKEQYLFTCTCPRC 128
           +K L++QY F C C RC
Sbjct: 61  RKQLRDQYCFECDCFRC 77


>gi|301765320|ref|XP_002918077.1| PREDICTED: SET and MYND domain-containing protein 4-like
           [Ailuropoda melanoleuca]
 gi|281342357|gb|EFB17941.1| hypothetical protein PANDA_006471 [Ailuropoda melanoleuca]
          Length = 793

 Score = 78.2 bits (191), Expect = 5e-12,   Method: Compositional matrix adjust.
 Identities = 76/307 (24%), Positives = 128/307 (41%), Gaps = 61/307 (19%)

Query: 21  LILQWPEISINEIAENFSKLACNAHTICNSELRPLGTGLYPVISIINHSCLPNAVLVFEG 80
           L LQ    +I  I +  SK     + I +S L  L TG++PV+S++NHSC PN  + F  
Sbjct: 487 LQLQCNAQAITTIQQTGSK----ENNITDSRLVRLATGIFPVVSLLNHSCSPNTSMSFVS 542

Query: 81  RLAVVRAVQHVPKGAEVLISY--IETAGSTMTRQKALKEQYLFTCTCPRCIKLGQFDDIQ 138
            +A +RA + + KG E+L  Y    +      RQ+ L+ QY F C CP C K     +  
Sbjct: 543 TVATIRASEKIGKGQEILHCYGPHHSRMGVADRQQKLRSQYFFDCGCPACQK-----EKH 597

Query: 139 ESAILEGYRCKDDGCSGFLLRDSDDKGFTCQQCGLVRSKEEIKKIASEVNILSKKTLALT 198
            +A+   +                 + F C +CG V    ++    S            T
Sbjct: 598 RAAVGPRW-----------------EAFCCSRCGAVLQGGDVLSCGS------------T 628

Query: 199 SCGNHQEVVSTYKMIEKLQKKLYHPFSVNLMQTREKLIKILMELEDWKEALAYCQLTIPV 258
           SC    E VS   ++ +LQ           ++ R    + L+     ++A+   QL + V
Sbjct: 629 SC---TESVSREHLVSRLQD----------LRQRVGSARNLLSNGQLEQAI---QLLL-V 671

Query: 259 YQRVYPQF----HPLLGLQYYTCGKLEWFLGDTENAIKSMTEAVEILRITHGTNSPFMKE 314
           Y+     F    H ++G       +    LGD + +   + +++ ++ + HG  S  M  
Sbjct: 672 YRHDAENFLSAEHSMVGEMEDALAQAYAALGDWQKSATHLQKSLRVVEVQHGPASVEMGH 731

Query: 315 LILKLEE 321
            + KL +
Sbjct: 732 ELFKLAQ 738


>gi|170102677|ref|XP_001882554.1| predicted protein [Laccaria bicolor S238N-H82]
 gi|164642451|gb|EDR06707.1| predicted protein [Laccaria bicolor S238N-H82]
          Length = 645

 Score = 78.2 bits (191), Expect = 5e-12,   Method: Compositional matrix adjust.
 Identities = 42/104 (40%), Positives = 61/104 (58%), Gaps = 6/104 (5%)

Query: 31  NEIAENFSKLACNAHTICNSELRPLGTGLYPVISIINHSCLPNAVLVF-----EGR-LAV 84
           +++ +  S+   N  TI +  L PLG  + P +++INHSC PNA +VF     EG  L  
Sbjct: 264 SDLVDLISRFTTNTFTITSPTLAPLGACVSPSVALINHSCDPNAAVVFPRSAKEGEPLMQ 323

Query: 85  VRAVQHVPKGAEVLISYIETAGSTMTRQKALKEQYLFTCTCPRC 128
           V A++++    E+L +YI+T   T  RQ+ALKE Y F C CP C
Sbjct: 324 VVALKYIGPDEEILTAYIDTTLPTGLRQQALKETYHFVCECPLC 367


>gi|355568065|gb|EHH24346.1| hypothetical protein EGK_07991 [Macaca mulatta]
          Length = 806

 Score = 78.2 bits (191), Expect = 5e-12,   Method: Compositional matrix adjust.
 Identities = 79/318 (24%), Positives = 132/318 (41%), Gaps = 73/318 (22%)

Query: 26  PEISINEIA--ENFSKLACNA-------HT------ICNSELRPLGTGLYPVISIINHSC 70
           P+++I  +A   +  +L CNA       HT      + +S    L TG++PVIS++NHSC
Sbjct: 485 PDVTIWGVAMLRHMLQLQCNAQAITTVQHTGSKGSIVTDSRQVRLATGIFPVISLLNHSC 544

Query: 71  LPNAVLVFEGRLAVVRAVQHVPKGAEVLISY--IETAGSTMTRQKALKEQYLFTCTCPRC 128
            PN  + F   +A +RA Q + KG E+L  Y   ++      RQ+ L+ QY F CTCP C
Sbjct: 545 SPNTSVSFISTVATIRASQRIGKGQEILHCYGPHKSRMGVAERQQKLRSQYFFDCTCPAC 604

Query: 129 IKLGQFDDIQESAILEGYRCKDDGCSGFLLRDSDDKGFTCQQCGLVRSKEEIKKIASEVN 188
                    +   +  G R                K F C  C      + + +      
Sbjct: 605 -------QTEAHRMAAGPRW---------------KAFCCNSCRAPMQGDGVLR------ 636

Query: 189 ILSKKTLALTSCGNH---QEVVSTYKMIEKLQKKLYHPFSVNLMQTREKLIKILM--ELE 243
                      CGN    + VVS   ++ +LQ         +L Q      K+L   ELE
Sbjct: 637 -----------CGNRSCAESVVSRDHLVSRLQ---------DLQQQVGVAQKLLRDGELE 676

Query: 244 DWKEALAYCQLTIPVYQRVYPQFHPLLGLQYYTCGKLEWFLGDTENAIKSMTEAVEILRI 303
              + L+ CQ     +  ++ + H ++G       +    LGD + +   + +++ ++ +
Sbjct: 677 RAVQLLSGCQRDAESF--LWAE-HAMVGEIADGLARACAALGDWQKSATHLQKSLRVVEV 733

Query: 304 THGTNSPFMKELILKLEE 321
            HG +S  M   + KL +
Sbjct: 734 RHGPSSVEMGHELFKLAQ 751


>gi|313234749|emb|CBY24692.1| unnamed protein product [Oikopleura dioica]
          Length = 475

 Score = 78.2 bits (191), Expect = 5e-12,   Method: Compositional matrix adjust.
 Identities = 69/276 (25%), Positives = 113/276 (40%), Gaps = 34/276 (12%)

Query: 43  NAHTICNSE--LRPLGTGLYPVISIINHSCLPNAVLVFEGRLAVVRAVQHVPKGAEVLIS 100
           N   IC+ E  +  +G+ LY   S+INHSC PNA  VF G   V++A++ +  G E+ I+
Sbjct: 183 NQFAICDEESSICDIGSALYLNHSLINHSCKPNAFPVFNGTNLVIKALEKIAPGEEIKIA 242

Query: 101 YIETAGSTMTRQKALKEQYLFTCTCPRCIKLGQFDDIQESAILEGYRCKDDGCSGFLLRD 160
           Y +T      R+  L + Y F C C  C    + D           R K     G ++R 
Sbjct: 243 YTDTKAVIQDRRDYLNDIYRFVCQCQGCTNDDEVD-----------RKKHLDKKGNVIRR 291

Query: 161 SDDKGFTCQQCGLVRSKEEIKKIASEVNILSKKTLALTSCGNHQEVVSTYKMIEKLQKKL 220
           SD    + +   +VR  EE KK   E N++             +E    +     L +K 
Sbjct: 292 SDAIWQSAET--MVRDMEEFKK-NKEWNLM-------------KEAAQGW-----LARKF 330

Query: 221 YHPFSVNLMQTREKLIKILMELEDWKEALAYCQLTIPVYQRVYPQFHPLLGLQYYTCGKL 280
               ++  ++  E      +E ++W   L      I  Y+ +Y   HP LG+      K+
Sbjct: 331 LPDENIFWIRLNEFAFDAGIETQEWSLCLETGASLIVNYKEIYGPMHPTLGIHLMKFAKI 390

Query: 281 EWFLGDTENAIKSMTEAVEILRITHGTNSPFMKELI 316
              +     A +    A  I+ + +   S   K+L+
Sbjct: 391 LLHIEKPNEAEEYFRRAFAIMSLFYEPESAVRKQLL 426


>gi|336375028|gb|EGO03364.1| hypothetical protein SERLA73DRAFT_174827 [Serpula lacrymans var.
           lacrymans S7.3]
 gi|336387991|gb|EGO29135.1| hypothetical protein SERLADRAFT_456499 [Serpula lacrymans var.
           lacrymans S7.9]
          Length = 530

 Score = 78.2 bits (191), Expect = 5e-12,   Method: Compositional matrix adjust.
 Identities = 42/114 (36%), Positives = 61/114 (53%), Gaps = 9/114 (7%)

Query: 31  NEIAENFSKLACNAHTICNSELRPLGTGLYPVISIINHSCLPNAVLVF---------EGR 81
            ++ +  SK   N  T+ +S L  LG  + P++++INHSC PNAV+V+         E  
Sbjct: 180 GDLVDIISKFITNTFTLTSSSLSALGVSVSPLVALINHSCDPNAVIVYPRCSNEPSTEEP 239

Query: 82  LAVVRAVQHVPKGAEVLISYIETAGSTMTRQKALKEQYLFTCTCPRCIKLGQFD 135
           L  V A++ +    E+L +YI+T     +RQK LKE Y F C CP C K    D
Sbjct: 240 LMQVVAIRDIEVDEEILTAYIDTTLPRFSRQKFLKETYNFDCQCPSCTKYSGVD 293


>gi|326931360|ref|XP_003211799.1| PREDICTED: SET and MYND domain-containing protein 4-like, partial
           [Meleagris gallopavo]
          Length = 706

 Score = 78.2 bits (191), Expect = 5e-12,   Method: Compositional matrix adjust.
 Identities = 62/264 (23%), Positives = 112/264 (42%), Gaps = 9/264 (3%)

Query: 55  LGTGLYPVISIINHSCLPNAVLVFEGRLAVVRAVQHVPKGAEVLISY-IETAGSTMTRQK 113
           L T  +PV+S++NHSC PN  + F G +A VRA Q +P G E+   Y        +  ++
Sbjct: 431 LATAFFPVLSLLNHSCSPNISVSFSGTVATVRASQPIPSGQEIFHCYGPHRCRMRVAERQ 490

Query: 114 ALKEQYLFTCTCPRCIKLGQFDDIQESAILEGYRCKDDGCSGFLLRDSDDKGFTC--QQC 171
            L  QY F C C  C  L + +   ES  +    C    C         ++   C  + C
Sbjct: 491 QLLSQYFFECRCQAC--LDELESDVESVSMRNSFC----CPSCQASMQGEEMLCCSNEAC 544

Query: 172 GLVRSKEEIKKIASEVNILSKKTLALTSCGNHQEVVSTYKMIEKLQKKLYHPFSVNLMQT 231
            L  S+E + +   ++    +K L L    N  E +      +   +    P  + + + 
Sbjct: 545 ALSVSRERLSRRLLDLQQQMEKALELLRDSNADEAIRMLLKCQMDARNFLSPEHLLMGEL 604

Query: 232 REKLIKILMELEDWKEALAYCQLTIPVYQRVYPQFHPLLGLQYYTCGKLEWFLGDTENAI 291
            + L +I   L  W+EA  + + +I + +  +      +G + +   ++ +       A+
Sbjct: 605 EDHLAQIYATLGKWQEAARHLERSIQLVEMHHGPSSVEMGHELFKLAQILFNGFAVSEAL 664

Query: 292 KSMTEAVEILRITHGTNSPFMKEL 315
            ++  A EIL +  G  S  ++EL
Sbjct: 665 STIQRAEEILSVHCGPQSTQIQEL 688


>gi|355753603|gb|EHH57568.1| hypothetical protein EGM_07234 [Macaca fascicularis]
          Length = 806

 Score = 78.2 bits (191), Expect = 5e-12,   Method: Compositional matrix adjust.
 Identities = 79/318 (24%), Positives = 132/318 (41%), Gaps = 73/318 (22%)

Query: 26  PEISINEIA--ENFSKLACNA-------HT------ICNSELRPLGTGLYPVISIINHSC 70
           P+++I  +A   +  +L CNA       HT      + +S    L TG++PVIS++NHSC
Sbjct: 485 PDVTIWGVAMLRHMLQLQCNAQAITTVQHTGSKGSIVTDSRQVRLATGIFPVISLLNHSC 544

Query: 71  LPNAVLVFEGRLAVVRAVQHVPKGAEVLISY--IETAGSTMTRQKALKEQYLFTCTCPRC 128
            PN  + F   +A +RA Q + KG E+L  Y   ++      RQ+ L+ QY F CTCP C
Sbjct: 545 SPNTSVSFISTVATIRASQRIGKGQEILHCYGPHKSRMGVAERQQKLRSQYFFDCTCPAC 604

Query: 129 IKLGQFDDIQESAILEGYRCKDDGCSGFLLRDSDDKGFTCQQCGLVRSKEEIKKIASEVN 188
                    +   +  G R                K F C  C      + + +      
Sbjct: 605 -------QTEAHRMAAGPRW---------------KAFCCNSCRAPMQGDGVLR------ 636

Query: 189 ILSKKTLALTSCGNH---QEVVSTYKMIEKLQKKLYHPFSVNLMQTREKLIKILM--ELE 243
                      CGN    + VVS   ++ +LQ         +L Q      K+L   ELE
Sbjct: 637 -----------CGNRSCAESVVSRDHLVSRLQ---------DLQQQVGVAQKLLRDGELE 676

Query: 244 DWKEALAYCQLTIPVYQRVYPQFHPLLGLQYYTCGKLEWFLGDTENAIKSMTEAVEILRI 303
              + L+ CQ     +  ++ + H ++G       +    LGD + +   + +++ ++ +
Sbjct: 677 RAVQLLSGCQRDAESF--LWAE-HAMVGEIADGLARACAALGDWQKSATHLQKSLRVVEV 733

Query: 304 THGTNSPFMKELILKLEE 321
            HG +S  M   + KL +
Sbjct: 734 RHGPSSVEMGHELFKLAQ 751


>gi|397612716|gb|EJK61854.1| hypothetical protein THAOC_17577 [Thalassiosira oceanica]
          Length = 750

 Score = 78.2 bits (191), Expect = 5e-12,   Method: Compositional matrix adjust.
 Identities = 84/339 (24%), Positives = 138/339 (40%), Gaps = 64/339 (18%)

Query: 17  NLVNLILQWPEISINE-IAENFSKLACNAHTICNSELRPLGTGLYPVISIINHSCLPNAV 75
            LV  IL   E +++E I    +    N   + +S   P+G  +YP  +++NHSC PN +
Sbjct: 437 GLVLRILGCKEKTLDEAIRHTLTAFQQNNFGVTDSLYAPIGEAVYPHAALLNHSCSPNCI 496

Query: 76  LVFE-------GRLAVVRAVQHVPKGAEVLISYIETAGSTMTRQKALKEQYLFTCTCPRC 128
           L ++        +L +V A + +  G E++ SY++    T TR+  L++ + F C C RC
Sbjct: 497 LRYKIGLESSPPQLEIV-ACKDISSGEELVHSYVDLVLPTETRRNRLRDTHGFICECKRC 555

Query: 129 IKLGQFDDIQESAILEGYRCKDDGCSGFLLRDSDDKGF--TCQQCGLVRSKEEIKKIASE 186
              GQ                   C+  L RD        T ++ G + S    ++I  E
Sbjct: 556 A--GQ-------------------CTVELPRDRQSWMLWPTKERLGEIPSSTPTQQIDIE 594

Query: 187 VNILSKKTLALTSCGNHQEVVSTYKM-----------------IEKLQKKL--------Y 221
             I  +   A      H ++V+T ++                 I+ LQ+ +        +
Sbjct: 595 EAIGGRSGEA-----EHLQIVNTSRLLQDQATRAMVNGNNDLEIQLLQQAVGVFLLSGGF 649

Query: 222 HPFSVNLMQTREKLIKILMELEDWKEALAYCQLTIPVYQRVYP--QFHPLLGLQYYTCGK 279
            PF   L   R      L+      E +  C+  +          + HPLLGLQ YT G 
Sbjct: 650 SPFHNELYTARCAYFSALLANGKIDEGIEQCEHIVSSLSVCLGNVKHHPLLGLQLYTLGD 709

Query: 280 LEWFLGDTENAIKSMTEAVEILRITHGTNSPFMKELILK 318
           L    G  + A  +   A EI+ ++HG   P ++ LI K
Sbjct: 710 LCSGGGCGQKAANAYRWAYEIMSVSHGAQDPMVQALIRK 748


>gi|328789942|ref|XP_392262.3| PREDICTED: SET and MYND domain-containing protein 4-like [Apis
           mellifera]
          Length = 677

 Score = 78.2 bits (191), Expect = 6e-12,   Method: Compositional matrix adjust.
 Identities = 59/201 (29%), Positives = 90/201 (44%), Gaps = 18/201 (8%)

Query: 33  IAENFSKLACNAHTI------------CNSELRPLGTGLYPVISIINHSCLPNAVLVFEG 80
           I  N   +  NAH I             N++ + +G GLYP IS+ NHSC P  +  F G
Sbjct: 446 ILHNLMTIQFNAHEISELVIPKADNNLANAKSKFIGGGLYPTISLFNHSCNPGIIRYFIG 505

Query: 81  RLAVVRAVQHVPKGAEVLISY--IETAGSTMTRQKALKEQYLFTCTCPRCIKLGQFDDIQ 138
              VVRA++ +  G E+  +Y  I T      R++ L+ QY F C C  C +     +  
Sbjct: 506 TTMVVRAIRSISSGEEISENYGQIFTTTPESERKRKLRLQYFFDCNCEACREHWPLLEEI 565

Query: 139 ESAILEGYRCKDDGCSGFLL---RDSDDKGFTCQQCGLVRSKEEIKKIASEVNILSKKTL 195
           +  IL  ++C+     G +L    DS++    C +CG   +  +  K   E + + K   
Sbjct: 566 DPTILR-FKCETGKECGNVLPVRTDSNEFMIECSKCGKCMNIFKGLKALQETDAIFKIAS 624

Query: 196 ALTSCGNHQEVVSTYKMIEKL 216
                GNH+E +  Y  I KL
Sbjct: 625 RYLEQGNHREALKNYLKILKL 645


>gi|109112713|ref|XP_001117365.1| PREDICTED: SET and MYND domain-containing protein 4-like [Macaca
           mulatta]
          Length = 802

 Score = 77.8 bits (190), Expect = 6e-12,   Method: Compositional matrix adjust.
 Identities = 79/318 (24%), Positives = 132/318 (41%), Gaps = 73/318 (22%)

Query: 26  PEISINEIA--ENFSKLACNA-------HT------ICNSELRPLGTGLYPVISIINHSC 70
           P+++I  +A   +  +L CNA       HT      + +S    L TG++PVIS++NHSC
Sbjct: 481 PDVTIWGVAMLRHMLQLQCNAQAITTVQHTGSKGSIVTDSRQVRLATGIFPVISLLNHSC 540

Query: 71  LPNAVLVFEGRLAVVRAVQHVPKGAEVLISY--IETAGSTMTRQKALKEQYLFTCTCPRC 128
            PN  + F   +A +RA Q + KG E+L  Y   ++      RQ+ L+ QY F CTCP C
Sbjct: 541 SPNTSVSFISTVATIRASQRIGKGQEILHCYGPHKSRMGVAERQQKLRSQYFFDCTCPAC 600

Query: 129 IKLGQFDDIQESAILEGYRCKDDGCSGFLLRDSDDKGFTCQQCGLVRSKEEIKKIASEVN 188
                    +   +  G R                K F C  C      + + +      
Sbjct: 601 -------QTEAHRMAAGPRW---------------KAFCCNSCRAPMQGDGVLR------ 632

Query: 189 ILSKKTLALTSCGNH---QEVVSTYKMIEKLQKKLYHPFSVNLMQTREKLIKILM--ELE 243
                      CGN    + VVS   ++ +LQ         +L Q      K+L   ELE
Sbjct: 633 -----------CGNRSCAESVVSRDHLVSRLQ---------DLQQQVGVAQKLLRDGELE 672

Query: 244 DWKEALAYCQLTIPVYQRVYPQFHPLLGLQYYTCGKLEWFLGDTENAIKSMTEAVEILRI 303
              + L+ CQ     +  ++ + H ++G       +    LGD + +   + +++ ++ +
Sbjct: 673 RAVQLLSGCQRDAESF--LWAE-HAMVGEIADGLARACAALGDWQKSATHLQKSLRVVEV 729

Query: 304 THGTNSPFMKELILKLEE 321
            HG +S  M   + KL +
Sbjct: 730 RHGPSSVEMGHELFKLAQ 747


>gi|357146941|ref|XP_003574165.1| PREDICTED: uncharacterized protein LOC100821025 [Brachypodium
           distachyon]
          Length = 791

 Score = 77.8 bits (190), Expect = 7e-12,   Method: Compositional matrix adjust.
 Identities = 70/312 (22%), Positives = 135/312 (43%), Gaps = 52/312 (16%)

Query: 46  TICNSELRPLGTGLYPVISIINHSCLPNAVLVFEGRLAVVRAVQHVPKGAEVLISYIETA 105
            +C+ E   +   +Y   S+ NHSC PN    F  R   +R+ +++  G+ + +SY   A
Sbjct: 466 VMCSVEQIRVAQAIYMSGSLFNHSCRPNVHTYFHSRTLFLRSTEYIESGSPLELSYGPQA 525

Query: 106 GST--MTRQKALKEQYLFTCTCPRCIKLGQFDDIQES----------AILEG--YRCKDD 151
           G    + RQK+L+E Y F+C C  C +L   D + +S          A+ E   YR +++
Sbjct: 526 GEMDLLERQKSLQENYKFSCRCSSCSELNLSDLVIDSFCCPRSFCLGAVSESTCYRSEEN 585

Query: 152 ---------------------------GCSGFLLRDSD-----DKGF--TCQ-QCGLVRS 176
                                             R+ D     D G+  +C+ Q  L   
Sbjct: 586 FVHVSLNGSAVCKLSLPDVSKVDEDMEKVGKLFFRNDDADLKIDPGYCMSCRSQLDLSSV 645

Query: 177 KEEIKKIASEVNILSKKTLALTSCGNHQEVVSTYKMIEKLQKKLYHPFSVNLMQTREKLI 236
              + + AS++N    K L +    +   + +  + +E++ KKL HP+S  L Q  +K+ 
Sbjct: 646 VATLDRAASKINRF--KELIVIDDVSEVLITNALQSLEQI-KKLRHPYSKALAQAEDKIA 702

Query: 237 KILMELEDWKEALAYCQLTIPVYQRVYPQFHPLLGLQYYTCGKLEWFLGDTENAIKSMTE 296
           +   ++ D ++A  +C+ +I + +++Y   H  +  +      ++  +GD   A  ++T 
Sbjct: 703 EAFAKVGDQEQARKHCEASIKILEKLYHPKHIAIAHELIKLISIKLCMGDKAGAAATVTR 762

Query: 297 AVEILRITHGTN 308
           A  I  + +G +
Sbjct: 763 AEAIFLLYYGPD 774


>gi|380013978|ref|XP_003691021.1| PREDICTED: SET and MYND domain-containing protein 4-like [Apis
           florea]
          Length = 678

 Score = 77.8 bits (190), Expect = 7e-12,   Method: Compositional matrix adjust.
 Identities = 59/201 (29%), Positives = 91/201 (45%), Gaps = 18/201 (8%)

Query: 33  IAENFSKLACNAHTI------------CNSELRPLGTGLYPVISIINHSCLPNAVLVFEG 80
           I  N   +  NAH I             N++ + +G GLYP IS+ NHSC P  +  F G
Sbjct: 447 ILHNLMTIQFNAHEISELVIPKADNNLANAKSKFIGGGLYPTISLFNHSCNPGIIRYFIG 506

Query: 81  RLAVVRAVQHVPKGAEVLISY--IETAGSTMTRQKALKEQYLFTCTCPRCIKLGQFDDIQ 138
              VVRA++ +  G E+  +Y  I T      R++ L+ QY F C C  C +     +  
Sbjct: 507 TTMVVRAIRSISSGEEISENYGQIFTTTPEKERKRKLRLQYFFDCNCEACREHWPLLEEI 566

Query: 139 ESAILEGYRCKDDGCSGFLL---RDSDDKGFTCQQCGLVRSKEEIKKIASEVNILSKKTL 195
           +  IL  ++C+     G +L    DS++    C +CG   +  +  K   E + + K   
Sbjct: 567 DPTILR-FKCETGKECGNVLPVKTDSNEFMIECSKCGKCMNIFKGLKALQETDAIFKIAS 625

Query: 196 ALTSCGNHQEVVSTYKMIEKL 216
                GNH+E + +Y  I KL
Sbjct: 626 RNLEQGNHREALKSYLKILKL 646


>gi|299115343|emb|CBN74163.1| conserved unknown protein [Ectocarpus siliculosus]
          Length = 579

 Score = 77.8 bits (190), Expect = 7e-12,   Method: Compositional matrix adjust.
 Identities = 45/143 (31%), Positives = 71/143 (49%), Gaps = 7/143 (4%)

Query: 17  NLVNLILQWPEISINEIAENFSKLACNAHTICNSELRPL--GTGLYPVISIINHSCLPNA 74
           +L+ L+ +W    + E+ +   ++  NAH + +     L  G G++P+ ++INH+C PN 
Sbjct: 252 DLLQLLPEWVRFDVEEVVQLACRVNVNAHGLRDDSGANLVIGVGMFPLTAMINHACRPNC 311

Query: 75  VLVFEGRLAVVRAVQHVPKGAEVLISYIETAGSTMTRQKALKEQYLFTCTCPRCIKLGQF 134
             V+ G    VR ++ V  GAE+ + YI+   ST  R++ L     F C C RC      
Sbjct: 312 TFVYFGGNLEVRTLEPVSAGAELSVYYIDLLQSTAARRQELLTSKHFLCKCSRCENPSSM 371

Query: 135 DDIQESAILEGYRCKDDGCSGFL 157
           DD      L+G  C D G  G L
Sbjct: 372 DD-----YLDGVCCTDCGERGCL 389


>gi|281205858|gb|EFA80047.1| SET domain-containing protein [Polysphondylium pallidum PN500]
          Length = 496

 Score = 77.8 bits (190), Expect = 7e-12,   Method: Compositional matrix adjust.
 Identities = 48/141 (34%), Positives = 74/141 (52%), Gaps = 15/141 (10%)

Query: 19  VNLILQWPEISINEIAENFSKLACNAHTI--CNSELRPLGTGLYPVISIINHSCLPNAVL 76
           +N +LQ     I+ + +    + CNA TI  C S +  LGTGLYP ++ INHSC PN  +
Sbjct: 206 INQVLQS---EIDVLYDALCSINCNAFTIIDCVSGVL-LGTGLYPGLTFINHSCDPNLQV 261

Query: 77  VFEGRLAVVRAVQHVPKGAEVLISYIETAGSTMTRQKALKEQYLFTCTCPRC--IKLGQF 134
              G++  ++AV+ + KG E  ISYI+     + R + L E + F C C +C  ++  + 
Sbjct: 262 TQIGKILTLKAVRPIKKGEEFTISYIDRTEGAIQRNEELMETFFFECQCHKCASVRSEKR 321

Query: 135 DDIQESAILEGYRCKDDGCSG 155
           +D         + CKD  C G
Sbjct: 322 NDFY-------FVCKDPSCKG 335


>gi|157786648|ref|NP_001099280.1| SET and MYND domain-containing protein 4 [Rattus norvegicus]
 gi|149053389|gb|EDM05206.1| SET and MYND domain containing 4 (predicted) [Rattus norvegicus]
          Length = 801

 Score = 77.4 bits (189), Expect = 8e-12,   Method: Compositional matrix adjust.
 Identities = 36/84 (42%), Positives = 49/84 (58%), Gaps = 2/84 (2%)

Query: 47  ICNSELRPLGTGLYPVISIINHSCLPNAVLVFEGRLAVVRAVQHVPKGAEVLISY--IET 104
           I NS    L TG++PV+S++NHSC PN  + F   +A +RA Q + KG E+L  Y   E+
Sbjct: 517 ITNSRQVRLATGVFPVVSLLNHSCSPNTSVSFTSTVATIRAAQQIAKGQEILHCYGPHES 576

Query: 105 AGSTMTRQKALKEQYLFTCTCPRC 128
                 RQ+ L  QY F C+CP C
Sbjct: 577 RMGVAERQQRLSSQYFFDCSCPAC 600


>gi|21754614|dbj|BAC04538.1| unnamed protein product [Homo sapiens]
          Length = 804

 Score = 77.4 bits (189), Expect = 9e-12,   Method: Compositional matrix adjust.
 Identities = 77/315 (24%), Positives = 131/315 (41%), Gaps = 67/315 (21%)

Query: 26  PEISINEIA--ENFSKLACNA-------HT------ICNSELRPLGTGLYPVISIINHSC 70
           P+++I  +A   +  +L CNA       HT      + +S    L TG++PVIS++NHSC
Sbjct: 483 PDVTIWGVAMLRHMLQLQCNAQAMTTIQHTGSKGSIVTDSRQVRLATGIFPVISLLNHSC 542

Query: 71  LPNAVLVFEGRLAVVRAVQHVPKGAEVLISY--IETAGSTMTRQKALKEQYLFTCTCPRC 128
            PN  + F   +A +RA Q + KG E+L  Y   ++      RQ+ L+ QY F C CP C
Sbjct: 543 SPNTSVSFISTVATIRASQRIRKGQEILHCYGPHKSRMGVAERQQKLRSQYFFDCACPAC 602

Query: 129 IKLGQFDDIQESAILEGYRCKDDGCSGFLLRDSDDKGFTCQQCGLVRSKEEIKKIASEVN 188
                    +   +  G R                + F C  CG     +++ +  S   
Sbjct: 603 -------QTEAHRMAAGPRW---------------EAFCCNSCGAPMQGDDVLRCGSR-- 638

Query: 189 ILSKKTLALTSCGNHQEVVSTYKMIEKLQKKLYHPFSVNLMQTREKLIKILM--ELEDWK 246
                     SC   +  VS   ++ +LQ         +L Q      K+L   ELE   
Sbjct: 639 ----------SCA--ESAVSRDHLVSRLQ---------DLQQQVRVAQKLLRDGELERAV 677

Query: 247 EALAYCQLTIPVYQRVYPQFHPLLGLQYYTCGKLEWFLGDTENAIKSMTEAVEILRITHG 306
           + L+ CQ     +  ++ + H ++G       +    LGD + +   +  ++ ++ + HG
Sbjct: 678 QRLSGCQRDAESF--LWAE-HAVVGEIADGLARACAALGDWQKSATHLQRSLYVVEVRHG 734

Query: 307 TNSPFMKELILKLEE 321
            +S  M   + KL +
Sbjct: 735 PSSVEMGHELFKLAQ 749


>gi|410980233|ref|XP_003996482.1| PREDICTED: SET and MYND domain-containing protein 4 [Felis catus]
          Length = 800

 Score = 77.4 bits (189), Expect = 9e-12,   Method: Compositional matrix adjust.
 Identities = 71/283 (25%), Positives = 122/283 (43%), Gaps = 57/283 (20%)

Query: 45  HTICNSELRPLGTGLYPVISIINHSCLPNAVLVFEGRLAVVRAVQHVPKGAEVLISY--I 102
           +++ NS    L TG++PVIS++NHSC PN  + F   +A VRA Q + KG E+L  Y   
Sbjct: 506 NSVTNSRQVRLATGIFPVISLLNHSCSPNTSVSFISTVATVRASQQIGKGQEILHCYGPH 565

Query: 103 ETAGSTMTRQKALKEQYLFTCTCPRCIKLGQFDDIQESAILEGYRCKDDGCSGFLLRDSD 162
           ++      RQ+ L+ QY F C+C  C         ++ +   G R               
Sbjct: 566 QSRMRAAERQQKLRSQYFFDCSCAACQN-------EKHSAATGPRW-------------- 604

Query: 163 DKGFTCQQCGLVRSKEEIKKIASEVNILSKKTLALTSCGNHQEVVSTYKMIEKLQKKLYH 222
            + F C+ CG +   +++    S            TSC    E VS   +I +LQ     
Sbjct: 605 -EAFCCRSCGALMQGDDVLICGS------------TSC---TESVSRDHLISRLQD---- 644

Query: 223 PFSVNLMQTREKLIKILMELEDWKEALAYCQLTIPVYQRVYPQF----HPLLGLQYYTCG 278
                 +Q +  + + L+  +  + A+   QL +   QR    F    H L+G       
Sbjct: 645 ------LQQQVGMARKLLRNDKPERAI---QLLLGC-QRDAQSFLSGEHSLVGEMEDDLA 694

Query: 279 KLEWFLGDTENAIKSMTEAVEILRITHGTNSPFMKELILKLEE 321
           +    LGD + +   + ++++++ + HG +S  M   + KL +
Sbjct: 695 QAYAALGDWQKSATHLQKSLQVVEVRHGPSSVEMGHELFKLAQ 737


>gi|156616308|ref|NP_443160.2| SET and MYND domain-containing protein 4 [Homo sapiens]
 gi|296452956|sp|Q8IYR2.3|SMYD4_HUMAN RecName: Full=SET and MYND domain-containing protein 4
          Length = 804

 Score = 77.4 bits (189), Expect = 1e-11,   Method: Compositional matrix adjust.
 Identities = 77/315 (24%), Positives = 131/315 (41%), Gaps = 67/315 (21%)

Query: 26  PEISINEIA--ENFSKLACNA-------HT------ICNSELRPLGTGLYPVISIINHSC 70
           P+++I  +A   +  +L CNA       HT      + +S    L TG++PVIS++NHSC
Sbjct: 483 PDVTIWGVAMLRHMLQLQCNAQAMTTIQHTGPKGSIVTDSRQVRLATGIFPVISLLNHSC 542

Query: 71  LPNAVLVFEGRLAVVRAVQHVPKGAEVLISY--IETAGSTMTRQKALKEQYLFTCTCPRC 128
            PN  + F   +A +RA Q + KG E+L  Y   ++      RQ+ L+ QY F C CP C
Sbjct: 543 SPNTSVSFISTVATIRASQRIRKGQEILHCYGPHKSRMGVAERQQKLRSQYFFDCACPAC 602

Query: 129 IKLGQFDDIQESAILEGYRCKDDGCSGFLLRDSDDKGFTCQQCGLVRSKEEIKKIASEVN 188
                    +   +  G R                + F C  CG     +++ +  S   
Sbjct: 603 -------QTEAHRMAAGPRW---------------EAFCCNSCGAPMQGDDVLRCGSR-- 638

Query: 189 ILSKKTLALTSCGNHQEVVSTYKMIEKLQKKLYHPFSVNLMQTREKLIKILM--ELEDWK 246
                     SC   +  VS   ++ +LQ         +L Q      K+L   ELE   
Sbjct: 639 ----------SCA--ESAVSRDHLVSRLQ---------DLQQQVRVAQKLLRDGELERAV 677

Query: 247 EALAYCQLTIPVYQRVYPQFHPLLGLQYYTCGKLEWFLGDTENAIKSMTEAVEILRITHG 306
           + L+ CQ     +  ++ + H ++G       +    LGD + +   +  ++ ++ + HG
Sbjct: 678 QRLSGCQRDAESF--LWAE-HAVVGEIADGLARACAALGDWQKSATHLQRSLYVVEVRHG 734

Query: 307 TNSPFMKELILKLEE 321
            +S  M   + KL +
Sbjct: 735 PSSVEMGHELFKLAQ 749


>gi|345326540|ref|XP_003431054.1| PREDICTED: LOW QUALITY PROTEIN: SET and MYND domain-containing
           protein 4-like [Ornithorhynchus anatinus]
          Length = 815

 Score = 77.0 bits (188), Expect = 1e-11,   Method: Compositional matrix adjust.
 Identities = 38/92 (41%), Positives = 53/92 (57%), Gaps = 2/92 (2%)

Query: 41  ACNAHTICNSELRPLGTGLYPVISIINHSCLPNAVLVFEGRLAVVRAVQHVPKGAEVLIS 100
           A   H I +S    L TGL+PV+S++NHSC PN  + F GR+AVV+A Q + +G E+L  
Sbjct: 508 ASEDHVIASSRQVRLATGLFPVVSLLNHSCSPNTSVSFSGRVAVVQATQPIGRGQEILHC 567

Query: 101 Y--IETAGSTMTRQKALKEQYLFTCTCPRCIK 130
           Y           R++ LK QY F C C  C++
Sbjct: 568 YGPHRCRMDVAERRQKLKAQYFFDCRCQACLE 599


>gi|194768765|ref|XP_001966482.1| GF21975 [Drosophila ananassae]
 gi|190617246|gb|EDV32770.1| GF21975 [Drosophila ananassae]
          Length = 445

 Score = 77.0 bits (188), Expect = 1e-11,   Method: Compositional matrix adjust.
 Identities = 38/102 (37%), Positives = 55/102 (53%), Gaps = 6/102 (5%)

Query: 58  GLYPVISIINHSCLPNAVLVFE-GRLAVVRAVQHVPKGAEVLISYIETAGSTMTRQKALK 116
           GL+P+ +I+NH C PNA   F+ G LAVVRA + +PKG E+  +Y +     +TR   LK
Sbjct: 225 GLFPLTAIMNHECTPNASHYFDNGTLAVVRAARDIPKGGEITTTYTKILWGNLTRGIFLK 284

Query: 117 EQYLFTCTCPRCIKLGQFDDIQESAILEGYRCKDDGCSGFLL 158
               F C C RC      D+ +    L    C++ GC G ++
Sbjct: 285 MTKNFNCDCVRC-----NDNTENGTYLSALFCREQGCRGLVI 321


>gi|390363607|ref|XP_001199204.2| PREDICTED: N-lysine methyltransferase SMYD2-like
           [Strongylocentrotus purpuratus]
          Length = 490

 Score = 77.0 bits (188), Expect = 1e-11,   Method: Compositional matrix adjust.
 Identities = 68/294 (23%), Positives = 115/294 (39%), Gaps = 58/294 (19%)

Query: 37  FSKLACNAHTICNSELRPLGTGLYPVISIINHSCLPNAVLVFEGRLAVVRAVQHVPKGAE 96
           +    CN+  I +++L  +   +Y   S+ NHSC  N ++VF+ R   +R V+ V +G E
Sbjct: 241 YGATNCNSFGIFDNDLIVISDAIYLRASMANHSCDYNCIVVFDERKLQLRTVKDVQEGEE 300

Query: 97  VLISYIETAGSTMTRQKALKEQYLFTCTCPRCIK----LGQFDDIQESAILEGYRCKDDG 152
             I Y++       R+  L+E+Y FTC C +CI+    LG                 DDG
Sbjct: 301 CTIGYVDVIHPAKERRAELEEKYHFTCKCVKCIEEINALG----------------PDDG 344

Query: 153 CSGFLLRDSDDKGFTCQQCGLVRSKEEIKKIASEVNILSKKTLALTSCGNHQEVVSTYKM 212
                                         +  E+  L K    +    N  +     ++
Sbjct: 345 ------------------------------LGEELRDLKKSLEQIVDAENSHDWAKVIQL 374

Query: 213 IEKLQKKLYHPFSVN-----LMQTREKLIKILMELEDWKEALAYCQLTIPVYQRVYPQFH 267
            E   K +    S+      L+  R+      ++ + WK+A    QL I  Y   Y +++
Sbjct: 375 CEPYLKPMDSSSSLPANHQLLVMLRDTAFFACIQSQSWKKAAEMGQLNIESYIYHYGRYN 434

Query: 268 PLLGLQYYTCGKLEWFLGDTENAIKSMTEAVEILRITHGTNSPFM---KELILK 318
           P +G+     G++   L     A K   EA  + ++THG     M   K+L+ K
Sbjct: 435 PNVGMYLLKIGEVLLNLDRLREARKCFKEAESVFKVTHGLQHSLMASVKKLLFK 488



 Score = 56.6 bits (135), Expect = 1e-05,   Method: Compositional matrix adjust.
 Identities = 25/66 (37%), Positives = 38/66 (57%)

Query: 65  IINHSCLPNAVLVFEGRLAVVRAVQHVPKGAEVLISYIETAGSTMTRQKALKEQYLFTCT 124
           + NHSC  N   VF+G    +R ++ V +G E  ISY++       RQ  L+E+Y FTC 
Sbjct: 1   MANHSCDYNCAGVFDGMKLQLRTIKDVKEGEECTISYVDVINPAKERQAKLEEEYHFTCK 60

Query: 125 CPRCIK 130
           C +C++
Sbjct: 61  CVKCVE 66


>gi|395536340|ref|XP_003770178.1| PREDICTED: SET and MYND domain-containing protein 4 [Sarcophilus
           harrisii]
          Length = 802

 Score = 77.0 bits (188), Expect = 1e-11,   Method: Compositional matrix adjust.
 Identities = 64/276 (23%), Positives = 117/276 (42%), Gaps = 32/276 (11%)

Query: 55  LGTGLYPVISIINHSCLPNAVLVFEGRLAVVRAVQHVPKGAEVLISYIETAG--STMTRQ 112
           L TGL+P IS++NHSC PN  L F G +  V A Q + +G E+L  Y    G     TRQ
Sbjct: 525 LATGLFPAISLMNHSCRPNTSLSFRGSVGSVHASQLIARGQEILHCYGPHEGRMDAATRQ 584

Query: 113 KALKEQYLFTCTCPRCIKLGQFDDIQESAILE---GYRCK----------DDGCSGFLLR 159
           + L+ QY F C C  C    Q +++  + +     G+RC             GCS    R
Sbjct: 585 QKLRSQYFFDCRCQAC----QEEEVHGAGVTPKQGGFRCPTCRAALQGLDPLGCSSGSCR 640

Query: 160 DSDDKGFTCQQCGLVRSKEEIKKIASEVNILSKKTLALTSCGNHQEVVSTYKMIEKLQKK 219
               +     Q G ++ + E+       +  ++    L +C    E   T +        
Sbjct: 641 AQVSRAHLLGQLGDLQQRVEMAGELLRGDRTARAVQLLLACRREAEDFLTSEH------- 693

Query: 220 LYHPFSVNLMQTREKLIKILMELEDWKEALAYCQLTIPVYQRVYPQFHPLLGLQYYTCGK 279
                 V + +  ++L +    L DW ++ A+ Q ++ V +  +      +G + +   +
Sbjct: 694 ------VLMGEIEDRLAQAYASLGDWPKSAAHLQNSLQVVKAQHGPASVEMGHELFKLAQ 747

Query: 280 LEWFLGDTENAIKSMTEAVEILRITHGTNSPFMKEL 315
           + +       A+  + +A ++L + +G  +  ++EL
Sbjct: 748 VLFNGFAVTEAVSVIEKAEKVLSLHYGPENEQVQEL 783


>gi|22902087|gb|AAC60295.2| skmBOP [Takifugu rubripes]
          Length = 433

 Score = 77.0 bits (188), Expect = 1e-11,   Method: Compositional matrix adjust.
 Identities = 37/127 (29%), Positives = 70/127 (55%), Gaps = 6/127 (4%)

Query: 17  NLVNLILQWPEIS----INEIAENFSKLACNAHTICNSE-LRPLGTGLYPVISIINHSCL 71
           ++ N +  WP  S    I++I+  F  + CN  T+ +   L+ +G GL+P + ++NH+C 
Sbjct: 131 DIHNFLDYWPRNSKQHTIDDISHIFGVINCNGFTVSDQRGLQAVGVGLFPNLCMVNHNCW 190

Query: 72  PNAVLVFEGRLAVVRAVQHVPKGAEVLISYIETAGSTMTRQKALKEQYLFTCTCPRCIKL 131
           PN  ++       +R++  + +G E+ ++Y++    +  R++ LK QY F C C  C K 
Sbjct: 191 PNCTVILNHGKIELRSLGKIAEGEELTVAYVDFLNLSEERRRLLKTQYFFDCQCDYC-KN 249

Query: 132 GQFDDIQ 138
           G  DD++
Sbjct: 250 GTKDDLK 256


>gi|440794181|gb|ELR15350.1| tetratricopeptide repeat domain containing protein [Acanthamoeba
           castellanii str. Neff]
          Length = 823

 Score = 77.0 bits (188), Expect = 1e-11,   Method: Compositional matrix adjust.
 Identities = 68/269 (25%), Positives = 114/269 (42%), Gaps = 35/269 (13%)

Query: 29  SINEIAENFSKLACNAHTICNSELRPLGTGLYPVISIINHSCLPNAVLVFEGRLAVVRAV 88
           +I E+A++      + H      +R +   +Y   S+ NHSC PN VL +EGR+  +RA 
Sbjct: 513 AIYEVADD------DEHGGGTKPIRRVAEAMYATGSLFNHSCRPNTVLNYEGRVLTIRAC 566

Query: 89  QHVPKGAEVLISY---IETAGSTMTRQKALKEQYLFTCTCPRCIKLGQFDDIQESAILEG 145
           + + KG EVL  Y   +    S   RQKAL+ QY FTC C  C           S++ +G
Sbjct: 567 EDIAKGKEVLNCYGPHVAHIPSRDERQKALRHQYFFTCRCDAC----------SSSVEQG 616

Query: 146 YR------CKDDGCSGFLLRDSDDKGFTCQQCGLVRSKEEIKKIASEVNILSKKTLALTS 199
            +      C  +GC+  L  + +      Q+  +   + EI K+A E    +++ L    
Sbjct: 617 GQDFDYLVCGSEGCTRELQINPEQ---LRQEVDIRVVQREI-KLAREFYERARRALDGGG 672

Query: 200 CGNH-----QEVVSTYKMIEKLQKKLYHPFSVNLMQTREKLIKILMELEDWKEALAYCQL 254
                    +    +     +L+ K  HP    L +T + L +    L  ++ A  + + 
Sbjct: 673 GDGSGDGDVERAAQSLAHALRLRAKYLHPQHRELGRTHDALAEAHARLGRYEAAATHLER 732

Query: 255 TIPVYQRVYPQFHPLLGLQYYTCGKLEWF 283
            + V +  +  F   L  QY    +  WF
Sbjct: 733 AVAVLESRFAPFSFELADQYAKLAQA-WF 760


>gi|322708541|gb|EFZ00118.1| MYND finger family protein [Metarhizium anisopliae ARSEF 23]
          Length = 526

 Score = 76.6 bits (187), Expect = 1e-11,   Method: Compositional matrix adjust.
 Identities = 42/102 (41%), Positives = 60/102 (58%), Gaps = 1/102 (0%)

Query: 27  EISINEIAENFSKLACNAHTICNSELRPLGTGLYPVISIINHSCLPNAVLVFEGRLAVVR 86
           E +I    E   K+  NA    + +L  +G  L P +++ NHSC+PNAV+ F GR A++R
Sbjct: 156 ESNIRRAVELLCKIQTNAFHRWDVDLGQVGVFLEPTLAMANHSCVPNAVVQFVGRKAILR 215

Query: 87  AVQHVPKGAEVLISYIETAGSTMTRQKALKEQYLFTCTCPRC 128
           A + +  G E+ ISY +      TR++AL EQY F CTC RC
Sbjct: 216 AERPIHAGDEIEISYTDYTMPLSTRREAL-EQYSFECTCARC 256


>gi|330799867|ref|XP_003287962.1| hypothetical protein DICPUDRAFT_33308 [Dictyostelium purpureum]
 gi|325081986|gb|EGC35483.1| hypothetical protein DICPUDRAFT_33308 [Dictyostelium purpureum]
          Length = 400

 Score = 76.6 bits (187), Expect = 1e-11,   Method: Compositional matrix adjust.
 Identities = 61/211 (28%), Positives = 91/211 (43%), Gaps = 35/211 (16%)

Query: 15  IANLVNLI--------LQWPEIS-------INEIAENFSKLACNAHTICNSELRPLGTGL 59
           IANL NL+        L  PE+        I+ I E   K+ CN   I     + +   +
Sbjct: 200 IANLFNLVITSSDNHDLPTPELKKEAVEKVISVIRELIHKVRCNQFGIWTKNDKCIAVAI 259

Query: 60  YPVISIINHSCLPNAVLVFEGRLAVVRAVQHVPKGAEVLISYIETAGSTMTRQKALKEQY 119
            P  S  NHSC+PN + + +G     +A+  V KG  + ISY++      +R++ LK  Y
Sbjct: 260 SPSSSFFNHSCIPNCINIRDGNKMTFKALYPVKKGEPLAISYLDLDLPVESRKEYLKYGY 319

Query: 120 LFTCTCPRCIKLGQFDDIQESAILEGYRCKDDGCSGFLLRDSDDKGFTCQQCGLVRSKEE 179
            F C CPRC +    D+  ++ I + Y C+   C                  GL  SK +
Sbjct: 320 YFDCGCPRCDEKTNQDECMDNWISKFY-CQRKKCV-----------------GLYYSKTK 361

Query: 180 IKKIASEVNILSKKTLALTSCGNHQEVVSTY 210
           +     +VN   K TL  + C N   + STY
Sbjct: 362 VN--IDQVNKNDKITLNCSDCNNEFIINSTY 390


>gi|26354773|dbj|BAC41013.1| unnamed protein product [Mus musculus]
          Length = 799

 Score = 76.6 bits (187), Expect = 1e-11,   Method: Compositional matrix adjust.
 Identities = 38/88 (43%), Positives = 49/88 (55%), Gaps = 2/88 (2%)

Query: 43  NAHTICNSELRPLGTGLYPVISIINHSCLPNAVLVFEGRLAVVRAVQHVPKGAEVLISY- 101
           N   I NS    L TG++PV+S++NHSC PN  + F G +A VRA Q + KG E+L  Y 
Sbjct: 511 NESIITNSRQIRLATGIFPVVSLLNHSCRPNTSVSFTGTVATVRAAQRIAKGQEILHCYG 570

Query: 102 -IETAGSTMTRQKALKEQYLFTCTCPRC 128
             E+      RQ+ L  QY F C C  C
Sbjct: 571 PHESRMGVAERQQRLSSQYFFDCRCGAC 598


>gi|168275572|dbj|BAG10506.1| SET and MYND domain-containing protein 4 [synthetic construct]
          Length = 804

 Score = 76.6 bits (187), Expect = 1e-11,   Method: Compositional matrix adjust.
 Identities = 77/315 (24%), Positives = 131/315 (41%), Gaps = 67/315 (21%)

Query: 26  PEISINEIA--ENFSKLACNA-------HT------ICNSELRPLGTGLYPVISIINHSC 70
           P+++I  +A   +  +L CNA       HT      + +S    L TG++PVIS++NHSC
Sbjct: 483 PDVTIWGVAMLRHMLQLQCNAQAMTTIQHTGPKGSIVTDSRQVRLATGIFPVISLLNHSC 542

Query: 71  LPNAVLVFEGRLAVVRAVQHVPKGAEVLISY--IETAGSTMTRQKALKEQYLFTCTCPRC 128
            PN  + F   +A +RA Q + KG E+L  Y   ++      RQ+ L+ QY F C CP C
Sbjct: 543 SPNTSVSFISTVATIRASQRIRKGQEILHCYGPHKSRMGVAERQQKLRSQYFFDCACPAC 602

Query: 129 IKLGQFDDIQESAILEGYRCKDDGCSGFLLRDSDDKGFTCQQCGLVRSKEEIKKIASEVN 188
                    +   +  G R                + F C  CG     +++ +  S   
Sbjct: 603 -------QTEAHRMAAGPRW---------------EAFCCNSCGAPMQGDDVLRCGSR-- 638

Query: 189 ILSKKTLALTSCGNHQEVVSTYKMIEKLQKKLYHPFSVNLMQTREKLIKILM--ELEDWK 246
                     SC   +  VS   ++ +LQ         +L Q      K+L   ELE   
Sbjct: 639 ----------SCA--ESAVSRDHLVSRLQ---------DLQQQVRVAQKLLRDGELERAV 677

Query: 247 EALAYCQLTIPVYQRVYPQFHPLLGLQYYTCGKLEWFLGDTENAIKSMTEAVEILRITHG 306
           + L+ CQ     +  ++ + H ++G       +    LGD + +   +  ++ ++ + HG
Sbjct: 678 QRLSGCQRDAESF--LWAE-HAVVGEIADGLARACAALGDWQKSATHLQRSLCVVEVRHG 734

Query: 307 TNSPFMKELILKLEE 321
            +S  M   + KL +
Sbjct: 735 PSSVEMGHELFKLAQ 749


>gi|156616310|ref|NP_001096081.1| SET and MYND domain-containing protein 4 [Mus musculus]
 gi|90101757|sp|Q8BTK5.2|SMYD4_MOUSE RecName: Full=SET and MYND domain-containing protein 4
 gi|148680856|gb|EDL12803.1| SET and MYND domain containing 4, isoform CRA_c [Mus musculus]
          Length = 799

 Score = 76.6 bits (187), Expect = 1e-11,   Method: Compositional matrix adjust.
 Identities = 38/88 (43%), Positives = 49/88 (55%), Gaps = 2/88 (2%)

Query: 43  NAHTICNSELRPLGTGLYPVISIINHSCLPNAVLVFEGRLAVVRAVQHVPKGAEVLISY- 101
           N   I NS    L TG++PV+S++NHSC PN  + F G +A VRA Q + KG E+L  Y 
Sbjct: 511 NESIITNSRQIRLATGIFPVVSLLNHSCRPNTSVSFTGTVATVRAAQRIAKGQEILHCYG 570

Query: 102 -IETAGSTMTRQKALKEQYLFTCTCPRC 128
             E+      RQ+ L  QY F C C  C
Sbjct: 571 PHESRMGVAERQQRLSSQYFFDCRCGAC 598


>gi|391335768|ref|XP_003742261.1| PREDICTED: SET and MYND domain-containing protein 4-like
           [Metaseiulus occidentalis]
          Length = 775

 Score = 76.6 bits (187), Expect = 2e-11,   Method: Compositional matrix adjust.
 Identities = 70/249 (28%), Positives = 105/249 (42%), Gaps = 29/249 (11%)

Query: 36  NFSKLACNAHTICN-----------------SELRPLGTGLYPVISIINHSCLPNAVLVF 78
           +  +L CNAH I                    E   + T +YP  S++NHSC PN +  F
Sbjct: 455 HIQQLICNAHAITTIQQPGDHVIEEDGIILEQEQVRVATAIYPSASLMNHSCNPNIISGF 514

Query: 79  E-GRLAVVRAVQHVPKGAEVLISYIETAGSTMT---RQKALKEQYLFTCTCPRCIKLGQF 134
             G   VV++V+ +  G EV   Y       MT   RQ AL+EQY F C C  C K G  
Sbjct: 515 RSGSTLVVKSVRPIASGEEVFNCY-GPHFRRMTFQERQTALQEQYFFRCDCTACQK-GDL 572

Query: 135 DDIQESAILEGYRCKDDGCSGFLLRDSDDKGFTCQQCGLVRSKEEIKKIASEVNILSKKT 194
           DD     I    RC  + C G L          C QC   +   E ++    ++ L  + 
Sbjct: 573 DD----QISMALRC--EYCEGPLSAVQSSGKADCLQCSTTQDCLEKEQKVFRMHDLFVQG 626

Query: 195 LALTSCGNHQEVVSTYKMIEKLQKKLYHPFSVNLMQTREKLIKILMELEDWKEALAYCQL 254
           L L    +H E +   +     Q+KL +  +  LM TR+ + K L  L  +++A+   + 
Sbjct: 627 LQLAEMDSHGEALERLQKCLASQEKLLYRHNKQLMITRDMVAKSLCALGRFRDAVGVLKS 686

Query: 255 TIPVYQRVY 263
            +   + +Y
Sbjct: 687 AVESVRHMY 695


>gi|23272879|gb|AAH35077.1| SET and MYND domain containing 4 [Homo sapiens]
          Length = 804

 Score = 76.6 bits (187), Expect = 2e-11,   Method: Compositional matrix adjust.
 Identities = 77/315 (24%), Positives = 131/315 (41%), Gaps = 67/315 (21%)

Query: 26  PEISINEIA--ENFSKLACNA-------HT------ICNSELRPLGTGLYPVISIINHSC 70
           P+++I  +A   +  +L CNA       HT      + +S    L TG++PVIS++NHSC
Sbjct: 483 PDVTIWGVAMLRHMLQLQCNAQAMTTIQHTGPKGSIVTDSRQVRLATGIFPVISLLNHSC 542

Query: 71  LPNAVLVFEGRLAVVRAVQHVPKGAEVLISY--IETAGSTMTRQKALKEQYLFTCTCPRC 128
            PN  + F   +A +RA Q + KG E+L  Y   ++      RQ+ L+ QY F C CP C
Sbjct: 543 SPNTSVSFISTVATIRASQRIRKGQEILHCYGPHKSRMGVAERQQKLRSQYFFDCACPAC 602

Query: 129 IKLGQFDDIQESAILEGYRCKDDGCSGFLLRDSDDKGFTCQQCGLVRSKEEIKKIASEVN 188
                    +   +  G R                + F C  CG     +++ +  S   
Sbjct: 603 -------QTEAHRMAAGPRW---------------EAFCCNSCGAPMQGDDVLRCGSR-- 638

Query: 189 ILSKKTLALTSCGNHQEVVSTYKMIEKLQKKLYHPFSVNLMQTREKLIKILM--ELEDWK 246
                     SC   +  VS   ++ +LQ         +L Q      K+L   ELE   
Sbjct: 639 ----------SCA--ESAVSRDHLVSRLQ---------DLQQQVRVAQKLLRDGELERAV 677

Query: 247 EALAYCQLTIPVYQRVYPQFHPLLGLQYYTCGKLEWFLGDTENAIKSMTEAVEILRITHG 306
           + L+ CQ     +  ++ + H ++G       +    LGD + +   +  ++ ++ + HG
Sbjct: 678 QRLSGCQRDAESF--LWAE-HAVVGEIADGLARACAALGDWQKSATHLQRSLCVVEVRHG 734

Query: 307 TNSPFMKELILKLEE 321
            +S  M   + KL +
Sbjct: 735 PSSVEMGHELFKLAQ 749


>gi|119610982|gb|EAW90576.1| SET and MYND domain containing 4 [Homo sapiens]
          Length = 804

 Score = 76.6 bits (187), Expect = 2e-11,   Method: Compositional matrix adjust.
 Identities = 77/315 (24%), Positives = 131/315 (41%), Gaps = 67/315 (21%)

Query: 26  PEISINEIA--ENFSKLACNA-------HT------ICNSELRPLGTGLYPVISIINHSC 70
           P+++I  +A   +  +L CNA       HT      + +S    L TG++PVIS++NHSC
Sbjct: 483 PDVTIWGVAMLRHMLQLQCNAQAMTTIQHTGPKGSIVTDSRQVRLATGIFPVISLLNHSC 542

Query: 71  LPNAVLVFEGRLAVVRAVQHVPKGAEVLISY--IETAGSTMTRQKALKEQYLFTCTCPRC 128
            PN  + F   +A +RA Q + KG E+L  Y   ++      RQ+ L+ QY F C CP C
Sbjct: 543 SPNTSVSFISTVATIRASQRIRKGQEILHCYGPHKSRMGVAERQQKLRSQYFFDCACPAC 602

Query: 129 IKLGQFDDIQESAILEGYRCKDDGCSGFLLRDSDDKGFTCQQCGLVRSKEEIKKIASEVN 188
                    +   +  G R                + F C  CG     +++ +  S   
Sbjct: 603 -------QTEAHRMAAGPRW---------------EAFCCNSCGAPMQGDDVLRCGSR-- 638

Query: 189 ILSKKTLALTSCGNHQEVVSTYKMIEKLQKKLYHPFSVNLMQTREKLIKILM--ELEDWK 246
                     SC   +  VS   ++ +LQ         +L Q      K+L   ELE   
Sbjct: 639 ----------SCA--ESAVSRDHLVSRLQ---------DLQQQVRVAQKLLRDGELERAV 677

Query: 247 EALAYCQLTIPVYQRVYPQFHPLLGLQYYTCGKLEWFLGDTENAIKSMTEAVEILRITHG 306
           + L+ CQ     +  ++ + H ++G       +    LGD + +   +  ++ ++ + HG
Sbjct: 678 QRLSGCQRDAESF--LWAE-HAVVGEIADGLARACAALGDWQKSATHLQRSLCVVEVRHG 734

Query: 307 TNSPFMKELILKLEE 321
            +S  M   + KL +
Sbjct: 735 PSSVEMGHELFKLAQ 749


>gi|397491961|ref|XP_003816904.1| PREDICTED: SET and MYND domain-containing protein 4 [Pan paniscus]
          Length = 835

 Score = 76.6 bits (187), Expect = 2e-11,   Method: Compositional matrix adjust.
 Identities = 77/315 (24%), Positives = 131/315 (41%), Gaps = 67/315 (21%)

Query: 26  PEISINEIA--ENFSKLACNA-------HT------ICNSELRPLGTGLYPVISIINHSC 70
           P+++I  +A   +  +L CNA       HT      + +S    L TG++PVIS++NHSC
Sbjct: 483 PDVTIWGVAMLRHMLQLQCNAQAMTTIQHTGPKGSIVTDSRQVRLATGIFPVISLLNHSC 542

Query: 71  LPNAVLVFEGRLAVVRAVQHVPKGAEVLISY--IETAGSTMTRQKALKEQYLFTCTCPRC 128
            PN  + F   +A +RA Q + KG E+L  Y   ++      RQ+ L+ QY F C CP C
Sbjct: 543 NPNTSVSFISTVATIRASQRIRKGQEILHCYGPHKSRMGVAERQQKLRSQYFFDCACPAC 602

Query: 129 IKLGQFDDIQESAILEGYRCKDDGCSGFLLRDSDDKGFTCQQCGLVRSKEEIKKIASEVN 188
                    +   +  G R                + F C  CG     +++ +  S   
Sbjct: 603 -------QTEAHRMAAGPRW---------------EAFCCNSCGAPMQGDDVLRCGSR-- 638

Query: 189 ILSKKTLALTSCGNHQEVVSTYKMIEKLQKKLYHPFSVNLMQTREKLIKILM--ELEDWK 246
                     SC   +  VS   ++ +LQ         +L Q      K+L   ELE   
Sbjct: 639 ----------SCA--ESAVSRDHLVSRLQ---------DLQQQVRVAQKLLRDGELERAV 677

Query: 247 EALAYCQLTIPVYQRVYPQFHPLLGLQYYTCGKLEWFLGDTENAIKSMTEAVEILRITHG 306
           + L+ CQ     +  ++ + H ++G       +    LGD + +   +  ++ ++ + HG
Sbjct: 678 QRLSGCQRDAESF--LWAE-HAVVGEIADGLARAFAALGDWQKSATHLQRSLCVVEVRHG 734

Query: 307 TNSPFMKELILKLEE 321
            +S  M   + KL +
Sbjct: 735 PSSVEMGHELFKLAQ 749


>gi|114665632|ref|XP_511253.2| PREDICTED: SET and MYND domain-containing protein 4 isoform 3 [Pan
           troglodytes]
          Length = 835

 Score = 76.6 bits (187), Expect = 2e-11,   Method: Compositional matrix adjust.
 Identities = 77/315 (24%), Positives = 131/315 (41%), Gaps = 67/315 (21%)

Query: 26  PEISINEIA--ENFSKLACNA-------HT------ICNSELRPLGTGLYPVISIINHSC 70
           P+++I  +A   +  +L CNA       HT      + +S    L TG++PVIS++NHSC
Sbjct: 483 PDVTIWGVAMLRHMLQLQCNAQAVTTIQHTGPKGSIVTDSRQVRLATGIFPVISLLNHSC 542

Query: 71  LPNAVLVFEGRLAVVRAVQHVPKGAEVLISY--IETAGSTMTRQKALKEQYLFTCTCPRC 128
            PN  + F   +A +RA Q + KG E+L  Y   ++      RQ+ L+ QY F C CP C
Sbjct: 543 NPNTSVSFISTVATIRASQRIRKGQEILHCYGPHKSRMGVAERQQKLRSQYFFDCACPAC 602

Query: 129 IKLGQFDDIQESAILEGYRCKDDGCSGFLLRDSDDKGFTCQQCGLVRSKEEIKKIASEVN 188
                    +   +  G R                + F C  CG     +++ +  S   
Sbjct: 603 -------QTEAHRMAAGPRW---------------EAFCCNSCGAPMQGDDVLRCGSR-- 638

Query: 189 ILSKKTLALTSCGNHQEVVSTYKMIEKLQKKLYHPFSVNLMQTREKLIKILM--ELEDWK 246
                     SC   +  VS   ++ +LQ         +L Q      K+L   ELE   
Sbjct: 639 ----------SCA--ESAVSRDHLVSRLQ---------DLQQQVRVAQKLLRDGELERAV 677

Query: 247 EALAYCQLTIPVYQRVYPQFHPLLGLQYYTCGKLEWFLGDTENAIKSMTEAVEILRITHG 306
           + L+ CQ     +  ++ + H ++G       +    LGD + +   +  ++ ++ + HG
Sbjct: 678 QRLSGCQRDAESF--LWAE-HAVVGEIADGLARAFAALGDWQKSATHLQRSLCVVEVRHG 734

Query: 307 TNSPFMKELILKLEE 321
            +S  M   + KL +
Sbjct: 735 PSSVEMGHELFKLAQ 749


>gi|426383436|ref|XP_004058287.1| PREDICTED: SET and MYND domain-containing protein 4 [Gorilla
           gorilla gorilla]
          Length = 804

 Score = 76.6 bits (187), Expect = 2e-11,   Method: Compositional matrix adjust.
 Identities = 76/315 (24%), Positives = 131/315 (41%), Gaps = 67/315 (21%)

Query: 26  PEISINEIA--ENFSKLACNA-------HT------ICNSELRPLGTGLYPVISIINHSC 70
           P++++  +A   +  +L CNA       HT      + +S    L TG++PVIS++NHSC
Sbjct: 483 PDVTVWGVAMLRHMLQLQCNAQAMTTIQHTGSKGSIVTDSRQVRLATGIFPVISLLNHSC 542

Query: 71  LPNAVLVFEGRLAVVRAVQHVPKGAEVLISY--IETAGSTMTRQKALKEQYLFTCTCPRC 128
            PN  + F   +A +RA Q + KG E+L  Y   ++      RQ+ L+ QY F C CP C
Sbjct: 543 CPNTSVSFISTVATIRASQRIQKGQEILHCYGPHKSRMGVAERQQKLRSQYFFDCACPAC 602

Query: 129 IKLGQFDDIQESAILEGYRCKDDGCSGFLLRDSDDKGFTCQQCGLVRSKEEIKKIASEVN 188
                    +   +  G R                + F C  CG     +++ +  S   
Sbjct: 603 -------QTEAHRMAAGPRW---------------EAFCCNSCGAPMQGDDVLRCGSR-- 638

Query: 189 ILSKKTLALTSCGNHQEVVSTYKMIEKLQKKLYHPFSVNLMQTREKLIKILM--ELEDWK 246
                     SC   +  VS   ++ +LQ         +L Q      K+L   ELE   
Sbjct: 639 ----------SCA--ESAVSRDHLVSRLQ---------DLQQQVRVAQKLLRDGELERAV 677

Query: 247 EALAYCQLTIPVYQRVYPQFHPLLGLQYYTCGKLEWFLGDTENAIKSMTEAVEILRITHG 306
           + L+ CQ     +  ++ + H ++G       +    LGD + +   +  ++ ++ + HG
Sbjct: 678 QRLSGCQRDAESF--LWAE-HAVVGEIADGLARACAALGDWQKSATHLQRSLCVVEVRHG 734

Query: 307 TNSPFMKELILKLEE 321
            +S  M   + KL +
Sbjct: 735 PSSVEMGHELFKLAQ 749


>gi|405969486|gb|EKC34455.1| SET and MYND domain-containing protein 4 [Crassostrea gigas]
          Length = 324

 Score = 76.6 bits (187), Expect = 2e-11,   Method: Compositional matrix adjust.
 Identities = 52/156 (33%), Positives = 78/156 (50%), Gaps = 19/156 (12%)

Query: 31  NEIAENFSKLACNAHTICNSELRP----------LGTGLYPVISIINHSCLPNAVLVFEG 80
           + I      L CNAH I     +P          +G+G Y ++S+INHSC P+ V    G
Sbjct: 94  SHILRQIQMLPCNAHEISEILWKPGDPTVTNSIEIGSGAYALLSLINHSCDPSVVRHNYG 153

Query: 81  RLAVVRAVQHVPKGAEVLISYIETAGSTMT--RQKALKEQYLFTCTCPRC-IKLGQFDDI 137
            + VVRA++ + KG E+L +Y      T+   R+  L+ QY F C C  C ++L  + DI
Sbjct: 154 NICVVRAIKPIKKGEEILDNYGALYPLTIREERRAKLRPQYFFDCNCDACQLELPLYFDI 213

Query: 138 QESAILEGYRCKDDGCSG--FLLRDSDDKGFTCQQC 171
            +   +  ++CKD  CSG  F+ +D D     C  C
Sbjct: 214 PDDVPV--FKCKD--CSGPIFISQDKDLAEAECSSC 245


>gi|410209806|gb|JAA02122.1| SET and MYND domain containing 4 [Pan troglodytes]
 gi|410209808|gb|JAA02123.1| SET and MYND domain containing 4 [Pan troglodytes]
 gi|410246942|gb|JAA11438.1| SET and MYND domain containing 4 [Pan troglodytes]
 gi|410246944|gb|JAA11439.1| SET and MYND domain containing 4 [Pan troglodytes]
 gi|410289944|gb|JAA23572.1| SET and MYND domain containing 4 [Pan troglodytes]
 gi|410289946|gb|JAA23573.1| SET and MYND domain containing 4 [Pan troglodytes]
 gi|410289948|gb|JAA23574.1| SET and MYND domain containing 4 [Pan troglodytes]
 gi|410330765|gb|JAA34329.1| SET and MYND domain containing 4 [Pan troglodytes]
 gi|410330767|gb|JAA34330.1| SET and MYND domain containing 4 [Pan troglodytes]
 gi|410330769|gb|JAA34331.1| SET and MYND domain containing 4 [Pan troglodytes]
          Length = 835

 Score = 76.6 bits (187), Expect = 2e-11,   Method: Compositional matrix adjust.
 Identities = 77/315 (24%), Positives = 131/315 (41%), Gaps = 67/315 (21%)

Query: 26  PEISINEIA--ENFSKLACNA-------HT------ICNSELRPLGTGLYPVISIINHSC 70
           P+++I  +A   +  +L CNA       HT      + +S    L TG++PVIS++NHSC
Sbjct: 483 PDVTIWGVAMLRHMLQLQCNAQAVTTIQHTGPKGSIVTDSRQVRLATGIFPVISLLNHSC 542

Query: 71  LPNAVLVFEGRLAVVRAVQHVPKGAEVLISY--IETAGSTMTRQKALKEQYLFTCTCPRC 128
            PN  + F   +A +RA Q + KG E+L  Y   ++      RQ+ L+ QY F C CP C
Sbjct: 543 NPNTSVSFISTVATIRASQRIRKGQEILHCYGPHKSRMGVAERQQKLRSQYFFDCACPAC 602

Query: 129 IKLGQFDDIQESAILEGYRCKDDGCSGFLLRDSDDKGFTCQQCGLVRSKEEIKKIASEVN 188
                    +   +  G R                + F C  CG     +++ +  S   
Sbjct: 603 -------QTEAHRMAAGPRW---------------EAFCCNSCGAPMQGDDVLRCGSR-- 638

Query: 189 ILSKKTLALTSCGNHQEVVSTYKMIEKLQKKLYHPFSVNLMQTREKLIKILM--ELEDWK 246
                     SC   +  VS   ++ +LQ         +L Q      K+L   ELE   
Sbjct: 639 ----------SCA--ESAVSRDHLVSRLQ---------DLQQQVRVAQKLLRDGELERAV 677

Query: 247 EALAYCQLTIPVYQRVYPQFHPLLGLQYYTCGKLEWFLGDTENAIKSMTEAVEILRITHG 306
           + L+ CQ     +  ++ + H ++G       +    LGD + +   +  ++ ++ + HG
Sbjct: 678 QRLSGCQRDAESF--LWAE-HAVVGEIADGLARAFAALGDWQKSATHLQRSLCVVEVRHG 734

Query: 307 TNSPFMKELILKLEE 321
            +S  M   + KL +
Sbjct: 735 PSSVEMGHELFKLAQ 749


>gi|326472015|gb|EGD96024.1| hypothetical protein TESG_03485 [Trichophyton tonsurans CBS 112818]
          Length = 499

 Score = 76.6 bits (187), Expect = 2e-11,   Method: Compositional matrix adjust.
 Identities = 38/102 (37%), Positives = 56/102 (54%)

Query: 27  EISINEIAENFSKLACNAHTICNSELRPLGTGLYPVISIINHSCLPNAVLVFEGRLAVVR 86
           ++ I  IAE  +++  N+ T   +   PLG  + P    +NHSC PNAV+ F+  L  V+
Sbjct: 180 DLDIETIAEYLARIEVNSFTFTTAFGDPLGLCIQPFACYMNHSCEPNAVVGFDEGLITVK 239

Query: 87  AVQHVPKGAEVLISYIETAGSTMTRQKALKEQYLFTCTCPRC 128
           A+  +    +V ISYI+       RQK L E+Y FTC C +C
Sbjct: 240 ALCKIKPDEQVFISYIDNTYPFEVRQKQLAERYFFTCKCSKC 281


>gi|395853348|ref|XP_003799177.1| PREDICTED: SET and MYND domain-containing protein 4 [Otolemur
           garnettii]
          Length = 749

 Score = 76.6 bits (187), Expect = 2e-11,   Method: Compositional matrix adjust.
 Identities = 43/120 (35%), Positives = 61/120 (50%), Gaps = 17/120 (14%)

Query: 26  PEISINEIA--ENFSKLACNAHTIC-------------NSELRPLGTGLYPVISIINHSC 70
           P++ I  +A   +  +L CNA  I              NS    L TG++PV+S++NHSC
Sbjct: 483 PDLDIWGVAMLRHMLQLQCNAQAITTIQQTGSEENIITNSRQVRLATGVFPVVSLLNHSC 542

Query: 71  LPNAVLVFEGRLAVVRAVQHVPKGAEVLISY--IETAGSTMTRQKALKEQYLFTCTCPRC 128
            PN  + F   +A +RA Q + KG E+L  Y   E+      RQ+ L+ QY F C CP C
Sbjct: 543 SPNTSISFCSTVATIRASQQIGKGQEILHCYGPHESRMGVAKRQQKLRSQYFFDCNCPAC 602


>gi|326477120|gb|EGE01130.1| SET and MYND domain-containing protein [Trichophyton equinum CBS
           127.97]
          Length = 499

 Score = 76.3 bits (186), Expect = 2e-11,   Method: Compositional matrix adjust.
 Identities = 38/102 (37%), Positives = 56/102 (54%)

Query: 27  EISINEIAENFSKLACNAHTICNSELRPLGTGLYPVISIINHSCLPNAVLVFEGRLAVVR 86
           ++ I  IAE  +++  N+ T   +   PLG  + P    +NHSC PNAV+ F+  L  V+
Sbjct: 180 DLDIETIAEYLARIEVNSFTFTTAFGDPLGLCIQPFACYMNHSCEPNAVVGFDEGLITVK 239

Query: 87  AVQHVPKGAEVLISYIETAGSTMTRQKALKEQYLFTCTCPRC 128
           A+  +    +V ISYI+       RQK L E+Y FTC C +C
Sbjct: 240 ALCKIKPDEQVFISYIDNTYPFEVRQKQLAERYFFTCKCSKC 281


>gi|390474255|ref|XP_002757614.2| PREDICTED: SET and MYND domain-containing protein 1 [Callithrix
           jacchus]
          Length = 442

 Score = 76.3 bits (186), Expect = 2e-11,   Method: Compositional matrix adjust.
 Identities = 77/320 (24%), Positives = 123/320 (38%), Gaps = 104/320 (32%)

Query: 19  VNLILQ-WP----EISINEIAENFSKLACNAHTICNSE-LRPLGTGLYPVISIINHSCLP 72
           V+  LQ WP    + S+  I+  F  + CN  T+ +   L+ +G G++P + ++NH C P
Sbjct: 151 VDTFLQYWPPQSQQFSMQYISHIFGVINCNGFTLSDQRGLQAVGVGIFPNLGLVNHDCWP 210

Query: 73  NAVLVFE-GRLAVVRAVQH------------VPKGAEVLISYIETAGSTMTRQKALKEQY 119
           N  ++F  G    V+++ H            + +G E+ +SYI+    +  R+K LK+QY
Sbjct: 211 NCTVIFNNGNHEAVKSMFHTQMRIELRALGKISEGEELTVSYIDFLNVSEERKKQLKKQY 270

Query: 120 LFTCTCPRCIKLGQFDDIQESAILEGYRCKDDGCSGFLLRDSDDKGFTCQQCGLVRSKEE 179
            F CTC  C K                + KDD    FL              G+  + + 
Sbjct: 271 YFDCTCEHCQK----------------KLKDD---LFL--------------GVKDNPKA 297

Query: 180 IKKIASEVNILSKKTL----ALTSCGNHQEVVSTYKMIEKLQKKLYHPFSVNLMQTREKL 235
            +++  E+   SK TL       S G + EVV         ++KLYHP +          
Sbjct: 298 SQEVVKEMIQFSKDTLEKIDKARSEGLYHEVVKL------CREKLYHPNNAQ-------- 343

Query: 236 IKILMELEDWKEALAYCQLTIPVYQRVYPQFHPLLGLQYYTCGKLEWFLGDTENAIKSMT 295
                                             LG+     G   W  G+ E     + 
Sbjct: 344 ----------------------------------LGMAVMRAGLTNWHAGNIEVGHGMIC 369

Query: 296 EAVEILRITHGTNSPFMKEL 315
           +A  IL +THG + P  K+L
Sbjct: 370 KAYAILLVTHGPSHPITKDL 389


>gi|339235237|ref|XP_003379173.1| MYND finger protein [Trichinella spiralis]
 gi|316978237|gb|EFV61247.1| MYND finger protein [Trichinella spiralis]
          Length = 451

 Score = 76.3 bits (186), Expect = 2e-11,   Method: Compositional matrix adjust.
 Identities = 70/285 (24%), Positives = 121/285 (42%), Gaps = 26/285 (9%)

Query: 43  NAHTICNS-ELRPLGTGLYPVISIINHSCLPNAVLVFEGRLAVVRAVQ---HVPKGAEVL 98
           N H I +   L  +G GLY  +   +HSC PNA+   EG +A ++A+    ++       
Sbjct: 168 NRHAIGDQLYLNEIGKGLYLDLCAYDHSCRPNAIYTCEGFVARLKALDDSVNLSDRKRTH 227

Query: 99  ISYIETAGSTMTRQKALKEQYLFTCTCPRCIKLGQFDDIQESAILEGYRCKDDGCSGFLL 158
             YI    S   R++ LKE + F C C RC             +L   +C  + CS  + 
Sbjct: 228 TCYITPLDSRAERRRLLKESWYFDCFCERCTD-------PNDHLLTALKC--NNCSTAVP 278

Query: 159 RDS--------DDKGFTCQQC-GLVRSKEEIKKIASEVNILSKKTLALTSCGNHQEVVST 209
             +        DD  FTC +C     S+ ++++   ++  + K+     S    +  + +
Sbjct: 279 IHNADNDKDLIDDSFFTCPKCEQRCFSRPDVEQALFKMIDIEKQLAKTGSETLSKSELDS 338

Query: 210 YKMIEKLQKKLYHPFSVNLMQTREKLIKILMELEDWKEALAYCQLTIPVYQRVYPQFHPL 269
             ++ K      + F   L+Q + +LI    +  D K+ L      +P  +  YP+ HP 
Sbjct: 339 MLLVCKSVLADENVFFAKLLQRKLQLI----DCNDSKQLLETSLQALPCLRVCYPEMHPT 394

Query: 270 LGLQYYTCGKLEWFLGDTENAIKSMTEAVEILRITHGTNSPFMKE 314
           L   Y T G     L D + A++ + +A+   R T G+N    K+
Sbjct: 395 LAYHYMTIGIYYKNLADRKTAMQYLNDALRRFRFTLGSNHLLTKQ 439


>gi|148680855|gb|EDL12802.1| SET and MYND domain containing 4, isoform CRA_b [Mus musculus]
          Length = 828

 Score = 76.3 bits (186), Expect = 2e-11,   Method: Compositional matrix adjust.
 Identities = 38/88 (43%), Positives = 49/88 (55%), Gaps = 2/88 (2%)

Query: 43  NAHTICNSELRPLGTGLYPVISIINHSCLPNAVLVFEGRLAVVRAVQHVPKGAEVLISY- 101
           N   I NS    L TG++PV+S++NHSC PN  + F G +A VRA Q + KG E+L  Y 
Sbjct: 540 NESIITNSRQIRLATGIFPVVSLLNHSCRPNTSVSFTGTVATVRAAQRIAKGQEILHCYG 599

Query: 102 -IETAGSTMTRQKALKEQYLFTCTCPRC 128
             E+      RQ+ L  QY F C C  C
Sbjct: 600 PHESRMGVAERQQRLSSQYFFDCRCGAC 627


>gi|224006285|ref|XP_002292103.1| hypothetical protein THAPSDRAFT_23831 [Thalassiosira pseudonana
           CCMP1335]
 gi|220972622|gb|EED90954.1| hypothetical protein THAPSDRAFT_23831 [Thalassiosira pseudonana
           CCMP1335]
          Length = 575

 Score = 76.3 bits (186), Expect = 2e-11,   Method: Compositional matrix adjust.
 Identities = 43/124 (34%), Positives = 59/124 (47%), Gaps = 11/124 (8%)

Query: 37  FSKLACNAHTICNSELRPLGTGLYPVISIINHSCLPNAVLVF-----EGRLAVVRAVQHV 91
            S+L  N  TI  SE  PLG G+YP  S+INHSC PNAV  F        +  +   + V
Sbjct: 257 MSRLIMNGFTISTSEQLPLGVGVYPAASMINHSCRPNAVPTFWFSTPSPPMLQITMCKSV 316

Query: 92  PKGAEVLISYIETAGSTMTRQKALKEQYLFTCTCPRCIKLGQFDDIQESAILEGYRCKDD 151
             G E+ ISY + +     R++ L + Y F C C  C       D +    + G +C+  
Sbjct: 317 RVGDEIAISYCDVSAPRYVRREGLIKNYKFACDCSHC------GDFETDGDMIGLKCQAS 370

Query: 152 GCSG 155
           GC G
Sbjct: 371 GCIG 374


>gi|449265983|gb|EMC77110.1| SET and MYND domain-containing protein 4, partial [Columba livia]
          Length = 706

 Score = 76.3 bits (186), Expect = 2e-11,   Method: Compositional matrix adjust.
 Identities = 63/265 (23%), Positives = 116/265 (43%), Gaps = 10/265 (3%)

Query: 55  LGTGLYPVISIINHSCLPNAVLVFEGRLAVVRAVQHVPKGAEVLISY-IETAGSTMTRQK 113
           L T  +PV+S++NHSC PN  + F G  A VRA Q +P G E+   Y        +  ++
Sbjct: 430 LATAFFPVLSLLNHSCCPNTSVSFSGTTATVRASQLIPSGQEIFHCYGPHRCRMRVAERQ 489

Query: 114 ALKEQYLFTCTCPRCIKLGQFDDIQESAILEGYRCKDDGCSGFLLRDSDDKGFTCQQCGL 173
            L  QY F C C  C    + D     ++   + C   GC G  ++  D    + + C +
Sbjct: 490 QLLSQYFFECRCQACCDELESDVKSVVSLRNTFCCP--GCRG-PMQGEDMLCCSNEACAI 546

Query: 174 VRSKEEIKKIASEVNILSKKTLALTSCGNHQEVVSTYKMIEKLQ---KKLYHPFSVNLMQ 230
             S+E +     ++    K+ L L      ++     KM+ K Q   +    P  + + +
Sbjct: 547 SVSRESLSCRLQDLQQRIKRALELL---RDRKADQAIKMLLKCQVDAECFLSPEHLLMGE 603

Query: 231 TREKLIKILMELEDWKEALAYCQLTIPVYQRVYPQFHPLLGLQYYTCGKLEWFLGDTENA 290
             + L ++   L  W+EA  + + +I + +  +      +G + +   ++ +       A
Sbjct: 604 LEDHLAQVYATLGKWQEAARHLERSIEIVEMHHGPSSIEIGHELFKLAQILFNGFAVSEA 663

Query: 291 IKSMTEAVEILRITHGTNSPFMKEL 315
           + ++  A EIL +  G  SP ++EL
Sbjct: 664 LSTIQRAEEILSVHCGPQSPQIQEL 688


>gi|403275299|ref|XP_003929391.1| PREDICTED: SET and MYND domain-containing protein 4 [Saimiri
           boliviensis boliviensis]
          Length = 800

 Score = 76.3 bits (186), Expect = 2e-11,   Method: Compositional matrix adjust.
 Identities = 71/312 (22%), Positives = 134/312 (42%), Gaps = 32/312 (10%)

Query: 26  PEISINEIA--ENFSKLACNA-------HTIC------NSELRPLGTGLYPVISIINHSC 70
           P++++  +A   +  +L CNA       HT C       S    L TG++PV+S++NHSC
Sbjct: 480 PDVTVWGVAMLRHMLQLQCNAQAITTIQHTGCKEGIVTGSRQVRLATGIFPVVSLLNHSC 539

Query: 71  LPNAVLVFEGRLAVVRAVQHVPKGAEVLISY--IETAGSTMTRQKALKEQYLFTCTCPRC 128
            PN  + F   +A +RA Q + KG E++  Y   ++      RQ+ L+ QY F CTCP C
Sbjct: 540 SPNTSVSFISTVATIRASQRIGKGQEIVHCYGPHKSRMGVAERQQKLRSQYFFDCTCPAC 599

Query: 129 IKLGQFDDIQESAILEGYRCKDDGCSGFLLRDSDDKGFTCQQCGLVRSKEEIKK--IASE 186
              G    +      E + C    C   +      +G     CG     E +++  +AS 
Sbjct: 600 HSEGH--GMAARPRWEAFCCSS--CRAPM------QGDDVLSCGRRGCAESVRRDHLASR 649

Query: 187 VNILSKK---TLALTSCGNHQEVVSTYKMIEKLQKKLYHPFSVNLMQTREKLIKILMELE 243
           +  L ++      L S G  +  +      ++  +         L +  + L +    L 
Sbjct: 650 LQDLRQQVGMAQKLLSDGELEPAIQQLSGCQRDAESFLSAEHAVLGEIEDGLAQACAALG 709

Query: 244 DWKEALAYCQLTIPVYQRVYPQFHPLLGLQYYTCGKLEWFLGDTENAIKSMTEAVEILRI 303
           DW+++  + Q ++ V +  +      +G + +   ++ +       A+ ++ +A E+L +
Sbjct: 710 DWQKSATHLQNSLRVVEARHGPSSVEMGHELFKLAQIFFNGFAVPEALSTIQKAEEVLSL 769

Query: 304 THGTNSPFMKEL 315
             G     ++EL
Sbjct: 770 HCGPWDDEIQEL 781


>gi|213406173|ref|XP_002173858.1| conserved hypothetical protein [Schizosaccharomyces japonicus
           yFS275]
 gi|212001905|gb|EEB07565.1| conserved hypothetical protein [Schizosaccharomyces japonicus
           yFS275]
          Length = 449

 Score = 76.3 bits (186), Expect = 2e-11,   Method: Compositional matrix adjust.
 Identities = 38/107 (35%), Positives = 59/107 (55%)

Query: 30  INEIAENFSKLACNAHTICNSELRPLGTGLYPVISIINHSCLPNAVLVFEGRLAVVRAVQ 89
           +N     F KL+ NA ++       +GT +   ++ INHSC PN V + EG  A + A+Q
Sbjct: 155 VNAFLSLFCKLSINAMSLVTPAFDAIGTCMDSTLARINHSCQPNCVFMVEGATARLVALQ 214

Query: 90  HVPKGAEVLISYIETAGSTMTRQKALKEQYLFTCTCPRCIKLGQFDD 136
            + KG  V ISY++T      R   L+++Y FTCTC +C++  +  D
Sbjct: 215 TLQKGDFVHISYVDTTLPYNVRTNELQQKYFFTCTCQKCMEEARRSD 261


>gi|312373132|gb|EFR20944.1| hypothetical protein AND_18266 [Anopheles darlingi]
          Length = 648

 Score = 76.3 bits (186), Expect = 2e-11,   Method: Compositional matrix adjust.
 Identities = 66/236 (27%), Positives = 106/236 (44%), Gaps = 30/236 (12%)

Query: 5   DEKQLLLYAQIANLVNLILQWPEISINE----------IAENFSKLACNAHTICN-SELR 53
           D +  L++  +A ++N IL+    +  E          +  N   +  +AH +      R
Sbjct: 375 DPEDYLVWTLMAAMLNTILRHGNYTNTEQPDDGFIGYLLLHNLQIVNYSAHDVAELQRKR 434

Query: 54  P--------LGTGLYPVISIINHSCLPNAVLVFEGRLAVVRAVQHVPKGAEVLISY--IE 103
           P        +G  LYP++++ NHSC P  V  F G    VR ++++  G  +  +Y  + 
Sbjct: 435 PNEAGKSVAIGAALYPLLALFNHSCDPGIVRYFSGTTVHVRTIKNIAAGQIIAENYGPLY 494

Query: 104 TAGSTMTRQKALKEQYLFTCTCPRCIKLGQ-FDDIQESAILEGYRCKDDGCSGFLLRD-- 160
           T      R+++L   Y F C C  C +  Q +  + ES I   +RC    C G LL D  
Sbjct: 495 TKMERTERRQSLAANYKFQCNCDACRQDWQLYTHMDESVIR--FRCTGPACQGALLFDLT 552

Query: 161 SDDKGFTCQQCG-LVRSKEEIKKIASEVNILSKKTLA--LTSCGNHQEVVSTYKMI 213
           S+  G  C  CG LV  +E I ++  E N+LS+   A  L + G  +  +S Y  I
Sbjct: 553 SECYGMQCDDCGELVDIRERI-RLLQEANMLSRFNEAAHLYAIGLFEHALSKYAAI 607


>gi|431906502|gb|ELK10625.1| SET and MYND domain-containing protein 3 [Pteropus alecto]
          Length = 352

 Score = 76.3 bits (186), Expect = 2e-11,   Method: Compositional matrix adjust.
 Identities = 32/74 (43%), Positives = 49/74 (66%)

Query: 55  LGTGLYPVISIINHSCLPNAVLVFEGRLAVVRAVQHVPKGAEVLISYIETAGSTMTRQKA 114
           +G GLYP +S++NHSC PN  +VF G   ++RAV+ +  G E+ I Y++   ++  R+K 
Sbjct: 4   VGVGLYPSMSLLNHSCDPNCSIVFNGPHLLLRAVRDIEAGEELTICYLDMLMTSEERRKQ 63

Query: 115 LKEQYLFTCTCPRC 128
           L++QY F C C RC
Sbjct: 64  LRDQYCFECDCSRC 77


>gi|405959013|gb|EKC25088.1| SET and MYND domain-containing protein 5 [Crassostrea gigas]
          Length = 396

 Score = 76.3 bits (186), Expect = 2e-11,   Method: Compositional matrix adjust.
 Identities = 37/87 (42%), Positives = 51/87 (58%), Gaps = 5/87 (5%)

Query: 56  GTGLYPVISIINHSCLPNAVLVF--EGRLAVVRAVQHVPKGAEVLISYI---ETAGSTMT 110
           G+GLY + S  NHSCLPNA + F     +  V A + +P G E+ ISY+   + + S  +
Sbjct: 296 GSGLYSLQSTCNHSCLPNAEITFPYNNNVMAVVAKEKIPSGQEICISYLSECDLSRSRHS 355

Query: 111 RQKALKEQYLFTCTCPRCIKLGQFDDI 137
           RQ  LKE YLFTC CP+C+      D+
Sbjct: 356 RQNILKENYLFTCDCPKCLSEADEPDV 382


>gi|290562039|gb|ADD38416.1| SET and MYND domain-containing protein 3 [Lepeophtheirus salmonis]
          Length = 311

 Score = 76.3 bits (186), Expect = 2e-11,   Method: Compositional matrix adjust.
 Identities = 40/119 (33%), Positives = 64/119 (53%)

Query: 10  LLYAQIANLVNLILQWPEISINEIAENFSKLACNAHTICNSELRPLGTGLYPVISIINHS 69
           + Y  I  + N +    EI+ +E+ + F+K++ N+  I N +    G+ L    S I+HS
Sbjct: 132 MFYRYIGIIKNKLGGALEINESEVLDIFTKMSINSVFILNGKFLNFGSCLSLEFSAIDHS 191

Query: 70  CLPNAVLVFEGRLAVVRAVQHVPKGAEVLISYIETAGSTMTRQKALKEQYLFTCTCPRC 128
           C PNAV +F GR  VV+A+  +    +V ++Y +       R+  LK+QY F CTC  C
Sbjct: 192 CRPNAVYMFIGRTLVVKALCDIANFDDVRVTYTDITQPRRMRRNFLKDQYFFDCTCEEC 250


>gi|50511197|dbj|BAD32584.1| mKIAA1936 protein [Mus musculus]
          Length = 716

 Score = 76.3 bits (186), Expect = 2e-11,   Method: Compositional matrix adjust.
 Identities = 38/88 (43%), Positives = 49/88 (55%), Gaps = 2/88 (2%)

Query: 43  NAHTICNSELRPLGTGLYPVISIINHSCLPNAVLVFEGRLAVVRAVQHVPKGAEVLISY- 101
           N   I NS    L TG++PV+S++NHSC PN  + F G +A VRA Q + KG E+L  Y 
Sbjct: 428 NESIITNSRQIRLATGIFPVVSLLNHSCRPNTSVSFTGTVATVRAAQRIAKGQEILHCYG 487

Query: 102 -IETAGSTMTRQKALKEQYLFTCTCPRC 128
             E+      RQ+ L  QY F C C  C
Sbjct: 488 PHESRMGVAERQQRLSSQYFFDCRCGAC 515


>gi|432096123|gb|ELK26991.1| SET and MYND domain-containing protein 4 [Myotis davidii]
          Length = 777

 Score = 75.9 bits (185), Expect = 2e-11,   Method: Compositional matrix adjust.
 Identities = 69/299 (23%), Positives = 129/299 (43%), Gaps = 15/299 (5%)

Query: 21  LILQWPEISINEIAENFSKLACNAHTICNSELRPLGTGLYPVISIINHSCLPNAVLVFEG 80
           L LQ    +I  I +  SK     + + +S    L TGL+PV+S++NHSC PN  + F  
Sbjct: 471 LQLQCNAQAITTIQQTGSK----ENMVTDSRQVRLATGLFPVVSLLNHSCSPNTSVSFVS 526

Query: 81  RLAVVRAVQHVPKGAEVLISY--IETAGSTMTRQKALKEQYLFTCTCPRCIKLGQFDDIQ 138
            +A VRA Q + +G E+L  Y   E+      RQ+ L+ QY F C C  C    Q    +
Sbjct: 527 TVATVRASQQIGEGQEILHCYGPHESRMGVAERQQKLRSQYFFDCDCLAC----QNGKCR 582

Query: 139 ESA--ILEGYRCKDDGCSGFLLRDSDDKGFTCQQCGLVRSKEEIKKIASEVNILSKKTLA 196
            +A    E +RC    C G L++  D        C    S++ +     ++     K   
Sbjct: 583 AAAGPRWEAFRCH---CCGELMQGDDVLSCGSASCTESASRDRLVSRLQDLRQRVGKAQT 639

Query: 197 LTSCGNHQEVVSTYKMIEKLQKKLYHPFSVNLMQTREKLIKILMELEDWKEALAYCQLTI 256
           L   G  +  +      ++  +         + +  + L +    L DW ++ A+ Q ++
Sbjct: 640 LLRNGKVERAIQLLLGCQRDAESFLSAEHSLVGEIEDALAQAYAALGDWHKSAAHLQRSL 699

Query: 257 PVYQRVYPQFHPLLGLQYYTCGKLEWFLGDTENAIKSMTEAVEILRITHGTNSPFMKEL 315
            V +  +      +G + +   ++ +       A+ ++ +A ++L + +G  +  ++EL
Sbjct: 700 RVVEVRHGPASVEMGHELFKLAQIFFNGFAIPEALNTIQKAEKVLLVHYGRGNEEIQEL 758


>gi|427782607|gb|JAA56755.1| Putative set and mynd domain protein [Rhipicephalus pulchellus]
          Length = 423

 Score = 75.9 bits (185), Expect = 2e-11,   Method: Compositional matrix adjust.
 Identities = 39/105 (37%), Positives = 58/105 (55%), Gaps = 6/105 (5%)

Query: 26  PEISINEIAENFSKLACNAHTICNSELRPLGTGLYPVISIINHSCLPNAVLVFEGRLAVV 85
           PE+ +    E + K+  N++ ICN E   +GTGLY   SI++HSC PNA  V+EG    +
Sbjct: 151 PEVGL----EIYGKMIINSYCICNDEHTAIGTGLYMGPSILDHSCSPNAHAVYEGHKLHL 206

Query: 86  RAVQHV--PKGAEVLISYIETAGSTMTRQKALKEQYLFTCTCPRC 128
           RA + +       + +SYI+       RQ+ L+ QY F C C +C
Sbjct: 207 RAAEDINCSNFDGIRVSYIDVMAPKKVRQEELRSQYYFECNCMKC 251


>gi|74181871|dbj|BAE32637.1| unnamed protein product [Mus musculus]
 gi|122936404|gb|AAI30221.1| Smyd4 protein [Mus musculus]
          Length = 666

 Score = 75.9 bits (185), Expect = 2e-11,   Method: Compositional matrix adjust.
 Identities = 38/88 (43%), Positives = 49/88 (55%), Gaps = 2/88 (2%)

Query: 43  NAHTICNSELRPLGTGLYPVISIINHSCLPNAVLVFEGRLAVVRAVQHVPKGAEVLISY- 101
           N   I NS    L TG++PV+S++NHSC PN  + F G +A VRA Q + KG E+L  Y 
Sbjct: 378 NESIITNSRQIRLATGIFPVVSLLNHSCRPNTSVSFTGTVATVRAAQRIAKGQEILHCYG 437

Query: 102 -IETAGSTMTRQKALKEQYLFTCTCPRC 128
             E+      RQ+ L  QY F C C  C
Sbjct: 438 PHESRMGVAERQQRLSSQYFFDCRCGAC 465


>gi|148680854|gb|EDL12801.1| SET and MYND domain containing 4, isoform CRA_a [Mus musculus]
          Length = 743

 Score = 75.9 bits (185), Expect = 2e-11,   Method: Compositional matrix adjust.
 Identities = 38/88 (43%), Positives = 49/88 (55%), Gaps = 2/88 (2%)

Query: 43  NAHTICNSELRPLGTGLYPVISIINHSCLPNAVLVFEGRLAVVRAVQHVPKGAEVLISY- 101
           N   I NS    L TG++PV+S++NHSC PN  + F G +A VRA Q + KG E+L  Y 
Sbjct: 455 NESIITNSRQIRLATGIFPVVSLLNHSCRPNTSVSFTGTVATVRAAQRIAKGQEILHCYG 514

Query: 102 -IETAGSTMTRQKALKEQYLFTCTCPRC 128
             E+      RQ+ L  QY F C C  C
Sbjct: 515 PHESRMGVAERQQRLSSQYFFDCRCGAC 542


>gi|26326379|dbj|BAC26933.1| unnamed protein product [Mus musculus]
          Length = 666

 Score = 75.9 bits (185), Expect = 2e-11,   Method: Compositional matrix adjust.
 Identities = 38/88 (43%), Positives = 49/88 (55%), Gaps = 2/88 (2%)

Query: 43  NAHTICNSELRPLGTGLYPVISIINHSCLPNAVLVFEGRLAVVRAVQHVPKGAEVLISY- 101
           N   I NS    L TG++PV+S++NHSC PN  + F G +A VRA Q + KG E+L  Y 
Sbjct: 378 NESIITNSRQIRLATGIFPVVSLLNHSCRPNTSVSFTGTVATVRAAQRIAKGQEILHCYG 437

Query: 102 -IETAGSTMTRQKALKEQYLFTCTCPRC 128
             E+      RQ+ L  QY F C C  C
Sbjct: 438 PHESRMGVAERQQRLSSQYFFDCRCGAC 465


>gi|63146229|gb|AAH95952.1| Smyd4 protein [Mus musculus]
          Length = 799

 Score = 75.9 bits (185), Expect = 3e-11,   Method: Compositional matrix adjust.
 Identities = 37/88 (42%), Positives = 49/88 (55%), Gaps = 2/88 (2%)

Query: 43  NAHTICNSELRPLGTGLYPVISIINHSCLPNAVLVFEGRLAVVRAVQHVPKGAEVLISY- 101
           N   I NS    L TG++PV+S++NHSC PN  + F G +A +RA Q + KG E+L  Y 
Sbjct: 511 NESIITNSRQIRLATGVFPVVSLLNHSCRPNTSVSFTGTVATIRAAQRIAKGQEILHCYG 570

Query: 102 -IETAGSTMTRQKALKEQYLFTCTCPRC 128
             E+      RQ+ L  QY F C C  C
Sbjct: 571 PHESRMGVAERQQRLSSQYFFDCRCGAC 598


>gi|307172338|gb|EFN63826.1| Protein msta, isoform B [Camponotus floridanus]
          Length = 510

 Score = 75.9 bits (185), Expect = 3e-11,   Method: Compositional matrix adjust.
 Identities = 75/314 (23%), Positives = 131/314 (41%), Gaps = 37/314 (11%)

Query: 17  NLVNLILQWPEISINEIAENFSKLACNAHTICNSELRPL----GTGLYPVISIINHSCLP 72
           N+V+ + +   + ++  +E F +  C    I   E+R +      GLYP ++++NHSC+ 
Sbjct: 170 NIVDYLRK--RLKLDRFSEEFIQTICGILDINTFEVRTMKGFCARGLYPTVAMMNHSCVS 227

Query: 73  N---AVLVFEGRLAVVRAVQHVPKGAEVLISYIETAGSTMTRQKALKEQYLFTCTCPRCI 129
           N   ++   + R+ + R    VP G E+  SY  +   TM R++ L E   F C CPRC 
Sbjct: 228 NTSHSISPIDYRIRL-RTTLKVPAGGELYGSYTHSLLPTMLRREHLLEGKYFACACPRC- 285

Query: 130 KLGQFDDIQESAILEGYRCKDDGCSGFLLRDSDDKGFT--CQQCGLVRSKEEIKKIASEV 187
                D  +    +   +C        L  DS D   T  C  C    S + ++KI   +
Sbjct: 286 ----SDPTELGTHMSSLKCNKCDNGVILSLDSLDSTSTWNCTHCDFSTSGQAVRKIFKII 341

Query: 188 NILSKKTLALTSCGNHQEVVSTYKMIEKLQKKLYHPFSVNLMQTREKLIKILMELEDW-- 245
                   A++       ++    +I+K +  L HP    L   R  L ++   ++++  
Sbjct: 342 QAEVDTVEAISGADGADAILERENVIKKYRSVL-HPHHAFLTILRHSLTQMYGRVDEYLL 400

Query: 246 --------KEALAYCQLTIPVYQRVYPQFHPLLGLQYYT-------CGKLEWFLGDTENA 290
                   +  +  C+L + V   V P +  + G+  Y          K +W  G  + A
Sbjct: 401 DDLPDVVLEHKVDMCRLLLQVLDIVEPGYSRIRGMTLYELHAPLLFLAKGQWNAGVIDEA 460

Query: 291 -IKS-MTEAVEILR 302
            +KS M EA  IL+
Sbjct: 461 GLKSKMIEAATILK 474


>gi|194890613|ref|XP_001977353.1| GG18992 [Drosophila erecta]
 gi|190649002|gb|EDV46280.1| GG18992 [Drosophila erecta]
          Length = 503

 Score = 75.9 bits (185), Expect = 3e-11,   Method: Compositional matrix adjust.
 Identities = 45/183 (24%), Positives = 83/183 (45%), Gaps = 27/183 (14%)

Query: 58  GLYPVISIINHSCLPNAVLVFEGRL--AVVRAVQHVPKGAEVLISYIETAGSTMTRQKAL 115
            LYP+  ++NH C+PNA   FE +    +VRA   +P+G EV  +Y +     + R   L
Sbjct: 208 ALYPLFGVVNHDCIPNAYYTFEDKTNNMIVRAAVDIPEGFEVTTTYTKLFTGNIARHLFL 267

Query: 116 KEQYLFTCTCPRCIKLGQFDDIQESAILEGYRCKDDGCSGFLLRDSDD---KGFTCQQC- 171
           K +  FTC C RC      D  ++ A + G  C+D  C+G ++ +        + C +C 
Sbjct: 268 KMKKSFTCKCSRC-----SDPTEKGAFISGLYCRDTNCTGLVVPEISGLPHPNWNCLECK 322

Query: 172 ----------------GLVRSKEEIKKIASEVNILSKKTLALTSCGNHQEVVSTYKMIEK 215
                           G + +K     + + V  L++K+ +     N+  + +   ++++
Sbjct: 323 QKSTHAQMMKSQDFASGAINAKANSNSLRTLVQYLNEKSDSFIPSSNYVVIDAKMSVLQR 382

Query: 216 LQK 218
           LQ+
Sbjct: 383 LQQ 385


>gi|197102342|ref|NP_001127594.1| SET and MYND domain-containing protein 4 [Pongo abelii]
 gi|75054780|sp|Q5R5X9.1|SMYD4_PONAB RecName: Full=SET and MYND domain-containing protein 4
 gi|55732267|emb|CAH92837.1| hypothetical protein [Pongo abelii]
          Length = 804

 Score = 75.5 bits (184), Expect = 3e-11,   Method: Compositional matrix adjust.
 Identities = 77/315 (24%), Positives = 130/315 (41%), Gaps = 67/315 (21%)

Query: 26  PEISINEIA--ENFSKLACNA-------HT------ICNSELRPLGTGLYPVISIINHSC 70
           P+++I  +A   +  +L CNA       HT      + +S    L TG++PV+S++NHSC
Sbjct: 483 PDVTIWGVAMLRHMLQLQCNAQAMTTIQHTGSKGSIVTDSRQVRLATGIFPVVSLLNHSC 542

Query: 71  LPNAVLVFEGRLAVVRAVQHVPKGAEVLISY--IETAGSTMTRQKALKEQYLFTCTCPRC 128
            PN  + F   +A ++A Q + KG E+L  Y   ++      RQ+ L+ QY F C CP C
Sbjct: 543 SPNTSMSFISTVATIQASQRIRKGQEILHCYGPHKSRMGVAERQQELRSQYFFDCACPAC 602

Query: 129 IKLGQFDDIQESAILEGYRCKDDGCSGFLLRDSDDKGFTCQQCGLVRSKEEIKKIASEVN 188
                          E +R         +  +   + F C  CG     +++    S   
Sbjct: 603 QT-------------EAHR---------MAAEPRWEAFCCNSCGAPMQGDDVLHCGSR-- 638

Query: 189 ILSKKTLALTSCGNHQEVVSTYKMIEKLQKKLYHPFSVNLMQTREKLIKILM--ELEDWK 246
                     SC   +  VS   ++ +LQ         +L Q      K+L   ELE   
Sbjct: 639 ----------SCA--ESAVSRDHLVSRLQ---------DLQQQVGVAQKLLRDGELERAV 677

Query: 247 EALAYCQLTIPVYQRVYPQFHPLLGLQYYTCGKLEWFLGDTENAIKSMTEAVEILRITHG 306
           + L  CQ     +  ++ + H L+G       +    LGD + A   +  ++ ++ + HG
Sbjct: 678 QQLLGCQRDAESF--LWAE-HALVGEIADGLARACAALGDWQKAATHLQRSLRVVEVRHG 734

Query: 307 TNSPFMKELILKLEE 321
            +S  M   + KL +
Sbjct: 735 PSSVEMGHELFKLAQ 749


>gi|242798932|ref|XP_002483270.1| SET and MYND domain protein, putative [Talaromyces stipitatus ATCC
           10500]
 gi|218716615|gb|EED16036.1| SET and MYND domain protein, putative [Talaromyces stipitatus ATCC
           10500]
          Length = 547

 Score = 75.5 bits (184), Expect = 3e-11,   Method: Compositional matrix adjust.
 Identities = 37/116 (31%), Positives = 62/116 (53%), Gaps = 7/116 (6%)

Query: 14  QIANLVNLILQWPEISINEIAENFSKLACNAHTICNSELRPLGTGLYPVISIINHSCLPN 73
           + +NL +L  +W       +A  F +L  N+  + ++  R  G   +P  +  NHSC PN
Sbjct: 197 EYSNLTDLDNKW-------VASLFGRLNANSFCLTSAFGRRRGVYFHPGPARFNHSCDPN 249

Query: 74  AVLVFEGRLAVVRAVQHVPKGAEVLISYIETAGSTMTRQKALKEQYLFTCTCPRCI 129
           A   F      +RA++ + K  ++ ISY++T  S  TR+  L+E+Y F C CP+C+
Sbjct: 250 ASYSFAKGKCYIRAIRPIAKDEQIFISYVDTTYSVGTRRHELQERYRFECQCPKCL 305


>gi|406868025|gb|EKD21062.1| SET and MYND domain-containing protein 2 [Marssonina brunnea f. sp.
           'multigermtubi' MB_m1]
          Length = 570

 Score = 75.5 bits (184), Expect = 3e-11,   Method: Compositional matrix adjust.
 Identities = 35/91 (38%), Positives = 51/91 (56%)

Query: 40  LACNAHTICNSELRPLGTGLYPVISIINHSCLPNAVLVFEGRLAVVRAVQHVPKGAEVLI 99
           L+ NA  I   +  P G    P +++ NHSC PNA++V   R   +RA++ + K  ++ I
Sbjct: 194 LSTNAFRITLPDNTPFGMCFSPTLALANHSCKPNAIIVSNSRSISLRALRPIKKNEQIFI 253

Query: 100 SYIETAGSTMTRQKALKEQYLFTCTCPRCIK 130
           SYI+      +RQ  LKE+Y FTC C  C K
Sbjct: 254 SYIDPTEDLPSRQSKLKERYFFTCKCDSCEK 284


>gi|167535778|ref|XP_001749562.1| hypothetical protein [Monosiga brevicollis MX1]
 gi|163771954|gb|EDQ85613.1| predicted protein [Monosiga brevicollis MX1]
          Length = 338

 Score = 75.5 bits (184), Expect = 3e-11,   Method: Compositional matrix adjust.
 Identities = 35/94 (37%), Positives = 57/94 (60%), Gaps = 1/94 (1%)

Query: 36  NFSKLACNAHTICNS-ELRPLGTGLYPVISIINHSCLPNAVLVFEGRLAVVRAVQHVPKG 94
           +F+KL  N    C++ ++   G  +Y  +S  NHSC PNA +V+ G  AV+R+++ +P+G
Sbjct: 109 SFNKLMPNTFGWCDTFDMSSFGAVVYAELSRANHSCQPNAAVVYNGAAAVLRSMRDIPEG 168

Query: 95  AEVLISYIETAGSTMTRQKALKEQYLFTCTCPRC 128
            EV ISY++   +   R++ L + Y F C C RC
Sbjct: 169 EEVCISYVDPTLARDVRRRELVQSYGFACDCARC 202


>gi|290975196|ref|XP_002670329.1| predicted protein [Naegleria gruberi]
 gi|284083887|gb|EFC37585.1| predicted protein [Naegleria gruberi]
          Length = 626

 Score = 75.5 bits (184), Expect = 3e-11,   Method: Compositional matrix adjust.
 Identities = 37/120 (30%), Positives = 64/120 (53%), Gaps = 16/120 (13%)

Query: 50  SELRPLGTGLYPVISIINHSCLPNAVL----------VFEGRLAVVRAVQHVPKGAEVLI 99
           ++ R +GTGL+P++S+ +HSC PN              + G + +V+ V+ V KG E+ I
Sbjct: 287 ADFRDIGTGLFPMVSMFDHSCSPNCSFQTFDDMKSRNSYSGNVILVQTVKKVKKGEELCI 346

Query: 100 SYIETAGSTMTRQKALKEQYLFTCTCPRCIKLGQFDDIQESAILEGYRCKDDGCS-GFLL 158
           SYI+    T  R++ L     F C C RC+      + +E+ ++  Y C +  C+ G+L+
Sbjct: 347 SYIDIMNPTCIRRRELWYSKYFVCRCSRCMS-----ETEENRMVRAYSCGESECTEGYLV 401


>gi|403413006|emb|CCL99706.1| predicted protein [Fibroporia radiculosa]
          Length = 800

 Score = 75.5 bits (184), Expect = 3e-11,   Method: Compositional matrix adjust.
 Identities = 48/127 (37%), Positives = 66/127 (51%), Gaps = 14/127 (11%)

Query: 35  ENFSKLACNAHTICNSELRPLGTGLYPVIS-IINHSCLPNAVL-----VFEGRLAVVRAV 88
           E +++   N + I +S L   G G+YP+ S + NHSC+PNAV       FE     V A+
Sbjct: 531 EAYTRFG-NNNFIIHSHLNTFGHGIYPLASRLFNHSCVPNAVAKYIISPFESVQMEVVAL 589

Query: 89  QHVPKGAEVLISYIETAGSTMTRQKALKEQYLFTCTCPRCIKLGQFDDI-------QESA 141
           + + +G E+ I Y++ A    TRQ+AL+  Y FTCTCP C   G  D          E A
Sbjct: 590 RDIAEGEEITIPYLDPALPLHTRQEALRLNYGFTCTCPLCSWAGHLDPAPPPARGSAELA 649

Query: 142 ILEGYRC 148
           ILE   C
Sbjct: 650 ILESQLC 656


>gi|195481611|ref|XP_002101711.1| GE15465 [Drosophila yakuba]
 gi|194189235|gb|EDX02819.1| GE15465 [Drosophila yakuba]
          Length = 497

 Score = 75.5 bits (184), Expect = 3e-11,   Method: Compositional matrix adjust.
 Identities = 45/183 (24%), Positives = 83/183 (45%), Gaps = 27/183 (14%)

Query: 58  GLYPVISIINHSCLPNAVLVFEGRL--AVVRAVQHVPKGAEVLISYIETAGSTMTRQKAL 115
            LYP+  ++NH C+PNA   FE +    +VRA   +P+G EV  +Y +     + R   L
Sbjct: 208 ALYPLFGVVNHDCIPNAYYTFEEKTNNMIVRAAVDIPEGFEVTTTYTKLFTGNIARHLFL 267

Query: 116 KEQYLFTCTCPRCIKLGQFDDIQESAILEGYRCKDDGCSGFLLRDSDD---KGFTCQQC- 171
           K +  FTC C RC      D  ++ A + G  C+D  C+G ++ +        + C +C 
Sbjct: 268 KMKKSFTCKCSRCS-----DPTEKGAFISGLYCRDTNCTGLVVPEISGLPHPNWNCLECK 322

Query: 172 ----------------GLVRSKEEIKKIASEVNILSKKTLALTSCGNHQEVVSTYKMIEK 215
                           G + +K     + + V  L++K+ +     N+  + +   ++++
Sbjct: 323 QKSTHAQMMKSQDFASGAINAKANSNSLRTLVQYLNEKSDSFIPSSNYVVIDAKMSVLQR 382

Query: 216 LQK 218
           LQ+
Sbjct: 383 LQQ 385


>gi|353241217|emb|CCA73046.1| hypothetical protein PIIN_07001 [Piriformospora indica DSM 11827]
          Length = 565

 Score = 75.5 bits (184), Expect = 4e-11,   Method: Compositional matrix adjust.
 Identities = 53/153 (34%), Positives = 80/153 (52%), Gaps = 20/153 (13%)

Query: 29  SINEIAENFSKLACNAHTICNSELRPLGTGLYPVISIINHSCLPNAVLVFEGR------- 81
           S  ++ +  SK   N+ T+ ++ L  +G    P+ ++INHSC PNAV+VF          
Sbjct: 215 SGKDLVDLMSKFTTNSFTLTSTFLNAIGVATAPIPALINHSCQPNAVVVFPATRKGAPPT 274

Query: 82  LAVVRAVQHVPKGAEVLISYIETAGSTMTRQKALKEQYLFTCTCPRCIKLGQFDDIQESA 141
           L V+ A+Q + +G EVL +Y++       RQK+LKE Y F C+C  C KL    D++E  
Sbjct: 275 LDVI-AIQPIRRGEEVLAAYVDITLPREIRQKSLKETYAFECSCTLC-KLPGTLDLRECV 332

Query: 142 ILEGYRCKDDGCSGFLLRDSDDKGFT--CQQCG 172
                +C +    GFL   S  KG +  C +CG
Sbjct: 333 FCP--KCNE----GFL---SSHKGQSAPCSKCG 356


>gi|295665212|ref|XP_002793157.1| conserved hypothetical protein [Paracoccidioides sp. 'lutzii' Pb01]
 gi|226278071|gb|EEH33637.1| conserved hypothetical protein [Paracoccidioides sp. 'lutzii' Pb01]
          Length = 508

 Score = 75.5 bits (184), Expect = 4e-11,   Method: Compositional matrix adjust.
 Identities = 36/94 (38%), Positives = 54/94 (57%), Gaps = 1/94 (1%)

Query: 40  LACNAHTICNSELRPLGTGLYPVISIINHSCLPNAVLVFEGRLAVVRAVQHVPKGAEVLI 99
           L  NA T+ N     +G  + P  S  NHSC PNA + F+G +  ++A+Q +    ++ I
Sbjct: 187 LETNAFTLTNRYFDRIGLCMLPFASYANHSCEPNAYIGFDGPVIYLKALQDIALDEQIFI 246

Query: 100 SYIETAGSTMTRQKALKEQYLFTCTCPRCIKLGQ 133
           SYI+       RQ  LK++Y FTC CP+C++ GQ
Sbjct: 247 SYIDNTEPWEKRQSELKKRYFFTCKCPKCVQ-GQ 279


>gi|428180198|gb|EKX49066.1| hypothetical protein GUITHDRAFT_136248 [Guillardia theta CCMP2712]
          Length = 719

 Score = 75.5 bits (184), Expect = 4e-11,   Method: Compositional matrix adjust.
 Identities = 39/112 (34%), Positives = 57/112 (50%), Gaps = 15/112 (13%)

Query: 32  EIAENFSKLACNAHTICNSELRPLGTGLYPVISIINHSCLPNAVLVFEGRLAVVRAVQHV 91
           EI     +L     TIC+ E RP+GTGLYP+ S+ NH C PN ++ F G    +RA + +
Sbjct: 230 EILMTICRLYNYGFTICDEEERPIGTGLYPLASLCNHDCAPNCLVTFCGPSLQLRATEEI 289

Query: 92  PKG---------------AEVLISYIETAGSTMTRQKALKEQYLFTCTCPRC 128
             G                E+ ISY+++A     R+K L ++YL  C+C  C
Sbjct: 290 EMGQVQMARVDWRLRSGLQEITISYVDSAEPHWLRRKTLHKRYLINCSCSLC 341



 Score = 47.8 bits (112), Expect = 0.008,   Method: Compositional matrix adjust.
 Identities = 36/138 (26%), Positives = 66/138 (47%), Gaps = 13/138 (9%)

Query: 199 SCGNHQEVVSTYKMIEKLQKKLYHPFSVNLMQTREKLIKILMELEDWKEALAYCQLTIPV 258
           S     E++  + +  +  + + H  +++L++ RE +    +E + WK AL+     +  
Sbjct: 572 SADRANELLQMFILAVQKAESILHDNNLSLLRLREAMQSQAVEAKSWKFALSTSYKVLRR 631

Query: 259 YQRVYPQFHPLLGLQYYTCGKLEWFL------------GDT-ENAIKSMTEAVEILRITH 305
             ++YP+  P +GLQ    GKL  FL            GD  + A+K + EA  IL  TH
Sbjct: 632 LYKIYPRCSPYVGLQEAFLGKLCMFLSGEPDKTLCHGFGDPLQEAVKHLREAKLILSGTH 691

Query: 306 GTNSPFMKELILKLEEAQ 323
           G+    ++++   L+EA 
Sbjct: 692 GSEHVLVQDINRLLQEAH 709


>gi|351696197|gb|EHA99115.1| SET and MYND domain-containing protein 1 [Heterocephalus glaber]
          Length = 483

 Score = 75.5 bits (184), Expect = 4e-11,   Method: Compositional matrix adjust.
 Identities = 40/131 (30%), Positives = 69/131 (52%), Gaps = 19/131 (14%)

Query: 19  VNLILQ-WP----EISINEIAENFSKLACNAHTICNSE-LRPLGTGLYPVISIINHSCLP 72
           V++ LQ WP    + S+  I+  F  + CN  T+ +   L+ +G G++P + ++NH C P
Sbjct: 146 VDMFLQYWPPQSQQFSMQYISHIFGVINCNGFTLSDQRGLQAVGVGIFPNLGLVNHDCWP 205

Query: 73  NAVLVFE-GRLAVVRAVQH------------VPKGAEVLISYIETAGSTMTRQKALKEQY 119
           N  ++F  G    V+++ H            + +G E+ +SYI+    +  R++ LK+QY
Sbjct: 206 NCTVIFNNGNHEAVKSMFHTQMRIELRALGKISEGEELTVSYIDFLNVSEERKRQLKKQY 265

Query: 120 LFTCTCPRCIK 130
            F CTC  C K
Sbjct: 266 YFDCTCEHCQK 276


>gi|321476199|gb|EFX87160.1| hypothetical protein DAPPUDRAFT_312722 [Daphnia pulex]
          Length = 732

 Score = 75.1 bits (183), Expect = 4e-11,   Method: Compositional matrix adjust.
 Identities = 74/307 (24%), Positives = 137/307 (44%), Gaps = 43/307 (14%)

Query: 39  KLACNAHTIC--------NSELRPLGTGLYPVISIINHSCLPNAVLVFEGRLAVVRAVQH 90
           ++  NAH I         N     + T +YP  S++NH+C P  +  F+G   +VRA+++
Sbjct: 426 QMVSNAHAITELCLIDENNERQERIATAIYPSASLMNHNCDPTVINSFQGNTLIVRAIRN 485

Query: 91  VPKGAEVLISYIETAGSTMTRQK------ALKEQYLFTCTCPRCIKLGQFDDIQESAILE 144
           V +G EV   Y    G    R +      AL+ QY FTCTC  C+     +D Q+  ++ 
Sbjct: 486 VRQGDEVFNCY----GPHYRRMRRSERVEALEAQYSFTCTCDSCLD-KNTEDFQD--VIY 538

Query: 145 GYRCKDDGCSGFLLRDSDDKGFTCQQCGLVRSKE-------EIKKIASEVNILSKKTLAL 197
            + C    C G L+  + +      Q  L RS +       ++K     V++ ++ T A+
Sbjct: 539 SFSCP--SCQGSLINPTGNNSSAQNQMALCRSCKTPQSYFTQLKADLEAVSLDARGTEAM 596

Query: 198 TSCGNHQEVVSTYKMIEK---LQKKLYHPFSVNLMQTREKLIKILMELEDWKEALAYCQL 254
               +  +++   K++ K   L+ K       +L ++ +KL +    +  ++E     ++
Sbjct: 597 ----DQGDIMGAIKLLTKCVQLRSKALFKGHPDLGKSADKLAQCYAFIGKYEECEKMLRI 652

Query: 255 TIPVYQRVYPQFHPLLG--LQYYTCGKLEWFLGDTENA----IKSMTEAVEILRITHGTN 308
           ++   +  Y ++   +   LQ +T   +E   G  + A    +  + EA+ I RI +G  
Sbjct: 653 SLTAVEHRYGRYSIEMANELQKFTDVLMELASGRNQRARDELVNYLEEAMLIYRIHYGPW 712

Query: 309 SPFMKEL 315
           S   KEL
Sbjct: 713 STSYKEL 719


>gi|400601369|gb|EJP69012.1| MYND finger protein [Beauveria bassiana ARSEF 2860]
          Length = 542

 Score = 75.1 bits (183), Expect = 4e-11,   Method: Compositional matrix adjust.
 Identities = 41/104 (39%), Positives = 60/104 (57%), Gaps = 1/104 (0%)

Query: 27  EISINEIAENFSKLACNAHTICNSELRPLGTGLYPVISIINHSCLPNAVLVFEGRLAVVR 86
           E S+ +  E   KL  NA    +++L  +G  L P +++ NHSCLPNA + F GR AV+R
Sbjct: 166 EESVRQAVEILCKLQTNAFHRFDADLGHVGIFLEPTLAMANHSCLPNAFVQFVGRAAVLR 225

Query: 87  AVQHVPKGAEVLISYIETAGSTMTRQKALKEQYLFTCTCPRCIK 130
           A   + KG E+ ISY +   S++ ++KA    Y F C C RC +
Sbjct: 226 AESRIQKGDEIEISYTDYT-SSLGKRKAALAPYNFECRCRRCTQ 268


>gi|194769376|ref|XP_001966780.1| GF19100 [Drosophila ananassae]
 gi|190618301|gb|EDV33825.1| GF19100 [Drosophila ananassae]
          Length = 499

 Score = 75.1 bits (183), Expect = 4e-11,   Method: Compositional matrix adjust.
 Identities = 34/103 (33%), Positives = 55/103 (53%), Gaps = 7/103 (6%)

Query: 58  GLYPVISIINHSCLPNAVLVFEGRL--AVVRAVQHVPKGAEVLISYIETAGSTMTRQKAL 115
            LYP+  ++NH C+PN+   F+ +    VVRA   +P+G E+  +Y +     + R   L
Sbjct: 208 ALYPLFGVMNHDCIPNSYYTFDEKTNHMVVRAAVDIPEGFEITTTYTKLFTGNIARHLFL 267

Query: 116 KEQYLFTCTCPRCIKLGQFDDIQESAILEGYRCKDDGCSGFLL 158
           K +  FTC CPRC      D  ++ A + G  C+D  C+G ++
Sbjct: 268 KMKKGFTCKCPRC-----SDPTEKGAFISGLYCRDTNCTGLVV 305


>gi|392562474|gb|EIW55654.1| SET domain-containing protein [Trametes versicolor FP-101664 SS1]
          Length = 499

 Score = 75.1 bits (183), Expect = 4e-11,   Method: Compositional matrix adjust.
 Identities = 49/154 (31%), Positives = 70/154 (45%), Gaps = 17/154 (11%)

Query: 29  SINEIAENFSKLACNAHTICNSELRPLGTGLYPVISIINHSCLPNAVLVF---------- 78
           S  ++ +  S+   N  T+ ++ L P+G  + P ++  NHSC PNAV+VF          
Sbjct: 148 SAGDLVDLISRFTTNTFTLTSASLSPIGICVAPTVAFANHSCSPNAVIVFPRAQGTPASK 207

Query: 79  EGRLAVVRAVQHVPKGAEVLISYIETAGSTMTRQKALKEQYLFTCTCPRCIKLGQFDDIQ 138
           E  L +V A++ +  G E+ ISY++T      RQK L E Y FTC C  C K    D   
Sbjct: 208 EPSLNLV-ALRDIAPGKEIRISYVDTTLPKDLRQKELTEVYNFTCQCKLCTKPPAVDP-- 264

Query: 139 ESAILEGYRCKDDGCSGFLLRDSDDKGFTCQQCG 172
                E   C         L   +D  F C +CG
Sbjct: 265 ----RESLWCPKKCGGTCPLPTEEDNLFRCAKCG 294


>gi|308499663|ref|XP_003112017.1| CRE-SET-18 protein [Caenorhabditis remanei]
 gi|308268498|gb|EFP12451.1| CRE-SET-18 protein [Caenorhabditis remanei]
          Length = 507

 Score = 75.1 bits (183), Expect = 4e-11,   Method: Compositional matrix adjust.
 Identities = 68/314 (21%), Positives = 132/314 (42%), Gaps = 24/314 (7%)

Query: 5   DEKQLLLYAQIANLVNLILQWPEISINEIAENFSKLACNAHTICNSELRPLGTGLYPVIS 64
           DE+++  + ++   + L      I   +  + F K + N+H+I  +    +G  L   +S
Sbjct: 150 DEEKMKHFEKVFTKMGLFRGDEMIEKGQFFDVFCKASINSHSIHTNAGNEVGMALDLGVS 209

Query: 65  IINHSCLPNAVLVFEGRLAVVRAVQHVP-----KGAEVLISYIETAGSTMTRQKALKEQY 119
           + NHSC P   +VF+G    +R +  VP        +  ISYI+   S   R+K LK ++
Sbjct: 210 MYNHSCRPTCSMVFDGYRVCLRPL--VPGVDAESTEQAFISYIDVGRSKYVRRKDLKSRW 267

Query: 120 LFTCTCPRCIKLGQFDDIQESAILEGYRCKDDGCSGFLLRDSDDK--GFTCQQCGLVRSK 177
            F C C RC+      D ++ A L   RC +  C   +L    ++     C +C  +  +
Sbjct: 268 YFDCECTRCM------DPEDDA-LTAIRCANPACDAPVLTSETEEPMNIACDKCKTIIEE 320

Query: 178 EEIKKIASEVNILSKKTLALTSCGNHQEVVSTYKMIEKLQKKLYHPFSVNLMQTREKLIK 237
            ++KK    +  L             Q+++S  +++        HP +V + + R  L  
Sbjct: 321 NDVKKAQEFMKSLPASFDPQCPADILQDLLSKAEVV-------LHPSNVYVARLRTALFH 373

Query: 238 ILMELEDWKEALAYCQLTIPVYQRVYPQFHPLLGLQYYTCGKLEWFLGDTENAIKSMTEA 297
           +  +L     +  + Q+    Y+  +P+    +G Q     K      + + A+    +A
Sbjct: 374 VTGQLTMDNLSTMHTQI-YNNYKLCFPKADRHVGFQLLHIVKALIEKDERDEAMPYAFDA 432

Query: 298 VEILRITHGTNSPF 311
           + I  +  G + P+
Sbjct: 433 MNIFEVCFGLDHPY 446


>gi|195565893|ref|XP_002106530.1| GD16076 [Drosophila simulans]
 gi|194203908|gb|EDX17484.1| GD16076 [Drosophila simulans]
          Length = 497

 Score = 75.1 bits (183), Expect = 4e-11,   Method: Compositional matrix adjust.
 Identities = 35/103 (33%), Positives = 54/103 (52%), Gaps = 7/103 (6%)

Query: 58  GLYPVISIINHSCLPNAVLVFEGRL--AVVRAVQHVPKGAEVLISYIETAGSTMTRQKAL 115
            LYP+  ++NH C+PNA   FE +    +VRA   +P+G EV  +Y +     + R   L
Sbjct: 208 ALYPLFGVVNHDCIPNAYYTFEEKTNNMIVRAAVDIPEGFEVTTTYTKLFTGNIARHLFL 267

Query: 116 KEQYLFTCTCPRCIKLGQFDDIQESAILEGYRCKDDGCSGFLL 158
           K +  FTC C RC      D  ++ A + G  C+D  C+G ++
Sbjct: 268 KMKKSFTCKCSRCS-----DPTEKGAFISGLYCRDTNCTGLVV 305


>gi|12322390|gb|AAG51222.1|AC051630_19 unknown protein; 33246-28649 [Arabidopsis thaliana]
          Length = 781

 Score = 75.1 bits (183), Expect = 5e-11,   Method: Compositional matrix adjust.
 Identities = 68/326 (20%), Positives = 135/326 (41%), Gaps = 62/326 (19%)

Query: 55  LGTGLYPVISIINHSCLPNAVLVFEGRLAVVRAVQHVPKGAEVLISYIETAG--STMTRQ 112
           +G  LY   S+ NHSC PN  L F  R  +++  + VP G  + +SY    G      R 
Sbjct: 458 VGQALYKTGSLFNHSCKPNIHLYFLSRGLIMQTTEFVPTGCPLELSYGPEVGKWDCKNRI 517

Query: 113 KALKEQYLFTCTCPRCIKLGQFDDIQESAILEGYRCKDDGCSGFLLRDSDDKGFTCQQCG 172
           + L+E+Y F C C  C ++   D      ++ GY C +  C+G +L   D    TC+   
Sbjct: 518 RFLEEEYFFHCRCRGCAQINISD-----LVINGYGCVNTNCTGVVL---DSNVATCESEK 569

Query: 173 L-------------VRSKEEIKKIASEV--NILSKKTLALT-------SCG-------NH 203
           L             V+ +E++     EV  ++LSK + +L         CG       +H
Sbjct: 570 LNHFFTAPRNVDQQVQMREKVYADVGEVASSLLSKPSGSLHIEPEICLKCGSRCDIENSH 629

Query: 204 QEVVSTYKMIEKLQ-----------------------KKLYHPFSVNLMQTREKLIKILM 240
            EV   +  + +++                       +   H ++ ++    +K+ +   
Sbjct: 630 AEVNKAWNHMRRVEELMNSGRANYSVLSDCSRSIAVLRTFLHMYNKDIADAEDKVAQACY 689

Query: 241 ELEDWKEALAYCQLTIPVYQRVYPQFHPLLGLQYYTCGKLEWFLGDTENAIKSMTEAVEI 300
              +  +A  +C+ +I + +R+Y   H ++G +      ++   GD+  A  +   + +I
Sbjct: 690 LAGELVDARKHCEASIKILKRLYEDEHVVIGNEMVKLASIQLASGDSSGAWDTTKRSSQI 749

Query: 301 LRITHGTNSPFMKELILKLEEAQAEA 326
               +G+++  +   +  L++  A+A
Sbjct: 750 FSKYYGSHAETLFSYLPCLKQETAKA 775


>gi|390353457|ref|XP_003728116.1| PREDICTED: SET and MYND domain-containing protein 3-like
           [Strongylocentrotus purpuratus]
          Length = 319

 Score = 75.1 bits (183), Expect = 5e-11,   Method: Compositional matrix adjust.
 Identities = 40/123 (32%), Positives = 64/123 (52%), Gaps = 7/123 (5%)

Query: 32  EIAENFSKLACNAHTICNSELRPL-GTGLYPVISIINHSCLPNAVLVFEGRLAVVRAVQH 90
           E+ + + KL  N+ ++ +   R + G  LY   S+ NHSC PN   VFEG    VRA++ 
Sbjct: 178 EVLDTYQKLTINSFSMYDEMTRTIVGEALYIRASMFNHSCEPNCTFVFEGSRLSVRAIKR 237

Query: 91  VPKGAEVLISYIETAGSTMTRQKALKEQYLFTCTCPRCIKLGQFDDIQESAILEGYRCKD 150
           +  G E  ISY+ +   +  R++ L+  Y FTC CPRC+      D     ++   +C +
Sbjct: 238 IEIGEECCISYMSSLLPSPLRKEKLRSIYGFTCQCPRCL------DSARDNLMLCVKCPN 291

Query: 151 DGC 153
           + C
Sbjct: 292 ESC 294


>gi|327283812|ref|XP_003226634.1| PREDICTED: SET and MYND domain-containing protein 4-like [Anolis
           carolinensis]
          Length = 683

 Score = 75.1 bits (183), Expect = 5e-11,   Method: Compositional matrix adjust.
 Identities = 62/265 (23%), Positives = 111/265 (41%), Gaps = 11/265 (4%)

Query: 55  LGTGLYPVISIINHSCLPNAVLVFEGRLAVVRAVQHVPKGAEVLISY--IETAGSTMTRQ 112
           L T L+PV+S++NHSC PN  + F+GR A  RA + +P+G E+L  Y           R+
Sbjct: 413 LATALFPVLSLLNHSCDPNTSVTFDGRTATARASRAIPRGQEILHCYGPHRCRMKPSERR 472

Query: 113 KALKEQYLFTCTCPRCIKLGQFDDIQESAILEGYRCKD--DGCSGFLLRDSDDKGFTCQQ 170
           + L  QY F C C  C      +    S +     C+    G   FL   S         
Sbjct: 473 QRLLAQYFFECRCSACTDETGPESASGSTLFCCPTCRAPMQGEGLFLCCSSAS------- 525

Query: 171 CGLVRSKEEIKKIASEVNILSKKTLALTSCGNHQEVVSTYKMIEKLQKKLYHPFSVNLMQ 230
           C  +  KE  +    E+    +  + L + G     V   +      ++L  P  + + +
Sbjct: 526 CKAMLPKEGFQHQLRELRQHMEAAMDLLAQGKADRSVEQLRKCHLEAQQLLSPGHLVVGE 585

Query: 231 TREKLIKILMELEDWKEALAYCQLTIPVYQRVYPQFHPLLGLQYYTCGKLEWFLGDTENA 290
             + L ++     +W EA A+ + +I V +  Y      +G + +   ++ +       A
Sbjct: 586 IEDHLAQVHATQGNWPEAAAHLRRSIEVVEAHYGAGSIEVGQELFKLAQVLFNGRFVPEA 645

Query: 291 IKSMTEAVEILRITHGTNSPFMKEL 315
           +++  +A  +L +  G   P ++EL
Sbjct: 646 LQAAEKAEAVLSVQLGPRGPEVQEL 670


>gi|195359283|ref|XP_002045335.1| GM11659 [Drosophila sechellia]
 gi|194129184|gb|EDW51227.1| GM11659 [Drosophila sechellia]
          Length = 496

 Score = 75.1 bits (183), Expect = 5e-11,   Method: Compositional matrix adjust.
 Identities = 49/209 (23%), Positives = 91/209 (43%), Gaps = 27/209 (12%)

Query: 58  GLYPVISIINHSCLPNAVLVFEGRL--AVVRAVQHVPKGAEVLISYIETAGSTMTRQKAL 115
            LYP+  ++NH C+PNA   FE +    +VRA   +P+G EV  +Y +     + R   L
Sbjct: 208 ALYPLFGVVNHDCIPNAYYTFEEKTNNMIVRAAVDIPEGFEVTTTYTKLFTGNIARHLFL 267

Query: 116 KEQYLFTCTCPRCIKLGQFDDIQESAILEGYRCKDDGCSGFLLRDSDD---KGFTCQQC- 171
           K +  FTC C RC      D  ++ A + G  C+D  C+G ++ +        + C +C 
Sbjct: 268 KMKKSFTCKCSRCS-----DPTEKGAFISGLYCRDTNCTGLVVPEISGLPHPNWNCLECK 322

Query: 172 ----------------GLVRSKEEIKKIASEVNILSKKTLALTSCGNHQEVVSTYKMIEK 215
                           G + +K     + + V  L++K+ +     N+  + +   ++++
Sbjct: 323 QKSTHAQMMKSQDFASGAINAKVNSNSLRTLVQYLNEKSDSFIPNSNYVVIDAKMSVLQR 382

Query: 216 LQKKLYHPFSVNLMQTREKLIKILMELED 244
           LQ+            TR +  + + +L D
Sbjct: 383 LQQGTRGLLRGAGAHTRLRYSRDITQLMD 411


>gi|24640767|ref|NP_572539.2| CG12119 [Drosophila melanogaster]
 gi|7291024|gb|AAF46462.1| CG12119 [Drosophila melanogaster]
          Length = 500

 Score = 75.1 bits (183), Expect = 5e-11,   Method: Compositional matrix adjust.
 Identities = 35/103 (33%), Positives = 54/103 (52%), Gaps = 7/103 (6%)

Query: 58  GLYPVISIINHSCLPNAVLVFEGRL--AVVRAVQHVPKGAEVLISYIETAGSTMTRQKAL 115
            LYP+  ++NH C+PNA   FE +    +VRA   +P+G EV  +Y +     + R   L
Sbjct: 208 ALYPLFGVVNHDCIPNAYYTFEEKTNNMIVRAAVDIPEGFEVTTTYTKLFTGNIARHLFL 267

Query: 116 KEQYLFTCTCPRCIKLGQFDDIQESAILEGYRCKDDGCSGFLL 158
           K +  FTC C RC      D  ++ A + G  C+D  C+G ++
Sbjct: 268 KMKKSFTCKCSRCS-----DPTEKGAFISGLYCRDTNCTGLVV 305


>gi|42562487|ref|NP_174606.2| tetratricopeptide repeat-containing protein [Arabidopsis thaliana]
 gi|332193468|gb|AEE31589.1| tetratricopeptide repeat-containing protein [Arabidopsis thaliana]
          Length = 798

 Score = 74.7 bits (182), Expect = 5e-11,   Method: Compositional matrix adjust.
 Identities = 68/326 (20%), Positives = 135/326 (41%), Gaps = 62/326 (19%)

Query: 55  LGTGLYPVISIINHSCLPNAVLVFEGRLAVVRAVQHVPKGAEVLISYIETAG--STMTRQ 112
           +G  LY   S+ NHSC PN  L F  R  +++  + VP G  + +SY    G      R 
Sbjct: 475 VGQALYKTGSLFNHSCKPNIHLYFLSRGLIMQTTEFVPTGCPLELSYGPEVGKWDCKNRI 534

Query: 113 KALKEQYLFTCTCPRCIKLGQFDDIQESAILEGYRCKDDGCSGFLLRDSDDKGFTCQQCG 172
           + L+E+Y F C C  C ++   D      ++ GY C +  C+G +L   D    TC+   
Sbjct: 535 RFLEEEYFFHCRCRGCAQINISD-----LVINGYGCVNTNCTGVVL---DSNVATCESEK 586

Query: 173 L-------------VRSKEEIKKIASEV--NILSKKTLALT-------SCG-------NH 203
           L             V+ +E++     EV  ++LSK + +L         CG       +H
Sbjct: 587 LNHFFTAPRNVDQQVQMREKVYADVGEVASSLLSKPSGSLHIEPEICLKCGSRCDIENSH 646

Query: 204 QEVVSTYKMIEKLQ-----------------------KKLYHPFSVNLMQTREKLIKILM 240
            EV   +  + +++                       +   H ++ ++    +K+ +   
Sbjct: 647 AEVNKAWNHMRRVEELMNSGRANYSVLSDCSRSIAVLRTFLHMYNKDIADAEDKVAQACY 706

Query: 241 ELEDWKEALAYCQLTIPVYQRVYPQFHPLLGLQYYTCGKLEWFLGDTENAIKSMTEAVEI 300
              +  +A  +C+ +I + +R+Y   H ++G +      ++   GD+  A  +   + +I
Sbjct: 707 LAGELVDARKHCEASIKILKRLYEDEHVVIGNEMVKLASIQLASGDSSGAWDTTKRSSQI 766

Query: 301 LRITHGTNSPFMKELILKLEEAQAEA 326
               +G+++  +   +  L++  A+A
Sbjct: 767 FSKYYGSHAETLFSYLPCLKQETAKA 792


>gi|17946274|gb|AAL49177.1| RE62495p [Drosophila melanogaster]
          Length = 500

 Score = 74.7 bits (182), Expect = 5e-11,   Method: Compositional matrix adjust.
 Identities = 35/103 (33%), Positives = 54/103 (52%), Gaps = 7/103 (6%)

Query: 58  GLYPVISIINHSCLPNAVLVFEGRL--AVVRAVQHVPKGAEVLISYIETAGSTMTRQKAL 115
            LYP+  ++NH C+PNA   FE +    +VRA   +P+G EV  +Y +     + R   L
Sbjct: 208 ALYPLFGVVNHDCIPNAYYTFEEKTNNMIVRAAVDIPEGFEVTTTYTKLFTGNIARHLFL 267

Query: 116 KEQYLFTCTCPRCIKLGQFDDIQESAILEGYRCKDDGCSGFLL 158
           K +  FTC C RC      D  ++ A + G  C+D  C+G ++
Sbjct: 268 KMKKSFTCKCSRCS-----DPTEKGAFISGLYCRDTNCAGLVV 305


>gi|195446904|ref|XP_002070974.1| GK25543 [Drosophila willistoni]
 gi|194167059|gb|EDW81960.1| GK25543 [Drosophila willistoni]
          Length = 503

 Score = 74.7 bits (182), Expect = 5e-11,   Method: Compositional matrix adjust.
 Identities = 34/103 (33%), Positives = 55/103 (53%), Gaps = 7/103 (6%)

Query: 58  GLYPVISIINHSCLPNAVLVFEGRL--AVVRAVQHVPKGAEVLISYIETAGSTMTRQKAL 115
            LYP+  ++NH C+PN+   FE +    +VRA   +P+G E+  +Y +     + R   L
Sbjct: 208 ALYPLFGVMNHDCIPNSYYTFEEKTNNMIVRAAVDIPEGFEITTTYTKLFTGNIARHLFL 267

Query: 116 KEQYLFTCTCPRCIKLGQFDDIQESAILEGYRCKDDGCSGFLL 158
           K +  FTC CPRC      D  ++ A + G  C+D  C+G ++
Sbjct: 268 KMKKNFTCKCPRC-----SDPTEKGAYISGLYCRDTSCNGLVV 305


>gi|16553701|dbj|BAB71564.1| unnamed protein product [Homo sapiens]
          Length = 313

 Score = 74.7 bits (182), Expect = 5e-11,   Method: Compositional matrix adjust.
 Identities = 67/271 (24%), Positives = 113/271 (41%), Gaps = 52/271 (19%)

Query: 55  LGTGLYPVISIINHSCLPNAVLVFEGRLAVVRAVQHVPKGAEVLISY--IETAGSTMTRQ 112
           L TG++PVIS++NHSC PN  + F   +A +RA Q + KG E+L  Y   ++      RQ
Sbjct: 36  LATGIFPVISLLNHSCSPNTSVSFISTVATIRASQRIRKGQEILHCYGPHKSRMGVAERQ 95

Query: 113 KALKEQYLFTCTCPRCIKLGQFDDIQESAILEGYRCKDDGCSGFLLRDSDDKGFTCQQCG 172
           + L+ QY F C CP C         +   +  G R                + F C  CG
Sbjct: 96  QKLRSQYFFDCACPAC-------QTEAHRMAAGPRW---------------EAFCCNSCG 133

Query: 173 LVRSKEEIKKIASEVNILSKKTLALTSCGNHQEVVSTYKMIEKLQKKLYHPFSVNLMQTR 232
                +++ +  S             SC   +  VS   ++ +LQ         +L Q  
Sbjct: 134 APMQGDDVLRCGSR------------SCA--ESAVSRDHLVSRLQ---------DLQQQV 170

Query: 233 EKLIKILM--ELEDWKEALAYCQLTIPVYQRVYPQFHPLLGLQYYTCGKLEWFLGDTENA 290
               K+L   ELE   + L+ CQ     +  ++ + H ++G       +    LGD + +
Sbjct: 171 RVAQKLLRDGELERAVQRLSGCQRDAESF--LWAE-HAVVGEIADGLARACAALGDWQKS 227

Query: 291 IKSMTEAVEILRITHGTNSPFMKELILKLEE 321
              +  ++ ++ + HG +S  M   + KL +
Sbjct: 228 ATHLQRSLCVVEVRHGPSSVEMGHELFKLAQ 258


>gi|15620931|dbj|BAB67829.1| KIAA1936 protein [Homo sapiens]
          Length = 558

 Score = 74.7 bits (182), Expect = 5e-11,   Method: Compositional matrix adjust.
 Identities = 75/315 (23%), Positives = 129/315 (40%), Gaps = 67/315 (21%)

Query: 26  PEISINEIA--ENFSKLACNAHT-------------ICNSELRPLGTGLYPVISIINHSC 70
           P+++I  +A   +  +L CNA               + +S    L TG++PVIS++NHSC
Sbjct: 287 PDVTIWGVAMLRHMLQLQCNAQAMTTIQRTGPKGSIVTDSRQVRLATGIFPVISLLNHSC 346

Query: 71  LPNAVLVFEGRLAVVRAVQHVPKGAEVLISY--IETAGSTMTRQKALKEQYLFTCTCPRC 128
            PN  + F   +A +RA Q + KG E+L  Y   ++      RQ+ L+ QY F C CP C
Sbjct: 347 SPNTSVSFISTVATIRASQWIRKGQEILHCYGPHKSRMGVAERQQKLRSQYFFDCACPAC 406

Query: 129 IKLGQFDDIQESAILEGYRCKDDGCSGFLLRDSDDKGFTCQQCGLVRSKEEIKKIASEVN 188
                    +   +  G R                + F C  CG     +++ +  S   
Sbjct: 407 -------QTEAHRMAAGPRW---------------EAFCCNSCGAPMQGDDVLRCGSR-- 442

Query: 189 ILSKKTLALTSCGNHQEVVSTYKMIEKLQKKLYHPFSVNLMQTREKLIKILM--ELEDWK 246
                     SC   +  VS   ++ +LQ         +L Q      K+L   ELE   
Sbjct: 443 ----------SCA--ESAVSRDHLVSRLQ---------DLQQQVRVAQKLLRDGELERAV 481

Query: 247 EALAYCQLTIPVYQRVYPQFHPLLGLQYYTCGKLEWFLGDTENAIKSMTEAVEILRITHG 306
           + L+ CQ     +  ++ + H ++G       +    LGD + +   +  ++ ++ + HG
Sbjct: 482 QRLSGCQRDAESF--LWAE-HAVVGEIADGLARACAALGDWQKSATHLQRSLCVVEVRHG 538

Query: 307 TNSPFMKELILKLEE 321
            +S  M   + KL +
Sbjct: 539 PSSVEMGHELFKLAQ 553


>gi|126314231|ref|XP_001371610.1| PREDICTED: SET and MYND domain-containing protein 4-like
           [Monodelphis domestica]
          Length = 845

 Score = 74.7 bits (182), Expect = 6e-11,   Method: Compositional matrix adjust.
 Identities = 61/266 (22%), Positives = 116/266 (43%), Gaps = 12/266 (4%)

Query: 55  LGTGLYPVISIINHSCLPNAVLVFEGRLAVVRAVQHVPKGAEVLISY--IETAGSTMTRQ 112
           L TG +PVIS++NHSC PN  L F G + +++A + + +G E+L  Y   E      TRQ
Sbjct: 513 LATGFFPVISLLNHSCRPNTSLSFRGSVGIIQASRLIAQGEEILHCYGPHECRMDVTTRQ 572

Query: 113 KALKEQYLFTCTCPRCIKLGQFDDIQESAILEGYRCKDDGCSGFLLRDSDDKGFTC---Q 169
           + L+ QY F C C  C +  +   +  +    G+RC    C   L  D     F C   +
Sbjct: 573 QKLRSQYFFDCHCQAC-QNEEVCPVSTAPKPSGFRCPS--CKAALQGDD----FLCCDNR 625

Query: 170 QCGLVRSKEEIKKIASEVNILSKKTLALTSCGNHQEVVSTYKMIEKLQKKLYHPFSVNLM 229
            C    S+ +++    ++    K  L L       + +      ++  +       + + 
Sbjct: 626 TCTASASRAQLEHQVQDLQQQVKAALELLRDDRSAQAIQLLLGCQQDAENFLSSEHMLMG 685

Query: 230 QTREKLIKILMELEDWKEALAYCQLTIPVYQRVYPQFHPLLGLQYYTCGKLEWFLGDTEN 289
           +  + L +    L DW  + A+ Q ++ V +  +      LG + +   ++ +       
Sbjct: 686 EIEDHLAQAYASLGDWSRSAAHLQKSLLVVEARHGPASVELGHELFKLAQVLFNGFAVSK 745

Query: 290 AIKSMTEAVEILRITHGTNSPFMKEL 315
           A+ ++ +A ++L I +   +  M+EL
Sbjct: 746 ALCAIEKAEKVLSIHYDPENEQMQEL 771


>gi|119480927|ref|XP_001260492.1| SET and MYND domain protein, putative [Neosartorya fischeri NRRL
           181]
 gi|119408646|gb|EAW18595.1| SET and MYND domain protein, putative [Neosartorya fischeri NRRL
           181]
          Length = 544

 Score = 74.7 bits (182), Expect = 6e-11,   Method: Compositional matrix adjust.
 Identities = 41/132 (31%), Positives = 72/132 (54%), Gaps = 2/132 (1%)

Query: 1   MSDIDEKQLLLYAQIANLVNLILQWPEISINE--IAENFSKLACNAHTICNSELRPLGTG 58
           M++I  +    + +I+     + ++   +++E  I    +KL  N++   +     LG  
Sbjct: 195 MAEIQAQNWEQWQRISLTAKAVKKYSGCALDEEMICHYGAKLELNSYNFHSPLADRLGFY 254

Query: 59  LYPVISIINHSCLPNAVLVFEGRLAVVRAVQHVPKGAEVLISYIETAGSTMTRQKALKEQ 118
           L+P  ++INHSC  N+V+  +G    V+A++ + KG E+L+SYI+      TRQ  L E+
Sbjct: 255 LHPYAALINHSCDYNSVVGSDGDALYVKALRPIQKGEEILVSYIDATNPCKTRQTELSER 314

Query: 119 YLFTCTCPRCIK 130
           Y F C CP+C K
Sbjct: 315 YYFKCRCPKCAK 326


>gi|296201020|ref|XP_002747857.1| PREDICTED: SET and MYND domain-containing protein 4 [Callithrix
           jacchus]
          Length = 798

 Score = 74.7 bits (182), Expect = 6e-11,   Method: Compositional matrix adjust.
 Identities = 34/76 (44%), Positives = 47/76 (61%), Gaps = 2/76 (2%)

Query: 55  LGTGLYPVISIINHSCLPNAVLVFEGRLAVVRAVQHVPKGAEVLISY--IETAGSTMTRQ 112
           L TG++PVIS++NHSC PN  + F   +A +RA Q + KG E+L  Y   ++      RQ
Sbjct: 522 LATGIFPVISLLNHSCSPNTSVSFVSTVATIRASQRIGKGQEILHCYGPHKSRIGVAERQ 581

Query: 113 KALKEQYLFTCTCPRC 128
           + L+ QY F CTCP C
Sbjct: 582 QKLRSQYFFDCTCPAC 597


>gi|195130539|ref|XP_002009709.1| GI15088 [Drosophila mojavensis]
 gi|193908159|gb|EDW07026.1| GI15088 [Drosophila mojavensis]
          Length = 492

 Score = 74.7 bits (182), Expect = 6e-11,   Method: Compositional matrix adjust.
 Identities = 33/103 (32%), Positives = 56/103 (54%), Gaps = 7/103 (6%)

Query: 58  GLYPVISIINHSCLPNAVLVFEGRL--AVVRAVQHVPKGAEVLISYIETAGSTMTRQKAL 115
            LYP+ +++NH C+PN+   FE +    +VRA   + +G E+  +Y +     + R   L
Sbjct: 208 ALYPLFAVMNHDCIPNSYYTFEEKTNNMIVRAAVDIAEGEEITTTYTKLFTGNIARHLFL 267

Query: 116 KEQYLFTCTCPRCIKLGQFDDIQESAILEGYRCKDDGCSGFLL 158
           K +  FTC CPRC+     D  ++ + + G  C+D  CSG ++
Sbjct: 268 KMKKGFTCKCPRCL-----DPTEKGSFISGLYCRDTNCSGLVV 305


>gi|224140669|ref|XP_002323703.1| tetratricopeptide repeat-containing protein [Populus trichocarpa]
 gi|222868333|gb|EEF05464.1| tetratricopeptide repeat-containing protein [Populus trichocarpa]
          Length = 584

 Score = 74.3 bits (181), Expect = 6e-11,   Method: Compositional matrix adjust.
 Identities = 64/312 (20%), Positives = 126/312 (40%), Gaps = 59/312 (18%)

Query: 29  SINEIAENFSKLACNAHTICNS-ELRPLGTGLYPVISIINHSCLPNAVLVFEGRLAVVRA 87
           S+++  + F KL      + +S E  P+G  +Y   S+ NHSCLPN    F  R   +R 
Sbjct: 240 SVDDPPDQFRKLTSVGDALTSSLEQVPVGQAIYKAASLFNHSCLPNIHAYFLSRTLFIRT 299

Query: 88  VQHVPKGAEVLISYIETAGST--MTRQKALKEQYLFTCTCPRCIKLGQFDDIQESAILEG 145
            ++V  G  + +SY    G +    R + L ++Y F C C  C +L   D      +L  
Sbjct: 300 TEYVSTGCPLELSYGPQVGQSDCEDRLRYLADKYSFRCQCRGCSQLNLSD-----LVLNA 354

Query: 146 YRCKDDGCSGFLLRDSDDKGFTCQQCGLVRSKE----------------EIKKIASEVNI 189
           + C +  C+G +L  +   G T +     R+ E                +I  +AS    
Sbjct: 355 FCCVNHNCAGVVLESTIINGETRKLNNFPRAPEKQKFDSHLQGHKLNIVDINDVASLALK 414

Query: 190 LSKKTLALTS-----CGNHQEVVSTYKMIEK----------------------------- 215
            +  +L +       CG+H+++ ++++ I K                             
Sbjct: 415 FNNSSLHIHPGFCLHCGSHRDLDASHEAINKAWSYIKRLQEAIISKDISGTTLLDASRAL 474

Query: 216 -LQKKLYHPFSVNLMQTREKLIKILMELEDWKEALAYCQLTIPVYQRVYPQFHPLLGLQY 274
            + +   H ++ ++ +  + L +    + D++ A  +C+ +I + Q +Y   H ++G + 
Sbjct: 475 GILRSTLHAYNKSVAEAEDNLAQAFCLVRDFQSAREHCKESIKILQTLYDPDHIVIGYEL 534

Query: 275 YTCGKLEWFLGD 286
                ++  L D
Sbjct: 535 VKLASIQLSLDD 546


>gi|322697990|gb|EFY89764.1| MYND finger family protein [Metarhizium acridum CQMa 102]
          Length = 513

 Score = 74.3 bits (181), Expect = 7e-11,   Method: Compositional matrix adjust.
 Identities = 40/102 (39%), Positives = 60/102 (58%), Gaps = 1/102 (0%)

Query: 27  EISINEIAENFSKLACNAHTICNSELRPLGTGLYPVISIINHSCLPNAVLVFEGRLAVVR 86
           E ++ +  E   K+  NA    + +L  +G  L P +++ NHSC+PNAV+ F GR A++R
Sbjct: 156 ESNVRKAVELLCKIQTNAFHRWDVDLGQVGVFLEPTLAMANHSCVPNAVVQFVGRKAILR 215

Query: 87  AVQHVPKGAEVLISYIETAGSTMTRQKALKEQYLFTCTCPRC 128
           A + +  G E+ ISY +       R++AL EQY F CTC RC
Sbjct: 216 AERPIQAGDEIEISYTDYTMPLSARREAL-EQYSFECTCARC 256


>gi|115497232|ref|NP_001069874.1| SET and MYND domain-containing protein 3 [Bos taurus]
 gi|112362116|gb|AAI20242.1| SET and MYND domain containing 3 [Bos taurus]
 gi|296479288|tpg|DAA21403.1| TPA: SET and MYND domain containing 3 [Bos taurus]
          Length = 391

 Score = 74.3 bits (181), Expect = 7e-11,   Method: Compositional matrix adjust.
 Identities = 32/65 (49%), Positives = 47/65 (72%)

Query: 32  EIAENFSKLACNAHTICNSELRPLGTGLYPVISIINHSCLPNAVLVFEGRLAVVRAVQHV 91
           +I E F+K+ CN+ TICN+E++ +G GLYP +S++NHSC PN  +VF G   ++RAV+ V
Sbjct: 170 DIFEAFAKVICNSFTICNAEMQEVGVGLYPSMSLLNHSCDPNCSIVFNGPHLLLRAVRDV 229

Query: 92  PKGAE 96
             G E
Sbjct: 230 EAGEE 234



 Score = 48.1 bits (113), Expect = 0.006,   Method: Compositional matrix adjust.
 Identities = 24/61 (39%), Positives = 36/61 (59%)

Query: 246 KEALAYCQLTIPVYQRVYPQFHPLLGLQYYTCGKLEWFLGDTENAIKSMTEAVEILRITH 305
           +EAL Y   T+  Y+  YP  HP+ G+Q    GKL+   G    A+K++  A +I+R+TH
Sbjct: 308 EEALFYGIRTMEPYRIFYPGNHPVRGVQIMKVGKLQLHQGMFPQAMKNLRLAFDIMRVTH 367

Query: 306 G 306
           G
Sbjct: 368 G 368


>gi|336274372|ref|XP_003351940.1| hypothetical protein SMAC_00488 [Sordaria macrospora k-hell]
 gi|380096224|emb|CCC06271.1| unnamed protein product [Sordaria macrospora k-hell]
          Length = 537

 Score = 74.3 bits (181), Expect = 7e-11,   Method: Compositional matrix adjust.
 Identities = 47/142 (33%), Positives = 75/142 (52%), Gaps = 3/142 (2%)

Query: 7   KQLLLYAQIANLVNLILQWPEISINEIAENFSKLACNAHTICNSELRPLGTGLYPVISII 66
           K   L A  A +   +LQ  E ++ +  E   K+  NA    +++    G  L+P +S++
Sbjct: 171 KDFELQATGAVVYAGLLQSDE-TLKQAMEVLCKIQTNAFNRFDADTGQAGIYLHPSLSMV 229

Query: 67  NHSCLPNAVLVFEGRLAVVRAVQHVPKGAEVLISYIETAGSTMTRQKALKEQYLFTCTCP 126
           NHSC+PNA + FE R A ++A + +  G E+LISYI+       RQ++L+  Y F C C 
Sbjct: 230 NHSCVPNAYITFEKRKAFLKAERDIEPGDEILISYIDHTAPRRARQESLR-LYHFQCNCV 288

Query: 127 RC-IKLGQFDDIQESAILEGYR 147
           RC   L  ++ IQ   +  G +
Sbjct: 289 RCKDDLNVYEVIQACGVSPGVK 310


>gi|380801673|gb|AFE72712.1| SET and MYND domain-containing protein 4, partial [Macaca mulatta]
          Length = 212

 Score = 74.3 bits (181), Expect = 8e-11,   Method: Compositional matrix adjust.
 Identities = 34/76 (44%), Positives = 47/76 (61%), Gaps = 2/76 (2%)

Query: 55  LGTGLYPVISIINHSCLPNAVLVFEGRLAVVRAVQHVPKGAEVLISYI--ETAGSTMTRQ 112
           L TG++PVIS++NHSC PN  + F   +A +RA Q + KG E+L  Y   ++      RQ
Sbjct: 26  LATGIFPVISLLNHSCSPNTSVSFISTVATIRASQRIGKGQEILHCYGPHKSRMGVAERQ 85

Query: 113 KALKEQYLFTCTCPRC 128
           + L+ QY F CTCP C
Sbjct: 86  QKLRSQYFFDCTCPAC 101


>gi|346978403|gb|EGY21855.1| hypothetical protein VDAG_03295 [Verticillium dahliae VdLs.17]
          Length = 558

 Score = 74.3 bits (181), Expect = 8e-11,   Method: Compositional matrix adjust.
 Identities = 40/104 (38%), Positives = 61/104 (58%), Gaps = 8/104 (7%)

Query: 27  EISINEIAENFSKLACNAHTICNSELRPLGTGLY--PVISIINHSCLPNAVLVFEGRLAV 84
           E++   + E   K+  NA T   +     G GL+    ++++NHSC+PNA++ F GR A 
Sbjct: 184 EVTERRVVELLCKIKTNAFTRSEA-----GEGLFLDTTLAMVNHSCVPNAIVAFSGRRAF 238

Query: 85  VRAVQHVPKGAEVLISYIETAGSTMTRQKALKEQYLFTCTCPRC 128
           +RA++ + +G E+ ISYI+   S   R+KAL E Y F C C RC
Sbjct: 239 LRALRDIKEGEEIEISYIDCTQSLEHRRKAL-ELYFFQCVCTRC 281


>gi|332813752|ref|XP_003309160.1| PREDICTED: SET and MYND domain-containing protein 1 [Pan
           troglodytes]
          Length = 452

 Score = 73.9 bits (180), Expect = 9e-11,   Method: Compositional matrix adjust.
 Identities = 40/131 (30%), Positives = 68/131 (51%), Gaps = 19/131 (14%)

Query: 19  VNLILQ-WP----EISINEIAENFSKLACNAHTICNSE-LRPLGTGLYPVISIINHSCLP 72
           V+  LQ WP    + S+  I+  F  + CN  T+ +   L+ +G G++P + ++NH C P
Sbjct: 151 VDTFLQYWPPQSQQFSMQYISHIFGVINCNGFTLSDQRGLQAVGVGIFPNLGLVNHDCWP 210

Query: 73  NAVLVFE-GRLAVVRAVQH------------VPKGAEVLISYIETAGSTMTRQKALKEQY 119
           N  ++F  G    V+++ H            + +G E+ +SYI+    +  R++ LK+QY
Sbjct: 211 NCTVIFNNGNHEAVKSMFHTQMRIELRALGKISEGEELTVSYIDFLNVSEERKRQLKKQY 270

Query: 120 LFTCTCPRCIK 130
            F CTC  C K
Sbjct: 271 YFDCTCEHCQK 281


>gi|91079710|ref|XP_969181.1| PREDICTED: similar to SET and MYND domain containing 4 [Tribolium
           castaneum]
          Length = 393

 Score = 73.9 bits (180), Expect = 9e-11,   Method: Compositional matrix adjust.
 Identities = 79/302 (26%), Positives = 126/302 (41%), Gaps = 47/302 (15%)

Query: 33  IAENFSKLACNAHTI----CNSE-------LRPLGTGLYPVISIINHSCLPNAVLVFEGR 81
           I ++ ++LACN+ TI    C+S        L  + +G++P +SI+NHSC PN    F   
Sbjct: 92  ITKHIAQLACNSSTIEQWTCSSSDLLFPDVLITIASGIFPSVSIMNHSCRPNVTNYFMSD 151

Query: 82  LAVVRAVQHVPKGAEVLISY-IETAG-STMTRQKALKEQYLFTCTCPRCIKLGQFDDIQE 139
             VV+A++ + +  E+   Y I+  G     RQ A KE Y F C C  C      D   E
Sbjct: 152 TIVVKALEDIAQNEEIFNCYGIDYRGMEREQRQYACKELYHFECKCVICS-----DPAHE 206

Query: 140 SAILEGYRCKDDGCSGFLLRDSDDKGFTCQQCGLVRSKEEIKKIASEVNILSKKTLALTS 199
             +L+ Y C    C G +    +     C  CG     +  +KI  E    ++K +   S
Sbjct: 207 LDMLDSYLCPK--CKGLVPEIKNTILSFCVNCGEKYHLKPFRKINYE----AQKYMESDS 260

Query: 200 CGNHQEVVSTYKMIEKLQKKLYHPFSVNLMQTREKLIKILMELEDWKEALAYCQLTI--- 256
               + ++ + K+ EK+  K +  F     +   +L    ++L D +    Y  L +   
Sbjct: 261 SNQLELLIKSLKIREKILYKHHKDFE----EVYYRLYSFYVKLGDAENMFKYFHLWLENE 316

Query: 257 -------------PVYQRVYPQFHPLLGLQYYTCGKLEWFLGDTENAIKSMTEAVEILRI 303
                         +Y+      H L G Q   C  L+ FL + E  IK   EA  IL +
Sbjct: 317 KARRGCNSRVVGTKLYEAALAILHCLKGGQQKNCTNLKDFLQNVEYMIK---EAKAILNL 373

Query: 304 TH 305
            +
Sbjct: 374 YY 375


>gi|384250973|gb|EIE24451.1| SET domain-containing protein [Coccomyxa subellipsoidea C-169]
          Length = 606

 Score = 73.9 bits (180), Expect = 9e-11,   Method: Compositional matrix adjust.
 Identities = 52/207 (25%), Positives = 80/207 (38%), Gaps = 16/207 (7%)

Query: 43  NAHTI--CNSELRPLGTGLYPVISIINHSCLPNAVLVFEGRLAVVRAVQHVPKGAEVLIS 100
           NAH +   NS       GLYP +S++NHSCLPN V    G    VRA++ V  G ++ ++
Sbjct: 306 NAHGMGAANSTNTDSALGLYPALSMLNHSCLPNCVFASCGSDMHVRAIRPVAAGEQLTVT 365

Query: 101 YIETAGSTMTRQKALKEQYLFTCTCPRCIKLGQFDDIQESAILEGYRCKDDGCSGFLLRD 160
           YI        R + L +   F C C RC+      +      LE  +C   GC G+LL  
Sbjct: 366 YINIMEPRRIRARELMDTKHFACACERCV---SPLETHPDRFLEAVKCGAGGCDGWLLEQ 422

Query: 161 -----SDDKGFTCQQCGLVRSKEEIKKIASEVNILSKKTLALTSCGN------HQEVVST 209
                S+   + C  C        +       +I      A     N      H      
Sbjct: 423 RADNLSEGDTWKCTVCATAVPSRSVDGRTGPQDITESFRAAWHQAANILRYKGHAAGRPV 482

Query: 210 YKMIEKLQKKLYHPFSVNLMQTREKLI 236
           ++ +    + + HP  V+   +   L+
Sbjct: 483 FEAVLARAEGVLHPMHVSAFDSLMPLV 509


>gi|443926535|gb|ELU45157.1| zf-MYND domain-containing protein [Rhizoctonia solani AG-1 IA]
          Length = 584

 Score = 73.9 bits (180), Expect = 1e-10,   Method: Compositional matrix adjust.
 Identities = 71/264 (26%), Positives = 104/264 (39%), Gaps = 46/264 (17%)

Query: 31  NEIAENFSK------LACNAHTICNSELRPLGTGLYPVISIINHSCLPNAVLVFE----- 79
           N +   F K      +  NAHT+    L P+G  L PV ++INHSC+PN V+VF      
Sbjct: 189 NSVKAGFGKRYRGSIIEANAHTLSTPSLTPIGVALSPVAALINHSCVPNCVVVFPKASET 248

Query: 80  ---GRLAVVRAVQHVPKGAEVLISYIETAGSTMTRQKALKEQYLFTCTCPRC-----IKL 131
                  ++ A+  +  G EV I +  T G      K L+E+Y F C CP C     +K 
Sbjct: 249 KKVPNEMLIIAIGKILPGEEVQIPH--TLGK-FADNKILQERYFFKCHCPNCEMTSLVKT 305

Query: 132 GQFDDIQESAILEGYRCKDDGCSGFLLRDSDDKGFTCQQCGLVRSKEEIKKIASEVNILS 191
              D  +       +RC    C GFL   S       +Q                 N+ S
Sbjct: 306 PYVDSRR------AFRCASKTCEGFLPMPSMQSQLGQKQ---------------RFNVYS 344

Query: 192 KKTLALTSCGNHQEVVSTYKMIEKLQKKLYHPFSV--NLMQTREKLIKILMELEDWKEAL 249
             + AL++    Q    + K   K ++      S   +   T  + + +   L D + A 
Sbjct: 345 AISCALSNPRRSQMQSDSEKKGSKGRRNFNSLVSYFRSPGLTYHQSLTLCAGLLDPERAF 404

Query: 250 AYCQLTIPVYQRVY-PQFHPLLGL 272
            Y    IP+  R + P  HPLL L
Sbjct: 405 KYTSNLIPMVSRFFHPSAHPLLAL 428


>gi|345566296|gb|EGX49239.1| hypothetical protein AOL_s00078g272 [Arthrobotrys oligospora ATCC
           24927]
          Length = 526

 Score = 73.9 bits (180), Expect = 1e-10,   Method: Compositional matrix adjust.
 Identities = 33/92 (35%), Positives = 54/92 (58%)

Query: 37  FSKLACNAHTICNSELRPLGTGLYPVISIINHSCLPNAVLVFEGRLAVVRAVQHVPKGAE 96
           + KL  N+ ++      P+G  L    ++ NHSC PNAV++F+GR   +R+++ + K  E
Sbjct: 196 YCKLLINSASVVTQTFDPIGLCLAYQSAMFNHSCDPNAVMMFDGRQLSIRSLKEITKDTE 255

Query: 97  VLISYIETAGSTMTRQKALKEQYLFTCTCPRC 128
           + ISYI+   S   R++ LK +Y F C+C  C
Sbjct: 256 ITISYIDNLASRKERKEELKSRYFFDCSCSLC 287


>gi|350295434|gb|EGZ76411.1| hypothetical protein NEUTE2DRAFT_98210 [Neurospora tetrasperma FGSC
           2509]
          Length = 547

 Score = 73.9 bits (180), Expect = 1e-10,   Method: Compositional matrix adjust.
 Identities = 41/120 (34%), Positives = 67/120 (55%), Gaps = 2/120 (1%)

Query: 29  SINEIAENFSKLACNAHTICNSELRPLGTGLYPVISIINHSCLPNAVLVFEGRLAVVRAV 88
           ++ +  E   K+  NA    +++    G  L+P +S++NHSC+PNA + FE R AV++A 
Sbjct: 172 TLKQAMEVLCKIQTNAFNRFDADTGQAGIYLHPSLSMVNHSCVPNAYITFEKRKAVLKAE 231

Query: 89  QHVPKGAEVLISYIETAGSTMTRQKALKEQYLFTCTCPRC-IKLGQFDDIQESAILEGYR 147
           + +  G E+LISYI+       RQ++L+  Y F C C RC   L  ++ IQ   +  G +
Sbjct: 232 RDLEPGDEILISYIDHTMPRRARQESLR-LYHFQCNCIRCKDDLNAYEVIQACGVSPGVK 290


>gi|303282641|ref|XP_003060612.1| predicted protein [Micromonas pusilla CCMP1545]
 gi|226458083|gb|EEH55381.1| predicted protein [Micromonas pusilla CCMP1545]
          Length = 302

 Score = 73.9 bits (180), Expect = 1e-10,   Method: Compositional matrix adjust.
 Identities = 40/105 (38%), Positives = 55/105 (52%), Gaps = 3/105 (2%)

Query: 26  PEISINEIAENFSKLACNAHTICNSE---LRPLGTGLYPVISIINHSCLPNAVLVFEGRL 82
           P   + E+      + CN+HT+  SE     P+GT +Y   S  NHSCLP+A    EG  
Sbjct: 198 PGYGVEELVRLMCCVRCNSHTLYASEEWPSEPVGTAVYLKGSAFNHSCLPSAEFYNEGTS 257

Query: 83  AVVRAVQHVPKGAEVLISYIETAGSTMTRQKALKEQYLFTCTCPR 127
             VR+V+ +  G EV ISY+    +   R++AL  QY F C C R
Sbjct: 258 LRVRSVRDISAGEEVTISYVPVTETLWDRRQALWRQYKFDCECDR 302


>gi|440799648|gb|ELR20692.1| Histone-lysine N-methyltransferase [Acanthamoeba castellanii str.
           Neff]
          Length = 548

 Score = 73.9 bits (180), Expect = 1e-10,   Method: Compositional matrix adjust.
 Identities = 75/287 (26%), Positives = 121/287 (42%), Gaps = 27/287 (9%)

Query: 14  QIANLVNLILQWP-EISINEIAENFSKLACNAHTICNSEL--RPLGTGLYPVISIINHSC 70
           + A LV  +L  P  +S  E  +  +++  N+H   +  +  R LG GLYP+ S+INHSC
Sbjct: 208 ETAKLVMPLLPKPARLSAREFVKVCARINTNSHRQHHMFVPQRILGVGLYPLASLINHSC 267

Query: 71  LPNAVLVFEGRLAVVRAVQHVPKGAEVLISYIETAGSTMTRQKALKEQYLFTCTCPRCIK 130
            PN      G    +R +  V +G E+  SYI+   S   R+  L E   F C C RC  
Sbjct: 268 QPNCGFYNRGPTLYIRTLCDVKEGEELCYSYIDLYQSRSKRKAELLETKHFDCLCNRCSP 327

Query: 131 LGQFDDIQESA--ILEGYRCKDDG---CSGFL-LRDSD----DKGFTCQQ-CGLVRSKEE 179
                 I +S    L G++C +     C G L L D +    +K  +C   CG  ++  +
Sbjct: 328 -----PITDSVDRYLSGFQCPNKAKTSCDGLLVLPDGEGLASEKPVSCTAGCGETKTAGD 382

Query: 180 IKKIASEVNILSKKTLALTSCGNHQEVVSTYKMIEKLQKKLY------HPFSVNLMQTRE 233
           + K      I+   + +L    + +    +  M++KL    Y      HP+   + Q   
Sbjct: 383 LLKAQKRAEIMLTASQSLYY--DQKRTSDSKTMLDKLLTGGYSPDVQLHPYHPTVFQAHV 440

Query: 234 KLIKILMELEDWKEALAYCQLTIPVYQRVYPQFHPLLGLQYYTCGKL 280
             I +   L     ++ +C+  +   + V P  H      YY  G L
Sbjct: 441 DSINVSDALGYTHMSIDHCEAVLRCAEAVLPMNHLETSNYYYYQGML 487


>gi|344297528|ref|XP_003420449.1| PREDICTED: SET and MYND domain-containing protein 1 [Loxodonta
           africana]
          Length = 478

 Score = 73.9 bits (180), Expect = 1e-10,   Method: Compositional matrix adjust.
 Identities = 80/308 (25%), Positives = 135/308 (43%), Gaps = 44/308 (14%)

Query: 19  VNLILQ-WP----EISINEIAENFSKLACNAHTICNSE-LRPLGTGLYPVISIINHSCLP 72
           VN  L+ WP    + S   I+  F  + CN  T+ +   L+ +G G++P + ++NH C P
Sbjct: 151 VNTFLEYWPPQNQQFSTQYISHIFGVINCNGFTLSDQRGLQAVGVGVFPNLGLVNHDCWP 210

Query: 73  NAVLVFE-GRLAVVRAVQHVPKGAEVLISYIETAGSTMTRQKALKEQYLFTCTCPRCIKL 131
           N  ++F  G    V+++ H        +S  +  G T   Q  +K   L TC    C+ L
Sbjct: 211 NCTVIFNNGNHEAVKSMFHTQ------MSKAQDPGKT-GPQFVIKLVGLTTCFFS-CLAL 262

Query: 132 GQFDDIQESAILEGYRCKDDGCSGFLLRDSDDKGFTCQQCGLVRSKEEIKKIASEVNILS 191
           G        A+ EG   KDD      L   DD           +  +E+ K   E+  LS
Sbjct: 263 GV------EALPEGL--KDD----LFLGVKDDP----------KPSQEVVK---EMIQLS 297

Query: 192 KKTL----ALTSCGNHQEVVSTYKMIEKLQKKLYHPFSVNLMQTREKLIKILMELEDWKE 247
           K TL       S G + EVV   +   + Q+ ++   ++  ++    + ++L  L+ ++E
Sbjct: 298 KDTLEKIDKARSEGLYHEVVKLCRECLQKQEPVFADTNLYTLRMLSIVSEVLSYLQAFEE 357

Query: 248 ALAYCQLTIPVYQRVYPQFHPLLGLQYYTCGKLEWFLGDTENAIKSMTEAVEILRITHGT 307
           A  Y +  +  Y ++Y   +  LG+     G   W  G+ E     + +A  IL +THG 
Sbjct: 358 AADYAKRMVDGYMKLYHHNNAQLGMAVMRAGLTNWHAGNIEVGHGMICKAYAILLVTHGP 417

Query: 308 NSPFMKEL 315
           + P  K+L
Sbjct: 418 SHPITKDL 425


>gi|388854233|emb|CCF52152.1| uncharacterized protein [Ustilago hordei]
          Length = 938

 Score = 73.9 bits (180), Expect = 1e-10,   Method: Compositional matrix adjust.
 Identities = 39/118 (33%), Positives = 65/118 (55%), Gaps = 16/118 (13%)

Query: 29  SINEIAENFSKLACNAHTICNSELRPLGTGLYPVISIINHSCLPNAVLVF---------- 78
           S  E+ +   + A N+ T+ +S+L PLG  ++P +++INH+C PNA +VF          
Sbjct: 481 SAKELLDLVCQFASNSFTLTDSDLNPLGVCMHPSMAMINHACTPNAAVVFPFGGAAKNGQ 540

Query: 79  ------EGRLAVVRAVQHVPKGAEVLISYIETAGSTMTRQKALKEQYLFTCTCPRCIK 130
                 E R+  + A++ +  G E+LISY++ A +   R+  LK++Y F C C  C K
Sbjct: 541 QKWNDGEDRIMQLVALRAIEPGEELLISYVDIADTYQDRRNYLKKRYCFDCRCELCRK 598


>gi|195168440|ref|XP_002025039.1| GL26796 [Drosophila persimilis]
 gi|194108484|gb|EDW30527.1| GL26796 [Drosophila persimilis]
          Length = 498

 Score = 73.9 bits (180), Expect = 1e-10,   Method: Compositional matrix adjust.
 Identities = 34/103 (33%), Positives = 56/103 (54%), Gaps = 7/103 (6%)

Query: 58  GLYPVISIINHSCLPNAVLVFEGR--LAVVRAVQHVPKGAEVLISYIETAGSTMTRQKAL 115
            LYP+ +++NH C+PN+   FE +  + +VRA   +P+G E+  +Y +     + R   L
Sbjct: 208 ALYPLFAVMNHDCIPNSYYTFEEKTNIMIVRAAVDIPEGFEITTTYTKLFTGNIARHLFL 267

Query: 116 KEQYLFTCTCPRCIKLGQFDDIQESAILEGYRCKDDGCSGFLL 158
           K +  FTC C RC      D  ++ A + G  C+D  CSG ++
Sbjct: 268 KIKRGFTCKCSRC-----SDPTEKGAYISGLYCRDTNCSGLVV 305


>gi|125983736|ref|XP_001355633.1| GA11413 [Drosophila pseudoobscura pseudoobscura]
 gi|54643949|gb|EAL32692.1| GA11413 [Drosophila pseudoobscura pseudoobscura]
          Length = 496

 Score = 73.9 bits (180), Expect = 1e-10,   Method: Compositional matrix adjust.
 Identities = 34/103 (33%), Positives = 56/103 (54%), Gaps = 7/103 (6%)

Query: 58  GLYPVISIINHSCLPNAVLVFEGR--LAVVRAVQHVPKGAEVLISYIETAGSTMTRQKAL 115
            LYP+ +++NH C+PN+   FE +  + +VRA   +P+G E+  +Y +     + R   L
Sbjct: 208 ALYPLFAVMNHDCIPNSYYTFEEKTNIMIVRAAVDIPEGFEITTTYTKLFTGNIARHLFL 267

Query: 116 KEQYLFTCTCPRCIKLGQFDDIQESAILEGYRCKDDGCSGFLL 158
           K +  FTC C RC      D  ++ A + G  C+D  CSG ++
Sbjct: 268 KIKRGFTCKCSRC-----SDPTEKGAYISGLYCRDTNCSGLVV 305


>gi|344296426|ref|XP_003419908.1| PREDICTED: N-lysine methyltransferase SMYD2-like [Loxodonta
           africana]
          Length = 427

 Score = 73.6 bits (179), Expect = 1e-10,   Method: Compositional matrix adjust.
 Identities = 70/319 (21%), Positives = 137/319 (42%), Gaps = 39/319 (12%)

Query: 5   DEKQLLLYAQIANLVNLI---LQWPEISINEIAENFSKLACNAHTICNSELRPLGTGLYP 61
           +EK+ L+ + IA L +     L++P+     +   F+++ CN  TI + EL  +G+ ++P
Sbjct: 143 NEKKDLIQSDIAALHHFYSKHLEFPDT--ESLVVLFAQVNCNGFTIEDEELSHVGSAIFP 200

Query: 62  VISIINHSCLPNAVLVFEGRLAVVRAVQHVPKGAEVLISYIETAGSTMTRQKALKEQYLF 121
              +   S +P ++ VF+  L    +         V  SYI+    T  R   L++ Y F
Sbjct: 201 DPVV---SGVPGSLTVFQAHLF---SQMFASLCGMVFTSYIDLLYPTEDRNDRLRDSYFF 254

Query: 122 TCTCPRCIKLGQFDDIQESAILEGYRCKDDGCSGFLLRDSDDKGFTCQQCGLVRSKEEIK 181
           TC C  C    +     ++A +E  R   D      +RD            +VR    + 
Sbjct: 255 TCECQECTTKDK-----DNAKVE-IRKLTDPPKAEAIRD------------MVRYARNVI 296

Query: 182 KIASEVNILSKKTLALTSCGNHQEVVSTYKMIEKLQKKLYHPFSVNLMQTREKLIKILME 241
           +           +  L  C   QE +S+          ++   +V ++    + + + + 
Sbjct: 297 EEFRRAKHYKSPSELLEMCALSQEKMSS----------VFEDSNVYMLHMMYQAMGVCLY 346

Query: 242 LEDWKEALAYCQLTIPVYQRVYPQFHPLLGLQYYTCGKLEWFLGDTENAIKSMTEAVEIL 301
           ++DW+ AL Y Q  I  Y + YP +   +   +   G+L   L +     K++ +A+ I+
Sbjct: 347 MQDWEGALRYGQKIIQPYSKHYPLYSLNVASMWLKLGRLYMGLENKAAGEKALKKAIAIM 406

Query: 302 RITHGTNSPFMKELILKLE 320
            + HG + P++ E+  ++E
Sbjct: 407 EVAHGKDHPYISEIKHEIE 425


>gi|270004498|gb|EFA00946.1| hypothetical protein TcasGA2_TC003856 [Tribolium castaneum]
          Length = 713

 Score = 73.6 bits (179), Expect = 1e-10,   Method: Compositional matrix adjust.
 Identities = 79/302 (26%), Positives = 126/302 (41%), Gaps = 47/302 (15%)

Query: 33  IAENFSKLACNAHTI----CNSE-------LRPLGTGLYPVISIINHSCLPNAVLVFEGR 81
           I ++ ++LACN+ TI    C+S        L  + +G++P +SI+NHSC PN    F   
Sbjct: 412 ITKHIAQLACNSSTIEQWTCSSSDLLFPDVLITIASGIFPSVSIMNHSCRPNVTNYFMSD 471

Query: 82  LAVVRAVQHVPKGAEVLISY-IETAG-STMTRQKALKEQYLFTCTCPRCIKLGQFDDIQE 139
             VV+A++ + +  E+   Y I+  G     RQ A KE Y F C C  C      D   E
Sbjct: 472 TIVVKALEDIAQNEEIFNCYGIDYRGMEREQRQYACKELYHFECKCVICS-----DPAHE 526

Query: 140 SAILEGYRCKDDGCSGFLLRDSDDKGFTCQQCGLVRSKEEIKKIASEVNILSKKTLALTS 199
             +L+ Y C    C G +    +     C  CG     +  +KI  E    ++K +   S
Sbjct: 527 LDMLDSYLCPK--CKGLVPEIKNTILSFCVNCGEKYHLKPFRKINYE----AQKYMESDS 580

Query: 200 CGNHQEVVSTYKMIEKLQKKLYHPFSVNLMQTREKLIKILMELEDWKEALAYCQLTI--- 256
               + ++ + K+ EK+  K +  F     +   +L    ++L D +    Y  L +   
Sbjct: 581 SNQLELLIKSLKIREKILYKHHKDFE----EVYYRLYSFYVKLGDAENMFKYFHLWLENE 636

Query: 257 -------------PVYQRVYPQFHPLLGLQYYTCGKLEWFLGDTENAIKSMTEAVEILRI 303
                         +Y+      H L G Q   C  L+ FL + E  IK   EA  IL +
Sbjct: 637 KARRGCNSRVVGTKLYEAALAILHCLKGGQQKNCTNLKDFLQNVEYMIK---EAKAILNL 693

Query: 304 TH 305
            +
Sbjct: 694 YY 695


>gi|449480134|ref|XP_004177074.1| PREDICTED: LOW QUALITY PROTEIN: SET and MYND domain-containing
           protein 4 [Taeniopygia guttata]
          Length = 797

 Score = 73.6 bits (179), Expect = 1e-10,   Method: Compositional matrix adjust.
 Identities = 68/294 (23%), Positives = 120/294 (40%), Gaps = 22/294 (7%)

Query: 39  KLACNAHTI-CNSELRP------------LGTGLYPVISIINHSCLPNAVLVFEGRLAVV 85
           +L CNA  I    EL P            L T  +PV+S++NHSC PN  + F G  A V
Sbjct: 491 QLQCNAQAITVMQELGPGDGAVVDKKPVRLATAFFPVLSLLNHSCCPNTSMSFSGTAATV 550

Query: 86  RAVQHVPKGAEVLISY-IETAGSTMTRQKALKEQYLFTCTCPRCIKLGQFDDIQESAILE 144
           RA Q +  G EVL  Y        +  ++ L  QY F C CP C++  +       +I  
Sbjct: 551 RASQPISSGQEVLHCYGPHWCRMRVAERQQLLRQYFFECRCPACLEELESGVKSVVSIRN 610

Query: 145 GYRCKDDGCSGFLLRDSDDKGFTCQQCGLVRSKEEIKKIASEVNILSKKTLALTSCGNHQ 204
            + C    C   +  + D    + + C    S+  +     ++    KK L +   G   
Sbjct: 611 SFCCPK--CQAQMQGEEDTLCCSNEACATSASRNHLSGRLQDLQQQIKKALGMLRVGKAD 668

Query: 205 EVVSTYKMIEKLQ---KKLYHPFSVNLMQTREKLIKILMELEDWKEALAYCQLTIPVYQR 261
           + +   KM+ K Q        P  + + +  + L ++   L  W+EA  + + +I + + 
Sbjct: 669 QAI---KMLLKCQMDAGTFLSPEHLLMGEMEDHLAQVYATLGKWQEAARHLKKSIEIVEM 725

Query: 262 VYPQFHPLLGLQYYTCGKLEWFLGDTENAIKSMTEAVEILRITHGTNSPFMKEL 315
            +       G + +   ++ +       A+ ++  A  IL +  G  +  ++EL
Sbjct: 726 HHGPSSVETGHELFKLAQILFNGFAVSEALSTIQRAEGILSVHFGPQNAQIQEL 779


>gi|281208760|gb|EFA82935.1| SET domain-containing protein [Polysphondylium pallidum PN500]
          Length = 422

 Score = 73.6 bits (179), Expect = 1e-10,   Method: Compositional matrix adjust.
 Identities = 46/132 (34%), Positives = 64/132 (48%), Gaps = 9/132 (6%)

Query: 55  LGTGLYPVISIINHSCLPNAVLVFEGRLAVVRAVQHVPKGAEVLISYIETAGSTMTRQKA 114
            G  +Y   S+ NHSC PNA  V  GR   + A++ +    E+ ISY+     +  R+  
Sbjct: 251 FGLAIYLEASLFNHSCYPNAARVQRGRSIDIIAIRDIEPNEEICISYLNITNGSHERKDH 310

Query: 115 LKEQYLFTCTCPRCIKLG-QFDDIQESAILEGYRCKDDGCSGFLLRDSDDKGFTCQQCGL 173
           LK  YLF C C RC +     ++I  S I    R K   CSG+L      +G   +QC  
Sbjct: 311 LKNNYLFDCVCIRCTQTNPDIENIVRSFICRNPRVK---CSGYLYL---PQGSDSRQCNF 364

Query: 174 VRSKE--EIKKI 183
            + KE  EIK+I
Sbjct: 365 CQWKESSEIKRI 376


>gi|336465134|gb|EGO53374.1| hypothetical protein NEUTE1DRAFT_150707 [Neurospora tetrasperma
           FGSC 2508]
          Length = 668

 Score = 73.2 bits (178), Expect = 2e-10,   Method: Compositional matrix adjust.
 Identities = 40/120 (33%), Positives = 66/120 (55%), Gaps = 2/120 (1%)

Query: 29  SINEIAENFSKLACNAHTICNSELRPLGTGLYPVISIINHSCLPNAVLVFEGRLAVVRAV 88
           ++ +  E   K+  NA    +++    G  L+P +S++NHSC+PNA + FE R A ++A 
Sbjct: 192 TLKQAMEVLCKMQTNAFNRFDADTGQAGIYLHPSLSMVNHSCVPNAYIAFEKRKAFLKAE 251

Query: 89  QHVPKGAEVLISYIETAGSTMTRQKALKEQYLFTCTCPRC-IKLGQFDDIQESAILEGYR 147
           + +  G E+LISYI+       RQ++L+  Y F C C RC   L  ++ IQ   +  G +
Sbjct: 252 RDIEPGDEILISYIDHTMPRRARQESLR-LYHFQCNCIRCKDDLNAYEVIQACGVSPGVK 310


>gi|443693726|gb|ELT95020.1| hypothetical protein CAPTEDRAFT_183922 [Capitella teleta]
          Length = 666

 Score = 73.2 bits (178), Expect = 2e-10,   Method: Compositional matrix adjust.
 Identities = 45/139 (32%), Positives = 73/139 (52%), Gaps = 16/139 (11%)

Query: 55  LGTGLYPVISIINHSCLPNAVLVFEGRLAVVRAVQHVPKGAEVLISYIET--AGSTMTRQ 112
           +G GL+P  +++NHSC P A++ + G  AVVRA++ + K  E+ I+Y  T      ++R+
Sbjct: 472 IGIGLFPTAALLNHSCNPEAIVCYYGNKAVVRAIRDIDKNEEISIAYGVTFYDDEELSRR 531

Query: 113 KALKEQYLFTCTCPRCIK----LGQFDDIQESAILEGYRCKDDGCSGFLLRDS--DDKGF 166
             LKE + F CTC  C++      + D  Q   + E        C   LL D   D+K  
Sbjct: 532 HQLKETHFFHCTCKACLEGWPMWLEMDQNQPDWLCE-------ACGSILLSDKIEDNKFA 584

Query: 167 TCQQCGLVRSKEE-IKKIA 184
            C++C   ++ E+ I K+A
Sbjct: 585 KCKKCSHRQNLEDAINKLA 603


>gi|195393214|ref|XP_002055249.1| GJ18896 [Drosophila virilis]
 gi|194149759|gb|EDW65450.1| GJ18896 [Drosophila virilis]
          Length = 506

 Score = 73.2 bits (178), Expect = 2e-10,   Method: Compositional matrix adjust.
 Identities = 34/103 (33%), Positives = 55/103 (53%), Gaps = 7/103 (6%)

Query: 58  GLYPVISIINHSCLPNAVLVFEGRL--AVVRAVQHVPKGAEVLISYIETAGSTMTRQKAL 115
            LYP+ +++NH C+PN+   FE +    +VRA   + +G E+  +Y +     + R   L
Sbjct: 208 ALYPLFAVMNHDCIPNSYYTFEEKTNNMIVRAAVDIAEGEEITTTYTKLFTGNIARHLYL 267

Query: 116 KEQYLFTCTCPRCIKLGQFDDIQESAILEGYRCKDDGCSGFLL 158
           K +  FTC CPRC      D  ++ A + G  C+D  CSG ++
Sbjct: 268 KMKKGFTCKCPRC-----SDPTEKGAYISGLYCRDTKCSGLVV 305


>gi|302411726|ref|XP_003003696.1| conserved hypothetical protein [Verticillium albo-atrum VaMs.102]
 gi|261357601|gb|EEY20029.1| conserved hypothetical protein [Verticillium albo-atrum VaMs.102]
          Length = 564

 Score = 72.8 bits (177), Expect = 2e-10,   Method: Compositional matrix adjust.
 Identities = 40/104 (38%), Positives = 61/104 (58%), Gaps = 8/104 (7%)

Query: 27  EISINEIAENFSKLACNAHTICNSELRPLGTGLY--PVISIINHSCLPNAVLVFEGRLAV 84
           E++   + E   K+  NA T   +     G GL+    ++++NHSC+PNA++ F GR A 
Sbjct: 190 EVTERRVVELLCKIKTNAFTRSEA-----GEGLFLDTTLAMVNHSCVPNAIVAFSGRRAF 244

Query: 85  VRAVQHVPKGAEVLISYIETAGSTMTRQKALKEQYLFTCTCPRC 128
           +RA++ + +G E+ ISYI+   S   R+KAL E Y F C C RC
Sbjct: 245 LRALRDIKEGEEIEISYIDCTQSLEHRRKAL-ELYFFQCVCMRC 287


>gi|389631537|ref|XP_003713421.1| hypothetical protein MGG_15522 [Magnaporthe oryzae 70-15]
 gi|351645754|gb|EHA53614.1| hypothetical protein MGG_15522 [Magnaporthe oryzae 70-15]
 gi|440463734|gb|ELQ33288.1| hypothetical protein OOU_Y34scaffold00979g72 [Magnaporthe oryzae
           Y34]
 gi|440483676|gb|ELQ64025.1| hypothetical protein OOW_P131scaffold00888g14 [Magnaporthe oryzae
           P131]
          Length = 542

 Score = 72.8 bits (177), Expect = 2e-10,   Method: Compositional matrix adjust.
 Identities = 37/90 (41%), Positives = 55/90 (61%), Gaps = 1/90 (1%)

Query: 39  KLACNAHTICNSELRPLGTGLYPVISIINHSCLPNAVLVFEGRLAVVRAVQHVPKGAEVL 98
           K+  N+    +++    GT L PV+++ NHSC+PNAV++F  R A +RA   + +G+E+ 
Sbjct: 188 KIQTNSFDRFDADTGASGTFLDPVLAMANHSCVPNAVVLFWRRKAYLRAEMPIKQGSEIS 247

Query: 99  ISYIETAGSTMTRQKALKEQYLFTCTCPRC 128
           ISYI+       RQ+ L   Y FTC CPRC
Sbjct: 248 ISYIDYTKPVRFRQEDL-WLYHFTCKCPRC 276


>gi|156050587|ref|XP_001591255.1| hypothetical protein SS1G_07881 [Sclerotinia sclerotiorum 1980]
 gi|154692281|gb|EDN92019.1| hypothetical protein SS1G_07881 [Sclerotinia sclerotiorum 1980
           UF-70]
          Length = 593

 Score = 72.8 bits (177), Expect = 2e-10,   Method: Compositional matrix adjust.
 Identities = 37/112 (33%), Positives = 58/112 (51%), Gaps = 11/112 (9%)

Query: 31  NEIAENFSK---LACNAHTICNSELR--------PLGTGLYPVISIINHSCLPNAVLVFE 79
           +++ +N SK   L   AH +  +  R        P+G  L P+++  NH C PNA + F+
Sbjct: 201 DDLEKNNSKWDDLLLQAHALSTNAFRVESNVGNGPIGLCLDPLLARANHCCYPNAAITFD 260

Query: 80  GRLAVVRAVQHVPKGAEVLISYIETAGSTMTRQKALKEQYLFTCTCPRCIKL 131
           G+ A +RA+  +  G ++ ISYI+       RQ AL+E + F C C RC   
Sbjct: 261 GKRATLRALFPIKNGEQIFISYIDETQRQEVRQAALEETWFFKCRCSRCTNF 312


>gi|66825697|ref|XP_646203.1| hypothetical protein DDB_G0270830 [Dictyostelium discoideum AX4]
 gi|60474828|gb|EAL72765.1| hypothetical protein DDB_G0270830 [Dictyostelium discoideum AX4]
          Length = 696

 Score = 72.8 bits (177), Expect = 2e-10,   Method: Compositional matrix adjust.
 Identities = 39/107 (36%), Positives = 59/107 (55%), Gaps = 10/107 (9%)

Query: 28  ISINEIAENFSKLACNAHTICNSELRPL-----GTGLYPVISIINHSCLPNAVLVFEGR- 81
           ++I+EI E +S +  N H +    L PL     G G++P  S +NHSCLPNA    + + 
Sbjct: 403 VTIDEIIEIYSIVLSNGHEM----LHPLNCHTYGLGIFPTGSYLNHSCLPNAFWYNDDQG 458

Query: 82  LAVVRAVQHVPKGAEVLISYIETAGSTMTRQKALKEQYLFTCTCPRC 128
           + V R ++ + KG E+L SY +       R+K L +QY F C C +C
Sbjct: 459 MMVFRTLRPIKKGEEILTSYTDITTECSERRKHLLKQYFFFCQCQQC 505


>gi|255570887|ref|XP_002526395.1| heat shock protein 70 (HSP70)-interacting protein, putative
           [Ricinus communis]
 gi|223534257|gb|EEF35971.1| heat shock protein 70 (HSP70)-interacting protein, putative
           [Ricinus communis]
          Length = 781

 Score = 72.8 bits (177), Expect = 2e-10,   Method: Compositional matrix adjust.
 Identities = 72/364 (19%), Positives = 153/364 (42%), Gaps = 52/364 (14%)

Query: 7   KQLLLYAQIANLVNLILQWPEISINEIAENFSKLACNAHTICNS-ELRPLGTGLYPVISI 65
           + ++L +Q+      +++   +  +  +++  KL+ +   + +S E  P+G  +Y   S+
Sbjct: 423 QAIILVSQVRVNAMAVVRMKSVDAHCPSDHLVKLSHSGDALTSSVEQVPVGQAIYTAGSL 482

Query: 66  INHSCLPNAVLVFEGRLAVVRAVQHVPKGAEVLISYIETAG--STMTRQKALKEQYLFTC 123
            NHSC PN    F  R   +R  +H+  G  + +SY    G      R K L+++Y F C
Sbjct: 483 FNHSCQPNVHAYFLSRTLFIRTTEHLATGCPLELSYGPQVGQRDCKDRLKFLQDKYSFRC 542

Query: 124 TCPRCIKLGQFDDIQESAILEGYRCKDDGCSGFLLRDSDDKGFTCQQCGLVRSKEEIK-K 182
            C  C  +   D +Q +     +RC D  C G +L  S       +     R+ E  +  
Sbjct: 543 HCNGCSIVNLSDLVQNA-----FRCIDLNCVGVVLDRSVINSEIKKLKNFPRAPERQRLD 597

Query: 183 IASEVNILSKKTLALT------------SCGN-------HQEVVSTYKMIEKLQ------ 217
           +  +V+ L+   L L+            +CG+       H+ + + +   ++LQ      
Sbjct: 598 LCLQVDDLAHLALELSNGPLHIQPGCCLNCGSYCDLEAVHEGMRTAWIYFKRLQDAIVVK 657

Query: 218 -----------------KKLYHPFSVNLMQTREKLIKILMELEDWKEALAYCQLTIPVYQ 260
                            + + H ++ ++ +  + L +    +ED++ A  +C+ +I + +
Sbjct: 658 KISTTVITDASRALGALRSILHAYNKHIAEAEDILAQAFCLVEDFQSARDHCRASIKILE 717

Query: 261 RVYPQFHPLLGLQYYTCGKLEWFLGDTENAIKSMTEAVEILRITHGTNSPFMKELILKLE 320
            +Y   H ++G +      ++  +GD  +A+ S+     I     G+++ F+   +  L 
Sbjct: 718 MLYGPDHIVIGYELIKLSTIQLSMGDL-SAVDSINRLGAIFERYFGSHADFIFPYLQTLR 776

Query: 321 EAQA 324
           E  A
Sbjct: 777 EKLA 780


>gi|426239581|ref|XP_004013698.1| PREDICTED: SET and MYND domain-containing protein 3 isoform 3 [Ovis
           aries]
          Length = 391

 Score = 72.8 bits (177), Expect = 2e-10,   Method: Compositional matrix adjust.
 Identities = 31/65 (47%), Positives = 46/65 (70%)

Query: 32  EIAENFSKLACNAHTICNSELRPLGTGLYPVISIINHSCLPNAVLVFEGRLAVVRAVQHV 91
           +I E F+K+ CN+ TICN+E++ +G GLYP +S++NHSC PN  +VF G   ++R V+ V
Sbjct: 170 DIFEAFAKVICNSFTICNAEMQEVGVGLYPSMSLLNHSCDPNCSIVFNGPHLLLRTVRDV 229

Query: 92  PKGAE 96
             G E
Sbjct: 230 EAGEE 234



 Score = 48.1 bits (113), Expect = 0.006,   Method: Compositional matrix adjust.
 Identities = 24/61 (39%), Positives = 36/61 (59%)

Query: 246 KEALAYCQLTIPVYQRVYPQFHPLLGLQYYTCGKLEWFLGDTENAIKSMTEAVEILRITH 305
           +EAL Y   T+  Y+  YP  HP+ G+Q    GKL+   G    A+K++  A +I+R+TH
Sbjct: 308 EEALFYGIRTMEPYRIFYPGNHPVRGVQIMKVGKLQLHQGMFPQAMKNLRLAFDIMRVTH 367

Query: 306 G 306
           G
Sbjct: 368 G 368


>gi|335309609|ref|XP_003131876.2| PREDICTED: SET and MYND domain-containing protein 4-like [Sus
           scrofa]
          Length = 788

 Score = 72.4 bits (176), Expect = 2e-10,   Method: Compositional matrix adjust.
 Identities = 53/171 (30%), Positives = 77/171 (45%), Gaps = 22/171 (12%)

Query: 25  WPEISINEIA--ENFSKLACNAHTICNSELRPLGTGLYPVISIINHSCLPNAVLVFEGRL 82
           +PE +I  +A   +  +L CNA  I  + ++  G  ++P++++  HSC PNA + F   +
Sbjct: 481 YPEWNIWGVAMLRHMLQLQCNAQAI--TAIQQTGDSIFPIVNVFXHSCSPNASVSFISTV 538

Query: 83  AVVRAVQHVPKGAEVLISY--IETAGSTMTRQKALKEQYLFTCTCPRCIKLGQFDDIQES 140
           A VRA Q + KG EVL  Y   E+      R + L+ QY F C CP C +  Q     E 
Sbjct: 539 ATVRASQPIRKGQEVLHCYGPHESRMGVAERHQKLRSQYFFDCNCPPCEREKQRPSALEP 598

Query: 141 -----------AILEG---YRCKDDGCSGFLLRDSDDKGFT--CQQCGLVR 175
                      A+L+G     C    C   + R   D       QQ GL R
Sbjct: 599 RREAFCCHRCRALLQGDDVLSCGSTSCPASVRRHHLDSRLQDLQQQVGLAR 649


>gi|156386886|ref|XP_001634142.1| predicted protein [Nematostella vectensis]
 gi|156221221|gb|EDO42079.1| predicted protein [Nematostella vectensis]
          Length = 73

 Score = 72.4 bits (176), Expect = 3e-10,   Method: Compositional matrix adjust.
 Identities = 32/73 (43%), Positives = 46/73 (63%)

Query: 56  GTGLYPVISIINHSCLPNAVLVFEGRLAVVRAVQHVPKGAEVLISYIETAGSTMTRQKAL 115
           GTG++P    +NHSC PN+V VF G    ++A++ +P G E+ ISYI+      TRQ+ L
Sbjct: 1   GTGIFPNAVCLNHSCAPNSVAVFNGTNIYIKALEEIPVGEELTISYIQQLHPRETRQEEL 60

Query: 116 KEQYLFTCTCPRC 128
           + Q+ F C C RC
Sbjct: 61  QTQFCFYCQCHRC 73


>gi|12007319|gb|AAG45134.1|AF310894_3 BOP [Dictyostelium discoideum]
          Length = 403

 Score = 72.4 bits (176), Expect = 3e-10,   Method: Compositional matrix adjust.
 Identities = 39/107 (36%), Positives = 59/107 (55%), Gaps = 10/107 (9%)

Query: 28  ISINEIAENFSKLACNAHTICNSELRPL-----GTGLYPVISIINHSCLPNAVLVFEGR- 81
           ++I+EI E +S +  N H +    L PL     G G++P  S +NHSCLPNA    + + 
Sbjct: 110 VTIDEIIEIYSIVLSNGHEM----LHPLNCHTYGLGIFPTGSYLNHSCLPNAFWYNDDQG 165

Query: 82  LAVVRAVQHVPKGAEVLISYIETAGSTMTRQKALKEQYLFTCTCPRC 128
           + V R ++ + KG E+L SY +       R+K L +QY F C C +C
Sbjct: 166 MMVFRTLRPIKKGEEILTSYTDITTECSERRKHLLKQYFFFCQCQQC 212


>gi|383852386|ref|XP_003701709.1| PREDICTED: SET and MYND domain-containing protein 4-like [Megachile
           rotundata]
          Length = 675

 Score = 72.4 bits (176), Expect = 3e-10,   Method: Compositional matrix adjust.
 Identities = 52/168 (30%), Positives = 78/168 (46%), Gaps = 8/168 (4%)

Query: 55  LGTGLYPVISIINHSCLPNAVLVFEGRLAVVRAVQHVPKGAEVLISY--IETAGSTMTRQ 112
           +G GL+P I++ NHSC P  +  F G   VVRA++ +  G E+  +Y  I T      R+
Sbjct: 478 IGGGLFPTIALFNHSCNPGVIRYFIGTTMVVRAIRSIAAGEEISENYGPIFTMSPEAERK 537

Query: 113 KALKEQYLFTCTCPRCIKLGQFDDIQESAILEGYRCKDDG--CSGFLLRDSDDKGFT--C 168
           + L+ QY F C C  C+      D  +  IL  ++C D G  C   L   +D   F   C
Sbjct: 538 RTLRWQYWFDCNCEACVGHWPLLDDIDPTILR-FKC-DTGRECGNILPVKTDTNEFMIRC 595

Query: 169 QQCGLVRSKEEIKKIASEVNILSKKTLALTSCGNHQEVVSTYKMIEKL 216
            +CG   +  +  K   + + L +        G H E +S+Y  I KL
Sbjct: 596 PKCGKNTNILKGLKALQDTDSLFRVASNNLEDGKHVEALSSYLKILKL 643


>gi|322801942|gb|EFZ22489.1| hypothetical protein SINV_02250 [Solenopsis invicta]
          Length = 675

 Score = 72.4 bits (176), Expect = 3e-10,   Method: Compositional matrix adjust.
 Identities = 51/179 (28%), Positives = 85/179 (47%), Gaps = 6/179 (3%)

Query: 43  NAHTICNSELRPLGTGLYPVISIINHSCLPNAVLVFEGRLAVVRAVQHVPKGAEVLISY- 101
            A T+  ++   +G G+YP ++++NHSC P  V  F G   +VRA++ +  G E+  +Y 
Sbjct: 466 GAKTLAKAKSMFIGGGVYPTVAMLNHSCNPGVVRYFIGTTMIVRAIRTISVGEEISENYG 525

Query: 102 -IETAGSTMTRQKALKEQYLFTCTCPRCIKLGQFDDIQESAILEGYRCK-DDGCSGFLLR 159
            I T    + R++ L+ QY F C C  C       D  +  IL  ++C+    C   LL 
Sbjct: 526 PIFTTTPEIERKRKLRVQYWFDCNCEACSGHWPLLDELDPTILR-FKCETGPSCGNVLLV 584

Query: 160 DSDDKGFT--CQQCGLVRSKEEIKKIASEVNILSKKTLALTSCGNHQEVVSTYKMIEKL 216
            SD K F   C +CG   +  +  K+  + + L +        G +++ +  Y  I KL
Sbjct: 585 RSDTKEFMIGCAKCGKSTNILKGLKVLQDTDALFRVASTNLEEGRNEQALKAYLEILKL 643


>gi|392579385|gb|EIW72512.1| hypothetical protein TREMEDRAFT_58684 [Tremella mesenterica DSM
           1558]
          Length = 885

 Score = 72.4 bits (176), Expect = 3e-10,   Method: Compositional matrix adjust.
 Identities = 38/102 (37%), Positives = 52/102 (50%), Gaps = 1/102 (0%)

Query: 57  TGLYPVISIINHSCLPNAVLVFEGRLAVVRAVQHVPKGAEVLISYIETAGSTMTRQKALK 116
           T ++P +S+INHSCLPN V +  G +AVVRA+  +  G E+ ISY        TR    K
Sbjct: 553 TAIWPTVSMINHSCLPNTVHIAWGDVAVVRALNDMSAGTELTISYFSREQPYETRAAKAK 612

Query: 117 EQYLFTCTCPRCIKLGQFDDIQESAILEGYRCKDDGCSGFLL 158
            QYLFTC C  C      + I+   ++E +          LL
Sbjct: 613 -QYLFTCDCNLCAADRMDESIRRGKLMEKFDMAKQASRNILL 653


>gi|115449849|ref|XP_001218712.1| predicted protein [Aspergillus terreus NIH2624]
 gi|114187661|gb|EAU29361.1| predicted protein [Aspergillus terreus NIH2624]
          Length = 306

 Score = 72.0 bits (175), Expect = 3e-10,   Method: Compositional matrix adjust.
 Identities = 35/87 (40%), Positives = 50/87 (57%), Gaps = 2/87 (2%)

Query: 55  LGTGLYPVISIINHSCLPNAVLVFEGRLAVVRAVQHVPKGAEVLISYIETAGSTMTRQKA 114
           LGT L    S +NHSC P A + FEGR   VRA+  +P G E+  +Y++ +GS  +RQ  
Sbjct: 9   LGTTLDLAASFMNHSCNPGAFVFFEGRQMRVRALLSLPAGEEITQAYVDLSGSVFSRQAT 68

Query: 115 LKEQYLFTCTCPRCIKLGQFDDIQESA 141
            + +Y F C C RC      +D+Q+ A
Sbjct: 69  TEAEYFFQCHCVRCED--DLEDLQQIA 93


>gi|226290918|gb|EEH46346.1| conserved hypothetical protein [Paracoccidioides brasiliensis Pb18]
          Length = 389

 Score = 72.0 bits (175), Expect = 3e-10,   Method: Compositional matrix adjust.
 Identities = 36/119 (30%), Positives = 62/119 (52%)

Query: 12  YAQIANLVNLILQWPEISINEIAENFSKLACNAHTICNSELRPLGTGLYPVISIINHSCL 71
           +  + + VN  L     +   + +  ++L  NA T+ N     +G  + P  S  NHSC 
Sbjct: 73  FVTVGDEVNPGLTLKACTGCRVVKYCNELETNAFTLTNRYFDRIGLCMLPFASYANHSCE 132

Query: 72  PNAVLVFEGRLAVVRAVQHVPKGAEVLISYIETAGSTMTRQKALKEQYLFTCTCPRCIK 130
           PNA + F+G +  ++A+Q +    ++ ISYI+       RQ  L+++Y FTC CP+C +
Sbjct: 133 PNAYIGFDGPVIYLKALQDIALDEQIFISYIDNTEPWGKRQSELQKRYFFTCKCPKCAQ 191


>gi|195058496|ref|XP_001995453.1| GH17754 [Drosophila grimshawi]
 gi|193896239|gb|EDV95105.1| GH17754 [Drosophila grimshawi]
          Length = 499

 Score = 72.0 bits (175), Expect = 3e-10,   Method: Compositional matrix adjust.
 Identities = 34/103 (33%), Positives = 54/103 (52%), Gaps = 7/103 (6%)

Query: 58  GLYPVISIINHSCLPNAVLVFEGRL--AVVRAVQHVPKGAEVLISYIETAGSTMTRQKAL 115
            LYP+  ++NH C+PN+   FE +    +VRA   + +G E+  +Y +     + R   L
Sbjct: 208 ALYPLFGVMNHDCIPNSYYTFEEKTNNMIVRAAVDILEGEEITTTYTKLFTGNIARHLFL 267

Query: 116 KEQYLFTCTCPRCIKLGQFDDIQESAILEGYRCKDDGCSGFLL 158
           K +  FTC CPRC      D  ++ A + G  C+D  CSG ++
Sbjct: 268 KMKKGFTCKCPRC-----SDPTEKGAYISGLYCRDTNCSGLVV 305


>gi|330841743|ref|XP_003292851.1| hypothetical protein DICPUDRAFT_89992 [Dictyostelium purpureum]
 gi|325076863|gb|EGC30616.1| hypothetical protein DICPUDRAFT_89992 [Dictyostelium purpureum]
          Length = 680

 Score = 72.0 bits (175), Expect = 3e-10,   Method: Compositional matrix adjust.
 Identities = 38/103 (36%), Positives = 57/103 (55%), Gaps = 2/103 (1%)

Query: 28  ISINEIAENFSKLACNAHTICNS-ELRPLGTGLYPVISIINHSCLPNAVLVF-EGRLAVV 85
           I++NEI E +S +  N H + +    R  G G++P  S +NHSC PNA     E  + V 
Sbjct: 379 ITLNEIIELYSIVLSNGHEMLHPLTCRSYGMGIFPTGSYLNHSCSPNAFWYNDEQGMMVF 438

Query: 86  RAVQHVPKGAEVLISYIETAGSTMTRQKALKEQYLFTCTCPRC 128
           R+++ + KG E+L SY +       R+K L +QY F C C +C
Sbjct: 439 RSLRPLKKGEELLTSYTDVTNPLEDRRKYLMKQYFFFCQCNQC 481


>gi|19113306|ref|NP_596514.1| histone lysine methyltransferase Set6 (predicted)
           [Schizosaccharomyces pombe 972h-]
 gi|74626997|sp|O94256.1|SET6_SCHPO RecName: Full=SET domain and MYND-type zinc finger protein 6
 gi|3810831|emb|CAA21792.1| histone lysine methyltransferase Set6 (predicted)
           [Schizosaccharomyces pombe]
          Length = 483

 Score = 72.0 bits (175), Expect = 4e-10,   Method: Compositional matrix adjust.
 Identities = 33/92 (35%), Positives = 52/92 (56%)

Query: 37  FSKLACNAHTICNSELRPLGTGLYPVISIINHSCLPNAVLVFEGRLAVVRAVQHVPKGAE 96
           F +LA NA  +  S    LG  L  ++  +NHSC PN  ++F+G +  + + + + K  +
Sbjct: 163 FCRLAVNAMNLVTSSFDSLGMCLDTILCRLNHSCDPNCQIIFDGAIVQLVSKRDIKKDEQ 222

Query: 97  VLISYIETAGSTMTRQKALKEQYLFTCTCPRC 128
           + ISYI+       RQK L ++Y F+C CPRC
Sbjct: 223 LFISYIDIRLPKSIRQKQLLKKYFFSCYCPRC 254


>gi|328875104|gb|EGG23469.1| SET domain-containing protein [Dictyostelium fasciculatum]
          Length = 448

 Score = 72.0 bits (175), Expect = 4e-10,   Method: Compositional matrix adjust.
 Identities = 43/125 (34%), Positives = 64/125 (51%), Gaps = 4/125 (3%)

Query: 39  KLACNAHTICNSELRPLGTGLYPVISIINHSCLPNAVLVFEGRLAVVRAVQHVPKGAEVL 98
           K+ CN   I + + + LG  + P+ S  NHSC PN V V    L   +A+  +PKG+++ 
Sbjct: 272 KIRCNQFGIWSKKDKCLGVSVTPIASYFNHSCCPNIVDVRGTTLLEFKALHFIPKGSQLC 331

Query: 99  ISYIETAGSTMTRQKALKEQYLFTCTCPRCIKLGQFDDIQESAILEGYRCKDDGCSG-FL 157
           ISY++   +T +RQ  L   Y F C C RC   G   D  ++ I + Y      CSG + 
Sbjct: 332 ISYLDLDQTTDSRQDYLIYSYYFKCGCKRCNDKG---DSIDNWISQFYCQGTKKCSGTYF 388

Query: 158 LRDSD 162
           L D +
Sbjct: 389 LEDEE 393


>gi|225679205|gb|EEH17489.1| conserved hypothetical protein [Paracoccidioides brasiliensis Pb03]
          Length = 377

 Score = 72.0 bits (175), Expect = 4e-10,   Method: Compositional matrix adjust.
 Identities = 36/119 (30%), Positives = 62/119 (52%)

Query: 12  YAQIANLVNLILQWPEISINEIAENFSKLACNAHTICNSELRPLGTGLYPVISIINHSCL 71
           +  + + VN  L     +   + +  ++L  NA T+ N     +G  + P  S  NHSC 
Sbjct: 62  FVTVGDEVNPGLTLKACTGCRVVKYCNELETNAFTLTNRYFDRIGLCMLPFASYANHSCE 121

Query: 72  PNAVLVFEGRLAVVRAVQHVPKGAEVLISYIETAGSTMTRQKALKEQYLFTCTCPRCIK 130
           PNA + F+G +  ++A+Q +    ++ ISYI+       RQ  L+++Y FTC CP+C +
Sbjct: 122 PNAYIGFDGPVIYLKALQDIALDEQIFISYIDNTEPWEKRQSELQKRYFFTCKCPKCAQ 180


>gi|121715400|ref|XP_001275309.1| SET and MYND domain protein, putative [Aspergillus clavatus NRRL 1]
 gi|119403466|gb|EAW13883.1| SET and MYND domain protein, putative [Aspergillus clavatus NRRL 1]
          Length = 555

 Score = 72.0 bits (175), Expect = 4e-10,   Method: Compositional matrix adjust.
 Identities = 34/96 (35%), Positives = 54/96 (56%)

Query: 33  IAENFSKLACNAHTICNSELRPLGTGLYPVISIINHSCLPNAVLVFEGRLAVVRAVQHVP 92
           IAE  +KL  N+    N     LG  ++P  +++NHSC  NA + F+     V+A+  + 
Sbjct: 233 IAEYAAKLDVNSFNFDNVMDERLGIYMHPYAALMNHSCDYNATVAFDDDRLHVKALHPIK 292

Query: 93  KGAEVLISYIETAGSTMTRQKALKEQYLFTCTCPRC 128
           KG ++ I+Y++T      RQK L ++Y FTC C +C
Sbjct: 293 KGEQIFITYVDTTNPYKIRQKELSDRYYFTCRCSKC 328


>gi|320586941|gb|EFW99604.1| set and mynd domain protein [Grosmannia clavigera kw1407]
          Length = 576

 Score = 71.6 bits (174), Expect = 4e-10,   Method: Compositional matrix adjust.
 Identities = 41/106 (38%), Positives = 58/106 (54%), Gaps = 3/106 (2%)

Query: 25  WP--EISINEIAENFSKLACNAHTICNSELRPLGTGLYPVISIINHSCLPNAVLVFEGRL 82
           WP  + S+    E   K+  N+    + ++   GT L   +++ NHSC+PNAV+ F GR 
Sbjct: 206 WPATDASLAIAVEAMCKIHTNSFDRSDVDVAYTGTFLDAHLAMANHSCVPNAVVSFAGRK 265

Query: 83  AVVRAVQHVPKGAEVLISYIETAGSTMTRQKALKEQYLFTCTCPRC 128
           A +RA Q +  G EV ISYI+       RQ+ L + Y FTC C RC
Sbjct: 266 AFLRAEQAIRAGDEVTISYIDYTKPKSVRQRGL-DLYHFTCDCARC 310


>gi|255950300|ref|XP_002565917.1| Pc22g20170 [Penicillium chrysogenum Wisconsin 54-1255]
 gi|211592934|emb|CAP99305.1| Pc22g20170 [Penicillium chrysogenum Wisconsin 54-1255]
          Length = 495

 Score = 71.6 bits (174), Expect = 4e-10,   Method: Compositional matrix adjust.
 Identities = 36/106 (33%), Positives = 59/106 (55%), Gaps = 1/106 (0%)

Query: 27  EISINEIAENFSKLACNAHTICNSELRPLGTGLYPVISIINHSCLPNAVLVFEGRLAVVR 86
           E+    +A   +KL  N+  +  S    +G  ++P   +INHSC  N+ + F+G    V+
Sbjct: 175 EMDEGTVASYAAKLDLNSFNLTTSMYDRIGLYMHPYAGLINHSCDYNSTVGFDGEELYVK 234

Query: 87  AVQHVPKGAEVLISYIETAGSTMTRQKALKEQYLFTCTCPRCIKLG 132
           A++ + KG ++ ISYI+T      R+  LKE+Y F C C +C K+G
Sbjct: 235 AMRPIKKGEQIFISYIDTTTPYDIRRNELKERYFFDCQCTKC-KMG 279


>gi|302845794|ref|XP_002954435.1| hypothetical protein VOLCADRAFT_118721 [Volvox carteri f.
           nagariensis]
 gi|300260365|gb|EFJ44585.1| hypothetical protein VOLCADRAFT_118721 [Volvox carteri f.
           nagariensis]
          Length = 405

 Score = 71.6 bits (174), Expect = 5e-10,   Method: Compositional matrix adjust.
 Identities = 35/73 (47%), Positives = 45/73 (61%), Gaps = 2/73 (2%)

Query: 58  GLYPVISIINHSCLPNAVLVFEGRLAVVRAVQHVPKGAEVLISYIETAG--STMTRQKAL 115
           GL+P  S +NHSCLPNAV    G + VV A + + +G+EVLI+Y+  A       RQ  L
Sbjct: 122 GLFPSFSFLNHSCLPNAVNFVVGGIMVVVAARKIRQGSEVLINYLGRASLRPVGERQGQL 181

Query: 116 KEQYLFTCTCPRC 128
            E Y F+C CPRC
Sbjct: 182 AEGYHFSCDCPRC 194


>gi|340992675|gb|EGS23230.1| hypothetical protein CTHT_0008940 [Chaetomium thermophilum var.
           thermophilum DSM 1495]
          Length = 459

 Score = 71.6 bits (174), Expect = 5e-10,   Method: Compositional matrix adjust.
 Identities = 39/103 (37%), Positives = 59/103 (57%), Gaps = 2/103 (1%)

Query: 27  EISINEIAENFSKLACNAHTICNSELRPLGTGLYPVISIINHSCLPNAVLVFEGRLAVVR 86
           + SI    E   K+  NA    + +    G  L PV+++ NHSC+P+A + F+ R AV+R
Sbjct: 89  DFSIPRAREILCKIQTNAFNRLDPDAGMTGIYLDPVLAMANHSCMPSAFVSFDQRNAVLR 148

Query: 87  AVQHVPKGAEVLISYIE-TAGSTMTRQKALKEQYLFTCTCPRC 128
           A + + +G E+ I Y++ T  +   RQ+ALK  Y F C CPRC
Sbjct: 149 AWRDIKEGDEITICYVDVTLPNKAARQEALK-LYHFECRCPRC 190


>gi|4324419|gb|AAD16880.1| unknown [Dictyostelium discoideum]
          Length = 333

 Score = 71.6 bits (174), Expect = 5e-10,   Method: Compositional matrix adjust.
 Identities = 39/107 (36%), Positives = 59/107 (55%), Gaps = 10/107 (9%)

Query: 28  ISINEIAENFSKLACNAHTICNSELRPL-----GTGLYPVISIINHSCLPNAVLVFEGR- 81
           ++I+EI E +S +  N H +    L PL     G G++P  S +NHSCLPNA    + + 
Sbjct: 40  VTIDEIIEIYSIVLSNGHEM----LHPLNCHTYGLGIFPTGSYLNHSCLPNAFWYNDDQG 95

Query: 82  LAVVRAVQHVPKGAEVLISYIETAGSTMTRQKALKEQYLFTCTCPRC 128
           + V R ++ + KG E+L SY +       R+K L +QY F C C +C
Sbjct: 96  MMVFRTLRPIKKGEEILTSYTDITTECSERRKHLLKQYFFFCQCQQC 142


>gi|449681118|ref|XP_002159729.2| PREDICTED: uncharacterized protein LOC100210152, partial [Hydra
           magnipapillata]
          Length = 1044

 Score = 71.2 bits (173), Expect = 6e-10,   Method: Compositional matrix adjust.
 Identities = 36/101 (35%), Positives = 57/101 (56%), Gaps = 12/101 (11%)

Query: 40  LACNAHTIC----------NSELRPLGTGLYPVISIINHSCLPNAVLVFEGRLAVVRAVQ 89
           L CNAH +           +SEL+ +G+ +Y  +S++NHSC P+ V    G   V+RA++
Sbjct: 385 LPCNAHEVSELQLKASNYKDSELKEIGSAVYATLSLLNHSCDPSVVRHCYGDTCVLRAIK 444

Query: 90  HVPKGAEVLISY--IETAGSTMTRQKALKEQYLFTCTCPRC 128
           H+ +G+E++ +Y  +    S + RQ  L EQY F C C  C
Sbjct: 445 HIKEGSEIVDNYGFLYAVESKVIRQSHLMEQYYFACQCEAC 485


>gi|390599694|gb|EIN09090.1| SET domain-containing protein [Punctularia strigosozonata HHB-11173
           SS5]
          Length = 562

 Score = 71.2 bits (173), Expect = 6e-10,   Method: Compositional matrix adjust.
 Identities = 41/121 (33%), Positives = 64/121 (52%), Gaps = 12/121 (9%)

Query: 29  SINEIAENFSKLACNAHTICNSELRPLGTGLYPVISIINHSCLPNAVLVF---------E 79
           S  ++ +  S+   NA T+    L PLG  + P ++++NHSC PNAV+VF         E
Sbjct: 207 SAGDVVDLISRFTTNAITLAAPSLTPLGVSVSPAVALVNHSCAPNAVVVFPRVSKTVDQE 266

Query: 80  GRLAVVRAVQHVPKGAEVLISYIETAGSTMTRQKALKEQYLFTCTCPRCIKLGQFDDIQE 139
             + V+ A++ +    EV  +YI+T      RQK L++ Y FTC+C  C    +  D +E
Sbjct: 267 PVMQVI-ALRDIHPDEEVTTAYIDTTVPREQRQKILRDTYHFTCSCSLCA--AEDPDPRE 323

Query: 140 S 140
           S
Sbjct: 324 S 324


>gi|156367132|ref|XP_001627273.1| predicted protein [Nematostella vectensis]
 gi|156214178|gb|EDO35173.1| predicted protein [Nematostella vectensis]
          Length = 750

 Score = 71.2 bits (173), Expect = 6e-10,   Method: Compositional matrix adjust.
 Identities = 58/272 (21%), Positives = 118/272 (43%), Gaps = 11/272 (4%)

Query: 51  ELRPLGTGLYPVISIINHSCLPNAVLVFEGRLAVVRAVQHVPKGAEVLISYIETAGSTMT 110
           E R + + +YP  S++NH+C P+ ++ F   + V RA  ++  G+ +   Y         
Sbjct: 468 EQRRIASAIYPTASLLNHACDPDVLVSFVDGVLVARATHNIAPGSGITHCYGPHVNHMPR 527

Query: 111 --RQKALKEQYLFTCTCPRCIKLGQFDDIQESAILEGYRCKDDGCSGFLLRDS--DDKGF 166
             RQK L +QY FTC C  C    + ++ +       + C    C    ++ S  +    
Sbjct: 528 EERQKLLYKQYFFTCQCSACTSDEEMENTR--LCFSAFACPRCKCP---MKTSPLEPSLA 582

Query: 167 TCQ--QCGLVRSKEEIKKIASEVNILSKKTLALTSCGNHQEVVSTYKMIEKLQKKLYHPF 224
            CQ  +C L +S EE    + +  +L  K +        QE +  ++   + + ++ HP 
Sbjct: 583 RCQNKKCTLEKSIEEELSHSRQAELLFFKAVRTMERIGVQEALGLFQECLRTRTQILHPH 642

Query: 225 SVNLMQTREKLIKILMELEDWKEALAYCQLTIPVYQRVYPQFHPLLGLQYYTCGKLEWFL 284
             +L +T + L +    + D+K A  +C  +    ++ +         + +   +L +  
Sbjct: 643 HKDLAETHDALARCYAMIGDFKLASQHCLQSSEAVEKAFGSTSVEYAHELHKLSQLLFND 702

Query: 285 GDTENAIKSMTEAVEILRITHGTNSPFMKELI 316
              + A+  + +A  +L   +G N P ++EL+
Sbjct: 703 RQAKKALPMIDKAASLLATYYGRNHPDVQELV 734


>gi|157124668|ref|XP_001654144.1| hypothetical protein AaeL_AAEL001857 [Aedes aegypti]
 gi|108882773|gb|EAT46998.1| AAEL001857-PA [Aedes aegypti]
          Length = 497

 Score = 71.2 bits (173), Expect = 6e-10,   Method: Compositional matrix adjust.
 Identities = 48/188 (25%), Positives = 81/188 (43%), Gaps = 17/188 (9%)

Query: 58  GLYPVISIINHSCLPNAVLVF---EGRLAVVRAVQHVPKGAEVLISYIETAGSTMTRQKA 114
           GLYP + ++ HSC+PN V      +G   + R    +P+G ++   Y  T   T TRQ A
Sbjct: 200 GLYPKLGVMAHSCVPNVVHSIHPSDGYRLIGRVAIDIPEGEKLYTVYTYTLNGTSTRQAA 259

Query: 115 LKEQYLFTCTCPRCIKLGQFDDIQESAILEGYRCKDDGCSGFLLRDS----DDKGFTCQQ 170
           LK    FTC CPRC+     D  +        +C+   C   L+  S    D+  + C  
Sbjct: 260 LKSSKYFTCRCPRCL-----DPTELGTHFSSLKCQK--CDNGLIVSSNPLDDEAEWRCSN 312

Query: 171 CGLVRSKEEIKKIASEVNILSKKTLALTSCGNHQEVVSTYKMIEKLQKKLYHPFSVNLMQ 230
           C        ++K    + ++  +   L+   N  E ++ ++ + K  + + HP       
Sbjct: 313 CEFKTRGAAVQKA---IQVMQAEIDQLSCLENGPEKLTAFEKLYKKYRSILHPLHFIKTS 369

Query: 231 TREKLIKI 238
            R  LI++
Sbjct: 370 IRHSLIEL 377


>gi|225711848|gb|ACO11770.1| SET and MYND domain-containing protein 3 [Lepeophtheirus salmonis]
          Length = 426

 Score = 71.2 bits (173), Expect = 6e-10,   Method: Compositional matrix adjust.
 Identities = 40/128 (31%), Positives = 67/128 (52%), Gaps = 2/128 (1%)

Query: 1   MSDIDEKQLLLYAQIANLVNLILQWPEISINEIAENFSKLACNAHTICNSELRPLGTGLY 60
           M+   +K+ L Y  I  ++ ++    E + +E+   F+KL  NA  + N +L   G  L 
Sbjct: 125 MNPEHKKRFLTYLDI--IIYMLGGSLETNESEVLNIFTKLMINASFMLNEKLIDFGGCLC 182

Query: 61  PVISIINHSCLPNAVLVFEGRLAVVRAVQHVPKGAEVLISYIETAGSTMTRQKALKEQYL 120
              S I+HSC PNA+ +F G   VV+A+  +    +V ++Y++ +     RQ+ LK Q+ 
Sbjct: 183 LEFSAIDHSCRPNAIYMFNGHTLVVKALCEIANFDDVRVAYVDMSQPRSIRQELLKNQFF 242

Query: 121 FTCTCPRC 128
           F C C  C
Sbjct: 243 FDCNCEEC 250


>gi|156356358|ref|XP_001623892.1| predicted protein [Nematostella vectensis]
 gi|156210632|gb|EDO31792.1| predicted protein [Nematostella vectensis]
          Length = 634

 Score = 71.2 bits (173), Expect = 7e-10,   Method: Compositional matrix adjust.
 Identities = 48/159 (30%), Positives = 78/159 (49%), Gaps = 18/159 (11%)

Query: 36  NFSKLACNAH----------TICNSELRPLGTGLYPVISIINHSCLPNAVLVFEGRLAVV 85
           +   L CNAH          ++  SE   +G G+Y  +S+ NHSC PN    F G   VV
Sbjct: 408 HLQSLPCNAHEISELQLSLKSVATSEAAEIGAGIYGTLSLFNHSCEPNVTRFFYGDKCVV 467

Query: 86  RAVQHVPKGAEVLISY--IETAGSTMTRQKALKEQYLFTCTCPRCIKLGQ-FDDI--QES 140
           RA   +P   EV+ +Y  +        RQ++L+ QY F C C  C++    + ++  Q+ 
Sbjct: 468 RAFSSIPCRGEVVDNYGILSALTPRKQRQESLQSQYYFKCNCHACLEDSPLYSELIKQDV 527

Query: 141 AILEGYRCKDDGCSGFLLRDSDDKGFTCQQCGLVRSKEE 179
             L+   C+    +G +L  +D K   C++CG+ +S E+
Sbjct: 528 PQLKCANCR-MALAGEIL--TDGKLVKCEKCGVPQSLED 563


>gi|341883759|gb|EGT39694.1| hypothetical protein CAEBREN_28295 [Caenorhabditis brenneri]
          Length = 582

 Score = 71.2 bits (173), Expect = 7e-10,   Method: Compositional matrix adjust.
 Identities = 65/314 (20%), Positives = 131/314 (41%), Gaps = 24/314 (7%)

Query: 5   DEKQLLLYAQIANLVNLILQWPEISINEIAENFSKLACNAHTICNSELRPLGTGLYPVIS 64
           DE+++  + ++   + L  +   I   E  + F K + N+H+I  +    +G  L   +S
Sbjct: 211 DEEKMTHFEKVFTKMGLFRKDDMIEKGEFFDIFCKASINSHSIHTNAGNEVGMALDLGVS 270

Query: 65  IINHSCLPNAVLVFEGRLAVVRAVQHVP-----KGAEVLISYIETAGSTMTRQKALKEQY 119
             NHSC P   +VF+G    +R +  VP        +  ISYI+   S   R++ LK ++
Sbjct: 271 KYNHSCRPTCSMVFDGYRVCLRPL--VPGVDAEDTEQAFISYIDVGRSKFVRRRDLKSRW 328

Query: 120 LFTCTCPRCIKLGQFDDIQESAILEGYRCKDDGCSGFLLRDSDDK--GFTCQQCGLVRSK 177
            F C C RC+      D ++ A L   RC +  C   +L    ++     C +C  +  +
Sbjct: 329 YFNCECTRCM------DPEDDA-LTAIRCANPSCDAPILTSETEEPMNIACDKCKTIVEE 381

Query: 178 EEIKKIASEVNILSKKTLALTSCGNHQEVVSTYKMIEKLQKKLYHPFSVNLMQTREKLIK 237
            ++K     +        +L +  + +      + +    +++ HP +V + + R  L  
Sbjct: 382 NDVKAAQEYMK-------SLPASFDPKCPADILRDLLAKAEQVLHPSNVYVARLRTALFH 434

Query: 238 ILMELEDWKEALAYCQLTIPVYQRVYPQFHPLLGLQYYTCGKLEWFLGDTENAIKSMTEA 297
           +   L     +  + Q+    Y+  +P+    +G Q     K      + + A+    +A
Sbjct: 435 VTGTLTMDNLSSMHTQI-YNNYKMCFPKADRHVGFQLLHIVKALIEKDERDEAMPYAFDA 493

Query: 298 VEILRITHGTNSPF 311
           + I  +  G + P+
Sbjct: 494 MNIFEVCFGLDHPY 507


>gi|328767279|gb|EGF77329.1| hypothetical protein BATDEDRAFT_27650 [Batrachochytrium
           dendrobatidis JAM81]
          Length = 445

 Score = 70.9 bits (172), Expect = 7e-10,   Method: Compositional matrix adjust.
 Identities = 71/285 (24%), Positives = 120/285 (42%), Gaps = 35/285 (12%)

Query: 56  GTGLYPVISIINHSCLPNAVLVFE-GRLAVVRAVQHVPKGAEVLISYIETAGSTMTRQKA 114
           G  +Y  +S +NHSC+PNA L F  G +A +  +  +  G ++ ISY++   S  TRQ+ 
Sbjct: 183 GVAIYDRLSTVNHSCVPNACLTFGIGGIARLSPMTAIASGDQINISYVDVFQSCETRQRQ 242

Query: 115 LKEQYLFTCTCPRCIKLGQFDDIQESAILEGYRCKDDGCSGFLLRDSDDKGFTCQQCGLV 174
           LKEQY F CTC  C      +  Q+        C     S   L  S             
Sbjct: 243 LKEQYYFDCTCRLCTANANGESRQQ--------CISQFDSAVDLYQS------------- 281

Query: 175 RSKEEIKKIASEVNILSK--KTLALTSCGNHQEVVS-TYKMIEKLQKKLYHPFSVNLMQT 231
            +K++    ++  +IL++  ++L L  C +H  ++     +IE L  +  +   + L   
Sbjct: 282 -TKKQTNDASASQSILTRALESLELIVCKSHSVLIRLRTDLIEILVGQRNYAKVIELSND 340

Query: 232 REKLIKILMELEDWKEALAYCQLTIPVYQRVYPQFHPLLGLQYYTCGKLE-WFLGDTENA 290
           +    + +     WK + +  +  I      YP + P+  +      K   W + D  + 
Sbjct: 341 QLDAFRCV-----WKASQSI-EKGISESDNAYPVW-PMESVSSVDIFKASMWTMDDDLDR 393

Query: 291 IKSM-TEAVEILRITHGTNSPFMKELILKLEEAQAEASYKLSSKD 334
           +  +  +A   LRITHG   P   +L   LE+   E   + S  D
Sbjct: 394 VAELGKQAAHNLRITHGEKHPMYLDLTEGLEDVLRELEARHSGVD 438


>gi|72169832|ref|XP_781331.1| PREDICTED: SET and MYND domain-containing protein 4-like
           [Strongylocentrotus purpuratus]
          Length = 704

 Score = 70.9 bits (172), Expect = 7e-10,   Method: Compositional matrix adjust.
 Identities = 59/278 (21%), Positives = 124/278 (44%), Gaps = 34/278 (12%)

Query: 55  LGTGLYPVISIINHSCLPNAVLVFEGRLAVVRAVQHVPKGAEVLISYIETAG--STMTRQ 112
           + T +YP +S++NH+C PN +  F   +  VRA++ + +G E+   Y    G  +T  RQ
Sbjct: 436 IATAVYPTVSLMNHACQPNVIASFRKGIISVRAIEKIMRGDEIQHCYGPQVGHMTTSDRQ 495

Query: 113 KALKEQYLFTCTCPRCIKLGQFDDIQESAILEGYRCKD---------DGCS-----GFLL 158
           +AL  QY FTC C  C +  +  D +E   ++  +C             C+     G L+
Sbjct: 496 QALLNQYCFTCRCRACTRKPRTFDKEEDLCIKCPQCGQPLNIQTSMCGKCAERIDVGVLI 555

Query: 159 RDSDDKGFTCQQCGLVRSKEEIKKIASEVNILSKKTLALTSCGNHQEVVSTYKMIEKLQK 218
            +  + G T     L+  +E      ++  ++             +EV+S  K    + +
Sbjct: 556 HELTNAGTT-----LIGLEEMFSAAVNDDTLM-------------REVISKTKSCIDVLE 597

Query: 219 KLYHPFSVNLMQTREKLIKILMELEDWKEALAYCQLTIPVYQRVYPQFHPLLGLQYYTCG 278
           ++  P  + L    + + K  +EL+++KEA  +     P  +  + +    +  + Y   
Sbjct: 598 RIIIPPDMQLATAYDDMAKCHVELDEFKEAALWLAKATPSIESRFGRDSIEVAHELYKLA 657

Query: 279 KLEWFLGDTENAIKSMTEAVEILRITHGTNSPFMKELI 316
           ++ +   +   A++++  A+E+    +G ++  + EL+
Sbjct: 658 QIYFNGKEIAPAMETIDRALELFIRHYGNSNEEVLELV 695


>gi|157125402|ref|XP_001654323.1| hypothetical protein AaeL_AAEL001936 [Aedes aegypti]
 gi|108882687|gb|EAT46912.1| AAEL001936-PA [Aedes aegypti]
          Length = 546

 Score = 70.9 bits (172), Expect = 8e-10,   Method: Compositional matrix adjust.
 Identities = 79/291 (27%), Positives = 123/291 (42%), Gaps = 34/291 (11%)

Query: 24  QWPEISINEIAENFSKLACNAHTICNSELRPLGTGLYPVISIINHSCLPNAVLVFEGRLA 83
           +W E   +EI +    L  N H +  +E  P    +Y   S++ HSC PN    F  +  
Sbjct: 224 KWSE---DEILKIVGILQVNGHEVPLTE--PPSVAIYNNASMLEHSCRPNLSKSFTSKKE 278

Query: 84  VV-RAVQHVPKGAEVLISYIETAGSTMTRQKALKEQYLFTCTCPRCIKLGQFDDIQESAI 142
           +V  A   + +G  + ISY +    T  RQ  L++  LF CTC RC+   +F     +  
Sbjct: 279 IVFWAPNPIKQGERLSISYSDVLWGTANRQDHLQQTKLFRCTCVRCLDPTEFGTYLSALK 338

Query: 143 LEGYRCKDDGCSGFLLRDSDDK---GFTCQQC-GLVRSKEEIKKIASEVNILSKKTLALT 198
             G++ KD  CSG LL ++      G+ C +C GLV  KE         NIL +  +   
Sbjct: 339 CSGFK-KDSNCSGLLLPENLKNWYGGYICNKCRGLVDGKE-------ITNILDRARVDHE 390

Query: 199 SCGNHQEVVSTYKMIEKLQKKLYHPFSVNLMQTREKLIKIL----------MELEDWKEA 248
           +     E    +K I    + L  P    L+  +  L +I+          +  ED    
Sbjct: 391 AMQKDNE-QHCHKYIAHYGRWL-GPNHHLLVDVKISLSQIMGGGDPNAIQKISDEDLMTK 448

Query: 249 LAYCQLTIPVYQRVYPQFHPLLGLQYYTCGKLEWFLGD-TENAIKSMTEAV 298
           +  CQ  I V+QRV P    ++G   +   +L   L +     ++SM  AV
Sbjct: 449 MKICQELIDVFQRVCPAEARVIGTTRF---ELHAALAEFARRGVESMNSAV 496


>gi|407929314|gb|EKG22146.1| hypothetical protein MPH_00473 [Macrophomina phaseolina MS6]
          Length = 492

 Score = 70.9 bits (172), Expect = 9e-10,   Method: Compositional matrix adjust.
 Identities = 33/94 (35%), Positives = 52/94 (55%)

Query: 37  FSKLACNAHTICNSELRPLGTGLYPVISIINHSCLPNAVLVFEGRLAVVRAVQHVPKGAE 96
           + ++  N+ T+  S + PLG    P+    NHSC PNA ++ +G     RA++ + K  E
Sbjct: 181 YGRVFTNSLTLIGSAMEPLGICFDPLACSANHSCDPNAFVIMDGAQLSFRALKPIAKDEE 240

Query: 97  VLISYIETAGSTMTRQKALKEQYLFTCTCPRCIK 130
           +LISYI+       RQ  L  +Y FTC+C +C +
Sbjct: 241 ILISYIDGTLPFERRQTDLTRRYYFTCSCTKCAQ 274


>gi|32564211|ref|NP_871849.1| Protein SET-18, isoform b [Caenorhabditis elegans]
 gi|25005018|emb|CAD56600.1| Protein SET-18, isoform b [Caenorhabditis elegans]
          Length = 507

 Score = 70.5 bits (171), Expect = 1e-09,   Method: Compositional matrix adjust.
 Identities = 76/319 (23%), Positives = 133/319 (41%), Gaps = 28/319 (8%)

Query: 5   DEKQLLLYAQIANLVNLILQWPEISINEIAENFSKLACNAHTICNSELRPLGTGLYPVIS 64
           DE+++  + +I   +        I   E  + F K   N+H+I  +    +G  L   +S
Sbjct: 146 DEEKMAHFEKIFEKMGAFRGEEMIEKGEFFDVFCKATINSHSIHTNAGNEVGMALDLGVS 205

Query: 65  IINHSCLPNAVLVFEGRLAVVRAVQHVP-----KGAEVLISYIETAGSTMTRQKALKEQY 119
             NHSC P   +VF+G    +R +  VP        E  ISYI+   S   R++ L  ++
Sbjct: 206 KYNHSCRPTCSMVFDGYRVCLRPL--VPGVDAENTEEAFISYIDVGRSKYIRRRDLNSRW 263

Query: 120 LFTCTCPRCIKLGQFDDIQESAILEGYRCKDDGCSGFLLRDSDDK--GFTCQQCGLVRSK 177
            F C C RC+      D ++ A L   RC +  C   +L    ++     C++C  +  +
Sbjct: 264 YFNCECTRCM------DPEDDA-LTAIRCANPACDAPILTSETEEPMNIACEKCKTI-VE 315

Query: 178 EEIKKIASEVNILSKKTLALT---SCGNHQEVVSTYKMIEKLQK--KLYHPFSVNLMQTR 232
           E+  K A E      KTL  +    C    E +   K+ E L K  ++ HP +V + + R
Sbjct: 316 EDTVKAAQEY----MKTLPASFDPKCPAEIEALPG-KLKELLAKAEQILHPSNVYVARLR 370

Query: 233 EKLIKILMELEDWKEALAYCQLTIPVYQRVYPQFHPLLGLQYYTCGKLEWFLGDTENAIK 292
             L  +   L     +  + Q+    Y+  +P+    +G Q     K      + + A+ 
Sbjct: 371 TALFHVTGTLTMDNLSSMHTQI-YNNYKMCFPKADRHVGFQLLHIVKALIEKDERDEAMP 429

Query: 293 SMTEAVEILRITHGTNSPF 311
              +A+ I  +  G + P+
Sbjct: 430 YAFDAMNIFEVCFGLDHPY 448


>gi|281347298|gb|EFB22882.1| hypothetical protein PANDA_013199 [Ailuropoda melanoleuca]
          Length = 1019

 Score = 70.5 bits (171), Expect = 1e-09,   Method: Composition-based stats.
 Identities = 29/59 (49%), Positives = 43/59 (72%)

Query: 38  SKLACNAHTICNSELRPLGTGLYPVISIINHSCLPNAVLVFEGRLAVVRAVQHVPKGAE 96
           S++ CNA TICN+E++ +G GLYP +S++NHSC PN  +VF G   ++RAV+ +  G E
Sbjct: 130 SQVICNAFTICNAEMQEVGVGLYPSMSLLNHSCDPNCSIVFNGPHLLLRAVRDIEAGEE 188


>gi|426237300|ref|XP_004012599.1| PREDICTED: SET and MYND domain-containing protein 4 [Ovis aries]
          Length = 802

 Score = 70.1 bits (170), Expect = 1e-09,   Method: Compositional matrix adjust.
 Identities = 36/96 (37%), Positives = 51/96 (53%), Gaps = 4/96 (4%)

Query: 55  LGTGLYPVISIINHSCLPNAVLVFEGRLAVVRAVQHVPKGAEVLISY--IETAGSTMTRQ 112
           L T L+PV+S++NHSC PN  + F G +A +RA Q +  G E+L  Y   E+      R+
Sbjct: 526 LATALFPVVSLLNHSCSPNTSVSFIGTIATIRASQLIRSGQEILHCYGPHESRMGVAERR 585

Query: 113 KALKEQYLFTCTCPRCIKLGQFDDIQESAILEGYRC 148
           + L+ QY F C CP C +  Q          E +RC
Sbjct: 586 QKLRSQYFFDCDCPPCEREKQRPSAGPGR--EAFRC 619


>gi|158299452|ref|XP_319583.4| AGAP008839-PA [Anopheles gambiae str. PEST]
 gi|157013526|gb|EAA14823.4| AGAP008839-PA [Anopheles gambiae str. PEST]
          Length = 650

 Score = 70.1 bits (170), Expect = 1e-09,   Method: Compositional matrix adjust.
 Identities = 50/177 (28%), Positives = 84/177 (47%), Gaps = 9/177 (5%)

Query: 55  LGTGLYPVISIINHSCLPNAVLVFEGRLAVVRAVQHVPKGAEVLISY--IETAGSTMTRQ 112
           +G GLYP +++ NHSC P     + G    VR V+++P  + V  +Y  + T      R+
Sbjct: 448 IGGGLYPTLALFNHSCDPGVTRYYRGNQVCVRTVKNIPADSMVAENYGPLFTQVRRDERR 507

Query: 113 KALKEQYLFTCTCPRCIKLGQFDDIQESAILEGYRCKDDG--CSGFLLRDS--DDKGFTC 168
             L  QY FTC C  C++        +  ++  +RC D G  CS  LL  +  +D    C
Sbjct: 508 DTLLHQYRFTCQCVPCVENWPLFTEMDPGVIR-FRC-DSGKICSNVLLIPAAVNDFMVKC 565

Query: 169 QQCGLVRSKEEIKKIASEVNILSKKTLALTSCGNHQEVVSTY-KMIEKLQKKLYHPF 224
            +CG   +  +  K   + ++L K    L S G ++  +  Y +M+E + + L  P+
Sbjct: 566 TECGEHTNIMKGLKSLQDTDMLFKTATRLHSAGEYEAALRKYIEMMETMSEVLVPPY 622


>gi|195038187|ref|XP_001990541.1| GH18189 [Drosophila grimshawi]
 gi|193894737|gb|EDV93603.1| GH18189 [Drosophila grimshawi]
          Length = 393

 Score = 70.1 bits (170), Expect = 1e-09,   Method: Compositional matrix adjust.
 Identities = 39/101 (38%), Positives = 58/101 (57%), Gaps = 9/101 (8%)

Query: 33  IAENFSKLACNAHTICNSELRPLGTGLYPVISIINHSCLPNA--VLVFEGRLAVVRAVQH 90
           I E ++K+   A    N+E    G+GLY + S INHSC+PNA     +   + V++AV  
Sbjct: 267 IDELYAKVGDFAGEFLNNE----GSGLYILQSKINHSCVPNAQSTFPYSNDIVVLKAVAP 322

Query: 91  VPKGAEVLISYIETAG---STMTRQKALKEQYLFTCTCPRC 128
           + KG E+ ISY++      S  +R K L+E Y+F C CP+C
Sbjct: 323 IEKGDEICISYLDECQLERSRHSRHKTLRENYIFVCQCPKC 363


>gi|159129417|gb|EDP54531.1| SET and MYND domain protein, putative [Aspergillus fumigatus A1163]
          Length = 544

 Score = 70.1 bits (170), Expect = 1e-09,   Method: Compositional matrix adjust.
 Identities = 35/93 (37%), Positives = 54/93 (58%)

Query: 38  SKLACNAHTICNSELRPLGTGLYPVISIINHSCLPNAVLVFEGRLAVVRAVQHVPKGAEV 97
           +KL  N++   +     LG  L+P  ++INHSC  N+V+  +G    V+A++ + KG E+
Sbjct: 234 AKLELNSYNFHSPLADRLGFYLHPYAALINHSCDYNSVVGSDGDALYVKALRPIRKGEEI 293

Query: 98  LISYIETAGSTMTRQKALKEQYLFTCTCPRCIK 130
           L+SYI+       RQ  L E+Y F C CP+C K
Sbjct: 294 LVSYIDATNPYKIRQTELLERYYFKCRCPKCGK 326


>gi|47204414|emb|CAF87026.1| unnamed protein product [Tetraodon nigroviridis]
          Length = 386

 Score = 70.1 bits (170), Expect = 2e-09,   Method: Compositional matrix adjust.
 Identities = 70/302 (23%), Positives = 120/302 (39%), Gaps = 47/302 (15%)

Query: 29  SINEIAENFSKLACNAHTICNSE-LRPLGTGLYPVISIINHSCLPNAVLVFEG------- 80
           S  +IA     +  N  T+ +   L+ +G GL+P + ++NH C PN  +V          
Sbjct: 116 SREDIAHVLGIIKTNGFTLSDQRGLQAVGVGLFPNLCLVNHDCWPNCSVVLNHGNQSATN 175

Query: 81  ------RLAVVRAVQHVPKGAEVLISYIETAGSTMTRQKALKEQYLFTCTCPRCIKLGQF 134
                 R   +RA+  V +G E+ +SY++   ++  RQ+ L E + F CTC R       
Sbjct: 176 PALHSQRRVELRALGKVCQGEELTVSYVDFLDTSAERQRKLSELFFFHCTCQR------- 228

Query: 135 DDIQESAILEGYRCKDDGCSGFLLRDSDDKGFTCQQCGLVRSKEEIKKI-ASEVNILSKK 193
                             CS  L      K              E++++ A     L K 
Sbjct: 229 ------------------CSQHL------KDDAMTAAADGADAAEVQEVTAFSQECLEKI 264

Query: 194 TLALTSCGNHQEVVSTYKMIEKLQKKLYHPFSVNLMQTREKLIKILMELEDWKEALAYCQ 253
              LT   ++QEV++      + Q+ L     +  ++     ++ L  L  + EA AY  
Sbjct: 265 DRCLTG-RDYQEVLALCSACLQKQRGLLADTHLCHLRVLSAAVEALSYLRCFPEAAAYAL 323

Query: 254 LTIPVYQRVYPQFHPLLGLQYYTCGKLEWFLGDTENAIKSMTEAVEILRITHGTNSPFMK 313
             +  Y+++YP  H  LG+     G      G  E A + + +A   L ++HG N    +
Sbjct: 324 RLLQGYRKLYPPNHAQLGVAAMRAGVAHLQAGMPEAAHELICQAYRTLLVSHGPNHSTTR 383

Query: 314 EL 315
           +L
Sbjct: 384 DL 385


>gi|222613106|gb|EEE51238.1| hypothetical protein OsJ_32096 [Oryza sativa Japonica Group]
          Length = 718

 Score = 70.1 bits (170), Expect = 2e-09,   Method: Compositional matrix adjust.
 Identities = 79/351 (22%), Positives = 149/351 (42%), Gaps = 58/351 (16%)

Query: 9   LLLYAQI-ANLVNLILQWPEISINEIAENFSKLACNAHTICNSELRPLGTGLYPVISIIN 67
           +LL  QI  N + ++       +  + + FS  + +   +C+ E   +   +Y   S  N
Sbjct: 352 VLLICQIKVNSIAIVHMKSMDGVKALTKGFSGFSGD--VMCSVEQVRVAQAIYMSGSFFN 409

Query: 68  HSCLPNAVLVFEGRLAVVRAVQHVPKGAEVLISYIETAGST--MTRQKALKEQYLFTCT- 124
           HSC PN    F  R  ++R+ +++  G+ + +SY    G      RQK+L+E Y F+C  
Sbjct: 410 HSCRPNIHAYFHSRTLILRSTEYIKAGSPIELSYGPQVGEMDLPERQKSLRENYYFSCGC 469

Query: 125 -----------------CPRCIKLGQF---------DDIQESAILEGYRC---------- 148
                            CP+   LG           ++    +I E + C          
Sbjct: 470 SSCSVLSLSDLVMNSFCCPQSNCLGAVSELIHHRHKENFVHVSIGESHVCTLSLPDVSKF 529

Query: 149 -KDDGCSGFLLRDSD-----DKGF--TCQ-QCGLVRSKEEIKKIASEVNILSKKTLALTS 199
            +D    G L   SD     D GF  +C+ Q  L  +     +  S++N L +    L S
Sbjct: 530 DEDIVKVGKLFFKSDTMFNIDPGFCMSCRSQLDLSSAVAMSDRATSKINRLKE----LPS 585

Query: 200 CGNHQEVV--STYKMIEKLQKKLYHPFSVNLMQTREKLIKILMELEDWKEALAYCQLTIP 257
             N  EV+     + +E+++ KL HP+S  L Q+ + + +   ++ D ++A  +C+ +I 
Sbjct: 586 LDNVPEVLIAEALQSLERIE-KLRHPYSKTLAQSHDTIAEAFAKVGDQEQARKHCEASIK 644

Query: 258 VYQRVYPQFHPLLGLQYYTCGKLEWFLGDTENAIKSMTEAVEILRITHGTN 308
           + +++Y   H ++  +      +E  +GD  +A  +   A  I  + +G +
Sbjct: 645 ILEKLYHPRHIIIAHELIKLVSIELSMGDGASAAAAFARADAIFSLYYGPD 695


>gi|367042716|ref|XP_003651738.1| hypothetical protein THITE_68287 [Thielavia terrestris NRRL 8126]
 gi|346999000|gb|AEO65402.1| hypothetical protein THITE_68287 [Thielavia terrestris NRRL 8126]
          Length = 562

 Score = 70.1 bits (170), Expect = 2e-09,   Method: Compositional matrix adjust.
 Identities = 38/102 (37%), Positives = 60/102 (58%), Gaps = 1/102 (0%)

Query: 27  EISINEIAENFSKLACNAHTICNSELRPLGTGLYPVISIINHSCLPNAVLVFEGRLAVVR 86
           E ++ +  E   K+  NA    +++    G  L   ++++NHSC+PNA + F+ R AV+R
Sbjct: 189 EETLEKAREVLCKIQTNAFNRLDADTGLSGIFLDVGLAMVNHSCVPNAFIGFDRRTAVLR 248

Query: 87  AVQHVPKGAEVLISYIETAGSTMTRQKALKEQYLFTCTCPRC 128
           A + + +G E+ ISYI+ A     RQ+AL+  Y F C CPRC
Sbjct: 249 AERPIQEGEEITISYIDNALPKSARQEALR-LYHFRCDCPRC 289


>gi|157136833|ref|XP_001656930.1| hypothetical protein AaeL_AAEL003527 [Aedes aegypti]
 gi|108880959|gb|EAT45184.1| AAEL003527-PA [Aedes aegypti]
          Length = 650

 Score = 70.1 bits (170), Expect = 2e-09,   Method: Compositional matrix adjust.
 Identities = 49/179 (27%), Positives = 86/179 (48%), Gaps = 11/179 (6%)

Query: 55  LGTGLYPVISIINHSCLPNAVLVFEGRLAVVRAVQHVPKGAEVLISY--IETAGSTMTRQ 112
           +G GLYP + + NHSC P     F G    +R ++++P G+ V  +Y  + T      R+
Sbjct: 447 IGAGLYPTLVLFNHSCDPGITRYFVGNAVYIRTIKNIPAGSMVAENYGQLYTRAGRRERR 506

Query: 113 KALKEQYLFTCTCPRCIKLGQFDDIQE-SAILEGYRC-KDDGCSGFLLR--DSDDKGFTC 168
           K L E Y F C C  C +   +  + E + ++  ++C  ++GC   LL   +S +    C
Sbjct: 507 KLLAENYKFDCCCQACEE--DWPSLHEMNPMIRRFKCGANEGCGNELLFKLNSTENEMEC 564

Query: 169 QQCGLVRSKEEIKKIASEVNILSKKTLA--LTSCGNHQEVVSTY-KMIEKLQKKLYHPF 224
            +CG +       +   +V+ L++   A  L S G+ +  +S Y  +I  L + L  P+
Sbjct: 565 NKCGGLTEVNASFETLKQVDFLNRYNDAARLYSQGDFERALSKYAALINSLDEILVPPY 623


>gi|255085128|ref|XP_002504995.1| predicted protein [Micromonas sp. RCC299]
 gi|226520264|gb|ACO66253.1| predicted protein [Micromonas sp. RCC299]
          Length = 267

 Score = 69.7 bits (169), Expect = 2e-09,   Method: Compositional matrix adjust.
 Identities = 36/101 (35%), Positives = 55/101 (54%), Gaps = 3/101 (2%)

Query: 32  EIAENFSKLACNAHTIC---NSELRPLGTGLYPVISIINHSCLPNAVLVFEGRLAVVRAV 88
           E+ +    + CNAHT+     S L P+G  +Y   S  NHSC+P+A     G    VR++
Sbjct: 162 ELTQLLCSVNCNAHTLYANDRSSLVPVGIAVYLQGSAFNHSCVPSAEFCNVGTSLTVRSL 221

Query: 89  QHVPKGAEVLISYIETAGSTMTRQKALKEQYLFTCTCPRCI 129
           + V  G E+ +SY+ T  +   R++ L+ QY F C C RC+
Sbjct: 222 RRVQAGEEITVSYVPTTMNLKERRRCLEGQYKFVCACARCV 262


>gi|289740749|gb|ADD19122.1| SET and MYND domain containing protein 5 [Glossina morsitans
           morsitans]
          Length = 395

 Score = 69.7 bits (169), Expect = 2e-09,   Method: Compositional matrix adjust.
 Identities = 41/111 (36%), Positives = 58/111 (52%), Gaps = 5/111 (4%)

Query: 23  LQWPEISINEIAENFSKLACNAHTICNSELRPLGTGLYPVISIINHSCLPNAVLVF--EG 80
           L  PE   N++ E    +           L   G+GLY + S INHSCLPNA + F    
Sbjct: 253 LPLPETDKNKLDEYIDDIYHKVGEFAGEFLNNEGSGLYLLQSKINHSCLPNAQVTFPYSN 312

Query: 81  RLAVVRAVQHVPKGAEVLISYIETAG---STMTRQKALKEQYLFTCTCPRC 128
            + V++A+Q +  G E+ ISY++      S  +RQK LKE Y+F C C +C
Sbjct: 313 DIVVLKALQPIQVGEEICISYLDEGQLERSRHSRQKILKENYIFVCECFKC 363


>gi|440912312|gb|ELR61896.1| SET and MYND domain-containing protein 4 [Bos grunniens mutus]
          Length = 802

 Score = 69.7 bits (169), Expect = 2e-09,   Method: Compositional matrix adjust.
 Identities = 32/76 (42%), Positives = 45/76 (59%), Gaps = 2/76 (2%)

Query: 55  LGTGLYPVISIINHSCLPNAVLVFEGRLAVVRAVQHVPKGAEVLISY--IETAGSTMTRQ 112
           L T L+PV+S++NHSC PN  + F G +A +RA Q +  G E+L  Y   E+      R+
Sbjct: 526 LATALFPVVSLLNHSCSPNTSVSFIGTIATIRASQLIRSGQEILHCYGPHESRMGVAERR 585

Query: 113 KALKEQYLFTCTCPRC 128
           + L+ QY F C CP C
Sbjct: 586 QKLRSQYFFDCDCPPC 601


>gi|402587997|gb|EJW81931.1| MYND finger family protein, partial [Wuchereria bancrofti]
          Length = 348

 Score = 69.7 bits (169), Expect = 2e-09,   Method: Compositional matrix adjust.
 Identities = 73/317 (23%), Positives = 133/317 (41%), Gaps = 31/317 (9%)

Query: 28  ISINEIAENFSKLACNAHTICN-SELRPLGTGLYPVISIINHSCLPNAVLVFEGRLAVVR 86
           +S +E+ E   +   N H I +   +  +G GLY  +   +HSC PN +      +A +R
Sbjct: 37  VSKDEVFELHCRNYINRHAISDCGYIEEIGKGLYLDLCAYDHSCRPNTIYTCNSFVATLR 96

Query: 87  AVQ---HVPKGAEVLISYIETAGSTMTRQKALKEQYLFTCTCPRCIKLGQFDDIQESAIL 143
            +     +        SYI+   +T  R+K LK+ + F C C RC      DD+  S+IL
Sbjct: 97  GLTAGVDLRNLNSTHYSYIDLINTTQQRRKLLKDTWYFECHCTRC---DDPDDVLLSSIL 153

Query: 144 EGYRCKDDGC----SGFLLRDSDDKGFTCQQCGLVRSKEEIKKIASEVNILSK--KTLAL 197
                K   C         +D + +  TC +C  + S E + +  + +  + K  +   +
Sbjct: 154 CPNCPKKRECLCIFGNVPYKDRNTQIITCPKCHNIVSPEYVVEAIAAMRFIDKIVENREV 213

Query: 198 TSCGNHQEVVSTYKMIEKLQKKLYHPFS-VNLMQTREKLIKILMELEDWKEALAYCQLTI 256
                 Q +    + +  L+K+     S VN+     K+I++L+ L D  +    CQL +
Sbjct: 214 EKMSREQSI----RFLTDLKKRFSELLSKVNVFLC--KVIQLLIPLIDISD----CQLLL 263

Query: 257 PVY-------QRVYPQFHPLLGLQYYTCGKLEWFLGDTENAIKSMTEAVEILRITHGTNS 309
            ++       +  +P  HP +G+ Y +             A+    +A E+++ T G+  
Sbjct: 264 DLHLETEECVRFCFPYNHPAVGIHYRSIATFFLKCKQPHRALLYYKKAYEVVKFTLGSKH 323

Query: 310 PFMKELILKLEEAQAEA 326
           P   E    L++A  E 
Sbjct: 324 PMTIETDAMLKDASREV 340


>gi|329664548|ref|NP_001192662.1| SET and MYND domain-containing protein 4 [Bos taurus]
          Length = 802

 Score = 69.7 bits (169), Expect = 2e-09,   Method: Compositional matrix adjust.
 Identities = 32/76 (42%), Positives = 45/76 (59%), Gaps = 2/76 (2%)

Query: 55  LGTGLYPVISIINHSCLPNAVLVFEGRLAVVRAVQHVPKGAEVLISY--IETAGSTMTRQ 112
           L T L+PV+S++NHSC PN  + F G +A +RA Q +  G E+L  Y   E+      R+
Sbjct: 526 LATALFPVVSLLNHSCSPNTSVSFIGTIATIRASQLIRSGQEILHCYGPHESRMGVAERR 585

Query: 113 KALKEQYLFTCTCPRC 128
           + L+ QY F C CP C
Sbjct: 586 QKLRSQYFFDCDCPPC 601


>gi|296476893|tpg|DAA19008.1| TPA: CG8378-like [Bos taurus]
          Length = 802

 Score = 69.7 bits (169), Expect = 2e-09,   Method: Compositional matrix adjust.
 Identities = 32/76 (42%), Positives = 45/76 (59%), Gaps = 2/76 (2%)

Query: 55  LGTGLYPVISIINHSCLPNAVLVFEGRLAVVRAVQHVPKGAEVLISY--IETAGSTMTRQ 112
           L T L+PV+S++NHSC PN  + F G +A +RA Q +  G E+L  Y   E+      R+
Sbjct: 526 LATALFPVVSLLNHSCSPNTSVSFIGTIATIRASQLIRSGQEILHCYGPHESRMGVAERR 585

Query: 113 KALKEQYLFTCTCPRC 128
           + L+ QY F C CP C
Sbjct: 586 QKLRSQYFFDCDCPPC 601


>gi|291227705|ref|XP_002733823.1| PREDICTED: predicted protein-like [Saccoglossus kowalevskii]
          Length = 644

 Score = 69.7 bits (169), Expect = 2e-09,   Method: Compositional matrix adjust.
 Identities = 67/294 (22%), Positives = 127/294 (43%), Gaps = 13/294 (4%)

Query: 29  SINEIAENFSKLACNAHTICNSELRPLGTGLYPVISIINHSCLPNAVLVFEGRLAVVRAV 88
           +++ + E  +K   +   +  ++   +   +Y   S++NHSC PN +  ++G    +RA 
Sbjct: 346 NVHAVTEVATKTDSSTSFVATTQQIRIAVAVYGTASMLNHSCTPNVIAGYDGNQLTIRAT 405

Query: 89  QHVPKGAEVLISYIETAGSTMT--RQKALKEQYLFTCTCPRCIKLGQFDDIQESAILEGY 146
           + + KG EVL  Y           R K L++QY FTC C  C  + Q + I  S  L   
Sbjct: 406 EMIKKGGEVLHCYGPRVSDMFRDERLKVLRDQYYFTCKCMFC-GIPQ-EAITPSGALRCP 463

Query: 147 RCKDDGCSGFLLRDSDDKG----FTCQQCGLVRSKEEIKKIASEVNILSKKTLALTSCGN 202
           RC D   SG + +  +  G    F C     +   +E  KI +   +       +     
Sbjct: 464 RCSD--MSG-VYQTPEVTGQCCNFDCHHSWKLGETDE--KIENARQLFRVGKTLIGDKNK 518

Query: 203 HQEVVSTYKMIEKLQKKLYHPFSVNLMQTREKLIKILMELEDWKEALAYCQLTIPVYQRV 262
            +E +ST     K+Q+ + +  +  L +T + L        ++  A  Y +L+I     +
Sbjct: 519 VKECLSTLHKAYKIQQSVLYKHNKELAETEDCLALCYASFGNYHYAAKYLKLSIKSNIIL 578

Query: 263 YPQFHPLLGLQYYTCGKLEWFLGDTENAIKSMTEAVEILRITHGTNSPFMKELI 316
           Y +    +G + Y   ++ +   +    +  + +AV+I+   +G N   +KEL+
Sbjct: 579 YGEDSIEIGHELYKLAQILFNCKNVTEGLGVIDKAVKIISDHYGENHSDVKELL 632


>gi|291242073|ref|XP_002740933.1| PREDICTED: predicted protein-like [Saccoglossus kowalevskii]
          Length = 669

 Score = 69.7 bits (169), Expect = 2e-09,   Method: Compositional matrix adjust.
 Identities = 42/123 (34%), Positives = 61/123 (49%), Gaps = 15/123 (12%)

Query: 40  LACNAHTICNSELR----------PLGTGLYPVISIINHSCLPNAVLVFEGRLAVVRAVQ 89
           + CNAH I   EL            +G+GLYP +S++NHSC P       G   VVRA++
Sbjct: 449 IPCNAHEISEYELWRSDITKCHFVEVGSGLYPTMSLVNHSCDPVVTRNCYGETCVVRAIR 508

Query: 90  HVPKGAEVL--ISYIETAGSTMTRQKALKEQYLFTCTCPRCI-KLGQFDDIQESAILEGY 146
           ++ KG E+     Y+        RQ  LK QY F C C  C+     + DI++  +L  Y
Sbjct: 509 NIYKGEEITDNYGYLYPVHDKSERQTRLKWQYFFECKCDACVHDWPLYPDIED--LLPKY 566

Query: 147 RCK 149
           +C+
Sbjct: 567 KCE 569


>gi|115482840|ref|NP_001065013.1| Os10g0506800 [Oryza sativa Japonica Group]
 gi|62733614|gb|AAX95731.1| tetratricopeptide repeat, putative [Oryza sativa Japonica Group]
 gi|110289368|gb|ABB47859.2| TPR Domain containing protein, expressed [Oryza sativa Japonica
           Group]
 gi|113639622|dbj|BAF26927.1| Os10g0506800 [Oryza sativa Japonica Group]
 gi|215695497|dbj|BAG90688.1| unnamed protein product [Oryza sativa Japonica Group]
          Length = 793

 Score = 69.7 bits (169), Expect = 2e-09,   Method: Compositional matrix adjust.
 Identities = 79/351 (22%), Positives = 148/351 (42%), Gaps = 58/351 (16%)

Query: 9   LLLYAQI-ANLVNLILQWPEISINEIAENFSKLACNAHTICNSELRPLGTGLYPVISIIN 67
           +LL  QI  N + ++       +  + + FS  +     +C+ E   +   +Y   S  N
Sbjct: 427 VLLICQIKVNSIAIVHMKSMDGVKALTKGFSGFS--GDVMCSVEQVRVAQAIYMSGSFFN 484

Query: 68  HSCLPNAVLVFEGRLAVVRAVQHVPKGAEVLISYIETAGST--MTRQKALKEQYLFTCT- 124
           HSC PN    F  R  ++R+ +++  G+ + +SY    G      RQK+L+E Y F+C  
Sbjct: 485 HSCRPNIHAYFHSRTLILRSTEYIKAGSPIELSYGPQVGEMDLPERQKSLRENYYFSCGC 544

Query: 125 -----------------CPRCIKLGQF---------DDIQESAILEGYRC---------- 148
                            CP+   LG           ++    +I E + C          
Sbjct: 545 SSCSVLSLSDLVMNSFCCPQSNCLGAVSELIHHRHKENFVHVSIGESHVCTLSLPDVSKF 604

Query: 149 -KDDGCSGFLLRDSD-----DKGF--TCQ-QCGLVRSKEEIKKIASEVNILSKKTLALTS 199
            +D    G L   SD     D GF  +C+ Q  L  +     +  S++N L +    L S
Sbjct: 605 DEDIVKVGKLFFKSDTMFNIDPGFCMSCRSQLDLSSAVAMSDRATSKINRLKE----LPS 660

Query: 200 CGNHQEVV--STYKMIEKLQKKLYHPFSVNLMQTREKLIKILMELEDWKEALAYCQLTIP 257
             N  EV+     + +E+++ KL HP+S  L Q+ + + +   ++ D ++A  +C+ +I 
Sbjct: 661 LDNVPEVLIAEALQSLERIE-KLRHPYSKTLAQSHDTIAEAFAKVGDQEQARKHCEASIK 719

Query: 258 VYQRVYPQFHPLLGLQYYTCGKLEWFLGDTENAIKSMTEAVEILRITHGTN 308
           + +++Y   H ++  +      +E  +GD  +A  +   A  I  + +G +
Sbjct: 720 ILEKLYHPRHIIIAHELIKLVSIELSMGDGASAAAAFARADAIFSLYYGPD 770


>gi|71019315|ref|XP_759888.1| hypothetical protein UM03741.1 [Ustilago maydis 521]
 gi|46099457|gb|EAK84690.1| hypothetical protein UM03741.1 [Ustilago maydis 521]
          Length = 739

 Score = 69.7 bits (169), Expect = 2e-09,   Method: Compositional matrix adjust.
 Identities = 35/118 (29%), Positives = 65/118 (55%), Gaps = 16/118 (13%)

Query: 29  SINEIAENFSKLACNAHTICNSELRPLGTGLYPVISIINHSCLPNAVLVF---------- 78
           S  E+ +   + +CN+ T+ +S+L  LG  ++  ++++NH+C+PNA +VF          
Sbjct: 423 SATELLDLVCQFSCNSFTLADSDLNALGVCMHASMAMLNHACIPNAAVVFPFGGAAKGGQ 482

Query: 79  ------EGRLAVVRAVQHVPKGAEVLISYIETAGSTMTRQKALKEQYLFTCTCPRCIK 130
                 + ++  + A++ +  G E+LISY++   +   RQK LK++Y F C C  C K
Sbjct: 483 QRWNDGDDKIMQLVALRAIEPGEELLISYVDLCDTVEERQKQLKQRYCFDCRCDLCCK 540


>gi|66822113|ref|XP_644411.1| SET domain-containing protein [Dictyostelium discoideum AX4]
 gi|66822969|ref|XP_644839.1| SET domain-containing protein [Dictyostelium discoideum AX4]
 gi|122057743|sp|Q557F7.1|Y3589_DICDI RecName: Full=SET and MYND domain-containing protein DDB_G0273589
 gi|60472534|gb|EAL70485.1| SET domain-containing protein [Dictyostelium discoideum AX4]
 gi|60472895|gb|EAL70844.1| SET domain-containing protein [Dictyostelium discoideum AX4]
          Length = 386

 Score = 69.3 bits (168), Expect = 2e-09,   Method: Compositional matrix adjust.
 Identities = 38/117 (32%), Positives = 54/117 (46%), Gaps = 1/117 (0%)

Query: 39  KLACNAHTICNSELRPLGTGLYPVISIINHSCLPNAVLVFEGRLAVVRAVQHVPKGAEVL 98
           K  CN   I     + +G  + P  S  NHSC+PN   V +G     +++  + KG ++ 
Sbjct: 231 KTRCNQFGIWTKNDKCIGVAVSPSSSYFNHSCIPNCTDVRDGSNMTFKSLYPIKKGDQLT 290

Query: 99  ISYIETAGSTMTRQKALKEQYLFTCTCPRCIKLGQFDDIQESAILEGYRCKDDGCSG 155
           ISYIE       R+  LK  Y F C CPRC       D  ++ I + Y C    C+G
Sbjct: 291 ISYIELDQPIQDRKDELKYGYYFDCICPRCNGDSNSIDSMDNWISKFY-CSQKKCTG 346


>gi|393234411|gb|EJD41974.1| hypothetical protein AURDEDRAFT_105699 [Auricularia delicata
           TFB-10046 SS5]
          Length = 531

 Score = 69.3 bits (168), Expect = 3e-09,   Method: Compositional matrix adjust.
 Identities = 77/296 (26%), Positives = 115/296 (38%), Gaps = 58/296 (19%)

Query: 29  SINEIAENFSKLACNAHTICNSELRPLGTGLYPVISIINHSCLPNAVLVFEGRLAV---- 84
           S  ++ +  S+   N+ T+ +  L P+G  + P +++ NHSC PNAV+VF    A     
Sbjct: 188 SAGDLVDLVSRFTTNSFTLSSPSLSPIGVCVSPTVALANHSCDPNAVVVFPSAAAPLGDE 247

Query: 85  ----VRAVQHVPKGAEVLISYIETAGSTMTRQKALKEQYLFTCTCPRCIKLGQFDDIQES 140
               + A++ +    EVL SY++ +     R++ LKE Y FTC C  C  L   DD    
Sbjct: 248 TLMQIIAIKDIMPEEEVLTSYVDISLPRHLRRRDLKETYNFTCNCSACSSL---DDPSVG 304

Query: 141 AILEGYRCKDDGCSGFLLRDSDDKGFT-CQQCGLVRSKEEIKKIASEVNILSKKTLALTS 199
                  C   GC G         G   C +CG                       A  S
Sbjct: 305 DPRYAVWCP-KGCGGVCRIPELGSGIVICLRCG-----------------------AEYS 340

Query: 200 CGNHQEVVSTYKM-IEKLQKKLYHPFSVNLMQTREKLIKILMELEDWKEALAYCQLTIPV 258
                E +   ++  E L+K            TR       +EL D   AL      +P+
Sbjct: 341 PAKQDEALDRVRLGTEALEKA-----------TR-------LELSDPDRALRLTTNMLPL 382

Query: 259 YQRVYPQFHPLLGLQYYTCGKLEWFLGDTENAIKSMTEAVEI-LRITHGTNSPFMK 313
               Y   HPLLGL       L  FL  + +  K++ EA+ +  RI  G    F++
Sbjct: 383 IGMPY-SSHPLLGLLRLHSSLLISFLAASPSN-KTLDEAIAMNFRIQRGMQDVFVE 436


>gi|403414107|emb|CCM00807.1| predicted protein [Fibroporia radiculosa]
          Length = 304

 Score = 69.3 bits (168), Expect = 3e-09,   Method: Compositional matrix adjust.
 Identities = 35/104 (33%), Positives = 56/104 (53%), Gaps = 9/104 (8%)

Query: 29  SINEIAENFSKLACNAHTICNSELRPLGTGLYPVISIINHSCLPNAVLVF---------E 79
           S  ++ +  S+   NA T+ +S L  +G  + P ++++NHSC PN V+VF         E
Sbjct: 10  SPGDLVDLISRFTTNAFTLTSSSLSAIGVAISPTVALVNHSCDPNVVIVFPRNPSTSHAE 69

Query: 80  GRLAVVRAVQHVPKGAEVLISYIETAGSTMTRQKALKEQYLFTC 123
             +  V A++ +  G E+L +Y++       RQK LKE Y FTC
Sbjct: 70  EPMMTVVAIKPILPGEEILSAYVDVTQPRELRQKELKETYNFTC 113


>gi|242007004|ref|XP_002424332.1| conserved hypothetical protein [Pediculus humanus corporis]
 gi|212507732|gb|EEB11594.1| conserved hypothetical protein [Pediculus humanus corporis]
          Length = 572

 Score = 69.3 bits (168), Expect = 3e-09,   Method: Compositional matrix adjust.
 Identities = 39/95 (41%), Positives = 49/95 (51%), Gaps = 5/95 (5%)

Query: 40  LACNAHTICN---SELRPLGTGLYPVISIINHSCLPNAVLVFEGRLAVVRAVQHVPKGAE 96
           L CNAH I     S L  +G G Y  +S+INHSC PN V        ++RA++ + KG E
Sbjct: 384 LPCNAHEISEYYCSNLLEIGAGAYATLSLINHSCDPNVVRHSCRNTVILRAIKPIKKGEE 443

Query: 97  VL--ISYIETAGSTMTRQKALKEQYLFTCTCPRCI 129
           +     Y         RQKAL +QY FTC C  CI
Sbjct: 444 LFDNYGYHYATHELQERQKALLKQYYFTCQCKACI 478


>gi|312371974|gb|EFR20029.1| hypothetical protein AND_20776 [Anopheles darlingi]
          Length = 256

 Score = 69.3 bits (168), Expect = 3e-09,   Method: Compositional matrix adjust.
 Identities = 42/106 (39%), Positives = 61/106 (57%), Gaps = 10/106 (9%)

Query: 29  SINE-IAENFSKLACNAHTICNSELRPLGTGLYPVISIINHSCLPNAVLVF--EGRLAVV 85
           S++E I E ++K+     T  N+E    G+ LY + S INHSC PNA +VF     +  +
Sbjct: 125 SVDELIDELYNKMDEVVGTFLNNE----GSALYAMQSKINHSCTPNAEIVFPKSNHVLAL 180

Query: 86  RAVQHVPKGAEVLISYIETAG---STMTRQKALKEQYLFTCTCPRC 128
           RA++ +  G E+ ISY++      S  +RQK L+E YLF C C RC
Sbjct: 181 RALRDLAAGEEICISYLDECNLQRSRHSRQKNLREYYLFECQCERC 226


>gi|302794157|ref|XP_002978843.1| hypothetical protein SELMODRAFT_418586 [Selaginella moellendorffii]
 gi|300153652|gb|EFJ20290.1| hypothetical protein SELMODRAFT_418586 [Selaginella moellendorffii]
          Length = 849

 Score = 69.3 bits (168), Expect = 3e-09,   Method: Compositional matrix adjust.
 Identities = 62/233 (26%), Positives = 103/233 (44%), Gaps = 19/233 (8%)

Query: 43  NAHTICNSELRPLGTGLYPVISIINHSCLPNAVLVFEGRLAVVRAVQHVPKGAEVLISYI 102
            AH IC  E   +   L+   S  NHSC PN  + +  R       + +P    + ISY 
Sbjct: 391 GAH-ICGIEHVVVAQALFVRGSKFNHSCSPNVHVSYVKRTLRAHCTEALPAFCPLEISYG 449

Query: 103 ETAGSTMTRQKA--LKEQYLFTCTCPRCIKLGQFDDIQESAILEGYRCKDDGCSGFLLR- 159
              G +   Q+   L++ YLF C+C  C +LG   D+   A    Y C   GC G +L  
Sbjct: 450 VQVGESEKSQRKIWLRDHYLFECSCEPC-RLGLVSDLLHIA----YYCCQPGCQGIVLDP 504

Query: 160 DSDDKGFTCQQCGLVRSKEEIKKIASEVNILSKKTLA-LTSCG---NHQEVVSTYKMIEK 215
           D+ D+        L      ++ +    NIL +K L  L  CG   N  E ++    + +
Sbjct: 505 DASDEDIL-----LPGGSGTVRFLIEHCNILPQKELEDLQICGSRLNGNEAIADGLRLLE 559

Query: 216 LQKKLYHPFSVNLMQTREKLIKILMEL-EDWKEALAYCQLTIPVYQRVYPQFH 267
             +K +HPF+  L Q  + + ++  E  +    A+ + + +I + +R+Y + H
Sbjct: 560 NGRKCFHPFNKQLAQMEDIVARLYCEAGQQPAAAIVHARRSIEILERIYGKDH 612


>gi|110289369|gb|AAP54526.2| TPR Domain containing protein, expressed [Oryza sativa Japonica
           Group]
          Length = 687

 Score = 68.9 bits (167), Expect = 3e-09,   Method: Compositional matrix adjust.
 Identities = 80/355 (22%), Positives = 149/355 (41%), Gaps = 66/355 (18%)

Query: 9   LLLYAQI-ANLVNLILQWPEISINEIAENFSKLACNAHTICNSELRPLGTGLYPVISIIN 67
           +LL  QI  N + ++       +  + + FS  +     +C+ E   +   +Y   S  N
Sbjct: 321 VLLICQIKVNSIAIVHMKSMDGVKALTKGFSGFS--GDVMCSVEQVRVAQAIYMSGSFFN 378

Query: 68  HSCLPNAVLVFEGRLAVVRAVQHVPKGAEVLISYIETAGST--MTRQKALKEQYLFTC-- 123
           HSC PN    F  R  ++R+ +++  G+ + +SY    G      RQK+L+E Y F+C  
Sbjct: 379 HSCRPNIHAYFHSRTLILRSTEYIKAGSPIELSYGPQVGEMDLPERQKSLRENYYFSCGC 438

Query: 124 ----------------TCPRCIKLGQF---------DDIQESAILEGYRC---------- 148
                            CP+   LG           ++    +I E + C          
Sbjct: 439 SSCSVLSLSDLVMNSFCCPQSNCLGAVSELIHHRHKENFVHVSIGESHVCTLSLPDVSKF 498

Query: 149 -KDDGCSGFLLRDSD-----DKGFTCQQCGLVRSKEEIK-------KIASEVNILSKKTL 195
            +D    G L   SD     D GF C  C   RS+ ++        +  S++N L +   
Sbjct: 499 DEDIVKVGKLFFKSDTMFNIDPGF-CMSC---RSQLDLSSAVAMSDRATSKINRLKE--- 551

Query: 196 ALTSCGNHQEVV--STYKMIEKLQKKLYHPFSVNLMQTREKLIKILMELEDWKEALAYCQ 253
            L S  N  EV+     + +E+++ KL HP+S  L Q+ + + +   ++ D ++A  +C+
Sbjct: 552 -LPSLDNVPEVLIAEALQSLERIE-KLRHPYSKTLAQSHDTIAEAFAKVGDQEQARKHCE 609

Query: 254 LTIPVYQRVYPQFHPLLGLQYYTCGKLEWFLGDTENAIKSMTEAVEILRITHGTN 308
            +I + +++Y   H ++  +      +E  +GD  +A  +   A  I  + +G +
Sbjct: 610 ASIKILEKLYHPRHIIIAHELIKLVSIELSMGDGASAAAAFARADAIFSLYYGPD 664


>gi|328873018|gb|EGG21385.1| SET domain-containing protein [Dictyostelium fasciculatum]
          Length = 740

 Score = 68.9 bits (167), Expect = 3e-09,   Method: Compositional matrix adjust.
 Identities = 39/105 (37%), Positives = 57/105 (54%), Gaps = 5/105 (4%)

Query: 32  EIAENFSKLACNAHTICNSELR-PLGTGLYPVISIINHSCLPNAVLVFEGR-LAVVRAVQ 89
           EI + + K+  NAH   +      +G G+YP  S INH+CLPN     +   L + R+ +
Sbjct: 451 EIFDLYPKILFNAHEYIDPLTHYEIGRGIYPTASYINHTCLPNTTWYNDDHGLILYRSSR 510

Query: 90  HVPKGAEVLISYIETAGSTMTRQKALKEQYLFTCTCPRCI--KLG 132
            + KG E+  SY++     + R+K LK QY F C C RC+  KLG
Sbjct: 511 DILKGEEITTSYLDILKPKLQRRKDLK-QYSFVCQCERCLNEKLG 554


>gi|302838330|ref|XP_002950723.1| hypothetical protein VOLCADRAFT_91197 [Volvox carteri f.
           nagariensis]
 gi|300263840|gb|EFJ48038.1| hypothetical protein VOLCADRAFT_91197 [Volvox carteri f.
           nagariensis]
          Length = 610

 Score = 68.9 bits (167), Expect = 3e-09,   Method: Compositional matrix adjust.
 Identities = 33/73 (45%), Positives = 44/73 (60%), Gaps = 2/73 (2%)

Query: 58  GLYPVISIINHSCLPNAVLVFEGRLAVVRAVQHVPKGAEVLISYI--ETAGSTMTRQKAL 115
           GL+P  + +NH+C PN V    G   VVRAVQ +P+G E+L+SY+  +       RQ AL
Sbjct: 246 GLWPYFNQLNHACAPNCVHYVVGSTMVVRAVQVIPEGCELLVSYLGRDDLAPRQVRQAAL 305

Query: 116 KEQYLFTCTCPRC 128
           K +Y F C C RC
Sbjct: 306 KARYGFICNCTRC 318


>gi|71001320|ref|XP_755341.1| SET and MYND domain protein [Aspergillus fumigatus Af293]
 gi|66852979|gb|EAL93303.1| SET and MYND domain protein, putative [Aspergillus fumigatus Af293]
          Length = 544

 Score = 68.9 bits (167), Expect = 3e-09,   Method: Compositional matrix adjust.
 Identities = 35/93 (37%), Positives = 53/93 (56%)

Query: 38  SKLACNAHTICNSELRPLGTGLYPVISIINHSCLPNAVLVFEGRLAVVRAVQHVPKGAEV 97
           +KL  N+    +     LG  L+P  ++INHSC  N+V+  +G    V+A++ + KG E+
Sbjct: 234 AKLELNSFNFHSPLADRLGFYLHPYAALINHSCDYNSVVGSDGDALYVKALRPIRKGEEI 293

Query: 98  LISYIETAGSTMTRQKALKEQYLFTCTCPRCIK 130
           L+SYI+       RQ  L E+Y F C CP+C K
Sbjct: 294 LVSYIDATNPYKIRQTELLERYYFKCRCPKCGK 326


>gi|17509133|ref|NP_492772.1| Protein SET-18, isoform a [Caenorhabditis elegans]
 gi|3924878|emb|CAB03382.1| Protein SET-18, isoform a [Caenorhabditis elegans]
          Length = 501

 Score = 68.9 bits (167), Expect = 3e-09,   Method: Compositional matrix adjust.
 Identities = 67/314 (21%), Positives = 126/314 (40%), Gaps = 24/314 (7%)

Query: 5   DEKQLLLYAQIANLVNLILQWPEISINEIAENFSKLACNAHTICNSELRPLGTGLYPVIS 64
           DE+++  + +I   +        I   E  + F K   N+H+I  +    +G  L   +S
Sbjct: 146 DEEKMAHFEKIFEKMGAFRGEEMIEKGEFFDVFCKATINSHSIHTNAGNEVGMALDLGVS 205

Query: 65  IINHSCLPNAVLVFEGRLAVVRAVQHVP-----KGAEVLISYIETAGSTMTRQKALKEQY 119
             NHSC P   +VF+G    +R +  VP        E  ISYI+   S   R++ L  ++
Sbjct: 206 KYNHSCRPTCSMVFDGYRVCLRPL--VPGVDAENTEEAFISYIDVGRSKYIRRRDLNSRW 263

Query: 120 LFTCTCPRCIKLGQFDDIQESAILEGYRCKDDGCSGFLLRDSDDK--GFTCQQCGLVRSK 177
            F C C RC+      D ++ A L   RC +  C   +L    ++     C++C  +  +
Sbjct: 264 YFNCECTRCM------DPEDDA-LTAIRCANPACDAPILTSETEEPMNIACEKCKTIVEE 316

Query: 178 EEIKKIASEVNILSKKTLALTSCGNHQEVVSTYKMIEKLQKKLYHPFSVNLMQTREKLIK 237
           + +K     +  L             +E+++  + I        HP +V + + R  L  
Sbjct: 317 DTVKAAQEYMKTLPASFDPKCPAEILKELLAKAEQI-------LHPSNVYVARLRTALFH 369

Query: 238 ILMELEDWKEALAYCQLTIPVYQRVYPQFHPLLGLQYYTCGKLEWFLGDTENAIKSMTEA 297
           +   L     +  + Q+    Y+  +P+    +G Q     K      + + A+    +A
Sbjct: 370 VTGTLTMDNLSSMHTQI-YNNYKMCFPKADRHVGFQLLHIVKALIEKDERDEAMPYAFDA 428

Query: 298 VEILRITHGTNSPF 311
           + I  +  G + P+
Sbjct: 429 MNIFEVCFGLDHPY 442


>gi|317157122|ref|XP_001826232.2| SET and MYND domain protein [Aspergillus oryzae RIB40]
          Length = 435

 Score = 68.6 bits (166), Expect = 4e-09,   Method: Compositional matrix adjust.
 Identities = 31/93 (33%), Positives = 50/93 (53%)

Query: 38  SKLACNAHTICNSELRPLGTGLYPVISIINHSCLPNAVLVFEGRLAVVRAVQHVPKGAEV 97
           +KL  N+    N     +G  L+P  ++INHSC  NA + F+     ++A + + KG ++
Sbjct: 122 AKLELNSFNFTNILSDRIGLYLHPYAALINHSCNYNAAVTFDSDNLYIKATRPIQKGDQI 181

Query: 98  LISYIETAGSTMTRQKALKEQYLFTCTCPRCIK 130
            ISYI+       R+  L+E+Y F C C +C K
Sbjct: 182 FISYIDATNPVKLRRSELRERYYFDCHCAKCAK 214


>gi|330812969|ref|XP_003291388.1| hypothetical protein DICPUDRAFT_98991 [Dictyostelium purpureum]
 gi|325078448|gb|EGC32099.1| hypothetical protein DICPUDRAFT_98991 [Dictyostelium purpureum]
          Length = 824

 Score = 68.6 bits (166), Expect = 4e-09,   Method: Compositional matrix adjust.
 Identities = 32/98 (32%), Positives = 51/98 (52%), Gaps = 1/98 (1%)

Query: 32  EIAENFSKLACNAHTICNSELRPLGTGLYPVISIINHSCLPNAV-LVFEGRLAVVRAVQH 90
           EI E ++    N HT+ N   +  G G YP+    NHSC+PN    V +  +   RA + 
Sbjct: 509 EIYEIYTIFLSNGHTVSNIFGKGYGHGFYPLGCYFNHSCVPNVTWYVDDNGMLAFRATRT 568

Query: 91  VPKGAEVLISYIETAGSTMTRQKALKEQYLFTCTCPRC 128
           + +G E+ ++Y +   +   R+  L +QY F C CP+C
Sbjct: 569 ISQGEELFLTYYDMFSTLDDRRPHLLDQYRFFCQCPQC 606


>gi|238493359|ref|XP_002377916.1| SET and MYND domain protein, putative [Aspergillus flavus NRRL3357]
 gi|220696410|gb|EED52752.1| SET and MYND domain protein, putative [Aspergillus flavus NRRL3357]
          Length = 415

 Score = 68.6 bits (166), Expect = 4e-09,   Method: Compositional matrix adjust.
 Identities = 31/93 (33%), Positives = 50/93 (53%)

Query: 38  SKLACNAHTICNSELRPLGTGLYPVISIINHSCLPNAVLVFEGRLAVVRAVQHVPKGAEV 97
           +KL  N+    N     +G  L+P  ++INHSC  NA + F+     ++A + + KG ++
Sbjct: 102 AKLELNSFNFTNILSDRIGLYLHPYAALINHSCNYNAAVTFDSDNLYIKATRPIQKGDQI 161

Query: 98  LISYIETAGSTMTRQKALKEQYLFTCTCPRCIK 130
            ISYI+       R+  L+E+Y F C C +C K
Sbjct: 162 FISYIDATNPVKLRRSELRERYYFDCHCAKCAK 194


>gi|61097985|ref|NP_001012912.1| SET and MYND domain-containing protein 5 [Gallus gallus]
 gi|75571258|sp|Q5ZIZ2.1|SMYD5_CHICK RecName: Full=SET and MYND domain-containing protein 5
 gi|53134094|emb|CAG32301.1| hypothetical protein RCJMB04_22j23 [Gallus gallus]
          Length = 420

 Score = 68.6 bits (166), Expect = 4e-09,   Method: Compositional matrix adjust.
 Identities = 37/86 (43%), Positives = 51/86 (59%), Gaps = 7/86 (8%)

Query: 56  GTGLYPVISIINHSCLPNAVLVFEGR--LAVVRAVQHVPKGAEVLISYIETAG---STMT 110
           G+GLY + S  NHSC+PNA   F     L  + A++ +  G E+ ISY++      S  +
Sbjct: 310 GSGLYMLQSCCNHSCIPNAETSFPDNNFLLYLTALEDIEAGEEICISYLDCCQRERSRHS 369

Query: 111 RQKALKEQYLFTCTCPRCIKLGQFDD 136
           R K L+E YLFTC+CP+C  L Q DD
Sbjct: 370 RNKILRENYLFTCSCPKC--LAQADD 393


>gi|340711193|ref|XP_003394163.1| PREDICTED: SET and MYND domain-containing protein 4-like [Bombus
           terrestris]
          Length = 676

 Score = 68.6 bits (166), Expect = 4e-09,   Method: Compositional matrix adjust.
 Identities = 51/201 (25%), Positives = 95/201 (47%), Gaps = 11/201 (5%)

Query: 21  LILQWPEISINEIAENFSKLACNAHTICNSELRPLGTGLYPVISIINHSCLPNAVLVFEG 80
           +++Q+    I+E+A     +   ++ +  ++ + +G G+Y  +S+ NHSC P  +  F G
Sbjct: 450 MLIQFNAHEISELA-----VPKGSNILAKAKSKFIGGGVYSTVSLFNHSCNPGIIRYFIG 504

Query: 81  RLAVVRAVQHVPKGAEVLISY--IETAGSTMTRQKALKEQYLFTCTCPRCIKLGQFDDIQ 138
              VVRA++ +P G E+  +Y  I T      R++ L+ QY F C C  C       +  
Sbjct: 505 TTMVVRAIRTIPAGEEISENYGPIFTTTPEAERKRKLRVQYWFDCNCEACTAHWPTLEEI 564

Query: 139 ESAILEGYRCKDDGCSGFLL---RDSDDKGFTCQQCGLVRSKEEIKKIASEVNILSKKTL 195
           +  IL  ++C+     G +L    D+++    C +CG   +  +  K   + + + +   
Sbjct: 565 DPTILR-FKCETGKECGNVLPIKADTNEFMIRCSKCGKNTNIFKGLKALQDTDAIFRTAS 623

Query: 196 ALTSCGNHQEVVSTYKMIEKL 216
                G HQE + +Y  I KL
Sbjct: 624 RNLEEGKHQEALKSYLEILKL 644


>gi|350405733|ref|XP_003487532.1| PREDICTED: SET and MYND domain-containing protein 4-like [Bombus
           impatiens]
          Length = 676

 Score = 68.6 bits (166), Expect = 4e-09,   Method: Compositional matrix adjust.
 Identities = 51/201 (25%), Positives = 95/201 (47%), Gaps = 11/201 (5%)

Query: 21  LILQWPEISINEIAENFSKLACNAHTICNSELRPLGTGLYPVISIINHSCLPNAVLVFEG 80
           +++Q+    I+E+A     +   ++ +  ++ + +G G+Y  +S+ NHSC P  +  F G
Sbjct: 450 MLIQFNAHEISELA-----VPKGSNILAKAKSKFIGGGVYSTVSLFNHSCNPGIIRYFIG 504

Query: 81  RLAVVRAVQHVPKGAEVLISY--IETAGSTMTRQKALKEQYLFTCTCPRCIKLGQFDDIQ 138
              VVRA++ +P G E+  +Y  I T      R++ L+ QY F C C  C       +  
Sbjct: 505 TTMVVRAIRTIPAGEEISENYGPIFTTTPEAERKRKLRVQYWFDCNCEACTAHWPTLEEI 564

Query: 139 ESAILEGYRCKDDGCSGFLL---RDSDDKGFTCQQCGLVRSKEEIKKIASEVNILSKKTL 195
           +  IL  ++C+     G +L    D+++    C +CG   +  +  K   + + + +   
Sbjct: 565 DPTILR-FKCETGKECGNVLPIKADTNEFMIRCSKCGKNTNIFKGLKALQDTDAIFRTAS 623

Query: 196 ALTSCGNHQEVVSTYKMIEKL 216
                G HQE + +Y  I KL
Sbjct: 624 RNLEEGKHQEALKSYLEILKL 644


>gi|391868965|gb|EIT78172.1| hypothetical protein Ao3042_05555 [Aspergillus oryzae 3.042]
          Length = 324

 Score = 68.6 bits (166), Expect = 4e-09,   Method: Compositional matrix adjust.
 Identities = 31/93 (33%), Positives = 50/93 (53%)

Query: 38  SKLACNAHTICNSELRPLGTGLYPVISIINHSCLPNAVLVFEGRLAVVRAVQHVPKGAEV 97
           +KL  N+    N     +G  L+P  ++INHSC  NA + F+     ++A + + KG ++
Sbjct: 11  AKLELNSFNFTNILSDRIGLYLHPYAALINHSCNYNAAVTFDSDNLYIKATRPIQKGDQI 70

Query: 98  LISYIETAGSTMTRQKALKEQYLFTCTCPRCIK 130
            ISYI+       R+  L+E+Y F C C +C K
Sbjct: 71  FISYIDATNPVKLRRSELRERYYFDCHCAKCAK 103


>gi|82658264|ref|NP_001032477.1| SET and MYND domain-containing protein 3 [Danio rerio]
 gi|81097726|gb|AAI09413.1| SET and MYND domain containing 3 [Danio rerio]
          Length = 380

 Score = 68.6 bits (166), Expect = 4e-09,   Method: Compositional matrix adjust.
 Identities = 27/76 (35%), Positives = 48/76 (63%)

Query: 37  FSKLACNAHTICNSELRPLGTGLYPVISIINHSCLPNAVLVFEGRLAVVRAVQHVPKGAE 96
            +++ CN  +I + EL+ +G GLYP +S++NH C PN +++FEG+   +RAV+ +    E
Sbjct: 175 LARVTCNCFSISDGELQDVGVGLYPSMSLLNHDCQPNCIMMFEGKRLTLRAVRVIRSAEE 234

Query: 97  VLISYIETAGSTMTRQ 112
           + ISY +    +  R+
Sbjct: 235 LTISYTDILAPSKDRR 250



 Score = 54.7 bits (130), Expect = 6e-05,   Method: Compositional matrix adjust.
 Identities = 30/96 (31%), Positives = 49/96 (51%), Gaps = 4/96 (4%)

Query: 234 KLIKILME----LEDWKEALAYCQLTIPVYQRVYPQFHPLLGLQYYTCGKLEWFLGDTEN 289
           +L+ + M+    L+D++ AL Y    +  Y+  +   HP   ++    GKL+ FLG  E 
Sbjct: 281 RLLDLAMDACISLDDYETALEYGNRALGPYKLYFSDPHPSRAVELLRVGKLQHFLGRLEE 340

Query: 290 AIKSMTEAVEILRITHGTNSPFMKELILKLEEAQAE 325
           A  S T+A  I+ +THG       ++  +L E QAE
Sbjct: 341 ARGSFTQAYSIMSVTHGAAHALTNQVQRRLSECQAE 376


>gi|83774976|dbj|BAE65099.1| unnamed protein product [Aspergillus oryzae RIB40]
          Length = 323

 Score = 68.6 bits (166), Expect = 4e-09,   Method: Compositional matrix adjust.
 Identities = 32/98 (32%), Positives = 51/98 (52%)

Query: 33  IAENFSKLACNAHTICNSELRPLGTGLYPVISIINHSCLPNAVLVFEGRLAVVRAVQHVP 92
           +A    KL  N+    N     +G  L+P  ++INHSC  NA + F+     ++A + + 
Sbjct: 5   VANVLIKLELNSFNFTNILSDRIGLYLHPYAALINHSCNYNAAVTFDSDNLYIKATRPIQ 64

Query: 93  KGAEVLISYIETAGSTMTRQKALKEQYLFTCTCPRCIK 130
           KG ++ ISYI+       R+  L+E+Y F C C +C K
Sbjct: 65  KGDQIFISYIDATNPVKLRRSELRERYYFDCHCAKCAK 102


>gi|242006418|ref|XP_002424047.1| SET and MYND domain-containing protein, putative [Pediculus humanus
           corporis]
 gi|212507353|gb|EEB11309.1| SET and MYND domain-containing protein, putative [Pediculus humanus
           corporis]
          Length = 236

 Score = 68.6 bits (166), Expect = 4e-09,   Method: Compositional matrix adjust.
 Identities = 44/105 (41%), Positives = 61/105 (58%), Gaps = 14/105 (13%)

Query: 30  INEIAENFSKLACNAHTICNSELRPLGTGLYPVISIINHSCLPNAVLVFEG---RLAVVR 86
           I+ + ENF K   +     N+E    G+GLY + S INHSC PNA + F     +LAVV 
Sbjct: 122 IDAVYENFEKGVGD---FLNNE----GSGLYELQSTINHSCSPNAEVTFPHSNYQLAVV- 173

Query: 87  AVQHVPKGAEVLISYIET---AGSTMTRQKALKEQYLFTCTCPRC 128
           A  ++  G E+ ISY++    + S  +RQK L+E YLFTC C +C
Sbjct: 174 ATDNINPGDEICISYLDMCSLSRSRHSRQKILQENYLFTCKCHKC 218


>gi|425770916|gb|EKV09375.1| hypothetical protein PDIP_64570 [Penicillium digitatum Pd1]
 gi|425776732|gb|EKV14940.1| hypothetical protein PDIG_30190 [Penicillium digitatum PHI26]
          Length = 495

 Score = 68.6 bits (166), Expect = 5e-09,   Method: Compositional matrix adjust.
 Identities = 33/102 (32%), Positives = 54/102 (52%)

Query: 27  EISINEIAENFSKLACNAHTICNSELRPLGTGLYPVISIINHSCLPNAVLVFEGRLAVVR 86
           ++    +A   +KL  N+  +  S    +G  ++P   +INHSC  N+ + F+G    V+
Sbjct: 175 DVDEGTVASYAAKLDLNSFNLTTSMYDRIGLYMHPYAGLINHSCDYNSTVGFDGEELYVK 234

Query: 87  AVQHVPKGAEVLISYIETAGSTMTRQKALKEQYLFTCTCPRC 128
           A+  + KG ++ ISYI+T      R   LKE+Y F C C +C
Sbjct: 235 AMHPIKKGEQIFISYIDTTTPYDIRCNELKERYFFDCQCTKC 276


>gi|449274577|gb|EMC83667.1| SET and MYND domain-containing protein 5, partial [Columba livia]
          Length = 383

 Score = 68.2 bits (165), Expect = 5e-09,   Method: Compositional matrix adjust.
 Identities = 37/86 (43%), Positives = 51/86 (59%), Gaps = 7/86 (8%)

Query: 56  GTGLYPVISIINHSCLPNAVLVF--EGRLAVVRAVQHVPKGAEVLISYIETAG---STMT 110
           G+GLY + S  NHSC+PNA   F     L  + A++ +  G E+ ISY++      S  +
Sbjct: 273 GSGLYVLQSCCNHSCIPNAETSFPENNFLLHLTALEDIEAGEEICISYLDCCQRERSRHS 332

Query: 111 RQKALKEQYLFTCTCPRCIKLGQFDD 136
           R K L+E YLFTC+CP+C  L Q DD
Sbjct: 333 RNKILRENYLFTCSCPKC--LAQADD 356


>gi|195389466|ref|XP_002053397.1| GJ23360 [Drosophila virilis]
 gi|194151483|gb|EDW66917.1| GJ23360 [Drosophila virilis]
          Length = 393

 Score = 68.2 bits (165), Expect = 5e-09,   Method: Compositional matrix adjust.
 Identities = 34/87 (39%), Positives = 50/87 (57%), Gaps = 5/87 (5%)

Query: 56  GTGLYPVISIINHSCLPNA--VLVFEGRLAVVRAVQHVPKGAEVLISYIETAG---STMT 110
           G+GLY + S INHSC+PNA     +   + V++A+  + KG E+ ISY++      S  +
Sbjct: 286 GSGLYILQSKINHSCVPNAQSTFPYSNDIVVLKALAPIQKGDEICISYLDDCQLERSRHS 345

Query: 111 RQKALKEQYLFTCTCPRCIKLGQFDDI 137
           R K L+E Y+F C CP+C       DI
Sbjct: 346 RHKTLRENYIFVCQCPKCRAQASDPDI 372


>gi|328866132|gb|EGG14518.1| hypothetical protein DFA_12294 [Dictyostelium fasciculatum]
          Length = 561

 Score = 68.2 bits (165), Expect = 5e-09,   Method: Compositional matrix adjust.
 Identities = 35/94 (37%), Positives = 50/94 (53%), Gaps = 12/94 (12%)

Query: 49  NSELRP------LGTGLYPVISIINHSCLPNAVLVFEGRLAVVRAV------QHVPKGAE 96
           N +++P      LG+ L P+ + +NHSC PN  +  E R  V +A+      Q + KG E
Sbjct: 466 NKKMQPKLRQYCLGSALIPIFACLNHSCAPNIEMGREQRDGVTKAIAVMIAKQDIKKGQE 525

Query: 97  VLISYIETAGSTMTRQKALKEQYLFTCTCPRCIK 130
           +L SYI+ +     RQ  L  QY FTC C +C K
Sbjct: 526 LLTSYIDESNPFKDRQSILSSQYGFTCNCNKCSK 559


>gi|343424810|emb|CBQ68348.1| conserved hypothetical protein [Sporisorium reilianum SRZ2]
          Length = 855

 Score = 68.2 bits (165), Expect = 5e-09,   Method: Compositional matrix adjust.
 Identities = 35/116 (30%), Positives = 64/116 (55%), Gaps = 16/116 (13%)

Query: 29  SINEIAENFSKLACNAHTICNSELRPLGTGLYPVISIINHSCLPNAVLVF---------- 78
           S  E+ +   + +CN+ T+ +S+L PLG  ++  ++++NH+C PNA +VF          
Sbjct: 415 SATELLDLVCQFSCNSFTLTDSDLNPLGVCMHASMAMLNHACTPNAAVVFPFGGAAKSGQ 474

Query: 79  ------EGRLAVVRAVQHVPKGAEVLISYIETAGSTMTRQKALKEQYLFTCTCPRC 128
                 E ++  + A++ +  G E+LISY++ A +   R+  LK++Y F C C  C
Sbjct: 475 QKWRDGEDKIMQLVALRAIEPGEELLISYVDIADTFEERRSYLKKRYCFDCRCELC 530


>gi|297736653|emb|CBI25524.3| unnamed protein product [Vitis vinifera]
          Length = 716

 Score = 68.2 bits (165), Expect = 5e-09,   Method: Compositional matrix adjust.
 Identities = 40/112 (35%), Positives = 54/112 (48%), Gaps = 7/112 (6%)

Query: 49  NSELRPLGTGLYPVISIINHSCLPNAVLVFEGRLAVVRAVQHVPKGAEVLISYIETAG-- 106
           N E   +G  +Y V S+ NHSC PN    F  R   +RA +HV  G  + +SY    G  
Sbjct: 452 NMEQVRVGQAIYSVASLFNHSCQPNIHAYFLSRTLFLRATEHVAVGCPLELSYGPQVGQW 511

Query: 107 STMTRQKALKEQYLFTCTCPRCIKLGQFDDIQESAILEGYRCKDDGCSGFLL 158
               RQK LK++Y F C C  C +L   D      +L  +RC +  C G +L
Sbjct: 512 DCKDRQKFLKDEYSFRCECSGCSELNVSD-----LVLNAFRCVNPDCFGTVL 558


>gi|157136831|ref|XP_001656929.1| hypothetical protein AaeL_AAEL003516 [Aedes aegypti]
 gi|108880958|gb|EAT45183.1| AAEL003516-PA [Aedes aegypti]
          Length = 649

 Score = 68.2 bits (165), Expect = 5e-09,   Method: Compositional matrix adjust.
 Identities = 61/209 (29%), Positives = 92/209 (44%), Gaps = 26/209 (12%)

Query: 36  NFSKLACNAHTICNSELRP-----------LGTGLYPVISIINHSCLPNAVLVFEGRLAV 84
           N   L  NAH I  SEL+            +G GLYP +++ NHSC P     + G    
Sbjct: 419 NLQFLQYNAHEI--SELQRENERDIGKSTFIGGGLYPTLALFNHSCEPGVTRYYRGNSVC 476

Query: 85  VRAVQHVPKGAEVLISY--IETAGSTMTRQKALKEQYLFTCTCPRCI-KLGQFDDIQESA 141
           VR V+ +P G+ V  +Y  + T  S   R+  L  QY FTC C  C     +F D+  + 
Sbjct: 477 VRTVKGIPAGSMVAENYGPLFTQVSREERRSTLLNQYKFTCNCRACAGDWPKFADMDSNV 536

Query: 142 ILEGYRCKDDG---CSGFLLRDSDDKGFT--CQQCGLVRSKEEIKKIASEVNILSKKTLA 196
           +    R K DG   CS  L   ++   F   C +CG   +  +  K   + ++L K    
Sbjct: 537 L----RFKCDGGKNCSNVLEIPAEINEFMVQCTECGEHTNIMKGLKSLQDTDMLFKSATK 592

Query: 197 LTSCGNHQEVVSTY-KMIEKLQKKLYHPF 224
           L   G ++  +  Y +M+  L + L  P+
Sbjct: 593 LHEAGEYEFALKKYVEMMNTLDEVLVPPY 621


>gi|308509934|ref|XP_003117150.1| CRE-SET-14 protein [Caenorhabditis remanei]
 gi|308242064|gb|EFO86016.1| CRE-SET-14 protein [Caenorhabditis remanei]
          Length = 429

 Score = 68.2 bits (165), Expect = 5e-09,   Method: Compositional matrix adjust.
 Identities = 60/223 (26%), Positives = 98/223 (43%), Gaps = 15/223 (6%)

Query: 26  PEISINEIAENFSKLACNAHTICNSELRPLGTGLYPVISIINHSCLPNAVLVFEGRLAVV 85
           P      I   F K++ N+ ++ NS    +G  L   +S  NHSC P+  + + GR A++
Sbjct: 136 PHSHPETIKSIFKKVSVNSFSLSNSTGNSIGIALCVKLSAANHSCKPSTRVCYRGRTAML 195

Query: 86  RAV-QHVPKGAE-VLISYIETAGSTMTRQKALKEQYLFTCTCPRCIKLGQFDDIQESAIL 143
             V   +P   E    SYI+   +  TRQ  LK++Y F C C  C       D + +  +
Sbjct: 196 VPVDDRLPTSLEGACHSYIDELQTLSTRQATLKKKYKFDCACEGCT------DDERNGRM 249

Query: 144 EGYRCKDDGCSGFLLRDSDDKGFTCQQCG--LVRSKEEIKKIASEVNILSKKTLALTSCG 201
           E + C  + C    +R+ +  G +C  CG  L R + E+ + A E  I S+  L   S  
Sbjct: 250 EAWAC--EICKTGWIRNVE--GASCNPCGYVLTRDQYELCRTAEEAAIASRPKLENDSIP 305

Query: 202 NHQEVVSTYKMIEKLQKKLYHPFSVNLMQTREKLIKILMELED 244
                    K++E  Q  L H ++V+ +     L    + + D
Sbjct: 306 LETRRHLCEKLLELFQDTL-HAYNVHRIPVLRCLFVATLAIRD 347


>gi|194752924|ref|XP_001958769.1| GF12554 [Drosophila ananassae]
 gi|190620067|gb|EDV35591.1| GF12554 [Drosophila ananassae]
          Length = 660

 Score = 68.2 bits (165), Expect = 5e-09,   Method: Compositional matrix adjust.
 Identities = 56/219 (25%), Positives = 99/219 (45%), Gaps = 18/219 (8%)

Query: 15  IANLVNLILQWPEISINEIAE--NFSKLACNAHTICNSELRPLGTGLYPVISIINHSCLP 72
           IA+LV   LQ+ + + +E+AE   FS                +G  +YP +++ NHSC P
Sbjct: 427 IASLVLRSLQFIQFNTHEVAELHKFSSSGREKSIF-------IGGAIYPTLALFNHSCDP 479

Query: 73  NAVLVFEGRLAVVRAVQHVPKGAEVLISY--IETAGSTMTRQKALKEQYLFTCTCPRCI- 129
             V  F G    + +V+ +  G  +  +Y  I T      RQ  LKE Y F C+C  CI 
Sbjct: 480 GVVRYFRGNTIHINSVRPIEAGLPINENYGPIYTQDERSERQARLKELYWFECSCDACID 539

Query: 130 KLGQFDDIQESAILEGYRCK-DDGCSGFL--LRDSDDKGFTCQQCGLVRSKEEIKKIASE 186
              +FDD+    I   +RC+  + C+  +      +D    C  CG + +  +  K+  +
Sbjct: 540 NWPKFDDLPRDII--RFRCEAPNNCAAVIEVPPSCNDFMVKCVTCGEITNILKGLKVMQD 597

Query: 187 VNILSKKTLALTSCGNHQEVVSTY-KMIEKLQKKLYHPF 224
             ++++    L   G + + ++ +  +I  + + L  PF
Sbjct: 598 TEMMTRTAKRLYETGEYSKALTKFIDLIRIMYEVLAPPF 636


>gi|307173919|gb|EFN64667.1| Protein msta, isoform B [Camponotus floridanus]
          Length = 513

 Score = 68.2 bits (165), Expect = 5e-09,   Method: Compositional matrix adjust.
 Identities = 58/194 (29%), Positives = 93/194 (47%), Gaps = 22/194 (11%)

Query: 48  CNSELRPLGTGLYPVISIINHSCLPNAVLVFEGRLAV-VRAVQHVPKGAEVLISYIETAG 106
           C + LR    GLYP+ ++ NH C+PN    F+ +  + V A   +  G E+ +SY +   
Sbjct: 213 CTTSLR----GLYPLGALQNHCCVPNTRHHFDDQQQLHVTATLPIAAGEEITMSYTDLLW 268

Query: 107 STMTRQKALKEQYLFTCTCPRCIKLGQFDDIQESAILEGYRCKDDGCSGFLLRDSDDKGF 166
            T +R++ LK    F+C C RC    +F   Q  A+L    C  D CSG LL  S+   F
Sbjct: 269 DTSSRRQFLKATKHFSCNCSRCSDPSEFGS-QLGALL----CAKDDCSGHLL-PSNPLNF 322

Query: 167 ----TCQQCGLVRSKEEIKKIASEVNILSKKTLALTSCGNHQEVVSTYKMIEKLQKKLYH 222
                C +C +  + ++I+ I S++N L    + +T      E+V   K IE    KL  
Sbjct: 323 ISPWICDKCQISVNYKQIEFIHSQLNALVLDAMYMTP----HEIV---KFIETALSKLVP 375

Query: 223 PFSVNLMQTREKLI 236
             +  L+  + ++I
Sbjct: 376 TSNFILIDVKFRII 389


>gi|449501577|ref|XP_002187845.2| PREDICTED: SET and MYND domain-containing protein 5-like
           [Taeniopygia guttata]
          Length = 404

 Score = 68.2 bits (165), Expect = 6e-09,   Method: Compositional matrix adjust.
 Identities = 37/86 (43%), Positives = 51/86 (59%), Gaps = 7/86 (8%)

Query: 56  GTGLYPVISIINHSCLPNAVLVF--EGRLAVVRAVQHVPKGAEVLISYIETAG---STMT 110
           G+GLY + S  NHSC+PNA   F     L  + A++ +  G E+ ISY++      S  +
Sbjct: 294 GSGLYVLQSCCNHSCIPNAETSFPENNFLLHLTALEDIEAGEEICISYLDCCQRERSRHS 353

Query: 111 RQKALKEQYLFTCTCPRCIKLGQFDD 136
           R K L+E YLFTC+CP+C  L Q DD
Sbjct: 354 RNKILRENYLFTCSCPKC--LAQADD 377


>gi|302853821|ref|XP_002958423.1| hypothetical protein VOLCADRAFT_99704 [Volvox carteri f.
           nagariensis]
 gi|300256228|gb|EFJ40499.1| hypothetical protein VOLCADRAFT_99704 [Volvox carteri f.
           nagariensis]
          Length = 729

 Score = 68.2 bits (165), Expect = 6e-09,   Method: Compositional matrix adjust.
 Identities = 36/91 (39%), Positives = 54/91 (59%), Gaps = 7/91 (7%)

Query: 43  NAHTICNSELRPLGT--GLYPVISIINHSCLPNAVLVFEGRLAVVRAVQHVPKGAEVLIS 100
            A  + NSE  PL +  G++P +++INHSC PNAV V      VVRA + +P+G EVL++
Sbjct: 601 GAAAVANSE--PLTSRMGVWPEVALINHSCGPNAVAVMLYDRLVVRATRRIPRGREVLLN 658

Query: 101 YIETAGS---TMTRQKALKEQYLFTCTCPRC 128
           ++ + G+      R+  L + Y F C C RC
Sbjct: 659 WLGSQGALAPARERRAQLSQMYDFACRCARC 689


>gi|389746511|gb|EIM87691.1| SET domain-containing protein [Stereum hirsutum FP-91666 SS1]
          Length = 552

 Score = 67.8 bits (164), Expect = 6e-09,   Method: Compositional matrix adjust.
 Identities = 70/260 (26%), Positives = 110/260 (42%), Gaps = 75/260 (28%)

Query: 32  EIAENFSKLACNAHTICNSELRPLGTGLYPVISIINHSCLPNAVLVF----------EGR 81
           ++ +  S+   N+ T+    L PLG  + P++++INHSC PNAV+V+          E +
Sbjct: 200 DLVDLISRFITNSITLTAPSLTPLGVSVSPLVALINHSCDPNAVVVYPRSTSDPSQQEPQ 259

Query: 82  LAVVRAVQHVPKGAEVLISYIETAGSTMTRQKALKEQYLFTCTCPRCIKLGQFDDIQESA 141
           + VV A++++    E+  +YI+T      RQ AL   Y FTC C  C +    D      
Sbjct: 260 MQVV-AIKNIAPDQEIFTAYIDTTMPRGHRQAALTTTYNFTCKCSLCSQPDGMD------ 312

Query: 142 ILEGYRCKDDGCSGFLLRDSDDKGFTCQQCGLVRSKEEIKKIASEVNILSKKTLALTSCG 201
           + E   C  + C G           TC              + +E N LS+       CG
Sbjct: 313 LREAIWCPKN-CGG-----------TC-------------PLPTEENTLSR-------CG 340

Query: 202 NHQEVV-STYKMIEKL---QKKLYHPFSVNLMQTREKLIKILMELEDWKEALAYCQLT-- 255
             + +V ST  +++ L   Q+ L    S+ L +T + L                 QLT  
Sbjct: 341 KCKAIVSSTDSVLDALRVGQEALQKATSLQLTETAKAL-----------------QLTTN 383

Query: 256 -IPVY--QRVYPQFHPLLGL 272
            IP+    ++ P  HPLL L
Sbjct: 384 IIPILTSSKLTPSSHPLLAL 403


>gi|219130732|ref|XP_002185512.1| predicted protein [Phaeodactylum tricornutum CCAP 1055/1]
 gi|217403043|gb|EEC42999.1| predicted protein [Phaeodactylum tricornutum CCAP 1055/1]
          Length = 493

 Score = 67.8 bits (164), Expect = 6e-09,   Method: Compositional matrix adjust.
 Identities = 37/104 (35%), Positives = 52/104 (50%), Gaps = 3/104 (2%)

Query: 28  ISINEIAENFSKLACNAHT---ICNSELRPLGTGLYPVISIINHSCLPNAVLVFEGRLAV 84
           +S   I +++   A NA T   +    L P   GL+P+ ++INHSC+ NAV V    + V
Sbjct: 188 LSYTNIEQSWRTTALNAPTPLDLSRLSLPPRLLGLFPLAAMINHSCVGNAVRVLVDDVMV 247

Query: 85  VRAVQHVPKGAEVLISYIETAGSTMTRQKALKEQYLFTCTCPRC 128
           V A   +P G E++ SY         R   L+  Y F C CPRC
Sbjct: 248 VHATTDLPAGTELVWSYGPPTTPFAQRNTRLRAHYGFVCDCPRC 291


>gi|332018228|gb|EGI58833.1| Protein msta, isoform A [Acromyrmex echinatior]
          Length = 483

 Score = 67.8 bits (164), Expect = 6e-09,   Method: Compositional matrix adjust.
 Identities = 67/314 (21%), Positives = 131/314 (41%), Gaps = 37/314 (11%)

Query: 17  NLVNLILQWPEISINEIAENFSKLACNAHTICNSELRPL----GTGLYPVISIINHSCLP 72
           N+V+ + +   + ++  +E + +  C    I   E+R        GLYP ++++NHSC+ 
Sbjct: 143 NIVDYLRK--RLKLDRFSEKYIQTICGILEINTFEVRTAKGFSARGLYPTVAMMNHSCVS 200

Query: 73  N---AVLVFEGRLAVVRAVQHVPKGAEVLISYIETAGSTMTRQKALKEQYLFTCTCPRCI 129
           N   ++   + R+ + R    +P G E+  SY  +   T+ R++ L E   F C CPRC 
Sbjct: 201 NTSHSISPIDYRIRL-RTTLKIPAGGELYASYTHSLLPTILRREHLLEGKHFACACPRC- 258

Query: 130 KLGQFDDIQESAILEGYRCK--DDGCSGFLLRDSDDKGFTCQQCGLVRSKEEIKKIASEV 187
                D  +    +   +C   D+G    L     +  + C  C    + + ++KI   +
Sbjct: 259 ----SDPTELGTHMSSLKCNKCDNGIVLPLDSLDSESTWKCTHCDFSTNGQAVRKILRII 314

Query: 188 NILSKKTLALTSCGNHQEVVSTYKMIEKLQKKLYHPFSVNLMQTREKLIKILMELEDW-- 245
                   A++   +  + +   + + K  + + HP    L   R  L ++   ++++  
Sbjct: 315 QAQVDAAEAISG-ADGADAIYKRETVMKKYRLVLHPHHAFLSMLRHSLTQMYGRVDEYLL 373

Query: 246 --------KEALAYCQLTIPVYQRVYPQFHPLLGLQYYT-------CGKLEWFLGDTENA 290
                   +  +  C+L + V   V P +  + G+  Y          K +W  G  + A
Sbjct: 374 DDLPDVVLEHKVDMCRLLLQVLDVVEPGYSRVRGMTLYELHAPLLFLAKDQWNAGVIDEA 433

Query: 291 -IKS-MTEAVEILR 302
            +KS M EA  IL+
Sbjct: 434 KLKSKMIEAANILK 447


>gi|307196209|gb|EFN77865.1| SET and MYND domain-containing protein 3 [Harpegnathos saltator]
          Length = 354

 Score = 67.8 bits (164), Expect = 6e-09,   Method: Compositional matrix adjust.
 Identities = 33/98 (33%), Positives = 55/98 (56%)

Query: 217 QKKLYHPFSVNLMQTREKLIKILMELEDWKEALAYCQLTIPVYQRVYPQFHPLLGLQYYT 276
           QK + H F++  ++T +      + LE W++A  Y +  +P Y   Y + HPL+GL Y T
Sbjct: 234 QKDILHSFNIQHIRTLDTAFLAAVNLEYWEDAELYSKQLLPGYLLYYGEVHPLIGLLYLT 293

Query: 277 CGKLEWFLGDTENAIKSMTEAVEILRITHGTNSPFMKE 314
            GK++  L   + A++ + +A  +L ITHG   P M+E
Sbjct: 294 MGKIQLHLRKLKQALEILKKASAVLTITHGDKHPIMRE 331



 Score = 50.4 bits (119), Expect = 0.001,   Method: Compositional matrix adjust.
 Identities = 20/56 (35%), Positives = 35/56 (62%)

Query: 37  FSKLACNAHTICNSELRPLGTGLYPVISIINHSCLPNAVLVFEGRLAVVRAVQHVP 92
           + ++  N+  I + ++  +G G+Y   S+I+HSC PNAV+ FEG   +VR +  +P
Sbjct: 160 YGRICINSFNILDLDMNSIGVGIYLGASVIDHSCKPNAVVTFEGTTIIVRTLIDLP 215


>gi|358368449|dbj|GAA85066.1| SET and MYND domain protein [Aspergillus kawachii IFO 4308]
          Length = 484

 Score = 67.8 bits (164), Expect = 6e-09,   Method: Compositional matrix adjust.
 Identities = 34/102 (33%), Positives = 56/102 (54%)

Query: 27  EISINEIAENFSKLACNAHTICNSELRPLGTGLYPVISIINHSCLPNAVLVFEGRLAVVR 86
           E+S   I+   +KL  N+  + N+    LG  L+P  +I NHSC  NA + F+G    ++
Sbjct: 168 EMSEEVISAMGAKLDLNSFNLTNAVYDRLGVYLHPYAAIFNHSCDHNAAVSFDGPNLHIK 227

Query: 87  AVQHVPKGAEVLISYIETAGSTMTRQKALKEQYLFTCTCPRC 128
           A++ + K  ++ I+YI+       RQ  L+ +Y FTC C +C
Sbjct: 228 AIRPIQKDEQIFITYIDVTDPYPIRQHNLQSRYYFTCHCSKC 269


>gi|321472149|gb|EFX83120.1| hypothetical protein DAPPUDRAFT_2393 [Daphnia pulex]
          Length = 458

 Score = 67.8 bits (164), Expect = 7e-09,   Method: Compositional matrix adjust.
 Identities = 48/186 (25%), Positives = 87/186 (46%), Gaps = 17/186 (9%)

Query: 58  GLYPVISIINHSCLPNAVLVFEGRLAVVR--AVQHVPKGAEVLISYIETAGSTMTRQKAL 115
           GLYP  S++NH C+ N   VF+     +R  A + +P G ++  +Y  +  +T+ R+  L
Sbjct: 225 GLYPTASLMNHDCVANTRHVFDPADFRIRILATKDIPAGDKISATYTRSLWNTLDRRLHL 284

Query: 116 KEQYLFTCTCPRCIKLGQFDDIQESAILEGYRCKDDGCSGFLLRDS---DDKGFTCQQCG 172
           K    F C C RC      D  +   +L   +C   GC G +L  +       + C  CG
Sbjct: 285 KSTKHFWCQCSRCA-----DPRELGTLLSAVKCT--GCGGAVLSQNPLYQMSNWECSDCG 337

Query: 173 LVRSKEEIKKIASEVNILSKKTLALTSCGNHQEVVSTYKMIEKLQKKLYHPFSVNLMQTR 232
            V+  E++K++     +  K+   L    N  E++  +  I K    + HP S ++++ +
Sbjct: 338 SVQKVEQVKRVHDSARMELKQIAQLAR--NRPELLEDF--IRKYSGAI-HPDSCHVIEAK 392

Query: 233 EKLIKI 238
             L+++
Sbjct: 393 YALVQL 398


>gi|66822115|ref|XP_644412.1| SET domain-containing protein [Dictyostelium discoideum AX4]
 gi|66822967|ref|XP_644838.1| SET domain-containing protein [Dictyostelium discoideum AX4]
 gi|74857697|sp|Q557F6.1|Y3591_DICDI RecName: Full=SET and MYND domain-containing protein DDB_G0273591
 gi|60472535|gb|EAL70486.1| SET domain-containing protein [Dictyostelium discoideum AX4]
 gi|60472965|gb|EAL70914.1| SET domain-containing protein [Dictyostelium discoideum AX4]
          Length = 413

 Score = 67.8 bits (164), Expect = 7e-09,   Method: Compositional matrix adjust.
 Identities = 39/122 (31%), Positives = 59/122 (48%), Gaps = 6/122 (4%)

Query: 39  KLACNAHTICNSELRPLGTGLYPVISIINHSCLPNAVLVFEGRLAVVRAVQHVPKGAEVL 98
           K+ CN   I     + +G  + P  S  NHSC+PN   V +G     +++  + KG ++ 
Sbjct: 248 KVRCNQFGIWTKNDKCIGMAVSPSSSYFNHSCIPNCESVRDGSDMTFKSLFPIKKGDQIN 307

Query: 99  ISYIETAGSTMTRQKALKEQYLFTCTCPRCIK-----LGQFDDIQESAILEGYRCKDDGC 153
           ISY+    ST  R+  LK  Y F C CPRC        G+ +D  ++ I + Y C    C
Sbjct: 308 ISYLALDKSTKRRRDYLKFGYYFHCQCPRCNSTDIDPTGKLEDSLDNWISKFY-CHQKKC 366

Query: 154 SG 155
           +G
Sbjct: 367 TG 368


>gi|443896871|dbj|GAC74214.1| predicted histone tail methylase containing SET domain [Pseudozyma
           antarctica T-34]
          Length = 882

 Score = 67.8 bits (164), Expect = 7e-09,   Method: Compositional matrix adjust.
 Identities = 36/119 (30%), Positives = 65/119 (54%), Gaps = 17/119 (14%)

Query: 29  SINEIAENFSKLACNAHTICNSELRPLGTGLYPVISIINHSCLPNAVLVF---------- 78
           S  E+ +   + + N+ T+ +S+L PLG  ++  ++++NHSC PNA +VF          
Sbjct: 463 SATELLDLVCQFSSNSFTLTDSDLNPLGVCMHASMAMLNHSCAPNAAVVFPFGGAAKSGQ 522

Query: 79  -------EGRLAVVRAVQHVPKGAEVLISYIETAGSTMTRQKALKEQYLFTCTCPRCIK 130
                  + ++  + A++ +  G E+LISY++ A +   R+ ALK++Y F C C  C K
Sbjct: 523 QNWSDGEDAKVMQLVALRAIEPGEELLISYVDVADAYEQRRSALKKRYCFECRCELCRK 581


>gi|440635519|gb|ELR05438.1| hypothetical protein GMDG_01733 [Geomyces destructans 20631-21]
          Length = 505

 Score = 67.8 bits (164), Expect = 7e-09,   Method: Compositional matrix adjust.
 Identities = 39/118 (33%), Positives = 61/118 (51%), Gaps = 3/118 (2%)

Query: 33  IAENFSKLACNAHTICNSELRPLGTGLYPVISIINHSCLPNAVLVFEGRLAVVRAVQHVP 92
           I + F K+  N   I + E  P G  L  V S +NHSC PNA + FEG    +R+++ + 
Sbjct: 193 IQDLFCKVLTNVMCINHQEGSP-GITLDLVGSFVNHSCDPNAFVFFEGSQLRMRSLKPIN 251

Query: 93  KGAEVLISYIETAGSTMTRQKALKEQYLFTCTCPRCIKLGQFDDIQESAILEGYRCKD 150
            G E+ ++Y E     + R++ L   Y F C C RC K  +  ++ E +  EG+  +D
Sbjct: 252 AGDEITLTYTELREGVLMRKRKLHRGYSFFCRCNRCKK--EHKELHELSFREGFSIRD 307


>gi|342874438|gb|EGU76450.1| hypothetical protein FOXB_13043 [Fusarium oxysporum Fo5176]
          Length = 896

 Score = 67.8 bits (164), Expect = 7e-09,   Method: Compositional matrix adjust.
 Identities = 35/92 (38%), Positives = 53/92 (57%), Gaps = 1/92 (1%)

Query: 39  KLACNAHTICNSELRPLGTGLYPVISIINHSCLPNAVLVFEGRLAVVRAVQHVPKGAEVL 98
           K+  NA    +++L  +G  L P +++ NHSC+PNA++ F GR A++RA + +    E+ 
Sbjct: 535 KIQTNAFHRYDADLGQVGIFLEPKLAMANHSCIPNAMVQFVGRKAILRAEKPIKVDEEIE 594

Query: 99  ISYIETAGSTMTRQKALKEQYLFTCTCPRCIK 130
           ISY +       R+ AL   Y F C CPRC K
Sbjct: 595 ISYTDYTFPRSKRKHALAP-YFFDCQCPRCEK 625


>gi|402073679|gb|EJT69231.1| hypothetical protein GGTG_12851 [Gaeumannomyces graminis var.
           tritici R3-111a-1]
          Length = 541

 Score = 67.8 bits (164), Expect = 7e-09,   Method: Compositional matrix adjust.
 Identities = 38/104 (36%), Positives = 60/104 (57%), Gaps = 1/104 (0%)

Query: 27  EISINEIAENFSKLACNAHTICNSELRPLGTGLYPVISIINHSCLPNAVLVFEGRLAVVR 86
           + ++N  AE   K+  N+    +++    GT L P+++++NHSC+PNAV++F  R A +R
Sbjct: 173 DANLNLAAEILCKIQTNSFDRFDADTGQSGTFLDPLLAMVNHSCIPNAVVLFWKRKAYLR 232

Query: 87  AVQHVPKGAEVLISYIETAGSTMTRQKALKEQYLFTCTCPRCIK 130
           A   V  G ++ ISYI+       R++ L E Y F C C RC K
Sbjct: 233 AETPVKAGEDITISYIDYTKPLSFRRQDL-ELYHFQCGCLRCEK 275


>gi|327288923|ref|XP_003229174.1| PREDICTED: SET and MYND domain-containing protein 5-like, partial
           [Anolis carolinensis]
          Length = 163

 Score = 67.8 bits (164), Expect = 7e-09,   Method: Compositional matrix adjust.
 Identities = 36/86 (41%), Positives = 50/86 (58%), Gaps = 7/86 (8%)

Query: 56  GTGLYPVISIINHSCLPNAVLVFEGR--LAVVRAVQHVPKGAEVLISYIETAG---STMT 110
           G+GLY + S  NHSC+PNA   F     L  + A++ +  G E+ ISY++      S  +
Sbjct: 45  GSGLYVLQSCCNHSCIPNAETSFPDNNFLLHLTALEDIRPGEEICISYLDCCQRERSRHS 104

Query: 111 RQKALKEQYLFTCTCPRCIKLGQFDD 136
           R K L+E YLF C+CP+C  L Q DD
Sbjct: 105 RHKVLRENYLFVCSCPKC--LAQADD 128


>gi|332027349|gb|EGI67433.1| SET and MYND domain-containing protein 4 [Acromyrmex echinatior]
          Length = 619

 Score = 67.8 bits (164), Expect = 8e-09,   Method: Compositional matrix adjust.
 Identities = 40/123 (32%), Positives = 61/123 (49%), Gaps = 6/123 (4%)

Query: 55  LGTGLYPVISIINHSCLPNAVLVFEGRLAVVRAVQHVPKGAEVLISY--IETAGSTMTRQ 112
           +G G+YP ++++NHSC P  +  F G   +VRAV+ +  G E+  +Y  I T      R+
Sbjct: 422 IGGGVYPTVAMLNHSCNPGVIRYFIGTTMIVRAVRTINAGEEISENYGPIFTTMPESERK 481

Query: 113 KALKEQYLFTCTCPRCIKLGQFDDIQESAILEGYRCKDD-GCSGFLLRDSDDKGFT--CQ 169
           + L+ QY F C C  C       D  +  IL  ++C+    C   L+  SD   F   C 
Sbjct: 482 RKLRVQYWFDCNCEACSGHWPLLDELDPTILR-FKCETGPSCGNVLMVKSDTNEFMIGCA 540

Query: 170 QCG 172
           +CG
Sbjct: 541 KCG 543


>gi|147817677|emb|CAN75589.1| hypothetical protein VITISV_042880 [Vitis vinifera]
          Length = 816

 Score = 67.8 bits (164), Expect = 8e-09,   Method: Compositional matrix adjust.
 Identities = 46/125 (36%), Positives = 60/125 (48%), Gaps = 9/125 (7%)

Query: 36  NFSKLACNAHTICNSELRPLGTGLYPVISIINHSCLPNAVLVFEGRLAVVRAVQHVPKGA 95
           NFS  A  A T  N E   +G  +Y V S+ NHSC PN    F  R   +RA +HV  G 
Sbjct: 62  NFSP-AGGAFT-SNMEQVRVGQAIYSVASLFNHSCQPNIHAYFLSRTLFLRATEHVAVGC 119

Query: 96  EVLISYIETAG--STMTRQKALKEQYLFTCTCPRCIKLGQFDDIQESAILEGYRCKDDGC 153
            + +SY    G      RQK LK++Y F C C  C +L   D      +L  +RC +  C
Sbjct: 120 PLELSYGPQVGQWDCKDRQKFLKDEYSFRCECSGCSELNVSD-----LVLNAFRCVNPDC 174

Query: 154 SGFLL 158
            G +L
Sbjct: 175 FGTVL 179


>gi|390365247|ref|XP_003730778.1| PREDICTED: SET and MYND domain-containing protein 3-like
           [Strongylocentrotus purpuratus]
          Length = 487

 Score = 67.4 bits (163), Expect = 8e-09,   Method: Compositional matrix adjust.
 Identities = 31/87 (35%), Positives = 51/87 (58%)

Query: 42  CNAHTICNSELRPLGTGLYPVISIINHSCLPNAVLVFEGRLAVVRAVQHVPKGAEVLISY 101
           CN+  I +++L  +  G+Y   S++NHSC  N ++VF+ R   +R V+ V +G E  I Y
Sbjct: 181 CNSFGIFDNDLIAISDGIYLRASMVNHSCDYNCIVVFDERKLQLRTVKDVKEGEECTIGY 240

Query: 102 IETAGSTMTRQKALKEQYLFTCTCPRC 128
           ++       R+  L+E+Y FTC C +C
Sbjct: 241 VDVIHPAKERRAELEEKYHFTCKCVKC 267



 Score = 63.2 bits (152), Expect = 2e-07,   Method: Compositional matrix adjust.
 Identities = 28/67 (41%), Positives = 40/67 (59%)

Query: 64  SIINHSCLPNAVLVFEGRLAVVRAVQHVPKGAEVLISYIETAGSTMTRQKALKEQYLFTC 123
           S++NHSC PN   V +GR   +  V+ V +G E  I+Y++       RQ  LKE+Y FTC
Sbjct: 364 SMLNHSCDPNCAWVSDGRKIQIMTVKDVKEGEECTITYVDVMDPAKVRQADLKERYHFTC 423

Query: 124 TCPRCIK 130
            C +CI+
Sbjct: 424 KCVKCIE 430


>gi|307182513|gb|EFN69720.1| Histone-lysine N-methyltransferase ASHR1 [Camponotus floridanus]
          Length = 132

 Score = 67.4 bits (163), Expect = 8e-09,   Method: Compositional matrix adjust.
 Identities = 36/100 (36%), Positives = 55/100 (55%)

Query: 215 KLQKKLYHPFSVNLMQTREKLIKILMELEDWKEALAYCQLTIPVYQRVYPQFHPLLGLQY 274
           K QK + H F++  ++T E      M L+ W++A  Y +  IP Y   Y + HPL+GL Y
Sbjct: 12  KKQKNILHRFNIQHIRTLEAAHIAAMNLKYWEDAELYGKELIPGYLLYYGEIHPLIGLLY 71

Query: 275 YTCGKLEWFLGDTENAIKSMTEAVEILRITHGTNSPFMKE 314
              GK++  L   + A+K + +A E+L ITHG     M+E
Sbjct: 72  LMTGKIQLHLNKPKEALKVLEKASEVLMITHGDKHSLMRE 111


>gi|289739577|gb|ADD18536.1| putative histone tail methylase [Glossina morsitans morsitans]
          Length = 429

 Score = 67.4 bits (163), Expect = 9e-09,   Method: Compositional matrix adjust.
 Identities = 32/101 (31%), Positives = 52/101 (51%), Gaps = 7/101 (6%)

Query: 58  GLYPVISIINHSCLPNAVLVFEGRL--AVVRAVQHVPKGAEVLISYIETAGSTMTRQKAL 115
            L+P+ +++NH C+PN+   FE +    V+RA   +P+G E+  +Y +     + R   L
Sbjct: 208 ALFPLFALVNHDCVPNSYYTFEEKTNYMVLRASVDLPEGTEITTTYTKLFTGNIARHLFL 267

Query: 116 KEQYLFTCTCPRCIKLGQFDDIQESAILEGYRCKDDGCSGF 156
           K +  FTC C RC      D  ++ A +    C+D  CSG 
Sbjct: 268 KMKKNFTCKCSRC-----SDPTEKGAFISAVYCRDTNCSGL 303


>gi|66800595|ref|XP_629223.1| hypothetical protein DDB_G0293294 [Dictyostelium discoideum AX4]
 gi|60462643|gb|EAL60846.1| hypothetical protein DDB_G0293294 [Dictyostelium discoideum AX4]
          Length = 508

 Score = 67.4 bits (163), Expect = 9e-09,   Method: Compositional matrix adjust.
 Identities = 32/81 (39%), Positives = 48/81 (59%), Gaps = 6/81 (7%)

Query: 56  GTGLYPVISIINHSCLPNAVL---VFEGRLAV---VRAVQHVPKGAEVLISYIETAGSTM 109
           G GL+P+ S +NHSC PN  +   + +G  +V   V+A +++P G+E+L SY +   S  
Sbjct: 428 GVGLFPIFSCMNHSCFPNVEISNEIIDGVTSVRMVVKAKKNIPAGSEILHSYCDETLSNK 487

Query: 110 TRQKALKEQYLFTCTCPRCIK 130
            R+  L  QY F CTC +C K
Sbjct: 488 ERKDILFSQYGFKCTCNKCSK 508


>gi|345480361|ref|XP_003424136.1| PREDICTED: protein msta, isoform A-like [Nasonia vitripennis]
          Length = 501

 Score = 67.4 bits (163), Expect = 9e-09,   Method: Compositional matrix adjust.
 Identities = 42/135 (31%), Positives = 66/135 (48%), Gaps = 11/135 (8%)

Query: 58  GLYPVISIINHSCLPNA--VLVFEGRLAVVRAVQHVPKGAEVLISYIETAGSTMTRQKAL 115
            L+PV ++ NH C+PN   ++  EG L V  AV  + +G E+ +SY +    T  R+  L
Sbjct: 208 ALFPVGALTNHQCVPNTRHIVNAEGELLVYAAVP-IAEGQEITMSYADVLWDTQMRRHFL 266

Query: 116 KEQYLFTCTCPRCIKLGQFDDIQESAILEGYRCKDDGCSGFLLRDSD---DKGFTCQQCG 172
                F C CPRC      D  +  ++L    C  D C G LL ++    +  +TC +CG
Sbjct: 267 LATKHFACQCPRCT-----DVTECGSLLGALSCAIDNCQGNLLPNTPLSLNCPWTCNECG 321

Query: 173 LVRSKEEIKKIASEV 187
           L+    +I  I + +
Sbjct: 322 LIIKGRQISSIRAGI 336


>gi|350403520|ref|XP_003486827.1| PREDICTED: protein msta, isoform B-like [Bombus impatiens]
          Length = 513

 Score = 67.4 bits (163), Expect = 1e-08,   Method: Compositional matrix adjust.
 Identities = 70/316 (22%), Positives = 133/316 (42%), Gaps = 41/316 (12%)

Query: 17  NLVNLILQWPEISINEIAENFSKLACNAHTICNSELRPL----GTGLYPVISIINHSCLP 72
           N+V+ I +  ++ ++  +E     AC    I   E+R         LYP ++++NHSC+ 
Sbjct: 173 NIVDYIRK--QLKLDRFSEEQIHTACGILEINTFEIRTAKGCSARALYPTVALMNHSCIS 230

Query: 73  N---AVLVFEGRLAVVRAVQHVPKGAEVLISYIETAGSTMTRQKALKEQYLFTCTCPRCI 129
           N   ++   + R+ + R    +P G E+  SY  +   T+ R++ L E   F C CPRC 
Sbjct: 231 NTCHSISPTDYRVRL-RTTLKIPAGGELYGSYTHSLLPTLLRREHLLEGKYFACACPRC- 288

Query: 130 KLGQFDDIQESAILEGYRCK--DDGCSGFLLRDSDDKGFTCQQCGLVRSKEEIKKIASEV 187
                D  +    +   +C   D+G    L     +  + C  C    S   ++K+   +
Sbjct: 289 ----SDPTELGTHVSSLKCNKCDNGIVLSLDSLDPESSWKCTHCEFTTSGSAVRKV---L 341

Query: 188 NILSKKTLAL--TSCGNHQEVVSTYKMIEKLQKKLYHPFSVNLMQTREKLIKILMELEDW 245
           +I+  +  A+   S  +  + +   + + K  + + HP    L   R  L ++   ++++
Sbjct: 342 HIIQAEVDAVETISGADGADAIQERETVVKKYRSVLHPRHGFLTMLRHSLTQMYGRVDEY 401

Query: 246 ----------KEALAYCQLTIPVYQRVYPQFHPLLGLQYYT-------CGKLEWFLGD-T 287
                     +  +  C+L + V   + P +  + G+  Y          K +W  G   
Sbjct: 402 LLDDLPDVVLEHKIDICRLLLQVLDVIEPGYSRIRGMTLYELHAPLLFIAKTQWNAGVID 461

Query: 288 ENAIKS-MTEAVEILR 302
           E A+KS M EA  IL+
Sbjct: 462 EAALKSKMIEAANILK 477


>gi|393212658|gb|EJC98158.1| SET domain-containing protein [Fomitiporia mediterranea MF3/22]
          Length = 459

 Score = 67.4 bits (163), Expect = 1e-08,   Method: Compositional matrix adjust.
 Identities = 40/106 (37%), Positives = 59/106 (55%), Gaps = 11/106 (10%)

Query: 33  IAENFSKLACNAHTICNSELRPLGTGLYPVIS-IINHSCLPNAVLVF----EGRLAVVRA 87
           I E FS+ + N + I +S L  +G G++P+ S + NHSCLPNA++ +    EG   VV+A
Sbjct: 189 IKEIFSRFS-NNNFIVHSHLSAVGHGIFPLASRLFNHSCLPNAIVTYSFTSEGIQMVVKA 247

Query: 88  VQHVPKGAEVLISYIETAGSTMTRQKALKEQYLFTCTC-----PRC 128
           +  + +G E+ I Y + A     RQ   +  Y F CTC     PRC
Sbjct: 248 LTPIKQGEEITIPYFDPALPYHQRQAICRYSYGFECTCSVCMFPRC 293


>gi|170047543|ref|XP_001851277.1| conserved hypothetical protein [Culex quinquefasciatus]
 gi|167869950|gb|EDS33333.1| conserved hypothetical protein [Culex quinquefasciatus]
          Length = 697

 Score = 67.4 bits (163), Expect = 1e-08,   Method: Compositional matrix adjust.
 Identities = 57/202 (28%), Positives = 82/202 (40%), Gaps = 51/202 (25%)

Query: 11  LYAQIANLVNLILQ--WPEISINEIAENFSKLACNAHTICNSELRP-------------- 54
            + Q+ +L+N++ +  W       I  +  +L CN H I  SELR               
Sbjct: 353 FFQQLGDLINVMSKENWSIYCGAVIFRHIGQLVCNGHAI--SELRGSFASENNCLEADSF 410

Query: 55  ---------------LGTGLYPVISIINHSCLPNAVLVFEGRLAVVRAVQHVPKGAEVLI 99
                          + TG++P IS+ NHSC PN    F      V A + +P G E+  
Sbjct: 411 NIKAGFLHRYFESTRVFTGIFPQISMFNHSCEPNIRNTFNKNTLTVYAAKDIPVGGEIFN 470

Query: 100 SYIETAGSTMT------RQKALKEQYLFTCTCPRCIKLGQFDDIQESAILEGYRCKDDGC 153
            Y    G          R+ AL++QY F C C RC      D+  ES   E Y+C    C
Sbjct: 471 CY----GPNFKLMCKDERKSALRQQYGFDCKCTRCAMNN--DEAYES--FEHYKCPFAKC 522

Query: 154 SG-FLLRDSDD---KGFTCQQC 171
           S  F+L+++ D   K   C  C
Sbjct: 523 SKYFMLKENADPFEKDIKCPMC 544


>gi|302834810|ref|XP_002948967.1| hypothetical protein VOLCADRAFT_89375 [Volvox carteri f.
           nagariensis]
 gi|300265712|gb|EFJ49902.1| hypothetical protein VOLCADRAFT_89375 [Volvox carteri f.
           nagariensis]
          Length = 679

 Score = 67.0 bits (162), Expect = 1e-08,   Method: Compositional matrix adjust.
 Identities = 37/104 (35%), Positives = 52/104 (50%), Gaps = 2/104 (1%)

Query: 28  ISINEIAENFSKLACNAHTICNSELRPLGTGLYPVISIINHSCLPNAVLVFEGRLAVVRA 87
           ++ N   E+    A +A    ++EL     GL+P  S+INHSC P A     G + VVRA
Sbjct: 218 VTFNAYGESKPDPAVSAVRSLSAELLSGCVGLWPPFSLINHSCAPVASYGLVGDVMVVRA 277

Query: 88  VQHVPKGAEVLISYI--ETAGSTMTRQKALKEQYLFTCTCPRCI 129
              +P G +V ISY      G    R+  L++ Y F C C RC+
Sbjct: 278 AADLPAGQQVTISYFGRRALGPLELRRAYLRQHYGFVCACERCV 321


>gi|340924027|gb|EGS18930.1| hypothetical protein CTHT_0055430 [Chaetomium thermophilum var.
           thermophilum DSM 1495]
          Length = 278

 Score = 67.0 bits (162), Expect = 1e-08,   Method: Compositional matrix adjust.
 Identities = 31/76 (40%), Positives = 40/76 (52%)

Query: 58  GLYPVISIINHSCLPNAVLVFEGRLAVVRAVQHVPKGAEVLISYIETAGSTMTRQKALKE 117
            +YP +S  NH C PN      G +    AV+ +P G E+ ISYI    S   RQK L E
Sbjct: 69  AVYPKVSRFNHDCRPNVHYRLSGLIHTTVAVRDIPAGTELTISYIYVRKSRAERQKELSE 128

Query: 118 QYLFTCTCPRCIKLGQ 133
            + FTCTC +C K  +
Sbjct: 129 SWNFTCTCEQCSKSAE 144


>gi|452846662|gb|EME48594.1| hypothetical protein DOTSEDRAFT_19117 [Dothistroma septosporum
           NZE10]
          Length = 199

 Score = 67.0 bits (162), Expect = 1e-08,   Method: Compositional matrix adjust.
 Identities = 35/92 (38%), Positives = 48/92 (52%), Gaps = 16/92 (17%)

Query: 37  FSKLACNAHTICNSELRPLGTGLYPVISIINHSCLPNAVLVFEGRLAVVRAVQHVPKGAE 96
            S L CN+  +      PLG  +  +IS +NHSC+PNA +VFE                E
Sbjct: 38  ISLLLCNSLILMTPANDPLGIVVELLISNVNHSCVPNACIVFE----------------E 81

Query: 97  VLISYIETAGSTMTRQKALKEQYLFTCTCPRC 128
           + +SY++  G    RQ AL  +Y FTCTCP+C
Sbjct: 82  IFVSYVDQHGLFHARQAALWHRYYFTCTCPKC 113


>gi|281202062|gb|EFA76267.1| autophagy protein 9 [Polysphondylium pallidum PN500]
          Length = 1064

 Score = 67.0 bits (162), Expect = 1e-08,   Method: Compositional matrix adjust.
 Identities = 41/123 (33%), Positives = 54/123 (43%), Gaps = 6/123 (4%)

Query: 39   KLACNAHTICNSELRPLGTGLYPVISIINHSCLPNAVLVFEGRLAVVRAVQHVPKGAEVL 98
            K+ CN   I +   + +G    P  S  NHSC PN   V    + V +A+  + KG  + 
Sbjct: 910  KIRCNQFGIWSKRDKCIGVSCTPAASYFNHSCCPNLADVRGTTVVVFKALHFISKGTPIS 969

Query: 99   ISYIETAGSTMTRQKALKEQYLFTCTCPRCIKLGQFDDIQES-AILEGYRCKDDGCSGFL 157
            ISY++    T  RQ  LK  Y FTC C RC      D   ES   +  + C    CSG  
Sbjct: 970  ISYLDLDQPTHERQSYLKTFYYFTCQCLRCK-----DQTDESDNWISRFYCDRFKCSGTY 1024

Query: 158  LRD 160
              D
Sbjct: 1025 FLD 1027


>gi|383858987|ref|XP_003704980.1| PREDICTED: protein msta, isoform B-like [Megachile rotundata]
          Length = 514

 Score = 67.0 bits (162), Expect = 1e-08,   Method: Compositional matrix adjust.
 Identities = 70/308 (22%), Positives = 126/308 (40%), Gaps = 43/308 (13%)

Query: 27  EISINEIAENFSKLACNAHTICNSELR-PLGTG---LYPVISIINHSCLPN---AVLVFE 79
           ++ ++  +E   + AC    I   E+R   G G   LYP ++++NHSC+ N   ++   +
Sbjct: 182 QLKLDRFSEELIQTACGILEINTFEVRTSRGYGARALYPTVALMNHSCVSNTCHSISPTD 241

Query: 80  GRLAVVRAVQHVPKGAEVLISYIETAGSTMTRQKALKEQYLFTCTCPRCIKLGQFDDIQE 139
            R+ + R   HVP   E+  SY  +   TM R++ L E   F C C RC      D  + 
Sbjct: 242 YRIRL-RTTVHVPPSGELYGSYTHSLYPTMLRREHLLEGKHFACACARC-----SDPTEL 295

Query: 140 SAILEGYRCK--DDGCSGFLLRDSDDKGFTCQQCGLVRSKEEIKK----IASEVNILSKK 193
              +   +C   D+G    L     D  + C  C    S   ++K    I +EV+ + + 
Sbjct: 296 GTHMSSLKCNKCDNGIVLSLDSLDPDSSWKCTHCEFSTSGPAVRKVLKIIQAEVDAVEQ- 354

Query: 194 TLALTSCGNHQEVVSTYKMIEKLQKKLYHPFSVNLMQTREKLIKILMELEDW-------- 245
                S  +  + +   + + K  + + HP    L   R  L ++   ++++        
Sbjct: 355 ----ISGADGADAIQERETVMKKYRSVLHPRHAFLTMLRHSLTQMYGRVDEYLLDDLPDV 410

Query: 246 --KEALAYCQLTIPVYQRVYPQFHPLLGLQYYTCGKLEWFLGDT--------ENAIKS-M 294
             +  +  C+  + V   + P +  + G+  Y       F+  T        E A+KS M
Sbjct: 411 VLEHKIDMCRFLLQVLDVIEPGYSRIRGMTLYELHAPLLFVAKTLWNAGVIDEAALKSKM 470

Query: 295 TEAVEILR 302
            EA  IL+
Sbjct: 471 IEASNILK 478


>gi|171684729|ref|XP_001907306.1| hypothetical protein [Podospora anserina S mat+]
 gi|170942325|emb|CAP67977.1| unnamed protein product [Podospora anserina S mat+]
          Length = 559

 Score = 67.0 bits (162), Expect = 1e-08,   Method: Compositional matrix adjust.
 Identities = 32/91 (35%), Positives = 52/91 (57%), Gaps = 1/91 (1%)

Query: 39  KLACNAHTICNSELRPLGTGLYPVISIINHSCLPNAVLVFEGRLAVVRAVQHVPKGAEVL 98
           ++  NA    +++    G  L P ++ +NHSC+PNA + F+ R A +RA + + +G E+ 
Sbjct: 199 QIQTNAFNRLDADTGMSGIFLDPALARVNHSCVPNAFIGFDKRTATLRAERPIKEGEEIT 258

Query: 99  ISYIETAGSTMTRQKALKEQYLFTCTCPRCI 129
           ISYI        R++ L+  Y F C CPRC+
Sbjct: 259 ISYIANDKPRSIRREGLR-LYYFECDCPRCV 288


>gi|400595566|gb|EJP63361.1| TPR domain-containing protein [Beauveria bassiana ARSEF 2860]
          Length = 662

 Score = 67.0 bits (162), Expect = 1e-08,   Method: Compositional matrix adjust.
 Identities = 32/73 (43%), Positives = 44/73 (60%), Gaps = 1/73 (1%)

Query: 57  TGLYPVISIINHSCLPNAVLVFEGRLAVVRAVQHVPKGAEVLISYIETAGSTMTRQKALK 116
           TGL+   S +NHSC+PN+V  F G L VVRA+QH+  G E+  +Y +  G    RQ  L+
Sbjct: 515 TGLWARASYLNHSCMPNSVKDFAGDLIVVRALQHIRTGDEITHAYQDN-GDYDARQALLQ 573

Query: 117 EQYLFTCTCPRCI 129
             + FTC C  C+
Sbjct: 574 TTWGFTCRCKLCV 586


>gi|281206015|gb|EFA80204.1| SET domain-containing protein [Polysphondylium pallidum PN500]
          Length = 490

 Score = 66.6 bits (161), Expect = 1e-08,   Method: Compositional matrix adjust.
 Identities = 32/82 (39%), Positives = 49/82 (59%), Gaps = 2/82 (2%)

Query: 50  SELRPLGTGLYPVISIINHSCLPNAVLVFEGRLAVV--RAVQHVPKGAEVLISYIETAGS 107
           S+L   G+ L+P+++  NH C PNAV+ +      V  R+++ +P   EV ISYI+   S
Sbjct: 406 SDLCISGSALFPIVNSCNHHCNPNAVVSYTTNCNRVTLRSLRSIPIHEEVEISYIDETVS 465

Query: 108 TMTRQKALKEQYLFTCTCPRCI 129
              R+K L+ +YLF C C RC+
Sbjct: 466 CSQRRKELQHKYLFNCKCTRCL 487


>gi|195355540|ref|XP_002044249.1| GM15092 [Drosophila sechellia]
 gi|194129550|gb|EDW51593.1| GM15092 [Drosophila sechellia]
          Length = 392

 Score = 66.6 bits (161), Expect = 1e-08,   Method: Compositional matrix adjust.
 Identities = 37/97 (38%), Positives = 57/97 (58%), Gaps = 9/97 (9%)

Query: 37  FSKLACNAHTICNSELRPLGTGLYPVISIINHSCLPNAVLVF--EGRLAVVRAVQHVPKG 94
           ++K+A  A    N+E    G+GLY + S INHSC+PNA   F     + V++A+  + +G
Sbjct: 271 YAKVAEFAGEFLNNE----GSGLYLLQSKINHSCVPNACSTFPYSNDIVVLKALGPIQQG 326

Query: 95  AEVLISYIETA---GSTMTRQKALKEQYLFTCTCPRC 128
            E+ ISY++      S  +R K L+E Y+F C CP+C
Sbjct: 327 EEICISYLDECMLERSRHSRHKVLRENYVFICQCPKC 363


>gi|328791769|ref|XP_001121272.2| PREDICTED: SET and MYND domain-containing protein 4-like [Apis
           mellifera]
          Length = 668

 Score = 66.6 bits (161), Expect = 1e-08,   Method: Compositional matrix adjust.
 Identities = 54/183 (29%), Positives = 89/183 (48%), Gaps = 23/183 (12%)

Query: 33  IAENFSKLACNAHTICNS---------ELRPLGTGLYPVISIINHSCLPNAVL-VFEGRL 82
           I  +   + CNA+ I  +         E R +G  +YP +S+INHSC PN V   +   +
Sbjct: 442 ILHHLQAINCNAYEIVENIYDKKTHIWEPRQIGGAIYPSVSLINHSCYPNVVRHTYPSGI 501

Query: 83  AVVRAVQHVPKGAEVLISYIET--AGSTMTRQKALKEQYLFTCTCPRCIKLGQFDDIQES 140
            VVR ++ V KG E+L  Y     + + ++R + L ++Y F CTC  CI+  Q+   +  
Sbjct: 502 VVVRTLRFVGKGTEILDCYGPHWFSENKLSRIEYLWKKYRFLCTCDACIQNWQYPLPE-- 559

Query: 141 AILEGYRCKDDGCSGFL----LRDSDDKGFTCQQCGLVRSKEEIKKIASEVNILSKKTLA 196
             +  Y+CK   CS  +    L + D +    +QC   ++   +KKI +++    KK L 
Sbjct: 560 --IMKYKCK--TCSEIIGIIALNEKDMQNVLNKQCNCNKTI-NLKKIKNQLQNSIKKRLN 614

Query: 197 LTS 199
             S
Sbjct: 615 AIS 617


>gi|195569233|ref|XP_002102615.1| GD19999 [Drosophila simulans]
 gi|194198542|gb|EDX12118.1| GD19999 [Drosophila simulans]
          Length = 391

 Score = 66.6 bits (161), Expect = 1e-08,   Method: Compositional matrix adjust.
 Identities = 37/97 (38%), Positives = 57/97 (58%), Gaps = 9/97 (9%)

Query: 37  FSKLACNAHTICNSELRPLGTGLYPVISIINHSCLPNAVLVF--EGRLAVVRAVQHVPKG 94
           ++K+A  A    N+E    G+GLY + S INHSC+PNA   F     + V++A+  + +G
Sbjct: 271 YAKVAEFAGEFLNNE----GSGLYLLQSKINHSCVPNACSTFPYSNDIVVLKALGPIQQG 326

Query: 95  AEVLISYIETA---GSTMTRQKALKEQYLFTCTCPRC 128
            E+ ISY++      S  +R K L+E Y+F C CP+C
Sbjct: 327 EEICISYLDECMLERSRHSRHKVLRENYVFICQCPKC 363


>gi|166796007|ref|NP_001107677.1| SMYD family member 5 [Xenopus (Silurana) tropicalis]
 gi|163916066|gb|AAI57309.1| smyd5 protein [Xenopus (Silurana) tropicalis]
          Length = 421

 Score = 66.6 bits (161), Expect = 1e-08,   Method: Compositional matrix adjust.
 Identities = 36/86 (41%), Positives = 51/86 (59%), Gaps = 7/86 (8%)

Query: 56  GTGLYPVISIINHSCLPNAVLVFEGRLAVVR--AVQHVPKGAEVLISYIETAG---STMT 110
           G+GLY + S  NHSC+PNA   F     ++   A++ +  G E+ ISY++      S  +
Sbjct: 305 GSGLYLLQSCCNHSCVPNAEASFPDNNFILHLTALEDIQPGEEICISYLDCCQRDRSRHS 364

Query: 111 RQKALKEQYLFTCTCPRCIKLGQFDD 136
           RQK L+E YLF C+CP+C  L Q DD
Sbjct: 365 RQKILRENYLFVCSCPKC--LAQADD 388


>gi|56788824|gb|AAH88504.1| smyd5-prov protein, partial [Xenopus (Silurana) tropicalis]
          Length = 420

 Score = 66.6 bits (161), Expect = 1e-08,   Method: Compositional matrix adjust.
 Identities = 36/86 (41%), Positives = 51/86 (59%), Gaps = 7/86 (8%)

Query: 56  GTGLYPVISIINHSCLPNAVLVFEGRLAVVR--AVQHVPKGAEVLISYIETAG---STMT 110
           G+GLY + S  NHSC+PNA   F     ++   A++ +  G E+ ISY++      S  +
Sbjct: 303 GSGLYLLQSCCNHSCVPNAEASFPDNNFILHLTALEDIQPGEEICISYLDCCQRDRSRHS 362

Query: 111 RQKALKEQYLFTCTCPRCIKLGQFDD 136
           RQK L+E YLF C+CP+C  L Q DD
Sbjct: 363 RQKILRENYLFVCSCPKC--LAQADD 386


>gi|328702715|ref|XP_001948576.2| PREDICTED: SET and MYND domain-containing protein 5-like
           [Acyrthosiphon pisum]
          Length = 387

 Score = 66.6 bits (161), Expect = 2e-08,   Method: Compositional matrix adjust.
 Identities = 37/80 (46%), Positives = 47/80 (58%), Gaps = 10/80 (12%)

Query: 56  GTGLYPVISIINHSCLPNAVLVFEG----RLAVVRAVQHVPKGAEVLISYI---ETAGST 108
           G+ L+   S INHSC PNA  VF+G    RL  +R ++    G E+ ISYI   E   S 
Sbjct: 289 GSALFQNHSSINHSCFPNAASVFDGNHVLRLVAIRMIE---PGDEINISYIAPCELDHSR 345

Query: 109 MTRQKALKEQYLFTCTCPRC 128
            TRQK L+E Y+FTC C +C
Sbjct: 346 HTRQKYLQENYVFTCRCIKC 365


>gi|452839795|gb|EME41734.1| hypothetical protein DOTSEDRAFT_81948 [Dothistroma septosporum
           NZE10]
          Length = 549

 Score = 66.6 bits (161), Expect = 2e-08,   Method: Compositional matrix adjust.
 Identities = 29/74 (39%), Positives = 40/74 (54%)

Query: 55  LGTGLYPVISIINHSCLPNAVLVFEGRLAVVRAVQHVPKGAEVLISYIETAGSTMTRQKA 114
           LG G  P++   NHSC PNA  +F     ++RA++ + KG E+ + Y +       RQ  
Sbjct: 223 LGMGFDPIMCSANHSCDPNAAALFNQPRQILRALKPIKKGEEISMKYTDVTNPLCVRQAE 282

Query: 115 LKEQYLFTCTCPRC 128
           LK  Y FTC C RC
Sbjct: 283 LKGYYFFTCHCARC 296


>gi|218184846|gb|EEC67273.1| hypothetical protein OsI_34249 [Oryza sativa Indica Group]
          Length = 816

 Score = 66.6 bits (161), Expect = 2e-08,   Method: Compositional matrix adjust.
 Identities = 82/359 (22%), Positives = 147/359 (40%), Gaps = 77/359 (21%)

Query: 17  NLVNLILQWPEISINEIA--------------ENFSKLACNAHTICNSELRPLGTGLYPV 62
           +L  L+L   +I +N IA              + FS  +     +C+ E   +   +Y  
Sbjct: 445 SLSQLVLLICQIKVNSIAIVHMKSMDGGKVLTKGFSGFS--GDVMCSVEQVRVAQAIYMS 502

Query: 63  ISIINHSCLPNAVLVFEGRLAVVRAVQHVPKGAEVLISYIETAGST--MTRQKALKEQYL 120
            S  NHSC PN    F  R  ++R+ +++  G+ + +SY    G      RQK+L+E Y 
Sbjct: 503 GSFFNHSCQPNIHAYFHSRTLILRSTEYIKAGSPIELSYGPQVGEMDLPERQKSLRENYY 562

Query: 121 FTCT------------------CPRCIKLGQF---------DDIQESAILEGYRC----- 148
           F+C                   CP+   LG           ++    +I E + C     
Sbjct: 563 FSCGCSSCSVLSLSDLVMNSFCCPQSNCLGAVSELIHHRRKENFVHVSIGESHVCTLSLP 622

Query: 149 ------KDDGCSGFLLRDSD-----DKGF--TCQ-QCGLVRSKEEIKKIASEVNILSKKT 194
                 +D    G L  +SD     D GF  +C+ Q  L  +     +  S++N L++  
Sbjct: 623 DVSKFDEDIVKVGKLFFNSDTMLNIDPGFCMSCRSQLDLSSAVAMSDRATSKINRLNE-- 680

Query: 195 LALTSCGNHQEVVSTYKMIEKLQ-----KKLYHPFSVNLMQTREKLIKILMELEDWKEAL 249
             L S  N  EV+    + E LQ     +KL HP+S  L Q  + + +   ++ D ++A 
Sbjct: 681 --LPSPDNVPEVL----IAEALQSLEHIEKLRHPYSKALAQAHDTIAEAFAKVGDQEQAR 734

Query: 250 AYCQLTIPVYQRVYPQFHPLLGLQYYTCGKLEWFLGDTENAIKSMTEAVEILRITHGTN 308
            +C+ +  + +++Y   H ++  +      +E  +GD  +A  +   A  I  + +G +
Sbjct: 735 KHCEASFKILEKLYHPRHIIIAHELIKLVSIELSMGDGASAAAAFARADAIFSLYYGPD 793


>gi|384249285|gb|EIE22767.1| hypothetical protein COCSUDRAFT_42390 [Coccomyxa subellipsoidea
           C-169]
          Length = 329

 Score = 66.6 bits (161), Expect = 2e-08,   Method: Compositional matrix adjust.
 Identities = 43/128 (33%), Positives = 57/128 (44%), Gaps = 16/128 (12%)

Query: 16  ANLVNLILQWPEISINEIAENFSKLACNAHTIC-----NSELRPLGTGLYPVISIINHSC 70
           A L  L    P +S  E+A+  +K   N + I        E R  GT LYP  S++NH C
Sbjct: 99  AQLPQLAGSRP-LSPEEVADALAKEELNGYGIMAPSGPEGERRIRGTALYPKASLLNHEC 157

Query: 71  LPNAVLVF----------EGRLAVVRAVQHVPKGAEVLISYIETAGSTMTRQKALKEQYL 120
           LPN   V           E      +A+ ++P G E   SY     S   RQ+  ++QY 
Sbjct: 158 LPNVARVDNFDEADVNAPENTAVHFKALHNLPAGEEFTQSYFPMHTSYHVRQQRCQDQYG 217

Query: 121 FTCTCPRC 128
           F C CPRC
Sbjct: 218 FACNCPRC 225


>gi|242007370|ref|XP_002424514.1| set and mynd domain containing protein, putative [Pediculus humanus
           corporis]
 gi|212507932|gb|EEB11776.1| set and mynd domain containing protein, putative [Pediculus humanus
           corporis]
          Length = 646

 Score = 66.6 bits (161), Expect = 2e-08,   Method: Compositional matrix adjust.
 Identities = 67/298 (22%), Positives = 123/298 (41%), Gaps = 53/298 (17%)

Query: 7   KQLLLYAQIANLVNLILQWP----------------EISINEIAENFSKLACNAHTICNS 50
           K L  YA  A+L+ L L+                  EI  + + ++  +L CN H I + 
Sbjct: 308 KHLFQYALTASLLALYLKTHTSYFSNESNDDDDNNLEIVASTLFKHILQLICNGHAITDV 367

Query: 51  EL-------------RPLGTGLYPVISIINHSCLPNAVLVFEGRLAVVRAVQHVPKGAEV 97
                          + +GT +YP  S++NHSC P+ +  F     +VRA + +  G E+
Sbjct: 368 ITDDSNDSNAVDLIEKRIGTAIYPSASMMNHSCEPSIITSFSNNHLIVRASKPIKAGEEI 427

Query: 98  LISYIETAGSTMT--RQKALKEQYLFTCTCPRCIKLGQFDDIQESAILEGYRCKDDGCSG 155
              Y   A       RQ+ L++QY F C C  C +  Q+D ++       Y C +  C G
Sbjct: 428 FNCYGPHAKRMKKEERQEILRKQYFFNCHCMACHQ--QYDIMER---FMAYMCFE--CKG 480

Query: 156 FLLRDSDDKGFTCQQCGLVRSKEEIKKIASEVNILSKKTL--ALTSCGNHQEVVSTYKMI 213
            L+  +D     C  C             ++ +I+S+  +  +    G   + +  +   
Sbjct: 481 PLIIINDSNNGYCLDC-------------NKNSIISQDVIRESFKEQGKILDALKKFLNF 527

Query: 214 EKLQKKLYHPFSVNLMQTREKLIKILMELEDWKEALAYCQLTIPVYQRVYPQFHPLLG 271
             + +K+ + F+ +L  + ++      +LED+ + L   + TI   ++ +  F   LG
Sbjct: 528 FIIGRKILYKFNKDLCYSADQAASCFSKLEDYDKCLVILKCTILSEEKRFGSFSIELG 585


>gi|62473432|ref|NP_001014718.1| msta, isoform B [Drosophila melanogaster]
 gi|76363525|sp|P83501.2|MSTAB_DROME RecName: Full=Protein msta, isoform B
 gi|61677866|gb|AAF45751.3| msta, isoform B [Drosophila melanogaster]
          Length = 448

 Score = 66.6 bits (161), Expect = 2e-08,   Method: Compositional matrix adjust.
 Identities = 36/105 (34%), Positives = 52/105 (49%), Gaps = 6/105 (5%)

Query: 55  LGTGLYPVISIINHSCLPNAVLVFE-GRLAVVRAVQHVPKGAEVLISYIETAGSTMTRQK 113
           L   L+P+  ++NH C PNA   FE G   VV A + +P GAE+ +SY +   ST+ R+ 
Sbjct: 225 LVRALFPLAGLLNHQCTPNAAHHFENGETIVVCATERIPAGAEITMSYAKLLWSTLARKI 284

Query: 114 ALKEQYLFTCTCPRCIKLGQFDDIQESAILEGYRCKDDGCSGFLL 158
            L     F C C RC      D  +    L    C++ GC G ++
Sbjct: 285 FLGMTKHFICKCVRCQ-----DPTENGTYLSALFCREQGCRGLVI 324


>gi|350633710|gb|EHA22075.1| hypothetical protein ASPNIDRAFT_183370 [Aspergillus niger ATCC
           1015]
          Length = 386

 Score = 66.6 bits (161), Expect = 2e-08,   Method: Compositional matrix adjust.
 Identities = 33/104 (31%), Positives = 57/104 (54%)

Query: 27  EISINEIAENFSKLACNAHTICNSELRPLGTGLYPVISIINHSCLPNAVLVFEGRLAVVR 86
           ++S   I+   +KL  N+  + N+    LG  L+P  +I NHSC  NA + F+G    ++
Sbjct: 168 DMSEEVISAMGAKLDLNSFNLTNAVYDRLGVYLHPYAAIFNHSCDHNAAVSFDGPNLHIK 227

Query: 87  AVQHVPKGAEVLISYIETAGSTMTRQKALKEQYLFTCTCPRCIK 130
           A++ + K  ++ I+YI+       RQ  L+ +Y FTC C +C +
Sbjct: 228 ALRPIRKDEQIFITYIDVTDPYPIRQANLQSRYYFTCHCSKCTR 271


>gi|134083355|emb|CAK42922.1| unnamed protein product [Aspergillus niger]
          Length = 415

 Score = 66.6 bits (161), Expect = 2e-08,   Method: Compositional matrix adjust.
 Identities = 31/93 (33%), Positives = 52/93 (55%)

Query: 38  SKLACNAHTICNSELRPLGTGLYPVISIINHSCLPNAVLVFEGRLAVVRAVQHVPKGAEV 97
           +KL  N+  + N+    LG  L+P  +I NHSC  NA + F+G    ++A++ + K  ++
Sbjct: 147 AKLDLNSFNLTNAVYDRLGVYLHPYAAIFNHSCDHNAAVSFDGPNLHIKALRPIRKDEQI 206

Query: 98  LISYIETAGSTMTRQKALKEQYLFTCTCPRCIK 130
            I+YI+       RQ  L+ +Y FTC C +C +
Sbjct: 207 FITYIDVTDPYPIRQANLQSRYYFTCHCSKCSR 239


>gi|402590196|gb|EJW84127.1| hypothetical protein WUBG_04964 [Wuchereria bancrofti]
          Length = 283

 Score = 66.6 bits (161), Expect = 2e-08,   Method: Compositional matrix adjust.
 Identities = 70/288 (24%), Positives = 132/288 (45%), Gaps = 32/288 (11%)

Query: 29  SINEIAENFSKLACNAHTICNSELRPLGTGLYPVISIINHSCLPNAVLVFEGRLAVVRAV 88
           S  E+ + F K+  N   I +S L   G G+Y  +S ++HSC P+A ++F G  A++R++
Sbjct: 3   SSTEMFDIFCKIVINTVVITDSCLSRTGIGVYLGLSALDHSCKPDAFIIFNGTKAILRSL 62

Query: 89  QH--VPKGAEVLISYIETAGSTMTRQKALKEQYLFTCTCPRCIKLGQFDDIQESAILEGY 146
                     + I Y +    T TR K ++ Q+ F C C  C  + + D  ++   L   
Sbjct: 63  SKNITEYNNNLRIPYCDLLELTSTRCKYMQLQHNFICNCEICQNV-ELD--RQKCSLRCT 119

Query: 147 RCKDDGCSGFLLRDSDDKGFT-CQQCGLVR--SKEEIKKIASEVNILSKKTLALTSCGNH 203
           +C D  C        D++  T C+ CG +   + + ++K+  ++      T   ++  N 
Sbjct: 120 KCTDGFCP---YSPEDEQAETRCKVCGEISLFNFDHLQKLYQQL------TTDDSTEKNL 170

Query: 204 QEVVSTYKMIEKLQKKLYHPFSVNLMQTREKLIKILMELEDWKEALAYCQLTIPVYQRVY 263
            E++  Y   E    +++ P++V L +  EK++   M+   + EA           +  Y
Sbjct: 171 NELIDLYCESE----EVFSPYNVPLCKFAEKIMISAMKHHKYDEAA----------KTFY 216

Query: 264 PQFHPLLGLQYYTCGKLEWFLGDTENAIKSMTEAVEILRITHGTNSPF 311
           P+  P L  +     KL   L   E +   + EA++++R ++G+ S F
Sbjct: 217 PKGLPPLPTRMLEYAKL-LMLQHDEASFPILREALKMIRESYGSESNF 263


>gi|340722924|ref|XP_003399849.1| PREDICTED: protein msta, isoform B-like [Bombus terrestris]
          Length = 513

 Score = 66.2 bits (160), Expect = 2e-08,   Method: Compositional matrix adjust.
 Identities = 67/316 (21%), Positives = 130/316 (41%), Gaps = 41/316 (12%)

Query: 17  NLVNLILQWPEISINEIAENFSKLACNAHTICNSELRPL----GTGLYPVISIINHSCLP 72
           N+V+ I +  ++ ++  +E     AC    I   E+R         LYP ++++NHSC+ 
Sbjct: 173 NIVDYIRK--QLKLDRFSEEQIHTACGILEINTFEIRTAKGCSARALYPTVALMNHSCIS 230

Query: 73  N---AVLVFEGRLAVVRAVQHVPKGAEVLISYIETAGSTMTRQKALKEQYLFTCTCPRCI 129
           N   ++   + R+ + R    +P G E+  SY  +   T+ R++ L E   F C C RC 
Sbjct: 231 NTCHSISPTDYRVRL-RTTLKIPAGGELYGSYTHSLLPTLLRREHLLEGKYFACACSRC- 288

Query: 130 KLGQFDDIQESAILEGYRCK--DDGCSGFLLRDSDDKGFTCQQCGLVRSKEEIKKIASEV 187
                D  +    +   +C   D+G    L     D  + C  C    S   ++K+   +
Sbjct: 289 ----SDPTELGTHVSSLKCNKCDNGIVLSLDSLDPDSSWKCTHCEFTTSGSAVRKV---L 341

Query: 188 NILSKKTLAL--TSCGNHQEVVSTYKMIEKLQKKLYHPFSVNLMQTREKLIKILMELEDW 245
           +I+  +  A+   S  +  + +   + + K  + + HP    L   R  L ++   ++++
Sbjct: 342 HIIQAEVDAVETISGADGADAIQERETVVKKYRSVLHPRHGFLTMLRHSLTQMYGRVDEY 401

Query: 246 ----------KEALAYCQLTIPVYQRVYPQFHPLLGLQYYT-------CGKLEWFLGDTE 288
                     +  +  C+L + V   + P +  + G+  Y          K +W  G  +
Sbjct: 402 LLDDLPDVVLEHKIDICRLLLQVLDVIEPGYSRIRGMTLYELHAPLLFVAKTQWNAGVID 461

Query: 289 NAI--KSMTEAVEILR 302
            A+    M EA  IL+
Sbjct: 462 EAVLKSKMIEAANILK 477


>gi|256062338|ref|XP_002570318.1| set and mynd domain containing [Schistosoma mansoni]
 gi|353230727|emb|CCD77144.1| putative set and mynd domain containing [Schistosoma mansoni]
          Length = 423

 Score = 66.2 bits (160), Expect = 2e-08,   Method: Compositional matrix adjust.
 Identities = 36/79 (45%), Positives = 48/79 (60%), Gaps = 7/79 (8%)

Query: 56  GTGLYPVISIINHSCLPNAVLVFEG---RLAVVRAVQHVPKGAEVLISYIE---TAGSTM 109
           G GLY   S INHSC PNA++ F G   RL+VV A+  + +G E+ ISY++    +    
Sbjct: 314 GVGLYYYQSRINHSCSPNAIIRFSGVNSRLSVV-ALTSIQEGEEITISYLDHCLQSRGRH 372

Query: 110 TRQKALKEQYLFTCTCPRC 128
           TR+K L   YLF C CP+C
Sbjct: 373 TRRKHLSSNYLFWCNCPKC 391


>gi|322792366|gb|EFZ16350.1| hypothetical protein SINV_08812 [Solenopsis invicta]
          Length = 510

 Score = 66.2 bits (160), Expect = 2e-08,   Method: Compositional matrix adjust.
 Identities = 67/314 (21%), Positives = 128/314 (40%), Gaps = 37/314 (11%)

Query: 17  NLVNLILQWPEISINEIAENFSKLACNAHTICNSELRPL----GTGLYPVISIINHSCLP 72
           N+V+ + +   + ++  +E   +  C    I   E+R        GLYP ++++NHSC+ 
Sbjct: 170 NIVDYLRK--RLKLDRFSEEHIQTVCGILEINTFEVRTAKGFSARGLYPTVAMMNHSCVS 227

Query: 73  N---AVLVFEGRLAVVRAVQHVPKGAEVLISYIETAGSTMTRQKALKEQYLFTCTCPRCI 129
           N   ++   + R+ + R    +P G E+  SY  +   TM R++ L E   F C CPRC 
Sbjct: 228 NTSHSISPIDYRIRL-RTTLKIPAGGELYASYTHSLLPTMLRREHLLEGKHFACACPRC- 285

Query: 130 KLGQFDDIQESAILEGYRCK--DDGCSGFLLRDSDDKGFTCQQCGLVRSKEEIKKIASEV 187
                D  +    +   +C   D+G    L     +  + C  C    + + ++K+   +
Sbjct: 286 ----SDPTELGTHMSSLKCNKCDNGIVLSLDSLDSESKWKCTHCDFSTTGQAVRKVFRII 341

Query: 188 NILSKKTLALTSCGNHQEVVSTYKMIEKLQKKLYHPFSVNLMQTREKLIKILMELEDW-- 245
                   A++   +  + +   + I K  + + HP    L   R  L ++   ++++  
Sbjct: 342 QAEVDAAEAISG-ADGADAIHERETIMKKYRSVLHPQHAFLSMLRHSLTQMYGRVDEYLL 400

Query: 246 --------KEALAYCQLTIPVYQRVYPQFHPLLGLQYYT-------CGKLEWFLGD-TEN 289
                   +  +  C+  + V   V P +  + G+  Y          K +W  G   E 
Sbjct: 401 DDLPDVVLEHKVDLCRSLLQVLNVVEPGYSRVRGMTLYELHAPLLFLAKGQWNAGVIDET 460

Query: 290 AIKS-MTEAVEILR 302
            +KS M EA  IL+
Sbjct: 461 GLKSKMIEAANILK 474


>gi|253741927|gb|EES98785.1| Hypothetical protein GL50581_3974 [Giardia intestinalis ATCC 50581]
          Length = 409

 Score = 66.2 bits (160), Expect = 2e-08,   Method: Compositional matrix adjust.
 Identities = 36/104 (34%), Positives = 59/104 (56%), Gaps = 8/104 (7%)

Query: 34  AENFSKLAC----NAHTICNSELRPLGTGLYPVISIINHSCLPNAVLVFE----GRLAVV 85
           A+ F+ L C    N+++I +   + +G  LY +IS  NHSC PNA ++FE     R A +
Sbjct: 300 AQMFNYLVCILLTNSYSIEDKNGQEIGAALYSLISCCNHSCAPNAQVIFEDSEDAREATL 359

Query: 86  RAVQHVPKGAEVLISYIETAGSTMTRQKALKEQYLFTCTCPRCI 129
             ++   +G E+ ISYI   G +++ ++    Q+ FTC C RC+
Sbjct: 360 VLLRPCVQGEELYISYITDLGRSVSERRRELAQWCFTCQCTRCL 403


>gi|407394357|gb|EKF26910.1| hypothetical protein MOQ_009378 [Trypanosoma cruzi marinkellei]
          Length = 697

 Score = 66.2 bits (160), Expect = 2e-08,   Method: Compositional matrix adjust.
 Identities = 34/93 (36%), Positives = 49/93 (52%), Gaps = 2/93 (2%)

Query: 38  SKLACNAHTICNSELRPLGTGLYPVISIINHSCLPNAVLV-FEGRLAVVRAVQHVPKGAE 96
           + L CN+  + N++   +G  LYP  S  NHSC PN   V + G  A   A++ +PKG  
Sbjct: 550 AALQCNSFGLFNADGNCIGVALYPEASYFNHSCCPNICRVTYRGLFAAFHALREIPKGEP 609

Query: 97  VLISYIETA-GSTMTRQKALKEQYLFTCTCPRC 128
           + I Y++    ST  R++ L   Y F C C RC
Sbjct: 610 LTICYVDVQETSTAERRRTLFSSYRFFCECARC 642


>gi|356525020|ref|XP_003531125.1| PREDICTED: histone-lysine N-methyltransferase ASHR2-like [Glycine
           max]
          Length = 419

 Score = 66.2 bits (160), Expect = 2e-08,   Method: Compositional matrix adjust.
 Identities = 38/119 (31%), Positives = 52/119 (43%), Gaps = 13/119 (10%)

Query: 58  GLYPVISIINHSCLPNAVLV----------FEGRLAVVRAVQHVPKGAEVLISYIETAGS 107
           G+YP  S  NH CLPNA                   ++R +  VP+G E+ +SY      
Sbjct: 231 GIYPYASFFNHDCLPNACRFDYVDANPSDDSHNTDFIIRMIHDVPQGREICLSYFPVNEK 290

Query: 108 TMTRQKALKEQYLFTCTCPRC---IKLGQFDDIQESAILEGYRCKDDGCSGFLLRDSDD 163
             +RQK L E Y FTC C RC         D ++++A  E     +D C      D+DD
Sbjct: 291 YSSRQKRLIEDYGFTCNCDRCNVESNWSDNDSVEDNAEEEEEVMDEDQCETMAASDTDD 349


>gi|148235066|ref|NP_001085635.1| SET and MYND domain-containing protein 5 [Xenopus laevis]
 gi|82184468|sp|Q6GPQ4.1|SMYD5_XENLA RecName: Full=SET and MYND domain-containing protein 5
 gi|49115717|gb|AAH73058.1| MGC82689 protein [Xenopus laevis]
          Length = 421

 Score = 66.2 bits (160), Expect = 2e-08,   Method: Compositional matrix adjust.
 Identities = 34/87 (39%), Positives = 50/87 (57%), Gaps = 5/87 (5%)

Query: 56  GTGLYPVISIINHSCLPNAVLVFEGRLAVVR--AVQHVPKGAEVLISYIETAG---STMT 110
           G+GLY + S  NHSC+PNA   F     ++   A++ +  G E+ ISY++      S  +
Sbjct: 303 GSGLYLLQSCCNHSCVPNAEASFPDNNFILHLTALEDIQPGEEICISYLDCCQRDRSRHS 362

Query: 111 RQKALKEQYLFTCTCPRCIKLGQFDDI 137
           RQK L+E YLF C+CP+C+      DI
Sbjct: 363 RQKILRENYLFMCSCPKCLAQADEPDI 389


>gi|350419310|ref|XP_003492139.1| PREDICTED: SET and MYND domain-containing protein 4-like [Bombus
           impatiens]
          Length = 673

 Score = 66.2 bits (160), Expect = 2e-08,   Method: Compositional matrix adjust.
 Identities = 53/180 (29%), Positives = 85/180 (47%), Gaps = 22/180 (12%)

Query: 36  NFSKLACNAHTICNS---------ELRPLGTGLYPVISIINHSCLPNAVL-VFEGRLAVV 85
           N   + CNA+ I  +         E R +G  +YP IS+INHSC PN V   +     V+
Sbjct: 450 NLQAINCNAYEIVENIYDKKTHVWEPRNIGGAIYPSISLINHSCYPNVVRHSYPSGTVVL 509

Query: 86  RAVQHVPKGAEVLISYIET--AGSTMTRQKALKEQYLFTCTCPRCIKLGQFDDIQESAIL 143
           R ++ + KG E+L  Y     +   ++R + L ++Y F CTC  C +  QF  + E+   
Sbjct: 510 RTLRFIGKGTEILDCYGPHWLSEKRLSRLEYLWKKYCFLCTCEACTQNWQF-PLPETM-- 566

Query: 144 EGYRCKDDGCSGFL----LRDSDDKGFTCQQCGLVRSKEEIKKIASEVNILSKKTLALTS 199
             Y+CK   CS  +    L +   +  + +QC     K ++KKI ++     +K L   S
Sbjct: 567 -NYKCK--MCSEIIGTITLDEKHTQNVSSKQCCKCNKKTDLKKIKNQFRKSVEKRLDAIS 623


>gi|307187924|gb|EFN72837.1| SET and MYND domain-containing protein 4 [Camponotus floridanus]
          Length = 651

 Score = 66.2 bits (160), Expect = 2e-08,   Method: Compositional matrix adjust.
 Identities = 40/124 (32%), Positives = 62/124 (50%), Gaps = 8/124 (6%)

Query: 55  LGTGLYPVISIINHSCLPNAVLVFEGRLAVVRAVQHVPKGAEVLISY--IETAGSTMTRQ 112
           +G G+YP ++++NHSC P  +  F G   VVRA + +  G E+  +Y  I T      R+
Sbjct: 454 IGGGVYPTVAMLNHSCNPGVIRYFIGTTMVVRAARTIRAGEEISENYGPIFTTTPENERK 513

Query: 113 KALKEQYLFTCTCPRCIKLGQFDDIQE-SAILEGYRCKDD-GCSGFLLRDSDDKGFT--C 168
           + L+ QY F C C  C   G +  ++E    +  ++C+    C   LL  SD   F   C
Sbjct: 514 RRLRVQYWFDCNCEACS--GHWPLLEELDPTILRFKCETGLSCGNVLLVRSDTNEFMIGC 571

Query: 169 QQCG 172
            +CG
Sbjct: 572 AKCG 575


>gi|449540918|gb|EMD31905.1| hypothetical protein CERSUDRAFT_119222 [Ceriporiopsis subvermispora
           B]
          Length = 604

 Score = 66.2 bits (160), Expect = 2e-08,   Method: Compositional matrix adjust.
 Identities = 43/136 (31%), Positives = 66/136 (48%), Gaps = 10/136 (7%)

Query: 9   LLLYAQIANLVNLILQWPEISINEIAENFSKLACNAHTICNSELRPLGTGLYPVISIINH 68
           L+ Y  IAN   L   +   S   + +  S+ A N  T+ +  L P+G  + P I++ NH
Sbjct: 232 LVRYLGIANPAELA-PFGLNSAGNLVDLISRFATNTFTLTSFSLTPIGICICPSIALANH 290

Query: 69  SCLPNAVLVF---------EGRLAVVRAVQHVPKGAEVLISYIETAGSTMTRQKALKEQY 119
           SC PNA +VF         +  L  + A++ +    +V+ +Y++       RQKAL+E Y
Sbjct: 291 SCEPNAAIVFPRSSSLPQAQEPLMHLMAIRDIAPNEQVVAAYVDVTLPRELRQKALQETY 350

Query: 120 LFTCTCPRCIKLGQFD 135
            FTC C  C K    D
Sbjct: 351 SFTCKCKLCTKRMAVD 366


>gi|256062336|ref|XP_002570317.1| set and mynd domain containing [Schistosoma mansoni]
 gi|353230728|emb|CCD77145.1| putative set and mynd domain containing [Schistosoma mansoni]
          Length = 429

 Score = 66.2 bits (160), Expect = 2e-08,   Method: Compositional matrix adjust.
 Identities = 36/79 (45%), Positives = 48/79 (60%), Gaps = 7/79 (8%)

Query: 56  GTGLYPVISIINHSCLPNAVLVFEG---RLAVVRAVQHVPKGAEVLISYIE---TAGSTM 109
           G GLY   S INHSC PNA++ F G   RL+VV A+  + +G E+ ISY++    +    
Sbjct: 320 GVGLYYYQSRINHSCSPNAIIRFSGVNSRLSVV-ALTSIQEGEEITISYLDHCLQSRGRH 378

Query: 110 TRQKALKEQYLFTCTCPRC 128
           TR+K L   YLF C CP+C
Sbjct: 379 TRRKHLSSNYLFWCNCPKC 397


>gi|268532182|ref|XP_002631219.1| C. briggsae CBR-SET-14 protein [Caenorhabditis briggsae]
          Length = 428

 Score = 66.2 bits (160), Expect = 2e-08,   Method: Compositional matrix adjust.
 Identities = 66/293 (22%), Positives = 124/293 (42%), Gaps = 16/293 (5%)

Query: 37  FSKLACNAHTICNSELRPLGTGLYPVISIINHSCLPNAVLVFEGRLAVVRAVQ--HVPKG 94
           F K++ N  ++ NS    +G GL   +S  NHSC P   + +  R A++  +     P  
Sbjct: 147 FKKISINGFSVSNSMGSAIGHGLCIKLSAANHSCKPLTRVCYRNRTAMLVPIDPYKPPTS 206

Query: 95  AE-VLISYIETAGSTMTRQKALKEQYLFTCTCPRCIKLGQFDDIQESAILEGYRCKDDGC 153
            E    SYI+   S   RQ  LK++Y F C C  C+      D + +  +E + C D  C
Sbjct: 207 LEGACHSYIDELQSVSARQSELKKKYNFICICEGCM------DDERNESMEAWACGD--C 258

Query: 154 SGFLLRDSDDKGFTCQQCG--LVRSKEEIKKIASEVNILSKKTLALTSCGNHQEVVSTYK 211
           +   + + +D    C  C   + R   E+ + A E  I +K  +   S           K
Sbjct: 259 ANGKVPNVEDG--RCASCSWKMTRDHYELCRTAEESAIAAKPKIENESISLETRKALCEK 316

Query: 212 MIEKLQKKLYHPFSVNLMQTREKLIKILMELEDWKEALAYCQLTIPVYQRVYPQFHPLLG 271
           ++E     L H F+V+ +  +  L    + ++D   A  + +  + +  R   +  P++ 
Sbjct: 317 LMELFDDTL-HEFNVHRVVPQRLLFFASLAVQDVNGASKHGKALLAMMMRYQGKHDPIVM 375

Query: 272 LQYYTCGKLEWFLGDTENAIKSMTEAVEILRITHGTNSPFMKELILKLEEAQA 324
           L+ +   ++    G+    +K + E  E L+  +   +P    +   +E A++
Sbjct: 376 LKKWQIAQICHANGEYNKVVKLLEEIKEPLKKIYTAGAPISVNIRRLMEMARS 428


>gi|256062334|ref|XP_002570316.1| set and mynd domain containing [Schistosoma mansoni]
 gi|353230729|emb|CCD77146.1| putative set and mynd domain containing [Schistosoma mansoni]
          Length = 433

 Score = 66.2 bits (160), Expect = 2e-08,   Method: Compositional matrix adjust.
 Identities = 36/79 (45%), Positives = 48/79 (60%), Gaps = 7/79 (8%)

Query: 56  GTGLYPVISIINHSCLPNAVLVFEG---RLAVVRAVQHVPKGAEVLISYIE---TAGSTM 109
           G GLY   S INHSC PNA++ F G   RL+VV A+  + +G E+ ISY++    +    
Sbjct: 324 GVGLYYYQSRINHSCSPNAIIRFSGVNSRLSVV-ALTSIQEGEEITISYLDHCLQSRGRH 382

Query: 110 TRQKALKEQYLFTCTCPRC 128
           TR+K L   YLF C CP+C
Sbjct: 383 TRRKHLSSNYLFWCNCPKC 401


>gi|378731922|gb|EHY58381.1| SET and MYND domain-containing protein [Exophiala dermatitidis
           NIH/UT8656]
          Length = 550

 Score = 66.2 bits (160), Expect = 2e-08,   Method: Compositional matrix adjust.
 Identities = 36/101 (35%), Positives = 57/101 (56%), Gaps = 12/101 (11%)

Query: 39  KLAC----NAHTICNSELRPLGTGLYPVISIINHSCLPNAVLVFE-------GRLAVVRA 87
           +L+C    N+ T+ N+    +G  L+P  +++NHSC PNA + F+       G ++V  A
Sbjct: 212 RLSCAVIVNSFTLSNATFDSIGVILHPKPALLNHSCDPNAYVRFDVSETDTLGSISV-HA 270

Query: 88  VQHVPKGAEVLISYIETAGSTMTRQKALKEQYLFTCTCPRC 128
           ++ + K  E+ ISYI+T      RQ+ L E+Y FTC C  C
Sbjct: 271 LRDIAKDEEITISYIDTTVPCKRRQQQLSERYFFTCQCHLC 311


>gi|317036659|ref|XP_001397803.2| SET and MYND domain protein [Aspergillus niger CBS 513.88]
          Length = 481

 Score = 65.9 bits (159), Expect = 2e-08,   Method: Compositional matrix adjust.
 Identities = 33/104 (31%), Positives = 57/104 (54%)

Query: 27  EISINEIAENFSKLACNAHTICNSELRPLGTGLYPVISIINHSCLPNAVLVFEGRLAVVR 86
           ++S   I+   +KL  N+  + N+    LG  L+P  +I NHSC  NA + F+G    ++
Sbjct: 168 DMSEEVISAMGAKLDLNSFNLTNAVYDRLGVYLHPYAAIFNHSCDHNAAVSFDGPNLHIK 227

Query: 87  AVQHVPKGAEVLISYIETAGSTMTRQKALKEQYLFTCTCPRCIK 130
           A++ + K  ++ I+YI+       RQ  L+ +Y FTC C +C +
Sbjct: 228 ALRPIRKDEQIFITYIDVTDPYPIRQANLQSRYYFTCHCSKCSR 271


>gi|156539569|ref|XP_001601623.1| PREDICTED: SET and MYND domain-containing protein 4-like [Nasonia
           vitripennis]
          Length = 697

 Score = 65.9 bits (159), Expect = 2e-08,   Method: Compositional matrix adjust.
 Identities = 30/76 (39%), Positives = 44/76 (57%), Gaps = 2/76 (2%)

Query: 55  LGTGLYPVISIINHSCLPNAVLVFEGRLAVVRAVQHVPKGAEVLISY--IETAGSTMTRQ 112
           +G G++P +++ NHSC P  V  F G   VVRA++ +P GAE+  +Y  I T      R+
Sbjct: 499 IGGGVFPTVALFNHSCNPGVVRYFIGNTMVVRAIKTIPAGAEISENYGPIFTEEEENDRK 558

Query: 113 KALKEQYLFTCTCPRC 128
           + L+ QY F C C  C
Sbjct: 559 RKLRLQYWFDCDCEAC 574


>gi|24648668|ref|NP_650955.1| CG3353 [Drosophila melanogaster]
 gi|23171865|gb|AAF55866.2| CG3353 [Drosophila melanogaster]
 gi|33636489|gb|AAQ23542.1| RE75113p [Drosophila melanogaster]
          Length = 393

 Score = 65.9 bits (159), Expect = 2e-08,   Method: Compositional matrix adjust.
 Identities = 32/78 (41%), Positives = 48/78 (61%), Gaps = 5/78 (6%)

Query: 56  GTGLYPVISIINHSCLPNAVLVF--EGRLAVVRAVQHVPKGAEVLISYIETA---GSTMT 110
           G+GLY + S INHSC+PNA   F     + V++A+  + +G E+ ISY++      S  +
Sbjct: 286 GSGLYLLQSKINHSCVPNACSTFPYSNDIVVLKALAPIQQGEEICISYLDECMLERSRHS 345

Query: 111 RQKALKEQYLFTCTCPRC 128
           R K L+E Y+F C CP+C
Sbjct: 346 RHKVLRENYVFICQCPKC 363


>gi|115395962|ref|XP_001213620.1| predicted protein [Aspergillus terreus NIH2624]
 gi|114193189|gb|EAU34889.1| predicted protein [Aspergillus terreus NIH2624]
          Length = 425

 Score = 65.9 bits (159), Expect = 2e-08,   Method: Compositional matrix adjust.
 Identities = 60/262 (22%), Positives = 110/262 (41%), Gaps = 50/262 (19%)

Query: 26  PEISINEIAENFSKLACNAHTICNSELRPLGTGLYPVISIINHSCLPNAVLVFEGRLAVV 85
           PE +I+      +KL  N+  +  +    +G  L+P  ++INHSC  NAV+ F+G     
Sbjct: 140 PEDTISAFG---AKLDVNSFNMTTALADRIGLYLHPYAALINHSCAYNAVIGFDGAELFA 196

Query: 86  RAVQHVPKGAEVLISYIETAGSTMTRQKALKEQYLFTCTCPRCIKLGQFDDIQESAILEG 145
            A++ + +  ++ ISY++       R+  L+E+Y F C C +C    + D          
Sbjct: 197 TALRPITRDEQIFISYVDATNPVAVRRNELRERYFFDCRCAKCA--AEMD---------- 244

Query: 146 YRCKDDGCSGFLLRDSDDKGFTCQQCGL---VRSKEEIKKIASEVNILSKKTLALTSCGN 202
                DG       D+  K +T  +      V   E ++ + S +  L +   A      
Sbjct: 245 ---APDGGRAPAAADAQRKAYTLMEVATGAGVDPAEAVRALESAMRTLRESEGA------ 295

Query: 203 HQEVVSTYKMIEKLQKKLYHPFSVNLMQTREKLIKILMELEDWKEALAYCQLTIPVYQ-- 260
                 ++++ E+            L+  R++LI  L+    ++   A+ Q  I V +  
Sbjct: 296 ------SWRITEQ-----------PLVSLRDELIASLLGAGRFQ--TAFVQAAIRVVRVD 336

Query: 261 -RVYP-QFHPLLGLQYYTCGKL 280
             VYP + HP+  L  +T  +L
Sbjct: 337 CVVYPVRGHPIRQLHAWTLARL 358


>gi|357610542|gb|EHJ67027.1| hypothetical protein KGM_11275 [Danaus plexippus]
          Length = 661

 Score = 65.9 bits (159), Expect = 3e-08,   Method: Compositional matrix adjust.
 Identities = 60/174 (34%), Positives = 84/174 (48%), Gaps = 14/174 (8%)

Query: 55  LGTGLYPVISIINHSCLPNAVLVFEGRLAVVRAVQHVPKGAEVLISY--IETAGSTMTRQ 112
           +G  +YP +++ NHSC P  V  F G   VV AV+++ KG EV  +Y  I T      RQ
Sbjct: 464 IGGAVYPTLALFNHSCEPGIVRYFCGSRIVVCAVKNIRKGEEVAENYGPIFTTVPKDKRQ 523

Query: 113 KALKEQYLFTCTCPRCIK-LGQFDDIQESAILEGYRCKDD-GCSGFLLRDSDDKGFTCQQ 170
             LKEQY F C C  C +   +++D+ E+ +   ++C  D  CS  +    D   F   Q
Sbjct: 524 SQLKEQYWFDCKCLPCEQNWPKYEDMTENYLR--FKCDSDQPCSNVIPVPYDCMEFMV-Q 580

Query: 171 CGLVRSKEEI-KKIAS--EVNILSKKTLALTSCGNHQEVVSTYKMIEKLQKKLY 221
           CGL +    I K + S  +   L K   A    G + E +   K IE L  KLY
Sbjct: 581 CGLCQQYTNILKGLKSLQDTETLYKLGRAAMGEGKYGEAIK--KFIETL--KLY 630


>gi|398392173|ref|XP_003849546.1| hypothetical protein MYCGRDRAFT_48051 [Zymoseptoria tritici IPO323]
 gi|339469423|gb|EGP84522.1| hypothetical protein MYCGRDRAFT_48051 [Zymoseptoria tritici IPO323]
          Length = 633

 Score = 65.9 bits (159), Expect = 3e-08,   Method: Compositional matrix adjust.
 Identities = 45/131 (34%), Positives = 67/131 (51%), Gaps = 12/131 (9%)

Query: 13  AQIANLVNLILQW----PEISINEIAENFSKLAC-NAH---TICNSE---LRPLGTGLYP 61
           +QI  +++L   W     E+ I E A     + C NA     +   E   +R   TGL+ 
Sbjct: 430 SQIEKVMDLFGDWRSIGKEVLIGEDAFQVHDVVCRNAFGPGAVIAGEAENIRKASTGLWL 489

Query: 62  VISIINHSCLPNAVLVFEGRLAVVRAVQHVPKGAEVLISYIETAGSTMTRQKALKEQYLF 121
           + + INHSCLPNA   F G L VVRA +++  G+E+  SY +++     RQ+AL + + F
Sbjct: 490 MAAYINHSCLPNAEKSFLGDLMVVRATRNITAGSEITHSY-DSSSDYDARQEALMKTWGF 548

Query: 122 TCTCPRCIKLG 132
            C C  C   G
Sbjct: 549 RCRCELCKADG 559


>gi|332025390|gb|EGI65557.1| SET and MYND domain-containing protein 4 [Acromyrmex echinatior]
          Length = 401

 Score = 65.9 bits (159), Expect = 3e-08,   Method: Compositional matrix adjust.
 Identities = 40/124 (32%), Positives = 58/124 (46%), Gaps = 7/124 (5%)

Query: 53  RP--LGTGLYPVISIINHSCLPNAVLVFEGRLAVVRAVQHVPKGAEVLISY--IETAGST 108
           RP  LG  +YP ++  NH C P     F GR  V+RA++ +  G  +  +Y  I T  + 
Sbjct: 200 RPVYLGVAIYPSVARFNHDCYPAVTRYFIGRHIVIRAIRGLGPGDVIAENYGPIFTKRTL 259

Query: 109 MTRQKALKEQYLFTCTCPRCIK-LGQFDDIQESAILEGYRCKDDGCSGFLLRDSDDKGFT 167
             RQ+ L  +Y F CTC  C +    F+++   ++    RC   GC G  LR    K   
Sbjct: 260 AERQRTLTGRYWFQCTCKACQEDWPCFENLTNDSV--KLRCPTVGCGGLHLRSRQGKPIK 317

Query: 168 CQQC 171
           C  C
Sbjct: 318 CPDC 321


>gi|393236328|gb|EJD43877.1| hypothetical protein AURDEDRAFT_114449 [Auricularia delicata
           TFB-10046 SS5]
          Length = 451

 Score = 65.9 bits (159), Expect = 3e-08,   Method: Compositional matrix adjust.
 Identities = 31/75 (41%), Positives = 49/75 (65%), Gaps = 3/75 (4%)

Query: 57  TGLYPVISIINHSCLPNAVLVFE-GRLAV-VRAVQHVPKGAEVLISYIETA-GSTMTRQK 113
           +G++P  S ++HSC PNA+  ++ G + + +RA +H+ +GAEV ISY+      T  RQ+
Sbjct: 244 SGVFPTFSRVSHSCCPNAIPRWDSGTMTLELRATRHIQQGAEVTISYVPPPLKPTEARQE 303

Query: 114 ALKEQYLFTCTCPRC 128
            L++ Y F CTCP C
Sbjct: 304 FLRDGYHFECTCPAC 318


>gi|383858836|ref|XP_003704905.1| PREDICTED: SET and MYND domain-containing protein DDB_G0273589-like
           [Megachile rotundata]
          Length = 523

 Score = 65.9 bits (159), Expect = 3e-08,   Method: Compositional matrix adjust.
 Identities = 51/195 (26%), Positives = 87/195 (44%), Gaps = 17/195 (8%)

Query: 4   IDEKQLLLYAQIANLVNLILQW---PEISINEIAENFSKLACNAHTICNSELRPLGTGLY 60
           I  K+  ++ + +NLVN I ++    +I   +I +    L  N H +  ++  P    +Y
Sbjct: 177 IKNKETFIFEESSNLVNFIKRFFKSEDIPEEDIRKIIGVLQVNGHEVPLTD--PPYVAVY 234

Query: 61  PVISIINHSCLPNAVLVFE--GRLAVVRAVQHVPKGAEVLISYIETAGSTMTRQKALKEQ 118
            + S++ H+C  N    F   G L ++ A   + KG  + I Y +    TM R+  L E 
Sbjct: 235 ELASLLEHNCKANCSKSFTDTGGL-IIHAATPITKGDHISICYTDPLWGTMNRRHHLLET 293

Query: 119 YLFTCTCPRCIKLGQFDDIQESAILEGYRCKDDGCSGFLLRDS----DDKGFTCQQCGLV 174
             F CTC RC    +F      A+    RC +  CSG++L  +      + + C  C  V
Sbjct: 294 KYFECTCDRCRDPTEF-----GAMFNAIRCSEIDCSGYVLPKTFLGDHREDYVCTNCTTV 348

Query: 175 RSKEEIKKIASEVNI 189
              E I++   ++ I
Sbjct: 349 VPLEIIEETLEDIGI 363


>gi|302697297|ref|XP_003038327.1| hypothetical protein SCHCODRAFT_103387 [Schizophyllum commune H4-8]
 gi|300112024|gb|EFJ03425.1| hypothetical protein SCHCODRAFT_103387, partial [Schizophyllum
           commune H4-8]
          Length = 360

 Score = 65.9 bits (159), Expect = 3e-08,   Method: Compositional matrix adjust.
 Identities = 33/74 (44%), Positives = 43/74 (58%), Gaps = 2/74 (2%)

Query: 57  TGLYPVISIINHSCLPNAVLVFE--GRLAVVRAVQHVPKGAEVLISYIETAGSTMTRQKA 114
            G++   S INHSC PNAV  F+      VVRA++H+ KG E+ ISY ET     TR++ 
Sbjct: 174 VGVFERASRINHSCSPNAVYHFDLQSFCLVVRAIRHISKGEEIFISYSETLLQAATRRQN 233

Query: 115 LKEQYLFTCTCPRC 128
             + Y F C CP C
Sbjct: 234 SLQDYGFRCACPAC 247


>gi|403222774|dbj|BAM40905.1| uncharacterized protein TOT_030000166 [Theileria orientalis strain
           Shintoku]
          Length = 150

 Score = 65.9 bits (159), Expect = 3e-08,   Method: Compositional matrix adjust.
 Identities = 35/90 (38%), Positives = 50/90 (55%), Gaps = 3/90 (3%)

Query: 46  TICNSELRP--LGTGLYPVISIINHSCLPNAVLVF-EGRLAVVRAVQHVPKGAEVLISYI 102
           TI   EL P  +G GL+  +S +NHSC PN  + + +  +A +  +  VP+G E  ISYI
Sbjct: 56  TIQKKELYPETMGLGLFNYLSKMNHSCEPNLQIEYTKNNIAHIAPLVDVPRGEEATISYI 115

Query: 103 ETAGSTMTRQKALKEQYLFTCTCPRCIKLG 132
           +   S   RQ+ L + Y F C C +CI  G
Sbjct: 116 DEKDSVENRQEKLYKNYGFKCDCNKCILEG 145


>gi|194883957|ref|XP_001976062.1| GG20202 [Drosophila erecta]
 gi|190659249|gb|EDV56462.1| GG20202 [Drosophila erecta]
          Length = 660

 Score = 65.9 bits (159), Expect = 3e-08,   Method: Compositional matrix adjust.
 Identities = 55/219 (25%), Positives = 97/219 (44%), Gaps = 18/219 (8%)

Query: 15  IANLVNLILQWPEISINEIAE--NFSKLACNAHTICNSELRPLGTGLYPVISIINHSCLP 72
           I +LV   LQ+ + + +E+AE   FS                +G  +YP +++ NHSC P
Sbjct: 427 ICSLVLRSLQFIQFNTHEVAELHKFSSSGREKSIF-------IGGAIYPTLALFNHSCDP 479

Query: 73  NAVLVFEGRLAVVRAVQHVPKGAEVLISY--IETAGSTMTRQKALKEQYLFTCTCPRCI- 129
             V  F G    + +V+ +  G  +  +Y  + T      RQ  LKE Y F C+C  CI 
Sbjct: 480 GVVRYFRGTTIHINSVRPIEAGLPINENYGPMYTQDERAERQARLKELYWFECSCDACID 539

Query: 130 KLGQFDDIQESAILEGYRCK-DDGCSGFL--LRDSDDKGFTCQQCGLVRSKEEIKKIASE 186
              +FDD+    I   +RC   + CS  +      +D    C  CG + +  +  K+  +
Sbjct: 540 NWPKFDDLPRDVI--RFRCDAPNNCSAIIEVPPSCNDFMVKCVTCGEITNILKGLKVMQD 597

Query: 187 VNILSKKTLALTSCGNHQEVVSTY-KMIEKLQKKLYHPF 224
             ++++    L   G + + ++ +  +I  + + L  PF
Sbjct: 598 TEMMTRTAKRLYETGEYLKALAKFIDLIRIMYEVLAPPF 636


>gi|330796724|ref|XP_003286415.1| hypothetical protein DICPUDRAFT_77304 [Dictyostelium purpureum]
 gi|325083610|gb|EGC37058.1| hypothetical protein DICPUDRAFT_77304 [Dictyostelium purpureum]
          Length = 499

 Score = 65.9 bits (159), Expect = 3e-08,   Method: Compositional matrix adjust.
 Identities = 33/79 (41%), Positives = 47/79 (59%), Gaps = 6/79 (7%)

Query: 56  GTGLYPVISIINHSCLPNAVLVFE-----GRLA-VVRAVQHVPKGAEVLISYIETAGSTM 109
           G GL+P+ S +NHSCLPN  +  E      R+  VV+A + +P G+E+L SY +    T 
Sbjct: 419 GVGLFPIFSCMNHSCLPNIEISNEIQDGVDRVKMVVKAKKLIPAGSEILHSYCDENLPTK 478

Query: 110 TRQKALKEQYLFTCTCPRC 128
            R++ L  QY F C CP+C
Sbjct: 479 ERKQLLLSQYGFKCLCPKC 497


>gi|118403760|ref|NP_001072288.1| SET and MYND domain containing 4 [Xenopus (Silurana) tropicalis]
 gi|111305557|gb|AAI21315.1| hypothetical protein MGC145614 [Xenopus (Silurana) tropicalis]
          Length = 739

 Score = 65.5 bits (158), Expect = 3e-08,   Method: Compositional matrix adjust.
 Identities = 68/295 (23%), Positives = 116/295 (39%), Gaps = 64/295 (21%)

Query: 49  NSELRPLGTGLYPVISIINHSCLPNAVLVFEGRLAVVRAVQHVPKGAEVLISY--IETAG 106
           N  +R L T ++PV+S++NHSC PN  + F GR   VRA + + +  EV   Y   +   
Sbjct: 465 NKSIR-LATAVFPVLSLLNHSCDPNTTVSFTGRFVTVRANRPIRRDEEVTHCYGPHKLRM 523

Query: 107 STMTRQKALKEQYLFTCTCPRCIKLGQFDDIQESAILEGYRCKDDGCSGFLLRDSDDKGF 166
               RQ+ LK+QY F C C  C +                           L+     GF
Sbjct: 524 DVAERQQLLKDQYFFVCQCKACTE--------------------------ELKGKKTHGF 557

Query: 167 TCQQCGLVRSKEEI--------KKIASEVNILSKKTLALTSCGNHQEVVSTYKMIEKLQK 218
            C  C      EE            AS+  + S+    L   GNH ++        +LQ 
Sbjct: 558 FCPLCKAQLEGEEALYCTGARCTYTASQTQLTSR----LNQLGNHIQIAKV-----QLQD 608

Query: 219 KLYHPFSVNLM-----------QTREKLIKILMELE-------DWKEALAYCQLTIPVYQ 260
                  + LM           Q    L +I+  L        +WK A  + + +I + +
Sbjct: 609 NKTDNAKMTLMSCLSEAECFLSQDHLLLGEIMDHLAQAEASDGNWKAAAGHLRKSISIVK 668

Query: 261 RVYPQFHPLLGLQYYTCGKLEWFLGDTENAIKSMTEAVEILRITHGTNSPFMKEL 315
             Y      LG + +   ++ +   +  +A++++ EA ++L + +G +   ++EL
Sbjct: 669 VHYGSSSMELGHELFKLAQILFNGFEVPDAMRTIMEAQKVLSMHYGPDHNLVQEL 723


>gi|341883792|gb|EGT39727.1| CBN-SET-18 protein [Caenorhabditis brenneri]
          Length = 499

 Score = 65.5 bits (158), Expect = 3e-08,   Method: Compositional matrix adjust.
 Identities = 64/314 (20%), Positives = 127/314 (40%), Gaps = 26/314 (8%)

Query: 5   DEKQLLLYAQIANLVNLILQWPEISINEIAENFSKLACNAHTICNSELRPLGTGLYPVIS 64
           DE+++  + ++   + L  +   I   E  + F K + N+H+I  +    +G  L   +S
Sbjct: 146 DEEKMKHFEKVFTKMGLFRKDDMIEKGEFFDIFCKASINSHSIHTNAGNEVGMALDLGVS 205

Query: 65  IINHSCLPNAVLVFEGRLAVVRAVQHVP-----KGAEVLISYIETAGSTMTRQKALKEQY 119
             NHSC P   +VF+G    +R +  VP        +  ISYI+   S   R++ LK  +
Sbjct: 206 KYNHSCRPTCSMVFDGYRVCLRPL--VPGVDAEDTEQAFISYIDVGRSKFVRRRDLK--W 261

Query: 120 LFTCTCPRCIKLGQFDDIQESAILEGYRCKDDGCSGFLLRDSDDK--GFTCQQCGLVRSK 177
            F C C RC+         E   L   RC +  C   +L    ++     C +C  +  +
Sbjct: 262 YFNCECTRCMD-------PEDDALTAIRCANPSCDAPILTSETEEPMNIACDKCKTIVEE 314

Query: 178 EEIKKIASEVNILSKKTLALTSCGNHQEVVSTYKMIEKLQKKLYHPFSVNLMQTREKLIK 237
            ++K     +        +L +  + +      + +    +++ HP +V + + R  L  
Sbjct: 315 NDVKAAQEYMK-------SLPASFDPKCPADILRDLLAKAEQVLHPSNVYVARLRTALFH 367

Query: 238 ILMELEDWKEALAYCQLTIPVYQRVYPQFHPLLGLQYYTCGKLEWFLGDTENAIKSMTEA 297
           +   L     +  + Q+    Y+  +P+    +G Q     K      + + A+    +A
Sbjct: 368 VTGTLTMDNLSSMHTQI-YNNYKMCFPKADRHVGFQLLHIVKALIEKDERDEAMPYAFDA 426

Query: 298 VEILRITHGTNSPF 311
           + I  +  G + P+
Sbjct: 427 MNIFEVCFGLDHPY 440


>gi|194764587|ref|XP_001964410.1| GF23162 [Drosophila ananassae]
 gi|190614682|gb|EDV30206.1| GF23162 [Drosophila ananassae]
          Length = 392

 Score = 65.5 bits (158), Expect = 3e-08,   Method: Compositional matrix adjust.
 Identities = 37/101 (36%), Positives = 58/101 (57%), Gaps = 9/101 (8%)

Query: 33  IAENFSKLACNAHTICNSELRPLGTGLYPVISIINHSCLPNAVLVF--EGRLAVVRAVQH 90
           I E ++K+   A    N+E    G+GLY + S INHSC+PN+   F     + V++A+  
Sbjct: 267 IDEIYAKVGEFAGEFLNNE----GSGLYLLQSKINHSCVPNSCSTFPYSNDIVVLKALTP 322

Query: 91  VPKGAEVLISYIETAG---STMTRQKALKEQYLFTCTCPRC 128
           + +G E+ ISY++      S  +R K L+E Y+F C CP+C
Sbjct: 323 IQEGDEICISYLDECQLERSRHSRHKVLRENYIFVCQCPKC 363


>gi|392579066|gb|EIW72193.1| hypothetical protein TREMEDRAFT_58351 [Tremella mesenterica DSM
           1558]
          Length = 660

 Score = 65.5 bits (158), Expect = 3e-08,   Method: Compositional matrix adjust.
 Identities = 43/129 (33%), Positives = 61/129 (47%), Gaps = 11/129 (8%)

Query: 29  SINEIAENFSKLACNAHTICNSELRPLGTGLYPVISIINHSCLPNAVLVF-EG----RLA 83
           S   I +  S    N  +  +  L  +G  L P +++ NHSC PNAV+VF EG    R+ 
Sbjct: 355 STGSILQVISSFMINTFSASSPSLDLVGAALNPAMAMSNHSCSPNAVVVFPEGADSMRIV 414

Query: 84  VVRAVQHVPKGAEVLISYIETAGSTMTRQKALKEQYLFTCTCPRC---IKLGQFDDIQES 140
            ++A++    G EVL  Y++ A     RQ  L+  Y F C CP C    K G    +   
Sbjct: 415 AIKAIE---AGEEVLTHYVDLALPYAQRQAELRRTYHFECKCPTCDNRTKDGDKPVLDPR 471

Query: 141 AILEGYRCK 149
           +IL    CK
Sbjct: 472 SILIHRECK 480


>gi|195333441|ref|XP_002033400.1| GM21288 [Drosophila sechellia]
 gi|194125370|gb|EDW47413.1| GM21288 [Drosophila sechellia]
          Length = 660

 Score = 65.5 bits (158), Expect = 3e-08,   Method: Compositional matrix adjust.
 Identities = 55/219 (25%), Positives = 96/219 (43%), Gaps = 18/219 (8%)

Query: 15  IANLVNLILQWPEISINEIAE--NFSKLACNAHTICNSELRPLGTGLYPVISIINHSCLP 72
           I +LV   LQ+ + + +E+AE   FS                +G  +YP +++ NHSC P
Sbjct: 427 ICSLVLRSLQFIQFNTHEVAELHKFSSSGREKSIF-------IGGAIYPTLALFNHSCDP 479

Query: 73  NAVLVFEGRLAVVRAVQHVPKGAEVLISY--IETAGSTMTRQKALKEQYLFTCTCPRCI- 129
             V  F G    + +V+ +  G  +  +Y  + T      RQ  LKE Y F C+C  CI 
Sbjct: 480 GVVRYFRGTTIHINSVRPIEAGLPINENYGPMYTQDERSERQARLKELYWFECSCDACID 539

Query: 130 KLGQFDDIQESAILEGYRCK-DDGCSGFL--LRDSDDKGFTCQQCGLVRSKEEIKKIASE 186
              +FDD+    I   +RC   + CS  +      +D    C  CG + +  +  K+  +
Sbjct: 540 NWPKFDDLPRDVI--RFRCDAPNNCSAVIEVPPSCNDFMVKCVTCGEITNILKGLKVMQD 597

Query: 187 VNILSKKTLALTSCGNHQEVVSTY-KMIEKLQKKLYHPF 224
             ++++    L   G + + +  +  +I  + + L  PF
Sbjct: 598 TEMMTRTAKRLYETGEYSKALPKFIDLIRIMYEVLAPPF 636


>gi|195485733|ref|XP_002091210.1| GE12361 [Drosophila yakuba]
 gi|194177311|gb|EDW90922.1| GE12361 [Drosophila yakuba]
          Length = 660

 Score = 65.5 bits (158), Expect = 3e-08,   Method: Compositional matrix adjust.
 Identities = 55/219 (25%), Positives = 97/219 (44%), Gaps = 18/219 (8%)

Query: 15  IANLVNLILQWPEISINEIAE--NFSKLACNAHTICNSELRPLGTGLYPVISIINHSCLP 72
           I +LV   LQ+ + + +E+AE   FS                +G  +YP +++ NHSC P
Sbjct: 427 ICSLVLRSLQFIQFNTHEVAELHKFSSSGREKSIF-------IGGAIYPTLALFNHSCDP 479

Query: 73  NAVLVFEGRLAVVRAVQHVPKGAEVLISY--IETAGSTMTRQKALKEQYLFTCTCPRCI- 129
             V  F G    V +V+ +  G  +  +Y  + T      RQ  LKE Y F C+C  CI 
Sbjct: 480 GVVRYFRGTTIHVNSVRPIEAGLPINENYGPMYTQDGRPERQARLKELYWFECSCDACID 539

Query: 130 KLGQFDDIQESAILEGYRCK-DDGCSGFL--LRDSDDKGFTCQQCGLVRSKEEIKKIASE 186
              +FDD+    I   +RC   + C+  +      +D    C  CG + +  +  K+  +
Sbjct: 540 NWPKFDDLPRDVI--RFRCDAPNNCAAVIEVPPSCNDFMVKCVTCGEITNILKGLKVMQD 597

Query: 187 VNILSKKTLALTSCGNHQEVVSTY-KMIEKLQKKLYHPF 224
             ++++    L   G + + ++ +  +I  + + L  PF
Sbjct: 598 TEMMTRTAKRLYETGEYSKALAKFIDLIRIMYEVLAPPF 636


>gi|255084131|ref|XP_002508640.1| predicted protein [Micromonas sp. RCC299]
 gi|226523917|gb|ACO69898.1| predicted protein [Micromonas sp. RCC299]
          Length = 701

 Score = 65.5 bits (158), Expect = 3e-08,   Method: Compositional matrix adjust.
 Identities = 41/135 (30%), Positives = 60/135 (44%), Gaps = 24/135 (17%)

Query: 18  LVNLILQWPEISINEIAENFSKLACNAHTICNSELRPLGTGLYPVISIINHSCLPNAVLV 77
           L + +     +  +E+ +   K   N+H + + +LR LGTG+YP  S+ NHSC PNA + 
Sbjct: 246 LASAVRHAAALDPSEVEDMLGKTRENSHGVVDWKLRQLGTGIYPEASMFNHSCAPNAAVS 305

Query: 78  F----------------EGRLAVVRAVQHVP--------KGAEVLISYIETAGSTMTRQK 113
           F                E R   V    +VP        KG EV I+Y E      +R+ 
Sbjct: 306 FGAGGTLNTCCIGVDDYEARAMEVGPKGYVPLGRPRAIRKGEEVCIAYTELYRPAASRRL 365

Query: 114 ALKEQYLFTCTCPRC 128
            L++   F C C RC
Sbjct: 366 TLEKSKGFVCECVRC 380


>gi|328875121|gb|EGG23486.1| hypothetical protein DFA_05619 [Dictyostelium fasciculatum]
          Length = 391

 Score = 65.5 bits (158), Expect = 3e-08,   Method: Compositional matrix adjust.
 Identities = 37/107 (34%), Positives = 54/107 (50%), Gaps = 7/107 (6%)

Query: 53  RPLGTGLYPVISIINHSCLPNAVLVFE--GRLAVVRAVQHVPKGAEVLISYIETAGSTMT 110
           + +G+GLY   S  NHSC PN   V +  G   V+ A++ + +G E+  +YI+ + S   
Sbjct: 268 KSIGSGLYDQCSYFNHSCQPNIFKVNQTPGGELVMVALRDIEQGEELFYNYIQISMSGEA 327

Query: 111 RQKALKEQYLFTCTCPRCIKLGQFDDIQESAILEGYRCKDDGCSGFL 157
           R K LKE Y F C CP C      +       L+ Y CK   C+G +
Sbjct: 328 RIKKLKESYFFNCQCPGCK-----NAPSHKQFLDKYLCKVKQCNGVI 369


>gi|326435247|gb|EGD80817.1| hypothetical protein PTSG_01403 [Salpingoeca sp. ATCC 50818]
          Length = 431

 Score = 65.5 bits (158), Expect = 3e-08,   Method: Compositional matrix adjust.
 Identities = 36/80 (45%), Positives = 46/80 (57%), Gaps = 9/80 (11%)

Query: 58  GLYPVISIINHSCLPNAVLVFE-----GRLA----VVRAVQHVPKGAEVLISYIETAGST 108
           G Y  ++  NHSC PNA    +      RL     V+RAV  + +G EV ISYIE   +T
Sbjct: 174 GFYLRMAFCNHSCRPNACQYIDPNSTRARLNSPSIVLRAVSDIAEGEEVCISYIELMDTT 233

Query: 109 MTRQKALKEQYLFTCTCPRC 128
             R++AL+E Y FTC CPRC
Sbjct: 234 PERREALQELYYFTCQCPRC 253


>gi|324507181|gb|ADY43048.1| SET and MYND domain-containing protein 3 [Ascaris suum]
          Length = 614

 Score = 65.5 bits (158), Expect = 3e-08,   Method: Compositional matrix adjust.
 Identities = 67/296 (22%), Positives = 127/296 (42%), Gaps = 21/296 (7%)

Query: 43  NAHTICNSE-LRPLGTGLYPVISIINHSCLPNAVLVFEGRLAVVRAVQ---HVPKGAEVL 98
           N H I +   L  +G GLY  +   +HSC PN +   +G +A +RA+    ++       
Sbjct: 231 NRHAISDDGYLEEIGKGLYLDLCAYDHSCRPNTIYTCKGFVATLRALDLTVNLMDRTTTF 290

Query: 99  ISYIETAGSTMTRQKALKEQYLFTCTCPRCIKLGQFDDIQESAILEGYRCKDDGCSGF-- 156
            SYI+   +T  R+K L++ + F C C RC+     D +  S      R K +    F  
Sbjct: 291 YSYIDLLSTTQERKKLLRDTWYFDCQCVRCV--DNDDHVLSSMFCPSCREKPERLCIFGE 348

Query: 157 -LLRDSDDKGFTCQQCGLVRSKEEIKKIASEVNILSKKTLALTSCGNHQEVVS--TYKMI 213
              ++   +  TC +C +   KE + +    +  + +    + +    Q++     ++ +
Sbjct: 349 SPYKNPVTQILTCPKCHIEVPKERVLEAVDAMRFIDR----IVAKNEIQQMAKEQAFRFL 404

Query: 214 EKLQKKLYHPF-SVNLMQTREKLIKILMELEDWKEALAYCQLTIPVYQRV---YPQFHPL 269
           + ++ +      SVN+     K+I++L+ L D  ++     L +   + V   +P  HP 
Sbjct: 405 KGIKDRFSKILPSVNVYMC--KIIQLLIPLIDPTDSKTLLSLHLESEECVRFCFPPNHPA 462

Query: 270 LGLQYYTCGKLEWFLGDTENAIKSMTEAVEILRITHGTNSPFMKELILKLEEAQAE 325
           + +   + G     +G    A   +  A +IL+ T G   P   + +  LEEA  E
Sbjct: 463 VAVHLSSIGLFYLRIGHPHRAELYLEMACDILKFTLGHTHPLTVDKLSILEEAHRE 518


>gi|358394896|gb|EHK44289.1| hypothetical protein TRIATDRAFT_79382 [Trichoderma atroviride IMI
           206040]
          Length = 742

 Score = 65.5 bits (158), Expect = 3e-08,   Method: Compositional matrix adjust.
 Identities = 28/72 (38%), Positives = 41/72 (56%)

Query: 57  TGLYPVISIINHSCLPNAVLVFEGRLAVVRAVQHVPKGAEVLISYIETAGSTMTRQKALK 116
           TG++   S  NHSCLPNA   F G + +VRA++ +P G E+L+ Y+        RQ+ ++
Sbjct: 501 TGIWLQASYANHSCLPNATRAFIGDMMIVRALREIPVGGEILMQYVPQDKPFDKRQEVVE 560

Query: 117 EQYLFTCTCPRC 128
             Y F C C  C
Sbjct: 561 NHYGFKCDCDLC 572


>gi|260940791|ref|XP_002615235.1| hypothetical protein CLUG_04117 [Clavispora lusitaniae ATCC 42720]
 gi|238850525|gb|EEQ39989.1| hypothetical protein CLUG_04117 [Clavispora lusitaniae ATCC 42720]
          Length = 402

 Score = 65.5 bits (158), Expect = 3e-08,   Method: Compositional matrix adjust.
 Identities = 44/132 (33%), Positives = 59/132 (44%), Gaps = 11/132 (8%)

Query: 12  YAQIANLVNLILQ---WPEISINEIAENFSKLACNAHTI------CNSELRPLGTGLYPV 62
           Y+ I   + LI      P ISI  + +   +   NA  I       + E    G G+YP 
Sbjct: 271 YSNIYKFIRLIAPEEYLPFISIQAVRDIIGRNLTNAFGIWSPVTSSDEEREYFGFGVYPS 330

Query: 63  ISIINHSCLPNAVLVFEGRLAVVRAVQHVPKGAEVLISY-IETAGSTMTRQKALKEQYLF 121
            S  NHSC PN   +  G      A + +P GAE+ ISY I  +     RQ AL+E + F
Sbjct: 331 ASFFNHSCKPNVTKIRRGASYEYIAKEDIPVGAELCISYGIRESDGLKQRQDALRE-WFF 389

Query: 122 TCTCPRCIKLGQ 133
            C C RC   G+
Sbjct: 390 ECGCSRCTTEGK 401


>gi|428176655|gb|EKX45538.1| hypothetical protein GUITHDRAFT_138770 [Guillardia theta CCMP2712]
          Length = 598

 Score = 65.5 bits (158), Expect = 4e-08,   Method: Compositional matrix adjust.
 Identities = 71/300 (23%), Positives = 127/300 (42%), Gaps = 28/300 (9%)

Query: 26  PEISINEIAENFSKLACNAHTICNSELRPLGTGLYPVISIINHSCLPNAVLVFEGRLAVV 85
           P +S  + A   S L  N+H +        G+GL+ +  ++ HSC PN      G    V
Sbjct: 257 PTVSDEDAARMLSILNTNSHELGEWG----GSGLFLLGCLMEHSCAPNCNFSTYGTQLWV 312

Query: 86  RAVQHVPKGAEVLISYIE-TAGSTMTRQKALKEQYLFTCTCPRCIKLGQFDDIQESAILE 144
            A+  + KG  + I Y + T   T  R+K LK+ Y F C CPRC  L    D+  + I  
Sbjct: 313 TALNSIKKGESLSIDYNDSTFEPTAVRRKLLKDSYDFLCLCPRCTTL---PDVMRAFICP 369

Query: 145 GYRCKDDGCSGFLLRDSDDKGFTCQQCGLVRSKEEIKKIASEVNILSKKTLALTSCGNHQ 204
             +CK   C   L   S    + C++C     K ++ K + +  +  +K L +     H 
Sbjct: 370 VKQCKGKVCP--LGEGSKPSDWVCEKC-----KSQLDKKSRKRFLEIEKAL-MEEGPEHD 421

Query: 205 EVVSTYKMIEKLQKKLYHPFSVNLMQTREKLIKILMELEDW----KEALAYCQLTIPVYQ 260
               T+ ++ +  ++L   ++ +     E+ +      E W    K A     + IP+  
Sbjct: 422 VFHETHHIVLEALRELGSVYAKD-----EEAVADGSAEEIWLRVLKGADQARSILIPL-- 474

Query: 261 RVYPQFHPLLGLQYYTCGKLEWFLGDTENAIKSMTEAVEILRITHGTNSPFMKELILKLE 320
            V P+FHP   + +    +L     +   A +++  + E+ R   G  +P  ++ +  +E
Sbjct: 475 -VLPKFHPTKAIWWDNVAQLRLSRKNQAGAQEALKRSYEVTRRCCGDATPSTQDALRLVE 533


>gi|240979812|ref|XP_002403255.1| SET and MYND domain-containing protein (SMYD), putative [Ixodes
           scapularis]
 gi|215491338|gb|EEC00979.1| SET and MYND domain-containing protein (SMYD), putative [Ixodes
           scapularis]
          Length = 413

 Score = 65.5 bits (158), Expect = 4e-08,   Method: Compositional matrix adjust.
 Identities = 41/106 (38%), Positives = 59/106 (55%), Gaps = 8/106 (7%)

Query: 56  GTGLYPVISIINHSCLPNAVLVF---EGRLAVVRAVQHVPKGAEVLISYIETAG---STM 109
           G GLYP+ S+  HSC PNA   F   +  L++V A++ +  G EV +SYI+      S  
Sbjct: 309 GCGLYPLQSLCAHSCCPNAEARFLHNDHTLSLV-ALRDIRPGEEVTVSYIDECSLSRSRH 367

Query: 110 TRQKALKEQYLFTCTCPRCIKLG-QFDDIQESAILEGYRCKDDGCS 154
           +R K L+E +LFTCTCPRC +   Q  D+     +E     ++ CS
Sbjct: 368 SRIKMLRESHLFTCTCPRCEEEATQQPDVTSDEEMEDSSDAEEACS 413


>gi|302691170|ref|XP_003035264.1| hypothetical protein SCHCODRAFT_14438 [Schizophyllum commune H4-8]
 gi|300108960|gb|EFJ00362.1| hypothetical protein SCHCODRAFT_14438 [Schizophyllum commune H4-8]
          Length = 597

 Score = 65.5 bits (158), Expect = 4e-08,   Method: Compositional matrix adjust.
 Identities = 32/100 (32%), Positives = 54/100 (54%), Gaps = 9/100 (9%)

Query: 38  SKLACNAHTICNSELRPLGTGLYPVISIINHSCLPNAVLVF---------EGRLAVVRAV 88
           S+ A N++++   +L P+G  + P++S++NHSC PNA  VF         +  L  V A+
Sbjct: 252 SRFATNSYSVTTPDLTPIGACVSPLVSLVNHSCSPNAATVFPRASKTPSTDEPLISVVAI 311

Query: 89  QHVPKGAEVLISYIETAGSTMTRQKALKEQYLFTCTCPRC 128
           + +    ++  SYI+T      R++ L+E Y F C C  C
Sbjct: 312 RDIQPDEQIFTSYIDTTLPRALRRRELQEGYNFLCKCSLC 351


>gi|159472132|ref|XP_001694205.1| predicted protein [Chlamydomonas reinhardtii]
 gi|158276868|gb|EDP02638.1| predicted protein [Chlamydomonas reinhardtii]
          Length = 468

 Score = 65.5 bits (158), Expect = 4e-08,   Method: Compositional matrix adjust.
 Identities = 33/73 (45%), Positives = 42/73 (57%), Gaps = 2/73 (2%)

Query: 58  GLYPVISIINHSCLPNAVLVFEGRLAVVRAVQHVPKGAEVLISYIETAG--STMTRQKAL 115
           GL+   S++NHSC PNAV    G    V A  HV +G+E+LI+Y+  A       RQ  L
Sbjct: 139 GLWHSFSMLNHSCCPNAVNYVVGDSMCVFAAGHVEQGSELLINYLGAASLRPVEERQAGL 198

Query: 116 KEQYLFTCTCPRC 128
            E Y F+C CPRC
Sbjct: 199 AEAYAFSCGCPRC 211


>gi|452988674|gb|EME88429.1| hypothetical protein MYCFIDRAFT_43814 [Pseudocercospora fijiensis
           CIRAD86]
          Length = 334

 Score = 65.5 bits (158), Expect = 4e-08,   Method: Compositional matrix adjust.
 Identities = 37/100 (37%), Positives = 51/100 (51%), Gaps = 8/100 (8%)

Query: 36  NFSKLACNAHTICNSELRPLGT-----GLYPVISIINHSCLPNAVLVFEGRLA--VVRAV 88
           NF         I  S   PLG      G++P++S +NHSCLPNA   +  RL   +V  +
Sbjct: 89  NFPGSPTPFSNIVRSNGYPLGPSSSIGGIFPLVSRLNHSCLPNAQHAYNERLGKMLVHII 148

Query: 89  QHVPKGAEVLISYIETAGSTMTRQKALKEQYLFTCTCPRC 128
           + +    E+ +SYI   G +  RQ  LK  +LFTCTC  C
Sbjct: 149 RPILPNEEITLSYI-PGGPSPQRQTELKSNFLFTCTCTLC 187


>gi|298710004|emb|CBJ31723.1| SET and MYND domain containing 3 [Ectocarpus siliculosus]
          Length = 636

 Score = 65.1 bits (157), Expect = 4e-08,   Method: Compositional matrix adjust.
 Identities = 32/78 (41%), Positives = 43/78 (55%), Gaps = 3/78 (3%)

Query: 55  LGTGLYPVISIINHSCLPNAVLVF--EGRLAVVRAVQHVPKGAEVLISYIETAGSTMTRQ 112
            G GLYP  ++ NHSC PN        G L VV AV+ VP G+++ ISY++       RQ
Sbjct: 228 FGVGLYPAGAMFNHSCRPNCSWRTNRSGELCVV-AVEDVPAGSQLFISYVDILQPWPVRQ 286

Query: 113 KALKEQYLFTCTCPRCIK 130
             L+  + F C CPRC +
Sbjct: 287 DLLRCHFFFECACPRCCR 304


>gi|194756896|ref|XP_001960706.1| GF13486 [Drosophila ananassae]
 gi|190622004|gb|EDV37528.1| GF13486 [Drosophila ananassae]
          Length = 517

 Score = 65.1 bits (157), Expect = 4e-08,   Method: Compositional matrix adjust.
 Identities = 42/163 (25%), Positives = 68/163 (41%), Gaps = 11/163 (6%)

Query: 13  AQIANLVNLILQWPEISINEIAENFSKLACNAHTICNSELRPLGTGLYPVISIINHSCLP 72
           A I   +    +  + +  +I      L  N H I  S+  P   G++   S   +SCLP
Sbjct: 189 ASIGQFIPKFFKTQKFTEEQIMRAVGALQINGHEIPTSD--PPHVGVFYTASFTENSCLP 246

Query: 73  NAVLVF-EGRLAVVRAVQHVPKGAEVLISYIETAGSTMTRQKALKEQYLFTCTCPRCIKL 131
           N    F +    ++ A Q + K A + I Y +    T  RQ+ L +  LF C C RC  +
Sbjct: 247 NLAKSFNKNGHCILWAPQEIKKNAHLSICYSDAMWGTADRQRHLMQTKLFKCACARCADV 306

Query: 132 GQFDDIQESAILEGYRCKDDGCSGFLLRDSDDK---GFTCQQC 171
            + D           +C+D  C G +L    D+    + C++C
Sbjct: 307 TELD-----TFYSALKCEDRQCGGLMLPSKSDEWNGSWRCREC 344


>gi|195498437|ref|XP_002096523.1| GE25007 [Drosophila yakuba]
 gi|194182624|gb|EDW96235.1| GE25007 [Drosophila yakuba]
          Length = 393

 Score = 65.1 bits (157), Expect = 4e-08,   Method: Compositional matrix adjust.
 Identities = 32/78 (41%), Positives = 48/78 (61%), Gaps = 5/78 (6%)

Query: 56  GTGLYPVISIINHSCLPNAVLVF--EGRLAVVRAVQHVPKGAEVLISYIETA---GSTMT 110
           G+GLY + S INHSC+PNA   F     + V++A+  + +G E+ ISY++      S  +
Sbjct: 286 GSGLYLLQSKINHSCVPNACSTFPYSNDIIVLKALAPIQQGDEICISYLDECMLERSRHS 345

Query: 111 RQKALKEQYLFTCTCPRC 128
           R K L+E Y+F C CP+C
Sbjct: 346 RHKVLRENYVFICQCPKC 363


>gi|357611158|gb|EHJ67334.1| hypothetical protein KGM_20152 [Danaus plexippus]
          Length = 397

 Score = 65.1 bits (157), Expect = 4e-08,   Method: Compositional matrix adjust.
 Identities = 43/137 (31%), Positives = 62/137 (45%), Gaps = 9/137 (6%)

Query: 58  GLYPVISIINHSCLPNAVLVFEGRLAV-VRAVQHVPKGAEVLISYIETAGSTMTRQKALK 116
           GLYPV SI+NH+C+PN    F G L + V+A + +  G+E+   Y      T  R+  L 
Sbjct: 209 GLYPVASILNHNCVPNTRNCFNGDLQMTVKATKTINAGSEIFTCYSGMLWGTPARRLYLY 268

Query: 117 EQYLFTCTCPRCIKLGQFDDIQESAILEGYRCKDDGCSGFLLRDSDDK---GFTCQQCGL 173
           +   F C C RC      D  +   +L   +C    C G LL     K    + C +CG+
Sbjct: 269 KSKHFLCDCERCA-----DPTERGTLLAALKCFSTECQGSLLPIQPLKTTTAWRCLECGM 323

Query: 174 VRSKEEIKKIASEVNIL 190
               + I  I S +  L
Sbjct: 324 RVPNDNICVIQSALGSL 340


>gi|195582454|ref|XP_002081043.1| GD10796 [Drosophila simulans]
 gi|194193052|gb|EDX06628.1| GD10796 [Drosophila simulans]
          Length = 646

 Score = 65.1 bits (157), Expect = 4e-08,   Method: Compositional matrix adjust.
 Identities = 53/216 (24%), Positives = 94/216 (43%), Gaps = 26/216 (12%)

Query: 15  IANLVNLILQWPEISINEIAE--NFSKLACNAHTICNSELRPLGTGLYPVISIINHSCLP 72
           I +LV   LQ+ + + +E+AE   FS                +G  +YP +++ NHSC P
Sbjct: 427 ICSLVLRSLQFIQFNTHEVAELHKFSSSGREKSIF-------IGGAIYPTLALFNHSCDP 479

Query: 73  NAVLVFEGRLAVVRAVQHVPKGAEVLISY--IETAGSTMTRQKALKEQYLFTCTCPRCI- 129
             V  F G    + +V+ +  G  +  +Y  + T      RQ  LKE Y F C+C  CI 
Sbjct: 480 GVVRYFRGTTIHINSVRPIEAGLPINENYGPMYTQDERSERQARLKELYWFECSCDACID 539

Query: 130 KLGQFDDIQESAILEGYRCKDDGCSGFLLRDSDDKGFTCQQCGLVRSKEEIKKIASEVNI 189
              +FDD+    I   +RC           D+ +    C  CG + +  +  K+  +  +
Sbjct: 540 NWPKFDDLPRDVI--RFRC-----------DAPNNCSACVTCGEITNILKGLKVMQDTEM 586

Query: 190 LSKKTLALTSCGNHQEVVSTY-KMIEKLQKKLYHPF 224
           +++    L   G + + ++ +  +I  + + L  PF
Sbjct: 587 MTRTAKRLYETGEYSKALAKFIDLIRIMYEVLAPPF 622


>gi|156550498|ref|XP_001601757.1| PREDICTED: protein msta, isoform A-like [Nasonia vitripennis]
          Length = 519

 Score = 65.1 bits (157), Expect = 4e-08,   Method: Compositional matrix adjust.
 Identities = 44/167 (26%), Positives = 69/167 (41%), Gaps = 12/167 (7%)

Query: 27  EISINEIAENFSKLACNAHTICNSELRPLGTGLYPVISIINHSCLPNAVLVFEGRLAVV- 85
           + S +EIA     +  N H +  SE  P+   +Y   S + HSC  N    F     +V 
Sbjct: 198 DFSEDEIARAAGIMQINGHEVPISE--PVLIAVYDESSYVEHSCRANCSKSFTSSGGIVI 255

Query: 86  RAVQHVPKGAEVLISYIETAGSTMTRQKALKEQYLFTCTCPRCIKLGQFDDIQESAILEG 145
           RA   + KG  ++I Y +    T  R+  L     F CTCPRC    +F      ++   
Sbjct: 256 RAAMEIEKGEHIMICYTDPLWGTANRRHHLLRTKFFECTCPRCSDPAEF-----GSMFNA 310

Query: 146 YRCKDDGCSGFLLRDS----DDKGFTCQQCGLVRSKEEIKKIASEVN 188
             C    CSG +L  S    +   + C +C   RS   + ++  ++ 
Sbjct: 311 IECSKSDCSGCMLPKSFTVQEIPDYRCNKCDNSRSSNSVDEMLEQIG 357


>gi|340708900|ref|XP_003393055.1| PREDICTED: SET and MYND domain-containing protein 4-like [Bombus
           terrestris]
          Length = 673

 Score = 65.1 bits (157), Expect = 4e-08,   Method: Compositional matrix adjust.
 Identities = 51/178 (28%), Positives = 83/178 (46%), Gaps = 18/178 (10%)

Query: 36  NFSKLACNAHTICNS---------ELRPLGTGLYPVISIINHSCLPNAVL-VFEGRLAVV 85
           N   + CNA+ I  +         E R +G  +YP IS+INHSC PN V   +     V+
Sbjct: 450 NLQAINCNAYEIVENIYDKKTHVWEPRHIGGAIYPSISLINHSCYPNVVRHSYPSGTVVL 509

Query: 86  RAVQHVPKGAEVLISYIET--AGSTMTRQKALKEQYLFTCTCPRCIKLGQFDDIQESAIL 143
           R ++ + KG E+L  Y     +   ++R + L ++Y F C C  C +  QF  + E+   
Sbjct: 510 RTLRFIGKGTEILDCYGPHWLSEKRLSRLEYLWKKYRFLCACEACTQNWQF-PLPETM-- 566

Query: 144 EGYRCK--DDGCSGFLLRDSDDKGFTCQQCGLVRSKEEIKKIASEVNILSKKTLALTS 199
             Y+CK   +  S   L +   +  + +QC     K ++KKI ++     +K L   S
Sbjct: 567 -NYKCKMCSEIISTITLDEKHTQNVSSKQCCKCNKKTDLKKIKNQFRKSVEKRLDAIS 623


>gi|350418674|ref|XP_003491932.1| PREDICTED: SET and MYND domain-containing protein 4-like [Bombus
           impatiens]
          Length = 681

 Score = 65.1 bits (157), Expect = 5e-08,   Method: Compositional matrix adjust.
 Identities = 44/144 (30%), Positives = 64/144 (44%), Gaps = 17/144 (11%)

Query: 43  NAHTICNSEL---------RPL--GTGLYPVISIINHSCLPNAVLVFEGRLAVVRAVQHV 91
           NAH +  + L         RPL  G  +YP ++  NH C P     F GR  V+RA++ +
Sbjct: 454 NAHEVFETRLGNEHRFRGSRPLYIGVAIYPTVARFNHDCYPAVTRYFVGRSIVIRAIRSL 513

Query: 92  PKGAEVLISY--IETAGSTMTRQKALKEQYLFTCTCPRCIK-LGQFDDIQESAILEGYRC 148
             G  V  +Y  I T  S   R++ L  +Y F C C  C +   +F+ +    +    RC
Sbjct: 514 RAGDAVAENYGPIFTKRSLEDRRRTLAARYWFRCECTACREDWPRFETLTNDMVR--LRC 571

Query: 149 KDDGCSGFLLRDSD-DKGFTCQQC 171
             +GCS    R  D D+   C  C
Sbjct: 572 PTEGCSKLHPRPRDPDRSIECSSC 595


>gi|149238794|ref|XP_001525273.1| hypothetical protein LELG_03201 [Lodderomyces elongisporus NRRL
           YB-4239]
 gi|146450766|gb|EDK45022.1| hypothetical protein LELG_03201 [Lodderomyces elongisporus NRRL
           YB-4239]
          Length = 619

 Score = 65.1 bits (157), Expect = 5e-08,   Method: Compositional matrix adjust.
 Identities = 36/97 (37%), Positives = 51/97 (52%), Gaps = 4/97 (4%)

Query: 35  ENFSKLA-CNAHTICNSELRPLGTGLYPVISIINHSCLPNAVLVFEGRLAVVRAVQHVP- 92
           ENF  L   N +T+ + + R +G  L P  S+INHSC+PN+  V E      R +  +P 
Sbjct: 213 ENFVNLIFINCNTLVDYKFRIVGYNLDPQFSMINHSCMPNSCQV-ETEYGCYRLINTLPI 271

Query: 93  -KGAEVLISYIETAGSTMTRQKALKEQYLFTCTCPRC 128
             G E+ ++YI +   T  RQ  L  +Y F C CP C
Sbjct: 272 NAGDEITVNYIASGIPTELRQVQLFSRYFFRCKCPLC 308


>gi|449459406|ref|XP_004147437.1| PREDICTED: SET and MYND domain-containing protein 4-like [Cucumis
           sativus]
 gi|449500639|ref|XP_004161155.1| PREDICTED: SET and MYND domain-containing protein 4-like [Cucumis
           sativus]
          Length = 775

 Score = 65.1 bits (157), Expect = 5e-08,   Method: Compositional matrix adjust.
 Identities = 66/311 (21%), Positives = 120/311 (38%), Gaps = 46/311 (14%)

Query: 48  CNSELRPLGTGLYPVISIINHSCLPNAVLVFEGRLAVVRAVQHVPKGAEVLISYIETAG- 106
           CN E   +G  +Y   S+ NHSC PN    F  R   +RA   +  G  + +SY    G 
Sbjct: 464 CNMEQVRVGQAIYTTGSLFNHSCKPNIHAYFNSRTLFIRATVFMAVGCPLELSYGPQVGQ 523

Query: 107 -STMTRQKALKEQYLFTCTCPRCIKLGQFDDIQESAILEGYRCKDDGCSGFLLRDSDDKG 165
                R + LK++Y F C C  C  +   D      ++  + C +  C G +L   D   
Sbjct: 524 LDCKDRLQLLKDEYSFNCQCSGCSTVHISD-----LVINAFCCINPNCRGVVL---DRSI 575

Query: 166 FTCQQCG-----LVRSKEEIKKIASEVNILSKKTLALTSCGNHQEVVSTYKMIEK----- 215
           F+C+         V  +  ++      + L         CG++ ++ S+   ++K     
Sbjct: 576 FSCENTKTKDFLTVNDQMILEPFMQTDSFLHAGPSHCLKCGSYCDIKSSRLTVDKAGIHF 635

Query: 216 --LQ-----------------------KKLYHPFSVNLMQTREKLIKILMELEDWKEALA 250
             LQ                       K   H ++  + +  + L +    L   + A  
Sbjct: 636 TRLQQEINLNRVSETTVSDALGALISLKSTLHEYNRRIAEAEDNLSQAFSLLGKLELAAE 695

Query: 251 YCQLTIPVYQRVYPQFHPLLGLQYYTCGKLEWFLGDTENAIKSMTEAVEILRITHGTNSP 310
           +C+ +I + +++Y + H  +G +      +   +GD  NA+  +    +I R  +G+N  
Sbjct: 696 HCKASIRILEKLYGENHIAIGNELSKLSSILISVGD-HNAVDCIKRLSKIFRCYYGSNVN 754

Query: 311 FMKELILKLEE 321
            M   +  LEE
Sbjct: 755 TMFPFLNILEE 765


>gi|395331361|gb|EJF63742.1| SET domain-containing protein [Dichomitus squalens LYAD-421 SS1]
          Length = 652

 Score = 65.1 bits (157), Expect = 5e-08,   Method: Compositional matrix adjust.
 Identities = 44/133 (33%), Positives = 63/133 (47%), Gaps = 21/133 (15%)

Query: 52  LRPLGTGLYPVISIINHSCLPNAVLVF----------EGRLAVVRAVQHVPKGAEVLISY 101
           L P+G  + P +++ NHSC PNA LVF          E  L++V A+++V  G E+ ISY
Sbjct: 314 LTPIGICVSPTVALANHSCDPNAALVFPRADGGSRVKEPLLSLV-ALRNVAPGKEIRISY 372

Query: 102 IETAGSTMTRQKALKEQYLFTCTCPRCIKLGQFDDIQESAILEGYRCKDDGCSG---FLL 158
           ++T      RQ  LKE Y F+C C  C +    D        E   C    C G   +  
Sbjct: 373 VDTTLPNRLRQNELKEVYSFSCQCKLCCRTTAADP------REALWCPKS-CGGICPYPT 425

Query: 159 RDSDDKGFTCQQC 171
            +S D+   C +C
Sbjct: 426 EESGDRATHCVKC 438


>gi|330843375|ref|XP_003293631.1| hypothetical protein DICPUDRAFT_90323 [Dictyostelium purpureum]
 gi|325076008|gb|EGC29833.1| hypothetical protein DICPUDRAFT_90323 [Dictyostelium purpureum]
          Length = 343

 Score = 65.1 bits (157), Expect = 5e-08,   Method: Compositional matrix adjust.
 Identities = 30/73 (41%), Positives = 43/73 (58%), Gaps = 1/73 (1%)

Query: 58  GLYPVISIINHSCLPNAVLVFEG-RLAVVRAVQHVPKGAEVLISYIETAGSTMTRQKALK 116
           GLY + S INHSC PNA + F     A +R ++ +  G E+ ISY +T+   + R+  L 
Sbjct: 260 GLYFLTSFINHSCDPNAYVQFPNDHTAEIRLLKPINPGEEITISYADTSKDIIDRRSQLF 319

Query: 117 EQYLFTCTCPRCI 129
           E Y F C CP+C+
Sbjct: 320 ENYGFNCECPKCL 332


>gi|383861061|ref|XP_003706005.1| PREDICTED: SET and MYND domain-containing protein 5-like [Megachile
           rotundata]
          Length = 388

 Score = 65.1 bits (157), Expect = 5e-08,   Method: Compositional matrix adjust.
 Identities = 34/86 (39%), Positives = 52/86 (60%), Gaps = 7/86 (8%)

Query: 56  GTGLYPVISIINHSCLPNAVLVF--EGRLAVVRAVQHVPKGAEVLISYIETAG---STMT 110
           G+GLY + S +NHSC+PNA++ F     + V++A++ +    E+ ISY++      S  +
Sbjct: 287 GSGLYILQSAVNHSCIPNAIVEFPYSNNVLVLKAIRDIHPEEEICISYLDECDLERSRHS 346

Query: 111 RQKALKEQYLFTCTCPRCIKLGQFDD 136
           RQKAL   YLF C C +C  L Q +D
Sbjct: 347 RQKALSSLYLFICCCSKC--LSQTND 370


>gi|340720588|ref|XP_003398716.1| PREDICTED: SET and MYND domain-containing protein 4-like [Bombus
           terrestris]
          Length = 735

 Score = 64.7 bits (156), Expect = 5e-08,   Method: Compositional matrix adjust.
 Identities = 41/122 (33%), Positives = 61/122 (50%), Gaps = 13/122 (10%)

Query: 55  LGTGLYPVISIINHSCLPNAVLVFEGRLAVVRAVQHVPKGAEVLISYIE--TAGSTMTRQ 112
           + TG+YP  S++NHSC PN + +F G+  +VRA + + +   +   Y       +T  RQ
Sbjct: 456 VATGIYPSASMMNHSCDPNIINIFVGQYLIVRASRDIDQSEGIFNCYGPHYRHMTTEDRQ 515

Query: 113 KALKEQYLFTCTCPRCI--KLGQFDDIQESAILEGYRCKDDGCSGFLLRDSDDKGFTCQQ 170
           K LK QY FTC C  C   KL  F  I+    ++  +C    C+   ++DS      C  
Sbjct: 516 KILKSQYCFTCKCKACTLPKLQYF--IERFNAVKCLKCNGPVCN---IKDS----IYCLD 566

Query: 171 CG 172
           CG
Sbjct: 567 CG 568


>gi|443719695|gb|ELU09739.1| hypothetical protein CAPTEDRAFT_159764 [Capitella teleta]
          Length = 338

 Score = 64.7 bits (156), Expect = 5e-08,   Method: Compositional matrix adjust.
 Identities = 31/78 (39%), Positives = 45/78 (57%), Gaps = 5/78 (6%)

Query: 56  GTGLYPVISIINHSCLPNAVLVF--EGRLAVVRAVQHVPKGAEVLISYIETAG---STMT 110
           G+GLY + S  NHSC+PNA + F        V+A+ ++  G E+ ISY++      S  +
Sbjct: 234 GSGLYLLQSTCNHSCMPNAEITFPHNNSTLAVKALSNIKTGEEICISYLDECALERSRHS 293

Query: 111 RQKALKEQYLFTCTCPRC 128
           R K L+E YLF C C +C
Sbjct: 294 RHKILRENYLFNCNCSKC 311


>gi|348540237|ref|XP_003457594.1| PREDICTED: SET and MYND domain-containing protein 4-like
           [Oreochromis niloticus]
          Length = 725

 Score = 64.7 bits (156), Expect = 5e-08,   Method: Compositional matrix adjust.
 Identities = 74/322 (22%), Positives = 128/322 (39%), Gaps = 57/322 (17%)

Query: 41  ACNAHTICNSELRPLGTGLYPVISIINHSCLPNAVLVF---------------------- 78
           A N+    N E+R + T ++P +S++NHSC PN  LVF                      
Sbjct: 405 ATNSLVQSNREIR-IATAVFPALSLLNHSCCPNTSLVFNTGGIIEPSGSALSADFSEGEA 463

Query: 79  ------EGRLAVVRAVQHVPKGAEVLISYIETAG--STMTRQKALKEQYLFTCTCPRCIK 130
                 +G    VRA + +  G E+L  Y   +   +T  R++ L+EQY F C C  C +
Sbjct: 464 EWRHKAQGVTVTVRAAKLITPGQEILHCYGPHSSRMATQERRRLLQEQYYFLCQCEACSQ 523

Query: 131 LGQFDDIQESAILE-GYRCKDDG-----CSGFLLRDSDDK--GFTC--QQCGLVRSKEEI 180
             Q  D++E    + G   +D G     C G L + S+D+  GF C    C    S  ++
Sbjct: 524 HQQ--DVEEPRQQQPGAEGRDLGLLCIKCKGSLKKRSEDREAGFVCSLSSCSHCMSLSDV 581

Query: 181 KKIASEVNILSKKTLALTSCGNHQEVVSTYKMIEKLQKKLYHPFSVNLMQTR-------E 233
                E+ +  +K + L       E +       +L KK      V L +T        +
Sbjct: 582 NNRLQEIRVNLEKAVELMETDRPGEAL-------RLLKKTLCQSGVILAETHPLQGELAD 634

Query: 234 KLIKILMELEDWKEALAYCQLTIPVYQRVYPQFHPLLGLQYYTCGKLEWFLGDTENAIKS 293
              +    + DW  A ++ + +       + +    LG Q +   +L +  G    A+  
Sbjct: 635 ATARAYASMGDWNNAASHLERSTVAIASQFGKESVELGRQLFKLAQLHFNSGSRGPALSV 694

Query: 294 MTEAVEILRITHGTNSPFMKEL 315
           + +   +L +  G +   ++EL
Sbjct: 695 IPKVRRLLCLHCGPHCHELQEL 716


>gi|224010509|ref|XP_002294212.1| predicted protein [Thalassiosira pseudonana CCMP1335]
 gi|220970229|gb|EED88567.1| predicted protein [Thalassiosira pseudonana CCMP1335]
          Length = 516

 Score = 64.7 bits (156), Expect = 5e-08,   Method: Compositional matrix adjust.
 Identities = 34/87 (39%), Positives = 46/87 (52%), Gaps = 14/87 (16%)

Query: 58  GLYPVISIINHSCLPNAVLVF-------------EGR-LAVVRAVQHVPKGAEVLISYIE 103
           GLYP+ ++INHSC PNAV VF             +GR + +V A   + KG E+  SY+ 
Sbjct: 244 GLYPLAAMINHSCCPNAVRVFGTLPAINDNMHRVQGREVMIVHATAKISKGDEITWSYLP 303

Query: 104 TAGSTMTRQKALKEQYLFTCTCPRCIK 130
                  R+  L  ++ FTC C RCIK
Sbjct: 304 PCSPFNERRAVLSSKFGFTCQCVRCIK 330


>gi|307204816|gb|EFN83374.1| SET and MYND domain-containing protein 4 [Harpegnathos saltator]
          Length = 628

 Score = 64.7 bits (156), Expect = 5e-08,   Method: Compositional matrix adjust.
 Identities = 65/300 (21%), Positives = 124/300 (41%), Gaps = 28/300 (9%)

Query: 37  FSKLACNAHTICNSELRPLGTGLYPVISIINHSCLPNAVLVFEGRLAVVRAVQHVPKGAE 96
            +K+A +   +C  +   + T +YP  S++NHSC PN +  F  +  +VRA++ +  G E
Sbjct: 326 LNKIASDKDKLCVEQQDRIATAIYPSASMMNHSCDPNIITSFVDQYLIVRAMKDIQAGEE 385

Query: 97  VLISYIET--AGSTMTRQKALKEQYLFTCTCPRCIKLGQFDDIQESAILEGYRCKDDGCS 154
           V   Y       S   RQ+ LK QY F C C  C+ + ++++  E       +C +  C 
Sbjct: 386 VSNCYGPNFRRMSREHRQEILKNQYGFECRCDPCV-MPEYENFMER--FHAIKCPE--CG 440

Query: 155 GFLLRDSDDKGFTCQQCG---LVRSKEEIKKIASEVNILSKKTLALTSCGNHQEVVSTYK 211
           G  L +       C  CG    +  + E+K   +  +               +E V   K
Sbjct: 441 G-ALEEFHSYSMHCLDCGATPYINCQIELKNAVTAFDAAQ----IYIELEREEEAVVKLK 495

Query: 212 MIEKLQKKLYHPFSVNLMQTREKLIKILMELEDWKEALAYCQLTIPVYQRVY-------- 263
               +++ L + +  +L  T + L K+   +  W E++++ + ++   +  Y        
Sbjct: 496 ECLSIRRNLLYKYHQDLTATLDILGKVYAIMGRWLESISHIEHSMAAIEERYGCSSMEVA 555

Query: 264 PQFHPLLG-----LQYYTCGKLEWFLGDTENAIKSMTEAVEILRITHGTNSPFMKELILK 318
            + + L       LQ       +W+    +   + +  A EI+  T+G      +E+I K
Sbjct: 556 NELNKLTDICIRYLQEEPNRTTKWYKNVLKKTRRYLDRAEEIVNFTYGPWHEIHQEIIEK 615


>gi|19922036|ref|NP_610684.1| CG7759, isoform B [Drosophila melanogaster]
 gi|16768704|gb|AAL28571.1| HL04910p [Drosophila melanogaster]
 gi|21627490|gb|AAM68738.1| CG7759, isoform B [Drosophila melanogaster]
          Length = 567

 Score = 64.7 bits (156), Expect = 6e-08,   Method: Compositional matrix adjust.
 Identities = 54/219 (24%), Positives = 97/219 (44%), Gaps = 18/219 (8%)

Query: 15  IANLVNLILQWPEISINEIAE--NFSKLACNAHTICNSELRPLGTGLYPVISIINHSCLP 72
           I +LV   LQ+ + + +E+AE   FS                +G  +YP +++ NHSC P
Sbjct: 334 ICSLVLRSLQFIQFNTHEVAELHKFSSSGREKSIF-------IGGAIYPTLALFNHSCDP 386

Query: 73  NAVLVFEGRLAVVRAVQHVPKGAEVLISY--IETAGSTMTRQKALKEQYLFTCTCPRCI- 129
             V  F G    + +V+ +  G  +  +Y  + T      RQ  LK+ Y F C+C  CI 
Sbjct: 387 GVVRYFRGTTIHINSVRPIEAGLPINENYGPMYTQDERSERQARLKDLYWFECSCDACID 446

Query: 130 KLGQFDDIQESAILEGYRCK-DDGCSGFL--LRDSDDKGFTCQQCGLVRSKEEIKKIASE 186
              +FDD+    I   +RC   + CS  +      +D    C  CG + +  +  K+  +
Sbjct: 447 NWPKFDDLPRDVI--RFRCDAPNNCSAVIEVPPSCNDFMVKCVTCGEITNILKGLKVMQD 504

Query: 187 VNILSKKTLALTSCGNHQEVVSTY-KMIEKLQKKLYHPF 224
             ++++    L   G + + ++ +  +I  + + L  PF
Sbjct: 505 TEMMTRTAKRLYETGEYPKALAKFVDLIRIMYEVLAPPF 543


>gi|313217681|emb|CBY38723.1| unnamed protein product [Oikopleura dioica]
          Length = 228

 Score = 64.7 bits (156), Expect = 6e-08,   Method: Compositional matrix adjust.
 Identities = 30/101 (29%), Positives = 52/101 (51%), Gaps = 2/101 (1%)

Query: 37  FSKLACNAHTICNSEL--RPLGTGLYPVISIINHSCLPNAVLVFEGRLAVVRAVQHVPKG 94
           F+ +  N   IC+ E     +G+ +Y   +++NHSC PNA  VF     + +A++ +  G
Sbjct: 30  FAAIKNNQFAICDEESGDYDVGSAVYIDHALVNHSCRPNAYPVFNKTNMIFKALRKIEPG 89

Query: 95  AEVLISYIETAGSTMTRQKALKEQYLFTCTCPRCIKLGQFD 135
            E+  +Y +T      R++ L + + F C CP C K  + D
Sbjct: 90  EEITHAYTDTISPIQERREYLNDVWRFMCNCPGCTKSNEID 130


>gi|73980996|ref|XP_533007.2| PREDICTED: SET and MYND domain-containing protein 5 [Canis lupus
           familiaris]
          Length = 419

 Score = 64.7 bits (156), Expect = 6e-08,   Method: Compositional matrix adjust.
 Identities = 33/79 (41%), Positives = 47/79 (59%), Gaps = 5/79 (6%)

Query: 56  GTGLYPVISIINHSCLPNAVLVF--EGRLAVVRAVQHVPKGAEVLISYIETAG---STMT 110
           G+GLY + S  NHSC+PNA   F     L  V A++ +  G E+ ISY++      S  +
Sbjct: 304 GSGLYVLQSCCNHSCVPNAETTFPENNFLLHVTALEDIKPGEEICISYLDCCQRERSRHS 363

Query: 111 RQKALKEQYLFTCTCPRCI 129
           R K L+E YLF C+CP+C+
Sbjct: 364 RHKILRENYLFVCSCPKCL 382


>gi|24652742|ref|NP_725048.1| CG7759, isoform A [Drosophila melanogaster]
 gi|21627489|gb|AAF58645.2| CG7759, isoform A [Drosophila melanogaster]
          Length = 660

 Score = 64.7 bits (156), Expect = 6e-08,   Method: Compositional matrix adjust.
 Identities = 54/219 (24%), Positives = 97/219 (44%), Gaps = 18/219 (8%)

Query: 15  IANLVNLILQWPEISINEIAE--NFSKLACNAHTICNSELRPLGTGLYPVISIINHSCLP 72
           I +LV   LQ+ + + +E+AE   FS                +G  +YP +++ NHSC P
Sbjct: 427 ICSLVLRSLQFIQFNTHEVAELHKFSSSGREKSIF-------IGGAIYPTLALFNHSCDP 479

Query: 73  NAVLVFEGRLAVVRAVQHVPKGAEVLISY--IETAGSTMTRQKALKEQYLFTCTCPRCI- 129
             V  F G    + +V+ +  G  +  +Y  + T      RQ  LK+ Y F C+C  CI 
Sbjct: 480 GVVRYFRGTTIHINSVRPIEAGLPINENYGPMYTQDERSERQARLKDLYWFECSCDACID 539

Query: 130 KLGQFDDIQESAILEGYRCK-DDGCSGFL--LRDSDDKGFTCQQCGLVRSKEEIKKIASE 186
              +FDD+    I   +RC   + CS  +      +D    C  CG + +  +  K+  +
Sbjct: 540 NWPKFDDLPRDVI--RFRCDAPNNCSAVIEVPPSCNDFMVKCVTCGEITNILKGLKVMQD 597

Query: 187 VNILSKKTLALTSCGNHQEVVSTY-KMIEKLQKKLYHPF 224
             ++++    L   G + + ++ +  +I  + + L  PF
Sbjct: 598 TEMMTRTAKRLYETGEYPKALAKFVDLIRIMYEVLAPPF 636


>gi|328866939|gb|EGG15322.1| SET domain-containing protein [Dictyostelium fasciculatum]
          Length = 524

 Score = 64.7 bits (156), Expect = 6e-08,   Method: Compositional matrix adjust.
 Identities = 30/75 (40%), Positives = 43/75 (57%), Gaps = 2/75 (2%)

Query: 56  GTGLYPVISIINHSCLPNAVLVFEGR--LAVVRAVQHVPKGAEVLISYIETAGSTMTRQK 113
           G+GLY   +  NHSC PNA +  E     A +R++  +P G ++ ISYIE    + TRQ 
Sbjct: 444 GSGLYVYANSCNHSCSPNAAITNESTNFSATIRSITDIPNGNQIEISYIEEDQPSQTRQS 503

Query: 114 ALKEQYLFTCTCPRC 128
            L ++Y F C C +C
Sbjct: 504 ELIDKYKFKCHCQKC 518


>gi|195121780|ref|XP_002005397.1| GI19093 [Drosophila mojavensis]
 gi|193910465|gb|EDW09332.1| GI19093 [Drosophila mojavensis]
          Length = 515

 Score = 64.7 bits (156), Expect = 6e-08,   Method: Compositional matrix adjust.
 Identities = 42/160 (26%), Positives = 67/160 (41%), Gaps = 8/160 (5%)

Query: 13  AQIANLVNLILQWPEISINEIAENFSKLACNAHTICNSELRPLGTGLYPVISIINHSCLP 72
           A I   +    +  + S  EI      L  N H +  S+  P    ++   S   +SC+P
Sbjct: 187 ASIGQFIPKFFRTQKFSEEEIMRAVGALQINGHEVPTSD--PPHVAVFYTASFTENSCVP 244

Query: 73  NAVLVF-EGRLAVVRAVQHVPKGAEVLISYIETAGSTMTRQKALKEQYLFTCTCPRCIKL 131
           N    F +    ++ A + + KGA + I Y +    T  RQ+ L +  LF CTC RC+ +
Sbjct: 245 NLAKSFTKNGHCILWAPKAIKKGANLSICYSDAVWGTADRQRHLMQTKLFKCTCERCVDV 304

Query: 132 GQFDDIQESAILEGYRCKDDGCSGFLLRDSDDKGFTCQQC 171
            +             +C+D  C+G LL    D    C +C
Sbjct: 305 TEL-----GTYYSALKCEDRKCTGLLLPIKADDWNGCWRC 339


>gi|328770867|gb|EGF80908.1| hypothetical protein BATDEDRAFT_88247 [Batrachochytrium
           dendrobatidis JAM81]
          Length = 428

 Score = 64.7 bits (156), Expect = 6e-08,   Method: Compositional matrix adjust.
 Identities = 66/265 (24%), Positives = 114/265 (43%), Gaps = 47/265 (17%)

Query: 56  GTGLYPVISIINHSCLPNAVLVFEGRLAVVRAVQHVPKGAEVLISYIETAGSTM--TRQK 113
           G+G+YP  S+INH                 RA +++ KG  + ISY E+AG ++  TRQ+
Sbjct: 153 GSGIYPTASLINH-----------------RASKNIRKGNAITISYGESAGRSLRATRQQ 195

Query: 114 ALKEQYLFTCTCPRCIKLGQFDDIQESAILEGYRCKDDGCSGFLLRDSDDKGFTCQQCGL 173
            LK  + F C C  CI      D Q    L G +C   GC  + L + + +   C +C  
Sbjct: 196 NLKAGWGFECRCSACISEDDTKDRQ----LAGLKCP--GCP-YPLYEGEPR---CPKC-- 243

Query: 174 VRSKEEIKKIASEVNILSKKTLALTSCGNHQEVVSTYKMIEKLQ------KKLYHPFSVN 227
                 I  I+ +  +++K       C +H++  +   +I+ LQ      + + HP S  
Sbjct: 244 ----HSIVDISRQQKVMAKIDAVWHECQDHRKHDNVASLIQTLQGIQTDLESIMHPSSSR 299

Query: 228 LMQTREKLIKILMELE---DWKEALAYCQLTIPVYQRVYPQFHPLLGLQYYT-CGKLEWF 283
           L    + L  + ++L    D +  L++C+  +    R+      L  ++      K+ +F
Sbjct: 300 LGCMYDNLASLNVQLAKQGDNRILLSWCEKLLESSIRIVEGIFGLTSIEAANEHNKMAYF 359

Query: 284 LG--DTENAIKSMTEAVEILRITHG 306
           +   D E  IK    A  + R  +G
Sbjct: 360 MMAVDAEKGIKIARHAEGLFRTLYG 384


>gi|50552992|ref|XP_503906.1| YALI0E13552p [Yarrowia lipolytica]
 gi|49649775|emb|CAG79499.1| YALI0E13552p [Yarrowia lipolytica CLIB122]
          Length = 637

 Score = 64.7 bits (156), Expect = 6e-08,   Method: Compositional matrix adjust.
 Identities = 34/101 (33%), Positives = 50/101 (49%)

Query: 30  INEIAENFSKLACNAHTICNSELRPLGTGLYPVISIINHSCLPNAVLVFEGRLAVVRAVQ 89
           + +I E    L  N +T C+   R  G    P +++INHSC+PNA L+F GR   +   +
Sbjct: 179 LEQILEYAGMLFVNQYTRCDDLGRQGGYIFDPTLALINHSCVPNAYLLFRGRKVHLVCWK 238

Query: 90  HVPKGAEVLISYIETAGSTMTRQKALKEQYLFTCTCPRCIK 130
            +  G EV +SY      T  R+  L   + F C CP C+ 
Sbjct: 239 PINDGDEVFLSYTRFMHPTPERRTLLYMHFRFWCECPGCVS 279


>gi|407867908|gb|EKG08717.1| hypothetical protein TCSYLVIO_000127 [Trypanosoma cruzi]
          Length = 697

 Score = 64.7 bits (156), Expect = 6e-08,   Method: Compositional matrix adjust.
 Identities = 41/123 (33%), Positives = 59/123 (47%), Gaps = 12/123 (9%)

Query: 18  LVNLILQW------PEISINEIAENFSKLA----CNAHTICNSELRPLGTGLYPVISIIN 67
              LIL W       E ++   A  F++L     CN+  + N++   +G  LYP  S  N
Sbjct: 520 FTKLILPWLGEGGSTEANLTVTATFFNRLCAALQCNSFGLFNADGNCIGVALYPEASYFN 579

Query: 68  HSCLPNAVLV-FEGRLAVVRAVQHVPKGAEVLISYIETA-GSTMTRQKALKEQYLFTCTC 125
           HSC PN   V + G LA   A++ + KG  + I Y++    ST  R++ L   Y F C C
Sbjct: 580 HSCCPNICRVTYRGILAAFHALREIRKGEPLTICYVDVQETSTAERRRTLFSSYRFFCEC 639

Query: 126 PRC 128
            RC
Sbjct: 640 ARC 642


>gi|157124067|ref|XP_001660315.1| hypothetical protein AaeL_AAEL009757 [Aedes aegypti]
 gi|108874116|gb|EAT38341.1| AAEL009757-PA [Aedes aegypti]
          Length = 518

 Score = 64.7 bits (156), Expect = 6e-08,   Method: Compositional matrix adjust.
 Identities = 63/242 (26%), Positives = 103/242 (42%), Gaps = 35/242 (14%)

Query: 50  SELRPLGTGLYPVISIINHSCLPNAVLVFEGRLAVVRAVQHVP-KGAEVL-ISYIETAGS 107
           S++  L  GLYP  +++ HSC+PN +L  +G  + VR     P K  E+L ISY  +   
Sbjct: 190 SQMGNLARGLYPKTALMMHSCMPNTLLSVDG-YSNVRVFTSAPVKMGEILNISYTRSLFG 248

Query: 108 TMTRQKALKEQYLFTCTCPRCIKLGQFDDIQESAILEGYRCK--DDGCSGFLLRDSDDKG 165
           T  RQ  L++   F C+C RC      D  +    L   RC   DDG   F     D+  
Sbjct: 249 TYDRQTHLRKGKYFICSCRRCK-----DPTELGTHLSSIRCSECDDGLCSFY---PDEPR 300

Query: 166 FTCQQCGLVRSKEEIKKI--ASEVNILSKKTLALTSCGNHQEVVSTYKMIEKLQKKL--Y 221
           + C +C  +  +E + +I  A+  +I++             EV    ++I K  K L  +
Sbjct: 301 WECNKCHKLLKREYVNEILCAARDDIMACPL----------EVRDLERVISKHSKILNPH 350

Query: 222 HPFSVNLMQTREKLIKILMELED--------WKEALAYCQLTIPVYQRVYPQFHPLLGLQ 273
           H   +   QT    +K +    D         +     C+  +P+ + + P    L+G+ 
Sbjct: 351 HSLVLEAKQTLMGELKAICMSTDPTAVPKPVLRRKFELCEEILPILRTLEPGLSRLVGIA 410

Query: 274 YY 275
            Y
Sbjct: 411 MY 412


>gi|406867180|gb|EKD20219.1| xanthine dehydrogenase [Marssonina brunnea f. sp. 'multigermtubi'
           MB_m1]
          Length = 654

 Score = 64.7 bits (156), Expect = 7e-08,   Method: Compositional matrix adjust.
 Identities = 31/77 (40%), Positives = 41/77 (53%), Gaps = 2/77 (2%)

Query: 55  LGTGLYPVISIINHSCLPNAVLVFE--GRLAVVRAVQHVPKGAEVLISYIETAGSTMTRQ 112
           LG  +Y +    NHSC PNA   F   G L V+RA   + +G E+ I Y    G+T  R+
Sbjct: 439 LGPFIYEIAGRFNHSCTPNAARGFSDVGELIVLRAFVDIQEGEEITIEYFSDGGTTTERR 498

Query: 113 KALKEQYLFTCTCPRCI 129
           + L +QY F C C  CI
Sbjct: 499 QHLYKQYRFNCDCRACI 515


>gi|195111430|ref|XP_002000282.1| GI10143 [Drosophila mojavensis]
 gi|193916876|gb|EDW15743.1| GI10143 [Drosophila mojavensis]
          Length = 392

 Score = 64.3 bits (155), Expect = 7e-08,   Method: Compositional matrix adjust.
 Identities = 31/78 (39%), Positives = 47/78 (60%), Gaps = 5/78 (6%)

Query: 56  GTGLYPVISIINHSCLPNA--VLVFEGRLAVVRAVQHVPKGAEVLISYIETAG---STMT 110
           G+GLY + S +NHSC+PNA     +   + V++AV  + KG E+ ISY++      S  +
Sbjct: 286 GSGLYILQSKVNHSCVPNAQSTFPYSNDIVVLKAVAPIQKGEEICISYLDECQLERSRHS 345

Query: 111 RQKALKEQYLFTCTCPRC 128
           R K L+E Y+F C C +C
Sbjct: 346 RHKILRENYIFVCQCSKC 363


>gi|414870819|tpg|DAA49376.1| TPA: hypothetical protein ZEAMMB73_691595 [Zea mays]
          Length = 489

 Score = 64.3 bits (155), Expect = 7e-08,   Method: Compositional matrix adjust.
 Identities = 73/308 (23%), Positives = 129/308 (41%), Gaps = 56/308 (18%)

Query: 50  SELRPLGTGLYPVISIINHSCLPNAVLVFEGRLAVVRAVQHVPKGAEVLISYIETAGSTM 109
           S++R +   LY   S+ NHSC PN    F  R  V+R  + +  G+ V +SY    G   
Sbjct: 171 SQVR-VAQALYVSGSLFNHSCQPNVHAYFLSRAFVLRTTEFIKSGSPVELSYGPQVGEMH 229

Query: 110 T--RQKALKEQYLFTCTCPRCIKLGQFDDIQES----------AILEG--YRCKDD---- 151
              RQK+L+E Y F+C C  C +L   D +  S          AI E   YR K++    
Sbjct: 230 ISERQKSLQENYYFSCQCSSCSELNLSDLVMNSFCCPQSNCLGAISESTYYRSKENFVNV 289

Query: 152 -----------------------GCSGFLLRDSD-----DKG--FTCQQC-GLVRSKEEI 180
                                    +  LL++S      D G   +C+ C  L  +    
Sbjct: 290 SLGGSYVCKLSLPDVSKVDMDMENVAKSLLQNSGVSLNIDHGCCMSCRSCIDLSSALATS 349

Query: 181 KKIASEVNILSKKTLALTSCGNHQEVVSTYKMIEKLQKKLYHPFSVNLMQTREKLIKILM 240
            +  S ++ L K T       +   +    + +++L KKL HP+S  L Q  + + +   
Sbjct: 350 HREESTIDRLKKHTFL-----DKTLITEALQSLKQL-KKLRHPYSKALAQAEDTIAEAFA 403

Query: 241 ELEDWKEALAYCQLTIPVYQRVYPQFHPLLGLQYYTCGKLEWFLGDTENAIKSMTEAVEI 300
           ++ D + A  +C+ +I + +++Y   H ++  +      +   LGD  +A  +  +A  I
Sbjct: 404 KVGDQELARKHCEASIQILEKLYHPKHIIIAHELIKLVSILLSLGDGASAAATFAQAEAI 463

Query: 301 LRITHGTN 308
             + +G++
Sbjct: 464 FSLYYGSH 471


>gi|357464329|ref|XP_003602446.1| SET and MYND domain-containing protein [Medicago truncatula]
 gi|355491494|gb|AES72697.1| SET and MYND domain-containing protein [Medicago truncatula]
          Length = 785

 Score = 64.3 bits (155), Expect = 7e-08,   Method: Compositional matrix adjust.
 Identities = 38/118 (32%), Positives = 54/118 (45%), Gaps = 7/118 (5%)

Query: 43  NAHTICNSELRPLGTGLYPVISIINHSCLPNAVLVFEGRLAVVRAVQHVPKGAEVLISYI 102
           + H   N E   +G  +Y V S+ NHSC PN    F  R   +R  Q V  G ++ +SY 
Sbjct: 462 SVHLTSNVEQVRVGKAIYKVGSLFNHSCQPNVHAYFLSRTLYLRTTQAVAAGCQLELSYG 521

Query: 103 ETAG--STMTRQKALKEQYLFTCTCPRCIKLGQFDDIQESAILEGYRCKDDGCSGFLL 158
              G      RQ  LK++Y F C C  C ++   D      +L  + C +  CSG +L
Sbjct: 522 PQVGLWDCKDRQSFLKDEYAFHCQCTGCSEVNLSD-----IVLNAFHCVNPNCSGAVL 574


>gi|440802077|gb|ELR23016.1| Histone-lysine N-methyltransferase [Acanthamoeba castellanii str.
           Neff]
          Length = 481

 Score = 64.3 bits (155), Expect = 7e-08,   Method: Compositional matrix adjust.
 Identities = 45/156 (28%), Positives = 75/156 (48%), Gaps = 12/156 (7%)

Query: 9   LLLYAQIANLVNLILQWPEISINEIAENFSKLACNAHTICNSELRPL---GTGLYPVISI 65
           L++ A      + +LQ   + ++EI E   +L  N     ++  R     G+ LY   ++
Sbjct: 288 LIMEALSGRATSDVLQG--LDVDEIVEYIGRLESNTFARYSTTKRKTVNYGSALYNCAAM 345

Query: 66  INHSCLPNAVLVFEGRLAVVRAVQHVPKGAEVLISYIETAGSTMTRQKALKEQYLFTCTC 125
            NHSC P+ V  F+G+   +RA++ +  G E+ ++YI     T  RQ  L++ Y F C C
Sbjct: 346 FNHSCFPSVVRQFDGQHLTLRALRPLNPGDELTMTYIPLRDDTPERQDELQQHYHFACRC 405

Query: 126 PRCIKLG-QFDDIQESAILEGYRCKDDG--CSGFLL 158
            RC   G + +   E  +L    C +D   C G L+
Sbjct: 406 QRCEARGAEVERYMEECLL----CPNDKRECDGLLV 437


>gi|116179288|ref|XP_001219493.1| hypothetical protein CHGG_00272 [Chaetomium globosum CBS 148.51]
 gi|88184569|gb|EAQ92037.1| hypothetical protein CHGG_00272 [Chaetomium globosum CBS 148.51]
          Length = 588

 Score = 64.3 bits (155), Expect = 7e-08,   Method: Compositional matrix adjust.
 Identities = 30/70 (42%), Positives = 45/70 (64%), Gaps = 1/70 (1%)

Query: 59  LYPVISIINHSCLPNAVLVFEGRLAVVRAVQHVPKGAEVLISYIETAGSTMTRQKALKEQ 118
           L+P  ++INHSC+PN+   F G L V+RA +++ KG E++ SY E+ G    RQ+AL   
Sbjct: 425 LWPWAALINHSCIPNSEREFVGDLMVIRATKNIAKGEEIVHSYDES-GVYDDRQRALMTT 483

Query: 119 YLFTCTCPRC 128
           + F C+C  C
Sbjct: 484 WGFECSCALC 493


>gi|302922324|ref|XP_003053442.1| hypothetical protein NECHADRAFT_74600 [Nectria haematococca mpVI
           77-13-4]
 gi|256734383|gb|EEU47729.1| hypothetical protein NECHADRAFT_74600 [Nectria haematococca mpVI
           77-13-4]
          Length = 530

 Score = 64.3 bits (155), Expect = 7e-08,   Method: Compositional matrix adjust.
 Identities = 33/90 (36%), Positives = 52/90 (57%), Gaps = 1/90 (1%)

Query: 39  KLACNAHTICNSELRPLGTGLYPVISIINHSCLPNAVLVFEGRLAVVRAVQHVPKGAEVL 98
           K+  NA    ++++  +G  L P +++ NHSC+PNA + F GR A++RA + +    E+ 
Sbjct: 169 KIQTNAFHRYDADIGQVGIFLEPRLAMANHSCIPNATVQFVGRRAILRAERPIKADEEIE 228

Query: 99  ISYIETAGSTMTRQKALKEQYLFTCTCPRC 128
           ISY +       R++AL   Y FTC C RC
Sbjct: 229 ISYTDYNYPLSKRKEALAP-YFFTCECTRC 257


>gi|355720921|gb|AES07094.1| SMYD family member 5 [Mustela putorius furo]
          Length = 159

 Score = 64.3 bits (155), Expect = 7e-08,   Method: Compositional matrix adjust.
 Identities = 32/79 (40%), Positives = 47/79 (59%), Gaps = 5/79 (6%)

Query: 56  GTGLYPVISIINHSCLPNAVLVF--EGRLAVVRAVQHVPKGAEVLISYIETAG---STMT 110
           G+GL+ + S  NHSC+PNA   F     L  V A++ +  G E+ ISY++      S  +
Sbjct: 46  GSGLFVLQSCCNHSCVPNAETSFPENNFLLHVTALEDIKPGEEICISYLDCCQRERSRHS 105

Query: 111 RQKALKEQYLFTCTCPRCI 129
           R K L+E YLF C+CP+C+
Sbjct: 106 RHKILRENYLFVCSCPKCL 124


>gi|321473475|gb|EFX84442.1| hypothetical protein DAPPUDRAFT_194440 [Daphnia pulex]
          Length = 524

 Score = 64.3 bits (155), Expect = 7e-08,   Method: Compositional matrix adjust.
 Identities = 41/148 (27%), Positives = 69/148 (46%), Gaps = 15/148 (10%)

Query: 58  GLYPVISIINHSCLPNAVLVFEGRLAVV-RAVQHVPKGAEVLISYIETAGSTMTRQKALK 116
            +Y   S++ H C+ NA+  F  +  +V RA   +PKG ++ + Y E    TM RQ+ L 
Sbjct: 231 AVYSTASLLEHDCVANAIKTFTNKGDIVIRAAVPIPKGEKIALCYTEPLWGTMNRQRHLS 290

Query: 117 EQYLFTCTCPRCIKLGQFDDIQESAILEGYRCKD--DGCSGFLLRDS---DDKGFTCQQC 171
           +   F C C RC    + D         G+ C+      +G LL ++   D   + C+QC
Sbjct: 291 QTKFFQCVCERCKDPTELD-----TFTSGFYCQKCPQQSAGILLTENPLDDASDWICRQC 345

Query: 172 GLVR----SKEEIKKIASEVNILSKKTL 195
           G  +    + E I+ +  E+  L K ++
Sbjct: 346 GARQPANYAAEIIESVGKEIVALKKGSV 373


>gi|71421326|ref|XP_811771.1| hypothetical protein [Trypanosoma cruzi strain CL Brener]
 gi|70876474|gb|EAN89920.1| hypothetical protein, conserved [Trypanosoma cruzi]
          Length = 697

 Score = 64.3 bits (155), Expect = 7e-08,   Method: Compositional matrix adjust.
 Identities = 41/123 (33%), Positives = 59/123 (47%), Gaps = 12/123 (9%)

Query: 18  LVNLILQW------PEISINEIAENFSKLA----CNAHTICNSELRPLGTGLYPVISIIN 67
              LIL W       E ++   A  F++L     CN+  + N++   +G  LYP  S  N
Sbjct: 520 FTKLILPWLGEGGSTEANLTVTATFFNRLCAALQCNSFGLFNADGNCIGVALYPEASYFN 579

Query: 68  HSCLPNAVLV-FEGRLAVVRAVQHVPKGAEVLISYIETA-GSTMTRQKALKEQYLFTCTC 125
           HSC PN   V + G LA   A++ + KG  + I Y++    ST  R++ L   Y F C C
Sbjct: 580 HSCCPNICRVTYRGILAAFHALREIRKGEPLTICYVDVQETSTAERRRTLFSSYRFFCEC 639

Query: 126 PRC 128
            RC
Sbjct: 640 ARC 642


>gi|328869415|gb|EGG17793.1| SET domain-containing protein [Dictyostelium fasciculatum]
          Length = 363

 Score = 64.3 bits (155), Expect = 8e-08,   Method: Compositional matrix adjust.
 Identities = 28/74 (37%), Positives = 42/74 (56%), Gaps = 1/74 (1%)

Query: 58  GLYPVISIINHSCLPNAVLVF-EGRLAVVRAVQHVPKGAEVLISYIETAGSTMTRQKALK 116
           GL+ + S  NHSC PN  + F   + AV+ A++ + KG E+ ISY ++      RQ  L 
Sbjct: 279 GLFFLSSFFNHSCDPNVYMAFPHDKTAVITALRDIKKGEELFISYGDSEKDMFDRQTHLF 338

Query: 117 EQYLFTCTCPRCIK 130
           + Y F C CP+C +
Sbjct: 339 DNYGFNCDCPKCTQ 352


>gi|296413480|ref|XP_002836440.1| hypothetical protein [Tuber melanosporum Mel28]
 gi|295630261|emb|CAZ80631.1| unnamed protein product [Tuber melanosporum]
          Length = 763

 Score = 64.3 bits (155), Expect = 8e-08,   Method: Compositional matrix adjust.
 Identities = 29/73 (39%), Positives = 42/73 (57%)

Query: 56  GTGLYPVISIINHSCLPNAVLVFEGRLAVVRAVQHVPKGAEVLISYIETAGSTMTRQKAL 115
           G+GL+ + S INHSC  N+   F G L +VRA + + +G E+ I+Y+E       RQKA 
Sbjct: 515 GSGLWIMPSYINHSCWQNSTRSFLGDLLIVRAARDLSEGEEITINYMENESGVQKRQKAF 574

Query: 116 KEQYLFTCTCPRC 128
             ++ F C C  C
Sbjct: 575 LSEWGFECKCTMC 587


>gi|350401158|ref|XP_003486068.1| PREDICTED: SET and MYND domain-containing protein 4-like [Bombus
           impatiens]
          Length = 662

 Score = 64.3 bits (155), Expect = 8e-08,   Method: Compositional matrix adjust.
 Identities = 41/122 (33%), Positives = 61/122 (50%), Gaps = 13/122 (10%)

Query: 55  LGTGLYPVISIINHSCLPNAVLVFEGRLAVVRAVQHVPKGAEVLISYIE--TAGSTMTRQ 112
           + TG+YP  S++NHSC PN + +F G+  +VRA + + +  E+   Y       +T  RQ
Sbjct: 377 IATGIYPSASMMNHSCDPNIINIFVGQYLIVRASRDIGQSEEIFNCYGPHYRHMTTEDRQ 436

Query: 113 KALKEQYLFTCTCPRCI--KLGQFDDIQESAILEGYRCKDDGCSGFLLRDSDDKGFTCQQ 170
           K LK  Y FTC C  C   KL  F +   +A  +  +C    C+   ++DS      C  
Sbjct: 437 KILKSHYCFTCKCKACTLPKLQYFTERFNAA--KCLKCNGPVCN---IKDS----IYCLD 487

Query: 171 CG 172
           CG
Sbjct: 488 CG 489


>gi|328785542|ref|XP_001122203.2| PREDICTED: protein msta, isoform A-like [Apis mellifera]
          Length = 490

 Score = 64.3 bits (155), Expect = 8e-08,   Method: Compositional matrix adjust.
 Identities = 42/132 (31%), Positives = 64/132 (48%), Gaps = 9/132 (6%)

Query: 58  GLYPVISIINHSCLPNAVLVFEGRLAV-VRAVQHVPKGAEVLISYIETAGSTMTRQKALK 116
           GLYP+ S+ NH C+PN    F+ +  + VRA   +  G E+ +SY      T  R++ L 
Sbjct: 214 GLYPLGSLQNHCCIPNTRHYFDEKFRLYVRAALPISAGEEITMSYTSLFWDTTLRRQFLN 273

Query: 117 EQYLFTCTCPRCIKLGQFDDIQESAILEGYRCKDDGCSGFLLRDSD---DKGFTCQQCGL 173
               F+C C RC    +F+  + SA+L    C  D CSG LL  +       + C +C +
Sbjct: 274 VTKNFSCMCKRCSDPTEFNS-KLSALL----CASDKCSGELLPKNPLNMRTSWICNKCSI 328

Query: 174 VRSKEEIKKIAS 185
             +  +I  I S
Sbjct: 329 TVNHRQICSIRS 340


>gi|401405154|ref|XP_003882027.1| conserved hypothetical protein [Neospora caninum Liverpool]
 gi|325116441|emb|CBZ51994.1| conserved hypothetical protein [Neospora caninum Liverpool]
          Length = 576

 Score = 64.3 bits (155), Expect = 9e-08,   Method: Compositional matrix adjust.
 Identities = 30/76 (39%), Positives = 46/76 (60%), Gaps = 2/76 (2%)

Query: 55  LGTGLYPVISIINHSCLPNAVLVFEG--RLAVVRAVQHVPKGAEVLISYIETAGSTMTRQ 112
           +G GL+  +S+ NHSC PNA + +    + A V A++ + +  +VL+SYI+      +RQ
Sbjct: 419 MGIGLFRAVSMTNHSCWPNAEVDYPSLTKTAQVTALRDIAQDEQVLLSYIDETLPLASRQ 478

Query: 113 KALKEQYLFTCTCPRC 128
           + LK  Y FTC C RC
Sbjct: 479 RLLKRHYKFTCACVRC 494


>gi|260798620|ref|XP_002594298.1| hypothetical protein BRAFLDRAFT_117670 [Branchiostoma floridae]
 gi|229279531|gb|EEN50309.1| hypothetical protein BRAFLDRAFT_117670 [Branchiostoma floridae]
          Length = 400

 Score = 63.9 bits (154), Expect = 9e-08,   Method: Compositional matrix adjust.
 Identities = 32/96 (33%), Positives = 50/96 (52%)

Query: 33  IAENFSKLACNAHTICNSELRPLGTGLYPVISIINHSCLPNAVLVFEGRLAVVRAVQHVP 92
           + E + K+A N+  I +  +  +G G+YP  S+INHSC  N + +F G    +RA + + 
Sbjct: 170 MEEIYGKIASNSFAILDENMCSIGIGVYPQASMINHSCKSNCIGMFYGPQIQIRANEFIR 229

Query: 93  KGAEVLISYIETAGSTMTRQKALKEQYLFTCTCPRC 128
            G ++   YI     T  RQ+ L + Y F C C  C
Sbjct: 230 PGEQIFHGYIPPLLPTAKRQEKLLKTYHFLCQCADC 265



 Score = 40.8 bits (94), Expect = 0.89,   Method: Compositional matrix adjust.
 Identities = 27/104 (25%), Positives = 52/104 (50%)

Query: 222 HPFSVNLMQTREKLIKILMELEDWKEALAYCQLTIPVYQRVYPQFHPLLGLQYYTCGKLE 281
           H  +V +++  + L+++ +EL  W++A+ Y +   P Y+    ++    GL Y       
Sbjct: 281 HSDNVYVIRILKDLVEVCVELGLWEKAVVYGKRIEPGYKMFLKRYDLDQGLLYQKMALAY 340

Query: 282 WFLGDTENAIKSMTEAVEILRITHGTNSPFMKELILKLEEAQAE 325
           + LGD   A+  + +A   L IT G +S  ++E+   L E  A+
Sbjct: 341 YRLGDQSAALPYLRQAKTTLTITSGEDSGLVQEVSDMLAECVAK 384


>gi|428180005|gb|EKX48874.1| hypothetical protein GUITHDRAFT_105498 [Guillardia theta CCMP2712]
          Length = 197

 Score = 63.9 bits (154), Expect = 9e-08,   Method: Compositional matrix adjust.
 Identities = 45/158 (28%), Positives = 70/158 (44%), Gaps = 22/158 (13%)

Query: 43  NAHTICNSELRPLGTGLYPVISIINHSCLPNAVLVFEGRLAVVRAVQHVPKGAEVLISYI 102
           NAH I N   + +G G++P   ++NH+C  NAV  F            VP GAEV  + I
Sbjct: 7   NAHPITNGSFKRIGLGIFPSACMLNHACNYNAVYSF------------VPGGAEVHANAI 54

Query: 103 E----TAGSTMTRQKALKEQYLFTCTCPRCIKLGQFDDIQESAILEGY-RCKDDGCSGFL 157
           +    T+   + RQK ++E Y F C C  C      +    S +   +  C    C   +
Sbjct: 55  DFGLNTSSQLIRRQKLIREGYFFQCACEWCAGTSGPERYFNSLVCNPWPDCDGHKCEAAI 114

Query: 158 LRDSD-DKGFTCQQCGLVRSKEEIKKIASEVNILSKKT 194
           L D++ D+G     C   + +  I    +E+N L+K  
Sbjct: 115 LPDANGDEGQAVVICDACKRRHPI----TEMNALNKSA 148


>gi|195352295|ref|XP_002042648.1| GM14901 [Drosophila sechellia]
 gi|194124532|gb|EDW46575.1| GM14901 [Drosophila sechellia]
          Length = 530

 Score = 63.9 bits (154), Expect = 9e-08,   Method: Compositional matrix adjust.
 Identities = 51/181 (28%), Positives = 84/181 (46%), Gaps = 22/181 (12%)

Query: 10  LLYAQIANLVNLIL---QWPEISINEIAENFSKLACNAHTICNSELRPLGTGLYPVISII 66
           +L A +   +N +L    WPE+ I  IA   + L  N   +     R     LYP  ++I
Sbjct: 199 VLRANLITFINTVLGMKDWPEMDILRIA---AILDTNTFEVRQPRERRKIRALYPEAAMI 255

Query: 67  NHSCLPNAVLVFEGRLAVV-RAVQHVPKGAEVLISYIETAGSTMTRQKALKEQYLFTCTC 125
           +H C+PN    F+  + +V  A + + KG  + ISY +   ST+ R+  L++   F C+C
Sbjct: 256 SHDCVPNMRHRFDDDMNIVFLAKRKIAKGEILSISYTQPLRSTIQRRVHLRQAKCFDCSC 315

Query: 126 PRCI---KLGQFDDIQESAILEGYRCKDDGCSGFLLRDS---DDKGFTCQQCGLVRSKEE 179
            RC    +LG F   Q        +CK    +G ++  +   +   + CQ C   RS +E
Sbjct: 316 ARCQDPEELGSFAGAQTC-----LKCK----AGKIISQNPLLNSAPWKCQLCNFKRSAKE 366

Query: 180 I 180
           +
Sbjct: 367 V 367


>gi|198454620|ref|XP_001359652.2| GA17400 [Drosophila pseudoobscura pseudoobscura]
 gi|198132877|gb|EAL28802.2| GA17400 [Drosophila pseudoobscura pseudoobscura]
          Length = 392

 Score = 63.9 bits (154), Expect = 1e-07,   Method: Compositional matrix adjust.
 Identities = 38/101 (37%), Positives = 57/101 (56%), Gaps = 9/101 (8%)

Query: 33  IAENFSKLACNAHTICNSELRPLGTGLYPVISIINHSCLPNAVLVF--EGRLAVVRAVQH 90
           I E +SK+   A    N+E    G+GLY + S INHSC+PNA   F     + V++A+  
Sbjct: 267 IDELYSKVGEFAGEFLNNE----GSGLYLLQSKINHSCVPNACSTFPYSNDIVVLKALAP 322

Query: 91  VPKGAEVLISYIETAG---STMTRQKALKEQYLFTCTCPRC 128
           + +G E+ ISY++      S  +R K L+E Y+F C C +C
Sbjct: 323 IQEGEEICISYLDECQLERSRHSRHKVLRENYIFVCQCLKC 363


>gi|345484303|ref|XP_001603615.2| PREDICTED: SET and MYND domain-containing protein 5-like [Nasonia
           vitripennis]
          Length = 386

 Score = 63.9 bits (154), Expect = 1e-07,   Method: Compositional matrix adjust.
 Identities = 36/88 (40%), Positives = 51/88 (57%), Gaps = 7/88 (7%)

Query: 56  GTGLYPVISIINHSCLPNAVLVF---EGRLAVVRAVQHVPKGAEVLISYIETAG---STM 109
           G+GLY + S INHSC PNA + F   + RL VVRA++ +    E+ I+Y++      S  
Sbjct: 288 GSGLYVLQSAINHSCAPNASVEFPHSDSRL-VVRALRDIKPDEEICIAYLDECHLERSRH 346

Query: 110 TRQKALKEQYLFTCTCPRCIKLGQFDDI 137
           +RQKAL   YLF C C +C +     D+
Sbjct: 347 SRQKALSSLYLFVCKCDKCQQQASDPDV 374


>gi|167521373|ref|XP_001745025.1| hypothetical protein [Monosiga brevicollis MX1]
 gi|163776639|gb|EDQ90258.1| predicted protein [Monosiga brevicollis MX1]
          Length = 399

 Score = 63.9 bits (154), Expect = 1e-07,   Method: Compositional matrix adjust.
 Identities = 34/98 (34%), Positives = 46/98 (46%), Gaps = 17/98 (17%)

Query: 58  GLYPVISIINHSCLPNAV-----------------LVFEGRLAVVRAVQHVPKGAEVLIS 100
           GL+P  + INHSC PNA                  L  EG   ++R+V  + +G  + IS
Sbjct: 132 GLWPQAAAINHSCRPNATHYLDASAPMKPRESGADLPPEGGTMIIRSVSDIARGEPITIS 191

Query: 101 YIETAGSTMTRQKALKEQYLFTCTCPRCIKLGQFDDIQ 138
           Y+E       RQ+AL+  Y F CTC RC +    D  Q
Sbjct: 192 YVELGDPWPVRQEALRTGYGFACTCIRCTEEAALDSEQ 229


>gi|16741406|gb|AAH16525.1| SET and MYND domain containing 5 [Mus musculus]
          Length = 300

 Score = 63.9 bits (154), Expect = 1e-07,   Method: Compositional matrix adjust.
 Identities = 35/86 (40%), Positives = 50/86 (58%), Gaps = 7/86 (8%)

Query: 56  GTGLYPVISIINHSCLPNAVLVFEGRLAV--VRAVQHVPKGAEVLISYIETAG---STMT 110
           G+GL+ + S  NHSC+PNA   F     V  V A++ +  G E+ ISY++      S  +
Sbjct: 187 GSGLFVLQSCCNHSCVPNAETSFPENNFVLHVTALEDIKPGEEICISYLDCCQRERSRHS 246

Query: 111 RQKALKEQYLFTCTCPRCIKLGQFDD 136
           R K L+E YLF C+CP+C  L + DD
Sbjct: 247 RHKILRENYLFNCSCPKC--LAEADD 270


>gi|255085478|ref|XP_002505170.1| predicted protein [Micromonas sp. RCC299]
 gi|226520439|gb|ACO66428.1| predicted protein [Micromonas sp. RCC299]
          Length = 625

 Score = 63.9 bits (154), Expect = 1e-07,   Method: Compositional matrix adjust.
 Identities = 36/106 (33%), Positives = 57/106 (53%), Gaps = 11/106 (10%)

Query: 26  PEISINEIAENFSKLACNAHTICNSELRPLGT-GLYPVISIINHSCLPNAVLVFEGRLAV 84
           PE+   E+   F++LA N   +  S +R   + G+YP  S +NHSC PNA  +  G    
Sbjct: 282 PELPPQEV---FARLAAN--VVKTSAVRNANSVGVYPFPSFLNHSCAPNACKLMVGHTMF 336

Query: 85  VRAVQHVPKGAEVLISYIETAGSTMTR--QKALKEQYLFTCTCPRC 128
           +RA + +  G EV + Y +    TM +  + A+ +++ F C CPRC
Sbjct: 337 IRAARDLVAGEEVFMKYFDV---TMPKPERSAVAKRWGFECACPRC 379


>gi|321449488|gb|EFX61914.1| hypothetical protein DAPPUDRAFT_68494 [Daphnia pulex]
          Length = 680

 Score = 63.9 bits (154), Expect = 1e-07,   Method: Compositional matrix adjust.
 Identities = 38/111 (34%), Positives = 59/111 (53%), Gaps = 5/111 (4%)

Query: 55  LGTGLYPVISIINHSCLPNAVLVFEGRLAVVRAVQHVPKGAEVLISY--IETAGSTMTRQ 112
           LG G+Y  ++++NHSC P  +  F G + VVRA++    G  V  +Y  I T    + RQ
Sbjct: 476 LGAGVYSTVALLNHSCEPGVIRHFIGDVMVVRAIKSFQPGEMVNENYGPIFTQKRRVDRQ 535

Query: 113 KALKEQYLFTCTCPRCIK-LGQFDDIQESAILEGYRCKDDGCSGFLLRDSD 162
           ++LK++Y F C C  C +      ++ E A+   +RC D  C   L+  SD
Sbjct: 536 RSLKDRYWFDCRCNPCTENWPLIGEMTEEALC--FRCADRRCRKPLVVQSD 584


>gi|195426423|ref|XP_002061334.1| GK20779 [Drosophila willistoni]
 gi|194157419|gb|EDW72320.1| GK20779 [Drosophila willistoni]
          Length = 515

 Score = 63.9 bits (154), Expect = 1e-07,   Method: Compositional matrix adjust.
 Identities = 46/173 (26%), Positives = 72/173 (41%), Gaps = 11/173 (6%)

Query: 15  IANLVNLILQWPEISINEIAENFSKLACNAHTICNSELRPLGTGLYPVISIINHSCLPNA 74
           I   +    +  + S  EI      L  N H +  S+  P    ++ V S   +SCLPN 
Sbjct: 189 IGQFIPKFFKTQKFSEEEIMRTVGALQINGHEVPTSD--PPHVAVFYVASFTENSCLPNL 246

Query: 75  VLVFEGR-LAVVRAVQHVPKGAEVLISYIETAGSTMTRQKALKEQYLFTCTCPRCIKLGQ 133
              F      ++ A Q + K + + I Y +    T  RQ+ L +  LF C C RC     
Sbjct: 247 AKSFNKHGHCILWAPQAIKKNSHLSICYSDAIWGTSDRQRHLMQTKLFKCACLRCQ---- 302

Query: 134 FDDIQESAILEGYRCKDDGCSGFLLRDSDDK---GFTCQQCGLVRSKEEIKKI 183
            D  +        +C+D  CSG +L    D    G+ C++CG    K+ ++ I
Sbjct: 303 -DPTELGTNYSAIKCEDRKCSGLMLPTKADDWHGGWQCKECGKQVPKQYVEGI 354


>gi|66525107|ref|XP_394075.2| PREDICTED: SET and MYND domain-containing protein 5 [Apis
           mellifera]
          Length = 387

 Score = 63.9 bits (154), Expect = 1e-07,   Method: Compositional matrix adjust.
 Identities = 34/87 (39%), Positives = 53/87 (60%), Gaps = 7/87 (8%)

Query: 56  GTGLYPVISIINHSCLPNAVLVF--EGRLAVVRAVQHVPKGAEVLISYIETA---GSTMT 110
           G+GLY + S +NHSC+PNA++ F     + V++A++ +    E+ ISY++      S  +
Sbjct: 287 GSGLYILQSSVNHSCVPNAIVEFPYSNNVLVLKAIRDIHPEEEICISYLDECCLERSRHS 346

Query: 111 RQKALKEQYLFTCTCPRCIKLGQFDDI 137
           RQKAL   YLF C C +C  L Q +D+
Sbjct: 347 RQKALNSLYLFQCYCNKC--LSQANDL 371


>gi|270007661|gb|EFA04109.1| hypothetical protein TcasGA2_TC014347 [Tribolium castaneum]
          Length = 497

 Score = 63.9 bits (154), Expect = 1e-07,   Method: Compositional matrix adjust.
 Identities = 45/130 (34%), Positives = 63/130 (48%), Gaps = 9/130 (6%)

Query: 58  GLYPVISIINHSCLPNAVLVFEGRL-AVVRAVQHVPKGAEVLISYIETAGSTMTRQKALK 116
           GLYP+ S+ NHSC PN   VF+ +   VVRA + +P+G+E+  SY     ST  R+  L 
Sbjct: 207 GLYPLGSLANHSCCPNTCHVFDDKQHMVVRASKFIPQGSEIFHSYSRLIWSTSARRFHLY 266

Query: 117 EQYLFTCTCPRCIKLGQFDDIQESAILEGYRCKDDGCSGFLLRDS--DDKGFTCQQCGLV 174
               F C C RC    +F     S +     CK       +  +S   DK + C+ CG +
Sbjct: 267 RTKHFLCKCQRCEDPTEFGSYIGSIL-----CKVCKTGKVIPTNSLQTDK-WQCEVCGSL 320

Query: 175 RSKEEIKKIA 184
             KEE+  I 
Sbjct: 321 IKKEEVANIT 330


>gi|367028494|ref|XP_003663531.1| hypothetical protein MYCTH_33039, partial [Myceliophthora
           thermophila ATCC 42464]
 gi|347010800|gb|AEO58286.1| hypothetical protein MYCTH_33039, partial [Myceliophthora
           thermophila ATCC 42464]
          Length = 419

 Score = 63.9 bits (154), Expect = 1e-07,   Method: Compositional matrix adjust.
 Identities = 32/87 (36%), Positives = 46/87 (52%), Gaps = 2/87 (2%)

Query: 42  CNAHTICNSELRPLGTGLYPVISIINHSCLPNAVLVFEGRLAVVRAVQHVPKGAEVLISY 101
           C     C+ ++  +G  L  V ++INHSC PNA +  EG    VR+++ +  G E+ +SY
Sbjct: 103 CKIARYCSKDV--IGYALDMVTAMINHSCAPNAFVTLEGCQLRVRSLKPIAAGEEITVSY 160

Query: 102 IETAGSTMTRQKALKEQYLFTCTCPRC 128
            +       RQK LKE Y F C C  C
Sbjct: 161 ADPTLPVFNRQKFLKETYFFDCRCKGC 187


>gi|198433887|ref|XP_002127168.1| PREDICTED: similar to SMYD family member 5 [Ciona intestinalis]
          Length = 372

 Score = 63.9 bits (154), Expect = 1e-07,   Method: Compositional matrix adjust.
 Identities = 35/79 (44%), Positives = 46/79 (58%), Gaps = 7/79 (8%)

Query: 56  GTGLYPVISIINHSCLPNAVLVFEG---RLAVVRAVQHVPKGAEVLISYI---ETAGSTM 109
           G GLY + S  NHSC PNA + F     RLAV +A + +  G E+ ISY+   + A    
Sbjct: 281 GAGLYRIQSKCNHSCEPNAEVCFPNNNHRLAV-KACRDIAAGEEITISYLSQCQIARGCR 339

Query: 110 TRQKALKEQYLFTCTCPRC 128
           +RQ+ LKE YLF C C +C
Sbjct: 340 SRQQYLKENYLFHCCCSKC 358


>gi|390360826|ref|XP_001181205.2| PREDICTED: SET and MYND domain-containing protein 4-like
           [Strongylocentrotus purpuratus]
          Length = 679

 Score = 63.5 bits (153), Expect = 1e-07,   Method: Compositional matrix adjust.
 Identities = 60/267 (22%), Positives = 118/267 (44%), Gaps = 21/267 (7%)

Query: 50  SELRPLGTGLYPVISIINHSCLPNAVLVFEGRLAVVRAVQHVPKGAEVLISYIETAG--S 107
           S+ R + T +YP +S++NH+C+PN +  F   +  VRA + +  G E+   Y    G  +
Sbjct: 410 SQFR-VATAVYPTVSMMNHACIPNIIPSFRKGILCVRATKKIMHGDEIQHCYGPQVGHMT 468

Query: 108 TMTRQKALKEQYLFTCTCPRCIKLGQFDDIQESAILEGYRCKDDGCSGFLLRDSDDKGFT 167
           T  RQ+AL  QY FTC C  C +  +  D   +  ++  +C          +  +     
Sbjct: 469 TSDRQQALLNQYCFTCRCRACTRDPKSRD---NLCMKCPKCG---------QPLNAMTSI 516

Query: 168 CQQCG-----LVRSKEEIKKIASEVNILSKKTLALTSCGNHQEVVSTYKMIEKLQKKLYH 222
           C +C      LV  KE    + +   ++     A+      ++V+S  K    +  ++  
Sbjct: 517 CGKCAVWIDVLVLLKEVDDTMITFAELVEMYPAAVKDVTLMRDVISKTKTCHDVLDRILI 576

Query: 223 PFSVNLMQTREKLIKILMELEDWKEALAYCQLTIPVYQRVYPQFHPLLGLQYYTCGKLEW 282
           P  ++L      + K  + L+++KEA  +    IP  +  + +    +  + Y   ++ +
Sbjct: 577 PPHMHLAVAYNFMAKCHIALQEFKEAAPWLAKAIPTIECRFGRDSIEVANELYKLAQIYF 636

Query: 283 FLGDTENAIKSMTEAVEILRITHGTNS 309
              +   A++++  A+E+  I H  NS
Sbjct: 637 NGKEIAPAMEAIDRALELF-IRHYGNS 662


>gi|367020696|ref|XP_003659633.1| hypothetical protein MYCTH_99686 [Myceliophthora thermophila ATCC
           42464]
 gi|347006900|gb|AEO54388.1| hypothetical protein MYCTH_99686 [Myceliophthora thermophila ATCC
           42464]
          Length = 573

 Score = 63.5 bits (153), Expect = 1e-07,   Method: Compositional matrix adjust.
 Identities = 40/122 (32%), Positives = 66/122 (54%), Gaps = 2/122 (1%)

Query: 7   KQLLLYAQIANLVNLILQWPEISINEIAENFSKLACNAHTICNSELRPLGTGLYPVISII 66
           K + L A  A +   +LQ  E+ + +  E   K+  NA    +++    G  L   ++++
Sbjct: 181 KDMELQATAAVVYAGLLQGEEV-LEKAREILCKIQTNAFNRLDADTGMAGIFLDVGLAMV 239

Query: 67  NHSCLPNAVLVFEGRLAVVRAVQHVPKGAEVLISYIETAGSTMTRQKALKEQYLFTCTCP 126
           NHSC+PNA + F+ R A++RA + + +G E+ ISYI+       R +AL+  Y F C C 
Sbjct: 240 NHSCVPNAFIGFDKRTAILRAERPIQEGEEITISYIDNTLPKAARYEALR-LYHFQCDCV 298

Query: 127 RC 128
           RC
Sbjct: 299 RC 300


>gi|426335970|ref|XP_004029477.1| PREDICTED: SET and MYND domain-containing protein 5 isoform 2
           [Gorilla gorilla gorilla]
          Length = 303

 Score = 63.5 bits (153), Expect = 1e-07,   Method: Compositional matrix adjust.
 Identities = 32/79 (40%), Positives = 47/79 (59%), Gaps = 5/79 (6%)

Query: 56  GTGLYPVISIINHSCLPNAVLVF--EGRLAVVRAVQHVPKGAEVLISYIETAG---STMT 110
           G+GL+ + S  NHSC+PNA   F     L  V A++ +  G E+ ISY++      S  +
Sbjct: 188 GSGLFVLQSCCNHSCVPNAETSFPENNFLLHVTALEDIKPGEEICISYLDCCQRERSRHS 247

Query: 111 RQKALKEQYLFTCTCPRCI 129
           R K L+E YLF C+CP+C+
Sbjct: 248 RHKILRENYLFVCSCPKCL 266


>gi|397473489|ref|XP_003808243.1| PREDICTED: SET and MYND domain-containing protein 5 isoform 2 [Pan
           paniscus]
          Length = 304

 Score = 63.5 bits (153), Expect = 1e-07,   Method: Compositional matrix adjust.
 Identities = 32/79 (40%), Positives = 47/79 (59%), Gaps = 5/79 (6%)

Query: 56  GTGLYPVISIINHSCLPNAVLVF--EGRLAVVRAVQHVPKGAEVLISYIETAG---STMT 110
           G+GL+ + S  NHSC+PNA   F     L  V A++ +  G E+ ISY++      S  +
Sbjct: 188 GSGLFVLQSCCNHSCVPNAETSFPENNFLLHVTALEDIKPGEEICISYLDCCQRERSRHS 247

Query: 111 RQKALKEQYLFTCTCPRCI 129
           R K L+E YLF C+CP+C+
Sbjct: 248 RHKILRENYLFVCSCPKCL 266


>gi|380016398|ref|XP_003692172.1| PREDICTED: SET and MYND domain-containing protein 5-like [Apis
           florea]
          Length = 387

 Score = 63.5 bits (153), Expect = 1e-07,   Method: Compositional matrix adjust.
 Identities = 34/87 (39%), Positives = 53/87 (60%), Gaps = 7/87 (8%)

Query: 56  GTGLYPVISIINHSCLPNAVLVF--EGRLAVVRAVQHVPKGAEVLISYIETA---GSTMT 110
           G+GLY + S +NHSC+PNA++ F     + V++A++ +    E+ ISY++      S  +
Sbjct: 287 GSGLYILQSSVNHSCVPNAIVEFPYSNNVLVLKAIRDIHPEEEICISYLDECCLERSRHS 346

Query: 111 RQKALKEQYLFTCTCPRCIKLGQFDDI 137
           RQKAL   YLF C C +C  L Q +D+
Sbjct: 347 RQKALNSLYLFQCYCNKC--LSQANDL 371


>gi|26450409|dbj|BAC42319.1| unknown protein [Arabidopsis thaliana]
          Length = 604

 Score = 63.5 bits (153), Expect = 1e-07,   Method: Compositional matrix adjust.
 Identities = 37/117 (31%), Positives = 56/117 (47%), Gaps = 10/117 (8%)

Query: 55  LGTGLYPVISIINHSCLPNAVLVFEGRLAVVRAVQHVPKGAEVLISYIETAG--STMTRQ 112
           +G  LY   S+ NHSC PN  L F  R  +++  + VP G  + +SY    G      R 
Sbjct: 475 VGQALYKTGSLFNHSCKPNIHLYFLSRGLIMQTTEFVPTGCPLELSYGPEVGKWDCKNRI 534

Query: 113 KALKEQYLFTCTCPRCIKLGQFDDIQESAILEGYRCKDDGCSGFLLRDSDDKGFTCQ 169
           + L+E+Y F C C  C ++   D      ++ GY C +  C+G +L   D    TC+
Sbjct: 535 RFLEEEYFFHCRCRGCAQINISD-----LVINGYGCVNTNCTGVVL---DSNVATCE 583


>gi|170060284|ref|XP_001865734.1| conserved hypothetical protein [Culex quinquefasciatus]
 gi|167878798|gb|EDS42181.1| conserved hypothetical protein [Culex quinquefasciatus]
          Length = 647

 Score = 63.5 bits (153), Expect = 1e-07,   Method: Compositional matrix adjust.
 Identities = 39/124 (31%), Positives = 59/124 (47%), Gaps = 8/124 (6%)

Query: 55  LGTGLYPVISIINHSCLPNAVLVFEGRLAVVRAVQHVPKGAEVLISY--IETAGSTMTRQ 112
           +G GLYP + + NHSC P     F G    VR V+++P G+ V  +Y  +        R+
Sbjct: 446 IGAGLYPTLVLFNHSCDPGITRYFVGSAVFVRTVRNIPAGSVVAENYGQLFVRAPRHERR 505

Query: 113 KALKEQYLFTCTCPRCIK-LGQFDDIQESAILEGYRC-KDDGCSGFLLRDS--DDKGFTC 168
           K+LK+ Y F C C  C +    F D+  S +   +RC   +GC   L+     D     C
Sbjct: 506 KSLKKLYKFDCYCQACYEDWPTFFDMNPSVV--RFRCAATEGCENGLIYTERLDQNVMKC 563

Query: 169 QQCG 172
           ++C 
Sbjct: 564 EKCN 567


>gi|344283937|ref|XP_003413727.1| PREDICTED: SET and MYND domain-containing protein 5 [Loxodonta
           africana]
          Length = 417

 Score = 63.5 bits (153), Expect = 1e-07,   Method: Compositional matrix adjust.
 Identities = 35/86 (40%), Positives = 50/86 (58%), Gaps = 7/86 (8%)

Query: 56  GTGLYPVISIINHSCLPNAVLVF--EGRLAVVRAVQHVPKGAEVLISYIETAG---STMT 110
           G+GL+ + S  NHSC+PNA   F     L  V A++ +  G E+ ISY++      S  +
Sbjct: 304 GSGLFVLQSCCNHSCVPNAETSFPENNFLLHVTALEDIKPGEEICISYLDCCQRERSRHS 363

Query: 111 RQKALKEQYLFTCTCPRCIKLGQFDD 136
           R K L+E YLF C+CP+C  L + DD
Sbjct: 364 RHKILRENYLFVCSCPKC--LAEADD 387


>gi|380022725|ref|XP_003695189.1| PREDICTED: protein msta, isoform B-like [Apis florea]
          Length = 513

 Score = 63.5 bits (153), Expect = 1e-07,   Method: Compositional matrix adjust.
 Identities = 68/316 (21%), Positives = 127/316 (40%), Gaps = 41/316 (12%)

Query: 17  NLVNLILQWPEISINEIAENFSKLACNAHTICNSELRPL----GTGLYPVISIINHSCLP 72
           N+V  I +  ++ ++  +E   +  C    I   E+R         +YP ++++NHSC+ 
Sbjct: 173 NIVEYIRK--QLKLDRFSEKEIQTVCGILEINTFEVRTAKGFSARAIYPTVALMNHSCIS 230

Query: 73  N-----AVLVFEGRLAVVRAVQHVPKGAEVLISYIETAGSTMTRQKALKEQYLFTCTCPR 127
           N     +   ++ RL   R    +P G E+  SY  +   TM R++ L E   F C C R
Sbjct: 231 NTCHSISPTDYKIRL---RTTLKIPPGGELYGSYTHSLLPTMLRREHLLEGKHFACACAR 287

Query: 128 CIKLGQFDDIQESAILEGYRCK--DDGCSGFLLRDSDDKGFTCQQCGLVRSKEEIKKIAS 185
           C      D  +    +   +C   D+G    L     +  + C  C    S   ++K+  
Sbjct: 288 C-----SDPTELGTHMSSLKCNKCDNGIVLSLDSLDPESSWKCTHCEFSTSGSAVRKVLQ 342

Query: 186 EVNILSKKTLALTSCGNHQEVVSTYKMIEKLQKKLYHPFSVNLMQTREKLIKILMELEDW 245
            +++      A+ S  +  + +   + + K  + + HP    L   R  L ++   +ED+
Sbjct: 343 IIHMEVDAVEAI-SGADGADAIQERETVAKKYRSVLHPRHAFLTMLRHSLTQMYGRVEDY 401

Query: 246 ----------KEALAYCQLTIPVYQRVYPQFHPLLGLQYYTCGKLEWFLGDTE------- 288
                     +  +  C+L + V   + P +  + G+  Y       F   T+       
Sbjct: 402 LLEDLPDVVLEHKIDMCKLLLQVLDVIEPGYSRIRGMTLYELHAPLLFTAKTQWNAEVID 461

Query: 289 -NAIKS-MTEAVEILR 302
             A+KS M EA +IL+
Sbjct: 462 KAALKSKMIEASKILK 477


>gi|31208665|ref|XP_313299.1| AGAP003552-PA [Anopheles gambiae str. PEST]
 gi|21296724|gb|EAA08869.1| AGAP003552-PA [Anopheles gambiae str. PEST]
          Length = 391

 Score = 63.5 bits (153), Expect = 1e-07,   Method: Compositional matrix adjust.
 Identities = 35/87 (40%), Positives = 47/87 (54%), Gaps = 5/87 (5%)

Query: 47  ICNSELRPLGTGLYPVISIINHSCLPNAVLVF--EGRLAVVRAVQHVPKGAEVLISYIET 104
           +  S L   G+ LY   S INHSC PNA  VF     +  +RA + +  G E+ ISY++ 
Sbjct: 279 VVGSFLNNEGSALYARQSKINHSCAPNAETVFPKSNHMLALRATRDIQPGEEICISYLDE 338

Query: 105 AG---STMTRQKALKEQYLFTCTCPRC 128
                S  +RQK LK+ YLF C C +C
Sbjct: 339 CNLQRSRHSRQKTLKDYYLFICQCEKC 365


>gi|74187979|dbj|BAE37118.1| unnamed protein product [Mus musculus]
          Length = 400

 Score = 63.5 bits (153), Expect = 1e-07,   Method: Compositional matrix adjust.
 Identities = 35/86 (40%), Positives = 50/86 (58%), Gaps = 7/86 (8%)

Query: 56  GTGLYPVISIINHSCLPNAVLVFEGRLAV--VRAVQHVPKGAEVLISYIETAG---STMT 110
           G+GL+ + S  NHSC+PNA   F     V  V A++ +  G E+ ISY++      S  +
Sbjct: 287 GSGLFVLQSCCNHSCVPNAETSFPENNFVLHVTALEDIKPGEEICISYLDCCQRERSRHS 346

Query: 111 RQKALKEQYLFTCTCPRCIKLGQFDD 136
           R K L+E YLF C+CP+C  L + DD
Sbjct: 347 RHKILRENYLFNCSCPKC--LAEADD 370


>gi|221503737|gb|EEE29421.1| SET and MYND domain containing protein, putative [Toxoplasma gondii
           VEG]
          Length = 402

 Score = 63.5 bits (153), Expect = 1e-07,   Method: Compositional matrix adjust.
 Identities = 32/76 (42%), Positives = 45/76 (59%), Gaps = 2/76 (2%)

Query: 55  LGTGLYPVISIINHSCLPNAVL--VFEGRLAVVRAVQHVPKGAEVLISYIETAGSTMTRQ 112
           LG GL+  +S+ NHSC PNA +   F    A V A++ + +  E+L+SYI+ +     RQ
Sbjct: 263 LGVGLFRAVSMTNHSCWPNAEVDYPFLTNAAQVTALRDIAEKEEILLSYIDESLPLAERQ 322

Query: 113 KALKEQYLFTCTCPRC 128
           + LK  Y FTC C RC
Sbjct: 323 RLLKSHYKFTCGCQRC 338


>gi|255543292|ref|XP_002512709.1| protein with unknown function [Ricinus communis]
 gi|223548670|gb|EEF50161.1| protein with unknown function [Ricinus communis]
          Length = 379

 Score = 63.5 bits (153), Expect = 1e-07,   Method: Compositional matrix adjust.
 Identities = 29/75 (38%), Positives = 41/75 (54%), Gaps = 4/75 (5%)

Query: 58  GLYPVISIINHSCLPNAV----LVFEGRLAVVRAVQHVPKGAEVLISYIETAGSTMTRQK 113
           G+YP  ++ NH CLPNA     +  +    ++R +  VP+G E+ +SY        TRQK
Sbjct: 209 GIYPKAALFNHDCLPNACRFDYVDTQDTDLIIRMIHDVPQGREICLSYFPVNYDYSTRQK 268

Query: 114 ALKEQYLFTCTCPRC 128
            L+E Y F C C RC
Sbjct: 269 RLREDYGFICDCDRC 283


>gi|116181330|ref|XP_001220514.1| hypothetical protein CHGG_01293 [Chaetomium globosum CBS 148.51]
 gi|88185590|gb|EAQ93058.1| hypothetical protein CHGG_01293 [Chaetomium globosum CBS 148.51]
          Length = 348

 Score = 63.5 bits (153), Expect = 1e-07,   Method: Compositional matrix adjust.
 Identities = 31/71 (43%), Positives = 40/71 (56%), Gaps = 1/71 (1%)

Query: 58  GLYPVISIINHSCLPNAVLVFEGRLAVVRAVQHVPKGAEVLISYIETAGSTMTRQKALKE 117
           G++P +S  NH C PN      G      AV+ +P G E+ ISYI    S  TRQ  L+E
Sbjct: 173 GVFPEVSKFNHDCRPNVHYRLNGLKHTTIAVRDIPAGDELTISYIYGRASHATRQSQLRE 232

Query: 118 QYLFTCTCPRC 128
            + FTCTCP+C
Sbjct: 233 -WGFTCTCPQC 242


>gi|294886191|ref|XP_002771602.1| hypothetical protein, conserved [Perkinsus marinus ATCC 50983]
 gi|239875308|gb|EER03418.1| hypothetical protein, conserved [Perkinsus marinus ATCC 50983]
          Length = 265

 Score = 63.5 bits (153), Expect = 1e-07,   Method: Compositional matrix adjust.
 Identities = 34/98 (34%), Positives = 51/98 (52%), Gaps = 9/98 (9%)

Query: 56  GTGLYPVISIINHSCLPNAVLVFEGRLAVVRAVQHVPKGAEVLISYIETAGSTMTRQKAL 115
           GT +YPV S+ NHSC PN      G L   R    V +G E+ ISYI      +   KA+
Sbjct: 17  GTAVYPVASMFNHSCTPNVTRASLGDLTWFRTCTDVKRGQELTISYI--GSDLLCEPKAV 74

Query: 116 KEQYL---FTCTCPRCIKLGQFDDIQESAILEGYRCKD 150
           ++++L   F+C CP C K    +D  +S ++  +  +D
Sbjct: 75  RQKHLARDFSCNCPACTK----EDDDDSVVIMPFTLQD 108


>gi|157135679|ref|XP_001663543.1| hypothetical protein AaeL_AAEL003295 [Aedes aegypti]
 gi|108881205|gb|EAT45430.1| AAEL003295-PA [Aedes aegypti]
          Length = 565

 Score = 63.5 bits (153), Expect = 1e-07,   Method: Compositional matrix adjust.
 Identities = 50/199 (25%), Positives = 86/199 (43%), Gaps = 38/199 (19%)

Query: 58  GLYPVISIINHSCLPNAVLVF---EGRLAVVRAVQHVPKGAEVLISYIETAGSTMTRQKA 114
           GLYP+  ++ H C+PN    F   +G     +A +++ KG  +  +Y  +   T  R+  
Sbjct: 263 GLYPIACMLEHCCMPNCFYTFNCTKGMKLTFKAGRNIKKGEHLTTTYTHSLWGTQQRRDH 322

Query: 115 LKEQYLFTCTCPRCIKLGQFDDIQESAILEGYRC---KDDGCSGFL-----LRDSDDKGF 166
           LK    F+C+C RC      D  +    L   +C     + C+G+      L+++ D  +
Sbjct: 323 LKANKYFSCSCARCA-----DPTELGTYLSALKCMGIDGNACAGYQLPIDPLKETSD--W 375

Query: 167 TCQQCGLVRSKEEIK----KIASEVNILSKKTLALTSCGNHQEVVSTYKMIEKLQKKL-- 220
            C QC +    +++     KI  EV+             +  E  S+ K +E L  KL  
Sbjct: 376 KCNQCPITIESDQVNFLLSKIGEEVD-------------DAMERKSSVKQLEDLISKLLT 422

Query: 221 -YHPFSVNLMQTREKLIKI 238
             HP    L+Q +  LI++
Sbjct: 423 FLHPNHHFLLQLKHSLIQM 441


>gi|332813421|ref|XP_515547.3| PREDICTED: SET and MYND domain-containing protein 5 isoform 4 [Pan
           troglodytes]
          Length = 299

 Score = 63.5 bits (153), Expect = 1e-07,   Method: Compositional matrix adjust.
 Identities = 32/79 (40%), Positives = 47/79 (59%), Gaps = 5/79 (6%)

Query: 56  GTGLYPVISIINHSCLPNAVLVF--EGRLAVVRAVQHVPKGAEVLISYIETAG---STMT 110
           G+GL+ + S  NHSC+PNA   F     L  V A++ +  G E+ ISY++      S  +
Sbjct: 188 GSGLFVLQSCCNHSCVPNAETSFPENNFLLHVTALEDIKPGEEICISYLDCCQRERSRHS 247

Query: 111 RQKALKEQYLFTCTCPRCI 129
           R K L+E YLF C+CP+C+
Sbjct: 248 RHKILRENYLFVCSCPKCL 266


>gi|49257476|gb|AAH73806.1| SMYD family member 5 [Homo sapiens]
 gi|167773713|gb|ABZ92291.1| SMYD family member 5 [synthetic construct]
 gi|194374657|dbj|BAG62443.1| unnamed protein product [Homo sapiens]
          Length = 302

 Score = 63.5 bits (153), Expect = 1e-07,   Method: Compositional matrix adjust.
 Identities = 32/79 (40%), Positives = 47/79 (59%), Gaps = 5/79 (6%)

Query: 56  GTGLYPVISIINHSCLPNAVLVF--EGRLAVVRAVQHVPKGAEVLISYIETAG---STMT 110
           G+GL+ + S  NHSC+PNA   F     L  V A++ +  G E+ ISY++      S  +
Sbjct: 188 GSGLFVLQSCCNHSCVPNAETSFPENNFLLHVTALEDIKPGEEICISYLDCCQRERSRHS 247

Query: 111 RQKALKEQYLFTCTCPRCI 129
           R K L+E YLF C+CP+C+
Sbjct: 248 RHKILRENYLFVCSCPKCL 266


>gi|154689581|ref|NP_659167.2| SET and MYND domain-containing protein 5 [Mus musculus]
 gi|90101759|sp|Q3TYX3.2|SMYD5_MOUSE RecName: Full=SET and MYND domain-containing protein 5; AltName:
           Full=Protein NN8-4AG; AltName: Full=Retinoic
           acid-induced protein 15
 gi|148666711|gb|EDK99127.1| SET and MYND domain containing 5 [Mus musculus]
          Length = 416

 Score = 63.5 bits (153), Expect = 1e-07,   Method: Compositional matrix adjust.
 Identities = 35/86 (40%), Positives = 50/86 (58%), Gaps = 7/86 (8%)

Query: 56  GTGLYPVISIINHSCLPNAVLVFEGRLAV--VRAVQHVPKGAEVLISYIETAG---STMT 110
           G+GL+ + S  NHSC+PNA   F     V  V A++ +  G E+ ISY++      S  +
Sbjct: 303 GSGLFVLQSCCNHSCVPNAETSFPENNFVLHVTALEDIKPGEEICISYLDCCQRERSRHS 362

Query: 111 RQKALKEQYLFTCTCPRCIKLGQFDD 136
           R K L+E YLF C+CP+C  L + DD
Sbjct: 363 RHKILRENYLFNCSCPKC--LAEADD 386


>gi|440583731|emb|CCH47233.1| similar to histone-lysine N-methyltransferase ASHR2 [Lupinus
           angustifolius]
          Length = 357

 Score = 63.5 bits (153), Expect = 1e-07,   Method: Compositional matrix adjust.
 Identities = 39/143 (27%), Positives = 60/143 (41%), Gaps = 24/143 (16%)

Query: 58  GLYPVISIINHSCLPNAVLV-----------FEGRLAVVRAVQHVPKGAEVLISYIETAG 106
            +YP  S+ NH C+PNA                    V+R +Q +P+G EV ISY   + 
Sbjct: 212 AIYPKASMFNHDCIPNACRFDYVDSADLDDEHNNTDMVIRMIQDLPEGREVCISYFRISR 271

Query: 107 STMTRQKALKEQYLFTCTCPRCI-----------KLGQFDDIQESAILEGYRCKDDGCSG 155
              TR++ L + Y F+C C RC             + ++ D+     +  Y C    C+G
Sbjct: 272 DYCTRKRILMDDYGFSCECDRCKIEANWPHDCQNYVEEYSDLPHVRFIAKYVCDRKNCNG 331

Query: 156 FLL--RDSDDKGFTCQQCGLVRS 176
            L    D+      C  CG ++S
Sbjct: 332 TLAPKDDAHTNVLECNFCGNLKS 354


>gi|194762123|ref|XP_001963207.1| GF15829 [Drosophila ananassae]
 gi|190616904|gb|EDV32428.1| GF15829 [Drosophila ananassae]
          Length = 507

 Score = 63.5 bits (153), Expect = 1e-07,   Method: Compositional matrix adjust.
 Identities = 51/199 (25%), Positives = 84/199 (42%), Gaps = 25/199 (12%)

Query: 59  LYPVISIINHSCLPN---AVLVFEGRLAVVRAVQHVPKGAEVLISYIETAGSTMTRQKAL 115
           LYP   I+ H+C+PN   ++   EG    +RA+  +  G  +  SY  T   T  RQ+ L
Sbjct: 207 LYPYTGILAHNCVPNTARSIYPSEGYKIRLRAMVDLKDGQPLHHSYTYTLDGTAQRQRHL 266

Query: 116 KEQYLFTCTCPRC---IKLG-QFDDIQESAILEGYRCKDDGCSGFLLRDSDDKGFTCQQC 171
           K+   F+CTC RC    +LG  F  ++     EGY+               D  + C  C
Sbjct: 267 KQGKYFSCTCERCQDPTELGTHFSSLKCGQCTEGYQVPRQP-------TDSDSSWNCNNC 319

Query: 172 GLVRSKEEI----KKIASEVNILSKKTLALTSCGNHQEVVSTYKMIEKLQKKLYHPFSVN 227
           G   S+ E+    + + SEVN +    +        + ++  Y       K L HP    
Sbjct: 320 GASSSQAEVETLLQSLQSEVNKVQGLEMGAKRLEEAERLLRKY-------KSLLHPLHFI 372

Query: 228 LMQTREKLIKILMELEDWK 246
               R+ LI++   ++ ++
Sbjct: 373 ATGLRQLLIEMYGRVQGYE 391


>gi|440796609|gb|ELR17718.1| Histone-lysine N-methyltransferase [Acanthamoeba castellanii str.
           Neff]
          Length = 419

 Score = 63.5 bits (153), Expect = 1e-07,   Method: Compositional matrix adjust.
 Identities = 36/98 (36%), Positives = 47/98 (47%), Gaps = 1/98 (1%)

Query: 32  EIAENFSKLACNAHTICNSELRPLGTGLYPVISIINHSCLPN-AVLVFEGRLAVVRAVQH 90
           E+ E  SK+  N   + + +L  +G  LY   S  NHSC PN      EG+     A   
Sbjct: 278 ELMEYISKIESNTFGMWSDKLVSMGMVLYAEGSYFNHSCAPNCGTRTGEGQAVQFVATHD 337

Query: 91  VPKGAEVLISYIETAGSTMTRQKALKEQYLFTCTCPRC 128
           +P G EV I YI+    T +R+  L   Y FTC CP C
Sbjct: 338 IPAGDEVCIRYIDVDKPTTSRRSELLSHYHFTCMCPLC 375


>gi|350403870|ref|XP_003486931.1| PREDICTED: SET and MYND domain-containing protein 5-like [Bombus
           impatiens]
          Length = 386

 Score = 63.5 bits (153), Expect = 1e-07,   Method: Compositional matrix adjust.
 Identities = 36/86 (41%), Positives = 51/86 (59%), Gaps = 7/86 (8%)

Query: 56  GTGLYPVISIINHSCLPNAVLVF--EGRLAVVRAVQHVPKGAEVLISYIETA---GSTMT 110
           G+GLY   S INHSC+PNAV+ F     + V++A++ +    E+ ISY++      S  +
Sbjct: 287 GSGLYIFQSSINHSCVPNAVVEFPYSNNVLVLKAIRDIHPEEEICISYLDECCLERSRHS 346

Query: 111 RQKALKEQYLFTCTCPRCIKLGQFDD 136
           RQKAL   YLF C C +C  L Q +D
Sbjct: 347 RQKALSSLYLFHCYCNKC--LSQVND 370


>gi|157823305|ref|NP_001101340.1| SET and MYND domain-containing protein 5 [Rattus norvegicus]
 gi|149036567|gb|EDL91185.1| SET and MYND domain containing 5 (predicted) [Rattus norvegicus]
          Length = 417

 Score = 63.5 bits (153), Expect = 1e-07,   Method: Compositional matrix adjust.
 Identities = 35/86 (40%), Positives = 50/86 (58%), Gaps = 7/86 (8%)

Query: 56  GTGLYPVISIINHSCLPNAVLVF--EGRLAVVRAVQHVPKGAEVLISYIETAG---STMT 110
           G+GL+ + S  NHSC+PNA   F     L  V A++ +  G E+ ISY++      S  +
Sbjct: 304 GSGLFVLQSCCNHSCVPNAETSFPENNFLLHVTALEDIEPGEEICISYLDCCQRERSRHS 363

Query: 111 RQKALKEQYLFTCTCPRCIKLGQFDD 136
           R K L+E YLF C+CP+C  L + DD
Sbjct: 364 RHKILRENYLFVCSCPKC--LAEADD 387


>gi|71409653|ref|XP_807160.1| hypothetical protein [Trypanosoma cruzi strain CL Brener]
 gi|70871100|gb|EAN85309.1| hypothetical protein, conserved [Trypanosoma cruzi]
          Length = 697

 Score = 63.5 bits (153), Expect = 1e-07,   Method: Compositional matrix adjust.
 Identities = 41/123 (33%), Positives = 59/123 (47%), Gaps = 12/123 (9%)

Query: 18  LVNLILQW------PEISINEIAENFSKLA----CNAHTICNSELRPLGTGLYPVISIIN 67
              LIL W       E ++   A  F++L     CN+  + N++   +G  LYP  S  N
Sbjct: 520 FTKLILPWLGEGGSIEANLTVTATFFNRLCAALQCNSFGLFNADGNCIGVALYPEASYFN 579

Query: 68  HSCLPNAVLV-FEGRLAVVRAVQHVPKGAEVLISYIETA-GSTMTRQKALKEQYLFTCTC 125
           HSC PN   V + G LA   A++ + KG  + I Y++    ST  R++ L   Y F C C
Sbjct: 580 HSCCPNICRVTYRGILAAFHALREIRKGEPLTICYVDVQETSTAERRRTLFSSYRFFCEC 639

Query: 126 PRC 128
            RC
Sbjct: 640 ARC 642


>gi|440302623|gb|ELP94930.1| set and mynd domain containing protein, putative [Entamoeba
           invadens IP1]
          Length = 428

 Score = 63.5 bits (153), Expect = 1e-07,   Method: Compositional matrix adjust.
 Identities = 30/86 (34%), Positives = 48/86 (55%), Gaps = 1/86 (1%)

Query: 43  NAHTICNSELRPLGTGLYPVISIINHSCLPNAVLVFEGRLAVVRAVQHVPKGAEVLISYI 102
           N+  + N E + +G GL+   S INHSC PN V +   R   ++++  +  G E+ ISYI
Sbjct: 154 NSFCLTNGEEQVIGIGLFDYASFINHSCCPNCVPLQNKREMSIKSLTSIKSGEEIFISYI 213

Query: 103 ETAGSTMTRQKALKEQYLFTCTCPRC 128
           +   S   R+K L+ ++ F+C C  C
Sbjct: 214 DITESFERREKELR-KWHFSCGCSLC 238


>gi|348681060|gb|EGZ20876.1| hypothetical protein PHYSODRAFT_313346 [Phytophthora sojae]
          Length = 494

 Score = 63.5 bits (153), Expect = 1e-07,   Method: Compositional matrix adjust.
 Identities = 44/127 (34%), Positives = 61/127 (48%), Gaps = 17/127 (13%)

Query: 56  GTGLYPVISIINHSCLPNAVLVFEGR-LAVVRAVQHVPKGAEVLISYIETAGSTMTRQKA 114
             GL+P+  +INHSC PN      G  +A VRA++ + +G E+ +SYI+       R+K 
Sbjct: 208 AVGLFPICGLINHSCQPNCTWSNAGDGIAEVRALRDIKEGDEITLSYIDIDKERAERRKE 267

Query: 115 LKEQYLFTCTCPRCIKLGQFDDIQESA--ILEGYRCKDDGCS------GFLLRDSDDKGF 166
           L+E   F C C RC        + ES    LEG+ C    CS       +LL   +DK  
Sbjct: 268 LRETKHFDCQCGRCAA-----PLSESVDRYLEGFCCP--RCSVMASGKDYLLAQVEDK-L 319

Query: 167 TCQQCGL 173
            C  C L
Sbjct: 320 VCSDCQL 326


>gi|354492175|ref|XP_003508226.1| PREDICTED: SET and MYND domain-containing protein 5-like
           [Cricetulus griseus]
          Length = 435

 Score = 63.2 bits (152), Expect = 2e-07,   Method: Compositional matrix adjust.
 Identities = 35/86 (40%), Positives = 50/86 (58%), Gaps = 7/86 (8%)

Query: 56  GTGLYPVISIINHSCLPNAVLVF--EGRLAVVRAVQHVPKGAEVLISYIETAG---STMT 110
           G+GL+ + S  NHSC+PNA   F     L  V A++ +  G E+ ISY++      S  +
Sbjct: 322 GSGLFVLQSCCNHSCVPNAETSFPENNFLLHVTALEDIKPGEEICISYLDCCQRERSRHS 381

Query: 111 RQKALKEQYLFTCTCPRCIKLGQFDD 136
           R K L+E YLF C+CP+C  L + DD
Sbjct: 382 RHKILRENYLFICSCPKC--LAEADD 405


>gi|332022210|gb|EGI62525.1| SET and MYND domain-containing protein 5 [Acromyrmex echinatior]
          Length = 391

 Score = 63.2 bits (152), Expect = 2e-07,   Method: Compositional matrix adjust.
 Identities = 31/78 (39%), Positives = 48/78 (61%), Gaps = 5/78 (6%)

Query: 56  GTGLYPVISIINHSCLPNAVLVF--EGRLAVVRAVQHVPKGAEVLISYIETAG---STMT 110
           G+GLY + S +NHSC+PNAV+ F       V++A++++  G E+  SY++      S  +
Sbjct: 295 GSGLYVLQSSVNHSCIPNAVVEFPYSNNTLVLKAIRNIKVGEEICTSYLDECQLERSRHS 354

Query: 111 RQKALKEQYLFTCTCPRC 128
           RQ+AL   YLF C C +C
Sbjct: 355 RQQALSSLYLFVCHCDKC 372


>gi|332021392|gb|EGI61760.1| SET and MYND domain-containing protein 4 [Acromyrmex echinatior]
          Length = 749

 Score = 63.2 bits (152), Expect = 2e-07,   Method: Compositional matrix adjust.
 Identities = 49/217 (22%), Positives = 90/217 (41%), Gaps = 16/217 (7%)

Query: 55  LGTGLYPVISIINHSCLPNAVLVFEGRLAVVRAVQHVPKGAEVLISYIETAGSTM--TRQ 112
           + T +YP  S++NHSC PN +  F G+  + +A + +  G EV   Y       +   RQ
Sbjct: 460 IATAIYPSASMMNHSCDPNIINSFLGQFLITKATRDIAAGEEVFNCYGADFRRMLRKDRQ 519

Query: 113 KALKEQYLFTCTCPRCIKLGQFDDIQESAILEGYRCK------DDGCSGFLLRDSDDKGF 166
           + ++ QY F C C  C    +++DI +       +C       +D CS  L R S     
Sbjct: 520 EKMESQYCFKCNCAAC-SAPEYEDILKK--FTAKKCPECSGPLNDNCS-VLHRSS----I 571

Query: 167 TCQQCGLVRSKEEIKKIASEVNILSKKTLALTSCGNHQEVVSTYKMIEKLQKKLYHPFSV 226
            C  CG    +        +     +K        N+ E +   K   +L+K   + +  
Sbjct: 572 HCMDCGTAVYENIYDFTLRQAQHYFEKAEICIENENYDEALDKLKKCLRLRKDALYKYHD 631

Query: 227 NLMQTREKLIKILMELEDWKEALAYCQLTIPVYQRVY 263
           ++  T + + K+   +  W  +++Y + +I   +  Y
Sbjct: 632 DIADTMDFIAKVYAIIGQWLNSISYLEHSIVAIEEKY 668


>gi|224057872|ref|XP_002299366.1| SET domain protein [Populus trichocarpa]
 gi|222846624|gb|EEE84171.1| SET domain protein [Populus trichocarpa]
          Length = 391

 Score = 63.2 bits (152), Expect = 2e-07,   Method: Compositional matrix adjust.
 Identities = 33/79 (41%), Positives = 42/79 (53%), Gaps = 9/79 (11%)

Query: 58  GLYPVISIINHSCLPNAVLVFE--------GRLAVVRAVQHVPKGAEVLISYIETAGSTM 109
           G+YP  S+ NH CLPNA   F+            VVR +  VP+G E+ +SY     +  
Sbjct: 220 GIYPKASLFNHDCLPNACR-FDYVDTNNSGNTDIVVRMIHDVPQGREICLSYFPVNSNYS 278

Query: 110 TRQKALKEQYLFTCTCPRC 128
           TR+K L E Y FTC C RC
Sbjct: 279 TRRKRLLEDYGFTCDCDRC 297


>gi|307212779|gb|EFN88450.1| SET and MYND domain-containing protein 4 [Harpegnathos saltator]
          Length = 687

 Score = 63.2 bits (152), Expect = 2e-07,   Method: Compositional matrix adjust.
 Identities = 37/124 (29%), Positives = 64/124 (51%), Gaps = 8/124 (6%)

Query: 55  LGTGLYPVISIINHSCLPNAVLVFEGRLAVVRAVQHVPKGAEVLISY--IETAGSTMTRQ 112
           +G G+YP ++++NHSC P  V  F G   ++RAV+ +  G E+  +Y  I T      R+
Sbjct: 490 IGGGVYPTVAMLNHSCNPGVVRYFVGTTMILRAVRTISAGEEISENYGPIFTTTPENERK 549

Query: 113 KALKEQYLFTCTCPRCIKLGQFDDIQE-SAILEGYRCKDDGCSGFLL---RDSDDKGFTC 168
           + L+ QY F C C  C   G +  ++E    +  ++C+     G +L    D+++    C
Sbjct: 550 RRLRVQYWFDCNCEACT--GHWPLLEELDPTVLRFKCETGPSCGNVLPIRSDTNEFMIGC 607

Query: 169 QQCG 172
            +CG
Sbjct: 608 VKCG 611


>gi|392558999|gb|EIW52184.1| hypothetical protein TRAVEDRAFT_75273 [Trametes versicolor
           FP-101664 SS1]
          Length = 721

 Score = 63.2 bits (152), Expect = 2e-07,   Method: Compositional matrix adjust.
 Identities = 36/72 (50%), Positives = 41/72 (56%), Gaps = 3/72 (4%)

Query: 63  ISIINHSCLPNAVLVFEGRLAV--VRAVQHVPKGAEVLISYIETAGSTMTRQKALKEQYL 120
           IS INHSC PNAV  FE       VRA+  +P GA+V ISYI+ A     RQ+AL   Y 
Sbjct: 179 ISRINHSCSPNAVYRFEPATLTFEVRALSPIPSGAQVFISYIDPALPRAKRQEALS-SYG 237

Query: 121 FTCTCPRCIKLG 132
           FTC C  C   G
Sbjct: 238 FTCACTACALTG 249


>gi|389643366|ref|XP_003719315.1| TPR domain-containing protein [Magnaporthe oryzae 70-15]
 gi|351639084|gb|EHA46948.1| TPR domain-containing protein [Magnaporthe oryzae 70-15]
 gi|440463125|gb|ELQ32752.1| TPR domain-containing protein [Magnaporthe oryzae Y34]
 gi|440483490|gb|ELQ63875.1| TPR domain-containing protein [Magnaporthe oryzae P131]
          Length = 746

 Score = 63.2 bits (152), Expect = 2e-07,   Method: Compositional matrix adjust.
 Identities = 26/72 (36%), Positives = 39/72 (54%)

Query: 57  TGLYPVISIINHSCLPNAVLVFEGRLAVVRAVQHVPKGAEVLISYIETAGSTMTRQKALK 116
           TG++   +  NH+C+PNA   F G + +VRA + +P GAE+ + Y   A    +R+   K
Sbjct: 503 TGMWLHAAYANHTCIPNATRAFIGDMMIVRAARDIPAGAEIFMGYASLAEPFESRRSKFK 562

Query: 117 EQYLFTCTCPRC 128
             Y F C C  C
Sbjct: 563 TSYGFECDCEMC 574


>gi|195156964|ref|XP_002019366.1| GL12277 [Drosophila persimilis]
 gi|194115957|gb|EDW38000.1| GL12277 [Drosophila persimilis]
          Length = 188

 Score = 63.2 bits (152), Expect = 2e-07,   Method: Compositional matrix adjust.
 Identities = 38/101 (37%), Positives = 57/101 (56%), Gaps = 9/101 (8%)

Query: 33  IAENFSKLACNAHTICNSELRPLGTGLYPVISIINHSCLPNAVLVF--EGRLAVVRAVQH 90
           I E +SK+   A    N+E    G+GLY + S INHSC+PNA   F     + V++A+  
Sbjct: 62  IDELYSKVGEFAGEFLNNE----GSGLYLLQSKINHSCVPNACSTFPYSNDIVVLKALAP 117

Query: 91  VPKGAEVLISYIETAG---STMTRQKALKEQYLFTCTCPRC 128
           + +G E+ ISY++      S  +R K L+E Y+F C C +C
Sbjct: 118 IQEGEEICISYLDECQLERSRHSRHKVLRENYIFVCQCLKC 158


>gi|74196676|dbj|BAE34438.1| unnamed protein product [Mus musculus]
          Length = 416

 Score = 63.2 bits (152), Expect = 2e-07,   Method: Compositional matrix adjust.
 Identities = 35/86 (40%), Positives = 50/86 (58%), Gaps = 7/86 (8%)

Query: 56  GTGLYPVISIINHSCLPNAVLVFEGRLAV--VRAVQHVPKGAEVLISYIE---TAGSTMT 110
           G+GL+ + S  NHSC+PNA   F     V  V A++ +  G E+ ISY++      S  +
Sbjct: 303 GSGLFVLQSCCNHSCVPNAETSFPENNFVLHVTALEDIKPGEEICISYLDCRQRERSRHS 362

Query: 111 RQKALKEQYLFTCTCPRCIKLGQFDD 136
           R K L+E YLF C+CP+C  L + DD
Sbjct: 363 RHKILRENYLFNCSCPKC--LAEADD 386


>gi|351716047|gb|EHB18966.1| SET and MYND domain-containing protein 5 [Heterocephalus glaber]
          Length = 420

 Score = 63.2 bits (152), Expect = 2e-07,   Method: Compositional matrix adjust.
 Identities = 32/79 (40%), Positives = 47/79 (59%), Gaps = 5/79 (6%)

Query: 56  GTGLYPVISIINHSCLPNAVLVF--EGRLAVVRAVQHVPKGAEVLISYIETAG---STMT 110
           G+GL+ + S  NHSC+PNA   F     L  V A++ +  G E+ ISY++      S  +
Sbjct: 304 GSGLFVLQSCCNHSCVPNAETAFPENNFLLHVTALEDINPGEEICISYLDCCQRERSRHS 363

Query: 111 RQKALKEQYLFTCTCPRCI 129
           R K L+E YLF C+CP+C+
Sbjct: 364 RHKILRENYLFVCSCPKCL 382


>gi|346320526|gb|EGX90126.1| TPR domain protein [Cordyceps militaris CM01]
          Length = 656

 Score = 63.2 bits (152), Expect = 2e-07,   Method: Compositional matrix adjust.
 Identities = 31/72 (43%), Positives = 42/72 (58%), Gaps = 1/72 (1%)

Query: 57  TGLYPVISIINHSCLPNAVLVFEGRLAVVRAVQHVPKGAEVLISYIETAGSTMTRQKALK 116
           TGL+   S +NH+C+ NAV  F G L V RA +H+  G E+  +Y E A     RQ AL+
Sbjct: 509 TGLWVRASYVNHACVANAVKDFAGDLMVFRAARHIRAGEEIFHAYDENADYD-ARQAALR 567

Query: 117 EQYLFTCTCPRC 128
           + + F CTC  C
Sbjct: 568 KTWGFACTCALC 579


>gi|291231170|ref|XP_002735533.1| PREDICTED: SMYD family member 5-like [Saccoglossus kowalevskii]
          Length = 395

 Score = 63.2 bits (152), Expect = 2e-07,   Method: Compositional matrix adjust.
 Identities = 35/86 (40%), Positives = 48/86 (55%), Gaps = 7/86 (8%)

Query: 56  GTGLYPVISIINHSCLPNAVLVFEGRLAVVR--AVQHVPKGAEVLISYI---ETAGSTMT 110
           G+ LY + S  NHSC+PNA + F    A V   A+Q + +  E+ ISY+   +   S  +
Sbjct: 295 GSALYSLQSCCNHSCVPNAEVTFPDNDAAVSVMALQDIQENEEICISYLGECDIGRSRHS 354

Query: 111 RQKALKEQYLFTCTCPRCIKLGQFDD 136
           RQK L+E YLF C C +C    Q DD
Sbjct: 355 RQKILRENYLFNCNCMKC--QSQCDD 378


>gi|195421076|ref|XP_002060836.1| GK23578 [Drosophila willistoni]
 gi|194156921|gb|EDW71822.1| GK23578 [Drosophila willistoni]
          Length = 128

 Score = 63.2 bits (152), Expect = 2e-07,   Method: Compositional matrix adjust.
 Identities = 29/72 (40%), Positives = 41/72 (56%), Gaps = 1/72 (1%)

Query: 58  GLYPVISIINHSCLPNAVLVFE-GRLAVVRAVQHVPKGAEVLISYIETAGSTMTRQKALK 116
            L+P+  ++NH C PNA   FE G   VV A + +PKG E+ ++Y +   ST+ R+  L 
Sbjct: 51  ALFPLAGLLNHHCTPNAAHHFEDGETIVVSATERIPKGTEITMTYAKLLWSTLARKLFLA 110

Query: 117 EQYLFTCTCPRC 128
               F C CPRC
Sbjct: 111 MTKHFICQCPRC 122


>gi|321476942|gb|EFX87901.1| SET and MYND domain-containing protein 4A [Daphnia pulex]
          Length = 648

 Score = 63.2 bits (152), Expect = 2e-07,   Method: Compositional matrix adjust.
 Identities = 27/78 (34%), Positives = 47/78 (60%), Gaps = 2/78 (2%)

Query: 55  LGTGLYPVISIINHSCLPNAVLVFEGRLAVVRAVQHVPKGAEVLISYIETAGSTMTRQKA 114
           +G+  +PV+S++NHSC PN V +  G + VV+ +  + +G E+L +Y     +   R++ 
Sbjct: 479 IGSAAFPVVSLMNHSCNPNVVHLCYGDVMVVKVIHRIARGEEILDNYGYHYATHEKRERQ 538

Query: 115 LK--EQYLFTCTCPRCIK 130
           LK  +QY F C C  C++
Sbjct: 539 LKLCQQYYFRCRCQSCVE 556


>gi|307183932|gb|EFN70520.1| SET and MYND domain-containing protein 4 [Camponotus floridanus]
          Length = 421

 Score = 63.2 bits (152), Expect = 2e-07,   Method: Compositional matrix adjust.
 Identities = 39/126 (30%), Positives = 59/126 (46%), Gaps = 9/126 (7%)

Query: 53  RP--LGTGLYPVISIINHSCLPNAVLVFEGRLAVVRAVQHVPKGAEVLISY--IETAGST 108
           RP  LG  +YP ++  NH C P     F GR  V+RA++ +  G  +  +Y  I T  + 
Sbjct: 218 RPVYLGVAIYPTVARFNHDCYPAVTRYFVGRHIVIRAIRGLRPGDVIAENYGPIFTKRTL 277

Query: 109 MTRQKALKEQYLFTCTCPRCIK-LGQFDDIQESAILEGYRCKDDGCSGFLLR--DSDDKG 165
             RQ+ L  +Y F C+C  C +   +F+ +   ++    RC   GC+G   R     DK 
Sbjct: 278 AERQRTLAGRYWFRCSCRACQEDWPRFETLTNDSVR--LRCPTTGCNGLHSRPQQRPDKP 335

Query: 166 FTCQQC 171
             C  C
Sbjct: 336 IKCSAC 341


>gi|426335968|ref|XP_004029476.1| PREDICTED: SET and MYND domain-containing protein 5 isoform 1
           [Gorilla gorilla gorilla]
          Length = 419

 Score = 62.8 bits (151), Expect = 2e-07,   Method: Compositional matrix adjust.
 Identities = 32/79 (40%), Positives = 47/79 (59%), Gaps = 5/79 (6%)

Query: 56  GTGLYPVISIINHSCLPNAVLVF--EGRLAVVRAVQHVPKGAEVLISYIETAG---STMT 110
           G+GL+ + S  NHSC+PNA   F     L  V A++ +  G E+ ISY++      S  +
Sbjct: 304 GSGLFVLQSCCNHSCVPNAETSFPENNFLLHVTALEDIKPGEEICISYLDCCQRERSRHS 363

Query: 111 RQKALKEQYLFTCTCPRCI 129
           R K L+E YLF C+CP+C+
Sbjct: 364 RHKILRENYLFVCSCPKCL 382


>gi|348566513|ref|XP_003469046.1| PREDICTED: SET and MYND domain-containing protein 5-like [Cavia
           porcellus]
          Length = 416

 Score = 62.8 bits (151), Expect = 2e-07,   Method: Compositional matrix adjust.
 Identities = 32/79 (40%), Positives = 47/79 (59%), Gaps = 5/79 (6%)

Query: 56  GTGLYPVISIINHSCLPNAVLVF--EGRLAVVRAVQHVPKGAEVLISYIETAG---STMT 110
           G+GL+ + S  NHSC+PNA   F     L  V A++ +  G E+ ISY++      S  +
Sbjct: 304 GSGLFVLQSCCNHSCVPNAETSFPENNFLLHVTALEDISPGEEICISYLDCCQRERSRHS 363

Query: 111 RQKALKEQYLFTCTCPRCI 129
           R K L+E YLF C+CP+C+
Sbjct: 364 RHKILRENYLFVCSCPKCL 382


>gi|328874283|gb|EGG22649.1| SET domain-containing protein [Dictyostelium fasciculatum]
          Length = 1240

 Score = 62.8 bits (151), Expect = 2e-07,   Method: Compositional matrix adjust.
 Identities = 64/277 (23%), Positives = 117/277 (42%), Gaps = 30/277 (10%)

Query: 55   LGTGLYPVISIINHSCLPNAVLVFEGRLAVVRAVQHVPKGAEVLISYIETAGSTMTRQK- 113
            +   ++PV S++NHSC  N +L + G    +++++++ K  E+   Y   A     R++ 
Sbjct: 893  IAYAIFPVSSLLNHSCDNNTILQYNGSSITIKSLKNISKNDEITGCYGPHAFHLELRERL 952

Query: 114  -ALKEQYLFTCTCPRC-IKLGQFDDIQESAILE--GYRCK-----------------DDG 152
              LKE+Y F C C  C  K+G       +  LE  G +CK                 D  
Sbjct: 953  ECLKEEYFFICRCHACNAKIGPNLLRCPTQSLESGGEQCKGTLLERVDTNSLGQEEDDSY 1012

Query: 153  CSGFLLRDSDDKGFTCQQCGLVRSKEEIKKIASEV----NILSKKTLALTSCGNHQEVVS 208
              G    D + + F C  C L   K +   + S+      I S  T  L    +HQ    
Sbjct: 1013 PEGASYDDYNGRKFVCSICSLELGKMDCFMLTSQSVMSGAIFSNATKLLIDTEHHQPPRE 1072

Query: 209  TYKMIEK---LQKKLYHPFSVNLMQTREKLIKILMELEDWKEALAYCQLTI-PVYQRVYP 264
              +M+ K   ++K++YH  S  + +  + L +  +  + +KEA+ Y +  I  ++ R+  
Sbjct: 1073 VEEMLFKCLEIRKQIYHVKSRKIGEVYDALSRYYIAKDKYKEAVTYAERAINNIFLRLGH 1132

Query: 265  QFHPLLGLQYYTCGKLEWFLGDTENAIKSMTEAVEIL 301
                 L  ++     +    G+   AIK++  +  +L
Sbjct: 1133 SHSTELAREWAKLANIYINAGEPAKAIKAINTSESLL 1169


>gi|417400530|gb|JAA47200.1| Putative histone tail methylase [Desmodus rotundus]
          Length = 415

 Score = 62.8 bits (151), Expect = 2e-07,   Method: Compositional matrix adjust.
 Identities = 32/79 (40%), Positives = 47/79 (59%), Gaps = 5/79 (6%)

Query: 56  GTGLYPVISIINHSCLPNAVLVF--EGRLAVVRAVQHVPKGAEVLISYIETAG---STMT 110
           G+GL+ + S  NHSC+PNA   F     L  V A++ +  G E+ ISY++      S  +
Sbjct: 304 GSGLFVLQSCCNHSCVPNAETSFPENNFLLHVTALEDIKPGEEICISYLDCCQRERSRHS 363

Query: 111 RQKALKEQYLFTCTCPRCI 129
           R K L+E YLF C+CP+C+
Sbjct: 364 RHKILRENYLFVCSCPKCL 382


>gi|268637641|ref|XP_002649106.1| SET domain-containing protein [Dictyostelium discoideum AX4]
 gi|256012848|gb|EEU04054.1| SET domain-containing protein [Dictyostelium discoideum AX4]
          Length = 449

 Score = 62.8 bits (151), Expect = 2e-07,   Method: Compositional matrix adjust.
 Identities = 41/135 (30%), Positives = 63/135 (46%), Gaps = 13/135 (9%)

Query: 28  ISINEIAENFSKLACNAHTICNSELRPLGTGLYPVISIINHSCLPNAVLVFEGR-LAVVR 86
           I+I+EI E FS +  NAH    +  + +G G+ P     NHSC+PN     + + + +  
Sbjct: 178 ITIHEILECFSSVLTNAHQFSYATSKEIGRGVCPT-GYFNHSCMPNTTWSLDDQGMLLFS 236

Query: 87  AVQHVPKGAEVLISYIETAGSTMTRQKALKEQYLFTCTCPRCIKLGQFDDIQESAILEGY 146
              +V KG E+ + Y+        R++ L + Y F C CP C         +  + L GY
Sbjct: 237 TSSNVKKGDELSLGYLANEYPLKNRRRELLDGYYFFCQCPLC---------EFQSNLSGY 287

Query: 147 RCKDDGCSGFLLRDS 161
            C  + C   LL DS
Sbjct: 288 LC--EKCKEPLLNDS 300


>gi|189237463|ref|XP_967982.2| PREDICTED: similar to msta CG33548-PB [Tribolium castaneum]
          Length = 387

 Score = 62.8 bits (151), Expect = 2e-07,   Method: Compositional matrix adjust.
 Identities = 45/130 (34%), Positives = 63/130 (48%), Gaps = 9/130 (6%)

Query: 58  GLYPVISIINHSCLPNAVLVFEGRL-AVVRAVQHVPKGAEVLISYIETAGSTMTRQKALK 116
           GLYP+ S+ NHSC PN   VF+ +   VVRA + +P+G+E+  SY     ST  R+  L 
Sbjct: 207 GLYPLGSLANHSCCPNTCHVFDDKQHMVVRASKFIPQGSEIFHSYSRLIWSTSARRFHLY 266

Query: 117 EQYLFTCTCPRCIKLGQFDDIQESAILEGYRCKDDGCSGFLLRDS--DDKGFTCQQCGLV 174
               F C C RC    +F     S +     CK       +  +S   DK + C+ CG +
Sbjct: 267 RTKHFLCKCQRCEDPTEFGSYIGSIL-----CKVCKTGKVIPTNSLQTDK-WQCEVCGSL 320

Query: 175 RSKEEIKKIA 184
             KEE+  I 
Sbjct: 321 IKKEEVANIT 330


>gi|296223556|ref|XP_002807574.1| PREDICTED: LOW QUALITY PROTEIN: SET and MYND domain-containing
           protein 5 [Callithrix jacchus]
          Length = 431

 Score = 62.8 bits (151), Expect = 2e-07,   Method: Compositional matrix adjust.
 Identities = 32/79 (40%), Positives = 47/79 (59%), Gaps = 5/79 (6%)

Query: 56  GTGLYPVISIINHSCLPNAVLVF--EGRLAVVRAVQHVPKGAEVLISYIETAG---STMT 110
           G+GL+ + S  NHSC+PNA   F     L  V A++ +  G E+ ISY++      S  +
Sbjct: 320 GSGLFVLQSCCNHSCVPNAETSFPENNFLLHVTALEDIKPGEEICISYLDCCQRERSRHS 379

Query: 111 RQKALKEQYLFTCTCPRCI 129
           R K L+E YLF C+CP+C+
Sbjct: 380 RHKILRENYLFVCSCPKCL 398


>gi|390362089|ref|XP_784346.3| PREDICTED: SET and MYND domain-containing protein 5-like
           [Strongylocentrotus purpuratus]
          Length = 396

 Score = 62.8 bits (151), Expect = 2e-07,   Method: Compositional matrix adjust.
 Identities = 34/79 (43%), Positives = 49/79 (62%), Gaps = 7/79 (8%)

Query: 56  GTGLYPVISIINHSCLPNAVLVF---EGRLAVVRAVQHVPKGAEVLISYIETA---GSTM 109
           G+ LY + S  NHSC+PNA + F      L++V A+  + +G EVLISY++      S  
Sbjct: 288 GSALYKLQSCCNHSCVPNAEISFLHNNSTLSLV-ALTDITEGQEVLISYLDECCKERSRH 346

Query: 110 TRQKALKEQYLFTCTCPRC 128
           +RQK L+E YLF+C C +C
Sbjct: 347 SRQKELRENYLFSCDCSKC 365


>gi|403260393|ref|XP_003922658.1| PREDICTED: SET and MYND domain-containing protein 5 [Saimiri
           boliviensis boliviensis]
          Length = 419

 Score = 62.8 bits (151), Expect = 2e-07,   Method: Compositional matrix adjust.
 Identities = 32/79 (40%), Positives = 47/79 (59%), Gaps = 5/79 (6%)

Query: 56  GTGLYPVISIINHSCLPNAVLVF--EGRLAVVRAVQHVPKGAEVLISYIETAG---STMT 110
           G+GL+ + S  NHSC+PNA   F     L  V A++ +  G E+ ISY++      S  +
Sbjct: 304 GSGLFVLQSCCNHSCVPNAETSFPENNFLLHVTALEDIKPGEEICISYLDCCQRERSRHS 363

Query: 111 RQKALKEQYLFTCTCPRCI 129
           R K L+E YLF C+CP+C+
Sbjct: 364 RHKILRENYLFVCSCPKCL 382


>gi|431920345|gb|ELK18377.1| SET and MYND domain-containing protein 5 [Pteropus alecto]
          Length = 418

 Score = 62.8 bits (151), Expect = 2e-07,   Method: Compositional matrix adjust.
 Identities = 32/79 (40%), Positives = 47/79 (59%), Gaps = 5/79 (6%)

Query: 56  GTGLYPVISIINHSCLPNAVLVF--EGRLAVVRAVQHVPKGAEVLISYIETAG---STMT 110
           G+GL+ + S  NHSC+PNA   F     L  V A++ +  G E+ ISY++      S  +
Sbjct: 304 GSGLFVLQSCCNHSCVPNAETSFPENNFLLHVTALEDIKPGEEICISYLDCCQRERSRHS 363

Query: 111 RQKALKEQYLFTCTCPRCI 129
           R K L+E YLF C+CP+C+
Sbjct: 364 RHKILRENYLFVCSCPKCL 382


>gi|444723373|gb|ELW64030.1| SET and MYND domain-containing protein 5 [Tupaia chinensis]
          Length = 415

 Score = 62.8 bits (151), Expect = 2e-07,   Method: Compositional matrix adjust.
 Identities = 32/79 (40%), Positives = 47/79 (59%), Gaps = 5/79 (6%)

Query: 56  GTGLYPVISIINHSCLPNAVLVF--EGRLAVVRAVQHVPKGAEVLISYIETAG---STMT 110
           G+GL+ + S  NHSC+PNA   F     L  V A++ +  G E+ ISY++      S  +
Sbjct: 304 GSGLFVLQSCCNHSCVPNAETSFPENNFLLHVTALEEIKPGEEICISYLDCCQRERSRHS 363

Query: 111 RQKALKEQYLFTCTCPRCI 129
           R K L+E YLF C+CP+C+
Sbjct: 364 RHKILRENYLFVCSCPKCL 382


>gi|397473487|ref|XP_003808242.1| PREDICTED: SET and MYND domain-containing protein 5 isoform 1 [Pan
           paniscus]
          Length = 420

 Score = 62.8 bits (151), Expect = 2e-07,   Method: Compositional matrix adjust.
 Identities = 32/79 (40%), Positives = 47/79 (59%), Gaps = 5/79 (6%)

Query: 56  GTGLYPVISIINHSCLPNAVLVF--EGRLAVVRAVQHVPKGAEVLISYIETAG---STMT 110
           G+GL+ + S  NHSC+PNA   F     L  V A++ +  G E+ ISY++      S  +
Sbjct: 304 GSGLFVLQSCCNHSCVPNAETSFPENNFLLHVTALEDIKPGEEICISYLDCCQRERSRHS 363

Query: 111 RQKALKEQYLFTCTCPRCI 129
           R K L+E YLF C+CP+C+
Sbjct: 364 RHKILRENYLFVCSCPKCL 382


>gi|332226837|ref|XP_003262596.1| PREDICTED: SET and MYND domain-containing protein 5 isoform 1
           [Nomascus leucogenys]
          Length = 418

 Score = 62.8 bits (151), Expect = 2e-07,   Method: Compositional matrix adjust.
 Identities = 32/79 (40%), Positives = 47/79 (59%), Gaps = 5/79 (6%)

Query: 56  GTGLYPVISIINHSCLPNAVLVF--EGRLAVVRAVQHVPKGAEVLISYIETAG---STMT 110
           G+GL+ + S  NHSC+PNA   F     L  V A++ +  G E+ ISY++      S  +
Sbjct: 304 GSGLFVLQSCCNHSCVPNAETSFPENNFLLHVTALEDIKPGEEICISYLDCCQRERSRHS 363

Query: 111 RQKALKEQYLFTCTCPRCI 129
           R K L+E YLF C+CP+C+
Sbjct: 364 RHKILRENYLFVCSCPKCL 382


>gi|194377790|dbj|BAG63258.1| unnamed protein product [Homo sapiens]
          Length = 418

 Score = 62.8 bits (151), Expect = 2e-07,   Method: Compositional matrix adjust.
 Identities = 32/79 (40%), Positives = 47/79 (59%), Gaps = 5/79 (6%)

Query: 56  GTGLYPVISIINHSCLPNAVLVF--EGRLAVVRAVQHVPKGAEVLISYIETAG---STMT 110
           G+GL+ + S  NHSC+PNA   F     L  V A++ +  G E+ ISY++      S  +
Sbjct: 304 GSGLFVLQSCCNHSCVPNAETSFPENNFLLHVTALEDIKPGEEICISYLDCCQRERSRHS 363

Query: 111 RQKALKEQYLFTCTCPRCI 129
           R K L+E YLF C+CP+C+
Sbjct: 364 RHKILRENYLFVCSCPKCL 382


>gi|332813419|ref|XP_001150449.2| PREDICTED: SET and MYND domain-containing protein 5 isoform 1 [Pan
           troglodytes]
          Length = 415

 Score = 62.8 bits (151), Expect = 2e-07,   Method: Compositional matrix adjust.
 Identities = 32/79 (40%), Positives = 47/79 (59%), Gaps = 5/79 (6%)

Query: 56  GTGLYPVISIINHSCLPNAVLVF--EGRLAVVRAVQHVPKGAEVLISYIETAG---STMT 110
           G+GL+ + S  NHSC+PNA   F     L  V A++ +  G E+ ISY++      S  +
Sbjct: 304 GSGLFVLQSCCNHSCVPNAETSFPENNFLLHVTALEDIKPGEEICISYLDCCQRERSRHS 363

Query: 111 RQKALKEQYLFTCTCPRCI 129
           R K L+E YLF C+CP+C+
Sbjct: 364 RHKILRENYLFVCSCPKCL 382


>gi|297667345|ref|XP_002811937.1| PREDICTED: SET and MYND domain-containing protein 5 [Pongo abelii]
          Length = 419

 Score = 62.8 bits (151), Expect = 2e-07,   Method: Compositional matrix adjust.
 Identities = 32/79 (40%), Positives = 47/79 (59%), Gaps = 5/79 (6%)

Query: 56  GTGLYPVISIINHSCLPNAVLVF--EGRLAVVRAVQHVPKGAEVLISYIETAG---STMT 110
           G+GL+ + S  NHSC+PNA   F     L  V A++ +  G E+ ISY++      S  +
Sbjct: 304 GSGLFVLQSCCNHSCVPNAETSFPENNFLLHVTALEDIKPGEEICISYLDCCQRERSRHS 363

Query: 111 RQKALKEQYLFTCTCPRCI 129
           R K L+E YLF C+CP+C+
Sbjct: 364 RHKILRENYLFVCSCPKCL 382


>gi|301758142|ref|XP_002914921.1| PREDICTED: SET and MYND domain-containing protein 5-like
           [Ailuropoda melanoleuca]
 gi|281346678|gb|EFB22262.1| hypothetical protein PANDA_002863 [Ailuropoda melanoleuca]
          Length = 417

 Score = 62.8 bits (151), Expect = 2e-07,   Method: Compositional matrix adjust.
 Identities = 32/79 (40%), Positives = 47/79 (59%), Gaps = 5/79 (6%)

Query: 56  GTGLYPVISIINHSCLPNAVLVF--EGRLAVVRAVQHVPKGAEVLISYIETAG---STMT 110
           G+GL+ + S  NHSC+PNA   F     L  V A++ +  G E+ ISY++      S  +
Sbjct: 304 GSGLFVLQSCCNHSCVPNAETSFPENNFLLHVTALEDIKPGEEICISYLDCCQRERSRHS 363

Query: 111 RQKALKEQYLFTCTCPRCI 129
           R K L+E YLF C+CP+C+
Sbjct: 364 RHKILRENYLFVCSCPKCL 382


>gi|219112705|ref|XP_002178104.1| predicted protein [Phaeodactylum tricornutum CCAP 1055/1]
 gi|217410989|gb|EEC50918.1| predicted protein [Phaeodactylum tricornutum CCAP 1055/1]
          Length = 501

 Score = 62.8 bits (151), Expect = 2e-07,   Method: Compositional matrix adjust.
 Identities = 35/91 (38%), Positives = 54/91 (59%), Gaps = 10/91 (10%)

Query: 47  ICNSELRPLGTGLYPVISIINHSCLPNAVL----VFEGRLAVVRAVQHVPKGAEVLISYI 102
           +   ++ P    ++P+ + INHSC+PNA +      + R+ VV A++ +  G E+ ISYI
Sbjct: 408 VVEDKVAPEIVAVFPLTARINHSCVPNAQVQSQEFVDARIDVV-ALRDIAAGEEITISYI 466

Query: 103 ---ETAGSTMT--RQKALKEQYLFTCTCPRC 128
               T+GS  T  R++ L  +YLFTC CPRC
Sbjct: 467 GCGRTSGSKSTSRRRRELLAKYLFTCECPRC 497


>gi|384943470|gb|AFI35340.1| SET and MYND domain-containing protein 5 [Macaca mulatta]
          Length = 418

 Score = 62.8 bits (151), Expect = 2e-07,   Method: Compositional matrix adjust.
 Identities = 32/79 (40%), Positives = 47/79 (59%), Gaps = 5/79 (6%)

Query: 56  GTGLYPVISIINHSCLPNAVLVF--EGRLAVVRAVQHVPKGAEVLISYIETAG---STMT 110
           G+GL+ + S  NHSC+PNA   F     L  V A++ +  G E+ ISY++      S  +
Sbjct: 304 GSGLFVLQSCCNHSCVPNAETSFPENNFLLHVTALEDIKPGEEICISYLDCCQRERSRHS 363

Query: 111 RQKALKEQYLFTCTCPRCI 129
           R K L+E YLF C+CP+C+
Sbjct: 364 RHKILRENYLFVCSCPKCL 382


>gi|410955043|ref|XP_003984168.1| PREDICTED: SET and MYND domain-containing protein 5 [Felis catus]
          Length = 416

 Score = 62.8 bits (151), Expect = 2e-07,   Method: Compositional matrix adjust.
 Identities = 32/79 (40%), Positives = 47/79 (59%), Gaps = 5/79 (6%)

Query: 56  GTGLYPVISIINHSCLPNAVLVF--EGRLAVVRAVQHVPKGAEVLISYIETAG---STMT 110
           G+GL+ + S  NHSC+PNA   F     L  V A++ +  G E+ ISY++      S  +
Sbjct: 304 GSGLFVLQSCCNHSCVPNAETSFPENNFLLHVTALEDIKPGEEICISYLDCCQRERSRHS 363

Query: 111 RQKALKEQYLFTCTCPRCI 129
           R K L+E YLF C+CP+C+
Sbjct: 364 RHKILRENYLFVCSCPKCL 382


>gi|340725361|ref|XP_003401039.1| PREDICTED: SET and MYND domain-containing protein 5-like [Bombus
           terrestris]
          Length = 386

 Score = 62.8 bits (151), Expect = 2e-07,   Method: Compositional matrix adjust.
 Identities = 35/87 (40%), Positives = 49/87 (56%), Gaps = 5/87 (5%)

Query: 56  GTGLYPVISIINHSCLPNAVLVF--EGRLAVVRAVQHVPKGAEVLISYIETA---GSTMT 110
           G+GLY   S INHSC+PNAV+ F     + V++A++ +    E+ ISY++      S  +
Sbjct: 287 GSGLYIFQSSINHSCVPNAVVEFPYSNNVLVLKAIRDIHPEEEICISYLDECCLERSRHS 346

Query: 111 RQKALKEQYLFTCTCPRCIKLGQFDDI 137
           RQK L   YLF C C RC+      DI
Sbjct: 347 RQKVLSSLYLFHCYCNRCLSQVNNPDI 373


>gi|291386545|ref|XP_002709793.1| PREDICTED: SMYD family member 5 [Oryctolagus cuniculus]
          Length = 421

 Score = 62.8 bits (151), Expect = 2e-07,   Method: Compositional matrix adjust.
 Identities = 32/79 (40%), Positives = 47/79 (59%), Gaps = 5/79 (6%)

Query: 56  GTGLYPVISIINHSCLPNAVLVF--EGRLAVVRAVQHVPKGAEVLISYIETAG---STMT 110
           G+GL+ + S  NHSC+PNA   F     L  V A++ +  G E+ ISY++      S  +
Sbjct: 304 GSGLFVLQSCCNHSCVPNAETSFPENNFLLHVTALEDIKPGEEICISYLDCCQRERSRHS 363

Query: 111 RQKALKEQYLFTCTCPRCI 129
           R K L+E YLF C+CP+C+
Sbjct: 364 RHKILRENYLFVCSCPKCL 382


>gi|154689858|ref|NP_006053.2| SET and MYND domain-containing protein 5 [Homo sapiens]
 gi|90101758|sp|Q6GMV2.2|SMYD5_HUMAN RecName: Full=SET and MYND domain-containing protein 5; AltName:
           Full=Protein NN8-4AG; AltName: Full=Retinoic
           acid-induced protein 15
 gi|119620150|gb|EAW99744.1| SMYD family member 5, isoform CRA_a [Homo sapiens]
 gi|119620151|gb|EAW99745.1| SMYD family member 5, isoform CRA_a [Homo sapiens]
 gi|261859134|dbj|BAI46089.1| SMYD family member 5 [synthetic construct]
          Length = 418

 Score = 62.8 bits (151), Expect = 2e-07,   Method: Compositional matrix adjust.
 Identities = 32/79 (40%), Positives = 47/79 (59%), Gaps = 5/79 (6%)

Query: 56  GTGLYPVISIINHSCLPNAVLVF--EGRLAVVRAVQHVPKGAEVLISYIETAG---STMT 110
           G+GL+ + S  NHSC+PNA   F     L  V A++ +  G E+ ISY++      S  +
Sbjct: 304 GSGLFVLQSCCNHSCVPNAETSFPENNFLLHVTALEDIKPGEEICISYLDCCQRERSRHS 363

Query: 111 RQKALKEQYLFTCTCPRCI 129
           R K L+E YLF C+CP+C+
Sbjct: 364 RHKILRENYLFVCSCPKCL 382


>gi|409047429|gb|EKM56908.1| hypothetical protein PHACADRAFT_208075 [Phanerochaete carnosa
           HHB-10118-sp]
          Length = 606

 Score = 62.8 bits (151), Expect = 2e-07,   Method: Compositional matrix adjust.
 Identities = 36/118 (30%), Positives = 57/118 (48%), Gaps = 14/118 (11%)

Query: 29  SINEIAENFSKLACNAHTICNSELRPLGTGLYPVISIINHSCLPNAVLVF---------- 78
           S  ++    S+   N  T+ +  L P+G  + P +++ NHSC PN V+VF          
Sbjct: 248 SAGDLVGLISRFTTNTFTLTSYTLSPVGICISPAMALTNHSCDPNVVIVFPRSSKKVTKE 307

Query: 79  -EGRLAVVRAVQHVPKGAEVLISYIETAGSTMTRQKALKEQYLFTCTCPRCIKLGQFD 135
            + +L  +R++ H     E++ +Y++       RQ ALKE Y FTC C  C   G  D
Sbjct: 308 PQMQLIALRSILHE---EEIMTAYVDVTLPKELRQSALKEAYNFTCMCSLCKNTGPTD 362


>gi|355751414|gb|EHH55669.1| hypothetical protein EGM_04917 [Macaca fascicularis]
          Length = 419

 Score = 62.8 bits (151), Expect = 2e-07,   Method: Compositional matrix adjust.
 Identities = 32/79 (40%), Positives = 47/79 (59%), Gaps = 5/79 (6%)

Query: 56  GTGLYPVISIINHSCLPNAVLVF--EGRLAVVRAVQHVPKGAEVLISYIETAG---STMT 110
           G+GL+ + S  NHSC+PNA   F     L  V A++ +  G E+ ISY++      S  +
Sbjct: 304 GSGLFVLQSCCNHSCVPNAETSFPENNFLLHVTALEDIKPGEEICISYLDCCQRERSRHS 363

Query: 111 RQKALKEQYLFTCTCPRCI 129
           R K L+E YLF C+CP+C+
Sbjct: 364 RHKILRENYLFVCSCPKCL 382


>gi|149727812|ref|XP_001488565.1| PREDICTED: SET and MYND domain-containing protein 5 [Equus
           caballus]
          Length = 417

 Score = 62.8 bits (151), Expect = 2e-07,   Method: Compositional matrix adjust.
 Identities = 32/79 (40%), Positives = 47/79 (59%), Gaps = 5/79 (6%)

Query: 56  GTGLYPVISIINHSCLPNAVLVF--EGRLAVVRAVQHVPKGAEVLISYIETAG---STMT 110
           G+GL+ + S  NHSC+PNA   F     L  V A++ +  G E+ ISY++      S  +
Sbjct: 304 GSGLFVLQSCCNHSCVPNAETSFPENNFLLHVTALEDIKPGEEICISYLDCCQRERSRHS 363

Query: 111 RQKALKEQYLFTCTCPRCI 129
           R K L+E YLF C+CP+C+
Sbjct: 364 RHKILRENYLFVCSCPKCL 382


>gi|109103368|ref|XP_001104101.1| PREDICTED: SET and MYND domain-containing protein 5 [Macaca
           mulatta]
 gi|402891240|ref|XP_003908860.1| PREDICTED: SET and MYND domain-containing protein 5 [Papio anubis]
 gi|355565788|gb|EHH22217.1| hypothetical protein EGK_05442 [Macaca mulatta]
          Length = 418

 Score = 62.8 bits (151), Expect = 2e-07,   Method: Compositional matrix adjust.
 Identities = 32/79 (40%), Positives = 47/79 (59%), Gaps = 5/79 (6%)

Query: 56  GTGLYPVISIINHSCLPNAVLVF--EGRLAVVRAVQHVPKGAEVLISYIETAG---STMT 110
           G+GL+ + S  NHSC+PNA   F     L  V A++ +  G E+ ISY++      S  +
Sbjct: 304 GSGLFVLQSCCNHSCVPNAETSFPENNFLLHVTALEDIKPGEEICISYLDCCQRERSRHS 363

Query: 111 RQKALKEQYLFTCTCPRCI 129
           R K L+E YLF C+CP+C+
Sbjct: 364 RHKILRENYLFVCSCPKCL 382


>gi|380797359|gb|AFE70555.1| SET and MYND domain-containing protein 5, partial [Macaca mulatta]
          Length = 407

 Score = 62.8 bits (151), Expect = 2e-07,   Method: Compositional matrix adjust.
 Identities = 32/79 (40%), Positives = 47/79 (59%), Gaps = 5/79 (6%)

Query: 56  GTGLYPVISIINHSCLPNAVLVF--EGRLAVVRAVQHVPKGAEVLISYIETAG---STMT 110
           G+GL+ + S  NHSC+PNA   F     L  V A++ +  G E+ ISY++      S  +
Sbjct: 293 GSGLFVLQSCCNHSCVPNAETSFPENNFLLHVTALEDIKPGEEICISYLDCCQRERSRHS 352

Query: 111 RQKALKEQYLFTCTCPRCI 129
           R K L+E YLF C+CP+C+
Sbjct: 353 RHKILRENYLFVCSCPKCL 371


>gi|311252397|ref|XP_003125060.1| PREDICTED: SET and MYND domain-containing protein 5 [Sus scrofa]
          Length = 417

 Score = 62.8 bits (151), Expect = 2e-07,   Method: Compositional matrix adjust.
 Identities = 32/79 (40%), Positives = 47/79 (59%), Gaps = 5/79 (6%)

Query: 56  GTGLYPVISIINHSCLPNAVLVF--EGRLAVVRAVQHVPKGAEVLISYIETAG---STMT 110
           G+GL+ + S  NHSC+PNA   F     L  V A++ +  G E+ ISY++      S  +
Sbjct: 304 GSGLFVLQSCCNHSCVPNAETSFPENNFLLHVTALEDIKPGEEICISYLDCCQRERSRHS 363

Query: 111 RQKALKEQYLFTCTCPRCI 129
           R K L+E YLF C+CP+C+
Sbjct: 364 RHKILRENYLFVCSCPKCL 382


>gi|195132931|ref|XP_002010893.1| GI21458 [Drosophila mojavensis]
 gi|193907681|gb|EDW06548.1| GI21458 [Drosophila mojavensis]
          Length = 302

 Score = 62.8 bits (151), Expect = 2e-07,   Method: Compositional matrix adjust.
 Identities = 30/72 (41%), Positives = 41/72 (56%), Gaps = 1/72 (1%)

Query: 58  GLYPVISIINHSCLPNAVLVFE-GRLAVVRAVQHVPKGAEVLISYIETAGSTMTRQKALK 116
            L+P+  ++NH C PNA   FE G   VV A + +P GAE+ +SY +   ST+ R+  L 
Sbjct: 226 ALFPLAGLLNHQCTPNAAHHFEDGETIVVTATERIPAGAEITMSYTKLLWSTLARKIFLG 285

Query: 117 EQYLFTCTCPRC 128
               F C CPRC
Sbjct: 286 MTKHFMCQCPRC 297


>gi|440901422|gb|ELR52368.1| SET and MYND domain-containing protein 5 [Bos grunniens mutus]
          Length = 418

 Score = 62.8 bits (151), Expect = 2e-07,   Method: Compositional matrix adjust.
 Identities = 32/79 (40%), Positives = 47/79 (59%), Gaps = 5/79 (6%)

Query: 56  GTGLYPVISIINHSCLPNAVLVF--EGRLAVVRAVQHVPKGAEVLISYIETAG---STMT 110
           G+GL+ + S  NHSC+PNA   F     L  V A++ +  G E+ ISY++      S  +
Sbjct: 304 GSGLFVLQSCCNHSCVPNAETSFPENNFLLHVTALEDIKPGEEICISYLDCCQRERSRHS 363

Query: 111 RQKALKEQYLFTCTCPRCI 129
           R K L+E YLF C+CP+C+
Sbjct: 364 RHKILRENYLFVCSCPKCL 382


>gi|122692553|ref|NP_001073717.1| SET and MYND domain-containing protein 5 [Bos taurus]
 gi|120419446|gb|ABM21542.1| SMYD family member 5 [Bos taurus]
 gi|151553913|gb|AAI49126.1| SMYD5 protein [Bos taurus]
 gi|296482713|tpg|DAA24828.1| TPA: SMYD family member 5 [Bos taurus]
          Length = 418

 Score = 62.8 bits (151), Expect = 2e-07,   Method: Compositional matrix adjust.
 Identities = 32/79 (40%), Positives = 47/79 (59%), Gaps = 5/79 (6%)

Query: 56  GTGLYPVISIINHSCLPNAVLVF--EGRLAVVRAVQHVPKGAEVLISYIETAG---STMT 110
           G+GL+ + S  NHSC+PNA   F     L  V A++ +  G E+ ISY++      S  +
Sbjct: 304 GSGLFVLQSCCNHSCVPNAETSFPENNFLLHVTALEDIKPGEEICISYLDCCQRERSRHS 363

Query: 111 RQKALKEQYLFTCTCPRCI 129
           R K L+E YLF C+CP+C+
Sbjct: 364 RHKILRENYLFVCSCPKCL 382


>gi|328873515|gb|EGG21882.1| SET domain-containing protein [Dictyostelium fasciculatum]
          Length = 438

 Score = 62.8 bits (151), Expect = 2e-07,   Method: Compositional matrix adjust.
 Identities = 40/86 (46%), Positives = 48/86 (55%), Gaps = 10/86 (11%)

Query: 49  NSELRPLGTGLYPVISIINHSCLPNA-VLVFEGRLAV----VRAVQHVPKGAEVLISYIE 103
           N E R  G G+Y   S  NHSC PN    V E  L V    +RAV+   KG E+ ISYI+
Sbjct: 324 NGESR--GCGVYVRNSFFNHSCNPNVNYWVVENTLEVECSLMRAVR---KGEELCISYID 378

Query: 104 TAGSTMTRQKALKEQYLFTCTCPRCI 129
           TA S   R++ L E YLF C C +CI
Sbjct: 379 TAASLRDRREKLSEGYLFHCRCEKCI 404


>gi|195485910|ref|XP_002091283.1| GE13567 [Drosophila yakuba]
 gi|194177384|gb|EDW90995.1| GE13567 [Drosophila yakuba]
          Length = 514

 Score = 62.8 bits (151), Expect = 2e-07,   Method: Compositional matrix adjust.
 Identities = 44/173 (25%), Positives = 76/173 (43%), Gaps = 11/173 (6%)

Query: 15  IANLVNLILQWPEISINEIAENFSKLACNAHTICNSELRPLGTGLYPVISIINHSCLPNA 74
           I   +    +  + S  EI +    L  N H +  ++  P    ++   S   +SCLPN 
Sbjct: 189 IGQFIPKFFKTQKFSEEEIMKAVGALQINGHEVPTTD--PSHVAVFYTASFTENSCLPNL 246

Query: 75  VLVF-EGRLAVVRAVQHVPKGAEVLISYIETAGSTMTRQKALKEQYLFTCTCPRCIKLGQ 133
              F +    ++ A + + K A + I Y +    T  RQ+ L +  LF C C RC+ + +
Sbjct: 247 AKSFNKNGHCILWAPREIKKNAHLSICYSDAVWGTADRQRHLMQTKLFRCACERCVDVTE 306

Query: 134 FDDIQESAILEGYRCKDDGCSGFLL---RDSDDKGFTCQQCGLVRSKEEIKKI 183
            D    SAI    +C+D  C G +L    D  +  + C++C     K  +++I
Sbjct: 307 LDTFY-SAI----KCEDRQCGGLMLPTKTDDWNGNWRCRECHKQVQKHYVERI 354


>gi|154346778|ref|XP_001569326.1| hypothetical protein LBRM_35_7140 [Leishmania braziliensis
           MHOM/BR/75/M2904]
 gi|134066668|emb|CAM44467.1| hypothetical protein LBRM_35_7140 [Leishmania braziliensis
           MHOM/BR/75/M2904]
          Length = 442

 Score = 62.8 bits (151), Expect = 2e-07,   Method: Compositional matrix adjust.
 Identities = 34/84 (40%), Positives = 47/84 (55%), Gaps = 5/84 (5%)

Query: 56  GTGLYPVISIINHSCLPNAVLVFEG----RLAVVRAVQHVPKGAEVLISYIETAGSTMTR 111
           G G+Y V  + NHSC PN  + +       L VV A++ V  G E+ ISYI+++     R
Sbjct: 357 GQGIYEVGCLFNHSCDPNLSVQYSSLNDETLTVV-ALRDVKAGEELTISYIDSSLPFAVR 415

Query: 112 QKALKEQYLFTCTCPRCIKLGQFD 135
           Q+ L + YLF C CPRC+  G  D
Sbjct: 416 QQQLLDHYLFECRCPRCVAEGTTD 439


>gi|340723180|ref|XP_003399973.1| PREDICTED: SET and MYND domain-containing protein 4-like [Bombus
           terrestris]
          Length = 718

 Score = 62.8 bits (151), Expect = 2e-07,   Method: Compositional matrix adjust.
 Identities = 43/144 (29%), Positives = 63/144 (43%), Gaps = 17/144 (11%)

Query: 43  NAHTICNSEL---------RPL--GTGLYPVISIINHSCLPNAVLVFEGRLAVVRAVQHV 91
           NAH +  + L         RPL  G  +YP ++  NH C P     F GR  V+RA++ +
Sbjct: 491 NAHEVFETRLGNEHRFRGSRPLYIGVAIYPTVARFNHDCYPAVTRYFVGRSIVIRAIRSL 550

Query: 92  PKGAEVLISY--IETAGSTMTRQKALKEQYLFTCTCPRCIK-LGQFDDIQESAILEGYRC 148
             G  V  +Y  I T  S   R++ L  +Y F C C  C +   +F+ +    +    RC
Sbjct: 551 RPGDTVAENYGPIFTKRSLEDRRRTLAARYWFRCECTACREDWPRFETLTNDMVR--LRC 608

Query: 149 KDDGCSGFLLRDSD-DKGFTCQQC 171
             +GCS    R  D  +   C  C
Sbjct: 609 PTEGCSKLHSRPRDPSRSIECSSC 632


>gi|449550207|gb|EMD41172.1| hypothetical protein CERSUDRAFT_91931 [Ceriporiopsis subvermispora
           B]
          Length = 329

 Score = 62.8 bits (151), Expect = 2e-07,   Method: Compositional matrix adjust.
 Identities = 28/74 (37%), Positives = 40/74 (54%), Gaps = 2/74 (2%)

Query: 57  TGLYPVISIINHSCLPNA--VLVFEGRLAVVRAVQHVPKGAEVLISYIETAGSTMTRQKA 114
           +G +P++S INH C P+A     FE     + A + +P G E+ I+Y       + RQ  
Sbjct: 138 SGCFPILSRINHDCSPSANYFFAFEPYCGQLWAARDIPDGGEITIAYTTLPAPRVDRQAH 197

Query: 115 LKEQYLFTCTCPRC 128
           L E+Y FTCTC  C
Sbjct: 198 LSERYFFTCTCVAC 211


>gi|391340382|ref|XP_003744521.1| PREDICTED: son of sevenless homolog 2-like [Metaseiulus occidentalis]
          Length = 1624

 Score = 62.8 bits (151), Expect = 3e-07,   Method: Compositional matrix adjust.
 Identities = 78/329 (23%), Positives = 131/329 (39%), Gaps = 64/329 (19%)

Query: 7    KQLLLYAQIANLVNLILQWPEISINEIAENFSKLACNAHTICNSELRPLGTGLYPVISII 66
            KQ L  AQ+  + +L          +I E F  +  N   I N +    G  LY   S+I
Sbjct: 1332 KQFLTAAQLEGVSDL----------DILEIFGIMCVNTIHISNDD-DSFGCALYLAPSLI 1380

Query: 67   NHSCLPNAVLVFEGRLAVVRAVQHV-PKG-AEVLISYIETAGSTMTRQKALKEQYLFTCT 124
            +HSC PN    F+G+  V++ ++   PK  A++ ++Y+    +   R+K L+E+Y FTC 
Sbjct: 1381 DHSCYPNLTATFKGQKIVLKVLRPCEPKTVADLSLAYMPVCTTKERRRKTLREEYYFTCE 1440

Query: 125  CPRCIKLGQFDDIQESAILEGYRCKDDGCSGFLLRDSDDKGFTCQQCGLVRSKEEIKKIA 184
            C  C                       G    +L ++D                   K+ 
Sbjct: 1441 CEMC----------------------SGKVPEVLSEAD------------------PKLT 1460

Query: 185  SEVNILSKKTLALTSCGNHQEVVSTYKMIEKLQKKLYHPFSVNLMQTREKLIKILMEL-- 242
             EV  L K +L L+S  NH++ +   +  E L  KL      ++ + R  L+     +  
Sbjct: 1461 DEVLELEKLSLDLSSPENHRKALKGVE--ELLSTKLKDLDDSDVAKFRAILVAADASVCA 1518

Query: 243  EDWKEALAYCQLTIPVYQRVYPQFHPLLGLQYYTCGKLEWFLGDTENA-------IKSMT 295
              +  A  Y   ++PV +RV+ +       +     +L   +  T          I  + 
Sbjct: 1519 SSYDRAYNYYSRSLPVMKRVFTENKAEYAYKLVRLARLSTIVVKTSQDPEVFGPLIALLQ 1578

Query: 296  EAVEILRITHGTNSPFMKELILKLEEAQA 324
            EAV + RI  G +    K++ L  EE +A
Sbjct: 1579 EAVRVTRIALGEDHSDTKDVALLYEELRA 1607


>gi|332016864|gb|EGI57673.1| Protein msta, isoform A [Acromyrmex echinatior]
          Length = 421

 Score = 62.8 bits (151), Expect = 3e-07,   Method: Compositional matrix adjust.
 Identities = 55/205 (26%), Positives = 97/205 (47%), Gaps = 18/205 (8%)

Query: 58  GLYPVISIINHSCLPNAVLVFEGRLAV-VRAVQHVPKGAEVLISYIETAGSTMTRQKALK 116
           GLYP+ ++ NH C+PN    F+ +  + V AV  +  G E+ +SY +    T +R++ LK
Sbjct: 147 GLYPMGALQNHCCVPNTRHHFDDQQRLYVSAVLPIAAGEEITMSYTDLLWDTSSRRQFLK 206

Query: 117 EQYLFTCTCPRCIKLGQFDDIQESAILEGYRCKDDGCSGFLL-RD--SDDKGFTCQQCGL 173
               F+C C RC      D ++  + L    C  D CSG LL R+  + +  + C +C +
Sbjct: 207 VTKRFSCNCNRC-----SDPLEFGSQLSTLLCAKDNCSGCLLPRNPLNHESSWICDKCQI 261

Query: 174 -VRSKEEIKKIASEVNILSKKTLALTSCGNHQEVVSTYKMIEKLQKKLYHPFSVNLMQTR 232
            V  ++ I+ I S +N      +  T     +E++   +  E +  +L    +  +M  +
Sbjct: 262 SVNYRQVIECIHSGLNTFVSDAMYKTP----REIL---RFTEAILSRLVPATNYIMMDMK 314

Query: 233 EKLIKILMELEDWK-EALAYCQLTI 256
            ++I     + D K E L   +L I
Sbjct: 315 YRIISYFGRVPDLKWENLTDTELRI 339


>gi|1245372|gb|AAB38131.1| NN8-4AG, partial [Homo sapiens]
          Length = 412

 Score = 62.4 bits (150), Expect = 3e-07,   Method: Compositional matrix adjust.
 Identities = 32/79 (40%), Positives = 47/79 (59%), Gaps = 5/79 (6%)

Query: 56  GTGLYPVISIINHSCLPNAVLVF--EGRLAVVRAVQHVPKGAEVLISYIETAG---STMT 110
           G+GL+ + S  NHSC+PNA   F     L  V A++ +  G E+ ISY++      S  +
Sbjct: 298 GSGLFVLQSCCNHSCVPNAETSFPENNFLLHVTALEDIKPGEEICISYLDCCQRERSRHS 357

Query: 111 RQKALKEQYLFTCTCPRCI 129
           R K L+E YLF C+CP+C+
Sbjct: 358 RHKILRENYLFVCSCPKCL 376


>gi|307177472|gb|EFN66599.1| SET and MYND domain-containing protein 5 [Camponotus floridanus]
          Length = 386

 Score = 62.4 bits (150), Expect = 3e-07,   Method: Compositional matrix adjust.
 Identities = 34/87 (39%), Positives = 48/87 (55%), Gaps = 5/87 (5%)

Query: 47  ICNSELRPLGTGLYPVISIINHSCLPNAVLVF--EGRLAVVRAVQHVPKGAEVLISYI-- 102
           +  S L   G+GLY + S +NHSC PNA+  F       V++A++ +  G E+  SY+  
Sbjct: 278 VVGSFLNNEGSGLYILQSSVNHSCTPNAIAEFPYSNNTLVLKAIRDIQVGEEICTSYLDE 337

Query: 103 -ETAGSTMTRQKALKEQYLFTCTCPRC 128
            E   S  +RQKAL   YLF C C +C
Sbjct: 338 CELERSRYSRQKALSSLYLFVCHCDKC 364


>gi|312373131|gb|EFR20943.1| hypothetical protein AND_18265 [Anopheles darlingi]
          Length = 675

 Score = 62.4 bits (150), Expect = 3e-07,   Method: Compositional matrix adjust.
 Identities = 49/175 (28%), Positives = 80/175 (45%), Gaps = 16/175 (9%)

Query: 55  LGTGLYPVISIINHSCLPNAVLVFEGRLAVVRAVQHVPKGAEVLISYIETAGSTMTRQKA 114
           +G GLYP +++ NHSC P     + G    VR V+++  G         T      R++ 
Sbjct: 463 IGGGLYPTLALFNHSCDPGVTRYYRGNQVCVRTVKNIAAG---------TRVRREERRET 513

Query: 115 LKEQYLFTCTCPRCIK-LGQFDDIQESAILEGYRCKDDG-CSGFLLRDSDDKGF--TCQQ 170
           L  QY FTC C  C++    F D+  S I   +RC+    CS  LL  S+   F   C +
Sbjct: 514 LLNQYRFTCYCEPCMQNWPLFSDMDPSII--RFRCEGGKICSNVLLIPSEINDFMVKCTE 571

Query: 171 CGLVRSKEEIKKIASEVNILSKKTLALTSCGNHQEVVSTY-KMIEKLQKKLYHPF 224
           CG   +  +  K   + ++L +    L S G ++  +  Y +M+  + + L  P+
Sbjct: 572 CGEHTNIMKGLKSLQDTDMLFQTATRLHSAGEYEAALMKYVEMMAIMSEVLVPPY 626


>gi|344246823|gb|EGW02927.1| SET and MYND domain-containing protein 5 [Cricetulus griseus]
          Length = 633

 Score = 62.4 bits (150), Expect = 3e-07,   Method: Compositional matrix adjust.
 Identities = 35/86 (40%), Positives = 50/86 (58%), Gaps = 7/86 (8%)

Query: 56  GTGLYPVISIINHSCLPNAVLVF--EGRLAVVRAVQHVPKGAEVLISYIETAG---STMT 110
           G+GL+ + S  NHSC+PNA   F     L  V A++ +  G E+ ISY++      S  +
Sbjct: 520 GSGLFVLQSCCNHSCVPNAETSFPENNFLLHVTALEDIKPGEEICISYLDCCQRERSRHS 579

Query: 111 RQKALKEQYLFTCTCPRCIKLGQFDD 136
           R K L+E YLF C+CP+C  L + DD
Sbjct: 580 RHKILRENYLFICSCPKC--LAEADD 603


>gi|440490836|gb|ELQ70343.1| hypothetical protein OOW_P131scaffold00045g35 [Magnaporthe oryzae
           P131]
          Length = 582

 Score = 62.4 bits (150), Expect = 3e-07,   Method: Compositional matrix adjust.
 Identities = 28/74 (37%), Positives = 42/74 (56%)

Query: 55  LGTGLYPVISIINHSCLPNAVLVFEGRLAVVRAVQHVPKGAEVLISYIETAGSTMTRQKA 114
            GT L  V+S+INH C PNA + FEG    VR+++ +  G E+ +SY +     + RQ+ 
Sbjct: 240 FGTSLDLVVSMINHDCSPNAHVFFEGSQVRVRSLKAIAAGGEITVSYCDPRLDVLLRQEI 299

Query: 115 LKEQYLFTCTCPRC 128
           L++   F C C  C
Sbjct: 300 LRQTQFFHCECTTC 313


>gi|307195015|gb|EFN77083.1| SET and MYND domain-containing protein 5 [Harpegnathos saltator]
          Length = 385

 Score = 62.4 bits (150), Expect = 3e-07,   Method: Compositional matrix adjust.
 Identities = 32/78 (41%), Positives = 45/78 (57%), Gaps = 5/78 (6%)

Query: 56  GTGLYPVISIINHSCLPNAVLVF--EGRLAVVRAVQHVPKGAEVLISYIETA---GSTMT 110
           G+GLY + S +NHSC+PNA + F       V+RA++ +    E+  SY+E      S  +
Sbjct: 287 GSGLYILQSAVNHSCMPNASVEFPYSNNTLVLRAIRDIQPEEEICTSYLEECELERSRYS 346

Query: 111 RQKALKEQYLFTCTCPRC 128
           RQKAL   YLF C C +C
Sbjct: 347 RQKALSSLYLFVCHCDKC 364


>gi|294892061|ref|XP_002773875.1| hypothetical protein, conserved [Perkinsus marinus ATCC 50983]
 gi|239879079|gb|EER05691.1| hypothetical protein, conserved [Perkinsus marinus ATCC 50983]
          Length = 140

 Score = 62.4 bits (150), Expect = 3e-07,   Method: Compositional matrix adjust.
 Identities = 32/99 (32%), Positives = 47/99 (47%), Gaps = 4/99 (4%)

Query: 39  KLACNAHTICNSELRPLGTGLYPVISIINHSCLPNAVLVF-EGRLAVVRAVQHVPKGAEV 97
           +  CN   I +     +  G+YP  ++ NHSC PN +  F  G     RA++ +  G E+
Sbjct: 41  RFPCNNFAIVDDLWSGIAAGVYPNAALFNHSCHPNVIPAFGHGSTLSFRAIRDISPGEEI 100

Query: 98  LISYIETAGSTMTRQKALKEQYLFTCTCPRCIKLGQFDD 136
             SY+E    +  R+  L   Y F C C RC   G+ DD
Sbjct: 101 CHSYVELTLPSWKRRDVLLRDYEFLCECERC---GKRDD 136


>gi|71666752|ref|XP_820332.1| hypothetical protein [Trypanosoma cruzi strain CL Brener]
 gi|70885672|gb|EAN98481.1| hypothetical protein, conserved [Trypanosoma cruzi]
          Length = 359

 Score = 62.4 bits (150), Expect = 3e-07,   Method: Compositional matrix adjust.
 Identities = 44/131 (33%), Positives = 67/131 (51%), Gaps = 19/131 (14%)

Query: 16  ANLVNLILQWPEISINEIAENFSKLACNAHTI---------CNSELRPL-----GTGLYP 61
           A L+N   + P  + + + E   +L C   T+           +EL+ L     G G+Y 
Sbjct: 203 AVLLNGQERSPPSNYDRLKELVRRLPCGESTMDAFEQEVQMAGNELQHLRQSSRGQGVYT 262

Query: 62  VISIINHSCLPNAVLVF----EGRLAVVRAVQHVPKGAEVLISYIETAGSTMTRQKALKE 117
           V  + NHSC PN  +V+    +  L+VV A++ +  G E+ ISYI+ + S   RQ+ L E
Sbjct: 263 VGCLFNHSCEPNLQVVYSESGDETLSVV-ALRDIEPGEELCISYIDESLSYPERQQELYE 321

Query: 118 QYLFTCTCPRC 128
            YLF C CP+C
Sbjct: 322 HYLFFCQCPKC 332


>gi|440476470|gb|ELQ45066.1| hypothetical protein OOU_Y34scaffold00021g6 [Magnaporthe oryzae
           Y34]
          Length = 581

 Score = 62.4 bits (150), Expect = 3e-07,   Method: Compositional matrix adjust.
 Identities = 28/74 (37%), Positives = 42/74 (56%)

Query: 55  LGTGLYPVISIINHSCLPNAVLVFEGRLAVVRAVQHVPKGAEVLISYIETAGSTMTRQKA 114
            GT L  V+S+INH C PNA + FEG    VR+++ +  G E+ +SY +     + RQ+ 
Sbjct: 240 FGTSLDLVVSMINHDCSPNAHVFFEGSQVRVRSLKAIAAGGEITVSYCDPRLDVLLRQEI 299

Query: 115 LKEQYLFTCTCPRC 128
           L++   F C C  C
Sbjct: 300 LRQTQFFHCECTTC 313


>gi|407849671|gb|EKG04345.1| hypothetical protein TCSYLVIO_004595 [Trypanosoma cruzi]
          Length = 359

 Score = 62.4 bits (150), Expect = 3e-07,   Method: Compositional matrix adjust.
 Identities = 44/131 (33%), Positives = 67/131 (51%), Gaps = 19/131 (14%)

Query: 16  ANLVNLILQWPEISINEIAENFSKLACNAHTI---------CNSELRPL-----GTGLYP 61
           A L+N   + P  + + + E   +L C   T+            EL+ L     G G+Y 
Sbjct: 203 AVLLNGQERSPPSNYDRLKELVQRLPCGESTMDAFEQEVQMAGKELQHLLQSSRGQGVYT 262

Query: 62  VISIINHSCLPNAVLVF----EGRLAVVRAVQHVPKGAEVLISYIETAGSTMTRQKALKE 117
           V  + NHSC PN  +V+    +  L+VV A++++  G E+ ISYI+ + S   RQ+ L E
Sbjct: 263 VGCLFNHSCEPNLQVVYSESGDETLSVV-ALRNIELGEELCISYIDESLSYPERQQELYE 321

Query: 118 QYLFTCTCPRC 128
            YLF C CP+C
Sbjct: 322 HYLFVCQCPKC 332


>gi|328784417|ref|XP_624539.3| PREDICTED: protein msta, isoform A-like [Apis mellifera]
          Length = 440

 Score = 62.4 bits (150), Expect = 3e-07,   Method: Compositional matrix adjust.
 Identities = 49/185 (26%), Positives = 80/185 (43%), Gaps = 21/185 (11%)

Query: 14  QIANLVNLILQWPEISINEIAENFSKLACNAHTICNSELRPLG----TGLYPVISIINHS 69
            IA+ +    +  +IS  EIA     L  N H +      PL       +Y + S+I H+
Sbjct: 184 NIAHFIKRFFKADDISEEEIATIIGILQVNGHEV------PLTDSPYVAVYEMASLIEHN 237

Query: 70  CLPNAVLVFE--GRLAVVRAVQHVPKGAEVLISYIETAGSTMTRQKALKEQYLFTCTCPR 127
           C  N    F   G L ++RA   + KG  + I Y +    T  R+  L +   F C C R
Sbjct: 238 CRANCSKSFTDMGGL-IIRAALPITKGDHISICYTDPLWGTANRRHHLFKTKFFECICNR 296

Query: 128 CIKLGQFDDIQESAILEGYRCKDDGCSGFLLRDS---DDKGFTCQQCGLVRSKEEIKKIA 184
           C    +F       +    +C +  CSG++L  +    ++ + C+ C  V S  EI+K+ 
Sbjct: 297 CQDPTEF-----QTMFNALKCNNINCSGYILPKTFLEQEQDYICKICESVVSCTEIEKVL 351

Query: 185 SEVNI 189
            ++ I
Sbjct: 352 EDIGI 356


>gi|268570288|ref|XP_002648464.1| C. briggsae CBR-SET-18 protein [Caenorhabditis briggsae]
          Length = 369

 Score = 62.0 bits (149), Expect = 3e-07,   Method: Compositional matrix adjust.
 Identities = 41/152 (26%), Positives = 66/152 (43%), Gaps = 16/152 (10%)

Query: 37  FSKLACNAHTICNSELRPLGTGLYPVISIINHSCLPNAVLVFEGRLAVVRAVQHVP---- 92
           F K + N+H+I  +    +G  L   +S  NHSC P   +VF+G    +R +  VP    
Sbjct: 180 FCKASINSHSIHTNAGNEVGMALDLGVSKYNHSCRPTCSMVFDGYRVCLRPL--VPGVDA 237

Query: 93  -KGAEVLISYIETAGSTMTRQKALKEQYLFTCTCPRCIKLGQFDDIQESAILEGYRCKDD 151
               +  ISYI+   S   R++ LK ++ F C C RC+         E   L   +C + 
Sbjct: 238 ENTEQAFISYIDVGRSKFVRRRDLKSRWYFDCECTRCMD-------PEDDSLTAIKCANP 290

Query: 152 GCSGFLLRDSDDK--GFTCQQCGLVRSKEEIK 181
            C   +L    ++     C +C  V  +  +K
Sbjct: 291 ACDAPILTSETEEPMNIACDKCKTVVEESTVK 322


>gi|303279368|ref|XP_003058977.1| predicted protein [Micromonas pusilla CCMP1545]
 gi|226460137|gb|EEH57432.1| predicted protein [Micromonas pusilla CCMP1545]
          Length = 322

 Score = 62.0 bits (149), Expect = 3e-07,   Method: Compositional matrix adjust.
 Identities = 28/77 (36%), Positives = 41/77 (53%), Gaps = 1/77 (1%)

Query: 53  RPLGTGLYPVISIINHSCLPNAVLVFEGRLAV-VRAVQHVPKGAEVLISYIETAGSTMTR 111
           RP+G G+YP  ++ NH C PNA   F+    V V   + V KG E+ I Y++       R
Sbjct: 241 RPIGVGVYPSAAMFNHDCAPNAAQRFDAFGCVRVETTRRVRKGEELTIPYVDVMLGREER 300

Query: 112 QKALKEQYLFTCTCPRC 128
           +  L++ + F C C RC
Sbjct: 301 RGKLRKNFAFECACARC 317


>gi|444708470|gb|ELW49533.1| SET and MYND domain-containing protein 3 [Tupaia chinensis]
          Length = 125

 Score = 62.0 bits (149), Expect = 3e-07,   Method: Compositional matrix adjust.
 Identities = 27/66 (40%), Positives = 43/66 (65%)

Query: 63  ISIINHSCLPNAVLVFEGRLAVVRAVQHVPKGAEVLISYIETAGSTMTRQKALKEQYLFT 122
           +S++NHSC PN  +VF G   ++RAV+ +  G E+ I Y++   ++  R+K L++QY F 
Sbjct: 16  MSLLNHSCDPNCSIVFNGPHLLLRAVRDIEVGEELTICYLDMLMTSEERRKQLRDQYCFE 75

Query: 123 CTCPRC 128
           C C RC
Sbjct: 76  CDCFRC 81


>gi|322785185|gb|EFZ11898.1| hypothetical protein SINV_10758 [Solenopsis invicta]
          Length = 389

 Score = 62.0 bits (149), Expect = 3e-07,   Method: Compositional matrix adjust.
 Identities = 31/78 (39%), Positives = 47/78 (60%), Gaps = 5/78 (6%)

Query: 56  GTGLYPVISIINHSCLPNAVLVF--EGRLAVVRAVQHVPKGAEVLISYIETAG---STMT 110
           G+GLY + S +NHSC+PNAV+ F       V++A++ +  G E+  SY++      S  +
Sbjct: 294 GSGLYVLQSSVNHSCVPNAVVEFPYSNNTLVLKAIRDIKVGEEICTSYLDECQLERSRHS 353

Query: 111 RQKALKEQYLFTCTCPRC 128
           RQ+AL   YLF C C +C
Sbjct: 354 RQQALSSLYLFVCHCDKC 371


>gi|395841250|ref|XP_003793459.1| PREDICTED: SET and MYND domain-containing protein 5 isoform 2
           [Otolemur garnettii]
          Length = 300

 Score = 62.0 bits (149), Expect = 4e-07,   Method: Compositional matrix adjust.
 Identities = 31/79 (39%), Positives = 46/79 (58%), Gaps = 5/79 (6%)

Query: 56  GTGLYPVISIINHSCLPNAVLVF--EGRLAVVRAVQHVPKGAEVLISYIETAG---STMT 110
           G+GL+ + S  NHSC+PNA   F        V A++ +  G E+ ISY++      S  +
Sbjct: 188 GSGLFVLQSCCNHSCVPNAETSFPENNFFLHVTALEDIKPGEEICISYLDCCQRERSRHS 247

Query: 111 RQKALKEQYLFTCTCPRCI 129
           R K L+E YLF C+CP+C+
Sbjct: 248 RHKILRENYLFICSCPKCL 266


>gi|348515415|ref|XP_003445235.1| PREDICTED: SET and MYND domain-containing protein 5-like isoform 1
           [Oreochromis niloticus]
          Length = 417

 Score = 62.0 bits (149), Expect = 4e-07,   Method: Compositional matrix adjust.
 Identities = 33/87 (37%), Positives = 51/87 (58%), Gaps = 7/87 (8%)

Query: 56  GTGLYPVISIINHSCLPNAVLVFEGR--LAVVRAVQHVPKGAEVLISYIETAG---STMT 110
           G+GL+ + S  NHSC+PNA   F     L  + A+  + +G E+ ISY++      S  +
Sbjct: 303 GSGLFLLQSSCNHSCIPNAEASFPDNNFLLHLSALCDINQGEEICISYLDCCQRDRSRHS 362

Query: 111 RQKALKEQYLFTCTCPRCIKLGQFDDI 137
           R K L+E YLF C+CP+C+   Q D++
Sbjct: 363 RHKILRENYLFVCSCPKCV--SQMDEL 387


>gi|194899548|ref|XP_001979321.1| GG14794 [Drosophila erecta]
 gi|190651024|gb|EDV48279.1| GG14794 [Drosophila erecta]
          Length = 392

 Score = 62.0 bits (149), Expect = 4e-07,   Method: Compositional matrix adjust.
 Identities = 31/78 (39%), Positives = 47/78 (60%), Gaps = 5/78 (6%)

Query: 56  GTGLYPVISIINHSCLPNAVLVF--EGRLAVVRAVQHVPKGAEVLISYIETA---GSTMT 110
           G+GLY + S INHSC+PNA   F     + V++A+  + +G E+ ISY++      S  +
Sbjct: 286 GSGLYLLQSKINHSCVPNACSTFPYSNDIVVLKALAPIQEGDEICISYLDECMLERSRHS 345

Query: 111 RQKALKEQYLFTCTCPRC 128
           R K L+E Y+F C C +C
Sbjct: 346 RHKVLRENYVFICQCSKC 363


>gi|66521464|ref|XP_396314.2| PREDICTED: protein msta, isoform B-like [Apis mellifera]
          Length = 513

 Score = 62.0 bits (149), Expect = 4e-07,   Method: Compositional matrix adjust.
 Identities = 59/271 (21%), Positives = 109/271 (40%), Gaps = 35/271 (12%)

Query: 58  GLYPVISIINHSCLPN-----AVLVFEGRLAVVRAVQHVPKGAEVLISYIETAGSTMTRQ 112
            +YP ++++NHSC+ N     +   ++ RL   R    +P G E+  SY  +   TM R+
Sbjct: 216 AIYPTVALMNHSCISNTCHSISPTDYKIRL---RTTLKIPVGGELYGSYTHSLLPTMLRR 272

Query: 113 KALKEQYLFTCTCPRCIKLGQFDDIQESAILEGYRCK--DDGCSGFLLRDSDDKGFTCQQ 170
           + L E   F C C RC      D  +    +   +C   D+G    L     +  + C  
Sbjct: 273 EHLLEGKNFACACARC-----SDPTELGTHMSSLKCNKCDNGIVLSLDSLDPESSWKCTH 327

Query: 171 CGLVRSKEEIKKIASEVNILSKKTLALTSCGNHQEVVSTYKMIEKLQKKLYHPFSVNLMQ 230
           C    S   ++K+   + +      A++   +  + +   + + K  + + HP    L  
Sbjct: 328 CEFSTSGSAVRKVLQIIQMEVDAVEAISG-ADGADAIQERETVAKKYRSVLHPRHAFLTM 386

Query: 231 TREKLIKILMELEDW----------KEALAYCQLTIPVYQRVYPQFHPLLGLQYYTCGKL 280
            R  L ++   +ED+          +  +  C+L + V   + P +  + G+  Y     
Sbjct: 387 LRHSLTQMYGRVEDYLLEDLPDVVLEHKIDMCRLLLQVLDVIEPGYSRIRGMTLYELHAP 446

Query: 281 EWFLGDTE--------NAIKS-MTEAVEILR 302
             F   T+         A+KS M EA +IL+
Sbjct: 447 LLFTAKTQWNAEVIDKAALKSKMIEASKILK 477


>gi|270011596|gb|EFA08044.1| hypothetical protein TcasGA2_TC005638 [Tribolium castaneum]
          Length = 575

 Score = 62.0 bits (149), Expect = 4e-07,   Method: Compositional matrix adjust.
 Identities = 52/182 (28%), Positives = 77/182 (42%), Gaps = 24/182 (13%)

Query: 59  LYPVISIINHSCLPNAVLVFE----GRLAVVRAVQHVPKGAEVLISYIETAGSTMTRQKA 114
           LYP   ++ H CL N V  F+    G    VRA   + KG  +   Y      T  R++ 
Sbjct: 252 LYPTAYLMEHDCLCNTVHSFDTEENGYKITVRAALPIKKGDHISTMYTHALWGTQARREH 311

Query: 115 LKEQYLFTCTCPRCIKLGQFDDIQESAILEGYRCKD---DGCSGFLL---RDSDDKGFTC 168
           LKE   F+CTC RC      D  +  + L   RC     D C G+ L      D+  + C
Sbjct: 312 LKETKYFSCTCKRCK-----DPTEMGSYLSALRCLGTGVDSCDGYQLPADPTDDNTQWLC 366

Query: 169 QQCGLVRSKEE----IKKIASEVNILSKKTLALTSCGNHQEVVSTYKMIEKLQKKLYHPF 224
            +C +  +  E    I +I  EV+ +   +  +    N   ++S  KM+  L    YH +
Sbjct: 367 NKCDIKLTNSEVSYLINQIGEEVDHVQLSSPTVKDLDN---LLS--KMLTFLHPNHYHVY 421

Query: 225 SV 226
           SV
Sbjct: 422 SV 423


>gi|156366270|ref|XP_001627062.1| predicted protein [Nematostella vectensis]
 gi|156213960|gb|EDO34962.1| predicted protein [Nematostella vectensis]
          Length = 391

 Score = 62.0 bits (149), Expect = 4e-07,   Method: Compositional matrix adjust.
 Identities = 32/78 (41%), Positives = 44/78 (56%), Gaps = 5/78 (6%)

Query: 56  GTGLYPVISIINHSCLPNAVLVF--EGRLAVVRAVQHVPKGAEVLISYIETAG---STMT 110
           G GLY + S  NHSC PNA + F       V++A+  +  G E+ ISY+E      S  +
Sbjct: 293 GAGLYALQSSCNHSCAPNAEVTFPKNNSTLVLKALHPIKNGEEICISYLEECQRERSRHS 352

Query: 111 RQKALKEQYLFTCTCPRC 128
           R K L+E Y+F CTC +C
Sbjct: 353 RLKYLRENYIFDCTCTKC 370


>gi|281206858|gb|EFA81042.1| hypothetical protein PPL_05877 [Polysphondylium pallidum PN500]
          Length = 425

 Score = 62.0 bits (149), Expect = 4e-07,   Method: Compositional matrix adjust.
 Identities = 31/79 (39%), Positives = 41/79 (51%), Gaps = 6/79 (7%)

Query: 56  GTGLYPVISIINHSCLPNAVLVFEG------RLAVVRAVQHVPKGAEVLISYIETAGSTM 109
           G GLYP+ S +NHSC PN  +  E          V RA +++  G E+L +Y +    T 
Sbjct: 346 GIGLYPIFSCMNHSCQPNVEICNERTDGVTFNKVVFRAKKNIKAGQELLNNYCDVTLPTK 405

Query: 110 TRQKALKEQYLFTCTCPRC 128
            RQ  LK QY F C C +C
Sbjct: 406 ERQSQLKSQYDFICKCNKC 424


>gi|91088857|ref|XP_971416.1| PREDICTED: similar to CG14590 CG14590-PA [Tribolium castaneum]
          Length = 563

 Score = 62.0 bits (149), Expect = 4e-07,   Method: Compositional matrix adjust.
 Identities = 52/182 (28%), Positives = 77/182 (42%), Gaps = 24/182 (13%)

Query: 59  LYPVISIINHSCLPNAVLVFE----GRLAVVRAVQHVPKGAEVLISYIETAGSTMTRQKA 114
           LYP   ++ H CL N V  F+    G    VRA   + KG  +   Y      T  R++ 
Sbjct: 252 LYPTAYLMEHDCLCNTVHSFDTEENGYKITVRAALPIKKGDHISTMYTHALWGTQARREH 311

Query: 115 LKEQYLFTCTCPRCIKLGQFDDIQESAILEGYRCKD---DGCSGFLL---RDSDDKGFTC 168
           LKE   F+CTC RC      D  +  + L   RC     D C G+ L      D+  + C
Sbjct: 312 LKETKYFSCTCKRCK-----DPTEMGSYLSALRCLGTGVDSCDGYQLPADPTDDNTQWLC 366

Query: 169 QQCGLVRSKEE----IKKIASEVNILSKKTLALTSCGNHQEVVSTYKMIEKLQKKLYHPF 224
            +C +  +  E    I +I  EV+ +   +  +    N   ++S  KM+  L    YH +
Sbjct: 367 NKCDIKLTNSEVSYLINQIGEEVDHVQLSSPTVKDLDN---LLS--KMLTFLHPNHYHVY 421

Query: 225 SV 226
           SV
Sbjct: 422 SV 423


>gi|407404730|gb|EKF30079.1| hypothetical protein MOQ_006118 [Trypanosoma cruzi marinkellei]
          Length = 359

 Score = 62.0 bits (149), Expect = 4e-07,   Method: Compositional matrix adjust.
 Identities = 33/77 (42%), Positives = 47/77 (61%), Gaps = 5/77 (6%)

Query: 56  GTGLYPVISIINHSCLPNAVLVF----EGRLAVVRAVQHVPKGAEVLISYIETAGSTMTR 111
           G G+Y V  + NHSC PN  +V+    +  L+VV A++ +  G E+ ISYI+ + S   R
Sbjct: 257 GQGVYTVGCLFNHSCEPNLQVVYSESGDETLSVV-ALRDIEPGEELCISYIDESLSYPER 315

Query: 112 QKALKEQYLFTCTCPRC 128
           Q+ L E YLF C CP+C
Sbjct: 316 QQELYEHYLFVCHCPKC 332


>gi|301121168|ref|XP_002908311.1| conserved hypothetical protein [Phytophthora infestans T30-4]
 gi|262103342|gb|EEY61394.1| conserved hypothetical protein [Phytophthora infestans T30-4]
          Length = 424

 Score = 62.0 bits (149), Expect = 4e-07,   Method: Compositional matrix adjust.
 Identities = 42/127 (33%), Positives = 61/127 (48%), Gaps = 16/127 (12%)

Query: 56  GTGLYPVISIINHSCLPNAVLVFEGRLAV-VRAVQHVPKGAEVLISYIETAGSTMTRQKA 114
             GL+ +  +INHSC PN      G   + VRA++ + +G E+ +SYI+       RQK 
Sbjct: 162 AVGLFSICGLINHSCQPNCTWSNAGDSVMEVRALRDIEEGEEITLSYIDIDKERSERQKE 221

Query: 115 LKEQYLFTCTCPRC-IKLGQFDDIQESAILEGYRCKDDGCS-------GFLLRDSDDKGF 166
           L++   F C C RC   L +  D     +L+G+RC    CS        +LL   +DK  
Sbjct: 222 LRDTKHFDCQCERCSTPLSESVD----RVLDGFRCP--RCSVKASEEENYLLAQVEDK-L 274

Query: 167 TCQQCGL 173
            C  C L
Sbjct: 275 VCPDCQL 281


>gi|429853222|gb|ELA28312.1| TPR domain-containing protein [Colletotrichum gloeosporioides Nara
           gc5]
          Length = 643

 Score = 62.0 bits (149), Expect = 4e-07,   Method: Compositional matrix adjust.
 Identities = 31/72 (43%), Positives = 42/72 (58%), Gaps = 1/72 (1%)

Query: 57  TGLYPVISIINHSCLPNAVLVFEGRLAVVRAVQHVPKGAEVLISYIETAGSTMTRQKALK 116
           TGL+   + INHSC+PNA   F G L V+R+++ + KG E+  SY E+ G    RQ AL 
Sbjct: 496 TGLWVRAAYINHSCVPNAKREFVGDLMVIRSLRKIKKGEEIFHSYDES-GDYEARQAALM 554

Query: 117 EQYLFTCTCPRC 128
             + F C C  C
Sbjct: 555 TTWGFECGCALC 566


>gi|328770914|gb|EGF80955.1| hypothetical protein BATDEDRAFT_88226 [Batrachochytrium
           dendrobatidis JAM81]
          Length = 227

 Score = 62.0 bits (149), Expect = 4e-07,   Method: Compositional matrix adjust.
 Identities = 27/75 (36%), Positives = 37/75 (49%)

Query: 55  LGTGLYPVISIINHSCLPNAVLVFEGRLAVVRAVQHVPKGAEVLISYIETAGSTMTRQKA 114
           +G  ++P  S  NHSC PN   +        R ++ V KG  + ISYI+T      R+  
Sbjct: 96  MGRAVFPAASYFNHSCFPNCQSIKHDHKMAFRTLKDVSKGEMLTISYIDTNMPVSARRAR 155

Query: 115 LKEQYLFTCTCPRCI 129
           L + Y F C C RCI
Sbjct: 156 LMDDYFFECMCERCI 170


>gi|449548889|gb|EMD39855.1| hypothetical protein CERSUDRAFT_122028 [Ceriporiopsis subvermispora
           B]
          Length = 454

 Score = 62.0 bits (149), Expect = 4e-07,   Method: Compositional matrix adjust.
 Identities = 33/93 (35%), Positives = 53/93 (56%), Gaps = 6/93 (6%)

Query: 43  NAHTICNSELRPLGTGLYPVIS-IINHSCLPNAVLVF-----EGRLAVVRAVQHVPKGAE 96
           N + I +S L P   G++P+ S   NHSC+PNAV+ +     E     V A++ + +G E
Sbjct: 184 NNNFIVHSHLYPYAHGIFPLASRTFNHSCVPNAVVKYIIRPSEPVCMQVVALREIQEGEE 243

Query: 97  VLISYIETAGSTMTRQKALKEQYLFTCTCPRCI 129
           ++I Y++ A S   R+ AL+  Y F C+C  C+
Sbjct: 244 IVIPYLDPALSYAARRDALQTNYGFICSCALCV 276


>gi|395508886|ref|XP_003758739.1| PREDICTED: SET and MYND domain-containing protein 5 [Sarcophilus
           harrisii]
          Length = 609

 Score = 61.6 bits (148), Expect = 4e-07,   Method: Compositional matrix adjust.
 Identities = 32/79 (40%), Positives = 47/79 (59%), Gaps = 5/79 (6%)

Query: 56  GTGLYPVISIINHSCLPNAVLVFEGR--LAVVRAVQHVPKGAEVLISYIE---TAGSTMT 110
           G+GL+ + S  NHSC+PNA   F     L  V A++ +  G E+ ISY++      S  +
Sbjct: 496 GSGLFVLQSCCNHSCVPNAETSFPENNFLLPVTALEDIKPGEEICISYLDCCQRERSRYS 555

Query: 111 RQKALKEQYLFTCTCPRCI 129
           R K L+E YLF C+CP+C+
Sbjct: 556 RHKILRENYLFVCSCPKCL 574


>gi|195583324|ref|XP_002081472.1| GD11032 [Drosophila simulans]
 gi|194193481|gb|EDX07057.1| GD11032 [Drosophila simulans]
          Length = 1033

 Score = 61.6 bits (148), Expect = 4e-07,   Method: Compositional matrix adjust.
 Identities = 42/156 (26%), Positives = 71/156 (45%), Gaps = 11/156 (7%)

Query: 32  EIAENFSKLACNAHTICNSELRPLGTGLYPVISIINHSCLPNAVLVF-EGRLAVVRAVQH 90
           EI +    L  N H +  ++  P    ++   S   +SCLPN    F +    ++ A + 
Sbjct: 725 EIMKAVGALQINGHEVPTTD--PPHVAVFYTASFTENSCLPNLAKSFNKNGHCILWAPRE 782

Query: 91  VPKGAEVLISYIETAGSTMTRQKALKEQYLFTCTCPRCIKLGQFDDIQESAILEGYRCKD 150
           + K A + I Y +    T  RQ+ L +  LF C C RC+ + +  D   SAI    +C+D
Sbjct: 783 IKKNAHLSICYSDAMWGTADRQRHLMQTKLFKCACERCVDVTEL-DTNYSAI----KCED 837

Query: 151 DGCSGFLL---RDSDDKGFTCQQCGLVRSKEEIKKI 183
             C G +L    D  +  + C++C     K  +++I
Sbjct: 838 RQCGGLMLPTKADDWNGSWRCRECHKQVQKHYVERI 873


>gi|294935956|ref|XP_002781574.1| hypothetical protein, conserved [Perkinsus marinus ATCC 50983]
 gi|239892411|gb|EER13369.1| hypothetical protein, conserved [Perkinsus marinus ATCC 50983]
          Length = 181

 Score = 61.6 bits (148), Expect = 4e-07,   Method: Compositional matrix adjust.
 Identities = 31/75 (41%), Positives = 44/75 (58%), Gaps = 1/75 (1%)

Query: 55  LGTGLYPVISIINHSCLPNAVLVFEG-RLAVVRAVQHVPKGAEVLISYIETAGSTMTRQK 113
           LG+GLY  +++ NHSC PNA   F G R   V++++ V  G EV  SYI+     + RQ 
Sbjct: 91  LGSGLYRGVAVTNHSCSPNAEASFGGSRCLRVKSLRPVHAGEEVFQSYIDENLPLVERQS 150

Query: 114 ALKEQYLFTCTCPRC 128
            L++ Y F C C +C
Sbjct: 151 KLRQAYGFACRCGKC 165


>gi|395329994|gb|EJF62379.1| hypothetical protein DICSQDRAFT_104880 [Dichomitus squalens
           LYAD-421 SS1]
          Length = 810

 Score = 61.6 bits (148), Expect = 5e-07,   Method: Compositional matrix adjust.
 Identities = 31/72 (43%), Positives = 41/72 (56%), Gaps = 1/72 (1%)

Query: 57  TGLYPVISIINHSCLPNAVLVFEGRLAVVRAVQHVPKGAEVLISYIETAGSTMTRQKALK 116
           TGLYP+ S+ NHSC+ NA+    G + ++RA + VP G E+ I Y     S   RQ  LK
Sbjct: 556 TGLYPLASLFNHSCVANAIWYCIGDVMIIRAAEPVPAGTEITIPY-SVEESYFARQSVLK 614

Query: 117 EQYLFTCTCPRC 128
           +  L  CTC  C
Sbjct: 615 KHMLEHCTCWLC 626


>gi|344303810|gb|EGW34059.1| hypothetical protein SPAPADRAFT_65242 [Spathaspora passalidarum
           NRRL Y-27907]
          Length = 619

 Score = 61.6 bits (148), Expect = 5e-07,   Method: Compositional matrix adjust.
 Identities = 31/90 (34%), Positives = 47/90 (52%), Gaps = 2/90 (2%)

Query: 43  NAHTICNSELRPLGTGLYPVISIINHSCLPNAVLVFEGRLAVVRAVQHVP--KGAEVLIS 100
           N + + + +  P+G GL P  S+INHSC PN +L  +   +    V  VP  K +E+ ++
Sbjct: 226 NHNILLDQDNEPIGIGLDPDFSLINHSCAPNCILKHKQNWSGFELVNTVPIVKNSELTVT 285

Query: 101 YIETAGSTMTRQKALKEQYLFTCTCPRCIK 130
           YI+T      R   L+ +Y F C C  C K
Sbjct: 286 YIDTCFPKELRMLDLRSRYFFNCKCELCSK 315


>gi|125810418|ref|XP_001361485.1| GA20567 [Drosophila pseudoobscura pseudoobscura]
 gi|54636660|gb|EAL26063.1| GA20567 [Drosophila pseudoobscura pseudoobscura]
          Length = 660

 Score = 61.6 bits (148), Expect = 5e-07,   Method: Compositional matrix adjust.
 Identities = 54/219 (24%), Positives = 95/219 (43%), Gaps = 18/219 (8%)

Query: 15  IANLVNLILQWPEISINEIAE--NFSKLACNAHTICNSELRPLGTGLYPVISIINHSCLP 72
           I +LV   LQ+ + + +E+AE   FS                +G  +YP +++ NHSC P
Sbjct: 427 ICSLVLRSLQFIQFNTHEVAELHKFSSTGREKSIF-------IGGAIYPTLALFNHSCDP 479

Query: 73  NAVLVFEGRLAVVRAVQHVPKGAEVLISY--IETAGSTMTRQKALKEQYLFTCTCPRCI- 129
             V  F G    + +V+ +  G  +  +Y  I T      RQ  +KE Y F C C  CI 
Sbjct: 480 GVVRYFRGTTIHINSVRPIEAGLPINENYGPIYTQDKREDRQAKMKELYWFECCCDACID 539

Query: 130 KLGQFDDIQESAILEGYRCK-DDGCSGFL--LRDSDDKGFTCQQCGLVRSKEEIKKIASE 186
               F+D+    I   +RC+  + C+  +      +D    C  CG V +  +  K+  +
Sbjct: 540 NWPLFEDLPRDVI--RFRCEAPNNCAAVIEVPPSCNDFMVKCVTCGEVTNILKGLKVMQD 597

Query: 187 VNILSKKTLALTSCGNHQEVVSTY-KMIEKLQKKLYHPF 224
             ++++    L   G++ + +  +  +I  +   L  PF
Sbjct: 598 TEMMTRTGKRLYETGDYAKALQKFIDLIRIMYDVLAPPF 636


>gi|195123911|ref|XP_002006445.1| GI18563 [Drosophila mojavensis]
 gi|193911513|gb|EDW10380.1| GI18563 [Drosophila mojavensis]
          Length = 666

 Score = 61.6 bits (148), Expect = 5e-07,   Method: Compositional matrix adjust.
 Identities = 44/194 (22%), Positives = 87/194 (44%), Gaps = 8/194 (4%)

Query: 55  LGTGLYPVISIINHSCLPNAVLVFEGRLAVVRAVQHVPKGAEVLISY--IETAGSTMTRQ 112
           +G  +YP +++ NHSC P  V  F G    +  V+ V  G  +  +Y  I T      RQ
Sbjct: 468 IGGAIYPTLALFNHSCDPGVVRYFRGNTIHINTVRPVEAGLPINENYGPIYTQDKREDRQ 527

Query: 113 KALKEQYLFTCTCPRCIK-LGQFDDIQESAILEGYRCK-DDGCSGFL--LRDSDDKGFTC 168
             LKE Y F C C  C++    FDD+    I   +RC+  + C+  +      +D    C
Sbjct: 528 ARLKELYWFECNCDACLENWPLFDDLPRDII--RFRCEAPNNCTAVIEVPPSCNDFMIKC 585

Query: 169 QQCGLVRSKEEIKKIASEVNILSKKTLALTSCGNHQEVVSTYKMIEKLQKKLYHPFSVNL 228
             CG + +  +  K+  +  ++++    L   G++ + ++ +  + ++  ++  P   + 
Sbjct: 586 VTCGELTNILKGLKVMQDTEMMTRTAKRLYDTGDYAKALNKFVDLLRIMYEVLAPPFPDF 645

Query: 229 MQTREKLIKILMEL 242
            + ++ L    + L
Sbjct: 646 CECQQHLKDCFLNL 659


>gi|189233943|ref|XP_974086.2| PREDICTED: similar to AGAP008839-PA [Tribolium castaneum]
          Length = 666

 Score = 61.6 bits (148), Expect = 5e-07,   Method: Compositional matrix adjust.
 Identities = 57/235 (24%), Positives = 105/235 (44%), Gaps = 9/235 (3%)

Query: 15  IANLVNLILQWPEISINEIAE--NFSKLACNAHTICNSELRPLGTGLYPVISIINHSCLP 72
           IA+L+   LQ  + + +EI+E  N ++        C+ +   +G GLYP +++ NHSC P
Sbjct: 423 IASLILRHLQILQFNSHEISELRNLNEEMVTNGIQCHYKSEYIGAGLYPTLALFNHSCDP 482

Query: 73  NAVLVFEGRLAVVRAVQHVPKGAEVLISY--IETAGSTMTRQKALKEQYLFTCTCPRCIK 130
           + V    G   +VR ++ +  G  +  +Y  + T+     R+  L+ +Y F C C  C +
Sbjct: 483 SIVRYNIGNRMIVRTIKPIKAGEIIYENYGPLYTSMDADERRVTLQNRYWFECYCTPCQQ 542

Query: 131 -LGQFDDIQESAILEGYRCKDDGCSGFLLRDSDD--KGFTCQQCGLVRSKEEIKKIASEV 187
               F+ +  + I  G  C+ + C        DD    F C  C  V       K  S++
Sbjct: 543 EWPLFEYMDPNQIKIG--CQKENCPFEFTLYKDDLCPYFQCDYCDGVTKIFPSLKGLSQL 600

Query: 188 NILSKKTLALTSCGNHQEVVSTYKMIEKLQKKLYHPFSVNLMQTREKLIKILMEL 242
            I+  K   L S G  +E +  +     +  K   P   ++++ +++L  + + L
Sbjct: 601 AIMLPKAEDLYSAGETREAMKLFMQSLDILYKYSRPPCPDMIKVQQRLKTLFVHL 655


>gi|194698994|gb|ACF83581.1| unknown [Zea mays]
          Length = 268

 Score = 61.6 bits (148), Expect = 5e-07,   Method: Compositional matrix adjust.
 Identities = 33/84 (39%), Positives = 41/84 (48%), Gaps = 7/84 (8%)

Query: 52  LRPLGTGLYPVISIINHSCLPNAV-LVFEGRLA------VVRAVQHVPKGAEVLISYIET 104
           L+     +YP  S+ NH CLPNA    +  R        VVRA+  +P+G EV ISY   
Sbjct: 72  LKARAYAVYPRASLFNHDCLPNACHFDYPDRPGPGNTDIVVRALHDIPEGREVCISYFAA 131

Query: 105 AGSTMTRQKALKEQYLFTCTCPRC 128
                 RQ+ L E Y F C C RC
Sbjct: 132 NWRYADRQRRLLEDYGFRCECDRC 155


>gi|281202822|gb|EFA77024.1| hypothetical protein PPL_09776 [Polysphondylium pallidum PN500]
          Length = 1187

 Score = 61.6 bits (148), Expect = 5e-07,   Method: Compositional matrix adjust.
 Identities = 36/121 (29%), Positives = 57/121 (47%), Gaps = 13/121 (10%)

Query: 32   EIAENFSKLACNAHTICNSELRP-LGTGLYPVISIINHSCLPNAVLVFEGR-LAVVRAVQ 89
            +I + + K+  NAH   +      +  G+YP  + +NHSC PN V   +   +   R+++
Sbjct: 908  QILDLYPKMLFNAHEYIDPLYHSEVARGIYPTAAYLNHSCEPNTVWHNDNNGMIAYRSIR 967

Query: 90   HVPKGAEVLISYIETAGSTMTRQKALKEQYLFTCTCPRCIKLGQFDDIQESAILEGYRCK 149
             +  G E+  +YI+      TRQ  L  QY F C C RC         Q+ A   G++C 
Sbjct: 968  DIKAGEEITTTYIDITKYKSTRQLNLLSQYAFLCQCARC---------QDRAT--GFKCL 1016

Query: 150  D 150
            D
Sbjct: 1017 D 1017


>gi|90077650|dbj|BAE88505.1| unnamed protein product [Macaca fascicularis]
          Length = 418

 Score = 61.6 bits (148), Expect = 5e-07,   Method: Compositional matrix adjust.
 Identities = 31/79 (39%), Positives = 46/79 (58%), Gaps = 5/79 (6%)

Query: 56  GTGLYPVISIINHSCLPNAVLVF--EGRLAVVRAVQHVPKGAEVLISYIETAG---STMT 110
           G+G + + S  NHSC+PNA   F     L  V A++ +  G E+ ISY++      S  +
Sbjct: 304 GSGFFVLQSCCNHSCVPNAETSFPENNFLLHVTALEDIKPGEEICISYLDCCQRERSRHS 363

Query: 111 RQKALKEQYLFTCTCPRCI 129
           R K L+E YLF C+CP+C+
Sbjct: 364 RHKILRENYLFVCSCPKCL 382


>gi|17535507|ref|NP_496323.1| Protein SET-14 [Caenorhabditis elegans]
 gi|3123316|sp|Q09415.2|SET14_CAEEL RecName: Full=SET domain-containing protein 14
 gi|3878909|emb|CAA86783.1| Protein SET-14 [Caenorhabditis elegans]
          Length = 429

 Score = 61.6 bits (148), Expect = 5e-07,   Method: Compositional matrix adjust.
 Identities = 48/164 (29%), Positives = 77/164 (46%), Gaps = 14/164 (8%)

Query: 37  FSKLACNAHTICNSELRPLGTGLYPVISIINHSCLPNAVLVFEGRLAVVRAV-QHVPKGA 95
           F K++ N+  + NS   P+G GL   +S  NHSC P   + +  R A++  V   +P   
Sbjct: 147 FQKVSINSFVVGNSTGNPIGVGLCIKLSAANHSCKPLTRVCYRNRTAMLVPVSSELPSTL 206

Query: 96  E-VLISYIETAGSTMTRQKALKEQYLFTCTCPRCIKLGQFDDIQESAILEGYRCKDDGCS 154
           E    SYI+       R+  LK++Y F C C  C+      D   +A +E + C    C 
Sbjct: 207 EGACHSYIDELMPRDMRRDTLKKKYKFLCQCDGCL------DEDRNARMEAWTC--GICV 258

Query: 155 GFLLRDSDDKGFTCQQCGLVRSKE--EIKKIASEVNILSKKTLA 196
              +R+ ++    C+ CG   SK+  E+ + A E  I ++  LA
Sbjct: 259 KGWMRNKENG--QCELCGWTMSKDHFELCRTAEEAGIAARSRLA 300


>gi|342883051|gb|EGU83613.1| hypothetical protein FOXB_05861 [Fusarium oxysporum Fo5176]
          Length = 374

 Score = 61.6 bits (148), Expect = 5e-07,   Method: Compositional matrix adjust.
 Identities = 37/113 (32%), Positives = 58/113 (51%), Gaps = 7/113 (6%)

Query: 56  GTGLYPVISIINHSCLPNAVLVFEGRLAVV-RAVQHVPKGAEVLISYIE--TAGSTMTRQ 112
           G  LYP  +  NHSCLPN     +G+  +V  A + + KG E +I+Y +  T     +RQ
Sbjct: 261 GMALYPRAAQFNHSCLPNVTHQPDGQARMVYTAARDISKGEECMITYFDLTTHKDLTSRQ 320

Query: 113 KALKEQYLFTCTCPRCIKLGQFDDIQESAILEGYRCKDDGCSGFLLRDSDDKG 165
              +EQ+ F CTC RC+K    ++ +E+   +    K + C       SD++G
Sbjct: 321 NHTQEQFQFKCTCERCLK----EEAEENIEFQELELKLEPCPISSATTSDNEG 369


>gi|443683649|gb|ELT87827.1| hypothetical protein CAPTEDRAFT_118237 [Capitella teleta]
          Length = 596

 Score = 61.6 bits (148), Expect = 5e-07,   Method: Compositional matrix adjust.
 Identities = 34/101 (33%), Positives = 47/101 (46%), Gaps = 12/101 (11%)

Query: 40  LACNAHTICNSELRP----------LGTGLYPVISIINHSCLPNAVLVFEGRLAVVRAVQ 89
             CNAH +    L P          LG G+Y  +S+ NHSC P     F G   VVRA++
Sbjct: 377 FPCNAHEVPELYLDPNAIDLSMPNELGAGIYSTLSLFNHSCDPGVNRNFYGDTCVVRAIK 436

Query: 90  HVPKGAEVLISY--IETAGSTMTRQKALKEQYLFTCTCPRC 128
            + KG +V  +Y  +    +   R   L+ QY F+C C  C
Sbjct: 437 TIRKGHQVSDNYGALYATNTLKERHDKLQPQYFFSCRCEPC 477


>gi|345493155|ref|XP_001601354.2| PREDICTED: SET and MYND domain-containing protein 4-like [Nasonia
           vitripennis]
          Length = 737

 Score = 61.6 bits (148), Expect = 5e-07,   Method: Compositional matrix adjust.
 Identities = 61/288 (21%), Positives = 108/288 (37%), Gaps = 66/288 (22%)

Query: 39  KLACNAHTICNSELRP-------------LGTGLYPVISIINHSCLPNAVLVFEGRLAVV 85
           +L CN H I    +               + T +YP  S++NHSC PN +  F+ +  +V
Sbjct: 422 QLICNGHAITRLNISDSESGNVVTEYQCRIATAIYPSASMMNHSCDPNIINSFKDQYLIV 481

Query: 86  RAVQHVPKGAEVLISYIE--TAGSTMTRQKALKEQYLFTCTCPRCIKLGQFDDIQESAIL 143
           +A + +    EV   Y           RQ AL+ QY FTC C                  
Sbjct: 482 KATKDIAAKEEVFNCYGPHYRRMRKKDRQIALQNQYCFTCEC------------------ 523

Query: 144 EGYRCKDDGCSGFLLRDSDDK--GFTCQQCGLVRSKEEIKKIASEVNILSKKTLALTSCG 201
                  + C+   L++  DK   F C++C               V I+S  ++    C 
Sbjct: 524 -------EACTQRALQNFSDKFQRFNCEECN------------GPVEIISHSSMRCLDCE 564

Query: 202 NHQEVVSTYKMIEKLQKKLYHPFSVNLMQTREKLIKILMELEDWKEALAYCQLTIPVYQR 261
              ++V +  +  +   KL+    +NL   +             KEAL   +  + + +R
Sbjct: 565 TTFDLVKSQLLELEEANKLFEAAKINLKSQKV------------KEALENAKQCLEIRKR 612

Query: 262 VYPQFHPLLGLQYYTCGKLEWFLGDTENAIKSMTEAVEILRITHGTNS 309
           +  ++H  + L Y   GK+    G   ++I  +  ++  +    G +S
Sbjct: 613 ILYEYHESVTLTYDLIGKIFAVTGRWLDSISHLEHSLAAVEERFGPDS 660


>gi|384252143|gb|EIE25620.1| hypothetical protein COCSUDRAFT_83627 [Coccomyxa subellipsoidea
           C-169]
          Length = 394

 Score = 61.6 bits (148), Expect = 5e-07,   Method: Compositional matrix adjust.
 Identities = 54/226 (23%), Positives = 100/226 (44%), Gaps = 20/226 (8%)

Query: 55  LGTGLYPVISIINHSCLPNAVLVFEGRLAVVRAVQHVPKGAEVLISYIETAGSTM--TRQ 112
           +G  +YP  +++NH+C PN    F G    + A   +P G  +   Y   AG ++   R 
Sbjct: 171 MGLAIYPRAAMLNHACSPNVAAAFAGTRLHICATSDLPPGTTLRFCYGPQAGESIREVRL 230

Query: 113 KALKEQYLFTCTCPRCIKLGQFDDIQESAILEGYRCKDDGCSGFLLRDSDDKGFTCQQCG 172
           + LKEQY F C C  C     +  +QE+A++ G +C+   C+G ++            CG
Sbjct: 231 RQLKEQYHFWCRCSAC--EASYSGVQEAAMI-GMKCRL--CNGPVVPSIG--------CG 277

Query: 173 L-VRSKEEIKKIASEVNILSKKTLALTSCGNHQ----EVVSTYKMIEKLQKKLYHPFSVN 227
             + S+E+I+++A      +    A  S   H     E + +     ++Q+ L    +  
Sbjct: 278 AEMSSEEQIERMARLSQADAAFASACLSLAQHSAVSAETLQSMHFCLEVQEVLLVTSNQV 337

Query: 228 LMQTREKLIKILMELEDWKEALAYCQLTIPVYQRVYPQFHPLLGLQ 273
           L +T + L +      D + A+ + Q  + + +  Y   H ++  Q
Sbjct: 338 LGRTHDALARAYGPRGDTENAMLHAQKALDIVRSNYGDGHLVVAHQ 383


>gi|195151412|ref|XP_002016641.1| GL11690 [Drosophila persimilis]
 gi|194110488|gb|EDW32531.1| GL11690 [Drosophila persimilis]
          Length = 660

 Score = 61.6 bits (148), Expect = 5e-07,   Method: Compositional matrix adjust.
 Identities = 54/219 (24%), Positives = 95/219 (43%), Gaps = 18/219 (8%)

Query: 15  IANLVNLILQWPEISINEIAE--NFSKLACNAHTICNSELRPLGTGLYPVISIINHSCLP 72
           I +LV   LQ+ + + +E+AE   FS                +G  +YP +++ NHSC P
Sbjct: 427 ICSLVLRSLQFIQFNTHEVAELHKFSSTGREKSIF-------IGGAIYPTLALFNHSCDP 479

Query: 73  NAVLVFEGRLAVVRAVQHVPKGAEVLISY--IETAGSTMTRQKALKEQYLFTCTCPRCI- 129
             V  F G    + +V+ +  G  +  +Y  I T      RQ  +KE Y F C C  CI 
Sbjct: 480 GVVRYFRGTTIHINSVRPIEAGLPINENYGPIYTQDKREERQAKMKELYWFECCCDACID 539

Query: 130 KLGQFDDIQESAILEGYRCK-DDGCSGFL--LRDSDDKGFTCQQCGLVRSKEEIKKIASE 186
               F+D+    I   +RC+  + C+  +      +D    C  CG V +  +  K+  +
Sbjct: 540 NWPLFEDLPRDVI--RFRCEAPNNCAAVIEVPPSCNDFMVKCVTCGEVTNILKGLKVMQD 597

Query: 187 VNILSKKTLALTSCGNHQEVVSTY-KMIEKLQKKLYHPF 224
             ++++    L   G++ + +  +  +I  +   L  PF
Sbjct: 598 TEMMTRTGKRLYETGDYAKALQKFIDLIRIMYDVLAPPF 636


>gi|198437054|ref|XP_002123001.1| PREDICTED: similar to SET and MYND domain containing 4 [Ciona
           intestinalis]
          Length = 773

 Score = 61.6 bits (148), Expect = 5e-07,   Method: Compositional matrix adjust.
 Identities = 44/154 (28%), Positives = 72/154 (46%), Gaps = 8/154 (5%)

Query: 1   MSDIDEKQLLLYAQIANLVNLILQWPEISINEIAENF-SKLACNAHTICNSELRPLGTGL 59
           M+ I+  QL    ++   + L   + ++ IN  + +F ++  C+  T+   ++  + +  
Sbjct: 440 MTKIEPGQLTSMGEVIEYL-LHRHYLQVPINGQSISFVTEELCDNVTVTRRDI--VASAF 496

Query: 60  YPVISIINHSCLPNAVLVFEGRLAVVRAVQHVPKGAEVLISYIETA--GSTMTRQKALKE 117
           +P +S++NHSC  N   +F G     R+ Q +P GAE+   Y  +    S   RQK LKE
Sbjct: 497 FPTMSMMNHSCDCNTDALFNGSTVTFRSNQFIPVGAEITHCYGPSVFHASFEERQKTLKE 556

Query: 118 QYLFTCTCPRCI--KLGQFDDIQESAILEGYRCK 149
            Y F C C  C   K G  D       L   +CK
Sbjct: 557 NYSFDCDCTPCAMHKSGNTDKEALHTQLRAMKCK 590


>gi|281210720|gb|EFA84886.1| hypothetical protein PPL_01879 [Polysphondylium pallidum PN500]
          Length = 439

 Score = 61.6 bits (148), Expect = 5e-07,   Method: Compositional matrix adjust.
 Identities = 39/149 (26%), Positives = 63/149 (42%), Gaps = 26/149 (17%)

Query: 40  LACNAH----TICNSELRPLGTGLYPVISIINHSCLPNAVLVFE----GRLAVVRAVQHV 91
           L CN H    T+       +G+ +Y   S+ NHSC PN   +      G L ++ ++  +
Sbjct: 261 LECNTHEIGITVDEYNYCSIGSAVYEKASLFNHSCQPNVCRINRAGEWGALEMI-SLTDI 319

Query: 92  PKGAEVLISYIETAGSTMTRQKALKEQYLFTCTCPRCIKLGQFDDI-------------- 137
             G E++ +YI+ +  T  RQ  L E Y F C C  C+                      
Sbjct: 320 AAGTELVYNYIQISLPTEDRQSKLSENYFFECKCNGCVNNTSISSRQKQQKQQQKQRQQK 379

Query: 138 ---QESAILEGYRCKDDGCSGFLLRDSDD 163
               + + L+ Y C  D C+G L+ D++D
Sbjct: 380 QSNSQKSFLKKYLCTRDTCNGVLITDAND 408


>gi|159465319|ref|XP_001690870.1| predicted protein [Chlamydomonas reinhardtii]
 gi|158279556|gb|EDP05316.1| predicted protein [Chlamydomonas reinhardtii]
          Length = 1470

 Score = 61.6 bits (148), Expect = 6e-07,   Method: Composition-based stats.
 Identities = 42/126 (33%), Positives = 64/126 (50%), Gaps = 19/126 (15%)

Query: 28   ISINEIAENFSKLACNAHTICNSELR---PLG-TGLYPVISIINHSCLPNAVLVFEGRLA 83
            +++N + E+F  LA     +C  ELR   P G  GL+P  +   HSC P A     G   
Sbjct: 1174 VNLNCMGEDFQDLA-----LC--ELRGEAPRGHIGLWPEAAFAAHSCAPTATAYSIGDRL 1226

Query: 84   VVRAVQHVPKGAEVLISYIETAGSTMT-----RQKALKEQYLFTCTCPRCIKLGQFDDIQ 138
            ++RA   +PKG EV ++++   GS +T     R+  L+ QY FTC C RC    +     
Sbjct: 1227 LIRAAAEIPKGGEVSLNFL---GSLLTSPLSVRRAELRSQYGFTCGCSRCAAEARHTGTP 1283

Query: 139  ESAILE 144
             +A++E
Sbjct: 1284 LAALVE 1289


>gi|328724471|ref|XP_003248160.1| PREDICTED: SET and MYND domain-containing protein 4-like
           [Acyrthosiphon pisum]
          Length = 627

 Score = 61.2 bits (147), Expect = 6e-07,   Method: Compositional matrix adjust.
 Identities = 37/100 (37%), Positives = 53/100 (53%), Gaps = 5/100 (5%)

Query: 36  NFSKLACNAHTICNSELR-PLGTGLYPVISIINHSCLPNAVL--VFEGRLAVVRAVQHVP 92
           N  KL  NA +I + +   PL   LYP IS+ NHSC  N     V   R+ V++AVQ +P
Sbjct: 441 NARKLNINAPSISHQQFTFPLALTLYPTISLFNHSCDGNIKRSGVISDRIRVMKAVQPIP 500

Query: 93  KGAEVLISY--IETAGSTMTRQKALKEQYLFTCTCPRCIK 130
           KG ++  +Y  +       +RQ A  +++ F C C  CIK
Sbjct: 501 KGTQLCCNYGIMFKEHDKESRQSACNDRFNFNCYCDPCIK 540


>gi|159110558|ref|XP_001705534.1| Hypothetical protein GL50803_17036 [Giardia lamblia ATCC 50803]
 gi|157433620|gb|EDO77860.1| hypothetical protein GL50803_17036 [Giardia lamblia ATCC 50803]
          Length = 409

 Score = 61.2 bits (147), Expect = 6e-07,   Method: Compositional matrix adjust.
 Identities = 34/104 (32%), Positives = 56/104 (53%), Gaps = 8/104 (7%)

Query: 34  AENFSKLAC----NAHTICNSELRPLGTGLYPVISIINHSCLPNAVLVF----EGRLAVV 85
           A+ F+   C    N+++I +   + +G GLY +IS  NHSC PNA ++F      R A +
Sbjct: 300 AQMFNYFVCILLTNSYSIEDKTGQEIGAGLYSLISCCNHSCTPNAQVIFGDSENAREATL 359

Query: 86  RAVQHVPKGAEVLISYIETAGSTMTRQKALKEQYLFTCTCPRCI 129
             ++   +  E+ ISYI   G ++  ++    Q+ FTC C RC+
Sbjct: 360 VLLRPCAQKEELYISYITDLGRSVVERRRELAQWCFTCQCTRCL 403


>gi|361129398|gb|EHL01305.1| putative SET domain-containing protein 5 [Glarea lozoyensis 74030]
          Length = 286

 Score = 61.2 bits (147), Expect = 6e-07,   Method: Compositional matrix adjust.
 Identities = 36/103 (34%), Positives = 52/103 (50%), Gaps = 11/103 (10%)

Query: 36  NFSKLACNAHTICNSELRPLGT-----GLYPVISIINHSCLPNAVLVFEGRLA-----VV 85
           NF        +I  S   PLG      G++  IS INH CLPNAV  + G L       V
Sbjct: 49  NFPGQGNPLTSIIRSNGYPLGADAGCGGVFENISRINHGCLPNAVQNWNGLLGEEGEETV 108

Query: 86  RAVQHVPKGAEVLISYIETAGSTMTRQKALKEQYLFTCTCPRC 128
            A++ + +G E+  SY+ + G++  R+  LK+ + F CTC  C
Sbjct: 109 YAIKDIKEGEEITTSYL-SGGTSKERRAVLKQSFGFDCTCKLC 150


>gi|195127975|ref|XP_002008442.1| GI11814 [Drosophila mojavensis]
 gi|193920051|gb|EDW18918.1| GI11814 [Drosophila mojavensis]
          Length = 527

 Score = 61.2 bits (147), Expect = 6e-07,   Method: Compositional matrix adjust.
 Identities = 52/192 (27%), Positives = 86/192 (44%), Gaps = 21/192 (10%)

Query: 16  ANLVNLI------LQWPEISINEIAENFSKLACNAHTICNSELRPLGTGLYPVISIINHS 69
           ANL++ I       +WP++ I  IA   + L  NA  +  +  R    GL+P  ++ +H 
Sbjct: 200 ANLISFIKTVLGLREWPDLDILRIA---AILDTNAFEVRQAGDRIKVRGLFPGGAMFSHD 256

Query: 70  CLPNAVLVFEGRLAVV-RAVQHVPKGAEVLISYIETAGSTMTRQKALKEQYLFTCTCPRC 128
           C+PN    F+  + ++  A + + KG  + ISY +   ST+ R+  LK+   F C C RC
Sbjct: 257 CVPNMRHRFDDDMNIMFLAKRPIAKGEILSISYTQQLRSTIQRRLHLKQVKCFECACARC 316

Query: 129 IKLGQFDDIQESAILEGYRCKDDGCSGFLLRD--SDDKGFTCQQCGLVRSKEEI----KK 182
                 D  +       + C       F+  D   +   + C+ C L+R  +E      K
Sbjct: 317 A-----DPTELGTYAGAHMCAKCKVGKFISMDPLQNAANWRCEVCNLIRPAKEYLINDAK 371

Query: 183 IASEVNILSKKT 194
           I +E+  L K T
Sbjct: 372 IEAELESLDKTT 383


>gi|195454342|ref|XP_002074198.1| GK12748 [Drosophila willistoni]
 gi|194170283|gb|EDW85184.1| GK12748 [Drosophila willistoni]
          Length = 394

 Score = 61.2 bits (147), Expect = 6e-07,   Method: Compositional matrix adjust.
 Identities = 37/102 (36%), Positives = 56/102 (54%), Gaps = 9/102 (8%)

Query: 32  EIAENFSKLACNAHTICNSELRPLGTGLYPVISIINHSCLPNAVLVF--EGRLAVVRAVQ 89
           EI   ++K+   A    N+E    G+GLY + S INHSC+PNA   F     + V++A+ 
Sbjct: 266 EIDNLYAKVGEFAGEFLNNE----GSGLYLLQSKINHSCVPNACSTFPYSNDIVVLKALS 321

Query: 90  HVPKGAEVLISYIETAG---STMTRQKALKEQYLFTCTCPRC 128
            + +G E+ ISY++      S  +R K L   Y+F C CP+C
Sbjct: 322 PIQEGDEICISYLDECQLERSRHSRHKILCGNYIFICQCPKC 363


>gi|170041827|ref|XP_001848651.1| SET and MYND domain-containing protein 5 [Culex quinquefasciatus]
 gi|167865410|gb|EDS28793.1| SET and MYND domain-containing protein 5 [Culex quinquefasciatus]
          Length = 387

 Score = 61.2 bits (147), Expect = 6e-07,   Method: Compositional matrix adjust.
 Identities = 33/87 (37%), Positives = 46/87 (52%), Gaps = 5/87 (5%)

Query: 47  ICNSELRPLGTGLYPVISIINHSCLPNAVLVF--EGRLAVVRAVQHVPKGAEVLISYIET 104
           +  S L   G+ LY + S +NHSC PNA   F     +  +  VQ +  G E+ ISY++ 
Sbjct: 276 VVGSFLNNEGSALYSLQSKVNHSCAPNAECQFPHSNNVLALTTVQDIAAGEEICISYLDE 335

Query: 105 AG---STMTRQKALKEQYLFTCTCPRC 128
                S  +RQK L+E YLF C C +C
Sbjct: 336 CALERSRHSRQKMLRENYLFQCQCEKC 362


>gi|395841248|ref|XP_003793458.1| PREDICTED: SET and MYND domain-containing protein 5 isoform 1
           [Otolemur garnettii]
          Length = 416

 Score = 61.2 bits (147), Expect = 6e-07,   Method: Compositional matrix adjust.
 Identities = 31/79 (39%), Positives = 46/79 (58%), Gaps = 5/79 (6%)

Query: 56  GTGLYPVISIINHSCLPNAVLVF--EGRLAVVRAVQHVPKGAEVLISYIETAG---STMT 110
           G+GL+ + S  NHSC+PNA   F        V A++ +  G E+ ISY++      S  +
Sbjct: 304 GSGLFVLQSCCNHSCVPNAETSFPENNFFLHVTALEDIKPGEEICISYLDCCQRERSRHS 363

Query: 111 RQKALKEQYLFTCTCPRCI 129
           R K L+E YLF C+CP+C+
Sbjct: 364 RHKILRENYLFICSCPKCL 382


>gi|71985487|ref|NP_493620.2| Protein SET-10 [Caenorhabditis elegans]
 gi|31873104|emb|CAB04267.2| Protein SET-10 [Caenorhabditis elegans]
          Length = 430

 Score = 61.2 bits (147), Expect = 6e-07,   Method: Compositional matrix adjust.
 Identities = 38/117 (32%), Positives = 61/117 (52%), Gaps = 8/117 (6%)

Query: 42  CNAHTICNSE-LRPLGTGLYPVISIINHSCLPNAVLVFEGRLAVVRAVQHVPKGAEVLIS 100
           CN+ +I N + + P+G+GLY  ++  NHSC   + +VFEG    +R  Q      E+ IS
Sbjct: 159 CNSFSIINEKRVEPIGSGLYVGVAKHNHSCASTSHVVFEGNQVFLRTNQE-EYSKELTIS 217

Query: 101 YIETAGSTMTRQKALKEQYLFTCTCPRCIKLGQFDDIQESAILEGYRCKDDGCSGFL 157
           Y+     T  R+K ++  +  TC C  C K  + D I  S+     +C+   C+GF+
Sbjct: 218 YVSRMLPTSERRKTIRGVHFLTCQCEMC-KNEELDLIGLSS-----KCRTKNCTGFV 268


>gi|320168174|gb|EFW45073.1| predicted protein [Capsaspora owczarzaki ATCC 30864]
          Length = 291

 Score = 61.2 bits (147), Expect = 6e-07,   Method: Compositional matrix adjust.
 Identities = 33/76 (43%), Positives = 46/76 (60%), Gaps = 4/76 (5%)

Query: 56  GTGLYPVISIINHSCLPNAVLVFEGR---LAVVRAVQHVPKGAEVLISYIETAGSTMTRQ 112
           G+GL+ + + +NHSC PN V +       L+VV AV  +P   EV ISYI+T      RQ
Sbjct: 213 GSGLFEIGNTMNHSCQPNVVSMTRATDFTLSVV-AVATIPVNTEVCISYIDTDLPKAKRQ 271

Query: 113 KALKEQYLFTCTCPRC 128
            AL+E Y F+C+C +C
Sbjct: 272 AALEELYYFSCSCAKC 287


>gi|342182559|emb|CCC92038.1| conserved hypothetical protein [Trypanosoma congolense IL3000]
          Length = 712

 Score = 61.2 bits (147), Expect = 7e-07,   Method: Compositional matrix adjust.
 Identities = 37/118 (31%), Positives = 56/118 (47%), Gaps = 2/118 (1%)

Query: 42  CNAHTICNSELRPLGTGLYPVISIINHSCLPNAVLV-FEGRLAVVRAVQHVPKGAEVLIS 100
           CN   + +S+   +G  ++P  S  NHSCLPN   V + G +A   A+Q + KG  + I 
Sbjct: 559 CNNFGLFSSKESCIGVSIFPEASYFNHSCLPNLCRVMYRGNIAAFYALQSIRKGEPLTIC 618

Query: 101 YIETA-GSTMTRQKALKEQYLFTCTCPRCIKLGQFDDIQESAILEGYRCKDDGCSGFL 157
           Y++    ST  R++ L   Y F C C RC      D   ++ + E   C      G+L
Sbjct: 619 YVDVQEASTAERRRTLLTSYRFFCECRRCHGYSDEDGKGDTVMSEIRFCDTCDARGYL 676


>gi|406863858|gb|EKD16905.1| TPR domain protein [Marssonina brunnea f. sp. 'multigermtubi'
           MB_m1]
          Length = 739

 Score = 61.2 bits (147), Expect = 7e-07,   Method: Compositional matrix adjust.
 Identities = 33/91 (36%), Positives = 49/91 (53%), Gaps = 6/91 (6%)

Query: 57  TGLYPVISIINHSCLPNAVLVFEGRLAVVRAVQHVPKGAEVLISYIETAGSTMTRQKALK 116
           +GL+ + S INHSC PN    F G + +VRA + +P   E+ +SYIET       Q+ L 
Sbjct: 508 SGLWILASYINHSCEPNCRRAFIGDVQIVRAARDMPADTEITLSYIETDDPAKMNQR-LF 566

Query: 117 EQYLFTCTCPRCIKLGQFDDIQESAILEGYR 147
           + + F CTC  C+     DD +  A ++  R
Sbjct: 567 DGWGFDCTCAMCV-----DDRETPAAVKNRR 592


>gi|194883134|ref|XP_001975659.1| GG20436 [Drosophila erecta]
 gi|190658846|gb|EDV56059.1| GG20436 [Drosophila erecta]
          Length = 515

 Score = 61.2 bits (147), Expect = 7e-07,   Method: Compositional matrix adjust.
 Identities = 43/173 (24%), Positives = 75/173 (43%), Gaps = 11/173 (6%)

Query: 15  IANLVNLILQWPEISINEIAENFSKLACNAHTICNSELRPLGTGLYPVISIINHSCLPNA 74
           I   +    +  + +  EI +    L  N H +  ++  P    ++   S   +SCLPN 
Sbjct: 189 IGQFIPKFFKTQKFTEEEIMKAVGALQINGHEVPTTD--PSHVAVFYTASFTENSCLPNL 246

Query: 75  VLVF-EGRLAVVRAVQHVPKGAEVLISYIETAGSTMTRQKALKEQYLFTCTCPRCIKLGQ 133
              F +    ++ A   + K A + I Y +    T  RQ+ L +  LF C C RC+ + +
Sbjct: 247 AKSFNKNGHCILWAPHEIKKNAHLSICYSDAVWGTADRQRHLMQTKLFKCACDRCVDVTE 306

Query: 134 FDDIQESAILEGYRCKDDGCSGFLL---RDSDDKGFTCQQCGLVRSKEEIKKI 183
             D   SAI    +C+D  C G +L    D  +  + C++C     K  +++I
Sbjct: 307 L-DTNYSAI----KCEDRQCGGLMLPTKADDWNGSWRCRECHKQVQKHYVERI 354


>gi|19922236|ref|NP_610944.1| CG8503 [Drosophila melanogaster]
 gi|7303209|gb|AAF58272.1| CG8503 [Drosophila melanogaster]
 gi|15291193|gb|AAK92865.1| GH11294p [Drosophila melanogaster]
 gi|220945042|gb|ACL85064.1| CG8503-PA [synthetic construct]
 gi|220954946|gb|ACL90016.1| CG8503-PA [synthetic construct]
          Length = 513

 Score = 61.2 bits (147), Expect = 7e-07,   Method: Compositional matrix adjust.
 Identities = 43/173 (24%), Positives = 76/173 (43%), Gaps = 11/173 (6%)

Query: 15  IANLVNLILQWPEISINEIAENFSKLACNAHTICNSELRPLGTGLYPVISIINHSCLPNA 74
           I   +    +  + +  EI +    L  N H +  ++  P    ++   S   +SCLPN 
Sbjct: 189 IGQFIPKFFKTQKFTEEEIMKAVGALQINGHEVPTTD--PSHVAVFYTASFTENSCLPNL 246

Query: 75  VLVF-EGRLAVVRAVQHVPKGAEVLISYIETAGSTMTRQKALKEQYLFTCTCPRCIKLGQ 133
              F +    ++ A + + K A + I Y +    T  RQ+ L +  LF C C RC+ + +
Sbjct: 247 AKSFNKNGHCILWAPREIKKNAHLSICYSDAMWGTADRQRHLMQTKLFKCACERCVDVTE 306

Query: 134 FDDIQESAILEGYRCKDDGCSGFLLRDSDDK---GFTCQQCGLVRSKEEIKKI 183
             D   SAI    +C+D  C G +L    D+    + C++C     K  +++I
Sbjct: 307 L-DTNYSAI----KCEDRQCGGLMLPTKADEWNGSWRCRECHKQVQKHYVERI 354


>gi|225425346|ref|XP_002275154.1| PREDICTED: histone-lysine N-methyltransferase ASHR2-like [Vitis
           vinifera]
          Length = 405

 Score = 61.2 bits (147), Expect = 7e-07,   Method: Compositional matrix adjust.
 Identities = 30/78 (38%), Positives = 38/78 (48%), Gaps = 7/78 (8%)

Query: 58  GLYPVISIINHSCLPNA-------VLVFEGRLAVVRAVQHVPKGAEVLISYIETAGSTMT 110
           G+YP  S  NH CLPNA                 +R +  VP+G+E+ +SY     +   
Sbjct: 217 GIYPKASFFNHDCLPNACRFDYVDTASHHNTDITIRLIHDVPEGSEICLSYFPVNETYAD 276

Query: 111 RQKALKEQYLFTCTCPRC 128
           RQK L E Y FTC C RC
Sbjct: 277 RQKRLLEDYGFTCYCDRC 294


>gi|451848012|gb|EMD61318.1| hypothetical protein COCSADRAFT_96412 [Cochliobolus sativus ND90Pr]
          Length = 417

 Score = 60.8 bits (146), Expect = 7e-07,   Method: Compositional matrix adjust.
 Identities = 34/89 (38%), Positives = 48/89 (53%), Gaps = 8/89 (8%)

Query: 58  GLYPVISIINHSCLPNAVLVFE-GRLA-VVRAVQHVPKGAEVLISYIETAGSTMTRQKAL 115
            ++P  S +NH+C PNA  V +   L+  VR  + + +G E+ ISY      T TRQ  L
Sbjct: 234 AVWPETSRLNHACAPNAQYVIDTDHLSHTVRVTRPIAEGEEITISYTSPLEPTETRQHHL 293

Query: 116 KEQYLFTCTCPRC------IKLGQFDDIQ 138
            + + FTCTCPRC      I L   +D+Q
Sbjct: 294 AQGFHFTCTCPRCTSPITDITLSIINDLQ 322


>gi|147844297|emb|CAN82112.1| hypothetical protein VITISV_031337 [Vitis vinifera]
          Length = 405

 Score = 60.8 bits (146), Expect = 7e-07,   Method: Compositional matrix adjust.
 Identities = 30/78 (38%), Positives = 38/78 (48%), Gaps = 7/78 (8%)

Query: 58  GLYPVISIINHSCLPNA-------VLVFEGRLAVVRAVQHVPKGAEVLISYIETAGSTMT 110
           G+YP  S  NH CLPNA                 +R +  VP+G+E+ +SY     +   
Sbjct: 217 GIYPKASFFNHDCLPNACRFDYVDTASHHNTDITIRLIHDVPEGSEICLSYFPVNETYAD 276

Query: 111 RQKALKEQYLFTCTCPRC 128
           RQK L E Y FTC C RC
Sbjct: 277 RQKRLLEDYGFTCYCDRC 294


>gi|357605609|gb|EHJ64694.1| hypothetical protein KGM_16822 [Danaus plexippus]
          Length = 382

 Score = 60.8 bits (146), Expect = 8e-07,   Method: Compositional matrix adjust.
 Identities = 32/79 (40%), Positives = 47/79 (59%), Gaps = 5/79 (6%)

Query: 56  GTGLYPVISIINHSCLPNAVLVFE-GRLAV-VRAVQHVPKGAEVLISYIETAG---STMT 110
           G+GLY + S  NHSC PNA   F  G   + ++A++++  G E+ ISY++      S  +
Sbjct: 283 GSGLYQLQSACNHSCAPNAESSFPYGNHRIQLKAIRNIKPGDEIHISYLDDCTLQRSRHS 342

Query: 111 RQKALKEQYLFTCTCPRCI 129
           RQ+ L E YLF C C RC+
Sbjct: 343 RQRELSENYLFVCCCERCM 361


>gi|383849380|ref|XP_003700323.1| PREDICTED: SET and MYND domain-containing protein 4-like [Megachile
           rotundata]
          Length = 691

 Score = 60.8 bits (146), Expect = 8e-07,   Method: Compositional matrix adjust.
 Identities = 30/80 (37%), Positives = 46/80 (57%), Gaps = 3/80 (3%)

Query: 51  ELRPLGTGLYPVISIINHSCLPNAVLVFEGRLAVVRAVQHVPKGAEVLISYIET--AGST 108
           ++  + TG+YP  S++NHSC PN + ++  +  +VRAV+ +    E+  SY+ T     T
Sbjct: 403 QINIVATGIYPSASMMNHSCNPNIIKIYMDQYLIVRAVEDIFPTEEIFNSYVATYRYKKT 462

Query: 109 MTRQKALKEQYLFTCTCPRC 128
             RQK L E Y F+C C  C
Sbjct: 463 KARQKLL-ELYYFSCKCEAC 481


>gi|307198438|gb|EFN79380.1| Protein msta, isoform B [Harpegnathos saltator]
          Length = 514

 Score = 60.8 bits (146), Expect = 8e-07,   Method: Compositional matrix adjust.
 Identities = 61/303 (20%), Positives = 122/303 (40%), Gaps = 35/303 (11%)

Query: 28  ISINEIAENFSKLACNAHTICNSELRPL----GTGLYPVISIINHSCLPN---AVLVFEG 80
           + ++  +E   ++AC    I   E+R         LYP ++++NHSC+ N    +   + 
Sbjct: 183 LKLDRFSEECIQMACGILEINTFEIRTAKGYSARALYPTVAMMNHSCVSNMSHTISPIDY 242

Query: 81  RLAVVRAVQHVPKGAEVLISYIETAGSTMTRQKALKEQYLFTCTCPRCIKLGQFDDIQES 140
           R+ + R    +P   E+  SY  +   T+ R++ L E   F C CPRC      D  + +
Sbjct: 243 RIRL-RTTLKIPAAGELYASYTHSLLPTLLRREHLLEGKHFACACPRC-----SDPTELA 296

Query: 141 AILEGYRCK--DDGCSGFLLRDSDDKGFTCQQCGLVRSKEEIKKIASEVNILSKKTLALT 198
             +   +C   D+G    L     +  + C  C    + + ++K+   +        A++
Sbjct: 297 THMSSLKCNKCDNGIVLPLDSLDAESAWKCTHCEYSINGQAVRKVFRIIQAEVDNVEAIS 356

Query: 199 SCGNHQEVVSTYKMIEKLQKKLYHPFSVNLMQTREKLIKILMELEDW----------KEA 248
              +  + +   + + +  + + HP    L   R  L ++   ++++          +  
Sbjct: 357 G-ADGADAIHERETVLRKYRSVLHPRHAFLTILRHSLTQMYGRVDEYLLDDLPDVVLEHK 415

Query: 249 LAYCQLTIPVYQRVYPQFHPLLGLQYYTCGKLEWFLGD--------TENAIKS-MTEAVE 299
           +  C+L + +   V P +  + G+  Y       FL           E  +KS MTEA  
Sbjct: 416 VDMCRLLLQILDVVEPGYSRIRGMTLYELHAPLLFLAKGQWNADVIDEAGLKSKMTEAAN 475

Query: 300 ILR 302
           IL+
Sbjct: 476 ILK 478


>gi|194862038|ref|XP_001969906.1| GG23681 [Drosophila erecta]
 gi|190661773|gb|EDV58965.1| GG23681 [Drosophila erecta]
          Length = 507

 Score = 60.8 bits (146), Expect = 8e-07,   Method: Compositional matrix adjust.
 Identities = 51/199 (25%), Positives = 85/199 (42%), Gaps = 25/199 (12%)

Query: 59  LYPVISIINHSCLPN---AVLVFEGRLAVVRAVQHVPKGAEVLISYIETAGSTMTRQKAL 115
           L+P   I+ H+C+PN   ++   EG    +RA+  + +G  +  SY  T   T  RQK L
Sbjct: 207 LFPYTGILAHNCVPNTSRSIYPSEGYKIRLRAMVDLEEGQPLHHSYTYTLDGTAQRQKHL 266

Query: 116 KEQYLFTCTCPRCI---KLG-QFDDIQESAILEGYRCKDDGCSGFLLRDSDDKGFTCQQC 171
           K+   FTC C RC+   +LG  F  ++     EGY+               D  ++C  C
Sbjct: 267 KQGKFFTCQCDRCLDPTELGTHFSSLKCGQCAEGYQVPRQPTEA-------DTSWSCASC 319

Query: 172 GLVRSKEE----IKKIASEVNILSKKTLALTSCGNHQEVVSTYKMIEKLQKKLYHPFSVN 227
           G   S  E    ++ + SEVN +    +        + ++  Y       K L HP    
Sbjct: 320 GSDTSNVEALAMLQSLQSEVNAVQALPMGAKRLEEVERLLRKY-------KSLLHPLHFI 372

Query: 228 LMQTREKLIKILMELEDWK 246
               R+ LI++   ++ ++
Sbjct: 373 ATGLRQLLIEMYGRVQGYE 391


>gi|145347416|ref|XP_001418163.1| predicted protein [Ostreococcus lucimarinus CCE9901]
 gi|144578392|gb|ABO96456.1| predicted protein [Ostreococcus lucimarinus CCE9901]
          Length = 639

 Score = 60.8 bits (146), Expect = 9e-07,   Method: Compositional matrix adjust.
 Identities = 25/67 (37%), Positives = 42/67 (62%)

Query: 37  FSKLACNAHTICNSELRPLGTGLYPVISIINHSCLPNAVLVFEGRLAVVRAVQHVPKGAE 96
            S+   N  TI + +++ +G G+YP  S+ NHS  PNA ++F+G+  VV+ ++ +  G E
Sbjct: 218 LSRFEINGFTIADDDMQRVGFGIYPEASLFNHSSTPNAQVMFKGKTLVVKTLREIAVGEE 277

Query: 97  VLISYIE 103
           + ISY E
Sbjct: 278 ITISYGE 284


>gi|297845524|ref|XP_002890643.1| predicted protein [Arabidopsis lyrata subsp. lyrata]
 gi|297336485|gb|EFH66902.1| predicted protein [Arabidopsis lyrata subsp. lyrata]
          Length = 976

 Score = 60.8 bits (146), Expect = 9e-07,   Method: Compositional matrix adjust.
 Identities = 26/73 (35%), Positives = 41/73 (56%), Gaps = 1/73 (1%)

Query: 56  GTGLYPVISIINHSCLPNAVLVFEGRLAVVRAVQHVPKGAEVLISYIETAGSTMTRQKAL 115
           G GL+ + S INHSC+PNA  +  G   +V A + +  G E+  +Y +   S + ++K +
Sbjct: 761 GVGLWTLASFINHSCIPNARRLHVGDYVIVHASRDIKAGEEITFAYFDVL-SPLEKRKEM 819

Query: 116 KEQYLFTCTCPRC 128
            E + F C C RC
Sbjct: 820 AESWGFRCGCSRC 832


>gi|393233731|gb|EJD41300.1| hypothetical protein AURDEDRAFT_186645 [Auricularia delicata
           TFB-10046 SS5]
          Length = 293

 Score = 60.8 bits (146), Expect = 9e-07,   Method: Compositional matrix adjust.
 Identities = 36/94 (38%), Positives = 50/94 (53%), Gaps = 8/94 (8%)

Query: 43  NAHTICNSELRPLGTGLYPVIS-IINHSCLPNAVLVF------EGRLAVVRAVQHVPKGA 95
           N + + +S L PL   +YP  S  +NHSC  NAV +F        R+ VV  V+ V  G 
Sbjct: 83  NNNFVLHSHLTPLAAAVYPAASRSLNHSCASNAVPLFVFAPATPPRMEVV-LVRDVAPGD 141

Query: 96  EVLISYIETAGSTMTRQKALKEQYLFTCTCPRCI 129
           E+ I YI+ A +   R++ L+  Y F C C RCI
Sbjct: 142 EITIPYIDPALAPSARRERLRASYGFECACARCI 175


>gi|195020683|ref|XP_001985247.1| GH16955 [Drosophila grimshawi]
 gi|193898729|gb|EDV97595.1| GH16955 [Drosophila grimshawi]
          Length = 529

 Score = 60.8 bits (146), Expect = 9e-07,   Method: Compositional matrix adjust.
 Identities = 57/196 (29%), Positives = 91/196 (46%), Gaps = 29/196 (14%)

Query: 16  ANLVNLI------LQWPEISINEIAENFSKLACNAHTICNSELRPLGTGLYPVISIINHS 69
           ANL+  I        WPE+ I  IA   + L  NA  +  +  R     ++P  ++I H 
Sbjct: 200 ANLITFIKTVLGQRDWPELEILRIA---AILDTNAFEVRQNGERRKVRAIFPGGAMIAHD 256

Query: 70  CLPNAVLVFEGRLAVV-RAVQHVPKGAEVLISYIETAGSTMTRQKALKEQYLFTCTCPRC 128
           C+PN    F+  + ++  A + +PKGA + ISY +   ST+ R+  LK+   F C C RC
Sbjct: 257 CVPNLRHRFDDNMRIIFLAKRPIPKGAILSISYTQPLRSTVQRRVHLKQVKCFDCACERC 316

Query: 129 ---IKLGQFDDIQESAILEGYRCKDDGCSGFLLRDS---DDKGFTCQQCGLVRSKEEI-- 180
               +LG F      A L G +CK     G ++  +   +   + CQ C + +S  E+  
Sbjct: 317 SDPTELGTF----AGAHLCG-KCK----VGKVISQNPLENAANWHCQLCNVKKSAREVLT 367

Query: 181 --KKIASEVNILSKKT 194
              ++  E+  L K T
Sbjct: 368 QDARLQQEIESLDKTT 383


>gi|196009510|ref|XP_002114620.1| hypothetical protein TRIADDRAFT_28394 [Trichoplax adhaerens]
 gi|190582682|gb|EDV22754.1| hypothetical protein TRIADDRAFT_28394 [Trichoplax adhaerens]
          Length = 398

 Score = 60.8 bits (146), Expect = 9e-07,   Method: Compositional matrix adjust.
 Identities = 32/77 (41%), Positives = 46/77 (59%), Gaps = 3/77 (3%)

Query: 56  GTGLYPVISIINHSCLPNAVLVFEGRLAV--VRAVQHVPKGAEVLISYIETA-GSTMTRQ 112
           G+GLY + S  NH C PN  + F    A   V+A++++ +G E+ ISYI++   +   RQ
Sbjct: 298 GSGLYLLQSCCNHDCSPNVEINFLDNNATLTVKAIRNISEGQELCISYIDSDIKNWKKRQ 357

Query: 113 KALKEQYLFTCTCPRCI 129
             L E YLF CTC RC+
Sbjct: 358 AILMENYLFECTCNRCM 374


>gi|310793444|gb|EFQ28905.1| MYND finger [Glomerella graminicola M1.001]
          Length = 548

 Score = 60.8 bits (146), Expect = 9e-07,   Method: Compositional matrix adjust.
 Identities = 33/90 (36%), Positives = 47/90 (52%), Gaps = 1/90 (1%)

Query: 39  KLACNAHTICNSELRPLGTGLYPVISIINHSCLPNAVLVFEGRLAVVRAVQHVPKGAEVL 98
           KL  NA +         G  L   +++INHSC+PNA++ F GR A +R+   +  G+E+ 
Sbjct: 183 KLQTNAFSRSEEYYETGGVFLDTTLAMINHSCVPNALVQFGGRTATLRSASFIHPGSEIE 242

Query: 99  ISYIETAGSTMTRQKALKEQYLFTCTCPRC 128
           ISYI+       RQ  L   Y F C C +C
Sbjct: 243 ISYIDQTQPKSRRQHELS-LYHFECHCSKC 271


>gi|393233728|gb|EJD41297.1| hypothetical protein AURDEDRAFT_106306 [Auricularia delicata
           TFB-10046 SS5]
          Length = 373

 Score = 60.8 bits (146), Expect = 1e-06,   Method: Compositional matrix adjust.
 Identities = 36/94 (38%), Positives = 50/94 (53%), Gaps = 8/94 (8%)

Query: 43  NAHTICNSELRPLGTGLYPVIS-IINHSCLPNAVLVF------EGRLAVVRAVQHVPKGA 95
           N + + +S L PL   +YP  S  +NHSC  NAV +F        R+ VV  V+ V  G 
Sbjct: 163 NNNFVLHSHLTPLAAAVYPAASRALNHSCATNAVPLFVFAPATPPRMEVV-LVRDVAPGD 221

Query: 96  EVLISYIETAGSTMTRQKALKEQYLFTCTCPRCI 129
           E+ I YI+ A +   R++ L+  Y F C C RCI
Sbjct: 222 EITIPYIDPALAPSARRERLRASYGFECACARCI 255


>gi|297836294|ref|XP_002886029.1| hypothetical protein ARALYDRAFT_343234 [Arabidopsis lyrata subsp.
           lyrata]
 gi|297331869|gb|EFH62288.1| hypothetical protein ARALYDRAFT_343234 [Arabidopsis lyrata subsp.
           lyrata]
          Length = 424

 Score = 60.8 bits (146), Expect = 1e-06,   Method: Compositional matrix adjust.
 Identities = 31/78 (39%), Positives = 40/78 (51%), Gaps = 7/78 (8%)

Query: 58  GLYPVISIINHSCLPNAVL------VFEGRLAVV-RAVQHVPKGAEVLISYIETAGSTMT 110
           G+YP  S  NH CLPNA          +G   ++ R +  VP+G EV +SY     +  +
Sbjct: 214 GIYPKTSFFNHDCLPNACRFDYVDSASDGNTDIIIRTIHDVPEGREVCLSYFPVNMNYSS 273

Query: 111 RQKALKEQYLFTCTCPRC 128
           RQK L E Y F C C RC
Sbjct: 274 RQKRLLEDYGFKCDCDRC 291


>gi|413917046|gb|AFW56978.1| hypothetical protein ZEAMMB73_771542 [Zea mays]
          Length = 404

 Score = 60.5 bits (145), Expect = 1e-06,   Method: Compositional matrix adjust.
 Identities = 49/180 (27%), Positives = 65/180 (36%), Gaps = 57/180 (31%)

Query: 58  GLYPVISIINHSCLPNAV-LVFEGRLA------VVRAVQHVPKGAEVLISYIETAGSTMT 110
            +YP  S+ NH CLPNA    +  R        VVRA+  +P+G EV ISY         
Sbjct: 214 AVYPRASLFNHDCLPNACHFDYPDRPGPGNTDIVVRALHDIPEGREVCISYFAANWRYAD 273

Query: 111 RQKALKEQYLFTCTCPRCIKLGQF------------------------------------ 134
           RQ+ L E Y F C C RC    Q+                                    
Sbjct: 274 RQRRLLEDYGFRCECDRCQVESQWKDDDDNNGDDGDDTMEEEGEDDVGDRGDDGIEEEEG 333

Query: 135 --------DDIQESAILEGYRCKDDGCSGFL--LRDSDDKG----FTCQQCGLVRSKEEI 180
                   DD   +     Y C ++GC G L  L  S +      F C  CG +R +E++
Sbjct: 334 DGATNGGDDDFPHAYFFVRYLCDNEGCWGMLAPLPPSPNGDLSHVFECNLCGKLRKEEDV 393


>gi|268570006|ref|XP_002640670.1| Hypothetical protein CBG19732 [Caenorhabditis briggsae]
          Length = 445

 Score = 60.5 bits (145), Expect = 1e-06,   Method: Compositional matrix adjust.
 Identities = 52/231 (22%), Positives = 104/231 (45%), Gaps = 29/231 (12%)

Query: 32  EIAENFSKLACNAHTICNS-------ELRPLGTGLYPVISIINHSCLPNAVLVFEGRLAV 84
           E  +N   +A     +CNS        + P+G+G+Y  ++  NHSC     +VF+G  AV
Sbjct: 163 ESMDNNKMIAIICSVLCNSFGIIAEKRVEPIGSGMYVGLATHNHSCASTCHVVFDGNQAV 222

Query: 85  VRAVQHVPKGAEVLISYIETAGSTMTRQKALKEQYLFTCTCPRCIKLGQFDDIQESAILE 144
           +R+ +         ISY+     T  RQK+++  +  TC C  C+     +D+  + +  
Sbjct: 223 LRS-RDRQYCKNTTISYVSRMLPTFERQKSIRNVHFITCRCEMCLN----EDLDLTGL-- 275

Query: 145 GYRCKDDGCSGFLLRDSDDKGFTCQQCGLV---RSKEEIKKIASEVNILSKKTLALTSCG 201
             +C+   C GF ++ ++    TC +  ++   +S +   K+   +  L K         
Sbjct: 276 ASKCQTSKCQGF-VKGANGSCTTCGKPAMIPFEQSTQSTSKLLDTLENLHK--------S 326

Query: 202 NHQEVVSTYKMIEKLQK---KLYHPFSVNLMQTREKLIKILMELEDWKEAL 249
              + V  Y+ ++ LQ+   ++    +V ++Q  E+L     E+++  E +
Sbjct: 327 QQLDTVQEYRHLQNLQEEYGRILADCNVAILQLDEQLAYCASEMKNVPEDM 377


>gi|403170123|ref|XP_003329506.2| hypothetical protein PGTG_11256 [Puccinia graminis f. sp. tritici
           CRL 75-36-700-3]
 gi|375168575|gb|EFP85087.2| hypothetical protein PGTG_11256 [Puccinia graminis f. sp. tritici
           CRL 75-36-700-3]
          Length = 647

 Score = 60.5 bits (145), Expect = 1e-06,   Method: Compositional matrix adjust.
 Identities = 35/102 (34%), Positives = 54/102 (52%), Gaps = 11/102 (10%)

Query: 38  SKLACNAHTICNSELRPLGTGLYPVISIINHSCLPNAVLVF----EGRLAV-------VR 86
           S+   N+ T+ +  L  +G    P  + INHSC PNAV+VF    EG  +        V 
Sbjct: 277 SRFIDNSFTLTSIILDQIGVVFVPSAAFINHSCNPNAVVVFPEGGEGAGSTAGKEWVKVI 336

Query: 87  AVQHVPKGAEVLISYIETAGSTMTRQKALKEQYLFTCTCPRC 128
           A++ +  G E++ SYI++AG+   R+  L ++Y F C C  C
Sbjct: 337 AIKPIEPGEEIVTSYIDSAGTRQERRNELVKRYKFVCDCQAC 378


>gi|193683573|ref|XP_001946512.1| PREDICTED: protein msta, isoform A-like [Acyrthosiphon pisum]
          Length = 499

 Score = 60.5 bits (145), Expect = 1e-06,   Method: Compositional matrix adjust.
 Identities = 44/167 (26%), Positives = 66/167 (39%), Gaps = 19/167 (11%)

Query: 24  QWPEISINEIAENFSKLACNAHTICNSELRPLGTGLYPVISIINHSCLPNAVLVF--EGR 81
           +WP   ++EI      +  N H +  ++  P    +Y   S++ HSC+PN    F  +G 
Sbjct: 168 EWP---VDEILRVCGIVQVNGHEVPLTD--PPYVAIYDAGSMLEHSCVPNCSKSFTRDGH 222

Query: 82  LAVVRAVQHVPKGAEVLISYIETAGSTMTRQKALKEQYLFTCTCPRCIKLGQFDDIQESA 141
           L +  A   V  G  + ISY +    T  R   L +   F C CPRC      D  +   
Sbjct: 223 LLIRTAAAAVESGGHLSISYTDVLWGTAQRLAHLADTKFFVCKCPRCS-----DPTELGT 277

Query: 142 ILEGYRCKDDGCSGFLL-------RDSDDKGFTCQQCGLVRSKEEIK 181
              G +C  + C G+ L        D  D  + C  C  V     I+
Sbjct: 278 YFSGVKCATEDCMGYSLPNVHPSTNDPFDVEWMCNFCFTVADPNRIE 324


>gi|307108187|gb|EFN56428.1| hypothetical protein CHLNCDRAFT_145047 [Chlorella variabilis]
          Length = 383

 Score = 60.5 bits (145), Expect = 1e-06,   Method: Compositional matrix adjust.
 Identities = 26/77 (33%), Positives = 45/77 (58%), Gaps = 2/77 (2%)

Query: 56  GTGLYPVISIINHSCLPNAVLVFE--GRLAVVRAVQHVPKGAEVLISYIETAGSTMTRQK 113
           G+  Y + S++NHSC P+  + F     +A   A +H+ +G ++ +SY++       R++
Sbjct: 300 GSAAYLLASLLNHSCEPSLGVSFPRNNAVAAFTAARHIARGEQLTVSYVDAGQGLAARRQ 359

Query: 114 ALKEQYLFTCTCPRCIK 130
           AL   Y FTC CPRC++
Sbjct: 360 ALAWAYGFTCRCPRCVE 376


>gi|322786609|gb|EFZ13004.1| hypothetical protein SINV_09986 [Solenopsis invicta]
          Length = 490

 Score = 60.5 bits (145), Expect = 1e-06,   Method: Compositional matrix adjust.
 Identities = 40/145 (27%), Positives = 68/145 (46%), Gaps = 9/145 (6%)

Query: 58  GLYPVISIINHSCLPNAVLVFEGRLAV-VRAVQHVPKGAEVLISYIETAGSTMTRQKALK 116
           GLYP+ ++ NH C+PN+   F+ +  + V A   +  G E+ +SY +    T TR++ L+
Sbjct: 216 GLYPMGALQNHCCVPNSRHHFDDQQRLHVSAALPIAAGEEITMSYTDLLWDTSTRRQFLR 275

Query: 117 EQYLFTCTCPRCIKLGQFDDIQESAILEGYRCKDDGCSGFLLRDSD---DKGFTCQQCGL 173
               F+C C RC      D ++  + L    C  D C G LL  +    +  + C +C  
Sbjct: 276 ITKRFSCNCNRC-----SDPLEFGSRLSALLCAKDECLGHLLPRNPLNYESSWICDKCQT 330

Query: 174 VRSKEEIKKIASEVNILSKKTLALT 198
             +  +I+ I S +N      +  T
Sbjct: 331 SVNHRQIECIHSGLNTFVSNVMYKT 355


>gi|226499648|ref|NP_001143943.1| uncharacterized protein LOC100276756 [Zea mays]
 gi|195629926|gb|ACG36604.1| hypothetical protein [Zea mays]
          Length = 404

 Score = 60.5 bits (145), Expect = 1e-06,   Method: Compositional matrix adjust.
 Identities = 49/180 (27%), Positives = 65/180 (36%), Gaps = 57/180 (31%)

Query: 58  GLYPVISIINHSCLPNAV-LVFEGRLA------VVRAVQHVPKGAEVLISYIETAGSTMT 110
            +YP  S+ NH CLPNA    +  R        VVRA+  +P+G EV ISY         
Sbjct: 214 AVYPRASLFNHDCLPNACHFDYPDRPGPGNTDIVVRALHDIPEGREVCISYFAANWRYAD 273

Query: 111 RQKALKEQYLFTCTCPRCIKLGQF------------------------------------ 134
           RQ+ L E Y F C C RC    Q+                                    
Sbjct: 274 RQRRLLEDYGFRCECDRCQVESQWKDDDDNNGDDGDDTMEEEDEDDAGDRGDDGIEEEEG 333

Query: 135 --------DDIQESAILEGYRCKDDGCSGFL--LRDSDDKG----FTCQQCGLVRSKEEI 180
                   DD   +     Y C ++GC G L  L  S +      F C  CG +R +E++
Sbjct: 334 DGATNGGDDDFPHAYFFVRYLCDNEGCWGMLAPLPPSPNGDLSHVFECNLCGKLRKEEDV 393


>gi|356518688|ref|XP_003528010.1| PREDICTED: uncharacterized protein LOC100783548 [Glycine max]
          Length = 786

 Score = 60.5 bits (145), Expect = 1e-06,   Method: Compositional matrix adjust.
 Identities = 36/117 (30%), Positives = 53/117 (45%), Gaps = 7/117 (5%)

Query: 44  AHTICNSELRPLGTGLYPVISIINHSCLPNAVLVFEGRLAVVRAVQHVPKGAEVLISYIE 103
           AH+  N E   +G  +Y   S+ NHSC PN    F  R   +R    V  G+++ +SY  
Sbjct: 462 AHSTSNVEQVRVGKAIYKAGSLFNHSCQPNVHAYFLSRALYLRTTNVVAAGSQLELSYGP 521

Query: 104 TAG--STMTRQKALKEQYLFTCTCPRCIKLGQFDDIQESAILEGYRCKDDGCSGFLL 158
             G      R   LK +Y F C C  C ++ + D      +L  + C +  CSG +L
Sbjct: 522 QVGLWDCKDRLNFLKNEYAFHCLCTGCSEVNRSD-----LVLNAFHCVNPNCSGAVL 573


>gi|30680748|ref|NP_849991.1| histone-lysine N-methyltransferase ASHR2 [Arabidopsis thaliana]
 gi|94707155|sp|Q9ZUM9.3|ASHR2_ARATH RecName: Full=Histone-lysine N-methyltransferase ASHR2; AltName:
           Full=ASH1-related protein 2; AltName: Full=Protein SET
           DOMAIN GROUP 39
 gi|28393236|gb|AAO42047.1| putative SET-domain transcriptional regulator [Arabidopsis
           thaliana]
 gi|330251813|gb|AEC06907.1| histone-lysine N-methyltransferase ASHR2 [Arabidopsis thaliana]
          Length = 398

 Score = 60.5 bits (145), Expect = 1e-06,   Method: Compositional matrix adjust.
 Identities = 31/78 (39%), Positives = 40/78 (51%), Gaps = 7/78 (8%)

Query: 58  GLYPVISIINHSCLPNAVL------VFEGRLAVV-RAVQHVPKGAEVLISYIETAGSTMT 110
           G+YP  S  NH CLPNA          +G   ++ R +  VP+G EV +SY     +  +
Sbjct: 219 GIYPKTSFFNHDCLPNACRFDYVDSASDGNTDIIIRMIHDVPEGREVCLSYFPVNMNYSS 278

Query: 111 RQKALKEQYLFTCTCPRC 128
           RQK L E Y F C C RC
Sbjct: 279 RQKRLLEDYGFKCDCDRC 296


>gi|328785590|ref|XP_003250620.1| PREDICTED: SET and MYND domain-containing protein 4-like, partial
           [Apis mellifera]
          Length = 444

 Score = 60.5 bits (145), Expect = 1e-06,   Method: Compositional matrix adjust.
 Identities = 29/71 (40%), Positives = 41/71 (57%), Gaps = 2/71 (2%)

Query: 55  LGTGLYPVISIINHSCLPNAVLVFEGRLAVVRAVQHVPKGAEVLISYIE--TAGSTMTRQ 112
           + TG+YP  SI+NHSC PN + +F  +  +VRA + + +G E+   Y       +T  RQ
Sbjct: 373 VATGIYPSASIMNHSCDPNIINIFVNQYLIVRASRDISQGEEIFNCYGPHYRHMTTENRQ 432

Query: 113 KALKEQYLFTC 123
           K LK QY F C
Sbjct: 433 KILKNQYCFIC 443


>gi|198461003|ref|XP_002138938.1| GA25085 [Drosophila pseudoobscura pseudoobscura]
 gi|198137203|gb|EDY69496.1| GA25085 [Drosophila pseudoobscura pseudoobscura]
          Length = 515

 Score = 60.5 bits (145), Expect = 1e-06,   Method: Compositional matrix adjust.
 Identities = 44/175 (25%), Positives = 72/175 (41%), Gaps = 11/175 (6%)

Query: 13  AQIANLVNLILQWPEISINEIAENFSKLACNAHTICNSELRPLGTGLYPVISIINHSCLP 72
           A I   +    +  + S  E+      L  N H +  S+  P    ++   S   HSC+P
Sbjct: 187 ASIGQFIPKFFKTEKFSEEEVMRAVGALQINGHEVPTSD--PPHVAVFYTASFTEHSCMP 244

Query: 73  NAVLVF-EGRLAVVRAVQHVPKGAEVLISYIETAGSTMTRQKALKEQYLFTCTCPRCIKL 131
           N    F +    ++ A + + K + + I Y +    T  RQ+ L +  LF C C RC  +
Sbjct: 245 NLAKSFNKNGHCILWAPREIKKNSHLSICYSDAMWGTADRQRHLVQTKLFKCACDRCTDV 304

Query: 132 GQFDDIQESAILEGYRCKDDGCSGFLLRDSDDK---GFTCQQCGLVRSKEEIKKI 183
            +      SAI    +C+D  C G +L    D     + C++C     K  I +I
Sbjct: 305 TEL-GTNYSAI----KCEDRRCDGLMLPSKTDDWNGSWKCRECQKQVQKHYIDQI 354


>gi|195170308|ref|XP_002025955.1| GL10122 [Drosophila persimilis]
 gi|194110819|gb|EDW32862.1| GL10122 [Drosophila persimilis]
          Length = 515

 Score = 60.5 bits (145), Expect = 1e-06,   Method: Compositional matrix adjust.
 Identities = 44/175 (25%), Positives = 72/175 (41%), Gaps = 11/175 (6%)

Query: 13  AQIANLVNLILQWPEISINEIAENFSKLACNAHTICNSELRPLGTGLYPVISIINHSCLP 72
           A I   +    +  + S  E+      L  N H +  S+  P    ++   S   HSC+P
Sbjct: 187 ASIGQFIPKFFKTEKFSEEEVMRAVGALQINGHEVPTSD--PPHVAVFYTASFTEHSCMP 244

Query: 73  NAVLVF-EGRLAVVRAVQHVPKGAEVLISYIETAGSTMTRQKALKEQYLFTCTCPRCIKL 131
           N    F +    ++ A + + K + + I Y +    T  RQ+ L +  LF C C RC  +
Sbjct: 245 NLAKSFNKNGHCILWAPREIKKNSHLSICYSDAMWGTADRQRHLVQTKLFKCACDRCTDV 304

Query: 132 GQFDDIQESAILEGYRCKDDGCSGFLLRDSDDK---GFTCQQCGLVRSKEEIKKI 183
            +      SAI    +C+D  C G +L    D     + C++C     K  I +I
Sbjct: 305 TEL-GTNYSAI----KCEDRRCDGLMLPSKTDDWNGSWKCRECQKQVQKHYIDQI 354


>gi|380494289|emb|CCF33265.1| MYND finger [Colletotrichum higginsianum]
          Length = 549

 Score = 60.5 bits (145), Expect = 1e-06,   Method: Compositional matrix adjust.
 Identities = 34/92 (36%), Positives = 48/92 (52%), Gaps = 1/92 (1%)

Query: 39  KLACNAHTICNSELRPLGTGLYPVISIINHSCLPNAVLVFEGRLAVVRAVQHVPKGAEVL 98
           KL  NA +         G  L   +++INHSC+PNA++ F GR A +RA   +  G E+ 
Sbjct: 183 KLQTNAFSRTEEYYETGGVFLDTTLAMINHSCVPNALVQFGGRTATLRATSFLDPGDEIE 242

Query: 99  ISYIETAGSTMTRQKALKEQYLFTCTCPRCIK 130
           ISYI+       R   L + Y F C+C +C K
Sbjct: 243 ISYIDQTQPRGKRHGEL-DLYHFECSCYKCQK 273


>gi|158288115|ref|XP_309979.4| AGAP011530-PA [Anopheles gambiae str. PEST]
 gi|157019319|gb|EAA05706.4| AGAP011530-PA [Anopheles gambiae str. PEST]
          Length = 522

 Score = 60.5 bits (145), Expect = 1e-06,   Method: Compositional matrix adjust.
 Identities = 48/183 (26%), Positives = 83/183 (45%), Gaps = 10/183 (5%)

Query: 24  QWPEISINEIAENFSKLACNAHTICNSELRPLGTGLYPVISIINHSCLPNAVLVFEGRLA 83
           +W E   +EI      +  N H +  +E  P    +Y + S++ HSC PN    F  R  
Sbjct: 200 KWDE---DEILRVVGIIQVNGHEVPMTE--PSSVAIYNMASMLEHSCRPNLAKSFTNRGE 254

Query: 84  VVR-AVQHVPKGAEVLISYIETAGSTMTRQKALKEQYLFTCTCPRCIKLGQFDDIQESAI 142
           VV  A   + +G  + I Y +   +T  R + L++  +F C C RC    +++    +  
Sbjct: 255 VVMWAPNPIRRGDRLSICYTDVLWTTGNRLEHLQQTKMFRCECERCSDRTEYETYFSAVR 314

Query: 143 LEGYRCKDDGCSGFLLRDSDDKGFTCQQCGLVRSKEEIKKIASEVNILSKKTLALTSCGN 202
             G++ KD  C G+LL   + +     QC  +R  +E+   A+   IL +  + L +   
Sbjct: 315 CSGFQ-KDSKCKGYLLPVDNAQWTGEWQC--LRCSKEVSG-ATVCQILERARMDLEAMEK 370

Query: 203 HQE 205
           H+E
Sbjct: 371 HRE 373


>gi|402083521|gb|EJT78539.1| hypothetical protein GGTG_03639 [Gaeumannomyces graminis var.
           tritici R3-111a-1]
          Length = 779

 Score = 60.1 bits (144), Expect = 1e-06,   Method: Compositional matrix adjust.
 Identities = 29/95 (30%), Positives = 46/95 (48%), Gaps = 7/95 (7%)

Query: 41  ACNAHTICNSELRPLGT-------GLYPVISIINHSCLPNAVLVFEGRLAVVRAVQHVPK 93
           +C   T+ N+  RP G        G++P  +I+NH+C+ N+   F G L + RA + +P 
Sbjct: 459 SCPPLTVDNTRPRPEGVPEPPQTRGIWPHAAIMNHACVANSTRAFCGDLFITRASRDIPA 518

Query: 94  GAEVLISYIETAGSTMTRQKALKEQYLFTCTCPRC 128
           G E+   Y+        RQ  L+  + F C C  C
Sbjct: 519 GDEITQQYVPVRADWGERQAQLRHWWGFECACALC 553


>gi|224029645|gb|ACN33898.1| unknown [Zea mays]
          Length = 369

 Score = 60.1 bits (144), Expect = 1e-06,   Method: Compositional matrix adjust.
 Identities = 35/111 (31%), Positives = 51/111 (45%), Gaps = 7/111 (6%)

Query: 47  ICNSELRPLGTGLYPVISIINHSCLPNAVLVFEGRLAVVRAVQHVPKGAEVLISYIETAG 106
           +C+ E   +   LY   S+ NHSC PN    F  R  V+R  + +  G+ V +SY    G
Sbjct: 122 MCSVEQVRVAQALYVSGSLFNHSCQPNVHAYFLSRAFVLRTTEFIKSGSPVELSYGPQVG 181

Query: 107 STMT--RQKALKEQYLFTCTCPRCIKLGQFDDIQESAILEGYRCKDDGCSG 155
                 RQK+L+E Y F+C C  C +L   D      ++  + C    C G
Sbjct: 182 EMHISERQKSLQENYYFSCQCSSCSELNLSD-----LVMNSFCCPQSNCLG 227


>gi|302836969|ref|XP_002950044.1| hypothetical protein VOLCADRAFT_104571 [Volvox carteri f.
           nagariensis]
 gi|300264517|gb|EFJ48712.1| hypothetical protein VOLCADRAFT_104571 [Volvox carteri f.
           nagariensis]
          Length = 430

 Score = 60.1 bits (144), Expect = 1e-06,   Method: Compositional matrix adjust.
 Identities = 31/73 (42%), Positives = 40/73 (54%), Gaps = 2/73 (2%)

Query: 58  GLYPVISIINHSCLPNAVLVFEGRLAVVRAVQHVPKGAEVLISYIET--AGSTMTRQKAL 115
           GL+P  ++ NHSC PNAV V  GR   +R    V  G EV I+Y      G    R+  L
Sbjct: 286 GLWPAHAMFNHSCAPNAVAVVAGRELHIRCSGPVSAGEEVCITYSGALGLGPLPLRRALL 345

Query: 116 KEQYLFTCTCPRC 128
           ++ + F CTCPRC
Sbjct: 346 EKNHRFRCTCPRC 358


>gi|296085566|emb|CBI29298.3| unnamed protein product [Vitis vinifera]
          Length = 327

 Score = 60.1 bits (144), Expect = 1e-06,   Method: Compositional matrix adjust.
 Identities = 30/78 (38%), Positives = 38/78 (48%), Gaps = 7/78 (8%)

Query: 58  GLYPVISIINHSCLPNA-------VLVFEGRLAVVRAVQHVPKGAEVLISYIETAGSTMT 110
           G+YP  S  NH CLPNA                 +R +  VP+G+E+ +SY     +   
Sbjct: 148 GIYPKASFFNHDCLPNACRFDYVDTASHHNTDITIRLIHDVPEGSEICLSYFPVNETYAD 207

Query: 111 RQKALKEQYLFTCTCPRC 128
           RQK L E Y FTC C RC
Sbjct: 208 RQKRLLEDYGFTCYCDRC 225


>gi|256071345|ref|XP_002572001.1| set and mynd domain containing [Schistosoma mansoni]
 gi|350645143|emb|CCD60151.1| set and mynd domain containing, putative [Schistosoma mansoni]
          Length = 527

 Score = 60.1 bits (144), Expect = 1e-06,   Method: Compositional matrix adjust.
 Identities = 30/87 (34%), Positives = 45/87 (51%), Gaps = 3/87 (3%)

Query: 47  ICNSELRPLGTGLYPVISIINHSCLPNAVLVFEGRLAVVRAVQHVPKGAEVLISY---IE 103
           IC  +   + TGL+P +S++NHSC PN    FE    ++R ++ +  G EV   Y     
Sbjct: 384 ICKFQQIRIATGLFPCVSLLNHSCDPNTAHNFEESFLILRCLKPILPGTEVFHCYGPHYL 443

Query: 104 TAGSTMTRQKALKEQYLFTCTCPRCIK 130
              S++ R   L++QY F C C  C K
Sbjct: 444 HYPSSVQRVTLLQQQYFFICDCEHCSK 470


>gi|154337874|ref|XP_001565163.1| conserved hypothetical protein [Leishmania braziliensis
           MHOM/BR/75/M2904]
 gi|134062210|emb|CAM36598.1| conserved hypothetical protein [Leishmania braziliensis
           MHOM/BR/75/M2904]
          Length = 466

 Score = 60.1 bits (144), Expect = 1e-06,   Method: Compositional matrix adjust.
 Identities = 42/129 (32%), Positives = 64/129 (49%), Gaps = 20/129 (15%)

Query: 28  ISINEIAENFSKLACNAHTICNSELRPLGT-------------GLYPVISIINHSCLPN- 73
           +S + +     ++  NAH I +  L P+G              GLY +++  NHSC+PN 
Sbjct: 329 LSFDTLRTLVGQMVLNAHAINDYVLLPVGAPSPSAFDWVLKGAGLYSLLACFNHSCVPNV 388

Query: 74  AVLVFEGRLAVV-RAVQHVPKGAEVLISYIETAGSTMT---RQKALKEQYLFTCTCPRCI 129
           AV   +G   +V +  + +  G  + I+YI  +   M+   RQ+ L+  Y FTC CPRCI
Sbjct: 389 AVSNVDGTHEIVLKTTRPIQAGEPLTITYIPLSAGAMSRAERQRQLR-NYFFTCHCPRCI 447

Query: 130 K-LGQFDDI 137
           K  G   DI
Sbjct: 448 KEAGVAGDI 456


>gi|353236057|emb|CCA68059.1| hypothetical protein PIIN_01926 [Piriformospora indica DSM 11827]
          Length = 786

 Score = 60.1 bits (144), Expect = 1e-06,   Method: Compositional matrix adjust.
 Identities = 77/309 (24%), Positives = 119/309 (38%), Gaps = 84/309 (27%)

Query: 56  GTGLYPVISIINHSCLPNAVLVFEGRLAVVRAVQHVPKGAEVLISYIETAGSTMTRQKAL 115
           GTGLY + S  NHSC+PNA   F   L V+RA Q + +  E+ ISY   A S  +R+K L
Sbjct: 522 GTGLYLLASYFNHSCIPNAGHQFFSDLMVIRATQSIKQDEEITISYCSHA-SYASREKNL 580

Query: 116 KEQYLFTCTCPRCIKLGQFDDIQESAILEGYRCKDDGCSGFLLRDSDDKGFTCQQCGLVR 175
           K  +   C C  CI                    DD  +G   R                
Sbjct: 581 KP-WFDQCDCQLCI--------------------DDRLAGSNRRS--------------- 604

Query: 176 SKEEIKK-IASEVNILSKKTLALTSCGNHQEVVSTYKMIEKLQKKL---YHPFSVNLMQT 231
           ++E + K + +    L +   AL      Q++VSTY   E+++ +L   YH FS  L Q 
Sbjct: 605 ARESLSKAVRNAATPLPQARDAL------QKMVSTYSSQERVRPELSTAYHAFSHRLQQM 658

Query: 232 REKLIKILMELE---------DW----------KEALAYCQL-TIPVYQRVYP------- 264
             + +      E         +W           E   +  L T+P+  R  P       
Sbjct: 659 AHEFVSKAAATEAIEKEIQSLEWLGVEVTEKGITEVSDHSHLPTLPISTRRIPNTFGEPD 718

Query: 265 -QFHPLLGLQYYTCG----KLEWFLGDTENAIKSMTEAVEILRITHG----TNSPFMKEL 315
            QF  + G+ +   G     + WF      + K++   + + + TH        P + E 
Sbjct: 719 MQFLMIFGM-FVLLGCPNRAMSWFRAAIWMSEKTIGGGIPMFKRTHSDIIRVLGPVLAEF 777

Query: 316 ILKLEEAQA 324
           + K+E  +A
Sbjct: 778 LSKIESERA 786


>gi|440800514|gb|ELR21550.1| Histone-lysine N-methyltransferase [Acanthamoeba castellanii str.
           Neff]
          Length = 333

 Score = 60.1 bits (144), Expect = 1e-06,   Method: Compositional matrix adjust.
 Identities = 33/76 (43%), Positives = 44/76 (57%), Gaps = 2/76 (2%)

Query: 56  GTGLYPVISIINHSCLPNAVLVF-EGRLAVVRAVQHVPKGAEVLISYIETAGSTMTRQKA 114
           G GL+P+ +++NHSC PN  + F   R AVV A+  + +G E+  SYIE       RQ  
Sbjct: 254 GFGLFPIAAMMNHSCEPNTQVKFGRNREAVVVALCDIAEGEELTHSYIENDRPLAERQAD 313

Query: 115 LKEQYLFTCTCPRCIK 130
           L E Y F C C RC+K
Sbjct: 314 LLE-YNFVCQCVRCLK 328


>gi|195591423|ref|XP_002085440.1| GD12310 [Drosophila simulans]
 gi|194197449|gb|EDX11025.1| GD12310 [Drosophila simulans]
          Length = 530

 Score = 60.1 bits (144), Expect = 1e-06,   Method: Compositional matrix adjust.
 Identities = 50/181 (27%), Positives = 83/181 (45%), Gaps = 22/181 (12%)

Query: 10  LLYAQIANLVNLIL---QWPEISINEIAENFSKLACNAHTICNSELRPLGTGLYPVISII 66
           +L A +   +  +L    WPE+ I  IA   + L  N   +     R     LYP  ++I
Sbjct: 199 VLRANLITFIKTVLGMKDWPEMDILRIA---AILDTNTFEVRQPRERRKIRALYPGAAMI 255

Query: 67  NHSCLPNAVLVFEGRLAVV-RAVQHVPKGAEVLISYIETAGSTMTRQKALKEQYLFTCTC 125
           +H C+PN    F+  + +V  A + + KG  + ISY +   ST+ R+  L++   F C+C
Sbjct: 256 SHDCVPNMRHRFDDDMNIVFLAKRKIAKGEILSISYTQPLRSTIQRRVHLRQAKCFDCSC 315

Query: 126 PRCI---KLGQFDDIQESAILEGYRCKDDGCSGFLLRDS---DDKGFTCQQCGLVRSKEE 179
            RC    +LG F   Q        +CK    +G ++  +   +   + CQ C   RS +E
Sbjct: 316 ARCQDPEELGSFAGAQTC-----LKCK----AGKIISLNPLLNSAPWKCQLCNFKRSAKE 366

Query: 180 I 180
           +
Sbjct: 367 V 367


>gi|46115982|ref|XP_384009.1| hypothetical protein FG03833.1 [Gibberella zeae PH-1]
          Length = 690

 Score = 60.1 bits (144), Expect = 1e-06,   Method: Compositional matrix adjust.
 Identities = 36/128 (28%), Positives = 61/128 (47%), Gaps = 5/128 (3%)

Query: 6   EKQLLLYAQIANLVNLILQWPEISINEIAENFSKLACNAHTIC----NSELRPLGTGLYP 61
           E+ L L      +   +++  E+ + +  +    + CNA  +     + ++    TGL+ 
Sbjct: 452 ERILGLQGDYKGIGQKLIELDEVPVIDTFQIHDIVQCNAFGLGQQTEDEDISNASTGLWV 511

Query: 62  VISIINHSCLPNAVLVFEGRLAVVRAVQHVPKGAEVLISYIETAGSTMTRQKALKEQYLF 121
             S INHSC+PNA     G L + RA + +  G E+  +Y E+  S   RQ A +  + F
Sbjct: 512 RASYINHSCIPNAKKDLIGDLILFRATRRIASGEEITHAYDEST-SYEARQAAFRRTWNF 570

Query: 122 TCTCPRCI 129
            C CP C+
Sbjct: 571 ECRCPLCL 578


>gi|169606119|ref|XP_001796480.1| hypothetical protein SNOG_06094 [Phaeosphaeria nodorum SN15]
 gi|160706916|gb|EAT87158.2| hypothetical protein SNOG_06094 [Phaeosphaeria nodorum SN15]
          Length = 323

 Score = 60.1 bits (144), Expect = 1e-06,   Method: Compositional matrix adjust.
 Identities = 48/155 (30%), Positives = 71/155 (45%), Gaps = 21/155 (13%)

Query: 58  GLYPVISIINHSCLPNAVLVFEGRL--AVVRAVQHVPKGAEVLISYIETAGSTMTRQKAL 115
           GL+P I+ INHSC PNA   +   L   +V A + + KG E+ +SYI    +   RQK L
Sbjct: 117 GLFPKIARINHSCRPNASYYWSQTLNKRIVYATRRIAKGDEIFVSYIPLLLTQEQRQKHL 176

Query: 116 KEQYLFTCTCPRCIKLGQFDDIQESAILEGYRCKDDGCSGFLLRDSDDKGFTCQQCGLVR 175
            ++Y F CTC  C         QE A +E     DD       R +  K F   +  L  
Sbjct: 177 -DRYGFKCTCEAC--------AQEHAAME---ASDD------RRTTISKAFVDFEPQLTL 218

Query: 176 SKEEIKKIASEVNILSKKTLALTSCGNHQEVVSTY 210
           +  + KK   +    ++ +L L     H+E ++ Y
Sbjct: 219 TSPKTKKATQQARKNARASLKLAELV-HEEGLADY 252


>gi|66810740|ref|XP_639077.1| SET domain-containing protein [Dictyostelium discoideum AX4]
 gi|74854618|sp|Q54R14.1|Y3443_DICDI RecName: Full=SET domain-containing protein DDB_G0283443
 gi|60467687|gb|EAL65706.1| SET domain-containing protein [Dictyostelium discoideum AX4]
          Length = 393

 Score = 60.1 bits (144), Expect = 1e-06,   Method: Compositional matrix adjust.
 Identities = 37/99 (37%), Positives = 49/99 (49%), Gaps = 5/99 (5%)

Query: 63  ISIINHSCLPNAVLVFEGRLAVVRAVQHVPKGAEVLISYIETAGSTMTRQKALKEQYLFT 122
           IS  NHSC PN V V E +   + ++  + KG E+ ISYI+   S   R   LKE Y F 
Sbjct: 273 ISYFNHSCFPNCVRVQENQSISIYSLIPIKKGDELSISYIDIRMSKNDRLLHLKEIYYFE 332

Query: 123 CTCPRCI--KLGQFDDIQESAILEGYRCKDDG--CSGFL 157
           C C RC    L     ++    +E Y CK+    C+G L
Sbjct: 333 CKCKRCTLPPLSNL-SLEIEKTIENYTCKNQSIKCTGIL 370


>gi|325188967|emb|CCA23495.1| conserved hypothetical protein [Albugo laibachii Nc14]
          Length = 497

 Score = 60.1 bits (144), Expect = 2e-06,   Method: Compositional matrix adjust.
 Identities = 31/76 (40%), Positives = 45/76 (59%), Gaps = 2/76 (2%)

Query: 56  GTGLYPVISIINHSCLPNAVLVF-EGRLAVVRAVQHVPKGAEVLISYIETAGSTMTRQKA 114
           GT L+ +I  +NHSCLPN ++V+     A V A+Q +    E+ I YI+T  S   RQ  
Sbjct: 415 GTALFSLICTMNHSCLPNCIVVYGHNGEAHVHAIQAIQPKDELCIEYIDTDRSYDERQYE 474

Query: 115 LKEQYLFTCTCPRCIK 130
           L+E Y F C C +C++
Sbjct: 475 LRE-YHFRCQCMKCMR 489


>gi|332372756|gb|AEE61520.1| unknown [Dendroctonus ponderosae]
          Length = 501

 Score = 60.1 bits (144), Expect = 2e-06,   Method: Compositional matrix adjust.
 Identities = 49/195 (25%), Positives = 85/195 (43%), Gaps = 15/195 (7%)

Query: 59  LYPVISIINHSCLPNAVLVFE----GRL----AVVRAVQHVPKGAEVLISYIETAGSTMT 110
           L+P ++I++H+C+ N     +    G L      VRA   +PKG E+  SY  +   T+ 
Sbjct: 198 LFPKLAILSHNCVSNIHHAVDCNGDGELKDCVVTVRAAVDIPKGGELFSSYTFSLWPTLV 257

Query: 111 RQKALKEQYLFTCTCPRCIKLGQFDDIQESAILEGYRCKDDGCSGFLLRDSDDKGFTCQQ 170
           R++ L+E   F CTCPRC    + D    S  L+  RC D+G         D   + C  
Sbjct: 258 RREFLRESKFFDCTCPRCSSPTELDTHMSS--LKCQRC-DNGVIISTNTADDACEWKCTH 314

Query: 171 CGLVRSKEEIKKIASEVNILSKKTLALTSCGNHQEVVSTYKMIEKLQKKLYHPFSVNLMQ 230
           C    +   ++K+ + +     +   +   G  Q +     +  K +  L HP +  +  
Sbjct: 315 CDYKTNGRAVRKVFAAIQ---NEIDQVEYIGGPQGIQQRETIFRKYRSVL-HPNNSYMTT 370

Query: 231 TREKLIKILMELEDW 245
            R  LI++  + E +
Sbjct: 371 LRSALIQLYGKTEGY 385


>gi|281212379|gb|EFA86539.1| hypothetical protein PPL_00338 [Polysphondylium pallidum PN500]
          Length = 468

 Score = 60.1 bits (144), Expect = 2e-06,   Method: Compositional matrix adjust.
 Identities = 29/75 (38%), Positives = 46/75 (61%), Gaps = 3/75 (4%)

Query: 56  GTGLYPVISIINHSCLPNAVLVFE--GRLAVVRAVQHVPKGAEVLISYIETAGSTMTRQK 113
           G G++ + ++INHSC+PN ++VFE    LA ++A++ +  G E+  SYIE +     R +
Sbjct: 373 GFGVFGLQAMINHSCVPNCLVVFEQGSSLAYIKALRDIVPGEELFHSYIEESAPFEERSQ 432

Query: 114 ALKEQYLFTCTCPRC 128
            L   Y F C CP+C
Sbjct: 433 ELV-TYGFNCDCPKC 446


>gi|189234759|ref|XP_001814707.1| PREDICTED: similar to SMYD family member 5 [Tribolium castaneum]
 gi|270001538|gb|EEZ97985.1| hypothetical protein TcasGA2_TC000380 [Tribolium castaneum]
          Length = 383

 Score = 60.1 bits (144), Expect = 2e-06,   Method: Compositional matrix adjust.
 Identities = 32/88 (36%), Positives = 50/88 (56%), Gaps = 7/88 (7%)

Query: 56  GTGLYPVISIINHSCLPNAVLVF---EGRLAVVRAVQHVPKGAEVLISYIETAG---STM 109
           G  L+ + S  NHSC+PNA   +     +L++V AV+ + +G E+ ISY++      S  
Sbjct: 284 GVALFTLQSACNHSCVPNAEPTYLHNNNKLSLV-AVRDIQEGEEICISYLDECNLHRSRH 342

Query: 110 TRQKALKEQYLFTCTCPRCIKLGQFDDI 137
           +R+K L E YLF C CP+C +     D+
Sbjct: 343 SRRKELMENYLFACNCPKCEEQADEPDV 370


>gi|389645452|ref|XP_003720358.1| hypothetical protein MGG_09876 [Magnaporthe oryzae 70-15]
 gi|351640127|gb|EHA47991.1| hypothetical protein MGG_09876 [Magnaporthe oryzae 70-15]
          Length = 427

 Score = 60.1 bits (144), Expect = 2e-06,   Method: Compositional matrix adjust.
 Identities = 27/71 (38%), Positives = 41/71 (57%)

Query: 55  LGTGLYPVISIINHSCLPNAVLVFEGRLAVVRAVQHVPKGAEVLISYIETAGSTMTRQKA 114
            GT L  V+S+INH C PNA + FEG    VR+++ +  G E+ +SY +     + RQ+ 
Sbjct: 128 FGTSLDLVVSMINHDCSPNAHVFFEGSQVRVRSLKAIAAGGEITVSYCDPRLDVLLRQEI 187

Query: 115 LKEQYLFTCTC 125
           L++   F C C
Sbjct: 188 LRQTQFFHCEC 198


>gi|18399103|ref|NP_565457.1| histone-lysine N-methyltransferase ASHR2 [Arabidopsis thaliana]
 gi|14334524|gb|AAK59459.1| putative SET-domain transcriptional regulator protein [Arabidopsis
           thaliana]
 gi|17979109|gb|AAL47498.1| putative SET-domain transcriptional regulator protein [Arabidopsis
           thaliana]
 gi|20197479|gb|AAD10162.2| putative SET-domain transcriptional regulator [Arabidopsis
           thaliana]
 gi|330251812|gb|AEC06906.1| histone-lysine N-methyltransferase ASHR2 [Arabidopsis thaliana]
          Length = 341

 Score = 60.1 bits (144), Expect = 2e-06,   Method: Compositional matrix adjust.
 Identities = 31/78 (39%), Positives = 40/78 (51%), Gaps = 7/78 (8%)

Query: 58  GLYPVISIINHSCLPNAVL------VFEGRLAVV-RAVQHVPKGAEVLISYIETAGSTMT 110
           G+YP  S  NH CLPNA          +G   ++ R +  VP+G EV +SY     +  +
Sbjct: 219 GIYPKTSFFNHDCLPNACRFDYVDSASDGNTDIIIRMIHDVPEGREVCLSYFPVNMNYSS 278

Query: 111 RQKALKEQYLFTCTCPRC 128
           RQK L E Y F C C RC
Sbjct: 279 RQKRLLEDYGFKCDCDRC 296


>gi|328855353|gb|EGG04480.1| hypothetical protein MELLADRAFT_108480 [Melampsora larici-populina
           98AG31]
          Length = 398

 Score = 59.7 bits (143), Expect = 2e-06,   Method: Compositional matrix adjust.
 Identities = 36/108 (33%), Positives = 54/108 (50%), Gaps = 9/108 (8%)

Query: 25  WPEISINEIAENFSKLACNAHTICNSELRPLGTGLYPVISIINHSCLPNAVLVFEGR--- 81
           W   S++ I+     L  NA +I +++L      ++P +S INH C   +  V+  R   
Sbjct: 171 WQNQSLSNISNYIRILQTNAISIGSNQL-----AIFPSLSRINHGCAGASNSVYNWREKE 225

Query: 82  -LAVVRAVQHVPKGAEVLISYIETAGSTMTRQKALKEQYLFTCTCPRC 128
            + VV A + +  G E+LISY ++  S   RQ  LK  Y F CTC  C
Sbjct: 226 GVEVVHATKLIEVGEEILISYWDSKRSRSDRQDYLKSNYGFQCTCQTC 273


>gi|340057403|emb|CCC51748.1| conserved hypothetical protein [Trypanosoma vivax Y486]
          Length = 438

 Score = 59.7 bits (143), Expect = 2e-06,   Method: Compositional matrix adjust.
 Identities = 30/78 (38%), Positives = 44/78 (56%), Gaps = 3/78 (3%)

Query: 56  GTGLYPVISIINHSCLPNAVLVFEG---RLAVVRAVQHVPKGAEVLISYIETAGSTMTRQ 112
           G G+Y V S+ NHSC PN  ++         VV A++    G E+ ISYI+ +     RQ
Sbjct: 350 GQGVYAVGSLFNHSCEPNLQVLNSATGDETLVVEALRDCEPGEELYISYIDESLPYAIRQ 409

Query: 113 KALKEQYLFTCTCPRCIK 130
           + L+E YLF C C +C++
Sbjct: 410 QQLREHYLFECRCSKCVR 427


>gi|453088056|gb|EMF16097.1| SET domain-containing protein, partial [Mycosphaerella populorum
           SO2202]
          Length = 558

 Score = 59.7 bits (143), Expect = 2e-06,   Method: Compositional matrix adjust.
 Identities = 24/76 (31%), Positives = 38/76 (50%)

Query: 55  LGTGLYPVISIINHSCLPNAVLVFEGRLAVVRAVQHVPKGAEVLISYIETAGSTMTRQKA 114
           LG    P++   NHSC PN  ++F     ++RA   + K  E+ + Y++       RQ  
Sbjct: 221 LGIAFDPILCSANHSCEPNTAVIFNQPQLILRAQTKIKKDEEIFMKYVDITNPFSVRQAE 280

Query: 115 LKEQYLFTCTCPRCIK 130
           L++ Y F C C +C K
Sbjct: 281 LRQSYFFGCRCTKCRK 296


>gi|119189249|ref|XP_001245231.1| hypothetical protein CIMG_04672 [Coccidioides immitis RS]
 gi|303323233|ref|XP_003071608.1| hypothetical protein CPC735_071450 [Coccidioides posadasii C735
           delta SOWgp]
 gi|240111310|gb|EER29463.1| hypothetical protein CPC735_071450 [Coccidioides posadasii C735
           delta SOWgp]
 gi|320031288|gb|EFW13261.1| conserved hypothetical protein [Coccidioides posadasii str.
           Silveira]
 gi|392868134|gb|EAS33877.2| hypothetical protein CIMG_04672 [Coccidioides immitis RS]
          Length = 338

 Score = 59.7 bits (143), Expect = 2e-06,   Method: Compositional matrix adjust.
 Identities = 28/74 (37%), Positives = 44/74 (59%), Gaps = 2/74 (2%)

Query: 58  GLYPVISIINHSCLPNAVLVFEGRL--AVVRAVQHVPKGAEVLISYIETAGSTMTRQKAL 115
           G++P +S  NH C PN++  +   L  AVV  V+ V +G E+ +SY+E   +   RQ+ L
Sbjct: 119 GIFPKVSRFNHGCRPNSMRSYHPVLDQAVVHVVKDVSEGEEITVSYVEPGLAFYLRQEQL 178

Query: 116 KEQYLFTCTCPRCI 129
           KE++ F C C  C+
Sbjct: 179 KEKFGFICGCNLCL 192


>gi|302796834|ref|XP_002980178.1| hypothetical protein SELMODRAFT_419787 [Selaginella moellendorffii]
 gi|300151794|gb|EFJ18438.1| hypothetical protein SELMODRAFT_419787 [Selaginella moellendorffii]
          Length = 662

 Score = 59.7 bits (143), Expect = 2e-06,   Method: Compositional matrix adjust.
 Identities = 31/73 (42%), Positives = 45/73 (61%), Gaps = 3/73 (4%)

Query: 58  GLYPVISIINHSCLPNAVLVFEGRLAVVRAVQHVPKGAEVLISYIET--AGSTMTRQKAL 115
           GL+  + +INHSC+PNA  +  G +  VRA + +  G EVL+SY+E   A     R+K +
Sbjct: 445 GLWASVGLINHSCIPNASKMSIGDVVFVRAAKDLKAGDEVLLSYLEPPLAPYAGYREKMI 504

Query: 116 KEQYLFTCTCPRC 128
            +QY F C+C RC
Sbjct: 505 -QQYNFECSCERC 516


>gi|426226456|ref|XP_004007359.1| PREDICTED: SET and MYND domain-containing protein 5, partial [Ovis
           aries]
          Length = 391

 Score = 59.7 bits (143), Expect = 2e-06,   Method: Compositional matrix adjust.
 Identities = 31/79 (39%), Positives = 46/79 (58%), Gaps = 5/79 (6%)

Query: 56  GTGLYPVISIINHSCLPNAVLVF--EGRLAVVRAVQHVPKGAEVLISYIETAG---STMT 110
           G+GL+ + S  NHSC+PNA   F     L  V A++ +  G  + ISY++      S  +
Sbjct: 278 GSGLFVLQSCCNHSCVPNAETSFPENNFLLHVTALEDIKPGERLGISYLDCCQRERSRHS 337

Query: 111 RQKALKEQYLFTCTCPRCI 129
           R K L+E YLF C+CP+C+
Sbjct: 338 RHKILRENYLFVCSCPKCL 356


>gi|19921122|ref|NP_609464.1| CG43129, isoform D [Drosophila melanogaster]
 gi|442627404|ref|NP_001260366.1| CG43129, isoform E [Drosophila melanogaster]
 gi|7297774|gb|AAF53024.1| CG43129, isoform D [Drosophila melanogaster]
 gi|17945207|gb|AAL48662.1| RE12806p [Drosophila melanogaster]
 gi|440213690|gb|AGB92901.1| CG43129, isoform E [Drosophila melanogaster]
          Length = 507

 Score = 59.7 bits (143), Expect = 2e-06,   Method: Compositional matrix adjust.
 Identities = 50/199 (25%), Positives = 85/199 (42%), Gaps = 25/199 (12%)

Query: 59  LYPVISIINHSCLPN---AVLVFEGRLAVVRAVQHVPKGAEVLISYIETAGSTMTRQKAL 115
           L+P   I+ H+C+PN   ++   EG    +RA+  + +G  +  SY  T   T  RQK L
Sbjct: 207 LFPYTGILAHNCVPNTSRSIYPSEGYKIRLRAMVDLEEGQPLHHSYTYTLDGTAQRQKHL 266

Query: 116 KEQYLFTCTCPRCI---KLG-QFDDIQESAILEGYRCKDDGCSGFLLRDSDDKGFTCQQC 171
           K+   FTC C RC+   +LG  F  ++     EG++               D  + C  C
Sbjct: 267 KQGKFFTCQCERCLDPTELGTHFSSLKCGQCAEGFQVPRQP-------TEPDTSWNCANC 319

Query: 172 GLVRSKEE----IKKIASEVNILSKKTLALTSCGNHQEVVSTYKMIEKLQKKLYHPFSVN 227
           G   S  +    ++ + SEVN +    +A       + ++  Y       K L HP    
Sbjct: 320 GSDTSNADALAMLQSLQSEVNAVQALPMAAKRLEEIERLLRKY-------KSLLHPLHFI 372

Query: 228 LMQTREKLIKILMELEDWK 246
               R+ LI++   ++ ++
Sbjct: 373 ATGLRQLLIEMYGRVQGYE 391


>gi|429860577|gb|ELA35307.1| mynd finger family protein [Colletotrichum gloeosporioides Nara
           gc5]
          Length = 551

 Score = 59.7 bits (143), Expect = 2e-06,   Method: Compositional matrix adjust.
 Identities = 35/104 (33%), Positives = 54/104 (51%), Gaps = 4/104 (3%)

Query: 28  ISINEIAENFS---KLACNAHTICNSELRPLGTGLYPVISIINHSCLPNAVLVFEGRLAV 84
           I++  + E+F    KL  NA +         G  L   +++INHSC+PNA++ F GR A 
Sbjct: 171 ITLKGLEESFQVLCKLQTNAFSRTEEYYETGGVFLDTTLAMINHSCVPNAMVQFGGRTAT 230

Query: 85  VRAVQHVPKGAEVLISYIETAGSTMTRQKALKEQYLFTCTCPRC 128
           +R+   +  G E+ ISY +       RQ  L + Y F C+C +C
Sbjct: 231 LRSATFINPGDEIEISYADHTQPKGKRQGEL-DLYHFECSCYKC 273


>gi|9295736|gb|AAF87042.1|AC006535_20 T24P13.14 [Arabidopsis thaliana]
          Length = 969

 Score = 59.7 bits (143), Expect = 2e-06,   Method: Compositional matrix adjust.
 Identities = 26/73 (35%), Positives = 41/73 (56%), Gaps = 1/73 (1%)

Query: 56  GTGLYPVISIINHSCLPNAVLVFEGRLAVVRAVQHVPKGAEVLISYIETAGSTMTRQKAL 115
           G GL+ + S INHSC+PNA  +  G   +V A + +  G E+  +Y +   S + ++K +
Sbjct: 754 GVGLWTLASFINHSCIPNARRLHVGDYVIVHASRDIKTGEEISFAYFDVL-SPLEKRKEM 812

Query: 116 KEQYLFTCTCPRC 128
            E + F C C RC
Sbjct: 813 AESWGFCCGCSRC 825


>gi|332373072|gb|AEE61677.1| unknown [Dendroctonus ponderosae]
          Length = 382

 Score = 59.7 bits (143), Expect = 2e-06,   Method: Compositional matrix adjust.
 Identities = 36/91 (39%), Positives = 50/91 (54%), Gaps = 9/91 (9%)

Query: 52  LRPLGTGLYPVISIINHSCLPNAVLVF---EGRLAVVRAVQHVPKGAEVLISYI---ETA 105
           L   G GLY   S  NHSC PNA + F     RL+++ A++ +  G E+ ISY+   +  
Sbjct: 280 LNSEGVGLYARQSCANHSCDPNAEISFVHNNYRLSLI-ALKDIQPGEEICISYLGDCDNE 338

Query: 106 GSTMTRQKALKEQYLFTCTCPRCIKLGQFDD 136
            S  +R+K L E YLF C C +C  L + DD
Sbjct: 339 RSRHSRRKTLMENYLFACECSKC--LAEIDD 367


>gi|195334290|ref|XP_002033816.1| GM21522 [Drosophila sechellia]
 gi|194125786|gb|EDW47829.1| GM21522 [Drosophila sechellia]
          Length = 514

 Score = 59.7 bits (143), Expect = 2e-06,   Method: Compositional matrix adjust.
 Identities = 42/173 (24%), Positives = 76/173 (43%), Gaps = 11/173 (6%)

Query: 15  IANLVNLILQWPEISINEIAENFSKLACNAHTICNSELRPLGTGLYPVISIINHSCLPNA 74
           I   +    +  + +  +I +    L  N H +  ++  P    ++   S   +SCLPN 
Sbjct: 189 IGQFIPKFFKTQKFTEEQIMKAVGALQINGHEVPTTD--PSHVAVFYTASFTENSCLPNL 246

Query: 75  VLVF-EGRLAVVRAVQHVPKGAEVLISYIETAGSTMTRQKALKEQYLFTCTCPRCIKLGQ 133
              F +    ++ A + + K A + I Y +    T  RQ+ L +  LF C C RC+ + +
Sbjct: 247 AKSFNKNGHCILWAPREIKKNAHLSICYSDAMWGTADRQRHLMQTKLFKCACERCVDVTE 306

Query: 134 FDDIQESAILEGYRCKDDGCSGFLL---RDSDDKGFTCQQCGLVRSKEEIKKI 183
             D   SAI    +C+D  C G +L    D  +  + C++C     K  +++I
Sbjct: 307 L-DTNYSAI----KCEDRQCGGLMLPTKADDWNGSWRCRECHKQVQKHYVERI 354


>gi|302759442|ref|XP_002963144.1| hypothetical protein SELMODRAFT_404768 [Selaginella moellendorffii]
 gi|300170005|gb|EFJ36607.1| hypothetical protein SELMODRAFT_404768 [Selaginella moellendorffii]
          Length = 664

 Score = 59.7 bits (143), Expect = 2e-06,   Method: Compositional matrix adjust.
 Identities = 31/73 (42%), Positives = 45/73 (61%), Gaps = 3/73 (4%)

Query: 58  GLYPVISIINHSCLPNAVLVFEGRLAVVRAVQHVPKGAEVLISYIET--AGSTMTRQKAL 115
           GL+  + +INHSC+PNA  +  G +  VRA + +  G EVL+SY+E   A     R+K +
Sbjct: 447 GLWASVGLINHSCIPNASKMSIGDVVFVRAAKDLKAGDEVLLSYLEPPLAPYAGYREKMI 506

Query: 116 KEQYLFTCTCPRC 128
            +QY F C+C RC
Sbjct: 507 -QQYNFECSCERC 518


>gi|383850313|ref|XP_003700740.1| PREDICTED: protein msta, isoform B-like [Megachile rotundata]
          Length = 494

 Score = 59.7 bits (143), Expect = 2e-06,   Method: Compositional matrix adjust.
 Identities = 45/142 (31%), Positives = 66/142 (46%), Gaps = 9/142 (6%)

Query: 58  GLYPVISIINHSCLPNAVLVF-EGRLAVVRAVQHVPKGAEVLISYIETAGSTMTRQKALK 116
           GLYP+ ++ NH+C PN    F E     V A   +  G E+ +SY      T  R++ L 
Sbjct: 214 GLYPLGALQNHNCAPNTRHHFDENYRLYVSATVPISAGEELTMSYTSLFWDTTLRRQFLS 273

Query: 117 EQYLFTCTCPRCIKLGQFDDIQESAILEGYRCKDDGCSGFLL-RDSDDKG--FTCQQCGL 173
               F+CTC RC      D  +  ++L    C  D CSG LL RD    G  + C  CG 
Sbjct: 274 ITKHFSCTCKRC-----SDPTESGSMLCALLCAFDNCSGNLLPRDPLRIGTPWVCDMCGK 328

Query: 174 VRSKEEIKKIASEVNILSKKTL 195
             +  +I  I S +  ++++ L
Sbjct: 329 SINYRQICSIRSGLAAITEEAL 350


>gi|195387830|ref|XP_002052595.1| GJ20777 [Drosophila virilis]
 gi|194149052|gb|EDW64750.1| GJ20777 [Drosophila virilis]
          Length = 504

 Score = 59.7 bits (143), Expect = 2e-06,   Method: Compositional matrix adjust.
 Identities = 59/234 (25%), Positives = 99/234 (42%), Gaps = 33/234 (14%)

Query: 32  EIAENFSK---------LACNAHTICNSELRPLGTGLYPVISIINHSCLPNAVLVF---E 79
           ++AE FS+         L  NA     S+  PL   LYP   I+ H+C+PN        E
Sbjct: 169 KLAERFSEELIMQVVGVLEVNAFEARTSQGYPLRC-LYPYTGILAHNCVPNTARTIHPSE 227

Query: 80  GRLAVVRAVQHVPKGAEVLISYIETAGSTMTRQKALKEQYLFTCTCPRCIKLGQFDDIQE 139
           G    +RA+  +  G ++  SY  T   T  RQ  L+E   FTC+C RC+   +      
Sbjct: 228 GYKIRLRAMVALEAGQQLQHSYTYTLDGTAQRQAHLREGKYFTCSCERCLDASELQTHFS 287

Query: 140 SAILEGYRCKDDGCSGFLLRDSD---DKGFTCQQCGLVRSKEEIKKIA----SEVNILSK 192
           S  ++  +C +    G+LL       +  + C+ C    S EE+++I     +EVN +  
Sbjct: 288 S--MKCGQCTE----GWLLPKHPTAMNSSWNCRACDHSTSSEEVRQIVAALQAEVNAVQA 341

Query: 193 KTLALTSCGNHQEVVSTYKMIEKLQKKLYHPFSVNLMQTREKLIKILMELEDWK 246
             +        + ++  Y       K L HP        R+ LI++   ++ ++
Sbjct: 342 LDMGPKRLEESERLLRKY-------KSLLHPSHYIATSLRQLLIEMYGRVQGYE 388


>gi|380018360|ref|XP_003693098.1| PREDICTED: protein msta, isoform A-like [Apis florea]
          Length = 467

 Score = 59.7 bits (143), Expect = 2e-06,   Method: Compositional matrix adjust.
 Identities = 47/185 (25%), Positives = 78/185 (42%), Gaps = 21/185 (11%)

Query: 14  QIANLVNLILQWPEISINEIAENFSKLACNAHTICNSELRPLG----TGLYPVISIINHS 69
            IA+ +    +  +IS  EI      L  N H +      PL       +Y + S+I H+
Sbjct: 184 NIAHFIKRFFKADDISEEEIVTIIGILQVNGHEV------PLTDSPYVAVYEMASLIEHN 237

Query: 70  CLPNAVLVFE--GRLAVVRAVQHVPKGAEVLISYIETAGSTMTRQKALKEQYLFTCTCPR 127
           C  N    F   G L ++RA   + KG  + I Y +    T  R+  L +   F C C R
Sbjct: 238 CRANCSKSFTDMGGL-IIRAALPITKGDHISICYTDPLWGTANRRHHLLKTKFFECICNR 296

Query: 128 CIKLGQFDDIQESAILEGYRCKDDGCSGFLLRDS---DDKGFTCQQCGLVRSKEEIKKIA 184
           C    +F       +    +C    CSG++L  +    ++ + C+ C  + S  EI+K+ 
Sbjct: 297 CQDPTEF-----QTMFNALKCNKINCSGYVLPKTFLEQEQDYVCKTCESIVSCTEIEKML 351

Query: 185 SEVNI 189
            ++ I
Sbjct: 352 EDIGI 356


>gi|330801330|ref|XP_003288681.1| hypothetical protein DICPUDRAFT_34471 [Dictyostelium purpureum]
 gi|325081244|gb|EGC34766.1| hypothetical protein DICPUDRAFT_34471 [Dictyostelium purpureum]
          Length = 378

 Score = 59.7 bits (143), Expect = 2e-06,   Method: Compositional matrix adjust.
 Identities = 41/132 (31%), Positives = 60/132 (45%), Gaps = 20/132 (15%)

Query: 40  LACNAHTIC-----NSELR-----PLGTGLYPVISIINHSCLPNAVLVFE------GRLA 83
           L CN+H I      NS +       +G G+Y   S+ NHSC PN   V E      G  +
Sbjct: 226 LECNSHDISFPFRTNSSITTFNYYSIGIGIYFNSSLFNHSCQPNICKVIESCKNNFGCHS 285

Query: 84  VVRAVQHVPKGAEVLISYIETAGSTMTRQKALKEQYLFTCTCPRCIKLGQFDDIQESAIL 143
           +V A++ + +  E+  +YI+   +   R + L+  Y FTC CP C      +       L
Sbjct: 286 MV-AIRDIKENEEISFNYIQLNQNKQERIEKLQSSYYFTCNCPSCCTN---NPNPHKEFL 341

Query: 144 EGYRCKDDGCSG 155
             Y+CK   CSG
Sbjct: 342 NKYKCKKLNCSG 353


>gi|345570043|gb|EGX52868.1| hypothetical protein AOL_s00007g204 [Arthrobotrys oligospora ATCC
           24927]
          Length = 772

 Score = 59.7 bits (143), Expect = 2e-06,   Method: Compositional matrix adjust.
 Identities = 29/73 (39%), Positives = 41/73 (56%)

Query: 57  TGLYPVISIINHSCLPNAVLVFEGRLAVVRAVQHVPKGAEVLISYIETAGSTMTRQKALK 116
           +GL+ + +  NHSC+ NA   F G + + RAV  +PK  EVLISY +       RQ  L+
Sbjct: 519 SGLWILPAYTNHSCISNAHRTFLGDMMIFRAVVDMPKDTEVLISYTDHNLPYEERQGILQ 578

Query: 117 EQYLFTCTCPRCI 129
             + FTC C  C+
Sbjct: 579 TSWKFTCQCKLCV 591


>gi|186478896|ref|NP_173998.2| SET domain protein 35 [Arabidopsis thaliana]
 gi|332192607|gb|AEE30728.1| SET domain protein 35 [Arabidopsis thaliana]
          Length = 545

 Score = 59.7 bits (143), Expect = 2e-06,   Method: Compositional matrix adjust.
 Identities = 26/73 (35%), Positives = 41/73 (56%), Gaps = 1/73 (1%)

Query: 56  GTGLYPVISIINHSCLPNAVLVFEGRLAVVRAVQHVPKGAEVLISYIETAGSTMTRQKAL 115
           G GL+ + S INHSC+PNA  +  G   +V A + +  G E+  +Y +   S + ++K +
Sbjct: 330 GVGLWTLASFINHSCIPNARRLHVGDYVIVHASRDIKTGEEISFAYFDVL-SPLEKRKEM 388

Query: 116 KEQYLFTCTCPRC 128
            E + F C C RC
Sbjct: 389 AESWGFCCGCSRC 401


>gi|392592096|gb|EIW81423.1| SET domain-containing protein [Coniophora puteana RWD-64-598 SS2]
          Length = 450

 Score = 59.7 bits (143), Expect = 2e-06,   Method: Compositional matrix adjust.
 Identities = 35/107 (32%), Positives = 60/107 (56%), Gaps = 10/107 (9%)

Query: 29  SINEIAENFSKLACNAHTICNSELRPLGTGLYPVIS-IINHSCLPNAVLVF------EGR 81
           +I ++ + FS+ + N   + +S L  +  G++P+ S + NHSCLPNA   +        R
Sbjct: 167 TIPDVNDLFSRFSNNNFAV-HSHLTTVAHGVFPLASRLFNHSCLPNAAARYILSEDAAPR 225

Query: 82  LAVVRAVQHVPKGAEVLISYIETAGSTMTRQKALKEQYLFTCTCPRC 128
           + +V A+Q +  G E+ + Y++ A    +RQ+  +  Y FTCTCP C
Sbjct: 226 MEIV-ALQDIGAGEEICVPYLDPA-LLQSRQQGFQLTYGFTCTCPSC 270


>gi|170060287|ref|XP_001865735.1| conserved hypothetical protein [Culex quinquefasciatus]
 gi|167878799|gb|EDS42182.1| conserved hypothetical protein [Culex quinquefasciatus]
          Length = 650

 Score = 59.7 bits (143), Expect = 2e-06,   Method: Compositional matrix adjust.
 Identities = 46/171 (26%), Positives = 79/171 (46%), Gaps = 14/171 (8%)

Query: 55  LGTGLYPVISIINHSCLPNAVLVFEGRLAVVRAVQHVPKGAEVLISY--IETAGSTMTRQ 112
           +G GLYP +++ NHSC P     ++G    VR+V+ +  G+ +  +Y  + T      R+
Sbjct: 448 IGGGLYPTLALFNHSCEPGVTRYYKGNSVCVRSVRSIAAGSMIGENYGPLFTQTPRDERR 507

Query: 113 KALKEQYLFTCTCPRCIK-LGQFDDIQESAILEGYRCKDDG---CSGFLLRDSDDKGF-- 166
             L  QY F+C C  C +    F ++ ++ +    R K DG   CS  LL  S+   F  
Sbjct: 508 ATLLNQYRFSCNCRACSENWPLFSEMDDTVL----RFKCDGGKICSNVLLIPSEINEFMI 563

Query: 167 TCQQCGLVRSKEEIKKIASEVNILSKKTLALTSCGNHQEVVSTYKMIEKLQ 217
            C  CG   +  +  K+  +   + K    + S G  +  +  YK +E++ 
Sbjct: 564 KCTDCGEHTNIMKGLKLVQDTETMFKLATKMHSAGEIEAAL--YKYVEEMN 612


>gi|312373795|gb|EFR21480.1| hypothetical protein AND_16998 [Anopheles darlingi]
          Length = 486

 Score = 59.7 bits (143), Expect = 2e-06,   Method: Compositional matrix adjust.
 Identities = 43/158 (27%), Positives = 74/158 (46%), Gaps = 14/158 (8%)

Query: 54  PLGTGLYPVISIINHSCLPNAVLVFEGRLAVV-RAVQHVPKGAEVLISYIETAGSTMTRQ 112
           P    +Y   S++ HSC PN    F  R  +V  A   + +G  + I Y +   +T  R 
Sbjct: 189 PSSVAIYNQASMLEHSCRPNLSKSFTNRGEIVLWAPNTIKRGERLSICYTDAMWATGNRL 248

Query: 113 KALKEQYLFTCTCPRCIKLGQFDDIQESAILEGYRCKDDGCSGFLLR-DSDD---KGFTC 168
           + L++  +F C C RC    +++    +    G++ KD  C G++L  DSD+     ++C
Sbjct: 249 EHLQQTKMFRCECERCSDPTEYETYFSALRCSGFQ-KDSKCKGYILPVDSDNWTTGDWSC 307

Query: 169 QQC-GLVRSKEEIKKIASEVNILSKKTLALTSCGNHQE 205
           Q+C G+V         A+   IL +  + L +   H+E
Sbjct: 308 QKCRGVVTG-------AAVGQILERARMDLEAMQKHRE 338


>gi|380489700|emb|CCF36529.1| TPR domain-containing protein [Colletotrichum higginsianum]
          Length = 257

 Score = 59.7 bits (143), Expect = 2e-06,   Method: Compositional matrix adjust.
 Identities = 34/84 (40%), Positives = 44/84 (52%), Gaps = 1/84 (1%)

Query: 53  RPLGTGLYPVISIINHSCLPNAVLVFEGRLAVVRAVQHVPKGAEVLISYIETAGSTMTRQ 112
           R   TGL+   + INHSC+ NA   + G L V+RA+  + KG E+  SY E+A     RQ
Sbjct: 104 RNASTGLWIWAAYINHSCIANAKKEYVGDLMVLRALGAIKKGEEIFHSYDESA-DYEARQ 162

Query: 113 KALKEQYLFTCTCPRCIKLGQFDD 136
           KAL   + F C C  C      DD
Sbjct: 163 KALLTTWGFECGCALCAAERTDDD 186


>gi|242078489|ref|XP_002444013.1| hypothetical protein SORBIDRAFT_07g005870 [Sorghum bicolor]
 gi|241940363|gb|EES13508.1| hypothetical protein SORBIDRAFT_07g005870 [Sorghum bicolor]
          Length = 404

 Score = 59.7 bits (143), Expect = 2e-06,   Method: Compositional matrix adjust.
 Identities = 32/78 (41%), Positives = 39/78 (50%), Gaps = 7/78 (8%)

Query: 58  GLYPVISIINHSCLPNAV-LVFEGRLA------VVRAVQHVPKGAEVLISYIETAGSTMT 110
            +YP  S+ NH CLPNA    +  R        VVRA+  +P+G EV ISY         
Sbjct: 214 AVYPRASLFNHDCLPNACHFDYPDRPGPGNTDIVVRALHDIPEGREVCISYFAANWRYAD 273

Query: 111 RQKALKEQYLFTCTCPRC 128
           RQ+ L E Y F C C RC
Sbjct: 274 RQRRLLEDYGFRCECDRC 291


>gi|298708048|emb|CBJ30401.1| set and mynd domain containing [Ectocarpus siliculosus]
          Length = 589

 Score = 59.3 bits (142), Expect = 2e-06,   Method: Compositional matrix adjust.
 Identities = 40/121 (33%), Positives = 56/121 (46%), Gaps = 24/121 (19%)

Query: 27  EISINEIAENFSKLACNAHTICNSELRPLG-------TGLYPVISIINHSCLPNAVLVFE 79
           E+S  EI     +L  N HT+   +  PLG        G++P  S++NHSC PNA   F 
Sbjct: 299 EVSAEEIQTVHRRLRRNFHTV---QPEPLGRLLPPTVVGVFPAASLVNHSCEPNA--CFH 353

Query: 80  GRLA---------VVRAVQHVPKGAEVLISYIETAGSTMT---RQKALKEQYLFTCTCPR 127
            R A          +R    V  G EV +SY+       T   R++ L+  + F+C CPR
Sbjct: 354 SRRAGPEGPPLELALRCTTDVSAGEEVCVSYLAHCADAATKEGRRELLQNVWGFSCDCPR 413

Query: 128 C 128
           C
Sbjct: 414 C 414


>gi|51859341|gb|AAH81479.1| SMYD family member 5 [Danio rerio]
          Length = 300

 Score = 59.3 bits (142), Expect = 2e-06,   Method: Compositional matrix adjust.
 Identities = 34/86 (39%), Positives = 48/86 (55%), Gaps = 7/86 (8%)

Query: 56  GTGLYPVISIINHSCLPNAVLVFEGR--LAVVRAVQHVPKGAEVLISYIETAG---STMT 110
           G+GL+ + S  NHSC+PNA   F     L  + A+  +  G E+ ISY++      S  +
Sbjct: 188 GSGLFLLQSSCNHSCVPNAEASFPDNNFLLHLTALGDIGPGEEICISYLDCCQRDRSRHS 247

Query: 111 RQKALKEQYLFTCTCPRCIKLGQFDD 136
           R K L+E YLF C+C +C  L Q DD
Sbjct: 248 RHKILRENYLFICSCQKC--LSQMDD 271


>gi|195578361|ref|XP_002079034.1| GD23739 [Drosophila simulans]
 gi|194191043|gb|EDX04619.1| GD23739 [Drosophila simulans]
          Length = 325

 Score = 59.3 bits (142), Expect = 2e-06,   Method: Compositional matrix adjust.
 Identities = 50/199 (25%), Positives = 86/199 (43%), Gaps = 25/199 (12%)

Query: 59  LYPVISIINHSCLPN---AVLVFEGRLAVVRAVQHVPKGAEVLISYIETAGSTMTRQKAL 115
           L+P   I+ H+C+PN   ++   EG    +RA+  + +G  +  SY  T   T  RQK L
Sbjct: 25  LFPYTGILAHNCVPNTARSIYPSEGYKIRLRAMVDLEEGQPLHHSYTYTLDGTAQRQKHL 84

Query: 116 KEQYLFTCTCPRCI---KLG-QFDDIQESAILEGYRCKDDGCSGFLLRDSDDKGFTCQQC 171
           K+   FTC C RC+   +LG  F  ++     EG++               D  ++C  C
Sbjct: 85  KQGKFFTCQCERCLDPTELGTHFSSLKCGQCAEGFQVPRQP-------TEPDTSWSCAGC 137

Query: 172 GLVRSKEE----IKKIASEVNILSKKTLALTSCGNHQEVVSTYKMIEKLQKKLYHPFSVN 227
           G   S  +    ++ + SEVN +    +A       + ++  Y       K L HP    
Sbjct: 138 GSDTSNADALAMLQSLQSEVNAVQALPMAAKRLEEIERLLRKY-------KSLLHPLHFI 190

Query: 228 LMQTREKLIKILMELEDWK 246
               R+ LI++   ++ ++
Sbjct: 191 ATGLRQLLIEMYGRVQGYE 209


>gi|351737851|gb|AEQ60886.1| hypothetical protein [Acanthamoeba castellanii mamavirus]
          Length = 263

 Score = 59.3 bits (142), Expect = 2e-06,   Method: Compositional matrix adjust.
 Identities = 29/92 (31%), Positives = 45/92 (48%), Gaps = 6/92 (6%)

Query: 38  SKLACNAHTICNSELRPLGTGLYPVISIINHSCLPNAVLVFEGRLAVVRAVQHVPKGAEV 97
           +K  CNA    N      G+ +    +  NHSC+PN + V +        V+++  G E+
Sbjct: 118 AKYMCNAFEFNN------GSAILINGAKFNHSCVPNVIFVSDENYMYFYTVRNIKTGEEL 171

Query: 98  LISYIETAGSTMTRQKALKEQYLFTCTCPRCI 129
             +Y++   +T TR+  L  QY F C C RCI
Sbjct: 172 TDNYVDIMSNTKTRKNRLFNQYGFDCQCERCI 203


>gi|311978082|ref|YP_003987202.1| putative SET domain-containing protein [Acanthamoeba polyphaga
           mimivirus]
 gi|81999695|sp|Q5UNT8.1|YL678_MIMIV RecName: Full=Putative SET domain-containing protein L678
 gi|55417289|gb|AAV50939.1| unknown [Acanthamoeba polyphaga mimivirus]
 gi|308204535|gb|ADO18336.1| putative SET domain-containing protein [Acanthamoeba polyphaga
           mimivirus]
 gi|339061617|gb|AEJ34921.1| hypothetical protein MIMI_L678 [Acanthamoeba polyphaga mimivirus]
          Length = 255

 Score = 59.3 bits (142), Expect = 2e-06,   Method: Compositional matrix adjust.
 Identities = 29/92 (31%), Positives = 45/92 (48%), Gaps = 6/92 (6%)

Query: 38  SKLACNAHTICNSELRPLGTGLYPVISIINHSCLPNAVLVFEGRLAVVRAVQHVPKGAEV 97
           +K  CNA    N      G+ +    +  NHSC+PN + V +        V+++  G E+
Sbjct: 118 AKYMCNAFEFNN------GSAILINGAKFNHSCVPNVIFVSDENYMYFYTVRNIKTGEEL 171

Query: 98  LISYIETAGSTMTRQKALKEQYLFTCTCPRCI 129
             +Y++   +T TR+  L  QY F C C RCI
Sbjct: 172 TDNYVDIMSNTKTRKNRLFNQYGFDCQCERCI 203


>gi|195339975|ref|XP_002036592.1| GM18755 [Drosophila sechellia]
 gi|194130472|gb|EDW52515.1| GM18755 [Drosophila sechellia]
          Length = 507

 Score = 59.3 bits (142), Expect = 2e-06,   Method: Compositional matrix adjust.
 Identities = 50/199 (25%), Positives = 86/199 (43%), Gaps = 25/199 (12%)

Query: 59  LYPVISIINHSCLPN---AVLVFEGRLAVVRAVQHVPKGAEVLISYIETAGSTMTRQKAL 115
           L+P   I+ H+C+PN   ++   EG    +RA+  + +G  +  SY  T   T  RQK L
Sbjct: 207 LFPYTGILAHNCVPNTARSIYPSEGYKIRLRAMVDLEEGQPLHHSYTYTLDGTAQRQKHL 266

Query: 116 KEQYLFTCTCPRCI---KLG-QFDDIQESAILEGYRCKDDGCSGFLLRDSDDKGFTCQQC 171
           K+   FTC C RC+   +LG  F  ++     EG++               D  ++C  C
Sbjct: 267 KQGKFFTCQCERCLDPTELGTHFSSLKCGQCAEGFQVPRQP-------TEPDTSWSCASC 319

Query: 172 GLVRSKEE----IKKIASEVNILSKKTLALTSCGNHQEVVSTYKMIEKLQKKLYHPFSVN 227
           G   S  +    ++ + SEVN +    +A       + ++  Y       K L HP    
Sbjct: 320 GSETSNVDALAMLQSLQSEVNAVQALPMAAKRLEEIERLLRKY-------KSLLHPLHFI 372

Query: 228 LMQTREKLIKILMELEDWK 246
               R+ LI++   ++ ++
Sbjct: 373 ATGLRQLLIEMYGRVQGYE 391


>gi|146263138|gb|ABQ15327.1| Mcg1p [Magnaporthe grisea]
          Length = 456

 Score = 59.3 bits (142), Expect = 2e-06,   Method: Compositional matrix adjust.
 Identities = 30/92 (32%), Positives = 47/92 (51%), Gaps = 1/92 (1%)

Query: 57  TGLYPVISIINHSCLPN-AVLVFEGRLAVVRAVQHVPKGAEVLISYIETAGSTMTRQKAL 115
            G +  +S +NH C PN A  V +  +    A + +  G E+ ISY+++  S   RQ+ L
Sbjct: 258 AGTFLGVSRLNHDCRPNLAFHVAKSFVHTTHATRDIAAGEELTISYVDSYSSRNVRQERL 317

Query: 116 KEQYLFTCTCPRCIKLGQFDDIQESAILEGYR 147
           K  + FTCTCP C   G    + +S +   Y+
Sbjct: 318 KRNWGFTCTCPHCSLSGAMGKLSDSRLYAIYQ 349


>gi|408398088|gb|EKJ77223.1| hypothetical protein FPSE_02597 [Fusarium pseudograminearum CS3096]
          Length = 520

 Score = 59.3 bits (142), Expect = 2e-06,   Method: Compositional matrix adjust.
 Identities = 33/88 (37%), Positives = 50/88 (56%), Gaps = 1/88 (1%)

Query: 43  NAHTICNSELRPLGTGLYPVISIINHSCLPNAVLVFEGRLAVVRAVQHVPKGAEVLISYI 102
           NA    +++L  +G  L P +++ NHSC+PNA++ F GR A++RA + +    E+ ISY 
Sbjct: 163 NAFHRYDTDLGQVGIFLEPKLAMANHSCIPNAMVQFIGRKAILRAEKPIKIDDEIEISYT 222

Query: 103 ETAGSTMTRQKALKEQYLFTCTCPRCIK 130
           +       R+ AL   Y F C C RC K
Sbjct: 223 DYTFPLSKRKLALAP-YFFDCMCLRCEK 249


>gi|440300086|gb|ELP92579.1| set and mynd domain containing protein, putative [Entamoeba
           invadens IP1]
          Length = 420

 Score = 59.3 bits (142), Expect = 2e-06,   Method: Compositional matrix adjust.
 Identities = 31/82 (37%), Positives = 49/82 (59%), Gaps = 4/82 (4%)

Query: 50  SELRPLGTGLYPVISIINHSCLPNAVLVF---EGRLAVVRAVQHVPKGAEVLISYIETAG 106
           S+    G GLY  ++ +NHSC PN VL     + +L+++ A++ +  G E+ ISYI+ + 
Sbjct: 337 SKFSVEGEGLYKYLNTLNHSCDPNCVLACTTDDFKLSLI-ALKDIKAGEELTISYIDNSM 395

Query: 107 STMTRQKALKEQYLFTCTCPRC 128
           +  TR K L +QY F C C +C
Sbjct: 396 NKETRLKTLMDQYNFDCKCKKC 417


>gi|348668850|gb|EGZ08673.1| hypothetical protein PHYSODRAFT_352669 [Phytophthora sojae]
          Length = 1354

 Score = 59.3 bits (142), Expect = 2e-06,   Method: Compositional matrix adjust.
 Identities = 44/140 (31%), Positives = 70/140 (50%), Gaps = 20/140 (14%)

Query: 5    DEKQLLLYAQIANLVNLILQWPEISINEIAENFSKLACNAHTICNSELRPL--------- 55
            DE Q ++ +     V  +L W E  + ++ +   +L  N   + +SE  PL         
Sbjct: 1167 DEDQTIIAS-----VRRVLGWEEERL-QVLQTQERLERNDGEL-DSEFDPLMPLGYPSIE 1219

Query: 56   GTGLYPVISIINHSCLPNAVLVF--EGRLAVVRAVQHVPKGAEVLISYIETAGSTMTRQK 113
            GT L+P+I  +NHSC PN  +++   G   VV A++ + KG E+ I YI+       R+ 
Sbjct: 1220 GTALFPIICTMNHSCDPNCTVLYTKNGDGHVV-AIRDIHKGEELCICYIDVDMDVQMREA 1278

Query: 114  ALKEQYLFTCTCPRCIKLGQ 133
             L+E Y F C C RC++  Q
Sbjct: 1279 NLRE-YKFKCFCSRCVQERQ 1297



 Score = 42.4 bits (98), Expect = 0.31,   Method: Compositional matrix adjust.
 Identities = 25/62 (40%), Positives = 34/62 (54%), Gaps = 1/62 (1%)

Query: 56  GTGLYPVISIINHSCLPN-AVLVFEGRLAVVRAVQHVPKGAEVLISYIETAGSTMTRQKA 114
           GT L+  I  +NHSC PN  VL  +   A V AVQ + +G E+ ISYI+       R + 
Sbjct: 390 GTALFSGICTMNHSCDPNCTVLYTKDGAAHVFAVQDIQEGEELCISYIDVDQEVEERAEC 449

Query: 115 LK 116
           L+
Sbjct: 450 LR 451


>gi|330798181|ref|XP_003287133.1| hypothetical protein DICPUDRAFT_77998 [Dictyostelium purpureum]
 gi|325082849|gb|EGC36318.1| hypothetical protein DICPUDRAFT_77998 [Dictyostelium purpureum]
          Length = 510

 Score = 59.3 bits (142), Expect = 2e-06,   Method: Compositional matrix adjust.
 Identities = 35/121 (28%), Positives = 54/121 (44%), Gaps = 19/121 (15%)

Query: 56  GTGLYPVISIINHSCLPNAVLVFE-----------------GRLAV-VRAVQHVPKGAEV 97
           G+ +Y  +S+ NH C PN   + E                  RL++ +  ++ VP+ +E+
Sbjct: 361 GSAVYLKLSLFNHGCFPNCTTLLEYNINHNDYSYYQGFESDNRLSISIITLRDVPQNSEL 420

Query: 98  LISYIETAGSTMTRQKALKEQYLFTCTCPRCIKLGQFDDIQESAILEGYRCKDDGCSGFL 157
           LI+YI        R   LK  ++F C C RC       ++ E  I     CK+  CSG L
Sbjct: 421 LITYIPLNQKGHERYHNLKSNWMFPCDCLRCFHEKTNPELTE-KIFNDSSCKNSKCSGGL 479

Query: 158 L 158
           L
Sbjct: 480 L 480


>gi|340517499|gb|EGR47743.1| predicted protein [Trichoderma reesei QM6a]
          Length = 535

 Score = 59.3 bits (142), Expect = 2e-06,   Method: Compositional matrix adjust.
 Identities = 36/95 (37%), Positives = 51/95 (53%), Gaps = 1/95 (1%)

Query: 34  AENFSKLACNAHTICNSELRPLGTGLYPVISIINHSCLPNAVLVFEGRLAVVRAVQHVPK 93
           AE   K+  N+    +SEL   G  L P +++ NHSC+PNA + F GR A++ A   +  
Sbjct: 168 AEMLCKIQNNSFQRFDSELGVAGLYLEPTLAMANHSCIPNASVQFIGRNALLIAENPIRA 227

Query: 94  GAEVLISYIETAGSTMTRQKALKEQYLFTCTCPRC 128
           G E+ I+Y         R++AL   Y FTC C RC
Sbjct: 228 GDEIEIAYTFYTDPLPKRREALA-HYKFTCQCLRC 261


>gi|310798972|gb|EFQ33865.1| tetratricopeptide [Glomerella graminicola M1.001]
          Length = 657

 Score = 59.3 bits (142), Expect = 2e-06,   Method: Compositional matrix adjust.
 Identities = 31/76 (40%), Positives = 45/76 (59%), Gaps = 1/76 (1%)

Query: 53  RPLGTGLYPVISIINHSCLPNAVLVFEGRLAVVRAVQHVPKGAEVLISYIETAGSTMTRQ 112
           R   TGL+   + INHSC+ NA   + G L V+RA++ + +G E+  SY E+A    TRQ
Sbjct: 503 RNASTGLWVWAAYINHSCVANAKKEYVGDLMVLRALRPIKEGEEIFHSYDESA-DYETRQ 561

Query: 113 KALKEQYLFTCTCPRC 128
           +AL   + F C+C  C
Sbjct: 562 RALMTTWGFECSCALC 577


>gi|407844697|gb|EKG02089.1| hypothetical protein TCSYLVIO_006900 [Trypanosoma cruzi]
          Length = 584

 Score = 59.3 bits (142), Expect = 3e-06,   Method: Compositional matrix adjust.
 Identities = 58/234 (24%), Positives = 102/234 (43%), Gaps = 40/234 (17%)

Query: 42  CNAHTICNSELRPLGTGLYP--VISIINHSCLPNAVLVFEGRLAVVRAVQHVPKGAEVLI 99
           CNA  +C++   P+G  L+   V S  NHSCLPN   +  G +   RA++    G E+ I
Sbjct: 327 CNALAVCDASGLPVGQALHAASVTSYFNHSCLPNCA-IEAGAIVTTRAIR---PGEELTI 382

Query: 100 SYI-ETAGSTMTRQKALKEQYLFTCTCPRCIKLGQFDDIQ-ESAILEGYRCKDDGCSGFL 157
           SY+ +       R++ L E+Y F C   RC++ G  D    ESA+    R          
Sbjct: 383 SYLPQLYWPAWLRREELAERYFFDC---RCVRCGDGDRSPFESALSATLRLGGS------ 433

Query: 158 LRDSDDKGFTCQQ--CGLVRSKEEIKKIASEVNILSKKTLALTSCGNHQEVVSTYKMIEK 215
            R+ + +  +  Q  CG VR+K+       +V++  +  L                ++++
Sbjct: 434 -REKEREYISSVQILCGRVRAKD-----VGDVDVGDRDAL--------------LHLLQE 473

Query: 216 LQKKLYHPFSVNLMQTREKLIKILMELEDWKEALAYCQLTIPVYQRVYPQFHPL 269
            ++ L+ PF     +    L  +   L D +  L  C   + +++ + P  HP+
Sbjct: 474 CRQHLF-PFHYLCHELHNTLSFVYAVLGDTRACLCGCLRELVMWEAIVPGAHPV 526


>gi|302832792|ref|XP_002947960.1| hypothetical protein VOLCADRAFT_116676 [Volvox carteri f.
           nagariensis]
 gi|300266762|gb|EFJ50948.1| hypothetical protein VOLCADRAFT_116676 [Volvox carteri f.
           nagariensis]
          Length = 1170

 Score = 59.3 bits (142), Expect = 3e-06,   Method: Composition-based stats.
 Identities = 31/76 (40%), Positives = 40/76 (52%), Gaps = 13/76 (17%)

Query: 53  RPLGTGLYPVISIINHSCLPNAVLVFEGRLAVVRAVQHVPKGAEVLISYIETAGSTMTRQ 112
           RP  TGL+   +++NHSC+PN      G   +VR  + V  G EV             R+
Sbjct: 703 RPGHTGLWTEHALMNHSCVPNVCTYVLGEAMLVRVSRPVQGGEEV-------------RR 749

Query: 113 KALKEQYLFTCTCPRC 128
           KAL +QY FTC CPRC
Sbjct: 750 KALYDQYGFTCCCPRC 765


>gi|328715119|ref|XP_003245540.1| PREDICTED: protein msta, isoform A-like [Acyrthosiphon pisum]
          Length = 478

 Score = 59.3 bits (142), Expect = 3e-06,   Method: Compositional matrix adjust.
 Identities = 43/139 (30%), Positives = 61/139 (43%), Gaps = 9/139 (6%)

Query: 58  GLYPVISIINHSCLPNAVLVFEGRLAVV-RAVQHVPKGAEVLISYIETAGSTMTRQKALK 116
           GLYPV + +NH C+PN +  F  +L ++ +A   + KG E+  SY  +   T  RQ  L 
Sbjct: 216 GLYPVAAFMNHCCVPNTMHNFNEKLQMIAKASLPIYKGQEITTSYTYSIWPTSLRQNHLL 275

Query: 117 EQYLFTCTCPRCIKLGQFDDIQESAILEGYRCKDDGCSGFLL--RDSDDKG-FTCQQCGL 173
               F CTC RC    +F        L    C    C G +L     D K  + C+ C  
Sbjct: 276 TSKQFICTCSRCCDTEEF-----GTKLAALNCIVKNCDGRILPINPLDKKSIWQCKMCTK 330

Query: 174 VRSKEEIKKIASEVNILSK 192
           + S  E+     +V  L K
Sbjct: 331 LVSSTEMICFYKDVETLMK 349


>gi|453080542|gb|EMF08593.1| SET domain-containing protein [Mycosphaerella populorum SO2202]
          Length = 350

 Score = 59.3 bits (142), Expect = 3e-06,   Method: Compositional matrix adjust.
 Identities = 29/73 (39%), Positives = 43/73 (58%), Gaps = 2/73 (2%)

Query: 58  GLYPVISIINHSCLPNAVLVFEGRLAVVR--AVQHVPKGAEVLISYIETAGSTMTRQKAL 115
            ++  I+ INHSCLPNA   +   L  +R  +++ +  G E+ +SYI   G + TR+K L
Sbjct: 122 AIFLTIARINHSCLPNAQHSWNAPLQRMRIHSIKTLQPGEEITLSYILPCGPSSTRKKLL 181

Query: 116 KEQYLFTCTCPRC 128
           KE + F C CP C
Sbjct: 182 KENFCFNCLCPLC 194


>gi|389744758|gb|EIM85940.1| SET domain-containing protein [Stereum hirsutum FP-91666 SS1]
          Length = 480

 Score = 59.3 bits (142), Expect = 3e-06,   Method: Compositional matrix adjust.
 Identities = 34/102 (33%), Positives = 56/102 (54%), Gaps = 8/102 (7%)

Query: 34  AENFSKLACNAHTICNSELRPLGTGLYPVIS-IINHSCLPNAVLVF------EGRLAVVR 86
           +E +S+   N + + +S L P+  G++PV S + NHSC PNA   +       G   VV 
Sbjct: 189 SELYSRFGRN-NFVVHSHLSPIAHGIFPVASRLFNHSCTPNAAAKYIFGRDKPGVRMVVV 247

Query: 87  AVQHVPKGAEVLISYIETAGSTMTRQKALKEQYLFTCTCPRC 128
           A++++    E+ I Y++ A   + RQ +L++ Y F CTC  C
Sbjct: 248 ALRNISHNEEITIPYLDPALPFLERQLSLEQNYGFRCTCHVC 289


>gi|321478979|gb|EFX89935.1| SET and MYND domain-containing 5 protein [Daphnia pulex]
          Length = 387

 Score = 59.3 bits (142), Expect = 3e-06,   Method: Compositional matrix adjust.
 Identities = 29/78 (37%), Positives = 42/78 (53%), Gaps = 5/78 (6%)

Query: 56  GTGLYPVISIINHSCLPNAV--LVFEGRLAVVRAVQHVPKGAEVLISYIETAG---STMT 110
           G+ L+ + S  NHSC PN +    F      + A + + +G E+ ISY++      S  +
Sbjct: 290 GSALFAIQSACNHSCEPNCISTFPFSNHTVALVASKDLEEGEEIFISYLDECAQSRSRHS 349

Query: 111 RQKALKEQYLFTCTCPRC 128
           R+K LKE YLF C C RC
Sbjct: 350 RRKILKENYLFHCNCSRC 367


>gi|396460396|ref|XP_003834810.1| hypothetical protein LEMA_P069530.1 [Leptosphaeria maculans JN3]
 gi|312211360|emb|CBX91445.1| hypothetical protein LEMA_P069530.1 [Leptosphaeria maculans JN3]
          Length = 388

 Score = 59.3 bits (142), Expect = 3e-06,   Method: Compositional matrix adjust.
 Identities = 28/77 (36%), Positives = 40/77 (51%), Gaps = 3/77 (3%)

Query: 55  LGTGLYPVISIINHSCLPNAVLVFEGRLAVVRAVQHVPKGAEVLISYIETAGSTMTRQK- 113
            G G +P  S  NHSC PN     +GR    RA + + KG E+ I+Y+      ++R K 
Sbjct: 309 FGYGCWPAASYFNHSCHPNVEKQRDGRAWTFRARRAIAKGDELCITYLSGEERKLSRAKR 368

Query: 114 --ALKEQYLFTCTCPRC 128
              LK+ + F C+C RC
Sbjct: 369 MLRLKKTWGFDCSCERC 385


>gi|159471331|ref|XP_001693810.1| histone methyltransferase [Chlamydomonas reinhardtii]
 gi|158283313|gb|EDP09064.1| histone methyltransferase [Chlamydomonas reinhardtii]
          Length = 484

 Score = 59.3 bits (142), Expect = 3e-06,   Method: Compositional matrix adjust.
 Identities = 36/103 (34%), Positives = 49/103 (47%), Gaps = 10/103 (9%)

Query: 30  INEIAENFSKLACNAHTICNSELRPLG-TGLYPVISIINHSCLPNAVLVFEGRLAVVRAV 88
             + AE+ +  AC           P G  GL+P  +++NHSC PN V    G   VVRAV
Sbjct: 185 FGDDAEDLAACACRGE-------EPRGHIGLWPEFALLNHSCAPNTVNYVVGGSMVVRAV 237

Query: 89  QHVPKGAEVLISYIETAG--STMTRQKALKEQYLFTCTCPRCI 129
             + +G EV I Y+          R   L + Y F C+CPRC+
Sbjct: 238 APISQGEEVTICYLGRPQLLPFNKRIAILADDYGFECSCPRCV 280


>gi|328710001|ref|XP_003244133.1| PREDICTED: SET and MYND domain-containing protein 4-like isoform 1
           [Acyrthosiphon pisum]
 gi|328710003|ref|XP_003244134.1| PREDICTED: SET and MYND domain-containing protein 4-like isoform 2
           [Acyrthosiphon pisum]
          Length = 645

 Score = 59.3 bits (142), Expect = 3e-06,   Method: Compositional matrix adjust.
 Identities = 37/125 (29%), Positives = 60/125 (48%), Gaps = 19/125 (15%)

Query: 23  LQWPEISINEIAENFSKLAC----NAHTI-----CNSELR--------PLGTGLYPVISI 65
           LQ P   +  + E+   + C    NAH I     C ++L+        P+ + L PV+S+
Sbjct: 423 LQIPNHQLGTVGESLVHIICAVSSNAHGITQPSDCKTQLKLSLDNRFVPVASLLMPVLSL 482

Query: 66  INHSCLPNAVLVFEGRLAVVRAVQHVPKGAEVLISY--IETAGSTMTRQKALKEQYLFTC 123
           +NH C PN V        V+ A+Q + KG+++  +Y  +       +R + LK QY F+C
Sbjct: 483 LNHHCDPNVVRHNYNGTIVLTAIQPISKGSQLFDNYGLLYATHPKESRLQILKNQYYFSC 542

Query: 124 TCPRC 128
            C  C
Sbjct: 543 ECSSC 547


>gi|170029520|ref|XP_001842640.1| conserved hypothetical protein [Culex quinquefasciatus]
 gi|167863224|gb|EDS26607.1| conserved hypothetical protein [Culex quinquefasciatus]
          Length = 498

 Score = 59.3 bits (142), Expect = 3e-06,   Method: Compositional matrix adjust.
 Identities = 47/187 (25%), Positives = 81/187 (43%), Gaps = 15/187 (8%)

Query: 58  GLYPVISIINHSCLPNAVLVF----EGRLAVVRAVQHVPKGAEVLISYIETAGSTMTRQK 113
           GLYP ++I+ HSC+PN V       + RL    A+  V +G+++  +Y  T   TM RQ 
Sbjct: 198 GLYPKLAIMAHSCVPNVVHSIHPSKDYRLTARTAID-VEEGSKLYTTYTYTLSGTMVRQA 256

Query: 114 ALKEQYLFTCTCPRCIKLGQFDDIQESAILEGYRCK--DDGCSGFLLRDSDDKGFTCQQC 171
           ALK    FTC C RC+     D  +        +C+  D+G         ++  + C  C
Sbjct: 257 ALKSTKYFTCQCKRCL-----DPTELGTHFSSLKCQKCDNGVIVSSKPTDEEAEWHCTHC 311

Query: 172 GLVRSKEEIKKIASEVNILSKKTLALTSCGNHQEVVSTYKMIEKLQKKLYHPFSVNLMQT 231
                K +   +A  + ++  +   L       E + +++ + K  + + HP        
Sbjct: 312 ---EYKLKGAAMAKAIQVMQAEIDELAYMEYGPERLESFERVFKKYRSVLHPLHFINTSI 368

Query: 232 REKLIKI 238
           R  LI++
Sbjct: 369 RNSLIEL 375


>gi|167525114|ref|XP_001746892.1| hypothetical protein [Monosiga brevicollis MX1]
 gi|163774672|gb|EDQ88299.1| predicted protein [Monosiga brevicollis MX1]
          Length = 500

 Score = 58.9 bits (141), Expect = 3e-06,   Method: Compositional matrix adjust.
 Identities = 35/97 (36%), Positives = 49/97 (50%), Gaps = 1/97 (1%)

Query: 53  RPLGTGLYPVISIINHSCLPN-AVLVFEGRLAVVRAVQHVPKGAEVLISYIETAGSTMTR 111
           R    G +PVI+++NHSC PN AV+   G    VR +  V  GAE+ +SY++       R
Sbjct: 233 RETAWGHFPVIALVNHSCHPNCAVVSRPGGELEVRTLGTVRAGAELFVSYVDLTLPRAER 292

Query: 112 QKALKEQYLFTCTCPRCIKLGQFDDIQESAILEGYRC 148
           Q  L     FTCTC RC     F    E+++    +C
Sbjct: 293 QAHLLASKEFTCTCYRCQHPDAFPHEHEASMPSCPQC 329


>gi|398257480|gb|EJN41088.1| hypothetical protein lvs_L585 [Acanthamoeba polyphaga
           lentillevirus]
          Length = 237

 Score = 58.9 bits (141), Expect = 3e-06,   Method: Compositional matrix adjust.
 Identities = 29/92 (31%), Positives = 45/92 (48%), Gaps = 6/92 (6%)

Query: 38  SKLACNAHTICNSELRPLGTGLYPVISIINHSCLPNAVLVFEGRLAVVRAVQHVPKGAEV 97
           +K  CNA    N      G+ +    +  NHSC+PN + V +        V+++  G E+
Sbjct: 100 AKYMCNAFEFNN------GSAILINGAKFNHSCVPNVIFVSDENYMYFYTVRNIKTGEEL 153

Query: 98  LISYIETAGSTMTRQKALKEQYLFTCTCPRCI 129
             +Y++   +T TR+  L  QY F C C RCI
Sbjct: 154 TDNYVDIMSNTKTRKNRLFNQYGFDCQCERCI 185


>gi|339236741|ref|XP_003379925.1| MYND finger protein [Trichinella spiralis]
 gi|316977375|gb|EFV60485.1| MYND finger protein [Trichinella spiralis]
          Length = 476

 Score = 58.9 bits (141), Expect = 3e-06,   Method: Compositional matrix adjust.
 Identities = 69/285 (24%), Positives = 114/285 (40%), Gaps = 25/285 (8%)

Query: 37  FSKLACNAHTICNSELRPLGTGLYPVISIINHSCLPNAVLVFEGRLAVVRAVQH---VPK 93
           F +   N+H+I ++    +G  L   IS  +HSC PN  +VF G  AV+R + +      
Sbjct: 164 FCRSWINSHSIHSNTGIEVGMALDLGISKYDHSCRPNTAMVFNGFRAVLRPLVNGIDTAD 223

Query: 94  GAEVLISYIETAGSTMTRQKALKEQYLFTCTCPRCIKLGQFDDIQESAILEGYRCKDDGC 153
            ++  I+Y++   S   R+K L+ ++ F C C RC      DD      L   RC +  C
Sbjct: 224 PSQCFIAYVDVGRSRYQRRKELQSKWYFWCECERCRDPS--DD-----RLTSIRCVNVDC 276

Query: 154 SGFLLRDSDD---KGFTCQQCGLVRSKEEIKKIASEVNILSK-KTLALTS-CGN--HQEV 206
           S  +    D    K   C+ CG         KI   V I ++   LAL    GN    E 
Sbjct: 277 SEPVCITEDQTNTKNVQCRGCG--------SKIPENVIIEAQCFMLALPQHFGNVKSAEE 328

Query: 207 VSTYKMIEKLQKKLYHPFSVNLMQTREKLIKILMELEDWKEALAYCQLTIPVYQRVYPQF 266
           V   K+     ++L H  ++   +     +++   ++ +   L   +     Y+R  P  
Sbjct: 329 VRRLKIYLNTAERLLHKENIYFCRLLTAYLQLTEGVDSFANNLELQKSVYTNYRRCLPAA 388

Query: 267 HPLLGLQYYTCGKLEWFLGDTENAIKSMTEAVEILRITHGTNSPF 311
              +G Q     +          +I    EA+ I  I  G + P+
Sbjct: 389 DRHVGFQLLHIVRTLITQNRRAESIPYAHEAMRIFEICFGLDHPY 433


>gi|168054064|ref|XP_001779453.1| predicted protein [Physcomitrella patens subsp. patens]
 gi|162669138|gb|EDQ55731.1| predicted protein [Physcomitrella patens subsp. patens]
          Length = 377

 Score = 58.9 bits (141), Expect = 3e-06,   Method: Compositional matrix adjust.
 Identities = 38/114 (33%), Positives = 57/114 (50%), Gaps = 12/114 (10%)

Query: 27  EISINEIAENFSKLACNAHTI----CNSELRPL-GTGLYPVISIINHSCLPNAV------ 75
           E+S+    +  ++ ACN   +    C  E R + G  ++   S+ NH CLPNA       
Sbjct: 165 ELSLELTGKLLARDACNTFGLMAPSCAGEERKVRGYAMFAQASMFNHDCLPNACRFEYVD 224

Query: 76  LVFEGRLAV-VRAVQHVPKGAEVLISYIETAGSTMTRQKALKEQYLFTCTCPRC 128
           +  +G   V VRA+  + +G EV +SY         RQ+ L+E+Y F CTC RC
Sbjct: 225 IDGDGNTDVIVRALHDMEEGTEVCLSYFPVDWPYGDRQQKLQEEYGFWCTCARC 278


>gi|386769408|ref|NP_001245962.1| CG43129, isoform C [Drosophila melanogaster]
 gi|383291417|gb|AFH03636.1| CG43129, isoform C [Drosophila melanogaster]
          Length = 503

 Score = 58.9 bits (141), Expect = 3e-06,   Method: Compositional matrix adjust.
 Identities = 50/199 (25%), Positives = 85/199 (42%), Gaps = 25/199 (12%)

Query: 59  LYPVISIINHSCLPN---AVLVFEGRLAVVRAVQHVPKGAEVLISYIETAGSTMTRQKAL 115
           L+P   I+ H+C+PN   ++   EG    +RA+  + +G  +  SY  T   T  RQK L
Sbjct: 203 LFPYTGILAHNCVPNTSRSIYPSEGYKIRLRAMVDLEEGQPLHHSYTYTLDGTAQRQKHL 262

Query: 116 KEQYLFTCTCPRCI---KLG-QFDDIQESAILEGYRCKDDGCSGFLLRDSDDKGFTCQQC 171
           K+   FTC C RC+   +LG  F  ++     EG++               D  + C  C
Sbjct: 263 KQGKFFTCQCERCLDPTELGTHFSSLKCGQCAEGFQVPRQP-------TEPDTSWNCANC 315

Query: 172 GLVRSKEE----IKKIASEVNILSKKTLALTSCGNHQEVVSTYKMIEKLQKKLYHPFSVN 227
           G   S  +    ++ + SEVN +    +A       + ++  Y       K L HP    
Sbjct: 316 GSDTSNADALAMLQSLQSEVNAVQALPMAAKRLEEIERLLRKY-------KSLLHPLHFI 368

Query: 228 LMQTREKLIKILMELEDWK 246
               R+ LI++   ++ ++
Sbjct: 369 ATGLRQLLIEMYGRVQGYE 387


>gi|195171538|ref|XP_002026562.1| GL21959 [Drosophila persimilis]
 gi|194111478|gb|EDW33521.1| GL21959 [Drosophila persimilis]
          Length = 532

 Score = 58.9 bits (141), Expect = 3e-06,   Method: Compositional matrix adjust.
 Identities = 54/199 (27%), Positives = 88/199 (44%), Gaps = 26/199 (13%)

Query: 10  LLYAQIANLVNLIL---QWPEISINEIAENFSKLACNAHTICNSELRPLGTGLYPVISII 66
           +L A +   +  IL    W E+ I  IA   + L  N   +     R     L+P  ++I
Sbjct: 198 VLRANLITFIKTILGMRDWSEVDILRIA---AILDTNTFELRQPRERRKVRALFPGAAMI 254

Query: 67  NHSCLPNAVLVFEGRLAVV-RAVQHVPKGAEVLISYIETAGSTMTRQKALKEQYLFTCTC 125
           +H C PN    F+  + ++  A + + KG  + ISY +   ST+ R+  L++   F C C
Sbjct: 255 SHDCAPNMRHRFDDDMNIIFLAKRPIAKGEILTISYTQPLRSTIQRRLHLRQAKCFDCAC 314

Query: 126 PRC---IKLGQFDDIQESAILEGYRCKDDGCSGFLLRDS---DDKGFTCQQCGLVRSKEE 179
            RC    +LG F   Q        +CK    +G ++  +   +   + CQ C L RS +E
Sbjct: 315 DRCQDPTELGTFAGAQTCV-----KCK----AGKIISVNPLQNTANWKCQLCNLKRSAKE 365

Query: 180 I----KKIASEVNILSKKT 194
           +     K+  E+  L K T
Sbjct: 366 VLLSDAKLQQEIEALDKTT 384


>gi|341876422|gb|EGT32357.1| CBN-SET-10 protein [Caenorhabditis brenneri]
          Length = 428

 Score = 58.9 bits (141), Expect = 3e-06,   Method: Compositional matrix adjust.
 Identities = 45/145 (31%), Positives = 69/145 (47%), Gaps = 19/145 (13%)

Query: 35  ENFSKLACNAHTICNS-----ELR--PLGTGLYPVISIINHSCLPNAVLVFEGRLAVVRA 87
           EN   +A     +CNS     E R  P+G+G+Y  ++  NHSC   + +VF+    ++RA
Sbjct: 145 ENNRMIAMICSVLCNSFGIIDEKRGEPIGSGMYIGLAKHNHSCASTSHVVFDKNQVLLRA 204

Query: 88  VQHVPKGAEVLISYIETAGSTMTRQKALKEQYLFTCTCPRCIKLGQFDDIQESAILEGYR 147
            +         ISY+     T  RQK+++  +  TC C  C    Q +D+    I    R
Sbjct: 205 RKE-EYCRNTTISYVSRMLPTAERQKSIRSVHFITCRCEMC----QNEDLD--LIGLASR 257

Query: 148 CKDDGCSGFLLRDSDDKGFTCQQCG 172
           C+ DGC+GF+   S     +C  CG
Sbjct: 258 CQTDGCNGFVKGSS-----SCGSCG 277


>gi|358395915|gb|EHK45302.1| hypothetical protein TRIATDRAFT_79702 [Trichoderma atroviride IMI
           206040]
          Length = 543

 Score = 58.9 bits (141), Expect = 3e-06,   Method: Compositional matrix adjust.
 Identities = 35/102 (34%), Positives = 54/102 (52%), Gaps = 1/102 (0%)

Query: 27  EISINEIAENFSKLACNAHTICNSELRPLGTGLYPVISIINHSCLPNAVLVFEGRLAVVR 86
           E  I + AE   K+  N+    +S+L  +G  L P +++ NHSC+PNA + F GR A++ 
Sbjct: 166 EEDIRKAAEMLCKIQNNSFQRFDSDLGVIGLFLEPTLAMANHSCIPNAAVQFIGRNALLI 225

Query: 87  AVQHVPKGAEVLISYIETAGSTMTRQKALKEQYLFTCTCPRC 128
           A   +  G E+ ++Y         R++AL   Y F C C RC
Sbjct: 226 AENPIRAGDEMELAYTFYTDPLPKRKEALA-HYKFVCQCLRC 266


>gi|307179275|gb|EFN67661.1| SET and MYND domain-containing protein 4 [Camponotus floridanus]
          Length = 631

 Score = 58.9 bits (141), Expect = 3e-06,   Method: Compositional matrix adjust.
 Identities = 35/107 (32%), Positives = 56/107 (52%), Gaps = 12/107 (11%)

Query: 40  LACNAHTICNS---------ELRPLGTGLYPVISIINHSCLPNAVL-VFEGRLAVVRAVQ 89
           + CNA+ I  +         E R +G  +Y  +S++NHSC PN V   +   + VVRA++
Sbjct: 411 INCNAYEIVENVHEEVTRVWEPRNIGGAIYTTVSLVNHSCYPNVVRHSYPSGIVVVRALR 470

Query: 90  HVPKGAEVLISYIET--AGSTMTRQKALKEQYLFTCTCPRCIKLGQF 134
            + KG E+L  Y     + S M R++ L ++Y F C C  C +  +F
Sbjct: 471 FIGKGCEILDCYGPQFLSESRMARRELLWKKYRFLCECDACTQDWKF 517


>gi|308803330|ref|XP_003078978.1| SET domain protein 123 (ISS) [Ostreococcus tauri]
 gi|116057431|emb|CAL51858.1| SET domain protein 123 (ISS) [Ostreococcus tauri]
          Length = 311

 Score = 58.9 bits (141), Expect = 3e-06,   Method: Compositional matrix adjust.
 Identities = 30/75 (40%), Positives = 44/75 (58%), Gaps = 2/75 (2%)

Query: 56  GTGLYPVISIINHSCLPNAVLVFE-GRLAV-VRAVQHVPKGAEVLISYIETAGSTMTRQK 113
           GT +Y   S+ NHSC PNA + +E G  A+ +R ++ V  G E  I+Y++    + +R+ 
Sbjct: 232 GTAVYFTASLFNHSCAPNAHVSWENGDAAITIRTLRPVRAGEEFNITYVDANERSASRRA 291

Query: 114 ALKEQYLFTCTCPRC 128
            LKE Y F C C RC
Sbjct: 292 RLKEWYGFDCACERC 306


>gi|312085936|ref|XP_003144878.1| MYND finger family protein [Loa loa]
          Length = 535

 Score = 58.9 bits (141), Expect = 3e-06,   Method: Compositional matrix adjust.
 Identities = 74/330 (22%), Positives = 132/330 (40%), Gaps = 35/330 (10%)

Query: 16  ANLVNLILQWPEISINEIAENFSKLACNAHTICN-SELRPLGTGLYPVISIINHSCLPNA 74
           A+L+        +  +E+ E   +   N H I +   +  +G GLY  +   +HSC PN 
Sbjct: 198 ADLLAFYGSKAMVGKDEVFELHCRDYINRHAISDCGYIEEIGKGLYLDLCAYDHSCRPNT 257

Query: 75  VLVFEGRLAVVRAVQ---HVPKGAEVLISYIETAGSTMTRQKALKEQYLFTCTCPRCIKL 131
           +   +G +A +R +     +   +    SYI+   +T  R+K LK+ + F C C RC   
Sbjct: 258 IYTCDGFVATLRGLTASVDLRNLSSAHYSYIDLINTTQQRRKLLKDTWYFECHCTRC--- 314

Query: 132 GQFDDIQESAIL-------EGYRCKDDGCSGFLLRDSDDKGFTCQQCGLVRSKEEIKKIA 184
              DD   S+IL         Y C          +D + +  TC +C    S E + +  
Sbjct: 315 SDPDDALLSSILCPNCPEKREYLCI---FGDLPYKDRNTQIITCPKCHNKVSPEYVVEAI 371

Query: 185 SEVNILSK--KTLALTSCGNHQEVVSTYKMIEKLQKKLYHPFSVNLMQTREKLIKILMEL 242
             +  + K  +   +      Q +     + E+  K L     VN+     K+I++L+ L
Sbjct: 372 GAMRFIDKIVENHEVEQMSREQSIKFLTDLKERFSKLLS---KVNVFLC--KIIQLLIPL 426

Query: 243 EDWKEALAYCQLTIPVY-------QRVYPQFHPLLGLQYYTCGKLEWFLGDTENAIKSMT 295
            +    ++ CQL + ++       +  +P  HP +G+ Y +             A+    
Sbjct: 427 IN----MSDCQLLLDLHLEAEECVRFCFPFNHPAVGIHYRSIATFYLKCKQPHRALLYCK 482

Query: 296 EAVEILRITHGTNSPFMKELILKLEEAQAE 325
           +A EI+  T G       E    L++A  E
Sbjct: 483 KAYEIISFTLGPKHLMTIETDAMLKDASNE 512


>gi|357609801|gb|EHJ66685.1| hypothetical protein KGM_08794 [Danaus plexippus]
          Length = 776

 Score = 58.9 bits (141), Expect = 3e-06,   Method: Compositional matrix adjust.
 Identities = 65/291 (22%), Positives = 123/291 (42%), Gaps = 34/291 (11%)

Query: 43  NAHTICNSELRPLGTGLYPVISIINHSCLPNAVLVFEGRLAVVRAVQHVPKGAEVLISY- 101
           N  TI   E+R   T +YP  +++NHSC PN +  F     +VR  + +P G EV   Y 
Sbjct: 469 NGKTITEKEVR-RATAIYPSAAMMNHSCDPNIINTFYKSRLIVRCQRELPAGGEVFNCYG 527

Query: 102 -IETAGSTMTRQKALKEQYLFTCTCPRCIKLGQFDDIQESAILEGYRCKDDGCSGFLLRD 160
                     R+KALK QY+FTC C  C    + D +   ++   Y C+   C G +   
Sbjct: 528 PHRARAPAAARRKALKAQYMFTCHCADCNDTERKDFV---SLFSAYLCQ--SCKGPVWAH 582

Query: 161 SDDKGFTCQQCGLVRSKEEIKKIASEVNILSKKTLALTSCGNH-QEVVSTYKMIEKLQKK 219
                  C QC      E    +    + L+ +   + S     +++ ++Y    +L+++
Sbjct: 583 CVRP--LCTQCRSALHLERAHTLLDRADDLATQAEQVVSLEERCEKMAASY----RLKQQ 636

Query: 220 LYHPFSVNLMQTREKLIKILMELEDWKEALAYCQLTIPVYQRVYPQF-----HPLLGLQY 274
           ++H    +L    ++L ++  +  D+ +++   +  I   +  +  F     H L  L  
Sbjct: 637 VWHRHHASLRMAADRLARLYADTGDFGKSMELIKQNIQSLEYRFGSFSVEVAHELRKLSD 696

Query: 275 YTCGKL----------EWFLGDTENAIKSMTEAVEILRITHGTNSPFMKEL 315
               ++          EW L     A K + +A++++ + +G+  P +  L
Sbjct: 697 VMLERILNSPQHLEYREWCL----EAHKVVKKAIQLMELNYGSWEPLVSRL 743


>gi|226508960|ref|NP_001146688.1| uncharacterized protein LOC100280288 [Zea mays]
 gi|219888317|gb|ACL54533.1| unknown [Zea mays]
          Length = 318

 Score = 58.9 bits (141), Expect = 3e-06,   Method: Compositional matrix adjust.
 Identities = 35/108 (32%), Positives = 51/108 (47%), Gaps = 8/108 (7%)

Query: 50  SELRPLGTGLYPVISIINHSCLPNAVLVFEGRLAVVRAVQHVPKGAEVLISYIETAGSTM 109
           S++R +   LY   S+ NHSC PN    F  R  V+R  + +  G+ V +SY    G   
Sbjct: 171 SQVR-VAQALYVSGSLFNHSCQPNVHAYFLSRAFVLRTTEFIKSGSPVELSYGPQVGEMH 229

Query: 110 T--RQKALKEQYLFTCTCPRCIKLGQFDDIQESAILEGYRCKDDGCSG 155
              RQK+L+E Y F+C C  C +L   D      ++  + C    C G
Sbjct: 230 ISERQKSLQENYYFSCQCSSCSELNLSD-----LVMNSFCCPQSNCHG 272


>gi|197304712|ref|NP_001127869.1| N-lysine methyltransferase SMYD2-like [Nasonia vitripennis]
          Length = 704

 Score = 58.9 bits (141), Expect = 3e-06,   Method: Compositional matrix adjust.
 Identities = 33/106 (31%), Positives = 54/106 (50%), Gaps = 12/106 (11%)

Query: 36  NFSKLACNAHTICNS---------ELRPLGTGLYPVISIINHSCLPNAVL-VFEGRLAVV 85
           N   + CNA+ I  +         E R +G  +Y  +S+ NHSC PN V   F     VV
Sbjct: 488 NMQAIKCNAYEIVENVRDDETKILEPRNVGGAIYTTVSLTNHSCYPNIVRHSFPNGTVVV 547

Query: 86  RAVQHVPKGAEVLISYIET--AGSTMTRQKALKEQYLFTCTCPRCI 129
            +++++P+G+E+L  Y +        +R++ L E+Y F C C  C+
Sbjct: 548 TSLRYIPEGSEILDCYGQHFLENKRDSRRRLLAEKYYFDCQCEPCL 593


>gi|389750876|gb|EIM91949.1| hypothetical protein STEHIDRAFT_106624 [Stereum hirsutum FP-91666
           SS1]
          Length = 820

 Score = 58.9 bits (141), Expect = 3e-06,   Method: Compositional matrix adjust.
 Identities = 26/70 (37%), Positives = 39/70 (55%)

Query: 59  LYPVISIINHSCLPNAVLVFEGRLAVVRAVQHVPKGAEVLISYIETAGSTMTRQKALKEQ 118
           L+ + S+ NHSC PNA     G   V+R+ + +PK  E+ ++Y+  AG+  TR K L  +
Sbjct: 546 LHTLPSLFNHSCAPNATWRLFGDTMVIRSNRDIPKNTEITLAYVSPAGNARTRSKDLFTK 605

Query: 119 YLFTCTCPRC 128
            L  C C  C
Sbjct: 606 LLGPCNCDLC 615


>gi|157115758|ref|XP_001652682.1| hypothetical protein AaeL_AAEL007346 [Aedes aegypti]
 gi|108876750|gb|EAT40975.1| AAEL007346-PA [Aedes aegypti]
          Length = 354

 Score = 58.9 bits (141), Expect = 3e-06,   Method: Compositional matrix adjust.
 Identities = 43/135 (31%), Positives = 52/135 (38%), Gaps = 33/135 (24%)

Query: 25  WPEISINEIAENFSKLACNAHTICNSELR-----------------------------PL 55
           W   S   I  +  +L CN H I  SELR                              +
Sbjct: 42  WFLYSSAVILRHIGQLVCNGHAI--SELRVCTASENNCLEADSFNIKAGFLHRCFESTRV 99

Query: 56  GTGLYPVISIINHSCLPNAVLVFEGRLAVVRAVQHVPKGAEVLISYIETAG--STMTRQK 113
            TG++P IS+ NHSC PN    F      V A + V  G E+   Y       S   RQ 
Sbjct: 100 FTGIFPQISMFNHSCDPNIRNCFSKSTLTVYATRDVEAGGEIFNCYGPNFKLMSREDRQS 159

Query: 114 ALKEQYLFTCTCPRC 128
           ALK+QY F C C RC
Sbjct: 160 ALKQQYCFDCDCIRC 174


>gi|380022845|ref|XP_003695246.1| PREDICTED: protein msta, isoform A-like [Apis florea]
          Length = 485

 Score = 58.9 bits (141), Expect = 3e-06,   Method: Compositional matrix adjust.
 Identities = 39/132 (29%), Positives = 59/132 (44%), Gaps = 9/132 (6%)

Query: 58  GLYPVISIINHSCLPNAVLVFEGRLAV-VRAVQHVPKGAEVLISYIETAGSTMTRQKALK 116
           GLYP+ S+ NH C+PN    F+    + VRA   +  G E+ +SY      T  R++ L 
Sbjct: 212 GLYPLGSLQNHCCIPNTRHYFDEEFQLYVRAALPISTGEEITMSYTSLFWDTTLRRQFLN 271

Query: 117 EQYLFTCTCPRCIKLGQFDDIQESAILEGYRCKDDGCSGFLLRDSD---DKGFTCQQCGL 173
               F+C C RC    +F+     + L    C  D C G LL  +       + C +C +
Sbjct: 272 VTKNFSCMCKRCSDPTEFN-----SKLGALLCASDKCCGELLPKNPLNMRTSWICNKCTI 326

Query: 174 VRSKEEIKKIAS 185
             +  +I  I S
Sbjct: 327 SVNHRQICSIRS 338


>gi|300121578|emb|CBK22096.2| unnamed protein product [Blastocystis hominis]
          Length = 223

 Score = 58.9 bits (141), Expect = 3e-06,   Method: Compositional matrix adjust.
 Identities = 25/62 (40%), Positives = 41/62 (66%)

Query: 43  NAHTICNSELRPLGTGLYPVISIINHSCLPNAVLVFEGRLAVVRAVQHVPKGAEVLISYI 102
           N  TI ++E+ P+G GLYP I+ INH C PNA LVF+G   ++ +++ +    E+ I Y+
Sbjct: 149 NRVTIRDAEMNPIGYGLYPYITAINHHCEPNAALVFQGNQLILTSLRPIEASEEIRIRYL 208

Query: 103 ET 104
           ++
Sbjct: 209 DS 210


>gi|255082145|ref|XP_002508291.1| predicted protein [Micromonas sp. RCC299]
 gi|226523567|gb|ACO69549.1| predicted protein [Micromonas sp. RCC299]
          Length = 311

 Score = 58.9 bits (141), Expect = 3e-06,   Method: Compositional matrix adjust.
 Identities = 35/115 (30%), Positives = 53/115 (46%), Gaps = 8/115 (6%)

Query: 15  IANLVNLILQWPEISINEIAENFSKLACNAHTICNSELRPLGTGLYPVISIINHSCLPNA 74
           +  L  L L   E++ +E  E   +     H       RP+G G++P  S  NHSC PN 
Sbjct: 199 VRTLGRLQLNGFEMTASEPEEGADEAEGGGH-------RPVGVGVFPSASYTNHSCAPNC 251

Query: 75  VLVFEGRLA-VVRAVQHVPKGAEVLISYIETAGSTMTRQKALKEQYLFTCTCPRC 128
              F+G    VV   + V  G E+ I Y++       R++ L++ + F C C RC
Sbjct: 252 AQRFDGHGCIVVETARDVRGGEELTIPYVDVRLGRRERRERLRKNFAFDCACERC 306


>gi|449547900|gb|EMD38867.1| hypothetical protein CERSUDRAFT_47730 [Ceriporiopsis subvermispora
           B]
          Length = 410

 Score = 58.9 bits (141), Expect = 3e-06,   Method: Compositional matrix adjust.
 Identities = 28/76 (36%), Positives = 39/76 (51%), Gaps = 1/76 (1%)

Query: 55  LGTGLYPVISIINHSCLPNAVLVFEGRLAVVRAVQHVPKGAEVLISYIETAG-STMTRQK 113
           LG  +YP +S  NH C PNA    +GR       + V +G E+ ISY    G +   R+K
Sbjct: 321 LGFAVYPRLSFFNHDCAPNARKERDGRALRFVTTREVAEGDELCISYGHVDGMAWRERRK 380

Query: 114 ALKEQYLFTCTCPRCI 129
            L + + F C C RC+
Sbjct: 381 ELSDGWFFDCACGRCV 396


>gi|401422463|ref|XP_003875719.1| conserved hypothetical protein [Leishmania mexicana
           MHOM/GT/2001/U1103]
 gi|322491958|emb|CBZ27231.1| conserved hypothetical protein [Leishmania mexicana
           MHOM/GT/2001/U1103]
          Length = 510

 Score = 58.9 bits (141), Expect = 3e-06,   Method: Compositional matrix adjust.
 Identities = 48/137 (35%), Positives = 68/137 (49%), Gaps = 26/137 (18%)

Query: 10  LLYAQIANLVNLILQWPEISINEIAENFSKLACNAHTICNSELRP-------------LG 56
           LL+A+   LV +I      S + +     ++  NAH I +  L P              G
Sbjct: 366 LLWAE--ALVQVIF-----STDTLRTLVGQMVLNAHAINDYVLPPGEAPSSGVFDWVLKG 418

Query: 57  TGLYPVISIINHSCLPN-AVLVFEGRLAVV-RAVQHVPKGAEVLISYIE-TAGST--MTR 111
            GLY ++S  NHSC+PN AV   +G   +V +  + +  G  + I+YI  TAG+T    R
Sbjct: 419 AGLYSLLSCFNHSCVPNVAVSAVDGTHEIVLKTTRSIRAGEPLAITYIPLTAGATSRAER 478

Query: 112 QKALKEQYLFTCTCPRC 128
           Q+ LK  Y FTC CPRC
Sbjct: 479 QRQLK-NYFFTCHCPRC 494


>gi|308501687|ref|XP_003113028.1| CRE-SET-3 protein [Caenorhabditis remanei]
 gi|308265329|gb|EFP09282.1| CRE-SET-3 protein [Caenorhabditis remanei]
          Length = 464

 Score = 58.9 bits (141), Expect = 3e-06,   Method: Compositional matrix adjust.
 Identities = 43/134 (32%), Positives = 61/134 (45%), Gaps = 10/134 (7%)

Query: 39  KLACNAHTICN-------SELRPLGTGLYPVISIINHSCLPNAVLVFEGRLAVVRAVQHV 91
           + A NAHTI +        E  P+GTGL+P+ SI NHSC PN V  F  R   +   + V
Sbjct: 196 RCAKNAHTIYSIEQIEKKDEDVPIGTGLFPISSIFNHSCTPN-VFGFFVRNTFIFVSRGV 254

Query: 92  PKGAEVLISYIETAG--STMTRQKALKEQYLFTCTCPRCIKLGQFDDIQESAILEGYRCK 149
             G E++ SY  T    S   R++ L     F C C  C++    +D  E +  +  R  
Sbjct: 255 KSGEELVDSYGVTYNQHSLKQREEFLANVSGFKCHCDSCVEQKSLEDYLEKSFKDVDRSA 314

Query: 150 DDGCSGFLLRDSDD 163
            +  S   + D  D
Sbjct: 315 REASSFLDISDYID 328


>gi|24666583|ref|NP_649084.1| CG18136 [Drosophila melanogaster]
 gi|7293833|gb|AAF49199.1| CG18136 [Drosophila melanogaster]
          Length = 530

 Score = 58.9 bits (141), Expect = 3e-06,   Method: Compositional matrix adjust.
 Identities = 49/181 (27%), Positives = 83/181 (45%), Gaps = 22/181 (12%)

Query: 10  LLYAQIANLVNLIL---QWPEISINEIAENFSKLACNAHTICNSELRPLGTGLYPVISII 66
           +L A +   +  +L    WPE+ I  IA   + L  N   +     R     LYP  ++I
Sbjct: 199 VLRANLITFIKTVLGMKDWPEMDILRIA---AILDTNTFEVRQPRERRKIRALYPGAAMI 255

Query: 67  NHSCLPNAVLVFEGRLAVV-RAVQHVPKGAEVLISYIETAGSTMTRQKALKEQYLFTCTC 125
           +H C+PN    F+  + +V  A + + KG  + ISY +   ST+ R+  L++   F C+C
Sbjct: 256 SHDCVPNMRHRFDDDMNIVFLAKRKIAKGEILSISYTQPLRSTIQRRVHLRQAKCFDCSC 315

Query: 126 PRCI---KLGQFDDIQESAILEGYRCKDDGCSGFLLRDS---DDKGFTCQQCGLVRSKEE 179
            RC    +LG F   Q        +CK    +G ++  +   +   + CQ C   RS ++
Sbjct: 316 ARCQDPEELGSFAGAQTC-----LKCK----AGKIISLNPLLNSAPWKCQLCNFKRSAKD 366

Query: 180 I 180
           +
Sbjct: 367 V 367


>gi|358378046|gb|EHK15729.1| hypothetical protein TRIVIDRAFT_228756 [Trichoderma virens Gv29-8]
          Length = 583

 Score = 58.9 bits (141), Expect = 3e-06,   Method: Compositional matrix adjust.
 Identities = 34/104 (32%), Positives = 52/104 (50%), Gaps = 3/104 (2%)

Query: 27  EISINEIAENFSKLACNAHTICNSELR--PLGTGLYPVISIINHSCLPNAVLVFEGRLAV 84
           E+SI E+   +  + CN+  + +   R  PLG  L    +++NH C PN V+VF      
Sbjct: 258 ELSIGEVLMLYGIVRCNSLPV-DQTFRNAPLGIALDMGGALLNHCCDPNVVIVFNSTQVQ 316

Query: 85  VRAVQHVPKGAEVLISYIETAGSTMTRQKALKEQYLFTCTCPRC 128
           VRA++ +  G E+L  Y + A     R   +  +Y F C C RC
Sbjct: 317 VRALRKIKDGEELLHCYRDIAYDFTFRNPRITARYQFNCQCDRC 360


>gi|393905719|gb|EFO19192.2| MYND finger family protein [Loa loa]
          Length = 545

 Score = 58.9 bits (141), Expect = 3e-06,   Method: Compositional matrix adjust.
 Identities = 74/330 (22%), Positives = 132/330 (40%), Gaps = 35/330 (10%)

Query: 16  ANLVNLILQWPEISINEIAENFSKLACNAHTICN-SELRPLGTGLYPVISIINHSCLPNA 74
           A+L+        +  +E+ E   +   N H I +   +  +G GLY  +   +HSC PN 
Sbjct: 198 ADLLAFYGSKAMVGKDEVFELHCRDYINRHAISDCGYIEEIGKGLYLDLCAYDHSCRPNT 257

Query: 75  VLVFEGRLAVVRAVQ---HVPKGAEVLISYIETAGSTMTRQKALKEQYLFTCTCPRCIKL 131
           +   +G +A +R +     +   +    SYI+   +T  R+K LK+ + F C C RC   
Sbjct: 258 IYTCDGFVATLRGLTASVDLRNLSSAHYSYIDLINTTQQRRKLLKDTWYFECHCTRC--- 314

Query: 132 GQFDDIQESAIL-------EGYRCKDDGCSGFLLRDSDDKGFTCQQCGLVRSKEEIKKIA 184
              DD   S+IL         Y C          +D + +  TC +C    S E + +  
Sbjct: 315 SDPDDALLSSILCPNCPEKREYLCI---FGDLPYKDRNTQIITCPKCHNKVSPEYVVEAI 371

Query: 185 SEVNILSK--KTLALTSCGNHQEVVSTYKMIEKLQKKLYHPFSVNLMQTREKLIKILMEL 242
             +  + K  +   +      Q +     + E+  K L     VN+     K+I++L+ L
Sbjct: 372 GAMRFIDKIVENHEVEQMSREQSIKFLTDLKERFSKLLS---KVNVFLC--KIIQLLIPL 426

Query: 243 EDWKEALAYCQLTIPVY-------QRVYPQFHPLLGLQYYTCGKLEWFLGDTENAIKSMT 295
            +    ++ CQL + ++       +  +P  HP +G+ Y +             A+    
Sbjct: 427 IN----MSDCQLLLDLHLEAEECVRFCFPFNHPAVGIHYRSIATFYLKCKQPHRALLYCK 482

Query: 296 EAVEILRITHGTNSPFMKELILKLEEAQAE 325
           +A EI+  T G       E    L++A  E
Sbjct: 483 KAYEIISFTLGPKHLMTIETDAMLKDASNE 512


>gi|307170062|gb|EFN62504.1| SET and MYND domain-containing protein 4 [Camponotus floridanus]
          Length = 333

 Score = 58.9 bits (141), Expect = 3e-06,   Method: Compositional matrix adjust.
 Identities = 35/113 (30%), Positives = 53/113 (46%), Gaps = 16/113 (14%)

Query: 32  EIAENFSKLACNAHTICNSELRP--------------LGTGLYPVISIINHSCLPNAVLV 77
           E A ++  L CN H I    +                + T +YP  S++NHSC PN +  
Sbjct: 187 EDAFSYGILVCNGHAITKINVTADDHENKLLIEEQIRIATAIYPSASMMNHSCDPNIINS 246

Query: 78  FEGRLAVVRAVQHVPKGAEVLISYIET--AGSTMTRQKALKEQYLFTCTCPRC 128
           F  +  +V+A+Q + +G E+L  Y           RQ++L+ QY FTC C  C
Sbjct: 247 FLDQTLIVKAIQDIKEGEEILHCYGVDFRRMPKAERQESLESQYYFTCNCKAC 299


>gi|46115820|ref|XP_383928.1| hypothetical protein FG03752.1 [Gibberella zeae PH-1]
          Length = 358

 Score = 58.9 bits (141), Expect = 4e-06,   Method: Compositional matrix adjust.
 Identities = 30/78 (38%), Positives = 44/78 (56%), Gaps = 3/78 (3%)

Query: 56  GTGLYPVISIINHSCLPNAVLVFEGRLAVV-RAVQHVPKGAEVLISYIE--TAGSTMTRQ 112
           G  LYP  +  NHSCLPN     +G+  +V  A + + KG E +I+Y +  T     +RQ
Sbjct: 264 GFALYPRAAQFNHSCLPNVSHKPDGQARMVYTAARDISKGEECMITYFDLATRKDVSSRQ 323

Query: 113 KALKEQYLFTCTCPRCIK 130
           K  + Q+ F CTC RC++
Sbjct: 324 KYAQTQFQFKCTCNRCLE 341


  Database: nr
    Posted date:  Mar 3, 2013 10:45 PM
  Number of letters in database: 999,999,864
  Number of sequences in database:  2,912,245
  
  Database: /local_scratch/syshi//blastdatabase/nr.01
    Posted date:  Mar 3, 2013 10:52 PM
  Number of letters in database: 999,999,666
  Number of sequences in database:  2,912,720
  
  Database: /local_scratch/syshi//blastdatabase/nr.02
    Posted date:  Mar 3, 2013 10:58 PM
  Number of letters in database: 999,999,938
  Number of sequences in database:  3,014,250
  
  Database: /local_scratch/syshi//blastdatabase/nr.03
    Posted date:  Mar 3, 2013 11:03 PM
  Number of letters in database: 999,999,780
  Number of sequences in database:  2,805,020
  
  Database: /local_scratch/syshi//blastdatabase/nr.04
    Posted date:  Mar 3, 2013 11:08 PM
  Number of letters in database: 999,999,551
  Number of sequences in database:  2,816,253
  
  Database: /local_scratch/syshi//blastdatabase/nr.05
    Posted date:  Mar 3, 2013 11:13 PM
  Number of letters in database: 999,999,897
  Number of sequences in database:  2,981,387
  
  Database: /local_scratch/syshi//blastdatabase/nr.06
    Posted date:  Mar 3, 2013 11:18 PM
  Number of letters in database: 999,999,649
  Number of sequences in database:  2,911,476
  
  Database: /local_scratch/syshi//blastdatabase/nr.07
    Posted date:  Mar 3, 2013 11:24 PM
  Number of letters in database: 999,999,452
  Number of sequences in database:  2,920,260
  
  Database: /local_scratch/syshi//blastdatabase/nr.08
    Posted date:  Mar 3, 2013 11:25 PM
  Number of letters in database: 64,230,274
  Number of sequences in database:  189,558
  
Lambda     K      H
   0.320    0.135    0.397 

Lambda     K      H
   0.267   0.0410    0.140 


Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Number of Hits to DB: 4,800,127,029
Number of Sequences: 23463169
Number of extensions: 187367448
Number of successful extensions: 500268
Number of sequences better than 100.0: 1000
Number of HSP's better than 100.0 without gapping: 1278
Number of HSP's successfully gapped in prelim test: 1456
Number of HSP's that attempted gapping in prelim test: 495840
Number of HSP's gapped (non-prelim): 3856
length of query: 335
length of database: 8,064,228,071
effective HSP length: 143
effective length of query: 192
effective length of database: 9,003,962,200
effective search space: 1728760742400
effective search space used: 1728760742400
T: 11
A: 40
X1: 16 ( 7.4 bits)
X2: 38 (14.6 bits)
X3: 64 (24.7 bits)
S1: 41 (21.8 bits)
S2: 77 (34.3 bits)