BLASTP 2.2.26 [Sep-21-2011]


Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, 
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), 
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs",  Nucleic Acids Res. 25:3389-3402.


Reference for compositional score matrix adjustment: Altschul, Stephen F., 
John C. Wootton, E. Michael Gertz, Richa Agarwala, Aleksandr Morgulis,
Alejandro A. Schaffer, and Yi-Kuo Yu (2005) "Protein database searches
using compositionally adjusted substitution matrices", FEBS J. 272:5101-5109.

Query= 017649
         (368 letters)

Database: nr 
           23,463,169 sequences; 8,064,228,071 total letters

Searching..................................................done



>gi|302141868|emb|CBI19071.3| unnamed protein product [Vitis vinifera]
          Length = 480

 Score =  615 bits (1587), Expect = e-174,   Method: Compositional matrix adjust.
 Identities = 283/368 (76%), Positives = 335/368 (91%), Gaps = 1/368 (0%)

Query: 1   MLKLYLRRKLQNDNVIPSTTTDNYSLVEALVAHMSDIDEKQLLLYAQIANLVNLILQWPE 60
           M+KLY+RRKLQ++ ++P+T  DNY+LVEALV+H++DIDEKQL+LYAQ+ANLVNLILQWP+
Sbjct: 114 MVKLYMRRKLQSEKIMPTTARDNYNLVEALVSHITDIDEKQLVLYAQMANLVNLILQWPD 173

Query: 61  ISINEIAENFSKLACNAHTICNSELRPLGTGLYPVISIINHSCLPNAVLVFEGRLAVVRA 120
           I++ EIAENFSKLACNAHTIC+ ELRPLGTGLYPVISIINHSCLPN+VLVFE RLAVVRA
Sbjct: 174 INVKEIAENFSKLACNAHTICDGELRPLGTGLYPVISIINHSCLPNSVLVFEERLAVVRA 233

Query: 121 VQHVPKGAEVLISYIETAGSTMTRQKALKEQYLFTCTCPRCIKLGQFDDIQESAILEGYR 180
           VQH+PKG EVLISYIETAGST+TRQKALKEQYLFTCTCPRC ++GQ+DDIQESAILEGYR
Sbjct: 234 VQHIPKGTEVLISYIETAGSTITRQKALKEQYLFTCTCPRCRRMGQYDDIQESAILEGYR 293

Query: 181 CKDDGCSGFLLRDSDDKGFTCQQCGLVRSKEEIKKIASEVNILSKKTLALTSCGNHQEVV 240
           CKDD C GFLLRDSDD GF CQQCGLVR+KEEIK++ASE+  LS K   ++S  ++ E  
Sbjct: 294 CKDDRCDGFLLRDSDDIGFICQQCGLVRNKEEIKRLASELKPLSDKA-TMSSSSHYVEAT 352

Query: 241 STYKMIEKLQKKLYHPFSVNLMQTREKLIKILMELEDWKEALAYCQLTIPVYQRVYPQFH 300
           S YKMIEKLQ KL+HPFS+NLM+TRE ++KILME++DW+ AL YC+LTIPVYQRVYP FH
Sbjct: 353 SIYKMIEKLQTKLFHPFSINLMRTREAILKILMEMKDWRAALTYCKLTIPVYQRVYPGFH 412

Query: 301 PLLGLQYYTCGKLEWFLGDTENAIKSMTEAVEILRITHGTNSPFMKELILKLEEAQAEAS 360
           PLLGLQYYTCGKLEW LG+TE+A+KS+T+A +IL+ITHGTN+PFMKEL+ KLEEA+AEAS
Sbjct: 413 PLLGLQYYTCGKLEWLLGETEDAVKSLTKAADILQITHGTNTPFMKELLFKLEEARAEAS 472

Query: 361 YKLSSKDE 368
           +K SSKD+
Sbjct: 473 HKRSSKDK 480


>gi|255545512|ref|XP_002513816.1| protein with unknown function [Ricinus communis]
 gi|223546902|gb|EEF48399.1| protein with unknown function [Ricinus communis]
          Length = 482

 Score =  613 bits (1582), Expect = e-173,   Method: Compositional matrix adjust.
 Identities = 283/367 (77%), Positives = 329/367 (89%)

Query: 1   MLKLYLRRKLQNDNVIPSTTTDNYSLVEALVAHMSDIDEKQLLLYAQIANLVNLILQWPE 60
           M++L +RRKLQ++ +I ST TDNY LVEALVAHM DIDEKQL+LYAQ+ANLVNLIL+WP+
Sbjct: 114 MVRLLIRRKLQSEKIISSTATDNYDLVEALVAHMKDIDEKQLVLYAQMANLVNLILRWPD 173

Query: 61  ISINEIAENFSKLACNAHTICNSELRPLGTGLYPVISIINHSCLPNAVLVFEGRLAVVRA 120
           ++I EIAENFSKLACNAHTIC+SELRPLGTGLYPV+SIINHSCLPNAVLVF+GRLAVV  
Sbjct: 174 VNIKEIAENFSKLACNAHTICDSELRPLGTGLYPVVSIINHSCLPNAVLVFDGRLAVVHT 233

Query: 121 VQHVPKGAEVLISYIETAGSTMTRQKALKEQYLFTCTCPRCIKLGQFDDIQESAILEGYR 180
           VQH+PKG+EVLISYIETAGSTMTRQKALK+QY FTCTCPRCIK+G  DDIQESAILEGYR
Sbjct: 234 VQHIPKGSEVLISYIETAGSTMTRQKALKQQYFFTCTCPRCIKMGLLDDIQESAILEGYR 293

Query: 181 CKDDGCSGFLLRDSDDKGFTCQQCGLVRSKEEIKKIASEVNILSKKTLALTSCGNHQEVV 240
           CKD+ C+GFLLRDSDD+GF CQQCGL+RSKEE+KK A+E+   S K     S GN QE V
Sbjct: 294 CKDNRCNGFLLRDSDDRGFICQQCGLLRSKEEVKKSAAEIKATSDKASKSISSGNLQEAV 353

Query: 241 STYKMIEKLQKKLYHPFSVNLMQTREKLIKILMELEDWKEALAYCQLTIPVYQRVYPQFH 300
           S YK+IEKLQ+KL HPFS +LMQTREKL+K+LMELEDW EAL+YC+LTIPVYQRVYP+FH
Sbjct: 354 SIYKLIEKLQRKLCHPFSTSLMQTREKLLKMLMELEDWGEALSYCKLTIPVYQRVYPEFH 413

Query: 301 PLLGLQYYTCGKLEWFLGDTENAIKSMTEAVEILRITHGTNSPFMKELILKLEEAQAEAS 360
           PLLGLQY++CGKLEW LGDTE AIKS+T+A+++LRITHGT +PFMKEL++KLEEA+AE S
Sbjct: 414 PLLGLQYFSCGKLEWLLGDTEAAIKSLTKALDVLRITHGTKTPFMKELMMKLEEARAEVS 473

Query: 361 YKLSSKD 367
           Y  +S D
Sbjct: 474 YLRASND 480


>gi|225459467|ref|XP_002284386.1| PREDICTED: histone-lysine N-methyltransferase ASHR1-like [Vitis
           vinifera]
          Length = 477

 Score =  611 bits (1575), Expect = e-172,   Method: Compositional matrix adjust.
 Identities = 282/368 (76%), Positives = 332/368 (90%), Gaps = 4/368 (1%)

Query: 1   MLKLYLRRKLQNDNVIPSTTTDNYSLVEALVAHMSDIDEKQLLLYAQIANLVNLILQWPE 60
           M+KLY+RRKLQ++ ++P+T  DNY+LVEALV+H++DIDEKQL+LYAQ+ANLVNLILQWP+
Sbjct: 114 MVKLYMRRKLQSEKIMPTTARDNYNLVEALVSHITDIDEKQLVLYAQMANLVNLILQWPD 173

Query: 61  ISINEIAENFSKLACNAHTICNSELRPLGTGLYPVISIINHSCLPNAVLVFEGRLAVVRA 120
           I++ EIAENFSKLACNAHTIC+ ELRPLGTGLYPVISIINHSCLPN+VLVFE RLAVVRA
Sbjct: 174 INVKEIAENFSKLACNAHTICDGELRPLGTGLYPVISIINHSCLPNSVLVFEERLAVVRA 233

Query: 121 VQHVPKGAEVLISYIETAGSTMTRQKALKEQYLFTCTCPRCIKLGQFDDIQESAILEGYR 180
           VQH+PKG EVLISYIETAGST+TRQKALKEQYLFTCTCPRC ++GQ+DDIQESAILEGYR
Sbjct: 234 VQHIPKGTEVLISYIETAGSTITRQKALKEQYLFTCTCPRCRRMGQYDDIQESAILEGYR 293

Query: 181 CKDDGCSGFLLRDSDDKGFTCQQCGLVRSKEEIKKIASEVNILSKKTLALTSCGNHQEVV 240
           CKDD C GFLLRDSDD GF CQQCGLVR+KEEIK++ASE+  LS K    +S     +  
Sbjct: 294 CKDDRCDGFLLRDSDDIGFICQQCGLVRNKEEIKRLASELKPLSDKATMSSS----SQAT 349

Query: 241 STYKMIEKLQKKLYHPFSVNLMQTREKLIKILMELEDWKEALAYCQLTIPVYQRVYPQFH 300
           S YKMIEKLQ KL+HPFS+NLM+TRE ++KILME++DW+ AL YC+LTIPVYQRVYP FH
Sbjct: 350 SIYKMIEKLQTKLFHPFSINLMRTREAILKILMEMKDWRAALTYCKLTIPVYQRVYPGFH 409

Query: 301 PLLGLQYYTCGKLEWFLGDTENAIKSMTEAVEILRITHGTNSPFMKELILKLEEAQAEAS 360
           PLLGLQYYTCGKLEW LG+TE+A+KS+T+A +IL+ITHGTN+PFMKEL+ KLEEA+AEAS
Sbjct: 410 PLLGLQYYTCGKLEWLLGETEDAVKSLTKAADILQITHGTNTPFMKELLFKLEEARAEAS 469

Query: 361 YKLSSKDE 368
           +K SSKD+
Sbjct: 470 HKRSSKDK 477


>gi|356507670|ref|XP_003522587.1| PREDICTED: histone-lysine N-methyltransferase ASHR1-like [Glycine
           max]
          Length = 484

 Score =  570 bits (1469), Expect = e-160,   Method: Compositional matrix adjust.
 Identities = 266/357 (74%), Positives = 309/357 (86%), Gaps = 1/357 (0%)

Query: 11  QNDNVIPSTTTDNYSLVEALVAHMSDIDEKQLLLYAQIANLVNLILQWPEISINEIAENF 70
           QND +IPST  DNY+LVEALVAHMSDI E+QL+LYAQ+ANLVN IL+WP I+I EIAENF
Sbjct: 129 QNDKIIPSTAMDNYNLVEALVAHMSDITEEQLVLYAQMANLVNSILEWPGINIKEIAENF 188

Query: 71  SKLACNAHTICNSELRPLGTGLYPVISIINHSCLPNAVLVFEGRLAVVRAVQHVPKGAEV 130
           SK ACNAHTIC+SELRP+GTGLYPVISIINHSCLPN+VLVFEG  A+VRAVQH+P G EV
Sbjct: 189 SKFACNAHTICDSELRPVGTGLYPVISIINHSCLPNSVLVFEGSSALVRAVQHIPSGTEV 248

Query: 131 LISYIETAGSTMTRQKALKEQYLFTCTCPRCIKLGQFDDIQESAILEGYRCKDDGCSGFL 190
           LISYIETA STMTRQKALKEQYLFTCTCPRC K+GQ+DDIQESAILEGY+CK + C GFL
Sbjct: 249 LISYIETAESTMTRQKALKEQYLFTCTCPRCSKVGQYDDIQESAILEGYKCKSEKCGGFL 308

Query: 191 LRDSDDKGFTCQQCGLVRSKEEIKKIASEVNILSKKTLALTSCGNHQEVVSTYKMIEKLQ 250
           LR +D KGF CQ CGL+R KEEIK+I +E+ +LS+   +  S   +QE +S YK IEKLQ
Sbjct: 309 LRTTDGKGFQCQGCGLIRDKEEIKRITTEIKLLSEDA-SKPSATYYQEAISIYKRIEKLQ 367

Query: 251 KKLYHPFSVNLMQTREKLIKILMELEDWKEALAYCQLTIPVYQRVYPQFHPLLGLQYYTC 310
            +L+HP S+NLM TREK++K LMELE W EALAYC+LTIP YQRVYP  HPL GLQYYTC
Sbjct: 368 TELFHPLSINLMHTREKILKSLMELEHWTEALAYCKLTIPFYQRVYPAVHPLPGLQYYTC 427

Query: 311 GKLEWFLGDTENAIKSMTEAVEILRITHGTNSPFMKELILKLEEAQAEASYKLSSKD 367
           GKLEW+LGDTE A+KS+T+AV+ILRITHGTN+PFMK+L++KLEEA+ EASYK SSK+
Sbjct: 428 GKLEWYLGDTEEAVKSLTKAVDILRITHGTNTPFMKDLLMKLEEARTEASYKFSSKE 484


>gi|449445136|ref|XP_004140329.1| PREDICTED: histone-lysine N-methyltransferase ASHR1-like [Cucumis
           sativus]
 gi|449502473|ref|XP_004161650.1| PREDICTED: histone-lysine N-methyltransferase ASHR1-like [Cucumis
           sativus]
          Length = 482

 Score =  566 bits (1458), Expect = e-159,   Method: Compositional matrix adjust.
 Identities = 266/368 (72%), Positives = 313/368 (85%)

Query: 1   MLKLYLRRKLQNDNVIPSTTTDNYSLVEALVAHMSDIDEKQLLLYAQIANLVNLILQWPE 60
           M+KL++RRKLQ++ VIP   TDNY LVE L+ HMS IDEKQL+LYAQ+  LVN ILQWP 
Sbjct: 114 MIKLFIRRKLQSEKVIPMAATDNYKLVEELIDHMSGIDEKQLVLYAQMGTLVNFILQWPG 173

Query: 61  ISINEIAENFSKLACNAHTICNSELRPLGTGLYPVISIINHSCLPNAVLVFEGRLAVVRA 120
           +++ EIA NFSKLACNAHTIC+SELRPLGTGLYPVISIINHSCLPNAVLVFEGR AVVRA
Sbjct: 174 MNVKEIAVNFSKLACNAHTICDSELRPLGTGLYPVISIINHSCLPNAVLVFEGRTAVVRA 233

Query: 121 VQHVPKGAEVLISYIETAGSTMTRQKALKEQYLFTCTCPRCIKLGQFDDIQESAILEGYR 180
           VQH+P GAEV ISYIETAGSTMTRQK LKE YLFTCTC RC+K+ Q D+I+ESAILEGYR
Sbjct: 234 VQHIPAGAEVSISYIETAGSTMTRQKTLKENYLFTCTCSRCVKVAQEDEIKESAILEGYR 293

Query: 181 CKDDGCSGFLLRDSDDKGFTCQQCGLVRSKEEIKKIASEVNILSKKTLALTSCGNHQEVV 240
           C++D C GFLLR+SDD GFTCQQCGLVRSKE+IK IAS++  +S +     S  ++ E +
Sbjct: 294 CRNDQCDGFLLRNSDDTGFTCQQCGLVRSKEDIKNIASKIKSISDEASTSLSSQSYAEAL 353

Query: 241 STYKMIEKLQKKLYHPFSVNLMQTREKLIKILMELEDWKEALAYCQLTIPVYQRVYPQFH 300
             Y+ +EKLQ+ L HP+S++LMQTREKL+KI MELE+W +AL YC+LTI VYQ++YP  H
Sbjct: 354 FMYEKVEKLQRILCHPYSISLMQTREKLLKISMELENWTKALTYCKLTISVYQKLYPGIH 413

Query: 301 PLLGLQYYTCGKLEWFLGDTENAIKSMTEAVEILRITHGTNSPFMKELILKLEEAQAEAS 360
           PLLGLQ+YTCGKLEW LG TE+AIKS T+A +ILRITHGTNS FMKEL+LKLEEA+AEAS
Sbjct: 414 PLLGLQFYTCGKLEWLLGHTEDAIKSYTKAFDILRITHGTNSSFMKELLLKLEEARAEAS 473

Query: 361 YKLSSKDE 368
           YKLSS D+
Sbjct: 474 YKLSSTDD 481


>gi|224082230|ref|XP_002306611.1| SET domain protein [Populus trichocarpa]
 gi|222856060|gb|EEE93607.1| SET domain protein [Populus trichocarpa]
          Length = 458

 Score =  557 bits (1435), Expect = e-156,   Method: Compositional matrix adjust.
 Identities = 266/368 (72%), Positives = 312/368 (84%), Gaps = 23/368 (6%)

Query: 1   MLKLYLRRKLQNDNVIPSTTTDNYSLVEALVAHMSDIDEKQLLLYAQIANLVNLILQWPE 60
           M++LYLRRKLQN+                      D+DEKQL+LYAQ+ANLV+ ILQWPE
Sbjct: 114 MVRLYLRRKLQNE---------------------MDLDEKQLVLYAQMANLVHFILQWPE 152

Query: 61  ISINEIAENFSKLACNAHTICNSELRPLGTGLYPVISIINHSCLPNAVLVFEGRLAVVRA 120
           I++ EIAENFSKLACNAHTIC+ ELRPLGTGLYPV+SIINHSC+PNAVL FEG+ +VVRA
Sbjct: 153 INLKEIAENFSKLACNAHTICDCELRPLGTGLYPVVSIINHSCMPNAVLTFEGKSSVVRA 212

Query: 121 VQHVPKGAEVLISYIETAGSTMTRQKALKEQYLFTCTCPRCIKLGQFDDIQESAILEGYR 180
           V+H+P+GAEV I+YI+TAGSTMTRQKALKEQY FTCTCPRCIK+  +DDIQESAILEGYR
Sbjct: 213 VEHIPEGAEVSIAYIDTAGSTMTRQKALKEQYFFTCTCPRCIKV--YDDIQESAILEGYR 270

Query: 181 CKDDGCSGFLLRDSDDKGFTCQQCGLVRSKEEIKKIASEVNILSKKTLALTSCGNHQEVV 240
           CKDD C+GFLLRDS+DKGF CQ CGL RSKEE+K+I  E+  +S K L  TS GNH+EV+
Sbjct: 271 CKDDRCNGFLLRDSEDKGFICQTCGLRRSKEEVKRIVCEITAISDKKLKSTSPGNHEEVI 330

Query: 241 STYKMIEKLQKKLYHPFSVNLMQTREKLIKILMELEDWKEALAYCQLTIPVYQRVYPQFH 300
           S YKMIEKLQ +L HPFS++LM+T+E+L+KILMEL DW+EALAYC+LTI  YQRVYP+ H
Sbjct: 331 SLYKMIEKLQMELCHPFSISLMRTQEELLKILMELGDWREALAYCRLTITGYQRVYPEPH 390

Query: 301 PLLGLQYYTCGKLEWFLGDTENAIKSMTEAVEILRITHGTNSPFMKELILKLEEAQAEAS 360
           PLLGLQYYTCGK+EW LG TE+AIKS+T AV+ILRITHGTNSPFMKEL++KL+EA AEAS
Sbjct: 391 PLLGLQYYTCGKIEWLLGYTEDAIKSLTRAVDILRITHGTNSPFMKELMMKLDEAHAEAS 450

Query: 361 YKLSSKDE 368
           Y LSSKDE
Sbjct: 451 YNLSSKDE 458


>gi|356518519|ref|XP_003527926.1| PREDICTED: LOW QUALITY PROTEIN: histone-lysine N-methyltransferase
           ASHR1-like [Glycine max]
          Length = 401

 Score =  551 bits (1421), Expect = e-154,   Method: Compositional matrix adjust.
 Identities = 258/357 (72%), Positives = 305/357 (85%), Gaps = 3/357 (0%)

Query: 11  QNDNVIPSTTTDNYSLVEALVAHMSDIDEKQLLLYAQIANLVNLILQWPEISINEIAENF 70
           QND ++PST  DNY+LVEALVA+MSDI E+QL+LYA++ANLVN ILQWPEI+I EIAENF
Sbjct: 48  QNDKIVPSTAMDNYNLVEALVAYMSDITEEQLVLYAKMANLVNSILQWPEINIKEIAENF 107

Query: 71  SKLACNAHTICNSELRPLGTGLYPVISIINHSCLPNAVLVFEGRLAVVRAVQHVPKGAEV 130
           SK ACNAHTIC+SELRP+GTGLYPVISIINHSCLPN+VLVFEGR A+V  VQH+P G EV
Sbjct: 108 SKFACNAHTICDSELRPVGTGLYPVISIINHSCLPNSVLVFEGRSALVPTVQHIPTGTEV 167

Query: 131 LISYIETAGSTMTRQKALKEQYLFTCTCPRCIKLGQFDDIQESAILEGYRCKDDGCSGFL 190
            ISYIETAGSTMT+Q ALK    FTCTCPRC K+G++DDIQESAILEGY+CK + C GFL
Sbjct: 168 PISYIETAGSTMTQQNALKSSTXFTCTCPRCSKVGEYDDIQESAILEGYKCKSEKCGGFL 227

Query: 191 LRDSDDKGFTCQQCGLVRSKEEIKKIASEVNILSKKTLALTSCGNHQEVVSTYKMIEKLQ 250
           L     KGF CQ CGLVR KEEIK+I +E+ +LS++ +  +S  N+QE +S YK IEKLQ
Sbjct: 228 LY---GKGFQCQGCGLVRDKEEIKRITTEIKLLSEEKIISSSTCNYQEAISIYKWIEKLQ 284

Query: 251 KKLYHPFSVNLMQTREKLIKILMELEDWKEALAYCQLTIPVYQRVYPQFHPLLGLQYYTC 310
            +LYHP SVNLMQ REK++K LMELE W EALAYC+LTIP YQRVYP  HPLLGLQYYTC
Sbjct: 285 TELYHPLSVNLMQNREKILKSLMELEHWAEALAYCKLTIPFYQRVYPAVHPLLGLQYYTC 344

Query: 311 GKLEWFLGDTENAIKSMTEAVEILRITHGTNSPFMKELILKLEEAQAEASYKLSSKD 367
           GKLEW+LGDT+ A+KS+ +AV+ILRITHGTN+PFMK+L++KLEEA+AEASY+LS K+
Sbjct: 345 GKLEWYLGDTDEAVKSLIKAVDILRITHGTNTPFMKDLLMKLEEARAEASYRLSPKE 401


>gi|357461631|ref|XP_003601097.1| Histone-lysine N-methyltransferase ASHR1 [Medicago truncatula]
 gi|355490145|gb|AES71348.1| Histone-lysine N-methyltransferase ASHR1 [Medicago truncatula]
          Length = 511

 Score =  543 bits (1400), Expect = e-152,   Method: Compositional matrix adjust.
 Identities = 262/398 (65%), Positives = 313/398 (78%), Gaps = 38/398 (9%)

Query: 1   MLKLYLRRKLQNDNVIPSTTTDNYSLVEALVAH--------------------------- 33
           MLKLYLRRKLQ+  +IPST  DNY LVEALVA                            
Sbjct: 110 MLKLYLRRKLQDQKIIPSTAMDNYKLVEALVARILFIIVVLSSRAMIVEFSLLTNMSVVI 169

Query: 34  ----------MSDIDEKQLLLYAQIANLVNLILQWPEISINEIAENFSKLACNAHTICNS 83
                     MSDI E+QL+LYAQ+ANLV+LILQWPEI+I EIA  FSK ACNAHT+C+S
Sbjct: 170 VLFNTYFHEDMSDIKEEQLVLYAQMANLVHLILQWPEINIKEIANFFSKFACNAHTVCDS 229

Query: 84  ELRPLGTGLYPVISIINHSCLPNAVLVFEGRLAVVRAVQHVPKGAEVLISYIETAGSTMT 143
           ELRPLGTGLYPV+SIINHSCLPN+VLVF+GR A VRA+QH+PKG EVLISYIETAGST+T
Sbjct: 230 ELRPLGTGLYPVVSIINHSCLPNSVLVFDGREASVRALQHIPKGTEVLISYIETAGSTVT 289

Query: 144 RQKALKEQYLFTCTCPRCIKLGQFDDIQESAILEGYRCKDDGCSGFLLRDSDDKGFTCQQ 203
           RQKAL+EQYLF C CP C K+GQ++D++E+AILEGYRCK++ C GFLLR +D K F CQ+
Sbjct: 290 RQKALREQYLFQCVCPLCSKVGQYEDVRENAILEGYRCKNETCDGFLLRTTDGKAFQCQE 349

Query: 204 CGLVRSKEEIKKIASEVNILSKKTLALTSCGNHQEVVSTYKMIEKLQKKLYHPFSVNLMQ 263
           CGLVR KEEIK+IA+E+  L ++    +S  +H E +S +KMIEKLQ KLYHPFS+NLMQ
Sbjct: 350 CGLVRDKEEIKQIATEIKFLLEEASKPSSNDSH-EAISIHKMIEKLQTKLYHPFSINLMQ 408

Query: 264 TREKLIKILMELEDWKEALAYCQLTIPVYQRVYPQFHPLLGLQYYTCGKLEWFLGDTENA 323
           TRE ++K LM+LE W+EALAYC+LTIP+YQRVYP  HPLLGLQYYTCGKLEW+LGDTE A
Sbjct: 409 TRETILKSLMKLEYWREALAYCKLTIPIYQRVYPAVHPLLGLQYYTCGKLEWYLGDTEEA 468

Query: 324 IKSMTEAVEILRITHGTNSPFMKELILKLEEAQAEASY 361
           IKS+T+AV+ILRITHGT +PF+KEL + L EA+AEAS+
Sbjct: 469 IKSLTKAVDILRITHGTKTPFVKELSMMLVEARAEASF 506


>gi|297836508|ref|XP_002886136.1| hypothetical protein ARALYDRAFT_319740 [Arabidopsis lyrata subsp.
           lyrata]
 gi|297331976|gb|EFH62395.1| hypothetical protein ARALYDRAFT_319740 [Arabidopsis lyrata subsp.
           lyrata]
          Length = 483

 Score =  521 bits (1343), Expect = e-145,   Method: Compositional matrix adjust.
 Identities = 241/364 (66%), Positives = 307/364 (84%)

Query: 1   MLKLYLRRKLQNDNVIPSTTTDNYSLVEALVAHMSDIDEKQLLLYAQIANLVNLILQWPE 60
           M+KLY++R LQN+ V+P T+TDNYSLVEALV+HMS+IDEKQ+LLYAQ+ANLVNLILQ+P 
Sbjct: 114 MVKLYIKRNLQNEKVLPITSTDNYSLVEALVSHMSEIDEKQMLLYAQMANLVNLILQFPS 173

Query: 61  ISINEIAENFSKLACNAHTICNSELRPLGTGLYPVISIINHSCLPNAVLVFEGRLAVVRA 120
           + + EIAENFSK +CNAH+IC+SELRP G GL+P++SIINHSC PNAVLVFE ++AVVRA
Sbjct: 174 VDLREIAENFSKFSCNAHSICDSELRPQGIGLFPLVSIINHSCSPNAVLVFEEQMAVVRA 233

Query: 121 VQHVPKGAEVLISYIETAGSTMTRQKALKEQYLFTCTCPRCIKLGQFDDIQESAILEGYR 180
           + ++ K +E+ ISYIETAGST+TRQK+LKEQYLF C C RC  +G+  DI+ESAILEGYR
Sbjct: 234 MDNISKDSEITISYIETAGSTLTRQKSLKEQYLFHCQCARCSNVGKPHDIEESAILEGYR 293

Query: 181 CKDDGCSGFLLRDSDDKGFTCQQCGLVRSKEEIKKIASEVNILSKKTLALTSCGNHQEVV 240
           C ++ C+GFLLRD ++KGF CQ+C L+RSKEE+KK+AS++  +S+K  A  S  N Q  +
Sbjct: 294 CSNEKCTGFLLRDPEEKGFVCQKCLLLRSKEEVKKLASDLKTVSEKAHASPSAENKQAAI 353

Query: 241 STYKMIEKLQKKLYHPFSVNLMQTREKLIKILMELEDWKEALAYCQLTIPVYQRVYPQFH 300
             YK IEKLQ KLYH FS+ LM+TREKL+K+LM+LE W+EAL YC+L +PVYQRVYP  H
Sbjct: 354 ELYKTIEKLQVKLYHSFSITLMRTREKLLKMLMDLEIWREALNYCRLIVPVYQRVYPATH 413

Query: 301 PLLGLQYYTCGKLEWFLGDTENAIKSMTEAVEILRITHGTNSPFMKELILKLEEAQAEAS 360
           PL+GLQ+YT GKLEW LG+T+ A+ S+ +A +ILRI+HGT++PFMKEL  KLEEA+AEAS
Sbjct: 414 PLIGLQFYTQGKLEWLLGETKEAVSSLIKAFDILRISHGTSTPFMKELSAKLEEARAEAS 473

Query: 361 YKLS 364
           YKL+
Sbjct: 474 YKLA 477


>gi|30680137|ref|NP_849969.1| histone-lysine N-methyltransferase ASHR1 [Arabidopsis thaliana]
 gi|94707144|sp|Q7XJS0.2|ASHR1_ARATH RecName: Full=Histone-lysine N-methyltransferase ASHR1; AltName:
           Full=ASH1-related protein 1; AltName: Full=Protein SET
           DOMAIN GROUP 37
 gi|145651792|gb|ABP88121.1| At2g17900 [Arabidopsis thaliana]
 gi|330251606|gb|AEC06700.1| histone-lysine N-methyltransferase ASHR1 [Arabidopsis thaliana]
          Length = 480

 Score =  513 bits (1320), Expect = e-143,   Method: Compositional matrix adjust.
 Identities = 236/362 (65%), Positives = 302/362 (83%)

Query: 1   MLKLYLRRKLQNDNVIPSTTTDNYSLVEALVAHMSDIDEKQLLLYAQIANLVNLILQWPE 60
           M++LY++R LQN+ V+P TTTDNYSLVEALV+HMS+IDEKQ+LLYAQ+ANLVNLILQ+P 
Sbjct: 114 MVRLYIKRNLQNEKVLPITTTDNYSLVEALVSHMSEIDEKQMLLYAQMANLVNLILQFPS 173

Query: 61  ISINEIAENFSKLACNAHTICNSELRPLGTGLYPVISIINHSCLPNAVLVFEGRLAVVRA 120
           + + EIAENFSK +CNAH+IC+SELRP G GL+P++SIINHSC PNAVLVFE ++AVVRA
Sbjct: 174 VDLREIAENFSKFSCNAHSICDSELRPQGIGLFPLVSIINHSCSPNAVLVFEEQMAVVRA 233

Query: 121 VQHVPKGAEVLISYIETAGSTMTRQKALKEQYLFTCTCPRCIKLGQFDDIQESAILEGYR 180
           + ++ K +E+ ISYIETAGST+TRQK+LKEQYLF C C RC   G+  DI+ESAILEGYR
Sbjct: 234 MDNISKDSEITISYIETAGSTLTRQKSLKEQYLFHCQCARCSNFGKPHDIEESAILEGYR 293

Query: 181 CKDDGCSGFLLRDSDDKGFTCQQCGLVRSKEEIKKIASEVNILSKKTLALTSCGNHQEVV 240
           C ++ C+GFLLRD ++KGF CQ+C L+RSKEE+KK+AS++  +S+K     S  + Q  +
Sbjct: 294 CANEKCTGFLLRDPEEKGFVCQKCLLLRSKEEVKKLASDLKTVSEKAPTSPSAEDKQAAI 353

Query: 241 STYKMIEKLQKKLYHPFSVNLMQTREKLIKILMELEDWKEALAYCQLTIPVYQRVYPQFH 300
             YK IEKLQ KLYH FS+ LM+TREKL+K+LM++E W+EAL YC+L +PVYQRVYP  H
Sbjct: 354 ELYKTIEKLQVKLYHSFSIPLMRTREKLLKMLMDVEIWREALNYCRLIVPVYQRVYPATH 413

Query: 301 PLLGLQYYTCGKLEWFLGDTENAIKSMTEAVEILRITHGTNSPFMKELILKLEEAQAEAS 360
           PL+GLQ+YT GKLEW LG+T+ A+ S+ +A +ILRI+HG ++PFMKEL  KLEEA+AEAS
Sbjct: 414 PLIGLQFYTQGKLEWLLGETKEAVSSLIKAFDILRISHGISTPFMKELSAKLEEARAEAS 473

Query: 361 YK 362
           YK
Sbjct: 474 YK 475


>gi|41469441|gb|AAS07242.1| putative MYND finger protein [Oryza sativa Japonica Group]
 gi|108710650|gb|ABF98445.1| MYND finger family protein, expressed [Oryza sativa Japonica Group]
          Length = 480

 Score =  486 bits (1250), Expect = e-134,   Method: Compositional matrix adjust.
 Identities = 230/368 (62%), Positives = 290/368 (78%), Gaps = 6/368 (1%)

Query: 1   MLKLYLRRKLQNDNVIPSTTTDNYSLVEALVAHMSDIDEKQLLLYAQIANLVNLILQWPE 60
           M++L LRRKLQ+D  IPS+ TDNY+LV+AL +H+S++D+ QL+LYAQ+ANLV LIL   E
Sbjct: 118 MVRLVLRRKLQDDKAIPSSGTDNYNLVDALESHISEVDKNQLVLYAQMANLVQLILPSFE 177

Query: 61  ISINEIAENFSKLACNAHTICNSELRPLGTGLYPVISIINHSCLPNAVLVFEGRLAVVRA 120
           + + EI   FSK ACNAHTIC+ ELRPLGTGLYPV+SIINHSC+PNAVL+FEGR A VRA
Sbjct: 178 LDLKEITHTFSKFACNAHTICDPELRPLGTGLYPVLSIINHSCVPNAVLIFEGRTAYVRA 237

Query: 121 VQHVPKGAEVLISYIETAGSTMTRQKALKEQYLFTCTCPRCIKLGQFDDIQESAILEGYR 180
           +Q + K  EV ISYIETA +TM RQ  LK  Y FTCTCPRC+K     D +E A+LEGYR
Sbjct: 238 LQPISKNEEVSISYIETAATTMKRQDDLK-HYYFTCTCPRCVK-----DSEEDALLEGYR 291

Query: 181 CKDDGCSGFLLRDSDDKGFTCQQCGLVRSKEEIKKIASEVNILSKKTLALTSCGNHQEVV 240
           C D  C GFLL ++ +KG+TCQ+C   R  EE++K+AS+V +LS K  +L S GN+ EV 
Sbjct: 292 CNDQKCDGFLLPNAGNKGYTCQKCSTSRDGEELQKMASDVLLLSDKVSSLVSSGNNSEVG 351

Query: 241 STYKMIEKLQKKLYHPFSVNLMQTREKLIKILMELEDWKEALAYCQLTIPVYQRVYPQFH 300
           S YK IE+L++KLYHP S+ L+ TRE L+KI MEL+DW+ AL YC+LTIPVY+R+YP FH
Sbjct: 352 SMYKTIEELERKLYHPLSITLLHTRETLLKIYMELQDWQTALMYCRLTIPVYERIYPPFH 411

Query: 301 PLLGLQYYTCGKLEWFLGDTENAIKSMTEAVEILRITHGTNSPFMKELILKLEEAQAEAS 360
           P++GLQ+YTCGKLEW L  TE+A+ S+T A +ILRITHGT S FMKEL+ KLEE +AEAS
Sbjct: 412 PMIGLQFYTCGKLEWLLEYTEDALMSLTRAADILRITHGTKSEFMKELLGKLEEVRAEAS 471

Query: 361 YKLSSKDE 368
           ++LS+ DE
Sbjct: 472 FRLSAGDE 479


>gi|108710649|gb|ABF98444.1| MYND finger family protein, expressed [Oryza sativa Japonica Group]
          Length = 481

 Score =  479 bits (1234), Expect = e-133,   Method: Compositional matrix adjust.
 Identities = 229/369 (62%), Positives = 290/369 (78%), Gaps = 7/369 (1%)

Query: 1   MLKLYLRRKLQNDNVIPSTTTDNYSLVEALVAHMSDIDEKQLLLYAQIANLVNLILQWPE 60
           M++L LRRKLQ+D  IPS+ TDNY+LV+AL +H+S++D+ QL+LYAQ+ANLV LIL   E
Sbjct: 118 MVRLVLRRKLQDDKAIPSSGTDNYNLVDALESHISEVDKNQLVLYAQMANLVQLILPSFE 177

Query: 61  ISINEIAENFSKLACNAHTICNSELRPLGTGLYPVISIINHSCLPNAVLVFEGRLAVVRA 120
           + + EI   FSK ACNAHTIC+ ELRPLGTGLYPV+SIINHSC+PNAVL+FEGR A VRA
Sbjct: 178 LDLKEITHTFSKFACNAHTICDPELRPLGTGLYPVLSIINHSCVPNAVLIFEGRTAYVRA 237

Query: 121 VQHVPKGAEVLISYIETAGSTMTRQKALKEQYLFTCTCPRCIKLGQFDDIQESAILEGYR 180
           +Q + K  EV ISYIETA +TM RQ  LK  Y FTCTCPRC+K     D +E A+LEGYR
Sbjct: 238 LQPISKNEEVSISYIETAATTMKRQDDLK-HYYFTCTCPRCVK-----DSEEDALLEGYR 291

Query: 181 CKDDGCSGFLLRDSDDKGFTCQQCGLVRSKEEIKKIASEVNILSKKTLALTSCG-NHQEV 239
           C D  C GFLL ++ +KG+TCQ+C   R  EE++K+AS+V +LS K  +L S G ++ EV
Sbjct: 292 CNDQKCDGFLLPNAGNKGYTCQKCSTSRDGEELQKMASDVLLLSDKVSSLVSSGIDNSEV 351

Query: 240 VSTYKMIEKLQKKLYHPFSVNLMQTREKLIKILMELEDWKEALAYCQLTIPVYQRVYPQF 299
            S YK IE+L++KLYHP S+ L+ TRE L+KI MEL+DW+ AL YC+LTIPVY+R+YP F
Sbjct: 352 GSMYKTIEELERKLYHPLSITLLHTRETLLKIYMELQDWQTALMYCRLTIPVYERIYPPF 411

Query: 300 HPLLGLQYYTCGKLEWFLGDTENAIKSMTEAVEILRITHGTNSPFMKELILKLEEAQAEA 359
           HP++GLQ+YTCGKLEW L  TE+A+ S+T A +ILRITHGT S FMKEL+ KLEE +AEA
Sbjct: 412 HPMIGLQFYTCGKLEWLLEYTEDALMSLTRAADILRITHGTKSEFMKELLGKLEEVRAEA 471

Query: 360 SYKLSSKDE 368
           S++LS+ DE
Sbjct: 472 SFRLSAGDE 480


>gi|218193604|gb|EEC76031.1| hypothetical protein OsI_13199 [Oryza sativa Indica Group]
          Length = 481

 Score =  476 bits (1225), Expect = e-132,   Method: Compositional matrix adjust.
 Identities = 228/369 (61%), Positives = 289/369 (78%), Gaps = 7/369 (1%)

Query: 1   MLKLYLRRKLQNDNVIPSTTTDNYSLVEALVAHMSDIDEKQLLLYAQIANLVNLILQWPE 60
           M++L LRRKLQ+D  IPS+ TDNY+LV+AL +H+S++D+ QL+LYAQ+ANLV LIL   E
Sbjct: 118 MVRLVLRRKLQDDKAIPSSGTDNYNLVDALESHISEVDKNQLVLYAQMANLVQLILPSFE 177

Query: 61  ISINEIAENFSKLACNAHTICNSELRPLGTGLYPVISIINHSCLPNAVLVFEGRLAVVRA 120
           + + EI   FSK ACNAHTIC+ ELR LGTGLYPV+SIINHSC+PNAVL+FEGR A VRA
Sbjct: 178 LDLKEITHTFSKFACNAHTICDPELRSLGTGLYPVLSIINHSCVPNAVLIFEGRTAYVRA 237

Query: 121 VQHVPKGAEVLISYIETAGSTMTRQKALKEQYLFTCTCPRCIKLGQFDDIQESAILEGYR 180
           +Q + K  EV ISYIETA +TM RQ  LK  Y FTCTCPRC+K     D +E A+LEGYR
Sbjct: 238 LQPISKNEEVSISYIETAATTMKRQDDLK-HYYFTCTCPRCVK-----DSEEDALLEGYR 291

Query: 181 CKDDGCSGFLLRDSDDKGFTCQQCGLVRSKEEIKKIASEVNILSKKTLALTSCG-NHQEV 239
           C D  C GFLL ++ +KG+TCQ+C   R  EE++K+AS+V +LS K  +L S G ++ EV
Sbjct: 292 CNDQKCDGFLLPNAGNKGYTCQKCSTSRDGEELQKMASDVLLLSDKVSSLVSSGIDNSEV 351

Query: 240 VSTYKMIEKLQKKLYHPFSVNLMQTREKLIKILMELEDWKEALAYCQLTIPVYQRVYPQF 299
            S YK IE+L++KLYHP S+ L+ TRE L+KI MEL+DW+ AL YC+LTIPVY+R+YP F
Sbjct: 352 GSMYKTIEELERKLYHPLSITLLHTRETLLKIYMELQDWQTALMYCRLTIPVYERIYPPF 411

Query: 300 HPLLGLQYYTCGKLEWFLGDTENAIKSMTEAVEILRITHGTNSPFMKELILKLEEAQAEA 359
           HP++GLQ+YTCGKLEW L  TE+A+ S+T A +ILRITHGT S FMKEL+ KLEE +AEA
Sbjct: 412 HPMIGLQFYTCGKLEWLLEYTEDALMSLTRAADILRITHGTKSEFMKELLGKLEEVRAEA 471

Query: 360 SYKLSSKDE 368
           S++LS+ DE
Sbjct: 472 SFRLSAGDE 480


>gi|388518229|gb|AFK47176.1| unknown [Medicago truncatula]
          Length = 477

 Score =  475 bits (1222), Expect = e-131,   Method: Compositional matrix adjust.
 Identities = 235/398 (59%), Positives = 283/398 (71%), Gaps = 72/398 (18%)

Query: 1   MLKLYLRRKLQNDNVIPSTTTDNYSLVEALVAH--------------------------- 33
           MLKLYLRRKLQ+  +IPST  DNY LVEALVA                            
Sbjct: 110 MLKLYLRRKLQDQKIIPSTAMDNYKLVEALVARILFIIVVLSSRAMIVEFSLLTNMSVVI 169

Query: 34  ----------MSDIDEKQLLLYAQIANLVNLILQWPEISINEIAENFSKLACNAHTICNS 83
                     MSDI E+QL+LYAQ+ANLV+LILQWPEI+I EIA  FSK ACNAHT+C+S
Sbjct: 170 VLFNTYFHEDMSDIKEEQLVLYAQMANLVHLILQWPEINIKEIANFFSKFACNAHTVCDS 229

Query: 84  ELRPLGTGLYPVISIINHSCLPNAVLVFEGRLAVVRAVQHVPKGAEVLISYIETAGSTMT 143
           ELRPLGTGLYPV+SIINHSCLPN+VLVF+GR A+VRA+QH+PKG E              
Sbjct: 230 ELRPLGTGLYPVVSIINHSCLPNSVLVFDGREALVRALQHIPKGTE-------------- 275

Query: 144 RQKALKEQYLFTCTCPRCIKLGQFDDIQESAILEGYRCKDDGCSGFLLRDSDDKGFTCQQ 203
                                GQ++D++E+AILEGYRCK++ C GFLLR +D K F CQ+
Sbjct: 276 ---------------------GQYEDVRENAILEGYRCKNETCDGFLLRTTDGKAFQCQE 314

Query: 204 CGLVRSKEEIKKIASEVNILSKKTLALTSCGNHQEVVSTYKMIEKLQKKLYHPFSVNLMQ 263
           CGLVR KEEIK+IA+E+  L ++    +S GN  E +S +KMIEKLQ KLYHPFS+NLMQ
Sbjct: 315 CGLVRDKEEIKQIATEIKFLLEEASKPSSNGNSHEAISIHKMIEKLQTKLYHPFSINLMQ 374

Query: 264 TREKLIKILMELEDWKEALAYCQLTIPVYQRVYPQFHPLLGLQYYTCGKLEWFLGDTENA 323
           TRE ++K LM+LE W+EALAYC+LTIP+YQRVYP  HPLLGLQYYTCGKLEW+LGDTE A
Sbjct: 375 TRETILKSLMKLEYWREALAYCKLTIPIYQRVYPAVHPLLGLQYYTCGKLEWYLGDTEEA 434

Query: 324 IKSMTEAVEILRITHGTNSPFMKELILKLEEAQAEASY 361
           IKS+T+AV+ILRITHGT +PF+KEL + L EA+AEAS+
Sbjct: 435 IKSLTKAVDILRITHGTKTPFVKELSMMLVEARAEASF 472


>gi|357117918|ref|XP_003560708.1| PREDICTED: histone-lysine N-methyltransferase ASHR1-like
           [Brachypodium distachyon]
          Length = 482

 Score =  474 bits (1221), Expect = e-131,   Method: Compositional matrix adjust.
 Identities = 224/368 (60%), Positives = 283/368 (76%), Gaps = 6/368 (1%)

Query: 1   MLKLYLRRKLQNDNVIPSTTTDNYSLVEALVAHMSDIDEKQLLLYAQIANLVNLILQWPE 60
           M++L LRRKLQN+ VIPST TDNY LV+AL +H+S++D+KQL+LYAQ+ANLV LIL   E
Sbjct: 118 MVRLILRRKLQNEKVIPSTGTDNYGLVDALESHISEVDDKQLVLYAQMANLVQLILPAIE 177

Query: 61  ISINEIAENFSKLACNAHTICNSELRPLGTGLYPVISIINHSCLPNAVLVFEGRLAVVRA 120
           + + E A  FSK +CNAHTIC+ ELRP+GTGL+P IS INHSC+PNAVL+FEGR A VRA
Sbjct: 178 LDLKETAHIFSKFSCNAHTICDPELRPVGTGLFPAISTINHSCVPNAVLLFEGRTAYVRA 237

Query: 121 VQHVPKGAEVLISYIETAGSTMTRQKALKEQYLFTCTCPRCIKLGQFDDIQESAILEGYR 180
           +Q +    EV ISYIETA +T+ R   LK  Y FTCTCPRCIK       +E  +LEGYR
Sbjct: 238 LQPLSNNTEVSISYIETAATTLKRHNDLK-HYFFTCTCPRCIK-----GSEEDPLLEGYR 291

Query: 181 CKDDGCSGFLLRDSDDKGFTCQQCGLVRSKEEIKKIASEVNILSKKTLALTSCGNHQEVV 240
           CKD  C GFLL DS  K +TCQ+C + R  EE+KK++SE+ +LS K  +  S GN+ E  
Sbjct: 292 CKDQNCDGFLLPDSGKKAYTCQKCSMCRDGEEVKKLSSEILLLSDKASSFVSSGNNNEAG 351

Query: 241 STYKMIEKLQKKLYHPFSVNLMQTREKLIKILMELEDWKEALAYCQLTIPVYQRVYPQFH 300
           S YK+IE+L++KLYH FS+ L+ TRE L+K+ MEL+DW+ AL YC+ TIPVY+RVYP FH
Sbjct: 352 SVYKIIEQLERKLYHSFSITLLHTRETLLKLYMELQDWRSALMYCRSTIPVYERVYPPFH 411

Query: 301 PLLGLQYYTCGKLEWFLGDTENAIKSMTEAVEILRITHGTNSPFMKELILKLEEAQAEAS 360
           P++GLQ+YTCGKLEW L  TE+A+KS+T A ++LRITHGTNS FMKEL  KLEEA+AE S
Sbjct: 412 PMVGLQFYTCGKLEWLLEYTEDALKSLTRAADVLRITHGTNSQFMKELFGKLEEARAEVS 471

Query: 361 YKLSSKDE 368
           ++LS  +E
Sbjct: 472 FRLSPGNE 479


>gi|242033255|ref|XP_002464022.1| hypothetical protein SORBIDRAFT_01g010740 [Sorghum bicolor]
 gi|241917876|gb|EER91020.1| hypothetical protein SORBIDRAFT_01g010740 [Sorghum bicolor]
          Length = 482

 Score =  461 bits (1185), Expect = e-127,   Method: Compositional matrix adjust.
 Identities = 220/366 (60%), Positives = 280/366 (76%), Gaps = 6/366 (1%)

Query: 1   MLKLYLRRKLQNDNVIPSTTTDNYSLVEALVAHMSDIDEKQLLLYAQIANLVNLILQWPE 60
           M++L L+RKLQN+ VIPS++ DNY+LV+AL +H+  +DE  L+LYAQ+ANLV+LIL   E
Sbjct: 118 MVRLVLKRKLQNEKVIPSSSIDNYNLVDALESHIWKVDENHLVLYAQMANLVSLILPLIE 177

Query: 61  ISINEIAENFSKLACNAHTICNSELRPLGTGLYPVISIINHSCLPNAVLVFEGRLAVVRA 120
           + + EIA  FSK ACNAHTIC+ ELRPLGTGLYPVISIINHSC+PNAVL+F+GR A VRA
Sbjct: 178 LDLKEIAHTFSKFACNAHTICDPELRPLGTGLYPVISIINHSCVPNAVLIFDGRTAYVRA 237

Query: 121 VQHVPKGAEVLISYIETAGSTMTRQKALKEQYLFTCTCPRCIKLGQFDDIQESAILEGYR 180
           +Q + K  EV ISYIETA  T  R   LK QY FTC+CPRC+K       +E A+LEG+R
Sbjct: 238 LQPIGKNEEVSISYIETAAVTKKRHNDLK-QYFFTCSCPRCVK-----GSEEDALLEGFR 291

Query: 181 CKDDGCSGFLLRDSDDKGFTCQQCGLVRSKEEIKKIASEVNILSKKTLALTSCGNHQEVV 240
           CK+  C GFLL DS  K +TCQ+C + R +EEI+K+ SE+  LS K  +  S GN  E  
Sbjct: 292 CKNQTCDGFLLPDSGKKAYTCQKCSVSRDEEEIQKMRSEILQLSDKASSFLSSGNKAEAG 351

Query: 241 STYKMIEKLQKKLYHPFSVNLMQTREKLIKILMELEDWKEALAYCQLTIPVYQRVYPQFH 300
           S YK IE+L++ LYH FS  L+ T E L+KI MEL+DW+ ALAYC+LT+PVY+RVYP FH
Sbjct: 352 SVYKTIEQLEQNLYHAFSTTLLHTCETLLKIYMELQDWRTALAYCRLTVPVYERVYPPFH 411

Query: 301 PLLGLQYYTCGKLEWFLGDTENAIKSMTEAVEILRITHGTNSPFMKELILKLEEAQAEAS 360
           P++GLQ+YTCGKLEW L  TE+A+KS+T A +IL++THGT S FMKEL  KLEEA+AE S
Sbjct: 412 PMIGLQFYTCGKLEWLLECTEDALKSLTRATDILKVTHGTKSQFMKELFGKLEEARAEVS 471

Query: 361 YKLSSK 366
           +++SS+
Sbjct: 472 FRISSR 477


>gi|212722234|ref|NP_001131420.1| uncharacterized protein LOC100192749 [Zea mays]
 gi|194691468|gb|ACF79818.1| unknown [Zea mays]
 gi|413933330|gb|AFW67881.1| histone-lysine N-methyltransferase ASHR1 [Zea mays]
          Length = 482

 Score =  460 bits (1183), Expect = e-127,   Method: Compositional matrix adjust.
 Identities = 223/366 (60%), Positives = 280/366 (76%), Gaps = 6/366 (1%)

Query: 1   MLKLYLRRKLQNDNVIPSTTTDNYSLVEALVAHMSDIDEKQLLLYAQIANLVNLILQWPE 60
           M++L L+RKLQN+ VIPS++ DNY+LV+ L +H+S +DE QL+LYAQ+ANLV+LIL + E
Sbjct: 118 MVRLALKRKLQNEKVIPSSSIDNYNLVDGLESHISKVDENQLVLYAQMANLVSLILPFIE 177

Query: 61  ISINEIAENFSKLACNAHTICNSELRPLGTGLYPVISIINHSCLPNAVLVFEGRLAVVRA 120
           + + EIA+ FSK ACNAHTIC+ ELRPLGTGLYPVISIINHSC+PNAVL+F+GR A VRA
Sbjct: 178 LDLKEIAQTFSKFACNAHTICDPELRPLGTGLYPVISIINHSCVPNAVLIFDGRTAYVRA 237

Query: 121 VQHVPKGAEVLISYIETAGSTMTRQKALKEQYLFTCTCPRCIKLGQFDDIQESAILEGYR 180
           +Q + K  EV ISYIETA  T  R   LK QY FTCTCPRC+K   FD   E A+LEG+R
Sbjct: 238 LQPINKDEEVSISYIETATVTKKRNNDLK-QYFFTCTCPRCVK--GFD---EDALLEGFR 291

Query: 181 CKDDGCSGFLLRDSDDKGFTCQQCGLVRSKEEIKKIASEVNILSKKTLALTSCGNHQEVV 240
           CK+  C GFLL +S  K +TCQ+CG  R  EEIK + SE+  LS K  +  S GN  E  
Sbjct: 292 CKNQACDGFLLPNSGKKAYTCQKCGASRDVEEIKNMRSEILQLSDKASSFLSSGNKAEAG 351

Query: 241 STYKMIEKLQKKLYHPFSVNLMQTREKLIKILMELEDWKEALAYCQLTIPVYQRVYPQFH 300
           S YK+IE+L++ LYH FS  L+ T E L+KI +EL+DW  AL YC+LTIPVY+RVYP FH
Sbjct: 352 SIYKIIEQLERNLYHAFSTTLLHTCETLLKIYLELQDWWTALTYCRLTIPVYERVYPPFH 411

Query: 301 PLLGLQYYTCGKLEWFLGDTENAIKSMTEAVEILRITHGTNSPFMKELILKLEEAQAEAS 360
           P++GLQ+YTCGKLEW L  TE+A+KS+T A +IL++THG  S FMKEL  KLEEA+AE S
Sbjct: 412 PMIGLQFYTCGKLEWLLEFTEDALKSLTRAADILKVTHGVKSQFMKELFGKLEEARAEVS 471

Query: 361 YKLSSK 366
           ++LSS+
Sbjct: 472 FRLSSR 477


>gi|195636164|gb|ACG37550.1| histone-lysine N-methyltransferase ASHR1 [Zea mays]
          Length = 482

 Score =  459 bits (1181), Expect = e-127,   Method: Compositional matrix adjust.
 Identities = 223/366 (60%), Positives = 279/366 (76%), Gaps = 6/366 (1%)

Query: 1   MLKLYLRRKLQNDNVIPSTTTDNYSLVEALVAHMSDIDEKQLLLYAQIANLVNLILQWPE 60
           M++L L+RKLQN+ VIPS++ DNY+LV+AL +H+S +DE QL+LYAQ+ANLV+LIL + E
Sbjct: 118 MVRLALKRKLQNEKVIPSSSIDNYNLVDALESHISKVDENQLVLYAQMANLVSLILPFIE 177

Query: 61  ISINEIAENFSKLACNAHTICNSELRPLGTGLYPVISIINHSCLPNAVLVFEGRLAVVRA 120
           + + EIA+ FSK ACNAHTIC+ ELRPLGTGLYPVISIINHSC+PNAVL+F+GR A VRA
Sbjct: 178 LDLKEIAQTFSKFACNAHTICDPELRPLGTGLYPVISIINHSCVPNAVLIFDGRTAYVRA 237

Query: 121 VQHVPKGAEVLISYIETAGSTMTRQKALKEQYLFTCTCPRCIKLGQFDDIQESAILEGYR 180
           +Q + K  EV ISYIETA  T  R   LK QY FTCTCPRC+K   FD   E  +LEG+R
Sbjct: 238 LQPIDKDEEVSISYIETAAVTKKRNNDLK-QYFFTCTCPRCVK--GFD---EDPLLEGFR 291

Query: 181 CKDDGCSGFLLRDSDDKGFTCQQCGLVRSKEEIKKIASEVNILSKKTLALTSCGNHQEVV 240
           CK+  C GFLL  S  K +TCQ+CG  R  EEIK + SE+  LS K  +  S GN  E  
Sbjct: 292 CKNQTCDGFLLPSSGKKAYTCQKCGASRDVEEIKNMRSEILQLSDKASSFLSSGNKAEAG 351

Query: 241 STYKMIEKLQKKLYHPFSVNLMQTREKLIKILMELEDWKEALAYCQLTIPVYQRVYPQFH 300
           S YK+IE+L++ LYH FS  L+ T E L+KI +EL+DW  AL YC+LTIPVY+RVYP FH
Sbjct: 352 SIYKIIEQLERNLYHAFSTTLLHTCETLLKIYLELQDWWTALTYCRLTIPVYERVYPPFH 411

Query: 301 PLLGLQYYTCGKLEWFLGDTENAIKSMTEAVEILRITHGTNSPFMKELILKLEEAQAEAS 360
           P++GLQ+YTCGKLEW L  TE+A+KS+T A +IL++THG  S FMKEL  KLEEA+AE S
Sbjct: 412 PMIGLQFYTCGKLEWLLEFTEDALKSLTRAADILKVTHGVKSQFMKELFGKLEEARAEVS 471

Query: 361 YKLSSK 366
           ++LSS+
Sbjct: 472 FRLSSR 477


>gi|22450579|gb|AAM97151.1| unknown protein,3'-partial [Oryza sativa Japonica Group]
          Length = 425

 Score =  417 bits (1073), Expect = e-114,   Method: Compositional matrix adjust.
 Identities = 196/314 (62%), Positives = 247/314 (78%), Gaps = 6/314 (1%)

Query: 1   MLKLYLRRKLQNDNVIPSTTTDNYSLVEALVAHMSDIDEKQLLLYAQIANLVNLILQWPE 60
           M++L LRRKLQ+D  IPS+ TDNY+LV+AL +H+S++D+ QL+LYAQ+ANLV LIL   E
Sbjct: 118 MVRLVLRRKLQDDKAIPSSGTDNYNLVDALESHISEVDKNQLVLYAQMANLVQLILPSFE 177

Query: 61  ISINEIAENFSKLACNAHTICNSELRPLGTGLYPVISIINHSCLPNAVLVFEGRLAVVRA 120
           + + EI   FSK ACNAHTIC+ ELRPLGTGLYPV+SIINHSC+PNAVL+FEGR A VRA
Sbjct: 178 LDLKEITHTFSKFACNAHTICDPELRPLGTGLYPVLSIINHSCVPNAVLIFEGRTAYVRA 237

Query: 121 VQHVPKGAEVLISYIETAGSTMTRQKALKEQYLFTCTCPRCIKLGQFDDIQESAILEGYR 180
           +Q + K  EV ISYIETA +TM RQ  LK  Y FTCTCPRC+K     D +E A+LEGYR
Sbjct: 238 LQPISKNEEVSISYIETAATTMKRQDDLK-HYYFTCTCPRCVK-----DSEEDALLEGYR 291

Query: 181 CKDDGCSGFLLRDSDDKGFTCQQCGLVRSKEEIKKIASEVNILSKKTLALTSCGNHQEVV 240
           C D  C GFLL ++ +KG+TCQ+C   R  EE++K+AS+V +LS K  +L S GN+ EV 
Sbjct: 292 CNDQKCDGFLLPNAGNKGYTCQKCSTSRDGEELQKMASDVLLLSDKVSSLVSSGNNSEVG 351

Query: 241 STYKMIEKLQKKLYHPFSVNLMQTREKLIKILMELEDWKEALAYCQLTIPVYQRVYPQFH 300
           S YK IE+L++KLYHP S+ L+ TRE L+KI MEL+DW+ AL YC+LTIPVY+R+YP FH
Sbjct: 352 SMYKTIEELERKLYHPLSITLLHTRETLLKIYMELQDWQTALMYCRLTIPVYERIYPPFH 411

Query: 301 PLLGLQYYTCGKLE 314
           P++GLQ+YTCGKLE
Sbjct: 412 PMIGLQFYTCGKLE 425


>gi|413933331|gb|AFW67882.1| hypothetical protein ZEAMMB73_219813 [Zea mays]
          Length = 443

 Score =  402 bits (1032), Expect = e-109,   Method: Compositional matrix adjust.
 Identities = 193/316 (61%), Positives = 241/316 (76%), Gaps = 6/316 (1%)

Query: 1   MLKLYLRRKLQNDNVIPSTTTDNYSLVEALVAHMSDIDEKQLLLYAQIANLVNLILQWPE 60
           M++L L+RKLQN+ VIPS++ DNY+LV+ L +H+S +DE QL+LYAQ+ANLV+LIL + E
Sbjct: 118 MVRLALKRKLQNEKVIPSSSIDNYNLVDGLESHISKVDENQLVLYAQMANLVSLILPFIE 177

Query: 61  ISINEIAENFSKLACNAHTICNSELRPLGTGLYPVISIINHSCLPNAVLVFEGRLAVVRA 120
           + + EIA+ FSK ACNAHTIC+ ELRPLGTGLYPVISIINHSC+PNAVL+F+GR A VRA
Sbjct: 178 LDLKEIAQTFSKFACNAHTICDPELRPLGTGLYPVISIINHSCVPNAVLIFDGRTAYVRA 237

Query: 121 VQHVPKGAEVLISYIETAGSTMTRQKALKEQYLFTCTCPRCIKLGQFDDIQESAILEGYR 180
           +Q + K  EV ISYIETA  T  R   LK QY FTCTCPRC+K   FD   E A+LEG+R
Sbjct: 238 LQPINKDEEVSISYIETATVTKKRNNDLK-QYFFTCTCPRCVK--GFD---EDALLEGFR 291

Query: 181 CKDDGCSGFLLRDSDDKGFTCQQCGLVRSKEEIKKIASEVNILSKKTLALTSCGNHQEVV 240
           CK+  C GFLL +S  K +TCQ+CG  R  EEIK + SE+  LS K  +  S GN  E  
Sbjct: 292 CKNQACDGFLLPNSGKKAYTCQKCGASRDVEEIKNMRSEILQLSDKASSFLSSGNKAEAG 351

Query: 241 STYKMIEKLQKKLYHPFSVNLMQTREKLIKILMELEDWKEALAYCQLTIPVYQRVYPQFH 300
           S YK+IE+L++ LYH FS  L+ T E L+KI +EL+DW  AL YC+LTIPVY+RVYP FH
Sbjct: 352 SIYKIIEQLERNLYHAFSTTLLHTCETLLKIYLELQDWWTALTYCRLTIPVYERVYPPFH 411

Query: 301 PLLGLQYYTCGKLEWF 316
           P++GLQ+YTCGKLEW+
Sbjct: 412 PMIGLQFYTCGKLEWY 427


>gi|148906849|gb|ABR16570.1| unknown [Picea sitchensis]
          Length = 441

 Score =  356 bits (913), Expect = 1e-95,   Method: Compositional matrix adjust.
 Identities = 165/290 (56%), Positives = 216/290 (74%), Gaps = 1/290 (0%)

Query: 73  LACNAHTICNSELRPLGTGLYPVISIINHSCLPNAVLVFEGRLAVVRAVQHVPKGAEVLI 132
            +CNAHTIC+SELRP+GTGLYPVISIINHSC PNAVL+FEGR AVVRAV+ + +G+E+ +
Sbjct: 142 FSCNAHTICDSELRPMGTGLYPVISIINHSCFPNAVLLFEGRQAVVRAVEPIREGSELTV 201

Query: 133 SYIETAGSTMTRQKALKEQYLFTCTCPRCIKLGQFDDIQESAILEGYRCKDDGCSGFLLR 192
           SYIE A ST +R+K+LKEQY F C C RC+K+   D + E AILEG+RC  D C GFLL 
Sbjct: 202 SYIEIAASTASRKKSLKEQYFFDCKCLRCLKVDTPDGLHEDAILEGFRCSSDHCEGFLLH 261

Query: 193 DSDD-KGFTCQQCGLVRSKEEIKKIASEVNILSKKTLALTSCGNHQEVVSTYKMIEKLQK 251
           D DD +   CQ CG  R++EE KK A +V+ L K+   L S GN+ E  S Y+ I++LQ 
Sbjct: 262 DPDDAQSLVCQLCGCGRNEEETKKQARKVDKLGKEASKLLSSGNYSEARSLYEQIQQLQT 321

Query: 252 KLYHPFSVNLMQTREKLIKILMELEDWKEALAYCQLTIPVYQRVYPQFHPLLGLQYYTCG 311
           +L+HP+SV L++T + L+KI MEL DWK+AL YC+LTIP Y+R YP  HP++GLQYY CG
Sbjct: 322 QLWHPYSVILLRTGDTLLKICMELYDWKQALKYCRLTIPAYERAYPTCHPMMGLQYYACG 381

Query: 312 KLEWFLGDTENAIKSMTEAVEILRITHGTNSPFMKELILKLEEAQAEASY 361
           KLEWFL +T  A+    +A +IL +THG NS F+ +L  +++EA AEA++
Sbjct: 382 KLEWFLENTLEALNFFEKAAKILTVTHGRNSEFLTQLFDRIQEAHAEAAH 431


>gi|168051542|ref|XP_001778213.1| predicted protein [Physcomitrella patens subsp. patens]
 gi|162670426|gb|EDQ56995.1| predicted protein [Physcomitrella patens subsp. patens]
          Length = 490

 Score =  354 bits (908), Expect = 5e-95,   Method: Compositional matrix adjust.
 Identities = 162/364 (44%), Positives = 239/364 (65%)

Query: 1   MLKLYLRRKLQNDNVIPSTTTDNYSLVEALVAHMSDIDEKQLLLYAQIANLVNLILQWPE 60
           +++  ++R+LQ + V      DNY + E L  HMS+  E +L++YAQ+A++V  ++   E
Sbjct: 121 VMRFLIKRRLQAERVQLKAAVDNYDVCEVLPTHMSETSEDRLVMYAQMASIVQQMMAPDE 180

Query: 61  ISINEIAENFSKLACNAHTICNSELRPLGTGLYPVISIINHSCLPNAVLVFEGRLAVVRA 120
           +++ E+ +   + ACNAHTIC+ E+RPLGTGLYPVISI+NHSC+PNAVL F+G  A +RA
Sbjct: 181 VNVKEVTQMICRFACNAHTICDEEVRPLGTGLYPVISIVNHSCVPNAVLHFDGNRAALRA 240

Query: 121 VQHVPKGAEVLISYIETAGSTMTRQKALKEQYLFTCTCPRCIKLGQFDDIQESAILEGYR 180
           ++   +G E+ ISY+E A ST TR+KAL++QY F C C RC +L   +  +E A LEGY 
Sbjct: 241 LEDTQEGTEITISYVELAASTNTRRKALRDQYYFDCNCIRCSRLVTREGSREDAFLEGYG 300

Query: 181 CKDDGCSGFLLRDSDDKGFTCQQCGLVRSKEEIKKIASEVNILSKKTLALTSCGNHQEVV 240
           C +  C+G L+ D       C+ CGL R  ++ K  A EV +   +   L + G  +   
Sbjct: 301 CVNSDCNGPLIEDPGSDKVICEVCGLKREVQQTKSAAKEVELDVLEASNLYAAGKLESAR 360

Query: 241 STYKMIEKLQKKLYHPFSVNLMQTREKLIKILMELEDWKEALAYCQLTIPVYQRVYPQFH 300
             Y  +E  Q+KL+HP+SV L++T + L+KI M++EDW  AL +CQ TIP Y+R YP F 
Sbjct: 361 RLYSEVEAKQRKLWHPYSVPLLRTHDALLKICMDMEDWASALEFCQSTIPAYERAYPPFS 420

Query: 301 PLLGLQYYTCGKLEWFLGDTENAIKSMTEAVEILRITHGTNSPFMKELILKLEEAQAEAS 360
           PLLGLQY+T GKL+W LGD+  A+ ++ +A  ++++THG+ S  +  L   L+EAQAE +
Sbjct: 421 PLLGLQYFTLGKLQWLLGDSAKAVNTLRKAYTVIQVTHGSKSELLHGLTSTLQEAQAEVA 480

Query: 361 YKLS 364
           YK S
Sbjct: 481 YKRS 484


>gi|302799034|ref|XP_002981276.1| hypothetical protein SELMODRAFT_114558 [Selaginella moellendorffii]
 gi|300150816|gb|EFJ17464.1| hypothetical protein SELMODRAFT_114558 [Selaginella moellendorffii]
          Length = 453

 Score =  298 bits (762), Expect = 4e-78,   Method: Compositional matrix adjust.
 Identities = 146/359 (40%), Positives = 228/359 (63%), Gaps = 7/359 (1%)

Query: 6   LRRKLQNDNVIPSTTTDNYSLVEALVAHMSDIDEKQLLLYAQIANLVNLIL--QWPEISI 63
           ++R+LQ   V+P T  DNY +V AL  H S+  +++L++YAQ+A L+  IL  ++ E  +
Sbjct: 87  VKRELQATGVLPVTALDNYEIVRALPTHFSETGDERLVMYAQMAVLIKTILNARYAE-DV 145

Query: 64  NEIAENFSKLACNAHTICNSELRPLGTGLYPVISIINHSCLPNAVLVFEGRLAVVRAVQH 123
            EI ++  +++CN HTIC+ ELRP+G GL+PV+S+INHSC  N++L+F+G+ AVVRA+  
Sbjct: 146 KEITKDICRISCNGHTICDDELRPVGIGLFPVVSVINHSCSSNSLLLFDGKHAVVRALGT 205

Query: 124 VPKGAEVLISYIETAGSTMTRQKALKEQYLFTCTCPRCIKLG-QFDDIQESAILEGYRCK 182
           + +G EV +SYIE   ST +R++AL +QY F C CPRC  +  Q   + +  +LE   C 
Sbjct: 206 ISRGCEVTVSYIELGASTNSRREALSDQYYFHCKCPRCTDVSPQAHGLYKDDVLEAVACL 265

Query: 183 DDGCSGFLLRDSDDKGFTCQQCGLVRSKEEIKKIASEVNILSKKTLALTSCGNHQEVVST 242
           D  C  F+     +  + C  CG  R   E+ K+++E   + +K   L + G+       
Sbjct: 266 DPACESFM--RLSNGSWRCMSCGSSREPNEVNKLSTEAEGMIEKANRLQAAGDLHGARMA 323

Query: 243 YKMIEKLQKKLYHPFSVNLMQTREKLIKILMELEDWKEALAYCQLTIPVYQRVYP-QFHP 301
           ++  EKLQ +L++P SV LM+TR+ L+++ + LEDW  AL  C+LT+P Y+  YP   HP
Sbjct: 324 FQQAEKLQTELWNPRSVKLMRTRDLLLRVYLSLEDWDLALHVCRLTLPAYETAYPGSKHP 383

Query: 302 LLGLQYYTCGKLEWFLGDTENAIKSMTEAVEILRITHGTNSPFMKELILKLEEAQAEAS 360
           LLGLQYYT GKLE   G    A+++  +A +IL +THG+   F+++L  +L++A+  AS
Sbjct: 384 LLGLQYYTLGKLEMHAGSVTEAVRAYAKAFDILSVTHGSRGEFVRKLRNELDQAKMVAS 442


>gi|18568265|gb|AAL75997.1|AF466646_5 putative SET-domain transcriptional regulator [Zea mays]
 gi|413933333|gb|AFW67884.1| putative SET-domain transcriptional regulator [Zea mays]
          Length = 410

 Score =  297 bits (761), Expect = 4e-78,   Method: Compositional matrix adjust.
 Identities = 169/357 (47%), Positives = 216/357 (60%), Gaps = 55/357 (15%)

Query: 15  VIPSTTTDNYSLVEALVAHMSDIDEKQLLLYAQIANLVNLILQWPEISINEIAENFSKLA 74
            + + T D   ++   +  M  +  K+ L   ++ANLV+LIL + E+ + EIA+ FSK A
Sbjct: 99  AMAALTEDRKKMLTPTIRLMVRLALKRKLQNEKMANLVSLILPFIELDLKEIAQTFSKFA 158

Query: 75  CNAHTICNSELRPLGTGLYPVISIINHSCLPNAVLVFEGRLAVVRAVQHVPKGAEVLISY 134
           CNAHTIC+ ELRPLGTGLYPVISIINHSC+PNAVL+F+GR A VRA+Q + K  EV ISY
Sbjct: 159 CNAHTICDPELRPLGTGLYPVISIINHSCVPNAVLIFDGRTAYVRALQPINKDEEVSISY 218

Query: 135 IETAGSTMTRQKALKEQYLFTCTCPRCIKLGQFDDIQESAILEGYRCKDDGCSGFLLRDS 194
           IETA  T  R   LK QY FTCTCPRC+K   FD   E A+LEG+RCK+  C GFLL +S
Sbjct: 219 IETATVTKKRNNDLK-QYFFTCTCPRCVK--GFD---EDALLEGFRCKNQACDGFLLPNS 272

Query: 195 DDKGFTCQQCGLVRSKEEIKKIASEVNILSKKTLALTSCGNHQEVVSTYKMIEKLQKKLY 254
           D                                          E  S YK+IE+L++ LY
Sbjct: 273 DK----------------------------------------AEAGSIYKIIEQLERNLY 292

Query: 255 HPFSVNLMQTREKLIKILMELEDWKEALAYCQLTIPVYQR-----VYPQFHPLLGLQYYT 309
           H FS  L+ T E L+KI +EL+DW  AL YC+LTIPVY+        P   P    + Y 
Sbjct: 293 HAFSTTLLHTCETLLKIYLELQDWWTALTYCRLTIPVYESKAWRLKIPFPRPRTVREAYG 352

Query: 310 CGKLEWFLGDTENAIKSMTEAVEILRITHGTNSPFMKELILKLEEAQAEASYKLSSK 366
            G LE+    TE+A+KS+T A +IL++THG  S FMKEL  KLEEA+AE S++LSS+
Sbjct: 353 TGLLEF----TEDALKSLTRAADILKVTHGVKSQFMKELFGKLEEARAEVSFRLSSR 405


>gi|302772491|ref|XP_002969663.1| hypothetical protein SELMODRAFT_231366 [Selaginella moellendorffii]
 gi|300162174|gb|EFJ28787.1| hypothetical protein SELMODRAFT_231366 [Selaginella moellendorffii]
          Length = 484

 Score =  296 bits (758), Expect = 1e-77,   Method: Compositional matrix adjust.
 Identities = 146/358 (40%), Positives = 227/358 (63%), Gaps = 7/358 (1%)

Query: 6   LRRKLQNDNVIPSTTTDNYSLVEALVAHMSDIDEKQLLLYAQIANLVNLIL--QWPEISI 63
           ++R+LQ   V+P T  DNY +V AL  H S+  +++L++YAQ+A L+  IL  ++ E  +
Sbjct: 120 VKRELQATGVLPVTALDNYEIVRALPTHFSETGDERLVMYAQMAVLIKTILNARYAE-DV 178

Query: 64  NEIAENFSKLACNAHTICNSELRPLGTGLYPVISIINHSCLPNAVLVFEGRLAVVRAVQH 123
            EI ++  +++CN HTIC+ ELRP+G GL+PV+SIINHSC  N++L+F+G+ AVVRA+  
Sbjct: 179 KEITKDICRISCNGHTICDDELRPVGIGLFPVVSIINHSCSSNSLLLFDGKHAVVRALGT 238

Query: 124 VPKGAEVLISYIETAGSTMTRQKALKEQYLFTCTCPRCIKLGQFDDIQESAILEGYRCKD 183
           + +G EV +SYIE   ST +R++AL +QY F C CPRC    +   + +  +LE   C D
Sbjct: 239 ISRGCEVTVSYIELGASTNSRREALSDQYYFHCKCPRCTDDSEA-GLYKDDVLEAVACLD 297

Query: 184 DGCSGFLLRDSDDKGFTCQQCGLVRSKEEIKKIASEVNILSKKTLALTSCGNHQEVVSTY 243
             C  F+     +  + C  CG  R   E+ K+++E   + +K   L + G+       +
Sbjct: 298 PACESFM--RLSNGSWRCMSCGSSREPNEVNKLSTEAEGMIEKANRLQAAGDLHGARMAF 355

Query: 244 KMIEKLQKKLYHPFSVNLMQTREKLIKILMELEDWKEALAYCQLTIPVYQRVYP-QFHPL 302
           +  EKLQ +L++P SV LM+TR+ L+++ + LEDW  AL  C+LT+P Y+  YP   HPL
Sbjct: 356 QQAEKLQTELWNPRSVKLMRTRDLLLRVYLSLEDWDLALHVCRLTLPAYETAYPGSKHPL 415

Query: 303 LGLQYYTCGKLEWFLGDTENAIKSMTEAVEILRITHGTNSPFMKELILKLEEAQAEAS 360
           LGLQYYT GKLE   G    A+++  +A +IL +THG+   F+++L  +L++A+  AS
Sbjct: 416 LGLQYYTLGKLEMHAGSVTEAVRAYAKAFDILSVTHGSRGEFVRKLRNELDQAKMVAS 473


>gi|108710651|gb|ABF98446.1| MYND finger family protein, expressed [Oryza sativa Japonica Group]
          Length = 352

 Score =  295 bits (755), Expect = 3e-77,   Method: Compositional matrix adjust.
 Identities = 143/235 (60%), Positives = 180/235 (76%), Gaps = 6/235 (2%)

Query: 1   MLKLYLRRKLQNDNVIPSTTTDNYSLVEALVAHMSDIDEKQLLLYAQIANLVNLILQWPE 60
           M++L LRRKLQ+D  IPS+ TDNY+LV+AL +H+S++D+ QL+LYAQ+ANLV LIL   E
Sbjct: 118 MVRLVLRRKLQDDKAIPSSGTDNYNLVDALESHISEVDKNQLVLYAQMANLVQLILPSFE 177

Query: 61  ISINEIAENFSKLACNAHTICNSELRPLGTGLYPVISIINHSCLPNAVLVFEGRLAVVRA 120
           + + EI   FSK ACNAHTIC+ ELRPLGTGLYPV+SIINHSC+PNAVL+FEGR A VRA
Sbjct: 178 LDLKEITHTFSKFACNAHTICDPELRPLGTGLYPVLSIINHSCVPNAVLIFEGRTAYVRA 237

Query: 121 VQHVPKGAEVLISYIETAGSTMTRQKALKEQYLFTCTCPRCIKLGQFDDIQESAILEGYR 180
           +Q + K  EV ISYIETA +TM RQ  LK  Y FTCTCPRC+K     D +E A+LEGYR
Sbjct: 238 LQPISKNEEVSISYIETAATTMKRQDDLK-HYYFTCTCPRCVK-----DSEEDALLEGYR 291

Query: 181 CKDDGCSGFLLRDSDDKGFTCQQCGLVRSKEEIKKIASEVNILSKKTLALTSCGN 235
           C D  C GFLL ++ +KG+TCQ+C   R  EE++K+AS+V +LS K  +L S G+
Sbjct: 292 CNDQKCDGFLLPNAGNKGYTCQKCSTSRDGEELQKMASDVLLLSDKVSSLVSSGS 346


>gi|222625640|gb|EEE59772.1| hypothetical protein OsJ_12271 [Oryza sativa Japonica Group]
          Length = 384

 Score =  201 bits (510), Expect = 6e-49,   Method: Compositional matrix adjust.
 Identities = 103/200 (51%), Positives = 128/200 (64%), Gaps = 39/200 (19%)

Query: 1   MLKLYLRRKLQNDNVIPSTTTDNYSLVEALVAHMSDIDEKQLLLYAQIANLVNLILQWPE 60
           M++L LRRKLQ+D  IPS+ TDNY+LV+AL +H+S++D+ QL+LYAQ+ANLV LIL   E
Sbjct: 130 MVRLVLRRKLQDDKAIPSSGTDNYNLVDALESHISEVDKNQLVLYAQMANLVQLILPSFE 189

Query: 61  ISINEIAENFSKLACNAHTICNSELRPLGTGLYPVISIINHSCLPNAVLVFEGRLAVVRA 120
           + + EI   FSK ACNAHTIC+ ELRPLGTGLYPV+SIINHSC+PNAVL+FEGR A VRA
Sbjct: 190 LDLKEITHTFSKFACNAHTICDPELRPLGTGLYPVLSIINHSCVPNAVLIFEGRTAYVRA 249

Query: 121 VQHVPKGAEVLISYIETAGSTMTRQKALKEQYLFTCTCPRCIKLGQFDDIQESAILEGYR 180
           +Q + K  E                                       D +E A+LEGYR
Sbjct: 250 LQPISKNEE---------------------------------------DSEEDALLEGYR 270

Query: 181 CKDDGCSGFLLRDSDDKGFT 200
           C D  C GFLL ++D  G T
Sbjct: 271 CNDQKCDGFLLPNADIYGVT 290



 Score =  122 bits (305), Expect = 4e-25,   Method: Compositional matrix adjust.
 Identities = 53/80 (66%), Positives = 67/80 (83%)

Query: 289 IPVYQRVYPQFHPLLGLQYYTCGKLEWFLGDTENAIKSMTEAVEILRITHGTNSPFMKEL 348
            PVY+R+YP FHP++GLQ+YTCGKLEW L  TE+A+ S+T A +ILRITHGT S FMKEL
Sbjct: 304 FPVYERIYPPFHPMIGLQFYTCGKLEWLLEYTEDALMSLTRAADILRITHGTKSEFMKEL 363

Query: 349 ILKLEEAQAEASYKLSSKDE 368
           + KLEE +AEAS++LS+ DE
Sbjct: 364 LGKLEEVRAEASFRLSAGDE 383


>gi|326496881|dbj|BAJ98467.1| predicted protein [Hordeum vulgare subsp. vulgare]
          Length = 153

 Score =  186 bits (473), Expect = 1e-44,   Method: Compositional matrix adjust.
 Identities = 86/140 (61%), Positives = 111/140 (79%)

Query: 229 ALTSCGNHQEVVSTYKMIEKLQKKLYHPFSVNLMQTREKLIKILMELEDWKEALAYCQLT 288
            L S  ++ E  S YK+IE+L++K YH FS+ L+ TRE L+KI MEL+DW+ AL YC+LT
Sbjct: 11  TLISDTDNSEAGSVYKIIEQLEQKHYHSFSITLLHTRETLLKIYMELQDWQTALMYCRLT 70

Query: 289 IPVYQRVYPQFHPLLGLQYYTCGKLEWFLGDTENAIKSMTEAVEILRITHGTNSPFMKEL 348
           IPVY+RVYP FHP++GLQ+YTCGKLEW L  TE+A+K +T A ++LRITHGT S FMKEL
Sbjct: 71  IPVYERVYPPFHPMVGLQFYTCGKLEWLLEYTEDALKLLTRAADVLRITHGTQSQFMKEL 130

Query: 349 ILKLEEAQAEASYKLSSKDE 368
             KLEEA+AE S++LSS +E
Sbjct: 131 FGKLEEARAEVSFRLSSGEE 150


>gi|108710652|gb|ABF98447.1| MYND finger family protein, expressed [Oryza sativa Japonica Group]
          Length = 282

 Score =  184 bits (466), Expect = 7e-44,   Method: Compositional matrix adjust.
 Identities = 86/131 (65%), Positives = 108/131 (82%)

Query: 1   MLKLYLRRKLQNDNVIPSTTTDNYSLVEALVAHMSDIDEKQLLLYAQIANLVNLILQWPE 60
           M++L LRRKLQ+D  IPS+ TDNY+LV+AL +H+S++D+ QL+LYAQ+ANLV LIL   E
Sbjct: 118 MVRLVLRRKLQDDKAIPSSGTDNYNLVDALESHISEVDKNQLVLYAQMANLVQLILPSFE 177

Query: 61  ISINEIAENFSKLACNAHTICNSELRPLGTGLYPVISIINHSCLPNAVLVFEGRLAVVRA 120
           + + EI   FSK ACNAHTIC+ ELRPLGTGLYPV+SIINHSC+PNAVL+FEGR A VRA
Sbjct: 178 LDLKEITHTFSKFACNAHTICDPELRPLGTGLYPVLSIINHSCVPNAVLIFEGRTAYVRA 237

Query: 121 VQHVPKGAEVL 131
           +Q + K  EV+
Sbjct: 238 LQPISKNEEVM 248


>gi|260799834|ref|XP_002594889.1| hypothetical protein BRAFLDRAFT_124463 [Branchiostoma floridae]
 gi|229280126|gb|EEN50900.1| hypothetical protein BRAFLDRAFT_124463 [Branchiostoma floridae]
          Length = 463

 Score =  169 bits (427), Expect = 3e-39,   Method: Compositional matrix adjust.
 Identities = 106/346 (30%), Positives = 182/346 (52%), Gaps = 21/346 (6%)

Query: 15  VIPSTTTDNYSL----VEALVAHMSDIDEKQLLLYAQIANLVNLILQWPEIS-INEIAEN 69
           +I  T TD+  +    ++ L +++ ++ E    ++AQ+A ++ + +    +    EI E 
Sbjct: 107 IINKTKTDSPGVPGNSIDELQSNLREMPENVKEMFAQLAVVLRMYVGKDVMDDAREIFEL 166

Query: 70  FSKLACNAHTICNSELRPLGTGLYPVISIINHSCLPNAVLVFEGRLAVVRAVQHVPKGAE 129
           F ++ CN  +IC+ E++ +G G+YP +S+ NHSC PN V VF G    VRA+Q++  G E
Sbjct: 167 FGRMTCNTFSICDPEMQYIGIGIYPKMSLFNHSCEPNCVAVFNGLRMEVRAIQNIQPGEE 226

Query: 130 VLISYIETAGSTMTRQKALKEQYLFTCTCPRCIKLGQFDDIQESAILEGYRCKDDGCSGF 189
           +LISY+E    +  R++ L +QY FTC CPRC       D  +  ++   +C +  C   
Sbjct: 227 LLISYVEMLAMSSVRKQQLLQQYYFTCKCPRC------QDQTKDGMMMAVKCGNINCKKV 280

Query: 190 LLRDSDDKGFTCQQCGLVRSKEEIKKIASEVNILSKKT----LALTSCGNHQEVVSTYKM 245
           +++  D    TC+ C      E+  K   EVN +++ +     A+TS    QE     ++
Sbjct: 281 IIQ-VDGAYETCKVC--CHDNEKDPKFWKEVNKVTQFSEEMLAAITSADQRQEPKEGLRL 337

Query: 246 IEKL---QKKLYHPFSVNLMQTREKLIKILMELEDWKEALAYCQLTIPVYQRVYPQFHPL 302
            E+L   Q+++ H   + +++  +K +   + L  W  AL     TI  Y+  +P +HP 
Sbjct: 338 AERLLERQQQVLHNNHLFVLKCLDKALDQAVVLRKWNRALRCALQTIEPYKVHFPAYHPS 397

Query: 303 LGLQYYTCGKLEWFLGDTENAIKSMTEAVEILRITHGTNSPFMKEL 348
           LG+QY   GKL  +L     A++++  A  IL +THG + P  KEL
Sbjct: 398 LGIQYMRIGKLLLYLEKRLAALEALQMAEHILNVTHGKDHPINKEL 443


>gi|414864678|tpg|DAA43235.1| TPA: hypothetical protein ZEAMMB73_073593 [Zea mays]
          Length = 342

 Score =  162 bits (410), Expect = 2e-37,   Method: Compositional matrix adjust.
 Identities = 85/157 (54%), Positives = 104/157 (66%), Gaps = 6/157 (3%)

Query: 102 SCLPNAVLVFEGRLAVVRAVQHVPKGAEVLISYIETAGSTMTRQKALKEQYLFTCTCPRC 161
           SC+PNAVL+F+GR   VRA+Q + K  EV ISYIETA  T  R   LK QY FT TCPRC
Sbjct: 182 SCVPNAVLIFDGRTTYVRALQPIDKDEEVSISYIETAAVTKKRNNYLK-QYFFTYTCPRC 240

Query: 162 IKLGQFDDIQESAILEGYRCKDDGCSGFLLRDSDDKGFTCQQCGLVRSKEEIKKIASEVN 221
           +K   FDD    A+LEG+RCK+  C GFLL +S  K +TCQ+CG  R  EEI K+ SE+ 
Sbjct: 241 VK--GFDD---DALLEGFRCKNQTCDGFLLPNSGKKAYTCQKCGASRDVEEINKMRSEIL 295

Query: 222 ILSKKTLALTSCGNHQEVVSTYKMIEKLQKKLYHPFS 258
            LS K  +  S GN  E  S YK+IE+L++ LYH FS
Sbjct: 296 QLSDKASSFLSSGNKAEAGSIYKIIEQLERNLYHAFS 332


>gi|348534108|ref|XP_003454545.1| PREDICTED: SET and MYND domain-containing protein 3-like
           [Oreochromis niloticus]
          Length = 439

 Score =  158 bits (400), Expect = 3e-36,   Method: Compositional matrix adjust.
 Identities = 110/361 (30%), Positives = 182/361 (50%), Gaps = 52/361 (14%)

Query: 18  STTTDNYSLVEALVAHMSDIDEKQLLLYAQIANLVNLILQW--PEI----------SINE 65
           S++ + Y+L E   +H++ + E++    +Q+A+++ L LQ   P++          S  E
Sbjct: 118 SSSEELYTLEEH-ESHLTSMPEQKKQGLSQLASMLELYLQQEAPDLLQEMGSALPPSCRE 176

Query: 66  IAENFSKLACNAHTICNSELRPLGTGLYPVISIINHSCLPNAVLVFEGRLAVVRAVQHVP 125
                +K+ CN  TI + EL+ +G GLYP +S++NH C PN V+VFEG    +RAV+ + 
Sbjct: 177 PLSLIAKVTCNCFTISDGELQEIGVGLYPSLSLLNHDCRPNCVMVFEGTKLELRAVRDID 236

Query: 126 KGAEVLISYIETAGSTMTRQKALKEQYLFTCTCPRCIKLGQFDDIQESAILEGYRCKDDG 185
              E+ ISYIET   T  RQ+ L+EQY FTC C RC                  R KD  
Sbjct: 237 PEDELTISYIETLSLTEDRQRQLEEQYHFTCHCQRC----------------DSRDKD-- 278

Query: 186 CSGFLLRDSDDKGFTCQQCGLVRSKEEIKKIASEVNILSKKTLALTSCGNHQEVVSTYKM 245
             G +L   + K    ++  L R  E +K  ++   +L   +  L+S G      + YK+
Sbjct: 279 --GLMLSGEEGKWRLLKE-ALPR-LEGLKAESNWETLLESCSHLLSSVGGEVPEENLYKL 334

Query: 246 IEKLQKKLYHPFSVNLMQTREKLIKILMELEDWKEALAYCQLTIPVYQRVYPQFHPLLGL 305
                            +  +  +   + L  W+EA++Y   T+P Y++ YP  HP+ G+
Sbjct: 335 -----------------RITDMALDASVHLGRWEEAMSYGVTTLPAYRQYYPDPHPVHGI 377

Query: 306 QYYTCGKLEWFLGDTENAIKSMTEAVEILRITHGTNSPFMKELILKLEEAQAEASYKLSS 365
           Q    GKL+ +L + E+A+ +  +A EI++ THG + P   EL +K++E  +E  +  SS
Sbjct: 378 QLMRVGKLQHYLENIEDALDTFRQAYEIVKFTHGEDHPLTTELTMKMKECHSEMDHHSSS 437

Query: 366 K 366
           +
Sbjct: 438 R 438


>gi|443720661|gb|ELU10312.1| hypothetical protein CAPTEDRAFT_158133 [Capitella teleta]
          Length = 441

 Score =  154 bits (388), Expect = 8e-35,   Method: Compositional matrix adjust.
 Identities = 93/336 (27%), Positives = 169/336 (50%), Gaps = 33/336 (9%)

Query: 30  LVAHMSDIDEKQ------LLLYAQIANLVNLILQWPEISINEIAENFSKLACNAHTICNS 83
           L++H+ ++ E Q      ++L   + +    ++  P  S+ E+   F ++  N+ +IC+ 
Sbjct: 113 LISHVDEVQEDQIRLQQFMVLMTTLKSFSENVMSMP--SVEELFVIFGRVCVNSFSICDP 170

Query: 84  ELRPLGTGLY----PVISIINHSCLPNAVLVFEGRLAVVRAVQHVPKGAEVLISYIETAG 139
           E+ P+G G+Y       S+++HSC PNAV VFEG    +R V+ V    ++ ISYI+T  
Sbjct: 171 EMNPIGVGVYIRQIKPCSVLDHSCRPNAVAVFEGTTLRIRCVEPVDSEQDLRISYIDTLD 230

Query: 140 STMTRQKALKEQYLFTCTCPRCIKLGQFDDIQESAILEGYRCKDDGCSGFLLRDSDDKGF 199
            T TR++ L++QY F C C      G+  D ++  I   + C    C G + +   D  F
Sbjct: 231 DTTTRRRNLQQQYYFNCLC------GECKDSEKDLIKFSFNCTSVECKGHVTQVPADDRF 284

Query: 200 TCQQCG-------LVRSKEEIKKIASEVNILSKKTLALTSCGNHQEVVSTYKMIEKLQKK 252
           +C  CG       L ++ E+ +KI   +N L K+         H+ +           + 
Sbjct: 285 SCDICGTTVDDVNLNQAAEKAQKIIKHLNELKKQR-------EHESIRMKGLSCVSEMRA 337

Query: 253 LYHPFSVNLMQTREKLIKILMELEDWKEALAYCQLTIPVYQRVYPQFHPLLGLQYYTCGK 312
           + HP++V+ ++  +  +   +E   W +AL   +LT+  Y+  Y + HP +G+  +  GK
Sbjct: 338 ILHPYNVHFIKLCDLSLDACIETSQWDDALELGRLTVVAYRFYYGELHPSVGILLFKIGK 397

Query: 313 LEWFLGDTENAIKSMTEAVEILRITHGTNSPFMKEL 348
           +  +L D E A++ + +A  +LR+THG +    K+L
Sbjct: 398 ILSYLADRE-ALQFLKQAATVLRVTHGDDHSLYKDL 432


>gi|432852874|ref|XP_004067428.1| PREDICTED: SET and MYND domain-containing protein 3-like [Oryzias
           latipes]
          Length = 495

 Score =  152 bits (384), Expect = 2e-34,   Method: Compositional matrix adjust.
 Identities = 101/318 (31%), Positives = 159/318 (50%), Gaps = 39/318 (12%)

Query: 41  QLLLYAQIANLVNLILQWPEISINEIAENFSKLACNAHTICNSELRPLGTGLYPVISIIN 100
           +L +  +I +L   +     +S  E     +K+ CN  TI + EL+ +G GLYP +S++N
Sbjct: 209 ELYMKTEIPDLKQRMTSELPLSCREALSIIAKVTCNCFTISDGELQEIGVGLYPSLSLLN 268

Query: 101 HSCLPNAVLVFEGRLAVVRAVQHVPKGAEVLISYIETAGSTMTRQKALKEQYLFTCTCPR 160
           H C PN V+VF G    +RAVQ +  G E+ ISYIET   T  RQK L+EQY FTC C R
Sbjct: 269 HDCRPNCVMVFNGTELQLRAVQDINPGEELRISYIETLSLTEDRQKQLEEQYHFTCHCQR 328

Query: 161 CIKLGQFDDIQESAILEGYRCKDDGCSGFLLRDSDDKGFTCQQCGLVRSKEEIKKIASEV 220
           C         Q+  +L G                       ++    + KE + ++   +
Sbjct: 329 C-----SSKEQDGVMLSG-----------------------REAAWRQLKEALPRLEG-L 359

Query: 221 NILSKKTLALTSCGNHQEVVSTYKMIEKLQKKLYHPFSVNLMQTREKLIKILMELEDWKE 280
            + S     L SC N    + T    E   + LY       ++  +  +   + L  W+E
Sbjct: 360 QLESNWLELLESCSN----LLTSAGEEVPDENLYK------LKITDMALDASVHLGRWEE 409

Query: 281 ALAYCQLTIPVYQRVYPQFHPLLGLQYYTCGKLEWFLGDTENAIKSMTEAVEILRITHGT 340
           AL++   T+P Y+  Y   HP+ G+Q    GKL+  LG  E+A++S  EA +IL++THG 
Sbjct: 410 ALSFGLKTLPSYRLYYADPHPVHGVQLMRVGKLQHHLGCIEDALESFKEAFKILKLTHGE 469

Query: 341 NSPFMKELILKLEEAQAE 358
           + P + +L++K+EE ++E
Sbjct: 470 DHPLIVDLMMKMEECRSE 487


>gi|395531511|ref|XP_003767821.1| PREDICTED: SET and MYND domain-containing protein 3 [Sarcophilus
           harrisii]
          Length = 428

 Score =  149 bits (375), Expect = 3e-33,   Method: Compositional matrix adjust.
 Identities = 90/293 (30%), Positives = 152/293 (51%), Gaps = 39/293 (13%)

Query: 65  EIAENFSKLACNAHTICNSELRPLGTGLYPVISIINHSCLPNAVLVFEGRLAVVRAVQHV 124
           +I E+F+K+ CN  TI N E++ +G GLYP +S++NHSC PN V+VFEG    +RA++++
Sbjct: 170 DIFESFAKVICNGFTISNGEMQEVGVGLYPSMSLLNHSCDPNCVIVFEGPSLFLRAIRNI 229

Query: 125 PKGAEVLISYIETAGSTMTRQKALKEQYLFTCTCPRCIKLGQFDDIQESAILEGYRCKDD 184
           P G E+ I Y++    T  RQK LKEQY F C CP C    +  D     +L G      
Sbjct: 230 PLGEELTICYLDVLMPTAERQKQLKEQYCFDCDCPLCKTQSKDAD-----MLAGEEQAWK 284

Query: 185 GCSGFLLRDSDDKGFTCQQCGLVRSKEEIKKIASEVNILSKKTLALTSCGNHQEVVSTYK 244
              G L++  D           ++S+E+ +++ +    L      + +CGN     + Y+
Sbjct: 285 EIQGSLIKIED-----------LQSQEKWEQVLAMCQTL------INNCGNRLPDRNIYQ 327

Query: 245 MIEKLQKKLYHPFSVNLMQTREKLIKILMELEDWKEALAYCQLTIPVYQRVYPQFHPLLG 304
                            ++  E  +   + L  W++AL Y   T+  Y+  YP FHP+ G
Sbjct: 328 -----------------LKMLECAMDACINLSLWEDALLYGSRTLEPYRLYYPGFHPVRG 370

Query: 305 LQYYTCGKLEWFLGDTENAIKSMTEAVEILRITHGTNSPFMKELILKLEEAQA 357
           +Q    GKL+   G    A++++ +A E++++THG +   +++L+L L + +A
Sbjct: 371 VQVMKVGKLQQHQGLYPQALETLKQAFELIKVTHGRDHSLIEDLMLLLGDCEA 423


>gi|413955565|gb|AFW88214.1| hypothetical protein ZEAMMB73_999877 [Zea mays]
          Length = 529

 Score =  148 bits (373), Expect = 5e-33,   Method: Compositional matrix adjust.
 Identities = 83/189 (43%), Positives = 105/189 (55%), Gaps = 46/189 (24%)

Query: 108 VLVFEGRLAVVRAVQHVPKGAEVLISYIETAGSTMTRQKALKEQYLFTCTCPRCIKLGQF 167
           VL+F+GR A VRA+Q + K  EV ISYIETA  T  R   LK QY FTCTCPRC+K   F
Sbjct: 211 VLIFDGRTAYVRALQPINKDEEVSISYIETAAVTKKRNNDLK-QYFFTCTCPRCVK--GF 267

Query: 168 DDIQESAILEGYRCKDDGCSGFLLRDSDDKGFTCQQCGLVRSKEEIKKIASEVNILSKKT 227
           D   E A+LEG+RCK   C GFLL +SD                                
Sbjct: 268 D---EDALLEGFRCKSQACDGFLLPNSD-------------------------------- 292

Query: 228 LALTSCGNHQEVVSTYKMIEKLQKKLYHPFSVNLMQTREKLIKILMELEDWKEALAYCQL 287
                     E  S YK+I++L++ LYH FS  L+ T E L+KI +EL+DW  AL YC+L
Sbjct: 293 --------KAEAGSIYKIIKQLERNLYHAFSTTLLHTCETLLKIYLELQDWWTALTYCRL 344

Query: 288 TIPVYQRVY 296
           TIPVY++++
Sbjct: 345 TIPVYEKIF 353


>gi|413955566|gb|AFW88215.1| hypothetical protein ZEAMMB73_999877 [Zea mays]
          Length = 484

 Score =  148 bits (373), Expect = 5e-33,   Method: Compositional matrix adjust.
 Identities = 83/189 (43%), Positives = 105/189 (55%), Gaps = 46/189 (24%)

Query: 108 VLVFEGRLAVVRAVQHVPKGAEVLISYIETAGSTMTRQKALKEQYLFTCTCPRCIKLGQF 167
           VL+F+GR A VRA+Q + K  EV ISYIETA  T  R   LK QY FTCTCPRC+K   F
Sbjct: 211 VLIFDGRTAYVRALQPINKDEEVSISYIETAAVTKKRNNDLK-QYFFTCTCPRCVK--GF 267

Query: 168 DDIQESAILEGYRCKDDGCSGFLLRDSDDKGFTCQQCGLVRSKEEIKKIASEVNILSKKT 227
           D   E A+LEG+RCK   C GFLL +SD                                
Sbjct: 268 D---EDALLEGFRCKSQACDGFLLPNSD-------------------------------- 292

Query: 228 LALTSCGNHQEVVSTYKMIEKLQKKLYHPFSVNLMQTREKLIKILMELEDWKEALAYCQL 287
                     E  S YK+I++L++ LYH FS  L+ T E L+KI +EL+DW  AL YC+L
Sbjct: 293 --------KAEAGSIYKIIKQLERNLYHAFSTTLLHTCETLLKIYLELQDWWTALTYCRL 344

Query: 288 TIPVYQRVY 296
           TIPVY++++
Sbjct: 345 TIPVYEKIF 353


>gi|410912232|ref|XP_003969594.1| PREDICTED: SET and MYND domain-containing protein 3-like [Takifugu
           rubripes]
          Length = 434

 Score =  147 bits (372), Expect = 6e-33,   Method: Compositional matrix adjust.
 Identities = 97/337 (28%), Positives = 169/337 (50%), Gaps = 49/337 (14%)

Query: 32  AHMSDIDEKQLLLYAQIANLVNLIL--QWPEI--------SINEIAENFSKLACNAHTIC 81
           +H++ + E++    +Q+A ++ L L  + P++        S  +     +K+ CN  TI 
Sbjct: 130 SHLTSLSEQRKQGLSQLATMLKLYLHKEVPDLPQDTPSLSSCRDALSLIAKVTCNCFTIS 189

Query: 82  NSELRPLGTGLYPVISIINHSCLPNAVLVFEGRLAVVRAVQHVPKGAEVLISYIETAGST 141
           + EL+ +G GLYP +S++NH C PN V+VFEG   ++RAV+ +  G E+ ISYIET    
Sbjct: 190 DGELQEIGVGLYPSLSLLNHDCRPNCVMVFEGTKLLLRAVRGLSPGEELTISYIETLSLN 249

Query: 142 MTRQKALKEQYLFTCTCPRCIKLGQFDDIQESAILEGYRCKDDGCSGFLLRDSDDKGFTC 201
             RQ+ L++QY FTC C +C      D +  S     +R         +L+++       
Sbjct: 250 EDRQQRLEDQYCFTCHC-QCCNSPDNDKLMLSGEESSWR---------VLKEA------- 292

Query: 202 QQCGLVRSKEEIKKIASEVNILSKKTLALTSCGNHQEVVSTYKMIEKLQKKLYHPFSVNL 261
                + + EE+K  ++   ++      L++ G      + YK                 
Sbjct: 293 -----LPTLEELKAKSNWQTLVESCHRLLSTEGKRVPEENLYK----------------- 330

Query: 262 MQTREKLIKILMELEDWKEALAYCQLTIPVYQRVYPQFHPLLGLQYYTCGKLEWFLGDTE 321
           ++  +  +   ++L  W EA+ Y Q T+PVY   YP  HP+ G+Q    GKL+ +L   E
Sbjct: 331 LRVTDMALDASIQLGLWTEAVEYGQKTLPVYHLYYPDPHPVHGVQLVRVGKLQHYLAHIE 390

Query: 322 NAIKSMTEAVEILRITHGTNSPFMKELILKLEEAQAE 358
            A+ +  +A  IL++THG ++    +L++K+EE +AE
Sbjct: 391 EALDTFKQAYRILKVTHGNDNSITTDLLMKMEECRAE 427


>gi|196000040|ref|XP_002109888.1| hypothetical protein TRIADDRAFT_53251 [Trichoplax adhaerens]
 gi|190588012|gb|EDV28054.1| hypothetical protein TRIADDRAFT_53251 [Trichoplax adhaerens]
          Length = 448

 Score =  147 bits (370), Expect = 1e-32,   Method: Compositional matrix adjust.
 Identities = 90/283 (31%), Positives = 149/283 (52%), Gaps = 17/283 (6%)

Query: 71  SKLACNAHTICNSELRPLGTGLYPVISIINHSCLPNAVLVFEGRLAVVRAVQHVPKGAEV 130
           S++ CN  TICNSE++ +G G+YP +S++NHSC PN    F G+   +R +++   G E+
Sbjct: 172 SRVTCNTFTICNSEMQTVGIGVYPGLSLVNHSCSPNCSATFRGKQMQLRIIENTKIGDEL 231

Query: 131 LISYIETAGSTMTRQKALKEQYLFTCTCPRCIKLGQFDDIQESA--ILEGYRCKDDGCSG 188
           LISYI+      +RQ  L+ QY F C C RCI     D  ++S   +++  RC    C  
Sbjct: 232 LISYIDPMQVLSSRQNQLQSQYCFKCICERCI-----DTTKDSCNNLMDSVRCPKKICKA 286

Query: 189 FLLRDSDDKGFTCQQCGLVRSKE---EIKKIASEVNILSKKTLALTSCGNHQEVVSTYKM 245
               DS      C +CG +  +    EI+   +++N    KT++L   G  Q  +   K 
Sbjct: 287 ASSLDSLLANKLCPECGSIVDQSFFAEIENFQAQIN----KTISL---GYQQAKLEDLKK 339

Query: 246 IEKLQKKLYHPFSVNLMQTREKLIKILMELEDWKEALAYCQLTIPVYQRVYPQFHPLLGL 305
           +    K+     ++  ++  E L+   +E + ++EAL Y +     Y R+YP+++P+ G+
Sbjct: 340 LFIEGKQRLGECNMLYIRIIENLMDAYIESQRYEEALEYARRLEEPYCRLYPRYYPVTGV 399

Query: 306 QYYTCGKLEWFLGDTENAIKSMTEAVEILRITHGTNSPFMKEL 348
                GKLE +LG    A+KS+ +A EIL ++HG     + E+
Sbjct: 400 HLMKQGKLECYLGKFGEAVKSLGKAKEILLVSHGKECGLIHEM 442


>gi|449496642|ref|XP_002188283.2| PREDICTED: SET and MYND domain-containing protein 3 [Taeniopygia
           guttata]
          Length = 289

 Score =  144 bits (363), Expect = 6e-32,   Method: Compositional matrix adjust.
 Identities = 99/303 (32%), Positives = 154/303 (50%), Gaps = 50/303 (16%)

Query: 59  PEISINEIAENFSKLACNAHTICNSELRPLGTGLYPVISIINHSCLPNAVLVFEGRLAVV 118
           P I   +I   F+K+ CN  TI N E++ +G GLYP +S++NHSC PN V+VFEG   ++
Sbjct: 28  PAIDFFQI---FTKVTCNCFTISNGEMQDVGVGLYPSMSLLNHSCDPNCVIVFEGYQLLL 84

Query: 119 RAVQHVPKGAEVLISYIETAGSTMTRQKALKEQYLFTCTCPRCIKLGQFDDIQESAILEG 178
            +V+ +  G E+ ISY+E+   T  RQK L  QY F C CP C    Q  D ++ A  E 
Sbjct: 85  HSVRDIQIGEELTISYVESLMPTRERQKQLMRQYCFECDCPLC--QNQEKDAEKLAG-EV 141

Query: 179 YRCKDDGCSGFLLRDSDDKGFTCQQCGLVRSKEEIKKIASEVNILSKKTLALTSCGNHQE 238
           +  K+       ++D+ +      + G  +SKEE K      ++L++    L S   H  
Sbjct: 142 HAWKE-------VKDAVN------EVGYPKSKEEWK------HVLARCQNLLRSNKGHLP 182

Query: 239 VVSTYKMIEKLQKKLYHPFSVNLMQTREKLIKILME----LEDWKEALAYCQLTIPVYQR 294
             + Y++                     K++   M+    LE W+EAL Y   T+  Y+ 
Sbjct: 183 DTNIYQL---------------------KMLDCAMDACINLESWEEALYYGSRTLEPYRL 221

Query: 295 VYPQFHPLLGLQYYTCGKLEWFLGDTENAIKSMTEAVEILRITHGTNSPFMKELILKLEE 354
            YP FHPL  +Q    GKL++  G    A++++ +A  I+++THGT+   MK L+   EE
Sbjct: 222 YYPGFHPLRAVQLMRVGKLQYSQGMVPQALETLKQAYNIMKVTHGTDHSLMKVLMEMKEE 281

Query: 355 AQA 357
            +A
Sbjct: 282 CEA 284


>gi|50740707|ref|XP_419536.1| PREDICTED: SET and MYND domain-containing protein 3 [Gallus gallus]
          Length = 428

 Score =  143 bits (361), Expect = 1e-31,   Method: Compositional matrix adjust.
 Identities = 107/327 (32%), Positives = 165/327 (50%), Gaps = 55/327 (16%)

Query: 41  QLLLYAQIANLVNLILQWPEISINEIAENFSKLACNAHTICNSELRPLGTGLYPVISIIN 100
           QL L A+I +  +L    P I   +I   F+K+ CN  TI N E++ +G GLYP +S++N
Sbjct: 152 QLYLRAEIQDASHLP---PAIDFFQI---FTKVTCNCFTISNGEMQDVGVGLYPSMSLLN 205

Query: 101 HSCLPNAVLVFEGRLAVVRAVQHVPKGAEVLISYIETAGSTMTRQKALKEQYLFTCTCPR 160
           HSC PN V++FEG   ++R+++ +  G E+ ISYIE+   T  RQK LK QY F C C  
Sbjct: 206 HSCDPNCVIIFEGYQLLLRSIREIQIGEELTISYIESLMPTSERQKQLKRQYCFECDC-- 263

Query: 161 CIKLGQFDDIQESAILEGYRCKDDGCSGFLLRDSDDKGFTCQQCGLVRSKEEIKKIASEV 220
           C+   Q  D ++ A  E              ++  D      +    +SKEE +K     
Sbjct: 264 CLCQDQEKDAKKLAGEEP-----------AWKEVKD---AVNEVKYPKSKEEWEK----- 304

Query: 221 NILSKKTLALTSCGNHQEVVSTYKMIEKLQKKLYHPFSVNLMQTREKLIKILME----LE 276
            +L+K    L+S  +H    + Y++                     KL+   M+    LE
Sbjct: 305 -VLAKCQHLLSSHTSHLPDTNIYQL---------------------KLLDCAMDACINLE 342

Query: 277 DWKEALAYCQLTIPVYQRVYPQFHPLLGLQYYTCGKLEWFLGDTENAIKSMTEAVEILRI 336
            W++AL Y   T+  Y+  YP FHPL  +Q    GKL++  G    A+ ++ +A +I+++
Sbjct: 343 AWEQALCYGSRTLGPYRLYYPDFHPLRAVQLMRVGKLQYSQGMLPQALGTLKQAYDIMKV 402

Query: 337 THGTNSPFMKELILKLEEAQAEASYKL 363
           THGT+   M+ L + L+E Q EA  KL
Sbjct: 403 THGTDHSLMQAL-MDLKE-QCEAMMKL 427


>gi|449665374|ref|XP_002163555.2| PREDICTED: uncharacterized protein LOC100200645 [Hydra
           magnipapillata]
          Length = 1037

 Score =  140 bits (352), Expect = 1e-30,   Method: Compositional matrix adjust.
 Identities = 98/333 (29%), Positives = 166/333 (49%), Gaps = 7/333 (2%)

Query: 25  SLVEALVAHMSDIDEKQLLLYAQIANLVNLILQWPEISINEIA--ENFSKLACNAHTICN 82
           +L+  L A+  +I   +   +   A ++   LQ   I+IN+I       K +CN+  I N
Sbjct: 121 TLINNLYANKGNISNARKEAFFTFAAVLVEYLQDVNININDIDIYGLMCKASCNSFAITN 180

Query: 83  SELRPLGTGLYPVISIINHSCLPNAVLVFEGRLAVVRAVQHVPKGAEVLISYIETAGSTM 142
           +EL  LGTG++   S+ NHSC PN V  F GR   +RA++ + +G E+++SYI    ++ 
Sbjct: 181 AELNSLGTGIFSSASLFNHSCDPNCVATFNGRDISIRAIKPIAEGEELMLSYISILATSD 240

Query: 143 TRQKALKEQYLFTCTCPRCIKLGQFDDIQESAILEGYRCKDDGCSGFLLRDSDDKGFTCQ 202
            RQ  L+E Y+FTC C  C +    D + +S      +C    C  FL+   +    +CQ
Sbjct: 241 VRQLELRESYMFTCKCTVCSRKEVNDSLMKSVKCSQPQC---LCMKFLITAPESNKCSCQ 297

Query: 203 QCGLVRSKEEIKKIASEVNILSKKTLALTSCGNHQEVVSTYKMIEKLQKKLYHPFSVNLM 262
           +     S + I K    ++ L     +++     ++  S  ++I   ++ L  P ++ L+
Sbjct: 298 K-NCEASNDYILKANDCMDKLQALYNSISVVPTIEQQKSLTQLIRCGEEILCSP-NIALL 355

Query: 263 QTREKLIKILMELEDWKEALAYCQLTIPVYQRVYPQFHPLLGLQYYTCGKLEWFLGDTEN 322
           Q  E  +   +E  DWK A  Y       Y+    ++HP LGL Y+  GKL   + D  N
Sbjct: 356 QCYEVAMDGCIESGDWKGAFQYGIKLECSYKNYLSEYHPTLGLHYFKLGKLALQIEDLRN 415

Query: 323 AIKSMTEAVEILRITHGTNSPFMKELILKLEEA 355
            I  + +  ++L ITHG++  F+++L   L+EA
Sbjct: 416 GITYLEKGYKVLSITHGSSCHFVQKLKSYLDEA 448


>gi|326915443|ref|XP_003204027.1| PREDICTED: SET and MYND domain-containing protein 3-like [Meleagris
           gallopavo]
          Length = 427

 Score =  140 bits (352), Expect = 1e-30,   Method: Compositional matrix adjust.
 Identities = 101/322 (31%), Positives = 161/322 (50%), Gaps = 55/322 (17%)

Query: 41  QLLLYAQIANLVNLILQWPEISINEIAENFSKLACNAHTICNSELRPLGTGLYPVISIIN 100
           QL L A+I +  +L    P I   +I   F+K+ CN  TI N E++ +G GLYP +S++N
Sbjct: 151 QLYLRAEIQDASHLS---PAIDFFQI---FTKVTCNCFTISNGEMQDVGVGLYPSMSLLN 204

Query: 101 HSCLPNAVLVFEGRLAVVRAVQHVPKGAEVLISYIETAGSTMTRQKALKEQYLFTCTCPR 160
           HSC PN V++FEG   ++R+V+ +  G E+ +SYIE+   T  RQ+ LK QY F C C  
Sbjct: 205 HSCDPNCVIIFEGYQLLLRSVREIQIGEELTVSYIESLMPTSERQEQLKRQYCFECDC-- 262

Query: 161 CIKLGQFDDIQESAILEGYRCKDDGCSGFLLRDSDDKGFTCQQCGLVRSKEEIKKIASEV 220
           C+   Q  D ++ A  E              ++  D      +    +SKEE +K+ ++ 
Sbjct: 263 CLCQDQEKDAKKLAGEEP-----------AWKEVKD---AVNEVKYPKSKEEWEKVLAKC 308

Query: 221 -NILSKKTLALTSCGNHQEVVSTYKMIEKLQKKLYHPFSVNLMQTREKLIKILME----L 275
            ++LS  T  L                             N+ Q   KL+   M+    L
Sbjct: 309 QHLLSSHTSRLP--------------------------DTNIYQL--KLLDCAMDACINL 340

Query: 276 EDWKEALAYCQLTIPVYQRVYPQFHPLLGLQYYTCGKLEWFLGDTENAIKSMTEAVEILR 335
           E W++AL Y   T+  Y+  YP FHPL  +Q    GKL++  G    A++++ +A +I++
Sbjct: 341 EAWEQALCYGSRTLGPYRLYYPNFHPLRAVQLMRVGKLQYSQGMLPQALETLKQAYDIMK 400

Query: 336 ITHGTNSPFMKELILKLEEAQA 357
           +THGT+   M+ L+   E+ +A
Sbjct: 401 VTHGTDHSLMQALMDLKEQCEA 422


>gi|213512076|ref|NP_001133866.1| SET and MYND domain-containing protein 3 [Salmo salar]
 gi|209155626|gb|ACI34045.1| SET and MYND domain-containing protein 3 [Salmo salar]
          Length = 429

 Score =  135 bits (341), Expect = 3e-29,   Method: Compositional matrix adjust.
 Identities = 98/350 (28%), Positives = 164/350 (46%), Gaps = 46/350 (13%)

Query: 17  PSTTTDNYSLVEALVAHMSDIDEKQLLLYAQIANLVNLILQW--PEISINEIAEN----F 70
           PS T+     +E   +H+ D+ E++    +Q+++++ L +Q   P I+    A +     
Sbjct: 114 PSQTSPGLCSLEEHESHLCDMGEERREGLSQLSSMLQLYIQQEVPNITQELPAVDPLSLL 173

Query: 71  SKLACNAHTICNSELRPLGTGLYPVISIINHSCLPNAVLVFEGRLAVVRAVQHVPKGAEV 130
           +KL CN  TI + ELR +G GLYP +S++NH C P+ V++FEG    +RAV+ +    E+
Sbjct: 174 AKLTCNCFTISDGELREIGVGLYPSMSLLNHDCRPSCVMLFEGETLHLRAVRDMQPAEEL 233

Query: 131 LISYIETAGSTMTRQKALKEQYLFTCTCPRCIKLGQFDDIQESAILEGYRCKDDGCSGFL 190
            ISYI T   T  R+  L+EQY FTC C                     RC        +
Sbjct: 234 TISYIGTLAPTRDRRTQLEEQYHFTCQC--------------------QRCTTADMDPVM 273

Query: 191 LRDSDDKGFTCQQCGLVRSKEEIKKIASEVNILSKKTLALTSCGNHQEVVSTYKMIEKLQ 250
           L            CG  ++   +++      I + +     +C N + +     +++ L 
Sbjct: 274 L------------CGEEKAWTPLREA-----IPTLENWQTQACWN-ELLYECSSLLDPLD 315

Query: 251 KKLYHPFSVNLMQTR--EKLIKILMELEDWKEALAYCQLTIPVYQRVYPQFHPLLGLQYY 308
             +      N+ + R  +  +   + L  W  ALAY  +T+  Y++ YP  HP   +Q  
Sbjct: 316 GDVPAVPDCNVYRLRVTDLALDACINLAAWDTALAYGTITLEPYRQYYPDPHPAHAIQLM 375

Query: 309 TCGKLEWFLGDTENAIKSMTEAVEILRITHGTNSPFMKELILKLEEAQAE 358
              KL+ FL    +A  ++  A +I+++THG   P   +LI KLEE +AE
Sbjct: 376 RVAKLQHFLVHLGDAQHTLRLAYDIMKVTHGNQHPLTSDLIRKLEECRAE 425


>gi|312383019|gb|EFR28260.1| hypothetical protein AND_04036 [Anopheles darlingi]
          Length = 455

 Score =  135 bits (340), Expect = 3e-29,   Method: Compositional matrix adjust.
 Identities = 105/375 (28%), Positives = 175/375 (46%), Gaps = 26/375 (6%)

Query: 1   MLKLYLRRKLQNDNVIPSTTTDN-YSLVEALVAHMSDI--DEKQLLLYAQIANLVNLILQ 57
           ML   +RR  +  +      TD  Y     L+ H  DI  D K++  +  +  ++  +L 
Sbjct: 86  MLARIIRRLQKGGDFCKGYYTDKLYRRFNDLMPHEEDIRKDVKRIEHFHTLNVVLQRLLD 145

Query: 58  WPEISI-NEIAENFSKLACNAHTICNSELRPLGTGLYPVISIINHSCLPNAVLVFEGRLA 116
            P I   +E+   F K+  N+  +C+ E+  +GTG+Y   SI++HSC PNAV  F G   
Sbjct: 146 EPAIPPRDELLRIFGKMCINSFNVCDDEMNSIGTGMYLGASILDHSCRPNAVATFVGEQL 205

Query: 117 VVRAVQHVPKGAE-----VLISYIETAGSTMTRQKALKEQYLFTCTCPRCIKLGQFDDIQ 171
            +R ++    G E     + ISYI+    + TR++ L E+Y F C C RC       D  
Sbjct: 206 QLRLLEDF-AGPELDFSRIFISYIDLIDPSDTRREQLSERYYFRCECVRC------RDEA 258

Query: 172 ESAILEGYRCKDDGCSGFLLRDSDDKGFTCQQCGLVRSKEEIKKIASEVNILSKKTLALT 231
           E  ++    C++  C    +RD       C+        +  +    EV   ++  LA  
Sbjct: 259 ERELMGAAACQNRKCDE-PIRDGQTLCSACE----APFDQSARDRFDEVTSFTRDRLAEM 313

Query: 232 SCGNHQEVVSTYKMIEKLQKKLYHPFSVNLMQTREKLIKILMELEDWKEALAYCQLTIPV 291
               + +V    ++  + Q  + HP +   ++T +      ++LE W+ AL Y    +  
Sbjct: 314 KDVAYFDVC---RLCLEKQSGVLHPLNAQHIKTLDYAFDSAIKLEKWEAALRYGAGAVAG 370

Query: 292 YQRVYPQFHPLLGLQYYTCGKLEWFLGDTENAIKSMTEAVEILRITHGTNSPFMKELILK 351
           Y+R  P  +PLLGL     GK++ +LGD + A+ S+ EA +ILR+THG      K+ ++ 
Sbjct: 371 YRRYSPS-NPLLGLMLANIGKIQLYLGDAKTALSSLHEADKILRVTHGEQHDLYKDQLVP 429

Query: 352 LEEAQAEASYKLSSK 366
           L    A   Y+LS +
Sbjct: 430 L-LCDAAQQYELSQR 443


>gi|158299622|ref|XP_319707.4| AGAP008954-PA [Anopheles gambiae str. PEST]
 gi|157013605|gb|EAA14812.4| AGAP008954-PA [Anopheles gambiae str. PEST]
          Length = 453

 Score =  135 bits (339), Expect = 4e-29,   Method: Compositional matrix adjust.
 Identities = 101/378 (26%), Positives = 180/378 (47%), Gaps = 21/378 (5%)

Query: 1   MLKLYLRRKLQN-DNVIPSTTTDNYSLVEALVAHMSDI--DEKQLLLYAQI-ANLVNLIL 56
           M+   +RR L+  D      T+  Y     L+ H  +I  D K++  +  +   L  L+ 
Sbjct: 86  MIARIVRRLLKGGDTHKGYYTSKQYRKFCDLMPHEENIRADSKRMEHFGTLYVVLQRLLD 145

Query: 57  QWPEISINEIAENFSKLACNAHTICNSELRPLGTGLYPVISIINHSCLPNAVLVFEGRLA 116
           +    +  E+   + K+  N   I ++E+  +GTG+Y   SII+HSC PN V+ F+G   
Sbjct: 146 EASRPTKAELLRIYGKMCINTFNILDAEMSTIGTGMYIGASIIDHSCRPNVVVSFDGETL 205

Query: 117 VVRAVQHVPKG----AEVLISYIETAGSTMTRQKALKEQYLFTCTCPRCIKLGQFDDIQE 172
            +R ++  P+      ++ ISYI+   +   RQ+ L E+Y F C C RC       D QE
Sbjct: 206 RMRLLEDYPEQELDFGKLFISYIDLIDTAEVRQEQLAERYYFHCACERC------RDEQE 259

Query: 173 SAILEGYRCKDDGCSGFLLRDSDDKGFTCQQCGLVRSKEEIKKIASEVNILSKKTLALTS 232
              +    C +  C   L     ++   C  CG   +  + +  A E++  ++  LA   
Sbjct: 260 QKRMNAAACPNTTCHEPLDFSDSEQLNQCPACGTAVTHSDREAFA-EISSFTRDHLAQMK 318

Query: 233 CGNHQEVVSTYKMIEKLQKKLYHPFSVNLMQTREKLIKILMELEDWKEALAYCQLTIPVY 292
              + +V    ++  + Q  + H ++V+ ++T +  ++  + LE W EA  Y    +  +
Sbjct: 319 SVAYLDVS---RLCLEKQANVLHRYNVHHIKTLDNAMESALNLEKWTEATGYGLRLLDGF 375

Query: 293 QRVYPQFHPLLGLQYYTCGKLEWFLGDTENAIKSMTEAVEILRITHGTNSPFMKELILKL 352
           ++ Y  +HPLLGL Y   GKL+ +      A+K + +A +ILR+THG      K +++ L
Sbjct: 376 RQYYSTYHPLLGLTYLKVGKLQLYQCQFAEALKQLQQAAKILRVTHGEQDDLYKRVLVPL 435

Query: 353 --EEAQAEASYKLSSKDE 368
             + AQ +  + L+  DE
Sbjct: 436 LCDAAQGDLGH-LAIADE 452


>gi|350416087|ref|XP_003490836.1| PREDICTED: SET and MYND domain-containing protein 3-like [Bombus
           impatiens]
          Length = 439

 Score =  134 bits (337), Expect = 8e-29,   Method: Compositional matrix adjust.
 Identities = 95/324 (29%), Positives = 164/324 (50%), Gaps = 26/324 (8%)

Query: 23  NYSLVEALVAHMSDI--DEKQLLLYAQIANLVNLILQ-WPEISINEIAENFSKLACNAHT 79
           +Y   + L++H S+I  DEK++  +  + N+++  L+  P  S  E+   + ++  N+ +
Sbjct: 110 SYRKFKDLMSHCSEIEKDEKKMEHFVCLCNVLHKFLEDMPIPSTAELLGIYGRITINSFS 169

Query: 80  ICNSELRPLGTGLYPVISIINHSCLPNAVLVFEGRLAVVRAVQHVPKG--AEVLISYIET 137
           I NS++  +G G+Y   SI++HSC PNAV  FEG    V+A++ +P    +++ I YI+ 
Sbjct: 170 IFNSDMN-IGVGIYLGPSILDHSCKPNAVATFEGTTITVKAIEDLPSLDLSQIRIPYIDV 228

Query: 138 AGSTMTRQKALKEQYLFTCTCPRCIKLGQFDDIQESAILEGYRCKDDGCSGFLLRDSDDK 197
             +T  R+  L+  Y F C C RC K           + E   C +  C+     ++D  
Sbjct: 229 IKTTRDRRAELQSSYYFWCDCERCEK--------PEPMAEAAACPNKLCTYPCAPNAD-- 278

Query: 198 GFTCQQCGLVRSKEEIKKIASEVNILSKKTLALTSCGNHQEV--VSTYKMIEKLQKKLYH 255
              C++C   +  E  K+I  E++  +   L      N + +  +   KM    Q+ + H
Sbjct: 279 --LCEKCN-TKFPENFKEIFDEISEFTAYHLE-----NMKNIAYLDVSKMCLSRQEGILH 330

Query: 256 PFSVNLMQTREKLIKILMELEDWKEALAYCQLTIPVYQRVYPQFHPLLGLQYYTCGKLEW 315
           P +V  +QT +      + L+ W+EA +Y +  I  Y   Y + HP  G+ Y + GKLE 
Sbjct: 331 PLNVQYVQTLQTAFDSSINLQHWEEAESYAKKLINGYLTYYGEIHPSTGILYLSIGKLEV 390

Query: 316 FLGDTENAIKSMTEAVEILRITHG 339
           +L   + AIK++ +A  IL ITHG
Sbjct: 391 YLKKLKQAIKTLRKASSILTITHG 414


>gi|345329417|ref|XP_001513395.2| PREDICTED: SET and MYND domain-containing protein 3-like
           [Ornithorhynchus anatinus]
          Length = 415

 Score =  133 bits (335), Expect = 1e-28,   Method: Compositional matrix adjust.
 Identities = 83/274 (30%), Positives = 136/274 (49%), Gaps = 39/274 (14%)

Query: 66  IAENFSKLACNAHTICNSELRPLGTGLYPVISIINHSCLPNAVLVFEGRLAVVRAVQHVP 125
           I E+F+K+ CN  TI N E++ +G GLYP +S++NHSC PN V+VFEG    +RAV+ + 
Sbjct: 145 IFESFAKVICNGFTISNGEMQEVGVGLYPSMSLLNHSCDPNCVIVFEGTSLFLRAVREIQ 204

Query: 126 KGAEVLISYIETAGSTMTRQKALKEQYLFTCTCPRCIKLGQFDDIQESAILEGYRCKDDG 185
           KG E+ I Y++    +  RQK LKEQY F C C RC    Q  D+   A  E +      
Sbjct: 205 KGEELTICYLDVLLPSQERQKQLKEQYCFACDCIRCKT--QDKDVDMLAGEEPF------ 256

Query: 186 CSGFLLRDSDDKGFTCQQCGLVRSKEEIKKIASEVNILSKKTLALTSCGNHQEVVSTYKM 245
                                       K++   V+ + +    L S    ++V++T + 
Sbjct: 257 ---------------------------WKEVKDAVDTVEE----LQSQKKWEQVLATCQA 285

Query: 246 IEKLQKKLYHPFSVNLMQTREKLIKILMELEDWKEALAYCQLTIPVYQRVYPQFHPLLGL 305
           +    K      ++  ++  +  +   + L  W++AL Y + T+  Y+  YP FHP+  +
Sbjct: 286 LINNHKDRIPDRNIYQLKMLDCAMDACINLSLWEDALLYGRRTLDPYRLYYPGFHPIKAI 345

Query: 306 QYYTCGKLEWFLGDTENAIKSMTEAVEILRITHG 339
           Q    GKL+   G    A++++ EA ++L++THG
Sbjct: 346 QIMKIGKLQQHQGMFREALETLREAFDLLKVTHG 379


>gi|170064095|ref|XP_001867383.1| conserved hypothetical protein [Culex quinquefasciatus]
 gi|167881524|gb|EDS44907.1| conserved hypothetical protein [Culex quinquefasciatus]
          Length = 445

 Score =  132 bits (331), Expect = 4e-28,   Method: Compositional matrix adjust.
 Identities = 106/380 (27%), Positives = 185/380 (48%), Gaps = 33/380 (8%)

Query: 1   MLKLYLRRKLQNDNVIPSTTTDN-YSLVEALVAHMSDI--DEKQLLLYAQIANLVNLILQ 57
           M+   +R+ L+  +V+    TD  Y     L+ H  DI  DEK++  +  +  ++  ++ 
Sbjct: 85  MMSRIIRKLLKGGDVMKGYYTDKCYRRFWDLMPHEEDIKKDEKRMEHFQSLTVVLRSLID 144

Query: 58  WPEISIN-EIAENFSKLACNAHTICNSELRPLGTGLYPVISIINHSCLPNAVLVFEGRLA 116
              +  N E+   F K+  N+  I + E+  +GTG+Y   SI++HSC PNAV +F+G   
Sbjct: 145 EAAMPGNQELLRIFGKMCINSFNILDDEMNSIGTGMYLGASIMDHSCRPNAVAIFDGCNL 204

Query: 117 VVRAVQHVPKGAEV-----LISYIETAGSTMTRQKALKEQYLFTCTCPRCIKLGQFDDIQ 171
            VR ++    GAE+      ISYI+    T  R+  L+++Y F C C RC       D Q
Sbjct: 205 NVRLLEDY-HGAEIDFSKIFISYIDLLNPTDVRRDMLRKRYYFECGCERC------RDEQ 257

Query: 172 ESAILEGYRCKDDGCSGFL-LRDSDDKGFTCQQCGLVRSKEEIKKIASEVNILSKKTLAL 230
           E  ++ G  C +  C   + +    D+   C        +++ ++I++           +
Sbjct: 258 ELKLMNGAACANAQCDEPISMTARQDRCPGCSTAIKQTERDKFREISA---------FTM 308

Query: 231 TSCGNHQEVVSTYKMIEKL----QKKLYHPFSVNLMQTREKLIKILMELEDWKEALAYCQ 286
           T     ++V  TY  I +L    Q+K++HPF+V  ++T +   +  +E+E W++A+ Y  
Sbjct: 309 TQLEQMKDV--TYLDICQLCLSKQEKVFHPFNVWYLKTLDLAFESAIEMEKWEDAIDYGS 366

Query: 287 LTIPVYQRVYPQFHPLLGLQYYTCGKLEWFLGDTENAIKSMTEAVEILRITHGTNSPFMK 346
                ++R    FHPL  L     GK+E +L   + A+ ++ E+ +ILRITHG      K
Sbjct: 367 RLKDGFKRFNGAFHPLYALLLLKLGKIEIYLKHGKEALMNVNESEKILRITHGEEHDLYK 426

Query: 347 ELILKLEEAQAEASYKLSSK 366
           + ++ L   QA   Y+   K
Sbjct: 427 KQLIPL-LCQAATEYEQMDK 445


>gi|310656796|gb|ADP02224.1| zf-MYND domain-containing protein [Triticum aestivum]
          Length = 520

 Score =  132 bits (331), Expect = 4e-28,   Method: Compositional matrix adjust.
 Identities = 56/82 (68%), Positives = 70/82 (85%)

Query: 268 LIKILMELEDWKEALAYCQLTIPVYQRVYPQFHPLLGLQYYTCGKLEWFLGDTENAIKSM 327
           L++I MEL+DW+ AL YC+LTIPVY+RVYP FHP++GLQ+YTCGKLEW L  TE+A+KS+
Sbjct: 367 LLQIYMELQDWQTALMYCRLTIPVYERVYPPFHPMVGLQFYTCGKLEWLLEYTEDALKSL 426

Query: 328 TEAVEILRITHGTNSPFMKELI 349
           T A ++LRITHGT S FMKEL 
Sbjct: 427 TRAADVLRITHGTQSQFMKELF 448



 Score = 64.7 bits (156), Expect = 7e-08,   Method: Compositional matrix adjust.
 Identities = 47/118 (39%), Positives = 64/118 (54%), Gaps = 17/118 (14%)

Query: 1   MLKLYLRRKLQNDN-------VIPSTTTD----NYSLVEALVAHMSDIDEKQLL-LYAQI 48
           M+KL LRRKLQ++        ++PS   D     ++  ++ V     I    +L + A I
Sbjct: 118 MVKLILRRKLQSEKMANLVQLILPSIELDLKEIAHTFSKSFVCTEVGIAYCHVLEILAVI 177

Query: 49  ANLVN--LILQWPEISINEIAENFSKLACNAHTICNSELRPLGTGLYPVISIINHSCL 104
            +L    L L    I+++     F   ACNAHTIC+ ELRPLGTGL+P ISIINH  L
Sbjct: 178 GSLRPNPLGLGSAVIAVHPTQVQF---ACNAHTICDPELRPLGTGLFPAISIINHRIL 232


>gi|334322119|ref|XP_003340188.1| PREDICTED: SET and MYND domain-containing protein 3-like
           [Monodelphis domestica]
          Length = 441

 Score =  130 bits (327), Expect = 1e-27,   Method: Compositional matrix adjust.
 Identities = 84/293 (28%), Positives = 148/293 (50%), Gaps = 39/293 (13%)

Query: 65  EIAENFSKLACNAHTICNSELRPLGTGLYPVISIINHSCLPNAVLVFEGRLAVVRAVQHV 124
           +I E+F+K+ CN   I N E++ +G GLYP +S++NHSC PN V+VFEG    +RAV+ +
Sbjct: 183 DIFESFAKVICNGFAISNGEMQEVGVGLYPSMSLLNHSCDPNCVIVFEGPSLFLRAVRDI 242

Query: 125 PKGAEVLISYIETAGSTMTRQKALKEQYLFTCTCPRCIKLGQFDDIQESAILEGYRCKDD 184
            +G E+ I Y++    +  RQK LKEQY F C CP C                       
Sbjct: 243 QQGEELTICYLDVLMPSAERQKQLKEQYCFDCDCPGCET--------------------- 281

Query: 185 GCSGFLLRDSDDKGFTCQQCGLVRSKEEIKKIASEVNILSKKTLALTSCGNHQEVVSTYK 244
                  R  DD   + ++      ++ +KK+  ++    K    L  C   Q +++ Y 
Sbjct: 282 -------RSKDDDMLSGEEQAWKEVQDSLKKV-EDLRAQEKWEQILAIC---QTLINNYG 330

Query: 245 MIEKLQKKLYHPFSVNLMQTREKLIKILMELEDWKEALAYCQLTIPVYQRVYPQFHPLLG 304
             ++L  +     ++  ++  +  +   + L  W++AL Y   T+  Y+  Y  FHP+ G
Sbjct: 331 --DRLPDR-----NIYQLKMLDCAMDACINLCLWEDALLYGSRTLEPYRLYYSGFHPISG 383

Query: 305 LQYYTCGKLEWFLGDTENAIKSMTEAVEILRITHGTNSPFMKELILKLEEAQA 357
           +Q    GKL+   G    A++++ +A E++++THG +   +++L+L L + +A
Sbjct: 384 VQVMRVGKLQHHQGLYPQALETLKQAFELMKVTHGRDHSLVEDLLLLLGDCEA 436


>gi|395852657|ref|XP_003798851.1| PREDICTED: SET and MYND domain-containing protein 3 [Otolemur
           garnettii]
          Length = 428

 Score =  129 bits (324), Expect = 2e-27,   Method: Compositional matrix adjust.
 Identities = 85/293 (29%), Positives = 146/293 (49%), Gaps = 39/293 (13%)

Query: 65  EIAENFSKLACNAHTICNSELRPLGTGLYPVISIINHSCLPNAVLVFEGRLAVVRAVQHV 124
           ++ E F+K+ CN+ TICN+E++ +G GLYP +S++NHSC PN  +VF G   ++RAV+ +
Sbjct: 170 DVFEAFAKVICNSFTICNAEMQEVGVGLYPSMSLLNHSCDPNCSIVFNGPHLLLRAVRDI 229

Query: 125 PKGAEVLISYIETAGSTMTRQKALKEQYLFTCTCPRCIKLGQFDDIQESAILEGYRCKDD 184
             G E+ I Y++   ++  R+K L++QY F C C RC    Q  D               
Sbjct: 230 EAGEELTICYLDMLMTSEERRKQLRDQYCFECDCLRC----QTQD--------------- 270

Query: 185 GCSGFLLRDSDDKGFTCQQCGLVRSKEEIKKIASEVNILSKKTLALTSCGNHQEVVSTYK 244
                  +D+D          ++   E++ K   EV    KK   L +    ++V++  +
Sbjct: 271 -------KDAD----------MLTGDEQVWK---EVQESLKKIEELKAHWKWEQVLAMCQ 310

Query: 245 MIEKLQKKLYHPFSVNLMQTREKLIKILMELEDWKEALAYCQLTIPVYQRVYPQFHPLLG 304
            I     +     ++  ++  +  +   + L   +EAL Y   T+  Y+  +P  HP+ G
Sbjct: 311 TIISSNSERLPDINIYQLKVLDCAMDACINLGLLEEALFYATRTMEPYRIFFPGSHPVRG 370

Query: 305 LQYYTCGKLEWFLGDTENAIKSMTEAVEILRITHGTNSPFMKELILKLEEAQA 357
           +Q    GKL+   G    A++++  A +I+R+THG      ++LIL LEE  A
Sbjct: 371 VQVMKVGKLQLHQGMFPQAMQNLRLAFDIIRVTHGREHSLTEDLILLLEECDA 423


>gi|74204261|dbj|BAE39889.1| unnamed protein product [Mus musculus]
          Length = 428

 Score =  129 bits (323), Expect = 3e-27,   Method: Compositional matrix adjust.
 Identities = 85/293 (29%), Positives = 146/293 (49%), Gaps = 39/293 (13%)

Query: 65  EIAENFSKLACNAHTICNSELRPLGTGLYPVISIINHSCLPNAVLVFEGRLAVVRAVQHV 124
           ++ E F+K+ CN+ TICN+E++ +G GLYP +S++NHSC PN  +VF G   ++RAV+ +
Sbjct: 170 DLFEAFAKVICNSFTICNAEMQEVGVGLYPSMSLLNHSCDPNCSIVFNGPHLLLRAVREI 229

Query: 125 PKGAEVLISYIETAGSTMTRQKALKEQYLFTCTCPRCIKLGQFDDIQESAILEGYRCKDD 184
             G E+ I Y++   ++  R+K L++QY F C C RC    Q  D               
Sbjct: 230 EAGEELTICYLDMLMTSEERRKQLRDQYCFECDCIRC----QTQD--------------- 270

Query: 185 GCSGFLLRDSDDKGFTCQQCGLVRSKEEIKKIASEVNILSKKTLALTSCGNHQEVVSTYK 244
                  +D+D          ++   E+I K   EV    KK   L +    ++V++  +
Sbjct: 271 -------KDAD----------MLTGDEQIWK---EVQESLKKIEELKAHWKWEQVLALCQ 310

Query: 245 MIEKLQKKLYHPFSVNLMQTREKLIKILMELEDWKEALAYCQLTIPVYQRVYPQFHPLLG 304
            I           ++  ++  +  +   + L   +EAL Y   T+  Y+  +P  HP+ G
Sbjct: 311 AIINSNSNRLPNINIYQLKVLDCAMDACINLGMLEEALFYAMRTMEPYRIFFPGSHPVRG 370

Query: 305 LQYYTCGKLEWFLGDTENAIKSMTEAVEILRITHGTNSPFMKELILKLEEAQA 357
           +Q    GKL+   G    A+K++  A +I+++THG     +++LI+ LEE  A
Sbjct: 371 VQVMKVGKLQLHQGMFPQAMKNLRLAFDIMKVTHGREHSLIEDLIVLLEECDA 423


>gi|66540564|ref|XP_625013.1| PREDICTED: SET and MYND domain-containing protein 3 [Apis
           mellifera]
          Length = 440

 Score =  128 bits (322), Expect = 3e-27,   Method: Compositional matrix adjust.
 Identities = 93/324 (28%), Positives = 164/324 (50%), Gaps = 27/324 (8%)

Query: 24  YSLVEALVAHMSDI--DEKQLLLYAQIANLVNLILQWPEISINEIAE---NFSKLACNAH 78
           Y   + L++H SDI  DEK++  +  +  ++   L   ++SI   AE    + ++  N+ 
Sbjct: 111 YRKFKDLMSHYSDIKKDEKKMEHFVCVCGVLYEFL--GDMSIPNSAELMGIYGRIYINSF 168

Query: 79  TICNSELRPLGTGLYPVISIINHSCLPNAVLVFEGRLAVVRAVQHVP--KGAEVLISYIE 136
            I + ++  +G G+Y   SI++HSC PNAV  FEG   ++R  + +P    +++ ISYI+
Sbjct: 169 NISDLDMNNIGAGIYLGPSILDHSCKPNAVATFEGTTIIIRTTEDLPCLDLSQIRISYID 228

Query: 137 TAGSTMTRQKALKEQYLFTCTCPRCIKLGQFDDIQES-AILEGYRCKDDGCSGFLLRDSD 195
              +T  R++ L+  Y F C C +C         +ES  ++E   C +  C+     D+D
Sbjct: 229 VIKTTKDRREELQSSYYFWCNCKKC---------EESEPMVEAAACPNKFCTYPCSLDAD 279

Query: 196 DKGFTCQQCGLVRSKEEIKKIASEVNILSKKTLALTSCGNHQEVVSTYKMIEKLQKKLYH 255
                C+ C   +  E  K+   E++ L+   L       + +V +   M  K Q+ + H
Sbjct: 280 ----MCENCN-TKFPENFKETFYEISDLTAYHLQNMKNIAYLDVST---MCLKKQEGVLH 331

Query: 256 PFSVNLMQTREKLIKILMELEDWKEALAYCQLTIPVYQRVYPQFHPLLGLQYYTCGKLEW 315
           P ++  +QT +      + L+ W+EA +Y +  I  Y   Y +FHP  G+ Y + GKL+ 
Sbjct: 332 PLNIQHVQTLQSAFDSSLTLQHWEEAESYAKRLINGYLTYYGEFHPSTGILYLSIGKLQV 391

Query: 316 FLGDTENAIKSMTEAVEILRITHG 339
           +L   + AI+++ +A  IL ITHG
Sbjct: 392 YLKKLKQAIETLRKASAILTITHG 415


>gi|344278575|ref|XP_003411069.1| PREDICTED: SET and MYND domain-containing protein 3 [Loxodonta
           africana]
          Length = 428

 Score =  127 bits (320), Expect = 7e-27,   Method: Compositional matrix adjust.
 Identities = 90/295 (30%), Positives = 146/295 (49%), Gaps = 43/295 (14%)

Query: 65  EIAENFSKLACNAHTICNSELRPLGTGLYPVISIINHSCLPNAVLVFEGRLAVVRAVQHV 124
           +I E F+K+ CN+ TIC++E++ +G GLYP +S++NHSC PN  +VF G   ++RAV+ +
Sbjct: 170 DIFEAFAKVICNSFTICDAEMQEVGVGLYPSMSLLNHSCDPNCSIVFNGPHLLLRAVRDI 229

Query: 125 PKGAEVLISYIETAGSTMTRQKALKEQYLFTCTCPRCIKLGQFDDIQESAILEGYRCKDD 184
             G E+ I Y++   ++  R+K L++QY F C C                    +RC+  
Sbjct: 230 EVGEELTICYLDMLMTSEERRKQLRDQYCFECDC--------------------FRCR-- 267

Query: 185 GCSGFLLRDSDDKGFTCQQCGLVRSKEEIKKIASEVNILSKKTLALTSCGNHQEVVSTYK 244
                  +D D    T  +      +E +KKI  E+    K    L  C   Q ++S+  
Sbjct: 268 ------TQDKDADMLTGNEQAWKEVQESLKKI-EELKAHRKWEQVLAMC---QTILSSNS 317

Query: 245 MIEKLQKKLYHPFSVNLMQTR--EKLIKILMELEDWKEALAYCQLTIPVYQRVYPQFHPL 302
             E+L          N+ Q +  +  +   + L   +EAL Y   T+  Y+  YP  HP+
Sbjct: 318 --EQLP-------DTNIYQLKVLDCAMDACINLGLLEEALFYGIRTMEPYRIFYPGSHPV 368

Query: 303 LGLQYYTCGKLEWFLGDTENAIKSMTEAVEILRITHGTNSPFMKELILKLEEAQA 357
            G+Q    GKL+   G    A+K++  A +I+R+THG     +++LI  LEE  A
Sbjct: 369 RGVQVMKVGKLQLHQGMFPQAMKNLRLAFDIMRVTHGREHSLIEDLIRLLEECDA 423


>gi|225715142|gb|ACO13417.1| SET and MYND domain-containing protein 3 [Esox lucius]
          Length = 429

 Score =  127 bits (320), Expect = 7e-27,   Method: Compositional matrix adjust.
 Identities = 96/357 (26%), Positives = 162/357 (45%), Gaps = 48/357 (13%)

Query: 15  VIPSTTTDNYSLVEALVAHMSDIDEKQLLLYAQIANLVNLILQWPEISINEIAENF---- 70
           + PS  T      E   +H+ D+ E++    +Q+++++ L L+  +  I +   +F    
Sbjct: 112 LCPSQMTPQLYSFEEHESHLCDMGEEKREGLSQLSSMLQLYLKQEQPDIIQKVPSFDPIS 171

Query: 71  --SKLACNAHTICNSELRPLGTGLYPVISIINHSCLPNAVLVFEGRLAVVRAVQHVPKGA 128
             +K+ CN  TI ++EL+ +G GLYP +S++NH C P+ V++F+G+   +RA++ +    
Sbjct: 172 LLAKVTCNCFTISDAELQEIGVGLYPSMSLLNHDCRPSCVMLFQGKTLQLRAIRDIQPTE 231

Query: 129 EVLISYIETAGSTMTRQKALKEQYLFTCTCPRCIKLGQFDDIQESAILEGYRCKDDGCSG 188
           EV ISYI     T  RQ  L EQY F+C C  C    + D +                  
Sbjct: 232 EVTISYIGVLLPTRERQTQLMEQYHFSCQCGLC-STAELDPLMF---------------- 274

Query: 189 FLLRDSDDKGFTCQQCGLVRSKEEIKKIASEVNILSKKTLALTSCGNHQEVVS-TYKMIE 247
                          CG+  +   +K+    + IL          GN +E++     ++ 
Sbjct: 275 ---------------CGVKEAWTPMKEAIPRLEILQTN-------GNWEELLQECSSLLA 312

Query: 248 KLQKKLYHPFSVNLMQTR--EKLIKILMELEDWKEALAYCQLTIPVYQRVYPQFHPLLGL 305
            +   +      N+ + R  +      + L  W+ ALAY   T+  Y++ YP  HP   +
Sbjct: 313 PVGGAVPAVPDSNVYRLRVTDLAFDACINLARWETALAYGLKTLGPYRQYYPDPHPAHAI 372

Query: 306 QYYTCGKLEWFLGDTENAIKSMTEAVEILRITHGTNSPFMKELILKLEEAQAEASYK 362
           Q     KL+ FL   E+A  ++  A +I++ITHG       +LI KLEE  AE   K
Sbjct: 373 QLMRVAKLQHFLVHLEDAQHTLRLAYDIMKITHGNEHSLTSDLIRKLEECLAEMDCK 429


>gi|432089341|gb|ELK23292.1| N-lysine methyltransferase SMYD2 [Myotis davidii]
          Length = 391

 Score =  127 bits (320), Expect = 7e-27,   Method: Compositional matrix adjust.
 Identities = 95/372 (25%), Positives = 178/372 (47%), Gaps = 27/372 (7%)

Query: 7   RRKLQNDNVIPSTTTDNYSLVEALVAHMSDID-EKQLLLYAQIANLVNLILQWPEISINE 65
           R +++  +    T ++    V+   +H+  +D EK+ L+ + I+ L +   +  E    E
Sbjct: 20  RSRMRKKSHPERTPSEKLLTVKEFESHLDKLDNEKKDLIQSDISALHHFYSKHLEFPDQE 79

Query: 66  -IAENFSKLACNAHTICNSELRPLGTGLYPVISIINHSCLPNAVLVFEGRLAVVRAVQHV 124
            +   F+++ CN  TI + EL  LG+ ++P ++++NHSC PN ++ ++G LA VRAVQ +
Sbjct: 80  SLVALFAQVNCNGFTIEDEELSHLGSAIFPDVALMNHSCCPNVIVTYKGTLAEVRAVQEI 139

Query: 125 PKGAEVLISYIETAGSTMTRQKALKEQYLFTCTCPRC---------IKLGQFDDIQES-A 174
             G EV  SYI+    T  R   L++ Y FTC C  C         +K+ + +D  ++ A
Sbjct: 140 GPGDEVFTSYIDLLYPTEDRNDRLRDSYFFTCECQECTTKDKDKAKVKIRKLNDPPKAEA 199

Query: 175 ILEGYRCKDDGCSGFLLRDSDDKGFTCQQCGL--VRSKEEIKKIASEVNILSKKT----- 227
           I +  R   +    F  R +    +    CG    R   E+ +  + V++L  KT     
Sbjct: 200 IRDMVRYARNVIEEF--RRAKHYKYILDSCGRSPARGAGELGQELNVVDVLLSKTTENGS 257

Query: 228 ---LALTSCGNHQEVVSTYKMIEKLQKKLYHPF---SVNLMQTREKLIKILMELEDWKEA 281
              L L     H       ++ E  Q+K+   F   +V ++    + + + + ++DW+ A
Sbjct: 258 LKVLILGHYFGHTPPSELLEICELSQEKMSSVFEDSNVYMLHMMYQAMGVCLYIQDWEGA 317

Query: 282 LAYCQLTIPVYQRVYPQFHPLLGLQYYTCGKLEWFLGDTENAIKSMTEAVEILRITHGTN 341
           L Y Q  I  Y + YP +   +   +   G+L   L +     K++ +A+ I+ + HG +
Sbjct: 318 LRYGQKIIKPYSKHYPLYSLNVASMWLKLGRLYMGLENRAAGEKALRKAMAIMEVAHGKD 377

Query: 342 SPFMKELILKLE 353
            P++ E+  ++E
Sbjct: 378 HPYISEIKQEIE 389


>gi|340711607|ref|XP_003394366.1| PREDICTED: SET and MYND domain-containing protein 3-like [Bombus
           terrestris]
          Length = 439

 Score =  127 bits (319), Expect = 8e-27,   Method: Compositional matrix adjust.
 Identities = 93/325 (28%), Positives = 160/325 (49%), Gaps = 26/325 (8%)

Query: 23  NYSLVEALVAHMSDI--DEKQLLLYAQIANLVNLILQ-WPEISINEIAENFSKLACNAHT 79
           NY   + L++H S+I  DEK++  +  + N+++  L+  P  S  E+   + ++  N+ +
Sbjct: 110 NYRKFKDLMSHCSEIKKDEKKMEHFVCLCNVLHKFLEDMPIPSTAELLGIYGRITINSFS 169

Query: 80  ICNSELRPLGTGLYPVISIINHSCLPNAVLVFEGRLAVVRAVQHVPKG--AEVLISYIET 137
           I N ++  +G G+Y   SI++HSC PNAV  FEG    V+A++ +P    +++ I YI+ 
Sbjct: 170 IFNLDMN-IGVGIYLGPSILDHSCKPNAVATFEGTTINVKAIEDLPSLDLSQIRIPYIDV 228

Query: 138 AGSTMTRQKALKEQYLFTCTCPRCIKLGQFDDIQESAILEGYRCKDDGCSGFLLRDSDDK 197
             +   R+  L+  Y F C C +C K           + E   C +  C+       D  
Sbjct: 229 IKTAGDRRAELQSSYYFWCDCEKCEK--------PEPMAEAAACPNKLCT----YPCDPN 276

Query: 198 GFTCQQCGLVRSKEEIKKIASEVNILSKKTLALTSCGNHQEV--VSTYKMIEKLQKKLYH 255
              C++C   +  E  K+I  E++  +   L      N + +  +   KM    Q+ + H
Sbjct: 277 ADLCEKCN-TKFPENFKEIFDEISEFTAYHLE-----NMKNIAYLDVSKMCLSRQEGILH 330

Query: 256 PFSVNLMQTREKLIKILMELEDWKEALAYCQLTIPVYQRVYPQFHPLLGLQYYTCGKLEW 315
           P +V  +QT +      + L+ W+EA +Y +  I  Y   Y + HP  G+ Y + GKLE 
Sbjct: 331 PLNVQYVQTLQTAFDSSINLQHWEEAESYAKKLINGYLAYYGEIHPSTGILYLSIGKLEV 390

Query: 316 FLGDTENAIKSMTEAVEILRITHGT 340
           +L   + AIK++ +A  IL ITHG 
Sbjct: 391 YLKKLKQAIKTLRKASLILTITHGA 415


>gi|332024084|gb|EGI64301.1| SET and MYND domain-containing protein 3 [Acromyrmex echinatior]
          Length = 441

 Score =  127 bits (319), Expect = 9e-27,   Method: Compositional matrix adjust.
 Identities = 94/344 (27%), Positives = 166/344 (48%), Gaps = 37/344 (10%)

Query: 20  TTDNYSLVEALVAHMSDI--DEKQLLLYAQIANLVNLILQWPEI-SINEIAENFSKLACN 76
           T  N+   + L++H SDI  D K++  +  +  +++  L    + +I E+   + ++  N
Sbjct: 103 TEKNFRRFKDLMSHYSDIKVDVKRMEHFTMLCGVLSQFLDETFMPNIAELMGIYGRICTN 162

Query: 77  AHTICNSELRPLGTGLYPVISIINHSCLPNAVLVFEGRLAVVRAVQHVPK--------GA 128
           +  I +  +  +G G+Y   S+I+HSC PN ++VFEG   +VR +  +P           
Sbjct: 163 SFNILDINMNTIGVGIYLGASVIDHSCKPNVIVVFEGTTIIVRTLTDLPSLDWSQASIDK 222

Query: 129 EVLISYIETAGSTMTRQKALKEQYLFTCTCPRCIKLGQFDDIQESAILEGYRCKDDGCSG 188
           ++ ISY++   S   R++ L   Y F C C RC K        E  ++E   C +  C  
Sbjct: 223 DIRISYVDLLNSNKDRREELHSSYYFWCDCERCKK--------EEPMVEAAACPNLLCDS 274

Query: 189 FLLRDSDDKGFTCQQCG---LVRSKEEIKKIAS-EVNILSK-KTLALTSCGNHQEVVSTY 243
               ++D+    C++C     V  KE  +++    ++ L K KT+A          +   
Sbjct: 275 PCSIEADE----CEKCNKEISVEFKETFREVVDFTIHHLEKMKTMAY---------LDVS 321

Query: 244 KMIEKLQKKLYHPFSVNLMQTREKLIKILMELEDWKEALAYCQLTIPVYQRVYPQFHPLL 303
           K+  K QK + H F++  ++T E      M L+ W++A  Y +  +P Y   Y + HPL 
Sbjct: 322 KICLKKQKGIMHKFNIQHVRTLEMAHIAAMNLKCWEDAEFYGKELVPGYLLYYGEIHPLT 381

Query: 304 GLQYYTCGKLEWFLGDTENAIKSMTEAVEILRITHGTNSPFMKE 347
           GL Y T GK++  L   + A +++T+A  +L ITHG     ++E
Sbjct: 382 GLLYLTVGKIQLHLEKPKEAFQALTKANTVLTITHGDKHSIVEE 425


>gi|242020146|ref|XP_002430517.1| SET and MYND domain-containing protein, putative [Pediculus humanus
           corporis]
 gi|212515674|gb|EEB17779.1| SET and MYND domain-containing protein, putative [Pediculus humanus
           corporis]
          Length = 378

 Score =  126 bits (317), Expect = 1e-26,   Method: Compositional matrix adjust.
 Identities = 84/285 (29%), Positives = 142/285 (49%), Gaps = 30/285 (10%)

Query: 73  LACNAHTICNSELRPLGTGLYPVISIINHSCLPNAVLVFEGRLAVVRAVQHVP--KGAEV 130
           +  N   I +SE++ +G+GLY   SII+HSC PNAV VF+G    +R ++ +P    ++V
Sbjct: 108 MVINTFNILDSEMQSIGSGLYLASSIIDHSCSPNAVAVFKGTTIFIRTLEDIPIMDWSKV 167

Query: 131 LISYIETAGSTMTRQKALKEQYLFTCTCPRCIKLGQFDDIQESAILEGYRCKDDGCSGFL 190
            ISYIE       RQ+ L   Y F C C +C       D      ++   C+++ C  FL
Sbjct: 168 FISYIELLNLPEIRQQELLSSYYFLCQCSKCT------DSDNLNFMKSIYCQNEKCKNFL 221

Query: 191 LRDSDDKGFTCQQCGLVRSKE---EIKKI----ASEVNILSKKTLALTSCGNHQEVVSTY 243
           + +  +    C +CG   SKE   EIK+I     SE+N +            +   +   
Sbjct: 222 MPNEVE----CHKCGQSISKEDEDEIKEIIQYTESELNKME-----------NMAYLDIC 266

Query: 244 KMIEKLQKKLYHPFSVNLMQTREKLIKILMELEDWKEALAYCQLTIPVYQRVYPQFHPLL 303
           ++  + Q+K   P ++  ++  +  ++  + LE+W+ A+ Y +  +  + + Y ++HPL 
Sbjct: 267 RICLQKQEKYLTPLNIYKVKILDLAMESSIALENWESAVFYGRQLLEPFLKYYGEYHPLR 326

Query: 304 GLQYYTCGKLEWFLGDTENAIKSMTEAVEILRITHGTNSPFMKEL 348
           G+     GK+ + L + + A K   EA  IL +THG N  F +EL
Sbjct: 327 GIFLMKLGKILFLLNNIDEAKKYFKEANLILSVTHGKNHCFSREL 371


>gi|24639370|ref|NP_524768.2| buzidau [Drosophila melanogaster]
 gi|6018870|emb|CAB58065.1| EG:BACR7C10.4 [Drosophila melanogaster]
 gi|22831587|gb|AAF45795.2| buzidau [Drosophila melanogaster]
 gi|223718730|gb|ACN22202.1| MIP05435p [Drosophila melanogaster]
          Length = 468

 Score =  125 bits (314), Expect = 3e-26,   Method: Compositional matrix adjust.
 Identities = 95/369 (25%), Positives = 178/369 (48%), Gaps = 26/369 (7%)

Query: 1   MLKLYLRRKLQNDNVIPSTTTDNYSLVEALVAHMSDIDEKQLLLY---AQIANLVNLILQ 57
           + +L LR +   D +    T         L++H ++I    + L    +  A L +++ +
Sbjct: 115 LCRLILRLEHGGDLIRGYYTEHGSRKFRDLMSHYAEIKNDPMRLEHLDSLHAVLTDMMAE 174

Query: 58  WPEISIN--EIAENFSKLACNAHTICNSELRPLGTGLYPVISIINHSCLPNAVLVFEGRL 115
            P    N  E+   + +L  N   I ++E+  + T +Y  +SI +HSC PNAV  FEG  
Sbjct: 175 SPSTVPNKTELMSIYGRLITNGFNILDAEMNSIATAIYLGVSITDHSCQPNAVATFEGNE 234

Query: 116 AVVRAVQHVP--KGAEVLISYIETAGSTMTRQKALKEQYLFTCTCPRCIKLGQFDDIQES 173
             V A++ +     +++ ISYI+   +   R+  LKE Y F C C +C       D +ES
Sbjct: 235 LHVHAIEDMECLDWSKIFISYIDLLNTPEQRRLDLKEHYYFLCVCSKCT------DAKES 288

Query: 174 AILEGYRCKDDGCSGFLLRDSDDKGFTCQQCGLVRSKEEIKKIASEVNILSKKTLALTSC 233
             +    C +  C   +  D ++    C +C    S + ++   +E   L++  L     
Sbjct: 289 KEMLAALCPNRNCGAGISVDRNN----CPRCDAGISPK-LRNAFNEAMTLTRHNLE---- 339

Query: 234 GNHQEV--VSTYKMIEKLQKKLYHPFSVNLMQTREKLIKILMELEDWKEALAYCQLTIPV 291
            N ++V  +   K+    Q  ++HP +V  ++T +   +  +E+  W +AL Y Q  +P 
Sbjct: 340 -NMKDVAYLDVCKVCLDKQTGVFHPLNVWYVKTLDAAFEAAIEVGKWSDALDYGQRLLPG 398

Query: 292 YQRVYPQFHPLLGLQYYTCGKLEWFLGDTENAIKSMTEAVEILRITHGTNSPFMKE-LIL 350
           +++ +  ++PLLGL +   GK++ + G ++ A+  + EA  IL +THG +   + E L +
Sbjct: 399 FRKYHGPWNPLLGLLHMKLGKIQLYEGHSKEALHHLEEAQRILTVTHGRDHRLLTEQLYM 458

Query: 351 KLEEAQAEA 359
            + +A+ EA
Sbjct: 459 LVLQARQEA 467


>gi|125981921|ref|XP_001354964.1| GA12511 [Drosophila pseudoobscura pseudoobscura]
 gi|54643276|gb|EAL32020.1| GA12511 [Drosophila pseudoobscura pseudoobscura]
          Length = 439

 Score =  125 bits (314), Expect = 3e-26,   Method: Compositional matrix adjust.
 Identities = 98/368 (26%), Positives = 181/368 (49%), Gaps = 22/368 (5%)

Query: 1   MLKLYLRRKLQNDNVIPSTTTDNYSLVEALVAHMSDI--DEKQL-LLYAQIANLVNLILQ 57
           + +L LR +   D +    T         L++H ++I  D K+L  L +  A L +++  
Sbjct: 86  LCRLILRLQHGGDLIRGYYTEHGSRKFRDLMSHYAEIKNDPKRLEHLDSLHAVLTDMMAD 145

Query: 58  WPEISIN--EIAENFSKLACNAHTICNSELRPLGTGLYPVISIINHSCLPNAVLVFEGRL 115
            P    N  E+   + +L  N   + ++E+  + T +Y  +SI +HSC PNAV  FEG  
Sbjct: 146 SPSTVPNKSELMSIYGRLITNGFNVLDAEMNSIATAIYLGVSITDHSCQPNAVATFEGNE 205

Query: 116 AVVRAVQHVP--KGAEVLISYIETAGSTMTRQKALKEQYLFTCTCPRCIKLGQFDDIQES 173
             + A++ +P    +++ ISYI+   +   R++ LK+ Y F C C +C       D +E+
Sbjct: 206 LHIHALEDMPCLDWSKIFISYIDLLNTPEQRRQDLKDHYYFLCVCSKC------RDPKEA 259

Query: 174 AILEGYRCKDDGCSGFLLRDSDDKGFTCQQCGLVRSKEEIKKIASEVNILSKKTLALTSC 233
             +    C +  CS  +  + ++    C++C +  S + ++   +E+  L+K  L     
Sbjct: 260 RQMTAAACPNRKCSASINIEWNN----CKRCSVGISPK-LRNAYNEIMALTKHNLEAMKD 314

Query: 234 GNHQEVVSTYKMIEKLQKKLYHPFSVNLMQTREKLIKILMELEDWKEALAYCQLTIPVYQ 293
             + +V    K+    Q  L HP +V  ++T +   +  +E+  W+EAL Y Q  +P +Q
Sbjct: 315 VAYLDVC---KVCLDKQAGLMHPLNVWHVKTLDAAFEAAIEVGKWEEALDYGQQLLPGFQ 371

Query: 294 RVYPQFHPLLGLQYYTCGKLEWFLGDTENAIKSMTEAVEILRITHGTNSPFMKELILKL- 352
             + Q++PL+GL +   GK++ F    + A   + EA  IL +THG +   + E +  L 
Sbjct: 372 MYHGQWNPLVGLLHMKLGKIQLFERHHKEASHHLEEAQRILSVTHGRDHRLLVEQLYPLI 431

Query: 353 EEAQAEAS 360
            +A+ EA+
Sbjct: 432 FQARQEAN 439


>gi|383865502|ref|XP_003708212.1| PREDICTED: SET and MYND domain-containing protein 3-like [Megachile
           rotundata]
          Length = 438

 Score =  125 bits (314), Expect = 4e-26,   Method: Compositional matrix adjust.
 Identities = 88/353 (24%), Positives = 168/353 (47%), Gaps = 23/353 (6%)

Query: 1   MLKLYLRRKLQNDNVIPSTTTDNYSLVEALVAHMSDI--DEKQLLLYAQIAN-LVNLILQ 57
           M ++ ++     D+ +   T   +   + L++H SDI  DEK++  +  +   L   +  
Sbjct: 88  MARIIIKLSQGGDDEVEYYTKTKFRRFKDLMSHYSDIKKDEKKMEHFMFLCGVLFGFLGD 147

Query: 58  WPEISINEIAENFSKLACNAHTICNSELRPLGTGLYPVISIINHSCLPNAVLVFEGRLAV 117
            P  +  E+   + ++  N+  I + ++  +G G+Y   S+++HSC+PNAV  FEG    
Sbjct: 148 TPMPNSAELMGIYGRICINSFNIFDLDMNSIGVGIYLAPSVVDHSCVPNAVATFEGITLN 207

Query: 118 VRAVQHVPK--GAEVLISYIETAGSTMTRQKALKEQYLFTCTCPRCIKLGQFDDIQESAI 175
           +R ++ +P    +++ ISYI+   +T  R+  L+  Y F C C +C         +E  +
Sbjct: 208 IRTIEDLPSLDWSQIRISYIDVLKTTKERRSELQSSYYFWCNCKKC---------EEPEL 258

Query: 176 L-EGYRCKDDGCSGFLLRDSDDKGFTCQQCGLVRSKEEIKKIASEVNILSKKTLALTSCG 234
           + E   C +  C+     D +    +C +C  ++  E  K+I  EV+  +   L      
Sbjct: 259 MAEAAACSNKNCTNPCSPDME----SCPECN-IKLLENFKEIFDEVSSFTAHHLQNMK-- 311

Query: 235 NHQEVVSTYKMIEKLQKKLYHPFSVNLMQTREKLIKILMELEDWKEALAYCQLTIPVYQR 294
            +   +   KM  K Q+ + H  ++  +QT +      + L+ W+EA  Y +  I  Y  
Sbjct: 312 -NMAYLDVSKMCLKKQEGVLHSLNIQHVQTLQTAFDSSISLQHWEEAEFYAKKLIKGYLV 370

Query: 295 VYPQFHPLLGLQYYTCGKLEWFLGDTENAIKSMTEAVEILRITHGTNSPFMKE 347
            Y + HPL G+     GK++ +L   + A++++ +A  IL ITHG     ++E
Sbjct: 371 YYGEVHPLTGILLLMTGKIQLYLEKPKQALEALRKANSILAITHGEQHTLVRE 423


>gi|21430850|gb|AAM51103.1| SD20045p [Drosophila melanogaster]
          Length = 382

 Score =  125 bits (314), Expect = 4e-26,   Method: Compositional matrix adjust.
 Identities = 95/369 (25%), Positives = 178/369 (48%), Gaps = 26/369 (7%)

Query: 1   MLKLYLRRKLQNDNVIPSTTTDNYSLVEALVAHMSDIDEKQLLLY---AQIANLVNLILQ 57
           + +L LR +   D +    T         L++H ++I    + L    +  A L +++ +
Sbjct: 29  LCRLILRLEHGGDLIRGYYTEHGSRKFRDLMSHYAEIKNDPMRLEHLDSLHAVLTDMMAE 88

Query: 58  WPEISIN--EIAENFSKLACNAHTICNSELRPLGTGLYPVISIINHSCLPNAVLVFEGRL 115
            P    N  E+   + +L  N   I ++E+  + T +Y  +SI +HSC PNAV  FEG  
Sbjct: 89  SPSTVPNKTELMSIYGRLITNGFNILDAEMNSIATAIYLGVSITDHSCQPNAVATFEGNE 148

Query: 116 AVVRAVQHVP--KGAEVLISYIETAGSTMTRQKALKEQYLFTCTCPRCIKLGQFDDIQES 173
             V A++ +     +++ ISYI+   +   R+  LKE Y F C C +C       D +ES
Sbjct: 149 LHVHAIEDMECLDWSKIFISYIDLLNTPEQRRLDLKEHYYFLCVCSKCT------DAKES 202

Query: 174 AILEGYRCKDDGCSGFLLRDSDDKGFTCQQCGLVRSKEEIKKIASEVNILSKKTLALTSC 233
             +    C +  C   +  D ++    C +C    S + ++   +E   L++  L     
Sbjct: 203 KEMLAALCPNRNCGAGISVDRNN----CPRCDAGISPK-LRNAFNEAMTLTRHNLE---- 253

Query: 234 GNHQEV--VSTYKMIEKLQKKLYHPFSVNLMQTREKLIKILMELEDWKEALAYCQLTIPV 291
            N ++V  +   K+    Q  ++HP +V  ++T +   +  +E+  W +AL Y Q  +P 
Sbjct: 254 -NMKDVAYLDVCKVCLDKQTGVFHPLNVWYVKTLDAAFEAAIEVGKWSDALDYGQRLLPG 312

Query: 292 YQRVYPQFHPLLGLQYYTCGKLEWFLGDTENAIKSMTEAVEILRITHGTNSPFMKE-LIL 350
           +++ +  ++PLLGL +   GK++ + G ++ A+  + EA  IL +THG +   + E L +
Sbjct: 313 FRKYHGPWNPLLGLLHMKLGKIQLYEGHSKEALHHLEEAQRILTVTHGRDHRLLTEQLYV 372

Query: 351 KLEEAQAEA 359
            + +A+ EA
Sbjct: 373 LVLQARQEA 381


>gi|224033847|gb|ACN35999.1| unknown [Zea mays]
 gi|413955568|gb|AFW88217.1| hypothetical protein ZEAMMB73_999877 [Zea mays]
          Length = 129

 Score =  124 bits (311), Expect = 7e-26,   Method: Compositional matrix adjust.
 Identities = 56/98 (57%), Positives = 73/98 (74%)

Query: 218 SEVNILSKKTLALTSCGNHQEVVSTYKMIEKLQKKLYHPFSVNLMQTREKLIKILMELED 277
           S++  LS K  +  S GN  E  S YK+I++L++ LYH FS  L+ T E L+KI +EL+D
Sbjct: 3   SKILQLSDKASSFLSSGNKAEAGSIYKIIKQLERNLYHAFSTTLLHTCETLLKIYLELQD 62

Query: 278 WKEALAYCQLTIPVYQRVYPQFHPLLGLQYYTCGKLEW 315
           W  AL YC+LTIPVY+RVYP FHP++GLQ+YTCGKLEW
Sbjct: 63  WWTALTYCRLTIPVYERVYPPFHPMIGLQFYTCGKLEW 100


>gi|431915909|gb|ELK16163.1| SET and MYND domain-containing protein 2, partial [Pteropus alecto]
          Length = 390

 Score =  124 bits (310), Expect = 9e-26,   Method: Compositional matrix adjust.
 Identities = 90/357 (25%), Positives = 163/357 (45%), Gaps = 26/357 (7%)

Query: 2   LKLYLRRKLQNDNVIPSTTTDNYSLVEALVAHMSDID-EKQLLLYAQIANLVNLILQWPE 60
           ++L  R   +  N   +T ++    V+   +H+  +D EK+ L+ + I+ L +   +  E
Sbjct: 53  VRLTARILAKQKNHPETTPSEKLLAVKDFESHLDKLDNEKKDLIQSDISALHHFYSKHLE 112

Query: 61  ISINE-IAENFSKLACNAHTICNSELRPLGTGLYPVISIINHSCLPNAVLVFEGRLAVVR 119
              N+ +   F+++ CN  TI + EL  LG+ ++P ++++NHSC PN ++ ++G LA VR
Sbjct: 113 FPDNKSLVVLFAQVNCNGFTIEDEELSHLGSAIFPDVALMNHSCCPNVIVTYKGTLAEVR 172

Query: 120 AVQHVPKGAEVLISYIETAGSTMTRQKALKEQYLFTCTCPRCIKLGQFDDIQESAILEGY 179
           AVQ +  G EV  SYI+    T  R   L++ Y FTC C  C                  
Sbjct: 173 AVQEINPGEEVFTSYIDLLYPTEDRNDRLRDSYFFTCECQECTT---------------- 216

Query: 180 RCKDDGCSGFLLRDSDDKGFTCQQCGLVRSKEEIKKIASEVNILSKKTLALTSCGNHQEV 239
             K    +   +R   D       C +VR     + +  E          L  C      
Sbjct: 217 --KHKDKAKVKIRKLSDPPKAEAICDMVRY---ARNVIEEFRRAKHYKYILYRCPGVAPP 271

Query: 240 VSTYKMIEKLQKKLYHPF---SVNLMQTREKLIKILMELEDWKEALAYCQLTIPVYQRVY 296
               ++ E  Q+K+   F   +V ++    + + + + ++DW+ AL Y Q  I  Y + Y
Sbjct: 272 SELLEICELSQEKMSSVFEDSNVYMLHMMYQAMGVCLYMQDWEGALRYGQKIIKPYSKHY 331

Query: 297 PQFHPLLGLQYYTCGKLEWFLGDTENAIKSMTEAVEILRITHGTNSPFMKELILKLE 353
           P +   +   +   G+L   L +     K++ +AVEI+ + HG + P++ E+  ++E
Sbjct: 332 PLYSLNVASMWLKLGRLYMGLENKAAGEKALRKAVEIMEVAHGKDHPYISEIKQEIE 388


>gi|345803330|ref|XP_537223.3| PREDICTED: SET and MYND domain-containing protein 3 [Canis lupus
           familiaris]
          Length = 428

 Score =  124 bits (310), Expect = 9e-26,   Method: Compositional matrix adjust.
 Identities = 87/277 (31%), Positives = 137/277 (49%), Gaps = 43/277 (15%)

Query: 65  EIAENFSKLACNAHTICNSELRPLGTGLYPVISIINHSCLPNAVLVFEGRLAVVRAVQHV 124
           +I E F+K+ CNA TICN+E++ +G GLYP +S++NHSC PN  +VF G   ++RAV+ +
Sbjct: 170 DIFEAFAKVICNAFTICNAEMQEVGVGLYPSMSLLNHSCDPNCSIVFNGPHLLLRAVRDI 229

Query: 125 PKGAEVLISYIETAGSTMTRQKALKEQYLFTCTCPRCIKLGQFDDIQESAILEGYRCKDD 184
             G E+ I Y++   ++  R+K LK+QY F C C RC                       
Sbjct: 230 EAGEELTICYLDMLMTSEERRKQLKDQYCFECDCVRC----------------------- 266

Query: 185 GCSGFLLRDSDDKGFTCQQCGLVRSKEEIKKIASEVNILSKKTLALTSCGNHQEVVSTYK 244
                  +D D    T  +    + +E +KKI  E+    K    L  C   Q ++S+  
Sbjct: 267 -----ETQDKDADMLTGDEQVWRQVQESLKKI-EELKAHWKWEQVLAMC---QTIISSN- 316

Query: 245 MIEKLQKKLYHPFSVNLMQTR--EKLIKILMELEDWKEALAYCQLTIPVYQRVYPQFHPL 302
             E+L         +N+ Q R  +  +   + L   +EAL Y   T+  Y+  +P  HP+
Sbjct: 317 -AERLP-------DINVYQLRVLDCAMDACINLGLLEEALFYGIRTMEPYRIFFPGSHPV 368

Query: 303 LGLQYYTCGKLEWFLGDTENAIKSMTEAVEILRITHG 339
            G+Q    GKL+   G    A+K++  A +I+R+THG
Sbjct: 369 RGVQVMKVGKLQLHQGMFPQAMKNLRLAFDIMRVTHG 405


>gi|91076642|ref|XP_970424.1| PREDICTED: similar to Buzidau CG13761-PB [Tribolium castaneum]
 gi|270002373|gb|EEZ98820.1| hypothetical protein TcasGA2_TC004426 [Tribolium castaneum]
          Length = 442

 Score =  123 bits (309), Expect = 1e-25,   Method: Compositional matrix adjust.
 Identities = 79/301 (26%), Positives = 151/301 (50%), Gaps = 24/301 (7%)

Query: 65  EIAENFSKLACNAHTICNSELRPLGTGLYPVISIINHSCLPNAVLVFEGRLAVVRAVQHV 124
           E+   + ++  N+  I + EL+ +GTG+Y   S+I+HSC PNAV +F+G +  +RA+Q  
Sbjct: 154 ELMGMYGRMCINSFNIIDQELQCIGTGMYLGASVIDHSCSPNAVAIFDGPILSIRALQTF 213

Query: 125 P--KGAEVLISYIETAGSTMTRQKALKEQYLFTCTCPRCIKLGQFDDIQESAILEGYRCK 182
                +++ ISYI+   +T  RQ  L+  Y F C CP+C++        E   +    C 
Sbjct: 214 QYLDWSQIKISYIDILNTTKDRQSELEAAYYFLCKCPKCLE-------PEPPEINAAACP 266

Query: 183 DDGCSGFLLRDSDDKGFTCQQCGLVRSKEEIKKIA-----SEVNILSKKTLALTSCGNHQ 237
           ++ C   +  +    G  C +C  V S+  +K+       +++++ + K LA        
Sbjct: 267 NEKCDNHIDTEIITPGDKCAKCDTVVSETFLKRFKEVIEFTDLHLQNMKQLA-------- 318

Query: 238 EVVSTYKMIEKLQKKLYHPFSVNLMQTREKLIKILMELEDWKEALAYCQLTIPVYQRVYP 297
                 ++  K Q+ + H F++  ++T +   +  ++ + W  A  +    +  + + Y 
Sbjct: 319 -YFDVCEICLKKQEGVLHKFNIKHVKTLDLAFQSSIDFQKWDFARKFALELVDAFYKYYG 377

Query: 298 QFHPLLGLQYYTCGKLEWFLGDTENAIKSMTEAVEILRITHGTNSPFMK-ELILKLEEAQ 356
             HP+ GL +   GK+  F  + + A++ +T+A +IL+I HG  S   K EL+  L++A+
Sbjct: 378 HVHPITGLLHLKLGKILLFEENDQLALEHLTKAYQILKIIHGVGSHLFKDELVPLLQQAR 437

Query: 357 A 357
           A
Sbjct: 438 A 438


>gi|126307073|ref|XP_001374886.1| PREDICTED: n-lysine methyltransferase SMYD2 [Monodelphis domestica]
          Length = 386

 Score =  123 bits (308), Expect = 2e-25,   Method: Compositional matrix adjust.
 Identities = 86/339 (25%), Positives = 161/339 (47%), Gaps = 32/339 (9%)

Query: 19  TTTDNYSLVEALVAHMSDID-EKQLLLYAQIANLVNLILQWPEISIN-EIAENFSKLACN 76
           T+++    V+   +H+  +D EK+ L+ + IA L +   +  E   N  +   F+++ CN
Sbjct: 76  TSSEKLLAVKEFESHLDKLDNEKRELIQSDIAALHHFYSKHLEYPDNCSLVTLFAQVNCN 135

Query: 77  AHTICNSELRPLGTGLYPVISIINHSCLPNAVLVFEGRLAVVRAVQHVPKGAEVLISYIE 136
             TI + EL  LG+ ++P ++++NHSC PN ++ ++G LA VRAVQ +  G EV  SYI+
Sbjct: 136 GFTIEDEELSHLGSAIFPDVALMNHSCCPNVIVTYKGTLAEVRAVQEINPGDEVFTSYID 195

Query: 137 TAGSTMTRQKALKEQYLFTCTCPRC-IKLGQFDDIQESAILEGYRCKDDGCSGFLLRDSD 195
               T  R   LK+ Y FTC C  C  K      ++   + E  + +D       +RD  
Sbjct: 196 LLYPTEDRNDRLKDSYFFTCECRECTTKAKDKAKVEIRKLSEPPKAED-------IRD-- 246

Query: 196 DKGFTCQQCGLVRSKEEIKKIASEVNILSKKTLALTSCGNHQEVVSTYKMIEKLQKKLYH 255
                     +V+    + +           +  L  C   QE +S+          L+ 
Sbjct: 247 ----------MVKYARNVIEEFRRAKHYKSPSELLEICELSQEKMSS----------LFE 286

Query: 256 PFSVNLMQTREKLIKILMELEDWKEALAYCQLTIPVYQRVYPQFHPLLGLQYYTCGKLEW 315
             +V ++    + + + + ++DW+ AL Y Q  I  Y + YP +   +   +   G+L  
Sbjct: 287 DSNVYMLHMMYQAMGVCLYMQDWEGALRYGQKIIKPYSKHYPLYSLNVASMWLKLGRLYM 346

Query: 316 FLGDTENAIKSMTEAVEILRITHGTNSPFMKELILKLEE 354
            L    + +K++ +A+ I+ + HG + P++ E+  ++E+
Sbjct: 347 GLEHKASGVKALKKAIVIMEVAHGKDHPYISEIKKEIED 385


>gi|355746250|gb|EHH50875.1| hypothetical protein EGM_01766, partial [Macaca fascicularis]
          Length = 352

 Score =  122 bits (306), Expect = 3e-25,   Method: Compositional matrix adjust.
 Identities = 84/277 (30%), Positives = 136/277 (49%), Gaps = 43/277 (15%)

Query: 65  EIAENFSKLACNAHTICNSELRPLGTGLYPVISIINHSCLPNAVLVFEGRLAVVRAVQHV 124
           ++ E F+K+ CN+ TICN+E++ +G GLYP IS++NHSC PN  +VF G   ++RAV+ +
Sbjct: 94  DLFEAFAKVICNSFTICNAEMQEVGVGLYPSISLLNHSCDPNCSIVFNGPHLLLRAVRDI 153

Query: 125 PKGAEVLISYIETAGSTMTRQKALKEQYLFTCTCPRCIKLGQFDDIQESAILEGYRCKDD 184
             G E+ I Y++   ++  R+K L++QY F C C RC                       
Sbjct: 154 EVGEELTICYLDMLMTSEERRKQLRDQYCFECDCFRC----------------------- 190

Query: 185 GCSGFLLRDSDDKGFTCQQCGLVRSKEEIKKIASEVNILSKKTLALTSCGNHQEVVSTYK 244
                  +D D    T  +      +E +KKI  E+    K    L  C   Q ++S+  
Sbjct: 191 -----QTQDKDADMLTGDEQVWKEVQESLKKI-EELKAHWKWEQVLAMC---QAIISSNS 241

Query: 245 MIEKLQKKLYHPFSVNLMQTR--EKLIKILMELEDWKEALAYCQLTIPVYQRVYPQFHPL 302
             E+L         +N+ Q +  +  +   + L   +EAL Y   T+  Y+  +P  HP+
Sbjct: 242 --ERLP-------DINIYQLKVLDCAMDACINLGLLEEALFYATRTMEPYRIFFPGSHPV 292

Query: 303 LGLQYYTCGKLEWFLGDTENAIKSMTEAVEILRITHG 339
            G+Q    GKL+   G    A+K++  A +I+R+THG
Sbjct: 293 RGVQVMKVGKLQLHQGMFPQAMKNLRLAFDIMRVTHG 329


>gi|195347982|ref|XP_002040530.1| GM18871 [Drosophila sechellia]
 gi|194121958|gb|EDW44001.1| GM18871 [Drosophila sechellia]
          Length = 466

 Score =  122 bits (305), Expect = 3e-25,   Method: Compositional matrix adjust.
 Identities = 96/369 (26%), Positives = 176/369 (47%), Gaps = 26/369 (7%)

Query: 1   MLKLYLRRKLQNDNVIPSTTTDNYSLVEALVAHMSDIDEKQLLLY---AQIANLVNLILQ 57
           + +L LR +   D +    T         L++H ++I    + L    +  A L +++ +
Sbjct: 113 LCRLILRLEHGGDLIRGYYTEHGSRKFRDLMSHYAEIKNDPMRLEHLDSLHAVLSDMMAE 172

Query: 58  WPEISIN--EIAENFSKLACNAHTICNSELRPLGTGLYPVISIINHSCLPNAVLVFEGRL 115
            P    N  E+   + +L  N   I ++E+  + T +Y  +SI +HSC PNAV  FEG  
Sbjct: 173 SPSTVPNKTELMSIYGRLITNGFNILDAEMNSIATAIYLGVSITDHSCQPNAVATFEGNE 232

Query: 116 AVVRAVQHVP--KGAEVLISYIETAGSTMTRQKALKEQYLFTCTCPRCIKLGQFDDIQES 173
             V A++ +     +++ ISYI+   +   R+  LKE Y F C C +C       D +ES
Sbjct: 233 LHVHAIEDMECLDWSKIFISYIDLLNTPEQRRLDLKEHYYFLCVCSKCT------DAKES 286

Query: 174 AILEGYRCKDDGCSGFLLRDSDDKGFTCQQCGLVRSKEEIKKIASEVNILSKKTLALTSC 233
             +    C +  C   +  D  +    C +C    S + ++   +E   L++  L     
Sbjct: 287 KEMLAALCPNRNCGVGISVDRTN----CPRCDAGISPK-LRNAFNEAMTLTRHNLE---- 337

Query: 234 GNHQEV--VSTYKMIEKLQKKLYHPFSVNLMQTREKLIKILMELEDWKEALAYCQLTIPV 291
            N ++V  +   K+    Q  ++HP +V  ++T +   +  +E+  W +AL Y Q  +P 
Sbjct: 338 -NMKDVAYLDVCKVCLDKQTGVFHPLNVWYVKTLDAAFEAAIEVGKWSDALDYGQRLLPG 396

Query: 292 YQRVYPQFHPLLGLQYYTCGKLEWFLGDTENAIKSMTEAVEILRITHGTNSPFMKE-LIL 350
           +++ +  ++PLLGL +   GK++ + G  + A+  + EA  IL +THG +   + E L L
Sbjct: 397 FRKYHGPWNPLLGLLHMKLGKIQLYEGHAKEALHHLEEAQRILTVTHGRDHRLLTEQLYL 456

Query: 351 KLEEAQAEA 359
            + +A+ EA
Sbjct: 457 LVLQARQEA 465


>gi|260796111|ref|XP_002593048.1| hypothetical protein BRAFLDRAFT_74375 [Branchiostoma floridae]
 gi|229278272|gb|EEN49059.1| hypothetical protein BRAFLDRAFT_74375 [Branchiostoma floridae]
          Length = 390

 Score =  122 bits (305), Expect = 4e-25,   Method: Compositional matrix adjust.
 Identities = 81/278 (29%), Positives = 126/278 (45%), Gaps = 32/278 (11%)

Query: 70  FSKLACNAHTICNSELRPLGTGLYPVISIINHSCLPNAVLVFEGRLAVVRAVQHVPKGAE 129
           + K  CN   I N +LR +G GLYP  ++INHSC  N V  F G    +RA+  +  G E
Sbjct: 106 YGKTTCNCFAIHNLDLREIGVGLYPQAAMINHSCKSNCVSTFRGPTLQIRALVDIQPGEE 165

Query: 130 VLISYIETAGSTMTRQKALKEQYLFTCTCPRCIKLGQFDDIQESAILEGYRCKDDGCSGF 189
           V  SY E    T  R+  L+ +Y F C CP C+      D    AI++  +C    C G 
Sbjct: 166 VCYSYTEKGNVTHERRDELR-KYFFECQCPHCL------DTDRDAIMKSVKCP--SCQGQ 216

Query: 190 LLRDSDDKGFTCQQCGLVRSKEEIKK-----IASEVNILSKKTLALTSCGNHQEVVSTYK 244
           +   S D+   C  CG      E  +     I  E ++L ++                  
Sbjct: 217 VKPTSSDRYEKCSSCGFTDFTTEFYEDLEIYIHVEFDLLFRENC---------------- 260

Query: 245 MIEKLQKKLYHPFSVNLMQTREKLIKILMELEDWKEALAYCQLTIPVYQRVYPQFHPLLG 304
           ++E    K+ HP ++++++         ++LE+W +A+ Y +     +    P   P  G
Sbjct: 261 LVE--LDKILHPDNIHVVRILVGAFAASVKLEEWTKAIDYGKRLDRAFGLYLPPNEPDTG 318

Query: 305 LQYYTCGKLEWFLGDTENAIKSMTEAVEILRITHGTNS 342
           L YY  GK  + L D ENA+ S+ +A  +L I +G +S
Sbjct: 319 LLYYKMGKAYYHLDDIENAVTSLRKAKTLLSIAYGRDS 356


>gi|349604328|gb|AEP99912.1| SET and MYND domain-containing protein 3-like protein, partial
           [Equus caballus]
          Length = 358

 Score =  122 bits (305), Expect = 4e-25,   Method: Compositional matrix adjust.
 Identities = 84/277 (30%), Positives = 136/277 (49%), Gaps = 43/277 (15%)

Query: 65  EIAENFSKLACNAHTICNSELRPLGTGLYPVISIINHSCLPNAVLVFEGRLAVVRAVQHV 124
           ++ E F+K+ CN+ TICN+E++ +G GLYP +S++NHSC PN  +VF G   ++RAV+ +
Sbjct: 100 DVFEAFAKVICNSFTICNAEMQEVGVGLYPSMSLLNHSCEPNCSIVFNGPHLLLRAVRDI 159

Query: 125 PKGAEVLISYIETAGSTMTRQKALKEQYLFTCTCPRCIKLGQFDDIQESAILEGYRCKDD 184
             G E+ I Y++   ++  R+K L++QY F C C RC                       
Sbjct: 160 EAGEELTICYLDMLMTSEERRKQLRDQYCFECDCFRC----------------------- 196

Query: 185 GCSGFLLRDSDDKGFTCQQCGLVRSKEEIKKIASEVNILSKKTLALTSCGNHQEVVSTYK 244
                  +D D    T  +      +E +KKI  E+    K    L  C   Q ++S+  
Sbjct: 197 -----QTQDKDADMLTGDEHVWKEVQESLKKI-EELKAHWKWEQVLAMC---QTIISSNS 247

Query: 245 MIEKLQKKLYHPFSVNLMQTR--EKLIKILMELEDWKEALAYCQLTIPVYQRVYPQFHPL 302
             E+L         +N+ Q +  +  +   + L   +EAL Y   T+  Y+  YP  HP+
Sbjct: 248 --ERLP-------DINIYQLKVLDCAMDACINLGLLEEALFYGIRTMEPYRIFYPGSHPV 298

Query: 303 LGLQYYTCGKLEWFLGDTENAIKSMTEAVEILRITHG 339
            G+Q    GKL+   G    A+K++  A +I+R+THG
Sbjct: 299 RGVQVMKVGKLQLHQGMFPQAMKNLRLAFDIMRVTHG 335


>gi|48257059|gb|AAH17079.2| SMYD3 protein, partial [Homo sapiens]
          Length = 287

 Score =  122 bits (305), Expect = 4e-25,   Method: Compositional matrix adjust.
 Identities = 84/277 (30%), Positives = 136/277 (49%), Gaps = 43/277 (15%)

Query: 65  EIAENFSKLACNAHTICNSELRPLGTGLYPVISIINHSCLPNAVLVFEGRLAVVRAVQHV 124
           ++ E F+K+ CN+ TICN+E++ +G GLYP IS++NHSC PN  +VF G   ++RAV+ +
Sbjct: 29  DLFEAFAKVICNSFTICNAEMQEVGVGLYPSISLLNHSCDPNCSIVFNGPHLLLRAVRDI 88

Query: 125 PKGAEVLISYIETAGSTMTRQKALKEQYLFTCTCPRCIKLGQFDDIQESAILEGYRCKDD 184
             G E+ I Y++   ++  R+K L++QY F C C RC                       
Sbjct: 89  EVGEELTICYLDMLMTSEERRKQLRDQYCFECDCFRCQT--------------------- 127

Query: 185 GCSGFLLRDSDDKGFTCQQCGLVRSKEEIKKIASEVNILSKKTLALTSCGNHQEVVSTYK 244
                  +D D    T  +      +E +KKI  E+    K    L  C   Q ++S+  
Sbjct: 128 -------QDKDADMLTGDEQVWKEVQESLKKI-EELKAHWKWEQVLAMC---QAIISSNS 176

Query: 245 MIEKLQKKLYHPFSVNLMQTR--EKLIKILMELEDWKEALAYCQLTIPVYQRVYPQFHPL 302
             E+L         +N+ Q +  +  +   + L   +EAL Y   T+  Y+  +P  HP+
Sbjct: 177 --ERLP-------DINIYQLKVLDCAMDACINLGLLEEALFYGTRTMEPYRIFFPGSHPV 227

Query: 303 LGLQYYTCGKLEWFLGDTENAIKSMTEAVEILRITHG 339
            G+Q    GKL+   G    A+K++  A +I+R+THG
Sbjct: 228 RGVQVMKVGKLQLHQGMFPQAMKNLRLAFDIMRVTHG 264


>gi|119613763|gb|EAW93357.1| SET and MYND domain containing 2, isoform CRA_c [Homo sapiens]
          Length = 391

 Score =  121 bits (304), Expect = 4e-25,   Method: Compositional matrix adjust.
 Identities = 90/350 (25%), Positives = 161/350 (46%), Gaps = 32/350 (9%)

Query: 8   RKLQNDNVIPSTTTDNYSL-VEALVAHMSDID-EKQLLLYAQIANLVNLILQWPEISINE 65
           R L    + P  T     L V+   +H+  +D EK+ L+ + IA L +   +  E   N+
Sbjct: 68  RILAKQKIHPERTPSEKLLAVKEFESHLDKLDNEKKDLIQSDIAALHHFYSKHLEFPDND 127

Query: 66  -IAENFSKLACNAHTICNSELRPLGTGLYPVISIINHSCLPNAVLVFEGRLAVVRAVQHV 124
            +   F+++ CN  TI + EL  LG+ ++P ++++NHSC PN ++ ++G LA VRAVQ +
Sbjct: 128 SLVVLFAQVNCNGFTIEDEELSHLGSAIFPDVALMNHSCCPNVIVTYKGTLAEVRAVQEI 187

Query: 125 PKGAEVLISYIETAGSTMTRQKALKEQYLFTCTCPRCIKLGQFDDIQESAILEGYRCKDD 184
             G EV  SYI+    T  R   L++ Y FTC C  C                    KD 
Sbjct: 188 KPGEEVFTSYIDLLYPTEDRNDRLRDSYFFTCECQECTT------------------KDK 229

Query: 185 GCSGFLLRDSDDKGFTCQQCGLVR-SKEEIKKIASEVNILSKKTLALTSCGNHQEVVSTY 243
             +   +R   D         +VR ++  I++     +  S  +  L  C   QE +S+ 
Sbjct: 230 DKAKVEIRKLSDPPKAEAIRDMVRYARNVIEEFRRAKHYKSPPSELLEICELSQEKMSS- 288

Query: 244 KMIEKLQKKLYHPFSVNLMQTREKLIKILMELEDWKEALAYCQLTIPVYQRVYPQFHPLL 303
                    ++   +V ++    + + + + ++DW+ AL Y Q  I  Y + YP +   +
Sbjct: 289 ---------VFEDSNVYMLHMMYQAMGVCLYMQDWEGALQYGQKIIKPYSKHYPLYSLNV 339

Query: 304 GLQYYTCGKLEWFLGDTENAIKSMTEAVEILRITHGTNSPFMKELILKLE 353
              +   G+L   L       K++ +A+ I+ + HG + P++ E+  ++E
Sbjct: 340 ASMWLKLGRLYMGLEHKAAGEKALKKAIAIMEVAHGKDHPYISEIKQEIE 389


>gi|281342967|gb|EFB18551.1| hypothetical protein PANDA_009836 [Ailuropoda melanoleuca]
          Length = 386

 Score =  121 bits (304), Expect = 5e-25,   Method: Compositional matrix adjust.
 Identities = 93/357 (26%), Positives = 163/357 (45%), Gaps = 41/357 (11%)

Query: 8   RKLQNDNVIPSTTTDNYSL-VEALVAHMSDID-EKQLLLYAQIANLVNLI---LQWPEIS 62
           R L    + P  T     L V+   +H+  +D EK+ L+ + IA L +     L++P+  
Sbjct: 58  RILAKQKIHPERTPSEKLLAVKEFESHLDKLDNEKKDLIQSDIAALHHFYSKHLEFPDS- 116

Query: 63  INEIAENFSKLACNAHTICNSELRPLGTGLYPVISIINHSCLPNAVLVFEGRLAVVRAVQ 122
            + +   F+++ CN  TI + EL  LG+ ++P ++++NHSC PN ++ ++G LA VRAVQ
Sbjct: 117 -DSLVVLFAQVNCNGFTIEDEELSHLGSAIFPDVALMNHSCCPNVIVTYKGTLAEVRAVQ 175

Query: 123 HVPKGAEVLISYIETAGSTMTRQKALKEQYLFTCTCPRCIKLGQFDDIQESAILEGYRCK 182
            +  G EV  SYI+    T  R   L++ Y FTC C  C                    K
Sbjct: 176 EISPGEEVFTSYIDLLYPTEDRNDRLRDSYFFTCQCQECT------------------TK 217

Query: 183 DDGCSGFLLRDSDDKGFTCQQCGLVRSK----EEIKKIASEVNIL--SKKTLALTSCGNH 236
           D   +   +R   D         +VR      EE ++      IL  S  +  L  C   
Sbjct: 218 DKDKAKVEIRKLSDPPKAETIRDMVRYARNVIEEFRRAKHYKYILYNSPPSELLEICELS 277

Query: 237 QEVVSTYKMIEKLQKKLYHPFSVNLMQTREKLIKILMELEDWKEALAYCQLTIPVYQRVY 296
           QE + +          ++   +V ++    + + + + ++DW+ AL Y Q  I  Y + Y
Sbjct: 278 QEKMGS----------VFEDSNVYMLHMMYQAMGVCLYMQDWEGALRYGQKIIKPYSKHY 327

Query: 297 PQFHPLLGLQYYTCGKLEWFLGDTENAIKSMTEAVEILRITHGTNSPFMKELILKLE 353
           P +   +   +   G+L   L D     K++ +A+ I+ + HG + P++ E+  ++E
Sbjct: 328 PLYSLNVASMWLKLGRLYMGLEDKAAGEKALKKAIAIMEVAHGKDHPYISEIKQEIE 384


>gi|195477573|ref|XP_002100247.1| GE16937 [Drosophila yakuba]
 gi|194187771|gb|EDX01355.1| GE16937 [Drosophila yakuba]
          Length = 469

 Score =  121 bits (304), Expect = 5e-25,   Method: Compositional matrix adjust.
 Identities = 94/371 (25%), Positives = 178/371 (47%), Gaps = 26/371 (7%)

Query: 1   MLKLYLRRKLQNDNVIPSTTTDNYSLVEALVAHMSDIDEKQLLLY---AQIANLVNLILQ 57
           + +L LR +   D +    T         L++H ++I    + L    +  A L +++  
Sbjct: 115 LCRLILRLEHGGDLIRGYYTEHGSRKFRDLMSHYAEIKNDPMRLEHLDSLHAVLTDMMAD 174

Query: 58  WPEISIN--EIAENFSKLACNAHTICNSELRPLGTGLYPVISIINHSCLPNAVLVFEGRL 115
            P    N  E+   + +L  N   I ++E+  + T +Y  +SI +HSC PNAV  FEG  
Sbjct: 175 SPSTVPNKTELMSIYGRLITNGFNILDAEMNSIATAIYLGVSITDHSCQPNAVATFEGNE 234

Query: 116 AVVRAVQHVP--KGAEVLISYIETAGSTMTRQKALKEQYLFTCTCPRCIKLGQFDDIQES 173
             V A++ +     +++ ISYI+   +   R+  LKE Y F C C +C       D +ES
Sbjct: 235 LHVHAIEDMECLDWSKIFISYIDLLNTPEQRRLDLKEHYYFLCVCSKCT------DAKES 288

Query: 174 AILEGYRCKDDGCSGFLLRDSDDKGFTCQQCGLVRSKEEIKKIASEVNILSKKTLALTSC 233
             +    C +  C   +  D  +    C +C    S + ++   ++   L++  L     
Sbjct: 289 KEMLSALCPNRNCGVGITVDRTN----CPRCDAGISPK-LRNAFNDAMTLTRHNLE---- 339

Query: 234 GNHQEV--VSTYKMIEKLQKKLYHPFSVNLMQTREKLIKILMELEDWKEALAYCQLTIPV 291
            N ++V  +   K+    Q  ++HP +V  ++T +   +  +E+  W +AL Y Q  +P 
Sbjct: 340 -NMKDVAYLDVCKVCLDKQTGVFHPLNVWYVKTLDAAFEAAIEVGKWTDALDYGQRLLPG 398

Query: 292 YQRVYPQFHPLLGLQYYTCGKLEWFLGDTENAIKSMTEAVEILRITHGTNSPFMKE-LIL 350
           +++ +  ++PLLGL +   GK++ + G ++ A+  + EA  IL +THG +   + E L +
Sbjct: 399 FRKYHGPWNPLLGLLHMKLGKIQLYEGRSKEALHHLEEAQRILTVTHGRDHRLLTEQLYV 458

Query: 351 KLEEAQAEASY 361
            + +A+ EA++
Sbjct: 459 LILQARQEAAH 469


>gi|301771175|ref|XP_002921007.1| PREDICTED: SET and MYND domain-containing protein 2-like
           [Ailuropoda melanoleuca]
          Length = 385

 Score =  121 bits (304), Expect = 5e-25,   Method: Compositional matrix adjust.
 Identities = 90/351 (25%), Positives = 162/351 (46%), Gaps = 35/351 (9%)

Query: 8   RKLQNDNVIPSTTTDNYSL-VEALVAHMSDID-EKQLLLYAQIANLVNLI---LQWPEIS 62
           R L    + P  T     L V+   +H+  +D EK+ L+ + IA L +     L++P+  
Sbjct: 63  RILAKQKIHPERTPSEKLLAVKEFESHLDKLDNEKKDLIQSDIAALHHFYSKHLEFPDS- 121

Query: 63  INEIAENFSKLACNAHTICNSELRPLGTGLYPVISIINHSCLPNAVLVFEGRLAVVRAVQ 122
            + +   F+++ CN  TI + EL  LG+ ++P ++++NHSC PN ++ ++G LA VRAVQ
Sbjct: 122 -DSLVVLFAQVNCNGFTIEDEELSHLGSAIFPDVALMNHSCCPNVIVTYKGTLAEVRAVQ 180

Query: 123 HVPKGAEVLISYIETAGSTMTRQKALKEQYLFTCTCPRCIKLGQFDDIQESAILEGYRCK 182
            +  G EV  SYI+    T  R   L++ Y FTC C  C         ++ A +E  R  
Sbjct: 181 EISPGEEVFTSYIDLLYPTEDRNDRLRDSYFFTCQCQECTTKD-----KDKAKVE-IRKL 234

Query: 183 DDGCSGFLLRDSDDKGFTCQQCGLVRSKEEIKKIASEVNILSKKTLALTSCGNHQEVVST 242
            D      +RD            +VR    + +           +  L  C   QE + +
Sbjct: 235 SDPPKAETIRD------------MVRYARNVIEEFRRAKHYKSPSELLEICELSQEKMGS 282

Query: 243 YKMIEKLQKKLYHPFSVNLMQTREKLIKILMELEDWKEALAYCQLTIPVYQRVYPQFHPL 302
                     ++   +V ++    + + + + ++DW+ AL Y Q  I  Y + YP +   
Sbjct: 283 ----------VFEDSNVYMLHMMYQAMGVCLYMQDWEGALRYGQKIIKPYSKHYPLYSLN 332

Query: 303 LGLQYYTCGKLEWFLGDTENAIKSMTEAVEILRITHGTNSPFMKELILKLE 353
           +   +   G+L   L D     K++ +A+ I+ + HG + P++ E+  ++E
Sbjct: 333 VASMWLKLGRLYMGLEDKAAGEKALKKAIAIMEVAHGKDHPYISEIKQEIE 383


>gi|197692956|gb|ACH71266.1| SET and MYND domain-containing 3 [Sus scrofa]
          Length = 369

 Score =  121 bits (304), Expect = 5e-25,   Method: Compositional matrix adjust.
 Identities = 84/277 (30%), Positives = 136/277 (49%), Gaps = 43/277 (15%)

Query: 65  EIAENFSKLACNAHTICNSELRPLGTGLYPVISIINHSCLPNAVLVFEGRLAVVRAVQHV 124
           +I E F+K+ CN+ TICN+E++ +G GLYP +S++NHSC PN  +VF G   ++RAV+ +
Sbjct: 111 DIFEAFAKVICNSFTICNAEMQEVGVGLYPSMSLLNHSCDPNCSIVFNGPHLLLRAVRDI 170

Query: 125 PKGAEVLISYIETAGSTMTRQKALKEQYLFTCTCPRCIKLGQFDDIQESAILEGYRCKDD 184
             G E+ I Y++   ++  R+K L++QY F C C RC                       
Sbjct: 171 EAGEELTICYLDMLMTSEERRKQLRDQYCFECDCFRC----------------------- 207

Query: 185 GCSGFLLRDSDDKGFTCQQCGLVRSKEEIKKIASEVNILSKKTLALTSCGNHQEVVSTYK 244
                  +D D    T  +      +E +KKI  E+    K    L  C   Q ++S+  
Sbjct: 208 -----QTQDKDADMLTGDEQVWKEVQESLKKI-EELKAHWKWEQVLAMC---QTIISSN- 257

Query: 245 MIEKLQKKLYHPFSVNLMQTR--EKLIKILMELEDWKEALAYCQLTIPVYQRVYPQFHPL 302
             E+L         +N+ Q +  +  +   + L   +EAL Y   T+  Y+  +P  HP+
Sbjct: 258 -AERLP-------DINIYQLKVLDCAMDACINLGLLEEALFYGMRTMEPYRIFFPGSHPV 309

Query: 303 LGLQYYTCGKLEWFLGDTENAIKSMTEAVEILRITHG 339
            G+Q    GKL+   G    A+K++  A +I+R+THG
Sbjct: 310 RGVQVMKVGKLQLHQGMFPQAMKNLRLAFDIMRVTHG 346


>gi|355720907|gb|AES07089.1| SET and MYND domain containing 2 [Mustela putorius furo]
          Length = 432

 Score =  121 bits (303), Expect = 6e-25,   Method: Compositional matrix adjust.
 Identities = 82/351 (23%), Positives = 161/351 (45%), Gaps = 35/351 (9%)

Query: 8   RKLQNDNVIPSTTTDNYSL-VEALVAHMSDID-EKQLLLYAQIANLVNLI---LQWPEIS 62
           R L    + P  T     L V+   +H+  +D EK+ L+ + +A L +     L++P+  
Sbjct: 111 RILAKQKIHPERTPSEKLLAVKEFESHLEKLDNEKRDLIQSDVAALHHFYSKHLEFPDT- 169

Query: 63  INEIAENFSKLACNAHTICNSELRPLGTGLYPVISIINHSCLPNAVLVFEGRLAVVRAVQ 122
            + +   F+++ CN  TI + EL  LG+ ++P ++++NHSC PN ++ ++G LA VRAVQ
Sbjct: 170 -DSLVVLFAQVNCNGFTIEDEELSHLGSAIFPDVALMNHSCCPNVIVTYKGTLAEVRAVQ 228

Query: 123 HVPKGAEVLISYIETAGSTMTRQKALKEQYLFTCTCPRCIKLGQFDDIQESAILEGYRCK 182
            +  G EV  SYI+    T  R   L++ Y FTC C  C                     
Sbjct: 229 EISPGEEVFTSYIDLLYPTEDRNDRLRDSYFFTCQCQECT-------------------- 268

Query: 183 DDGCSGFLLRDSDDKGFTCQQCGLVRSKEEIKKIASEVNILSKKTLALTSCGNHQEVVST 242
                    +D D      ++       E I+++      + ++        +  E++  
Sbjct: 269 --------TKDKDKAKVEIRKLSDPPKAETIREMVRYARNVIEEFRRAKHYKSPSELLEI 320

Query: 243 YKMIEKLQKKLYHPFSVNLMQTREKLIKILMELEDWKEALAYCQLTIPVYQRVYPQFHPL 302
            ++ ++    ++   +V ++    + + + + ++DW+ AL Y Q  I  Y + YP +   
Sbjct: 321 CELSQEKMGSVFEDSNVYMLHMMYQAMGVCLYMQDWEGALRYGQKIIKPYSKHYPLYSLN 380

Query: 303 LGLQYYTCGKLEWFLGDTENAIKSMTEAVEILRITHGTNSPFMKELILKLE 353
           +   +   G+L   L D     K++ +A+ I+ + HG + P++ E+  ++E
Sbjct: 381 VASMWLKLGRLYMGLEDKAAGEKALKKAIAIMEVAHGKDHPYISEIKQEIE 431


>gi|332236389|ref|XP_003267386.1| PREDICTED: SET and MYND domain-containing protein 3 [Nomascus
           leucogenys]
          Length = 369

 Score =  121 bits (303), Expect = 6e-25,   Method: Compositional matrix adjust.
 Identities = 84/277 (30%), Positives = 136/277 (49%), Gaps = 43/277 (15%)

Query: 65  EIAENFSKLACNAHTICNSELRPLGTGLYPVISIINHSCLPNAVLVFEGRLAVVRAVQHV 124
           ++ E F+K+ CN+ TICN+E++ +G GLYP IS++NHSC PN  +VF G   ++RAV+ +
Sbjct: 111 DLFEAFAKVICNSFTICNAEMQEVGVGLYPSISLLNHSCDPNCSIVFNGPHLLLRAVRDI 170

Query: 125 PKGAEVLISYIETAGSTMTRQKALKEQYLFTCTCPRCIKLGQFDDIQESAILEGYRCKDD 184
             G E+ I Y++   ++  R+K L++QY F C C RC                       
Sbjct: 171 EVGEELTICYLDMLMTSEERRKQLRDQYCFECDCLRC----------------------- 207

Query: 185 GCSGFLLRDSDDKGFTCQQCGLVRSKEEIKKIASEVNILSKKTLALTSCGNHQEVVSTYK 244
                  +D D    T  +      +E +KKI  E+    K    L  C   Q ++S+  
Sbjct: 208 -----QTQDKDADMLTGDEQVWKEVQESLKKI-EELKAHWKWEQVLAMC---QAIISSNS 258

Query: 245 MIEKLQKKLYHPFSVNLMQTR--EKLIKILMELEDWKEALAYCQLTIPVYQRVYPQFHPL 302
             E+L         +N+ Q +  +  +   + L   +EAL Y   T+  Y+  +P  HP+
Sbjct: 259 --ERLP-------DINIYQLKVLDCAMDACINLGLLEEALFYGTRTMEPYRIFFPGSHPV 309

Query: 303 LGLQYYTCGKLEWFLGDTENAIKSMTEAVEILRITHG 339
            G+Q    GKL+   G    A+K++  A +I+R+THG
Sbjct: 310 RGVQVMKVGKLQLHQGMFPQAMKNLRLAFDIMRVTHG 346


>gi|332812383|ref|XP_514316.3| PREDICTED: SET and MYND domain-containing protein 3 [Pan
           troglodytes]
 gi|397473047|ref|XP_003808033.1| PREDICTED: SET and MYND domain-containing protein 3 [Pan paniscus]
 gi|410208860|gb|JAA01649.1| SET and MYND domain containing 3 [Pan troglodytes]
 gi|410254410|gb|JAA15172.1| SET and MYND domain containing 3 [Pan troglodytes]
 gi|410289336|gb|JAA23268.1| SET and MYND domain containing 3 [Pan troglodytes]
 gi|410332281|gb|JAA35087.1| SET and MYND domain containing 3 [Pan troglodytes]
          Length = 428

 Score =  121 bits (303), Expect = 6e-25,   Method: Compositional matrix adjust.
 Identities = 84/277 (30%), Positives = 136/277 (49%), Gaps = 43/277 (15%)

Query: 65  EIAENFSKLACNAHTICNSELRPLGTGLYPVISIINHSCLPNAVLVFEGRLAVVRAVQHV 124
           ++ E F+K+ CN+ TICN+E++ +G GLYP IS++NHSC PN  +VF G   ++RAV+ +
Sbjct: 170 DLFEAFAKVICNSFTICNAEMQEVGVGLYPSISLLNHSCDPNCSIVFNGPHLLLRAVRDI 229

Query: 125 PKGAEVLISYIETAGSTMTRQKALKEQYLFTCTCPRCIKLGQFDDIQESAILEGYRCKDD 184
             G E+ I Y++   ++  R+K L++QY F C C RC                       
Sbjct: 230 EVGEELTICYLDMLMTSEERRKQLRDQYCFECDCFRC----------------------- 266

Query: 185 GCSGFLLRDSDDKGFTCQQCGLVRSKEEIKKIASEVNILSKKTLALTSCGNHQEVVSTYK 244
                  +D D    T  +      +E +KKI  E+    K    L  C   Q ++S+  
Sbjct: 267 -----QTQDKDADMLTGDEQVWKEVQESLKKI-EELKAHWKWEQVLAMC---QAIISSNS 317

Query: 245 MIEKLQKKLYHPFSVNLMQTR--EKLIKILMELEDWKEALAYCQLTIPVYQRVYPQFHPL 302
             E+L         +N+ Q +  +  +   + L   +EAL Y   T+  Y+  +P  HP+
Sbjct: 318 --ERLP-------DINIYQLKVLDCAMDACINLGLLEEALFYATRTMEPYRIFFPGSHPV 368

Query: 303 LGLQYYTCGKLEWFLGDTENAIKSMTEAVEILRITHG 339
            G+Q    GKL+   G    A+K++  A +I+R+THG
Sbjct: 369 RGVQVMKVGKLQLHQGMFPQAMKNLRLAFDIMRVTHG 405


>gi|338722720|ref|XP_001914950.2| PREDICTED: LOW QUALITY PROTEIN: SET and MYND domain-containing
           protein 3 [Equus caballus]
          Length = 440

 Score =  121 bits (303), Expect = 6e-25,   Method: Compositional matrix adjust.
 Identities = 84/277 (30%), Positives = 136/277 (49%), Gaps = 43/277 (15%)

Query: 65  EIAENFSKLACNAHTICNSELRPLGTGLYPVISIINHSCLPNAVLVFEGRLAVVRAVQHV 124
           ++ E F+K+ CN+ TICN+E++ +G GLYP +S++NHSC PN  +VF G   ++RAV+ +
Sbjct: 182 DVFEAFAKVICNSFTICNAEMQEVGVGLYPSMSLLNHSCDPNCSIVFNGPHLLLRAVRDI 241

Query: 125 PKGAEVLISYIETAGSTMTRQKALKEQYLFTCTCPRCIKLGQFDDIQESAILEGYRCKDD 184
             G E+ I Y++   ++  R+K L++QY F C C RC                       
Sbjct: 242 EAGEELTICYLDMLMTSEERRKQLRDQYCFECDCFRC----------------------- 278

Query: 185 GCSGFLLRDSDDKGFTCQQCGLVRSKEEIKKIASEVNILSKKTLALTSCGNHQEVVSTYK 244
                  +D D    T  +      +E +KKI  E+    K    L  C   Q ++S+  
Sbjct: 279 -----QTQDKDADMLTGDEHVWKEVQESLKKI-EELKAHWKWEQVLAMC---QTIISSNS 329

Query: 245 MIEKLQKKLYHPFSVNLMQTR--EKLIKILMELEDWKEALAYCQLTIPVYQRVYPQFHPL 302
             E+L         +N+ Q +  +  +   + L   +EAL Y   T+  Y+  YP  HP+
Sbjct: 330 --ERLP-------DINIYQLKVLDCAMDACINLGLLEEALFYGIRTMEPYRIFYPGSHPV 380

Query: 303 LGLQYYTCGKLEWFLGDTENAIKSMTEAVEILRITHG 339
            G+Q    GKL+   G    A+K++  A +I+R+THG
Sbjct: 381 RGVQVMKVGKLQLHQGMFPQAMKNLRLAFDIMRVTHG 417


>gi|119613761|gb|EAW93355.1| SET and MYND domain containing 2, isoform CRA_a [Homo sapiens]
          Length = 349

 Score =  121 bits (303), Expect = 6e-25,   Method: Compositional matrix adjust.
 Identities = 88/349 (25%), Positives = 156/349 (44%), Gaps = 31/349 (8%)

Query: 8   RKLQNDNVIPSTTTDNYSL-VEALVAHMSDID-EKQLLLYAQIANLVNLILQWPEISINE 65
           R L    + P  T     L V+   +H+  +D EK+ L+ + IA L +   +  E   N+
Sbjct: 27  RILAKQKIHPERTPSEKLLAVKEFESHLDKLDNEKKDLIQSDIAALHHFYSKHLEFPDND 86

Query: 66  -IAENFSKLACNAHTICNSELRPLGTGLYPVISIINHSCLPNAVLVFEGRLAVVRAVQHV 124
            +   F+++ CN  TI + EL  LG+ ++P ++++NHSC PN ++ ++G LA VRAVQ +
Sbjct: 87  SLVVLFAQVNCNGFTIEDEELSHLGSAIFPDVALMNHSCCPNVIVTYKGTLAEVRAVQEI 146

Query: 125 PKGAEVLISYIETAGSTMTRQKALKEQYLFTCTCPRCIKLGQFDDIQESAILEGYRCKDD 184
             G EV  SYI+    T  R   L++ Y FTC C  C                    KD 
Sbjct: 147 KPGEEVFTSYIDLLYPTEDRNDRLRDSYFFTCECQECT------------------TKDK 188

Query: 185 GCSGFLLRDSDDKGFTCQQCGLVRSKEEIKKIASEVNILSKKTLALTSCGNHQEVVSTYK 244
             +   +R   D         +VR    + +           +  L  C   QE +S+  
Sbjct: 189 DKAKVEIRKLSDPPKAEAIRDMVRYARNVIEEFRRAKHYKSPSELLEICELSQEKMSS-- 246

Query: 245 MIEKLQKKLYHPFSVNLMQTREKLIKILMELEDWKEALAYCQLTIPVYQRVYPQFHPLLG 304
                   ++   +V ++    + + + + ++DW+ AL Y Q  I  Y + YP +   + 
Sbjct: 247 --------VFEDSNVYMLHMMYQAMGVCLYMQDWEGALQYGQKIIKPYSKHYPLYSLNVA 298

Query: 305 LQYYTCGKLEWFLGDTENAIKSMTEAVEILRITHGTNSPFMKELILKLE 353
             +   G+L   L       K++ +A+ I+ + HG + P++ E+  ++E
Sbjct: 299 SMWLKLGRLYMGLEHKAAGEKALKKAIAIMEVAHGKDHPYISEIKQEIE 347


>gi|332811913|ref|XP_003308794.1| PREDICTED: N-lysine methyltransferase SMYD2 [Pan troglodytes]
          Length = 433

 Score =  121 bits (303), Expect = 7e-25,   Method: Compositional matrix adjust.
 Identities = 91/349 (26%), Positives = 160/349 (45%), Gaps = 31/349 (8%)

Query: 8   RKLQNDNVIPSTTTDNYSL-VEALVAHMSDID-EKQLLLYAQIANLVNLILQWPEISINE 65
           R L    + P  T     L V+   +H+  +D EK+ L+ + IA L +   +  E   N+
Sbjct: 111 RILAKQKIHPERTPSEKLLAVKEFESHLDKLDNEKKDLIQSDIAALHHFYSKHLEFPDND 170

Query: 66  -IAENFSKLACNAHTICNSELRPLGTGLYPVISIINHSCLPNAVLVFEGRLAVVRAVQHV 124
            +   F+++ CN  TI + EL  LG+ ++P ++++NHSC PN ++ ++G LA VRAVQ +
Sbjct: 171 SLVVVFAQVNCNGFTIEDEELSHLGSAIFPDVALMNHSCCPNVIVTYKGTLAEVRAVQEI 230

Query: 125 PKGAEVLISYIETAGSTMTRQKALKEQYLFTCTCPRCIKLGQFDDIQESAILEGYRCKDD 184
             G EV  SYI+    T  R   L++ Y FTC C  C         ++ A +E  R   D
Sbjct: 231 KPGEEVFTSYIDLLYPTEDRNDRLRDSYFFTCECQECTTKD-----KDKAKVE-IRKLSD 284

Query: 185 GCSGFLLRDSDDKGFTCQQCGLVRSKEEIKKIASEVNILSKKTLALTSCGNHQEVVSTYK 244
                 +RD            +VR    + +           +  L  C   QE +S+  
Sbjct: 285 PPKAEAIRD------------MVRYARNVIEEFRRAKHYKSPSELLEICELSQEKMSS-- 330

Query: 245 MIEKLQKKLYHPFSVNLMQTREKLIKILMELEDWKEALAYCQLTIPVYQRVYPQFHPLLG 304
                   ++   +V ++    + + + + ++DW+ AL Y Q  I  Y + YP +   + 
Sbjct: 331 --------VFEDSNVYMLHMMYQAMGVCLYMQDWEGALQYGQKIIKPYSKHYPLYSLNVA 382

Query: 305 LQYYTCGKLEWFLGDTENAIKSMTEAVEILRITHGTNSPFMKELILKLE 353
             +   G+L   L       K++ +A+ I+ I HG + P++ E+  ++E
Sbjct: 383 SMWLKLGRLYMGLEHKAAGEKALKKAIAIMEIAHGKDHPYISEIKQEIE 431


>gi|355559119|gb|EHH15899.1| hypothetical protein EGK_02058, partial [Macaca mulatta]
          Length = 373

 Score =  121 bits (303), Expect = 7e-25,   Method: Compositional matrix adjust.
 Identities = 84/277 (30%), Positives = 136/277 (49%), Gaps = 43/277 (15%)

Query: 65  EIAENFSKLACNAHTICNSELRPLGTGLYPVISIINHSCLPNAVLVFEGRLAVVRAVQHV 124
           ++ E F+K+ CN+ TICN+E++ +G GLYP IS++NHSC PN  +VF G   ++RAV+ +
Sbjct: 115 DLFEAFAKVICNSFTICNAEMQEVGVGLYPSISLLNHSCDPNCSIVFNGPHLLLRAVRDI 174

Query: 125 PKGAEVLISYIETAGSTMTRQKALKEQYLFTCTCPRCIKLGQFDDIQESAILEGYRCKDD 184
             G E+ I Y++   ++  R+K L++QY F C C RC                       
Sbjct: 175 EVGEELTICYLDMLMTSEERRKQLRDQYCFECDCFRC----------------------- 211

Query: 185 GCSGFLLRDSDDKGFTCQQCGLVRSKEEIKKIASEVNILSKKTLALTSCGNHQEVVSTYK 244
                  +D D    T  +      +E +KKI  E+    K    L  C   Q ++S+  
Sbjct: 212 -----QTQDKDADMLTGDEQVWKEVQESLKKI-EELKAHWKWEQVLAMC---QAIISSNS 262

Query: 245 MIEKLQKKLYHPFSVNLMQTR--EKLIKILMELEDWKEALAYCQLTIPVYQRVYPQFHPL 302
             E+L         +N+ Q +  +  +   + L   +EAL Y   T+  Y+  +P  HP+
Sbjct: 263 --ERLP-------DINIYQLKVLDCAMDACINLGLLEEALFYGTRTMEPYRIFFPGSHPV 313

Query: 303 LGLQYYTCGKLEWFLGDTENAIKSMTEAVEILRITHG 339
            G+Q    GKL+   G    A+K++  A +I+R+THG
Sbjct: 314 RGVQVMKVGKLQLHQGMFPQAMKNLRLAFDIMRVTHG 350


>gi|347300358|ref|NP_001153564.2| SET and MYND domain-containing protein 3 [Sus scrofa]
          Length = 428

 Score =  121 bits (303), Expect = 7e-25,   Method: Compositional matrix adjust.
 Identities = 84/277 (30%), Positives = 136/277 (49%), Gaps = 43/277 (15%)

Query: 65  EIAENFSKLACNAHTICNSELRPLGTGLYPVISIINHSCLPNAVLVFEGRLAVVRAVQHV 124
           +I E F+K+ CN+ TICN+E++ +G GLYP +S++NHSC PN  +VF G   ++RAV+ +
Sbjct: 170 DIFEAFAKVICNSFTICNAEMQEVGVGLYPSMSLLNHSCDPNCSIVFNGPHLLLRAVRDI 229

Query: 125 PKGAEVLISYIETAGSTMTRQKALKEQYLFTCTCPRCIKLGQFDDIQESAILEGYRCKDD 184
             G E+ I Y++   ++  R+K L++QY F C C RC                       
Sbjct: 230 EAGEELTICYLDMLMTSEERRKQLRDQYCFECDCFRC----------------------- 266

Query: 185 GCSGFLLRDSDDKGFTCQQCGLVRSKEEIKKIASEVNILSKKTLALTSCGNHQEVVSTYK 244
                  +D D    T  +      +E +KKI  E+    K    L  C   Q ++S+  
Sbjct: 267 -----QTQDKDADMLTGDEQVWKEVQESLKKI-EELKAHWKWEQVLAMC---QTIISSN- 316

Query: 245 MIEKLQKKLYHPFSVNLMQTR--EKLIKILMELEDWKEALAYCQLTIPVYQRVYPQFHPL 302
             E+L         +N+ Q +  +  +   + L   +EAL Y   T+  Y+  +P  HP+
Sbjct: 317 -AERLP-------DINIYQLKVLDCAMDACINLGLLEEALFYGMRTMEPYRIFFPGSHPV 368

Query: 303 LGLQYYTCGKLEWFLGDTENAIKSMTEAVEILRITHG 339
            G+Q    GKL+   G    A+K++  A +I+R+THG
Sbjct: 369 RGVQVMKVGKLQLHQGMFPQAMKNLRLAFDIMRVTHG 405


>gi|395859538|ref|XP_003802095.1| PREDICTED: N-lysine methyltransferase SMYD2 [Otolemur garnettii]
          Length = 433

 Score =  120 bits (302), Expect = 7e-25,   Method: Compositional matrix adjust.
 Identities = 88/349 (25%), Positives = 157/349 (44%), Gaps = 31/349 (8%)

Query: 8   RKLQNDNVIPSTTTDNYSL-VEALVAHMSDID-EKQLLLYAQIANLVNLILQWPEISINE 65
           R L    + P  T     L V+   +H+  +D EK+ L+ + IA L +   +  E   N+
Sbjct: 111 RILAKQKIHPERTPSEKLLAVKEFESHLDKLDNEKKELIQSDIAALHHFYSKHLEFPDND 170

Query: 66  -IAENFSKLACNAHTICNSELRPLGTGLYPVISIINHSCLPNAVLVFEGRLAVVRAVQHV 124
            +   F+++ CN  TI + EL  LG+ ++P ++++NHSC PN ++ ++G LA VRAVQ +
Sbjct: 171 SLVVLFAQVNCNGFTIEDEELSHLGSAIFPDVALMNHSCCPNVIVTYKGTLAEVRAVQEI 230

Query: 125 PKGAEVLISYIETAGSTMTRQKALKEQYLFTCTCPRCIKLGQFDDIQESAILEGYRCKDD 184
             G EV  SYI+    T  R   L++ Y FTC C  C                    KD 
Sbjct: 231 NPGEEVFTSYIDLLYPTEDRNDRLRDSYFFTCECQECTT------------------KDK 272

Query: 185 GCSGFLLRDSDDKGFTCQQCGLVRSKEEIKKIASEVNILSKKTLALTSCGNHQEVVSTYK 244
             +   +R  +D         +VR    + +           +  L  C   QE +S+  
Sbjct: 273 DKAKVEIRKLNDPPKAETVRDMVRYARNVIEEFRRAKHYKSPSELLEICELSQEKMSS-- 330

Query: 245 MIEKLQKKLYHPFSVNLMQTREKLIKILMELEDWKEALAYCQLTIPVYQRVYPQFHPLLG 304
                   ++   +V ++    + + + + ++DW+ AL Y Q  I  Y + YP +   + 
Sbjct: 331 --------VFEDSNVYMLHMMYQAMGVCLYMQDWEGALRYGQKIIKPYSKHYPLYSLNVA 382

Query: 305 LQYYTCGKLEWFLGDTENAIKSMTEAVEILRITHGTNSPFMKELILKLE 353
             +   G+L   L       K++ +A+ I+ + HG + P++ E+  ++E
Sbjct: 383 SMWLKLGRLYMGLEHKAAGEKALKKAIAIMEVAHGKDHPYISEIKQEIE 431


>gi|426239577|ref|XP_004013696.1| PREDICTED: SET and MYND domain-containing protein 3 isoform 1 [Ovis
           aries]
          Length = 369

 Score =  120 bits (302), Expect = 7e-25,   Method: Compositional matrix adjust.
 Identities = 85/277 (30%), Positives = 135/277 (48%), Gaps = 43/277 (15%)

Query: 65  EIAENFSKLACNAHTICNSELRPLGTGLYPVISIINHSCLPNAVLVFEGRLAVVRAVQHV 124
           +I E F+K+ CN+ TICN+E++ +G GLYP +S++NHSC PN  +VF G   ++R V+ V
Sbjct: 111 DIFEAFAKVICNSFTICNAEMQEVGVGLYPSMSLLNHSCDPNCSIVFNGPHLLLRTVRDV 170

Query: 125 PKGAEVLISYIETAGSTMTRQKALKEQYLFTCTCPRCIKLGQFDDIQESAILEGYRCKDD 184
             G E+ I Y++   ++  R+K L++QY F C C RC                       
Sbjct: 171 EAGEELTICYLDMLMTSEERRKQLRDQYCFDCDCFRC----------------------- 207

Query: 185 GCSGFLLRDSDDKGFTCQQCGLVRSKEEIKKIASEVNILSKKTLALTSCGNHQEVVSTYK 244
                  +D D    T  +      +E +KKI  E+    K    L  C   Q ++S+  
Sbjct: 208 -----QTQDKDADMLTGDEQVWKEVQESLKKI-EELKAHWKWEQVLAMC---QSIISSNA 258

Query: 245 MIEKLQKKLYHPFSVNLMQTR--EKLIKILMELEDWKEALAYCQLTIPVYQRVYPQFHPL 302
             E+L         +N+ Q +  +  +   + L   +EAL Y   T+  Y+  YP  HP+
Sbjct: 259 --ERLP-------DINIYQLKVLDCAMDACINLGLLEEALFYGIRTMEPYRIFYPGNHPV 309

Query: 303 LGLQYYTCGKLEWFLGDTENAIKSMTEAVEILRITHG 339
            G+Q    GKL+   G    A+K++  A +I+R+THG
Sbjct: 310 RGVQIMKVGKLQLHQGMFPQAMKNLRLAFDIMRVTHG 346


>gi|12232401|ref|NP_073580.1| SET and MYND domain-containing protein 3 isoform 2 [Homo sapiens]
 gi|10437096|dbj|BAB14981.1| unnamed protein product [Homo sapiens]
 gi|119597549|gb|EAW77143.1| SET and MYND domain containing 3, isoform CRA_d [Homo sapiens]
          Length = 369

 Score =  120 bits (302), Expect = 7e-25,   Method: Compositional matrix adjust.
 Identities = 84/277 (30%), Positives = 136/277 (49%), Gaps = 43/277 (15%)

Query: 65  EIAENFSKLACNAHTICNSELRPLGTGLYPVISIINHSCLPNAVLVFEGRLAVVRAVQHV 124
           ++ E F+K+ CN+ TICN+E++ +G GLYP IS++NHSC PN  +VF G   ++RAV+ +
Sbjct: 111 DLFEAFAKVICNSFTICNAEMQEVGVGLYPSISLLNHSCDPNCSIVFNGPHLLLRAVRDI 170

Query: 125 PKGAEVLISYIETAGSTMTRQKALKEQYLFTCTCPRCIKLGQFDDIQESAILEGYRCKDD 184
             G E+ I Y++   ++  R+K L++QY F C C RC                       
Sbjct: 171 EVGEELTICYLDMLMTSEERRKQLRDQYCFECDCFRC----------------------- 207

Query: 185 GCSGFLLRDSDDKGFTCQQCGLVRSKEEIKKIASEVNILSKKTLALTSCGNHQEVVSTYK 244
                  +D D    T  +      +E +KKI  E+    K    L  C   Q ++S+  
Sbjct: 208 -----QTQDKDADMLTGDEQVWKEVQESLKKI-EELKAHWKWEQVLAMC---QAIISSNS 258

Query: 245 MIEKLQKKLYHPFSVNLMQTR--EKLIKILMELEDWKEALAYCQLTIPVYQRVYPQFHPL 302
             E+L         +N+ Q +  +  +   + L   +EAL Y   T+  Y+  +P  HP+
Sbjct: 259 --ERLP-------DINIYQLKVLDCAMDACINLGLLEEALFYGTRTMEPYRIFFPGSHPV 309

Query: 303 LGLQYYTCGKLEWFLGDTENAIKSMTEAVEILRITHG 339
            G+Q    GKL+   G    A+K++  A +I+R+THG
Sbjct: 310 RGVQVMKVGKLQLHQGMFPQAMKNLRLAFDIMRVTHG 346


>gi|410219524|gb|JAA06981.1| SET and MYND domain containing 2 [Pan troglodytes]
 gi|410248566|gb|JAA12250.1| SET and MYND domain containing 2 [Pan troglodytes]
 gi|410291460|gb|JAA24330.1| SET and MYND domain containing 2 [Pan troglodytes]
 gi|410336589|gb|JAA37241.1| SET and MYND domain containing 2 [Pan troglodytes]
          Length = 433

 Score =  120 bits (302), Expect = 8e-25,   Method: Compositional matrix adjust.
 Identities = 91/349 (26%), Positives = 160/349 (45%), Gaps = 31/349 (8%)

Query: 8   RKLQNDNVIPSTTTDNYSL-VEALVAHMSDID-EKQLLLYAQIANLVNLILQWPEISINE 65
           R L    + P  T     L V+   +H+  +D EK+ L+ + IA L +   +  E   N+
Sbjct: 111 RILAKQKIHPERTPSEKLLAVKEFESHLDKLDNEKKDLIQSDIAALHHFYSKHLEFPDND 170

Query: 66  -IAENFSKLACNAHTICNSELRPLGTGLYPVISIINHSCLPNAVLVFEGRLAVVRAVQHV 124
            +   F+++ CN  TI + EL  LG+ ++P ++++NHSC PN ++ ++G LA VRAVQ +
Sbjct: 171 SLIVVFAQVNCNGFTIEDEELSHLGSAIFPDVALMNHSCCPNVIVTYKGTLAEVRAVQEI 230

Query: 125 PKGAEVLISYIETAGSTMTRQKALKEQYLFTCTCPRCIKLGQFDDIQESAILEGYRCKDD 184
             G EV  SYI+    T  R   L++ Y FTC C  C         ++ A +E  R   D
Sbjct: 231 KPGEEVFTSYIDLLYPTEDRNDRLRDSYFFTCECQECTTKD-----KDKAKVE-IRKLSD 284

Query: 185 GCSGFLLRDSDDKGFTCQQCGLVRSKEEIKKIASEVNILSKKTLALTSCGNHQEVVSTYK 244
                 +RD            +VR    + +           +  L  C   QE +S+  
Sbjct: 285 PPKAEAIRD------------MVRYARNVIEEFRRAKHYKSPSELLEICELSQEKMSS-- 330

Query: 245 MIEKLQKKLYHPFSVNLMQTREKLIKILMELEDWKEALAYCQLTIPVYQRVYPQFHPLLG 304
                   ++   +V ++    + + + + ++DW+ AL Y Q  I  Y + YP +   + 
Sbjct: 331 --------VFEDSNVYMLHMMYQAMGVCLYMQDWEGALQYGQKIIKPYSKHYPLYSLNVA 382

Query: 305 LQYYTCGKLEWFLGDTENAIKSMTEAVEILRITHGTNSPFMKELILKLE 353
             +   G+L   L       K++ +A+ I+ I HG + P++ E+  ++E
Sbjct: 383 SMWLKLGRLYMGLEHKAAGEKALKKAIAIMEIAHGKDHPYISEIKQEIE 431


>gi|426333764|ref|XP_004028440.1| PREDICTED: N-lysine methyltransferase SMYD2 [Gorilla gorilla
           gorilla]
          Length = 384

 Score =  120 bits (302), Expect = 8e-25,   Method: Compositional matrix adjust.
 Identities = 88/349 (25%), Positives = 156/349 (44%), Gaps = 31/349 (8%)

Query: 8   RKLQNDNVIPSTTTDNYSL-VEALVAHMSDID-EKQLLLYAQIANLVNLILQWPEISINE 65
           R L    + P  T     L V+   +H+  +D EK+ L+ + IA L +   +  E   N+
Sbjct: 62  RILAKQKIHPERTPSEKLLAVKEFESHLDKLDNEKKDLIQSDIAALHHFYSKHLEFPDND 121

Query: 66  -IAENFSKLACNAHTICNSELRPLGTGLYPVISIINHSCLPNAVLVFEGRLAVVRAVQHV 124
            +   F+++ CN  TI + EL  LG+ ++P ++++NHSC PN ++ ++G LA VRAVQ +
Sbjct: 122 SLVVLFAQVNCNGFTIEDEELSHLGSAIFPDVALMNHSCCPNVIVTYKGTLAEVRAVQEI 181

Query: 125 PKGAEVLISYIETAGSTMTRQKALKEQYLFTCTCPRCIKLGQFDDIQESAILEGYRCKDD 184
             G EV  SYI+    T  R   L++ Y FTC C  C                    KD 
Sbjct: 182 KPGEEVFTSYIDLLYPTEDRNDRLRDSYFFTCECQECT------------------TKDK 223

Query: 185 GCSGFLLRDSDDKGFTCQQCGLVRSKEEIKKIASEVNILSKKTLALTSCGNHQEVVSTYK 244
             +   +R   D         +VR    + +           +  L  C   QE +S+  
Sbjct: 224 DKAKVEIRKLSDPPKAEAIRDMVRYARNVIEEFRRAKHYKSPSELLEICELSQEKMSS-- 281

Query: 245 MIEKLQKKLYHPFSVNLMQTREKLIKILMELEDWKEALAYCQLTIPVYQRVYPQFHPLLG 304
                   ++   +V ++    + + + + ++DW+ AL Y Q  I  Y + YP +   + 
Sbjct: 282 --------VFEDSNVYMLHMMYQAMGVCLYMQDWEGALQYGQKIIKPYSKHYPLYSLNVA 333

Query: 305 LQYYTCGKLEWFLGDTENAIKSMTEAVEILRITHGTNSPFMKELILKLE 353
             +   G+L   L       K++ +A+ I+ + HG + P++ E+  ++E
Sbjct: 334 SMWLKLGRLYMGLEHKAAGEKALKKAIAIMEVAHGKDHPYISEIKQEIE 382


>gi|355558772|gb|EHH15552.1| hypothetical protein EGK_01662, partial [Macaca mulatta]
          Length = 380

 Score =  120 bits (302), Expect = 8e-25,   Method: Compositional matrix adjust.
 Identities = 88/349 (25%), Positives = 156/349 (44%), Gaps = 31/349 (8%)

Query: 8   RKLQNDNVIPSTTTDNYSL-VEALVAHMSDID-EKQLLLYAQIANLVNLILQWPEISINE 65
           R L    + P  T     L V+   +H+  +D EK+ L+ + IA L +   +  E   N+
Sbjct: 58  RILAKQKIHPERTPSEKLLAVKEFESHLDKLDNEKKDLIQSDIAALHHFYSKHLEFPDND 117

Query: 66  -IAENFSKLACNAHTICNSELRPLGTGLYPVISIINHSCLPNAVLVFEGRLAVVRAVQHV 124
            +   F+++ CN  TI + EL  LG+ ++P ++++NHSC PN ++ ++G LA VRAVQ +
Sbjct: 118 SLVVLFAQVNCNGFTIEDEELSHLGSAIFPDVALMNHSCCPNVIVTYKGTLAEVRAVQEI 177

Query: 125 PKGAEVLISYIETAGSTMTRQKALKEQYLFTCTCPRCIKLGQFDDIQESAILEGYRCKDD 184
             G EV  SYI+    T  R   L++ Y FTC C  C                    KD 
Sbjct: 178 KPGEEVFTSYIDLLYPTEDRNDRLRDSYFFTCECQECT------------------TKDK 219

Query: 185 GCSGFLLRDSDDKGFTCQQCGLVRSKEEIKKIASEVNILSKKTLALTSCGNHQEVVSTYK 244
             +   +R   D         +VR    + +           +  L  C   QE +S+  
Sbjct: 220 DKAKVEIRKLSDPPKAEAIRDMVRYARNVIEEFRRAKHYKSPSELLEICELSQEKMSS-- 277

Query: 245 MIEKLQKKLYHPFSVNLMQTREKLIKILMELEDWKEALAYCQLTIPVYQRVYPQFHPLLG 304
                   ++   +V ++    + + + + ++DW+ AL Y Q  I  Y + YP +   + 
Sbjct: 278 --------VFEDSNVYMLHMMYQAMGVCLYMQDWEGALQYGQKIIKPYSKHYPLYSLNVA 329

Query: 305 LQYYTCGKLEWFLGDTENAIKSMTEAVEILRITHGTNSPFMKELILKLE 353
             +   G+L   L       K++ +A+ I+ + HG + P++ E+  ++E
Sbjct: 330 SMWLKLGRLYMGLEHKAAGEKALKKAIAIMEVAHGKDHPYISEIKQEIE 378


>gi|426239579|ref|XP_004013697.1| PREDICTED: SET and MYND domain-containing protein 3 isoform 2 [Ovis
           aries]
          Length = 428

 Score =  120 bits (302), Expect = 8e-25,   Method: Compositional matrix adjust.
 Identities = 85/277 (30%), Positives = 135/277 (48%), Gaps = 43/277 (15%)

Query: 65  EIAENFSKLACNAHTICNSELRPLGTGLYPVISIINHSCLPNAVLVFEGRLAVVRAVQHV 124
           +I E F+K+ CN+ TICN+E++ +G GLYP +S++NHSC PN  +VF G   ++R V+ V
Sbjct: 170 DIFEAFAKVICNSFTICNAEMQEVGVGLYPSMSLLNHSCDPNCSIVFNGPHLLLRTVRDV 229

Query: 125 PKGAEVLISYIETAGSTMTRQKALKEQYLFTCTCPRCIKLGQFDDIQESAILEGYRCKDD 184
             G E+ I Y++   ++  R+K L++QY F C C RC                       
Sbjct: 230 EAGEELTICYLDMLMTSEERRKQLRDQYCFDCDCFRC----------------------- 266

Query: 185 GCSGFLLRDSDDKGFTCQQCGLVRSKEEIKKIASEVNILSKKTLALTSCGNHQEVVSTYK 244
                  +D D    T  +      +E +KKI  E+    K    L  C   Q ++S+  
Sbjct: 267 -----QTQDKDADMLTGDEQVWKEVQESLKKI-EELKAHWKWEQVLAMC---QSIISSN- 316

Query: 245 MIEKLQKKLYHPFSVNLMQTR--EKLIKILMELEDWKEALAYCQLTIPVYQRVYPQFHPL 302
             E+L         +N+ Q +  +  +   + L   +EAL Y   T+  Y+  YP  HP+
Sbjct: 317 -AERLP-------DINIYQLKVLDCAMDACINLGLLEEALFYGIRTMEPYRIFYPGNHPV 368

Query: 303 LGLQYYTCGKLEWFLGDTENAIKSMTEAVEILRITHG 339
            G+Q    GKL+   G    A+K++  A +I+R+THG
Sbjct: 369 RGVQIMKVGKLQLHQGMFPQAMKNLRLAFDIMRVTHG 405


>gi|297661992|ref|XP_002809506.1| PREDICTED: N-lysine methyltransferase SMYD2 [Pongo abelii]
          Length = 433

 Score =  120 bits (302), Expect = 9e-25,   Method: Compositional matrix adjust.
 Identities = 90/349 (25%), Positives = 161/349 (46%), Gaps = 31/349 (8%)

Query: 8   RKLQNDNVIPSTTTDNYSL-VEALVAHMSDID-EKQLLLYAQIANLVNLILQWPEISINE 65
           R L    + P  T     L V+   +H+  +D EK+ L+ + IA L +   +  E   N+
Sbjct: 111 RILAKQKIHPERTPSEKLLAVKEFESHLDKLDNEKKDLIQSDIAALHHFYSKHLEFPDND 170

Query: 66  -IAENFSKLACNAHTICNSELRPLGTGLYPVISIINHSCLPNAVLVFEGRLAVVRAVQHV 124
            +   F+++ CN  TI + EL  LG+ ++P ++++NHSC PN ++ ++G LA VRAVQ +
Sbjct: 171 SLIVLFAQVNCNGFTIEDEELSHLGSAIFPDVALMNHSCCPNVIVTYKGTLAEVRAVQEI 230

Query: 125 PKGAEVLISYIETAGSTMTRQKALKEQYLFTCTCPRCIKLGQFDDIQESAILEGYRCKDD 184
             G EV  SYI+    T  R   L++ Y FTC C  C         ++ A +E  R   D
Sbjct: 231 KPGEEVFTSYIDLLYPTEDRNDRLRDSYFFTCECQECTTKD-----KDKAKVE-IRKLSD 284

Query: 185 GCSGFLLRDSDDKGFTCQQCGLVRSKEEIKKIASEVNILSKKTLALTSCGNHQEVVSTYK 244
                ++RD            +VR    + +           +  L  C   QE +S+  
Sbjct: 285 PPKAEVIRD------------MVRYARNVIEEFRRAKHYKSPSELLEICELSQEKMSS-- 330

Query: 245 MIEKLQKKLYHPFSVNLMQTREKLIKILMELEDWKEALAYCQLTIPVYQRVYPQFHPLLG 304
                   ++   +V ++    + + + + ++DW+ AL Y Q  I  Y + YP +   + 
Sbjct: 331 --------VFEDSNVYMLHMMYQAMGVCLYMQDWEGALQYGQKIIKPYSKHYPLYSLNVA 382

Query: 305 LQYYTCGKLEWFLGDTENAIKSMTEAVEILRITHGTNSPFMKELILKLE 353
             +   G+L   L       K++ +A+ I+ + HG + P++ E+  ++E
Sbjct: 383 SMWLKLGRLYMGLEHKAAGEKALKKAIAIMEVAHGKDHPYISEIKQEIE 431


>gi|267844824|ref|NP_001161212.1| SET and MYND domain-containing protein 3 isoform 1 [Homo sapiens]
 gi|212276523|sp|Q9H7B4.4|SMYD3_HUMAN RecName: Full=SET and MYND domain-containing protein 3; AltName:
           Full=Zinc finger MYND domain-containing protein 1
 gi|119597548|gb|EAW77142.1| SET and MYND domain containing 3, isoform CRA_c [Homo sapiens]
          Length = 428

 Score =  120 bits (301), Expect = 1e-24,   Method: Compositional matrix adjust.
 Identities = 84/277 (30%), Positives = 136/277 (49%), Gaps = 43/277 (15%)

Query: 65  EIAENFSKLACNAHTICNSELRPLGTGLYPVISIINHSCLPNAVLVFEGRLAVVRAVQHV 124
           ++ E F+K+ CN+ TICN+E++ +G GLYP IS++NHSC PN  +VF G   ++RAV+ +
Sbjct: 170 DLFEAFAKVICNSFTICNAEMQEVGVGLYPSISLLNHSCDPNCSIVFNGPHLLLRAVRDI 229

Query: 125 PKGAEVLISYIETAGSTMTRQKALKEQYLFTCTCPRCIKLGQFDDIQESAILEGYRCKDD 184
             G E+ I Y++   ++  R+K L++QY F C C RC                       
Sbjct: 230 EVGEELTICYLDMLMTSEERRKQLRDQYCFECDCFRC----------------------- 266

Query: 185 GCSGFLLRDSDDKGFTCQQCGLVRSKEEIKKIASEVNILSKKTLALTSCGNHQEVVSTYK 244
                  +D D    T  +      +E +KKI  E+    K    L  C   Q ++S+  
Sbjct: 267 -----QTQDKDADMLTGDEQVWKEVQESLKKI-EELKAHWKWEQVLAMC---QAIISSNS 317

Query: 245 MIEKLQKKLYHPFSVNLMQTR--EKLIKILMELEDWKEALAYCQLTIPVYQRVYPQFHPL 302
             E+L         +N+ Q +  +  +   + L   +EAL Y   T+  Y+  +P  HP+
Sbjct: 318 --ERLP-------DINIYQLKVLDCAMDACINLGLLEEALFYGTRTMEPYRIFFPGSHPV 368

Query: 303 LGLQYYTCGKLEWFLGDTENAIKSMTEAVEILRITHG 339
            G+Q    GKL+   G    A+K++  A +I+R+THG
Sbjct: 369 RGVQVMKVGKLQLHQGMFPQAMKNLRLAFDIMRVTHG 405


>gi|333361525|pdb|3RU0|A Chain A, Cocrystal Structure Of Human Smyd3 With Inhibitor
           Sinefungin Bound
 gi|333361526|pdb|3RU0|B Chain B, Cocrystal Structure Of Human Smyd3 With Inhibitor
           Sinefungin Bound
          Length = 438

 Score =  120 bits (301), Expect = 1e-24,   Method: Compositional matrix adjust.
 Identities = 84/277 (30%), Positives = 136/277 (49%), Gaps = 43/277 (15%)

Query: 65  EIAENFSKLACNAHTICNSELRPLGTGLYPVISIINHSCLPNAVLVFEGRLAVVRAVQHV 124
           ++ E F+K+ CN+ TICN+E++ +G GLYP IS++NHSC PN  +VF G   ++RAV+ +
Sbjct: 172 DLFEAFAKVICNSFTICNAEMQEVGVGLYPSISLLNHSCDPNCSIVFNGPHLLLRAVRDI 231

Query: 125 PKGAEVLISYIETAGSTMTRQKALKEQYLFTCTCPRCIKLGQFDDIQESAILEGYRCKDD 184
             G E+ I Y++   ++  R+K L++QY F C C RC                       
Sbjct: 232 EVGEELTICYLDMLMTSEERRKQLRDQYCFECDCFRC----------------------- 268

Query: 185 GCSGFLLRDSDDKGFTCQQCGLVRSKEEIKKIASEVNILSKKTLALTSCGNHQEVVSTYK 244
                  +D D    T  +      +E +KKI  E+    K    L  C   Q ++S+  
Sbjct: 269 -----QTQDKDADMLTGDEQVWKEVQESLKKI-EELKAHWKWEQVLAMC---QAIISSNS 319

Query: 245 MIEKLQKKLYHPFSVNLMQTR--EKLIKILMELEDWKEALAYCQLTIPVYQRVYPQFHPL 302
             E+L         +N+ Q +  +  +   + L   +EAL Y   T+  Y+  +P  HP+
Sbjct: 320 --ERLP-------DINIYQLKVLDCAMDACINLGLLEEALFYGTRTMEPYRIFFPGSHPV 370

Query: 303 LGLQYYTCGKLEWFLGDTENAIKSMTEAVEILRITHG 339
            G+Q    GKL+   G    A+K++  A +I+R+THG
Sbjct: 371 RGVQVMKVGKLQLHQGMFPQAMKNLRLAFDIMRVTHG 407


>gi|312208186|pdb|3PDN|A Chain A, Crystal Structure Of Smyd3 In Complex With
           Methyltransferase Inhibitor Sinefungin
 gi|323714509|pdb|3OXL|A Chain A, Human Lysine Methyltransferase Smyd3 In Complex With
           Adohcy (Form Ii)
 gi|21410974|gb|AAH31010.1| SMYD3 protein [Homo sapiens]
 gi|123984179|gb|ABM83506.1| SET and MYND domain containing 3 [synthetic construct]
 gi|123998243|gb|ABM86723.1| SET and MYND domain containing 3 [synthetic construct]
          Length = 428

 Score =  120 bits (301), Expect = 1e-24,   Method: Compositional matrix adjust.
 Identities = 84/277 (30%), Positives = 136/277 (49%), Gaps = 43/277 (15%)

Query: 65  EIAENFSKLACNAHTICNSELRPLGTGLYPVISIINHSCLPNAVLVFEGRLAVVRAVQHV 124
           ++ E F+K+ CN+ TICN+E++ +G GLYP IS++NHSC PN  +VF G   ++RAV+ +
Sbjct: 170 DLFEAFAKVICNSFTICNAEMQEVGVGLYPSISLLNHSCDPNCSIVFNGPHLLLRAVRDI 229

Query: 125 PKGAEVLISYIETAGSTMTRQKALKEQYLFTCTCPRCIKLGQFDDIQESAILEGYRCKDD 184
             G E+ I Y++   ++  R+K L++QY F C C RC                       
Sbjct: 230 EVGEELTICYLDMLMTSEERRKQLRDQYCFECDCFRC----------------------- 266

Query: 185 GCSGFLLRDSDDKGFTCQQCGLVRSKEEIKKIASEVNILSKKTLALTSCGNHQEVVSTYK 244
                  +D D    T  +      +E +KKI  E+    K    L  C   Q ++S+  
Sbjct: 267 -----QTQDKDADMLTGDEQVWKEVQESLKKI-EELKAHWKWEQVLAMC---QAIISSNS 317

Query: 245 MIEKLQKKLYHPFSVNLMQTR--EKLIKILMELEDWKEALAYCQLTIPVYQRVYPQFHPL 302
             E+L         +N+ Q +  +  +   + L   +EAL Y   T+  Y+  +P  HP+
Sbjct: 318 --ERLP-------DINIYQLKVLDCAMDACINLGLLEEALFYGTRTMEPYRIFFPGSHPV 368

Query: 303 LGLQYYTCGKLEWFLGDTENAIKSMTEAVEILRITHG 339
            G+Q    GKL+   G    A+K++  A +I+R+THG
Sbjct: 369 RGVQVMKVGKLQLHQGMFPQAMKNLRLAFDIMRVTHG 405


>gi|19570495|dbj|BAB86333.1| histone methyltransferase [Homo sapiens]
 gi|158260233|dbj|BAF82294.1| unnamed protein product [Homo sapiens]
 gi|261858444|dbj|BAI45744.1| SET and MYND domain containing 3 [synthetic construct]
          Length = 428

 Score =  120 bits (301), Expect = 1e-24,   Method: Compositional matrix adjust.
 Identities = 84/277 (30%), Positives = 136/277 (49%), Gaps = 43/277 (15%)

Query: 65  EIAENFSKLACNAHTICNSELRPLGTGLYPVISIINHSCLPNAVLVFEGRLAVVRAVQHV 124
           ++ E F+K+ CN+ TICN+E++ +G GLYP IS++NHSC PN  +VF G   ++RAV+ +
Sbjct: 170 DLFEAFAKVICNSFTICNAEMQEVGVGLYPSISLLNHSCDPNCSIVFNGPHLLLRAVRDI 229

Query: 125 PKGAEVLISYIETAGSTMTRQKALKEQYLFTCTCPRCIKLGQFDDIQESAILEGYRCKDD 184
             G E+ I Y++   ++  R+K L++QY F C C RC                       
Sbjct: 230 EVGEELTICYLDMLMTSEERRKQLRDQYCFECDCFRC----------------------- 266

Query: 185 GCSGFLLRDSDDKGFTCQQCGLVRSKEEIKKIASEVNILSKKTLALTSCGNHQEVVSTYK 244
                  +D D    T  +      +E +KKI  E+    K    L  C   Q ++S+  
Sbjct: 267 -----QTQDKDADMLTGDEQVWKEVQESLKKI-EELKAHWKWEQVLAMC---QAIISSNS 317

Query: 245 MIEKLQKKLYHPFSVNLMQTR--EKLIKILMELEDWKEALAYCQLTIPVYQRVYPQFHPL 302
             E+L         +N+ Q +  +  +   + L   +EAL Y   T+  Y+  +P  HP+
Sbjct: 318 --ERLP-------DINIYQLKVLDCAMDACINLGLLEEALFYGTRTMEPYRIFFPGSHPV 368

Query: 303 LGLQYYTCGKLEWFLGDTENAIKSMTEAVEILRITHG 339
            G+Q    GKL+   G    A+K++  A +I+R+THG
Sbjct: 369 RGVQVMKVGKLQLHQGMFPQAMKNLRLAFDIMRVTHG 405


>gi|327533748|pdb|3QWP|A Chain A, Crystal Structure Of Set And Mynd Domain Containing 3;
           Zinc Finger Mynd Domain-Containing Protein 1
          Length = 429

 Score =  120 bits (301), Expect = 1e-24,   Method: Compositional matrix adjust.
 Identities = 84/277 (30%), Positives = 136/277 (49%), Gaps = 43/277 (15%)

Query: 65  EIAENFSKLACNAHTICNSELRPLGTGLYPVISIINHSCLPNAVLVFEGRLAVVRAVQHV 124
           ++ E F+K+ CN+ TICN+E++ +G GLYP IS++NHSC PN  +VF G   ++RAV+ +
Sbjct: 171 DLFEAFAKVICNSFTICNAEMQEVGVGLYPSISLLNHSCDPNCSIVFNGPHLLLRAVRDI 230

Query: 125 PKGAEVLISYIETAGSTMTRQKALKEQYLFTCTCPRCIKLGQFDDIQESAILEGYRCKDD 184
             G E+ I Y++   ++  R+K L++QY F C C RC                       
Sbjct: 231 EVGEELTICYLDMLMTSEERRKQLRDQYCFECDCFRC----------------------- 267

Query: 185 GCSGFLLRDSDDKGFTCQQCGLVRSKEEIKKIASEVNILSKKTLALTSCGNHQEVVSTYK 244
                  +D D    T  +      +E +KKI  E+    K    L  C   Q ++S+  
Sbjct: 268 -----QTQDKDADMLTGDEQVWKEVQESLKKI-EELKAHWKWEQVLAMC---QAIISSNS 318

Query: 245 MIEKLQKKLYHPFSVNLMQTR--EKLIKILMELEDWKEALAYCQLTIPVYQRVYPQFHPL 302
             E+L         +N+ Q +  +  +   + L   +EAL Y   T+  Y+  +P  HP+
Sbjct: 319 --ERLP-------DINIYQLKVLDCAMDACINLGLLEEALFYGTRTMEPYRIFFPGSHPV 369

Query: 303 LGLQYYTCGKLEWFLGDTENAIKSMTEAVEILRITHG 339
            G+Q    GKL+   G    A+K++  A +I+R+THG
Sbjct: 370 RGVQVMKVGKLQLHQGMFPQAMKNLRLAFDIMRVTHG 406


>gi|323714508|pdb|3OXG|A Chain A, Human Lysine Methyltransferase Smyd3 In Complex With
           Adohcy (Form Iii)
          Length = 464

 Score =  120 bits (301), Expect = 1e-24,   Method: Compositional matrix adjust.
 Identities = 84/277 (30%), Positives = 136/277 (49%), Gaps = 43/277 (15%)

Query: 65  EIAENFSKLACNAHTICNSELRPLGTGLYPVISIINHSCLPNAVLVFEGRLAVVRAVQHV 124
           ++ E F+K+ CN+ TICN+E++ +G GLYP IS++NHSC PN  +VF G   ++RAV+ +
Sbjct: 206 DLFEAFAKVICNSFTICNAEMQEVGVGLYPSISLLNHSCDPNCSIVFNGPHLLLRAVRDI 265

Query: 125 PKGAEVLISYIETAGSTMTRQKALKEQYLFTCTCPRCIKLGQFDDIQESAILEGYRCKDD 184
             G E+ I Y++   ++  R+K L++QY F C C RC                       
Sbjct: 266 EVGEELTICYLDMLMTSEERRKQLRDQYCFECDCFRC----------------------- 302

Query: 185 GCSGFLLRDSDDKGFTCQQCGLVRSKEEIKKIASEVNILSKKTLALTSCGNHQEVVSTYK 244
                  +D D    T  +      +E +KKI  E+    K    L  C   Q ++S+  
Sbjct: 303 -----QTQDKDADMLTGDEQVWKEVQESLKKI-EELKAHWKWEQVLAMC---QAIISSNS 353

Query: 245 MIEKLQKKLYHPFSVNLMQTR--EKLIKILMELEDWKEALAYCQLTIPVYQRVYPQFHPL 302
             E+L         +N+ Q +  +  +   + L   +EAL Y   T+  Y+  +P  HP+
Sbjct: 354 --ERLP-------DINIYQLKVLDCAMDACINLGLLEEALFYGTRTMEPYRIFFPGSHPV 404

Query: 303 LGLQYYTCGKLEWFLGDTENAIKSMTEAVEILRITHG 339
            G+Q    GKL+   G    A+K++  A +I+R+THG
Sbjct: 405 RGVQVMKVGKLQLHQGMFPQAMKNLRLAFDIMRVTHG 441


>gi|388454549|ref|NP_001252862.1| SET and MYND domain-containing protein 3 [Macaca mulatta]
 gi|380787707|gb|AFE65729.1| SET and MYND domain-containing protein 3 isoform 1 [Macaca mulatta]
 gi|383412903|gb|AFH29665.1| SET and MYND domain-containing protein 3 isoform 1 [Macaca mulatta]
          Length = 428

 Score =  120 bits (301), Expect = 1e-24,   Method: Compositional matrix adjust.
 Identities = 84/277 (30%), Positives = 136/277 (49%), Gaps = 43/277 (15%)

Query: 65  EIAENFSKLACNAHTICNSELRPLGTGLYPVISIINHSCLPNAVLVFEGRLAVVRAVQHV 124
           ++ E F+K+ CN+ TICN+E++ +G GLYP IS++NHSC PN  +VF G   ++RAV+ +
Sbjct: 170 DLFEAFAKVICNSFTICNAEMQEVGVGLYPSISLLNHSCDPNCSIVFNGPHLLLRAVRDI 229

Query: 125 PKGAEVLISYIETAGSTMTRQKALKEQYLFTCTCPRCIKLGQFDDIQESAILEGYRCKDD 184
             G E+ I Y++   ++  R+K L++QY F C C RC                       
Sbjct: 230 EVGEELTICYLDMLMTSEERRKQLRDQYCFECDCFRC----------------------- 266

Query: 185 GCSGFLLRDSDDKGFTCQQCGLVRSKEEIKKIASEVNILSKKTLALTSCGNHQEVVSTYK 244
                  +D D    T  +      +E +KKI  E+    K    L  C   Q ++S+  
Sbjct: 267 -----QTQDKDADMLTGDEQVWKEVQESLKKI-EELKAHWKWEQVLAMC---QAIISSNS 317

Query: 245 MIEKLQKKLYHPFSVNLMQTR--EKLIKILMELEDWKEALAYCQLTIPVYQRVYPQFHPL 302
             E+L         +N+ Q +  +  +   + L   +EAL Y   T+  Y+  +P  HP+
Sbjct: 318 --ERLP-------DINIYQLKVLDCAMDACINLGLLEEALFYGTRTMEPYRIFFPGSHPV 368

Query: 303 LGLQYYTCGKLEWFLGDTENAIKSMTEAVEILRITHG 339
            G+Q    GKL+   G    A+K++  A +I+R+THG
Sbjct: 369 RGVQVMKVGKLQLHQGMFPQAMKNLRLAFDIMRVTHG 405


>gi|323714506|pdb|3OXF|A Chain A, Human Lysine Methyltransferase Smyd3 In Complex With
           Adohcy (Form I)
 gi|323714507|pdb|3OXF|B Chain B, Human Lysine Methyltransferase Smyd3 In Complex With
           Adohcy (Form I)
          Length = 436

 Score =  120 bits (301), Expect = 1e-24,   Method: Compositional matrix adjust.
 Identities = 84/277 (30%), Positives = 136/277 (49%), Gaps = 43/277 (15%)

Query: 65  EIAENFSKLACNAHTICNSELRPLGTGLYPVISIINHSCLPNAVLVFEGRLAVVRAVQHV 124
           ++ E F+K+ CN+ TICN+E++ +G GLYP IS++NHSC PN  +VF G   ++RAV+ +
Sbjct: 170 DLFEAFAKVICNSFTICNAEMQEVGVGLYPSISLLNHSCDPNCSIVFNGPHLLLRAVRDI 229

Query: 125 PKGAEVLISYIETAGSTMTRQKALKEQYLFTCTCPRCIKLGQFDDIQESAILEGYRCKDD 184
             G E+ I Y++   ++  R+K L++QY F C C RC                       
Sbjct: 230 EVGEELTICYLDMLMTSEERRKQLRDQYCFECDCFRC----------------------- 266

Query: 185 GCSGFLLRDSDDKGFTCQQCGLVRSKEEIKKIASEVNILSKKTLALTSCGNHQEVVSTYK 244
                  +D D    T  +      +E +KKI  E+    K    L  C   Q ++S+  
Sbjct: 267 -----QTQDKDADMLTGDEQVWKEVQESLKKI-EELKAHWKWEQVLAMC---QAIISSNS 317

Query: 245 MIEKLQKKLYHPFSVNLMQTR--EKLIKILMELEDWKEALAYCQLTIPVYQRVYPQFHPL 302
             E+L         +N+ Q +  +  +   + L   +EAL Y   T+  Y+  +P  HP+
Sbjct: 318 --ERLP-------DINIYQLKVLDCAMDACINLGLLEEALFYGTRTMEPYRIFFPGSHPV 368

Query: 303 LGLQYYTCGKLEWFLGDTENAIKSMTEAVEILRITHG 339
            G+Q    GKL+   G    A+K++  A +I+R+THG
Sbjct: 369 RGVQVMKVGKLQLHQGMFPQAMKNLRLAFDIMRVTHG 405


>gi|149708613|ref|XP_001488507.1| PREDICTED: n-lysine methyltransferase SMYD2 [Equus caballus]
          Length = 384

 Score =  120 bits (301), Expect = 1e-24,   Method: Compositional matrix adjust.
 Identities = 86/337 (25%), Positives = 158/337 (46%), Gaps = 30/337 (8%)

Query: 19  TTTDNYSLVEALVAHMSDID-EKQLLLYAQIANLVNLILQWPEISINE-IAENFSKLACN 76
           T ++    V+   +H+  +D EK+ L+ + IA L +   +  E   N+ +   F+++ CN
Sbjct: 74  TPSEKLLAVKEFESHLEKLDNEKKDLIQSDIAALHHFYSKHLEFPDNDSLVVLFAQVNCN 133

Query: 77  AHTICNSELRPLGTGLYPVISIINHSCLPNAVLVFEGRLAVVRAVQHVPKGAEVLISYIE 136
             TI + EL  LG+ ++P ++++NHSC PN ++ ++G LA VRAVQ +  G EV  SYI+
Sbjct: 134 GFTIEDEELSHLGSAIFPDVALMNHSCCPNVIVTYKGTLAEVRAVQEINPGEEVFTSYID 193

Query: 137 TAGSTMTRQKALKEQYLFTCTCPRCIKLGQFDDIQESAILEGYRCKDDGCSGFLLRDSDD 196
               T  R   L++ Y FTC C  C         ++ A +E  R   D      +RD   
Sbjct: 194 LLYPTEDRNDRLRDSYFFTCECQECTTRD-----KDKAKVE-IRKLTDPPKAEAIRD--- 244

Query: 197 KGFTCQQCGLVRSKEEIKKIASEVNILSKKTLALTSCGNHQEVVSTYKMIEKLQKKLYHP 256
                    +VR    + +           +  L  C   QE +S+          ++  
Sbjct: 245 ---------MVRYARNVIEEFRRAKHYKSPSELLEICELSQEKMSS----------VFED 285

Query: 257 FSVNLMQTREKLIKILMELEDWKEALAYCQLTIPVYQRVYPQFHPLLGLQYYTCGKLEWF 316
            +V ++    + + + + ++DW+ AL Y Q  I  Y + YP +   +   +   G+L   
Sbjct: 286 SNVYMLHMMYQAMGVCLYMQDWEGALRYGQKIIKPYSKHYPLYSLNVASMWLKLGRLYMG 345

Query: 317 LGDTENAIKSMTEAVEILRITHGTNSPFMKELILKLE 353
           L +     K++ +A+ I+ + HG + P++ E+  ++E
Sbjct: 346 LENKAAGEKALKKAIAIMEVAHGKDHPYISEIKQEIE 382


>gi|403277520|ref|XP_003930406.1| PREDICTED: N-lysine methyltransferase SMYD2 [Saimiri boliviensis
           boliviensis]
          Length = 433

 Score =  120 bits (301), Expect = 1e-24,   Method: Compositional matrix adjust.
 Identities = 90/349 (25%), Positives = 160/349 (45%), Gaps = 31/349 (8%)

Query: 8   RKLQNDNVIPSTTTDNYSL-VEALVAHMSDID-EKQLLLYAQIANLVNLILQWPEISINE 65
           R L    + P  T     L V+   +H+  +D EK+ L+ + IA L +   +  E   N+
Sbjct: 111 RILAKQKIHPERTPSERLLAVKEFESHLDKLDNEKKDLIQSDIAALHHFYSKHLEFPDND 170

Query: 66  -IAENFSKLACNAHTICNSELRPLGTGLYPVISIINHSCLPNAVLVFEGRLAVVRAVQHV 124
            +   F+++ CN  TI + EL  LG+ ++P ++++NHSC PN ++ ++G LA VRAVQ +
Sbjct: 171 SLVVLFAQVNCNGFTIEDEELSHLGSAIFPDVALMNHSCCPNVIVTYKGTLAEVRAVQEI 230

Query: 125 PKGAEVLISYIETAGSTMTRQKALKEQYLFTCTCPRCIKLGQFDDIQESAILEGYRCKDD 184
             G EV  SYI+    T  R   L++ Y FTC C  C         ++ A +E  R   D
Sbjct: 231 SPGEEVFTSYIDLLYPTEDRNDRLRDSYFFTCECQECTTKD-----KDKAKVE-IRKLSD 284

Query: 185 GCSGFLLRDSDDKGFTCQQCGLVRSKEEIKKIASEVNILSKKTLALTSCGNHQEVVSTYK 244
                 +RD            +VR    + +           +  L  C   QE +S+  
Sbjct: 285 PPKAEAIRD------------MVRYARNVIEEFRRAKHYKSPSELLEICELSQEKMSS-- 330

Query: 245 MIEKLQKKLYHPFSVNLMQTREKLIKILMELEDWKEALAYCQLTIPVYQRVYPQFHPLLG 304
                   ++   +V ++    + + + + ++DW+ AL Y Q  I  Y + YP +   + 
Sbjct: 331 --------VFEDSNVYMLHMMYQAMGVCLYMQDWEGALQYGQKIIKPYSKHYPLYSLNVA 382

Query: 305 LQYYTCGKLEWFLGDTENAIKSMTEAVEILRITHGTNSPFMKELILKLE 353
             +   G+L   L       K++ +A+ I+ + HG + P++ E+  ++E
Sbjct: 383 SMWLKLGRLYMGLEHKAAGEKALKKAIAIMEVAHGKDHPYISEIKQEIE 431


>gi|343197592|pdb|3S7B|A Chain A, Structural Basis Of Substrate Methylation And Inhibition
           Of Smyd2
 gi|343197593|pdb|3S7D|A Chain A, Structural Basis Of Substrate Methylation And Inhibition
           Of Smyd2
 gi|343197595|pdb|3S7F|A Chain A, Structural Basis Of Substrate Methylation And Inhibition
           Of Smyd2
 gi|343197597|pdb|3S7J|A Chain A, Structural Basis Of Substrate Methylation And Inhibition
           Of Smyd2
 gi|9295345|gb|AAF86953.1|AF226053_1 HSKM-B [Homo sapiens]
 gi|66990077|gb|AAH98133.1| SET and MYND domain containing 2 [Homo sapiens]
 gi|68226703|gb|AAH98335.1| SET and MYND domain containing 2 [Homo sapiens]
 gi|189054089|dbj|BAG36596.1| unnamed protein product [Homo sapiens]
 gi|307686281|dbj|BAJ21071.1| SET and MYND domain containing 2 [synthetic construct]
          Length = 433

 Score =  120 bits (301), Expect = 1e-24,   Method: Compositional matrix adjust.
 Identities = 90/349 (25%), Positives = 160/349 (45%), Gaps = 31/349 (8%)

Query: 8   RKLQNDNVIPSTTTDNYSL-VEALVAHMSDID-EKQLLLYAQIANLVNLILQWPEISINE 65
           R L    + P  T     L V+   +H+  +D EK+ L+ + IA L +   +  E   N+
Sbjct: 111 RILAKQKIHPERTPSEKLLAVKEFESHLDKLDNEKKDLIQSDIAALHHFYSKHLEFPDND 170

Query: 66  -IAENFSKLACNAHTICNSELRPLGTGLYPVISIINHSCLPNAVLVFEGRLAVVRAVQHV 124
            +   F+++ CN  TI + EL  LG+ ++P ++++NHSC PN ++ ++G LA VRAVQ +
Sbjct: 171 SLVVLFAQVNCNGFTIEDEELSHLGSAIFPDVALMNHSCCPNVIVTYKGTLAEVRAVQEI 230

Query: 125 PKGAEVLISYIETAGSTMTRQKALKEQYLFTCTCPRCIKLGQFDDIQESAILEGYRCKDD 184
             G EV  SYI+    T  R   L++ Y FTC C  C         ++ A +E  R   D
Sbjct: 231 KPGEEVFTSYIDLLYPTEDRNDRLRDSYFFTCECQECTTKD-----KDKAKVE-IRKLSD 284

Query: 185 GCSGFLLRDSDDKGFTCQQCGLVRSKEEIKKIASEVNILSKKTLALTSCGNHQEVVSTYK 244
                 +RD            +VR    + +           +  L  C   QE +S+  
Sbjct: 285 PPKAEAIRD------------MVRYARNVIEEFRRAKHYKSPSELLEICELSQEKMSS-- 330

Query: 245 MIEKLQKKLYHPFSVNLMQTREKLIKILMELEDWKEALAYCQLTIPVYQRVYPQFHPLLG 304
                   ++   +V ++    + + + + ++DW+ AL Y Q  I  Y + YP +   + 
Sbjct: 331 --------VFEDSNVYMLHMMYQAMGVCLYMQDWEGALQYGQKIIKPYSKHYPLYSLNVA 382

Query: 305 LQYYTCGKLEWFLGDTENAIKSMTEAVEILRITHGTNSPFMKELILKLE 353
             +   G+L   L       K++ +A+ I+ + HG + P++ E+  ++E
Sbjct: 383 SMWLKLGRLYMGLEHKAAGEKALKKAIAIMEVAHGKDHPYISEIKQEIE 431


>gi|380814842|gb|AFE79295.1| N-lysine methyltransferase SMYD2 [Macaca mulatta]
          Length = 433

 Score =  120 bits (301), Expect = 1e-24,   Method: Compositional matrix adjust.
 Identities = 90/349 (25%), Positives = 160/349 (45%), Gaps = 31/349 (8%)

Query: 8   RKLQNDNVIPSTTTDNYSL-VEALVAHMSDID-EKQLLLYAQIANLVNLILQWPEISINE 65
           R L    + P  T     L V+   +H+  +D EK+ L+ + IA L +   +  E   N+
Sbjct: 111 RILAKQKIHPERTPSEKLLAVKEFESHLDKLDNEKKDLIQSDIAALHHFYSKHLEFPDND 170

Query: 66  -IAENFSKLACNAHTICNSELRPLGTGLYPVISIINHSCLPNAVLVFEGRLAVVRAVQHV 124
            +   F+++ CN  TI + EL  LG+ ++P ++++NHSC PN ++ ++G LA VRAVQ +
Sbjct: 171 SLVVLFAQVNCNGFTIEDEELSHLGSAIFPDVALMNHSCCPNVIVTYKGTLAEVRAVQEI 230

Query: 125 PKGAEVLISYIETAGSTMTRQKALKEQYLFTCTCPRCIKLGQFDDIQESAILEGYRCKDD 184
             G EV  SYI+    T  R   L++ Y FTC C  C         ++ A +E  R   D
Sbjct: 231 KPGEEVFTSYIDLLYPTEDRNDRLRDSYFFTCECQECTTKD-----KDKAKVE-IRKLSD 284

Query: 185 GCSGFLLRDSDDKGFTCQQCGLVRSKEEIKKIASEVNILSKKTLALTSCGNHQEVVSTYK 244
                 +RD            +VR    + +           +  L  C   QE +S+  
Sbjct: 285 PPKAEAIRD------------MVRYARNVIEEFRRAKHYKSPSELLEICELSQEKMSS-- 330

Query: 245 MIEKLQKKLYHPFSVNLMQTREKLIKILMELEDWKEALAYCQLTIPVYQRVYPQFHPLLG 304
                   ++   +V ++    + + + + ++DW+ AL Y Q  I  Y + YP +   + 
Sbjct: 331 --------VFEDSNVYMLHMMYQAMGVCLYMQDWEGALQYGQKIIKPYSKHYPLYSLNVA 382

Query: 305 LQYYTCGKLEWFLGDTENAIKSMTEAVEILRITHGTNSPFMKELILKLE 353
             +   G+L   L       K++ +A+ I+ + HG + P++ E+  ++E
Sbjct: 383 SMWLKLGRLYMGLEHKAAGEKALKKAIAIMEVAHGKDHPYISEIKQEIE 431


>gi|355745920|gb|EHH50545.1| hypothetical protein EGM_01398, partial [Macaca fascicularis]
          Length = 380

 Score =  120 bits (301), Expect = 1e-24,   Method: Compositional matrix adjust.
 Identities = 88/349 (25%), Positives = 156/349 (44%), Gaps = 31/349 (8%)

Query: 8   RKLQNDNVIPSTTTDNYSL-VEALVAHMSDID-EKQLLLYAQIANLVNLILQWPEISINE 65
           R L    + P  T     L V+   +H+  +D EK+ L+ + IA L +   +  E   N+
Sbjct: 58  RILAKQKIHPERTPSEKLLAVKEFESHLDKLDNEKKDLIQSDIAALHHFYSKHLEFPDND 117

Query: 66  -IAENFSKLACNAHTICNSELRPLGTGLYPVISIINHSCLPNAVLVFEGRLAVVRAVQHV 124
            +   F+++ CN  TI + EL  LG+ ++P ++++NHSC PN ++ ++G LA VRAVQ +
Sbjct: 118 SLIVLFAQVNCNGFTIEDEELSHLGSAIFPDVALMNHSCCPNVIVTYKGTLAEVRAVQEI 177

Query: 125 PKGAEVLISYIETAGSTMTRQKALKEQYLFTCTCPRCIKLGQFDDIQESAILEGYRCKDD 184
             G EV  SYI+    T  R   L++ Y FTC C  C                    KD 
Sbjct: 178 KPGEEVFTSYIDLLYPTEDRNDRLRDSYFFTCECQECT------------------TKDK 219

Query: 185 GCSGFLLRDSDDKGFTCQQCGLVRSKEEIKKIASEVNILSKKTLALTSCGNHQEVVSTYK 244
             +   +R   D         +VR    + +           +  L  C   QE +S+  
Sbjct: 220 DKAKVEIRKLSDPPKAEAIRDMVRYARNVIEEFRRAKHYKSPSELLEICELSQEKMSS-- 277

Query: 245 MIEKLQKKLYHPFSVNLMQTREKLIKILMELEDWKEALAYCQLTIPVYQRVYPQFHPLLG 304
                   ++   +V ++    + + + + ++DW+ AL Y Q  I  Y + YP +   + 
Sbjct: 278 --------VFEDSNVYMLHMMYQAMGVCLYMQDWEGALQYGQKIIKPYSKHYPLYSLNVA 329

Query: 305 LQYYTCGKLEWFLGDTENAIKSMTEAVEILRITHGTNSPFMKELILKLE 353
             +   G+L   L       K++ +A+ I+ + HG + P++ E+  ++E
Sbjct: 330 SMWLKLGRLYMGLEHKAAGEKALKKAIAIMEVAHGKDHPYISEIKQEIE 378


>gi|402857226|ref|XP_003893168.1| PREDICTED: N-lysine methyltransferase SMYD2 [Papio anubis]
          Length = 433

 Score =  120 bits (301), Expect = 1e-24,   Method: Compositional matrix adjust.
 Identities = 90/349 (25%), Positives = 160/349 (45%), Gaps = 31/349 (8%)

Query: 8   RKLQNDNVIPSTTTDNYSL-VEALVAHMSDID-EKQLLLYAQIANLVNLILQWPEISINE 65
           R L    + P  T     L V+   +H+  +D EK+ L+ + IA L +   +  E   N+
Sbjct: 111 RILAKQKIHPERTPSEKLLAVKEFESHLDKLDNEKKDLIQSDIAALHHFYSRHLEFPDND 170

Query: 66  -IAENFSKLACNAHTICNSELRPLGTGLYPVISIINHSCLPNAVLVFEGRLAVVRAVQHV 124
            +   F+++ CN  TI + EL  LG+ ++P ++++NHSC PN ++ ++G LA VRAVQ +
Sbjct: 171 SLVVLFAQVNCNGFTIEDEELSHLGSAIFPDVALMNHSCCPNVIVTYKGTLAEVRAVQEI 230

Query: 125 PKGAEVLISYIETAGSTMTRQKALKEQYLFTCTCPRCIKLGQFDDIQESAILEGYRCKDD 184
             G EV  SYI+    T  R   L++ Y FTC C  C         ++ A +E  R   D
Sbjct: 231 KPGEEVFTSYIDLLYPTEDRNDRLRDSYFFTCECQECTTKD-----KDKAKVE-IRKLSD 284

Query: 185 GCSGFLLRDSDDKGFTCQQCGLVRSKEEIKKIASEVNILSKKTLALTSCGNHQEVVSTYK 244
                 +RD            +VR    + +           +  L  C   QE +S+  
Sbjct: 285 PPKAEAIRD------------MVRYARNVIEEFRRAKHYKSPSELLEICELSQEKMSS-- 330

Query: 245 MIEKLQKKLYHPFSVNLMQTREKLIKILMELEDWKEALAYCQLTIPVYQRVYPQFHPLLG 304
                   ++   +V ++    + + + + ++DW+ AL Y Q  I  Y + YP +   + 
Sbjct: 331 --------VFEDSNVYMLHMMYQAMGVCLYMQDWEGALQYGQKIIKPYSKHYPLYSLNVA 382

Query: 305 LQYYTCGKLEWFLGDTENAIKSMTEAVEILRITHGTNSPFMKELILKLE 353
             +   G+L   L       K++ +A+ I+ + HG + P++ E+  ++E
Sbjct: 383 SMWLKLGRLYMGLEHKAAGEKALKKAIAIMEVAHGKDHPYISEIKQEIE 431


>gi|74188167|dbj|BAE37175.1| unnamed protein product [Mus musculus]
          Length = 428

 Score =  120 bits (301), Expect = 1e-24,   Method: Compositional matrix adjust.
 Identities = 79/275 (28%), Positives = 136/275 (49%), Gaps = 39/275 (14%)

Query: 65  EIAENFSKLACNAHTICNSELRPLGTGLYPVISIINHSCLPNAVLVFEGRLAVVRAVQHV 124
           ++ E F+K+ CN+ TICN+E++ +G GLYP +S++NHSC PN  +VF G   ++RAV+ +
Sbjct: 170 DLFEAFAKVICNSFTICNAEMQEVGVGLYPSMSLLNHSCDPNCSIVFNGPHLLLRAVREI 229

Query: 125 PKGAEVLISYIETAGSTMTRQKALKEQYLFTCTCPRCIKLGQFDDIQESAILEGYRCKDD 184
             G E+ I Y++   ++  R+K L++QY F C C RC    Q  D               
Sbjct: 230 EAGEELTICYLDMLMTSEERRKQLRDQYCFECDCIRC----QTQD--------------- 270

Query: 185 GCSGFLLRDSDDKGFTCQQCGLVRSKEEIKKIASEVNILSKKTLALTSCGNHQEVVSTYK 244
                  +D+D          ++   E+I K   EV    KK   L +    ++V++  +
Sbjct: 271 -------KDAD----------MLTGDEQIWK---EVQESLKKIEELKAHWKWEQVLALCQ 310

Query: 245 MIEKLQKKLYHPFSVNLMQTREKLIKILMELEDWKEALAYCQLTIPVYQRVYPQFHPLLG 304
            I           ++  ++  +  +   + L   +EAL Y   T+  Y+  +P  HP+ G
Sbjct: 311 AIINSNSNRLPDINIYQLKVLDCAMDACINLGMLEEALLYAMRTMEPYRIFFPGSHPVRG 370

Query: 305 LQYYTCGKLEWFLGDTENAIKSMTEAVEILRITHG 339
           +Q    GKL+   G    A+K++  A +I+++THG
Sbjct: 371 VQVMKVGKLQLHQGMFPQAMKNLRLAFDIMKVTHG 405


>gi|440901481|gb|ELR52415.1| SET and MYND domain-containing protein 2, partial [Bos grunniens
           mutus]
          Length = 383

 Score =  120 bits (301), Expect = 1e-24,   Method: Compositional matrix adjust.
 Identities = 91/350 (26%), Positives = 166/350 (47%), Gaps = 32/350 (9%)

Query: 8   RKLQNDNVIPSTTTDNYSL-VEALVAHMSDID-EKQLLLYAQIANLVNLILQWPEISINE 65
           R L    + P  T     L V+   +H+  +D EK+ L+ + IA L +   +  E   N+
Sbjct: 60  RILAKQKIHPERTPSEKLLAVKEFESHLDKLDNEKRDLIQSDIAALHHFYSKHLEFPDND 119

Query: 66  -IAENFSKLACNAHTICNSELRPLGTGLYPVISIINHSCLPNAVLVFEGRLAVVRAVQHV 124
            +   F+++ CN  TI + EL  LG+ ++P ++++NHSC PN ++ ++G LA VRAVQ +
Sbjct: 120 SLVVLFAQVNCNGFTIEDEELSHLGSAIFPDVALMNHSCCPNVIVTYKGTLAEVRAVQEI 179

Query: 125 PKGAEVLISYIETAGSTMTRQKALKEQYLFTCTCPRCIKLGQFDDIQESAILEGYRCKDD 184
             G EV  SYI+    T  R   L++ Y FTC C  C         ++ A +E  R  +D
Sbjct: 180 HPGEEVFTSYIDLLYPTEDRNDRLRDSYFFTCECQECTTKD-----KDKAKVE-IRKLND 233

Query: 185 GCSGFLLRDSDDKGFTCQQCGLVR-SKEEIKKIASEVNILSKKTLALTSCGNHQEVVSTY 243
                 +RD            +VR ++  I++     +  S  +  L  C   QE +S  
Sbjct: 234 PPKAETIRD------------MVRYARNVIEEFRRAKHYKSPPSELLEICELSQEKMSC- 280

Query: 244 KMIEKLQKKLYHPFSVNLMQTREKLIKILMELEDWKEALAYCQLTIPVYQRVYPQFHPLL 303
                    ++   +V ++    + + + + ++DW+ AL Y Q  I  Y + YP +   +
Sbjct: 281 ---------VFEDSNVYMLHMMYQAMGVCLYMQDWEGALRYGQKIIQPYSKHYPLYSLNV 331

Query: 304 GLQYYTCGKLEWFLGDTENAIKSMTEAVEILRITHGTNSPFMKELILKLE 353
              +   G+L   L +     +++ +A+ I+ + HG + P++ E+  ++E
Sbjct: 332 ASMWLKLGRLYMGLENKAAGERALKKAIAIMEVAHGKDHPYISEIKQEIE 381


>gi|71043910|ref|NP_001020933.1| SET and MYND domain-containing protein 3 [Rattus norvegicus]
 gi|66910630|gb|AAH97455.1| SET and MYND domain containing 3 [Rattus norvegicus]
          Length = 428

 Score =  120 bits (301), Expect = 1e-24,   Method: Compositional matrix adjust.
 Identities = 79/275 (28%), Positives = 137/275 (49%), Gaps = 39/275 (14%)

Query: 65  EIAENFSKLACNAHTICNSELRPLGTGLYPVISIINHSCLPNAVLVFEGRLAVVRAVQHV 124
           ++ E F+K+ CN+ TICN+E++ +G GLYP +S++NHSC PN  +VF G   ++RAV+ +
Sbjct: 170 DLFEAFAKVICNSFTICNAEMQEVGVGLYPSMSLLNHSCDPNCSIVFNGPHLLLRAVREI 229

Query: 125 PKGAEVLISYIETAGSTMTRQKALKEQYLFTCTCPRCIKLGQFDDIQESAILEGYRCKDD 184
             G E+ I Y++   ++  R+K L++QY F C C RC    Q  D               
Sbjct: 230 EAGEELTICYLDMLMTSEERRKQLRDQYCFECDCIRC----QTQD--------------- 270

Query: 185 GCSGFLLRDSDDKGFTCQQCGLVRSKEEIKKIASEVNILSKKTLALTSCGNHQEVVSTYK 244
                  +D+D          ++   E+I K   EV    KK   L +    ++V++  +
Sbjct: 271 -------KDAD----------MLTGDEQIWK---EVQESLKKIEELKAHWKWEQVLALCQ 310

Query: 245 MIEKLQKKLYHPFSVNLMQTREKLIKILMELEDWKEALAYCQLTIPVYQRVYPQFHPLLG 304
            I     +     ++  ++  +  +   + L   +EAL Y   T+  Y+  +P  HP+ G
Sbjct: 311 AIINSNSERLPDINIYQLKVLDCAMDACINLGMLEEALFYAMRTMEPYRIFFPGSHPVRG 370

Query: 305 LQYYTCGKLEWFLGDTENAIKSMTEAVEILRITHG 339
           +Q    GKL+   G    A+K++  A +I+++THG
Sbjct: 371 VQVMKVGKLQLHQGMFPQAMKNLRLAFDIMKVTHG 405


>gi|297280801|ref|XP_001106482.2| PREDICTED: SET and MYND domain-containing protein 2 [Macaca
           mulatta]
          Length = 433

 Score =  120 bits (300), Expect = 1e-24,   Method: Compositional matrix adjust.
 Identities = 90/349 (25%), Positives = 160/349 (45%), Gaps = 31/349 (8%)

Query: 8   RKLQNDNVIPSTTTDNYSL-VEALVAHMSDID-EKQLLLYAQIANLVNLILQWPEISINE 65
           R L    + P  T     L V+   +H+  +D EK+ L+ + IA L +   +  E   N+
Sbjct: 111 RILAKQKIHPERTPSEKLLAVKEFESHLDKLDNEKKDLIQSDIAALHHFYSKHLEFPDND 170

Query: 66  -IAENFSKLACNAHTICNSELRPLGTGLYPVISIINHSCLPNAVLVFEGRLAVVRAVQHV 124
            +   F+++ CN  TI + EL  LG+ ++P ++++NHSC PN ++ ++G LA VRAVQ +
Sbjct: 171 SLVVLFAQVNCNGFTIEDEELSHLGSAIFPDVALMNHSCCPNVIVTYKGTLAEVRAVQEI 230

Query: 125 PKGAEVLISYIETAGSTMTRQKALKEQYLFTCTCPRCIKLGQFDDIQESAILEGYRCKDD 184
             G EV  SYI+    T  R   L++ Y FTC C  C         ++ A +E  R   D
Sbjct: 231 KPGEEVFTSYIDLLYPTEDRNDRLRDSYFFTCECQECTTKD-----KDKAKVE-IRKLSD 284

Query: 185 GCSGFLLRDSDDKGFTCQQCGLVRSKEEIKKIASEVNILSKKTLALTSCGNHQEVVSTYK 244
                 +RD            +VR    + +           +  L  C   QE +S+  
Sbjct: 285 PPKAEAIRD------------MVRYARNVIEEFRRAKHYKSPSELLEICELSQEKMSS-- 330

Query: 245 MIEKLQKKLYHPFSVNLMQTREKLIKILMELEDWKEALAYCQLTIPVYQRVYPQFHPLLG 304
                   ++   +V ++    + + + + ++DW+ AL Y Q  I  Y + YP +   + 
Sbjct: 331 --------VFEDSNVYMLHMMYQAMGVCLYMQDWEGALQYGQKIIKPYSKHYPLYSLNVA 382

Query: 305 LQYYTCGKLEWFLGDTENAIKSMTEAVEILRITHGTNSPFMKELILKLE 353
             +   G+L   L       K++ +A+ I+ + HG + P++ E+  ++E
Sbjct: 383 SMWLKLGRLYMGLEHKAAGEKALKKAIAIMEVAHGKDHPYISEIKQEIE 431


>gi|296230124|ref|XP_002760570.1| PREDICTED: N-lysine methyltransferase SMYD2 [Callithrix jacchus]
          Length = 433

 Score =  120 bits (300), Expect = 1e-24,   Method: Compositional matrix adjust.
 Identities = 90/349 (25%), Positives = 160/349 (45%), Gaps = 31/349 (8%)

Query: 8   RKLQNDNVIPSTTTDNYSL-VEALVAHMSDID-EKQLLLYAQIANLVNLILQWPEISINE 65
           R L    + P  T     L V+   +H+  +D EK+ L+ + IA L +   +  E   N+
Sbjct: 111 RILAKQKIHPERTPSEKLLAVKEFESHLDKLDNEKKDLIQSDIAALHHFYSKHLEFPDND 170

Query: 66  -IAENFSKLACNAHTICNSELRPLGTGLYPVISIINHSCLPNAVLVFEGRLAVVRAVQHV 124
            +   F+++ CN  TI + EL  LG+ ++P ++++NHSC PN ++ ++G LA VRAVQ +
Sbjct: 171 SLVVLFAQVNCNGFTIEDEELSHLGSAIFPDVALMNHSCCPNVIVTYKGTLAEVRAVQEI 230

Query: 125 PKGAEVLISYIETAGSTMTRQKALKEQYLFTCTCPRCIKLGQFDDIQESAILEGYRCKDD 184
             G EV  SYI+    T  R   L++ Y FTC C  C         ++ A +E  R   D
Sbjct: 231 SPGEEVFTSYIDLLYPTEDRNDRLRDSYFFTCECQECTTKD-----KDKAKVE-IRKLSD 284

Query: 185 GCSGFLLRDSDDKGFTCQQCGLVRSKEEIKKIASEVNILSKKTLALTSCGNHQEVVSTYK 244
                 +RD            +VR    + +           +  L  C   QE +S+  
Sbjct: 285 PPKAEAIRD------------MVRYARNVIEEFRRAKHYKSPSELLEICELSQEKMSS-- 330

Query: 245 MIEKLQKKLYHPFSVNLMQTREKLIKILMELEDWKEALAYCQLTIPVYQRVYPQFHPLLG 304
                   ++   +V ++    + + + + ++DW+ AL Y Q  I  Y + YP +   + 
Sbjct: 331 --------VFEDSNVYMLHMMYQAMGVCLYMQDWEGALQYGQKIIKPYSKHYPLYSLNVA 382

Query: 305 LQYYTCGKLEWFLGDTENAIKSMTEAVEILRITHGTNSPFMKELILKLE 353
             +   G+L   L       K++ +A+ I+ + HG + P++ E+  ++E
Sbjct: 383 SMWLKLGRLYMGLEHKAAGEKALKKAIAIMEVAHGKDHPYISEIKQEIE 431


>gi|449283344|gb|EMC90014.1| SET and MYND domain-containing protein 2, partial [Columba livia]
          Length = 385

 Score =  120 bits (300), Expect = 1e-24,   Method: Compositional matrix adjust.
 Identities = 88/341 (25%), Positives = 165/341 (48%), Gaps = 29/341 (8%)

Query: 19  TTTDNYSLVEALVAHMSDID-EKQLLLYAQIANLVNLI---LQWPEISINEIAENFSKLA 74
           T ++    V+   +H+  +D EK+ L+   IA L +     L++P+ +   +   F+++ 
Sbjct: 68  TQSEKLLAVKEFESHLDKLDNEKRELIQNDIAALHHFYSKHLEYPDNAA--LVVLFAQVN 125

Query: 75  CNAHTICNSELRPLGTGLYPVISIINHSCLPNAVLVFEGRLAVVRAVQHVPKGAEVLISY 134
           CN  TI + EL  LG+ ++P ++++NHSC PN ++ ++G LA VRAV+ +  G EV  SY
Sbjct: 126 CNGFTIEDEELSHLGSAIFPDVALMNHSCCPNVIVTYKGTLAEVRAVKEIEPGEEVFTSY 185

Query: 135 IETAGSTMTRQKALKEQYLFTCTCPRCIKLGQFDDIQESAILEGYRCKDDGCSGFLLRDS 194
           I+    T  R   L++ Y F C C  C+   +  D ++  I    R  +D  S   +RD 
Sbjct: 186 IDLLYPTEDRNDRLRDSYFFNCDCRECVTKEK--DKEKLEI----RKLNDPPSAETVRD- 238

Query: 195 DDKGFTCQQCGLVRSK-EEIKKIASEVNILSKKTLALTSCGNHQEVVSTYKMIEKLQKKL 253
                        R+  EE ++      IL   TLA        E++   ++       +
Sbjct: 239 --------MIKYARNVIEEFRRAKHYKYILCLTTLAC-------ELLEICELSLDKMGAV 283

Query: 254 YHPFSVNLMQTREKLIKILMELEDWKEALAYCQLTIPVYQRVYPQFHPLLGLQYYTCGKL 313
           +   +V ++    + + + + ++DW+ AL Y Q  I  Y + YP +   +   +   G+L
Sbjct: 284 FEDSNVYMLHMMYQAMGVCLYVQDWEGALRYGQKIIRPYSKHYPSYSLNVASMWLKLGRL 343

Query: 314 EWFLGDTENAIKSMTEAVEILRITHGTNSPFMKELILKLEE 354
              L +    +K++  A+ I+ + HG + P++ E+  +LE+
Sbjct: 344 YMALENRAAGVKALKRAIAIMEVAHGKDHPYISEIKKELED 384


>gi|397486375|ref|XP_003814305.1| PREDICTED: N-lysine methyltransferase SMYD2 [Pan paniscus]
          Length = 384

 Score =  120 bits (300), Expect = 1e-24,   Method: Compositional matrix adjust.
 Identities = 88/349 (25%), Positives = 156/349 (44%), Gaps = 31/349 (8%)

Query: 8   RKLQNDNVIPSTTTDNYSL-VEALVAHMSDID-EKQLLLYAQIANLVNLILQWPEISINE 65
           R L    + P  T     L V+   +H+  +D EK+ L+ + IA L +   +  E   N+
Sbjct: 62  RILAKQKIHPERTPSEKLLAVKEFESHLDKLDNEKKDLIQSDIAALHHFYSKHLEFLDND 121

Query: 66  -IAENFSKLACNAHTICNSELRPLGTGLYPVISIINHSCLPNAVLVFEGRLAVVRAVQHV 124
            +   F+++ CN  TI + EL  LG+ ++P ++++NHSC PN ++ ++G LA VRAVQ +
Sbjct: 122 SLVVVFAQVNCNGFTIEDEELSHLGSAIFPDVALMNHSCCPNVIVTYKGTLAEVRAVQEI 181

Query: 125 PKGAEVLISYIETAGSTMTRQKALKEQYLFTCTCPRCIKLGQFDDIQESAILEGYRCKDD 184
             G EV  SYI+    T  R   L++ Y FTC C  C                    KD 
Sbjct: 182 KPGEEVFTSYIDLLYPTEDRNDRLRDSYFFTCECQECT------------------TKDK 223

Query: 185 GCSGFLLRDSDDKGFTCQQCGLVRSKEEIKKIASEVNILSKKTLALTSCGNHQEVVSTYK 244
             +   +R   D         +VR    + +           +  L  C   QE +S+  
Sbjct: 224 DKAKVEIRKLSDPPKAEAIRDMVRYARNVIEEFRRAKHYKSPSELLEICELSQEKMSS-- 281

Query: 245 MIEKLQKKLYHPFSVNLMQTREKLIKILMELEDWKEALAYCQLTIPVYQRVYPQFHPLLG 304
                   ++   +V ++    + + + + ++DW+ AL Y Q  I  Y + YP +   + 
Sbjct: 282 --------VFEDSNVYMLHMMYQAMGVCLYMQDWEGALQYGQKIIKPYSKHYPLYSLNVA 333

Query: 305 LQYYTCGKLEWFLGDTENAIKSMTEAVEILRITHGTNSPFMKELILKLE 353
             +   G+L   L       K++ +A+ I+ + HG + P++ E+  ++E
Sbjct: 334 SMWLKLGRLYMGLEHKAAGEKALKKAIAIMEVAHGKDHPYISEIKQEIE 382


>gi|21312378|ref|NP_081464.1| SET and MYND domain-containing protein 3 [Mus musculus]
 gi|30913566|sp|Q9CWR2.1|SMYD3_MOUSE RecName: Full=SET and MYND domain-containing protein 3; AltName:
           Full=Zinc finger MYND domain-containing protein 1
 gi|12845900|dbj|BAB26947.1| unnamed protein product [Mus musculus]
 gi|30851412|gb|AAH52431.1| SET and MYND domain containing 3 [Mus musculus]
          Length = 428

 Score =  120 bits (300), Expect = 1e-24,   Method: Compositional matrix adjust.
 Identities = 79/275 (28%), Positives = 136/275 (49%), Gaps = 39/275 (14%)

Query: 65  EIAENFSKLACNAHTICNSELRPLGTGLYPVISIINHSCLPNAVLVFEGRLAVVRAVQHV 124
           ++ E F+K+ CN+ TICN+E++ +G GLYP +S++NHSC PN  +VF G   ++RAV+ +
Sbjct: 170 DLFEAFAKVICNSFTICNAEMQEVGVGLYPSMSLLNHSCDPNCSIVFNGPHLLLRAVREI 229

Query: 125 PKGAEVLISYIETAGSTMTRQKALKEQYLFTCTCPRCIKLGQFDDIQESAILEGYRCKDD 184
             G E+ I Y++   ++  R+K L++QY F C C RC    Q  D               
Sbjct: 230 EAGEELTICYLDMLMTSEERRKQLRDQYCFECDCIRC----QTQD--------------- 270

Query: 185 GCSGFLLRDSDDKGFTCQQCGLVRSKEEIKKIASEVNILSKKTLALTSCGNHQEVVSTYK 244
                  +D+D          ++   E+I K   EV    KK   L +    ++V++  +
Sbjct: 271 -------KDAD----------MLTGDEQIWK---EVQESLKKIEELKAHWKWEQVLALCQ 310

Query: 245 MIEKLQKKLYHPFSVNLMQTREKLIKILMELEDWKEALAYCQLTIPVYQRVYPQFHPLLG 304
            I           ++  ++  +  +   + L   +EAL Y   T+  Y+  +P  HP+ G
Sbjct: 311 AIINSNSNRLPDINIYQLKVLDCAMDACINLGMLEEALFYAMRTMEPYRIFFPGSHPVRG 370

Query: 305 LQYYTCGKLEWFLGDTENAIKSMTEAVEILRITHG 339
           +Q    GKL+   G    A+K++  A +I+++THG
Sbjct: 371 VQVMKVGKLQLHQGMFPQAMKNLRLAFDIMKVTHG 405


>gi|38494371|gb|AAH61485.1| SET and MYND domain containing 3 [Mus musculus]
 gi|148681224|gb|EDL13171.1| SET and MYND domain containing 3, isoform CRA_b [Mus musculus]
          Length = 428

 Score =  120 bits (300), Expect = 1e-24,   Method: Compositional matrix adjust.
 Identities = 79/275 (28%), Positives = 136/275 (49%), Gaps = 39/275 (14%)

Query: 65  EIAENFSKLACNAHTICNSELRPLGTGLYPVISIINHSCLPNAVLVFEGRLAVVRAVQHV 124
           ++ E F+K+ CN+ TICN+E++ +G GLYP +S++NHSC PN  +VF G   ++RAV+ +
Sbjct: 170 DLFEAFAKVICNSFTICNAEMQEVGVGLYPSMSLLNHSCDPNCSIVFNGPHLLLRAVREI 229

Query: 125 PKGAEVLISYIETAGSTMTRQKALKEQYLFTCTCPRCIKLGQFDDIQESAILEGYRCKDD 184
             G E+ I Y++   ++  R+K L++QY F C C RC    Q  D               
Sbjct: 230 EAGEELTICYLDMLMTSEERRKQLRDQYCFECDCIRC----QTQD--------------- 270

Query: 185 GCSGFLLRDSDDKGFTCQQCGLVRSKEEIKKIASEVNILSKKTLALTSCGNHQEVVSTYK 244
                  +D+D          ++   E+I K   EV    KK   L +    ++V++  +
Sbjct: 271 -------KDAD----------MLTGDEQIWK---EVQESLKKIEELKAHWKWEQVLALCQ 310

Query: 245 MIEKLQKKLYHPFSVNLMQTREKLIKILMELEDWKEALAYCQLTIPVYQRVYPQFHPLLG 304
            I           ++  ++  +  +   + L   +EAL Y   T+  Y+  +P  HP+ G
Sbjct: 311 AIINSNSDRLPDINIYQLKVLDCAMDACINLGMLEEALFYAMRTMEPYRIFFPGSHPVRG 370

Query: 305 LQYYTCGKLEWFLGDTENAIKSMTEAVEILRITHG 339
           +Q    GKL+   G    A+K++  A +I+++THG
Sbjct: 371 VQVMKVGKLQLHQGMFPQAMKNLRLAFDIMKVTHG 405


>gi|348577081|ref|XP_003474313.1| PREDICTED: SET and MYND domain-containing protein 3-like [Cavia
           porcellus]
          Length = 428

 Score =  120 bits (300), Expect = 1e-24,   Method: Compositional matrix adjust.
 Identities = 87/277 (31%), Positives = 143/277 (51%), Gaps = 43/277 (15%)

Query: 65  EIAENFSKLACNAHTICNSELRPLGTGLYPVISIINHSCLPNAVLVFEGRLAVVRAVQHV 124
           +I E F+K+ CN+ TICN+E++ +G GLYP +S++NHSC PN  +VF G   ++RAV+ +
Sbjct: 170 DIFEAFAKVICNSFTICNAEMQEVGVGLYPSMSLLNHSCDPNCSIVFNGPHLLLRAVRDI 229

Query: 125 PKGAEVLISYIETAGSTMTRQKALKEQYLFTCTCPRCIKLGQFDDIQESAILEGYRCKDD 184
             G E+ I Y++   ++  R+K L++QY F C C RC    Q  D +++ +L G      
Sbjct: 230 EVGEELTICYLDMLMTSEERRKQLRDQYCFECDCFRC----QTQD-KDADMLTG------ 278

Query: 185 GCSGFLLRDSDDKGFTCQQCGLVRSKEEIKKIASEVNILSKKTLALTSCGNHQEVVSTYK 244
                     D++ +   Q       E +KKI  E+    K    L  C   Q ++S+  
Sbjct: 279 ----------DEQVWKGVQ-------ESLKKI-EELKTHWKWEQVLAMC---QSIISSNS 317

Query: 245 MIEKLQKKLYHPFSVNLMQTR--EKLIKILMELEDWKEALAYCQLTIPVYQRVYPQFHPL 302
             E+L         +N+ Q +  +  +   + L   +EAL Y   T+  Y+  +P  HP+
Sbjct: 318 --ERLP-------DINIYQLKVLDCAMDACINLGLLEEALFYGIRTMEPYRIFFPGSHPV 368

Query: 303 LGLQYYTCGKLEWFLGDTENAIKSMTEAVEILRITHG 339
            G+Q    GKL+   G    A+K++  A +I+R+THG
Sbjct: 369 RGVQVMKVGKLQLHQGMFPQAMKNLRLAFDIMRVTHG 405


>gi|345802872|ref|XP_537149.3| PREDICTED: N-lysine methyltransferase SMYD2 [Canis lupus
           familiaris]
          Length = 523

 Score =  120 bits (300), Expect = 2e-24,   Method: Compositional matrix adjust.
 Identities = 82/349 (23%), Positives = 158/349 (45%), Gaps = 31/349 (8%)

Query: 8   RKLQNDNVIPSTTTDNYSL-VEALVAHMSDID-EKQLLLYAQIANLVNLILQWPEISINE 65
           R L    + P  T     L V+   +H+  +D EK+ L+ + IA L +   +  E   +E
Sbjct: 201 RILAKQKIHPERTPSEKLLAVKEFESHLDKLDNEKKDLIQSDIAALHHFYSKHLEFPDSE 260

Query: 66  -IAENFSKLACNAHTICNSELRPLGTGLYPVISIINHSCLPNAVLVFEGRLAVVRAVQHV 124
            +   F+++ CN  TI + EL  LG+ ++P ++++NHSC PN ++ ++G LA VRAVQ +
Sbjct: 261 SLVVLFAQVNCNGFTIEDEELSHLGSAIFPDVALMNHSCCPNVIVTYKGTLAEVRAVQEI 320

Query: 125 PKGAEVLISYIETAGSTMTRQKALKEQYLFTCTCPRCIKLGQFDDIQESAILEGYRCKDD 184
             G EV  SYI+    T  R   L++ Y FTC C  C                       
Sbjct: 321 SPGEEVFTSYIDLLYPTEDRNDRLRDSYFFTCQCQECTT--------------------- 359

Query: 185 GCSGFLLRDSDDKGFTCQQCGLVRSKEEIKKIASEVNILSKKTLALTSCGNHQEVVSTYK 244
                  +D D      ++       E I+ +      + ++        +  E++   +
Sbjct: 360 -------KDKDKAKVEIRKLSDPPKAETIRDMVRYARNVIEEFRRAKHYKSPSELLEICE 412

Query: 245 MIEKLQKKLYHPFSVNLMQTREKLIKILMELEDWKEALAYCQLTIPVYQRVYPQFHPLLG 304
           + ++    ++   +V ++    + + + + ++DW+ AL Y Q  I  Y + YP +   + 
Sbjct: 413 LSQEKMGSVFEDSNVYMLHMMYQAMGVCLYMQDWEGALRYGQKIIKPYSKHYPLYSLNVA 472

Query: 305 LQYYTCGKLEWFLGDTENAIKSMTEAVEILRITHGTNSPFMKELILKLE 353
             +   G+L   L D     +++ +A+ I+ + HG + P++ E+  ++E
Sbjct: 473 SMWLKLGRLYMGLEDKAAGERALKKAIAIMEVAHGKDHPYISEIKQEIE 521


>gi|410985721|ref|XP_003999165.1| PREDICTED: SET and MYND domain-containing protein 3 [Felis catus]
          Length = 428

 Score =  119 bits (299), Expect = 2e-24,   Method: Compositional matrix adjust.
 Identities = 83/277 (29%), Positives = 136/277 (49%), Gaps = 43/277 (15%)

Query: 65  EIAENFSKLACNAHTICNSELRPLGTGLYPVISIINHSCLPNAVLVFEGRLAVVRAVQHV 124
           +I E F+K+ CN+ T+CN+E++ +G GLYP +S++NHSC PN  +VF G   ++RAV+ +
Sbjct: 170 DIFEAFAKVICNSFTVCNAEMQEVGVGLYPSMSLLNHSCDPNCSIVFNGPHLLLRAVRDI 229

Query: 125 PKGAEVLISYIETAGSTMTRQKALKEQYLFTCTCPRCIKLGQFDDIQESAILEGYRCKDD 184
             G E+ I Y++   ++  R+K L++QY F C C RC                       
Sbjct: 230 EAGEELTICYLDMLMTSEERRKQLRDQYCFECDCIRC----------------------- 266

Query: 185 GCSGFLLRDSDDKGFTCQQCGLVRSKEEIKKIASEVNILSKKTLALTSCGNHQEVVSTYK 244
                  +D D    T  +      +E +KKI  E+    K    L  C   Q ++S+  
Sbjct: 267 -----QTQDKDADMLTGDEQVWKEVQESLKKI-EELKAHWKWEQVLAMC---QAIISSNS 317

Query: 245 MIEKLQKKLYHPFSVNLMQTR--EKLIKILMELEDWKEALAYCQLTIPVYQRVYPQFHPL 302
             E+L         +N+ Q +  +  +   + L   +EAL Y   T+  Y+  +P  HP+
Sbjct: 318 --ERLP-------DINIYQLKVLDCAMDACINLGLLEEALFYGIRTMEPYRIFFPGSHPV 368

Query: 303 LGLQYYTCGKLEWFLGDTENAIKSMTEAVEILRITHG 339
            G+Q    GKL+   G    A+K++  A +I+R+THG
Sbjct: 369 RGVQVMKVGKLQLHQGMFPQAMKNLRLAFDIMRVTHG 405


>gi|403288342|ref|XP_003935365.1| PREDICTED: SET and MYND domain-containing protein 3 [Saimiri
           boliviensis boliviensis]
          Length = 369

 Score =  119 bits (299), Expect = 2e-24,   Method: Compositional matrix adjust.
 Identities = 80/275 (29%), Positives = 137/275 (49%), Gaps = 39/275 (14%)

Query: 65  EIAENFSKLACNAHTICNSELRPLGTGLYPVISIINHSCLPNAVLVFEGRLAVVRAVQHV 124
           +I E F+K+ CN+ TICN+E++ +G GLYP IS++NHSC PN  +VF G   ++RAV+ +
Sbjct: 111 DIFEAFAKVICNSFTICNAEMQEVGVGLYPSISLLNHSCDPNCSVVFNGPHLLLRAVRDI 170

Query: 125 PKGAEVLISYIETAGSTMTRQKALKEQYLFTCTCPRCIKLGQFDDIQESAILEGYRCKDD 184
             G E+ I Y++T  ++  R+K L++QY F C C RC    Q  D               
Sbjct: 171 EVGEELTICYLDTLMTSEERRKQLRDQYCFECDCFRC----QTQD--------------- 211

Query: 185 GCSGFLLRDSDDKGFTCQQCGLVRSKEEIKKIASEVNILSKKTLALTSCGNHQEVVSTYK 244
                  +D+D          ++   E++ K   EV    K    L +    ++V++  +
Sbjct: 212 -------KDAD----------MLTGDEQVWK---EVQKSLKTIEELKAHWKWEQVLAMCQ 251

Query: 245 MIEKLQKKLYHPFSVNLMQTREKLIKILMELEDWKEALAYCQLTIPVYQRVYPQFHPLLG 304
            I     +     ++  ++  +  +   + L   ++AL Y   T+  Y+  +P  HP+ G
Sbjct: 252 AIISSNSERLPDINIYQLKVLDCAMDACIHLGLLEKALFYGTRTMEPYRIFFPGSHPVRG 311

Query: 305 LQYYTCGKLEWFLGDTENAIKSMTEAVEILRITHG 339
           +Q    GKL+   G    A+K++  A +I+R+THG
Sbjct: 312 VQVMKVGKLQLHQGMFPQAMKNLRLAFDIMRVTHG 346


>gi|417400865|gb|JAA47349.1| Putative histone tail methylase [Desmodus rotundus]
          Length = 433

 Score =  119 bits (299), Expect = 2e-24,   Method: Compositional matrix adjust.
 Identities = 84/337 (24%), Positives = 153/337 (45%), Gaps = 30/337 (8%)

Query: 19  TTTDNYSLVEALVAHMSDID-EKQLLLYAQIANLVNLILQWPEISINE-IAENFSKLACN 76
           T ++    V+   +H+  +D EK+ L+ + I+ L     +  E   NE +   F+++ CN
Sbjct: 123 TPSEKLLAVKEFESHLDKLDNEKKDLIQSDISALHRFYSKHLEFPDNESLVTLFAQVNCN 182

Query: 77  AHTICNSELRPLGTGLYPVISIINHSCLPNAVLVFEGRLAVVRAVQHVPKGAEVLISYIE 136
             TI + EL  LG+ ++P ++++NHSC PN ++ ++G LA VRAVQ +  G EV  SYI+
Sbjct: 183 GFTIEDEELSHLGSAIFPDVALMNHSCCPNVIVTYKGTLAEVRAVQAISPGEEVFTSYID 242

Query: 137 TAGSTMTRQKALKEQYLFTCTCPRCIKLGQFDDIQESAILEGYRCKDDGCSGFLLRDSDD 196
               T  R   L++ Y FTC C  C                    KD   +   +R  +D
Sbjct: 243 LLYPTEDRNDRLRDSYFFTCECQECTT------------------KDKDKAKVKIRKLND 284

Query: 197 KGFTCQQCGLVRSKEEIKKIASEVNILSKKTLALTSCGNHQEVVSTYKMIEKLQKKLYHP 256
                    +VR    + +           +  L  C   QE +S+          ++  
Sbjct: 285 PPKADAVRDMVRYARNVIEEFRRAKHYKSPSELLEICELSQEKMSS----------VFED 334

Query: 257 FSVNLMQTREKLIKILMELEDWKEALAYCQLTIPVYQRVYPQFHPLLGLQYYTCGKLEWF 316
            +V ++    + + + +  +DW+ AL Y Q  I  Y + YP +   +   +   G+L   
Sbjct: 335 SNVYMLHMMYQAMGVCLYTQDWEGALRYGQKIIKPYSKHYPLYSLNVASMWLKLGRLYMG 394

Query: 317 LGDTENAIKSMTEAVEILRITHGTNSPFMKELILKLE 353
           L +     K++ +A+ I+ + HG + P++ E+  ++E
Sbjct: 395 LENRAAGEKALRKAMAIMEVAHGKDHPYISEIKQEIE 431


>gi|291402060|ref|XP_002717678.1| PREDICTED: SET and MYND domain containing 3 [Oryctolagus cuniculus]
          Length = 428

 Score =  119 bits (299), Expect = 2e-24,   Method: Compositional matrix adjust.
 Identities = 80/275 (29%), Positives = 138/275 (50%), Gaps = 39/275 (14%)

Query: 65  EIAENFSKLACNAHTICNSELRPLGTGLYPVISIINHSCLPNAVLVFEGRLAVVRAVQHV 124
           +I E F+K+ CN+ TICN+E++ +G GLYP +S++NHSC PN  +VF G   ++RAV+ +
Sbjct: 170 DIFEAFAKVICNSFTICNAEMQEVGVGLYPSMSLLNHSCDPNCSIVFNGPHLLLRAVRDI 229

Query: 125 PKGAEVLISYIETAGSTMTRQKALKEQYLFTCTCPRCIKLGQFDDIQESAILEGYRCKDD 184
             G E+ I Y++   ++  R+K L++QY F C C RC       + Q+            
Sbjct: 230 EAGEELTICYLDMLMTSEERRKQLRDQYCFECDCFRC-------ETQD------------ 270

Query: 185 GCSGFLLRDSDDKGFTCQQCGLVRSKEEIKKIASEVNILSKKTLALTSCGNHQEVVSTYK 244
                  +D+D          ++   E+I K   EV    KK   L +    ++V++  +
Sbjct: 271 -------KDAD----------MLTGDEQIWK---EVQESLKKIEELKAHWKWEQVLALCQ 310

Query: 245 MIEKLQKKLYHPFSVNLMQTREKLIKILMELEDWKEALAYCQLTIPVYQRVYPQFHPLLG 304
            I     +     ++  ++  +  +   + L   +EAL Y   T+  Y+  +P  HP+ G
Sbjct: 311 TIISSNSERLPDINIYQLKVLDCAMDACINLGLLEEALFYGIRTMEPYRIFFPGSHPVRG 370

Query: 305 LQYYTCGKLEWFLGDTENAIKSMTEAVEILRITHG 339
           +Q    GKL+   G    A+K++  A +I+R+THG
Sbjct: 371 VQVMKVGKLQLHQGMFPQAMKNLRLAFDIMRVTHG 405


>gi|417400767|gb|JAA47309.1| Hypothetical protein [Desmodus rotundus]
          Length = 428

 Score =  119 bits (299), Expect = 2e-24,   Method: Compositional matrix adjust.
 Identities = 84/277 (30%), Positives = 135/277 (48%), Gaps = 43/277 (15%)

Query: 65  EIAENFSKLACNAHTICNSELRPLGTGLYPVISIINHSCLPNAVLVFEGRLAVVRAVQHV 124
           +I E F+K+ CNA TICN+E++ +G GLYP +S++NHSC PN  +VF G   ++RAV+ +
Sbjct: 170 DIFEAFAKVFCNAFTICNAEMQGVGVGLYPSMSLLNHSCDPNCSIVFNGPHLLLRAVRDI 229

Query: 125 PKGAEVLISYIETAGSTMTRQKALKEQYLFTCTCPRCIKLGQFDDIQESAILEGYRCKDD 184
             G E+ + Y++   ++  R+K L+ QY F C C RC                       
Sbjct: 230 ETGEELTVCYLDLLMTSEERRKHLRSQYCFDCDCVRC----------------------- 266

Query: 185 GCSGFLLRDSDDKGFTCQQCGLVRSKEEIKKIASEVNILSKKTLALTSCGNHQEVVSTYK 244
                  +D D    T  +      +E +KKI  E+    K    L  C   Q ++S+  
Sbjct: 267 -----QTQDKDADMLTGDEQVWKEVQESLKKI-EELKAHWKWEQVLAMC---QAIISSN- 316

Query: 245 MIEKLQKKLYHPFSVNLMQTR--EKLIKILMELEDWKEALAYCQLTIPVYQRVYPQFHPL 302
             E+L         +N+ Q +  +  +   + L   +EAL Y   T+  Y+  +P  HP+
Sbjct: 317 -AERLP-------DINIYQLKVLDCAMDACIHLGLLEEALFYGLRTMEPYRIFFPGSHPV 368

Query: 303 LGLQYYTCGKLEWFLGDTENAIKSMTEAVEILRITHG 339
            G+Q    GKL+   G    A+K++  A +I+R+THG
Sbjct: 369 RGVQVMKVGKLQLHQGMFPQAMKNLRLAFDIMRVTHG 405


>gi|195438687|ref|XP_002067264.1| GK16327 [Drosophila willistoni]
 gi|194163349|gb|EDW78250.1| GK16327 [Drosophila willistoni]
          Length = 461

 Score =  119 bits (299), Expect = 2e-24,   Method: Compositional matrix adjust.
 Identities = 98/361 (27%), Positives = 168/361 (46%), Gaps = 25/361 (6%)

Query: 1   MLKLYLRRKLQNDNVIPSTTTDNYSLVEALVAHMSDI--DEKQL-LLYAQIANLVNLILQ 57
           + +L LR +   D +    T         L++H ++I  D K+L  L +  A L ++++ 
Sbjct: 102 LCRLILRLEHGGDLIRGYYTEHGSRKFRDLMSHYAEIKNDPKRLEHLESLHAVLTDMMVD 161

Query: 58  WPEISIN--EIAENFSKLACNAHTICNSELRPLGTGLYPVISIINHSCLPNAVLVFEGRL 115
            P    N  E+   + +L  N   I + E+  + T +Y  +SI +HSC PNAV  FEG  
Sbjct: 162 SPSTVPNKTELMSIYGRLITNGFNILDGEMNSIATAIYLGVSITDHSCQPNAVATFEGNE 221

Query: 116 AVVRAVQHVP--KGAEVLISYIETAGSTMTRQKALKEQYLFTCTCPRCIKLGQFDDIQES 173
             + A++ +P    +++ ISYI+   +   R+  LKE Y F C C +C       D QE 
Sbjct: 222 LHIYAIEDMPCLDWSKIFISYIDLLNTPEQRRLDLKEHYYFLCVCSKCT------DRQEQ 275

Query: 174 AILEGYRCKDDGCSGFLLRDSDDKGFTCQQC--GLVRSKEEIKKIASEVNILSKKTLALT 231
             +E   C    C   +    + K   C+ C  G+     +++   +EV  ++K  L   
Sbjct: 276 REMEAAVCPFRKCGSSI----NVKWTHCRNCQAGIT---PKLRNTYNEVMAMTKHHLESM 328

Query: 232 SCGNHQEVVSTYKMIEKLQKKLYHPFSVNLMQTREKLIKILMELEDWKEALAYCQLTIPV 291
               + +V    K+    Q  + HP +V  ++T +   +  +E+  W EAL Y Q  +  
Sbjct: 329 KDVAYLDVC---KVCLDKQTGVLHPLNVWHVKTLDAAFEAAIEVGKWPEALNYGQQLLQG 385

Query: 292 YQRVYPQFHPLLGLQYYTCGKLEWFLGDTENAIKSMTEAVEILRITHGTNSPFMKELILK 351
           +++    +HPLLGL +   GK++ + G  + A+    EA  IL +THG     + E +L 
Sbjct: 386 FRKYNGPWHPLLGLLHMKLGKIQLYEGQCKEAMHHFEEARRILCVTHGREHRLIGEQLLP 445

Query: 352 L 352
           L
Sbjct: 446 L 446


>gi|291402401|ref|XP_002717452.1| PREDICTED: SET and MYND domain containing 2, partial [Oryctolagus
           cuniculus]
          Length = 394

 Score =  119 bits (299), Expect = 2e-24,   Method: Compositional matrix adjust.
 Identities = 86/339 (25%), Positives = 159/339 (46%), Gaps = 34/339 (10%)

Query: 19  TTTDNYSLVEALVAHMSDID-EKQLLLYAQIANLVNLI---LQWPEISINEIAENFSKLA 74
           T ++    VE   +H+  +D EK+ L+ + I+ L +     L++P+   + +   F+++ 
Sbjct: 84  TPSEKLLAVEEFESHLDKLDNEKKDLIQSDISALHHFYSKHLEFPDH--DSLVVLFAQVN 141

Query: 75  CNAHTICNSELRPLGTGLYPVISIINHSCLPNAVLVFEGRLAVVRAVQHVPKGAEVLISY 134
           CN  TI + EL  LG+ ++P ++++NHSC PN ++ ++G LA VRAVQ +  G EV  SY
Sbjct: 142 CNGFTIEDEELSHLGSAIFPDVALMNHSCCPNVIVTYKGTLAEVRAVQEISPGEEVFTSY 201

Query: 135 IETAGSTMTRQKALKEQYLFTCTCPRCIKLGQFDDIQESAILEGYRCKDDGCSGFLLRDS 194
           I+    T  R + L++ Y FTC C  C         ++ A +E  R  D       +RD 
Sbjct: 202 IDLLYPTEDRNERLRDSYFFTCECLECTTKD-----KDKAKVEIRRLSDPP-KAEAIRD- 254

Query: 195 DDKGFTCQQCGLVRSKEEIKKIASEVNILSKKTLALTSCGNHQEVVSTYKMIEKLQKKLY 254
                      +VR    + +              L  C   QE + +          ++
Sbjct: 255 -----------MVRYARNVIEEFRRAKHYKSPPELLEICELSQEKMGS----------VF 293

Query: 255 HPFSVNLMQTREKLIKILMELEDWKEALAYCQLTIPVYQRVYPQFHPLLGLQYYTCGKLE 314
              +V ++    + + + + ++DW+ AL Y Q  I  Y + YP +   +   +   G+L 
Sbjct: 294 EDSNVYMLHMMYQAMGVCLYMQDWEGALRYGQKIIKPYSKHYPLYSLNVASMWLKLGRLY 353

Query: 315 WFLGDTENAIKSMTEAVEILRITHGTNSPFMKELILKLE 353
             L +     K++ +A+ I+ + HG + P++ E+  +LE
Sbjct: 354 LGLENKAAGEKALRKAIAIMEVAHGKDHPYISEIKQELE 392


>gi|441613050|ref|XP_003265162.2| PREDICTED: N-lysine methyltransferase SMYD2 [Nomascus leucogenys]
          Length = 536

 Score =  119 bits (299), Expect = 2e-24,   Method: Compositional matrix adjust.
 Identities = 90/349 (25%), Positives = 160/349 (45%), Gaps = 31/349 (8%)

Query: 8   RKLQNDNVIPSTTTDNYSL-VEALVAHMSDID-EKQLLLYAQIANLVNLILQWPEISINE 65
           R L    + P  T     L V+   +H+  +D EK+ L+ + IA L +   +  E   N+
Sbjct: 214 RILAKQKIYPERTPSEKLLAVKEFESHLDKLDNEKKDLIQSDIAALHHFYSKHLEFPDND 273

Query: 66  -IAENFSKLACNAHTICNSELRPLGTGLYPVISIINHSCLPNAVLVFEGRLAVVRAVQHV 124
            +   F+++ CN  TI + EL  LG+ ++P ++++NHSC PN ++ ++G LA VRAVQ +
Sbjct: 274 SLVVLFAQVNCNGFTIEDEELSHLGSAIFPDVALMNHSCCPNVIVTYKGTLAEVRAVQEI 333

Query: 125 PKGAEVLISYIETAGSTMTRQKALKEQYLFTCTCPRCIKLGQFDDIQESAILEGYRCKDD 184
             G EV  SYI+    T  R   L++ Y FTC C  C         ++ A +E  R   D
Sbjct: 334 KTGEEVFTSYIDLLYPTEDRNDRLRDSYFFTCECQECTTKD-----KDKAKVE-IRKLSD 387

Query: 185 GCSGFLLRDSDDKGFTCQQCGLVRSKEEIKKIASEVNILSKKTLALTSCGNHQEVVSTYK 244
                 +RD            +VR    + +           +  L  C   QE +S+  
Sbjct: 388 PPKAEAIRD------------MVRYARNVIEEFRRAKHYKSPSELLEICELSQEKMSS-- 433

Query: 245 MIEKLQKKLYHPFSVNLMQTREKLIKILMELEDWKEALAYCQLTIPVYQRVYPQFHPLLG 304
                   ++   +V ++    + + + + ++DW+ AL Y Q  I  Y + YP +   + 
Sbjct: 434 --------VFEDSNVYMLHMMYQAMGVCLYMQDWEGALQYGQKIIKPYSKHYPLYSLNVA 485

Query: 305 LQYYTCGKLEWFLGDTENAIKSMTEAVEILRITHGTNSPFMKELILKLE 353
             +   G+L   L       K++ +A+ I+ + HG + P++ E+  ++E
Sbjct: 486 SMWLKLGRLYMGLEYKAAGEKALKKAIAIMEVAHGKDHPYISEIKQEIE 534


>gi|348577033|ref|XP_003474289.1| PREDICTED: N-lysine methyltransferase SMYD2-like [Cavia porcellus]
          Length = 433

 Score =  119 bits (298), Expect = 2e-24,   Method: Compositional matrix adjust.
 Identities = 85/351 (24%), Positives = 159/351 (45%), Gaps = 35/351 (9%)

Query: 8   RKLQNDNVIPSTTTDNYSL-VEALVAHMSDID-EKQLLLYAQIANLVNLI---LQWPEIS 62
           R L    + P  T     L V+   +H+  +D EK+ L+ + IA L       L++P+  
Sbjct: 111 RILAKQKIHPERTPSEMLLAVKEFESHLDKLDNEKKDLIQSDIAMLHQFYSKHLEFPDH- 169

Query: 63  INEIAENFSKLACNAHTICNSELRPLGTGLYPVISIINHSCLPNAVLVFEGRLAVVRAVQ 122
            + +   F+++ CN  TI + EL  LG+ ++P ++++NHSC PN ++ ++G LA VRAVQ
Sbjct: 170 -DSLVVLFAQVNCNGFTIEDEELSHLGSAIFPDVALMNHSCCPNVIVTYKGTLAEVRAVQ 228

Query: 123 HVPKGAEVLISYIETAGSTMTRQKALKEQYLFTCTCPRCIKLGQFDDIQESAILEGYRCK 182
            V  G EV  SYI+    T  R   L++ Y FTC C  C                     
Sbjct: 229 EVLPGEEVFTSYIDLLYPTEDRNDRLRDSYFFTCECQECTT------------------- 269

Query: 183 DDGCSGFLLRDSDDKGFTCQQCGLVRSKEEIKKIASEVNILSKKTLALTSCGNHQEVVST 242
                    RD D      ++       E I+ +      + ++        +  E++  
Sbjct: 270 ---------RDKDKAKVEIRKLSNPPKAEAIRDMVRYARNVIEEFRRAKHYKSPSELLEI 320

Query: 243 YKMIEKLQKKLYHPFSVNLMQTREKLIKILMELEDWKEALAYCQLTIPVYQRVYPQFHPL 302
            ++ ++    ++   +V ++    + + + + L+DW+ AL Y Q  I  Y R YP +   
Sbjct: 321 CELSQEKMSAVFEDSNVYMLHMMYQAMGVCLYLQDWEGALRYGQKIIKPYSRHYPLYSLN 380

Query: 303 LGLQYYTCGKLEWFLGDTENAIKSMTEAVEILRITHGTNSPFMKELILKLE 353
           +   +   G+L   L +     +++ +A+ I+ + HG + P++ E+  ++E
Sbjct: 381 VASMWLKLGRLYLGLENRVAGERALRKALAIMEVAHGKDHPYISEIKREIE 431


>gi|115496736|ref|NP_001069832.1| N-lysine methyltransferase SMYD2 [Bos taurus]
 gi|122144240|sp|Q0P585.1|SMYD2_BOVIN RecName: Full=N-lysine methyltransferase SMYD2; AltName:
           Full=Histone methyltransferase SMYD2; AltName: Full=SET
           and MYND domain-containing protein 2
 gi|112362245|gb|AAI20365.1| SET and MYND domain containing 2 [Bos taurus]
 gi|296478866|tpg|DAA20981.1| TPA: SET and MYND domain containing 2 [Bos taurus]
          Length = 433

 Score =  119 bits (298), Expect = 2e-24,   Method: Compositional matrix adjust.
 Identities = 89/349 (25%), Positives = 161/349 (46%), Gaps = 31/349 (8%)

Query: 8   RKLQNDNVIPSTTTDNYSL-VEALVAHMSDID-EKQLLLYAQIANLVNLILQWPEISINE 65
           R L    + P  T     L V+   +H+  +D EK+ L+ + IA L +   +  E   N+
Sbjct: 111 RILAKQKIHPERTPSEKLLAVKEFESHLDKLDNEKRDLIQSDIAALHHFYSKHLEFPDND 170

Query: 66  -IAENFSKLACNAHTICNSELRPLGTGLYPVISIINHSCLPNAVLVFEGRLAVVRAVQHV 124
            +   F+++ CN  TI + EL  LG+ ++P ++++NHSC PN ++ ++G LA VRAVQ +
Sbjct: 171 SLVVLFAQVNCNGFTIEDEELSHLGSAIFPDVALMNHSCCPNVIVTYKGTLAEVRAVQEI 230

Query: 125 PKGAEVLISYIETAGSTMTRQKALKEQYLFTCTCPRCIKLGQFDDIQESAILEGYRCKDD 184
             G EV  SYI+    T  R   L++ Y FTC C  C         ++ A +E  R  +D
Sbjct: 231 HPGEEVFTSYIDLLYPTEDRNDRLRDSYFFTCECQECTTKD-----KDKAKVE-IRKLND 284

Query: 185 GCSGFLLRDSDDKGFTCQQCGLVRSKEEIKKIASEVNILSKKTLALTSCGNHQEVVSTYK 244
                 +RD            +VR    + +           +  L  C   QE +S   
Sbjct: 285 PPKAETIRD------------MVRYARNVIEEFRRAKHYKSPSELLEICELSQEKMSC-- 330

Query: 245 MIEKLQKKLYHPFSVNLMQTREKLIKILMELEDWKEALAYCQLTIPVYQRVYPQFHPLLG 304
                   ++   +V ++    + + + + ++DW+ AL Y Q  I  Y + YP +   + 
Sbjct: 331 --------VFEDSNVYMLHMMYQAMGVCLYMQDWEGALRYGQKIIQPYSKHYPLYSLNVA 382

Query: 305 LQYYTCGKLEWFLGDTENAIKSMTEAVEILRITHGTNSPFMKELILKLE 353
             +   G+L   L +     +++ +A+ I+ + HG + P++ E+  ++E
Sbjct: 383 SMWLKLGRLYMGLENKAAGERALKKAIAIMEVAHGKDHPYISEIKQEIE 431


>gi|296230793|ref|XP_002760890.1| PREDICTED: SET and MYND domain-containing protein 3 [Callithrix
           jacchus]
          Length = 428

 Score =  119 bits (298), Expect = 2e-24,   Method: Compositional matrix adjust.
 Identities = 85/275 (30%), Positives = 141/275 (51%), Gaps = 39/275 (14%)

Query: 65  EIAENFSKLACNAHTICNSELRPLGTGLYPVISIINHSCLPNAVLVFEGRLAVVRAVQHV 124
           +I E F+K+ CN+ TICN+E++ +G GLYP IS++NHSC PN  +VF G   ++RAV+ V
Sbjct: 170 DIFEAFAKVICNSFTICNAEMQEVGVGLYPSISLLNHSCDPNCSVVFNGPHLLLRAVRDV 229

Query: 125 PKGAEVLISYIETAGSTMTRQKALKEQYLFTCTCPRCIKLGQFDDIQESAILEGYRCKDD 184
             G E+ I Y++   ++  R+K L++QY F C C RC    Q  D +++ +L G      
Sbjct: 230 EVGEELTICYLDMLMTSEERRKQLRDQYCFECDCFRC----QTQD-KDADMLTG------ 278

Query: 185 GCSGFLLRDSDDKGFTCQQCGLVRSKEEIKKIASEVNILSKKTLALTSCGNHQEVVSTYK 244
                     D++ +          +E +K I  E+    K    L  C   Q ++S+  
Sbjct: 279 ----------DERVWK-------EVQESLKHI-EELKAHWKWEQVLAMC---QAIISSNS 317

Query: 245 MIEKLQKKLYHPFSVNLMQTREKLIKILMELEDWKEALAYCQLTIPVYQRVYPQFHPLLG 304
             E+L        ++  ++  +  +   + L   +EAL Y   T+  Y+  +P  HP+ G
Sbjct: 318 --ERLPD-----LNIYQLKVLDCAMDACINLGLLEEALFYGTRTMEPYRIFFPGSHPVRG 370

Query: 305 LQYYTCGKLEWFLGDTENAIKSMTEAVEILRITHG 339
           +Q    GKL+   G    A+K++  A +I+R+THG
Sbjct: 371 VQVMKVGKLQLHQGMFPQAMKNLRLAFDIMRVTHG 405


>gi|195398853|ref|XP_002058035.1| GJ15711 [Drosophila virilis]
 gi|194150459|gb|EDW66143.1| GJ15711 [Drosophila virilis]
          Length = 448

 Score =  119 bits (298), Expect = 2e-24,   Method: Compositional matrix adjust.
 Identities = 80/292 (27%), Positives = 142/292 (48%), Gaps = 20/292 (6%)

Query: 65  EIAENFSKLACNAHTICNSELRPLGTGLYPVISIINHSCLPNAVLVFEGRLAVVRAVQHV 124
           E+   + +L  N   I ++E+  + T +Y  +SI +HSC PNAV  FEG    V A++ +
Sbjct: 164 ELMSIYGRLITNGFNILDAEMNSIATAIYLGVSITDHSCQPNAVATFEGNELHVHAIEDL 223

Query: 125 P--KGAEVLISYIETAGSTMTRQKALKEQYLFTCTCPRCIKLGQFDDIQESAILEGYRCK 182
           P    +++ ISYI+   +   R+  LKE Y F C C +CI      D QE   +    C 
Sbjct: 224 PCLDWSKIYISYIDLLNTPEQRRADLKEHYYFLCVCSKCI------DPQEMHEMTAAVCP 277

Query: 183 DDGCSGFLLRDSDDKGFTCQQCGLVRSKEEIKKIASEVNILSKKTLALTSCGNHQEV--V 240
           +  C   +  D       CQ C       E++   +E+  L++ +L      + +EV  +
Sbjct: 278 NASCDASVNIDL----AKCQSCD-ASVTPELRSAYNEIMSLTQSSL-----DSMKEVAYL 327

Query: 241 STYKMIEKLQKKLYHPFSVNLMQTREKLIKILMELEDWKEALAYCQLTIPVYQRVYPQFH 300
              K+    Q+ + HP +V  ++T +   +  + +  W +AL Y Q  +P + + +  ++
Sbjct: 328 DVCKVCLAKQRGVLHPLNVWHVKTLDAAFEAAINVGKWTDALEYGQQLLPGFAKYHGAWN 387

Query: 301 PLLGLQYYTCGKLEWFLGDTENAIKSMTEAVEILRITHGTNSPFMKELILKL 352
           PLLGL +   GK++ +  + + A+  + EA  IL +THG +   + E +  L
Sbjct: 388 PLLGLLHMKLGKIQLYERNYKQAVHHLQEAQRILNVTHGRDHRLLLEQLRPL 439


>gi|30840984|ref|NP_081072.1| N-lysine methyltransferase SMYD2 [Mus musculus]
 gi|47117248|sp|Q8R5A0.1|SMYD2_MOUSE RecName: Full=N-lysine methyltransferase SMYD2; AltName:
           Full=Histone methyltransferase SMYD2; AltName: Full=SET
           and MYND domain-containing protein 2
 gi|339717596|pdb|3QWV|A Chain A, Crystal Structure Of Histone Lysine Methyltransferase
           Smyd2 In Complex With The Cofactor Product Adohcy
 gi|339717597|pdb|3QWW|A Chain A, Crystal Structure Of Histone Lysine Methyltransferase
           Smyd2 In Complex With The Methyltransferase Inhibitor
           Sinefungin
 gi|18606376|gb|AAH23119.1| SET and MYND domain containing 2 [Mus musculus]
          Length = 433

 Score =  119 bits (298), Expect = 2e-24,   Method: Compositional matrix adjust.
 Identities = 83/351 (23%), Positives = 158/351 (45%), Gaps = 35/351 (9%)

Query: 8   RKLQNDNVIPSTTTDNYSL-VEALVAHMSDID-EKQLLLYAQIANLVNLI---LQWPEIS 62
           R L    + P  T     L V    +H+  +D EK+ L+ + IA L       L++P+ S
Sbjct: 111 RILAKQKIHPERTPSEKLLAVREFESHLDKLDNEKKDLIQSDIAALHQFYSKYLEFPDHS 170

Query: 63  INEIAENFSKLACNAHTICNSELRPLGTGLYPVISIINHSCLPNAVLVFEGRLAVVRAVQ 122
              +   F+++ CN  TI + EL  LG+ ++P ++++NHSC PN ++ ++G LA VRAVQ
Sbjct: 171 --SLVVLFAQVNCNGFTIEDEELSHLGSAIFPDVALMNHSCCPNVIVTYKGTLAEVRAVQ 228

Query: 123 HVPKGAEVLISYIETAGSTMTRQKALKEQYLFTCTCPRCIKLGQFDDIQESAILEGYRCK 182
            +  G EV  SYI+    T  R   L++ Y FTC C  C                     
Sbjct: 229 EIHPGDEVFTSYIDLLYPTEDRNDRLRDSYFFTCECRECT-------------------- 268

Query: 183 DDGCSGFLLRDSDDKGFTCQQCGLVRSKEEIKKIASEVNILSKKTLALTSCGNHQEVVST 242
                    +D D      ++       E I+ +      + ++        +  E++  
Sbjct: 269 --------TKDKDKAKVEVRKLSSPPQAEAIRDMVRYARNVIEEFRRAKHYKSPSELLEI 320

Query: 243 YKMIEKLQKKLYHPFSVNLMQTREKLIKILMELEDWKEALAYCQLTIPVYQRVYPQFHPL 302
            ++ ++    ++   +V ++    + + + + ++DW+ AL Y Q  I  Y + YP +   
Sbjct: 321 CELSQEKMSSVFEDSNVYMLHMMYQAMGVCLYMQDWEGALKYGQKIIKPYSKHYPVYSLN 380

Query: 303 LGLQYYTCGKLEWFLGDTENAIKSMTEAVEILRITHGTNSPFMKELILKLE 353
           +   +   G+L   L +     K++ +A+ I+ + HG + P++ E+  ++E
Sbjct: 381 VASMWLKLGRLYMGLENKAAGEKALKKAIAIMEVAHGKDHPYISEIKQEIE 431


>gi|426240539|ref|XP_004014156.1| PREDICTED: N-lysine methyltransferase SMYD2 [Ovis aries]
          Length = 395

 Score =  119 bits (298), Expect = 3e-24,   Method: Compositional matrix adjust.
 Identities = 89/349 (25%), Positives = 161/349 (46%), Gaps = 31/349 (8%)

Query: 8   RKLQNDNVIPSTTTDNYSL-VEALVAHMSDID-EKQLLLYAQIANLVNLILQWPEISINE 65
           R L    + P  T     L V+   +H+  +D EK+ L+ + IA L +   +  E   N+
Sbjct: 73  RILAKQKIHPERTPSEKLLAVKEFESHLDKLDNEKRDLIQSDIAALHHFYSKHLEFPDND 132

Query: 66  -IAENFSKLACNAHTICNSELRPLGTGLYPVISIINHSCLPNAVLVFEGRLAVVRAVQHV 124
            +   F+++ CN  TI + EL  LG+ ++P ++++NHSC PN ++ ++G LA VRAVQ +
Sbjct: 133 SLVVLFAQVNCNGFTIEDEELSHLGSAIFPDVALMNHSCCPNVIVTYKGTLAEVRAVQEI 192

Query: 125 PKGAEVLISYIETAGSTMTRQKALKEQYLFTCTCPRCIKLGQFDDIQESAILEGYRCKDD 184
             G EV  SYI+    T  R   L++ Y FTC C  C         ++ A +E  R  +D
Sbjct: 193 HPGEEVFTSYIDLLYPTEDRNDRLRDSYFFTCECQECTTKD-----KDKAKVE-IRKLND 246

Query: 185 GCSGFLLRDSDDKGFTCQQCGLVRSKEEIKKIASEVNILSKKTLALTSCGNHQEVVSTYK 244
                 +RD            +VR    + +           +  L  C   QE +S   
Sbjct: 247 PPKAETIRD------------MVRYARNVIEEFRRAKHYKSPSELLEICELSQEKMSC-- 292

Query: 245 MIEKLQKKLYHPFSVNLMQTREKLIKILMELEDWKEALAYCQLTIPVYQRVYPQFHPLLG 304
                   ++   +V ++    + + + + ++DW+ AL Y Q  I  Y + YP +   + 
Sbjct: 293 --------VFEDTNVYMLHMMYQAMGVCLYMQDWEGALRYGQKIIQPYSKHYPLYSLNVA 344

Query: 305 LQYYTCGKLEWFLGDTENAIKSMTEAVEILRITHGTNSPFMKELILKLE 353
             +   G+L   L +     +++ +A+ I+ + HG + P++ E+  ++E
Sbjct: 345 SMWLKLGRLYMGLENKAAGERALKKAIAIMEVAHGKDHPYISEIKQEIE 393


>gi|395531367|ref|XP_003767751.1| PREDICTED: N-lysine methyltransferase SMYD2 [Sarcophilus harrisii]
          Length = 402

 Score =  119 bits (297), Expect = 3e-24,   Method: Compositional matrix adjust.
 Identities = 78/338 (23%), Positives = 155/338 (45%), Gaps = 30/338 (8%)

Query: 19  TTTDNYSLVEALVAHMSDID-EKQLLLYAQIANLVNLILQWPEISIN-EIAENFSKLACN 76
           T+++    V+   +H+  +D EK+ L+ + I+ L +   +  E   N  +   F+++ CN
Sbjct: 92  TSSEKLLAVKEFESHLDKLDNEKRELIQSDISALHHFYSKHIEYPDNASLVTLFAQVNCN 151

Query: 77  AHTICNSELRPLGTGLYPVISIINHSCLPNAVLVFEGRLAVVRAVQHVPKGAEVLISYIE 136
             TI + EL  LG+ ++P ++++NHSC PN ++ ++G LA VRAVQ +  G EV  SYI+
Sbjct: 152 GFTIEDEELSHLGSAIFPDVALMNHSCCPNVIVTYKGTLAEVRAVQEINPGDEVFTSYID 211

Query: 137 TAGSTMTRQKALKEQYLFTCTCPRCIKLGQFDDIQESAILEGYRCKDDGCSGFLLRDSDD 196
               T  R   LK+ Y FTC C  C                              +  D 
Sbjct: 212 LLYPTEDRNDRLKDSYFFTCECRECT----------------------------TKAKDK 243

Query: 197 KGFTCQQCGLVRSKEEIKKIASEVNILSKKTLALTSCGNHQEVVSTYKMIEKLQKKLYHP 256
                ++       E I+ +      + ++           E++   ++ ++    L+  
Sbjct: 244 AKVEIRKLSEPPKAESIRDMVKYARNVIEEFRRAKHYKTPSELLEICELSQEKMGSLFED 303

Query: 257 FSVNLMQTREKLIKILMELEDWKEALAYCQLTIPVYQRVYPQFHPLLGLQYYTCGKLEWF 316
            +V ++    + + + + ++DW+ AL Y Q  I  Y + YP +   +   +   G+L   
Sbjct: 304 SNVYMLHMMYQAMGVCLYMQDWEGALRYGQKIIKPYSKHYPLYSLNVASMWLKLGRLYMG 363

Query: 317 LGDTENAIKSMTEAVEILRITHGTNSPFMKELILKLEE 354
           L    + +K++ +A+ I+ + HG + P++ E+  ++E+
Sbjct: 364 LEHKASGVKALKKAIAIMEVAHGKDHPYISEIKKEIED 401


>gi|345329543|ref|XP_001510161.2| PREDICTED: N-lysine methyltransferase SMYD2-like [Ornithorhynchus
           anatinus]
          Length = 412

 Score =  119 bits (297), Expect = 3e-24,   Method: Compositional matrix adjust.
 Identities = 83/338 (24%), Positives = 160/338 (47%), Gaps = 30/338 (8%)

Query: 19  TTTDNYSLVEALVAHMSDID-EKQLLLYAQIANLVNLILQWPEISIN-EIAENFSKLACN 76
           T ++    V+   +H+  +D EK+ L+ + IA L +   +  E   N  +   F+++ CN
Sbjct: 102 TPSEKLLAVKEFESHLDKLDNEKRELIQSDIAALHHFYSKHIEYPDNASLVVLFAQVNCN 161

Query: 77  AHTICNSELRPLGTGLYPVISIINHSCLPNAVLVFEGRLAVVRAVQHVPKGAEVLISYIE 136
             TI + EL  LG+ ++P ++++NHSC PN ++ ++G LA VRAVQ +  G E+  SYI+
Sbjct: 162 GFTIEDEELSHLGSAIFPDVALMNHSCCPNVIVTYKGTLAEVRAVQEINPGDEIFTSYID 221

Query: 137 TAGSTMTRQKALKEQYLFTCTCPRCIKLGQFDDIQESAILEGYRCKDDGCSGFLLRDSDD 196
               T  R   L++ Y FTC C  CI        ++ A +E  R  +D      +RD   
Sbjct: 222 LLYPTEDRNDRLRDSYFFTCECRECITKE-----KDKAKVE-IRKLNDPPKAEAIRD--- 272

Query: 197 KGFTCQQCGLVRSKEEIKKIASEVNILSKKTLALTSCGNHQEVVSTYKMIEKLQKKLYHP 256
                    +++    + +           +  L  C   QE + +          ++  
Sbjct: 273 ---------MIKYARNVIEEFRRAKHYKSPSELLEICELSQEKMGS----------VFED 313

Query: 257 FSVNLMQTREKLIKILMELEDWKEALAYCQLTIPVYQRVYPQFHPLLGLQYYTCGKLEWF 316
            +V ++    + + + + ++DW+ AL Y Q  I  Y + YP +   +   +   G+L   
Sbjct: 314 SNVYMLHMMYQAMGVCLYMQDWEGALRYGQKIIKPYSKHYPLYSLNVASMWLKLGRLYMG 373

Query: 317 LGDTENAIKSMTEAVEILRITHGTNSPFMKELILKLEE 354
           L +    +K++ +A+ I+ + HG + P++ E+  ++E+
Sbjct: 374 LENKSAGVKALKKALTIMEVAHGKDHPYISEIKKEIED 411


>gi|410986126|ref|XP_003999363.1| PREDICTED: N-lysine methyltransferase SMYD2 [Felis catus]
          Length = 386

 Score =  119 bits (297), Expect = 3e-24,   Method: Compositional matrix adjust.
 Identities = 87/346 (25%), Positives = 160/346 (46%), Gaps = 35/346 (10%)

Query: 8   RKLQNDNVIPSTTTDNYSL-VEALVAHMSDID-EKQLLLYAQIANLVNLI---LQWPEIS 62
           R L    + P  T     L V+   +H+  +D EK+ L+ + +A L +     L++P+  
Sbjct: 64  RILAKQKIHPERTPSEKLLAVKEFESHLDKLDNEKKDLIQSDVAALHHFYSKHLEFPDS- 122

Query: 63  INEIAENFSKLACNAHTICNSELRPLGTGLYPVISIINHSCLPNAVLVFEGRLAVVRAVQ 122
            + +   F+++ CN  TI + EL  LG+ ++P ++++NHSC PN ++ ++G LA VRAVQ
Sbjct: 123 -DSLVVLFAQVNCNGFTIEDEELSHLGSAIFPDVALMNHSCCPNVIVTYKGTLAEVRAVQ 181

Query: 123 HVPKGAEVLISYIETAGSTMTRQKALKEQYLFTCTCPRCIKLGQFDDIQESAILEGYRCK 182
            +  G EV  SYI+    T  R   L++ Y FTC C  C         ++ A +E  R  
Sbjct: 182 EINPGEEVFTSYIDLLYPTEDRNDRLRDSYFFTCQCQECTTKD-----KDKAKVE-IRKL 235

Query: 183 DDGCSGFLLRDSDDKGFTCQQCGLVRSKEEIKKIASEVNILSKKTLALTSCGNHQEVVST 242
            D      +RD            +VR    + +           +  L  C   QE +S+
Sbjct: 236 SDPPKAETIRD------------MVRYARNVIEEFRRAKHYKSPSELLEICELSQEKMSS 283

Query: 243 YKMIEKLQKKLYHPFSVNLMQTREKLIKILMELEDWKEALAYCQLTIPVYQRVYPQFHPL 302
                     ++   +V ++    + + + + ++DW+ AL Y Q  I  Y + YP +   
Sbjct: 284 ----------VFEDSNVYMLHMMYQAMGVCLYMQDWEGALRYGQKIIKPYSKHYPLYSLN 333

Query: 303 LGLQYYTCGKLEWFLGDTENAIKSMTEAVEILRITHGTNSPFMKEL 348
           +   +   G+L   L +     +++ +A+ I+ + HG + P++ E+
Sbjct: 334 VASMWLKLGRLHMGLENKAAGERALKKAIAIMEVAHGKDHPYISEI 379


>gi|354465172|ref|XP_003495054.1| PREDICTED: N-lysine methyltransferase SMYD2-like [Cricetulus
           griseus]
          Length = 404

 Score =  119 bits (297), Expect = 3e-24,   Method: Compositional matrix adjust.
 Identities = 82/351 (23%), Positives = 159/351 (45%), Gaps = 35/351 (9%)

Query: 8   RKLQNDNVIPSTTTDNYSL-VEALVAHMSDID-EKQLLLYAQIANLVNLI---LQWPEIS 62
           R L    + P  T     L V+   +H+  +D EK+ L+ + IA L       L++P+ S
Sbjct: 82  RILAKQKIHPERTPSEKLLAVKEFESHLDKLDNEKKDLIQSDIAALHQFYSRHLEFPDHS 141

Query: 63  INEIAENFSKLACNAHTICNSELRPLGTGLYPVISIINHSCLPNAVLVFEGRLAVVRAVQ 122
              +   F+++ CN  TI + EL  LG+ ++P ++++NHSC PN ++ ++G LA VRAVQ
Sbjct: 142 --SLVVLFAQVNCNGFTIEDEELSHLGSAIFPDVALMNHSCCPNVIVTYKGTLAEVRAVQ 199

Query: 123 HVPKGAEVLISYIETAGSTMTRQKALKEQYLFTCTCPRCIKLGQFDDIQESAILEGYRCK 182
            +  G EV  SYI+    T  R   L++ Y FTC C  C                     
Sbjct: 200 EIHPGDEVFTSYIDLLYPTEDRNDRLRDSYFFTCECRECT-------------------- 239

Query: 183 DDGCSGFLLRDSDDKGFTCQQCGLVRSKEEIKKIASEVNILSKKTLALTSCGNHQEVVST 242
                    +D D      ++       E ++ +      + ++        +  E++  
Sbjct: 240 --------TKDKDKAKVEIRKLSSPPQAEAVRDMVRYARNVIEEFRRAKHYKSPSELLEI 291

Query: 243 YKMIEKLQKKLYHPFSVNLMQTREKLIKILMELEDWKEALAYCQLTIPVYQRVYPQFHPL 302
            ++ ++    ++   +V ++    + + + + ++DW+ AL Y Q  I  Y + YP +   
Sbjct: 292 CELSQEKMSSVFEDSNVYMLHMMYQAMGVCLYMQDWEGALKYGQKIIKPYSKHYPVYSLN 351

Query: 303 LGLQYYTCGKLEWFLGDTENAIKSMTEAVEILRITHGTNSPFMKELILKLE 353
           +   +   G+L   L +     K++ +A+ I+ + HG + P++ E+  ++E
Sbjct: 352 VASMWLKLGRLYMGLENKAAGQKALKKAIAIMEVAHGKDHPYISEIKQEIE 402


>gi|149040852|gb|EDL94809.1| similar to SET and MYND domain containing 3, isoform CRA_b [Rattus
           norvegicus]
          Length = 428

 Score =  119 bits (297), Expect = 3e-24,   Method: Compositional matrix adjust.
 Identities = 78/275 (28%), Positives = 137/275 (49%), Gaps = 39/275 (14%)

Query: 65  EIAENFSKLACNAHTICNSELRPLGTGLYPVISIINHSCLPNAVLVFEGRLAVVRAVQHV 124
           ++ E F+K+ CN+ TICN+E++ +G GLYP +S++NHSC PN  +VF G   ++RAV+ +
Sbjct: 170 DLFEAFAKVICNSFTICNAEMQEVGVGLYPSMSLLNHSCDPNCSIVFNGPHLLLRAVREI 229

Query: 125 PKGAEVLISYIETAGSTMTRQKALKEQYLFTCTCPRCIKLGQFDDIQESAILEGYRCKDD 184
             G E+ I Y++   ++  R+K L++QY F C C RC         Q+            
Sbjct: 230 EAGEELTICYLDMLMTSEERRKQLRDQYCFECDCIRC-------PTQD------------ 270

Query: 185 GCSGFLLRDSDDKGFTCQQCGLVRSKEEIKKIASEVNILSKKTLALTSCGNHQEVVSTYK 244
                  +D+D          ++   E+I K   EV    KK   L +    ++V++  +
Sbjct: 271 -------KDAD----------MLTGDEQIWK---EVQESLKKIEELKAHWKWEQVLALCQ 310

Query: 245 MIEKLQKKLYHPFSVNLMQTREKLIKILMELEDWKEALAYCQLTIPVYQRVYPQFHPLLG 304
            I     +     ++  ++  +  +   + L   +EAL Y   T+  Y+  +P  HP+ G
Sbjct: 311 AIINSNSERLPDINIYQLKVLDCAMDACINLGMLEEALFYAMRTMEPYRIFFPGSHPVRG 370

Query: 305 LQYYTCGKLEWFLGDTENAIKSMTEAVEILRITHG 339
           +Q    GKL+   G    A+K++  A +I+++THG
Sbjct: 371 VQVMKVGKLQLHQGMFPQAMKNLRLAFDIMKVTHG 405


>gi|354475964|ref|XP_003500195.1| PREDICTED: SET and MYND domain-containing protein 3 [Cricetulus
           griseus]
          Length = 428

 Score =  119 bits (297), Expect = 3e-24,   Method: Compositional matrix adjust.
 Identities = 79/275 (28%), Positives = 136/275 (49%), Gaps = 39/275 (14%)

Query: 65  EIAENFSKLACNAHTICNSELRPLGTGLYPVISIINHSCLPNAVLVFEGRLAVVRAVQHV 124
           ++ E F+K+ CN+ TICN+E++ +G GLYP +S++NHSC PN  +VF G   ++RAV+ +
Sbjct: 170 DLFEAFAKVICNSFTICNAEMQEVGVGLYPSMSLLNHSCDPNCSIVFNGPHLLLRAVREI 229

Query: 125 PKGAEVLISYIETAGSTMTRQKALKEQYLFTCTCPRCIKLGQFDDIQESAILEGYRCKDD 184
             G E+ I Y++   ++  R+K L++QY F C C RC    Q  D               
Sbjct: 230 EAGEELTICYLDMLMTSEERRKQLRDQYCFECDCIRC----QTHD--------------- 270

Query: 185 GCSGFLLRDSDDKGFTCQQCGLVRSKEEIKKIASEVNILSKKTLALTSCGNHQEVVSTYK 244
                  +D+D          ++   E+I K   EV    KK   L +    ++V++  +
Sbjct: 271 -------KDAD----------MLTGDEQIWK---EVQESLKKIEELKAHWKWEQVLALCQ 310

Query: 245 MIEKLQKKLYHPFSVNLMQTREKLIKILMELEDWKEALAYCQLTIPVYQRVYPQFHPLLG 304
            I           ++  ++  +  +   + L   +EAL Y   T+  Y+  +P  HP+ G
Sbjct: 311 AIISSNSDRLPDINIYQLKVLDCAMDACINLGMLEEALLYALRTMEPYRIFFPGSHPVRG 370

Query: 305 LQYYTCGKLEWFLGDTENAIKSMTEAVEILRITHG 339
           +Q    GKL+   G    A+K++  A +I+++THG
Sbjct: 371 VQVMKVGKLQLHQGMFPQAMKNLRLAFDIMKVTHG 405


>gi|281306785|ref|NP_996733.1| N-lysine methyltransferase SMYD2 [Rattus norvegicus]
 gi|47116972|sp|Q7M6Z3.1|SMYD2_RAT RecName: Full=N-lysine methyltransferase SMYD2; AltName:
           Full=Histone methyltransferase SMYD2; AltName: Full=SET
           and MYND domain-containing protein 2
 gi|32140157|tpg|DAA01315.1| TPA_exp: SET and MYND domain protein 2 [Rattus norvegicus]
          Length = 433

 Score =  118 bits (296), Expect = 4e-24,   Method: Compositional matrix adjust.
 Identities = 80/339 (23%), Positives = 155/339 (45%), Gaps = 34/339 (10%)

Query: 19  TTTDNYSLVEALVAHMSDID-EKQLLLYAQIANLVNLI---LQWPEISINEIAENFSKLA 74
           T ++    V    +H+  +D EK+ L+ + IA L       L++P+ S   +   F+++ 
Sbjct: 123 TPSEKLLAVREFESHLDKLDNEKKDLIQSDIAALHQFYSKHLEFPDHS--SLVVLFAQVN 180

Query: 75  CNAHTICNSELRPLGTGLYPVISIINHSCLPNAVLVFEGRLAVVRAVQHVPKGAEVLISY 134
           CN  TI + EL  LG+ ++P ++++NHSC PN ++ ++G LA VRAVQ +  G EV  SY
Sbjct: 181 CNGFTIEDEELSHLGSAIFPDVALMNHSCCPNVIVTYKGTLAEVRAVQEIHPGDEVFTSY 240

Query: 135 IETAGSTMTRQKALKEQYLFTCTCPRCIKLGQFDDIQESAILEGYRCKDDGCSGFLLRDS 194
           I+    T  R   L++ Y FTC C  C                              +D 
Sbjct: 241 IDLLYPTEDRNDRLRDSYFFTCECRECT----------------------------TKDK 272

Query: 195 DDKGFTCQQCGLVRSKEEIKKIASEVNILSKKTLALTSCGNHQEVVSTYKMIEKLQKKLY 254
           D      ++       E I+ +      + ++        +  E++   ++ ++    ++
Sbjct: 273 DKAKVEIRKLSNPPQAEAIRDMVRYARNVIEEFRRAKHYKSPSELLEICELSQEKMSSVF 332

Query: 255 HPFSVNLMQTREKLIKILMELEDWKEALAYCQLTIPVYQRVYPQFHPLLGLQYYTCGKLE 314
              +V ++    + + + + ++DW+ AL Y Q  I  Y + YP +   +   +   G+L 
Sbjct: 333 EDSNVYMLHMMYQAMGVCLYMQDWEGALKYGQKIIKPYSKHYPVYSLNVASMWLKLGRLY 392

Query: 315 WFLGDTENAIKSMTEAVEILRITHGTNSPFMKELILKLE 353
             L +     K++ +A+ I+ I HG + P++ E+  ++E
Sbjct: 393 MGLENKAAGEKALKKAIAIMEIAHGKDHPYISEIKQEIE 431


>gi|194913303|ref|XP_001982670.1| GG12606 [Drosophila erecta]
 gi|190648346|gb|EDV45639.1| GG12606 [Drosophila erecta]
          Length = 468

 Score =  118 bits (296), Expect = 4e-24,   Method: Compositional matrix adjust.
 Identities = 94/369 (25%), Positives = 173/369 (46%), Gaps = 26/369 (7%)

Query: 1   MLKLYLRRKLQNDNVIPSTTTDNYSLVEALVAHMSDIDEKQLLLY---AQIANLVNLILQ 57
           + +L LR +   D +    T         L++H ++I    + L    +  A L +++  
Sbjct: 115 LCRLILRLEHGGDLIRGYYTEHGSRKFRDLMSHYAEIKNDPMRLEHLDSLHAVLTDMMAD 174

Query: 58  WPEISIN--EIAENFSKLACNAHTICNSELRPLGTGLYPVISIINHSCLPNAVLVFEGRL 115
            P    N  E+   + +L  N   + ++++  + T +Y  +SI +HSC PNAV  FEG  
Sbjct: 175 SPSTVPNKTELMSIYGRLITNGFNVLDTDMNSIATAIYLGVSITDHSCQPNAVATFEGNE 234

Query: 116 AVVRAVQHVP--KGAEVLISYIETAGSTMTRQKALKEQYLFTCTCPRCIKLGQFDDIQES 173
             V A++ +     +++ ISYI+   +   R+  LKE Y F C C +C       D +ES
Sbjct: 235 LHVHAIEDMECLDWSKIFISYIDLLNTPEQRRLDLKEHYYFLCVCSKCT------DAKES 288

Query: 174 AILEGYRCKDDGCSGFLLRDSDDKGFTCQQCGLVRSKEEIKKIASEVNILSKKTLALTSC 233
             +    C +  C   +  D  +    C +C    S     K+ +  N      L L + 
Sbjct: 289 KEMLAALCPNRNCGVGISVDRTN----CPRCDAGISP----KLRNAFN--DAMALTLHNL 338

Query: 234 GNHQEV--VSTYKMIEKLQKKLYHPFSVNLMQTREKLIKILMELEDWKEALAYCQLTIPV 291
            N ++V  +   K+    Q  ++HP +V  ++T +   +  +E+  W +AL Y Q  +P 
Sbjct: 339 ENMKDVAYLDVCKVCLDKQTGVFHPLNVWYVKTLDAAFEAAIEVGKWSDALDYGQRLLPG 398

Query: 292 YQRVYPQFHPLLGLQYYTCGKLEWFLGDTENAIKSMTEAVEILRITHGTNSPFMKE-LIL 350
           +++ +  ++PLLGL +   GK++ + G  + A+  + EA  IL +THG +   + E L L
Sbjct: 399 FRKYHGPWNPLLGLVHMKLGKIQLYTGQCKEALHHLEEAQRILTVTHGRDHRLLTEQLYL 458

Query: 351 KLEEAQAEA 359
            + +A+ +A
Sbjct: 459 LILQARQQA 467


>gi|340780606|pdb|3RIB|A Chain A, Human Lysine Methyltransferase Smyd2 In Complex With
           Adohcy
 gi|340780607|pdb|3RIB|B Chain B, Human Lysine Methyltransferase Smyd2 In Complex With
           Adohcy
          Length = 441

 Score =  118 bits (295), Expect = 6e-24,   Method: Compositional matrix adjust.
 Identities = 90/351 (25%), Positives = 161/351 (45%), Gaps = 35/351 (9%)

Query: 8   RKLQNDNVIPSTTTDNYSL-VEALVAHMSDID-EKQLLLYAQIANLVNLI---LQWPEIS 62
           R L    + P  T     L V+   +H+  +D EK+ L+ + IA L +     L +P+  
Sbjct: 111 RILAKQKIHPERTPSEKLLAVKEFESHLDKLDNEKKDLIQSDIAALHHFYSKHLGFPDN- 169

Query: 63  INEIAENFSKLACNAHTICNSELRPLGTGLYPVISIINHSCLPNAVLVFEGRLAVVRAVQ 122
            + +   F+++ CN  TI + EL  LG+ ++P ++++NHSC PN ++ ++G LA VRAVQ
Sbjct: 170 -DSLVVLFAQVNCNGFTIEDEELSHLGSAIFPDVALMNHSCCPNVIVTYKGTLAEVRAVQ 228

Query: 123 HVPKGAEVLISYIETAGSTMTRQKALKEQYLFTCTCPRCIKLGQFDDIQESAILEGYRCK 182
            +  G EV  SYI+    T  R   L++ Y FTC C  C         ++ A +E  R  
Sbjct: 229 EIKPGEEVFTSYIDLLYPTEDRNDRLRDSYFFTCECQECTTKD-----KDKAKVE-IRKL 282

Query: 183 DDGCSGFLLRDSDDKGFTCQQCGLVRSKEEIKKIASEVNILSKKTLALTSCGNHQEVVST 242
            D      +RD            +VR    + +           +  L  C   QE +S+
Sbjct: 283 SDPPKAEAIRD------------MVRYARNVIEEFRRAKHYKSPSELLEICELSQEKMSS 330

Query: 243 YKMIEKLQKKLYHPFSVNLMQTREKLIKILMELEDWKEALAYCQLTIPVYQRVYPQFHPL 302
                     ++   +V ++    + + + + ++DW+ AL Y Q  I  Y + YP +   
Sbjct: 331 ----------VFEDSNVYMLHMMYQAMGVCLYMQDWEGALQYGQKIIKPYSKHYPLYSLN 380

Query: 303 LGLQYYTCGKLEWFLGDTENAIKSMTEAVEILRITHGTNSPFMKELILKLE 353
           +   +   G+L   L       K++ +A+ I+ + HG + P++ E+  ++E
Sbjct: 381 VASMWLKLGRLYMGLEHKAAGEKALKKAIAIMEVAHGKDHPYISEIKQEIE 431


>gi|188035871|ref|NP_064582.2| N-lysine methyltransferase SMYD2 [Homo sapiens]
 gi|90185234|sp|Q9NRG4.2|SMYD2_HUMAN RecName: Full=N-lysine methyltransferase SMYD2; AltName:
           Full=HSKM-B; AltName: Full=Histone methyltransferase
           SMYD2; AltName: Full=Lysine N-methyltransferase 3C;
           AltName: Full=SET and MYND domain-containing protein 2
 gi|345111051|pdb|3TG4|A Chain A, Structure Of Smyd2 In Complex With Sam
 gi|345111052|pdb|3TG5|A Chain A, Structure Of Smyd2 In Complex With P53 And Sah
 gi|67514269|gb|AAH98276.1| SET and MYND domain containing 2 [Homo sapiens]
          Length = 433

 Score =  117 bits (294), Expect = 6e-24,   Method: Compositional matrix adjust.
 Identities = 90/351 (25%), Positives = 161/351 (45%), Gaps = 35/351 (9%)

Query: 8   RKLQNDNVIPSTTTDNYSL-VEALVAHMSDID-EKQLLLYAQIANLVNLI---LQWPEIS 62
           R L    + P  T     L V+   +H+  +D EK+ L+ + IA L +     L +P+  
Sbjct: 111 RILAKQKIHPERTPSEKLLAVKEFESHLDKLDNEKKDLIQSDIAALHHFYSKHLGFPDN- 169

Query: 63  INEIAENFSKLACNAHTICNSELRPLGTGLYPVISIINHSCLPNAVLVFEGRLAVVRAVQ 122
            + +   F+++ CN  TI + EL  LG+ ++P ++++NHSC PN ++ ++G LA VRAVQ
Sbjct: 170 -DSLVVLFAQVNCNGFTIEDEELSHLGSAIFPDVALMNHSCCPNVIVTYKGTLAEVRAVQ 228

Query: 123 HVPKGAEVLISYIETAGSTMTRQKALKEQYLFTCTCPRCIKLGQFDDIQESAILEGYRCK 182
            +  G EV  SYI+    T  R   L++ Y FTC C  C         ++ A +E  R  
Sbjct: 229 EIKPGEEVFTSYIDLLYPTEDRNDRLRDSYFFTCECQECTTKD-----KDKAKVE-IRKL 282

Query: 183 DDGCSGFLLRDSDDKGFTCQQCGLVRSKEEIKKIASEVNILSKKTLALTSCGNHQEVVST 242
            D      +RD            +VR    + +           +  L  C   QE +S+
Sbjct: 283 SDPPKAEAIRD------------MVRYARNVIEEFRRAKHYKSPSELLEICELSQEKMSS 330

Query: 243 YKMIEKLQKKLYHPFSVNLMQTREKLIKILMELEDWKEALAYCQLTIPVYQRVYPQFHPL 302
                     ++   +V ++    + + + + ++DW+ AL Y Q  I  Y + YP +   
Sbjct: 331 ----------VFEDSNVYMLHMMYQAMGVCLYMQDWEGALQYGQKIIKPYSKHYPLYSLN 380

Query: 303 LGLQYYTCGKLEWFLGDTENAIKSMTEAVEILRITHGTNSPFMKELILKLE 353
           +   +   G+L   L       K++ +A+ I+ + HG + P++ E+  ++E
Sbjct: 381 VASMWLKLGRLYMGLEHKAAGEKALKKAIAIMEVAHGKDHPYISEIKQEIE 431


>gi|116283746|gb|AAH28104.1| SMYD2 protein [Homo sapiens]
          Length = 417

 Score =  117 bits (294), Expect = 7e-24,   Method: Compositional matrix adjust.
 Identities = 88/351 (25%), Positives = 157/351 (44%), Gaps = 35/351 (9%)

Query: 8   RKLQNDNVIPSTTTDNYSL-VEALVAHMSDID-EKQLLLYAQIANLVNLI---LQWPEIS 62
           R L    + P  T     L V+   +H+  +D EK+ L+ + IA L +     L +P+  
Sbjct: 95  RILAKQKIHPERTPSEKLLAVKEFESHLDKLDNEKKDLIQSDIAALHHFYSKHLGFPDN- 153

Query: 63  INEIAENFSKLACNAHTICNSELRPLGTGLYPVISIINHSCLPNAVLVFEGRLAVVRAVQ 122
            + +   F+++ CN  TI + EL  LG+ ++P ++++NHSC PN ++ ++G LA VRAVQ
Sbjct: 154 -DSLVVLFAQVNCNGFTIEDEELSHLGSAIFPDVALMNHSCCPNVIVTYKGTLAEVRAVQ 212

Query: 123 HVPKGAEVLISYIETAGSTMTRQKALKEQYLFTCTCPRCIKLGQFDDIQESAILEGYRCK 182
            +  G EV  SYI+    T  R   L++ Y FTC C  C                    K
Sbjct: 213 EIKPGEEVFTSYIDLLYPTEDRNDRLRDSYFFTCECQECT------------------TK 254

Query: 183 DDGCSGFLLRDSDDKGFTCQQCGLVRSKEEIKKIASEVNILSKKTLALTSCGNHQEVVST 242
           D   +   +R   D         +VR    + +           +  L  C   QE +S+
Sbjct: 255 DKDKAKVEIRKLSDPPKAEAIRDMVRYARNVIEEFRRAKHYKSPSELLEICELSQEKMSS 314

Query: 243 YKMIEKLQKKLYHPFSVNLMQTREKLIKILMELEDWKEALAYCQLTIPVYQRVYPQFHPL 302
                     ++   +V ++    + + + + ++DW+ AL Y Q  I  Y + YP +   
Sbjct: 315 ----------VFEDSNVYMLHMMYQAMGVCLYMQDWEGALQYGQKIIKPYSKHYPLYSLN 364

Query: 303 LGLQYYTCGKLEWFLGDTENAIKSMTEAVEILRITHGTNSPFMKELILKLE 353
           +   +   G+L   L       K++ +A+ I+ + HG + P++ E+  ++E
Sbjct: 365 VASMWLKLGRLYMGLEHKAAGEKALKKAIAIMEVAHGKDHPYISEIKQEME 415


>gi|229892824|ref|NP_001153563.1| N-lysine methyltransferase SMYD2 [Sus scrofa]
 gi|325530260|sp|C3RZA1.1|SMYD2_PIG RecName: Full=N-lysine methyltransferase SMYD2; AltName:
           Full=Histone methyltransferase SMYD2; AltName: Full=SET
           and MYND domain-containing protein 2
 gi|197692954|gb|ACH71265.1| SET and MYND domain-containing 2 [Sus scrofa]
          Length = 433

 Score =  117 bits (294), Expect = 8e-24,   Method: Compositional matrix adjust.
 Identities = 89/351 (25%), Positives = 163/351 (46%), Gaps = 35/351 (9%)

Query: 8   RKLQNDNVIPSTTTDNYSL-VEALVAHMSDID-EKQLLLYAQIANLVNLI---LQWPEIS 62
           R L    + P  T     L V+   +H+  +D EK+ L+ + IA L +     L++P+  
Sbjct: 111 RILAKQKIHPERTPSEKLLAVKEFESHLDKLDNEKRDLIQSDIAALHHFYSKHLEFPDS- 169

Query: 63  INEIAENFSKLACNAHTICNSELRPLGTGLYPVISIINHSCLPNAVLVFEGRLAVVRAVQ 122
            + +   F+++ CN  TI + EL  LG+ ++P ++++NHSC PN ++ ++G LA VRAVQ
Sbjct: 170 -DSLVVLFAQVNCNGFTIEDEELSHLGSXIFPDVALMNHSCCPNVIVTYKGTLAEVRAVQ 228

Query: 123 HVPKGAEVLISYIETAGSTMTRQKALKEQYLFTCTCPRCIKLGQFDDIQESAILEGYRCK 182
            +  G EV  SYI+    T  R   L++ Y FTC C  C         ++ A +E  R  
Sbjct: 229 EIHPGEEVFTSYIDLLYPTEDRNDRLRDSYFFTCECQECTTKD-----KDKAKVE-IRKL 282

Query: 183 DDGCSGFLLRDSDDKGFTCQQCGLVRSKEEIKKIASEVNILSKKTLALTSCGNHQEVVST 242
           +D      +RD            +VR    + +           +  L  C   QE +S 
Sbjct: 283 NDPPKAEAIRD------------MVRYARNVIEEFRRAKHYKSPSELLEICELSQEKMSC 330

Query: 243 YKMIEKLQKKLYHPFSVNLMQTREKLIKILMELEDWKEALAYCQLTIPVYQRVYPQFHPL 302
                     ++   +V ++    + + + + ++DW+ AL Y Q  I  Y + YP +   
Sbjct: 331 ----------VFEDSNVYMLHMMYQAMGVCLYMQDWEGALRYGQKIIQPYSKHYPLYSLN 380

Query: 303 LGLQYYTCGKLEWFLGDTENAIKSMTEAVEILRITHGTNSPFMKELILKLE 353
           +   +   G+L   L +     +++ +A+ I+ + HG + P++ E+  ++E
Sbjct: 381 VASMWLKLGRLYMGLENKAAGERALRKAIAIMEVAHGKDHPYISEIKQEIE 431


>gi|402858454|ref|XP_003893720.1| PREDICTED: SET and MYND domain-containing protein 3-like [Papio
           anubis]
          Length = 258

 Score =  117 bits (292), Expect = 1e-23,   Method: Compositional matrix adjust.
 Identities = 80/274 (29%), Positives = 134/274 (48%), Gaps = 39/274 (14%)

Query: 66  IAENFSKLACNAHTICNSELRPLGTGLYPVISIINHSCLPNAVLVFEGRLAVVRAVQHVP 125
           + E   K+ CN+ TICN+E++ +G GLYP IS++NHSC PN  +VF G   ++RAV+ + 
Sbjct: 1   MEEEEEKVICNSFTICNAEMQEVGVGLYPSISLLNHSCDPNCSIVFNGPHLLLRAVRDIE 60

Query: 126 KGAEVLISYIETAGSTMTRQKALKEQYLFTCTCPRCIKLGQFDDIQESAILEGYRCKDDG 185
            G E+ I Y++   ++  R+K L++QY F C C RC    Q  D                
Sbjct: 61  VGEELTICYLDMLMTSEERRKQLRDQYCFECDCFRC----QTQD---------------- 100

Query: 186 CSGFLLRDSDDKGFTCQQCGLVRSKEEIKKIASEVNILSKKTLALTSCGNHQEVVSTYKM 245
                 +D+D          ++   E++ K   EV    KK   L +    ++V++  + 
Sbjct: 101 ------KDAD----------MLTGDEQVWK---EVQESLKKIEELKAHWKWEQVLAMCQA 141

Query: 246 IEKLQKKLYHPFSVNLMQTREKLIKILMELEDWKEALAYCQLTIPVYQRVYPQFHPLLGL 305
           I     +     +V  ++  +  +   + L   +EAL Y   T+  Y+  +P  HP+ G+
Sbjct: 142 IISSNSERLPDINVYQLKVLDCAMDACINLGLLEEALFYGTRTMEPYRIFFPGSHPVRGV 201

Query: 306 QYYTCGKLEWFLGDTENAIKSMTEAVEILRITHG 339
           Q    GKL+   G    A+K++  A +I+R+THG
Sbjct: 202 QVMKVGKLQLHQGMFPQAMKNLRLAFDIMRVTHG 235


>gi|380030528|ref|XP_003698898.1| PREDICTED: LOW QUALITY PROTEIN: SET and MYND domain-containing
           protein 3-like [Apis florea]
          Length = 414

 Score =  117 bits (292), Expect = 1e-23,   Method: Compositional matrix adjust.
 Identities = 90/324 (27%), Positives = 158/324 (48%), Gaps = 53/324 (16%)

Query: 24  YSLVEALVAHMSDI--DEKQLLLYAQIANLVNLILQWPEISINEIAE---NFSKLACNAH 78
           Y   + L++H SDI  DEK++  +  +  ++   L   ++SI   AE    + ++  N+ 
Sbjct: 111 YRKFKDLMSHYSDIKKDEKKMEHFVCVCGVLYEFL--GDMSIPNSAELMGIYGRIYINSF 168

Query: 79  TICNSELRPLGTGLYPVISIINHSCLPNAVLVFEGRLAVVRAVQHVP--KGAEVLISYIE 136
            I + ++  +G G+Y   SI++HSC PNAV  FEG   ++R  + +P    +++ ISYI+
Sbjct: 169 NISDLDMNNIGAGIYLGPSILDHSCKPNAVATFEGTTIIIRTTEDLPCLDLSQIRISYID 228

Query: 137 TAGSTMTRQKALKEQYLFTCTCPRCIKLGQFDDIQES-AILEGYRCKDDGCSGFLLRDSD 195
              +T  R++ L+  Y F C C +C         +ES  ++E   C +            
Sbjct: 229 VIKTTKDRREELQNSYYFWCNCKKC---------EESEPMVEAAACPN------------ 267

Query: 196 DKGFTCQQCGLVRSKEEIKKIASEVNILSKKTLALTSCGNHQEVVSTYKMIEKLQKKLYH 255
                       R+  EI  + +  ++ + K +A          VST  M  K Q+ + H
Sbjct: 268 ------------RTFYEISDLTA-YHLQNMKNIAYLD-------VST--MCLKKQEGVLH 305

Query: 256 PFSVNLMQTREKLIKILMELEDWKEALAYCQLTIPVYQRVYPQFHPLLGLQYYTCGKLEW 315
           P ++  +QT +      + L+ W+EA +Y +  I  Y   Y +FHP  G+ Y + GKL+ 
Sbjct: 306 PLNIQHVQTLQSAFDSSLTLQHWEEAESYAKRLINGYLTYYGEFHPSTGILYLSIGKLQV 365

Query: 316 FLGDTENAIKSMTEAVEILRITHG 339
           +L + + AI+++ +A  IL ITHG
Sbjct: 366 YLKNXKQAIETLRKASAILTITHG 389


>gi|297661482|ref|XP_002809270.1| PREDICTED: SET and MYND domain-containing protein 3-like [Pongo
           abelii]
 gi|426334412|ref|XP_004028746.1| PREDICTED: SET and MYND domain-containing protein 3-like [Gorilla
           gorilla gorilla]
 gi|28200379|gb|AAO31695.1| hypothetical protein FLJ21080 [Homo sapiens]
 gi|119597546|gb|EAW77140.1| SET and MYND domain containing 3, isoform CRA_b [Homo sapiens]
          Length = 258

 Score =  116 bits (291), Expect = 1e-23,   Method: Compositional matrix adjust.
 Identities = 83/276 (30%), Positives = 133/276 (48%), Gaps = 43/276 (15%)

Query: 66  IAENFSKLACNAHTICNSELRPLGTGLYPVISIINHSCLPNAVLVFEGRLAVVRAVQHVP 125
           + E   K+ CN+ TICN+E++ +G GLYP IS++NHSC PN  +VF G   ++RAV+ + 
Sbjct: 1   MEEEEEKVICNSFTICNAEMQEVGVGLYPSISLLNHSCDPNCSIVFNGPHLLLRAVRDIE 60

Query: 126 KGAEVLISYIETAGSTMTRQKALKEQYLFTCTCPRCIKLGQFDDIQESAILEGYRCKDDG 185
            G E+ I Y++   ++  R+K L++QY F C C RC                        
Sbjct: 61  VGEELTICYLDMLMTSEERRKQLRDQYCFECDCFRC------------------------ 96

Query: 186 CSGFLLRDSDDKGFTCQQCGLVRSKEEIKKIASEVNILSKKTLALTSCGNHQEVVSTYKM 245
                 +D D    T  +      +E +KKI  E+    K    L  C   Q ++S+   
Sbjct: 97  ----QTQDKDADMLTGDEQVWKEVQESLKKI-EELKAHWKWEQVLAMC---QAIISSNS- 147

Query: 246 IEKLQKKLYHPFSVNLMQTR--EKLIKILMELEDWKEALAYCQLTIPVYQRVYPQFHPLL 303
            E+L         +N+ Q +  +  +   + L   +EAL Y   T+  Y+  +P  HP+ 
Sbjct: 148 -ERLP-------DINIYQLKVLDCAMDACINLGLLEEALFYGTRTMEPYRIFFPGSHPVR 199

Query: 304 GLQYYTCGKLEWFLGDTENAIKSMTEAVEILRITHG 339
           G+Q    GKL+   G    A+K++  A +I+R+THG
Sbjct: 200 GVQVMKVGKLQLHQGMFPQAMKNLRLAFDIMRVTHG 235


>gi|294662600|pdb|3MEK|A Chain A, Crystal Structure Of Human Histone-Lysine N-
           Methyltransferase Smyd3 In Complex With S-Adenosyl-L-
           Methionine
          Length = 429

 Score =  116 bits (291), Expect = 1e-23,   Method: Compositional matrix adjust.
 Identities = 84/277 (30%), Positives = 131/277 (47%), Gaps = 43/277 (15%)

Query: 65  EIAENFSKLACNAHTICNSELRPLGTGLYPVISIINHSCLPNAVLVFEGRLAVVRAVQHV 124
           ++ E F+K+ CN+ TICN+E + +G GLYP IS++NHSC PN  +VF G   ++RAV+ +
Sbjct: 171 DLFEAFAKVICNSFTICNAEXQEVGVGLYPSISLLNHSCDPNCSIVFNGPHLLLRAVRDI 230

Query: 125 PKGAEVLISYIETAGSTMTRQKALKEQYLFTCTCPRCIKLGQFDDIQESAILEGYRCKDD 184
             G E+ I Y++   ++  R+K L++QY F C C RC                       
Sbjct: 231 EVGEELTICYLDXLXTSEERRKQLRDQYCFECDCFRC----------------------- 267

Query: 185 GCSGFLLRDSDDKGFTCQQCGLVRSKEEIKKIASEVNILSKKTLALTSCGNHQEVVSTYK 244
                  +D D    T  +      +E +KKI  E+    K    L  C   Q ++S+  
Sbjct: 268 -----QTQDKDADXLTGDEQVWKEVQESLKKI-EELKAHWKWEQVLAXC---QAIISSNS 318

Query: 245 MIEKLQKKLYHPFSVNLMQTR--EKLIKILMELEDWKEALAYCQLTIPVYQRVYPQFHPL 302
             E+L         +N+ Q +  +      + L   +EAL Y   T   Y+  +P  HP+
Sbjct: 319 --ERLP-------DINIYQLKVLDCAXDACINLGLLEEALFYGTRTXEPYRIFFPGSHPV 369

Query: 303 LGLQYYTCGKLEWFLGDTENAIKSMTEAVEILRITHG 339
            G+Q    GKL+   G    A K++  A +I R+THG
Sbjct: 370 RGVQVXKVGKLQLHQGXFPQAXKNLRLAFDIXRVTHG 406


>gi|344236460|gb|EGV92563.1| SET and MYND domain-containing protein 2 [Cricetulus griseus]
          Length = 326

 Score =  116 bits (291), Expect = 2e-23,   Method: Compositional matrix adjust.
 Identities = 81/319 (25%), Positives = 149/319 (46%), Gaps = 27/319 (8%)

Query: 38  DEKQLLLYAQIANLVNLI---LQWPEISINEIAENFSKLACNAHTICNSELRPLGTGLYP 94
           +EK+ L+ + IA L       L++P+ S   +   F+++ CN  TI + EL  LG+ ++P
Sbjct: 30  NEKKDLIQSDIAALHQFYSRHLEFPDHS--SLVVLFAQVNCNGFTIEDEELSHLGSAIFP 87

Query: 95  VISIINHSCLPNAVLVFEGRLAVVRAVQHVPKGAEVLISYIETAGSTMTRQKALKEQYLF 154
            ++++NHSC PN ++ ++G LA VRAVQ +  G EV  SYI+    T  R   L++ Y F
Sbjct: 88  DVALMNHSCCPNVIVTYKGTLAEVRAVQEIHPGDEVFTSYIDLLYPTEDRNDRLRDSYFF 147

Query: 155 TCTCPRCIKLGQFDDIQESAILEGYRCKDDGCSGFLLRDSDDKGFTCQQCGLVRSKEEIK 214
           TC C  C                    KD   +   +R    K  +  Q   VR      
Sbjct: 148 TCECRECT------------------TKDKDKAKVEIR----KLSSPPQAEAVRDMVRYA 185

Query: 215 KIASEVNILSKKTLALTSCGNHQEVVSTYKMIEKLQKKLYHPFSVNLMQTREKLIKILME 274
           +   E    +K      S     E++   ++ ++    ++   +V ++    + + + + 
Sbjct: 186 RNVIEEFRRAKHYKYNLSVSPPSELLEICELSQEKMSSVFEDSNVYMLHMMYQAMGVCLY 245

Query: 275 LEDWKEALAYCQLTIPVYQRVYPQFHPLLGLQYYTCGKLEWFLGDTENAIKSMTEAVEIL 334
           ++DW+ AL Y Q  I  Y + YP +   +   +   G+L   L +     K++ +A+ I+
Sbjct: 246 MQDWEGALKYGQKIIKPYSKHYPVYSLNVASMWLKLGRLYMGLENKAAGQKALKKAIAIM 305

Query: 335 RITHGTNSPFMKELILKLE 353
            + HG + P++ E+  ++E
Sbjct: 306 EVAHGKDHPYISEIKQEIE 324


>gi|50740296|ref|XP_419420.1| PREDICTED: N-lysine methyltransferase SMYD2 [Gallus gallus]
          Length = 436

 Score =  116 bits (290), Expect = 2e-23,   Method: Compositional matrix adjust.
 Identities = 90/350 (25%), Positives = 159/350 (45%), Gaps = 31/350 (8%)

Query: 8   RKLQNDNVIPSTTTDNYSL-VEALVAHMSDID-EKQLLLYAQIANLVNLILQWPEISINE 65
           R L    + P  T     L V+   +H+  +D EK+ L+   IA L +   +  E   N 
Sbjct: 114 RILAKQKIHPERTQSEKLLAVKEFESHLDKLDNEKRELIQNDIAALHHFYSKHMEYPDNA 173

Query: 66  -IAENFSKLACNAHTICNSELRPLGTGLYPVISIINHSCLPNAVLVFEGRLAVVRAVQHV 124
            +   F+++ CN  TI + EL  LG+ ++P ++++NHSC PN ++ ++G LA VRAV+ +
Sbjct: 174 ALVVLFAQVNCNGFTIEDEELSHLGSAIFPDVALMNHSCCPNVIVTYKGTLAEVRAVKEI 233

Query: 125 PKGAEVLISYIETAGSTMTRQKALKEQYLFTCTCPRCIKLGQFDDIQESAILEGYRCKDD 184
             G EV  SYI+    T  R   L++ Y FTC C  C    +  D ++  I    R  +D
Sbjct: 234 EPGEEVFTSYIDLLYPTEDRNDRLRDSYFFTCDCRECTM--KEKDKEKLKI----RKLND 287

Query: 185 GCSGFLLRDSDDKGFTCQQCGLVRSKEEIKKIASEVNILSKKTLALTSCGNHQEVVSTYK 244
             S   +RD            ++      K   S   +L    L+L   G          
Sbjct: 288 PPSAEAVRDMIKYARN-----VIEEFRRAKHYKSPSELLEICELSLDKMG---------- 332

Query: 245 MIEKLQKKLYHPFSVNLMQTREKLIKILMELEDWKEALAYCQLTIPVYQRVYPQFHPLLG 304
                   ++   +V ++    + + + + ++DW+ AL Y Q  I  Y + YP +   + 
Sbjct: 333 -------AVFEDSNVYMLHMMYQAMGVCLYVQDWEGALRYGQKIIRPYSKHYPSYSLNVA 385

Query: 305 LQYYTCGKLEWFLGDTENAIKSMTEAVEILRITHGTNSPFMKELILKLEE 354
             +   G+L   L +     K++ +A+ I+ + HG + P++ E+  +LE+
Sbjct: 386 SMWLKLGRLYMALENRPAGDKALKKAIAIMEVAHGKDHPYISEIKKELED 435


>gi|195133772|ref|XP_002011313.1| GI16066 [Drosophila mojavensis]
 gi|193907288|gb|EDW06155.1| GI16066 [Drosophila mojavensis]
          Length = 448

 Score =  115 bits (289), Expect = 2e-23,   Method: Compositional matrix adjust.
 Identities = 85/295 (28%), Positives = 150/295 (50%), Gaps = 20/295 (6%)

Query: 62  SINEIAENFSKLACNAHTICNSELRPLGTGLYPVISIINHSCLPNAVLVFEGRLAVVRAV 121
           ++NE+   + +L  N   I ++E+  + TG+Y  +SI +HSC PNAV  FEG    V A+
Sbjct: 161 NMNELMSIYGRLITNGFNILDAEMNSIATGIYLGVSITDHSCQPNAVATFEGNELHVHAI 220

Query: 122 QHVP--KGAEVLISYIETAGSTMTRQKALKEQYLFTCTCPRCIKLGQFDDIQESAILEGY 179
           + +P    +++ ISYI+   +   R+  LKE Y F C C +CI      D QE   +   
Sbjct: 221 EDLPCLDWSKIYISYIDLLNTPEQRRADLKEHYYFLCVCSKCI------DPQEMHEMTAA 274

Query: 180 RCKDDGCSGFLLRDSDDKGFTCQQCGLVRSKEEIKKIASEVNILSKKTLALTSCGNHQEV 239
            C +  C   +    D     C+ CG   S E+++   SEV  L++ +L      + +EV
Sbjct: 275 VCPNGSCDASVNIQLD----KCESCGTPVS-EQLRTDYSEVMALTQSSL-----DSMKEV 324

Query: 240 --VSTYKMIEKLQKKLYHPFSVNLMQTREKLIKILMELEDWKEALAYCQLTIPVYQRVYP 297
             +   K+     + + HP +V  ++T +   +  +++  W EAL Y +  +P + + + 
Sbjct: 325 AYLDVCKVCLAKHRGILHPLNVWHVKTLDAAFEASIQVGKWTEALDYGRQLLPGFAKYHG 384

Query: 298 QFHPLLGLQYYTCGKLEWFLGDTENAIKSMTEAVEILRITHGTNSPFMKELILKL 352
           +++PLLGL     GK+E +  + + A++ + EA  IL +THG +   + E +  L
Sbjct: 385 EWNPLLGLLQLKLGKIELYERNYKQALQHLQEAQRILNVTHGRDHRLLLEQLRPL 439


>gi|325530259|sp|E1C5V0.1|SMYD2_CHICK RecName: Full=N-lysine methyltransferase SMYD2; AltName:
           Full=Histone methyltransferase SMYD2; AltName: Full=SET
           and MYND domain-containing protein 2
          Length = 436

 Score =  115 bits (287), Expect = 4e-23,   Method: Compositional matrix adjust.
 Identities = 81/350 (23%), Positives = 157/350 (44%), Gaps = 31/350 (8%)

Query: 8   RKLQNDNVIPSTTTDNYSL-VEALVAHMSDID-EKQLLLYAQIANLVNLILQWPEISINE 65
           R L    + P  T     L V+   +H+  +D EK+ L+   IA L +   +  E   N 
Sbjct: 114 RILAKQKIHPERTQSEKLLAVKEFESHLDKLDNEKRELIQNDIAALHHFYSKHMEYPDNA 173

Query: 66  -IAENFSKLACNAHTICNSELRPLGTGLYPVISIINHSCLPNAVLVFEGRLAVVRAVQHV 124
            +   F+++ CN  TI + EL  LG+ ++P ++++NHSC PN ++ ++G LA VRAV+ +
Sbjct: 174 ALVVLFAQVNCNGFTIEDEELSHLGSAIFPDVALMNHSCCPNVIVTYKGTLAEVRAVKEI 233

Query: 125 PKGAEVLISYIETAGSTMTRQKALKEQYLFTCTCPRCIKLGQFDDIQESAILEGYRCKDD 184
             G EV  SYI+    T  R   L++ Y FTC C  C                       
Sbjct: 234 EPGEEVFTSYIDLLYPTEDRNDRLRDSYFFTCDCRECT---------------------- 271

Query: 185 GCSGFLLRDSDDKGFTCQQCGLVRSKEEIKKIASEVNILSKKTLALTSCGNHQEVVSTYK 244
                 +++ D +    ++     S E ++ +      + ++           E++   +
Sbjct: 272 ------MKEKDKEKLKIRKLNDPPSAEAVRDMIKYARNVIEEFRRAKHYKPPSELLEICE 325

Query: 245 MIEKLQKKLYHPFSVNLMQTREKLIKILMELEDWKEALAYCQLTIPVYQRVYPQFHPLLG 304
           +       ++   +V ++    + + + + ++DW+ AL Y Q  I  Y + YP +   + 
Sbjct: 326 LSLDKMGAVFEDSNVYMLHMMYQAMGVCLYVQDWEGALRYGQKIIRPYSKHYPSYSLNVA 385

Query: 305 LQYYTCGKLEWFLGDTENAIKSMTEAVEILRITHGTNSPFMKELILKLEE 354
             +   G+L   L +     K++ +A+ I+ + HG + P++ E+  +LE+
Sbjct: 386 SMWLKLGRLYMALENRPAGDKALKKAIAIMEVAHGKDHPYISEIKKELED 435


>gi|327262509|ref|XP_003216066.1| PREDICTED: n-lysine methyltransferase SMYD2-like [Anolis
           carolinensis]
          Length = 437

 Score =  115 bits (287), Expect = 4e-23,   Method: Compositional matrix adjust.
 Identities = 84/329 (25%), Positives = 152/329 (46%), Gaps = 30/329 (9%)

Query: 27  VEALVAHMSDID-EKQLLLYAQIANLVNLILQWPEISINE-IAENFSKLACNAHTICNSE 84
           V+   +H+  +D EK  L+ + I+ L +   +  E   N  +   F+++ CN  TI + E
Sbjct: 135 VKEFESHLDKLDNEKMELIQSDISALHHFYSKHIEYPDNAALVVLFAQVNCNGFTIEDEE 194

Query: 85  LRPLGTGLYPVISIINHSCLPNAVLVFEGRLAVVRAVQHVPKGAEVLISYIETAGSTMTR 144
           L  LG+ ++P ++++NHSC PN ++ ++G +A VRAVQ +  G E+  SYI+    T  R
Sbjct: 195 LSHLGSAIFPDVALMNHSCCPNVIVTYKGTVAEVRAVQEIEPGDEIFTSYIDLLYPTEDR 254

Query: 145 QKALKEQYLFTCTCPRCIKLGQFDDIQESAILEGYRCKDDGCSGFLLRDSDDKGFTCQQC 204
              LK+ Y FTC C  CI   +  D  E   L      ++      +RD           
Sbjct: 255 NDRLKDSYFFTCDCRECITKVKDKDKLEIRKL------NEPPPAEAVRDMIKYARN---- 304

Query: 205 GLVRSKEEIKKIASEVNILSKKTLALTSCGNHQEVVSTYKMIEKLQKKLYHPFSVNLMQT 264
            ++      K   S   +L    L+L   G+                 ++   +V ++  
Sbjct: 305 -VIEEFRRAKHYKSPSELLEICELSLEKMGS-----------------VFAESNVYMLHM 346

Query: 265 REKLIKILMELEDWKEALAYCQLTIPVYQRVYPQFHPLLGLQYYTCGKLEWFLGDTENAI 324
             + + + + ++DW+ AL Y Q  I  Y + YP +   +   +   G+L   L +    +
Sbjct: 347 MYQAMGVCLYMQDWEGALRYGQKIIKPYSKHYPLYSLNVASMWLKLGRLYMGLENRTAGV 406

Query: 325 KSMTEAVEILRITHGTNSPFMKELILKLE 353
           K++ +A+ I+ I HG + P++ E+  +LE
Sbjct: 407 KALKKAIAIMEIAHGKDHPYVIEIKKELE 435


>gi|355720910|gb|AES07090.1| SET and MYND domain containing 3 [Mustela putorius furo]
          Length = 251

 Score =  115 bits (287), Expect = 4e-23,   Method: Compositional matrix adjust.
 Identities = 82/267 (30%), Positives = 129/267 (48%), Gaps = 43/267 (16%)

Query: 75  CNAHTICNSELRPLGTGLYPVISIINHSCLPNAVLVFEGRLAVVRAVQHVPKGAEVLISY 134
           CNA TICN+E++ +G GLYP +S++NHSC PN  +VF G   ++RAV+ +  G E+ I Y
Sbjct: 3   CNAFTICNAEMQEVGVGLYPSMSLLNHSCDPNCSIVFNGPHLLLRAVRDIEAGEELTICY 62

Query: 135 IETAGSTMTRQKALKEQYLFTCTCPRCIKLGQFDDIQESAILEGYRCKDDGCSGFLLRDS 194
           ++   ++  R+K L++QY F C C RC                              +D 
Sbjct: 63  LDMLMTSEERRKQLRDQYCFECDCFRCQT----------------------------QDK 94

Query: 195 DDKGFTCQQCGLVRSKEEIKKIASEVNILSKKTLALTSCGNHQEVVSTYKMIEKLQKKLY 254
           D    T  +      +E +KKI  E+    K    L  C   Q V+S+    E+L     
Sbjct: 95  DADMLTGDEQVWKEVQESLKKI-EELKAHWKWEQVLAMC---QAVISSNA--ERLP---- 144

Query: 255 HPFSVNLMQTR--EKLIKILMELEDWKEALAYCQLTIPVYQRVYPQFHPLLGLQYYTCGK 312
               +N+ Q +  +  +   + L   +EAL Y   T+  Y+  +P  HP+ G+Q    GK
Sbjct: 145 ---DINIYQLKVLDCAMDACINLGLLEEALFYGIRTMEPYRIFFPGSHPVRGVQVMKVGK 201

Query: 313 LEWFLGDTENAIKSMTEAVEILRITHG 339
           L+   G    A+K++  A +I+R+THG
Sbjct: 202 LQLHQGMFPQAMKNLRLAFDIMRVTHG 228


>gi|326915197|ref|XP_003203906.1| PREDICTED: n-lysine methyltransferase SMYD2-like [Meleagris
           gallopavo]
          Length = 575

 Score =  115 bits (287), Expect = 5e-23,   Method: Compositional matrix adjust.
 Identities = 86/338 (25%), Positives = 156/338 (46%), Gaps = 30/338 (8%)

Query: 19  TTTDNYSLVEALVAHMSDID-EKQLLLYAQIANLVNLILQWPEISINE-IAENFSKLACN 76
           T ++    V+   +H+  +D EK+ L+   IA L +   +  E   N  +   F+++ CN
Sbjct: 265 TQSEKLLAVKEFESHLDKLDNEKRELIQNDIAALHHFYSKHMEYPDNAALVVLFAQVNCN 324

Query: 77  AHTICNSELRPLGTGLYPVISIINHSCLPNAVLVFEGRLAVVRAVQHVPKGAEVLISYIE 136
             TI + EL  LG+ ++P ++++NHSC PN ++ ++G LA VRAV+ +  G EV  SYI+
Sbjct: 325 GFTIEDEELSHLGSAIFPDVALMNHSCCPNVIVTYKGILAEVRAVKEIEPGEEVFTSYID 384

Query: 137 TAGSTMTRQKALKEQYLFTCTCPRCIKLGQFDDIQESAILEGYRCKDDGCSGFLLRDSDD 196
               T  R   L++ Y FTC C  C    +  D ++  I    R  +D  S   +RD   
Sbjct: 385 LLYPTEDRNDRLRDSYFFTCDCRECTM--KEKDKEKLKI----RKLNDPPSAEAVRD--- 435

Query: 197 KGFTCQQCGLVRSKEEIKKIASEVNILSKKTLALTSCGNHQEVVSTYKMIEKLQKKLYHP 256
                    ++      K   S   +L    L+L   G                  ++  
Sbjct: 436 --MIKYARNVIEEFRRAKHYKSPSELLEICELSLDKMG-----------------AVFED 476

Query: 257 FSVNLMQTREKLIKILMELEDWKEALAYCQLTIPVYQRVYPQFHPLLGLQYYTCGKLEWF 316
            +V ++    + + + + ++DW+ AL Y Q  I  Y + YP +   +   +   G+L   
Sbjct: 477 SNVYMLHMMYQAMGVCLYVQDWEGALRYGQKIIRPYSKYYPSYSLNVASMWLKLGRLYMA 536

Query: 317 LGDTENAIKSMTEAVEILRITHGTNSPFMKELILKLEE 354
           L +     K++ +A+ I+ + HG + P++ E+  +LE+
Sbjct: 537 LENRPAGDKALKKAIAIMEVAHGKDHPYISEIKKELED 574


>gi|148681077|gb|EDL13024.1| SET and MYND domain containing 2 [Mus musculus]
          Length = 450

 Score =  114 bits (286), Expect = 5e-23,   Method: Compositional matrix adjust.
 Identities = 86/351 (24%), Positives = 162/351 (46%), Gaps = 18/351 (5%)

Query: 8   RKLQNDNVIPSTTTDNYSL-VEALVAHMSDID-EKQLLLYAQIANLVNLI---LQWPEIS 62
           R L    + P  T     L V    +H+  +D EK+ L+ + IA L       L++P+ S
Sbjct: 111 RILAKQKIHPERTPSEKLLAVREFESHLDKLDNEKKDLIQSDIAALHQFYSKYLEFPDHS 170

Query: 63  INEIAENFSKLACNAHTICNSELRPLGTGLYPVISIINHSCLPNAVLVFEGRLAVVRAVQ 122
              +   F+++ CN  TI + EL  LG+ ++P ++++NHSC PN ++ ++G LA VRAVQ
Sbjct: 171 --SLVVLFAQVNCNGFTIEDEELSHLGSAIFPDVALMNHSCCPNVIVTYKGTLAEVRAVQ 228

Query: 123 HVPKGAEVLISYIETAGSTMTRQKALKEQYLFTCTCPRCIKLGQFDDIQESAILEGYRCK 182
            +  G EV  SYI+    T  R   L++ Y FTC C  C         ++ A +E  R  
Sbjct: 229 EIHPGDEVFTSYIDLLYPTEDRNDRLRDSYFFTCECRECTTKD-----KDKAKVE-VRKL 282

Query: 183 DDGCSGFLLRDSDDKGFTCQQCGLVRSKEEIKKIASEVNILSKKTLALTSCGNHQEVVST 242
                   +RD            ++      K     +++   +  A        E++  
Sbjct: 283 SSPPQAEAIRDMVRYARN-----VIEEFRRAKHYKYNLSVWPARGTAPCLTDGVCELLEI 337

Query: 243 YKMIEKLQKKLYHPFSVNLMQTREKLIKILMELEDWKEALAYCQLTIPVYQRVYPQFHPL 302
            ++ ++    ++   +V ++    + + + + ++DW+ AL Y Q  I  Y + YP +   
Sbjct: 338 CELSQEKMSSVFEDSNVYMLHMMYQAMGVCLYMQDWEGALKYGQKIIKPYSKHYPVYSLN 397

Query: 303 LGLQYYTCGKLEWFLGDTENAIKSMTEAVEILRITHGTNSPFMKELILKLE 353
           +   +   G+L   L +     K++ +A+ I+ + HG + P++ E+  ++E
Sbjct: 398 VASMWLKLGRLYMGLENKAAGEKALKKAIAIMEVAHGKDHPYISEIKQEIE 448


>gi|47217177|emb|CAG11013.1| unnamed protein product [Tetraodon nigroviridis]
          Length = 473

 Score =  114 bits (285), Expect = 7e-23,   Method: Compositional matrix adjust.
 Identities = 86/329 (26%), Positives = 152/329 (46%), Gaps = 41/329 (12%)

Query: 33  HMSDIDEKQLL--LYAQI-----ANLVNLILQWPEISINEIAENFSKLACNAHTICNSEL 85
           H+SD+ + Q L  L+AQ+     A    LI   P  S+N          CN  TI + EL
Sbjct: 179 HISDLPDDQALTQLFAQVRSRTGAGGEKLIRGSPPASVN----------CNGFTIEDEEL 228

Query: 86  RPLGTGLYPVISIINHSCLPNAVLVFEGRLAVVRAVQHVPKGAEVLISYIETAGSTMTRQ 145
             LG+ ++P ++++NHSC PN ++ ++G +A VRAVQ +  G E+  SYI+    T  R+
Sbjct: 229 SHLGSAVFPDVALMNHSCSPNVIVTYKGTVAEVRAVQEINPGDEIFNSYIDLLYPTEDRK 288

Query: 146 KALKEQYLFTCTCPRCIKLGQFDDIQESAILEGYRCKDDGCSGFLLRDSDDKGFTCQQCG 205
           + L + Y FTC C  C    + ++  +S   +             L   + +        
Sbjct: 289 ERLLDSYFFTCQCAECTTRSKDEEKMKSTKPKSS-----------LEPEEVQSMVVYARN 337

Query: 206 LVRSKEEIKKIASEVNILSKKTLALTSCGNHQEVVSTYKMIEKLQKKLYHPFSVNLMQTR 265
           L+   EE ++     NILS     L  C      +S  KM       ++   +V ++   
Sbjct: 338 LI---EEFRRAKHYKNILSFPVSLLEMCE-----LSLDKM-----GSVFADTNVYMLHMM 384

Query: 266 EKLIKILMELEDWKEALAYCQLTIPVYQRVYPQFHPLLGLQYYTCGKLEWFLGDTENAIK 325
            + + + + ++DW  A++Y +  I  Y   YP +   +   Y   G+L   L  T   +K
Sbjct: 385 YQAMGVCLYMQDWDGAMSYGEKIIQPYSVHYPAYSLNVASMYLKLGRLYLGLEKTPQGVK 444

Query: 326 SMTEAVEILRITHGTNSPFMKELILKLEE 354
           ++ +A+ I+ + HG +  ++ E+  ++EE
Sbjct: 445 ALKKALSIMEVAHGKDHYYVAEVKREIEE 473


>gi|224047131|ref|XP_002191464.1| PREDICTED: N-lysine methyltransferase SMYD2 [Taeniopygia guttata]
          Length = 436

 Score =  114 bits (285), Expect = 7e-23,   Method: Compositional matrix adjust.
 Identities = 84/341 (24%), Positives = 159/341 (46%), Gaps = 36/341 (10%)

Query: 19  TTTDNYSLVEALVAHMSDID-EKQLLLYAQIANLVNLI---LQWPEISINEIAENFSKLA 74
           T ++    V+   +H+  +D EK+ L+   IA L +     L +P+ +   +   F+++ 
Sbjct: 126 TQSEKLLAVKEFESHLDKLDNEKRELIQNDIAALHHFYSKHLDYPDNAA--LVVLFAQVN 183

Query: 75  CNAHTICNSELRPLGTGLYPVISIINHSCLPNAVLVFEGRLAVVRAVQHVPKGAEVLISY 134
           CN  TI + EL  LG+ ++P ++++NHSC PN ++ ++G LA VRAV+ +  G E+  SY
Sbjct: 184 CNGFTIEDEELSHLGSAIFPDVALMNHSCCPNVIVTYKGTLAEVRAVREIEPGEEIFSSY 243

Query: 135 IETAGSTMTRQKALKEQYLFTCTCPRCIKLGQFDDIQESAILEGYRCK-DDGCSGFLLRD 193
           I+    T  R   L++ Y F+C C  C    +  D ++  I     CK +D  S   ++D
Sbjct: 244 IDLLYPTEDRNDRLRDSYFFSCDCRECTT--KEKDKEKLEI-----CKLNDPPSAETVKD 296

Query: 194 SDDKGFTCQQCGLVRSKEEIKKIASEVNILSKKTLALTSCGNHQEVVSTYKMIEKLQKKL 253
                       ++      K   S   +L    L+L   G                  +
Sbjct: 297 MIKYARN-----VIEEFRRAKHYKSPSELLEICELSLDKMG-----------------AV 334

Query: 254 YHPFSVNLMQTREKLIKILMELEDWKEALAYCQLTIPVYQRVYPQFHPLLGLQYYTCGKL 313
           +   +V ++    + + + + ++DW+ AL Y Q  I  Y + YP +   +   +   G+L
Sbjct: 335 FEDSNVYMLHMMYQAMGVCLYVQDWEGALCYGQKIIRPYSKHYPSYSLNVASMWLKLGRL 394

Query: 314 EWFLGDTENAIKSMTEAVEILRITHGTNSPFMKELILKLEE 354
              L +    +K++  A+ I+ + HG + P++ E+  +LE+
Sbjct: 395 YMALKNRTAGVKALKRAIAIMEVAHGKDHPYISEIKKELED 435


>gi|195059707|ref|XP_001995687.1| GH17627 [Drosophila grimshawi]
 gi|193896473|gb|EDV95339.1| GH17627 [Drosophila grimshawi]
          Length = 457

 Score =  113 bits (283), Expect = 1e-22,   Method: Compositional matrix adjust.
 Identities = 74/282 (26%), Positives = 141/282 (50%), Gaps = 16/282 (5%)

Query: 62  SINEIAENFSKLACNAHTICNSELRPLGTGLYPVISIINHSCLPNAVLVFEGRLAVVRAV 121
           +I E+   + +L  N   I ++E+  + T +Y  +SI +HSC PNAV  FEG    V  +
Sbjct: 170 NITELMSIYGRLITNGFNILDAEMNSIATAIYLGVSITDHSCQPNAVATFEGNELHVHVI 229

Query: 122 QHVP--KGAEVLISYIETAGSTMTRQKALKEQYLFTCTCPRCIKLGQFDDIQESAILEGY 179
           + +P    +++ ISYI+   +   R+  L+E Y F C C +CI      D QE   +   
Sbjct: 230 EDLPCLDWSKIYISYIDLLNTPEQRRADLREHYYFLCVCSKCI------DPQEMHEMTAA 283

Query: 180 RCKDDGCSGFLLRDSDDKGFTCQQCGLVRSKEEIKKIASEVNILSKKTLALTSCGNHQEV 239
            C +  C   +    + +   CQQC       +++   +++   ++ +L       + +V
Sbjct: 284 VCPNASCDASV----NIELAKCQQCD-ASVTPQLRTAYNDIMAFTQSSLDSMKEVAYLDV 338

Query: 240 VSTYKMIEKLQKKLYHPFSVNLMQTREKLIKILMELEDWKEALAYCQLTIPVYQRVYPQF 299
            +    +EK Q+ + HP +V  ++T +   +  + +  W +ALA+ Q  +P + + +  +
Sbjct: 339 CNV--CLEK-QRGVLHPLNVWHVKTLDAAFEAAINVGKWTDALAFGQQLLPGFGKYHGAW 395

Query: 300 HPLLGLQYYTCGKLEWFLGDTENAIKSMTEAVEILRITHGTN 341
           +PL+GL Y   GK++ +  + + A+  + EA  IL +THG +
Sbjct: 396 NPLVGLLYMKLGKIQLYERNYKLAVHHLQEAQCILSVTHGHD 437


>gi|195163295|ref|XP_002022487.1| GL12938 [Drosophila persimilis]
 gi|194104479|gb|EDW26522.1| GL12938 [Drosophila persimilis]
          Length = 420

 Score =  113 bits (283), Expect = 1e-22,   Method: Compositional matrix adjust.
 Identities = 93/368 (25%), Positives = 176/368 (47%), Gaps = 41/368 (11%)

Query: 1   MLKLYLRRKLQNDNVIPSTTTDNYSLVEALVAHMSDI--DEKQL-LLYAQIANLVNLILQ 57
           + +L LR +   D +    T         L++H ++I  D K+L  L +  A L +++  
Sbjct: 86  LCRLILRLQHGGDLIRGYYTEHGSRKFRDLMSHYAEIKNDRKRLEHLDSLHAVLTDMMAD 145

Query: 58  WPEISIN--EIAENFSKLACNAHTICNSELRPLGTGLYPVISIINHSCLPNAVLVFEGRL 115
            P    N  E+   + +L  N   + ++E+  + T +Y  +SI +HSC PNAV  FEG  
Sbjct: 146 SPSTVPNKSELMSIYGRLITNGFNVLDAEMNSIATAIYLGVSITDHSCQPNAVATFEGNE 205

Query: 116 AVVRAVQHVP--KGAEVLISYIETAGSTMTRQKALKEQYLFTCTCPRCIKLGQFDDIQES 173
             + A++ +P    +++ ISYI+   +   R++ LK+ Y F C C +C       D +E+
Sbjct: 206 LHIHALEDMPCLDWSKIFISYIDLLNTPEQRRQDLKDHYYFLCVCSKC------RDPKEA 259

Query: 174 AILEGYRCKDDGCSGFLLRDSDDKGFTCQQCGLVRSKEEIKKIASEVNILSKKTLALTSC 233
             +    C +  CS  +  + ++    C++C +  S + ++   +E+  L+K  L     
Sbjct: 260 RQMTAAACPNRKCSASINIEWNN----CKRCSVGISPK-LRNAYNEIMALTKHNL----- 309

Query: 234 GNHQEVVSTYKMIEKLQKKLYHPFSVNLMQTREKLIKILMELEDWKEALAYCQLTIPVYQ 293
                        E ++   Y    +++ +T +   +  +E+  W+EAL Y Q  +P +Q
Sbjct: 310 -------------EAMKDVAY----LDVCKTLDAACEAAIEVGKWEEALNYGQQLLPGFQ 352

Query: 294 RVYPQFHPLLGLQYYTCGKLEWFLGDTENAIKSMTEAVEILRITHGTNSPFMKELILKL- 352
             + Q++PL+GL +   GK++ F    + A   + EA  IL +THG +   + E +  L 
Sbjct: 353 MYHGQWNPLVGLLHMKLGKIQLFERHHKEASHHLEEAQRILSVTHGRDHRLLVEQLYPLI 412

Query: 353 EEAQAEAS 360
            +A+ EA+
Sbjct: 413 FQARQEAN 420


>gi|432945357|ref|XP_004083558.1| PREDICTED: N-lysine methyltransferase SMYD2-like [Oryzias latipes]
          Length = 432

 Score =  113 bits (283), Expect = 1e-22,   Method: Compositional matrix adjust.
 Identities = 98/347 (28%), Positives = 154/347 (44%), Gaps = 62/347 (17%)

Query: 26  LVEALVAHMSDID-EKQLLLYAQIANLVNLILQWPEI-SINEIAENFSKLACNAHTICNS 83
           L+  + +H+ D D EK+      IA L     +  E+ S  ++   FS++ACN  TI + 
Sbjct: 129 LIGEMQSHLEDEDNEKKERTELDIAGLGRFFSKHLEVPSHKDLLTLFSQVACNGFTIEDD 188

Query: 84  ELRPLGTGLYPVISIINHSCLPNAVLVFEGRLAVVRAVQHVPKGAEVLISYIETAGSTMT 143
           EL  +GT +YP +++INHSCLP+ ++ F G LA VRAVQ +  G EVLISYI+    T  
Sbjct: 189 ELSHMGTAVYPDVALINHSCLPSVIVTFNGTLAQVRAVQDMKPGDEVLISYIDLLYPTDD 248

Query: 144 RQKALKEQYLFTCTCPRC-------IKL---GQFDDIQESAILEGYRCKDDGCSGF-LLR 192
           R   L+E Y FTC C  C       +KL    Q D I+   I    R   +    F  ++
Sbjct: 249 RNSRLRESYYFTCNCLECQNKQKDKVKLKVRKQSDPIEPQVISNMTRYAKNTIREFRAMK 308

Query: 193 DSDDKGFTCQQCGLVRSKEEIKKI--ASEVNILSKKTLALTSCGNHQEVVSTYKMIEKLQ 250
            +       + C   +S EE+  +   S V +L     A+  C   Q+     +  EK+ 
Sbjct: 309 STKTPSELLEMC--EQSLEEMGAVFEDSNVYVLHMMYQAMGVCIYMQDADGALRYGEKVA 366

Query: 251 K---KLYHPFSVNLMQTREKLIKILMELEDWKEALAYCQLTIPVYQRVYPQFHPLLGLQY 307
           +   K YHP+S+N+     KL +I M                                  
Sbjct: 367 RYYSKRYHPYSLNVSSLYLKLGRIYMA--------------------------------- 393

Query: 308 YTCGKLEWFLGDTENAIKSMTEAVEILRITHGTNSPFMKELILKLEE 354
                    LG     I ++T+A+ I+ + HG +  ++K++  +L +
Sbjct: 394 ---------LGRHSAGINALTKAMAIMEVAHGKDHQYLKDIRKELSQ 431


>gi|195059722|ref|XP_001995690.1| GH17625 [Drosophila grimshawi]
 gi|193896476|gb|EDV95342.1| GH17625 [Drosophila grimshawi]
          Length = 457

 Score =  113 bits (283), Expect = 1e-22,   Method: Compositional matrix adjust.
 Identities = 73/282 (25%), Positives = 141/282 (50%), Gaps = 16/282 (5%)

Query: 62  SINEIAENFSKLACNAHTICNSELRPLGTGLYPVISIINHSCLPNAVLVFEGRLAVVRAV 121
           +I E+   + +L  N   I ++E+  + T +Y  +SI +HSC PNAV  FEG    V  +
Sbjct: 170 NITELMSIYGRLITNGFNILDAEMNSIATAIYLGVSITDHSCQPNAVATFEGNELHVHVI 229

Query: 122 QHVP--KGAEVLISYIETAGSTMTRQKALKEQYLFTCTCPRCIKLGQFDDIQESAILEGY 179
           + +P    +++ ISYI+   +   R+  L+E Y F C C +CI      D QE   +   
Sbjct: 230 EDLPCLDWSKIYISYIDLLNTPEQRRADLREHYYFLCVCSKCI------DPQEMHEMTAA 283

Query: 180 RCKDDGCSGFLLRDSDDKGFTCQQCGLVRSKEEIKKIASEVNILSKKTLALTSCGNHQEV 239
            C +  C   +    + +   CQQC       +++   +++   ++ +L       + +V
Sbjct: 284 VCPNASCDASV----NIELAKCQQCD-ASVTPQLRTAYNDIMAFTQSSLDSMKEVAYLDV 338

Query: 240 VSTYKMIEKLQKKLYHPFSVNLMQTREKLIKILMELEDWKEALAYCQLTIPVYQRVYPQF 299
            +    +EK Q+ + HP +V  ++T +   +  + +  W +ALA+ Q  +P + + +  +
Sbjct: 339 CNV--CLEK-QRGVLHPLNVWHVKTLDAAFEAAINVGKWTDALAFGQQLLPGFGKYHGAW 395

Query: 300 HPLLGLQYYTCGKLEWFLGDTENAIKSMTEAVEILRITHGTN 341
           +PL+GL +   GK++ +  + + A+  + EA  IL +THG +
Sbjct: 396 NPLVGLLHMKLGKIQLYERNYKLAVHHLQEAQRILSVTHGHD 437


>gi|351695044|gb|EHA97962.1| SET and MYND domain-containing protein 3, partial [Heterocephalus
           glaber]
          Length = 252

 Score =  113 bits (282), Expect = 2e-22,   Method: Compositional matrix adjust.
 Identities = 83/270 (30%), Positives = 138/270 (51%), Gaps = 43/270 (15%)

Query: 72  KLACNAHTICNSELRPLGTGLYPVISIINHSCLPNAVLVFEGRLAVVRAVQHVPKGAEVL 131
           ++ CN+ TICN+E++ +G GLYP +S++NHSC PN  +VF G   ++RAV+ +  G E+ 
Sbjct: 1   QVICNSFTICNAEMQEVGVGLYPSMSLLNHSCDPNCSIVFSGPHLLLRAVRDIEVGEELT 60

Query: 132 ISYIETAGSTMTRQKALKEQYLFTCTCPRCIKLGQFDDIQESAILEGYRCKDDGCSGFLL 191
           I Y++   ++  R+K L++QY F C C RC    Q  D +++ +L G             
Sbjct: 61  ICYLDMLMTSEERRKQLRDQYCFDCDCFRC----QTQD-KDADMLTG------------- 102

Query: 192 RDSDDKGFTCQQCGLVRSKEEIKKIASEVNILSKKTLALTSCGNHQEVVSTYKMIEKLQK 251
              D++ +   Q       E +KKI  E+    K    L  C   Q ++S+    E+L  
Sbjct: 103 ---DEQVWKGVQ-------ESLKKI-EELKAHWKWEQVLAMC---QSIISSNS--ERLP- 145

Query: 252 KLYHPFSVNLMQTR--EKLIKILMELEDWKEALAYCQLTIPVYQRVYPQFHPLLGLQYYT 309
                  +N+ Q +  +  +   + L   +EAL Y   T+  Y+  +P  HP+ G+Q   
Sbjct: 146 ------DINIYQLKVLDCAMDACINLGLLEEALFYGIRTMEPYRIFFPGSHPVRGVQVMK 199

Query: 310 CGKLEWFLGDTENAIKSMTEAVEILRITHG 339
            GKL+   G    A+K++  A +I+R+THG
Sbjct: 200 VGKLQLHQGMFPQAMKNLRLAFDIMRVTHG 229


>gi|313227557|emb|CBY22704.1| unnamed protein product [Oikopleura dioica]
          Length = 589

 Score =  113 bits (282), Expect = 2e-22,   Method: Compositional matrix adjust.
 Identities = 78/273 (28%), Positives = 140/273 (51%), Gaps = 33/273 (12%)

Query: 70  FSKLACNAHTICNSE-LRPLGTGLYPVISIINHSCLPNAVLVFEGRLAVVRAVQHVPKGA 128
           F  ++CN  +I +   L+ LG  ++P +++INH C PN V V  G    VRA++ + +G 
Sbjct: 279 FGIVSCNGMSITDMRGLQYLGVAIHPTLNLINHDCNPNVVAVSCGPNIFVRAIKPIKEGD 338

Query: 129 EVLISYIETAGSTMTRQKALKEQYLFTCTCPRCIKLGQFDDIQESAILEGYRCKDDGCSG 188
           E+ ISYI+T+ ++ TR+  LK+QY F CTC  C + G+ D+++ + ++      +   S 
Sbjct: 339 ELFISYIDTSATSETRKNILKDQYYFDCTCKMC-ESGEKDELKSAYVMPKEDVSEKR-ST 396

Query: 189 FLLRDSDDKGFTCQQCGLVRSKEEIKKIASEVNILSKKTLALTSCGN--HQEVVSTYKMI 246
           ++ +++D                 +KKI S   + + + +A  + G    QE +     +
Sbjct: 397 YIEKNTDIM---------------MKKIESSKKVQAWERVAAQAGGCLLQQENLFDDTHL 441

Query: 247 EKLQKKLYHPFSVNLMQTREKLIKILMELEDWKEALAYCQLTIPVYQRVYPQFHPLLGLQ 306
           +KL           ++QT  ++  IL   +D   A  Y +  +  Y+++YP++   +G+Q
Sbjct: 442 KKLM----------ILQTCSEVSAILNHYDD---AAQYAERVLAAYEQLYPEYSTQIGMQ 488

Query: 307 YYTCGKLEWFLGDTENAIKSMTEAVEILRITHG 339
            Y  G   W L   E+AIK +  A++ L ITHG
Sbjct: 489 AYRLGVHYWHLQRVEDAIKMLGRALKHLEITHG 521


>gi|157137747|ref|XP_001657162.1| hypothetical protein AaeL_AAEL003674 [Aedes aegypti]
 gi|108880819|gb|EAT45044.1| AAEL003674-PA [Aedes aegypti]
          Length = 446

 Score =  113 bits (282), Expect = 2e-22,   Method: Compositional matrix adjust.
 Identities = 98/369 (26%), Positives = 168/369 (45%), Gaps = 23/369 (6%)

Query: 8   RKLQ--NDNVIPSTTTDNYSLVEALVAHMSDI--DEKQLLLYAQI-ANLVNLILQWPEIS 62
           RKLQ   D      T   Y     L+ H  DI  D  ++  +  +   L +L+ +    S
Sbjct: 91  RKLQKGGDYQKGYYTAKFYRRFHDLMTHEDDIKNDAPRMEHFQSLLVVLRSLVEEAAMPS 150

Query: 63  INEIAENFSKLACNAHTICNSELRPLGTGLYPVISIINHSCLPNAVLVFEGRLAVVRAVQ 122
             E+ + F K+  N+  I + E+  +GTG+Y  ISII+HSC PNA+  F+G    +R ++
Sbjct: 151 KAELLQIFGKMCINSFNILDDEMNSIGTGMYLGISIIDHSCRPNALATFDGTTIHLRLLE 210

Query: 123 HVPKGAEV-----LISYIETAGSTMTRQKALKEQYLFTCTCPRCIKLGQFDDIQESAILE 177
               G+EV      ISYI+       R+  L+ QY F C C RC       D QE  ++ 
Sbjct: 211 DY-HGSEVDFSKIFISYIDLMNPAEERKARLRAQYYFECNCERC------RDEQEQQLMI 263

Query: 178 GYRCKDDGCSGFLLRDSDDKGFTCQQCGLVRSKEEIKKIASEVNILSKKTLALTSCGNHQ 237
              C +  C    L   D K   C +C     K   +++  EV   +   LA      + 
Sbjct: 264 AGACPNVDCDE-PLSMLDRKVEKCPRCDTA-IKHSAREMFREVTDFTLAQLAKMKDITYM 321

Query: 238 EVVSTYKMIEKLQKKLYHPFSVNLMQTREKLIKILMELEDWKEALAYCQLTIPVYQRVYP 297
           ++    ++  K Q+ + H ++V  ++T +   +  + ++ W++A+ Y       +++   
Sbjct: 322 DIC---QLCLKKQENILHYYNVWYLKTLDLAFESAINMQKWEDAIGYGLRLKDGFKKYNG 378

Query: 298 QFHPLLGLQYYTCGKLEWFLGDTENAIKSMTEAVEILRITHGTNSPFMKELILKLEEAQA 357
            FHPL GL     GK++ +      A++++ ++ +ILR+THG      K  ++ L   QA
Sbjct: 379 PFHPLYGLLLLKIGKIQLYTKHVVEALQNINDSEKILRVTHGEEHDLYKSQLIPL-LCQA 437

Query: 358 EASYKLSSK 366
              Y   +K
Sbjct: 438 AGEYDALNK 446


>gi|432873524|ref|XP_004072259.1| PREDICTED: SET and MYND domain-containing protein 1-like isoform 1
           [Oryzias latipes]
          Length = 476

 Score =  113 bits (282), Expect = 2e-22,   Method: Compositional matrix adjust.
 Identities = 88/341 (25%), Positives = 157/341 (46%), Gaps = 43/341 (12%)

Query: 18  STTTD-NYSLVEALVAHMSDIDEKQLL-LYAQIANLVNLILQWPEIS----INEIAENFS 71
           ST +D   + ++ L  H++D+ E +L  L   I N ++    WP  S    +++I+  F 
Sbjct: 114 STVSDMQLTTLDELEDHITDMQEDELKELKVDIHNFLDF---WPRTSKQHTVDDISHIFG 170

Query: 72  KLACNAHTICNSE-LRPLGTGLYPVISIINHSCLPNAVLVFEGRLAVVRAVQHVPKGAEV 130
            + CN  ++ +   L+ +G GL+P + ++NH C PN  ++       +R++  + +G E+
Sbjct: 171 VINCNGFSVSDQRGLQAVGVGLFPNLCLVNHDCWPNCTVILNHGKIELRSLGKIAEGEEL 230

Query: 131 LISYIETAGSTMTRQKALKEQYLFTCTCPRCIKLGQFDDIQESAILEGYRCKDDGCSGFL 190
            ++Y++    +  RQ+ LK QY F CTC  C      + I++   L G            
Sbjct: 231 TVAYVDYMNLSEERQRLLKTQYFFDCTCEHCK-----NKIKDDIKLGG------------ 273

Query: 191 LRDSDDKGFTCQQCGLVRSKEEIKKIASEVNILSKKTLALTSCGNHQEVVSTYK-MIEKL 249
            R+ D         G+  S+E++K+       + +K       G++ EVV   +  IEK 
Sbjct: 274 -REVD---------GVKPSEEQVKEATDYCYQMLEKMDKARLNGDYHEVVKICRECIEKT 323

Query: 250 QKKLY--HPFSVNLMQTREKLIKILMELEDWKEALAYCQLTIPVYQRVYPQFHPLLGLQY 307
           +  L   H + + +  T  ++   L  L D   A  Y +  +  Y ++YP  +  LG+  
Sbjct: 324 EPVLAETHIYLLRMWSTMSEVQAYLQYLND---AAEYARKMVEGYTKLYPPNNATLGMAA 380

Query: 308 YTCGKLEWFLGDTENAIKSMTEAVEILRITHGTNSPFMKEL 348
              G + W  GD E A   + +A  IL +THG   P  K+L
Sbjct: 381 MRAGVIHWQAGDIEVAHGMVCKAYAILMVTHGPTHPITKDL 421


>gi|322779440|gb|EFZ09632.1| hypothetical protein SINV_01274 [Solenopsis invicta]
          Length = 425

 Score =  112 bits (281), Expect = 2e-22,   Method: Compositional matrix adjust.
 Identities = 98/350 (28%), Positives = 160/350 (45%), Gaps = 39/350 (11%)

Query: 20  TTDNYSLVEALVAHMSDI--DEKQL----LLYAQIANLVNLILQWPEISINEIAENFSKL 73
           T +NY     L++H SD+  D K +     LY  +A  ++  L  P  SI E+   + K+
Sbjct: 103 TKENYRKFTDLMSHYSDMRADAKHMEPFIKLYRFLAEFLDETLM-P--SIAELISIYGKM 159

Query: 74  ACNAHTICNSELRPLGTGLYPVISIINHSCLPNAVLVFEGRLAVVRAVQHVPKGAEVLIS 133
           + N   I +  L  +G G+Y   S++ HSC PNAV  FEG   ++           + IS
Sbjct: 160 STNRLNIQDEWLNVIGFGIYLGASVVGHSCKPNAVATFEGTTIII-----------IKIS 208

Query: 134 YIETAGSTMTRQKALKEQYLFTCTCPRCIKLGQFDDIQESAILEGYRCKDDGCSGFLLRD 193
           Y++   S   R++ L   Y F C C RC K        E ++ E   C +  C      D
Sbjct: 209 YVDLIKSKKDRREELYSLYHFWCDCERCEK--------EESLTEAAACPNSSCDSPCSID 260

Query: 194 SDDKGFTCQQCGL---VRSKEEIKKIASEVNILSKKTLALTSCGNHQEVVSTYKMIEKLQ 250
           +D     C++C     V+ KE  +++    +   +  +++    N  +  S  K I + Q
Sbjct: 261 ADK----CEKCNTRISVKFKETFQEVVDLTDYHFEHIISILF-FNDLDTFSWIKEILEKQ 315

Query: 251 KKLYHPFSVNLMQTREKLIKILMELEDWKEALAYCQLTIPVYQRVYPQFHPLLGLQYYTC 310
           K + H F+V+L+ T  K+      L+ WK+   Y +     Y   Y + +P +G  Y T 
Sbjct: 316 KNVMHKFNVHLICTLRKIHCAAWNLKCWKDVEFYGKKLEAGYLFYYGKVYPPIGTLYLTL 375

Query: 311 GKLEWFLGDTENAIKSMTEAVEILRITHGTNSPFMKELILKLEEAQAEAS 360
           G     LG  + A K++ +A  +L+ITHG     +KE+   LE  + +AS
Sbjct: 376 GIALLHLGKPKEAFKTLNKAYTVLKITHGNKHFLVKEI---LEPLRFDAS 422


>gi|194768855|ref|XP_001966527.1| GF21943 [Drosophila ananassae]
 gi|190617291|gb|EDV32815.1| GF21943 [Drosophila ananassae]
          Length = 459

 Score =  112 bits (279), Expect = 4e-22,   Method: Compositional matrix adjust.
 Identities = 89/368 (24%), Positives = 173/368 (47%), Gaps = 22/368 (5%)

Query: 1   MLKLYLRRKLQNDNVIPSTTTDNYSLVEALVAHMSDIDEKQLLLY---AQIANLVNLILQ 57
           + +L LR +   D +    T         L++H ++I    + L    +  A L +++ +
Sbjct: 106 LCRLILRLEHGGDLIRGYYTEHGSRKFRDLMSHYAEIKNDPMRLEHLDSLHAVLTDMMAE 165

Query: 58  WPEISIN--EIAENFSKLACNAHTICNSELRPLGTGLYPVISIINHSCLPNAVLVFEGRL 115
            P    N  E+   + +L  N   + ++E+  + T +Y  +SI +HSC PNAV  FEG  
Sbjct: 166 SPSTVPNKTELMSIYGRLITNGFNVLDAEMNSIATAIYLGVSITDHSCQPNAVATFEGNE 225

Query: 116 AVVRAVQHVP--KGAEVLISYIETAGSTMTRQKALKEQYLFTCTCPRCIKLGQFDDIQES 173
             + A++ +     +++ ISYI+   +   R+  LKE Y F C C +CI      D QE+
Sbjct: 226 LHIHAIEDMECLDWSKIFISYIDLLNTPEERRLDLKEHYYFLCCCSKCI------DPQET 279

Query: 174 AILEGYRCKDDGCSGFLLRDSDDKGFTCQQCGLVRSKEEIKKIASEVNILSKKTLALTSC 233
             +    C +  C   +    + +   C++C    S + ++   + V   ++  L     
Sbjct: 280 KEMMAALCPNRNCGIGI----NPERTNCKRCDTGISPK-LRNAYNSVMAFTRTNLEAMKD 334

Query: 234 GNHQEVVSTYKMIEKLQKKLYHPFSVNLMQTREKLIKILMELEDWKEALAYCQLTIPVYQ 293
             + +V    K+    Q  + HP +V  ++T +   +  +++  W +AL Y Q  +P Y+
Sbjct: 335 VAYLDVC---KVCLNKQTGVLHPLNVWHVKTLDAAFEAAIDVGKWTDALDYGQRLLPGYR 391

Query: 294 RVYPQFHPLLGLQYYTCGKLEWFLGDTENAIKSMTEAVEILRITHGTNSPFMKELILKLE 353
           + +  ++PLLGL +   GK++      + A+  + +A+ IL +THG +   + E +  L 
Sbjct: 392 KYHGAWNPLLGLLHMKLGKIQLHEQRAKEALPHLEDALRILTVTHGRDHRLLAEQLYPLL 451

Query: 354 -EAQAEAS 360
            +A+ EA+
Sbjct: 452 IQARHEAN 459


>gi|301113434|ref|XP_002998487.1| conserved hypothetical protein [Phytophthora infestans T30-4]
 gi|262111788|gb|EEY69840.1| conserved hypothetical protein [Phytophthora infestans T30-4]
          Length = 424

 Score =  111 bits (278), Expect = 5e-22,   Method: Compositional matrix adjust.
 Identities = 96/336 (28%), Positives = 157/336 (46%), Gaps = 55/336 (16%)

Query: 30  LVAHMSDIDEKQLLLYAQIANLVNLILQWPEISINEIAENFSKLACNAHTICNSELRPLG 89
           LV +  D+D+++LLL A +A  +NL+ +    S++E+A   S+   N  +IC+  L  LG
Sbjct: 132 LVWYEEDMDDQELLLLAALAQKINLVDE--SFSMDEMARMLSRFRNNNFSICDELLLELG 189

Query: 90  TGLYPVISIINHSCLPNAVLVFEGRLAVV--RAVQHVPKGAEVLISYIETAGSTMTRQKA 147
            G +P+ ++INHSC PN  + F  +   +  RA++ +  G E+  +Y++ A     RQ+ 
Sbjct: 190 AGCFPLGAMINHSCDPNCAVTFVPKTLDMEFRAMKPIKSGEEITQTYVDIALPRRERQQR 249

Query: 148 LKEQYLFTCTCPRCIKLGQFDDIQESAILEGYRCKDDGCSGFLLRDSDDKGFTCQQCGLV 207
           L+ +Y FTC CPRC +      +QE   L+ Y             D+D  G   ++    
Sbjct: 250 LQRKYHFTCGCPRCSQ-----PLQEPGSLDAY------------LDADIDGVPQERWTEE 292

Query: 208 RSKEEIKKIASEVNILSKKTLALTSCGNHQEVVSTYKMIEKLQKKLYHPFSVNLMQTREK 267
           R +E                          + +   + +   Q  + H  SV  +QT   
Sbjct: 293 RQQE--------------------------QCIDALQNLADRQSTILHCHSVARLQTLAT 326

Query: 268 LIKILMELEDWKEALAYCQLTIPVYQRVYPQFHPLLGLQYYTCGKL-----EWFLGDTEN 322
           +    ME    +EA+ Y +  +  Y+RVY   HP+ GL  +T G L     +   G  ++
Sbjct: 327 IFSAEMERGSVEEAIGYGERMLEFYRRVYNPNHPMTGLHVFTLGDLYGQQAQAGTGAEDS 386

Query: 323 AIKS---MTEAVEILRITHGTNSPFMKELILKLEEA 355
             KS   + EA  IL+ITHG +  F+K L  +LE A
Sbjct: 387 KQKSAEYLAEAQRILQITHGKDHRFVKMLGDRLETA 422


>gi|61806679|ref|NP_001013568.1| N-lysine methyltransferase SMYD2-A [Danio rerio]
 gi|82178565|sp|Q5BJI7.1|SMY2A_DANRE RecName: Full=N-lysine methyltransferase SMYD2-A; AltName:
           Full=Histone methyltransferase SMYD2-A; AltName:
           Full=SET and MYND domain-containing protein 2A
 gi|60552471|gb|AAH91465.1| SET and MYND domain containing 2a [Danio rerio]
          Length = 435

 Score =  110 bits (274), Expect = 1e-21,   Method: Compositional matrix adjust.
 Identities = 88/361 (24%), Positives = 164/361 (45%), Gaps = 41/361 (11%)

Query: 2   LKLYLRRKLQNDNVIPSTTTDNYSLVEALVAHMSDID-EKQLLLYAQIANLVNLI---LQ 57
           ++L  R  L+  +    T ++    +  L AH+  +D EK  +    IA L +     L 
Sbjct: 106 VRLVARIILKQKHQTERTPSERVLTLRELEAHLDKLDNEKNEMNDTDIAALHHFYSRHLD 165

Query: 58  WPEISINEIAENFSKLACNAHTICNSELRPLGTGLYPVISIINHSCLPNAVLVFEGRLAV 117
           +P+ +   + E  +++ CN  TI + EL  LG+ L+P ++++NHSC PN ++ ++G +A 
Sbjct: 166 FPDNAA--LTELIAQVNCNGFTIEDEELSHLGSALFPDVALMNHSCSPNVIVTYKGTVAE 223

Query: 118 VRAVQHVPKGAEVLISYIETAGSTMTRQKALKEQYLFTCTCPRCIKLGQFDDIQESAILE 177
           VRAVQ +    E+  SYI+    T  R + LK+ Y F C C  C                
Sbjct: 224 VRAVQEINPEEEIFNSYIDLLYPTEDRIERLKDSYFFNCDCKECTS-------------- 269

Query: 178 GYRCKDDGCSGFLLRDSDDKGFTCQQCGLVRSKEEIKKIASEVNILSKKTLALTSCGNHQ 237
             + KD+            K    Q+  +   +EEIK++     I ++  +       H 
Sbjct: 270 --KSKDEA-----------KMEIRQKLSIPPEEEEIKQMV----IYARNVIEEFRRAKHY 312

Query: 238 EVVSTYKMIEKLQKK----LYHPFSVNLMQTREKLIKILMELEDWKEALAYCQLTIPVYQ 293
           +  S    I +L  +    ++   +V ++    + + + + ++DW  A+ Y +  I  Y 
Sbjct: 313 KTPSELLEICELSMEKMGAIFAETNVYMLHMMYQAMGVCLYMQDWDGAMKYGEKIIHPYS 372

Query: 294 RVYPQFHPLLGLQYYTCGKLEWFLGDTENAIKSMTEAVEILRITHGTNSPFMKELILKLE 353
             YP +   +   Y   G+L   L      +K++ +A+ I+ I HG + P++ E+  ++E
Sbjct: 373 VHYPPYSLNVASMYLKLGRLYLGLEKRTQGVKALKKALAIMDIAHGKDHPYIDEIKKEME 432

Query: 354 E 354
           E
Sbjct: 433 E 433


>gi|348670159|gb|EGZ09981.1| hypothetical protein PHYSODRAFT_523060 [Phytophthora sojae]
          Length = 421

 Score =  110 bits (274), Expect = 1e-21,   Method: Compositional matrix adjust.
 Identities = 89/336 (26%), Positives = 148/336 (44%), Gaps = 55/336 (16%)

Query: 30  LVAHMSDIDEKQLLLYAQIANLVNLILQWPEISINEIAENFSKLACNAHTICNSELRPLG 89
           LV +  D+++++L+L A +A  + L+      S++E+    S+   N  +IC+  L   G
Sbjct: 130 LVWYEEDMEDQELMLLAALAQKLELVD--GSYSMDEMLRMLSRFRNNNFSICDELLLEQG 187

Query: 90  TGLYPVISIINHSCLPNAVLVFEGRLAVV--RAVQHVPKGAEVLISYIETAGSTMTRQKA 147
            G +P+ ++INHSC PN  + F  +   +  RA++ +  G E+  +Y++ A     R + 
Sbjct: 188 AGCFPLGAMINHSCDPNCAITFVPKTLEMEFRAMRPIKAGEEITQTYVDVALPRRERHER 247

Query: 148 LKEQYLFTCTCPRCIKLGQFDDIQESAILEGYRCKDDGCSGFLLRDSDDKGFTCQQCGLV 207
           L+ +Y F C C RC        +QES  L+ +             D+D  G   +Q    
Sbjct: 248 LQRKYHFNCACSRCSV-----PLQESGSLDAF------------LDADIDGVPKEQWSQE 290

Query: 208 RSKEEIKKIASEVNILSKKTLALTSCGNHQEVVSTYKMIEKLQKKLYHPFSVNLMQTREK 267
           R  E                          + +   + + + Q  + H  S+  +QT   
Sbjct: 291 RKDE--------------------------QCIDALQKLAERQSNILHRDSIARLQTLAT 324

Query: 268 LIKILMELEDWKEALAYCQLTIPVYQRVYPQFHPLLGLQYYTCGKLEWFL-----GDTEN 322
           +    ME    +EA+ Y +  +  YQRVY   HP+ GL  +T G L   L     G   +
Sbjct: 325 IFSAEMERGSVEEAVVYGEKMLEFYQRVYNANHPMTGLHLFTLGDLYAQLAQVGTGPENS 384

Query: 323 AIKS---MTEAVEILRITHGTNSPFMKELILKLEEA 355
             KS   +TEA  IL+IT G    F+K L  +L+ A
Sbjct: 385 KAKSSEYLTEARRILQITQGKEHRFVKMLADRLQAA 420


>gi|324509870|gb|ADY44135.1| SET and MYND domain-containing protein 3 [Ascaris suum]
          Length = 442

 Score =  109 bits (273), Expect = 2e-21,   Method: Compositional matrix adjust.
 Identities = 81/307 (26%), Positives = 144/307 (46%), Gaps = 28/307 (9%)

Query: 64  NEIAENFSKLACNAHTICNSELRPLGTGLYPVISIINHSCLPNAVLVFEGRLAVVRAVQH 123
           +E+   F K+  N  TI N +L  +G GLY  +S+++HSC P+A ++F G  AV+R ++ 
Sbjct: 142 SELLTIFGKVMVNVFTISNDDLNTIGLGLYLGLSVLDHSCDPDAFVLFNGTKAVLRPLKQ 201

Query: 124 --VPKGAEVLISYIETAGSTMTRQKALKEQYLFTCTCPRCIKLGQFDDIQESAILEGYRC 181
                 + + I+Y +    T  R+  LK+Q+ FTC C  C+      D++        RC
Sbjct: 202 YITAYDSSLRIAYCDLLDLTSMRRNQLKQQFFFTCECSACL------DLEREKTARSVRC 255

Query: 182 KDDGCSGFLLRDSDDKGFTCQQCGL---VRSKEEI---KKIASEVNILSKKTLALTSCGN 235
           +     G+   D ++    C QCG    V   E +   +++  EV  L ++         
Sbjct: 256 R-HCVDGYCPLDVNENSLVCWQCGATSEVHVDEAVHLMQQVEWEVKRLQER--------- 305

Query: 236 HQEVVSTYKMIEKLQKKLYHPFSVNLMQTREKLIKILMELEDWKEALAYCQLTIPVYQRV 295
             + + T K +      +    ++ L Q  +++    ++  ++  A  Y + TI  ++R 
Sbjct: 306 RNDSICTAKDLYNEASIILSSLNIPLCQLADRIASWGIDNGNFSLAADYIEKTISCFKRF 365

Query: 296 YPQFHPLLGLQYYTCGKLEWFLGDT-ENAIKSMTEAVEILRITHGTN---SPFMKELILK 351
           YP  HP L LQY+  GKL      T E A + + +A+E L  +HG     S    EL+ K
Sbjct: 366 YPTAHPSLSLQYFKAGKLHSLNDLTLELAAERLEQAIESLHASHGVQHSLSAEAHELLCK 425

Query: 352 LEEAQAE 358
           +++ + +
Sbjct: 426 VQQQRRD 432


>gi|290998093|ref|XP_002681615.1| SET domain-containing protein [Naegleria gruberi]
 gi|284095240|gb|EFC48871.1| SET domain-containing protein [Naegleria gruberi]
          Length = 430

 Score =  109 bits (273), Expect = 2e-21,   Method: Compositional matrix adjust.
 Identities = 73/257 (28%), Positives = 122/257 (47%), Gaps = 53/257 (20%)

Query: 72  KLACNAHTICNSELRPLGTGLYPVISIINHSCLPNAVLVFEGR----LAVVRAVQHVPKG 127
           KL  N  +ICN E+  +G+G+Y   S+ NHSC+PN  ++F+      + ++     + +G
Sbjct: 182 KLQRNTFSICNEEMNAIGSGIYLKASMFNHSCVPNCAILFDSDKNLYVRILNPSSLLEEG 241

Query: 128 AEVLISYIETAGSTMTRQKALKEQYLFTCTCPRCIKLGQFDDIQESAILEGYRCKDDGCS 187
             + I+Y++    T  RQK LKEQY FTCTCPRC  L   +++ E+              
Sbjct: 242 TPLTINYVDLMDLTANRQKKLKEQYHFTCTCPRC--LNSNEEVNENV------------- 286

Query: 188 GFLLRDSDDKGFTCQQCGLVRSKEEIKKIASEVNILSKKTLALTSCGNH-QEVVSTYKMI 246
                                  +++  IA+E     +K   L    N+ Q +VS  +  
Sbjct: 287 -----------------------DKMISIANE----HRKNQNLQEAVNYFQRIVSKKRSS 319

Query: 247 EKLQKKLYHPFSVNLMQTREKLIKILMELEDWKEALAYCQLTIPVYQRVYPQFHPLLGLQ 306
             L  ++Y   ++N       L  + ++L D+K+A  +   ++  +++ YP F+PLLGLQ
Sbjct: 320 PSLFHEIYMGIALN------SLTHLNVDLSDFKKAYEFGIESLEYFEKYYPPFYPLLGLQ 373

Query: 307 YYTCGKLEWFLGDTENA 323
           Y TC KL  +L +   A
Sbjct: 374 YLTCAKLASYLEENGQA 390


>gi|410916601|ref|XP_003971775.1| PREDICTED: N-lysine methyltransferase SMYD2-B-like [Takifugu
           rubripes]
          Length = 432

 Score =  109 bits (273), Expect = 2e-21,   Method: Compositional matrix adjust.
 Identities = 90/358 (25%), Positives = 162/358 (45%), Gaps = 42/358 (11%)

Query: 3   KLYLRRKLQNDNVIPSTTTDNYSLVEALVAHMSDID-EKQLLLYAQIANLVNLILQWPEI 61
           ++  ++K+Q D       ++   L+  + +H  D+D E++  + A +A L     +  EI
Sbjct: 110 RILTKKKMQKDRC----ASEKLLLLGEMQSHTEDMDNERRETMEADVAGLHQFFSKHLEI 165

Query: 62  -SINEIAENFSKLACNAHTICNSELRPLGTGLYPVISIINHSCLPNAVLVFEGRLAVVRA 120
               ++   FS++ACN  TI + EL  LGT +YP +++INHSC P+ ++ + G  A VRA
Sbjct: 166 PGHKDLLTLFSQVACNGFTIEDEELSHLGTAVYPDVALINHSCRPSVIVTYSGTSAHVRA 225

Query: 121 VQHVPKGAEVLISYIETAGSTMTRQKALKEQYLFTCTCPRCIKLGQFDDIQESAILEGYR 180
           ++ +  G EVLISYI+    T  R   L+E Y FTC C  C          ES       
Sbjct: 226 LRDMKPGDEVLISYIDVLYPTEDRNNRLRESYYFTCQCEEC----------ES------- 268

Query: 181 CKDDGCSGFLLRDSDDKGFTCQQCGLVRSKE-EIKKIASEVNILSKKTLALTSCGNHQEV 239
                      R  D      ++    RS+  E + I++ V    K         N    
Sbjct: 269 -----------RSMDQAKLKARK----RSEPIEPEAISNMVRYARKCIREFRVFKNTSPA 313

Query: 240 VSTYKMIEKLQKKLYHPF---SVNLMQTREKLIKILMELEDWKEALAYCQLTIPVYQRVY 296
            +  +M E+   ++   F   +V ++    + + + + ++D   A+ Y +     Y+++Y
Sbjct: 314 STLLEMCEQSLDEMGAVFDDSNVYMLHMMYQAMGVCLYMQDPDGAIRYGEKIAKYYRKLY 373

Query: 297 PQFHPLLGLQYYTCGKLEWFLGDTENAIKSMTEAVEILRITHGTNSPFMKELILKLEE 354
           P +   L   Y   G+L + +      I  + +A  I+ +THG    ++ EL  +++E
Sbjct: 374 PAYSLNLSSLYLKLGRLYFGMERNAECIGMLKKAKAIMEVTHGKEHFYLTELDRQMKE 431


>gi|147905510|ref|NP_001085986.1| N-lysine methyltransferase SMYD2-B [Xenopus laevis]
 gi|82184154|sp|Q6GN68.1|SMY2B_XENLA RecName: Full=N-lysine methyltransferase SMYD2-B; AltName:
           Full=Histone methyltransferase SMYD2-B; AltName:
           Full=SET and MYND domain-containing protein 2B
 gi|49115919|gb|AAH73650.1| MGC82991 protein [Xenopus laevis]
          Length = 430

 Score =  108 bits (271), Expect = 3e-21,   Method: Compositional matrix adjust.
 Identities = 88/339 (25%), Positives = 158/339 (46%), Gaps = 34/339 (10%)

Query: 19  TTTDNYSLVEALVAHMSDID-EKQLLLYAQIANLVNLILQWPEISINEIAEN-FSKLACN 76
           T ++ +  V+   +H+S +D EK+ L+   I+ L     +      N   E  F+++ CN
Sbjct: 121 TASERFMSVKEFESHLSKLDNEKKELIENDISALHRFYSKNVHNCDNAALEFLFAQVNCN 180

Query: 77  AHTICNSELRPLGTGLYPVISIINHSCLPNAVLVFEGRLAVVRAVQHVPKGAEVLISYIE 136
             TI + EL  LG+ ++P ++++NHSC PN ++ ++G +A VRAVQ +  G EV  SYI+
Sbjct: 181 GFTIEDEELSHLGSAIFPDVALMNHSCCPNVIVTYKGTVAEVRAVQEIHAGEEVFTSYID 240

Query: 137 TAGSTMTRQKALKEQYLFTCTCPRCIKLGQFDDIQESAILEGYRCKDDGCSGFLLRDSDD 196
               T  R   LK+ Y F+C C  C         ++ A LE  R   D  S   +RD   
Sbjct: 241 LLYPTEDRNDRLKDSYFFSCDCRECST-----KQKDPAKLE-LRKLSDPPSPQTVRD--- 291

Query: 197 KGFTCQQCGLVRSKEEIKKIASEVNILSKKTLALTSCGNHQEVVSTYKMIEKLQKKLYHP 256
                    +V      K   +   +L    L+L   G+                 ++  
Sbjct: 292 --MITYARNVVEEFRRAKHYKTPSELLEICELSLDKMGS-----------------VFVD 332

Query: 257 FSVNLMQTREKLIKILMELEDWKEALAYCQLTIPVYQRVYPQFHPLLGLQYYTCGKLEWF 316
            +V ++    + + + + ++DW+ AL Y +  I  Y + YP +   +   +   G+L  +
Sbjct: 333 SNVYMLHMMYQAMGVCLYMQDWEGALKYGEKIIKPYSKHYPAYSLNVASMWLKLGRL--Y 390

Query: 317 LGDTENAI--KSMTEAVEILRITHGTNSPFMKELILKLE 353
           +G  +N I  K++ +A+ I+ I HG +  ++ E+  +LE
Sbjct: 391 MGLEKNTIGTKALKKALAIMEIAHGPDHYYIAEIKKELE 429


>gi|348524789|ref|XP_003449905.1| PREDICTED: N-lysine methyltransferase SMYD2-like [Oreochromis
           niloticus]
          Length = 440

 Score =  108 bits (270), Expect = 4e-21,   Method: Compositional matrix adjust.
 Identities = 88/356 (24%), Positives = 160/356 (44%), Gaps = 50/356 (14%)

Query: 3   KLYLRRKLQNDNVIPSTTTDNYSLVEALVAHMSDID-EKQLLLYAQIANLVNLILQWPEI 61
           ++  ++K+Q +  I     +   L+  + +H  D D EK+ +  A IA L     +  ++
Sbjct: 118 RILAKKKMQKERCI----CEKMLLIGEMQSHTEDEDNEKREMTEADIAGLHRFYSKHLDV 173

Query: 62  SIN-EIAENFSKLACNAHTICNSELRPLGTGLYPVISIINHSCLPNAVLVFEGRLAVVRA 120
             + E+   FS++ACN  TI + EL  LGT +YP +++INHSCLP+ ++ F+G  A VRA
Sbjct: 174 PDHKELLTLFSQVACNGFTIEDDELSHLGTAVYPDMALINHSCLPSVIVTFKGTSAEVRA 233

Query: 121 VQHVPKGAEVLISYIETAGSTMTRQKALKEQYLFTCTCPRCIKLGQFDDIQESAILEGYR 180
           VQ +  G EVLISYI+    T  R   L+E Y F C C  C                   
Sbjct: 234 VQDMKPGDEVLISYIDLLYPTDDRNNRLRESYYFICDCQECKS----------------- 276

Query: 181 CKDDGCSGFLLRDSDDKGFTCQQCGLVRSKEEIKKIASEV--NILSKKTLALTSCGNHQE 238
                        S DK         ++ +++   I  +V  N++     A+      + 
Sbjct: 277 ------------QSKDK-------AKLKVRKQRDSIEPDVINNMVRYARKAIREFREFKH 317

Query: 239 VVSTYKMIEKLQKKL------YHPFSVNLMQTREKLIKILMELEDWKEALAYCQLTIPVY 292
             +  +++E  ++ L      +   +V ++    + + I M ++D   A+ Y +  +  Y
Sbjct: 318 TKTPSELLEMCEQSLEEMGAVFDDSNVYMLHMMYQAMGICMYMQDLDGAIRYGEKLLKPY 377

Query: 293 QRVYPQFHPLLGLQYYTCGKLEWFLGDTENAIKSMTEAVEILRITHGTNSPFMKEL 348
             +YP +   +   Y   G+L   L        ++ +A+ I+ + HG +  +++EL
Sbjct: 378 SHLYPPYSLNVSSVYLKLGRLYLGLERQSACTSALKKAMAIMEVAHGKDHFYIEEL 433


>gi|148228090|ref|NP_001080251.1| N-lysine methyltransferase SMYD2-A [Xenopus laevis]
 gi|82176673|sp|Q7ZXV5.1|SMY2A_XENLA RecName: Full=N-lysine methyltransferase SMYD2-A; AltName:
           Full=Histone methyltransferase SMYD2-A; AltName:
           Full=SET and MYND domain-containing protein 2A
 gi|28279861|gb|AAH44103.1| Smyd2-prov protein [Xenopus laevis]
          Length = 430

 Score =  107 bits (268), Expect = 7e-21,   Method: Compositional matrix adjust.
 Identities = 86/337 (25%), Positives = 155/337 (45%), Gaps = 30/337 (8%)

Query: 19  TTTDNYSLVEALVAHMSDID-EKQLLLYAQIANLVNLILQWPEISINE-IAENFSKLACN 76
           T ++ +  V+   +H+S +D EK+ L+ + IA L     +    + N  +   F+++ CN
Sbjct: 121 TPSETFLSVKEFESHLSKLDNEKKELIESDIAALHRFYSKNLHYTDNAALVFLFAQVNCN 180

Query: 77  AHTICNSELRPLGTGLYPVISIINHSCLPNAVLVFEGRLAVVRAVQHVPKGAEVLISYIE 136
             TI + EL  LG+ ++P ++++NHSC PN ++ F+G +A +RAVQ +  G EV  SYI+
Sbjct: 181 GFTIEDEELSHLGSAIFPDVALMNHSCCPNIIVTFKGTVAEIRAVQEIHAGDEVFTSYID 240

Query: 137 TAGSTMTRQKALKEQYLFTCTCPRCIKLGQFDDIQESAILEGYRCKDDGCSGFLLRDSDD 196
               T  R   L + Y FTC C  C         ++ A LE  R   D  S   ++D   
Sbjct: 241 LLYPTEDRNDRLMDSYFFTCDCRECST-----KQKDPAKLE-IRKLSDPPSHQTVKDMIK 294

Query: 197 KGFTCQQCGLVRSKEEIKKIASEVNILSKKTLALTSCGNHQEVVSTYKMIEKLQKKLYHP 256
                    +V      K   +   +L    L+L   G+                 ++  
Sbjct: 295 YARN-----IVEEFRRAKHYKTPSELLEMCELSLDKMGS-----------------VFVD 332

Query: 257 FSVNLMQTREKLIKILMELEDWKEALAYCQLTIPVYQRVYPQFHPLLGLQYYTCGKLEWF 316
            +V ++    + + + + L++W  AL Y +  I  Y + YP +   +   +   G+L   
Sbjct: 333 SNVYMLHMMYQAMGVCLYLQEWDGALKYGEKIIKPYSKHYPAYSLNVASMWLKLGRLYMG 392

Query: 317 LGDTENAIKSMTEAVEILRITHGTNSPFMKELILKLE 353
           L  T    K++ +A+ I++I HG +  ++ E+  +LE
Sbjct: 393 LEKTTIGTKALKKALAIMQIAHGPDHHYIAEIKKELE 429


>gi|432873526|ref|XP_004072260.1| PREDICTED: SET and MYND domain-containing protein 1-like isoform 2
           [Oryzias latipes]
          Length = 489

 Score =  107 bits (266), Expect = 1e-20,   Method: Compositional matrix adjust.
 Identities = 91/354 (25%), Positives = 160/354 (45%), Gaps = 56/354 (15%)

Query: 18  STTTD-NYSLVEALVAHMSDIDEKQLL-LYAQIANLVNLILQWPEIS----INEIAENFS 71
           ST +D   + ++ L  H++D+ E +L  L   I N ++    WP  S    +++I+  F 
Sbjct: 114 STVSDMQLTTLDELEDHITDMQEDELKELKVDIHNFLDF---WPRTSKQHTVDDISHIFG 170

Query: 72  KLACNAHTICNSE-LRPLGTGLYPVISIINHSCLPN-AVLVFEGRLAVV----------- 118
            + CN  ++ +   L+ +G GL+P + ++NH C PN  V++  G  + V           
Sbjct: 171 VINCNGFSVSDQRGLQAVGVGLFPNLCLVNHDCWPNCTVILNHGNQSAVNTMFHSQRRIE 230

Query: 119 -RAVQHVPKGAEVLISYIETAGSTMTRQKALKEQYLFTCTCPRCIKLGQFDDIQESAILE 177
            R++  + +G E+ ++Y++    +  RQ+ LK QY F CTC  C      + I++   L 
Sbjct: 231 LRSLGKIAEGEELTVAYVDYMNLSEERQRLLKTQYFFDCTCEHCK-----NKIKDDIKLG 285

Query: 178 GYRCKDDGCSGFLLRDSDDKGFTCQQCGLVRSKEEIKKIASEVNILSKKTLALTSCGNHQ 237
           G             R+ D         G+  S+E++K+       + +K       G++ 
Sbjct: 286 G-------------REVD---------GVKPSEEQVKEATDYCYQMLEKMDKARLNGDYH 323

Query: 238 EVVSTYK-MIEKLQKKLY--HPFSVNLMQTREKLIKILMELEDWKEALAYCQLTIPVYQR 294
           EVV   +  IEK +  L   H + + +  T  ++   L  L D   A  Y +  +  Y +
Sbjct: 324 EVVKICRECIEKTEPVLAETHIYLLRMWSTMSEVQAYLQYLND---AAEYARKMVEGYTK 380

Query: 295 VYPQFHPLLGLQYYTCGKLEWFLGDTENAIKSMTEAVEILRITHGTNSPFMKEL 348
           +YP  +  LG+     G + W  GD E A   + +A  IL +THG   P  K+L
Sbjct: 381 LYPPNNATLGMAAMRAGVIHWQAGDIEVAHGMVCKAYAILMVTHGPTHPITKDL 434


>gi|348534090|ref|XP_003454536.1| PREDICTED: N-lysine methyltransferase SMYD2-A-like [Oreochromis
           niloticus]
          Length = 435

 Score =  105 bits (263), Expect = 2e-20,   Method: Compositional matrix adjust.
 Identities = 76/360 (21%), Positives = 159/360 (44%), Gaps = 35/360 (9%)

Query: 2   LKLYLRRKLQNDNVIPSTTTDNYSLVEALVAHMSDID-EKQLLLYAQIANLVNL----IL 56
           ++L  R  ++       T ++   L++   AH+  +D EK+ +    IA L +     I 
Sbjct: 106 VRLVARIIMKQRATTERTPSERLLLLKEFEAHLDKMDSEKEEMNQTDIAALHHFYSRHIS 165

Query: 57  QWPEISINEIAENFSKLACNAHTICNSELRPLGTGLYPVISIINHSCLPNAVLVFEGRLA 116
             P+     + E F+++ CN  TI + EL  LG+ ++P ++++NHSC PN ++ ++G +A
Sbjct: 166 NLPDEQA--LTELFAQVNCNGFTIEDEELSHLGSAVFPDVALMNHSCSPNVIVTYKGTVA 223

Query: 117 VVRAVQHVPKGAEVLISYIETAGSTMTRQKALKEQYLFTCTCPRCIKLGQFDDIQESAIL 176
            VRAV+ +  G E+  SYI+    T  R++ L + Y FTC C  C               
Sbjct: 224 EVRAVKEINPGEEIFNSYIDLLYPTEDRKERLLDSYFFTCQCTECTS------------- 270

Query: 177 EGYRCKDDGCSGFLLRDSDDKGFTCQQCGLVRSKEEIKKIASEVNILSKKTLALTSCGNH 236
                          R  D      ++       EEI+ +      + ++          
Sbjct: 271 ---------------RSKDKAKMEIRKLNPPPEPEEIRSMVRYAKNVIEEFRRAKHYKTP 315

Query: 237 QEVVSTYKMIEKLQKKLYHPFSVNLMQTREKLIKILMELEDWKEALAYCQLTIPVYQRVY 296
            E++   ++  +    ++   +V ++    + + + + ++DW  A++Y +  +  Y   Y
Sbjct: 316 SELLEMCELSLEKMGAIFADTNVYMLHMMYQAMGVCLYMQDWDGAMSYGEKIVQPYSVHY 375

Query: 297 PQFHPLLGLQYYTCGKLEWFLGDTENAIKSMTEAVEILRITHGTNSPFMKELILKLEEAQ 356
           P +   +   Y   G+L   L      +K++ +A+ ++ + HG +  ++ E+  ++EE +
Sbjct: 376 PAYSLNVASMYLKLGRLYLGLEKKTQGVKALKKALAVMEVAHGKDHHYVAEVKREIEEQK 435


>gi|303286946|ref|XP_003062762.1| set domain protein [Micromonas pusilla CCMP1545]
 gi|226455398|gb|EEH52701.1| set domain protein [Micromonas pusilla CCMP1545]
          Length = 732

 Score =  105 bits (263), Expect = 3e-20,   Method: Compositional matrix adjust.
 Identities = 83/315 (26%), Positives = 133/315 (42%), Gaps = 36/315 (11%)

Query: 16  IPSTTTDNYSLVEALVAHMSDIDEKQLLLYAQIANLVNLILQW--PEISINEIAENFSKL 73
           + +T    +  V+AL +   D+  ++   Y  +AN VN +L      + ++ +AE  SK+
Sbjct: 322 VRATAPGLFDAVKALDSPFRDVSAEKRATYVGMANAVNSMLPPGPARMDVDALAEVMSKV 381

Query: 74  ACNAHTICNSELRPLGTGLYPVISIINHSCLPNAVLVF---------------------- 111
             N+H + ++  R LGTG+YP  ++ NHSC PNAV+ F                      
Sbjct: 382 HVNSHGVVDAAGRALGTGVYPPAALFNHSCAPNAVVSFGGADDDGDGSHSDRDGSCSNGS 441

Query: 112 --EGRLAVVRAVQHVPKGAEVLISYIETAGSTMTRQKALKEQYLFTCTCPRCIKLGQFDD 169
              G    VR V  V +G E+ I+Y E   +  TR++AL E+  F C C RC        
Sbjct: 442 TNNGARLTVRCVSPVEEGEEICIAYAEVYATRETRREALWEKKAFACECRRCADAAS--- 498

Query: 170 IQESAILEGYRCKD-DGCSGFLLRDSDDKGFTCQQCGLVRSKEEIKKIASEVNILSKKTL 228
            +  A L G+RC     C G +     D G +C  CG   ++   K+ A+E   + +   
Sbjct: 499 ARRDAPLGGWRCPRLSACDGVV----PDAGDSCLTCGAKCARARSKRDAAERRWIGRHAA 554

Query: 229 ALTSC--GNHQEVVSTYKMIEKLQKKLYHPFSVNLMQTREKLIKILMELEDWKEALAYCQ 286
           ALT+    +H   VS    + +          V   + R  L+     L+ W+ A    +
Sbjct: 555 ALTALRENDHATAVSIACEVVRESDDALCDAHVIRHECRLVLMDAHAALKQWRHAADVSR 614

Query: 287 LTIPVYQRVYPQFHP 301
                 +R     HP
Sbjct: 615 DVADAMRRFVNVNHP 629


>gi|432852547|ref|XP_004067302.1| PREDICTED: N-lysine methyltransferase SMYD2-A-like isoform 1
           [Oryzias latipes]
          Length = 435

 Score =  105 bits (263), Expect = 3e-20,   Method: Compositional matrix adjust.
 Identities = 77/343 (22%), Positives = 152/343 (44%), Gaps = 35/343 (10%)

Query: 19  TTTDNYSLVEALVAHMSDID-EKQLLLYAQIANLVNL----ILQWPEISINEIAENFSKL 73
           T ++   L++   +H+  +D EK  +  A IA L       I   P+    E+ E F+++
Sbjct: 123 TPSERLLLLKEFESHLDKMDSEKDEMNQADIAALHYFYSKHISDLPDD--QELTELFAQV 180

Query: 74  ACNAHTICNSELRPLGTGLYPVISIINHSCLPNAVLVFEGRLAVVRAVQHVPKGAEVLIS 133
            CN  TI + EL  LG+ ++P ++++NHSC PN ++ ++G +A VRAVQ +  G E+  S
Sbjct: 181 NCNGFTIEDEELSHLGSAVFPDVALMNHSCSPNVIVTYKGTVAEVRAVQEINPGEEIFNS 240

Query: 134 YIETAGSTMTRQKALKEQYLFTCTCPRCIKLGQFDDIQESAILEGYRCKDDGCSGFLLRD 193
           YI+    T  R++ L + Y F C C  C                              + 
Sbjct: 241 YIDLLYPTEDRKERLLDSYFFGCQCTECT----------------------------TKS 272

Query: 194 SDDKGFTCQQCGLVRSKEEIKKIASEVNILSKKTLALTSCGNHQEVVSTYKMIEKLQKKL 253
            D      ++       EEIK +      + ++           E++   ++  +    +
Sbjct: 273 KDKAKMEIRKLSSPPEPEEIKSMVHYAKNVIEEFRKAKHYKTPSELLEMCELSLEKMGAI 332

Query: 254 YHPFSVNLMQTREKLIKILMELEDWKEALAYCQLTIPVYQRVYPQFHPLLGLQYYTCGKL 313
           +   +V ++    + + + + ++DW  A++Y +  +  Y   YP +   +   Y   G+L
Sbjct: 333 FADTNVYMLHMMYQAMGVCLYMQDWDGAMSYGEKIVYPYSVHYPPYSLNVASMYLKLGRL 392

Query: 314 EWFLGDTENAIKSMTEAVEILRITHGTNSPFMKELILKLEEAQ 356
              L      +K++ +A+ I+ I HG +  ++ E+  ++EE +
Sbjct: 393 YLGLEKKTQGVKALKKALAIMEIAHGKDHHYVAEVKREIEEQK 435


>gi|301603797|ref|XP_002931556.1| PREDICTED: SET and MYND domain-containing protein 3-like [Xenopus
           (Silurana) tropicalis]
          Length = 226

 Score =  105 bits (263), Expect = 3e-20,   Method: Compositional matrix adjust.
 Identities = 74/264 (28%), Positives = 126/264 (47%), Gaps = 45/264 (17%)

Query: 96  ISIINHSCLPNAVLVFEGRLAVVRAVQHVPKGAEVLISYIETAGSTMTRQKALKEQYLFT 155
           +S++NHSC PN V+VFEG   ++R V+ +PKG E+ ISYI+    T  R+  L+ QY F 
Sbjct: 1   MSLLNHSCDPNCVIVFEGTCLLLRTVKEIPKGEELTISYIDVKMPTQGRRDQLQRQYCFL 60

Query: 156 CTCPRCIKLGQFDDIQESAILEGYRCKDDGCSGFLLRDSDDKGFTCQQCGLVRSKEEIKK 215
           C C RC                            LLRD D+         L    E  ++
Sbjct: 61  CDCQRC----------------------------LLRDKDEDM-------LAGDAEASRE 85

Query: 216 IASEVNILSKKTLALTSCGNHQEVVSTYKMIEKLQKKLYHPFSVNLMQTR--EKLIKILM 273
           + S V+ L +    L S    +E ++   + + L  + Y P   N+ Q +  +  +   +
Sbjct: 86  VESSVSRLEE----LLSQNTAEEALN---LCKTLMNRYYLP-DKNIYQLKILDCAMDASI 137

Query: 274 ELEDWKEALAYCQLTIPVYQRVYPQFHPLLGLQYYTCGKLEWFLGDTENAIKSMTEAVEI 333
           +L  W+EAL +   T+  Y   Y  +HP+  +Q    GKL+ + G    A+K++ +A +I
Sbjct: 138 DLGLWEEALHFGLRTLEPYSLYYSNYHPVRAVQLMKVGKLQNYQGLFAEAMKTLKQAFDI 197

Query: 334 LRITHGTNSPFMKELILKLEEAQA 357
           +++THG +    ++L   + + +A
Sbjct: 198 MKVTHGRDHGQTQQLAELMNDCEA 221


>gi|432852549|ref|XP_004067303.1| PREDICTED: N-lysine methyltransferase SMYD2-A-like isoform 2
           [Oryzias latipes]
          Length = 424

 Score =  105 bits (263), Expect = 3e-20,   Method: Compositional matrix adjust.
 Identities = 77/343 (22%), Positives = 152/343 (44%), Gaps = 35/343 (10%)

Query: 19  TTTDNYSLVEALVAHMSDID-EKQLLLYAQIANLVNL----ILQWPEISINEIAENFSKL 73
           T ++   L++   +H+  +D EK  +  A IA L       I   P+    E+ E F+++
Sbjct: 112 TPSERLLLLKEFESHLDKMDSEKDEMNQADIAALHYFYSKHISDLPDD--QELTELFAQV 169

Query: 74  ACNAHTICNSELRPLGTGLYPVISIINHSCLPNAVLVFEGRLAVVRAVQHVPKGAEVLIS 133
            CN  TI + EL  LG+ ++P ++++NHSC PN ++ ++G +A VRAVQ +  G E+  S
Sbjct: 170 NCNGFTIEDEELSHLGSAVFPDVALMNHSCSPNVIVTYKGTVAEVRAVQEINPGEEIFNS 229

Query: 134 YIETAGSTMTRQKALKEQYLFTCTCPRCIKLGQFDDIQESAILEGYRCKDDGCSGFLLRD 193
           YI+    T  R++ L + Y F C C  C                              + 
Sbjct: 230 YIDLLYPTEDRKERLLDSYFFGCQCTECT----------------------------TKS 261

Query: 194 SDDKGFTCQQCGLVRSKEEIKKIASEVNILSKKTLALTSCGNHQEVVSTYKMIEKLQKKL 253
            D      ++       EEIK +      + ++           E++   ++  +    +
Sbjct: 262 KDKAKMEIRKLSSPPEPEEIKSMVHYAKNVIEEFRKAKHYKTPSELLEMCELSLEKMGAI 321

Query: 254 YHPFSVNLMQTREKLIKILMELEDWKEALAYCQLTIPVYQRVYPQFHPLLGLQYYTCGKL 313
           +   +V ++    + + + + ++DW  A++Y +  +  Y   YP +   +   Y   G+L
Sbjct: 322 FADTNVYMLHMMYQAMGVCLYMQDWDGAMSYGEKIVYPYSVHYPPYSLNVASMYLKLGRL 381

Query: 314 EWFLGDTENAIKSMTEAVEILRITHGTNSPFMKELILKLEEAQ 356
              L      +K++ +A+ I+ I HG +  ++ E+  ++EE +
Sbjct: 382 YLGLEKKTQGVKALKKALAIMEIAHGKDHHYVAEVKREIEEQK 424


>gi|189204167|ref|XP_001938419.1| MYND finger family protein [Pyrenophora tritici-repentis Pt-1C-BFP]
 gi|187985518|gb|EDU51006.1| MYND finger family protein [Pyrenophora tritici-repentis Pt-1C-BFP]
          Length = 454

 Score =  105 bits (263), Expect = 3e-20,   Method: Compositional matrix adjust.
 Identities = 89/366 (24%), Positives = 158/366 (43%), Gaps = 51/366 (13%)

Query: 23  NYSLVEALVAHMSDIDEKQLLLYAQIANLVNLILQWPEISI-------NEIAENFSKLAC 75
           ++ +V  L +H+ D   K         N+  + +  P+ ++       + IA  ++++  
Sbjct: 76  DWEMVCRLPSHVDDFKRK-----GTYGNIEMMAMGAPQFALPPTMFDKDFIAAMYARVMS 130

Query: 76  NAHTICNSELRPLGTGLYPVISIINHSCLPNAVLVFEGRLAVVRAVQHVPKGAEVLISYI 135
           NA TI    L PLG  L P +  +NHSC PNA ++ +G    +R ++ + K  E+ ISYI
Sbjct: 131 NALTIITPTLDPLGIILDPTLCSLNHSCDPNAFIMMDGPSVSIRTLRPIRKDKEIFISYI 190

Query: 136 ETAGSTMTRQKALKEQYLFTCTCPRCIKLGQFDDIQESAILEGYRCKDDGCSGFLLRDSD 195
           +T      RQ+ L+ ++ FTC C +C         QE A L+         S F+L    
Sbjct: 191 DTTYPYHKRQEELQTRWFFTCRCAKC---------QEKATLQEDNWLLPVDSKFVLDAEA 241

Query: 196 DKGFTCQQCGLVRSKEEIKKIASEVNILSKKTLALTSCGNHQEVVSTYKMIEKLQKKLYH 255
            K     Q       EE+ K+++E  I   K + L SC                + + Y 
Sbjct: 242 KKAMAQTQEQTFAVYEELHKLSTEHVIYGLKQI-LASC---------------YESRFYP 285

Query: 256 PFSVNLMQTREKLIKILMELEDWKEALAYCQLTIP-VYQRVYP-QFHPLLGLQYYTCGKL 313
            +     + R+ LI  L+ +  +++A A C      +  ++YP  FHP+  +Q +    L
Sbjct: 286 IYRQPYAEARDVLIVNLLSMGKFQDAWAQCAKRYKYILPKLYPVPFHPVRVVQTWQMAML 345

Query: 314 EWFLGDTENAIKS------------MTEAVEILRITHGTNSPFMKELILKLEEAQAEASY 361
             +L  TE  + +            + + +++  ++HG N+ F K +  K EE   E   
Sbjct: 346 AAYLASTEEGVGAPGVNMGLIAMMLVKQVLDVAHLSHGPNNAFTKSVKGKAEEMIEELKR 405

Query: 362 KLSSKD 367
            + + D
Sbjct: 406 SVGNPD 411


>gi|119597486|gb|EAW77080.1| SET and MYND domain containing 1, isoform CRA_d [Homo sapiens]
          Length = 497

 Score =  105 bits (261), Expect = 5e-20,   Method: Compositional matrix adjust.
 Identities = 82/328 (25%), Positives = 150/328 (45%), Gaps = 49/328 (14%)

Query: 31  VAHMSDIDEKQLLLYAQIANLVNLILQ-WP----EISINEIAENFSKLACNAHTICNSE- 84
           V H  + ++K L +       V+  LQ WP    + S+  I+  F  + CN  T+ +   
Sbjct: 136 VEHFGEEEQKDLRVD------VDTFLQYWPPQSQQFSMQYISHIFGVINCNGFTLSDQRG 189

Query: 85  LRPLGTGLYPVISIINHSCLPNAVLVFEGRLAVVRAVQHVPKGAEVLISYIETAGSTMTR 144
           L+ +G G++P + ++NH C PN  ++F      +RA+  + +G E+ +SYI+    +  R
Sbjct: 190 LQAVGVGIFPNLGLVNHDCWPNCTVIFNNGKIELRALGKISEGEELTVSYIDFLNVSEER 249

Query: 145 QKALKEQYLFTCTCPRCIKLGQFDDIQESAILEGYRCKDDGCSGFLLRDSDDKGFTCQQC 204
           ++ LK+QY F CTC  C K                + KDD    FL              
Sbjct: 250 KRQLKKQYYFDCTCEHCQK----------------KLKDD---LFL-------------- 276

Query: 205 GLVRSKEEIKKIASEVNILSKKTL----ALTSCGNHQEVVSTYKMIEKLQKKLYHPFSVN 260
           G+  + +  +++  E+   SK TL       S G + EVV   +   + Q+ ++   ++ 
Sbjct: 277 GVKDNPKPSQEVVKEMIQFSKDTLEKIDKARSEGLYHEVVKLCRECLEKQEPVFADTNIY 336

Query: 261 LMQTREKLIKILMELEDWKEALAYCQLTIPVYQRVYPQFHPLLGLQYYTCGKLEWFLGDT 320
           +++    + ++L  L+ ++EA  Y +  +  Y ++Y   +  LG+     G   W  G+ 
Sbjct: 337 MLRMLSIVSEVLSYLQAFEEASFYARRMVDGYMKLYHPNNAQLGMAVMRAGLTNWHAGNI 396

Query: 321 ENAIKSMTEAVEILRITHGTNSPFMKEL 348
           E     + +A  IL +THG + P  K+L
Sbjct: 397 EVGHGMICKAYAILLVTHGPSHPITKDL 424


>gi|291386373|ref|XP_002709685.1| PREDICTED: SET and MYND domain containing 1 isoform 1 [Oryctolagus
           cuniculus]
          Length = 477

 Score =  104 bits (260), Expect = 6e-20,   Method: Compositional matrix adjust.
 Identities = 79/323 (24%), Positives = 146/323 (45%), Gaps = 39/323 (12%)

Query: 31  VAHMSDIDEKQLLLYAQIANLVNLILQWP----EISINEIAENFSKLACNAHTICNSE-L 85
           V H  + ++K+L      A++   +  WP      S+  I+  F  + CN  T+ +   L
Sbjct: 136 VDHFGEEEQKELR-----ADVDTFLQYWPPQSQHFSMQYISHIFGVINCNGFTLSDQRGL 190

Query: 86  RPLGTGLYPVISIINHSCLPNAVLVFEGRLAVVRAVQHVPKGAEVLISYIETAGSTMTRQ 145
           + +G G++P ++++NH C PN  ++F      +RA+  + +G E+ +SYI+    +  R+
Sbjct: 191 QAVGVGIFPNLALVNHDCWPNCTVIFNNGKIELRALGKISEGEELTVSYIDFLNVSEERR 250

Query: 146 KALKEQYLFTCTCPRCIKLGQFDDIQESAILEGYRCKDDGCSGFLLRDSDDKGFTCQQCG 205
           + LK+QY F CTC  C K G  DD+                  FL    + K        
Sbjct: 251 RQLKKQYYFDCTCEHCQK-GLKDDL------------------FLGVKDNPKP------- 284

Query: 206 LVRSKEEIKKIASEVNILSKKTLALTSCGNHQEVVSTYKMIEKLQKKLYHPFSVNLMQTR 265
              S+E +K++        +K     S G + EVV   +   + Q+ ++   ++  ++  
Sbjct: 285 ---SQEVVKEMTEFCKDTLEKIDKARSEGLYHEVVKLCRECLQKQEPVFADTNLYTLRML 341

Query: 266 EKLIKILMELEDWKEALAYCQLTIPVYQRVYPQFHPLLGLQYYTCGKLEWFLGDTENAIK 325
             + ++L  L+ + EA  Y +  +  Y ++Y   +  LG+     G   W  G+ E    
Sbjct: 342 SIVSEVLSYLQAFGEAADYAKRMVDGYMKLYHPNNAQLGMAVMRAGLTNWHAGNIEVGHG 401

Query: 326 SMTEAVEILRITHGTNSPFMKEL 348
            + +A  IL +THG + P  K+L
Sbjct: 402 MICKAYAILLVTHGPSHPITKDL 424


>gi|384246814|gb|EIE20303.1| hypothetical protein COCSUDRAFT_48675 [Coccomyxa subellipsoidea
           C-169]
          Length = 377

 Score =  104 bits (260), Expect = 7e-20,   Method: Compositional matrix adjust.
 Identities = 43/90 (47%), Positives = 59/90 (65%)

Query: 65  EIAENFSKLACNAHTICNSELRPLGTGLYPVISIINHSCLPNAVLVFEGRLAVVRAVQHV 124
           ++A   ++ ACN HTIC+ ELRP+G G+YP  +++NHSC PN +  F G   V RAVQH+
Sbjct: 22  DVAYLLARFACNNHTICDDELRPIGVGIYPQGAMLNHSCTPNTMQSFHGSRIVFRAVQHI 81

Query: 125 PKGAEVLISYIETAGSTMTRQKALKEQYLF 154
           P GAE  I+Y+E A +   R  AL+  Y F
Sbjct: 82  PAGAEATIAYVELAATRAERSAALRANYFF 111


>gi|348566409|ref|XP_003468994.1| PREDICTED: SET and MYND domain-containing protein 1-like isoform 2
           [Cavia porcellus]
          Length = 477

 Score =  104 bits (259), Expect = 7e-20,   Method: Compositional matrix adjust.
 Identities = 82/328 (25%), Positives = 147/328 (44%), Gaps = 49/328 (14%)

Query: 31  VAHMSDIDEKQLLLYAQIANLVNLILQ-WP----EISINEIAENFSKLACNAHTICNSE- 84
           V H  + ++K+L +       V+  LQ WP    + S+  I+  F  + CN  T+ +   
Sbjct: 136 VEHFGEEEQKELRVD------VDTFLQYWPPQSQQFSMQYISHIFGVINCNGFTLSDQRG 189

Query: 85  LRPLGTGLYPVISIINHSCLPNAVLVFEGRLAVVRAVQHVPKGAEVLISYIETAGSTMTR 144
           L+ +G G++P + ++NH C PN  ++F      +RA+  +  G E+ +SYI+    +  R
Sbjct: 190 LQAVGVGIFPNLGLVNHDCWPNCTVIFNNGKIELRALGKISVGEELTVSYIDFLNVSEER 249

Query: 145 QKALKEQYLFTCTCPRCIKLGQFDDIQESAILEGYRCKDDGCSGFLLRDSDDKGFTCQQC 204
           ++ LK+QY F CTC  C K G  DD+                  FL              
Sbjct: 250 KRQLKKQYYFDCTCEHCQK-GLKDDL------------------FL-------------- 276

Query: 205 GLVRSKEEIKKIASEVNILSKKTL----ALTSCGNHQEVVSTYKMIEKLQKKLYHPFSVN 260
           G+    +  +++  E+   SK TL       S G + EVV   +   + Q+ ++   ++ 
Sbjct: 277 GVKEDPKPSQEVVKEMTQFSKDTLEKIDKARSEGLYHEVVKLCRECLEKQEPVFADTNLY 336

Query: 261 LMQTREKLIKILMELEDWKEALAYCQLTIPVYQRVYPQFHPLLGLQYYTCGKLEWFLGDT 320
            ++    + ++L  L+ + EA  Y +  +  Y ++Y   +  LG+     G   W  G+ 
Sbjct: 337 TLRMLSVVSEVLSYLQAFGEAAHYARRMVDGYMKLYHPNNAQLGMAVMRAGLTNWHAGNI 396

Query: 321 ENAIKSMTEAVEILRITHGTNSPFMKEL 348
           E     + +A  IL +THG + P  K+L
Sbjct: 397 EVGHGMICKAYAILLVTHGPSHPITKDL 424


>gi|156368233|ref|XP_001627600.1| predicted protein [Nematostella vectensis]
 gi|156214514|gb|EDO35500.1| predicted protein [Nematostella vectensis]
          Length = 377

 Score =  104 bits (259), Expect = 8e-20,   Method: Compositional matrix adjust.
 Identities = 56/168 (33%), Positives = 92/168 (54%), Gaps = 6/168 (3%)

Query: 28  EALVAHMSDIDEKQLLLYAQIANLVNLILQWPEISINEIAENFSKLACNAHTICNSELRP 87
           ++LV+++  ID      +  +  ++N  L   EIS  E  E FSK++CN+  IC+ E++ 
Sbjct: 121 DSLVSNVEKIDSDAKEDFVSVLMVLNEYLG-SEISPPEGLELFSKISCNSFAICDGEMQA 179

Query: 88  LGTGLYPVISIINHSCLPNAVLVFEGRLAVVRAVQHVPKGAEVLISYIETAGSTMTRQKA 147
           +GTG++P    +NHSC PN+V VF G    ++A++ +P G E+ ISYI+      TRQ+ 
Sbjct: 180 IGTGIFPNAVCLNHSCAPNSVAVFNGTNIYIKALEEIPVGEELTISYIQQLHPRETRQEE 239

Query: 148 LKEQYLFTCTCPRCIKLGQFDDIQESAILEGYRCKDDGCSGFLLRDSD 195
           L+ Q+ F C C RC+     D    + +L    C +  C   + +  D
Sbjct: 240 LQTQFCFYCQCHRCL-----DASDNNKMLTSLICPNKSCEAIVYQTFD 282



 Score = 62.4 bits (150), Expect = 3e-07,   Method: Compositional matrix adjust.
 Identities = 33/84 (39%), Positives = 47/84 (55%), Gaps = 1/84 (1%)

Query: 276 EDWKEALAYCQLTIPVYQRVYPQFHPLLGLQYYTCGKLEWFLG-DTENAIKSMTEAVEIL 334
           +DW +AL Y    + VY   YP++HP LG+  Y  GKL      D E A+K + +A  IL
Sbjct: 288 QDWTKALEYATRNLEVYTWFYPKYHPCLGVHLYKIGKLLAVTHQDLELAVKRLEQARRIL 347

Query: 335 RITHGTNSPFMKELILKLEEAQAE 358
            +THG + P ++EL   L +A  E
Sbjct: 348 EVTHGQSHPLVQELCEYLCQASEE 371


>gi|47227584|emb|CAG09581.1| unnamed protein product [Tetraodon nigroviridis]
          Length = 382

 Score =  103 bits (258), Expect = 1e-19,   Method: Compositional matrix adjust.
 Identities = 81/330 (24%), Positives = 145/330 (43%), Gaps = 36/330 (10%)

Query: 25  SLVEALVAHMSDIDEKQLL-LYAQIANLVNLILQWPEIS----INEIAENFSKLACNAHT 79
           + VE L  H++D+ E +L  L   I N ++    WP  S    I++I+  F  + CN  T
Sbjct: 28  TTVEELEDHIADLQEDELKELKVDIHNFLDY---WPRNSKQHTIDDISHLFGVINCNGFT 84

Query: 80  ICNSE-LRPLGTGLYPVISIINHSCLPNAVLVFEGRLAVVRAVQHVPKGAEVLISYIETA 138
           + +   L+ +G GL+P + ++NH C PN  ++       +R++  + +G E+ ++Y++  
Sbjct: 85  VSDQRGLQAVGVGLFPNLCLVNHDCWPNCTVILNHGKIELRSLGKIAEGEELTVAYVDFL 144

Query: 139 GSTMTRQKALKEQYLFTCTCPRCIKLGQFDDIQESAILEGYRCKDDGCSGFLLRDSDDKG 198
             +  R++ LK QY F C C  C                  R KDD       R+ D   
Sbjct: 145 NLSEERRRLLKTQYFFDCQCEHCRN----------------RTKDD--LKLAGREVD--- 183

Query: 199 FTCQQCGLVRSKEEIKKIASEVNILSKKTLALTSCGNHQEVVSTYKMIEKLQKKLYHPFS 258
                 GL  S+E++K+       + +K       G++ EVV   +      + +     
Sbjct: 184 ------GLKPSEEQVKEATDYCFQMLEKMDKARLDGDYHEVVKICRDAIDRTEPVLADTH 237

Query: 259 VNLMQTREKLIKILMELEDWKEALAYCQLTIPVYQRVYPQFHPLLGLQYYTCGKLEWFLG 318
           + L++    L ++   L+ + +A  Y +  +  Y ++YP  +  LG+     G   W  G
Sbjct: 238 IYLLRMWSTLSEVQAYLQYFNDAATYSRKMVEGYMKLYPPNNAALGMAAMRAGVNHWQAG 297

Query: 319 DTENAIKSMTEAVEILRITHGTNSPFMKEL 348
             E     + +A  IL +THG   P  K+L
Sbjct: 298 LIEVGHGMVCKAYAILLVTHGPTHPITKDL 327


>gi|302850329|ref|XP_002956692.1| SET and zf-MYND domain-containing protein [Volvox carteri f.
           nagariensis]
 gi|300258053|gb|EFJ42294.1| SET and zf-MYND domain-containing protein [Volvox carteri f.
           nagariensis]
          Length = 265

 Score =  103 bits (258), Expect = 1e-19,   Method: Compositional matrix adjust.
 Identities = 46/123 (37%), Positives = 74/123 (60%), Gaps = 1/123 (0%)

Query: 33  HMSDIDEKQLLLYAQIANLVNLILQWPEI-SINEIAENFSKLACNAHTICNSELRPLGTG 91
           H  ++D+++  LYAQ+A +   + +W        +A+  S L+CN HT+C+ ELRPLG  
Sbjct: 143 HWDELDDRRKQLYAQMAVVTWWVARWGTWPGFRTVAQLLSLLSCNCHTVCDEELRPLGVA 202

Query: 92  LYPVISIINHSCLPNAVLVFEGRLAVVRAVQHVPKGAEVLISYIETAGSTMTRQKALKEQ 151
           LYP  +++NHSC P+ V  F G    +RA++ +  G E+ I+YIE A +   R++ L + 
Sbjct: 203 LYPTGALVNHSCSPSTVQTFHGSTLELRALRQLAPGDEITIAYIELAATRQERRETLADS 262

Query: 152 YLF 154
           Y F
Sbjct: 263 YFF 265


>gi|5870834|gb|AAC53022.2| skm-BOP2 [Mus musculus]
          Length = 472

 Score =  103 bits (257), Expect = 1e-19,   Method: Compositional matrix adjust.
 Identities = 81/324 (25%), Positives = 147/324 (45%), Gaps = 41/324 (12%)

Query: 31  VAHMSDIDEKQLLLYAQIANLVNLILQ-WP----EISINEIAENFSKLACNAHTICNSE- 84
           V H  + ++K+L +       V+  LQ WP    + S+  I+  F  + CN  T+ +   
Sbjct: 131 VEHFGEEEQKELRVD------VDTFLQYWPPQSQQFSMQYISHIFGVINCNGFTLSDQRG 184

Query: 85  LRPLGTGLYPVISIINHSCLPNAVLVFEGRLAVVRAVQHVPKGAEVLISYIETAGSTMTR 144
           L+ +G G++P + ++NH C PN  ++F      +RA+  + +G E+ +SYI+    +  R
Sbjct: 185 LQAVGVGIFPNLGLVNHDCWPNCTVIFNNGKIELRALGKISEGEELTVSYIDFLHLSEER 244

Query: 145 QKALKEQYLFTCTCPRCIKLGQFDDIQESAILEGYRCKDDGCSGFLLRDSDDKGFTCQQC 204
           ++ LK+QY F C+C  C K G  DD+                  FL    D K       
Sbjct: 245 RRQLKKQYYFDCSCEHCQK-GLKDDL------------------FLAAKEDPKP------ 279

Query: 205 GLVRSKEEIKKIASEVNILSKKTLALTSCGNHQEVVSTYKMIEKLQKKLYHPFSVNLMQT 264
               S+E +K++        +K     S G + EVV   +   + Q+ ++   ++ +++ 
Sbjct: 280 ----SQEVVKEMIQFSKDTLEKIDKARSEGLYHEVVKLCRECLEKQEPVFADTNLYVLRL 335

Query: 265 REKLIKILMELEDWKEALAYCQLTIPVYQRVYPQFHPLLGLQYYTCGKLEWFLGDTENAI 324
                ++L  L+ ++EA  Y +  +  Y ++Y   +  LG+     G   W  G  E   
Sbjct: 336 LSIASEVLSYLQAYEEASHYARRMVDGYMKLYHHNNAQLGMAVMRAGLTNWHAGHIEVGH 395

Query: 325 KSMTEAVEILRITHGTNSPFMKEL 348
             + +A  IL +THG + P  K+L
Sbjct: 396 GMICKAYAILLVTHGPSHPITKDL 419


>gi|187607241|ref|NP_001120357.1| SET and MYND domain containing 1 [Xenopus (Silurana) tropicalis]
 gi|170284459|gb|AAI61001.1| LOC100145429 protein [Xenopus (Silurana) tropicalis]
          Length = 478

 Score =  103 bits (257), Expect = 1e-19,   Method: Compositional matrix adjust.
 Identities = 86/351 (24%), Positives = 157/351 (44%), Gaps = 45/351 (12%)

Query: 27  VEALVAHMSDIDEKQLLLYAQIANLVNLILQWP----EISINEIAENFSKLACNAHTICN 82
           ++ L  H+   DE +  L  +  ++   +  WP    +  +  I+  FS ++CN  T+ +
Sbjct: 124 IDDLQNHIDKFDEAEKGLLME--DVQKFLEYWPSQSQQFGMQYISHIFSVISCNGFTLSD 181

Query: 83  SE-LRPLGTGLYPVISIINHSCLPNAVLVFEGRLAVVRAVQHVPKGAEVLISYIETAGST 141
              L+ +G G++P + + NH C PN  ++F      +RA+  + KG E+ +SY++    T
Sbjct: 182 QRGLQAVGVGIFPNLCLANHDCWPNCTVIFNNGKIELRALGKINKGEELTVSYVDFLNLT 241

Query: 142 MTRQKALKEQYLFTCTCPRCIKLGQFDDIQESAILEGYRCKDDGCSGFLLRDSDDKGFTC 201
             R+  LK+QY F CTC  C K  +       A+L              ++D +DK    
Sbjct: 242 EDRKAQLKKQYYFDCTCEHCTKKTK------DALL------------LAVKDGEDK---- 279

Query: 202 QQCGLVRSKEEIKKIASEVNILSKKTL----ALTSCGNHQEVVSTYKMIEKLQKKLYHPF 257
                    EE  ++  EV   SK T+       S G + +VV   +   K Q+ ++   
Sbjct: 280 --------PEE--RVVKEVIQYSKDTMEKIEKARSEGLYNDVVKLCRDCLKRQEPIFADT 329

Query: 258 SVNLMQTREKLIKILMELEDWKEALAYCQLTIPVYQRVYPQFHPLLGLQYYTCGKLEWFL 317
           ++ +++      ++L  L+ + +A    +  +  Y ++Y Q +  LG+     G   W  
Sbjct: 330 NIYMLRILSIYSEVLSYLQMFDDAAENAKKMVDGYLKIYHQNNAQLGMAVMRAGVTHWHA 389

Query: 318 GDTENAIKSMTEAVEILRITHGTNSPFMKELILKLEEAQAEASYKLSSKDE 368
           G  E     + +A  IL ITHG   P  K+  L++  AQ E   ++  ++E
Sbjct: 390 GMIEVGHGMICKAFAILLITHGPLHPITKD--LEVMRAQTEMELRMFKENE 438


>gi|149036368|gb|EDL90986.1| rCG56113, isoform CRA_b [Rattus norvegicus]
          Length = 477

 Score =  103 bits (257), Expect = 1e-19,   Method: Compositional matrix adjust.
 Identities = 85/328 (25%), Positives = 150/328 (45%), Gaps = 49/328 (14%)

Query: 31  VAHMSDIDEKQLLLYAQIANLVNLILQ-WP----EISINEIAENFSKLACNAHTICNSE- 84
           V H  + ++K+L +       V+  LQ WP    + S+  I+  F  + CN  T+ +   
Sbjct: 136 VEHFGEEEQKELRVD------VDTFLQYWPPQSQQFSMQYISHIFGVINCNGFTLSDQRG 189

Query: 85  LRPLGTGLYPVISIINHSCLPNAVLVFEGRLAVVRAVQHVPKGAEVLISYIETAGSTMTR 144
           L+ +G G++P + ++NH C PN  ++F      +RA+  + +G E+ +SYI+    +  R
Sbjct: 190 LQAVGVGIFPNLGLVNHDCWPNCTVIFNNGKIELRALGKISEGEELTVSYIDFLHLSEER 249

Query: 145 QKALKEQYLFTCTCPRCIKLGQFDDIQESAILEGYRCKDDGCSGFLLRDSDDKGFTCQQC 204
           ++ LK+QY F C+C  C K G  DD+                  FL    D K       
Sbjct: 250 RQQLKKQYYFDCSCEHCQK-GLKDDL------------------FLAVKEDPKP------ 284

Query: 205 GLVRSKEEIKKIASEVNILSKKTL----ALTSCGNHQEVVSTYKMIEKLQKKLYHPFSVN 260
               S+E +K    E+   SK TL       S G + EVV   +   + Q+ ++   ++ 
Sbjct: 285 ----SQEVVK----EMTQFSKDTLEKIDKARSEGLYHEVVKLCRECLEKQEPVFADTNLY 336

Query: 261 LMQTREKLIKILMELEDWKEALAYCQLTIPVYQRVYPQFHPLLGLQYYTCGKLEWFLGDT 320
           +++    + ++L  L+ ++EA  Y +  +  Y ++Y   +  LG+     G   W  G  
Sbjct: 337 VLRLLSIVSEVLSYLQAFEEASHYARRMVDGYMKLYHHNNAQLGMAVMRAGLTNWHAGHI 396

Query: 321 ENAIKSMTEAVEILRITHGTNSPFMKEL 348
           E     + +A  IL +THG + P  K+L
Sbjct: 397 EVGHGMICKAYAILLVTHGPSHPITKDL 424


>gi|236461142|ref|NP_033892.2| SET and MYND domain-containing protein 1 isoform 2 [Mus musculus]
 gi|74209105|dbj|BAE24950.1| unnamed protein product [Mus musculus]
 gi|148666517|gb|EDK98933.1| SET and MYND domain containing 1, isoform CRA_a [Mus musculus]
          Length = 477

 Score =  103 bits (257), Expect = 1e-19,   Method: Compositional matrix adjust.
 Identities = 81/324 (25%), Positives = 147/324 (45%), Gaps = 41/324 (12%)

Query: 31  VAHMSDIDEKQLLLYAQIANLVNLILQ-WP----EISINEIAENFSKLACNAHTICNSE- 84
           V H  + ++K+L +       V+  LQ WP    + S+  I+  F  + CN  T+ +   
Sbjct: 136 VEHFGEEEQKELRVD------VDTFLQYWPPQSQQFSMQYISHIFGVINCNGFTLSDQRG 189

Query: 85  LRPLGTGLYPVISIINHSCLPNAVLVFEGRLAVVRAVQHVPKGAEVLISYIETAGSTMTR 144
           L+ +G G++P + ++NH C PN  ++F      +RA+  + +G E+ +SYI+    +  R
Sbjct: 190 LQAVGVGIFPNLGLVNHDCWPNCTVIFNNGKIELRALGKISEGEELTVSYIDFLHLSEER 249

Query: 145 QKALKEQYLFTCTCPRCIKLGQFDDIQESAILEGYRCKDDGCSGFLLRDSDDKGFTCQQC 204
           ++ LK+QY F C+C  C K G  DD+                  FL    D K       
Sbjct: 250 RRQLKKQYYFDCSCEHCQK-GLKDDL------------------FLAAKEDPKP------ 284

Query: 205 GLVRSKEEIKKIASEVNILSKKTLALTSCGNHQEVVSTYKMIEKLQKKLYHPFSVNLMQT 264
               S+E +K++        +K     S G + EVV   +   + Q+ ++   ++ +++ 
Sbjct: 285 ----SQEVVKEMIQFSKDTLEKIDKARSEGLYHEVVKLCRECLEKQEPVFADTNLYVLRL 340

Query: 265 REKLIKILMELEDWKEALAYCQLTIPVYQRVYPQFHPLLGLQYYTCGKLEWFLGDTENAI 324
                ++L  L+ ++EA  Y +  +  Y ++Y   +  LG+     G   W  G  E   
Sbjct: 341 LSIASEVLSYLQAYEEASHYARRMVDGYMKLYHHNNAQLGMAVMRAGLTNWHAGHIEVGH 400

Query: 325 KSMTEAVEILRITHGTNSPFMKEL 348
             + +A  IL +THG + P  K+L
Sbjct: 401 GMICKAYAILLVTHGPSHPITKDL 424


>gi|348517231|ref|XP_003446138.1| PREDICTED: SET and MYND domain-containing protein 1-like isoform 1
           [Oreochromis niloticus]
          Length = 476

 Score =  103 bits (257), Expect = 1e-19,   Method: Compositional matrix adjust.
 Identities = 87/339 (25%), Positives = 152/339 (44%), Gaps = 39/339 (11%)

Query: 18  STTTD-NYSLVEALVAHMSDIDEKQLL-LYAQIANLVNLILQWPEIS----INEIAENFS 71
           ST +D   + ++ L  H++D+ E  L  L   I N ++    WP  S    I++I+  F 
Sbjct: 114 STVSDMQLTTLDELEDHIADMPEDDLKELKVDIHNFLDY---WPHNSKQHTIDDISHIFG 170

Query: 72  KLACNAHTICNSE-LRPLGTGLYPVISIINHSCLPNAVLVFEGRLAVVRAVQHVPKGAEV 130
            + CN  ++ +   L+ +G GL+P + ++NH C PN  ++       +RA+  + +G E+
Sbjct: 171 VINCNGFSVSDQRGLQAVGVGLFPNLCLVNHDCWPNCTVILNHGKIELRALGKIAEGEEL 230

Query: 131 LISYIETAGSTMTRQKALKEQYLFTCTCPRCIKLGQFDDIQESAILEGYRCKDDGCSGFL 190
            ++Y++    +  RQ+ LK QY F CTC  C                  R KDD   G  
Sbjct: 231 TVAYVDFLNLSEERQRLLKTQYFFDCTCEHCKN----------------RIKDDIKIGG- 273

Query: 191 LRDSDDKGFTCQQCGLVRSKEEIKKIASEVNILSKKTLALTSCGNHQEVVSTYK-MIEKL 249
            R+ D         G+  S+E++K+       + +K       G++ EVV   +  IEK 
Sbjct: 274 -REED---------GVKPSEEQVKEATDYCFQMLEKMEKARLNGDYHEVVKICRECIEKT 323

Query: 250 QKKLYHPFSVNLMQTREKLIKILMELEDWKEALAYCQLTIPVYQRVYPQFHPLLGLQYYT 309
           +  L     + L++    + ++   L+ + +A  Y +  +  Y ++Y   +  LG+    
Sbjct: 324 EPVLADT-HIYLLRMWSTMSEVQAYLQYFDDAADYARKMVEGYMKLYHPNNAALGMAAMR 382

Query: 310 CGKLEWFLGDTENAIKSMTEAVEILRITHGTNSPFMKEL 348
            G   W  G  E     + +A  IL ITHG   P  K+L
Sbjct: 383 AGVTHWQAGQIEVGHGMICKAYAILMITHGPIHPITKDL 421


>gi|432113965|gb|ELK36030.1| SET and MYND domain-containing protein 3, partial [Myotis davidii]
          Length = 241

 Score =  102 bits (255), Expect = 2e-19,   Method: Compositional matrix adjust.
 Identities = 75/260 (28%), Positives = 123/260 (47%), Gaps = 43/260 (16%)

Query: 70  FSKLACNAHTICNSELRPLGTGLYPVISIINHSCLPNAVLVFEGRLAVVRAVQHVPKGAE 129
           F +L CN+ TICN+E++ +G GLYP +S++NHSC PN  +VF G   ++RAV+ +  G E
Sbjct: 1   FPQLICNSFTICNAEMQEVGVGLYPSMSLLNHSCDPNCSIVFNGPHLLLRAVRDIQAGEE 60

Query: 130 VLISYIETAGSTMTRQKALKEQYLFTCTCPRCIKLGQFDDIQESAILEGYRCKDDGCSGF 189
           + I Y++   ++  R++ L+ QY F C C RC                            
Sbjct: 61  LTICYLDMLMTSAERREQLRNQYCFDCDCARC---------------------------- 92

Query: 190 LLRDSDDKGFTCQQCGLVRSKEEIKKIASEVNILSKKTLALTSCGNHQEVVSTYKMIEKL 249
           L +D D    T  +      +E +KKI  E+    K    L  C   Q ++S+    E+L
Sbjct: 93  LTQDKDADMLTGDEKVWKEVQESLKKI-EELKAEWKWEQVLALC---QAIISSNS--ERL 146

Query: 250 QKKLYHPFSVNLMQTR--EKLIKILMELEDWKEALAYCQLTIPVYQRVYPQFHPLLGLQY 307
                    +N+ Q +  +  +   + L   +EAL Y   T+  Y+  +P  HP+ G+Q 
Sbjct: 147 P-------DINIYQLKVLDCAMDACIRLGLLEEALFYGIRTMEPYRIFFPGCHPVRGVQV 199

Query: 308 YTCGKLEWFLGDTENAIKSM 327
              GKL+   G    A++++
Sbjct: 200 MKVGKLQLHQGMFPQAMRNL 219


>gi|84180545|gb|ABC54714.1| histone methyltransferase SmyD1b [Danio rerio]
 gi|190337458|gb|AAI63095.1| Smyd1b protein [Danio rerio]
          Length = 473

 Score =  102 bits (254), Expect = 3e-19,   Method: Compositional matrix adjust.
 Identities = 79/337 (23%), Positives = 144/337 (42%), Gaps = 35/337 (10%)

Query: 18  STTTDN-YSLVEALVAHMSDIDEKQLLLYAQIANLVNLILQWPEIS----INEIAENFSK 72
           S  +DN  + +E L  H+ DI E  L  +    ++ N +  WP  S    ++ ++     
Sbjct: 114 SVVSDNQLTTLEDLEDHICDISEDDLKDFK--VDIHNFLDYWPRNSKPHTVDSVSHILGV 171

Query: 73  LACNAHTICNSE-LRPLGTGLYPVISIINHSCLPNAVLVFEGRLAVVRAVQHVPKGAEVL 131
           + CN   + +   L+ +G GL+P + ++NH C PN  ++       +RA+  +  G EV 
Sbjct: 172 INCNGFMVSDQRGLQAVGVGLFPNLCLVNHDCWPNCTVILNNGKIELRALGKISAGEEVT 231

Query: 132 ISYIETAGSTMTRQKALKEQYLFTCTCPRCIKLGQFDDIQESAILEGYRCKDDGCSGFLL 191
           ++Y++    +  RQ+ LK+QY F CTC  C +  + D     A ++G +  +        
Sbjct: 232 VAYVDYLNVSADRQRLLKQQYFFDCTCKHCTEKIKDDLKMAGAEVDGVKVPE-------- 283

Query: 192 RDSDDKGFTCQQCGLVRSKEEIKKIASEVNILSKKTLALTSCGNHQEVVSTYKMIEKLQK 251
                              E++K++        +K        N+ EVV   +   + Q+
Sbjct: 284 -------------------EQVKEVTEFSRQKLEKMEKARIEANYNEVVKICRECVEKQE 324

Query: 252 KLYHPFSVNLMQTREKLIKILMELEDWKEALAYCQLTIPVYQRVYPQFHPLLGLQYYTCG 311
            +     +  ++    L ++L  L+ + EA  Y +  +  Y ++Y   +  LG+     G
Sbjct: 325 NVLADTHIYYLRVCCTLSEVLSYLQFFDEASEYARKMVDGYLKLYHPNNAQLGMATMRAG 384

Query: 312 KLEWFLGDTENAIKSMTEAVEILRITHGTNSPFMKEL 348
              W  G  E A   + +A  IL ITHG   P  K+L
Sbjct: 385 VTHWQAGFIEVAHGMICKAFAILMITHGPTHPITKDL 421


>gi|326919603|ref|XP_003206069.1| PREDICTED: SET and MYND domain-containing protein 1-like isoform 2
           [Meleagris gallopavo]
          Length = 478

 Score =  102 bits (253), Expect = 4e-19,   Method: Compositional matrix adjust.
 Identities = 89/352 (25%), Positives = 155/352 (44%), Gaps = 47/352 (13%)

Query: 27  VEALVAHMSDIDEKQLL-LYAQIANLVNLILQWP----EISINEIAENFSKLACNAHTIC 81
           ++ L  H+ + DE++   L   + + +     WP    +  +  I+  F  + CNA T+ 
Sbjct: 129 IDDLQNHVENFDEEEKKDLRIDVESFLEF---WPAQSQQFGMQYISHIFGVINCNAFTLS 185

Query: 82  NSE-LRPLGTGLYPVISIINHSCLPNAVLVFEGRLAVVRAVQHVPKGAEVLISYIETAGS 140
           +   L+ +G G++P +   NH C PN  ++F      +RA+  +  G E+ +SY++    
Sbjct: 186 DQRGLQAVGVGIFPNLCQANHDCWPNCTVIFNNGKIELRALSKISPGDELTVSYVDFLNV 245

Query: 141 TMTRQKALKEQYLFTCTCPRCIKLGQFDDIQESAILEGYRCKDDGCSGFLLRDSDDKGFT 200
           +  RQK LK+QY F CTC  C K                + KDD      +++ D K   
Sbjct: 246 SEERQKQLKKQYYFDCTCEHCKK----------------KIKDD--LMLAVKEGDKKP-- 285

Query: 201 CQQCGLVRSKEEIKKIASEVNILSKKTLALTSC----GNHQEVVSTYKMIEKLQKKLYHP 256
                   S E +K    EV   SK TL   +     G + EVV   +   K Q+ +   
Sbjct: 286 --------SAETVK----EVIQFSKDTLEKINKARLEGTYHEVVKLCRECLKKQEPVLGD 333

Query: 257 FSVNLMQTREKLIKILMELEDWKEALAYCQLTIPVYQRVYPQFHPLLGLQYYTCGKLEWF 316
            ++ L++      ++L  L+ ++EA  Y +  +  Y ++Y   +  LG+     G   W 
Sbjct: 334 TNIYLLRILSIASEVLSYLQMFEEAAEYAKRMVDGYLKIYHPNNAQLGMAVMRAGVTHWH 393

Query: 317 LGDTENAIKSMTEAVEILRITHGTNSPFMKELILKLEEAQAEASYKLSSKDE 368
            G  E     + +A  IL ITHG + P  K+  L++   Q E   ++  ++E
Sbjct: 394 AGLIEAGHSMICKAYAILLITHGPSHPITKD--LEVMRVQTEMELRMFQQNE 443


>gi|71834584|ref|NP_001025394.1| N-lysine methyltransferase SMYD2-B isoform 1 [Danio rerio]
          Length = 437

 Score =  101 bits (252), Expect = 4e-19,   Method: Compositional matrix adjust.
 Identities = 84/331 (25%), Positives = 151/331 (45%), Gaps = 39/331 (11%)

Query: 26  LVEALVAHMSDID-EKQLLLYAQIANLVNLI---LQWPEISINEIAENFSKLACNAHTIC 81
           L+  + AH+ D+D EK+ +  A IA L       L +P+     +   FS++ CN  T+ 
Sbjct: 131 LLGEMEAHLEDMDNEKREMTEAHIAGLHQFYSKHLDFPDH--QALLTLFSQVHCNGFTVE 188

Query: 82  NSELRPLGTGLYPVISIINHSCLPNAVLVFEGRLAVVRAVQHVPKGAEVLISYIETAGST 141
           + EL  LG  ++P I+++NHSC PN ++ + G  A VRAV+ +  G E+  SYI+    T
Sbjct: 189 DEELSNLGLAIFPDIALLNHSCSPNVIVTYRGINAEVRAVKDISPGQEIYTSYIDLLYPT 248

Query: 142 MTRQKALKEQYLFTCTCPRCIKLGQFDDIQESAILEGYRCKDDGCSGFLLRDSDDKGFTC 201
             R + L++ Y F+C C  C      D ++ S      R + D      ++D        
Sbjct: 249 ADRLERLRDMYYFSCDCKECTT-KSMDVVKMSV-----RKRSDEIGEKEIKD-------- 294

Query: 202 QQCGLVR----SKEEIKKIASEVNILSKKTLALTSCGNHQEVVSTYKMIEKLQKKLYHPF 257
               +VR    S E  ++   + NIL   +  L  C    + +ST          ++   
Sbjct: 295 ----MVRYARNSMENFRRAKQDKNILC-GSFILEMCELSIDKMST----------VFDDS 339

Query: 258 SVNLMQTREKLIKILMELEDWKEALAYCQLTIPVYQRVYPQFHPLLGLQYYTCGKLEWFL 317
           +V ++    + + I +  ED++ A+ Y +  I  +  +YP +   +   +   G+L   L
Sbjct: 340 NVYILHMMYQAMGICLFTEDYEGAVRYGEKVIKPFTVLYPAYSMNVASMFLKLGRLYIAL 399

Query: 318 GDTENAIKSMTEAVEILRITHGTNSPFMKEL 348
                 I +  +A+ I+ + HG +  ++ EL
Sbjct: 400 DRKLAGIDAFQKALTIMEVVHGKDHTYVTEL 430


>gi|440907991|gb|ELR58064.1| SET and MYND domain-containing protein 3, partial [Bos grunniens
           mutus]
          Length = 221

 Score =  101 bits (252), Expect = 5e-19,   Method: Compositional matrix adjust.
 Identities = 73/258 (28%), Positives = 123/258 (47%), Gaps = 39/258 (15%)

Query: 70  FSKLACNAHTICNSELRPLGTGLYPVISIINHSCLPNAVLVFEGRLAVVRAVQHVPKGAE 129
           F ++ CN+ TICN+E++ +G GLYP +S++NHSC PN  +VF G   ++RAV+ V  G E
Sbjct: 1   FLQVICNSFTICNAEMQEVGVGLYPSMSLLNHSCDPNCSIVFNGPHLLLRAVRDVEAGEE 60

Query: 130 VLISYIETAGSTMTRQKALKEQYLFTCTCPRCIKLGQFDDIQESAILEGYRCKDDGCSGF 189
           + I Y++   ++  R+K L+ QY F C C RC    Q  D                    
Sbjct: 61  LTICYLDMLMTSEERRKQLRGQYCFDCDCFRC----QTQD-------------------- 96

Query: 190 LLRDSDDKGFTCQQCGLVRSKEEIKKIASEVNILSKKTLALTSCGNHQEVVSTYKMIEKL 249
             +D+D          ++   E++ K   EV    KK   L +    ++V++  + I   
Sbjct: 97  --KDAD----------MLTGDEQVWK---EVQESLKKIEDLKAHWKWEQVLAMCQSIISS 141

Query: 250 QKKLYHPFSVNLMQTREKLIKILMELEDWKEALAYCQLTIPVYQRVYPQFHPLLGLQYYT 309
             +     ++  ++  +  +   + L   +EAL Y   T+  Y+  YP  HP+ G+Q   
Sbjct: 142 NAERLPDINIYQLKVLDCAMDACLNLGLLEEALFYGIRTMEPYRIFYPGNHPVRGVQIMK 201

Query: 310 CGKLEWFLGDTENAIKSM 327
            GKL+   G    A+K++
Sbjct: 202 VGKLQLHQGMFPQAMKNL 219


>gi|126305322|ref|XP_001379366.1| PREDICTED: SET and MYND domain-containing protein 1 [Monodelphis
           domestica]
          Length = 490

 Score =  101 bits (251), Expect = 6e-19,   Method: Compositional matrix adjust.
 Identities = 86/345 (24%), Positives = 151/345 (43%), Gaps = 59/345 (17%)

Query: 27  VEALVAHMSDI-DEKQLLLYAQIANLVNLILQWP----EISINEIAENFSKLACNAHTIC 81
           V+ L  H+ +  +E+Q  L   + + +N    WP    + S+  I+  F  + CNA T+ 
Sbjct: 129 VDDLQNHVDNFGEEEQKDLRVDVDSFLNF---WPAQSQQFSMQYISHIFGVINCNAFTLS 185

Query: 82  NSE-LRPLGTGLYPVISIINHSCLPNAVLVFE-GRLAVVRAVQH------------VPKG 127
           +   L+ +G G++P + ++NH C PN  ++F  G    V+++ H            + +G
Sbjct: 186 DQRGLQAVGVGIFPNLCLVNHDCWPNCTVIFNNGNHEAVKSMFHTQMRIELRALGKISEG 245

Query: 128 AEVLISYIETAGSTMTRQKALKEQYLFTCTCPRCIKLGQFDDIQESAILEGYRCKDDGCS 187
            E+ +SY++    +  R+K LK+QY F CTC  C K G  DD+                 
Sbjct: 246 EELTVSYVDFLNVSQERKKQLKKQYYFDCTCEHCEK-GIKDDL----------------- 287

Query: 188 GFLLRDSDDKGFTCQQCGLVRSKEEIKKIASEVNILSKKTL----ALTSCGNHQEVVSTY 243
            FL   +D K                + +  EV   SK+TL       S G + EVV   
Sbjct: 288 -FLAVKADPK--------------PSQDVVKEVTQFSKETLEKIDKARSEGLYHEVVKLC 332

Query: 244 KMIEKLQKKLYHPFSVNLMQTREKLIKILMELEDWKEALAYCQLTIPVYQRVYPQFHPLL 303
           +   + Q+ +    ++++++      ++L  L+ + EA  Y +  +  Y ++Y   +  L
Sbjct: 333 RECLQKQEPVLADTNIHVLRILSIASEVLSYLQSFHEASDYAKRMVDGYMKLYHPNNAQL 392

Query: 304 GLQYYTCGKLEWFLGDTENAIKSMTEAVEILRITHGTNSPFMKEL 348
           G+     G   W  G  E     + +A  IL +THG   P  K+L
Sbjct: 393 GMAVMRAGLTHWHAGLIEAGHGMICKAYAILLVTHGPTHPITKDL 437


>gi|296482488|tpg|DAA24603.1| TPA: SET and MYND domain containing 1 [Bos taurus]
          Length = 490

 Score =  101 bits (251), Expect = 7e-19,   Method: Compositional matrix adjust.
 Identities = 85/341 (24%), Positives = 153/341 (44%), Gaps = 62/341 (18%)

Query: 31  VAHMSDIDEKQLLLYAQIANLVNLILQ-WP----EISINEIAENFSKLACNAHTICNSE- 84
           V H  + ++K+L L       V++ LQ WP    + S+  I+  F  + CN  T+ +   
Sbjct: 136 VEHFGEEEQKELRLD------VDMFLQYWPPQSQQFSMQYISHIFGVINCNGFTLSDQRG 189

Query: 85  LRPLGTGLYPVISIINHSCLPNAVLVFE-GRLAVVRAVQH------------VPKGAEVL 131
           L+ +G G++P ++++NH C PN  ++F  G    V+++ H            + +G E+ 
Sbjct: 190 LQAVGVGVFPNLALVNHDCWPNCTVIFNNGNHEAVKSMFHTQMRIELRALGKISEGEELT 249

Query: 132 ISYIETAGSTMTRQKALKEQYLFTCTCPRCIKLGQFDDIQESAILEGYRCKDDGCSGFLL 191
           +SYI+    +  R+K LK QY F CTC  C K G  DD+                  FL 
Sbjct: 250 VSYIDFLNVSEERKKQLKRQYYFDCTCEHCQK-GLKDDL------------------FL- 289

Query: 192 RDSDDKGFTCQQCGLVRSKEEIKKIASEVNILSKKTL----ALTSCGNHQEVVSTYKMIE 247
                        G+  + +  +++  E+   SK TL       S G + EVV   +   
Sbjct: 290 -------------GVKDNPKPSQEVVKEMIQFSKDTLEKIDKARSEGLYHEVVKLCRECL 336

Query: 248 KLQKKLYHPFSVNLMQTREKLIKILMELEDWKEALAYCQLTIPVYQRVYPQFHPLLGLQY 307
           + Q+ ++   ++  ++    + ++L  L+ ++EA  Y +  +  Y ++Y   +  LG+  
Sbjct: 337 QKQEPVFADTNIYTLRLLSIVSEVLSYLQAFEEASHYARRMVDGYMKLYHHNNAQLGMAI 396

Query: 308 YTCGKLEWFLGDTENAIKSMTEAVEILRITHGTNSPFMKEL 348
              G   W  G+ E     + +A  IL +THG + P  K+L
Sbjct: 397 MRAGLTNWHAGNIEVGHGMICKAYAILLVTHGPSHPITKDL 437


>gi|301777093|ref|XP_002923966.1| PREDICTED: SET and MYND domain-containing protein 3-like
           [Ailuropoda melanoleuca]
          Length = 477

 Score =  101 bits (251), Expect = 7e-19,   Method: Compositional matrix adjust.
 Identities = 44/97 (45%), Positives = 67/97 (69%)

Query: 65  EIAENFSKLACNAHTICNSELRPLGTGLYPVISIINHSCLPNAVLVFEGRLAVVRAVQHV 124
           +I E F+K+ CNA TICN+E++ +G GLYP +S++NHSC PN  +VF G   ++RAV+ +
Sbjct: 170 DIFEAFAKVICNAFTICNAEMQEVGVGLYPSMSLLNHSCDPNCSIVFNGPHLLLRAVRDI 229

Query: 125 PKGAEVLISYIETAGSTMTRQKALKEQYLFTCTCPRC 161
             G E+ I Y++   ++  R+K L++QY F C C RC
Sbjct: 230 EAGEELTICYLDMLMTSEERRKQLRDQYCFECDCFRC 266


>gi|16930389|gb|AAL31881.1|AF410782_1 cardiac and skeletal muscle-specific BOP2 [Gallus gallus]
          Length = 473

 Score =  100 bits (250), Expect = 9e-19,   Method: Compositional matrix adjust.
 Identities = 88/352 (25%), Positives = 154/352 (43%), Gaps = 47/352 (13%)

Query: 27  VEALVAHMSDIDEKQLL-LYAQIANLVNLILQWP----EISINEIAENFSKLACNAHTIC 81
           ++ L  H+   DE++   L   + + +     WP    +  +  I+  F  + CNA T+ 
Sbjct: 124 IDDLQNHVESFDEEEKKDLRVDVESFLEF---WPAQSQQFGMQYISHIFGVINCNAFTLS 180

Query: 82  NSE-LRPLGTGLYPVISIINHSCLPNAVLVFEGRLAVVRAVQHVPKGAEVLISYIETAGS 140
           +   L+ +G G++P +   NH C PN  ++F      +RA+  +  G E+ +SY++    
Sbjct: 181 DQRGLQAVGVGIFPNLCQANHDCWPNCTVIFNNGKIELRALSKISPGDELTVSYVDFLNV 240

Query: 141 TMTRQKALKEQYLFTCTCPRCIKLGQFDDIQESAILEGYRCKDDGCSGFLLRDSDDKGFT 200
           +  R+K LK+QY F CTC  C K                + KDD      +++ D K   
Sbjct: 241 SEERRKQLKKQYYFDCTCEHCKK----------------KIKDD--LMLAVKEGDKKP-- 280

Query: 201 CQQCGLVRSKEEIKKIASEVNILSKKTLALTSC----GNHQEVVSTYKMIEKLQKKLYHP 256
                   S E +K    EV   SK TL   +     G + EVV   +   K Q+ +   
Sbjct: 281 --------SAETVK----EVIQFSKDTLEKINKARLEGTYHEVVKLCRECLKKQEPVLGD 328

Query: 257 FSVNLMQTREKLIKILMELEDWKEALAYCQLTIPVYQRVYPQFHPLLGLQYYTCGKLEWF 316
            ++ L++      ++L  L+ ++EA  Y +  +  Y ++Y   +  LG+     G   W 
Sbjct: 329 TNIYLLRILSIASEVLSYLQMFEEAAEYAKRMVEGYMKIYHPNNAQLGMAVMRAGVTHWH 388

Query: 317 LGDTENAIKSMTEAVEILRITHGTNSPFMKELILKLEEAQAEASYKLSSKDE 368
            G  E     + +A  IL ITHG + P  K+  L++   Q E   ++  ++E
Sbjct: 389 AGLIEAGHSMICKAYAILLITHGPSHPITKD--LEVMRVQTEMELRMFQQNE 438


>gi|395508794|ref|XP_003758694.1| PREDICTED: SET and MYND domain-containing protein 1 [Sarcophilus
           harrisii]
          Length = 490

 Score =  100 bits (250), Expect = 1e-18,   Method: Compositional matrix adjust.
 Identities = 87/345 (25%), Positives = 150/345 (43%), Gaps = 59/345 (17%)

Query: 27  VEALVAHMSDI-DEKQLLLYAQIANLVNLILQWP----EISINEIAENFSKLACNAHTIC 81
           V+ L  H+ +  +E+Q  L   + + +N    WP    + S+  I+  F  + CNA T+ 
Sbjct: 129 VDDLQNHVDNFGEEEQKDLRMDVDSFLNF---WPPQSQQFSMQYISHIFGVINCNAFTLS 185

Query: 82  NSE-LRPLGTGLYPVISIINHSCLPNAVLVFE-GRLAVVRAVQH------------VPKG 127
           +   L+ +G G++P + ++NH C PN  ++F  G    V+++ H            + +G
Sbjct: 186 DQRGLQAVGVGIFPNLCLVNHDCWPNCTVIFNNGNHEAVKSMFHTQMRIELRALGKISEG 245

Query: 128 AEVLISYIETAGSTMTRQKALKEQYLFTCTCPRCIKLGQFDDIQESAILEGYRCKDDGCS 187
            E+ +SYI+    +  R+K LK+QY F CTC  C K G  DD+                 
Sbjct: 246 EELTVSYIDFLNISQERKKQLKKQYYFDCTCEHCEK-GIKDDL----------------- 287

Query: 188 GFLLRDSDDKGFTCQQCGLVRSKEEIKKIASEVNILSKKTL----ALTSCGNHQEVVSTY 243
            FL    D K                + +  EV   SK TL       S G + EVV   
Sbjct: 288 -FLAVKEDPK--------------PSQDVVKEVTQFSKDTLEKIDKARSEGLYHEVVKLC 332

Query: 244 KMIEKLQKKLYHPFSVNLMQTREKLIKILMELEDWKEALAYCQLTIPVYQRVYPQFHPLL 303
           +   + Q+ +    ++++++      ++L  L+ ++EA  Y +  +  Y ++Y   +  L
Sbjct: 333 RECLQKQEPVLADTNIHVLRILSIASEVLSYLQSFQEASDYAKRMVDGYMKLYHPNNAQL 392

Query: 304 GLQYYTCGKLEWFLGDTENAIKSMTEAVEILRITHGTNSPFMKEL 348
           G+     G   W  G  E     + +A  IL +THG   P  K+L
Sbjct: 393 GMAVMRAGLTHWHAGLIEAGHGMICKAYAILLVTHGPTHPITKDL 437


>gi|260788099|ref|XP_002589088.1| hypothetical protein BRAFLDRAFT_75068 [Branchiostoma floridae]
 gi|229274262|gb|EEN45099.1| hypothetical protein BRAFLDRAFT_75068 [Branchiostoma floridae]
          Length = 909

 Score =  100 bits (249), Expect = 1e-18,   Method: Compositional matrix adjust.
 Identities = 76/299 (25%), Positives = 135/299 (45%), Gaps = 23/299 (7%)

Query: 66  IAENFSKLACNAHTIC------------NSELRPLGTGLYPVISIINHSCLPNAVLVFEG 113
           +  +  +L CNA  I             + ++R L T ++P  +++NHSC PN  + F+G
Sbjct: 606 LCHHMQQLRCNAQAITTLQEQDSVSLLEDKQVR-LATAVFPTEALLNHSCRPNVFVSFQG 664

Query: 114 RLAVVRAVQHVPKGAEVLISYIETAGSTM--TRQKALKEQYLFTCTCPRCIKLGQFDDIQ 171
           +  +VRAV H+  G E+L  Y   AG  +   RQ ALKEQY F+C+C  C +  Q  +  
Sbjct: 665 KTLIVRAVSHIKPGEELLHCYGPHAGRMVYGERQAALKEQYFFSCSCDACQE--QVGNPN 722

Query: 172 ESAILEGYRCKDDGCSGFLLRDSDDKGFTCQ--QCGLVRSKEEIKKIASEVNILSKKTLA 229
              +   Y+C     +  L     DK   C    C      ++IK  + ++  L  ++ A
Sbjct: 723 TVDMFSAYKCPVCNNAAKL----QDKKLVCTSPNCDAQGDTDDIKTTSKKIQDLFVQSSA 778

Query: 230 LTSCGNHQEVVSTYKMIEKLQKKLYHPFSVNLMQTREKLIKILMELEDWKEALAYCQLTI 289
               G  QE VS  K    LQ K+ HP + ++ +T + L +      D K+++ + + ++
Sbjct: 779 FLEEGQIQEAVSGLKQCLVLQWKILHPSNKDIARTHDALARCYATSGDCKKSVEHLKKSL 838

Query: 290 PVYQRVYPQFHPLLGLQYYTCGKLEWFLGDTENAIKSMTEAVEILRITHGTNSPFMKEL 348
              +  +      L  + +   +L++        + ++  A+ I    +G + P  KEL
Sbjct: 839 STVELQFGACSVELAHELHKLAQLQFNGQQVAECLDTIERALTIFGCHYGDHHPKSKEL 897


>gi|443728057|gb|ELU14532.1| hypothetical protein CAPTEDRAFT_199484 [Capitella teleta]
          Length = 374

 Score =  100 bits (249), Expect = 1e-18,   Method: Compositional matrix adjust.
 Identities = 71/254 (27%), Positives = 122/254 (48%), Gaps = 18/254 (7%)

Query: 66  IAENFSKLACNAHTICNSELRPLGTGLYPVISIINHSCLPNAVLVFEGRLAVVRAVQHVP 125
           +A   S+++ N+ TI + +   LG GLY   S ++HSC PNA   FEG    VR ++ + 
Sbjct: 95  MAFYLSQISVNSITINHPQAVGLGVGLYLRFSAVDHSCRPNAFANFEGTKVCVRCIEPIE 154

Query: 126 KGAEVLISYIETAGSTMTRQKALKEQYLFTCTCPRCIKLGQFDDIQESAILEGYRCKDDG 185
              ++ ISY+     T TR+K L ++YLF CTC  C+      DI +  +   ++C   G
Sbjct: 155 DEKDLRISYVSPLDDTATRRKNLLQKYLFECTCEACL------DIDQDQMKFSFKCVRGG 208

Query: 186 CSGFLLRDSDDKGFTCQQCGLVR-SKEEIKKIASEVNI------LSKKTLALTSCGNHQE 238
           C G + R  D++ F C  CG  + +++EI+ +   V        +SKK   L + G + E
Sbjct: 209 CKGHMTRMQDNR-FRCDYCGEKQVTEDEIQTLNVAVEKTRRMMDISKK---LKAQGRYAE 264

Query: 239 VVSTYKMIEKLQKKLYHPFSVNLMQTREKLIKILMELEDWKEALAYCQLT-IPVYQRVYP 297
           ++       K      H  ++ +++ R+ L+      ++W      C +  +  Y+  Y 
Sbjct: 265 IMVAGFACLKDAVDFLHSDNLLIVELRDCLLNACRMTQNWSVEAIQCGIANLEPYRSFYG 324

Query: 298 QFHPLLGLQYYTCG 311
           + HP L +  Y  G
Sbjct: 325 RHHPALAILLYDIG 338


>gi|410955280|ref|XP_003984284.1| PREDICTED: SET and MYND domain-containing protein 1 [Felis catus]
          Length = 490

 Score =  100 bits (249), Expect = 1e-18,   Method: Compositional matrix adjust.
 Identities = 87/341 (25%), Positives = 154/341 (45%), Gaps = 62/341 (18%)

Query: 31  VAHMSDIDEKQLLLYAQIANLVNLILQ-WP----EISINEIAENFSKLACNAHTICNSE- 84
           V H  + ++K+L +       V+  LQ WP    + S+  I+  F  ++CN  T+ +   
Sbjct: 136 VEHFGEEEQKELRVD------VDTFLQYWPPQSQQFSMQYISHIFGVISCNGFTLSDQRG 189

Query: 85  LRPLGTGLYPVISIINHSCLPNAVLVFE-GRLAVVRAVQH------------VPKGAEVL 131
           L+ +G G++P + ++NH C PN  ++F  G    V+++ H            + +G E+ 
Sbjct: 190 LQAVGVGIFPNLGLVNHDCWPNCTVIFNNGNHEAVKSMFHTQMRIELRALGKISEGEELT 249

Query: 132 ISYIETAGSTMTRQKALKEQYLFTCTCPRCIKLGQFDDIQESAILEGYRCKDDGCSGFLL 191
           +SYI+    +  R++ LK+QY F CTC  C K                + KDD    FL 
Sbjct: 250 VSYIDFLNVSEERKRQLKKQYYFDCTCEHCQK----------------KLKDD---LFLG 290

Query: 192 RDSDDKGFTCQQCGLVRSKEEIKKIASEVNILSKKTL----ALTSCGNHQEVVSTYKMIE 247
              D K           S+E +K++      LSK TL       S G + EVV   +   
Sbjct: 291 VKDDPKP----------SQEAVKEMIQ----LSKDTLEKIDKARSEGLYHEVVKLCRECL 336

Query: 248 KLQKKLYHPFSVNLMQTREKLIKILMELEDWKEALAYCQLTIPVYQRVYPQFHPLLGLQY 307
           + Q+ ++   ++  ++    + ++L  L+ ++EA  Y +  +  Y ++Y   +  LG+  
Sbjct: 337 QKQEPVFADTNLYTLRMLSTVSEVLSYLQAFEEASHYARRMVDGYMKLYHPNNAQLGMAV 396

Query: 308 YTCGKLEWFLGDTENAIKSMTEAVEILRITHGTNSPFMKEL 348
              G   W  G+ E     + +A  IL +THG + P  K+L
Sbjct: 397 MRAGLTNWHAGNIEVGHGMICKAYAILLVTHGPSHPITKDL 437


>gi|345328822|ref|XP_001510515.2| PREDICTED: SET and MYND domain-containing protein 1 isoform 2
           [Ornithorhynchus anatinus]
          Length = 477

 Score =  100 bits (249), Expect = 1e-18,   Method: Compositional matrix adjust.
 Identities = 77/300 (25%), Positives = 131/300 (43%), Gaps = 42/300 (14%)

Query: 58  WP----EISINEIAENFSKLACNAHTICNSE-LRPLGTGLYPVISIINHSCLPNAVLVFE 112
           WP    + S+  I+  F  + CN  T+ +   L+ +G G++P + ++NH C PN  ++F 
Sbjct: 158 WPPQSQQFSMQYISHIFGVINCNGFTLSDQRGLQAVGVGVFPNLCLVNHDCWPNCTVIFN 217

Query: 113 GRLAVVRAVQHVPKGAEVLISYIETAGSTMTRQKALKEQYLFTCTCPRCIKLGQFDDIQE 172
                +RA+  + +G E+ +SYI+    +  R+K LK+QY F CTC  C K G  DD+  
Sbjct: 218 NGKIELRALGKISEGEELTVSYIDFLNVSADRKKLLKKQYYFDCTCEHCEK-GIKDDL-- 274

Query: 173 SAILEGYRCKDDGCSGFLLRDSDDKGFTCQQCGLVRSKEEIKKIASEVNILSKKTL---- 228
                           FL              G+    +  + +  EV   SK  L    
Sbjct: 275 ----------------FL--------------GVKDEPKPSQDVVKEVIQFSKDALEKID 304

Query: 229 ALTSCGNHQEVVSTYKMIEKLQKKLYHPFSVNLMQTREKLIKILMELEDWKEALAYCQLT 288
              S G + EVV   +   + Q+ +    ++ L++      ++L  L+ ++EA  Y +  
Sbjct: 305 KARSEGVYHEVVKLCRECLQKQEPVLADTNIYLLRILSVASEVLSYLQSFEEASDYAKKM 364

Query: 289 IPVYQRVYPQFHPLLGLQYYTCGKLEWFLGDTENAIKSMTEAVEILRITHGTNSPFMKEL 348
           +  Y ++Y   +  LG+     G   W  G  E     + +A  IL +THG   P  K+L
Sbjct: 365 VDGYMKLYHPNNAQLGMAVMRAGVTHWHAGLIEVGHGMICKAYAILLVTHGPTHPITKDL 424


>gi|73980306|ref|XP_852104.1| PREDICTED: SET and MYND domain-containing protein 1 isoform 2
           [Canis lupus familiaris]
          Length = 490

 Score =  100 bits (248), Expect = 1e-18,   Method: Compositional matrix adjust.
 Identities = 85/337 (25%), Positives = 151/337 (44%), Gaps = 54/337 (16%)

Query: 31  VAHMSDIDEKQLLLYAQIANLVNLILQ-WP----EISINEIAENFSKLACNAHTICNSE- 84
           V H  + ++K+L L       V+  LQ WP    + S+  I+  F  ++CN  T+ +   
Sbjct: 136 VEHFGEEEQKELRLD------VDTFLQYWPPQGQQFSMQYISHIFGVISCNGFTLSDQRG 189

Query: 85  LRPLGTGLYPVISIINHSCLPNAVLVFE-GRLAVVRAVQH------------VPKGAEVL 131
           L+ +G G++P + ++NH C PN  ++F  G    V+++ H            + +G E+ 
Sbjct: 190 LQAVGVGIFPNLGLVNHDCWPNCTVIFNNGNHEAVKSMFHTQMRIELRALGKISEGEELT 249

Query: 132 ISYIETAGSTMTRQKALKEQYLFTCTCPRCIKLGQFDDIQESAILEGYRCKDDGCSGFLL 191
           +SYI+    +  R+K LK+QY F CTC  C K                + KDD    FL 
Sbjct: 250 VSYIDFLNLSEDRKKQLKKQYYFDCTCEHCQK----------------KLKDD---LFLG 290

Query: 192 RDSDDKGFTCQQCGLVRSKEEIKKIASEVNILSKKTLALTSCGNHQEVVSTYKMIEKLQK 251
              D K           S+E +K++        +K     S G + EVV   +   + Q+
Sbjct: 291 VKDDPKP----------SQEVVKEMIQFSKDTLEKIDKARSEGLYHEVVKLCRECLQKQE 340

Query: 252 KLYHPFSVNLMQTREKLIKILMELEDWKEALAYCQLTIPVYQRVYPQFHPLLGLQYYTCG 311
            ++   ++  ++    + ++L  L+ ++EA  Y +  +  Y ++Y   +  LG+     G
Sbjct: 341 PVFADTNLYTLRMLSTVSEVLSYLQAFEEASYYARRMVDGYMKLYHPNNAQLGMAVMRAG 400

Query: 312 KLEWFLGDTENAIKSMTEAVEILRITHGTNSPFMKEL 348
              W  G+ E     + +A  IL +THG + P  K+L
Sbjct: 401 LTNWHAGNIEVGHGMICKAYAILLVTHGPSHPITKDL 437


>gi|134085965|ref|NP_001076953.1| SET and MYND domain-containing protein 1 [Bos taurus]
 gi|126717435|gb|AAI33279.1| SMYD1 protein [Bos taurus]
          Length = 490

 Score =  100 bits (248), Expect = 1e-18,   Method: Compositional matrix adjust.
 Identities = 85/341 (24%), Positives = 152/341 (44%), Gaps = 62/341 (18%)

Query: 31  VAHMSDIDEKQLLLYAQIANLVNLILQ-WP----EISINEIAENFSKLACNAHTICNSE- 84
           V H  + ++K+L L       V+  LQ WP    + S+  I+  F  + CN  T+ +   
Sbjct: 136 VEHFGEEEQKELRLD------VDTFLQYWPPQSQQFSMQYISHIFGVINCNGFTLSDQRG 189

Query: 85  LRPLGTGLYPVISIINHSCLPNAVLVFE-GRLAVVRAVQH------------VPKGAEVL 131
           L+ +G G++P ++++NH C PN  ++F  G    V+++ H            + +G E+ 
Sbjct: 190 LQAVGVGVFPNLALVNHDCWPNCTVIFNNGNHEAVKSMFHTQMRIELRALGKISEGEELT 249

Query: 132 ISYIETAGSTMTRQKALKEQYLFTCTCPRCIKLGQFDDIQESAILEGYRCKDDGCSGFLL 191
           +SYI+    +  R+K LK QY F CTC  C K G  DD+                  FL 
Sbjct: 250 VSYIDFLNVSEERKKQLKRQYYFDCTCEHCQK-GLKDDL------------------FL- 289

Query: 192 RDSDDKGFTCQQCGLVRSKEEIKKIASEVNILSKKTL----ALTSCGNHQEVVSTYKMIE 247
                        G+  + +  +++  E+   SK TL       S G + EVV   +   
Sbjct: 290 -------------GVKDNPKPSQEVVKEMIQFSKDTLEKIDKARSEGLYHEVVKLCRECL 336

Query: 248 KLQKKLYHPFSVNLMQTREKLIKILMELEDWKEALAYCQLTIPVYQRVYPQFHPLLGLQY 307
           + Q+ ++   ++  ++    + ++L  L+ ++EA  Y +  +  Y ++Y   +  LG+  
Sbjct: 337 QKQEPVFADTNIYTLRLLSIVSEVLSYLQAFEEASHYARRMVDGYMKLYHHNNAQLGMAI 396

Query: 308 YTCGKLEWFLGDTENAIKSMTEAVEILRITHGTNSPFMKEL 348
              G   W  G+ E     + +A  IL +THG + P  K+L
Sbjct: 397 MRAGLTNWHAGNIEVGHGMICKAYAILLVTHGPSHPITKDL 437


>gi|241712248|ref|XP_002413449.1| SET and MYND domain-containing protein, putative [Ixodes
           scapularis]
 gi|215507263|gb|EEC16757.1| SET and MYND domain-containing protein, putative [Ixodes
           scapularis]
          Length = 374

 Score =  100 bits (248), Expect = 1e-18,   Method: Compositional matrix adjust.
 Identities = 84/324 (25%), Positives = 144/324 (44%), Gaps = 44/324 (13%)

Query: 30  LVAHMSDI--DEKQLLLYAQIANLVNLILQWPEISINEIA-ENFSKLACNAHTICNSELR 86
           LVAH+ +I  D K++  + ++       L    +   E   E F K+  N +TI N E  
Sbjct: 71  LVAHVEEIKRDAKRVAQFQRLWMTAGAFLGDKYLPPAETGLEIFGKMVVNTYTITNYEEC 130

Query: 87  PLGTGLYPVISIINHSCLPNAVLVFEGRLAVVRAVQHVPKGA--EVLISYIETAGSTMTR 144
           P+GTGLY   SI++HSC PNA  VF+GR   +RA   +   +   + +SYI+      TR
Sbjct: 131 PIGTGLYIGPSILDHSCTPNAQAVFDGRTLRLRAAADIECDSVDGIRVSYIDLKELRRTR 190

Query: 145 QKALKEQYLFTCTCPRCIKLGQFDDIQESAILEGYRCKDDGCSGFLLRDSDDKGFTCQQC 204
            + L+E+Y F+C CPRC      D+ QE                           T +  
Sbjct: 191 IEELRERYYFSCNCPRC----SADEAQEY-------------------------LTEKSP 221

Query: 205 GLVRSKEEIKKIASEVNILSKKTLALTSCGNHQEVVSTYKMIEKLQKKLYHPFSVNLMQT 264
            L    +E+ +   +  + +++ L     G  + ++S++K+      +          + 
Sbjct: 222 SLTAKAKELLREFMQAEVHAREDLVRLR-GQAEALLSSHKLPATDAAR---------SEA 271

Query: 265 REKLIKILMELEDWKEALAYCQLTIPVYQRVYPQFHPLLGLQYYTCGKLEWFLGDTENAI 324
            E L K  +E+ D+  AL Y       Y+R Y  + P+ G+  +T  KL       + A+
Sbjct: 272 LELLSKSCLEMGDFDAALPYYLAKEETYRRCYGPYSPVYGVLLFTIAKLYHHKVLLQEAL 331

Query: 325 KSMTEAVEILRITHGTNSPFMKEL 348
           +   +A E++ ++HG N    ++L
Sbjct: 332 QYFEKASEVVTVSHGRNHRLYRDL 355


>gi|148225404|ref|NP_001085463.1| SET and MYND domain containing 1 [Xenopus laevis]
 gi|49118725|gb|AAH72803.1| MGC80131 protein [Xenopus laevis]
          Length = 478

 Score =  100 bits (248), Expect = 1e-18,   Method: Compositional matrix adjust.
 Identities = 76/316 (24%), Positives = 145/316 (45%), Gaps = 35/316 (11%)

Query: 58  WP----EISINEIAENFSKLACNAHTICNSE-LRPLGTGLYPVISIINHSCLPNAVLVFE 112
           WP    +  +  I+  FS ++CN  T+ +   L+ +G  ++P + + NH C PN  ++F 
Sbjct: 153 WPSQSQQFGMQYISHIFSVISCNGFTLSDQRGLQAVGVAIFPNLCLTNHDCWPNCTVIFN 212

Query: 113 GRLAVVRAVQHVPKGAEVLISYIETAGSTMTRQKALKEQYLFTCTCPRCIKLGQFDDIQE 172
                +RA+  + KG E+ +SY++    T  R + LK+QY F CTC  C    +  D   
Sbjct: 213 NGKIELRALGKINKGDELTVSYVDFLNLTEDRMEQLKKQYYFDCTCEHCT--NKTKDALL 270

Query: 173 SAILEGYRCKDDGCSGFLLRDSDDKGFTCQQCGLVRSKEEIKKIASEVNILSKKTLALTS 232
            A+ +G    +D     +++ S D            + E+I+K  SE             
Sbjct: 271 LAVNDGESKPEDRVVKEVIQYSKD------------TMEKIEKARSE------------- 305

Query: 233 CGNHQEVVSTYKMIEKLQKKLYHPFSVNLMQTREKLIKILMELEDWKEALAYCQLTIPVY 292
            G++ +VV   +   K Q+ ++   ++ +++      ++L  L+ +++A    +  I  Y
Sbjct: 306 -GSYHQVVKLCRECLKRQEPIFADTNIYMLRILSIYSEVLSYLQMFQDAAENAKKMIDGY 364

Query: 293 QRVYPQFHPLLGLQYYTCGKLEWFLGDTENAIKSMTEAVEILRITHGTNSPFMKELILKL 352
            ++Y   +  LG+     G  +W  G  E     + +A  IL ITHG + P  K+  L++
Sbjct: 365 MKIYHPNNAQLGMAIMRAGVTQWHAGMIEVGHGMICKAFAILLITHGPSHPITKD--LEM 422

Query: 353 EEAQAEASYKLSSKDE 368
             +Q E   ++  ++E
Sbjct: 423 MRSQTEMELRMFKENE 438


>gi|197102816|ref|NP_001125377.1| SET and MYND domain-containing protein 1 [Pongo abelii]
 gi|55727869|emb|CAH90687.1| hypothetical protein [Pongo abelii]
          Length = 477

 Score =  100 bits (248), Expect = 2e-18,   Method: Compositional matrix adjust.
 Identities = 78/323 (24%), Positives = 146/323 (45%), Gaps = 39/323 (12%)

Query: 31  VAHMSDIDEKQLLLYAQIANLVNLILQWPEISINEIAENFSKLACNAHTICNSE-LRPLG 89
           V H  + ++K+L +      L   + Q  + S+  I+  F  + CN  T+ +   L+ +G
Sbjct: 136 VEHFGEEEQKELRVDVDTF-LQYWLPQSQQFSMQYISHIFGVINCNGFTLSDQRGLQAVG 194

Query: 90  TGLYPVISIINHSCLPNAVLVFEGRLAVVRAVQHVPKGAEVLISYIETAGSTMTRQKALK 149
            G++P + ++NH C PN  ++F      +RA+  + +G E+ +SYI+    +  R++ LK
Sbjct: 195 VGIFPNLGLVNHDCWPNCTVIFNNGKIELRALGKISEGEELTVSYIDFLNVSEERKRQLK 254

Query: 150 EQYLFTCTCPRCIKLGQFDDIQESAILEGYRCKDDGCSGFLLRDSDDKGFTCQQCGLVRS 209
           +QY F C C  C K                + KDD    FL              G+  +
Sbjct: 255 KQYYFDCACEHCQK----------------KLKDD---LFL--------------GVKDN 281

Query: 210 KEEIKKIASEVNILSKKTL----ALTSCGNHQEVVSTYKMIEKLQKKLYHPFSVNLMQTR 265
            +  +++  E+   SK TL       S G + EVV   +   + Q+ ++   ++  ++  
Sbjct: 282 PKPSQEVVKEMIRFSKDTLEKIDKARSEGLYHEVVKLCRECLQKQEPVFADTNIYTLRML 341

Query: 266 EKLIKILMELEDWKEALAYCQLTIPVYQRVYPQFHPLLGLQYYTCGKLEWFLGDTENAIK 325
             + ++L  L+ ++EA  Y +  +  Y ++Y   +  LG+     G   W  G+ E    
Sbjct: 342 SIVSEVLSYLQAFEEASFYARRMVDGYMKLYHPNNAQLGMAVMRAGLTNWHAGNIEVGHG 401

Query: 326 SMTEAVEILRITHGTNSPFMKEL 348
            + +A  IL +THG + P  K+L
Sbjct: 402 MICKAYAILLVTHGPSHPITKDL 424


>gi|330917311|ref|XP_003297760.1| hypothetical protein PTT_08278 [Pyrenophora teres f. teres 0-1]
 gi|311329388|gb|EFQ94158.1| hypothetical protein PTT_08278 [Pyrenophora teres f. teres 0-1]
          Length = 532

 Score =  100 bits (248), Expect = 2e-18,   Method: Compositional matrix adjust.
 Identities = 85/366 (23%), Positives = 157/366 (42%), Gaps = 51/366 (13%)

Query: 23  NYSLVEALVAHMSDIDEKQLLLYAQIANLVNLILQWPEISI-------NEIAENFSKLAC 75
           ++ +V  L +H+ D             N+  + +  P+ ++       + IA  ++++  
Sbjct: 154 DWEMVCRLPSHVDDFKRN-----GTYGNIEMMAMGAPQFALPPNMFDRDFIAAMYARVMS 208

Query: 76  NAHTICNSELRPLGTGLYPVISIINHSCLPNAVLVFEGRLAVVRAVQHVPKGAEVLISYI 135
           NA TI    L PLG  L P +  +NHSC PNA ++ +G    +R ++ + K  E+ ISYI
Sbjct: 209 NALTIITPTLDPLGIILDPTLCSLNHSCDPNAFIMMDGPSVSIRTLRPIRKDKEIFISYI 268

Query: 136 ETAGSTMTRQKALKEQYLFTCTCPRCIKLGQFDDIQESAILEGYRCKDDGCSGFLLRDSD 195
           +T      RQ+ L+ ++ FTC C +C         QE A L+         S F+L    
Sbjct: 269 DTTYPYHKRQEELQTRWFFTCRCAKC---------QEKATLQEDNWLVPADSKFVLDPEA 319

Query: 196 DKGFTCQQCGLVRSKEEIKKIASEVNILSKKTLALTSCGNHQEVVSTYKMIEKLQKKLYH 255
            +     Q        E++K+++E  I   K + L SC                + + Y 
Sbjct: 320 KQAMAQTQEQTFALYGELQKLSTEHVIHGLKQI-LASC---------------YESRFYP 363

Query: 256 PFSVNLMQTREKLIKILMELEDWKEALAYCQLTIP-VYQRVYP-QFHPLLGLQYYTCGKL 313
            +     + R+ L+  L+ +  +++A A C      +  ++YP  FHP+  +Q +    L
Sbjct: 364 MYRQPYAEARDVLVVNLLSVGKFQDAWAQCAKRYKYILPKLYPVPFHPVRVVQTWQMAML 423

Query: 314 EWFLGDTENAIKS------------MTEAVEILRITHGTNSPFMKELILKLEEAQAEASY 361
             +L  TE  + +            + + +++  ++HG N+ F K +  K EE   E   
Sbjct: 424 AAYLASTEEGVGAPGVNMGLIAMMLVKQVLDVAHLSHGPNNAFTKSVKEKAEEMIEELKR 483

Query: 362 KLSSKD 367
            + + D
Sbjct: 484 SVGNPD 489


>gi|426223567|ref|XP_004005946.1| PREDICTED: SET and MYND domain-containing protein 1 [Ovis aries]
          Length = 490

 Score = 99.8 bits (247), Expect = 2e-18,   Method: Compositional matrix adjust.
 Identities = 85/337 (25%), Positives = 150/337 (44%), Gaps = 54/337 (16%)

Query: 31  VAHMSDIDEKQLLLYAQIANLVNLILQ-WP----EISINEIAENFSKLACNAHTICNSE- 84
           V H  + ++K+L L       V+  LQ WP    + S+  I+  F  + CN  T+ +   
Sbjct: 136 VEHFGEEEQKELRLD------VDTFLQYWPPQSQQFSMQYISHIFGVINCNGFTLSDQRG 189

Query: 85  LRPLGTGLYPVISIINHSCLPNAVLVFE-GRLAVVRAVQH------------VPKGAEVL 131
           L+ +G G++P ++++NH C PN  ++F  G    V+++ H            + +G E+ 
Sbjct: 190 LQAVGVGVFPNLALVNHDCWPNCTVIFNNGNHEAVKSMFHTQMRIELRALGKISEGEELT 249

Query: 132 ISYIETAGSTMTRQKALKEQYLFTCTCPRCIKLGQFDDIQESAILEGYRCKDDGCSGFLL 191
           +SYI+    +  R+K LK QY F CTC  C K G  DD+                  FL 
Sbjct: 250 VSYIDFLNISEERKKQLKRQYYFDCTCEHCQK-GLKDDL------------------FLG 290

Query: 192 RDSDDKGFTCQQCGLVRSKEEIKKIASEVNILSKKTLALTSCGNHQEVVSTYKMIEKLQK 251
              D K           S+E +K++        +K     S G + EVV   +   + Q+
Sbjct: 291 VKDDPKP----------SQEVVKEMIQFSKDTLEKIDKARSEGLYHEVVKLCRECLQKQE 340

Query: 252 KLYHPFSVNLMQTREKLIKILMELEDWKEALAYCQLTIPVYQRVYPQFHPLLGLQYYTCG 311
            ++   ++  ++    + ++L  L+ ++EA  Y +  +  Y ++Y   +  LG+     G
Sbjct: 341 PVFADTNIYTLRLLSIVSEVLSYLQAFEEASHYARRMVDGYMKLYHPNNAQLGMAIMRAG 400

Query: 312 KLEWFLGDTENAIKSMTEAVEILRITHGTNSPFMKEL 348
              W  G+ E     + +A  IL +THG + P  K+L
Sbjct: 401 LTNWHAGNIEVGHGMICKAYAILLVTHGPSHPITKDL 437


>gi|74178258|dbj|BAE29912.1| unnamed protein product [Mus musculus]
          Length = 420

 Score = 99.8 bits (247), Expect = 2e-18,   Method: Compositional matrix adjust.
 Identities = 72/297 (24%), Positives = 131/297 (44%), Gaps = 35/297 (11%)

Query: 8   RKLQNDNVIPSTTTDNYSL-VEALVAHMSDID-EKQLLLYAQIANLVNLI---LQWPEIS 62
           R L    + P  T     L V    +H+  +D EK+ L+ + IA L       L++P+ S
Sbjct: 111 RILAKQKIHPERTPSEKLLAVREFESHLDKLDNEKKDLIQSDIAALHQFYSKYLEFPDHS 170

Query: 63  INEIAENFSKLACNAHTICNSELRPLGTGLYPVISIINHSCLPNAVLVFEGRLAVVRAVQ 122
              +   F+++ CN  TI + EL  LG+ ++P ++++NHSC PN ++ ++G LA VRAVQ
Sbjct: 171 --SLVVLFAQVNCNGFTIEDEELSHLGSAIFPDVALMNHSCCPNVIVTYKGTLAEVRAVQ 228

Query: 123 HVPKGAEVLISYIETAGSTMTRQKALKEQYLFTCTCPRCIKLGQFDDIQESAILEGYRCK 182
            +  G EV  SYI+    T  R   L++ Y FTC C  C                     
Sbjct: 229 EIHPGDEVFTSYIDLLYPTEDRNDRLRDSYFFTCECRECT-------------------- 268

Query: 183 DDGCSGFLLRDSDDKGFTCQQCGLVRSKEEIKKIASEVNILSKKTLALTSCGNHQEVVST 242
                    +D D      ++       E I+ +      + ++        +  E++  
Sbjct: 269 --------TKDKDKAKVEVRKLSSPPQAEAIRDMVRYARNVIEEFRRAKHYKSPSELLEI 320

Query: 243 YKMIEKLQKKLYHPFSVNLMQTREKLIKILMELEDWKEALAYCQLTIPVYQRVYPQF 299
            ++ ++    ++   +V ++    + + + + ++DW+ AL Y Q  I  Y + YP +
Sbjct: 321 CELSQEKMSSVFEDSNVYMLHMMYQAMGVCLYMQDWEGALKYGQKIIKPYSKHYPVY 377


>gi|325188357|emb|CCA22894.1| conserved hypothetical protein [Albugo laibachii Nc14]
          Length = 528

 Score = 99.8 bits (247), Expect = 2e-18,   Method: Compositional matrix adjust.
 Identities = 53/146 (36%), Positives = 79/146 (54%), Gaps = 12/146 (8%)

Query: 65  EIAENFSKLACNAHTICNSELRPLGTGLYPVISIINHSCLPNAVLVFE--GRLAVVRAVQ 122
           +I + F  + CN  TICN+   PLG GLYP  +++NHSC PN ++ F+   + AV+  ++
Sbjct: 202 QITKLFCMVQCNVFTICNTLNEPLGIGLYPKAAMLNHSCTPNCIVRFDLTRKQAVIHTIE 261

Query: 123 HVPKGAEVLISYIETAGSTMTRQKALKEQYLFTCTCPRCIKLGQFDDIQESAILEGY--- 179
            +  G E+ ISY++    T  R+K L+  Y F CTC RC         QE A LEG+   
Sbjct: 262 TIHSGQELTISYVDIFDPTTIRKKRLRSSYFFDCTCQRCT------SSQEDAFLEGFLCT 315

Query: 180 RCKDDGCSGFLLRDSDDKGFTCQQCG 205
           +C++   S  +L D   + F C  C 
Sbjct: 316 QCRNTSESARIL-DEKKRIFRCCVCA 340


>gi|321462734|gb|EFX73755.1| SET and MYND domain-containing protein 1 [Daphnia pulex]
          Length = 452

 Score = 99.0 bits (245), Expect = 3e-18,   Method: Compositional matrix adjust.
 Identities = 84/321 (26%), Positives = 138/321 (42%), Gaps = 69/321 (21%)

Query: 38  DEKQLLLYAQIANLVNLILQWPEISINEIAEN-------FSKLACNAHTICNSELRPLGT 90
           D K++  +  I  ++N  L       NEI  N       + ++  N+  I N E++ +GT
Sbjct: 180 DTKRMEYFMTICGVLNEYLS------NEILPNSVELLGIYGRMCINSFNILNGEMQAIGT 233

Query: 91  GLYPVISIINHSCLPNAVLVFEGRLAVVRAVQHVPK--GAEVLISYIETAGSTMTRQKAL 148
           G+Y   SI++HSC PNAV  F+G    ++  Q +PK     + ISYI+   S   R+K L
Sbjct: 234 GIYLAPSILDHSCSPNAVATFDGFKLRIQLTQELPKLEWDSIRISYIDLMNSKSHRKKEL 293

Query: 149 KEQYLFTCTCPRCIKLGQFDDIQESAILEGYRCKDDGCSGFLLRDSDDKGFTCQQCGLVR 208
           K++Y F C CPRC    + DDI               C  +       K  TCQ      
Sbjct: 294 KDRYYFDCDCPRC----KNDDID--------------CYHYAA-----KCPTCQ------ 324

Query: 209 SKEEIKKIASEVNILSKKTLALTSCGNHQEVVSTYKMIEKLQKKLYHPFSVNLMQTREKL 268
                K + ++V++                     K+    Q+ L+H   +   +  +  
Sbjct: 325 -----KPVIAKVDL------------------DVCKLCIAKQENLFHDLCLVRSKIMDAA 361

Query: 269 IKILMELEDWKEALAYCQLTIPVYQRVYPQFHPLLGLQYYTCGKLEWFLGDTENAI--KS 326
               ++L+ W+ A+ Y    I  Y+  Y + HPL  +      K+   +  T +++  K 
Sbjct: 362 FDSAIQLQLWEAAIQYGIPLIQAYKLWYGEEHPLTAILLLKLFKITLLISTTNDSVAQKY 421

Query: 327 MTEAVEILRITHGTNSPFMKE 347
             EA +IL ITHG +S F ++
Sbjct: 422 YEEAAKILEITHGRDSSFYRQ 442


>gi|395853479|ref|XP_003799234.1| PREDICTED: SET and MYND domain-containing protein 1 [Otolemur
           garnettii]
          Length = 490

 Score = 99.0 bits (245), Expect = 3e-18,   Method: Compositional matrix adjust.
 Identities = 87/338 (25%), Positives = 151/338 (44%), Gaps = 56/338 (16%)

Query: 31  VAHMSDIDEKQLLLYAQIANLVNLILQ-WPE----ISINEIAENFSKLACNAHTICNSE- 84
           V H  + ++K+L +       V+  LQ WP      SI  I+  F  + CN  T+ +   
Sbjct: 136 VEHFGEEEQKELRVD------VDTFLQYWPPQSQPFSIQYISHIFGVINCNGFTLSDQRG 189

Query: 85  LRPLGTGLYPVISIINHSCLPNAVLVFE-GRLAVVRAVQH------------VPKGAEVL 131
           L+ +G G++P + ++NH C PN  ++F  G    V+A+ H            + +G E+ 
Sbjct: 190 LQAVGVGIFPNLGLVNHDCWPNCTVIFNNGNHEAVKAMFHTQMRIELRALGKISEGEELT 249

Query: 132 ISYIETAGSTMTRQKALKEQYLFTCTCPRCIKLGQFDDIQESAILEGYRCKDDGCSGFL- 190
           +SYI+    T  R+K LK+QY F C C  C K                + KDD    FL 
Sbjct: 250 VSYIDFLNVTEERRKQLKKQYYFDCACEHCQK----------------KLKDD---LFLA 290

Query: 191 LRDSDDKGFTCQQCGLVRSKEEIKKIASEVNILSKKTLALTSCGNHQEVVSTYKMIEKLQ 250
           ++D+        +  +  SKE + KIA              S G + EVV   +   + Q
Sbjct: 291 VKDNPKPSQEVVKEMIQFSKETLDKIAK-----------ARSEGLYHEVVKLCRECLQKQ 339

Query: 251 KKLYHPFSVNLMQTREKLIKILMELEDWKEALAYCQLTIPVYQRVYPQFHPLLGLQYYTC 310
           + ++   ++ +++    + ++L  L+ ++EA  Y +  +  Y ++Y   +  LG+     
Sbjct: 340 EPVFADTNLYMLRILSIVSEVLSYLQAFEEASDYARRMVDGYMKLYHPNNAQLGMAVMRA 399

Query: 311 GKLEWFLGDTENAIKSMTEAVEILRITHGTNSPFMKEL 348
           G   W  G+ E     + +A  IL +THG + P  K+L
Sbjct: 400 GLTNWHAGNIEVGHGMICKAYAILLVTHGPSHPITKDL 437


>gi|45387527|ref|NP_991103.1| SET and MYND domain containing 1 [Danio rerio]
 gi|41223368|gb|AAH65475.1| SET and MYND domain containing 1a [Danio rerio]
          Length = 485

 Score = 98.6 bits (244), Expect = 4e-18,   Method: Compositional matrix adjust.
 Identities = 78/365 (21%), Positives = 156/365 (42%), Gaps = 57/365 (15%)

Query: 6   LRRKLQNDNVIPSTTTDNYSLVEALVAHMSDIDEKQLLLYAQIANLVNLILQWPE----I 61
           L R  ++  ++  +      ++E  ++ M+  D K+L      A++      WP+    +
Sbjct: 111 LWRIQKHTGLVSDSQLTTLDMLEDHLSRMTPEDLKELK-----ADVKTFYTYWPKKSKAV 165

Query: 62  SINEIAENFSKLACNAHTICNSE-LRPLGTGLYPVISIINHSCLPNAVLVF--------- 111
             + ++  F  ++CN  T+ +   L+ +G GL+P + ++NH C PN  ++          
Sbjct: 166 GEDYVSHLFGVISCNGFTLSDQRGLQSVGIGLFPNLCLVNHDCWPNCTVILNHGDQSALD 225

Query: 112 ----EGRLAVVRAVQHVPKGAEVLISYIETAGSTMTRQKALKEQYLFTCTCPRCIKLGQF 167
                 R   +RA++ +  G E+ +SY++    +  RQ+ L++QY F C C  C+  G  
Sbjct: 226 ASFHSSRRIELRALEPISAGQELTVSYVDFLSVSTDRQRLLQQQYYFDCKCEHCVN-GTK 284

Query: 168 DDIQESAILEGYRCKDDGCSGFLLRDSDDKGFTCQQCGLVRSKEEIKKIASEVNILSKKT 227
           D++     +   +  +DG                        K+    +  ++   S + 
Sbjct: 285 DEL-----MTAVKPTEDG------------------------KQPSADVVKQLTDFSLQA 315

Query: 228 L----ALTSCGNHQEVVSTYKMIEKLQKKLYHPFSVNLMQTREKLIKILMELEDWKEALA 283
           L    A  + GN  EV+   +   + Q  ++   +V++++      ++L  L+ ++EA  
Sbjct: 316 LVKIEAARAQGNFHEVIRICRECLEKQDPVFADTNVHVLRVLSTASEVLSFLQQFQEAAG 375

Query: 284 YCQLTIPVYQRVYPQFHPLLGLQYYTCGKLEWFLGDTENAIKSMTEAVEILRITHGTNSP 343
           Y Q  +  Y ++Y   +  LG+     G   W  G  E A   +  A  IL ITHG + P
Sbjct: 376 YAQRMVDGYMKLYHPNNAQLGMAIMRAGVTHWHAGLIEAAHGLICRAYGILMITHGPHHP 435

Query: 344 FMKEL 348
             ++L
Sbjct: 436 ISRDL 440


>gi|410903311|ref|XP_003965137.1| PREDICTED: SET and MYND domain-containing protein 1-like [Takifugu
           rubripes]
          Length = 471

 Score = 98.6 bits (244), Expect = 4e-18,   Method: Compositional matrix adjust.
 Identities = 99/362 (27%), Positives = 168/362 (46%), Gaps = 52/362 (14%)

Query: 27  VEALVAHMSDIDEKQLLLYAQIANLVNLILQW-----PEISINEIAENFSKLACNAHTIC 81
           VE L  H++D+ E  L    +I   V++ LQ+      + S++EIA  F  + CN  T+ 
Sbjct: 129 VEELEDHVADLPEDHL---KRIDTDVHVFLQYWSCGRTKHSLDEIAHIFGIIKCNGFTLS 185

Query: 82  NSE-LRPLGTGLYPVISIINHSCLPNAVLVF-------------EGRLAVVRAVQHVPKG 127
           +   L+ +G GL+P + ++NH C PN  +V                R   +RA++ + +G
Sbjct: 186 DQRGLKAVGVGLFPNLCLVNHDCWPNCSVVLNHGNHSATNSALHSKRRIELRALRKICEG 245

Query: 128 AEVLISYIETAGSTMTRQKALKEQYLFTCTCPRCIKLGQFDDIQESAILEGYRCKDDGCS 187
            E+ +SY++   ++  RQ+ LKE + F CTC  C +  + DD+  +A         DG  
Sbjct: 246 EELTVSYVDFLDTSAERQRKLKEHFYFECTCEHCRQHIK-DDLMTAAAA-------DGPG 297

Query: 188 GFLLRDSDDKGFTCQQCGLVRSKEEIKKIASEVNILSKKTLALTSCGNHQEV-VSTYKMI 246
           G   + S D+     Q     S+E ++KI  E +++ K         + QEV +   K +
Sbjct: 298 G---KPSADQ----VQEVTAFSQECLEKI--ERSLMDK---------DFQEVRMLCSKCL 339

Query: 247 EKLQKKLYHPFSVNLMQTREKLIKILMELEDWKEALAYCQLTIPVYQRVYPQFHPLLGLQ 306
           EK QK L     ++ ++     +++L  L  + EA AY +  +  Y ++Y      LG+ 
Sbjct: 340 EK-QKNLLADTHLHHLRVLSAAVEVLSYLRCFSEAAAYARRMVQGYTKLYHPNSAQLGIA 398

Query: 307 YYTCGKLEWFLGDTENAIKSMTEAVEILRITHGTNSPFMKELILKLEEAQAEASYKLSSK 366
               G      G  E A + + +A  IL +THG N    ++  L+    Q E   KL  +
Sbjct: 399 VMRAGVTHLQAGMIEMAHELICKAYRILLVTHGPNHSVTRD--LEAMRRQTEVELKLLKQ 456

Query: 367 DE 368
           DE
Sbjct: 457 DE 458


>gi|402891521|ref|XP_003908994.1| PREDICTED: SET and MYND domain-containing protein 1 [Papio anubis]
 gi|355565873|gb|EHH22302.1| hypothetical protein EGK_05542 [Macaca mulatta]
 gi|355751476|gb|EHH55731.1| hypothetical protein EGM_04994 [Macaca fascicularis]
          Length = 490

 Score = 98.6 bits (244), Expect = 5e-18,   Method: Compositional matrix adjust.
 Identities = 83/341 (24%), Positives = 153/341 (44%), Gaps = 62/341 (18%)

Query: 31  VAHMSDIDEKQLLLYAQIANLVNLILQ-WP----EISINEIAENFSKLACNAHTICNSE- 84
           V H  + ++K+L +       V+  LQ WP    + S+  I+  F  + CN  T+ +   
Sbjct: 136 VEHFGEEEQKELRVD------VDTFLQYWPPQSQQFSMQYISHIFGVINCNGFTLSDQRG 189

Query: 85  LRPLGTGLYPVISIINHSCLPNAVLVFE-GRLAVVRAVQH------------VPKGAEVL 131
           L+ +G G++P + ++NH C PN  ++F  G    V+++ H            + +G E+ 
Sbjct: 190 LQAVGVGIFPNLGLVNHDCWPNCTVIFNNGNHEAVKSMFHTQMRIELRALGKISEGEELT 249

Query: 132 ISYIETAGSTMTRQKALKEQYLFTCTCPRCIKLGQFDDIQESAILEGYRCKDDGCSGFLL 191
           +SYI+    +  R++ LK+QY F CTC  C K                + KDD    FL 
Sbjct: 250 VSYIDFLNVSEERKRQLKKQYYFDCTCEHCQK----------------KLKDD---LFL- 289

Query: 192 RDSDDKGFTCQQCGLVRSKEEIKKIASEVNILSKKTL----ALTSCGNHQEVVSTYKMIE 247
                        G+  + +  +++  E+   SK TL       S G + EVV   +   
Sbjct: 290 -------------GVKDNPKPSQEVVKEMIQFSKDTLEKIDKARSEGLYHEVVKLCRECL 336

Query: 248 KLQKKLYHPFSVNLMQTREKLIKILMELEDWKEALAYCQLTIPVYQRVYPQFHPLLGLQY 307
           + Q+ ++   ++ +++    + ++L  L+ ++EA  Y +  +  Y ++Y   +  LG+  
Sbjct: 337 QKQEPVFADTNIYMLRILSIVSEVLSYLQAFEEASDYARRMVDGYMKLYHPNNAQLGMAV 396

Query: 308 YTCGKLEWFLGDTENAIKSMTEAVEILRITHGTNSPFMKEL 348
              G   W  G+ E     + +A  IL +THG + P  K+L
Sbjct: 397 MRAGLTNWHAGNIEVGHGMICKAYAILLVTHGPSHPITKDL 437


>gi|119613764|gb|EAW93358.1| SET and MYND domain containing 2, isoform CRA_d [Homo sapiens]
          Length = 230

 Score = 98.2 bits (243), Expect = 5e-18,   Method: Compositional matrix adjust.
 Identities = 51/145 (35%), Positives = 83/145 (57%), Gaps = 2/145 (1%)

Query: 19  TTTDNYSLVEALVAHMSDID-EKQLLLYAQIANLVNLILQWPEISINE-IAENFSKLACN 76
           T ++    V+   +H+  +D EK+ L+ + IA L +   +  E   N+ +   F+++ CN
Sbjct: 39  TPSEKLLAVKEFESHLDKLDNEKKDLIQSDIAALHHFYSKHLEFPDNDSLVVLFAQVNCN 98

Query: 77  AHTICNSELRPLGTGLYPVISIINHSCLPNAVLVFEGRLAVVRAVQHVPKGAEVLISYIE 136
             TI + EL  LG+ ++P ++++NHSC PN ++ ++G LA VRAVQ +  G EV  SYI+
Sbjct: 99  GFTIEDEELSHLGSAIFPDVALMNHSCCPNVIVTYKGTLAEVRAVQEIKPGEEVFTSYID 158

Query: 137 TAGSTMTRQKALKEQYLFTCTCPRC 161
               T  R   L++ Y FTC C  C
Sbjct: 159 LLYPTEDRNDRLRDSYFFTCECQEC 183


>gi|229892820|ref|NP_001153561.1| SET and MYND domain-containing protein 1 [Sus scrofa]
 gi|197692962|gb|ACH71269.1| SET and MYND domain-containing 1 [Sus scrofa]
          Length = 490

 Score = 98.2 bits (243), Expect = 5e-18,   Method: Compositional matrix adjust.
 Identities = 84/341 (24%), Positives = 152/341 (44%), Gaps = 62/341 (18%)

Query: 31  VAHMSDIDEKQLLLYAQIANLVNLILQ-WP----EISINEIAENFSKLACNAHTICNSE- 84
           V H  + ++K+L +       V+  LQ WP    + S+  I+  F  + CN  T+     
Sbjct: 136 VEHFGEEEQKELRMD------VDTFLQYWPPQSQQFSMQYISHIFGVINCNGFTLSEQRG 189

Query: 85  LRPLGTGLYPVISIINHSCLPNAVLVFE-GRLAVVRAVQH------------VPKGAEVL 131
           L+ +G G++P + ++NH+C PN  ++F  G    V+++ H            + +G E+ 
Sbjct: 190 LQAVGVGIFPNLGLVNHNCWPNCTVIFNNGNHEAVKSMFHTQTRIELRALGKISEGEELT 249

Query: 132 ISYIETAGSTMTRQKALKEQYLFTCTCPRCIKLGQFDDIQESAILEGYRCKDDGCSGFLL 191
           +SYI+    +  R+K LK+QY F CTC  C K G  DD+                  FL 
Sbjct: 250 VSYIDFLNFSEERKKQLKKQYYFDCTCEHCQK-GLKDDL------------------FL- 289

Query: 192 RDSDDKGFTCQQCGLVRSKEEIKKIASEVNILSKKTL----ALTSCGNHQEVVSTYKMIE 247
                        G+  + +  +++  E+   SK TL       S G + EVV   +   
Sbjct: 290 -------------GVKDNPKPSQEVVKEMIQFSKDTLEKIDKARSKGLYHEVVKLCRECL 336

Query: 248 KLQKKLYHPFSVNLMQTREKLIKILMELEDWKEALAYCQLTIPVYQRVYPQFHPLLGLQY 307
           + Q+ ++   ++  ++    + ++L  L+ ++EA  Y +  +  Y ++Y   +  LG+  
Sbjct: 337 EKQEPVFADTNIYTLRLLSIVSEVLSYLQAFEEASHYARRMVDGYMKLYHPNNAQLGMAI 396

Query: 308 YTCGKLEWFLGDTENAIKSMTEAVEILRITHGTNSPFMKEL 348
              G   W  G+ E     + +A  IL +THG + P  K+L
Sbjct: 397 MRAGLTNWHAGNIEVGHSMICKAYAILLVTHGPSHPITKDL 437


>gi|301610418|ref|XP_002934751.1| PREDICTED: LOW QUALITY PROTEIN: SET and MYND domain-containing
           protein 2-like [Xenopus (Silurana) tropicalis]
          Length = 430

 Score = 98.2 bits (243), Expect = 6e-18,   Method: Compositional matrix adjust.
 Identities = 82/346 (23%), Positives = 158/346 (45%), Gaps = 48/346 (13%)

Query: 19  TTTDNYSLVEALVAHMSDIDEKQL-LLYAQIANLVNLILQWPEISINEIAEN-FSKLACN 76
           T ++ +  V+   +H+S +D ++L L+   IA L     +    S N      F+++ CN
Sbjct: 121 TASERFLSVKDFESHLSKLDNEKLELIQNDIAALHRFYSKNLHYSDNAAQVFLFAQVNCN 180

Query: 77  AHTICNSELRPLGTGLYPVISIINHSCLPNAVLVFEGRLAVVRAVQHVPKGAEVLISYIE 136
             TI + EL  LG+ ++P ++++NHSC PN ++ ++G +A VRAVQ +  G EV+ SYI+
Sbjct: 181 GFTIEDEELSHLGSAIFPDVALMNHSCCPNVIVTYKGTVAEVRAVQEIHAGDEVITSYID 240

Query: 137 TAGSTMTRQKALKEQYLFTCTCPRC-------IKLGQFDDIQESAILEGYRCKDDGCSGF 189
               T  R   L +     C C  C       +K+ +        + + ++ K+  CS  
Sbjct: 241 LLYPTEDRNDRLIDSXFCNCDCRECSTKQKGRLKIAEISLKIHPPLDKXWKNKERSCSVP 300

Query: 190 LLRDSDDKGFTCQQCGLVRSKEEIKKIASEVNILSKKTLALTSCGNHQEVVSTYKMIEKL 249
           LL                               L   T+A  S     E++   ++  + 
Sbjct: 301 LL------------------------------YLFPLTIAPPS-----ELLEICELSLEK 325

Query: 250 QKKLYHPFSVNLMQTREKLIKILMELEDWKEALAYCQLTIPVYQRVYPQFHPLLGLQYYT 309
              ++   +V ++    + + + + ++DW+ AL Y +  I  Y + YP +   +   +  
Sbjct: 326 MGSVFEDSNVYMLHMMYQAMGVCLYMQDWEGALKYGEKIIKPYSKHYPPYSLNVASMWLK 385

Query: 310 CGKLEWFLGDTENAI--KSMTEAVEILRITHGTNSPFMKELILKLE 353
            G+L  ++G  + AI  K++ +A+ I+ I HG +  ++ E+  +LE
Sbjct: 386 LGRL--YMGLEKKAIGTKALKKAIAIMDIAHGPDHHYIAEIKKELE 429


>gi|119597545|gb|EAW77139.1| SET and MYND domain containing 3, isoform CRA_a [Homo sapiens]
 gi|119597547|gb|EAW77141.1| SET and MYND domain containing 3, isoform CRA_a [Homo sapiens]
 gi|193785773|dbj|BAG51208.1| unnamed protein product [Homo sapiens]
          Length = 239

 Score = 98.2 bits (243), Expect = 6e-18,   Method: Compositional matrix adjust.
 Identities = 74/257 (28%), Positives = 120/257 (46%), Gaps = 43/257 (16%)

Query: 85  LRPLGTGLYPVISIINHSCLPNAVLVFEGRLAVVRAVQHVPKGAEVLISYIETAGSTMTR 144
           ++ +G GLYP IS++NHSC PN  +VF G   ++RAV+ +  G E+ I Y++   ++  R
Sbjct: 1   MQEVGVGLYPSISLLNHSCDPNCSIVFNGPHLLLRAVRDIEVGEELTICYLDMLMTSEER 60

Query: 145 QKALKEQYLFTCTCPRCIKLGQFDDIQESAILEGYRCKDDGCSGFLLRDSDDKGFTCQQC 204
           +K L++QY F C C RC                              +D D    T  + 
Sbjct: 61  RKQLRDQYCFECDCFRCQT----------------------------QDKDADMLTGDEQ 92

Query: 205 GLVRSKEEIKKIASEVNILSKKTLALTSCGNHQEVVSTYKMIEKLQKKLYHPFSVNLMQT 264
                +E +KKI  E+    K    L  C   Q ++S+    E+L         +N+ Q 
Sbjct: 93  VWKEVQESLKKI-EELKAHWKWEQVLAMC---QAIISSNS--ERLP-------DINIYQL 139

Query: 265 R--EKLIKILMELEDWKEALAYCQLTIPVYQRVYPQFHPLLGLQYYTCGKLEWFLGDTEN 322
           +  +  +   + L   +EAL Y   T+  Y+  +P  HP+ G+Q    GKL+   G    
Sbjct: 140 KVLDCAMDACINLGLLEEALFYGTRTMEPYRIFFPGSHPVRGVQVMKVGKLQLHQGMFPQ 199

Query: 323 AIKSMTEAVEILRITHG 339
           A+K++  A +I+R+THG
Sbjct: 200 AMKNLRLAFDIMRVTHG 216


>gi|301773928|ref|XP_002922386.1| PREDICTED: SET and MYND domain-containing protein 1-like
           [Ailuropoda melanoleuca]
          Length = 490

 Score = 98.2 bits (243), Expect = 6e-18,   Method: Compositional matrix adjust.
 Identities = 83/337 (24%), Positives = 151/337 (44%), Gaps = 54/337 (16%)

Query: 31  VAHMSDIDEKQLLLYAQIANLVNLILQ-WP----EISINEIAENFSKLACNAHTICNSE- 84
           V H  + ++K+L +       V+  LQ WP    + S+  I+  F  + CN  T+ +   
Sbjct: 136 VEHFGEEEQKELRVD------VDTFLQYWPPQSQQFSMQYISHVFGVINCNGFTLSDQRG 189

Query: 85  LRPLGTGLYPVISIINHSCLPNAVLVFE-GRLAVVRAVQH------------VPKGAEVL 131
           L+ +G G++P + ++NH+C PN  ++F  G    V+++ H            + +G E+ 
Sbjct: 190 LQAVGVGIFPNLGLVNHNCWPNCTVIFNNGNHEAVKSMFHTQMRIELRALGKISEGEELT 249

Query: 132 ISYIETAGSTMTRQKALKEQYLFTCTCPRCIKLGQFDDIQESAILEGYRCKDDGCSGFLL 191
           +SYI+    +  R++ LK+QY F CTC  C K                + KDD    FL 
Sbjct: 250 VSYIDFLNVSEERKRQLKKQYYFDCTCEHCQK----------------KLKDD---LFLG 290

Query: 192 RDSDDKGFTCQQCGLVRSKEEIKKIASEVNILSKKTLALTSCGNHQEVVSTYKMIEKLQK 251
              D K           S+E +K++        +K     S G + EVV   +   + Q+
Sbjct: 291 VKDDPKP----------SQEVVKEMIQFSKDTLEKIDKARSEGLYHEVVKLCRECLQKQE 340

Query: 252 KLYHPFSVNLMQTREKLIKILMELEDWKEALAYCQLTIPVYQRVYPQFHPLLGLQYYTCG 311
            ++   ++  ++    + ++L  L+ ++EA  Y +  +  Y ++Y   +  LG+     G
Sbjct: 341 PVFADTNLYTLRLLSTVSEVLSYLQAFEEASYYARRMVDGYMKLYHPNNAQLGMAVMRAG 400

Query: 312 KLEWFLGDTENAIKSMTEAVEILRITHGTNSPFMKEL 348
              W  G+ E     + +A  IL +THG + P  K+L
Sbjct: 401 LTNWHAGNIEVGHGMICKAYAILLVTHGPSHPITKDL 437


>gi|403303925|ref|XP_003942566.1| PREDICTED: SET and MYND domain-containing protein 1 [Saimiri
           boliviensis boliviensis]
          Length = 490

 Score = 98.2 bits (243), Expect = 6e-18,   Method: Compositional matrix adjust.
 Identities = 84/341 (24%), Positives = 152/341 (44%), Gaps = 62/341 (18%)

Query: 31  VAHMSDIDEKQLLLYAQIANLVNLILQ-WP----EISINEIAENFSKLACNAHTICNSE- 84
           V H  + ++K+L +       V+  LQ WP    + S+  I+  F  + CN  T+ +   
Sbjct: 136 VEHFGEEEQKELRVD------VDTFLQYWPPQSQQFSMQYISHIFGVINCNGFTLSDQRG 189

Query: 85  LRPLGTGLYPVISIINHSCLPNAVLVFE-GRLAVVRAVQH------------VPKGAEVL 131
           L+ +G G++P + ++NH C PN  ++F  G    V+++ H            + +G E+ 
Sbjct: 190 LQAVGVGIFPNLGLVNHDCWPNCTVIFNNGNHEAVKSMFHTQMRIELRALGKISEGEELT 249

Query: 132 ISYIETAGSTMTRQKALKEQYLFTCTCPRCIKLGQFDDIQESAILEGYRCKDDGCSGFLL 191
           +SYI+    +  R+K LK+QY F CTC  C K                + KDD    FL 
Sbjct: 250 VSYIDFLNVSEERKKQLKKQYYFDCTCEHCQK----------------KLKDD---LFL- 289

Query: 192 RDSDDKGFTCQQCGLVRSKEEIKKIASEVNILSKKTL----ALTSCGNHQEVVSTYKMIE 247
                        G+  + +  +++  E+   SK TL       S G + EVV   +   
Sbjct: 290 -------------GVKDNPKPSQEVVKEMIQFSKDTLEKIDKARSEGLYHEVVKLCRECL 336

Query: 248 KLQKKLYHPFSVNLMQTREKLIKILMELEDWKEALAYCQLTIPVYQRVYPQFHPLLGLQY 307
           + Q+ ++   ++  ++    + ++L  L+ ++EA  Y +  +  Y ++Y   +  LG+  
Sbjct: 337 QKQEPVFADTNIYTLRMLSIVSEVLSYLQAFEEASDYARRMVDGYMKLYHPNNAQLGMAV 396

Query: 308 YTCGKLEWFLGDTENAIKSMTEAVEILRITHGTNSPFMKEL 348
              G   W  G+ E     + +A  IL +THG + P  K+L
Sbjct: 397 MRAGLTNWHAGNIEVGHGMICKAYAILLVTHGPSHPITKDL 437


>gi|71052093|gb|AAH49367.2| SMYD2 protein [Homo sapiens]
          Length = 371

 Score = 97.8 bits (242), Expect = 8e-18,   Method: Compositional matrix adjust.
 Identities = 55/159 (34%), Positives = 88/159 (55%), Gaps = 7/159 (4%)

Query: 8   RKLQNDNVIPSTTTDNYSL-VEALVAHMSDID-EKQLLLYAQIANLVNLI---LQWPEIS 62
           R L    + P  T     L V+   +H+  +D EK+ L+ + IA L +     L++P+  
Sbjct: 111 RILAKQKIHPERTPSEKLLAVKEFESHLDKLDNEKKDLIQSDIAALHHFYSKHLEFPDS- 169

Query: 63  INEIAENFSKLACNAHTICNSELRPLGTGLYPVISIINHSCLPNAVLVFEGRLAVVRAVQ 122
            + +   F+++ CN  TI + EL  LG+ ++P ++++NHSC PN ++ ++G LA VRAVQ
Sbjct: 170 -DSLVVLFAQVNCNGFTIEDEELSHLGSAIFPDVALMNHSCCPNVIVTYKGTLAEVRAVQ 228

Query: 123 HVPKGAEVLISYIETAGSTMTRQKALKEQYLFTCTCPRC 161
            +  G EV  SYI+    T  R   L++ Y FTC C  C
Sbjct: 229 EIKPGEEVFTSYIDLLYPTEDRNDRLRDSYFFTCECQEC 267


>gi|26336238|dbj|BAC31804.1| unnamed protein product [Mus musculus]
 gi|26338432|dbj|BAC32887.1| unnamed protein product [Mus musculus]
          Length = 239

 Score = 97.4 bits (241), Expect = 9e-18,   Method: Compositional matrix adjust.
 Identities = 69/255 (27%), Positives = 120/255 (47%), Gaps = 39/255 (15%)

Query: 85  LRPLGTGLYPVISIINHSCLPNAVLVFEGRLAVVRAVQHVPKGAEVLISYIETAGSTMTR 144
           ++ +G GLYP +S++NHSC PN  +VF G   ++RAV+ +  G E+ I Y++   ++  R
Sbjct: 1   MQEVGVGLYPSMSLLNHSCDPNCSIVFNGPHLLLRAVREIEAGEELTICYLDMLMTSEER 60

Query: 145 QKALKEQYLFTCTCPRCIKLGQFDDIQESAILEGYRCKDDGCSGFLLRDSDDKGFTCQQC 204
           +K L++QY F C C RC    Q  D                      +D+D         
Sbjct: 61  RKQLRDQYCFECDCIRC----QTQD----------------------KDAD--------- 85

Query: 205 GLVRSKEEIKKIASEVNILSKKTLALTSCGNHQEVVSTYKMIEKLQKKLYHPFSVNLMQT 264
            ++   E+I K   EV    KK   L +    ++V++  + I           ++  ++ 
Sbjct: 86  -MLTGDEQIWK---EVQESLKKIEELKAHWKWEQVLALCQAIINSNSNRLPDINIYQLKV 141

Query: 265 REKLIKILMELEDWKEALAYCQLTIPVYQRVYPQFHPLLGLQYYTCGKLEWFLGDTENAI 324
            +  +   + L   +EAL Y   T+  Y+  +P  HP+ G+Q    GKL+   G    A+
Sbjct: 142 LDCAMDACINLGMLEEALFYAMRTMEPYRIFFPGSHPVRGVQVMKVGKLQLHQGMFPQAM 201

Query: 325 KSMTEAVEILRITHG 339
           K++  A +I+++THG
Sbjct: 202 KNLRLAFDIMKVTHG 216


>gi|48476975|gb|AAT44535.1| hypothetical rhabdomyosarcoma antigen MU-RMS-40.9A [Homo sapiens]
          Length = 484

 Score = 97.4 bits (241), Expect = 9e-18,   Method: Compositional matrix adjust.
 Identities = 83/341 (24%), Positives = 152/341 (44%), Gaps = 62/341 (18%)

Query: 31  VAHMSDIDEKQLLLYAQIANLVNLILQ-WP----EISINEIAENFSKLACNAHTICNSE- 84
           V H  + ++K L +       V+  LQ WP    + S+  I+  F  + CN  T+ +   
Sbjct: 130 VEHFGEEEQKDLRVD------VDTFLQYWPPQSQQFSMQYISHIFGVINCNGFTLSDQRG 183

Query: 85  LRPLGTGLYPVISIINHSCLPNAVLVFE-GRLAVVRAVQH------------VPKGAEVL 131
           L+ +G G++P + ++NH C PN  ++F  G    V+++ H            + +G E+ 
Sbjct: 184 LQAVGVGIFPNLGLVNHDCWPNCTVIFNNGNHEAVKSMFHTQMRIELRALGKISEGEELT 243

Query: 132 ISYIETAGSTMTRQKALKEQYLFTCTCPRCIKLGQFDDIQESAILEGYRCKDDGCSGFLL 191
           +SYI+    +  R++ LK+QY F CTC  C K                + KDD    FL 
Sbjct: 244 VSYIDFLNVSEERKRQLKKQYYFDCTCEHCQK----------------KLKDD---LFL- 283

Query: 192 RDSDDKGFTCQQCGLVRSKEEIKKIASEVNILSKKTL----ALTSCGNHQEVVSTYKMIE 247
                        G+  + +  +++  E+   SK TL       S G + EVV   +   
Sbjct: 284 -------------GVKDNPKPSQEVVKEMIQFSKDTLEKIDKARSEGLYHEVVKLCRECL 330

Query: 248 KLQKKLYHPFSVNLMQTREKLIKILMELEDWKEALAYCQLTIPVYQRVYPQFHPLLGLQY 307
           + Q+ ++   ++ +++    + ++L  L+ ++EA  Y +  +  Y ++Y   +  LG+  
Sbjct: 331 EKQEPVFADTNIYMLRMLSIVSEVLSYLQAFEEASFYARRMVDGYMKLYHPNNAQLGMAV 390

Query: 308 YTCGKLEWFLGDTENAIKSMTEAVEILRITHGTNSPFMKEL 348
              G   W  G+ E     + +A  IL +THG + P  K+L
Sbjct: 391 MRAGLTNWHAGNIEVGHGMICKAYAILLVTHGPSHPITKDL 431


>gi|119597483|gb|EAW77077.1| SET and MYND domain containing 1, isoform CRA_a [Homo sapiens]
          Length = 510

 Score = 97.4 bits (241), Expect = 9e-18,   Method: Compositional matrix adjust.
 Identities = 83/341 (24%), Positives = 152/341 (44%), Gaps = 62/341 (18%)

Query: 31  VAHMSDIDEKQLLLYAQIANLVNLILQ-WP----EISINEIAENFSKLACNAHTICNSE- 84
           V H  + ++K L +       V+  LQ WP    + S+  I+  F  + CN  T+ +   
Sbjct: 136 VEHFGEEEQKDLRVD------VDTFLQYWPPQSQQFSMQYISHIFGVINCNGFTLSDQRG 189

Query: 85  LRPLGTGLYPVISIINHSCLPNAVLVFE-GRLAVVRAVQH------------VPKGAEVL 131
           L+ +G G++P + ++NH C PN  ++F  G    V+++ H            + +G E+ 
Sbjct: 190 LQAVGVGIFPNLGLVNHDCWPNCTVIFNNGNHEAVKSMFHTQMRIELRALGKISEGEELT 249

Query: 132 ISYIETAGSTMTRQKALKEQYLFTCTCPRCIKLGQFDDIQESAILEGYRCKDDGCSGFLL 191
           +SYI+    +  R++ LK+QY F CTC  C K                + KDD    FL 
Sbjct: 250 VSYIDFLNVSEERKRQLKKQYYFDCTCEHCQK----------------KLKDD---LFL- 289

Query: 192 RDSDDKGFTCQQCGLVRSKEEIKKIASEVNILSKKTL----ALTSCGNHQEVVSTYKMIE 247
                        G+  + +  +++  E+   SK TL       S G + EVV   +   
Sbjct: 290 -------------GVKDNPKPSQEVVKEMIQFSKDTLEKIDKARSEGLYHEVVKLCRECL 336

Query: 248 KLQKKLYHPFSVNLMQTREKLIKILMELEDWKEALAYCQLTIPVYQRVYPQFHPLLGLQY 307
           + Q+ ++   ++ +++    + ++L  L+ ++EA  Y +  +  Y ++Y   +  LG+  
Sbjct: 337 EKQEPVFADTNIYMLRMLSIVSEVLSYLQAFEEASFYARRMVDGYMKLYHPNNAQLGMAV 396

Query: 308 YTCGKLEWFLGDTENAIKSMTEAVEILRITHGTNSPFMKEL 348
              G   W  G+ E     + +A  IL +THG + P  K+L
Sbjct: 397 MRAGLTNWHAGNIEVGHGMICKAYAILLVTHGPSHPITKDL 437


>gi|348517233|ref|XP_003446139.1| PREDICTED: SET and MYND domain-containing protein 1-like isoform 2
           [Oreochromis niloticus]
          Length = 489

 Score = 97.4 bits (241), Expect = 1e-17,   Method: Compositional matrix adjust.
 Identities = 90/352 (25%), Positives = 155/352 (44%), Gaps = 52/352 (14%)

Query: 18  STTTD-NYSLVEALVAHMSDIDEKQLL-LYAQIANLVNLILQWPEIS----INEIAENFS 71
           ST +D   + ++ L  H++D+ E  L  L   I N ++    WP  S    I++I+  F 
Sbjct: 114 STVSDMQLTTLDELEDHIADMPEDDLKELKVDIHNFLDY---WPHNSKQHTIDDISHIFG 170

Query: 72  KLACNAHTICNSE-LRPLGTGLYPVISIINHSCLPN-AVLVFEGRLAVV----------- 118
            + CN  ++ +   L+ +G GL+P + ++NH C PN  V++  G  + V           
Sbjct: 171 VINCNGFSVSDQRGLQAVGVGLFPNLCLVNHDCWPNCTVILNHGNQSAVNTMFHSQRRIE 230

Query: 119 -RAVQHVPKGAEVLISYIETAGSTMTRQKALKEQYLFTCTCPRCIKLGQFDDIQESAILE 177
            RA+  + +G E+ ++Y++    +  RQ+ LK QY F CTC  C                
Sbjct: 231 LRALGKIAEGEELTVAYVDFLNLSEERQRLLKTQYFFDCTCEHCKN-------------- 276

Query: 178 GYRCKDDGCSGFLLRDSDDKGFTCQQCGLVRSKEEIKKIASEVNILSKKTLALTSCGNHQ 237
             R KDD   G   R+ D         G+  S+E++K+       + +K       G++ 
Sbjct: 277 --RIKDDIKIGG--REED---------GVKPSEEQVKEATDYCFQMLEKMEKARLNGDYH 323

Query: 238 EVVSTYK-MIEKLQKKLYHPFSVNLMQTREKLIKILMELEDWKEALAYCQLTIPVYQRVY 296
           EVV   +  IEK +  L     + L++    + ++   L+ + +A  Y +  +  Y ++Y
Sbjct: 324 EVVKICRECIEKTEPVLADT-HIYLLRMWSTMSEVQAYLQYFDDAADYARKMVEGYMKLY 382

Query: 297 PQFHPLLGLQYYTCGKLEWFLGDTENAIKSMTEAVEILRITHGTNSPFMKEL 348
              +  LG+     G   W  G  E     + +A  IL ITHG   P  K+L
Sbjct: 383 HPNNAALGMAAMRAGVTHWQAGQIEVGHGMICKAYAILMITHGPIHPITKDL 434


>gi|119597485|gb|EAW77079.1| SET and MYND domain containing 1, isoform CRA_c [Homo sapiens]
          Length = 493

 Score = 97.4 bits (241), Expect = 1e-17,   Method: Compositional matrix adjust.
 Identities = 83/341 (24%), Positives = 152/341 (44%), Gaps = 62/341 (18%)

Query: 31  VAHMSDIDEKQLLLYAQIANLVNLILQ-WP----EISINEIAENFSKLACNAHTICNSE- 84
           V H  + ++K L +       V+  LQ WP    + S+  I+  F  + CN  T+ +   
Sbjct: 136 VEHFGEEEQKDLRVD------VDTFLQYWPPQSQQFSMQYISHIFGVINCNGFTLSDQRG 189

Query: 85  LRPLGTGLYPVISIINHSCLPNAVLVFE-GRLAVVRAVQH------------VPKGAEVL 131
           L+ +G G++P + ++NH C PN  ++F  G    V+++ H            + +G E+ 
Sbjct: 190 LQAVGVGIFPNLGLVNHDCWPNCTVIFNNGNHEAVKSMFHTQMRIELRALGKISEGEELT 249

Query: 132 ISYIETAGSTMTRQKALKEQYLFTCTCPRCIKLGQFDDIQESAILEGYRCKDDGCSGFLL 191
           +SYI+    +  R++ LK+QY F CTC  C K                + KDD    FL 
Sbjct: 250 VSYIDFLNVSEERKRQLKKQYYFDCTCEHCQK----------------KLKDD---LFL- 289

Query: 192 RDSDDKGFTCQQCGLVRSKEEIKKIASEVNILSKKTL----ALTSCGNHQEVVSTYKMIE 247
                        G+  + +  +++  E+   SK TL       S G + EVV   +   
Sbjct: 290 -------------GVKDNPKPSQEVVKEMIQFSKDTLEKIDKARSEGLYHEVVKLCRECL 336

Query: 248 KLQKKLYHPFSVNLMQTREKLIKILMELEDWKEALAYCQLTIPVYQRVYPQFHPLLGLQY 307
           + Q+ ++   ++ +++    + ++L  L+ ++EA  Y +  +  Y ++Y   +  LG+  
Sbjct: 337 EKQEPVFADTNIYMLRMLSIVSEVLSYLQAFEEASFYARRMVDGYMKLYHPNNAQLGMAV 396

Query: 308 YTCGKLEWFLGDTENAIKSMTEAVEILRITHGTNSPFMKEL 348
              G   W  G+ E     + +A  IL +THG + P  K+L
Sbjct: 397 MRAGLTNWHAGNIEVGHGMICKAYAILLVTHGPSHPITKDL 437


>gi|397468470|ref|XP_003805903.1| PREDICTED: SET and MYND domain-containing protein 1 [Pan paniscus]
          Length = 490

 Score = 97.4 bits (241), Expect = 1e-17,   Method: Compositional matrix adjust.
 Identities = 83/341 (24%), Positives = 152/341 (44%), Gaps = 62/341 (18%)

Query: 31  VAHMSDIDEKQLLLYAQIANLVNLILQ-WP----EISINEIAENFSKLACNAHTICNSE- 84
           V H  + ++K L +       V+  LQ WP    + S+  I+  F  + CN  T+ +   
Sbjct: 136 VEHFGEEEQKDLRVD------VDTFLQYWPPQSQQFSMQYISHIFGVINCNGFTLSDQRG 189

Query: 85  LRPLGTGLYPVISIINHSCLPNAVLVFE-GRLAVVRAVQH------------VPKGAEVL 131
           L+ +G G++P + ++NH C PN  ++F  G    V+++ H            + +G E+ 
Sbjct: 190 LQAVGVGIFPNLGLVNHDCWPNCTVIFNNGNHEAVKSMFHTQMRIELRALGKISEGEELT 249

Query: 132 ISYIETAGSTMTRQKALKEQYLFTCTCPRCIKLGQFDDIQESAILEGYRCKDDGCSGFLL 191
           +SYI+    +  R++ LK+QY F CTC  C K                + KDD    FL 
Sbjct: 250 VSYIDFLNVSEERKRQLKKQYYFDCTCEHCQK----------------KLKDD---LFL- 289

Query: 192 RDSDDKGFTCQQCGLVRSKEEIKKIASEVNILSKKTL----ALTSCGNHQEVVSTYKMIE 247
                        G+  + +  +++  E+   SK TL       S G + EVV   +   
Sbjct: 290 -------------GVKDNPKPSQEVVKEMIQFSKDTLEKIDKARSEGLYHEVVKLCRECL 336

Query: 248 KLQKKLYHPFSVNLMQTREKLIKILMELEDWKEALAYCQLTIPVYQRVYPQFHPLLGLQY 307
           + Q+ ++   ++ +++    + ++L  L+ ++EA  Y +  +  Y ++Y   +  LG+  
Sbjct: 337 EKQEPVFADTNIYMLRMLSIVSEVLSYLQAFEEASFYARRMVDGYMKLYHPNNAQLGMAV 396

Query: 308 YTCGKLEWFLGDTENAIKSMTEAVEILRITHGTNSPFMKEL 348
              G   W  G+ E     + +A  IL +THG + P  K+L
Sbjct: 397 MRAGLTNWHAGNIEVGHGMICKAYAILLVTHGPSHPITKDL 437


>gi|148681223|gb|EDL13170.1| SET and MYND domain containing 3, isoform CRA_a [Mus musculus]
          Length = 239

 Score = 97.4 bits (241), Expect = 1e-17,   Method: Compositional matrix adjust.
 Identities = 69/255 (27%), Positives = 120/255 (47%), Gaps = 39/255 (15%)

Query: 85  LRPLGTGLYPVISIINHSCLPNAVLVFEGRLAVVRAVQHVPKGAEVLISYIETAGSTMTR 144
           ++ +G GLYP +S++NHSC PN  +VF G   ++RAV+ +  G E+ I Y++   ++  R
Sbjct: 1   MQEVGVGLYPSMSLLNHSCDPNCSIVFNGPHLLLRAVREIEAGEELTICYLDMLMTSEER 60

Query: 145 QKALKEQYLFTCTCPRCIKLGQFDDIQESAILEGYRCKDDGCSGFLLRDSDDKGFTCQQC 204
           +K L++QY F C C RC    Q  D                      +D+D         
Sbjct: 61  RKQLRDQYCFECDCIRC----QTQD----------------------KDAD--------- 85

Query: 205 GLVRSKEEIKKIASEVNILSKKTLALTSCGNHQEVVSTYKMIEKLQKKLYHPFSVNLMQT 264
            ++   E+I K   EV    KK   L +    ++V++  + I           ++  ++ 
Sbjct: 86  -MLTGDEQIWK---EVQESLKKIEELKAHWKWEQVLALCQAIINSNSDRLPDINIYQLKV 141

Query: 265 REKLIKILMELEDWKEALAYCQLTIPVYQRVYPQFHPLLGLQYYTCGKLEWFLGDTENAI 324
            +  +   + L   +EAL Y   T+  Y+  +P  HP+ G+Q    GKL+   G    A+
Sbjct: 142 LDCAMDACINLGMLEEALFYAMRTMEPYRIFFPGSHPVRGVQVMKVGKLQLHQGMFPQAM 201

Query: 325 KSMTEAVEILRITHG 339
           K++  A +I+++THG
Sbjct: 202 KNLRLAFDIMKVTHG 216


>gi|38093643|ref|NP_938015.1| SET and MYND domain-containing protein 1 [Homo sapiens]
 gi|34925329|sp|Q8NB12.1|SMYD1_HUMAN RecName: Full=SET and MYND domain-containing protein 1
 gi|21750163|dbj|BAC03732.1| unnamed protein product [Homo sapiens]
 gi|57997113|emb|CAI46139.1| hypothetical protein [Homo sapiens]
 gi|116497127|gb|AAI26192.1| SET and MYND domain containing 1 [Homo sapiens]
 gi|119597484|gb|EAW77078.1| SET and MYND domain containing 1, isoform CRA_b [Homo sapiens]
 gi|190689781|gb|ACE86665.1| SET and MYND domain containing 1 protein [synthetic construct]
 gi|190691149|gb|ACE87349.1| SET and MYND domain containing 1 protein [synthetic construct]
 gi|261858976|dbj|BAI46010.1| SET and MYND domain containing 1 [synthetic construct]
 gi|313883064|gb|ADR83018.1| SET and MYND domain containing 1 (SMYD1) [synthetic construct]
          Length = 490

 Score = 97.4 bits (241), Expect = 1e-17,   Method: Compositional matrix adjust.
 Identities = 83/341 (24%), Positives = 152/341 (44%), Gaps = 62/341 (18%)

Query: 31  VAHMSDIDEKQLLLYAQIANLVNLILQ-WP----EISINEIAENFSKLACNAHTICNSE- 84
           V H  + ++K L +       V+  LQ WP    + S+  I+  F  + CN  T+ +   
Sbjct: 136 VEHFGEEEQKDLRVD------VDTFLQYWPPQSQQFSMQYISHIFGVINCNGFTLSDQRG 189

Query: 85  LRPLGTGLYPVISIINHSCLPNAVLVFE-GRLAVVRAVQH------------VPKGAEVL 131
           L+ +G G++P + ++NH C PN  ++F  G    V+++ H            + +G E+ 
Sbjct: 190 LQAVGVGIFPNLGLVNHDCWPNCTVIFNNGNHEAVKSMFHTQMRIELRALGKISEGEELT 249

Query: 132 ISYIETAGSTMTRQKALKEQYLFTCTCPRCIKLGQFDDIQESAILEGYRCKDDGCSGFLL 191
           +SYI+    +  R++ LK+QY F CTC  C K                + KDD    FL 
Sbjct: 250 VSYIDFLNVSEERKRQLKKQYYFDCTCEHCQK----------------KLKDD---LFL- 289

Query: 192 RDSDDKGFTCQQCGLVRSKEEIKKIASEVNILSKKTL----ALTSCGNHQEVVSTYKMIE 247
                        G+  + +  +++  E+   SK TL       S G + EVV   +   
Sbjct: 290 -------------GVKDNPKPSQEVVKEMIQFSKDTLEKIDKARSEGLYHEVVKLCRECL 336

Query: 248 KLQKKLYHPFSVNLMQTREKLIKILMELEDWKEALAYCQLTIPVYQRVYPQFHPLLGLQY 307
           + Q+ ++   ++ +++    + ++L  L+ ++EA  Y +  +  Y ++Y   +  LG+  
Sbjct: 337 EKQEPVFADTNIYMLRMLSIVSEVLSYLQAFEEASFYARRMVDGYMKLYHPNNAQLGMAV 396

Query: 308 YTCGKLEWFLGDTENAIKSMTEAVEILRITHGTNSPFMKEL 348
              G   W  G+ E     + +A  IL +THG + P  K+L
Sbjct: 397 MRAGLTNWHAGNIEVGHGMICKAYAILLVTHGPSHPITKDL 437


>gi|57997548|emb|CAI46077.1| hypothetical protein [Homo sapiens]
          Length = 490

 Score = 97.4 bits (241), Expect = 1e-17,   Method: Compositional matrix adjust.
 Identities = 83/341 (24%), Positives = 152/341 (44%), Gaps = 62/341 (18%)

Query: 31  VAHMSDIDEKQLLLYAQIANLVNLILQ-WP----EISINEIAENFSKLACNAHTICNSE- 84
           V H  + ++K L +       V+  LQ WP    + S+  I+  F  + CN  T+ +   
Sbjct: 136 VEHFGEEEQKDLRVD------VDTFLQYWPPQSQQFSMQYISHIFGVINCNGFTLSDQRG 189

Query: 85  LRPLGTGLYPVISIINHSCLPNAVLVFE-GRLAVVRAVQH------------VPKGAEVL 131
           L+ +G G++P + ++NH C PN  ++F  G    V+++ H            + +G E+ 
Sbjct: 190 LQAVGVGIFPNLGLVNHDCWPNCTVIFNNGNHEAVKSMFHTQMRIELRALGKISEGEELT 249

Query: 132 ISYIETAGSTMTRQKALKEQYLFTCTCPRCIKLGQFDDIQESAILEGYRCKDDGCSGFLL 191
           +SYI+    +  R++ LK+QY F CTC  C K                + KDD    FL 
Sbjct: 250 VSYIDFLNVSEERKRQLKKQYYFDCTCEHCQK----------------KLKDD---LFL- 289

Query: 192 RDSDDKGFTCQQCGLVRSKEEIKKIASEVNILSKKTL----ALTSCGNHQEVVSTYKMIE 247
                        G+  + +  +++  E+   SK TL       S G + EVV   +   
Sbjct: 290 -------------GVKDNPKPSQEVVKEMIQFSKDTLEKIDKARSEGLYHEVVKLCRECL 336

Query: 248 KLQKKLYHPFSVNLMQTREKLIKILMELEDWKEALAYCQLTIPVYQRVYPQFHPLLGLQY 307
           + Q+ ++   ++ +++    + ++L  L+ ++EA  Y +  +  Y ++Y   +  LG+  
Sbjct: 337 EKQEPVFADTNIYMLRMLSIVSEVLSYLQAFEEASFYARRMVDGYMKLYHPNNAQLGMAV 396

Query: 308 YTCGKLEWFLGDTENAIKSMTEAVEILRITHGTNSPFMKEL 348
              G   W  G+ E     + +A  IL +THG + P  K+L
Sbjct: 397 MRAGLTNWHAGNIEVGHGMICKAYAILLVTHGPSHPITKDL 437


>gi|291386375|ref|XP_002709686.1| PREDICTED: SET and MYND domain containing 1 isoform 2 [Oryctolagus
           cuniculus]
          Length = 490

 Score = 97.1 bits (240), Expect = 1e-17,   Method: Compositional matrix adjust.
 Identities = 80/336 (23%), Positives = 148/336 (44%), Gaps = 52/336 (15%)

Query: 31  VAHMSDIDEKQLLLYAQIANLVNLILQWP----EISINEIAENFSKLACNAHTICNSE-L 85
           V H  + ++K+L      A++   +  WP      S+  I+  F  + CN  T+ +   L
Sbjct: 136 VDHFGEEEQKELR-----ADVDTFLQYWPPQSQHFSMQYISHIFGVINCNGFTLSDQRGL 190

Query: 86  RPLGTGLYPVISIINHSCLPNAVLVFE-GRLAVVRAVQH------------VPKGAEVLI 132
           + +G G++P ++++NH C PN  ++F  G    V+++ H            + +G E+ +
Sbjct: 191 QAVGVGIFPNLALVNHDCWPNCTVIFNNGNHEAVKSMFHTQMRIELRALGKISEGEELTV 250

Query: 133 SYIETAGSTMTRQKALKEQYLFTCTCPRCIKLGQFDDIQESAILEGYRCKDDGCSGFLLR 192
           SYI+    +  R++ LK+QY F CTC  C K G  DD+                  FL  
Sbjct: 251 SYIDFLNVSEERRRQLKKQYYFDCTCEHCQK-GLKDDL------------------FLGV 291

Query: 193 DSDDKGFTCQQCGLVRSKEEIKKIASEVNILSKKTLALTSCGNHQEVVSTYKMIEKLQKK 252
             + K           S+E +K++        +K     S G + EVV   +   + Q+ 
Sbjct: 292 KDNPKP----------SQEVVKEMTEFCKDTLEKIDKARSEGLYHEVVKLCRECLQKQEP 341

Query: 253 LYHPFSVNLMQTREKLIKILMELEDWKEALAYCQLTIPVYQRVYPQFHPLLGLQYYTCGK 312
           ++   ++  ++    + ++L  L+ + EA  Y +  +  Y ++Y   +  LG+     G 
Sbjct: 342 VFADTNLYTLRMLSIVSEVLSYLQAFGEAADYAKRMVDGYMKLYHPNNAQLGMAVMRAGL 401

Query: 313 LEWFLGDTENAIKSMTEAVEILRITHGTNSPFMKEL 348
             W  G+ E     + +A  IL +THG + P  K+L
Sbjct: 402 TNWHAGNIEVGHGMICKAYAILLVTHGPSHPITKDL 437


>gi|328769214|gb|EGF79258.1| hypothetical protein BATDEDRAFT_35411 [Batrachochytrium
           dendrobatidis JAM81]
          Length = 503

 Score = 97.1 bits (240), Expect = 1e-17,   Method: Compositional matrix adjust.
 Identities = 81/315 (25%), Positives = 144/315 (45%), Gaps = 50/315 (15%)

Query: 60  EISINEIAENFSKLACNAHTICNSELRPLGTGLYPVISIINHSCLPNAVLVFEGRLAVVR 119
            + ++++  +  +  CN  TI +S L P+G G YP+ S+ NH C PN + +F+G   V++
Sbjct: 228 NLPVSDLIHHLGRFRCNNFTIIDSNLFPVGEGTYPLASLFNHDCWPNCIAIFDGSRVVIQ 287

Query: 120 AVQHVPKGAEVLISYIETAGSTMTRQKALKEQYLFTCTCPRCIKLGQFDDIQESAILEGY 179
            ++ + KG E+ ISYI+      +R+ +L+ +Y F C C  C+        +    L   
Sbjct: 288 TIRDIAKGDELCISYIDPILDHDSRRMSLETKYCFNCQCSVCMS-------ESCTPLHSP 340

Query: 180 RCKDDGCSGFLLRDSDDKGF-------TCQQCGLVRSKEEIKKIASEVNILSKKTLALTS 232
             K D      L +  D G+       TC    L +S +E ++ A +  I   KT+AL S
Sbjct: 341 TTKKDAL--LSLDEKSDVGWRLLKWFHTC----LKKSDDEFER-AQQDQI--SKTIALVS 391

Query: 233 CGNHQEVVSTYKMIEKLQKKLYHPFSVNLMQTREKLIKILMELEDWKEALAYCQLTIPVY 292
               + VV+      ++       F  +LM+            +DW++A       +  Y
Sbjct: 392 ---KRAVVAEIGNATQM-------FQSHLMR------------QDWEQAYKSGLHLLGQY 429

Query: 293 QRVYPQFHPLLGLQYYTCGKLEWFLGDTENAIKSMTEAVEILRITHGTNSPFMK--ELIL 350
              YP+++PL+    +   +  W  G T+  I  +    + L IT+G+    MK   LI 
Sbjct: 430 LLRYPRYYPLVSQHMFFVAQCMWNSGQTDETIFLLGVIKKCLEITYGS---VMKSHHLIG 486

Query: 351 KLEEAQAEASYKLSS 365
           +++   A+A Y+ S+
Sbjct: 487 QVDSLLAQALYEQSA 501


>gi|325530307|sp|Q5RGL7.2|SMY2B_DANRE RecName: Full=N-lysine methyltransferase SMYD2-B; AltName:
           Full=Histone methyltransferase SMYD2-B; AltName:
           Full=SET and MYND domain-containing protein 2B
          Length = 434

 Score = 97.1 bits (240), Expect = 1e-17,   Method: Compositional matrix adjust.
 Identities = 82/337 (24%), Positives = 148/337 (43%), Gaps = 54/337 (16%)

Query: 26  LVEALVAHMSDID-EKQLLLYAQIANLVNLI---LQWPEISINEIAENFSKLACNAHTIC 81
           L+  + AH+ D+D EK+ +  A IA L       L +P+     +   FS++ CN  T+ 
Sbjct: 131 LLGEMEAHLEDMDNEKREMTEAHIAGLHQFYSKHLDFPDH--QALLTLFSQVHCNGFTVE 188

Query: 82  NSELRPLGTGLYPVISIINHSCLPNAVLVFEGRLAVVRAVQHVPKGAEVLISYIETAGST 141
           + EL  LG  ++P I+++NHSC PN ++ + G  A VRAV+ +  G E+  SYI+    T
Sbjct: 189 DEELSNLGLAIFPDIALLNHSCSPNVIVTYRGINAEVRAVKDISPGQEIYTSYIDLLYPT 248

Query: 142 MTRQKALKEQYLFTCTCPRC-------IKLG---QFDDIQESAILEGYRCKDDGCSGFLL 191
             R + L++ Y F+C C  C       +K+    + D+I E  I +  R   +    F  
Sbjct: 249 ADRLERLRDMYYFSCDCKECTTKSMDVVKMSVRKRSDEIGEKEIKDMVRYARNSMENFR- 307

Query: 192 RDSDDKGFTCQQCGLVRSKEEIKKIASEVNILSKKTLALTSCGNHQEVVSTYKMIEKLQK 251
                           R+K++              T  L  C    + +ST         
Sbjct: 308 ----------------RAKQD-----------KSPTELLEMCELSIDKMST--------- 331

Query: 252 KLYHPFSVNLMQTREKLIKILMELEDWKEALAYCQLTIPVYQRVYPQFHPLLGLQYYTCG 311
            ++   +V ++    + + I +  ED++ A+ Y +  I  +  +YP +   +   +   G
Sbjct: 332 -VFDDSNVYILHMMYQAMGICLFTEDYEGAVRYGEKVIKPFTVLYPAYSMNVASMFLKLG 390

Query: 312 KLEWFLGDTENAIKSMTEAVEILRITHGTNSPFMKEL 348
           +L   L      I +  +A+ I+ + HG +  ++ EL
Sbjct: 391 RLYIALDRKLAGIDAFQKALTIMEVVHGKDHTYVTEL 427


>gi|348566407|ref|XP_003468993.1| PREDICTED: SET and MYND domain-containing protein 1-like isoform 1
           [Cavia porcellus]
          Length = 490

 Score = 97.1 bits (240), Expect = 1e-17,   Method: Compositional matrix adjust.
 Identities = 83/341 (24%), Positives = 149/341 (43%), Gaps = 62/341 (18%)

Query: 31  VAHMSDIDEKQLLLYAQIANLVNLILQ-WP----EISINEIAENFSKLACNAHTICNSE- 84
           V H  + ++K+L +       V+  LQ WP    + S+  I+  F  + CN  T+ +   
Sbjct: 136 VEHFGEEEQKELRVD------VDTFLQYWPPQSQQFSMQYISHIFGVINCNGFTLSDQRG 189

Query: 85  LRPLGTGLYPVISIINHSCLPNAVLVFE-GRLAVVRAVQH------------VPKGAEVL 131
           L+ +G G++P + ++NH C PN  ++F  G    V+++ H            +  G E+ 
Sbjct: 190 LQAVGVGIFPNLGLVNHDCWPNCTVIFNNGNHEAVKSMFHTQMRIELRALGKISVGEELT 249

Query: 132 ISYIETAGSTMTRQKALKEQYLFTCTCPRCIKLGQFDDIQESAILEGYRCKDDGCSGFLL 191
           +SYI+    +  R++ LK+QY F CTC  C K G  DD+                  FL 
Sbjct: 250 VSYIDFLNVSEERKRQLKKQYYFDCTCEHCQK-GLKDDL------------------FL- 289

Query: 192 RDSDDKGFTCQQCGLVRSKEEIKKIASEVNILSKKTL----ALTSCGNHQEVVSTYKMIE 247
                        G+    +  +++  E+   SK TL       S G + EVV   +   
Sbjct: 290 -------------GVKEDPKPSQEVVKEMTQFSKDTLEKIDKARSEGLYHEVVKLCRECL 336

Query: 248 KLQKKLYHPFSVNLMQTREKLIKILMELEDWKEALAYCQLTIPVYQRVYPQFHPLLGLQY 307
           + Q+ ++   ++  ++    + ++L  L+ + EA  Y +  +  Y ++Y   +  LG+  
Sbjct: 337 EKQEPVFADTNLYTLRMLSVVSEVLSYLQAFGEAAHYARRMVDGYMKLYHPNNAQLGMAV 396

Query: 308 YTCGKLEWFLGDTENAIKSMTEAVEILRITHGTNSPFMKEL 348
              G   W  G+ E     + +A  IL +THG + P  K+L
Sbjct: 397 MRAGLTNWHAGNIEVGHGMICKAYAILLVTHGPSHPITKDL 437


>gi|113674346|ref|NP_001038756.1| N-lysine methyltransferase SMYD2-B isoform 2 [Danio rerio]
 gi|95132413|gb|AAI16607.1| SET and MYND domain containing 2b [Danio rerio]
          Length = 423

 Score = 96.7 bits (239), Expect = 1e-17,   Method: Compositional matrix adjust.
 Identities = 82/337 (24%), Positives = 148/337 (43%), Gaps = 54/337 (16%)

Query: 26  LVEALVAHMSDID-EKQLLLYAQIANLVNLI---LQWPEISINEIAENFSKLACNAHTIC 81
           L+  + AH+ D+D EK+ +  A IA L       L +P+     +   FS++ CN  T+ 
Sbjct: 120 LLGEMEAHLEDMDNEKREMTEAHIAGLHQFYSKHLDFPDH--QALLTLFSQVHCNGFTVE 177

Query: 82  NSELRPLGTGLYPVISIINHSCLPNAVLVFEGRLAVVRAVQHVPKGAEVLISYIETAGST 141
           + EL  LG  ++P I+++NHSC PN ++ + G  A VRAV+ +  G E+  SYI+    T
Sbjct: 178 DEELSNLGLAIFPDIALLNHSCSPNVIVTYRGINAEVRAVKDISPGQEIYTSYIDLLYPT 237

Query: 142 MTRQKALKEQYLFTCTCPRC-------IKLG---QFDDIQESAILEGYRCKDDGCSGFLL 191
             R + L++ Y F+C C  C       +K+    + D+I E  I +  R   +    F  
Sbjct: 238 ADRLERLRDMYYFSCDCKECTTKSMDVVKMSVRKRSDEIGEKEIKDMVRYARNSMENFR- 296

Query: 192 RDSDDKGFTCQQCGLVRSKEEIKKIASEVNILSKKTLALTSCGNHQEVVSTYKMIEKLQK 251
                           R+K++              T  L  C    + +ST         
Sbjct: 297 ----------------RAKQD-----------KSPTELLEMCELSIDKMST--------- 320

Query: 252 KLYHPFSVNLMQTREKLIKILMELEDWKEALAYCQLTIPVYQRVYPQFHPLLGLQYYTCG 311
            ++   +V ++    + + I +  ED++ A+ Y +  I  +  +YP +   +   +   G
Sbjct: 321 -VFDDSNVYILHMMYQAMGICLFTEDYEGAVRYGEKVIKPFTVLYPAYSMNVASMFLKLG 379

Query: 312 KLEWFLGDTENAIKSMTEAVEILRITHGTNSPFMKEL 348
           +L   L      I +  +A+ I+ + HG +  ++ EL
Sbjct: 380 RLYIALDRKLAGIDAFQKALTIMEVVHGKDHTYVTEL 416


>gi|327286307|ref|XP_003227872.1| PREDICTED: SET and MYND domain-containing protein 3-like [Anolis
           carolinensis]
          Length = 334

 Score = 96.7 bits (239), Expect = 2e-17,   Method: Compositional matrix adjust.
 Identities = 58/169 (34%), Positives = 94/169 (55%), Gaps = 11/169 (6%)

Query: 9   KLQNDNVIPSTTTDNYSLVEALVAHMSD-IDE------KQLLLYAQIANLVNLILQWPEI 61
           KL   +  PS    ++S +++ V  +S+ + E      K L LY ++   +  + Q P  
Sbjct: 141 KLLRQSTCPSEELYSFSDLQSNVEKLSEEMKEGLGHLAKTLQLYLKVE--IQDVSQLP-- 196

Query: 62  SINEIAENFSKLACNAHTICNSELRPLGTGLYPVISIINHSCLPNAVLVFEGRLAVVRAV 121
              ++ E F+K+ACN  +I N E++ +G GLYP +S++N+SC PN  +VFEG    +R++
Sbjct: 197 PALDVFETFAKVACNCFSISNGEMQDVGVGLYPSMSLLNNSCDPNCAIVFEGPELHLRSI 256

Query: 122 QHVPKGAEVLISYIETAGSTMTRQKALKEQYLFTCTCPRCIKLGQFDDI 170
           + + +G E+ I Y+ET   T  RQ+ LK QY F C C  C    +  DI
Sbjct: 257 REIQEGEELTICYVETMMPTPERQERLKRQYCFECDCLMCCTKSKAYDI 305


>gi|348505210|ref|XP_003440154.1| PREDICTED: SET and MYND domain-containing protein 1-like isoform 1
           [Oreochromis niloticus]
          Length = 454

 Score = 96.7 bits (239), Expect = 2e-17,   Method: Compositional matrix adjust.
 Identities = 86/339 (25%), Positives = 150/339 (44%), Gaps = 61/339 (17%)

Query: 27  VEALVAHMSDI---DEKQLLLYAQIANLVNLILQWP----EISINEIAENFSKLACNAHT 79
           V+ L  H++D+   D KQL      A++   +  W     + S++ I+  F  + CN  T
Sbjct: 129 VDQLQEHVADLAAEDFKQLR-----ADVHKFLQYWSYGTRQHSVDYISHIFGIIKCNGFT 183

Query: 80  ICNSE-LRPLGTGLYPVISIINHSCLPNAVLVFEGRLAVVRAVQHVPKGAEVLISYIETA 138
           + +   L+ +G GL+P + ++NH C PN  ++       +RA+  +P+G E+ +SY++  
Sbjct: 184 LSDQRGLQAVGVGLFPNLCLVNHDCWPNCTVILNHGKIELRALGKIPEGEELTVSYVDFL 243

Query: 139 GSTMTRQKALKEQYLFTCTCPRCIKLGQFDDIQESAILEGYRCKDDGCSGFLLRDSDDKG 198
             +  RQK LKE++ F CTC  C K                  KDD     +   +D K 
Sbjct: 244 NLSADRQKKLKERFHFDCTCEHCSK----------------HIKDD----LMTAVADSKP 283

Query: 199 FTCQQCGLVRSKEEIKKIASEVNILSKKTLALTSCG----NHQEVVS-TYKMIEKLQKKL 253
                     S +++K    EV   SK++L          ++ EV+   Y+ ++K +  L
Sbjct: 284 ----------SADKVK----EVTAFSKESLEKIEKSRVERDYNEVLKLCYECLQKQENVL 329

Query: 254 YHPFSVNLMQTREKLIKILMELEDWKEALA----YCQLTIPVYQRVYPQFHPLLGLQYYT 309
                 NL + R  ++ I  E+  ++ + A    Y    +  Y+++Y   +  LG+    
Sbjct: 330 A---DTNLYKLR--VLSIASEVLSYQRSFAKAANYAHRMVEGYRKLYHPNNAQLGMAIMR 384

Query: 310 CGKLEWFLGDTENAIKSMTEAVEILRITHGTNSPFMKEL 348
            G   W  G  E     + +A  IL +THG N    K+L
Sbjct: 385 AGVTHWHAGQIEEGHSLICQAYRILMVTHGPNHAITKDL 423


>gi|302753848|ref|XP_002960348.1| hypothetical protein SELMODRAFT_437450 [Selaginella moellendorffii]
 gi|300171287|gb|EFJ37887.1| hypothetical protein SELMODRAFT_437450 [Selaginella moellendorffii]
          Length = 614

 Score = 96.7 bits (239), Expect = 2e-17,   Method: Compositional matrix adjust.
 Identities = 68/236 (28%), Positives = 105/236 (44%), Gaps = 16/236 (6%)

Query: 91  GLYPVISIINHSCLPNAVLVFEGRLAVVRAVQHVPKGAEVLISYIETAGSTMTRQKALKE 150
           GL+P +SI+NHSC PN     EG +  VRAVQ +PKGAE+ +SYI    S   R+  L  
Sbjct: 296 GLFPYVSILNHSCRPNCCFASEGSVMYVRAVQDIPKGAELCLSYINLYESRRVRKTLLVA 355

Query: 151 QYLFTCTCPRCIKLGQFDDIQESAILEGYRCKDDGCSGFLLRDS---DDKG----FTCQQ 203
              F CTC RC+   +  +      LEG  C   GC G  LR +   +D+     + C  
Sbjct: 356 TKHFDCTCDRCV---EPLNTSIDRFLEGCVCHVRGCGGVFLRTAALHEDQASSTTWECDL 412

Query: 204 CGLVRSKEEIKKIASEVNILSK---KTLALTSCGNHQEVVSTYKMIEKLQKKL---YHPF 257
           C  V   E ++       +++K   + +A     + +      K++E    +     HP 
Sbjct: 413 CSRVLDPESMRFKDPPWEVIAKAEERLVAAVRIYSERRFKEARKLLESYLSEFTGKLHPL 472

Query: 258 SVNLMQTREKLIKILMELEDWKEALAYCQLTIPVYQRVYPQFHPLLGLQYYTCGKL 313
            V L      L+     + D +E L  C+L +   ++V P     L   Y+  G++
Sbjct: 473 HVFLFDALTPLMNCCRAMGDAEEGLRVCRLILQCMEKVLPGPSLELANFYFCLGEM 528


>gi|149727214|ref|XP_001497840.1| PREDICTED: SET and MYND domain-containing protein 1 isoform 2
           [Equus caballus]
          Length = 490

 Score = 96.7 bits (239), Expect = 2e-17,   Method: Compositional matrix adjust.
 Identities = 79/320 (24%), Positives = 142/320 (44%), Gaps = 56/320 (17%)

Query: 52  VNLILQ-WP----EISINEIAENFSKLACNAHTICNSE-LRPLGTGLYPVISIINHSCLP 105
           V+  LQ WP    + S+  I+  F  + CN  T+ +   L+ +G G++P + ++NH C P
Sbjct: 151 VDTFLQYWPPQSQQFSMQYISHIFGVINCNGFTLSDQRGLQAVGVGVFPNLGLVNHDCWP 210

Query: 106 NAVLVFE-GRLAVVRAVQH------------VPKGAEVLISYIETAGSTMTRQKALKEQY 152
           N  ++F  G    V+++ H            + +G E+ +SYI+    +  R++ LK+QY
Sbjct: 211 NCTVIFNNGNHEAVKSMFHTQMRIELRALGKISEGEELTVSYIDFLNVSEERKRQLKKQY 270

Query: 153 LFTCTCPRCIKLGQFDDIQESAILEGYRCKDDGCSGFLLRDSDDKGFTCQQCGLVRSKEE 212
            F CTC  C K G  DD+                  FL              G+    + 
Sbjct: 271 YFDCTCEHCQK-GLKDDL------------------FL--------------GVKDDPKP 297

Query: 213 IKKIASEVNILSKKTL----ALTSCGNHQEVVSTYKMIEKLQKKLYHPFSVNLMQTREKL 268
            + +  EV   SK TL       + G + EVV   +   + Q+ ++   ++ +++    +
Sbjct: 298 SQDVVKEVVQFSKDTLEKIDKARTEGLYHEVVKLCRECLQKQEPVFADTNLYMLRILSIV 357

Query: 269 IKILMELEDWKEALAYCQLTIPVYQRVYPQFHPLLGLQYYTCGKLEWFLGDTENAIKSMT 328
            ++L  L+ ++EA  Y +  +  Y ++Y   +  LG+     G   W  G+ E     + 
Sbjct: 358 SEVLSYLQAFEEASYYARKMVDGYMKLYHPNNAQLGMAVMRAGLTNWHAGNIEVGHGMIC 417

Query: 329 EAVEILRITHGTNSPFMKEL 348
           +A  IL +THG + P  K+L
Sbjct: 418 KAYAILLVTHGPSHPITKDL 437


>gi|344244710|gb|EGW00814.1| SET and MYND domain-containing protein 3 [Cricetulus griseus]
          Length = 239

 Score = 96.3 bits (238), Expect = 2e-17,   Method: Compositional matrix adjust.
 Identities = 69/255 (27%), Positives = 120/255 (47%), Gaps = 39/255 (15%)

Query: 85  LRPLGTGLYPVISIINHSCLPNAVLVFEGRLAVVRAVQHVPKGAEVLISYIETAGSTMTR 144
           ++ +G GLYP +S++NHSC PN  +VF G   ++RAV+ +  G E+ I Y++   ++  R
Sbjct: 1   MQEVGVGLYPSMSLLNHSCDPNCSIVFNGPHLLLRAVREIEAGEELTICYLDMLMTSEER 60

Query: 145 QKALKEQYLFTCTCPRCIKLGQFDDIQESAILEGYRCKDDGCSGFLLRDSDDKGFTCQQC 204
           +K L++QY F C C RC    Q  D                      +D+D         
Sbjct: 61  RKQLRDQYCFECDCIRC----QTHD----------------------KDAD--------- 85

Query: 205 GLVRSKEEIKKIASEVNILSKKTLALTSCGNHQEVVSTYKMIEKLQKKLYHPFSVNLMQT 264
            ++   E+I K   EV    KK   L +    ++V++  + I           ++  ++ 
Sbjct: 86  -MLTGDEQIWK---EVQESLKKIEELKAHWKWEQVLALCQAIISSNSDRLPDINIYQLKV 141

Query: 265 REKLIKILMELEDWKEALAYCQLTIPVYQRVYPQFHPLLGLQYYTCGKLEWFLGDTENAI 324
            +  +   + L   +EAL Y   T+  Y+  +P  HP+ G+Q    GKL+   G    A+
Sbjct: 142 LDCAMDACINLGMLEEALLYALRTMEPYRIFFPGSHPVRGVQVMKVGKLQLHQGMFPQAM 201

Query: 325 KSMTEAVEILRITHG 339
           K++  A +I+++THG
Sbjct: 202 KNLRLAFDIMKVTHG 216


>gi|236461014|ref|NP_001153599.1| SET and MYND domain-containing protein 1 isoform 1 [Mus musculus]
 gi|332278221|sp|P97443.3|SMYD1_MOUSE RecName: Full=SET and MYND domain-containing protein 1; AltName:
           Full=CD8b-opposite; AltName: Full=Zinc finger protein
           BOP; Short=m-BOP
 gi|308387879|pdb|3N71|A Chain A, Crystal Structure Of Cardiac Specific Histone
           Methyltransferase Smyd1
 gi|49903961|gb|AAH76601.1| Smyd1 protein [Mus musculus]
 gi|74209249|dbj|BAE24995.1| unnamed protein product [Mus musculus]
 gi|148666518|gb|EDK98934.1| SET and MYND domain containing 1, isoform CRA_b [Mus musculus]
          Length = 490

 Score = 96.3 bits (238), Expect = 2e-17,   Method: Compositional matrix adjust.
 Identities = 79/336 (23%), Positives = 148/336 (44%), Gaps = 52/336 (15%)

Query: 31  VAHMSDIDEKQLLLYAQIANLVNLILQWP----EISINEIAENFSKLACNAHTICNSE-L 85
           V H  + ++K+L +     ++   +  WP    + S+  I+  F  + CN  T+ +   L
Sbjct: 136 VEHFGEEEQKELRV-----DVDTFLQYWPPQSQQFSMQYISHIFGVINCNGFTLSDQRGL 190

Query: 86  RPLGTGLYPVISIINHSCLPNAVLVFE-GRLAVVRAVQH------------VPKGAEVLI 132
           + +G G++P + ++NH C PN  ++F  G    V+++ H            + +G E+ +
Sbjct: 191 QAVGVGIFPNLGLVNHDCWPNCTVIFNNGNHEAVKSMFHTQMRIELRALGKISEGEELTV 250

Query: 133 SYIETAGSTMTRQKALKEQYLFTCTCPRCIKLGQFDDIQESAILEGYRCKDDGCSGFLLR 192
           SYI+    +  R++ LK+QY F C+C  C K G  DD+                  FL  
Sbjct: 251 SYIDFLHLSEERRRQLKKQYYFDCSCEHCQK-GLKDDL------------------FLAA 291

Query: 193 DSDDKGFTCQQCGLVRSKEEIKKIASEVNILSKKTLALTSCGNHQEVVSTYKMIEKLQKK 252
             D K           S+E +K++        +K     S G + EVV   +   + Q+ 
Sbjct: 292 KEDPKP----------SQEVVKEMIQFSKDTLEKIDKARSEGLYHEVVKLCRECLEKQEP 341

Query: 253 LYHPFSVNLMQTREKLIKILMELEDWKEALAYCQLTIPVYQRVYPQFHPLLGLQYYTCGK 312
           ++   ++ +++      ++L  L+ ++EA  Y +  +  Y ++Y   +  LG+     G 
Sbjct: 342 VFADTNLYVLRLLSIASEVLSYLQAYEEASHYARRMVDGYMKLYHHNNAQLGMAVMRAGL 401

Query: 313 LEWFLGDTENAIKSMTEAVEILRITHGTNSPFMKEL 348
             W  G  E     + +A  IL +THG + P  K+L
Sbjct: 402 TNWHAGHIEVGHGMICKAYAILLVTHGPSHPITKDL 437


>gi|157819445|ref|NP_001100065.1| SET and MYND domain-containing protein 1 [Rattus norvegicus]
 gi|149036367|gb|EDL90985.1| rCG56113, isoform CRA_a [Rattus norvegicus]
          Length = 490

 Score = 96.3 bits (238), Expect = 2e-17,   Method: Compositional matrix adjust.
 Identities = 86/341 (25%), Positives = 152/341 (44%), Gaps = 62/341 (18%)

Query: 31  VAHMSDIDEKQLLLYAQIANLVNLILQ-WP----EISINEIAENFSKLACNAHTICNSE- 84
           V H  + ++K+L +       V+  LQ WP    + S+  I+  F  + CN  T+ +   
Sbjct: 136 VEHFGEEEQKELRVD------VDTFLQYWPPQSQQFSMQYISHIFGVINCNGFTLSDQRG 189

Query: 85  LRPLGTGLYPVISIINHSCLPNAVLVFE-GRLAVVRAVQH------------VPKGAEVL 131
           L+ +G G++P + ++NH C PN  ++F  G    V+++ H            + +G E+ 
Sbjct: 190 LQAVGVGIFPNLGLVNHDCWPNCTVIFNNGNHEAVKSMFHTQMRIELRALGKISEGEELT 249

Query: 132 ISYIETAGSTMTRQKALKEQYLFTCTCPRCIKLGQFDDIQESAILEGYRCKDDGCSGFLL 191
           +SYI+    +  R++ LK+QY F C+C  C K G  DD+                  FL 
Sbjct: 250 VSYIDFLHLSEERRQQLKKQYYFDCSCEHCQK-GLKDDL------------------FLA 290

Query: 192 RDSDDKGFTCQQCGLVRSKEEIKKIASEVNILSKKTL----ALTSCGNHQEVVSTYKMIE 247
              D K           S+E +K    E+   SK TL       S G + EVV   +   
Sbjct: 291 VKEDPKP----------SQEVVK----EMTQFSKDTLEKIDKARSEGLYHEVVKLCRECL 336

Query: 248 KLQKKLYHPFSVNLMQTREKLIKILMELEDWKEALAYCQLTIPVYQRVYPQFHPLLGLQY 307
           + Q+ ++   ++ +++    + ++L  L+ ++EA  Y +  +  Y ++Y   +  LG+  
Sbjct: 337 EKQEPVFADTNLYVLRLLSIVSEVLSYLQAFEEASHYARRMVDGYMKLYHHNNAQLGMAV 396

Query: 308 YTCGKLEWFLGDTENAIKSMTEAVEILRITHGTNSPFMKEL 348
              G   W  G  E     + +A  IL +THG + P  K+L
Sbjct: 397 MRAGLTNWHAGHIEVGHGMICKAYAILLVTHGPSHPITKDL 437


>gi|302767894|ref|XP_002967367.1| hypothetical protein SELMODRAFT_439878 [Selaginella moellendorffii]
 gi|300165358|gb|EFJ31966.1| hypothetical protein SELMODRAFT_439878 [Selaginella moellendorffii]
          Length = 614

 Score = 96.3 bits (238), Expect = 2e-17,   Method: Compositional matrix adjust.
 Identities = 69/237 (29%), Positives = 106/237 (44%), Gaps = 18/237 (7%)

Query: 91  GLYPVISIINHSCLPNAVLVFEGRLAVVRAVQHVPKGAEVLISYIETAGSTMTRQKALKE 150
           GL+P +SI+NHSC PN     EG +  VRAVQ +PKGAE+ +SYI    S   R+  L  
Sbjct: 296 GLFPYVSILNHSCRPNCCFASEGSVMYVRAVQDIPKGAELCLSYINLYESRRVRKTLLVA 355

Query: 151 QYLFTCTCPRCIK-LGQFDDIQESAILEGYRCKDDGCSGFLLRDS---DDKG----FTCQ 202
              F CTC RC++ L    D      LEG  C   GC G  LR +   +D+     + C 
Sbjct: 356 TKHFDCTCDRCVEPLNTSID----RFLEGCVCHVRGCGGVFLRTAALHEDQASSTTWECD 411

Query: 203 QCGLVRSKEEIKKIASEVNILSK---KTLALTSCGNHQEVVSTYKMIEKLQKKL---YHP 256
            C  +   E ++       +++K   + +A     + +      K++E    +     HP
Sbjct: 412 LCSRILDPESMRFKDPPWEVIAKAEERLVAAVRIYSERRFKEARKLLESYLSEFTGKLHP 471

Query: 257 FSVNLMQTREKLIKILMELEDWKEALAYCQLTIPVYQRVYPQFHPLLGLQYYTCGKL 313
             V L      L+     + D +E L  C+L +   ++V P     L   Y+  G++
Sbjct: 472 LHVFLFDALTPLMNCCRAMGDAEEGLRVCRLILQCMEKVLPGPSLELANFYFCLGEM 528


>gi|5870832|gb|AAC53021.2| skm-BOP1 [Mus musculus]
          Length = 485

 Score = 95.9 bits (237), Expect = 2e-17,   Method: Compositional matrix adjust.
 Identities = 82/337 (24%), Positives = 149/337 (44%), Gaps = 54/337 (16%)

Query: 31  VAHMSDIDEKQLLLYAQIANLVNLILQ-WP----EISINEIAENFSKLACNAHTICNSE- 84
           V H  + ++K+L +       V+  LQ WP    + S+  I+  F  + CN  T+ +   
Sbjct: 131 VEHFGEEEQKELRVD------VDTFLQYWPPQSQQFSMQYISHIFGVINCNGFTLSDQRG 184

Query: 85  LRPLGTGLYPVISIINHSCLPNAVLVFE-GRLAVVRAVQH------------VPKGAEVL 131
           L+ +G G++P + ++NH C PN  ++F  G    V+++ H            + +G E+ 
Sbjct: 185 LQAVGVGIFPNLGLVNHDCWPNCTVIFNNGNHEAVKSMFHTQMRIELRALGKISEGEELT 244

Query: 132 ISYIETAGSTMTRQKALKEQYLFTCTCPRCIKLGQFDDIQESAILEGYRCKDDGCSGFLL 191
           +SYI+    +  R++ LK+QY F C+C  C K G  DD+                  FL 
Sbjct: 245 VSYIDFLHLSEERRRQLKKQYYFDCSCEHCQK-GLKDDL------------------FLA 285

Query: 192 RDSDDKGFTCQQCGLVRSKEEIKKIASEVNILSKKTLALTSCGNHQEVVSTYKMIEKLQK 251
              D K           S+E +K++        +K     S G + EVV   +   + Q+
Sbjct: 286 AKEDPKP----------SQEVVKEMIQFSKDTLEKIDKARSEGLYHEVVKLCRECLEKQE 335

Query: 252 KLYHPFSVNLMQTREKLIKILMELEDWKEALAYCQLTIPVYQRVYPQFHPLLGLQYYTCG 311
            ++   ++ +++      ++L  L+ ++EA  Y +  +  Y ++Y   +  LG+     G
Sbjct: 336 PVFADTNLYVLRLLSIASEVLSYLQAYEEASHYARRMVDGYMKLYHHNNAQLGMAVMRAG 395

Query: 312 KLEWFLGDTENAIKSMTEAVEILRITHGTNSPFMKEL 348
              W  G  E     + +A  IL +THG + P  K+L
Sbjct: 396 LTNWHAGHIEVGHGMICKAYAILLVTHGPSHPITKDL 432


>gi|149040851|gb|EDL94808.1| similar to SET and MYND domain containing 3, isoform CRA_a [Rattus
           norvegicus]
          Length = 239

 Score = 95.9 bits (237), Expect = 3e-17,   Method: Compositional matrix adjust.
 Identities = 67/255 (26%), Positives = 119/255 (46%), Gaps = 39/255 (15%)

Query: 85  LRPLGTGLYPVISIINHSCLPNAVLVFEGRLAVVRAVQHVPKGAEVLISYIETAGSTMTR 144
           ++ +G GLYP +S++NHSC PN  +VF G   ++RAV+ +  G E+ I Y++   ++  R
Sbjct: 1   MQEVGVGLYPSMSLLNHSCDPNCSIVFNGPHLLLRAVREIEAGEELTICYLDMLMTSEER 60

Query: 145 QKALKEQYLFTCTCPRCIKLGQFDDIQESAILEGYRCKDDGCSGFLLRDSDDKGFTCQQC 204
           +K L++QY F C C RC                              +D+D         
Sbjct: 61  RKQLRDQYCFECDCIRCPTQD--------------------------KDAD--------- 85

Query: 205 GLVRSKEEIKKIASEVNILSKKTLALTSCGNHQEVVSTYKMIEKLQKKLYHPFSVNLMQT 264
            ++   E+I K   EV    KK   L +    ++V++  + I     +     ++  ++ 
Sbjct: 86  -MLTGDEQIWK---EVQESLKKIEELKAHWKWEQVLALCQAIINSNSERLPDINIYQLKV 141

Query: 265 REKLIKILMELEDWKEALAYCQLTIPVYQRVYPQFHPLLGLQYYTCGKLEWFLGDTENAI 324
            +  +   + L   +EAL Y   T+  Y+  +P  HP+ G+Q    GKL+   G    A+
Sbjct: 142 LDCAMDACINLGMLEEALFYAMRTMEPYRIFFPGSHPVRGVQVMKVGKLQLHQGMFPQAM 201

Query: 325 KSMTEAVEILRITHG 339
           K++  A +I+++THG
Sbjct: 202 KNLRLAFDIMKVTHG 216


>gi|1809322|gb|AAC53020.1| t-BOP [Mus musculus]
          Length = 456

 Score = 95.9 bits (237), Expect = 3e-17,   Method: Compositional matrix adjust.
 Identities = 82/337 (24%), Positives = 149/337 (44%), Gaps = 54/337 (16%)

Query: 31  VAHMSDIDEKQLLLYAQIANLVNLILQ-WP----EISINEIAENFSKLACNAHTICNSE- 84
           V H  + ++K+L +       V+  LQ WP    + S+  I+  F  + CN  T+ +   
Sbjct: 102 VEHFGEEEQKELRVD------VDTFLQYWPLQSQQFSMQYISHIFGVINCNGFTLSDQRG 155

Query: 85  LRPLGTGLYPVISIINHSCLPNAVLVFE-GRLAVVRAVQH------------VPKGAEVL 131
           L+ +G G++P + ++NH C PN  ++F  G    V+++ H            + +G E+ 
Sbjct: 156 LQAVGVGIFPNLGLVNHDCWPNCTVIFNNGNHEAVKSMFHTQMRIELRALGKISEGEELT 215

Query: 132 ISYIETAGSTMTRQKALKEQYLFTCTCPRCIKLGQFDDIQESAILEGYRCKDDGCSGFLL 191
           +SYI+    +  R++ LK+QY F C+C  C K G  DD+                  FL 
Sbjct: 216 VSYIDFLHLSEERRRQLKKQYYFDCSCEHCQK-GLKDDL------------------FLA 256

Query: 192 RDSDDKGFTCQQCGLVRSKEEIKKIASEVNILSKKTLALTSCGNHQEVVSTYKMIEKLQK 251
              D K           S+E +K++        +K     S G + EVV   +   + Q+
Sbjct: 257 AKEDPKP----------SQEVVKEMIQFSKDTLEKIDKARSEGLYHEVVKLCRECLEKQE 306

Query: 252 KLYHPFSVNLMQTREKLIKILMELEDWKEALAYCQLTIPVYQRVYPQFHPLLGLQYYTCG 311
            ++   ++ +++      ++L  L+ ++EA  Y +  +  Y ++Y   +  LG+     G
Sbjct: 307 PVFTDTNLYVLRLLSIASEVLSYLQAYEEASHYARRMVDGYMKLYHHNNAQLGMAVMRAG 366

Query: 312 KLEWFLGDTENAIKSMTEAVEILRITHGTNSPFMKEL 348
              W  G  E     + +A  IL +THG + P  K+L
Sbjct: 367 LTNWHAGHIEVGHGMICKAYAILLVTHGPSHPITKDL 403


>gi|428171150|gb|EKX40069.1| hypothetical protein GUITHDRAFT_164767 [Guillardia theta CCMP2712]
          Length = 500

 Score = 95.9 bits (237), Expect = 3e-17,   Method: Compositional matrix adjust.
 Identities = 91/373 (24%), Positives = 156/373 (41%), Gaps = 64/373 (17%)

Query: 10  LQNDNVIPSTTTDNY----SLVEALVAHMSDIDEKQLLLYAQIANLVNLI---------- 55
           L +   + + ++D+Y      V+A++ HM+ + +     +A    +V +           
Sbjct: 162 LMSGKAVNAMSSDDYVQDHDDVKAMIDHMAQMRKSNAQEFAGNQEIVRIAEHLLDMLQAR 221

Query: 56  ----LQWP-EISINEIAENFSKLACN-----AHTICNSELRPLGTGLYPVISIINHSCLP 105
               L W  + +  E+ +   K ACN     A  I +  +   G G+YP+ +I+NHSC P
Sbjct: 222 SPKGLDWSLKPTEEELLKVLCKFACNNFSHAARQIWDDLIVSHGMGVYPLGAILNHSCKP 281

Query: 106 NAVLVF--EGRLAVVRAVQHVPKGAEVLISYIETAGSTMTRQKALKEQYLFTCTCPRCIK 163
           N V+ +  E      R ++ +  G ++  SYI+ A  + TR++ L+  Y F C C +C K
Sbjct: 282 NCVIYYHPETHEQEFRCIEDIQVGEDICHSYIDLAAVSKTRKEKLQSTYYFDCDC-QCCK 340

Query: 164 LGQFDDIQESAILEGYRCKDDGCSGFLLRDSDDKGFTCQQCGLVRSKEEIKKIASEVNIL 223
             +  D +  A             G +    D         G   S+ EI+   S +  L
Sbjct: 341 FPEELDNKLGAR-----------DGKVTEKCDRAAELLAAAG---SRTEIEHALSRLKDL 386

Query: 224 SKKTLALTSCGNHQEVVSTYKMIEKLQKKLYHPFSVNLMQTREKLIKILMELEDWKEALA 283
            +  LA    GN                       ++ +  + K+++  +EL     A+ 
Sbjct: 387 DENVLA--DRGN---------------------VDLDRLSVKSKMLQASIELGMMDSAIR 423

Query: 284 YCQLTIPVYQRVYPQFHPLLGLQYYTCGKLEWFLGDTENAIKSMTEAVEILRITHGTNSP 343
            C+  +  Y+ +YP  HPLLGLQ YT G L +  G  E A   + E  +IL  TH   S 
Sbjct: 424 ACKQVVEGYRGIYPPLHPLLGLQLYTLGNLLFDDGRGEEAADVLQEGQKILLATHDRRST 483

Query: 344 FMKELILKLEEAQ 356
            ++ +   L EA+
Sbjct: 484 MVQGITELLAEAR 496


>gi|326919601|ref|XP_003206068.1| PREDICTED: SET and MYND domain-containing protein 1-like isoform 1
           [Meleagris gallopavo]
          Length = 486

 Score = 95.5 bits (236), Expect = 3e-17,   Method: Compositional matrix adjust.
 Identities = 91/365 (24%), Positives = 157/365 (43%), Gaps = 60/365 (16%)

Query: 27  VEALVAHMSDIDEKQLL-LYAQIANLVNLILQWP----EISINEIAENFSKLACNAHTIC 81
           ++ L  H+ + DE++   L   + + +     WP    +  +  I+  F  + CNA T+ 
Sbjct: 124 IDDLQNHVENFDEEEKKDLRIDVESFLEF---WPAQSQQFGMQYISHIFGVINCNAFTLS 180

Query: 82  NSE-LRPLGTGLYPVISIINHSCLPNAVLVFE-GRLAVVRAVQH------------VPKG 127
           +   L+ +G G++P +   NH C PN  ++F  G    VR++ H            +  G
Sbjct: 181 DQRGLQAVGVGIFPNLCQANHDCWPNCTVIFNNGNHEAVRSMFHTQMRIELRALSKISPG 240

Query: 128 AEVLISYIETAGSTMTRQKALKEQYLFTCTCPRCIKLGQFDDIQESAILEGYRCKDDGCS 187
            E+ +SY++    +  RQK LK+QY F CTC  C K                + KDD   
Sbjct: 241 DELTVSYVDFLNVSEERQKQLKKQYYFDCTCEHCKK----------------KIKDD--L 282

Query: 188 GFLLRDSDDKGFTCQQCGLVRSKEEIKKIASEVNILSKKTLALTSC----GNHQEVVSTY 243
              +++ D K           S E +K    EV   SK TL   +     G + EVV   
Sbjct: 283 MLAVKEGDKKP----------SAETVK----EVIQFSKDTLEKINKARLEGTYHEVVKLC 328

Query: 244 KMIEKLQKKLYHPFSVNLMQTREKLIKILMELEDWKEALAYCQLTIPVYQRVYPQFHPLL 303
           +   K Q+ +    ++ L++      ++L  L+ ++EA  Y +  +  Y ++Y   +  L
Sbjct: 329 RECLKKQEPVLGDTNIYLLRILSIASEVLSYLQMFEEAAEYAKRMVDGYLKIYHPNNAQL 388

Query: 304 GLQYYTCGKLEWFLGDTENAIKSMTEAVEILRITHGTNSPFMKELILKLEEAQAEASYKL 363
           G+     G   W  G  E     + +A  IL ITHG + P  K+  L++   Q E   ++
Sbjct: 389 GMAVMRAGVTHWHAGLIEAGHSMICKAYAILLITHGPSHPITKD--LEVMRVQTEMELRM 446

Query: 364 SSKDE 368
             ++E
Sbjct: 447 FQQNE 451


>gi|426194765|gb|EKV44696.1| hypothetical protein AGABI2DRAFT_120820 [Agaricus bisporus var.
           bisporus H97]
          Length = 674

 Score = 95.5 bits (236), Expect = 4e-17,   Method: Compositional matrix adjust.
 Identities = 68/196 (34%), Positives = 94/196 (47%), Gaps = 35/196 (17%)

Query: 26  LVEALVAHMS--DIDEKQLLLYAQIANLVNLILQWPEISINEIAENFSKLACNAHTICNS 83
           +V ALV ++    +DE        IA LV+L+               S+   NA T+ + 
Sbjct: 250 MVHALVRYLGLESLDEFSSYGIDSIAGLVDLV---------------SRFTTNAFTVASP 294

Query: 84  ELRPLGTGLYPVISIINHSCLPNAVLVFEGR---------LAVVRAVQHVPKGAEVLISY 134
            L P+G  + P+I++ NHSC PNAV VF            +A V  ++ +P   EV+ISY
Sbjct: 295 TLTPIGVSISPIIALFNHSCAPNAVPVFPRAPHNAKANEPMASVITLRDIPANEEVVISY 354

Query: 135 IETAGSTMTRQKALKEQYLFTCTCPRCIKL--GQFDDIQESAILEGYRCKDDGCSGF-LL 191
           I+T  +   RQKALKE Y FTC+C  C K   G + D +ES       C  + C G   L
Sbjct: 355 IDTTLTKRERQKALKETYYFTCSCSLCEKPRDGAYVDSRES-----LYCPKN-CGGLCWL 408

Query: 192 RDSDDKGFTCQQCGLV 207
              DD    C +CG V
Sbjct: 409 PTEDDPLARCAKCGTV 424


>gi|149041012|gb|EDL94969.1| SET and MYND domain containing 2, isoform CRA_b [Rattus norvegicus]
          Length = 349

 Score = 95.1 bits (235), Expect = 4e-17,   Method: Compositional matrix adjust.
 Identities = 52/147 (35%), Positives = 83/147 (56%), Gaps = 6/147 (4%)

Query: 19  TTTDNYSLVEALVAHMSDID-EKQLLLYAQIANLVNLI---LQWPEISINEIAENFSKLA 74
           T ++    V    +H+  +D EK+ L+ + IA L       L++P+ S   +   F+++ 
Sbjct: 123 TPSEKLLAVREFESHLDKLDNEKKDLIQSDIAALHQFYSKHLEFPDHS--SLVVLFAQVN 180

Query: 75  CNAHTICNSELRPLGTGLYPVISIINHSCLPNAVLVFEGRLAVVRAVQHVPKGAEVLISY 134
           CN  TI + EL  LG+ ++P ++++NHSC PN ++ ++G LA VRAVQ +  G EV  SY
Sbjct: 181 CNGFTIEDEELSHLGSAIFPDVALMNHSCCPNVIVTYKGTLAEVRAVQEIHPGDEVFTSY 240

Query: 135 IETAGSTMTRQKALKEQYLFTCTCPRC 161
           I+    T  R   L++ Y FTC C  C
Sbjct: 241 IDLLYPTEDRNDRLRDSYFFTCECREC 267


>gi|384498493|gb|EIE88984.1| hypothetical protein RO3G_13695 [Rhizopus delemar RA 99-880]
          Length = 553

 Score = 95.1 bits (235), Expect = 4e-17,   Method: Compositional matrix adjust.
 Identities = 42/108 (38%), Positives = 65/108 (60%)

Query: 56  LQWPEISINEIAENFSKLACNAHTICNSELRPLGTGLYPVISIINHSCLPNAVLVFEGRL 115
           +Q   I  +E+    +    N  T+ + E+  +G G YPV S+ NH+C PNAV+VF+G L
Sbjct: 178 VQQTSIDEDELVHYLNVFKSNNFTLDDQEMFAVGEGTYPVASLFNHTCRPNAVIVFDGAL 237

Query: 116 AVVRAVQHVPKGAEVLISYIETAGSTMTRQKALKEQYLFTCTCPRCIK 163
           A +RA+  +    E+ ISYI+ A +   R++AL+E+Y F C C RC +
Sbjct: 238 AEIRAIDTIEPDTEITISYIDPAHARAHRKRALREKYFFDCRCVRCTR 285


>gi|88900471|ref|NP_001034725.1| SET and MYND domain-containing protein 1 [Danio rerio]
 gi|84180543|gb|ABC54713.1| histone methyltransferase SmyD1a [Danio rerio]
          Length = 486

 Score = 95.1 bits (235), Expect = 5e-17,   Method: Compositional matrix adjust.
 Identities = 81/350 (23%), Positives = 147/350 (42%), Gaps = 48/350 (13%)

Query: 18  STTTDN-YSLVEALVAHMSDIDEKQLLLYAQIANLVNLILQWPEIS----INEIAENFSK 72
           S  +DN  + +E L  H+ DI E  L  +    ++ N +  WP  S    ++ ++     
Sbjct: 114 SVVSDNQLTTLEDLEDHICDISEDDLKDFK--VDIHNFLDYWPRNSKPHTVDSVSHILGV 171

Query: 73  LACNAHTICNSE-LRPLGTGLYPVISIINHSCLPNAVL------------VFEGRLAV-V 118
           + CN   + +   L+ +G GL+P + ++NH C PN  +            VF  +  + +
Sbjct: 172 INCNGFMVSDQRGLQAVGVGLFPNLCLVNHDCWPNCTVILNNGNQSAIDTVFHSQKRIEL 231

Query: 119 RAVQHVPKGAEVLISYIETAGSTMTRQKALKEQYLFTCTCPRCIKLGQFDDIQESAILEG 178
           RA+  +  G EV ++Y++    +  RQ+ LK+QY F CTC  C +  + D     A ++G
Sbjct: 232 RALGKISAGEEVTVAYVDYLNVSADRQRLLKQQYFFDCTCKHCTEKIKDDLKMAGAEVDG 291

Query: 179 YRCKDDGCSGFLLRDSDDKGFTCQQCGLVRSKEEIKKIASEVNILSKKTLALTSCGNHQE 238
            +  +                           E++K++        +K        N+ E
Sbjct: 292 VKVPE---------------------------EQVKEVTEFSRQKLEKMEKARIEANYNE 324

Query: 239 VVSTYKMIEKLQKKLYHPFSVNLMQTREKLIKILMELEDWKEALAYCQLTIPVYQRVYPQ 298
           VV   +   + Q+ +     +  ++    L ++L  L+ + EA  Y +  +  Y ++Y  
Sbjct: 325 VVKICRECVEKQENVLADTHIYYLRVCCTLSEVLSYLQFFDEASEYARKMVDGYLKLYHP 384

Query: 299 FHPLLGLQYYTCGKLEWFLGDTENAIKSMTEAVEILRITHGTNSPFMKEL 348
            +  LG+     G   W  G  E A   + +A  IL ITHG   P  K+L
Sbjct: 385 NNAQLGMATMRAGVTHWQAGFIEVAHGMICKAFAILMITHGPTHPITKDL 434


>gi|332239335|ref|XP_003268859.1| PREDICTED: SET and MYND domain-containing protein 1 [Nomascus
           leucogenys]
          Length = 512

 Score = 95.1 bits (235), Expect = 5e-17,   Method: Compositional matrix adjust.
 Identities = 80/336 (23%), Positives = 150/336 (44%), Gaps = 52/336 (15%)

Query: 31  VAHMSDIDEKQLLLYAQIANLVNLILQWPEISINEIAENFSKLACNAHTICNSE-LRPLG 89
           V H  + ++K+L +      L   + Q  + S+  I+  F  + CN  T+ +   L+ +G
Sbjct: 158 VEHFGEEEQKELRVDVDTF-LQYWLPQSQQFSMQYISHIFGVINCNGFTLSDQRGLQAVG 216

Query: 90  TGLYPVISIINHSCLPNAVLVFE-GRLAVVRAVQH------------VPKGAEVLISYIE 136
            G++P + ++NH C PN  ++F  G    V+++ H            + +G E+ +SYI+
Sbjct: 217 VGIFPNLGLVNHDCWPNCTVIFNNGNHEAVKSMFHTQMRIELRALGKISEGEELTVSYID 276

Query: 137 TAGSTMTRQKALKEQYLFTCTCPRCIKLGQFDDIQESAILEGYRCKDDGCSGFLLRDSDD 196
               +  R++ LK+QY F CTC  C K                + KDD    FL      
Sbjct: 277 FLNVSEERKRQLKKQYYFDCTCEHCQK----------------KLKDD---LFL------ 311

Query: 197 KGFTCQQCGLVRSKEEIKKIASEVNILSKKTL----ALTSCGNHQEVVSTYKMIEKLQKK 252
                   G+  + +  +++  E+   SK TL       S G + EVV   +   + Q+ 
Sbjct: 312 --------GVKDNPKPSQEVVKEMIQFSKDTLEKIDKARSEGLYHEVVKLCRECLQKQEP 363

Query: 253 LYHPFSVNLMQTREKLIKILMELEDWKEALAYCQLTIPVYQRVYPQFHPLLGLQYYTCGK 312
           ++   ++ +++    + ++L  L+ ++EA  Y +  +  Y ++Y   +  LG+     G 
Sbjct: 364 VFADTNIYMLRMLSIVSEVLSYLQAFEEASFYARRMVDGYMKLYHPNNAQLGMAVMRAGL 423

Query: 313 LEWFLGDTENAIKSMTEAVEILRITHGTNSPFMKEL 348
             W  G+ E     + +A  IL +THG + P  K+L
Sbjct: 424 TNWHAGNIEVGHGMICKAYAILLVTHGPSHPITKDL 459


>gi|67514234|gb|AAH98305.1| SMYD2 protein [Homo sapiens]
          Length = 272

 Score = 95.1 bits (235), Expect = 5e-17,   Method: Compositional matrix adjust.
 Identities = 55/159 (34%), Positives = 87/159 (54%), Gaps = 7/159 (4%)

Query: 8   RKLQNDNVIPSTTTDNYSL-VEALVAHMSDID-EKQLLLYAQIANLVNLI---LQWPEIS 62
           R L    + P  T     L V+   +H+  +D EK+ L+ + IA L +     L +P+  
Sbjct: 111 RILAKQKIHPERTPSEKLLAVKEFESHLDKLDNEKKDLIQSDIAALHHFYSKHLGFPDN- 169

Query: 63  INEIAENFSKLACNAHTICNSELRPLGTGLYPVISIINHSCLPNAVLVFEGRLAVVRAVQ 122
            + +   F+++ CN  TI + EL  LG+ ++P ++++NHSC PN ++ ++G LA VRAVQ
Sbjct: 170 -DSLVVLFAQVNCNGFTIEDEELSHLGSAIFPDVALMNHSCCPNVIVTYKGTLAEVRAVQ 228

Query: 123 HVPKGAEVLISYIETAGSTMTRQKALKEQYLFTCTCPRC 161
            +  G EV  SYI+    T  R   L++ Y FTC C  C
Sbjct: 229 EIKPGEEVFTSYIDLLYPTEDRNDRLRDSYFFTCECQEC 267


>gi|47224323|emb|CAG09169.1| unnamed protein product [Tetraodon nigroviridis]
          Length = 361

 Score = 95.1 bits (235), Expect = 5e-17,   Method: Compositional matrix adjust.
 Identities = 76/286 (26%), Positives = 126/286 (44%), Gaps = 41/286 (14%)

Query: 77  AHTIC--NSELRPLGTGLYPVISIINHSCLPNAVLVFEGRLAVVRAVQHVPKGAEVLISY 134
           +HT+   + EL  LGT +YP +++INHSCLP+ ++ + G  A VRAV+ +  G EVLISY
Sbjct: 109 SHTLFSEDEELSHLGTAVYPDVALINHSCLPSVIVTYNGTSADVRAVRDMNPGDEVLISY 168

Query: 135 IETAGSTMTRQKALKEQYLFTCTCPRCIKLGQFDDIQESAILEGYRCKDDGCSGFLLRDS 194
           I+    T  R   L+E Y FTC C  C                G +  D   +   LR  
Sbjct: 169 IDVLYPTEDRNTRLRESYYFTCQCQEC----------------GSQSSDQ--AKLKLRKR 210

Query: 195 DDKGFTCQQCGLVRSKEEIKKIASEVNILSKKTLALTSCGNHQEVVSTYKMIEKLQKKL- 253
            D               E + I + V    K         N     ST  ++E  ++ L 
Sbjct: 211 SDPA-------------EAEVINTMVRYARKCIREFRVFKNSNTPAST--LLEMCEQSLD 255

Query: 254 -----YHPFSVNLMQTREKLIKILMELEDWKEALAYCQLTIPVYQRVYPQFHPLLGLQYY 308
                +   +V ++    + + + + ++D   AL Y +     Y+++YP +   L   Y 
Sbjct: 256 EMGAVFDESNVYMLHMMYQAMGVCLYMQDPDGALRYGEKIAKYYRKLYPAYSLNLSSLYL 315

Query: 309 TCGKLEWFLGDTENAIKSMTEAVEILRITHGTNSPFMKELILKLEE 354
             G+L + +      I  + +A  I+ +THG N  ++ EL  +++E
Sbjct: 316 KLGRLYFGMERNSECISVLKKAKAIMEVTHGKNHFYVTELDRQMKE 361


>gi|409076546|gb|EKM76917.1| hypothetical protein AGABI1DRAFT_102088 [Agaricus bisporus var.
           burnettii JB137-S8]
          Length = 643

 Score = 95.1 bits (235), Expect = 5e-17,   Method: Compositional matrix adjust.
 Identities = 68/196 (34%), Positives = 93/196 (47%), Gaps = 35/196 (17%)

Query: 26  LVEALVAHMS--DIDEKQLLLYAQIANLVNLILQWPEISINEIAENFSKLACNAHTICNS 83
           +V ALV ++    +DE        IA LV+L+               S+   NA T+ + 
Sbjct: 217 MVHALVRYLGLESLDELSSYGIDSIAGLVDLV---------------SRFTTNAFTVASP 261

Query: 84  ELRPLGTGLYPVISIINHSCLPNAVLVFEGR---------LAVVRAVQHVPKGAEVLISY 134
            L P+G  + P+I++ NHSC PNAV VF            +A V  ++ +P   EV+ISY
Sbjct: 262 TLTPIGVSISPIIALFNHSCAPNAVPVFPRAPHNAKANEPMASVITLRDIPANEEVVISY 321

Query: 135 IETAGSTMTRQKALKEQYLFTCTCPRCIKL--GQFDDIQESAILEGYRCKDDGCSGF-LL 191
           I+T  +   RQKALKE Y FTC C  C K   G + D +ES       C  + C G   L
Sbjct: 322 IDTTLTKRERQKALKETYYFTCNCSLCEKPRDGAYVDSRES-----LYCPKN-CGGLCWL 375

Query: 192 RDSDDKGFTCQQCGLV 207
              DD    C +CG V
Sbjct: 376 PTEDDPLARCAKCGTV 391


>gi|449501420|ref|XP_002187966.2| PREDICTED: SET and MYND domain-containing protein 1 isoform 1
           [Taeniopygia guttata]
          Length = 478

 Score = 94.7 bits (234), Expect = 6e-17,   Method: Compositional matrix adjust.
 Identities = 83/348 (23%), Positives = 156/348 (44%), Gaps = 39/348 (11%)

Query: 27  VEALVAHMSDI-DEKQLLLYAQIANLVNLILQWP----EISINEIAENFSKLACNAHTIC 81
           +E L  H+    +E++  L A + + +     WP    +  +  I+  F  ++CN  T+ 
Sbjct: 129 IEELQNHVDSFGEEEKKDLRADVESFLEF---WPPQCQQFGMQLISHIFGVISCNGFTLS 185

Query: 82  NSE-LRPLGTGLYPVISIINHSCLPNAVLVFEGRLAVVRAVQHVPKGAEVLISYIETAGS 140
           +   L+ +G G++P +   NH C PN  +VF      +RA+  +  G E+ +SY++    
Sbjct: 186 DQRGLQAVGVGIFPNLCQANHDCWPNCTVVFNNGKIELRALNKISPGDELTVSYVDFLSL 245

Query: 141 TMTRQKALKEQYLFTCTCPRCIKLGQFDDIQESAILEGYRCKDDGCSGFLLRDSDDKGFT 200
           +  R++ LK+QY F CTC  C K                + KDD        +S     T
Sbjct: 246 SEERRRQLKKQYYFDCTCEHCKK----------------QLKDDLMLAVKAGESKPSADT 289

Query: 201 CQQCGLVRSKEEIKKIASEVNILSKKTLALTSCGNHQEVVSTYKMIEKLQKKLYHPFSVN 260
            ++  +  SK+ ++KI       +K  +     G++ EVV   +   + Q+ +    ++ 
Sbjct: 290 VKEV-IQLSKDTLEKI-------NKARME----GHYHEVVKLCRDCLQKQEPVLGDTNIY 337

Query: 261 LMQTREKLIKILMELEDWKEALAYCQLTIPVYQRVYPQFHPLLGLQYYTCGKLEWFLGDT 320
           L++      ++L  L+ ++EA  Y +  +  Y ++Y   +  LG+     G   W  G  
Sbjct: 338 LLRILSIASEVLSYLQMFEEAADYAKRMVDGYLKIYHPNNAQLGMAVMRAGVTHWHAGLI 397

Query: 321 ENAIKSMTEAVEILRITHGTNSPFMKELILKLEEAQAEASYKLSSKDE 368
           E     + +A  IL ITHG + P  K+  L++   Q E   ++  ++E
Sbjct: 398 EVGHGLICKAYAILLITHGPSHPITKD--LEVMRVQTEMELRMFQQNE 443


>gi|326427056|gb|EGD72626.1| hypothetical protein PTSG_04361 [Salpingoeca sp. ATCC 50818]
          Length = 464

 Score = 94.7 bits (234), Expect = 7e-17,   Method: Compositional matrix adjust.
 Identities = 73/283 (25%), Positives = 124/283 (43%), Gaps = 19/283 (6%)

Query: 74  ACNAHTICNSELRPLGTGLYPVISIINHSCLPNAVLVFEGRLAVVRAVQHVPKGAEVLIS 133
           + N+  I   E   LG GLY  +S INHSC PN  + F G  A VR+V  +P   E+ I+
Sbjct: 191 SANSFGISGVEGNNLGVGLYDTVSYINHSCAPNCSITFSGVYARVRSVHDLPPNQELTIA 250

Query: 134 YIETAGSTMTRQKALKEQYLFTCTCPRCIKLGQFDDIQESAILEGYRCKDDGCSGF---- 189
           YI+       R+  LK Q++F C C RC +  + DD     +     C   GC       
Sbjct: 251 YIDPCDPRAKRRAHLKSQFMFDCECSRCER--ERDDDPLLTLC----CPGSGCDQIIKYQ 304

Query: 190 LLRDSDDKGF-TCQQC--GLVRSKEEIKKIASEVNILSKKTLALTSCGNHQEVVSTYKMI 246
           L    DD  F   ++C  G +         + +V    K+ L +    N         M 
Sbjct: 305 LTAPPDDTAFEVTKKCERGHLDDPATFDSWSRKV----KRVLGIIGEANKARRPVQGDMA 360

Query: 247 EKLQKKLYHPFSVNLMQTREKLIKILMELEDWKEALAYCQLTIPVYQRVYPQFHPLLGLQ 306
               ++L  P +  +      ++   +    ++EAL + +  I VY  +     PL G++
Sbjct: 361 SF--ERLLPPTNYLMHIAYTGVLDQAIGTGQFQEALQWARKVILVYDALNSPMDPLRGIE 418

Query: 307 YYTCGKLEWFLGDTENAIKSMTEAVEILRITHGTNSPFMKELI 349
                KL+    D    ++++ +AV+++R+THG +S   ++++
Sbjct: 419 LLRVAKLQALARDGRGFLETIQKAVDVIRLTHGEDSVVYQQVL 461


>gi|327275307|ref|XP_003222415.1| PREDICTED: SET and MYND domain-containing protein 1-like isoform 1
           [Anolis carolinensis]
          Length = 472

 Score = 94.4 bits (233), Expect = 7e-17,   Method: Compositional matrix adjust.
 Identities = 81/323 (25%), Positives = 146/323 (45%), Gaps = 39/323 (12%)

Query: 31  VAHMSDIDEKQLLLYAQIANLVNLILQWP----EISINEIAENFSKLACNAHTICNSE-L 85
           V H  + ++K+L L     +L + +  WP    + ++  I+     + CN  T+ +   L
Sbjct: 131 VEHFGEEEKKELRL-----DLESFLEFWPHDGKQFAMQYISHILGVINCNGFTLSDQRGL 185

Query: 86  RPLGTGLYPVISIINHSCLPNAVLVFEGRLAVVRAVQHVPKGAEVLISYIETAGSTMTRQ 145
           + +G G++P + ++NH C PN  ++F      +RA+  +  G E+ +SY++       R+
Sbjct: 186 QAVGVGIFPNLCLVNHDCWPNCTVIFNNGKIELRALGKISIGEELTVSYVDFLNVCEDRR 245

Query: 146 KALKEQYLFTCTCPRCIKLGQFDDIQESAILEGYRCKDDGCSGFLLRDSDDKGFTCQQCG 205
           + LK+QY F CTC  C K G  DD+   A+ E     ++  S  +++D            
Sbjct: 246 QQLKKQYYFDCTCEHCQK-GIKDDLM-LAVKE-----EEKPSPEVVKDV----------- 287

Query: 206 LVRSKEEIKKIASEVNILSKKTLALTSCGNHQEVVSTYKMIEKLQKKLYHPFSVNLMQTR 265
           +  SKE I+KI               S G ++EVV   +   + Q+ +    ++ +++  
Sbjct: 288 IQFSKEAIEKIDK-----------ARSEGLYKEVVKLCRACLEKQEPVLGDTNIYMLRIL 336

Query: 266 EKLIKILMELEDWKEALAYCQLTIPVYQRVYPQFHPLLGLQYYTCGKLEWFLGDTENAIK 325
             L ++L  L+   EA  Y +     Y ++Y   +  LG+     G   W  G  E    
Sbjct: 337 SILSEVLSYLQHVDEAAGYSKRMADGYLKLYHPNNAQLGMALMRAGVTHWHAGLIEAGHG 396

Query: 326 SMTEAVEILRITHGTNSPFMKEL 348
            + +A  IL +THG   P  K+L
Sbjct: 397 MICKAYAILLVTHGPTHPITKDL 419


>gi|45383804|ref|NP_989486.1| SET and MYND domain-containing protein 1 [Gallus gallus]
 gi|16930387|gb|AAL31880.1|AF410781_1 cardiac and skeletal muscle-specific BOP1 [Gallus gallus]
          Length = 486

 Score = 94.4 bits (233), Expect = 8e-17,   Method: Compositional matrix adjust.
 Identities = 90/365 (24%), Positives = 156/365 (42%), Gaps = 60/365 (16%)

Query: 27  VEALVAHMSDIDEKQLL-LYAQIANLVNLILQWP----EISINEIAENFSKLACNAHTIC 81
           ++ L  H+   DE++   L   + + +     WP    +  +  I+  F  + CNA T+ 
Sbjct: 124 IDDLQNHVESFDEEEKKDLRVDVESFLEF---WPAQSQQFGMQYISHIFGVINCNAFTLS 180

Query: 82  NSE-LRPLGTGLYPVISIINHSCLPNAVLVFE-GRLAVVRAVQH------------VPKG 127
           +   L+ +G G++P +   NH C PN  ++F  G    VR++ H            +  G
Sbjct: 181 DQRGLQAVGVGIFPNLCQANHDCWPNCTVIFNNGNHEAVRSMFHTQMRIELRALSKISPG 240

Query: 128 AEVLISYIETAGSTMTRQKALKEQYLFTCTCPRCIKLGQFDDIQESAILEGYRCKDDGCS 187
            E+ +SY++    +  R+K LK+QY F CTC  C K                + KDD   
Sbjct: 241 DELTVSYVDFLNVSEERRKQLKKQYYFDCTCEHCKK----------------KIKDD--L 282

Query: 188 GFLLRDSDDKGFTCQQCGLVRSKEEIKKIASEVNILSKKTLALTSC----GNHQEVVSTY 243
              +++ D K           S E +K    EV   SK TL   +     G + EVV   
Sbjct: 283 MLAVKEGDKKP----------SAETVK----EVIQFSKDTLEKINKARLEGTYHEVVKLC 328

Query: 244 KMIEKLQKKLYHPFSVNLMQTREKLIKILMELEDWKEALAYCQLTIPVYQRVYPQFHPLL 303
           +   K Q+ +    ++ L++      ++L  L+ ++EA  Y +  +  Y ++Y   +  L
Sbjct: 329 RECLKKQEPVLGDTNIYLLRILSIASEVLSYLQMFEEAAEYAKRMVEGYMKIYHPNNAQL 388

Query: 304 GLQYYTCGKLEWFLGDTENAIKSMTEAVEILRITHGTNSPFMKELILKLEEAQAEASYKL 363
           G+     G   W  G  E     + +A  IL ITHG + P  K+  L++   Q E   ++
Sbjct: 389 GMAVMRAGVTHWHAGLIEAGHSMICKAYAILLITHGPSHPITKD--LEVMRVQTEMELRM 446

Query: 364 SSKDE 368
             ++E
Sbjct: 447 FQQNE 451


>gi|354483543|ref|XP_003503952.1| PREDICTED: SET and MYND domain-containing protein 1 [Cricetulus
           griseus]
          Length = 498

 Score = 94.4 bits (233), Expect = 8e-17,   Method: Compositional matrix adjust.
 Identities = 80/337 (23%), Positives = 149/337 (44%), Gaps = 54/337 (16%)

Query: 31  VAHMSDIDEKQLLLYAQIANLVNLILQ-WP----EISINEIAENFSKLACNAHTICNSE- 84
           V H  + ++K+L +       V+  LQ WP    + S+  I+  F  + CN  T+ +   
Sbjct: 136 VEHFGEEEQKELRVD------VDAFLQYWPPQGQQFSMQYISHIFGVINCNGFTLSDQRG 189

Query: 85  LRPLGTGLYPVISIINHSCLPNAVLVFE-GRLAVVRAVQH------------VPKGAEVL 131
           L+ +G G++P + ++NH C PN  ++F  G    V+++ H            + +G E+ 
Sbjct: 190 LQAVGVGIFPNLGLVNHDCWPNCTVIFNNGNHEAVKSMFHTQMRIELRALGKISEGEELT 249

Query: 132 ISYIETAGSTMTRQKALKEQYLFTCTCPRCIKLGQFDDIQESAILEGYRCKDDGCSGFLL 191
           +SYI+    +  R++ L++QY F C+C  C K G  DD+                  F  
Sbjct: 250 VSYIDFLHLSEERRRQLRKQYYFDCSCEHCQK-GLKDDL------------------FQA 290

Query: 192 RDSDDKGFTCQQCGLVRSKEEIKKIASEVNILSKKTLALTSCGNHQEVVSTYKMIEKLQK 251
              D K           S E +K++        +K     S G + EVV   +   + Q+
Sbjct: 291 VKEDPKP----------SPEVVKEMVQFSKDTLEKIDKARSEGLYHEVVKLCRECLEKQE 340

Query: 252 KLYHPFSVNLMQTREKLIKILMELEDWKEALAYCQLTIPVYQRVYPQFHPLLGLQYYTCG 311
            ++   ++ +++    + ++L  L+ ++EA  Y +  +  Y ++Y   +  LG+     G
Sbjct: 341 SVFADTNLYVLRLLSIVSEVLSYLQAFEEAAHYARRMVDGYMKLYHHNNAQLGMAVMRAG 400

Query: 312 KLEWFLGDTENAIKSMTEAVEILRITHGTNSPFMKEL 348
              W  G+ E     + +A  IL +THG + P  K+L
Sbjct: 401 LTNWHAGNIEVGHGMICKAYAILLVTHGPSHPITKDL 437


>gi|395731585|ref|XP_003775930.1| PREDICTED: LOW QUALITY PROTEIN: SET and MYND domain-containing
           protein 1-like [Pongo abelii]
          Length = 490

 Score = 94.0 bits (232), Expect = 1e-16,   Method: Compositional matrix adjust.
 Identities = 80/336 (23%), Positives = 149/336 (44%), Gaps = 52/336 (15%)

Query: 31  VAHMSDIDEKQLLLYAQIANLVNLILQWPEISINEIAENFSKLACNAHTICNSE-LRPLG 89
           V H  + ++K+L +      L   + Q  + S+  I+  F  + CN  T+ +   L+ +G
Sbjct: 136 VEHFGEEEQKELRVDVDTF-LQYWLPQSQQFSMQYISHIFGVINCNGFTLSDQRGLQAVG 194

Query: 90  TGLYPVISIINHSCLPNAVLVFE-GRLAVVRAVQH------------VPKGAEVLISYIE 136
            G++P + ++NH C PN  ++F  G    V+++ H            + +G E+ +SYI+
Sbjct: 195 VGIFPNLGLVNHDCWPNCTVIFNNGNHEAVKSMFHTQMRIELRALGKISEGEELTVSYID 254

Query: 137 TAGSTMTRQKALKEQYLFTCTCPRCIKLGQFDDIQESAILEGYRCKDDGCSGFLLRDSDD 196
               +  R++ LK+QY F CTC  C K                + KDD    FL      
Sbjct: 255 FLNVSEERKRQLKKQYYFDCTCEHCQK----------------KLKDD---LFL------ 289

Query: 197 KGFTCQQCGLVRSKEEIKKIASEVNILSKKTL----ALTSCGNHQEVVSTYKMIEKLQKK 252
                   G+  + +  +++  E+   SK TL       S G + EVV   +   + Q+ 
Sbjct: 290 --------GVKDNPKPSQEVVKEMIRFSKDTLEKIDKARSEGLYHEVVKLCRECLQKQEP 341

Query: 253 LYHPFSVNLMQTREKLIKILMELEDWKEALAYCQLTIPVYQRVYPQFHPLLGLQYYTCGK 312
           ++   ++  ++    + ++L  L+ ++EA  Y +  +  Y ++Y   +  LG+     G 
Sbjct: 342 VFADTNIYTLRMLSIVSEVLSYLQAFEEASFYARRMVDGYMKLYHPNNAQLGMAVMRAGL 401

Query: 313 LEWFLGDTENAIKSMTEAVEILRITHGTNSPFMKEL 348
             W  G+ E     + +A  IL +THG + P  K+L
Sbjct: 402 TNWHAGNIEVGHGMICKAYAILLVTHGPSHPITKDL 437


>gi|159463378|ref|XP_001689919.1| histone methyltransferase [Chlamydomonas reinhardtii]
 gi|158283907|gb|EDP09657.1| histone methyltransferase [Chlamydomonas reinhardtii]
          Length = 765

 Score = 93.2 bits (230), Expect = 2e-16,   Method: Compositional matrix adjust.
 Identities = 43/97 (44%), Positives = 62/97 (63%), Gaps = 2/97 (2%)

Query: 58  WPEISINEIAENFSKLACNAHTICNSELRPLGTGLYPVISIINHSCLPNAVLVFEGRLAV 117
           WP     E+A+  ++LA N HT+C+ ELRPLGT LYP  ++ NHSC P+ V VF GR   
Sbjct: 94  WP--GFKEVAQLLARLAVNCHTVCDEELRPLGTALYPSGALANHSCRPSTVQVFRGRTLQ 151

Query: 118 VRAVQHVPKGAEVLISYIETAGSTMTRQKALKEQYLF 154
           +RA++ +  G EV + Y+E A +   R++AL + Y F
Sbjct: 152 LRALRPLAPGQEVTLCYLEPAATAQERREALWDSYCF 188


>gi|118343990|ref|NP_001071820.1| SET and MYND domain containing protein [Ciona intestinalis]
 gi|70571203|dbj|BAE06699.1| SET and MYND domain containing protein [Ciona intestinalis]
          Length = 474

 Score = 93.2 bits (230), Expect = 2e-16,   Method: Compositional matrix adjust.
 Identities = 79/308 (25%), Positives = 146/308 (47%), Gaps = 30/308 (9%)

Query: 62  SINEIAENFSKLACNAHTICNSE-LRPLGTGLYPVISIINHSCLPNAVLVFEGRLAVVRA 120
           S  E+A  F+ + CNA  + +   ++ +G G+YP IS++NH C PN V +  G    VRA
Sbjct: 182 SDEEMAHLFAIIDCNAIGLNDHRGVQTIGVGIYPGISMLNHDCSPNCVAMNNGPRLEVRA 241

Query: 121 VQHVPKGAEVLISYIETAGSTMTRQKALKEQYLFTCTCPRCIKLGQFDDIQESAILEGYR 180
           ++ +  G E+ ISYI++  +T  R++ LK QY F C C  C K  + ++++ + + E  +
Sbjct: 242 LRVIQPGEELCISYIDSLETTEKRREKLKLQYYFDCECDTCTKGEELENLKHALVSEDIK 301

Query: 181 CKDDGCSGFLLRDSDDKGFTCQQCGLVRSKEEIKKIASEVNILSKKTLALTSCGNHQEVV 240
              +    ++ + S D     ++    +  +  +++ ++    +  TLA   C     V+
Sbjct: 302 ---EESVKYINQFSKD---MLKRIHKTKQNQNWERMCNQ----TLGTLAQQDC-----VI 346

Query: 241 STYKMIEKLQKKLYHPFSVNLMQTREKLIKILMELEDWKEALAYCQLTIPVYQRVYPQFH 300
           +   +++     L H   V     R++      E    + A AY +L  PV        H
Sbjct: 347 ADTNVLKIAM--LNHAVEVQSFLRRQEPALEYAE----RVAAAYEKLLPPV--------H 392

Query: 301 PLLGLQYYTCGKLEWFLGDTENAIKSMTEAVEILRITHGTNSPFMKELILKLEEAQAEAS 360
           P LG+     G ++W +  TE A+ ++  A  I+  THG +    KEL+  + + + EA 
Sbjct: 393 PTLGMYLMRLGVIQWQIQKTEAAVATLGRAASIISKTHGDDHGMFKELLGLIHQCKMEAH 452

Query: 361 YKLSSKDE 368
               ++ E
Sbjct: 453 MSKGAQRE 460


>gi|449273578|gb|EMC83062.1| SET and MYND domain-containing protein 1 [Columba livia]
          Length = 491

 Score = 93.2 bits (230), Expect = 2e-16,   Method: Compositional matrix adjust.
 Identities = 91/365 (24%), Positives = 157/365 (43%), Gaps = 60/365 (16%)

Query: 27  VEALVAHMSDIDEKQLL-LYAQIANLVNLILQWP----EISINEIAENFSKLACNAHTIC 81
           ++ L  H+   DE++   L A + + +     WP    +  +  I+  F  ++CN  T+ 
Sbjct: 129 IDDLQNHVDSFDEEEKKELRADVESFLEF---WPPHSQQFGMQYISHIFGVISCNGFTLS 185

Query: 82  NSE-LRPLGTGLYPVISIINHSCLPNAVLVFE-GRLAVVRAVQH------------VPKG 127
           +   L+ +G G++P +   NH C PN  ++F  G    VR++ H            +  G
Sbjct: 186 DQRGLQAVGVGIFPNLCQANHDCWPNCTVIFNNGNHEAVRSMFHTQMRIELRALTKISPG 245

Query: 128 AEVLISYIETAGSTMTRQKALKEQYLFTCTCPRCIKLGQFDDIQESAILEGYRCKDDGCS 187
            E+ +SY++    +  R+K LK+QY F CTC  C K                + KDD   
Sbjct: 246 DELTVSYVDFLNVSEERRKQLKKQYYFDCTCEHCKK----------------QIKDD--- 286

Query: 188 GFLLRDSDDKGFTCQQCGLVRSKEEIKKIASEVNILSKKTLALTSC----GNHQEVVSTY 243
             +L   + +G          S E +K    EV   SK TL   S     G + EVV   
Sbjct: 287 -LMLAVKEGEGKP--------SAETVK----EVIQFSKDTLEKISKARMEGLYHEVVKLC 333

Query: 244 KMIEKLQKKLYHPFSVNLMQTREKLIKILMELEDWKEALAYCQLTIPVYQRVYPQFHPLL 303
           +   K Q+ +    ++ L++      ++L  L+ ++EA  Y +  +  Y ++Y   +  L
Sbjct: 334 RECLKKQEPVLGDTNIYLLRILSIASEVLSYLQMFEEAADYAKRMVDGYLKIYHPNNAQL 393

Query: 304 GLQYYTCGKLEWFLGDTENAIKSMTEAVEILRITHGTNSPFMKELILKLEEAQAEASYKL 363
           G+     G   W  G  E     + +A  IL ITHG + P  K+  L++   Q E   ++
Sbjct: 394 GMAVMRAGVTHWHAGLIEAGHGLICKAYAILLITHGPSHPITKD--LEVMRVQTEMELRM 451

Query: 364 SSKDE 368
             ++E
Sbjct: 452 FQQNE 456


>gi|303321560|ref|XP_003070774.1| MYND finger family protein [Coccidioides posadasii C735 delta
           SOWgp]
 gi|240110471|gb|EER28629.1| MYND finger family protein [Coccidioides posadasii C735 delta
           SOWgp]
 gi|320040248|gb|EFW22181.1| conserved hypothetical protein [Coccidioides posadasii str.
           Silveira]
          Length = 497

 Score = 92.8 bits (229), Expect = 2e-16,   Method: Compositional matrix adjust.
 Identities = 49/171 (28%), Positives = 89/171 (52%), Gaps = 8/171 (4%)

Query: 1   MLKLYLRRKLQNDNVIPSTTTDNYSLVEALVAHMSDIDEKQLLLYAQIANLVNLILQWP- 59
           +L++ L +K Q D        D     ++L +H+++    +   +  +      I ++  
Sbjct: 123 VLRILLLKKHQEDQ------GDRLQRFDSLTSHLTETIRTKPDHFQNLVLCARAIHEYSK 176

Query: 60  -EISINEIAENFSKLACNAHTICNSELRPLGTGLYPVISIINHSCLPNAVLVFEGRLAVV 118
            E+S+ E+ + F KL  NA T+       +G  + P+ S+ NHSC PNA + F+     +
Sbjct: 177 TELSLQEVVDCFGKLDINAFTLTTPFYDQIGAAVEPLASLCNHSCDPNAAVDFDKGKTWL 236

Query: 119 RAVQHVPKGAEVLISYIETAGSTMTRQKALKEQYLFTCTCPRCIKLGQFDD 169
           RA+QH+ +G ++ +SY+E   + + RQ  L ++Y F C CPRCI+  +  D
Sbjct: 237 RALQHIEEGEQIFVSYVEPTDACLHRQAELSKRYYFECECPRCIREKEIGD 287


>gi|392862338|gb|EAS37026.2| hypothetical protein CIMG_02225 [Coccidioides immitis RS]
          Length = 497

 Score = 92.8 bits (229), Expect = 2e-16,   Method: Compositional matrix adjust.
 Identities = 49/171 (28%), Positives = 89/171 (52%), Gaps = 8/171 (4%)

Query: 1   MLKLYLRRKLQNDNVIPSTTTDNYSLVEALVAHMSDIDEKQLLLYAQIANLVNLILQWP- 59
           +L++ L +K Q D        D     ++L +H+++    +   +  +      I ++  
Sbjct: 123 VLRILLLKKHQEDQ------GDRMQRFDSLTSHLTETIRTKPDHFQNLVLCARAIHEYSK 176

Query: 60  -EISINEIAENFSKLACNAHTICNSELRPLGTGLYPVISIINHSCLPNAVLVFEGRLAVV 118
            E+S+ E+ + F KL  NA T+       +G  + P+ S+ NHSC PNA + F+     +
Sbjct: 177 TELSLQEVVDCFGKLDINAFTLTTPFYDQIGAAVEPLASLCNHSCDPNAAVDFDKGKTWL 236

Query: 119 RAVQHVPKGAEVLISYIETAGSTMTRQKALKEQYLFTCTCPRCIKLGQFDD 169
           RA+QH+ +G ++ +SY+E   + + RQ  L ++Y F C CPRCI+  +  D
Sbjct: 237 RALQHIEEGEQIFVSYVEPTDACLHRQAELSKRYYFECECPRCIREKEVGD 287


>gi|32140151|tpg|DAA01312.1| TPA_exp: SET and MYND domain protein 2 [Takifugu rubripes]
          Length = 433

 Score = 92.8 bits (229), Expect = 3e-16,   Method: Compositional matrix adjust.
 Identities = 53/168 (31%), Positives = 92/168 (54%), Gaps = 11/168 (6%)

Query: 2   LKLYLRRKLQNDNVIPSTTTDNYSLVEALVAHMSDIDE-KQLLLYAQIANLVNLILQWPE 60
           ++L  R  L+       T ++   L++   +H+  +D  K  +  A IA L +   ++  
Sbjct: 106 VRLVARIILKQKVTTEQTPSERLLLLKDFESHLDKMDSIKDEMNQADIAALHHFYSRY-- 163

Query: 61  ISINEIAEN------FSKLACNAHTICNSELRPLGTGLYPVISIINHSCLPNAVLVFEGR 114
             IN+I ++      F+++ CN  TI + EL  LG+ ++P ++++NHSC PN ++ ++G 
Sbjct: 164 --INDIPDDQSLTELFAQVNCNGFTIEDEELSHLGSAIFPDVALMNHSCNPNVIVTYKGT 221

Query: 115 LAVVRAVQHVPKGAEVLISYIETAGSTMTRQKALKEQYLFTCTCPRCI 162
           +A VRAVQ +  G E+L SYI+    T  R++ L + Y FTC C  C 
Sbjct: 222 VAEVRAVQELNPGDEILNSYIDLLYPTEDRKERLLDSYFFTCLCSECF 269


>gi|432102449|gb|ELK30026.1| SET and MYND domain-containing protein 1 [Myotis davidii]
          Length = 449

 Score = 92.8 bits (229), Expect = 3e-16,   Method: Compositional matrix adjust.
 Identities = 79/315 (25%), Positives = 143/315 (45%), Gaps = 35/315 (11%)

Query: 27  VEALVAHMSDI-DEKQLLLYAQIANLVNLIL-QWPEISINEIAENFSKLACNAHTICNSE 84
           V+ L  H+ +  +E+Q  L   +   +   L Q  + S+  I+  F  + CN  T+ +  
Sbjct: 129 VDDLQNHVENFGEEEQKQLRVDVDTFLQYWLPQSQQFSMQYISHIFGVINCNGFTLSDQR 188

Query: 85  -LRPLGTGLYPVISIINHSCLPNAVLVFEGRLAVVRAVQHVPKGAEVLISYIETAGSTMT 143
            L+ +G G++P + ++NH C PN  ++F      +RA+  + +G E+ +SYI+    +  
Sbjct: 189 GLQAVGVGIFPNLGLVNHDCWPNCTVIFNNGKIELRALGKISEGEELTVSYIDFLNVSEE 248

Query: 144 RQKALKEQYLFTCTCPRCIKLGQFDDIQESAILEGYRCKDDGCSGFLLRDSDDKGFTCQQ 203
           R+K LK+QY F CTC  C K G  DD+                  FL    D K      
Sbjct: 249 RKKQLKKQYYFDCTCEHCQK-GLKDDL------------------FLGVKDDPKP----- 284

Query: 204 CGLVRSKEEIKKIASEVNILSKKTLALTSCGNHQEVVSTYKMIEKLQKKLYHPFSVNLMQ 263
                S++ +K+I        +K     S G + EVV   +   + Q  ++   ++ +++
Sbjct: 285 -----SQDVVKEIIQFSKDTLEKIDKARSEGLYHEVVKLCRECLEKQDSVFADTNIYMLR 339

Query: 264 TREKLIKILMELEDWKEALAYCQLTIPVYQRVYPQFHPLLGLQYYTCGKLEWFLGDTENA 323
               + ++L  L+ ++EA  Y +  +  Y ++Y   +  LG+     G   W  G+ E A
Sbjct: 340 VLSIVSEVLSYLQAFEEASYYARRMVDGYVKLYHPNNAQLGMAVMRAGLTNWHAGNIE-A 398

Query: 324 IKSMTEAVEILRITH 338
           ++  TE    LR+ H
Sbjct: 399 MRMQTEME--LRLFH 411


>gi|149640598|ref|XP_001510487.1| PREDICTED: SET and MYND domain-containing protein 1 isoform 1
           [Ornithorhynchus anatinus]
          Length = 490

 Score = 92.4 bits (228), Expect = 3e-16,   Method: Compositional matrix adjust.
 Identities = 78/313 (24%), Positives = 133/313 (42%), Gaps = 55/313 (17%)

Query: 58  WP----EISINEIAENFSKLACNAHTICNSE-LRPLGTGLYPVISIINHSCLPNAVLVFE 112
           WP    + S+  I+  F  + CN  T+ +   L+ +G G++P + ++NH C PN  ++F 
Sbjct: 158 WPPQSQQFSMQYISHIFGVINCNGFTLSDQRGLQAVGVGVFPNLCLVNHDCWPNCTVIFN 217

Query: 113 -GRLAVVRAVQH------------VPKGAEVLISYIETAGSTMTRQKALKEQYLFTCTCP 159
            G    V+++ H            + +G E+ +SYI+    +  R+K LK+QY F CTC 
Sbjct: 218 NGNHEAVKSMFHTQMRIELRALGKISEGEELTVSYIDFLNVSADRKKLLKKQYYFDCTCE 277

Query: 160 RCIKLGQFDDIQESAILEGYRCKDDGCSGFLLRDSDDKGFTCQQCGLVRSKEEIKKIASE 219
            C K G  DD+                  FL              G+    +  + +  E
Sbjct: 278 HCEK-GIKDDL------------------FL--------------GVKDEPKPSQDVVKE 304

Query: 220 VNILSKKTL----ALTSCGNHQEVVSTYKMIEKLQKKLYHPFSVNLMQTREKLIKILMEL 275
           V   SK  L       S G + EVV   +   + Q+ +    ++ L++      ++L  L
Sbjct: 305 VIQFSKDALEKIDKARSEGVYHEVVKLCRECLQKQEPVLADTNIYLLRILSVASEVLSYL 364

Query: 276 EDWKEALAYCQLTIPVYQRVYPQFHPLLGLQYYTCGKLEWFLGDTENAIKSMTEAVEILR 335
           + ++EA  Y +  +  Y ++Y   +  LG+     G   W  G  E     + +A  IL 
Sbjct: 365 QSFEEASDYAKKMVDGYMKLYHPNNAQLGMAVMRAGVTHWHAGLIEVGHGMICKAYAILL 424

Query: 336 ITHGTNSPFMKEL 348
           +THG   P  K+L
Sbjct: 425 VTHGPTHPITKDL 437


>gi|432880358|ref|XP_004073658.1| PREDICTED: SET and MYND domain-containing protein 1-like isoform 3
           [Oryzias latipes]
          Length = 476

 Score = 92.4 bits (228), Expect = 3e-16,   Method: Compositional matrix adjust.
 Identities = 76/293 (25%), Positives = 130/293 (44%), Gaps = 32/293 (10%)

Query: 62  SINEIAENFSKLACNAHTICNSE-LRPLGTGLYPVISIINHSCLPNAVLVFEGRLAVVRA 120
           S+  I+  F  + CN  T+ +   L+ +G GL+P ++++NH C PN   +       VR 
Sbjct: 166 SVEYISHIFGIIKCNGFTLTDQRGLQAVGVGLFPNLALVNHDCWPNCTAILNHGKIEVRT 225

Query: 121 VQHVPKGAEVLISYIETAGSTMTRQKALKEQYLFTCTCPRCIKLGQFDDIQESAILEGYR 180
           +  + +G E+ ISY++    +  RQK LKEQ+ F C+C  C +    DD+  +A      
Sbjct: 226 LGKISEGEELTISYVDFLQLSADRQKQLKEQFHFECSCKHCSQHIS-DDLMMAAA----- 279

Query: 181 CKDDGCSGFLLRDSDDKGFTCQQCGLVR-SKEEIKKIASEVNILSKKTLALTSC----GN 235
              DG         D   F      L R S E+I+    EV+  SK +L         G+
Sbjct: 280 ---DG--------QDSPWFI-----LCRPSAEQIQ----EVSAFSKDSLEKIEKSRLDGD 319

Query: 236 HQEVVSTYKMIEKLQKKLYHPFSVNLMQTREKLIKILMELEDWKEALAYCQLTIPVYQRV 295
           + EV+   +   + Q+ +     ++ ++      ++L  L  + EA    +  +  Y ++
Sbjct: 320 YSEVLRLCRECLQKQRGVLADTHLHRLRVLSIAAEVLSSLRQFSEAADCARKMVEGYMKL 379

Query: 296 YPQFHPLLGLQYYTCGKLEWFLGDTENAIKSMTEAVEILRITHGTNSPFMKEL 348
           Y   + LLG+     G   W  G  E     + +A  IL +THG N    ++L
Sbjct: 380 YHPNNALLGMAIMRAGVTLWHAGQIEEGHSLICKAYSILMVTHGPNHSVTRDL 432


>gi|296423702|ref|XP_002841392.1| hypothetical protein [Tuber melanosporum Mel28]
 gi|295637630|emb|CAZ85583.1| unnamed protein product [Tuber melanosporum]
          Length = 489

 Score = 92.4 bits (228), Expect = 3e-16,   Method: Compositional matrix adjust.
 Identities = 68/260 (26%), Positives = 114/260 (43%), Gaps = 38/260 (14%)

Query: 64  NEIAENFSKLACNAHTICNSELRPLGTGLYPVISIINHSCLPNAVLVFEGRLAVVRAVQH 123
            E+   +  +  N+ T+      P+G    P+ + INH C+PNAV++F GR+  +RA++ 
Sbjct: 170 GEVRRLYCAVLVNSMTLVTETFDPIGIAFDPLTASINHDCIPNAVMLFNGRILQIRALED 229

Query: 124 VPKGAEVLISYIETAGSTMTRQKALKEQYLFTCTCPRCIKLGQFDDIQESAILEGYRCKD 183
           +P  AEVLISYI+       RQ  L E+Y FTCTC RC           +  ++G  C  
Sbjct: 230 IPANAEVLISYIDNTVPRERRQLELSEKYFFTCTCSRCTG--------PAGPMDGLLC-- 279

Query: 184 DGCSGFLLRDSDDKGFTCQQCGLVRSKEEIKKIASEVNILSKKTLALTSCGNHQEVVSTY 243
             C G  +     +   C +C             ++V IL +   +       + ++S  
Sbjct: 280 -ACGGVGM--PSREAVLCNRCA------RALPATADVEILEQTAWSALDSDKSEAMISAL 330

Query: 244 KMIEK-----LQKKLYHPFSVNLMQTREKLIKILMELEDWKEALAYCQLTIPVYQRV--- 295
           + + K     L ++      + L+Q         +   +W+ AL +  L   +Y R+   
Sbjct: 331 RHLHKSPMWPLPRQPLAALHLALVQGE------YIPSGNWERALLHHLL---IYLRLDPV 381

Query: 296 -YPQ-FHPLLGLQYYTCGKL 313
            YPQ FHP+  +  +T   L
Sbjct: 382 FYPQRFHPMRVVHGFTLANL 401


>gi|410912130|ref|XP_003969543.1| PREDICTED: N-lysine methyltransferase SMYD2-A-like [Takifugu
           rubripes]
          Length = 434

 Score = 92.4 bits (228), Expect = 3e-16,   Method: Compositional matrix adjust.
 Identities = 53/168 (31%), Positives = 92/168 (54%), Gaps = 11/168 (6%)

Query: 2   LKLYLRRKLQNDNVIPSTTTDNYSLVEALVAHMSDIDE-KQLLLYAQIANLVNLILQWPE 60
           ++L  R  L+       T ++   L++   +H+  +D  K  +  A IA L +   ++  
Sbjct: 106 VRLVARIILKQKVTTEQTPSERLLLLKDFESHLDKMDSIKDEMNQADIAALHHFYSRY-- 163

Query: 61  ISINEIAEN------FSKLACNAHTICNSELRPLGTGLYPVISIINHSCLPNAVLVFEGR 114
             IN+I ++      F+++ CN  TI + EL  LG+ ++P ++++NHSC PN ++ ++G 
Sbjct: 164 --INDIPDDQSLTELFAQVNCNGFTIEDEELSHLGSAIFPDVALMNHSCNPNVIVTYKGT 221

Query: 115 LAVVRAVQHVPKGAEVLISYIETAGSTMTRQKALKEQYLFTCTCPRCI 162
           +A VRAVQ +  G E+L SYI+    T  R++ L + Y FTC C  C 
Sbjct: 222 VAEVRAVQELNPGDEILNSYIDLLYPTEDRKERLLDSYFFTCLCSECF 269



 Score = 42.4 bits (98), Expect = 0.40,   Method: Compositional matrix adjust.
 Identities = 21/99 (21%), Positives = 49/99 (49%)

Query: 258 SVNLMQTREKLIKILMELEDWKEALAYCQLTIPVYQRVYPQFHPLLGLQYYTCGKLEWFL 317
           +V ++    + + + + ++DW+ A+ Y +  I  Y   YP +   +   Y   G L   L
Sbjct: 336 NVYMLHMMYQAMGVCLYMQDWEGAMRYGEKIIHPYSFHYPAYSLNVASMYLKLGHLYLGL 395

Query: 318 GDTENAIKSMTEAVEILRITHGTNSPFMKELILKLEEAQ 356
                 +K++ +A+ I+ + HG +  ++ E+  ++EE +
Sbjct: 396 EKRTQGVKALKKALSIMEVAHGKDHYYVAEVKREIEEQK 434


>gi|70947750|ref|XP_743461.1| hypothetical protein [Plasmodium chabaudi chabaudi]
 gi|56522968|emb|CAH78269.1| conserved hypothetical protein [Plasmodium chabaudi chabaudi]
          Length = 353

 Score = 92.0 bits (227), Expect = 4e-16,   Method: Compositional matrix adjust.
 Identities = 81/326 (24%), Positives = 138/326 (42%), Gaps = 40/326 (12%)

Query: 63  INEIAENFSKLACNAHTICNSELRPLGTGLYPV-ISIINHSCLPNAVLVFEGRLAVVRAV 121
           +  + E F K++ N   I ++EL P G G+Y   I   NHSCL N + +F+ +   +R +
Sbjct: 19  LKNVIEKFIKVSKNTLQIIDNELEPCGLGIYKKPIPYFNHSCLSNCITIFKNQRLYIRTL 78

Query: 122 QHVPKGAEVLISYIETAGSTMTRQKALKEQYLFTCTCP---------------------- 159
             +  G E+ ISY++ A     R     +QY FTCTC                       
Sbjct: 79  MDIYPGEELTISYLDVAFDRNARLAICADQYFFTCTCKLCKVNIASECHNIFNNDFICTQ 138

Query: 160 --RCIKLGQFDDIQESAILE---GYRCKDDGCSGFLLRDSDDKGFTCQQCGLVRSK---- 210
              C K   + +I   + LE    Y  K    +  +L+ S +K  T  +C L +++    
Sbjct: 139 SENCKKFINYMEIVLMSELERKISYLNKSHFKTFPILKKSTEKNETVWKCMLCKNETNDS 198

Query: 211 ------EEIKKIASEVNILSKKTLALTSCGNHQEVVSTYKMIEKLQK--KLYHPFSVNLM 262
                 E+ K+   EV  L        S  N   + S  K+  K+      YH    +L 
Sbjct: 199 VIKALIEKEKETIKEVEYLDTLFAEKYSYDNKNVLQSLTKIKSKIDDLTDFYHHSKYSLQ 258

Query: 263 QTREKLIKILMELEDWKEALAYCQLTIPVYQRVYPQFHPLLGLQYYTCGKLEWFLGDTEN 322
           + R K++ + ++L+D+K A +     +   +  Y ++ P+ G   +  GKL  FL     
Sbjct: 259 KMRAKILYVSIQLQDFKLAYSIATQYLKSIEVSYGKYSPIYGYYIFLTGKLALFLDLKSA 318

Query: 323 AIKSMTEAVEILRITHGTNSPFMKEL 348
            +  + +A + +  T+G  SP  K+L
Sbjct: 319 GLSLIHKAKKNIIKTYGPESPIYKDL 344


>gi|451849565|gb|EMD62868.1| hypothetical protein COCSADRAFT_144768 [Cochliobolus sativus
           ND90Pr]
          Length = 529

 Score = 92.0 bits (227), Expect = 4e-16,   Method: Compositional matrix adjust.
 Identities = 76/323 (23%), Positives = 141/323 (43%), Gaps = 50/323 (15%)

Query: 66  IAENFSKLACNAHTICNSELRPLGTGLYPVISIINHSCLPNAVLVFEGRLAVVRAVQHVP 125
           +A  + ++  NA T+    L PLG  + P +  +NHSC PNA ++ +G L  +R ++ + 
Sbjct: 199 VAAMYGRVMSNALTLITPTLDPLGIIVDPTLCQMNHSCDPNAYIMMDGPLISIRTLRPIR 258

Query: 126 KGAEVLISYIETAGSTMTRQKALKEQYLFTCTCPRCIKLGQFDDIQESAILEGYRCKDDG 185
           K  E+ ISYI+T      RQ  L+ ++ FTC C +C         Q+ A  +        
Sbjct: 259 KDKEIFISYIDTTHPYQKRQDELQARWFFTCRCAKC---------QDKATHQ-------- 301

Query: 186 CSGFLLRDSDDKGFTCQQCGLVRSKEEIKKIASEVNILSKKTLALTSCGNHQEVVSTYKM 245
                    +D          V SKEE++ +AS    +      +    N + V+   K 
Sbjct: 302 ---------EDNWLVPANSDFVASKEELEALASNQEQMFTLYEQVQGTPNAEAVIPVLKE 352

Query: 246 IEKLQKK-----LY-HPFSVNLMQTREKLIKILMELEDWKEALAYCQLTIP-VYQRVYP- 297
             K  ++     LY  P++      R++LI  L+ +  ++EA A+       +  ++YP 
Sbjct: 353 SLKACRESGNWPLYRQPYA----SIRDELIVNLLSVGKYQEAWAHSAKRYKYILPKLYPV 408

Query: 298 QFHPLLGLQYYTCGKLEWFLGDTENAIKS------------MTEAVEILRITHGTNSPFM 345
            FHP+  +Q +    L  +L  TE  + +            + + +++  ++HG ++ F 
Sbjct: 409 PFHPVRVVQTWQMAMLAAYLAGTEEGVGAPGVNMGLIAMMLVKQVLDVAPMSHGQDNAFT 468

Query: 346 KELILKLEEAQAEASYKLSSKDE 368
           K ++ K  E   E    + + D+
Sbjct: 469 KSVMEKTREMVEELKRSVGNPDK 491


>gi|351710111|gb|EHB13030.1| SET and MYND domain-containing protein 2 [Heterocephalus glaber]
          Length = 527

 Score = 92.0 bits (227), Expect = 4e-16,   Method: Compositional matrix adjust.
 Identities = 69/274 (25%), Positives = 120/274 (43%), Gaps = 34/274 (12%)

Query: 96  ISIINHSCLPNAVLVFEGRLAVVRAVQHVPKGAEVLISYIETAGSTMTRQKALKEQYLFT 155
           ++++NHSC PN ++ + G LA VRAVQ +  G EV  SYI+    T  R   L++ Y FT
Sbjct: 270 VALMNHSCCPNVIVTYRGTLAEVRAVQEIHPGEEVFTSYIDLLYPTEDRNDRLRDSYFFT 329

Query: 156 CTCPRCIKLGQFDDIQESAILEGYRCKDDGCSGFLLRDSDDKGFTCQQCGLVRSK----E 211
           C C  C   G     Q+ A LE  R   D      +RD            +VR      E
Sbjct: 330 CECRECSTKG-----QDQAKLE-IRKLSDPPRAEAVRD------------MVRYARGVIE 371

Query: 212 EIKKIASEVNILSKKTLALTSCGNH------------QEVVSTYKMIEKLQKKLYHPFSV 259
           E ++      IL            H             E++   ++ ++    ++   +V
Sbjct: 372 EFRRAKHYKYILGCSVQGPMPRSPHCEGREAGPGSPPSELLEICELSQEQMSSVFEDSNV 431

Query: 260 NLMQTREKLIKILMELEDWKEALAYCQLTIPVYQRVYPQFHPLLGLQYYTCGKLEWFLGD 319
            ++    + + + + L+DW+ AL Y Q  I  Y + YP +   +   +   G+L   L +
Sbjct: 432 YMLHMMYQAMGVCLYLQDWEGALRYGQKIIKPYSKHYPLYSLNVASMWLKLGRLYLGLEN 491

Query: 320 TENAIKSMTEAVEILRITHGTNSPFMKELILKLE 353
                K++ +A+ I+ + HG + P++ E+  ++E
Sbjct: 492 RAAGEKALKKALAIMEVAHGKDHPYISEIRREIE 525


>gi|410922900|ref|XP_003974920.1| PREDICTED: SET and MYND domain-containing protein 1-like [Takifugu
           rubripes]
          Length = 489

 Score = 91.3 bits (225), Expect = 7e-16,   Method: Compositional matrix adjust.
 Identities = 82/345 (23%), Positives = 154/345 (44%), Gaps = 57/345 (16%)

Query: 27  VEALVAHMSDIDEKQLL-LYAQIANLVNLILQWPEIS----INEIAENFSKLACNAHTIC 81
           VE L  H++D+ E ++  L   I N ++    WP  S    I++I+  F  + CN  T+ 
Sbjct: 124 VEELEDHVADMQEDEIKELKVDIHNFLDY---WPRNSKQHTIDDISHIFGVINCNGFTVS 180

Query: 82  NSE-LRPLGTGLYPVISIINHSCLPN-AVLVFEGRLAVV------------RAVQHVPKG 127
           +   L+ +G GL+P + ++NH+C PN  V++  G  + V            R++  + +G
Sbjct: 181 DQRGLQAVGVGLFPNLCMVNHNCWPNCTVILNHGNQSAVNTMFHSQRRIELRSLGKIAEG 240

Query: 128 AEVLISYIETAGSTMTRQKALKEQYLFTCTCPRCIKLGQFDDIQESAILEGYRCKDDGCS 187
            E+ ++Y++    +  R++ LK QY F C C  C K G  DD++    L G         
Sbjct: 241 EELTVAYVDFLNLSEERRRLLKTQYFFDCQCDYC-KNGTKDDLK----LAG--------- 286

Query: 188 GFLLRDSDDKGFTCQQCGLVRSKEEIKKIAS----EVNILSKKTLALTSCGNHQEVVSTY 243
               R+ D         G+  S++++K+       ++  + K  L     G++ +VV   
Sbjct: 287 ----REVD---------GVKPSEQQVKEATDYCFQKLETMDKARLD----GDYHQVVKIC 329

Query: 244 KMIEKLQKKLYHPFSVNLMQTREKLIKILMELEDWKEALAYCQLTIPVYQRVYPQFHPLL 303
           + +    + +     + L++    L ++   L+ + +A  Y +  +  Y ++Y   +  L
Sbjct: 330 RDVIDRTEPVLADTHIYLLRMWSTLSEVQAYLQYFNDAAEYSRKMVEGYMKLYHPNNAAL 389

Query: 304 GLQYYTCGKLEWFLGDTENAIKSMTEAVEILRITHGTNSPFMKEL 348
           G+     G   W  G  E     + +A  IL +THG   P  K+L
Sbjct: 390 GMAAMRAGVNHWQAGLIEVGHGMVCKAYAILLVTHGPTHPITKDL 434


>gi|348670644|gb|EGZ10465.1| hypothetical protein PHYSODRAFT_518318 [Phytophthora sojae]
          Length = 437

 Score = 91.3 bits (225), Expect = 7e-16,   Method: Compositional matrix adjust.
 Identities = 47/125 (37%), Positives = 67/125 (53%), Gaps = 2/125 (1%)

Query: 65  EIAENFSKLACNAHTICNS-ELRPLGTGLYPVISIINHSCLPNAVLVFEGRLAVVRAVQH 123
           EI + F ++ CNA ++ N      +G GLYP  ++ NH C PN V+ F+GR   VR V+ 
Sbjct: 173 EILKLFGRVNCNAFSVANEFTNEAVGIGLYPEGALFNHDCDPNCVVSFKGREMQVRVVRD 232

Query: 124 VPKGAEVLISYIETAGSTMTRQKALKEQYLFTCTCPRCIKLGQFDDIQESAILEGYRCKD 183
           +    E+ +SY+E   ST  R++ LKE Y F C C RC K        E   L+G RC +
Sbjct: 233 IEVDEELTVSYVELLQSTKARRRELKESYFFDCECKRC-KAATNGQSNEDWYLDGLRCSN 291

Query: 184 DGCSG 188
             C+ 
Sbjct: 292 KKCAS 296


>gi|156101287|ref|XP_001616337.1| hypothetical protein [Plasmodium vivax Sal-1]
 gi|148805211|gb|EDL46610.1| hypothetical protein, conserved [Plasmodium vivax]
          Length = 517

 Score = 91.3 bits (225), Expect = 7e-16,   Method: Compositional matrix adjust.
 Identities = 80/327 (24%), Positives = 133/327 (40%), Gaps = 43/327 (13%)

Query: 63  INEIAENFSKLACNAHTICNSELRPLGTGLY--PVISIINHSCLPNAVLVFEGRLAVVRA 120
           +  I E F K++ N   I ++EL P G G Y  PV    NHSCL N V +F+ +   +R 
Sbjct: 184 LRNIIEKFIKISKNTLQIIDNELEPAGLGFYKKPV-PYFNHSCLSNCVTIFKNQKLFIRT 242

Query: 121 VQHVPKGAEVLISYIETAGSTMTRQKALKEQYLFTCTCPRCIKLGQFDDIQESAILEGYR 180
           +  V  G E+ ISY++ A    TR     +QY FTCTC  C K+    + Q     E   
Sbjct: 243 LMDVYPGEELTISYVDIAFDKNTRLAICMDQYFFTCTCKLC-KVNVVSECQNVFNTEFVC 301

Query: 181 CKDDGCSGF------------------------------LLRDSDDKGFTCQQCGLVRSK 210
            + + C  +                              L + ++   + C  C      
Sbjct: 302 TQSESCKKYVNHMELILISELERKSCYVNKSQFKNSYPVLKKANEQHVWKCSICKNEVHD 361

Query: 211 EEIKKIASEVNILSKKTLALTSCGNHQEVVSTYKMIEKLQK---------KLYHPFSVNL 261
             IK +  +     K+T  L S  N +       +++ L K           YH    +L
Sbjct: 362 NVIKSLLEKEKETVKETAHLESLFNEKYTYDNKSVLQSLNKIKSKIDYLTTFYHHTRYSL 421

Query: 262 MQTREKLIKILMELEDWKEALAYCQLTIPVYQRVYPQFHPLLGLQYYTCGKLEWFLGDTE 321
            + R K++ I ++L+++K A       +   +  Y ++ P+ G   +  GKL  FL    
Sbjct: 422 QKMRAKILYISIQLQEFKLAYNIANQYLKSIEVSYGKYSPIYGYYIFLTGKLALFLDLKS 481

Query: 322 NAIKSMTEAVEILRITHGTNSPFMKEL 348
             +  + +A + +  T+G +SP  K+L
Sbjct: 482 EGLSLIHKAKKNIIKTYGPDSPIYKDL 508


>gi|413933332|gb|AFW67883.1| hypothetical protein ZEAMMB73_219813 [Zea mays]
          Length = 190

 Score = 91.3 bits (225), Expect = 7e-16,   Method: Compositional matrix adjust.
 Identities = 42/73 (57%), Positives = 60/73 (82%)

Query: 1   MLKLYLRRKLQNDNVIPSTTTDNYSLVEALVAHMSDIDEKQLLLYAQIANLVNLILQWPE 60
           M++L L+RKLQN+ VIPS++ DNY+LV+ L +H+S +DE QL+LYAQ+ANLV+LIL + E
Sbjct: 118 MVRLALKRKLQNEKVIPSSSIDNYNLVDGLESHISKVDENQLVLYAQMANLVSLILPFIE 177

Query: 61  ISINEIAENFSKL 73
           + + EIA+ FSK 
Sbjct: 178 LDLKEIAQTFSKF 190


>gi|348505212|ref|XP_003440155.1| PREDICTED: SET and MYND domain-containing protein 1-like isoform 2
           [Oreochromis niloticus]
          Length = 467

 Score = 91.3 bits (225), Expect = 7e-16,   Method: Compositional matrix adjust.
 Identities = 89/352 (25%), Positives = 153/352 (43%), Gaps = 74/352 (21%)

Query: 27  VEALVAHMSDI---DEKQLLLYAQIANLVNLILQWP----EISINEIAENFSKLACNAHT 79
           V+ L  H++D+   D KQL      A++   +  W     + S++ I+  F  + CN  T
Sbjct: 129 VDQLQEHVADLAAEDFKQLR-----ADVHKFLQYWSYGTRQHSVDYISHIFGIIKCNGFT 183

Query: 80  ICNSE-LRPLGTGLYPVISIINHSCLPN-AVLVFEGRLAVV------------RAVQHVP 125
           + +   L+ +G GL+P + ++NH C PN  V++  G  + V            RA+  +P
Sbjct: 184 LSDQRGLQAVGVGLFPNLCLVNHDCWPNCTVILNHGNQSAVSSALHSQRRIELRALGKIP 243

Query: 126 KGAEVLISYIETAGSTMTRQKALKEQYLFTCTCPRCIKLGQFDDIQESAILEGYRCKDDG 185
           +G E+ +SY++    +  RQK LKE++ F CTC  C K                  KDD 
Sbjct: 244 EGEELTVSYVDFLNLSADRQKKLKERFHFDCTCEHCSK----------------HIKDD- 286

Query: 186 CSGFLLRDSDDKGFTCQQCGLVRSKEEIKKIASEVNILSKKTLALTSCG----NHQEVVS 241
               +   +D K           S +++K    EV   SK++L          ++ EV+ 
Sbjct: 287 ---LMTAVADSKP----------SADKVK----EVTAFSKESLEKIEKSRVERDYNEVLK 329

Query: 242 -TYKMIEKLQKKLYHPFSVNLMQTREKLIKILMELEDWKEALA----YCQLTIPVYQRVY 296
             Y+ ++K +  L      NL + R  ++ I  E+  ++ + A    Y    +  Y+++Y
Sbjct: 330 LCYECLQKQENVLA---DTNLYKLR--VLSIASEVLSYQRSFAKAANYAHRMVEGYRKLY 384

Query: 297 PQFHPLLGLQYYTCGKLEWFLGDTENAIKSMTEAVEILRITHGTNSPFMKEL 348
              +  LG+     G   W  G  E     + +A  IL +THG N    K+L
Sbjct: 385 HPNNAQLGMAIMRAGVTHWHAGQIEEGHSLICQAYRILMVTHGPNHAITKDL 436


>gi|312076846|ref|XP_003141043.1| MYND finger family protein [Loa loa]
 gi|307763792|gb|EFO23026.1| MYND finger family protein [Loa loa]
          Length = 439

 Score = 90.5 bits (223), Expect = 1e-15,   Method: Compositional matrix adjust.
 Identities = 83/351 (23%), Positives = 167/351 (47%), Gaps = 32/351 (9%)

Query: 23  NYSLVEALVAHMSDI--DEKQLLLYAQIANLVNLILQWPEISIN-EIAENFSKLACNAHT 79
           N  +  +L +H  DI  DE++   +  I +++   L + ++  N EI + F K+  NA  
Sbjct: 107 NGRVFASLKSHTVDIEKDEEKRNGFIAIIHVIKDYLPFDKMPSNTEIFDIFCKILINALV 166

Query: 80  ICNSELRPLGTGLYPVISIINHSCLPNAVLVFEGRLAVVRAVQH--VPKGAEVLISYIET 137
           I +S L  +G  +Y  +S ++HSC P+A ++F G  A++R++          + I Y + 
Sbjct: 167 ITDSCLNRIGLAVYLGLSALDHSCKPDAFIIFSGAKAILRSLNENITEYNDNLHIPYCDL 226

Query: 138 AGSTMTRQKALKEQYLFTCTCPRCIKLGQFDDIQESAILEGYRCKDDGCSGFLLRDSDDK 197
                 R +AL++Q+ F C C  C     FD  ++ + +   +C D  C       S D 
Sbjct: 227 LDLRSARCEALQKQHNFVCNCDIC---QDFDLDRQKSSVRCTKCTDGFCPY-----SPDD 278

Query: 198 GFTCQQCGL-----VRSKEEIKKIASEVNILSKKTLALTSCGNHQEVVSTYKMIEKLQKK 252
             T ++C +     V + + ++K+  ++      T    +  N  E++  Y  +E    +
Sbjct: 279 DHTVKRCKVCHEISVFNSDHVQKLYQQL------TAPRPAEKNLNELIDLYHELE----E 328

Query: 253 LYHPFSVNLMQTREKLIKILMELEDWKEALAYCQLTIPVYQRVYPQFHPLLGLQYYTCGK 312
           ++ P++V L +  E ++   +  E + EA+ Y + T+  Y+  YP+ HP   ++ +   K
Sbjct: 329 VFSPYNVPLCKLAESIMISALNNEKYDEAVEYAEKTLLCYRTYYPKGHPSPSVRMFEYAK 388

Query: 313 LEWFLGDTENAIKSMTEAVEILRITHGTNSPF---MKELILKLEEAQAEAS 360
           L   L     ++  + +A++++  ++G+ S F      L+  LE+  + AS
Sbjct: 389 L-LMLQHNRESLPVLRKALKMICESYGSESSFAFNAATLLSDLEKCVSTAS 438


>gi|451992674|gb|EMD85153.1| hypothetical protein COCHEDRAFT_1149213 [Cochliobolus
           heterostrophus C5]
          Length = 508

 Score = 90.5 bits (223), Expect = 1e-15,   Method: Compositional matrix adjust.
 Identities = 83/387 (21%), Positives = 164/387 (42%), Gaps = 54/387 (13%)

Query: 2   LKLYLRRKLQNDNVIPSTTTDNYSLVEALVAHMSDIDEKQLLLYAQIANLVNLILQWPEI 61
           ++L  RRK     +IP     ++ L+  L +H+ D          ++  +       P+ 
Sbjct: 118 MELLTRRK---HGLIPDK---DWELLCQLPSHIDDFKRNATYQNIEMMAMGAAQFSLPQN 171

Query: 62  SINE--IAENFSKLACNAHTICNSELRPLGTGLYPVISIINHSCLPNAVLVFEGRLAVVR 119
             ++  +A  + ++  NA T+    L PLG  + P +  +NHSC PNA ++ +G L  +R
Sbjct: 172 VFDKDFVAAMYGRVMSNALTLITPTLDPLGIIVDPTLCQMNHSCDPNAYIMMDGPLVSIR 231

Query: 120 AVQHVPKGAEVLISYIETAGSTMTRQKALKEQYLFTCTCPRCIKLGQFDDIQESAILEGY 179
            ++ + K  E+ ISYI+T      RQ  L+ ++ FTC C +C         Q+ A  +  
Sbjct: 232 TLRPIRKDKEIFISYIDTTHPYQKRQDELQARWFFTCRCAKC---------QDKATHQED 282

Query: 180 RCKDDGCSGFLLRDSDDKGFTCQQCGLVRSKEEIKKIASEVNILSKKTLALTSCGNHQEV 239
                G S F+      +     Q  +    E+++ + +   ++     AL +C      
Sbjct: 283 NWLVPGNSDFVASKEQVQALASNQEQMFTLYEQVQGMPNAEAVIPVLKEALKACH----- 337

Query: 240 VSTYKMIEKLQKKLY-HPFSVNLMQTREKLIKILMELEDWKEALAYCQLTIPVYQRVYPQ 298
                  E     LY  P++      R++LI  L+ +  ++EA A+   +   Y+ + P+
Sbjct: 338 -------ESGNWPLYRQPYA----SIRDELIVNLLSVGKYQEAWAH---SAKRYKYILPK 383

Query: 299 -----FHPLLGLQYYTCGKLEWFLGDTENAIKS------------MTEAVEILRITHGTN 341
                FHP+  +Q +    L  +L  TE  + +            + + +++  ++H  +
Sbjct: 384 LYSVPFHPVRVVQTWQMAMLAAYLASTEEGVGAPGVNMGLIAMMLVKQVLDVAPMSHSQD 443

Query: 342 SPFMKELILKLEEAQAEASYKLSSKDE 368
           + F K ++ K  E   E    + + D+
Sbjct: 444 NAFTKSVVEKTREMVEELKRSVGNPDK 470


>gi|431907306|gb|ELK11286.1| SET and MYND domain-containing protein 1 [Pteropus alecto]
          Length = 476

 Score = 90.5 bits (223), Expect = 1e-15,   Method: Compositional matrix adjust.
 Identities = 80/323 (24%), Positives = 144/323 (44%), Gaps = 54/323 (16%)

Query: 31  VAHMSDIDEKQLLLYAQIANLVNLILQ-WP----EISINEIAENFSKLACNAHTICNSE- 84
           V H  + ++KQL    Q+   V+  LQ WP    + S+  I+  F  + CN  T+ +   
Sbjct: 136 VMHFGEEEQKQL----QVD--VDTFLQYWPPKSQQFSMQYISNIFGVINCNGFTLSDQRG 189

Query: 85  LRPLGTGLYPVISIINHSCLPNAVLVFE-GRLAVVRAVQH------------VPKGAEVL 131
           L+ +G G++P + ++NH C PN  ++F  G    V+++ H            + +G E+ 
Sbjct: 190 LQAVGVGIFPNLGLVNHDCWPNCTVIFNNGNHEAVKSMFHTQMRIELRALGKIAEGEELT 249

Query: 132 ISYIETAGSTMTRQKALKEQYLFTCTCPRCIKLGQFDDIQESAILEGYRCKDDGCSGFLL 191
           +SYI+    +  R+K LK+QY F CTC  C K G  DD+                    L
Sbjct: 250 VSYIDFLNVSEERKKQLKKQYYFDCTCEHCQK-GLKDDL-------------------FL 289

Query: 192 RDSDDKGFTCQQCGLVRSKEEIKKIASEVNILSKKTLALTSCGNHQEVVSTYKMIEKLQK 251
           R  DD            S++ +K++    N   +K     S G + EVV   +   + Q+
Sbjct: 290 RVKDDPK---------PSQDVVKEMIQFSNDTLEKIDKARSEGLYHEVVKLCRECLEKQE 340

Query: 252 KLYHPFSVNLMQTREKLIKILMELEDWKEALAYCQLTIPVYQRVYPQFHPLLGLQYYTCG 311
            ++   ++ +++    + ++L  L+ ++EA  Y +  +  Y ++Y   +  LG+     G
Sbjct: 341 SVFADTNLYMLRMLSIVSEVLSYLQAFEEASYYARRMVDGYMKLYHPNNAQLGMAVMRAG 400

Query: 312 KLEWFLGDTENAIKSMTEAVEIL 334
              W  G+ E     + +A  IL
Sbjct: 401 LTNWHAGNIEVGHGMICKAYAIL 423


>gi|444714585|gb|ELW55465.1| N-lysine methyltransferase SMYD2 [Tupaia chinensis]
          Length = 494

 Score = 90.1 bits (222), Expect = 1e-15,   Method: Compositional matrix adjust.
 Identities = 65/264 (24%), Positives = 116/264 (43%), Gaps = 34/264 (12%)

Query: 96  ISIINHSCLPNAVLVFEGRLAVVRAVQHVPKGAEVLISYIETAGSTMTRQKALKEQYLFT 155
           ++++NHSC PN ++ + G LA VRAV  +  G EV  SYI+    T  R   L++ Y FT
Sbjct: 257 VALMNHSCCPNVIVTYRGTLAEVRAVHEISPGEEVFTSYIDLLYPTEDRNDRLRDSYFFT 316

Query: 156 CTCPRCIKLGQFDDIQESAILEGYRCKDDGCSGFLLRDSDDKGFTCQQCGLVRSK----E 211
           C C  C                    KD   +   +R  +D         +VR      E
Sbjct: 317 CECHECTT------------------KDKDKAKVEIRKLNDPPKAEAIRDMVRYARNVIE 358

Query: 212 EIKKIASEVNIL--SKKTLALTSCGNHQEVVSTYKMIEKLQKKLYHPFSVNLMQTREKLI 269
           E ++      IL  S  +  L  C   QE +S+          ++   +V ++    + +
Sbjct: 359 EFRRAKHYKYILYNSPPSELLEICELSQEKMSS----------VFEDSNVYMLHMMYQAM 408

Query: 270 KILMELEDWKEALAYCQLTIPVYQRVYPQFHPLLGLQYYTCGKLEWFLGDTENAIKSMTE 329
            + + ++DW+ AL Y Q  I  Y + YP +   +   +   G+L   L +     +++ +
Sbjct: 409 GVCLYMQDWEGALRYGQRIIKPYSKHYPLYSLNVASMWLKLGRLYMGLENKAAGERALKK 468

Query: 330 AVEILRITHGTNSPFMKELILKLE 353
           A+ I+ + HG + P++ E+  ++E
Sbjct: 469 AIAIMEVAHGKDHPYISEIKQEIE 492


>gi|167533183|ref|XP_001748272.1| hypothetical protein [Monosiga brevicollis MX1]
 gi|163773392|gb|EDQ87033.1| predicted protein [Monosiga brevicollis MX1]
          Length = 456

 Score = 89.7 bits (221), Expect = 2e-15,   Method: Compositional matrix adjust.
 Identities = 80/291 (27%), Positives = 126/291 (43%), Gaps = 31/291 (10%)

Query: 66  IAENFSKLACNAHTICNSELRPLGTGLYPVISIINHSCLPNAVLVFEGRL--AVVRAVQH 123
           + +   +  CN  +I +  L P G G+YP  +I+NHSC PN V+ + G L    V+A++ 
Sbjct: 183 VMDVLQRADCNNFSIWDELLLPRGAGVYPWGAILNHSCEPNCVMTYRGPLHAQAVKALRD 242

Query: 124 VPKGAEVLISYIETAGSTMTRQKALKEQYLFTCTCPRCIK--LGQFDDIQESAILEGYRC 181
           +  G E+  SYI+    T  R   L +QY F C C   +   L + D + E    E    
Sbjct: 243 IAVGEELCHSYIDLYAPTGQRHSHLGDQYGFECDCALYLDGALAELDALPEVTAAEMR-- 300

Query: 182 KDDGCSGFLLRDSDDKGFTCQQ-CGLVRSKEEIKKIASEVNILSKKTLALTSCGNHQEVV 240
                 G +L + + +    Q+   L R+ E +K  A                   +E+V
Sbjct: 301 ----LPGKMLHELECEAEERQRLLSLYRAAETLKDYAWST--------------PEEELV 342

Query: 241 STYKMIEKLQKKLYHPFSVNLMQTREKLIKILME---LEDWKEALAYCQLTIPVYQRVYP 297
              +    L ++L HP +++L     +L  +  E   LED   AL   Q     Y  VYP
Sbjct: 343 CLLQGYNIL-RRLAHPANISLTSIMTRLQNVATECGRLEDI--ALPVGQHLALAYDHVYP 399

Query: 298 QFHPLLGLQYYTCGKLEWFLGDTENAIKSMTEAVEILRITHGTNSPFMKEL 348
           + +PL GLQYY  G +         A+     A ++LR TH  +   + +L
Sbjct: 400 EHNPLCGLQYYRLGDVANLAQKQALALLWHRRAYQVLRTTHDADHWLLTQL 450


>gi|301102901|ref|XP_002900537.1| conserved hypothetical protein [Phytophthora infestans T30-4]
 gi|262101800|gb|EEY59852.1| conserved hypothetical protein [Phytophthora infestans T30-4]
          Length = 425

 Score = 89.4 bits (220), Expect = 2e-15,   Method: Compositional matrix adjust.
 Identities = 46/123 (37%), Positives = 68/123 (55%), Gaps = 6/123 (4%)

Query: 65  EIAENFSKLACNAHTICNSEL-RPLGTGLYPVISIINHSCLPNAVLVFEGRLAVVRAVQH 123
           EI + F +++CN  +I N     P+G GL+   S+ NH C PN V+ F+G+   V  ++ 
Sbjct: 172 EILKLFGQVSCNGFSIMNGVTNEPVGIGLFLQGSMFNHDCDPNCVVSFQGQEMNVHVIKD 231

Query: 124 VPKGAEVLISYIETAGSTMTRQKALKEQYLFTCTCPRCIKLGQFDDIQESAILEGYRCKD 183
           V +G E+ ISY+E   ST  RQK LK+ Y F C C RC       +  +   L+G +C +
Sbjct: 232 VKEGQELTISYVEVLQSTKKRQKILKDSYFFECQCSRCTT-----ETTDDWYLDGLQCGN 286

Query: 184 DGC 186
            GC
Sbjct: 287 KGC 289


>gi|12834773|dbj|BAB23038.1| unnamed protein product [Mus musculus]
          Length = 229

 Score = 89.4 bits (220), Expect = 2e-15,   Method: Compositional matrix adjust.
 Identities = 56/255 (21%), Positives = 110/255 (43%), Gaps = 28/255 (10%)

Query: 99  INHSCLPNAVLVFEGRLAVVRAVQHVPKGAEVLISYIETAGSTMTRQKALKEQYLFTCTC 158
           +NHSC PN ++ ++G LA VRAVQ +  G EV  SYI+    T  R   L++ Y FTC C
Sbjct: 1   MNHSCCPNVIVTYKGTLAEVRAVQEIHPGDEVFTSYIDLLYPTEDRNDRLRDSYFFTCEC 60

Query: 159 PRCIKLGQFDDIQESAILEGYRCKDDGCSGFLLRDSDDKGFTCQQCGLVRSKEEIKKIAS 218
             C                              +D D      ++       E I+ +  
Sbjct: 61  RECT----------------------------TKDKDKAKVEVRKLSSPPQAEAIRDMVR 92

Query: 219 EVNILSKKTLALTSCGNHQEVVSTYKMIEKLQKKLYHPFSVNLMQTREKLIKILMELEDW 278
               + ++        +  E++   ++ ++    ++   +V ++    + + + + ++DW
Sbjct: 93  YARNVIEEFRRAKHYKSPSELLEICELSQEKMSSVFEDSNVYMLHMMYQAMGVCLYMQDW 152

Query: 279 KEALAYCQLTIPVYQRVYPQFHPLLGLQYYTCGKLEWFLGDTENAIKSMTEAVEILRITH 338
           + AL Y Q  I  Y + YP +   +   +   G+L   L +     K++ +A+ I+ + H
Sbjct: 153 EGALKYGQKIIKPYSKHYPVYSLNVASMWLKLGRLYMGLENKAAGEKALKKAIAIMEVAH 212

Query: 339 GTNSPFMKELILKLE 353
           G + P++ E+  ++E
Sbjct: 213 GKDHPYISEIKQEIE 227


>gi|440804218|gb|ELR25095.1| SET and MYND domain containing 3, putative [Acanthamoeba
           castellanii str. Neff]
          Length = 365

 Score = 89.0 bits (219), Expect = 3e-15,   Method: Compositional matrix adjust.
 Identities = 45/126 (35%), Positives = 72/126 (57%), Gaps = 3/126 (2%)

Query: 23  NYSLVEALVAHMSDIDEKQLLLYAQIANLV--NLILQWP-EISINEIAENFSKLACNAHT 79
           N+ L+  L  H+     ++   +  +A L+  +L    P +I+  EI   F   ACN  T
Sbjct: 59  NFDLIRYLTTHIDAFPSEKKEHFGVMAALIKKSLPATLPTDITPQEIMHLFCLFACNNFT 118

Query: 80  ICNSELRPLGTGLYPVISIINHSCLPNAVLVFEGRLAVVRAVQHVPKGAEVLISYIETAG 139
           + + EL+PLG G+YP  ++INHSC PN V++FEGR   VR+++ + K  E+  +Y+E   
Sbjct: 119 VSDGELKPLGLGIYPPAALINHSCDPNCVIIFEGRQCTVRSLRDITKDEEITFNYVEVGD 178

Query: 140 STMTRQ 145
            T TR+
Sbjct: 179 PTETRR 184



 Score = 82.4 bits (202), Expect = 3e-13,   Method: Compositional matrix adjust.
 Identities = 39/85 (45%), Positives = 56/85 (65%)

Query: 278 WKEALAYCQLTIPVYQRVYPQFHPLLGLQYYTCGKLEWFLGDTENAIKSMTEAVEILRIT 337
           W+EAL YC  ++ V++R+YP   PL+GLQY   GKL ++L  TE A++SM  A+ IL IT
Sbjct: 269 WREALLYCSASLAVFERIYPPRWPLVGLQYLIHGKLSFYLKYTEAALESMQRALPILTIT 328

Query: 338 HGTNSPFMKELILKLEEAQAEASYK 362
           H +  P ++ L   L EA AE +Y+
Sbjct: 329 HSSGHPLVRTLHTMLAEATAEWNYE 353


>gi|224050278|ref|XP_002188029.1| PREDICTED: SET and MYND domain-containing protein 1 isoform 3
           [Taeniopygia guttata]
          Length = 490

 Score = 89.0 bits (219), Expect = 4e-15,   Method: Compositional matrix adjust.
 Identities = 89/365 (24%), Positives = 157/365 (43%), Gaps = 61/365 (16%)

Query: 27  VEALVAHMSDI-DEKQLLLYAQIANLVNLILQWP----EISINEIAENFSKLACNAHTIC 81
           +E L  H+    +E++  L A + + +     WP    +  +  I+  F  ++CN  T+ 
Sbjct: 129 IEELQNHVDSFGEEEKKDLRADVESFLEF---WPPQCQQFGMQLISHIFGVISCNGFTLS 185

Query: 82  NSE-LRPLGTGLYPVISIINHSCLPNAVLVFE-GRLAVVRAVQH------------VPKG 127
           +   L+ +G G++P +   NH C PN  +VF  G    VR++ H            +  G
Sbjct: 186 DQRGLQAVGVGIFPNLCQANHDCWPNCTVVFNNGNHEAVRSMFHTQMRIELRALNKISPG 245

Query: 128 AEVLISYIETAGSTMTRQKALKEQYLFTCTCPRCIKLGQFDDIQESAILEGYRCKDDGCS 187
            E+ +SY++    +  R++ LK+QY F CTC  C K                + KDD   
Sbjct: 246 DELTVSYVDFLSLSEERRRQLKKQYYFDCTCEHCKK----------------QLKDD--- 286

Query: 188 GFLLRDSDDKGFTCQQCGLVRSKEEIKKIASEVNILSKKTLALTSC----GNHQEVVSTY 243
             +L   D             S + +K    EV  LSK TL   +     G++ EVV   
Sbjct: 287 -LMLAVKDSP---------TPSADTVK----EVIQLSKDTLEKINKARMEGHYHEVVKLC 332

Query: 244 KMIEKLQKKLYHPFSVNLMQTREKLIKILMELEDWKEALAYCQLTIPVYQRVYPQFHPLL 303
           +   + Q+ +    ++ L++      ++L  L+ ++EA  Y +  +  Y ++Y   +  L
Sbjct: 333 RDCLQKQEPVLGDTNIYLLRILSIASEVLSYLQMFEEAADYAKRMVDGYLKIYHPNNAQL 392

Query: 304 GLQYYTCGKLEWFLGDTENAIKSMTEAVEILRITHGTNSPFMKELILKLEEAQAEASYKL 363
           G+     G   W  G  E     + +A  IL ITHG + P  K+  L++   Q E   ++
Sbjct: 393 GMAVMRAGVTHWHAGLIEVGHGLICKAYAILLITHGPSHPITKD--LEVMRVQTEMELRM 450

Query: 364 SSKDE 368
             ++E
Sbjct: 451 FQQNE 455


>gi|242009727|ref|XP_002425634.1| set and mynd domain-containing protein, putative [Pediculus humanus
           corporis]
 gi|212509527|gb|EEB12896.1| set and mynd domain-containing protein, putative [Pediculus humanus
           corporis]
          Length = 666

 Score = 88.6 bits (218), Expect = 4e-15,   Method: Compositional matrix adjust.
 Identities = 61/195 (31%), Positives = 96/195 (49%), Gaps = 15/195 (7%)

Query: 76  NAHTICNSELRPLGTGLYPVISIINHSCLPNAVLVFEGRLAVVRAVQHVPKGAEVLISY- 134
           N + + +S+   +G GLYP +++ NHSC P  V  F G   +VRA + + KG  V  +Y 
Sbjct: 449 NKNILDDSKSLFVGGGLYPTLALFNHSCDPGIVRYFIGTQVIVRAAKPIKKGEIVAENYG 508

Query: 135 -IETAGSTMTRQKALKEQYLFTCTCPRCIKL-GQFDDIQESAILEGYRC-------KDDG 185
            I +    + RQ+ LK QY FTCTC  C++L   FD++    I   +RC       K + 
Sbjct: 509 PIYSQMKKLERQEKLKSQYWFTCTCTPCLELWPTFDELDTKTI--RFRCSGYGTDEKREN 566

Query: 186 CSGFLL--RDSDDKGFTCQQCGLVRSKEEIKKIASEVNILSKKTLALTSCGNHQEVVSTY 243
           C   L+   ++DD    C QC    +  +  K   + +IL K+ +     GN Q  +  Y
Sbjct: 567 CKNILIVPVNTDDFMIKCSQCNQHTNLFKGLKALQDTDILYKQAIKYAQTGNFQNALDIY 626

Query: 244 -KMIEKLQKKLYHPF 257
            ++++ L   L  PF
Sbjct: 627 LEILKTLHGILAPPF 641


>gi|426336314|ref|XP_004031420.1| PREDICTED: SET and MYND domain-containing protein 1 [Gorilla
           gorilla gorilla]
          Length = 459

 Score = 88.6 bits (218), Expect = 5e-15,   Method: Compositional matrix adjust.
 Identities = 79/337 (23%), Positives = 141/337 (41%), Gaps = 85/337 (25%)

Query: 31  VAHMSDIDEKQLLLYAQIANLVNLILQ-WP----EISINEIAENFSKLACNAHTICNSE- 84
           V H  + ++K L +       V+  LQ WP    + S+  I+  F  + CN  T+ +   
Sbjct: 136 VEHFGEEEQKDLRVD------VDTFLQYWPPQSQQFSMQYISHIFGVINCNGFTLSDQRG 189

Query: 85  LRPLGTGLYPVISIINHSCLPNAVLVFE-GRLAVVRAVQH------------VPKGAEVL 131
           L+ +G G++P + ++NH C PN  ++F  G    V+++ H            + +G E+ 
Sbjct: 190 LQAVGVGIFPNLGLVNHDCWPNCTVIFNNGNHEAVKSMFHTQMRIELRALGKISEGEELT 249

Query: 132 ISYIETAGSTMTRQKALKEQYLFTCTCPRCIKLGQFDDIQESAILEGYRCKDDGCSGFLL 191
           +SYI+    +  R++ LK+QY F CTC  C K                + KDD   G  +
Sbjct: 250 VSYIDFLNVSEERKRQLKKQYYFDCTCEHCQK----------------KLKDDLFLG--V 291

Query: 192 RDSDDKGFTCQQCGLVRSKEEIKKIASEVNILSKKTLALTSCGNHQEVVSTYKMIEKLQK 251
           +D+      C++C     K+E   + ++ NI   + L++ S                   
Sbjct: 292 KDNPKVVKLCREC---LQKQE--PVFADTNIYMLRMLSIVS------------------- 327

Query: 252 KLYHPFSVNLMQTREKLIKILMELEDWKEALAYCQLTIPVYQRVYPQFHPLLGLQYYTCG 311
                             ++L  L+ ++EA  Y +  +  Y ++Y   +  LG+     G
Sbjct: 328 ------------------EVLSYLQAFEEASFYARRMVDGYMKLYHPNNAQLGMAVMRAG 369

Query: 312 KLEWFLGDTENAIKSMTEAVEILRITHGTNSPFMKEL 348
              W  G+ E     + +A  IL +THG + P  K+L
Sbjct: 370 LTNWHAGNIEVGHGMICKAYAILLVTHGPSHPITKDL 406


>gi|432880354|ref|XP_004073656.1| PREDICTED: SET and MYND domain-containing protein 1-like isoform 1
           [Oryzias latipes]
          Length = 489

 Score = 88.6 bits (218), Expect = 5e-15,   Method: Compositional matrix adjust.
 Identities = 78/306 (25%), Positives = 132/306 (43%), Gaps = 45/306 (14%)

Query: 62  SINEIAENFSKLACNAHTICNSE-LRPLGTGLYPVISIINHSCLPNAVLVF-EGRLAVVR 119
           S+  I+  F  + CN  T+ +   L+ +G GL+P ++++NH C PN   +   G  + V 
Sbjct: 166 SVEYISHIFGIIKCNGFTLTDQRGLQAVGVGLFPNLALVNHDCWPNCTAILNHGNHSAVS 225

Query: 120 AVQHVPK------------GAEVLISYIETAGSTMTRQKALKEQYLFTCTCPRCIKLGQF 167
           +  H P+            G E+ ISY++    +  RQK LKEQ+ F C+C  C +    
Sbjct: 226 SALHSPRRIEVRTLGKISEGEELTISYVDFLQLSADRQKQLKEQFHFECSCKHCSQHIS- 284

Query: 168 DDIQESAILEGYRCKDDGCSGFLLRDSDDKGFTCQQCGLVR-SKEEIKKIASEVNILSKK 226
           DD+  +A         DG         D   F      L R S E+I+    EV+  SK 
Sbjct: 285 DDLMMAAA--------DG--------QDSPWFI-----LCRPSAEQIQ----EVSAFSKD 319

Query: 227 TLALTSC----GNHQEVVSTYKMIEKLQKKLYHPFSVNLMQTREKLIKILMELEDWKEAL 282
           +L         G++ EV+   +   + Q+ +     ++ ++      ++L  L  + EA 
Sbjct: 320 SLEKIEKSRLDGDYSEVLRLCRECLQKQRGVLADTHLHRLRVLSIAAEVLSSLRQFSEAA 379

Query: 283 AYCQLTIPVYQRVYPQFHPLLGLQYYTCGKLEWFLGDTENAIKSMTEAVEILRITHGTNS 342
              +  +  Y ++Y   + LLG+     G   W  G  E     + +A  IL +THG N 
Sbjct: 380 DCARKMVEGYMKLYHPNNALLGMAIMRAGVTLWHAGQIEEGHSLICKAYSILMVTHGPNH 439

Query: 343 PFMKEL 348
              ++L
Sbjct: 440 SVTRDL 445


>gi|327275309|ref|XP_003222416.1| PREDICTED: SET and MYND domain-containing protein 1-like isoform 2
           [Anolis carolinensis]
          Length = 485

 Score = 88.2 bits (217), Expect = 5e-15,   Method: Compositional matrix adjust.
 Identities = 83/336 (24%), Positives = 148/336 (44%), Gaps = 52/336 (15%)

Query: 31  VAHMSDIDEKQLLLYAQIANLVNLILQWP----EISINEIAENFSKLACNAHTICNSE-L 85
           V H  + ++K+L L     +L + +  WP    + ++  I+     + CN  T+ +   L
Sbjct: 131 VEHFGEEEKKELRL-----DLESFLEFWPHDGKQFAMQYISHILGVINCNGFTLSDQRGL 185

Query: 86  RPLGTGLYPVISIINHSCLPNAVLVFE-GRLAVVRAVQH------------VPKGAEVLI 132
           + +G G++P + ++NH C PN  ++F  G    VR++ H            +  G E+ +
Sbjct: 186 QAVGVGIFPNLCLVNHDCWPNCTVIFNNGNHEAVRSMFHTQMRIELRALGKISIGEELTV 245

Query: 133 SYIETAGSTMTRQKALKEQYLFTCTCPRCIKLGQFDDIQESAILEGYRCKDDGCSGFLLR 192
           SY++       R++ LK+QY F CTC  C K G  DD+   A+ E     ++  S  +++
Sbjct: 246 SYVDFLNVCEDRRQQLKKQYYFDCTCEHCQK-GIKDDLM-LAVKE-----EEKPSPEVVK 298

Query: 193 DSDDKGFTCQQCGLVRSKEEIKKIASEVNILSKKTLALTSCGNHQEVVSTYKMIEKLQKK 252
           D            +  SKE I+KI               S G ++EVV   +   + Q+ 
Sbjct: 299 DV-----------IQFSKEAIEKIDK-----------ARSEGLYKEVVKLCRACLEKQEP 336

Query: 253 LYHPFSVNLMQTREKLIKILMELEDWKEALAYCQLTIPVYQRVYPQFHPLLGLQYYTCGK 312
           +    ++ +++    L ++L  L+   EA  Y +     Y ++Y   +  LG+     G 
Sbjct: 337 VLGDTNIYMLRILSILSEVLSYLQHVDEAAGYSKRMADGYLKLYHPNNAQLGMALMRAGV 396

Query: 313 LEWFLGDTENAIKSMTEAVEILRITHGTNSPFMKEL 348
             W  G  E     + +A  IL +THG   P  K+L
Sbjct: 397 THWHAGLIEAGHGMICKAYAILLVTHGPTHPITKDL 432


>gi|240981045|ref|XP_002403605.1| SET and MYND domain-containing protein (SMYD), putative [Ixodes
           scapularis]
 gi|215491387|gb|EEC01028.1| SET and MYND domain-containing protein (SMYD), putative [Ixodes
           scapularis]
          Length = 770

 Score = 88.2 bits (217), Expect = 6e-15,   Method: Compositional matrix adjust.
 Identities = 66/247 (26%), Positives = 114/247 (46%), Gaps = 24/247 (9%)

Query: 69  NFSKLACNAHTICNSELRP--------------LGTGLYPVISIINHSCLPNAVLVFE-G 113
           +  +L CNAH I + E +               + T +YP  S++NHSC PN    F  G
Sbjct: 455 HIQQLVCNAHAITSLESKTSQEDDVVMTTEQVRIATAIYPSASLMNHSCDPNIFSSFRCG 514

Query: 114 RLAVVRAVQHVPKGAEVLISY--IETAGSTMTRQKALKEQYLFTCTCPRCIKLGQFDDIQ 171
              VVRA++ + +G EVL  Y       S   RQ+ L+EQY F C+C  C      +D +
Sbjct: 515 STLVVRAIRRIQEGEEVLNCYGPHHRRMSFAERQQLLQEQYFFVCSCTAC---SSGEDAE 571

Query: 172 ESAILEGYRCKDDGCSGFLLRDSDDKGFTCQQCGLVRSKEEIKKIASEVNILSKKTLALT 231
           +   L+  +C  + C G L +  D    TC  CG  +     ++ A  ++ L  + + L 
Sbjct: 572 QR--LQALKC--EYCEGPLKQPDDSGKATCLDCGTTQECLGSEQKAFRMHDLYVQGVQLA 627

Query: 232 SCGNHQEVVSTYKMIEKLQKKLYHPFSVNLMQTREKLIKILMELEDWKEALAYCQLTIPV 291
             GNH E +       K ++K+ +  ++ L++ ++ + K L  L D++ A    +  +  
Sbjct: 628 EKGNHVEALQRLNKCLKTREKVMYKHNLKLLEVKDMVAKCLCSLGDFRSASGVLKPAVDA 687

Query: 292 YQRVYPQ 298
            + +Y +
Sbjct: 688 VRVIYGE 694


>gi|118379402|ref|XP_001022867.1| conserved hypothetical protein [Tetrahymena thermophila]
 gi|89304634|gb|EAS02622.1| conserved hypothetical protein [Tetrahymena thermophila SB210]
          Length = 418

 Score = 87.8 bits (216), Expect = 7e-15,   Method: Compositional matrix adjust.
 Identities = 82/307 (26%), Positives = 146/307 (47%), Gaps = 34/307 (11%)

Query: 71  SKLACNAHTICNS--ELRPLGTGLYPVISIINHSCLPNAVLVFEGRLAVVR--AVQHVPK 126
           +K A NA TI +S  E   +G GLY  ++ +NHSC PN + VF  +L  VR  A++ + +
Sbjct: 133 AKFATNALTIQDSLFETDGIGAGLYEEVNYMNHSCTPNVICVF-NKLPQVRVIAIRDIEQ 191

Query: 127 GAEVLISYIETAGSTMTRQKALKEQYLFTCTCPRCIKLGQFDDIQESAILEGYRCKDDGC 186
           G E++ SYI+T      R++ LK+ Y F C C RCIK       +++  +   RC+   C
Sbjct: 192 GEEIMNSYIDTKKDLDFRRRFLKQNYFFLCECKRCIK-------EQNEGVSFVRCQK--C 242

Query: 187 SGFLLRDSDDKGFTCQQCGL--VRSKEEIKKIASEVNILSKKTLALTSCGNHQEVVSTYK 244
               +   + K   C  C      +++E ++I +++  + K+    TS   +++ +   +
Sbjct: 243 MKGRI---NSKTLNCSDCNTQGQMTEQEYQRIITQMEEIQKQ----TSQSTYEKTIELTE 295

Query: 245 MIEKLQKKLYHPFSVNLMQTREKLIKILMELEDWKEALAYCQLTIPVYQRVYPQF----- 299
             EKL   L  P+  ++  +R   I   + ++ +K+A  Y +  I + +  Y +      
Sbjct: 296 TAEKLC-HLEAPYFNDIYNSR---ILATLGVQQYKKANKYLKNFIELSKHWYEEIIGNTL 351

Query: 300 -HPLLGLQYYTCGKLEWFLGDTENAIKSMTEAVEILRITHGTNS-PFMKELILKLEEAQA 357
             P  G +     KL + L     A K    A++IL   +   S P + EL L+L + ++
Sbjct: 352 EMPFFGWKLNELSKLNFHLDKCLEAEKYSERALKILTQYYTKESCPELNELFLRLNDIKS 411

Query: 358 EASYKLS 364
           E   K +
Sbjct: 412 EIKMKYN 418


>gi|449501413|ref|XP_002187998.2| PREDICTED: SET and MYND domain-containing protein 1 isoform 2
           [Taeniopygia guttata]
          Length = 491

 Score = 87.8 bits (216), Expect = 7e-15,   Method: Compositional matrix adjust.
 Identities = 85/361 (23%), Positives = 158/361 (43%), Gaps = 52/361 (14%)

Query: 27  VEALVAHMSDI-DEKQLLLYAQIANLVNLILQWP----EISINEIAENFSKLACNAHTIC 81
           +E L  H+    +E++  L A + + +     WP    +  +  I+  F  ++CN  T+ 
Sbjct: 129 IEELQNHVDSFGEEEKKDLRADVESFLEF---WPPQCQQFGMQLISHIFGVISCNGFTLS 185

Query: 82  NSE-LRPLGTGLYPVISIINHSCLPNAVLVFE-GRLAVVRAVQH------------VPKG 127
           +   L+ +G G++P +   NH C PN  +VF  G    VR++ H            +  G
Sbjct: 186 DQRGLQAVGVGIFPNLCQANHDCWPNCTVVFNNGNHEAVRSMFHTQMRIELRALNKISPG 245

Query: 128 AEVLISYIETAGSTMTRQKALKEQYLFTCTCPRCIKLGQFDDIQESAILEGYRCKDDGCS 187
            E+ +SY++    +  R++ LK+QY F CTC  C K                + KDD   
Sbjct: 246 DELTVSYVDFLSLSEERRRQLKKQYYFDCTCEHCKK----------------QLKDDLML 289

Query: 188 GFLLRDSDDKGFTCQQCGLVRSKEEIKKIASEVNILSKKTLALTSCGNHQEVVSTYKMIE 247
                +S     T ++  +  SK+ ++KI       +K  +     G++ EVV   +   
Sbjct: 290 AVKAGESKPSADTVKEV-IQLSKDTLEKI-------NKARME----GHYHEVVKLCRDCL 337

Query: 248 KLQKKLYHPFSVNLMQTREKLIKILMELEDWKEALAYCQLTIPVYQRVYPQFHPLLGLQY 307
           + Q+ +    ++ L++      ++L  L+ ++EA  Y +  +  Y ++Y   +  LG+  
Sbjct: 338 QKQEPVLGDTNIYLLRILSIASEVLSYLQMFEEAADYAKRMVDGYLKIYHPNNAQLGMAV 397

Query: 308 YTCGKLEWFLGDTENAIKSMTEAVEILRITHGTNSPFMKELILKLEEAQAEASYKLSSKD 367
              G   W  G  E     + +A  IL ITHG + P  K+  L++   Q E   ++  ++
Sbjct: 398 MRAGVTHWHAGLIEVGHGLICKAYAILLITHGPSHPITKD--LEVMRVQTEMELRMFQQN 455

Query: 368 E 368
           E
Sbjct: 456 E 456


>gi|124513818|ref|XP_001350265.1| histone-lysine N-methyltransferase, putative [Plasmodium falciparum
           3D7]
 gi|23615682|emb|CAD52674.1| histone-lysine N-methyltransferase, putative [Plasmodium falciparum
           3D7]
          Length = 509

 Score = 87.8 bits (216), Expect = 8e-15,   Method: Compositional matrix adjust.
 Identities = 84/326 (25%), Positives = 137/326 (42%), Gaps = 42/326 (12%)

Query: 63  INEIAENFSKLACNAHTICNSELRPLGTGLY--PVISIINHSCLPNAVLVFEGRLAVVRA 120
           +  I E F K++ N+  I ++EL P G  +Y  PV    NHSCL N V VF  +   +R 
Sbjct: 177 LKNIIEKFIKISKNSLQIIDNELEPAGLAIYKKPV-PFFNHSCLSNCVTVFRNQKLYIRT 235

Query: 121 VQHVPKGAEVLISYIETAGSTMTRQKALKEQYLFTCTCP--------------------- 159
           +  V  G E+ ISYI+ A    TR     +QY FTC+C                      
Sbjct: 236 LMDVYPGEELTISYIDIAFDRCTRLSICMDQYFFTCSCKLCKVNIASECHNIFNTEIVCT 295

Query: 160 ---RCIKLGQFDDIQESAILEGYRCKDDGCSGF----LLRDSD-DKGFTCQQCGLVRSKE 211
               C K   + +I   + LE  +C  +  S F    +LR S+ D  + C  C       
Sbjct: 296 NSENCKKFLGYMEIVLISELERKQCYINK-SSFKTYPVLRKSNIDNVWRCMLCKAEVGDN 354

Query: 212 EIKKIASEVNILSKKTLALTSCGNHQEVVSTYKMIEKLQK---------KLYHPFSVNLM 262
            IK +  +     K+T+ L +  N +       +++ L K           YH    +L 
Sbjct: 355 IIKGLVDKEKETFKETVYLETLFNEKYTYDNKSVLQSLNKIKSKIDYLTNYYHHAKYSLQ 414

Query: 263 QTREKLIKILMELEDWKEALAYCQLTIPVYQRVYPQFHPLLGLQYYTCGKLEWFLGDTEN 322
           + R K++ I ++L ++K A       +   +  Y ++ P+ G   +  GKL  FL     
Sbjct: 415 KMRAKILYISIQLHEFKLAYNIANQYLKSIEVSYGKYSPIYGYYIFLTGKLALFLDLKNE 474

Query: 323 AIKSMTEAVEILRITHGTNSPFMKEL 348
            +  + +A + +  T+G +S   K+L
Sbjct: 475 GLGLIHKAKKNIIKTYGPDSLIYKDL 500


>gi|428182841|gb|EKX51700.1| hypothetical protein GUITHDRAFT_55948, partial [Guillardia theta
           CCMP2712]
          Length = 98

 Score = 87.8 bits (216), Expect = 8e-15,   Method: Composition-based stats.
 Identities = 41/96 (42%), Positives = 58/96 (60%)

Query: 66  IAENFSKLACNAHTICNSELRPLGTGLYPVISIINHSCLPNAVLVFEGRLAVVRAVQHVP 125
           IA   SK   N H I + + +  G GLYP+ S INHSC PNA++ F+G   VVRA++++P
Sbjct: 3   IASMISKAQRNMHGIVDLKGQNFGHGLYPLASFINHSCEPNAIISFDGNKLVVRALENIP 62

Query: 126 KGAEVLISYIETAGSTMTRQKALKEQYLFTCTCPRC 161
           +G E+ I+Y+E       R+ AL  +  F C C RC
Sbjct: 63  RGTEITIAYVELYAPLDVRRDALLSRKGFLCRCSRC 98


>gi|344290601|ref|XP_003417026.1| PREDICTED: SET and MYND domain-containing protein 4-like [Loxodonta
           africana]
          Length = 800

 Score = 87.4 bits (215), Expect = 9e-15,   Method: Compositional matrix adjust.
 Identities = 84/318 (26%), Positives = 130/318 (40%), Gaps = 75/318 (23%)

Query: 59  PEISINEIA--ENFSKLACNAHTIC--------------NSELRPLGTGLYPVISIINHS 102
           PE++I  +    +  +L CNA  I               + E+R L TG++PV+S++NHS
Sbjct: 481 PELNIWGVVMLRHMLQLQCNAQAITTIQHTGSKESIITESREIR-LATGIFPVVSLLNHS 539

Query: 103 CLPNAVLVFEGRLAVVRAVQHVPKGAEVLISY--IETAGSTMTRQKALKEQYLFTCTCPR 160
           C PN  + F   +A +RA Q + KG E+L  Y   E+  +   RQ+ L+ QY F CTCP 
Sbjct: 540 CRPNTSVSFTSTVATIRASQQIAKGQEILHCYGPHESRMAVAERQQKLRSQYFFECTCPA 599

Query: 161 CIKLGQFDDIQESAILEGYRCKDDGCSGFLLRDSDDKGFTCQQCGLVRSKEEIKKIASEV 220
           C K        E     G R                  F C  C  V   E++       
Sbjct: 600 CHK--------EPRATAGPRW---------------AAFCCHSCRAVMQGEDV------- 629

Query: 221 NILSKKTLALTSCGNHQ--EVVSTYKMIEKLQKKLYHPFSVNLMQTREKLIKILM--ELE 276
                      SCGN    E VS   ++ +LQ         +L Q      K+L   +LE
Sbjct: 630 ----------LSCGNRSCTESVSRSHLVSRLQ---------DLQQQVGMAQKLLRSGKLE 670

Query: 277 DWKEALAYCQLTIPVYQRVYPQFHPLLGLQYYTCGKLEWFLGDTENAIKSMTEAVEILRI 336
              + L  CQ     +    P  H ++G       +    LGD E +   + ++++++ +
Sbjct: 671 QAVQQLLGCQHAAESF---LPAEHTVVGEIDDHLAQAYAALGDWEKSATHLQKSLQVVEV 727

Query: 337 THGTNSPFMKELILKLEE 354
            HG +S  M   + KL +
Sbjct: 728 RHGPSSIEMGRELFKLAQ 745


>gi|444515202|gb|ELV10791.1| SET and MYND domain-containing protein 1 [Tupaia chinensis]
          Length = 462

 Score = 87.0 bits (214), Expect = 1e-14,   Method: Compositional matrix adjust.
 Identities = 78/318 (24%), Positives = 142/318 (44%), Gaps = 55/318 (17%)

Query: 31  VAHMSDIDEKQLLLYAQIANLVNLILQ-WP----EISINEIAENFSKLACNAHTICNSE- 84
           V H  + ++K+L +       V+  LQ WP    + S+  I+  F  + CN  T+ +   
Sbjct: 136 VEHFGEEEQKELRVD------VDTFLQYWPPQSQQFSMQYISHIFGVINCNGFTLSDQRG 189

Query: 85  LRPLGTGLYPVISIINHSCLPNAVLVFE-GRLAVVRAVQH------------VPKGAEVL 131
           L+ +G G++P + ++NH C PN  ++F  G    V+++ H            + +G E+ 
Sbjct: 190 LQAVGVGIFPNLGLVNHDCWPNCTVIFNNGNHEAVKSMFHTQMRIELRALGKISEGEELT 249

Query: 132 ISYIETAGSTMTRQKALKEQYLFTCTCPRCIKLGQFDDIQESAILEGYRCKDDGCSGFLL 191
           +SYI+    +  R+K LK+QY F CTC  C K G  DD+                  FL 
Sbjct: 250 VSYIDFLNLSEERRKQLKKQYYFDCTCEHCQK-GLKDDL------------------FLG 290

Query: 192 RDSDDKGFTCQQCGLVRSKEEIKKIASEVNILSKKTLALTSCGNHQEVVSTYKMIEKLQK 251
              D K           S+E +K++        +K     S G + EVV   +   + Q+
Sbjct: 291 VKDDPKP----------SQEAVKEMTQFSKDTLEKIEKARSEGLYHEVVKLCRECLQKQE 340

Query: 252 KLYHPFSVNLMQTREKLIKILMELEDWKEALAYCQLTIPVYQRVYPQFHPLLGLQYYTCG 311
            ++   ++ +++    + ++L  L+ ++EA  Y +  +  Y ++Y   +  LG+     G
Sbjct: 341 PVFADTNLYVLRMLSIVSEVLSYLQAFEEASHYARRMVDGYMKLYHPNNAQLGMAVMRAG 400

Query: 312 KLEWFLGDTENAIKSMTE 329
              W  G+ E A++  TE
Sbjct: 401 LTNWHAGNIE-AMRVQTE 417


>gi|440799473|gb|ELR20518.1| Histone-lysine N-methyltransferase [Acanthamoeba castellanii str.
           Neff]
          Length = 404

 Score = 86.7 bits (213), Expect = 2e-14,   Method: Compositional matrix adjust.
 Identities = 60/193 (31%), Positives = 87/193 (45%), Gaps = 12/193 (6%)

Query: 25  SLVEALVAHMSDIDEKQLLLYAQIANLVNLILQWPEISINEIAENFSKL----ACNAHTI 80
           +L  AL  +   +   +L  Y Q+A +V   +  P  +     ++  +L     CNA  +
Sbjct: 192 ALTFALEPNTHALSPTRLAQYEQMAEVVAASISLPRGARKVTKDDVVRLLCIQQCNAFGL 251

Query: 81  CNSELRPLGTGLYPVISIINHSCLPNAVLVFEG----RLAVVRAVQHVPKGAEVLISYIE 136
            N E    G  LYP +S+ NHSC+PN   V +G    R+  ++ +  VP G E+ ISYI+
Sbjct: 252 SNGEGEMTGVALYPALSLFNHSCMPNCAAVDDGTGSKRVCAIKTLVAVPPGEELTISYID 311

Query: 137 TAGSTMTRQKALKEQYLFTCTCPRCIKLGQFDDIQESAILEGYRCKDDGCSGFLLRDSDD 196
              +   RQ  L+E Y F CTC RC      D    +A L    C    C G L+     
Sbjct: 312 LDLTRELRQDKLEESYAFRCTCARCRAPDADDSPAVAAHLRRITCGS--CGGSLIPAPPG 369

Query: 197 KGFT--CQQCGLV 207
           +  T  C  C LV
Sbjct: 370 QELTVACNSCPLV 382


>gi|312373132|gb|EFR20944.1| hypothetical protein AND_18266 [Anopheles darlingi]
          Length = 648

 Score = 86.7 bits (213), Expect = 2e-14,   Method: Compositional matrix adjust.
 Identities = 73/260 (28%), Positives = 116/260 (44%), Gaps = 30/260 (11%)

Query: 14  NVIPSTTTDNYSLVEALVAHMSDIDEKQLLLYAQIANLVNLILQWPEISINE-------- 65
           N     ++D+Y  V  LV H    D +  L++  +A ++N IL+    +  E        
Sbjct: 351 NFTDKLSSDDYRKVFNLVTHSDKRDPEDYLVWTLMAAMLNTILRHGNYTNTEQPDDGFIG 410

Query: 66  --IAENFSKLACNAHTICN-SELRP--------LGTGLYPVISIINHSCLPNAVLVFEGR 114
             +  N   +  +AH +      RP        +G  LYP++++ NHSC P  V  F G 
Sbjct: 411 YLLLHNLQIVNYSAHDVAELQRKRPNEAGKSVAIGAALYPLLALFNHSCDPGIVRYFSGT 470

Query: 115 LAVVRAVQHVPKGAEVLISY--IETAGSTMTRQKALKEQYLFTCTCPRCIKLGQ-FDDIQ 171
              VR ++++  G  +  +Y  + T      R+++L   Y F C C  C +  Q +  + 
Sbjct: 471 TVHVRTIKNIAAGQIIAENYGPLYTKMERTERRQSLAANYKFQCNCDACRQDWQLYTHMD 530

Query: 172 ESAILEGYRCKDDGCSGFLLRD--SDDKGFTCQQCG-LVRSKEEIKKIASEVNILSKKTL 228
           ES I   +RC    C G LL D  S+  G  C  CG LV  +E I ++  E N+LS+   
Sbjct: 531 ESVIR--FRCTGPACQGALLFDLTSECYGMQCDDCGELVDIRERI-RLLQEANMLSRFNE 587

Query: 229 A--LTSCGNHQEVVSTYKMI 246
           A  L + G  +  +S Y  I
Sbjct: 588 AAHLYAIGLFEHALSKYAAI 607


>gi|432880356|ref|XP_004073657.1| PREDICTED: SET and MYND domain-containing protein 1-like isoform 2
           [Oryzias latipes]
          Length = 478

 Score = 86.3 bits (212), Expect = 2e-14,   Method: Compositional matrix adjust.
 Identities = 71/301 (23%), Positives = 127/301 (42%), Gaps = 46/301 (15%)

Query: 62  SINEIAENFSKLACNAHTICNSE-LRPLGTGLYPVISIINHSCLPNAVLVF-EGRLAVVR 119
           S+  I+  F  + CN  T+ +   L+ +G GL+P ++++NH C PN   +   G  + V 
Sbjct: 166 SVEYISHIFGIIKCNGFTLTDQRGLQAVGVGLFPNLALVNHDCWPNCTAILNHGNHSAVS 225

Query: 120 AVQHVPK------------GAEVLISYIETAGSTMTRQKALKEQYLFTCTCPRCIKLGQF 167
           +  H P+            G E+ ISY++    +  RQK LKEQ+ F C+C  C +    
Sbjct: 226 SALHSPRRIEVRTLGKISEGEELTISYVDFLQLSADRQKQLKEQFHFECSCKHCSQ---- 281

Query: 168 DDIQESAILEGYRCKDDGCSGFLLRDSDDKGFTCQQCGLVRSKEEIKKIASEVNILSKKT 227
                          DD     ++  +D +    Q+     SK+ ++KI        K  
Sbjct: 282 ------------HISDD----LMMAAADGQDSPIQEVSAF-SKDSLEKI-------EKSR 317

Query: 228 LALTSCGNHQEVVSTYKMIEKLQKKLYHPFSVNLMQTREKLIKILMELEDWKEALAYCQL 287
           L     G++ EV+   +   + Q+ +     ++ ++      ++L  L  + EA    + 
Sbjct: 318 LD----GDYSEVLRLCRECLQKQRGVLADTHLHRLRVLSIAAEVLSSLRQFSEAADCARK 373

Query: 288 TIPVYQRVYPQFHPLLGLQYYTCGKLEWFLGDTENAIKSMTEAVEILRITHGTNSPFMKE 347
            +  Y ++Y   + LLG+     G   W  G  E     + +A  IL +THG N    ++
Sbjct: 374 MVEGYMKLYHPNNALLGMAIMRAGVTLWHAGQIEEGHSLICKAYSILMVTHGPNHSVTRD 433

Query: 348 L 348
           L
Sbjct: 434 L 434


>gi|239607850|gb|EEQ84837.1| SET and MYND domain-containing protein [Ajellomyces dermatitidis
           ER-3]
          Length = 519

 Score = 86.3 bits (212), Expect = 2e-14,   Method: Compositional matrix adjust.
 Identities = 39/104 (37%), Positives = 60/104 (57%)

Query: 60  EISINEIAENFSKLACNAHTICNSELRPLGTGLYPVISIINHSCLPNAVLVFEGRLAVVR 119
           +I ++ +A  F+KL  NA T+ N     +G    P  S INHSC PNA + F+G +  ++
Sbjct: 181 DIDLSTLAAYFAKLETNAFTLTNQYFDRIGLCFLPFASYINHSCQPNAYIGFDGPVMYLK 240

Query: 120 AVQHVPKGAEVLISYIETAGSTMTRQKALKEQYLFTCTCPRCIK 163
           A+Q +    ++ ISYI+      TRQ  L+ QY F C CP+C++
Sbjct: 241 ALQDIAPDEQIYISYIDNTEPFQTRQSELQLQYFFECKCPKCLE 284


>gi|324513338|gb|ADY45484.1| SET and MYND domain-containing protein 3 [Ascaris suum]
          Length = 477

 Score = 86.3 bits (212), Expect = 2e-14,   Method: Compositional matrix adjust.
 Identities = 83/323 (25%), Positives = 138/323 (42%), Gaps = 25/323 (7%)

Query: 30  LVAHMSDI--DEKQLLLYAQIANLVNLILQWPEISINEIAENFSKLACNAHTICNSELRP 87
           L+ H  DI  D+ +   + +I N +++  +   I  ++    F K + N+H+I  S    
Sbjct: 123 LLGHEEDIKADQAKYEHFEKIFNKMSIFRKDEMIDKDKFYIIFCKTSINSHSIHTSAGTE 182

Query: 88  LGTGLYPVISIINHSCLPNAVLVFEGRLAVVRAVQHVPKGAE---VLISYIETAGSTMTR 144
           +G  L   +S  +HSC PN  LVF+G  A +R +      ++     ISYI+   S   R
Sbjct: 183 VGMALDLGVSAYDHSCRPNCSLVFDGYQACLRPLTPQTNASDPRTSFISYIDIGRSRYQR 242

Query: 145 QKALKEQYLFTCTCPRCIKLGQFDDIQESAILEGYRCKDDGCSGFLL--RDSDDKGFTCQ 202
           +K LK ++ F C C RC             IL   RC D  C   L+   D++    TC 
Sbjct: 243 RKELKAKWYFDCACERCCDPA-------DNILTSIRCSDASCDEPLITAEDAEATAITCP 295

Query: 203 QCGLVRSKEEIKKIASEVNILSKKTLALTSCGNHQEVVSTYKMIEKLQKKLYHPFSVNLM 262
           +CG V  +E ++     +  L  K    ++  + QE++     +   QK +Y    +  +
Sbjct: 296 KCGQVADEEYVRAAQQMMLRLPAKFSPESNPDSLQELLDEASNVLH-QKNIY----ITRL 350

Query: 263 QTREKLIKILMELEDWKEALAYCQLTI-PVYQRVYPQFHPLLGLQYYTCGKLEWFLGDTE 321
           QT   ++ I   L+D    L + Q  +   Y+  +P     +G Q     K     G  +
Sbjct: 351 QT--AIMHITGTLKD---NLPFIQHQVYENYKMCFPYADRHIGYQLLQIVKAYIEKGQRK 405

Query: 322 NAIKSMTEAVEILRITHGTNSPF 344
            A+    EA+ I  I  G   P+
Sbjct: 406 EAVSYAFEAMNIFEICFGLQHPY 428


>gi|261198320|ref|XP_002625562.1| SET and MYND domain-containing protein [Ajellomyces dermatitidis
           SLH14081]
 gi|239595525|gb|EEQ78106.1| SET and MYND domain-containing protein [Ajellomyces dermatitidis
           SLH14081]
 gi|327355791|gb|EGE84648.1| SET and MYND domain-containing protein [Ajellomyces dermatitidis
           ATCC 18188]
          Length = 519

 Score = 86.3 bits (212), Expect = 2e-14,   Method: Compositional matrix adjust.
 Identities = 39/104 (37%), Positives = 60/104 (57%)

Query: 60  EISINEIAENFSKLACNAHTICNSELRPLGTGLYPVISIINHSCLPNAVLVFEGRLAVVR 119
           +I ++ +A  F+KL  NA T+ N     +G    P  S INHSC PNA + F+G +  ++
Sbjct: 181 DIDLSTLAAYFAKLETNAFTLTNQYFDRIGLCFLPFASYINHSCQPNAYIGFDGPVMYLK 240

Query: 120 AVQHVPKGAEVLISYIETAGSTMTRQKALKEQYLFTCTCPRCIK 163
           A+Q +    ++ ISYI+      TRQ  L+ QY F C CP+C++
Sbjct: 241 ALQDIAPDEQIYISYIDNTEPFQTRQSELQLQYFFECKCPKCLE 284


>gi|225561846|gb|EEH10126.1| MYND finger family protein [Ajellomyces capsulatus G186AR]
          Length = 518

 Score = 86.3 bits (212), Expect = 2e-14,   Method: Compositional matrix adjust.
 Identities = 50/165 (30%), Positives = 85/165 (51%), Gaps = 11/165 (6%)

Query: 6   LRRKLQNDNVIPSTTTD-NYSLVEALVAHMSDIDEK------QLLLYAQIANLVNLILQW 58
           LR   Q DN +     D ++++   L  H S +DE+      ++ + A+    ++     
Sbjct: 123 LRIISQPDNKVSKEIYDTHHAVFRTLGHHFSKMDERGGEQAHRITVSAEALKAISNT--- 179

Query: 59  PEISINEIAENFSKLACNAHTICNSELRPLGTGLYPVISIINHSCLPNAVLVFEGRLAVV 118
            E+ ++ +   F+KL  NA T+ N     +G  L P  + INHSC PNA + F+G++  +
Sbjct: 180 -EVELSTLVVYFAKLETNAFTLTNQYFDRIGLCLLPFAAYINHSCEPNAYIGFDGQVMYL 238

Query: 119 RAVQHVPKGAEVLISYIETAGSTMTRQKALKEQYLFTCTCPRCIK 163
           +A+Q +    E+ ISY +      TRQ  L+ +Y F C CP+C+K
Sbjct: 239 KALQDIAPDEEIFISYTDNTEPLKTRQTELQLRYFFECKCPKCLK 283


>gi|403351926|gb|EJY75465.1| SET and MYND domain containing 3 [Oxytricha trifallax]
          Length = 517

 Score = 85.9 bits (211), Expect = 3e-14,   Method: Compositional matrix adjust.
 Identities = 73/283 (25%), Positives = 128/283 (45%), Gaps = 54/283 (19%)

Query: 75  CNAHTICNSELRPLGTGLYPVISIINHSCLPNAVLVFEGRLAVVRAVQHVPKGAEVLISY 134
           CN   I ++++  +GTGLY   +++NHSC PN +++F G+   +   + +    E+ I Y
Sbjct: 243 CNGFGIADNQMLRIGTGLYYPSNLLNHSCDPNCMVLFRGQTQFIVTCRPIEADEEITICY 302

Query: 135 IETAGS-TMTRQKALKEQYLFTCTCPRCIKLGQFDDIQESAILEGYRCKDDGCSGFLLRD 193
           I+   S  + RQ+ L+EQY F C C RC+K           I EG   K+          
Sbjct: 303 IDNGISERIIRQQYLQEQYHFNCMCARCLK----------QIGEGTELKE---------- 342

Query: 194 SDDKGFTCQQCGLVRSKEEIKKIASEVNILSKKTLALTSCGNHQEVVSTYKMIEKLQKKL 253
                           K +I + + E  IL+K+ L   S GN  + +S    + +L K +
Sbjct: 343 ---------------QKVQI-QFSEEQQILNKQALHEESLGNFDKSLS---QLSQLYKSV 383

Query: 254 YHPFSVNLMQTR------EKLIKILMELEDWKEALAYCQLTIPVYQRVYPQF-------- 299
               S  + Q        E+L+ + +  +D+K A+ + Q  + + + +Y +         
Sbjct: 384 ETNKSTQIFQNSMIKELLERLVYLSVMNKDYKFAVKFQQKLVDLCEIMYIKEVKREDELP 443

Query: 300 HPLLGLQYYTCGKLEWFLGDTENAIKSMTEAVEILRITHGTNS 342
           HPL+ L YY  GKL       + AI ++ +A+ ++   +G  S
Sbjct: 444 HPLIALHYYQLGKLLSQRKKYQEAIDNLQKAMTLVSAYYGLKS 486


>gi|82594042|ref|XP_725259.1| hypothetical protein [Plasmodium yoelii yoelii 17XNL]
 gi|23480195|gb|EAA16824.1| Homo sapiens HSKM-B [Plasmodium yoelii yoelii]
          Length = 511

 Score = 85.9 bits (211), Expect = 3e-14,   Method: Compositional matrix adjust.
 Identities = 81/327 (24%), Positives = 135/327 (41%), Gaps = 42/327 (12%)

Query: 63  INEIAENFSKLACNAHTICNSELRPLGTGLY--PVISIINHSCLPNAVLVFEGRLAVVRA 120
           +  + E F K++ N   I ++EL P G G Y  PV    NHSCL N + +F+ +   +R 
Sbjct: 177 LKNVIEKFIKVSKNTLQIIDNELEPCGLGFYKKPV-PYFNHSCLSNCITIFKNQKLYIRT 235

Query: 121 VQHVPKGAEVLISYIETAGSTMTRQKALKEQYLFTCTCPRCI------------------ 162
           +  +  G E+ ISY++ A    TR     +QY FTCTC  C                   
Sbjct: 236 LMDIYPGEELTISYLDIAFDRNTRLAICTDQYFFTCTCKLCKVNIPSECHNMFNNDFICT 295

Query: 163 ---KLGQFDDIQESAILE------GYRCKDDGCSGFLLRDSDDKGFTCQQCGLVRSK--- 210
                 +F +  E  ++        Y  K    +  +L+ S DK     +C L + +   
Sbjct: 296 RSENCKKFVNYMEMVLISELERKLNYHNKLHFKAFPILKKSTDKNENIWKCMLCKHETND 355

Query: 211 -------EEIKKIASEVNILSKKTLALTSCGNHQEVVSTYKMIEKLQ--KKLYHPFSVNL 261
                  E+ K+   EV  L        S  N   + S  K+  K+      YH    +L
Sbjct: 356 SVIKGVIEKEKETVKEVEYLETLFAEKYSYDNKIILQSLNKIKSKIDYLTTFYHHSRYSL 415

Query: 262 MQTREKLIKILMELEDWKEALAYCQLTIPVYQRVYPQFHPLLGLQYYTCGKLEWFLGDTE 321
            + R K++ I ++L+++K A       +   +  Y ++ P+ G   +  GKL  FL    
Sbjct: 416 QKMRAKILYISIQLQEFKLAYNIANQYLKSIEVSYGKYSPIYGYYIFLTGKLALFLDLKS 475

Query: 322 NAIKSMTEAVEILRITHGTNSPFMKEL 348
             +  + +A + +  T+G  SP  K+L
Sbjct: 476 AGLNLIHKAKKNIIKTYGPESPIYKDL 502


>gi|344247559|gb|EGW03663.1| SET and MYND domain-containing protein 1 [Cricetulus griseus]
          Length = 467

 Score = 85.1 bits (209), Expect = 4e-14,   Method: Compositional matrix adjust.
 Identities = 76/338 (22%), Positives = 140/338 (41%), Gaps = 87/338 (25%)

Query: 31  VAHMSDIDEKQLLLYAQIANLVNLILQ-WP----EISINEIAENFSKLACNAHTICNSE- 84
           V H  + ++K+L +       V+  LQ WP    + S+  I+  F  + CN  T+ +   
Sbjct: 136 VEHFGEEEQKELRVD------VDAFLQYWPPQGQQFSMQYISHIFGVINCNGFTLSDQRG 189

Query: 85  LRPLGTGLYPVISIINHSCLPNAVLVFE-GRLAVVRAVQH------------VPKGAEVL 131
           L+ +G G++P + ++NH C PN  ++F  G    V+++ H            + +G E+ 
Sbjct: 190 LQAVGVGIFPNLGLVNHDCWPNCTVIFNNGNHEAVKSMFHTQMRIELRALGKISEGEELT 249

Query: 132 ISYIETAGSTMTRQKALKEQYLFTCTCPRCIKLGQFDDIQESAILEGYRCKDDGCSGFLL 191
           +SYI+    +  R++ L++QY F C+C  C K G  DD+                  F  
Sbjct: 250 VSYIDFLHLSEERRRQLRKQYYFDCSCEHCQK-GLKDDL------------------FQA 290

Query: 192 RDSDDKGFT-CQQCGLVRSKEEIKKIASEVNILSKKTLALTSCGNHQEVVSTYKMIEKLQ 250
              D K    C++C      E+ + + ++ N+   + L++ S                  
Sbjct: 291 VKEDPKVVKLCREC-----LEKQESVFADTNLYVLRLLSIVS------------------ 327

Query: 251 KKLYHPFSVNLMQTREKLIKILMELEDWKEALAYCQLTIPVYQRVYPQFHPLLGLQYYTC 310
                              ++L  L+ ++EA  Y +  +  Y ++Y   +  LG+     
Sbjct: 328 -------------------EVLSYLQAFEEAAHYARRMVDGYMKLYHHNNAQLGMAVMRA 368

Query: 311 GKLEWFLGDTENAIKSMTEAVEILRITHGTNSPFMKEL 348
           G   W  G+ E     + +A  IL +THG + P  K+L
Sbjct: 369 GLTNWHAGNIEVGHGMICKAYAILLVTHGPSHPITKDL 406


>gi|389584498|dbj|GAB67230.1| hypothetical protein PCYB_112510 [Plasmodium cynomolgi strain B]
          Length = 501

 Score = 85.1 bits (209), Expect = 5e-14,   Method: Compositional matrix adjust.
 Identities = 84/332 (25%), Positives = 141/332 (42%), Gaps = 52/332 (15%)

Query: 63  INEIAENFSKLACNAHTICNSELRPLGTGLY--PVISIINHSCLPNAVLVFEGRLAVVRA 120
           +  I E F K++ N+  I ++EL P G   Y  PV    NHSCL N V +F+ +   +R 
Sbjct: 167 LKNIIEKFVKISKNSLQIIDNELEPAGLAFYKKPV-PYFNHSCLSNCVTIFKNQKLFIRT 225

Query: 121 VQHVPKGAEVLISYIETAGSTMTRQKALKEQYLFTCTCPRCIKLGQFDDIQESAILEGYR 180
           +  V  G E+ ISYI+ A    TR     +QY FTC+C  C K+    + Q     E   
Sbjct: 226 LVDVYPGEELTISYIDIAFDKNTRLAICMDQYFFTCSCKLC-KVNVVSECQNIFNTEFVC 284

Query: 181 CKDDGC------------------------SGF-----LLRDSDDKG--FTCQQCG---- 205
              + C                        S F     +L+ S+++   + C  C     
Sbjct: 285 THSESCKKCVNNMELILISELERKNCYMNKSQFKNSYPILKKSNEQQHMWKCALCKNEVH 344

Query: 206 -------LVRSKEEIKKIASEVNILSKKTLALTSCGNHQEVVSTYKMIEKLQ--KKLYHP 256
                  + + KE +K+ A   ++ S+K     S  N   + S  K+  K+      YH 
Sbjct: 345 DNVIKSLMEKEKETVKETAHLESLFSEKY----SYDNKSVLQSLNKIKSKIDYLTTFYHH 400

Query: 257 FSVNLMQTREKLIKILMELEDWKEALAYCQLTIPVYQRVYPQFHPLLGLQYYTCGKLEWF 316
              +L + R K++ I ++L+++K A       +   +  Y ++ P+ G   +  GKL  F
Sbjct: 401 TRYSLQKMRAKILYISIQLQEFKLAYNIANQYLKSIEVSYGKYSPIYGYYIFLTGKLALF 460

Query: 317 LGDTENAIKSMTEAVEILRITHGTNSPFMKEL 348
           L      +  + +A + +  T+G +SP  K+L
Sbjct: 461 LDLKSEGLSLIHKAKKNIIKTYGPDSPIYKDL 492


>gi|67539656|ref|XP_663602.1| hypothetical protein AN5998.2 [Aspergillus nidulans FGSC A4]
 gi|40738557|gb|EAA57747.1| hypothetical protein AN5998.2 [Aspergillus nidulans FGSC A4]
 gi|259479819|tpe|CBF70391.1| TPA: SET and MYND domain protein, putative (AFU_orthologue;
           AFUA_2G10080) [Aspergillus nidulans FGSC A4]
          Length = 497

 Score = 85.1 bits (209), Expect = 5e-14,   Method: Compositional matrix adjust.
 Identities = 49/164 (29%), Positives = 88/164 (53%), Gaps = 8/164 (4%)

Query: 1   MLKLYLRRKLQNDNVIPSTTTDNYSLVEALVAHMSDIDEKQLLLYAQIANLVNLILQWPE 60
           +L++ LR + + +    + T +   L + L  H+ DI  +      +IA     + ++ +
Sbjct: 125 LLRMVLRTEARKN----AYTEEELVLFQTLETHIDDILNRNAPQAERIALTSRAVKEYSK 180

Query: 61  ISINE--IAENFSKLACNAHTICNSELRPLGTGLYPVISIINHSCLPNAVLVFEGRLAVV 118
             + E  I    ++L  N+  + N +   +G  L+P  ++INHSC  NAV+ F+G    V
Sbjct: 181 TDMEEEKIVAYHARLDLNSFNLTNDD--DIGIYLHPYAALINHSCDYNAVVGFDGSEIFV 238

Query: 119 RAVQHVPKGAEVLISYIETAGSTMTRQKALKEQYLFTCTCPRCI 162
           +A++ +  G ++ ISYI+T   T  RQK L+E+Y FTC C +C+
Sbjct: 239 KAIRPIATGEQIFISYIDTTYPTRIRQKELQERYFFTCNCAKCL 282


>gi|240275453|gb|EER38967.1| MYND finger family protein [Ajellomyces capsulatus H143]
          Length = 518

 Score = 85.1 bits (209), Expect = 5e-14,   Method: Compositional matrix adjust.
 Identities = 50/165 (30%), Positives = 85/165 (51%), Gaps = 11/165 (6%)

Query: 6   LRRKLQNDNVIPSTTTD-NYSLVEALVAHMSDIDEK------QLLLYAQIANLVNLILQW 58
           LR   Q DN +     D ++++   L  H S +DE+      ++ + A+    ++     
Sbjct: 123 LRIISQPDNKVLKEIYDTHHAVFRTLGHHFSRMDERGGEQAHRITVSAEALKAISNT--- 179

Query: 59  PEISINEIAENFSKLACNAHTICNSELRPLGTGLYPVISIINHSCLPNAVLVFEGRLAVV 118
            E+ ++ +   F+KL  NA T+ N     +G  L P  + INHSC PNA + F+G++  +
Sbjct: 180 -EVELSTLVVYFAKLETNAFTLTNQYFDRIGLCLLPFAAYINHSCEPNAYIGFDGQVMYL 238

Query: 119 RAVQHVPKGAEVLISYIETAGSTMTRQKALKEQYLFTCTCPRCIK 163
           +A+Q +    E+ ISY +      TRQ  L+ +Y F C CP+C+K
Sbjct: 239 KALQDIAPDEEIFISYTDNTEPLKTRQTELQLRYFFECKCPKCLK 283


>gi|325091290|gb|EGC44600.1| MYND finger family protein [Ajellomyces capsulatus H88]
          Length = 479

 Score = 84.3 bits (207), Expect = 8e-14,   Method: Compositional matrix adjust.
 Identities = 45/158 (28%), Positives = 82/158 (51%), Gaps = 10/158 (6%)

Query: 12  NDNVIPSTTTDNYSLVEALVAHMSDIDEK------QLLLYAQIANLVNLILQWPEISINE 65
           ++ V+      ++++   L  H S +DE+      ++ + A+    ++      E+ ++ 
Sbjct: 91  DNKVLKEIYDTHHAVFRTLGHHFSRMDERGGEQAHRITVSAEALKAISNT----EVELST 146

Query: 66  IAENFSKLACNAHTICNSELRPLGTGLYPVISIINHSCLPNAVLVFEGRLAVVRAVQHVP 125
           +   F+KL  NA T+ N     +G  L P  + INHSC PNA + F+G++  ++A+Q + 
Sbjct: 147 LVVYFAKLETNAFTLTNQYFDRIGLCLLPFAAYINHSCEPNAYIGFDGQVMYLKALQDIA 206

Query: 126 KGAEVLISYIETAGSTMTRQKALKEQYLFTCTCPRCIK 163
              E+ ISY +      TRQ  L+ +Y F C CP+C+K
Sbjct: 207 PDEEIFISYTDNTEPLKTRQTELQLRYFFECKCPKCLK 244


>gi|213514028|ref|NP_001135157.1| SET and MYND domain-containing protein 1 [Salmo salar]
 gi|197631975|gb|ACH70711.1| SET and MYND domain containing 1b [Salmo salar]
          Length = 301

 Score = 84.3 bits (207), Expect = 8e-14,   Method: Compositional matrix adjust.
 Identities = 45/141 (31%), Positives = 72/141 (51%), Gaps = 9/141 (6%)

Query: 27  VEALVAHMSDIDEKQLL-LYAQIANLVNLILQWPEIS----INEIAENFSKLACNAHTIC 81
           ++ L  H+SD+ E  L  L   I N ++    WP  S    +++I+  F  + CN  T+ 
Sbjct: 131 LDDLENHISDMPEDDLKELKVDIHNFLDY---WPRNSKQHRVDDISHIFGVINCNGFTVS 187

Query: 82  NSE-LRPLGTGLYPVISIINHSCLPNAVLVFEGRLAVVRAVQHVPKGAEVLISYIETAGS 140
           +   L  +G GL+P + ++NH C PN  ++       +RA+  +  G EV +SY++    
Sbjct: 188 DQRGLNAVGVGLFPNLCLVNHDCWPNCTVILNHGKIELRALGKIEPGEEVTVSYVDFLNV 247

Query: 141 TMTRQKALKEQYLFTCTCPRC 161
           T  RQ+ LK  Y F CTC  C
Sbjct: 248 TEDRQRQLKMHYFFDCTCEHC 268


>gi|194913080|ref|XP_001982622.1| GG12639 [Drosophila erecta]
 gi|190648298|gb|EDV45591.1| GG12639 [Drosophila erecta]
          Length = 462

 Score = 84.3 bits (207), Expect = 8e-14,   Method: Compositional matrix adjust.
 Identities = 41/102 (40%), Positives = 57/102 (55%), Gaps = 6/102 (5%)

Query: 91  GLYPVISIINHSCLPNAVLVFE-GRLAVVRAVQHVPKGAEVLISYIETAGSTMTRQKALK 149
           GL+P+ +I+NH C PNA   FE GRLAVVRA + +PKG E+  +Y +   S +TR   LK
Sbjct: 242 GLFPLTAIMNHECTPNASHYFENGRLAVVRAARDIPKGGEITTTYTKILWSNLTRNIFLK 301

Query: 150 EQYLFTCTCPRCIKLGQFDDIQESAILEGYRCKDDGCSGFLL 191
               F C C RC      D+ +    L    C++ GC G ++
Sbjct: 302 MTKHFACDCARC-----HDNTENGTYLSALFCREQGCRGLVI 338


>gi|345480324|ref|XP_003424127.1| PREDICTED: SET and MYND domain-containing protein 3-like [Nasonia
           vitripennis]
          Length = 391

 Score = 84.3 bits (207), Expect = 8e-14,   Method: Compositional matrix adjust.
 Identities = 44/143 (30%), Positives = 82/143 (57%), Gaps = 5/143 (3%)

Query: 24  YSLVEALVAHMSDI--DEKQLLLYAQIANLVNLILQWPEI-SINEIAENFSKLACNAHTI 80
           Y   + L++H +DI  D K++  +  +  ++   ++   + +  EI   + ++  N++ I
Sbjct: 106 YRKFKDLMSHYTDIKNDPKRIEHFVSLCQVLEDFMEGTTLPNSAEILGLYGRICVNSYNI 165

Query: 81  CNSELRPLGTGLYPVISIINHSCLPNAVLVFEGRLAVVRAVQHVPK--GAEVLISYIETA 138
            + ++  +G G+Y   S+I+HSC PNAV VFEG   ++RA++ +P+   +++ ISYI+  
Sbjct: 166 LDPDMNSIGVGIYLGPSVIDHSCKPNAVAVFEGTTILIRALEDIPRLDWSQIHISYIDVL 225

Query: 139 GSTMTRQKALKEQYLFTCTCPRC 161
            +T TR   L+  Y F C C RC
Sbjct: 226 NTTSTRCTELQNTYYFLCECERC 248



 Score = 45.4 bits (106), Expect = 0.041,   Method: Compositional matrix adjust.
 Identities = 21/52 (40%), Positives = 31/52 (59%)

Query: 296 YPQFHPLLGLQYYTCGKLEWFLGDTENAIKSMTEAVEILRITHGTNSPFMKE 347
           Y + HPL G+ Y   GK++  L   ++A+  +T+A +ILRITHG      KE
Sbjct: 325 YGEIHPLTGILYLMLGKIQLHLDKPKSALDMLTKADKILRITHGEKHSLFKE 376


>gi|195456692|ref|XP_002075245.1| GK16970 [Drosophila willistoni]
 gi|194171330|gb|EDW86231.1| GK16970 [Drosophila willistoni]
          Length = 766

 Score = 84.3 bits (207), Expect = 8e-14,   Method: Compositional matrix adjust.
 Identities = 40/102 (39%), Positives = 58/102 (56%), Gaps = 6/102 (5%)

Query: 91  GLYPVISIINHSCLPNAVLVFE-GRLAVVRAVQHVPKGAEVLISYIETAGSTMTRQKALK 149
           GL+P+ +I+NH C PNA   F+ GR+AVVRA + +PKG E+  +Y +   S +TR   LK
Sbjct: 546 GLFPLTAIMNHECTPNASHYFDNGRMAVVRAARDIPKGGEITTTYTKILWSNLTRGIFLK 605

Query: 150 EQYLFTCTCPRCIKLGQFDDIQESAILEGYRCKDDGCSGFLL 191
               F C CPRC      D+ +    L    C++ GC G ++
Sbjct: 606 MTKNFMCDCPRC-----NDNTENGTYLSALFCREQGCKGLVI 642



 Score = 62.8 bits (151), Expect = 3e-07,   Method: Compositional matrix adjust.
 Identities = 29/72 (40%), Positives = 41/72 (56%), Gaps = 1/72 (1%)

Query: 91  GLYPVISIINHSCLPNAVLVFE-GRLAVVRAVQHVPKGAEVLISYIETAGSTMTRQKALK 149
            L+P+  ++NH C PNA   FE G   VV A + +PKG E+ ++Y +   ST+ R+  L 
Sbjct: 222 ALFPLAGLLNHHCTPNAAHHFEDGETIVVSATERIPKGTEITMTYAKLLWSTLARKLFLA 281

Query: 150 EQYLFTCTCPRC 161
               F C CPRC
Sbjct: 282 MTKHFICQCPRC 293


>gi|357607357|gb|EHJ65468.1| hypothetical protein KGM_05644 [Danaus plexippus]
          Length = 370

 Score = 84.0 bits (206), Expect = 1e-13,   Method: Compositional matrix adjust.
 Identities = 51/174 (29%), Positives = 96/174 (55%), Gaps = 15/174 (8%)

Query: 20  TTDNYSLVEALVAHMSDI--DEKQLLLYAQIANLVNLILQWPEISINEIAEN---FSKLA 74
           T  ++ + + L++H SD+  D+K++  ++ ++ ++   L+  +IS+   A+    + ++ 
Sbjct: 110 TPTSFRVWKDLMSHYSDLKSDKKRMDHFSTLSMVLFEYLK--DISLPNTADLMGLYGRMV 167

Query: 75  CNAHTICNSELRPLGTGLYPVISIINHSCLPNAVLVFEGRLAVVRAVQHVP--KGAEVLI 132
            N+ TI + E+  +GTG+Y   S+I+HSC PNAV VF+G+   +RA++ +      ++ I
Sbjct: 168 INSFTILDIEMNSIGTGIYLASSVIDHSCNPNAVAVFDGKTINIRALKDMNCLDWKKIRI 227

Query: 133 SYIETAGSTMTRQKALKEQYLFTCTCPRCIKLGQFDDIQESAILEGYRCKDDGC 186
           SYI+   +   RQ  L++ Y F C C RC+      D      +   +C  DGC
Sbjct: 228 SYIDLMKTPYERQMELRQSYYFLCQCDRCL------DENRIKYVHAAKCLKDGC 275


>gi|302654157|ref|XP_003018889.1| SET and MYND domain protein, putative [Trichophyton verrucosum HKI
           0517]
 gi|291182573|gb|EFE38244.1| SET and MYND domain protein, putative [Trichophyton verrucosum HKI
           0517]
          Length = 498

 Score = 84.0 bits (206), Expect = 1e-13,   Method: Compositional matrix adjust.
 Identities = 39/102 (38%), Positives = 59/102 (57%)

Query: 60  EISINEIAENFSKLACNAHTICNSELRPLGTGLYPVISIINHSCLPNAVLVFEGRLAVVR 119
           E+ +  IAE  +++  N+ T   +   PLG  + P    +NHSC PNAV+ F+G L +V+
Sbjct: 179 ELDVETIAEYLARIEVNSFTFTTAFGDPLGLCIQPFACYMNHSCEPNAVVGFDGGLIIVK 238

Query: 120 AVQHVPKGAEVLISYIETAGSTMTRQKALKEQYLFTCTCPRC 161
           A++ +    +V ISYI+       RQK L E+Y FTC C +C
Sbjct: 239 ALREIKPDEQVFISYIDNTYPLEVRQKQLAERYFFTCKCSKC 280


>gi|452820074|gb|EME27122.1| SET and MYND domain-containing protein [Galdieria sulphuraria]
          Length = 464

 Score = 84.0 bits (206), Expect = 1e-13,   Method: Compositional matrix adjust.
 Identities = 43/107 (40%), Positives = 56/107 (52%), Gaps = 9/107 (8%)

Query: 64  NEIAENFSKLACNAHTICNSELRPLGTGLYPVISIINHSCLPNAVLVFEGRLAVV----- 118
           N +A     L  NAHTI +SEL  LG G +P  S +NH C PN V +F G    V     
Sbjct: 168 NFVASLIRMLEMNAHTIYDSELNTLGVGFFPKASFMNHDCRPNCVALFTGGFHSVSGKPI 227

Query: 119 ----RAVQHVPKGAEVLISYIETAGSTMTRQKALKEQYLFTCTCPRC 161
               R ++ +  G E++ISY++   S M R + LKE Y F C C RC
Sbjct: 228 SIHIRCIRPIEAGEEIVISYLDVCLSWMDRLEWLKEHYQFECCCSRC 274


>gi|402217590|gb|EJT97670.1| hypothetical protein DACRYDRAFT_111714 [Dacryopinax sp. DJM-731
           SS1]
          Length = 531

 Score = 84.0 bits (206), Expect = 1e-13,   Method: Compositional matrix adjust.
 Identities = 77/273 (28%), Positives = 121/273 (44%), Gaps = 26/273 (9%)

Query: 62  SINEIAENFSKLACNAHTICNSELRPLGTGLYPVISIINHSCLPNAVLVF------EGRL 115
           SI ++    S+   N+ ++   +L  +G  + P+ ++I+HSC+PNAV+VF       G L
Sbjct: 182 SIRDLTNLLSRFQTNSFSLTTPDLTNVGVAISPLAALISHSCMPNAVVVFPTGLGRRGGL 241

Query: 116 AVVRAVQHVPKGAEVLISYIETAGSTMTRQKALKEQYLFTCTCPRCIKLGQFDDIQESAI 175
            V+ A++ +  G EVL SY++ A     R K LK++YLF C C  C K    + I     
Sbjct: 242 EVI-ALRDLQPGEEVLTSYVDIALPRSLRWKELKDRYLFDCECVLCEKHHDHEWIDPR-- 298

Query: 176 LEGYRCKDDGCSGFLLRDSD--DKGFTCQQCGL-----VRSKEEIKKIASEVNILSKKTL 228
            E  RC   GC G +   +       TC  CG        S +E  K+A +V   ++   
Sbjct: 299 -EALRCSKKGCKGKMGTPTSLGTTEITCSICGTKLSVSTASLQEHVKLAEDVLHKAEAIQ 357

Query: 229 ALTSCGNHQEVVSTYKMIEKLQKKLYHPFSVNLMQTREKLIKILMELED--------WKE 280
              S      +VS  +  +      Y P    L   ++ LI  + +L           ++
Sbjct: 358 FSDSDKARYLLVSILRDFDPFALSAY-PLLQILQLLQQLLITAVFDLPSTGTERDGLLQD 416

Query: 281 ALAYCQLTIPVYQRVYPQFHPLLGLQYYTCGKL 313
           A +   L +   + VYP  HP  G+QY T  KL
Sbjct: 417 AASCSALVLSGMEDVYPYGHPSKGIQYVTSAKL 449


>gi|164656030|ref|XP_001729143.1| hypothetical protein MGL_3610 [Malassezia globosa CBS 7966]
 gi|159103033|gb|EDP41929.1| hypothetical protein MGL_3610 [Malassezia globosa CBS 7966]
          Length = 390

 Score = 83.6 bits (205), Expect = 1e-13,   Method: Compositional matrix adjust.
 Identities = 49/149 (32%), Positives = 74/149 (49%), Gaps = 17/149 (11%)

Query: 67  AENFSKLAC----NAHTICNSELRPLGTGLYPVISIINHSCLPNAVLVFEGRLAVVR--- 119
           A+   +L C    NA T+ ++ L PLG  + P  ++INHSC PNAV+VF  R+  +    
Sbjct: 54  ADALMELVCQHETNAFTLADAHLNPLGVCIEPTFALINHSCDPNAVIVFPDRVRGMPSKM 113

Query: 120 ---AVQHVPKGAEVLISYIETAGSTMTRQKALKEQYLFTCTCPRCIKLGQFDDIQESAIL 176
              A++ +  G EV ISY++ A     RQ  L E+Y F C C  C ++G  D        
Sbjct: 114 HLVAIRAISAGEEVRISYVDVASCQAERQATLLERYCFACECRLCKRMGWRDPRM----- 168

Query: 177 EGYRCKDDGCSGFLLRDSDDKGFT-CQQC 204
               C    C+G+++  +    +T C QC
Sbjct: 169 -ALWCPQPKCTGWVVFQNAASSWTACTQC 196


>gi|327305077|ref|XP_003237230.1| hypothetical protein TERG_01951 [Trichophyton rubrum CBS 118892]
 gi|326460228|gb|EGD85681.1| hypothetical protein TERG_01951 [Trichophyton rubrum CBS 118892]
          Length = 498

 Score = 83.6 bits (205), Expect = 1e-13,   Method: Compositional matrix adjust.
 Identities = 39/102 (38%), Positives = 58/102 (56%)

Query: 60  EISINEIAENFSKLACNAHTICNSELRPLGTGLYPVISIINHSCLPNAVLVFEGRLAVVR 119
           E+ +  IAE  +++  N+ T   +   PLG  + P    +NHSC PNAV+ F+G L  V+
Sbjct: 179 ELDVETIAEYLARIEVNSFTFTTAFGDPLGLCIQPFACYMNHSCEPNAVVGFDGGLITVK 238

Query: 120 AVQHVPKGAEVLISYIETAGSTMTRQKALKEQYLFTCTCPRC 161
           A++ +    +V ISYI+       RQK L E+Y FTC C +C
Sbjct: 239 ALREIKSDEQVFISYIDNTYPFEVRQKQLAERYFFTCKCSKC 280


>gi|299470965|emb|CBN79949.1| conserved unknown protein [Ectocarpus siliculosus]
          Length = 556

 Score = 83.6 bits (205), Expect = 2e-13,   Method: Compositional matrix adjust.
 Identities = 57/187 (30%), Positives = 83/187 (44%), Gaps = 26/187 (13%)

Query: 27  VEALVAHMSDIDEKQLLLYAQIANLVNLI-------------LQWPEIS-------INEI 66
           V+ LV H  D   +Q   Y  +AN V  +             L WP          ++  
Sbjct: 127 VDELVHHNDDRSPEQKEEYMLMANFVARLCLAGCGDSKKGSALLWPSAQGRGLPGLVDAA 186

Query: 67  AENFSKLACNAHTICNSELRP-LGTGLYPVISIINHSCLPNAVLVFEGRLAVVRAVQHVP 125
                KL+CN  +I  S L   +G GLY   +  NHSC PNA   F G+   +R  + + 
Sbjct: 187 YAVLGKLSCNVFSIAESALNGEVGCGLYLEAAAANHSCNPNAAQSFSGKTLSLRCTRPIR 246

Query: 126 KGAEVLISYIETAGSTMTRQKALKEQYLFTCTCPRC-IKLGQFDDIQESAILEGYRCKDD 184
           KG E+ I   +       R+++L++ Y F C C RC    G+ +D++    LE + C D 
Sbjct: 247 KGEEITIGITQIQKPGPARRESLRKTYFFECRCERCESPEGRAEDMR----LEAFACPDS 302

Query: 185 GCSGFLL 191
            CSGF L
Sbjct: 303 ECSGFCL 309


>gi|302506763|ref|XP_003015338.1| SET and MYND domain protein, putative [Arthroderma benhamiae CBS
           112371]
 gi|291178910|gb|EFE34698.1| SET and MYND domain protein, putative [Arthroderma benhamiae CBS
           112371]
          Length = 498

 Score = 83.2 bits (204), Expect = 2e-13,   Method: Compositional matrix adjust.
 Identities = 39/102 (38%), Positives = 58/102 (56%)

Query: 60  EISINEIAENFSKLACNAHTICNSELRPLGTGLYPVISIINHSCLPNAVLVFEGRLAVVR 119
           E+ +  IAE  +++  N+ T   +   PLG  + P    +NHSC PNAV+ F+G L  V+
Sbjct: 179 ELDVETIAEYLARIEVNSFTFTTAFGDPLGLCIQPFACYMNHSCEPNAVVGFDGGLITVK 238

Query: 120 AVQHVPKGAEVLISYIETAGSTMTRQKALKEQYLFTCTCPRC 161
           A++ +    +V ISYI+       RQK L E+Y FTC C +C
Sbjct: 239 ALREIKPDEQVFISYIDNTYPLEVRQKQLTERYFFTCKCSKC 280


>gi|195132929|ref|XP_002010892.1| GI21459 [Drosophila mojavensis]
 gi|193907680|gb|EDW06547.1| GI21459 [Drosophila mojavensis]
          Length = 460

 Score = 83.2 bits (204), Expect = 2e-13,   Method: Compositional matrix adjust.
 Identities = 41/102 (40%), Positives = 57/102 (55%), Gaps = 6/102 (5%)

Query: 91  GLYPVISIINHSCLPNAVLVFE-GRLAVVRAVQHVPKGAEVLISYIETAGSTMTRQKALK 149
           GL+P+ +I+NH C PNA   FE GRLAVVRA + +PKG E+  +Y +   S +TR   LK
Sbjct: 240 GLFPLTAIMNHECTPNASHYFENGRLAVVRAARDIPKGGEITTTYTKILWSNLTRGIFLK 299

Query: 150 EQYLFTCTCPRCIKLGQFDDIQESAILEGYRCKDDGCSGFLL 191
               F C C RC      D+ +    L    C++ GC G ++
Sbjct: 300 MTKYFVCNCDRC-----NDNSENGTYLSALFCREQGCKGLVI 336


>gi|212541368|ref|XP_002150839.1| SET and MYND domain protein, putative [Talaromyces marneffei ATCC
           18224]
 gi|210068138|gb|EEA22230.1| SET and MYND domain protein, putative [Talaromyces marneffei ATCC
           18224]
          Length = 527

 Score = 83.2 bits (204), Expect = 2e-13,   Method: Compositional matrix adjust.
 Identities = 37/93 (39%), Positives = 55/93 (59%)

Query: 70  FSKLACNAHTICNSELRPLGTGLYPVISIINHSCLPNAVLVFEGRLAVVRAVQHVPKGAE 129
           F KL+ N+ T+ ++  +  G  L+P  +  NHSC PNA   F+     +RA + + KG +
Sbjct: 195 FGKLSANSFTLTSAFGKRRGVYLHPAAARFNHSCNPNASYSFDKGKCYIRATKPIAKGEQ 254

Query: 130 VLISYIETAGSTMTRQKALKEQYLFTCTCPRCI 162
           + I YI+T  S  TR+  LKE+Y F C CPRC+
Sbjct: 255 IFIPYIDTTYSVGTRRHELKERYKFDCQCPRCL 287


>gi|198420659|ref|XP_002128556.1| PREDICTED: similar to SET and MYND domain containing 3 [Ciona
           intestinalis]
          Length = 430

 Score = 83.2 bits (204), Expect = 2e-13,   Method: Compositional matrix adjust.
 Identities = 43/105 (40%), Positives = 64/105 (60%), Gaps = 2/105 (1%)

Query: 59  PEISINE-IAENFSKLACNAHTICNSELRP-LGTGLYPVISIINHSCLPNAVLVFEGRLA 116
           P+++ N+ + E  +++  N+  ICN EL+  +GTG+Y   S INHSC PN V  F  R  
Sbjct: 162 PQVTDNKMLLELCARIKNNSFAICNEELQSDVGTGVYLNCSFINHSCEPNCVAEFNMRTL 221

Query: 117 VVRAVQHVPKGAEVLISYIETAGSTMTRQKALKEQYLFTCTCPRC 161
            +RAV+++  G EVLISY++   ++  RQ+ L   Y F CTC  C
Sbjct: 222 KIRAVKNITAGEEVLISYVDLFATSFERQRELMSIYHFQCTCHSC 266


>gi|195397183|ref|XP_002057208.1| GJ16480 [Drosophila virilis]
 gi|194146975|gb|EDW62694.1| GJ16480 [Drosophila virilis]
          Length = 600

 Score = 83.2 bits (204), Expect = 2e-13,   Method: Compositional matrix adjust.
 Identities = 41/102 (40%), Positives = 57/102 (55%), Gaps = 6/102 (5%)

Query: 91  GLYPVISIINHSCLPNAVLVFE-GRLAVVRAVQHVPKGAEVLISYIETAGSTMTRQKALK 149
           GL+P+ +I+NH C PNA   FE GRLAVVRA + +PKG E+  +Y +   S +TR   LK
Sbjct: 380 GLFPLTAIMNHECTPNASHYFENGRLAVVRAARDIPKGGEITTTYTKILWSNLTRGIFLK 439

Query: 150 EQYLFTCTCPRCIKLGQFDDIQESAILEGYRCKDDGCSGFLL 191
               F C C RC      D+ +    L    C++ GC G ++
Sbjct: 440 MTKYFVCNCDRCN-----DNSENGTYLSALFCREQGCKGLVI 476



 Score = 64.3 bits (155), Expect = 9e-08,   Method: Compositional matrix adjust.
 Identities = 32/84 (38%), Positives = 45/84 (53%), Gaps = 1/84 (1%)

Query: 88  LGTGLYPVISIINHSCLPNAVLVFE-GRLAVVRAVQHVPKGAEVLISYIETAGSTMTRQK 146
           +   L+P+  ++NH C PNA   FE G   VV A + +P GAE+ +SY +   ST+ R+ 
Sbjct: 210 VARALFPLAGMLNHQCTPNAAHHFEDGETIVVTATERIPLGAEITMSYAKLLWSTLARKM 269

Query: 147 ALKEQYLFTCTCPRCIKLGQFDDI 170
            L     F C CPRC    Q D +
Sbjct: 270 FLGMTKHFMCQCPRCQDPTQTDAV 293


>gi|194217425|ref|XP_001504392.2| PREDICTED: SET and MYND domain-containing protein 4 [Equus
           caballus]
          Length = 802

 Score = 82.8 bits (203), Expect = 2e-13,   Method: Compositional matrix adjust.
 Identities = 47/120 (39%), Positives = 63/120 (52%), Gaps = 17/120 (14%)

Query: 59  PEISINEIA--ENFSKLACNAHTIC-------------NSELRPLGTGLYPVISIINHSC 103
           PE SI  +A   +  +L CNA  I              +S    L TG++PV+S++NHSC
Sbjct: 482 PESSIWGVAMLRHMLQLQCNAQAITAIQQTGSKENIITDSRQVRLATGIFPVVSLLNHSC 541

Query: 104 LPNAVLVFEGRLAVVRAVQHVPKGAEVLISY--IETAGSTMTRQKALKEQYLFTCTCPRC 161
            PN  + F   +A VRA QH+ KG E+L  Y   E+      RQ+ L+ QY F CTCP C
Sbjct: 542 SPNTSMSFVSTVATVRASQHIGKGQEILHCYGPHESRMGVAERQQKLRSQYFFDCTCPAC 601


>gi|195040117|ref|XP_001991004.1| GH12314 [Drosophila grimshawi]
 gi|193900762|gb|EDV99628.1| GH12314 [Drosophila grimshawi]
          Length = 611

 Score = 82.8 bits (203), Expect = 2e-13,   Method: Compositional matrix adjust.
 Identities = 41/102 (40%), Positives = 57/102 (55%), Gaps = 6/102 (5%)

Query: 91  GLYPVISIINHSCLPNAVLVFE-GRLAVVRAVQHVPKGAEVLISYIETAGSTMTRQKALK 149
           GL+P+ +I+NH C PNA   FE GRLAVVRA + +PKG E+  +Y +   S +TR   LK
Sbjct: 391 GLFPLTAIMNHECTPNASHYFENGRLAVVRAARAIPKGGEITTTYTKILWSNLTRGIFLK 450

Query: 150 EQYLFTCTCPRCIKLGQFDDIQESAILEGYRCKDDGCSGFLL 191
               F C C RC      D+ +    L    C++ GC G ++
Sbjct: 451 MTKYFVCNCDRCN-----DNSENGTYLAALFCREQGCKGLVI 487



 Score = 63.5 bits (153), Expect = 1e-07,   Method: Compositional matrix adjust.
 Identities = 30/75 (40%), Positives = 42/75 (56%), Gaps = 1/75 (1%)

Query: 88  LGTGLYPVISIINHSCLPNAVLVFE-GRLAVVRAVQHVPKGAEVLISYIETAGSTMTRQK 146
           +   L+P+  ++NH C PNA   FE G   VV A + +P GAE+ +SY +   ST+ R+ 
Sbjct: 222 VARALFPLAGLLNHQCTPNAAHHFEDGETIVVTATERIPMGAEITMSYAKLLWSTLARKM 281

Query: 147 ALKEQYLFTCTCPRC 161
            L     F C CPRC
Sbjct: 282 FLGMTKHFMCRCPRC 296


>gi|195347880|ref|XP_002040479.1| GM18906 [Drosophila sechellia]
 gi|194121907|gb|EDW43950.1| GM18906 [Drosophila sechellia]
          Length = 302

 Score = 82.8 bits (203), Expect = 3e-13,   Method: Compositional matrix adjust.
 Identities = 40/102 (39%), Positives = 56/102 (54%), Gaps = 6/102 (5%)

Query: 91  GLYPVISIINHSCLPNAVLVFE-GRLAVVRAVQHVPKGAEVLISYIETAGSTMTRQKALK 149
           GL+P+ +I+NH C PNA   FE GRLAVVRA + +PKG E+  +Y +     +TR   LK
Sbjct: 85  GLFPLTAIMNHECTPNASHYFENGRLAVVRAARDIPKGGEITTTYTKILWGNLTRNIFLK 144

Query: 150 EQYLFTCTCPRCIKLGQFDDIQESAILEGYRCKDDGCSGFLL 191
               F C C RC      D+ +    L    C++ GC G ++
Sbjct: 145 MTKHFACDCVRCN-----DNTENGTYLSALFCREQGCRGLVI 181


>gi|4753708|emb|CAB42051.1| MSTA protein [Drosophila melanogaster]
          Length = 462

 Score = 82.4 bits (202), Expect = 3e-13,   Method: Compositional matrix adjust.
 Identities = 40/102 (39%), Positives = 56/102 (54%), Gaps = 6/102 (5%)

Query: 91  GLYPVISIINHSCLPNAVLVFE-GRLAVVRAVQHVPKGAEVLISYIETAGSTMTRQKALK 149
           GL+P+ +I+NH C PNA   FE GRLAVVRA + +PKG E+  +Y +     +TR   LK
Sbjct: 242 GLFPLTAIMNHECTPNASHYFENGRLAVVRAARDIPKGGEITTTYTKILWGNLTRNIFLK 301

Query: 150 EQYLFTCTCPRCIKLGQFDDIQESAILEGYRCKDDGCSGFLL 191
               F C C RC      D+ +    L    C++ GC G ++
Sbjct: 302 MTKHFACDCVRC-----HDNTENGTYLSALFCREQGCRGLVI 338


>gi|62473423|ref|NP_001014717.1| msta, isoform A [Drosophila melanogaster]
 gi|25090702|sp|O46040.3|MSTAA_DROME RecName: Full=Protein msta, isoform A
 gi|22831561|gb|AAN09072.1| msta, isoform A [Drosophila melanogaster]
          Length = 462

 Score = 82.4 bits (202), Expect = 3e-13,   Method: Compositional matrix adjust.
 Identities = 40/102 (39%), Positives = 56/102 (54%), Gaps = 6/102 (5%)

Query: 91  GLYPVISIINHSCLPNAVLVFE-GRLAVVRAVQHVPKGAEVLISYIETAGSTMTRQKALK 149
           GL+P+ +I+NH C PNA   FE GRLAVVRA + +PKG E+  +Y +     +TR   LK
Sbjct: 242 GLFPLTAIMNHECTPNASHYFENGRLAVVRAARDIPKGGEITTTYTKILWGNLTRNIFLK 301

Query: 150 EQYLFTCTCPRCIKLGQFDDIQESAILEGYRCKDDGCSGFLL 191
               F C C RC      D+ +    L    C++ GC G ++
Sbjct: 302 MTKHFACDCVRC-----HDNTENGTYLSALFCREQGCRGLVI 338


>gi|201066227|gb|ACH92522.1| IP12582p [Drosophila melanogaster]
          Length = 495

 Score = 82.4 bits (202), Expect = 3e-13,   Method: Compositional matrix adjust.
 Identities = 40/102 (39%), Positives = 56/102 (54%), Gaps = 6/102 (5%)

Query: 91  GLYPVISIINHSCLPNAVLVFE-GRLAVVRAVQHVPKGAEVLISYIETAGSTMTRQKALK 149
           GL+P+ +I+NH C PNA   FE GRLAVVRA + +PKG E+  +Y +     +TR   LK
Sbjct: 275 GLFPLTAIMNHECTPNASHYFENGRLAVVRAARDIPKGGEITTTYTKILWGNLTRNIFLK 334

Query: 150 EQYLFTCTCPRCIKLGQFDDIQESAILEGYRCKDDGCSGFLL 191
               F C C RC      D+ +    L    C++ GC G ++
Sbjct: 335 MTKHFACDCVRC-----HDNTENGTYLSALFCREQGCRGLVI 371


>gi|4725963|emb|CAA15694.2| EG:103B4.4 [Drosophila melanogaster]
          Length = 991

 Score = 82.0 bits (201), Expect = 4e-13,   Method: Compositional matrix adjust.
 Identities = 40/102 (39%), Positives = 56/102 (54%), Gaps = 6/102 (5%)

Query: 91  GLYPVISIINHSCLPNAVLVFE-GRLAVVRAVQHVPKGAEVLISYIETAGSTMTRQKALK 149
           GL+P+ +I+NH C PNA   FE GRLAVVRA + +PKG E+  +Y +     +TR   LK
Sbjct: 771 GLFPLTAIMNHECTPNASHYFENGRLAVVRAARDIPKGGEITTTYTKILWGNLTRNIFLK 830

Query: 150 EQYLFTCTCPRCIKLGQFDDIQESAILEGYRCKDDGCSGFLL 191
               F C C RC      D+ +    L    C++ GC G ++
Sbjct: 831 MTKHFACDCVRC-----HDNTENGTYLSALFCREQGCRGLVI 867



 Score = 58.2 bits (139), Expect = 7e-06,   Method: Compositional matrix adjust.
 Identities = 29/72 (40%), Positives = 40/72 (55%), Gaps = 1/72 (1%)

Query: 91  GLYPVISIINHSCLPNAVLVFE-GRLAVVRAVQHVPKGAEVLISYIETAGSTMTRQKALK 149
            L+P+  ++NH C PNA   FE G   VV A + +P GAE+ +SY +   ST+ R+  L 
Sbjct: 228 ALFPLAGLLNHQCTPNAAHHFENGETIVVCATERIPAGAEITMSYAKLLWSTLARKIFLG 287

Query: 150 EQYLFTCTCPRC 161
               F C C RC
Sbjct: 288 MTKHFICKCVRC 299


>gi|294933189|ref|XP_002780642.1| SET and MYND domain-containing protein, putative [Perkinsus marinus
           ATCC 50983]
 gi|239890576|gb|EER12437.1| SET and MYND domain-containing protein, putative [Perkinsus marinus
           ATCC 50983]
          Length = 409

 Score = 82.0 bits (201), Expect = 4e-13,   Method: Compositional matrix adjust.
 Identities = 39/98 (39%), Positives = 60/98 (61%), Gaps = 2/98 (2%)

Query: 76  NAHTICNSELRPLGTGLYPVISIINHSCLPNAVLVFEGRLAVVRAVQHVPKGAEVLISYI 135
           NA TI N  L+ +  GLY  +S +NHSC PN VL+F G    +R ++ V  GAE+ ISY+
Sbjct: 188 NAVTITNDSLQDVALGLYTEVSAMNHSCAPNVVLIFSGSEVTLRTIRAVEDGAELFISYV 247

Query: 136 ETAGSTMTRQ-KALKEQYLFTCTCPRCIKLGQF-DDIQ 171
           +   S   ++ + L++QY F C+C RC +  ++ DD+ 
Sbjct: 248 DVCISPKAKRCQRLRDQYKFDCSCERCTREDEYLDDVD 285


>gi|390361260|ref|XP_799878.2| PREDICTED: SET and MYND domain-containing protein 3-like
           [Strongylocentrotus purpuratus]
          Length = 315

 Score = 82.0 bits (201), Expect = 4e-13,   Method: Compositional matrix adjust.
 Identities = 36/93 (38%), Positives = 56/93 (60%)

Query: 70  FSKLACNAHTICNSELRPLGTGLYPVISIINHSCLPNAVLVFEGRLAVVRAVQHVPKGAE 129
           F    CN+ +IC+++L  +  G+Y   S++NHSC PN V+VF+ R   +R V+ V +G  
Sbjct: 180 FGATICNSFSICDNDLNGIAVGIYLRASMLNHSCYPNCVVVFDERKLQLRTVRDVKEGDV 239

Query: 130 VLISYIETAGSTMTRQKALKEQYLFTCTCPRCI 162
             ISY++       RQ  L+E+Y F+C C +CI
Sbjct: 240 CTISYVDVINPAKERQTELEERYHFSCNCVKCI 272


>gi|298705753|emb|CBJ49061.1| conserved unknown protein [Ectocarpus siliculosus]
          Length = 401

 Score = 81.6 bits (200), Expect = 5e-13,   Method: Compositional matrix adjust.
 Identities = 46/148 (31%), Positives = 71/148 (47%), Gaps = 8/148 (5%)

Query: 59  PEISINEIA-ENFSKLACNAHTICNSELRPLGTGLYPVISIINHSCLPNAVLVFEGRLAV 117
           P  ++ ++A +   +++ N  T C+    P G GL+P  ++INHSC PN    + G    
Sbjct: 20  PSAALLDLARKELGRVSLNGITTCDG-TGPTGLGLFPSGAMINHSCSPNCQAWWRGSQLE 78

Query: 118 VRAVQHVPKGAEVLISYIETAGSTMTRQKALKEQYLFTCTCPRCIKLGQFDDIQESAILE 177
           +R  + V  G E+ +SYI     +  R+  L+  + F C C RC+   Q+D     A L 
Sbjct: 79  IRCTKPVATGEELCLSYIPIDQPSTVRRAQLRHSWFFACRCRRCVSR-QWD-----AELV 132

Query: 178 GYRCKDDGCSGFLLRDSDDKGFTCQQCG 205
           G RC   GC+G +       G  C  CG
Sbjct: 133 GLRCPTKGCAGAVPPPPWRGGHACSHCG 160


>gi|330799867|ref|XP_003287962.1| hypothetical protein DICPUDRAFT_33308 [Dictyostelium purpureum]
 gi|325081986|gb|EGC35483.1| hypothetical protein DICPUDRAFT_33308 [Dictyostelium purpureum]
          Length = 400

 Score = 81.6 bits (200), Expect = 5e-13,   Method: Compositional matrix adjust.
 Identities = 69/243 (28%), Positives = 105/243 (43%), Gaps = 37/243 (15%)

Query: 18  STTTDNYSLVEALVAHMSDIDEKQLL--LYAQIANLVNLI--------LQWPEIS----- 62
            T  D + LVE  +   S+   K+L+  +   IANL NL+        L  PE+      
Sbjct: 168 DTLDDIFDLVENTIDDGSNKAAKELIDIITDYIANLFNLVITSSDNHDLPTPELKKEAVE 227

Query: 63  --INEIAENFSKLACNAHTICNSELRPLGTGLYPVISIINHSCLPNAVLVFEGRLAVVRA 120
             I+ I E   K+ CN   I     + +   + P  S  NHSC+PN + + +G     +A
Sbjct: 228 KVISVIRELIHKVRCNQFGIWTKNDKCIAVAISPSSSFFNHSCIPNCINIRDGNKMTFKA 287

Query: 121 VQHVPKGAEVLISYIETAGSTMTRQKALKEQYLFTCTCPRCIKLGQFDDIQESAILEGYR 180
           +  V KG  + ISY++      +R++ LK  Y F C CPRC +    D+  ++ I + Y 
Sbjct: 288 LYPVKKGEPLAISYLDLDLPVESRKEYLKYGYYFDCGCPRCDEKTNQDECMDNWISKFY- 346

Query: 181 CKDDGCSGFLLRDSDDKGFTCQQCGLVRSKEEIKKIASEVNILSKKTLALTSCGNHQEVV 240
           C+   C                  GL  SK ++     +VN   K TL  + C N   + 
Sbjct: 347 CQRKKCV-----------------GLYYSKTKVN--IDQVNKNDKITLNCSDCNNEFIIN 387

Query: 241 STY 243
           STY
Sbjct: 388 STY 390


>gi|22902087|gb|AAC60295.2| skmBOP [Takifugu rubripes]
          Length = 433

 Score = 81.6 bits (200), Expect = 5e-13,   Method: Compositional matrix adjust.
 Identities = 45/151 (29%), Positives = 82/151 (54%), Gaps = 10/151 (6%)

Query: 27  VEALVAHMSDIDEKQLL-LYAQIANLVNLILQWPEIS----INEIAENFSKLACNAHTIC 81
           VE L  H++D+ E ++  L   I N ++    WP  S    I++I+  F  + CN  T+ 
Sbjct: 110 VEELEDHVADMQEDEIKELKVDIHNFLDY---WPRNSKQHTIDDISHIFGVINCNGFTVS 166

Query: 82  NSE-LRPLGTGLYPVISIINHSCLPNAVLVFEGRLAVVRAVQHVPKGAEVLISYIETAGS 140
           +   L+ +G GL+P + ++NH+C PN  ++       +R++  + +G E+ ++Y++    
Sbjct: 167 DQRGLQAVGVGLFPNLCMVNHNCWPNCTVILNHGKIELRSLGKIAEGEELTVAYVDFLNL 226

Query: 141 TMTRQKALKEQYLFTCTCPRCIKLGQFDDIQ 171
           +  R++ LK QY F C C  C K G  DD++
Sbjct: 227 SEERRRLLKTQYFFDCQCDYC-KNGTKDDLK 256


>gi|325189974|emb|CCA24457.1| conserved hypothetical protein [Albugo laibachii Nc14]
          Length = 465

 Score = 81.6 bits (200), Expect = 6e-13,   Method: Compositional matrix adjust.
 Identities = 76/317 (23%), Positives = 134/317 (42%), Gaps = 28/317 (8%)

Query: 10  LQNDNVIPSTTTDNYSLVEALVAHMSDI-DEKQLL-LYAQIANLVNLILQWPEISINEIA 67
           L  D ++P             + H  D+ +E  L+ L AQ   LV          + ++ 
Sbjct: 122 LAKDGILPDKANATSVFPMDSMWHSVDLTNETHLISLLAQKLGLVR-----ESWHVRDLQ 176

Query: 68  ENFSKLACNAHTICNSELRPLGTGLYPVISIINHSCLPNAVLVFEGRLAVVR--AVQHVP 125
           E  ++  CN   I +     +G G YP+ +++NHSC PN V +F    A +   A++ + 
Sbjct: 177 EMLARFQCNNFCILDENFFEVGAGCYPLGAMVNHSCDPNCVTIFARGSAQLELWAMKSIG 236

Query: 126 KGAEVLISYIETAGSTMTRQKALKEQYLFTCTCPRCIKLGQFDDIQESAILEGYRCKDDG 185
           K  EV ISY++ A     R+K L+++Y F C C RC     +     +A ++ +   D+ 
Sbjct: 237 KDEEVTISYVDPANCMNKRRKYLQKRYHFDCRCQRC-----YCKASHAATIDTFLVADEN 291

Query: 186 ---CSGFLLRDSDDKGFTCQQ-CGLVRSKEEIKKIASEVNILSKKTLALTSCGNHQEVVS 241
                 + L ++  +G    Q  G +      +K+  +  IL+            + V +
Sbjct: 292 GIPSEKWSLNEATKRGTVLSQITGKIEEFRVCEKLTRKCAILADIDSDFV-----RNVFT 346

Query: 242 TYKMIE----KLQKKLYHPFSVNLMQTREKLIKILMELEDWKE-ALAYCQLTIPVYQRVY 296
              M      +LQ +L+   ++ + +++    +   ELE  ++ AL Y       Y  VY
Sbjct: 347 HLHMYNIDAFRLQARLFSDLTIMIAESQSGNSREFAELEFVQQKALVYGYRISKFYAFVY 406

Query: 297 PQFHPLLGLQYYTCGKL 313
           P  HPL  L     G+L
Sbjct: 407 PALHPLSNLHNIRMGEL 423


>gi|359486519|ref|XP_002270770.2| PREDICTED: uncharacterized protein LOC100263830 [Vitis vinifera]
          Length = 799

 Score = 81.3 bits (199), Expect = 7e-13,   Method: Compositional matrix adjust.
 Identities = 72/312 (23%), Positives = 120/312 (38%), Gaps = 54/312 (17%)

Query: 82  NSELRPLGTGLYPVISIINHSCLPNAVLVFEGRLAVVRAVQHVPKGAEVLISYIETAG-- 139
           N E   +G  +Y V S+ NHSC PN    F  R   +RA +HV  G  + +SY    G  
Sbjct: 478 NMEQVRVGQAIYSVASLFNHSCQPNIHAYFLSRTLFLRATEHVAVGCPLELSYGPQVGQW 537

Query: 140 STMTRQKALKEQYLFTCTCPRCIKLGQFDDIQESAILEGYRCKDDGCSGFLL-------- 191
               RQK LK++Y F C C  C +L   D      +L  +RC +  C G +L        
Sbjct: 538 DCKDRQKFLKDEYSFRCECSGCSELNVSD-----LVLNAFRCVNPDCFGTVLDSCVIKYE 592

Query: 192 ---------------------------RDSDDKGFT----------CQQCGLVRSKEEIK 214
                                      R+   + F           C  CG  R  E   
Sbjct: 593 NKKFERFQGVPQDCISEPHLQLKNDGIREVAHQAFANSSFRAAPGYCLHCGAYRDLEASH 652

Query: 215 KIASEVNI-LSKKTLALTSCGNHQEVVSTYKMIEKLQKKLYHPFSVNLMQTREKLIKILM 273
               E  I +S+   A+ S        S       L K   H ++  + +  + + +   
Sbjct: 653 ATVGEAGIYISRLQEAIVSKEVPATTFSDALRSLDLLKSTLHAYNKGIAEAEDWIAQAFC 712

Query: 274 ELEDWKEALAYCQLTIPVYQRVYPQFHPLLGLQYYTCGKLEWFLGDTENAIKSMTEAVEI 333
            + + + A+ +C+ +I + +++Y   H ++G +      ++  LGDT  A+KS++    I
Sbjct: 713 MIGELQPAMHHCKASIEILEKLYGSNHIVIGYELMKLSSIQLSLGDTA-AMKSISRLAAI 771

Query: 334 LRITHGTNSPFM 345
               +G ++  M
Sbjct: 772 FSWYYGPHADMM 783


>gi|351704191|gb|EHB07110.1| SET and MYND domain-containing protein 4, partial [Heterocephalus
           glaber]
          Length = 793

 Score = 81.3 bits (199), Expect = 7e-13,   Method: Compositional matrix adjust.
 Identities = 65/274 (23%), Positives = 120/274 (43%), Gaps = 29/274 (10%)

Query: 88  LGTGLYPVISIINHSCLPNAVLVFEGRLAVVRAVQHVPKGAEVLISY--IETAGSTMTRQ 145
           L TG++PV+S++NHSC PN  + F G +A +RA Q + KG E+L  Y   ++  S   RQ
Sbjct: 520 LATGIFPVVSLLNHSCSPNTSISFSGTVATIRAAQQIGKGQEILHCYGPHKSWMSIAERQ 579

Query: 146 KALKEQYLFTCTCPRCIKLGQFDDIQESA--ILEGYRCKDDGCSGFLLRDSDDKGFTCQQ 203
           + L+ QY F C CP C     ++ ++ +A    E +RC +  C   +      +G     
Sbjct: 580 QKLRSQYFFDCNCPAC----HYEKLRTAAGPRWEAFRCNN--CRALM------QGDDVLS 627

Query: 204 CGLVRSKEEIKK---------IASEVNILSKKTLALTSCGNHQEVVSTYKMIEKLQKKLY 254
           CG     E I +         +  +V I SK    L   G   + +       +  +   
Sbjct: 628 CGNRACLEPISRTHLVSQLQDLQQQVGIASK----LLRNGKLDQAIPRLLRCRQDAESFL 683

Query: 255 HPFSVNLMQTREKLIKILMELEDWKEALAYCQLTIPVYQRVYPQFHPLLGLQYYTCGKLE 314
                   +  + L +    L DW+++  + Q ++ V +  +      +G + +   ++ 
Sbjct: 684 AAEHAVAGEIEDNLARAYAALGDWQKSATHLQKSLQVVEARHGPSSVEMGHELFKLAQVL 743

Query: 315 WFLGDTENAIKSMTEAVEILRITHGTNSPFMKEL 348
           +       A+ ++ +A E+L +  G  S  ++EL
Sbjct: 744 FNGFAVPEALNAIQKAEEVLLVHCGPQSDEVQEL 777


>gi|355720917|gb|AES07092.1| SET and MYND domain containing 4 [Mustela putorius furo]
          Length = 576

 Score = 81.3 bits (199), Expect = 7e-13,   Method: Compositional matrix adjust.
 Identities = 68/284 (23%), Positives = 127/284 (44%), Gaps = 29/284 (10%)

Query: 78  HTICNSELRPLGTGLYPVISIINHSCLPNAVLVFEGRLAVVRAVQHVPKGAEVLISY--I 135
           + + NS    L TG++PV+S++NHSC PN  + F   +A VRA + + KG E+L  Y   
Sbjct: 296 NNVTNSRQVRLATGIFPVVSLLNHSCSPNTSMSFVSTVATVRASEKIAKGQEILHCYGPH 355

Query: 136 ETAGSTMTRQKALKEQYLFTCTCPRCIKLGQFDDIQESAILEGYRCKDDGCSGFLLRDSD 195
            +  +   RQ+ L+ QY F C+CP C K      +      E + C   G  G LL+  D
Sbjct: 356 HSRMAVAERQQKLRAQYFFDCSCPVCQKEKHRATV--GPRWEAFCC---GRCGALLQGGD 410

Query: 196 DKGFTCQQCGLVRSKEEIKK--IASEVNILSKKTLALTSCGNHQEVVSTYKMIEKLQ--- 250
                   CG     E + +  + S V  L ++       G  Q+++   ++ + +Q   
Sbjct: 411 -----VLSCGGTSCMESVSRDHLISRVQDLQRQV------GGAQKLLGNGQLEQAIQLLL 459

Query: 251 ------KKLYHPFSVNLMQTREKLIKILMELEDWKEALAYCQLTIPVYQRVYPQFHPLLG 304
                 +    P    + +  + L +    L DW+++ A+ Q ++ V +  +      +G
Sbjct: 460 GCRHDAENFLSPEHSVVGEMEDALAQAYAALGDWQKSAAHLQKSLQVVEAQHGPSSVEMG 519

Query: 305 LQYYTCGKLEWFLGDTENAIKSMTEAVEILRITHGTNSPFMKEL 348
            + +   +L +       A+ ++  A ++L + +G  S  ++EL
Sbjct: 520 HELFKLAQLFFNGFAIPEALNTIERAEKVLLVHYGPCSDEIQEL 563


>gi|390363795|ref|XP_788278.2| PREDICTED: SET and MYND domain-containing protein 3-like
           [Strongylocentrotus purpuratus]
          Length = 356

 Score = 81.3 bits (199), Expect = 7e-13,   Method: Compositional matrix adjust.
 Identities = 41/107 (38%), Positives = 59/107 (55%), Gaps = 4/107 (3%)

Query: 70  FSKLACNAHTICNSELRPLGTGLYPVISIINHSCLPNAVLVFEGRLAVVRAVQHVPKGAE 129
           F    CN+ +IC+++L  +  G+Y   S++NHSC PN   V +GR   +  V+ V +G E
Sbjct: 171 FGATICNSISICDNDLIDIAVGIYLRASMLNHSCDPNCAWVCDGRKLQIMTVKDVKEGDE 230

Query: 130 VLISYIETAGSTMTRQKALKEQYLFTCTCPRCIK----LGQFDDIQE 172
             ISY++       RQ  LKE Y FTC C +CI+    LG  D + E
Sbjct: 231 CTISYVDAMKPAKVRQADLKESYHFTCKCVKCIEEINALGPDDGLGE 277



 Score = 58.5 bits (140), Expect = 4e-06,   Method: Compositional matrix adjust.
 Identities = 25/66 (37%), Positives = 39/66 (59%)

Query: 98  IINHSCLPNAVLVFEGRLAVVRAVQHVPKGAEVLISYIETAGSTMTRQKALKEQYLFTCT 157
           + NHSC  N ++VF+ R   +R ++ V  G E  ISY++       RQ  L+E+Y FTC 
Sbjct: 1   MANHSCDYNCIVVFDERKLQLRTIKDVKDGEECTISYVDVINPAKERQAKLEEEYHFTCK 60

Query: 158 CPRCIK 163
           C +C++
Sbjct: 61  CVKCVE 66


>gi|412986118|emb|CCO17318.1| SET and MYND domain-containing protein 3 [Bathycoccus prasinos]
          Length = 734

 Score = 81.3 bits (199), Expect = 8e-13,   Method: Compositional matrix adjust.
 Identities = 39/93 (41%), Positives = 55/93 (59%)

Query: 62  SINEIAENFSKLACNAHTICNSELRPLGTGLYPVISIINHSCLPNAVLVFEGRLAVVRAV 121
           S+ ++A+   KL  N HT+C+ ELRP G G+YPV +++NHS  PN    F G+  +VR +
Sbjct: 280 SVADLAKLIGKLRLNCHTLCDDELRPYGIGVYPVAAMMNHSENPNCFATFRGKKMIVRCL 339

Query: 122 QHVPKGAEVLISYIETAGSTMTRQKALKEQYLF 154
           + V  G E+ ISY E       R K+LK  Y F
Sbjct: 340 RDVLPGEELTISYDELMKPKRERAKSLKSNYGF 372


>gi|440636518|gb|ELR06437.1| hypothetical protein GMDG_07962 [Geomyces destructans 20631-21]
          Length = 287

 Score = 81.3 bits (199), Expect = 8e-13,   Method: Compositional matrix adjust.
 Identities = 37/86 (43%), Positives = 54/86 (62%)

Query: 76  NAHTICNSELRPLGTGLYPVISIINHSCLPNAVLVFEGRLAVVRAVQHVPKGAEVLISYI 135
           N+  I ++    +G G  P++ + NHSC PNA L F+GR AV+ A++H+ +G E+ ISYI
Sbjct: 188 NSFRITDASGDEIGIGFDPLLGMANHSCAPNASLKFDGRCAVLTALKHIEEGEEITISYI 247

Query: 136 ETAGSTMTRQKALKEQYLFTCTCPRC 161
           +T      RQ  L++ Y FTCTC  C
Sbjct: 248 DTTLPRAARQAFLQKHYYFTCTCAAC 273


>gi|296806220|ref|XP_002843920.1| histone methyltransferase SmyD1b [Arthroderma otae CBS 113480]
 gi|238845222|gb|EEQ34884.1| histone methyltransferase SmyD1b [Arthroderma otae CBS 113480]
          Length = 495

 Score = 81.3 bits (199), Expect = 8e-13,   Method: Compositional matrix adjust.
 Identities = 70/261 (26%), Positives = 115/261 (44%), Gaps = 42/261 (16%)

Query: 60  EISINEIAENFSKLACNAHTICNSELRPLGTGLYPVISIINHSCLPNAVLVF-EGRLAVV 118
           ++ I  IAE  +K+  N  T       PLG  + P    +NHSC PNAV+ F EGR+  V
Sbjct: 176 DLDIETIAEYLAKIEVNGFTFTTPFGDPLGLCIQPFACFVNHSCDPNAVVGFDEGRI-TV 234

Query: 119 RAVQHVPKGAEVLISYIETAGSTMTRQKALKEQYLFTCTCPRCIKLGQFDDIQESAILEG 178
           +A++ +    +V ISYI+       RQK L E+Y FTC C +C+   Q D  +E   +  
Sbjct: 235 KALRTIEPDEQVFISYIDNTNPFEIRQKELAERYFFTCRCSKCL---QGDKAREDQFIPA 291

Query: 179 YRCKDDGCSGFLLRDSDDKGFTCQQCGLVRSKEEIKKIASEVNILSKKTLALTSCGNHQE 238
               +D                      + + +E +K A E+         L S  N + 
Sbjct: 292 SPPSND----------------------IETLKEAEKQAQEL---------LASAKNARP 320

Query: 239 VVSTYKMIEKLQ----KKLYHPFSVNLMQTREKLIKILMELEDWKEALAYCQLT-IPVYQ 293
           V +  ++   ++     K++        Q R +LI  L+++E +  AL +  +  + V  
Sbjct: 321 VFAVRQLKAAIKILHDTKMWPITRQPYPQIRSELIVALLQIESFWPALQHSAVRYLFVDP 380

Query: 294 RVYPQ-FHPLLGLQYYTCGKL 313
            +YPQ +HP+     Y   +L
Sbjct: 381 VLYPQEWHPIRNNHAYILSQL 401


>gi|73967367|ref|XP_548315.2| PREDICTED: SET and MYND domain-containing protein 4 [Canis lupus
           familiaris]
          Length = 794

 Score = 80.9 bits (198), Expect = 8e-13,   Method: Compositional matrix adjust.
 Identities = 44/112 (39%), Positives = 60/112 (53%), Gaps = 6/112 (5%)

Query: 54  LILQWPEISINEIAENFSKLACNAHTICNSELRPLGTGLYPVISIINHSCLPNAVLVFEG 113
           L LQ    +I  I E  SK     + +CNS    L TG++PV+S++NHSC PN  + F  
Sbjct: 488 LQLQCNAQAITTIQETGSK----ENNVCNSRQIRLATGIFPVVSLLNHSCSPNTSVSFSS 543

Query: 114 RLAVVRAVQHVPKGAEVLISYIE--TAGSTMTRQKALKEQYLFTCTCPRCIK 163
            +A +RA Q + KG E+L  Y    +      RQ+ L+ QY F C CP C K
Sbjct: 544 TVATIRASQQIGKGQEILHCYGPHYSRMGVADRQQKLRSQYFFDCGCPACQK 595


>gi|221057668|ref|XP_002261342.1| hypothetical protein, conserved in Plasmodium species [Plasmodium
           knowlesi strain H]
 gi|194247347|emb|CAQ40747.1| hypothetical protein, conserved in Plasmodium species [Plasmodium
           knowlesi strain H]
          Length = 518

 Score = 80.9 bits (198), Expect = 9e-13,   Method: Compositional matrix adjust.
 Identities = 78/328 (23%), Positives = 137/328 (41%), Gaps = 44/328 (13%)

Query: 63  INEIAENFSKLACNAHTICNSELRPLGTGLY--PVISIINHSCLPNAVLVFEGRLAVVRA 120
           +  I E F K++ N+  I +++L P G   Y  PV    NHSCL N V +F+ +   +R 
Sbjct: 184 LKNIIEKFVKISKNSLQIIDNDLEPAGLAFYKKPV-PYFNHSCLSNCVTIFKNQKLFIRT 242

Query: 121 VQHVPKGAEVLISYIETAGSTMTRQKALKEQYLFTCTCP--------------------- 159
           +  V  G E+ ISY++ A    TR     +QY FTC+C                      
Sbjct: 243 LVDVYPGEELTISYVDIAFDRNTRLAICMDQYFFTCSCKLCKVNMVSECQHIFNTELLCT 302

Query: 160 ------RCIKLGQFDDIQESAILEGYRCKDDGCSGF-LLRDSDDKG--FTCQQC------ 204
                 +C+   +   I E      Y  K+   S + + + ++++   + C  C      
Sbjct: 303 QSESCKKCVNHMELLLISELERKSCYMNKNQFKSAYPVFKKTNEQQHVWKCSICKGEVHD 362

Query: 205 ----GLVRSKEEIKKIASEVNILSKKTLALTSCGNHQEVVSTYKMIEKLQKKLYHPFSVN 260
                LV  ++E  K AS +  L  +     +    Q +      I+ L    YH    +
Sbjct: 363 NVIKSLVEKEKETVKEASHLESLFSEKYTYDNKSVLQALNKIKSKIDYLTT-FYHHTKYS 421

Query: 261 LMQTREKLIKILMELEDWKEALAYCQLTIPVYQRVYPQFHPLLGLQYYTCGKLEWFLGDT 320
           L + R K++ I ++L+++K A       +   +  Y ++ P+ G   +  GKL  FL   
Sbjct: 422 LQKMRAKILYISIQLQEFKLAYNIANQYLKSIEVTYGKYSPIYGYYIFLTGKLALFLDLK 481

Query: 321 ENAIKSMTEAVEILRITHGTNSPFMKEL 348
              +  + +A + +  T+G +SP  K+L
Sbjct: 482 SEGLSLIHKAKKNIIKTYGPDSPIYKDL 509


>gi|320169071|gb|EFW45970.1| predicted protein [Capsaspora owczarzaki ATCC 30864]
          Length = 509

 Score = 80.5 bits (197), Expect = 1e-12,   Method: Compositional matrix adjust.
 Identities = 67/266 (25%), Positives = 121/266 (45%), Gaps = 41/266 (15%)

Query: 73  LACNAHTICNSELRPLGTGLYPVISIINHSCLPNAVLVFE--GRLAVVRAVQHVPKGAEV 130
           + CN   I N  L   G+G+YPV +++NH+C+ N VL ++   +   +RA++ +  G E+
Sbjct: 254 MQCNNFAIHNDILFARGSGIYPVAALVNHACVANCVLTYDLKSKRQFIRAIRDIRAGEEI 313

Query: 131 LISYIETAGSTMTRQKALKEQYLFTCTCPRCIKLGQFDDIQE-SAILEGYRCKDDGCSGF 189
             ++ + A  T+ R+  LK  Y F C C RC      D  +E  A L   R  D   + F
Sbjct: 314 THAFTDAASPTVVRKAHLKSLYAFDCNCSRC---NDSDAAKELDAELVATRPIDTIPAYF 370

Query: 190 LLRDSDDKGFTCQQCG-----LVRSKEEIKKIAS--EVNILSKKTLALTSCGNHQEVVST 242
                  K F  ++       L R+  ++ K++S  +V  L + + +    G    + + 
Sbjct: 371 -------KRFRLERLAGLPSLLERATRDVPKVSSPADVEALLQASRSWLQEGMDPRLPTA 423

Query: 243 YKMI-----EKLQKKLYHPFSVNLMQTREKLIKILM-----ELEDWKEALAY-------- 284
             ++       +++ L   +++ L ++  K+  + +      L + +  LA         
Sbjct: 424 RAIVLLVTAWAVRQALLGDYNLELFESNVKIFGMALLWREEALSERQRDLALVSGLRNVL 483

Query: 285 ---CQLTIPVYQRVYPQFHPLLGLQY 307
               +  I VYQR+YP  HPLLGLQ+
Sbjct: 484 VDSARHVIRVYQRIYPANHPLLGLQW 509


>gi|291405379|ref|XP_002719091.1| PREDICTED: SET and MYND domain containing 4 [Oryctolagus cuniculus]
          Length = 801

 Score = 80.5 bits (197), Expect = 1e-12,   Method: Compositional matrix adjust.
 Identities = 69/269 (25%), Positives = 121/269 (44%), Gaps = 19/269 (7%)

Query: 88  LGTGLYPVISIINHSCLPNAVLVFEGRLAVVRAVQHVPKGAEVLISYIETAG--STMTRQ 145
           L TG++PVIS++NHSC PN  + F G +A VRA QH+ +G E+L  Y           RQ
Sbjct: 525 LATGVFPVISLLNHSCSPNTSVSFVGTVATVRASQHIRRGQEILHCYGPHVSRLGIAARQ 584

Query: 146 KALKEQYLFTCTCPRCIKLGQFDDIQESAILEGYRCKDDGCSG--FLLRDSDDKGFTCQQ 203
           + LK QY F C+C  C         +  A     R +   CS    LL+  D    +   
Sbjct: 585 QKLKSQYFFNCSCLAC-------QTEAHATAAAPRWEAFCCSSCRALLQGGDVLSCSSGS 637

Query: 204 CGLVRSKEEIKKIASEVNILSKKTLALTSCGNHQEVVSTYKMIEKLQKK----LYHPFSV 259
           C    S++    +A+ +  L ++T          E+    + +   Q+     L+   SV
Sbjct: 638 CTEAVSRDH---LAARLRDLQRQTGLARELLRSGELEKAIQQLSGCQRDAESFLWAEHSV 694

Query: 260 NLMQTREKLIKILMELEDWKEALAYCQLTIPVYQRVYPQFHPLLGLQYYTCGKLEWFLGD 319
            L +  + L +    L DWK++ A+ Q ++ V +  +      +G + +   ++ +   +
Sbjct: 695 -LGEIEDDLAQAYAALGDWKKSAAHLQKSLTVVEARHGPSSVEMGHELFKLAQILFNGFE 753

Query: 320 TENAIKSMTEAVEILRITHGTNSPFMKEL 348
              A+ ++ +A  +L +  G  S  ++EL
Sbjct: 754 VPEALSTIQKAETVLAVHCGPCSDEIQEL 782


>gi|332257691|ref|XP_003277939.1| PREDICTED: LOW QUALITY PROTEIN: SET and MYND domain-containing
           protein 4 [Nomascus leucogenys]
          Length = 703

 Score = 80.5 bits (197), Expect = 1e-12,   Method: Compositional matrix adjust.
 Identities = 78/315 (24%), Positives = 132/315 (41%), Gaps = 67/315 (21%)

Query: 59  PEISINEIA--ENFSKLACNA-------HT------ICNSELRPLGTGLYPVISIINHSC 103
           P+++I  +A   +  +L CNA       HT      + +S    L TG++PVIS++NHSC
Sbjct: 382 PDVTIWGVAMLRHMLQLQCNAQAMTTIQHTGSKGSIVTDSRQVRLATGIFPVISLLNHSC 441

Query: 104 LPNAVLVFEGRLAVVRAVQHVPKGAEVLISY--IETAGSTMTRQKALKEQYLFTCTCPRC 161
            PN  + F   +A +RA Q + KG E+L  Y   ++      RQ+ L+ QY F CTCP C
Sbjct: 442 SPNTSVSFISTVATIRASQRIRKGQEILHCYGPHKSRMGVAERQQKLRSQYFFDCTCPAC 501

Query: 162 IKLGQFDDIQESAILEGYRCKDDGCSGFLLRDSDDKGFTCQQCGLVRSKEEIKKIASEVN 221
                    +   +  G R                + F C  CG     +++ +  S   
Sbjct: 502 -------QTEAHRMAAGPRW---------------EAFCCNGCGAPMQGDDVLRCGSR-- 537

Query: 222 ILSKKTLALTSCGNHQEVVSTYKMIEKLQKKLYHPFSVNLMQTREKLIKILM--ELEDWK 279
                     SC   +  VS   ++ +LQ         +L Q      K+L   ELE   
Sbjct: 538 ----------SCA--ESAVSRDHLVSRLQ---------DLQQQVRVAQKLLRDGELERAI 576

Query: 280 EALAYCQLTIPVYQRVYPQFHPLLGLQYYTCGKLEWFLGDTENAIKSMTEAVEILRITHG 339
           + L+ CQ     +  ++ + H ++G       +    LGD + +   +  ++ ++ + HG
Sbjct: 577 QRLSGCQRDAESF--LWAE-HAMVGEIADGLARACAALGDWQKSATHLQRSLRVVEVRHG 633

Query: 340 TNSPFMKELILKLEE 354
            +S  M   + KL +
Sbjct: 634 PSSVEMGHELFKLAQ 648


>gi|48476977|gb|AAT44536.1| hypothetical rhabdomyosarcoma antigen MU-RMS-40.9C [Homo sapiens]
          Length = 273

 Score = 80.5 bits (197), Expect = 1e-12,   Method: Compositional matrix adjust.
 Identities = 63/253 (24%), Positives = 113/253 (44%), Gaps = 37/253 (14%)

Query: 100 NHSCLPNAVLVFEGRLAVVRAVQHVPKGAEVLISYIETAGSTMTRQKALKEQYLFTCTCP 159
           NH C PN  ++F      +RA+  + +G E+ +SYI+    +  R++ LK+QY F CTC 
Sbjct: 1   NHDCWPNCTVIFNNGKIELRALGKISEGEELTVSYIDFLNVSEERKRQLKKQYYFDCTCE 60

Query: 160 RCIKLGQFDDIQESAILEGYRCKDDGCSGFLLRDSDDKGFTCQQCGLVRSKEEIKKIASE 219
            C K                + KDD    FL              G+  + +  +++  E
Sbjct: 61  HCQK----------------KLKDD---LFL--------------GVKDNPKPSQEVVKE 87

Query: 220 VNILSKKTL----ALTSCGNHQEVVSTYKMIEKLQKKLYHPFSVNLMQTREKLIKILMEL 275
           +   SK TL       S G + EVV   +   + Q+ ++   ++ +++    + ++L  L
Sbjct: 88  MIQFSKDTLEKIDKARSEGLYHEVVKLCRECLEKQEPVFADTNIYMLRMLSIVSEVLSYL 147

Query: 276 EDWKEALAYCQLTIPVYQRVYPQFHPLLGLQYYTCGKLEWFLGDTENAIKSMTEAVEILR 335
           + ++EA  Y +  +  Y ++Y   +  LG+     G   W  G+ E     + +A  IL 
Sbjct: 148 QAFEEASFYARRMVDGYMKLYHPNNAQLGMAVMRAGLTNWHAGNIEVGHGMICKAYAILL 207

Query: 336 ITHGTNSPFMKEL 348
           +THG + P  K+L
Sbjct: 208 VTHGPSHPITKDL 220


>gi|297266487|ref|XP_001093225.2| PREDICTED: SET and MYND domain-containing protein 1 [Macaca
           mulatta]
          Length = 399

 Score = 80.1 bits (196), Expect = 1e-12,   Method: Compositional matrix adjust.
 Identities = 42/139 (30%), Positives = 74/139 (53%), Gaps = 12/139 (8%)

Query: 31  VAHMSDIDEKQLLLYAQIANLVNLILQ-WP----EISINEIAENFSKLACNAHTICNSE- 84
           V H  + ++K+L +       V+  LQ WP    +  +  I+  F  + CN  T+ +   
Sbjct: 136 VEHFGEEEQKELRVD------VDTFLQYWPPQSQQFGMQYISHIFGVINCNGFTLSDQRG 189

Query: 85  LRPLGTGLYPVISIINHSCLPNAVLVFEGRLAVVRAVQHVPKGAEVLISYIETAGSTMTR 144
           L+ +G G++P + ++NH C PN  ++F      +RA+  + +G E+ +SYI+    +  R
Sbjct: 190 LQAVGVGIFPNLGLVNHDCWPNCTVIFNNGKIELRALGKISEGEELTVSYIDFLNVSEER 249

Query: 145 QKALKEQYLFTCTCPRCIK 163
           ++ LK+QY F CTC  C K
Sbjct: 250 KRQLKKQYYFDCTCEHCQK 268


>gi|348567599|ref|XP_003469586.1| PREDICTED: SET and MYND domain-containing protein 4-like [Cavia
           porcellus]
          Length = 800

 Score = 80.1 bits (196), Expect = 2e-12,   Method: Compositional matrix adjust.
 Identities = 43/105 (40%), Positives = 58/105 (55%), Gaps = 15/105 (14%)

Query: 72  KLACNA-------HT------ICNSELRPLGTGLYPVISIINHSCLPNAVLVFEGRLAVV 118
           +L CNA       HT      I +S    L TG++PV+S++NHSC PN  + F G +A +
Sbjct: 491 QLQCNAQAITAIQHTGSKESIITDSRQVRLATGIFPVVSLLNHSCSPNTSVSFTGTIATI 550

Query: 119 RAVQHVPKGAEVLISY--IETAGSTMTRQKALKEQYLFTCTCPRC 161
           RA Q + KG E+L  Y   E+  S   RQ+ L+ QY F C CP C
Sbjct: 551 RAAQQIRKGQEILHCYGPHESHMSVAERQQKLRSQYFFDCCCPAC 595


>gi|430813591|emb|CCJ29069.1| unnamed protein product [Pneumocystis jirovecii]
          Length = 463

 Score = 80.1 bits (196), Expect = 2e-12,   Method: Compositional matrix adjust.
 Identities = 44/150 (29%), Positives = 74/150 (49%), Gaps = 11/150 (7%)

Query: 24  YSLVEALVAHMSDIDEKQ-----LLLYAQIANLVNLILQWPEISINEIAENFSKLACNAH 78
           ++ +  L +H S+I   +      +L   I N  N      E++   + + F K+  N+ 
Sbjct: 136 WNTIMDLESHRSEIMSSEKGDVIWMLSKGIQNFTN------EMNETFVLDLFCKIMINSF 189

Query: 79  TICNSELRPLGTGLYPVISIINHSCLPNAVLVFEGRLAVVRAVQHVPKGAEVLISYIETA 138
           ++  S    +GT + P+IS INHSC PN  LVF+     +R++Q +    E+ +SYI+  
Sbjct: 190 SLMTSSYDTIGTAIDPIISRINHSCYPNTALVFDRNTVALRSLQKILPNQEITVSYIDIY 249

Query: 139 GSTMTRQKALKEQYLFTCTCPRCIKLGQFD 168
            +   R   L  +Y F+C C RCI    F+
Sbjct: 250 NTQKNRHDELLSRYYFSCKCTRCIVSDGFE 279


>gi|154308769|ref|XP_001553720.1| hypothetical protein BC1G_07807 [Botryotinia fuckeliana B05.10]
          Length = 409

 Score = 80.1 bits (196), Expect = 2e-12,   Method: Compositional matrix adjust.
 Identities = 40/117 (34%), Positives = 66/117 (56%), Gaps = 2/117 (1%)

Query: 47  QIANLVNLILQWPEISINEIAENFSKLACNAHTI-CNSELRPLGTGLYPVISIINHSCLP 105
           Q   +VNL+ + PE  +    E+  +L+ N   +  N    P+G  L P+++  NHSC P
Sbjct: 58  QAHAVVNLLQRKPEF-VKYALESLCRLSTNGFRVESNVGNGPIGLCLDPLLARANHSCRP 116

Query: 106 NAVLVFEGRLAVVRAVQHVPKGAEVLISYIETAGSTMTRQKALKEQYLFTCTCPRCI 162
           NA + F+G+ A +RA+  + KG ++ ISYI+       R++AL + + F C C RC+
Sbjct: 117 NAAITFDGKRATLRALSPIAKGEQIFISYIDETQRQEVRREALAKTWFFQCLCSRCV 173


>gi|307104881|gb|EFN53133.1| hypothetical protein CHLNCDRAFT_137504 [Chlorella variabilis]
          Length = 623

 Score = 79.7 bits (195), Expect = 2e-12,   Method: Compositional matrix adjust.
 Identities = 36/99 (36%), Positives = 54/99 (54%)

Query: 61  ISINEIAENFSKLACNAHTICNSELRPLGTGLYPVISIINHSCLPNAVLVFEGRLAVVRA 120
           +   E+A   ++  CN+HTI + EL+PL  G++P+ ++ NH C PN +  F G   V RA
Sbjct: 155 VGPRELALLIARFGCNSHTISDDELQPLAVGIFPLGAMANHDCRPNTLHAFRGGRMVFRA 214

Query: 121 VQHVPKGAEVLISYIETAGSTMTRQKALKEQYLFTCTCP 159
           V+ +  G EV  SY E A     R+  L + +LF    P
Sbjct: 215 VRAIQPGEEVTTSYTELAAPRWERRAVLLQHHLFDIDAP 253


>gi|406700189|gb|EKD03370.1| hypothetical protein A1Q2_02350 [Trichosporon asahii var. asahii
           CBS 8904]
          Length = 573

 Score = 79.7 bits (195), Expect = 2e-12,   Method: Compositional matrix adjust.
 Identities = 62/233 (26%), Positives = 100/233 (42%), Gaps = 19/233 (8%)

Query: 76  NAHTICNSELRPLGTGLYPVISIINHSCLPNAVLVFE--GRLAVVRAVQHVPKGAEVLIS 133
           NA T+ + +L+P+G  + P++++ NHSC PNA +VF   G+  V  A   +  G EVL +
Sbjct: 287 NAFTLSSFDLKPIGVAMSPLLALFNHSCAPNAAIVFPRGGKEMVAVANADIAAGEEVLTT 346

Query: 134 YIETAGSTMTRQKALKEQYLFTCTCPRCIKLGQFDDIQESAILEGYRCKDDGCSGFLLRD 193
           Y++ +    TRQ  L+ +Y F C CP C      D +      +   C    C G     
Sbjct: 347 YVDISDDKETRQGDLQSRYGFECECPACT----LDAV------DPRNCLLHECGGLARMP 396

Query: 194 SDDKGFTCQQCGLVRSKEEIKKIASE-VNILSKKTLALTSCGNHQEVVSTYKMIEKLQKK 252
              K  TC     V S   ++ + +E ++++           +   V      +EK+   
Sbjct: 397 ETVKCDTCGASWTV-SPNALRDLVNEGLSLVEADDQDQLDPDDLPRVAEVIARLEKIM-- 453

Query: 253 LYHPFSVNLMQTREKLIKILMELEDWKEALAYCQLTIPVYQRVYPQFHPLLGL 305
              P S   +     + K L       EALAY    +    +VYP+ HP + L
Sbjct: 454 ---PPSSRPLLRLHTIAKNLHVPAAQPEALAYAHKALAGAYKVYPRNHPSVAL 503


>gi|303271753|ref|XP_003055238.1| predicted protein [Micromonas pusilla CCMP1545]
 gi|226463212|gb|EEH60490.1| predicted protein [Micromonas pusilla CCMP1545]
          Length = 838

 Score = 79.7 bits (195), Expect = 2e-12,   Method: Compositional matrix adjust.
 Identities = 39/106 (36%), Positives = 57/106 (53%), Gaps = 3/106 (2%)

Query: 50  NLVNLILQWPEISINEIAENFSKLACNAHTICNSELRPLGTGLYPVISIINHSCLPNAVL 109
           N V+ +   P+    ++A   +  ACN HTI + EL P+G G+YP  +++NH+C P    
Sbjct: 235 NFVDPLAGGPDP--RDVARLLAAFACNVHTIHDDELNPVGMGVYPTAAMMNHACAPTVAQ 292

Query: 110 VF-EGRLAVVRAVQHVPKGAEVLISYIETAGSTMTRQKALKEQYLF 154
            F EG  A  R V+ V KG E+ ISY+E   +   R+  L   Y F
Sbjct: 293 TFGEGTTATFRCVRDVRKGEEIAISYVELLATRAERRAELLNHYRF 338


>gi|119480927|ref|XP_001260492.1| SET and MYND domain protein, putative [Neosartorya fischeri NRRL
           181]
 gi|119408646|gb|EAW18595.1| SET and MYND domain protein, putative [Neosartorya fischeri NRRL
           181]
          Length = 544

 Score = 79.7 bits (195), Expect = 2e-12,   Method: Compositional matrix adjust.
 Identities = 44/140 (31%), Positives = 76/140 (54%), Gaps = 2/140 (1%)

Query: 26  LVEALVAHMSDIDEKQLLLYAQIANLVNLILQWPEISINE--IAENFSKLACNAHTICNS 83
           L   L +HM++I  +    + +I+     + ++   +++E  I    +KL  N++   + 
Sbjct: 187 LFRKLESHMAEIQAQNWEQWQRISLTAKAVKKYSGCALDEEMICHYGAKLELNSYNFHSP 246

Query: 84  ELRPLGTGLYPVISIINHSCLPNAVLVFEGRLAVVRAVQHVPKGAEVLISYIETAGSTMT 143
               LG  L+P  ++INHSC  N+V+  +G    V+A++ + KG E+L+SYI+      T
Sbjct: 247 LADRLGFYLHPYAALINHSCDYNSVVGSDGDALYVKALRPIQKGEEILVSYIDATNPCKT 306

Query: 144 RQKALKEQYLFTCTCPRCIK 163
           RQ  L E+Y F C CP+C K
Sbjct: 307 RQTELSERYYFKCRCPKCAK 326


>gi|347831782|emb|CCD47479.1| hypothetical protein [Botryotinia fuckeliana]
          Length = 612

 Score = 79.7 bits (195), Expect = 2e-12,   Method: Compositional matrix adjust.
 Identities = 40/117 (34%), Positives = 66/117 (56%), Gaps = 2/117 (1%)

Query: 47  QIANLVNLILQWPEISINEIAENFSKLACNAHTI-CNSELRPLGTGLYPVISIINHSCLP 105
           Q   +VNL+ + PE  +    E+  +L+ N   +  N    P+G  L P+++  NHSC P
Sbjct: 221 QAHAVVNLLQRKPEF-VKYALESLCRLSTNGFRVESNVGNGPIGLCLDPLLARANHSCRP 279

Query: 106 NAVLVFEGRLAVVRAVQHVPKGAEVLISYIETAGSTMTRQKALKEQYLFTCTCPRCI 162
           NA + F+G+ A +RA+  + KG ++ ISYI+       R++AL + + F C C RC+
Sbjct: 280 NAAITFDGKRATLRALSPIAKGEQIFISYIDETQRQEVRREALAKTWFFQCLCSRCV 336


>gi|258575593|ref|XP_002541978.1| predicted protein [Uncinocarpus reesii 1704]
 gi|237902244|gb|EEP76645.1| predicted protein [Uncinocarpus reesii 1704]
          Length = 481

 Score = 79.7 bits (195), Expect = 2e-12,   Method: Compositional matrix adjust.
 Identities = 71/279 (25%), Positives = 126/279 (45%), Gaps = 40/279 (14%)

Query: 60  EISINEIAENFSKLACNAHTICNSELRPLGTGLYPVISIINHSCLPNAVLVFEGRLAVVR 119
           E+S+++I E F KL  NA T+       +G  + P+ ++ NHSC PNA   F+     VR
Sbjct: 162 ELSLHKIIEYFCKLDINAFTLTTPFYDHVGAAIEPLAALCNHSCSPNAATDFDKGKIWVR 221

Query: 120 AVQHVPKGAEVLISYIETAGSTMTRQKALKEQYLFTCTCPRCIKLGQFDDIQESAILEGY 179
           A++ + KG +V +SYIET      RQ  L ++Y F C C +C       +I+++A    +
Sbjct: 222 ALRDIGKGEQVFVSYIETTDPYAHRQSELLKRYYFNCKCNKC-------EIEKNAPDTHF 274

Query: 180 RCKDDGCSGFLLRDSDDKG---FTCQQCGLVRSKEEIKKIASEVNILSKKTLALTSCGNH 236
                      ++++  +    F   + G   S + IK + S ++ L + TL        
Sbjct: 275 LRAITAVDSKTIQNAQQEAMELFEMVKPG-APSTDSIKNLRSAMSALRRTTLW------- 326

Query: 237 QEVVSTYKMIEKLQKKLYHPFSVNLMQTREKLIKILMELEDWKEALAYCQLT-IPVYQRV 295
                       L ++ Y       ++ R +LI  LM    ++ A  +C +  + V   V
Sbjct: 327 -----------PLTRQPY-------VRLRGELIASLMGARQFQSAFVHCVIRHLRVNPVV 368

Query: 296 YP-QFHPLLGLQYYTCGKLEWFLGDTENAIKSMTEAVEI 333
           YP ++HP+  +  +   KL  +L  T+     + E V++
Sbjct: 369 YPNRWHPISSMHKWVFVKLMRYL--TQAGDLGVAEGVDL 405


>gi|405121568|gb|AFR96336.1| hypothetical protein CNAG_03112 [Cryptococcus neoformans var.
           grubii H99]
          Length = 447

 Score = 79.3 bits (194), Expect = 2e-12,   Method: Compositional matrix adjust.
 Identities = 52/177 (29%), Positives = 87/177 (49%), Gaps = 13/177 (7%)

Query: 62  SINEIAENFSKLACNAHTICNSELRPLGTGLYPVISIINHSCLPNAVLVFE--GRLAVVR 119
            ++E+    S +  NA T+ +  L P+G  + P++++ NHSC PNA+ VF   GR   + 
Sbjct: 110 GVDEVLNLCSAIQVNAFTLTSPSLAPIGMSISPLLALANHSCEPNAIAVFPKGGRDIFLV 169

Query: 120 AVQHVPKGAEVLISYIETAGSTMTRQKALKEQYLFTCTCPRCIKLGQFDDIQESAI---- 175
           A+  +P G E+L SYI+T+     RQ  L  +Y F C C  C K     + ++S +    
Sbjct: 170 ALNDIPPGEEILTSYIDTSTPYHHRQSELLSRYRFVCHCSLCQKSESTANGEQSWVDPRW 229

Query: 176 -LEGYRCKDDGCSGFLLRDSDDK-GFTCQQCGLVRSKEEIKKIASEVNILSKKTLAL 230
            +    C  +G +   L ++ DK    C  C     KE+ K  A +V  L ++ ++ 
Sbjct: 230 CVRHRECSKEGKARMPLENATDKVEVRCIAC-----KEDFKVDADKVRKLVQEGMSF 281


>gi|346320789|gb|EGX90389.1| SET domain protein [Cordyceps militaris CM01]
          Length = 540

 Score = 79.3 bits (194), Expect = 2e-12,   Method: Compositional matrix adjust.
 Identities = 44/104 (42%), Positives = 59/104 (56%), Gaps = 1/104 (0%)

Query: 60  EISINEIAENFSKLACNAHTICNSELRPLGTGLYPVISIINHSCLPNAVLVFEGRLAVVR 119
           E ++ E AE   KL  NA    +++L  +G  L P +++ NHSCLPNA + F GR AV+R
Sbjct: 167 EGAVREAAEILCKLQTNAFHRFDADLGHVGIFLEPTLAMANHSCLPNAFVQFVGRTAVLR 226

Query: 120 AVQHVPKGAEVLISYIETAGSTMTRQKALKEQYLFTCTCPRCIK 163
           A Q +  G E+ ISY +   S   RQ AL   Y F C C RC +
Sbjct: 227 AEQRIQSGDEIEISYTDYTSSLSKRQAALAP-YHFECRCRRCTQ 269


>gi|401886254|gb|EJT50303.1| hypothetical protein A1Q1_00408 [Trichosporon asahii var. asahii
           CBS 2479]
          Length = 582

 Score = 79.3 bits (194), Expect = 2e-12,   Method: Compositional matrix adjust.
 Identities = 62/241 (25%), Positives = 102/241 (42%), Gaps = 26/241 (10%)

Query: 76  NAHTICNSELRPLGTGLYPVISIINHSCLPNAVLVFE--GRLAVVRAVQHVPKGAEVLIS 133
           NA T+ + +L+P+G  + P++++ NHSC PNA +VF   G+  V  A   +  G EVL +
Sbjct: 287 NAFTLSSFDLKPIGVAMSPLLALFNHSCAPNAAIVFPRGGKEMVAVANADIAAGEEVLTT 346

Query: 134 YIETAGSTMTRQKALKEQYLFTCTCPRCIKLGQFDDIQESAILEGYRCKDDGCSGF---- 189
           Y++ +    TRQ  L+ +Y F C CP C      D +      +   C    C G     
Sbjct: 347 YVDISDDKETRQGDLQSRYGFECECPACT----LDAV------DPRNCLLHECGGLARMP 396

Query: 190 --LLRDSDDKGFTCQQCG--LVRSKEEIKK-IASEVNILSKKTLALTSCGNHQEVVSTYK 244
             L + SD +   C  CG     S   ++  +   ++++           +   V     
Sbjct: 397 ADLRQMSDSETVKCDTCGASWTVSPNALRDLVNGGLSLVEADDQDQLDPDDLPRVAEVIA 456

Query: 245 MIEKLQKKLYHPFSVNLMQTREKLIKILMELEDWKEALAYCQLTIPVYQRVYPQFHPLLG 304
            +E++      P S   +     + K L       EALAY    +    +VYP+ HP + 
Sbjct: 457 RLEEIM-----PPSSRPLLRLHTIAKNLHVPAAQPEALAYAHKALAGAYKVYPRNHPSVA 511

Query: 305 L 305
           L
Sbjct: 512 L 512


>gi|315046180|ref|XP_003172465.1| SET and MYND domain-containing protein 2 [Arthroderma gypseum CBS
           118893]
 gi|311342851|gb|EFR02054.1| SET and MYND domain-containing protein 2 [Arthroderma gypseum CBS
           118893]
          Length = 498

 Score = 79.3 bits (194), Expect = 3e-12,   Method: Compositional matrix adjust.
 Identities = 38/103 (36%), Positives = 58/103 (56%)

Query: 61  ISINEIAENFSKLACNAHTICNSELRPLGTGLYPVISIINHSCLPNAVLVFEGRLAVVRA 120
           + +  IAE  +++  N+ T   S   PLG  + P    +NHSC PNAV+ F+  L  V+A
Sbjct: 180 LDLETIAEYLARIEVNSFTFTTSFGDPLGLCIQPFACYMNHSCDPNAVVGFDEGLITVKA 239

Query: 121 VQHVPKGAEVLISYIETAGSTMTRQKALKEQYLFTCTCPRCIK 163
           ++ +    +V ISYI+       RQK L E+Y FTC C +C++
Sbjct: 240 LRKIKPDEQVFISYIDNTYPFEVRQKQLAERYFFTCKCSKCVE 282


>gi|392587935|gb|EIW77268.1| SET domain-containing protein [Coniophora puteana RWD-64-598 SS2]
          Length = 656

 Score = 79.3 bits (194), Expect = 3e-12,   Method: Compositional matrix adjust.
 Identities = 54/174 (31%), Positives = 80/174 (45%), Gaps = 16/174 (9%)

Query: 62  SINEIAENFSKLACNAHTICNSELRPLGTGLYPVISIINHSCLPNAVLVF---------- 111
           S  ++ +  S+   N+ T+    L P+G  + P+I++INHSC PNAV+VF          
Sbjct: 239 SARDLVDLTSRFTSNSFTLATPALAPIGVSVSPLIALINHSCDPNAVVVFPRSSTASTPA 298

Query: 112 --EGRLAVVRAVQHVPKGAEVLISYIETAGSTMTRQKALKEQYLFTCTCPRCIKLGQFDD 169
             E ++ VV A++ +    E+L SYI+T      RQ ALKE Y FTC C  C +      
Sbjct: 299 KLEPQMQVV-AIKDIEPDTEILTSYIDTTLPQPIRQSALKETYCFTCACTLCARPQDATT 357

Query: 170 IQ-ESAILEGYRCKDDGCSGFLLRDSDDKGFTCQQCGLVRSKEEIKKIASEVNI 222
           I   SA+    RC   G +       +     C  CG   S    + I   +N+
Sbjct: 358 IDPRSAVWCAQRC--GGVASVYSAGDELVVGRCNTCGFSASMTTAESILDAINV 409


>gi|313234749|emb|CBY24692.1| unnamed protein product [Oikopleura dioica]
          Length = 475

 Score = 79.3 bits (194), Expect = 3e-12,   Method: Compositional matrix adjust.
 Identities = 81/343 (23%), Positives = 137/343 (39%), Gaps = 38/343 (11%)

Query: 13  DNVIPSTTTDNYSLVEALVAHMSDIDEKQLLLYAQIANLVNLILQWPEI----SINEIAE 68
           D+  P+   D++  V  L +H   +D  Q           NL  + P      S  ++ +
Sbjct: 116 DSDEPNNKIDSFIHVLQLESHFDRLDPSQQEDLGIFLMKFNLYWKKPIPGSLRSSRKLLD 175

Query: 69  NFSKLACNAHTICNSE--LRPLGTGLYPVISIINHSCLPNAVLVFEGRLAVVRAVQHVPK 126
             + +  N   IC+ E  +  +G+ LY   S+INHSC PNA  VF G   V++A++ +  
Sbjct: 176 LVAAIKNNQFAICDEESSICDIGSALYLNHSLINHSCKPNAFPVFNGTNLVIKALEKIAP 235

Query: 127 GAEVLISYIETAGSTMTRQKALKEQYLFTCTCPRCIKLGQFDDIQESAILEGYRCKDDGC 186
           G E+ I+Y +T      R+  L + Y F C C  C    + D           R K    
Sbjct: 236 GEEIKIAYTDTKAVIQDRRDYLNDIYRFVCQCQGCTNDDEVD-----------RKKHLDK 284

Query: 187 SGFLLRDSDDKGFTCQQCGLVRSKEEIKKIASEVNILSKKTLALTSCGNHQEVVSTYKMI 246
            G ++R SD    + +   +VR  EE KK   E N++             +E    +   
Sbjct: 285 KGNVIRRSDAIWQSAET--MVRDMEEFKK-NKEWNLM-------------KEAAQGW--- 325

Query: 247 EKLQKKLYHPFSVNLMQTREKLIKILMELEDWKEALAYCQLTIPVYQRVYPQFHPLLGLQ 306
             L +K     ++  ++  E      +E ++W   L      I  Y+ +Y   HP LG+ 
Sbjct: 326 --LARKFLPDENIFWIRLNEFAFDAGIETQEWSLCLETGASLIVNYKEIYGPMHPTLGIH 383

Query: 307 YYTCGKLEWFLGDTENAIKSMTEAVEILRITHGTNSPFMKELI 349
                K+   +     A +    A  I+ + +   S   K+L+
Sbjct: 384 LMKFAKILLHIEKPNEAEEYFRRAFAIMSLFYEPESAVRKQLL 426


>gi|195168896|ref|XP_002025266.1| GL13396 [Drosophila persimilis]
 gi|194108722|gb|EDW30765.1| GL13396 [Drosophila persimilis]
          Length = 787

 Score = 79.3 bits (194), Expect = 3e-12,   Method: Compositional matrix adjust.
 Identities = 38/102 (37%), Positives = 56/102 (54%), Gaps = 6/102 (5%)

Query: 91  GLYPVISIINHSCLPNAVLVFE-GRLAVVRAVQHVPKGAEVLISYIETAGSTMTRQKALK 149
           GL+P+ +I+NH C PNA   F+ GRLA+VRA + +PKG E+  +Y +     +TR   LK
Sbjct: 567 GLFPLTAIMNHECTPNASHYFDNGRLAIVRAARDIPKGGEITTTYTKILWGNLTRGIFLK 626

Query: 150 EQYLFTCTCPRCIKLGQFDDIQESAILEGYRCKDDGCSGFLL 191
               F C C RC      D+ +    L    C++ GC G ++
Sbjct: 627 MTKHFMCDCTRC-----NDNTENGTYLSALFCREQGCRGLVI 663



 Score = 55.8 bits (133), Expect = 3e-05,   Method: Compositional matrix adjust.
 Identities = 27/72 (37%), Positives = 40/72 (55%), Gaps = 1/72 (1%)

Query: 91  GLYPVISIINHSCLPNAVLVF-EGRLAVVRAVQHVPKGAEVLISYIETAGSTMTRQKALK 149
            L+P+  ++NH C PNA   F +G   +V A + +  GAE+ +SY +   ST+ R+  L 
Sbjct: 237 ALFPLAGLLNHQCTPNAGHHFQDGETIIVCATERIACGAEITVSYAKMLWSTLARKIFLG 296

Query: 150 EQYLFTCTCPRC 161
               F C CPRC
Sbjct: 297 MTKGFMCQCPRC 308


>gi|402898218|ref|XP_003912123.1| PREDICTED: SET and MYND domain-containing protein 4 [Papio anubis]
          Length = 802

 Score = 79.3 bits (194), Expect = 3e-12,   Method: Compositional matrix adjust.
 Identities = 79/318 (24%), Positives = 132/318 (41%), Gaps = 73/318 (22%)

Query: 59  PEISINEIA--ENFSKLACNA-------HT------ICNSELRPLGTGLYPVISIINHSC 103
           P+++I  +A   +  +L CNA       HT      + +S    L TG++PVIS++NHSC
Sbjct: 481 PDVTIWGVAMLRHMLQLQCNAQAITTVQHTGSKGSIVTDSRQVRLATGIFPVISLLNHSC 540

Query: 104 LPNAVLVFEGRLAVVRAVQHVPKGAEVLISY--IETAGSTMTRQKALKEQYLFTCTCPRC 161
            PN  + F   +A +RA Q + KG E+L  Y   ++      RQ+ L+ QY F CTCP C
Sbjct: 541 SPNTSVSFISTVATIRASQRIGKGQEILHCYGPHKSRMGVAERQQKLRSQYFFDCTCPAC 600

Query: 162 IKLGQFDDIQESAILEGYRCKDDGCSGFLLRDSDDKGFTCQQCGLVRSKEEIKKIASEVN 221
                    +   +  G R                K F C  C      + + +      
Sbjct: 601 -------QTEAHRMAAGPRW---------------KAFCCNSCRAPMQGDGVLR------ 632

Query: 222 ILSKKTLALTSCGNH---QEVVSTYKMIEKLQKKLYHPFSVNLMQTREKLIKILM--ELE 276
                      CGN    + VVS   ++ +LQ         +L Q      K+L   ELE
Sbjct: 633 -----------CGNRSCAESVVSRDHLVSRLQ---------DLQQQVGVAQKLLRDGELE 672

Query: 277 DWKEALAYCQLTIPVYQRVYPQFHPLLGLQYYTCGKLEWFLGDTENAIKSMTEAVEILRI 336
              + L+ CQ     +  ++ + H ++G       +    LGD + +   + +++ ++ +
Sbjct: 673 RAVQLLSRCQRDAESF--LWAE-HAMVGAIADGLARACAALGDWQKSATHLQKSLRVVEV 729

Query: 337 THGTNSPFMKELILKLEE 354
            HG +S  M   + KL +
Sbjct: 730 RHGPSSIEMGHELFKLAQ 747


>gi|198470120|ref|XP_002133373.1| GA22855 [Drosophila pseudoobscura pseudoobscura]
 gi|198145302|gb|EDY72001.1| GA22855 [Drosophila pseudoobscura pseudoobscura]
          Length = 844

 Score = 79.0 bits (193), Expect = 3e-12,   Method: Compositional matrix adjust.
 Identities = 38/102 (37%), Positives = 56/102 (54%), Gaps = 6/102 (5%)

Query: 91  GLYPVISIINHSCLPNAVLVFE-GRLAVVRAVQHVPKGAEVLISYIETAGSTMTRQKALK 149
           GL+P+ +I+NH C PNA   F+ GRLA+VRA + +PKG E+  +Y +     +TR   LK
Sbjct: 624 GLFPLTAIMNHECTPNASHYFDNGRLAIVRAARDIPKGGEITTTYTKILWGNLTRGIFLK 683

Query: 150 EQYLFTCTCPRCIKLGQFDDIQESAILEGYRCKDDGCSGFLL 191
               F C C RC      D+ +    L    C++ GC G ++
Sbjct: 684 MTKHFMCDCTRC-----NDNTENGTYLSALFCREQGCRGLVI 720



 Score = 55.8 bits (133), Expect = 3e-05,   Method: Compositional matrix adjust.
 Identities = 27/72 (37%), Positives = 40/72 (55%), Gaps = 1/72 (1%)

Query: 91  GLYPVISIINHSCLPNAVLVF-EGRLAVVRAVQHVPKGAEVLISYIETAGSTMTRQKALK 149
            L+P+  ++NH C PNA   F +G   +V A + +  GAE+ +SY +   ST+ R+  L 
Sbjct: 237 ALFPLAGLLNHQCTPNAGHHFQDGETIIVCATERIACGAEITVSYAKMLWSTLARKIFLG 296

Query: 150 EQYLFTCTCPRC 161
               F C CPRC
Sbjct: 297 MTKGFMCQCPRC 308


>gi|452977535|gb|EME77301.1| hypothetical protein MYCFIDRAFT_179882 [Pseudocercospora fijiensis
           CIRAD86]
          Length = 616

 Score = 79.0 bits (193), Expect = 3e-12,   Method: Compositional matrix adjust.
 Identities = 34/82 (41%), Positives = 49/82 (59%)

Query: 88  LGTGLYPVISIINHSCLPNAVLVFEGRLAVVRAVQHVPKGAEVLISYIETAGSTMTRQKA 147
           LG G  P+++  NHSC PNA +VF     V+RA++ + KG E+ + YI+ +     RQ  
Sbjct: 284 LGIGFDPILNSSNHSCDPNAAVVFNQPRLVLRALRSISKGEEIFMKYIDVSNPFSVRQAE 343

Query: 148 LKEQYLFTCTCPRCIKLGQFDD 169
           LKE Y F+C C +C K   FD+
Sbjct: 344 LKESYFFSCRCSKCKKGAIFDE 365


>gi|440792261|gb|ELR13489.1| tetratricopeptide repeat domain containing protein [Acanthamoeba
           castellanii str. Neff]
          Length = 764

 Score = 79.0 bits (193), Expect = 3e-12,   Method: Compositional matrix adjust.
 Identities = 51/147 (34%), Positives = 79/147 (53%), Gaps = 11/147 (7%)

Query: 22  DNYSLVEALVAHMSDIDEKQLLLYAQIANLVNLILQWPEISINEIAENFSKL----ACNA 77
           D  SLVE      +++ E       Q  N+  L    P +++ +  E  ++L      NA
Sbjct: 291 DGRSLVEHHDRLAAEVREG---FRQQARNVRWLAAHGPGLALPDADETVARLFAVAQANA 347

Query: 78  HTICN-SELRPLGTGLYPVISIINHSCLPNA--VLVFEGRLAVVRAVQHVPKGAEVLISY 134
            T+ + S LRP+G GLY   +++NHSCLPNA   +  EGRL  VRAV+ +  G EV ++Y
Sbjct: 348 LTLTDPSSLRPIGQGLYASAALLNHSCLPNANWSVDGEGRL-CVRAVRPIEAGEEVTVAY 406

Query: 135 IETAGSTMTRQKALKEQYLFTCTCPRC 161
           ++       RQ+AL++ + F C C +C
Sbjct: 407 VDPTLPYHARQQALQDHFFFACRCLQC 433


>gi|255072485|ref|XP_002499917.1| predicted protein [Micromonas sp. RCC299]
 gi|226515179|gb|ACO61175.1| predicted protein [Micromonas sp. RCC299]
          Length = 676

 Score = 79.0 bits (193), Expect = 4e-12,   Method: Compositional matrix adjust.
 Identities = 33/89 (37%), Positives = 55/89 (61%)

Query: 66  IAENFSKLACNAHTICNSELRPLGTGLYPVISIINHSCLPNAVLVFEGRLAVVRAVQHVP 125
           +A   +K++CNAHT+C+ EL  +G G+YP  +++NH   P A   F+G+  V+RA + + 
Sbjct: 228 VARLLAKISCNAHTVCDEELNAIGVGVYPAAAMVNHGDAPTAAQSFKGKKIVLRATRDLK 287

Query: 126 KGAEVLISYIETAGSTMTRQKALKEQYLF 154
           +G EV ++Y+E   +   R+ AL   Y F
Sbjct: 288 RGDEVTMAYVELLATRQERRAALHAGYNF 316



 Score = 53.5 bits (127), Expect = 2e-04,   Method: Compositional matrix adjust.
 Identities = 30/105 (28%), Positives = 53/105 (50%), Gaps = 2/105 (1%)

Query: 259 VNLMQTR--EKLIKILMELEDWKEALAYCQLTIPVYQRVYPQFHPLLGLQYYTCGKLEWF 316
           ++L++ R  + + +  +   D++ A       +P Y+  YP  HP LGL      K+E  
Sbjct: 549 IHLLRVRVLDAMQRAAVTAGDFESARDAAYAVLPAYRLSYPPCHPPLGLHLALIAKIEAH 608

Query: 317 LGDTENAIKSMTEAVEILRITHGTNSPFMKELILKLEEAQAEASY 361
           L D   A++   EA+  L ++HG NS  ++ +   L E++AE  Y
Sbjct: 609 LVDLNAAVQFAREAISCLNVSHGRNSDVVRMMETLLGESEAELRY 653


>gi|308499663|ref|XP_003112017.1| CRE-SET-18 protein [Caenorhabditis remanei]
 gi|308268498|gb|EFP12451.1| CRE-SET-18 protein [Caenorhabditis remanei]
          Length = 507

 Score = 79.0 bits (193), Expect = 4e-12,   Method: Compositional matrix adjust.
 Identities = 73/325 (22%), Positives = 138/325 (42%), Gaps = 26/325 (8%)

Query: 29  ALVAHMSDI--DEKQLLLYAQIANLVNLILQWPEISINEIAENFSKLACNAHTICNSELR 86
           +LV H  DI  DE+++  + ++   + L      I   +  + F K + N+H+I  +   
Sbjct: 139 SLVDHKVDIREDEEKMKHFEKVFTKMGLFRGDEMIEKGQFFDVFCKASINSHSIHTNAGN 198

Query: 87  PLGTGLYPVISIINHSCLPNAVLVFEGRLAVVRAVQHVP-----KGAEVLISYIETAGST 141
            +G  L   +S+ NHSC P   +VF+G    +R +  VP        +  ISYI+   S 
Sbjct: 199 EVGMALDLGVSMYNHSCRPTCSMVFDGYRVCLRPL--VPGVDAESTEQAFISYIDVGRSK 256

Query: 142 MTRQKALKEQYLFTCTCPRCIKLGQFDDIQESAILEGYRCKDDGCSGFLLRDSDDK--GF 199
             R+K LK ++ F C C RC+      D ++ A L   RC +  C   +L    ++    
Sbjct: 257 YVRRKDLKSRWYFDCECTRCM------DPEDDA-LTAIRCANPACDAPVLTSETEEPMNI 309

Query: 200 TCQQCGLVRSKEEIKKIASEVNILSKKTLALTSCGNHQEVVSTYKMIEKLQKKLYHPFSV 259
            C +C  +  + ++KK    +  L             Q+++S  +++        HP +V
Sbjct: 310 ACDKCKTIIEENDVKKAQEFMKSLPASFDPQCPADILQDLLSKAEVV-------LHPSNV 362

Query: 260 NLMQTREKLIKILMELEDWKEALAYCQLTIPVYQRVYPQFHPLLGLQYYTCGKLEWFLGD 319
            + + R  L  +  +L     +  + Q+    Y+  +P+    +G Q     K      +
Sbjct: 363 YVARLRTALFHVTGQLTMDNLSTMHTQI-YNNYKLCFPKADRHVGFQLLHIVKALIEKDE 421

Query: 320 TENAIKSMTEAVEILRITHGTNSPF 344
            + A+    +A+ I  +  G + P+
Sbjct: 422 RDEAMPYAFDAMNIFEVCFGLDHPY 446


>gi|242798932|ref|XP_002483270.1| SET and MYND domain protein, putative [Talaromyces stipitatus ATCC
           10500]
 gi|218716615|gb|EED16036.1| SET and MYND domain protein, putative [Talaromyces stipitatus ATCC
           10500]
          Length = 547

 Score = 78.6 bits (192), Expect = 4e-12,   Method: Compositional matrix adjust.
 Identities = 50/164 (30%), Positives = 82/164 (50%), Gaps = 15/164 (9%)

Query: 4   LYLRRKLQNDNVIPSTTTDNYSLVEALVAHMSDIDE-KQLLLYA----QIANLVNLILQW 58
           L LRR    D +IP   +++ +L    +    +  E K  LL A    + +NL +L  +W
Sbjct: 152 LMLRRA---DKIIPEVYSESLALTYPKIYDCVESQETKDHLLMAKALREYSNLTDLDNKW 208

Query: 59  PEISINEIAENFSKLACNAHTICNSELRPLGTGLYPVISIINHSCLPNAVLVFEGRLAVV 118
                  +A  F +L  N+  + ++  R  G   +P  +  NHSC PNA   F      +
Sbjct: 209 -------VASLFGRLNANSFCLTSAFGRRRGVYFHPGPARFNHSCDPNASYSFAKGKCYI 261

Query: 119 RAVQHVPKGAEVLISYIETAGSTMTRQKALKEQYLFTCTCPRCI 162
           RA++ + K  ++ ISY++T  S  TR+  L+E+Y F C CP+C+
Sbjct: 262 RAIRPIAKDEQIFISYVDTTYSVGTRRHELQERYRFECQCPKCL 305


>gi|157136833|ref|XP_001656930.1| hypothetical protein AaeL_AAEL003527 [Aedes aegypti]
 gi|108880959|gb|EAT45184.1| AAEL003527-PA [Aedes aegypti]
          Length = 650

 Score = 78.6 bits (192), Expect = 4e-12,   Method: Compositional matrix adjust.
 Identities = 73/277 (26%), Positives = 124/277 (44%), Gaps = 34/277 (12%)

Query: 10  LQNDNVIPSTTTDNYSLVEALVAHMSDIDEKQLLLYAQIANLVNLILQWPEISINE---- 65
           LQND  + S   D+Y  V   V H    + +  L +  ++ L+N +L     SI+     
Sbjct: 352 LQND-FVDSLFFDDYRKVYNFVTHGQQRNAEDYLKWTVMSVLLNTVLVAAGFSISGSLKG 410

Query: 66  -----IAENFSKLACNAHTICNSEL---RP--------LGTGLYPVISIINHSCLPNAVL 109
                +  N   +  N+H I  SEL   +P        +G GLYP + + NHSC P    
Sbjct: 411 FLGKILLHNLQIVTYNSHEI--SELQRKKPKDSGFSVCIGAGLYPTLVLFNHSCDPGITR 468

Query: 110 VFEGRLAVVRAVQHVPKGAEVLISY--IETAGSTMTRQKALKEQYLFTCTCPRCIKLGQF 167
            F G    +R ++++P G+ V  +Y  + T      R+K L E Y F C C  C +   +
Sbjct: 469 YFVGNAVYIRTIKNIPAGSMVAENYGQLYTRAGRRERRKLLAENYKFDCCCQACEE--DW 526

Query: 168 DDIQE-SAILEGYRC-KDDGCSGFLLR--DSDDKGFTCQQCGLVRSKEEIKKIASEVNIL 223
             + E + ++  ++C  ++GC   LL   +S +    C +CG +       +   +V+ L
Sbjct: 527 PSLHEMNPMIRRFKCGANEGCGNELLFKLNSTENEMECNKCGGLTEVNASFETLKQVDFL 586

Query: 224 SKKTLA--LTSCGNHQEVVSTY-KMIEKLQKKLYHPF 257
           ++   A  L S G+ +  +S Y  +I  L + L  P+
Sbjct: 587 NRYNDAARLYSQGDFERALSKYAALINSLDEILVPPY 623


>gi|301765320|ref|XP_002918077.1| PREDICTED: SET and MYND domain-containing protein 4-like
           [Ailuropoda melanoleuca]
 gi|281342357|gb|EFB17941.1| hypothetical protein PANDA_006471 [Ailuropoda melanoleuca]
          Length = 793

 Score = 78.6 bits (192), Expect = 4e-12,   Method: Compositional matrix adjust.
 Identities = 76/307 (24%), Positives = 128/307 (41%), Gaps = 61/307 (19%)

Query: 54  LILQWPEISINEIAENFSKLACNAHTICNSELRPLGTGLYPVISIINHSCLPNAVLVFEG 113
           L LQ    +I  I +  SK     + I +S L  L TG++PV+S++NHSC PN  + F  
Sbjct: 487 LQLQCNAQAITTIQQTGSK----ENNITDSRLVRLATGIFPVVSLLNHSCSPNTSMSFVS 542

Query: 114 RLAVVRAVQHVPKGAEVLISY--IETAGSTMTRQKALKEQYLFTCTCPRCIKLGQFDDIQ 171
            +A +RA + + KG E+L  Y    +      RQ+ L+ QY F C CP C K     +  
Sbjct: 543 TVATIRASEKIGKGQEILHCYGPHHSRMGVADRQQKLRSQYFFDCGCPACQK-----EKH 597

Query: 172 ESAILEGYRCKDDGCSGFLLRDSDDKGFTCQQCGLVRSKEEIKKIASEVNILSKKTLALT 231
            +A+   +                 + F C +CG V    ++    S            T
Sbjct: 598 RAAVGPRW-----------------EAFCCSRCGAVLQGGDVLSCGS------------T 628

Query: 232 SCGNHQEVVSTYKMIEKLQKKLYHPFSVNLMQTREKLIKILMELEDWKEALAYCQLTIPV 291
           SC    E VS   ++ +LQ           ++ R    + L+     ++A+   QL + V
Sbjct: 629 SC---TESVSREHLVSRLQD----------LRQRVGSARNLLSNGQLEQAI---QLLL-V 671

Query: 292 YQRVYPQF----HPLLGLQYYTCGKLEWFLGDTENAIKSMTEAVEILRITHGTNSPFMKE 347
           Y+     F    H ++G       +    LGD + +   + +++ ++ + HG  S  M  
Sbjct: 672 YRHDAENFLSAEHSMVGEMEDALAQAYAALGDWQKSATHLQKSLRVVEVQHGPASVEMGH 731

Query: 348 LILKLEE 354
            + KL +
Sbjct: 732 ELFKLAQ 738


>gi|431891028|gb|ELK01907.1| SET and MYND domain-containing protein 4 [Pteropus alecto]
          Length = 776

 Score = 78.6 bits (192), Expect = 5e-12,   Method: Compositional matrix adjust.
 Identities = 70/282 (24%), Positives = 118/282 (41%), Gaps = 59/282 (20%)

Query: 80  ICNSELRPLGTGLYPVISIINHSCLPNAVLVFEGRLAVVRAVQHVPKGAEVLISY--IET 137
           I NS+   L TG++PV+S++NHSC PN  + F   +A VRA   + KG E+L  Y   E+
Sbjct: 492 ITNSKQVRLATGIFPVVSLLNHSCSPNTSVSFISTVATVRASVQIGKGQEILHCYGPHES 551

Query: 138 AGSTMTRQKALKEQYLFTCTCPRCIKLGQFDDIQESAILEGYRCKDDGCSGFLLRDSDDK 197
                 RQ+ L+ QY F C CP C               E +R             S  +
Sbjct: 552 RMGAAERQQKLRSQYFFDCNCPACQN-------------EKHR---------TTTGSKWE 589

Query: 198 GFTCQQCGLVRSKEEIKKIASEVNILSKKTLALTSCGNHQ--EVVSTYKMIEKLQKKLYH 255
            F C +C  +   +++                  SCGN    E VS  +++ +LQ     
Sbjct: 590 AFCCNRCRTLMQGDDV-----------------LSCGNTSCTEAVSRDRLVSQLQD---- 628

Query: 256 PFSVNLMQTREKLIKILM---ELEDWKEALAYCQLTIPVYQRVYPQFHPLLGLQYYTCGK 312
                 +Q + K+ + L+   +LE   + L  C+     +    P  H ++G       +
Sbjct: 629 ------LQQQIKMAQKLLKNGKLEQAIQLLLGCRYDAESF---LPAEHSVVGEIEDDLAQ 679

Query: 313 LEWFLGDTENAIKSMTEAVEILRITHGTNSPFMKELILKLEE 354
               LGD + +   + +++ ++ + HG +S  M   + KL +
Sbjct: 680 AYAALGDWQKSAAHLQKSLHVVEVRHGPSSVEMGHELFKLAQ 721


>gi|119597550|gb|EAW77144.1| SET and MYND domain containing 3, isoform CRA_e [Homo sapiens]
          Length = 170

 Score = 78.6 bits (192), Expect = 5e-12,   Method: Compositional matrix adjust.
 Identities = 33/77 (42%), Positives = 51/77 (66%)

Query: 85  LRPLGTGLYPVISIINHSCLPNAVLVFEGRLAVVRAVQHVPKGAEVLISYIETAGSTMTR 144
           ++ +G GLYP IS++NHSC PN  +VF G   ++RAV+ +  G E+ I Y++   ++  R
Sbjct: 1   MQEVGVGLYPSISLLNHSCDPNCSIVFNGPHLLLRAVRDIEVGEELTICYLDMLMTSEER 60

Query: 145 QKALKEQYLFTCTCPRC 161
           +K L++QY F C C RC
Sbjct: 61  RKQLRDQYCFECDCFRC 77


>gi|355753603|gb|EHH57568.1| hypothetical protein EGM_07234 [Macaca fascicularis]
          Length = 806

 Score = 78.6 bits (192), Expect = 5e-12,   Method: Compositional matrix adjust.
 Identities = 79/318 (24%), Positives = 132/318 (41%), Gaps = 73/318 (22%)

Query: 59  PEISINEIA--ENFSKLACNA-------HT------ICNSELRPLGTGLYPVISIINHSC 103
           P+++I  +A   +  +L CNA       HT      + +S    L TG++PVIS++NHSC
Sbjct: 485 PDVTIWGVAMLRHMLQLQCNAQAITTVQHTGSKGSIVTDSRQVRLATGIFPVISLLNHSC 544

Query: 104 LPNAVLVFEGRLAVVRAVQHVPKGAEVLISY--IETAGSTMTRQKALKEQYLFTCTCPRC 161
            PN  + F   +A +RA Q + KG E+L  Y   ++      RQ+ L+ QY F CTCP C
Sbjct: 545 SPNTSVSFISTVATIRASQRIGKGQEILHCYGPHKSRMGVAERQQKLRSQYFFDCTCPAC 604

Query: 162 IKLGQFDDIQESAILEGYRCKDDGCSGFLLRDSDDKGFTCQQCGLVRSKEEIKKIASEVN 221
                    +   +  G R                K F C  C      + + +      
Sbjct: 605 -------QTEAHRMAAGPRW---------------KAFCCNSCRAPMQGDGVLR------ 636

Query: 222 ILSKKTLALTSCGNH---QEVVSTYKMIEKLQKKLYHPFSVNLMQTREKLIKILM--ELE 276
                      CGN    + VVS   ++ +LQ         +L Q      K+L   ELE
Sbjct: 637 -----------CGNRSCAESVVSRDHLVSRLQ---------DLQQQVGVAQKLLRDGELE 676

Query: 277 DWKEALAYCQLTIPVYQRVYPQFHPLLGLQYYTCGKLEWFLGDTENAIKSMTEAVEILRI 336
              + L+ CQ     +  ++ + H ++G       +    LGD + +   + +++ ++ +
Sbjct: 677 RAVQLLSGCQRDAESF--LWAE-HAMVGEIADGLARACAALGDWQKSATHLQKSLRVVEV 733

Query: 337 THGTNSPFMKELILKLEE 354
            HG +S  M   + KL +
Sbjct: 734 RHGPSSVEMGHELFKLAQ 751


>gi|328789942|ref|XP_392262.3| PREDICTED: SET and MYND domain-containing protein 4-like [Apis
           mellifera]
          Length = 677

 Score = 78.2 bits (191), Expect = 5e-12,   Method: Compositional matrix adjust.
 Identities = 59/201 (29%), Positives = 90/201 (44%), Gaps = 18/201 (8%)

Query: 66  IAENFSKLACNAHTI------------CNSELRPLGTGLYPVISIINHSCLPNAVLVFEG 113
           I  N   +  NAH I             N++ + +G GLYP IS+ NHSC P  +  F G
Sbjct: 446 ILHNLMTIQFNAHEISELVIPKADNNLANAKSKFIGGGLYPTISLFNHSCNPGIIRYFIG 505

Query: 114 RLAVVRAVQHVPKGAEVLISY--IETAGSTMTRQKALKEQYLFTCTCPRCIKLGQFDDIQ 171
              VVRA++ +  G E+  +Y  I T      R++ L+ QY F C C  C +     +  
Sbjct: 506 TTMVVRAIRSISSGEEISENYGQIFTTTPESERKRKLRLQYFFDCNCEACREHWPLLEEI 565

Query: 172 ESAILEGYRCKDDGCSGFLL---RDSDDKGFTCQQCGLVRSKEEIKKIASEVNILSKKTL 228
           +  IL  ++C+     G +L    DS++    C +CG   +  +  K   E + + K   
Sbjct: 566 DPTILR-FKCETGKECGNVLPVRTDSNEFMIECSKCGKCMNIFKGLKALQETDAIFKIAS 624

Query: 229 ALTSCGNHQEVVSTYKMIEKL 249
                GNH+E +  Y  I KL
Sbjct: 625 RYLEQGNHREALKNYLKILKL 645


>gi|109112713|ref|XP_001117365.1| PREDICTED: SET and MYND domain-containing protein 4-like [Macaca
           mulatta]
          Length = 802

 Score = 78.2 bits (191), Expect = 5e-12,   Method: Compositional matrix adjust.
 Identities = 79/318 (24%), Positives = 132/318 (41%), Gaps = 73/318 (22%)

Query: 59  PEISINEIA--ENFSKLACNA-------HT------ICNSELRPLGTGLYPVISIINHSC 103
           P+++I  +A   +  +L CNA       HT      + +S    L TG++PVIS++NHSC
Sbjct: 481 PDVTIWGVAMLRHMLQLQCNAQAITTVQHTGSKGSIVTDSRQVRLATGIFPVISLLNHSC 540

Query: 104 LPNAVLVFEGRLAVVRAVQHVPKGAEVLISY--IETAGSTMTRQKALKEQYLFTCTCPRC 161
            PN  + F   +A +RA Q + KG E+L  Y   ++      RQ+ L+ QY F CTCP C
Sbjct: 541 SPNTSVSFISTVATIRASQRIGKGQEILHCYGPHKSRMGVAERQQKLRSQYFFDCTCPAC 600

Query: 162 IKLGQFDDIQESAILEGYRCKDDGCSGFLLRDSDDKGFTCQQCGLVRSKEEIKKIASEVN 221
                    +   +  G R                K F C  C      + + +      
Sbjct: 601 -------QTEAHRMAAGPRW---------------KAFCCNSCRAPMQGDGVLR------ 632

Query: 222 ILSKKTLALTSCGNH---QEVVSTYKMIEKLQKKLYHPFSVNLMQTREKLIKILM--ELE 276
                      CGN    + VVS   ++ +LQ         +L Q      K+L   ELE
Sbjct: 633 -----------CGNRSCAESVVSRDHLVSRLQ---------DLQQQVGVAQKLLRDGELE 672

Query: 277 DWKEALAYCQLTIPVYQRVYPQFHPLLGLQYYTCGKLEWFLGDTENAIKSMTEAVEILRI 336
              + L+ CQ     +  ++ + H ++G       +    LGD + +   + +++ ++ +
Sbjct: 673 RAVQLLSGCQRDAESF--LWAE-HAMVGEIADGLARACAALGDWQKSATHLQKSLRVVEV 729

Query: 337 THGTNSPFMKELILKLEE 354
            HG +S  M   + KL +
Sbjct: 730 RHGPSSVEMGHELFKLAQ 747


>gi|355568065|gb|EHH24346.1| hypothetical protein EGK_07991 [Macaca mulatta]
          Length = 806

 Score = 78.2 bits (191), Expect = 5e-12,   Method: Compositional matrix adjust.
 Identities = 79/318 (24%), Positives = 132/318 (41%), Gaps = 73/318 (22%)

Query: 59  PEISINEIA--ENFSKLACNA-------HT------ICNSELRPLGTGLYPVISIINHSC 103
           P+++I  +A   +  +L CNA       HT      + +S    L TG++PVIS++NHSC
Sbjct: 485 PDVTIWGVAMLRHMLQLQCNAQAITTVQHTGSKGSIVTDSRQVRLATGIFPVISLLNHSC 544

Query: 104 LPNAVLVFEGRLAVVRAVQHVPKGAEVLISY--IETAGSTMTRQKALKEQYLFTCTCPRC 161
            PN  + F   +A +RA Q + KG E+L  Y   ++      RQ+ L+ QY F CTCP C
Sbjct: 545 SPNTSVSFISTVATIRASQRIGKGQEILHCYGPHKSRMGVAERQQKLRSQYFFDCTCPAC 604

Query: 162 IKLGQFDDIQESAILEGYRCKDDGCSGFLLRDSDDKGFTCQQCGLVRSKEEIKKIASEVN 221
                    +   +  G R                K F C  C      + + +      
Sbjct: 605 -------QTEAHRMAAGPRW---------------KAFCCNSCRAPMQGDGVLR------ 636

Query: 222 ILSKKTLALTSCGNH---QEVVSTYKMIEKLQKKLYHPFSVNLMQTREKLIKILM--ELE 276
                      CGN    + VVS   ++ +LQ         +L Q      K+L   ELE
Sbjct: 637 -----------CGNRSCAESVVSRDHLVSRLQ---------DLQQQVGVAQKLLRDGELE 676

Query: 277 DWKEALAYCQLTIPVYQRVYPQFHPLLGLQYYTCGKLEWFLGDTENAIKSMTEAVEILRI 336
              + L+ CQ     +  ++ + H ++G       +    LGD + +   + +++ ++ +
Sbjct: 677 RAVQLLSGCQRDAESF--LWAE-HAMVGEIADGLARACAALGDWQKSATHLQKSLRVVEV 733

Query: 337 THGTNSPFMKELILKLEE 354
            HG +S  M   + KL +
Sbjct: 734 RHGPSSVEMGHELFKLAQ 751


>gi|170102677|ref|XP_001882554.1| predicted protein [Laccaria bicolor S238N-H82]
 gi|164642451|gb|EDR06707.1| predicted protein [Laccaria bicolor S238N-H82]
          Length = 645

 Score = 78.2 bits (191), Expect = 6e-12,   Method: Compositional matrix adjust.
 Identities = 42/104 (40%), Positives = 61/104 (58%), Gaps = 6/104 (5%)

Query: 64  NEIAENFSKLACNAHTICNSELRPLGTGLYPVISIINHSCLPNAVLVF-----EGR-LAV 117
           +++ +  S+   N  TI +  L PLG  + P +++INHSC PNA +VF     EG  L  
Sbjct: 264 SDLVDLISRFTTNTFTITSPTLAPLGACVSPSVALINHSCDPNAAVVFPRSAKEGEPLMQ 323

Query: 118 VRAVQHVPKGAEVLISYIETAGSTMTRQKALKEQYLFTCTCPRC 161
           V A++++    E+L +YI+T   T  RQ+ALKE Y F C CP C
Sbjct: 324 VVALKYIGPDEEILTAYIDTTLPTGLRQQALKETYHFVCECPLC 367


>gi|393219982|gb|EJD05468.1| SET domain-containing protein [Fomitiporia mediterranea MF3/22]
          Length = 635

 Score = 78.2 bits (191), Expect = 6e-12,   Method: Compositional matrix adjust.
 Identities = 38/107 (35%), Positives = 60/107 (56%), Gaps = 7/107 (6%)

Query: 62  SINEIAENFSKLACNAHTICNSELRPLGTGLYPVISIINHSCLPNAVLVFEGRLAV---- 117
           SI+E+ +  SK   N++T+    L P+G  + P+  ++NHSC PN  +VF   L+     
Sbjct: 261 SISELVDLISKFTLNSYTLTTPSLSPIGVSVSPLAGLLNHSCDPNVSVVFPRILSADKEP 320

Query: 118 ---VRAVQHVPKGAEVLISYIETAGSTMTRQKALKEQYLFTCTCPRC 161
              + A+Q +P  +E+L SY++     + RQK LKE Y F C+C  C
Sbjct: 321 ALHIVAIQDIPADSELLTSYVDVTLPVLQRQKDLKETYSFNCSCHSC 367


>gi|326931360|ref|XP_003211799.1| PREDICTED: SET and MYND domain-containing protein 4-like, partial
           [Meleagris gallopavo]
          Length = 706

 Score = 78.2 bits (191), Expect = 7e-12,   Method: Compositional matrix adjust.
 Identities = 62/264 (23%), Positives = 112/264 (42%), Gaps = 9/264 (3%)

Query: 88  LGTGLYPVISIINHSCLPNAVLVFEGRLAVVRAVQHVPKGAEVLISY-IETAGSTMTRQK 146
           L T  +PV+S++NHSC PN  + F G +A VRA Q +P G E+   Y        +  ++
Sbjct: 431 LATAFFPVLSLLNHSCSPNISVSFSGTVATVRASQPIPSGQEIFHCYGPHRCRMRVAERQ 490

Query: 147 ALKEQYLFTCTCPRCIKLGQFDDIQESAILEGYRCKDDGCSGFLLRDSDDKGFTC--QQC 204
            L  QY F C C  C  L + +   ES  +    C    C         ++   C  + C
Sbjct: 491 QLLSQYFFECRCQAC--LDELESDVESVSMRNSFC----CPSCQASMQGEEMLCCSNEAC 544

Query: 205 GLVRSKEEIKKIASEVNILSKKTLALTSCGNHQEVVSTYKMIEKLQKKLYHPFSVNLMQT 264
            L  S+E + +   ++    +K L L    N  E +      +   +    P  + + + 
Sbjct: 545 ALSVSRERLSRRLLDLQQQMEKALELLRDSNADEAIRMLLKCQMDARNFLSPEHLLMGEL 604

Query: 265 REKLIKILMELEDWKEALAYCQLTIPVYQRVYPQFHPLLGLQYYTCGKLEWFLGDTENAI 324
            + L +I   L  W+EA  + + +I + +  +      +G + +   ++ +       A+
Sbjct: 605 EDHLAQIYATLGKWQEAARHLERSIQLVEMHHGPSSVEMGHELFKLAQILFNGFAVSEAL 664

Query: 325 KSMTEAVEILRITHGTNSPFMKEL 348
            ++  A EIL +  G  S  ++EL
Sbjct: 665 STIQRAEEILSVHCGPQSTQIQEL 688


>gi|380013978|ref|XP_003691021.1| PREDICTED: SET and MYND domain-containing protein 4-like [Apis
           florea]
          Length = 678

 Score = 77.8 bits (190), Expect = 7e-12,   Method: Compositional matrix adjust.
 Identities = 59/201 (29%), Positives = 91/201 (45%), Gaps = 18/201 (8%)

Query: 66  IAENFSKLACNAHTI------------CNSELRPLGTGLYPVISIINHSCLPNAVLVFEG 113
           I  N   +  NAH I             N++ + +G GLYP IS+ NHSC P  +  F G
Sbjct: 447 ILHNLMTIQFNAHEISELVIPKADNNLANAKSKFIGGGLYPTISLFNHSCNPGIIRYFIG 506

Query: 114 RLAVVRAVQHVPKGAEVLISY--IETAGSTMTRQKALKEQYLFTCTCPRCIKLGQFDDIQ 171
              VVRA++ +  G E+  +Y  I T      R++ L+ QY F C C  C +     +  
Sbjct: 507 TTMVVRAIRSISSGEEISENYGQIFTTTPEKERKRKLRLQYFFDCNCEACREHWPLLEEI 566

Query: 172 ESAILEGYRCKDDGCSGFLL---RDSDDKGFTCQQCGLVRSKEEIKKIASEVNILSKKTL 228
           +  IL  ++C+     G +L    DS++    C +CG   +  +  K   E + + K   
Sbjct: 567 DPTILR-FKCETGKECGNVLPVKTDSNEFMIECSKCGKCMNIFKGLKALQETDAIFKIAS 625

Query: 229 ALTSCGNHQEVVSTYKMIEKL 249
                GNH+E + +Y  I KL
Sbjct: 626 RNLEQGNHREALKSYLKILKL 646


>gi|336375028|gb|EGO03364.1| hypothetical protein SERLA73DRAFT_174827 [Serpula lacrymans var.
           lacrymans S7.3]
 gi|336387991|gb|EGO29135.1| hypothetical protein SERLADRAFT_456499 [Serpula lacrymans var.
           lacrymans S7.9]
          Length = 530

 Score = 77.8 bits (190), Expect = 7e-12,   Method: Compositional matrix adjust.
 Identities = 42/114 (36%), Positives = 61/114 (53%), Gaps = 9/114 (7%)

Query: 64  NEIAENFSKLACNAHTICNSELRPLGTGLYPVISIINHSCLPNAVLVF---------EGR 114
            ++ +  SK   N  T+ +S L  LG  + P++++INHSC PNAV+V+         E  
Sbjct: 180 GDLVDIISKFITNTFTLTSSSLSALGVSVSPLVALINHSCDPNAVIVYPRCSNEPSTEEP 239

Query: 115 LAVVRAVQHVPKGAEVLISYIETAGSTMTRQKALKEQYLFTCTCPRCIKLGQFD 168
           L  V A++ +    E+L +YI+T     +RQK LKE Y F C CP C K    D
Sbjct: 240 LMQVVAIRDIEVDEEILTAYIDTTLPRFSRQKFLKETYNFDCQCPSCTKYSGVD 293


>gi|397612716|gb|EJK61854.1| hypothetical protein THAOC_17577 [Thalassiosira oceanica]
          Length = 750

 Score = 77.8 bits (190), Expect = 7e-12,   Method: Compositional matrix adjust.
 Identities = 84/339 (24%), Positives = 138/339 (40%), Gaps = 64/339 (18%)

Query: 50  NLVNLILQWPEISINE-IAENFSKLACNAHTICNSELRPLGTGLYPVISIINHSCLPNAV 108
            LV  IL   E +++E I    +    N   + +S   P+G  +YP  +++NHSC PN +
Sbjct: 437 GLVLRILGCKEKTLDEAIRHTLTAFQQNNFGVTDSLYAPIGEAVYPHAALLNHSCSPNCI 496

Query: 109 LVFE-------GRLAVVRAVQHVPKGAEVLISYIETAGSTMTRQKALKEQYLFTCTCPRC 161
           L ++        +L +V A + +  G E++ SY++    T TR+  L++ + F C C RC
Sbjct: 497 LRYKIGLESSPPQLEIV-ACKDISSGEELVHSYVDLVLPTETRRNRLRDTHGFICECKRC 555

Query: 162 IKLGQFDDIQESAILEGYRCKDDGCSGFLLRDSDDKGF--TCQQCGLVRSKEEIKKIASE 219
              GQ                   C+  L RD        T ++ G + S    ++I  E
Sbjct: 556 A--GQ-------------------CTVELPRDRQSWMLWPTKERLGEIPSSTPTQQIDIE 594

Query: 220 VNILSKKTLALTSCGNHQEVVSTYKM-----------------IEKLQKKL--------Y 254
             I  +   A      H ++V+T ++                 I+ LQ+ +        +
Sbjct: 595 EAIGGRSGEA-----EHLQIVNTSRLLQDQATRAMVNGNNDLEIQLLQQAVGVFLLSGGF 649

Query: 255 HPFSVNLMQTREKLIKILMELEDWKEALAYCQLTIPVYQRVYP--QFHPLLGLQYYTCGK 312
            PF   L   R      L+      E +  C+  +          + HPLLGLQ YT G 
Sbjct: 650 SPFHNELYTARCAYFSALLANGKIDEGIEQCEHIVSSLSVCLGNVKHHPLLGLQLYTLGD 709

Query: 313 LEWFLGDTENAIKSMTEAVEILRITHGTNSPFMKELILK 351
           L    G  + A  +   A EI+ ++HG   P ++ LI K
Sbjct: 710 LCSGGGCGQKAANAYRWAYEIMSVSHGAQDPMVQALIRK 748


>gi|281205858|gb|EFA80047.1| SET domain-containing protein [Polysphondylium pallidum PN500]
          Length = 496

 Score = 77.8 bits (190), Expect = 7e-12,   Method: Compositional matrix adjust.
 Identities = 48/141 (34%), Positives = 74/141 (52%), Gaps = 15/141 (10%)

Query: 52  VNLILQWPEISINEIAENFSKLACNAHTI--CNSELRPLGTGLYPVISIINHSCLPNAVL 109
           +N +LQ     I+ + +    + CNA TI  C S +  LGTGLYP ++ INHSC PN  +
Sbjct: 206 INQVLQS---EIDVLYDALCSINCNAFTIIDCVSGVL-LGTGLYPGLTFINHSCDPNLQV 261

Query: 110 VFEGRLAVVRAVQHVPKGAEVLISYIETAGSTMTRQKALKEQYLFTCTCPRC--IKLGQF 167
              G++  ++AV+ + KG E  ISYI+     + R + L E + F C C +C  ++  + 
Sbjct: 262 TQIGKILTLKAVRPIKKGEEFTISYIDRTEGAIQRNEELMETFFFECQCHKCASVRSEKR 321

Query: 168 DDIQESAILEGYRCKDDGCSG 188
           +D         + CKD  C G
Sbjct: 322 NDFY-------FVCKDPSCKG 335


>gi|299115343|emb|CBN74163.1| conserved unknown protein [Ectocarpus siliculosus]
          Length = 579

 Score = 77.8 bits (190), Expect = 8e-12,   Method: Compositional matrix adjust.
 Identities = 45/143 (31%), Positives = 71/143 (49%), Gaps = 7/143 (4%)

Query: 50  NLVNLILQWPEISINEIAENFSKLACNAHTICNSELRPL--GTGLYPVISIINHSCLPNA 107
           +L+ L+ +W    + E+ +   ++  NAH + +     L  G G++P+ ++INH+C PN 
Sbjct: 252 DLLQLLPEWVRFDVEEVVQLACRVNVNAHGLRDDSGANLVIGVGMFPLTAMINHACRPNC 311

Query: 108 VLVFEGRLAVVRAVQHVPKGAEVLISYIETAGSTMTRQKALKEQYLFTCTCPRCIKLGQF 167
             V+ G    VR ++ V  GAE+ + YI+   ST  R++ L     F C C RC      
Sbjct: 312 TFVYFGGNLEVRTLEPVSAGAELSVYYIDLLQSTAARRQELLTSKHFLCKCSRCENPSSM 371

Query: 168 DDIQESAILEGYRCKDDGCSGFL 190
           DD      L+G  C D G  G L
Sbjct: 372 DD-----YLDGVCCTDCGERGCL 389


>gi|21754614|dbj|BAC04538.1| unnamed protein product [Homo sapiens]
          Length = 804

 Score = 77.8 bits (190), Expect = 8e-12,   Method: Compositional matrix adjust.
 Identities = 77/315 (24%), Positives = 131/315 (41%), Gaps = 67/315 (21%)

Query: 59  PEISINEIA--ENFSKLACNA-------HT------ICNSELRPLGTGLYPVISIINHSC 103
           P+++I  +A   +  +L CNA       HT      + +S    L TG++PVIS++NHSC
Sbjct: 483 PDVTIWGVAMLRHMLQLQCNAQAMTTIQHTGSKGSIVTDSRQVRLATGIFPVISLLNHSC 542

Query: 104 LPNAVLVFEGRLAVVRAVQHVPKGAEVLISY--IETAGSTMTRQKALKEQYLFTCTCPRC 161
            PN  + F   +A +RA Q + KG E+L  Y   ++      RQ+ L+ QY F C CP C
Sbjct: 543 SPNTSVSFISTVATIRASQRIRKGQEILHCYGPHKSRMGVAERQQKLRSQYFFDCACPAC 602

Query: 162 IKLGQFDDIQESAILEGYRCKDDGCSGFLLRDSDDKGFTCQQCGLVRSKEEIKKIASEVN 221
                    +   +  G R                + F C  CG     +++ +  S   
Sbjct: 603 -------QTEAHRMAAGPRW---------------EAFCCNSCGAPMQGDDVLRCGSR-- 638

Query: 222 ILSKKTLALTSCGNHQEVVSTYKMIEKLQKKLYHPFSVNLMQTREKLIKILM--ELEDWK 279
                     SC   +  VS   ++ +LQ         +L Q      K+L   ELE   
Sbjct: 639 ----------SCA--ESAVSRDHLVSRLQ---------DLQQQVRVAQKLLRDGELERAV 677

Query: 280 EALAYCQLTIPVYQRVYPQFHPLLGLQYYTCGKLEWFLGDTENAIKSMTEAVEILRITHG 339
           + L+ CQ     +  ++ + H ++G       +    LGD + +   +  ++ ++ + HG
Sbjct: 678 QRLSGCQRDAESF--LWAE-HAVVGEIADGLARACAALGDWQKSATHLQRSLYVVEVRHG 734

Query: 340 TNSPFMKELILKLEE 354
            +S  M   + KL +
Sbjct: 735 PSSVEMGHELFKLAQ 749


>gi|156616308|ref|NP_443160.2| SET and MYND domain-containing protein 4 [Homo sapiens]
 gi|296452956|sp|Q8IYR2.3|SMYD4_HUMAN RecName: Full=SET and MYND domain-containing protein 4
          Length = 804

 Score = 77.8 bits (190), Expect = 9e-12,   Method: Compositional matrix adjust.
 Identities = 77/315 (24%), Positives = 131/315 (41%), Gaps = 67/315 (21%)

Query: 59  PEISINEIA--ENFSKLACNA-------HT------ICNSELRPLGTGLYPVISIINHSC 103
           P+++I  +A   +  +L CNA       HT      + +S    L TG++PVIS++NHSC
Sbjct: 483 PDVTIWGVAMLRHMLQLQCNAQAMTTIQHTGPKGSIVTDSRQVRLATGIFPVISLLNHSC 542

Query: 104 LPNAVLVFEGRLAVVRAVQHVPKGAEVLISY--IETAGSTMTRQKALKEQYLFTCTCPRC 161
            PN  + F   +A +RA Q + KG E+L  Y   ++      RQ+ L+ QY F C CP C
Sbjct: 543 SPNTSVSFISTVATIRASQRIRKGQEILHCYGPHKSRMGVAERQQKLRSQYFFDCACPAC 602

Query: 162 IKLGQFDDIQESAILEGYRCKDDGCSGFLLRDSDDKGFTCQQCGLVRSKEEIKKIASEVN 221
                    +   +  G R                + F C  CG     +++ +  S   
Sbjct: 603 -------QTEAHRMAAGPRW---------------EAFCCNSCGAPMQGDDVLRCGSR-- 638

Query: 222 ILSKKTLALTSCGNHQEVVSTYKMIEKLQKKLYHPFSVNLMQTREKLIKILM--ELEDWK 279
                     SC   +  VS   ++ +LQ         +L Q      K+L   ELE   
Sbjct: 639 ----------SCA--ESAVSRDHLVSRLQ---------DLQQQVRVAQKLLRDGELERAV 677

Query: 280 EALAYCQLTIPVYQRVYPQFHPLLGLQYYTCGKLEWFLGDTENAIKSMTEAVEILRITHG 339
           + L+ CQ     +  ++ + H ++G       +    LGD + +   +  ++ ++ + HG
Sbjct: 678 QRLSGCQRDAESF--LWAE-HAVVGEIADGLARACAALGDWQKSATHLQRSLYVVEVRHG 734

Query: 340 TNSPFMKELILKLEE 354
            +S  M   + KL +
Sbjct: 735 PSSVEMGHELFKLAQ 749


>gi|157786648|ref|NP_001099280.1| SET and MYND domain-containing protein 4 [Rattus norvegicus]
 gi|149053389|gb|EDM05206.1| SET and MYND domain containing 4 (predicted) [Rattus norvegicus]
          Length = 801

 Score = 77.4 bits (189), Expect = 9e-12,   Method: Compositional matrix adjust.
 Identities = 36/84 (42%), Positives = 49/84 (58%), Gaps = 2/84 (2%)

Query: 80  ICNSELRPLGTGLYPVISIINHSCLPNAVLVFEGRLAVVRAVQHVPKGAEVLISY--IET 137
           I NS    L TG++PV+S++NHSC PN  + F   +A +RA Q + KG E+L  Y   E+
Sbjct: 517 ITNSRQVRLATGVFPVVSLLNHSCSPNTSVSFTSTVATIRAAQQIAKGQEILHCYGPHES 576

Query: 138 AGSTMTRQKALKEQYLFTCTCPRC 161
                 RQ+ L  QY F C+CP C
Sbjct: 577 RMGVAERQQRLSSQYFFDCSCPAC 600


>gi|390474255|ref|XP_002757614.2| PREDICTED: SET and MYND domain-containing protein 1 [Callithrix
           jacchus]
          Length = 442

 Score = 77.4 bits (189), Expect = 1e-11,   Method: Compositional matrix adjust.
 Identities = 81/341 (23%), Positives = 132/341 (38%), Gaps = 110/341 (32%)

Query: 31  VAHMSDIDEKQLLLYAQIANLVNLILQ-WP----EISINEIAENFSKLACNAHTICNSE- 84
           V H  + ++K+L +       V+  LQ WP    + S+  I+  F  + CN  T+ +   
Sbjct: 136 VEHFGEEEQKELRVD------VDTFLQYWPPQSQQFSMQYISHIFGVINCNGFTLSDQRG 189

Query: 85  LRPLGTGLYPVISIINHSCLPNAVLVFE-GRLAVVRAVQH------------VPKGAEVL 131
           L+ +G G++P + ++NH C PN  ++F  G    V+++ H            + +G E+ 
Sbjct: 190 LQAVGVGIFPNLGLVNHDCWPNCTVIFNNGNHEAVKSMFHTQMRIELRALGKISEGEELT 249

Query: 132 ISYIETAGSTMTRQKALKEQYLFTCTCPRCIKLGQFDDIQESAILEGYRCKDDGCSGFLL 191
           +SYI+    +  R+K LK+QY F CTC  C K                + KDD    FL 
Sbjct: 250 VSYIDFLNVSEERKKQLKKQYYFDCTCEHCQK----------------KLKDD---LFL- 289

Query: 192 RDSDDKGFTCQQCGLVRSKEEIKKIASEVNILSKKTL----ALTSCGNHQEVVSTYKMIE 247
                        G+  + +  +++  E+   SK TL       S G + EVV       
Sbjct: 290 -------------GVKDNPKASQEVVKEMIQFSKDTLEKIDKARSEGLYHEVVKL----- 331

Query: 248 KLQKKLYHPFSVNLMQTREKLIKILMELEDWKEALAYCQLTIPVYQRVYPQFHPLLGLQY 307
             ++KLYHP +                                            LG+  
Sbjct: 332 -CREKLYHPNNAQ------------------------------------------LGMAV 348

Query: 308 YTCGKLEWFLGDTENAIKSMTEAVEILRITHGTNSPFMKEL 348
              G   W  G+ E     + +A  IL +THG + P  K+L
Sbjct: 349 MRAGLTNWHAGNIEVGHGMICKAYAILLVTHGPSHPITKDL 389


>gi|357146941|ref|XP_003574165.1| PREDICTED: uncharacterized protein LOC100821025 [Brachypodium
           distachyon]
          Length = 791

 Score = 77.4 bits (189), Expect = 1e-11,   Method: Compositional matrix adjust.
 Identities = 70/311 (22%), Positives = 135/311 (43%), Gaps = 52/311 (16%)

Query: 80  ICNSELRPLGTGLYPVISIINHSCLPNAVLVFEGRLAVVRAVQHVPKGAEVLISYIETAG 139
           +C+ E   +   +Y   S+ NHSC PN    F  R   +R+ +++  G+ + +SY   AG
Sbjct: 467 MCSVEQIRVAQAIYMSGSLFNHSCRPNVHTYFHSRTLFLRSTEYIESGSPLELSYGPQAG 526

Query: 140 ST--MTRQKALKEQYLFTCTCPRCIKLGQFDDIQES----------AILEG--YRCKDD- 184
               + RQK+L+E Y F+C C  C +L   D + +S          A+ E   YR +++ 
Sbjct: 527 EMDLLERQKSLQENYKFSCRCSSCSELNLSDLVIDSFCCPRSFCLGAVSESTCYRSEENF 586

Query: 185 --------------------------GCSGFLLRDSD-----DKGF--TCQ-QCGLVRSK 210
                                            R+ D     D G+  +C+ Q  L    
Sbjct: 587 VHVSLNGSAVCKLSLPDVSKVDEDMEKVGKLFFRNDDADLKIDPGYCMSCRSQLDLSSVV 646

Query: 211 EEIKKIASEVNILSKKTLALTSCGNHQEVVSTYKMIEKLQKKLYHPFSVNLMQTREKLIK 270
             + + AS++N    K L +    +   + +  + +E++ KKL HP+S  L Q  +K+ +
Sbjct: 647 ATLDRAASKINRF--KELIVIDDVSEVLITNALQSLEQI-KKLRHPYSKALAQAEDKIAE 703

Query: 271 ILMELEDWKEALAYCQLTIPVYQRVYPQFHPLLGLQYYTCGKLEWFLGDTENAIKSMTEA 330
              ++ D ++A  +C+ +I + +++Y   H  +  +      ++  +GD   A  ++T A
Sbjct: 704 AFAKVGDQEQARKHCEASIKILEKLYHPKHIAIAHELIKLISIKLCMGDKAGAAATVTRA 763

Query: 331 VEILRITHGTN 341
             I  + +G +
Sbjct: 764 EAIFLLYYGPD 774


>gi|410980233|ref|XP_003996482.1| PREDICTED: SET and MYND domain-containing protein 4 [Felis catus]
          Length = 800

 Score = 77.4 bits (189), Expect = 1e-11,   Method: Compositional matrix adjust.
 Identities = 71/283 (25%), Positives = 122/283 (43%), Gaps = 57/283 (20%)

Query: 78  HTICNSELRPLGTGLYPVISIINHSCLPNAVLVFEGRLAVVRAVQHVPKGAEVLISY--I 135
           +++ NS    L TG++PVIS++NHSC PN  + F   +A VRA Q + KG E+L  Y   
Sbjct: 506 NSVTNSRQVRLATGIFPVISLLNHSCSPNTSVSFISTVATVRASQQIGKGQEILHCYGPH 565

Query: 136 ETAGSTMTRQKALKEQYLFTCTCPRCIKLGQFDDIQESAILEGYRCKDDGCSGFLLRDSD 195
           ++      RQ+ L+ QY F C+C  C         ++ +   G R               
Sbjct: 566 QSRMRAAERQQKLRSQYFFDCSCAACQN-------EKHSAATGPRW-------------- 604

Query: 196 DKGFTCQQCGLVRSKEEIKKIASEVNILSKKTLALTSCGNHQEVVSTYKMIEKLQKKLYH 255
            + F C+ CG +   +++    S            TSC    E VS   +I +LQ     
Sbjct: 605 -EAFCCRSCGALMQGDDVLICGS------------TSC---TESVSRDHLISRLQD---- 644

Query: 256 PFSVNLMQTREKLIKILMELEDWKEALAYCQLTIPVYQRVYPQF----HPLLGLQYYTCG 311
                 +Q +  + + L+  +  + A+   QL +   QR    F    H L+G       
Sbjct: 645 ------LQQQVGMARKLLRNDKPERAI---QLLLGC-QRDAQSFLSGEHSLVGEMEDDLA 694

Query: 312 KLEWFLGDTENAIKSMTEAVEILRITHGTNSPFMKELILKLEE 354
           +    LGD + +   + ++++++ + HG +S  M   + KL +
Sbjct: 695 QAYAALGDWQKSATHLQKSLQVVEVRHGPSSVEMGHELFKLAQ 737


>gi|345326540|ref|XP_003431054.1| PREDICTED: LOW QUALITY PROTEIN: SET and MYND domain-containing
           protein 4-like [Ornithorhynchus anatinus]
          Length = 815

 Score = 77.4 bits (189), Expect = 1e-11,   Method: Compositional matrix adjust.
 Identities = 38/92 (41%), Positives = 53/92 (57%), Gaps = 2/92 (2%)

Query: 74  ACNAHTICNSELRPLGTGLYPVISIINHSCLPNAVLVFEGRLAVVRAVQHVPKGAEVLIS 133
           A   H I +S    L TGL+PV+S++NHSC PN  + F GR+AVV+A Q + +G E+L  
Sbjct: 508 ASEDHVIASSRQVRLATGLFPVVSLLNHSCSPNTSVSFSGRVAVVQATQPIGRGQEILHC 567

Query: 134 YI--ETAGSTMTRQKALKEQYLFTCTCPRCIK 163
           Y           R++ LK QY F C C  C++
Sbjct: 568 YGPHRCRMDVAERRQKLKAQYFFDCRCQACLE 599


>gi|194768765|ref|XP_001966482.1| GF21975 [Drosophila ananassae]
 gi|190617246|gb|EDV32770.1| GF21975 [Drosophila ananassae]
          Length = 445

 Score = 77.0 bits (188), Expect = 1e-11,   Method: Compositional matrix adjust.
 Identities = 38/102 (37%), Positives = 55/102 (53%), Gaps = 6/102 (5%)

Query: 91  GLYPVISIINHSCLPNAVLVFE-GRLAVVRAVQHVPKGAEVLISYIETAGSTMTRQKALK 149
           GL+P+ +I+NH C PNA   F+ G LAVVRA + +PKG E+  +Y +     +TR   LK
Sbjct: 225 GLFPLTAIMNHECTPNASHYFDNGTLAVVRAARDIPKGGEITTTYTKILWGNLTRGIFLK 284

Query: 150 EQYLFTCTCPRCIKLGQFDDIQESAILEGYRCKDDGCSGFLL 191
               F C C RC      D+ +    L    C++ GC G ++
Sbjct: 285 MTKNFNCDCVRC-----NDNTENGTYLSALFCREQGCRGLVI 321


>gi|390363607|ref|XP_001199204.2| PREDICTED: N-lysine methyltransferase SMYD2-like
           [Strongylocentrotus purpuratus]
          Length = 490

 Score = 77.0 bits (188), Expect = 1e-11,   Method: Compositional matrix adjust.
 Identities = 68/294 (23%), Positives = 115/294 (39%), Gaps = 58/294 (19%)

Query: 70  FSKLACNAHTICNSELRPLGTGLYPVISIINHSCLPNAVLVFEGRLAVVRAVQHVPKGAE 129
           +    CN+  I +++L  +   +Y   S+ NHSC  N ++VF+ R   +R V+ V +G E
Sbjct: 241 YGATNCNSFGIFDNDLIVISDAIYLRASMANHSCDYNCIVVFDERKLQLRTVKDVQEGEE 300

Query: 130 VLISYIETAGSTMTRQKALKEQYLFTCTCPRCIK----LGQFDDIQESAILEGYRCKDDG 185
             I Y++       R+  L+E+Y FTC C +CI+    LG                 DDG
Sbjct: 301 CTIGYVDVIHPAKERRAELEEKYHFTCKCVKCIEEINALG----------------PDDG 344

Query: 186 CSGFLLRDSDDKGFTCQQCGLVRSKEEIKKIASEVNILSKKTLALTSCGNHQEVVSTYKM 245
                                         +  E+  L K    +    N  +     ++
Sbjct: 345 ------------------------------LGEELRDLKKSLEQIVDAENSHDWAKVIQL 374

Query: 246 IEKLQKKLYHPFSVN-----LMQTREKLIKILMELEDWKEALAYCQLTIPVYQRVYPQFH 300
            E   K +    S+      L+  R+      ++ + WK+A    QL I  Y   Y +++
Sbjct: 375 CEPYLKPMDSSSSLPANHQLLVMLRDTAFFACIQSQSWKKAAEMGQLNIESYIYHYGRYN 434

Query: 301 PLLGLQYYTCGKLEWFLGDTENAIKSMTEAVEILRITHGTNSPFM---KELILK 351
           P +G+     G++   L     A K   EA  + ++THG     M   K+L+ K
Sbjct: 435 PNVGMYLLKIGEVLLNLDRLREARKCFKEAESVFKVTHGLQHSLMASVKKLLFK 488



 Score = 57.0 bits (136), Expect = 1e-05,   Method: Compositional matrix adjust.
 Identities = 25/66 (37%), Positives = 38/66 (57%)

Query: 98  IINHSCLPNAVLVFEGRLAVVRAVQHVPKGAEVLISYIETAGSTMTRQKALKEQYLFTCT 157
           + NHSC  N   VF+G    +R ++ V +G E  ISY++       RQ  L+E+Y FTC 
Sbjct: 1   MANHSCDYNCAGVFDGMKLQLRTIKDVKEGEECTISYVDVINPAKERQAKLEEEYHFTCK 60

Query: 158 CPRCIK 163
           C +C++
Sbjct: 61  CVKCVE 66


>gi|326477120|gb|EGE01130.1| SET and MYND domain-containing protein [Trichophyton equinum CBS
           127.97]
          Length = 499

 Score = 77.0 bits (188), Expect = 1e-11,   Method: Compositional matrix adjust.
 Identities = 38/102 (37%), Positives = 56/102 (54%)

Query: 60  EISINEIAENFSKLACNAHTICNSELRPLGTGLYPVISIINHSCLPNAVLVFEGRLAVVR 119
           ++ I  IAE  +++  N+ T   +   PLG  + P    +NHSC PNAV+ F+  L  V+
Sbjct: 180 DLDIETIAEYLARIEVNSFTFTTAFGDPLGLCIQPFACYMNHSCEPNAVVGFDEGLITVK 239

Query: 120 AVQHVPKGAEVLISYIETAGSTMTRQKALKEQYLFTCTCPRC 161
           A+  +    +V ISYI+       RQK L E+Y FTC C +C
Sbjct: 240 ALCKIKPDEQVFISYIDNTYPFEVRQKQLAERYFFTCKCSKC 281


>gi|119610982|gb|EAW90576.1| SET and MYND domain containing 4 [Homo sapiens]
          Length = 804

 Score = 77.0 bits (188), Expect = 1e-11,   Method: Compositional matrix adjust.
 Identities = 77/315 (24%), Positives = 131/315 (41%), Gaps = 67/315 (21%)

Query: 59  PEISINEIA--ENFSKLACNA-------HT------ICNSELRPLGTGLYPVISIINHSC 103
           P+++I  +A   +  +L CNA       HT      + +S    L TG++PVIS++NHSC
Sbjct: 483 PDVTIWGVAMLRHMLQLQCNAQAMTTIQHTGPKGSIVTDSRQVRLATGIFPVISLLNHSC 542

Query: 104 LPNAVLVFEGRLAVVRAVQHVPKGAEVLISY--IETAGSTMTRQKALKEQYLFTCTCPRC 161
            PN  + F   +A +RA Q + KG E+L  Y   ++      RQ+ L+ QY F C CP C
Sbjct: 543 SPNTSVSFISTVATIRASQRIRKGQEILHCYGPHKSRMGVAERQQKLRSQYFFDCACPAC 602

Query: 162 IKLGQFDDIQESAILEGYRCKDDGCSGFLLRDSDDKGFTCQQCGLVRSKEEIKKIASEVN 221
                    +   +  G R                + F C  CG     +++ +  S   
Sbjct: 603 -------QTEAHRMAAGPRW---------------EAFCCNSCGAPMQGDDVLRCGSR-- 638

Query: 222 ILSKKTLALTSCGNHQEVVSTYKMIEKLQKKLYHPFSVNLMQTREKLIKILM--ELEDWK 279
                     SC   +  VS   ++ +LQ         +L Q      K+L   ELE   
Sbjct: 639 ----------SCA--ESAVSRDHLVSRLQ---------DLQQQVRVAQKLLRDGELERAV 677

Query: 280 EALAYCQLTIPVYQRVYPQFHPLLGLQYYTCGKLEWFLGDTENAIKSMTEAVEILRITHG 339
           + L+ CQ     +  ++ + H ++G       +    LGD + +   +  ++ ++ + HG
Sbjct: 678 QRLSGCQRDAESF--LWAE-HAVVGEIADGLARACAALGDWQKSATHLQRSLCVVEVRHG 734

Query: 340 TNSPFMKELILKLEE 354
            +S  M   + KL +
Sbjct: 735 PSSVEMGHELFKLAQ 749


>gi|168275572|dbj|BAG10506.1| SET and MYND domain-containing protein 4 [synthetic construct]
          Length = 804

 Score = 77.0 bits (188), Expect = 1e-11,   Method: Compositional matrix adjust.
 Identities = 77/315 (24%), Positives = 131/315 (41%), Gaps = 67/315 (21%)

Query: 59  PEISINEIA--ENFSKLACNA-------HT------ICNSELRPLGTGLYPVISIINHSC 103
           P+++I  +A   +  +L CNA       HT      + +S    L TG++PVIS++NHSC
Sbjct: 483 PDVTIWGVAMLRHMLQLQCNAQAMTTIQHTGPKGSIVTDSRQVRLATGIFPVISLLNHSC 542

Query: 104 LPNAVLVFEGRLAVVRAVQHVPKGAEVLISY--IETAGSTMTRQKALKEQYLFTCTCPRC 161
            PN  + F   +A +RA Q + KG E+L  Y   ++      RQ+ L+ QY F C CP C
Sbjct: 543 SPNTSVSFISTVATIRASQRIRKGQEILHCYGPHKSRMGVAERQQKLRSQYFFDCACPAC 602

Query: 162 IKLGQFDDIQESAILEGYRCKDDGCSGFLLRDSDDKGFTCQQCGLVRSKEEIKKIASEVN 221
                    +   +  G R                + F C  CG     +++ +  S   
Sbjct: 603 -------QTEAHRMAAGPRW---------------EAFCCNSCGAPMQGDDVLRCGSR-- 638

Query: 222 ILSKKTLALTSCGNHQEVVSTYKMIEKLQKKLYHPFSVNLMQTREKLIKILM--ELEDWK 279
                     SC   +  VS   ++ +LQ         +L Q      K+L   ELE   
Sbjct: 639 ----------SCA--ESAVSRDHLVSRLQ---------DLQQQVRVAQKLLRDGELERAV 677

Query: 280 EALAYCQLTIPVYQRVYPQFHPLLGLQYYTCGKLEWFLGDTENAIKSMTEAVEILRITHG 339
           + L+ CQ     +  ++ + H ++G       +    LGD + +   +  ++ ++ + HG
Sbjct: 678 QRLSGCQRDAESF--LWAE-HAVVGEIADGLARACAALGDWQKSATHLQRSLCVVEVRHG 734

Query: 340 TNSPFMKELILKLEE 354
            +S  M   + KL +
Sbjct: 735 PSSVEMGHELFKLAQ 749


>gi|322708541|gb|EFZ00118.1| MYND finger family protein [Metarhizium anisopliae ARSEF 23]
          Length = 526

 Score = 77.0 bits (188), Expect = 1e-11,   Method: Compositional matrix adjust.
 Identities = 42/102 (41%), Positives = 60/102 (58%), Gaps = 1/102 (0%)

Query: 60  EISINEIAENFSKLACNAHTICNSELRPLGTGLYPVISIINHSCLPNAVLVFEGRLAVVR 119
           E +I    E   K+  NA    + +L  +G  L P +++ NHSC+PNAV+ F GR A++R
Sbjct: 156 ESNIRRAVELLCKIQTNAFHRWDVDLGQVGVFLEPTLAMANHSCVPNAVVQFVGRKAILR 215

Query: 120 AVQHVPKGAEVLISYIETAGSTMTRQKALKEQYLFTCTCPRC 161
           A + +  G E+ ISY +      TR++AL EQY F CTC RC
Sbjct: 216 AERPIHAGDEIEISYTDYTMPLSTRREAL-EQYSFECTCARC 256


>gi|326472015|gb|EGD96024.1| hypothetical protein TESG_03485 [Trichophyton tonsurans CBS 112818]
          Length = 499

 Score = 77.0 bits (188), Expect = 1e-11,   Method: Compositional matrix adjust.
 Identities = 38/102 (37%), Positives = 56/102 (54%)

Query: 60  EISINEIAENFSKLACNAHTICNSELRPLGTGLYPVISIINHSCLPNAVLVFEGRLAVVR 119
           ++ I  IAE  +++  N+ T   +   PLG  + P    +NHSC PNAV+ F+  L  V+
Sbjct: 180 DLDIETIAEYLARIEVNSFTFTTAFGDPLGLCIQPFACYMNHSCEPNAVVGFDEGLITVK 239

Query: 120 AVQHVPKGAEVLISYIETAGSTMTRQKALKEQYLFTCTCPRC 161
           A+  +    +V ISYI+       RQK L E+Y FTC C +C
Sbjct: 240 ALCKIKPDEQVFISYIDNTYPFEVRQKQLAERYFFTCKCSKC 281


>gi|440794181|gb|ELR15350.1| tetratricopeptide repeat domain containing protein [Acanthamoeba
           castellanii str. Neff]
          Length = 823

 Score = 77.0 bits (188), Expect = 1e-11,   Method: Compositional matrix adjust.
 Identities = 68/269 (25%), Positives = 114/269 (42%), Gaps = 35/269 (13%)

Query: 62  SINEIAENFSKLACNAHTICNSELRPLGTGLYPVISIINHSCLPNAVLVFEGRLAVVRAV 121
           +I E+A++      + H      +R +   +Y   S+ NHSC PN VL +EGR+  +RA 
Sbjct: 513 AIYEVADD------DEHGGGTKPIRRVAEAMYATGSLFNHSCRPNTVLNYEGRVLTIRAC 566

Query: 122 QHVPKGAEVLISY---IETAGSTMTRQKALKEQYLFTCTCPRCIKLGQFDDIQESAILEG 178
           + + KG EVL  Y   +    S   RQKAL+ QY FTC C  C           S++ +G
Sbjct: 567 EDIAKGKEVLNCYGPHVAHIPSRDERQKALRHQYFFTCRCDAC----------SSSVEQG 616

Query: 179 YR------CKDDGCSGFLLRDSDDKGFTCQQCGLVRSKEEIKKIASEVNILSKKTLALTS 232
            +      C  +GC+  L  + +      Q+  +   + EI K+A E    +++ L    
Sbjct: 617 GQDFDYLVCGSEGCTRELQINPEQ---LRQEVDIRVVQREI-KLAREFYERARRALDGGG 672

Query: 233 CGNH-----QEVVSTYKMIEKLQKKLYHPFSVNLMQTREKLIKILMELEDWKEALAYCQL 287
                    +    +     +L+ K  HP    L +T + L +    L  ++ A  + + 
Sbjct: 673 GDGSGDGDVERAAQSLAHALRLRAKYLHPQHRELGRTHDALAEAHARLGRYEAAATHLER 732

Query: 288 TIPVYQRVYPQFHPLLGLQYYTCGKLEWF 316
            + V +  +  F   L  QY    +  WF
Sbjct: 733 AVAVLESRFAPFSFELADQYAKLAQA-WF 760


>gi|397491961|ref|XP_003816904.1| PREDICTED: SET and MYND domain-containing protein 4 [Pan paniscus]
          Length = 835

 Score = 76.6 bits (187), Expect = 2e-11,   Method: Compositional matrix adjust.
 Identities = 82/343 (23%), Positives = 144/343 (41%), Gaps = 68/343 (19%)

Query: 31  VAHMSDIDEKQLLLYAQIANLVNLILQWPEISINEIA--ENFSKLACNA-------HT-- 79
            A +  I  ++++  +Q+   V   L+ P+++I  +A   +  +L CNA       HT  
Sbjct: 456 AASLQAIPTERIVNSSQLKAAVTPELR-PDVTIWGVAMLRHMLQLQCNAQAMTTIQHTGP 514

Query: 80  ----ICNSELRPLGTGLYPVISIINHSCLPNAVLVFEGRLAVVRAVQHVPKGAEVLISY- 134
               + +S    L TG++PVIS++NHSC PN  + F   +A +RA Q + KG E+L  Y 
Sbjct: 515 KGSIVTDSRQVRLATGIFPVISLLNHSCNPNTSVSFISTVATIRASQRIRKGQEILHCYG 574

Query: 135 -IETAGSTMTRQKALKEQYLFTCTCPRCIKLGQFDDIQESAILEGYRCKDDGCSGFLLRD 193
             ++      RQ+ L+ QY F C CP C         +   +  G R             
Sbjct: 575 PHKSRMGVAERQQKLRSQYFFDCACPAC-------QTEAHRMAAGPRW------------ 615

Query: 194 SDDKGFTCQQCGLVRSKEEIKKIASEVNILSKKTLALTSCGNHQEVVSTYKMIEKLQKKL 253
              + F C  CG     +++ +  S             SC   +  VS   ++ +LQ   
Sbjct: 616 ---EAFCCNSCGAPMQGDDVLRCGSR------------SCA--ESAVSRDHLVSRLQ--- 655

Query: 254 YHPFSVNLMQTREKLIKILM--ELEDWKEALAYCQLTIPVYQRVYPQFHPLLGLQYYTCG 311
                 +L Q      K+L   ELE   + L+ CQ     +  ++ + H ++G       
Sbjct: 656 ------DLQQQVRVAQKLLRDGELERAVQRLSGCQRDAESF--LWAE-HAVVGEIADGLA 706

Query: 312 KLEWFLGDTENAIKSMTEAVEILRITHGTNSPFMKELILKLEE 354
           +    LGD + +   +  ++ ++ + HG +S  M   + KL +
Sbjct: 707 RAFAALGDWQKSATHLQRSLCVVEVRHGPSSVEMGHELFKLAQ 749


>gi|426383436|ref|XP_004058287.1| PREDICTED: SET and MYND domain-containing protein 4 [Gorilla
           gorilla gorilla]
          Length = 804

 Score = 76.6 bits (187), Expect = 2e-11,   Method: Compositional matrix adjust.
 Identities = 76/315 (24%), Positives = 131/315 (41%), Gaps = 67/315 (21%)

Query: 59  PEISINEIA--ENFSKLACNA-------HT------ICNSELRPLGTGLYPVISIINHSC 103
           P++++  +A   +  +L CNA       HT      + +S    L TG++PVIS++NHSC
Sbjct: 483 PDVTVWGVAMLRHMLQLQCNAQAMTTIQHTGSKGSIVTDSRQVRLATGIFPVISLLNHSC 542

Query: 104 LPNAVLVFEGRLAVVRAVQHVPKGAEVLISY--IETAGSTMTRQKALKEQYLFTCTCPRC 161
            PN  + F   +A +RA Q + KG E+L  Y   ++      RQ+ L+ QY F C CP C
Sbjct: 543 CPNTSVSFISTVATIRASQRIQKGQEILHCYGPHKSRMGVAERQQKLRSQYFFDCACPAC 602

Query: 162 IKLGQFDDIQESAILEGYRCKDDGCSGFLLRDSDDKGFTCQQCGLVRSKEEIKKIASEVN 221
                    +   +  G R                + F C  CG     +++ +  S   
Sbjct: 603 -------QTEAHRMAAGPRW---------------EAFCCNSCGAPMQGDDVLRCGSR-- 638

Query: 222 ILSKKTLALTSCGNHQEVVSTYKMIEKLQKKLYHPFSVNLMQTREKLIKILM--ELEDWK 279
                     SC   +  VS   ++ +LQ         +L Q      K+L   ELE   
Sbjct: 639 ----------SCA--ESAVSRDHLVSRLQ---------DLQQQVRVAQKLLRDGELERAV 677

Query: 280 EALAYCQLTIPVYQRVYPQFHPLLGLQYYTCGKLEWFLGDTENAIKSMTEAVEILRITHG 339
           + L+ CQ     +  ++ + H ++G       +    LGD + +   +  ++ ++ + HG
Sbjct: 678 QRLSGCQRDAESF--LWAE-HAVVGEIADGLARACAALGDWQKSATHLQRSLCVVEVRHG 734

Query: 340 TNSPFMKELILKLEE 354
            +S  M   + KL +
Sbjct: 735 PSSVEMGHELFKLAQ 749


>gi|114665632|ref|XP_511253.2| PREDICTED: SET and MYND domain-containing protein 4 isoform 3 [Pan
           troglodytes]
          Length = 835

 Score = 76.6 bits (187), Expect = 2e-11,   Method: Compositional matrix adjust.
 Identities = 82/343 (23%), Positives = 144/343 (41%), Gaps = 68/343 (19%)

Query: 31  VAHMSDIDEKQLLLYAQIANLVNLILQWPEISINEIA--ENFSKLACNA-------HT-- 79
            A +  I  ++++  +Q+   V   L+ P+++I  +A   +  +L CNA       HT  
Sbjct: 456 AASLQAIPTERIVNSSQLKAAVTPELR-PDVTIWGVAMLRHMLQLQCNAQAVTTIQHTGP 514

Query: 80  ----ICNSELRPLGTGLYPVISIINHSCLPNAVLVFEGRLAVVRAVQHVPKGAEVLISY- 134
               + +S    L TG++PVIS++NHSC PN  + F   +A +RA Q + KG E+L  Y 
Sbjct: 515 KGSIVTDSRQVRLATGIFPVISLLNHSCNPNTSVSFISTVATIRASQRIRKGQEILHCYG 574

Query: 135 -IETAGSTMTRQKALKEQYLFTCTCPRCIKLGQFDDIQESAILEGYRCKDDGCSGFLLRD 193
             ++      RQ+ L+ QY F C CP C         +   +  G R             
Sbjct: 575 PHKSRMGVAERQQKLRSQYFFDCACPAC-------QTEAHRMAAGPRW------------ 615

Query: 194 SDDKGFTCQQCGLVRSKEEIKKIASEVNILSKKTLALTSCGNHQEVVSTYKMIEKLQKKL 253
              + F C  CG     +++ +  S             SC   +  VS   ++ +LQ   
Sbjct: 616 ---EAFCCNSCGAPMQGDDVLRCGSR------------SCA--ESAVSRDHLVSRLQ--- 655

Query: 254 YHPFSVNLMQTREKLIKILM--ELEDWKEALAYCQLTIPVYQRVYPQFHPLLGLQYYTCG 311
                 +L Q      K+L   ELE   + L+ CQ     +  ++ + H ++G       
Sbjct: 656 ------DLQQQVRVAQKLLRDGELERAVQRLSGCQRDAESF--LWAE-HAVVGEIADGLA 706

Query: 312 KLEWFLGDTENAIKSMTEAVEILRITHGTNSPFMKELILKLEE 354
           +    LGD + +   +  ++ ++ + HG +S  M   + KL +
Sbjct: 707 RAFAALGDWQKSATHLQRSLCVVEVRHGPSSVEMGHELFKLAQ 749


>gi|23272879|gb|AAH35077.1| SET and MYND domain containing 4 [Homo sapiens]
          Length = 804

 Score = 76.6 bits (187), Expect = 2e-11,   Method: Compositional matrix adjust.
 Identities = 77/315 (24%), Positives = 131/315 (41%), Gaps = 67/315 (21%)

Query: 59  PEISINEIA--ENFSKLACNA-------HT------ICNSELRPLGTGLYPVISIINHSC 103
           P+++I  +A   +  +L CNA       HT      + +S    L TG++PVIS++NHSC
Sbjct: 483 PDVTIWGVAMLRHMLQLQCNAQAMTTIQHTGPKGSIVTDSRQVRLATGIFPVISLLNHSC 542

Query: 104 LPNAVLVFEGRLAVVRAVQHVPKGAEVLISY--IETAGSTMTRQKALKEQYLFTCTCPRC 161
            PN  + F   +A +RA Q + KG E+L  Y   ++      RQ+ L+ QY F C CP C
Sbjct: 543 SPNTSVSFISTVATIRASQRIRKGQEILHCYGPHKSRMGVAERQQKLRSQYFFDCACPAC 602

Query: 162 IKLGQFDDIQESAILEGYRCKDDGCSGFLLRDSDDKGFTCQQCGLVRSKEEIKKIASEVN 221
                    +   +  G R                + F C  CG     +++ +  S   
Sbjct: 603 -------QTEAHRMAAGPRW---------------EAFCCNSCGAPMQGDDVLRCGSR-- 638

Query: 222 ILSKKTLALTSCGNHQEVVSTYKMIEKLQKKLYHPFSVNLMQTREKLIKILM--ELEDWK 279
                     SC   +  VS   ++ +LQ         +L Q      K+L   ELE   
Sbjct: 639 ----------SCA--ESAVSRDHLVSRLQ---------DLQQQVRVAQKLLRDGELERAV 677

Query: 280 EALAYCQLTIPVYQRVYPQFHPLLGLQYYTCGKLEWFLGDTENAIKSMTEAVEILRITHG 339
           + L+ CQ     +  ++ + H ++G       +    LGD + +   +  ++ ++ + HG
Sbjct: 678 QRLSGCQRDAESF--LWAE-HAVVGEIADGLARACAALGDWQKSATHLQRSLCVVEVRHG 734

Query: 340 TNSPFMKELILKLEE 354
            +S  M   + KL +
Sbjct: 735 PSSVEMGHELFKLAQ 749


>gi|410209806|gb|JAA02122.1| SET and MYND domain containing 4 [Pan troglodytes]
 gi|410209808|gb|JAA02123.1| SET and MYND domain containing 4 [Pan troglodytes]
 gi|410246942|gb|JAA11438.1| SET and MYND domain containing 4 [Pan troglodytes]
 gi|410246944|gb|JAA11439.1| SET and MYND domain containing 4 [Pan troglodytes]
 gi|410289944|gb|JAA23572.1| SET and MYND domain containing 4 [Pan troglodytes]
 gi|410289946|gb|JAA23573.1| SET and MYND domain containing 4 [Pan troglodytes]
 gi|410289948|gb|JAA23574.1| SET and MYND domain containing 4 [Pan troglodytes]
 gi|410330765|gb|JAA34329.1| SET and MYND domain containing 4 [Pan troglodytes]
 gi|410330767|gb|JAA34330.1| SET and MYND domain containing 4 [Pan troglodytes]
 gi|410330769|gb|JAA34331.1| SET and MYND domain containing 4 [Pan troglodytes]
          Length = 835

 Score = 76.6 bits (187), Expect = 2e-11,   Method: Compositional matrix adjust.
 Identities = 82/343 (23%), Positives = 144/343 (41%), Gaps = 68/343 (19%)

Query: 31  VAHMSDIDEKQLLLYAQIANLVNLILQWPEISINEIA--ENFSKLACNA-------HT-- 79
            A +  I  ++++  +Q+   V   L+ P+++I  +A   +  +L CNA       HT  
Sbjct: 456 AASLQAIPTERIVNSSQLKAAVTPELR-PDVTIWGVAMLRHMLQLQCNAQAVTTIQHTGP 514

Query: 80  ----ICNSELRPLGTGLYPVISIINHSCLPNAVLVFEGRLAVVRAVQHVPKGAEVLISY- 134
               + +S    L TG++PVIS++NHSC PN  + F   +A +RA Q + KG E+L  Y 
Sbjct: 515 KGSIVTDSRQVRLATGIFPVISLLNHSCNPNTSVSFISTVATIRASQRIRKGQEILHCYG 574

Query: 135 -IETAGSTMTRQKALKEQYLFTCTCPRCIKLGQFDDIQESAILEGYRCKDDGCSGFLLRD 193
             ++      RQ+ L+ QY F C CP C         +   +  G R             
Sbjct: 575 PHKSRMGVAERQQKLRSQYFFDCACPAC-------QTEAHRMAAGPRW------------ 615

Query: 194 SDDKGFTCQQCGLVRSKEEIKKIASEVNILSKKTLALTSCGNHQEVVSTYKMIEKLQKKL 253
              + F C  CG     +++ +  S             SC   +  VS   ++ +LQ   
Sbjct: 616 ---EAFCCNSCGAPMQGDDVLRCGSR------------SCA--ESAVSRDHLVSRLQ--- 655

Query: 254 YHPFSVNLMQTREKLIKILM--ELEDWKEALAYCQLTIPVYQRVYPQFHPLLGLQYYTCG 311
                 +L Q      K+L   ELE   + L+ CQ     +  ++ + H ++G       
Sbjct: 656 ------DLQQQVRVAQKLLRDGELERAVQRLSGCQRDAESF--LWAE-HAVVGEIADGLA 706

Query: 312 KLEWFLGDTENAIKSMTEAVEILRITHGTNSPFMKELILKLEE 354
           +    LGD + +   +  ++ ++ + HG +S  M   + KL +
Sbjct: 707 RAFAALGDWQKSATHLQRSLCVVEVRHGPSSVEMGHELFKLAQ 749


>gi|156616310|ref|NP_001096081.1| SET and MYND domain-containing protein 4 [Mus musculus]
 gi|90101757|sp|Q8BTK5.2|SMYD4_MOUSE RecName: Full=SET and MYND domain-containing protein 4
 gi|148680856|gb|EDL12803.1| SET and MYND domain containing 4, isoform CRA_c [Mus musculus]
          Length = 799

 Score = 76.6 bits (187), Expect = 2e-11,   Method: Compositional matrix adjust.
 Identities = 38/88 (43%), Positives = 49/88 (55%), Gaps = 2/88 (2%)

Query: 76  NAHTICNSELRPLGTGLYPVISIINHSCLPNAVLVFEGRLAVVRAVQHVPKGAEVLISY- 134
           N   I NS    L TG++PV+S++NHSC PN  + F G +A VRA Q + KG E+L  Y 
Sbjct: 511 NESIITNSRQIRLATGIFPVVSLLNHSCRPNTSVSFTGTVATVRAAQRIAKGQEILHCYG 570

Query: 135 -IETAGSTMTRQKALKEQYLFTCTCPRC 161
             E+      RQ+ L  QY F C C  C
Sbjct: 571 PHESRMGVAERQQRLSSQYFFDCRCGAC 598


>gi|26354773|dbj|BAC41013.1| unnamed protein product [Mus musculus]
          Length = 799

 Score = 76.6 bits (187), Expect = 2e-11,   Method: Compositional matrix adjust.
 Identities = 38/88 (43%), Positives = 49/88 (55%), Gaps = 2/88 (2%)

Query: 76  NAHTICNSELRPLGTGLYPVISIINHSCLPNAVLVFEGRLAVVRAVQHVPKGAEVLISY- 134
           N   I NS    L TG++PV+S++NHSC PN  + F G +A VRA Q + KG E+L  Y 
Sbjct: 511 NESIITNSRQIRLATGIFPVVSLLNHSCRPNTSVSFTGTVATVRAAQRIAKGQEILHCYG 570

Query: 135 -IETAGSTMTRQKALKEQYLFTCTCPRC 161
             E+      RQ+ L  QY F C C  C
Sbjct: 571 PHESRMGVAERQQRLSSQYFFDCRCGAC 598


>gi|395853348|ref|XP_003799177.1| PREDICTED: SET and MYND domain-containing protein 4 [Otolemur
           garnettii]
          Length = 749

 Score = 76.6 bits (187), Expect = 2e-11,   Method: Compositional matrix adjust.
 Identities = 43/120 (35%), Positives = 61/120 (50%), Gaps = 17/120 (14%)

Query: 59  PEISINEIA--ENFSKLACNAHTIC-------------NSELRPLGTGLYPVISIINHSC 103
           P++ I  +A   +  +L CNA  I              NS    L TG++PV+S++NHSC
Sbjct: 483 PDLDIWGVAMLRHMLQLQCNAQAITTIQQTGSEENIITNSRQVRLATGVFPVVSLLNHSC 542

Query: 104 LPNAVLVFEGRLAVVRAVQHVPKGAEVLISY--IETAGSTMTRQKALKEQYLFTCTCPRC 161
            PN  + F   +A +RA Q + KG E+L  Y   E+      RQ+ L+ QY F C CP C
Sbjct: 543 SPNTSISFCSTVATIRASQQIGKGQEILHCYGPHESRMGVAKRQQKLRSQYFFDCNCPAC 602


>gi|440799648|gb|ELR20692.1| Histone-lysine N-methyltransferase [Acanthamoeba castellanii str.
           Neff]
          Length = 548

 Score = 76.6 bits (187), Expect = 2e-11,   Method: Compositional matrix adjust.
 Identities = 78/307 (25%), Positives = 130/307 (42%), Gaps = 27/307 (8%)

Query: 27  VEALVAHMSDIDEKQLLLYAQIANLVNLILQWP-EISINEIAENFSKLACNAHTICNSEL 85
           V A++ H  +  ++ +    + A LV  +L  P  +S  E  +  +++  N+H   +  +
Sbjct: 188 VMAMMGHEDECPQQWVDSITETAKLVMPLLPKPARLSAREFVKVCARINTNSHRQHHMFV 247

Query: 86  --RPLGTGLYPVISIINHSCLPNAVLVFEGRLAVVRAVQHVPKGAEVLISYIETAGSTMT 143
             R LG GLYP+ S+INHSC PN      G    +R +  V +G E+  SYI+   S   
Sbjct: 248 PQRILGVGLYPLASLINHSCQPNCGFYNRGPTLYIRTLCDVKEGEELCYSYIDLYQSRSK 307

Query: 144 RQKALKEQYLFTCTCPRCIKLGQFDDIQESA--ILEGYRCKDDG---CSGFL-LRDSD-- 195
           R+  L E   F C C RC        I +S    L G++C +     C G L L D +  
Sbjct: 308 RKAELLETKHFDCLCNRCSP-----PITDSVDRYLSGFQCPNKAKTSCDGLLVLPDGEGL 362

Query: 196 --DKGFTCQQ-CGLVRSKEEIKKIASEVNILSKKTLALTSCGNHQEVVSTYKMIEKLQKK 252
             +K  +C   CG  ++  ++ K      I+   + +L    + +    +  M++KL   
Sbjct: 363 ASEKPVSCTAGCGETKTAGDLLKAQKRAEIMLTASQSLYY--DQKRTSDSKTMLDKLLTG 420

Query: 253 LY------HPFSVNLMQTREKLIKILMELEDWKEALAYCQLTIPVYQRVYPQFHPLLGLQ 306
            Y      HP+   + Q     I +   L     ++ +C+  +   + V P  H      
Sbjct: 421 GYSPDVQLHPYHPTVFQAHVDSINVSDALGYTHMSIDHCEAVLRCAEAVLPMNHLETSNY 480

Query: 307 YYTCGKL 313
           YY  G L
Sbjct: 481 YYYQGML 487


>gi|403275299|ref|XP_003929391.1| PREDICTED: SET and MYND domain-containing protein 4 [Saimiri
           boliviensis boliviensis]
          Length = 800

 Score = 76.6 bits (187), Expect = 2e-11,   Method: Compositional matrix adjust.
 Identities = 71/312 (22%), Positives = 134/312 (42%), Gaps = 32/312 (10%)

Query: 59  PEISINEIA--ENFSKLACNA-------HTIC------NSELRPLGTGLYPVISIINHSC 103
           P++++  +A   +  +L CNA       HT C       S    L TG++PV+S++NHSC
Sbjct: 480 PDVTVWGVAMLRHMLQLQCNAQAITTIQHTGCKEGIVTGSRQVRLATGIFPVVSLLNHSC 539

Query: 104 LPNAVLVFEGRLAVVRAVQHVPKGAEVLISY--IETAGSTMTRQKALKEQYLFTCTCPRC 161
            PN  + F   +A +RA Q + KG E++  Y   ++      RQ+ L+ QY F CTCP C
Sbjct: 540 SPNTSVSFISTVATIRASQRIGKGQEIVHCYGPHKSRMGVAERQQKLRSQYFFDCTCPAC 599

Query: 162 IKLGQFDDIQESAILEGYRCKDDGCSGFLLRDSDDKGFTCQQCGLVRSKEEIKK--IASE 219
              G    +      E + C    C   +      +G     CG     E +++  +AS 
Sbjct: 600 HSEGH--GMAARPRWEAFCCSS--CRAPM------QGDDVLSCGRRGCAESVRRDHLASR 649

Query: 220 VNILSKK---TLALTSCGNHQEVVSTYKMIEKLQKKLYHPFSVNLMQTREKLIKILMELE 276
           +  L ++      L S G  +  +      ++  +         L +  + L +    L 
Sbjct: 650 LQDLRQQVGMAQKLLSDGELEPAIQQLSGCQRDAESFLSAEHAVLGEIEDGLAQACAALG 709

Query: 277 DWKEALAYCQLTIPVYQRVYPQFHPLLGLQYYTCGKLEWFLGDTENAIKSMTEAVEILRI 336
           DW+++  + Q ++ V +  +      +G + +   ++ +       A+ ++ +A E+L +
Sbjct: 710 DWQKSATHLQNSLRVVEARHGPSSVEMGHELFKLAQIFFNGFAVPEALSTIQKAEEVLSL 769

Query: 337 THGTNSPFMKEL 348
             G     ++EL
Sbjct: 770 HCGPWDDEIQEL 781


>gi|395536340|ref|XP_003770178.1| PREDICTED: SET and MYND domain-containing protein 4 [Sarcophilus
           harrisii]
          Length = 802

 Score = 76.6 bits (187), Expect = 2e-11,   Method: Compositional matrix adjust.
 Identities = 64/276 (23%), Positives = 117/276 (42%), Gaps = 32/276 (11%)

Query: 88  LGTGLYPVISIINHSCLPNAVLVFEGRLAVVRAVQHVPKGAEVLISYIETAG--STMTRQ 145
           L TGL+P IS++NHSC PN  L F G +  V A Q + +G E+L  Y    G     TRQ
Sbjct: 525 LATGLFPAISLMNHSCRPNTSLSFRGSVGSVHASQLIARGQEILHCYGPHEGRMDAATRQ 584

Query: 146 KALKEQYLFTCTCPRCIKLGQFDDIQESAILE---GYRCKDD----------GCSGFLLR 192
           + L+ QY F C C  C    Q +++  + +     G+RC             GCS    R
Sbjct: 585 QKLRSQYFFDCRCQAC----QEEEVHGAGVTPKQGGFRCPTCRAALQGLDPLGCSSGSCR 640

Query: 193 DSDDKGFTCQQCGLVRSKEEIKKIASEVNILSKKTLALTSCGNHQEVVSTYKMIEKLQKK 252
               +     Q G ++ + E+       +  ++    L +C    E   T +        
Sbjct: 641 AQVSRAHLLGQLGDLQQRVEMAGELLRGDRTARAVQLLLACRREAEDFLTSEH------- 693

Query: 253 LYHPFSVNLMQTREKLIKILMELEDWKEALAYCQLTIPVYQRVYPQFHPLLGLQYYTCGK 312
                 V + +  ++L +    L DW ++ A+ Q ++ V +  +      +G + +   +
Sbjct: 694 ------VLMGEIEDRLAQAYASLGDWPKSAAHLQNSLQVVKAQHGPASVEMGHELFKLAQ 747

Query: 313 LEWFLGDTENAIKSMTEAVEILRITHGTNSPFMKEL 348
           + +       A+  + +A ++L + +G  +  ++EL
Sbjct: 748 VLFNGFAVTEAVSVIEKAEKVLSLHYGPENEQVQEL 783


>gi|391335768|ref|XP_003742261.1| PREDICTED: SET and MYND domain-containing protein 4-like
           [Metaseiulus occidentalis]
          Length = 775

 Score = 76.6 bits (187), Expect = 2e-11,   Method: Compositional matrix adjust.
 Identities = 70/249 (28%), Positives = 105/249 (42%), Gaps = 29/249 (11%)

Query: 69  NFSKLACNAHTICN-----------------SELRPLGTGLYPVISIINHSCLPNAVLVF 111
           +  +L CNAH I                    E   + T +YP  S++NHSC PN +  F
Sbjct: 455 HIQQLICNAHAITTIQQPGDHVIEEDGIILEQEQVRVATAIYPSASLMNHSCNPNIISGF 514

Query: 112 E-GRLAVVRAVQHVPKGAEVLISYIETAGSTMT---RQKALKEQYLFTCTCPRCIKLGQF 167
             G   VV++V+ +  G EV   Y       MT   RQ AL+EQY F C C  C K G  
Sbjct: 515 RSGSTLVVKSVRPIASGEEVFNCY-GPHFRRMTFQERQTALQEQYFFRCDCTACQK-GDL 572

Query: 168 DDIQESAILEGYRCKDDGCSGFLLRDSDDKGFTCQQCGLVRSKEEIKKIASEVNILSKKT 227
           DD     I    RC  + C G L          C QC   +   E ++    ++ L  + 
Sbjct: 573 DD----QISMALRC--EYCEGPLSAVQSSGKADCLQCSTTQDCLEKEQKVFRMHDLFVQG 626

Query: 228 LALTSCGNHQEVVSTYKMIEKLQKKLYHPFSVNLMQTREKLIKILMELEDWKEALAYCQL 287
           L L    +H E +   +     Q+KL +  +  LM TR+ + K L  L  +++A+   + 
Sbjct: 627 LQLAEMDSHGEALERLQKCLASQEKLLYRHNKQLMITRDMVAKSLCALGRFRDAVGVLKS 686

Query: 288 TIPVYQRVY 296
            +   + +Y
Sbjct: 687 AVESVRHMY 695


>gi|405969486|gb|EKC34455.1| SET and MYND domain-containing protein 4 [Crassostrea gigas]
          Length = 324

 Score = 76.6 bits (187), Expect = 2e-11,   Method: Compositional matrix adjust.
 Identities = 51/147 (34%), Positives = 76/147 (51%), Gaps = 19/147 (12%)

Query: 73  LACNAHTICNSELRP----------LGTGLYPVISIINHSCLPNAVLVFEGRLAVVRAVQ 122
           L CNAH I     +P          +G+G Y ++S+INHSC P+ V    G + VVRA++
Sbjct: 103 LPCNAHEISEILWKPGDPTVTNSIEIGSGAYALLSLINHSCDPSVVRHNYGNICVVRAIK 162

Query: 123 HVPKGAEVLISYIETAGSTMT--RQKALKEQYLFTCTCPRC-IKLGQFDDIQESAILEGY 179
            + KG E+L +Y      T+   R+  L+ QY F C C  C ++L  + DI +   +  +
Sbjct: 163 PIKKGEEILDNYGALYPLTIREERRAKLRPQYFFDCNCDACQLELPLYFDIPDDVPV--F 220

Query: 180 RCKDDGCSG--FLLRDSDDKGFTCQQC 204
           +CKD  CSG  F+ +D D     C  C
Sbjct: 221 KCKD--CSGPIFISQDKDLAEAECSSC 245


>gi|330841743|ref|XP_003292851.1| hypothetical protein DICPUDRAFT_89992 [Dictyostelium purpureum]
 gi|325076863|gb|EGC30616.1| hypothetical protein DICPUDRAFT_89992 [Dictyostelium purpureum]
          Length = 680

 Score = 76.6 bits (187), Expect = 2e-11,   Method: Compositional matrix adjust.
 Identities = 47/152 (30%), Positives = 78/152 (51%), Gaps = 14/152 (9%)

Query: 22  DNYSLVEALVAHMSDIDEKQLLLYAQIANLV----------NLILQWPEISINEIAENFS 71
           D Y  +E L   +  +D +Q+  + + A+ +          + IL++  I++NEI E +S
Sbjct: 332 DEYQDLENLSRTLDKVDPQQMEAFKRSASSIIAVISKLRGEDAILKY--ITLNEIIELYS 389

Query: 72  KLACNAHTICNS-ELRPLGTGLYPVISIINHSCLPNAVLVF-EGRLAVVRAVQHVPKGAE 129
            +  N H + +    R  G G++P  S +NHSC PNA     E  + V R+++ + KG E
Sbjct: 390 IVLSNGHEMLHPLTCRSYGMGIFPTGSYLNHSCSPNAFWYNDEQGMMVFRSLRPLKKGEE 449

Query: 130 VLISYIETAGSTMTRQKALKEQYLFTCTCPRC 161
           +L SY +       R+K L +QY F C C +C
Sbjct: 450 LLTSYTDVTNPLEDRRKYLMKQYFFFCQCNQC 481


>gi|351696197|gb|EHA99115.1| SET and MYND domain-containing protein 1 [Heterocephalus glaber]
          Length = 483

 Score = 76.3 bits (186), Expect = 2e-11,   Method: Compositional matrix adjust.
 Identities = 44/152 (28%), Positives = 78/152 (51%), Gaps = 25/152 (16%)

Query: 31  VAHMSDIDEKQLLLYAQIANLVNLILQ-WP----EISINEIAENFSKLACNAHTICNSE- 84
           V H  + ++K+L +       V++ LQ WP    + S+  I+  F  + CN  T+ +   
Sbjct: 131 VEHFGEEEQKELRVD------VDMFLQYWPPQSQQFSMQYISHIFGVINCNGFTLSDQRG 184

Query: 85  LRPLGTGLYPVISIINHSCLPNAVLVFE-GRLAVVRAVQH------------VPKGAEVL 131
           L+ +G G++P + ++NH C PN  ++F  G    V+++ H            + +G E+ 
Sbjct: 185 LQAVGVGIFPNLGLVNHDCWPNCTVIFNNGNHEAVKSMFHTQMRIELRALGKISEGEELT 244

Query: 132 ISYIETAGSTMTRQKALKEQYLFTCTCPRCIK 163
           +SYI+    +  R++ LK+QY F CTC  C K
Sbjct: 245 VSYIDFLNVSEERKRQLKKQYYFDCTCEHCQK 276


>gi|224006285|ref|XP_002292103.1| hypothetical protein THAPSDRAFT_23831 [Thalassiosira pseudonana
           CCMP1335]
 gi|220972622|gb|EED90954.1| hypothetical protein THAPSDRAFT_23831 [Thalassiosira pseudonana
           CCMP1335]
          Length = 575

 Score = 76.3 bits (186), Expect = 2e-11,   Method: Compositional matrix adjust.
 Identities = 43/124 (34%), Positives = 59/124 (47%), Gaps = 11/124 (8%)

Query: 70  FSKLACNAHTICNSELRPLGTGLYPVISIINHSCLPNAVLVF-----EGRLAVVRAVQHV 124
            S+L  N  TI  SE  PLG G+YP  S+INHSC PNAV  F        +  +   + V
Sbjct: 257 MSRLIMNGFTISTSEQLPLGVGVYPAASMINHSCRPNAVPTFWFSTPSPPMLQITMCKSV 316

Query: 125 PKGAEVLISYIETAGSTMTRQKALKEQYLFTCTCPRCIKLGQFDDIQESAILEGYRCKDD 184
             G E+ ISY + +     R++ L + Y F C C  C       D +    + G +C+  
Sbjct: 317 RVGDEIAISYCDVSAPRYVRREGLIKNYKFACDCSHC------GDFETDGDMIGLKCQAS 370

Query: 185 GCSG 188
           GC G
Sbjct: 371 GCIG 374


>gi|148680855|gb|EDL12802.1| SET and MYND domain containing 4, isoform CRA_b [Mus musculus]
          Length = 828

 Score = 76.3 bits (186), Expect = 2e-11,   Method: Compositional matrix adjust.
 Identities = 38/88 (43%), Positives = 49/88 (55%), Gaps = 2/88 (2%)

Query: 76  NAHTICNSELRPLGTGLYPVISIINHSCLPNAVLVFEGRLAVVRAVQHVPKGAEVLISY- 134
           N   I NS    L TG++PV+S++NHSC PN  + F G +A VRA Q + KG E+L  Y 
Sbjct: 540 NESIITNSRQIRLATGIFPVVSLLNHSCRPNTSVSFTGTVATVRAAQRIAKGQEILHCYG 599

Query: 135 -IETAGSTMTRQKALKEQYLFTCTCPRC 161
             E+      RQ+ L  QY F C C  C
Sbjct: 600 PHESRMGVAERQQRLSSQYFFDCRCGAC 627


>gi|431906502|gb|ELK10625.1| SET and MYND domain-containing protein 3 [Pteropus alecto]
          Length = 352

 Score = 76.3 bits (186), Expect = 2e-11,   Method: Compositional matrix adjust.
 Identities = 32/74 (43%), Positives = 49/74 (66%)

Query: 88  LGTGLYPVISIINHSCLPNAVLVFEGRLAVVRAVQHVPKGAEVLISYIETAGSTMTRQKA 147
           +G GLYP +S++NHSC PN  +VF G   ++RAV+ +  G E+ I Y++   ++  R+K 
Sbjct: 4   VGVGLYPSMSLLNHSCDPNCSIVFNGPHLLLRAVRDIEAGEELTICYLDMLMTSEERRKQ 63

Query: 148 LKEQYLFTCTCPRC 161
           L++QY F C C RC
Sbjct: 64  LRDQYCFECDCSRC 77


>gi|344296426|ref|XP_003419908.1| PREDICTED: N-lysine methyltransferase SMYD2-like [Loxodonta
           africana]
          Length = 427

 Score = 76.3 bits (186), Expect = 2e-11,   Method: Compositional matrix adjust.
 Identities = 78/351 (22%), Positives = 149/351 (42%), Gaps = 41/351 (11%)

Query: 8   RKLQNDNVIPSTTTDNYSL-VEALVAHMSDID-EKQLLLYAQIANLVNLI---LQWPEIS 62
           R L    + P  T     L V+   +H+  +D EK+ L+ + IA L +     L++P+  
Sbjct: 111 RILAKQKIHPERTPSEKLLAVKEFESHLDKLDNEKKDLIQSDIAALHHFYSKHLEFPDT- 169

Query: 63  INEIAENFSKLACNAHTICNSELRPLGTGLYPVISIINHSCLPNAVLVFEGRLAVVRAVQ 122
              +   F+++ CN  TI + EL  +G+ ++P   +   S +P ++ VF+  L    +  
Sbjct: 170 -ESLVVLFAQVNCNGFTIEDEELSHVGSAIFPDPVV---SGVPGSLTVFQAHLF---SQM 222

Query: 123 HVPKGAEVLISYIETAGSTMTRQKALKEQYLFTCTCPRCIKLGQFDDIQESAILEGYRCK 182
                  V  SYI+    T  R   L++ Y FTC C  C         +++A +E  R  
Sbjct: 223 FASLCGMVFTSYIDLLYPTEDRNDRLRDSYFFTCECQECTTKD-----KDNAKVE-IRKL 276

Query: 183 DDGCSGFLLRDSDDKGFTCQQCGLVRSKEEIKKIASEVNILSKKTLALTSCGNHQEVVST 242
            D      +RD            +VR    + +           +  L  C   QE +S+
Sbjct: 277 TDPPKAEAIRD------------MVRYARNVIEEFRRAKHYKSPSELLEMCALSQEKMSS 324

Query: 243 YKMIEKLQKKLYHPFSVNLMQTREKLIKILMELEDWKEALAYCQLTIPVYQRVYPQFHPL 302
                     ++   +V ++    + + + + ++DW+ AL Y Q  I  Y + YP +   
Sbjct: 325 ----------VFEDSNVYMLHMMYQAMGVCLYMQDWEGALRYGQKIIQPYSKHYPLYSLN 374

Query: 303 LGLQYYTCGKLEWFLGDTENAIKSMTEAVEILRITHGTNSPFMKELILKLE 353
           +   +   G+L   L +     K++ +A+ I+ + HG + P++ E+  ++E
Sbjct: 375 VASMWLKLGRLYMGLENKAAGEKALKKAIAIMEVAHGKDHPYISEIKHEIE 425


>gi|213406173|ref|XP_002173858.1| conserved hypothetical protein [Schizosaccharomyces japonicus
           yFS275]
 gi|212001905|gb|EEB07565.1| conserved hypothetical protein [Schizosaccharomyces japonicus
           yFS275]
          Length = 449

 Score = 76.3 bits (186), Expect = 2e-11,   Method: Compositional matrix adjust.
 Identities = 38/107 (35%), Positives = 59/107 (55%)

Query: 63  INEIAENFSKLACNAHTICNSELRPLGTGLYPVISIINHSCLPNAVLVFEGRLAVVRAVQ 122
           +N     F KL+ NA ++       +GT +   ++ INHSC PN V + EG  A + A+Q
Sbjct: 155 VNAFLSLFCKLSINAMSLVTPAFDAIGTCMDSTLARINHSCQPNCVFMVEGATARLVALQ 214

Query: 123 HVPKGAEVLISYIETAGSTMTRQKALKEQYLFTCTCPRCIKLGQFDD 169
            + KG  V ISY++T      R   L+++Y FTCTC +C++  +  D
Sbjct: 215 TLQKGDFVHISYVDTTLPYNVRTNELQQKYFFTCTCQKCMEEARRSD 261


>gi|449265983|gb|EMC77110.1| SET and MYND domain-containing protein 4, partial [Columba livia]
          Length = 706

 Score = 76.3 bits (186), Expect = 2e-11,   Method: Compositional matrix adjust.
 Identities = 63/265 (23%), Positives = 116/265 (43%), Gaps = 10/265 (3%)

Query: 88  LGTGLYPVISIINHSCLPNAVLVFEGRLAVVRAVQHVPKGAEVLISY-IETAGSTMTRQK 146
           L T  +PV+S++NHSC PN  + F G  A VRA Q +P G E+   Y        +  ++
Sbjct: 430 LATAFFPVLSLLNHSCCPNTSVSFSGTTATVRASQLIPSGQEIFHCYGPHRCRMRVAERQ 489

Query: 147 ALKEQYLFTCTCPRCIKLGQFDDIQESAILEGYRCKDDGCSGFLLRDSDDKGFTCQQCGL 206
            L  QY F C C  C    + D     ++   + C   GC G  ++  D    + + C +
Sbjct: 490 QLLSQYFFECRCQACCDELESDVKSVVSLRNTFCCP--GCRG-PMQGEDMLCCSNEACAI 546

Query: 207 VRSKEEIKKIASEVNILSKKTLALTSCGNHQEVVSTYKMIEKLQ---KKLYHPFSVNLMQ 263
             S+E +     ++    K+ L L      ++     KM+ K Q   +    P  + + +
Sbjct: 547 SVSRESLSCRLQDLQQRIKRALELL---RDRKADQAIKMLLKCQVDAECFLSPEHLLMGE 603

Query: 264 TREKLIKILMELEDWKEALAYCQLTIPVYQRVYPQFHPLLGLQYYTCGKLEWFLGDTENA 323
             + L ++   L  W+EA  + + +I + +  +      +G + +   ++ +       A
Sbjct: 604 LEDHLAQVYATLGKWQEAARHLERSIEIVEMHHGPSSIEIGHELFKLAQILFNGFAVSEA 663

Query: 324 IKSMTEAVEILRITHGTNSPFMKEL 348
           + ++  A EIL +  G  SP ++EL
Sbjct: 664 LSTIQRAEEILSVHCGPQSPQIQEL 688


>gi|405959013|gb|EKC25088.1| SET and MYND domain-containing protein 5 [Crassostrea gigas]
          Length = 396

 Score = 76.3 bits (186), Expect = 2e-11,   Method: Compositional matrix adjust.
 Identities = 37/87 (42%), Positives = 51/87 (58%), Gaps = 5/87 (5%)

Query: 89  GTGLYPVISIINHSCLPNAVLVF--EGRLAVVRAVQHVPKGAEVLISYI---ETAGSTMT 143
           G+GLY + S  NHSCLPNA + F     +  V A + +P G E+ ISY+   + + S  +
Sbjct: 296 GSGLYSLQSTCNHSCLPNAEITFPYNNNVMAVVAKEKIPSGQEICISYLSECDLSRSRHS 355

Query: 144 RQKALKEQYLFTCTCPRCIKLGQFDDI 170
           RQ  LKE YLFTC CP+C+      D+
Sbjct: 356 RQNILKENYLFTCDCPKCLSEADEPDV 382


>gi|148680854|gb|EDL12801.1| SET and MYND domain containing 4, isoform CRA_a [Mus musculus]
          Length = 743

 Score = 76.3 bits (186), Expect = 3e-11,   Method: Compositional matrix adjust.
 Identities = 38/88 (43%), Positives = 49/88 (55%), Gaps = 2/88 (2%)

Query: 76  NAHTICNSELRPLGTGLYPVISIINHSCLPNAVLVFEGRLAVVRAVQHVPKGAEVLISY- 134
           N   I NS    L TG++PV+S++NHSC PN  + F G +A VRA Q + KG E+L  Y 
Sbjct: 455 NESIITNSRQIRLATGIFPVVSLLNHSCRPNTSVSFTGTVATVRAAQRIAKGQEILHCYG 514

Query: 135 -IETAGSTMTRQKALKEQYLFTCTCPRC 161
             E+      RQ+ L  QY F C C  C
Sbjct: 515 PHESRMGVAERQQRLSSQYFFDCRCGAC 542


>gi|432096123|gb|ELK26991.1| SET and MYND domain-containing protein 4 [Myotis davidii]
          Length = 777

 Score = 75.9 bits (185), Expect = 3e-11,   Method: Compositional matrix adjust.
 Identities = 69/299 (23%), Positives = 129/299 (43%), Gaps = 15/299 (5%)

Query: 54  LILQWPEISINEIAENFSKLACNAHTICNSELRPLGTGLYPVISIINHSCLPNAVLVFEG 113
           L LQ    +I  I +  SK     + + +S    L TGL+PV+S++NHSC PN  + F  
Sbjct: 471 LQLQCNAQAITTIQQTGSK----ENMVTDSRQVRLATGLFPVVSLLNHSCSPNTSVSFVS 526

Query: 114 RLAVVRAVQHVPKGAEVLISY--IETAGSTMTRQKALKEQYLFTCTCPRCIKLGQFDDIQ 171
            +A VRA Q + +G E+L  Y   E+      RQ+ L+ QY F C C  C    Q    +
Sbjct: 527 TVATVRASQQIGEGQEILHCYGPHESRMGVAERQQKLRSQYFFDCDCLAC----QNGKCR 582

Query: 172 ESA--ILEGYRCKDDGCSGFLLRDSDDKGFTCQQCGLVRSKEEIKKIASEVNILSKKTLA 229
            +A    E +RC    C G L++  D        C    S++ +     ++     K   
Sbjct: 583 AAAGPRWEAFRCH---CCGELMQGDDVLSCGSASCTESASRDRLVSRLQDLRQRVGKAQT 639

Query: 230 LTSCGNHQEVVSTYKMIEKLQKKLYHPFSVNLMQTREKLIKILMELEDWKEALAYCQLTI 289
           L   G  +  +      ++  +         + +  + L +    L DW ++ A+ Q ++
Sbjct: 640 LLRNGKVERAIQLLLGCQRDAESFLSAEHSLVGEIEDALAQAYAALGDWHKSAAHLQRSL 699

Query: 290 PVYQRVYPQFHPLLGLQYYTCGKLEWFLGDTENAIKSMTEAVEILRITHGTNSPFMKEL 348
            V +  +      +G + +   ++ +       A+ ++ +A ++L + +G  +  ++EL
Sbjct: 700 RVVEVRHGPASVEMGHELFKLAQIFFNGFAIPEALNTIQKAEKVLLVHYGRGNEEIQEL 758


>gi|50511197|dbj|BAD32584.1| mKIAA1936 protein [Mus musculus]
          Length = 716

 Score = 75.9 bits (185), Expect = 3e-11,   Method: Compositional matrix adjust.
 Identities = 38/88 (43%), Positives = 49/88 (55%), Gaps = 2/88 (2%)

Query: 76  NAHTICNSELRPLGTGLYPVISIINHSCLPNAVLVFEGRLAVVRAVQHVPKGAEVLISY- 134
           N   I NS    L TG++PV+S++NHSC PN  + F G +A VRA Q + KG E+L  Y 
Sbjct: 428 NESIITNSRQIRLATGIFPVVSLLNHSCRPNTSVSFTGTVATVRAAQRIAKGQEILHCYG 487

Query: 135 -IETAGSTMTRQKALKEQYLFTCTCPRC 161
             E+      RQ+ L  QY F C C  C
Sbjct: 488 PHESRMGVAERQQRLSSQYFFDCRCGAC 515


>gi|74181871|dbj|BAE32637.1| unnamed protein product [Mus musculus]
 gi|122936404|gb|AAI30221.1| Smyd4 protein [Mus musculus]
          Length = 666

 Score = 75.9 bits (185), Expect = 3e-11,   Method: Compositional matrix adjust.
 Identities = 38/88 (43%), Positives = 49/88 (55%), Gaps = 2/88 (2%)

Query: 76  NAHTICNSELRPLGTGLYPVISIINHSCLPNAVLVFEGRLAVVRAVQHVPKGAEVLISY- 134
           N   I NS    L TG++PV+S++NHSC PN  + F G +A VRA Q + KG E+L  Y 
Sbjct: 378 NESIITNSRQIRLATGIFPVVSLLNHSCRPNTSVSFTGTVATVRAAQRIAKGQEILHCYG 437

Query: 135 -IETAGSTMTRQKALKEQYLFTCTCPRC 161
             E+      RQ+ L  QY F C C  C
Sbjct: 438 PHESRMGVAERQQRLSSQYFFDCRCGAC 465


>gi|26326379|dbj|BAC26933.1| unnamed protein product [Mus musculus]
          Length = 666

 Score = 75.9 bits (185), Expect = 3e-11,   Method: Compositional matrix adjust.
 Identities = 38/88 (43%), Positives = 49/88 (55%), Gaps = 2/88 (2%)

Query: 76  NAHTICNSELRPLGTGLYPVISIINHSCLPNAVLVFEGRLAVVRAVQHVPKGAEVLISY- 134
           N   I NS    L TG++PV+S++NHSC PN  + F G +A VRA Q + KG E+L  Y 
Sbjct: 378 NESIITNSRQIRLATGIFPVVSLLNHSCRPNTSVSFTGTVATVRAAQRIAKGQEILHCYG 437

Query: 135 -IETAGSTMTRQKALKEQYLFTCTCPRC 161
             E+      RQ+ L  QY F C C  C
Sbjct: 438 PHESRMGVAERQQRLSSQYFFDCRCGAC 465


>gi|427782607|gb|JAA56755.1| Putative set and mynd domain protein [Rhipicephalus pulchellus]
          Length = 423

 Score = 75.9 bits (185), Expect = 3e-11,   Method: Compositional matrix adjust.
 Identities = 39/105 (37%), Positives = 58/105 (55%), Gaps = 6/105 (5%)

Query: 59  PEISINEIAENFSKLACNAHTICNSELRPLGTGLYPVISIINHSCLPNAVLVFEGRLAVV 118
           PE+ +    E + K+  N++ ICN E   +GTGLY   SI++HSC PNA  V+EG    +
Sbjct: 151 PEVGL----EIYGKMIINSYCICNDEHTAIGTGLYMGPSILDHSCSPNAHAVYEGHKLHL 206

Query: 119 RAVQHV--PKGAEVLISYIETAGSTMTRQKALKEQYLFTCTCPRC 161
           RA + +       + +SYI+       RQ+ L+ QY F C C +C
Sbjct: 207 RAAEDINCSNFDGIRVSYIDVMAPKKVRQEELRSQYYFECNCMKC 251


>gi|197102342|ref|NP_001127594.1| SET and MYND domain-containing protein 4 [Pongo abelii]
 gi|75054780|sp|Q5R5X9.1|SMYD4_PONAB RecName: Full=SET and MYND domain-containing protein 4
 gi|55732267|emb|CAH92837.1| hypothetical protein [Pongo abelii]
          Length = 804

 Score = 75.9 bits (185), Expect = 3e-11,   Method: Compositional matrix adjust.
 Identities = 77/315 (24%), Positives = 130/315 (41%), Gaps = 67/315 (21%)

Query: 59  PEISINEIA--ENFSKLACNA-------HT------ICNSELRPLGTGLYPVISIINHSC 103
           P+++I  +A   +  +L CNA       HT      + +S    L TG++PV+S++NHSC
Sbjct: 483 PDVTIWGVAMLRHMLQLQCNAQAMTTIQHTGSKGSIVTDSRQVRLATGIFPVVSLLNHSC 542

Query: 104 LPNAVLVFEGRLAVVRAVQHVPKGAEVLISY--IETAGSTMTRQKALKEQYLFTCTCPRC 161
            PN  + F   +A ++A Q + KG E+L  Y   ++      RQ+ L+ QY F C CP C
Sbjct: 543 SPNTSMSFISTVATIQASQRIRKGQEILHCYGPHKSRMGVAERQQELRSQYFFDCACPAC 602

Query: 162 IKLGQFDDIQESAILEGYRCKDDGCSGFLLRDSDDKGFTCQQCGLVRSKEEIKKIASEVN 221
                          E +R         +  +   + F C  CG     +++    S   
Sbjct: 603 QT-------------EAHR---------MAAEPRWEAFCCNSCGAPMQGDDVLHCGSR-- 638

Query: 222 ILSKKTLALTSCGNHQEVVSTYKMIEKLQKKLYHPFSVNLMQTREKLIKILM--ELEDWK 279
                     SC   +  VS   ++ +LQ         +L Q      K+L   ELE   
Sbjct: 639 ----------SCA--ESAVSRDHLVSRLQ---------DLQQQVGVAQKLLRDGELERAV 677

Query: 280 EALAYCQLTIPVYQRVYPQFHPLLGLQYYTCGKLEWFLGDTENAIKSMTEAVEILRITHG 339
           + L  CQ     +  ++ + H L+G       +    LGD + A   +  ++ ++ + HG
Sbjct: 678 QQLLGCQRDAESF--LWAE-HALVGEIADGLARACAALGDWQKAATHLQRSLRVVEVRHG 734

Query: 340 TNSPFMKELILKLEE 354
            +S  M   + KL +
Sbjct: 735 PSSVEMGHELFKLAQ 749


>gi|290975196|ref|XP_002670329.1| predicted protein [Naegleria gruberi]
 gi|284083887|gb|EFC37585.1| predicted protein [Naegleria gruberi]
          Length = 626

 Score = 75.9 bits (185), Expect = 3e-11,   Method: Compositional matrix adjust.
 Identities = 37/120 (30%), Positives = 64/120 (53%), Gaps = 16/120 (13%)

Query: 83  SELRPLGTGLYPVISIINHSCLPNAVL----------VFEGRLAVVRAVQHVPKGAEVLI 132
           ++ R +GTGL+P++S+ +HSC PN              + G + +V+ V+ V KG E+ I
Sbjct: 287 ADFRDIGTGLFPMVSMFDHSCSPNCSFQTFDDMKSRNSYSGNVILVQTVKKVKKGEELCI 346

Query: 133 SYIETAGSTMTRQKALKEQYLFTCTCPRCIKLGQFDDIQESAILEGYRCKDDGCS-GFLL 191
           SYI+    T  R++ L     F C C RC+      + +E+ ++  Y C +  C+ G+L+
Sbjct: 347 SYIDIMNPTCIRRRELWYSKYFVCRCSRCMS-----ETEENRMVRAYSCGESECTEGYLV 401


>gi|339235237|ref|XP_003379173.1| MYND finger protein [Trichinella spiralis]
 gi|316978237|gb|EFV61247.1| MYND finger protein [Trichinella spiralis]
          Length = 451

 Score = 75.9 bits (185), Expect = 3e-11,   Method: Compositional matrix adjust.
 Identities = 70/285 (24%), Positives = 121/285 (42%), Gaps = 26/285 (9%)

Query: 76  NAHTICNS-ELRPLGTGLYPVISIINHSCLPNAVLVFEGRLAVVRAVQ---HVPKGAEVL 131
           N H I +   L  +G GLY  +   +HSC PNA+   EG +A ++A+    ++       
Sbjct: 168 NRHAIGDQLYLNEIGKGLYLDLCAYDHSCRPNAIYTCEGFVARLKALDDSVNLSDRKRTH 227

Query: 132 ISYIETAGSTMTRQKALKEQYLFTCTCPRCIKLGQFDDIQESAILEGYRCKDDGCSGFLL 191
             YI    S   R++ LKE + F C C RC             +L   +C  + CS  + 
Sbjct: 228 TCYITPLDSRAERRRLLKESWYFDCFCERCTD-------PNDHLLTALKC--NNCSTAVP 278

Query: 192 RDS--------DDKGFTCQQC-GLVRSKEEIKKIASEVNILSKKTLALTSCGNHQEVVST 242
             +        DD  FTC +C     S+ ++++   ++  + K+     S    +  + +
Sbjct: 279 IHNADNDKDLIDDSFFTCPKCEQRCFSRPDVEQALFKMIDIEKQLAKTGSETLSKSELDS 338

Query: 243 YKMIEKLQKKLYHPFSVNLMQTREKLIKILMELEDWKEALAYCQLTIPVYQRVYPQFHPL 302
             ++ K      + F   L+Q + +LI    +  D K+ L      +P  +  YP+ HP 
Sbjct: 339 MLLVCKSVLADENVFFAKLLQRKLQLI----DCNDSKQLLETSLQALPCLRVCYPEMHPT 394

Query: 303 LGLQYYTCGKLEWFLGDTENAIKSMTEAVEILRITHGTNSPFMKE 347
           L   Y T G     L D + A++ + +A+   R T G+N    K+
Sbjct: 395 LAYHYMTIGIYYKNLADRKTAMQYLNDALRRFRFTLGSNHLLTKQ 439


>gi|63146229|gb|AAH95952.1| Smyd4 protein [Mus musculus]
          Length = 799

 Score = 75.9 bits (185), Expect = 3e-11,   Method: Compositional matrix adjust.
 Identities = 37/88 (42%), Positives = 49/88 (55%), Gaps = 2/88 (2%)

Query: 76  NAHTICNSELRPLGTGLYPVISIINHSCLPNAVLVFEGRLAVVRAVQHVPKGAEVLISY- 134
           N   I NS    L TG++PV+S++NHSC PN  + F G +A +RA Q + KG E+L  Y 
Sbjct: 511 NESIITNSRQIRLATGVFPVVSLLNHSCRPNTSVSFTGTVATIRAAQRIAKGQEILHCYG 570

Query: 135 -IETAGSTMTRQKALKEQYLFTCTCPRC 161
             E+      RQ+ L  QY F C C  C
Sbjct: 571 PHESRMGVAERQQRLSSQYFFDCRCGAC 598


>gi|341883759|gb|EGT39694.1| hypothetical protein CAEBREN_28295 [Caenorhabditis brenneri]
          Length = 582

 Score = 75.9 bits (185), Expect = 3e-11,   Method: Compositional matrix adjust.
 Identities = 70/325 (21%), Positives = 138/325 (42%), Gaps = 26/325 (8%)

Query: 29  ALVAHMSDI--DEKQLLLYAQIANLVNLILQWPEISINEIAENFSKLACNAHTICNSELR 86
           +LV H +DI  DE+++  + ++   + L  +   I   E  + F K + N+H+I  +   
Sbjct: 200 SLVDHKTDIREDEEKMTHFEKVFTKMGLFRKDDMIEKGEFFDIFCKASINSHSIHTNAGN 259

Query: 87  PLGTGLYPVISIINHSCLPNAVLVFEGRLAVVRAVQHVP-----KGAEVLISYIETAGST 141
            +G  L   +S  NHSC P   +VF+G    +R +  VP        +  ISYI+   S 
Sbjct: 260 EVGMALDLGVSKYNHSCRPTCSMVFDGYRVCLRPL--VPGVDAEDTEQAFISYIDVGRSK 317

Query: 142 MTRQKALKEQYLFTCTCPRCIKLGQFDDIQESAILEGYRCKDDGCSGFLLRDSDDK--GF 199
             R++ LK ++ F C C RC+      D ++ A L   RC +  C   +L    ++    
Sbjct: 318 FVRRRDLKSRWYFNCECTRCM------DPEDDA-LTAIRCANPSCDAPILTSETEEPMNI 370

Query: 200 TCQQCGLVRSKEEIKKIASEVNILSKKTLALTSCGNHQEVVSTYKMIEKLQKKLYHPFSV 259
            C +C  +  + ++K     +        +L +  + +      + +    +++ HP +V
Sbjct: 371 ACDKCKTIVEENDVKAAQEYMK-------SLPASFDPKCPADILRDLLAKAEQVLHPSNV 423

Query: 260 NLMQTREKLIKILMELEDWKEALAYCQLTIPVYQRVYPQFHPLLGLQYYTCGKLEWFLGD 319
            + + R  L  +   L     +  + Q+    Y+  +P+    +G Q     K      +
Sbjct: 424 YVARLRTALFHVTGTLTMDNLSSMHTQI-YNNYKMCFPKADRHVGFQLLHIVKALIEKDE 482

Query: 320 TENAIKSMTEAVEILRITHGTNSPF 344
            + A+    +A+ I  +  G + P+
Sbjct: 483 RDEAMPYAFDAMNIFEVCFGLDHPY 507


>gi|295665212|ref|XP_002793157.1| conserved hypothetical protein [Paracoccidioides sp. 'lutzii' Pb01]
 gi|226278071|gb|EEH33637.1| conserved hypothetical protein [Paracoccidioides sp. 'lutzii' Pb01]
          Length = 508

 Score = 75.9 bits (185), Expect = 3e-11,   Method: Compositional matrix adjust.
 Identities = 36/94 (38%), Positives = 54/94 (57%), Gaps = 1/94 (1%)

Query: 73  LACNAHTICNSELRPLGTGLYPVISIINHSCLPNAVLVFEGRLAVVRAVQHVPKGAEVLI 132
           L  NA T+ N     +G  + P  S  NHSC PNA + F+G +  ++A+Q +    ++ I
Sbjct: 187 LETNAFTLTNRYFDRIGLCMLPFASYANHSCEPNAYIGFDGPVIYLKALQDIALDEQIFI 246

Query: 133 SYIETAGSTMTRQKALKEQYLFTCTCPRCIKLGQ 166
           SYI+       RQ  LK++Y FTC CP+C++ GQ
Sbjct: 247 SYIDNTEPWEKRQSELKKRYFFTCKCPKCVQ-GQ 279


>gi|194890613|ref|XP_001977353.1| GG18992 [Drosophila erecta]
 gi|190649002|gb|EDV46280.1| GG18992 [Drosophila erecta]
          Length = 503

 Score = 75.9 bits (185), Expect = 3e-11,   Method: Compositional matrix adjust.
 Identities = 45/183 (24%), Positives = 83/183 (45%), Gaps = 27/183 (14%)

Query: 91  GLYPVISIINHSCLPNAVLVFEGRL--AVVRAVQHVPKGAEVLISYIETAGSTMTRQKAL 148
            LYP+  ++NH C+PNA   FE +    +VRA   +P+G EV  +Y +     + R   L
Sbjct: 208 ALYPLFGVVNHDCIPNAYYTFEDKTNNMIVRAAVDIPEGFEVTTTYTKLFTGNIARHLFL 267

Query: 149 KEQYLFTCTCPRCIKLGQFDDIQESAILEGYRCKDDGCSGFLLRDSDD---KGFTCQQC- 204
           K +  FTC C RC      D  ++ A + G  C+D  C+G ++ +        + C +C 
Sbjct: 268 KMKKSFTCKCSRC-----SDPTEKGAFISGLYCRDTNCTGLVVPEISGLPHPNWNCLECK 322

Query: 205 ----------------GLVRSKEEIKKIASEVNILSKKTLALTSCGNHQEVVSTYKMIEK 248
                           G + +K     + + V  L++K+ +     N+  + +   ++++
Sbjct: 323 QKSTHAQMMKSQDFASGAINAKANSNSLRTLVQYLNEKSDSFIPSSNYVVIDAKMSVLQR 382

Query: 249 LQK 251
           LQ+
Sbjct: 383 LQQ 385


>gi|290562039|gb|ADD38416.1| SET and MYND domain-containing protein 3 [Lepeophtheirus salmonis]
          Length = 311

 Score = 75.5 bits (184), Expect = 3e-11,   Method: Compositional matrix adjust.
 Identities = 40/119 (33%), Positives = 64/119 (53%)

Query: 43  LLYAQIANLVNLILQWPEISINEIAENFSKLACNAHTICNSELRPLGTGLYPVISIINHS 102
           + Y  I  + N +    EI+ +E+ + F+K++ N+  I N +    G+ L    S I+HS
Sbjct: 132 MFYRYIGIIKNKLGGALEINESEVLDIFTKMSINSVFILNGKFLNFGSCLSLEFSAIDHS 191

Query: 103 CLPNAVLVFEGRLAVVRAVQHVPKGAEVLISYIETAGSTMTRQKALKEQYLFTCTCPRC 161
           C PNAV +F GR  VV+A+  +    +V ++Y +       R+  LK+QY F CTC  C
Sbjct: 192 CRPNAVYMFIGRTLVVKALCDIANFDDVRVTYTDITQPRRMRRNFLKDQYFFDCTCEEC 250


>gi|428180198|gb|EKX49066.1| hypothetical protein GUITHDRAFT_136248 [Guillardia theta CCMP2712]
          Length = 719

 Score = 75.5 bits (184), Expect = 3e-11,   Method: Compositional matrix adjust.
 Identities = 39/112 (34%), Positives = 57/112 (50%), Gaps = 15/112 (13%)

Query: 65  EIAENFSKLACNAHTICNSELRPLGTGLYPVISIINHSCLPNAVLVFEGRLAVVRAVQHV 124
           EI     +L     TIC+ E RP+GTGLYP+ S+ NH C PN ++ F G    +RA + +
Sbjct: 230 EILMTICRLYNYGFTICDEEERPIGTGLYPLASLCNHDCAPNCLVTFCGPSLQLRATEEI 289

Query: 125 PKG---------------AEVLISYIETAGSTMTRQKALKEQYLFTCTCPRC 161
             G                E+ ISY+++A     R+K L ++YL  C+C  C
Sbjct: 290 EMGQVQMARVDWRLRSGLQEITISYVDSAEPHWLRRKTLHKRYLINCSCSLC 341



 Score = 47.4 bits (111), Expect = 0.012,   Method: Compositional matrix adjust.
 Identities = 36/138 (26%), Positives = 66/138 (47%), Gaps = 13/138 (9%)

Query: 232 SCGNHQEVVSTYKMIEKLQKKLYHPFSVNLMQTREKLIKILMELEDWKEALAYCQLTIPV 291
           S     E++  + +  +  + + H  +++L++ RE +    +E + WK AL+     +  
Sbjct: 572 SADRANELLQMFILAVQKAESILHDNNLSLLRLREAMQSQAVEAKSWKFALSTSYKVLRR 631

Query: 292 YQRVYPQFHPLLGLQYYTCGKLEWFL------------GDT-ENAIKSMTEAVEILRITH 338
             ++YP+  P +GLQ    GKL  FL            GD  + A+K + EA  IL  TH
Sbjct: 632 LYKIYPRCSPYVGLQEAFLGKLCMFLSGEPDKTLCHGFGDPLQEAVKHLREAKLILSGTH 691

Query: 339 GTNSPFMKELILKLEEAQ 356
           G+    ++++   L+EA 
Sbjct: 692 GSEHVLVQDINRLLQEAH 709


>gi|353241217|emb|CCA73046.1| hypothetical protein PIIN_07001 [Piriformospora indica DSM 11827]
          Length = 565

 Score = 75.5 bits (184), Expect = 4e-11,   Method: Compositional matrix adjust.
 Identities = 58/175 (33%), Positives = 91/175 (52%), Gaps = 21/175 (12%)

Query: 40  KQLLLYAQIANLVNLILQWPEISINEIAENFSKLACNAHTICNSELRPLGTGLYPVISII 99
           + L+LY +I +   L  ++   S  ++ +  SK   N+ T+ ++ L  +G    P+ ++I
Sbjct: 194 QSLVLYMKIESPEGL-REYGIESGKDLVDLMSKFTTNSFTLTSTFLNAIGVATAPIPALI 252

Query: 100 NHSCLPNAVLVFEGR-------LAVVRAVQHVPKGAEVLISYIETAGSTMTRQKALKEQY 152
           NHSC PNAV+VF          L V+ A+Q + +G EVL +Y++       RQK+LKE Y
Sbjct: 253 NHSCQPNAVVVFPATRKGAPPTLDVI-AIQPIRRGEEVLAAYVDITLPREIRQKSLKETY 311

Query: 153 LFTCTCPRCIKLGQFDDIQESAILEGYRCKDDGCSGFLLRDSDDKGFT--CQQCG 205
            F C+C  C KL    D++E       +C +    GFL   S  KG +  C +CG
Sbjct: 312 AFECSCTLC-KLPGTLDLRECVFCP--KCNE----GFL---SSHKGQSAPCSKCG 356


>gi|167535778|ref|XP_001749562.1| hypothetical protein [Monosiga brevicollis MX1]
 gi|163771954|gb|EDQ85613.1| predicted protein [Monosiga brevicollis MX1]
          Length = 338

 Score = 75.5 bits (184), Expect = 4e-11,   Method: Compositional matrix adjust.
 Identities = 35/94 (37%), Positives = 57/94 (60%), Gaps = 1/94 (1%)

Query: 69  NFSKLACNAHTICNS-ELRPLGTGLYPVISIINHSCLPNAVLVFEGRLAVVRAVQHVPKG 127
           +F+KL  N    C++ ++   G  +Y  +S  NHSC PNA +V+ G  AV+R+++ +P+G
Sbjct: 109 SFNKLMPNTFGWCDTFDMSSFGAVVYAELSRANHSCQPNAAVVYNGAAAVLRSMRDIPEG 168

Query: 128 AEVLISYIETAGSTMTRQKALKEQYLFTCTCPRC 161
            EV ISY++   +   R++ L + Y F C C RC
Sbjct: 169 EEVCISYVDPTLARDVRRRELVQSYGFACDCARC 202


>gi|403413006|emb|CCL99706.1| predicted protein [Fibroporia radiculosa]
          Length = 800

 Score = 75.5 bits (184), Expect = 4e-11,   Method: Compositional matrix adjust.
 Identities = 48/127 (37%), Positives = 66/127 (51%), Gaps = 14/127 (11%)

Query: 68  ENFSKLACNAHTICNSELRPLGTGLYPVIS-IINHSCLPNAVL-----VFEGRLAVVRAV 121
           E +++   N + I +S L   G G+YP+ S + NHSC+PNAV       FE     V A+
Sbjct: 531 EAYTRFG-NNNFIIHSHLNTFGHGIYPLASRLFNHSCVPNAVAKYIISPFESVQMEVVAL 589

Query: 122 QHVPKGAEVLISYIETAGSTMTRQKALKEQYLFTCTCPRCIKLGQFDDI-------QESA 174
           + + +G E+ I Y++ A    TRQ+AL+  Y FTCTCP C   G  D          E A
Sbjct: 590 RDIAEGEEITIPYLDPALPLHTRQEALRLNYGFTCTCPLCSWAGHLDPAPPPARGSAELA 649

Query: 175 ILEGYRC 181
           ILE   C
Sbjct: 650 ILESQLC 656


>gi|406868025|gb|EKD21062.1| SET and MYND domain-containing protein 2 [Marssonina brunnea f. sp.
           'multigermtubi' MB_m1]
          Length = 570

 Score = 75.5 bits (184), Expect = 4e-11,   Method: Compositional matrix adjust.
 Identities = 35/91 (38%), Positives = 51/91 (56%)

Query: 73  LACNAHTICNSELRPLGTGLYPVISIINHSCLPNAVLVFEGRLAVVRAVQHVPKGAEVLI 132
           L+ NA  I   +  P G    P +++ NHSC PNA++V   R   +RA++ + K  ++ I
Sbjct: 194 LSTNAFRITLPDNTPFGMCFSPTLALANHSCKPNAIIVSNSRSISLRALRPIKKNEQIFI 253

Query: 133 SYIETAGSTMTRQKALKEQYLFTCTCPRCIK 163
           SYI+      +RQ  LKE+Y FTC C  C K
Sbjct: 254 SYIDPTEDLPSRQSKLKERYFFTCKCDSCEK 284


>gi|15620931|dbj|BAB67829.1| KIAA1936 protein [Homo sapiens]
          Length = 558

 Score = 75.5 bits (184), Expect = 4e-11,   Method: Compositional matrix adjust.
 Identities = 75/315 (23%), Positives = 129/315 (40%), Gaps = 67/315 (21%)

Query: 59  PEISINEIA--ENFSKLACNAHT-------------ICNSELRPLGTGLYPVISIINHSC 103
           P+++I  +A   +  +L CNA               + +S    L TG++PVIS++NHSC
Sbjct: 287 PDVTIWGVAMLRHMLQLQCNAQAMTTIQRTGPKGSIVTDSRQVRLATGIFPVISLLNHSC 346

Query: 104 LPNAVLVFEGRLAVVRAVQHVPKGAEVLISY--IETAGSTMTRQKALKEQYLFTCTCPRC 161
            PN  + F   +A +RA Q + KG E+L  Y   ++      RQ+ L+ QY F C CP C
Sbjct: 347 SPNTSVSFISTVATIRASQWIRKGQEILHCYGPHKSRMGVAERQQKLRSQYFFDCACPAC 406

Query: 162 IKLGQFDDIQESAILEGYRCKDDGCSGFLLRDSDDKGFTCQQCGLVRSKEEIKKIASEVN 221
                    +   +  G R                + F C  CG     +++ +  S   
Sbjct: 407 -------QTEAHRMAAGPRW---------------EAFCCNSCGAPMQGDDVLRCGSR-- 442

Query: 222 ILSKKTLALTSCGNHQEVVSTYKMIEKLQKKLYHPFSVNLMQTREKLIKILM--ELEDWK 279
                     SC   +  VS   ++ +LQ         +L Q      K+L   ELE   
Sbjct: 443 ----------SCA--ESAVSRDHLVSRLQ---------DLQQQVRVAQKLLRDGELERAV 481

Query: 280 EALAYCQLTIPVYQRVYPQFHPLLGLQYYTCGKLEWFLGDTENAIKSMTEAVEILRITHG 339
           + L+ CQ     +  ++ + H ++G       +    LGD + +   +  ++ ++ + HG
Sbjct: 482 QRLSGCQRDAESF--LWAE-HAVVGEIADGLARACAALGDWQKSATHLQRSLCVVEVRHG 538

Query: 340 TNSPFMKELILKLEE 354
            +S  M   + KL +
Sbjct: 539 PSSVEMGHELFKLAQ 553


>gi|400601369|gb|EJP69012.1| MYND finger protein [Beauveria bassiana ARSEF 2860]
          Length = 542

 Score = 75.5 bits (184), Expect = 4e-11,   Method: Compositional matrix adjust.
 Identities = 41/104 (39%), Positives = 60/104 (57%), Gaps = 1/104 (0%)

Query: 60  EISINEIAENFSKLACNAHTICNSELRPLGTGLYPVISIINHSCLPNAVLVFEGRLAVVR 119
           E S+ +  E   KL  NA    +++L  +G  L P +++ NHSCLPNA + F GR AV+R
Sbjct: 166 EESVRQAVEILCKLQTNAFHRFDADLGHVGIFLEPTLAMANHSCLPNAFVQFVGRAAVLR 225

Query: 120 AVQHVPKGAEVLISYIETAGSTMTRQKALKEQYLFTCTCPRCIK 163
           A   + KG E+ ISY +   S++ ++KA    Y F C C RC +
Sbjct: 226 AESRIQKGDEIEISYTDYT-SSLGKRKAALAPYNFECRCRRCTQ 268


>gi|392562474|gb|EIW55654.1| SET domain-containing protein [Trametes versicolor FP-101664 SS1]
          Length = 499

 Score = 75.5 bits (184), Expect = 4e-11,   Method: Compositional matrix adjust.
 Identities = 51/155 (32%), Positives = 72/155 (46%), Gaps = 19/155 (12%)

Query: 62  SINEIAENFSKLACNAHTICNSELRPLGTGLYPVISIINHSCLPNAVLVF---------- 111
           S  ++ +  S+   N  T+ ++ L P+G  + P ++  NHSC PNAV+VF          
Sbjct: 148 SAGDLVDLISRFTTNTFTLTSASLSPIGICVAPTVAFANHSCSPNAVIVFPRAQGTPASK 207

Query: 112 EGRLAVVRAVQHVPKGAEVLISYIETAGSTMTRQKALKEQYLFTCTCPRCIKLGQFDDIQ 171
           E  L +V A++ +  G E+ ISY++T      RQK L E Y FTC C  C K    D   
Sbjct: 208 EPSLNLV-ALRDIAPGKEIRISYVDTTLPKDLRQKELTEVYNFTCQCKLCTKPPAVDP-- 264

Query: 172 ESAILEGYRCKDDGCSGFL-LRDSDDKGFTCQQCG 205
                E   C    C G   L   +D  F C +CG
Sbjct: 265 ----RESLWCPKK-CGGTCPLPTEEDNLFRCAKCG 294


>gi|195481611|ref|XP_002101711.1| GE15465 [Drosophila yakuba]
 gi|194189235|gb|EDX02819.1| GE15465 [Drosophila yakuba]
          Length = 497

 Score = 75.1 bits (183), Expect = 4e-11,   Method: Compositional matrix adjust.
 Identities = 45/183 (24%), Positives = 83/183 (45%), Gaps = 27/183 (14%)

Query: 91  GLYPVISIINHSCLPNAVLVFEGRL--AVVRAVQHVPKGAEVLISYIETAGSTMTRQKAL 148
            LYP+  ++NH C+PNA   FE +    +VRA   +P+G EV  +Y +     + R   L
Sbjct: 208 ALYPLFGVVNHDCIPNAYYTFEEKTNNMIVRAAVDIPEGFEVTTTYTKLFTGNIARHLFL 267

Query: 149 KEQYLFTCTCPRCIKLGQFDDIQESAILEGYRCKDDGCSGFLLRDSDD---KGFTCQQC- 204
           K +  FTC C RC      D  ++ A + G  C+D  C+G ++ +        + C +C 
Sbjct: 268 KMKKSFTCKCSRCS-----DPTEKGAFISGLYCRDTNCTGLVVPEISGLPHPNWNCLECK 322

Query: 205 ----------------GLVRSKEEIKKIASEVNILSKKTLALTSCGNHQEVVSTYKMIEK 248
                           G + +K     + + V  L++K+ +     N+  + +   ++++
Sbjct: 323 QKSTHAQMMKSQDFASGAINAKANSNSLRTLVQYLNEKSDSFIPSSNYVVIDAKMSVLQR 382

Query: 249 LQK 251
           LQ+
Sbjct: 383 LQQ 385


>gi|91079710|ref|XP_969181.1| PREDICTED: similar to SET and MYND domain containing 4 [Tribolium
           castaneum]
          Length = 393

 Score = 75.1 bits (183), Expect = 5e-11,   Method: Compositional matrix adjust.
 Identities = 93/368 (25%), Positives = 150/368 (40%), Gaps = 71/368 (19%)

Query: 22  DNYSLVEALVAHMSDIDEKQLLLYAQIANLVNLILQ-------W-------PEISINE-- 65
           +NY     L++++     K L  Y  +A++V   LQ       W       P+  +NE  
Sbjct: 28  NNYRYFNRLLSNIHK--NKNLAPYIIMASVVTTYLQKYTDFFTWFLTQPSCPKEGLNELV 85

Query: 66  ------IAENFSKLACNAHTI----CNSE-------LRPLGTGLYPVISIINHSCLPNAV 108
                 I ++ ++LACN+ TI    C+S        L  + +G++P +SI+NHSC PN  
Sbjct: 86  KLVGGFITKHIAQLACNSSTIEQWTCSSSDLLFPDVLITIASGIFPSVSIMNHSCRPNVT 145

Query: 109 LVFEGRLAVVRAVQHVPKGAEVLISY-IETAG-STMTRQKALKEQYLFTCTCPRCIKLGQ 166
             F     VV+A++ + +  E+   Y I+  G     RQ A KE Y F C C  C     
Sbjct: 146 NYFMSDTIVVKALEDIAQNEEIFNCYGIDYRGMEREQRQYACKELYHFECKCVICS---- 201

Query: 167 FDDIQESAILEGYRCKDDGCSGFLLRDSDDKGFTCQQCGLVRSKEEIKKIASEVNILSKK 226
            D   E  +L+ Y C    C G +    +     C  CG     +  +KI  E    ++K
Sbjct: 202 -DPAHELDMLDSYLCPK--CKGLVPEIKNTILSFCVNCGEKYHLKPFRKINYE----AQK 254

Query: 227 TLALTSCGNHQEVVSTYKMIEKLQKKLYHPFSVNLMQTREKLIKILMELEDWKEALAYCQ 286
            +   S    + ++ + K+ EK+  K +  F     +   +L    ++L D +    Y  
Sbjct: 255 YMESDSSNQLELLIKSLKIREKILYKHHKDFE----EVYYRLYSFYVKLGDAENMFKYFH 310

Query: 287 LTI----------------PVYQRVYPQFHPLLGLQYYTCGKLEWFLGDTENAIKSMTEA 330
           L +                 +Y+      H L G Q   C  L+ FL + E  IK   EA
Sbjct: 311 LWLENEKARRGCNSRVVGTKLYEAALAILHCLKGGQQKNCTNLKDFLQNVEYMIK---EA 367

Query: 331 VEILRITH 338
             IL + +
Sbjct: 368 KAILNLYY 375


>gi|42562487|ref|NP_174606.2| tetratricopeptide repeat-containing protein [Arabidopsis thaliana]
 gi|332193468|gb|AEE31589.1| tetratricopeptide repeat-containing protein [Arabidopsis thaliana]
          Length = 798

 Score = 75.1 bits (183), Expect = 5e-11,   Method: Compositional matrix adjust.
 Identities = 68/326 (20%), Positives = 135/326 (41%), Gaps = 62/326 (19%)

Query: 88  LGTGLYPVISIINHSCLPNAVLVFEGRLAVVRAVQHVPKGAEVLISYIETAG--STMTRQ 145
           +G  LY   S+ NHSC PN  L F  R  +++  + VP G  + +SY    G      R 
Sbjct: 475 VGQALYKTGSLFNHSCKPNIHLYFLSRGLIMQTTEFVPTGCPLELSYGPEVGKWDCKNRI 534

Query: 146 KALKEQYLFTCTCPRCIKLGQFDDIQESAILEGYRCKDDGCSGFLLRDSDDKGFTCQQCG 205
           + L+E+Y F C C  C ++   D      ++ GY C +  C+G +L   D    TC+   
Sbjct: 535 RFLEEEYFFHCRCRGCAQINISD-----LVINGYGCVNTNCTGVVL---DSNVATCESEK 586

Query: 206 L-------------VRSKEEIKKIASEV--NILSKKTLALT-------SCG-------NH 236
           L             V+ +E++     EV  ++LSK + +L         CG       +H
Sbjct: 587 LNHFFTAPRNVDQQVQMREKVYADVGEVASSLLSKPSGSLHIEPEICLKCGSRCDIENSH 646

Query: 237 QEVVSTYKMIEKLQ-----------------------KKLYHPFSVNLMQTREKLIKILM 273
            EV   +  + +++                       +   H ++ ++    +K+ +   
Sbjct: 647 AEVNKAWNHMRRVEELMNSGRANYSVLSDCSRSIAVLRTFLHMYNKDIADAEDKVAQACY 706

Query: 274 ELEDWKEALAYCQLTIPVYQRVYPQFHPLLGLQYYTCGKLEWFLGDTENAIKSMTEAVEI 333
              +  +A  +C+ +I + +R+Y   H ++G +      ++   GD+  A  +   + +I
Sbjct: 707 LAGELVDARKHCEASIKILKRLYEDEHVVIGNEMVKLASIQLASGDSSGAWDTTKRSSQI 766

Query: 334 LRITHGTNSPFMKELILKLEEAQAEA 359
               +G+++  +   +  L++  A+A
Sbjct: 767 FSKYYGSHAETLFSYLPCLKQETAKA 792


>gi|194769376|ref|XP_001966780.1| GF19100 [Drosophila ananassae]
 gi|190618301|gb|EDV33825.1| GF19100 [Drosophila ananassae]
          Length = 499

 Score = 75.1 bits (183), Expect = 5e-11,   Method: Compositional matrix adjust.
 Identities = 34/103 (33%), Positives = 55/103 (53%), Gaps = 7/103 (6%)

Query: 91  GLYPVISIINHSCLPNAVLVFEGRL--AVVRAVQHVPKGAEVLISYIETAGSTMTRQKAL 148
            LYP+  ++NH C+PN+   F+ +    VVRA   +P+G E+  +Y +     + R   L
Sbjct: 208 ALYPLFGVMNHDCIPNSYYTFDEKTNHMVVRAAVDIPEGFEITTTYTKLFTGNIARHLFL 267

Query: 149 KEQYLFTCTCPRCIKLGQFDDIQESAILEGYRCKDDGCSGFLL 191
           K +  FTC CPRC      D  ++ A + G  C+D  C+G ++
Sbjct: 268 KMKKGFTCKCPRC-----SDPTEKGAFISGLYCRDTNCTGLVV 305


>gi|12322390|gb|AAG51222.1|AC051630_19 unknown protein; 33246-28649 [Arabidopsis thaliana]
          Length = 781

 Score = 75.1 bits (183), Expect = 5e-11,   Method: Compositional matrix adjust.
 Identities = 68/326 (20%), Positives = 135/326 (41%), Gaps = 62/326 (19%)

Query: 88  LGTGLYPVISIINHSCLPNAVLVFEGRLAVVRAVQHVPKGAEVLISYIETAG--STMTRQ 145
           +G  LY   S+ NHSC PN  L F  R  +++  + VP G  + +SY    G      R 
Sbjct: 458 VGQALYKTGSLFNHSCKPNIHLYFLSRGLIMQTTEFVPTGCPLELSYGPEVGKWDCKNRI 517

Query: 146 KALKEQYLFTCTCPRCIKLGQFDDIQESAILEGYRCKDDGCSGFLLRDSDDKGFTCQQCG 205
           + L+E+Y F C C  C ++   D      ++ GY C +  C+G +L   D    TC+   
Sbjct: 518 RFLEEEYFFHCRCRGCAQINISD-----LVINGYGCVNTNCTGVVL---DSNVATCESEK 569

Query: 206 L-------------VRSKEEIKKIASEV--NILSKKTLALT-------SCG-------NH 236
           L             V+ +E++     EV  ++LSK + +L         CG       +H
Sbjct: 570 LNHFFTAPRNVDQQVQMREKVYADVGEVASSLLSKPSGSLHIEPEICLKCGSRCDIENSH 629

Query: 237 QEVVSTYKMIEKLQ-----------------------KKLYHPFSVNLMQTREKLIKILM 273
            EV   +  + +++                       +   H ++ ++    +K+ +   
Sbjct: 630 AEVNKAWNHMRRVEELMNSGRANYSVLSDCSRSIAVLRTFLHMYNKDIADAEDKVAQACY 689

Query: 274 ELEDWKEALAYCQLTIPVYQRVYPQFHPLLGLQYYTCGKLEWFLGDTENAIKSMTEAVEI 333
              +  +A  +C+ +I + +R+Y   H ++G +      ++   GD+  A  +   + +I
Sbjct: 690 LAGELVDARKHCEASIKILKRLYEDEHVVIGNEMVKLASIQLASGDSSGAWDTTKRSSQI 749

Query: 334 LRITHGTNSPFMKELILKLEEAQAEA 359
               +G+++  +   +  L++  A+A
Sbjct: 750 FSKYYGSHAETLFSYLPCLKQETAKA 775


>gi|24640767|ref|NP_572539.2| CG12119 [Drosophila melanogaster]
 gi|7291024|gb|AAF46462.1| CG12119 [Drosophila melanogaster]
          Length = 500

 Score = 75.1 bits (183), Expect = 5e-11,   Method: Compositional matrix adjust.
 Identities = 35/103 (33%), Positives = 54/103 (52%), Gaps = 7/103 (6%)

Query: 91  GLYPVISIINHSCLPNAVLVFEGRL--AVVRAVQHVPKGAEVLISYIETAGSTMTRQKAL 148
            LYP+  ++NH C+PNA   FE +    +VRA   +P+G EV  +Y +     + R   L
Sbjct: 208 ALYPLFGVVNHDCIPNAYYTFEEKTNNMIVRAAVDIPEGFEVTTTYTKLFTGNIARHLFL 267

Query: 149 KEQYLFTCTCPRCIKLGQFDDIQESAILEGYRCKDDGCSGFLL 191
           K +  FTC C RC      D  ++ A + G  C+D  C+G ++
Sbjct: 268 KMKKSFTCKCSRCS-----DPTEKGAFISGLYCRDTNCTGLVV 305


>gi|321476199|gb|EFX87160.1| hypothetical protein DAPPUDRAFT_312722 [Daphnia pulex]
          Length = 732

 Score = 75.1 bits (183), Expect = 5e-11,   Method: Compositional matrix adjust.
 Identities = 74/307 (24%), Positives = 137/307 (44%), Gaps = 43/307 (14%)

Query: 72  KLACNAHTIC--------NSELRPLGTGLYPVISIINHSCLPNAVLVFEGRLAVVRAVQH 123
           ++  NAH I         N     + T +YP  S++NH+C P  +  F+G   +VRA+++
Sbjct: 426 QMVSNAHAITELCLIDENNERQERIATAIYPSASLMNHNCDPTVINSFQGNTLIVRAIRN 485

Query: 124 VPKGAEVLISYIETAGSTMTRQK------ALKEQYLFTCTCPRCIKLGQFDDIQESAILE 177
           V +G EV   Y    G    R +      AL+ QY FTCTC  C+     +D Q+  ++ 
Sbjct: 486 VRQGDEVFNCY----GPHYRRMRRSERVEALEAQYSFTCTCDSCLD-KNTEDFQD--VIY 538

Query: 178 GYRCKDDGCSGFLLRDSDDKGFTCQQCGLVRSKE-------EIKKIASEVNILSKKTLAL 230
            + C    C G L+  + +      Q  L RS +       ++K     V++ ++ T A+
Sbjct: 539 SFSCP--SCQGSLINPTGNNSSAQNQMALCRSCKTPQSYFTQLKADLEAVSLDARGTEAM 596

Query: 231 TSCGNHQEVVSTYKMIEK---LQKKLYHPFSVNLMQTREKLIKILMELEDWKEALAYCQL 287
               +  +++   K++ K   L+ K       +L ++ +KL +    +  ++E     ++
Sbjct: 597 ----DQGDIMGAIKLLTKCVQLRSKALFKGHPDLGKSADKLAQCYAFIGKYEECEKMLRI 652

Query: 288 TIPVYQRVYPQFHPLLG--LQYYTCGKLEWFLGDTENA----IKSMTEAVEILRITHGTN 341
           ++   +  Y ++   +   LQ +T   +E   G  + A    +  + EA+ I RI +G  
Sbjct: 653 SLTAVEHRYGRYSIEMANELQKFTDVLMELASGRNQRARDELVNYLEEAMLIYRIHYGPW 712

Query: 342 SPFMKEL 348
           S   KEL
Sbjct: 713 STSYKEL 719


>gi|307172338|gb|EFN63826.1| Protein msta, isoform B [Camponotus floridanus]
          Length = 510

 Score = 75.1 bits (183), Expect = 5e-11,   Method: Compositional matrix adjust.
 Identities = 73/303 (24%), Positives = 125/303 (41%), Gaps = 35/303 (11%)

Query: 61  ISINEIAENFSKLACNAHTICNSELRPL----GTGLYPVISIINHSCLPN---AVLVFEG 113
           + ++  +E F +  C    I   E+R +      GLYP ++++NHSC+ N   ++   + 
Sbjct: 179 LKLDRFSEEFIQTICGILDINTFEVRTMKGFCARGLYPTVAMMNHSCVSNTSHSISPIDY 238

Query: 114 RLAVVRAVQHVPKGAEVLISYIETAGSTMTRQKALKEQYLFTCTCPRCIKLGQFDDIQES 173
           R+ + R    VP G E+  SY  +   TM R++ L E   F C CPRC      D  +  
Sbjct: 239 RIRL-RTTLKVPAGGELYGSYTHSLLPTMLRREHLLEGKYFACACPRC-----SDPTELG 292

Query: 174 AILEGYRCKDDGCSGFLLRDSDDKGFT--CQQCGLVRSKEEIKKIASEVNILSKKTLALT 231
             +   +C        L  DS D   T  C  C    S + ++KI   +        A++
Sbjct: 293 THMSSLKCNKCDNGVILSLDSLDSTSTWNCTHCDFSTSGQAVRKIFKIIQAEVDTVEAIS 352

Query: 232 SCGNHQEVVSTYKMIEKLQKKLYHPFSVNLMQTREKLIKILMELEDW----------KEA 281
                  ++    +I+K +  L HP    L   R  L ++   ++++          +  
Sbjct: 353 GADGADAILERENVIKKYRSVL-HPHHAFLTILRHSLTQMYGRVDEYLLDDLPDVVLEHK 411

Query: 282 LAYCQLTIPVYQRVYPQFHPLLGLQYYT-------CGKLEWFLGDTENA-IKS-MTEAVE 332
           +  C+L + V   V P +  + G+  Y          K +W  G  + A +KS M EA  
Sbjct: 412 VDMCRLLLQVLDIVEPGYSRIRGMTLYELHAPLLFLAKGQWNAGVIDEAGLKSKMIEAAT 471

Query: 333 ILR 335
           IL+
Sbjct: 472 ILK 474


>gi|16553701|dbj|BAB71564.1| unnamed protein product [Homo sapiens]
          Length = 313

 Score = 75.1 bits (183), Expect = 5e-11,   Method: Compositional matrix adjust.
 Identities = 67/271 (24%), Positives = 113/271 (41%), Gaps = 52/271 (19%)

Query: 88  LGTGLYPVISIINHSCLPNAVLVFEGRLAVVRAVQHVPKGAEVLISYI--ETAGSTMTRQ 145
           L TG++PVIS++NHSC PN  + F   +A +RA Q + KG E+L  Y   ++      RQ
Sbjct: 36  LATGIFPVISLLNHSCSPNTSVSFISTVATIRASQRIRKGQEILHCYGPHKSRMGVAERQ 95

Query: 146 KALKEQYLFTCTCPRCIKLGQFDDIQESAILEGYRCKDDGCSGFLLRDSDDKGFTCQQCG 205
           + L+ QY F C CP C         +   +  G R                + F C  CG
Sbjct: 96  QKLRSQYFFDCACPAC-------QTEAHRMAAGPRW---------------EAFCCNSCG 133

Query: 206 LVRSKEEIKKIASEVNILSKKTLALTSCGNHQEVVSTYKMIEKLQKKLYHPFSVNLMQTR 265
                +++ +  S             SC   +  VS   ++ +LQ         +L Q  
Sbjct: 134 APMQGDDVLRCGSR------------SCA--ESAVSRDHLVSRLQ---------DLQQQV 170

Query: 266 EKLIKILM--ELEDWKEALAYCQLTIPVYQRVYPQFHPLLGLQYYTCGKLEWFLGDTENA 323
               K+L   ELE   + L+ CQ     +  ++ + H ++G       +    LGD + +
Sbjct: 171 RVAQKLLRDGELERAVQRLSGCQRDAESF--LWAE-HAVVGEIADGLARACAALGDWQKS 227

Query: 324 IKSMTEAVEILRITHGTNSPFMKELILKLEE 354
              +  ++ ++ + HG +S  M   + KL +
Sbjct: 228 ATHLQRSLCVVEVRHGPSSVEMGHELFKLAQ 258


>gi|17946274|gb|AAL49177.1| RE62495p [Drosophila melanogaster]
          Length = 500

 Score = 75.1 bits (183), Expect = 5e-11,   Method: Compositional matrix adjust.
 Identities = 35/103 (33%), Positives = 54/103 (52%), Gaps = 7/103 (6%)

Query: 91  GLYPVISIINHSCLPNAVLVFEGRL--AVVRAVQHVPKGAEVLISYIETAGSTMTRQKAL 148
            LYP+  ++NH C+PNA   FE +    +VRA   +P+G EV  +Y +     + R   L
Sbjct: 208 ALYPLFGVVNHDCIPNAYYTFEEKTNNMIVRAAVDIPEGFEVTTTYTKLFTGNIARHLFL 267

Query: 149 KEQYLFTCTCPRCIKLGQFDDIQESAILEGYRCKDDGCSGFLL 191
           K +  FTC C RC      D  ++ A + G  C+D  C+G ++
Sbjct: 268 KMKKSFTCKCSRCS-----DPTEKGAFISGLYCRDTNCAGLVV 305


>gi|195565893|ref|XP_002106530.1| GD16076 [Drosophila simulans]
 gi|194203908|gb|EDX17484.1| GD16076 [Drosophila simulans]
          Length = 497

 Score = 75.1 bits (183), Expect = 6e-11,   Method: Compositional matrix adjust.
 Identities = 35/103 (33%), Positives = 54/103 (52%), Gaps = 7/103 (6%)

Query: 91  GLYPVISIINHSCLPNAVLVFEGRL--AVVRAVQHVPKGAEVLISYIETAGSTMTRQKAL 148
            LYP+  ++NH C+PNA   FE +    +VRA   +P+G EV  +Y +     + R   L
Sbjct: 208 ALYPLFGVVNHDCIPNAYYTFEEKTNNMIVRAAVDIPEGFEVTTTYTKLFTGNIARHLFL 267

Query: 149 KEQYLFTCTCPRCIKLGQFDDIQESAILEGYRCKDDGCSGFLL 191
           K +  FTC C RC      D  ++ A + G  C+D  C+G ++
Sbjct: 268 KMKKSFTCKCSRCS-----DPTEKGAFISGLYCRDTNCTGLVV 305


>gi|195446904|ref|XP_002070974.1| GK25543 [Drosophila willistoni]
 gi|194167059|gb|EDW81960.1| GK25543 [Drosophila willistoni]
          Length = 503

 Score = 75.1 bits (183), Expect = 6e-11,   Method: Compositional matrix adjust.
 Identities = 34/103 (33%), Positives = 55/103 (53%), Gaps = 7/103 (6%)

Query: 91  GLYPVISIINHSCLPNAVLVFEGRL--AVVRAVQHVPKGAEVLISYIETAGSTMTRQKAL 148
            LYP+  ++NH C+PN+   FE +    +VRA   +P+G E+  +Y +     + R   L
Sbjct: 208 ALYPLFGVMNHDCIPNSYYTFEEKTNNMIVRAAVDIPEGFEITTTYTKLFTGNIARHLFL 267

Query: 149 KEQYLFTCTCPRCIKLGQFDDIQESAILEGYRCKDDGCSGFLL 191
           K +  FTC CPRC      D  ++ A + G  C+D  C+G ++
Sbjct: 268 KMKKNFTCKCPRC-----SDPTEKGAYISGLYCRDTSCNGLVV 305


>gi|296201020|ref|XP_002747857.1| PREDICTED: SET and MYND domain-containing protein 4 [Callithrix
           jacchus]
          Length = 798

 Score = 75.1 bits (183), Expect = 6e-11,   Method: Compositional matrix adjust.
 Identities = 53/158 (33%), Positives = 78/158 (49%), Gaps = 9/158 (5%)

Query: 6   LRRKLQNDNVIPSTTTDNYSLVEALVAHMSDIDEKQLLLYAQIANLVNLILQWPEISINE 65
           L R+L+  +  P TT  + S ++A VA     D   + ++  +     L LQ    +I  
Sbjct: 447 LCRQLEAASFQPVTTGASSSQLKAAVAPELCPD---MTVWGVVMLRHMLQLQCNAQAITT 503

Query: 66  IAENFSKLACNAHTICNSELRPLGTGLYPVISIINHSCLPNAVLVFEGRLAVVRAVQHVP 125
           I    SK       +  S    L TG++PVIS++NHSC PN  + F   +A +RA Q + 
Sbjct: 504 ILHTGSKEG----IVTGSRQVRLATGIFPVISLLNHSCSPNTSVSFVSTVATIRASQRIG 559

Query: 126 KGAEVLISY--IETAGSTMTRQKALKEQYLFTCTCPRC 161
           KG E+L  Y   ++      RQ+ L+ QY F CTCP C
Sbjct: 560 KGQEILHCYGPHKSRIGVAERQQKLRSQYFFDCTCPAC 597


>gi|390353457|ref|XP_003728116.1| PREDICTED: SET and MYND domain-containing protein 3-like
           [Strongylocentrotus purpuratus]
          Length = 319

 Score = 74.7 bits (182), Expect = 6e-11,   Method: Compositional matrix adjust.
 Identities = 40/123 (32%), Positives = 64/123 (52%), Gaps = 7/123 (5%)

Query: 65  EIAENFSKLACNAHTICNSELRPL-GTGLYPVISIINHSCLPNAVLVFEGRLAVVRAVQH 123
           E+ + + KL  N+ ++ +   R + G  LY   S+ NHSC PN   VFEG    VRA++ 
Sbjct: 178 EVLDTYQKLTINSFSMYDEMTRTIVGEALYIRASMFNHSCEPNCTFVFEGSRLSVRAIKR 237

Query: 124 VPKGAEVLISYIETAGSTMTRQKALKEQYLFTCTCPRCIKLGQFDDIQESAILEGYRCKD 183
           +  G E  ISY+ +   +  R++ L+  Y FTC CPRC+      D     ++   +C +
Sbjct: 238 IEIGEECCISYMSSLLPSPLRKEKLRSIYGFTCQCPRCL------DSARDNLMLCVKCPN 291

Query: 184 DGC 186
           + C
Sbjct: 292 ESC 294


>gi|195359283|ref|XP_002045335.1| GM11659 [Drosophila sechellia]
 gi|194129184|gb|EDW51227.1| GM11659 [Drosophila sechellia]
          Length = 496

 Score = 74.7 bits (182), Expect = 6e-11,   Method: Compositional matrix adjust.
 Identities = 49/209 (23%), Positives = 91/209 (43%), Gaps = 27/209 (12%)

Query: 91  GLYPVISIINHSCLPNAVLVFEGRL--AVVRAVQHVPKGAEVLISYIETAGSTMTRQKAL 148
            LYP+  ++NH C+PNA   FE +    +VRA   +P+G EV  +Y +     + R   L
Sbjct: 208 ALYPLFGVVNHDCIPNAYYTFEEKTNNMIVRAAVDIPEGFEVTTTYTKLFTGNIARHLFL 267

Query: 149 KEQYLFTCTCPRCIKLGQFDDIQESAILEGYRCKDDGCSGFLLRDSDD---KGFTCQQC- 204
           K +  FTC C RC      D  ++ A + G  C+D  C+G ++ +        + C +C 
Sbjct: 268 KMKKSFTCKCSRCS-----DPTEKGAFISGLYCRDTNCTGLVVPEISGLPHPNWNCLECK 322

Query: 205 ----------------GLVRSKEEIKKIASEVNILSKKTLALTSCGNHQEVVSTYKMIEK 248
                           G + +K     + + V  L++K+ +     N+  + +   ++++
Sbjct: 323 QKSTHAQMMKSQDFASGAINAKVNSNSLRTLVQYLNEKSDSFIPNSNYVVIDAKMSVLQR 382

Query: 249 LQKKLYHPFSVNLMQTREKLIKILMELED 277
           LQ+            TR +  + + +L D
Sbjct: 383 LQQGTRGLLRGAGAHTRLRYSRDITQLMD 411


>gi|47204414|emb|CAF87026.1| unnamed protein product [Tetraodon nigroviridis]
          Length = 386

 Score = 74.7 bits (182), Expect = 7e-11,   Method: Compositional matrix adjust.
 Identities = 79/341 (23%), Positives = 139/341 (40%), Gaps = 53/341 (15%)

Query: 27  VEALVAHMSDIDEKQL-LLYAQIANLVNLILQWP----EISINEIAENFSKLACNAHTIC 81
           VE L  H++D+ ++QL  L+A +         W       S  +IA     +  N  T+ 
Sbjct: 79  VEELQDHVADLPQEQLQRLHADVRTFQEF---WSCGRSRPSREDIAHVLGIIKTNGFTLS 135

Query: 82  NSE-LRPLGTGLYPVISIINHSCLPNAVLVFEG-------------RLAVVRAVQHVPKG 127
           +   L+ +G GL+P + ++NH C PN  +V                R   +RA+  V +G
Sbjct: 136 DQRGLQAVGVGLFPNLCLVNHDCWPNCSVVLNHGNQSATNPALHSQRRVELRALGKVCQG 195

Query: 128 AEVLISYIETAGSTMTRQKALKEQYLFTCTCPRCIKLGQFDDIQESAILEGYRCKDDGCS 187
            E+ +SY++   ++  RQ+ L E + F CTC RC +  + D +  +A             
Sbjct: 196 EELTVSYVDFLDTSAERQRKLSELFFFHCTCQRCSQHLKDDAMTAAADGA---------- 245

Query: 188 GFLLRDSDDKGFTCQQCGLVRSKEEIKKIASEVNILSKKTLALTSCGNHQEVVSTYKMIE 247
                                  +E+   + E   L K    LT   ++QEV++      
Sbjct: 246 ------------------DAAEVQEVTAFSQEC--LEKIDRCLTG-RDYQEVLALCSACL 284

Query: 248 KLQKKLYHPFSVNLMQTREKLIKILMELEDWKEALAYCQLTIPVYQRVYPQFHPLLGLQY 307
           + Q+ L     +  ++     ++ L  L  + EA AY    +  Y+++YP  H  LG+  
Sbjct: 285 QKQRGLLADTHLCHLRVLSAAVEALSYLRCFPEAAAYALRLLQGYRKLYPPNHAQLGVAA 344

Query: 308 YTCGKLEWFLGDTENAIKSMTEAVEILRITHGTNSPFMKEL 348
              G      G  E A + + +A   L ++HG N    ++L
Sbjct: 345 MRAGVAHLQAGMPEAAHELICQAYRTLLVSHGPNHSTTRDL 385


>gi|195130539|ref|XP_002009709.1| GI15088 [Drosophila mojavensis]
 gi|193908159|gb|EDW07026.1| GI15088 [Drosophila mojavensis]
          Length = 492

 Score = 74.7 bits (182), Expect = 7e-11,   Method: Compositional matrix adjust.
 Identities = 33/103 (32%), Positives = 56/103 (54%), Gaps = 7/103 (6%)

Query: 91  GLYPVISIINHSCLPNAVLVFEGRL--AVVRAVQHVPKGAEVLISYIETAGSTMTRQKAL 148
            LYP+ +++NH C+PN+   FE +    +VRA   + +G E+  +Y +     + R   L
Sbjct: 208 ALYPLFAVMNHDCIPNSYYTFEEKTNNMIVRAAVDIAEGEEITTTYTKLFTGNIARHLFL 267

Query: 149 KEQYLFTCTCPRCIKLGQFDDIQESAILEGYRCKDDGCSGFLL 191
           K +  FTC CPRC+     D  ++ + + G  C+D  CSG ++
Sbjct: 268 KMKKGFTCKCPRCL-----DPTEKGSFISGLYCRDTNCSGLVV 305


>gi|332813752|ref|XP_003309160.1| PREDICTED: SET and MYND domain-containing protein 1 [Pan
           troglodytes]
          Length = 452

 Score = 74.7 bits (182), Expect = 7e-11,   Method: Compositional matrix adjust.
 Identities = 44/152 (28%), Positives = 76/152 (50%), Gaps = 25/152 (16%)

Query: 31  VAHMSDIDEKQLLLYAQIANLVNLILQ-WP----EISINEIAENFSKLACNAHTICNSE- 84
           V H  + ++K L +       V+  LQ WP    + S+  I+  F  + CN  T+ +   
Sbjct: 136 VEHFGEEEQKDLRVD------VDTFLQYWPPQSQQFSMQYISHIFGVINCNGFTLSDQRG 189

Query: 85  LRPLGTGLYPVISIINHSCLPNAVLVFE-GRLAVVRAVQH------------VPKGAEVL 131
           L+ +G G++P + ++NH C PN  ++F  G    V+++ H            + +G E+ 
Sbjct: 190 LQAVGVGIFPNLGLVNHDCWPNCTVIFNNGNHEAVKSMFHTQMRIELRALGKISEGEELT 249

Query: 132 ISYIETAGSTMTRQKALKEQYLFTCTCPRCIK 163
           +SYI+    +  R++ LK+QY F CTC  C K
Sbjct: 250 VSYIDFLNVSEERKRQLKKQYYFDCTCEHCQK 281


>gi|126314231|ref|XP_001371610.1| PREDICTED: SET and MYND domain-containing protein 4-like
           [Monodelphis domestica]
          Length = 845

 Score = 74.3 bits (181), Expect = 8e-11,   Method: Compositional matrix adjust.
 Identities = 61/266 (22%), Positives = 116/266 (43%), Gaps = 12/266 (4%)

Query: 88  LGTGLYPVISIINHSCLPNAVLVFEGRLAVVRAVQHVPKGAEVLISY--IETAGSTMTRQ 145
           L TG +PVIS++NHSC PN  L F G + +++A + + +G E+L  Y   E      TRQ
Sbjct: 513 LATGFFPVISLLNHSCRPNTSLSFRGSVGIIQASRLIAQGEEILHCYGPHECRMDVTTRQ 572

Query: 146 KALKEQYLFTCTCPRCIKLGQFDDIQESAILEGYRCKDDGCSGFLLRDSDDKGFTC---Q 202
           + L+ QY F C C  C +  +   +  +    G+RC    C   L  D     F C   +
Sbjct: 573 QKLRSQYFFDCHCQAC-QNEEVCPVSTAPKPSGFRCPS--CKAALQGDD----FLCCDNR 625

Query: 203 QCGLVRSKEEIKKIASEVNILSKKTLALTSCGNHQEVVSTYKMIEKLQKKLYHPFSVNLM 262
            C    S+ +++    ++    K  L L       + +      ++  +       + + 
Sbjct: 626 TCTASASRAQLEHQVQDLQQQVKAALELLRDDRSAQAIQLLLGCQQDAENFLSSEHMLMG 685

Query: 263 QTREKLIKILMELEDWKEALAYCQLTIPVYQRVYPQFHPLLGLQYYTCGKLEWFLGDTEN 322
           +  + L +    L DW  + A+ Q ++ V +  +      LG + +   ++ +       
Sbjct: 686 EIEDHLAQAYASLGDWSRSAAHLQKSLLVVEARHGPASVELGHELFKLAQVLFNGFAVSK 745

Query: 323 AIKSMTEAVEILRITHGTNSPFMKEL 348
           A+ ++ +A ++L I +   +  M+EL
Sbjct: 746 ALCAIEKAEKVLSIHYDPENEQMQEL 771


>gi|115497232|ref|NP_001069874.1| SET and MYND domain-containing protein 3 [Bos taurus]
 gi|112362116|gb|AAI20242.1| SET and MYND domain containing 3 [Bos taurus]
 gi|296479288|tpg|DAA21403.1| TPA: SET and MYND domain containing 3 [Bos taurus]
          Length = 391

 Score = 74.3 bits (181), Expect = 8e-11,   Method: Compositional matrix adjust.
 Identities = 32/65 (49%), Positives = 47/65 (72%)

Query: 65  EIAENFSKLACNAHTICNSELRPLGTGLYPVISIINHSCLPNAVLVFEGRLAVVRAVQHV 124
           +I E F+K+ CN+ TICN+E++ +G GLYP +S++NHSC PN  +VF G   ++RAV+ V
Sbjct: 170 DIFEAFAKVICNSFTICNAEMQEVGVGLYPSMSLLNHSCDPNCSIVFNGPHLLLRAVRDV 229

Query: 125 PKGAE 129
             G E
Sbjct: 230 EAGEE 234



 Score = 48.1 bits (113), Expect = 0.007,   Method: Compositional matrix adjust.
 Identities = 24/61 (39%), Positives = 36/61 (59%)

Query: 279 KEALAYCQLTIPVYQRVYPQFHPLLGLQYYTCGKLEWFLGDTENAIKSMTEAVEILRITH 338
           +EAL Y   T+  Y+  YP  HP+ G+Q    GKL+   G    A+K++  A +I+R+TH
Sbjct: 308 EEALFYGIRTMEPYRIFYPGNHPVRGVQIMKVGKLQLHQGMFPQAMKNLRLAFDIMRVTH 367

Query: 339 G 339
           G
Sbjct: 368 G 368


>gi|322697990|gb|EFY89764.1| MYND finger family protein [Metarhizium acridum CQMa 102]
          Length = 513

 Score = 74.3 bits (181), Expect = 8e-11,   Method: Compositional matrix adjust.
 Identities = 40/102 (39%), Positives = 60/102 (58%), Gaps = 1/102 (0%)

Query: 60  EISINEIAENFSKLACNAHTICNSELRPLGTGLYPVISIINHSCLPNAVLVFEGRLAVVR 119
           E ++ +  E   K+  NA    + +L  +G  L P +++ NHSC+PNAV+ F GR A++R
Sbjct: 156 ESNVRKAVELLCKIQTNAFHRWDVDLGQVGVFLEPTLAMANHSCVPNAVVQFVGRKAILR 215

Query: 120 AVQHVPKGAEVLISYIETAGSTMTRQKALKEQYLFTCTCPRC 161
           A + +  G E+ ISY +       R++AL EQY F CTC RC
Sbjct: 216 AERPIQAGDEIEISYTDYTMPLSARREAL-EQYSFECTCARC 256


>gi|159129417|gb|EDP54531.1| SET and MYND domain protein, putative [Aspergillus fumigatus A1163]
          Length = 544

 Score = 74.3 bits (181), Expect = 8e-11,   Method: Compositional matrix adjust.
 Identities = 42/140 (30%), Positives = 75/140 (53%), Gaps = 2/140 (1%)

Query: 26  LVEALVAHMSDIDEKQLLLYAQIANLVNLILQWPEISINE--IAENFSKLACNAHTICNS 83
           L   L +HM++I  +    + +I+     + ++   +++E  +    +KL  N++   + 
Sbjct: 187 LFRKLESHMAEIQAQNWEQWQRISLTAKAVKKYSGCALDEEMLCHYGAKLELNSYNFHSP 246

Query: 84  ELRPLGTGLYPVISIINHSCLPNAVLVFEGRLAVVRAVQHVPKGAEVLISYIETAGSTMT 143
               LG  L+P  ++INHSC  N+V+  +G    V+A++ + KG E+L+SYI+       
Sbjct: 247 LADRLGFYLHPYAALINHSCDYNSVVGSDGDALYVKALRPIRKGEEILVSYIDATNPYKI 306

Query: 144 RQKALKEQYLFTCTCPRCIK 163
           RQ  L E+Y F C CP+C K
Sbjct: 307 RQTELLERYYFKCRCPKCGK 326


>gi|346978403|gb|EGY21855.1| hypothetical protein VDAG_03295 [Verticillium dahliae VdLs.17]
          Length = 558

 Score = 74.3 bits (181), Expect = 8e-11,   Method: Compositional matrix adjust.
 Identities = 48/153 (31%), Positives = 78/153 (50%), Gaps = 19/153 (12%)

Query: 22  DNYSLVEALVAHMSDIDEKQLLLYAQIANLVNLILQWP-----------EISINEIAENF 70
           + ++ VEAL+    D  E +   +A +     ++  W            E++   + E  
Sbjct: 135 EKFAAVEALLDGNVDRFEGRAEAWAGMEVPAKVMGGWVAAGAKGKGKAVEVTERRVVELL 194

Query: 71  SKLACNAHTICNSELRPLGTGLY--PVISIINHSCLPNAVLVFEGRLAVVRAVQHVPKGA 128
            K+  NA T   +     G GL+    ++++NHSC+PNA++ F GR A +RA++ + +G 
Sbjct: 195 CKIKTNAFTRSEA-----GEGLFLDTTLAMVNHSCVPNAIVAFSGRRAFLRALRDIKEGE 249

Query: 129 EVLISYIETAGSTMTRQKALKEQYLFTCTCPRC 161
           E+ ISYI+   S   R+KAL E Y F C C RC
Sbjct: 250 EIEISYIDCTQSLEHRRKAL-ELYFFQCVCTRC 281


>gi|270004498|gb|EFA00946.1| hypothetical protein TcasGA2_TC003856 [Tribolium castaneum]
          Length = 713

 Score = 74.3 bits (181), Expect = 8e-11,   Method: Compositional matrix adjust.
 Identities = 93/368 (25%), Positives = 150/368 (40%), Gaps = 71/368 (19%)

Query: 22  DNYSLVEALVAHMSDIDEKQLLLYAQIANLVNLILQ-------W-------PEISINE-- 65
           +NY     L++++     K L  Y  +A++V   LQ       W       P+  +NE  
Sbjct: 348 NNYRYFNRLLSNIHK--NKNLAPYIIMASVVTTYLQKYTDFFTWFLTQPSCPKEGLNELV 405

Query: 66  ------IAENFSKLACNAHTI----CNSE-------LRPLGTGLYPVISIINHSCLPNAV 108
                 I ++ ++LACN+ TI    C+S        L  + +G++P +SI+NHSC PN  
Sbjct: 406 KLVGGFITKHIAQLACNSSTIEQWTCSSSDLLFPDVLITIASGIFPSVSIMNHSCRPNVT 465

Query: 109 LVFEGRLAVVRAVQHVPKGAEVLISY-IETAG-STMTRQKALKEQYLFTCTCPRCIKLGQ 166
             F     VV+A++ + +  E+   Y I+  G     RQ A KE Y F C C  C     
Sbjct: 466 NYFMSDTIVVKALEDIAQNEEIFNCYGIDYRGMEREQRQYACKELYHFECKCVICS---- 521

Query: 167 FDDIQESAILEGYRCKDDGCSGFLLRDSDDKGFTCQQCGLVRSKEEIKKIASEVNILSKK 226
            D   E  +L+ Y C    C G +    +     C  CG     +  +KI  E    ++K
Sbjct: 522 -DPAHELDMLDSYLCPK--CKGLVPEIKNTILSFCVNCGEKYHLKPFRKINYE----AQK 574

Query: 227 TLALTSCGNHQEVVSTYKMIEKLQKKLYHPFSVNLMQTREKLIKILMELEDWKEALAYCQ 286
            +   S    + ++ + K+ EK+  K +  F     +   +L    ++L D +    Y  
Sbjct: 575 YMESDSSNQLELLIKSLKIREKILYKHHKDFE----EVYYRLYSFYVKLGDAENMFKYFH 630

Query: 287 LTI----------------PVYQRVYPQFHPLLGLQYYTCGKLEWFLGDTENAIKSMTEA 330
           L +                 +Y+      H L G Q   C  L+ FL + E  IK   EA
Sbjct: 631 LWLENEKARRGCNSRVVGTKLYEAALAILHCLKGGQQKNCTNLKDFLQNVEYMIK---EA 687

Query: 331 VEILRITH 338
             IL + +
Sbjct: 688 KAILNLYY 695


>gi|303282641|ref|XP_003060612.1| predicted protein [Micromonas pusilla CCMP1545]
 gi|226458083|gb|EEH55381.1| predicted protein [Micromonas pusilla CCMP1545]
          Length = 302

 Score = 74.3 bits (181), Expect = 8e-11,   Method: Compositional matrix adjust.
 Identities = 50/163 (30%), Positives = 74/163 (45%), Gaps = 17/163 (10%)

Query: 6   LRRKLQNDNVIPSTTTDNYSLVEALVAHM-----SDIDEKQLLLYAQIANLVNLILQWPE 60
           LRR L+  +  P   +D    +  +  +       D+DE   +   + A         P 
Sbjct: 149 LRRLLKLQSHEPEVDSDEREWLSGIARNTLRLMEQDVDEGVKVFGEKDA---------PG 199

Query: 61  ISINEIAENFSKLACNAHTICNSE---LRPLGTGLYPVISIINHSCLPNAVLVFEGRLAV 117
             + E+      + CN+HT+  SE     P+GT +Y   S  NHSCLP+A    EG    
Sbjct: 200 YGVEELVRLMCCVRCNSHTLYASEEWPSEPVGTAVYLKGSAFNHSCLPSAEFYNEGTSLR 259

Query: 118 VRAVQHVPKGAEVLISYIETAGSTMTRQKALKEQYLFTCTCPR 160
           VR+V+ +  G EV ISY+    +   R++AL  QY F C C R
Sbjct: 260 VRSVRDISAGEEVTISYVPVTETLWDRRQALWRQYKFDCECDR 302


>gi|345566296|gb|EGX49239.1| hypothetical protein AOL_s00078g272 [Arthrobotrys oligospora ATCC
           24927]
          Length = 526

 Score = 74.3 bits (181), Expect = 8e-11,   Method: Compositional matrix adjust.
 Identities = 37/140 (26%), Positives = 76/140 (54%), Gaps = 7/140 (5%)

Query: 22  DNYSLVEALVAHMSDIDEKQLLLYAQIANLVNLILQWPEISINEIAENFSKLACNAHTIC 81
           ++++ +   +  M+++  K    +++++  ++     PE     + + + KL  N+ ++ 
Sbjct: 155 EDFAKIGGDIWEMANLTAKAAAEFSKVSKQISF---EPEF----LRDMYCKLLINSASVV 207

Query: 82  NSELRPLGTGLYPVISIINHSCLPNAVLVFEGRLAVVRAVQHVPKGAEVLISYIETAGST 141
                P+G  L    ++ NHSC PNAV++F+GR   +R+++ + K  E+ ISYI+   S 
Sbjct: 208 TQTFDPIGLCLAYQSAMFNHSCDPNAVMMFDGRQLSIRSLKEITKDTEITISYIDNLASR 267

Query: 142 MTRQKALKEQYLFTCTCPRC 161
             R++ LK +Y F C+C  C
Sbjct: 268 KERKEELKSRYFFDCSCSLC 287


>gi|327283812|ref|XP_003226634.1| PREDICTED: SET and MYND domain-containing protein 4-like [Anolis
           carolinensis]
          Length = 683

 Score = 74.3 bits (181), Expect = 9e-11,   Method: Compositional matrix adjust.
 Identities = 62/265 (23%), Positives = 111/265 (41%), Gaps = 11/265 (4%)

Query: 88  LGTGLYPVISIINHSCLPNAVLVFEGRLAVVRAVQHVPKGAEVLISYI--ETAGSTMTRQ 145
           L T L+PV+S++NHSC PN  + F+GR A  RA + +P+G E+L  Y           R+
Sbjct: 413 LATALFPVLSLLNHSCDPNTSVTFDGRTATARASRAIPRGQEILHCYGPHRCRMKPSERR 472

Query: 146 KALKEQYLFTCTCPRCIKLGQFDDIQESAILEGYRCKD--DGCSGFLLRDSDDKGFTCQQ 203
           + L  QY F C C  C      +    S +     C+    G   FL   S         
Sbjct: 473 QRLLAQYFFECRCSACTDETGPESASGSTLFCCPTCRAPMQGEGLFLCCSSAS------- 525

Query: 204 CGLVRSKEEIKKIASEVNILSKKTLALTSCGNHQEVVSTYKMIEKLQKKLYHPFSVNLMQ 263
           C  +  KE  +    E+    +  + L + G     V   +      ++L  P  + + +
Sbjct: 526 CKAMLPKEGFQHQLRELRQHMEAAMDLLAQGKADRSVEQLRKCHLEAQQLLSPGHLVVGE 585

Query: 264 TREKLIKILMELEDWKEALAYCQLTIPVYQRVYPQFHPLLGLQYYTCGKLEWFLGDTENA 323
             + L ++     +W EA A+ + +I V +  Y      +G + +   ++ +       A
Sbjct: 586 IEDHLAQVHATQGNWPEAAAHLRRSIEVVEAHYGAGSIEVGQELFKLAQVLFNGRFVPEA 645

Query: 324 IKSMTEAVEILRITHGTNSPFMKEL 348
           +++  +A  +L +  G   P ++EL
Sbjct: 646 LQAAEKAEAVLSVQLGPRGPEVQEL 670


>gi|380801673|gb|AFE72712.1| SET and MYND domain-containing protein 4, partial [Macaca mulatta]
          Length = 212

 Score = 74.3 bits (181), Expect = 9e-11,   Method: Compositional matrix adjust.
 Identities = 34/76 (44%), Positives = 47/76 (61%), Gaps = 2/76 (2%)

Query: 88  LGTGLYPVISIINHSCLPNAVLVFEGRLAVVRAVQHVPKGAEVLISYI--ETAGSTMTRQ 145
           L TG++PVIS++NHSC PN  + F   +A +RA Q + KG E+L  Y   ++      RQ
Sbjct: 26  LATGIFPVISLLNHSCSPNTSVSFISTVATIRASQRIGKGQEILHCYGPHKSRMGVAERQ 85

Query: 146 KALKEQYLFTCTCPRC 161
           + L+ QY F CTCP C
Sbjct: 86  QKLRSQYFFDCTCPAC 101


>gi|336274372|ref|XP_003351940.1| hypothetical protein SMAC_00488 [Sordaria macrospora k-hell]
 gi|380096224|emb|CCC06271.1| unnamed protein product [Sordaria macrospora k-hell]
          Length = 537

 Score = 74.3 bits (181), Expect = 9e-11,   Method: Compositional matrix adjust.
 Identities = 47/142 (33%), Positives = 75/142 (52%), Gaps = 3/142 (2%)

Query: 40  KQLLLYAQIANLVNLILQWPEISINEIAENFSKLACNAHTICNSELRPLGTGLYPVISII 99
           K   L A  A +   +LQ  E ++ +  E   K+  NA    +++    G  L+P +S++
Sbjct: 171 KDFELQATGAVVYAGLLQSDE-TLKQAMEVLCKIQTNAFNRFDADTGQAGIYLHPSLSMV 229

Query: 100 NHSCLPNAVLVFEGRLAVVRAVQHVPKGAEVLISYIETAGSTMTRQKALKEQYLFTCTCP 159
           NHSC+PNA + FE R A ++A + +  G E+LISYI+       RQ++L+  Y F C C 
Sbjct: 230 NHSCVPNAYITFEKRKAFLKAERDIEPGDEILISYIDHTAPRRARQESLR-LYHFQCNCV 288

Query: 160 RC-IKLGQFDDIQESAILEGYR 180
           RC   L  ++ IQ   +  G +
Sbjct: 289 RCKDDLNVYEVIQACGVSPGVK 310


>gi|242007370|ref|XP_002424514.1| set and mynd domain containing protein, putative [Pediculus humanus
           corporis]
 gi|212507932|gb|EEB11776.1| set and mynd domain containing protein, putative [Pediculus humanus
           corporis]
          Length = 646

 Score = 74.3 bits (181), Expect = 9e-11,   Method: Compositional matrix adjust.
 Identities = 72/325 (22%), Positives = 136/325 (41%), Gaps = 54/325 (16%)

Query: 13  DNVIPSTTTDNYSLVEALVAHMSDIDEKQLLLYAQIANLVNLILQWP------------- 59
           D ++P    ++Y  V  L+ ++   + K L  YA  A+L+ L L+               
Sbjct: 282 DGIVP-CENNSYHNVYGLITNLEKYEGKHLFQYALTASLLALYLKTHTSYFSNESNDDDD 340

Query: 60  ---EISINEIAENFSKLACNAHTICNSEL-------------RPLGTGLYPVISIINHSC 103
              EI  + + ++  +L CN H I +                + +GT +YP  S++NHSC
Sbjct: 341 NNLEIVASTLFKHILQLICNGHAITDVITDDSNDSNAVDLIEKRIGTAIYPSASMMNHSC 400

Query: 104 LPNAVLVFEGRLAVVRAVQHVPKGAEVLISYIETAGSTMT--RQKALKEQYLFTCTCPRC 161
            P+ +  F     +VRA + +  G E+   Y   A       RQ+ L++QY F C C  C
Sbjct: 401 EPSIITSFSNNHLIVRASKPIKAGEEIFNCYGPHAKRMKKEERQEILRKQYFFNCHCMAC 460

Query: 162 IKLGQFDDIQESAILEGYRCKDDGCSGFLLRDSDDKGFTCQQCGLVRSKEEIKKIASEVN 221
            +  Q+D ++       Y C +  C G L+  +D     C  C             ++ +
Sbjct: 461 HQ--QYDIMER---FMAYMCFE--CKGPLIIINDSNNGYCLDC-------------NKNS 500

Query: 222 ILSKKTL--ALTSCGNHQEVVSTYKMIEKLQKKLYHPFSVNLMQTREKLIKILMELEDWK 279
           I+S+  +  +    G   + +  +     + +K+ + F+ +L  + ++      +LED+ 
Sbjct: 501 IISQDVIRESFKEQGKILDALKKFLNFFIIGRKILYKFNKDLCYSADQAASCFSKLEDYD 560

Query: 280 EALAYCQLTIPVYQRVYPQFHPLLG 304
           + L   + TI   ++ +  F   LG
Sbjct: 561 KCLVILKCTILSEEKRFGSFSIELG 585


>gi|32564211|ref|NP_871849.1| Protein SET-18, isoform b [Caenorhabditis elegans]
 gi|25005018|emb|CAD56600.1| Protein SET-18, isoform b [Caenorhabditis elegans]
          Length = 507

 Score = 74.3 bits (181), Expect = 9e-11,   Method: Compositional matrix adjust.
 Identities = 78/330 (23%), Positives = 138/330 (41%), Gaps = 30/330 (9%)

Query: 29  ALVAHMSDI--DEKQLLLYAQIANLVNLILQWPEISINEIAENFSKLACNAHTICNSELR 86
           +LV H  DI  DE+++  + +I   +        I   E  + F K   N+H+I  +   
Sbjct: 135 SLVDHKVDIRDDEEKMAHFEKIFEKMGAFRGEEMIEKGEFFDVFCKATINSHSIHTNAGN 194

Query: 87  PLGTGLYPVISIINHSCLPNAVLVFEGRLAVVRAVQHVP-----KGAEVLISYIETAGST 141
            +G  L   +S  NHSC P   +VF+G    +R +  VP        E  ISYI+   S 
Sbjct: 195 EVGMALDLGVSKYNHSCRPTCSMVFDGYRVCLRPL--VPGVDAENTEEAFISYIDVGRSK 252

Query: 142 MTRQKALKEQYLFTCTCPRCIKLGQFDDIQESAILEGYRCKDDGCSGFLLRDSDDK--GF 199
             R++ L  ++ F C C RC+      D ++ A L   RC +  C   +L    ++    
Sbjct: 253 YIRRRDLNSRWYFNCECTRCM------DPEDDA-LTAIRCANPACDAPILTSETEEPMNI 305

Query: 200 TCQQCGLVRSKEEIKKIASEVNILSKKTLALT---SCGNHQEVVSTYKMIEKLQK--KLY 254
            C++C  +  ++ +K     +     KTL  +    C    E +   K+ E L K  ++ 
Sbjct: 306 ACEKCKTIVEEDTVKAAQEYM-----KTLPASFDPKCPAEIEALPG-KLKELLAKAEQIL 359

Query: 255 HPFSVNLMQTREKLIKILMELEDWKEALAYCQLTIPVYQRVYPQFHPLLGLQYYTCGKLE 314
           HP +V + + R  L  +   L     +  + Q+    Y+  +P+    +G Q     K  
Sbjct: 360 HPSNVYVARLRTALFHVTGTLTMDNLSSMHTQI-YNNYKMCFPKADRHVGFQLLHIVKAL 418

Query: 315 WFLGDTENAIKSMTEAVEILRITHGTNSPF 344
               + + A+    +A+ I  +  G + P+
Sbjct: 419 IEKDERDEAMPYAFDAMNIFEVCFGLDHPY 448


>gi|384250973|gb|EIE24451.1| SET domain-containing protein [Coccomyxa subellipsoidea C-169]
          Length = 606

 Score = 73.9 bits (180), Expect = 1e-10,   Method: Compositional matrix adjust.
 Identities = 42/123 (34%), Positives = 58/123 (47%), Gaps = 5/123 (4%)

Query: 76  NAHTI--CNSELRPLGTGLYPVISIINHSCLPNAVLVFEGRLAVVRAVQHVPKGAEVLIS 133
           NAH +   NS       GLYP +S++NHSCLPN V    G    VRA++ V  G ++ ++
Sbjct: 306 NAHGMGAANSTNTDSALGLYPALSMLNHSCLPNCVFASCGSDMHVRAIRPVAAGEQLTVT 365

Query: 134 YIETAGSTMTRQKALKEQYLFTCTCPRCIKLGQFDDIQESAILEGYRCKDDGCSGFLLRD 193
           YI        R + L +   F C C RC+      +      LE  +C   GC G+LL  
Sbjct: 366 YINIMEPRRIRARELMDTKHFACACERCV---SPLETHPDRFLEAVKCGAGGCDGWLLEQ 422

Query: 194 SDD 196
             D
Sbjct: 423 RAD 425


>gi|344297528|ref|XP_003420449.1| PREDICTED: SET and MYND domain-containing protein 1 [Loxodonta
           africana]
          Length = 478

 Score = 73.9 bits (180), Expect = 1e-10,   Method: Compositional matrix adjust.
 Identities = 80/308 (25%), Positives = 135/308 (43%), Gaps = 44/308 (14%)

Query: 52  VNLILQ-WP----EISINEIAENFSKLACNAHTICNSE-LRPLGTGLYPVISIINHSCLP 105
           VN  L+ WP    + S   I+  F  + CN  T+ +   L+ +G G++P + ++NH C P
Sbjct: 151 VNTFLEYWPPQNQQFSTQYISHIFGVINCNGFTLSDQRGLQAVGVGVFPNLGLVNHDCWP 210

Query: 106 NAVLVFE-GRLAVVRAVQHVPKGAEVLISYIETAGSTMTRQKALKEQYLFTCTCPRCIKL 164
           N  ++F  G    V+++ H        +S  +  G T   Q  +K   L TC    C+ L
Sbjct: 211 NCTVIFNNGNHEAVKSMFHTQ------MSKAQDPGKT-GPQFVIKLVGLTTCFFS-CLAL 262

Query: 165 GQFDDIQESAILEGYRCKDDGCSGFLLRDSDDKGFTCQQCGLVRSKEEIKKIASEVNILS 224
           G        A+ EG   KDD      L   DD           +  +E+ K   E+  LS
Sbjct: 263 GV------EALPEGL--KDD----LFLGVKDDP----------KPSQEVVK---EMIQLS 297

Query: 225 KKTL----ALTSCGNHQEVVSTYKMIEKLQKKLYHPFSVNLMQTREKLIKILMELEDWKE 280
           K TL       S G + EVV   +   + Q+ ++   ++  ++    + ++L  L+ ++E
Sbjct: 298 KDTLEKIDKARSEGLYHEVVKLCRECLQKQEPVFADTNLYTLRMLSIVSEVLSYLQAFEE 357

Query: 281 ALAYCQLTIPVYQRVYPQFHPLLGLQYYTCGKLEWFLGDTENAIKSMTEAVEILRITHGT 340
           A  Y +  +  Y ++Y   +  LG+     G   W  G+ E     + +A  IL +THG 
Sbjct: 358 AADYAKRMVDGYMKLYHHNNAQLGMAVMRAGLTNWHAGNIEVGHGMICKAYAILLVTHGP 417

Query: 341 NSPFMKEL 348
           + P  K+L
Sbjct: 418 SHPITKDL 425


>gi|195168440|ref|XP_002025039.1| GL26796 [Drosophila persimilis]
 gi|194108484|gb|EDW30527.1| GL26796 [Drosophila persimilis]
          Length = 498

 Score = 73.9 bits (180), Expect = 1e-10,   Method: Compositional matrix adjust.
 Identities = 34/103 (33%), Positives = 56/103 (54%), Gaps = 7/103 (6%)

Query: 91  GLYPVISIINHSCLPNAVLVFEGR--LAVVRAVQHVPKGAEVLISYIETAGSTMTRQKAL 148
            LYP+ +++NH C+PN+   FE +  + +VRA   +P+G E+  +Y +     + R   L
Sbjct: 208 ALYPLFAVMNHDCIPNSYYTFEEKTNIMIVRAAVDIPEGFEITTTYTKLFTGNIARHLFL 267

Query: 149 KEQYLFTCTCPRCIKLGQFDDIQESAILEGYRCKDDGCSGFLL 191
           K +  FTC C RC      D  ++ A + G  C+D  CSG ++
Sbjct: 268 KIKRGFTCKCSRC-----SDPTEKGAYISGLYCRDTNCSGLVV 305


>gi|125983736|ref|XP_001355633.1| GA11413 [Drosophila pseudoobscura pseudoobscura]
 gi|54643949|gb|EAL32692.1| GA11413 [Drosophila pseudoobscura pseudoobscura]
          Length = 496

 Score = 73.9 bits (180), Expect = 1e-10,   Method: Compositional matrix adjust.
 Identities = 34/103 (33%), Positives = 56/103 (54%), Gaps = 7/103 (6%)

Query: 91  GLYPVISIINHSCLPNAVLVFEGR--LAVVRAVQHVPKGAEVLISYIETAGSTMTRQKAL 148
            LYP+ +++NH C+PN+   FE +  + +VRA   +P+G E+  +Y +     + R   L
Sbjct: 208 ALYPLFAVMNHDCIPNSYYTFEEKTNIMIVRAAVDIPEGFEITTTYTKLFTGNIARHLFL 267

Query: 149 KEQYLFTCTCPRCIKLGQFDDIQESAILEGYRCKDDGCSGFLL 191
           K +  FTC C RC      D  ++ A + G  C+D  CSG ++
Sbjct: 268 KIKRGFTCKCSRC-----SDPTEKGAYISGLYCRDTNCSGLVV 305


>gi|224140669|ref|XP_002323703.1| tetratricopeptide repeat-containing protein [Populus trichocarpa]
 gi|222868333|gb|EEF05464.1| tetratricopeptide repeat-containing protein [Populus trichocarpa]
          Length = 584

 Score = 73.9 bits (180), Expect = 1e-10,   Method: Compositional matrix adjust.
 Identities = 67/313 (21%), Positives = 128/313 (40%), Gaps = 61/313 (19%)

Query: 62  SINEIAENFSKLACNAHTICNS-ELRPLGTGLYPVISIINHSCLPNAVLVFEGRLAVVRA 120
           S+++  + F KL      + +S E  P+G  +Y   S+ NHSCLPN    F  R   +R 
Sbjct: 240 SVDDPPDQFRKLTSVGDALTSSLEQVPVGQAIYKAASLFNHSCLPNIHAYFLSRTLFIRT 299

Query: 121 VQHVPKGAEVLISYIETAGST--MTRQKALKEQYLFTCTCPRCIKLGQFDDIQESAILEG 178
            ++V  G  + +SY    G +    R + L ++Y F C C  C +L   D      +L  
Sbjct: 300 TEYVSTGCPLELSYGPQVGQSDCEDRLRYLADKYSFRCQCRGCSQLNLSD-----LVLNA 354

Query: 179 YRCKDDGCSGFLLRDSDDKGFTCQQCGLVRSKEEIKKIASE--------VNILSKKTLAL 230
           + C +  C+G +L  +   G T +     R+ E+ +K  S         V+I    +LAL
Sbjct: 355 FCCVNHNCAGVVLESTIINGETRKLNNFPRAPEK-QKFDSHLQGHKLNIVDINDVASLAL 413

Query: 231 T--------------SCGNHQEVVSTYKMIEK---------------------------- 248
                           CG+H+++ ++++ I K                            
Sbjct: 414 KFNNSSLHIHPGFCLHCGSHRDLDASHEAINKAWSYIKRLQEAIISKDISGTTLLDASRA 473

Query: 249 --LQKKLYHPFSVNLMQTREKLIKILMELEDWKEALAYCQLTIPVYQRVYPQFHPLLGLQ 306
             + +   H ++ ++ +  + L +    + D++ A  +C+ +I + Q +Y   H ++G +
Sbjct: 474 LGILRSTLHAYNKSVAEAEDNLAQAFCLVRDFQSAREHCKESIKILQTLYDPDHIVIGYE 533

Query: 307 YYTCGKLEWFLGD 319
                 ++  L D
Sbjct: 534 LVKLASIQLSLDD 546


>gi|350295434|gb|EGZ76411.1| hypothetical protein NEUTE2DRAFT_98210 [Neurospora tetrasperma FGSC
           2509]
          Length = 547

 Score = 73.6 bits (179), Expect = 1e-10,   Method: Compositional matrix adjust.
 Identities = 41/120 (34%), Positives = 67/120 (55%), Gaps = 2/120 (1%)

Query: 62  SINEIAENFSKLACNAHTICNSELRPLGTGLYPVISIINHSCLPNAVLVFEGRLAVVRAV 121
           ++ +  E   K+  NA    +++    G  L+P +S++NHSC+PNA + FE R AV++A 
Sbjct: 172 TLKQAMEVLCKIQTNAFNRFDADTGQAGIYLHPSLSMVNHSCVPNAYITFEKRKAVLKAE 231

Query: 122 QHVPKGAEVLISYIETAGSTMTRQKALKEQYLFTCTCPRC-IKLGQFDDIQESAILEGYR 180
           + +  G E+LISYI+       RQ++L+  Y F C C RC   L  ++ IQ   +  G +
Sbjct: 232 RDLEPGDEILISYIDHTMPRRARQESLR-LYHFQCNCIRCKDDLNAYEVIQACGVSPGVK 290


>gi|281208760|gb|EFA82935.1| SET domain-containing protein [Polysphondylium pallidum PN500]
          Length = 422

 Score = 73.6 bits (179), Expect = 1e-10,   Method: Compositional matrix adjust.
 Identities = 46/132 (34%), Positives = 64/132 (48%), Gaps = 9/132 (6%)

Query: 88  LGTGLYPVISIINHSCLPNAVLVFEGRLAVVRAVQHVPKGAEVLISYIETAGSTMTRQKA 147
            G  +Y   S+ NHSC PNA  V  GR   + A++ +    E+ ISY+     +  R+  
Sbjct: 251 FGLAIYLEASLFNHSCYPNAARVQRGRSIDIIAIRDIEPNEEICISYLNITNGSHERKDH 310

Query: 148 LKEQYLFTCTCPRCIKLG-QFDDIQESAILEGYRCKDDGCSGFLLRDSDDKGFTCQQCGL 206
           LK  YLF C C RC +     ++I  S I    R K   CSG+L      +G   +QC  
Sbjct: 311 LKNNYLFDCVCIRCTQTNPDIENIVRSFICRNPRVK---CSGYLYL---PQGSDSRQCNF 364

Query: 207 VRSKE--EIKKI 216
            + KE  EIK+I
Sbjct: 365 CQWKESSEIKRI 376


>gi|328873018|gb|EGG21385.1| SET domain-containing protein [Dictyostelium fasciculatum]
          Length = 740

 Score = 73.6 bits (179), Expect = 1e-10,   Method: Compositional matrix adjust.
 Identities = 50/156 (32%), Positives = 79/156 (50%), Gaps = 13/156 (8%)

Query: 22  DNYSLVEALVAHMSDIDEKQLLLYAQIAN-LVNLI--LQWPE-----ISINEIAENFSKL 73
           D+Y  + +L   +S I+ KQL  +    N + N    ++ P+      +  EI + + K+
Sbjct: 400 DSYEDLASLTTDVSKIESKQLQSFTVATNSITNFFAAVKGPKYLSDPFTKQEIFDLYPKI 459

Query: 74  ACNAHTICNSELR-PLGTGLYPVISIINHSCLPNAVLVFEGR-LAVVRAVQHVPKGAEVL 131
             NAH   +      +G G+YP  S INH+CLPN     +   L + R+ + + KG E+ 
Sbjct: 460 LFNAHEYIDPLTHYEIGRGIYPTASYINHTCLPNTTWYNDDHGLILYRSSRDILKGEEIT 519

Query: 132 ISYIETAGSTMTRQKALKEQYLFTCTCPRCI--KLG 165
            SY++     + R+K LK QY F C C RC+  KLG
Sbjct: 520 TSYLDILKPKLQRRKDLK-QYSFVCQCERCLNEKLG 554


>gi|443926535|gb|ELU45157.1| zf-MYND domain-containing protein [Rhizoctonia solani AG-1 IA]
          Length = 584

 Score = 73.6 bits (179), Expect = 1e-10,   Method: Compositional matrix adjust.
 Identities = 71/264 (26%), Positives = 104/264 (39%), Gaps = 46/264 (17%)

Query: 64  NEIAENFSK------LACNAHTICNSELRPLGTGLYPVISIINHSCLPNAVLVFE----- 112
           N +   F K      +  NAHT+    L P+G  L PV ++INHSC+PN V+VF      
Sbjct: 189 NSVKAGFGKRYRGSIIEANAHTLSTPSLTPIGVALSPVAALINHSCVPNCVVVFPKASET 248

Query: 113 ---GRLAVVRAVQHVPKGAEVLISYIETAGSTMTRQKALKEQYLFTCTCPRC-----IKL 164
                  ++ A+  +  G EV I +  T G      K L+E+Y F C CP C     +K 
Sbjct: 249 KKVPNEMLIIAIGKILPGEEVQIPH--TLGK-FADNKILQERYFFKCHCPNCEMTSLVKT 305

Query: 165 GQFDDIQESAILEGYRCKDDGCSGFLLRDSDDKGFTCQQCGLVRSKEEIKKIASEVNILS 224
              D  +       +RC    C GFL   S       +Q                 N+ S
Sbjct: 306 PYVDSRR------AFRCASKTCEGFLPMPSMQSQLGQKQ---------------RFNVYS 344

Query: 225 KKTLALTSCGNHQEVVSTYKMIEKLQKKLYHPFSV--NLMQTREKLIKILMELEDWKEAL 282
             + AL++    Q    + K   K ++      S   +   T  + + +   L D + A 
Sbjct: 345 AISCALSNPRRSQMQSDSEKKGSKGRRNFNSLVSYFRSPGLTYHQSLTLCAGLLDPERAF 404

Query: 283 AYCQLTIPVYQRVY-PQFHPLLGL 305
            Y    IP+  R + P  HPLL L
Sbjct: 405 KYTSNLIPMVSRFFHPSAHPLLAL 428


>gi|388854233|emb|CCF52152.1| uncharacterized protein [Ustilago hordei]
          Length = 938

 Score = 73.6 bits (179), Expect = 1e-10,   Method: Compositional matrix adjust.
 Identities = 39/118 (33%), Positives = 65/118 (55%), Gaps = 16/118 (13%)

Query: 62  SINEIAENFSKLACNAHTICNSELRPLGTGLYPVISIINHSCLPNAVLVF---------- 111
           S  E+ +   + A N+ T+ +S+L PLG  ++P +++INH+C PNA +VF          
Sbjct: 481 SAKELLDLVCQFASNSFTLTDSDLNPLGVCMHPSMAMINHACTPNAAVVFPFGGAAKNGQ 540

Query: 112 ------EGRLAVVRAVQHVPKGAEVLISYIETAGSTMTRQKALKEQYLFTCTCPRCIK 163
                 E R+  + A++ +  G E+LISY++ A +   R+  LK++Y F C C  C K
Sbjct: 541 QKWNDGEDRIMQLVALRAIEPGEELLISYVDIADTYQDRRNYLKKRYCFDCRCELCRK 598


>gi|443693726|gb|ELT95020.1| hypothetical protein CAPTEDRAFT_183922 [Capitella teleta]
          Length = 666

 Score = 73.6 bits (179), Expect = 2e-10,   Method: Compositional matrix adjust.
 Identities = 45/139 (32%), Positives = 73/139 (52%), Gaps = 16/139 (11%)

Query: 88  LGTGLYPVISIINHSCLPNAVLVFEGRLAVVRAVQHVPKGAEVLISYIET--AGSTMTRQ 145
           +G GL+P  +++NHSC P A++ + G  AVVRA++ + K  E+ I+Y  T      ++R+
Sbjct: 472 IGIGLFPTAALLNHSCNPEAIVCYYGNKAVVRAIRDIDKNEEISIAYGVTFYDDEELSRR 531

Query: 146 KALKEQYLFTCTCPRCIKLG----QFDDIQESAILEGYRCKDDGCSGFLLRD--SDDKGF 199
             LKE + F CTC  C++      + D  Q   + E        C   LL D   D+K  
Sbjct: 532 HQLKETHFFHCTCKACLEGWPMWLEMDQNQPDWLCE-------ACGSILLSDKIEDNKFA 584

Query: 200 TCQQCGLVRSKEE-IKKIA 217
            C++C   ++ E+ I K+A
Sbjct: 585 KCKKCSHRQNLEDAINKLA 603


>gi|255950300|ref|XP_002565917.1| Pc22g20170 [Penicillium chrysogenum Wisconsin 54-1255]
 gi|211592934|emb|CAP99305.1| Pc22g20170 [Penicillium chrysogenum Wisconsin 54-1255]
          Length = 495

 Score = 73.2 bits (178), Expect = 2e-10,   Method: Compositional matrix adjust.
 Identities = 43/142 (30%), Positives = 75/142 (52%), Gaps = 4/142 (2%)

Query: 26  LVEALVAHMSDIDEKQLLLYAQIANLVNLILQWPEISINE--IAENFSKLACNAHTICNS 83
           + + L  H+++I E Q  L  +I      +  +    ++E  +A   +KL  N+  +  S
Sbjct: 140 VFDGLETHINEISESQGQL-DRINLTARAVKNYSGTEMDEGTVASYAAKLDLNSFNLTTS 198

Query: 84  ELRPLGTGLYPVISIINHSCLPNAVLVFEGRLAVVRAVQHVPKGAEVLISYIETAGSTMT 143
               +G  ++P   +INHSC  N+ + F+G    V+A++ + KG ++ ISYI+T      
Sbjct: 199 MYDRIGLYMHPYAGLINHSCDYNSTVGFDGEELYVKAMRPIKKGEQIFISYIDTTTPYDI 258

Query: 144 RQKALKEQYLFTCTCPRCIKLG 165
           R+  LKE+Y F C C +C K+G
Sbjct: 259 RRNELKERYFFDCQCTKC-KMG 279


>gi|71001320|ref|XP_755341.1| SET and MYND domain protein [Aspergillus fumigatus Af293]
 gi|66852979|gb|EAL93303.1| SET and MYND domain protein, putative [Aspergillus fumigatus Af293]
          Length = 544

 Score = 73.2 bits (178), Expect = 2e-10,   Method: Compositional matrix adjust.
 Identities = 42/140 (30%), Positives = 74/140 (52%), Gaps = 2/140 (1%)

Query: 26  LVEALVAHMSDIDEKQLLLYAQIANLVNLILQWPEISINE--IAENFSKLACNAHTICNS 83
           L   L +HM++I  +    + +I+     + ++   +++E  +    +KL  N+    + 
Sbjct: 187 LFRKLESHMAEIQAQNWEQWQRISLTAKAVKKYSGCALDEEMLCHYGAKLELNSFNFHSP 246

Query: 84  ELRPLGTGLYPVISIINHSCLPNAVLVFEGRLAVVRAVQHVPKGAEVLISYIETAGSTMT 143
               LG  L+P  ++INHSC  N+V+  +G    V+A++ + KG E+L+SYI+       
Sbjct: 247 LADRLGFYLHPYAALINHSCDYNSVVGSDGDALYVKALRPIRKGEEILVSYIDATNPYKI 306

Query: 144 RQKALKEQYLFTCTCPRCIK 163
           RQ  L E+Y F C CP+C K
Sbjct: 307 RQTELLERYYFKCRCPKCGK 326


>gi|195393214|ref|XP_002055249.1| GJ18896 [Drosophila virilis]
 gi|194149759|gb|EDW65450.1| GJ18896 [Drosophila virilis]
          Length = 506

 Score = 73.2 bits (178), Expect = 2e-10,   Method: Compositional matrix adjust.
 Identities = 34/103 (33%), Positives = 55/103 (53%), Gaps = 7/103 (6%)

Query: 91  GLYPVISIINHSCLPNAVLVFEGRL--AVVRAVQHVPKGAEVLISYIETAGSTMTRQKAL 148
            LYP+ +++NH C+PN+   FE +    +VRA   + +G E+  +Y +     + R   L
Sbjct: 208 ALYPLFAVMNHDCIPNSYYTFEEKTNNMIVRAAVDIAEGEEITTTYTKLFTGNIARHLYL 267

Query: 149 KEQYLFTCTCPRCIKLGQFDDIQESAILEGYRCKDDGCSGFLL 191
           K +  FTC CPRC      D  ++ A + G  C+D  CSG ++
Sbjct: 268 KMKKGFTCKCPRC-----SDPTEKGAYISGLYCRDTKCSGLVV 305


>gi|302411726|ref|XP_003003696.1| conserved hypothetical protein [Verticillium albo-atrum VaMs.102]
 gi|261357601|gb|EEY20029.1| conserved hypothetical protein [Verticillium albo-atrum VaMs.102]
          Length = 564

 Score = 73.2 bits (178), Expect = 2e-10,   Method: Compositional matrix adjust.
 Identities = 40/104 (38%), Positives = 61/104 (58%), Gaps = 8/104 (7%)

Query: 60  EISINEIAENFSKLACNAHTICNSELRPLGTGLY--PVISIINHSCLPNAVLVFEGRLAV 117
           E++   + E   K+  NA T   +     G GL+    ++++NHSC+PNA++ F GR A 
Sbjct: 190 EVTERRVVELLCKIKTNAFTRSEA-----GEGLFLDTTLAMVNHSCVPNAIVAFSGRRAF 244

Query: 118 VRAVQHVPKGAEVLISYIETAGSTMTRQKALKEQYLFTCTCPRC 161
           +RA++ + +G E+ ISYI+   S   R+KAL E Y F C C RC
Sbjct: 245 LRALRDIKEGEEIEISYIDCTQSLEHRRKAL-ELYFFQCVCMRC 287


>gi|66825697|ref|XP_646203.1| hypothetical protein DDB_G0270830 [Dictyostelium discoideum AX4]
 gi|60474828|gb|EAL72765.1| hypothetical protein DDB_G0270830 [Dictyostelium discoideum AX4]
          Length = 696

 Score = 73.2 bits (178), Expect = 2e-10,   Method: Compositional matrix adjust.
 Identities = 39/107 (36%), Positives = 59/107 (55%), Gaps = 10/107 (9%)

Query: 61  ISINEIAENFSKLACNAHTICNSELRPL-----GTGLYPVISIINHSCLPNAVLVFEGR- 114
           ++I+EI E +S +  N H +    L PL     G G++P  S +NHSCLPNA    + + 
Sbjct: 403 VTIDEIIEIYSIVLSNGHEM----LHPLNCHTYGLGIFPTGSYLNHSCLPNAFWYNDDQG 458

Query: 115 LAVVRAVQHVPKGAEVLISYIETAGSTMTRQKALKEQYLFTCTCPRC 161
           + V R ++ + KG E+L SY +       R+K L +QY F C C +C
Sbjct: 459 MMVFRTLRPIKKGEEILTSYTDITTECSERRKHLLKQYFFFCQCQQC 505


>gi|449480134|ref|XP_004177074.1| PREDICTED: LOW QUALITY PROTEIN: SET and MYND domain-containing
           protein 4 [Taeniopygia guttata]
          Length = 797

 Score = 73.2 bits (178), Expect = 2e-10,   Method: Compositional matrix adjust.
 Identities = 68/294 (23%), Positives = 120/294 (40%), Gaps = 22/294 (7%)

Query: 72  KLACNAHTI-CNSELRP------------LGTGLYPVISIINHSCLPNAVLVFEGRLAVV 118
           +L CNA  I    EL P            L T  +PV+S++NHSC PN  + F G  A V
Sbjct: 491 QLQCNAQAITVMQELGPGDGAVVDKKPVRLATAFFPVLSLLNHSCCPNTSMSFSGTAATV 550

Query: 119 RAVQHVPKGAEVLISY-IETAGSTMTRQKALKEQYLFTCTCPRCIKLGQFDDIQESAILE 177
           RA Q +  G EVL  Y        +  ++ L  QY F C CP C++  +       +I  
Sbjct: 551 RASQPISSGQEVLHCYGPHWCRMRVAERQQLLRQYFFECRCPACLEELESGVKSVVSIRN 610

Query: 178 GYRCKDDGCSGFLLRDSDDKGFTCQQCGLVRSKEEIKKIASEVNILSKKTLALTSCGNHQ 237
            + C    C   +  + D    + + C    S+  +     ++    KK L +   G   
Sbjct: 611 SFCCPK--CQAQMQGEEDTLCCSNEACATSASRNHLSGRLQDLQQQIKKALGMLRVGKAD 668

Query: 238 EVVSTYKMIEKLQ---KKLYHPFSVNLMQTREKLIKILMELEDWKEALAYCQLTIPVYQR 294
           + +   KM+ K Q        P  + + +  + L ++   L  W+EA  + + +I + + 
Sbjct: 669 QAI---KMLLKCQMDAGTFLSPEHLLMGEMEDHLAQVYATLGKWQEAARHLKKSIEIVEM 725

Query: 295 VYPQFHPLLGLQYYTCGKLEWFLGDTENAIKSMTEAVEILRITHGTNSPFMKEL 348
            +       G + +   ++ +       A+ ++  A  IL +  G  +  ++EL
Sbjct: 726 HHGPSSVETGHELFKLAQILFNGFAVSEALSTIQRAEGILSVHFGPQNAQIQEL 779


>gi|336465134|gb|EGO53374.1| hypothetical protein NEUTE1DRAFT_150707 [Neurospora tetrasperma
           FGSC 2508]
          Length = 668

 Score = 72.8 bits (177), Expect = 2e-10,   Method: Compositional matrix adjust.
 Identities = 40/120 (33%), Positives = 66/120 (55%), Gaps = 2/120 (1%)

Query: 62  SINEIAENFSKLACNAHTICNSELRPLGTGLYPVISIINHSCLPNAVLVFEGRLAVVRAV 121
           ++ +  E   K+  NA    +++    G  L+P +S++NHSC+PNA + FE R A ++A 
Sbjct: 192 TLKQAMEVLCKMQTNAFNRFDADTGQAGIYLHPSLSMVNHSCVPNAYIAFEKRKAFLKAE 251

Query: 122 QHVPKGAEVLISYIETAGSTMTRQKALKEQYLFTCTCPRCI-KLGQFDDIQESAILEGYR 180
           + +  G E+LISYI+       RQ++L+  Y F C C RC   L  ++ IQ   +  G +
Sbjct: 252 RDIEPGDEILISYIDHTMPRRARQESLR-LYHFQCNCIRCKDDLNAYEVIQACGVSPGVK 310


>gi|156050587|ref|XP_001591255.1| hypothetical protein SS1G_07881 [Sclerotinia sclerotiorum 1980]
 gi|154692281|gb|EDN92019.1| hypothetical protein SS1G_07881 [Sclerotinia sclerotiorum 1980
           UF-70]
          Length = 593

 Score = 72.8 bits (177), Expect = 2e-10,   Method: Compositional matrix adjust.
 Identities = 37/112 (33%), Positives = 58/112 (51%), Gaps = 11/112 (9%)

Query: 64  NEIAENFSK---LACNAHTICNSELR--------PLGTGLYPVISIINHSCLPNAVLVFE 112
           +++ +N SK   L   AH +  +  R        P+G  L P+++  NH C PNA + F+
Sbjct: 201 DDLEKNNSKWDDLLLQAHALSTNAFRVESNVGNGPIGLCLDPLLARANHCCYPNAAITFD 260

Query: 113 GRLAVVRAVQHVPKGAEVLISYIETAGSTMTRQKALKEQYLFTCTCPRCIKL 164
           G+ A +RA+  +  G ++ ISYI+       RQ AL+E + F C C RC   
Sbjct: 261 GKRATLRALFPIKNGEQIFISYIDETQRQEVRQAALEETWFFKCRCSRCTNF 312


>gi|389631537|ref|XP_003713421.1| hypothetical protein MGG_15522 [Magnaporthe oryzae 70-15]
 gi|351645754|gb|EHA53614.1| hypothetical protein MGG_15522 [Magnaporthe oryzae 70-15]
 gi|440463734|gb|ELQ33288.1| hypothetical protein OOU_Y34scaffold00979g72 [Magnaporthe oryzae
           Y34]
 gi|440483676|gb|ELQ64025.1| hypothetical protein OOW_P131scaffold00888g14 [Magnaporthe oryzae
           P131]
          Length = 542

 Score = 72.8 bits (177), Expect = 2e-10,   Method: Compositional matrix adjust.
 Identities = 37/90 (41%), Positives = 55/90 (61%), Gaps = 1/90 (1%)

Query: 72  KLACNAHTICNSELRPLGTGLYPVISIINHSCLPNAVLVFEGRLAVVRAVQHVPKGAEVL 131
           K+  N+    +++    GT L PV+++ NHSC+PNAV++F  R A +RA   + +G+E+ 
Sbjct: 188 KIQTNSFDRFDADTGASGTFLDPVLAMANHSCVPNAVVLFWRRKAYLRAEMPIKQGSEIS 247

Query: 132 ISYIETAGSTMTRQKALKEQYLFTCTCPRC 161
           ISYI+       RQ+ L   Y FTC CPRC
Sbjct: 248 ISYIDYTKPVRFRQEDL-WLYHFTCKCPRC 276


>gi|426239581|ref|XP_004013698.1| PREDICTED: SET and MYND domain-containing protein 3 isoform 3 [Ovis
           aries]
          Length = 391

 Score = 72.8 bits (177), Expect = 3e-10,   Method: Compositional matrix adjust.
 Identities = 31/65 (47%), Positives = 46/65 (70%)

Query: 65  EIAENFSKLACNAHTICNSELRPLGTGLYPVISIINHSCLPNAVLVFEGRLAVVRAVQHV 124
           +I E F+K+ CN+ TICN+E++ +G GLYP +S++NHSC PN  +VF G   ++R V+ V
Sbjct: 170 DIFEAFAKVICNSFTICNAEMQEVGVGLYPSMSLLNHSCDPNCSIVFNGPHLLLRTVRDV 229

Query: 125 PKGAE 129
             G E
Sbjct: 230 EAGEE 234



 Score = 48.1 bits (113), Expect = 0.007,   Method: Compositional matrix adjust.
 Identities = 24/61 (39%), Positives = 36/61 (59%)

Query: 279 KEALAYCQLTIPVYQRVYPQFHPLLGLQYYTCGKLEWFLGDTENAIKSMTEAVEILRITH 338
           +EAL Y   T+  Y+  YP  HP+ G+Q    GKL+   G    A+K++  A +I+R+TH
Sbjct: 308 EEALFYGIRTMEPYRIFYPGNHPVRGVQIMKVGKLQLHQGMFPQAMKNLRLAFDIMRVTH 367

Query: 339 G 339
           G
Sbjct: 368 G 368


>gi|12007319|gb|AAG45134.1|AF310894_3 BOP [Dictyostelium discoideum]
          Length = 403

 Score = 72.8 bits (177), Expect = 3e-10,   Method: Compositional matrix adjust.
 Identities = 39/107 (36%), Positives = 59/107 (55%), Gaps = 10/107 (9%)

Query: 61  ISINEIAENFSKLACNAHTICNSELRPL-----GTGLYPVISIINHSCLPNAVLVFEGR- 114
           ++I+EI E +S +  N H +    L PL     G G++P  S +NHSCLPNA    + + 
Sbjct: 110 VTIDEIIEIYSIVLSNGHEM----LHPLNCHTYGLGIFPTGSYLNHSCLPNAFWYNDDQG 165

Query: 115 LAVVRAVQHVPKGAEVLISYIETAGSTMTRQKALKEQYLFTCTCPRC 161
           + V R ++ + KG E+L SY +       R+K L +QY F C C +C
Sbjct: 166 MMVFRTLRPIKKGEEILTSYTDITTECSERRKHLLKQYFFFCQCQQC 212


>gi|335309609|ref|XP_003131876.2| PREDICTED: SET and MYND domain-containing protein 4-like [Sus
           scrofa]
          Length = 788

 Score = 72.8 bits (177), Expect = 3e-10,   Method: Compositional matrix adjust.
 Identities = 39/108 (36%), Positives = 59/108 (54%), Gaps = 6/108 (5%)

Query: 58  WPEISINEIA--ENFSKLACNAHTICNSELRPLGTGLYPVISIINHSCLPNAVLVFEGRL 115
           +PE +I  +A   +  +L CNA  I  + ++  G  ++P++++  HSC PNA + F   +
Sbjct: 481 YPEWNIWGVAMLRHMLQLQCNAQAI--TAIQQTGDSIFPIVNVFXHSCSPNASVSFISTV 538

Query: 116 AVVRAVQHVPKGAEVLISY--IETAGSTMTRQKALKEQYLFTCTCPRC 161
           A VRA Q + KG EVL  Y   E+      R + L+ QY F C CP C
Sbjct: 539 ATVRASQPIRKGQEVLHCYGPHESRMGVAERHQKLRSQYFFDCNCPPC 586


>gi|156386886|ref|XP_001634142.1| predicted protein [Nematostella vectensis]
 gi|156221221|gb|EDO42079.1| predicted protein [Nematostella vectensis]
          Length = 73

 Score = 72.8 bits (177), Expect = 3e-10,   Method: Compositional matrix adjust.
 Identities = 32/73 (43%), Positives = 46/73 (63%)

Query: 89  GTGLYPVISIINHSCLPNAVLVFEGRLAVVRAVQHVPKGAEVLISYIETAGSTMTRQKAL 148
           GTG++P    +NHSC PN+V VF G    ++A++ +P G E+ ISYI+      TRQ+ L
Sbjct: 1   GTGIFPNAVCLNHSCAPNSVAVFNGTNIYIKALEEIPVGEELTISYIQQLHPRETRQEEL 60

Query: 149 KEQYLFTCTCPRC 161
           + Q+ F C C RC
Sbjct: 61  QTQFCFYCQCHRC 73


>gi|322801942|gb|EFZ22489.1| hypothetical protein SINV_02250 [Solenopsis invicta]
          Length = 675

 Score = 72.4 bits (176), Expect = 3e-10,   Method: Compositional matrix adjust.
 Identities = 51/179 (28%), Positives = 85/179 (47%), Gaps = 6/179 (3%)

Query: 76  NAHTICNSELRPLGTGLYPVISIINHSCLPNAVLVFEGRLAVVRAVQHVPKGAEVLISY- 134
            A T+  ++   +G G+YP ++++NHSC P  V  F G   +VRA++ +  G E+  +Y 
Sbjct: 466 GAKTLAKAKSMFIGGGVYPTVAMLNHSCNPGVVRYFIGTTMIVRAIRTISVGEEISENYG 525

Query: 135 -IETAGSTMTRQKALKEQYLFTCTCPRCIKLGQFDDIQESAILEGYRCK-DDGCSGFLLR 192
            I T    + R++ L+ QY F C C  C       D  +  IL  ++C+    C   LL 
Sbjct: 526 PIFTTTPEIERKRKLRVQYWFDCNCEACSGHWPLLDELDPTILR-FKCETGPSCGNVLLV 584

Query: 193 DSDDKGFT--CQQCGLVRSKEEIKKIASEVNILSKKTLALTSCGNHQEVVSTYKMIEKL 249
            SD K F   C +CG   +  +  K+  + + L +        G +++ +  Y  I KL
Sbjct: 585 RSDTKEFMIGCAKCGKSTNILKGLKVLQDTDALFRVASTNLEEGRNEQALKAYLEILKL 643


>gi|383852386|ref|XP_003701709.1| PREDICTED: SET and MYND domain-containing protein 4-like [Megachile
           rotundata]
          Length = 675

 Score = 72.4 bits (176), Expect = 3e-10,   Method: Compositional matrix adjust.
 Identities = 52/168 (30%), Positives = 78/168 (46%), Gaps = 8/168 (4%)

Query: 88  LGTGLYPVISIINHSCLPNAVLVFEGRLAVVRAVQHVPKGAEVLISY--IETAGSTMTRQ 145
           +G GL+P I++ NHSC P  +  F G   VVRA++ +  G E+  +Y  I T      R+
Sbjct: 478 IGGGLFPTIALFNHSCNPGVIRYFIGTTMVVRAIRSIAAGEEISENYGPIFTMSPEAERK 537

Query: 146 KALKEQYLFTCTCPRCIKLGQFDDIQESAILEGYRCKDDG--CSGFLLRDSDDKGFT--C 201
           + L+ QY F C C  C+      D  +  IL  ++C D G  C   L   +D   F   C
Sbjct: 538 RTLRWQYWFDCNCEACVGHWPLLDDIDPTILR-FKC-DTGRECGNILPVKTDTNEFMIRC 595

Query: 202 QQCGLVRSKEEIKKIASEVNILSKKTLALTSCGNHQEVVSTYKMIEKL 249
            +CG   +  +  K   + + L +        G H E +S+Y  I KL
Sbjct: 596 PKCGKNTNILKGLKALQDTDSLFRVASNNLEDGKHVEALSSYLKILKL 643


>gi|425770916|gb|EKV09375.1| hypothetical protein PDIP_64570 [Penicillium digitatum Pd1]
 gi|425776732|gb|EKV14940.1| hypothetical protein PDIG_30190 [Penicillium digitatum PHI26]
          Length = 495

 Score = 72.4 bits (176), Expect = 3e-10,   Method: Compositional matrix adjust.
 Identities = 41/143 (28%), Positives = 73/143 (51%), Gaps = 3/143 (2%)

Query: 21  TDNYSLVEALVAHMSDIDEKQLLLYAQIANLVNLILQWPEISINE--IAENFSKLACNAH 78
           ++ + + + L  HM++I   Q  L  +I      + ++    ++E  +A   +KL  N+ 
Sbjct: 135 SEEFKVFDGLETHMNEISGSQGQL-DRINLTARAVKRYSGTDVDEGTVASYAAKLDLNSF 193

Query: 79  TICNSELRPLGTGLYPVISIINHSCLPNAVLVFEGRLAVVRAVQHVPKGAEVLISYIETA 138
            +  S    +G  ++P   +INHSC  N+ + F+G    V+A+  + KG ++ ISYI+T 
Sbjct: 194 NLTTSMYDRIGLYMHPYAGLINHSCDYNSTVGFDGEELYVKAMHPIKKGEQIFISYIDTT 253

Query: 139 GSTMTRQKALKEQYLFTCTCPRC 161
                R   LKE+Y F C C +C
Sbjct: 254 TPYDIRCNELKERYFFDCQCTKC 276


>gi|115449849|ref|XP_001218712.1| predicted protein [Aspergillus terreus NIH2624]
 gi|114187661|gb|EAU29361.1| predicted protein [Aspergillus terreus NIH2624]
          Length = 306

 Score = 72.4 bits (176), Expect = 3e-10,   Method: Compositional matrix adjust.
 Identities = 35/87 (40%), Positives = 50/87 (57%), Gaps = 2/87 (2%)

Query: 88  LGTGLYPVISIINHSCLPNAVLVFEGRLAVVRAVQHVPKGAEVLISYIETAGSTMTRQKA 147
           LGT L    S +NHSC P A + FEGR   VRA+  +P G E+  +Y++ +GS  +RQ  
Sbjct: 9   LGTTLDLAASFMNHSCNPGAFVFFEGRQMRVRALLSLPAGEEITQAYVDLSGSVFSRQAT 68

Query: 148 LKEQYLFTCTCPRCIKLGQFDDIQESA 174
            + +Y F C C RC      +D+Q+ A
Sbjct: 69  TEAEYFFQCHCVRCED--DLEDLQQIA 93


>gi|328875104|gb|EGG23469.1| SET domain-containing protein [Dictyostelium fasciculatum]
          Length = 448

 Score = 72.4 bits (176), Expect = 3e-10,   Method: Compositional matrix adjust.
 Identities = 43/125 (34%), Positives = 64/125 (51%), Gaps = 4/125 (3%)

Query: 72  KLACNAHTICNSELRPLGTGLYPVISIINHSCLPNAVLVFEGRLAVVRAVQHVPKGAEVL 131
           K+ CN   I + + + LG  + P+ S  NHSC PN V V    L   +A+  +PKG+++ 
Sbjct: 272 KIRCNQFGIWSKKDKCLGVSVTPIASYFNHSCCPNIVDVRGTTLLEFKALHFIPKGSQLC 331

Query: 132 ISYIETAGSTMTRQKALKEQYLFTCTCPRCIKLGQFDDIQESAILEGYRCKDDGCSG-FL 190
           ISY++   +T +RQ  L   Y F C C RC   G   D  ++ I + Y      CSG + 
Sbjct: 332 ISYLDLDQTTDSRQDYLIYSYYFKCGCKRCNDKG---DSIDNWISQFYCQGTKKCSGTYF 388

Query: 191 LRDSD 195
           L D +
Sbjct: 389 LEDEE 393


>gi|392579385|gb|EIW72512.1| hypothetical protein TREMEDRAFT_58684 [Tremella mesenterica DSM
           1558]
          Length = 885

 Score = 72.4 bits (176), Expect = 3e-10,   Method: Compositional matrix adjust.
 Identities = 38/102 (37%), Positives = 52/102 (50%), Gaps = 1/102 (0%)

Query: 90  TGLYPVISIINHSCLPNAVLVFEGRLAVVRAVQHVPKGAEVLISYIETAGSTMTRQKALK 149
           T ++P +S+INHSCLPN V +  G +AVVRA+  +  G E+ ISY        TR    K
Sbjct: 553 TAIWPTVSMINHSCLPNTVHIAWGDVAVVRALNDMSAGTELTISYFSREQPYETRAAKAK 612

Query: 150 EQYLFTCTCPRCIKLGQFDDIQESAILEGYRCKDDGCSGFLL 191
            QYLFTC C  C      + I+   ++E +          LL
Sbjct: 613 -QYLFTCDCNLCAADRMDESIRRGKLMEKFDMAKQASRNILL 653


>gi|226290918|gb|EEH46346.1| conserved hypothetical protein [Paracoccidioides brasiliensis Pb18]
          Length = 389

 Score = 72.4 bits (176), Expect = 3e-10,   Method: Compositional matrix adjust.
 Identities = 36/119 (30%), Positives = 62/119 (52%)

Query: 45  YAQIANLVNLILQWPEISINEIAENFSKLACNAHTICNSELRPLGTGLYPVISIINHSCL 104
           +  + + VN  L     +   + +  ++L  NA T+ N     +G  + P  S  NHSC 
Sbjct: 73  FVTVGDEVNPGLTLKACTGCRVVKYCNELETNAFTLTNRYFDRIGLCMLPFASYANHSCE 132

Query: 105 PNAVLVFEGRLAVVRAVQHVPKGAEVLISYIETAGSTMTRQKALKEQYLFTCTCPRCIK 163
           PNA + F+G +  ++A+Q +    ++ ISYI+       RQ  L+++Y FTC CP+C +
Sbjct: 133 PNAYIGFDGPVIYLKALQDIALDEQIFISYIDNTEPWGKRQSELQKRYFFTCKCPKCAQ 191


>gi|121715400|ref|XP_001275309.1| SET and MYND domain protein, putative [Aspergillus clavatus NRRL 1]
 gi|119403466|gb|EAW13883.1| SET and MYND domain protein, putative [Aspergillus clavatus NRRL 1]
          Length = 555

 Score = 72.4 bits (176), Expect = 4e-10,   Method: Compositional matrix adjust.
 Identities = 43/145 (29%), Positives = 74/145 (51%), Gaps = 12/145 (8%)

Query: 26  LVEALVAHMSDI-------DEKQLLLYAQIANLVNLILQWPEISINE--IAENFSKLACN 76
           L + L +HM D+       D+ Q   + +I      +  +   +++E  IAE  +KL  N
Sbjct: 187 LFKGLESHMKDVCIRNANGDKDQ---WERILLTAKAVKNYSGCAMDESLIAEYAAKLDVN 243

Query: 77  AHTICNSELRPLGTGLYPVISIINHSCLPNAVLVFEGRLAVVRAVQHVPKGAEVLISYIE 136
           +    N     LG  ++P  +++NHSC  NA + F+     V+A+  + KG ++ I+Y++
Sbjct: 244 SFNFDNVMDERLGIYMHPYAALMNHSCDYNATVAFDDDRLHVKALHPIKKGEQIFITYVD 303

Query: 137 TAGSTMTRQKALKEQYLFTCTCPRC 161
           T      RQK L ++Y FTC C +C
Sbjct: 304 TTNPYKIRQKELSDRYYFTCRCSKC 328


>gi|255570887|ref|XP_002526395.1| heat shock protein 70 (HSP70)-interacting protein, putative
           [Ricinus communis]
 gi|223534257|gb|EEF35971.1| heat shock protein 70 (HSP70)-interacting protein, putative
           [Ricinus communis]
          Length = 781

 Score = 72.4 bits (176), Expect = 4e-10,   Method: Compositional matrix adjust.
 Identities = 72/364 (19%), Positives = 153/364 (42%), Gaps = 52/364 (14%)

Query: 40  KQLLLYAQIANLVNLILQWPEISINEIAENFSKLACNAHTICNS-ELRPLGTGLYPVISI 98
           + ++L +Q+      +++   +  +  +++  KL+ +   + +S E  P+G  +Y   S+
Sbjct: 423 QAIILVSQVRVNAMAVVRMKSVDAHCPSDHLVKLSHSGDALTSSVEQVPVGQAIYTAGSL 482

Query: 99  INHSCLPNAVLVFEGRLAVVRAVQHVPKGAEVLISYIETAG--STMTRQKALKEQYLFTC 156
            NHSC PN    F  R   +R  +H+  G  + +SY    G      R K L+++Y F C
Sbjct: 483 FNHSCQPNVHAYFLSRTLFIRTTEHLATGCPLELSYGPQVGQRDCKDRLKFLQDKYSFRC 542

Query: 157 TCPRCIKLGQFDDIQESAILEGYRCKDDGCSGFLLRDSDDKGFTCQQCGLVRSKEEIK-K 215
            C  C  +   D +Q +     +RC D  C G +L  S       +     R+ E  +  
Sbjct: 543 HCNGCSIVNLSDLVQNA-----FRCIDLNCVGVVLDRSVINSEIKKLKNFPRAPERQRLD 597

Query: 216 IASEVNILSKKTLALT------------SCGN-------HQEVVSTYKMIEKLQ------ 250
           +  +V+ L+   L L+            +CG+       H+ + + +   ++LQ      
Sbjct: 598 LCLQVDDLAHLALELSNGPLHIQPGCCLNCGSYCDLEAVHEGMRTAWIYFKRLQDAIVVK 657

Query: 251 -----------------KKLYHPFSVNLMQTREKLIKILMELEDWKEALAYCQLTIPVYQ 293
                            + + H ++ ++ +  + L +    +ED++ A  +C+ +I + +
Sbjct: 658 KISTTVITDASRALGALRSILHAYNKHIAEAEDILAQAFCLVEDFQSARDHCRASIKILE 717

Query: 294 RVYPQFHPLLGLQYYTCGKLEWFLGDTENAIKSMTEAVEILRITHGTNSPFMKELILKLE 353
            +Y   H ++G +      ++  +GD  +A+ S+     I     G+++ F+   +  L 
Sbjct: 718 MLYGPDHIVIGYELIKLSTIQLSMGDL-SAVDSINRLGAIFERYFGSHADFIFPYLQTLR 776

Query: 354 EAQA 357
           E  A
Sbjct: 777 EKLA 780


>gi|17509133|ref|NP_492772.1| Protein SET-18, isoform a [Caenorhabditis elegans]
 gi|3924878|emb|CAB03382.1| Protein SET-18, isoform a [Caenorhabditis elegans]
          Length = 501

 Score = 72.4 bits (176), Expect = 4e-10,   Method: Compositional matrix adjust.
 Identities = 72/325 (22%), Positives = 132/325 (40%), Gaps = 26/325 (8%)

Query: 29  ALVAHMSDI--DEKQLLLYAQIANLVNLILQWPEISINEIAENFSKLACNAHTICNSELR 86
           +LV H  DI  DE+++  + +I   +        I   E  + F K   N+H+I  +   
Sbjct: 135 SLVDHKVDIRDDEEKMAHFEKIFEKMGAFRGEEMIEKGEFFDVFCKATINSHSIHTNAGN 194

Query: 87  PLGTGLYPVISIINHSCLPNAVLVFEGRLAVVRAVQHVP-----KGAEVLISYIETAGST 141
            +G  L   +S  NHSC P   +VF+G    +R +  VP        E  ISYI+   S 
Sbjct: 195 EVGMALDLGVSKYNHSCRPTCSMVFDGYRVCLRPL--VPGVDAENTEEAFISYIDVGRSK 252

Query: 142 MTRQKALKEQYLFTCTCPRCIKLGQFDDIQESAILEGYRCKDDGCSGFLLRDSDDK--GF 199
             R++ L  ++ F C C RC+      D ++ A L   RC +  C   +L    ++    
Sbjct: 253 YIRRRDLNSRWYFNCECTRCM------DPEDDA-LTAIRCANPACDAPILTSETEEPMNI 305

Query: 200 TCQQCGLVRSKEEIKKIASEVNILSKKTLALTSCGNHQEVVSTYKMIEKLQKKLYHPFSV 259
            C++C  +  ++ +K     +  L             +E+++  + I        HP +V
Sbjct: 306 ACEKCKTIVEEDTVKAAQEYMKTLPASFDPKCPAEILKELLAKAEQI-------LHPSNV 358

Query: 260 NLMQTREKLIKILMELEDWKEALAYCQLTIPVYQRVYPQFHPLLGLQYYTCGKLEWFLGD 319
            + + R  L  +   L     +  + Q+    Y+  +P+    +G Q     K      +
Sbjct: 359 YVARLRTALFHVTGTLTMDNLSSMHTQI-YNNYKMCFPKADRHVGFQLLHIVKALIEKDE 417

Query: 320 TENAIKSMTEAVEILRITHGTNSPF 344
            + A+    +A+ I  +  G + P+
Sbjct: 418 RDEAMPYAFDAMNIFEVCFGLDHPY 442


>gi|389746511|gb|EIM87691.1| SET domain-containing protein [Stereum hirsutum FP-91666 SS1]
          Length = 552

 Score = 72.0 bits (175), Expect = 4e-10,   Method: Compositional matrix adjust.
 Identities = 87/320 (27%), Positives = 131/320 (40%), Gaps = 97/320 (30%)

Query: 7   RRKLQNDNVIPSTTTDNYSLVEALVAHMSDIDEKQLLLYA--QIANLVNLILQWPEISIN 64
           R+ LQ     PST   +  L  A+V ++      QL  Y     A+LV+LI         
Sbjct: 160 RKDLQ-----PSTYESHTHLAHAVVKYLGLDAPGQLGAYGITTAADLVDLI--------- 205

Query: 65  EIAENFSKLACNAHTICNSELRPLGTGLYPVISIINHSCLPNAVLVF----------EGR 114
                 S+   N+ T+    L PLG  + P++++INHSC PNAV+V+          E +
Sbjct: 206 ------SRFITNSITLTAPSLTPLGVSVSPLVALINHSCDPNAVVVYPRSTSDPSQQEPQ 259

Query: 115 LAVVRAVQHVPKGAEVLISYIETAGSTMTRQKALKEQYLFTCTCPRCIKLGQFDDIQESA 174
           + VV A++++    E+  +YI+T      RQ AL   Y FTC C  C +    D      
Sbjct: 260 MQVV-AIKNIAPDQEIFTAYIDTTMPRGHRQAALTTTYNFTCKCSLCSQPDGMD------ 312

Query: 175 ILEGYRCKDDGCSGFLLRDSDDKGFTCQQCGLVRSKEEIKKIASEVNILSKKTLALTSCG 234
           + E   C  + C G           TC              + +E N LS+       CG
Sbjct: 313 LREAIWCPKN-CGG-----------TC-------------PLPTEENTLSR-------CG 340

Query: 235 NHQEVV-STYKMIEKL---QKKLYHPFSVNLMQTREKLIKILMELEDWKEALAYCQLT-- 288
             + +V ST  +++ L   Q+ L    S+ L +T + L                 QLT  
Sbjct: 341 KCKAIVSSTDSVLDALRVGQEALQKATSLQLTETAKAL-----------------QLTTN 383

Query: 289 -IPVY--QRVYPQFHPLLGL 305
            IP+    ++ P  HPLL L
Sbjct: 384 IIPILTSSKLTPSSHPLLAL 403


>gi|72169832|ref|XP_781331.1| PREDICTED: SET and MYND domain-containing protein 4-like
           [Strongylocentrotus purpuratus]
          Length = 704

 Score = 72.0 bits (175), Expect = 4e-10,   Method: Compositional matrix adjust.
 Identities = 79/382 (20%), Positives = 158/382 (41%), Gaps = 76/382 (19%)

Query: 23  NYSLVEALVAHMSDIDEKQLLLYAQIANLVNLILQWPEISINEIAENFS----------- 71
           NY  +  L AH  +   K+L+ YA  + L+     + E+ + +  E FS           
Sbjct: 335 NYECIVGLEAHWFNHSHKELVQYAVTSILLAKCF-YRELVLPKTCETFSEEELITEIASL 393

Query: 72  ------KLACNAHTICNSELRP----------------------LGTGLYPVISIINHSC 103
                 +L  N+H I  +E+R                       + T +YP +S++NH+C
Sbjct: 394 LLLHTRQLKSNSHAI--TEVRSSEGENTAGESVGGSVQQISQGRIATAVYPTVSLMNHAC 451

Query: 104 LPNAVLVFEGRLAVVRAVQHVPKGAEVLISYIETAG--STMTRQKALKEQYLFTCTCPRC 161
            PN +  F   +  VRA++ + +G E+   Y    G  +T  RQ+AL  QY FTC C  C
Sbjct: 452 QPNVIASFRKGIISVRAIEKIMRGDEIQHCYGPQVGHMTTSDRQQALLNQYCFTCRCRAC 511

Query: 162 IKLGQFDDIQESAILEGYRCKD---------DGCS-----GFLLRDSDDKGFTCQQCGLV 207
            +  +  D +E   ++  +C             C+     G L+ +  + G T     L+
Sbjct: 512 TRKPRTFDKEEDLCIKCPQCGQPLNIQTSMCGKCAERIDVGVLIHELTNAGTT-----LI 566

Query: 208 RSKEEIKKIASEVNILSKKTLALTSCGNHQEVVSTYKMIEKLQKKLYHPFSVNLMQTREK 267
             +E      ++  ++             +EV+S  K    + +++  P  + L    + 
Sbjct: 567 GLEEMFSAAVNDDTLM-------------REVISKTKSCIDVLERIIIPPDMQLATAYDD 613

Query: 268 LIKILMELEDWKEALAYCQLTIPVYQRVYPQFHPLLGLQYYTCGKLEWFLGDTENAIKSM 327
           + K  +EL+++KEA  +     P  +  + +    +  + Y   ++ +   +   A++++
Sbjct: 614 MAKCHVELDEFKEAALWLAKATPSIESRFGRDSIEVAHELYKLAQIYFNGKEIAPAMETI 673

Query: 328 TEAVEILRITHGTNSPFMKELI 349
             A+E+    +G ++  + EL+
Sbjct: 674 DRALELFIRHYGNSNEEVLELV 695


>gi|19113306|ref|NP_596514.1| histone lysine methyltransferase Set6 (predicted)
           [Schizosaccharomyces pombe 972h-]
 gi|74626997|sp|O94256.1|SET6_SCHPO RecName: Full=SET domain and MYND-type zinc finger protein 6
 gi|3810831|emb|CAA21792.1| histone lysine methyltransferase Set6 (predicted)
           [Schizosaccharomyces pombe]
          Length = 483

 Score = 72.0 bits (175), Expect = 4e-10,   Method: Compositional matrix adjust.
 Identities = 33/92 (35%), Positives = 52/92 (56%)

Query: 70  FSKLACNAHTICNSELRPLGTGLYPVISIINHSCLPNAVLVFEGRLAVVRAVQHVPKGAE 129
           F +LA NA  +  S    LG  L  ++  +NHSC PN  ++F+G +  + + + + K  +
Sbjct: 163 FCRLAVNAMNLVTSSFDSLGMCLDTILCRLNHSCDPNCQIIFDGAIVQLVSKRDIKKDEQ 222

Query: 130 VLISYIETAGSTMTRQKALKEQYLFTCTCPRC 161
           + ISYI+       RQK L ++Y F+C CPRC
Sbjct: 223 LFISYIDIRLPKSIRQKQLLKKYFFSCYCPRC 254


>gi|158299452|ref|XP_319583.4| AGAP008839-PA [Anopheles gambiae str. PEST]
 gi|157013526|gb|EAA14823.4| AGAP008839-PA [Anopheles gambiae str. PEST]
          Length = 650

 Score = 72.0 bits (175), Expect = 4e-10,   Method: Compositional matrix adjust.
 Identities = 68/267 (25%), Positives = 109/267 (40%), Gaps = 27/267 (10%)

Query: 16  IPSTTTDNYSLVEALVAHMSDIDEKQLLLYAQIANLVNLIL---------QWPEISINEI 66
           I     D+Y  V  LV H S    +       +A L+N  L         Q        +
Sbjct: 358 IDKLPVDDYRKVYKLVTHESTRSPEDFFQRTLMATLLNACLTLGGYGACPQEQNFIGGLL 417

Query: 67  AENFSKLACNAHTICN---------SELRPLGTGLYPVISIINHSCLPNAVLVFEGRLAV 117
             N   L  NAH +            +   +G GLYP +++ NHSC P     + G    
Sbjct: 418 VHNLQLLQFNAHEVSEMIRETAEDIGKSTFIGGGLYPTLALFNHSCDPGVTRYYRGNQVC 477

Query: 118 VRAVQHVPKGAEVLISY--IETAGSTMTRQKALKEQYLFTCTCPRCIKLGQFDDIQESAI 175
           VR V+++P  + V  +Y  + T      R+  L  QY FTC C  C++        +  +
Sbjct: 478 VRTVKNIPADSMVAENYGPLFTQVRRDERRDTLLHQYRFTCQCVPCVENWPLFTEMDPGV 537

Query: 176 LEGYRCKDDG--CSGFLLRDS--DDKGFTCQQCGLVRSKEEIKKIASEVNILSKKTLALT 231
           +  +RC D G  CS  LL  +  +D    C +CG   +  +  K   + ++L K    L 
Sbjct: 538 IR-FRC-DSGKICSNVLLIPAAVNDFMVKCTECGEHTNIMKGLKSLQDTDMLFKTATRLH 595

Query: 232 SCGNHQEVVSTY-KMIEKLQKKLYHPF 257
           S G ++  +  Y +M+E + + L  P+
Sbjct: 596 SAGEYEAALRKYIEMMETMSEVLVPPY 622


>gi|225679205|gb|EEH17489.1| conserved hypothetical protein [Paracoccidioides brasiliensis Pb03]
          Length = 377

 Score = 72.0 bits (175), Expect = 4e-10,   Method: Compositional matrix adjust.
 Identities = 38/122 (31%), Positives = 64/122 (52%), Gaps = 1/122 (0%)

Query: 45  YAQIANLVNLILQWPEISINEIAENFSKLACNAHTICNSELRPLGTGLYPVISIINHSCL 104
           +  + + VN  L     +   + +  ++L  NA T+ N     +G  + P  S  NHSC 
Sbjct: 62  FVTVGDEVNPGLTLKACTGCRVVKYCNELETNAFTLTNRYFDRIGLCMLPFASYANHSCE 121

Query: 105 PNAVLVFEGRLAVVRAVQHVPKGAEVLISYIETAGSTMTRQKALKEQYLFTCTCPRCIKL 164
           PNA + F+G +  ++A+Q +    ++ ISYI+       RQ  L+++Y FTC CP+C + 
Sbjct: 122 PNAYIGFDGPVIYLKALQDIALDEQIFISYIDNTEPWEKRQSELQKRYFFTCKCPKCAQ- 180

Query: 165 GQ 166
           GQ
Sbjct: 181 GQ 182


>gi|195058496|ref|XP_001995453.1| GH17754 [Drosophila grimshawi]
 gi|193896239|gb|EDV95105.1| GH17754 [Drosophila grimshawi]
          Length = 499

 Score = 72.0 bits (175), Expect = 4e-10,   Method: Compositional matrix adjust.
 Identities = 34/103 (33%), Positives = 54/103 (52%), Gaps = 7/103 (6%)

Query: 91  GLYPVISIINHSCLPNAVLVFEGRL--AVVRAVQHVPKGAEVLISYIETAGSTMTRQKAL 148
            LYP+  ++NH C+PN+   FE +    +VRA   + +G E+  +Y +     + R   L
Sbjct: 208 ALYPLFGVMNHDCIPNSYYTFEEKTNNMIVRAAVDILEGEEITTTYTKLFTGNIARHLFL 267

Query: 149 KEQYLFTCTCPRCIKLGQFDDIQESAILEGYRCKDDGCSGFLL 191
           K +  FTC CPRC      D  ++ A + G  C+D  CSG ++
Sbjct: 268 KMKKGFTCKCPRC-----SDPTEKGAYISGLYCRDTNCSGLVV 305


>gi|157136831|ref|XP_001656929.1| hypothetical protein AaeL_AAEL003516 [Aedes aegypti]
 gi|108880958|gb|EAT45183.1| AAEL003516-PA [Aedes aegypti]
          Length = 649

 Score = 72.0 bits (175), Expect = 4e-10,   Method: Compositional matrix adjust.
 Identities = 73/272 (26%), Positives = 112/272 (41%), Gaps = 35/272 (12%)

Query: 15  VIPSTTTDNYSLVEALVAHMSDIDEKQLLLYAQIANLVNLILQWPEISINEIAENF---- 70
           V      D+Y  V  LV H      +       +A L+N  L       ++ +E+F    
Sbjct: 356 VTDKLNNDDYRKVYKLVTHEDKRSPEDYFQRTLMATLLNACLSLGGYYKSKESESFIGGL 415

Query: 71  -----SKLACNAHTICNSELRP-----------LGTGLYPVISIINHSCLPNAVLVFEGR 114
                  L  NAH I  SEL+            +G GLYP +++ NHSC P     + G 
Sbjct: 416 LLHNLQFLQYNAHEI--SELQRENERDIGKSTFIGGGLYPTLALFNHSCEPGVTRYYRGN 473

Query: 115 LAVVRAVQHVPKGAEVLISY--IETAGSTMTRQKALKEQYLFTCTCPRCI-KLGQFDDIQ 171
              VR V+ +P G+ V  +Y  + T  S   R+  L  QY FTC C  C     +F D+ 
Sbjct: 474 SVCVRTVKGIPAGSMVAENYGPLFTQVSREERRSTLLNQYKFTCNCRACAGDWPKFADMD 533

Query: 172 ESAILEGYRCKDDG---CSGFLLRDSDDKGF--TCQQCGLVRSKEEIKKIASEVNILSKK 226
            + +    R K DG   CS  L   ++   F   C +CG   +  +  K   + ++L K 
Sbjct: 534 SNVL----RFKCDGGKNCSNVLEIPAEINEFMVQCTECGEHTNIMKGLKSLQDTDMLFKS 589

Query: 227 TLALTSCGNHQEVVSTY-KMIEKLQKKLYHPF 257
              L   G ++  +  Y +M+  L + L  P+
Sbjct: 590 ATKLHEAGEYEFALKKYVEMMNTLDEVLVPPY 621


>gi|302845794|ref|XP_002954435.1| hypothetical protein VOLCADRAFT_118721 [Volvox carteri f.
           nagariensis]
 gi|300260365|gb|EFJ44585.1| hypothetical protein VOLCADRAFT_118721 [Volvox carteri f.
           nagariensis]
          Length = 405

 Score = 72.0 bits (175), Expect = 5e-10,   Method: Compositional matrix adjust.
 Identities = 35/73 (47%), Positives = 45/73 (61%), Gaps = 2/73 (2%)

Query: 91  GLYPVISIINHSCLPNAVLVFEGRLAVVRAVQHVPKGAEVLISYIETAG--STMTRQKAL 148
           GL+P  S +NHSCLPNAV    G + VV A + + +G+EVLI+Y+  A       RQ  L
Sbjct: 122 GLFPSFSFLNHSCLPNAVNFVVGGIMVVVAARKIRQGSEVLINYLGRASLRPVGERQGQL 181

Query: 149 KEQYLFTCTCPRC 161
            E Y F+C CPRC
Sbjct: 182 AEGYHFSCDCPRC 194


>gi|4324419|gb|AAD16880.1| unknown [Dictyostelium discoideum]
          Length = 333

 Score = 71.6 bits (174), Expect = 5e-10,   Method: Compositional matrix adjust.
 Identities = 39/107 (36%), Positives = 59/107 (55%), Gaps = 10/107 (9%)

Query: 61  ISINEIAENFSKLACNAHTICNSELRPL-----GTGLYPVISIINHSCLPNAVLVFEGR- 114
           ++I+EI E +S +  N H +    L PL     G G++P  S +NHSCLPNA    + + 
Sbjct: 40  VTIDEIIEIYSIVLSNGHEM----LHPLNCHTYGLGIFPTGSYLNHSCLPNAFWYNDDQG 95

Query: 115 LAVVRAVQHVPKGAEVLISYIETAGSTMTRQKALKEQYLFTCTCPRC 161
           + V R ++ + KG E+L SY +       R+K L +QY F C C +C
Sbjct: 96  MMVFRTLRPIKKGEEILTSYTDITTECSERRKHLLKQYFFFCQCQQC 142


>gi|320586941|gb|EFW99604.1| set and mynd domain protein [Grosmannia clavigera kw1407]
          Length = 576

 Score = 71.6 bits (174), Expect = 5e-10,   Method: Compositional matrix adjust.
 Identities = 41/106 (38%), Positives = 58/106 (54%), Gaps = 3/106 (2%)

Query: 58  WP--EISINEIAENFSKLACNAHTICNSELRPLGTGLYPVISIINHSCLPNAVLVFEGRL 115
           WP  + S+    E   K+  N+    + ++   GT L   +++ NHSC+PNAV+ F GR 
Sbjct: 206 WPATDASLAIAVEAMCKIHTNSFDRSDVDVAYTGTFLDAHLAMANHSCVPNAVVSFAGRK 265

Query: 116 AVVRAVQHVPKGAEVLISYIETAGSTMTRQKALKEQYLFTCTCPRC 161
           A +RA Q +  G EV ISYI+       RQ+ L + Y FTC C RC
Sbjct: 266 AFLRAEQAIRAGDEVTISYIDYTKPKSVRQRGL-DLYHFTCDCARC 310


>gi|449681118|ref|XP_002159729.2| PREDICTED: uncharacterized protein LOC100210152, partial [Hydra
           magnipapillata]
          Length = 1044

 Score = 71.6 bits (174), Expect = 6e-10,   Method: Compositional matrix adjust.
 Identities = 36/101 (35%), Positives = 57/101 (56%), Gaps = 12/101 (11%)

Query: 73  LACNAHTIC----------NSELRPLGTGLYPVISIINHSCLPNAVLVFEGRLAVVRAVQ 122
           L CNAH +           +SEL+ +G+ +Y  +S++NHSC P+ V    G   V+RA++
Sbjct: 385 LPCNAHEVSELQLKASNYKDSELKEIGSAVYATLSLLNHSCDPSVVRHCYGDTCVLRAIK 444

Query: 123 HVPKGAEVLISY--IETAGSTMTRQKALKEQYLFTCTCPRC 161
           H+ +G+E++ +Y  +    S + RQ  L EQY F C C  C
Sbjct: 445 HIKEGSEIVDNYGFLYAVESKVIRQSHLMEQYYFACQCEAC 485


>gi|225711848|gb|ACO11770.1| SET and MYND domain-containing protein 3 [Lepeophtheirus salmonis]
          Length = 426

 Score = 71.6 bits (174), Expect = 6e-10,   Method: Compositional matrix adjust.
 Identities = 42/137 (30%), Positives = 71/137 (51%), Gaps = 7/137 (5%)

Query: 30  LVAHMSDI-----DEKQLLLYAQIANLVNLILQWPEISINEIAENFSKLACNAHTICNSE 84
           L++H  +I      +K+ L Y  I  ++ ++    E + +E+   F+KL  NA  + N +
Sbjct: 116 LMSHKENILMNPEHKKRFLTYLDI--IIYMLGGSLETNESEVLNIFTKLMINASFMLNEK 173

Query: 85  LRPLGTGLYPVISIINHSCLPNAVLVFEGRLAVVRAVQHVPKGAEVLISYIETAGSTMTR 144
           L   G  L    S I+HSC PNA+ +F G   VV+A+  +    +V ++Y++ +     R
Sbjct: 174 LIDFGGCLCLEFSAIDHSCRPNAIYMFNGHTLVVKALCEIANFDDVRVAYVDMSQPRSIR 233

Query: 145 QKALKEQYLFTCTCPRC 161
           Q+ LK Q+ F C C  C
Sbjct: 234 QELLKNQFFFDCNCEEC 250


>gi|390599694|gb|EIN09090.1| SET domain-containing protein [Punctularia strigosozonata HHB-11173
           SS5]
          Length = 562

 Score = 71.6 bits (174), Expect = 6e-10,   Method: Compositional matrix adjust.
 Identities = 51/165 (30%), Positives = 84/165 (50%), Gaps = 21/165 (12%)

Query: 27  VEALVAHMSDIDEKQLLLYAQIAN-LVN-LILQWPE-------ISINEIAENFSKLACNA 77
           ++++ +H S +  +    +A +A+ LV  L L  PE        S  ++ +  S+   NA
Sbjct: 163 IDSMQSHRSSLRPESFESHAYLAHALVRYLGLDAPEKLASFGLRSAGDVVDLISRFTTNA 222

Query: 78  HTICNSELRPLGTGLYPVISIINHSCLPNAVLVF---------EGRLAVVRAVQHVPKGA 128
            T+    L PLG  + P ++++NHSC PNAV+VF         E  + V+ A++ +    
Sbjct: 223 ITLAAPSLTPLGVSVSPAVALVNHSCAPNAVVVFPRVSKTVDQEPVMQVI-ALRDIHPDE 281

Query: 129 EVLISYIETAGSTMTRQKALKEQYLFTCTCPRCIKLGQFDDIQES 173
           EV  +YI+T      RQK L++ Y FTC+C  C    +  D +ES
Sbjct: 282 EVTTAYIDTTVPREQRQKILRDTYHFTCSCSLCA--AEDPDPRES 324


>gi|156356358|ref|XP_001623892.1| predicted protein [Nematostella vectensis]
 gi|156210632|gb|EDO31792.1| predicted protein [Nematostella vectensis]
          Length = 634

 Score = 71.6 bits (174), Expect = 6e-10,   Method: Compositional matrix adjust.
 Identities = 48/159 (30%), Positives = 78/159 (49%), Gaps = 18/159 (11%)

Query: 69  NFSKLACNAH----------TICNSELRPLGTGLYPVISIINHSCLPNAVLVFEGRLAVV 118
           +   L CNAH          ++  SE   +G G+Y  +S+ NHSC PN    F G   VV
Sbjct: 408 HLQSLPCNAHEISELQLSLKSVATSEAAEIGAGIYGTLSLFNHSCEPNVTRFFYGDKCVV 467

Query: 119 RAVQHVPKGAEVLISY--IETAGSTMTRQKALKEQYLFTCTCPRCIKLGQ-FDDI--QES 173
           RA   +P   EV+ +Y  +        RQ++L+ QY F C C  C++    + ++  Q+ 
Sbjct: 468 RAFSSIPCRGEVVDNYGILSALTPRKQRQESLQSQYYFKCNCHACLEDSPLYSELIKQDV 527

Query: 174 AILEGYRCKDDGCSGFLLRDSDDKGFTCQQCGLVRSKEE 212
             L+   C+    +G +L  +D K   C++CG+ +S E+
Sbjct: 528 PQLKCANCR-MALAGEIL--TDGKLVKCEKCGVPQSLED 563


>gi|157124668|ref|XP_001654144.1| hypothetical protein AaeL_AAEL001857 [Aedes aegypti]
 gi|108882773|gb|EAT46998.1| AAEL001857-PA [Aedes aegypti]
          Length = 497

 Score = 71.2 bits (173), Expect = 7e-10,   Method: Compositional matrix adjust.
 Identities = 48/188 (25%), Positives = 81/188 (43%), Gaps = 17/188 (9%)

Query: 91  GLYPVISIINHSCLPNAVLVF---EGRLAVVRAVQHVPKGAEVLISYIETAGSTMTRQKA 147
           GLYP + ++ HSC+PN V      +G   + R    +P+G ++   Y  T   T TRQ A
Sbjct: 200 GLYPKLGVMAHSCVPNVVHSIHPSDGYRLIGRVAIDIPEGEKLYTVYTYTLNGTSTRQAA 259

Query: 148 LKEQYLFTCTCPRCIKLGQFDDIQESAILEGYRCKDDGCSGFLLRDS----DDKGFTCQQ 203
           LK    FTC CPRC+     D  +        +C+   C   L+  S    D+  + C  
Sbjct: 260 LKSSKYFTCRCPRCL-----DPTELGTHFSSLKCQK--CDNGLIVSSNPLDDEAEWRCSN 312

Query: 204 CGLVRSKEEIKKIASEVNILSKKTLALTSCGNHQEVVSTYKMIEKLQKKLYHPFSVNLMQ 263
           C        ++K    + ++  +   L+   N  E ++ ++ + K  + + HP       
Sbjct: 313 CEFKTRGAAVQKA---IQVMQAEIDQLSCLENGPEKLTAFEKLYKKYRSILHPLHFIKTS 369

Query: 264 TREKLIKI 271
            R  LI++
Sbjct: 370 IRHSLIEL 377


>gi|340992675|gb|EGS23230.1| hypothetical protein CTHT_0008940 [Chaetomium thermophilum var.
           thermophilum DSM 1495]
          Length = 459

 Score = 71.2 bits (173), Expect = 7e-10,   Method: Compositional matrix adjust.
 Identities = 39/103 (37%), Positives = 59/103 (57%), Gaps = 2/103 (1%)

Query: 60  EISINEIAENFSKLACNAHTICNSELRPLGTGLYPVISIINHSCLPNAVLVFEGRLAVVR 119
           + SI    E   K+  NA    + +    G  L PV+++ NHSC+P+A + F+ R AV+R
Sbjct: 89  DFSIPRAREILCKIQTNAFNRLDPDAGMTGIYLDPVLAMANHSCMPSAFVSFDQRNAVLR 148

Query: 120 AVQHVPKGAEVLISYIE-TAGSTMTRQKALKEQYLFTCTCPRC 161
           A + + +G E+ I Y++ T  +   RQ+ALK  Y F C CPRC
Sbjct: 149 AWRDIKEGDEITICYVDVTLPNKAARQEALK-LYHFECRCPRC 190


>gi|317157122|ref|XP_001826232.2| SET and MYND domain protein [Aspergillus oryzae RIB40]
          Length = 435

 Score = 71.2 bits (173), Expect = 7e-10,   Method: Compositional matrix adjust.
 Identities = 41/146 (28%), Positives = 70/146 (47%), Gaps = 2/146 (1%)

Query: 20  TTDNYSLVEALVAHMSDIDEKQLLLYAQIANLVNLILQWPEISINE--IAENFSKLACNA 77
           T     L   L  H+S+I ++    + +IA     +  +    + E  I+   +KL  N+
Sbjct: 69  TNGELELFSQLETHISEIRDQSPEQWERIALSSKAVKAYSGTDMKEETISAFGAKLELNS 128

Query: 78  HTICNSELRPLGTGLYPVISIINHSCLPNAVLVFEGRLAVVRAVQHVPKGAEVLISYIET 137
               N     +G  L+P  ++INHSC  NA + F+     ++A + + KG ++ ISYI+ 
Sbjct: 129 FNFTNILSDRIGLYLHPYAALINHSCNYNAAVTFDSDNLYIKATRPIQKGDQIFISYIDA 188

Query: 138 AGSTMTRQKALKEQYLFTCTCPRCIK 163
                 R+  L+E+Y F C C +C K
Sbjct: 189 TNPVKLRRSELRERYYFDCHCAKCAK 214


>gi|156367132|ref|XP_001627273.1| predicted protein [Nematostella vectensis]
 gi|156214178|gb|EDO35173.1| predicted protein [Nematostella vectensis]
          Length = 750

 Score = 71.2 bits (173), Expect = 7e-10,   Method: Compositional matrix adjust.
 Identities = 58/272 (21%), Positives = 118/272 (43%), Gaps = 11/272 (4%)

Query: 84  ELRPLGTGLYPVISIINHSCLPNAVLVFEGRLAVVRAVQHVPKGAEVLISYIETAGSTMT 143
           E R + + +YP  S++NH+C P+ ++ F   + V RA  ++  G+ +   Y         
Sbjct: 468 EQRRIASAIYPTASLLNHACDPDVLVSFVDGVLVARATHNIAPGSGITHCYGPHVNHMPR 527

Query: 144 --RQKALKEQYLFTCTCPRCIKLGQFDDIQESAILEGYRCKDDGCSGFLLRDS--DDKGF 199
             RQK L +QY FTC C  C    + ++ +       + C    C    ++ S  +    
Sbjct: 528 EERQKLLYKQYFFTCQCSACTSDEEMENTR--LCFSAFACPRCKCP---MKTSPLEPSLA 582

Query: 200 TCQ--QCGLVRSKEEIKKIASEVNILSKKTLALTSCGNHQEVVSTYKMIEKLQKKLYHPF 257
            CQ  +C L +S EE    + +  +L  K +        QE +  ++   + + ++ HP 
Sbjct: 583 RCQNKKCTLEKSIEEELSHSRQAELLFFKAVRTMERIGVQEALGLFQECLRTRTQILHPH 642

Query: 258 SVNLMQTREKLIKILMELEDWKEALAYCQLTIPVYQRVYPQFHPLLGLQYYTCGKLEWFL 317
             +L +T + L +    + D+K A  +C  +    ++ +         + +   +L +  
Sbjct: 643 HKDLAETHDALARCYAMIGDFKLASQHCLQSSEAVEKAFGSTSVEYAHELHKLSQLLFND 702

Query: 318 GDTENAIKSMTEAVEILRITHGTNSPFMKELI 349
              + A+  + +A  +L   +G N P ++EL+
Sbjct: 703 RQAKKALPMIDKAASLLATYYGRNHPDVQELV 734


>gi|238493359|ref|XP_002377916.1| SET and MYND domain protein, putative [Aspergillus flavus NRRL3357]
 gi|220696410|gb|EED52752.1| SET and MYND domain protein, putative [Aspergillus flavus NRRL3357]
          Length = 415

 Score = 70.9 bits (172), Expect = 9e-10,   Method: Compositional matrix adjust.
 Identities = 41/146 (28%), Positives = 70/146 (47%), Gaps = 2/146 (1%)

Query: 20  TTDNYSLVEALVAHMSDIDEKQLLLYAQIANLVNLILQWPEISINE--IAENFSKLACNA 77
           T     L   L  H+S+I ++    + +IA     +  +    + E  I+   +KL  N+
Sbjct: 49  TNGELELFSQLETHISEIRDQSPEQWERIALSSKAVKAYSGTDMKEETISAFGAKLELNS 108

Query: 78  HTICNSELRPLGTGLYPVISIINHSCLPNAVLVFEGRLAVVRAVQHVPKGAEVLISYIET 137
               N     +G  L+P  ++INHSC  NA + F+     ++A + + KG ++ ISYI+ 
Sbjct: 109 FNFTNILSDRIGLYLHPYAALINHSCNYNAAVTFDSDNLYIKATRPIQKGDQIFISYIDA 168

Query: 138 AGSTMTRQKALKEQYLFTCTCPRCIK 163
                 R+  L+E+Y F C C +C K
Sbjct: 169 TNPVKLRRSELRERYYFDCHCAKCAK 194


>gi|407929314|gb|EKG22146.1| hypothetical protein MPH_00473 [Macrophomina phaseolina MS6]
          Length = 492

 Score = 70.9 bits (172), Expect = 1e-09,   Method: Compositional matrix adjust.
 Identities = 37/109 (33%), Positives = 57/109 (52%), Gaps = 3/109 (2%)

Query: 70  FSKLACNAHTICNSELRPLGTGLYPVISIINHSCLPNAVLVFEGRLAVVRAVQHVPKGAE 129
           + ++  N+ T+  S + PLG    P+    NHSC PNA ++ +G     RA++ + K  E
Sbjct: 181 YGRVFTNSLTLIGSAMEPLGICFDPLACSANHSCDPNAFVIMDGAQLSFRALKPIAKDEE 240

Query: 130 VLISYIETAGSTMTRQKALKEQYLFTCTCPRCIKLGQFDDIQESAILEG 178
           +LISYI+       RQ  L  +Y FTC+C +C    Q   ++E   L G
Sbjct: 241 ILISYIDGTLPFERRQTDLTRRYYFTCSCTKC---AQGPTLREDRWLPG 286


>gi|328767279|gb|EGF77329.1| hypothetical protein BATDEDRAFT_27650 [Batrachochytrium
           dendrobatidis JAM81]
          Length = 445

 Score = 70.9 bits (172), Expect = 1e-09,   Method: Compositional matrix adjust.
 Identities = 71/285 (24%), Positives = 120/285 (42%), Gaps = 35/285 (12%)

Query: 89  GTGLYPVISIINHSCLPNAVLVFE-GRLAVVRAVQHVPKGAEVLISYIETAGSTMTRQKA 147
           G  +Y  +S +NHSC+PNA L F  G +A +  +  +  G ++ ISY++   S  TRQ+ 
Sbjct: 183 GVAIYDRLSTVNHSCVPNACLTFGIGGIARLSPMTAIASGDQINISYVDVFQSCETRQRQ 242

Query: 148 LKEQYLFTCTCPRCIKLGQFDDIQESAILEGYRCKDDGCSGFLLRDSDDKGFTCQQCGLV 207
           LKEQY F CTC  C      +  Q+        C     S   L  S             
Sbjct: 243 LKEQYYFDCTCRLCTANANGESRQQ--------CISQFDSAVDLYQS------------- 281

Query: 208 RSKEEIKKIASEVNILSK--KTLALTSCGNHQEVVS-TYKMIEKLQKKLYHPFSVNLMQT 264
            +K++    ++  +IL++  ++L L  C +H  ++     +IE L  +  +   + L   
Sbjct: 282 -TKKQTNDASASQSILTRALESLELIVCKSHSVLIRLRTDLIEILVGQRNYAKVIELSND 340

Query: 265 REKLIKILMELEDWKEALAYCQLTIPVYQRVYPQFHPLLGLQYYTCGKLE-WFLGDTENA 323
           +    + +     WK + +  +  I      YP + P+  +      K   W + D  + 
Sbjct: 341 QLDAFRCV-----WKASQSI-EKGISESDNAYPVW-PMESVSSVDIFKASMWTMDDDLDR 393

Query: 324 IKSM-TEAVEILRITHGTNSPFMKELILKLEEAQAEASYKLSSKD 367
           +  +  +A   LRITHG   P   +L   LE+   E   + S  D
Sbjct: 394 VAELGKQAAHNLRITHGEKHPMYLDLTEGLEDVLRELEARHSGVD 438


>gi|341883792|gb|EGT39727.1| CBN-SET-18 protein [Caenorhabditis brenneri]
          Length = 499

 Score = 70.5 bits (171), Expect = 1e-09,   Method: Compositional matrix adjust.
 Identities = 69/325 (21%), Positives = 134/325 (41%), Gaps = 28/325 (8%)

Query: 29  ALVAHMSDI--DEKQLLLYAQIANLVNLILQWPEISINEIAENFSKLACNAHTICNSELR 86
           +LV H +DI  DE+++  + ++   + L  +   I   E  + F K + N+H+I  +   
Sbjct: 135 SLVDHKTDIKEDEEKMKHFEKVFTKMGLFRKDDMIEKGEFFDIFCKASINSHSIHTNAGN 194

Query: 87  PLGTGLYPVISIINHSCLPNAVLVFEGRLAVVRAVQHVP-----KGAEVLISYIETAGST 141
            +G  L   +S  NHSC P   +VF+G    +R +  VP        +  ISYI+   S 
Sbjct: 195 EVGMALDLGVSKYNHSCRPTCSMVFDGYRVCLRPL--VPGVDAEDTEQAFISYIDVGRSK 252

Query: 142 MTRQKALKEQYLFTCTCPRCIKLGQFDDIQESAILEGYRCKDDGCSGFLLRDSDDK--GF 199
             R++ LK  + F C C RC+         E   L   RC +  C   +L    ++    
Sbjct: 253 FVRRRDLK--WYFNCECTRCMD-------PEDDALTAIRCANPSCDAPILTSETEEPMNI 303

Query: 200 TCQQCGLVRSKEEIKKIASEVNILSKKTLALTSCGNHQEVVSTYKMIEKLQKKLYHPFSV 259
            C +C  +  + ++K     +        +L +  + +      + +    +++ HP +V
Sbjct: 304 ACDKCKTIVEENDVKAAQEYMK-------SLPASFDPKCPADILRDLLAKAEQVLHPSNV 356

Query: 260 NLMQTREKLIKILMELEDWKEALAYCQLTIPVYQRVYPQFHPLLGLQYYTCGKLEWFLGD 319
            + + R  L  +   L     +  + Q+    Y+  +P+    +G Q     K      +
Sbjct: 357 YVARLRTALFHVTGTLTMDNLSSMHTQI-YNNYKMCFPKADRHVGFQLLHIVKALIEKDE 415

Query: 320 TENAIKSMTEAVEILRITHGTNSPF 344
            + A+    +A+ I  +  G + P+
Sbjct: 416 RDEAMPYAFDAMNIFEVCFGLDHPY 440


>gi|426237300|ref|XP_004012599.1| PREDICTED: SET and MYND domain-containing protein 4 [Ovis aries]
          Length = 802

 Score = 70.5 bits (171), Expect = 1e-09,   Method: Compositional matrix adjust.
 Identities = 36/96 (37%), Positives = 51/96 (53%), Gaps = 4/96 (4%)

Query: 88  LGTGLYPVISIINHSCLPNAVLVFEGRLAVVRAVQHVPKGAEVLISYI--ETAGSTMTRQ 145
           L T L+PV+S++NHSC PN  + F G +A +RA Q +  G E+L  Y   E+      R+
Sbjct: 526 LATALFPVVSLLNHSCSPNTSVSFIGTIATIRASQLIRSGQEILHCYGPHESRMGVAERR 585

Query: 146 KALKEQYLFTCTCPRCIKLGQFDDIQESAILEGYRC 181
           + L+ QY F C CP C +  Q          E +RC
Sbjct: 586 QKLRSQYFFDCDCPPCEREKQRPSAGPGR--EAFRC 619


>gi|170060284|ref|XP_001865734.1| conserved hypothetical protein [Culex quinquefasciatus]
 gi|167878798|gb|EDS42181.1| conserved hypothetical protein [Culex quinquefasciatus]
          Length = 647

 Score = 70.5 bits (171), Expect = 1e-09,   Method: Compositional matrix adjust.
 Identities = 57/210 (27%), Positives = 90/210 (42%), Gaps = 30/210 (14%)

Query: 22  DNYSLVEALVAHMSDIDEKQLLLYAQIANLVNLILQWPEISINE---------IAENFSK 72
           D+Y  +   V H    D ++ L +A +A L+N +L       +          +  N   
Sbjct: 362 DDYRRIYNFVTHSEGRDTEESLKWAFMAALLNTVLSLGYFYRSRYLDSYIGTLLLHNLQV 421

Query: 73  LACNAHTICNSELRP-----------LGTGLYPVISIINHSCLPNAVLVFEGRLAVVRAV 121
           +  N+H I  SEL+            +G GLYP + + NHSC P     F G    VR V
Sbjct: 422 VTFNSHEI--SELQRKSPRDTGISVCIGAGLYPTLVLFNHSCDPGITRYFVGSAVFVRTV 479

Query: 122 QHVPKGAEVLISY--IETAGSTMTRQKALKEQYLFTCTCPRCIK-LGQFDDIQESAILEG 178
           +++P G+ V  +Y  +        R+K+LK+ Y F C C  C +    F D+  S +   
Sbjct: 480 RNIPAGSVVAENYGQLFVRAPRHERRKSLKKLYKFDCYCQACYEDWPTFFDMNPSVV--R 537

Query: 179 YRC-KDDGCSGFLLRDS--DDKGFTCQQCG 205
           +RC   +GC   L+     D     C++C 
Sbjct: 538 FRCAATEGCENGLIYTERLDQNVMKCEKCN 567


>gi|383858836|ref|XP_003704905.1| PREDICTED: SET and MYND domain-containing protein DDB_G0273589-like
           [Megachile rotundata]
          Length = 523

 Score = 70.5 bits (171), Expect = 1e-09,   Method: Compositional matrix adjust.
 Identities = 54/210 (25%), Positives = 95/210 (45%), Gaps = 17/210 (8%)

Query: 22  DNYSLVEALVAHMSDIDEKQLLLYAQIANLVNLILQW---PEISINEIAENFSKLACNAH 78
           ++Y+ +  L +H   I  K+  ++ + +NLVN I ++    +I   +I +    L  N H
Sbjct: 162 ESYNKLLTLESHSDKIKNKETFIFEESSNLVNFIKRFFKSEDIPEEDIRKIIGVLQVNGH 221

Query: 79  TICNSELRPLGTGLYPVISIINHSCLPNAVLVFE--GRLAVVRAVQHVPKGAEVLISYIE 136
            +  ++  P    +Y + S++ H+C  N    F   G L ++ A   + KG  + I Y +
Sbjct: 222 EVPLTD--PPYVAVYELASLLEHNCKANCSKSFTDTGGL-IIHAATPITKGDHISICYTD 278

Query: 137 TAGSTMTRQKALKEQYLFTCTCPRCIKLGQFDDIQESAILEGYRCKDDGCSGFLLRDS-- 194
               TM R+  L E   F CTC RC    +F      A+    RC +  CSG++L  +  
Sbjct: 279 PLWGTMNRRHHLLETKYFECTCDRCRDPTEF-----GAMFNAIRCSEIDCSGYVLPKTFL 333

Query: 195 --DDKGFTCQQCGLVRSKEEIKKIASEVNI 222
               + + C  C  V   E I++   ++ I
Sbjct: 334 GDHREDYVCTNCTTVVPLEIIEETLEDIGI 363


>gi|66822113|ref|XP_644411.1| SET domain-containing protein [Dictyostelium discoideum AX4]
 gi|66822969|ref|XP_644839.1| SET domain-containing protein [Dictyostelium discoideum AX4]
 gi|122057743|sp|Q557F7.1|Y3589_DICDI RecName: Full=SET and MYND domain-containing protein DDB_G0273589
 gi|60472534|gb|EAL70485.1| SET domain-containing protein [Dictyostelium discoideum AX4]
 gi|60472895|gb|EAL70844.1| SET domain-containing protein [Dictyostelium discoideum AX4]
          Length = 386

 Score = 70.1 bits (170), Expect = 1e-09,   Method: Compositional matrix adjust.
 Identities = 54/191 (28%), Positives = 82/191 (42%), Gaps = 19/191 (9%)

Query: 16  IPSTTTDNYSLVEALVAHMSDIDEKQLL--LYAQIANLVNLILQWPEIS----------- 62
           + +T  D + LVE  V   S+   K+ +  +   I+ L NL+L     +           
Sbjct: 157 LTNTLDDVFDLVENQVTEESNPAAKERIDSIVEFISELFNLVLLGSTTTKSIINNDDKIE 216

Query: 63  ----INEIAENF-SKLACNAHTICNSELRPLGTGLYPVISIINHSCLPNAVLVFEGRLAV 117
               INE + +   K  CN   I     + +G  + P  S  NHSC+PN   V +G    
Sbjct: 217 MIRKINEKSRSIIHKTRCNQFGIWTKNDKCIGVAVSPSSSYFNHSCIPNCTDVRDGSNMT 276

Query: 118 VRAVQHVPKGAEVLISYIETAGSTMTRQKALKEQYLFTCTCPRCIKLGQFDDIQESAILE 177
            +++  + KG ++ ISYIE       R+  LK  Y F C CPRC       D  ++ I +
Sbjct: 277 FKSLYPIKKGDQLTISYIELDQPIQDRKDELKYGYYFDCICPRCNGDSNSIDSMDNWISK 336

Query: 178 GYRCKDDGCSG 188
            Y C    C+G
Sbjct: 337 FY-CSQKKCTG 346


>gi|195038187|ref|XP_001990541.1| GH18189 [Drosophila grimshawi]
 gi|193894737|gb|EDV93603.1| GH18189 [Drosophila grimshawi]
          Length = 393

 Score = 70.1 bits (170), Expect = 1e-09,   Method: Compositional matrix adjust.
 Identities = 39/101 (38%), Positives = 58/101 (57%), Gaps = 9/101 (8%)

Query: 66  IAENFSKLACNAHTICNSELRPLGTGLYPVISIINHSCLPNA--VLVFEGRLAVVRAVQH 123
           I E ++K+   A    N+E    G+GLY + S INHSC+PNA     +   + V++AV  
Sbjct: 267 IDELYAKVGDFAGEFLNNE----GSGLYILQSKINHSCVPNAQSTFPYSNDIVVLKAVAP 322

Query: 124 VPKGAEVLISYIETAG---STMTRQKALKEQYLFTCTCPRC 161
           + KG E+ ISY++      S  +R K L+E Y+F C CP+C
Sbjct: 323 IEKGDEICISYLDECQLERSRHSRHKTLRENYIFVCQCPKC 363


>gi|222613106|gb|EEE51238.1| hypothetical protein OsJ_32096 [Oryza sativa Japonica Group]
          Length = 718

 Score = 70.1 bits (170), Expect = 2e-09,   Method: Compositional matrix adjust.
 Identities = 80/355 (22%), Positives = 150/355 (42%), Gaps = 66/355 (18%)

Query: 42  LLLYAQI-ANLVNLILQWPEISINEIAENFSKLACNAHTICNSELRPLGTGLYPVISIIN 100
           +LL  QI  N + ++       +  + + FS  + +   +C+ E   +   +Y   S  N
Sbjct: 352 VLLICQIKVNSIAIVHMKSMDGVKALTKGFSGFSGD--VMCSVEQVRVAQAIYMSGSFFN 409

Query: 101 HSCLPNAVLVFEGRLAVVRAVQHVPKGAEVLISYIETAGST--MTRQKALKEQYLFTCT- 157
           HSC PN    F  R  ++R+ +++  G+ + +SY    G      RQK+L+E Y F+C  
Sbjct: 410 HSCRPNIHAYFHSRTLILRSTEYIKAGSPIELSYGPQVGEMDLPERQKSLRENYYFSCGC 469

Query: 158 -----------------CPRCIKLGQF---------DDIQESAILEGYRC---------- 181
                            CP+   LG           ++    +I E + C          
Sbjct: 470 SSCSVLSLSDLVMNSFCCPQSNCLGAVSELIHHRHKENFVHVSIGESHVCTLSLPDVSKF 529

Query: 182 -KDDGCSGFLLRDSD-----DKGFTCQQCGLVRSKEEIK-------KIASEVNILSKKTL 228
            +D    G L   SD     D GF C  C   RS+ ++        +  S++N L +   
Sbjct: 530 DEDIVKVGKLFFKSDTMFNIDPGF-CMSC---RSQLDLSSAVAMSDRATSKINRLKE--- 582

Query: 229 ALTSCGNHQEVV--STYKMIEKLQKKLYHPFSVNLMQTREKLIKILMELEDWKEALAYCQ 286
            L S  N  EV+     + +E+++ KL HP+S  L Q+ + + +   ++ D ++A  +C+
Sbjct: 583 -LPSLDNVPEVLIAEALQSLERIE-KLRHPYSKTLAQSHDTIAEAFAKVGDQEQARKHCE 640

Query: 287 LTIPVYQRVYPQFHPLLGLQYYTCGKLEWFLGDTENAIKSMTEAVEILRITHGTN 341
            +I + +++Y   H ++  +      +E  +GD  +A  +   A  I  + +G +
Sbjct: 641 ASIKILEKLYHPRHIIIAHELIKLVSIELSMGDGASAAAAFARADAIFSLYYGPD 695


>gi|440912312|gb|ELR61896.1| SET and MYND domain-containing protein 4 [Bos grunniens mutus]
          Length = 802

 Score = 70.1 bits (170), Expect = 2e-09,   Method: Compositional matrix adjust.
 Identities = 40/111 (36%), Positives = 56/111 (50%), Gaps = 10/111 (9%)

Query: 88  LGTGLYPVISIINHSCLPNAVLVFEGRLAVVRAVQHVPKGAEVLISY--IETAGSTMTRQ 145
           L T L+PV+S++NHSC PN  + F G +A +RA Q +  G E+L  Y   E+      R+
Sbjct: 526 LATALFPVVSLLNHSCSPNTSVSFIGTIATIRASQLIRSGQEILHCYGPHESRMGVAERR 585

Query: 146 KALKEQYLFTCTCPRCIKLGQFDDIQES-AILEGYRCK-----DD--GCSG 188
           + L+ QY F C CP C +  Q             +RC+     DD   CSG
Sbjct: 586 QKLRSQYFFDCDCPPCEREKQRPSAGPGRGAFRCHRCRALLQGDDVLSCSG 636


>gi|296476893|tpg|DAA19008.1| TPA: CG8378-like [Bos taurus]
          Length = 802

 Score = 70.1 bits (170), Expect = 2e-09,   Method: Compositional matrix adjust.
 Identities = 40/111 (36%), Positives = 56/111 (50%), Gaps = 10/111 (9%)

Query: 88  LGTGLYPVISIINHSCLPNAVLVFEGRLAVVRAVQHVPKGAEVLISY--IETAGSTMTRQ 145
           L T L+PV+S++NHSC PN  + F G +A +RA Q +  G E+L  Y   E+      R+
Sbjct: 526 LATALFPVVSLLNHSCSPNTSVSFIGTIATIRASQLIRSGQEILHCYGPHESRMGVAERR 585

Query: 146 KALKEQYLFTCTCPRCIKLGQFDDIQES-AILEGYRCK-----DD--GCSG 188
           + L+ QY F C CP C +  Q             +RC+     DD   CSG
Sbjct: 586 QKLRSQYFFDCDCPPCEREKQRPSAGPGRGAFRCHRCRALLQGDDVLSCSG 636


>gi|329664548|ref|NP_001192662.1| SET and MYND domain-containing protein 4 [Bos taurus]
          Length = 802

 Score = 69.7 bits (169), Expect = 2e-09,   Method: Compositional matrix adjust.
 Identities = 40/111 (36%), Positives = 56/111 (50%), Gaps = 10/111 (9%)

Query: 88  LGTGLYPVISIINHSCLPNAVLVFEGRLAVVRAVQHVPKGAEVLISY--IETAGSTMTRQ 145
           L T L+PV+S++NHSC PN  + F G +A +RA Q +  G E+L  Y   E+      R+
Sbjct: 526 LATALFPVVSLLNHSCSPNTSVSFIGTIATIRASQLIRSGQEILHCYGPHESRMGVAERR 585

Query: 146 KALKEQYLFTCTCPRCIKLGQFDDIQES-AILEGYRCK-----DD--GCSG 188
           + L+ QY F C CP C +  Q             +RC+     DD   CSG
Sbjct: 586 QKLRSQYFFDCDCPPCEREKQRPSAGPGRGAFRCHRCRALLQGDDVLSCSG 636


>gi|289740749|gb|ADD19122.1| SET and MYND domain containing protein 5 [Glossina morsitans
           morsitans]
          Length = 395

 Score = 69.7 bits (169), Expect = 2e-09,   Method: Compositional matrix adjust.
 Identities = 41/111 (36%), Positives = 58/111 (52%), Gaps = 5/111 (4%)

Query: 56  LQWPEISINEIAENFSKLACNAHTICNSELRPLGTGLYPVISIINHSCLPNAVLVF--EG 113
           L  PE   N++ E    +           L   G+GLY + S INHSCLPNA + F    
Sbjct: 253 LPLPETDKNKLDEYIDDIYHKVGEFAGEFLNNEGSGLYLLQSKINHSCLPNAQVTFPYSN 312

Query: 114 RLAVVRAVQHVPKGAEVLISYIETAG---STMTRQKALKEQYLFTCTCPRC 161
            + V++A+Q +  G E+ ISY++      S  +RQK LKE Y+F C C +C
Sbjct: 313 DIVVLKALQPIQVGEEICISYLDEGQLERSRHSRQKILKENYIFVCECFKC 363


>gi|367042716|ref|XP_003651738.1| hypothetical protein THITE_68287 [Thielavia terrestris NRRL 8126]
 gi|346999000|gb|AEO65402.1| hypothetical protein THITE_68287 [Thielavia terrestris NRRL 8126]
          Length = 562

 Score = 69.7 bits (169), Expect = 2e-09,   Method: Compositional matrix adjust.
 Identities = 38/102 (37%), Positives = 60/102 (58%), Gaps = 1/102 (0%)

Query: 60  EISINEIAENFSKLACNAHTICNSELRPLGTGLYPVISIINHSCLPNAVLVFEGRLAVVR 119
           E ++ +  E   K+  NA    +++    G  L   ++++NHSC+PNA + F+ R AV+R
Sbjct: 189 EETLEKAREVLCKIQTNAFNRLDADTGLSGIFLDVGLAMVNHSCVPNAFIGFDRRTAVLR 248

Query: 120 AVQHVPKGAEVLISYIETAGSTMTRQKALKEQYLFTCTCPRC 161
           A + + +G E+ ISYI+ A     RQ+AL+  Y F C CPRC
Sbjct: 249 AERPIQEGEEITISYIDNALPKSARQEALR-LYHFRCDCPRC 289


>gi|157125402|ref|XP_001654323.1| hypothetical protein AaeL_AAEL001936 [Aedes aegypti]
 gi|108882687|gb|EAT46912.1| AAEL001936-PA [Aedes aegypti]
          Length = 546

 Score = 69.7 bits (169), Expect = 2e-09,   Method: Compositional matrix adjust.
 Identities = 79/291 (27%), Positives = 123/291 (42%), Gaps = 34/291 (11%)

Query: 57  QWPEISINEIAENFSKLACNAHTICNSELRPLGTGLYPVISIINHSCLPNAVLVFEGRLA 116
           +W E   +EI +    L  N H +  +E  P    +Y   S++ HSC PN    F  +  
Sbjct: 224 KWSE---DEILKIVGILQVNGHEVPLTE--PPSVAIYNNASMLEHSCRPNLSKSFTSKKE 278

Query: 117 VV-RAVQHVPKGAEVLISYIETAGSTMTRQKALKEQYLFTCTCPRCIKLGQFDDIQESAI 175
           +V  A   + +G  + ISY +    T  RQ  L++  LF CTC RC+   +F     +  
Sbjct: 279 IVFWAPNPIKQGERLSISYSDVLWGTANRQDHLQQTKLFRCTCVRCLDPTEFGTYLSALK 338

Query: 176 LEGYRCKDDGCSGFLLRDSDDK---GFTCQQC-GLVRSKEEIKKIASEVNILSKKTLALT 231
             G++ KD  CSG LL ++      G+ C +C GLV  KE         NIL +  +   
Sbjct: 339 CSGFK-KDSNCSGLLLPENLKNWYGGYICNKCRGLVDGKE-------ITNILDRARVDHE 390

Query: 232 SCGNHQEVVSTYKMIEKLQKKLYHPFSVNLMQTREKLIKIL----------MELEDWKEA 281
           +     E    +K I    + L  P    L+  +  L +I+          +  ED    
Sbjct: 391 AMQKDNE-QHCHKYIAHYGRWL-GPNHHLLVDVKISLSQIMGGGDPNAIQKISDEDLMTK 448

Query: 282 LAYCQLTIPVYQRVYPQFHPLLGLQYYTCGKLEWFLGD-TENAIKSMTEAV 331
           +  CQ  I V+QRV P    ++G   +   +L   L +     ++SM  AV
Sbjct: 449 MKICQELIDVFQRVCPAEARVIGTTRF---ELHAALAEFARRGVESMNSAV 496


>gi|291242073|ref|XP_002740933.1| PREDICTED: predicted protein-like [Saccoglossus kowalevskii]
          Length = 669

 Score = 69.7 bits (169), Expect = 2e-09,   Method: Compositional matrix adjust.
 Identities = 42/123 (34%), Positives = 61/123 (49%), Gaps = 15/123 (12%)

Query: 73  LACNAHTICNSELR----------PLGTGLYPVISIINHSCLPNAVLVFEGRLAVVRAVQ 122
           + CNAH I   EL            +G+GLYP +S++NHSC P       G   VVRA++
Sbjct: 449 IPCNAHEISEYELWRSDITKCHFVEVGSGLYPTMSLVNHSCDPVVTRNCYGETCVVRAIR 508

Query: 123 HVPKGAEVL--ISYIETAGSTMTRQKALKEQYLFTCTCPRCI-KLGQFDDIQESAILEGY 179
           ++ KG E+     Y+        RQ  LK QY F C C  C+     + DI++  +L  Y
Sbjct: 509 NIYKGEEITDNYGYLYPVHDKSERQTRLKWQYFFECKCDACVHDWPLYPDIED--LLPKY 566

Query: 180 RCK 182
           +C+
Sbjct: 567 KCE 569


>gi|71019315|ref|XP_759888.1| hypothetical protein UM03741.1 [Ustilago maydis 521]
 gi|46099457|gb|EAK84690.1| hypothetical protein UM03741.1 [Ustilago maydis 521]
          Length = 739

 Score = 69.7 bits (169), Expect = 2e-09,   Method: Compositional matrix adjust.
 Identities = 35/118 (29%), Positives = 65/118 (55%), Gaps = 16/118 (13%)

Query: 62  SINEIAENFSKLACNAHTICNSELRPLGTGLYPVISIINHSCLPNAVLVF---------- 111
           S  E+ +   + +CN+ T+ +S+L  LG  ++  ++++NH+C+PNA +VF          
Sbjct: 423 SATELLDLVCQFSCNSFTLADSDLNALGVCMHASMAMLNHACIPNAAVVFPFGGAAKGGQ 482

Query: 112 ------EGRLAVVRAVQHVPKGAEVLISYIETAGSTMTRQKALKEQYLFTCTCPRCIK 163
                 + ++  + A++ +  G E+LISY++   +   RQK LK++Y F C C  C K
Sbjct: 483 QRWNDGDDKIMQLVALRAIEPGEELLISYVDLCDTVEERQKQLKQRYCFDCRCDLCCK 540


>gi|115482840|ref|NP_001065013.1| Os10g0506800 [Oryza sativa Japonica Group]
 gi|62733614|gb|AAX95731.1| tetratricopeptide repeat, putative [Oryza sativa Japonica Group]
 gi|110289368|gb|ABB47859.2| TPR Domain containing protein, expressed [Oryza sativa Japonica
           Group]
 gi|113639622|dbj|BAF26927.1| Os10g0506800 [Oryza sativa Japonica Group]
 gi|215695497|dbj|BAG90688.1| unnamed protein product [Oryza sativa Japonica Group]
          Length = 793

 Score = 69.7 bits (169), Expect = 2e-09,   Method: Compositional matrix adjust.
 Identities = 80/355 (22%), Positives = 149/355 (41%), Gaps = 66/355 (18%)

Query: 42  LLLYAQI-ANLVNLILQWPEISINEIAENFSKLACNAHTICNSELRPLGTGLYPVISIIN 100
           +LL  QI  N + ++       +  + + FS  +     +C+ E   +   +Y   S  N
Sbjct: 427 VLLICQIKVNSIAIVHMKSMDGVKALTKGFSGFS--GDVMCSVEQVRVAQAIYMSGSFFN 484

Query: 101 HSCLPNAVLVFEGRLAVVRAVQHVPKGAEVLISYIETAGST--MTRQKALKEQYLFTCT- 157
           HSC PN    F  R  ++R+ +++  G+ + +SY    G      RQK+L+E Y F+C  
Sbjct: 485 HSCRPNIHAYFHSRTLILRSTEYIKAGSPIELSYGPQVGEMDLPERQKSLRENYYFSCGC 544

Query: 158 -----------------CPRCIKLGQF---------DDIQESAILEGYRC---------- 181
                            CP+   LG           ++    +I E + C          
Sbjct: 545 SSCSVLSLSDLVMNSFCCPQSNCLGAVSELIHHRHKENFVHVSIGESHVCTLSLPDVSKF 604

Query: 182 -KDDGCSGFLLRDSD-----DKGFTCQQCGLVRSKEEIK-------KIASEVNILSKKTL 228
            +D    G L   SD     D GF C  C   RS+ ++        +  S++N L +   
Sbjct: 605 DEDIVKVGKLFFKSDTMFNIDPGF-CMSC---RSQLDLSSAVAMSDRATSKINRLKE--- 657

Query: 229 ALTSCGNHQEVV--STYKMIEKLQKKLYHPFSVNLMQTREKLIKILMELEDWKEALAYCQ 286
            L S  N  EV+     + +E+++ KL HP+S  L Q+ + + +   ++ D ++A  +C+
Sbjct: 658 -LPSLDNVPEVLIAEALQSLERIE-KLRHPYSKTLAQSHDTIAEAFAKVGDQEQARKHCE 715

Query: 287 LTIPVYQRVYPQFHPLLGLQYYTCGKLEWFLGDTENAIKSMTEAVEILRITHGTN 341
            +I + +++Y   H ++  +      +E  +GD  +A  +   A  I  + +G +
Sbjct: 716 ASIKILEKLYHPRHIIIAHELIKLVSIELSMGDGASAAAAFARADAIFSLYYGPD 770


>gi|255085128|ref|XP_002504995.1| predicted protein [Micromonas sp. RCC299]
 gi|226520264|gb|ACO66253.1| predicted protein [Micromonas sp. RCC299]
          Length = 267

 Score = 69.7 bits (169), Expect = 2e-09,   Method: Compositional matrix adjust.
 Identities = 36/101 (35%), Positives = 55/101 (54%), Gaps = 3/101 (2%)

Query: 65  EIAENFSKLACNAHTIC---NSELRPLGTGLYPVISIINHSCLPNAVLVFEGRLAVVRAV 121
           E+ +    + CNAHT+     S L P+G  +Y   S  NHSC+P+A     G    VR++
Sbjct: 162 ELTQLLCSVNCNAHTLYANDRSSLVPVGIAVYLQGSAFNHSCVPSAEFCNVGTSLTVRSL 221

Query: 122 QHVPKGAEVLISYIETAGSTMTRQKALKEQYLFTCTCPRCI 162
           + V  G E+ +SY+ T  +   R++ L+ QY F C C RC+
Sbjct: 222 RRVQAGEEITVSYVPTTMNLKERRRCLEGQYKFVCACARCV 262


>gi|328866132|gb|EGG14518.1| hypothetical protein DFA_12294 [Dictyostelium fasciculatum]
          Length = 561

 Score = 69.3 bits (168), Expect = 2e-09,   Method: Compositional matrix adjust.
 Identities = 50/154 (32%), Positives = 70/154 (45%), Gaps = 24/154 (15%)

Query: 21  TDNYSLVEALVAHMSDIDEKQLLLYAQIANLVNL-----ILQWPEISINEIAENFSKLAC 75
           +DN  L+E L    S+        Y +I  L+N       +   E  IN  A+N  K+  
Sbjct: 419 SDNKILIEELEKLFSN------KFYDEILGLINYNSISTFVYGEETVINTGAKN-KKMQP 471

Query: 76  NAHTICNSELRPLGTGLYPVISIINHSCLPNAVLVFEGRLAVVRAV------QHVPKGAE 129
                C      LG+ L P+ + +NHSC PN  +  E R  V +A+      Q + KG E
Sbjct: 472 KLRQYC------LGSALIPIFACLNHSCAPNIEMGREQRDGVTKAIAVMIAKQDIKKGQE 525

Query: 130 VLISYIETAGSTMTRQKALKEQYLFTCTCPRCIK 163
           +L SYI+ +     RQ  L  QY FTC C +C K
Sbjct: 526 LLTSYIDESNPFKDRQSILSSQYGFTCNCNKCSK 559


>gi|110289369|gb|AAP54526.2| TPR Domain containing protein, expressed [Oryza sativa Japonica
           Group]
          Length = 687

 Score = 69.3 bits (168), Expect = 2e-09,   Method: Compositional matrix adjust.
 Identities = 80/355 (22%), Positives = 149/355 (41%), Gaps = 66/355 (18%)

Query: 42  LLLYAQI-ANLVNLILQWPEISINEIAENFSKLACNAHTICNSELRPLGTGLYPVISIIN 100
           +LL  QI  N + ++       +  + + FS  +     +C+ E   +   +Y   S  N
Sbjct: 321 VLLICQIKVNSIAIVHMKSMDGVKALTKGFSGFS--GDVMCSVEQVRVAQAIYMSGSFFN 378

Query: 101 HSCLPNAVLVFEGRLAVVRAVQHVPKGAEVLISYIETAGST--MTRQKALKEQYLFTC-- 156
           HSC PN    F  R  ++R+ +++  G+ + +SY    G      RQK+L+E Y F+C  
Sbjct: 379 HSCRPNIHAYFHSRTLILRSTEYIKAGSPIELSYGPQVGEMDLPERQKSLRENYYFSCGC 438

Query: 157 ----------------TCPRCIKLGQF---------DDIQESAILEGYRC---------- 181
                            CP+   LG           ++    +I E + C          
Sbjct: 439 SSCSVLSLSDLVMNSFCCPQSNCLGAVSELIHHRHKENFVHVSIGESHVCTLSLPDVSKF 498

Query: 182 -KDDGCSGFLLRDSD-----DKGFTCQQCGLVRSKEEIK-------KIASEVNILSKKTL 228
            +D    G L   SD     D GF C  C   RS+ ++        +  S++N L +   
Sbjct: 499 DEDIVKVGKLFFKSDTMFNIDPGF-CMSC---RSQLDLSSAVAMSDRATSKINRLKE--- 551

Query: 229 ALTSCGNHQEVV--STYKMIEKLQKKLYHPFSVNLMQTREKLIKILMELEDWKEALAYCQ 286
            L S  N  EV+     + +E+++ KL HP+S  L Q+ + + +   ++ D ++A  +C+
Sbjct: 552 -LPSLDNVPEVLIAEALQSLERIE-KLRHPYSKTLAQSHDTIAEAFAKVGDQEQARKHCE 609

Query: 287 LTIPVYQRVYPQFHPLLGLQYYTCGKLEWFLGDTENAIKSMTEAVEILRITHGTN 341
            +I + +++Y   H ++  +      +E  +GD  +A  +   A  I  + +G +
Sbjct: 610 ASIKILEKLYHPRHIIIAHELIKLVSIELSMGDGASAAAAFARADAIFSLYYGPD 664


>gi|291227705|ref|XP_002733823.1| PREDICTED: predicted protein-like [Saccoglossus kowalevskii]
          Length = 644

 Score = 69.3 bits (168), Expect = 2e-09,   Method: Compositional matrix adjust.
 Identities = 67/294 (22%), Positives = 127/294 (43%), Gaps = 13/294 (4%)

Query: 62  SINEIAENFSKLACNAHTICNSELRPLGTGLYPVISIINHSCLPNAVLVFEGRLAVVRAV 121
           +++ + E  +K   +   +  ++   +   +Y   S++NHSC PN +  ++G    +RA 
Sbjct: 346 NVHAVTEVATKTDSSTSFVATTQQIRIAVAVYGTASMLNHSCTPNVIAGYDGNQLTIRAT 405

Query: 122 QHVPKGAEVLISYIETAGSTMT--RQKALKEQYLFTCTCPRCIKLGQFDDIQESAILEGY 179
           + + KG EVL  Y           R K L++QY FTC C  C  + Q + I  S  L   
Sbjct: 406 EMIKKGGEVLHCYGPRVSDMFRDERLKVLRDQYYFTCKCMFC-GIPQ-EAITPSGALRCP 463

Query: 180 RCKDDGCSGFLLRDSDDKG----FTCQQCGLVRSKEEIKKIASEVNILSKKTLALTSCGN 235
           RC D   SG + +  +  G    F C     +   +E  KI +   +       +     
Sbjct: 464 RCSD--MSG-VYQTPEVTGQCCNFDCHHSWKLGETDE--KIENARQLFRVGKTLIGDKNK 518

Query: 236 HQEVVSTYKMIEKLQKKLYHPFSVNLMQTREKLIKILMELEDWKEALAYCQLTIPVYQRV 295
            +E +ST     K+Q+ + +  +  L +T + L        ++  A  Y +L+I     +
Sbjct: 519 VKECLSTLHKAYKIQQSVLYKHNKELAETEDCLALCYASFGNYHYAAKYLKLSIKSNIIL 578

Query: 296 YPQFHPLLGLQYYTCGKLEWFLGDTENAIKSMTEAVEILRITHGTNSPFMKELI 349
           Y +    +G + Y   ++ +   +    +  + +AV+I+   +G N   +KEL+
Sbjct: 579 YGEDSIEIGHELYKLAQILFNCKNVTEGLGVIDKAVKIISDHYGENHSDVKELL 632


>gi|242007004|ref|XP_002424332.1| conserved hypothetical protein [Pediculus humanus corporis]
 gi|212507732|gb|EEB11594.1| conserved hypothetical protein [Pediculus humanus corporis]
          Length = 572

 Score = 69.3 bits (168), Expect = 3e-09,   Method: Compositional matrix adjust.
 Identities = 39/95 (41%), Positives = 49/95 (51%), Gaps = 5/95 (5%)

Query: 73  LACNAHTICN---SELRPLGTGLYPVISIINHSCLPNAVLVFEGRLAVVRAVQHVPKGAE 129
           L CNAH I     S L  +G G Y  +S+INHSC PN V        ++RA++ + KG E
Sbjct: 384 LPCNAHEISEYYCSNLLEIGAGAYATLSLINHSCDPNVVRHSCRNTVILRAIKPIKKGEE 443

Query: 130 VL--ISYIETAGSTMTRQKALKEQYLFTCTCPRCI 162
           +     Y         RQKAL +QY FTC C  CI
Sbjct: 444 LFDNYGYHYATHELQERQKALLKQYYFTCQCKACI 478


>gi|402587997|gb|EJW81931.1| MYND finger family protein, partial [Wuchereria bancrofti]
          Length = 348

 Score = 69.3 bits (168), Expect = 3e-09,   Method: Compositional matrix adjust.
 Identities = 74/320 (23%), Positives = 136/320 (42%), Gaps = 37/320 (11%)

Query: 61  ISINEIAENFSKLACNAHTICN-SELRPLGTGLYPVISIINHSCLPNAVLVFEGRLAVVR 119
           +S +E+ E   +   N H I +   +  +G GLY  +   +HSC PN +      +A +R
Sbjct: 37  VSKDEVFELHCRNYINRHAISDCGYIEEIGKGLYLDLCAYDHSCRPNTIYTCNSFVATLR 96

Query: 120 AVQHVPKGAEVL------ISYIETAGSTMTRQKALKEQYLFTCTCPRCIKLGQFDDIQES 173
            +     G ++        SYI+   +T  R+K LK+ + F C C RC      DD+  S
Sbjct: 97  GLT---AGVDLRNLNSTHYSYIDLINTTQQRRKLLKDTWYFECHCTRC---DDPDDVLLS 150

Query: 174 AILEGYRCKDDGC----SGFLLRDSDDKGFTCQQCGLVRSKEEIKKIASEVNILSK--KT 227
           +IL     K   C         +D + +  TC +C  + S E + +  + +  + K  + 
Sbjct: 151 SILCPNCPKKRECLCIFGNVPYKDRNTQIITCPKCHNIVSPEYVVEAIAAMRFIDKIVEN 210

Query: 228 LALTSCGNHQEVVSTYKMIEKLQKKLYHPFS-VNLMQTREKLIKILMELEDWKEALAYCQ 286
             +      Q +    + +  L+K+     S VN+     K+I++L+ L D    ++ CQ
Sbjct: 211 REVEKMSREQSI----RFLTDLKKRFSELLSKVNVFLC--KVIQLLIPLID----ISDCQ 260

Query: 287 LTIPVY-------QRVYPQFHPLLGLQYYTCGKLEWFLGDTENAIKSMTEAVEILRITHG 339
           L + ++       +  +P  HP +G+ Y +             A+    +A E+++ T G
Sbjct: 261 LLLDLHLETEECVRFCFPYNHPAVGIHYRSIATFFLKCKQPHRALLYYKKAYEVVKFTLG 320

Query: 340 TNSPFMKELILKLEEAQAEA 359
           +  P   E    L++A  E 
Sbjct: 321 SKHPMTIETDAMLKDASREV 340


>gi|302794157|ref|XP_002978843.1| hypothetical protein SELMODRAFT_418586 [Selaginella moellendorffii]
 gi|300153652|gb|EFJ20290.1| hypothetical protein SELMODRAFT_418586 [Selaginella moellendorffii]
          Length = 849

 Score = 69.3 bits (168), Expect = 3e-09,   Method: Compositional matrix adjust.
 Identities = 62/233 (26%), Positives = 103/233 (44%), Gaps = 19/233 (8%)

Query: 76  NAHTICNSELRPLGTGLYPVISIINHSCLPNAVLVFEGRLAVVRAVQHVPKGAEVLISYI 135
            AH IC  E   +   L+   S  NHSC PN  + +  R       + +P    + ISY 
Sbjct: 391 GAH-ICGIEHVVVAQALFVRGSKFNHSCSPNVHVSYVKRTLRAHCTEALPAFCPLEISYG 449

Query: 136 ETAGSTMTRQKA--LKEQYLFTCTCPRCIKLGQFDDIQESAILEGYRCKDDGCSGFLLR- 192
              G +   Q+   L++ YLF C+C  C +LG   D+   A    Y C   GC G +L  
Sbjct: 450 VQVGESEKSQRKIWLRDHYLFECSCEPC-RLGLVSDLLHIA----YYCCQPGCQGIVLDP 504

Query: 193 DSDDKGFTCQQCGLVRSKEEIKKIASEVNILSKKTLA-LTSCG---NHQEVVSTYKMIEK 248
           D+ D+        L      ++ +    NIL +K L  L  CG   N  E ++    + +
Sbjct: 505 DASDEDIL-----LPGGSGTVRFLIEHCNILPQKELEDLQICGSRLNGNEAIADGLRLLE 559

Query: 249 LQKKLYHPFSVNLMQTREKLIKILMEL-EDWKEALAYCQLTIPVYQRVYPQFH 300
             +K +HPF+  L Q  + + ++  E  +    A+ + + +I + +R+Y + H
Sbjct: 560 NGRKCFHPFNKQLAQMEDIVARLYCEAGQQPAAAIVHARRSIEILERIYGKDH 612


>gi|82658264|ref|NP_001032477.1| SET and MYND domain-containing protein 3 [Danio rerio]
 gi|81097726|gb|AAI09413.1| SET and MYND domain containing 3 [Danio rerio]
          Length = 380

 Score = 69.3 bits (168), Expect = 3e-09,   Method: Compositional matrix adjust.
 Identities = 34/128 (26%), Positives = 68/128 (53%), Gaps = 7/128 (5%)

Query: 24  YSLVEALVAHMSDIDEKQLLLYAQIANLVNLILQWPEISINEIAENF------SKLACNA 77
           YS+ E   +H++D+ E++      +   + + L      ++ +          +++ CN 
Sbjct: 124 YSIAEH-QSHLADMSEEKTEGLKHLCTTLQVYLAEENCDLSRLPSGLDPVSLLARVTCNC 182

Query: 78  HTICNSELRPLGTGLYPVISIINHSCLPNAVLVFEGRLAVVRAVQHVPKGAEVLISYIET 137
            +I + EL+ +G GLYP +S++NH C PN +++FEG+   +RAV+ +    E+ ISY + 
Sbjct: 183 FSISDGELQDVGVGLYPSMSLLNHDCQPNCIMMFEGKRLTLRAVRVIRSAEELTISYTDI 242

Query: 138 AGSTMTRQ 145
              +  R+
Sbjct: 243 LAPSKDRR 250



 Score = 54.7 bits (130), Expect = 7e-05,   Method: Compositional matrix adjust.
 Identities = 28/86 (32%), Positives = 44/86 (51%)

Query: 273 MELEDWKEALAYCQLTIPVYQRVYPQFHPLLGLQYYTCGKLEWFLGDTENAIKSMTEAVE 332
           + L+D++ AL Y    +  Y+  +   HP   ++    GKL+ FLG  E A  S T+A  
Sbjct: 291 ISLDDYETALEYGNRALGPYKLYFSDPHPSRAVELLRVGKLQHFLGRLEEARGSFTQAYS 350

Query: 333 ILRITHGTNSPFMKELILKLEEAQAE 358
           I+ +THG       ++  +L E QAE
Sbjct: 351 IMSVTHGAAHALTNQVQRRLSECQAE 376


>gi|312371974|gb|EFR20029.1| hypothetical protein AND_20776 [Anopheles darlingi]
          Length = 256

 Score = 69.3 bits (168), Expect = 3e-09,   Method: Compositional matrix adjust.
 Identities = 42/106 (39%), Positives = 61/106 (57%), Gaps = 10/106 (9%)

Query: 62  SINE-IAENFSKLACNAHTICNSELRPLGTGLYPVISIINHSCLPNAVLVF--EGRLAVV 118
           S++E I E ++K+     T  N+E    G+ LY + S INHSC PNA +VF     +  +
Sbjct: 125 SVDELIDELYNKMDEVVGTFLNNE----GSALYAMQSKINHSCTPNAEIVFPKSNHVLAL 180

Query: 119 RAVQHVPKGAEVLISYIETAG---STMTRQKALKEQYLFTCTCPRC 161
           RA++ +  G E+ ISY++      S  +RQK L+E YLF C C RC
Sbjct: 181 RALRDLAAGEEICISYLDECNLQRSRHSRQKNLREYYLFECQCERC 226


>gi|403414107|emb|CCM00807.1| predicted protein [Fibroporia radiculosa]
          Length = 304

 Score = 68.9 bits (167), Expect = 3e-09,   Method: Compositional matrix adjust.
 Identities = 35/104 (33%), Positives = 56/104 (53%), Gaps = 9/104 (8%)

Query: 62  SINEIAENFSKLACNAHTICNSELRPLGTGLYPVISIINHSCLPNAVLVF---------E 112
           S  ++ +  S+   NA T+ +S L  +G  + P ++++NHSC PN V+VF         E
Sbjct: 10  SPGDLVDLISRFTTNAFTLTSSSLSAIGVAISPTVALVNHSCDPNVVIVFPRNPSTSHAE 69

Query: 113 GRLAVVRAVQHVPKGAEVLISYIETAGSTMTRQKALKEQYLFTC 156
             +  V A++ +  G E+L +Y++       RQK LKE Y FTC
Sbjct: 70  EPMMTVVAIKPILPGEEILSAYVDVTQPRELRQKELKETYNFTC 113


>gi|350633710|gb|EHA22075.1| hypothetical protein ASPNIDRAFT_183370 [Aspergillus niger ATCC
           1015]
          Length = 386

 Score = 68.9 bits (167), Expect = 3e-09,   Method: Compositional matrix adjust.
 Identities = 39/136 (28%), Positives = 70/136 (51%), Gaps = 2/136 (1%)

Query: 30  LVAHMSDIDEKQLLLYAQIANLVNLILQWPEISINE--IAENFSKLACNAHTICNSELRP 87
           L  H+ DI ++    + +I+     I  +    ++E  I+   +KL  N+  + N+    
Sbjct: 136 LGTHIKDIRDQSASQWERISLSSKAIRAYSGTDMSEEVISAMGAKLDLNSFNLTNAVYDR 195

Query: 88  LGTGLYPVISIINHSCLPNAVLVFEGRLAVVRAVQHVPKGAEVLISYIETAGSTMTRQKA 147
           LG  L+P  +I NHSC  NA + F+G    ++A++ + K  ++ I+YI+       RQ  
Sbjct: 196 LGVYLHPYAAIFNHSCDHNAAVSFDGPNLHIKALRPIRKDEQIFITYIDVTDPYPIRQAN 255

Query: 148 LKEQYLFTCTCPRCIK 163
           L+ +Y FTC C +C +
Sbjct: 256 LQSRYYFTCHCSKCTR 271


>gi|302838330|ref|XP_002950723.1| hypothetical protein VOLCADRAFT_91197 [Volvox carteri f.
           nagariensis]
 gi|300263840|gb|EFJ48038.1| hypothetical protein VOLCADRAFT_91197 [Volvox carteri f.
           nagariensis]
          Length = 610

 Score = 68.9 bits (167), Expect = 4e-09,   Method: Compositional matrix adjust.
 Identities = 33/73 (45%), Positives = 44/73 (60%), Gaps = 2/73 (2%)

Query: 91  GLYPVISIINHSCLPNAVLVFEGRLAVVRAVQHVPKGAEVLISYI--ETAGSTMTRQKAL 148
           GL+P  + +NH+C PN V    G   VVRAVQ +P+G E+L+SY+  +       RQ AL
Sbjct: 246 GLWPYFNQLNHACAPNCVHYVVGSTMVVRAVQVIPEGCELLVSYLGRDDLAPRQVRQAAL 305

Query: 149 KEQYLFTCTCPRC 161
           K +Y F C C RC
Sbjct: 306 KARYGFICNCTRC 318


>gi|340711193|ref|XP_003394163.1| PREDICTED: SET and MYND domain-containing protein 4-like [Bombus
           terrestris]
          Length = 676

 Score = 68.9 bits (167), Expect = 4e-09,   Method: Compositional matrix adjust.
 Identities = 51/201 (25%), Positives = 95/201 (47%), Gaps = 11/201 (5%)

Query: 54  LILQWPEISINEIAENFSKLACNAHTICNSELRPLGTGLYPVISIINHSCLPNAVLVFEG 113
           +++Q+    I+E+A     +   ++ +  ++ + +G G+Y  +S+ NHSC P  +  F G
Sbjct: 450 MLIQFNAHEISELA-----VPKGSNILAKAKSKFIGGGVYSTVSLFNHSCNPGIIRYFIG 504

Query: 114 RLAVVRAVQHVPKGAEVLISY--IETAGSTMTRQKALKEQYLFTCTCPRCIKLGQFDDIQ 171
              VVRA++ +P G E+  +Y  I T      R++ L+ QY F C C  C       +  
Sbjct: 505 TTMVVRAIRTIPAGEEISENYGPIFTTTPEAERKRKLRVQYWFDCNCEACTAHWPTLEEI 564

Query: 172 ESAILEGYRCKDDGCSGFLL---RDSDDKGFTCQQCGLVRSKEEIKKIASEVNILSKKTL 228
           +  IL  ++C+     G +L    D+++    C +CG   +  +  K   + + + +   
Sbjct: 565 DPTILR-FKCETGKECGNVLPIKADTNEFMIRCSKCGKNTNIFKGLKALQDTDAIFRTAS 623

Query: 229 ALTSCGNHQEVVSTYKMIEKL 249
                G HQE + +Y  I KL
Sbjct: 624 RNLEEGKHQEALKSYLEILKL 644


>gi|393234411|gb|EJD41974.1| hypothetical protein AURDEDRAFT_105699 [Auricularia delicata
           TFB-10046 SS5]
          Length = 531

 Score = 68.6 bits (166), Expect = 4e-09,   Method: Compositional matrix adjust.
 Identities = 46/153 (30%), Positives = 69/153 (45%), Gaps = 13/153 (8%)

Query: 62  SINEIAENFSKLACNAHTICNSELRPLGTGLYPVISIINHSCLPNAVLVFEGRLAV---- 117
           S  ++ +  S+   N+ T+ +  L P+G  + P +++ NHSC PNAV+VF    A     
Sbjct: 188 SAGDLVDLVSRFTTNSFTLSSPSLSPIGVCVSPTVALANHSCDPNAVVVFPSAAAPLGDE 247

Query: 118 ----VRAVQHVPKGAEVLISYIETAGSTMTRQKALKEQYLFTCTCPRCIKLGQFDDIQES 173
               + A++ +    EVL SY++ +     R++ LKE Y FTC C  C  L   DD    
Sbjct: 248 TLMQIIAIKDIMPEEEVLTSYVDISLPRHLRRRDLKETYNFTCNCSACSSL---DDPSVG 304

Query: 174 AILEGYRCKDDGCSGFLLRDSDDKGFT-CQQCG 205
                  C   GC G         G   C +CG
Sbjct: 305 DPRYAVWCP-KGCGGVCRIPELGSGIVICLRCG 336


>gi|350405733|ref|XP_003487532.1| PREDICTED: SET and MYND domain-containing protein 4-like [Bombus
           impatiens]
          Length = 676

 Score = 68.6 bits (166), Expect = 4e-09,   Method: Compositional matrix adjust.
 Identities = 51/201 (25%), Positives = 95/201 (47%), Gaps = 11/201 (5%)

Query: 54  LILQWPEISINEIAENFSKLACNAHTICNSELRPLGTGLYPVISIINHSCLPNAVLVFEG 113
           +++Q+    I+E+A     +   ++ +  ++ + +G G+Y  +S+ NHSC P  +  F G
Sbjct: 450 MLIQFNAHEISELA-----VPKGSNILAKAKSKFIGGGVYSTVSLFNHSCNPGIIRYFIG 504

Query: 114 RLAVVRAVQHVPKGAEVLISY--IETAGSTMTRQKALKEQYLFTCTCPRCIKLGQFDDIQ 171
              VVRA++ +P G E+  +Y  I T      R++ L+ QY F C C  C       +  
Sbjct: 505 TTMVVRAIRTIPAGEEISENYGPIFTTTPEAERKRKLRVQYWFDCNCEACTAHWPTLEEI 564

Query: 172 ESAILEGYRCKDDGCSGFLL---RDSDDKGFTCQQCGLVRSKEEIKKIASEVNILSKKTL 228
           +  IL  ++C+     G +L    D+++    C +CG   +  +  K   + + + +   
Sbjct: 565 DPTILR-FKCETGKECGNVLPIKADTNEFMIRCSKCGKNTNIFKGLKALQDTDAIFRTAS 623

Query: 229 ALTSCGNHQEVVSTYKMIEKL 249
                G HQE + +Y  I KL
Sbjct: 624 RNLEEGKHQEALKSYLEILKL 644


>gi|61097985|ref|NP_001012912.1| SET and MYND domain-containing protein 5 [Gallus gallus]
 gi|75571258|sp|Q5ZIZ2.1|SMYD5_CHICK RecName: Full=SET and MYND domain-containing protein 5
 gi|53134094|emb|CAG32301.1| hypothetical protein RCJMB04_22j23 [Gallus gallus]
          Length = 420

 Score = 68.6 bits (166), Expect = 4e-09,   Method: Compositional matrix adjust.
 Identities = 37/86 (43%), Positives = 51/86 (59%), Gaps = 7/86 (8%)

Query: 89  GTGLYPVISIINHSCLPNAVLVFEGR--LAVVRAVQHVPKGAEVLISYIETAG---STMT 143
           G+GLY + S  NHSC+PNA   F     L  + A++ +  G E+ ISY++      S  +
Sbjct: 310 GSGLYMLQSCCNHSCIPNAETSFPDNNFLLYLTALEDIEAGEEICISYLDCCQRERSRHS 369

Query: 144 RQKALKEQYLFTCTCPRCIKLGQFDD 169
           R K L+E YLFTC+CP+C  L Q DD
Sbjct: 370 RNKILRENYLFTCSCPKC--LAQADD 393


>gi|330812969|ref|XP_003291388.1| hypothetical protein DICPUDRAFT_98991 [Dictyostelium purpureum]
 gi|325078448|gb|EGC32099.1| hypothetical protein DICPUDRAFT_98991 [Dictyostelium purpureum]
          Length = 824

 Score = 68.6 bits (166), Expect = 4e-09,   Method: Compositional matrix adjust.
 Identities = 32/98 (32%), Positives = 51/98 (52%), Gaps = 1/98 (1%)

Query: 65  EIAENFSKLACNAHTICNSELRPLGTGLYPVISIINHSCLPNAV-LVFEGRLAVVRAVQH 123
           EI E ++    N HT+ N   +  G G YP+    NHSC+PN    V +  +   RA + 
Sbjct: 509 EIYEIYTIFLSNGHTVSNIFGKGYGHGFYPLGCYFNHSCVPNVTWYVDDNGMLAFRATRT 568

Query: 124 VPKGAEVLISYIETAGSTMTRQKALKEQYLFTCTCPRC 161
           + +G E+ ++Y +   +   R+  L +QY F C CP+C
Sbjct: 569 ISQGEELFLTYYDMFSTLDDRRPHLLDQYRFFCQCPQC 606


>gi|358368449|dbj|GAA85066.1| SET and MYND domain protein [Aspergillus kawachii IFO 4308]
          Length = 484

 Score = 68.6 bits (166), Expect = 5e-09,   Method: Compositional matrix adjust.
 Identities = 39/134 (29%), Positives = 69/134 (51%), Gaps = 2/134 (1%)

Query: 30  LVAHMSDIDEKQLLLYAQIANLVNLILQWPEISINE--IAENFSKLACNAHTICNSELRP 87
           L  H+ DI ++    + +I+     I  +    ++E  I+   +KL  N+  + N+    
Sbjct: 136 LETHIKDIRDESASHWERISLSSKAIKAYSGTEMSEEVISAMGAKLDLNSFNLTNAVYDR 195

Query: 88  LGTGLYPVISIINHSCLPNAVLVFEGRLAVVRAVQHVPKGAEVLISYIETAGSTMTRQKA 147
           LG  L+P  +I NHSC  NA + F+G    ++A++ + K  ++ I+YI+       RQ  
Sbjct: 196 LGVYLHPYAAIFNHSCDHNAAVSFDGPNLHIKAIRPIQKDEQIFITYIDVTDPYPIRQHN 255

Query: 148 LKEQYLFTCTCPRC 161
           L+ +Y FTC C +C
Sbjct: 256 LQSRYYFTCHCSKC 269


>gi|242006418|ref|XP_002424047.1| SET and MYND domain-containing protein, putative [Pediculus humanus
           corporis]
 gi|212507353|gb|EEB11309.1| SET and MYND domain-containing protein, putative [Pediculus humanus
           corporis]
          Length = 236

 Score = 68.6 bits (166), Expect = 5e-09,   Method: Compositional matrix adjust.
 Identities = 44/105 (41%), Positives = 60/105 (57%), Gaps = 14/105 (13%)

Query: 63  INEIAENFSKLACNAHTICNSELRPLGTGLYPVISIINHSCLPNAVLVFEG---RLAVVR 119
           I+ + ENF K         N+E    G+GLY + S INHSC PNA + F     +LAVV 
Sbjct: 122 IDAVYENFEK---GVGDFLNNE----GSGLYELQSTINHSCSPNAEVTFPHSNYQLAVV- 173

Query: 120 AVQHVPKGAEVLISYIET---AGSTMTRQKALKEQYLFTCTCPRC 161
           A  ++  G E+ ISY++    + S  +RQK L+E YLFTC C +C
Sbjct: 174 ATDNINPGDEICISYLDMCSLSRSRHSRQKILQENYLFTCKCHKC 218


>gi|449274577|gb|EMC83667.1| SET and MYND domain-containing protein 5, partial [Columba livia]
          Length = 383

 Score = 68.6 bits (166), Expect = 5e-09,   Method: Compositional matrix adjust.
 Identities = 37/86 (43%), Positives = 51/86 (59%), Gaps = 7/86 (8%)

Query: 89  GTGLYPVISIINHSCLPNAVLVF--EGRLAVVRAVQHVPKGAEVLISYIETAG---STMT 143
           G+GLY + S  NHSC+PNA   F     L  + A++ +  G E+ ISY++      S  +
Sbjct: 273 GSGLYVLQSCCNHSCIPNAETSFPENNFLLHLTALEDIEAGEEICISYLDCCQRERSRHS 332

Query: 144 RQKALKEQYLFTCTCPRCIKLGQFDD 169
           R K L+E YLFTC+CP+C  L Q DD
Sbjct: 333 RNKILRENYLFTCSCPKC--LAQADD 356


>gi|195389466|ref|XP_002053397.1| GJ23360 [Drosophila virilis]
 gi|194151483|gb|EDW66917.1| GJ23360 [Drosophila virilis]
          Length = 393

 Score = 68.6 bits (166), Expect = 5e-09,   Method: Compositional matrix adjust.
 Identities = 34/87 (39%), Positives = 50/87 (57%), Gaps = 5/87 (5%)

Query: 89  GTGLYPVISIINHSCLPNA--VLVFEGRLAVVRAVQHVPKGAEVLISYIETAG---STMT 143
           G+GLY + S INHSC+PNA     +   + V++A+  + KG E+ ISY++      S  +
Sbjct: 286 GSGLYILQSKINHSCVPNAQSTFPYSNDIVVLKALAPIQKGDEICISYLDDCQLERSRHS 345

Query: 144 RQKALKEQYLFTCTCPRCIKLGQFDDI 170
           R K L+E Y+F C CP+C       DI
Sbjct: 346 RHKTLRENYIFVCQCPKCRAQASDPDI 372


>gi|134083355|emb|CAK42922.1| unnamed protein product [Aspergillus niger]
          Length = 415

 Score = 68.2 bits (165), Expect = 5e-09,   Method: Compositional matrix adjust.
 Identities = 39/136 (28%), Positives = 70/136 (51%), Gaps = 2/136 (1%)

Query: 30  LVAHMSDIDEKQLLLYAQIANLVNLILQWPEISINE--IAENFSKLACNAHTICNSELRP 87
           L  H+ DI ++    + +I+     I  +    ++E  I+   +KL  N+  + N+    
Sbjct: 104 LGTHIKDIRDQSASQWERISLSSKAIKAYSGTDMSEEVISAMGAKLDLNSFNLTNAVYDR 163

Query: 88  LGTGLYPVISIINHSCLPNAVLVFEGRLAVVRAVQHVPKGAEVLISYIETAGSTMTRQKA 147
           LG  L+P  +I NHSC  NA + F+G    ++A++ + K  ++ I+YI+       RQ  
Sbjct: 164 LGVYLHPYAAIFNHSCDHNAAVSFDGPNLHIKALRPIRKDEQIFITYIDVTDPYPIRQAN 223

Query: 148 LKEQYLFTCTCPRCIK 163
           L+ +Y FTC C +C +
Sbjct: 224 LQSRYYFTCHCSKCSR 239


>gi|449501577|ref|XP_002187845.2| PREDICTED: SET and MYND domain-containing protein 5-like
           [Taeniopygia guttata]
          Length = 404

 Score = 68.2 bits (165), Expect = 5e-09,   Method: Compositional matrix adjust.
 Identities = 37/86 (43%), Positives = 51/86 (59%), Gaps = 7/86 (8%)

Query: 89  GTGLYPVISIINHSCLPNAVLVF--EGRLAVVRAVQHVPKGAEVLISYIETAG---STMT 143
           G+GLY + S  NHSC+PNA   F     L  + A++ +  G E+ ISY++      S  +
Sbjct: 294 GSGLYVLQSCCNHSCIPNAETSFPENNFLLHLTALEDIEAGEEICISYLDCCQRERSRHS 353

Query: 144 RQKALKEQYLFTCTCPRCIKLGQFDD 169
           R K L+E YLFTC+CP+C  L Q DD
Sbjct: 354 RNKILRENYLFTCSCPKC--LAQADD 377


>gi|391868965|gb|EIT78172.1| hypothetical protein Ao3042_05555 [Aspergillus oryzae 3.042]
          Length = 324

 Score = 68.2 bits (165), Expect = 5e-09,   Method: Compositional matrix adjust.
 Identities = 31/93 (33%), Positives = 50/93 (53%)

Query: 71  SKLACNAHTICNSELRPLGTGLYPVISIINHSCLPNAVLVFEGRLAVVRAVQHVPKGAEV 130
           +KL  N+    N     +G  L+P  ++INHSC  NA + F+     ++A + + KG ++
Sbjct: 11  AKLELNSFNFTNILSDRIGLYLHPYAALINHSCNYNAAVTFDSDNLYIKATRPIQKGDQI 70

Query: 131 LISYIETAGSTMTRQKALKEQYLFTCTCPRCIK 163
            ISYI+       R+  L+E+Y F C C +C K
Sbjct: 71  FISYIDATNPVKLRRSELRERYYFDCHCAKCAK 103


>gi|317036659|ref|XP_001397803.2| SET and MYND domain protein [Aspergillus niger CBS 513.88]
          Length = 481

 Score = 68.2 bits (165), Expect = 6e-09,   Method: Compositional matrix adjust.
 Identities = 39/136 (28%), Positives = 70/136 (51%), Gaps = 2/136 (1%)

Query: 30  LVAHMSDIDEKQLLLYAQIANLVNLILQWPEISINE--IAENFSKLACNAHTICNSELRP 87
           L  H+ DI ++    + +I+     I  +    ++E  I+   +KL  N+  + N+    
Sbjct: 136 LGTHIKDIRDQSASQWERISLSSKAIKAYSGTDMSEEVISAMGAKLDLNSFNLTNAVYDR 195

Query: 88  LGTGLYPVISIINHSCLPNAVLVFEGRLAVVRAVQHVPKGAEVLISYIETAGSTMTRQKA 147
           LG  L+P  +I NHSC  NA + F+G    ++A++ + K  ++ I+YI+       RQ  
Sbjct: 196 LGVYLHPYAAIFNHSCDHNAAVSFDGPNLHIKALRPIRKDEQIFITYIDVTDPYPIRQAN 255

Query: 148 LKEQYLFTCTCPRCIK 163
           L+ +Y FTC C +C +
Sbjct: 256 LQSRYYFTCHCSKCSR 271


>gi|268532182|ref|XP_002631219.1| C. briggsae CBR-SET-14 protein [Caenorhabditis briggsae]
          Length = 428

 Score = 68.2 bits (165), Expect = 6e-09,   Method: Compositional matrix adjust.
 Identities = 75/333 (22%), Positives = 138/333 (41%), Gaps = 20/333 (6%)

Query: 30  LVAHMSDIDEKQLLLYAQIANLVNLILQWPEISINEIAENFSKLACNAHTICNSELRPLG 89
           LV H S +D      Y +  N    +   PE  +  +   F K++ N  ++ NS    +G
Sbjct: 111 LVDHGSFLDSGAETFYQEYLNYA--LPPHPEPDV--VKSVFKKISINGFSVSNSMGSAIG 166

Query: 90  TGLYPVISIINHSCLPNAVLVFEGRLAVVRAVQ--HVPKGAE-VLISYIETAGSTMTRQK 146
            GL   +S  NHSC P   + +  R A++  +     P   E    SYI+   S   RQ 
Sbjct: 167 HGLCIKLSAANHSCKPLTRVCYRNRTAMLVPIDPYKPPTSLEGACHSYIDELQSVSARQS 226

Query: 147 ALKEQYLFTCTCPRCIKLGQFDDIQESAILEGYRCKDDGCSGFLLRDSDDKGFTCQQCG- 205
            LK++Y F C C  C+      D + +  +E + C D  C+   + + +D    C  C  
Sbjct: 227 ELKKKYNFICICEGCM------DDERNESMEAWACGD--CANGKVPNVEDG--RCASCSW 276

Query: 206 -LVRSKEEIKKIASEVNILSKKTLALTSCGNHQEVVSTYKMIEKLQKKLYHPFSVNLMQT 264
            + R   E+ + A E  I +K  +   S           K++E     L H F+V+ +  
Sbjct: 277 KMTRDHYELCRTAEESAIAAKPKIENESISLETRKALCEKLMELFDDTL-HEFNVHRVVP 335

Query: 265 REKLIKILMELEDWKEALAYCQLTIPVYQRVYPQFHPLLGLQYYTCGKLEWFLGDTENAI 324
           +  L    + ++D   A  + +  + +  R   +  P++ L+ +   ++    G+    +
Sbjct: 336 QRLLFFASLAVQDVNGASKHGKALLAMMMRYQGKHDPIVMLKKWQIAQICHANGEYNKVV 395

Query: 325 KSMTEAVEILRITHGTNSPFMKELILKLEEAQA 357
           K + E  E L+  +   +P    +   +E A++
Sbjct: 396 KLLEEIKEPLKKIYTAGAPISVNIRRLMEMARS 428


>gi|343424810|emb|CBQ68348.1| conserved hypothetical protein [Sporisorium reilianum SRZ2]
          Length = 855

 Score = 68.2 bits (165), Expect = 6e-09,   Method: Compositional matrix adjust.
 Identities = 35/116 (30%), Positives = 64/116 (55%), Gaps = 16/116 (13%)

Query: 62  SINEIAENFSKLACNAHTICNSELRPLGTGLYPVISIINHSCLPNAVLVF---------- 111
           S  E+ +   + +CN+ T+ +S+L PLG  ++  ++++NH+C PNA +VF          
Sbjct: 415 SATELLDLVCQFSCNSFTLTDSDLNPLGVCMHASMAMLNHACTPNAAVVFPFGGAAKSGQ 474

Query: 112 ------EGRLAVVRAVQHVPKGAEVLISYIETAGSTMTRQKALKEQYLFTCTCPRC 161
                 E ++  + A++ +  G E+LISY++ A +   R+  LK++Y F C C  C
Sbjct: 475 QKWRDGEDKIMQLVALRAIEPGEELLISYVDIADTFEERRSYLKKRYCFDCRCELC 530


>gi|194752924|ref|XP_001958769.1| GF12554 [Drosophila ananassae]
 gi|190620067|gb|EDV35591.1| GF12554 [Drosophila ananassae]
          Length = 660

 Score = 68.2 bits (165), Expect = 6e-09,   Method: Compositional matrix adjust.
 Identities = 56/219 (25%), Positives = 99/219 (45%), Gaps = 18/219 (8%)

Query: 48  IANLVNLILQWPEISINEIAE--NFSKLACNAHTICNSELRPLGTGLYPVISIINHSCLP 105
           IA+LV   LQ+ + + +E+AE   FS                +G  +YP +++ NHSC P
Sbjct: 427 IASLVLRSLQFIQFNTHEVAELHKFSSSGREKSIF-------IGGAIYPTLALFNHSCDP 479

Query: 106 NAVLVFEGRLAVVRAVQHVPKGAEVLISY--IETAGSTMTRQKALKEQYLFTCTCPRCI- 162
             V  F G    + +V+ +  G  +  +Y  I T      RQ  LKE Y F C+C  CI 
Sbjct: 480 GVVRYFRGNTIHINSVRPIEAGLPINENYGPIYTQDERSERQARLKELYWFECSCDACID 539

Query: 163 KLGQFDDIQESAILEGYRCK-DDGCSGFL--LRDSDDKGFTCQQCGLVRSKEEIKKIASE 219
              +FDD+    I   +RC+  + C+  +      +D    C  CG + +  +  K+  +
Sbjct: 540 NWPKFDDLPRDII--RFRCEAPNNCAAVIEVPPSCNDFMVKCVTCGEITNILKGLKVMQD 597

Query: 220 VNILSKKTLALTSCGNHQEVVSTY-KMIEKLQKKLYHPF 257
             ++++    L   G + + ++ +  +I  + + L  PF
Sbjct: 598 TEMMTRTAKRLYETGEYSKALTKFIDLIRIMYEVLAPPF 636


>gi|83774976|dbj|BAE65099.1| unnamed protein product [Aspergillus oryzae RIB40]
          Length = 323

 Score = 68.2 bits (165), Expect = 6e-09,   Method: Compositional matrix adjust.
 Identities = 32/98 (32%), Positives = 51/98 (52%)

Query: 66  IAENFSKLACNAHTICNSELRPLGTGLYPVISIINHSCLPNAVLVFEGRLAVVRAVQHVP 125
           +A    KL  N+    N     +G  L+P  ++INHSC  NA + F+     ++A + + 
Sbjct: 5   VANVLIKLELNSFNFTNILSDRIGLYLHPYAALINHSCNYNAAVTFDSDNLYIKATRPIQ 64

Query: 126 KGAEVLISYIETAGSTMTRQKALKEQYLFTCTCPRCIK 163
           KG ++ ISYI+       R+  L+E+Y F C C +C K
Sbjct: 65  KGDQIFISYIDATNPVKLRRSELRERYYFDCHCAKCAK 102


>gi|66822115|ref|XP_644412.1| SET domain-containing protein [Dictyostelium discoideum AX4]
 gi|66822967|ref|XP_644838.1| SET domain-containing protein [Dictyostelium discoideum AX4]
 gi|74857697|sp|Q557F6.1|Y3591_DICDI RecName: Full=SET and MYND domain-containing protein DDB_G0273591
 gi|60472535|gb|EAL70486.1| SET domain-containing protein [Dictyostelium discoideum AX4]
 gi|60472965|gb|EAL70914.1| SET domain-containing protein [Dictyostelium discoideum AX4]
          Length = 413

 Score = 68.2 bits (165), Expect = 6e-09,   Method: Compositional matrix adjust.
 Identities = 39/122 (31%), Positives = 59/122 (48%), Gaps = 6/122 (4%)

Query: 72  KLACNAHTICNSELRPLGTGLYPVISIINHSCLPNAVLVFEGRLAVVRAVQHVPKGAEVL 131
           K+ CN   I     + +G  + P  S  NHSC+PN   V +G     +++  + KG ++ 
Sbjct: 248 KVRCNQFGIWTKNDKCIGMAVSPSSSYFNHSCIPNCESVRDGSDMTFKSLFPIKKGDQIN 307

Query: 132 ISYIETAGSTMTRQKALKEQYLFTCTCPRCIK-----LGQFDDIQESAILEGYRCKDDGC 186
           ISY+    ST  R+  LK  Y F C CPRC        G+ +D  ++ I + Y C    C
Sbjct: 308 ISYLALDKSTKRRRDYLKFGYYFHCQCPRCNSTDIDPTGKLEDSLDNWISKFY-CHQKKC 366

Query: 187 SG 188
           +G
Sbjct: 367 TG 368


>gi|297736653|emb|CBI25524.3| unnamed protein product [Vitis vinifera]
          Length = 716

 Score = 68.2 bits (165), Expect = 6e-09,   Method: Compositional matrix adjust.
 Identities = 40/112 (35%), Positives = 54/112 (48%), Gaps = 7/112 (6%)

Query: 82  NSELRPLGTGLYPVISIINHSCLPNAVLVFEGRLAVVRAVQHVPKGAEVLISYIETAG-- 139
           N E   +G  +Y V S+ NHSC PN    F  R   +RA +HV  G  + +SY    G  
Sbjct: 452 NMEQVRVGQAIYSVASLFNHSCQPNIHAYFLSRTLFLRATEHVAVGCPLELSYGPQVGQW 511

Query: 140 STMTRQKALKEQYLFTCTCPRCIKLGQFDDIQESAILEGYRCKDDGCSGFLL 191
               RQK LK++Y F C C  C +L   D      +L  +RC +  C G +L
Sbjct: 512 DCKDRQKFLKDEYSFRCECSGCSELNVSD-----LVLNAFRCVNPDCFGTVL 558


>gi|307173919|gb|EFN64667.1| Protein msta, isoform B [Camponotus floridanus]
          Length = 513

 Score = 68.2 bits (165), Expect = 6e-09,   Method: Compositional matrix adjust.
 Identities = 58/194 (29%), Positives = 93/194 (47%), Gaps = 22/194 (11%)

Query: 81  CNSELRPLGTGLYPVISIINHSCLPNAVLVFEGRLAV-VRAVQHVPKGAEVLISYIETAG 139
           C + LR    GLYP+ ++ NH C+PN    F+ +  + V A   +  G E+ +SY +   
Sbjct: 213 CTTSLR----GLYPLGALQNHCCVPNTRHHFDDQQQLHVTATLPIAAGEEITMSYTDLLW 268

Query: 140 STMTRQKALKEQYLFTCTCPRCIKLGQFDDIQESAILEGYRCKDDGCSGFLLRDSDDKGF 199
            T +R++ LK    F+C C RC    +F   Q  A+L    C  D CSG LL  S+   F
Sbjct: 269 DTSSRRQFLKATKHFSCNCSRCSDPSEFGS-QLGALL----CAKDDCSGHLL-PSNPLNF 322

Query: 200 ----TCQQCGLVRSKEEIKKIASEVNILSKKTLALTSCGNHQEVVSTYKMIEKLQKKLYH 255
                C +C +  + ++I+ I S++N L    + +T      E+V   K IE    KL  
Sbjct: 323 ISPWICDKCQISVNYKQIEFIHSQLNALVLDAMYMTP----HEIV---KFIETALSKLVP 375

Query: 256 PFSVNLMQTREKLI 269
             +  L+  + ++I
Sbjct: 376 TSNFILIDVKFRII 389


>gi|302853821|ref|XP_002958423.1| hypothetical protein VOLCADRAFT_99704 [Volvox carteri f.
           nagariensis]
 gi|300256228|gb|EFJ40499.1| hypothetical protein VOLCADRAFT_99704 [Volvox carteri f.
           nagariensis]
          Length = 729

 Score = 68.2 bits (165), Expect = 7e-09,   Method: Compositional matrix adjust.
 Identities = 36/91 (39%), Positives = 54/91 (59%), Gaps = 7/91 (7%)

Query: 76  NAHTICNSELRPLGT--GLYPVISIINHSCLPNAVLVFEGRLAVVRAVQHVPKGAEVLIS 133
            A  + NSE  PL +  G++P +++INHSC PNAV V      VVRA + +P+G EVL++
Sbjct: 601 GAAAVANSE--PLTSRMGVWPEVALINHSCGPNAVAVMLYDRLVVRATRRIPRGREVLLN 658

Query: 134 YIETAGS---TMTRQKALKEQYLFTCTCPRC 161
           ++ + G+      R+  L + Y F C C RC
Sbjct: 659 WLGSQGALAPARERRAQLSQMYDFACRCARC 689


>gi|308509934|ref|XP_003117150.1| CRE-SET-14 protein [Caenorhabditis remanei]
 gi|308242064|gb|EFO86016.1| CRE-SET-14 protein [Caenorhabditis remanei]
          Length = 429

 Score = 68.2 bits (165), Expect = 7e-09,   Method: Compositional matrix adjust.
 Identities = 60/223 (26%), Positives = 98/223 (43%), Gaps = 15/223 (6%)

Query: 59  PEISINEIAENFSKLACNAHTICNSELRPLGTGLYPVISIINHSCLPNAVLVFEGRLAVV 118
           P      I   F K++ N+ ++ NS    +G  L   +S  NHSC P+  + + GR A++
Sbjct: 136 PHSHPETIKSIFKKVSVNSFSLSNSTGNSIGIALCVKLSAANHSCKPSTRVCYRGRTAML 195

Query: 119 RAV-QHVPKGAE-VLISYIETAGSTMTRQKALKEQYLFTCTCPRCIKLGQFDDIQESAIL 176
             V   +P   E    SYI+   +  TRQ  LK++Y F C C  C       D + +  +
Sbjct: 196 VPVDDRLPTSLEGACHSYIDELQTLSTRQATLKKKYKFDCACEGCT------DDERNGRM 249

Query: 177 EGYRCKDDGCSGFLLRDSDDKGFTCQQCG--LVRSKEEIKKIASEVNILSKKTLALTSCG 234
           E + C  + C    +R+ +  G +C  CG  L R + E+ + A E  I S+  L   S  
Sbjct: 250 EAWAC--EICKTGWIRNVE--GASCNPCGYVLTRDQYELCRTAEEAAIASRPKLENDSIP 305

Query: 235 NHQEVVSTYKMIEKLQKKLYHPFSVNLMQTREKLIKILMELED 277
                    K++E  Q  L H ++V+ +     L    + + D
Sbjct: 306 LETRRHLCEKLLELFQDTL-HAYNVHRIPVLRCLFVATLAIRD 347


>gi|332027349|gb|EGI67433.1| SET and MYND domain-containing protein 4 [Acromyrmex echinatior]
          Length = 619

 Score = 67.8 bits (164), Expect = 7e-09,   Method: Compositional matrix adjust.
 Identities = 40/123 (32%), Positives = 61/123 (49%), Gaps = 6/123 (4%)

Query: 88  LGTGLYPVISIINHSCLPNAVLVFEGRLAVVRAVQHVPKGAEVLISY--IETAGSTMTRQ 145
           +G G+YP ++++NHSC P  +  F G   +VRAV+ +  G E+  +Y  I T      R+
Sbjct: 422 IGGGVYPTVAMLNHSCNPGVIRYFIGTTMIVRAVRTINAGEEISENYGPIFTTMPESERK 481

Query: 146 KALKEQYLFTCTCPRCIKLGQFDDIQESAILEGYRCKDD-GCSGFLLRDSDDKGFT--CQ 202
           + L+ QY F C C  C       D  +  IL  ++C+    C   L+  SD   F   C 
Sbjct: 482 RKLRVQYWFDCNCEACSGHWPLLDELDPTILR-FKCETGPSCGNVLMVKSDTNEFMIGCA 540

Query: 203 QCG 205
           +CG
Sbjct: 541 KCG 543


>gi|219130732|ref|XP_002185512.1| predicted protein [Phaeodactylum tricornutum CCAP 1055/1]
 gi|217403043|gb|EEC42999.1| predicted protein [Phaeodactylum tricornutum CCAP 1055/1]
          Length = 493

 Score = 67.8 bits (164), Expect = 8e-09,   Method: Compositional matrix adjust.
 Identities = 37/104 (35%), Positives = 52/104 (50%), Gaps = 3/104 (2%)

Query: 61  ISINEIAENFSKLACNAHT---ICNSELRPLGTGLYPVISIINHSCLPNAVLVFEGRLAV 117
           +S   I +++   A NA T   +    L P   GL+P+ ++INHSC+ NAV V    + V
Sbjct: 188 LSYTNIEQSWRTTALNAPTPLDLSRLSLPPRLLGLFPLAAMINHSCVGNAVRVLVDDVMV 247

Query: 118 VRAVQHVPKGAEVLISYIETAGSTMTRQKALKEQYLFTCTCPRC 161
           V A   +P G E++ SY         R   L+  Y F C CPRC
Sbjct: 248 VHATTDLPAGTELVWSYGPPTTPFAQRNTRLRAHYGFVCDCPRC 291


>gi|440635519|gb|ELR05438.1| hypothetical protein GMDG_01733 [Geomyces destructans 20631-21]
          Length = 505

 Score = 67.8 bits (164), Expect = 8e-09,   Method: Compositional matrix adjust.
 Identities = 39/118 (33%), Positives = 61/118 (51%), Gaps = 3/118 (2%)

Query: 66  IAENFSKLACNAHTICNSELRPLGTGLYPVISIINHSCLPNAVLVFEGRLAVVRAVQHVP 125
           I + F K+  N   I + E  P G  L  V S +NHSC PNA + FEG    +R+++ + 
Sbjct: 193 IQDLFCKVLTNVMCINHQEGSP-GITLDLVGSFVNHSCDPNAFVFFEGSQLRMRSLKPIN 251

Query: 126 KGAEVLISYIETAGSTMTRQKALKEQYLFTCTCPRCIKLGQFDDIQESAILEGYRCKD 183
            G E+ ++Y E     + R++ L   Y F C C RC K  +  ++ E +  EG+  +D
Sbjct: 252 AGDEITLTYTELREGVLMRKRKLHRGYSFFCRCNRCKK--EHKELHELSFREGFSIRD 307


>gi|327288923|ref|XP_003229174.1| PREDICTED: SET and MYND domain-containing protein 5-like, partial
           [Anolis carolinensis]
          Length = 163

 Score = 67.8 bits (164), Expect = 8e-09,   Method: Compositional matrix adjust.
 Identities = 36/86 (41%), Positives = 50/86 (58%), Gaps = 7/86 (8%)

Query: 89  GTGLYPVISIINHSCLPNAVLVFEGR--LAVVRAVQHVPKGAEVLISYIETAG---STMT 143
           G+GLY + S  NHSC+PNA   F     L  + A++ +  G E+ ISY++      S  +
Sbjct: 45  GSGLYVLQSCCNHSCIPNAETSFPDNNFLLHLTALEDIRPGEEICISYLDCCQRERSRHS 104

Query: 144 RQKALKEQYLFTCTCPRCIKLGQFDD 169
           R K L+E YLF C+CP+C  L Q DD
Sbjct: 105 RHKVLRENYLFVCSCPKC--LAQADD 128


>gi|443896871|dbj|GAC74214.1| predicted histone tail methylase containing SET domain [Pseudozyma
           antarctica T-34]
          Length = 882

 Score = 67.8 bits (164), Expect = 8e-09,   Method: Compositional matrix adjust.
 Identities = 36/119 (30%), Positives = 65/119 (54%), Gaps = 17/119 (14%)

Query: 62  SINEIAENFSKLACNAHTICNSELRPLGTGLYPVISIINHSCLPNAVLVF---------- 111
           S  E+ +   + + N+ T+ +S+L PLG  ++  ++++NHSC PNA +VF          
Sbjct: 463 SATELLDLVCQFSSNSFTLTDSDLNPLGVCMHASMAMLNHSCAPNAAVVFPFGGAAKSGQ 522

Query: 112 -------EGRLAVVRAVQHVPKGAEVLISYIETAGSTMTRQKALKEQYLFTCTCPRCIK 163
                  + ++  + A++ +  G E+LISY++ A +   R+ ALK++Y F C C  C K
Sbjct: 523 QNWSDGEDAKVMQLVALRAIEPGEELLISYVDVADAYEQRRSALKKRYCFECRCELCRK 581


>gi|307196209|gb|EFN77865.1| SET and MYND domain-containing protein 3 [Harpegnathos saltator]
          Length = 354

 Score = 67.8 bits (164), Expect = 8e-09,   Method: Compositional matrix adjust.
 Identities = 33/98 (33%), Positives = 55/98 (56%)

Query: 250 QKKLYHPFSVNLMQTREKLIKILMELEDWKEALAYCQLTIPVYQRVYPQFHPLLGLQYYT 309
           QK + H F++  ++T +      + LE W++A  Y +  +P Y   Y + HPL+GL Y T
Sbjct: 234 QKDILHSFNIQHIRTLDTAFLAAVNLEYWEDAELYSKQLLPGYLLYYGEVHPLIGLLYLT 293

Query: 310 CGKLEWFLGDTENAIKSMTEAVEILRITHGTNSPFMKE 347
            GK++  L   + A++ + +A  +L ITHG   P M+E
Sbjct: 294 MGKIQLHLRKLKQALEILKKASAVLTITHGDKHPIMRE 331



 Score = 51.6 bits (122), Expect = 5e-04,   Method: Compositional matrix adjust.
 Identities = 29/99 (29%), Positives = 55/99 (55%), Gaps = 3/99 (3%)

Query: 30  LVAHMSDI--DEKQLLLYAQIAN-LVNLILQWPEISINEIAENFSKLACNAHTICNSELR 86
           L++H SDI  D K++  +  +   L+  + +    +  E+   + ++  N+  I + ++ 
Sbjct: 117 LMSHYSDIKKDPKRMEHFISLCGVLLEFLGEALMPNTAELIGIYGRICINSFNILDLDMN 176

Query: 87  PLGTGLYPVISIINHSCLPNAVLVFEGRLAVVRAVQHVP 125
            +G G+Y   S+I+HSC PNAV+ FEG   +VR +  +P
Sbjct: 177 SIGVGIYLGASVIDHSCKPNAVVTFEGTTIIVRTLIDLP 215


>gi|66800595|ref|XP_629223.1| hypothetical protein DDB_G0293294 [Dictyostelium discoideum AX4]
 gi|60462643|gb|EAL60846.1| hypothetical protein DDB_G0293294 [Dictyostelium discoideum AX4]
          Length = 508

 Score = 67.8 bits (164), Expect = 8e-09,   Method: Compositional matrix adjust.
 Identities = 32/81 (39%), Positives = 48/81 (59%), Gaps = 6/81 (7%)

Query: 89  GTGLYPVISIINHSCLPNAVL---VFEGRLAV---VRAVQHVPKGAEVLISYIETAGSTM 142
           G GL+P+ S +NHSC PN  +   + +G  +V   V+A +++P G+E+L SY +   S  
Sbjct: 428 GVGLFPIFSCMNHSCFPNVEISNEIIDGVTSVRMVVKAKKNIPAGSEILHSYCDETLSNK 487

Query: 143 TRQKALKEQYLFTCTCPRCIK 163
            R+  L  QY F CTC +C K
Sbjct: 488 ERKDILFSQYGFKCTCNKCSK 508


>gi|342874438|gb|EGU76450.1| hypothetical protein FOXB_13043 [Fusarium oxysporum Fo5176]
          Length = 896

 Score = 67.8 bits (164), Expect = 8e-09,   Method: Compositional matrix adjust.
 Identities = 35/92 (38%), Positives = 53/92 (57%), Gaps = 1/92 (1%)

Query: 72  KLACNAHTICNSELRPLGTGLYPVISIINHSCLPNAVLVFEGRLAVVRAVQHVPKGAEVL 131
           K+  NA    +++L  +G  L P +++ NHSC+PNA++ F GR A++RA + +    E+ 
Sbjct: 535 KIQTNAFHRYDADLGQVGIFLEPKLAMANHSCIPNAMVQFVGRKAILRAEKPIKVDEEIE 594

Query: 132 ISYIETAGSTMTRQKALKEQYLFTCTCPRCIK 163
           ISY +       R+ AL   Y F C CPRC K
Sbjct: 595 ISYTDYTFPRSKRKHALAP-YFFDCQCPRCEK 625


>gi|170047543|ref|XP_001851277.1| conserved hypothetical protein [Culex quinquefasciatus]
 gi|167869950|gb|EDS33333.1| conserved hypothetical protein [Culex quinquefasciatus]
          Length = 697

 Score = 67.8 bits (164), Expect = 8e-09,   Method: Compositional matrix adjust.
 Identities = 57/202 (28%), Positives = 82/202 (40%), Gaps = 51/202 (25%)

Query: 44  LYAQIANLVNLILQ--WPEISINEIAENFSKLACNAHTICNSELRP-------------- 87
            + Q+ +L+N++ +  W       I  +  +L CN H I  SELR               
Sbjct: 353 FFQQLGDLINVMSKENWSIYCGAVIFRHIGQLVCNGHAI--SELRGSFASENNCLEADSF 410

Query: 88  ---------------LGTGLYPVISIINHSCLPNAVLVFEGRLAVVRAVQHVPKGAEVLI 132
                          + TG++P IS+ NHSC PN    F      V A + +P G E+  
Sbjct: 411 NIKAGFLHRYFESTRVFTGIFPQISMFNHSCEPNIRNTFNKNTLTVYAAKDIPVGGEIFN 470

Query: 133 SYIETAGSTMT------RQKALKEQYLFTCTCPRCIKLGQFDDIQESAILEGYRCKDDGC 186
            Y    G          R+ AL++QY F C C RC      D+  ES   E Y+C    C
Sbjct: 471 CY----GPNFKLMCKDERKSALRQQYGFDCKCTRCAMNN--DEAYES--FEHYKCPFAKC 522

Query: 187 SG-FLLRDSDD---KGFTCQQC 204
           S  F+L+++ D   K   C  C
Sbjct: 523 SKYFMLKENADPFEKDIKCPMC 544


>gi|321472149|gb|EFX83120.1| hypothetical protein DAPPUDRAFT_2393 [Daphnia pulex]
          Length = 458

 Score = 67.8 bits (164), Expect = 9e-09,   Method: Compositional matrix adjust.
 Identities = 48/186 (25%), Positives = 87/186 (46%), Gaps = 17/186 (9%)

Query: 91  GLYPVISIINHSCLPNAVLVFEGRLAVVR--AVQHVPKGAEVLISYIETAGSTMTRQKAL 148
           GLYP  S++NH C+ N   VF+     +R  A + +P G ++  +Y  +  +T+ R+  L
Sbjct: 225 GLYPTASLMNHDCVANTRHVFDPADFRIRILATKDIPAGDKISATYTRSLWNTLDRRLHL 284

Query: 149 KEQYLFTCTCPRCIKLGQFDDIQESAILEGYRCKDDGCSGFLLRDS---DDKGFTCQQCG 205
           K    F C C RC      D  +   +L   +C   GC G +L  +       + C  CG
Sbjct: 285 KSTKHFWCQCSRCA-----DPRELGTLLSAVKCT--GCGGAVLSQNPLYQMSNWECSDCG 337

Query: 206 LVRSKEEIKKIASEVNILSKKTLALTSCGNHQEVVSTYKMIEKLQKKLYHPFSVNLMQTR 265
            V+  E++K++     +  K+   L    N  E++  +  I K    + HP S ++++ +
Sbjct: 338 SVQKVEQVKRVHDSARMELKQIAQLAR--NRPELLEDF--IRKYSGAI-HPDSCHVIEAK 392

Query: 266 EKLIKI 271
             L+++
Sbjct: 393 YALVQL 398


>gi|449540918|gb|EMD31905.1| hypothetical protein CERSUDRAFT_119222 [Ceriporiopsis subvermispora
           B]
          Length = 604

 Score = 67.8 bits (164), Expect = 9e-09,   Method: Compositional matrix adjust.
 Identities = 46/163 (28%), Positives = 74/163 (45%), Gaps = 26/163 (15%)

Query: 17  PSTTTDNYSLVEALVAHMSDIDEKQLLLYA--QIANLVNLILQWPEISINEIAENFSKLA 74
           PS    +  +  +LV ++   +  +L  +      NLV+LI               S+ A
Sbjct: 219 PSAVESHTHMAHSLVRYLGIANPAELAPFGLNSAGNLVDLI---------------SRFA 263

Query: 75  CNAHTICNSELRPLGTGLYPVISIINHSCLPNAVLVF---------EGRLAVVRAVQHVP 125
            N  T+ +  L P+G  + P I++ NHSC PNA +VF         +  L  + A++ + 
Sbjct: 264 TNTFTLTSFSLTPIGICICPSIALANHSCEPNAAIVFPRSSSLPQAQEPLMHLMAIRDIA 323

Query: 126 KGAEVLISYIETAGSTMTRQKALKEQYLFTCTCPRCIKLGQFD 168
              +V+ +Y++       RQKAL+E Y FTC C  C K    D
Sbjct: 324 PNEQVVAAYVDVTLPRELRQKALQETYSFTCKCKLCTKRMAVD 366


>gi|402073679|gb|EJT69231.1| hypothetical protein GGTG_12851 [Gaeumannomyces graminis var.
           tritici R3-111a-1]
          Length = 541

 Score = 67.8 bits (164), Expect = 9e-09,   Method: Compositional matrix adjust.
 Identities = 38/104 (36%), Positives = 60/104 (57%), Gaps = 1/104 (0%)

Query: 60  EISINEIAENFSKLACNAHTICNSELRPLGTGLYPVISIINHSCLPNAVLVFEGRLAVVR 119
           + ++N  AE   K+  N+    +++    GT L P+++++NHSC+PNAV++F  R A +R
Sbjct: 173 DANLNLAAEILCKIQTNSFDRFDADTGQSGTFLDPLLAMVNHSCIPNAVVLFWKRKAYLR 232

Query: 120 AVQHVPKGAEVLISYIETAGSTMTRQKALKEQYLFTCTCPRCIK 163
           A   V  G ++ ISYI+       R++ L E Y F C C RC K
Sbjct: 233 AETPVKAGEDITISYIDYTKPLSFRRQDL-ELYHFQCGCLRCEK 275


>gi|147817677|emb|CAN75589.1| hypothetical protein VITISV_042880 [Vitis vinifera]
          Length = 816

 Score = 67.8 bits (164), Expect = 9e-09,   Method: Compositional matrix adjust.
 Identities = 46/125 (36%), Positives = 60/125 (48%), Gaps = 9/125 (7%)

Query: 69  NFSKLACNAHTICNSELRPLGTGLYPVISIINHSCLPNAVLVFEGRLAVVRAVQHVPKGA 128
           NFS  A  A T  N E   +G  +Y V S+ NHSC PN    F  R   +RA +HV  G 
Sbjct: 62  NFSP-AGGAFT-SNMEQVRVGQAIYSVASLFNHSCQPNIHAYFLSRTLFLRATEHVAVGC 119

Query: 129 EVLISYIETAG--STMTRQKALKEQYLFTCTCPRCIKLGQFDDIQESAILEGYRCKDDGC 186
            + +SY    G      RQK LK++Y F C C  C +L   D      +L  +RC +  C
Sbjct: 120 PLELSYGPQVGQWDCKDRQKFLKDEYSFRCECSGCSELNVSD-----LVLNAFRCVNPDC 174

Query: 187 SGFLL 191
            G +L
Sbjct: 175 FGTVL 179


>gi|357610542|gb|EHJ67027.1| hypothetical protein KGM_11275 [Danaus plexippus]
          Length = 661

 Score = 67.8 bits (164), Expect = 9e-09,   Method: Compositional matrix adjust.
 Identities = 80/267 (29%), Positives = 121/267 (45%), Gaps = 43/267 (16%)

Query: 21  TDNYSLVEALVAHMSDIDEKQLLLYAQIANLVNLILQWPEIS--INE--------IAENF 70
           T++Y  +  LV+H +   ++ +L   ++A     +L+  EIS   N+        I +N 
Sbjct: 374 TEDYRNIYHLVSHENKRTKQDILHRTEMAIF---LLKLLEISGYFNDDAASFGCLILKNL 430

Query: 71  SKLACNAHTICNSE-LRP---------------LGTGLYPVISIINHSCLPNAVLVFEGR 114
             L  NAH +   + L+P               +G  +YP +++ NHSC P  V  F G 
Sbjct: 431 QVLQFNAHEVFEIQCLKPKDGTRFLKHEGKSVFIGGAVYPTLALFNHSCEPGIVRYFCGS 490

Query: 115 LAVVRAVQHVPKGAEVLISY--IETAGSTMTRQKALKEQYLFTCTCPRCIK-LGQFDDIQ 171
             VV AV+++ KG EV  +Y  I T      RQ  LKEQY F C C  C +   +++D+ 
Sbjct: 491 RIVVCAVKNIRKGEEVAENYGPIFTTVPKDKRQSQLKEQYWFDCKCLPCEQNWPKYEDMT 550

Query: 172 ESAILEGYRCKDD-GCSGFLLRDSDDKGFTCQQCGLVRSKEEI-KKIAS--EVNILSKKT 227
           E+ +   ++C  D  CS  +    D   F   QCGL +    I K + S  +   L K  
Sbjct: 551 ENYLR--FKCDSDQPCSNVIPVPYDCMEFMV-QCGLCQQYTNILKGLKSLQDTETLYKLG 607

Query: 228 LALTSCGNHQEVVSTYKMIEKLQKKLY 254
            A    G + E +   K IE L  KLY
Sbjct: 608 RAAMGEGKYGEAIK--KFIETL--KLY 630


>gi|307182513|gb|EFN69720.1| Histone-lysine N-methyltransferase ASHR1 [Camponotus floridanus]
          Length = 132

 Score = 67.4 bits (163), Expect = 9e-09,   Method: Compositional matrix adjust.
 Identities = 36/100 (36%), Positives = 55/100 (55%)

Query: 248 KLQKKLYHPFSVNLMQTREKLIKILMELEDWKEALAYCQLTIPVYQRVYPQFHPLLGLQY 307
           K QK + H F++  ++T E      M L+ W++A  Y +  IP Y   Y + HPL+GL Y
Sbjct: 12  KKQKNILHRFNIQHIRTLEAAHIAAMNLKYWEDAELYGKELIPGYLLYYGEIHPLIGLLY 71

Query: 308 YTCGKLEWFLGDTENAIKSMTEAVEILRITHGTNSPFMKE 347
              GK++  L   + A+K + +A E+L ITHG     M+E
Sbjct: 72  LMTGKIQLHLNKPKEALKVLEKASEVLMITHGDKHSLMRE 111


>gi|390365247|ref|XP_003730778.1| PREDICTED: SET and MYND domain-containing protein 3-like
           [Strongylocentrotus purpuratus]
          Length = 487

 Score = 67.4 bits (163), Expect = 9e-09,   Method: Compositional matrix adjust.
 Identities = 31/87 (35%), Positives = 51/87 (58%)

Query: 75  CNAHTICNSELRPLGTGLYPVISIINHSCLPNAVLVFEGRLAVVRAVQHVPKGAEVLISY 134
           CN+  I +++L  +  G+Y   S++NHSC  N ++VF+ R   +R V+ V +G E  I Y
Sbjct: 181 CNSFGIFDNDLIAISDGIYLRASMVNHSCDYNCIVVFDERKLQLRTVKDVKEGEECTIGY 240

Query: 135 IETAGSTMTRQKALKEQYLFTCTCPRC 161
           ++       R+  L+E+Y FTC C +C
Sbjct: 241 VDVIHPAKERRAELEEKYHFTCKCVKC 267



 Score = 63.5 bits (153), Expect = 2e-07,   Method: Compositional matrix adjust.
 Identities = 28/67 (41%), Positives = 40/67 (59%)

Query: 97  SIINHSCLPNAVLVFEGRLAVVRAVQHVPKGAEVLISYIETAGSTMTRQKALKEQYLFTC 156
           S++NHSC PN   V +GR   +  V+ V +G E  I+Y++       RQ  LKE+Y FTC
Sbjct: 364 SMLNHSCDPNCAWVSDGRKIQIMTVKDVKEGEECTITYVDVMDPAKVRQADLKERYHFTC 423

Query: 157 TCPRCIK 163
            C +CI+
Sbjct: 424 KCVKCIE 430


>gi|345480361|ref|XP_003424136.1| PREDICTED: protein msta, isoform A-like [Nasonia vitripennis]
          Length = 501

 Score = 67.4 bits (163), Expect = 1e-08,   Method: Compositional matrix adjust.
 Identities = 42/135 (31%), Positives = 66/135 (48%), Gaps = 11/135 (8%)

Query: 91  GLYPVISIINHSCLPNA--VLVFEGRLAVVRAVQHVPKGAEVLISYIETAGSTMTRQKAL 148
            L+PV ++ NH C+PN   ++  EG L V  AV  + +G E+ +SY +    T  R+  L
Sbjct: 208 ALFPVGALTNHQCVPNTRHIVNAEGELLVYAAVP-IAEGQEITMSYADVLWDTQMRRHFL 266

Query: 149 KEQYLFTCTCPRCIKLGQFDDIQESAILEGYRCKDDGCSGFLLRDSD---DKGFTCQQCG 205
                F C CPRC      D  +  ++L    C  D C G LL ++    +  +TC +CG
Sbjct: 267 LATKHFACQCPRCT-----DVTECGSLLGALSCAIDNCQGNLLPNTPLSLNCPWTCNECG 321

Query: 206 LVRSKEEIKKIASEV 220
           L+    +I  I + +
Sbjct: 322 LIIKGRQISSIRAGI 336


>gi|302834810|ref|XP_002948967.1| hypothetical protein VOLCADRAFT_89375 [Volvox carteri f.
           nagariensis]
 gi|300265712|gb|EFJ49902.1| hypothetical protein VOLCADRAFT_89375 [Volvox carteri f.
           nagariensis]
          Length = 679

 Score = 67.4 bits (163), Expect = 1e-08,   Method: Compositional matrix adjust.
 Identities = 37/104 (35%), Positives = 52/104 (50%), Gaps = 2/104 (1%)

Query: 61  ISINEIAENFSKLACNAHTICNSELRPLGTGLYPVISIINHSCLPNAVLVFEGRLAVVRA 120
           ++ N   E+    A +A    ++EL     GL+P  S+INHSC P A     G + VVRA
Sbjct: 218 VTFNAYGESKPDPAVSAVRSLSAELLSGCVGLWPPFSLINHSCAPVASYGLVGDVMVVRA 277

Query: 121 VQHVPKGAEVLISYI--ETAGSTMTRQKALKEQYLFTCTCPRCI 162
              +P G +V ISY      G    R+  L++ Y F C C RC+
Sbjct: 278 AADLPAGQQVTISYFGRRALGPLELRRAYLRQHYGFVCACERCV 321


>gi|393212658|gb|EJC98158.1| SET domain-containing protein [Fomitiporia mediterranea MF3/22]
          Length = 459

 Score = 67.4 bits (163), Expect = 1e-08,   Method: Compositional matrix adjust.
 Identities = 40/106 (37%), Positives = 59/106 (55%), Gaps = 11/106 (10%)

Query: 66  IAENFSKLACNAHTICNSELRPLGTGLYPVIS-IINHSCLPNAVLVF----EGRLAVVRA 120
           I E FS+ + N + I +S L  +G G++P+ S + NHSCLPNA++ +    EG   VV+A
Sbjct: 189 IKEIFSRFS-NNNFIVHSHLSAVGHGIFPLASRLFNHSCLPNAIVTYSFTSEGIQMVVKA 247

Query: 121 VQHVPKGAEVLISYIETAGSTMTRQKALKEQYLFTCTC-----PRC 161
           +  + +G E+ I Y + A     RQ   +  Y F CTC     PRC
Sbjct: 248 LTPIKQGEEITIPYFDPALPYHQRQAICRYSYGFECTCSVCMFPRC 293


>gi|281202062|gb|EFA76267.1| autophagy protein 9 [Polysphondylium pallidum PN500]
          Length = 1064

 Score = 67.4 bits (163), Expect = 1e-08,   Method: Compositional matrix adjust.
 Identities = 41/123 (33%), Positives = 54/123 (43%), Gaps = 6/123 (4%)

Query: 72   KLACNAHTICNSELRPLGTGLYPVISIINHSCLPNAVLVFEGRLAVVRAVQHVPKGAEVL 131
            K+ CN   I +   + +G    P  S  NHSC PN   V    + V +A+  + KG  + 
Sbjct: 910  KIRCNQFGIWSKRDKCIGVSCTPAASYFNHSCCPNLADVRGTTVVVFKALHFISKGTPIS 969

Query: 132  ISYIETAGSTMTRQKALKEQYLFTCTCPRCIKLGQFDDIQES-AILEGYRCKDDGCSGFL 190
            ISY++    T  RQ  LK  Y FTC C RC      D   ES   +  + C    CSG  
Sbjct: 970  ISYLDLDQPTHERQSYLKTFYYFTCQCLRCK-----DQTDESDNWISRFYCDRFKCSGTY 1024

Query: 191  LRD 193
              D
Sbjct: 1025 FLD 1027


>gi|407867908|gb|EKG08717.1| hypothetical protein TCSYLVIO_000127 [Trypanosoma cruzi]
          Length = 697

 Score = 67.4 bits (163), Expect = 1e-08,   Method: Compositional matrix adjust.
 Identities = 50/159 (31%), Positives = 70/159 (44%), Gaps = 15/159 (9%)

Query: 15  VIPSTTTDNYSLVEALVAHMSDIDEKQLLLYAQIANLVNLILQW------PEISINEIAE 68
           V P   +D   LV  L     D   K    Y +      LIL W       E ++   A 
Sbjct: 487 VRPPKWSDTARLVTNLSVLSKDSRSKFRCAYRR---FTKLILPWLGEGGSTEANLTVTAT 543

Query: 69  NFSKLA----CNAHTICNSELRPLGTGLYPVISIINHSCLPNAVLV-FEGRLAVVRAVQH 123
            F++L     CN+  + N++   +G  LYP  S  NHSC PN   V + G LA   A++ 
Sbjct: 544 FFNRLCAALQCNSFGLFNADGNCIGVALYPEASYFNHSCCPNICRVTYRGILAAFHALRE 603

Query: 124 VPKGAEVLISYIETA-GSTMTRQKALKEQYLFTCTCPRC 161
           + KG  + I Y++    ST  R++ L   Y F C C RC
Sbjct: 604 IRKGEPLTICYVDVQETSTAERRRTLFSSYRFFCECARC 642


>gi|289739577|gb|ADD18536.1| putative histone tail methylase [Glossina morsitans morsitans]
          Length = 429

 Score = 67.4 bits (163), Expect = 1e-08,   Method: Compositional matrix adjust.
 Identities = 32/103 (31%), Positives = 53/103 (51%), Gaps = 7/103 (6%)

Query: 91  GLYPVISIINHSCLPNAVLVFEGRL--AVVRAVQHVPKGAEVLISYIETAGSTMTRQKAL 148
            L+P+ +++NH C+PN+   FE +    V+RA   +P+G E+  +Y +     + R   L
Sbjct: 208 ALFPLFALVNHDCVPNSYYTFEEKTNYMVLRASVDLPEGTEITTTYTKLFTGNIARHLFL 267

Query: 149 KEQYLFTCTCPRCIKLGQFDDIQESAILEGYRCKDDGCSGFLL 191
           K +  FTC C RC      D  ++ A +    C+D  CSG  +
Sbjct: 268 KMKKNFTCKCSRC-----SDPTEKGAFISAVYCRDTNCSGLAV 305


>gi|340924027|gb|EGS18930.1| hypothetical protein CTHT_0055430 [Chaetomium thermophilum var.
           thermophilum DSM 1495]
          Length = 278

 Score = 67.4 bits (163), Expect = 1e-08,   Method: Compositional matrix adjust.
 Identities = 31/76 (40%), Positives = 40/76 (52%)

Query: 91  GLYPVISIINHSCLPNAVLVFEGRLAVVRAVQHVPKGAEVLISYIETAGSTMTRQKALKE 150
            +YP +S  NH C PN      G +    AV+ +P G E+ ISYI    S   RQK L E
Sbjct: 69  AVYPKVSRFNHDCRPNVHYRLSGLIHTTVAVRDIPAGTELTISYIYVRKSRAERQKELSE 128

Query: 151 QYLFTCTCPRCIKLGQ 166
            + FTCTC +C K  +
Sbjct: 129 SWNFTCTCEQCSKSAE 144


>gi|332018228|gb|EGI58833.1| Protein msta, isoform A [Acromyrmex echinatior]
          Length = 483

 Score = 67.0 bits (162), Expect = 1e-08,   Method: Compositional matrix adjust.
 Identities = 65/303 (21%), Positives = 125/303 (41%), Gaps = 35/303 (11%)

Query: 61  ISINEIAENFSKLACNAHTICNSELRPL----GTGLYPVISIINHSCLPN---AVLVFEG 113
           + ++  +E + +  C    I   E+R        GLYP ++++NHSC+ N   ++   + 
Sbjct: 152 LKLDRFSEKYIQTICGILEINTFEVRTAKGFSARGLYPTVAMMNHSCVSNTSHSISPIDY 211

Query: 114 RLAVVRAVQHVPKGAEVLISYIETAGSTMTRQKALKEQYLFTCTCPRCIKLGQFDDIQES 173
           R+ + R    +P G E+  SY  +   T+ R++ L E   F C CPRC      D  +  
Sbjct: 212 RIRL-RTTLKIPAGGELYASYTHSLLPTILRREHLLEGKHFACACPRC-----SDPTELG 265

Query: 174 AILEGYRCK--DDGCSGFLLRDSDDKGFTCQQCGLVRSKEEIKKIASEVNILSKKTLALT 231
             +   +C   D+G    L     +  + C  C    + + ++KI   +        A++
Sbjct: 266 THMSSLKCNKCDNGIVLPLDSLDSESTWKCTHCDFSTNGQAVRKILRIIQAQVDAAEAIS 325

Query: 232 SCGNHQEVVSTYKMIEKLQKKLYHPFSVNLMQTREKLIKILMELEDW----------KEA 281
              +  + +   + + K  + + HP    L   R  L ++   ++++          +  
Sbjct: 326 G-ADGADAIYKRETVMKKYRLVLHPHHAFLSMLRHSLTQMYGRVDEYLLDDLPDVVLEHK 384

Query: 282 LAYCQLTIPVYQRVYPQFHPLLGLQYYT-------CGKLEWFLGDTENA-IKS-MTEAVE 332
           +  C+L + V   V P +  + G+  Y          K +W  G  + A +KS M EA  
Sbjct: 385 VDMCRLLLQVLDVVEPGYSRVRGMTLYELHAPLLFLAKDQWNAGVIDEAKLKSKMIEAAN 444

Query: 333 ILR 335
           IL+
Sbjct: 445 ILK 447


>gi|195355540|ref|XP_002044249.1| GM15092 [Drosophila sechellia]
 gi|194129550|gb|EDW51593.1| GM15092 [Drosophila sechellia]
          Length = 392

 Score = 67.0 bits (162), Expect = 1e-08,   Method: Compositional matrix adjust.
 Identities = 37/97 (38%), Positives = 57/97 (58%), Gaps = 9/97 (9%)

Query: 70  FSKLACNAHTICNSELRPLGTGLYPVISIINHSCLPNAVLVF--EGRLAVVRAVQHVPKG 127
           ++K+A  A    N+E    G+GLY + S INHSC+PNA   F     + V++A+  + +G
Sbjct: 271 YAKVAEFAGEFLNNE----GSGLYLLQSKINHSCVPNACSTFPYSNDIVVLKALGPIQQG 326

Query: 128 AEVLISYIETA---GSTMTRQKALKEQYLFTCTCPRC 161
            E+ ISY++      S  +R K L+E Y+F C CP+C
Sbjct: 327 EEICISYLDECMLERSRHSRHKVLRENYVFICQCPKC 363


>gi|281347298|gb|EFB22882.1| hypothetical protein PANDA_013199 [Ailuropoda melanoleuca]
          Length = 1019

 Score = 67.0 bits (162), Expect = 1e-08,   Method: Compositional matrix adjust.
 Identities = 29/59 (49%), Positives = 43/59 (72%)

Query: 71  SKLACNAHTICNSELRPLGTGLYPVISIINHSCLPNAVLVFEGRLAVVRAVQHVPKGAE 129
           S++ CNA TICN+E++ +G GLYP +S++NHSC PN  +VF G   ++RAV+ +  G E
Sbjct: 130 SQVICNAFTICNAEMQEVGVGLYPSMSLLNHSCDPNCSIVFNGPHLLLRAVRDIEAGEE 188


>gi|452846662|gb|EME48594.1| hypothetical protein DOTSEDRAFT_19117 [Dothistroma septosporum
           NZE10]
          Length = 199

 Score = 67.0 bits (162), Expect = 1e-08,   Method: Compositional matrix adjust.
 Identities = 35/91 (38%), Positives = 48/91 (52%), Gaps = 16/91 (17%)

Query: 71  SKLACNAHTICNSELRPLGTGLYPVISIINHSCLPNAVLVFEGRLAVVRAVQHVPKGAEV 130
           S L CN+  +      PLG  +  +IS +NHSC+PNA +VFE                E+
Sbjct: 39  SLLLCNSLILMTPANDPLGIVVELLISNVNHSCVPNACIVFE----------------EI 82

Query: 131 LISYIETAGSTMTRQKALKEQYLFTCTCPRC 161
            +SY++  G    RQ AL  +Y FTCTCP+C
Sbjct: 83  FVSYVDQHGLFHARQAALWHRYYFTCTCPKC 113


>gi|281206015|gb|EFA80204.1| SET domain-containing protein [Polysphondylium pallidum PN500]
          Length = 490

 Score = 67.0 bits (162), Expect = 1e-08,   Method: Compositional matrix adjust.
 Identities = 32/82 (39%), Positives = 49/82 (59%), Gaps = 2/82 (2%)

Query: 83  SELRPLGTGLYPVISIINHSCLPNAVLVFEGRLAVV--RAVQHVPKGAEVLISYIETAGS 140
           S+L   G+ L+P+++  NH C PNAV+ +      V  R+++ +P   EV ISYI+   S
Sbjct: 406 SDLCISGSALFPIVNSCNHHCNPNAVVSYTTNCNRVTLRSLRSIPIHEEVEISYIDETVS 465

Query: 141 TMTRQKALKEQYLFTCTCPRCI 162
              R+K L+ +YLF C C RC+
Sbjct: 466 CSQRRKELQHKYLFNCKCTRCL 487


>gi|195569233|ref|XP_002102615.1| GD19999 [Drosophila simulans]
 gi|194198542|gb|EDX12118.1| GD19999 [Drosophila simulans]
          Length = 391

 Score = 67.0 bits (162), Expect = 1e-08,   Method: Compositional matrix adjust.
 Identities = 37/97 (38%), Positives = 57/97 (58%), Gaps = 9/97 (9%)

Query: 70  FSKLACNAHTICNSELRPLGTGLYPVISIINHSCLPNAVLVF--EGRLAVVRAVQHVPKG 127
           ++K+A  A    N+E    G+GLY + S INHSC+PNA   F     + V++A+  + +G
Sbjct: 271 YAKVAEFAGEFLNNE----GSGLYLLQSKINHSCVPNACSTFPYSNDIVVLKALGPIQQG 326

Query: 128 AEVLISYIETA---GSTMTRQKALKEQYLFTCTCPRC 161
            E+ ISY++      S  +R K L+E Y+F C CP+C
Sbjct: 327 EEICISYLDECMLERSRHSRHKVLRENYVFICQCPKC 363


>gi|400595566|gb|EJP63361.1| TPR domain-containing protein [Beauveria bassiana ARSEF 2860]
          Length = 662

 Score = 67.0 bits (162), Expect = 1e-08,   Method: Compositional matrix adjust.
 Identities = 32/73 (43%), Positives = 44/73 (60%), Gaps = 1/73 (1%)

Query: 90  TGLYPVISIINHSCLPNAVLVFEGRLAVVRAVQHVPKGAEVLISYIETAGSTMTRQKALK 149
           TGL+   S +NHSC+PN+V  F G L VVRA+QH+  G E+  +Y +  G    RQ  L+
Sbjct: 515 TGLWARASYLNHSCMPNSVKDFAGDLIVVRALQHIRTGDEITHAYQDN-GDYDARQALLQ 573

Query: 150 EQYLFTCTCPRCI 162
             + FTC C  C+
Sbjct: 574 TTWGFTCRCKLCV 586


>gi|56788824|gb|AAH88504.1| smyd5-prov protein, partial [Xenopus (Silurana) tropicalis]
          Length = 420

 Score = 67.0 bits (162), Expect = 1e-08,   Method: Compositional matrix adjust.
 Identities = 36/86 (41%), Positives = 51/86 (59%), Gaps = 7/86 (8%)

Query: 89  GTGLYPVISIINHSCLPNAVLVFEGRLAVVR--AVQHVPKGAEVLISYIETAG---STMT 143
           G+GLY + S  NHSC+PNA   F     ++   A++ +  G E+ ISY++      S  +
Sbjct: 303 GSGLYLLQSCCNHSCVPNAEASFPDNNFILHLTALEDIQPGEEICISYLDCCQRDRSRHS 362

Query: 144 RQKALKEQYLFTCTCPRCIKLGQFDD 169
           RQK L+E YLF C+CP+C  L Q DD
Sbjct: 363 RQKILRENYLFVCSCPKC--LAQADD 386


>gi|218184846|gb|EEC67273.1| hypothetical protein OsI_34249 [Oryza sativa Indica Group]
          Length = 816

 Score = 67.0 bits (162), Expect = 1e-08,   Method: Compositional matrix adjust.
 Identities = 83/363 (22%), Positives = 148/363 (40%), Gaps = 85/363 (23%)

Query: 50  NLVNLILQWPEISINEIA--------------ENFSKLACNAHTICNSELRPLGTGLYPV 95
           +L  L+L   +I +N IA              + FS  +     +C+ E   +   +Y  
Sbjct: 445 SLSQLVLLICQIKVNSIAIVHMKSMDGGKVLTKGFSGFS--GDVMCSVEQVRVAQAIYMS 502

Query: 96  ISIINHSCLPNAVLVFEGRLAVVRAVQHVPKGAEVLISYIETAGST--MTRQKALKEQYL 153
            S  NHSC PN    F  R  ++R+ +++  G+ + +SY    G      RQK+L+E Y 
Sbjct: 503 GSFFNHSCQPNIHAYFHSRTLILRSTEYIKAGSPIELSYGPQVGEMDLPERQKSLRENYY 562

Query: 154 FTCT------------------CPRCIKLGQF---------DDIQESAILEGYRC----- 181
           F+C                   CP+   LG           ++    +I E + C     
Sbjct: 563 FSCGCSSCSVLSLSDLVMNSFCCPQSNCLGAVSELIHHRRKENFVHVSIGESHVCTLSLP 622

Query: 182 ------KDDGCSGFLLRDSD-----DKGFTCQQCGLVRSKEEIK-------KIASEVNIL 223
                 +D    G L  +SD     D GF C  C   RS+ ++        +  S++N L
Sbjct: 623 DVSKFDEDIVKVGKLFFNSDTMLNIDPGF-CMSC---RSQLDLSSAVAMSDRATSKINRL 678

Query: 224 SKKTLALTSCGNHQEVVSTYKMIEKLQ-----KKLYHPFSVNLMQTREKLIKILMELEDW 278
           ++    L S  N  EV+    + E LQ     +KL HP+S  L Q  + + +   ++ D 
Sbjct: 679 NE----LPSPDNVPEVL----IAEALQSLEHIEKLRHPYSKALAQAHDTIAEAFAKVGDQ 730

Query: 279 KEALAYCQLTIPVYQRVYPQFHPLLGLQYYTCGKLEWFLGDTENAIKSMTEAVEILRITH 338
           ++A  +C+ +  + +++Y   H ++  +      +E  +GD  +A  +   A  I  + +
Sbjct: 731 EQARKHCEASFKILEKLYHPRHIIIAHELIKLVSIELSMGDGASAAAAFARADAIFSLYY 790

Query: 339 GTN 341
           G +
Sbjct: 791 GPD 793


>gi|166796007|ref|NP_001107677.1| SMYD family member 5 [Xenopus (Silurana) tropicalis]
 gi|163916066|gb|AAI57309.1| smyd5 protein [Xenopus (Silurana) tropicalis]
          Length = 421

 Score = 67.0 bits (162), Expect = 1e-08,   Method: Compositional matrix adjust.
 Identities = 36/86 (41%), Positives = 51/86 (59%), Gaps = 7/86 (8%)

Query: 89  GTGLYPVISIINHSCLPNAVLVFEGRLAVVR--AVQHVPKGAEVLISYIETAG---STMT 143
           G+GLY + S  NHSC+PNA   F     ++   A++ +  G E+ ISY++      S  +
Sbjct: 305 GSGLYLLQSCCNHSCVPNAEASFPDNNFILHLTALEDIQPGEEICISYLDCCQRDRSRHS 364

Query: 144 RQKALKEQYLFTCTCPRCIKLGQFDD 169
           RQK L+E YLF C+CP+C  L Q DD
Sbjct: 365 RQKILRENYLFVCSCPKC--LAQADD 388


>gi|71421326|ref|XP_811771.1| hypothetical protein [Trypanosoma cruzi strain CL Brener]
 gi|70876474|gb|EAN89920.1| hypothetical protein, conserved [Trypanosoma cruzi]
          Length = 697

 Score = 67.0 bits (162), Expect = 1e-08,   Method: Compositional matrix adjust.
 Identities = 51/165 (30%), Positives = 72/165 (43%), Gaps = 15/165 (9%)

Query: 9   KLQNDNVIPSTTTDNYSLVEALVAHMSDIDEKQLLLYAQIANLVNLILQW------PEIS 62
           K +   V P   +D   LV  L     D   K    Y +      LIL W       E +
Sbjct: 481 KEETHAVRPPRWSDTARLVTNLSMLSKDSRSKFRCAYRR---FTKLILPWLGEGGSTEAN 537

Query: 63  INEIAENFSKLA----CNAHTICNSELRPLGTGLYPVISIINHSCLPNAVLV-FEGRLAV 117
           +   A  F++L     CN+  + N++   +G  LYP  S  NHSC PN   V + G LA 
Sbjct: 538 LTVTATFFNRLCAALQCNSFGLFNADGNCIGVALYPEASYFNHSCCPNICRVTYRGILAA 597

Query: 118 VRAVQHVPKGAEVLISYIETA-GSTMTRQKALKEQYLFTCTCPRC 161
             A++ + KG  + I Y++    ST  R++ L   Y F C C RC
Sbjct: 598 FHALREIRKGEPLTICYVDVQETSTAERRRTLFSSYRFFCECARC 642


>gi|62473432|ref|NP_001014718.1| msta, isoform B [Drosophila melanogaster]
 gi|76363525|sp|P83501.2|MSTAB_DROME RecName: Full=Protein msta, isoform B
 gi|61677866|gb|AAF45751.3| msta, isoform B [Drosophila melanogaster]
          Length = 448

 Score = 67.0 bits (162), Expect = 1e-08,   Method: Compositional matrix adjust.
 Identities = 35/102 (34%), Positives = 51/102 (50%), Gaps = 6/102 (5%)

Query: 91  GLYPVISIINHSCLPNAVLVFE-GRLAVVRAVQHVPKGAEVLISYIETAGSTMTRQKALK 149
            L+P+  ++NH C PNA   FE G   VV A + +P GAE+ +SY +   ST+ R+  L 
Sbjct: 228 ALFPLAGLLNHQCTPNAAHHFENGETIVVCATERIPAGAEITMSYAKLLWSTLARKIFLG 287

Query: 150 EQYLFTCTCPRCIKLGQFDDIQESAILEGYRCKDDGCSGFLL 191
               F C C RC      D  +    L    C++ GC G ++
Sbjct: 288 MTKHFICKCVRCQ-----DPTENGTYLSALFCREQGCRGLVI 324


>gi|452839795|gb|EME41734.1| hypothetical protein DOTSEDRAFT_81948 [Dothistroma septosporum
           NZE10]
          Length = 549

 Score = 67.0 bits (162), Expect = 2e-08,   Method: Compositional matrix adjust.
 Identities = 29/74 (39%), Positives = 40/74 (54%)

Query: 88  LGTGLYPVISIINHSCLPNAVLVFEGRLAVVRAVQHVPKGAEVLISYIETAGSTMTRQKA 147
           LG G  P++   NHSC PNA  +F     ++RA++ + KG E+ + Y +       RQ  
Sbjct: 223 LGMGFDPIMCSANHSCDPNAAALFNQPRQILRALKPIKKGEEISMKYTDVTNPLCVRQAE 282

Query: 148 LKEQYLFTCTCPRC 161
           LK  Y FTC C RC
Sbjct: 283 LKGYYFFTCHCARC 296


>gi|171684729|ref|XP_001907306.1| hypothetical protein [Podospora anserina S mat+]
 gi|170942325|emb|CAP67977.1| unnamed protein product [Podospora anserina S mat+]
          Length = 559

 Score = 66.6 bits (161), Expect = 2e-08,   Method: Compositional matrix adjust.
 Identities = 32/91 (35%), Positives = 52/91 (57%), Gaps = 1/91 (1%)

Query: 72  KLACNAHTICNSELRPLGTGLYPVISIINHSCLPNAVLVFEGRLAVVRAVQHVPKGAEVL 131
           ++  NA    +++    G  L P ++ +NHSC+PNA + F+ R A +RA + + +G E+ 
Sbjct: 199 QIQTNAFNRLDADTGMSGIFLDPALARVNHSCVPNAFIGFDKRTATLRAERPIKEGEEIT 258

Query: 132 ISYIETAGSTMTRQKALKEQYLFTCTCPRCI 162
           ISYI        R++ L+  Y F C CPRC+
Sbjct: 259 ISYIANDKPRSIRREGLR-LYYFECDCPRCV 288


>gi|253741927|gb|EES98785.1| Hypothetical protein GL50581_3974 [Giardia intestinalis ATCC 50581]
          Length = 409

 Score = 66.6 bits (161), Expect = 2e-08,   Method: Compositional matrix adjust.
 Identities = 36/104 (34%), Positives = 59/104 (56%), Gaps = 8/104 (7%)

Query: 67  AENFSKLAC----NAHTICNSELRPLGTGLYPVISIINHSCLPNAVLVFE----GRLAVV 118
           A+ F+ L C    N+++I +   + +G  LY +IS  NHSC PNA ++FE     R A +
Sbjct: 300 AQMFNYLVCILLTNSYSIEDKNGQEIGAALYSLISCCNHSCAPNAQVIFEDSEDAREATL 359

Query: 119 RAVQHVPKGAEVLISYIETAGSTMTRQKALKEQYLFTCTCPRCI 162
             ++   +G E+ ISYI   G +++ ++    Q+ FTC C RC+
Sbjct: 360 VLLRPCVQGEELYISYITDLGRSVSERRRELAQWCFTCQCTRCL 403


>gi|384249285|gb|EIE22767.1| hypothetical protein COCSUDRAFT_42390 [Coccomyxa subellipsoidea
           C-169]
          Length = 329

 Score = 66.6 bits (161), Expect = 2e-08,   Method: Compositional matrix adjust.
 Identities = 46/149 (30%), Positives = 61/149 (40%), Gaps = 18/149 (12%)

Query: 31  VAHMSDIDEKQLLLYAQIANLVNLILQWP---EISINEIAENFSKLACNAHTIC-----N 82
            AH    D   L      A L   + Q      +S  E+A+  +K   N + I       
Sbjct: 77  AAHSRSKDSSDLFRRLSTATLSAQLPQLAGSRPLSPEEVADALAKEELNGYGIMAPSGPE 136

Query: 83  SELRPLGTGLYPVISIINHSCLPNAVLVF----------EGRLAVVRAVQHVPKGAEVLI 132
            E R  GT LYP  S++NH CLPN   V           E      +A+ ++P G E   
Sbjct: 137 GERRIRGTALYPKASLLNHECLPNVARVDNFDEADVNAPENTAVHFKALHNLPAGEEFTQ 196

Query: 133 SYIETAGSTMTRQKALKEQYLFTCTCPRC 161
           SY     S   RQ+  ++QY F C CPRC
Sbjct: 197 SYFPMHTSYHVRQQRCQDQYGFACNCPRC 225


>gi|328791769|ref|XP_001121272.2| PREDICTED: SET and MYND domain-containing protein 4-like [Apis
           mellifera]
          Length = 668

 Score = 66.6 bits (161), Expect = 2e-08,   Method: Compositional matrix adjust.
 Identities = 54/183 (29%), Positives = 89/183 (48%), Gaps = 23/183 (12%)

Query: 66  IAENFSKLACNAHTICNS---------ELRPLGTGLYPVISIINHSCLPNAVL-VFEGRL 115
           I  +   + CNA+ I  +         E R +G  +YP +S+INHSC PN V   +   +
Sbjct: 442 ILHHLQAINCNAYEIVENIYDKKTHIWEPRQIGGAIYPSVSLINHSCYPNVVRHTYPSGI 501

Query: 116 AVVRAVQHVPKGAEVLISYIET--AGSTMTRQKALKEQYLFTCTCPRCIKLGQFDDIQES 173
            VVR ++ V KG E+L  Y     + + ++R + L ++Y F CTC  CI+  Q+   +  
Sbjct: 502 VVVRTLRFVGKGTEILDCYGPHWFSENKLSRIEYLWKKYRFLCTCDACIQNWQYPLPE-- 559

Query: 174 AILEGYRCKDDGCSGFL----LRDSDDKGFTCQQCGLVRSKEEIKKIASEVNILSKKTLA 229
             +  Y+CK   CS  +    L + D +    +QC   ++   +KKI +++    KK L 
Sbjct: 560 --IMKYKCK--TCSEIIGIIALNEKDMQNVLNKQCNCNKTI-NLKKIKNQLQNSIKKRLN 614

Query: 230 LTS 232
             S
Sbjct: 615 AIS 617


>gi|328702715|ref|XP_001948576.2| PREDICTED: SET and MYND domain-containing protein 5-like
           [Acyrthosiphon pisum]
          Length = 387

 Score = 66.6 bits (161), Expect = 2e-08,   Method: Compositional matrix adjust.
 Identities = 37/80 (46%), Positives = 47/80 (58%), Gaps = 10/80 (12%)

Query: 89  GTGLYPVISIINHSCLPNAVLVFEG----RLAVVRAVQHVPKGAEVLISYI---ETAGST 141
           G+ L+   S INHSC PNA  VF+G    RL  +R ++    G E+ ISYI   E   S 
Sbjct: 289 GSALFQNHSSINHSCFPNAASVFDGNHVLRLVAIRMIE---PGDEINISYIAPCELDHSR 345

Query: 142 MTRQKALKEQYLFTCTCPRC 161
            TRQK L+E Y+FTC C +C
Sbjct: 346 HTRQKYLQENYVFTCRCIKC 365


>gi|307187924|gb|EFN72837.1| SET and MYND domain-containing protein 4 [Camponotus floridanus]
          Length = 651

 Score = 66.6 bits (161), Expect = 2e-08,   Method: Compositional matrix adjust.
 Identities = 40/124 (32%), Positives = 62/124 (50%), Gaps = 8/124 (6%)

Query: 88  LGTGLYPVISIINHSCLPNAVLVFEGRLAVVRAVQHVPKGAEVLISY--IETAGSTMTRQ 145
           +G G+YP ++++NHSC P  +  F G   VVRA + +  G E+  +Y  I T      R+
Sbjct: 454 IGGGVYPTVAMLNHSCNPGVIRYFIGTTMVVRAARTIRAGEEISENYGPIFTTTPENERK 513

Query: 146 KALKEQYLFTCTCPRCIKLGQFDDIQE-SAILEGYRCKDD-GCSGFLLRDSDDKGFT--C 201
           + L+ QY F C C  C   G +  ++E    +  ++C+    C   LL  SD   F   C
Sbjct: 514 RRLRVQYWFDCNCEACS--GHWPLLEELDPTILRFKCETGLSCGNVLLVRSDTNEFMIGC 571

Query: 202 QQCG 205
            +CG
Sbjct: 572 AKCG 575


>gi|407394357|gb|EKF26910.1| hypothetical protein MOQ_009378 [Trypanosoma cruzi marinkellei]
          Length = 697

 Score = 66.6 bits (161), Expect = 2e-08,   Method: Compositional matrix adjust.
 Identities = 34/93 (36%), Positives = 49/93 (52%), Gaps = 2/93 (2%)

Query: 71  SKLACNAHTICNSELRPLGTGLYPVISIINHSCLPNAVLV-FEGRLAVVRAVQHVPKGAE 129
           + L CN+  + N++   +G  LYP  S  NHSC PN   V + G  A   A++ +PKG  
Sbjct: 550 AALQCNSFGLFNADGNCIGVALYPEASYFNHSCCPNICRVTYRGLFAAFHALREIPKGEP 609

Query: 130 VLISYIETA-GSTMTRQKALKEQYLFTCTCPRC 161
           + I Y++    ST  R++ L   Y F C C RC
Sbjct: 610 LTICYVDVQETSTAERRRTLFSSYRFFCECARC 642


>gi|350403520|ref|XP_003486827.1| PREDICTED: protein msta, isoform B-like [Bombus impatiens]
          Length = 513

 Score = 66.6 bits (161), Expect = 2e-08,   Method: Compositional matrix adjust.
 Identities = 70/316 (22%), Positives = 133/316 (42%), Gaps = 41/316 (12%)

Query: 50  NLVNLILQWPEISINEIAENFSKLACNAHTICNSELRPL----GTGLYPVISIINHSCLP 105
           N+V+ I +  ++ ++  +E     AC    I   E+R         LYP ++++NHSC+ 
Sbjct: 173 NIVDYIRK--QLKLDRFSEEQIHTACGILEINTFEIRTAKGCSARALYPTVALMNHSCIS 230

Query: 106 N---AVLVFEGRLAVVRAVQHVPKGAEVLISYIETAGSTMTRQKALKEQYLFTCTCPRCI 162
           N   ++   + R+ + R    +P G E+  SY  +   T+ R++ L E   F C CPRC 
Sbjct: 231 NTCHSISPTDYRVRL-RTTLKIPAGGELYGSYTHSLLPTLLRREHLLEGKYFACACPRC- 288

Query: 163 KLGQFDDIQESAILEGYRCK--DDGCSGFLLRDSDDKGFTCQQCGLVRSKEEIKKIASEV 220
                D  +    +   +C   D+G    L     +  + C  C    S   ++K+   +
Sbjct: 289 ----SDPTELGTHVSSLKCNKCDNGIVLSLDSLDPESSWKCTHCEFTTSGSAVRKV---L 341

Query: 221 NILSKKTLAL--TSCGNHQEVVSTYKMIEKLQKKLYHPFSVNLMQTREKLIKILMELEDW 278
           +I+  +  A+   S  +  + +   + + K  + + HP    L   R  L ++   ++++
Sbjct: 342 HIIQAEVDAVETISGADGADAIQERETVVKKYRSVLHPRHGFLTMLRHSLTQMYGRVDEY 401

Query: 279 ----------KEALAYCQLTIPVYQRVYPQFHPLLGLQYYT-------CGKLEWFLGD-T 320
                     +  +  C+L + V   + P +  + G+  Y          K +W  G   
Sbjct: 402 LLDDLPDVVLEHKIDICRLLLQVLDVIEPGYSRIRGMTLYELHAPLLFIAKTQWNAGVID 461

Query: 321 ENAIKS-MTEAVEILR 335
           E A+KS M EA  IL+
Sbjct: 462 EAALKSKMIEAANILK 477


>gi|148235066|ref|NP_001085635.1| SET and MYND domain-containing protein 5 [Xenopus laevis]
 gi|82184468|sp|Q6GPQ4.1|SMYD5_XENLA RecName: Full=SET and MYND domain-containing protein 5
 gi|49115717|gb|AAH73058.1| MGC82689 protein [Xenopus laevis]
          Length = 421

 Score = 66.6 bits (161), Expect = 2e-08,   Method: Compositional matrix adjust.
 Identities = 34/87 (39%), Positives = 50/87 (57%), Gaps = 5/87 (5%)

Query: 89  GTGLYPVISIINHSCLPNAVLVFEGRLAVVR--AVQHVPKGAEVLISYIETAG---STMT 143
           G+GLY + S  NHSC+PNA   F     ++   A++ +  G E+ ISY++      S  +
Sbjct: 303 GSGLYLLQSCCNHSCVPNAEASFPDNNFILHLTALEDIQPGEEICISYLDCCQRDRSRHS 362

Query: 144 RQKALKEQYLFTCTCPRCIKLGQFDDI 170
           RQK L+E YLF C+CP+C+      DI
Sbjct: 363 RQKILRENYLFMCSCPKCLAQADEPDI 389


>gi|383858987|ref|XP_003704980.1| PREDICTED: protein msta, isoform B-like [Megachile rotundata]
          Length = 514

 Score = 66.6 bits (161), Expect = 2e-08,   Method: Compositional matrix adjust.
 Identities = 70/308 (22%), Positives = 126/308 (40%), Gaps = 43/308 (13%)

Query: 60  EISINEIAENFSKLACNAHTICNSELR-PLGTG---LYPVISIINHSCLPN---AVLVFE 112
           ++ ++  +E   + AC    I   E+R   G G   LYP ++++NHSC+ N   ++   +
Sbjct: 182 QLKLDRFSEELIQTACGILEINTFEVRTSRGYGARALYPTVALMNHSCVSNTCHSISPTD 241

Query: 113 GRLAVVRAVQHVPKGAEVLISYIETAGSTMTRQKALKEQYLFTCTCPRCIKLGQFDDIQE 172
            R+ + R   HVP   E+  SY  +   TM R++ L E   F C C RC      D  + 
Sbjct: 242 YRIRL-RTTVHVPPSGELYGSYTHSLYPTMLRREHLLEGKHFACACARC-----SDPTEL 295

Query: 173 SAILEGYRCK--DDGCSGFLLRDSDDKGFTCQQCGLVRSKEEIKK----IASEVNILSKK 226
              +   +C   D+G    L     D  + C  C    S   ++K    I +EV+ + + 
Sbjct: 296 GTHMSSLKCNKCDNGIVLSLDSLDPDSSWKCTHCEFSTSGPAVRKVLKIIQAEVDAVEQ- 354

Query: 227 TLALTSCGNHQEVVSTYKMIEKLQKKLYHPFSVNLMQTREKLIKILMELEDW-------- 278
                S  +  + +   + + K  + + HP    L   R  L ++   ++++        
Sbjct: 355 ----ISGADGADAIQERETVMKKYRSVLHPRHAFLTMLRHSLTQMYGRVDEYLLDDLPDV 410

Query: 279 --KEALAYCQLTIPVYQRVYPQFHPLLGLQYYTCGKLEWFLGDT--------ENAIKS-M 327
             +  +  C+  + V   + P +  + G+  Y       F+  T        E A+KS M
Sbjct: 411 VLEHKIDMCRFLLQVLDVIEPGYSRIRGMTLYELHAPLLFVAKTLWNAGVIDEAALKSKM 470

Query: 328 TEAVEILR 335
            EA  IL+
Sbjct: 471 IEASNILK 478


>gi|402590196|gb|EJW84127.1| hypothetical protein WUBG_04964 [Wuchereria bancrofti]
          Length = 283

 Score = 66.6 bits (161), Expect = 2e-08,   Method: Compositional matrix adjust.
 Identities = 70/288 (24%), Positives = 132/288 (45%), Gaps = 32/288 (11%)

Query: 62  SINEIAENFSKLACNAHTICNSELRPLGTGLYPVISIINHSCLPNAVLVFEGRLAVVRAV 121
           S  E+ + F K+  N   I +S L   G G+Y  +S ++HSC P+A ++F G  A++R++
Sbjct: 3   SSTEMFDIFCKIVINTVVITDSCLSRTGIGVYLGLSALDHSCKPDAFIIFNGTKAILRSL 62

Query: 122 QH--VPKGAEVLISYIETAGSTMTRQKALKEQYLFTCTCPRCIKLGQFDDIQESAILEGY 179
                     + I Y +    T TR K ++ Q+ F C C  C  + + D  ++   L   
Sbjct: 63  SKNITEYNNNLRIPYCDLLELTSTRCKYMQLQHNFICNCEICQNV-ELD--RQKCSLRCT 119

Query: 180 RCKDDGCSGFLLRDSDDKGFT-CQQCGLVR--SKEEIKKIASEVNILSKKTLALTSCGNH 236
           +C D  C        D++  T C+ CG +   + + ++K+  ++      T   ++  N 
Sbjct: 120 KCTDGFCP---YSPEDEQAETRCKVCGEISLFNFDHLQKLYQQL------TTDDSTEKNL 170

Query: 237 QEVVSTYKMIEKLQKKLYHPFSVNLMQTREKLIKILMELEDWKEALAYCQLTIPVYQRVY 296
            E++  Y   E    +++ P++V L +  EK++   M+   + EA           +  Y
Sbjct: 171 NELIDLYCESE----EVFSPYNVPLCKFAEKIMISAMKHHKYDEAA----------KTFY 216

Query: 297 PQFHPLLGLQYYTCGKLEWFLGDTENAIKSMTEAVEILRITHGTNSPF 344
           P+  P L  +     KL   L   E +   + EA++++R ++G+ S F
Sbjct: 217 PKGLPPLPTRMLEYAKL-LMLQHDEASFPILREALKMIRESYGSESNF 263


>gi|268570288|ref|XP_002648464.1| C. briggsae CBR-SET-18 protein [Caenorhabditis briggsae]
          Length = 369

 Score = 66.2 bits (160), Expect = 2e-08,   Method: Compositional matrix adjust.
 Identities = 50/195 (25%), Positives = 84/195 (43%), Gaps = 18/195 (9%)

Query: 29  ALVAHMSDI--DEKQLLLYAQIANLVNLILQWPEISINEIAENFSKLACNAHTICNSELR 86
           +LV H  DI  DE+++  + +I   +    +   I      + F K + N+H+I  +   
Sbjct: 137 SLVDHKKDIREDEEKMKHFEKIFVKMGNFRKDDMIEKGHFFDVFCKASINSHSIHTNAGN 196

Query: 87  PLGTGLYPVISIINHSCLPNAVLVFEGRLAVVRAVQHVP-----KGAEVLISYIETAGST 141
            +G  L   +S  NHSC P   +VF+G    +R +  VP        +  ISYI+   S 
Sbjct: 197 EVGMALDLGVSKYNHSCRPTCSMVFDGYRVCLRPL--VPGVDAENTEQAFISYIDVGRSK 254

Query: 142 MTRQKALKEQYLFTCTCPRCIKLGQFDDIQESAILEGYRCKDDGCSGFLLRDSDDK--GF 199
             R++ LK ++ F C C RC+         E   L   +C +  C   +L    ++    
Sbjct: 255 FVRRRDLKSRWYFDCECTRCMD-------PEDDSLTAIKCANPACDAPILTSETEEPMNI 307

Query: 200 TCQQCGLVRSKEEIK 214
            C +C  V  +  +K
Sbjct: 308 ACDKCKTVVEESTVK 322


>gi|256062338|ref|XP_002570318.1| set and mynd domain containing [Schistosoma mansoni]
 gi|353230727|emb|CCD77144.1| putative set and mynd domain containing [Schistosoma mansoni]
          Length = 423

 Score = 66.2 bits (160), Expect = 2e-08,   Method: Compositional matrix adjust.
 Identities = 36/79 (45%), Positives = 48/79 (60%), Gaps = 7/79 (8%)

Query: 89  GTGLYPVISIINHSCLPNAVLVFEG---RLAVVRAVQHVPKGAEVLISYIE---TAGSTM 142
           G GLY   S INHSC PNA++ F G   RL+VV A+  + +G E+ ISY++    +    
Sbjct: 314 GVGLYYYQSRINHSCSPNAIIRFSGVNSRLSVV-ALTSIQEGEEITISYLDHCLQSRGRH 372

Query: 143 TRQKALKEQYLFTCTCPRC 161
           TR+K L   YLF C CP+C
Sbjct: 373 TRRKHLSSNYLFWCNCPKC 391


>gi|71409653|ref|XP_807160.1| hypothetical protein [Trypanosoma cruzi strain CL Brener]
 gi|70871100|gb|EAN85309.1| hypothetical protein, conserved [Trypanosoma cruzi]
          Length = 697

 Score = 66.2 bits (160), Expect = 2e-08,   Method: Compositional matrix adjust.
 Identities = 51/165 (30%), Positives = 72/165 (43%), Gaps = 15/165 (9%)

Query: 9   KLQNDNVIPSTTTDNYSLVEALVAHMSDIDEKQLLLYAQIANLVNLILQW------PEIS 62
           K +   V P   +D   LV  L     D   K    Y +      LIL W       E +
Sbjct: 481 KEETHAVRPPRWSDTARLVTNLSVLSKDSRSKFRCAYRR---FTKLILPWLGEGGSIEAN 537

Query: 63  INEIAENFSKLA----CNAHTICNSELRPLGTGLYPVISIINHSCLPNAVLV-FEGRLAV 117
           +   A  F++L     CN+  + N++   +G  LYP  S  NHSC PN   V + G LA 
Sbjct: 538 LTVTATFFNRLCAALQCNSFGLFNADGNCIGVALYPEASYFNHSCCPNICRVTYRGILAA 597

Query: 118 VRAVQHVPKGAEVLISYIETA-GSTMTRQKALKEQYLFTCTCPRC 161
             A++ + KG  + I Y++    ST  R++ L   Y F C C RC
Sbjct: 598 FHALREIRKGEPLTICYVDVQETSTAERRRTLFSSYRFFCECARC 642


>gi|256062336|ref|XP_002570317.1| set and mynd domain containing [Schistosoma mansoni]
 gi|353230728|emb|CCD77145.1| putative set and mynd domain containing [Schistosoma mansoni]
          Length = 429

 Score = 66.2 bits (160), Expect = 2e-08,   Method: Compositional matrix adjust.
 Identities = 36/79 (45%), Positives = 48/79 (60%), Gaps = 7/79 (8%)

Query: 89  GTGLYPVISIINHSCLPNAVLVFEG---RLAVVRAVQHVPKGAEVLISYIE---TAGSTM 142
           G GLY   S INHSC PNA++ F G   RL+VV A+  + +G E+ ISY++    +    
Sbjct: 320 GVGLYYYQSRINHSCSPNAIIRFSGVNSRLSVV-ALTSIQEGEEITISYLDHCLQSRGRH 378

Query: 143 TRQKALKEQYLFTCTCPRC 161
           TR+K L   YLF C CP+C
Sbjct: 379 TRRKHLSSNYLFWCNCPKC 397


>gi|24648668|ref|NP_650955.1| CG3353 [Drosophila melanogaster]
 gi|23171865|gb|AAF55866.2| CG3353 [Drosophila melanogaster]
 gi|33636489|gb|AAQ23542.1| RE75113p [Drosophila melanogaster]
          Length = 393

 Score = 66.2 bits (160), Expect = 2e-08,   Method: Compositional matrix adjust.
 Identities = 32/78 (41%), Positives = 48/78 (61%), Gaps = 5/78 (6%)

Query: 89  GTGLYPVISIINHSCLPNAVLVF--EGRLAVVRAVQHVPKGAEVLISYIETA---GSTMT 143
           G+GLY + S INHSC+PNA   F     + V++A+  + +G E+ ISY++      S  +
Sbjct: 286 GSGLYLLQSKINHSCVPNACSTFPYSNDIVVLKALAPIQQGEEICISYLDECMLERSRHS 345

Query: 144 RQKALKEQYLFTCTCPRC 161
           R K L+E Y+F C CP+C
Sbjct: 346 RHKVLRENYVFICQCPKC 363


>gi|393236328|gb|EJD43877.1| hypothetical protein AURDEDRAFT_114449 [Auricularia delicata
           TFB-10046 SS5]
          Length = 451

 Score = 66.2 bits (160), Expect = 2e-08,   Method: Compositional matrix adjust.
 Identities = 31/75 (41%), Positives = 49/75 (65%), Gaps = 3/75 (4%)

Query: 90  TGLYPVISIINHSCLPNAVLVFE-GRLAV-VRAVQHVPKGAEVLISYIETA-GSTMTRQK 146
           +G++P  S ++HSC PNA+  ++ G + + +RA +H+ +GAEV ISY+      T  RQ+
Sbjct: 244 SGVFPTFSRVSHSCCPNAIPRWDSGTMTLELRATRHIQQGAEVTISYVPPPLKPTEARQE 303

Query: 147 ALKEQYLFTCTCPRC 161
            L++ Y F CTCP C
Sbjct: 304 FLRDGYHFECTCPAC 318


>gi|156539569|ref|XP_001601623.1| PREDICTED: SET and MYND domain-containing protein 4-like [Nasonia
           vitripennis]
          Length = 697

 Score = 66.2 bits (160), Expect = 2e-08,   Method: Compositional matrix adjust.
 Identities = 30/76 (39%), Positives = 44/76 (57%), Gaps = 2/76 (2%)

Query: 88  LGTGLYPVISIINHSCLPNAVLVFEGRLAVVRAVQHVPKGAEVLISY--IETAGSTMTRQ 145
           +G G++P +++ NHSC P  V  F G   VVRA++ +P GAE+  +Y  I T      R+
Sbjct: 499 IGGGVFPTVALFNHSCNPGVVRYFIGNTMVVRAIKTIPAGAEISENYGPIFTEEEENDRK 558

Query: 146 KALKEQYLFTCTCPRC 161
           + L+ QY F C C  C
Sbjct: 559 RKLRLQYWFDCDCEAC 574


>gi|356525020|ref|XP_003531125.1| PREDICTED: histone-lysine N-methyltransferase ASHR2-like [Glycine
           max]
          Length = 419

 Score = 66.2 bits (160), Expect = 2e-08,   Method: Compositional matrix adjust.
 Identities = 40/120 (33%), Positives = 55/120 (45%), Gaps = 15/120 (12%)

Query: 91  GLYPVISIINHSCLPNAVLVFEGRLA-----------VVRAVQHVPKGAEVLISYIETAG 139
           G+YP  S  NH CLPNA   F+   A           ++R +  VP+G E+ +SY     
Sbjct: 231 GIYPYASFFNHDCLPNACR-FDYVDANPSDDSHNTDFIIRMIHDVPQGREICLSYFPVNE 289

Query: 140 STMTRQKALKEQYLFTCTCPRC---IKLGQFDDIQESAILEGYRCKDDGCSGFLLRDSDD 196
              +RQK L E Y FTC C RC         D ++++A  E     +D C      D+DD
Sbjct: 290 KYSSRQKRLIEDYGFTCNCDRCNVESNWSDNDSVEDNAEEEEEVMDEDQCETMAASDTDD 349


>gi|350419310|ref|XP_003492139.1| PREDICTED: SET and MYND domain-containing protein 4-like [Bombus
           impatiens]
          Length = 673

 Score = 66.2 bits (160), Expect = 2e-08,   Method: Compositional matrix adjust.
 Identities = 53/180 (29%), Positives = 85/180 (47%), Gaps = 22/180 (12%)

Query: 69  NFSKLACNAHTICNS---------ELRPLGTGLYPVISIINHSCLPNAVL-VFEGRLAVV 118
           N   + CNA+ I  +         E R +G  +YP IS+INHSC PN V   +     V+
Sbjct: 450 NLQAINCNAYEIVENIYDKKTHVWEPRNIGGAIYPSISLINHSCYPNVVRHSYPSGTVVL 509

Query: 119 RAVQHVPKGAEVLISYIET--AGSTMTRQKALKEQYLFTCTCPRCIKLGQFDDIQESAIL 176
           R ++ + KG E+L  Y     +   ++R + L ++Y F CTC  C +  QF  + E+   
Sbjct: 510 RTLRFIGKGTEILDCYGPHWLSEKRLSRLEYLWKKYCFLCTCEACTQNWQF-PLPETM-- 566

Query: 177 EGYRCKDDGCSGFL----LRDSDDKGFTCQQCGLVRSKEEIKKIASEVNILSKKTLALTS 232
             Y+CK   CS  +    L +   +  + +QC     K ++KKI ++     +K L   S
Sbjct: 567 -NYKCK--MCSEIIGTITLDEKHTQNVSSKQCCKCNKKTDLKKIKNQFRKSVEKRLDAIS 623


>gi|256062334|ref|XP_002570316.1| set and mynd domain containing [Schistosoma mansoni]
 gi|353230729|emb|CCD77146.1| putative set and mynd domain containing [Schistosoma mansoni]
          Length = 433

 Score = 66.2 bits (160), Expect = 2e-08,   Method: Compositional matrix adjust.
 Identities = 36/79 (45%), Positives = 48/79 (60%), Gaps = 7/79 (8%)

Query: 89  GTGLYPVISIINHSCLPNAVLVFEG---RLAVVRAVQHVPKGAEVLISYIE---TAGSTM 142
           G GLY   S INHSC PNA++ F G   RL+VV A+  + +G E+ ISY++    +    
Sbjct: 324 GVGLYYYQSRINHSCSPNAIIRFSGVNSRLSVV-ALTSIQEGEEITISYLDHCLQSRGRH 382

Query: 143 TRQKALKEQYLFTCTCPRC 161
           TR+K L   YLF C CP+C
Sbjct: 383 TRRKHLSSNYLFWCNCPKC 401


>gi|398392173|ref|XP_003849546.1| hypothetical protein MYCGRDRAFT_48051 [Zymoseptoria tritici IPO323]
 gi|339469423|gb|EGP84522.1| hypothetical protein MYCGRDRAFT_48051 [Zymoseptoria tritici IPO323]
          Length = 633

 Score = 66.2 bits (160), Expect = 2e-08,   Method: Compositional matrix adjust.
 Identities = 45/131 (34%), Positives = 67/131 (51%), Gaps = 12/131 (9%)

Query: 46  AQIANLVNLILQW----PEISINEIAENFSKLAC-NAH---TICNSE---LRPLGTGLYP 94
           +QI  +++L   W     E+ I E A     + C NA     +   E   +R   TGL+ 
Sbjct: 430 SQIEKVMDLFGDWRSIGKEVLIGEDAFQVHDVVCRNAFGPGAVIAGEAENIRKASTGLWL 489

Query: 95  VISIINHSCLPNAVLVFEGRLAVVRAVQHVPKGAEVLISYIETAGSTMTRQKALKEQYLF 154
           + + INHSCLPNA   F G L VVRA +++  G+E+  SY +++     RQ+AL + + F
Sbjct: 490 MAAYINHSCLPNAEKSFLGDLMVVRATRNITAGSEITHSY-DSSSDYDARQEALMKTWGF 548

Query: 155 TCTCPRCIKLG 165
            C C  C   G
Sbjct: 549 RCRCELCKADG 559


>gi|384252143|gb|EIE25620.1| hypothetical protein COCSUDRAFT_83627 [Coccomyxa subellipsoidea
           C-169]
          Length = 394

 Score = 65.9 bits (159), Expect = 3e-08,   Method: Compositional matrix adjust.
 Identities = 75/308 (24%), Positives = 136/308 (44%), Gaps = 39/308 (12%)

Query: 15  VIPSTTTDNYSLVEALVAHMSDIDEKQLL-------LYAQIANL-VNLILQ-WPEISINE 65
           V+   T+ +  LV  L  H+ D+   QL        + A  A L   L+L+ + ++ IN 
Sbjct: 99  VVAQVTSRD--LVAGLETHVGDVPSHQLFQEAVMCFIAAACAGLEPQLVLKAYRQVVINA 156

Query: 66  IAENFSKLACNAHTICNSELRPLGTGLYPVISIINHSCLPNAVLVFEGRLAVVRAVQHVP 125
           IA   ++ A       ++E R +G  +YP  +++NH+C PN    F G    + A   +P
Sbjct: 157 IAITPAEHA-------SAEDR-MGLAIYPRAAMLNHACSPNVAAAFAGTRLHICATSDLP 208

Query: 126 KGAEVLISYIETAGSTM--TRQKALKEQYLFTCTCPRCIKLGQFDDIQESAILEGYRCKD 183
            G  +   Y   AG ++   R + LKEQY F C C  C     +  +QE+A++ G +C+ 
Sbjct: 209 PGTTLRFCYGPQAGESIREVRLRQLKEQYHFWCRCSAC--EASYSGVQEAAMI-GMKCRL 265

Query: 184 DGCSGFLLRDSDDKGFTCQQCGL-VRSKEEIKKIASEVNILSKKTLALTSCGNHQ----E 238
             C+G ++            CG  + S+E+I+++A      +    A  S   H     E
Sbjct: 266 --CNGPVVPSIG--------CGAEMSSEEQIERMARLSQADAAFASACLSLAQHSAVSAE 315

Query: 239 VVSTYKMIEKLQKKLYHPFSVNLMQTREKLIKILMELEDWKEALAYCQLTIPVYQRVYPQ 298
            + +     ++Q+ L    +  L +T + L +      D + A+ + Q  + + +  Y  
Sbjct: 316 TLQSMHFCLEVQEVLLVTSNQVLGRTHDALARAYGPRGDTENAMLHAQKALDIVRSNYGD 375

Query: 299 FHPLLGLQ 306
            H ++  Q
Sbjct: 376 GHLVVAHQ 383


>gi|378731922|gb|EHY58381.1| SET and MYND domain-containing protein [Exophiala dermatitidis
           NIH/UT8656]
          Length = 550

 Score = 65.9 bits (159), Expect = 3e-08,   Method: Compositional matrix adjust.
 Identities = 36/101 (35%), Positives = 57/101 (56%), Gaps = 12/101 (11%)

Query: 72  KLAC----NAHTICNSELRPLGTGLYPVISIINHSCLPNAVLVFE-------GRLAVVRA 120
           +L+C    N+ T+ N+    +G  L+P  +++NHSC PNA + F+       G ++V  A
Sbjct: 212 RLSCAVIVNSFTLSNATFDSIGVILHPKPALLNHSCDPNAYVRFDVSETDTLGSISV-HA 270

Query: 121 VQHVPKGAEVLISYIETAGSTMTRQKALKEQYLFTCTCPRC 161
           ++ + K  E+ ISYI+T      RQ+ L E+Y FTC C  C
Sbjct: 271 LRDIAKDEEITISYIDTTVPCKRRQQQLSERYFFTCQCHLC 311


>gi|328875121|gb|EGG23486.1| hypothetical protein DFA_05619 [Dictyostelium fasciculatum]
          Length = 391

 Score = 65.9 bits (159), Expect = 3e-08,   Method: Compositional matrix adjust.
 Identities = 37/107 (34%), Positives = 54/107 (50%), Gaps = 7/107 (6%)

Query: 86  RPLGTGLYPVISIINHSCLPNAVLVFE--GRLAVVRAVQHVPKGAEVLISYIETAGSTMT 143
           + +G+GLY   S  NHSC PN   V +  G   V+ A++ + +G E+  +YI+ + S   
Sbjct: 268 KSIGSGLYDQCSYFNHSCQPNIFKVNQTPGGELVMVALRDIEQGEELFYNYIQISMSGEA 327

Query: 144 RQKALKEQYLFTCTCPRCIKLGQFDDIQESAILEGYRCKDDGCSGFL 190
           R K LKE Y F C CP C      +       L+ Y CK   C+G +
Sbjct: 328 RIKKLKESYFFNCQCPGCK-----NAPSHKQFLDKYLCKVKQCNGVI 369


>gi|332025390|gb|EGI65557.1| SET and MYND domain-containing protein 4 [Acromyrmex echinatior]
          Length = 401

 Score = 65.9 bits (159), Expect = 3e-08,   Method: Compositional matrix adjust.
 Identities = 40/124 (32%), Positives = 58/124 (46%), Gaps = 7/124 (5%)

Query: 86  RP--LGTGLYPVISIINHSCLPNAVLVFEGRLAVVRAVQHVPKGAEVLISY--IETAGST 141
           RP  LG  +YP ++  NH C P     F GR  V+RA++ +  G  +  +Y  I T  + 
Sbjct: 200 RPVYLGVAIYPSVARFNHDCYPAVTRYFIGRHIVIRAIRGLGPGDVIAENYGPIFTKRTL 259

Query: 142 MTRQKALKEQYLFTCTCPRCIK-LGQFDDIQESAILEGYRCKDDGCSGFLLRDSDDKGFT 200
             RQ+ L  +Y F CTC  C +    F+++   ++    RC   GC G  LR    K   
Sbjct: 260 AERQRTLTGRYWFQCTCKACQEDWPCFENLTNDSV--KLRCPTVGCGGLHLRSRQGKPIK 317

Query: 201 CQQC 204
           C  C
Sbjct: 318 CPDC 321


>gi|322792366|gb|EFZ16350.1| hypothetical protein SINV_08812 [Solenopsis invicta]
          Length = 510

 Score = 65.9 bits (159), Expect = 3e-08,   Method: Compositional matrix adjust.
 Identities = 60/269 (22%), Positives = 111/269 (41%), Gaps = 31/269 (11%)

Query: 91  GLYPVISIINHSCLPN---AVLVFEGRLAVVRAVQHVPKGAEVLISYIETAGSTMTRQKA 147
           GLYP ++++NHSC+ N   ++   + R+ + R    +P G E+  SY  +   TM R++ 
Sbjct: 213 GLYPTVAMMNHSCVSNTSHSISPIDYRIRL-RTTLKIPAGGELYASYTHSLLPTMLRREH 271

Query: 148 LKEQYLFTCTCPRCIKLGQFDDIQESAILEGYRCK--DDGCSGFLLRDSDDKGFTCQQCG 205
           L E   F C CPRC      D  +    +   +C   D+G    L     +  + C  C 
Sbjct: 272 LLEGKHFACACPRC-----SDPTELGTHMSSLKCNKCDNGIVLSLDSLDSESKWKCTHCD 326

Query: 206 LVRSKEEIKKIASEVNILSKKTLALTSCGNHQEVVSTYKMIEKLQKKLYHPFSVNLMQTR 265
              + + ++K+   +        A++   +  + +   + I K  + + HP    L   R
Sbjct: 327 FSTTGQAVRKVFRIIQAEVDAAEAISG-ADGADAIHERETIMKKYRSVLHPQHAFLSMLR 385

Query: 266 EKLIKILMELEDW----------KEALAYCQLTIPVYQRVYPQFHPLLGLQYYT------ 309
             L ++   ++++          +  +  C+  + V   V P +  + G+  Y       
Sbjct: 386 HSLTQMYGRVDEYLLDDLPDVVLEHKVDLCRSLLQVLNVVEPGYSRVRGMTLYELHAPLL 445

Query: 310 -CGKLEWFLGD-TENAIKS-MTEAVEILR 335
              K +W  G   E  +KS M EA  IL+
Sbjct: 446 FLAKGQWNAGVIDETGLKSKMIEAANILK 474


>gi|194764587|ref|XP_001964410.1| GF23162 [Drosophila ananassae]
 gi|190614682|gb|EDV30206.1| GF23162 [Drosophila ananassae]
          Length = 392

 Score = 65.9 bits (159), Expect = 3e-08,   Method: Compositional matrix adjust.
 Identities = 37/101 (36%), Positives = 58/101 (57%), Gaps = 9/101 (8%)

Query: 66  IAENFSKLACNAHTICNSELRPLGTGLYPVISIINHSCLPNAVLVF--EGRLAVVRAVQH 123
           I E ++K+   A    N+E    G+GLY + S INHSC+PN+   F     + V++A+  
Sbjct: 267 IDEIYAKVGEFAGEFLNNE----GSGLYLLQSKINHSCVPNSCSTFPYSNDIVVLKALTP 322

Query: 124 VPKGAEVLISYIETAG---STMTRQKALKEQYLFTCTCPRC 161
           + +G E+ ISY++      S  +R K L+E Y+F C CP+C
Sbjct: 323 IQEGDEICISYLDECQLERSRHSRHKVLRENYIFVCQCPKC 363


>gi|403222774|dbj|BAM40905.1| uncharacterized protein TOT_030000166 [Theileria orientalis strain
           Shintoku]
          Length = 150

 Score = 65.9 bits (159), Expect = 3e-08,   Method: Compositional matrix adjust.
 Identities = 35/90 (38%), Positives = 50/90 (55%), Gaps = 3/90 (3%)

Query: 79  TICNSELRP--LGTGLYPVISIINHSCLPNAVLVF-EGRLAVVRAVQHVPKGAEVLISYI 135
           TI   EL P  +G GL+  +S +NHSC PN  + + +  +A +  +  VP+G E  ISYI
Sbjct: 56  TIQKKELYPETMGLGLFNYLSKMNHSCEPNLQIEYTKNNIAHIAPLVDVPRGEEATISYI 115

Query: 136 ETAGSTMTRQKALKEQYLFTCTCPRCIKLG 165
           +   S   RQ+ L + Y F C C +CI  G
Sbjct: 116 DEKDSVENRQEKLYKNYGFKCDCNKCILEG 145


>gi|330796724|ref|XP_003286415.1| hypothetical protein DICPUDRAFT_77304 [Dictyostelium purpureum]
 gi|325083610|gb|EGC37058.1| hypothetical protein DICPUDRAFT_77304 [Dictyostelium purpureum]
          Length = 499

 Score = 65.9 bits (159), Expect = 3e-08,   Method: Compositional matrix adjust.
 Identities = 33/79 (41%), Positives = 47/79 (59%), Gaps = 6/79 (7%)

Query: 89  GTGLYPVISIINHSCLPNAVLVFE-----GRLA-VVRAVQHVPKGAEVLISYIETAGSTM 142
           G GL+P+ S +NHSCLPN  +  E      R+  VV+A + +P G+E+L SY +    T 
Sbjct: 419 GVGLFPIFSCMNHSCLPNIEISNEIQDGVDRVKMVVKAKKLIPAGSEILHSYCDENLPTK 478

Query: 143 TRQKALKEQYLFTCTCPRC 161
            R++ L  QY F C CP+C
Sbjct: 479 ERKQLLLSQYGFKCLCPKC 497


>gi|302697297|ref|XP_003038327.1| hypothetical protein SCHCODRAFT_103387 [Schizophyllum commune H4-8]
 gi|300112024|gb|EFJ03425.1| hypothetical protein SCHCODRAFT_103387, partial [Schizophyllum
           commune H4-8]
          Length = 360

 Score = 65.9 bits (159), Expect = 3e-08,   Method: Compositional matrix adjust.
 Identities = 33/73 (45%), Positives = 43/73 (58%), Gaps = 2/73 (2%)

Query: 91  GLYPVISIINHSCLPNAVLVFE--GRLAVVRAVQHVPKGAEVLISYIETAGSTMTRQKAL 148
           G++   S INHSC PNAV  F+      VVRA++H+ KG E+ ISY ET     TR++  
Sbjct: 175 GVFERASRINHSCSPNAVYHFDLQSFCLVVRAIRHISKGEEIFISYSETLLQAATRRQNS 234

Query: 149 KEQYLFTCTCPRC 161
            + Y F C CP C
Sbjct: 235 LQDYGFRCACPAC 247


>gi|270007661|gb|EFA04109.1| hypothetical protein TcasGA2_TC014347 [Tribolium castaneum]
          Length = 497

 Score = 65.9 bits (159), Expect = 3e-08,   Method: Compositional matrix adjust.
 Identities = 59/206 (28%), Positives = 91/206 (44%), Gaps = 26/206 (12%)

Query: 22  DNYSLVEALVAHMSDIDEKQLLLYAQIANLVNLILQWPEISINEIAENFSKLACNAHTIC 81
           ++ S+V+ L AH SD          Q    V+++     +++ E  E F +  C+     
Sbjct: 141 EDKSVVKCLKAHKSD----------QHGFEVDVMKDALALNVKEDEEKFMRFVCSVMDSN 190

Query: 82  NSEL-------RPLGTGLYPVISIINHSCLPNAVLVFEGRL-AVVRAVQHVPKGAEVLIS 133
             E+       +    GLYP+ S+ NHSC PN   VF+ +   VVRA + +P+G+E+  S
Sbjct: 191 AFEVLVGFEDNQASVKGLYPLGSLANHSCCPNTCHVFDDKQHMVVRASKFIPQGSEIFHS 250

Query: 134 YIETAGSTMTRQKALKEQYLFTCTCPRCIKLGQFDDIQESAILEGYRCKDDGCSGFLLRD 193
           Y     ST  R+  L     F C C RC    +F     S +     CK       +  +
Sbjct: 251 YSRLIWSTSARRFHLYRTKHFLCKCQRCEDPTEFGSYIGSIL-----CKVCKTGKVIPTN 305

Query: 194 S--DDKGFTCQQCGLVRSKEEIKKIA 217
           S   DK + C+ CG +  KEE+  I 
Sbjct: 306 SLQTDK-WQCEVCGSLIKKEEVANIT 330


>gi|255084131|ref|XP_002508640.1| predicted protein [Micromonas sp. RCC299]
 gi|226523917|gb|ACO69898.1| predicted protein [Micromonas sp. RCC299]
          Length = 701

 Score = 65.9 bits (159), Expect = 3e-08,   Method: Compositional matrix adjust.
 Identities = 41/135 (30%), Positives = 60/135 (44%), Gaps = 24/135 (17%)

Query: 51  LVNLILQWPEISINEIAENFSKLACNAHTICNSELRPLGTGLYPVISIINHSCLPNAVLV 110
           L + +     +  +E+ +   K   N+H + + +LR LGTG+YP  S+ NHSC PNA + 
Sbjct: 246 LASAVRHAAALDPSEVEDMLGKTRENSHGVVDWKLRQLGTGIYPEASMFNHSCAPNAAVS 305

Query: 111 F----------------EGRLAVVRAVQHVP--------KGAEVLISYIETAGSTMTRQK 146
           F                E R   V    +VP        KG EV I+Y E      +R+ 
Sbjct: 306 FGAGGTLNTCCIGVDDYEARAMEVGPKGYVPLGRPRAIRKGEEVCIAYTELYRPAASRRL 365

Query: 147 ALKEQYLFTCTCPRC 161
            L++   F C C RC
Sbjct: 366 TLEKSKGFVCECVRC 380


>gi|260940791|ref|XP_002615235.1| hypothetical protein CLUG_04117 [Clavispora lusitaniae ATCC 42720]
 gi|238850525|gb|EEQ39989.1| hypothetical protein CLUG_04117 [Clavispora lusitaniae ATCC 42720]
          Length = 402

 Score = 65.9 bits (159), Expect = 3e-08,   Method: Compositional matrix adjust.
 Identities = 44/132 (33%), Positives = 59/132 (44%), Gaps = 11/132 (8%)

Query: 45  YAQIANLVNLILQ---WPEISINEIAENFSKLACNAHTI------CNSELRPLGTGLYPV 95
           Y+ I   + LI      P ISI  + +   +   NA  I       + E    G G+YP 
Sbjct: 271 YSNIYKFIRLIAPEEYLPFISIQAVRDIIGRNLTNAFGIWSPVTSSDEEREYFGFGVYPS 330

Query: 96  ISIINHSCLPNAVLVFEGRLAVVRAVQHVPKGAEVLISY-IETAGSTMTRQKALKEQYLF 154
            S  NHSC PN   +  G      A + +P GAE+ ISY I  +     RQ AL+E + F
Sbjct: 331 ASFFNHSCKPNVTKIRRGASYEYIAKEDIPVGAELCISYGIRESDGLKQRQDALRE-WFF 389

Query: 155 TCTCPRCIKLGQ 166
            C C RC   G+
Sbjct: 390 ECGCSRCTTEGK 401


>gi|194883957|ref|XP_001976062.1| GG20202 [Drosophila erecta]
 gi|190659249|gb|EDV56462.1| GG20202 [Drosophila erecta]
          Length = 660

 Score = 65.9 bits (159), Expect = 3e-08,   Method: Compositional matrix adjust.
 Identities = 55/219 (25%), Positives = 97/219 (44%), Gaps = 18/219 (8%)

Query: 48  IANLVNLILQWPEISINEIAE--NFSKLACNAHTICNSELRPLGTGLYPVISIINHSCLP 105
           I +LV   LQ+ + + +E+AE   FS                +G  +YP +++ NHSC P
Sbjct: 427 ICSLVLRSLQFIQFNTHEVAELHKFSSSGREKSIF-------IGGAIYPTLALFNHSCDP 479

Query: 106 NAVLVFEGRLAVVRAVQHVPKGAEVLISY--IETAGSTMTRQKALKEQYLFTCTCPRCI- 162
             V  F G    + +V+ +  G  +  +Y  + T      RQ  LKE Y F C+C  CI 
Sbjct: 480 GVVRYFRGTTIHINSVRPIEAGLPINENYGPMYTQDERAERQARLKELYWFECSCDACID 539

Query: 163 KLGQFDDIQESAILEGYRCK-DDGCSGFL--LRDSDDKGFTCQQCGLVRSKEEIKKIASE 219
              +FDD+    I   +RC   + CS  +      +D    C  CG + +  +  K+  +
Sbjct: 540 NWPKFDDLPRDVI--RFRCDAPNNCSAIIEVPPSCNDFMVKCVTCGEITNILKGLKVMQD 597

Query: 220 VNILSKKTLALTSCGNHQEVVSTY-KMIEKLQKKLYHPF 257
             ++++    L   G + + ++ +  +I  + + L  PF
Sbjct: 598 TEMMTRTAKRLYETGEYLKALAKFIDLIRIMYEVLAPPF 636


>gi|326435247|gb|EGD80817.1| hypothetical protein PTSG_01403 [Salpingoeca sp. ATCC 50818]
          Length = 431

 Score = 65.9 bits (159), Expect = 3e-08,   Method: Compositional matrix adjust.
 Identities = 36/80 (45%), Positives = 46/80 (57%), Gaps = 9/80 (11%)

Query: 91  GLYPVISIINHSCLPNAVLVFE-----GRLA----VVRAVQHVPKGAEVLISYIETAGST 141
           G Y  ++  NHSC PNA    +      RL     V+RAV  + +G EV ISYIE   +T
Sbjct: 174 GFYLRMAFCNHSCRPNACQYIDPNSTRARLNSPSIVLRAVSDIAEGEEVCISYIELMDTT 233

Query: 142 MTRQKALKEQYLFTCTCPRC 161
             R++AL+E Y FTC CPRC
Sbjct: 234 PERREALQELYYFTCQCPRC 253


>gi|358394896|gb|EHK44289.1| hypothetical protein TRIATDRAFT_79382 [Trichoderma atroviride IMI
           206040]
          Length = 742

 Score = 65.9 bits (159), Expect = 3e-08,   Method: Compositional matrix adjust.
 Identities = 28/72 (38%), Positives = 41/72 (56%)

Query: 90  TGLYPVISIINHSCLPNAVLVFEGRLAVVRAVQHVPKGAEVLISYIETAGSTMTRQKALK 149
           TG++   S  NHSCLPNA   F G + +VRA++ +P G E+L+ Y+        RQ+ ++
Sbjct: 501 TGIWLQASYANHSCLPNATRAFIGDMMIVRALREIPVGGEILMQYVPQDKPFDKRQEVVE 560

Query: 150 EQYLFTCTCPRC 161
             Y F C C  C
Sbjct: 561 NHYGFKCDCDLC 572


>gi|195333441|ref|XP_002033400.1| GM21288 [Drosophila sechellia]
 gi|194125370|gb|EDW47413.1| GM21288 [Drosophila sechellia]
          Length = 660

 Score = 65.9 bits (159), Expect = 3e-08,   Method: Compositional matrix adjust.
 Identities = 55/219 (25%), Positives = 96/219 (43%), Gaps = 18/219 (8%)

Query: 48  IANLVNLILQWPEISINEIAE--NFSKLACNAHTICNSELRPLGTGLYPVISIINHSCLP 105
           I +LV   LQ+ + + +E+AE   FS                +G  +YP +++ NHSC P
Sbjct: 427 ICSLVLRSLQFIQFNTHEVAELHKFSSSGREKSIF-------IGGAIYPTLALFNHSCDP 479

Query: 106 NAVLVFEGRLAVVRAVQHVPKGAEVLISY--IETAGSTMTRQKALKEQYLFTCTCPRCI- 162
             V  F G    + +V+ +  G  +  +Y  + T      RQ  LKE Y F C+C  CI 
Sbjct: 480 GVVRYFRGTTIHINSVRPIEAGLPINENYGPMYTQDERSERQARLKELYWFECSCDACID 539

Query: 163 KLGQFDDIQESAILEGYRCK-DDGCSGFL--LRDSDDKGFTCQQCGLVRSKEEIKKIASE 219
              +FDD+    I   +RC   + CS  +      +D    C  CG + +  +  K+  +
Sbjct: 540 NWPKFDDLPRDVI--RFRCDAPNNCSAVIEVPPSCNDFMVKCVTCGEITNILKGLKVMQD 597

Query: 220 VNILSKKTLALTSCGNHQEVVSTY-KMIEKLQKKLYHPF 257
             ++++    L   G + + +  +  +I  + + L  PF
Sbjct: 598 TEMMTRTAKRLYETGEYSKALPKFIDLIRIMYEVLAPPF 636


>gi|115395962|ref|XP_001213620.1| predicted protein [Aspergillus terreus NIH2624]
 gi|114193189|gb|EAU34889.1| predicted protein [Aspergillus terreus NIH2624]
          Length = 425

 Score = 65.9 bits (159), Expect = 3e-08,   Method: Compositional matrix adjust.
 Identities = 60/262 (22%), Positives = 110/262 (41%), Gaps = 50/262 (19%)

Query: 59  PEISINEIAENFSKLACNAHTICNSELRPLGTGLYPVISIINHSCLPNAVLVFEGRLAVV 118
           PE +I+      +KL  N+  +  +    +G  L+P  ++INHSC  NAV+ F+G     
Sbjct: 140 PEDTISAFG---AKLDVNSFNMTTALADRIGLYLHPYAALINHSCAYNAVIGFDGAELFA 196

Query: 119 RAVQHVPKGAEVLISYIETAGSTMTRQKALKEQYLFTCTCPRCIKLGQFDDIQESAILEG 178
            A++ + +  ++ ISY++       R+  L+E+Y F C C +C    + D          
Sbjct: 197 TALRPITRDEQIFISYVDATNPVAVRRNELRERYFFDCRCAKCA--AEMD---------- 244

Query: 179 YRCKDDGCSGFLLRDSDDKGFTCQQCGL---VRSKEEIKKIASEVNILSKKTLALTSCGN 235
                DG       D+  K +T  +      V   E ++ + S +  L +   A      
Sbjct: 245 ---APDGGRAPAAADAQRKAYTLMEVATGAGVDPAEAVRALESAMRTLRESEGA------ 295

Query: 236 HQEVVSTYKMIEKLQKKLYHPFSVNLMQTREKLIKILMELEDWKEALAYCQLTIPVYQ-- 293
                 ++++ E+            L+  R++LI  L+    ++   A+ Q  I V +  
Sbjct: 296 ------SWRITEQ-----------PLVSLRDELIASLLGAGRFQ--TAFVQAAIRVVRVD 336

Query: 294 -RVYP-QFHPLLGLQYYTCGKL 313
             VYP + HP+  L  +T  +L
Sbjct: 337 CVVYPVRGHPIRQLHAWTLARL 358


>gi|195485733|ref|XP_002091210.1| GE12361 [Drosophila yakuba]
 gi|194177311|gb|EDW90922.1| GE12361 [Drosophila yakuba]
          Length = 660

 Score = 65.9 bits (159), Expect = 3e-08,   Method: Compositional matrix adjust.
 Identities = 55/219 (25%), Positives = 97/219 (44%), Gaps = 18/219 (8%)

Query: 48  IANLVNLILQWPEISINEIAE--NFSKLACNAHTICNSELRPLGTGLYPVISIINHSCLP 105
           I +LV   LQ+ + + +E+AE   FS                +G  +YP +++ NHSC P
Sbjct: 427 ICSLVLRSLQFIQFNTHEVAELHKFSSSGREKSIF-------IGGAIYPTLALFNHSCDP 479

Query: 106 NAVLVFEGRLAVVRAVQHVPKGAEVLISY--IETAGSTMTRQKALKEQYLFTCTCPRCI- 162
             V  F G    V +V+ +  G  +  +Y  + T      RQ  LKE Y F C+C  CI 
Sbjct: 480 GVVRYFRGTTIHVNSVRPIEAGLPINENYGPMYTQDGRPERQARLKELYWFECSCDACID 539

Query: 163 KLGQFDDIQESAILEGYRCK-DDGCSGFL--LRDSDDKGFTCQQCGLVRSKEEIKKIASE 219
              +FDD+    I   +RC   + C+  +      +D    C  CG + +  +  K+  +
Sbjct: 540 NWPKFDDLPRDVI--RFRCDAPNNCAAVIEVPPSCNDFMVKCVTCGEITNILKGLKVMQD 597

Query: 220 VNILSKKTLALTSCGNHQEVVSTY-KMIEKLQKKLYHPF 257
             ++++    L   G + + ++ +  +I  + + L  PF
Sbjct: 598 TEMMTRTAKRLYETGEYSKALAKFIDLIRIMYEVLAPPF 636


>gi|159472132|ref|XP_001694205.1| predicted protein [Chlamydomonas reinhardtii]
 gi|158276868|gb|EDP02638.1| predicted protein [Chlamydomonas reinhardtii]
          Length = 468

 Score = 65.9 bits (159), Expect = 3e-08,   Method: Compositional matrix adjust.
 Identities = 33/73 (45%), Positives = 42/73 (57%), Gaps = 2/73 (2%)

Query: 91  GLYPVISIINHSCLPNAVLVFEGRLAVVRAVQHVPKGAEVLISYIETAG--STMTRQKAL 148
           GL+   S++NHSC PNAV    G    V A  HV +G+E+LI+Y+  A       RQ  L
Sbjct: 139 GLWHSFSMLNHSCCPNAVNYVVGDSMCVFAAGHVEQGSELLINYLGAASLRPVEERQAGL 198

Query: 149 KEQYLFTCTCPRC 161
            E Y F+C CPRC
Sbjct: 199 AEAYAFSCGCPRC 211


>gi|340722924|ref|XP_003399849.1| PREDICTED: protein msta, isoform B-like [Bombus terrestris]
          Length = 513

 Score = 65.5 bits (158), Expect = 4e-08,   Method: Compositional matrix adjust.
 Identities = 67/316 (21%), Positives = 130/316 (41%), Gaps = 41/316 (12%)

Query: 50  NLVNLILQWPEISINEIAENFSKLACNAHTICNSELRPL----GTGLYPVISIINHSCLP 105
           N+V+ I +  ++ ++  +E     AC    I   E+R         LYP ++++NHSC+ 
Sbjct: 173 NIVDYIRK--QLKLDRFSEEQIHTACGILEINTFEIRTAKGCSARALYPTVALMNHSCIS 230

Query: 106 N---AVLVFEGRLAVVRAVQHVPKGAEVLISYIETAGSTMTRQKALKEQYLFTCTCPRCI 162
           N   ++   + R+ + R    +P G E+  SY  +   T+ R++ L E   F C C RC 
Sbjct: 231 NTCHSISPTDYRVRL-RTTLKIPAGGELYGSYTHSLLPTLLRREHLLEGKYFACACSRC- 288

Query: 163 KLGQFDDIQESAILEGYRCK--DDGCSGFLLRDSDDKGFTCQQCGLVRSKEEIKKIASEV 220
                D  +    +   +C   D+G    L     D  + C  C    S   ++K+   +
Sbjct: 289 ----SDPTELGTHVSSLKCNKCDNGIVLSLDSLDPDSSWKCTHCEFTTSGSAVRKV---L 341

Query: 221 NILSKKTLAL--TSCGNHQEVVSTYKMIEKLQKKLYHPFSVNLMQTREKLIKILMELEDW 278
           +I+  +  A+   S  +  + +   + + K  + + HP    L   R  L ++   ++++
Sbjct: 342 HIIQAEVDAVETISGADGADAIQERETVVKKYRSVLHPRHGFLTMLRHSLTQMYGRVDEY 401

Query: 279 ----------KEALAYCQLTIPVYQRVYPQFHPLLGLQYYT-------CGKLEWFLGDTE 321
                     +  +  C+L + V   + P +  + G+  Y          K +W  G  +
Sbjct: 402 LLDDLPDVVLEHKIDICRLLLQVLDVIEPGYSRIRGMTLYELHAPLLFVAKTQWNAGVID 461

Query: 322 NAI--KSMTEAVEILR 335
            A+    M EA  IL+
Sbjct: 462 EAVLKSKMIEAANILK 477


>gi|298710004|emb|CBJ31723.1| SET and MYND domain containing 3 [Ectocarpus siliculosus]
          Length = 636

 Score = 65.5 bits (158), Expect = 4e-08,   Method: Compositional matrix adjust.
 Identities = 32/78 (41%), Positives = 43/78 (55%), Gaps = 3/78 (3%)

Query: 88  LGTGLYPVISIINHSCLPNAVLVF--EGRLAVVRAVQHVPKGAEVLISYIETAGSTMTRQ 145
            G GLYP  ++ NHSC PN        G L VV AV+ VP G+++ ISY++       RQ
Sbjct: 228 FGVGLYPAGAMFNHSCRPNCSWRTNRSGELCVV-AVEDVPAGSQLFISYVDILQPWPVRQ 286

Query: 146 KALKEQYLFTCTCPRCIK 163
             L+  + F C CPRC +
Sbjct: 287 DLLRCHFFFECACPRCCR 304


>gi|240979812|ref|XP_002403255.1| SET and MYND domain-containing protein (SMYD), putative [Ixodes
           scapularis]
 gi|215491338|gb|EEC00979.1| SET and MYND domain-containing protein (SMYD), putative [Ixodes
           scapularis]
          Length = 413

 Score = 65.5 bits (158), Expect = 4e-08,   Method: Compositional matrix adjust.
 Identities = 41/106 (38%), Positives = 59/106 (55%), Gaps = 8/106 (7%)

Query: 89  GTGLYPVISIINHSCLPNAVLVF---EGRLAVVRAVQHVPKGAEVLISYIETAG---STM 142
           G GLYP+ S+  HSC PNA   F   +  L++V A++ +  G EV +SYI+      S  
Sbjct: 309 GCGLYPLQSLCAHSCCPNAEARFLHNDHTLSLV-ALRDIRPGEEVTVSYIDECSLSRSRH 367

Query: 143 TRQKALKEQYLFTCTCPRCIKLG-QFDDIQESAILEGYRCKDDGCS 187
           +R K L+E +LFTCTCPRC +   Q  D+     +E     ++ CS
Sbjct: 368 SRIKMLRESHLFTCTCPRCEEEATQQPDVTSDEEMEDSSDAEEACS 413


>gi|357611158|gb|EHJ67334.1| hypothetical protein KGM_20152 [Danaus plexippus]
          Length = 397

 Score = 65.5 bits (158), Expect = 4e-08,   Method: Compositional matrix adjust.
 Identities = 66/230 (28%), Positives = 97/230 (42%), Gaps = 27/230 (11%)

Query: 14  NVIPSTTTDNYSLVEALVAHMS---DIDEKQLLLYAQIANL------VNLILQWPEISIN 64
           N +P    D+  L  AL A  +   + D+K LL   Q   L      +  + Q+ +I ++
Sbjct: 118 NKVPIEEVDDTLLSRALTAIRALLLNEDQKYLLTSLQANKLPQYGSEIRDLKQYFDIPLH 177

Query: 65  EIAENFSKLA-CNAHTICNSELRPLGT------GLYPVISIINHSCLPNAVLVFEGRLAV 117
           E  E F  L  C  +T       P G       GLYPV SI+NH+C+PN    F G L +
Sbjct: 178 E--EEFMILVICILNTNAFQMATPYGKKEMSLRGLYPVASILNHNCVPNTRNCFNGDLQM 235

Query: 118 -VRAVQHVPKGAEVLISYIETAGSTMTRQKALKEQYLFTCTCPRCIKLGQFDDIQESAIL 176
            V+A + +  G+E+   Y      T  R+  L +   F C C RC      D  +   +L
Sbjct: 236 TVKATKTINAGSEIFTCYSGMLWGTPARRLYLYKSKHFLCDCERCA-----DPTERGTLL 290

Query: 177 EGYRCKDDGCSGFLLRDSDDK---GFTCQQCGLVRSKEEIKKIASEVNIL 223
              +C    C G LL     K    + C +CG+    + I  I S +  L
Sbjct: 291 AALKCFSTECQGSLLPIQPLKTTTAWRCLECGMRVPNDNICVIQSALGSL 340


>gi|392579066|gb|EIW72193.1| hypothetical protein TREMEDRAFT_58351 [Tremella mesenterica DSM
           1558]
          Length = 660

 Score = 65.5 bits (158), Expect = 4e-08,   Method: Compositional matrix adjust.
 Identities = 43/129 (33%), Positives = 61/129 (47%), Gaps = 11/129 (8%)

Query: 62  SINEIAENFSKLACNAHTICNSELRPLGTGLYPVISIINHSCLPNAVLVF-EG----RLA 116
           S   I +  S    N  +  +  L  +G  L P +++ NHSC PNAV+VF EG    R+ 
Sbjct: 355 STGSILQVISSFMINTFSASSPSLDLVGAALNPAMAMSNHSCSPNAVVVFPEGADSMRIV 414

Query: 117 VVRAVQHVPKGAEVLISYIETAGSTMTRQKALKEQYLFTCTCPRC---IKLGQFDDIQES 173
            ++A++    G EVL  Y++ A     RQ  L+  Y F C CP C    K G    +   
Sbjct: 415 AIKAIE---AGEEVLTHYVDLALPYAQRQAELRRTYHFECKCPTCDNRTKDGDKPVLDPR 471

Query: 174 AILEGYRCK 182
           +IL    CK
Sbjct: 472 SILIHRECK 480


>gi|328784417|ref|XP_624539.3| PREDICTED: protein msta, isoform A-like [Apis mellifera]
          Length = 440

 Score = 65.5 bits (158), Expect = 4e-08,   Method: Compositional matrix adjust.
 Identities = 54/213 (25%), Positives = 95/213 (44%), Gaps = 23/213 (10%)

Query: 19  TTTDNYSLVEALVAHMSDIDEKQLLLYAQIANLVNLILQWPEISINEIAENFSKLACNAH 78
           T  ++Y  + +L +H ++++  +  L   IA+ +    +  +IS  EIA     L  N H
Sbjct: 158 TNPESYKKLISLESHCNEMNNSKEPL--NIAHFIKRFFKADDISEEEIATIIGILQVNGH 215

Query: 79  TICNSELRPLG----TGLYPVISIINHSCLPNAVLVFE--GRLAVVRAVQHVPKGAEVLI 132
            +      PL       +Y + S+I H+C  N    F   G L ++RA   + KG  + I
Sbjct: 216 EV------PLTDSPYVAVYEMASLIEHNCRANCSKSFTDMGGL-IIRAALPITKGDHISI 268

Query: 133 SYIETAGSTMTRQKALKEQYLFTCTCPRCIKLGQFDDIQESAILEGYRCKDDGCSGFLLR 192
            Y +    T  R+  L +   F C C RC    +F       +    +C +  CSG++L 
Sbjct: 269 CYTDPLWGTANRRHHLFKTKFFECICNRCQDPTEF-----QTMFNALKCNNINCSGYILP 323

Query: 193 DS---DDKGFTCQQCGLVRSKEEIKKIASEVNI 222
            +    ++ + C+ C  V S  EI+K+  ++ I
Sbjct: 324 KTFLEQEQDYICKICESVVSCTEIEKVLEDIGI 356


>gi|195498437|ref|XP_002096523.1| GE25007 [Drosophila yakuba]
 gi|194182624|gb|EDW96235.1| GE25007 [Drosophila yakuba]
          Length = 393

 Score = 65.5 bits (158), Expect = 4e-08,   Method: Compositional matrix adjust.
 Identities = 32/78 (41%), Positives = 48/78 (61%), Gaps = 5/78 (6%)

Query: 89  GTGLYPVISIINHSCLPNAVLVF--EGRLAVVRAVQHVPKGAEVLISYIETA---GSTMT 143
           G+GLY + S INHSC+PNA   F     + V++A+  + +G E+ ISY++      S  +
Sbjct: 286 GSGLYLLQSKINHSCVPNACSTFPYSNDIIVLKALAPIQQGDEICISYLDECMLERSRHS 345

Query: 144 RQKALKEQYLFTCTCPRC 161
           R K L+E Y+F C CP+C
Sbjct: 346 RHKVLRENYVFICQCPKC 363


>gi|224010509|ref|XP_002294212.1| predicted protein [Thalassiosira pseudonana CCMP1335]
 gi|220970229|gb|EED88567.1| predicted protein [Thalassiosira pseudonana CCMP1335]
          Length = 516

 Score = 65.5 bits (158), Expect = 4e-08,   Method: Compositional matrix adjust.
 Identities = 34/87 (39%), Positives = 46/87 (52%), Gaps = 14/87 (16%)

Query: 91  GLYPVISIINHSCLPNAVLVF-------------EGR-LAVVRAVQHVPKGAEVLISYIE 136
           GLYP+ ++INHSC PNAV VF             +GR + +V A   + KG E+  SY+ 
Sbjct: 244 GLYPLAAMINHSCCPNAVRVFGTLPAINDNMHRVQGREVMIVHATAKISKGDEITWSYLP 303

Query: 137 TAGSTMTRQKALKEQYLFTCTCPRCIK 163
                  R+  L  ++ FTC C RCIK
Sbjct: 304 PCSPFNERRAVLSSKFGFTCQCVRCIK 330


>gi|340720588|ref|XP_003398716.1| PREDICTED: SET and MYND domain-containing protein 4-like [Bombus
           terrestris]
          Length = 735

 Score = 65.5 bits (158), Expect = 4e-08,   Method: Compositional matrix adjust.
 Identities = 41/122 (33%), Positives = 61/122 (50%), Gaps = 13/122 (10%)

Query: 88  LGTGLYPVISIINHSCLPNAVLVFEGRLAVVRAVQHVPKGAEVLISYIE--TAGSTMTRQ 145
           + TG+YP  S++NHSC PN + +F G+  +VRA + + +   +   Y       +T  RQ
Sbjct: 456 VATGIYPSASMMNHSCDPNIINIFVGQYLIVRASRDIDQSEGIFNCYGPHYRHMTTEDRQ 515

Query: 146 KALKEQYLFTCTCPRCI--KLGQFDDIQESAILEGYRCKDDGCSGFLLRDSDDKGFTCQQ 203
           K LK QY FTC C  C   KL  F  I+    ++  +C    C+   ++DS      C  
Sbjct: 516 KILKSQYCFTCKCKACTLPKLQYF--IERFNAVKCLKCNGPVCN---IKDS----IYCLD 566

Query: 204 CG 205
           CG
Sbjct: 567 CG 568


>gi|194756896|ref|XP_001960706.1| GF13486 [Drosophila ananassae]
 gi|190622004|gb|EDV37528.1| GF13486 [Drosophila ananassae]
          Length = 517

 Score = 65.5 bits (158), Expect = 4e-08,   Method: Compositional matrix adjust.
 Identities = 42/163 (25%), Positives = 68/163 (41%), Gaps = 11/163 (6%)

Query: 46  AQIANLVNLILQWPEISINEIAENFSKLACNAHTICNSELRPLGTGLYPVISIINHSCLP 105
           A I   +    +  + +  +I      L  N H I  S+  P   G++   S   +SCLP
Sbjct: 189 ASIGQFIPKFFKTQKFTEEQIMRAVGALQINGHEIPTSD--PPHVGVFYTASFTENSCLP 246

Query: 106 NAVLVF-EGRLAVVRAVQHVPKGAEVLISYIETAGSTMTRQKALKEQYLFTCTCPRCIKL 164
           N    F +    ++ A Q + K A + I Y +    T  RQ+ L +  LF C C RC  +
Sbjct: 247 NLAKSFNKNGHCILWAPQEIKKNAHLSICYSDAMWGTADRQRHLMQTKLFKCACARCADV 306

Query: 165 GQFDDIQESAILEGYRCKDDGCSGFLLRDSDDK---GFTCQQC 204
            + D           +C+D  C G +L    D+    + C++C
Sbjct: 307 TELD-----TFYSALKCEDRQCGGLMLPSKSDEWNGSWRCREC 344


>gi|195582454|ref|XP_002081043.1| GD10796 [Drosophila simulans]
 gi|194193052|gb|EDX06628.1| GD10796 [Drosophila simulans]
          Length = 646

 Score = 65.5 bits (158), Expect = 4e-08,   Method: Compositional matrix adjust.
 Identities = 53/216 (24%), Positives = 94/216 (43%), Gaps = 26/216 (12%)

Query: 48  IANLVNLILQWPEISINEIAE--NFSKLACNAHTICNSELRPLGTGLYPVISIINHSCLP 105
           I +LV   LQ+ + + +E+AE   FS                +G  +YP +++ NHSC P
Sbjct: 427 ICSLVLRSLQFIQFNTHEVAELHKFSSSGREKSIF-------IGGAIYPTLALFNHSCDP 479

Query: 106 NAVLVFEGRLAVVRAVQHVPKGAEVLISY--IETAGSTMTRQKALKEQYLFTCTCPRCI- 162
             V  F G    + +V+ +  G  +  +Y  + T      RQ  LKE Y F C+C  CI 
Sbjct: 480 GVVRYFRGTTIHINSVRPIEAGLPINENYGPMYTQDERSERQARLKELYWFECSCDACID 539

Query: 163 KLGQFDDIQESAILEGYRCKDDGCSGFLLRDSDDKGFTCQQCGLVRSKEEIKKIASEVNI 222
              +FDD+    I   +RC           D+ +    C  CG + +  +  K+  +  +
Sbjct: 540 NWPKFDDLPRDVI--RFRC-----------DAPNNCSACVTCGEITNILKGLKVMQDTEM 586

Query: 223 LSKKTLALTSCGNHQEVVSTY-KMIEKLQKKLYHPF 257
           +++    L   G + + ++ +  +I  + + L  PF
Sbjct: 587 MTRTAKRLYETGEYSKALAKFIDLIRIMYEVLAPPF 622


>gi|281202822|gb|EFA77024.1| hypothetical protein PPL_09776 [Polysphondylium pallidum PN500]
          Length = 1187

 Score = 65.1 bits (157), Expect = 5e-08,   Method: Compositional matrix adjust.
 Identities = 46/171 (26%), Positives = 81/171 (47%), Gaps = 20/171 (11%)

Query: 22   DNYSLVEALVAHMSDIDEKQLLLY-AQIANLVNL--ILQWPE----ISINEIAENFSKLA 74
            D+Y  +  L    S+ID KQ+  +     +++N+  + + P+    ++  +I + + K+ 
Sbjct: 858  DSYDDLTHLTTDTSNIDRKQMESFETATQSIINVFKLAKGPKSLEPLTNQQILDLYPKML 917

Query: 75   CNAHTICNSELRP-LGTGLYPVISIINHSCLPNAVLVFEGR-LAVVRAVQHVPKGAEVLI 132
             NAH   +      +  G+YP  + +NHSC PN V   +   +   R+++ +  G E+  
Sbjct: 918  FNAHEYIDPLYHSEVARGIYPTAAYLNHSCEPNTVWHNDNNGMIAYRSIRDIKAGEEITT 977

Query: 133  SYIETAGSTMTRQKALKEQYLFTCTCPRCIKLGQFDDIQESAILEGYRCKD 183
            +YI+      TRQ  L  QY F C C RC         Q+ A   G++C D
Sbjct: 978  TYIDITKYKSTRQLNLLSQYAFLCQCARC---------QDRAT--GFKCLD 1017


>gi|268637641|ref|XP_002649106.1| SET domain-containing protein [Dictyostelium discoideum AX4]
 gi|256012848|gb|EEU04054.1| SET domain-containing protein [Dictyostelium discoideum AX4]
          Length = 449

 Score = 65.1 bits (157), Expect = 5e-08,   Method: Compositional matrix adjust.
 Identities = 48/185 (25%), Positives = 84/185 (45%), Gaps = 21/185 (11%)

Query: 19  TTTDNYSLVEALVAHMSDIDEKQLLLYAQ----IANLVNLILQWP----EISINEIAENF 70
           T  D+Y     L + +  +D++ +  + +    I++L++ +         I+I+EI E F
Sbjct: 128 TPIDDYQDFMGLTSTLDKVDKEHMTKFKRGVTSISSLISSVRGVGYLKNTITIHEILECF 187

Query: 71  SKLACNAHTICNSELRPLGTGLYPVISIINHSCLPNAVLVFEGR-LAVVRAVQHVPKGAE 129
           S +  NAH    +  + +G G+ P     NHSC+PN     + + + +     +V KG E
Sbjct: 188 SSVLTNAHQFSYATSKEIGRGVCPT-GYFNHSCMPNTTWSLDDQGMLLFSTSSNVKKGDE 246

Query: 130 VLISYIETAGSTMTRQKALKEQYLFTCTCPRCIKLGQFDDIQESAILEGYRCKDDGCSGF 189
           + + Y+        R++ L + Y F C CP C         +  + L GY C  + C   
Sbjct: 247 LSLGYLANEYPLKNRRRELLDGYYFFCQCPLC---------EFQSNLSGYLC--EKCKEP 295

Query: 190 LLRDS 194
           LL DS
Sbjct: 296 LLNDS 300


>gi|452988674|gb|EME88429.1| hypothetical protein MYCFIDRAFT_43814 [Pseudocercospora fijiensis
           CIRAD86]
          Length = 334

 Score = 65.1 bits (157), Expect = 5e-08,   Method: Compositional matrix adjust.
 Identities = 37/100 (37%), Positives = 51/100 (51%), Gaps = 8/100 (8%)

Query: 69  NFSKLACNAHTICNSELRPLGT-----GLYPVISIINHSCLPNAVLVFEGRLA--VVRAV 121
           NF         I  S   PLG      G++P++S +NHSCLPNA   +  RL   +V  +
Sbjct: 89  NFPGSPTPFSNIVRSNGYPLGPSSSIGGIFPLVSRLNHSCLPNAQHAYNERLGKMLVHII 148

Query: 122 QHVPKGAEVLISYIETAGSTMTRQKALKEQYLFTCTCPRC 161
           + +    E+ +SYI   G +  RQ  LK  +LFTCTC  C
Sbjct: 149 RPILPNEEITLSYI-PGGPSPQRQTELKSNFLFTCTCTLC 187


>gi|302691170|ref|XP_003035264.1| hypothetical protein SCHCODRAFT_14438 [Schizophyllum commune H4-8]
 gi|300108960|gb|EFJ00362.1| hypothetical protein SCHCODRAFT_14438 [Schizophyllum commune H4-8]
          Length = 597

 Score = 65.1 bits (157), Expect = 5e-08,   Method: Compositional matrix adjust.
 Identities = 32/100 (32%), Positives = 54/100 (54%), Gaps = 9/100 (9%)

Query: 71  SKLACNAHTICNSELRPLGTGLYPVISIINHSCLPNAVLVF---------EGRLAVVRAV 121
           S+ A N++++   +L P+G  + P++S++NHSC PNA  VF         +  L  V A+
Sbjct: 252 SRFATNSYSVTTPDLTPIGACVSPLVSLVNHSCSPNAATVFPRASKTPSTDEPLISVVAI 311

Query: 122 QHVPKGAEVLISYIETAGSTMTRQKALKEQYLFTCTCPRC 161
           + +    ++  SYI+T      R++ L+E Y F C C  C
Sbjct: 312 RDIQPDEQIFTSYIDTTLPRALRRRELQEGYNFLCKCSLC 351


>gi|449459406|ref|XP_004147437.1| PREDICTED: SET and MYND domain-containing protein 4-like [Cucumis
           sativus]
 gi|449500639|ref|XP_004161155.1| PREDICTED: SET and MYND domain-containing protein 4-like [Cucumis
           sativus]
          Length = 775

 Score = 65.1 bits (157), Expect = 5e-08,   Method: Compositional matrix adjust.
 Identities = 65/311 (20%), Positives = 122/311 (39%), Gaps = 46/311 (14%)

Query: 81  CNSELRPLGTGLYPVISIINHSCLPNAVLVFEGRLAVVRAVQHVPKGAEVLISYIETAG- 139
           CN E   +G  +Y   S+ NHSC PN    F  R   +RA   +  G  + +SY    G 
Sbjct: 464 CNMEQVRVGQAIYTTGSLFNHSCKPNIHAYFNSRTLFIRATVFMAVGCPLELSYGPQVGQ 523

Query: 140 -STMTRQKALKEQYLFTCTCPRCIKLGQFDDIQESAILEGYRCKDDGCSGFLLRDSDDKG 198
                R + LK++Y F C C  C  +   D      ++  + C +  C G +L   D   
Sbjct: 524 LDCKDRLQLLKDEYSFNCQCSGCSTVHISD-----LVINAFCCINPNCRGVVL---DRSI 575

Query: 199 FTCQQCG-----LVRSKEEIKKIASEVNILSKKTLALTSCGNHQEVVSTYKMIEK----- 248
           F+C+         V  +  ++      + L         CG++ ++ S+   ++K     
Sbjct: 576 FSCENTKTKDFLTVNDQMILEPFMQTDSFLHAGPSHCLKCGSYCDIKSSRLTVDKAGIHF 635

Query: 249 --LQKKL-----------------------YHPFSVNLMQTREKLIKILMELEDWKEALA 283
             LQ+++                        H ++  + +  + L +    L   + A  
Sbjct: 636 TRLQQEINLNRVSETTVSDALGALISLKSTLHEYNRRIAEAEDNLSQAFSLLGKLELAAE 695

Query: 284 YCQLTIPVYQRVYPQFHPLLGLQYYTCGKLEWFLGDTENAIKSMTEAVEILRITHGTNSP 343
           +C+ +I + +++Y + H  +G +      +   +GD  NA+  +    +I R  +G+N  
Sbjct: 696 HCKASIRILEKLYGENHIAIGNELSKLSSILISVGD-HNAVDCIKRLSKIFRCYYGSNVN 754

Query: 344 FMKELILKLEE 354
            M   +  LEE
Sbjct: 755 TMFPFLNILEE 765


>gi|330843375|ref|XP_003293631.1| hypothetical protein DICPUDRAFT_90323 [Dictyostelium purpureum]
 gi|325076008|gb|EGC29833.1| hypothetical protein DICPUDRAFT_90323 [Dictyostelium purpureum]
          Length = 343

 Score = 65.1 bits (157), Expect = 5e-08,   Method: Compositional matrix adjust.
 Identities = 30/73 (41%), Positives = 43/73 (58%), Gaps = 1/73 (1%)

Query: 91  GLYPVISIINHSCLPNAVLVFEG-RLAVVRAVQHVPKGAEVLISYIETAGSTMTRQKALK 149
           GLY + S INHSC PNA + F     A +R ++ +  G E+ ISY +T+   + R+  L 
Sbjct: 260 GLYFLTSFINHSCDPNAYVQFPNDHTAEIRLLKPINPGEEITISYADTSKDIIDRRSQLF 319

Query: 150 EQYLFTCTCPRCI 162
           E Y F C CP+C+
Sbjct: 320 ENYGFNCECPKCL 332


>gi|149238794|ref|XP_001525273.1| hypothetical protein LELG_03201 [Lodderomyces elongisporus NRRL
           YB-4239]
 gi|146450766|gb|EDK45022.1| hypothetical protein LELG_03201 [Lodderomyces elongisporus NRRL
           YB-4239]
          Length = 619

 Score = 65.1 bits (157), Expect = 5e-08,   Method: Compositional matrix adjust.
 Identities = 36/97 (37%), Positives = 51/97 (52%), Gaps = 4/97 (4%)

Query: 68  ENFSKLA-CNAHTICNSELRPLGTGLYPVISIINHSCLPNAVLVFEGRLAVVRAVQHVP- 125
           ENF  L   N +T+ + + R +G  L P  S+INHSC+PN+  V E      R +  +P 
Sbjct: 213 ENFVNLIFINCNTLVDYKFRIVGYNLDPQFSMINHSCMPNSCQV-ETEYGCYRLINTLPI 271

Query: 126 -KGAEVLISYIETAGSTMTRQKALKEQYLFTCTCPRC 161
             G E+ ++YI +   T  RQ  L  +Y F C CP C
Sbjct: 272 NAGDEITVNYIASGIPTELRQVQLFSRYFFRCKCPLC 308


>gi|324507181|gb|ADY43048.1| SET and MYND domain-containing protein 3 [Ascaris suum]
          Length = 614

 Score = 65.1 bits (157), Expect = 5e-08,   Method: Compositional matrix adjust.
 Identities = 67/296 (22%), Positives = 127/296 (42%), Gaps = 21/296 (7%)

Query: 76  NAHTICNSE-LRPLGTGLYPVISIINHSCLPNAVLVFEGRLAVVRAVQ---HVPKGAEVL 131
           N H I +   L  +G GLY  +   +HSC PN +   +G +A +RA+    ++       
Sbjct: 231 NRHAISDDGYLEEIGKGLYLDLCAYDHSCRPNTIYTCKGFVATLRALDLTVNLMDRTTTF 290

Query: 132 ISYIETAGSTMTRQKALKEQYLFTCTCPRCIKLGQFDDIQESAILEGYRCKDDGCSGF-- 189
            SYI+   +T  R+K L++ + F C C RC+     D +  S      R K +    F  
Sbjct: 291 YSYIDLLSTTQERKKLLRDTWYFDCQCVRCV--DNDDHVLSSMFCPSCREKPERLCIFGE 348

Query: 190 -LLRDSDDKGFTCQQCGLVRSKEEIKKIASEVNILSKKTLALTSCGNHQEVVS--TYKMI 246
              ++   +  TC +C +   KE + +    +  + +    + +    Q++     ++ +
Sbjct: 349 SPYKNPVTQILTCPKCHIEVPKERVLEAVDAMRFIDR----IVAKNEIQQMAKEQAFRFL 404

Query: 247 EKLQKKLYHPF-SVNLMQTREKLIKILMELEDWKEALAYCQLTIPVYQRV---YPQFHPL 302
           + ++ +      SVN+     K+I++L+ L D  ++     L +   + V   +P  HP 
Sbjct: 405 KGIKDRFSKILPSVNVYMC--KIIQLLIPLIDPTDSKTLLSLHLESEECVRFCFPPNHPA 462

Query: 303 LGLQYYTCGKLEWFLGDTENAIKSMTEAVEILRITHGTNSPFMKELILKLEEAQAE 358
           + +   + G     +G    A   +  A +IL+ T G   P   + +  LEEA  E
Sbjct: 463 VAVHLSSIGLFYLRIGHPHRAELYLEMACDILKFTLGHTHPLTVDKLSILEEAHRE 518


>gi|395331361|gb|EJF63742.1| SET domain-containing protein [Dichomitus squalens LYAD-421 SS1]
          Length = 652

 Score = 65.1 bits (157), Expect = 5e-08,   Method: Compositional matrix adjust.
 Identities = 44/133 (33%), Positives = 63/133 (47%), Gaps = 21/133 (15%)

Query: 85  LRPLGTGLYPVISIINHSCLPNAVLVF----------EGRLAVVRAVQHVPKGAEVLISY 134
           L P+G  + P +++ NHSC PNA LVF          E  L++V A+++V  G E+ ISY
Sbjct: 314 LTPIGICVSPTVALANHSCDPNAALVFPRADGGSRVKEPLLSLV-ALRNVAPGKEIRISY 372

Query: 135 IETAGSTMTRQKALKEQYLFTCTCPRCIKLGQFDDIQESAILEGYRCKDDGCSG---FLL 191
           ++T      RQ  LKE Y F+C C  C +    D        E   C    C G   +  
Sbjct: 373 VDTTLPNRLRQNELKEVYSFSCQCKLCCRTTAADP------REALWCPKS-CGGICPYPT 425

Query: 192 RDSDDKGFTCQQC 204
            +S D+   C +C
Sbjct: 426 EESGDRATHCVKC 438


>gi|428176655|gb|EKX45538.1| hypothetical protein GUITHDRAFT_138770 [Guillardia theta CCMP2712]
          Length = 598

 Score = 65.1 bits (157), Expect = 5e-08,   Method: Compositional matrix adjust.
 Identities = 71/300 (23%), Positives = 127/300 (42%), Gaps = 28/300 (9%)

Query: 59  PEISINEIAENFSKLACNAHTICNSELRPLGTGLYPVISIINHSCLPNAVLVFEGRLAVV 118
           P +S  + A   S L  N+H +        G+GL+ +  ++ HSC PN      G    V
Sbjct: 257 PTVSDEDAARMLSILNTNSHELGEWG----GSGLFLLGCLMEHSCAPNCNFSTYGTQLWV 312

Query: 119 RAVQHVPKGAEVLISYIE-TAGSTMTRQKALKEQYLFTCTCPRCIKLGQFDDIQESAILE 177
            A+  + KG  + I Y + T   T  R+K LK+ Y F C CPRC  L    D+  + I  
Sbjct: 313 TALNSIKKGESLSIDYNDSTFEPTAVRRKLLKDSYDFLCLCPRCTTL---PDVMRAFICP 369

Query: 178 GYRCKDDGCSGFLLRDSDDKGFTCQQCGLVRSKEEIKKIASEVNILSKKTLALTSCGNHQ 237
             +CK   C   L   S    + C++C     K ++ K + +  +  +K L +     H 
Sbjct: 370 VKQCKGKVCP--LGEGSKPSDWVCEKC-----KSQLDKKSRKRFLEIEKAL-MEEGPEHD 421

Query: 238 EVVSTYKMIEKLQKKLYHPFSVNLMQTREKLIKILMELEDW----KEALAYCQLTIPVYQ 293
               T+ ++ +  ++L   ++ +     E+ +      E W    K A     + IP+  
Sbjct: 422 VFHETHHIVLEALRELGSVYAKD-----EEAVADGSAEEIWLRVLKGADQARSILIPL-- 474

Query: 294 RVYPQFHPLLGLQYYTCGKLEWFLGDTENAIKSMTEAVEILRITHGTNSPFMKELILKLE 353
            V P+FHP   + +    +L     +   A +++  + E+ R   G  +P  ++ +  +E
Sbjct: 475 -VLPKFHPTKAIWWDNVAQLRLSRKNQAGAQEALKRSYEVTRRCCGDATPSTQDALRLVE 533


>gi|118403760|ref|NP_001072288.1| SET and MYND domain containing 4 [Xenopus (Silurana) tropicalis]
 gi|111305557|gb|AAI21315.1| hypothetical protein MGC145614 [Xenopus (Silurana) tropicalis]
          Length = 739

 Score = 65.1 bits (157), Expect = 5e-08,   Method: Compositional matrix adjust.
 Identities = 68/295 (23%), Positives = 116/295 (39%), Gaps = 64/295 (21%)

Query: 82  NSELRPLGTGLYPVISIINHSCLPNAVLVFEGRLAVVRAVQHVPKGAEVLISY--IETAG 139
           N  +R L T ++PV+S++NHSC PN  + F GR   VRA + + +  EV   Y   +   
Sbjct: 465 NKSIR-LATAVFPVLSLLNHSCDPNTTVSFTGRFVTVRANRPIRRDEEVTHCYGPHKLRM 523

Query: 140 STMTRQKALKEQYLFTCTCPRCIKLGQFDDIQESAILEGYRCKDDGCSGFLLRDSDDKGF 199
               RQ+ LK+QY F C C  C +                           L+     GF
Sbjct: 524 DVAERQQLLKDQYFFVCQCKACTE--------------------------ELKGKKTHGF 557

Query: 200 TCQQCGLVRSKEEI--------KKIASEVNILSKKTLALTSCGNHQEVVSTYKMIEKLQK 251
            C  C      EE            AS+  + S+    L   GNH ++        +LQ 
Sbjct: 558 FCPLCKAQLEGEEALYCTGARCTYTASQTQLTSR----LNQLGNHIQIAKV-----QLQD 608

Query: 252 KLYHPFSVNLM-----------QTREKLIKILMELE-------DWKEALAYCQLTIPVYQ 293
                  + LM           Q    L +I+  L        +WK A  + + +I + +
Sbjct: 609 NKTDNAKMTLMSCLSEAECFLSQDHLLLGEIMDHLAQAEASDGNWKAAAGHLRKSISIVK 668

Query: 294 RVYPQFHPLLGLQYYTCGKLEWFLGDTENAIKSMTEAVEILRITHGTNSPFMKEL 348
             Y      LG + +   ++ +   +  +A++++ EA ++L + +G +   ++EL
Sbjct: 669 VHYGSSSMELGHELFKLAQILFNGFEVPDAMRTIMEAQKVLSMHYGPDHNLVQEL 723


>gi|383861061|ref|XP_003706005.1| PREDICTED: SET and MYND domain-containing protein 5-like [Megachile
           rotundata]
          Length = 388

 Score = 65.1 bits (157), Expect = 5e-08,   Method: Compositional matrix adjust.
 Identities = 34/86 (39%), Positives = 52/86 (60%), Gaps = 7/86 (8%)

Query: 89  GTGLYPVISIINHSCLPNAVLVF--EGRLAVVRAVQHVPKGAEVLISYIETAG---STMT 143
           G+GLY + S +NHSC+PNA++ F     + V++A++ +    E+ ISY++      S  +
Sbjct: 287 GSGLYILQSAVNHSCIPNAIVEFPYSNNVLVLKAIRDIHPEEEICISYLDECDLERSRHS 346

Query: 144 RQKALKEQYLFTCTCPRCIKLGQFDD 169
           RQKAL   YLF C C +C  L Q +D
Sbjct: 347 RQKALSSLYLFICCCSKC--LSQTND 370


>gi|340708900|ref|XP_003393055.1| PREDICTED: SET and MYND domain-containing protein 4-like [Bombus
           terrestris]
          Length = 673

 Score = 65.1 bits (157), Expect = 6e-08,   Method: Compositional matrix adjust.
 Identities = 51/178 (28%), Positives = 83/178 (46%), Gaps = 18/178 (10%)

Query: 69  NFSKLACNAHTICNS---------ELRPLGTGLYPVISIINHSCLPNAVL-VFEGRLAVV 118
           N   + CNA+ I  +         E R +G  +YP IS+INHSC PN V   +     V+
Sbjct: 450 NLQAINCNAYEIVENIYDKKTHVWEPRHIGGAIYPSISLINHSCYPNVVRHSYPSGTVVL 509

Query: 119 RAVQHVPKGAEVLISYIET--AGSTMTRQKALKEQYLFTCTCPRCIKLGQFDDIQESAIL 176
           R ++ + KG E+L  Y     +   ++R + L ++Y F C C  C +  QF  + E+   
Sbjct: 510 RTLRFIGKGTEILDCYGPHWLSEKRLSRLEYLWKKYRFLCACEACTQNWQF-PLPETM-- 566

Query: 177 EGYRCK--DDGCSGFLLRDSDDKGFTCQQCGLVRSKEEIKKIASEVNILSKKTLALTS 232
             Y+CK   +  S   L +   +  + +QC     K ++KKI ++     +K L   S
Sbjct: 567 -NYKCKMCSEIISTITLDEKHTQNVSSKQCCKCNKKTDLKKIKNQFRKSVEKRLDAIS 623


>gi|307204816|gb|EFN83374.1| SET and MYND domain-containing protein 4 [Harpegnathos saltator]
          Length = 628

 Score = 65.1 bits (157), Expect = 6e-08,   Method: Compositional matrix adjust.
 Identities = 65/300 (21%), Positives = 124/300 (41%), Gaps = 28/300 (9%)

Query: 70  FSKLACNAHTICNSELRPLGTGLYPVISIINHSCLPNAVLVFEGRLAVVRAVQHVPKGAE 129
            +K+A +   +C  +   + T +YP  S++NHSC PN +  F  +  +VRA++ +  G E
Sbjct: 326 LNKIASDKDKLCVEQQDRIATAIYPSASMMNHSCDPNIITSFVDQYLIVRAMKDIQAGEE 385

Query: 130 VLISYIETAG--STMTRQKALKEQYLFTCTCPRCIKLGQFDDIQESAILEGYRCKDDGCS 187
           V   Y       S   RQ+ LK QY F C C  C+ + ++++  E       +C +  C 
Sbjct: 386 VSNCYGPNFRRMSREHRQEILKNQYGFECRCDPCV-MPEYENFMER--FHAIKCPE--CG 440

Query: 188 GFLLRDSDDKGFTCQQCG---LVRSKEEIKKIASEVNILSKKTLALTSCGNHQEVVSTYK 244
           G  L +       C  CG    +  + E+K   +  +               +E V   K
Sbjct: 441 G-ALEEFHSYSMHCLDCGATPYINCQIELKNAVTAFDAAQ----IYIELEREEEAVVKLK 495

Query: 245 MIEKLQKKLYHPFSVNLMQTREKLIKILMELEDWKEALAYCQLTIPVYQRVYP------- 297
               +++ L + +  +L  T + L K+   +  W E++++ + ++   +  Y        
Sbjct: 496 ECLSIRRNLLYKYHQDLTATLDILGKVYAIMGRWLESISHIEHSMAAIEERYGCSSMEVA 555

Query: 298 -QFHPLLG-----LQYYTCGKLEWFLGDTENAIKSMTEAVEILRITHGTNSPFMKELILK 351
            + + L       LQ       +W+    +   + +  A EI+  T+G      +E+I K
Sbjct: 556 NELNKLTDICIRYLQEEPNRTTKWYKNVLKKTRRYLDRAEEIVNFTYGPWHEIHQEIIEK 615


>gi|443719695|gb|ELU09739.1| hypothetical protein CAPTEDRAFT_159764 [Capitella teleta]
          Length = 338

 Score = 65.1 bits (157), Expect = 6e-08,   Method: Compositional matrix adjust.
 Identities = 31/78 (39%), Positives = 45/78 (57%), Gaps = 5/78 (6%)

Query: 89  GTGLYPVISIINHSCLPNAVLVF--EGRLAVVRAVQHVPKGAEVLISYIETAG---STMT 143
           G+GLY + S  NHSC+PNA + F        V+A+ ++  G E+ ISY++      S  +
Sbjct: 234 GSGLYLLQSTCNHSCMPNAEITFPHNNSTLAVKALSNIKTGEEICISYLDECALERSRHS 293

Query: 144 RQKALKEQYLFTCTCPRC 161
           R K L+E YLF C C +C
Sbjct: 294 RHKILRENYLFNCNCSKC 311


>gi|73980996|ref|XP_533007.2| PREDICTED: SET and MYND domain-containing protein 5 [Canis lupus
           familiaris]
          Length = 419

 Score = 65.1 bits (157), Expect = 6e-08,   Method: Compositional matrix adjust.
 Identities = 33/79 (41%), Positives = 47/79 (59%), Gaps = 5/79 (6%)

Query: 89  GTGLYPVISIINHSCLPNAVLVF--EGRLAVVRAVQHVPKGAEVLISYIETAG---STMT 143
           G+GLY + S  NHSC+PNA   F     L  V A++ +  G E+ ISY++      S  +
Sbjct: 304 GSGLYVLQSCCNHSCVPNAETTFPENNFLLHVTALEDIKPGEEICISYLDCCQRERSRHS 363

Query: 144 RQKALKEQYLFTCTCPRCI 162
           R K L+E YLF C+CP+C+
Sbjct: 364 RHKILRENYLFVCSCPKCL 382


>gi|328866939|gb|EGG15322.1| SET domain-containing protein [Dictyostelium fasciculatum]
          Length = 524

 Score = 65.1 bits (157), Expect = 6e-08,   Method: Compositional matrix adjust.
 Identities = 30/75 (40%), Positives = 43/75 (57%), Gaps = 2/75 (2%)

Query: 89  GTGLYPVISIINHSCLPNAVLVFEGR--LAVVRAVQHVPKGAEVLISYIETAGSTMTRQK 146
           G+GLY   +  NHSC PNA +  E     A +R++  +P G ++ ISYIE    + TRQ 
Sbjct: 444 GSGLYVYANSCNHSCSPNAAITNESTNFSATIRSITDIPNGNQIEISYIEEDQPSQTRQS 503

Query: 147 ALKEQYLFTCTCPRC 161
            L ++Y F C C +C
Sbjct: 504 ELIDKYKFKCHCQKC 518


>gi|156550498|ref|XP_001601757.1| PREDICTED: protein msta, isoform A-like [Nasonia vitripennis]
          Length = 519

 Score = 65.1 bits (157), Expect = 6e-08,   Method: Compositional matrix adjust.
 Identities = 44/167 (26%), Positives = 69/167 (41%), Gaps = 12/167 (7%)

Query: 60  EISINEIAENFSKLACNAHTICNSELRPLGTGLYPVISIINHSCLPNAVLVFEGRLAVV- 118
           + S +EIA     +  N H +  SE  P+   +Y   S + HSC  N    F     +V 
Sbjct: 198 DFSEDEIARAAGIMQINGHEVPISE--PVLIAVYDESSYVEHSCRANCSKSFTSSGGIVI 255

Query: 119 RAVQHVPKGAEVLISYIETAGSTMTRQKALKEQYLFTCTCPRCIKLGQFDDIQESAILEG 178
           RA   + KG  ++I Y +    T  R+  L     F CTCPRC    +F      ++   
Sbjct: 256 RAAMEIEKGEHIMICYTDPLWGTANRRHHLLRTKFFECTCPRCSDPAEF-----GSMFNA 310

Query: 179 YRCKDDGCSGFLLRDS----DDKGFTCQQCGLVRSKEEIKKIASEVN 221
             C    CSG +L  S    +   + C +C   RS   + ++  ++ 
Sbjct: 311 IECSKSDCSGCMLPKSFTVQEIPDYRCNKCDNSRSSNSVDEMLEQIG 357


>gi|19922036|ref|NP_610684.1| CG7759, isoform B [Drosophila melanogaster]
 gi|16768704|gb|AAL28571.1| HL04910p [Drosophila melanogaster]
 gi|21627490|gb|AAM68738.1| CG7759, isoform B [Drosophila melanogaster]
          Length = 567

 Score = 65.1 bits (157), Expect = 6e-08,   Method: Compositional matrix adjust.
 Identities = 54/219 (24%), Positives = 97/219 (44%), Gaps = 18/219 (8%)

Query: 48  IANLVNLILQWPEISINEIAE--NFSKLACNAHTICNSELRPLGTGLYPVISIINHSCLP 105
           I +LV   LQ+ + + +E+AE   FS                +G  +YP +++ NHSC P
Sbjct: 334 ICSLVLRSLQFIQFNTHEVAELHKFSSSGREKSIF-------IGGAIYPTLALFNHSCDP 386

Query: 106 NAVLVFEGRLAVVRAVQHVPKGAEVLISY--IETAGSTMTRQKALKEQYLFTCTCPRCI- 162
             V  F G    + +V+ +  G  +  +Y  + T      RQ  LK+ Y F C+C  CI 
Sbjct: 387 GVVRYFRGTTIHINSVRPIEAGLPINENYGPMYTQDERSERQARLKDLYWFECSCDACID 446

Query: 163 KLGQFDDIQESAILEGYRCK-DDGCSGFL--LRDSDDKGFTCQQCGLVRSKEEIKKIASE 219
              +FDD+    I   +RC   + CS  +      +D    C  CG + +  +  K+  +
Sbjct: 447 NWPKFDDLPRDVI--RFRCDAPNNCSAVIEVPPSCNDFMVKCVTCGEITNILKGLKVMQD 504

Query: 220 VNILSKKTLALTSCGNHQEVVSTY-KMIEKLQKKLYHPF 257
             ++++    L   G + + ++ +  +I  + + L  PF
Sbjct: 505 TEMMTRTAKRLYETGEYPKALAKFVDLIRIMYEVLAPPF 543


>gi|391338176|ref|XP_003743437.1| PREDICTED: N-lysine methyltransferase SMYD2-A-like [Metaseiulus
           occidentalis]
          Length = 409

 Score = 64.7 bits (156), Expect = 6e-08,   Method: Compositional matrix adjust.
 Identities = 44/164 (26%), Positives = 83/164 (50%), Gaps = 13/164 (7%)

Query: 27  VEALVAHMSDIDEKQLLLYAQIANLVNLILQWPEISI--NEIAENFSKLACNAHTICNS- 83
           ++ LV+H+ D+++ +       A  +  IL    +SI   E+ E + K   N H + +  
Sbjct: 111 LDDLVSHLEDLNDDERQKVKAKAQQIKSILDRVGVSIERKELMEIYMKCKINCHLVIDHN 170

Query: 84  --ELRPLGTGLYPVISIINHSCLP--NAVLVFEGRLAVVRAVQH--VPKGAEVLISYIET 137
             E    G  +Y   S ++H+C+P  N  L+F+GR  V+RA+++  +   +++ I Y+ +
Sbjct: 171 EPEYLSRGRAIYLAASKLDHTCVPTENYTLMFDGRRLVIRAMKNFTLKDPSDLRIHYVPS 230

Query: 138 AGSTMTRQKALKEQYLFTCTCPRCIKLGQF----DDIQESAILE 177
             S   R+  L + Y F C+C +C+   +F    DD +E  + E
Sbjct: 231 NLSYNDRRDRLLKNYFFLCSCDQCVSQLRFPAADDDSREERVSE 274


>gi|328770867|gb|EGF80908.1| hypothetical protein BATDEDRAFT_88247 [Batrachochytrium
           dendrobatidis JAM81]
          Length = 428

 Score = 64.7 bits (156), Expect = 6e-08,   Method: Compositional matrix adjust.
 Identities = 66/265 (24%), Positives = 114/265 (43%), Gaps = 47/265 (17%)

Query: 89  GTGLYPVISIINHSCLPNAVLVFEGRLAVVRAVQHVPKGAEVLISYIETAGSTM--TRQK 146
           G+G+YP  S+INH                 RA +++ KG  + ISY E+AG ++  TRQ+
Sbjct: 153 GSGIYPTASLINH-----------------RASKNIRKGNAITISYGESAGRSLRATRQQ 195

Query: 147 ALKEQYLFTCTCPRCIKLGQFDDIQESAILEGYRCKDDGCSGFLLRDSDDKGFTCQQCGL 206
            LK  + F C C  CI      D Q    L G +C   GC  + L + + +   C +C  
Sbjct: 196 NLKAGWGFECRCSACISEDDTKDRQ----LAGLKCP--GCP-YPLYEGEPR---CPKC-- 243

Query: 207 VRSKEEIKKIASEVNILSKKTLALTSCGNHQEVVSTYKMIEKLQ------KKLYHPFSVN 260
                 I  I+ +  +++K       C +H++  +   +I+ LQ      + + HP S  
Sbjct: 244 ----HSIVDISRQQKVMAKIDAVWHECQDHRKHDNVASLIQTLQGIQTDLESIMHPSSSR 299

Query: 261 LMQTREKLIKILMELE---DWKEALAYCQLTIPVYQRVYPQFHPLLGLQYYT-CGKLEWF 316
           L    + L  + ++L    D +  L++C+  +    R+      L  ++      K+ +F
Sbjct: 300 LGCMYDNLASLNVQLAKQGDNRILLSWCEKLLESSIRIVEGIFGLTSIEAANEHNKMAYF 359

Query: 317 LG--DTENAIKSMTEAVEILRITHG 339
           +   D E  IK    A  + R  +G
Sbjct: 360 MMAVDAEKGIKIARHAEGLFRTLYG 384


>gi|189237463|ref|XP_967982.2| PREDICTED: similar to msta CG33548-PB [Tribolium castaneum]
          Length = 387

 Score = 64.7 bits (156), Expect = 6e-08,   Method: Compositional matrix adjust.
 Identities = 59/205 (28%), Positives = 91/205 (44%), Gaps = 26/205 (12%)

Query: 22  DNYSLVEALVAHMSDIDEKQLLLYAQIANLVNLILQWPEISINEIAENFSKLACNAHTIC 81
           ++ S+V+ L AH SD          Q    V+++     +++ E  E F +  C+     
Sbjct: 141 EDKSVVKCLKAHKSD----------QHGFEVDVMKDALALNVKEDEEKFMRFVCSVMDSN 190

Query: 82  NSEL-------RPLGTGLYPVISIINHSCLPNAVLVFEGRL-AVVRAVQHVPKGAEVLIS 133
             E+       +    GLYP+ S+ NHSC PN   VF+ +   VVRA + +P+G+E+  S
Sbjct: 191 AFEVLVGFEDNQASVKGLYPLGSLANHSCCPNTCHVFDDKQHMVVRASKFIPQGSEIFHS 250

Query: 134 YIETAGSTMTRQKALKEQYLFTCTCPRCIKLGQFDDIQESAILEGYRCKDDGCSGFLLRD 193
           Y     ST  R+  L     F C C RC    +F     S +     CK       +  +
Sbjct: 251 YSRLIWSTSARRFHLYRTKHFLCKCQRCEDPTEFGSYIGSIL-----CKVCKTGKVIPTN 305

Query: 194 S--DDKGFTCQQCGLVRSKEEIKKI 216
           S   DK + C+ CG +  KEE+  I
Sbjct: 306 SLQTDK-WQCEVCGSLIKKEEVANI 329


>gi|116179288|ref|XP_001219493.1| hypothetical protein CHGG_00272 [Chaetomium globosum CBS 148.51]
 gi|88184569|gb|EAQ92037.1| hypothetical protein CHGG_00272 [Chaetomium globosum CBS 148.51]
          Length = 588

 Score = 64.7 bits (156), Expect = 6e-08,   Method: Compositional matrix adjust.
 Identities = 30/70 (42%), Positives = 45/70 (64%), Gaps = 1/70 (1%)

Query: 92  LYPVISIINHSCLPNAVLVFEGRLAVVRAVQHVPKGAEVLISYIETAGSTMTRQKALKEQ 151
           L+P  ++INHSC+PN+   F G L V+RA +++ KG E++ SY E+ G    RQ+AL   
Sbjct: 425 LWPWAALINHSCIPNSEREFVGDLMVIRATKNIAKGEEIVHSYDES-GVYDDRQRALMTT 483

Query: 152 YLFTCTCPRC 161
           + F C+C  C
Sbjct: 484 WGFECSCALC 493


>gi|406867180|gb|EKD20219.1| xanthine dehydrogenase [Marssonina brunnea f. sp. 'multigermtubi'
           MB_m1]
          Length = 654

 Score = 64.7 bits (156), Expect = 6e-08,   Method: Compositional matrix adjust.
 Identities = 31/77 (40%), Positives = 41/77 (53%), Gaps = 2/77 (2%)

Query: 88  LGTGLYPVISIINHSCLPNAVLVFE--GRLAVVRAVQHVPKGAEVLISYIETAGSTMTRQ 145
           LG  +Y +    NHSC PNA   F   G L V+RA   + +G E+ I Y    G+T  R+
Sbjct: 439 LGPFIYEIAGRFNHSCTPNAARGFSDVGELIVLRAFVDIQEGEEITIEYFSDGGTTTERR 498

Query: 146 KALKEQYLFTCTCPRCI 162
           + L +QY F C C  CI
Sbjct: 499 QHLYKQYRFNCDCRACI 515


>gi|195111430|ref|XP_002000282.1| GI10143 [Drosophila mojavensis]
 gi|193916876|gb|EDW15743.1| GI10143 [Drosophila mojavensis]
          Length = 392

 Score = 64.7 bits (156), Expect = 6e-08,   Method: Compositional matrix adjust.
 Identities = 31/78 (39%), Positives = 47/78 (60%), Gaps = 5/78 (6%)

Query: 89  GTGLYPVISIINHSCLPNA--VLVFEGRLAVVRAVQHVPKGAEVLISYIETAG---STMT 143
           G+GLY + S +NHSC+PNA     +   + V++AV  + KG E+ ISY++      S  +
Sbjct: 286 GSGLYILQSKVNHSCVPNAQSTFPYSNDIVVLKAVAPIQKGEEICISYLDECQLERSRHS 345

Query: 144 RQKALKEQYLFTCTCPRC 161
           R K L+E Y+F C C +C
Sbjct: 346 RHKILRENYIFVCQCSKC 363


>gi|195121780|ref|XP_002005397.1| GI19093 [Drosophila mojavensis]
 gi|193910465|gb|EDW09332.1| GI19093 [Drosophila mojavensis]
          Length = 515

 Score = 64.7 bits (156), Expect = 7e-08,   Method: Compositional matrix adjust.
 Identities = 42/160 (26%), Positives = 67/160 (41%), Gaps = 8/160 (5%)

Query: 46  AQIANLVNLILQWPEISINEIAENFSKLACNAHTICNSELRPLGTGLYPVISIINHSCLP 105
           A I   +    +  + S  EI      L  N H +  S+  P    ++   S   +SC+P
Sbjct: 187 ASIGQFIPKFFRTQKFSEEEIMRAVGALQINGHEVPTSD--PPHVAVFYTASFTENSCVP 244

Query: 106 NAVLVF-EGRLAVVRAVQHVPKGAEVLISYIETAGSTMTRQKALKEQYLFTCTCPRCIKL 164
           N    F +    ++ A + + KGA + I Y +    T  RQ+ L +  LF CTC RC+ +
Sbjct: 245 NLAKSFTKNGHCILWAPKAIKKGANLSICYSDAVWGTADRQRHLMQTKLFKCTCERCVDV 304

Query: 165 GQFDDIQESAILEGYRCKDDGCSGFLLRDSDDKGFTCQQC 204
            +             +C+D  C+G LL    D    C +C
Sbjct: 305 TEL-----GTYYSALKCEDRKCTGLLLPIKADDWNGCWRC 339


>gi|24652742|ref|NP_725048.1| CG7759, isoform A [Drosophila melanogaster]
 gi|21627489|gb|AAF58645.2| CG7759, isoform A [Drosophila melanogaster]
          Length = 660

 Score = 64.7 bits (156), Expect = 7e-08,   Method: Compositional matrix adjust.
 Identities = 54/219 (24%), Positives = 97/219 (44%), Gaps = 18/219 (8%)

Query: 48  IANLVNLILQWPEISINEIAE--NFSKLACNAHTICNSELRPLGTGLYPVISIINHSCLP 105
           I +LV   LQ+ + + +E+AE   FS                +G  +YP +++ NHSC P
Sbjct: 427 ICSLVLRSLQFIQFNTHEVAELHKFSSSGREKSIF-------IGGAIYPTLALFNHSCDP 479

Query: 106 NAVLVFEGRLAVVRAVQHVPKGAEVLISY--IETAGSTMTRQKALKEQYLFTCTCPRCI- 162
             V  F G    + +V+ +  G  +  +Y  + T      RQ  LK+ Y F C+C  CI 
Sbjct: 480 GVVRYFRGTTIHINSVRPIEAGLPINENYGPMYTQDERSERQARLKDLYWFECSCDACID 539

Query: 163 KLGQFDDIQESAILEGYRCK-DDGCSGFL--LRDSDDKGFTCQQCGLVRSKEEIKKIASE 219
              +FDD+    I   +RC   + CS  +      +D    C  CG + +  +  K+  +
Sbjct: 540 NWPKFDDLPRDVI--RFRCDAPNNCSAVIEVPPSCNDFMVKCVTCGEITNILKGLKVMQD 597

Query: 220 VNILSKKTLALTSCGNHQEVVSTY-KMIEKLQKKLYHPF 257
             ++++    L   G + + ++ +  +I  + + L  PF
Sbjct: 598 TEMMTRTAKRLYETGEYPKALAKFVDLIRIMYEVLAPPF 636


>gi|350418674|ref|XP_003491932.1| PREDICTED: SET and MYND domain-containing protein 4-like [Bombus
           impatiens]
          Length = 681

 Score = 64.7 bits (156), Expect = 7e-08,   Method: Compositional matrix adjust.
 Identities = 44/144 (30%), Positives = 64/144 (44%), Gaps = 17/144 (11%)

Query: 76  NAHTICNSEL---------RPL--GTGLYPVISIINHSCLPNAVLVFEGRLAVVRAVQHV 124
           NAH +  + L         RPL  G  +YP ++  NH C P     F GR  V+RA++ +
Sbjct: 454 NAHEVFETRLGNEHRFRGSRPLYIGVAIYPTVARFNHDCYPAVTRYFVGRSIVIRAIRSL 513

Query: 125 PKGAEVLISY--IETAGSTMTRQKALKEQYLFTCTCPRCIK-LGQFDDIQESAILEGYRC 181
             G  V  +Y  I T  S   R++ L  +Y F C C  C +   +F+ +    +    RC
Sbjct: 514 RAGDAVAENYGPIFTKRSLEDRRRTLAARYWFRCECTACREDWPRFETLTNDMVR--LRC 571

Query: 182 KDDGCSGFLLRDSD-DKGFTCQQC 204
             +GCS    R  D D+   C  C
Sbjct: 572 PTEGCSKLHPRPRDPDRSIECSSC 595


>gi|156366270|ref|XP_001627062.1| predicted protein [Nematostella vectensis]
 gi|156213960|gb|EDO34962.1| predicted protein [Nematostella vectensis]
          Length = 391

 Score = 64.7 bits (156), Expect = 7e-08,   Method: Compositional matrix adjust.
 Identities = 52/170 (30%), Positives = 81/170 (47%), Gaps = 15/170 (8%)

Query: 1   MLKLYLRRKLQNDNVIPSTTTDNYSLVEALVA-HMSDIDEKQLLLYAQIANLVNLILQWP 59
           M+++ L   + +D +    T + +S + ALV  +   I    L LY      V+ I  +P
Sbjct: 207 MIRILLSEAMYDDRLEQWFTPEGFSSLFALVGTNGQGIGTSSLSLY------VHNIDSYP 260

Query: 60  EISINE---IAENFSKLACNAHTICNSELRPLGTGLYPVISIINHSCLPNAVLVF--EGR 114
            +S +E   I    ++L      +    L   G GLY + S  NHSC PNA + F     
Sbjct: 261 ALSDDERQAIDIFLNQLYEEMERVSGQFLNCEGAGLYALQSSCNHSCAPNAEVTFPKNNS 320

Query: 115 LAVVRAVQHVPKGAEVLISYIETAG---STMTRQKALKEQYLFTCTCPRC 161
             V++A+  +  G E+ ISY+E      S  +R K L+E Y+F CTC +C
Sbjct: 321 TLVLKALHPIKNGEEICISYLEECQRERSRHSRLKYLRENYIFDCTCTKC 370


>gi|313217681|emb|CBY38723.1| unnamed protein product [Oikopleura dioica]
          Length = 228

 Score = 64.7 bits (156), Expect = 7e-08,   Method: Compositional matrix adjust.
 Identities = 30/101 (29%), Positives = 52/101 (51%), Gaps = 2/101 (1%)

Query: 70  FSKLACNAHTICNSEL--RPLGTGLYPVISIINHSCLPNAVLVFEGRLAVVRAVQHVPKG 127
           F+ +  N   IC+ E     +G+ +Y   +++NHSC PNA  VF     + +A++ +  G
Sbjct: 30  FAAIKNNQFAICDEESGDYDVGSAVYIDHALVNHSCRPNAYPVFNKTNMIFKALRKIEPG 89

Query: 128 AEVLISYIETAGSTMTRQKALKEQYLFTCTCPRCIKLGQFD 168
            E+  +Y +T      R++ L + + F C CP C K  + D
Sbjct: 90  EEITHAYTDTISPIQERREYLNDVWRFMCNCPGCTKSNEID 130


>gi|350401158|ref|XP_003486068.1| PREDICTED: SET and MYND domain-containing protein 4-like [Bombus
           impatiens]
          Length = 662

 Score = 64.7 bits (156), Expect = 7e-08,   Method: Compositional matrix adjust.
 Identities = 41/122 (33%), Positives = 61/122 (50%), Gaps = 13/122 (10%)

Query: 88  LGTGLYPVISIINHSCLPNAVLVFEGRLAVVRAVQHVPKGAEVLISYIE--TAGSTMTRQ 145
           + TG+YP  S++NHSC PN + +F G+  +VRA + + +  E+   Y       +T  RQ
Sbjct: 377 IATGIYPSASMMNHSCDPNIINIFVGQYLIVRASRDIGQSEEIFNCYGPHYRHMTTEDRQ 436

Query: 146 KALKEQYLFTCTCPRCI--KLGQFDDIQESAILEGYRCKDDGCSGFLLRDSDDKGFTCQQ 203
           K LK  Y FTC C  C   KL  F +   +A  +  +C    C+   ++DS      C  
Sbjct: 437 KILKSHYCFTCKCKACTLPKLQYFTERFNAA--KCLKCNGPVCN---IKDS----IYCLD 487

Query: 204 CG 205
           CG
Sbjct: 488 CG 489


>gi|50552992|ref|XP_503906.1| YALI0E13552p [Yarrowia lipolytica]
 gi|49649775|emb|CAG79499.1| YALI0E13552p [Yarrowia lipolytica CLIB122]
          Length = 637

 Score = 64.7 bits (156), Expect = 7e-08,   Method: Compositional matrix adjust.
 Identities = 34/101 (33%), Positives = 50/101 (49%)

Query: 63  INEIAENFSKLACNAHTICNSELRPLGTGLYPVISIINHSCLPNAVLVFEGRLAVVRAVQ 122
           + +I E    L  N +T C+   R  G    P +++INHSC+PNA L+F GR   +   +
Sbjct: 179 LEQILEYAGMLFVNQYTRCDDLGRQGGYIFDPTLALINHSCVPNAYLLFRGRKVHLVCWK 238

Query: 123 HVPKGAEVLISYIETAGSTMTRQKALKEQYLFTCTCPRCIK 163
            +  G EV +SY      T  R+  L   + F C CP C+ 
Sbjct: 239 PINDGDEVFLSYTRFMHPTPERRTLLYMHFRFWCECPGCVS 279


>gi|296413480|ref|XP_002836440.1| hypothetical protein [Tuber melanosporum Mel28]
 gi|295630261|emb|CAZ80631.1| unnamed protein product [Tuber melanosporum]
          Length = 763

 Score = 64.7 bits (156), Expect = 7e-08,   Method: Compositional matrix adjust.
 Identities = 29/73 (39%), Positives = 42/73 (57%)

Query: 89  GTGLYPVISIINHSCLPNAVLVFEGRLAVVRAVQHVPKGAEVLISYIETAGSTMTRQKAL 148
           G+GL+ + S INHSC  N+   F G L +VRA + + +G E+ I+Y+E       RQKA 
Sbjct: 515 GSGLWIMPSYINHSCWQNSTRSFLGDLLIVRAARDLSEGEEITINYMENESGVQKRQKAF 574

Query: 149 KEQYLFTCTCPRC 161
             ++ F C C  C
Sbjct: 575 LSEWGFECKCTMC 587


>gi|355720921|gb|AES07094.1| SMYD family member 5 [Mustela putorius furo]
          Length = 159

 Score = 64.7 bits (156), Expect = 7e-08,   Method: Compositional matrix adjust.
 Identities = 32/79 (40%), Positives = 47/79 (59%), Gaps = 5/79 (6%)

Query: 89  GTGLYPVISIINHSCLPNAVLVF--EGRLAVVRAVQHVPKGAEVLISYIETAG---STMT 143
           G+GL+ + S  NHSC+PNA   F     L  V A++ +  G E+ ISY++      S  +
Sbjct: 46  GSGLFVLQSCCNHSCVPNAETSFPENNFLLHVTALEDIKPGEEICISYLDCCQRERSRHS 105

Query: 144 RQKALKEQYLFTCTCPRCI 162
           R K L+E YLF C+CP+C+
Sbjct: 106 RHKILRENYLFVCSCPKCL 124


>gi|157124067|ref|XP_001660315.1| hypothetical protein AaeL_AAEL009757 [Aedes aegypti]
 gi|108874116|gb|EAT38341.1| AAEL009757-PA [Aedes aegypti]
          Length = 518

 Score = 64.7 bits (156), Expect = 8e-08,   Method: Compositional matrix adjust.
 Identities = 63/242 (26%), Positives = 103/242 (42%), Gaps = 35/242 (14%)

Query: 83  SELRPLGTGLYPVISIINHSCLPNAVLVFEGRLAVVRAVQHVP-KGAEVL-ISYIETAGS 140
           S++  L  GLYP  +++ HSC+PN +L  +G  + VR     P K  E+L ISY  +   
Sbjct: 190 SQMGNLARGLYPKTALMMHSCMPNTLLSVDG-YSNVRVFTSAPVKMGEILNISYTRSLFG 248

Query: 141 TMTRQKALKEQYLFTCTCPRCIKLGQFDDIQESAILEGYRCK--DDGCSGFLLRDSDDKG 198
           T  RQ  L++   F C+C RC      D  +    L   RC   DDG   F     D+  
Sbjct: 249 TYDRQTHLRKGKYFICSCRRCK-----DPTELGTHLSSIRCSECDDGLCSFY---PDEPR 300

Query: 199 FTCQQCGLVRSKEEIKKI--ASEVNILSKKTLALTSCGNHQEVVSTYKMIEKLQKKL--Y 254
           + C +C  +  +E + +I  A+  +I++             EV    ++I K  K L  +
Sbjct: 301 WECNKCHKLLKREYVNEILCAARDDIMACPL----------EVRDLERVISKHSKILNPH 350

Query: 255 HPFSVNLMQTREKLIKILMELED--------WKEALAYCQLTIPVYQRVYPQFHPLLGLQ 306
           H   +   QT    +K +    D         +     C+  +P+ + + P    L+G+ 
Sbjct: 351 HSLVLEAKQTLMGELKAICMSTDPTAVPKPVLRRKFELCEEILPILRTLEPGLSRLVGIA 410

Query: 307 YY 308
            Y
Sbjct: 411 MY 412


>gi|321473475|gb|EFX84442.1| hypothetical protein DAPPUDRAFT_194440 [Daphnia pulex]
          Length = 524

 Score = 64.3 bits (155), Expect = 8e-08,   Method: Compositional matrix adjust.
 Identities = 41/148 (27%), Positives = 69/148 (46%), Gaps = 15/148 (10%)

Query: 91  GLYPVISIINHSCLPNAVLVFEGRLAVV-RAVQHVPKGAEVLISYIETAGSTMTRQKALK 149
            +Y   S++ H C+ NA+  F  +  +V RA   +PKG ++ + Y E    TM RQ+ L 
Sbjct: 231 AVYSTASLLEHDCVANAIKTFTNKGDIVIRAAVPIPKGEKIALCYTEPLWGTMNRQRHLS 290

Query: 150 EQYLFTCTCPRCIKLGQFDDIQESAILEGYRCKD--DGCSGFLLRDS---DDKGFTCQQC 204
           +   F C C RC    + D         G+ C+      +G LL ++   D   + C+QC
Sbjct: 291 QTKFFQCVCERCKDPTELD-----TFTSGFYCQKCPQQSAGILLTENPLDDASDWICRQC 345

Query: 205 GLVR----SKEEIKKIASEVNILSKKTL 228
           G  +    + E I+ +  E+  L K ++
Sbjct: 346 GARQPANYAAEIIESVGKEIVALKKGSV 373


>gi|348540237|ref|XP_003457594.1| PREDICTED: SET and MYND domain-containing protein 4-like
           [Oreochromis niloticus]
          Length = 725

 Score = 64.3 bits (155), Expect = 8e-08,   Method: Compositional matrix adjust.
 Identities = 74/322 (22%), Positives = 128/322 (39%), Gaps = 57/322 (17%)

Query: 74  ACNAHTICNSELRPLGTGLYPVISIINHSCLPNAVLVF---------------------- 111
           A N+    N E+R + T ++P +S++NHSC PN  LVF                      
Sbjct: 405 ATNSLVQSNREIR-IATAVFPALSLLNHSCCPNTSLVFNTGGIIEPSGSALSADFSEGEA 463

Query: 112 ------EGRLAVVRAVQHVPKGAEVLISYIETAG--STMTRQKALKEQYLFTCTCPRCIK 163
                 +G    VRA + +  G E+L  Y   +   +T  R++ L+EQY F C C  C +
Sbjct: 464 EWRHKAQGVTVTVRAAKLITPGQEILHCYGPHSSRMATQERRRLLQEQYYFLCQCEACSQ 523

Query: 164 LGQFDDIQESAILE-GYRCKDDG-----CSGFLLRDSDDK--GFTC--QQCGLVRSKEEI 213
             Q  D++E    + G   +D G     C G L + S+D+  GF C    C    S  ++
Sbjct: 524 HQQ--DVEEPRQQQPGAEGRDLGLLCIKCKGSLKKRSEDREAGFVCSLSSCSHCMSLSDV 581

Query: 214 KKIASEVNILSKKTLALTSCGNHQEVVSTYKMIEKLQKKLYHPFSVNLMQTR-------E 266
                E+ +  +K + L       E +       +L KK      V L +T        +
Sbjct: 582 NNRLQEIRVNLEKAVELMETDRPGEAL-------RLLKKTLCQSGVILAETHPLQGELAD 634

Query: 267 KLIKILMELEDWKEALAYCQLTIPVYQRVYPQFHPLLGLQYYTCGKLEWFLGDTENAIKS 326
              +    + DW  A ++ + +       + +    LG Q +   +L +  G    A+  
Sbjct: 635 ATARAYASMGDWNNAASHLERSTVAIASQFGKESVELGRQLFKLAQLHFNSGSRGPALSV 694

Query: 327 MTEAVEILRITHGTNSPFMKEL 348
           + +   +L +  G +   ++EL
Sbjct: 695 IPKVRRLLCLHCGPHCHELQEL 716


>gi|407849671|gb|EKG04345.1| hypothetical protein TCSYLVIO_004595 [Trypanosoma cruzi]
          Length = 359

 Score = 64.3 bits (155), Expect = 8e-08,   Method: Compositional matrix adjust.
 Identities = 54/191 (28%), Positives = 89/191 (46%), Gaps = 32/191 (16%)

Query: 2   LKLYLRRKLQNDNVIPSTTTDNYSLVEALVAHMSDI--------DEKQLLLYAQIANLVN 53
           L+L      +      ++TT+N +  E L   M+ +        +E  L   A+ + L+ 
Sbjct: 143 LRLIRWEAFRKFRATHTSTTENDTTDEFLATGMTLLGRILRFTQEECTLFTVARWSELLG 202

Query: 54  LIL-----QWPEISINEIAENFSKLACNAHTI---------CNSELRPL-----GTGLYP 94
            +L     + P  + + + E   +L C   T+            EL+ L     G G+Y 
Sbjct: 203 AVLLNGQERSPPSNYDRLKELVQRLPCGESTMDAFEQEVQMAGKELQHLLQSSRGQGVYT 262

Query: 95  VISIINHSCLPNAVLVF----EGRLAVVRAVQHVPKGAEVLISYIETAGSTMTRQKALKE 150
           V  + NHSC PN  +V+    +  L+VV A++++  G E+ ISYI+ + S   RQ+ L E
Sbjct: 263 VGCLFNHSCEPNLQVVYSESGDETLSVV-ALRNIELGEELCISYIDESLSYPERQQELYE 321

Query: 151 QYLFTCTCPRC 161
            YLF C CP+C
Sbjct: 322 HYLFVCQCPKC 332


>gi|328869415|gb|EGG17793.1| SET domain-containing protein [Dictyostelium fasciculatum]
          Length = 363

 Score = 64.3 bits (155), Expect = 8e-08,   Method: Compositional matrix adjust.
 Identities = 28/74 (37%), Positives = 42/74 (56%), Gaps = 1/74 (1%)

Query: 91  GLYPVISIINHSCLPNAVLVF-EGRLAVVRAVQHVPKGAEVLISYIETAGSTMTRQKALK 149
           GL+ + S  NHSC PN  + F   + AV+ A++ + KG E+ ISY ++      RQ  L 
Sbjct: 279 GLFFLSSFFNHSCDPNVYMAFPHDKTAVITALRDIKKGEELFISYGDSEKDMFDRQTHLF 338

Query: 150 EQYLFTCTCPRCIK 163
           + Y F C CP+C +
Sbjct: 339 DNYGFNCDCPKCTQ 352


>gi|357464329|ref|XP_003602446.1| SET and MYND domain-containing protein [Medicago truncatula]
 gi|355491494|gb|AES72697.1| SET and MYND domain-containing protein [Medicago truncatula]
          Length = 785

 Score = 64.3 bits (155), Expect = 9e-08,   Method: Compositional matrix adjust.
 Identities = 38/118 (32%), Positives = 54/118 (45%), Gaps = 7/118 (5%)

Query: 76  NAHTICNSELRPLGTGLYPVISIINHSCLPNAVLVFEGRLAVVRAVQHVPKGAEVLISYI 135
           + H   N E   +G  +Y V S+ NHSC PN    F  R   +R  Q V  G ++ +SY 
Sbjct: 462 SVHLTSNVEQVRVGKAIYKVGSLFNHSCQPNVHAYFLSRTLYLRTTQAVAAGCQLELSYG 521

Query: 136 ETAG--STMTRQKALKEQYLFTCTCPRCIKLGQFDDIQESAILEGYRCKDDGCSGFLL 191
              G      RQ  LK++Y F C C  C ++   D      +L  + C +  CSG +L
Sbjct: 522 PQVGLWDCKDRQSFLKDEYAFHCQCTGCSEVNLSD-----IVLNAFHCVNPNCSGAVL 574


>gi|440802077|gb|ELR23016.1| Histone-lysine N-methyltransferase [Acanthamoeba castellanii str.
           Neff]
          Length = 481

 Score = 64.3 bits (155), Expect = 9e-08,   Method: Compositional matrix adjust.
 Identities = 45/156 (28%), Positives = 75/156 (48%), Gaps = 12/156 (7%)

Query: 42  LLLYAQIANLVNLILQWPEISINEIAENFSKLACNAHTICNSELRPL---GTGLYPVISI 98
           L++ A      + +LQ   + ++EI E   +L  N     ++  R     G+ LY   ++
Sbjct: 288 LIMEALSGRATSDVLQG--LDVDEIVEYIGRLESNTFARYSTTKRKTVNYGSALYNCAAM 345

Query: 99  INHSCLPNAVLVFEGRLAVVRAVQHVPKGAEVLISYIETAGSTMTRQKALKEQYLFTCTC 158
            NHSC P+ V  F+G+   +RA++ +  G E+ ++YI     T  RQ  L++ Y F C C
Sbjct: 346 FNHSCFPSVVRQFDGQHLTLRALRPLNPGDELTMTYIPLRDDTPERQDELQQHYHFACRC 405

Query: 159 PRCIKLG-QFDDIQESAILEGYRCKDDG--CSGFLL 191
            RC   G + +   E  +L    C +D   C G L+
Sbjct: 406 QRCEARGAEVERYMEECLL----CPNDKRECDGLLV 437


>gi|302922324|ref|XP_003053442.1| hypothetical protein NECHADRAFT_74600 [Nectria haematococca mpVI
           77-13-4]
 gi|256734383|gb|EEU47729.1| hypothetical protein NECHADRAFT_74600 [Nectria haematococca mpVI
           77-13-4]
          Length = 530

 Score = 64.3 bits (155), Expect = 9e-08,   Method: Compositional matrix adjust.
 Identities = 33/90 (36%), Positives = 52/90 (57%), Gaps = 1/90 (1%)

Query: 72  KLACNAHTICNSELRPLGTGLYPVISIINHSCLPNAVLVFEGRLAVVRAVQHVPKGAEVL 131
           K+  NA    ++++  +G  L P +++ NHSC+PNA + F GR A++RA + +    E+ 
Sbjct: 169 KIQTNAFHRYDADIGQVGIFLEPRLAMANHSCIPNATVQFVGRRAILRAERPIKADEEIE 228

Query: 132 ISYIETAGSTMTRQKALKEQYLFTCTCPRC 161
           ISY +       R++AL   Y FTC C RC
Sbjct: 229 ISYTDYNYPLSKRKEALAP-YFFTCECTRC 257


>gi|198433887|ref|XP_002127168.1| PREDICTED: similar to SMYD family member 5 [Ciona intestinalis]
          Length = 372

 Score = 64.3 bits (155), Expect = 9e-08,   Method: Compositional matrix adjust.
 Identities = 35/79 (44%), Positives = 46/79 (58%), Gaps = 7/79 (8%)

Query: 89  GTGLYPVISIINHSCLPNAVLVFEG---RLAVVRAVQHVPKGAEVLISYI---ETAGSTM 142
           G GLY + S  NHSC PNA + F     RLAV +A + +  G E+ ISY+   + A    
Sbjct: 281 GAGLYRIQSKCNHSCEPNAEVCFPNNNHRLAV-KACRDIAAGEEITISYLSQCQIARGCR 339

Query: 143 TRQKALKEQYLFTCTCPRC 161
           +RQ+ LKE YLF C C +C
Sbjct: 340 SRQQYLKENYLFHCCCSKC 358


>gi|198454620|ref|XP_001359652.2| GA17400 [Drosophila pseudoobscura pseudoobscura]
 gi|198132877|gb|EAL28802.2| GA17400 [Drosophila pseudoobscura pseudoobscura]
          Length = 392

 Score = 64.3 bits (155), Expect = 9e-08,   Method: Compositional matrix adjust.
 Identities = 38/101 (37%), Positives = 57/101 (56%), Gaps = 9/101 (8%)

Query: 66  IAENFSKLACNAHTICNSELRPLGTGLYPVISIINHSCLPNAVLVF--EGRLAVVRAVQH 123
           I E +SK+   A    N+E    G+GLY + S INHSC+PNA   F     + V++A+  
Sbjct: 267 IDELYSKVGEFAGEFLNNE----GSGLYLLQSKINHSCVPNACSTFPYSNDIVVLKALAP 322

Query: 124 VPKGAEVLISYIETAG---STMTRQKALKEQYLFTCTCPRC 161
           + +G E+ ISY++      S  +R K L+E Y+F C C +C
Sbjct: 323 IQEGEEICISYLDECQLERSRHSRHKVLRENYIFVCQCLKC 363


>gi|260798620|ref|XP_002594298.1| hypothetical protein BRAFLDRAFT_117670 [Branchiostoma floridae]
 gi|229279531|gb|EEN50309.1| hypothetical protein BRAFLDRAFT_117670 [Branchiostoma floridae]
          Length = 400

 Score = 64.3 bits (155), Expect = 1e-07,   Method: Compositional matrix adjust.
 Identities = 32/96 (33%), Positives = 50/96 (52%)

Query: 66  IAENFSKLACNAHTICNSELRPLGTGLYPVISIINHSCLPNAVLVFEGRLAVVRAVQHVP 125
           + E + K+A N+  I +  +  +G G+YP  S+INHSC  N + +F G    +RA + + 
Sbjct: 170 MEEIYGKIASNSFAILDENMCSIGIGVYPQASMINHSCKSNCIGMFYGPQIQIRANEFIR 229

Query: 126 KGAEVLISYIETAGSTMTRQKALKEQYLFTCTCPRC 161
            G ++   YI     T  RQ+ L + Y F C C  C
Sbjct: 230 PGEQIFHGYIPPLLPTAKRQEKLLKTYHFLCQCADC 265



 Score = 40.4 bits (93), Expect = 1.3,   Method: Compositional matrix adjust.
 Identities = 27/104 (25%), Positives = 52/104 (50%)

Query: 255 HPFSVNLMQTREKLIKILMELEDWKEALAYCQLTIPVYQRVYPQFHPLLGLQYYTCGKLE 314
           H  +V +++  + L+++ +EL  W++A+ Y +   P Y+    ++    GL Y       
Sbjct: 281 HSDNVYVIRILKDLVEVCVELGLWEKAVVYGKRIEPGYKMFLKRYDLDQGLLYQKMALAY 340

Query: 315 WFLGDTENAIKSMTEAVEILRITHGTNSPFMKELILKLEEAQAE 358
           + LGD   A+  + +A   L IT G +S  ++E+   L E  A+
Sbjct: 341 YRLGDQSAALPYLRQAKTTLTITSGEDSGLVQEVSDMLAECVAK 384


>gi|167521373|ref|XP_001745025.1| hypothetical protein [Monosiga brevicollis MX1]
 gi|163776639|gb|EDQ90258.1| predicted protein [Monosiga brevicollis MX1]
          Length = 399

 Score = 64.3 bits (155), Expect = 1e-07,   Method: Compositional matrix adjust.
 Identities = 34/98 (34%), Positives = 46/98 (46%), Gaps = 17/98 (17%)

Query: 91  GLYPVISIINHSCLPNAV-----------------LVFEGRLAVVRAVQHVPKGAEVLIS 133
           GL+P  + INHSC PNA                  L  EG   ++R+V  + +G  + IS
Sbjct: 132 GLWPQAAAINHSCRPNATHYLDASAPMKPRESGADLPPEGGTMIIRSVSDIARGEPITIS 191

Query: 134 YIETAGSTMTRQKALKEQYLFTCTCPRCIKLGQFDDIQ 171
           Y+E       RQ+AL+  Y F CTC RC +    D  Q
Sbjct: 192 YVELGDPWPVRQEALRTGYGFACTCIRCTEEAALDSEQ 229


>gi|16741406|gb|AAH16525.1| SET and MYND domain containing 5 [Mus musculus]
          Length = 300

 Score = 63.9 bits (154), Expect = 1e-07,   Method: Compositional matrix adjust.
 Identities = 35/86 (40%), Positives = 50/86 (58%), Gaps = 7/86 (8%)

Query: 89  GTGLYPVISIINHSCLPNAVLVFEGRLAV--VRAVQHVPKGAEVLISYIETAG---STMT 143
           G+GL+ + S  NHSC+PNA   F     V  V A++ +  G E+ ISY++      S  +
Sbjct: 187 GSGLFVLQSCCNHSCVPNAETSFPENNFVLHVTALEDIKPGEEICISYLDCCQRERSRHS 246

Query: 144 RQKALKEQYLFTCTCPRCIKLGQFDD 169
           R K L+E YLF C+CP+C  L + DD
Sbjct: 247 RHKILRENYLFNCSCPKC--LAEADD 270


>gi|414870819|tpg|DAA49376.1| TPA: hypothetical protein ZEAMMB73_691595 [Zea mays]
          Length = 489

 Score = 63.9 bits (154), Expect = 1e-07,   Method: Compositional matrix adjust.
 Identities = 73/308 (23%), Positives = 129/308 (41%), Gaps = 56/308 (18%)

Query: 83  SELRPLGTGLYPVISIINHSCLPNAVLVFEGRLAVVRAVQHVPKGAEVLISYIETAGSTM 142
           S++R +   LY   S+ NHSC PN    F  R  V+R  + +  G+ V +SY    G   
Sbjct: 171 SQVR-VAQALYVSGSLFNHSCQPNVHAYFLSRAFVLRTTEFIKSGSPVELSYGPQVGEMH 229

Query: 143 T--RQKALKEQYLFTCTCPRCIKLGQFDDIQES----------AILEG--YRCKDD---- 184
              RQK+L+E Y F+C C  C +L   D +  S          AI E   YR K++    
Sbjct: 230 ISERQKSLQENYYFSCQCSSCSELNLSDLVMNSFCCPQSNCLGAISESTYYRSKENFVNV 289

Query: 185 -----------------------GCSGFLLRDSD-----DKG--FTCQQC-GLVRSKEEI 213
                                    +  LL++S      D G   +C+ C  L  +    
Sbjct: 290 SLGGSYVCKLSLPDVSKVDMDMENVAKSLLQNSGVSLNIDHGCCMSCRSCIDLSSALATS 349

Query: 214 KKIASEVNILSKKTLALTSCGNHQEVVSTYKMIEKLQKKLYHPFSVNLMQTREKLIKILM 273
            +  S ++ L K T       +   +    + +++L KKL HP+S  L Q  + + +   
Sbjct: 350 HREESTIDRLKKHTFL-----DKTLITEALQSLKQL-KKLRHPYSKALAQAEDTIAEAFA 403

Query: 274 ELEDWKEALAYCQLTIPVYQRVYPQFHPLLGLQYYTCGKLEWFLGDTENAIKSMTEAVEI 333
           ++ D + A  +C+ +I + +++Y   H ++  +      +   LGD  +A  +  +A  I
Sbjct: 404 KVGDQELARKHCEASIQILEKLYHPKHIIIAHELIKLVSILLSLGDGASAAATFAQAEAI 463

Query: 334 LRITHGTN 341
             + +G++
Sbjct: 464 FSLYYGSH 471


>gi|328785542|ref|XP_001122203.2| PREDICTED: protein msta, isoform A-like [Apis mellifera]
          Length = 490

 Score = 63.9 bits (154), Expect = 1e-07,   Method: Compositional matrix adjust.
 Identities = 42/132 (31%), Positives = 64/132 (48%), Gaps = 9/132 (6%)

Query: 91  GLYPVISIINHSCLPNAVLVFEGRLAV-VRAVQHVPKGAEVLISYIETAGSTMTRQKALK 149
           GLYP+ S+ NH C+PN    F+ +  + VRA   +  G E+ +SY      T  R++ L 
Sbjct: 214 GLYPLGSLQNHCCIPNTRHYFDEKFRLYVRAALPISAGEEITMSYTSLFWDTTLRRQFLN 273

Query: 150 EQYLFTCTCPRCIKLGQFDDIQESAILEGYRCKDDGCSGFLLRDSD---DKGFTCQQCGL 206
               F+C C RC    +F+  + SA+L    C  D CSG LL  +       + C +C +
Sbjct: 274 VTKNFSCMCKRCSDPTEFNS-KLSALL----CASDKCSGELLPKNPLNMRTSWICNKCSI 328

Query: 207 VRSKEEIKKIAS 218
             +  +I  I S
Sbjct: 329 TVNHRQICSIRS 340


>gi|345484303|ref|XP_001603615.2| PREDICTED: SET and MYND domain-containing protein 5-like [Nasonia
           vitripennis]
          Length = 386

 Score = 63.9 bits (154), Expect = 1e-07,   Method: Compositional matrix adjust.
 Identities = 36/88 (40%), Positives = 51/88 (57%), Gaps = 7/88 (7%)

Query: 89  GTGLYPVISIINHSCLPNAVLVF---EGRLAVVRAVQHVPKGAEVLISYIETAG---STM 142
           G+GLY + S INHSC PNA + F   + RL VVRA++ +    E+ I+Y++      S  
Sbjct: 288 GSGLYVLQSAINHSCAPNASVEFPHSDSRL-VVRALRDIKPDEEICIAYLDECHLERSRH 346

Query: 143 TRQKALKEQYLFTCTCPRCIKLGQFDDI 170
           +RQKAL   YLF C C +C +     D+
Sbjct: 347 SRQKALSSLYLFVCKCDKCQQQASDPDV 374


>gi|428180005|gb|EKX48874.1| hypothetical protein GUITHDRAFT_105498 [Guillardia theta CCMP2712]
          Length = 197

 Score = 63.9 bits (154), Expect = 1e-07,   Method: Compositional matrix adjust.
 Identities = 45/158 (28%), Positives = 70/158 (44%), Gaps = 22/158 (13%)

Query: 76  NAHTICNSELRPLGTGLYPVISIINHSCLPNAVLVFEGRLAVVRAVQHVPKGAEVLISYI 135
           NAH I N   + +G G++P   ++NH+C  NAV  F            VP GAEV  + I
Sbjct: 7   NAHPITNGSFKRIGLGIFPSACMLNHACNYNAVYSF------------VPGGAEVHANAI 54

Query: 136 E----TAGSTMTRQKALKEQYLFTCTCPRCIKLGQFDDIQESAILEGY-RCKDDGCSGFL 190
           +    T+   + RQK ++E Y F C C  C      +    S +   +  C    C   +
Sbjct: 55  DFGLNTSSQLIRRQKLIREGYFFQCACEWCAGTSGPERYFNSLVCNPWPDCDGHKCEAAI 114

Query: 191 LRDSD-DKGFTCQQCGLVRSKEEIKKIASEVNILSKKT 227
           L D++ D+G     C   + +  I    +E+N L+K  
Sbjct: 115 LPDANGDEGQAVVICDACKRRHPI----TEMNALNKSA 148


>gi|380018360|ref|XP_003693098.1| PREDICTED: protein msta, isoform A-like [Apis florea]
          Length = 467

 Score = 63.9 bits (154), Expect = 1e-07,   Method: Compositional matrix adjust.
 Identities = 53/213 (24%), Positives = 93/213 (43%), Gaps = 23/213 (10%)

Query: 19  TTTDNYSLVEALVAHMSDIDEKQLLLYAQIANLVNLILQWPEISINEIAENFSKLACNAH 78
           T  ++Y  + +L +H ++I+  +  L   IA+ +    +  +IS  EI      L  N H
Sbjct: 158 TNPESYKKLISLESHCNEINNSKEPL--NIAHFIKRFFKADDISEEEIVTIIGILQVNGH 215

Query: 79  TICNSELRPLG----TGLYPVISIINHSCLPNAVLVFE--GRLAVVRAVQHVPKGAEVLI 132
            +      PL       +Y + S+I H+C  N    F   G L ++RA   + KG  + I
Sbjct: 216 EV------PLTDSPYVAVYEMASLIEHNCRANCSKSFTDMGGL-IIRAALPITKGDHISI 268

Query: 133 SYIETAGSTMTRQKALKEQYLFTCTCPRCIKLGQFDDIQESAILEGYRCKDDGCSGFLLR 192
            Y +    T  R+  L +   F C C RC    +F       +    +C    CSG++L 
Sbjct: 269 CYTDPLWGTANRRHHLLKTKFFECICNRCQDPTEF-----QTMFNALKCNKINCSGYVLP 323

Query: 193 DS---DDKGFTCQQCGLVRSKEEIKKIASEVNI 222
            +    ++ + C+ C  + S  EI+K+  ++ I
Sbjct: 324 KTFLEQEQDYVCKTCESIVSCTEIEKMLEDIGI 356


>gi|255082145|ref|XP_002508291.1| predicted protein [Micromonas sp. RCC299]
 gi|226523567|gb|ACO69549.1| predicted protein [Micromonas sp. RCC299]
          Length = 311

 Score = 63.9 bits (154), Expect = 1e-07,   Method: Compositional matrix adjust.
 Identities = 43/159 (27%), Positives = 69/159 (43%), Gaps = 19/159 (11%)

Query: 22  DNYSLVEALVAHMSDIDEKQL-----LLYAQIANLVNLILQWP-EISINEIAENFSKLAC 75
           D Y  +E L++ +   D+ +L        A++A     +L      S++E      +L  
Sbjct: 148 DGYDALEGLMSGVDGGDDGELPHDAVATIAEVAKQARFLLAASCRCSMDECVRTLGRLQL 207

Query: 76  NAHTICNSEL------------RPLGTGLYPVISIINHSCLPNAVLVFEGRLA-VVRAVQ 122
           N   +  SE             RP+G G++P  S  NHSC PN    F+G    VV   +
Sbjct: 208 NGFEMTASEPEEGADEAEGGGHRPVGVGVFPSASYTNHSCAPNCAQRFDGHGCIVVETAR 267

Query: 123 HVPKGAEVLISYIETAGSTMTRQKALKEQYLFTCTCPRC 161
            V  G E+ I Y++       R++ L++ + F C C RC
Sbjct: 268 DVRGGEELTIPYVDVRLGRRERRERLRKNFAFDCACERC 306


>gi|401405154|ref|XP_003882027.1| conserved hypothetical protein [Neospora caninum Liverpool]
 gi|325116441|emb|CBZ51994.1| conserved hypothetical protein [Neospora caninum Liverpool]
          Length = 576

 Score = 63.9 bits (154), Expect = 1e-07,   Method: Compositional matrix adjust.
 Identities = 30/76 (39%), Positives = 46/76 (60%), Gaps = 2/76 (2%)

Query: 88  LGTGLYPVISIINHSCLPNAVLVFEG--RLAVVRAVQHVPKGAEVLISYIETAGSTMTRQ 145
           +G GL+  +S+ NHSC PNA + +    + A V A++ + +  +VL+SYI+      +RQ
Sbjct: 419 MGIGLFRAVSMTNHSCWPNAEVDYPSLTKTAQVTALRDIAQDEQVLLSYIDETLPLASRQ 478

Query: 146 KALKEQYLFTCTCPRC 161
           + LK  Y FTC C RC
Sbjct: 479 RLLKRHYKFTCACVRC 494


>gi|426335970|ref|XP_004029477.1| PREDICTED: SET and MYND domain-containing protein 5 isoform 2
           [Gorilla gorilla gorilla]
          Length = 303

 Score = 63.9 bits (154), Expect = 1e-07,   Method: Compositional matrix adjust.
 Identities = 32/79 (40%), Positives = 47/79 (59%), Gaps = 5/79 (6%)

Query: 89  GTGLYPVISIINHSCLPNAVLVF--EGRLAVVRAVQHVPKGAEVLISYIETAG---STMT 143
           G+GL+ + S  NHSC+PNA   F     L  V A++ +  G E+ ISY++      S  +
Sbjct: 188 GSGLFVLQSCCNHSCVPNAETSFPENNFLLHVTALEDIKPGEEICISYLDCCQRERSRHS 247

Query: 144 RQKALKEQYLFTCTCPRCI 162
           R K L+E YLF C+CP+C+
Sbjct: 248 RHKILRENYLFVCSCPKCL 266


>gi|397473489|ref|XP_003808243.1| PREDICTED: SET and MYND domain-containing protein 5 isoform 2 [Pan
           paniscus]
          Length = 304

 Score = 63.9 bits (154), Expect = 1e-07,   Method: Compositional matrix adjust.
 Identities = 32/79 (40%), Positives = 47/79 (59%), Gaps = 5/79 (6%)

Query: 89  GTGLYPVISIINHSCLPNAVLVF--EGRLAVVRAVQHVPKGAEVLISYIETAG---STMT 143
           G+GL+ + S  NHSC+PNA   F     L  V A++ +  G E+ ISY++      S  +
Sbjct: 188 GSGLFVLQSCCNHSCVPNAETSFPENNFLLHVTALEDIKPGEEICISYLDCCQRERSRHS 247

Query: 144 RQKALKEQYLFTCTCPRCI 162
           R K L+E YLF C+CP+C+
Sbjct: 248 RHKILRENYLFVCSCPKCL 266


>gi|367028494|ref|XP_003663531.1| hypothetical protein MYCTH_33039, partial [Myceliophthora
           thermophila ATCC 42464]
 gi|347010800|gb|AEO58286.1| hypothetical protein MYCTH_33039, partial [Myceliophthora
           thermophila ATCC 42464]
          Length = 419

 Score = 63.9 bits (154), Expect = 1e-07,   Method: Compositional matrix adjust.
 Identities = 32/87 (36%), Positives = 46/87 (52%), Gaps = 2/87 (2%)

Query: 75  CNAHTICNSELRPLGTGLYPVISIINHSCLPNAVLVFEGRLAVVRAVQHVPKGAEVLISY 134
           C     C+ ++  +G  L  V ++INHSC PNA +  EG    VR+++ +  G E+ +SY
Sbjct: 103 CKIARYCSKDV--IGYALDMVTAMINHSCAPNAFVTLEGCQLRVRSLKPIAAGEEITVSY 160

Query: 135 IETAGSTMTRQKALKEQYLFTCTCPRC 161
            +       RQK LKE Y F C C  C
Sbjct: 161 ADPTLPVFNRQKFLKETYFFDCRCKGC 187


>gi|255085478|ref|XP_002505170.1| predicted protein [Micromonas sp. RCC299]
 gi|226520439|gb|ACO66428.1| predicted protein [Micromonas sp. RCC299]
          Length = 625

 Score = 63.9 bits (154), Expect = 1e-07,   Method: Compositional matrix adjust.
 Identities = 36/106 (33%), Positives = 57/106 (53%), Gaps = 11/106 (10%)

Query: 59  PEISINEIAENFSKLACNAHTICNSELRPLGT-GLYPVISIINHSCLPNAVLVFEGRLAV 117
           PE+   E+   F++LA N   +  S +R   + G+YP  S +NHSC PNA  +  G    
Sbjct: 282 PELPPQEV---FARLAAN--VVKTSAVRNANSVGVYPFPSFLNHSCAPNACKLMVGHTMF 336

Query: 118 VRAVQHVPKGAEVLISYIETAGSTMTR--QKALKEQYLFTCTCPRC 161
           +RA + +  G EV + Y +    TM +  + A+ +++ F C CPRC
Sbjct: 337 IRAARDLVAGEEVFMKYFDV---TMPKPERSAVAKRWGFECACPRC 379


>gi|321449488|gb|EFX61914.1| hypothetical protein DAPPUDRAFT_68494 [Daphnia pulex]
          Length = 680

 Score = 63.9 bits (154), Expect = 1e-07,   Method: Compositional matrix adjust.
 Identities = 38/111 (34%), Positives = 59/111 (53%), Gaps = 5/111 (4%)

Query: 88  LGTGLYPVISIINHSCLPNAVLVFEGRLAVVRAVQHVPKGAEVLISY--IETAGSTMTRQ 145
           LG G+Y  ++++NHSC P  +  F G + VVRA++    G  V  +Y  I T    + RQ
Sbjct: 476 LGAGVYSTVALLNHSCEPGVIRHFIGDVMVVRAIKSFQPGEMVNENYGPIFTQKRRVDRQ 535

Query: 146 KALKEQYLFTCTCPRCIK-LGQFDDIQESAILEGYRCKDDGCSGFLLRDSD 195
           ++LK++Y F C C  C +      ++ E A+   +RC D  C   L+  SD
Sbjct: 536 RSLKDRYWFDCRCNPCTENWPLIGEMTEEALC--FRCADRRCRKPLVVQSD 584


>gi|66525107|ref|XP_394075.2| PREDICTED: SET and MYND domain-containing protein 5 [Apis
           mellifera]
          Length = 387

 Score = 63.9 bits (154), Expect = 1e-07,   Method: Compositional matrix adjust.
 Identities = 34/87 (39%), Positives = 53/87 (60%), Gaps = 7/87 (8%)

Query: 89  GTGLYPVISIINHSCLPNAVLVF--EGRLAVVRAVQHVPKGAEVLISYIETA---GSTMT 143
           G+GLY + S +NHSC+PNA++ F     + V++A++ +    E+ ISY++      S  +
Sbjct: 287 GSGLYILQSSVNHSCVPNAIVEFPYSNNVLVLKAIRDIHPEEEICISYLDECCLERSRHS 346

Query: 144 RQKALKEQYLFTCTCPRCIKLGQFDDI 170
           RQKAL   YLF C C +C  L Q +D+
Sbjct: 347 RQKALNSLYLFQCYCNKC--LSQANDL 371


>gi|74187979|dbj|BAE37118.1| unnamed protein product [Mus musculus]
          Length = 400

 Score = 63.9 bits (154), Expect = 1e-07,   Method: Compositional matrix adjust.
 Identities = 35/86 (40%), Positives = 50/86 (58%), Gaps = 7/86 (8%)

Query: 89  GTGLYPVISIINHSCLPNAVLVFEGRLAV--VRAVQHVPKGAEVLISYIETAG---STMT 143
           G+GL+ + S  NHSC+PNA   F     V  V A++ +  G E+ ISY++      S  +
Sbjct: 287 GSGLFVLQSCCNHSCVPNAETSFPENNFVLHVTALEDIKPGEEICISYLDCCQRERSRHS 346

Query: 144 RQKALKEQYLFTCTCPRCIKLGQFDD 169
           R K L+E YLF C+CP+C  L + DD
Sbjct: 347 RHKILRENYLFNCSCPKC--LAEADD 370


>gi|195426423|ref|XP_002061334.1| GK20779 [Drosophila willistoni]
 gi|194157419|gb|EDW72320.1| GK20779 [Drosophila willistoni]
          Length = 515

 Score = 63.9 bits (154), Expect = 1e-07,   Method: Compositional matrix adjust.
 Identities = 46/173 (26%), Positives = 72/173 (41%), Gaps = 11/173 (6%)

Query: 48  IANLVNLILQWPEISINEIAENFSKLACNAHTICNSELRPLGTGLYPVISIINHSCLPNA 107
           I   +    +  + S  EI      L  N H +  S+  P    ++ V S   +SCLPN 
Sbjct: 189 IGQFIPKFFKTQKFSEEEIMRTVGALQINGHEVPTSD--PPHVAVFYVASFTENSCLPNL 246

Query: 108 VLVFEGR-LAVVRAVQHVPKGAEVLISYIETAGSTMTRQKALKEQYLFTCTCPRCIKLGQ 166
              F      ++ A Q + K + + I Y +    T  RQ+ L +  LF C C RC     
Sbjct: 247 AKSFNKHGHCILWAPQAIKKNSHLSICYSDAIWGTSDRQRHLMQTKLFKCACLRCQ---- 302

Query: 167 FDDIQESAILEGYRCKDDGCSGFLLRDSDDK---GFTCQQCGLVRSKEEIKKI 216
            D  +        +C+D  CSG +L    D    G+ C++CG    K+ ++ I
Sbjct: 303 -DPTELGTNYSAIKCEDRKCSGLMLPTKADDWHGGWQCKECGKQVPKQYVEGI 354


>gi|26450409|dbj|BAC42319.1| unknown protein [Arabidopsis thaliana]
          Length = 604

 Score = 63.9 bits (154), Expect = 1e-07,   Method: Compositional matrix adjust.
 Identities = 37/117 (31%), Positives = 56/117 (47%), Gaps = 10/117 (8%)

Query: 88  LGTGLYPVISIINHSCLPNAVLVFEGRLAVVRAVQHVPKGAEVLISYIETAG--STMTRQ 145
           +G  LY   S+ NHSC PN  L F  R  +++  + VP G  + +SY    G      R 
Sbjct: 475 VGQALYKTGSLFNHSCKPNIHLYFLSRGLIMQTTEFVPTGCPLELSYGPEVGKWDCKNRI 534

Query: 146 KALKEQYLFTCTCPRCIKLGQFDDIQESAILEGYRCKDDGCSGFLLRDSDDKGFTCQ 202
           + L+E+Y F C C  C ++   D      ++ GY C +  C+G +L   D    TC+
Sbjct: 535 RFLEEEYFFHCRCRGCAQINISD-----LVINGYGCVNTNCTGVVL---DSNVATCE 583


>gi|344283937|ref|XP_003413727.1| PREDICTED: SET and MYND domain-containing protein 5 [Loxodonta
           africana]
          Length = 417

 Score = 63.9 bits (154), Expect = 1e-07,   Method: Compositional matrix adjust.
 Identities = 35/86 (40%), Positives = 50/86 (58%), Gaps = 7/86 (8%)

Query: 89  GTGLYPVISIINHSCLPNAVLVF--EGRLAVVRAVQHVPKGAEVLISYIETAG---STMT 143
           G+GL+ + S  NHSC+PNA   F     L  V A++ +  G E+ ISY++      S  +
Sbjct: 304 GSGLFVLQSCCNHSCVPNAETSFPENNFLLHVTALEDIKPGEEICISYLDCCQRERSRHS 363

Query: 144 RQKALKEQYLFTCTCPRCIKLGQFDD 169
           R K L+E YLF C+CP+C  L + DD
Sbjct: 364 RHKILRENYLFVCSCPKC--LAEADD 387


>gi|49257476|gb|AAH73806.1| SMYD family member 5 [Homo sapiens]
 gi|167773713|gb|ABZ92291.1| SMYD family member 5 [synthetic construct]
 gi|194374657|dbj|BAG62443.1| unnamed protein product [Homo sapiens]
          Length = 302

 Score = 63.9 bits (154), Expect = 1e-07,   Method: Compositional matrix adjust.
 Identities = 32/79 (40%), Positives = 47/79 (59%), Gaps = 5/79 (6%)

Query: 89  GTGLYPVISIINHSCLPNAVLVF--EGRLAVVRAVQHVPKGAEVLISYIETAG---STMT 143
           G+GL+ + S  NHSC+PNA   F     L  V A++ +  G E+ ISY++      S  +
Sbjct: 188 GSGLFVLQSCCNHSCVPNAETSFPENNFLLHVTALEDIKPGEEICISYLDCCQRERSRHS 247

Query: 144 RQKALKEQYLFTCTCPRCI 162
           R K L+E YLF C+CP+C+
Sbjct: 248 RHKILRENYLFVCSCPKCL 266


>gi|380016398|ref|XP_003692172.1| PREDICTED: SET and MYND domain-containing protein 5-like [Apis
           florea]
          Length = 387

 Score = 63.9 bits (154), Expect = 1e-07,   Method: Compositional matrix adjust.
 Identities = 34/87 (39%), Positives = 53/87 (60%), Gaps = 7/87 (8%)

Query: 89  GTGLYPVISIINHSCLPNAVLVF--EGRLAVVRAVQHVPKGAEVLISYIETA---GSTMT 143
           G+GLY + S +NHSC+PNA++ F     + V++A++ +    E+ ISY++      S  +
Sbjct: 287 GSGLYILQSSVNHSCVPNAIVEFPYSNNVLVLKAIRDIHPEEEICISYLDECCLERSRHS 346

Query: 144 RQKALKEQYLFTCTCPRCIKLGQFDDI 170
           RQKAL   YLF C C +C  L Q +D+
Sbjct: 347 RQKALNSLYLFQCYCNKC--LSQANDL 371


>gi|332813421|ref|XP_515547.3| PREDICTED: SET and MYND domain-containing protein 5 isoform 4 [Pan
           troglodytes]
          Length = 299

 Score = 63.9 bits (154), Expect = 1e-07,   Method: Compositional matrix adjust.
 Identities = 32/79 (40%), Positives = 47/79 (59%), Gaps = 5/79 (6%)

Query: 89  GTGLYPVISIINHSCLPNAVLVF--EGRLAVVRAVQHVPKGAEVLISYIETAG---STMT 143
           G+GL+ + S  NHSC+PNA   F     L  V A++ +  G E+ ISY++      S  +
Sbjct: 188 GSGLFVLQSCCNHSCVPNAETSFPENNFLLHVTALEDIKPGEEICISYLDCCQRERSRHS 247

Query: 144 RQKALKEQYLFTCTCPRCI 162
           R K L+E YLF C+CP+C+
Sbjct: 248 RHKILRENYLFVCSCPKCL 266


>gi|348681060|gb|EGZ20876.1| hypothetical protein PHYSODRAFT_313346 [Phytophthora sojae]
          Length = 494

 Score = 63.5 bits (153), Expect = 1e-07,   Method: Compositional matrix adjust.
 Identities = 44/127 (34%), Positives = 61/127 (48%), Gaps = 17/127 (13%)

Query: 89  GTGLYPVISIINHSCLPNAVLVFEGR-LAVVRAVQHVPKGAEVLISYIETAGSTMTRQKA 147
             GL+P+  +INHSC PN      G  +A VRA++ + +G E+ +SYI+       R+K 
Sbjct: 208 AVGLFPICGLINHSCQPNCTWSNAGDGIAEVRALRDIKEGDEITLSYIDIDKERAERRKE 267

Query: 148 LKEQYLFTCTCPRCIKLGQFDDIQESA--ILEGYRCKDDGCS------GFLLRDSDDKGF 199
           L+E   F C C RC        + ES    LEG+ C    CS       +LL   +DK  
Sbjct: 268 LRETKHFDCQCGRCAA-----PLSESVDRYLEGFCCP--RCSVMASGKDYLLAQVEDK-L 319

Query: 200 TCQQCGL 206
            C  C L
Sbjct: 320 VCSDCQL 326


>gi|154689581|ref|NP_659167.2| SET and MYND domain-containing protein 5 [Mus musculus]
 gi|90101759|sp|Q3TYX3.2|SMYD5_MOUSE RecName: Full=SET and MYND domain-containing protein 5; AltName:
           Full=Protein NN8-4AG; AltName: Full=Retinoic
           acid-induced protein 15
 gi|148666711|gb|EDK99127.1| SET and MYND domain containing 5 [Mus musculus]
          Length = 416

 Score = 63.5 bits (153), Expect = 1e-07,   Method: Compositional matrix adjust.
 Identities = 35/86 (40%), Positives = 50/86 (58%), Gaps = 7/86 (8%)

Query: 89  GTGLYPVISIINHSCLPNAVLVFEGRLAV--VRAVQHVPKGAEVLISYIETAG---STMT 143
           G+GL+ + S  NHSC+PNA   F     V  V A++ +  G E+ ISY++      S  +
Sbjct: 303 GSGLFVLQSCCNHSCVPNAETSFPENNFVLHVTALEDIKPGEEICISYLDCCQRERSRHS 362

Query: 144 RQKALKEQYLFTCTCPRCIKLGQFDD 169
           R K L+E YLF C+CP+C  L + DD
Sbjct: 363 RHKILRENYLFNCSCPKC--LAEADD 386


>gi|440302623|gb|ELP94930.1| set and mynd domain containing protein, putative [Entamoeba
           invadens IP1]
          Length = 428

 Score = 63.5 bits (153), Expect = 1e-07,   Method: Compositional matrix adjust.
 Identities = 30/86 (34%), Positives = 48/86 (55%), Gaps = 1/86 (1%)

Query: 76  NAHTICNSELRPLGTGLYPVISIINHSCLPNAVLVFEGRLAVVRAVQHVPKGAEVLISYI 135
           N+  + N E + +G GL+   S INHSC PN V +   R   ++++  +  G E+ ISYI
Sbjct: 154 NSFCLTNGEEQVIGIGLFDYASFINHSCCPNCVPLQNKREMSIKSLTSIKSGEEIFISYI 213

Query: 136 ETAGSTMTRQKALKEQYLFTCTCPRC 161
           +   S   R+K L+ ++ F+C C  C
Sbjct: 214 DITESFERREKELR-KWHFSCGCSLC 238


>gi|195352295|ref|XP_002042648.1| GM14901 [Drosophila sechellia]
 gi|194124532|gb|EDW46575.1| GM14901 [Drosophila sechellia]
          Length = 530

 Score = 63.5 bits (153), Expect = 1e-07,   Method: Compositional matrix adjust.
 Identities = 51/181 (28%), Positives = 84/181 (46%), Gaps = 22/181 (12%)

Query: 43  LLYAQIANLVNLIL---QWPEISINEIAENFSKLACNAHTICNSELRPLGTGLYPVISII 99
           +L A +   +N +L    WPE+ I  IA   + L  N   +     R     LYP  ++I
Sbjct: 199 VLRANLITFINTVLGMKDWPEMDILRIA---AILDTNTFEVRQPRERRKIRALYPEAAMI 255

Query: 100 NHSCLPNAVLVFEGRLAVV-RAVQHVPKGAEVLISYIETAGSTMTRQKALKEQYLFTCTC 158
           +H C+PN    F+  + +V  A + + KG  + ISY +   ST+ R+  L++   F C+C
Sbjct: 256 SHDCVPNMRHRFDDDMNIVFLAKRKIAKGEILSISYTQPLRSTIQRRVHLRQAKCFDCSC 315

Query: 159 PRCI---KLGQFDDIQESAILEGYRCKDDGCSGFLLRDS---DDKGFTCQQCGLVRSKEE 212
            RC    +LG F   Q        +CK    +G ++  +   +   + CQ C   RS +E
Sbjct: 316 ARCQDPEELGSFAGAQTC-----LKCK----AGKIISQNPLLNSAPWKCQLCNFKRSAKE 366

Query: 213 I 213
           +
Sbjct: 367 V 367


>gi|367020696|ref|XP_003659633.1| hypothetical protein MYCTH_99686 [Myceliophthora thermophila ATCC
           42464]
 gi|347006900|gb|AEO54388.1| hypothetical protein MYCTH_99686 [Myceliophthora thermophila ATCC
           42464]
          Length = 573

 Score = 63.5 bits (153), Expect = 1e-07,   Method: Compositional matrix adjust.
 Identities = 40/122 (32%), Positives = 66/122 (54%), Gaps = 2/122 (1%)

Query: 40  KQLLLYAQIANLVNLILQWPEISINEIAENFSKLACNAHTICNSELRPLGTGLYPVISII 99
           K + L A  A +   +LQ  E+ + +  E   K+  NA    +++    G  L   ++++
Sbjct: 181 KDMELQATAAVVYAGLLQGEEV-LEKAREILCKIQTNAFNRLDADTGMAGIFLDVGLAMV 239

Query: 100 NHSCLPNAVLVFEGRLAVVRAVQHVPKGAEVLISYIETAGSTMTRQKALKEQYLFTCTCP 159
           NHSC+PNA + F+ R A++RA + + +G E+ ISYI+       R +AL+  Y F C C 
Sbjct: 240 NHSCVPNAFIGFDKRTAILRAERPIQEGEEITISYIDNTLPKAARYEALR-LYHFQCDCV 298

Query: 160 RC 161
           RC
Sbjct: 299 RC 300


>gi|332021392|gb|EGI61760.1| SET and MYND domain-containing protein 4 [Acromyrmex echinatior]
          Length = 749

 Score = 63.5 bits (153), Expect = 1e-07,   Method: Compositional matrix adjust.
 Identities = 47/214 (21%), Positives = 89/214 (41%), Gaps = 10/214 (4%)

Query: 88  LGTGLYPVISIINHSCLPNAVLVFEGRLAVVRAVQHVPKGAEVLISYIETAGSTM--TRQ 145
           + T +YP  S++NHSC PN +  F G+  + +A + +  G EV   Y       +   RQ
Sbjct: 460 IATAIYPSASMMNHSCDPNIINSFLGQFLITKATRDIAAGEEVFNCYGADFRRMLRKDRQ 519

Query: 146 KALKEQYLFTCTCPRCIKLGQFDDIQESAILEGYRCKDDGCSGFLLRDSD---DKGFTCQ 202
           + ++ QY F C C  C    +++DI +       +C +  CSG L  +          C 
Sbjct: 520 EKMESQYCFKCNCAAC-SAPEYEDILKK--FTAKKCPE--CSGPLNDNCSVLHRSSIHCM 574

Query: 203 QCGLVRSKEEIKKIASEVNILSKKTLALTSCGNHQEVVSTYKMIEKLQKKLYHPFSVNLM 262
            CG    +        +     +K        N+ E +   K   +L+K   + +  ++ 
Sbjct: 575 DCGTAVYENIYDFTLRQAQHYFEKAEICIENENYDEALDKLKKCLRLRKDALYKYHDDIA 634

Query: 263 QTREKLIKILMELEDWKEALAYCQLTIPVYQRVY 296
            T + + K+   +  W  +++Y + +I   +  Y
Sbjct: 635 DTMDFIAKVYAIIGQWLNSISYLEHSIVAIEEKY 668


>gi|255543292|ref|XP_002512709.1| protein with unknown function [Ricinus communis]
 gi|223548670|gb|EEF50161.1| protein with unknown function [Ricinus communis]
          Length = 379

 Score = 63.5 bits (153), Expect = 1e-07,   Method: Compositional matrix adjust.
 Identities = 29/75 (38%), Positives = 41/75 (54%), Gaps = 4/75 (5%)

Query: 91  GLYPVISIINHSCLPNAV----LVFEGRLAVVRAVQHVPKGAEVLISYIETAGSTMTRQK 146
           G+YP  ++ NH CLPNA     +  +    ++R +  VP+G E+ +SY        TRQK
Sbjct: 209 GIYPKAALFNHDCLPNACRFDYVDTQDTDLIIRMIHDVPQGREICLSYFPVNYDYSTRQK 268

Query: 147 ALKEQYLFTCTCPRC 161
            L+E Y F C C RC
Sbjct: 269 RLREDYGFICDCDRC 283


>gi|31208665|ref|XP_313299.1| AGAP003552-PA [Anopheles gambiae str. PEST]
 gi|21296724|gb|EAA08869.1| AGAP003552-PA [Anopheles gambiae str. PEST]
          Length = 391

 Score = 63.5 bits (153), Expect = 1e-07,   Method: Compositional matrix adjust.
 Identities = 35/87 (40%), Positives = 47/87 (54%), Gaps = 5/87 (5%)

Query: 80  ICNSELRPLGTGLYPVISIINHSCLPNAVLVF--EGRLAVVRAVQHVPKGAEVLISYIET 137
           +  S L   G+ LY   S INHSC PNA  VF     +  +RA + +  G E+ ISY++ 
Sbjct: 279 VVGSFLNNEGSALYARQSKINHSCAPNAETVFPKSNHMLALRATRDIQPGEEICISYLDE 338

Query: 138 AG---STMTRQKALKEQYLFTCTCPRC 161
                S  +RQK LK+ YLF C C +C
Sbjct: 339 CNLQRSRHSRQKTLKDYYLFICQCEKC 365


>gi|389643366|ref|XP_003719315.1| TPR domain-containing protein [Magnaporthe oryzae 70-15]
 gi|351639084|gb|EHA46948.1| TPR domain-containing protein [Magnaporthe oryzae 70-15]
 gi|440463125|gb|ELQ32752.1| TPR domain-containing protein [Magnaporthe oryzae Y34]
 gi|440483490|gb|ELQ63875.1| TPR domain-containing protein [Magnaporthe oryzae P131]
          Length = 746

 Score = 63.5 bits (153), Expect = 1e-07,   Method: Compositional matrix adjust.
 Identities = 26/72 (36%), Positives = 39/72 (54%)

Query: 90  TGLYPVISIINHSCLPNAVLVFEGRLAVVRAVQHVPKGAEVLISYIETAGSTMTRQKALK 149
           TG++   +  NH+C+PNA   F G + +VRA + +P GAE+ + Y   A    +R+   K
Sbjct: 503 TGMWLHAAYANHTCIPNATRAFIGDMMIVRAARDIPAGAEIFMGYASLAEPFESRRSKFK 562

Query: 150 EQYLFTCTCPRC 161
             Y F C C  C
Sbjct: 563 TSYGFECDCEMC 574


>gi|221503737|gb|EEE29421.1| SET and MYND domain containing protein, putative [Toxoplasma gondii
           VEG]
          Length = 402

 Score = 63.5 bits (153), Expect = 1e-07,   Method: Compositional matrix adjust.
 Identities = 32/76 (42%), Positives = 45/76 (59%), Gaps = 2/76 (2%)

Query: 88  LGTGLYPVISIINHSCLPNAVL--VFEGRLAVVRAVQHVPKGAEVLISYIETAGSTMTRQ 145
           LG GL+  +S+ NHSC PNA +   F    A V A++ + +  E+L+SYI+ +     RQ
Sbjct: 263 LGVGLFRAVSMTNHSCWPNAEVDYPFLTNAAQVTALRDIAEKEEILLSYIDESLPLAERQ 322

Query: 146 KALKEQYLFTCTCPRC 161
           + LK  Y FTC C RC
Sbjct: 323 RLLKSHYKFTCGCQRC 338


>gi|116181330|ref|XP_001220514.1| hypothetical protein CHGG_01293 [Chaetomium globosum CBS 148.51]
 gi|88185590|gb|EAQ93058.1| hypothetical protein CHGG_01293 [Chaetomium globosum CBS 148.51]
          Length = 348

 Score = 63.5 bits (153), Expect = 1e-07,   Method: Compositional matrix adjust.
 Identities = 31/71 (43%), Positives = 40/71 (56%), Gaps = 1/71 (1%)

Query: 91  GLYPVISIINHSCLPNAVLVFEGRLAVVRAVQHVPKGAEVLISYIETAGSTMTRQKALKE 150
           G++P +S  NH C PN      G      AV+ +P G E+ ISYI    S  TRQ  L+E
Sbjct: 173 GVFPEVSKFNHDCRPNVHYRLNGLKHTTIAVRDIPAGDELTISYIYGRASHATRQSQLRE 232

Query: 151 QYLFTCTCPRC 161
            + FTCTCP+C
Sbjct: 233 -WGFTCTCPQC 242


>gi|157823305|ref|NP_001101340.1| SET and MYND domain-containing protein 5 [Rattus norvegicus]
 gi|149036567|gb|EDL91185.1| SET and MYND domain containing 5 (predicted) [Rattus norvegicus]
          Length = 417

 Score = 63.5 bits (153), Expect = 1e-07,   Method: Compositional matrix adjust.
 Identities = 35/86 (40%), Positives = 50/86 (58%), Gaps = 7/86 (8%)

Query: 89  GTGLYPVISIINHSCLPNAVLVF--EGRLAVVRAVQHVPKGAEVLISYIETAG---STMT 143
           G+GL+ + S  NHSC+PNA   F     L  V A++ +  G E+ ISY++      S  +
Sbjct: 304 GSGLFVLQSCCNHSCVPNAETSFPENNFLLHVTALEDIEPGEEICISYLDCCQRERSRHS 363

Query: 144 RQKALKEQYLFTCTCPRCIKLGQFDD 169
           R K L+E YLF C+CP+C  L + DD
Sbjct: 364 RHKILRENYLFVCSCPKC--LAEADD 387


>gi|390360826|ref|XP_001181205.2| PREDICTED: SET and MYND domain-containing protein 4-like
           [Strongylocentrotus purpuratus]
          Length = 679

 Score = 63.5 bits (153), Expect = 1e-07,   Method: Compositional matrix adjust.
 Identities = 60/267 (22%), Positives = 118/267 (44%), Gaps = 21/267 (7%)

Query: 83  SELRPLGTGLYPVISIINHSCLPNAVLVFEGRLAVVRAVQHVPKGAEVLISYIETAG--S 140
           S+ R + T +YP +S++NH+C+PN +  F   +  VRA + +  G E+   Y    G  +
Sbjct: 410 SQFR-VATAVYPTVSMMNHACIPNIIPSFRKGILCVRATKKIMHGDEIQHCYGPQVGHMT 468

Query: 141 TMTRQKALKEQYLFTCTCPRCIKLGQFDDIQESAILEGYRCKDDGCSGFLLRDSDDKGFT 200
           T  RQ+AL  QY FTC C  C +  +  D   +  ++  +C          +  +     
Sbjct: 469 TSDRQQALLNQYCFTCRCRACTRDPKSRD---NLCMKCPKCG---------QPLNAMTSI 516

Query: 201 CQQCG-----LVRSKEEIKKIASEVNILSKKTLALTSCGNHQEVVSTYKMIEKLQKKLYH 255
           C +C      LV  KE    + +   ++     A+      ++V+S  K    +  ++  
Sbjct: 517 CGKCAVWIDVLVLLKEVDDTMITFAELVEMYPAAVKDVTLMRDVISKTKTCHDVLDRILI 576

Query: 256 PFSVNLMQTREKLIKILMELEDWKEALAYCQLTIPVYQRVYPQFHPLLGLQYYTCGKLEW 315
           P  ++L      + K  + L+++KEA  +    IP  +  + +    +  + Y   ++ +
Sbjct: 577 PPHMHLAVAYNFMAKCHIALQEFKEAAPWLAKAIPTIECRFGRDSIEVANELYKLAQIYF 636

Query: 316 FLGDTENAIKSMTEAVEILRITHGTNS 342
              +   A++++  A+E+  I H  NS
Sbjct: 637 NGKEIAPAMEAIDRALELF-IRHYGNS 662


>gi|391340382|ref|XP_003744521.1| PREDICTED: son of sevenless homolog 2-like [Metaseiulus occidentalis]
          Length = 1624

 Score = 63.5 bits (153), Expect = 2e-07,   Method: Compositional matrix adjust.
 Identities = 84/373 (22%), Positives = 152/373 (40%), Gaps = 59/373 (15%)

Query: 1    MLKLYLRRKLQNDNVIPSTTTDNYSLVEALVAHMSDIDEKQLL--LYAQIANLVNLIL-- 56
            M ++ LR + + +N    T       ++  ++H+ +I   ++    + ++ NL+   L  
Sbjct: 1278 MARIILRWRRELNNPSSETLLGTRRQLKDCMSHLQEIKRDRVRGGAFVEMLNLLKQFLTA 1337

Query: 57   -QWPEISINEIAENFSKLACNAHTICNSELRPLGTGLYPVISIINHSCLPNAVLVFEGRL 115
             Q   +S  +I E F  +  N   I N +    G  LY   S+I+HSC PN    F+G+ 
Sbjct: 1338 AQLEGVSDLDILEIFGIMCVNTIHISNDD-DSFGCALYLAPSLIDHSCYPNLTATFKGQK 1396

Query: 116  AVVRAVQHV-PKG-AEVLISYIETAGSTMTRQKALKEQYLFTCTCPRCIKLGQFDDIQES 173
             V++ ++   PK  A++ ++Y+    +   R+K L+E+Y FTC C  C            
Sbjct: 1397 IVLKVLRPCEPKTVADLSLAYMPVCTTKERRRKTLREEYYFTCECEMC------------ 1444

Query: 174  AILEGYRCKDDGCSGFLLRDSDDKGFTCQQCGLVRSKEEIKKIASEVNILSKKTLALTSC 233
                       G    +L ++D                   K+  EV  L K +L L+S 
Sbjct: 1445 ----------SGKVPEVLSEAD------------------PKLTDEVLELEKLSLDLSSP 1476

Query: 234  GNHQEVVSTYKMIEKLQKKLYHPFSVNLMQTREKLIKILMEL--EDWKEALAYCQLTIPV 291
             NH++ +   +  E L  KL      ++ + R  L+     +    +  A  Y   ++PV
Sbjct: 1477 ENHRKALKGVE--ELLSTKLKDLDDSDVAKFRAILVAADASVCASSYDRAYNYYSRSLPV 1534

Query: 292  YQRVYPQFHPLLGLQYYTCGKLEWFLGDTENA-------IKSMTEAVEILRITHGTNSPF 344
             +RV+ +       +     +L   +  T          I  + EAV + RI  G +   
Sbjct: 1535 MKRVFTENKAEYAYKLVRLARLSTIVVKTSQDPEVFGPLIALLQEAVRVTRIALGEDHSD 1594

Query: 345  MKELILKLEEAQA 357
             K++ L  EE +A
Sbjct: 1595 TKDVALLYEELRA 1607


>gi|307212779|gb|EFN88450.1| SET and MYND domain-containing protein 4 [Harpegnathos saltator]
          Length = 687

 Score = 63.5 bits (153), Expect = 2e-07,   Method: Compositional matrix adjust.
 Identities = 37/124 (29%), Positives = 64/124 (51%), Gaps = 8/124 (6%)

Query: 88  LGTGLYPVISIINHSCLPNAVLVFEGRLAVVRAVQHVPKGAEVLISY--IETAGSTMTRQ 145
           +G G+YP ++++NHSC P  V  F G   ++RAV+ +  G E+  +Y  I T      R+
Sbjct: 490 IGGGVYPTVAMLNHSCNPGVVRYFVGTTMILRAVRTISAGEEISENYGPIFTTTPENERK 549

Query: 146 KALKEQYLFTCTCPRCIKLGQFDDIQE-SAILEGYRCKDDGCSGFLL---RDSDDKGFTC 201
           + L+ QY F C C  C   G +  ++E    +  ++C+     G +L    D+++    C
Sbjct: 550 RRLRVQYWFDCNCEACT--GHWPLLEELDPTVLRFKCETGPSCGNVLPIRSDTNEFMIGC 607

Query: 202 QQCG 205
            +CG
Sbjct: 608 VKCG 611


>gi|354492175|ref|XP_003508226.1| PREDICTED: SET and MYND domain-containing protein 5-like
           [Cricetulus griseus]
          Length = 435

 Score = 63.5 bits (153), Expect = 2e-07,   Method: Compositional matrix adjust.
 Identities = 35/86 (40%), Positives = 50/86 (58%), Gaps = 7/86 (8%)

Query: 89  GTGLYPVISIINHSCLPNAVLVF--EGRLAVVRAVQHVPKGAEVLISYIETAG---STMT 143
           G+GL+ + S  NHSC+PNA   F     L  V A++ +  G E+ ISY++      S  +
Sbjct: 322 GSGLFVLQSCCNHSCVPNAETSFPENNFLLHVTALEDIKPGEEICISYLDCCQRERSRHS 381

Query: 144 RQKALKEQYLFTCTCPRCIKLGQFDD 169
           R K L+E YLF C+CP+C  L + DD
Sbjct: 382 RHKILRENYLFICSCPKC--LAEADD 405


>gi|294886191|ref|XP_002771602.1| hypothetical protein, conserved [Perkinsus marinus ATCC 50983]
 gi|239875308|gb|EER03418.1| hypothetical protein, conserved [Perkinsus marinus ATCC 50983]
          Length = 265

 Score = 63.5 bits (153), Expect = 2e-07,   Method: Compositional matrix adjust.
 Identities = 34/98 (34%), Positives = 51/98 (52%), Gaps = 9/98 (9%)

Query: 89  GTGLYPVISIINHSCLPNAVLVFEGRLAVVRAVQHVPKGAEVLISYIETAGSTMTRQKAL 148
           GT +YPV S+ NHSC PN      G L   R    V +G E+ ISYI      +   KA+
Sbjct: 17  GTAVYPVASMFNHSCTPNVTRASLGDLTWFRTCTDVKRGQELTISYI--GSDLLCEPKAV 74

Query: 149 KEQYL---FTCTCPRCIKLGQFDDIQESAILEGYRCKD 183
           ++++L   F+C CP C K    +D  +S ++  +  +D
Sbjct: 75  RQKHLARDFSCNCPACTK----EDDDDSVVIMPFTLQD 108


>gi|170060287|ref|XP_001865735.1| conserved hypothetical protein [Culex quinquefasciatus]
 gi|167878799|gb|EDS42182.1| conserved hypothetical protein [Culex quinquefasciatus]
          Length = 650

 Score = 63.5 bits (153), Expect = 2e-07,   Method: Compositional matrix adjust.
 Identities = 66/257 (25%), Positives = 107/257 (41%), Gaps = 36/257 (14%)

Query: 22  DNYSLVEALVAHMSDIDEKQLLLYAQIANLVNLILQWPEISINEIAENF---------SK 72
           D+Y  +  LV H      +       +A L+N  L        + AE+F           
Sbjct: 364 DDYRKIYKLVTHEETRSAEDFFQRTLMATLLNACLTLGGFYKTKEAESFIGGLLLHNLQL 423

Query: 73  LACNAHTICNSELRP-----------LGTGLYPVISIINHSCLPNAVLVFEGRLAVVRAV 121
           L  NAH I  SEL+            +G GLYP +++ NHSC P     ++G    VR+V
Sbjct: 424 LQFNAHEI--SELQREDDRDVGKSVFIGGGLYPTLALFNHSCEPGVTRYYKGNSVCVRSV 481

Query: 122 QHVPKGAEVLISY--IETAGSTMTRQKALKEQYLFTCTCPRCIK-LGQFDDIQESAILEG 178
           + +  G+ +  +Y  + T      R+  L  QY F+C C  C +    F ++ ++ +   
Sbjct: 482 RSIAAGSMIGENYGPLFTQTPRDERRATLLNQYRFSCNCRACSENWPLFSEMDDTVL--- 538

Query: 179 YRCKDDG---CSGFLLRDSDDKGF--TCQQCGLVRSKEEIKKIASEVNILSKKTLALTSC 233
            R K DG   CS  LL  S+   F   C  CG   +  +  K+  +   + K    + S 
Sbjct: 539 -RFKCDGGKICSNVLLIPSEINEFMIKCTDCGEHTNIMKGLKLVQDTETMFKLATKMHSA 597

Query: 234 GNHQEVVSTYKMIEKLQ 250
           G  +  +  YK +E++ 
Sbjct: 598 GEIEAAL--YKYVEEMN 612


>gi|157135679|ref|XP_001663543.1| hypothetical protein AaeL_AAEL003295 [Aedes aegypti]
 gi|108881205|gb|EAT45430.1| AAEL003295-PA [Aedes aegypti]
          Length = 565

 Score = 63.5 bits (153), Expect = 2e-07,   Method: Compositional matrix adjust.
 Identities = 50/199 (25%), Positives = 86/199 (43%), Gaps = 38/199 (19%)

Query: 91  GLYPVISIINHSCLPNAVLVF---EGRLAVVRAVQHVPKGAEVLISYIETAGSTMTRQKA 147
           GLYP+  ++ H C+PN    F   +G     +A +++ KG  +  +Y  +   T  R+  
Sbjct: 263 GLYPIACMLEHCCMPNCFYTFNCTKGMKLTFKAGRNIKKGEHLTTTYTHSLWGTQQRRDH 322

Query: 148 LKEQYLFTCTCPRCIKLGQFDDIQESAILEGYRC---KDDGCSGFL-----LRDSDDKGF 199
           LK    F+C+C RC      D  +    L   +C     + C+G+      L+++ D  +
Sbjct: 323 LKANKYFSCSCARCA-----DPTELGTYLSALKCMGIDGNACAGYQLPIDPLKETSD--W 375

Query: 200 TCQQCGLVRSKEEIK----KIASEVNILSKKTLALTSCGNHQEVVSTYKMIEKLQKKL-- 253
            C QC +    +++     KI  EV+             +  E  S+ K +E L  KL  
Sbjct: 376 KCNQCPITIESDQVNFLLSKIGEEVD-------------DAMERKSSVKQLEDLISKLLT 422

Query: 254 -YHPFSVNLMQTREKLIKI 271
             HP    L+Q +  LI++
Sbjct: 423 FLHPNHHFLLQLKHSLIQM 441


>gi|350403870|ref|XP_003486931.1| PREDICTED: SET and MYND domain-containing protein 5-like [Bombus
           impatiens]
          Length = 386

 Score = 63.5 bits (153), Expect = 2e-07,   Method: Compositional matrix adjust.
 Identities = 36/86 (41%), Positives = 51/86 (59%), Gaps = 7/86 (8%)

Query: 89  GTGLYPVISIINHSCLPNAVLVF--EGRLAVVRAVQHVPKGAEVLISYIETA---GSTMT 143
           G+GLY   S INHSC+PNAV+ F     + V++A++ +    E+ ISY++      S  +
Sbjct: 287 GSGLYIFQSSINHSCVPNAVVEFPYSNNVLVLKAIRDIHPEEEICISYLDECCLERSRHS 346

Query: 144 RQKALKEQYLFTCTCPRCIKLGQFDD 169
           RQKAL   YLF C C +C  L Q +D
Sbjct: 347 RQKALSSLYLFHCYCNKC--LSQVND 370


>gi|440796609|gb|ELR17718.1| Histone-lysine N-methyltransferase [Acanthamoeba castellanii str.
           Neff]
          Length = 419

 Score = 63.5 bits (153), Expect = 2e-07,   Method: Compositional matrix adjust.
 Identities = 36/98 (36%), Positives = 47/98 (47%), Gaps = 1/98 (1%)

Query: 65  EIAENFSKLACNAHTICNSELRPLGTGLYPVISIINHSCLPN-AVLVFEGRLAVVRAVQH 123
           E+ E  SK+  N   + + +L  +G  LY   S  NHSC PN      EG+     A   
Sbjct: 278 ELMEYISKIESNTFGMWSDKLVSMGMVLYAEGSYFNHSCAPNCGTRTGEGQAVQFVATHD 337

Query: 124 VPKGAEVLISYIETAGSTMTRQKALKEQYLFTCTCPRC 161
           +P G EV I YI+    T +R+  L   Y FTC CP C
Sbjct: 338 IPAGDEVCIRYIDVDKPTTSRRSELLSHYHFTCMCPLC 375


>gi|392558999|gb|EIW52184.1| hypothetical protein TRAVEDRAFT_75273 [Trametes versicolor
           FP-101664 SS1]
          Length = 721

 Score = 63.5 bits (153), Expect = 2e-07,   Method: Compositional matrix adjust.
 Identities = 36/72 (50%), Positives = 41/72 (56%), Gaps = 3/72 (4%)

Query: 96  ISIINHSCLPNAVLVFEGRLAV--VRAVQHVPKGAEVLISYIETAGSTMTRQKALKEQYL 153
           IS INHSC PNAV  FE       VRA+  +P GA+V ISYI+ A     RQ+AL   Y 
Sbjct: 179 ISRINHSCSPNAVYRFEPATLTFEVRALSPIPSGAQVFISYIDPALPRAKRQEALS-SYG 237

Query: 154 FTCTCPRCIKLG 165
           FTC C  C   G
Sbjct: 238 FTCACTACALTG 249


>gi|74196676|dbj|BAE34438.1| unnamed protein product [Mus musculus]
          Length = 416

 Score = 63.2 bits (152), Expect = 2e-07,   Method: Compositional matrix adjust.
 Identities = 35/86 (40%), Positives = 50/86 (58%), Gaps = 7/86 (8%)

Query: 89  GTGLYPVISIINHSCLPNAVLVFEGRLAV--VRAVQHVPKGAEVLISYIETAG---STMT 143
           G+GL+ + S  NHSC+PNA   F     V  V A++ +  G E+ ISY++      S  +
Sbjct: 303 GSGLFVLQSCCNHSCVPNAETSFPENNFVLHVTALEDIKPGEEICISYLDCRQRERSRHS 362

Query: 144 RQKALKEQYLFTCTCPRCIKLGQFDD 169
           R K L+E YLF C+CP+C  L + DD
Sbjct: 363 RHKILRENYLFNCSCPKC--LAEADD 386


>gi|346320526|gb|EGX90126.1| TPR domain protein [Cordyceps militaris CM01]
          Length = 656

 Score = 63.2 bits (152), Expect = 2e-07,   Method: Compositional matrix adjust.
 Identities = 31/72 (43%), Positives = 42/72 (58%), Gaps = 1/72 (1%)

Query: 90  TGLYPVISIINHSCLPNAVLVFEGRLAVVRAVQHVPKGAEVLISYIETAGSTMTRQKALK 149
           TGL+   S +NH+C+ NAV  F G L V RA +H+  G E+  +Y E A     RQ AL+
Sbjct: 509 TGLWVRASYVNHACVANAVKDFAGDLMVFRAARHIRAGEEIFHAYDENADYD-ARQAALR 567

Query: 150 EQYLFTCTCPRC 161
           + + F CTC  C
Sbjct: 568 KTWGFACTCALC 579


>gi|351716047|gb|EHB18966.1| SET and MYND domain-containing protein 5 [Heterocephalus glaber]
          Length = 420

 Score = 63.2 bits (152), Expect = 2e-07,   Method: Compositional matrix adjust.
 Identities = 32/79 (40%), Positives = 47/79 (59%), Gaps = 5/79 (6%)

Query: 89  GTGLYPVISIINHSCLPNAVLVF--EGRLAVVRAVQHVPKGAEVLISYIETAG---STMT 143
           G+GL+ + S  NHSC+PNA   F     L  V A++ +  G E+ ISY++      S  +
Sbjct: 304 GSGLFVLQSCCNHSCVPNAETAFPENNFLLHVTALEDINPGEEICISYLDCCQRERSRHS 363

Query: 144 RQKALKEQYLFTCTCPRCI 162
           R K L+E YLF C+CP+C+
Sbjct: 364 RHKILRENYLFVCSCPKCL 382


>gi|328874283|gb|EGG22649.1| SET domain-containing protein [Dictyostelium fasciculatum]
          Length = 1240

 Score = 63.2 bits (152), Expect = 2e-07,   Method: Compositional matrix adjust.
 Identities = 64/277 (23%), Positives = 117/277 (42%), Gaps = 30/277 (10%)

Query: 88   LGTGLYPVISIINHSCLPNAVLVFEGRLAVVRAVQHVPKGAEVLISYIETAGSTMTRQK- 146
            +   ++PV S++NHSC  N +L + G    +++++++ K  E+   Y   A     R++ 
Sbjct: 893  IAYAIFPVSSLLNHSCDNNTILQYNGSSITIKSLKNISKNDEITGCYGPHAFHLELRERL 952

Query: 147  -ALKEQYLFTCTCPRC-IKLGQFDDIQESAILE--GYRCK-----------------DDG 185
              LKE+Y F C C  C  K+G       +  LE  G +CK                 D  
Sbjct: 953  ECLKEEYFFICRCHACNAKIGPNLLRCPTQSLESGGEQCKGTLLERVDTNSLGQEEDDSY 1012

Query: 186  CSGFLLRDSDDKGFTCQQCGLVRSKEEIKKIASEV----NILSKKTLALTSCGNHQEVVS 241
              G    D + + F C  C L   K +   + S+      I S  T  L    +HQ    
Sbjct: 1013 PEGASYDDYNGRKFVCSICSLELGKMDCFMLTSQSVMSGAIFSNATKLLIDTEHHQPPRE 1072

Query: 242  TYKMIEK---LQKKLYHPFSVNLMQTREKLIKILMELEDWKEALAYCQLTI-PVYQRVYP 297
              +M+ K   ++K++YH  S  + +  + L +  +  + +KEA+ Y +  I  ++ R+  
Sbjct: 1073 VEEMLFKCLEIRKQIYHVKSRKIGEVYDALSRYYIAKDKYKEAVTYAERAINNIFLRLGH 1132

Query: 298  QFHPLLGLQYYTCGKLEWFLGDTENAIKSMTEAVEIL 334
                 L  ++     +    G+   AIK++  +  +L
Sbjct: 1133 SHSTELAREWAKLANIYINAGEPAKAIKAINTSESLL 1169


>gi|195156964|ref|XP_002019366.1| GL12277 [Drosophila persimilis]
 gi|194115957|gb|EDW38000.1| GL12277 [Drosophila persimilis]
          Length = 188

 Score = 63.2 bits (152), Expect = 2e-07,   Method: Compositional matrix adjust.
 Identities = 38/101 (37%), Positives = 57/101 (56%), Gaps = 9/101 (8%)

Query: 66  IAENFSKLACNAHTICNSELRPLGTGLYPVISIINHSCLPNAVLVF--EGRLAVVRAVQH 123
           I E +SK+   A    N+E    G+GLY + S INHSC+PNA   F     + V++A+  
Sbjct: 62  IDELYSKVGEFAGEFLNNE----GSGLYLLQSKINHSCVPNACSTFPYSNDIVVLKALAP 117

Query: 124 VPKGAEVLISYIETAG---STMTRQKALKEQYLFTCTCPRC 161
           + +G E+ ISY++      S  +R K L+E Y+F C C +C
Sbjct: 118 IQEGEEICISYLDECQLERSRHSRHKVLRENYIFVCQCLKC 158


>gi|332022210|gb|EGI62525.1| SET and MYND domain-containing protein 5 [Acromyrmex echinatior]
          Length = 391

 Score = 63.2 bits (152), Expect = 2e-07,   Method: Compositional matrix adjust.
 Identities = 31/78 (39%), Positives = 48/78 (61%), Gaps = 5/78 (6%)

Query: 89  GTGLYPVISIINHSCLPNAVLVF--EGRLAVVRAVQHVPKGAEVLISYIETAG---STMT 143
           G+GLY + S +NHSC+PNAV+ F       V++A++++  G E+  SY++      S  +
Sbjct: 295 GSGLYVLQSSVNHSCIPNAVVEFPYSNNTLVLKAIRNIKVGEEICTSYLDECQLERSRHS 354

Query: 144 RQKALKEQYLFTCTCPRC 161
           RQ+AL   YLF C C +C
Sbjct: 355 RQQALSSLYLFVCHCDKC 372


>gi|440583731|emb|CCH47233.1| similar to histone-lysine N-methyltransferase ASHR2 [Lupinus
           angustifolius]
          Length = 357

 Score = 63.2 bits (152), Expect = 2e-07,   Method: Compositional matrix adjust.
 Identities = 41/143 (28%), Positives = 62/143 (43%), Gaps = 24/143 (16%)

Query: 91  GLYPVISIINHSCLPNAV---LVFEGRL--------AVVRAVQHVPKGAEVLISYIETAG 139
            +YP  S+ NH C+PNA     V    L         V+R +Q +P+G EV ISY   + 
Sbjct: 212 AIYPKASMFNHDCIPNACRFDYVDSADLDDEHNNTDMVIRMIQDLPEGREVCISYFRISR 271

Query: 140 STMTRQKALKEQYLFTCTCPRCI-----------KLGQFDDIQESAILEGYRCKDDGCSG 188
              TR++ L + Y F+C C RC             + ++ D+     +  Y C    C+G
Sbjct: 272 DYCTRKRILMDDYGFSCECDRCKIEANWPHDCQNYVEEYSDLPHVRFIAKYVCDRKNCNG 331

Query: 189 FLL--RDSDDKGFTCQQCGLVRS 209
            L    D+      C  CG ++S
Sbjct: 332 TLAPKDDAHTNVLECNFCGNLKS 354


>gi|426335968|ref|XP_004029476.1| PREDICTED: SET and MYND domain-containing protein 5 isoform 1
           [Gorilla gorilla gorilla]
          Length = 419

 Score = 63.2 bits (152), Expect = 2e-07,   Method: Compositional matrix adjust.
 Identities = 32/79 (40%), Positives = 47/79 (59%), Gaps = 5/79 (6%)

Query: 89  GTGLYPVISIINHSCLPNAVLVF--EGRLAVVRAVQHVPKGAEVLISYIETAG---STMT 143
           G+GL+ + S  NHSC+PNA   F     L  V A++ +  G E+ ISY++      S  +
Sbjct: 304 GSGLFVLQSCCNHSCVPNAETSFPENNFLLHVTALEDIKPGEEICISYLDCCQRERSRHS 363

Query: 144 RQKALKEQYLFTCTCPRCI 162
           R K L+E YLF C+CP+C+
Sbjct: 364 RHKILRENYLFVCSCPKCL 382


>gi|348566513|ref|XP_003469046.1| PREDICTED: SET and MYND domain-containing protein 5-like [Cavia
           porcellus]
          Length = 416

 Score = 63.2 bits (152), Expect = 2e-07,   Method: Compositional matrix adjust.
 Identities = 32/79 (40%), Positives = 47/79 (59%), Gaps = 5/79 (6%)

Query: 89  GTGLYPVISIINHSCLPNAVLVF--EGRLAVVRAVQHVPKGAEVLISYIETAG---STMT 143
           G+GL+ + S  NHSC+PNA   F     L  V A++ +  G E+ ISY++      S  +
Sbjct: 304 GSGLFVLQSCCNHSCVPNAETSFPENNFLLHVTALEDISPGEEICISYLDCCQRERSRHS 363

Query: 144 RQKALKEQYLFTCTCPRCI 162
           R K L+E YLF C+CP+C+
Sbjct: 364 RHKILRENYLFVCSCPKCL 382


>gi|224057872|ref|XP_002299366.1| SET domain protein [Populus trichocarpa]
 gi|222846624|gb|EEE84171.1| SET domain protein [Populus trichocarpa]
          Length = 391

 Score = 63.2 bits (152), Expect = 2e-07,   Method: Compositional matrix adjust.
 Identities = 33/79 (41%), Positives = 42/79 (53%), Gaps = 9/79 (11%)

Query: 91  GLYPVISIINHSCLPNAVLVFE--------GRLAVVRAVQHVPKGAEVLISYIETAGSTM 142
           G+YP  S+ NH CLPNA   F+            VVR +  VP+G E+ +SY     +  
Sbjct: 220 GIYPKASLFNHDCLPNACR-FDYVDTNNSGNTDIVVRMIHDVPQGREICLSYFPVNSNYS 278

Query: 143 TRQKALKEQYLFTCTCPRC 161
           TR+K L E Y FTC C RC
Sbjct: 279 TRRKRLLEDYGFTCDCDRC 297


>gi|444723373|gb|ELW64030.1| SET and MYND domain-containing protein 5 [Tupaia chinensis]
          Length = 415

 Score = 63.2 bits (152), Expect = 2e-07,   Method: Compositional matrix adjust.
 Identities = 32/79 (40%), Positives = 47/79 (59%), Gaps = 5/79 (6%)

Query: 89  GTGLYPVISIINHSCLPNAVLVF--EGRLAVVRAVQHVPKGAEVLISYIETAG---STMT 143
           G+GL+ + S  NHSC+PNA   F     L  V A++ +  G E+ ISY++      S  +
Sbjct: 304 GSGLFVLQSCCNHSCVPNAETSFPENNFLLHVTALEEIKPGEEICISYLDCCQRERSRHS 363

Query: 144 RQKALKEQYLFTCTCPRCI 162
           R K L+E YLF C+CP+C+
Sbjct: 364 RHKILRENYLFVCSCPKCL 382


>gi|403260393|ref|XP_003922658.1| PREDICTED: SET and MYND domain-containing protein 5 [Saimiri
           boliviensis boliviensis]
          Length = 419

 Score = 63.2 bits (152), Expect = 2e-07,   Method: Compositional matrix adjust.
 Identities = 32/79 (40%), Positives = 47/79 (59%), Gaps = 5/79 (6%)

Query: 89  GTGLYPVISIINHSCLPNAVLVF--EGRLAVVRAVQHVPKGAEVLISYIETAG---STMT 143
           G+GL+ + S  NHSC+PNA   F     L  V A++ +  G E+ ISY++      S  +
Sbjct: 304 GSGLFVLQSCCNHSCVPNAETSFPENNFLLHVTALEDIKPGEEICISYLDCCQRERSRHS 363

Query: 144 RQKALKEQYLFTCTCPRCI 162
           R K L+E YLF C+CP+C+
Sbjct: 364 RHKILRENYLFVCSCPKCL 382


>gi|291386545|ref|XP_002709793.1| PREDICTED: SMYD family member 5 [Oryctolagus cuniculus]
          Length = 421

 Score = 63.2 bits (152), Expect = 2e-07,   Method: Compositional matrix adjust.
 Identities = 32/79 (40%), Positives = 47/79 (59%), Gaps = 5/79 (6%)

Query: 89  GTGLYPVISIINHSCLPNAVLVF--EGRLAVVRAVQHVPKGAEVLISYIETAG---STMT 143
           G+GL+ + S  NHSC+PNA   F     L  V A++ +  G E+ ISY++      S  +
Sbjct: 304 GSGLFVLQSCCNHSCVPNAETSFPENNFLLHVTALEDIKPGEEICISYLDCCQRERSRHS 363

Query: 144 RQKALKEQYLFTCTCPRCI 162
           R K L+E YLF C+CP+C+
Sbjct: 364 RHKILRENYLFVCSCPKCL 382


>gi|195454342|ref|XP_002074198.1| GK12748 [Drosophila willistoni]
 gi|194170283|gb|EDW85184.1| GK12748 [Drosophila willistoni]
          Length = 394

 Score = 63.2 bits (152), Expect = 2e-07,   Method: Compositional matrix adjust.
 Identities = 52/171 (30%), Positives = 88/171 (51%), Gaps = 16/171 (9%)

Query: 1   MLKLYLR--RKLQNDNVIPSTTTDNYSLVEALVAHMSDIDEKQLLLYAQIANLV-NLILQ 57
           M +LYL   +  Q ++    TT D +  +  ++   S      +L  +Q   +V +L L 
Sbjct: 199 MEQLYLAFCQAFQGEDFARFTTPDAFKTLMGIMGTNSQGIATSVL--SQWVTIVSDLPLS 256

Query: 58  WPE-ISINEIAEN-FSKLACNAHTICNSELRPLGTGLYPVISIINHSCLPNAVLVF--EG 113
            PE + +++  +N ++K+   A    N+E    G+GLY + S INHSC+PNA   F    
Sbjct: 257 DPEKVQLDQEIDNLYAKVGEFAGEFLNNE----GSGLYLLQSKINHSCVPNACSTFPYSN 312

Query: 114 RLAVVRAVQHVPKGAEVLISYIETAG---STMTRQKALKEQYLFTCTCPRC 161
            + V++A+  + +G E+ ISY++      S  +R K L   Y+F C CP+C
Sbjct: 313 DIVVLKALSPIQEGDEICISYLDECQLERSRHSRHKILCGNYIFICQCPKC 363


>gi|397473487|ref|XP_003808242.1| PREDICTED: SET and MYND domain-containing protein 5 isoform 1 [Pan
           paniscus]
          Length = 420

 Score = 63.2 bits (152), Expect = 2e-07,   Method: Compositional matrix adjust.
 Identities = 32/79 (40%), Positives = 47/79 (59%), Gaps = 5/79 (6%)

Query: 89  GTGLYPVISIINHSCLPNAVLVF--EGRLAVVRAVQHVPKGAEVLISYIETAG---STMT 143
           G+GL+ + S  NHSC+PNA   F     L  V A++ +  G E+ ISY++      S  +
Sbjct: 304 GSGLFVLQSCCNHSCVPNAETSFPENNFLLHVTALEDIKPGEEICISYLDCCQRERSRHS 363

Query: 144 RQKALKEQYLFTCTCPRCI 162
           R K L+E YLF C+CP+C+
Sbjct: 364 RHKILRENYLFVCSCPKCL 382


>gi|355751414|gb|EHH55669.1| hypothetical protein EGM_04917 [Macaca fascicularis]
          Length = 419

 Score = 63.2 bits (152), Expect = 2e-07,   Method: Compositional matrix adjust.
 Identities = 32/79 (40%), Positives = 47/79 (59%), Gaps = 5/79 (6%)

Query: 89  GTGLYPVISIINHSCLPNAVLVF--EGRLAVVRAVQHVPKGAEVLISYIETAG---STMT 143
           G+GL+ + S  NHSC+PNA   F     L  V A++ +  G E+ ISY++      S  +
Sbjct: 304 GSGLFVLQSCCNHSCVPNAETSFPENNFLLHVTALEDIKPGEEICISYLDCCQRERSRHS 363

Query: 144 RQKALKEQYLFTCTCPRCI 162
           R K L+E YLF C+CP+C+
Sbjct: 364 RHKILRENYLFVCSCPKCL 382


>gi|332226837|ref|XP_003262596.1| PREDICTED: SET and MYND domain-containing protein 5 isoform 1
           [Nomascus leucogenys]
          Length = 418

 Score = 63.2 bits (152), Expect = 2e-07,   Method: Compositional matrix adjust.
 Identities = 32/79 (40%), Positives = 47/79 (59%), Gaps = 5/79 (6%)

Query: 89  GTGLYPVISIINHSCLPNAVLVF--EGRLAVVRAVQHVPKGAEVLISYIETAG---STMT 143
           G+GL+ + S  NHSC+PNA   F     L  V A++ +  G E+ ISY++      S  +
Sbjct: 304 GSGLFVLQSCCNHSCVPNAETSFPENNFLLHVTALEDIKPGEEICISYLDCCQRERSRHS 363

Query: 144 RQKALKEQYLFTCTCPRCI 162
           R K L+E YLF C+CP+C+
Sbjct: 364 RHKILRENYLFVCSCPKCL 382


>gi|297667345|ref|XP_002811937.1| PREDICTED: SET and MYND domain-containing protein 5 [Pongo abelii]
          Length = 419

 Score = 63.2 bits (152), Expect = 2e-07,   Method: Compositional matrix adjust.
 Identities = 32/79 (40%), Positives = 47/79 (59%), Gaps = 5/79 (6%)

Query: 89  GTGLYPVISIINHSCLPNAVLVF--EGRLAVVRAVQHVPKGAEVLISYIETAG---STMT 143
           G+GL+ + S  NHSC+PNA   F     L  V A++ +  G E+ ISY++      S  +
Sbjct: 304 GSGLFVLQSCCNHSCVPNAETSFPENNFLLHVTALEDIKPGEEICISYLDCCQRERSRHS 363

Query: 144 RQKALKEQYLFTCTCPRCI 162
           R K L+E YLF C+CP+C+
Sbjct: 364 RHKILRENYLFVCSCPKCL 382


>gi|321476942|gb|EFX87901.1| SET and MYND domain-containing protein 4A [Daphnia pulex]
          Length = 648

 Score = 63.2 bits (152), Expect = 2e-07,   Method: Compositional matrix adjust.
 Identities = 27/78 (34%), Positives = 47/78 (60%), Gaps = 2/78 (2%)

Query: 88  LGTGLYPVISIINHSCLPNAVLVFEGRLAVVRAVQHVPKGAEVLISYIETAGSTMTRQKA 147
           +G+  +PV+S++NHSC PN V +  G + VV+ +  + +G E+L +Y     +   R++ 
Sbjct: 479 IGSAAFPVVSLMNHSCNPNVVHLCYGDVMVVKVIHRIARGEEILDNYGYHYATHEKRERQ 538

Query: 148 LK--EQYLFTCTCPRCIK 163
           LK  +QY F C C  C++
Sbjct: 539 LKLCQQYYFRCRCQSCVE 556


>gi|332813419|ref|XP_001150449.2| PREDICTED: SET and MYND domain-containing protein 5 isoform 1 [Pan
           troglodytes]
          Length = 415

 Score = 63.2 bits (152), Expect = 2e-07,   Method: Compositional matrix adjust.
 Identities = 32/79 (40%), Positives = 47/79 (59%), Gaps = 5/79 (6%)

Query: 89  GTGLYPVISIINHSCLPNAVLVF--EGRLAVVRAVQHVPKGAEVLISYIETAG---STMT 143
           G+GL+ + S  NHSC+PNA   F     L  V A++ +  G E+ ISY++      S  +
Sbjct: 304 GSGLFVLQSCCNHSCVPNAETSFPENNFLLHVTALEDIKPGEEICISYLDCCQRERSRHS 363

Query: 144 RQKALKEQYLFTCTCPRCI 162
           R K L+E YLF C+CP+C+
Sbjct: 364 RHKILRENYLFVCSCPKCL 382


>gi|296223556|ref|XP_002807574.1| PREDICTED: LOW QUALITY PROTEIN: SET and MYND domain-containing
           protein 5 [Callithrix jacchus]
          Length = 431

 Score = 63.2 bits (152), Expect = 2e-07,   Method: Compositional matrix adjust.
 Identities = 32/79 (40%), Positives = 47/79 (59%), Gaps = 5/79 (6%)

Query: 89  GTGLYPVISIINHSCLPNAVLVF--EGRLAVVRAVQHVPKGAEVLISYIETAG---STMT 143
           G+GL+ + S  NHSC+PNA   F     L  V A++ +  G E+ ISY++      S  +
Sbjct: 320 GSGLFVLQSCCNHSCVPNAETSFPENNFLLHVTALEDIKPGEEICISYLDCCQRERSRHS 379

Query: 144 RQKALKEQYLFTCTCPRCI 162
           R K L+E YLF C+CP+C+
Sbjct: 380 RHKILRENYLFVCSCPKCL 398


>gi|291231170|ref|XP_002735533.1| PREDICTED: SMYD family member 5-like [Saccoglossus kowalevskii]
          Length = 395

 Score = 63.2 bits (152), Expect = 2e-07,   Method: Compositional matrix adjust.
 Identities = 35/86 (40%), Positives = 48/86 (55%), Gaps = 7/86 (8%)

Query: 89  GTGLYPVISIINHSCLPNAVLVFEGRLAVVR--AVQHVPKGAEVLISYI---ETAGSTMT 143
           G+ LY + S  NHSC+PNA + F    A V   A+Q + +  E+ ISY+   +   S  +
Sbjct: 295 GSALYSLQSCCNHSCVPNAEVTFPDNDAAVSVMALQDIQENEEICISYLGECDIGRSRHS 354

Query: 144 RQKALKEQYLFTCTCPRCIKLGQFDD 169
           RQK L+E YLF C C +C    Q DD
Sbjct: 355 RQKILRENYLFNCNCMKC--QSQCDD 378


>gi|194762123|ref|XP_001963207.1| GF15829 [Drosophila ananassae]
 gi|190616904|gb|EDV32428.1| GF15829 [Drosophila ananassae]
          Length = 507

 Score = 63.2 bits (152), Expect = 2e-07,   Method: Compositional matrix adjust.
 Identities = 51/199 (25%), Positives = 84/199 (42%), Gaps = 25/199 (12%)

Query: 92  LYPVISIINHSCLPN---AVLVFEGRLAVVRAVQHVPKGAEVLISYIETAGSTMTRQKAL 148
           LYP   I+ H+C+PN   ++   EG    +RA+  +  G  +  SY  T   T  RQ+ L
Sbjct: 207 LYPYTGILAHNCVPNTARSIYPSEGYKIRLRAMVDLKDGQPLHHSYTYTLDGTAQRQRHL 266

Query: 149 KEQYLFTCTCPRC---IKLG-QFDDIQESAILEGYRCKDDGCSGFLLRDSDDKGFTCQQC 204
           K+   F+CTC RC    +LG  F  ++     EGY+               D  + C  C
Sbjct: 267 KQGKYFSCTCERCQDPTELGTHFSSLKCGQCTEGYQVPRQP-------TDSDSSWNCNNC 319

Query: 205 GLVRSKEEI----KKIASEVNILSKKTLALTSCGNHQEVVSTYKMIEKLQKKLYHPFSVN 260
           G   S+ E+    + + SEVN +    +        + ++  Y       K L HP    
Sbjct: 320 GASSSQAEVETLLQSLQSEVNKVQGLEMGAKRLEEAERLLRKY-------KSLLHPLHFI 372

Query: 261 LMQTREKLIKILMELEDWK 279
               R+ LI++   ++ ++
Sbjct: 373 ATGLRQLLIEMYGRVQGYE 391


>gi|154689858|ref|NP_006053.2| SET and MYND domain-containing protein 5 [Homo sapiens]
 gi|90101758|sp|Q6GMV2.2|SMYD5_HUMAN RecName: Full=SET and MYND domain-containing protein 5; AltName:
           Full=Protein NN8-4AG; AltName: Full=Retinoic
           acid-induced protein 15
 gi|119620150|gb|EAW99744.1| SMYD family member 5, isoform CRA_a [Homo sapiens]
 gi|119620151|gb|EAW99745.1| SMYD family member 5, isoform CRA_a [Homo sapiens]
 gi|261859134|dbj|BAI46089.1| SMYD family member 5 [synthetic construct]
          Length = 418

 Score = 63.2 bits (152), Expect = 2e-07,   Method: Compositional matrix adjust.
 Identities = 32/79 (40%), Positives = 47/79 (59%), Gaps = 5/79 (6%)

Query: 89  GTGLYPVISIINHSCLPNAVLVF--EGRLAVVRAVQHVPKGAEVLISYIETAG---STMT 143
           G+GL+ + S  NHSC+PNA   F     L  V A++ +  G E+ ISY++      S  +
Sbjct: 304 GSGLFVLQSCCNHSCVPNAETSFPENNFLLHVTALEDIKPGEEICISYLDCCQRERSRHS 363

Query: 144 RQKALKEQYLFTCTCPRCI 162
           R K L+E YLF C+CP+C+
Sbjct: 364 RHKILRENYLFVCSCPKCL 382


>gi|195421076|ref|XP_002060836.1| GK23578 [Drosophila willistoni]
 gi|194156921|gb|EDW71822.1| GK23578 [Drosophila willistoni]
          Length = 128

 Score = 63.2 bits (152), Expect = 2e-07,   Method: Compositional matrix adjust.
 Identities = 29/72 (40%), Positives = 41/72 (56%), Gaps = 1/72 (1%)

Query: 91  GLYPVISIINHSCLPNAVLVFE-GRLAVVRAVQHVPKGAEVLISYIETAGSTMTRQKALK 149
            L+P+  ++NH C PNA   FE G   VV A + +PKG E+ ++Y +   ST+ R+  L 
Sbjct: 51  ALFPLAGLLNHHCTPNAAHHFEDGETIVVSATERIPKGTEITMTYAKLLWSTLARKLFLA 110

Query: 150 EQYLFTCTCPRC 161
               F C CPRC
Sbjct: 111 MTKHFICQCPRC 122


>gi|431920345|gb|ELK18377.1| SET and MYND domain-containing protein 5 [Pteropus alecto]
          Length = 418

 Score = 63.2 bits (152), Expect = 2e-07,   Method: Compositional matrix adjust.
 Identities = 32/79 (40%), Positives = 47/79 (59%), Gaps = 5/79 (6%)

Query: 89  GTGLYPVISIINHSCLPNAVLVF--EGRLAVVRAVQHVPKGAEVLISYIETAG---STMT 143
           G+GL+ + S  NHSC+PNA   F     L  V A++ +  G E+ ISY++      S  +
Sbjct: 304 GSGLFVLQSCCNHSCVPNAETSFPENNFLLHVTALEDIKPGEEICISYLDCCQRERSRHS 363

Query: 144 RQKALKEQYLFTCTCPRCI 162
           R K L+E YLF C+CP+C+
Sbjct: 364 RHKILRENYLFVCSCPKCL 382


>gi|449550207|gb|EMD41172.1| hypothetical protein CERSUDRAFT_91931 [Ceriporiopsis subvermispora
           B]
          Length = 329

 Score = 63.2 bits (152), Expect = 2e-07,   Method: Compositional matrix adjust.
 Identities = 28/74 (37%), Positives = 40/74 (54%), Gaps = 2/74 (2%)

Query: 90  TGLYPVISIINHSCLPNA--VLVFEGRLAVVRAVQHVPKGAEVLISYIETAGSTMTRQKA 147
           +G +P++S INH C P+A     FE     + A + +P G E+ I+Y       + RQ  
Sbjct: 138 SGCFPILSRINHDCSPSANYFFAFEPYCGQLWAARDIPDGGEITIAYTTLPAPRVDRQAH 197

Query: 148 LKEQYLFTCTCPRC 161
           L E+Y FTCTC  C
Sbjct: 198 LSERYFFTCTCVAC 211


>gi|417400530|gb|JAA47200.1| Putative histone tail methylase [Desmodus rotundus]
          Length = 415

 Score = 63.2 bits (152), Expect = 2e-07,   Method: Compositional matrix adjust.
 Identities = 32/79 (40%), Positives = 47/79 (59%), Gaps = 5/79 (6%)

Query: 89  GTGLYPVISIINHSCLPNAVLVF--EGRLAVVRAVQHVPKGAEVLISYIETAG---STMT 143
           G+GL+ + S  NHSC+PNA   F     L  V A++ +  G E+ ISY++      S  +
Sbjct: 304 GSGLFVLQSCCNHSCVPNAETSFPENNFLLHVTALEDIKPGEEICISYLDCCQRERSRHS 363

Query: 144 RQKALKEQYLFTCTCPRCI 162
           R K L+E YLF C+CP+C+
Sbjct: 364 RHKILRENYLFVCSCPKCL 382


>gi|301758142|ref|XP_002914921.1| PREDICTED: SET and MYND domain-containing protein 5-like
           [Ailuropoda melanoleuca]
 gi|281346678|gb|EFB22262.1| hypothetical protein PANDA_002863 [Ailuropoda melanoleuca]
          Length = 417

 Score = 63.2 bits (152), Expect = 2e-07,   Method: Compositional matrix adjust.
 Identities = 32/79 (40%), Positives = 47/79 (59%), Gaps = 5/79 (6%)

Query: 89  GTGLYPVISIINHSCLPNAVLVF--EGRLAVVRAVQHVPKGAEVLISYIETAG---STMT 143
           G+GL+ + S  NHSC+PNA   F     L  V A++ +  G E+ ISY++      S  +
Sbjct: 304 GSGLFVLQSCCNHSCVPNAETSFPENNFLLHVTALEDIKPGEEICISYLDCCQRERSRHS 363

Query: 144 RQKALKEQYLFTCTCPRCI 162
           R K L+E YLF C+CP+C+
Sbjct: 364 RHKILRENYLFVCSCPKCL 382


>gi|384943470|gb|AFI35340.1| SET and MYND domain-containing protein 5 [Macaca mulatta]
          Length = 418

 Score = 63.2 bits (152), Expect = 2e-07,   Method: Compositional matrix adjust.
 Identities = 32/79 (40%), Positives = 47/79 (59%), Gaps = 5/79 (6%)

Query: 89  GTGLYPVISIINHSCLPNAVLVF--EGRLAVVRAVQHVPKGAEVLISYIETAG---STMT 143
           G+GL+ + S  NHSC+PNA   F     L  V A++ +  G E+ ISY++      S  +
Sbjct: 304 GSGLFVLQSCCNHSCVPNAETSFPENNFLLHVTALEDIKPGEEICISYLDCCQRERSRHS 363

Query: 144 RQKALKEQYLFTCTCPRCI 162
           R K L+E YLF C+CP+C+
Sbjct: 364 RHKILRENYLFVCSCPKCL 382


>gi|380797359|gb|AFE70555.1| SET and MYND domain-containing protein 5, partial [Macaca mulatta]
          Length = 407

 Score = 63.2 bits (152), Expect = 2e-07,   Method: Compositional matrix adjust.
 Identities = 32/79 (40%), Positives = 47/79 (59%), Gaps = 5/79 (6%)

Query: 89  GTGLYPVISIINHSCLPNAVLVF--EGRLAVVRAVQHVPKGAEVLISYIETAG---STMT 143
           G+GL+ + S  NHSC+PNA   F     L  V A++ +  G E+ ISY++      S  +
Sbjct: 293 GSGLFVLQSCCNHSCVPNAETSFPENNFLLHVTALEDIKPGEEICISYLDCCQRERSRHS 352

Query: 144 RQKALKEQYLFTCTCPRCI 162
           R K L+E YLF C+CP+C+
Sbjct: 353 RHKILRENYLFVCSCPKCL 371


>gi|109103368|ref|XP_001104101.1| PREDICTED: SET and MYND domain-containing protein 5 [Macaca
           mulatta]
 gi|402891240|ref|XP_003908860.1| PREDICTED: SET and MYND domain-containing protein 5 [Papio anubis]
 gi|355565788|gb|EHH22217.1| hypothetical protein EGK_05442 [Macaca mulatta]
          Length = 418

 Score = 63.2 bits (152), Expect = 2e-07,   Method: Compositional matrix adjust.
 Identities = 32/79 (40%), Positives = 47/79 (59%), Gaps = 5/79 (6%)

Query: 89  GTGLYPVISIINHSCLPNAVLVF--EGRLAVVRAVQHVPKGAEVLISYIETAG---STMT 143
           G+GL+ + S  NHSC+PNA   F     L  V A++ +  G E+ ISY++      S  +
Sbjct: 304 GSGLFVLQSCCNHSCVPNAETSFPENNFLLHVTALEDIKPGEEICISYLDCCQRERSRHS 363

Query: 144 RQKALKEQYLFTCTCPRCI 162
           R K L+E YLF C+CP+C+
Sbjct: 364 RHKILRENYLFVCSCPKCL 382


>gi|311252397|ref|XP_003125060.1| PREDICTED: SET and MYND domain-containing protein 5 [Sus scrofa]
          Length = 417

 Score = 62.8 bits (151), Expect = 2e-07,   Method: Compositional matrix adjust.
 Identities = 32/79 (40%), Positives = 47/79 (59%), Gaps = 5/79 (6%)

Query: 89  GTGLYPVISIINHSCLPNAVLVF--EGRLAVVRAVQHVPKGAEVLISYIETAG---STMT 143
           G+GL+ + S  NHSC+PNA   F     L  V A++ +  G E+ ISY++      S  +
Sbjct: 304 GSGLFVLQSCCNHSCVPNAETSFPENNFLLHVTALEDIKPGEEICISYLDCCQRERSRHS 363

Query: 144 RQKALKEQYLFTCTCPRCI 162
           R K L+E YLF C+CP+C+
Sbjct: 364 RHKILRENYLFVCSCPKCL 382


>gi|194377790|dbj|BAG63258.1| unnamed protein product [Homo sapiens]
          Length = 418

 Score = 62.8 bits (151), Expect = 2e-07,   Method: Compositional matrix adjust.
 Identities = 32/79 (40%), Positives = 47/79 (59%), Gaps = 5/79 (6%)

Query: 89  GTGLYPVISIINHSCLPNAVLVF--EGRLAVVRAVQHVPKGAEVLISYIETAG---STMT 143
           G+GL+ + S  NHSC+PNA   F     L  V A++ +  G E+ ISY++      S  +
Sbjct: 304 GSGLFVLQSCCNHSCVPNAETSFPENNFLLHVTALEDIKPGEEICISYLDCCQRERSRHS 363

Query: 144 RQKALKEQYLFTCTCPRCI 162
           R K L+E YLF C+CP+C+
Sbjct: 364 RHKILRENYLFVCSCPKCL 382


>gi|328873515|gb|EGG21882.1| SET domain-containing protein [Dictyostelium fasciculatum]
          Length = 438

 Score = 62.8 bits (151), Expect = 2e-07,   Method: Compositional matrix adjust.
 Identities = 40/86 (46%), Positives = 48/86 (55%), Gaps = 10/86 (11%)

Query: 82  NSELRPLGTGLYPVISIINHSCLPNA-VLVFEGRLAV----VRAVQHVPKGAEVLISYIE 136
           N E R  G G+Y   S  NHSC PN    V E  L V    +RAV+   KG E+ ISYI+
Sbjct: 324 NGESR--GCGVYVRNSFFNHSCNPNVNYWVVENTLEVECSLMRAVR---KGEELCISYID 378

Query: 137 TAGSTMTRQKALKEQYLFTCTCPRCI 162
           TA S   R++ L E YLF C C +CI
Sbjct: 379 TAASLRDRREKLSEGYLFHCRCEKCI 404


>gi|410955043|ref|XP_003984168.1| PREDICTED: SET and MYND domain-containing protein 5 [Felis catus]
          Length = 416

 Score = 62.8 bits (151), Expect = 2e-07,   Method: Compositional matrix adjust.
 Identities = 32/79 (40%), Positives = 47/79 (59%), Gaps = 5/79 (6%)

Query: 89  GTGLYPVISIINHSCLPNAVLVF--EGRLAVVRAVQHVPKGAEVLISYIETAG---STMT 143
           G+GL+ + S  NHSC+PNA   F     L  V A++ +  G E+ ISY++      S  +
Sbjct: 304 GSGLFVLQSCCNHSCVPNAETSFPENNFLLHVTALEDIKPGEEICISYLDCCQRERSRHS 363

Query: 144 RQKALKEQYLFTCTCPRCI 162
           R K L+E YLF C+CP+C+
Sbjct: 364 RHKILRENYLFVCSCPKCL 382


>gi|390362089|ref|XP_784346.3| PREDICTED: SET and MYND domain-containing protein 5-like
           [Strongylocentrotus purpuratus]
          Length = 396

 Score = 62.8 bits (151), Expect = 2e-07,   Method: Compositional matrix adjust.
 Identities = 34/79 (43%), Positives = 49/79 (62%), Gaps = 7/79 (8%)

Query: 89  GTGLYPVISIINHSCLPNAVLVF---EGRLAVVRAVQHVPKGAEVLISYIETA---GSTM 142
           G+ LY + S  NHSC+PNA + F      L++V A+  + +G EVLISY++      S  
Sbjct: 288 GSALYKLQSCCNHSCVPNAEISFLHNNSTLSLV-ALTDITEGQEVLISYLDECCKERSRH 346

Query: 143 TRQKALKEQYLFTCTCPRC 161
           +RQK L+E YLF+C C +C
Sbjct: 347 SRQKELRENYLFSCDCSKC 365


>gi|307183932|gb|EFN70520.1| SET and MYND domain-containing protein 4 [Camponotus floridanus]
          Length = 421

 Score = 62.8 bits (151), Expect = 2e-07,   Method: Compositional matrix adjust.
 Identities = 39/126 (30%), Positives = 59/126 (46%), Gaps = 9/126 (7%)

Query: 86  RP--LGTGLYPVISIINHSCLPNAVLVFEGRLAVVRAVQHVPKGAEVLISY--IETAGST 141
           RP  LG  +YP ++  NH C P     F GR  V+RA++ +  G  +  +Y  I T  + 
Sbjct: 218 RPVYLGVAIYPTVARFNHDCYPAVTRYFVGRHIVIRAIRGLRPGDVIAENYGPIFTKRTL 277

Query: 142 MTRQKALKEQYLFTCTCPRCIK-LGQFDDIQESAILEGYRCKDDGCSGFLLRDSD--DKG 198
             RQ+ L  +Y F C+C  C +   +F+ +   ++    RC   GC+G   R     DK 
Sbjct: 278 AERQRTLAGRYWFRCSCRACQEDWPRFETLTNDSVR--LRCPTTGCNGLHSRPQQRPDKP 335

Query: 199 FTCQQC 204
             C  C
Sbjct: 336 IKCSAC 341


>gi|380022725|ref|XP_003695189.1| PREDICTED: protein msta, isoform B-like [Apis florea]
          Length = 513

 Score = 62.8 bits (151), Expect = 2e-07,   Method: Compositional matrix adjust.
 Identities = 72/339 (21%), Positives = 137/339 (40%), Gaps = 42/339 (12%)

Query: 27  VEALVAHMSDIDEKQLLLYAQIANLVNLILQWPEISINEIAENFSKLACNAHTICNSELR 86
           V+ + AH     +K+     Q+ N+V  I +  ++ ++  +E   +  C    I   E+R
Sbjct: 151 VKDMEAHNKIRSQKKQWKLDQV-NIVEYIRK--QLKLDRFSEKEIQTVCGILEINTFEVR 207

Query: 87  PL----GTGLYPVISIINHSCLPN-----AVLVFEGRLAVVRAVQHVPKGAEVLISYIET 137
                    +YP ++++NHSC+ N     +   ++ RL   R    +P G E+  SY  +
Sbjct: 208 TAKGFSARAIYPTVALMNHSCISNTCHSISPTDYKIRL---RTTLKIPPGGELYGSYTHS 264

Query: 138 AGSTMTRQKALKEQYLFTCTCPRCIKLGQFDDIQESAILEGYRCK--DDGCSGFLLRDSD 195
              TM R++ L E   F C C RC      D  +    +   +C   D+G    L     
Sbjct: 265 LLPTMLRREHLLEGKHFACACARC-----SDPTELGTHMSSLKCNKCDNGIVLSLDSLDP 319

Query: 196 DKGFTCQQCGLVRSKEEIKKIASEVNILSKKTLALTSCGNHQEVVSTYKMIEKLQKKLYH 255
           +  + C  C    S   ++K+   +++      A++   +  + +   + + K  + + H
Sbjct: 320 ESSWKCTHCEFSTSGSAVRKVLQIIHMEVDAVEAISG-ADGADAIQERETVAKKYRSVLH 378

Query: 256 PFSVNLMQTREKLIKILMELEDW----------KEALAYCQLTIPVYQRVYPQFHPLLGL 305
           P    L   R  L ++   +ED+          +  +  C+L + V   + P +  + G+
Sbjct: 379 PRHAFLTMLRHSLTQMYGRVEDYLLEDLPDVVLEHKIDMCKLLLQVLDVIEPGYSRIRGM 438

Query: 306 QYYTCGKLEWFLGDTE--------NAIKS-MTEAVEILR 335
             Y       F   T+         A+KS M EA +IL+
Sbjct: 439 TLYELHAPLLFTAKTQWNAEVIDKAALKSKMIEASKILK 477


>gi|149727812|ref|XP_001488565.1| PREDICTED: SET and MYND domain-containing protein 5 [Equus
           caballus]
          Length = 417

 Score = 62.8 bits (151), Expect = 2e-07,   Method: Compositional matrix adjust.
 Identities = 32/79 (40%), Positives = 47/79 (59%), Gaps = 5/79 (6%)

Query: 89  GTGLYPVISIINHSCLPNAVLVF--EGRLAVVRAVQHVPKGAEVLISYIETAG---STMT 143
           G+GL+ + S  NHSC+PNA   F     L  V A++ +  G E+ ISY++      S  +
Sbjct: 304 GSGLFVLQSCCNHSCVPNAETSFPENNFLLHVTALEDIKPGEEICISYLDCCQRERSRHS 363

Query: 144 RQKALKEQYLFTCTCPRCI 162
           R K L+E YLF C+CP+C+
Sbjct: 364 RHKILRENYLFVCSCPKCL 382


>gi|122692553|ref|NP_001073717.1| SET and MYND domain-containing protein 5 [Bos taurus]
 gi|120419446|gb|ABM21542.1| SMYD family member 5 [Bos taurus]
 gi|151553913|gb|AAI49126.1| SMYD5 protein [Bos taurus]
 gi|296482713|tpg|DAA24828.1| TPA: SMYD family member 5 [Bos taurus]
          Length = 418

 Score = 62.8 bits (151), Expect = 2e-07,   Method: Compositional matrix adjust.
 Identities = 32/79 (40%), Positives = 47/79 (59%), Gaps = 5/79 (6%)

Query: 89  GTGLYPVISIINHSCLPNAVLVF--EGRLAVVRAVQHVPKGAEVLISYIETAG---STMT 143
           G+GL+ + S  NHSC+PNA   F     L  V A++ +  G E+ ISY++      S  +
Sbjct: 304 GSGLFVLQSCCNHSCVPNAETSFPENNFLLHVTALEDIKPGEEICISYLDCCQRERSRHS 363

Query: 144 RQKALKEQYLFTCTCPRCI 162
           R K L+E YLF C+CP+C+
Sbjct: 364 RHKILRENYLFVCSCPKCL 382


>gi|1245372|gb|AAB38131.1| NN8-4AG, partial [Homo sapiens]
          Length = 412

 Score = 62.8 bits (151), Expect = 2e-07,   Method: Compositional matrix adjust.
 Identities = 32/79 (40%), Positives = 47/79 (59%), Gaps = 5/79 (6%)

Query: 89  GTGLYPVISIINHSCLPNAVLVF--EGRLAVVRAVQHVPKGAEVLISYIETAG---STMT 143
           G+GL+ + S  NHSC+PNA   F     L  V A++ +  G E+ ISY++      S  +
Sbjct: 298 GSGLFVLQSCCNHSCVPNAETSFPENNFLLHVTALEDIKPGEEICISYLDCCQRERSRHS 357

Query: 144 RQKALKEQYLFTCTCPRCI 162
           R K L+E YLF C+CP+C+
Sbjct: 358 RHKILRENYLFVCSCPKCL 376


>gi|440901422|gb|ELR52368.1| SET and MYND domain-containing protein 5 [Bos grunniens mutus]
          Length = 418

 Score = 62.8 bits (151), Expect = 2e-07,   Method: Compositional matrix adjust.
 Identities = 32/79 (40%), Positives = 47/79 (59%), Gaps = 5/79 (6%)

Query: 89  GTGLYPVISIINHSCLPNAVLVF--EGRLAVVRAVQHVPKGAEVLISYIETAG---STMT 143
           G+GL+ + S  NHSC+PNA   F     L  V A++ +  G E+ ISY++      S  +
Sbjct: 304 GSGLFVLQSCCNHSCVPNAETSFPENNFLLHVTALEDIKPGEEICISYLDCCQRERSRHS 363

Query: 144 RQKALKEQYLFTCTCPRCI 162
           R K L+E YLF C+CP+C+
Sbjct: 364 RHKILRENYLFVCSCPKCL 382


>gi|219112705|ref|XP_002178104.1| predicted protein [Phaeodactylum tricornutum CCAP 1055/1]
 gi|217410989|gb|EEC50918.1| predicted protein [Phaeodactylum tricornutum CCAP 1055/1]
          Length = 501

 Score = 62.8 bits (151), Expect = 3e-07,   Method: Compositional matrix adjust.
 Identities = 35/91 (38%), Positives = 54/91 (59%), Gaps = 10/91 (10%)

Query: 80  ICNSELRPLGTGLYPVISIINHSCLPNAVL----VFEGRLAVVRAVQHVPKGAEVLISYI 135
           +   ++ P    ++P+ + INHSC+PNA +      + R+ VV A++ +  G E+ ISYI
Sbjct: 408 VVEDKVAPEIVAVFPLTARINHSCVPNAQVQSQEFVDARIDVV-ALRDIAAGEEITISYI 466

Query: 136 ---ETAGSTMT--RQKALKEQYLFTCTCPRC 161
               T+GS  T  R++ L  +YLFTC CPRC
Sbjct: 467 GCGRTSGSKSTSRRRRELLAKYLFTCECPRC 497


>gi|195020683|ref|XP_001985247.1| GH16955 [Drosophila grimshawi]
 gi|193898729|gb|EDV97595.1| GH16955 [Drosophila grimshawi]
          Length = 529

 Score = 62.8 bits (151), Expect = 3e-07,   Method: Compositional matrix adjust.
 Identities = 61/225 (27%), Positives = 105/225 (46%), Gaps = 28/225 (12%)

Query: 19  TTTDNYSLVEALVAHMSDIDEKQL--LLYAQIANLVNLIL---QWPEISINEIAENFSKL 73
           +  + ++ + AL  H+ +  E  L  +L A +   +  +L    WPE+ I  IA   + L
Sbjct: 171 SNPEAFARLAALEDHLKERIETPLYQVLRANLITFIKTVLGQRDWPELEILRIA---AIL 227

Query: 74  ACNAHTICNSELRPLGTGLYPVISIINHSCLPNAVLVFEGRLAVV-RAVQHVPKGAEVLI 132
             NA  +  +  R     ++P  ++I H C+PN    F+  + ++  A + +PKGA + I
Sbjct: 228 DTNAFEVRQNGERRKVRAIFPGGAMIAHDCVPNLRHRFDDNMRIIFLAKRPIPKGAILSI 287

Query: 133 SYIETAGSTMTRQKALKEQYLFTCTCPRC---IKLGQFDDIQESAILEGYRCKDDGCSGF 189
           SY +   ST+ R+  LK+   F C C RC    +LG F      A L G +CK     G 
Sbjct: 288 SYTQPLRSTVQRRVHLKQVKCFDCACERCSDPTELGTF----AGAHLCG-KCK----VGK 338

Query: 190 LLRDS---DDKGFTCQQCGLVRSKEEI----KKIASEVNILSKKT 227
           ++  +   +   + CQ C + +S  E+     ++  E+  L K T
Sbjct: 339 VISQNPLENAANWHCQLCNVKKSAREVLTQDARLQQEIESLDKTT 383


>gi|340725361|ref|XP_003401039.1| PREDICTED: SET and MYND domain-containing protein 5-like [Bombus
           terrestris]
          Length = 386

 Score = 62.8 bits (151), Expect = 3e-07,   Method: Compositional matrix adjust.
 Identities = 35/87 (40%), Positives = 49/87 (56%), Gaps = 5/87 (5%)

Query: 89  GTGLYPVISIINHSCLPNAVLVF--EGRLAVVRAVQHVPKGAEVLISYIETA---GSTMT 143
           G+GLY   S INHSC+PNAV+ F     + V++A++ +    E+ ISY++      S  +
Sbjct: 287 GSGLYIFQSSINHSCVPNAVVEFPYSNNVLVLKAIRDIHPEEEICISYLDECCLERSRHS 346

Query: 144 RQKALKEQYLFTCTCPRCIKLGQFDDI 170
           RQK L   YLF C C RC+      DI
Sbjct: 347 RQKVLSSLYLFHCYCNRCLSQVNNPDI 373


>gi|71666752|ref|XP_820332.1| hypothetical protein [Trypanosoma cruzi strain CL Brener]
 gi|70885672|gb|EAN98481.1| hypothetical protein, conserved [Trypanosoma cruzi]
          Length = 359

 Score = 62.8 bits (151), Expect = 3e-07,   Method: Compositional matrix adjust.
 Identities = 51/174 (29%), Positives = 84/174 (48%), Gaps = 32/174 (18%)

Query: 19  TTTDNYSLVEALVAHMSDI--------DEKQLLLYAQIANLVNLIL-----QWPEISINE 65
           +TT+N +  + L   M+ +        +E  L   A+ + L+  +L     + P  + + 
Sbjct: 160 STTENDTTDDFLATGMTLLGRILRFTQEECTLFTVARWSELLGAVLLNGQERSPPSNYDR 219

Query: 66  IAENFSKLACNAHTI---------CNSELRPL-----GTGLYPVISIINHSCLPNAVLVF 111
           + E   +L C   T+           +EL+ L     G G+Y V  + NHSC PN  +V+
Sbjct: 220 LKELVRRLPCGESTMDAFEQEVQMAGNELQHLRQSSRGQGVYTVGCLFNHSCEPNLQVVY 279

Query: 112 ----EGRLAVVRAVQHVPKGAEVLISYIETAGSTMTRQKALKEQYLFTCTCPRC 161
               +  L+VV A++ +  G E+ ISYI+ + S   RQ+ L E YLF C CP+C
Sbjct: 280 SESGDETLSVV-ALRDIEPGEELCISYIDESLSYPERQQELYEHYLFFCQCPKC 332


>gi|195132931|ref|XP_002010893.1| GI21458 [Drosophila mojavensis]
 gi|193907681|gb|EDW06548.1| GI21458 [Drosophila mojavensis]
          Length = 302

 Score = 62.8 bits (151), Expect = 3e-07,   Method: Compositional matrix adjust.
 Identities = 30/72 (41%), Positives = 41/72 (56%), Gaps = 1/72 (1%)

Query: 91  GLYPVISIINHSCLPNAVLVFE-GRLAVVRAVQHVPKGAEVLISYIETAGSTMTRQKALK 149
            L+P+  ++NH C PNA   FE G   VV A + +P GAE+ +SY +   ST+ R+  L 
Sbjct: 226 ALFPLAGLLNHQCTPNAAHHFEDGETIVVTATERIPAGAEITMSYTKLLWSTLARKIFLG 285

Query: 150 EQYLFTCTCPRC 161
               F C CPRC
Sbjct: 286 MTKHFMCQCPRC 297


>gi|154346778|ref|XP_001569326.1| hypothetical protein LBRM_35_7140 [Leishmania braziliensis
           MHOM/BR/75/M2904]
 gi|134066668|emb|CAM44467.1| hypothetical protein LBRM_35_7140 [Leishmania braziliensis
           MHOM/BR/75/M2904]
          Length = 442

 Score = 62.8 bits (151), Expect = 3e-07,   Method: Compositional matrix adjust.
 Identities = 34/84 (40%), Positives = 47/84 (55%), Gaps = 5/84 (5%)

Query: 89  GTGLYPVISIINHSCLPNAVLVFEG----RLAVVRAVQHVPKGAEVLISYIETAGSTMTR 144
           G G+Y V  + NHSC PN  + +       L VV A++ V  G E+ ISYI+++     R
Sbjct: 357 GQGIYEVGCLFNHSCDPNLSVQYSSLNDETLTVV-ALRDVKAGEELTISYIDSSLPFAVR 415

Query: 145 QKALKEQYLFTCTCPRCIKLGQFD 168
           Q+ L + YLF C CPRC+  G  D
Sbjct: 416 QQQLLDHYLFECRCPRCVAEGTTD 439


>gi|443704468|gb|ELU01530.1| hypothetical protein CAPTEDRAFT_197951 [Capitella teleta]
          Length = 614

 Score = 62.8 bits (151), Expect = 3e-07,   Method: Compositional matrix adjust.
 Identities = 80/370 (21%), Positives = 147/370 (39%), Gaps = 68/370 (18%)

Query: 21  TDNYSLVEALVAHMSDIDEKQLLLYAQIANLVNLILQWPEISINE-----------IAEN 69
           + NY+ V  L+ H   +  + L+ YA  A L+   L   E   N            +  +
Sbjct: 260 SRNYTAVYHLLPHNESMLREDLIQYALTAALLLKCLDHTEYFKNHSPDFRFSIGGLLLRH 319

Query: 70  FSKLACNAHTI-------CNSELRP-----LGTGLYPVISIINHSCLPNAVLVFEGRLAV 117
             +L CNAH I       C+  +       + T +YP  S++NHSC P +++  +  L  
Sbjct: 320 ICQLVCNAHAITRLEQGLCSQAVVECQQVRIATAIYPTTSLLNHSCDP-SIIARKNEL-F 377

Query: 118 VRAVQHVPKGAEVLISYIETAGS--TMTRQKALKEQYLFTCTCPRCIKLGQFDDIQESAI 175
           VR V+ V  G E+   Y           RQ+ L+ QY F C C  C      ++     +
Sbjct: 378 VRLVKDVKAGEEIFNCYGPHYARMPKKERQEVLQSQYFFKCDCSECTAEEPLEN-----L 432

Query: 176 LEGYRCKDDGCSGFLLRDSDDKGFTCQQCGLVRSKEEIKKIASEVNILSKKTLALTSCGN 235
           L+ YRC+   CS  ++     +   C +C  +    ++ ++++     S   ++   C  
Sbjct: 433 LKAYRCQK--CSHAIISTGTQEVLKCSKCVTMV---DMNQLSAAEKQSSDDFMSSLKCLQ 487

Query: 236 HQEVVSTYKMIEK---LQKKLYHPFSVNLMQTREKLIKILMELEDWKEALAYCQLTIPVY 292
            +++    K ++    ++K +YH           +  K+L E +D   ALA C      +
Sbjct: 488 VEDIDGAIKKLKNSLDIRKMIYH-----------RNHKLLTEAKD---ALARCYCIKGQF 533

Query: 293 QRVYPQFHPLLGLQYYTCGKLEWFLGDT--------------ENAIKSMTEAVEILRITH 338
            R  P     L       G     L +               + A++++ ++++ILR+ +
Sbjct: 534 TRAVPLLKDCLLCVETAFGSRSIELANELHKYAEVLVNARKLDEALETVNQSIDILRLNY 593

Query: 339 GTNSPFMKEL 348
           G      KE+
Sbjct: 594 GPEYSTAKEM 603


>gi|195485910|ref|XP_002091283.1| GE13567 [Drosophila yakuba]
 gi|194177384|gb|EDW90995.1| GE13567 [Drosophila yakuba]
          Length = 514

 Score = 62.8 bits (151), Expect = 3e-07,   Method: Compositional matrix adjust.
 Identities = 44/173 (25%), Positives = 76/173 (43%), Gaps = 11/173 (6%)

Query: 48  IANLVNLILQWPEISINEIAENFSKLACNAHTICNSELRPLGTGLYPVISIINHSCLPNA 107
           I   +    +  + S  EI +    L  N H +  ++  P    ++   S   +SCLPN 
Sbjct: 189 IGQFIPKFFKTQKFSEEEIMKAVGALQINGHEVPTTD--PSHVAVFYTASFTENSCLPNL 246

Query: 108 VLVF-EGRLAVVRAVQHVPKGAEVLISYIETAGSTMTRQKALKEQYLFTCTCPRCIKLGQ 166
              F +    ++ A + + K A + I Y +    T  RQ+ L +  LF C C RC+ + +
Sbjct: 247 AKSFNKNGHCILWAPREIKKNAHLSICYSDAVWGTADRQRHLMQTKLFRCACERCVDVTE 306

Query: 167 FDDIQESAILEGYRCKDDGCSGFLL---RDSDDKGFTCQQCGLVRSKEEIKKI 216
            D    SAI    +C+D  C G +L    D  +  + C++C     K  +++I
Sbjct: 307 LDTFY-SAI----KCEDRQCGGLMLPTKTDDWNGNWRCRECHKQVQKHYVERI 354


>gi|344246823|gb|EGW02927.1| SET and MYND domain-containing protein 5 [Cricetulus griseus]
          Length = 633

 Score = 62.8 bits (151), Expect = 3e-07,   Method: Compositional matrix adjust.
 Identities = 35/86 (40%), Positives = 50/86 (58%), Gaps = 7/86 (8%)

Query: 89  GTGLYPVISIINHSCLPNAVLVFEGR--LAVVRAVQHVPKGAEVLISYIETAG---STMT 143
           G+GL+ + S  NHSC+PNA   F     L  V A++ +  G E+ ISY++      S  +
Sbjct: 520 GSGLFVLQSCCNHSCVPNAETSFPENNFLLHVTALEDIKPGEEICISYLDCCQRERSRHS 579

Query: 144 RQKALKEQYLFTCTCPRCIKLGQFDD 169
           R K L+E YLF C+CP+C  L + DD
Sbjct: 580 RHKILRENYLFICSCPKC--LAEADD 603


>gi|332016864|gb|EGI57673.1| Protein msta, isoform A [Acromyrmex echinatior]
          Length = 421

 Score = 62.8 bits (151), Expect = 3e-07,   Method: Compositional matrix adjust.
 Identities = 50/194 (25%), Positives = 90/194 (46%), Gaps = 17/194 (8%)

Query: 91  GLYPVISIINHSCLPNAVLVFEGRLAV-VRAVQHVPKGAEVLISYIETAGSTMTRQKALK 149
           GLYP+ ++ NH C+PN    F+ +  + V AV  +  G E+ +SY +    T +R++ LK
Sbjct: 147 GLYPMGALQNHCCVPNTRHHFDDQQRLYVSAVLPIAAGEEITMSYTDLLWDTSSRRQFLK 206

Query: 150 EQYLFTCTCPRCIKLGQFDDIQESAILEGYRCKDDGCSGFLLRD---SDDKGFTCQQCGL 206
               F+C C RC      D ++  + L    C  D CSG LL     + +  + C +C +
Sbjct: 207 VTKRFSCNCNRC-----SDPLEFGSQLSTLLCAKDNCSGCLLPRNPLNHESSWICDKCQI 261

Query: 207 -VRSKEEIKKIASEVNILSKKTLALTSCGNHQEVVSTYKMIEKLQKKLYHPFSVNLMQTR 265
            V  ++ I+ I S +N      +  T     +E++   +  E +  +L    +  +M  +
Sbjct: 262 SVNYRQVIECIHSGLNTFVSDAMYKTP----REIL---RFTEAILSRLVPATNYIMMDMK 314

Query: 266 EKLIKILMELEDWK 279
            ++I     + D K
Sbjct: 315 YRIISYFGRVPDLK 328


>gi|312373131|gb|EFR20943.1| hypothetical protein AND_18265 [Anopheles darlingi]
          Length = 675

 Score = 62.8 bits (151), Expect = 3e-07,   Method: Compositional matrix adjust.
 Identities = 68/267 (25%), Positives = 112/267 (41%), Gaps = 38/267 (14%)

Query: 16  IPSTTTDNYSLVEALVAHMSDIDEKQLLLYAQIANLVNLILQWPEISINEIA-------- 67
           I     ++Y  V  LV H +    + L     +A L+   L   + + ++ +        
Sbjct: 373 IDRLPVNDYRKVYQLVTHEATRTPEDLFHRTLMATLLVSCLTLSDYAPDQASCEYLGGLM 432

Query: 68  -ENFSKLACNAHTICNSEL---RP--------LGTGLYPVISIINHSCLPNAVLVFEGRL 115
             N   L  NAH I  SE+   +P        +G GLYP +++ NHSC P     + G  
Sbjct: 433 LHNLQLLQFNAHEI--SEMIREKPSDIGKSTFIGGGLYPTLALFNHSCDPGVTRYYRGNQ 490

Query: 116 AVVRAVQHVPKGAEVLISYIETAGSTMTRQKALKEQYLFTCTCPRCIK-LGQFDDIQESA 174
             VR V+++  G         T      R++ L  QY FTC C  C++    F D+  S 
Sbjct: 491 VCVRTVKNIAAG---------TRVRREERRETLLNQYRFTCYCEPCMQNWPLFSDMDPSI 541

Query: 175 ILEGYRCKDDG-CSGFLLRDSDDKGF--TCQQCGLVRSKEEIKKIASEVNILSKKTLALT 231
           I   +RC+    CS  LL  S+   F   C +CG   +  +  K   + ++L +    L 
Sbjct: 542 IR--FRCEGGKICSNVLLIPSEINDFMVKCTECGEHTNIMKGLKSLQDTDMLFQTATRLH 599

Query: 232 SCGNHQEVVSTY-KMIEKLQKKLYHPF 257
           S G ++  +  Y +M+  + + L  P+
Sbjct: 600 SAGEYEAALMKYVEMMAIMSEVLVPPY 626


>gi|409047429|gb|EKM56908.1| hypothetical protein PHACADRAFT_208075 [Phanerochaete carnosa
           HHB-10118-sp]
          Length = 606

 Score = 62.4 bits (150), Expect = 3e-07,   Method: Compositional matrix adjust.
 Identities = 36/118 (30%), Positives = 57/118 (48%), Gaps = 14/118 (11%)

Query: 62  SINEIAENFSKLACNAHTICNSELRPLGTGLYPVISIINHSCLPNAVLVF---------- 111
           S  ++    S+   N  T+ +  L P+G  + P +++ NHSC PN V+VF          
Sbjct: 248 SAGDLVGLISRFTTNTFTLTSYTLSPVGICISPAMALTNHSCDPNVVIVFPRSSKKVTKE 307

Query: 112 -EGRLAVVRAVQHVPKGAEVLISYIETAGSTMTRQKALKEQYLFTCTCPRCIKLGQFD 168
            + +L  +R++ H     E++ +Y++       RQ ALKE Y FTC C  C   G  D
Sbjct: 308 PQMQLIALRSILHE---EEIMTAYVDVTLPKELRQSALKEAYNFTCMCSLCKNTGPTD 362


>gi|307177472|gb|EFN66599.1| SET and MYND domain-containing protein 5 [Camponotus floridanus]
          Length = 386

 Score = 62.4 bits (150), Expect = 3e-07,   Method: Compositional matrix adjust.
 Identities = 34/87 (39%), Positives = 48/87 (55%), Gaps = 5/87 (5%)

Query: 80  ICNSELRPLGTGLYPVISIINHSCLPNAVLVF--EGRLAVVRAVQHVPKGAEVLISYI-- 135
           +  S L   G+GLY + S +NHSC PNA+  F       V++A++ +  G E+  SY+  
Sbjct: 278 VVGSFLNNEGSGLYILQSSVNHSCTPNAIAEFPYSNNTLVLKAIRDIQVGEEICTSYLDE 337

Query: 136 -ETAGSTMTRQKALKEQYLFTCTCPRC 161
            E   S  +RQKAL   YLF C C +C
Sbjct: 338 CELERSRYSRQKALSSLYLFVCHCDKC 364


>gi|395841250|ref|XP_003793459.1| PREDICTED: SET and MYND domain-containing protein 5 isoform 2
           [Otolemur garnettii]
          Length = 300

 Score = 62.4 bits (150), Expect = 3e-07,   Method: Compositional matrix adjust.
 Identities = 31/79 (39%), Positives = 46/79 (58%), Gaps = 5/79 (6%)

Query: 89  GTGLYPVISIINHSCLPNAVLVF--EGRLAVVRAVQHVPKGAEVLISYIETAG---STMT 143
           G+GL+ + S  NHSC+PNA   F        V A++ +  G E+ ISY++      S  +
Sbjct: 188 GSGLFVLQSCCNHSCVPNAETSFPENNFFLHVTALEDIKPGEEICISYLDCCQRERSRHS 247

Query: 144 RQKALKEQYLFTCTCPRCI 162
           R K L+E YLF C+CP+C+
Sbjct: 248 RHKILRENYLFICSCPKCL 266


>gi|340723180|ref|XP_003399973.1| PREDICTED: SET and MYND domain-containing protein 4-like [Bombus
           terrestris]
          Length = 718

 Score = 62.4 bits (150), Expect = 3e-07,   Method: Compositional matrix adjust.
 Identities = 43/144 (29%), Positives = 63/144 (43%), Gaps = 17/144 (11%)

Query: 76  NAHTICNSEL---------RPL--GTGLYPVISIINHSCLPNAVLVFEGRLAVVRAVQHV 124
           NAH +  + L         RPL  G  +YP ++  NH C P     F GR  V+RA++ +
Sbjct: 491 NAHEVFETRLGNEHRFRGSRPLYIGVAIYPTVARFNHDCYPAVTRYFVGRSIVIRAIRSL 550

Query: 125 PKGAEVLISY--IETAGSTMTRQKALKEQYLFTCTCPRCIK-LGQFDDIQESAILEGYRC 181
             G  V  +Y  I T  S   R++ L  +Y F C C  C +   +F+ +    +    RC
Sbjct: 551 RPGDTVAENYGPIFTKRSLEDRRRTLAARYWFRCECTACREDWPRFETLTNDMVR--LRC 608

Query: 182 KDDGCSGFLLRDSD-DKGFTCQQC 204
             +GCS    R  D  +   C  C
Sbjct: 609 PTEGCSKLHSRPRDPSRSIECSSC 632


>gi|440490836|gb|ELQ70343.1| hypothetical protein OOW_P131scaffold00045g35 [Magnaporthe oryzae
           P131]
          Length = 582

 Score = 62.4 bits (150), Expect = 3e-07,   Method: Compositional matrix adjust.
 Identities = 28/74 (37%), Positives = 42/74 (56%)

Query: 88  LGTGLYPVISIINHSCLPNAVLVFEGRLAVVRAVQHVPKGAEVLISYIETAGSTMTRQKA 147
            GT L  V+S+INH C PNA + FEG    VR+++ +  G E+ +SY +     + RQ+ 
Sbjct: 240 FGTSLDLVVSMINHDCSPNAHVFFEGSQVRVRSLKAIAAGGEITVSYCDPRLDVLLRQEI 299

Query: 148 LKEQYLFTCTCPRC 161
           L++   F C C  C
Sbjct: 300 LRQTQFFHCECTTC 313


>gi|440476470|gb|ELQ45066.1| hypothetical protein OOU_Y34scaffold00021g6 [Magnaporthe oryzae
           Y34]
          Length = 581

 Score = 62.4 bits (150), Expect = 4e-07,   Method: Compositional matrix adjust.
 Identities = 28/74 (37%), Positives = 42/74 (56%)

Query: 88  LGTGLYPVISIINHSCLPNAVLVFEGRLAVVRAVQHVPKGAEVLISYIETAGSTMTRQKA 147
            GT L  V+S+INH C PNA + FEG    VR+++ +  G E+ +SY +     + RQ+ 
Sbjct: 240 FGTSLDLVVSMINHDCSPNAHVFFEGSQVRVRSLKAIAAGGEITVSYCDPRLDVLLRQEI 299

Query: 148 LKEQYLFTCTCPRC 161
           L++   F C C  C
Sbjct: 300 LRQTQFFHCECTTC 313


>gi|307195015|gb|EFN77083.1| SET and MYND domain-containing protein 5 [Harpegnathos saltator]
          Length = 385

 Score = 62.4 bits (150), Expect = 4e-07,   Method: Compositional matrix adjust.
 Identities = 32/78 (41%), Positives = 45/78 (57%), Gaps = 5/78 (6%)

Query: 89  GTGLYPVISIINHSCLPNAVLVF--EGRLAVVRAVQHVPKGAEVLISYIETA---GSTMT 143
           G+GLY + S +NHSC+PNA + F       V+RA++ +    E+  SY+E      S  +
Sbjct: 287 GSGLYILQSAVNHSCMPNASVEFPYSNNTLVLRAIRDIQPEEEICTSYLEECELERSRYS 346

Query: 144 RQKALKEQYLFTCTCPRC 161
           RQKAL   YLF C C +C
Sbjct: 347 RQKALSSLYLFVCHCDKC 364


>gi|281206858|gb|EFA81042.1| hypothetical protein PPL_05877 [Polysphondylium pallidum PN500]
          Length = 425

 Score = 62.4 bits (150), Expect = 4e-07,   Method: Compositional matrix adjust.
 Identities = 31/79 (39%), Positives = 41/79 (51%), Gaps = 6/79 (7%)

Query: 89  GTGLYPVISIINHSCLPNAVLVFEG------RLAVVRAVQHVPKGAEVLISYIETAGSTM 142
           G GLYP+ S +NHSC PN  +  E          V RA +++  G E+L +Y +    T 
Sbjct: 346 GIGLYPIFSCMNHSCQPNVEICNERTDGVTFNKVVFRAKKNIKAGQELLNNYCDVTLPTK 405

Query: 143 TRQKALKEQYLFTCTCPRC 161
            RQ  LK QY F C C +C
Sbjct: 406 ERQSQLKSQYDFICKCNKC 424


>gi|429853222|gb|ELA28312.1| TPR domain-containing protein [Colletotrichum gloeosporioides Nara
           gc5]
          Length = 643

 Score = 62.4 bits (150), Expect = 4e-07,   Method: Compositional matrix adjust.
 Identities = 31/72 (43%), Positives = 42/72 (58%), Gaps = 1/72 (1%)

Query: 90  TGLYPVISIINHSCLPNAVLVFEGRLAVVRAVQHVPKGAEVLISYIETAGSTMTRQKALK 149
           TGL+   + INHSC+PNA   F G L V+R+++ + KG E+  SY E+ G    RQ AL 
Sbjct: 496 TGLWVRAAYINHSCVPNAKREFVGDLMVIRSLRKIKKGEEIFHSYDES-GDYEARQAALM 554

Query: 150 EQYLFTCTCPRC 161
             + F C C  C
Sbjct: 555 TTWGFECGCALC 566


>gi|348515415|ref|XP_003445235.1| PREDICTED: SET and MYND domain-containing protein 5-like isoform 1
           [Oreochromis niloticus]
          Length = 417

 Score = 62.4 bits (150), Expect = 4e-07,   Method: Compositional matrix adjust.
 Identities = 33/87 (37%), Positives = 51/87 (58%), Gaps = 7/87 (8%)

Query: 89  GTGLYPVISIINHSCLPNAVLVFEGR--LAVVRAVQHVPKGAEVLISYIETAG---STMT 143
           G+GL+ + S  NHSC+PNA   F     L  + A+  + +G E+ ISY++      S  +
Sbjct: 303 GSGLFLLQSSCNHSCIPNAEASFPDNNFLLHLSALCDINQGEEICISYLDCCQRDRSRHS 362

Query: 144 RQKALKEQYLFTCTCPRCIKLGQFDDI 170
           R K L+E YLF C+CP+C+   Q D++
Sbjct: 363 RHKILRENYLFVCSCPKCV--SQMDEL 387


>gi|194899548|ref|XP_001979321.1| GG14794 [Drosophila erecta]
 gi|190651024|gb|EDV48279.1| GG14794 [Drosophila erecta]
          Length = 392

 Score = 62.4 bits (150), Expect = 4e-07,   Method: Compositional matrix adjust.
 Identities = 31/78 (39%), Positives = 47/78 (60%), Gaps = 5/78 (6%)

Query: 89  GTGLYPVISIINHSCLPNAVLVF--EGRLAVVRAVQHVPKGAEVLISYIETA---GSTMT 143
           G+GLY + S INHSC+PNA   F     + V++A+  + +G E+ ISY++      S  +
Sbjct: 286 GSGLYLLQSKINHSCVPNACSTFPYSNDIVVLKALAPIQEGDEICISYLDECMLERSRHS 345

Query: 144 RQKALKEQYLFTCTCPRC 161
           R K L+E Y+F C C +C
Sbjct: 346 RHKVLRENYVFICQCSKC 363


>gi|294892061|ref|XP_002773875.1| hypothetical protein, conserved [Perkinsus marinus ATCC 50983]
 gi|239879079|gb|EER05691.1| hypothetical protein, conserved [Perkinsus marinus ATCC 50983]
          Length = 140

 Score = 62.4 bits (150), Expect = 4e-07,   Method: Compositional matrix adjust.
 Identities = 32/99 (32%), Positives = 47/99 (47%), Gaps = 4/99 (4%)

Query: 72  KLACNAHTICNSELRPLGTGLYPVISIINHSCLPNAVLVF-EGRLAVVRAVQHVPKGAEV 130
           +  CN   I +     +  G+YP  ++ NHSC PN +  F  G     RA++ +  G E+
Sbjct: 41  RFPCNNFAIVDDLWSGIAAGVYPNAALFNHSCHPNVIPAFGHGSTLSFRAIRDISPGEEI 100

Query: 131 LISYIETAGSTMTRQKALKEQYLFTCTCPRCIKLGQFDD 169
             SY+E    +  R+  L   Y F C C RC   G+ DD
Sbjct: 101 CHSYVELTLPSWKRRDVLLRDYEFLCECERC---GKRDD 136


>gi|357609801|gb|EHJ66685.1| hypothetical protein KGM_08794 [Danaus plexippus]
          Length = 776

 Score = 62.4 bits (150), Expect = 4e-07,   Method: Compositional matrix adjust.
 Identities = 86/393 (21%), Positives = 158/393 (40%), Gaps = 73/393 (18%)

Query: 13  DNV-IPSTTTDNYSLVEALVAHMSDIDEKQLLLYAQIANLVNLILQ--------WPE--- 60
           DN+ I  T TD +  +  LV + + +D    + YA  A ++ L L+         P    
Sbjct: 367 DNIQIYKTGTDPFYRMFNLVTNFNKMDNTDYIQYALTATMLTLYLENFTSFFDYLPSKMP 426

Query: 61  ISINE----------IAENFSKLACNAH-----------------TICNSELRPLGTGLY 93
            S++E          I  +  +L CN H                 TI   E+R   T +Y
Sbjct: 427 CSMSESQLKLFAAAVILRSMGQLVCNGHATLSLAVVEEDDGRNGKTITEKEVR-RATAIY 485

Query: 94  PVISIINHSCLPNAVLVFEGRLAVVRAVQHVPKGAEVLISY--IETAGSTMTRQKALKEQ 151
           P  +++NHSC PN +  F     +VR  + +P G EV   Y           R+KALK Q
Sbjct: 486 PSAAMMNHSCDPNIINTFYKSRLIVRCQRELPAGGEVFNCYGPHRARAPAAARRKALKAQ 545

Query: 152 YLFTCTCPRCIKLGQFDDIQESAILEGYRCKDDGCSGFLLRDSDDKGFTCQQCGLVRSKE 211
           Y+FTC C  C    + D +   ++   Y C+   C G +          C QC      E
Sbjct: 546 YMFTCHCADCNDTERKDFV---SLFSAYLCQ--SCKGPVWAHCVRP--LCTQCRSALHLE 598

Query: 212 EIKKIASEVNILSKKTLALTSCGNH-QEVVSTYKMIEKLQKKLYHPFSVNLMQTREKLIK 270
               +    + L+ +   + S     +++ ++Y    +L+++++H    +L    ++L +
Sbjct: 599 RAHTLLDRADDLATQAEQVVSLEERCEKMAASY----RLKQQVWHRHHASLRMAADRLAR 654

Query: 271 ILMELEDWKEALAYCQLTIPVYQRVYPQF-----HPLLGLQYYTCGKL----------EW 315
           +  +  D+ +++   +  I   +  +  F     H L  L      ++          EW
Sbjct: 655 LYADTGDFGKSMELIKQNIQSLEYRFGSFSVEVAHELRKLSDVMLERILNSPQHLEYREW 714

Query: 316 FLGDTENAIKSMTEAVEILRITHGTNSPFMKEL 348
            L     A K + +A++++ + +G+  P +  L
Sbjct: 715 CL----EAHKVVKKAIQLMELNYGSWEPLVSRL 743


>gi|303279368|ref|XP_003058977.1| predicted protein [Micromonas pusilla CCMP1545]
 gi|226460137|gb|EEH57432.1| predicted protein [Micromonas pusilla CCMP1545]
          Length = 322

 Score = 62.0 bits (149), Expect = 4e-07,   Method: Compositional matrix adjust.
 Identities = 28/77 (36%), Positives = 41/77 (53%), Gaps = 1/77 (1%)

Query: 86  RPLGTGLYPVISIINHSCLPNAVLVFEGRLAV-VRAVQHVPKGAEVLISYIETAGSTMTR 144
           RP+G G+YP  ++ NH C PNA   F+    V V   + V KG E+ I Y++       R
Sbjct: 241 RPIGVGVYPSAAMFNHDCAPNAAQRFDAFGCVRVETTRRVRKGEELTIPYVDVMLGREER 300

Query: 145 QKALKEQYLFTCTCPRC 161
           +  L++ + F C C RC
Sbjct: 301 RGKLRKNFAFECACARC 317


>gi|395329994|gb|EJF62379.1| hypothetical protein DICSQDRAFT_104880 [Dichomitus squalens
           LYAD-421 SS1]
          Length = 810

 Score = 62.0 bits (149), Expect = 4e-07,   Method: Compositional matrix adjust.
 Identities = 31/72 (43%), Positives = 41/72 (56%), Gaps = 1/72 (1%)

Query: 90  TGLYPVISIINHSCLPNAVLVFEGRLAVVRAVQHVPKGAEVLISYIETAGSTMTRQKALK 149
           TGLYP+ S+ NHSC+ NA+    G + ++RA + VP G E+ I Y     S   RQ  LK
Sbjct: 556 TGLYPLASLFNHSCVANAIWYCIGDVMIIRAAEPVPAGTEITIPY-SVEESYFARQSVLK 614

Query: 150 EQYLFTCTCPRC 161
           +  L  CTC  C
Sbjct: 615 KHMLEHCTCWLC 626


>gi|444708470|gb|ELW49533.1| SET and MYND domain-containing protein 3 [Tupaia chinensis]
          Length = 125

 Score = 62.0 bits (149), Expect = 4e-07,   Method: Compositional matrix adjust.
 Identities = 27/66 (40%), Positives = 43/66 (65%)

Query: 96  ISIINHSCLPNAVLVFEGRLAVVRAVQHVPKGAEVLISYIETAGSTMTRQKALKEQYLFT 155
           +S++NHSC PN  +VF G   ++RAV+ +  G E+ I Y++   ++  R+K L++QY F 
Sbjct: 16  MSLLNHSCDPNCSIVFNGPHLLLRAVRDIEVGEELTICYLDMLMTSEERRKQLRDQYCFE 75

Query: 156 CTCPRC 161
           C C RC
Sbjct: 76  CDCFRC 81


>gi|407404730|gb|EKF30079.1| hypothetical protein MOQ_006118 [Trypanosoma cruzi marinkellei]
          Length = 359

 Score = 62.0 bits (149), Expect = 4e-07,   Method: Compositional matrix adjust.
 Identities = 33/77 (42%), Positives = 47/77 (61%), Gaps = 5/77 (6%)

Query: 89  GTGLYPVISIINHSCLPNAVLVF----EGRLAVVRAVQHVPKGAEVLISYIETAGSTMTR 144
           G G+Y V  + NHSC PN  +V+    +  L+VV A++ +  G E+ ISYI+ + S   R
Sbjct: 257 GQGVYTVGCLFNHSCEPNLQVVYSESGDETLSVV-ALRDIEPGEELCISYIDESLSYPER 315

Query: 145 QKALKEQYLFTCTCPRC 161
           Q+ L E YLF C CP+C
Sbjct: 316 QQELYEHYLFVCHCPKC 332


>gi|322785185|gb|EFZ11898.1| hypothetical protein SINV_10758 [Solenopsis invicta]
          Length = 389

 Score = 62.0 bits (149), Expect = 4e-07,   Method: Compositional matrix adjust.
 Identities = 31/78 (39%), Positives = 47/78 (60%), Gaps = 5/78 (6%)

Query: 89  GTGLYPVISIINHSCLPNAVLVF--EGRLAVVRAVQHVPKGAEVLISYIETAG---STMT 143
           G+GLY + S +NHSC+PNAV+ F       V++A++ +  G E+  SY++      S  +
Sbjct: 294 GSGLYVLQSSVNHSCVPNAVVEFPYSNNTLVLKAIRDIKVGEEICTSYLDECQLERSRHS 353

Query: 144 RQKALKEQYLFTCTCPRC 161
           RQ+AL   YLF C C +C
Sbjct: 354 RQQALSSLYLFVCHCDKC 371


>gi|294935956|ref|XP_002781574.1| hypothetical protein, conserved [Perkinsus marinus ATCC 50983]
 gi|239892411|gb|EER13369.1| hypothetical protein, conserved [Perkinsus marinus ATCC 50983]
          Length = 181

 Score = 62.0 bits (149), Expect = 4e-07,   Method: Compositional matrix adjust.
 Identities = 31/75 (41%), Positives = 44/75 (58%), Gaps = 1/75 (1%)

Query: 88  LGTGLYPVISIINHSCLPNAVLVFEG-RLAVVRAVQHVPKGAEVLISYIETAGSTMTRQK 146
           LG+GLY  +++ NHSC PNA   F G R   V++++ V  G EV  SYI+     + RQ 
Sbjct: 91  LGSGLYRGVAVTNHSCSPNAEASFGGSRCLRVKSLRPVHAGEEVFQSYIDENLPLVERQS 150

Query: 147 ALKEQYLFTCTCPRC 161
            L++ Y F C C +C
Sbjct: 151 KLRQAYGFACRCGKC 165


>gi|395508886|ref|XP_003758739.1| PREDICTED: SET and MYND domain-containing protein 5 [Sarcophilus
           harrisii]
          Length = 609

 Score = 62.0 bits (149), Expect = 4e-07,   Method: Compositional matrix adjust.
 Identities = 32/79 (40%), Positives = 47/79 (59%), Gaps = 5/79 (6%)

Query: 89  GTGLYPVISIINHSCLPNAVLVFEGR--LAVVRAVQHVPKGAEVLISYIETAG---STMT 143
           G+GL+ + S  NHSC+PNA   F     L  V A++ +  G E+ ISY++      S  +
Sbjct: 496 GSGLFVLQSCCNHSCVPNAETSFPENNFLLPVTALEDIKPGEEICISYLDCCQRERSRYS 555

Query: 144 RQKALKEQYLFTCTCPRCI 162
           R K L+E YLF C+CP+C+
Sbjct: 556 RHKILRENYLFVCSCPKCL 574


>gi|195583324|ref|XP_002081472.1| GD11032 [Drosophila simulans]
 gi|194193481|gb|EDX07057.1| GD11032 [Drosophila simulans]
          Length = 1033

 Score = 62.0 bits (149), Expect = 5e-07,   Method: Compositional matrix adjust.
 Identities = 42/156 (26%), Positives = 71/156 (45%), Gaps = 11/156 (7%)

Query: 65  EIAENFSKLACNAHTICNSELRPLGTGLYPVISIINHSCLPNAVLVF-EGRLAVVRAVQH 123
           EI +    L  N H +  ++  P    ++   S   +SCLPN    F +    ++ A + 
Sbjct: 725 EIMKAVGALQINGHEVPTTD--PPHVAVFYTASFTENSCLPNLAKSFNKNGHCILWAPRE 782

Query: 124 VPKGAEVLISYIETAGSTMTRQKALKEQYLFTCTCPRCIKLGQFDDIQESAILEGYRCKD 183
           + K A + I Y +    T  RQ+ L +  LF C C RC+ + +  D   SAI    +C+D
Sbjct: 783 IKKNAHLSICYSDAMWGTADRQRHLMQTKLFKCACERCVDVTEL-DTNYSAI----KCED 837

Query: 184 DGCSGFLL---RDSDDKGFTCQQCGLVRSKEEIKKI 216
             C G +L    D  +  + C++C     K  +++I
Sbjct: 838 RQCGGLMLPTKADDWNGSWRCRECHKQVQKHYVERI 873


>gi|301121168|ref|XP_002908311.1| conserved hypothetical protein [Phytophthora infestans T30-4]
 gi|262103342|gb|EEY61394.1| conserved hypothetical protein [Phytophthora infestans T30-4]
          Length = 424

 Score = 62.0 bits (149), Expect = 5e-07,   Method: Compositional matrix adjust.
 Identities = 42/127 (33%), Positives = 61/127 (48%), Gaps = 16/127 (12%)

Query: 89  GTGLYPVISIINHSCLPNAVLVFEGRLAV-VRAVQHVPKGAEVLISYIETAGSTMTRQKA 147
             GL+ +  +INHSC PN      G   + VRA++ + +G E+ +SYI+       RQK 
Sbjct: 162 AVGLFSICGLINHSCQPNCTWSNAGDSVMEVRALRDIEEGEEITLSYIDIDKERSERQKE 221

Query: 148 LKEQYLFTCTCPRC-IKLGQFDDIQESAILEGYRCKDDGCS-------GFLLRDSDDKGF 199
           L++   F C C RC   L +  D     +L+G+RC    CS        +LL   +DK  
Sbjct: 222 LRDTKHFDCQCERCSTPLSESVD----RVLDGFRCP--RCSVKASEEENYLLAQVEDK-L 274

Query: 200 TCQQCGL 206
            C  C L
Sbjct: 275 VCPDCQL 281


>gi|328698846|ref|XP_001948827.2| PREDICTED: SET and MYND domain-containing protein 4-like
           [Acyrthosiphon pisum]
          Length = 742

 Score = 62.0 bits (149), Expect = 5e-07,   Method: Compositional matrix adjust.
 Identities = 48/183 (26%), Positives = 80/183 (43%), Gaps = 31/183 (16%)

Query: 18  STTTDNYSLVEALVAHMSDIDEKQLLLYAQIANLVNLILQ--------WPEISI----NE 65
           S    N   +  L+ H+ D+   +L  Y+  A L+ + LQ         P + +    NE
Sbjct: 355 SQANSNNDQIYNLLTHIDDLKSLELYQYSLTATLLLIYLQKKTDFFEKHPNLVLDSVGNE 414

Query: 66  IAENFSKLACNAHTICN---------------SELRP-LGTGLYPVISIINHSCLPNAVL 109
           +  + ++L CN + I                  E +P +GT ++P  S++NHSC PN   
Sbjct: 415 LLHHMTRLVCNGNAISTHMLSDYDSGSRTPIIDESQPRIGTAIFPTSSLLNHSCDPNIFS 474

Query: 110 VFEGRLAVVRAVQHVPKGAEVLISYIET--AGSTMTRQKALKEQYLFTCTCPRCIKLGQF 167
               +  V++A + + +G E+   Y         + RQ +LK QY F C C  C+   Q 
Sbjct: 475 SNILKYVVIKASRDISEGEEITNCYGPNFLRMRVVDRQASLKNQYHFDCECNTCLD-PQA 533

Query: 168 DDI 170
           DD+
Sbjct: 534 DDL 536


>gi|17535507|ref|NP_496323.1| Protein SET-14 [Caenorhabditis elegans]
 gi|3123316|sp|Q09415.2|SET14_CAEEL RecName: Full=SET domain-containing protein 14
 gi|3878909|emb|CAA86783.1| Protein SET-14 [Caenorhabditis elegans]
          Length = 429

 Score = 62.0 bits (149), Expect = 5e-07,   Method: Compositional matrix adjust.
 Identities = 48/164 (29%), Positives = 77/164 (46%), Gaps = 14/164 (8%)

Query: 70  FSKLACNAHTICNSELRPLGTGLYPVISIINHSCLPNAVLVFEGRLAVVRAV-QHVPKGA 128
           F K++ N+  + NS   P+G GL   +S  NHSC P   + +  R A++  V   +P   
Sbjct: 147 FQKVSINSFVVGNSTGNPIGVGLCIKLSAANHSCKPLTRVCYRNRTAMLVPVSSELPSTL 206

Query: 129 E-VLISYIETAGSTMTRQKALKEQYLFTCTCPRCIKLGQFDDIQESAILEGYRCKDDGCS 187
           E    SYI+       R+  LK++Y F C C  C+      D   +A +E + C    C 
Sbjct: 207 EGACHSYIDELMPRDMRRDTLKKKYKFLCQCDGCL------DEDRNARMEAWTC--GICV 258

Query: 188 GFLLRDSDDKGFTCQQCGLVRSKE--EIKKIASEVNILSKKTLA 229
              +R+ ++    C+ CG   SK+  E+ + A E  I ++  LA
Sbjct: 259 KGWMRNKENG--QCELCGWTMSKDHFELCRTAEEAGIAARSRLA 300


>gi|344303810|gb|EGW34059.1| hypothetical protein SPAPADRAFT_65242 [Spathaspora passalidarum
           NRRL Y-27907]
          Length = 619

 Score = 62.0 bits (149), Expect = 5e-07,   Method: Compositional matrix adjust.
 Identities = 31/90 (34%), Positives = 47/90 (52%), Gaps = 2/90 (2%)

Query: 76  NAHTICNSELRPLGTGLYPVISIINHSCLPNAVLVFEGRLAVVRAVQHVP--KGAEVLIS 133
           N + + + +  P+G GL P  S+INHSC PN +L  +   +    V  VP  K +E+ ++
Sbjct: 226 NHNILLDQDNEPIGIGLDPDFSLINHSCAPNCILKHKQNWSGFELVNTVPIVKNSELTVT 285

Query: 134 YIETAGSTMTRQKALKEQYLFTCTCPRCIK 163
           YI+T      R   L+ +Y F C C  C K
Sbjct: 286 YIDTCFPKELRMLDLRSRYFFNCKCELCSK 315


>gi|342883051|gb|EGU83613.1| hypothetical protein FOXB_05861 [Fusarium oxysporum Fo5176]
          Length = 374

 Score = 62.0 bits (149), Expect = 5e-07,   Method: Compositional matrix adjust.
 Identities = 37/113 (32%), Positives = 58/113 (51%), Gaps = 7/113 (6%)

Query: 89  GTGLYPVISIINHSCLPNAVLVFEGRLAVV-RAVQHVPKGAEVLISYIE--TAGSTMTRQ 145
           G  LYP  +  NHSCLPN     +G+  +V  A + + KG E +I+Y +  T     +RQ
Sbjct: 261 GMALYPRAAQFNHSCLPNVTHQPDGQARMVYTAARDISKGEECMITYFDLTTHKDLTSRQ 320

Query: 146 KALKEQYLFTCTCPRCIKLGQFDDIQESAILEGYRCKDDGCSGFLLRDSDDKG 198
              +EQ+ F CTC RC+K    ++ +E+   +    K + C       SD++G
Sbjct: 321 NHTQEQFQFKCTCERCLK----EEAEENIEFQELELKLEPCPISSATTSDNEG 369


>gi|345493155|ref|XP_001601354.2| PREDICTED: SET and MYND domain-containing protein 4-like [Nasonia
           vitripennis]
          Length = 737

 Score = 61.6 bits (148), Expect = 5e-07,   Method: Compositional matrix adjust.
 Identities = 61/288 (21%), Positives = 108/288 (37%), Gaps = 66/288 (22%)

Query: 72  KLACNAHTICNSELRP-------------LGTGLYPVISIINHSCLPNAVLVFEGRLAVV 118
           +L CN H I    +               + T +YP  S++NHSC PN +  F+ +  +V
Sbjct: 422 QLICNGHAITRLNISDSESGNVVTEYQCRIATAIYPSASMMNHSCDPNIINSFKDQYLIV 481

Query: 119 RAVQHVPKGAEVLISYIE--TAGSTMTRQKALKEQYLFTCTCPRCIKLGQFDDIQESAIL 176
           +A + +    EV   Y           RQ AL+ QY FTC C                  
Sbjct: 482 KATKDIAAKEEVFNCYGPHYRRMRKKDRQIALQNQYCFTCEC------------------ 523

Query: 177 EGYRCKDDGCSGFLLRDSDDK--GFTCQQCGLVRSKEEIKKIASEVNILSKKTLALTSCG 234
                  + C+   L++  DK   F C++C               V I+S  ++    C 
Sbjct: 524 -------EACTQRALQNFSDKFQRFNCEECN------------GPVEIISHSSMRCLDCE 564

Query: 235 NHQEVVSTYKMIEKLQKKLYHPFSVNLMQTREKLIKILMELEDWKEALAYCQLTIPVYQR 294
              ++V +  +  +   KL+    +NL   +             KEAL   +  + + +R
Sbjct: 565 TTFDLVKSQLLELEEANKLFEAAKINLKSQKV------------KEALENAKQCLEIRKR 612

Query: 295 VYPQFHPLLGLQYYTCGKLEWFLGDTENAIKSMTEAVEILRITHGTNS 342
           +  ++H  + L Y   GK+    G   ++I  +  ++  +    G +S
Sbjct: 613 ILYEYHESVTLTYDLIGKIFAVTGRWLDSISHLEHSLAAVEERFGPDS 660


>gi|328770914|gb|EGF80955.1| hypothetical protein BATDEDRAFT_88226 [Batrachochytrium
           dendrobatidis JAM81]
          Length = 227

 Score = 61.6 bits (148), Expect = 5e-07,   Method: Compositional matrix adjust.
 Identities = 27/75 (36%), Positives = 37/75 (49%)

Query: 88  LGTGLYPVISIINHSCLPNAVLVFEGRLAVVRAVQHVPKGAEVLISYIETAGSTMTRQKA 147
           +G  ++P  S  NHSC PN   +        R ++ V KG  + ISYI+T      R+  
Sbjct: 96  MGRAVFPAASYFNHSCFPNCQSIKHDHKMAFRTLKDVSKGEMLTISYIDTNMPVSARRAR 155

Query: 148 LKEQYLFTCTCPRCI 162
           L + Y F C C RCI
Sbjct: 156 LMDDYFFECMCERCI 170


>gi|449548889|gb|EMD39855.1| hypothetical protein CERSUDRAFT_122028 [Ceriporiopsis subvermispora
           B]
          Length = 454

 Score = 61.6 bits (148), Expect = 5e-07,   Method: Compositional matrix adjust.
 Identities = 33/93 (35%), Positives = 53/93 (56%), Gaps = 6/93 (6%)

Query: 76  NAHTICNSELRPLGTGLYPVIS-IINHSCLPNAVLVF-----EGRLAVVRAVQHVPKGAE 129
           N + I +S L P   G++P+ S   NHSC+PNAV+ +     E     V A++ + +G E
Sbjct: 184 NNNFIVHSHLYPYAHGIFPLASRTFNHSCVPNAVVKYIIRPSEPVCMQVVALREIQEGEE 243

Query: 130 VLISYIETAGSTMTRQKALKEQYLFTCTCPRCI 162
           ++I Y++ A S   R+ AL+  Y F C+C  C+
Sbjct: 244 IVIPYLDPALSYAARRDALQTNYGFICSCALCV 276


>gi|90077650|dbj|BAE88505.1| unnamed protein product [Macaca fascicularis]
          Length = 418

 Score = 61.6 bits (148), Expect = 5e-07,   Method: Compositional matrix adjust.
 Identities = 31/79 (39%), Positives = 46/79 (58%), Gaps = 5/79 (6%)

Query: 89  GTGLYPVISIINHSCLPNAVLVF--EGRLAVVRAVQHVPKGAEVLISYIETAG---STMT 143
           G+G + + S  NHSC+PNA   F     L  V A++ +  G E+ ISY++      S  +
Sbjct: 304 GSGFFVLQSCCNHSCVPNAETSFPENNFLLHVTALEDIKPGEEICISYLDCCQRERSRHS 363

Query: 144 RQKALKEQYLFTCTCPRCI 162
           R K L+E YLF C+CP+C+
Sbjct: 364 RHKILRENYLFVCSCPKCL 382


>gi|159465319|ref|XP_001690870.1| predicted protein [Chlamydomonas reinhardtii]
 gi|158279556|gb|EDP05316.1| predicted protein [Chlamydomonas reinhardtii]
          Length = 1470

 Score = 61.6 bits (148), Expect = 5e-07,   Method: Composition-based stats.
 Identities = 42/126 (33%), Positives = 64/126 (50%), Gaps = 19/126 (15%)

Query: 61   ISINEIAENFSKLACNAHTICNSELR---PLG-TGLYPVISIINHSCLPNAVLVFEGRLA 116
            +++N + E+F  LA     +C  ELR   P G  GL+P  +   HSC P A     G   
Sbjct: 1174 VNLNCMGEDFQDLA-----LC--ELRGEAPRGHIGLWPEAAFAAHSCAPTATAYSIGDRL 1226

Query: 117  VVRAVQHVPKGAEVLISYIETAGSTMT-----RQKALKEQYLFTCTCPRCIKLGQFDDIQ 171
            ++RA   +PKG EV ++++   GS +T     R+  L+ QY FTC C RC    +     
Sbjct: 1227 LIRAAAEIPKGGEVSLNFL---GSLLTSPLSVRRAELRSQYGFTCGCSRCAAEARHTGTP 1283

Query: 172  ESAILE 177
             +A++E
Sbjct: 1284 LAALVE 1289


>gi|125810418|ref|XP_001361485.1| GA20567 [Drosophila pseudoobscura pseudoobscura]
 gi|54636660|gb|EAL26063.1| GA20567 [Drosophila pseudoobscura pseudoobscura]
          Length = 660

 Score = 61.6 bits (148), Expect = 6e-07,   Method: Compositional matrix adjust.
 Identities = 54/219 (24%), Positives = 95/219 (43%), Gaps = 18/219 (8%)

Query: 48  IANLVNLILQWPEISINEIAE--NFSKLACNAHTICNSELRPLGTGLYPVISIINHSCLP 105
           I +LV   LQ+ + + +E+AE   FS                +G  +YP +++ NHSC P
Sbjct: 427 ICSLVLRSLQFIQFNTHEVAELHKFSSTGREKSIF-------IGGAIYPTLALFNHSCDP 479

Query: 106 NAVLVFEGRLAVVRAVQHVPKGAEVLISY--IETAGSTMTRQKALKEQYLFTCTCPRCI- 162
             V  F G    + +V+ +  G  +  +Y  I T      RQ  +KE Y F C C  CI 
Sbjct: 480 GVVRYFRGTTIHINSVRPIEAGLPINENYGPIYTQDKREDRQAKMKELYWFECCCDACID 539

Query: 163 KLGQFDDIQESAILEGYRCK-DDGCSGFL--LRDSDDKGFTCQQCGLVRSKEEIKKIASE 219
               F+D+    I   +RC+  + C+  +      +D    C  CG V +  +  K+  +
Sbjct: 540 NWPLFEDLPRDVI--RFRCEAPNNCAAVIEVPPSCNDFMVKCVTCGEVTNILKGLKVMQD 597

Query: 220 VNILSKKTLALTSCGNHQEVVSTY-KMIEKLQKKLYHPF 257
             ++++    L   G++ + +  +  +I  +   L  PF
Sbjct: 598 TEMMTRTGKRLYETGDYAKALQKFIDLIRIMYDVLAPPF 636


>gi|159110558|ref|XP_001705534.1| Hypothetical protein GL50803_17036 [Giardia lamblia ATCC 50803]
 gi|157433620|gb|EDO77860.1| hypothetical protein GL50803_17036 [Giardia lamblia ATCC 50803]
          Length = 409

 Score = 61.6 bits (148), Expect = 6e-07,   Method: Compositional matrix adjust.
 Identities = 34/104 (32%), Positives = 56/104 (53%), Gaps = 8/104 (7%)

Query: 67  AENFSKLAC----NAHTICNSELRPLGTGLYPVISIINHSCLPNAVLVF----EGRLAVV 118
           A+ F+   C    N+++I +   + +G GLY +IS  NHSC PNA ++F      R A +
Sbjct: 300 AQMFNYFVCILLTNSYSIEDKTGQEIGAGLYSLISCCNHSCTPNAQVIFGDSENAREATL 359

Query: 119 RAVQHVPKGAEVLISYIETAGSTMTRQKALKEQYLFTCTCPRCI 162
             ++   +  E+ ISYI   G ++  ++    Q+ FTC C RC+
Sbjct: 360 VLLRPCAQKEELYISYITDLGRSVVERRRELAQWCFTCQCTRCL 403


>gi|195151412|ref|XP_002016641.1| GL11690 [Drosophila persimilis]
 gi|194110488|gb|EDW32531.1| GL11690 [Drosophila persimilis]
          Length = 660

 Score = 61.6 bits (148), Expect = 6e-07,   Method: Compositional matrix adjust.
 Identities = 54/219 (24%), Positives = 95/219 (43%), Gaps = 18/219 (8%)

Query: 48  IANLVNLILQWPEISINEIAE--NFSKLACNAHTICNSELRPLGTGLYPVISIINHSCLP 105
           I +LV   LQ+ + + +E+AE   FS                +G  +YP +++ NHSC P
Sbjct: 427 ICSLVLRSLQFIQFNTHEVAELHKFSSTGREKSIF-------IGGAIYPTLALFNHSCDP 479

Query: 106 NAVLVFEGRLAVVRAVQHVPKGAEVLISY--IETAGSTMTRQKALKEQYLFTCTCPRCI- 162
             V  F G    + +V+ +  G  +  +Y  I T      RQ  +KE Y F C C  CI 
Sbjct: 480 GVVRYFRGTTIHINSVRPIEAGLPINENYGPIYTQDKREERQAKMKELYWFECCCDACID 539

Query: 163 KLGQFDDIQESAILEGYRCK-DDGCSGFL--LRDSDDKGFTCQQCGLVRSKEEIKKIASE 219
               F+D+    I   +RC+  + C+  +      +D    C  CG V +  +  K+  +
Sbjct: 540 NWPLFEDLPRDVI--RFRCEAPNNCAAVIEVPPSCNDFMVKCVTCGEVTNILKGLKVMQD 597

Query: 220 VNILSKKTLALTSCGNHQEVVSTY-KMIEKLQKKLYHPF 257
             ++++    L   G++ + +  +  +I  +   L  PF
Sbjct: 598 TEMMTRTGKRLYETGDYAKALQKFIDLIRIMYDVLAPPF 636


>gi|270011596|gb|EFA08044.1| hypothetical protein TcasGA2_TC005638 [Tribolium castaneum]
          Length = 575

 Score = 61.6 bits (148), Expect = 6e-07,   Method: Compositional matrix adjust.
 Identities = 52/182 (28%), Positives = 77/182 (42%), Gaps = 24/182 (13%)

Query: 92  LYPVISIINHSCLPNAVLVFE----GRLAVVRAVQHVPKGAEVLISYIETAGSTMTRQKA 147
           LYP   ++ H CL N V  F+    G    VRA   + KG  +   Y      T  R++ 
Sbjct: 252 LYPTAYLMEHDCLCNTVHSFDTEENGYKITVRAALPIKKGDHISTMYTHALWGTQARREH 311

Query: 148 LKEQYLFTCTCPRCIKLGQFDDIQESAILEGYRCKD---DGCSGFLL---RDSDDKGFTC 201
           LKE   F+CTC RC      D  +  + L   RC     D C G+ L      D+  + C
Sbjct: 312 LKETKYFSCTCKRCK-----DPTEMGSYLSALRCLGTGVDSCDGYQLPADPTDDNTQWLC 366

Query: 202 QQCGLVRSKEE----IKKIASEVNILSKKTLALTSCGNHQEVVSTYKMIEKLQKKLYHPF 257
            +C +  +  E    I +I  EV+ +   +  +    N   ++S  KM+  L    YH +
Sbjct: 367 NKCDIKLTNSEVSYLINQIGEEVDHVQLSSPTVKDLDN---LLS--KMLTFLHPNHYHVY 421

Query: 258 SV 259
           SV
Sbjct: 422 SV 423


>gi|194698994|gb|ACF83581.1| unknown [Zea mays]
          Length = 268

 Score = 61.6 bits (148), Expect = 6e-07,   Method: Compositional matrix adjust.
 Identities = 32/77 (41%), Positives = 39/77 (50%), Gaps = 7/77 (9%)

Query: 92  LYPVISIINHSCLPNAV-LVFEGRLA------VVRAVQHVPKGAEVLISYIETAGSTMTR 144
           +YP  S+ NH CLPNA    +  R        VVRA+  +P+G EV ISY         R
Sbjct: 79  VYPRASLFNHDCLPNACHFDYPDRPGPGNTDIVVRALHDIPEGREVCISYFAANWRYADR 138

Query: 145 QKALKEQYLFTCTCPRC 161
           Q+ L E Y F C C RC
Sbjct: 139 QRRLLEDYGFRCECDRC 155


>gi|328724471|ref|XP_003248160.1| PREDICTED: SET and MYND domain-containing protein 4-like
           [Acyrthosiphon pisum]
          Length = 627

 Score = 61.6 bits (148), Expect = 6e-07,   Method: Compositional matrix adjust.
 Identities = 37/100 (37%), Positives = 53/100 (53%), Gaps = 5/100 (5%)

Query: 69  NFSKLACNAHTICNSELR-PLGTGLYPVISIINHSCLPNAVL--VFEGRLAVVRAVQHVP 125
           N  KL  NA +I + +   PL   LYP IS+ NHSC  N     V   R+ V++AVQ +P
Sbjct: 441 NARKLNINAPSISHQQFTFPLALTLYPTISLFNHSCDGNIKRSGVISDRIRVMKAVQPIP 500

Query: 126 KGAEVLISY--IETAGSTMTRQKALKEQYLFTCTCPRCIK 163
           KG ++  +Y  +       +RQ A  +++ F C C  CIK
Sbjct: 501 KGTQLCCNYGIMFKEHDKESRQSACNDRFNFNCYCDPCIK 540


>gi|395841248|ref|XP_003793458.1| PREDICTED: SET and MYND domain-containing protein 5 isoform 1
           [Otolemur garnettii]
          Length = 416

 Score = 61.6 bits (148), Expect = 6e-07,   Method: Compositional matrix adjust.
 Identities = 31/79 (39%), Positives = 46/79 (58%), Gaps = 5/79 (6%)

Query: 89  GTGLYPVISIINHSCLPNAVLVF--EGRLAVVRAVQHVPKGAEVLISYIETAG---STMT 143
           G+GL+ + S  NHSC+PNA   F        V A++ +  G E+ ISY++      S  +
Sbjct: 304 GSGLFVLQSCCNHSCVPNAETSFPENNFFLHVTALEDIKPGEEICISYLDCCQRERSRHS 363

Query: 144 RQKALKEQYLFTCTCPRCI 162
           R K L+E YLF C+CP+C+
Sbjct: 364 RHKILRENYLFICSCPKCL 382


>gi|170041827|ref|XP_001848651.1| SET and MYND domain-containing protein 5 [Culex quinquefasciatus]
 gi|167865410|gb|EDS28793.1| SET and MYND domain-containing protein 5 [Culex quinquefasciatus]
          Length = 387

 Score = 61.6 bits (148), Expect = 6e-07,   Method: Compositional matrix adjust.
 Identities = 33/87 (37%), Positives = 46/87 (52%), Gaps = 5/87 (5%)

Query: 80  ICNSELRPLGTGLYPVISIINHSCLPNAVLVF--EGRLAVVRAVQHVPKGAEVLISYIET 137
           +  S L   G+ LY + S +NHSC PNA   F     +  +  VQ +  G E+ ISY++ 
Sbjct: 276 VVGSFLNNEGSALYSLQSKVNHSCAPNAECQFPHSNNVLALTTVQDIAAGEEICISYLDE 335

Query: 138 AG---STMTRQKALKEQYLFTCTCPRC 161
                S  +RQK L+E YLF C C +C
Sbjct: 336 CALERSRHSRQKMLRENYLFQCQCEKC 362


>gi|195123911|ref|XP_002006445.1| GI18563 [Drosophila mojavensis]
 gi|193911513|gb|EDW10380.1| GI18563 [Drosophila mojavensis]
          Length = 666

 Score = 61.6 bits (148), Expect = 6e-07,   Method: Compositional matrix adjust.
 Identities = 44/194 (22%), Positives = 87/194 (44%), Gaps = 8/194 (4%)

Query: 88  LGTGLYPVISIINHSCLPNAVLVFEGRLAVVRAVQHVPKGAEVLISY--IETAGSTMTRQ 145
           +G  +YP +++ NHSC P  V  F G    +  V+ V  G  +  +Y  I T      RQ
Sbjct: 468 IGGAIYPTLALFNHSCDPGVVRYFRGNTIHINTVRPVEAGLPINENYGPIYTQDKREDRQ 527

Query: 146 KALKEQYLFTCTCPRCIK-LGQFDDIQESAILEGYRCK-DDGCSGFL--LRDSDDKGFTC 201
             LKE Y F C C  C++    FDD+    I   +RC+  + C+  +      +D    C
Sbjct: 528 ARLKELYWFECNCDACLENWPLFDDLPRDII--RFRCEAPNNCTAVIEVPPSCNDFMIKC 585

Query: 202 QQCGLVRSKEEIKKIASEVNILSKKTLALTSCGNHQEVVSTYKMIEKLQKKLYHPFSVNL 261
             CG + +  +  K+  +  ++++    L   G++ + ++ +  + ++  ++  P   + 
Sbjct: 586 VTCGELTNILKGLKVMQDTEMMTRTAKRLYDTGDYAKALNKFVDLLRIMYEVLAPPFPDF 645

Query: 262 MQTREKLIKILMEL 275
            + ++ L    + L
Sbjct: 646 CECQQHLKDCFLNL 659


>gi|443683649|gb|ELT87827.1| hypothetical protein CAPTEDRAFT_118237 [Capitella teleta]
          Length = 596

 Score = 61.6 bits (148), Expect = 6e-07,   Method: Compositional matrix adjust.
 Identities = 34/101 (33%), Positives = 47/101 (46%), Gaps = 12/101 (11%)

Query: 73  LACNAHTICNSELRP----------LGTGLYPVISIINHSCLPNAVLVFEGRLAVVRAVQ 122
             CNAH +    L P          LG G+Y  +S+ NHSC P     F G   VVRA++
Sbjct: 377 FPCNAHEVPELYLDPNAIDLSMPNELGAGIYSTLSLFNHSCDPGVNRNFYGDTCVVRAIK 436

Query: 123 HVPKGAEVLISY--IETAGSTMTRQKALKEQYLFTCTCPRC 161
            + KG +V  +Y  +    +   R   L+ QY F+C C  C
Sbjct: 437 TIRKGHQVSDNYGALYATNTLKERHDKLQPQYFFSCRCEPC 477


>gi|198437054|ref|XP_002123001.1| PREDICTED: similar to SET and MYND domain containing 4 [Ciona
           intestinalis]
          Length = 773

 Score = 61.6 bits (148), Expect = 6e-07,   Method: Compositional matrix adjust.
 Identities = 44/154 (28%), Positives = 72/154 (46%), Gaps = 8/154 (5%)

Query: 34  MSDIDEKQLLLYAQIANLVNLILQWPEISINEIAENF-SKLACNAHTICNSELRPLGTGL 92
           M+ I+  QL    ++   + L   + ++ IN  + +F ++  C+  T+   ++  + +  
Sbjct: 440 MTKIEPGQLTSMGEVIEYL-LHRHYLQVPINGQSISFVTEELCDNVTVTRRDI--VASAF 496

Query: 93  YPVISIINHSCLPNAVLVFEGRLAVVRAVQHVPKGAEVLISYIETA--GSTMTRQKALKE 150
           +P +S++NHSC  N   +F G     R+ Q +P GAE+   Y  +    S   RQK LKE
Sbjct: 497 FPTMSMMNHSCDCNTDALFNGSTVTFRSNQFIPVGAEITHCYGPSVFHASFEERQKTLKE 556

Query: 151 QYLFTCTCPRCI--KLGQFDDIQESAILEGYRCK 182
            Y F C C  C   K G  D       L   +CK
Sbjct: 557 NYSFDCDCTPCAMHKSGNTDKEALHTQLRAMKCK 590


>gi|361129398|gb|EHL01305.1| putative SET domain-containing protein 5 [Glarea lozoyensis 74030]
          Length = 286

 Score = 61.6 bits (148), Expect = 6e-07,   Method: Compositional matrix adjust.
 Identities = 36/103 (34%), Positives = 52/103 (50%), Gaps = 11/103 (10%)

Query: 69  NFSKLACNAHTICNSELRPLGT-----GLYPVISIINHSCLPNAVLVFEGRLA-----VV 118
           NF        +I  S   PLG      G++  IS INH CLPNAV  + G L       V
Sbjct: 49  NFPGQGNPLTSIIRSNGYPLGADAGCGGVFENISRINHGCLPNAVQNWNGLLGEEGEETV 108

Query: 119 RAVQHVPKGAEVLISYIETAGSTMTRQKALKEQYLFTCTCPRC 161
            A++ + +G E+  SY+ + G++  R+  LK+ + F CTC  C
Sbjct: 109 YAIKDIKEGEEITTSYL-SGGTSKERRAVLKQSFGFDCTCKLC 150


>gi|91088857|ref|XP_971416.1| PREDICTED: similar to CG14590 CG14590-PA [Tribolium castaneum]
          Length = 563

 Score = 61.6 bits (148), Expect = 6e-07,   Method: Compositional matrix adjust.
 Identities = 52/182 (28%), Positives = 77/182 (42%), Gaps = 24/182 (13%)

Query: 92  LYPVISIINHSCLPNAVLVFE----GRLAVVRAVQHVPKGAEVLISYIETAGSTMTRQKA 147
           LYP   ++ H CL N V  F+    G    VRA   + KG  +   Y      T  R++ 
Sbjct: 252 LYPTAYLMEHDCLCNTVHSFDTEENGYKITVRAALPIKKGDHISTMYTHALWGTQARREH 311

Query: 148 LKEQYLFTCTCPRCIKLGQFDDIQESAILEGYRCKD---DGCSGFLL---RDSDDKGFTC 201
           LKE   F+CTC RC      D  +  + L   RC     D C G+ L      D+  + C
Sbjct: 312 LKETKYFSCTCKRCK-----DPTEMGSYLSALRCLGTGVDSCDGYQLPADPTDDNTQWLC 366

Query: 202 QQCGLVRSKEE----IKKIASEVNILSKKTLALTSCGNHQEVVSTYKMIEKLQKKLYHPF 257
            +C +  +  E    I +I  EV+ +   +  +    N   ++S  KM+  L    YH +
Sbjct: 367 NKCDIKLTNSEVSYLINQIGEEVDHVQLSSPTVKDLDN---LLS--KMLTFLHPNHYHVY 421

Query: 258 SV 259
           SV
Sbjct: 422 SV 423


>gi|320168174|gb|EFW45073.1| predicted protein [Capsaspora owczarzaki ATCC 30864]
          Length = 291

 Score = 61.2 bits (147), Expect = 7e-07,   Method: Compositional matrix adjust.
 Identities = 33/76 (43%), Positives = 46/76 (60%), Gaps = 4/76 (5%)

Query: 89  GTGLYPVISIINHSCLPNAVLVFEGR---LAVVRAVQHVPKGAEVLISYIETAGSTMTRQ 145
           G+GL+ + + +NHSC PN V +       L+VV AV  +P   EV ISYI+T      RQ
Sbjct: 213 GSGLFEIGNTMNHSCQPNVVSMTRATDFTLSVV-AVATIPVNTEVCISYIDTDLPKAKRQ 271

Query: 146 KALKEQYLFTCTCPRC 161
            AL+E Y F+C+C +C
Sbjct: 272 AALEELYYFSCSCAKC 287


>gi|189233943|ref|XP_974086.2| PREDICTED: similar to AGAP008839-PA [Tribolium castaneum]
          Length = 666

 Score = 61.2 bits (147), Expect = 7e-07,   Method: Compositional matrix adjust.
 Identities = 57/235 (24%), Positives = 105/235 (44%), Gaps = 9/235 (3%)

Query: 48  IANLVNLILQWPEISINEIAE--NFSKLACNAHTICNSELRPLGTGLYPVISIINHSCLP 105
           IA+L+   LQ  + + +EI+E  N ++        C+ +   +G GLYP +++ NHSC P
Sbjct: 423 IASLILRHLQILQFNSHEISELRNLNEEMVTNGIQCHYKSEYIGAGLYPTLALFNHSCDP 482

Query: 106 NAVLVFEGRLAVVRAVQHVPKGAEVLISY--IETAGSTMTRQKALKEQYLFTCTCPRCIK 163
           + V    G   +VR ++ +  G  +  +Y  + T+     R+  L+ +Y F C C  C +
Sbjct: 483 SIVRYNIGNRMIVRTIKPIKAGEIIYENYGPLYTSMDADERRVTLQNRYWFECYCTPCQQ 542

Query: 164 -LGQFDDIQESAILEGYRCKDDGCSGFLLRDSDD--KGFTCQQCGLVRSKEEIKKIASEV 220
               F+ +  + I  G  C+ + C        DD    F C  C  V       K  S++
Sbjct: 543 EWPLFEYMDPNQIKIG--CQKENCPFEFTLYKDDLCPYFQCDYCDGVTKIFPSLKGLSQL 600

Query: 221 NILSKKTLALTSCGNHQEVVSTYKMIEKLQKKLYHPFSVNLMQTREKLIKILMEL 275
            I+  K   L S G  +E +  +     +  K   P   ++++ +++L  + + L
Sbjct: 601 AIMLPKAEDLYSAGETREAMKLFMQSLDILYKYSRPPCPDMIKVQQRLKTLFVHL 655


>gi|281210720|gb|EFA84886.1| hypothetical protein PPL_01879 [Polysphondylium pallidum PN500]
          Length = 439

 Score = 61.2 bits (147), Expect = 7e-07,   Method: Compositional matrix adjust.
 Identities = 39/149 (26%), Positives = 63/149 (42%), Gaps = 26/149 (17%)

Query: 73  LACNAH----TICNSELRPLGTGLYPVISIINHSCLPNAVLVFE----GRLAVVRAVQHV 124
           L CN H    T+       +G+ +Y   S+ NHSC PN   +      G L ++ ++  +
Sbjct: 261 LECNTHEIGITVDEYNYCSIGSAVYEKASLFNHSCQPNVCRINRAGEWGALEMI-SLTDI 319

Query: 125 PKGAEVLISYIETAGSTMTRQKALKEQYLFTCTCPRCIKLGQFDDI-------------- 170
             G E++ +YI+ +  T  RQ  L E Y F C C  C+                      
Sbjct: 320 AAGTELVYNYIQISLPTEDRQSKLSENYFFECKCNGCVNNTSISSRQKQQKQQQKQRQQK 379

Query: 171 ---QESAILEGYRCKDDGCSGFLLRDSDD 196
               + + L+ Y C  D C+G L+ D++D
Sbjct: 380 QSNSQKSFLKKYLCTRDTCNGVLITDAND 408


>gi|406863858|gb|EKD16905.1| TPR domain protein [Marssonina brunnea f. sp. 'multigermtubi'
           MB_m1]
          Length = 739

 Score = 61.2 bits (147), Expect = 7e-07,   Method: Compositional matrix adjust.
 Identities = 33/91 (36%), Positives = 49/91 (53%), Gaps = 6/91 (6%)

Query: 90  TGLYPVISIINHSCLPNAVLVFEGRLAVVRAVQHVPKGAEVLISYIETAGSTMTRQKALK 149
           +GL+ + S INHSC PN    F G + +VRA + +P   E+ +SYIET       Q+ L 
Sbjct: 508 SGLWILASYINHSCEPNCRRAFIGDVQIVRAARDMPADTEITLSYIETDDPAKMNQR-LF 566

Query: 150 EQYLFTCTCPRCIKLGQFDDIQESAILEGYR 180
           + + F CTC  C+     DD +  A ++  R
Sbjct: 567 DGWGFDCTCAMCV-----DDRETPAAVKNRR 592


>gi|66521464|ref|XP_396314.2| PREDICTED: protein msta, isoform B-like [Apis mellifera]
          Length = 513

 Score = 61.2 bits (147), Expect = 7e-07,   Method: Compositional matrix adjust.
 Identities = 59/271 (21%), Positives = 109/271 (40%), Gaps = 35/271 (12%)

Query: 91  GLYPVISIINHSCLPN-----AVLVFEGRLAVVRAVQHVPKGAEVLISYIETAGSTMTRQ 145
            +YP ++++NHSC+ N     +   ++ RL   R    +P G E+  SY  +   TM R+
Sbjct: 216 AIYPTVALMNHSCISNTCHSISPTDYKIRL---RTTLKIPVGGELYGSYTHSLLPTMLRR 272

Query: 146 KALKEQYLFTCTCPRCIKLGQFDDIQESAILEGYRCK--DDGCSGFLLRDSDDKGFTCQQ 203
           + L E   F C C RC      D  +    +   +C   D+G    L     +  + C  
Sbjct: 273 EHLLEGKNFACACARC-----SDPTELGTHMSSLKCNKCDNGIVLSLDSLDPESSWKCTH 327

Query: 204 CGLVRSKEEIKKIASEVNILSKKTLALTSCGNHQEVVSTYKMIEKLQKKLYHPFSVNLMQ 263
           C    S   ++K+   + +      A++   +  + +   + + K  + + HP    L  
Sbjct: 328 CEFSTSGSAVRKVLQIIQMEVDAVEAISG-ADGADAIQERETVAKKYRSVLHPRHAFLTM 386

Query: 264 TREKLIKILMELEDW----------KEALAYCQLTIPVYQRVYPQFHPLLGLQYYTCGKL 313
            R  L ++   +ED+          +  +  C+L + V   + P +  + G+  Y     
Sbjct: 387 LRHSLTQMYGRVEDYLLEDLPDVVLEHKIDMCRLLLQVLDVIEPGYSRIRGMTLYELHAP 446

Query: 314 EWFLGDTE--------NAIKS-MTEAVEILR 335
             F   T+         A+KS M EA +IL+
Sbjct: 447 LLFTAKTQWNAEVIDKAALKSKMIEASKILK 477


>gi|391334364|ref|XP_003741575.1| PREDICTED: SET and MYND domain-containing protein 1-like
           [Metaseiulus occidentalis]
          Length = 434

 Score = 61.2 bits (147), Expect = 7e-07,   Method: Compositional matrix adjust.
 Identities = 38/141 (26%), Positives = 69/141 (48%), Gaps = 5/141 (3%)

Query: 27  VEALVAHMSDIDEKQLLLYAQIA-NLVNLILQWPEISINEIAENFSKLACNAHTICNSEL 85
           +  L  H+ D+ E   L   + A  + +L+  +    ++E+A    +L  N   + +   
Sbjct: 112 ISDLETHLEDLSEDDRLEVQEKAQRIASLMRAFTNTEVHEVALMLQRLRINMFQLSDHRA 171

Query: 86  RPLGTGLYPVISIINHSC--LPNAVLVFEGRLAVVRAVQH--VPKGAEVLISYIETAGST 141
              G   Y  IS+++H+C    N VL F+GR  ++RA+++  V    +  I+YI+    T
Sbjct: 172 VTKGIACYLGISVVDHTCEDSGNFVLAFKGREIILRALKNFTVQDMGDFRINYIDATIPT 231

Query: 142 MTRQKALKEQYLFTCTCPRCI 162
             R++AL   Y F C C +C+
Sbjct: 232 EERRRALLAGYFFACQCAKCL 252


>gi|46115982|ref|XP_384009.1| hypothetical protein FG03833.1 [Gibberella zeae PH-1]
          Length = 690

 Score = 61.2 bits (147), Expect = 7e-07,   Method: Compositional matrix adjust.
 Identities = 39/142 (27%), Positives = 70/142 (49%), Gaps = 6/142 (4%)

Query: 25  SLVEALVAHMSDIDEKQLLLYAQIANLVNLILQWPEISINEIAENFSKLACNAHTIC--- 81
           ++V+ L+ + S + E+ L L      +   +++  E+ + +  +    + CNA  +    
Sbjct: 439 AVVQKLLNNPSQV-ERILGLQGDYKGIGQKLIELDEVPVIDTFQIHDIVQCNAFGLGQQT 497

Query: 82  -NSELRPLGTGLYPVISIINHSCLPNAVLVFEGRLAVVRAVQHVPKGAEVLISYIETAGS 140
            + ++    TGL+   S INHSC+PNA     G L + RA + +  G E+  +Y E+  S
Sbjct: 498 EDEDISNASTGLWVRASYINHSCIPNAKKDLIGDLILFRATRRIASGEEITHAYDEST-S 556

Query: 141 TMTRQKALKEQYLFTCTCPRCI 162
              RQ A +  + F C CP C+
Sbjct: 557 YEARQAAFRRTWNFECRCPLCL 578


>gi|194883134|ref|XP_001975659.1| GG20436 [Drosophila erecta]
 gi|190658846|gb|EDV56059.1| GG20436 [Drosophila erecta]
          Length = 515

 Score = 61.2 bits (147), Expect = 7e-07,   Method: Compositional matrix adjust.
 Identities = 43/173 (24%), Positives = 75/173 (43%), Gaps = 11/173 (6%)

Query: 48  IANLVNLILQWPEISINEIAENFSKLACNAHTICNSELRPLGTGLYPVISIINHSCLPNA 107
           I   +    +  + +  EI +    L  N H +  ++  P    ++   S   +SCLPN 
Sbjct: 189 IGQFIPKFFKTQKFTEEEIMKAVGALQINGHEVPTTD--PSHVAVFYTASFTENSCLPNL 246

Query: 108 VLVF-EGRLAVVRAVQHVPKGAEVLISYIETAGSTMTRQKALKEQYLFTCTCPRCIKLGQ 166
              F +    ++ A   + K A + I Y +    T  RQ+ L +  LF C C RC+ + +
Sbjct: 247 AKSFNKNGHCILWAPHEIKKNAHLSICYSDAVWGTADRQRHLMQTKLFKCACDRCVDVTE 306

Query: 167 FDDIQESAILEGYRCKDDGCSGFLL---RDSDDKGFTCQQCGLVRSKEEIKKI 216
             D   SAI    +C+D  C G +L    D  +  + C++C     K  +++I
Sbjct: 307 L-DTNYSAI----KCEDRQCGGLMLPTKADDWNGSWRCRECHKQVQKHYVERI 354


>gi|357605609|gb|EHJ64694.1| hypothetical protein KGM_16822 [Danaus plexippus]
          Length = 382

 Score = 61.2 bits (147), Expect = 8e-07,   Method: Compositional matrix adjust.
 Identities = 32/79 (40%), Positives = 47/79 (59%), Gaps = 5/79 (6%)

Query: 89  GTGLYPVISIINHSCLPNAVLVFE-GRLAV-VRAVQHVPKGAEVLISYIETAG---STMT 143
           G+GLY + S  NHSC PNA   F  G   + ++A++++  G E+ ISY++      S  +
Sbjct: 283 GSGLYQLQSACNHSCAPNAESSFPYGNHRIQLKAIRNIKPGDEIHISYLDDCTLQRSRHS 342

Query: 144 RQKALKEQYLFTCTCPRCI 162
           RQ+ L E YLF C C RC+
Sbjct: 343 RQRELSENYLFVCCCERCM 361


>gi|71985487|ref|NP_493620.2| Protein SET-10 [Caenorhabditis elegans]
 gi|31873104|emb|CAB04267.2| Protein SET-10 [Caenorhabditis elegans]
          Length = 430

 Score = 61.2 bits (147), Expect = 8e-07,   Method: Compositional matrix adjust.
 Identities = 38/117 (32%), Positives = 61/117 (52%), Gaps = 8/117 (6%)

Query: 75  CNAHTICNSE-LRPLGTGLYPVISIINHSCLPNAVLVFEGRLAVVRAVQHVPKGAEVLIS 133
           CN+ +I N + + P+G+GLY  ++  NHSC   + +VFEG    +R  Q      E+ IS
Sbjct: 159 CNSFSIINEKRVEPIGSGLYVGVAKHNHSCASTSHVVFEGNQVFLRTNQE-EYSKELTIS 217

Query: 134 YIETAGSTMTRQKALKEQYLFTCTCPRCIKLGQFDDIQESAILEGYRCKDDGCSGFL 190
           Y+     T  R+K ++  +  TC C  C K  + D I  S+     +C+   C+GF+
Sbjct: 218 YVSRMLPTSERRKTIRGVHFLTCQCEMC-KNEELDLIGLSS-----KCRTKNCTGFV 268


>gi|19922236|ref|NP_610944.1| CG8503 [Drosophila melanogaster]
 gi|7303209|gb|AAF58272.1| CG8503 [Drosophila melanogaster]
 gi|15291193|gb|AAK92865.1| GH11294p [Drosophila melanogaster]
 gi|220945042|gb|ACL85064.1| CG8503-PA [synthetic construct]
 gi|220954946|gb|ACL90016.1| CG8503-PA [synthetic construct]
          Length = 513

 Score = 61.2 bits (147), Expect = 8e-07,   Method: Compositional matrix adjust.
 Identities = 43/173 (24%), Positives = 76/173 (43%), Gaps = 11/173 (6%)

Query: 48  IANLVNLILQWPEISINEIAENFSKLACNAHTICNSELRPLGTGLYPVISIINHSCLPNA 107
           I   +    +  + +  EI +    L  N H +  ++  P    ++   S   +SCLPN 
Sbjct: 189 IGQFIPKFFKTQKFTEEEIMKAVGALQINGHEVPTTD--PSHVAVFYTASFTENSCLPNL 246

Query: 108 VLVF-EGRLAVVRAVQHVPKGAEVLISYIETAGSTMTRQKALKEQYLFTCTCPRCIKLGQ 166
              F +    ++ A + + K A + I Y +    T  RQ+ L +  LF C C RC+ + +
Sbjct: 247 AKSFNKNGHCILWAPREIKKNAHLSICYSDAMWGTADRQRHLMQTKLFKCACERCVDVTE 306

Query: 167 FDDIQESAILEGYRCKDDGCSGFLLRDSDDK---GFTCQQCGLVRSKEEIKKI 216
             D   SAI    +C+D  C G +L    D+    + C++C     K  +++I
Sbjct: 307 L-DTNYSAI----KCEDRQCGGLMLPTKADEWNGSWRCRECHKQVQKHYVERI 354


>gi|225425346|ref|XP_002275154.1| PREDICTED: histone-lysine N-methyltransferase ASHR2-like [Vitis
           vinifera]
          Length = 405

 Score = 61.2 bits (147), Expect = 8e-07,   Method: Compositional matrix adjust.
 Identities = 30/78 (38%), Positives = 38/78 (48%), Gaps = 7/78 (8%)

Query: 91  GLYPVISIINHSCLPNA-------VLVFEGRLAVVRAVQHVPKGAEVLISYIETAGSTMT 143
           G+YP  S  NH CLPNA                 +R +  VP+G+E+ +SY     +   
Sbjct: 217 GIYPKASFFNHDCLPNACRFDYVDTASHHNTDITIRLIHDVPEGSEICLSYFPVNETYAD 276

Query: 144 RQKALKEQYLFTCTCPRC 161
           RQK L E Y FTC C RC
Sbjct: 277 RQKRLLEDYGFTCYCDRC 294


>gi|383849380|ref|XP_003700323.1| PREDICTED: SET and MYND domain-containing protein 4-like [Megachile
           rotundata]
          Length = 691

 Score = 61.2 bits (147), Expect = 8e-07,   Method: Compositional matrix adjust.
 Identities = 30/80 (37%), Positives = 46/80 (57%), Gaps = 3/80 (3%)

Query: 84  ELRPLGTGLYPVISIINHSCLPNAVLVFEGRLAVVRAVQHVPKGAEVLISYIET--AGST 141
           ++  + TG+YP  S++NHSC PN + ++  +  +VRAV+ +    E+  SY+ T     T
Sbjct: 403 QINIVATGIYPSASMMNHSCNPNIIKIYMDQYLIVRAVEDIFPTEEIFNSYVATYRYKKT 462

Query: 142 MTRQKALKEQYLFTCTCPRC 161
             RQK L E Y F+C C  C
Sbjct: 463 KARQKLL-ELYYFSCKCEAC 481


>gi|342182559|emb|CCC92038.1| conserved hypothetical protein [Trypanosoma congolense IL3000]
          Length = 712

 Score = 61.2 bits (147), Expect = 8e-07,   Method: Compositional matrix adjust.
 Identities = 37/118 (31%), Positives = 56/118 (47%), Gaps = 2/118 (1%)

Query: 75  CNAHTICNSELRPLGTGLYPVISIINHSCLPNAVLV-FEGRLAVVRAVQHVPKGAEVLIS 133
           CN   + +S+   +G  ++P  S  NHSCLPN   V + G +A   A+Q + KG  + I 
Sbjct: 559 CNNFGLFSSKESCIGVSIFPEASYFNHSCLPNLCRVMYRGNIAAFYALQSIRKGEPLTIC 618

Query: 134 YIET-AGSTMTRQKALKEQYLFTCTCPRCIKLGQFDDIQESAILEGYRCKDDGCSGFL 190
           Y++    ST  R++ L   Y F C C RC      D   ++ + E   C      G+L
Sbjct: 619 YVDVQEASTAERRRTLLTSYRFFCECRRCHGYSDEDGKGDTVMSEIRFCDTCDARGYL 676


>gi|195127975|ref|XP_002008442.1| GI11814 [Drosophila mojavensis]
 gi|193920051|gb|EDW18918.1| GI11814 [Drosophila mojavensis]
          Length = 527

 Score = 61.2 bits (147), Expect = 8e-07,   Method: Compositional matrix adjust.
 Identities = 52/192 (27%), Positives = 86/192 (44%), Gaps = 21/192 (10%)

Query: 49  ANLVNLI------LQWPEISINEIAENFSKLACNAHTICNSELRPLGTGLYPVISIINHS 102
           ANL++ I       +WP++ I  IA   + L  NA  +  +  R    GL+P  ++ +H 
Sbjct: 200 ANLISFIKTVLGLREWPDLDILRIA---AILDTNAFEVRQAGDRIKVRGLFPGGAMFSHD 256

Query: 103 CLPNAVLVFEGRLAVV-RAVQHVPKGAEVLISYIETAGSTMTRQKALKEQYLFTCTCPRC 161
           C+PN    F+  + ++  A + + KG  + ISY +   ST+ R+  LK+   F C C RC
Sbjct: 257 CVPNMRHRFDDDMNIMFLAKRPIAKGEILSISYTQQLRSTIQRRLHLKQVKCFECACARC 316

Query: 162 IKLGQFDDIQESAILEGYRCKDDGCSGFLLRD--SDDKGFTCQQCGLVRSKEEI----KK 215
                 D  +       + C       F+  D   +   + C+ C L+R  +E      K
Sbjct: 317 A-----DPTELGTYAGAHMCAKCKVGKFISMDPLQNAANWRCEVCNLIRPAKEYLINDAK 371

Query: 216 IASEVNILSKKT 227
           I +E+  L K T
Sbjct: 372 IEAELESLDKTT 383


>gi|451848012|gb|EMD61318.1| hypothetical protein COCSADRAFT_96412 [Cochliobolus sativus ND90Pr]
          Length = 417

 Score = 60.8 bits (146), Expect = 9e-07,   Method: Compositional matrix adjust.
 Identities = 34/89 (38%), Positives = 48/89 (53%), Gaps = 8/89 (8%)

Query: 91  GLYPVISIINHSCLPNAVLVFE-GRLA-VVRAVQHVPKGAEVLISYIETAGSTMTRQKAL 148
            ++P  S +NH+C PNA  V +   L+  VR  + + +G E+ ISY      T TRQ  L
Sbjct: 234 AVWPETSRLNHACAPNAQYVIDTDHLSHTVRVTRPIAEGEEITISYTSPLEPTETRQHHL 293

Query: 149 KEQYLFTCTCPRC------IKLGQFDDIQ 171
            + + FTCTCPRC      I L   +D+Q
Sbjct: 294 AQGFHFTCTCPRCTSPITDITLSIINDLQ 322


>gi|297845524|ref|XP_002890643.1| predicted protein [Arabidopsis lyrata subsp. lyrata]
 gi|297336485|gb|EFH66902.1| predicted protein [Arabidopsis lyrata subsp. lyrata]
          Length = 976

 Score = 60.8 bits (146), Expect = 9e-07,   Method: Compositional matrix adjust.
 Identities = 26/73 (35%), Positives = 41/73 (56%), Gaps = 1/73 (1%)

Query: 89  GTGLYPVISIINHSCLPNAVLVFEGRLAVVRAVQHVPKGAEVLISYIETAGSTMTRQKAL 148
           G GL+ + S INHSC+PNA  +  G   +V A + +  G E+  +Y +   S + ++K +
Sbjct: 761 GVGLWTLASFINHSCIPNARRLHVGDYVIVHASRDIKAGEEITFAYFDVL-SPLEKRKEM 819

Query: 149 KEQYLFTCTCPRC 161
            E + F C C RC
Sbjct: 820 AESWGFRCGCSRC 832


>gi|147844297|emb|CAN82112.1| hypothetical protein VITISV_031337 [Vitis vinifera]
          Length = 405

 Score = 60.8 bits (146), Expect = 9e-07,   Method: Compositional matrix adjust.
 Identities = 30/78 (38%), Positives = 38/78 (48%), Gaps = 7/78 (8%)

Query: 91  GLYPVISIINHSCLPNA-------VLVFEGRLAVVRAVQHVPKGAEVLISYIETAGSTMT 143
           G+YP  S  NH CLPNA                 +R +  VP+G+E+ +SY     +   
Sbjct: 217 GIYPKASFFNHDCLPNACRFDYVDTASHHNTDITIRLIHDVPEGSEICLSYFPVNETYAD 276

Query: 144 RQKALKEQYLFTCTCPRC 161
           RQK L E Y FTC C RC
Sbjct: 277 RQKRLLEDYGFTCYCDRC 294


>gi|413917046|gb|AFW56978.1| hypothetical protein ZEAMMB73_771542 [Zea mays]
          Length = 404

 Score = 60.8 bits (146), Expect = 1e-06,   Method: Compositional matrix adjust.
 Identities = 49/180 (27%), Positives = 65/180 (36%), Gaps = 57/180 (31%)

Query: 91  GLYPVISIINHSCLPNAV-LVFEGRLA------VVRAVQHVPKGAEVLISYIETAGSTMT 143
            +YP  S+ NH CLPNA    +  R        VVRA+  +P+G EV ISY         
Sbjct: 214 AVYPRASLFNHDCLPNACHFDYPDRPGPGNTDIVVRALHDIPEGREVCISYFAANWRYAD 273

Query: 144 RQKALKEQYLFTCTCPRCIKLGQF------------------------------------ 167
           RQ+ L E Y F C C RC    Q+                                    
Sbjct: 274 RQRRLLEDYGFRCECDRCQVESQWKDDDDNNGDDGDDTMEEEGEDDVGDRGDDGIEEEEG 333

Query: 168 --------DDIQESAILEGYRCKDDGCSGFL--LRDSDDKG----FTCQQCGLVRSKEEI 213
                   DD   +     Y C ++GC G L  L  S +      F C  CG +R +E++
Sbjct: 334 DGATNGGDDDFPHAYFFVRYLCDNEGCWGMLAPLPPSPNGDLSHVFECNLCGKLRKEEDV 393


>gi|197304712|ref|NP_001127869.1| N-lysine methyltransferase SMYD2-like [Nasonia vitripennis]
          Length = 704

 Score = 60.8 bits (146), Expect = 1e-06,   Method: Compositional matrix adjust.
 Identities = 46/174 (26%), Positives = 80/174 (45%), Gaps = 27/174 (15%)

Query: 16  IPSTTTDNYSLVEALVAHMSDIDEKQLL------LYAQIANLVNLILQWPEISINE---- 65
           + S  + +Y  V +L +H +D D    L      ++   + + +L    PEI  +E    
Sbjct: 420 LASYESRDYKTVFSLESHCADADPADNLQRAIHSVFFAKSLIYSLNYFHPEIDNHERHLH 479

Query: 66  -----IAENFSKLACNAHTICNS---------ELRPLGTGLYPVISIINHSCLPNAVL-V 110
                +  N   + CNA+ I  +         E R +G  +Y  +S+ NHSC PN V   
Sbjct: 480 ILAVALLHNMQAIKCNAYEIVENVRDDETKILEPRNVGGAIYTTVSLTNHSCYPNIVRHS 539

Query: 111 FEGRLAVVRAVQHVPKGAEVLISYIET--AGSTMTRQKALKEQYLFTCTCPRCI 162
           F     VV +++++P+G+E+L  Y +        +R++ L E+Y F C C  C+
Sbjct: 540 FPNGTVVVTSLRYIPEGSEILDCYGQHFLENKRDSRRRLLAEKYYFDCQCEPCL 593


>gi|145347416|ref|XP_001418163.1| predicted protein [Ostreococcus lucimarinus CCE9901]
 gi|144578392|gb|ABO96456.1| predicted protein [Ostreococcus lucimarinus CCE9901]
          Length = 639

 Score = 60.8 bits (146), Expect = 1e-06,   Method: Compositional matrix adjust.
 Identities = 25/67 (37%), Positives = 42/67 (62%)

Query: 70  FSKLACNAHTICNSELRPLGTGLYPVISIINHSCLPNAVLVFEGRLAVVRAVQHVPKGAE 129
            S+   N  TI + +++ +G G+YP  S+ NHS  PNA ++F+G+  VV+ ++ +  G E
Sbjct: 218 LSRFEINGFTIADDDMQRVGFGIYPEASLFNHSSTPNAQVMFKGKTLVVKTLREIAVGEE 277

Query: 130 VLISYIE 136
           + ISY E
Sbjct: 278 ITISYGE 284


>gi|196009510|ref|XP_002114620.1| hypothetical protein TRIADDRAFT_28394 [Trichoplax adhaerens]
 gi|190582682|gb|EDV22754.1| hypothetical protein TRIADDRAFT_28394 [Trichoplax adhaerens]
          Length = 398

 Score = 60.8 bits (146), Expect = 1e-06,   Method: Compositional matrix adjust.
 Identities = 32/77 (41%), Positives = 46/77 (59%), Gaps = 3/77 (3%)

Query: 89  GTGLYPVISIINHSCLPNAVLVFEGRLAV--VRAVQHVPKGAEVLISYIETA-GSTMTRQ 145
           G+GLY + S  NH C PN  + F    A   V+A++++ +G E+ ISYI++   +   RQ
Sbjct: 298 GSGLYLLQSCCNHDCSPNVEINFLDNNATLTVKAIRNISEGQELCISYIDSDIKNWKKRQ 357

Query: 146 KALKEQYLFTCTCPRCI 162
             L E YLF CTC RC+
Sbjct: 358 AILMENYLFECTCNRCM 374


>gi|310793444|gb|EFQ28905.1| MYND finger [Glomerella graminicola M1.001]
          Length = 548

 Score = 60.8 bits (146), Expect = 1e-06,   Method: Compositional matrix adjust.
 Identities = 33/90 (36%), Positives = 47/90 (52%), Gaps = 1/90 (1%)

Query: 72  KLACNAHTICNSELRPLGTGLYPVISIINHSCLPNAVLVFEGRLAVVRAVQHVPKGAEVL 131
           KL  NA +         G  L   +++INHSC+PNA++ F GR A +R+   +  G+E+ 
Sbjct: 183 KLQTNAFSRSEEYYETGGVFLDTTLAMINHSCVPNALVQFGGRTATLRSASFIHPGSEIE 242

Query: 132 ISYIETAGSTMTRQKALKEQYLFTCTCPRC 161
           ISYI+       RQ  L   Y F C C +C
Sbjct: 243 ISYIDQTQPKSRRQHELS-LYHFECHCSKC 271


>gi|193683573|ref|XP_001946512.1| PREDICTED: protein msta, isoform A-like [Acyrthosiphon pisum]
          Length = 499

 Score = 60.8 bits (146), Expect = 1e-06,   Method: Compositional matrix adjust.
 Identities = 44/167 (26%), Positives = 66/167 (39%), Gaps = 19/167 (11%)

Query: 57  QWPEISINEIAENFSKLACNAHTICNSELRPLGTGLYPVISIINHSCLPNAVLVF--EGR 114
           +WP   ++EI      +  N H +  ++  P    +Y   S++ HSC+PN    F  +G 
Sbjct: 168 EWP---VDEILRVCGIVQVNGHEVPLTD--PPYVAIYDAGSMLEHSCVPNCSKSFTRDGH 222

Query: 115 LAVVRAVQHVPKGAEVLISYIETAGSTMTRQKALKEQYLFTCTCPRCIKLGQFDDIQESA 174
           L +  A   V  G  + ISY +    T  R   L +   F C CPRC      D  +   
Sbjct: 223 LLIRTAAAAVESGGHLSISYTDVLWGTAQRLAHLADTKFFVCKCPRC-----SDPTELGT 277

Query: 175 ILEGYRCKDDGCSGFLL-------RDSDDKGFTCQQCGLVRSKEEIK 214
              G +C  + C G+ L        D  D  + C  C  V     I+
Sbjct: 278 YFSGVKCATEDCMGYSLPNVHPSTNDPFDVEWMCNFCFTVADPNRIE 324


>gi|322786609|gb|EFZ13004.1| hypothetical protein SINV_09986 [Solenopsis invicta]
          Length = 490

 Score = 60.8 bits (146), Expect = 1e-06,   Method: Compositional matrix adjust.
 Identities = 40/145 (27%), Positives = 68/145 (46%), Gaps = 9/145 (6%)

Query: 91  GLYPVISIINHSCLPNAVLVFEGRLAV-VRAVQHVPKGAEVLISYIETAGSTMTRQKALK 149
           GLYP+ ++ NH C+PN+   F+ +  + V A   +  G E+ +SY +    T TR++ L+
Sbjct: 216 GLYPMGALQNHCCVPNSRHHFDDQQRLHVSAALPIAAGEEITMSYTDLLWDTSTRRQFLR 275

Query: 150 EQYLFTCTCPRCIKLGQFDDIQESAILEGYRCKDDGCSGFLLRDSD---DKGFTCQQCGL 206
               F+C C RC      D ++  + L    C  D C G LL  +    +  + C +C  
Sbjct: 276 ITKRFSCNCNRC-----SDPLEFGSRLSALLCAKDECLGHLLPRNPLNYESSWICDKCQT 330

Query: 207 VRSKEEIKKIASEVNILSKKTLALT 231
             +  +I+ I S +N      +  T
Sbjct: 331 SVNHRQIECIHSGLNTFVSNVMYKT 355


>gi|297836294|ref|XP_002886029.1| hypothetical protein ARALYDRAFT_343234 [Arabidopsis lyrata subsp.
           lyrata]
 gi|297331869|gb|EFH62288.1| hypothetical protein ARALYDRAFT_343234 [Arabidopsis lyrata subsp.
           lyrata]
          Length = 424

 Score = 60.8 bits (146), Expect = 1e-06,   Method: Compositional matrix adjust.
 Identities = 31/78 (39%), Positives = 40/78 (51%), Gaps = 7/78 (8%)

Query: 91  GLYPVISIINHSCLPNAVL------VFEGRLAVV-RAVQHVPKGAEVLISYIETAGSTMT 143
           G+YP  S  NH CLPNA          +G   ++ R +  VP+G EV +SY     +  +
Sbjct: 214 GIYPKTSFFNHDCLPNACRFDYVDSASDGNTDIIIRTIHDVPEGREVCLSYFPVNMNYSS 273

Query: 144 RQKALKEQYLFTCTCPRC 161
           RQK L E Y F C C RC
Sbjct: 274 RQKRLLEDYGFKCDCDRC 291


>gi|402083521|gb|EJT78539.1| hypothetical protein GGTG_03639 [Gaeumannomyces graminis var.
           tritici R3-111a-1]
          Length = 779

 Score = 60.5 bits (145), Expect = 1e-06,   Method: Compositional matrix adjust.
 Identities = 29/95 (30%), Positives = 46/95 (48%), Gaps = 7/95 (7%)

Query: 74  ACNAHTICNSELRPLGT-------GLYPVISIINHSCLPNAVLVFEGRLAVVRAVQHVPK 126
           +C   T+ N+  RP G        G++P  +I+NH+C+ N+   F G L + RA + +P 
Sbjct: 459 SCPPLTVDNTRPRPEGVPEPPQTRGIWPHAAIMNHACVANSTRAFCGDLFITRASRDIPA 518

Query: 127 GAEVLISYIETAGSTMTRQKALKEQYLFTCTCPRC 161
           G E+   Y+        RQ  L+  + F C C  C
Sbjct: 519 GDEITQQYVPVRADWGERQAQLRHWWGFECACALC 553


>gi|403170123|ref|XP_003329506.2| hypothetical protein PGTG_11256 [Puccinia graminis f. sp. tritici
           CRL 75-36-700-3]
 gi|375168575|gb|EFP85087.2| hypothetical protein PGTG_11256 [Puccinia graminis f. sp. tritici
           CRL 75-36-700-3]
          Length = 647

 Score = 60.5 bits (145), Expect = 1e-06,   Method: Compositional matrix adjust.
 Identities = 35/102 (34%), Positives = 54/102 (52%), Gaps = 11/102 (10%)

Query: 71  SKLACNAHTICNSELRPLGTGLYPVISIINHSCLPNAVLVF----EGRLAV-------VR 119
           S+   N+ T+ +  L  +G    P  + INHSC PNAV+VF    EG  +        V 
Sbjct: 277 SRFIDNSFTLTSIILDQIGVVFVPSAAFINHSCNPNAVVVFPEGGEGAGSTAGKEWVKVI 336

Query: 120 AVQHVPKGAEVLISYIETAGSTMTRQKALKEQYLFTCTCPRC 161
           A++ +  G E++ SYI++AG+   R+  L ++Y F C C  C
Sbjct: 337 AIKPIEPGEEIVTSYIDSAGTRQERRNELVKRYKFVCDCQAC 378


>gi|307108187|gb|EFN56428.1| hypothetical protein CHLNCDRAFT_145047 [Chlorella variabilis]
          Length = 383

 Score = 60.5 bits (145), Expect = 1e-06,   Method: Compositional matrix adjust.
 Identities = 26/77 (33%), Positives = 45/77 (58%), Gaps = 2/77 (2%)

Query: 89  GTGLYPVISIINHSCLPNAVLVFE--GRLAVVRAVQHVPKGAEVLISYIETAGSTMTRQK 146
           G+  Y + S++NHSC P+  + F     +A   A +H+ +G ++ +SY++       R++
Sbjct: 300 GSAAYLLASLLNHSCEPSLGVSFPRNNAVAAFTAARHIARGEQLTVSYVDAGQGLAARRQ 359

Query: 147 ALKEQYLFTCTCPRCIK 163
           AL   Y FTC CPRC++
Sbjct: 360 ALAWAYGFTCRCPRCVE 376


>gi|393233731|gb|EJD41300.1| hypothetical protein AURDEDRAFT_186645 [Auricularia delicata
           TFB-10046 SS5]
          Length = 293

 Score = 60.5 bits (145), Expect = 1e-06,   Method: Compositional matrix adjust.
 Identities = 36/94 (38%), Positives = 50/94 (53%), Gaps = 8/94 (8%)

Query: 76  NAHTICNSELRPLGTGLYPVIS-IINHSCLPNAVLVF------EGRLAVVRAVQHVPKGA 128
           N + + +S L PL   +YP  S  +NHSC  NAV +F        R+ VV  V+ V  G 
Sbjct: 83  NNNFVLHSHLTPLAAAVYPAASRSLNHSCASNAVPLFVFAPATPPRMEVV-LVRDVAPGD 141

Query: 129 EVLISYIETAGSTMTRQKALKEQYLFTCTCPRCI 162
           E+ I YI+ A +   R++ L+  Y F C C RCI
Sbjct: 142 EITIPYIDPALAPSARRERLRASYGFECACARCI 175


>gi|194862038|ref|XP_001969906.1| GG23681 [Drosophila erecta]
 gi|190661773|gb|EDV58965.1| GG23681 [Drosophila erecta]
          Length = 507

 Score = 60.5 bits (145), Expect = 1e-06,   Method: Compositional matrix adjust.
 Identities = 51/199 (25%), Positives = 85/199 (42%), Gaps = 25/199 (12%)

Query: 92  LYPVISIINHSCLPN---AVLVFEGRLAVVRAVQHVPKGAEVLISYIETAGSTMTRQKAL 148
           L+P   I+ H+C+PN   ++   EG    +RA+  + +G  +  SY  T   T  RQK L
Sbjct: 207 LFPYTGILAHNCVPNTSRSIYPSEGYKIRLRAMVDLEEGQPLHHSYTYTLDGTAQRQKHL 266

Query: 149 KEQYLFTCTCPRCI---KLG-QFDDIQESAILEGYRCKDDGCSGFLLRDSDDKGFTCQQC 204
           K+   FTC C RC+   +LG  F  ++     EGY+               D  ++C  C
Sbjct: 267 KQGKFFTCQCDRCLDPTELGTHFSSLKCGQCAEGYQVPRQPTEA-------DTSWSCASC 319

Query: 205 GLVRSKEE----IKKIASEVNILSKKTLALTSCGNHQEVVSTYKMIEKLQKKLYHPFSVN 260
           G   S  E    ++ + SEVN +    +        + ++  Y       K L HP    
Sbjct: 320 GSDTSNVEALAMLQSLQSEVNAVQALPMGAKRLEEVERLLRKY-------KSLLHPLHFI 372

Query: 261 LMQTREKLIKILMELEDWK 279
               R+ LI++   ++ ++
Sbjct: 373 ATGLRQLLIEMYGRVQGYE 391


>gi|328785590|ref|XP_003250620.1| PREDICTED: SET and MYND domain-containing protein 4-like, partial
           [Apis mellifera]
          Length = 444

 Score = 60.5 bits (145), Expect = 1e-06,   Method: Compositional matrix adjust.
 Identities = 29/71 (40%), Positives = 41/71 (57%), Gaps = 2/71 (2%)

Query: 88  LGTGLYPVISIINHSCLPNAVLVFEGRLAVVRAVQHVPKGAEVLISYIE--TAGSTMTRQ 145
           + TG+YP  SI+NHSC PN + +F  +  +VRA + + +G E+   Y       +T  RQ
Sbjct: 373 VATGIYPSASIMNHSCDPNIINIFVNQYLIVRASRDISQGEEIFNCYGPHYRHMTTENRQ 432

Query: 146 KALKEQYLFTC 156
           K LK QY F C
Sbjct: 433 KILKNQYCFIC 443


>gi|393233728|gb|EJD41297.1| hypothetical protein AURDEDRAFT_106306 [Auricularia delicata
           TFB-10046 SS5]
          Length = 373

 Score = 60.5 bits (145), Expect = 1e-06,   Method: Compositional matrix adjust.
 Identities = 36/94 (38%), Positives = 50/94 (53%), Gaps = 8/94 (8%)

Query: 76  NAHTICNSELRPLGTGLYPVIS-IINHSCLPNAVLVF------EGRLAVVRAVQHVPKGA 128
           N + + +S L PL   +YP  S  +NHSC  NAV +F        R+ VV  V+ V  G 
Sbjct: 163 NNNFVLHSHLTPLAAAVYPAASRALNHSCATNAVPLFVFAPATPPRMEVV-LVRDVAPGD 221

Query: 129 EVLISYIETAGSTMTRQKALKEQYLFTCTCPRCI 162
           E+ I YI+ A +   R++ L+  Y F C C RCI
Sbjct: 222 EITIPYIDPALAPSARRERLRASYGFECACARCI 255


>gi|256071345|ref|XP_002572001.1| set and mynd domain containing [Schistosoma mansoni]
 gi|350645143|emb|CCD60151.1| set and mynd domain containing, putative [Schistosoma mansoni]
          Length = 527

 Score = 60.5 bits (145), Expect = 1e-06,   Method: Compositional matrix adjust.
 Identities = 30/87 (34%), Positives = 45/87 (51%), Gaps = 3/87 (3%)

Query: 80  ICNSELRPLGTGLYPVISIINHSCLPNAVLVFEGRLAVVRAVQHVPKGAEVLISY---IE 136
           IC  +   + TGL+P +S++NHSC PN    FE    ++R ++ +  G EV   Y     
Sbjct: 384 ICKFQQIRIATGLFPCVSLLNHSCDPNTAHNFEESFLILRCLKPILPGTEVFHCYGPHYL 443

Query: 137 TAGSTMTRQKALKEQYLFTCTCPRCIK 163
              S++ R   L++QY F C C  C K
Sbjct: 444 HYPSSVQRVTLLQQQYFFICDCEHCSK 470


>gi|198463716|ref|XP_001352921.2| GA14810 [Drosophila pseudoobscura pseudoobscura]
 gi|198151380|gb|EAL30422.2| GA14810 [Drosophila pseudoobscura pseudoobscura]
          Length = 532

 Score = 60.5 bits (145), Expect = 1e-06,   Method: Compositional matrix adjust.
 Identities = 59/228 (25%), Positives = 101/228 (44%), Gaps = 28/228 (12%)

Query: 16  IPSTTTDNYSLVEALVAHMSDIDEKQL--LLYAQIANLVNLIL---QWPEISINEIAENF 70
           + S+  + ++ + AL  H+ +     L  +L A +   +  IL    W E+ I  IA   
Sbjct: 169 LKSSDPEGFARLSALEDHLEERLATPLYQVLRANLITFIKTILGMRDWSEVDILRIA--- 225

Query: 71  SKLACNAHTICNSELRPLGTGLYPVISIINHSCLPNAVLVFEGRLAVV-RAVQHVPKGAE 129
           + L  N   +     R     L+P  ++I+H C PN    F+  + ++  A + + KG  
Sbjct: 226 AILDTNTFELRQPRERRKVRALFPGAAMISHDCAPNMRHRFDDDMNIIFLAKRPIAKGEI 285

Query: 130 VLISYIETAGSTMTRQKALKEQYLFTCTCPRC---IKLGQFDDIQESAILEGYRCKDDGC 186
           + ISY +   ST+ R+  L++   F C C RC    +LG F   Q        +CK    
Sbjct: 286 LSISYTQPLRSTIQRRLHLRQAKCFDCACARCQDPTELGTFAGAQTCV-----KCK---- 336

Query: 187 SGFLLRDS---DDKGFTCQQCGLVRSKEEI----KKIASEVNILSKKT 227
           +G ++  +   +   + CQ C L RS +E+     K+  E+  L K T
Sbjct: 337 AGKIISVNPLQNTANWKCQLCNLKRSAKEVLLSDAKLQQEIEALDKTT 384


>gi|226499648|ref|NP_001143943.1| uncharacterized protein LOC100276756 [Zea mays]
 gi|195629926|gb|ACG36604.1| hypothetical protein [Zea mays]
          Length = 404

 Score = 60.5 bits (145), Expect = 1e-06,   Method: Compositional matrix adjust.
 Identities = 49/179 (27%), Positives = 65/179 (36%), Gaps = 57/179 (31%)

Query: 92  LYPVISIINHSCLPNAV-LVFEGRLA------VVRAVQHVPKGAEVLISYIETAGSTMTR 144
           +YP  S+ NH CLPNA    +  R        VVRA+  +P+G EV ISY         R
Sbjct: 215 VYPRASLFNHDCLPNACHFDYPDRPGPGNTDIVVRALHDIPEGREVCISYFAANWRYADR 274

Query: 145 QKALKEQYLFTCTCPRCIKLGQF------------------------------------- 167
           Q+ L E Y F C C RC    Q+                                     
Sbjct: 275 QRRLLEDYGFRCECDRCQVESQWKDDDDNNGDDGDDTMEEEDEDDAGDRGDDGIEEEEGD 334

Query: 168 -------DDIQESAILEGYRCKDDGCSGFL--LRDSDDKG----FTCQQCGLVRSKEEI 213
                  DD   +     Y C ++GC G L  L  S +      F C  CG +R +E++
Sbjct: 335 GATNGGDDDFPHAYFFVRYLCDNEGCWGMLAPLPPSPNGDLSHVFECNLCGKLRKEEDV 393


>gi|198461003|ref|XP_002138938.1| GA25085 [Drosophila pseudoobscura pseudoobscura]
 gi|198137203|gb|EDY69496.1| GA25085 [Drosophila pseudoobscura pseudoobscura]
          Length = 515

 Score = 60.5 bits (145), Expect = 1e-06,   Method: Compositional matrix adjust.
 Identities = 44/175 (25%), Positives = 72/175 (41%), Gaps = 11/175 (6%)

Query: 46  AQIANLVNLILQWPEISINEIAENFSKLACNAHTICNSELRPLGTGLYPVISIINHSCLP 105
           A I   +    +  + S  E+      L  N H +  S+  P    ++   S   HSC+P
Sbjct: 187 ASIGQFIPKFFKTEKFSEEEVMRAVGALQINGHEVPTSD--PPHVAVFYTASFTEHSCMP 244

Query: 106 NAVLVF-EGRLAVVRAVQHVPKGAEVLISYIETAGSTMTRQKALKEQYLFTCTCPRCIKL 164
           N    F +    ++ A + + K + + I Y +    T  RQ+ L +  LF C C RC  +
Sbjct: 245 NLAKSFNKNGHCILWAPREIKKNSHLSICYSDAMWGTADRQRHLVQTKLFKCACDRCTDV 304

Query: 165 GQFDDIQESAILEGYRCKDDGCSGFLLRDSDDK---GFTCQQCGLVRSKEEIKKI 216
            +      SAI    +C+D  C G +L    D     + C++C     K  I +I
Sbjct: 305 TEL-GTNYSAI----KCEDRRCDGLMLPSKTDDWNGSWKCRECQKQVQKHYIDQI 354


>gi|195170308|ref|XP_002025955.1| GL10122 [Drosophila persimilis]
 gi|194110819|gb|EDW32862.1| GL10122 [Drosophila persimilis]
          Length = 515

 Score = 60.5 bits (145), Expect = 1e-06,   Method: Compositional matrix adjust.
 Identities = 44/175 (25%), Positives = 72/175 (41%), Gaps = 11/175 (6%)

Query: 46  AQIANLVNLILQWPEISINEIAENFSKLACNAHTICNSELRPLGTGLYPVISIINHSCLP 105
           A I   +    +  + S  E+      L  N H +  S+  P    ++   S   HSC+P
Sbjct: 187 ASIGQFIPKFFKTEKFSEEEVMRAVGALQINGHEVPTSD--PPHVAVFYTASFTEHSCMP 244

Query: 106 NAVLVF-EGRLAVVRAVQHVPKGAEVLISYIETAGSTMTRQKALKEQYLFTCTCPRCIKL 164
           N    F +    ++ A + + K + + I Y +    T  RQ+ L +  LF C C RC  +
Sbjct: 245 NLAKSFNKNGHCILWAPREIKKNSHLSICYSDAMWGTADRQRHLVQTKLFKCACDRCTDV 304

Query: 165 GQFDDIQESAILEGYRCKDDGCSGFLLRDSDDK---GFTCQQCGLVRSKEEIKKI 216
            +      SAI    +C+D  C G +L    D     + C++C     K  I +I
Sbjct: 305 TEL-GTNYSAI----KCEDRRCDGLMLPSKTDDWNGSWKCRECQKQVQKHYIDQI 354


>gi|302836969|ref|XP_002950044.1| hypothetical protein VOLCADRAFT_104571 [Volvox carteri f.
           nagariensis]
 gi|300264517|gb|EFJ48712.1| hypothetical protein VOLCADRAFT_104571 [Volvox carteri f.
           nagariensis]
          Length = 430

 Score = 60.5 bits (145), Expect = 1e-06,   Method: Compositional matrix adjust.
 Identities = 31/73 (42%), Positives = 40/73 (54%), Gaps = 2/73 (2%)

Query: 91  GLYPVISIINHSCLPNAVLVFEGRLAVVRAVQHVPKGAEVLISYIET--AGSTMTRQKAL 148
           GL+P  ++ NHSC PNAV V  GR   +R    V  G EV I+Y      G    R+  L
Sbjct: 286 GLWPAHAMFNHSCAPNAVAVVAGRELHIRCSGPVSAGEEVCITYSGALGLGPLPLRRALL 345

Query: 149 KEQYLFTCTCPRC 161
           ++ + F CTCPRC
Sbjct: 346 EKNHRFRCTCPRC 358


>gi|353236057|emb|CCA68059.1| hypothetical protein PIIN_01926 [Piriformospora indica DSM 11827]
          Length = 786

 Score = 60.5 bits (145), Expect = 1e-06,   Method: Compositional matrix adjust.
 Identities = 77/309 (24%), Positives = 119/309 (38%), Gaps = 84/309 (27%)

Query: 89  GTGLYPVISIINHSCLPNAVLVFEGRLAVVRAVQHVPKGAEVLISYIETAGSTMTRQKAL 148
           GTGLY + S  NHSC+PNA   F   L V+RA Q + +  E+ ISY   A S  +R+K L
Sbjct: 522 GTGLYLLASYFNHSCIPNAGHQFFSDLMVIRATQSIKQDEEITISYCSHA-SYASREKNL 580

Query: 149 KEQYLFTCTCPRCIKLGQFDDIQESAILEGYRCKDDGCSGFLLRDSDDKGFTCQQCGLVR 208
           K  +   C C  CI                    DD  +G   R                
Sbjct: 581 KP-WFDQCDCQLCI--------------------DDRLAGSNRRS--------------- 604

Query: 209 SKEEIKK-IASEVNILSKKTLALTSCGNHQEVVSTYKMIEKLQKKL---YHPFSVNLMQT 264
           ++E + K + +    L +   AL      Q++VSTY   E+++ +L   YH FS  L Q 
Sbjct: 605 ARESLSKAVRNAATPLPQARDAL------QKMVSTYSSQERVRPELSTAYHAFSHRLQQM 658

Query: 265 REKLIKILMELE---------DW----------KEALAYCQL-TIPVYQRVYP------- 297
             + +      E         +W           E   +  L T+P+  R  P       
Sbjct: 659 AHEFVSKAAATEAIEKEIQSLEWLGVEVTEKGITEVSDHSHLPTLPISTRRIPNTFGEPD 718

Query: 298 -QFHPLLGLQYYTCG----KLEWFLGDTENAIKSMTEAVEILRITHG----TNSPFMKEL 348
            QF  + G+ +   G     + WF      + K++   + + + TH        P + E 
Sbjct: 719 MQFLMIFGM-FVLLGCPNRAMSWFRAAIWMSEKTIGGGIPMFKRTHSDIIRVLGPVLAEF 777

Query: 349 ILKLEEAQA 357
           + K+E  +A
Sbjct: 778 LSKIESERA 786


>gi|380494289|emb|CCF33265.1| MYND finger [Colletotrichum higginsianum]
          Length = 549

 Score = 60.5 bits (145), Expect = 1e-06,   Method: Compositional matrix adjust.
 Identities = 34/92 (36%), Positives = 48/92 (52%), Gaps = 1/92 (1%)

Query: 72  KLACNAHTICNSELRPLGTGLYPVISIINHSCLPNAVLVFEGRLAVVRAVQHVPKGAEVL 131
           KL  NA +         G  L   +++INHSC+PNA++ F GR A +RA   +  G E+ 
Sbjct: 183 KLQTNAFSRTEEYYETGGVFLDTTLAMINHSCVPNALVQFGGRTATLRATSFLDPGDEIE 242

Query: 132 ISYIETAGSTMTRQKALKEQYLFTCTCPRCIK 163
           ISYI+       R   L + Y F C+C +C K
Sbjct: 243 ISYIDQTQPRGKRHGEL-DLYHFECSCYKCQK 273


>gi|281212379|gb|EFA86539.1| hypothetical protein PPL_00338 [Polysphondylium pallidum PN500]
          Length = 468

 Score = 60.5 bits (145), Expect = 1e-06,   Method: Compositional matrix adjust.
 Identities = 29/75 (38%), Positives = 46/75 (61%), Gaps = 3/75 (4%)

Query: 89  GTGLYPVISIINHSCLPNAVLVFE--GRLAVVRAVQHVPKGAEVLISYIETAGSTMTRQK 146
           G G++ + ++INHSC+PN ++VFE    LA ++A++ +  G E+  SYIE +     R +
Sbjct: 373 GFGVFGLQAMINHSCVPNCLVVFEQGSSLAYIKALRDIVPGEELFHSYIEESAPFEERSQ 432

Query: 147 ALKEQYLFTCTCPRC 161
            L   Y F C CP+C
Sbjct: 433 ELV-TYGFNCDCPKC 446


>gi|30680748|ref|NP_849991.1| histone-lysine N-methyltransferase ASHR2 [Arabidopsis thaliana]
 gi|94707155|sp|Q9ZUM9.3|ASHR2_ARATH RecName: Full=Histone-lysine N-methyltransferase ASHR2; AltName:
           Full=ASH1-related protein 2; AltName: Full=Protein SET
           DOMAIN GROUP 39
 gi|28393236|gb|AAO42047.1| putative SET-domain transcriptional regulator [Arabidopsis
           thaliana]
 gi|330251813|gb|AEC06907.1| histone-lysine N-methyltransferase ASHR2 [Arabidopsis thaliana]
          Length = 398

 Score = 60.5 bits (145), Expect = 1e-06,   Method: Compositional matrix adjust.
 Identities = 31/78 (39%), Positives = 40/78 (51%), Gaps = 7/78 (8%)

Query: 91  GLYPVISIINHSCLPNAVL------VFEGRLAVV-RAVQHVPKGAEVLISYIETAGSTMT 143
           G+YP  S  NH CLPNA          +G   ++ R +  VP+G EV +SY     +  +
Sbjct: 219 GIYPKTSFFNHDCLPNACRFDYVDSASDGNTDIIIRMIHDVPEGREVCLSYFPVNMNYSS 278

Query: 144 RQKALKEQYLFTCTCPRC 161
           RQK L E Y F C C RC
Sbjct: 279 RQKRLLEDYGFKCDCDRC 296


>gi|268570006|ref|XP_002640670.1| Hypothetical protein CBG19732 [Caenorhabditis briggsae]
          Length = 445

 Score = 60.5 bits (145), Expect = 1e-06,   Method: Compositional matrix adjust.
 Identities = 52/231 (22%), Positives = 104/231 (45%), Gaps = 29/231 (12%)

Query: 65  EIAENFSKLACNAHTICNS-------ELRPLGTGLYPVISIINHSCLPNAVLVFEGRLAV 117
           E  +N   +A     +CNS        + P+G+G+Y  ++  NHSC     +VF+G  AV
Sbjct: 163 ESMDNNKMIAIICSVLCNSFGIIAEKRVEPIGSGMYVGLATHNHSCASTCHVVFDGNQAV 222

Query: 118 VRAVQHVPKGAEVLISYIETAGSTMTRQKALKEQYLFTCTCPRCIKLGQFDDIQESAILE 177
           +R+ +         ISY+     T  RQK+++  +  TC C  C+     +D+  + +  
Sbjct: 223 LRS-RDRQYCKNTTISYVSRMLPTFERQKSIRNVHFITCRCEMCLN----EDLDLTGL-- 275

Query: 178 GYRCKDDGCSGFLLRDSDDKGFTCQQCGLV---RSKEEIKKIASEVNILSKKTLALTSCG 234
             +C+   C GF ++ ++    TC +  ++   +S +   K+   +  L K         
Sbjct: 276 ASKCQTSKCQGF-VKGANGSCTTCGKPAMIPFEQSTQSTSKLLDTLENLHK--------S 326

Query: 235 NHQEVVSTYKMIEKLQK---KLYHPFSVNLMQTREKLIKILMELEDWKEAL 282
              + V  Y+ ++ LQ+   ++    +V ++Q  E+L     E+++  E +
Sbjct: 327 QQLDTVQEYRHLQNLQEEYGRILADCNVAILQLDEQLAYCASEMKNVPEDM 377


>gi|296085566|emb|CBI29298.3| unnamed protein product [Vitis vinifera]
          Length = 327

 Score = 60.1 bits (144), Expect = 1e-06,   Method: Compositional matrix adjust.
 Identities = 30/78 (38%), Positives = 38/78 (48%), Gaps = 7/78 (8%)

Query: 91  GLYPVISIINHSCLPNA-------VLVFEGRLAVVRAVQHVPKGAEVLISYIETAGSTMT 143
           G+YP  S  NH CLPNA                 +R +  VP+G+E+ +SY     +   
Sbjct: 148 GIYPKASFFNHDCLPNACRFDYVDTASHHNTDITIRLIHDVPEGSEICLSYFPVNETYAD 207

Query: 144 RQKALKEQYLFTCTCPRC 161
           RQK L E Y FTC C RC
Sbjct: 208 RQKRLLEDYGFTCYCDRC 225


>gi|224029645|gb|ACN33898.1| unknown [Zea mays]
          Length = 369

 Score = 60.1 bits (144), Expect = 1e-06,   Method: Compositional matrix adjust.
 Identities = 35/111 (31%), Positives = 51/111 (45%), Gaps = 7/111 (6%)

Query: 80  ICNSELRPLGTGLYPVISIINHSCLPNAVLVFEGRLAVVRAVQHVPKGAEVLISYIETAG 139
           +C+ E   +   LY   S+ NHSC PN    F  R  V+R  + +  G+ V +SY    G
Sbjct: 122 MCSVEQVRVAQALYVSGSLFNHSCQPNVHAYFLSRAFVLRTTEFIKSGSPVELSYGPQVG 181

Query: 140 STMT--RQKALKEQYLFTCTCPRCIKLGQFDDIQESAILEGYRCKDDGCSG 188
                 RQK+L+E Y F+C C  C +L   D      ++  + C    C G
Sbjct: 182 EMHISERQKSLQENYYFSCQCSSCSELNLSD-----LVMNSFCCPQSNCLG 227


>gi|356518688|ref|XP_003528010.1| PREDICTED: uncharacterized protein LOC100783548 [Glycine max]
          Length = 786

 Score = 60.1 bits (144), Expect = 2e-06,   Method: Compositional matrix adjust.
 Identities = 36/118 (30%), Positives = 53/118 (44%), Gaps = 7/118 (5%)

Query: 76  NAHTICNSELRPLGTGLYPVISIINHSCLPNAVLVFEGRLAVVRAVQHVPKGAEVLISYI 135
            AH+  N E   +G  +Y   S+ NHSC PN    F  R   +R    V  G+++ +SY 
Sbjct: 461 GAHSTSNVEQVRVGKAIYKAGSLFNHSCQPNVHAYFLSRALYLRTTNVVAAGSQLELSYG 520

Query: 136 ETAG--STMTRQKALKEQYLFTCTCPRCIKLGQFDDIQESAILEGYRCKDDGCSGFLL 191
              G      R   LK +Y F C C  C ++ + D      +L  + C +  CSG +L
Sbjct: 521 PQVGLWDCKDRLNFLKNEYAFHCLCTGCSEVNRSD-----LVLNAFHCVNPNCSGAVL 573


>gi|119189249|ref|XP_001245231.1| hypothetical protein CIMG_04672 [Coccidioides immitis RS]
 gi|303323233|ref|XP_003071608.1| hypothetical protein CPC735_071450 [Coccidioides posadasii C735
           delta SOWgp]
 gi|240111310|gb|EER29463.1| hypothetical protein CPC735_071450 [Coccidioides posadasii C735
           delta SOWgp]
 gi|320031288|gb|EFW13261.1| conserved hypothetical protein [Coccidioides posadasii str.
           Silveira]
 gi|392868134|gb|EAS33877.2| hypothetical protein CIMG_04672 [Coccidioides immitis RS]
          Length = 338

 Score = 60.1 bits (144), Expect = 2e-06,   Method: Compositional matrix adjust.
 Identities = 28/74 (37%), Positives = 44/74 (59%), Gaps = 2/74 (2%)

Query: 91  GLYPVISIINHSCLPNAVLVFEGRL--AVVRAVQHVPKGAEVLISYIETAGSTMTRQKAL 148
           G++P +S  NH C PN++  +   L  AVV  V+ V +G E+ +SY+E   +   RQ+ L
Sbjct: 119 GIFPKVSRFNHGCRPNSMRSYHPVLDQAVVHVVKDVSEGEEITVSYVEPGLAFYLRQEQL 178

Query: 149 KEQYLFTCTCPRCI 162
           KE++ F C C  C+
Sbjct: 179 KEKFGFICGCNLCL 192


>gi|154337874|ref|XP_001565163.1| conserved hypothetical protein [Leishmania braziliensis
           MHOM/BR/75/M2904]
 gi|134062210|emb|CAM36598.1| conserved hypothetical protein [Leishmania braziliensis
           MHOM/BR/75/M2904]
          Length = 466

 Score = 60.1 bits (144), Expect = 2e-06,   Method: Compositional matrix adjust.
 Identities = 50/170 (29%), Positives = 79/170 (46%), Gaps = 28/170 (16%)

Query: 28  EALVAHMSDID--EKQLLLYAQIANLVNLILQWPE------ISINEIAENFSKLACNAHT 79
           E L+A  + +   E+  L + +   +    L W E      +S + +     ++  NAH 
Sbjct: 288 ELLIASGTSLSAVERHTLAFNEHQAVPAPALLWAEALVQVLLSFDTLRTLVGQMVLNAHA 347

Query: 80  ICNSELRPLGT-------------GLYPVISIINHSCLPN-AVLVFEGRLAVV-RAVQHV 124
           I +  L P+G              GLY +++  NHSC+PN AV   +G   +V +  + +
Sbjct: 348 INDYVLLPVGAPSPSAFDWVLKGAGLYSLLACFNHSCVPNVAVSNVDGTHEIVLKTTRPI 407

Query: 125 PKGAEVLISYIETAGSTMT---RQKALKEQYLFTCTCPRCIK-LGQFDDI 170
             G  + I+YI  +   M+   RQ+ L+  Y FTC CPRCIK  G   DI
Sbjct: 408 QAGEPLTITYIPLSAGAMSRAERQRQLR-NYFFTCHCPRCIKEAGVAGDI 456


>gi|307198438|gb|EFN79380.1| Protein msta, isoform B [Harpegnathos saltator]
          Length = 514

 Score = 60.1 bits (144), Expect = 2e-06,   Method: Compositional matrix adjust.
 Identities = 61/303 (20%), Positives = 122/303 (40%), Gaps = 35/303 (11%)

Query: 61  ISINEIAENFSKLACNAHTICNSELRPL----GTGLYPVISIINHSCLPN---AVLVFEG 113
           + ++  +E   ++AC    I   E+R         LYP ++++NHSC+ N    +   + 
Sbjct: 183 LKLDRFSEECIQMACGILEINTFEIRTAKGYSARALYPTVAMMNHSCVSNMSHTISPIDY 242

Query: 114 RLAVVRAVQHVPKGAEVLISYIETAGSTMTRQKALKEQYLFTCTCPRCIKLGQFDDIQES 173
           R+ + R    +P   E+  SY  +   T+ R++ L E   F C CPRC      D  + +
Sbjct: 243 RIRL-RTTLKIPAAGELYASYTHSLLPTLLRREHLLEGKHFACACPRC-----SDPTELA 296

Query: 174 AILEGYRCK--DDGCSGFLLRDSDDKGFTCQQCGLVRSKEEIKKIASEVNILSKKTLALT 231
             +   +C   D+G    L     +  + C  C    + + ++K+   +        A++
Sbjct: 297 THMSSLKCNKCDNGIVLPLDSLDAESAWKCTHCEYSINGQAVRKVFRIIQAEVDNVEAIS 356

Query: 232 SCGNHQEVVSTYKMIEKLQKKLYHPFSVNLMQTREKLIKILMELEDW----------KEA 281
              +  + +   + + +  + + HP    L   R  L ++   ++++          +  
Sbjct: 357 G-ADGADAIHERETVLRKYRSVLHPRHAFLTILRHSLTQMYGRVDEYLLDDLPDVVLEHK 415

Query: 282 LAYCQLTIPVYQRVYPQFHPLLGLQYYTCGKLEWFLGD--------TENAIKS-MTEAVE 332
           +  C+L + +   V P +  + G+  Y       FL           E  +KS MTEA  
Sbjct: 416 VDMCRLLLQILDVVEPGYSRIRGMTLYELHAPLLFLAKGQWNADVIDEAGLKSKMTEAAN 475

Query: 333 ILR 335
           IL+
Sbjct: 476 ILK 478


>gi|453088056|gb|EMF16097.1| SET domain-containing protein, partial [Mycosphaerella populorum
           SO2202]
          Length = 558

 Score = 60.1 bits (144), Expect = 2e-06,   Method: Compositional matrix adjust.
 Identities = 24/76 (31%), Positives = 38/76 (50%)

Query: 88  LGTGLYPVISIINHSCLPNAVLVFEGRLAVVRAVQHVPKGAEVLISYIETAGSTMTRQKA 147
           LG    P++   NHSC PN  ++F     ++RA   + K  E+ + Y++       RQ  
Sbjct: 221 LGIAFDPILCSANHSCEPNTAVIFNQPQLILRAQTKIKKDEEIFMKYVDITNPFSVRQAE 280

Query: 148 LKEQYLFTCTCPRCIK 163
           L++ Y F C C +C K
Sbjct: 281 LRQSYFFGCRCTKCRK 296


>gi|440800514|gb|ELR21550.1| Histone-lysine N-methyltransferase [Acanthamoeba castellanii str.
           Neff]
          Length = 333

 Score = 60.1 bits (144), Expect = 2e-06,   Method: Compositional matrix adjust.
 Identities = 33/76 (43%), Positives = 44/76 (57%), Gaps = 2/76 (2%)

Query: 89  GTGLYPVISIINHSCLPNAVLVF-EGRLAVVRAVQHVPKGAEVLISYIETAGSTMTRQKA 147
           G GL+P+ +++NHSC PN  + F   R AVV A+  + +G E+  SYIE       RQ  
Sbjct: 254 GFGLFPIAAMMNHSCEPNTQVKFGRNREAVVVALCDIAEGEELTHSYIENDRPLAERQAD 313

Query: 148 LKEQYLFTCTCPRCIK 163
           L E Y F C C RC+K
Sbjct: 314 LLE-YNFVCQCVRCLK 328


>gi|9295736|gb|AAF87042.1|AC006535_20 T24P13.14 [Arabidopsis thaliana]
          Length = 969

 Score = 60.1 bits (144), Expect = 2e-06,   Method: Compositional matrix adjust.
 Identities = 26/73 (35%), Positives = 41/73 (56%), Gaps = 1/73 (1%)

Query: 89  GTGLYPVISIINHSCLPNAVLVFEGRLAVVRAVQHVPKGAEVLISYIETAGSTMTRQKAL 148
           G GL+ + S INHSC+PNA  +  G   +V A + +  G E+  +Y +   S + ++K +
Sbjct: 754 GVGLWTLASFINHSCIPNARRLHVGDYVIVHASRDIKTGEEISFAYFDVL-SPLEKRKEM 812

Query: 149 KEQYLFTCTCPRC 161
            E + F C C RC
Sbjct: 813 AESWGFCCGCSRC 825


>gi|189234759|ref|XP_001814707.1| PREDICTED: similar to SMYD family member 5 [Tribolium castaneum]
 gi|270001538|gb|EEZ97985.1| hypothetical protein TcasGA2_TC000380 [Tribolium castaneum]
          Length = 383

 Score = 60.1 bits (144), Expect = 2e-06,   Method: Compositional matrix adjust.
 Identities = 32/88 (36%), Positives = 50/88 (56%), Gaps = 7/88 (7%)

Query: 89  GTGLYPVISIINHSCLPNAVLVF---EGRLAVVRAVQHVPKGAEVLISYIETAG---STM 142
           G  L+ + S  NHSC+PNA   +     +L++V AV+ + +G E+ ISY++      S  
Sbjct: 284 GVALFTLQSACNHSCVPNAEPTYLHNNNKLSLV-AVRDIQEGEEICISYLDECNLHRSRH 342

Query: 143 TRQKALKEQYLFTCTCPRCIKLGQFDDI 170
           +R+K L E YLF C CP+C +     D+
Sbjct: 343 SRRKELMENYLFACNCPKCEEQADEPDV 370


>gi|302796834|ref|XP_002980178.1| hypothetical protein SELMODRAFT_419787 [Selaginella moellendorffii]
 gi|300151794|gb|EFJ18438.1| hypothetical protein SELMODRAFT_419787 [Selaginella moellendorffii]
          Length = 662

 Score = 60.1 bits (144), Expect = 2e-06,   Method: Compositional matrix adjust.
 Identities = 31/73 (42%), Positives = 45/73 (61%), Gaps = 3/73 (4%)

Query: 91  GLYPVISIINHSCLPNAVLVFEGRLAVVRAVQHVPKGAEVLISYIET--AGSTMTRQKAL 148
           GL+  + +INHSC+PNA  +  G +  VRA + +  G EVL+SY+E   A     R+K +
Sbjct: 445 GLWASVGLINHSCIPNASKMSIGDVVFVRAAKDLKAGDEVLLSYLEPPLAPYAGYREKMI 504

Query: 149 KEQYLFTCTCPRC 161
            +QY F C+C RC
Sbjct: 505 -QQYNFECSCERC 516


>gi|340057403|emb|CCC51748.1| conserved hypothetical protein [Trypanosoma vivax Y486]
          Length = 438

 Score = 60.1 bits (144), Expect = 2e-06,   Method: Compositional matrix adjust.
 Identities = 30/78 (38%), Positives = 44/78 (56%), Gaps = 3/78 (3%)

Query: 89  GTGLYPVISIINHSCLPNAVLVFEG---RLAVVRAVQHVPKGAEVLISYIETAGSTMTRQ 145
           G G+Y V S+ NHSC PN  ++         VV A++    G E+ ISYI+ +     RQ
Sbjct: 350 GQGVYAVGSLFNHSCEPNLQVLNSATGDETLVVEALRDCEPGEELYISYIDESLPYAIRQ 409

Query: 146 KALKEQYLFTCTCPRCIK 163
           + L+E YLF C C +C++
Sbjct: 410 QQLREHYLFECRCSKCVR 427


>gi|169606119|ref|XP_001796480.1| hypothetical protein SNOG_06094 [Phaeosphaeria nodorum SN15]
 gi|160706916|gb|EAT87158.2| hypothetical protein SNOG_06094 [Phaeosphaeria nodorum SN15]
          Length = 323

 Score = 60.1 bits (144), Expect = 2e-06,   Method: Compositional matrix adjust.
 Identities = 48/155 (30%), Positives = 71/155 (45%), Gaps = 21/155 (13%)

Query: 91  GLYPVISIINHSCLPNAVLVFEGRL--AVVRAVQHVPKGAEVLISYIETAGSTMTRQKAL 148
           GL+P I+ INHSC PNA   +   L   +V A + + KG E+ +SYI    +   RQK L
Sbjct: 117 GLFPKIARINHSCRPNASYYWSQTLNKRIVYATRRIAKGDEIFVSYIPLLLTQEQRQKHL 176

Query: 149 KEQYLFTCTCPRCIKLGQFDDIQESAILEGYRCKDDGCSGFLLRDSDDKGFTCQQCGLVR 208
            ++Y F CTC  C         QE A +E     DD       R +  K F   +  L  
Sbjct: 177 -DRYGFKCTCEAC--------AQEHAAME---ASDD------RRTTISKAFVDFEPQLTL 218

Query: 209 SKEEIKKIASEVNILSKKTLALTSCGNHQEVVSTY 243
           +  + KK   +    ++ +L L     H+E ++ Y
Sbjct: 219 TSPKTKKATQQARKNARASLKLAEL-VHEEGLADY 252


>gi|325188967|emb|CCA23495.1| conserved hypothetical protein [Albugo laibachii Nc14]
          Length = 497

 Score = 60.1 bits (144), Expect = 2e-06,   Method: Compositional matrix adjust.
 Identities = 31/76 (40%), Positives = 45/76 (59%), Gaps = 2/76 (2%)

Query: 89  GTGLYPVISIINHSCLPNAVLVF-EGRLAVVRAVQHVPKGAEVLISYIETAGSTMTRQKA 147
           GT L+ +I  +NHSCLPN ++V+     A V A+Q +    E+ I YI+T  S   RQ  
Sbjct: 415 GTALFSLICTMNHSCLPNCIVVYGHNGEAHVHAIQAIQPKDELCIEYIDTDRSYDERQYE 474

Query: 148 LKEQYLFTCTCPRCIK 163
           L+E Y F C C +C++
Sbjct: 475 LRE-YHFRCQCMKCMR 489


>gi|389645452|ref|XP_003720358.1| hypothetical protein MGG_09876 [Magnaporthe oryzae 70-15]
 gi|351640127|gb|EHA47991.1| hypothetical protein MGG_09876 [Magnaporthe oryzae 70-15]
          Length = 427

 Score = 60.1 bits (144), Expect = 2e-06,   Method: Compositional matrix adjust.
 Identities = 27/71 (38%), Positives = 41/71 (57%)

Query: 88  LGTGLYPVISIINHSCLPNAVLVFEGRLAVVRAVQHVPKGAEVLISYIETAGSTMTRQKA 147
            GT L  V+S+INH C PNA + FEG    VR+++ +  G E+ +SY +     + RQ+ 
Sbjct: 128 FGTSLDLVVSMINHDCSPNAHVFFEGSQVRVRSLKAIAAGGEITVSYCDPRLDVLLRQEI 187

Query: 148 LKEQYLFTCTC 158
           L++   F C C
Sbjct: 188 LRQTQFFHCEC 198


>gi|328855353|gb|EGG04480.1| hypothetical protein MELLADRAFT_108480 [Melampsora larici-populina
           98AG31]
          Length = 398

 Score = 60.1 bits (144), Expect = 2e-06,   Method: Compositional matrix adjust.
 Identities = 36/108 (33%), Positives = 54/108 (50%), Gaps = 9/108 (8%)

Query: 58  WPEISINEIAENFSKLACNAHTICNSELRPLGTGLYPVISIINHSCLPNAVLVFEGR--- 114
           W   S++ I+     L  NA +I +++L      ++P +S INH C   +  V+  R   
Sbjct: 171 WQNQSLSNISNYIRILQTNAISIGSNQL-----AIFPSLSRINHGCAGASNSVYNWREKE 225

Query: 115 -LAVVRAVQHVPKGAEVLISYIETAGSTMTRQKALKEQYLFTCTCPRC 161
            + VV A + +  G E+LISY ++  S   RQ  LK  Y F CTC  C
Sbjct: 226 GVEVVHATKLIEVGEEILISYWDSKRSRSDRQDYLKSNYGFQCTCQTC 273


>gi|195591423|ref|XP_002085440.1| GD12310 [Drosophila simulans]
 gi|194197449|gb|EDX11025.1| GD12310 [Drosophila simulans]
          Length = 530

 Score = 60.1 bits (144), Expect = 2e-06,   Method: Compositional matrix adjust.
 Identities = 51/178 (28%), Positives = 81/178 (45%), Gaps = 25/178 (14%)

Query: 49  ANLVNLIL------QWPEISINEIAENFSKLACNAHTICNSELRPLGTGLYPVISIINHS 102
           ANL+  I        WPE+ I  IA   + L  N   +     R     LYP  ++I+H 
Sbjct: 202 ANLITFIKTVLGMKDWPEMDILRIA---AILDTNTFEVRQPRERRKIRALYPGAAMISHD 258

Query: 103 CLPNAVLVFEGRLAVV-RAVQHVPKGAEVLISYIETAGSTMTRQKALKEQYLFTCTCPRC 161
           C+PN    F+  + +V  A + + KG  + ISY +   ST+ R+  L++   F C+C RC
Sbjct: 259 CVPNMRHRFDDDMNIVFLAKRKIAKGEILSISYTQPLRSTIQRRVHLRQAKCFDCSCARC 318

Query: 162 I---KLGQFDDIQESAILEGYRCKDDGCSGFLLRDS---DDKGFTCQQCGLVRSKEEI 213
               +LG F   Q        +CK    +G ++  +   +   + CQ C   RS +E+
Sbjct: 319 QDPEELGSFAGAQTC-----LKCK----AGKIISLNPLLNSAPWKCQLCNFKRSAKEV 367


>gi|186478896|ref|NP_173998.2| SET domain protein 35 [Arabidopsis thaliana]
 gi|332192607|gb|AEE30728.1| SET domain protein 35 [Arabidopsis thaliana]
          Length = 545

 Score = 60.1 bits (144), Expect = 2e-06,   Method: Compositional matrix adjust.
 Identities = 26/73 (35%), Positives = 41/73 (56%), Gaps = 1/73 (1%)

Query: 89  GTGLYPVISIINHSCLPNAVLVFEGRLAVVRAVQHVPKGAEVLISYIETAGSTMTRQKAL 148
           G GL+ + S INHSC+PNA  +  G   +V A + +  G E+  +Y +   S + ++K +
Sbjct: 330 GVGLWTLASFINHSCIPNARRLHVGDYVIVHASRDIKTGEEISFAYFDVL-SPLEKRKEM 388

Query: 149 KEQYLFTCTCPRC 161
            E + F C C RC
Sbjct: 389 AESWGFCCGCSRC 401


>gi|66810740|ref|XP_639077.1| SET domain-containing protein [Dictyostelium discoideum AX4]
 gi|74854618|sp|Q54R14.1|Y3443_DICDI RecName: Full=SET domain-containing protein DDB_G0283443
 gi|60467687|gb|EAL65706.1| SET domain-containing protein [Dictyostelium discoideum AX4]
          Length = 393

 Score = 60.1 bits (144), Expect = 2e-06,   Method: Compositional matrix adjust.
 Identities = 37/99 (37%), Positives = 49/99 (49%), Gaps = 5/99 (5%)

Query: 96  ISIINHSCLPNAVLVFEGRLAVVRAVQHVPKGAEVLISYIETAGSTMTRQKALKEQYLFT 155
           IS  NHSC PN V V E +   + ++  + KG E+ ISYI+   S   R   LKE Y F 
Sbjct: 273 ISYFNHSCFPNCVRVQENQSISIYSLIPIKKGDELSISYIDIRMSKNDRLLHLKEIYYFE 332

Query: 156 CTCPRCI--KLGQFDDIQESAILEGYRCKDDG--CSGFL 190
           C C RC    L     ++    +E Y CK+    C+G L
Sbjct: 333 CKCKRCTLPPLSNL-SLEIEKTIENYTCKNQSIKCTGIL 370


>gi|426226456|ref|XP_004007359.1| PREDICTED: SET and MYND domain-containing protein 5, partial [Ovis
           aries]
          Length = 391

 Score = 60.1 bits (144), Expect = 2e-06,   Method: Compositional matrix adjust.
 Identities = 31/79 (39%), Positives = 46/79 (58%), Gaps = 5/79 (6%)

Query: 89  GTGLYPVISIINHSCLPNAVLVF--EGRLAVVRAVQHVPKGAEVLISYIETAG---STMT 143
           G+GL+ + S  NHSC+PNA   F     L  V A++ +  G  + ISY++      S  +
Sbjct: 278 GSGLFVLQSCCNHSCVPNAETSFPENNFLLHVTALEDIKPGERLGISYLDCCQRERSRHS 337

Query: 144 RQKALKEQYLFTCTCPRCI 162
           R K L+E YLF C+CP+C+
Sbjct: 338 RHKILRENYLFVCSCPKCL 356


>gi|302759442|ref|XP_002963144.1| hypothetical protein SELMODRAFT_404768 [Selaginella moellendorffii]
 gi|300170005|gb|EFJ36607.1| hypothetical protein SELMODRAFT_404768 [Selaginella moellendorffii]
          Length = 664

 Score = 59.7 bits (143), Expect = 2e-06,   Method: Compositional matrix adjust.
 Identities = 31/73 (42%), Positives = 45/73 (61%), Gaps = 3/73 (4%)

Query: 91  GLYPVISIINHSCLPNAVLVFEGRLAVVRAVQHVPKGAEVLISYIET--AGSTMTRQKAL 148
           GL+  + +INHSC+PNA  +  G +  VRA + +  G EVL+SY+E   A     R+K +
Sbjct: 447 GLWASVGLINHSCIPNASKMSIGDVVFVRAAKDLKAGDEVLLSYLEPPLAPYAGYREKMI 506

Query: 149 KEQYLFTCTCPRC 161
            +QY F C+C RC
Sbjct: 507 -QQYNFECSCERC 518


>gi|18399103|ref|NP_565457.1| histone-lysine N-methyltransferase ASHR2 [Arabidopsis thaliana]
 gi|14334524|gb|AAK59459.1| putative SET-domain transcriptional regulator protein [Arabidopsis
           thaliana]
 gi|17979109|gb|AAL47498.1| putative SET-domain transcriptional regulator protein [Arabidopsis
           thaliana]
 gi|20197479|gb|AAD10162.2| putative SET-domain transcriptional regulator [Arabidopsis
           thaliana]
 gi|330251812|gb|AEC06906.1| histone-lysine N-methyltransferase ASHR2 [Arabidopsis thaliana]
          Length = 341

 Score = 59.7 bits (143), Expect = 2e-06,   Method: Compositional matrix adjust.
 Identities = 31/78 (39%), Positives = 40/78 (51%), Gaps = 7/78 (8%)

Query: 91  GLYPVISIINHSCLPNAVL------VFEGRLAVV-RAVQHVPKGAEVLISYIETAGSTMT 143
           G+YP  S  NH CLPNA          +G   ++ R +  VP+G EV +SY     +  +
Sbjct: 219 GIYPKTSFFNHDCLPNACRFDYVDSASDGNTDIIIRMIHDVPEGREVCLSYFPVNMNYSS 278

Query: 144 RQKALKEQYLFTCTCPRC 161
           RQK L E Y F C C RC
Sbjct: 279 RQKRLLEDYGFKCDCDRC 296


>gi|158288115|ref|XP_309979.4| AGAP011530-PA [Anopheles gambiae str. PEST]
 gi|157019319|gb|EAA05706.4| AGAP011530-PA [Anopheles gambiae str. PEST]
          Length = 522

 Score = 59.7 bits (143), Expect = 2e-06,   Method: Compositional matrix adjust.
 Identities = 48/183 (26%), Positives = 83/183 (45%), Gaps = 10/183 (5%)

Query: 57  QWPEISINEIAENFSKLACNAHTICNSELRPLGTGLYPVISIINHSCLPNAVLVFEGRLA 116
           +W E   +EI      +  N H +  +E  P    +Y + S++ HSC PN    F  R  
Sbjct: 200 KWDE---DEILRVVGIIQVNGHEVPMTE--PSSVAIYNMASMLEHSCRPNLAKSFTNRGE 254

Query: 117 VVR-AVQHVPKGAEVLISYIETAGSTMTRQKALKEQYLFTCTCPRCIKLGQFDDIQESAI 175
           VV  A   + +G  + I Y +   +T  R + L++  +F C C RC    +++    +  
Sbjct: 255 VVMWAPNPIRRGDRLSICYTDVLWTTGNRLEHLQQTKMFRCECERCSDRTEYETYFSAVR 314

Query: 176 LEGYRCKDDGCSGFLLRDSDDKGFTCQQCGLVRSKEEIKKIASEVNILSKKTLALTSCGN 235
             G++ KD  C G+LL   + +     QC  +R  +E+   A+   IL +  + L +   
Sbjct: 315 CSGFQ-KDSKCKGYLLPVDNAQWTGEWQC--LRCSKEVSG-ATVCQILERARMDLEAMEK 370

Query: 236 HQE 238
           H+E
Sbjct: 371 HRE 373


>gi|380489700|emb|CCF36529.1| TPR domain-containing protein [Colletotrichum higginsianum]
          Length = 257

 Score = 59.7 bits (143), Expect = 2e-06,   Method: Compositional matrix adjust.
 Identities = 34/84 (40%), Positives = 44/84 (52%), Gaps = 1/84 (1%)

Query: 86  RPLGTGLYPVISIINHSCLPNAVLVFEGRLAVVRAVQHVPKGAEVLISYIETAGSTMTRQ 145
           R   TGL+   + INHSC+ NA   + G L V+RA+  + KG E+  SY E+A     RQ
Sbjct: 104 RNASTGLWIWAAYINHSCIANAKKEYVGDLMVLRALGAIKKGEEIFHSYDESA-DYEARQ 162

Query: 146 KALKEQYLFTCTCPRCIKLGQFDD 169
           KAL   + F C C  C      DD
Sbjct: 163 KALLTTWGFECGCALCAAERTDDD 186


>gi|429860577|gb|ELA35307.1| mynd finger family protein [Colletotrichum gloeosporioides Nara
           gc5]
          Length = 551

 Score = 59.7 bits (143), Expect = 2e-06,   Method: Compositional matrix adjust.
 Identities = 35/104 (33%), Positives = 54/104 (51%), Gaps = 4/104 (3%)

Query: 61  ISINEIAENFS---KLACNAHTICNSELRPLGTGLYPVISIINHSCLPNAVLVFEGRLAV 117
           I++  + E+F    KL  NA +         G  L   +++INHSC+PNA++ F GR A 
Sbjct: 171 ITLKGLEESFQVLCKLQTNAFSRTEEYYETGGVFLDTTLAMINHSCVPNAMVQFGGRTAT 230

Query: 118 VRAVQHVPKGAEVLISYIETAGSTMTRQKALKEQYLFTCTCPRC 161
           +R+   +  G E+ ISY +       RQ  L + Y F C+C +C
Sbjct: 231 LRSATFINPGDEIEISYADHTQPKGKRQGEL-DLYHFECSCYKC 273


>gi|383850313|ref|XP_003700740.1| PREDICTED: protein msta, isoform B-like [Megachile rotundata]
          Length = 494

 Score = 59.7 bits (143), Expect = 2e-06,   Method: Compositional matrix adjust.
 Identities = 45/142 (31%), Positives = 66/142 (46%), Gaps = 9/142 (6%)

Query: 91  GLYPVISIINHSCLPNAVLVF-EGRLAVVRAVQHVPKGAEVLISYIETAGSTMTRQKALK 149
           GLYP+ ++ NH+C PN    F E     V A   +  G E+ +SY      T  R++ L 
Sbjct: 214 GLYPLGALQNHNCAPNTRHHFDENYRLYVSATVPISAGEELTMSYTSLFWDTTLRRQFLS 273

Query: 150 EQYLFTCTCPRCIKLGQFDDIQESAILEGYRCKDDGCSGFLL-RDSDDKG--FTCQQCGL 206
               F+CTC RC      D  +  ++L    C  D CSG LL RD    G  + C  CG 
Sbjct: 274 ITKHFSCTCKRCS-----DPTESGSMLCALLCAFDNCSGNLLPRDPLRIGTPWVCDMCGK 328

Query: 207 VRSKEEIKKIASEVNILSKKTL 228
             +  +I  I S +  ++++ L
Sbjct: 329 SINYRQICSIRSGLAAITEEAL 350


>gi|332373072|gb|AEE61677.1| unknown [Dendroctonus ponderosae]
          Length = 382

 Score = 59.7 bits (143), Expect = 2e-06,   Method: Compositional matrix adjust.
 Identities = 36/91 (39%), Positives = 50/91 (54%), Gaps = 9/91 (9%)

Query: 85  LRPLGTGLYPVISIINHSCLPNAVLVF---EGRLAVVRAVQHVPKGAEVLISYI---ETA 138
           L   G GLY   S  NHSC PNA + F     RL+++ A++ +  G E+ ISY+   +  
Sbjct: 280 LNSEGVGLYARQSCANHSCDPNAEISFVHNNYRLSLI-ALKDIQPGEEICISYLGDCDNE 338

Query: 139 GSTMTRQKALKEQYLFTCTCPRCIKLGQFDD 169
            S  +R+K L E YLF C C +C  L + DD
Sbjct: 339 RSRHSRRKTLMENYLFACECSKC--LAEIDD 367


>gi|195334290|ref|XP_002033816.1| GM21522 [Drosophila sechellia]
 gi|194125786|gb|EDW47829.1| GM21522 [Drosophila sechellia]
          Length = 514

 Score = 59.7 bits (143), Expect = 2e-06,   Method: Compositional matrix adjust.
 Identities = 42/173 (24%), Positives = 76/173 (43%), Gaps = 11/173 (6%)

Query: 48  IANLVNLILQWPEISINEIAENFSKLACNAHTICNSELRPLGTGLYPVISIINHSCLPNA 107
           I   +    +  + +  +I +    L  N H +  ++  P    ++   S   +SCLPN 
Sbjct: 189 IGQFIPKFFKTQKFTEEQIMKAVGALQINGHEVPTTD--PSHVAVFYTASFTENSCLPNL 246

Query: 108 VLVF-EGRLAVVRAVQHVPKGAEVLISYIETAGSTMTRQKALKEQYLFTCTCPRCIKLGQ 166
              F +    ++ A + + K A + I Y +    T  RQ+ L +  LF C C RC+ + +
Sbjct: 247 AKSFNKNGHCILWAPREIKKNAHLSICYSDAMWGTADRQRHLMQTKLFKCACERCVDVTE 306

Query: 167 FDDIQESAILEGYRCKDDGCSGFLL---RDSDDKGFTCQQCGLVRSKEEIKKI 216
             D   SAI    +C+D  C G +L    D  +  + C++C     K  +++I
Sbjct: 307 L-DTNYSAI----KCEDRQCGGLMLPTKADDWNGSWRCRECHKQVQKHYVERI 354


>gi|159471331|ref|XP_001693810.1| histone methyltransferase [Chlamydomonas reinhardtii]
 gi|158283313|gb|EDP09064.1| histone methyltransferase [Chlamydomonas reinhardtii]
          Length = 484

 Score = 59.7 bits (143), Expect = 2e-06,   Method: Compositional matrix adjust.
 Identities = 36/103 (34%), Positives = 49/103 (47%), Gaps = 10/103 (9%)

Query: 63  INEIAENFSKLACNAHTICNSELRPLG-TGLYPVISIINHSCLPNAVLVFEGRLAVVRAV 121
             + AE+ +  AC           P G  GL+P  +++NHSC PN V    G   VVRAV
Sbjct: 185 FGDDAEDLAACACRGE-------EPRGHIGLWPEFALLNHSCAPNTVNYVVGGSMVVRAV 237

Query: 122 QHVPKGAEVLISYIETAG--STMTRQKALKEQYLFTCTCPRCI 162
             + +G EV I Y+          R   L + Y F C+CPRC+
Sbjct: 238 APISQGEEVTICYLGRPQLLPFNKRIAILADDYGFECSCPRCV 280


>gi|298708048|emb|CBJ30401.1| set and mynd domain containing [Ectocarpus siliculosus]
          Length = 589

 Score = 59.7 bits (143), Expect = 2e-06,   Method: Compositional matrix adjust.
 Identities = 40/121 (33%), Positives = 56/121 (46%), Gaps = 24/121 (19%)

Query: 60  EISINEIAENFSKLACNAHTICNSELRPLG-------TGLYPVISIINHSCLPNAVLVFE 112
           E+S  EI     +L  N HT+   +  PLG        G++P  S++NHSC PNA   F 
Sbjct: 299 EVSAEEIQTVHRRLRRNFHTV---QPEPLGRLLPPTVVGVFPAASLVNHSCEPNA--CFH 353

Query: 113 GRLA---------VVRAVQHVPKGAEVLISYIETAGSTMT---RQKALKEQYLFTCTCPR 160
            R A          +R    V  G EV +SY+       T   R++ L+  + F+C CPR
Sbjct: 354 SRRAGPEGPPLELALRCTTDVSAGEEVCVSYLAHCADAATKEGRRELLQNVWGFSCDCPR 413

Query: 161 C 161
           C
Sbjct: 414 C 414


>gi|345570043|gb|EGX52868.1| hypothetical protein AOL_s00007g204 [Arthrobotrys oligospora ATCC
           24927]
          Length = 772

 Score = 59.7 bits (143), Expect = 2e-06,   Method: Compositional matrix adjust.
 Identities = 29/73 (39%), Positives = 41/73 (56%)

Query: 90  TGLYPVISIINHSCLPNAVLVFEGRLAVVRAVQHVPKGAEVLISYIETAGSTMTRQKALK 149
           +GL+ + +  NHSC+ NA   F G + + RAV  +PK  EVLISY +       RQ  L+
Sbjct: 519 SGLWILPAYTNHSCISNAHRTFLGDMMIFRAVVDMPKDTEVLISYTDHNLPYEERQGILQ 578

Query: 150 EQYLFTCTCPRCI 162
             + FTC C  C+
Sbjct: 579 TSWKFTCQCKLCV 591


>gi|330801330|ref|XP_003288681.1| hypothetical protein DICPUDRAFT_34471 [Dictyostelium purpureum]
 gi|325081244|gb|EGC34766.1| hypothetical protein DICPUDRAFT_34471 [Dictyostelium purpureum]
          Length = 378

 Score = 59.7 bits (143), Expect = 2e-06,   Method: Compositional matrix adjust.
 Identities = 41/132 (31%), Positives = 60/132 (45%), Gaps = 20/132 (15%)

Query: 73  LACNAHTIC-----NSELR-----PLGTGLYPVISIINHSCLPNAVLVFE------GRLA 116
           L CN+H I      NS +       +G G+Y   S+ NHSC PN   V E      G  +
Sbjct: 226 LECNSHDISFPFRTNSSITTFNYYSIGIGIYFNSSLFNHSCQPNICKVIESCKNNFGCHS 285

Query: 117 VVRAVQHVPKGAEVLISYIETAGSTMTRQKALKEQYLFTCTCPRCIKLGQFDDIQESAIL 176
           +V A++ + +  E+  +YI+   +   R + L+  Y FTC CP C      +       L
Sbjct: 286 MV-AIRDIKENEEISFNYIQLNQNKQERIEKLQSSYYFTCNCPSCCTN---NPNPHKEFL 341

Query: 177 EGYRCKDDGCSG 188
             Y+CK   CSG
Sbjct: 342 NKYKCKKLNCSG 353


>gi|312373795|gb|EFR21480.1| hypothetical protein AND_16998 [Anopheles darlingi]
          Length = 486

 Score = 59.7 bits (143), Expect = 2e-06,   Method: Compositional matrix adjust.
 Identities = 43/158 (27%), Positives = 74/158 (46%), Gaps = 14/158 (8%)

Query: 87  PLGTGLYPVISIINHSCLPNAVLVFEGRLAVV-RAVQHVPKGAEVLISYIETAGSTMTRQ 145
           P    +Y   S++ HSC PN    F  R  +V  A   + +G  + I Y +   +T  R 
Sbjct: 189 PSSVAIYNQASMLEHSCRPNLSKSFTNRGEIVLWAPNTIKRGERLSICYTDAMWATGNRL 248

Query: 146 KALKEQYLFTCTCPRCIKLGQFDDIQESAILEGYRCKDDGCSGFLLR-DSDD---KGFTC 201
           + L++  +F C C RC    +++    +    G++ KD  C G++L  DSD+     ++C
Sbjct: 249 EHLQQTKMFRCECERCSDPTEYETYFSALRCSGFQ-KDSKCKGYILPVDSDNWTTGDWSC 307

Query: 202 QQC-GLVRSKEEIKKIASEVNILSKKTLALTSCGNHQE 238
           Q+C G+V         A+   IL +  + L +   H+E
Sbjct: 308 QKCRGVVTG-------AAVGQILERARMDLEAMQKHRE 338


>gi|396460396|ref|XP_003834810.1| hypothetical protein LEMA_P069530.1 [Leptosphaeria maculans JN3]
 gi|312211360|emb|CBX91445.1| hypothetical protein LEMA_P069530.1 [Leptosphaeria maculans JN3]
          Length = 388

 Score = 59.7 bits (143), Expect = 2e-06,   Method: Compositional matrix adjust.
 Identities = 28/77 (36%), Positives = 40/77 (51%), Gaps = 3/77 (3%)

Query: 88  LGTGLYPVISIINHSCLPNAVLVFEGRLAVVRAVQHVPKGAEVLISYIETAGSTMTRQK- 146
            G G +P  S  NHSC PN     +GR    RA + + KG E+ I+Y+      ++R K 
Sbjct: 309 FGYGCWPAASYFNHSCHPNVEKQRDGRAWTFRARRAIAKGDELCITYLSGEERKLSRAKR 368

Query: 147 --ALKEQYLFTCTCPRC 161
              LK+ + F C+C RC
Sbjct: 369 MLRLKKTWGFDCSCERC 385


>gi|242078489|ref|XP_002444013.1| hypothetical protein SORBIDRAFT_07g005870 [Sorghum bicolor]
 gi|241940363|gb|EES13508.1| hypothetical protein SORBIDRAFT_07g005870 [Sorghum bicolor]
          Length = 404

 Score = 59.7 bits (143), Expect = 2e-06,   Method: Compositional matrix adjust.
 Identities = 49/179 (27%), Positives = 64/179 (35%), Gaps = 57/179 (31%)

Query: 91  GLYPVISIINHSCLPNAV-LVFEGRLA------VVRAVQHVPKGAEVLISYIETAGSTMT 143
            +YP  S+ NH CLPNA    +  R        VVRA+  +P+G EV ISY         
Sbjct: 214 AVYPRASLFNHDCLPNACHFDYPDRPGPGNTDIVVRALHDIPEGREVCISYFAANWRYAD 273

Query: 144 RQKALKEQYLFTCTCPRCIKLGQF------------------------------------ 167
           RQ+ L E Y F C C RC    Q+                                    
Sbjct: 274 RQRRLLEDYGFRCECDRCQVESQWKDDDDDNNGGDGDDTMEEEGEDDAGDRGDDGMEEEE 333

Query: 168 --------DDIQESAILEGYRCKDDGCSGFL--LRDSDDKG----FTCQQCGLVRSKEE 212
                   DD   +     Y C ++GC G L  L  S +      F C  CG +R +E+
Sbjct: 334 GDGATDGNDDFPHAYFFVRYLCDNEGCWGMLAPLPPSPNGDLSHVFECNLCGKLRKEED 392


>gi|195578361|ref|XP_002079034.1| GD23739 [Drosophila simulans]
 gi|194191043|gb|EDX04619.1| GD23739 [Drosophila simulans]
          Length = 325

 Score = 59.7 bits (143), Expect = 2e-06,   Method: Compositional matrix adjust.
 Identities = 50/199 (25%), Positives = 86/199 (43%), Gaps = 25/199 (12%)

Query: 92  LYPVISIINHSCLPN---AVLVFEGRLAVVRAVQHVPKGAEVLISYIETAGSTMTRQKAL 148
           L+P   I+ H+C+PN   ++   EG    +RA+  + +G  +  SY  T   T  RQK L
Sbjct: 25  LFPYTGILAHNCVPNTARSIYPSEGYKIRLRAMVDLEEGQPLHHSYTYTLDGTAQRQKHL 84

Query: 149 KEQYLFTCTCPRCI---KLG-QFDDIQESAILEGYRCKDDGCSGFLLRDSDDKGFTCQQC 204
           K+   FTC C RC+   +LG  F  ++     EG++               D  ++C  C
Sbjct: 85  KQGKFFTCQCERCLDPTELGTHFSSLKCGQCAEGFQVPRQ-------PTEPDTSWSCAGC 137

Query: 205 GLVRSKEE----IKKIASEVNILSKKTLALTSCGNHQEVVSTYKMIEKLQKKLYHPFSVN 260
           G   S  +    ++ + SEVN +    +A       + ++  Y       K L HP    
Sbjct: 138 GSDTSNADALAMLQSLQSEVNAVQALPMAAKRLEEIERLLRKY-------KSLLHPLHFI 190

Query: 261 LMQTREKLIKILMELEDWK 279
               R+ LI++   ++ ++
Sbjct: 191 ATGLRQLLIEMYGRVQGYE 209


>gi|51859341|gb|AAH81479.1| SMYD family member 5 [Danio rerio]
          Length = 300

 Score = 59.7 bits (143), Expect = 2e-06,   Method: Compositional matrix adjust.
 Identities = 34/86 (39%), Positives = 48/86 (55%), Gaps = 7/86 (8%)

Query: 89  GTGLYPVISIINHSCLPNAVLVFEGR--LAVVRAVQHVPKGAEVLISYIETAG---STMT 143
           G+GL+ + S  NHSC+PNA   F     L  + A+  +  G E+ ISY++      S  +
Sbjct: 188 GSGLFLLQSSCNHSCVPNAEASFPDNNFLLHLTALGDIGPGEEICISYLDCCQRDRSRHS 247

Query: 144 RQKALKEQYLFTCTCPRCIKLGQFDD 169
           R K L+E YLF C+C +C  L Q DD
Sbjct: 248 RHKILRENYLFICSCQKC--LSQMDD 271


>gi|351737851|gb|AEQ60886.1| hypothetical protein [Acanthamoeba castellanii mamavirus]
          Length = 263

 Score = 59.7 bits (143), Expect = 2e-06,   Method: Compositional matrix adjust.
 Identities = 29/92 (31%), Positives = 45/92 (48%), Gaps = 6/92 (6%)

Query: 71  SKLACNAHTICNSELRPLGTGLYPVISIINHSCLPNAVLVFEGRLAVVRAVQHVPKGAEV 130
           +K  CNA    N      G+ +    +  NHSC+PN + V +        V+++  G E+
Sbjct: 118 AKYMCNAFEFNN------GSAILINGAKFNHSCVPNVIFVSDENYMYFYTVRNIKTGEEL 171

Query: 131 LISYIETAGSTMTRQKALKEQYLFTCTCPRCI 162
             +Y++   +T TR+  L  QY F C C RCI
Sbjct: 172 TDNYVDIMSNTKTRKNRLFNQYGFDCQCERCI 203


>gi|310798972|gb|EFQ33865.1| tetratricopeptide [Glomerella graminicola M1.001]
          Length = 657

 Score = 59.7 bits (143), Expect = 2e-06,   Method: Compositional matrix adjust.
 Identities = 31/76 (40%), Positives = 45/76 (59%), Gaps = 1/76 (1%)

Query: 86  RPLGTGLYPVISIINHSCLPNAVLVFEGRLAVVRAVQHVPKGAEVLISYIETAGSTMTRQ 145
           R   TGL+   + INHSC+ NA   + G L V+RA++ + +G E+  SY E+A    TRQ
Sbjct: 503 RNASTGLWVWAAYINHSCVANAKKEYVGDLMVLRALRPIKEGEEIFHSYDESA-DYETRQ 561

Query: 146 KALKEQYLFTCTCPRC 161
           +AL   + F C+C  C
Sbjct: 562 RALMTTWGFECSCALC 577


>gi|195171538|ref|XP_002026562.1| GL21959 [Drosophila persimilis]
 gi|194111478|gb|EDW33521.1| GL21959 [Drosophila persimilis]
          Length = 532

 Score = 59.7 bits (143), Expect = 2e-06,   Method: Compositional matrix adjust.
 Identities = 58/228 (25%), Positives = 101/228 (44%), Gaps = 28/228 (12%)

Query: 16  IPSTTTDNYSLVEALVAHMSDIDEKQL--LLYAQIANLVNLIL---QWPEISINEIAENF 70
           + ++  + ++ + AL  H+ +     L  +L A +   +  IL    W E+ I  IA   
Sbjct: 169 LKASDPEGFARLSALEDHLEERLATPLYQVLRANLITFIKTILGMRDWSEVDILRIA--- 225

Query: 71  SKLACNAHTICNSELRPLGTGLYPVISIINHSCLPNAVLVFEGRLAVV-RAVQHVPKGAE 129
           + L  N   +     R     L+P  ++I+H C PN    F+  + ++  A + + KG  
Sbjct: 226 AILDTNTFELRQPRERRKVRALFPGAAMISHDCAPNMRHRFDDDMNIIFLAKRPIAKGEI 285

Query: 130 VLISYIETAGSTMTRQKALKEQYLFTCTCPRC---IKLGQFDDIQESAILEGYRCKDDGC 186
           + ISY +   ST+ R+  L++   F C C RC    +LG F   Q        +CK    
Sbjct: 286 LTISYTQPLRSTIQRRLHLRQAKCFDCACDRCQDPTELGTFAGAQTCV-----KCK---- 336

Query: 187 SGFLLRDS---DDKGFTCQQCGLVRSKEEI----KKIASEVNILSKKT 227
           +G ++  +   +   + CQ C L RS +E+     K+  E+  L K T
Sbjct: 337 AGKIISVNPLQNTANWKCQLCNLKRSAKEVLLSDAKLQQEIEALDKTT 384


>gi|392592096|gb|EIW81423.1| SET domain-containing protein [Coniophora puteana RWD-64-598 SS2]
          Length = 450

 Score = 59.7 bits (143), Expect = 2e-06,   Method: Compositional matrix adjust.
 Identities = 35/107 (32%), Positives = 60/107 (56%), Gaps = 10/107 (9%)

Query: 62  SINEIAENFSKLACNAHTICNSELRPLGTGLYPVIS-IINHSCLPNAVLVF------EGR 114
           +I ++ + FS+ + N   + +S L  +  G++P+ S + NHSCLPNA   +        R
Sbjct: 167 TIPDVNDLFSRFSNNNFAV-HSHLTTVAHGVFPLASRLFNHSCLPNAAARYILSEDAAPR 225

Query: 115 LAVVRAVQHVPKGAEVLISYIETAGSTMTRQKALKEQYLFTCTCPRC 161
           + +V A+Q +  G E+ + Y++ A    +RQ+  +  Y FTCTCP C
Sbjct: 226 MEIV-ALQDIGAGEEICVPYLDPA-LLQSRQQGFQLTYGFTCTCPSC 270


>gi|407844697|gb|EKG02089.1| hypothetical protein TCSYLVIO_006900 [Trypanosoma cruzi]
          Length = 584

 Score = 59.7 bits (143), Expect = 2e-06,   Method: Compositional matrix adjust.
 Identities = 58/234 (24%), Positives = 102/234 (43%), Gaps = 40/234 (17%)

Query: 75  CNAHTICNSELRPLGTGLYP--VISIINHSCLPNAVLVFEGRLAVVRAVQHVPKGAEVLI 132
           CNA  +C++   P+G  L+   V S  NHSCLPN   +  G +   RA++    G E+ I
Sbjct: 327 CNALAVCDASGLPVGQALHAASVTSYFNHSCLPNCA-IEAGAIVTTRAIR---PGEELTI 382

Query: 133 SYI-ETAGSTMTRQKALKEQYLFTCTCPRCIKLGQFDDIQ-ESAILEGYRCKDDGCSGFL 190
           SY+ +       R++ L E+Y F C   RC++ G  D    ESA+    R          
Sbjct: 383 SYLPQLYWPAWLRREELAERYFFDC---RCVRCGDGDRSPFESALSATLRLGGS------ 433

Query: 191 LRDSDDKGFTCQQ--CGLVRSKEEIKKIASEVNILSKKTLALTSCGNHQEVVSTYKMIEK 248
            R+ + +  +  Q  CG VR+K+       +V++  +  L                ++++
Sbjct: 434 -REKEREYISSVQILCGRVRAKD-----VGDVDVGDRDAL--------------LHLLQE 473

Query: 249 LQKKLYHPFSVNLMQTREKLIKILMELEDWKEALAYCQLTIPVYQRVYPQFHPL 302
            ++ L+ PF     +    L  +   L D +  L  C   + +++ + P  HP+
Sbjct: 474 CRQHLF-PFHYLCHELHNTLSFVYAVLGDTRACLCGCLRELVMWEAIVPGAHPV 526


>gi|332372756|gb|AEE61520.1| unknown [Dendroctonus ponderosae]
          Length = 501

 Score = 59.7 bits (143), Expect = 2e-06,   Method: Compositional matrix adjust.
 Identities = 49/195 (25%), Positives = 85/195 (43%), Gaps = 15/195 (7%)

Query: 92  LYPVISIINHSCLPNAVLVFE----GRL----AVVRAVQHVPKGAEVLISYIETAGSTMT 143
           L+P ++I++H+C+ N     +    G L      VRA   +PKG E+  SY  +   T+ 
Sbjct: 198 LFPKLAILSHNCVSNIHHAVDCNGDGELKDCVVTVRAAVDIPKGGELFSSYTFSLWPTLV 257

Query: 144 RQKALKEQYLFTCTCPRCIKLGQFDDIQESAILEGYRCKDDGCSGFLLRDSDDKGFTCQQ 203
           R++ L+E   F CTCPRC    + D    S  L+  RC D+G         D   + C  
Sbjct: 258 RREFLRESKFFDCTCPRCSSPTELDTHMSS--LKCQRC-DNGVIISTNTADDACEWKCTH 314

Query: 204 CGLVRSKEEIKKIASEVNILSKKTLALTSCGNHQEVVSTYKMIEKLQKKLYHPFSVNLMQ 263
           C    +   ++K+ + +     +   +   G  Q +     +  K +  L HP +  +  
Sbjct: 315 CDYKTNGRAVRKVFAAIQ---NEIDQVEYIGGPQGIQQRETIFRKYRSVL-HPNNSYMTT 370

Query: 264 TREKLIKILMELEDW 278
            R  LI++  + E +
Sbjct: 371 LRSALIQLYGKTEGY 385


>gi|348668850|gb|EGZ08673.1| hypothetical protein PHYSODRAFT_352669 [Phytophthora sojae]
          Length = 1354

 Score = 59.7 bits (143), Expect = 2e-06,   Method: Compositional matrix adjust.
 Identities = 44/140 (31%), Positives = 70/140 (50%), Gaps = 20/140 (14%)

Query: 38   DEKQLLLYAQIANLVNLILQWPEISINEIAENFSKLACNAHTICNSELRPL--------- 88
            DE Q ++ +     V  +L W E  + ++ +   +L  N   + +SE  PL         
Sbjct: 1167 DEDQTIIAS-----VRRVLGWEEERL-QVLQTQERLERNDGEL-DSEFDPLMPLGYPSIE 1219

Query: 89   GTGLYPVISIINHSCLPNAVLVF--EGRLAVVRAVQHVPKGAEVLISYIETAGSTMTRQK 146
            GT L+P+I  +NHSC PN  +++   G   VV A++ + KG E+ I YI+       R+ 
Sbjct: 1220 GTALFPIICTMNHSCDPNCTVLYTKNGDGHVV-AIRDIHKGEELCICYIDVDMDVQMREA 1278

Query: 147  ALKEQYLFTCTCPRCIKLGQ 166
             L+E Y F C C RC++  Q
Sbjct: 1279 NLRE-YKFKCFCSRCVQERQ 1297



 Score = 42.4 bits (98), Expect = 0.31,   Method: Compositional matrix adjust.
 Identities = 25/62 (40%), Positives = 34/62 (54%), Gaps = 1/62 (1%)

Query: 89  GTGLYPVISIINHSCLPN-AVLVFEGRLAVVRAVQHVPKGAEVLISYIETAGSTMTRQKA 147
           GT L+  I  +NHSC PN  VL  +   A V AVQ + +G E+ ISYI+       R + 
Sbjct: 390 GTALFSGICTMNHSCDPNCTVLYTKDGAAHVFAVQDIQEGEELCISYIDVDQEVEERAEC 449

Query: 148 LK 149
           L+
Sbjct: 450 LR 451


>gi|311978082|ref|YP_003987202.1| putative SET domain-containing protein [Acanthamoeba polyphaga
           mimivirus]
 gi|81999695|sp|Q5UNT8.1|YL678_MIMIV RecName: Full=Putative SET domain-containing protein L678
 gi|55417289|gb|AAV50939.1| unknown [Acanthamoeba polyphaga mimivirus]
 gi|308204535|gb|ADO18336.1| putative SET domain-containing protein [Acanthamoeba polyphaga
           mimivirus]
 gi|339061617|gb|AEJ34921.1| hypothetical protein MIMI_L678 [Acanthamoeba polyphaga mimivirus]
          Length = 255

 Score = 59.7 bits (143), Expect = 2e-06,   Method: Compositional matrix adjust.
 Identities = 29/92 (31%), Positives = 45/92 (48%), Gaps = 6/92 (6%)

Query: 71  SKLACNAHTICNSELRPLGTGLYPVISIINHSCLPNAVLVFEGRLAVVRAVQHVPKGAEV 130
           +K  CNA    N      G+ +    +  NHSC+PN + V +        V+++  G E+
Sbjct: 118 AKYMCNAFEFNN------GSAILINGAKFNHSCVPNVIFVSDENYMYFYTVRNIKTGEEL 171

Query: 131 LISYIETAGSTMTRQKALKEQYLFTCTCPRCI 162
             +Y++   +T TR+  L  QY F C C RCI
Sbjct: 172 TDNYVDIMSNTKTRKNRLFNQYGFDCQCERCI 203


>gi|312383723|gb|EFR28693.1| hypothetical protein AND_03014 [Anopheles darlingi]
          Length = 423

 Score = 59.3 bits (142), Expect = 2e-06,   Method: Compositional matrix adjust.
 Identities = 45/167 (26%), Positives = 78/167 (46%), Gaps = 8/167 (4%)

Query: 18  STTTDNYSLVEALVAHMSDIDEKQL--LLYAQIANLVNLILQWPEISINEIAENFSKLAC 75
           S + D Y+ ++ L +H+++  +  L  ++ + +   +  +L   + S   I E  + L  
Sbjct: 141 SHSPDTYAQLQRLESHVAERLKSPLYEVVRSNLVPFIRSVLGLQQYSAETILEVCAMLDT 200

Query: 76  NAHTICNSELRPLGTGLYPVISIINHSCLPNAVLVFEGRLAVV-RAVQHVPKGAEVLISY 134
           N   I   E R     LYP+ ++++H C PN    F+  L +V  A   + +GA +  SY
Sbjct: 201 NCFEIRLPERRTKVRALYPLGAMLSHDCRPNTKHYFDDALRMVLVATVDIERGATISTSY 260

Query: 135 IETAGSTMTRQKALKEQYLFTCTCPRCIKLGQFDDIQESAILEGYRC 181
            +    T+ R+ ALK+   F C C RC      D  +    L G+RC
Sbjct: 261 TQPLLGTLHRRLALKQSKHFDCQCERCT-----DPTELGTNLSGFRC 302


>gi|19921122|ref|NP_609464.1| CG43129, isoform D [Drosophila melanogaster]
 gi|442627404|ref|NP_001260366.1| CG43129, isoform E [Drosophila melanogaster]
 gi|7297774|gb|AAF53024.1| CG43129, isoform D [Drosophila melanogaster]
 gi|17945207|gb|AAL48662.1| RE12806p [Drosophila melanogaster]
 gi|440213690|gb|AGB92901.1| CG43129, isoform E [Drosophila melanogaster]
          Length = 507

 Score = 59.3 bits (142), Expect = 3e-06,   Method: Compositional matrix adjust.
 Identities = 50/199 (25%), Positives = 85/199 (42%), Gaps = 25/199 (12%)

Query: 92  LYPVISIINHSCLPN---AVLVFEGRLAVVRAVQHVPKGAEVLISYIETAGSTMTRQKAL 148
           L+P   I+ H+C+PN   ++   EG    +RA+  + +G  +  SY  T   T  RQK L
Sbjct: 207 LFPYTGILAHNCVPNTSRSIYPSEGYKIRLRAMVDLEEGQPLHHSYTYTLDGTAQRQKHL 266

Query: 149 KEQYLFTCTCPRCI---KLG-QFDDIQESAILEGYRCKDDGCSGFLLRDSDDKGFTCQQC 204
           K+   FTC C RC+   +LG  F  ++     EG++               D  + C  C
Sbjct: 267 KQGKFFTCQCERCLDPTELGTHFSSLKCGQCAEGFQVPRQP-------TEPDTSWNCANC 319

Query: 205 GLVRSKEE----IKKIASEVNILSKKTLALTSCGNHQEVVSTYKMIEKLQKKLYHPFSVN 260
           G   S  +    ++ + SEVN +    +A       + ++  Y       K L HP    
Sbjct: 320 GSDTSNADALAMLQSLQSEVNAVQALPMAAKRLEEIERLLRKY-------KSLLHPLHFI 372

Query: 261 LMQTREKLIKILMELEDWK 279
               R+ LI++   ++ ++
Sbjct: 373 ATGLRQLLIEMYGRVQGYE 391


>gi|408398088|gb|EKJ77223.1| hypothetical protein FPSE_02597 [Fusarium pseudograminearum CS3096]
          Length = 520

 Score = 59.3 bits (142), Expect = 3e-06,   Method: Compositional matrix adjust.
 Identities = 33/88 (37%), Positives = 50/88 (56%), Gaps = 1/88 (1%)

Query: 76  NAHTICNSELRPLGTGLYPVISIINHSCLPNAVLVFEGRLAVVRAVQHVPKGAEVLISYI 135
           NA    +++L  +G  L P +++ NHSC+PNA++ F GR A++RA + +    E+ ISY 
Sbjct: 163 NAFHRYDTDLGQVGIFLEPKLAMANHSCIPNAMVQFIGRKAILRAEKPIKIDDEIEISYT 222

Query: 136 ETAGSTMTRQKALKEQYLFTCTCPRCIK 163
           +       R+ AL   Y F C C RC K
Sbjct: 223 DYTFPLSKRKLALAP-YFFDCMCLRCEK 249


>gi|453080542|gb|EMF08593.1| SET domain-containing protein [Mycosphaerella populorum SO2202]
          Length = 350

 Score = 59.3 bits (142), Expect = 3e-06,   Method: Compositional matrix adjust.
 Identities = 29/73 (39%), Positives = 43/73 (58%), Gaps = 2/73 (2%)

Query: 91  GLYPVISIINHSCLPNAVLVFEGRLAVVR--AVQHVPKGAEVLISYIETAGSTMTRQKAL 148
            ++  I+ INHSCLPNA   +   L  +R  +++ +  G E+ +SYI   G + TR+K L
Sbjct: 122 AIFLTIARINHSCLPNAQHSWNAPLQRMRIHSIKTLQPGEEITLSYILPCGPSSTRKKLL 181

Query: 149 KEQYLFTCTCPRC 161
           KE + F C CP C
Sbjct: 182 KENFCFNCLCPLC 194


>gi|195387830|ref|XP_002052595.1| GJ20777 [Drosophila virilis]
 gi|194149052|gb|EDW64750.1| GJ20777 [Drosophila virilis]
          Length = 504

 Score = 59.3 bits (142), Expect = 3e-06,   Method: Compositional matrix adjust.
 Identities = 59/234 (25%), Positives = 99/234 (42%), Gaps = 33/234 (14%)

Query: 65  EIAENFSK---------LACNAHTICNSELRPLGTGLYPVISIINHSCLPNAVLVF---E 112
           ++AE FS+         L  NA     S+  PL   LYP   I+ H+C+PN        E
Sbjct: 169 KLAERFSEELIMQVVGVLEVNAFEARTSQGYPLRC-LYPYTGILAHNCVPNTARTIHPSE 227

Query: 113 GRLAVVRAVQHVPKGAEVLISYIETAGSTMTRQKALKEQYLFTCTCPRCIKLGQFDDIQE 172
           G    +RA+  +  G ++  SY  T   T  RQ  L+E   FTC+C RC+   +      
Sbjct: 228 GYKIRLRAMVALEAGQQLQHSYTYTLDGTAQRQAHLREGKYFTCSCERCLDASELQTHFS 287

Query: 173 SAILEGYRCKDDGCSGFLLRDSD---DKGFTCQQCGLVRSKEEIKKIA----SEVNILSK 225
           S  ++  +C +    G+LL       +  + C+ C    S EE+++I     +EVN +  
Sbjct: 288 S--MKCGQCTE----GWLLPKHPTAMNSSWNCRACDHSTSSEEVRQIVAALQAEVNAVQA 341

Query: 226 KTLALTSCGNHQEVVSTYKMIEKLQKKLYHPFSVNLMQTREKLIKILMELEDWK 279
             +        + ++  Y       K L HP        R+ LI++   ++ ++
Sbjct: 342 LDMGPKRLEESERLLRKY-------KSLLHPSHYIATSLRQLLIEMYGRVQGYE 388


>gi|195339975|ref|XP_002036592.1| GM18755 [Drosophila sechellia]
 gi|194130472|gb|EDW52515.1| GM18755 [Drosophila sechellia]
          Length = 507

 Score = 59.3 bits (142), Expect = 3e-06,   Method: Compositional matrix adjust.
 Identities = 50/199 (25%), Positives = 86/199 (43%), Gaps = 25/199 (12%)

Query: 92  LYPVISIINHSCLPN---AVLVFEGRLAVVRAVQHVPKGAEVLISYIETAGSTMTRQKAL 148
           L+P   I+ H+C+PN   ++   EG    +RA+  + +G  +  SY  T   T  RQK L
Sbjct: 207 LFPYTGILAHNCVPNTARSIYPSEGYKIRLRAMVDLEEGQPLHHSYTYTLDGTAQRQKHL 266

Query: 149 KEQYLFTCTCPRCI---KLG-QFDDIQESAILEGYRCKDDGCSGFLLRDSDDKGFTCQQC 204
           K+   FTC C RC+   +LG  F  ++     EG++               D  ++C  C
Sbjct: 267 KQGKFFTCQCERCLDPTELGTHFSSLKCGQCAEGFQVPRQP-------TEPDTSWSCASC 319

Query: 205 GLVRSKEE----IKKIASEVNILSKKTLALTSCGNHQEVVSTYKMIEKLQKKLYHPFSVN 260
           G   S  +    ++ + SEVN +    +A       + ++  Y       K L HP    
Sbjct: 320 GSETSNVDALAMLQSLQSEVNAVQALPMAAKRLEEIERLLRKY-------KSLLHPLHFI 372

Query: 261 LMQTREKLIKILMELEDWK 279
               R+ LI++   ++ ++
Sbjct: 373 ATGLRQLLIEMYGRVQGYE 391


>gi|321478979|gb|EFX89935.1| SET and MYND domain-containing 5 protein [Daphnia pulex]
          Length = 387

 Score = 59.3 bits (142), Expect = 3e-06,   Method: Compositional matrix adjust.
 Identities = 29/78 (37%), Positives = 42/78 (53%), Gaps = 5/78 (6%)

Query: 89  GTGLYPVISIINHSCLPNAV--LVFEGRLAVVRAVQHVPKGAEVLISYIETAG---STMT 143
           G+ L+ + S  NHSC PN +    F      + A + + +G E+ ISY++      S  +
Sbjct: 290 GSALFAIQSACNHSCEPNCISTFPFSNHTVALVASKDLEEGEEIFISYLDECAQSRSRHS 349

Query: 144 RQKALKEQYLFTCTCPRC 161
           R+K LKE YLF C C RC
Sbjct: 350 RRKILKENYLFHCNCSRC 367


>gi|398257480|gb|EJN41088.1| hypothetical protein lvs_L585 [Acanthamoeba polyphaga
           lentillevirus]
          Length = 237

 Score = 59.3 bits (142), Expect = 3e-06,   Method: Compositional matrix adjust.
 Identities = 29/92 (31%), Positives = 45/92 (48%), Gaps = 6/92 (6%)

Query: 71  SKLACNAHTICNSELRPLGTGLYPVISIINHSCLPNAVLVFEGRLAVVRAVQHVPKGAEV 130
           +K  CNA    N      G+ +    +  NHSC+PN + V +        V+++  G E+
Sbjct: 100 AKYMCNAFEFNN------GSAILINGAKFNHSCVPNVIFVSDENYMYFYTVRNIKTGEEL 153

Query: 131 LISYIETAGSTMTRQKALKEQYLFTCTCPRCI 162
             +Y++   +T TR+  L  QY F C C RCI
Sbjct: 154 TDNYVDIMSNTKTRKNRLFNQYGFDCQCERCI 185


>gi|300176601|emb|CBK24266.2| unnamed protein product [Blastocystis hominis]
          Length = 167

 Score = 59.3 bits (142), Expect = 3e-06,   Method: Compositional matrix adjust.
 Identities = 42/134 (31%), Positives = 67/134 (50%), Gaps = 15/134 (11%)

Query: 14  NVIPSTTTDNYSLVEALVAHMSDIDEKQLLLYAQIANLVNLILQ---WPEISI---NEIA 67
           +++    T +Y+ +  L  ++  I    L  Y QIA L++ I+Q   W   S+   + I 
Sbjct: 28  SMVDGVVTGDYTDIIDLCDNIDIIQPSMLSSYEQIATLLHSIVQSEPWYSDSLCPLSTIT 87

Query: 68  ENFSKLACNAHTICNSELR-PLG--------TGLYPVISIINHSCLPNAVLVFEGRLAVV 118
               KL  N   +   +L  PLG          LYP++S+ NH C PNA +VF+G  A +
Sbjct: 88  SCIGKLYSNRFAVTTIDLSAPLGRSFTQETAIALYPLLSLANHRCTPNATVVFDGLKATL 147

Query: 119 RAVQHVPKGAEVLI 132
           RA+Q + KG E+ +
Sbjct: 148 RALQPIHKGEEITV 161


>gi|440300086|gb|ELP92579.1| set and mynd domain containing protein, putative [Entamoeba
           invadens IP1]
          Length = 420

 Score = 59.3 bits (142), Expect = 3e-06,   Method: Compositional matrix adjust.
 Identities = 31/82 (37%), Positives = 49/82 (59%), Gaps = 4/82 (4%)

Query: 83  SELRPLGTGLYPVISIINHSCLPNAVLVF---EGRLAVVRAVQHVPKGAEVLISYIETAG 139
           S+    G GLY  ++ +NHSC PN VL     + +L+++ A++ +  G E+ ISYI+ + 
Sbjct: 337 SKFSVEGEGLYKYLNTLNHSCDPNCVLACTTDDFKLSLI-ALKDIKAGEELTISYIDNSM 395

Query: 140 STMTRQKALKEQYLFTCTCPRC 161
           +  TR K L +QY F C C +C
Sbjct: 396 NKETRLKTLMDQYNFDCKCKKC 417


>gi|340517499|gb|EGR47743.1| predicted protein [Trichoderma reesei QM6a]
          Length = 535

 Score = 59.3 bits (142), Expect = 3e-06,   Method: Compositional matrix adjust.
 Identities = 36/95 (37%), Positives = 51/95 (53%), Gaps = 1/95 (1%)

Query: 67  AENFSKLACNAHTICNSELRPLGTGLYPVISIINHSCLPNAVLVFEGRLAVVRAVQHVPK 126
           AE   K+  N+    +SEL   G  L P +++ NHSC+PNA + F GR A++ A   +  
Sbjct: 168 AEMLCKIQNNSFQRFDSELGVAGLYLEPTLAMANHSCIPNASVQFIGRNALLIAENPIRA 227

Query: 127 GAEVLISYIETAGSTMTRQKALKEQYLFTCTCPRC 161
           G E+ I+Y         R++AL   Y FTC C RC
Sbjct: 228 GDEIEIAYTFYTDPLPKRREALA-HYKFTCQCLRC 261


>gi|449547900|gb|EMD38867.1| hypothetical protein CERSUDRAFT_47730 [Ceriporiopsis subvermispora
           B]
          Length = 410

 Score = 59.3 bits (142), Expect = 3e-06,   Method: Compositional matrix adjust.
 Identities = 28/76 (36%), Positives = 39/76 (51%), Gaps = 1/76 (1%)

Query: 88  LGTGLYPVISIINHSCLPNAVLVFEGRLAVVRAVQHVPKGAEVLISYIETAG-STMTRQK 146
           LG  +YP +S  NH C PNA    +GR       + V +G E+ ISY    G +   R+K
Sbjct: 321 LGFAVYPRLSFFNHDCAPNARKERDGRALRFVTTREVAEGDELCISYGHVDGMAWRERRK 380

Query: 147 ALKEQYLFTCTCPRCI 162
            L + + F C C RC+
Sbjct: 381 ELSDGWFFDCACGRCV 396


>gi|330798181|ref|XP_003287133.1| hypothetical protein DICPUDRAFT_77998 [Dictyostelium purpureum]
 gi|325082849|gb|EGC36318.1| hypothetical protein DICPUDRAFT_77998 [Dictyostelium purpureum]
          Length = 510

 Score = 59.3 bits (142), Expect = 3e-06,   Method: Compositional matrix adjust.
 Identities = 35/121 (28%), Positives = 54/121 (44%), Gaps = 19/121 (15%)

Query: 89  GTGLYPVISIINHSCLPNAVLVFE-----------------GRLAV-VRAVQHVPKGAEV 130
           G+ +Y  +S+ NH C PN   + E                  RL++ +  ++ VP+ +E+
Sbjct: 361 GSAVYLKLSLFNHGCFPNCTTLLEYNINHNDYSYYQGFESDNRLSISIITLRDVPQNSEL 420

Query: 131 LISYIETAGSTMTRQKALKEQYLFTCTCPRCIKLGQFDDIQESAILEGYRCKDDGCSGFL 190
           LI+YI        R   LK  ++F C C RC       ++ E  I     CK+  CSG L
Sbjct: 421 LITYIPLNQKGHERYHNLKSNWMFPCDCLRCFHEKTNPELTE-KIFNDSSCKNSKCSGGL 479

Query: 191 L 191
           L
Sbjct: 480 L 480


>gi|307179275|gb|EFN67661.1| SET and MYND domain-containing protein 4 [Camponotus floridanus]
          Length = 631

 Score = 59.3 bits (142), Expect = 3e-06,   Method: Compositional matrix adjust.
 Identities = 35/107 (32%), Positives = 56/107 (52%), Gaps = 12/107 (11%)

Query: 73  LACNAHTICNS---------ELRPLGTGLYPVISIINHSCLPNAVL-VFEGRLAVVRAVQ 122
           + CNA+ I  +         E R +G  +Y  +S++NHSC PN V   +   + VVRA++
Sbjct: 411 INCNAYEIVENVHEEVTRVWEPRNIGGAIYTTVSLVNHSCYPNVVRHSYPSGIVVVRALR 470

Query: 123 HVPKGAEVLISYIET--AGSTMTRQKALKEQYLFTCTCPRCIKLGQF 167
            + KG E+L  Y     + S M R++ L ++Y F C C  C +  +F
Sbjct: 471 FIGKGCEILDCYGPQFLSESRMARRELLWKKYRFLCECDACTQDWKF 517


>gi|302832792|ref|XP_002947960.1| hypothetical protein VOLCADRAFT_116676 [Volvox carteri f.
           nagariensis]
 gi|300266762|gb|EFJ50948.1| hypothetical protein VOLCADRAFT_116676 [Volvox carteri f.
           nagariensis]
          Length = 1170

 Score = 59.3 bits (142), Expect = 3e-06,   Method: Composition-based stats.
 Identities = 31/76 (40%), Positives = 40/76 (52%), Gaps = 13/76 (17%)

Query: 86  RPLGTGLYPVISIINHSCLPNAVLVFEGRLAVVRAVQHVPKGAEVLISYIETAGSTMTRQ 145
           RP  TGL+   +++NHSC+PN      G   +VR  + V  G EV             R+
Sbjct: 703 RPGHTGLWTEHALMNHSCVPNVCTYVLGEAMLVRVSRPVQGGEEV-------------RR 749

Query: 146 KALKEQYLFTCTCPRC 161
           KAL +QY FTC CPRC
Sbjct: 750 KALYDQYGFTCCCPRC 765


>gi|168054064|ref|XP_001779453.1| predicted protein [Physcomitrella patens subsp. patens]
 gi|162669138|gb|EDQ55731.1| predicted protein [Physcomitrella patens subsp. patens]
          Length = 377

 Score = 59.3 bits (142), Expect = 3e-06,   Method: Compositional matrix adjust.
 Identities = 38/114 (33%), Positives = 57/114 (50%), Gaps = 12/114 (10%)

Query: 60  EISINEIAENFSKLACNAHTI----CNSELRPL-GTGLYPVISIINHSCLPNAV------ 108
           E+S+    +  ++ ACN   +    C  E R + G  ++   S+ NH CLPNA       
Sbjct: 165 ELSLELTGKLLARDACNTFGLMAPSCAGEERKVRGYAMFAQASMFNHDCLPNACRFEYVD 224

Query: 109 LVFEGRLAV-VRAVQHVPKGAEVLISYIETAGSTMTRQKALKEQYLFTCTCPRC 161
           +  +G   V VRA+  + +G EV +SY         RQ+ L+E+Y F CTC RC
Sbjct: 225 IDGDGNTDVIVRALHDMEEGTEVCLSYFPVDWPYGDRQQKLQEEYGFWCTCARC 278


>gi|389744758|gb|EIM85940.1| SET domain-containing protein [Stereum hirsutum FP-91666 SS1]
          Length = 480

 Score = 59.3 bits (142), Expect = 3e-06,   Method: Compositional matrix adjust.
 Identities = 34/102 (33%), Positives = 56/102 (54%), Gaps = 8/102 (7%)

Query: 67  AENFSKLACNAHTICNSELRPLGTGLYPVIS-IINHSCLPNAVLVF------EGRLAVVR 119
           +E +S+   N + + +S L P+  G++PV S + NHSC PNA   +       G   VV 
Sbjct: 189 SELYSRFGRN-NFVVHSHLSPIAHGIFPVASRLFNHSCTPNAAAKYIFGRDKPGVRMVVV 247

Query: 120 AVQHVPKGAEVLISYIETAGSTMTRQKALKEQYLFTCTCPRC 161
           A++++    E+ I Y++ A   + RQ +L++ Y F CTC  C
Sbjct: 248 ALRNISHNEEITIPYLDPALPFLERQLSLEQNYGFRCTCHVC 289


>gi|167525114|ref|XP_001746892.1| hypothetical protein [Monosiga brevicollis MX1]
 gi|163774672|gb|EDQ88299.1| predicted protein [Monosiga brevicollis MX1]
          Length = 500

 Score = 59.3 bits (142), Expect = 3e-06,   Method: Compositional matrix adjust.
 Identities = 35/97 (36%), Positives = 49/97 (50%), Gaps = 1/97 (1%)

Query: 86  RPLGTGLYPVISIINHSCLPN-AVLVFEGRLAVVRAVQHVPKGAEVLISYIETAGSTMTR 144
           R    G +PVI+++NHSC PN AV+   G    VR +  V  GAE+ +SY++       R
Sbjct: 233 RETAWGHFPVIALVNHSCHPNCAVVSRPGGELEVRTLGTVRAGAELFVSYVDLTLPRAER 292

Query: 145 QKALKEQYLFTCTCPRCIKLGQFDDIQESAILEGYRC 181
           Q  L     FTCTC RC     F    E+++    +C
Sbjct: 293 QAHLLASKEFTCTCYRCQHPDAFPHEHEASMPSCPQC 329


>gi|358342712|dbj|GAA50167.1| SET and MYND domain-containing protein 4 [Clonorchis sinensis]
          Length = 761

 Score = 59.3 bits (142), Expect = 3e-06,   Method: Compositional matrix adjust.
 Identities = 53/196 (27%), Positives = 87/196 (44%), Gaps = 47/196 (23%)

Query: 5   YLRRKLQNDNVIPSTTTDNYSLVEALVAHMSDIDEKQLLLYAQ----IANLVNLILQWPE 60
           +LRRK  N   I S        ++ LV+H  + + + L  YA     +A L++L L   E
Sbjct: 395 WLRRK--NHQQISSQQLT----IDRLVSHFDEFEVEDLFQYALTAWLLAKLISLSLP-DE 447

Query: 61  ISINEIAENF------SKLACNAHTICN---------------------SELRPLGTGLY 93
            +I E+ +         +L CNAH I +                      ++R + T L+
Sbjct: 448 GAITELIDGLWCFDTLRRLQCNAHAITDVQETFQLETKEDETHHAVPRLGQVR-VATALF 506

Query: 94  PVISIINHSCLPNAVLVFEGRLAVVRAVQHVPKGAEVLISYIE---TAGSTMTRQKALKE 150
           P +S++NH+C P+    FE +  V+R  + +    EV   Y        S+ +R+  L++
Sbjct: 507 PCVSMLNHACEPSVSNSFENQFIVLRCAKPIHPSDEVYNCYGPHYLHDTSSSSRRAQLQK 566

Query: 151 QYLFTCTC-----PRC 161
           QY F C+C     PRC
Sbjct: 567 QYFFNCSCQHCTNPRC 582


>gi|406859672|gb|EKD12735.1| TPR domain protein [Marssonina brunnea f. sp. 'multigermtubi'
           MB_m1]
          Length = 797

 Score = 59.3 bits (142), Expect = 3e-06,   Method: Compositional matrix adjust.
 Identities = 42/154 (27%), Positives = 70/154 (45%), Gaps = 11/154 (7%)

Query: 8   RKLQNDNVIPSTTTDNYSLVEALVAHMSDIDEKQLLLYAQIANLVNLILQWPEISINEIA 67
           +KLQN+    +T  D YS        +S +D + ++    I  +V         S+N  +
Sbjct: 472 QKLQNNPSTAATVHDLYSGSYKRATQVS-VDGRPVVDSFLIERIV---------SLNVFS 521

Query: 68  ENFSKLACNAHTICNSELRPLGTGLYPVISIINHSCLPNAVLVFEGRLAVVRAVQHVPKG 127
              +            + R   TGL+   + INHSC+ N    F G + +VRA + +P  
Sbjct: 522 SPLTSREAYGQDRQADDPRFGSTGLWVYAAYINHSCIVNCYRTFIGDMMIVRAAKDMPAD 581

Query: 128 AEVLISYIETAGSTMTRQKALKEQYLFTCTCPRC 161
            E+  SY +      T+ ++L++ + FTCTCP C
Sbjct: 582 TELEWSYADPINRAKTK-RSLQDNWGFTCTCPVC 614


>gi|146263138|gb|ABQ15327.1| Mcg1p [Magnaporthe grisea]
          Length = 456

 Score = 59.3 bits (142), Expect = 3e-06,   Method: Compositional matrix adjust.
 Identities = 30/92 (32%), Positives = 47/92 (51%), Gaps = 1/92 (1%)

Query: 90  TGLYPVISIINHSCLPN-AVLVFEGRLAVVRAVQHVPKGAEVLISYIETAGSTMTRQKAL 148
            G +  +S +NH C PN A  V +  +    A + +  G E+ ISY+++  S   RQ+ L
Sbjct: 258 AGTFLGVSRLNHDCRPNLAFHVAKSFVHTTHATRDIAAGEELTISYVDSYSSRNVRQERL 317

Query: 149 KEQYLFTCTCPRCIKLGQFDDIQESAILEGYR 180
           K  + FTCTCP C   G    + +S +   Y+
Sbjct: 318 KRNWGFTCTCPHCSLSGAMGKLSDSRLYAIYQ 349


>gi|401422463|ref|XP_003875719.1| conserved hypothetical protein [Leishmania mexicana
           MHOM/GT/2001/U1103]
 gi|322491958|emb|CBZ27231.1| conserved hypothetical protein [Leishmania mexicana
           MHOM/GT/2001/U1103]
          Length = 510

 Score = 59.3 bits (142), Expect = 3e-06,   Method: Compositional matrix adjust.
 Identities = 50/156 (32%), Positives = 71/156 (45%), Gaps = 27/156 (17%)

Query: 32  AHMSDIDEKQLLL--YAQIANLVNLILQWPE------ISINEIAENFSKLACNAHTICNS 83
           A +SD+    L    + Q   +    L W E       S + +     ++  NAH I + 
Sbjct: 340 APLSDVQRHTLTFDEHQQQQTVPAPTLLWAEALVQVIFSTDTLRTLVGQMVLNAHAINDY 399

Query: 84  ELRP-------------LGTGLYPVISIINHSCLPN-AVLVFEGRLAVV-RAVQHVPKGA 128
            L P              G GLY ++S  NHSC+PN AV   +G   +V +  + +  G 
Sbjct: 400 VLPPGEAPSSGVFDWVLKGAGLYSLLSCFNHSCVPNVAVSAVDGTHEIVLKTTRSIRAGE 459

Query: 129 EVLISYIE-TAGST--MTRQKALKEQYLFTCTCPRC 161
            + I+YI  TAG+T    RQ+ LK  Y FTC CPRC
Sbjct: 460 PLAITYIPLTAGATSRAERQRQLK-NYFFTCHCPRC 494


>gi|328710001|ref|XP_003244133.1| PREDICTED: SET and MYND domain-containing protein 4-like isoform 1
           [Acyrthosiphon pisum]
 gi|328710003|ref|XP_003244134.1| PREDICTED: SET and MYND domain-containing protein 4-like isoform 2
           [Acyrthosiphon pisum]
          Length = 645

 Score = 59.3 bits (142), Expect = 3e-06,   Method: Compositional matrix adjust.
 Identities = 37/125 (29%), Positives = 60/125 (48%), Gaps = 19/125 (15%)

Query: 56  LQWPEISINEIAENFSKLAC----NAHTI-----CNSELR--------PLGTGLYPVISI 98
           LQ P   +  + E+   + C    NAH I     C ++L+        P+ + L PV+S+
Sbjct: 423 LQIPNHQLGTVGESLVHIICAVSSNAHGITQPSDCKTQLKLSLDNRFVPVASLLMPVLSL 482

Query: 99  INHSCLPNAVLVFEGRLAVVRAVQHVPKGAEVLISY--IETAGSTMTRQKALKEQYLFTC 156
           +NH C PN V        V+ A+Q + KG+++  +Y  +       +R + LK QY F+C
Sbjct: 483 LNHHCDPNVVRHNYNGTIVLTAIQPISKGSQLFDNYGLLYATHPKESRLQILKNQYYFSC 542

Query: 157 TCPRC 161
            C  C
Sbjct: 543 ECSSC 547


>gi|157115758|ref|XP_001652682.1| hypothetical protein AaeL_AAEL007346 [Aedes aegypti]
 gi|108876750|gb|EAT40975.1| AAEL007346-PA [Aedes aegypti]
          Length = 354

 Score = 58.9 bits (141), Expect = 3e-06,   Method: Compositional matrix adjust.
 Identities = 43/135 (31%), Positives = 52/135 (38%), Gaps = 33/135 (24%)

Query: 58  WPEISINEIAENFSKLACNAHTICNSELR-----------------------------PL 88
           W   S   I  +  +L CN H I  SELR                              +
Sbjct: 42  WFLYSSAVILRHIGQLVCNGHAI--SELRVCTASENNCLEADSFNIKAGFLHRCFESTRV 99

Query: 89  GTGLYPVISIINHSCLPNAVLVFEGRLAVVRAVQHVPKGAEVLISYIETAG--STMTRQK 146
            TG++P IS+ NHSC PN    F      V A + V  G E+   Y       S   RQ 
Sbjct: 100 FTGIFPQISMFNHSCDPNIRNCFSKSTLTVYATRDVEAGGEIFNCYGPNFKLMSREDRQS 159

Query: 147 ALKEQYLFTCTCPRC 161
           ALK+QY F C C RC
Sbjct: 160 ALKQQYCFDCDCIRC 174


>gi|358395915|gb|EHK45302.1| hypothetical protein TRIATDRAFT_79702 [Trichoderma atroviride IMI
           206040]
          Length = 543

 Score = 58.9 bits (141), Expect = 3e-06,   Method: Compositional matrix adjust.
 Identities = 35/102 (34%), Positives = 54/102 (52%), Gaps = 1/102 (0%)

Query: 60  EISINEIAENFSKLACNAHTICNSELRPLGTGLYPVISIINHSCLPNAVLVFEGRLAVVR 119
           E  I + AE   K+  N+    +S+L  +G  L P +++ NHSC+PNA + F GR A++ 
Sbjct: 166 EEDIRKAAEMLCKIQNNSFQRFDSDLGVIGLFLEPTLAMANHSCIPNAAVQFIGRNALLI 225

Query: 120 AVQHVPKGAEVLISYIETAGSTMTRQKALKEQYLFTCTCPRC 161
           A   +  G E+ ++Y         R++AL   Y F C C RC
Sbjct: 226 AENPIRAGDEMELAYTFYTDPLPKRKEALA-HYKFVCQCLRC 266


>gi|260800041|ref|XP_002594945.1| hypothetical protein BRAFLDRAFT_174600 [Branchiostoma floridae]
 gi|229280183|gb|EEN50956.1| hypothetical protein BRAFLDRAFT_174600 [Branchiostoma floridae]
          Length = 458

 Score = 58.9 bits (141), Expect = 3e-06,   Method: Compositional matrix adjust.
 Identities = 76/364 (20%), Positives = 142/364 (39%), Gaps = 65/364 (17%)

Query: 24  YSLVEALVAHMSDIDEKQLLLYAQIANLVNLILQWPEISINEIAENFSKLACNAHTICNS 83
           + L+  + +H  D   +         N+V +I Q   + ++E +E+     C   T+   
Sbjct: 123 WKLISGMESHTEDQTREYRWGMEMGRNVVRVIRQ--TLGMDEFSEHEIMKVCGILTVNAF 180

Query: 84  EL-----------RPLGTGLYPVISIINHSCLPNAVLVFEGRLAV-VRAVQHVPKGAEVL 131
           EL            P    +YP   ++NH CL N +   + +  + VR    V KG  + 
Sbjct: 181 ELFTTGPNKDRDSTPRAWAVYPTTYLMNHDCLANTLTSIDSKNKMHVRTRMPVKKGEALT 240

Query: 132 ISYIETAGSTMTRQKALKEQYLFTCTCPRCIKLGQFDDIQESAILEGYRCKDDGCSGFLL 191
             Y E    T  R+  L     F C+C RC      D  +  + +  +RC +  C G ++
Sbjct: 241 AEYSECLWGTEIRRHQLHRYKYFWCSCQRC-----RDPTELGSFISSHRCTN--CGGNVV 293

Query: 192 R----DSDDKGFTCQQCGLVRSKEEIK----KIASEVNILSKKTLALTSCGNHQEVVSTY 243
                D +  GF C+ C ++   E +K    ++  EV  L  K L        +  +  Y
Sbjct: 294 ATKPLDFNSTGFRCESCRVIIRGEVLKGTNHQVEEEVFELESKEL--------ETPIEEY 345

Query: 244 KMIEKLQKKLYHPFSVNLMQTREKLIKI--------LMELEDWK--EALAYCQLTIPVYQ 293
           + + +  + + HP    +++ ++ L           L EL + +  + +  C+  I   +
Sbjct: 346 EDLVERHESILHPNHYAMLRCKKFLAHAYGRQPGYSLAELSEDRLNKKMEMCRTLIDTLE 405

Query: 294 RVYPQFHPLLGLQYYTCGKLEWFLGDTENAIKSMTEA-VEILRITHGTNSPFMKELILKL 352
            V P+F+                  DT + +  +  A VE+ R  H       +E+++ L
Sbjct: 406 IVSPRFN-----------------DDTAHFLYELHAALVEVARRRHDLGLISNQEMVVAL 448

Query: 353 EEAQ 356
            EA+
Sbjct: 449 TEAK 452


>gi|308803330|ref|XP_003078978.1| SET domain protein 123 (ISS) [Ostreococcus tauri]
 gi|116057431|emb|CAL51858.1| SET domain protein 123 (ISS) [Ostreococcus tauri]
          Length = 311

 Score = 58.9 bits (141), Expect = 3e-06,   Method: Compositional matrix adjust.
 Identities = 30/75 (40%), Positives = 44/75 (58%), Gaps = 2/75 (2%)

Query: 89  GTGLYPVISIINHSCLPNAVLVFE-GRLAV-VRAVQHVPKGAEVLISYIETAGSTMTRQK 146
           GT +Y   S+ NHSC PNA + +E G  A+ +R ++ V  G E  I+Y++    + +R+ 
Sbjct: 232 GTAVYFTASLFNHSCAPNAHVSWENGDAAITIRTLRPVRAGEEFNITYVDANERSASRRA 291

Query: 147 ALKEQYLFTCTCPRC 161
            LKE Y F C C RC
Sbjct: 292 RLKEWYGFDCACERC 306


>gi|226508960|ref|NP_001146688.1| uncharacterized protein LOC100280288 [Zea mays]
 gi|219888317|gb|ACL54533.1| unknown [Zea mays]
          Length = 318

 Score = 58.9 bits (141), Expect = 3e-06,   Method: Compositional matrix adjust.
 Identities = 35/108 (32%), Positives = 51/108 (47%), Gaps = 8/108 (7%)

Query: 83  SELRPLGTGLYPVISIINHSCLPNAVLVFEGRLAVVRAVQHVPKGAEVLISYIETAGSTM 142
           S++R +   LY   S+ NHSC PN    F  R  V+R  + +  G+ V +SY    G   
Sbjct: 171 SQVR-VAQALYVSGSLFNHSCQPNVHAYFLSRAFVLRTTEFIKSGSPVELSYGPQVGEMH 229

Query: 143 T--RQKALKEQYLFTCTCPRCIKLGQFDDIQESAILEGYRCKDDGCSG 188
              RQK+L+E Y F+C C  C +L   D      ++  + C    C G
Sbjct: 230 ISERQKSLQENYYFSCQCSSCSELNLSD-----LVMNSFCCPQSNCHG 272


>gi|328715119|ref|XP_003245540.1| PREDICTED: protein msta, isoform A-like [Acyrthosiphon pisum]
          Length = 478

 Score = 58.9 bits (141), Expect = 4e-06,   Method: Compositional matrix adjust.
 Identities = 43/139 (30%), Positives = 61/139 (43%), Gaps = 9/139 (6%)

Query: 91  GLYPVISIINHSCLPNAVLVFEGRLAVV-RAVQHVPKGAEVLISYIETAGSTMTRQKALK 149
           GLYPV + +NH C+PN +  F  +L ++ +A   + KG E+  SY  +   T  RQ  L 
Sbjct: 216 GLYPVAAFMNHCCVPNTMHNFNEKLQMIAKASLPIYKGQEITTSYTYSIWPTSLRQNHLL 275

Query: 150 EQYLFTCTCPRCIKLGQFDDIQESAILEGYRCKDDGCSGFLL--RDSDDKG-FTCQQCGL 206
               F CTC RC    +F        L    C    C G +L     D K  + C+ C  
Sbjct: 276 TSKQFICTCSRCCDTEEF-----GTKLAALNCIVKNCDGRILPINPLDKKSIWQCKMCTK 330

Query: 207 VRSKEEIKKIASEVNILSK 225
           + S  E+     +V  L K
Sbjct: 331 LVSSTEMICFYKDVETLMK 349


>gi|310795375|gb|EFQ30836.1| hypothetical protein GLRG_05980 [Glomerella graminicola M1.001]
          Length = 411

 Score = 58.9 bits (141), Expect = 4e-06,   Method: Compositional matrix adjust.
 Identities = 33/83 (39%), Positives = 41/83 (49%), Gaps = 5/83 (6%)

Query: 82  NSELRPLGTGLYPVISIINHSCLPNAVLVFEGRLAVVRAVQHVPKGAEVLISYI---ETA 138
            SEL   G G +P  S  NHSC PN     EGR    RA + V KG ++ I+Y+   E A
Sbjct: 321 GSEL--FGHGCWPAASYFNHSCGPNVDKRREGRAWEFRASRDVGKGEQLCITYLGGEEKA 378

Query: 139 GSTMTRQKALKEQYLFTCTCPRC 161
               TR   L+  + F C C RC
Sbjct: 379 MPRETRMSTLRRNWGFECGCKRC 401


>gi|341876422|gb|EGT32357.1| CBN-SET-10 protein [Caenorhabditis brenneri]
          Length = 428

 Score = 58.9 bits (141), Expect = 4e-06,   Method: Compositional matrix adjust.
 Identities = 45/145 (31%), Positives = 69/145 (47%), Gaps = 19/145 (13%)

Query: 68  ENFSKLACNAHTICNS-----ELR--PLGTGLYPVISIINHSCLPNAVLVFEGRLAVVRA 120
           EN   +A     +CNS     E R  P+G+G+Y  ++  NHSC   + +VF+    ++RA
Sbjct: 145 ENNRMIAMICSVLCNSFGIIDEKRGEPIGSGMYIGLAKHNHSCASTSHVVFDKNQVLLRA 204

Query: 121 VQHVPKGAEVLISYIETAGSTMTRQKALKEQYLFTCTCPRCIKLGQFDDIQESAILEGYR 180
            +         ISY+     T  RQK+++  +  TC C  C    Q +D+    I    R
Sbjct: 205 RKE-EYCRNTTISYVSRMLPTAERQKSIRSVHFITCRCEMC----QNEDLD--LIGLASR 257

Query: 181 CKDDGCSGFLLRDSDDKGFTCQQCG 205
           C+ DGC+GF+   S     +C  CG
Sbjct: 258 CQTDGCNGFVKGSS-----SCGSCG 277


>gi|307170062|gb|EFN62504.1| SET and MYND domain-containing protein 4 [Camponotus floridanus]
          Length = 333

 Score = 58.9 bits (141), Expect = 4e-06,   Method: Compositional matrix adjust.
 Identities = 36/113 (31%), Positives = 53/113 (46%), Gaps = 16/113 (14%)

Query: 65  EIAENFSKLACNAHTICN--------------SELRPLGTGLYPVISIINHSCLPNAVLV 110
           E A ++  L CN H I                 E   + T +YP  S++NHSC PN +  
Sbjct: 187 EDAFSYGILVCNGHAITKINVTADDHENKLLIEEQIRIATAIYPSASMMNHSCDPNIINS 246

Query: 111 FEGRLAVVRAVQHVPKGAEVLISYIET--AGSTMTRQKALKEQYLFTCTCPRC 161
           F  +  +V+A+Q + +G E+L  Y           RQ++L+ QY FTC C  C
Sbjct: 247 FLDQTLIVKAIQDIKEGEEILHCYGVDFRRMPKAERQESLESQYYFTCNCKAC 299


  Database: nr
    Posted date:  Mar 3, 2013 10:45 PM
  Number of letters in database: 999,999,864
  Number of sequences in database:  2,912,245
  
  Database: /local_scratch/syshi//blastdatabase/nr.01
    Posted date:  Mar 3, 2013 10:52 PM
  Number of letters in database: 999,999,666
  Number of sequences in database:  2,912,720
  
  Database: /local_scratch/syshi//blastdatabase/nr.02
    Posted date:  Mar 3, 2013 10:58 PM
  Number of letters in database: 999,999,938
  Number of sequences in database:  3,014,250
  
  Database: /local_scratch/syshi//blastdatabase/nr.03
    Posted date:  Mar 3, 2013 11:03 PM
  Number of letters in database: 999,999,780
  Number of sequences in database:  2,805,020
  
  Database: /local_scratch/syshi//blastdatabase/nr.04
    Posted date:  Mar 3, 2013 11:08 PM
  Number of letters in database: 999,999,551
  Number of sequences in database:  2,816,253
  
  Database: /local_scratch/syshi//blastdatabase/nr.05
    Posted date:  Mar 3, 2013 11:13 PM
  Number of letters in database: 999,999,897
  Number of sequences in database:  2,981,387
  
  Database: /local_scratch/syshi//blastdatabase/nr.06
    Posted date:  Mar 3, 2013 11:18 PM
  Number of letters in database: 999,999,649
  Number of sequences in database:  2,911,476
  
  Database: /local_scratch/syshi//blastdatabase/nr.07
    Posted date:  Mar 3, 2013 11:24 PM
  Number of letters in database: 999,999,452
  Number of sequences in database:  2,920,260
  
  Database: /local_scratch/syshi//blastdatabase/nr.08
    Posted date:  Mar 3, 2013 11:25 PM
  Number of letters in database: 64,230,274
  Number of sequences in database:  189,558
  
Lambda     K      H
   0.320    0.135    0.393 

Lambda     K      H
   0.267   0.0410    0.140 


Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Number of Hits to DB: 5,281,502,673
Number of Sequences: 23463169
Number of extensions: 207196425
Number of successful extensions: 560448
Number of sequences better than 100.0: 1000
Number of HSP's better than 100.0 without gapping: 1239
Number of HSP's successfully gapped in prelim test: 1462
Number of HSP's that attempted gapping in prelim test: 556083
Number of HSP's gapped (non-prelim): 3803
length of query: 368
length of database: 8,064,228,071
effective HSP length: 144
effective length of query: 224
effective length of database: 8,980,499,031
effective search space: 2011631782944
effective search space used: 2011631782944
T: 11
A: 40
X1: 16 ( 7.4 bits)
X2: 38 (14.6 bits)
X3: 64 (24.7 bits)
S1: 41 (21.8 bits)
S2: 77 (34.3 bits)