BLASTP 2.2.26 [Sep-21-2011]


Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, 
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), 
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs",  Nucleic Acids Res. 25:3389-3402.


Reference for compositional score matrix adjustment: Altschul, Stephen F., 
John C. Wootton, E. Michael Gertz, Richa Agarwala, Aleksandr Morgulis,
Alejandro A. Schaffer, and Yi-Kuo Yu (2005) "Protein database searches
using compositionally adjusted substitution matrices", FEBS J. 272:5101-5109.

Query= 012829
         (455 letters)

Database: nr 
           23,463,169 sequences; 8,064,228,071 total letters

Searching..................................................done



>gi|255545512|ref|XP_002513816.1| protein with unknown function [Ricinus communis]
 gi|223546902|gb|EEF48399.1| protein with unknown function [Ricinus communis]
          Length = 482

 Score =  755 bits (1950), Expect = 0.0,   Method: Compositional matrix adjust.
 Identities = 347/447 (77%), Positives = 404/447 (90%)

Query: 8   GEVIISQEPYVCVPNNSSSISRCDGCFASSNLKKCSACQVVWYCGSNCQKLDWKLHRLEC 67
           GEVIISQEPYVCVPNNS++ SRCD CF+SSN+KKCSACQV WYCGS+CQKL+WKLHR+EC
Sbjct: 34  GEVIISQEPYVCVPNNSATESRCDRCFSSSNVKKCSACQVAWYCGSSCQKLEWKLHRIEC 93

Query: 68  QVLSRLDKEKRKSVTPSIRLMLKLYLRRKLQNDNVIPSTTTDNYSLVEALVAHMSDIDEK 127
           + LS+LDK++RK VTPSIRLM++L +RRKLQ++ +I ST TDNY LVEALVAHM DIDEK
Sbjct: 94  EALSKLDKDRRKYVTPSIRLMVRLLIRRKLQSEKIISSTATDNYDLVEALVAHMKDIDEK 153

Query: 128 QLLLYAQIANLVNLILQWPEISINEIAENFSKLACNAHTICNSELRPLGTGLYPVISIIN 187
           QL+LYAQ+ANLVNLIL+WP+++I EIAENFSKLACNAHTIC+SELRPLGTGLYPV+SIIN
Sbjct: 154 QLVLYAQMANLVNLILRWPDVNIKEIAENFSKLACNAHTICDSELRPLGTGLYPVVSIIN 213

Query: 188 HSCLPNAVLVFEGRLAVVRAVQHVPKGAEVLISYIETAGSTMTRQKALKEQYLFTCTCPR 247
           HSCLPNAVLVF+GRLAVV  VQH+PKG+EVLISYIETAGSTMTRQKALK+QY FTCTCPR
Sbjct: 214 HSCLPNAVLVFDGRLAVVHTVQHIPKGSEVLISYIETAGSTMTRQKALKQQYFFTCTCPR 273

Query: 248 CIKLGQFDDIQESAILEGYRCKDDGCSGFLLRDSDDKGFTCQQCGLVRSKEEIKKIASEV 307
           CIK+G  DDIQESAILEGYRCKD+ C+GFLLRDSDD+GF CQQCGL+RSKEE+KK A+E+
Sbjct: 274 CIKMGLLDDIQESAILEGYRCKDNRCNGFLLRDSDDRGFICQQCGLLRSKEEVKKSAAEI 333

Query: 308 NILSKKTLALTSCGNHQEVVSTYKMIEKLQKKLYHPFSVNLMQTREKLIKILMELEDWKE 367
              S K     S GN QE VS YK+IEKLQ+KL HPFS +LMQTREKL+K+LMELEDW E
Sbjct: 334 KATSDKASKSISSGNLQEAVSIYKLIEKLQRKLCHPFSTSLMQTREKLLKMLMELEDWGE 393

Query: 368 ALAYCQLTIPVYQRVYPQFHPLLGLQYYTCGKLEWFLGDTENAIKSMTEAVEILRITHGT 427
           AL+YC+LTIPVYQRVYP+FHPLLGLQY++CGKLEW LGDTE AIKS+T+A+++LRITHGT
Sbjct: 394 ALSYCKLTIPVYQRVYPEFHPLLGLQYFSCGKLEWLLGDTEAAIKSLTKALDVLRITHGT 453

Query: 428 NSPFMKELILKLEEAQAEASYKLSSKD 454
            +PFMKEL++KLEEA+AE SY  +S D
Sbjct: 454 KTPFMKELMMKLEEARAEVSYLRASND 480


>gi|302141868|emb|CBI19071.3| unnamed protein product [Vitis vinifera]
          Length = 480

 Score =  753 bits (1945), Expect = 0.0,   Method: Compositional matrix adjust.
 Identities = 348/448 (77%), Positives = 406/448 (90%), Gaps = 1/448 (0%)

Query: 8   GEVIISQEPYVCVPNNSSSISRCDGCFASSNLKKCSACQVVWYCGSNCQKLDWKLHRLEC 67
           GEVIISQEPYV VPNNS+  SRC+GCF SSNLKKCSAC VVWYCGS CQK DWKLHRLEC
Sbjct: 34  GEVIISQEPYVSVPNNSAVHSRCEGCFRSSNLKKCSACHVVWYCGSTCQKSDWKLHRLEC 93

Query: 68  QVLSRLDKEKRKSVTPSIRLMLKLYLRRKLQNDNVIPSTTTDNYSLVEALVAHMSDIDEK 127
             LSRL+KE++KS+TPSIRLM+KLY+RRKLQ++ ++P+T  DNY+LVEALV+H++DIDEK
Sbjct: 94  NALSRLEKERQKSLTPSIRLMVKLYMRRKLQSEKIMPTTARDNYNLVEALVSHITDIDEK 153

Query: 128 QLLLYAQIANLVNLILQWPEISINEIAENFSKLACNAHTICNSELRPLGTGLYPVISIIN 187
           QL+LYAQ+ANLVNLILQWP+I++ EIAENFSKLACNAHTIC+ ELRPLGTGLYPVISIIN
Sbjct: 154 QLVLYAQMANLVNLILQWPDINVKEIAENFSKLACNAHTICDGELRPLGTGLYPVISIIN 213

Query: 188 HSCLPNAVLVFEGRLAVVRAVQHVPKGAEVLISYIETAGSTMTRQKALKEQYLFTCTCPR 247
           HSCLPN+VLVFE RLAVVRAVQH+PKG EVLISYIETAGST+TRQKALKEQYLFTCTCPR
Sbjct: 214 HSCLPNSVLVFEERLAVVRAVQHIPKGTEVLISYIETAGSTITRQKALKEQYLFTCTCPR 273

Query: 248 CIKLGQFDDIQESAILEGYRCKDDGCSGFLLRDSDDKGFTCQQCGLVRSKEEIKKIASEV 307
           C ++GQ+DDIQESAILEGYRCKDD C GFLLRDSDD GF CQQCGLVR+KEEIK++ASE+
Sbjct: 274 CRRMGQYDDIQESAILEGYRCKDDRCDGFLLRDSDDIGFICQQCGLVRNKEEIKRLASEL 333

Query: 308 NILSKKTLALTSCGNHQEVVSTYKMIEKLQKKLYHPFSVNLMQTREKLIKILMELEDWKE 367
             LS K   ++S  ++ E  S YKMIEKLQ KL+HPFS+NLM+TRE ++KILME++DW+ 
Sbjct: 334 KPLSDKA-TMSSSSHYVEATSIYKMIEKLQTKLFHPFSINLMRTREAILKILMEMKDWRA 392

Query: 368 ALAYCQLTIPVYQRVYPQFHPLLGLQYYTCGKLEWFLGDTENAIKSMTEAVEILRITHGT 427
           AL YC+LTIPVYQRVYP FHPLLGLQYYTCGKLEW LG+TE+A+KS+T+A +IL+ITHGT
Sbjct: 393 ALTYCKLTIPVYQRVYPGFHPLLGLQYYTCGKLEWLLGETEDAVKSLTKAADILQITHGT 452

Query: 428 NSPFMKELILKLEEAQAEASYKLSSKDE 455
           N+PFMKEL+ KLEEA+AEAS+K SSKD+
Sbjct: 453 NTPFMKELLFKLEEARAEASHKRSSKDK 480


>gi|225459467|ref|XP_002284386.1| PREDICTED: histone-lysine N-methyltransferase ASHR1-like [Vitis
           vinifera]
          Length = 477

 Score =  749 bits (1933), Expect = 0.0,   Method: Compositional matrix adjust.
 Identities = 347/448 (77%), Positives = 403/448 (89%), Gaps = 4/448 (0%)

Query: 8   GEVIISQEPYVCVPNNSSSISRCDGCFASSNLKKCSACQVVWYCGSNCQKLDWKLHRLEC 67
           GEVIISQEPYV VPNNS+  SRC+GCF SSNLKKCSAC VVWYCGS CQK DWKLHRLEC
Sbjct: 34  GEVIISQEPYVSVPNNSAVHSRCEGCFRSSNLKKCSACHVVWYCGSTCQKSDWKLHRLEC 93

Query: 68  QVLSRLDKEKRKSVTPSIRLMLKLYLRRKLQNDNVIPSTTTDNYSLVEALVAHMSDIDEK 127
             LSRL+KE++KS+TPSIRLM+KLY+RRKLQ++ ++P+T  DNY+LVEALV+H++DIDEK
Sbjct: 94  NALSRLEKERQKSLTPSIRLMVKLYMRRKLQSEKIMPTTARDNYNLVEALVSHITDIDEK 153

Query: 128 QLLLYAQIANLVNLILQWPEISINEIAENFSKLACNAHTICNSELRPLGTGLYPVISIIN 187
           QL+LYAQ+ANLVNLILQWP+I++ EIAENFSKLACNAHTIC+ ELRPLGTGLYPVISIIN
Sbjct: 154 QLVLYAQMANLVNLILQWPDINVKEIAENFSKLACNAHTICDGELRPLGTGLYPVISIIN 213

Query: 188 HSCLPNAVLVFEGRLAVVRAVQHVPKGAEVLISYIETAGSTMTRQKALKEQYLFTCTCPR 247
           HSCLPN+VLVFE RLAVVRAVQH+PKG EVLISYIETAGST+TRQKALKEQYLFTCTCPR
Sbjct: 214 HSCLPNSVLVFEERLAVVRAVQHIPKGTEVLISYIETAGSTITRQKALKEQYLFTCTCPR 273

Query: 248 CIKLGQFDDIQESAILEGYRCKDDGCSGFLLRDSDDKGFTCQQCGLVRSKEEIKKIASEV 307
           C ++GQ+DDIQESAILEGYRCKDD C GFLLRDSDD GF CQQCGLVR+KEEIK++ASE+
Sbjct: 274 CRRMGQYDDIQESAILEGYRCKDDRCDGFLLRDSDDIGFICQQCGLVRNKEEIKRLASEL 333

Query: 308 NILSKKTLALTSCGNHQEVVSTYKMIEKLQKKLYHPFSVNLMQTREKLIKILMELEDWKE 367
             LS K    +S     +  S YKMIEKLQ KL+HPFS+NLM+TRE ++KILME++DW+ 
Sbjct: 334 KPLSDKATMSSS----SQATSIYKMIEKLQTKLFHPFSINLMRTREAILKILMEMKDWRA 389

Query: 368 ALAYCQLTIPVYQRVYPQFHPLLGLQYYTCGKLEWFLGDTENAIKSMTEAVEILRITHGT 427
           AL YC+LTIPVYQRVYP FHPLLGLQYYTCGKLEW LG+TE+A+KS+T+A +IL+ITHGT
Sbjct: 390 ALTYCKLTIPVYQRVYPGFHPLLGLQYYTCGKLEWLLGETEDAVKSLTKAADILQITHGT 449

Query: 428 NSPFMKELILKLEEAQAEASYKLSSKDE 455
           N+PFMKEL+ KLEEA+AEAS+K SSKD+
Sbjct: 450 NTPFMKELLFKLEEARAEASHKRSSKDK 477


>gi|224082230|ref|XP_002306611.1| SET domain protein [Populus trichocarpa]
 gi|222856060|gb|EEE93607.1| SET domain protein [Populus trichocarpa]
          Length = 458

 Score =  702 bits (1811), Expect = 0.0,   Method: Compositional matrix adjust.
 Identities = 333/448 (74%), Positives = 386/448 (86%), Gaps = 23/448 (5%)

Query: 8   GEVIISQEPYVCVPNNSSSISRCDGCFASSNLKKCSACQVVWYCGSNCQKLDWKLHRLEC 67
           GEVI+ QEPYVCVPNNSS++SRCDGCFAS +LKKCSACQVVWYCGS CQK +WKLHRLEC
Sbjct: 34  GEVILRQEPYVCVPNNSSTVSRCDGCFASESLKKCSACQVVWYCGSTCQKSEWKLHRLEC 93

Query: 68  QVLSRLDKEKRKSVTPSIRLMLKLYLRRKLQNDNVIPSTTTDNYSLVEALVAHMSDIDEK 127
             LSRL+KEKRK+VTPSIRLM++LYLRRKLQN+                      D+DEK
Sbjct: 94  NALSRLEKEKRKAVTPSIRLMVRLYLRRKLQNE---------------------MDLDEK 132

Query: 128 QLLLYAQIANLVNLILQWPEISINEIAENFSKLACNAHTICNSELRPLGTGLYPVISIIN 187
           QL+LYAQ+ANLV+ ILQWPEI++ EIAENFSKLACNAHTIC+ ELRPLGTGLYPV+SIIN
Sbjct: 133 QLVLYAQMANLVHFILQWPEINLKEIAENFSKLACNAHTICDCELRPLGTGLYPVVSIIN 192

Query: 188 HSCLPNAVLVFEGRLAVVRAVQHVPKGAEVLISYIETAGSTMTRQKALKEQYLFTCTCPR 247
           HSC+PNAVL FEG+ +VVRAV+H+P+GAEV I+YI+TAGSTMTRQKALKEQY FTCTCPR
Sbjct: 193 HSCMPNAVLTFEGKSSVVRAVEHIPEGAEVSIAYIDTAGSTMTRQKALKEQYFFTCTCPR 252

Query: 248 CIKLGQFDDIQESAILEGYRCKDDGCSGFLLRDSDDKGFTCQQCGLVRSKEEIKKIASEV 307
           CIK+  +DDIQESAILEGYRCKDD C+GFLLRDS+DKGF CQ CGL RSKEE+K+I  E+
Sbjct: 253 CIKV--YDDIQESAILEGYRCKDDRCNGFLLRDSEDKGFICQTCGLRRSKEEVKRIVCEI 310

Query: 308 NILSKKTLALTSCGNHQEVVSTYKMIEKLQKKLYHPFSVNLMQTREKLIKILMELEDWKE 367
             +S K L  TS GNH+EV+S YKMIEKLQ +L HPFS++LM+T+E+L+KILMEL DW+E
Sbjct: 311 TAISDKKLKSTSPGNHEEVISLYKMIEKLQMELCHPFSISLMRTQEELLKILMELGDWRE 370

Query: 368 ALAYCQLTIPVYQRVYPQFHPLLGLQYYTCGKLEWFLGDTENAIKSMTEAVEILRITHGT 427
           ALAYC+LTI  YQRVYP+ HPLLGLQYYTCGK+EW LG TE+AIKS+T AV+ILRITHGT
Sbjct: 371 ALAYCRLTITGYQRVYPEPHPLLGLQYYTCGKIEWLLGYTEDAIKSLTRAVDILRITHGT 430

Query: 428 NSPFMKELILKLEEAQAEASYKLSSKDE 455
           NSPFMKEL++KL+EA AEASY LSSKDE
Sbjct: 431 NSPFMKELMMKLDEAHAEASYNLSSKDE 458


>gi|449445136|ref|XP_004140329.1| PREDICTED: histone-lysine N-methyltransferase ASHR1-like [Cucumis
           sativus]
 gi|449502473|ref|XP_004161650.1| PREDICTED: histone-lysine N-methyltransferase ASHR1-like [Cucumis
           sativus]
          Length = 482

 Score =  695 bits (1793), Expect = 0.0,   Method: Compositional matrix adjust.
 Identities = 328/448 (73%), Positives = 380/448 (84%)

Query: 8   GEVIISQEPYVCVPNNSSSISRCDGCFASSNLKKCSACQVVWYCGSNCQKLDWKLHRLEC 67
           GE IISQEPYVCVPNNS   SRCD CFAS NLKKCSAC+V WYC S CQK +WKLHRLEC
Sbjct: 34  GEEIISQEPYVCVPNNSPVESRCDRCFASINLKKCSACKVAWYCSSVCQKSEWKLHRLEC 93

Query: 68  QVLSRLDKEKRKSVTPSIRLMLKLYLRRKLQNDNVIPSTTTDNYSLVEALVAHMSDIDEK 127
             L+RLDK++ KSVTPSIRLM+KL++RRKLQ++ VIP   TDNY LVE L+ HMS IDEK
Sbjct: 94  DALARLDKDRHKSVTPSIRLMIKLFIRRKLQSEKVIPMAATDNYKLVEELIDHMSGIDEK 153

Query: 128 QLLLYAQIANLVNLILQWPEISINEIAENFSKLACNAHTICNSELRPLGTGLYPVISIIN 187
           QL+LYAQ+  LVN ILQWP +++ EIA NFSKLACNAHTIC+SELRPLGTGLYPVISIIN
Sbjct: 154 QLVLYAQMGTLVNFILQWPGMNVKEIAVNFSKLACNAHTICDSELRPLGTGLYPVISIIN 213

Query: 188 HSCLPNAVLVFEGRLAVVRAVQHVPKGAEVLISYIETAGSTMTRQKALKEQYLFTCTCPR 247
           HSCLPNAVLVFEGR AVVRAVQH+P GAEV ISYIETAGSTMTRQK LKE YLFTCTC R
Sbjct: 214 HSCLPNAVLVFEGRTAVVRAVQHIPAGAEVSISYIETAGSTMTRQKTLKENYLFTCTCSR 273

Query: 248 CIKLGQFDDIQESAILEGYRCKDDGCSGFLLRDSDDKGFTCQQCGLVRSKEEIKKIASEV 307
           C+K+ Q D+I+ESAILEGYRC++D C GFLLR+SDD GFTCQQCGLVRSKE+IK IAS++
Sbjct: 274 CVKVAQEDEIKESAILEGYRCRNDQCDGFLLRNSDDTGFTCQQCGLVRSKEDIKNIASKI 333

Query: 308 NILSKKTLALTSCGNHQEVVSTYKMIEKLQKKLYHPFSVNLMQTREKLIKILMELEDWKE 367
             +S +     S  ++ E +  Y+ +EKLQ+ L HP+S++LMQTREKL+KI MELE+W +
Sbjct: 334 KSISDEASTSLSSQSYAEALFMYEKVEKLQRILCHPYSISLMQTREKLLKISMELENWTK 393

Query: 368 ALAYCQLTIPVYQRVYPQFHPLLGLQYYTCGKLEWFLGDTENAIKSMTEAVEILRITHGT 427
           AL YC+LTI VYQ++YP  HPLLGLQ+YTCGKLEW LG TE+AIKS T+A +ILRITHGT
Sbjct: 394 ALTYCKLTISVYQKLYPGIHPLLGLQFYTCGKLEWLLGHTEDAIKSYTKAFDILRITHGT 453

Query: 428 NSPFMKELILKLEEAQAEASYKLSSKDE 455
           NS FMKEL+LKLEEA+AEASYKLSS D+
Sbjct: 454 NSSFMKELLLKLEEARAEASYKLSSTDD 481


>gi|356507670|ref|XP_003522587.1| PREDICTED: histone-lysine N-methyltransferase ASHR1-like [Glycine
           max]
          Length = 484

 Score =  677 bits (1746), Expect = 0.0,   Method: Compositional matrix adjust.
 Identities = 333/452 (73%), Positives = 386/452 (85%), Gaps = 6/452 (1%)

Query: 8   GEVIISQEPYVCVPNNSS--SISRCDGCFASSN---LKKCSACQVVWYCGSNCQKLDWKL 62
           GEVIISQEPYVCVPNNSS     RCDGCF + N   L +CS CQ+ +YCG+ CQ+ +WKL
Sbjct: 34  GEVIISQEPYVCVPNNSSVSPQKRCDGCFTTINNNVLSRCSRCQLAFYCGTACQRSEWKL 93

Query: 63  HRLECQVLSRLDKEKRKSVTPSIRLMLKLYLRRKLQNDNVIPSTTTDNYSLVEALVAHMS 122
           HRLEC+VLS L K KRKS+TPSIRLML+LYLRRKLQND +IPST  DNY+LVEALVAHMS
Sbjct: 94  HRLECEVLSSLHKYKRKSLTPSIRLMLRLYLRRKLQNDKIIPSTAMDNYNLVEALVAHMS 153

Query: 123 DIDEKQLLLYAQIANLVNLILQWPEISINEIAENFSKLACNAHTICNSELRPLGTGLYPV 182
           DI E+QL+LYAQ+ANLVN IL+WP I+I EIAENFSK ACNAHTIC+SELRP+GTGLYPV
Sbjct: 154 DITEEQLVLYAQMANLVNSILEWPGINIKEIAENFSKFACNAHTICDSELRPVGTGLYPV 213

Query: 183 ISIINHSCLPNAVLVFEGRLAVVRAVQHVPKGAEVLISYIETAGSTMTRQKALKEQYLFT 242
           ISIINHSCLPN+VLVFEG  A+VRAVQH+P G EVLISYIETA STMTRQKALKEQYLFT
Sbjct: 214 ISIINHSCLPNSVLVFEGSSALVRAVQHIPSGTEVLISYIETAESTMTRQKALKEQYLFT 273

Query: 243 CTCPRCIKLGQFDDIQESAILEGYRCKDDGCSGFLLRDSDDKGFTCQQCGLVRSKEEIKK 302
           CTCPRC K+GQ+DDIQESAILEGY+CK + C GFLLR +D KGF CQ CGL+R KEEIK+
Sbjct: 274 CTCPRCSKVGQYDDIQESAILEGYKCKSEKCGGFLLRTTDGKGFQCQGCGLIRDKEEIKR 333

Query: 303 IASEVNILSKKTLALTSCGNHQEVVSTYKMIEKLQKKLYHPFSVNLMQTREKLIKILMEL 362
           I +E+ +LS+   +  S   +QE +S YK IEKLQ +L+HP S+NLM TREK++K LMEL
Sbjct: 334 ITTEIKLLSEDA-SKPSATYYQEAISIYKRIEKLQTELFHPLSINLMHTREKILKSLMEL 392

Query: 363 EDWKEALAYCQLTIPVYQRVYPQFHPLLGLQYYTCGKLEWFLGDTENAIKSMTEAVEILR 422
           E W EALAYC+LTIP YQRVYP  HPL GLQYYTCGKLEW+LGDTE A+KS+T+AV+ILR
Sbjct: 393 EHWTEALAYCKLTIPFYQRVYPAVHPLPGLQYYTCGKLEWYLGDTEEAVKSLTKAVDILR 452

Query: 423 ITHGTNSPFMKELILKLEEAQAEASYKLSSKD 454
           ITHGTN+PFMK+L++KLEEA+ EASYK SSK+
Sbjct: 453 ITHGTNTPFMKDLLMKLEEARTEASYKFSSKE 484


>gi|357461631|ref|XP_003601097.1| Histone-lysine N-methyltransferase ASHR1 [Medicago truncatula]
 gi|355490145|gb|AES71348.1| Histone-lysine N-methyltransferase ASHR1 [Medicago truncatula]
          Length = 511

 Score =  673 bits (1736), Expect = 0.0,   Method: Compositional matrix adjust.
 Identities = 324/482 (67%), Positives = 381/482 (79%), Gaps = 43/482 (8%)

Query: 4   FHVSGEVIISQEPYVCVPNNSSSISRCDGCFASSNLKKCSACQVVWYCGSNCQKLDWKLH 63
           FH  G+VIISQEPYVCVP       RCDGCF+++NL KCS CQVVWYCG+ CQK +WKLH
Sbjct: 31  FH-PGDVIISQEPYVCVPTQK----RCDGCFSTTNLSKCSRCQVVWYCGTPCQKSEWKLH 85

Query: 64  RLECQVLSRLDKEKRKSVTPSIRLMLKLYLRRKLQNDNVIPSTTTDNYSLVEALVAH--- 120
           RLECQ LSRLD  KRKSVTPSIRLMLKLYLRRKLQ+  +IPST  DNY LVEALVA    
Sbjct: 86  RLECQALSRLDSNKRKSVTPSIRLMLKLYLRRKLQDQKIIPSTAMDNYKLVEALVARILF 145

Query: 121 ----------------------------------MSDIDEKQLLLYAQIANLVNLILQWP 146
                                             MSDI E+QL+LYAQ+ANLV+LILQWP
Sbjct: 146 IIVVLSSRAMIVEFSLLTNMSVVIVLFNTYFHEDMSDIKEEQLVLYAQMANLVHLILQWP 205

Query: 147 EISINEIAENFSKLACNAHTICNSELRPLGTGLYPVISIINHSCLPNAVLVFEGRLAVVR 206
           EI+I EIA  FSK ACNAHT+C+SELRPLGTGLYPV+SIINHSCLPN+VLVF+GR A VR
Sbjct: 206 EINIKEIANFFSKFACNAHTVCDSELRPLGTGLYPVVSIINHSCLPNSVLVFDGREASVR 265

Query: 207 AVQHVPKGAEVLISYIETAGSTMTRQKALKEQYLFTCTCPRCIKLGQFDDIQESAILEGY 266
           A+QH+PKG EVLISYIETAGST+TRQKAL+EQYLF C CP C K+GQ++D++E+AILEGY
Sbjct: 266 ALQHIPKGTEVLISYIETAGSTVTRQKALREQYLFQCVCPLCSKVGQYEDVRENAILEGY 325

Query: 267 RCKDDGCSGFLLRDSDDKGFTCQQCGLVRSKEEIKKIASEVNILSKKTLALTSCGNHQEV 326
           RCK++ C GFLLR +D K F CQ+CGLVR KEEIK+IA+E+  L ++    +S  +H E 
Sbjct: 326 RCKNETCDGFLLRTTDGKAFQCQECGLVRDKEEIKQIATEIKFLLEEASKPSSNDSH-EA 384

Query: 327 VSTYKMIEKLQKKLYHPFSVNLMQTREKLIKILMELEDWKEALAYCQLTIPVYQRVYPQF 386
           +S +KMIEKLQ KLYHPFS+NLMQTRE ++K LM+LE W+EALAYC+LTIP+YQRVYP  
Sbjct: 385 ISIHKMIEKLQTKLYHPFSINLMQTRETILKSLMKLEYWREALAYCKLTIPIYQRVYPAV 444

Query: 387 HPLLGLQYYTCGKLEWFLGDTENAIKSMTEAVEILRITHGTNSPFMKELILKLEEAQAEA 446
           HPLLGLQYYTCGKLEW+LGDTE AIKS+T+AV+ILRITHGT +PF+KEL + L EA+AEA
Sbjct: 445 HPLLGLQYYTCGKLEWYLGDTEEAIKSLTKAVDILRITHGTKTPFVKELSMMLVEARAEA 504

Query: 447 SY 448
           S+
Sbjct: 505 SF 506


>gi|297836508|ref|XP_002886136.1| hypothetical protein ARALYDRAFT_319740 [Arabidopsis lyrata subsp.
           lyrata]
 gi|297331976|gb|EFH62395.1| hypothetical protein ARALYDRAFT_319740 [Arabidopsis lyrata subsp.
           lyrata]
          Length = 483

 Score =  655 bits (1691), Expect = 0.0,   Method: Compositional matrix adjust.
 Identities = 301/444 (67%), Positives = 378/444 (85%)

Query: 8   GEVIISQEPYVCVPNNSSSISRCDGCFASSNLKKCSACQVVWYCGSNCQKLDWKLHRLEC 67
           GEVI+SQ+PY+CVPNN+S  SRCDGCF ++NLKKCS CQVVWYCGS CQK +WKLHR EC
Sbjct: 34  GEVILSQKPYICVPNNTSLESRCDGCFKTNNLKKCSGCQVVWYCGSFCQKSEWKLHRHEC 93

Query: 68  QVLSRLDKEKRKSVTPSIRLMLKLYLRRKLQNDNVIPSTTTDNYSLVEALVAHMSDIDEK 127
           + L+RL+KEKRK VTP+IRLM+KLY++R LQN+ V+P T+TDNYSLVEALV+HMS+IDEK
Sbjct: 94  KALTRLEKEKRKFVTPTIRLMVKLYIKRNLQNEKVLPITSTDNYSLVEALVSHMSEIDEK 153

Query: 128 QLLLYAQIANLVNLILQWPEISINEIAENFSKLACNAHTICNSELRPLGTGLYPVISIIN 187
           Q+LLYAQ+ANLVNLILQ+P + + EIAENFSK +CNAH+IC+SELRP G GL+P++SIIN
Sbjct: 154 QMLLYAQMANLVNLILQFPSVDLREIAENFSKFSCNAHSICDSELRPQGIGLFPLVSIIN 213

Query: 188 HSCLPNAVLVFEGRLAVVRAVQHVPKGAEVLISYIETAGSTMTRQKALKEQYLFTCTCPR 247
           HSC PNAVLVFE ++AVVRA+ ++ K +E+ ISYIETAGST+TRQK+LKEQYLF C C R
Sbjct: 214 HSCSPNAVLVFEEQMAVVRAMDNISKDSEITISYIETAGSTLTRQKSLKEQYLFHCQCAR 273

Query: 248 CIKLGQFDDIQESAILEGYRCKDDGCSGFLLRDSDDKGFTCQQCGLVRSKEEIKKIASEV 307
           C  +G+  DI+ESAILEGYRC ++ C+GFLLRD ++KGF CQ+C L+RSKEE+KK+AS++
Sbjct: 274 CSNVGKPHDIEESAILEGYRCSNEKCTGFLLRDPEEKGFVCQKCLLLRSKEEVKKLASDL 333

Query: 308 NILSKKTLALTSCGNHQEVVSTYKMIEKLQKKLYHPFSVNLMQTREKLIKILMELEDWKE 367
             +S+K  A  S  N Q  +  YK IEKLQ KLYH FS+ LM+TREKL+K+LM+LE W+E
Sbjct: 334 KTVSEKAHASPSAENKQAAIELYKTIEKLQVKLYHSFSITLMRTREKLLKMLMDLEIWRE 393

Query: 368 ALAYCQLTIPVYQRVYPQFHPLLGLQYYTCGKLEWFLGDTENAIKSMTEAVEILRITHGT 427
           AL YC+L +PVYQRVYP  HPL+GLQ+YT GKLEW LG+T+ A+ S+ +A +ILRI+HGT
Sbjct: 394 ALNYCRLIVPVYQRVYPATHPLIGLQFYTQGKLEWLLGETKEAVSSLIKAFDILRISHGT 453

Query: 428 NSPFMKELILKLEEAQAEASYKLS 451
           ++PFMKEL  KLEEA+AEASYKL+
Sbjct: 454 STPFMKELSAKLEEARAEASYKLA 477


>gi|30680137|ref|NP_849969.1| histone-lysine N-methyltransferase ASHR1 [Arabidopsis thaliana]
 gi|94707144|sp|Q7XJS0.2|ASHR1_ARATH RecName: Full=Histone-lysine N-methyltransferase ASHR1; AltName:
           Full=ASH1-related protein 1; AltName: Full=Protein SET
           DOMAIN GROUP 37
 gi|145651792|gb|ABP88121.1| At2g17900 [Arabidopsis thaliana]
 gi|330251606|gb|AEC06700.1| histone-lysine N-methyltransferase ASHR1 [Arabidopsis thaliana]
          Length = 480

 Score =  651 bits (1679), Expect = 0.0,   Method: Compositional matrix adjust.
 Identities = 298/442 (67%), Positives = 376/442 (85%)

Query: 8   GEVIISQEPYVCVPNNSSSISRCDGCFASSNLKKCSACQVVWYCGSNCQKLDWKLHRLEC 67
           GEVI+SQ+PY+CVPNN+SS SRCDGCF ++NLKKCSACQVVWYCGS+CQK +WKLHR EC
Sbjct: 34  GEVILSQKPYICVPNNTSSESRCDGCFKTNNLKKCSACQVVWYCGSSCQKSEWKLHRDEC 93

Query: 68  QVLSRLDKEKRKSVTPSIRLMLKLYLRRKLQNDNVIPSTTTDNYSLVEALVAHMSDIDEK 127
           + L+RL+KEKRK VTP+IRLM++LY++R LQN+ V+P TTTDNYSLVEALV+HMS+IDEK
Sbjct: 94  KALTRLEKEKRKFVTPTIRLMVRLYIKRNLQNEKVLPITTTDNYSLVEALVSHMSEIDEK 153

Query: 128 QLLLYAQIANLVNLILQWPEISINEIAENFSKLACNAHTICNSELRPLGTGLYPVISIIN 187
           Q+LLYAQ+ANLVNLILQ+P + + EIAENFSK +CNAH+IC+SELRP G GL+P++SIIN
Sbjct: 154 QMLLYAQMANLVNLILQFPSVDLREIAENFSKFSCNAHSICDSELRPQGIGLFPLVSIIN 213

Query: 188 HSCLPNAVLVFEGRLAVVRAVQHVPKGAEVLISYIETAGSTMTRQKALKEQYLFTCTCPR 247
           HSC PNAVLVFE ++AVVRA+ ++ K +E+ ISYIETAGST+TRQK+LKEQYLF C C R
Sbjct: 214 HSCSPNAVLVFEEQMAVVRAMDNISKDSEITISYIETAGSTLTRQKSLKEQYLFHCQCAR 273

Query: 248 CIKLGQFDDIQESAILEGYRCKDDGCSGFLLRDSDDKGFTCQQCGLVRSKEEIKKIASEV 307
           C   G+  DI+ESAILEGYRC ++ C+GFLLRD ++KGF CQ+C L+RSKEE+KK+AS++
Sbjct: 274 CSNFGKPHDIEESAILEGYRCANEKCTGFLLRDPEEKGFVCQKCLLLRSKEEVKKLASDL 333

Query: 308 NILSKKTLALTSCGNHQEVVSTYKMIEKLQKKLYHPFSVNLMQTREKLIKILMELEDWKE 367
             +S+K     S  + Q  +  YK IEKLQ KLYH FS+ LM+TREKL+K+LM++E W+E
Sbjct: 334 KTVSEKAPTSPSAEDKQAAIELYKTIEKLQVKLYHSFSIPLMRTREKLLKMLMDVEIWRE 393

Query: 368 ALAYCQLTIPVYQRVYPQFHPLLGLQYYTCGKLEWFLGDTENAIKSMTEAVEILRITHGT 427
           AL YC+L +PVYQRVYP  HPL+GLQ+YT GKLEW LG+T+ A+ S+ +A +ILRI+HG 
Sbjct: 394 ALNYCRLIVPVYQRVYPATHPLIGLQFYTQGKLEWLLGETKEAVSSLIKAFDILRISHGI 453

Query: 428 NSPFMKELILKLEEAQAEASYK 449
           ++PFMKEL  KLEEA+AEASYK
Sbjct: 454 STPFMKELSAKLEEARAEASYK 475


>gi|388518229|gb|AFK47176.1| unknown [Medicago truncatula]
          Length = 477

 Score =  604 bits (1558), Expect = e-170,   Method: Compositional matrix adjust.
 Identities = 297/482 (61%), Positives = 351/482 (72%), Gaps = 77/482 (15%)

Query: 4   FHVSGEVIISQEPYVCVPNNSSSISRCDGCFASSNLKKCSACQVVWYCGSNCQKLDWKLH 63
           FH  G+VIISQEPYVCVP       RCDGCF+++NL KCS CQVVWYCG+ CQK +WKLH
Sbjct: 31  FH-PGDVIISQEPYVCVPTQK----RCDGCFSTTNLSKCSRCQVVWYCGTPCQKSEWKLH 85

Query: 64  RLECQVLSRLDKEKRKSVTPSIRLMLKLYLRRKLQNDNVIPSTTTDNYSLVEALVAH--- 120
           RLECQ LSRLD  KRKSVTPSIRLMLKLYLRRKLQ+  +IPST  DNY LVEALVA    
Sbjct: 86  RLECQALSRLDSNKRKSVTPSIRLMLKLYLRRKLQDQKIIPSTAMDNYKLVEALVARILF 145

Query: 121 ----------------------------------MSDIDEKQLLLYAQIANLVNLILQWP 146
                                             MSDI E+QL+LYAQ+ANLV+LILQWP
Sbjct: 146 IIVVLSSRAMIVEFSLLTNMSVVIVLFNTYFHEDMSDIKEEQLVLYAQMANLVHLILQWP 205

Query: 147 EISINEIAENFSKLACNAHTICNSELRPLGTGLYPVISIINHSCLPNAVLVFEGRLAVVR 206
           EI+I EIA  FSK ACNAHT+C+SELRPLGTGLYPV+SIINHSCLPN+VLVF+GR A+VR
Sbjct: 206 EINIKEIANFFSKFACNAHTVCDSELRPLGTGLYPVVSIINHSCLPNSVLVFDGREALVR 265

Query: 207 AVQHVPKGAEVLISYIETAGSTMTRQKALKEQYLFTCTCPRCIKLGQFDDIQESAILEGY 266
           A+QH+PKG E                                   GQ++D++E+AILEGY
Sbjct: 266 ALQHIPKGTE-----------------------------------GQYEDVRENAILEGY 290

Query: 267 RCKDDGCSGFLLRDSDDKGFTCQQCGLVRSKEEIKKIASEVNILSKKTLALTSCGNHQEV 326
           RCK++ C GFLLR +D K F CQ+CGLVR KEEIK+IA+E+  L ++    +S GN  E 
Sbjct: 291 RCKNETCDGFLLRTTDGKAFQCQECGLVRDKEEIKQIATEIKFLLEEASKPSSNGNSHEA 350

Query: 327 VSTYKMIEKLQKKLYHPFSVNLMQTREKLIKILMELEDWKEALAYCQLTIPVYQRVYPQF 386
           +S +KMIEKLQ KLYHPFS+NLMQTRE ++K LM+LE W+EALAYC+LTIP+YQRVYP  
Sbjct: 351 ISIHKMIEKLQTKLYHPFSINLMQTRETILKSLMKLEYWREALAYCKLTIPIYQRVYPAV 410

Query: 387 HPLLGLQYYTCGKLEWFLGDTENAIKSMTEAVEILRITHGTNSPFMKELILKLEEAQAEA 446
           HPLLGLQYYTCGKLEW+LGDTE AIKS+T+AV+ILRITHGT +PF+KEL + L EA+AEA
Sbjct: 411 HPLLGLQYYTCGKLEWYLGDTEEAIKSLTKAVDILRITHGTKTPFVKELSMMLVEARAEA 470

Query: 447 SY 448
           S+
Sbjct: 471 SF 472


>gi|356518519|ref|XP_003527926.1| PREDICTED: LOW QUALITY PROTEIN: histone-lysine N-methyltransferase
           ASHR1-like [Glycine max]
          Length = 401

 Score =  595 bits (1533), Expect = e-167,   Method: Compositional matrix adjust.
 Identities = 291/399 (72%), Positives = 345/399 (86%), Gaps = 3/399 (0%)

Query: 56  QKLDWKLHRLECQVLSRLDKEKRKSVTPSIRLMLKLYLRRKLQNDNVIPSTTTDNYSLVE 115
           ++ +WKLHRLECQVLSRLD +KRKS+TPSIRLML+L+LRRKLQND ++PST  DNY+LVE
Sbjct: 6   ERSEWKLHRLECQVLSRLDHDKRKSLTPSIRLMLRLHLRRKLQNDKIVPSTAMDNYNLVE 65

Query: 116 ALVAHMSDIDEKQLLLYAQIANLVNLILQWPEISINEIAENFSKLACNAHTICNSELRPL 175
           ALVA+MSDI E+QL+LYA++ANLVN ILQWPEI+I EIAENFSK ACNAHTIC+SELRP+
Sbjct: 66  ALVAYMSDITEEQLVLYAKMANLVNSILQWPEINIKEIAENFSKFACNAHTICDSELRPV 125

Query: 176 GTGLYPVISIINHSCLPNAVLVFEGRLAVVRAVQHVPKGAEVLISYIETAGSTMTRQKAL 235
           GTGLYPVISIINHSCLPN+VLVFEGR A+V  VQH+P G EV ISYIETAGSTMT+Q AL
Sbjct: 126 GTGLYPVISIINHSCLPNSVLVFEGRSALVPTVQHIPTGTEVPISYIETAGSTMTQQNAL 185

Query: 236 KEQYLFTCTCPRCIKLGQFDDIQESAILEGYRCKDDGCSGFLLRDSDDKGFTCQQCGLVR 295
           K    FTCTCPRC K+G++DDIQESAILEGY+CK + C GFLL     KGF CQ CGLVR
Sbjct: 186 KSSTXFTCTCPRCSKVGEYDDIQESAILEGYKCKSEKCGGFLLY---GKGFQCQGCGLVR 242

Query: 296 SKEEIKKIASEVNILSKKTLALTSCGNHQEVVSTYKMIEKLQKKLYHPFSVNLMQTREKL 355
            KEEIK+I +E+ +LS++ +  +S  N+QE +S YK IEKLQ +LYHP SVNLMQ REK+
Sbjct: 243 DKEEIKRITTEIKLLSEEKIISSSTCNYQEAISIYKWIEKLQTELYHPLSVNLMQNREKI 302

Query: 356 IKILMELEDWKEALAYCQLTIPVYQRVYPQFHPLLGLQYYTCGKLEWFLGDTENAIKSMT 415
           +K LMELE W EALAYC+LTIP YQRVYP  HPLLGLQYYTCGKLEW+LGDT+ A+KS+ 
Sbjct: 303 LKSLMELEHWAEALAYCKLTIPFYQRVYPAVHPLLGLQYYTCGKLEWYLGDTDEAVKSLI 362

Query: 416 EAVEILRITHGTNSPFMKELILKLEEAQAEASYKLSSKD 454
           +AV+ILRITHGTN+PFMK+L++KLEEA+AEASY+LS K+
Sbjct: 363 KAVDILRITHGTNTPFMKDLLMKLEEARAEASYRLSPKE 401


>gi|41469441|gb|AAS07242.1| putative MYND finger protein [Oryza sativa Japonica Group]
 gi|108710650|gb|ABF98445.1| MYND finger family protein, expressed [Oryza sativa Japonica Group]
          Length = 480

 Score =  594 bits (1532), Expect = e-167,   Method: Compositional matrix adjust.
 Identities = 277/448 (61%), Positives = 352/448 (78%), Gaps = 6/448 (1%)

Query: 8   GEVIISQEPYVCVPNNSSSISRCDGCFASSNLKKCSACQVVWYCGSNCQKLDWKLHRLEC 67
           GEV+ISQEPY   PN  S  S CD CFAS NL+KCS C+V WYCGS CQ+ +WKLH+LEC
Sbjct: 38  GEVVISQEPYASTPNKISVGSNCDNCFASRNLRKCSVCRVAWYCGSACQREEWKLHQLEC 97

Query: 68  QVLSRLDKEKRKSVTPSIRLMLKLYLRRKLQNDNVIPSTTTDNYSLVEALVAHMSDIDEK 127
           + ++ L ++++K +TP+IRLM++L LRRKLQ+D  IPS+ TDNY+LV+AL +H+S++D+ 
Sbjct: 98  RAIAALTEDRKKMLTPTIRLMVRLVLRRKLQDDKAIPSSGTDNYNLVDALESHISEVDKN 157

Query: 128 QLLLYAQIANLVNLILQWPEISINEIAENFSKLACNAHTICNSELRPLGTGLYPVISIIN 187
           QL+LYAQ+ANLV LIL   E+ + EI   FSK ACNAHTIC+ ELRPLGTGLYPV+SIIN
Sbjct: 158 QLVLYAQMANLVQLILPSFELDLKEITHTFSKFACNAHTICDPELRPLGTGLYPVLSIIN 217

Query: 188 HSCLPNAVLVFEGRLAVVRAVQHVPKGAEVLISYIETAGSTMTRQKALKEQYLFTCTCPR 247
           HSC+PNAVL+FEGR A VRA+Q + K  EV ISYIETA +TM RQ  LK  Y FTCTCPR
Sbjct: 218 HSCVPNAVLIFEGRTAYVRALQPISKNEEVSISYIETAATTMKRQDDLK-HYYFTCTCPR 276

Query: 248 CIKLGQFDDIQESAILEGYRCKDDGCSGFLLRDSDDKGFTCQQCGLVRSKEEIKKIASEV 307
           C+K     D +E A+LEGYRC D  C GFLL ++ +KG+TCQ+C   R  EE++K+AS+V
Sbjct: 277 CVK-----DSEEDALLEGYRCNDQKCDGFLLPNAGNKGYTCQKCSTSRDGEELQKMASDV 331

Query: 308 NILSKKTLALTSCGNHQEVVSTYKMIEKLQKKLYHPFSVNLMQTREKLIKILMELEDWKE 367
            +LS K  +L S GN+ EV S YK IE+L++KLYHP S+ L+ TRE L+KI MEL+DW+ 
Sbjct: 332 LLLSDKVSSLVSSGNNSEVGSMYKTIEELERKLYHPLSITLLHTRETLLKIYMELQDWQT 391

Query: 368 ALAYCQLTIPVYQRVYPQFHPLLGLQYYTCGKLEWFLGDTENAIKSMTEAVEILRITHGT 427
           AL YC+LTIPVY+R+YP FHP++GLQ+YTCGKLEW L  TE+A+ S+T A +ILRITHGT
Sbjct: 392 ALMYCRLTIPVYERIYPPFHPMIGLQFYTCGKLEWLLEYTEDALMSLTRAADILRITHGT 451

Query: 428 NSPFMKELILKLEEAQAEASYKLSSKDE 455
            S FMKEL+ KLEE +AEAS++LS+ DE
Sbjct: 452 KSEFMKELLGKLEEVRAEASFRLSAGDE 479


>gi|108710649|gb|ABF98444.1| MYND finger family protein, expressed [Oryza sativa Japonica Group]
          Length = 481

 Score =  588 bits (1516), Expect = e-165,   Method: Compositional matrix adjust.
 Identities = 276/449 (61%), Positives = 352/449 (78%), Gaps = 7/449 (1%)

Query: 8   GEVIISQEPYVCVPNNSSSISRCDGCFASSNLKKCSACQVVWYCGSNCQKLDWKLHRLEC 67
           GEV+ISQEPY   PN  S  S CD CFAS NL+KCS C+V WYCGS CQ+ +WKLH+LEC
Sbjct: 38  GEVVISQEPYASTPNKISVGSNCDNCFASRNLRKCSVCRVAWYCGSACQREEWKLHQLEC 97

Query: 68  QVLSRLDKEKRKSVTPSIRLMLKLYLRRKLQNDNVIPSTTTDNYSLVEALVAHMSDIDEK 127
           + ++ L ++++K +TP+IRLM++L LRRKLQ+D  IPS+ TDNY+LV+AL +H+S++D+ 
Sbjct: 98  RAIAALTEDRKKMLTPTIRLMVRLVLRRKLQDDKAIPSSGTDNYNLVDALESHISEVDKN 157

Query: 128 QLLLYAQIANLVNLILQWPEISINEIAENFSKLACNAHTICNSELRPLGTGLYPVISIIN 187
           QL+LYAQ+ANLV LIL   E+ + EI   FSK ACNAHTIC+ ELRPLGTGLYPV+SIIN
Sbjct: 158 QLVLYAQMANLVQLILPSFELDLKEITHTFSKFACNAHTICDPELRPLGTGLYPVLSIIN 217

Query: 188 HSCLPNAVLVFEGRLAVVRAVQHVPKGAEVLISYIETAGSTMTRQKALKEQYLFTCTCPR 247
           HSC+PNAVL+FEGR A VRA+Q + K  EV ISYIETA +TM RQ  LK  Y FTCTCPR
Sbjct: 218 HSCVPNAVLIFEGRTAYVRALQPISKNEEVSISYIETAATTMKRQDDLK-HYYFTCTCPR 276

Query: 248 CIKLGQFDDIQESAILEGYRCKDDGCSGFLLRDSDDKGFTCQQCGLVRSKEEIKKIASEV 307
           C+K     D +E A+LEGYRC D  C GFLL ++ +KG+TCQ+C   R  EE++K+AS+V
Sbjct: 277 CVK-----DSEEDALLEGYRCNDQKCDGFLLPNAGNKGYTCQKCSTSRDGEELQKMASDV 331

Query: 308 NILSKKTLALTSCG-NHQEVVSTYKMIEKLQKKLYHPFSVNLMQTREKLIKILMELEDWK 366
            +LS K  +L S G ++ EV S YK IE+L++KLYHP S+ L+ TRE L+KI MEL+DW+
Sbjct: 332 LLLSDKVSSLVSSGIDNSEVGSMYKTIEELERKLYHPLSITLLHTRETLLKIYMELQDWQ 391

Query: 367 EALAYCQLTIPVYQRVYPQFHPLLGLQYYTCGKLEWFLGDTENAIKSMTEAVEILRITHG 426
            AL YC+LTIPVY+R+YP FHP++GLQ+YTCGKLEW L  TE+A+ S+T A +ILRITHG
Sbjct: 392 TALMYCRLTIPVYERIYPPFHPMIGLQFYTCGKLEWLLEYTEDALMSLTRAADILRITHG 451

Query: 427 TNSPFMKELILKLEEAQAEASYKLSSKDE 455
           T S FMKEL+ KLEE +AEAS++LS+ DE
Sbjct: 452 TKSEFMKELLGKLEEVRAEASFRLSAGDE 480


>gi|218193604|gb|EEC76031.1| hypothetical protein OsI_13199 [Oryza sativa Indica Group]
          Length = 481

 Score =  585 bits (1508), Expect = e-164,   Method: Compositional matrix adjust.
 Identities = 275/449 (61%), Positives = 351/449 (78%), Gaps = 7/449 (1%)

Query: 8   GEVIISQEPYVCVPNNSSSISRCDGCFASSNLKKCSACQVVWYCGSNCQKLDWKLHRLEC 67
           GEV+ISQEPY   PN  S  S CD CFAS NL+KCS C+V WYCGS CQ+ +WKLH+LEC
Sbjct: 38  GEVVISQEPYASTPNKISVGSNCDNCFASRNLRKCSVCRVAWYCGSACQREEWKLHQLEC 97

Query: 68  QVLSRLDKEKRKSVTPSIRLMLKLYLRRKLQNDNVIPSTTTDNYSLVEALVAHMSDIDEK 127
           + ++ L ++++K +TP+IRLM++L LRRKLQ+D  IPS+ TDNY+LV+AL +H+S++D+ 
Sbjct: 98  RAIAALTEDRKKMLTPTIRLMVRLVLRRKLQDDKAIPSSGTDNYNLVDALESHISEVDKN 157

Query: 128 QLLLYAQIANLVNLILQWPEISINEIAENFSKLACNAHTICNSELRPLGTGLYPVISIIN 187
           QL+LYAQ+ANLV LIL   E+ + EI   FSK ACNAHTIC+ ELR LGTGLYPV+SIIN
Sbjct: 158 QLVLYAQMANLVQLILPSFELDLKEITHTFSKFACNAHTICDPELRSLGTGLYPVLSIIN 217

Query: 188 HSCLPNAVLVFEGRLAVVRAVQHVPKGAEVLISYIETAGSTMTRQKALKEQYLFTCTCPR 247
           HSC+PNAVL+FEGR A VRA+Q + K  EV ISYIETA +TM RQ  LK  Y FTCTCPR
Sbjct: 218 HSCVPNAVLIFEGRTAYVRALQPISKNEEVSISYIETAATTMKRQDDLK-HYYFTCTCPR 276

Query: 248 CIKLGQFDDIQESAILEGYRCKDDGCSGFLLRDSDDKGFTCQQCGLVRSKEEIKKIASEV 307
           C+K     D +E A+LEGYRC D  C GFLL ++ +KG+TCQ+C   R  EE++K+AS+V
Sbjct: 277 CVK-----DSEEDALLEGYRCNDQKCDGFLLPNAGNKGYTCQKCSTSRDGEELQKMASDV 331

Query: 308 NILSKKTLALTSCG-NHQEVVSTYKMIEKLQKKLYHPFSVNLMQTREKLIKILMELEDWK 366
            +LS K  +L S G ++ EV S YK IE+L++KLYHP S+ L+ TRE L+KI MEL+DW+
Sbjct: 332 LLLSDKVSSLVSSGIDNSEVGSMYKTIEELERKLYHPLSITLLHTRETLLKIYMELQDWQ 391

Query: 367 EALAYCQLTIPVYQRVYPQFHPLLGLQYYTCGKLEWFLGDTENAIKSMTEAVEILRITHG 426
            AL YC+LTIPVY+R+YP FHP++GLQ+YTCGKLEW L  TE+A+ S+T A +ILRITHG
Sbjct: 392 TALMYCRLTIPVYERIYPPFHPMIGLQFYTCGKLEWLLEYTEDALMSLTRAADILRITHG 451

Query: 427 TNSPFMKELILKLEEAQAEASYKLSSKDE 455
           T S FMKEL+ KLEE +AEAS++LS+ DE
Sbjct: 452 TKSEFMKELLGKLEEVRAEASFRLSAGDE 480


>gi|357117918|ref|XP_003560708.1| PREDICTED: histone-lysine N-methyltransferase ASHR1-like
           [Brachypodium distachyon]
          Length = 482

 Score =  574 bits (1480), Expect = e-161,   Method: Compositional matrix adjust.
 Identities = 270/448 (60%), Positives = 341/448 (76%), Gaps = 6/448 (1%)

Query: 8   GEVIISQEPYVCVPNNSSSISRCDGCFASSNLKKCSACQVVWYCGSNCQKLDWKLHRLEC 67
           GEVI+  EPY   PN     S CD CF SSNL+KCS C+V WYCGS CQK +WKLH+LEC
Sbjct: 38  GEVILCGEPYASTPNKILVGSNCDHCFTSSNLRKCSVCRVAWYCGSVCQKEEWKLHQLEC 97

Query: 68  QVLSRLDKEKRKSVTPSIRLMLKLYLRRKLQNDNVIPSTTTDNYSLVEALVAHMSDIDEK 127
           + +S L  +++K +TP+IRLM++L LRRKLQN+ VIPST TDNY LV+AL +H+S++D+K
Sbjct: 98  RAISALTDDRKKMLTPTIRLMVRLILRRKLQNEKVIPSTGTDNYGLVDALESHISEVDDK 157

Query: 128 QLLLYAQIANLVNLILQWPEISINEIAENFSKLACNAHTICNSELRPLGTGLYPVISIIN 187
           QL+LYAQ+ANLV LIL   E+ + E A  FSK +CNAHTIC+ ELRP+GTGL+P IS IN
Sbjct: 158 QLVLYAQMANLVQLILPAIELDLKETAHIFSKFSCNAHTICDPELRPVGTGLFPAISTIN 217

Query: 188 HSCLPNAVLVFEGRLAVVRAVQHVPKGAEVLISYIETAGSTMTRQKALKEQYLFTCTCPR 247
           HSC+PNAVL+FEGR A VRA+Q +    EV ISYIETA +T+ R   LK  Y FTCTCPR
Sbjct: 218 HSCVPNAVLLFEGRTAYVRALQPLSNNTEVSISYIETAATTLKRHNDLK-HYFFTCTCPR 276

Query: 248 CIKLGQFDDIQESAILEGYRCKDDGCSGFLLRDSDDKGFTCQQCGLVRSKEEIKKIASEV 307
           CIK       +E  +LEGYRCKD  C GFLL DS  K +TCQ+C + R  EE+KK++SE+
Sbjct: 277 CIK-----GSEEDPLLEGYRCKDQNCDGFLLPDSGKKAYTCQKCSMCRDGEEVKKLSSEI 331

Query: 308 NILSKKTLALTSCGNHQEVVSTYKMIEKLQKKLYHPFSVNLMQTREKLIKILMELEDWKE 367
            +LS K  +  S GN+ E  S YK+IE+L++KLYH FS+ L+ TRE L+K+ MEL+DW+ 
Sbjct: 332 LLLSDKASSFVSSGNNNEAGSVYKIIEQLERKLYHSFSITLLHTRETLLKLYMELQDWRS 391

Query: 368 ALAYCQLTIPVYQRVYPQFHPLLGLQYYTCGKLEWFLGDTENAIKSMTEAVEILRITHGT 427
           AL YC+ TIPVY+RVYP FHP++GLQ+YTCGKLEW L  TE+A+KS+T A ++LRITHGT
Sbjct: 392 ALMYCRSTIPVYERVYPPFHPMVGLQFYTCGKLEWLLEYTEDALKSLTRAADVLRITHGT 451

Query: 428 NSPFMKELILKLEEAQAEASYKLSSKDE 455
           NS FMKEL  KLEEA+AE S++LS  +E
Sbjct: 452 NSQFMKELFGKLEEARAEVSFRLSPGNE 479


>gi|242033255|ref|XP_002464022.1| hypothetical protein SORBIDRAFT_01g010740 [Sorghum bicolor]
 gi|241917876|gb|EER91020.1| hypothetical protein SORBIDRAFT_01g010740 [Sorghum bicolor]
          Length = 482

 Score =  561 bits (1447), Expect = e-157,   Method: Compositional matrix adjust.
 Identities = 264/446 (59%), Positives = 340/446 (76%), Gaps = 6/446 (1%)

Query: 8   GEVIISQEPYVCVPNNSSSISRCDGCFASSNLKKCSACQVVWYCGSNCQKLDWKLHRLEC 67
           G+VI++QEPY   PN     S CD CF S NL+KCS CQV WYC +NCQK +WKLH+LEC
Sbjct: 38  GDVILNQEPYASTPNKILVGSSCDHCFTSGNLRKCSMCQVTWYCSTNCQKEEWKLHQLEC 97

Query: 68  QVLSRLDKEKRKSVTPSIRLMLKLYLRRKLQNDNVIPSTTTDNYSLVEALVAHMSDIDEK 127
           + ++ L ++++K +TP+IRLM++L L+RKLQN+ VIPS++ DNY+LV+AL +H+  +DE 
Sbjct: 98  RAMAALTEDRKKMLTPTIRLMVRLVLKRKLQNEKVIPSSSIDNYNLVDALESHIWKVDEN 157

Query: 128 QLLLYAQIANLVNLILQWPEISINEIAENFSKLACNAHTICNSELRPLGTGLYPVISIIN 187
            L+LYAQ+ANLV+LIL   E+ + EIA  FSK ACNAHTIC+ ELRPLGTGLYPVISIIN
Sbjct: 158 HLVLYAQMANLVSLILPLIELDLKEIAHTFSKFACNAHTICDPELRPLGTGLYPVISIIN 217

Query: 188 HSCLPNAVLVFEGRLAVVRAVQHVPKGAEVLISYIETAGSTMTRQKALKEQYLFTCTCPR 247
           HSC+PNAVL+F+GR A VRA+Q + K  EV ISYIETA  T  R   LK QY FTC+CPR
Sbjct: 218 HSCVPNAVLIFDGRTAYVRALQPIGKNEEVSISYIETAAVTKKRHNDLK-QYFFTCSCPR 276

Query: 248 CIKLGQFDDIQESAILEGYRCKDDGCSGFLLRDSDDKGFTCQQCGLVRSKEEIKKIASEV 307
           C+K       +E A+LEG+RCK+  C GFLL DS  K +TCQ+C + R +EEI+K+ SE+
Sbjct: 277 CVK-----GSEEDALLEGFRCKNQTCDGFLLPDSGKKAYTCQKCSVSRDEEEIQKMRSEI 331

Query: 308 NILSKKTLALTSCGNHQEVVSTYKMIEKLQKKLYHPFSVNLMQTREKLIKILMELEDWKE 367
             LS K  +  S GN  E  S YK IE+L++ LYH FS  L+ T E L+KI MEL+DW+ 
Sbjct: 332 LQLSDKASSFLSSGNKAEAGSVYKTIEQLEQNLYHAFSTTLLHTCETLLKIYMELQDWRT 391

Query: 368 ALAYCQLTIPVYQRVYPQFHPLLGLQYYTCGKLEWFLGDTENAIKSMTEAVEILRITHGT 427
           ALAYC+LT+PVY+RVYP FHP++GLQ+YTCGKLEW L  TE+A+KS+T A +IL++THGT
Sbjct: 392 ALAYCRLTVPVYERVYPPFHPMIGLQFYTCGKLEWLLECTEDALKSLTRATDILKVTHGT 451

Query: 428 NSPFMKELILKLEEAQAEASYKLSSK 453
            S FMKEL  KLEEA+AE S+++SS+
Sbjct: 452 KSQFMKELFGKLEEARAEVSFRISSR 477


>gi|212722234|ref|NP_001131420.1| uncharacterized protein LOC100192749 [Zea mays]
 gi|194691468|gb|ACF79818.1| unknown [Zea mays]
 gi|413933330|gb|AFW67881.1| histone-lysine N-methyltransferase ASHR1 [Zea mays]
          Length = 482

 Score =  561 bits (1447), Expect = e-157,   Method: Compositional matrix adjust.
 Identities = 267/446 (59%), Positives = 340/446 (76%), Gaps = 6/446 (1%)

Query: 8   GEVIISQEPYVCVPNNSSSISRCDGCFASSNLKKCSACQVVWYCGSNCQKLDWKLHRLEC 67
           G+VI++QEPY   PN     S CD CF S NL+KCS C+V WYC SNCQK +WKLH+LEC
Sbjct: 38  GDVILNQEPYASTPNKILVGSSCDHCFTSGNLRKCSMCRVTWYCSSNCQKEEWKLHQLEC 97

Query: 68  QVLSRLDKEKRKSVTPSIRLMLKLYLRRKLQNDNVIPSTTTDNYSLVEALVAHMSDIDEK 127
           + ++ L ++++K +TP+IRLM++L L+RKLQN+ VIPS++ DNY+LV+ L +H+S +DE 
Sbjct: 98  RAMAALTEDRKKMLTPTIRLMVRLALKRKLQNEKVIPSSSIDNYNLVDGLESHISKVDEN 157

Query: 128 QLLLYAQIANLVNLILQWPEISINEIAENFSKLACNAHTICNSELRPLGTGLYPVISIIN 187
           QL+LYAQ+ANLV+LIL + E+ + EIA+ FSK ACNAHTIC+ ELRPLGTGLYPVISIIN
Sbjct: 158 QLVLYAQMANLVSLILPFIELDLKEIAQTFSKFACNAHTICDPELRPLGTGLYPVISIIN 217

Query: 188 HSCLPNAVLVFEGRLAVVRAVQHVPKGAEVLISYIETAGSTMTRQKALKEQYLFTCTCPR 247
           HSC+PNAVL+F+GR A VRA+Q + K  EV ISYIETA  T  R   LK QY FTCTCPR
Sbjct: 218 HSCVPNAVLIFDGRTAYVRALQPINKDEEVSISYIETATVTKKRNNDLK-QYFFTCTCPR 276

Query: 248 CIKLGQFDDIQESAILEGYRCKDDGCSGFLLRDSDDKGFTCQQCGLVRSKEEIKKIASEV 307
           C+K   FD   E A+LEG+RCK+  C GFLL +S  K +TCQ+CG  R  EEIK + SE+
Sbjct: 277 CVK--GFD---EDALLEGFRCKNQACDGFLLPNSGKKAYTCQKCGASRDVEEIKNMRSEI 331

Query: 308 NILSKKTLALTSCGNHQEVVSTYKMIEKLQKKLYHPFSVNLMQTREKLIKILMELEDWKE 367
             LS K  +  S GN  E  S YK+IE+L++ LYH FS  L+ T E L+KI +EL+DW  
Sbjct: 332 LQLSDKASSFLSSGNKAEAGSIYKIIEQLERNLYHAFSTTLLHTCETLLKIYLELQDWWT 391

Query: 368 ALAYCQLTIPVYQRVYPQFHPLLGLQYYTCGKLEWFLGDTENAIKSMTEAVEILRITHGT 427
           AL YC+LTIPVY+RVYP FHP++GLQ+YTCGKLEW L  TE+A+KS+T A +IL++THG 
Sbjct: 392 ALTYCRLTIPVYERVYPPFHPMIGLQFYTCGKLEWLLEFTEDALKSLTRAADILKVTHGV 451

Query: 428 NSPFMKELILKLEEAQAEASYKLSSK 453
            S FMKEL  KLEEA+AE S++LSS+
Sbjct: 452 KSQFMKELFGKLEEARAEVSFRLSSR 477


>gi|195636164|gb|ACG37550.1| histone-lysine N-methyltransferase ASHR1 [Zea mays]
          Length = 482

 Score =  560 bits (1444), Expect = e-157,   Method: Compositional matrix adjust.
 Identities = 269/452 (59%), Positives = 341/452 (75%), Gaps = 7/452 (1%)

Query: 2   FSFHVSGEVIISQEPYVCVPNNSSSISRCDGCFASSNLKKCSACQVVWYCGSNCQKLDWK 61
           F+F   G+VI++QEPY   PN     S CD CF S NL+KCS C+V WYC SNCQK +WK
Sbjct: 33  FTF-FPGDVILNQEPYASTPNKILVGSSCDHCFTSGNLRKCSMCRVTWYCSSNCQKEEWK 91

Query: 62  LHRLECQVLSRLDKEKRKSVTPSIRLMLKLYLRRKLQNDNVIPSTTTDNYSLVEALVAHM 121
           LH+LEC+ ++ L + ++K +TP+IRLM++L L+RKLQN+ VIPS++ DNY+LV+AL +H+
Sbjct: 92  LHQLECRAMAALTENRKKMLTPTIRLMVRLALKRKLQNEKVIPSSSIDNYNLVDALESHI 151

Query: 122 SDIDEKQLLLYAQIANLVNLILQWPEISINEIAENFSKLACNAHTICNSELRPLGTGLYP 181
           S +DE QL+LYAQ+ANLV+LIL + E+ + EIA+ FSK ACNAHTIC+ ELRPLGTGLYP
Sbjct: 152 SKVDENQLVLYAQMANLVSLILPFIELDLKEIAQTFSKFACNAHTICDPELRPLGTGLYP 211

Query: 182 VISIINHSCLPNAVLVFEGRLAVVRAVQHVPKGAEVLISYIETAGSTMTRQKALKEQYLF 241
           VISIINHSC+PNAVL+F+GR A VRA+Q + K  EV ISYIETA  T  R   LK QY F
Sbjct: 212 VISIINHSCVPNAVLIFDGRTAYVRALQPIDKDEEVSISYIETAAVTKKRNNDLK-QYFF 270

Query: 242 TCTCPRCIKLGQFDDIQESAILEGYRCKDDGCSGFLLRDSDDKGFTCQQCGLVRSKEEIK 301
           TCTCPRC+K   FD   E  +LEG+RCK+  C GFLL  S  K +TCQ+CG  R  EEIK
Sbjct: 271 TCTCPRCVK--GFD---EDPLLEGFRCKNQTCDGFLLPSSGKKAYTCQKCGASRDVEEIK 325

Query: 302 KIASEVNILSKKTLALTSCGNHQEVVSTYKMIEKLQKKLYHPFSVNLMQTREKLIKILME 361
            + SE+  LS K  +  S GN  E  S YK+IE+L++ LYH FS  L+ T E L+KI +E
Sbjct: 326 NMRSEILQLSDKASSFLSSGNKAEAGSIYKIIEQLERNLYHAFSTTLLHTCETLLKIYLE 385

Query: 362 LEDWKEALAYCQLTIPVYQRVYPQFHPLLGLQYYTCGKLEWFLGDTENAIKSMTEAVEIL 421
           L+DW  AL YC+LTIPVY+RVYP FHP++GLQ+YTCGKLEW L  TE+A+KS+T A +IL
Sbjct: 386 LQDWWTALTYCRLTIPVYERVYPPFHPMIGLQFYTCGKLEWLLEFTEDALKSLTRAADIL 445

Query: 422 RITHGTNSPFMKELILKLEEAQAEASYKLSSK 453
           ++THG  S FMKEL  KLEEA+AE S++LSS+
Sbjct: 446 KVTHGVKSQFMKELFGKLEEARAEVSFRLSSR 477


>gi|22450579|gb|AAM97151.1| unknown protein,3'-partial [Oryza sativa Japonica Group]
          Length = 425

 Score =  526 bits (1354), Expect = e-146,   Method: Compositional matrix adjust.
 Identities = 243/394 (61%), Positives = 309/394 (78%), Gaps = 6/394 (1%)

Query: 8   GEVIISQEPYVCVPNNSSSISRCDGCFASSNLKKCSACQVVWYCGSNCQKLDWKLHRLEC 67
           GEV+ISQEPY   PN  S  S CD CFAS NL+KCS C+V WYCGS CQ+ +WKLH+LEC
Sbjct: 38  GEVVISQEPYASTPNKISVGSNCDNCFASRNLRKCSVCRVAWYCGSACQREEWKLHQLEC 97

Query: 68  QVLSRLDKEKRKSVTPSIRLMLKLYLRRKLQNDNVIPSTTTDNYSLVEALVAHMSDIDEK 127
           + ++ L ++++K +TP+IRLM++L LRRKLQ+D  IPS+ TDNY+LV+AL +H+S++D+ 
Sbjct: 98  RAIAALTEDRKKMLTPTIRLMVRLVLRRKLQDDKAIPSSGTDNYNLVDALESHISEVDKN 157

Query: 128 QLLLYAQIANLVNLILQWPEISINEIAENFSKLACNAHTICNSELRPLGTGLYPVISIIN 187
           QL+LYAQ+ANLV LIL   E+ + EI   FSK ACNAHTIC+ ELRPLGTGLYPV+SIIN
Sbjct: 158 QLVLYAQMANLVQLILPSFELDLKEITHTFSKFACNAHTICDPELRPLGTGLYPVLSIIN 217

Query: 188 HSCLPNAVLVFEGRLAVVRAVQHVPKGAEVLISYIETAGSTMTRQKALKEQYLFTCTCPR 247
           HSC+PNAVL+FEGR A VRA+Q + K  EV ISYIETA +TM RQ  LK  Y FTCTCPR
Sbjct: 218 HSCVPNAVLIFEGRTAYVRALQPISKNEEVSISYIETAATTMKRQDDLK-HYYFTCTCPR 276

Query: 248 CIKLGQFDDIQESAILEGYRCKDDGCSGFLLRDSDDKGFTCQQCGLVRSKEEIKKIASEV 307
           C+K     D +E A+LEGYRC D  C GFLL ++ +KG+TCQ+C   R  EE++K+AS+V
Sbjct: 277 CVK-----DSEEDALLEGYRCNDQKCDGFLLPNAGNKGYTCQKCSTSRDGEELQKMASDV 331

Query: 308 NILSKKTLALTSCGNHQEVVSTYKMIEKLQKKLYHPFSVNLMQTREKLIKILMELEDWKE 367
            +LS K  +L S GN+ EV S YK IE+L++KLYHP S+ L+ TRE L+KI MEL+DW+ 
Sbjct: 332 LLLSDKVSSLVSSGNNSEVGSMYKTIEELERKLYHPLSITLLHTRETLLKIYMELQDWQT 391

Query: 368 ALAYCQLTIPVYQRVYPQFHPLLGLQYYTCGKLE 401
           AL YC+LTIPVY+R+YP FHP++GLQ+YTCGKLE
Sbjct: 392 ALMYCRLTIPVYERIYPPFHPMIGLQFYTCGKLE 425


>gi|413933331|gb|AFW67882.1| hypothetical protein ZEAMMB73_219813 [Zea mays]
          Length = 443

 Score =  502 bits (1293), Expect = e-139,   Method: Compositional matrix adjust.
 Identities = 237/396 (59%), Positives = 301/396 (76%), Gaps = 6/396 (1%)

Query: 8   GEVIISQEPYVCVPNNSSSISRCDGCFASSNLKKCSACQVVWYCGSNCQKLDWKLHRLEC 67
           G+VI++QEPY   PN     S CD CF S NL+KCS C+V WYC SNCQK +WKLH+LEC
Sbjct: 38  GDVILNQEPYASTPNKILVGSSCDHCFTSGNLRKCSMCRVTWYCSSNCQKEEWKLHQLEC 97

Query: 68  QVLSRLDKEKRKSVTPSIRLMLKLYLRRKLQNDNVIPSTTTDNYSLVEALVAHMSDIDEK 127
           + ++ L ++++K +TP+IRLM++L L+RKLQN+ VIPS++ DNY+LV+ L +H+S +DE 
Sbjct: 98  RAMAALTEDRKKMLTPTIRLMVRLALKRKLQNEKVIPSSSIDNYNLVDGLESHISKVDEN 157

Query: 128 QLLLYAQIANLVNLILQWPEISINEIAENFSKLACNAHTICNSELRPLGTGLYPVISIIN 187
           QL+LYAQ+ANLV+LIL + E+ + EIA+ FSK ACNAHTIC+ ELRPLGTGLYPVISIIN
Sbjct: 158 QLVLYAQMANLVSLILPFIELDLKEIAQTFSKFACNAHTICDPELRPLGTGLYPVISIIN 217

Query: 188 HSCLPNAVLVFEGRLAVVRAVQHVPKGAEVLISYIETAGSTMTRQKALKEQYLFTCTCPR 247
           HSC+PNAVL+F+GR A VRA+Q + K  EV ISYIETA  T  R   LK QY FTCTCPR
Sbjct: 218 HSCVPNAVLIFDGRTAYVRALQPINKDEEVSISYIETATVTKKRNNDLK-QYFFTCTCPR 276

Query: 248 CIKLGQFDDIQESAILEGYRCKDDGCSGFLLRDSDDKGFTCQQCGLVRSKEEIKKIASEV 307
           C+K   FD   E A+LEG+RCK+  C GFLL +S  K +TCQ+CG  R  EEIK + SE+
Sbjct: 277 CVK--GFD---EDALLEGFRCKNQACDGFLLPNSGKKAYTCQKCGASRDVEEIKNMRSEI 331

Query: 308 NILSKKTLALTSCGNHQEVVSTYKMIEKLQKKLYHPFSVNLMQTREKLIKILMELEDWKE 367
             LS K  +  S GN  E  S YK+IE+L++ LYH FS  L+ T E L+KI +EL+DW  
Sbjct: 332 LQLSDKASSFLSSGNKAEAGSIYKIIEQLERNLYHAFSTTLLHTCETLLKIYLELQDWWT 391

Query: 368 ALAYCQLTIPVYQRVYPQFHPLLGLQYYTCGKLEWF 403
           AL YC+LTIPVY+RVYP FHP++GLQ+YTCGKLEW+
Sbjct: 392 ALTYCRLTIPVYERVYPPFHPMIGLQFYTCGKLEWY 427


>gi|168051542|ref|XP_001778213.1| predicted protein [Physcomitrella patens subsp. patens]
 gi|162670426|gb|EDQ56995.1| predicted protein [Physcomitrella patens subsp. patens]
          Length = 490

 Score =  456 bits (1173), Expect = e-125,   Method: Compositional matrix adjust.
 Identities = 212/450 (47%), Positives = 304/450 (67%), Gaps = 2/450 (0%)

Query: 3   SFHVSGEVIISQEPYVCVPNNSSSISRCDGCF-ASSNLKKCSACQVVWYCGSNCQKLDWK 61
           +FH  GEVI+ QEPYVC  ++SS   RCD C+  SSNLK+CSAC+ V+YC +NCQ+  W 
Sbjct: 36  AFH-RGEVILQQEPYVCALDDSSQALRCDRCYRQSSNLKRCSACKTVFYCCANCQRSGWG 94

Query: 62  LHRLECQVLSRLDKEKRKSVTPSIRLMLKLYLRRKLQNDNVIPSTTTDNYSLVEALVAHM 121
           LH+ EC +L RL KEK++S TPS+RL+++  ++R+LQ + V      DNY + E L  HM
Sbjct: 95  LHKFECAILVRLLKEKKRSPTPSLRLVMRFLIKRRLQAERVQLKAAVDNYDVCEVLPTHM 154

Query: 122 SDIDEKQLLLYAQIANLVNLILQWPEISINEIAENFSKLACNAHTICNSELRPLGTGLYP 181
           S+  E +L++YAQ+A++V  ++   E+++ E+ +   + ACNAHTIC+ E+RPLGTGLYP
Sbjct: 155 SETSEDRLVMYAQMASIVQQMMAPDEVNVKEVTQMICRFACNAHTICDEEVRPLGTGLYP 214

Query: 182 VISIINHSCLPNAVLVFEGRLAVVRAVQHVPKGAEVLISYIETAGSTMTRQKALKEQYLF 241
           VISI+NHSC+PNAVL F+G  A +RA++   +G E+ ISY+E A ST TR+KAL++QY F
Sbjct: 215 VISIVNHSCVPNAVLHFDGNRAALRALEDTQEGTEITISYVELAASTNTRRKALRDQYYF 274

Query: 242 TCTCPRCIKLGQFDDIQESAILEGYRCKDDGCSGFLLRDSDDKGFTCQQCGLVRSKEEIK 301
            C C RC +L   +  +E A LEGY C +  C+G L+ D       C+ CGL R  ++ K
Sbjct: 275 DCNCIRCSRLVTREGSREDAFLEGYGCVNSDCNGPLIEDPGSDKVICEVCGLKREVQQTK 334

Query: 302 KIASEVNILSKKTLALTSCGNHQEVVSTYKMIEKLQKKLYHPFSVNLMQTREKLIKILME 361
             A EV +   +   L + G  +     Y  +E  Q+KL+HP+SV L++T + L+KI M+
Sbjct: 335 SAAKEVELDVLEASNLYAAGKLESARRLYSEVEAKQRKLWHPYSVPLLRTHDALLKICMD 394

Query: 362 LEDWKEALAYCQLTIPVYQRVYPQFHPLLGLQYYTCGKLEWFLGDTENAIKSMTEAVEIL 421
           +EDW  AL +CQ TIP Y+R YP F PLLGLQY+T GKL+W LGD+  A+ ++ +A  ++
Sbjct: 395 MEDWASALEFCQSTIPAYERAYPPFSPLLGLQYFTLGKLQWLLGDSAKAVNTLRKAYTVI 454

Query: 422 RITHGTNSPFMKELILKLEEAQAEASYKLS 451
           ++THG+ S  +  L   L+EAQAE +YK S
Sbjct: 455 QVTHGSKSELLHGLTSTLQEAQAEVAYKRS 484


>gi|148906849|gb|ABR16570.1| unknown [Picea sitchensis]
          Length = 441

 Score =  414 bits (1065), Expect = e-113,   Method: Compositional matrix adjust.
 Identities = 208/443 (46%), Positives = 284/443 (64%), Gaps = 60/443 (13%)

Query: 8   GEVIISQEPYVCVPNNSSSISRCDGCFA-SSNLKKCSACQVVWYCGSNCQKLDWKLHRLE 66
           GE+++ Q+PYV V +++S   RCD CF   +NLK+CS C+  WYCG  CQ+  W+LH+ E
Sbjct: 47  GEIVLLQDPYVSVLDSASVNKRCDVCFRLCTNLKRCSVCKTTWYCGGTCQRNGWRLHQHE 106

Query: 67  CQVLSRLDKEKRKSVTPSIRLMLKLYLRRKLQNDNVIPSTTTDNYSLVEALVAHMSDIDE 126
           C+ ++ L +EK+++ TPS    L+L LR                                
Sbjct: 107 CKAITSLKEEKQQTPTPS----LQLMLR-------------------------------- 130

Query: 127 KQLLLYAQIANLVNLILQWPEISINEIAENFSKLACNAHTICNSELRPLGTGLYPVISII 186
             LL+  ++ N                    +  +CNAHTIC+SELRP+GTGLYPVISII
Sbjct: 131 --LLIKRKLQN--------------------AGFSCNAHTICDSELRPMGTGLYPVISII 168

Query: 187 NHSCLPNAVLVFEGRLAVVRAVQHVPKGAEVLISYIETAGSTMTRQKALKEQYLFTCTCP 246
           NHSC PNAVL+FEGR AVVRAV+ + +G+E+ +SYIE A ST +R+K+LKEQY F C C 
Sbjct: 169 NHSCFPNAVLLFEGRQAVVRAVEPIREGSELTVSYIEIAASTASRKKSLKEQYFFDCKCL 228

Query: 247 RCIKLGQFDDIQESAILEGYRCKDDGCSGFLLRDSDD-KGFTCQQCGLVRSKEEIKKIAS 305
           RC+K+   D + E AILEG+RC  D C GFLL D DD +   CQ CG  R++EE KK A 
Sbjct: 229 RCLKVDTPDGLHEDAILEGFRCSSDHCEGFLLHDPDDAQSLVCQLCGCGRNEEETKKQAR 288

Query: 306 EVNILSKKTLALTSCGNHQEVVSTYKMIEKLQKKLYHPFSVNLMQTREKLIKILMELEDW 365
           +V+ L K+   L S GN+ E  S Y+ I++LQ +L+HP+SV L++T + L+KI MEL DW
Sbjct: 289 KVDKLGKEASKLLSSGNYSEARSLYEQIQQLQTQLWHPYSVILLRTGDTLLKICMELYDW 348

Query: 366 KEALAYCQLTIPVYQRVYPQFHPLLGLQYYTCGKLEWFLGDTENAIKSMTEAVEILRITH 425
           K+AL YC+LTIP Y+R YP  HP++GLQYY CGKLEWFL +T  A+    +A +IL +TH
Sbjct: 349 KQALKYCRLTIPAYERAYPTCHPMMGLQYYACGKLEWFLENTLEALNFFEKAAKILTVTH 408

Query: 426 GTNSPFMKELILKLEEAQAEASY 448
           G NS F+ +L  +++EA AEA++
Sbjct: 409 GRNSEFLTQLFDRIQEAHAEAAH 431


>gi|108710651|gb|ABF98446.1| MYND finger family protein, expressed [Oryza sativa Japonica Group]
          Length = 352

 Score =  402 bits (1034), Expect = e-109,   Method: Compositional matrix adjust.
 Identities = 190/315 (60%), Positives = 242/315 (76%), Gaps = 6/315 (1%)

Query: 8   GEVIISQEPYVCVPNNSSSISRCDGCFASSNLKKCSACQVVWYCGSNCQKLDWKLHRLEC 67
           GEV+ISQEPY   PN  S  S CD CFAS NL+KCS C+V WYCGS CQ+ +WKLH+LEC
Sbjct: 38  GEVVISQEPYASTPNKISVGSNCDNCFASRNLRKCSVCRVAWYCGSACQREEWKLHQLEC 97

Query: 68  QVLSRLDKEKRKSVTPSIRLMLKLYLRRKLQNDNVIPSTTTDNYSLVEALVAHMSDIDEK 127
           + ++ L ++++K +TP+IRLM++L LRRKLQ+D  IPS+ TDNY+LV+AL +H+S++D+ 
Sbjct: 98  RAIAALTEDRKKMLTPTIRLMVRLVLRRKLQDDKAIPSSGTDNYNLVDALESHISEVDKN 157

Query: 128 QLLLYAQIANLVNLILQWPEISINEIAENFSKLACNAHTICNSELRPLGTGLYPVISIIN 187
           QL+LYAQ+ANLV LIL   E+ + EI   FSK ACNAHTIC+ ELRPLGTGLYPV+SIIN
Sbjct: 158 QLVLYAQMANLVQLILPSFELDLKEITHTFSKFACNAHTICDPELRPLGTGLYPVLSIIN 217

Query: 188 HSCLPNAVLVFEGRLAVVRAVQHVPKGAEVLISYIETAGSTMTRQKALKEQYLFTCTCPR 247
           HSC+PNAVL+FEGR A VRA+Q + K  EV ISYIETA +TM RQ  LK  Y FTCTCPR
Sbjct: 218 HSCVPNAVLIFEGRTAYVRALQPISKNEEVSISYIETAATTMKRQDDLK-HYYFTCTCPR 276

Query: 248 CIKLGQFDDIQESAILEGYRCKDDGCSGFLLRDSDDKGFTCQQCGLVRSKEEIKKIASEV 307
           C+K     D +E A+LEGYRC D  C GFLL ++ +KG+TCQ+C   R  EE++K+AS+V
Sbjct: 277 CVK-----DSEEDALLEGYRCNDQKCDGFLLPNAGNKGYTCQKCSTSRDGEELQKMASDV 331

Query: 308 NILSKKTLALTSCGN 322
            +LS K  +L S G+
Sbjct: 332 LLLSDKVSSLVSSGS 346


>gi|18568265|gb|AAL75997.1|AF466646_5 putative SET-domain transcriptional regulator [Zea mays]
 gi|413933333|gb|AFW67884.1| putative SET-domain transcriptional regulator [Zea mays]
          Length = 410

 Score =  396 bits (1017), Expect = e-107,   Method: Compositional matrix adjust.
 Identities = 216/451 (47%), Positives = 276/451 (61%), Gaps = 88/451 (19%)

Query: 8   GEVIISQEPYVCVPNNSSSISRCDGCFASSNLKKCSACQVVWYCGSNCQKLDWKLHRLEC 67
           G+VI++QEPY   PN     S CD CF S NL+KCS C+V WYC SNCQK +WKLH+LEC
Sbjct: 38  GDVILNQEPYASTPNKILVGSSCDHCFTSGNLRKCSMCRVTWYCSSNCQKEEWKLHQLEC 97

Query: 68  QVLSRLDKEKRKSVTPSIRLMLKLYLRRKLQNDNVIPSTTTDNYSLVEALVAHMSDIDEK 127
           + ++ L ++++K +TP+IRLM++L L+RKLQN+                           
Sbjct: 98  RAMAALTEDRKKMLTPTIRLMVRLALKRKLQNE--------------------------- 130

Query: 128 QLLLYAQIANLVNLILQWPEISINEIAENFSKLACNAHTICNSELRPLGTGLYPVISIIN 187
                 ++ANLV+LIL + E+ + EIA+ FSK ACNAHTIC+ ELRPLGTGLYPVISIIN
Sbjct: 131 ------KMANLVSLILPFIELDLKEIAQTFSKFACNAHTICDPELRPLGTGLYPVISIIN 184

Query: 188 HSCLPNAVLVFEGRLAVVRAVQHVPKGAEVLISYIETAGSTMTRQKALKEQYLFTCTCPR 247
           HSC+PNAVL+F+GR A VRA+Q + K  EV ISYIETA  T  R   LK QY FTCTCPR
Sbjct: 185 HSCVPNAVLIFDGRTAYVRALQPINKDEEVSISYIETATVTKKRNNDLK-QYFFTCTCPR 243

Query: 248 CIKLGQFDDIQESAILEGYRCKDDGCSGFLLRDSDDKGFTCQQCGLVRSKEEIKKIASEV 307
           C+K   FD   E A+LEG+RCK+  C GFLL +SD                         
Sbjct: 244 CVK--GFD---EDALLEGFRCKNQACDGFLLPNSDK------------------------ 274

Query: 308 NILSKKTLALTSCGNHQEVVSTYKMIEKLQKKLYHPFSVNLMQTREKLIKILMELEDWKE 367
                            E  S YK+IE+L++ LYH FS  L+ T E L+KI +EL+DW  
Sbjct: 275 ----------------AEAGSIYKIIEQLERNLYHAFSTTLLHTCETLLKIYLELQDWWT 318

Query: 368 ALAYCQLTIPVYQR-----VYPQFHPLLGLQYYTCGKLEWFLGDTENAIKSMTEAVEILR 422
           AL YC+LTIPVY+        P   P    + Y  G LE+    TE+A+KS+T A +IL+
Sbjct: 319 ALTYCRLTIPVYESKAWRLKIPFPRPRTVREAYGTGLLEF----TEDALKSLTRAADILK 374

Query: 423 ITHGTNSPFMKELILKLEEAQAEASYKLSSK 453
           +THG  S FMKEL  KLEEA+AE S++LSS+
Sbjct: 375 VTHGVKSQFMKELFGKLEEARAEVSFRLSSR 405


>gi|302799034|ref|XP_002981276.1| hypothetical protein SELMODRAFT_114558 [Selaginella moellendorffii]
 gi|300150816|gb|EFJ17464.1| hypothetical protein SELMODRAFT_114558 [Selaginella moellendorffii]
          Length = 453

 Score =  369 bits (946), Expect = 2e-99,   Method: Compositional matrix adjust.
 Identities = 185/445 (41%), Positives = 286/445 (64%), Gaps = 11/445 (2%)

Query: 8   GEVIISQEPYVCVPNNSSSISRCDGCFASS-NLKKCSACQVVWYCGSNCQKLDWKLHRLE 66
           G+ ++ Q+PYV V +  S  +RCD CF  S  LK+CSAC+ V+YC   CQ+  WK+H+ E
Sbjct: 4   GQTVLEQDPYVAVLDGDSRGNRCDACFKQSPALKRCSACKFVFYCSPTCQRSQWKIHQEE 63

Query: 67  CQVLSRLDKEKRKSVTPSIRLMLKLYLRRKLQNDNVIPSTTTDNYSLVEALVAHMSDIDE 126
           C+VL RL  E+++   P++ LML+L ++R+LQ   V+P T  DNY +V AL  H S+  +
Sbjct: 64  CKVLVRLSTEQKR---PTLLLMLRLLVKRELQATGVLPVTALDNYEIVRALPTHFSETGD 120

Query: 127 KQLLLYAQIANLVNLIL--QWPEISINEIAENFSKLACNAHTICNSELRPLGTGLYPVIS 184
           ++L++YAQ+A L+  IL  ++ E  + EI ++  +++CN HTIC+ ELRP+G GL+PV+S
Sbjct: 121 ERLVMYAQMAVLIKTILNARYAE-DVKEITKDICRISCNGHTICDDELRPVGIGLFPVVS 179

Query: 185 IINHSCLPNAVLVFEGRLAVVRAVQHVPKGAEVLISYIETAGSTMTRQKALKEQYLFTCT 244
           +INHSC  N++L+F+G+ AVVRA+  + +G EV +SYIE   ST +R++AL +QY F C 
Sbjct: 180 VINHSCSSNSLLLFDGKHAVVRALGTISRGCEVTVSYIELGASTNSRREALSDQYYFHCK 239

Query: 245 CPRCIKLG-QFDDIQESAILEGYRCKDDGCSGFLLRDSDDKGFTCQQCGLVRSKEEIKKI 303
           CPRC  +  Q   + +  +LE   C D  C  F+     +  + C  CG  R   E+ K+
Sbjct: 240 CPRCTDVSPQAHGLYKDDVLEAVACLDPACESFM--RLSNGSWRCMSCGSSREPNEVNKL 297

Query: 304 ASEVNILSKKTLALTSCGNHQEVVSTYKMIEKLQKKLYHPFSVNLMQTREKLIKILMELE 363
           ++E   + +K   L + G+       ++  EKLQ +L++P SV LM+TR+ L+++ + LE
Sbjct: 298 STEAEGMIEKANRLQAAGDLHGARMAFQQAEKLQTELWNPRSVKLMRTRDLLLRVYLSLE 357

Query: 364 DWKEALAYCQLTIPVYQRVYP-QFHPLLGLQYYTCGKLEWFLGDTENAIKSMTEAVEILR 422
           DW  AL  C+LT+P Y+  YP   HPLLGLQYYT GKLE   G    A+++  +A +IL 
Sbjct: 358 DWDLALHVCRLTLPAYETAYPGSKHPLLGLQYYTLGKLEMHAGSVTEAVRAYAKAFDILS 417

Query: 423 ITHGTNSPFMKELILKLEEAQAEAS 447
           +THG+   F+++L  +L++A+  AS
Sbjct: 418 VTHGSRGEFVRKLRNELDQAKMVAS 442


>gi|302772491|ref|XP_002969663.1| hypothetical protein SELMODRAFT_231366 [Selaginella moellendorffii]
 gi|300162174|gb|EFJ28787.1| hypothetical protein SELMODRAFT_231366 [Selaginella moellendorffii]
          Length = 484

 Score =  367 bits (941), Expect = 1e-98,   Method: Compositional matrix adjust.
 Identities = 184/445 (41%), Positives = 286/445 (64%), Gaps = 11/445 (2%)

Query: 7   SGEVIISQEPYVCVPNNSSSISRCDGCFASS-NLKKCSACQVVWYCGSNCQKLDWKLHRL 65
           +G+ ++ Q+PYV V +  S  +RCD CF  S  L++CSAC+ V+YC   CQ+  WK+H+ 
Sbjct: 36  AGQTVLEQDPYVAVLDGDSRGNRCDACFKQSPALRRCSACKFVFYCSPTCQRSQWKIHQE 95

Query: 66  ECQVLSRLDKEKRKSVTPSIRLMLKLYLRRKLQNDNVIPSTTTDNYSLVEALVAHMSDID 125
           EC+VL RL  E+++   P++ LML+L ++R+LQ   V+P T  DNY +V AL  H S+  
Sbjct: 96  ECKVLVRLSTEQKR---PTLLLMLRLLVKRELQATGVLPVTALDNYEIVRALPTHFSETG 152

Query: 126 EKQLLLYAQIANLVNLIL--QWPEISINEIAENFSKLACNAHTICNSELRPLGTGLYPVI 183
           +++L++YAQ+A L+  IL  ++ E  + EI ++  +++CN HTIC+ ELRP+G GL+PV+
Sbjct: 153 DERLVMYAQMAVLIKTILNARYAE-DVKEITKDICRISCNGHTICDDELRPVGIGLFPVV 211

Query: 184 SIINHSCLPNAVLVFEGRLAVVRAVQHVPKGAEVLISYIETAGSTMTRQKALKEQYLFTC 243
           SIINHSC  N++L+F+G+ AVVRA+  + +G EV +SYIE   ST +R++AL +QY F C
Sbjct: 212 SIINHSCSSNSLLLFDGKHAVVRALGTISRGCEVTVSYIELGASTNSRREALSDQYYFHC 271

Query: 244 TCPRCIKLGQFDDIQESAILEGYRCKDDGCSGFLLRDSDDKGFTCQQCGLVRSKEEIKKI 303
            CPRC    +   + +  +LE   C D  C  F+     +  + C  CG  R   E+ K+
Sbjct: 272 KCPRCTDDSEA-GLYKDDVLEAVACLDPACESFM--RLSNGSWRCMSCGSSREPNEVNKL 328

Query: 304 ASEVNILSKKTLALTSCGNHQEVVSTYKMIEKLQKKLYHPFSVNLMQTREKLIKILMELE 363
           ++E   + +K   L + G+       ++  EKLQ +L++P SV LM+TR+ L+++ + LE
Sbjct: 329 STEAEGMIEKANRLQAAGDLHGARMAFQQAEKLQTELWNPRSVKLMRTRDLLLRVYLSLE 388

Query: 364 DWKEALAYCQLTIPVYQRVYP-QFHPLLGLQYYTCGKLEWFLGDTENAIKSMTEAVEILR 422
           DW  AL  C+LT+P Y+  YP   HPLLGLQYYT GKLE   G    A+++  +A +IL 
Sbjct: 389 DWDLALHVCRLTLPAYETAYPGSKHPLLGLQYYTLGKLEMHAGSVTEAVRAYAKAFDILS 448

Query: 423 ITHGTNSPFMKELILKLEEAQAEAS 447
           +THG+   F+++L  +L++A+  AS
Sbjct: 449 VTHGSRGEFVRKLRNELDQAKMVAS 473


>gi|108710652|gb|ABF98447.1| MYND finger family protein, expressed [Oryza sativa Japonica Group]
          Length = 282

 Score =  292 bits (748), Expect = 2e-76,   Method: Compositional matrix adjust.
 Identities = 133/211 (63%), Positives = 170/211 (80%)

Query: 8   GEVIISQEPYVCVPNNSSSISRCDGCFASSNLKKCSACQVVWYCGSNCQKLDWKLHRLEC 67
           GEV+ISQEPY   PN  S  S CD CFAS NL+KCS C+V WYCGS CQ+ +WKLH+LEC
Sbjct: 38  GEVVISQEPYASTPNKISVGSNCDNCFASRNLRKCSVCRVAWYCGSACQREEWKLHQLEC 97

Query: 68  QVLSRLDKEKRKSVTPSIRLMLKLYLRRKLQNDNVIPSTTTDNYSLVEALVAHMSDIDEK 127
           + ++ L ++++K +TP+IRLM++L LRRKLQ+D  IPS+ TDNY+LV+AL +H+S++D+ 
Sbjct: 98  RAIAALTEDRKKMLTPTIRLMVRLVLRRKLQDDKAIPSSGTDNYNLVDALESHISEVDKN 157

Query: 128 QLLLYAQIANLVNLILQWPEISINEIAENFSKLACNAHTICNSELRPLGTGLYPVISIIN 187
           QL+LYAQ+ANLV LIL   E+ + EI   FSK ACNAHTIC+ ELRPLGTGLYPV+SIIN
Sbjct: 158 QLVLYAQMANLVQLILPSFELDLKEITHTFSKFACNAHTICDPELRPLGTGLYPVLSIIN 217

Query: 188 HSCLPNAVLVFEGRLAVVRAVQHVPKGAEVL 218
           HSC+PNAVL+FEGR A VRA+Q + K  EV+
Sbjct: 218 HSCVPNAVLIFEGRTAYVRALQPISKNEEVM 248


>gi|222625640|gb|EEE59772.1| hypothetical protein OsJ_12271 [Oryza sativa Japonica Group]
          Length = 384

 Score =  238 bits (608), Expect = 4e-60,   Method: Compositional matrix adjust.
 Identities = 119/235 (50%), Positives = 157/235 (66%), Gaps = 39/235 (16%)

Query: 53  SNCQKLDWKLHRLECQVLSRLDKEKRKSVTPSIRLMLKLYLRRKLQNDNVIPSTTTDNYS 112
           S+C + +WKLH+LEC+ ++ L ++++K +TP+IRLM++L LRRKLQ+D  IPS+ TDNY+
Sbjct: 95  SSCLREEWKLHQLECRAIAALTEDRKKMLTPTIRLMVRLVLRRKLQDDKAIPSSGTDNYN 154

Query: 113 LVEALVAHMSDIDEKQLLLYAQIANLVNLILQWPEISINEIAENFSKLACNAHTICNSEL 172
           LV+AL +H+S++D+ QL+LYAQ+ANLV LIL   E+ + EI   FSK ACNAHTIC+ EL
Sbjct: 155 LVDALESHISEVDKNQLVLYAQMANLVQLILPSFELDLKEITHTFSKFACNAHTICDPEL 214

Query: 173 RPLGTGLYPVISIINHSCLPNAVLVFEGRLAVVRAVQHVPKGAEVLISYIETAGSTMTRQ 232
           RPLGTGLYPV+SIINHSC+PNAVL+FEGR A VRA+Q + K  E                
Sbjct: 215 RPLGTGLYPVLSIINHSCVPNAVLIFEGRTAYVRALQPISKNEE---------------- 258

Query: 233 KALKEQYLFTCTCPRCIKLGQFDDIQESAILEGYRCKDDGCSGFLLRDSDDKGFT 287
                                  D +E A+LEGYRC D  C GFLL ++D  G T
Sbjct: 259 -----------------------DSEEDALLEGYRCNDQKCDGFLLPNADIYGVT 290



 Score =  122 bits (305), Expect = 6e-25,   Method: Compositional matrix adjust.
 Identities = 53/80 (66%), Positives = 67/80 (83%)

Query: 376 IPVYQRVYPQFHPLLGLQYYTCGKLEWFLGDTENAIKSMTEAVEILRITHGTNSPFMKEL 435
            PVY+R+YP FHP++GLQ+YTCGKLEW L  TE+A+ S+T A +ILRITHGT S FMKEL
Sbjct: 304 FPVYERIYPPFHPMIGLQFYTCGKLEWLLEYTEDALMSLTRAADILRITHGTKSEFMKEL 363

Query: 436 ILKLEEAQAEASYKLSSKDE 455
           + KLEE +AEAS++LS+ DE
Sbjct: 364 LGKLEEVRAEASFRLSAGDE 383


>gi|260799834|ref|XP_002594889.1| hypothetical protein BRAFLDRAFT_124463 [Branchiostoma floridae]
 gi|229280126|gb|EEN50900.1| hypothetical protein BRAFLDRAFT_124463 [Branchiostoma floridae]
          Length = 463

 Score =  218 bits (554), Expect = 7e-54,   Method: Compositional matrix adjust.
 Identities = 136/437 (31%), Positives = 226/437 (51%), Gaps = 28/437 (6%)

Query: 8   GEVIISQEPYVCVPNNSSSISRCDGCFA-SSNLKKCSACQVVWYCGSNCQKLDWKLHRLE 66
           G ++ + +PY  V  NS    RCD CFA   ++ +CS C+   YC   CQK  W  H+ E
Sbjct: 26  GNLVRAADPYAYVLCNSERGKRCDFCFARKDDMSRCSGCKFARYCDGKCQKAAWTEHKSE 85

Query: 67  CQVLSRLDKEKRKSVTPSIRLMLKLYLRRKLQNDNVIPSTTTDNYSLVEALVAHMSDIDE 126
           C+ +  +  E   + T SIRL+ ++  + K  +  V P  + D       L +++ ++ E
Sbjct: 86  CKSIKTVKPE---TPTDSIRLIARIINKTKTDSPGV-PGNSIDE------LQSNLREMPE 135

Query: 127 KQLLLYAQIANLVNLILQWPEIS-INEIAENFSKLACNAHTICNSELRPLGTGLYPVISI 185
               ++AQ+A ++ + +    +    EI E F ++ CN  +IC+ E++ +G G+YP +S+
Sbjct: 136 NVKEMFAQLAVVLRMYVGKDVMDDAREIFELFGRMTCNTFSICDPEMQYIGIGIYPKMSL 195

Query: 186 INHSCLPNAVLVFEGRLAVVRAVQHVPKGAEVLISYIETAGSTMTRQKALKEQYLFTCTC 245
            NHSC PN V VF G    VRA+Q++  G E+LISY+E    +  R++ L +QY FTC C
Sbjct: 196 FNHSCEPNCVAVFNGLRMEVRAIQNIQPGEELLISYVEMLAMSSVRKQQLLQQYYFTCKC 255

Query: 246 PRCIKLGQFDDIQESAILEGYRCKDDGCSGFLLRDSDDKGFTCQQCGLVRSKEEIKKIAS 305
           PRC       D  +  ++   +C +  C   +++  D    TC+ C      E+  K   
Sbjct: 256 PRC------QDQTKDGMMMAVKCGNINCKKVIIQ-VDGAYETCKVC--CHDNEKDPKFWK 306

Query: 306 EVNILSKKT----LALTSCGNHQEVVSTYKMIEKL---QKKLYHPFSVNLMQTREKLIKI 358
           EVN +++ +     A+TS    QE     ++ E+L   Q+++ H   + +++  +K +  
Sbjct: 307 EVNKVTQFSEEMLAAITSADQRQEPKEGLRLAERLLERQQQVLHNNHLFVLKCLDKALDQ 366

Query: 359 LMELEDWKEALAYCQLTIPVYQRVYPQFHPLLGLQYYTCGKLEWFLGDTENAIKSMTEAV 418
            + L  W  AL     TI  Y+  +P +HP LG+QY   GKL  +L     A++++  A 
Sbjct: 367 AVVLRKWNRALRCALQTIEPYKVHFPAYHPSLGIQYMRIGKLLLYLEKRLAALEALQMAE 426

Query: 419 EILRITHGTNSPFMKEL 435
            IL +THG + P  KEL
Sbjct: 427 HILNVTHGKDHPINKEL 443


>gi|310656796|gb|ADP02224.1| zf-MYND domain-containing protein [Triticum aestivum]
          Length = 520

 Score =  213 bits (543), Expect = 1e-52,   Method: Compositional matrix adjust.
 Identities = 168/495 (33%), Positives = 221/495 (44%), Gaps = 150/495 (30%)

Query: 8   GEVIISQEPYVCVPNNSSSISRCDGCFASSNLKKCSACQVVWYCGSNCQKLDWKLHRLEC 67
           GEVII QEPY   PN  S  S CD CFAS NL+KCS C+V WYCGS CQK +WKLH+LEC
Sbjct: 38  GEVIICQEPYASTPNRISVGSSCDHCFASGNLRKCSVCRVAWYCGSVCQKEEWKLHQLEC 97

Query: 68  QVLSRLDKEKRKSVTPSIRLMLKLYLRRKLQNDNVIPSTTTDNYSLVEALVAHMSDIDEK 127
           Q +S L +E++K +TP+IRLM+KL LRRKLQ++                           
Sbjct: 98  QAISALTEERKKMLTPTIRLMVKLILRRKLQSE--------------------------- 130

Query: 128 QLLLYAQIANLVNLILQWPEISINEIAENFSKLACNAHTICNSEL-RPLGTGLYPVISII 186
                 ++ANLV LIL   E+ + EI          AHT   S +   +G     V+ I 
Sbjct: 131 ------KMANLVQLILPSIELDLKEI----------AHTFSKSFVCTEVGIAYCHVLEI- 173

Query: 187 NHSCLPNAVLVFEGRLAVVRAVQHVPKGAEVLISYIETAGSTMTRQKALKEQYLFTCTCP 246
                          LAV+ +++  P G    +  +               Q  F C   
Sbjct: 174 ---------------LAVIGSLRPNPLGLGSAVIAVHPT------------QVQFACNAH 206

Query: 247 -------RCIKLGQFDDIQESAILEGYRCKDDGCSGFLLRDSDDKGFTCQQ-CGLVRSKE 298
                  R +  G F  I     +  +R      S    RD+D +  +    C L R + 
Sbjct: 207 TICDPELRPLGTGLFPAIS----IINHRILRKMLS---WRDTDARTKSVMAFCCLTRERR 259

Query: 299 EIKKIASEVNILSKKTLALTSCGNHQEVVSTYKMIEKLQKKLYHPFSVNLMQTREKLIK- 357
            I   A  V  L  K     S  ++ E  S YK IE+L++K YH FS  L+ TRE L+K 
Sbjct: 260 LIH--ARNVAFLEMKK----SNIDNSEAGSVYKTIEQLEQKHYHSFSTTLLHTRETLLKA 313

Query: 358 --------------------------ILMELEDWKEALAYCQL-------TIPV------ 378
                                     +L    +    LAY  L       +IP+      
Sbjct: 314 WRANVKTSQSFGYETHLGESSTRMGDLLGSPRERVSYLAYPNLFGIKGFVSIPLLQIYME 373

Query: 379 -----------------YQRVYPQFHPLLGLQYYTCGKLEWFLGDTENAIKSMTEAVEIL 421
                            Y+RVYP FHP++GLQ+YTCGKLEW L  TE+A+KS+T A ++L
Sbjct: 374 LQDWQTALMYCRLTIPVYERVYPPFHPMVGLQFYTCGKLEWLLEYTEDALKSLTRAADVL 433

Query: 422 RITHGTNSPFMKELI 436
           RITHGT S FMKEL 
Sbjct: 434 RITHGTQSQFMKELF 448


>gi|443720661|gb|ELU10312.1| hypothetical protein CAPTEDRAFT_158133 [Capitella teleta]
          Length = 441

 Score =  204 bits (519), Expect = 7e-50,   Method: Compositional matrix adjust.
 Identities = 126/449 (28%), Positives = 227/449 (50%), Gaps = 43/449 (9%)

Query: 8   GEVIISQEPYVCVPNNSSSISRCDGCFASS----NLKKCSACQVVWYCGSNCQKLDWKLH 63
           GE  + ++P+V V + +   SRCD CF        + +CS+C+VV +C   CQ   W +H
Sbjct: 6   GETWVKEDPFVHVLSQTERDSRCDFCFRRPENDIQIPRCSSCKVVRFCNRKCQSSAWSVH 65

Query: 64  RLECQVLSRLDKEKRKSVTPSIRLMLKLYLRRKLQNDNVIPSTTTDNYSLVEALVAHMSD 123
           + EC+ L R+     K  T S+RL+ +L ++ +   ++V       N++    L++H+ +
Sbjct: 66  KKECRSLKRV---APKIPTDSVRLIFRLLIKLESDIEDVEVFGKKRNWA---DLISHVDE 119

Query: 124 IDEKQ------LLLYAQIANLVNLILQWPEISINEIAENFSKLACNAHTICNSELRPLGT 177
           + E Q      ++L   + +    ++  P  S+ E+   F ++  N+ +IC+ E+ P+G 
Sbjct: 120 VQEDQIRLQQFMVLMTTLKSFSENVMSMP--SVEELFVIFGRVCVNSFSICDPEMNPIGV 177

Query: 178 GLY----PVISIINHSCLPNAVLVFEGRLAVVRAVQHVPKGAEVLISYIETAGSTMTRQK 233
           G+Y       S+++HSC PNAV VFEG    +R V+ V    ++ ISYI+T   T TR++
Sbjct: 178 GVYIRQIKPCSVLDHSCRPNAVAVFEGTTLRIRCVEPVDSEQDLRISYIDTLDDTTTRRR 237

Query: 234 ALKEQYLFTCTCPRCIKLGQFDDIQESAILEGYRCKDDGCSGFLLRDSDDKGFTCQQCG- 292
            L++QY F C C      G+  D ++  I   + C    C G + +   D  F+C  CG 
Sbjct: 238 NLQQQYYFNCLC------GECKDSEKDLIKFSFNCTSVECKGHVTQVPADDRFSCDICGT 291

Query: 293 ------LVRSKEEIKKIASEVNILSKKTLALTSCGNHQEVVSTYKMIEKLQKKLYHPFSV 346
                 L ++ E+ +KI   +N L K+         H+ +           + + HP++V
Sbjct: 292 TVDDVNLNQAAEKAQKIIKHLNELKKQR-------EHESIRMKGLSCVSEMRAILHPYNV 344

Query: 347 NLMQTREKLIKILMELEDWKEALAYCQLTIPVYQRVYPQFHPLLGLQYYTCGKLEWFLGD 406
           + ++  +  +   +E   W +AL   +LT+  Y+  Y + HP +G+  +  GK+  +L D
Sbjct: 345 HFIKLCDLSLDACIETSQWDDALELGRLTVVAYRFYYGELHPSVGILLFKIGKILSYLAD 404

Query: 407 TENAIKSMTEAVEILRITHGTNSPFMKEL 435
            E A++ + +A  +LR+THG +    K+L
Sbjct: 405 RE-ALQFLKQAATVLRVTHGDDHSLYKDL 432


>gi|348534108|ref|XP_003454545.1| PREDICTED: SET and MYND domain-containing protein 3-like
           [Oreochromis niloticus]
          Length = 439

 Score =  202 bits (515), Expect = 2e-49,   Method: Compositional matrix adjust.
 Identities = 141/460 (30%), Positives = 228/460 (49%), Gaps = 62/460 (13%)

Query: 8   GEVIISQEPYVCVPNNSSSISRCDGCFA-SSNLKKCSACQVVWYCGSNCQKLDWKLHRLE 66
           GE++IS EP  C  +N  +   C  CF    +L +CS C++  YC + CQK  W  H+ E
Sbjct: 27  GELVISAEPLACCVSNRLAKEVCHHCFTRHESLLRCSQCKMARYCNTTCQKQAWSGHKRE 86

Query: 67  CQVLSRLDKEKRKSVTPSIRLMLKLYLRRKLQNDNVIPS-TTTDNYSLVEALVAHMSDID 125
           C+ L  L     +  T S+RL  KL          + PS ++++    +E   +H++ + 
Sbjct: 87  CKCLQSLLP---RLPTDSVRLAAKLIFAL------LSPSKSSSEELYTLEEHESHLTSMP 137

Query: 126 EKQLLLYAQIANLVNLILQW--PEI----------SINEIAENFSKLACNAHTICNSELR 173
           E++    +Q+A+++ L LQ   P++          S  E     +K+ CN  TI + EL+
Sbjct: 138 EQKKQGLSQLASMLELYLQQEAPDLLQEMGSALPPSCREPLSLIAKVTCNCFTISDGELQ 197

Query: 174 PLGTGLYPVISIINHSCLPNAVLVFEGRLAVVRAVQHVPKGAEVLISYIETAGSTMTRQK 233
            +G GLYP +S++NH C PN V+VFEG    +RAV+ +    E+ ISYIET   T  RQ+
Sbjct: 198 EIGVGLYPSLSLLNHDCRPNCVMVFEGTKLELRAVRDIDPEDELTISYIETLSLTEDRQR 257

Query: 234 ALKEQYLFTCTCPRCIKLGQFDDIQESAILEGYRCKDDGCSGFLLRDSDDKGFTCQQCGL 293
            L+EQY FTC C RC                  R KD    G +L   + K    ++  L
Sbjct: 258 QLEEQYHFTCHCQRC----------------DSRDKD----GLMLSGEEGKWRLLKE-AL 296

Query: 294 VRSKEEIKKIASEVNILSKKTLALTSCGNHQEVVSTYKMIEKLQKKLYHPFSVNLMQTRE 353
            R  E +K  ++   +L   +  L+S G      + YK+                 +  +
Sbjct: 297 PR-LEGLKAESNWETLLESCSHLLSSVGGEVPEENLYKL-----------------RITD 338

Query: 354 KLIKILMELEDWKEALAYCQLTIPVYQRVYPQFHPLLGLQYYTCGKLEWFLGDTENAIKS 413
             +   + L  W+EA++Y   T+P Y++ YP  HP+ G+Q    GKL+ +L + E+A+ +
Sbjct: 339 MALDASVHLGRWEEAMSYGVTTLPAYRQYYPDPHPVHGIQLMRVGKLQHYLENIEDALDT 398

Query: 414 MTEAVEILRITHGTNSPFMKELILKLEEAQAEASYKLSSK 453
             +A EI++ THG + P   EL +K++E  +E  +  SS+
Sbjct: 399 FRQAYEIVKFTHGEDHPLTTELTMKMKECHSEMDHHSSSR 438


>gi|158299622|ref|XP_319707.4| AGAP008954-PA [Anopheles gambiae str. PEST]
 gi|157013605|gb|EAA14812.4| AGAP008954-PA [Anopheles gambiae str. PEST]
          Length = 453

 Score =  200 bits (508), Expect = 1e-48,   Method: Compositional matrix adjust.
 Identities = 135/463 (29%), Positives = 226/463 (48%), Gaps = 24/463 (5%)

Query: 3   SFHVSGEVIISQEPYVCVPNNSSSISRCDGCFASSNLKKCSACQVVWYCGSNCQKLDWKL 62
           + H  G+VI+ ++P+ CV +     SRCD CF  + + KCS C  V YCG +CQK  W  
Sbjct: 4   TIHRRGDVILQEKPFACVLDPRYRDSRCDRCFKETKVMKCSNCLYVRYCGRSCQKEAWSD 63

Query: 63  HRLECQVLSRLDKEKRKSVTPSIRLMLKLYLRRKLQ-NDNVIPSTTTDNYSLVEALVAHM 121
           H+ EC+ L  L       V PS  LM+   +RR L+  D      T+  Y     L+ H 
Sbjct: 64  HKEECEKLKALPP---GLVVPSAALMIARIVRRLLKGGDTHKGYYTSKQYRKFCDLMPHE 120

Query: 122 SDI--DEKQLLLYAQI-ANLVNLILQWPEISINEIAENFSKLACNAHTICNSELRPLGTG 178
            +I  D K++  +  +   L  L+ +    +  E+   + K+  N   I ++E+  +GTG
Sbjct: 121 ENIRADSKRMEHFGTLYVVLQRLLDEASRPTKAELLRIYGKMCINTFNILDAEMSTIGTG 180

Query: 179 LYPVISIINHSCLPNAVLVFEGRLAVVRAVQHVPKG----AEVLISYIETAGSTMTRQKA 234
           +Y   SII+HSC PN V+ F+G    +R ++  P+      ++ ISYI+   +   RQ+ 
Sbjct: 181 MYIGASIIDHSCRPNVVVSFDGETLRMRLLEDYPEQELDFGKLFISYIDLIDTAEVRQEQ 240

Query: 235 LKEQYLFTCTCPRCIKLGQFDDIQESAILEGYRCKDDGCSGFLLRDSDDKGFTCQQCGLV 294
           L E+Y F C C RC       D QE   +    C +  C   L     ++   C  CG  
Sbjct: 241 LAERYYFHCACERC------RDEQEQKRMNAAACPNTTCHEPLDFSDSEQLNQCPACGTA 294

Query: 295 RSKEEIKKIASEVNILSKKTLALTSCGNHQEVVSTYKMIEKLQKKLYHPFSVNLMQTREK 354
            +  + +  A E++  ++  LA      + +V    ++  + Q  + H ++V+ ++T + 
Sbjct: 295 VTHSDREAFA-EISSFTRDHLAQMKSVAYLDVS---RLCLEKQANVLHRYNVHHIKTLDN 350

Query: 355 LIKILMELEDWKEALAYCQLTIPVYQRVYPQFHPLLGLQYYTCGKLEWFLGDTENAIKSM 414
            ++  + LE W EA  Y    +  +++ Y  +HPLLGL Y   GKL+ +      A+K +
Sbjct: 351 AMESALNLEKWTEATGYGLRLLDGFRQYYSTYHPLLGLTYLKVGKLQLYQCQFAEALKQL 410

Query: 415 TEAVEILRITHGTNSPFMKELILKL--EEAQAEASYKLSSKDE 455
            +A +ILR+THG      K +++ L  + AQ +  + L+  DE
Sbjct: 411 QQAAKILRVTHGEQDDLYKRVLVPLLCDAAQGDLGH-LAIADE 452


>gi|413933332|gb|AFW67883.1| hypothetical protein ZEAMMB73_219813 [Zea mays]
          Length = 190

 Score =  193 bits (491), Expect = 1e-46,   Method: Compositional matrix adjust.
 Identities = 86/153 (56%), Positives = 120/153 (78%)

Query: 8   GEVIISQEPYVCVPNNSSSISRCDGCFASSNLKKCSACQVVWYCGSNCQKLDWKLHRLEC 67
           G+VI++QEPY   PN     S CD CF S NL+KCS C+V WYC SNCQK +WKLH+LEC
Sbjct: 38  GDVILNQEPYASTPNKILVGSSCDHCFTSGNLRKCSMCRVTWYCSSNCQKEEWKLHQLEC 97

Query: 68  QVLSRLDKEKRKSVTPSIRLMLKLYLRRKLQNDNVIPSTTTDNYSLVEALVAHMSDIDEK 127
           + ++ L ++++K +TP+IRLM++L L+RKLQN+ VIPS++ DNY+LV+ L +H+S +DE 
Sbjct: 98  RAMAALTEDRKKMLTPTIRLMVRLALKRKLQNEKVIPSSSIDNYNLVDGLESHISKVDEN 157

Query: 128 QLLLYAQIANLVNLILQWPEISINEIAENFSKL 160
           QL+LYAQ+ANLV+LIL + E+ + EIA+ FSK 
Sbjct: 158 QLVLYAQMANLVSLILPFIELDLKEIAQTFSKF 190


>gi|350416087|ref|XP_003490836.1| PREDICTED: SET and MYND domain-containing protein 3-like [Bombus
           impatiens]
          Length = 439

 Score =  191 bits (484), Expect = 9e-46,   Method: Compositional matrix adjust.
 Identities = 129/439 (29%), Positives = 218/439 (49%), Gaps = 29/439 (6%)

Query: 3   SFHVSGEVIISQEPYVCVPNNSSSISRCDGCFASSNLKKCSACQVVWYCGSNCQKLDWKL 62
           +F   G  I++ +P+  V  +     RCD CF S  L +CSACQ V+YC  +CQ++ W +
Sbjct: 6   NFIKKGTTILTAKPFAYVLCSKYKNVRCDYCFKSGKLFRCSACQYVYYCNQSCQQMSWPM 65

Query: 63  HRLECQVLSRLDKEKRKSVTPSIRLMLKLYLRRKLQNDNVIPSTTTDNYSLVEALVAHMS 122
           H  EC   +RL K     ++   RLM ++ ++     D         +Y   + L++H S
Sbjct: 66  HSKEC---ARLKKCSPWGISNVARLMARIIIKLNQGGDEERGYYNETSYRKFKDLMSHCS 122

Query: 123 DI--DEKQLLLYAQIANLVNLILQ-WPEISINEIAENFSKLACNAHTICNSELRPLGTGL 179
           +I  DEK++  +  + N+++  L+  P  S  E+   + ++  N+ +I NS++  +G G+
Sbjct: 123 EIEKDEKKMEHFVCLCNVLHKFLEDMPIPSTAELLGIYGRITINSFSIFNSDMN-IGVGI 181

Query: 180 YPVISIINHSCLPNAVLVFEGRLAVVRAVQHVPKG--AEVLISYIETAGSTMTRQKALKE 237
           Y   SI++HSC PNAV  FEG    V+A++ +P    +++ I YI+   +T  R+  L+ 
Sbjct: 182 YLGPSILDHSCKPNAVATFEGTTITVKAIEDLPSLDLSQIRIPYIDVIKTTRDRRAELQS 241

Query: 238 QYLFTCTCPRCIKLGQFDDIQESAILEGYRCKDDGCSGFLLRDSDDKGFTCQQCGLVRSK 297
            Y F C C RC K           + E   C +  C+     ++D     C++C   +  
Sbjct: 242 SYYFWCDCERCEK--------PEPMAEAAACPNKLCTYPCAPNAD----LCEKCN-TKFP 288

Query: 298 EEIKKIASEVNILSKKTLALTSCGNHQEV--VSTYKMIEKLQKKLYHPFSVNLMQTREKL 355
           E  K+I  E++  +   L      N + +  +   KM    Q+ + HP +V  +QT +  
Sbjct: 289 ENFKEIFDEISEFTAYHLE-----NMKNIAYLDVSKMCLSRQEGILHPLNVQYVQTLQTA 343

Query: 356 IKILMELEDWKEALAYCQLTIPVYQRVYPQFHPLLGLQYYTCGKLEWFLGDTENAIKSMT 415
               + L+ W+EA +Y +  I  Y   Y + HP  G+ Y + GKLE +L   + AIK++ 
Sbjct: 344 FDSSINLQHWEEAESYAKKLINGYLTYYGEIHPSTGILYLSIGKLEVYLKKLKQAIKTLR 403

Query: 416 EAVEILRITHGTNSPFMKE 434
           +A  IL ITHG     + E
Sbjct: 404 KASSILTITHGAQHSVIVE 422


>gi|170064095|ref|XP_001867383.1| conserved hypothetical protein [Culex quinquefasciatus]
 gi|167881524|gb|EDS44907.1| conserved hypothetical protein [Culex quinquefasciatus]
          Length = 445

 Score =  190 bits (482), Expect = 1e-45,   Method: Compositional matrix adjust.
 Identities = 138/464 (29%), Positives = 225/464 (48%), Gaps = 37/464 (7%)

Query: 4   FHVSGEVIISQEPYVCVPNNSSSISRCDGCFASSNLKKCSACQVVWYCGSNCQKLDWKLH 63
           FH  G +I+ + P+ CV  +     RCD CF    + KCS+C  V YC   CQK  W  H
Sbjct: 5   FHKQGSLILKETPFACVLQSRYRAERCDKCFKPGKVLKCSSCLYVRYCNRLCQKEAWPDH 64

Query: 64  RLECQVLSRLDKEKRKSVTPSIRLMLKLYLRRKLQNDNVIPSTTTDN-YSLVEALVAHMS 122
           + EC  L    KE    V P   LM+   +R+ L+  +V+    TD  Y     L+ H  
Sbjct: 65  QEECGKL----KEIGDRVVPDAGLMMSRIIRKLLKGGDVMKGYYTDKCYRRFWDLMPHEE 120

Query: 123 DI--DEKQLLLYAQIANLVNLILQWPEISIN-EIAENFSKLACNAHTICNSELRPLGTGL 179
           DI  DEK++  +  +  ++  ++    +  N E+   F K+  N+  I + E+  +GTG+
Sbjct: 121 DIKKDEKRMEHFQSLTVVLRSLIDEAAMPGNQELLRIFGKMCINSFNILDDEMNSIGTGM 180

Query: 180 YPVISIINHSCLPNAVLVFEGRLAVVRAVQHVPKGAEV-----LISYIETAGSTMTRQKA 234
           Y   SI++HSC PNAV +F+G    VR ++    GAE+      ISYI+    T  R+  
Sbjct: 181 YLGASIMDHSCRPNAVAIFDGCNLNVRLLEDY-HGAEIDFSKIFISYIDLLNPTDVRRDM 239

Query: 235 LKEQYLFTCTCPRCIKLGQFDDIQESAILEGYRCKDDGCSGFL-LRDSDDKGFTCQQCGL 293
           L+++Y F C C RC       D QE  ++ G  C +  C   + +    D+   C     
Sbjct: 240 LRKRYYFECGCERC------RDEQELKLMNGAACANAQCDEPISMTARQDRCPGCSTAIK 293

Query: 294 VRSKEEIKKIASEVNILSKKTLALTSCGNHQEVVSTYKMIEKL----QKKLYHPFSVNLM 349
              +++ ++I++           +T     ++V  TY  I +L    Q+K++HPF+V  +
Sbjct: 294 QTERDKFREISA---------FTMTQLEQMKDV--TYLDICQLCLSKQEKVFHPFNVWYL 342

Query: 350 QTREKLIKILMELEDWKEALAYCQLTIPVYQRVYPQFHPLLGLQYYTCGKLEWFLGDTEN 409
           +T +   +  +E+E W++A+ Y       ++R    FHPL  L     GK+E +L   + 
Sbjct: 343 KTLDLAFESAIEMEKWEDAIDYGSRLKDGFKRFNGAFHPLYALLLLKLGKIEIYLKHGKE 402

Query: 410 AIKSMTEAVEILRITHGTNSPFMKELILKLEEAQAEASYKLSSK 453
           A+ ++ E+ +ILRITHG      K+ ++ L   QA   Y+   K
Sbjct: 403 ALMNVNESEKILRITHGEEHDLYKKQLIPL-LCQAATEYEQMDK 445


>gi|326496881|dbj|BAJ98467.1| predicted protein [Hordeum vulgare subsp. vulgare]
          Length = 153

 Score =  186 bits (473), Expect = 2e-44,   Method: Compositional matrix adjust.
 Identities = 86/140 (61%), Positives = 111/140 (79%)

Query: 316 ALTSCGNHQEVVSTYKMIEKLQKKLYHPFSVNLMQTREKLIKILMELEDWKEALAYCQLT 375
            L S  ++ E  S YK+IE+L++K YH FS+ L+ TRE L+KI MEL+DW+ AL YC+LT
Sbjct: 11  TLISDTDNSEAGSVYKIIEQLEQKHYHSFSITLLHTRETLLKIYMELQDWQTALMYCRLT 70

Query: 376 IPVYQRVYPQFHPLLGLQYYTCGKLEWFLGDTENAIKSMTEAVEILRITHGTNSPFMKEL 435
           IPVY+RVYP FHP++GLQ+YTCGKLEW L  TE+A+K +T A ++LRITHGT S FMKEL
Sbjct: 71  IPVYERVYPPFHPMVGLQFYTCGKLEWLLEYTEDALKLLTRAADVLRITHGTQSQFMKEL 130

Query: 436 ILKLEEAQAEASYKLSSKDE 455
             KLEEA+AE S++LSS +E
Sbjct: 131 FGKLEEARAEVSFRLSSGEE 150


>gi|340711607|ref|XP_003394366.1| PREDICTED: SET and MYND domain-containing protein 3-like [Bombus
           terrestris]
          Length = 439

 Score =  185 bits (469), Expect = 5e-44,   Method: Compositional matrix adjust.
 Identities = 126/432 (29%), Positives = 212/432 (49%), Gaps = 29/432 (6%)

Query: 3   SFHVSGEVIISQEPYVCVPNNSSSISRCDGCFASSNLKKCSACQVVWYCGSNCQKLDWKL 62
           +F   G  I++ +P+  V  +     RCD CF S  L +CSACQ V+YC  +CQ++ W +
Sbjct: 6   NFIKKGTTILTAKPFAYVLCSKYKNVRCDYCFKSGKLFRCSACQYVYYCNQSCQQMSWPM 65

Query: 63  HRLECQVLSRLDKEKRKSVTPSIRLMLKLYLRRKLQNDNVIPSTTTDNYSLVEALVAHMS 122
           H  EC   +RL K     ++   RLM ++ ++     D         NY   + L++H S
Sbjct: 66  HSKEC---ARLKKFSPWGISNVARLMARIIIKLNQGGDEEKGYYNETNYRKFKDLMSHCS 122

Query: 123 DI--DEKQLLLYAQIANLVNLILQ-WPEISINEIAENFSKLACNAHTICNSELRPLGTGL 179
           +I  DEK++  +  + N+++  L+  P  S  E+   + ++  N+ +I N ++  +G G+
Sbjct: 123 EIKKDEKKMEHFVCLCNVLHKFLEDMPIPSTAELLGIYGRITINSFSIFNLDMN-IGVGI 181

Query: 180 YPVISIINHSCLPNAVLVFEGRLAVVRAVQHVPKG--AEVLISYIETAGSTMTRQKALKE 237
           Y   SI++HSC PNAV  FEG    V+A++ +P    +++ I YI+   +   R+  L+ 
Sbjct: 182 YLGPSILDHSCKPNAVATFEGTTINVKAIEDLPSLDLSQIRIPYIDVIKTAGDRRAELQS 241

Query: 238 QYLFTCTCPRCIKLGQFDDIQESAILEGYRCKDDGCSGFLLRDSDDKGFTCQQCGLVRSK 297
            Y F C C +C K           + E   C +  C+       D     C++C   +  
Sbjct: 242 SYYFWCDCEKCEK--------PEPMAEAAACPNKLCT----YPCDPNADLCEKCN-TKFP 288

Query: 298 EEIKKIASEVNILSKKTLALTSCGNHQEV--VSTYKMIEKLQKKLYHPFSVNLMQTREKL 355
           E  K+I  E++  +   L      N + +  +   KM    Q+ + HP +V  +QT +  
Sbjct: 289 ENFKEIFDEISEFTAYHLE-----NMKNIAYLDVSKMCLSRQEGILHPLNVQYVQTLQTA 343

Query: 356 IKILMELEDWKEALAYCQLTIPVYQRVYPQFHPLLGLQYYTCGKLEWFLGDTENAIKSMT 415
               + L+ W+EA +Y +  I  Y   Y + HP  G+ Y + GKLE +L   + AIK++ 
Sbjct: 344 FDSSINLQHWEEAESYAKKLINGYLAYYGEIHPSTGILYLSIGKLEVYLKKLKQAIKTLR 403

Query: 416 EAVEILRITHGT 427
           +A  IL ITHG 
Sbjct: 404 KASLILTITHGA 415


>gi|395531511|ref|XP_003767821.1| PREDICTED: SET and MYND domain-containing protein 3 [Sarcophilus
           harrisii]
          Length = 428

 Score =  185 bits (469), Expect = 5e-44,   Method: Compositional matrix adjust.
 Identities = 131/448 (29%), Positives = 218/448 (48%), Gaps = 62/448 (13%)

Query: 8   GEVIISQEPYVCVPNNSSSISRCDGCFA-SSNLKKCSACQVVWYCGSNCQKLDWKLHRLE 66
           GE++   EP        S    C+ C      L +CS C+V  YCGS CQK  W+ H+ E
Sbjct: 27  GELLFRSEPLAYTVCKGSRGVVCERCLCRKEKLLRCSQCKVARYCGSECQKQAWQDHKRE 86

Query: 67  CQVLSRLDKEKRKSVTP-SIRLMLKLYLRRKLQNDNVIPSTTTDNYSLVEALVAHMSDID 125
           C+ L    K       P S+RL+ K+    KL  ++   S    ++  +E+ + ++S+  
Sbjct: 87  CKCL----KSSEPKFPPDSVRLLGKVVF--KLLRESSCASEKLYSFFDLESNIKNLSEEK 140

Query: 126 EK---------QLLLYAQIANLVNLILQWPEISINEIAENFSKLACNAHTICNSELRPLG 176
           ++         QL L  +I N   L   +      +I E+F+K+ CN  TI N E++ +G
Sbjct: 141 KEGLRHLAMTLQLYLKEEIQNASQLPPAF------DIFESFAKVICNGFTISNGEMQEVG 194

Query: 177 TGLYPVISIINHSCLPNAVLVFEGRLAVVRAVQHVPKGAEVLISYIETAGSTMTRQKALK 236
            GLYP +S++NHSC PN V+VFEG    +RA++++P G E+ I Y++    T  RQK LK
Sbjct: 195 VGLYPSMSLLNHSCDPNCVIVFEGPSLFLRAIRNIPLGEELTICYLDVLMPTAERQKQLK 254

Query: 237 EQYLFTCTCPRCIKLGQFDDIQESAILEGYRCKDDGCSGFLLRDSDDKGFTCQQCGLVRS 296
           EQY F C CP C    +  D     +L G         G L++  D           ++S
Sbjct: 255 EQYCFDCDCPLCKTQSKDAD-----MLAGEEQAWKEIQGSLIKIED-----------LQS 298

Query: 297 KEEIKKIASEVNILSKKTLALTSCGNHQEVVSTYKMIEKLQKKLYHPFSVNLMQTREKLI 356
           +E+ +++ +    L      + +CGN     + Y+                 ++  E  +
Sbjct: 299 QEKWEQVLAMCQTL------INNCGNRLPDRNIYQ-----------------LKMLECAM 335

Query: 357 KILMELEDWKEALAYCQLTIPVYQRVYPQFHPLLGLQYYTCGKLEWFLGDTENAIKSMTE 416
              + L  W++AL Y   T+  Y+  YP FHP+ G+Q    GKL+   G    A++++ +
Sbjct: 336 DACINLSLWEDALLYGSRTLEPYRLYYPGFHPVRGVQVMKVGKLQQHQGLYPQALETLKQ 395

Query: 417 AVEILRITHGTNSPFMKELILKLEEAQA 444
           A E++++THG +   +++L+L L + +A
Sbjct: 396 AFELIKVTHGRDHSLIEDLMLLLGDCEA 423


>gi|312383019|gb|EFR28260.1| hypothetical protein AND_04036 [Anopheles darlingi]
          Length = 455

 Score =  184 bits (466), Expect = 1e-43,   Method: Compositional matrix adjust.
 Identities = 134/459 (29%), Positives = 214/459 (46%), Gaps = 29/459 (6%)

Query: 4   FHVSGEVIISQEPYVCVPNNSSSISRCDGCFASSNLKKCSACQVVWYCGSNCQKLDWKLH 63
            H  G++I+ ++P+  V        RCD CF    + KCS C  V YC   CQ+  W  H
Sbjct: 5   LHKRGDLILQEQPFAYVLLPQFRHERCDRCFKLGKVLKCSGCLYVRYCNRACQQEAWPDH 64

Query: 64  RLECQVLSRLDKEKRKSVTPSIRLMLKLYLRRKLQNDNVIPSTTTDN-YSLVEALVAHMS 122
           + EC+ L  L       V PS  LML   +RR  +  +      TD  Y     L+ H  
Sbjct: 65  QAECEKLKLLPA---TLVVPSAALMLARIIRRLQKGGDFCKGYYTDKLYRRFNDLMPHEE 121

Query: 123 DI--DEKQLLLYAQIANLVNLILQWPEISI-NEIAENFSKLACNAHTICNSELRPLGTGL 179
           DI  D K++  +  +  ++  +L  P I   +E+   F K+  N+  +C+ E+  +GTG+
Sbjct: 122 DIRKDVKRIEHFHTLNVVLQRLLDEPAIPPRDELLRIFGKMCINSFNVCDDEMNSIGTGM 181

Query: 180 YPVISIINHSCLPNAVLVFEGRLAVVRAVQHVPKGAE-----VLISYIETAGSTMTRQKA 234
           Y   SI++HSC PNAV  F G    +R ++    G E     + ISYI+    + TR++ 
Sbjct: 182 YLGASILDHSCRPNAVATFVGEQLQLRLLEDF-AGPELDFSRIFISYIDLIDPSDTRREQ 240

Query: 235 LKEQYLFTCTCPRCIKLGQFDDIQESAILEGYRCKDDGCSGFLLRDSDDKGFTCQQCGLV 294
           L E+Y F C C RC       D  E  ++    C++  C    +RD       C+     
Sbjct: 241 LSERYYFRCECVRC------RDEAERELMGAAACQNRKCDE-PIRDGQTLCSACE----A 289

Query: 295 RSKEEIKKIASEVNILSKKTLALTSCGNHQEVVSTYKMIEKLQKKLYHPFSVNLMQTREK 354
              +  +    EV   ++  LA      + +V    ++  + Q  + HP +   ++T + 
Sbjct: 290 PFDQSARDRFDEVTSFTRDRLAEMKDVAYFDVC---RLCLEKQSGVLHPLNAQHIKTLDY 346

Query: 355 LIKILMELEDWKEALAYCQLTIPVYQRVYPQFHPLLGLQYYTCGKLEWFLGDTENAIKSM 414
                ++LE W+ AL Y    +  Y+R  P  +PLLGL     GK++ +LGD + A+ S+
Sbjct: 347 AFDSAIKLEKWEAALRYGAGAVAGYRRYSPS-NPLLGLMLANIGKIQLYLGDAKTALSSL 405

Query: 415 TEAVEILRITHGTNSPFMKELILKLEEAQAEASYKLSSK 453
            EA +ILR+THG      K+ ++ L    A   Y+LS +
Sbjct: 406 HEADKILRVTHGEQHDLYKDQLVPL-LCDAAQQYELSQR 443


>gi|66540564|ref|XP_625013.1| PREDICTED: SET and MYND domain-containing protein 3 [Apis
           mellifera]
          Length = 440

 Score =  183 bits (465), Expect = 1e-43,   Method: Compositional matrix adjust.
 Identities = 123/433 (28%), Positives = 215/433 (49%), Gaps = 32/433 (7%)

Query: 3   SFHVSGEVIISQEPYVCVPNNSSSISRCDGCFASSNLKKCSACQVVWYCGSNCQKLDWKL 62
           +F   G  + + +P+  V  +     RCD CF S  L +CS C+ ++YC  +CQ++ W +
Sbjct: 6   NFIKKGTTLFTAKPFAYVLYSKYRNERCDYCFKSGKLFRCSVCKCIYYCNQSCQQMSWTI 65

Query: 63  HRLECQVLSRLDKEKRKSVTPSI-RLMLKLYLRRKLQNDNVIPSTTTDNYSLVEALVAHM 121
           H  EC  L R   +    V P + RLM ++ ++        I   +   Y   + L++H 
Sbjct: 66  HSKECASLKRFSSK----VIPDVARLMARIIIKLNQGGGEEIGYYSKTKYRKFKDLMSHY 121

Query: 122 SDI--DEKQLLLYAQIANLVNLILQWPEISINEIAE---NFSKLACNAHTICNSELRPLG 176
           SDI  DEK++  +  +  ++   L   ++SI   AE    + ++  N+  I + ++  +G
Sbjct: 122 SDIKKDEKKMEHFVCVCGVLYEFL--GDMSIPNSAELMGIYGRIYINSFNISDLDMNNIG 179

Query: 177 TGLYPVISIINHSCLPNAVLVFEGRLAVVRAVQHVP--KGAEVLISYIETAGSTMTRQKA 234
            G+Y   SI++HSC PNAV  FEG   ++R  + +P    +++ ISYI+   +T  R++ 
Sbjct: 180 AGIYLGPSILDHSCKPNAVATFEGTTIIIRTTEDLPCLDLSQIRISYIDVIKTTKDRREE 239

Query: 235 LKEQYLFTCTCPRCIKLGQFDDIQES-AILEGYRCKDDGCSGFLLRDSDDKGFTCQQCGL 293
           L+  Y F C C +C         +ES  ++E   C +  C+     D+D     C+ C  
Sbjct: 240 LQSSYYFWCNCKKC---------EESEPMVEAAACPNKFCTYPCSLDAD----MCENCN- 285

Query: 294 VRSKEEIKKIASEVNILSKKTLALTSCGNHQEVVSTYKMIEKLQKKLYHPFSVNLMQTRE 353
            +  E  K+   E++ L+   L       + +V +   M  K Q+ + HP ++  +QT +
Sbjct: 286 TKFPENFKETFYEISDLTAYHLQNMKNIAYLDVST---MCLKKQEGVLHPLNIQHVQTLQ 342

Query: 354 KLIKILMELEDWKEALAYCQLTIPVYQRVYPQFHPLLGLQYYTCGKLEWFLGDTENAIKS 413
                 + L+ W+EA +Y +  I  Y   Y +FHP  G+ Y + GKL+ +L   + AI++
Sbjct: 343 SAFDSSLTLQHWEEAESYAKRLINGYLTYYGEFHPSTGILYLSIGKLQVYLKKLKQAIET 402

Query: 414 MTEAVEILRITHG 426
           + +A  IL ITHG
Sbjct: 403 LRKASAILTITHG 415


>gi|449665374|ref|XP_002163555.2| PREDICTED: uncharacterized protein LOC100200645 [Hydra
           magnipapillata]
          Length = 1037

 Score =  182 bits (463), Expect = 3e-43,   Method: Compositional matrix adjust.
 Identities = 129/440 (29%), Positives = 215/440 (48%), Gaps = 24/440 (5%)

Query: 8   GEVIISQEPYVCVPNNSSSIS-RCDGCFAS-SNLKKCSACQVVWYCGSNCQKLDWKLHRL 65
           G  I+ +EP  C+  NS     RCD C++    L KCS C+ + YCG  CQ  DWK+H+ 
Sbjct: 28  GVEILKEEPLACILTNSKYRGIRCDYCYSEPEKLLKCSKCKFIAYCGKVCQASDWKMHKY 87

Query: 66  ECQVLSRLDKEKRKSVTPSI-RLMLKLYLRRKLQNDNVIPSTTTDNYSLVEALVAHMSDI 124
           EC+ L++   ++       + +L+   Y  +K               +L+  L A+  +I
Sbjct: 88  ECKCLTKSAPKQPPDFCRLVSQLIFNFYYNKK--------------NTLINNLYANKGNI 133

Query: 125 DEKQLLLYAQIANLVNLILQWPEISINEIA--ENFSKLACNAHTICNSELRPLGTGLYPV 182
              +   +   A ++   LQ   I+IN+I       K +CN+  I N+EL  LGTG++  
Sbjct: 134 SNARKEAFFTFAAVLVEYLQDVNININDIDIYGLMCKASCNSFAITNAELNSLGTGIFSS 193

Query: 183 ISIINHSCLPNAVLVFEGRLAVVRAVQHVPKGAEVLISYIETAGSTMTRQKALKEQYLFT 242
            S+ NHSC PN V  F GR   +RA++ + +G E+++SYI    ++  RQ  L+E Y+FT
Sbjct: 194 ASLFNHSCDPNCVATFNGRDISIRAIKPIAEGEELMLSYISILATSDVRQLELRESYMFT 253

Query: 243 CTCPRCIKLGQFDDIQESAILEGYRCKDDGCSGFLLRDSDDKGFTCQQCGLVRSKEEIKK 302
           C C  C +    D + +S      +C    C  FL+   +    +CQ+     S + I K
Sbjct: 254 CKCTVCSRKEVNDSLMKSVKCSQPQC---LCMKFLITAPESNKCSCQK-NCEASNDYILK 309

Query: 303 IASEVNILSKKTLALTSCGNHQEVVSTYKMIEKLQKKLYHPFSVNLMQTREKLIKILMEL 362
               ++ L     +++     ++  S  ++I   ++ L  P ++ L+Q  E  +   +E 
Sbjct: 310 ANDCMDKLQALYNSISVVPTIEQQKSLTQLIRCGEEILCSP-NIALLQCYEVAMDGCIES 368

Query: 363 EDWKEALAYCQLTIPVYQRVYPQFHPLLGLQYYTCGKLEWFLGDTENAIKSMTEAVEILR 422
            DWK A  Y       Y+    ++HP LGL Y+  GKL   + D  N I  + +  ++L 
Sbjct: 369 GDWKGAFQYGIKLECSYKNYLSEYHPTLGLHYFKLGKLALQIEDLRNGITYLEKGYKVLS 428

Query: 423 ITHGTNSPFMKELILKLEEA 442
           ITHG++  F+++L   L+EA
Sbjct: 429 ITHGSSCHFVQKLKSYLDEA 448


>gi|383865502|ref|XP_003708212.1| PREDICTED: SET and MYND domain-containing protein 3-like [Megachile
           rotundata]
          Length = 438

 Score =  182 bits (461), Expect = 4e-43,   Method: Compositional matrix adjust.
 Identities = 118/433 (27%), Positives = 209/433 (48%), Gaps = 26/433 (6%)

Query: 8   GEVIISQEPYVCVPNNSSSISRCDGCFASSNLKKCSACQVVWYCGSNCQKLDWKLHRLEC 67
           G  I + +P+  V ++    +RCD C  S  L +CSACQ V+YC  +CQ+  W  H  EC
Sbjct: 11  GTTIFTAKPFAYVLSSKYRTNRCDYCLTSGKLFRCSACQYVYYCNRSCQQGSWPTHNTEC 70

Query: 68  QVLSRLDKEKRKSVTPSIRLMLKLYLRRKLQNDNVIPSTTTDNYSLVEALVAHMSDI--D 125
             L R+     K V    RLM ++ ++     D+ +   T   +   + L++H SDI  D
Sbjct: 71  VNLKRVSP---KVVPDMARLMARIIIKLSQGGDDEVEYYTKTKFRRFKDLMSHYSDIKKD 127

Query: 126 EKQLLLYAQIAN-LVNLILQWPEISINEIAENFSKLACNAHTICNSELRPLGTGLYPVIS 184
           EK++  +  +   L   +   P  +  E+   + ++  N+  I + ++  +G G+Y   S
Sbjct: 128 EKKMEHFMFLCGVLFGFLGDTPMPNSAELMGIYGRICINSFNIFDLDMNSIGVGIYLAPS 187

Query: 185 IINHSCLPNAVLVFEGRLAVVRAVQHVPK--GAEVLISYIETAGSTMTRQKALKEQYLFT 242
           +++HSC+PNAV  FEG    +R ++ +P    +++ ISYI+   +T  R+  L+  Y F 
Sbjct: 188 VVDHSCVPNAVATFEGITLNIRTIEDLPSLDWSQIRISYIDVLKTTKERRSELQSSYYFW 247

Query: 243 CTCPRCIKLGQFDDIQESAIL-EGYRCKDDGCSGFLLRDSDDKGFTCQQCGLVRSKEEIK 301
           C C +C         +E  ++ E   C +  C+     D +    +C +C  ++  E  K
Sbjct: 248 CNCKKC---------EEPELMAEAAACSNKNCTNPCSPDME----SCPECN-IKLLENFK 293

Query: 302 KIASEVNILSKKTLALTSCGNHQEVVSTYKMIEKLQKKLYHPFSVNLMQTREKLIKILME 361
           +I  EV+  +   L       +   +   KM  K Q+ + H  ++  +QT +      + 
Sbjct: 294 EIFDEVSSFTAHHLQNMK---NMAYLDVSKMCLKKQEGVLHSLNIQHVQTLQTAFDSSIS 350

Query: 362 LEDWKEALAYCQLTIPVYQRVYPQFHPLLGLQYYTCGKLEWFLGDTENAIKSMTEAVEIL 421
           L+ W+EA  Y +  I  Y   Y + HPL G+     GK++ +L   + A++++ +A  IL
Sbjct: 351 LQHWEEAEFYAKKLIKGYLVYYGEVHPLTGILLLMTGKIQLYLEKPKQALEALRKANSIL 410

Query: 422 RITHGTNSPFMKE 434
            ITHG     ++E
Sbjct: 411 AITHGEQHTLVRE 423


>gi|432852874|ref|XP_004067428.1| PREDICTED: SET and MYND domain-containing protein 3-like [Oryzias
           latipes]
          Length = 495

 Score =  181 bits (460), Expect = 6e-43,   Method: Compositional matrix adjust.
 Identities = 130/421 (30%), Positives = 208/421 (49%), Gaps = 61/421 (14%)

Query: 38  NLKKCSACQVVWYCGSNCQKLDWKLHRLECQVL-SRLDKEKRKSVTPSIRLMLKLYLRRK 96
            L +CS C++  YC S CQK  W  H+ EC+ L S L +    SV  + RL+  L     
Sbjct: 115 TLLRCSQCKMARYCNSACQKRAWPGHKKECRCLCSLLPRHPTDSVRLAARLIFTLL---- 170

Query: 97  LQNDNVIPSTTTDNYSLVEALVAHMSDIDEKQLLLYAQIANLVNLIL--QWPEI------ 148
               N + S+  + YS+ E   +H+  + E++    +++A ++ L +  + P++      
Sbjct: 171 ----NPLQSSKEELYSIDEH-ESHLDSMPEQKKEGLSELATMLELYMKTEIPDLKQRMTS 225

Query: 149 ----SINEIAENFSKLACNAHTICNSELRPLGTGLYPVISIINHSCLPNAVLVFEGRLAV 204
               S  E     +K+ CN  TI + EL+ +G GLYP +S++NH C PN V+VF G    
Sbjct: 226 ELPLSCREALSIIAKVTCNCFTISDGELQEIGVGLYPSLSLLNHDCRPNCVMVFNGTELQ 285

Query: 205 VRAVQHVPKGAEVLISYIETAGSTMTRQKALKEQYLFTCTCPRCIKLGQFDDIQESAILE 264
           +RAVQ +  G E+ ISYIET   T  RQK L+EQY FTC C RC         Q+  +L 
Sbjct: 286 LRAVQDINPGEELRISYIETLSLTEDRQKQLEEQYHFTCHCQRC-----SSKEQDGVMLS 340

Query: 265 GYRCKDDGCSGFLLRDSDDKGFTCQQCGLVRSKEEIKKIASEVNILSKKTLALTSCGNHQ 324
           G                       ++    + KE + ++   + + S     L SC N  
Sbjct: 341 G-----------------------REAAWRQLKEALPRLEG-LQLESNWLELLESCSN-- 374

Query: 325 EVVSTYKMIEKLQKKLYHPFSVNLMQTREKLIKILMELEDWKEALAYCQLTIPVYQRVYP 384
             + T    E   + LY       ++  +  +   + L  W+EAL++   T+P Y+  Y 
Sbjct: 375 --LLTSAGEEVPDENLYK------LKITDMALDASVHLGRWEEALSFGLKTLPSYRLYYA 426

Query: 385 QFHPLLGLQYYTCGKLEWFLGDTENAIKSMTEAVEILRITHGTNSPFMKELILKLEEAQA 444
             HP+ G+Q    GKL+  LG  E+A++S  EA +IL++THG + P + +L++K+EE ++
Sbjct: 427 DPHPVHGVQLMRVGKLQHHLGCIEDALESFKEAFKILKLTHGEDHPLIVDLMMKMEECRS 486

Query: 445 E 445
           E
Sbjct: 487 E 487


>gi|213512076|ref|NP_001133866.1| SET and MYND domain-containing protein 3 [Salmo salar]
 gi|209155626|gb|ACI34045.1| SET and MYND domain-containing protein 3 [Salmo salar]
          Length = 429

 Score =  180 bits (457), Expect = 1e-42,   Method: Compositional matrix adjust.
 Identities = 127/446 (28%), Positives = 208/446 (46%), Gaps = 52/446 (11%)

Query: 7   SGEVIISQEPYVCVPNNSSSISRCDGCFAS-SNLKKCSACQVVWYCGSNCQKLDWKLHRL 65
           +GE++ + EP     +N+ +   C  CF+    L +CS C+V  YC   CQK  W  H+ 
Sbjct: 25  AGELLYTAEPLAYCVSNTCAKDLCHSCFSRRKTLLRCSQCKVARYCDVTCQKQAWSDHKR 84

Query: 66  ECQVLSRLDKEKRKSVTPSIRLMLKLYLRRKLQNDNVIPSTTTDNYSLVEALVAHMSDID 125
           EC+ L  L     +  T S+RL  ++  R       + PS T+     +E   +H+ D+ 
Sbjct: 85  ECKCLRSLHP---RIPTDSVRLAARIIFRL------LSPSQTSPGLCSLEEHESHLCDMG 135

Query: 126 EKQLLLYAQIANLVNLILQW--PEISINEIAEN----FSKLACNAHTICNSELRPLGTGL 179
           E++    +Q+++++ L +Q   P I+    A +     +KL CN  TI + ELR +G GL
Sbjct: 136 EERREGLSQLSSMLQLYIQQEVPNITQELPAVDPLSLLAKLTCNCFTISDGELREIGVGL 195

Query: 180 YPVISIINHSCLPNAVLVFEGRLAVVRAVQHVPKGAEVLISYIETAGSTMTRQKALKEQY 239
           YP +S++NH C P+ V++FEG    +RAV+ +    E+ ISYI T   T  R+  L+EQY
Sbjct: 196 YPSMSLLNHDCRPSCVMLFEGETLHLRAVRDMQPAEELTISYIGTLAPTRDRRTQLEEQY 255

Query: 240 LFTCTCPRCIKLGQFDDIQESAILEGYRCKDDGCSGFLLRDSDDKGFTCQQCGLVRSKEE 299
            FTC C RC      D +      + +    +            + +  Q C      E 
Sbjct: 256 HFTCQCQRCTT-ADMDPVMLCGEEKAWTPLREAIPTL-------ENWQTQACW----NEL 303

Query: 300 IKKIASEVNILSKKTLALTSCGNHQEVVSTYKMIEKLQKKLYHPFSVNLMQTREKLIKIL 359
           + + +S ++ L     A+  C                        +V  ++  +  +   
Sbjct: 304 LYECSSLLDPLDGDVPAVPDC------------------------NVYRLRVTDLALDAC 339

Query: 360 MELEDWKEALAYCQLTIPVYQRVYPQFHPLLGLQYYTCGKLEWFLGDTENAIKSMTEAVE 419
           + L  W  ALAY  +T+  Y++ YP  HP   +Q     KL+ FL    +A  ++  A +
Sbjct: 340 INLAAWDTALAYGTITLEPYRQYYPDPHPAHAIQLMRVAKLQHFLVHLGDAQHTLRLAYD 399

Query: 420 ILRITHGTNSPFMKELILKLEEAQAE 445
           I+++THG   P   +LI KLEE +AE
Sbjct: 400 IMKVTHGNQHPLTSDLIRKLEECRAE 425


>gi|326915443|ref|XP_003204027.1| PREDICTED: SET and MYND domain-containing protein 3-like [Meleagris
           gallopavo]
          Length = 427

 Score =  180 bits (456), Expect = 1e-42,   Method: Compositional matrix adjust.
 Identities = 135/448 (30%), Positives = 217/448 (48%), Gaps = 63/448 (14%)

Query: 8   GEVIISQEPYVCVPNNSSSISRCDGCFA-SSNLKKCSACQVVWYCGSNCQKLDWKLHRLE 66
           GE++   EP+  V         C+ CF  + +L +CS C+V  YCG +CQK  W  H+ E
Sbjct: 27  GELLYRAEPFAYVVTKEQLGGVCEQCFQRNEHLHRCSQCKVAKYCGKSCQKEGWLDHKRE 86

Query: 67  CQVLSRLDKEKRKSVTPSIRLMLKLYLRRKLQNDNVIPSTTTDNYSLVEALVAHMSD--- 123
           C+ L  +      SV  + R++ KL  +    ++ +   + +D  S  E L   M +   
Sbjct: 87  CKCLRNVKPNFPXSVRLAGRIVFKLLRQSACLSEKLY--SFSDLQSNAEQLSEEMKEGLR 144

Query: 124 --IDEKQLLLYAQIANLVNLILQWPEISINEIAENFSKLACNAHTICNSELRPLGTGLYP 181
                 QL L A+I +  +L    P I   +I   F+K+ CN  TI N E++ +G GLYP
Sbjct: 145 HLAHTLQLYLRAEIQDASHLS---PAIDFFQI---FTKVTCNCFTISNGEMQDVGVGLYP 198

Query: 182 VISIINHSCLPNAVLVFEGRLAVVRAVQHVPKGAEVLISYIETAGSTMTRQKALKEQYLF 241
            +S++NHSC PN V++FEG   ++R+V+ +  G E+ +SYIE+   T  RQ+ LK QY F
Sbjct: 199 SMSLLNHSCDPNCVIIFEGYQLLLRSVREIQIGEELTVSYIESLMPTSERQEQLKRQYCF 258

Query: 242 TCTCPRCIKLGQFDDIQESAILEGYRCKDDGCSGFLLRDSDDKGFTCQQCGLVRSKEEIK 301
            C C  C+   Q  D ++ A  E              ++  D      +    +SKEE +
Sbjct: 259 ECDC--CLCQDQEKDAKKLAGEEP-----------AWKEVKD---AVNEVKYPKSKEEWE 302

Query: 302 KIASEV-NILSKKTLALTSCGNHQEVVSTYKMIEKLQKKLYHPFSVNLMQTREKLIKILM 360
           K+ ++  ++LS  T  L                             N+ Q   KL+   M
Sbjct: 303 KVLAKCQHLLSSHTSRLP--------------------------DTNIYQL--KLLDCAM 334

Query: 361 E----LEDWKEALAYCQLTIPVYQRVYPQFHPLLGLQYYTCGKLEWFLGDTENAIKSMTE 416
           +    LE W++AL Y   T+  Y+  YP FHPL  +Q    GKL++  G    A++++ +
Sbjct: 335 DACINLEAWEQALCYGSRTLGPYRLYYPNFHPLRAVQLMRVGKLQYSQGMLPQALETLKQ 394

Query: 417 AVEILRITHGTNSPFMKELILKLEEAQA 444
           A +I+++THGT+   M+ L+   E+ +A
Sbjct: 395 AYDIMKVTHGTDHSLMQALMDLKEQCEA 422


>gi|196000040|ref|XP_002109888.1| hypothetical protein TRIADDRAFT_53251 [Trichoplax adhaerens]
 gi|190588012|gb|EDV28054.1| hypothetical protein TRIADDRAFT_53251 [Trichoplax adhaerens]
          Length = 448

 Score =  179 bits (455), Expect = 2e-42,   Method: Compositional matrix adjust.
 Identities = 130/442 (29%), Positives = 217/442 (49%), Gaps = 45/442 (10%)

Query: 8   GEVIISQEPYVCVPNNSSSISRCDGCF-ASSNLKKCSACQVVWYCGSNCQKLDWKLHRLE 66
           G  +  + P+  V +     S C  C    S L KCS C+++ YC  +CQK DW+ H+ E
Sbjct: 32  GTSVGKENPFCHVVSQDMLSSYCHSCLLMQSELYKCSRCKIIMYCCKSCQKEDWQWHKYE 91

Query: 67  CQVLSRLDKEKRKSVTP-SIRLMLKLYLRRKLQNDNVIPSTTTDNYSLVEALVAHMSDID 125
           C+ ++RL  +    V P SIRL+ ++        D        D +  +         + 
Sbjct: 92  CKSITRLGPK----VPPDSIRLLGRVAYTILQGQDR------ADQFKFL---------LS 132

Query: 126 EKQLLLYAQIANLVNLILQWPEISINEIAENF-------SKLACNAHTICNSELRPLGTG 178
            ++LL  ++   +V+ I    E   N++A N        S++ CN  TICNSE++ +G G
Sbjct: 133 NRELLEGSRKNTIVDGINLLKEYLSNKVAINENEIIEIISRVTCNTFTICNSEMQTVGIG 192

Query: 179 LYPVISIINHSCLPNAVLVFEGRLAVVRAVQHVPKGAEVLISYIETAGSTMTRQKALKEQ 238
           +YP +S++NHSC PN    F G+   +R +++   G E+LISYI+      +RQ  L+ Q
Sbjct: 193 VYPGLSLVNHSCSPNCSATFRGKQMQLRIIENTKIGDELLISYIDPMQVLSSRQNQLQSQ 252

Query: 239 YLFTCTCPRCIKLGQFDDIQESA--ILEGYRCKDDGCSGFLLRDSDDKGFTCQQCGLVRS 296
           Y F C C RCI     D  ++S   +++  RC    C      DS      C +CG +  
Sbjct: 253 YCFKCICERCI-----DTTKDSCNNLMDSVRCPKKICKAASSLDSLLANKLCPECGSIVD 307

Query: 297 KE---EIKKIASEVNILSKKTLALTSCGNHQEVVSTYKMIEKLQKKLYHPFSVNLMQTRE 353
           +    EI+   +++N    KT++L   G  Q  +   K +    K+     ++  ++  E
Sbjct: 308 QSFFAEIENFQAQIN----KTISL---GYQQAKLEDLKKLFIEGKQRLGECNMLYIRIIE 360

Query: 354 KLIKILMELEDWKEALAYCQLTIPVYQRVYPQFHPLLGLQYYTCGKLEWFLGDTENAIKS 413
            L+   +E + ++EAL Y +     Y R+YP+++P+ G+     GKLE +LG    A+KS
Sbjct: 361 NLMDAYIESQRYEEALEYARRLEEPYCRLYPRYYPVTGVHLMKQGKLECYLGKFGEAVKS 420

Query: 414 MTEAVEILRITHGTNSPFMKEL 435
           + +A EIL ++HG     + E+
Sbjct: 421 LGKAKEILLVSHGKECGLIHEM 442


>gi|410912232|ref|XP_003969594.1| PREDICTED: SET and MYND domain-containing protein 3-like [Takifugu
           rubripes]
          Length = 434

 Score =  179 bits (455), Expect = 2e-42,   Method: Compositional matrix adjust.
 Identities = 127/450 (28%), Positives = 218/450 (48%), Gaps = 60/450 (13%)

Query: 8   GEVIISQEPYVCVPNNSSSISRCDGCFAS-SNLKKCSACQVVWYCGSNCQKLDWKLHRLE 66
           GE++   EP     +   S + C  CF     L +CS C++  YC + CQ+  W  H+ E
Sbjct: 26  GELVYVTEPLAYCVSQKQSRNVCHQCFTRHETLLRCSQCKMARYCSATCQRRAWSDHKRE 85

Query: 67  CQVL-SRLDKEKRKSVTPSIRLMLKLYLRRKLQNDNVIPSTTTDNYSLVEALVAHMSDID 125
           C+ L S L +    SV  + RL+  +       ++ +        Y+L E   +H++ + 
Sbjct: 86  CKCLQSLLPRIPTDSVRLAARLIFAMLSSCSSSSEEL--------YTL-EEHESHLTSLS 136

Query: 126 EKQLLLYAQIANLVNLIL--QWPEI--------SINEIAENFSKLACNAHTICNSELRPL 175
           E++    +Q+A ++ L L  + P++        S  +     +K+ CN  TI + EL+ +
Sbjct: 137 EQRKQGLSQLATMLKLYLHKEVPDLPQDTPSLSSCRDALSLIAKVTCNCFTISDGELQEI 196

Query: 176 GTGLYPVISIINHSCLPNAVLVFEGRLAVVRAVQHVPKGAEVLISYIETAGSTMTRQKAL 235
           G GLYP +S++NH C PN V+VFEG   ++RAV+ +  G E+ ISYIET      RQ+ L
Sbjct: 197 GVGLYPSLSLLNHDCRPNCVMVFEGTKLLLRAVRGLSPGEELTISYIETLSLNEDRQQRL 256

Query: 236 KEQYLFTCTCPRCIKLGQFDDIQESAILEGYRCKDDGCSGFLLRDSDDKGFTCQQCGLVR 295
           ++QY FTC C +C      D +  S     +R         +L+++            + 
Sbjct: 257 EDQYCFTCHC-QCCNSPDNDKLMLSGEESSWR---------VLKEA------------LP 294

Query: 296 SKEEIKKIASEVNILSKKTLALTSCGNHQEVVSTYKMIEKLQKKLYHPFSVNLMQTREKL 355
           + EE+K  ++   ++      L++ G      + YK                 ++  +  
Sbjct: 295 TLEELKAKSNWQTLVESCHRLLSTEGKRVPEENLYK-----------------LRVTDMA 337

Query: 356 IKILMELEDWKEALAYCQLTIPVYQRVYPQFHPLLGLQYYTCGKLEWFLGDTENAIKSMT 415
           +   ++L  W EA+ Y Q T+PVY   YP  HP+ G+Q    GKL+ +L   E A+ +  
Sbjct: 338 LDASIQLGLWTEAVEYGQKTLPVYHLYYPDPHPVHGVQLVRVGKLQHYLAHIEEALDTFK 397

Query: 416 EAVEILRITHGTNSPFMKELILKLEEAQAE 445
           +A  IL++THG ++    +L++K+EE +AE
Sbjct: 398 QAYRILKVTHGNDNSITTDLLMKMEECRAE 427


>gi|125981921|ref|XP_001354964.1| GA12511 [Drosophila pseudoobscura pseudoobscura]
 gi|54643276|gb|EAL32020.1| GA12511 [Drosophila pseudoobscura pseudoobscura]
          Length = 439

 Score =  177 bits (448), Expect = 1e-41,   Method: Compositional matrix adjust.
 Identities = 125/453 (27%), Positives = 225/453 (49%), Gaps = 35/453 (7%)

Query: 8   GEVIISQEPYVCVPNNSSSISRCDGCFASSNLKKCSACQVVWYCGSNCQKLDWKLHRLEC 67
           G+ I++++P+  V  +   + RCD C  ++ ++KCS C+ V YC  +CQ   W  H+ EC
Sbjct: 9   GQRILTEKPFAFVLKSQYRLERCDNCLEATKVRKCSNCRYVSYCNRSCQTQAWAQHKHEC 68

Query: 68  QVLSRLDKEKRKSVTPSI-----RLMLKLYLRRKLQNDNVIPSTTTDNYSLVEALVAHMS 122
             L        KS+ P I     R++ +L LR +   D +    T         L++H +
Sbjct: 69  PFL--------KSIHPRIVPDAARMLCRLILRLQHGGDLIRGYYTEHGSRKFRDLMSHYA 120

Query: 123 DI--DEKQL-LLYAQIANLVNLILQWPEISIN--EIAENFSKLACNAHTICNSELRPLGT 177
           +I  D K+L  L +  A L +++   P    N  E+   + +L  N   + ++E+  + T
Sbjct: 121 EIKNDPKRLEHLDSLHAVLTDMMADSPSTVPNKSELMSIYGRLITNGFNVLDAEMNSIAT 180

Query: 178 GLYPVISIINHSCLPNAVLVFEGRLAVVRAVQHVP--KGAEVLISYIETAGSTMTRQKAL 235
            +Y  +SI +HSC PNAV  FEG    + A++ +P    +++ ISYI+   +   R++ L
Sbjct: 181 AIYLGVSITDHSCQPNAVATFEGNELHIHALEDMPCLDWSKIFISYIDLLNTPEQRRQDL 240

Query: 236 KEQYLFTCTCPRCIKLGQFDDIQESAILEGYRCKDDGCSGFLLRDSDDKGFTCQQCGLVR 295
           K+ Y F C C +C       D +E+  +    C +  CS  +  + ++    C++C +  
Sbjct: 241 KDHYYFLCVCSKC------RDPKEARQMTAAACPNRKCSASINIEWNN----CKRCSVGI 290

Query: 296 SKEEIKKIASEVNILSKKTLALTSCGNHQEVVSTYKMIEKLQKKLYHPFSVNLMQTREKL 355
           S + ++   +E+  L+K  L       + +V    K+    Q  L HP +V  ++T +  
Sbjct: 291 SPK-LRNAYNEIMALTKHNLEAMKDVAYLDVC---KVCLDKQAGLMHPLNVWHVKTLDAA 346

Query: 356 IKILMELEDWKEALAYCQLTIPVYQRVYPQFHPLLGLQYYTCGKLEWFLGDTENAIKSMT 415
            +  +E+  W+EAL Y Q  +P +Q  + Q++PL+GL +   GK++ F    + A   + 
Sbjct: 347 FEAAIEVGKWEEALDYGQQLLPGFQMYHGQWNPLVGLLHMKLGKIQLFERHHKEASHHLE 406

Query: 416 EAVEILRITHGTNSPFMKELILKL-EEAQAEAS 447
           EA  IL +THG +   + E +  L  +A+ EA+
Sbjct: 407 EAQRILSVTHGRDHRLLVEQLYPLIFQARQEAN 439


>gi|50740707|ref|XP_419536.1| PREDICTED: SET and MYND domain-containing protein 3 [Gallus gallus]
          Length = 428

 Score =  176 bits (447), Expect = 2e-41,   Method: Compositional matrix adjust.
 Identities = 139/454 (30%), Positives = 220/454 (48%), Gaps = 64/454 (14%)

Query: 8   GEVIISQEPYVCVPNNSSSISRCDGCFA-SSNLKKCSACQVVWYCGSNCQKLDWKLHRLE 66
           GE++   EP+  V         C+ C   + +L +CS C+V  YCG +CQK  W  H+ E
Sbjct: 27  GELLYRAEPFAYVVTKEQLGGVCEQCLQRNEHLHRCSQCKVAKYCGKSCQKEAWLDHKRE 86

Query: 67  CQVLSRLDKE-KRKSVTPSIRLMLKLY-----LRRKLQNDNVIPSTTTDNYSLVEALVAH 120
           C+ L  +       SV  + R++ KL      L  +L +   + S        ++  + H
Sbjct: 87  CKCLQNVKPNFPPDSVRLAGRIVFKLLRQSACLSERLYSFKDLQSNAEQLSEEMKEGLGH 146

Query: 121 MSDIDEKQLLLYAQIANLVNLILQWPEISINEIAENFSKLACNAHTICNSELRPLGTGLY 180
           ++     QL L A+I +  +L    P I   +I   F+K+ CN  TI N E++ +G GLY
Sbjct: 147 LAHT--LQLYLRAEIQDASHLP---PAIDFFQI---FTKVTCNCFTISNGEMQDVGVGLY 198

Query: 181 PVISIINHSCLPNAVLVFEGRLAVVRAVQHVPKGAEVLISYIETAGSTMTRQKALKEQYL 240
           P +S++NHSC PN V++FEG   ++R+++ +  G E+ ISYIE+   T  RQK LK QY 
Sbjct: 199 PSMSLLNHSCDPNCVIIFEGYQLLLRSIREIQIGEELTISYIESLMPTSERQKQLKRQYC 258

Query: 241 FTCTCPRCIKLGQFDDIQESAILEGYRCKDDGCSGFLLRDSDDKGFTCQQCGLVRSKEEI 300
           F C C  C+   Q  D ++ A  E              ++  D      +    +SKEE 
Sbjct: 259 FECDC--CLCQDQEKDAKKLAGEEP-----------AWKEVKD---AVNEVKYPKSKEEW 302

Query: 301 KKIASEVNILSKKTLALTSCGNHQEVVSTYKMIEKLQKKLYHPFSVNLMQTREKLIKILM 360
           +K      +L+K    L+S  +H    + Y++                     KL+   M
Sbjct: 303 EK------VLAKCQHLLSSHTSHLPDTNIYQL---------------------KLLDCAM 335

Query: 361 E----LEDWKEALAYCQLTIPVYQRVYPQFHPLLGLQYYTCGKLEWFLGDTENAIKSMTE 416
           +    LE W++AL Y   T+  Y+  YP FHPL  +Q    GKL++  G    A+ ++ +
Sbjct: 336 DACINLEAWEQALCYGSRTLGPYRLYYPDFHPLRAVQLMRVGKLQYSQGMLPQALGTLKQ 395

Query: 417 AVEILRITHGTNSPFMKELILKLEEAQAEASYKL 450
           A +I+++THGT+   M+ L + L+E Q EA  KL
Sbjct: 396 AYDIMKVTHGTDHSLMQAL-MDLKE-QCEAMMKL 427


>gi|24639370|ref|NP_524768.2| buzidau [Drosophila melanogaster]
 gi|6018870|emb|CAB58065.1| EG:BACR7C10.4 [Drosophila melanogaster]
 gi|22831587|gb|AAF45795.2| buzidau [Drosophila melanogaster]
 gi|223718730|gb|ACN22202.1| MIP05435p [Drosophila melanogaster]
          Length = 468

 Score =  174 bits (442), Expect = 6e-41,   Method: Compositional matrix adjust.
 Identities = 119/449 (26%), Positives = 221/449 (49%), Gaps = 29/449 (6%)

Query: 8   GEVIISQEPYVCVPNNSSSISRCDGCFASSNLKKCSACQVVWYCGSNCQKLDWKLHRLEC 67
           G+ I++++P+  V  +   + RCD C  ++ + KCS C+ V YC  +CQ   W  H+ EC
Sbjct: 38  GQRILTEKPFAFVLKSQYRLERCDNCLEATKVLKCSNCRYVSYCHRSCQMQAWGQHKHEC 97

Query: 68  QVLSRLDKEKRKSVTPSIRLMLKLYLRRKLQNDNVIPSTTTDNYSLVEALVAHMSDIDEK 127
             L ++     + V  + R++ +L LR +   D +    T         L++H ++I   
Sbjct: 98  PFLKKVHP---RVVPDAARMLCRLILRLEHGGDLIRGYYTEHGSRKFRDLMSHYAEIKND 154

Query: 128 QLLLY---AQIANLVNLILQWPEISIN--EIAENFSKLACNAHTICNSELRPLGTGLYPV 182
            + L    +  A L +++ + P    N  E+   + +L  N   I ++E+  + T +Y  
Sbjct: 155 PMRLEHLDSLHAVLTDMMAESPSTVPNKTELMSIYGRLITNGFNILDAEMNSIATAIYLG 214

Query: 183 ISIINHSCLPNAVLVFEGRLAVVRAVQHVP--KGAEVLISYIETAGSTMTRQKALKEQYL 240
           +SI +HSC PNAV  FEG    V A++ +     +++ ISYI+   +   R+  LKE Y 
Sbjct: 215 VSITDHSCQPNAVATFEGNELHVHAIEDMECLDWSKIFISYIDLLNTPEQRRLDLKEHYY 274

Query: 241 FTCTCPRCIKLGQFDDIQESAILEGYRCKDDGCSGFLLRDSDDKGFTCQQCGLVRSKEEI 300
           F C C +C       D +ES  +    C +  C   +  D ++    C +C    S + +
Sbjct: 275 FLCVCSKCT------DAKESKEMLAALCPNRNCGAGISVDRNN----CPRCDAGISPK-L 323

Query: 301 KKIASEVNILSKKTLALTSCGNHQEV--VSTYKMIEKLQKKLYHPFSVNLMQTREKLIKI 358
           +   +E   L++  L      N ++V  +   K+    Q  ++HP +V  ++T +   + 
Sbjct: 324 RNAFNEAMTLTRHNLE-----NMKDVAYLDVCKVCLDKQTGVFHPLNVWYVKTLDAAFEA 378

Query: 359 LMELEDWKEALAYCQLTIPVYQRVYPQFHPLLGLQYYTCGKLEWFLGDTENAIKSMTEAV 418
            +E+  W +AL Y Q  +P +++ +  ++PLLGL +   GK++ + G ++ A+  + EA 
Sbjct: 379 AIEVGKWSDALDYGQRLLPGFRKYHGPWNPLLGLLHMKLGKIQLYEGHSKEALHHLEEAQ 438

Query: 419 EILRITHGTNSPFMKE-LILKLEEAQAEA 446
            IL +THG +   + E L + + +A+ EA
Sbjct: 439 RILTVTHGRDHRLLTEQLYMLVLQARQEA 467


>gi|322779440|gb|EFZ09632.1| hypothetical protein SINV_01274 [Solenopsis invicta]
          Length = 425

 Score =  172 bits (435), Expect = 4e-40,   Method: Compositional matrix adjust.
 Identities = 132/453 (29%), Positives = 215/453 (47%), Gaps = 47/453 (10%)

Query: 8   GEVIISQEPYVCVPNNSSSISRCDGCFASSNLKK---CSACQVVWYCGSNCQKLDWKLHR 64
           G  I++  P+V V  +    +RCD CF S  L K   CS+CQ V+YC  NCQK  W++H+
Sbjct: 4   GTTILTGRPFVFVLRSKYRTTRCDNCFKSEQLVKLLKCSSCQYVYYCNRNCQKQSWRIHK 63

Query: 65  LECQVLSRLDKEKRKSVTP-SIRLMLKLYLRRKLQNDNVIPSTTTDNYSLVEALVAHMSD 123
            EC  L ++       + P S+RL+ ++ ++ K    + +   T +NY     L++H SD
Sbjct: 64  RECVCLKKI----LPVILPDSVRLVARIIIKLKQGGADEMDYYTKENYRKFTDLMSHYSD 119

Query: 124 I--DEKQL----LLYAQIANLVNLILQWPEISINEIAENFSKLACNAHTICNSELRPLGT 177
           +  D K +     LY  +A  ++  L  P  SI E+   + K++ N   I +  L  +G 
Sbjct: 120 MRADAKHMEPFIKLYRFLAEFLDETLM-P--SIAELISIYGKMSTNRLNIQDEWLNVIGF 176

Query: 178 GLYPVISIINHSCLPNAVLVFEGRLAVVRAVQHVPKGAEVLISYIETAGSTMTRQKALKE 237
           G+Y   S++ HSC PNAV  FEG   ++           + ISY++   S   R++ L  
Sbjct: 177 GIYLGASVVGHSCKPNAVATFEGTTIII-----------IKISYVDLIKSKKDRREELYS 225

Query: 238 QYLFTCTCPRCIKLGQFDDIQESAILEGYRCKDDGCSGFLLRDSDDKGFTCQQCGL---V 294
            Y F C C RC K        E ++ E   C +  C      D+D     C++C     V
Sbjct: 226 LYHFWCDCERCEK--------EESLTEAAACPNSSCDSPCSIDADK----CEKCNTRISV 273

Query: 295 RSKEEIKKIASEVNILSKKTLALTSCGNHQEVVSTYKMIEKLQKKLYHPFSVNLMQTREK 354
           + KE  +++    +   +  +++    N  +  S  K I + QK + H F+V+L+ T  K
Sbjct: 274 KFKETFQEVVDLTDYHFEHIISILF-FNDLDTFSWIKEILEKQKNVMHKFNVHLICTLRK 332

Query: 355 LIKILMELEDWKEALAYCQLTIPVYQRVYPQFHPLLGLQYYTCGKLEWFLGDTENAIKSM 414
           +      L+ WK+   Y +     Y   Y + +P +G  Y T G     LG  + A K++
Sbjct: 333 IHCAAWNLKCWKDVEFYGKKLEAGYLFYYGKVYPPIGTLYLTLGIALLHLGKPKEAFKTL 392

Query: 415 TEAVEILRITHGTNSPFMKELILKLEEAQAEAS 447
            +A  +L+ITHG     +KE+   LE  + +AS
Sbjct: 393 NKAYTVLKITHGNKHFLVKEI---LEPLRFDAS 422


>gi|195347982|ref|XP_002040530.1| GM18871 [Drosophila sechellia]
 gi|194121958|gb|EDW44001.1| GM18871 [Drosophila sechellia]
          Length = 466

 Score =  172 bits (435), Expect = 5e-40,   Method: Compositional matrix adjust.
 Identities = 120/449 (26%), Positives = 219/449 (48%), Gaps = 29/449 (6%)

Query: 8   GEVIISQEPYVCVPNNSSSISRCDGCFASSNLKKCSACQVVWYCGSNCQKLDWKLHRLEC 67
           G+ I++++P+  V  +   + RCD C  ++ + KCS C+ V YC  +CQ   W  H+ EC
Sbjct: 36  GQRILTEKPFAFVLKSQYRLERCDNCLEATKVLKCSNCRYVSYCHRSCQMQAWAQHKHEC 95

Query: 68  QVLSRLDKEKRKSVTPSIRLMLKLYLRRKLQNDNVIPSTTTDNYSLVEALVAHMSDIDEK 127
             L ++     + V  + R++ +L LR +   D +    T         L++H ++I   
Sbjct: 96  PFLKKVHP---RVVPDAARMLCRLILRLEHGGDLIRGYYTEHGSRKFRDLMSHYAEIKND 152

Query: 128 QLLLY---AQIANLVNLILQWPEISIN--EIAENFSKLACNAHTICNSELRPLGTGLYPV 182
            + L    +  A L +++ + P    N  E+   + +L  N   I ++E+  + T +Y  
Sbjct: 153 PMRLEHLDSLHAVLSDMMAESPSTVPNKTELMSIYGRLITNGFNILDAEMNSIATAIYLG 212

Query: 183 ISIINHSCLPNAVLVFEGRLAVVRAVQHVP--KGAEVLISYIETAGSTMTRQKALKEQYL 240
           +SI +HSC PNAV  FEG    V A++ +     +++ ISYI+   +   R+  LKE Y 
Sbjct: 213 VSITDHSCQPNAVATFEGNELHVHAIEDMECLDWSKIFISYIDLLNTPEQRRLDLKEHYY 272

Query: 241 FTCTCPRCIKLGQFDDIQESAILEGYRCKDDGCSGFLLRDSDDKGFTCQQCGLVRSKEEI 300
           F C C +C       D +ES  +    C +  C   +  D  +    C +C    S + +
Sbjct: 273 FLCVCSKCT------DAKESKEMLAALCPNRNCGVGISVDRTN----CPRCDAGISPK-L 321

Query: 301 KKIASEVNILSKKTLALTSCGNHQEV--VSTYKMIEKLQKKLYHPFSVNLMQTREKLIKI 358
           +   +E   L++  L      N ++V  +   K+    Q  ++HP +V  ++T +   + 
Sbjct: 322 RNAFNEAMTLTRHNLE-----NMKDVAYLDVCKVCLDKQTGVFHPLNVWYVKTLDAAFEA 376

Query: 359 LMELEDWKEALAYCQLTIPVYQRVYPQFHPLLGLQYYTCGKLEWFLGDTENAIKSMTEAV 418
            +E+  W +AL Y Q  +P +++ +  ++PLLGL +   GK++ + G  + A+  + EA 
Sbjct: 377 AIEVGKWSDALDYGQRLLPGFRKYHGPWNPLLGLLHMKLGKIQLYEGHAKEALHHLEEAQ 436

Query: 419 EILRITHGTNSPFMKE-LILKLEEAQAEA 446
            IL +THG +   + E L L + +A+ EA
Sbjct: 437 RILTVTHGRDHRLLTEQLYLLVLQARQEA 465


>gi|380030528|ref|XP_003698898.1| PREDICTED: LOW QUALITY PROTEIN: SET and MYND domain-containing
           protein 3-like [Apis florea]
          Length = 414

 Score =  171 bits (434), Expect = 5e-40,   Method: Compositional matrix adjust.
 Identities = 120/433 (27%), Positives = 209/433 (48%), Gaps = 58/433 (13%)

Query: 3   SFHVSGEVIISQEPYVCVPNNSSSISRCDGCFASSNLKKCSACQVVWYCGSNCQKLDWKL 62
           +F   G  + + +P+  V  +     RCD CF S  L +CS C+ ++YC  +CQ++ W +
Sbjct: 6   NFIKKGTTLFTAKPFAYVLYSKYRNERCDYCFKSGKLFRCSVCKCIYYCNQSCQQMSWTI 65

Query: 63  HRLECQVLSRLDKEKRKSVTPSI-RLMLKLYLRRKLQNDNVIPSTTTDNYSLVEALVAHM 121
           H  EC  L R   +    V P + RLM ++ ++        I   +   Y   + L++H 
Sbjct: 66  HSKECASLKRFSSK----VIPDVARLMARIIIKLNQGGGEEIGYYSKTKYRKFKDLMSHY 121

Query: 122 SDI--DEKQLLLYAQIANLVNLILQWPEISINEIAE---NFSKLACNAHTICNSELRPLG 176
           SDI  DEK++  +  +  ++   L   ++SI   AE    + ++  N+  I + ++  +G
Sbjct: 122 SDIKKDEKKMEHFVCVCGVLYEFL--GDMSIPNSAELMGIYGRIYINSFNISDLDMNNIG 179

Query: 177 TGLYPVISIINHSCLPNAVLVFEGRLAVVRAVQHVP--KGAEVLISYIETAGSTMTRQKA 234
            G+Y   SI++HSC PNAV  FEG   ++R  + +P    +++ ISYI+   +T  R++ 
Sbjct: 180 AGIYLGPSILDHSCKPNAVATFEGTTIIIRTTEDLPCLDLSQIRISYIDVIKTTKDRREE 239

Query: 235 LKEQYLFTCTCPRCIKLGQFDDIQES-AILEGYRCKDDGCSGFLLRDSDDKGFTCQQCGL 293
           L+  Y F C C +C         +ES  ++E   C +                       
Sbjct: 240 LQNSYYFWCNCKKC---------EESEPMVEAAACPN----------------------- 267

Query: 294 VRSKEEIKKIASEVNILSKKTLALTSCGNHQEVVSTYKMIEKLQKKLYHPFSVNLMQTRE 353
            R+  EI  + +  ++ + K +A          VST  M  K Q+ + HP ++  +QT +
Sbjct: 268 -RTFYEISDLTA-YHLQNMKNIAYLD-------VST--MCLKKQEGVLHPLNIQHVQTLQ 316

Query: 354 KLIKILMELEDWKEALAYCQLTIPVYQRVYPQFHPLLGLQYYTCGKLEWFLGDTENAIKS 413
                 + L+ W+EA +Y +  I  Y   Y +FHP  G+ Y + GKL+ +L + + AI++
Sbjct: 317 SAFDSSLTLQHWEEAESYAKRLINGYLTYYGEFHPSTGILYLSIGKLQVYLKNXKQAIET 376

Query: 414 MTEAVEILRITHG 426
           + +A  IL ITHG
Sbjct: 377 LRKASAILTITHG 389


>gi|195477573|ref|XP_002100247.1| GE16937 [Drosophila yakuba]
 gi|194187771|gb|EDX01355.1| GE16937 [Drosophila yakuba]
          Length = 469

 Score =  171 bits (434), Expect = 6e-40,   Method: Compositional matrix adjust.
 Identities = 118/451 (26%), Positives = 221/451 (49%), Gaps = 29/451 (6%)

Query: 8   GEVIISQEPYVCVPNNSSSISRCDGCFASSNLKKCSACQVVWYCGSNCQKLDWKLHRLEC 67
           G+ I++++P+  V  +   + RCD C  ++ + KCS C+ V YC  +CQ   W  H+ EC
Sbjct: 38  GQRILTEKPFAFVLKSQYRLERCDNCLEATKVLKCSNCRYVSYCHRSCQMQAWAQHKHEC 97

Query: 68  QVLSRLDKEKRKSVTPSIRLMLKLYLRRKLQNDNVIPSTTTDNYSLVEALVAHMSDIDEK 127
             L ++     + V  + R++ +L LR +   D +    T         L++H ++I   
Sbjct: 98  PFLKKVHP---RVVPDAARMLCRLILRLEHGGDLIRGYYTEHGSRKFRDLMSHYAEIKND 154

Query: 128 QLLLY---AQIANLVNLILQWPEISIN--EIAENFSKLACNAHTICNSELRPLGTGLYPV 182
            + L    +  A L +++   P    N  E+   + +L  N   I ++E+  + T +Y  
Sbjct: 155 PMRLEHLDSLHAVLTDMMADSPSTVPNKTELMSIYGRLITNGFNILDAEMNSIATAIYLG 214

Query: 183 ISIINHSCLPNAVLVFEGRLAVVRAVQHVP--KGAEVLISYIETAGSTMTRQKALKEQYL 240
           +SI +HSC PNAV  FEG    V A++ +     +++ ISYI+   +   R+  LKE Y 
Sbjct: 215 VSITDHSCQPNAVATFEGNELHVHAIEDMECLDWSKIFISYIDLLNTPEQRRLDLKEHYY 274

Query: 241 FTCTCPRCIKLGQFDDIQESAILEGYRCKDDGCSGFLLRDSDDKGFTCQQCGLVRSKEEI 300
           F C C +C       D +ES  +    C +  C   +  D  +    C +C    S + +
Sbjct: 275 FLCVCSKCT------DAKESKEMLSALCPNRNCGVGITVDRTN----CPRCDAGISPK-L 323

Query: 301 KKIASEVNILSKKTLALTSCGNHQEV--VSTYKMIEKLQKKLYHPFSVNLMQTREKLIKI 358
           +   ++   L++  L      N ++V  +   K+    Q  ++HP +V  ++T +   + 
Sbjct: 324 RNAFNDAMTLTRHNLE-----NMKDVAYLDVCKVCLDKQTGVFHPLNVWYVKTLDAAFEA 378

Query: 359 LMELEDWKEALAYCQLTIPVYQRVYPQFHPLLGLQYYTCGKLEWFLGDTENAIKSMTEAV 418
            +E+  W +AL Y Q  +P +++ +  ++PLLGL +   GK++ + G ++ A+  + EA 
Sbjct: 379 AIEVGKWTDALDYGQRLLPGFRKYHGPWNPLLGLLHMKLGKIQLYEGRSKEALHHLEEAQ 438

Query: 419 EILRITHGTNSPFMKE-LILKLEEAQAEASY 448
            IL +THG +   + E L + + +A+ EA++
Sbjct: 439 RILTVTHGRDHRLLTEQLYVLILQARQEAAH 469


>gi|50740296|ref|XP_419420.1| PREDICTED: N-lysine methyltransferase SMYD2 [Gallus gallus]
          Length = 436

 Score =  171 bits (433), Expect = 7e-40,   Method: Compositional matrix adjust.
 Identities = 119/437 (27%), Positives = 204/437 (46%), Gaps = 37/437 (8%)

Query: 8   GEVIISQEPYVCVPNNSSSISRCDGCFA-SSNLKKCSACQVVWYCGSNCQKLDWKLHRLE 66
           GE++ S   Y  V   S   S CDGCFA    L KC  C+  +YC   CQK DW +H+LE
Sbjct: 33  GELLFSCPAYTAVLTVSERGSHCDGCFARKEGLSKCGRCKQAFYCNVECQKEDWPMHKLE 92

Query: 67  CQVLSRLDKEKRKSVTPSIRLMLKLYLRRKLQNDNVIPSTTTDNYSLVEALVAHMSDID- 125
           C  +    +    S T  +RL  ++  ++K+  +     T ++    V+   +H+  +D 
Sbjct: 93  CAAMCAFGQNWNPSET--VRLTARILAKQKIHPER----TQSEKLLAVKEFESHLDKLDN 146

Query: 126 EKQLLLYAQIANLVNLILQWPEISINE-IAENFSKLACNAHTICNSELRPLGTGLYPVIS 184
           EK+ L+   IA L +   +  E   N  +   F+++ CN  TI + EL  LG+ ++P ++
Sbjct: 147 EKRELIQNDIAALHHFYSKHMEYPDNAALVVLFAQVNCNGFTIEDEELSHLGSAIFPDVA 206

Query: 185 IINHSCLPNAVLVFEGRLAVVRAVQHVPKGAEVLISYIETAGSTMTRQKALKEQYLFTCT 244
           ++NHSC PN ++ ++G LA VRAV+ +  G EV  SYI+    T  R   L++ Y FTC 
Sbjct: 207 LMNHSCCPNVIVTYKGTLAEVRAVKEIEPGEEVFTSYIDLLYPTEDRNDRLRDSYFFTCD 266

Query: 245 CPRCIKLGQFDDIQESAILEGYRCKDDGCSGFLLRDSDDKGFTCQQCGLVRSKEEIKKIA 304
           C  C    +  D ++  I    R  +D  S   +RD            ++      K   
Sbjct: 267 CRECTM--KEKDKEKLKI----RKLNDPPSAEAVRD-----MIKYARNVIEEFRRAKHYK 315

Query: 305 SEVNILSKKTLALTSCGNHQEVVSTYKMIEKLQKKLYHPFSVNLMQTREKLIKILMELED 364
           S   +L    L+L   G                  ++   +V ++    + + + + ++D
Sbjct: 316 SPSELLEICELSLDKMG-----------------AVFEDSNVYMLHMMYQAMGVCLYVQD 358

Query: 365 WKEALAYCQLTIPVYQRVYPQFHPLLGLQYYTCGKLEWFLGDTENAIKSMTEAVEILRIT 424
           W+ AL Y Q  I  Y + YP +   +   +   G+L   L +     K++ +A+ I+ + 
Sbjct: 359 WEGALRYGQKIIRPYSKHYPSYSLNVASMWLKLGRLYMALENRPAGDKALKKAIAIMEVA 418

Query: 425 HGTNSPFMKELILKLEE 441
           HG + P++ E+  +LE+
Sbjct: 419 HGKDHPYISEIKKELED 435


>gi|195438687|ref|XP_002067264.1| GK16327 [Drosophila willistoni]
 gi|194163349|gb|EDW78250.1| GK16327 [Drosophila willistoni]
          Length = 461

 Score =  171 bits (432), Expect = 8e-40,   Method: Compositional matrix adjust.
 Identities = 123/441 (27%), Positives = 209/441 (47%), Gaps = 28/441 (6%)

Query: 8   GEVIISQEPYVCVPNNSSSISRCDGCFASSNLKKCSACQVVWYCGSNCQKLDWKLHRLEC 67
           G+ I++++P+  V  +     RCD C     ++KCS C+ V+YC   CQ   W +H+ EC
Sbjct: 25  GQRILTEKPFAFVLKSKYRQQRCDNCLEEGKVRKCSNCRYVYYCNRACQTQAWLMHKHEC 84

Query: 68  QVLSRLDKEKRKSVTPSIRLMLKLYLRRKLQNDNVIPSTTTDNYSLVEALVAHMSDI--D 125
             L R      + V  + R++ +L LR +   D +    T         L++H ++I  D
Sbjct: 85  PFLKR---SYPRIVPDAARMLCRLILRLEHGGDLIRGYYTEHGSRKFRDLMSHYAEIKND 141

Query: 126 EKQL-LLYAQIANLVNLILQWPEISIN--EIAENFSKLACNAHTICNSELRPLGTGLYPV 182
            K+L  L +  A L ++++  P    N  E+   + +L  N   I + E+  + T +Y  
Sbjct: 142 PKRLEHLESLHAVLTDMMVDSPSTVPNKTELMSIYGRLITNGFNILDGEMNSIATAIYLG 201

Query: 183 ISIINHSCLPNAVLVFEGRLAVVRAVQHVP--KGAEVLISYIETAGSTMTRQKALKEQYL 240
           +SI +HSC PNAV  FEG    + A++ +P    +++ ISYI+   +   R+  LKE Y 
Sbjct: 202 VSITDHSCQPNAVATFEGNELHIYAIEDMPCLDWSKIFISYIDLLNTPEQRRLDLKEHYY 261

Query: 241 FTCTCPRCIKLGQFDDIQESAILEGYRCKDDGCSGFLLRDSDDKGFTCQQC--GLVRSKE 298
           F C C +C       D QE   +E   C    C   +    + K   C+ C  G+     
Sbjct: 262 FLCVCSKCT------DRQEQREMEAAVCPFRKCGSSI----NVKWTHCRNCQAGIT---P 308

Query: 299 EIKKIASEVNILSKKTLALTSCGNHQEVVSTYKMIEKLQKKLYHPFSVNLMQTREKLIKI 358
           +++   +EV  ++K  L       + +V    K+    Q  + HP +V  ++T +   + 
Sbjct: 309 KLRNTYNEVMAMTKHHLESMKDVAYLDVC---KVCLDKQTGVLHPLNVWHVKTLDAAFEA 365

Query: 359 LMELEDWKEALAYCQLTIPVYQRVYPQFHPLLGLQYYTCGKLEWFLGDTENAIKSMTEAV 418
            +E+  W EAL Y Q  +  +++    +HPLLGL +   GK++ + G  + A+    EA 
Sbjct: 366 AIEVGKWPEALNYGQQLLQGFRKYNGPWHPLLGLLHMKLGKIQLYEGQCKEAMHHFEEAR 425

Query: 419 EILRITHGTNSPFMKELILKL 439
            IL +THG     + E +L L
Sbjct: 426 RILCVTHGREHRLIGEQLLPL 446


>gi|225715142|gb|ACO13417.1| SET and MYND domain-containing protein 3 [Esox lucius]
          Length = 429

 Score =  171 bits (432), Expect = 9e-40,   Method: Compositional matrix adjust.
 Identities = 125/454 (27%), Positives = 208/454 (45%), Gaps = 58/454 (12%)

Query: 6   VSGEVIISQEPYVCVPNNSSSISRCDGCFAS-SNLKKCSACQVVWYCGSNCQKLDWKLHR 64
           ++GE++ + EP  C  ++      C  CF+    L +CS C+V  YC   CQK  W +H+
Sbjct: 24  LAGELLYTAEPLACCVSSQCRNVVCHSCFSRHETLLRCSQCKVARYCDVTCQKRAWSIHK 83

Query: 65  LECQVLSRLDKEKRKSVTPSIRLMLKLYLRRKLQNDNVIPSTTTDNYSLVEALVAHMSDI 124
            EC+ L  L     +  T S+RL  ++  R       + PS  T      E   +H+ D+
Sbjct: 84  RECKCLLSLHP---RIPTDSVRLAARIIFRL------LCPSQMTPQLYSFEEHESHLCDM 134

Query: 125 DEKQLLLYAQIANLVNLILQWPEISINEIAENF------SKLACNAHTICNSELRPLGTG 178
            E++    +Q+++++ L L+  +  I +   +F      +K+ CN  TI ++EL+ +G G
Sbjct: 135 GEEKREGLSQLSSMLQLYLKQEQPDIIQKVPSFDPISLLAKVTCNCFTISDAELQEIGVG 194

Query: 179 LYPVISIINHSCLPNAVLVFEGRLAVVRAVQHVPKGAEVLISYIETAGSTMTRQKALKEQ 238
           LYP +S++NH C P+ V++F+G+   +RA++ +    EV ISYI     T  RQ  L EQ
Sbjct: 195 LYPSMSLLNHDCRPSCVMLFQGKTLQLRAIRDIQPTEEVTISYIGVLLPTRERQTQLMEQ 254

Query: 239 YLFTCTCPRCIKLGQFDDIQESAILEGYRCKDDGCSGFLLRDSDDKGFTCQQCGLVRSKE 298
           Y F+C C  C    + D +                                 CG+  +  
Sbjct: 255 YHFSCQCGLC-STAELDPLMF-------------------------------CGVKEAWT 282

Query: 299 EIKKIASEVNILSKKTLALTSCGNHQEVVS-TYKMIEKLQKKLYHPFSVNLMQTR--EKL 355
            +K+    + IL          GN +E++     ++  +   +      N+ + R  +  
Sbjct: 283 PMKEAIPRLEILQTN-------GNWEELLQECSSLLAPVGGAVPAVPDSNVYRLRVTDLA 335

Query: 356 IKILMELEDWKEALAYCQLTIPVYQRVYPQFHPLLGLQYYTCGKLEWFLGDTENAIKSMT 415
               + L  W+ ALAY   T+  Y++ YP  HP   +Q     KL+ FL   E+A  ++ 
Sbjct: 336 FDACINLARWETALAYGLKTLGPYRQYYPDPHPAHAIQLMRVAKLQHFLVHLEDAQHTLR 395

Query: 416 EAVEILRITHGTNSPFMKELILKLEEAQAEASYK 449
            A +I++ITHG       +LI KLEE  AE   K
Sbjct: 396 LAYDIMKITHGNEHSLTSDLIRKLEECLAEMDCK 429


>gi|332024084|gb|EGI64301.1| SET and MYND domain-containing protein 3 [Acromyrmex echinatior]
          Length = 441

 Score =  170 bits (430), Expect = 1e-39,   Method: Compositional matrix adjust.
 Identities = 118/417 (28%), Positives = 202/417 (48%), Gaps = 40/417 (9%)

Query: 34  FASSNLKKCSACQVVWYCGSNCQKLDWKLHRLECQVLSRLDKEKRKSVTPSIRLMLKLYL 93
           + S  L KCS CQ V+YC  NCQK  W +H+ EC  L ++     K +  + RLM ++ L
Sbjct: 33  YCSGKLLKCSNCQYVYYCDRNCQKESWPIHKAECTCLKKVSP---KVLPDAARLMARIIL 89

Query: 94  RRKLQNDNVIPSTTTDNYSLVEALVAHMSDI--DEKQLLLYAQIANLVNLILQWPEI-SI 150
           +        +   T  N+   + L++H SDI  D K++  +  +  +++  L    + +I
Sbjct: 90  KLNQGGAEEVGYYTEKNFRRFKDLMSHYSDIKVDVKRMEHFTMLCGVLSQFLDETFMPNI 149

Query: 151 NEIAENFSKLACNAHTICNSELRPLGTGLYPVISIINHSCLPNAVLVFEGRLAVVRAVQH 210
            E+   + ++  N+  I +  +  +G G+Y   S+I+HSC PN ++VFEG   +VR +  
Sbjct: 150 AELMGIYGRICTNSFNILDINMNTIGVGIYLGASVIDHSCKPNVIVVFEGTTIIVRTLTD 209

Query: 211 VPK--------GAEVLISYIETAGSTMTRQKALKEQYLFTCTCPRCIKLGQFDDIQESAI 262
           +P           ++ ISY++   S   R++ L   Y F C C RC K        E  +
Sbjct: 210 LPSLDWSQASIDKDIRISYVDLLNSNKDRREELHSSYYFWCDCERCKK--------EEPM 261

Query: 263 LEGYRCKDDGCSGFLLRDSDDKGFTCQQCGL---VRSKEEIKKIAS-EVNILSK-KTLAL 317
           +E   C +  C      ++D+    C++C     V  KE  +++    ++ L K KT+A 
Sbjct: 262 VEAAACPNLLCDSPCSIEADE----CEKCNKEISVEFKETFREVVDFTIHHLEKMKTMAY 317

Query: 318 TSCGNHQEVVSTYKMIEKLQKKLYHPFSVNLMQTREKLIKILMELEDWKEALAYCQLTIP 377
                    +   K+  K QK + H F++  ++T E      M L+ W++A  Y +  +P
Sbjct: 318 ---------LDVSKICLKKQKGIMHKFNIQHVRTLEMAHIAAMNLKCWEDAEFYGKELVP 368

Query: 378 VYQRVYPQFHPLLGLQYYTCGKLEWFLGDTENAIKSMTEAVEILRITHGTNSPFMKE 434
            Y   Y + HPL GL Y T GK++  L   + A +++T+A  +L ITHG     ++E
Sbjct: 369 GYLLYYGEIHPLTGLLYLTVGKIQLHLEKPKEAFQALTKANTVLTITHGDKHSIVEE 425


>gi|30840984|ref|NP_081072.1| N-lysine methyltransferase SMYD2 [Mus musculus]
 gi|47117248|sp|Q8R5A0.1|SMYD2_MOUSE RecName: Full=N-lysine methyltransferase SMYD2; AltName:
           Full=Histone methyltransferase SMYD2; AltName: Full=SET
           and MYND domain-containing protein 2
 gi|339717596|pdb|3QWV|A Chain A, Crystal Structure Of Histone Lysine Methyltransferase
           Smyd2 In Complex With The Cofactor Product Adohcy
 gi|339717597|pdb|3QWW|A Chain A, Crystal Structure Of Histone Lysine Methyltransferase
           Smyd2 In Complex With The Methyltransferase Inhibitor
           Sinefungin
 gi|18606376|gb|AAH23119.1| SET and MYND domain containing 2 [Mus musculus]
          Length = 433

 Score =  170 bits (430), Expect = 2e-39,   Method: Compositional matrix adjust.
 Identities = 113/442 (25%), Positives = 205/442 (46%), Gaps = 42/442 (9%)

Query: 4   FHVSGEVIISQEPYVCVPNNSSSISRCDGCFA-SSNLKKCSACQVVWYCGSNCQKLDWKL 62
           FHV G+++ S   Y CV         C+ CFA    L KC  C+  +YC   CQK DW L
Sbjct: 27  FHV-GDLLFSCPAYACVLTVGERGHHCECCFARKEGLSKCGRCKQAFYCDVECQKEDWPL 85

Query: 63  HRLECQVLSRLDKEKRKSVTPSIRLMLKLYLRRKLQNDNVIPSTTTDNYSLVEALVAHMS 122
           H+LEC  +  L +    S T  +RL  ++  ++K+  +     T ++    V    +H+ 
Sbjct: 86  HKLECSSMVVLGENWNPSET--VRLTARILAKQKIHPER----TPSEKLLAVREFESHLD 139

Query: 123 DID-EKQLLLYAQIANLVNLI---LQWPEISINEIAENFSKLACNAHTICNSELRPLGTG 178
            +D EK+ L+ + IA L       L++P+ S   +   F+++ CN  TI + EL  LG+ 
Sbjct: 140 KLDNEKKDLIQSDIAALHQFYSKYLEFPDHS--SLVVLFAQVNCNGFTIEDEELSHLGSA 197

Query: 179 LYPVISIINHSCLPNAVLVFEGRLAVVRAVQHVPKGAEVLISYIETAGSTMTRQKALKEQ 238
           ++P ++++NHSC PN ++ ++G LA VRAVQ +  G EV  SYI+    T  R   L++ 
Sbjct: 198 IFPDVALMNHSCCPNVIVTYKGTLAEVRAVQEIHPGDEVFTSYIDLLYPTEDRNDRLRDS 257

Query: 239 YLFTCTCPRCIKLGQFDDIQESAILEGYRCKDDGCSGFLLRDSDDKGFTCQQCGLVRSKE 298
           Y FTC C  C                              +D D      ++       E
Sbjct: 258 YFFTCECRECT----------------------------TKDKDKAKVEVRKLSSPPQAE 289

Query: 299 EIKKIASEVNILSKKTLALTSCGNHQEVVSTYKMIEKLQKKLYHPFSVNLMQTREKLIKI 358
            I+ +      + ++        +  E++   ++ ++    ++   +V ++    + + +
Sbjct: 290 AIRDMVRYARNVIEEFRRAKHYKSPSELLEICELSQEKMSSVFEDSNVYMLHMMYQAMGV 349

Query: 359 LMELEDWKEALAYCQLTIPVYQRVYPQFHPLLGLQYYTCGKLEWFLGDTENAIKSMTEAV 418
            + ++DW+ AL Y Q  I  Y + YP +   +   +   G+L   L +     K++ +A+
Sbjct: 350 CLYMQDWEGALKYGQKIIKPYSKHYPVYSLNVASMWLKLGRLYMGLENKAAGEKALKKAI 409

Query: 419 EILRITHGTNSPFMKELILKLE 440
            I+ + HG + P++ E+  ++E
Sbjct: 410 AIMEVAHGKDHPYISEIKQEIE 431


>gi|325530259|sp|E1C5V0.1|SMYD2_CHICK RecName: Full=N-lysine methyltransferase SMYD2; AltName:
           Full=Histone methyltransferase SMYD2; AltName: Full=SET
           and MYND domain-containing protein 2
          Length = 436

 Score =  170 bits (430), Expect = 2e-39,   Method: Compositional matrix adjust.
 Identities = 110/437 (25%), Positives = 202/437 (46%), Gaps = 37/437 (8%)

Query: 8   GEVIISQEPYVCVPNNSSSISRCDGCFA-SSNLKKCSACQVVWYCGSNCQKLDWKLHRLE 66
           GE++ S   Y  V   S   S CDGCFA    L KC  C+  +YC   CQK DW +H+LE
Sbjct: 33  GELLFSCPAYTAVLTVSERGSHCDGCFARKEGLSKCGRCKQAFYCNVECQKEDWPMHKLE 92

Query: 67  CQVLSRLDKEKRKSVTPSIRLMLKLYLRRKLQNDNVIPSTTTDNYSLVEALVAHMSDID- 125
           C  +    +    S T  +RL  ++  ++K+  +     T ++    V+   +H+  +D 
Sbjct: 93  CAAMCAFGQNWNPSET--VRLTARILAKQKIHPER----TQSEKLLAVKEFESHLDKLDN 146

Query: 126 EKQLLLYAQIANLVNLILQWPEISINE-IAENFSKLACNAHTICNSELRPLGTGLYPVIS 184
           EK+ L+   IA L +   +  E   N  +   F+++ CN  TI + EL  LG+ ++P ++
Sbjct: 147 EKRELIQNDIAALHHFYSKHMEYPDNAALVVLFAQVNCNGFTIEDEELSHLGSAIFPDVA 206

Query: 185 IINHSCLPNAVLVFEGRLAVVRAVQHVPKGAEVLISYIETAGSTMTRQKALKEQYLFTCT 244
           ++NHSC PN ++ ++G LA VRAV+ +  G EV  SYI+    T  R   L++ Y FTC 
Sbjct: 207 LMNHSCCPNVIVTYKGTLAEVRAVKEIEPGEEVFTSYIDLLYPTEDRNDRLRDSYFFTCD 266

Query: 245 CPRCIKLGQFDDIQESAILEGYRCKDDGCSGFLLRDSDDKGFTCQQCGLVRSKEEIKKIA 304
           C  C                             +++ D +    ++     S E ++ + 
Sbjct: 267 CRECT----------------------------MKEKDKEKLKIRKLNDPPSAEAVRDMI 298

Query: 305 SEVNILSKKTLALTSCGNHQEVVSTYKMIEKLQKKLYHPFSVNLMQTREKLIKILMELED 364
                + ++           E++   ++       ++   +V ++    + + + + ++D
Sbjct: 299 KYARNVIEEFRRAKHYKPPSELLEICELSLDKMGAVFEDSNVYMLHMMYQAMGVCLYVQD 358

Query: 365 WKEALAYCQLTIPVYQRVYPQFHPLLGLQYYTCGKLEWFLGDTENAIKSMTEAVEILRIT 424
           W+ AL Y Q  I  Y + YP +   +   +   G+L   L +     K++ +A+ I+ + 
Sbjct: 359 WEGALRYGQKIIRPYSKHYPSYSLNVASMWLKLGRLYMALENRPAGDKALKKAIAIMEVA 418

Query: 425 HGTNSPFMKELILKLEE 441
           HG + P++ E+  +LE+
Sbjct: 419 HGKDHPYISEIKKELED 435


>gi|91076642|ref|XP_970424.1| PREDICTED: similar to Buzidau CG13761-PB [Tribolium castaneum]
 gi|270002373|gb|EEZ98820.1| hypothetical protein TcasGA2_TC004426 [Tribolium castaneum]
          Length = 442

 Score =  170 bits (430), Expect = 2e-39,   Method: Compositional matrix adjust.
 Identities = 116/450 (25%), Positives = 223/450 (49%), Gaps = 34/450 (7%)

Query: 8   GEVIISQEPYVCVPNNSSSISRCDGCFASSNLKKCSACQVVWYCGSNCQKLDWKLHRLEC 67
           G  I  ++P+V V ++      CD C       KCS C  V+YCG  CQK  W +H+ EC
Sbjct: 10  GCTIYKEKPFVYVLSSKLRTEYCDFCLKKGQFMKCSGCHYVYYCGKVCQKDGWSVHKSEC 69

Query: 68  QVLSRLDKEKRKSVTPSIRLMLKL-YLRRKLQNDNVIPSTTTDN-YSLVEALVAHMSDI- 124
           + L R+     + +  + R + +L ++ RK    +++ S   +N + + + L++H  +I 
Sbjct: 70  RGLKRV---APRILPDAARFIARLIHILRK--GGDLVKSYYLENCFRMYKDLMSHYPNIK 124

Query: 125 -DEKQLLLYAQI-ANLVNLILQWPEISINEIAENFSKLACNAHTICNSELRPLGTGLYPV 182
            D++++  +  + A L   +      +  E+   + ++  N+  I + EL+ +GTG+Y  
Sbjct: 125 GDQQRMEHFTSLCAVLFEFLGDDSLPNSAELMGMYGRMCINSFNIIDQELQCIGTGMYLG 184

Query: 183 ISIINHSCLPNAVLVFEGRLAVVRAVQHVP--KGAEVLISYIETAGSTMTRQKALKEQYL 240
            S+I+HSC PNAV +F+G +  +RA+Q       +++ ISYI+   +T  RQ  L+  Y 
Sbjct: 185 ASVIDHSCSPNAVAIFDGPILSIRALQTFQYLDWSQIKISYIDILNTTKDRQSELEAAYY 244

Query: 241 FTCTCPRCIKLGQFDDIQESAILEGYRCKDDGCSGFLLRDSDDKGFTCQQCGLVRSKEEI 300
           F C CP+C++        E   +    C ++ C   +  +    G  C +C  V S+  +
Sbjct: 245 FLCKCPKCLE-------PEPPEINAAACPNEKCDNHIDTEIITPGDKCAKCDTVVSETFL 297

Query: 301 KKIA-----SEVNILSKKTLALTSCGNHQEVVSTYKMIEKLQKKLYHPFSVNLMQTREKL 355
           K+       +++++ + K LA              ++  K Q+ + H F++  ++T +  
Sbjct: 298 KRFKEVIEFTDLHLQNMKQLA---------YFDVCEICLKKQEGVLHKFNIKHVKTLDLA 348

Query: 356 IKILMELEDWKEALAYCQLTIPVYQRVYPQFHPLLGLQYYTCGKLEWFLGDTENAIKSMT 415
            +  ++ + W  A  +    +  + + Y   HP+ GL +   GK+  F  + + A++ +T
Sbjct: 349 FQSSIDFQKWDFARKFALELVDAFYKYYGHVHPITGLLHLKLGKILLFEENDQLALEHLT 408

Query: 416 EAVEILRITHGTNSPFMK-ELILKLEEAQA 444
           +A +IL+I HG  S   K EL+  L++A+A
Sbjct: 409 KAYQILKIIHGVGSHLFKDELVPLLQQARA 438


>gi|224047131|ref|XP_002191464.1| PREDICTED: N-lysine methyltransferase SMYD2 [Taeniopygia guttata]
          Length = 436

 Score =  169 bits (428), Expect = 2e-39,   Method: Compositional matrix adjust.
 Identities = 118/440 (26%), Positives = 208/440 (47%), Gaps = 43/440 (9%)

Query: 8   GEVIISQEPYVCVPNNSSSISRCDGCFA-SSNLKKCSACQVVWYCGSNCQKLDWKLHRLE 66
           GE+++S   YV V   S   S CDGCFA    L KC  C+  +YC   CQK DW +H+LE
Sbjct: 33  GELLLSCPAYVAVLTVSERGSHCDGCFARKEGLSKCGRCKQAFYCNVECQKEDWPMHKLE 92

Query: 67  CQVLSRLDKEKRKSVTPSIRLMLKLYLRRKLQNDNVIPSTTTDNYSLVEALVAHMSDID- 125
           C  +    +    S T  +RL  ++  ++K   +     T ++    V+   +H+  +D 
Sbjct: 93  CASMCTFGQNWHPSET--VRLTARILAKQKTHPER----TQSEKLLAVKEFESHLDKLDN 146

Query: 126 EKQLLLYAQIANLVNLI---LQWPEISINEIAENFSKLACNAHTICNSELRPLGTGLYPV 182
           EK+ L+   IA L +     L +P+ +   +   F+++ CN  TI + EL  LG+ ++P 
Sbjct: 147 EKRELIQNDIAALHHFYSKHLDYPDNAA--LVVLFAQVNCNGFTIEDEELSHLGSAIFPD 204

Query: 183 ISIINHSCLPNAVLVFEGRLAVVRAVQHVPKGAEVLISYIETAGSTMTRQKALKEQYLFT 242
           ++++NHSC PN ++ ++G LA VRAV+ +  G E+  SYI+    T  R   L++ Y F+
Sbjct: 205 VALMNHSCCPNVIVTYKGTLAEVRAVREIEPGEEIFSSYIDLLYPTEDRNDRLRDSYFFS 264

Query: 243 CTCPRCIKLGQFDDIQESAILEGYRCK-DDGCSGFLLRDSDDKGFTCQQCGLVRSKEEIK 301
           C C  C    +  D ++  I     CK +D  S   ++D            ++      K
Sbjct: 265 CDCRECTT--KEKDKEKLEI-----CKLNDPPSAETVKDMIKYARN-----VIEEFRRAK 312

Query: 302 KIASEVNILSKKTLALTSCGNHQEVVSTYKMIEKLQKKLYHPFSVNLMQTREKLIKILME 361
              S   +L    L+L   G                  ++   +V ++    + + + + 
Sbjct: 313 HYKSPSELLEICELSLDKMG-----------------AVFEDSNVYMLHMMYQAMGVCLY 355

Query: 362 LEDWKEALAYCQLTIPVYQRVYPQFHPLLGLQYYTCGKLEWFLGDTENAIKSMTEAVEIL 421
           ++DW+ AL Y Q  I  Y + YP +   +   +   G+L   L +    +K++  A+ I+
Sbjct: 356 VQDWEGALCYGQKIIRPYSKHYPSYSLNVASMWLKLGRLYMALKNRTAGVKALKRAIAIM 415

Query: 422 RITHGTNSPFMKELILKLEE 441
            + HG + P++ E+  +LE+
Sbjct: 416 EVAHGKDHPYISEIKKELED 435


>gi|281306785|ref|NP_996733.1| N-lysine methyltransferase SMYD2 [Rattus norvegicus]
 gi|47116972|sp|Q7M6Z3.1|SMYD2_RAT RecName: Full=N-lysine methyltransferase SMYD2; AltName:
           Full=Histone methyltransferase SMYD2; AltName: Full=SET
           and MYND domain-containing protein 2
 gi|32140157|tpg|DAA01315.1| TPA_exp: SET and MYND domain protein 2 [Rattus norvegicus]
          Length = 433

 Score =  169 bits (428), Expect = 3e-39,   Method: Compositional matrix adjust.
 Identities = 113/442 (25%), Positives = 204/442 (46%), Gaps = 42/442 (9%)

Query: 4   FHVSGEVIISQEPYVCVPNNSSSISRCDGCFA-SSNLKKCSACQVVWYCGSNCQKLDWKL 62
           FHV G+++ S   Y CV         C+ CFA    L KC  C+  +YC   CQK DW L
Sbjct: 27  FHV-GDLLFSCPAYACVLTVGERGHHCECCFARKEGLSKCGRCKQAFYCDVECQKEDWPL 85

Query: 63  HRLECQVLSRLDKEKRKSVTPSIRLMLKLYLRRKLQNDNVIPSTTTDNYSLVEALVAHMS 122
           H+LEC  +    +    S T  +RL  ++  ++K+  +     T ++    V    +H+ 
Sbjct: 86  HKLECSSMVVFGENWNPSET--VRLTARILAKQKMHPER----TPSEKLLAVREFESHLD 139

Query: 123 DID-EKQLLLYAQIANLVNLI---LQWPEISINEIAENFSKLACNAHTICNSELRPLGTG 178
            +D EK+ L+ + IA L       L++P+ S   +   F+++ CN  TI + EL  LG+ 
Sbjct: 140 KLDNEKKDLIQSDIAALHQFYSKHLEFPDHS--SLVVLFAQVNCNGFTIEDEELSHLGSA 197

Query: 179 LYPVISIINHSCLPNAVLVFEGRLAVVRAVQHVPKGAEVLISYIETAGSTMTRQKALKEQ 238
           ++P ++++NHSC PN ++ ++G LA VRAVQ +  G EV  SYI+    T  R   L++ 
Sbjct: 198 IFPDVALMNHSCCPNVIVTYKGTLAEVRAVQEIHPGDEVFTSYIDLLYPTEDRNDRLRDS 257

Query: 239 YLFTCTCPRCIKLGQFDDIQESAILEGYRCKDDGCSGFLLRDSDDKGFTCQQCGLVRSKE 298
           Y FTC C  C                              +D D      ++       E
Sbjct: 258 YFFTCECRECT----------------------------TKDKDKAKVEIRKLSNPPQAE 289

Query: 299 EIKKIASEVNILSKKTLALTSCGNHQEVVSTYKMIEKLQKKLYHPFSVNLMQTREKLIKI 358
            I+ +      + ++        +  E++   ++ ++    ++   +V ++    + + +
Sbjct: 290 AIRDMVRYARNVIEEFRRAKHYKSPSELLEICELSQEKMSSVFEDSNVYMLHMMYQAMGV 349

Query: 359 LMELEDWKEALAYCQLTIPVYQRVYPQFHPLLGLQYYTCGKLEWFLGDTENAIKSMTEAV 418
            + ++DW+ AL Y Q  I  Y + YP +   +   +   G+L   L +     K++ +A+
Sbjct: 350 CLYMQDWEGALKYGQKIIKPYSKHYPVYSLNVASMWLKLGRLYMGLENKAAGEKALKKAI 409

Query: 419 EILRITHGTNSPFMKELILKLE 440
            I+ I HG + P++ E+  ++E
Sbjct: 410 AIMEIAHGKDHPYISEIKQEIE 431


>gi|194913303|ref|XP_001982670.1| GG12606 [Drosophila erecta]
 gi|190648346|gb|EDV45639.1| GG12606 [Drosophila erecta]
          Length = 468

 Score =  167 bits (424), Expect = 7e-39,   Method: Compositional matrix adjust.
 Identities = 118/449 (26%), Positives = 215/449 (47%), Gaps = 29/449 (6%)

Query: 8   GEVIISQEPYVCVPNNSSSISRCDGCFASSNLKKCSACQVVWYCGSNCQKLDWKLHRLEC 67
           G+ I++++P+  V  +   + RCD C  ++ + KCS C+ V YC   CQ   W  H+ EC
Sbjct: 38  GQRILTEKPFAFVLKSQYRLERCDNCLEATKVLKCSNCRYVSYCNRACQMQAWAQHKHEC 97

Query: 68  QVLSRLDKEKRKSVTPSIRLMLKLYLRRKLQNDNVIPSTTTDNYSLVEALVAHMSDIDEK 127
             L ++     + V  + R++ +L LR +   D +    T         L++H ++I   
Sbjct: 98  PFLKKVHP---RVVPDAARMLCRLILRLEHGGDLIRGYYTEHGSRKFRDLMSHYAEIKND 154

Query: 128 QLLLY---AQIANLVNLILQWPEISIN--EIAENFSKLACNAHTICNSELRPLGTGLYPV 182
            + L    +  A L +++   P    N  E+   + +L  N   + ++++  + T +Y  
Sbjct: 155 PMRLEHLDSLHAVLTDMMADSPSTVPNKTELMSIYGRLITNGFNVLDTDMNSIATAIYLG 214

Query: 183 ISIINHSCLPNAVLVFEGRLAVVRAVQHVP--KGAEVLISYIETAGSTMTRQKALKEQYL 240
           +SI +HSC PNAV  FEG    V A++ +     +++ ISYI+   +   R+  LKE Y 
Sbjct: 215 VSITDHSCQPNAVATFEGNELHVHAIEDMECLDWSKIFISYIDLLNTPEQRRLDLKEHYY 274

Query: 241 FTCTCPRCIKLGQFDDIQESAILEGYRCKDDGCSGFLLRDSDDKGFTCQQCGLVRSKEEI 300
           F C C +C       D +ES  +    C +  C   +  D  +    C +C    S    
Sbjct: 275 FLCVCSKCT------DAKESKEMLAALCPNRNCGVGISVDRTN----CPRCDAGISP--- 321

Query: 301 KKIASEVNILSKKTLALTSCGNHQEV--VSTYKMIEKLQKKLYHPFSVNLMQTREKLIKI 358
            K+ +  N      L L +  N ++V  +   K+    Q  ++HP +V  ++T +   + 
Sbjct: 322 -KLRNAFN--DAMALTLHNLENMKDVAYLDVCKVCLDKQTGVFHPLNVWYVKTLDAAFEA 378

Query: 359 LMELEDWKEALAYCQLTIPVYQRVYPQFHPLLGLQYYTCGKLEWFLGDTENAIKSMTEAV 418
            +E+  W +AL Y Q  +P +++ +  ++PLLGL +   GK++ + G  + A+  + EA 
Sbjct: 379 AIEVGKWSDALDYGQRLLPGFRKYHGPWNPLLGLVHMKLGKIQLYTGQCKEALHHLEEAQ 438

Query: 419 EILRITHGTNSPFMKE-LILKLEEAQAEA 446
            IL +THG +   + E L L + +A+ +A
Sbjct: 439 RILTVTHGRDHRLLTEQLYLLILQARQQA 467


>gi|355720907|gb|AES07089.1| SET and MYND domain containing 2 [Mustela putorius furo]
          Length = 432

 Score =  167 bits (422), Expect = 1e-38,   Method: Compositional matrix adjust.
 Identities = 109/442 (24%), Positives = 208/442 (47%), Gaps = 42/442 (9%)

Query: 4   FHVSGEVIISQEPYVCVPNNSSSISRCDGCFA-SSNLKKCSACQVVWYCGSNCQKLDWKL 62
           FHV G+++ S   Y  V   +   + C+ CFA    L KC  C+  +YC   CQK DW +
Sbjct: 27  FHV-GDLLFSCPAYAYVLTVNERGNHCEYCFARKEGLSKCGRCKQAFYCNVECQKEDWPM 85

Query: 63  HRLECQVLSRLDKEKRKSVTPSIRLMLKLYLRRKLQNDNVIPSTTTDNYSLVEALVAHMS 122
           H+LEC  +    +    S T  +RL  ++  ++K+  +     T ++    V+   +H+ 
Sbjct: 86  HKLECSPMVVFGENWNPSET--VRLTARILAKQKIHPER----TPSEKLLAVKEFESHLE 139

Query: 123 DID-EKQLLLYAQIANLVNLI---LQWPEISINEIAENFSKLACNAHTICNSELRPLGTG 178
            +D EK+ L+ + +A L +     L++P+   + +   F+++ CN  TI + EL  LG+ 
Sbjct: 140 KLDNEKRDLIQSDVAALHHFYSKHLEFPDT--DSLVVLFAQVNCNGFTIEDEELSHLGSA 197

Query: 179 LYPVISIINHSCLPNAVLVFEGRLAVVRAVQHVPKGAEVLISYIETAGSTMTRQKALKEQ 238
           ++P ++++NHSC PN ++ ++G LA VRAVQ +  G EV  SYI+    T  R   L++ 
Sbjct: 198 IFPDVALMNHSCCPNVIVTYKGTLAEVRAVQEISPGEEVFTSYIDLLYPTEDRNDRLRDS 257

Query: 239 YLFTCTCPRCIKLGQFDDIQESAILEGYRCKDDGCSGFLLRDSDDKGFTCQQCGLVRSKE 298
           Y FTC C  C                              +D D      ++       E
Sbjct: 258 YFFTCQCQECT----------------------------TKDKDKAKVEIRKLSDPPKAE 289

Query: 299 EIKKIASEVNILSKKTLALTSCGNHQEVVSTYKMIEKLQKKLYHPFSVNLMQTREKLIKI 358
            I+++      + ++        +  E++   ++ ++    ++   +V ++    + + +
Sbjct: 290 TIREMVRYARNVIEEFRRAKHYKSPSELLEICELSQEKMGSVFEDSNVYMLHMMYQAMGV 349

Query: 359 LMELEDWKEALAYCQLTIPVYQRVYPQFHPLLGLQYYTCGKLEWFLGDTENAIKSMTEAV 418
            + ++DW+ AL Y Q  I  Y + YP +   +   +   G+L   L D     K++ +A+
Sbjct: 350 CLYMQDWEGALRYGQKIIKPYSKHYPLYSLNVASMWLKLGRLYMGLEDKAAGEKALKKAI 409

Query: 419 EILRITHGTNSPFMKELILKLE 440
            I+ + HG + P++ E+  ++E
Sbjct: 410 AIMEVAHGKDHPYISEIKQEIE 431


>gi|395859538|ref|XP_003802095.1| PREDICTED: N-lysine methyltransferase SMYD2 [Otolemur garnettii]
          Length = 433

 Score =  166 bits (420), Expect = 2e-38,   Method: Compositional matrix adjust.
 Identities = 113/436 (25%), Positives = 201/436 (46%), Gaps = 37/436 (8%)

Query: 8   GEVIISQEPYVCVPNNSSSISRCDGCFA-SSNLKKCSACQVVWYCGSNCQKLDWKLHRLE 66
           G+++ S   Y  V   +   + CD CFA    L KC  C+  +YC   CQK DW +H+LE
Sbjct: 30  GDLLFSCPAYAYVLTVNERGNHCDYCFARKEGLSKCGRCKQAFYCDVECQKEDWPMHKLE 89

Query: 67  CQVLSRLDKEKRKSVTPSIRLMLKLYLRRKLQNDNVIPSTTTDNYSLVEALVAHMSDID- 125
           C  +    +    S T  +RL  ++  ++K+  +     T ++    V+   +H+  +D 
Sbjct: 90  CSPMVVFGENWNPSET--VRLTARILAKQKIHPER----TPSEKLLAVKEFESHLDKLDN 143

Query: 126 EKQLLLYAQIANLVNLILQWPEISINE-IAENFSKLACNAHTICNSELRPLGTGLYPVIS 184
           EK+ L+ + IA L +   +  E   N+ +   F+++ CN  TI + EL  LG+ ++P ++
Sbjct: 144 EKKELIQSDIAALHHFYSKHLEFPDNDSLVVLFAQVNCNGFTIEDEELSHLGSAIFPDVA 203

Query: 185 IINHSCLPNAVLVFEGRLAVVRAVQHVPKGAEVLISYIETAGSTMTRQKALKEQYLFTCT 244
           ++NHSC PN ++ ++G LA VRAVQ +  G EV  SYI+    T  R   L++ Y FTC 
Sbjct: 204 LMNHSCCPNVIVTYKGTLAEVRAVQEINPGEEVFTSYIDLLYPTEDRNDRLRDSYFFTCE 263

Query: 245 CPRCIKLGQFDDIQESAILEGYRCKDDGCSGFLLRDSDDKGFTCQQCGLVRSKEEIKKIA 304
           C  C                    KD   +   +R  +D         +VR    + +  
Sbjct: 264 CQECT------------------TKDKDKAKVEIRKLNDPPKAETVRDMVRYARNVIEEF 305

Query: 305 SEVNILSKKTLALTSCGNHQEVVSTYKMIEKLQKKLYHPFSVNLMQTREKLIKILMELED 364
                    +  L  C   QE +S+          ++   +V ++    + + + + ++D
Sbjct: 306 RRAKHYKSPSELLEICELSQEKMSS----------VFEDSNVYMLHMMYQAMGVCLYMQD 355

Query: 365 WKEALAYCQLTIPVYQRVYPQFHPLLGLQYYTCGKLEWFLGDTENAIKSMTEAVEILRIT 424
           W+ AL Y Q  I  Y + YP +   +   +   G+L   L       K++ +A+ I+ + 
Sbjct: 356 WEGALRYGQKIIKPYSKHYPLYSLNVASMWLKLGRLYMGLEHKAAGEKALKKAIAIMEVA 415

Query: 425 HGTNSPFMKELILKLE 440
           HG + P++ E+  ++E
Sbjct: 416 HGKDHPYISEIKQEIE 431


>gi|148681077|gb|EDL13024.1| SET and MYND domain containing 2 [Mus musculus]
          Length = 450

 Score =  166 bits (419), Expect = 3e-38,   Method: Compositional matrix adjust.
 Identities = 116/442 (26%), Positives = 209/442 (47%), Gaps = 25/442 (5%)

Query: 4   FHVSGEVIISQEPYVCVPNNSSSISRCDGCFA-SSNLKKCSACQVVWYCGSNCQKLDWKL 62
           FHV G+++ S   Y CV         C+ CFA    L KC  C+  +YC   CQK DW L
Sbjct: 27  FHV-GDLLFSCPAYACVLTVGERGHHCECCFARKEGLSKCGRCKQAFYCDVECQKEDWPL 85

Query: 63  HRLECQVLSRLDKEKRKSVTPSIRLMLKLYLRRKLQNDNVIPSTTTDNYSLVEALVAHMS 122
           H+LEC  +  L +    S T  +RL  ++  ++K+  +     T ++    V    +H+ 
Sbjct: 86  HKLECSSMVVLGENWNPSET--VRLTARILAKQKIHPER----TPSEKLLAVREFESHLD 139

Query: 123 DID-EKQLLLYAQIANLVNLI---LQWPEISINEIAENFSKLACNAHTICNSELRPLGTG 178
            +D EK+ L+ + IA L       L++P+ S   +   F+++ CN  TI + EL  LG+ 
Sbjct: 140 KLDNEKKDLIQSDIAALHQFYSKYLEFPDHS--SLVVLFAQVNCNGFTIEDEELSHLGSA 197

Query: 179 LYPVISIINHSCLPNAVLVFEGRLAVVRAVQHVPKGAEVLISYIETAGSTMTRQKALKEQ 238
           ++P ++++NHSC PN ++ ++G LA VRAVQ +  G EV  SYI+    T  R   L++ 
Sbjct: 198 IFPDVALMNHSCCPNVIVTYKGTLAEVRAVQEIHPGDEVFTSYIDLLYPTEDRNDRLRDS 257

Query: 239 YLFTCTCPRCIKLGQFDDIQESAILEGYRCKDDGCSGFLLRDSDDKGFTCQQCGLVRSKE 298
           Y FTC C  C         ++ A +E  R          +RD            ++    
Sbjct: 258 YFFTCECRECTTKD-----KDKAKVE-VRKLSSPPQAEAIRD-----MVRYARNVIEEFR 306

Query: 299 EIKKIASEVNILSKKTLALTSCGNHQEVVSTYKMIEKLQKKLYHPFSVNLMQTREKLIKI 358
             K     +++   +  A        E++   ++ ++    ++   +V ++    + + +
Sbjct: 307 RAKHYKYNLSVWPARGTAPCLTDGVCELLEICELSQEKMSSVFEDSNVYMLHMMYQAMGV 366

Query: 359 LMELEDWKEALAYCQLTIPVYQRVYPQFHPLLGLQYYTCGKLEWFLGDTENAIKSMTEAV 418
            + ++DW+ AL Y Q  I  Y + YP +   +   +   G+L   L +     K++ +A+
Sbjct: 367 CLYMQDWEGALKYGQKIIKPYSKHYPVYSLNVASMWLKLGRLYMGLENKAAGEKALKKAI 426

Query: 419 EILRITHGTNSPFMKELILKLE 440
            I+ + HG + P++ E+  ++E
Sbjct: 427 AIMEVAHGKDHPYISEIKQEIE 448


>gi|327262509|ref|XP_003216066.1| PREDICTED: n-lysine methyltransferase SMYD2-like [Anolis
           carolinensis]
          Length = 437

 Score =  166 bits (419), Expect = 3e-38,   Method: Compositional matrix adjust.
 Identities = 115/437 (26%), Positives = 205/437 (46%), Gaps = 39/437 (8%)

Query: 8   GEVIISQEPYVCVPNNSSSISRCDGCFAS-SNLKKCSACQVVWYCGSNCQKLDWKLHRLE 66
           GE++ +   + CV   +   + C+GCFA    L KC  C+  +YC   CQK DW LH+LE
Sbjct: 34  GELLFACPAFTCVLTVNERGNHCEGCFARREGLFKCGKCKQAFYCNVECQKQDWPLHKLE 93

Query: 67  CQVLSRLDKEKRKSVTPSIRLMLKLYLRRKLQNDNVIPSTTTDNYSLVEALVAHMSDID- 125
           C  +    +    S T  +RL  ++  ++K Q +       ++    V+   +H+  +D 
Sbjct: 94  CSAMCVFGQNWNPSET--VRLTARILAKQKTQTERC----ESEKLLSVKEFESHLDKLDN 147

Query: 126 EKQLLLYAQIANLVNLILQWPEISINE-IAENFSKLACNAHTICNSELRPLGTGLYPVIS 184
           EK  L+ + I+ L +   +  E   N  +   F+++ CN  TI + EL  LG+ ++P ++
Sbjct: 148 EKMELIQSDISALHHFYSKHIEYPDNAALVVLFAQVNCNGFTIEDEELSHLGSAIFPDVA 207

Query: 185 IINHSCLPNAVLVFEGRLAVVRAVQHVPKGAEVLISYIETAGSTMTRQKALKEQYLFTCT 244
           ++NHSC PN ++ ++G +A VRAVQ +  G E+  SYI+    T  R   LK+ Y FTC 
Sbjct: 208 LMNHSCCPNVIVTYKGTVAEVRAVQEIEPGDEIFTSYIDLLYPTEDRNDRLKDSYFFTCD 267

Query: 245 CPRCI-KLGQFDDIQESAILEGYRCKDDGCSGFLLRDSDDKGFTCQQCGLVRSKEEIKKI 303
           C  CI K+   D ++        R  ++      +RD            ++      K  
Sbjct: 268 CRECITKVKDKDKLE-------IRKLNEPPPAEAVRDMIKYARN-----VIEEFRRAKHY 315

Query: 304 ASEVNILSKKTLALTSCGNHQEVVSTYKMIEKLQKKLYHPFSVNLMQTREKLIKILMELE 363
            S   +L    L+L   G+                 ++   +V ++    + + + + ++
Sbjct: 316 KSPSELLEICELSLEKMGS-----------------VFAESNVYMLHMMYQAMGVCLYMQ 358

Query: 364 DWKEALAYCQLTIPVYQRVYPQFHPLLGLQYYTCGKLEWFLGDTENAIKSMTEAVEILRI 423
           DW+ AL Y Q  I  Y + YP +   +   +   G+L   L +    +K++ +A+ I+ I
Sbjct: 359 DWEGALRYGQKIIKPYSKHYPLYSLNVASMWLKLGRLYMGLENRTAGVKALKKAIAIMEI 418

Query: 424 THGTNSPFMKELILKLE 440
            HG + P++ E+  +LE
Sbjct: 419 AHGKDHPYVIEIKKELE 435


>gi|157137747|ref|XP_001657162.1| hypothetical protein AaeL_AAEL003674 [Aedes aegypti]
 gi|108880819|gb|EAT45044.1| AAEL003674-PA [Aedes aegypti]
          Length = 446

 Score =  165 bits (418), Expect = 4e-38,   Method: Compositional matrix adjust.
 Identities = 128/460 (27%), Positives = 210/460 (45%), Gaps = 28/460 (6%)

Query: 4   FHVSGEVIISQEPYVCVPNNSSSISRCDGCFASSNLKKCSACQVVWYCGSNCQKLDWKLH 63
           FH  G +I+ ++P+ CV ++     RCD CF    + KCS C  V YC   CQK  W  H
Sbjct: 5   FHKKGSLILKEKPFACVLHSRYRAERCDKCFKPGKVLKCSNCLYVRYCNRFCQKEAWPDH 64

Query: 64  RLECQVLSRLDKEKRKSVTPSIRLMLKLYLRRKLQ--NDNVIPSTTTDNYSLVEALVAHM 121
           + EC  L  +         P   LM+   + RKLQ   D      T   Y     L+ H 
Sbjct: 65  QDECGKLKAIGTR----TVPDAALMISRII-RKLQKGGDYQKGYYTAKFYRRFHDLMTHE 119

Query: 122 SDI--DEKQLLLYAQ-IANLVNLILQWPEISINEIAENFSKLACNAHTICNSELRPLGTG 178
            DI  D  ++  +   +  L +L+ +    S  E+ + F K+  N+  I + E+  +GTG
Sbjct: 120 DDIKNDAPRMEHFQSLLVVLRSLVEEAAMPSKAELLQIFGKMCINSFNILDDEMNSIGTG 179

Query: 179 LYPVISIINHSCLPNAVLVFEGRLAVVRAVQHVPKGAEV-----LISYIETAGSTMTRQK 233
           +Y  ISII+HSC PNA+  F+G    +R ++    G+EV      ISYI+       R+ 
Sbjct: 180 MYLGISIIDHSCRPNALATFDGTTIHLRLLEDY-HGSEVDFSKIFISYIDLMNPAEERKA 238

Query: 234 ALKEQYLFTCTCPRCIKLGQFDDIQESAILEGYRCKDDGCSGFLLRDSDDKGFTCQQCGL 293
            L+ QY F C C RC       D QE  ++    C +  C    L   D K   C +C  
Sbjct: 239 RLRAQYYFECNCERC------RDEQEQQLMIAGACPNVDCDE-PLSMLDRKVEKCPRCDT 291

Query: 294 VRSKEEIKKIASEVNILSKKTLALTSCGNHQEVVSTYKMIEKLQKKLYHPFSVNLMQTRE 353
              K   +++  EV   +   LA      + ++    ++  K Q+ + H ++V  ++T +
Sbjct: 292 A-IKHSAREMFREVTDFTLAQLAKMKDITYMDIC---QLCLKKQENILHYYNVWYLKTLD 347

Query: 354 KLIKILMELEDWKEALAYCQLTIPVYQRVYPQFHPLLGLQYYTCGKLEWFLGDTENAIKS 413
              +  + ++ W++A+ Y       +++    FHPL GL     GK++ +      A+++
Sbjct: 348 LAFESAINMQKWEDAIGYGLRLKDGFKKYNGPFHPLYGLLLLKIGKIQLYTKHVVEALQN 407

Query: 414 MTEAVEILRITHGTNSPFMKELILKLEEAQAEASYKLSSK 453
           + ++ +ILR+THG      K  ++ L   QA   Y   +K
Sbjct: 408 INDSEKILRVTHGEEHDLYKSQLIPL-LCQAAGEYDALNK 446


>gi|195163295|ref|XP_002022487.1| GL12938 [Drosophila persimilis]
 gi|194104479|gb|EDW26522.1| GL12938 [Drosophila persimilis]
          Length = 420

 Score =  165 bits (417), Expect = 5e-38,   Method: Compositional matrix adjust.
 Identities = 120/453 (26%), Positives = 220/453 (48%), Gaps = 54/453 (11%)

Query: 8   GEVIISQEPYVCVPNNSSSISRCDGCFASSNLKKCSACQVVWYCGSNCQKLDWKLHRLEC 67
           G+ I++++P+  V  +   + RCD C  ++ ++KCS C+ V YC  +CQ   W  H+ EC
Sbjct: 9   GQRILTEKPFAFVLKSQYRLERCDNCLEATKVRKCSNCRYVSYCNRSCQTQAWAQHKHEC 68

Query: 68  QVLSRLDKEKRKSVTPSI-----RLMLKLYLRRKLQNDNVIPSTTTDNYSLVEALVAHMS 122
             L        KS+ P I     R++ +L LR +   D +    T         L++H +
Sbjct: 69  PFL--------KSIHPRIVPDAARMLCRLILRLQHGGDLIRGYYTEHGSRKFRDLMSHYA 120

Query: 123 DI--DEKQL-LLYAQIANLVNLILQWPEISIN--EIAENFSKLACNAHTICNSELRPLGT 177
           +I  D K+L  L +  A L +++   P    N  E+   + +L  N   + ++E+  + T
Sbjct: 121 EIKNDRKRLEHLDSLHAVLTDMMADSPSTVPNKSELMSIYGRLITNGFNVLDAEMNSIAT 180

Query: 178 GLYPVISIINHSCLPNAVLVFEGRLAVVRAVQHVP--KGAEVLISYIETAGSTMTRQKAL 235
            +Y  +SI +HSC PNAV  FEG    + A++ +P    +++ ISYI+   +   R++ L
Sbjct: 181 AIYLGVSITDHSCQPNAVATFEGNELHIHALEDMPCLDWSKIFISYIDLLNTPEQRRQDL 240

Query: 236 KEQYLFTCTCPRCIKLGQFDDIQESAILEGYRCKDDGCSGFLLRDSDDKGFTCQQCGLVR 295
           K+ Y F C C +C       D +E+  +    C +  CS  +  + ++    C++C +  
Sbjct: 241 KDHYYFLCVCSKC------RDPKEARQMTAAACPNRKCSASINIEWNN----CKRCSVGI 290

Query: 296 SKEEIKKIASEVNILSKKTLALTSCGNHQEVVSTYKMIEKLQKKLYHPFSVNLMQTREKL 355
           S  +++   +E+  L+K  L                  E ++   Y    +++ +T +  
Sbjct: 291 SP-KLRNAYNEIMALTKHNL------------------EAMKDVAY----LDVCKTLDAA 327

Query: 356 IKILMELEDWKEALAYCQLTIPVYQRVYPQFHPLLGLQYYTCGKLEWFLGDTENAIKSMT 415
            +  +E+  W+EAL Y Q  +P +Q  + Q++PL+GL +   GK++ F    + A   + 
Sbjct: 328 CEAAIEVGKWEEALNYGQQLLPGFQMYHGQWNPLVGLLHMKLGKIQLFERHHKEASHHLE 387

Query: 416 EAVEILRITHGTNSPFMKELILKL-EEAQAEAS 447
           EA  IL +THG +   + E +  L  +A+ EA+
Sbjct: 388 EAQRILSVTHGRDHRLLVEQLYPLIFQARQEAN 420


>gi|380814842|gb|AFE79295.1| N-lysine methyltransferase SMYD2 [Macaca mulatta]
          Length = 433

 Score =  164 bits (415), Expect = 8e-38,   Method: Compositional matrix adjust.
 Identities = 112/436 (25%), Positives = 200/436 (45%), Gaps = 37/436 (8%)

Query: 8   GEVIISQEPYVCVPNNSSSISRCDGCFA-SSNLKKCSACQVVWYCGSNCQKLDWKLHRLE 66
           G+++ S   Y  V   +   + C+ CFA    L KC  C+  +YC   CQK DW +H+LE
Sbjct: 30  GDLLFSCRAYAYVLTVNERGNHCEYCFARKEGLSKCGRCKQAFYCNVECQKEDWPMHKLE 89

Query: 67  CQVLSRLDKEKRKSVTPSIRLMLKLYLRRKLQNDNVIPSTTTDNYSLVEALVAHMSDID- 125
           C  +    +    S T  +RL  ++  ++K+  +     T ++    V+   +H+  +D 
Sbjct: 90  CSPMVVFGENWNPSET--VRLTARILAKQKIHPER----TPSEKLLAVKEFESHLDKLDN 143

Query: 126 EKQLLLYAQIANLVNLILQWPEISINE-IAENFSKLACNAHTICNSELRPLGTGLYPVIS 184
           EK+ L+ + IA L +   +  E   N+ +   F+++ CN  TI + EL  LG+ ++P ++
Sbjct: 144 EKKDLIQSDIAALHHFYSKHLEFPDNDSLVVLFAQVNCNGFTIEDEELSHLGSAIFPDVA 203

Query: 185 IINHSCLPNAVLVFEGRLAVVRAVQHVPKGAEVLISYIETAGSTMTRQKALKEQYLFTCT 244
           ++NHSC PN ++ ++G LA VRAVQ +  G EV  SYI+    T  R   L++ Y FTC 
Sbjct: 204 LMNHSCCPNVIVTYKGTLAEVRAVQEIKPGEEVFTSYIDLLYPTEDRNDRLRDSYFFTCE 263

Query: 245 CPRCIKLGQFDDIQESAILEGYRCKDDGCSGFLLRDSDDKGFTCQQCGLVRSKEEIKKIA 304
           C  C                    KD   +   +R   D         +VR    + +  
Sbjct: 264 CQECT------------------TKDKDKAKVEIRKLSDPPKAEAIRDMVRYARNVIEEF 305

Query: 305 SEVNILSKKTLALTSCGNHQEVVSTYKMIEKLQKKLYHPFSVNLMQTREKLIKILMELED 364
                    +  L  C   QE +S+          ++   +V ++    + + + + ++D
Sbjct: 306 RRAKHYKSPSELLEICELSQEKMSS----------VFEDSNVYMLHMMYQAMGVCLYMQD 355

Query: 365 WKEALAYCQLTIPVYQRVYPQFHPLLGLQYYTCGKLEWFLGDTENAIKSMTEAVEILRIT 424
           W+ AL Y Q  I  Y + YP +   +   +   G+L   L       K++ +A+ I+ + 
Sbjct: 356 WEGALQYGQKIIKPYSKHYPLYSLNVASMWLKLGRLYMGLEHKAAGEKALKKAIAIMEVA 415

Query: 425 HGTNSPFMKELILKLE 440
           HG + P++ E+  ++E
Sbjct: 416 HGKDHPYISEIKQEIE 431


>gi|297661992|ref|XP_002809506.1| PREDICTED: N-lysine methyltransferase SMYD2 [Pongo abelii]
          Length = 433

 Score =  164 bits (415), Expect = 1e-37,   Method: Compositional matrix adjust.
 Identities = 114/436 (26%), Positives = 205/436 (47%), Gaps = 37/436 (8%)

Query: 8   GEVIISQEPYVCVPNNSSSISRCDGCFA-SSNLKKCSACQVVWYCGSNCQKLDWKLHRLE 66
           G+++ S   Y  V   +   + C+ CFA    L KC  C+  +YC   CQK DW +H+LE
Sbjct: 30  GDLLFSCPAYAYVLTVNERGNHCEYCFARKEGLSKCGRCKQAFYCNVECQKEDWPMHKLE 89

Query: 67  CQVLSRLDKEKRKSVTPSIRLMLKLYLRRKLQNDNVIPSTTTDNYSLVEALVAHMSDID- 125
           C  +    +    S T  +RL  ++  ++K+  +     T ++    V+   +H+  +D 
Sbjct: 90  CSPMVVFGENWNPSET--VRLTARILAKQKIHPER----TPSEKLLAVKEFESHLDKLDN 143

Query: 126 EKQLLLYAQIANLVNLILQWPEISINE-IAENFSKLACNAHTICNSELRPLGTGLYPVIS 184
           EK+ L+ + IA L +   +  E   N+ +   F+++ CN  TI + EL  LG+ ++P ++
Sbjct: 144 EKKDLIQSDIAALHHFYSKHLEFPDNDSLIVLFAQVNCNGFTIEDEELSHLGSAIFPDVA 203

Query: 185 IINHSCLPNAVLVFEGRLAVVRAVQHVPKGAEVLISYIETAGSTMTRQKALKEQYLFTCT 244
           ++NHSC PN ++ ++G LA VRAVQ +  G EV  SYI+    T  R   L++ Y FTC 
Sbjct: 204 LMNHSCCPNVIVTYKGTLAEVRAVQEIKPGEEVFTSYIDLLYPTEDRNDRLRDSYFFTCE 263

Query: 245 CPRCIKLGQFDDIQESAILEGYRCKDDGCSGFLLRDSDDKGFTCQQCGLVRSKEEIKKIA 304
           C  C         ++ A +E  R   D     ++RD            +VR    + +  
Sbjct: 264 CQECTTKD-----KDKAKVE-IRKLSDPPKAEVIRD------------MVRYARNVIEEF 305

Query: 305 SEVNILSKKTLALTSCGNHQEVVSTYKMIEKLQKKLYHPFSVNLMQTREKLIKILMELED 364
                    +  L  C   QE +S+          ++   +V ++    + + + + ++D
Sbjct: 306 RRAKHYKSPSELLEICELSQEKMSS----------VFEDSNVYMLHMMYQAMGVCLYMQD 355

Query: 365 WKEALAYCQLTIPVYQRVYPQFHPLLGLQYYTCGKLEWFLGDTENAIKSMTEAVEILRIT 424
           W+ AL Y Q  I  Y + YP +   +   +   G+L   L       K++ +A+ I+ + 
Sbjct: 356 WEGALQYGQKIIKPYSKHYPLYSLNVASMWLKLGRLYMGLEHKAAGEKALKKAIAIMEVA 415

Query: 425 HGTNSPFMKELILKLE 440
           HG + P++ E+  ++E
Sbjct: 416 HGKDHPYISEIKQEIE 431


>gi|297280801|ref|XP_001106482.2| PREDICTED: SET and MYND domain-containing protein 2 [Macaca
           mulatta]
          Length = 433

 Score =  164 bits (415), Expect = 1e-37,   Method: Compositional matrix adjust.
 Identities = 112/436 (25%), Positives = 200/436 (45%), Gaps = 37/436 (8%)

Query: 8   GEVIISQEPYVCVPNNSSSISRCDGCFA-SSNLKKCSACQVVWYCGSNCQKLDWKLHRLE 66
           G+++ S   Y  V   +   + C+ CFA    L KC  C+  +YC   CQK DW +H+LE
Sbjct: 30  GDLLFSCPAYAYVLTVNERGNHCEYCFARKEGLSKCGRCKQAFYCNVECQKEDWPMHKLE 89

Query: 67  CQVLSRLDKEKRKSVTPSIRLMLKLYLRRKLQNDNVIPSTTTDNYSLVEALVAHMSDID- 125
           C  +    +    S T  +RL  ++  ++K+  +     T ++    V+   +H+  +D 
Sbjct: 90  CSPMVVFGENWNPSET--VRLTARILAKQKIHPER----TPSEKLLAVKEFESHLDKLDN 143

Query: 126 EKQLLLYAQIANLVNLILQWPEISINE-IAENFSKLACNAHTICNSELRPLGTGLYPVIS 184
           EK+ L+ + IA L +   +  E   N+ +   F+++ CN  TI + EL  LG+ ++P ++
Sbjct: 144 EKKDLIQSDIAALHHFYSKHLEFPDNDSLVVLFAQVNCNGFTIEDEELSHLGSAIFPDVA 203

Query: 185 IINHSCLPNAVLVFEGRLAVVRAVQHVPKGAEVLISYIETAGSTMTRQKALKEQYLFTCT 244
           ++NHSC PN ++ ++G LA VRAVQ +  G EV  SYI+    T  R   L++ Y FTC 
Sbjct: 204 LMNHSCCPNVIVTYKGTLAEVRAVQEIKPGEEVFTSYIDLLYPTEDRNDRLRDSYFFTCE 263

Query: 245 CPRCIKLGQFDDIQESAILEGYRCKDDGCSGFLLRDSDDKGFTCQQCGLVRSKEEIKKIA 304
           C  C                    KD   +   +R   D         +VR    + +  
Sbjct: 264 CQECT------------------TKDKDKAKVEIRKLSDPPKAEAIRDMVRYARNVIEEF 305

Query: 305 SEVNILSKKTLALTSCGNHQEVVSTYKMIEKLQKKLYHPFSVNLMQTREKLIKILMELED 364
                    +  L  C   QE +S+          ++   +V ++    + + + + ++D
Sbjct: 306 RRAKHYKSPSELLEICELSQEKMSS----------VFEDSNVYMLHMMYQAMGVCLYMQD 355

Query: 365 WKEALAYCQLTIPVYQRVYPQFHPLLGLQYYTCGKLEWFLGDTENAIKSMTEAVEILRIT 424
           W+ AL Y Q  I  Y + YP +   +   +   G+L   L       K++ +A+ I+ + 
Sbjct: 356 WEGALQYGQKIIKPYSKHYPLYSLNVASMWLKLGRLYMGLEHKAAGEKALKKAIAIMEVA 415

Query: 425 HGTNSPFMKELILKLE 440
           HG + P++ E+  ++E
Sbjct: 416 HGKDHPYISEIKQEIE 431


>gi|402857226|ref|XP_003893168.1| PREDICTED: N-lysine methyltransferase SMYD2 [Papio anubis]
          Length = 433

 Score =  164 bits (414), Expect = 1e-37,   Method: Compositional matrix adjust.
 Identities = 112/436 (25%), Positives = 200/436 (45%), Gaps = 37/436 (8%)

Query: 8   GEVIISQEPYVCVPNNSSSISRCDGCFA-SSNLKKCSACQVVWYCGSNCQKLDWKLHRLE 66
           G+++ S   Y  V   +   + C+ CFA    L KC  C+  +YC   CQK DW +H+LE
Sbjct: 30  GDLLFSCPAYAYVLTVNERGNHCEYCFARKEGLSKCGRCKQAFYCNVECQKEDWPMHKLE 89

Query: 67  CQVLSRLDKEKRKSVTPSIRLMLKLYLRRKLQNDNVIPSTTTDNYSLVEALVAHMSDID- 125
           C  +    +    S T  +RL  ++  ++K+  +     T ++    V+   +H+  +D 
Sbjct: 90  CSPMVVFGENWNPSET--VRLTARILAKQKIHPER----TPSEKLLAVKEFESHLDKLDN 143

Query: 126 EKQLLLYAQIANLVNLILQWPEISINE-IAENFSKLACNAHTICNSELRPLGTGLYPVIS 184
           EK+ L+ + IA L +   +  E   N+ +   F+++ CN  TI + EL  LG+ ++P ++
Sbjct: 144 EKKDLIQSDIAALHHFYSRHLEFPDNDSLVVLFAQVNCNGFTIEDEELSHLGSAIFPDVA 203

Query: 185 IINHSCLPNAVLVFEGRLAVVRAVQHVPKGAEVLISYIETAGSTMTRQKALKEQYLFTCT 244
           ++NHSC PN ++ ++G LA VRAVQ +  G EV  SYI+    T  R   L++ Y FTC 
Sbjct: 204 LMNHSCCPNVIVTYKGTLAEVRAVQEIKPGEEVFTSYIDLLYPTEDRNDRLRDSYFFTCE 263

Query: 245 CPRCIKLGQFDDIQESAILEGYRCKDDGCSGFLLRDSDDKGFTCQQCGLVRSKEEIKKIA 304
           C  C                    KD   +   +R   D         +VR    + +  
Sbjct: 264 CQECT------------------TKDKDKAKVEIRKLSDPPKAEAIRDMVRYARNVIEEF 305

Query: 305 SEVNILSKKTLALTSCGNHQEVVSTYKMIEKLQKKLYHPFSVNLMQTREKLIKILMELED 364
                    +  L  C   QE +S+          ++   +V ++    + + + + ++D
Sbjct: 306 RRAKHYKSPSELLEICELSQEKMSS----------VFEDSNVYMLHMMYQAMGVCLYMQD 355

Query: 365 WKEALAYCQLTIPVYQRVYPQFHPLLGLQYYTCGKLEWFLGDTENAIKSMTEAVEILRIT 424
           W+ AL Y Q  I  Y + YP +   +   +   G+L   L       K++ +A+ I+ + 
Sbjct: 356 WEGALQYGQKIIKPYSKHYPLYSLNVASMWLKLGRLYMGLEHKAAGEKALKKAIAIMEVA 415

Query: 425 HGTNSPFMKELILKLE 440
           HG + P++ E+  ++E
Sbjct: 416 HGKDHPYISEIKQEIE 431


>gi|296230124|ref|XP_002760570.1| PREDICTED: N-lysine methyltransferase SMYD2 [Callithrix jacchus]
          Length = 433

 Score =  164 bits (414), Expect = 1e-37,   Method: Compositional matrix adjust.
 Identities = 112/436 (25%), Positives = 200/436 (45%), Gaps = 37/436 (8%)

Query: 8   GEVIISQEPYVCVPNNSSSISRCDGCFA-SSNLKKCSACQVVWYCGSNCQKLDWKLHRLE 66
           G+++ S   Y  V   +   + C+ CFA    L KC  C+  +YC   CQK DW +H+LE
Sbjct: 30  GDLLFSCPAYAYVLTVNERGNHCEYCFARKEGLSKCGRCKQAFYCNVECQKEDWPMHKLE 89

Query: 67  CQVLSRLDKEKRKSVTPSIRLMLKLYLRRKLQNDNVIPSTTTDNYSLVEALVAHMSDID- 125
           C  +    +    S T  +RL  ++  ++K+  +     T ++    V+   +H+  +D 
Sbjct: 90  CSPMVVFGENWNPSET--VRLTARILAKQKIHPER----TPSEKLLAVKEFESHLDKLDN 143

Query: 126 EKQLLLYAQIANLVNLILQWPEISINE-IAENFSKLACNAHTICNSELRPLGTGLYPVIS 184
           EK+ L+ + IA L +   +  E   N+ +   F+++ CN  TI + EL  LG+ ++P ++
Sbjct: 144 EKKDLIQSDIAALHHFYSKHLEFPDNDSLVVLFAQVNCNGFTIEDEELSHLGSAIFPDVA 203

Query: 185 IINHSCLPNAVLVFEGRLAVVRAVQHVPKGAEVLISYIETAGSTMTRQKALKEQYLFTCT 244
           ++NHSC PN ++ ++G LA VRAVQ +  G EV  SYI+    T  R   L++ Y FTC 
Sbjct: 204 LMNHSCCPNVIVTYKGTLAEVRAVQEISPGEEVFTSYIDLLYPTEDRNDRLRDSYFFTCE 263

Query: 245 CPRCIKLGQFDDIQESAILEGYRCKDDGCSGFLLRDSDDKGFTCQQCGLVRSKEEIKKIA 304
           C  C                    KD   +   +R   D         +VR    + +  
Sbjct: 264 CQECT------------------TKDKDKAKVEIRKLSDPPKAEAIRDMVRYARNVIEEF 305

Query: 305 SEVNILSKKTLALTSCGNHQEVVSTYKMIEKLQKKLYHPFSVNLMQTREKLIKILMELED 364
                    +  L  C   QE +S+          ++   +V ++    + + + + ++D
Sbjct: 306 RRAKHYKSPSELLEICELSQEKMSS----------VFEDSNVYMLHMMYQAMGVCLYMQD 355

Query: 365 WKEALAYCQLTIPVYQRVYPQFHPLLGLQYYTCGKLEWFLGDTENAIKSMTEAVEILRIT 424
           W+ AL Y Q  I  Y + YP +   +   +   G+L   L       K++ +A+ I+ + 
Sbjct: 356 WEGALQYGQKIIKPYSKHYPLYSLNVASMWLKLGRLYMGLEHKAAGEKALKKAIAIMEVA 415

Query: 425 HGTNSPFMKELILKLE 440
           HG + P++ E+  ++E
Sbjct: 416 HGKDHPYISEIKQEIE 431


>gi|403277520|ref|XP_003930406.1| PREDICTED: N-lysine methyltransferase SMYD2 [Saimiri boliviensis
           boliviensis]
          Length = 433

 Score =  164 bits (414), Expect = 1e-37,   Method: Compositional matrix adjust.
 Identities = 112/436 (25%), Positives = 200/436 (45%), Gaps = 37/436 (8%)

Query: 8   GEVIISQEPYVCVPNNSSSISRCDGCFA-SSNLKKCSACQVVWYCGSNCQKLDWKLHRLE 66
           G+++ S   Y  V   +   + C+ CFA    L KC  C+  +YC   CQK DW +H+LE
Sbjct: 30  GDLLFSCPAYAYVLTVNERGNHCEYCFARKEGLSKCGRCKQAFYCNVECQKEDWPMHKLE 89

Query: 67  CQVLSRLDKEKRKSVTPSIRLMLKLYLRRKLQNDNVIPSTTTDNYSLVEALVAHMSDID- 125
           C  +    +    S T  +RL  ++  ++K+  +     T ++    V+   +H+  +D 
Sbjct: 90  CSPMVVFGENWNPSET--VRLTARILAKQKIHPER----TPSERLLAVKEFESHLDKLDN 143

Query: 126 EKQLLLYAQIANLVNLILQWPEISINE-IAENFSKLACNAHTICNSELRPLGTGLYPVIS 184
           EK+ L+ + IA L +   +  E   N+ +   F+++ CN  TI + EL  LG+ ++P ++
Sbjct: 144 EKKDLIQSDIAALHHFYSKHLEFPDNDSLVVLFAQVNCNGFTIEDEELSHLGSAIFPDVA 203

Query: 185 IINHSCLPNAVLVFEGRLAVVRAVQHVPKGAEVLISYIETAGSTMTRQKALKEQYLFTCT 244
           ++NHSC PN ++ ++G LA VRAVQ +  G EV  SYI+    T  R   L++ Y FTC 
Sbjct: 204 LMNHSCCPNVIVTYKGTLAEVRAVQEISPGEEVFTSYIDLLYPTEDRNDRLRDSYFFTCE 263

Query: 245 CPRCIKLGQFDDIQESAILEGYRCKDDGCSGFLLRDSDDKGFTCQQCGLVRSKEEIKKIA 304
           C  C                    KD   +   +R   D         +VR    + +  
Sbjct: 264 CQECTT------------------KDKDKAKVEIRKLSDPPKAEAIRDMVRYARNVIEEF 305

Query: 305 SEVNILSKKTLALTSCGNHQEVVSTYKMIEKLQKKLYHPFSVNLMQTREKLIKILMELED 364
                    +  L  C   QE +S+          ++   +V ++    + + + + ++D
Sbjct: 306 RRAKHYKSPSELLEICELSQEKMSS----------VFEDSNVYMLHMMYQAMGVCLYMQD 355

Query: 365 WKEALAYCQLTIPVYQRVYPQFHPLLGLQYYTCGKLEWFLGDTENAIKSMTEAVEILRIT 424
           W+ AL Y Q  I  Y + YP +   +   +   G+L   L       K++ +A+ I+ + 
Sbjct: 356 WEGALQYGQKIIKPYSKHYPLYSLNVASMWLKLGRLYMGLEHKAAGEKALKKAIAIMEVA 415

Query: 425 HGTNSPFMKELILKLE 440
           HG + P++ E+  ++E
Sbjct: 416 HGKDHPYISEIKQEIE 431


>gi|115496736|ref|NP_001069832.1| N-lysine methyltransferase SMYD2 [Bos taurus]
 gi|122144240|sp|Q0P585.1|SMYD2_BOVIN RecName: Full=N-lysine methyltransferase SMYD2; AltName:
           Full=Histone methyltransferase SMYD2; AltName: Full=SET
           and MYND domain-containing protein 2
 gi|112362245|gb|AAI20365.1| SET and MYND domain containing 2 [Bos taurus]
 gi|296478866|tpg|DAA20981.1| TPA: SET and MYND domain containing 2 [Bos taurus]
          Length = 433

 Score =  163 bits (412), Expect = 2e-37,   Method: Compositional matrix adjust.
 Identities = 111/436 (25%), Positives = 201/436 (46%), Gaps = 37/436 (8%)

Query: 8   GEVIISQEPYVCVPNNSSSISRCDGCFA-SSNLKKCSACQVVWYCGSNCQKLDWKLHRLE 66
           G+++ S   Y  V   S   + C+ CFA    L KC  C+  +YC   CQ+ DW +H+LE
Sbjct: 30  GDLLFSCPAYAYVLTVSERGNHCEFCFARKEGLSKCGRCKQAFYCNVECQREDWPMHKLE 89

Query: 67  CQVLSRLDKEKRKSVTPSIRLMLKLYLRRKLQNDNVIPSTTTDNYSLVEALVAHMSDID- 125
           C  +    +    S T  +RL  ++  ++K+  +     T ++    V+   +H+  +D 
Sbjct: 90  CSPMVVFGENWNPSET--VRLTARILAKQKIHPER----TPSEKLLAVKEFESHLDKLDN 143

Query: 126 EKQLLLYAQIANLVNLILQWPEISINE-IAENFSKLACNAHTICNSELRPLGTGLYPVIS 184
           EK+ L+ + IA L +   +  E   N+ +   F+++ CN  TI + EL  LG+ ++P ++
Sbjct: 144 EKRDLIQSDIAALHHFYSKHLEFPDNDSLVVLFAQVNCNGFTIEDEELSHLGSAIFPDVA 203

Query: 185 IINHSCLPNAVLVFEGRLAVVRAVQHVPKGAEVLISYIETAGSTMTRQKALKEQYLFTCT 244
           ++NHSC PN ++ ++G LA VRAVQ +  G EV  SYI+    T  R   L++ Y FTC 
Sbjct: 204 LMNHSCCPNVIVTYKGTLAEVRAVQEIHPGEEVFTSYIDLLYPTEDRNDRLRDSYFFTCE 263

Query: 245 CPRCIKLGQFDDIQESAILEGYRCKDDGCSGFLLRDSDDKGFTCQQCGLVRSKEEIKKIA 304
           C  C                    KD   +   +R  +D         +VR    + +  
Sbjct: 264 CQECTT------------------KDKDKAKVEIRKLNDPPKAETIRDMVRYARNVIEEF 305

Query: 305 SEVNILSKKTLALTSCGNHQEVVSTYKMIEKLQKKLYHPFSVNLMQTREKLIKILMELED 364
                    +  L  C   QE +S           ++   +V ++    + + + + ++D
Sbjct: 306 RRAKHYKSPSELLEICELSQEKMSC----------VFEDSNVYMLHMMYQAMGVCLYMQD 355

Query: 365 WKEALAYCQLTIPVYQRVYPQFHPLLGLQYYTCGKLEWFLGDTENAIKSMTEAVEILRIT 424
           W+ AL Y Q  I  Y + YP +   +   +   G+L   L +     +++ +A+ I+ + 
Sbjct: 356 WEGALRYGQKIIQPYSKHYPLYSLNVASMWLKLGRLYMGLENKAAGERALKKAIAIMEVA 415

Query: 425 HGTNSPFMKELILKLE 440
           HG + P++ E+  ++E
Sbjct: 416 HGKDHPYISEIKQEIE 431


>gi|195398853|ref|XP_002058035.1| GJ15711 [Drosophila virilis]
 gi|194150459|gb|EDW66143.1| GJ15711 [Drosophila virilis]
          Length = 448

 Score =  163 bits (412), Expect = 2e-37,   Method: Compositional matrix adjust.
 Identities = 117/442 (26%), Positives = 209/442 (47%), Gaps = 29/442 (6%)

Query: 8   GEVIISQEPYVCVPNNSSSISRCDGCFASSNLKKCSACQVVWYCGSNCQKLDWKLHRLEC 67
           G  I++++P+  V  +   + RCD C  +  + KCS C+ V YC   CQ+  W LH+ EC
Sbjct: 17  GTRILTEKPFAYVLKSKYRLERCDNCLEAVKVLKCSNCRYVSYCNRACQQQAWPLHKHEC 76

Query: 68  QVLSRLDKEKRKSVTPSIRLMLKLYLRRKLQNDNVIPSTTTDNYSLVEALVAHMSDID-- 125
             L R      + V  + R++ +L LR +   D      T         L++H ++I   
Sbjct: 77  PFLKR---ALPRIVPDAARMLCRLILRLEHGGDLERGYYTAHGSRKFRDLMSHYAEIKND 133

Query: 126 -EKQLLLYAQIANLVNLILQWPEISI---NEIAENFSKLACNAHTICNSELRPLGTGLYP 181
            +++  L +  A L  ++      ++    E+   + +L  N   I ++E+  + T +Y 
Sbjct: 134 PKRREHLESLHAVLTEMMTDSSGSTVPNPTELMSIYGRLITNGFNILDAEMNSIATAIYL 193

Query: 182 VISIINHSCLPNAVLVFEGRLAVVRAVQHVP--KGAEVLISYIETAGSTMTRQKALKEQY 239
            +SI +HSC PNAV  FEG    V A++ +P    +++ ISYI+   +   R+  LKE Y
Sbjct: 194 GVSITDHSCQPNAVATFEGNELHVHAIEDLPCLDWSKIYISYIDLLNTPEQRRADLKEHY 253

Query: 240 LFTCTCPRCIKLGQFDDIQESAILEGYRCKDDGCSGFLLRDSDDKGFTCQQCGLVRSKEE 299
            F C C +CI      D QE   +    C +  C   +  D       CQ C       E
Sbjct: 254 YFLCVCSKCI------DPQEMHEMTAAVCPNASCDASVNIDL----AKCQSCD-ASVTPE 302

Query: 300 IKKIASEVNILSKKTLALTSCGNHQEV--VSTYKMIEKLQKKLYHPFSVNLMQTREKLIK 357
           ++   +E+  L++ +L      + +EV  +   K+    Q+ + HP +V  ++T +   +
Sbjct: 303 LRSAYNEIMSLTQSSL-----DSMKEVAYLDVCKVCLAKQRGVLHPLNVWHVKTLDAAFE 357

Query: 358 ILMELEDWKEALAYCQLTIPVYQRVYPQFHPLLGLQYYTCGKLEWFLGDTENAIKSMTEA 417
             + +  W +AL Y Q  +P + + +  ++PLLGL +   GK++ +  + + A+  + EA
Sbjct: 358 AAINVGKWTDALEYGQQLLPGFAKYHGAWNPLLGLLHMKLGKIQLYERNYKQAVHHLQEA 417

Query: 418 VEILRITHGTNSPFMKELILKL 439
             IL +THG +   + E +  L
Sbjct: 418 QRILNVTHGRDHRLLLEQLRPL 439


>gi|410219524|gb|JAA06981.1| SET and MYND domain containing 2 [Pan troglodytes]
 gi|410248566|gb|JAA12250.1| SET and MYND domain containing 2 [Pan troglodytes]
 gi|410291460|gb|JAA24330.1| SET and MYND domain containing 2 [Pan troglodytes]
 gi|410336589|gb|JAA37241.1| SET and MYND domain containing 2 [Pan troglodytes]
          Length = 433

 Score =  162 bits (410), Expect = 3e-37,   Method: Compositional matrix adjust.
 Identities = 112/436 (25%), Positives = 199/436 (45%), Gaps = 37/436 (8%)

Query: 8   GEVIISQEPYVCVPNNSSSISRCDGCFA-SSNLKKCSACQVVWYCGSNCQKLDWKLHRLE 66
           G+++ S   Y  V   +   + C+ CF     L KC  C+  +YC   CQK DW +H+LE
Sbjct: 30  GDLLFSCPAYAYVLTVNERGNHCEYCFTRKEGLSKCGRCKQAFYCNVECQKEDWPMHKLE 89

Query: 67  CQVLSRLDKEKRKSVTPSIRLMLKLYLRRKLQNDNVIPSTTTDNYSLVEALVAHMSDID- 125
           C  +    +    S T  +RL  ++  ++K+  +     T ++    V+   +H+  +D 
Sbjct: 90  CSPMVVFGENWNPSET--VRLTARILAKQKIHPER----TPSEKLLAVKEFESHLDKLDN 143

Query: 126 EKQLLLYAQIANLVNLILQWPEISINE-IAENFSKLACNAHTICNSELRPLGTGLYPVIS 184
           EK+ L+ + IA L +   +  E   N+ +   F+++ CN  TI + EL  LG+ ++P ++
Sbjct: 144 EKKDLIQSDIAALHHFYSKHLEFPDNDSLIVVFAQVNCNGFTIEDEELSHLGSAIFPDVA 203

Query: 185 IINHSCLPNAVLVFEGRLAVVRAVQHVPKGAEVLISYIETAGSTMTRQKALKEQYLFTCT 244
           ++NHSC PN ++ ++G LA VRAVQ +  G EV  SYI+    T  R   L++ Y FTC 
Sbjct: 204 LMNHSCCPNVIVTYKGTLAEVRAVQEIKPGEEVFTSYIDLLYPTEDRNDRLRDSYFFTCE 263

Query: 245 CPRCIKLGQFDDIQESAILEGYRCKDDGCSGFLLRDSDDKGFTCQQCGLVRSKEEIKKIA 304
           C  C                    KD   +   +R   D         +VR    + +  
Sbjct: 264 CQECT------------------TKDKDKAKVEIRKLSDPPKAEAIRDMVRYARNVIEEF 305

Query: 305 SEVNILSKKTLALTSCGNHQEVVSTYKMIEKLQKKLYHPFSVNLMQTREKLIKILMELED 364
                    +  L  C   QE +S+          ++   +V ++    + + + + ++D
Sbjct: 306 RRAKHYKSPSELLEICELSQEKMSS----------VFEDSNVYMLHMMYQAMGVCLYMQD 355

Query: 365 WKEALAYCQLTIPVYQRVYPQFHPLLGLQYYTCGKLEWFLGDTENAIKSMTEAVEILRIT 424
           W+ AL Y Q  I  Y + YP +   +   +   G+L   L       K++ +A+ I+ I 
Sbjct: 356 WEGALQYGQKIIKPYSKHYPLYSLNVASMWLKLGRLYMGLEHKAAGEKALKKAIAIMEIA 415

Query: 425 HGTNSPFMKELILKLE 440
           HG + P++ E+  ++E
Sbjct: 416 HGKDHPYISEIKQEIE 431


>gi|348577033|ref|XP_003474289.1| PREDICTED: N-lysine methyltransferase SMYD2-like [Cavia porcellus]
          Length = 433

 Score =  162 bits (410), Expect = 3e-37,   Method: Compositional matrix adjust.
 Identities = 109/438 (24%), Positives = 202/438 (46%), Gaps = 41/438 (9%)

Query: 8   GEVIISQEPYVCVPNNSSSISRCDGCFA-SSNLKKCSACQVVWYCGSNCQKLDWKLHRLE 66
           G+++ +   Y  V       S C+ CFA    L KC  C+  +YC   CQK DW +H+LE
Sbjct: 30  GDLLFACPAYASVLTVGERGSHCEHCFARKEGLSKCGRCKQAFYCNVECQKEDWPMHKLE 89

Query: 67  CQVLSRLDKEKRKSVTPSIRLMLKLYLRRKLQNDNVIPSTTTDNYSLVEALVAHMSDID- 125
           C  +    +    S T  +RL  ++  ++K+  +     T ++    V+   +H+  +D 
Sbjct: 90  CSPMVVFGENWNPSET--VRLTARILAKQKIHPER----TPSEMLLAVKEFESHLDKLDN 143

Query: 126 EKQLLLYAQIANLVNLI---LQWPEISINEIAENFSKLACNAHTICNSELRPLGTGLYPV 182
           EK+ L+ + IA L       L++P+   + +   F+++ CN  TI + EL  LG+ ++P 
Sbjct: 144 EKKDLIQSDIAMLHQFYSKHLEFPDH--DSLVVLFAQVNCNGFTIEDEELSHLGSAIFPD 201

Query: 183 ISIINHSCLPNAVLVFEGRLAVVRAVQHVPKGAEVLISYIETAGSTMTRQKALKEQYLFT 242
           ++++NHSC PN ++ ++G LA VRAVQ V  G EV  SYI+    T  R   L++ Y FT
Sbjct: 202 VALMNHSCCPNVIVTYKGTLAEVRAVQEVLPGEEVFTSYIDLLYPTEDRNDRLRDSYFFT 261

Query: 243 CTCPRCIKLGQFDDIQESAILEGYRCKDDGCSGFLLRDSDDKGFTCQQCGLVRSKEEIKK 302
           C C  C                              RD D      ++       E I+ 
Sbjct: 262 CECQECTT----------------------------RDKDKAKVEIRKLSNPPKAEAIRD 293

Query: 303 IASEVNILSKKTLALTSCGNHQEVVSTYKMIEKLQKKLYHPFSVNLMQTREKLIKILMEL 362
           +      + ++        +  E++   ++ ++    ++   +V ++    + + + + L
Sbjct: 294 MVRYARNVIEEFRRAKHYKSPSELLEICELSQEKMSAVFEDSNVYMLHMMYQAMGVCLYL 353

Query: 363 EDWKEALAYCQLTIPVYQRVYPQFHPLLGLQYYTCGKLEWFLGDTENAIKSMTEAVEILR 422
           +DW+ AL Y Q  I  Y R YP +   +   +   G+L   L +     +++ +A+ I+ 
Sbjct: 354 QDWEGALRYGQKIIKPYSRHYPLYSLNVASMWLKLGRLYLGLENRVAGERALRKALAIME 413

Query: 423 ITHGTNSPFMKELILKLE 440
           + HG + P++ E+  ++E
Sbjct: 414 VAHGKDHPYISEIKREIE 431


>gi|343197592|pdb|3S7B|A Chain A, Structural Basis Of Substrate Methylation And Inhibition
           Of Smyd2
 gi|343197593|pdb|3S7D|A Chain A, Structural Basis Of Substrate Methylation And Inhibition
           Of Smyd2
 gi|343197595|pdb|3S7F|A Chain A, Structural Basis Of Substrate Methylation And Inhibition
           Of Smyd2
 gi|343197597|pdb|3S7J|A Chain A, Structural Basis Of Substrate Methylation And Inhibition
           Of Smyd2
 gi|9295345|gb|AAF86953.1|AF226053_1 HSKM-B [Homo sapiens]
 gi|66990077|gb|AAH98133.1| SET and MYND domain containing 2 [Homo sapiens]
 gi|68226703|gb|AAH98335.1| SET and MYND domain containing 2 [Homo sapiens]
 gi|189054089|dbj|BAG36596.1| unnamed protein product [Homo sapiens]
 gi|307686281|dbj|BAJ21071.1| SET and MYND domain containing 2 [synthetic construct]
          Length = 433

 Score =  162 bits (410), Expect = 4e-37,   Method: Compositional matrix adjust.
 Identities = 111/436 (25%), Positives = 199/436 (45%), Gaps = 37/436 (8%)

Query: 8   GEVIISQEPYVCVPNNSSSISRCDGCFA-SSNLKKCSACQVVWYCGSNCQKLDWKLHRLE 66
           G+++ S   Y  V   +   + C+ CF     L KC  C+  +YC   CQK DW +H+LE
Sbjct: 30  GDLLFSCPAYAYVLTVNERGNHCEYCFTRKEGLSKCGRCKQAFYCNVECQKEDWPMHKLE 89

Query: 67  CQVLSRLDKEKRKSVTPSIRLMLKLYLRRKLQNDNVIPSTTTDNYSLVEALVAHMSDID- 125
           C  +    +    S T  +RL  ++  ++K+  +     T ++    V+   +H+  +D 
Sbjct: 90  CSPMVVFGENWNPSET--VRLTARILAKQKIHPER----TPSEKLLAVKEFESHLDKLDN 143

Query: 126 EKQLLLYAQIANLVNLILQWPEISINE-IAENFSKLACNAHTICNSELRPLGTGLYPVIS 184
           EK+ L+ + IA L +   +  E   N+ +   F+++ CN  TI + EL  LG+ ++P ++
Sbjct: 144 EKKDLIQSDIAALHHFYSKHLEFPDNDSLVVLFAQVNCNGFTIEDEELSHLGSAIFPDVA 203

Query: 185 IINHSCLPNAVLVFEGRLAVVRAVQHVPKGAEVLISYIETAGSTMTRQKALKEQYLFTCT 244
           ++NHSC PN ++ ++G LA VRAVQ +  G EV  SYI+    T  R   L++ Y FTC 
Sbjct: 204 LMNHSCCPNVIVTYKGTLAEVRAVQEIKPGEEVFTSYIDLLYPTEDRNDRLRDSYFFTCE 263

Query: 245 CPRCIKLGQFDDIQESAILEGYRCKDDGCSGFLLRDSDDKGFTCQQCGLVRSKEEIKKIA 304
           C  C                    KD   +   +R   D         +VR    + +  
Sbjct: 264 CQECT------------------TKDKDKAKVEIRKLSDPPKAEAIRDMVRYARNVIEEF 305

Query: 305 SEVNILSKKTLALTSCGNHQEVVSTYKMIEKLQKKLYHPFSVNLMQTREKLIKILMELED 364
                    +  L  C   QE +S+          ++   +V ++    + + + + ++D
Sbjct: 306 RRAKHYKSPSELLEICELSQEKMSS----------VFEDSNVYMLHMMYQAMGVCLYMQD 355

Query: 365 WKEALAYCQLTIPVYQRVYPQFHPLLGLQYYTCGKLEWFLGDTENAIKSMTEAVEILRIT 424
           W+ AL Y Q  I  Y + YP +   +   +   G+L   L       K++ +A+ I+ + 
Sbjct: 356 WEGALQYGQKIIKPYSKHYPLYSLNVASMWLKLGRLYMGLEHKAAGEKALKKAIAIMEVA 415

Query: 425 HGTNSPFMKELILKLE 440
           HG + P++ E+  ++E
Sbjct: 416 HGKDHPYISEIKQEIE 431


>gi|414864678|tpg|DAA43235.1| TPA: hypothetical protein ZEAMMB73_073593 [Zea mays]
          Length = 342

 Score =  162 bits (409), Expect = 4e-37,   Method: Compositional matrix adjust.
 Identities = 85/157 (54%), Positives = 104/157 (66%), Gaps = 6/157 (3%)

Query: 189 SCLPNAVLVFEGRLAVVRAVQHVPKGAEVLISYIETAGSTMTRQKALKEQYLFTCTCPRC 248
           SC+PNAVL+F+GR   VRA+Q + K  EV ISYIETA  T  R   LK QY FT TCPRC
Sbjct: 182 SCVPNAVLIFDGRTTYVRALQPIDKDEEVSISYIETAAVTKKRNNYLK-QYFFTYTCPRC 240

Query: 249 IKLGQFDDIQESAILEGYRCKDDGCSGFLLRDSDDKGFTCQQCGLVRSKEEIKKIASEVN 308
           +K   FDD    A+LEG+RCK+  C GFLL +S  K +TCQ+CG  R  EEI K+ SE+ 
Sbjct: 241 VK--GFDD---DALLEGFRCKNQTCDGFLLPNSGKKAYTCQKCGASRDVEEINKMRSEIL 295

Query: 309 ILSKKTLALTSCGNHQEVVSTYKMIEKLQKKLYHPFS 345
            LS K  +  S GN  E  S YK+IE+L++ LYH FS
Sbjct: 296 QLSDKASSFLSSGNKAEAGSIYKIIEQLERNLYHAFS 332


>gi|441613050|ref|XP_003265162.2| PREDICTED: N-lysine methyltransferase SMYD2 [Nomascus leucogenys]
          Length = 536

 Score =  162 bits (409), Expect = 4e-37,   Method: Compositional matrix adjust.
 Identities = 116/441 (26%), Positives = 207/441 (46%), Gaps = 38/441 (8%)

Query: 3   SFHVSGEVIISQEPYVCVPNNSSSISRCDGCFA-SSNLKKCSACQVVWYCGSNCQKLDWK 61
           +F V G+++ S   Y  V   +   + C+ CFA    L KC  C+  +YC   CQK DW 
Sbjct: 129 AFQV-GDLLFSCPAYAYVLTVNERGNHCEYCFARKEGLSKCGRCKQAFYCNVECQKEDWP 187

Query: 62  LHRLECQVLSRLDKEKRKSVTPSIRLMLKLYLRRKLQNDNVIPSTTTDNYSLVEALVAHM 121
           +H+LEC  +    +    S T  +RL  ++  ++K+  +     T ++    V+   +H+
Sbjct: 188 MHKLECSPMVVFGENWNPSET--VRLTARILAKQKIYPER----TPSEKLLAVKEFESHL 241

Query: 122 SDID-EKQLLLYAQIANLVNLILQWPEISINE-IAENFSKLACNAHTICNSELRPLGTGL 179
             +D EK+ L+ + IA L +   +  E   N+ +   F+++ CN  TI + EL  LG+ +
Sbjct: 242 DKLDNEKKDLIQSDIAALHHFYSKHLEFPDNDSLVVLFAQVNCNGFTIEDEELSHLGSAI 301

Query: 180 YPVISIINHSCLPNAVLVFEGRLAVVRAVQHVPKGAEVLISYIETAGSTMTRQKALKEQY 239
           +P ++++NHSC PN ++ ++G LA VRAVQ +  G EV  SYI+    T  R   L++ Y
Sbjct: 302 FPDVALMNHSCCPNVIVTYKGTLAEVRAVQEIKTGEEVFTSYIDLLYPTEDRNDRLRDSY 361

Query: 240 LFTCTCPRCIKLGQFDDIQESAILEGYRCKDDGCSGFLLRDSDDKGFTCQQCGLVRSKEE 299
            FTC C  C         ++ A +E  R   D      +RD            +VR    
Sbjct: 362 FFTCECQECTTKD-----KDKAKVE-IRKLSDPPKAEAIRD------------MVRYARN 403

Query: 300 IKKIASEVNILSKKTLALTSCGNHQEVVSTYKMIEKLQKKLYHPFSVNLMQTREKLIKIL 359
           + +           +  L  C   QE +S+          ++   +V ++    + + + 
Sbjct: 404 VIEEFRRAKHYKSPSELLEICELSQEKMSS----------VFEDSNVYMLHMMYQAMGVC 453

Query: 360 MELEDWKEALAYCQLTIPVYQRVYPQFHPLLGLQYYTCGKLEWFLGDTENAIKSMTEAVE 419
           + ++DW+ AL Y Q  I  Y + YP +   +   +   G+L   L       K++ +A+ 
Sbjct: 454 LYMQDWEGALQYGQKIIKPYSKHYPLYSLNVASMWLKLGRLYMGLEYKAAGEKALKKAIA 513

Query: 420 ILRITHGTNSPFMKELILKLE 440
           I+ + HG + P++ E+  ++E
Sbjct: 514 IMEVAHGKDHPYISEIKQEIE 534


>gi|417400865|gb|JAA47349.1| Putative histone tail methylase [Desmodus rotundus]
          Length = 433

 Score =  161 bits (408), Expect = 5e-37,   Method: Compositional matrix adjust.
 Identities = 112/436 (25%), Positives = 198/436 (45%), Gaps = 37/436 (8%)

Query: 8   GEVIISQEPYVCVPNNSSSISRCDGCFA-SSNLKKCSACQVVWYCGSNCQKLDWKLHRLE 66
           G+++ S   Y  V   +   + C+ CFA    L KC  C+  +YC   CQK DW +H+LE
Sbjct: 30  GDLLFSCPAYAYVLTVNERGNHCEHCFARKEGLSKCGRCKQAFYCDVECQKEDWPMHKLE 89

Query: 67  CQVLSRLDKEKRKSVTPSIRLMLKLYLRRKLQNDNVIPSTTTDNYSLVEALVAHMSDID- 125
           C  +    +    S T  +RL  ++  ++    D     T ++    V+   +H+  +D 
Sbjct: 90  CSPMVVFGENWNPSET--VRLTARILAKQTSHPDR----TPSEKLLAVKEFESHLDKLDN 143

Query: 126 EKQLLLYAQIANLVNLILQWPEISINE-IAENFSKLACNAHTICNSELRPLGTGLYPVIS 184
           EK+ L+ + I+ L     +  E   NE +   F+++ CN  TI + EL  LG+ ++P ++
Sbjct: 144 EKKDLIQSDISALHRFYSKHLEFPDNESLVTLFAQVNCNGFTIEDEELSHLGSAIFPDVA 203

Query: 185 IINHSCLPNAVLVFEGRLAVVRAVQHVPKGAEVLISYIETAGSTMTRQKALKEQYLFTCT 244
           ++NHSC PN ++ ++G LA VRAVQ +  G EV  SYI+    T  R   L++ Y FTC 
Sbjct: 204 LMNHSCCPNVIVTYKGTLAEVRAVQAISPGEEVFTSYIDLLYPTEDRNDRLRDSYFFTCE 263

Query: 245 CPRCIKLGQFDDIQESAILEGYRCKDDGCSGFLLRDSDDKGFTCQQCGLVRSKEEIKKIA 304
           C  C                    KD   +   +R  +D         +VR    + +  
Sbjct: 264 CQECT------------------TKDKDKAKVKIRKLNDPPKADAVRDMVRYARNVIEEF 305

Query: 305 SEVNILSKKTLALTSCGNHQEVVSTYKMIEKLQKKLYHPFSVNLMQTREKLIKILMELED 364
                    +  L  C   QE +S+          ++   +V ++    + + + +  +D
Sbjct: 306 RRAKHYKSPSELLEICELSQEKMSS----------VFEDSNVYMLHMMYQAMGVCLYTQD 355

Query: 365 WKEALAYCQLTIPVYQRVYPQFHPLLGLQYYTCGKLEWFLGDTENAIKSMTEAVEILRIT 424
           W+ AL Y Q  I  Y + YP +   +   +   G+L   L +     K++ +A+ I+ + 
Sbjct: 356 WEGALRYGQKIIKPYSKHYPLYSLNVASMWLKLGRLYMGLENRAAGEKALRKAMAIMEVA 415

Query: 425 HGTNSPFMKELILKLE 440
           HG + P++ E+  ++E
Sbjct: 416 HGKDHPYISEIKQEIE 431


>gi|194768855|ref|XP_001966527.1| GF21943 [Drosophila ananassae]
 gi|190617291|gb|EDV32815.1| GF21943 [Drosophila ananassae]
          Length = 459

 Score =  161 bits (408), Expect = 5e-37,   Method: Compositional matrix adjust.
 Identities = 113/448 (25%), Positives = 215/448 (47%), Gaps = 25/448 (5%)

Query: 8   GEVIISQEPYVCVPNNSSSISRCDGCFASSNLKKCSACQVVWYCGSNCQKLDWKLHRLEC 67
           G+ I++++P+  V  +   + RCD C  ++ + KCS C+ V YC   CQ   W  H+ EC
Sbjct: 29  GQRILTEKPFAFVLKSKYRLERCDNCLEATKVLKCSNCRYVSYCNRACQSQAWAQHKHEC 88

Query: 68  QVLSRLDKEKRKSVTPSIRLMLKLYLRRKLQNDNVIPSTTTDNYSLVEALVAHMSDIDEK 127
             L ++     + V  + R++ +L LR +   D +    T         L++H ++I   
Sbjct: 89  PFLKKVHP---RIVPDAARMLCRLILRLEHGGDLIRGYYTEHGSRKFRDLMSHYAEIKND 145

Query: 128 QLLLY---AQIANLVNLILQWPEISIN--EIAENFSKLACNAHTICNSELRPLGTGLYPV 182
            + L    +  A L +++ + P    N  E+   + +L  N   + ++E+  + T +Y  
Sbjct: 146 PMRLEHLDSLHAVLTDMMAESPSTVPNKTELMSIYGRLITNGFNVLDAEMNSIATAIYLG 205

Query: 183 ISIINHSCLPNAVLVFEGRLAVVRAVQHVP--KGAEVLISYIETAGSTMTRQKALKEQYL 240
           +SI +HSC PNAV  FEG    + A++ +     +++ ISYI+   +   R+  LKE Y 
Sbjct: 206 VSITDHSCQPNAVATFEGNELHIHAIEDMECLDWSKIFISYIDLLNTPEERRLDLKEHYY 265

Query: 241 FTCTCPRCIKLGQFDDIQESAILEGYRCKDDGCSGFLLRDSDDKGFTCQQCGLVRSKEEI 300
           F C C +CI      D QE+  +    C +  C   +    + +   C++C    S  ++
Sbjct: 266 FLCCCSKCI------DPQETKEMMAALCPNRNCGIGI----NPERTNCKRCDTGISP-KL 314

Query: 301 KKIASEVNILSKKTLALTSCGNHQEVVSTYKMIEKLQKKLYHPFSVNLMQTREKLIKILM 360
           +   + V   ++  L       + +V    K+    Q  + HP +V  ++T +   +  +
Sbjct: 315 RNAYNSVMAFTRTNLEAMKDVAYLDVC---KVCLNKQTGVLHPLNVWHVKTLDAAFEAAI 371

Query: 361 ELEDWKEALAYCQLTIPVYQRVYPQFHPLLGLQYYTCGKLEWFLGDTENAIKSMTEAVEI 420
           ++  W +AL Y Q  +P Y++ +  ++PLLGL +   GK++      + A+  + +A+ I
Sbjct: 372 DVGKWTDALDYGQRLLPGYRKYHGAWNPLLGLLHMKLGKIQLHEQRAKEALPHLEDALRI 431

Query: 421 LRITHGTNSPFMKELILKLE-EAQAEAS 447
           L +THG +   + E +  L  +A+ EA+
Sbjct: 432 LTVTHGRDHRLLAEQLYPLLIQARHEAN 459


>gi|332811913|ref|XP_003308794.1| PREDICTED: N-lysine methyltransferase SMYD2 [Pan troglodytes]
          Length = 433

 Score =  161 bits (408), Expect = 6e-37,   Method: Compositional matrix adjust.
 Identities = 112/438 (25%), Positives = 198/438 (45%), Gaps = 37/438 (8%)

Query: 6   VSGEVIISQEPYVCVPNNSSSISRCDGCFA-SSNLKKCSACQVVWYCGSNCQKLDWKLHR 64
           +SG    S   Y    + +   + C+ CF     L KC  C+  +YC   CQK DW +H+
Sbjct: 28  MSGTCCYSSPAYTLRAHVNERGNHCEYCFTRKEGLSKCGRCKQAFYCNVECQKEDWPMHK 87

Query: 65  LECQVLSRLDKEKRKSVTPSIRLMLKLYLRRKLQNDNVIPSTTTDNYSLVEALVAHMSDI 124
           LEC  +    +    S T  +RL  ++  ++K+  +     T ++    V+   +H+  +
Sbjct: 88  LECSPMVVFGENWNPSET--VRLTARILAKQKIHPER----TPSEKLLAVKEFESHLDKL 141

Query: 125 D-EKQLLLYAQIANLVNLILQWPEISINE-IAENFSKLACNAHTICNSELRPLGTGLYPV 182
           D EK+ L+ + IA L +   +  E   N+ +   F+++ CN  TI + EL  LG+ ++P 
Sbjct: 142 DNEKKDLIQSDIAALHHFYSKHLEFPDNDSLVVVFAQVNCNGFTIEDEELSHLGSAIFPD 201

Query: 183 ISIINHSCLPNAVLVFEGRLAVVRAVQHVPKGAEVLISYIETAGSTMTRQKALKEQYLFT 242
           ++++NHSC PN ++ ++G LA VRAVQ +  G EV  SYI+    T  R   L++ Y FT
Sbjct: 202 VALMNHSCCPNVIVTYKGTLAEVRAVQEIKPGEEVFTSYIDLLYPTEDRNDRLRDSYFFT 261

Query: 243 CTCPRCIKLGQFDDIQESAILEGYRCKDDGCSGFLLRDSDDKGFTCQQCGLVRSKEEIKK 302
           C C  C                    KD   +   +R   D         +VR    + +
Sbjct: 262 CECQECTT------------------KDKDKAKVEIRKLSDPPKAEAIRDMVRYARNVIE 303

Query: 303 IASEVNILSKKTLALTSCGNHQEVVSTYKMIEKLQKKLYHPFSVNLMQTREKLIKILMEL 362
                      +  L  C   QE +S+          ++   +V ++    + + + + +
Sbjct: 304 EFRRAKHYKSPSELLEICELSQEKMSS----------VFEDSNVYMLHMMYQAMGVCLYM 353

Query: 363 EDWKEALAYCQLTIPVYQRVYPQFHPLLGLQYYTCGKLEWFLGDTENAIKSMTEAVEILR 422
           +DW+ AL Y Q  I  Y + YP +   +   +   G+L   L       K++ +A+ I+ 
Sbjct: 354 QDWEGALQYGQKIIKPYSKHYPLYSLNVASMWLKLGRLYMGLEHKAAGEKALKKAIAIME 413

Query: 423 ITHGTNSPFMKELILKLE 440
           I HG + P++ E+  ++E
Sbjct: 414 IAHGKDHPYISEIKQEIE 431


>gi|229892824|ref|NP_001153563.1| N-lysine methyltransferase SMYD2 [Sus scrofa]
 gi|325530260|sp|C3RZA1.1|SMYD2_PIG RecName: Full=N-lysine methyltransferase SMYD2; AltName:
           Full=Histone methyltransferase SMYD2; AltName: Full=SET
           and MYND domain-containing protein 2
 gi|197692954|gb|ACH71265.1| SET and MYND domain-containing 2 [Sus scrofa]
          Length = 433

 Score =  160 bits (406), Expect = 9e-37,   Method: Compositional matrix adjust.
 Identities = 111/438 (25%), Positives = 203/438 (46%), Gaps = 41/438 (9%)

Query: 8   GEVIISQEPYVCVPNNSSSISRCDGCFA-SSNLKKCSACQVVWYCGSNCQKLDWKLHRLE 66
           G+++ S   Y  V   +   + C+ CFA    L KC  C+  +YC   CQK DW +H+LE
Sbjct: 30  GDLLFSCPAYAYVLTVNERGNHCEFCFARKEGLSKCGRCKQAFYCNVECQKEDWPMHKLE 89

Query: 67  CQVLSRLDKEKRKSVTPSIRLMLKLYLRRKLQNDNVIPSTTTDNYSLVEALVAHMSDID- 125
           C  +    +    S T  +RL  ++  ++K+  +     T ++    V+   +H+  +D 
Sbjct: 90  CSPMVVFGENWNPSET--VRLTARILAKQKIHPER----TPSEKLLAVKEFESHLDKLDN 143

Query: 126 EKQLLLYAQIANLVNLI---LQWPEISINEIAENFSKLACNAHTICNSELRPLGTGLYPV 182
           EK+ L+ + IA L +     L++P+   + +   F+++ CN  TI + EL  LG+ ++P 
Sbjct: 144 EKRDLIQSDIAALHHFYSKHLEFPDS--DSLVVLFAQVNCNGFTIEDEELSHLGSXIFPD 201

Query: 183 ISIINHSCLPNAVLVFEGRLAVVRAVQHVPKGAEVLISYIETAGSTMTRQKALKEQYLFT 242
           ++++NHSC PN ++ ++G LA VRAVQ +  G EV  SYI+    T  R   L++ Y FT
Sbjct: 202 VALMNHSCCPNVIVTYKGTLAEVRAVQEIHPGEEVFTSYIDLLYPTEDRNDRLRDSYFFT 261

Query: 243 CTCPRCIKLGQFDDIQESAILEGYRCKDDGCSGFLLRDSDDKGFTCQQCGLVRSKEEIKK 302
           C C  C                    KD   +   +R  +D         +VR    + +
Sbjct: 262 CECQECTT------------------KDKDKAKVEIRKLNDPPKAEAIRDMVRYARNVIE 303

Query: 303 IASEVNILSKKTLALTSCGNHQEVVSTYKMIEKLQKKLYHPFSVNLMQTREKLIKILMEL 362
                      +  L  C   QE +S           ++   +V ++    + + + + +
Sbjct: 304 EFRRAKHYKSPSELLEICELSQEKMSC----------VFEDSNVYMLHMMYQAMGVCLYM 353

Query: 363 EDWKEALAYCQLTIPVYQRVYPQFHPLLGLQYYTCGKLEWFLGDTENAIKSMTEAVEILR 422
           +DW+ AL Y Q  I  Y + YP +   +   +   G+L   L +     +++ +A+ I+ 
Sbjct: 354 QDWEGALRYGQKIIQPYSKHYPLYSLNVASMWLKLGRLYMGLENKAAGERALRKAIAIME 413

Query: 423 ITHGTNSPFMKELILKLE 440
           + HG + P++ E+  ++E
Sbjct: 414 VAHGKDHPYISEIKQEIE 431


>gi|395852657|ref|XP_003798851.1| PREDICTED: SET and MYND domain-containing protein 3 [Otolemur
           garnettii]
          Length = 428

 Score =  160 bits (406), Expect = 9e-37,   Method: Compositional matrix adjust.
 Identities = 129/448 (28%), Positives = 210/448 (46%), Gaps = 62/448 (13%)

Query: 8   GEVIISQEPYVCVPNNSSSISRCDGCF-ASSNLKKCSACQVVWYCGSNCQKLDWKLHRLE 66
           GE++   +P        S    CD C      L +CS CQV  YC + CQK  W+ H+ E
Sbjct: 27  GELLFRSDPLAYTVCKGSRGIVCDRCLLGKEKLMRCSQCQVAKYCSAKCQKKAWQDHKRE 86

Query: 67  CQVLSRLDKEKRKSVTPSIRLMLKLYLRRKLQNDNVIPSTTTDNYSLVEALVAHMSDIDE 126
           C+ L      K +    S+RL+ ++    KL  +   PS +   YS  + L +++  + E
Sbjct: 87  CKCLKSC---KPRYPPDSVRLLGRVIF--KLMEET--PSESEKLYSFYD-LESNIKKLTE 138

Query: 127 ------KQLLLYAQ--IANLVNLILQWPEISINEIAENFSKLACNAHTICNSELRPLGTG 178
                 +QL +  Q  +   +  + Q P     ++ E F+K+ CN+ TICN+E++ +G G
Sbjct: 139 DKKEGLRQLAMTFQHFMREEIQDVSQLP--PSFDVFEAFAKVICNSFTICNAEMQEVGVG 196

Query: 179 LYPVISIINHSCLPNAVLVFEGRLAVVRAVQHVPKGAEVLISYIETAGSTMTRQKALKEQ 238
           LYP +S++NHSC PN  +VF G   ++RAV+ +  G E+ I Y++   ++  R+K L++Q
Sbjct: 197 LYPSMSLLNHSCDPNCSIVFNGPHLLLRAVRDIEAGEELTICYLDMLMTSEERRKQLRDQ 256

Query: 239 YLFTCTCPRCIKLGQFDDIQESAILEGYRCKDDGCSGFLLRDSDDKGFTCQQCGLVRSKE 298
           Y F C C RC                              +D D    T  +      +E
Sbjct: 257 YCFECDCLRC----------------------------QTQDKDADMLTGDEQVWKEVQE 288

Query: 299 EIKKIASEVNILSKKTLALTSCGNHQEVVSTYKMIEKLQKKLYHPFSVNLMQTR--EKLI 356
            +KKI  E+    K    L  C   Q ++S+    E+L         +N+ Q +  +  +
Sbjct: 289 SLKKI-EELKAHWKWEQVLAMC---QTIISSNS--ERLP-------DINIYQLKVLDCAM 335

Query: 357 KILMELEDWKEALAYCQLTIPVYQRVYPQFHPLLGLQYYTCGKLEWFLGDTENAIKSMTE 416
              + L   +EAL Y   T+  Y+  +P  HP+ G+Q    GKL+   G    A++++  
Sbjct: 336 DACINLGLLEEALFYATRTMEPYRIFFPGSHPVRGVQVMKVGKLQLHQGMFPQAMQNLRL 395

Query: 417 AVEILRITHGTNSPFMKELILKLEEAQA 444
           A +I+R+THG      ++LIL LEE  A
Sbjct: 396 AFDIIRVTHGREHSLTEDLILLLEECDA 423


>gi|195133772|ref|XP_002011313.1| GI16066 [Drosophila mojavensis]
 gi|193907288|gb|EDW06155.1| GI16066 [Drosophila mojavensis]
          Length = 448

 Score =  160 bits (405), Expect = 1e-36,   Method: Compositional matrix adjust.
 Identities = 124/447 (27%), Positives = 221/447 (49%), Gaps = 34/447 (7%)

Query: 8   GEVIISQEPYVCVPNNSSSISRCDGCFASSNLKKCSACQVVWYCGSNCQKLDWKLHRLEC 67
           G  I++++P+  V  +   + RCD C  ++ + KCS C+ V YC  +CQ+  W +H+ EC
Sbjct: 12  GTRILTEKPFAYVLKSKYRLERCDNCLEATKVLKCSNCRYVSYCNRSCQQQAWSIHKHEC 71

Query: 68  QVLSRLDKEKRKSVTPSIRLMLKLYLRRKLQNDNVIPSTTTDNYSLVEALVAHMSDI--D 125
             L R      + V  + R++ +L LR +   D      T         L++H ++I  D
Sbjct: 72  LFLKR---ALPRIVPDAARMLCRLILRLEHGGDLERGYYTEHGSRKFRDLMSHYAEIKND 128

Query: 126 EKQL-LLYAQIANLVNLILQWPEISI--------NEIAENFSKLACNAHTICNSELRPLG 176
            K++  L +  A L +++ +  + S+        NE+   + +L  N   I ++E+  + 
Sbjct: 129 PKRIEHLESLHAVLTDMMAESGKGSLTGSLVPNMNELMSIYGRLITNGFNILDAEMNSIA 188

Query: 177 TGLYPVISIINHSCLPNAVLVFEGRLAVVRAVQHVP--KGAEVLISYIETAGSTMTRQKA 234
           TG+Y  +SI +HSC PNAV  FEG    V A++ +P    +++ ISYI+   +   R+  
Sbjct: 189 TGIYLGVSITDHSCQPNAVATFEGNELHVHAIEDLPCLDWSKIYISYIDLLNTPEQRRAD 248

Query: 235 LKEQYLFTCTCPRCIKLGQFDDIQESAILEGYRCKDDGCSGFLLRDSDDKGFTCQQCGLV 294
           LKE Y F C C +CI      D QE   +    C +  C   +    D     C+ CG  
Sbjct: 249 LKEHYYFLCVCSKCI------DPQEMHEMTAAVCPNGSCDASVNIQLD----KCESCGTP 298

Query: 295 RSKEEIKKIASEVNILSKKTLALTSCGNHQEV--VSTYKMIEKLQKKLYHPFSVNLMQTR 352
            S E+++   SEV  L++ +L      + +EV  +   K+     + + HP +V  ++T 
Sbjct: 299 VS-EQLRTDYSEVMALTQSSL-----DSMKEVAYLDVCKVCLAKHRGILHPLNVWHVKTL 352

Query: 353 EKLIKILMELEDWKEALAYCQLTIPVYQRVYPQFHPLLGLQYYTCGKLEWFLGDTENAIK 412
           +   +  +++  W EAL Y +  +P + + + +++PLLGL     GK+E +  + + A++
Sbjct: 353 DAAFEASIQVGKWTEALDYGRQLLPGFAKYHGEWNPLLGLLQLKLGKIELYERNYKQALQ 412

Query: 413 SMTEAVEILRITHGTNSPFMKELILKL 439
            + EA  IL +THG +   + E +  L
Sbjct: 413 HLQEAQRILNVTHGRDHRLLLEQLRPL 439


>gi|432945357|ref|XP_004083558.1| PREDICTED: N-lysine methyltransferase SMYD2-like [Oryzias latipes]
          Length = 432

 Score =  160 bits (404), Expect = 2e-36,   Method: Compositional matrix adjust.
 Identities = 129/453 (28%), Positives = 205/453 (45%), Gaps = 69/453 (15%)

Query: 8   GEVIISQEPYVCVPNNSSSISRCDGCFA-SSNLKKCSACQVVWYCGSNCQKLDWKLHRLE 66
           GE++ S   Y  V + +   S CD CF     L +C  C+  +YC ++CQK DW +HRLE
Sbjct: 29  GELLFSCPAYSSVLSVTERGSCCDLCFRRKEGLARCGKCKKAFYCNTSCQKADWAMHRLE 88

Query: 67  CQVLSRLDKEKRKSVTPSIRLMLKLYLRRKLQNDNVIPSTTTDNYSLVEALVAHMSDID- 125
           C  +    +    S T   RL+ ++  ++K+Q         ++   L+  + +H+ D D 
Sbjct: 89  CSAMVAFGENWCPSETS--RLVARILAKKKMQKGRC----DSEKILLIGEMQSHLEDEDN 142

Query: 126 EKQLLLYAQIANLVNLILQWPEI-SINEIAENFSKLACNAHTICNSELRPLGTGLYPVIS 184
           EK+      IA L     +  E+ S  ++   FS++ACN  TI + EL  +GT +YP ++
Sbjct: 143 EKKERTELDIAGLGRFFSKHLEVPSHKDLLTLFSQVACNGFTIEDDELSHMGTAVYPDVA 202

Query: 185 IINHSCLPNAVLVFEGRLAVVRAVQHVPKGAEVLISYIETAGSTMTRQKALKEQYLFTCT 244
           +INHSCLP+ ++ F G LA VRAVQ +  G EVLISYI+    T  R   L+E Y FTC 
Sbjct: 203 LINHSCLPSVIVTFNGTLAQVRAVQDMKPGDEVLISYIDLLYPTDDRNSRLRESYYFTCN 262

Query: 245 CPRC-------IKL---GQFDDIQESAILEGYRCKDDGCSGF-LLRDSDDKGFTCQQCGL 293
           C  C       +KL    Q D I+   I    R   +    F  ++ +       + C  
Sbjct: 263 CLECQNKQKDKVKLKVRKQSDPIEPQVISNMTRYAKNTIREFRAMKSTKTPSELLEMC-- 320

Query: 294 VRSKEEIKKI--ASEVNILSKKTLALTSCGNHQEVVSTYKMIEKLQK---KLYHPFSVNL 348
            +S EE+  +   S V +L     A+  C   Q+     +  EK+ +   K YHP+S+N+
Sbjct: 321 EQSLEEMGAVFEDSNVYVLHMMYQAMGVCIYMQDADGALRYGEKVARYYSKRYHPYSLNV 380

Query: 349 MQTREKLIKILMELEDWKEALAYCQLTIPVYQRVYPQFHPLLGLQYYTCGKLEWFLGDTE 408
                KL +I M                                           LG   
Sbjct: 381 SSLYLKLGRIYMA------------------------------------------LGRHS 398

Query: 409 NAIKSMTEAVEILRITHGTNSPFMKELILKLEE 441
             I ++T+A+ I+ + HG +  ++K++  +L +
Sbjct: 399 AGINALTKAMAIMEVAHGKDHQYLKDIRKELSQ 431


>gi|195059722|ref|XP_001995690.1| GH17625 [Drosophila grimshawi]
 gi|193896476|gb|EDV95342.1| GH17625 [Drosophila grimshawi]
          Length = 457

 Score =  159 bits (403), Expect = 2e-36,   Method: Compositional matrix adjust.
 Identities = 113/437 (25%), Positives = 212/437 (48%), Gaps = 29/437 (6%)

Query: 8   GEVIISQEPYVCVPNNSSSISRCDGCF-ASSNLKKCSACQVVWYCGSNCQKLDWKLHRLE 66
           G  I+S++P+  V  +   + RCD C  A   ++KCS C+ V YC   CQ+  W LH+ E
Sbjct: 22  GTRILSEKPFAYVLKSQYRLERCDNCLEAVQKVRKCSNCRYVSYCNRTCQQQAWPLHKHE 81

Query: 67  CQVLSRLDKEKRKSVTPSIRLMLKLYLRRKLQNDNVIPSTTTDNYSLVEALVAHMSDI-- 124
           C  L R+   + + V  + R++ +L LR +   D      T         L++H ++I  
Sbjct: 82  CPFLKRV---QPRIVPDAARMLCRLILRLEHGGDLERGYYTEHGSRKFRDLMSHYAEIKN 138

Query: 125 DEKQL----LLYAQIANLVNLILQWPEI---SINEIAENFSKLACNAHTICNSELRPLGT 177
           D K+      L+A + ++++           +I E+   + +L  N   I ++E+  + T
Sbjct: 139 DPKRREHLESLHAVLTDMISDSNSGSSSIVPNITELMSIYGRLITNGFNILDAEMNSIAT 198

Query: 178 GLYPVISIINHSCLPNAVLVFEGRLAVVRAVQHVP--KGAEVLISYIETAGSTMTRQKAL 235
            +Y  +SI +HSC PNAV  FEG    V  ++ +P    +++ ISYI+   +   R+  L
Sbjct: 199 AIYLGVSITDHSCQPNAVATFEGNELHVHVIEDLPCLDWSKIYISYIDLLNTPEQRRADL 258

Query: 236 KEQYLFTCTCPRCIKLGQFDDIQESAILEGYRCKDDGCSGFLLRDSDDKGFTCQQCGLVR 295
           +E Y F C C +CI      D QE   +    C +  C   +    + +   CQQC    
Sbjct: 259 REHYYFLCVCSKCI------DPQEMHEMTAAVCPNASCDASV----NIELAKCQQCD-AS 307

Query: 296 SKEEIKKIASEVNILSKKTLALTSCGNHQEVVSTYKMIEKLQKKLYHPFSVNLMQTREKL 355
              +++   +++   ++ +L       + +V +    +EK Q+ + HP +V  ++T +  
Sbjct: 308 VTPQLRTAYNDIMAFTQSSLDSMKEVAYLDVCNV--CLEK-QRGVLHPLNVWHVKTLDAA 364

Query: 356 IKILMELEDWKEALAYCQLTIPVYQRVYPQFHPLLGLQYYTCGKLEWFLGDTENAIKSMT 415
            +  + +  W +ALA+ Q  +P + + +  ++PL+GL +   GK++ +  + + A+  + 
Sbjct: 365 FEAAINVGKWTDALAFGQQLLPGFGKYHGAWNPLVGLLHMKLGKIQLYERNYKLAVHHLQ 424

Query: 416 EAVEILRITHGTNSPFM 432
           EA  IL +THG +   +
Sbjct: 425 EAQRILSVTHGHDHRLL 441


>gi|340780606|pdb|3RIB|A Chain A, Human Lysine Methyltransferase Smyd2 In Complex With
           Adohcy
 gi|340780607|pdb|3RIB|B Chain B, Human Lysine Methyltransferase Smyd2 In Complex With
           Adohcy
          Length = 441

 Score =  159 bits (403), Expect = 2e-36,   Method: Compositional matrix adjust.
 Identities = 111/438 (25%), Positives = 200/438 (45%), Gaps = 41/438 (9%)

Query: 8   GEVIISQEPYVCVPNNSSSISRCDGCFA-SSNLKKCSACQVVWYCGSNCQKLDWKLHRLE 66
           G+++ S   Y  V   +   + C+ CF     L KC  C+  +YC   CQK DW +H+LE
Sbjct: 30  GDLLFSCPAYAYVLTVNERGNHCEYCFTRKEGLSKCGRCKQAFYCNVECQKEDWPMHKLE 89

Query: 67  CQVLSRLDKEKRKSVTPSIRLMLKLYLRRKLQNDNVIPSTTTDNYSLVEALVAHMSDID- 125
           C  +    +    S T  +RL  ++  ++K+  +     T ++    V+   +H+  +D 
Sbjct: 90  CSPMVVFGENWNPSET--VRLTARILAKQKIHPER----TPSEKLLAVKEFESHLDKLDN 143

Query: 126 EKQLLLYAQIANLVNLI---LQWPEISINEIAENFSKLACNAHTICNSELRPLGTGLYPV 182
           EK+ L+ + IA L +     L +P+   + +   F+++ CN  TI + EL  LG+ ++P 
Sbjct: 144 EKKDLIQSDIAALHHFYSKHLGFPDN--DSLVVLFAQVNCNGFTIEDEELSHLGSAIFPD 201

Query: 183 ISIINHSCLPNAVLVFEGRLAVVRAVQHVPKGAEVLISYIETAGSTMTRQKALKEQYLFT 242
           ++++NHSC PN ++ ++G LA VRAVQ +  G EV  SYI+    T  R   L++ Y FT
Sbjct: 202 VALMNHSCCPNVIVTYKGTLAEVRAVQEIKPGEEVFTSYIDLLYPTEDRNDRLRDSYFFT 261

Query: 243 CTCPRCIKLGQFDDIQESAILEGYRCKDDGCSGFLLRDSDDKGFTCQQCGLVRSKEEIKK 302
           C C  C                    KD   +   +R   D         +VR    + +
Sbjct: 262 CECQECT------------------TKDKDKAKVEIRKLSDPPKAEAIRDMVRYARNVIE 303

Query: 303 IASEVNILSKKTLALTSCGNHQEVVSTYKMIEKLQKKLYHPFSVNLMQTREKLIKILMEL 362
                      +  L  C   QE +S+          ++   +V ++    + + + + +
Sbjct: 304 EFRRAKHYKSPSELLEICELSQEKMSS----------VFEDSNVYMLHMMYQAMGVCLYM 353

Query: 363 EDWKEALAYCQLTIPVYQRVYPQFHPLLGLQYYTCGKLEWFLGDTENAIKSMTEAVEILR 422
           +DW+ AL Y Q  I  Y + YP +   +   +   G+L   L       K++ +A+ I+ 
Sbjct: 354 QDWEGALQYGQKIIKPYSKHYPLYSLNVASMWLKLGRLYMGLEHKAAGEKALKKAIAIME 413

Query: 423 ITHGTNSPFMKELILKLE 440
           + HG + P++ E+  ++E
Sbjct: 414 VAHGKDHPYISEIKQEIE 431


>gi|188035871|ref|NP_064582.2| N-lysine methyltransferase SMYD2 [Homo sapiens]
 gi|90185234|sp|Q9NRG4.2|SMYD2_HUMAN RecName: Full=N-lysine methyltransferase SMYD2; AltName:
           Full=HSKM-B; AltName: Full=Histone methyltransferase
           SMYD2; AltName: Full=Lysine N-methyltransferase 3C;
           AltName: Full=SET and MYND domain-containing protein 2
 gi|345111051|pdb|3TG4|A Chain A, Structure Of Smyd2 In Complex With Sam
 gi|345111052|pdb|3TG5|A Chain A, Structure Of Smyd2 In Complex With P53 And Sah
 gi|67514269|gb|AAH98276.1| SET and MYND domain containing 2 [Homo sapiens]
          Length = 433

 Score =  159 bits (403), Expect = 2e-36,   Method: Compositional matrix adjust.
 Identities = 111/438 (25%), Positives = 200/438 (45%), Gaps = 41/438 (9%)

Query: 8   GEVIISQEPYVCVPNNSSSISRCDGCFA-SSNLKKCSACQVVWYCGSNCQKLDWKLHRLE 66
           G+++ S   Y  V   +   + C+ CF     L KC  C+  +YC   CQK DW +H+LE
Sbjct: 30  GDLLFSCPAYAYVLTVNERGNHCEYCFTRKEGLSKCGRCKQAFYCNVECQKEDWPMHKLE 89

Query: 67  CQVLSRLDKEKRKSVTPSIRLMLKLYLRRKLQNDNVIPSTTTDNYSLVEALVAHMSDID- 125
           C  +    +    S T  +RL  ++  ++K+  +     T ++    V+   +H+  +D 
Sbjct: 90  CSPMVVFGENWNPSET--VRLTARILAKQKIHPER----TPSEKLLAVKEFESHLDKLDN 143

Query: 126 EKQLLLYAQIANLVNLI---LQWPEISINEIAENFSKLACNAHTICNSELRPLGTGLYPV 182
           EK+ L+ + IA L +     L +P+   + +   F+++ CN  TI + EL  LG+ ++P 
Sbjct: 144 EKKDLIQSDIAALHHFYSKHLGFPDN--DSLVVLFAQVNCNGFTIEDEELSHLGSAIFPD 201

Query: 183 ISIINHSCLPNAVLVFEGRLAVVRAVQHVPKGAEVLISYIETAGSTMTRQKALKEQYLFT 242
           ++++NHSC PN ++ ++G LA VRAVQ +  G EV  SYI+    T  R   L++ Y FT
Sbjct: 202 VALMNHSCCPNVIVTYKGTLAEVRAVQEIKPGEEVFTSYIDLLYPTEDRNDRLRDSYFFT 261

Query: 243 CTCPRCIKLGQFDDIQESAILEGYRCKDDGCSGFLLRDSDDKGFTCQQCGLVRSKEEIKK 302
           C C  C                    KD   +   +R   D         +VR    + +
Sbjct: 262 CECQECT------------------TKDKDKAKVEIRKLSDPPKAEAIRDMVRYARNVIE 303

Query: 303 IASEVNILSKKTLALTSCGNHQEVVSTYKMIEKLQKKLYHPFSVNLMQTREKLIKILMEL 362
                      +  L  C   QE +S+          ++   +V ++    + + + + +
Sbjct: 304 EFRRAKHYKSPSELLEICELSQEKMSS----------VFEDSNVYMLHMMYQAMGVCLYM 353

Query: 363 EDWKEALAYCQLTIPVYQRVYPQFHPLLGLQYYTCGKLEWFLGDTENAIKSMTEAVEILR 422
           +DW+ AL Y Q  I  Y + YP +   +   +   G+L   L       K++ +A+ I+ 
Sbjct: 354 QDWEGALQYGQKIIKPYSKHYPLYSLNVASMWLKLGRLYMGLEHKAAGEKALKKAIAIME 413

Query: 423 ITHGTNSPFMKELILKLE 440
           + HG + P++ E+  ++E
Sbjct: 414 VAHGKDHPYISEIKQEIE 431


>gi|74204261|dbj|BAE39889.1| unnamed protein product [Mus musculus]
          Length = 428

 Score =  158 bits (400), Expect = 5e-36,   Method: Compositional matrix adjust.
 Identities = 122/444 (27%), Positives = 208/444 (46%), Gaps = 54/444 (12%)

Query: 8   GEVIISQEPYVCVPNNSSSISRCDGCF-ASSNLKKCSACQVVWYCGSNCQKLDWKLHRLE 66
           GE++   +P        S    CD C      L  CS C++  YC + CQK  W  HR E
Sbjct: 27  GELLFRSDPLAYTVCKGSRGVVCDRCLLGKEKLMHCSQCRIAKYCSAKCQKKAWPDHRRE 86

Query: 67  CQVLSRLDKEKRKSVTPSIRLMLKLYLRRKLQNDNVIPSTTTDNYSLVEALVAHMSDIDE 126
           C  L      K +    S+RL+ ++ +  KL ++   PS +   YS  + L +++S + E
Sbjct: 87  CSCLKSC---KPRYPPDSVRLLGRVIV--KLMDEK--PSESEKLYSFYD-LESNISKLTE 138

Query: 127 KQLLLYAQIANLVNLILQWPEISINEIA------ENFSKLACNAHTICNSELRPLGTGLY 180
            +     Q+A      ++      +++       E F+K+ CN+ TICN+E++ +G GLY
Sbjct: 139 DKKEGLRQLAMTFQHFMREEIQDASQLPPSFDLFEAFAKVICNSFTICNAEMQEVGVGLY 198

Query: 181 PVISIINHSCLPNAVLVFEGRLAVVRAVQHVPKGAEVLISYIETAGSTMTRQKALKEQYL 240
           P +S++NHSC PN  +VF G   ++RAV+ +  G E+ I Y++   ++  R+K L++QY 
Sbjct: 199 PSMSLLNHSCDPNCSIVFNGPHLLLRAVREIEAGEELTICYLDMLMTSEERRKQLRDQYC 258

Query: 241 FTCTCPRCIKLGQFDDIQESAILEGYRCKDDGCSGFLLRDSDDKGFTCQQCGLVRSKEEI 300
           F C C RC    Q  D                      +D+D          ++   E+I
Sbjct: 259 FECDCIRC----QTQD----------------------KDAD----------MLTGDEQI 282

Query: 301 KKIASEVNILSKKTLALTSCGNHQEVVSTYKMIEKLQKKLYHPFSVNLMQTREKLIKILM 360
            K   EV    KK   L +    ++V++  + I           ++  ++  +  +   +
Sbjct: 283 WK---EVQESLKKIEELKAHWKWEQVLALCQAIINSNSNRLPNINIYQLKVLDCAMDACI 339

Query: 361 ELEDWKEALAYCQLTIPVYQRVYPQFHPLLGLQYYTCGKLEWFLGDTENAIKSMTEAVEI 420
            L   +EAL Y   T+  Y+  +P  HP+ G+Q    GKL+   G    A+K++  A +I
Sbjct: 340 NLGMLEEALFYAMRTMEPYRIFFPGSHPVRGVQVMKVGKLQLHQGMFPQAMKNLRLAFDI 399

Query: 421 LRITHGTNSPFMKELILKLEEAQA 444
           +++THG     +++LI+ LEE  A
Sbjct: 400 MKVTHGREHSLIEDLIVLLEECDA 423


>gi|195059707|ref|XP_001995687.1| GH17627 [Drosophila grimshawi]
 gi|193896473|gb|EDV95339.1| GH17627 [Drosophila grimshawi]
          Length = 457

 Score =  158 bits (400), Expect = 5e-36,   Method: Compositional matrix adjust.
 Identities = 113/437 (25%), Positives = 212/437 (48%), Gaps = 29/437 (6%)

Query: 8   GEVIISQEPYVCVPNNSSSISRCDGCF-ASSNLKKCSACQVVWYCGSNCQKLDWKLHRLE 66
           G  I++++P+  V  +   + RCD C  A   ++KCS C+ V YC   CQ+  W LH+ E
Sbjct: 22  GTRILTEKPFAYVLKSQYRLERCDNCLEAVQKVRKCSNCRYVSYCNRTCQQQAWPLHKHE 81

Query: 67  CQVLSRLDKEKRKSVTPSIRLMLKLYLRRKLQNDNVIPSTTTDNYSLVEALVAHMSDI-- 124
           C  L R+   + + V  + R++ +L LR +   D      T         L++H ++I  
Sbjct: 82  CPFLKRV---QPRIVPDAARMLCRLILRLEHGGDLERGYYTEHGSRKFRDLMSHYAEIKN 138

Query: 125 DEKQL----LLYAQIANLVNLILQWPEI---SINEIAENFSKLACNAHTICNSELRPLGT 177
           D K+      L+A + ++++           +I E+   + +L  N   I ++E+  + T
Sbjct: 139 DPKRREHLESLHAVLTDMISDSNSGSSSIVPNITELMSIYGRLITNGFNILDAEMNSIAT 198

Query: 178 GLYPVISIINHSCLPNAVLVFEGRLAVVRAVQHVP--KGAEVLISYIETAGSTMTRQKAL 235
            +Y  +SI +HSC PNAV  FEG    V  ++ +P    +++ ISYI+   +   R+  L
Sbjct: 199 AIYLGVSITDHSCQPNAVATFEGNELHVHVIEDLPCLDWSKIYISYIDLLNTPEQRRADL 258

Query: 236 KEQYLFTCTCPRCIKLGQFDDIQESAILEGYRCKDDGCSGFLLRDSDDKGFTCQQCGLVR 295
           +E Y F C C +CI      D QE   +    C +  C   +    + +   CQQC    
Sbjct: 259 REHYYFLCVCSKCI------DPQEMHEMTAAVCPNASCDASV----NIELAKCQQCD-AS 307

Query: 296 SKEEIKKIASEVNILSKKTLALTSCGNHQEVVSTYKMIEKLQKKLYHPFSVNLMQTREKL 355
              +++   +++   ++ +L       + +V +    +EK Q+ + HP +V  ++T +  
Sbjct: 308 VTPQLRTAYNDIMAFTQSSLDSMKEVAYLDVCNV--CLEK-QRGVLHPLNVWHVKTLDAA 364

Query: 356 IKILMELEDWKEALAYCQLTIPVYQRVYPQFHPLLGLQYYTCGKLEWFLGDTENAIKSMT 415
            +  + +  W +ALA+ Q  +P + + +  ++PL+GL Y   GK++ +  + + A+  + 
Sbjct: 365 FEAAINVGKWTDALAFGQQLLPGFGKYHGAWNPLVGLLYMKLGKIQLYERNYKLAVHHLQ 424

Query: 416 EAVEILRITHGTNSPFM 432
           EA  IL +THG +   +
Sbjct: 425 EAQCILSVTHGHDHRLL 441


>gi|148228090|ref|NP_001080251.1| N-lysine methyltransferase SMYD2-A [Xenopus laevis]
 gi|82176673|sp|Q7ZXV5.1|SMY2A_XENLA RecName: Full=N-lysine methyltransferase SMYD2-A; AltName:
           Full=Histone methyltransferase SMYD2-A; AltName:
           Full=SET and MYND domain-containing protein 2A
 gi|28279861|gb|AAH44103.1| Smyd2-prov protein [Xenopus laevis]
          Length = 430

 Score =  158 bits (399), Expect = 6e-36,   Method: Compositional matrix adjust.
 Identities = 114/436 (26%), Positives = 202/436 (46%), Gaps = 37/436 (8%)

Query: 8   GEVIISQEPYVCVPNNSSSISRCDGCFA-SSNLKKCSACQVVWYCGSNCQKLDWKLHRLE 66
           GE++ +   Y  V  ++   + CD CFA    L KC  C+  +YC  +CQK DW +H+LE
Sbjct: 28  GELLFTCPAYTYVLTDTERGNHCDFCFARKEGLSKCGKCKQAFYCNVDCQKGDWPMHKLE 87

Query: 67  CQVLSRLDKEKRKSVTPSIRLMLKLYLRRKLQNDNVIPSTTTDNYSLVEALVAHMSDID- 125
           C  +    +    S T  +RL  ++  ++K Q +     T ++ +  V+   +H+S +D 
Sbjct: 88  CSAMCSYGQNWCPSET--VRLTARILAKQKTQTER----TPSETFLSVKEFESHLSKLDN 141

Query: 126 EKQLLLYAQIANLVNLILQWPEISINE-IAENFSKLACNAHTICNSELRPLGTGLYPVIS 184
           EK+ L+ + IA L     +    + N  +   F+++ CN  TI + EL  LG+ ++P ++
Sbjct: 142 EKKELIESDIAALHRFYSKNLHYTDNAALVFLFAQVNCNGFTIEDEELSHLGSAIFPDVA 201

Query: 185 IINHSCLPNAVLVFEGRLAVVRAVQHVPKGAEVLISYIETAGSTMTRQKALKEQYLFTCT 244
           ++NHSC PN ++ F+G +A +RAVQ +  G EV  SYI+    T  R   L + Y FTC 
Sbjct: 202 LMNHSCCPNIIVTFKGTVAEIRAVQEIHAGDEVFTSYIDLLYPTEDRNDRLMDSYFFTCD 261

Query: 245 CPRCIKLGQFDDIQESAILEGYRCKDDGCSGFLLRDSDDKGFTCQQCGLVRSKEEIKKIA 304
           C  C         ++ A LE  +  D      +      K        +V      K   
Sbjct: 262 CRECST-----KQKDPAKLEIRKLSDPPSHQTV------KDMIKYARNIVEEFRRAKHYK 310

Query: 305 SEVNILSKKTLALTSCGNHQEVVSTYKMIEKLQKKLYHPFSVNLMQTREKLIKILMELED 364
           +   +L    L+L   G+                 ++   +V ++    + + + + L++
Sbjct: 311 TPSELLEMCELSLDKMGS-----------------VFVDSNVYMLHMMYQAMGVCLYLQE 353

Query: 365 WKEALAYCQLTIPVYQRVYPQFHPLLGLQYYTCGKLEWFLGDTENAIKSMTEAVEILRIT 424
           W  AL Y +  I  Y + YP +   +   +   G+L   L  T    K++ +A+ I++I 
Sbjct: 354 WDGALKYGEKIIKPYSKHYPAYSLNVASMWLKLGRLYMGLEKTTIGTKALKKALAIMQIA 413

Query: 425 HGTNSPFMKELILKLE 440
           HG +  ++ E+  +LE
Sbjct: 414 HGPDHHYIAEIKKELE 429


>gi|297722479|ref|NP_001173603.1| Os03g0704200 [Oryza sativa Japonica Group]
 gi|255674817|dbj|BAH92331.1| Os03g0704200 [Oryza sativa Japonica Group]
          Length = 174

 Score =  157 bits (398), Expect = 8e-36,   Method: Compositional matrix adjust.
 Identities = 68/113 (60%), Positives = 90/113 (79%)

Query: 8   GEVIISQEPYVCVPNNSSSISRCDGCFASSNLKKCSACQVVWYCGSNCQKLDWKLHRLEC 67
           GEV+ISQEPY   PN  S  S CD CFAS NL+KCS C+V WYCGS CQ+ +WKLH+LEC
Sbjct: 59  GEVVISQEPYASTPNKISVGSNCDNCFASRNLRKCSVCRVAWYCGSACQREEWKLHQLEC 118

Query: 68  QVLSRLDKEKRKSVTPSIRLMLKLYLRRKLQNDNVIPSTTTDNYSLVEALVAH 120
           + ++ L ++++K +TP+IRLM++L LRRKLQ+D  IPS+ TDNY+LV+AL +H
Sbjct: 119 RAIAALTEDRKKMLTPTIRLMVRLVLRRKLQDDKAIPSSGTDNYNLVDALESH 171


>gi|126307073|ref|XP_001374886.1| PREDICTED: n-lysine methyltransferase SMYD2 [Monodelphis domestica]
          Length = 386

 Score =  157 bits (397), Expect = 9e-36,   Method: Compositional matrix adjust.
 Identities = 106/407 (26%), Positives = 194/407 (47%), Gaps = 38/407 (9%)

Query: 38  NLKKCSACQVVWYCGSNCQKLDWKLHRLECQVLSRLDKEKRKSVTPSIRLMLKLYLRRKL 97
            L KC  C+  +YC   CQK DW +H+LEC  +    +    S T  +RL  ++  ++K 
Sbjct: 14  GLSKCGRCRQAFYCDVECQKEDWPMHKLECSAMCVFGENWNPSET--VRLTARILAKQKT 71

Query: 98  QNDNVIPSTTTDNYSLVEALVAHMSDID-EKQLLLYAQIANLVNLILQWPEISIN-EIAE 155
            ++     T+++    V+   +H+  +D EK+ L+ + IA L +   +  E   N  +  
Sbjct: 72  HHER----TSSEKLLAVKEFESHLDKLDNEKRELIQSDIAALHHFYSKHLEYPDNCSLVT 127

Query: 156 NFSKLACNAHTICNSELRPLGTGLYPVISIINHSCLPNAVLVFEGRLAVVRAVQHVPKGA 215
            F+++ CN  TI + EL  LG+ ++P ++++NHSC PN ++ ++G LA VRAVQ +  G 
Sbjct: 128 LFAQVNCNGFTIEDEELSHLGSAIFPDVALMNHSCCPNVIVTYKGTLAEVRAVQEINPGD 187

Query: 216 EVLISYIETAGSTMTRQKALKEQYLFTCTCPRC-IKLGQFDDIQESAILEGYRCKDDGCS 274
           EV  SYI+    T  R   LK+ Y FTC C  C  K      ++   + E  + +D    
Sbjct: 188 EVFTSYIDLLYPTEDRNDRLKDSYFFTCECRECTTKAKDKAKVEIRKLSEPPKAED---- 243

Query: 275 GFLLRDSDDKGFTCQQCGLVRSKEEIKKIASEVNILSKKTLALTSCGNHQEVVSTYKMIE 334
              +RD            +V+    + +           +  L  C   QE +S+     
Sbjct: 244 ---IRD------------MVKYARNVIEEFRRAKHYKSPSELLEICELSQEKMSS----- 283

Query: 335 KLQKKLYHPFSVNLMQTREKLIKILMELEDWKEALAYCQLTIPVYQRVYPQFHPLLGLQY 394
                L+   +V ++    + + + + ++DW+ AL Y Q  I  Y + YP +   +   +
Sbjct: 284 -----LFEDSNVYMLHMMYQAMGVCLYMQDWEGALRYGQKIIKPYSKHYPLYSLNVASMW 338

Query: 395 YTCGKLEWFLGDTENAIKSMTEAVEILRITHGTNSPFMKELILKLEE 441
              G+L   L    + +K++ +A+ I+ + HG + P++ E+  ++E+
Sbjct: 339 LKLGRLYMGLEHKASGVKALKKAIVIMEVAHGKDHPYISEIKKEIED 385


>gi|354465172|ref|XP_003495054.1| PREDICTED: N-lysine methyltransferase SMYD2-like [Cricetulus
           griseus]
          Length = 404

 Score =  157 bits (396), Expect = 1e-35,   Method: Compositional matrix adjust.
 Identities = 100/407 (24%), Positives = 190/407 (46%), Gaps = 40/407 (9%)

Query: 38  NLKKCSACQVVWYCGSNCQKLDWKLHRLECQVLSRLDKEKRKSVTPSIRLMLKLYLRRKL 97
            L KC  C+  +YCG  CQK DW LH+LEC  +    +    S T  +RL  ++  ++K+
Sbjct: 32  GLSKCGRCKQAFYCGVECQKEDWPLHKLECSSMVVFGENWNPSET--VRLTARILAKQKI 89

Query: 98  QNDNVIPSTTTDNYSLVEALVAHMSDID-EKQLLLYAQIANLVNLI---LQWPEISINEI 153
             +     T ++    V+   +H+  +D EK+ L+ + IA L       L++P+ S   +
Sbjct: 90  HPER----TPSEKLLAVKEFESHLDKLDNEKKDLIQSDIAALHQFYSRHLEFPDHS--SL 143

Query: 154 AENFSKLACNAHTICNSELRPLGTGLYPVISIINHSCLPNAVLVFEGRLAVVRAVQHVPK 213
              F+++ CN  TI + EL  LG+ ++P ++++NHSC PN ++ ++G LA VRAVQ +  
Sbjct: 144 VVLFAQVNCNGFTIEDEELSHLGSAIFPDVALMNHSCCPNVIVTYKGTLAEVRAVQEIHP 203

Query: 214 GAEVLISYIETAGSTMTRQKALKEQYLFTCTCPRCIKLGQFDDIQESAILEGYRCKDDGC 273
           G EV  SYI+    T  R   L++ Y FTC C  C                         
Sbjct: 204 GDEVFTSYIDLLYPTEDRNDRLRDSYFFTCECRECT------------------------ 239

Query: 274 SGFLLRDSDDKGFTCQQCGLVRSKEEIKKIASEVNILSKKTLALTSCGNHQEVVSTYKMI 333
                +D D      ++       E ++ +      + ++        +  E++   ++ 
Sbjct: 240 ----TKDKDKAKVEIRKLSSPPQAEAVRDMVRYARNVIEEFRRAKHYKSPSELLEICELS 295

Query: 334 EKLQKKLYHPFSVNLMQTREKLIKILMELEDWKEALAYCQLTIPVYQRVYPQFHPLLGLQ 393
           ++    ++   +V ++    + + + + ++DW+ AL Y Q  I  Y + YP +   +   
Sbjct: 296 QEKMSSVFEDSNVYMLHMMYQAMGVCLYMQDWEGALKYGQKIIKPYSKHYPVYSLNVASM 355

Query: 394 YYTCGKLEWFLGDTENAIKSMTEAVEILRITHGTNSPFMKELILKLE 440
           +   G+L   L +     K++ +A+ I+ + HG + P++ E+  ++E
Sbjct: 356 WLKLGRLYMGLENKAAGQKALKKAIAIMEVAHGKDHPYISEIKQEIE 402


>gi|431915909|gb|ELK16163.1| SET and MYND domain-containing protein 2, partial [Pteropus alecto]
          Length = 390

 Score =  157 bits (396), Expect = 1e-35,   Method: Compositional matrix adjust.
 Identities = 108/408 (26%), Positives = 189/408 (46%), Gaps = 32/408 (7%)

Query: 38  NLKKCSACQVVWYCGSNCQKLDWKLHRLECQVLSRLDKEKRKSVTPSIRLMLKLYLRRKL 97
            L KC  C+  +YC   CQK DW +H+LEC  +    +    S T  +RL  ++  ++K 
Sbjct: 8   GLSKCGRCKQAFYCNVECQKEDWPMHKLECSPMVVFGENWNPSET--VRLTARILAKQK- 64

Query: 98  QNDNVIPSTTTDNYSLVEALVAHMSDID-EKQLLLYAQIANLVNLILQWPEISINE-IAE 155
              N   +T ++    V+   +H+  +D EK+ L+ + I+ L +   +  E   N+ +  
Sbjct: 65  ---NHPETTPSEKLLAVKDFESHLDKLDNEKKDLIQSDISALHHFYSKHLEFPDNKSLVV 121

Query: 156 NFSKLACNAHTICNSELRPLGTGLYPVISIINHSCLPNAVLVFEGRLAVVRAVQHVPKGA 215
            F+++ CN  TI + EL  LG+ ++P ++++NHSC PN ++ ++G LA VRAVQ +  G 
Sbjct: 122 LFAQVNCNGFTIEDEELSHLGSAIFPDVALMNHSCCPNVIVTYKGTLAEVRAVQEINPGE 181

Query: 216 EVLISYIETAGSTMTRQKALKEQYLFTCTCPRCIKLGQFDDIQESAILEGYRCKDDGCSG 275
           EV  SYI+    T  R   L++ Y FTC C  C                    K    + 
Sbjct: 182 EVFTSYIDLLYPTEDRNDRLRDSYFFTCECQECT------------------TKHKDKAK 223

Query: 276 FLLRDSDDKGFTCQQCGLVRSKEEIKKIASEVNILSKKTLALTSCGNHQEVVSTYKMIEK 335
             +R   D       C +VR     + +  E          L  C          ++ E 
Sbjct: 224 VKIRKLSDPPKAEAICDMVRY---ARNVIEEFRRAKHYKYILYRCPGVAPPSELLEICEL 280

Query: 336 LQKKLYHPF---SVNLMQTREKLIKILMELEDWKEALAYCQLTIPVYQRVYPQFHPLLGL 392
            Q+K+   F   +V ++    + + + + ++DW+ AL Y Q  I  Y + YP +   +  
Sbjct: 281 SQEKMSSVFEDSNVYMLHMMYQAMGVCLYMQDWEGALRYGQKIIKPYSKHYPLYSLNVAS 340

Query: 393 QYYTCGKLEWFLGDTENAIKSMTEAVEILRITHGTNSPFMKELILKLE 440
            +   G+L   L +     K++ +AVEI+ + HG + P++ E+  ++E
Sbjct: 341 MWLKLGRLYMGLENKAAGEKALRKAVEIMEVAHGKDHPYISEIKQEIE 388


>gi|410916601|ref|XP_003971775.1| PREDICTED: N-lysine methyltransferase SMYD2-B-like [Takifugu
           rubripes]
          Length = 432

 Score =  157 bits (396), Expect = 1e-35,   Method: Compositional matrix adjust.
 Identities = 119/448 (26%), Positives = 208/448 (46%), Gaps = 50/448 (11%)

Query: 3   SFHVSGEVIISQEPYVCVPNNSSSISRCDGCFA-SSNLKKCSACQVVWYCGSNCQKLDWK 61
           +F V GE+++S + Y  V +       C+ CF    +L +C  C+  +YC   CQK DW 
Sbjct: 25  AFQV-GELLLSSQAYSYVLSVKERGEHCESCFTRKKSLARCGKCKKAFYCDVKCQKGDWA 83

Query: 62  LHRLECQVLSRLDKEKRKSVTPS--IRLMLKLYLRRKLQNDNVIPSTTTDNYSLVEALVA 119
           +HRLEC  ++   ++      PS   RL+ ++  ++K+Q D       ++   L+  + +
Sbjct: 84  MHRLECSAMNAFGEK----WCPSEITRLVARILTKKKMQKDRC----ASEKLLLLGEMQS 135

Query: 120 HMSDID-EKQLLLYAQIANLVNLILQWPEI-SINEIAENFSKLACNAHTICNSELRPLGT 177
           H  D+D E++  + A +A L     +  EI    ++   FS++ACN  TI + EL  LGT
Sbjct: 136 HTEDMDNERRETMEADVAGLHQFFSKHLEIPGHKDLLTLFSQVACNGFTIEDEELSHLGT 195

Query: 178 GLYPVISIINHSCLPNAVLVFEGRLAVVRAVQHVPKGAEVLISYIETAGSTMTRQKALKE 237
            +YP +++INHSC P+ ++ + G  A VRA++ +  G EVLISYI+    T  R   L+E
Sbjct: 196 AVYPDVALINHSCRPSVIVTYSGTSAHVRALRDMKPGDEVLISYIDVLYPTEDRNNRLRE 255

Query: 238 QYLFTCTCPRCIKLGQFDDIQESAILEGYRCKDDGCSGFLLRDSDDKGFTCQQCGLVRSK 297
            Y FTC C  C          ES                  R  D      ++    RS+
Sbjct: 256 SYYFTCQCEEC----------ES------------------RSMDQAKLKARK----RSE 283

Query: 298 E-EIKKIASEVNILSKKTLALTSCGNHQEVVSTYKMIEKLQKKLYHPF---SVNLMQTRE 353
             E + I++ V    K         N     +  +M E+   ++   F   +V ++    
Sbjct: 284 PIEPEAISNMVRYARKCIREFRVFKNTSPASTLLEMCEQSLDEMGAVFDDSNVYMLHMMY 343

Query: 354 KLIKILMELEDWKEALAYCQLTIPVYQRVYPQFHPLLGLQYYTCGKLEWFLGDTENAIKS 413
           + + + + ++D   A+ Y +     Y+++YP +   L   Y   G+L + +      I  
Sbjct: 344 QAMGVCLYMQDPDGAIRYGEKIAKYYRKLYPAYSLNLSSLYLKLGRLYFGMERNAECIGM 403

Query: 414 MTEAVEILRITHGTNSPFMKELILKLEE 441
           + +A  I+ +THG    ++ EL  +++E
Sbjct: 404 LKKAKAIMEVTHGKEHFYLTELDRQMKE 431


>gi|119613763|gb|EAW93357.1| SET and MYND domain containing 2, isoform CRA_c [Homo sapiens]
          Length = 391

 Score =  156 bits (395), Expect = 2e-35,   Method: Compositional matrix adjust.
 Identities = 106/406 (26%), Positives = 191/406 (47%), Gaps = 37/406 (9%)

Query: 38  NLKKCSACQVVWYCGSNCQKLDWKLHRLECQVLSRLDKEKRKSVTPSIRLMLKLYLRRKL 97
            L KC  C+  +YC   CQK DW +H+LEC  +    +    S T  +RL  ++  ++K+
Sbjct: 18  GLSKCGRCKQAFYCNVECQKEDWPMHKLECSPMVVFGENWNPSET--VRLTARILAKQKI 75

Query: 98  QNDNVIPSTTTDNYSLVEALVAHMSDID-EKQLLLYAQIANLVNLILQWPEISINE-IAE 155
             +     T ++    V+   +H+  +D EK+ L+ + IA L +   +  E   N+ +  
Sbjct: 76  HPER----TPSEKLLAVKEFESHLDKLDNEKKDLIQSDIAALHHFYSKHLEFPDNDSLVV 131

Query: 156 NFSKLACNAHTICNSELRPLGTGLYPVISIINHSCLPNAVLVFEGRLAVVRAVQHVPKGA 215
            F+++ CN  TI + EL  LG+ ++P ++++NHSC PN ++ ++G LA VRAVQ +  G 
Sbjct: 132 LFAQVNCNGFTIEDEELSHLGSAIFPDVALMNHSCCPNVIVTYKGTLAEVRAVQEIKPGE 191

Query: 216 EVLISYIETAGSTMTRQKALKEQYLFTCTCPRCIKLGQFDDIQESAILEGYRCKDDGCSG 275
           EV  SYI+    T  R   L++ Y FTC C  C                    KD   + 
Sbjct: 192 EVFTSYIDLLYPTEDRNDRLRDSYFFTCECQECT------------------TKDKDKAK 233

Query: 276 FLLRDSDDKGFTCQQCGLVR-SKEEIKKIASEVNILSKKTLALTSCGNHQEVVSTYKMIE 334
             +R   D         +VR ++  I++     +  S  +  L  C   QE +S+     
Sbjct: 234 VEIRKLSDPPKAEAIRDMVRYARNVIEEFRRAKHYKSPPSELLEICELSQEKMSS----- 288

Query: 335 KLQKKLYHPFSVNLMQTREKLIKILMELEDWKEALAYCQLTIPVYQRVYPQFHPLLGLQY 394
                ++   +V ++    + + + + ++DW+ AL Y Q  I  Y + YP +   +   +
Sbjct: 289 -----VFEDSNVYMLHMMYQAMGVCLYMQDWEGALQYGQKIIKPYSKHYPLYSLNVASMW 343

Query: 395 YTCGKLEWFLGDTENAIKSMTEAVEILRITHGTNSPFMKELILKLE 440
              G+L   L       K++ +A+ I+ + HG + P++ E+  ++E
Sbjct: 344 LKLGRLYMGLEHKAAGEKALKKAIAIMEVAHGKDHPYISEIKQEIE 389


>gi|440901481|gb|ELR52415.1| SET and MYND domain-containing protein 2, partial [Bos grunniens
           mutus]
          Length = 383

 Score =  156 bits (395), Expect = 2e-35,   Method: Compositional matrix adjust.
 Identities = 107/415 (25%), Positives = 195/415 (46%), Gaps = 38/415 (9%)

Query: 30  CDGCF-ASSNLKKCSACQVVWYCGSNCQKLDWKLHRLECQVLSRLDKEKRKSVTPSIRLM 88
           C  CF     L KC  C+  +YC   CQ+ DW +H+LEC  +    +    S T  +RL 
Sbjct: 1   CPFCFLRKEGLSKCGRCKQAFYCNVECQREDWPMHKLECSPMVVFGENWNPSET--VRLT 58

Query: 89  LKLYLRRKLQNDNVIPSTTTDNYSLVEALVAHMSDID-EKQLLLYAQIANLVNLILQWPE 147
            ++  ++K+  +     T ++    V+   +H+  +D EK+ L+ + IA L +   +  E
Sbjct: 59  ARILAKQKIHPER----TPSEKLLAVKEFESHLDKLDNEKRDLIQSDIAALHHFYSKHLE 114

Query: 148 ISINE-IAENFSKLACNAHTICNSELRPLGTGLYPVISIINHSCLPNAVLVFEGRLAVVR 206
              N+ +   F+++ CN  TI + EL  LG+ ++P ++++NHSC PN ++ ++G LA VR
Sbjct: 115 FPDNDSLVVLFAQVNCNGFTIEDEELSHLGSAIFPDVALMNHSCCPNVIVTYKGTLAEVR 174

Query: 207 AVQHVPKGAEVLISYIETAGSTMTRQKALKEQYLFTCTCPRCIKLGQFDDIQESAILEGY 266
           AVQ +  G EV  SYI+    T  R   L++ Y FTC C  C                  
Sbjct: 175 AVQEIHPGEEVFTSYIDLLYPTEDRNDRLRDSYFFTCECQECTT---------------- 218

Query: 267 RCKDDGCSGFLLRDSDDKGFTCQQCGLVR-SKEEIKKIASEVNILSKKTLALTSCGNHQE 325
             KD   +   +R  +D         +VR ++  I++     +  S  +  L  C   QE
Sbjct: 219 --KDKDKAKVEIRKLNDPPKAETIRDMVRYARNVIEEFRRAKHYKSPPSELLEICELSQE 276

Query: 326 VVSTYKMIEKLQKKLYHPFSVNLMQTREKLIKILMELEDWKEALAYCQLTIPVYQRVYPQ 385
            +S           ++   +V ++    + + + + ++DW+ AL Y Q  I  Y + YP 
Sbjct: 277 KMSC----------VFEDSNVYMLHMMYQAMGVCLYMQDWEGALRYGQKIIQPYSKHYPL 326

Query: 386 FHPLLGLQYYTCGKLEWFLGDTENAIKSMTEAVEILRITHGTNSPFMKELILKLE 440
           +   +   +   G+L   L +     +++ +A+ I+ + HG + P++ E+  ++E
Sbjct: 327 YSLNVASMWLKLGRLYMGLENKAAGERALKKAIAIMEVAHGKDHPYISEIKQEIE 381


>gi|147905510|ref|NP_001085986.1| N-lysine methyltransferase SMYD2-B [Xenopus laevis]
 gi|82184154|sp|Q6GN68.1|SMY2B_XENLA RecName: Full=N-lysine methyltransferase SMYD2-B; AltName:
           Full=Histone methyltransferase SMYD2-B; AltName:
           Full=SET and MYND domain-containing protein 2B
 gi|49115919|gb|AAH73650.1| MGC82991 protein [Xenopus laevis]
          Length = 430

 Score =  156 bits (395), Expect = 2e-35,   Method: Compositional matrix adjust.
 Identities = 118/438 (26%), Positives = 206/438 (47%), Gaps = 41/438 (9%)

Query: 8   GEVIISQEPYVCVPNNSSSISRCDGCFA-SSNLKKCSACQVVWYCGSNCQKLDWKLHRLE 66
           GE++ S   Y  V  ++   + CD CF     L KC  C+  +YC  +CQK DW +H+LE
Sbjct: 28  GELLFSCPAYTYVLTDNERGNHCDFCFTRKEGLSKCGKCKQAFYCNVDCQKGDWPMHKLE 87

Query: 67  CQVLSRLDKEKRKSVTPSIRLMLKLYLRRKLQNDNVIPSTTTDNYSLVEALVAHMSDID- 125
           C  +    +    S T  +RL  ++  ++K Q +     T ++ +  V+   +H+S +D 
Sbjct: 88  CSSMCSSGQNWCPSET--VRLTARILAKQKTQTER----TASERFMSVKEFESHLSKLDN 141

Query: 126 EKQLLLYAQIANLVNLILQWPEISINEIAEN-FSKLACNAHTICNSELRPLGTGLYPVIS 184
           EK+ L+   I+ L     +      N   E  F+++ CN  TI + EL  LG+ ++P ++
Sbjct: 142 EKKELIENDISALHRFYSKNVHNCDNAALEFLFAQVNCNGFTIEDEELSHLGSAIFPDVA 201

Query: 185 IINHSCLPNAVLVFEGRLAVVRAVQHVPKGAEVLISYIETAGSTMTRQKALKEQYLFTCT 244
           ++NHSC PN ++ ++G +A VRAVQ +  G EV  SYI+    T  R   LK+ Y F+C 
Sbjct: 202 LMNHSCCPNVIVTYKGTVAEVRAVQEIHAGEEVFTSYIDLLYPTEDRNDRLKDSYFFSCD 261

Query: 245 CPRCIKLGQFDDIQESAILEGYRCKDDGCSGFLLRDSDDKGFTCQQCGLVRSKEEIKKIA 304
           C  C         ++ A LE  R   D  S   +RD            +V      K   
Sbjct: 262 CRECST-----KQKDPAKLE-LRKLSDPPSPQTVRD-----MITYARNVVEEFRRAKHYK 310

Query: 305 SEVNILSKKTLALTSCGNHQEVVSTYKMIEKLQKKLYHPFSVNLMQTREKLIKILMELED 364
           +   +L    L+L   G+                 ++   +V ++    + + + + ++D
Sbjct: 311 TPSELLEICELSLDKMGS-----------------VFVDSNVYMLHMMYQAMGVCLYMQD 353

Query: 365 WKEALAYCQLTIPVYQRVYPQFHPLLGLQYYTCGKLEWFLGDTENAI--KSMTEAVEILR 422
           W+ AL Y +  I  Y + YP +   +   +   G+L  ++G  +N I  K++ +A+ I+ 
Sbjct: 354 WEGALKYGEKIIKPYSKHYPAYSLNVASMWLKLGRL--YMGLEKNTIGTKALKKALAIME 411

Query: 423 ITHGTNSPFMKELILKLE 440
           I HG +  ++ E+  +LE
Sbjct: 412 IAHGPDHYYIAEIKKELE 429


>gi|301771175|ref|XP_002921007.1| PREDICTED: SET and MYND domain-containing protein 2-like
           [Ailuropoda melanoleuca]
          Length = 385

 Score =  155 bits (393), Expect = 3e-35,   Method: Compositional matrix adjust.
 Identities = 104/407 (25%), Positives = 188/407 (46%), Gaps = 40/407 (9%)

Query: 38  NLKKCSACQVVWYCGSNCQKLDWKLHRLECQVLSRLDKEKRKSVTPSIRLMLKLYLRRKL 97
            L KC  C+  +YC   CQK DW +H+LEC  +    +    S T  +RL  ++  ++K+
Sbjct: 13  GLSKCGRCKQAFYCDVECQKEDWPMHKLECSPMVVFGENWNPSET--VRLTARILAKQKI 70

Query: 98  QNDNVIPSTTTDNYSLVEALVAHMSDID-EKQLLLYAQIANLVNLI---LQWPEISINEI 153
             +     T ++    V+   +H+  +D EK+ L+ + IA L +     L++P+   + +
Sbjct: 71  HPER----TPSEKLLAVKEFESHLDKLDNEKKDLIQSDIAALHHFYSKHLEFPDS--DSL 124

Query: 154 AENFSKLACNAHTICNSELRPLGTGLYPVISIINHSCLPNAVLVFEGRLAVVRAVQHVPK 213
              F+++ CN  TI + EL  LG+ ++P ++++NHSC PN ++ ++G LA VRAVQ +  
Sbjct: 125 VVLFAQVNCNGFTIEDEELSHLGSAIFPDVALMNHSCCPNVIVTYKGTLAEVRAVQEISP 184

Query: 214 GAEVLISYIETAGSTMTRQKALKEQYLFTCTCPRCIKLGQFDDIQESAILEGYRCKDDGC 273
           G EV  SYI+    T  R   L++ Y FTC C  C                    KD   
Sbjct: 185 GEEVFTSYIDLLYPTEDRNDRLRDSYFFTCQCQECT------------------TKDKDK 226

Query: 274 SGFLLRDSDDKGFTCQQCGLVRSKEEIKKIASEVNILSKKTLALTSCGNHQEVVSTYKMI 333
           +   +R   D         +VR    + +           +  L  C   QE + +    
Sbjct: 227 AKVEIRKLSDPPKAETIRDMVRYARNVIEEFRRAKHYKSPSELLEICELSQEKMGS---- 282

Query: 334 EKLQKKLYHPFSVNLMQTREKLIKILMELEDWKEALAYCQLTIPVYQRVYPQFHPLLGLQ 393
                 ++   +V ++    + + + + ++DW+ AL Y Q  I  Y + YP +   +   
Sbjct: 283 ------VFEDSNVYMLHMMYQAMGVCLYMQDWEGALRYGQKIIKPYSKHYPLYSLNVASM 336

Query: 394 YYTCGKLEWFLGDTENAIKSMTEAVEILRITHGTNSPFMKELILKLE 440
           +   G+L   L D     K++ +A+ I+ + HG + P++ E+  ++E
Sbjct: 337 WLKLGRLYMGLEDKAAGEKALKKAIAIMEVAHGKDHPYISEIKQEIE 383


>gi|426333764|ref|XP_004028440.1| PREDICTED: N-lysine methyltransferase SMYD2 [Gorilla gorilla
           gorilla]
          Length = 384

 Score =  155 bits (393), Expect = 3e-35,   Method: Compositional matrix adjust.
 Identities = 105/411 (25%), Positives = 187/411 (45%), Gaps = 36/411 (8%)

Query: 32  GCFASSNLKKCSACQVVWYCGSNCQKLDWKLHRLECQVLSRLDKEKRKSVTPSIRLMLKL 91
           G      L KC  C+  +YC   CQK DW +H+LEC  +    +    S T  +RL  ++
Sbjct: 6   GRVRKEGLSKCGRCKQAFYCNVECQKEDWPMHKLECSPMVVFGENWNPSET--VRLTARI 63

Query: 92  YLRRKLQNDNVIPSTTTDNYSLVEALVAHMSDID-EKQLLLYAQIANLVNLILQWPEISI 150
             ++K+  +     T ++    V+   +H+  +D EK+ L+ + IA L +   +  E   
Sbjct: 64  LAKQKIHPER----TPSEKLLAVKEFESHLDKLDNEKKDLIQSDIAALHHFYSKHLEFPD 119

Query: 151 NE-IAENFSKLACNAHTICNSELRPLGTGLYPVISIINHSCLPNAVLVFEGRLAVVRAVQ 209
           N+ +   F+++ CN  TI + EL  LG+ ++P ++++NHSC PN ++ ++G LA VRAVQ
Sbjct: 120 NDSLVVLFAQVNCNGFTIEDEELSHLGSAIFPDVALMNHSCCPNVIVTYKGTLAEVRAVQ 179

Query: 210 HVPKGAEVLISYIETAGSTMTRQKALKEQYLFTCTCPRCIKLGQFDDIQESAILEGYRCK 269
            +  G EV  SYI+    T  R   L++ Y FTC C  C                    K
Sbjct: 180 EIKPGEEVFTSYIDLLYPTEDRNDRLRDSYFFTCECQECT------------------TK 221

Query: 270 DDGCSGFLLRDSDDKGFTCQQCGLVRSKEEIKKIASEVNILSKKTLALTSCGNHQEVVST 329
           D   +   +R   D         +VR    + +           +  L  C   QE +S+
Sbjct: 222 DKDKAKVEIRKLSDPPKAEAIRDMVRYARNVIEEFRRAKHYKSPSELLEICELSQEKMSS 281

Query: 330 YKMIEKLQKKLYHPFSVNLMQTREKLIKILMELEDWKEALAYCQLTIPVYQRVYPQFHPL 389
                     ++   +V ++    + + + + ++DW+ AL Y Q  I  Y + YP +   
Sbjct: 282 ----------VFEDSNVYMLHMMYQAMGVCLYMQDWEGALQYGQKIIKPYSKHYPLYSLN 331

Query: 390 LGLQYYTCGKLEWFLGDTENAIKSMTEAVEILRITHGTNSPFMKELILKLE 440
           +   +   G+L   L       K++ +A+ I+ + HG + P++ E+  ++E
Sbjct: 332 VASMWLKLGRLYMGLEHKAAGEKALKKAIAIMEVAHGKDHPYISEIKQEIE 382


>gi|281342967|gb|EFB18551.1| hypothetical protein PANDA_009836 [Ailuropoda melanoleuca]
          Length = 386

 Score =  155 bits (393), Expect = 3e-35,   Method: Compositional matrix adjust.
 Identities = 109/413 (26%), Positives = 193/413 (46%), Gaps = 46/413 (11%)

Query: 38  NLKKCSACQVVWYCGSNCQKLDWKLHRLECQVLSRLDKEKRKSVTPSIRLMLKLYLRRKL 97
            L KC  C+  +YC   CQK DW +H+LEC  +    +    S T  +RL  ++  ++K+
Sbjct: 8   GLSKCGRCKQAFYCDVECQKEDWPMHKLECSPMVVFGENWNPSET--VRLTARILAKQKI 65

Query: 98  QNDNVIPSTTTDNYSLVEALVAHMSDID-EKQLLLYAQIANLVNLI---LQWPEISINEI 153
             +     T ++    V+   +H+  +D EK+ L+ + IA L +     L++P+   + +
Sbjct: 66  HPER----TPSEKLLAVKEFESHLDKLDNEKKDLIQSDIAALHHFYSKHLEFPDS--DSL 119

Query: 154 AENFSKLACNAHTICNSELRPLGTGLYPVISIINHSCLPNAVLVFEGRLAVVRAVQHVPK 213
              F+++ CN  TI + EL  LG+ ++P ++++NHSC PN ++ ++G LA VRAVQ +  
Sbjct: 120 VVLFAQVNCNGFTIEDEELSHLGSAIFPDVALMNHSCCPNVIVTYKGTLAEVRAVQEISP 179

Query: 214 GAEVLISYIETAGSTMTRQKALKEQYLFTCTCPRCIKLGQFDDIQESAILEGYRCKDDGC 273
           G EV  SYI+    T  R   L++ Y FTC C  C                    KD   
Sbjct: 180 GEEVFTSYIDLLYPTEDRNDRLRDSYFFTCQCQECT------------------TKDKDK 221

Query: 274 SGFLLRDSDDKGFTCQQCGLVRSK----EEIKKIASEVNIL--SKKTLALTSCGNHQEVV 327
           +   +R   D         +VR      EE ++      IL  S  +  L  C   QE +
Sbjct: 222 AKVEIRKLSDPPKAETIRDMVRYARNVIEEFRRAKHYKYILYNSPPSELLEICELSQEKM 281

Query: 328 STYKMIEKLQKKLYHPFSVNLMQTREKLIKILMELEDWKEALAYCQLTIPVYQRVYPQFH 387
            +          ++   +V ++    + + + + ++DW+ AL Y Q  I  Y + YP + 
Sbjct: 282 GS----------VFEDSNVYMLHMMYQAMGVCLYMQDWEGALRYGQKIIKPYSKHYPLYS 331

Query: 388 PLLGLQYYTCGKLEWFLGDTENAIKSMTEAVEILRITHGTNSPFMKELILKLE 440
             +   +   G+L   L D     K++ +A+ I+ + HG + P++ E+  ++E
Sbjct: 332 LNVASMWLKLGRLYMGLEDKAAGEKALKKAIAIMEVAHGKDHPYISEIKQEIE 384


>gi|149708613|ref|XP_001488507.1| PREDICTED: n-lysine methyltransferase SMYD2 [Equus caballus]
          Length = 384

 Score =  155 bits (392), Expect = 4e-35,   Method: Compositional matrix adjust.
 Identities = 107/411 (26%), Positives = 192/411 (46%), Gaps = 36/411 (8%)

Query: 32  GCFASSNLKKCSACQVVWYCGSNCQKLDWKLHRLECQVLSRLDKEKRKSVTPSIRLMLKL 91
           G      L KC  C+  +YC   CQK DW +H+LEC  +    +    S T  +RL  ++
Sbjct: 6   GRVRKEGLSKCGRCKQAFYCNVECQKEDWPMHKLECSPMVVFGENWNPSET--VRLTARI 63

Query: 92  YLRRKLQNDNVIPSTTTDNYSLVEALVAHMSDID-EKQLLLYAQIANLVNLILQWPEISI 150
             ++K+  +     T ++    V+   +H+  +D EK+ L+ + IA L +   +  E   
Sbjct: 64  LAKQKIHAER----TPSEKLLAVKEFESHLEKLDNEKKDLIQSDIAALHHFYSKHLEFPD 119

Query: 151 NE-IAENFSKLACNAHTICNSELRPLGTGLYPVISIINHSCLPNAVLVFEGRLAVVRAVQ 209
           N+ +   F+++ CN  TI + EL  LG+ ++P ++++NHSC PN ++ ++G LA VRAVQ
Sbjct: 120 NDSLVVLFAQVNCNGFTIEDEELSHLGSAIFPDVALMNHSCCPNVIVTYKGTLAEVRAVQ 179

Query: 210 HVPKGAEVLISYIETAGSTMTRQKALKEQYLFTCTCPRCIKLGQFDDIQESAILEGYRCK 269
            +  G EV  SYI+    T  R   L++ Y FTC C  C         ++ A +E  R  
Sbjct: 180 EINPGEEVFTSYIDLLYPTEDRNDRLRDSYFFTCECQECTTRD-----KDKAKVE-IRKL 233

Query: 270 DDGCSGFLLRDSDDKGFTCQQCGLVRSKEEIKKIASEVNILSKKTLALTSCGNHQEVVST 329
            D      +RD            +VR    + +           +  L  C   QE +S+
Sbjct: 234 TDPPKAEAIRD------------MVRYARNVIEEFRRAKHYKSPSELLEICELSQEKMSS 281

Query: 330 YKMIEKLQKKLYHPFSVNLMQTREKLIKILMELEDWKEALAYCQLTIPVYQRVYPQFHPL 389
                     ++   +V ++    + + + + ++DW+ AL Y Q  I  Y + YP +   
Sbjct: 282 ----------VFEDSNVYMLHMMYQAMGVCLYMQDWEGALRYGQKIIKPYSKHYPLYSLN 331

Query: 390 LGLQYYTCGKLEWFLGDTENAIKSMTEAVEILRITHGTNSPFMKELILKLE 440
           +   +   G+L   L +     K++ +A+ I+ + HG + P++ E+  ++E
Sbjct: 332 VASMWLKLGRLYMGLENKAAGEKALKKAIAIMEVAHGKDHPYISEIKQEIE 382


>gi|426240539|ref|XP_004014156.1| PREDICTED: N-lysine methyltransferase SMYD2 [Ovis aries]
          Length = 395

 Score =  155 bits (392), Expect = 4e-35,   Method: Compositional matrix adjust.
 Identities = 105/415 (25%), Positives = 191/415 (46%), Gaps = 37/415 (8%)

Query: 29  RCDGCF-ASSNLKKCSACQVVWYCGSNCQKLDWKLHRLECQVLSRLDKEKRKSVTPSIRL 87
           +C  CF     L KC  C+  +YC   CQ+ DW +H+LEC  +    +    S T  +RL
Sbjct: 13  QCPFCFLRKEGLSKCGRCKQAFYCNVECQREDWPMHKLECSPMVVFGENWNPSET--VRL 70

Query: 88  MLKLYLRRKLQNDNVIPSTTTDNYSLVEALVAHMSDID-EKQLLLYAQIANLVNLILQWP 146
             ++  ++K+  +     T ++    V+   +H+  +D EK+ L+ + IA L +   +  
Sbjct: 71  TARILAKQKIHPER----TPSEKLLAVKEFESHLDKLDNEKRDLIQSDIAALHHFYSKHL 126

Query: 147 EISINE-IAENFSKLACNAHTICNSELRPLGTGLYPVISIINHSCLPNAVLVFEGRLAVV 205
           E   N+ +   F+++ CN  TI + EL  LG+ ++P ++++NHSC PN ++ ++G LA V
Sbjct: 127 EFPDNDSLVVLFAQVNCNGFTIEDEELSHLGSAIFPDVALMNHSCCPNVIVTYKGTLAEV 186

Query: 206 RAVQHVPKGAEVLISYIETAGSTMTRQKALKEQYLFTCTCPRCIKLGQFDDIQESAILEG 265
           RAVQ +  G EV  SYI+    T  R   L++ Y FTC C  C                 
Sbjct: 187 RAVQEIHPGEEVFTSYIDLLYPTEDRNDRLRDSYFFTCECQECTT--------------- 231

Query: 266 YRCKDDGCSGFLLRDSDDKGFTCQQCGLVRSKEEIKKIASEVNILSKKTLALTSCGNHQE 325
              KD   +   +R  +D         +VR    + +           +  L  C   QE
Sbjct: 232 ---KDKDKAKVEIRKLNDPPKAETIRDMVRYARNVIEEFRRAKHYKSPSELLEICELSQE 288

Query: 326 VVSTYKMIEKLQKKLYHPFSVNLMQTREKLIKILMELEDWKEALAYCQLTIPVYQRVYPQ 385
            +S           ++   +V ++    + + + + ++DW+ AL Y Q  I  Y + YP 
Sbjct: 289 KMSC----------VFEDTNVYMLHMMYQAMGVCLYMQDWEGALRYGQKIIQPYSKHYPL 338

Query: 386 FHPLLGLQYYTCGKLEWFLGDTENAIKSMTEAVEILRITHGTNSPFMKELILKLE 440
           +   +   +   G+L   L +     +++ +A+ I+ + HG + P++ E+  ++E
Sbjct: 339 YSLNVASMWLKLGRLYMGLENKAAGERALKKAIAIMEVAHGKDHPYISEIKQEIE 393


>gi|355558772|gb|EHH15552.1| hypothetical protein EGK_01662, partial [Macaca mulatta]
          Length = 380

 Score =  155 bits (392), Expect = 4e-35,   Method: Compositional matrix adjust.
 Identities = 104/405 (25%), Positives = 186/405 (45%), Gaps = 36/405 (8%)

Query: 38  NLKKCSACQVVWYCGSNCQKLDWKLHRLECQVLSRLDKEKRKSVTPSIRLMLKLYLRRKL 97
            L KC  C+  +YC   CQK DW +H+LEC  +    +    S T  +RL  ++  ++K+
Sbjct: 8   GLSKCGRCKQAFYCNVECQKEDWPMHKLECSPMVVFGENWNPSET--VRLTARILAKQKI 65

Query: 98  QNDNVIPSTTTDNYSLVEALVAHMSDID-EKQLLLYAQIANLVNLILQWPEISINE-IAE 155
             +     T ++    V+   +H+  +D EK+ L+ + IA L +   +  E   N+ +  
Sbjct: 66  HPER----TPSEKLLAVKEFESHLDKLDNEKKDLIQSDIAALHHFYSKHLEFPDNDSLVV 121

Query: 156 NFSKLACNAHTICNSELRPLGTGLYPVISIINHSCLPNAVLVFEGRLAVVRAVQHVPKGA 215
            F+++ CN  TI + EL  LG+ ++P ++++NHSC PN ++ ++G LA VRAVQ +  G 
Sbjct: 122 LFAQVNCNGFTIEDEELSHLGSAIFPDVALMNHSCCPNVIVTYKGTLAEVRAVQEIKPGE 181

Query: 216 EVLISYIETAGSTMTRQKALKEQYLFTCTCPRCIKLGQFDDIQESAILEGYRCKDDGCSG 275
           EV  SYI+    T  R   L++ Y FTC C  C                    KD   + 
Sbjct: 182 EVFTSYIDLLYPTEDRNDRLRDSYFFTCECQECT------------------TKDKDKAK 223

Query: 276 FLLRDSDDKGFTCQQCGLVRSKEEIKKIASEVNILSKKTLALTSCGNHQEVVSTYKMIEK 335
             +R   D         +VR    + +           +  L  C   QE +S+      
Sbjct: 224 VEIRKLSDPPKAEAIRDMVRYARNVIEEFRRAKHYKSPSELLEICELSQEKMSS------ 277

Query: 336 LQKKLYHPFSVNLMQTREKLIKILMELEDWKEALAYCQLTIPVYQRVYPQFHPLLGLQYY 395
               ++   +V ++    + + + + ++DW+ AL Y Q  I  Y + YP +   +   + 
Sbjct: 278 ----VFEDSNVYMLHMMYQAMGVCLYMQDWEGALQYGQKIIKPYSKHYPLYSLNVASMWL 333

Query: 396 TCGKLEWFLGDTENAIKSMTEAVEILRITHGTNSPFMKELILKLE 440
             G+L   L       K++ +A+ I+ + HG + P++ E+  ++E
Sbjct: 334 KLGRLYMGLEHKAAGEKALKKAIAIMEVAHGKDHPYISEIKQEIE 378


>gi|345803330|ref|XP_537223.3| PREDICTED: SET and MYND domain-containing protein 3 [Canis lupus
           familiaris]
          Length = 428

 Score =  155 bits (392), Expect = 4e-35,   Method: Compositional matrix adjust.
 Identities = 127/430 (29%), Positives = 203/430 (47%), Gaps = 62/430 (14%)

Query: 8   GEVIISQEPYVCVPNNSSSISRCDGCF-ASSNLKKCSACQVVWYCGSNCQKLDWKLHRLE 66
           GE++   +P        S    CD C      L +CS C+V  YC + CQK  W  H+ E
Sbjct: 27  GELLFRSDPLAYTVCKGSRGVVCDRCLLGKEKLMRCSQCRVAKYCSAKCQKKAWPDHKRE 86

Query: 67  CQVLSRLDKEKRKSVTPSIRLMLKLYLRRKLQNDNVIPSTTTDNYSLVEALVAHMSDIDE 126
           C+ L      K +    S+RL+ ++    KL  +   PS +   YS  + L ++++ + E
Sbjct: 87  CKCLKSC---KPRYPPDSVRLLGRVVF--KLMEET--PSESEKLYSFYD-LESNINKLTE 138

Query: 127 ------KQLLLYAQ--IANLVNLILQWPEISINEIAENFSKLACNAHTICNSELRPLGTG 178
                 +QL++  Q  +   +    Q P  S  +I E F+K+ CNA TICN+E++ +G G
Sbjct: 139 DKKDGLRQLVMTFQHFMREEIQDASQLP--SSFDIFEAFAKVICNAFTICNAEMQEVGVG 196

Query: 179 LYPVISIINHSCLPNAVLVFEGRLAVVRAVQHVPKGAEVLISYIETAGSTMTRQKALKEQ 238
           LYP +S++NHSC PN  +VF G   ++RAV+ +  G E+ I Y++   ++  R+K LK+Q
Sbjct: 197 LYPSMSLLNHSCDPNCSIVFNGPHLLLRAVRDIEAGEELTICYLDMLMTSEERRKQLKDQ 256

Query: 239 YLFTCTCPRCIKLGQFDDIQESAILEGYRCKDDGCSGFLLRDSDDKGFTCQQCGLVRSKE 298
           Y F C C RC                              +D D    T  +    + +E
Sbjct: 257 YCFECDCVRCET----------------------------QDKDADMLTGDEQVWRQVQE 288

Query: 299 EIKKIASEVNILSKKTLALTSCGNHQEVVSTYKMIEKLQKKLYHPFSVNLMQTR--EKLI 356
            +KKI  E+    K    L  C   Q ++S+    E+L         +N+ Q R  +  +
Sbjct: 289 SLKKI-EELKAHWKWEQVLAMC---QTIISSN--AERLP-------DINVYQLRVLDCAM 335

Query: 357 KILMELEDWKEALAYCQLTIPVYQRVYPQFHPLLGLQYYTCGKLEWFLGDTENAIKSMTE 416
              + L   +EAL Y   T+  Y+  +P  HP+ G+Q    GKL+   G    A+K++  
Sbjct: 336 DACINLGLLEEALFYGIRTMEPYRIFFPGSHPVRGVQVMKVGKLQLHQGMFPQAMKNLRL 395

Query: 417 AVEILRITHG 426
           A +I+R+THG
Sbjct: 396 AFDIMRVTHG 405


>gi|345802872|ref|XP_537149.3| PREDICTED: N-lysine methyltransferase SMYD2 [Canis lupus
           familiaris]
          Length = 523

 Score =  155 bits (392), Expect = 4e-35,   Method: Compositional matrix adjust.
 Identities = 98/405 (24%), Positives = 188/405 (46%), Gaps = 36/405 (8%)

Query: 38  NLKKCSACQVVWYCGSNCQKLDWKLHRLECQVLSRLDKEKRKSVTPSIRLMLKLYLRRKL 97
            L KC  C+  +YC   CQK DW +H+LEC  +    +    S T  +RL  ++  ++K+
Sbjct: 151 GLSKCGRCKQAFYCNVECQKEDWPMHKLECSPMVVFGENWNPSET--VRLTARILAKQKI 208

Query: 98  QNDNVIPSTTTDNYSLVEALVAHMSDID-EKQLLLYAQIANLVNLILQWPEISINE-IAE 155
             +     T ++    V+   +H+  +D EK+ L+ + IA L +   +  E   +E +  
Sbjct: 209 HPER----TPSEKLLAVKEFESHLDKLDNEKKDLIQSDIAALHHFYSKHLEFPDSESLVV 264

Query: 156 NFSKLACNAHTICNSELRPLGTGLYPVISIINHSCLPNAVLVFEGRLAVVRAVQHVPKGA 215
            F+++ CN  TI + EL  LG+ ++P ++++NHSC PN ++ ++G LA VRAVQ +  G 
Sbjct: 265 LFAQVNCNGFTIEDEELSHLGSAIFPDVALMNHSCCPNVIVTYKGTLAEVRAVQEISPGE 324

Query: 216 EVLISYIETAGSTMTRQKALKEQYLFTCTCPRCIKLGQFDDIQESAILEGYRCKDDGCSG 275
           EV  SYI+    T  R   L++ Y FTC C  C                           
Sbjct: 325 EVFTSYIDLLYPTEDRNDRLRDSYFFTCQCQECTT------------------------- 359

Query: 276 FLLRDSDDKGFTCQQCGLVRSKEEIKKIASEVNILSKKTLALTSCGNHQEVVSTYKMIEK 335
              +D D      ++       E I+ +      + ++        +  E++   ++ ++
Sbjct: 360 ---KDKDKAKVEIRKLSDPPKAETIRDMVRYARNVIEEFRRAKHYKSPSELLEICELSQE 416

Query: 336 LQKKLYHPFSVNLMQTREKLIKILMELEDWKEALAYCQLTIPVYQRVYPQFHPLLGLQYY 395
               ++   +V ++    + + + + ++DW+ AL Y Q  I  Y + YP +   +   + 
Sbjct: 417 KMGSVFEDSNVYMLHMMYQAMGVCLYMQDWEGALRYGQKIIKPYSKHYPLYSLNVASMWL 476

Query: 396 TCGKLEWFLGDTENAIKSMTEAVEILRITHGTNSPFMKELILKLE 440
             G+L   L D     +++ +A+ I+ + HG + P++ E+  ++E
Sbjct: 477 KLGRLYMGLEDKAAGERALKKAIAIMEVAHGKDHPYISEIKQEIE 521


>gi|61806679|ref|NP_001013568.1| N-lysine methyltransferase SMYD2-A [Danio rerio]
 gi|82178565|sp|Q5BJI7.1|SMY2A_DANRE RecName: Full=N-lysine methyltransferase SMYD2-A; AltName:
           Full=Histone methyltransferase SMYD2-A; AltName:
           Full=SET and MYND domain-containing protein 2A
 gi|60552471|gb|AAH91465.1| SET and MYND domain containing 2a [Danio rerio]
          Length = 435

 Score =  155 bits (391), Expect = 5e-35,   Method: Compositional matrix adjust.
 Identities = 113/443 (25%), Positives = 205/443 (46%), Gaps = 48/443 (10%)

Query: 8   GEVIISQEPYVCVPNNSSSISRCDGCFA-SSNLKKCSACQVVWYCGSNCQKLDWKLHRLE 66
           G+++ +   Y  V   +    RC+ CF     L KC  C+  +YC   CQ+ DW +H+LE
Sbjct: 30  GDLVFACPAYAYVLTVNERGGRCECCFTRKEGLSKCGKCKQAYYCNVECQRGDWPMHKLE 89

Query: 67  CQVLSRLDKEKRKSVTPSIRLMLKLYLRRKLQNDNVIPSTTTDNYSLVEALVAHMSDID- 125
           C  +    +    S T  +RL+ ++ L++K Q +     T ++    +  L AH+  +D 
Sbjct: 90  CSAMCAYGENWCPSET--VRLVARIILKQKHQTER----TPSERVLTLRELEAHLDKLDN 143

Query: 126 EKQLLLYAQIANLVNLI---LQWPEISINEIAENFSKLACNAHTICNSELRPLGTGLYPV 182
           EK  +    IA L +     L +P+ +   + E  +++ CN  TI + EL  LG+ L+P 
Sbjct: 144 EKNEMNDTDIAALHHFYSRHLDFPDNAA--LTELIAQVNCNGFTIEDEELSHLGSALFPD 201

Query: 183 ISIINHSCLPNAVLVFEGRLAVVRAVQHVPKGAEVLISYIETAGSTMTRQKALKEQYLFT 242
           ++++NHSC PN ++ ++G +A VRAVQ +    E+  SYI+    T  R + LK+ Y F 
Sbjct: 202 VALMNHSCSPNVIVTYKGTVAEVRAVQEINPEEEIFNSYIDLLYPTEDRIERLKDSYFFN 261

Query: 243 CTCPRCIKLGQFDDIQESAILEGYRCKDDGCSGFLLRDSDDKGFTCQQCGLVRSKEEIKK 302
           C C  C                  + KD+            K    Q+  +   +EEIK+
Sbjct: 262 CDCKECTS----------------KSKDEA-----------KMEIRQKLSIPPEEEEIKQ 294

Query: 303 IASEVNILSKKTLALTSCGNHQEVVSTYKMIEKLQKK----LYHPFSVNLMQTREKLIKI 358
           +     I ++  +       H +  S    I +L  +    ++   +V ++    + + +
Sbjct: 295 MV----IYARNVIEEFRRAKHYKTPSELLEICELSMEKMGAIFAETNVYMLHMMYQAMGV 350

Query: 359 LMELEDWKEALAYCQLTIPVYQRVYPQFHPLLGLQYYTCGKLEWFLGDTENAIKSMTEAV 418
            + ++DW  A+ Y +  I  Y   YP +   +   Y   G+L   L      +K++ +A+
Sbjct: 351 CLYMQDWDGAMKYGEKIIHPYSVHYPPYSLNVASMYLKLGRLYLGLEKRTQGVKALKKAL 410

Query: 419 EILRITHGTNSPFMKELILKLEE 441
            I+ I HG + P++ E+  ++EE
Sbjct: 411 AIMDIAHGKDHPYIDEIKKEMEE 433


>gi|397486375|ref|XP_003814305.1| PREDICTED: N-lysine methyltransferase SMYD2 [Pan paniscus]
          Length = 384

 Score =  155 bits (391), Expect = 5e-35,   Method: Compositional matrix adjust.
 Identities = 105/411 (25%), Positives = 187/411 (45%), Gaps = 36/411 (8%)

Query: 32  GCFASSNLKKCSACQVVWYCGSNCQKLDWKLHRLECQVLSRLDKEKRKSVTPSIRLMLKL 91
           G      L KC  C+  +YC   CQK DW +H+LEC  +    +    S T  +RL  ++
Sbjct: 6   GRVRKEGLSKCGRCKQAFYCNVECQKEDWPMHKLECSPMVVFGENWNPSET--VRLTARI 63

Query: 92  YLRRKLQNDNVIPSTTTDNYSLVEALVAHMSDID-EKQLLLYAQIANLVNLILQWPEISI 150
             ++K+  +     T ++    V+   +H+  +D EK+ L+ + IA L +   +  E   
Sbjct: 64  LAKQKIHPER----TPSEKLLAVKEFESHLDKLDNEKKDLIQSDIAALHHFYSKHLEFLD 119

Query: 151 NE-IAENFSKLACNAHTICNSELRPLGTGLYPVISIINHSCLPNAVLVFEGRLAVVRAVQ 209
           N+ +   F+++ CN  TI + EL  LG+ ++P ++++NHSC PN ++ ++G LA VRAVQ
Sbjct: 120 NDSLVVVFAQVNCNGFTIEDEELSHLGSAIFPDVALMNHSCCPNVIVTYKGTLAEVRAVQ 179

Query: 210 HVPKGAEVLISYIETAGSTMTRQKALKEQYLFTCTCPRCIKLGQFDDIQESAILEGYRCK 269
            +  G EV  SYI+    T  R   L++ Y FTC C  C                    K
Sbjct: 180 EIKPGEEVFTSYIDLLYPTEDRNDRLRDSYFFTCECQECT------------------TK 221

Query: 270 DDGCSGFLLRDSDDKGFTCQQCGLVRSKEEIKKIASEVNILSKKTLALTSCGNHQEVVST 329
           D   +   +R   D         +VR    + +           +  L  C   QE +S+
Sbjct: 222 DKDKAKVEIRKLSDPPKAEAIRDMVRYARNVIEEFRRAKHYKSPSELLEICELSQEKMSS 281

Query: 330 YKMIEKLQKKLYHPFSVNLMQTREKLIKILMELEDWKEALAYCQLTIPVYQRVYPQFHPL 389
                     ++   +V ++    + + + + ++DW+ AL Y Q  I  Y + YP +   
Sbjct: 282 ----------VFEDSNVYMLHMMYQAMGVCLYMQDWEGALQYGQKIIKPYSKHYPLYSLN 331

Query: 390 LGLQYYTCGKLEWFLGDTENAIKSMTEAVEILRITHGTNSPFMKELILKLE 440
           +   +   G+L   L       K++ +A+ I+ + HG + P++ E+  ++E
Sbjct: 332 VASMWLKLGRLYMGLEHKAAGEKALKKAIAIMEVAHGKDHPYISEIKQEIE 382


>gi|344278575|ref|XP_003411069.1| PREDICTED: SET and MYND domain-containing protein 3 [Loxodonta
           africana]
          Length = 428

 Score =  155 bits (391), Expect = 5e-35,   Method: Compositional matrix adjust.
 Identities = 129/449 (28%), Positives = 209/449 (46%), Gaps = 64/449 (14%)

Query: 8   GEVIISQEPYVCVPNNSSSISRCDGCF-ASSNLKKCSACQVVWYCGSNCQKLDWKLHRLE 66
           G+++   +P        S  + CD C      L +CS C+V  YC + CQK  W  H+ E
Sbjct: 27  GDLLFRSDPLAYTVCKGSRGAVCDRCLLGKEKLLRCSQCRVAKYCSAKCQKKAWPDHKRE 86

Query: 67  CQVLSRLDKEKRKSVTP-SIRLMLKLYLRRKLQNDNVIPSTTTDNYSLVEALVAHMSDID 125
           C+ L    K  R    P S+RL+ ++    KL      PS +   YS  + L ++++ + 
Sbjct: 87  CKCL----KNCRPRYPPDSVRLLGRVVC--KLMEGT--PSESEKLYSFYD-LESNINKLT 137

Query: 126 EKQLLLYAQIANLVNLIL--------QWPEISINEIAENFSKLACNAHTICNSELRPLGT 177
           E +     Q+A      +        Q P     +I E F+K+ CN+ TIC++E++ +G 
Sbjct: 138 EDKKEGLRQLAMTFQDFMREEIQDASQLP--PSFDIFEAFAKVICNSFTICDAEMQEVGV 195

Query: 178 GLYPVISIINHSCLPNAVLVFEGRLAVVRAVQHVPKGAEVLISYIETAGSTMTRQKALKE 237
           GLYP +S++NHSC PN  +VF G   ++RAV+ +  G E+ I Y++   ++  R+K L++
Sbjct: 196 GLYPSMSLLNHSCDPNCSIVFNGPHLLLRAVRDIEVGEELTICYLDMLMTSEERRKQLRD 255

Query: 238 QYLFTCTCPRCIKLGQFDDIQESAILEGYRCKDDGCSGFLLRDSDDKGFTCQQCGLVRSK 297
           QY F C C                    +RC+         +D D    T  +      +
Sbjct: 256 QYCFECDC--------------------FRCR--------TQDKDADMLTGNEQAWKEVQ 287

Query: 298 EEIKKIASEVNILSKKTLALTSCGNHQEVVSTYKMIEKLQKKLYHPFSVNLMQTR--EKL 355
           E +KKI  E+    K    L  C   Q ++S+    E+L          N+ Q +  +  
Sbjct: 288 ESLKKI-EELKAHRKWEQVLAMC---QTILSSNS--EQLP-------DTNIYQLKVLDCA 334

Query: 356 IKILMELEDWKEALAYCQLTIPVYQRVYPQFHPLLGLQYYTCGKLEWFLGDTENAIKSMT 415
           +   + L   +EAL Y   T+  Y+  YP  HP+ G+Q    GKL+   G    A+K++ 
Sbjct: 335 MDACINLGLLEEALFYGIRTMEPYRIFYPGSHPVRGVQVMKVGKLQLHQGMFPQAMKNLR 394

Query: 416 EAVEILRITHGTNSPFMKELILKLEEAQA 444
            A +I+R+THG     +++LI  LEE  A
Sbjct: 395 LAFDIMRVTHGREHSLIEDLIRLLEECDA 423


>gi|355745920|gb|EHH50545.1| hypothetical protein EGM_01398, partial [Macaca fascicularis]
          Length = 380

 Score =  155 bits (391), Expect = 6e-35,   Method: Compositional matrix adjust.
 Identities = 104/405 (25%), Positives = 186/405 (45%), Gaps = 36/405 (8%)

Query: 38  NLKKCSACQVVWYCGSNCQKLDWKLHRLECQVLSRLDKEKRKSVTPSIRLMLKLYLRRKL 97
            L KC  C+  +YC   CQK DW +H+LEC  +    +    S T  +RL  ++  ++K+
Sbjct: 8   GLSKCGRCKQAFYCNVECQKEDWPMHKLECSPMVVFGENWNPSET--VRLTARILAKQKI 65

Query: 98  QNDNVIPSTTTDNYSLVEALVAHMSDID-EKQLLLYAQIANLVNLILQWPEISINE-IAE 155
             +     T ++    V+   +H+  +D EK+ L+ + IA L +   +  E   N+ +  
Sbjct: 66  HPER----TPSEKLLAVKEFESHLDKLDNEKKDLIQSDIAALHHFYSKHLEFPDNDSLIV 121

Query: 156 NFSKLACNAHTICNSELRPLGTGLYPVISIINHSCLPNAVLVFEGRLAVVRAVQHVPKGA 215
            F+++ CN  TI + EL  LG+ ++P ++++NHSC PN ++ ++G LA VRAVQ +  G 
Sbjct: 122 LFAQVNCNGFTIEDEELSHLGSAIFPDVALMNHSCCPNVIVTYKGTLAEVRAVQEIKPGE 181

Query: 216 EVLISYIETAGSTMTRQKALKEQYLFTCTCPRCIKLGQFDDIQESAILEGYRCKDDGCSG 275
           EV  SYI+    T  R   L++ Y FTC C  C                    KD   + 
Sbjct: 182 EVFTSYIDLLYPTEDRNDRLRDSYFFTCECQECT------------------TKDKDKAK 223

Query: 276 FLLRDSDDKGFTCQQCGLVRSKEEIKKIASEVNILSKKTLALTSCGNHQEVVSTYKMIEK 335
             +R   D         +VR    + +           +  L  C   QE +S+      
Sbjct: 224 VEIRKLSDPPKAEAIRDMVRYARNVIEEFRRAKHYKSPSELLEICELSQEKMSS------ 277

Query: 336 LQKKLYHPFSVNLMQTREKLIKILMELEDWKEALAYCQLTIPVYQRVYPQFHPLLGLQYY 395
               ++   +V ++    + + + + ++DW+ AL Y Q  I  Y + YP +   +   + 
Sbjct: 278 ----VFEDSNVYMLHMMYQAMGVCLYMQDWEGALQYGQKIIKPYSKHYPLYSLNVASMWL 333

Query: 396 TCGKLEWFLGDTENAIKSMTEAVEILRITHGTNSPFMKELILKLE 440
             G+L   L       K++ +A+ I+ + HG + P++ E+  ++E
Sbjct: 334 KLGRLYMGLEHKAAGEKALKKAIAIMEVAHGKDHPYISEIKQEIE 378


>gi|395531367|ref|XP_003767751.1| PREDICTED: N-lysine methyltransferase SMYD2 [Sarcophilus harrisii]
          Length = 402

 Score =  154 bits (389), Expect = 9e-35,   Method: Compositional matrix adjust.
 Identities = 98/406 (24%), Positives = 188/406 (46%), Gaps = 36/406 (8%)

Query: 38  NLKKCSACQVVWYCGSNCQKLDWKLHRLECQVLSRLDKEKRKSVTPSIRLMLKLYLRRKL 97
            L KC  C+  +YC   CQK DW +H+LEC  +    +    S T  +RL  ++  ++K 
Sbjct: 30  GLSKCGRCRQAFYCNVECQKEDWPMHKLECSAMCVFGENWNPSET--VRLTARILAKQKT 87

Query: 98  QNDNVIPSTTTDNYSLVEALVAHMSDID-EKQLLLYAQIANLVNLILQWPEISIN-EIAE 155
            ++     T+++    V+   +H+  +D EK+ L+ + I+ L +   +  E   N  +  
Sbjct: 88  HHER----TSSEKLLAVKEFESHLDKLDNEKRELIQSDISALHHFYSKHIEYPDNASLVT 143

Query: 156 NFSKLACNAHTICNSELRPLGTGLYPVISIINHSCLPNAVLVFEGRLAVVRAVQHVPKGA 215
            F+++ CN  TI + EL  LG+ ++P ++++NHSC PN ++ ++G LA VRAVQ +  G 
Sbjct: 144 LFAQVNCNGFTIEDEELSHLGSAIFPDVALMNHSCCPNVIVTYKGTLAEVRAVQEINPGD 203

Query: 216 EVLISYIETAGSTMTRQKALKEQYLFTCTCPRCIKLGQFDDIQESAILEGYRCKDDGCSG 275
           EV  SYI+    T  R   LK+ Y FTC C  C                           
Sbjct: 204 EVFTSYIDLLYPTEDRNDRLKDSYFFTCECRECTT------------------------- 238

Query: 276 FLLRDSDDKGFTCQQCGLVRSKEEIKKIASEVNILSKKTLALTSCGNHQEVVSTYKMIEK 335
              +  D      ++       E I+ +      + ++           E++   ++ ++
Sbjct: 239 ---KAKDKAKVEIRKLSEPPKAESIRDMVKYARNVIEEFRRAKHYKTPSELLEICELSQE 295

Query: 336 LQKKLYHPFSVNLMQTREKLIKILMELEDWKEALAYCQLTIPVYQRVYPQFHPLLGLQYY 395
               L+   +V ++    + + + + ++DW+ AL Y Q  I  Y + YP +   +   + 
Sbjct: 296 KMGSLFEDSNVYMLHMMYQAMGVCLYMQDWEGALRYGQKIIKPYSKHYPLYSLNVASMWL 355

Query: 396 TCGKLEWFLGDTENAIKSMTEAVEILRITHGTNSPFMKELILKLEE 441
             G+L   L    + +K++ +A+ I+ + HG + P++ E+  ++E+
Sbjct: 356 KLGRLYMGLEHKASGVKALKKAIAIMEVAHGKDHPYISEIKKEIED 401


>gi|449283344|gb|EMC90014.1| SET and MYND domain-containing protein 2, partial [Columba livia]
          Length = 385

 Score =  154 bits (389), Expect = 1e-34,   Method: Compositional matrix adjust.
 Identities = 108/408 (26%), Positives = 197/408 (48%), Gaps = 35/408 (8%)

Query: 39  LKKCSACQVVWYCGSNCQKLDWKLHRLECQVLSRLDKEKRKSVTPSIRLMLKLYLRRKLQ 98
           L KC  C+  +YC   CQK DW +H+LEC  +    +    S T  +RL  ++  ++K  
Sbjct: 7   LSKCGRCKQAFYCNVECQKEDWPMHKLECSAMCAFGQNWNPSET--VRLTARILAKQKTH 64

Query: 99  NDNVIPSTTTDNYSLVEALVAHMSDID-EKQLLLYAQIANLVNLI---LQWPEISINEIA 154
            +     T ++    V+   +H+  +D EK+ L+   IA L +     L++P+ +   + 
Sbjct: 65  PER----TQSEKLLAVKEFESHLDKLDNEKRELIQNDIAALHHFYSKHLEYPDNAA--LV 118

Query: 155 ENFSKLACNAHTICNSELRPLGTGLYPVISIINHSCLPNAVLVFEGRLAVVRAVQHVPKG 214
             F+++ CN  TI + EL  LG+ ++P ++++NHSC PN ++ ++G LA VRAV+ +  G
Sbjct: 119 VLFAQVNCNGFTIEDEELSHLGSAIFPDVALMNHSCCPNVIVTYKGTLAEVRAVKEIEPG 178

Query: 215 AEVLISYIETAGSTMTRQKALKEQYLFTCTCPRCIKLGQFDDIQESAILEGYRCKDDGCS 274
            EV  SYI+    T  R   L++ Y F C C  C+   +  D ++  I    R  +D  S
Sbjct: 179 EEVFTSYIDLLYPTEDRNDRLRDSYFFNCDCRECVTKEK--DKEKLEI----RKLNDPPS 232

Query: 275 GFLLRDSDDKGFTCQQCGLVRSK-EEIKKIASEVNILSKKTLALTSCGNHQEVVSTYKMI 333
              +RD              R+  EE ++      IL   TLA        E++   ++ 
Sbjct: 233 AETVRD---------MIKYARNVIEEFRRAKHYKYILCLTTLAC-------ELLEICELS 276

Query: 334 EKLQKKLYHPFSVNLMQTREKLIKILMELEDWKEALAYCQLTIPVYQRVYPQFHPLLGLQ 393
                 ++   +V ++    + + + + ++DW+ AL Y Q  I  Y + YP +   +   
Sbjct: 277 LDKMGAVFEDSNVYMLHMMYQAMGVCLYVQDWEGALRYGQKIIRPYSKHYPSYSLNVASM 336

Query: 394 YYTCGKLEWFLGDTENAIKSMTEAVEILRITHGTNSPFMKELILKLEE 441
           +   G+L   L +    +K++  A+ I+ + HG + P++ E+  +LE+
Sbjct: 337 WLKLGRLYMALENRAAGVKALKRAIAIMEVAHGKDHPYISEIKKELED 384


>gi|347300358|ref|NP_001153564.2| SET and MYND domain-containing protein 3 [Sus scrofa]
          Length = 428

 Score =  154 bits (388), Expect = 1e-34,   Method: Compositional matrix adjust.
 Identities = 125/430 (29%), Positives = 202/430 (46%), Gaps = 62/430 (14%)

Query: 8   GEVIISQEPYVCVPNNSSSISRCDGCF-ASSNLKKCSACQVVWYCGSNCQKLDWKLHRLE 66
           GE++   +P        S    CD C      L +CS C+V  YC + CQK  W+ H+ E
Sbjct: 27  GELLFRSDPLAYTVCKGSRGVVCDRCLLGKEKLMRCSQCRVAKYCSAKCQKKAWQDHKRE 86

Query: 67  CQVLSRLDKEKRKSVTPSIRLMLKLYLRRKLQNDNVIPSTTTDNYSLVEALVAHMSDIDE 126
           C+ L      K +    S+RL+ ++    KL  +   PS +   YS  + L ++M+ + E
Sbjct: 87  CKCLKSC---KPRYPPDSVRLLGRVIF--KLMEET--PSESEKLYSFYD-LESNMNKLTE 138

Query: 127 ------KQLLLYAQ--IANLVNLILQWPEISINEIAENFSKLACNAHTICNSELRPLGTG 178
                 +QL+L  Q  +   +    Q P     +I E F+K+ CN+ TICN+E++ +G G
Sbjct: 139 EKKEGLRQLVLTFQHFMREEIQDASQLP--PSFDIFEAFAKVICNSFTICNAEMQEVGVG 196

Query: 179 LYPVISIINHSCLPNAVLVFEGRLAVVRAVQHVPKGAEVLISYIETAGSTMTRQKALKEQ 238
           LYP +S++NHSC PN  +VF G   ++RAV+ +  G E+ I Y++   ++  R+K L++Q
Sbjct: 197 LYPSMSLLNHSCDPNCSIVFNGPHLLLRAVRDIEAGEELTICYLDMLMTSEERRKQLRDQ 256

Query: 239 YLFTCTCPRCIKLGQFDDIQESAILEGYRCKDDGCSGFLLRDSDDKGFTCQQCGLVRSKE 298
           Y F C C RC                              +D D    T  +      +E
Sbjct: 257 YCFECDCFRC----------------------------QTQDKDADMLTGDEQVWKEVQE 288

Query: 299 EIKKIASEVNILSKKTLALTSCGNHQEVVSTYKMIEKLQKKLYHPFSVNLMQTR--EKLI 356
            +KKI  E+    K    L  C   Q ++S+    E+L         +N+ Q +  +  +
Sbjct: 289 SLKKI-EELKAHWKWEQVLAMC---QTIISSN--AERLP-------DINIYQLKVLDCAM 335

Query: 357 KILMELEDWKEALAYCQLTIPVYQRVYPQFHPLLGLQYYTCGKLEWFLGDTENAIKSMTE 416
              + L   +EAL Y   T+  Y+  +P  HP+ G+Q    GKL+   G    A+K++  
Sbjct: 336 DACINLGLLEEALFYGMRTMEPYRIFFPGSHPVRGVQVMKVGKLQLHQGMFPQAMKNLRL 395

Query: 417 AVEILRITHG 426
           A +I+R+THG
Sbjct: 396 AFDIMRVTHG 405


>gi|345329543|ref|XP_001510161.2| PREDICTED: N-lysine methyltransferase SMYD2-like [Ornithorhynchus
           anatinus]
          Length = 412

 Score =  153 bits (387), Expect = 2e-34,   Method: Compositional matrix adjust.
 Identities = 104/406 (25%), Positives = 192/406 (47%), Gaps = 36/406 (8%)

Query: 38  NLKKCSACQVVWYCGSNCQKLDWKLHRLECQVLSRLDKEKRKSVTPSIRLMLKLYLRRKL 97
            L KC  C+  +YC   CQK DW  H+LEC  +  L +    S T  +RL  ++  ++K 
Sbjct: 40  GLSKCGRCKQAFYCNVECQKEDWPTHKLECSAMCVLGESWNPSET--VRLTARILAKQKA 97

Query: 98  QNDNVIPSTTTDNYSLVEALVAHMSDID-EKQLLLYAQIANLVNLILQWPEISIN-EIAE 155
             +     T ++    V+   +H+  +D EK+ L+ + IA L +   +  E   N  +  
Sbjct: 98  HPER----TPSEKLLAVKEFESHLDKLDNEKRELIQSDIAALHHFYSKHIEYPDNASLVV 153

Query: 156 NFSKLACNAHTICNSELRPLGTGLYPVISIINHSCLPNAVLVFEGRLAVVRAVQHVPKGA 215
            F+++ CN  TI + EL  LG+ ++P ++++NHSC PN ++ ++G LA VRAVQ +  G 
Sbjct: 154 LFAQVNCNGFTIEDEELSHLGSAIFPDVALMNHSCCPNVIVTYKGTLAEVRAVQEINPGD 213

Query: 216 EVLISYIETAGSTMTRQKALKEQYLFTCTCPRCIKLGQFDDIQESAILEGYRCKDDGCSG 275
           E+  SYI+    T  R   L++ Y FTC C  CI        ++ A +E  R  +D    
Sbjct: 214 EIFTSYIDLLYPTEDRNDRLRDSYFFTCECRECITKE-----KDKAKVE-IRKLNDPPKA 267

Query: 276 FLLRDSDDKGFTCQQCGLVRSKEEIKKIASEVNILSKKTLALTSCGNHQEVVSTYKMIEK 335
             +RD            +++    + +           +  L  C   QE + +      
Sbjct: 268 EAIRD------------MIKYARNVIEEFRRAKHYKSPSELLEICELSQEKMGS------ 309

Query: 336 LQKKLYHPFSVNLMQTREKLIKILMELEDWKEALAYCQLTIPVYQRVYPQFHPLLGLQYY 395
               ++   +V ++    + + + + ++DW+ AL Y Q  I  Y + YP +   +   + 
Sbjct: 310 ----VFEDSNVYMLHMMYQAMGVCLYMQDWEGALRYGQKIIKPYSKHYPLYSLNVASMWL 365

Query: 396 TCGKLEWFLGDTENAIKSMTEAVEILRITHGTNSPFMKELILKLEE 441
             G+L   L +    +K++ +A+ I+ + HG + P++ E+  ++E+
Sbjct: 366 KLGRLYMGLENKSAGVKALKKALTIMEVAHGKDHPYISEIKKEIED 411


>gi|410986126|ref|XP_003999363.1| PREDICTED: N-lysine methyltransferase SMYD2 [Felis catus]
          Length = 386

 Score =  153 bits (387), Expect = 2e-34,   Method: Compositional matrix adjust.
 Identities = 101/402 (25%), Positives = 186/402 (46%), Gaps = 40/402 (9%)

Query: 38  NLKKCSACQVVWYCGSNCQKLDWKLHRLECQVLSRLDKEKRKSVTPSIRLMLKLYLRRKL 97
            L KC  C+  +YC   CQK DW +H+LEC  +    +    S T  +RL  ++  ++K+
Sbjct: 14  GLSKCGRCKQAFYCNVECQKEDWPMHKLECSPMVVFGENWNPSET--VRLTARILAKQKI 71

Query: 98  QNDNVIPSTTTDNYSLVEALVAHMSDID-EKQLLLYAQIANLVNLI---LQWPEISINEI 153
             +     T ++    V+   +H+  +D EK+ L+ + +A L +     L++P+   + +
Sbjct: 72  HPER----TPSEKLLAVKEFESHLDKLDNEKKDLIQSDVAALHHFYSKHLEFPDS--DSL 125

Query: 154 AENFSKLACNAHTICNSELRPLGTGLYPVISIINHSCLPNAVLVFEGRLAVVRAVQHVPK 213
              F+++ CN  TI + EL  LG+ ++P ++++NHSC PN ++ ++G LA VRAVQ +  
Sbjct: 126 VVLFAQVNCNGFTIEDEELSHLGSAIFPDVALMNHSCCPNVIVTYKGTLAEVRAVQEINP 185

Query: 214 GAEVLISYIETAGSTMTRQKALKEQYLFTCTCPRCIKLGQFDDIQESAILEGYRCKDDGC 273
           G EV  SYI+    T  R   L++ Y FTC C  C                    KD   
Sbjct: 186 GEEVFTSYIDLLYPTEDRNDRLRDSYFFTCQCQECT------------------TKDKDK 227

Query: 274 SGFLLRDSDDKGFTCQQCGLVRSKEEIKKIASEVNILSKKTLALTSCGNHQEVVSTYKMI 333
           +   +R   D         +VR    + +           +  L  C   QE +S+    
Sbjct: 228 AKVEIRKLSDPPKAETIRDMVRYARNVIEEFRRAKHYKSPSELLEICELSQEKMSS---- 283

Query: 334 EKLQKKLYHPFSVNLMQTREKLIKILMELEDWKEALAYCQLTIPVYQRVYPQFHPLLGLQ 393
                 ++   +V ++    + + + + ++DW+ AL Y Q  I  Y + YP +   +   
Sbjct: 284 ------VFEDSNVYMLHMMYQAMGVCLYMQDWEGALRYGQKIIKPYSKHYPLYSLNVASM 337

Query: 394 YYTCGKLEWFLGDTENAIKSMTEAVEILRITHGTNSPFMKEL 435
           +   G+L   L +     +++ +A+ I+ + HG + P++ E+
Sbjct: 338 WLKLGRLHMGLENKAAGERALKKAIAIMEVAHGKDHPYISEI 379


>gi|291402401|ref|XP_002717452.1| PREDICTED: SET and MYND domain containing 2, partial [Oryctolagus
           cuniculus]
          Length = 394

 Score =  153 bits (386), Expect = 2e-34,   Method: Compositional matrix adjust.
 Identities = 106/407 (26%), Positives = 192/407 (47%), Gaps = 40/407 (9%)

Query: 38  NLKKCSACQVVWYCGSNCQKLDWKLHRLECQVLSRLDKEKRKSVTPSIRLMLKLYLRRKL 97
            L KC  C+  +YC  +CQK DW +H+LEC  +    +    S T  +RL  ++  ++K 
Sbjct: 22  GLSKCGRCKQAFYCDVDCQKEDWPMHKLECSPMVVFGENWNPSET--VRLTARILAKQKS 79

Query: 98  QNDNVIPSTTTDNYSLVEALVAHMSDID-EKQLLLYAQIANLVNLI---LQWPEISINEI 153
             +     T ++    VE   +H+  +D EK+ L+ + I+ L +     L++P+   + +
Sbjct: 80  HPER----TPSEKLLAVEEFESHLDKLDNEKKDLIQSDISALHHFYSKHLEFPDH--DSL 133

Query: 154 AENFSKLACNAHTICNSELRPLGTGLYPVISIINHSCLPNAVLVFEGRLAVVRAVQHVPK 213
              F+++ CN  TI + EL  LG+ ++P ++++NHSC PN ++ ++G LA VRAVQ +  
Sbjct: 134 VVLFAQVNCNGFTIEDEELSHLGSAIFPDVALMNHSCCPNVIVTYKGTLAEVRAVQEISP 193

Query: 214 GAEVLISYIETAGSTMTRQKALKEQYLFTCTCPRCIKLGQFDDIQESAILEGYRCKDDGC 273
           G EV  SYI+    T  R + L++ Y FTC C  C         ++ A +E  R  D   
Sbjct: 194 GEEVFTSYIDLLYPTEDRNERLRDSYFFTCECLECTTKD-----KDKAKVEIRRLSDPP- 247

Query: 274 SGFLLRDSDDKGFTCQQCGLVRSKEEIKKIASEVNILSKKTLALTSCGNHQEVVSTYKMI 333
               +RD            +VR    + +              L  C   QE + +    
Sbjct: 248 KAEAIRD------------MVRYARNVIEEFRRAKHYKSPPELLEICELSQEKMGS---- 291

Query: 334 EKLQKKLYHPFSVNLMQTREKLIKILMELEDWKEALAYCQLTIPVYQRVYPQFHPLLGLQ 393
                 ++   +V ++    + + + + ++DW+ AL Y Q  I  Y + YP +   +   
Sbjct: 292 ------VFEDSNVYMLHMMYQAMGVCLYMQDWEGALRYGQKIIKPYSKHYPLYSLNVASM 345

Query: 394 YYTCGKLEWFLGDTENAIKSMTEAVEILRITHGTNSPFMKELILKLE 440
           +   G+L   L +     K++ +A+ I+ + HG + P++ E+  +LE
Sbjct: 346 WLKLGRLYLGLENKAAGEKALRKAIAIMEVAHGKDHPYISEIKQELE 392


>gi|242020146|ref|XP_002430517.1| SET and MYND domain-containing protein, putative [Pediculus humanus
           corporis]
 gi|212515674|gb|EEB17779.1| SET and MYND domain-containing protein, putative [Pediculus humanus
           corporis]
          Length = 378

 Score =  153 bits (386), Expect = 2e-34,   Method: Compositional matrix adjust.
 Identities = 113/406 (27%), Positives = 195/406 (48%), Gaps = 47/406 (11%)

Query: 41  KCSACQVVWYCGSNCQKLDWKLHRLECQVLSRLDKEKRKSVTPSIRLMLKLYLRRKLQND 100
           KCS C  V YC   CQK  W +H+ EC  L R+   K   +  + RL+ ++ ++      
Sbjct: 2   KCSGCAFVRYCDRTCQKNGWIIHKYECHNLKRIAPRK---LPDAARLLSRIIIKLNKGGA 58

Query: 101 NVIPSTTTDNYSLVEALVAHMSDIDEKQLLLYAQIANLVNLILQWPEISINEIAENFSKL 160
           +     T  NY   + L++           L+    +L +++ ++ +   +++  N ++L
Sbjct: 59  DEKSYYTKSNYRKFKDLMSRK---------LFKHFMSLNSVLHEYLQ---DQLIPNTAEL 106

Query: 161 --ACNAHTICNSELRPLGTGLYPVISIINHSCLPNAVLVFEGRLAVVRAVQHVP--KGAE 216
               N   I +SE++ +G+GLY   SII+HSC PNAV VF+G    +R ++ +P    ++
Sbjct: 107 LMVINTFNILDSEMQSIGSGLYLASSIIDHSCSPNAVAVFKGTTIFIRTLEDIPIMDWSK 166

Query: 217 VLISYIETAGSTMTRQKALKEQYLFTCTCPRCIKLGQFDDIQESAILEGYRCKDDGCSGF 276
           V ISYIE       RQ+ L   Y F C C +C       D      ++   C+++ C  F
Sbjct: 167 VFISYIELLNLPEIRQQELLSSYYFLCQCSKCT------DSDNLNFMKSIYCQNEKCKNF 220

Query: 277 LLRDSDDKGFTCQQCGLVRSKE---EIKKI----ASEVNILSKKTLALTSCGNHQEVVST 329
           L+ +  +    C +CG   SKE   EIK+I     SE+N +      L  C         
Sbjct: 221 LMPNEVE----CHKCGQSISKEDEDEIKEIIQYTESELNKMENMAY-LDIC--------- 266

Query: 330 YKMIEKLQKKLYHPFSVNLMQTREKLIKILMELEDWKEALAYCQLTIPVYQRVYPQFHPL 389
            ++  + Q+K   P ++  ++  +  ++  + LE+W+ A+ Y +  +  + + Y ++HPL
Sbjct: 267 -RICLQKQEKYLTPLNIYKVKILDLAMESSIALENWESAVFYGRQLLEPFLKYYGEYHPL 325

Query: 390 LGLQYYTCGKLEWFLGDTENAIKSMTEAVEILRITHGTNSPFMKEL 435
            G+     GK+ + L + + A K   EA  IL +THG N  F +EL
Sbjct: 326 RGIFLMKLGKILFLLNNIDEAKKYFKEANLILSVTHGKNHCFSREL 371


>gi|326915197|ref|XP_003203906.1| PREDICTED: n-lysine methyltransferase SMYD2-like [Meleagris
           gallopavo]
          Length = 575

 Score =  152 bits (383), Expect = 5e-34,   Method: Compositional matrix adjust.
 Identities = 106/406 (26%), Positives = 189/406 (46%), Gaps = 36/406 (8%)

Query: 38  NLKKCSACQVVWYCGSNCQKLDWKLHRLECQVLSRLDKEKRKSVTPSIRLMLKLYLRRKL 97
            L KC  C+  +YC   CQK DW +H+LEC  +    +    S T  +RL  ++  ++K+
Sbjct: 203 GLSKCGRCKQAFYCNVECQKEDWPMHKLECSAMCAFGQNWNPSET--VRLTARILAKQKI 260

Query: 98  QNDNVIPSTTTDNYSLVEALVAHMSDID-EKQLLLYAQIANLVNLILQWPEISINE-IAE 155
             +     T ++    V+   +H+  +D EK+ L+   IA L +   +  E   N  +  
Sbjct: 261 HPER----TQSEKLLAVKEFESHLDKLDNEKRELIQNDIAALHHFYSKHMEYPDNAALVV 316

Query: 156 NFSKLACNAHTICNSELRPLGTGLYPVISIINHSCLPNAVLVFEGRLAVVRAVQHVPKGA 215
            F+++ CN  TI + EL  LG+ ++P ++++NHSC PN ++ ++G LA VRAV+ +  G 
Sbjct: 317 LFAQVNCNGFTIEDEELSHLGSAIFPDVALMNHSCCPNVIVTYKGILAEVRAVKEIEPGE 376

Query: 216 EVLISYIETAGSTMTRQKALKEQYLFTCTCPRCIKLGQFDDIQESAILEGYRCKDDGCSG 275
           EV  SYI+    T  R   L++ Y FTC C  C    +  D ++  I    R  +D  S 
Sbjct: 377 EVFTSYIDLLYPTEDRNDRLRDSYFFTCDCRECTM--KEKDKEKLKI----RKLNDPPSA 430

Query: 276 FLLRDSDDKGFTCQQCGLVRSKEEIKKIASEVNILSKKTLALTSCGNHQEVVSTYKMIEK 335
             +RD            ++      K   S   +L    L+L   G              
Sbjct: 431 EAVRD-----MIKYARNVIEEFRRAKHYKSPSELLEICELSLDKMG-------------- 471

Query: 336 LQKKLYHPFSVNLMQTREKLIKILMELEDWKEALAYCQLTIPVYQRVYPQFHPLLGLQYY 395
               ++   +V ++    + + + + ++DW+ AL Y Q  I  Y + YP +   +   + 
Sbjct: 472 ---AVFEDSNVYMLHMMYQAMGVCLYVQDWEGALRYGQKIIRPYSKYYPSYSLNVASMWL 528

Query: 396 TCGKLEWFLGDTENAIKSMTEAVEILRITHGTNSPFMKELILKLEE 441
             G+L   L +     K++ +A+ I+ + HG + P++ E+  +LE+
Sbjct: 529 KLGRLYMALENRPAGDKALKKAIAIMEVAHGKDHPYISEIKKELED 574


>gi|417400767|gb|JAA47309.1| Hypothetical protein [Desmodus rotundus]
          Length = 428

 Score =  151 bits (382), Expect = 5e-34,   Method: Compositional matrix adjust.
 Identities = 121/428 (28%), Positives = 200/428 (46%), Gaps = 58/428 (13%)

Query: 8   GEVIISQEPYVCVPNNSSSISRCDGCF-ASSNLKKCSACQVVWYCGSNCQKLDWKLHRLE 66
           GE++   +P        S    CD C      L +CS C+V  YC + CQK  W  H+ E
Sbjct: 27  GELLFRSDPLAYTVCKGSRGVVCDRCLLGKEKLMRCSQCRVAKYCSAKCQKKAWPDHKRE 86

Query: 67  CQVLSRLDKEKRKSVTPSIRLMLKLYLRRKLQNDNVIPSTTTDNYSLVEALVAHMSDIDE 126
           C+ L      K +    S+RL+ ++ +  KL  +   PS +   YS  + L +++S + E
Sbjct: 87  CKCLKSC---KPRYPPDSVRLLGRVVV--KLMEET--PSESEKLYSFYD-LESNISKLTE 138

Query: 127 KQLLLYAQIANLVNLILQWPEISIN------EIAENFSKLACNAHTICNSELRPLGTGLY 180
           ++     Q+A      ++     ++      +I E F+K+ CNA TICN+E++ +G GLY
Sbjct: 139 EKKEGIRQLAMTFQHFMREEIQDVSQLLPAFDIFEAFAKVFCNAFTICNAEMQGVGVGLY 198

Query: 181 PVISIINHSCLPNAVLVFEGRLAVVRAVQHVPKGAEVLISYIETAGSTMTRQKALKEQYL 240
           P +S++NHSC PN  +VF G   ++RAV+ +  G E+ + Y++   ++  R+K L+ QY 
Sbjct: 199 PSMSLLNHSCDPNCSIVFNGPHLLLRAVRDIETGEELTVCYLDLLMTSEERRKHLRSQYC 258

Query: 241 FTCTCPRCIKLGQFDDIQESAILEGYRCKDDGCSGFLLRDSDDKGFTCQQCGLVRSKEEI 300
           F C C RC                              +D D    T  +      +E +
Sbjct: 259 FDCDCVRC----------------------------QTQDKDADMLTGDEQVWKEVQESL 290

Query: 301 KKIASEVNILSKKTLALTSCGNHQEVVSTYKMIEKLQKKLYHPFSVNLMQTR--EKLIKI 358
           KKI  E+    K    L  C   Q ++S+    E+L         +N+ Q +  +  +  
Sbjct: 291 KKI-EELKAHWKWEQVLAMC---QAIISSN--AERLP-------DINIYQLKVLDCAMDA 337

Query: 359 LMELEDWKEALAYCQLTIPVYQRVYPQFHPLLGLQYYTCGKLEWFLGDTENAIKSMTEAV 418
            + L   +EAL Y   T+  Y+  +P  HP+ G+Q    GKL+   G    A+K++  A 
Sbjct: 338 CIHLGLLEEALFYGLRTMEPYRIFFPGSHPVRGVQVMKVGKLQLHQGMFPQAMKNLRLAF 397

Query: 419 EILRITHG 426
           +I+R+THG
Sbjct: 398 DIMRVTHG 405


>gi|71043910|ref|NP_001020933.1| SET and MYND domain-containing protein 3 [Rattus norvegicus]
 gi|66910630|gb|AAH97455.1| SET and MYND domain containing 3 [Rattus norvegicus]
          Length = 428

 Score =  151 bits (382), Expect = 5e-34,   Method: Compositional matrix adjust.
 Identities = 117/427 (27%), Positives = 202/427 (47%), Gaps = 56/427 (13%)

Query: 8   GEVIISQEPYVCVPNNSSSISRCDGCF-ASSNLKKCSACQVVWYCGSNCQKLDWKLHRLE 66
           GE++   +P     +  S    CD C      L +CS C++  YC + CQK  W  H+ E
Sbjct: 27  GELLFRSDPLAYTVSKGSRGVVCDRCLLGKEKLMRCSQCRIAKYCSAKCQKKAWPDHKRE 86

Query: 67  CQVLSRLDKEKRKSVTPSIRLMLKLYLRRKLQNDNVIPSTTTDNYSLVEALVAHMSDIDE 126
           C+ L      K +    S+RL+ ++ +  KL +    PS +   YS  + L +++S + E
Sbjct: 87  CKCLKSC---KPRYPPDSVRLLGRVVV--KLMDGK--PSESEKLYSFYD-LESNISKLTE 138

Query: 127 KQLLLYAQIANLVNLILQWPEISIN-------EIAENFSKLACNAHTICNSELRPLGTGL 179
            +     Q+A      ++  EI          ++ E F+K+ CN+ TICN+E++ +G GL
Sbjct: 139 DKKEGLRQLAMTFQHFMR-EEIQDASQLPPSFDLFEAFAKVICNSFTICNAEMQEVGVGL 197

Query: 180 YPVISIINHSCLPNAVLVFEGRLAVVRAVQHVPKGAEVLISYIETAGSTMTRQKALKEQY 239
           YP +S++NHSC PN  +VF G   ++RAV+ +  G E+ I Y++   ++  R+K L++QY
Sbjct: 198 YPSMSLLNHSCDPNCSIVFNGPHLLLRAVREIEAGEELTICYLDMLMTSEERRKQLRDQY 257

Query: 240 LFTCTCPRCIKLGQFDDIQESAILEGYRCKDDGCSGFLLRDSDDKGFTCQQCGLVRSKEE 299
            F C C RC    Q  D                      +D+D          ++   E+
Sbjct: 258 CFECDCIRC----QTQD----------------------KDAD----------MLTGDEQ 281

Query: 300 IKKIASEVNILSKKTLALTSCGNHQEVVSTYKMIEKLQKKLYHPFSVNLMQTREKLIKIL 359
           I K   EV    KK   L +    ++V++  + I     +     ++  ++  +  +   
Sbjct: 282 IWK---EVQESLKKIEELKAHWKWEQVLALCQAIINSNSERLPDINIYQLKVLDCAMDAC 338

Query: 360 MELEDWKEALAYCQLTIPVYQRVYPQFHPLLGLQYYTCGKLEWFLGDTENAIKSMTEAVE 419
           + L   +EAL Y   T+  Y+  +P  HP+ G+Q    GKL+   G    A+K++  A +
Sbjct: 339 INLGMLEEALFYAMRTMEPYRIFFPGSHPVRGVQVMKVGKLQLHQGMFPQAMKNLRLAFD 398

Query: 420 ILRITHG 426
           I+++THG
Sbjct: 399 IMKVTHG 405


>gi|156368233|ref|XP_001627600.1| predicted protein [Nematostella vectensis]
 gi|156214514|gb|EDO35500.1| predicted protein [Nematostella vectensis]
          Length = 377

 Score =  151 bits (382), Expect = 6e-34,   Method: Compositional matrix adjust.
 Identities = 88/277 (31%), Positives = 150/277 (54%), Gaps = 23/277 (8%)

Query: 7   SGEVIISQEPYVCVPNNSSSISRCDGCFAS-SNLKKCSACQVVWYCGSNCQKLDWKLHRL 65
           SG+ I+S++P V + +N     RCD C    S+L++CS C+   YCG++CQ+  W++H+ 
Sbjct: 28  SGDTILSEQPVVYMLSNMLRGQRCDFCLEKLSDLQRCSRCKFARYCGASCQRAAWRIHKS 87

Query: 66  ECQVLSRLDKEKRKSVTPSIRLMLKLYLRRKLQNDNVIPSTTTDNYSLVEALVAHMSDID 125
           EC+ L R+     +  T  + LM +++   +L++ N             ++LV+++  ID
Sbjct: 88  ECERLKRVFP---RVPTDLVLLMFRVW---QLKSQN----------GWYDSLVSNVEKID 131

Query: 126 EKQLLLYAQIANLVNLILQWPEISINEIAENFSKLACNAHTICNSELRPLGTGLYPVISI 185
                 +  +  ++N  L   EIS  E  E FSK++CN+  IC+ E++ +GTG++P    
Sbjct: 132 SDAKEDFVSVLMVLNEYLG-SEISPPEGLELFSKISCNSFAICDGEMQAIGTGIFPNAVC 190

Query: 186 INHSCLPNAVLVFEGRLAVVRAVQHVPKGAEVLISYIETAGSTMTRQKALKEQYLFTCTC 245
           +NHSC PN+V VF G    ++A++ +P G E+ ISYI+      TRQ+ L+ Q+ F C C
Sbjct: 191 LNHSCAPNSVAVFNGTNIYIKALEEIPVGEELTISYIQQLHPRETRQEELQTQFCFYCQC 250

Query: 246 PRCIKLGQFDDIQESAILEGYRCKDDGCSGFLLRDSD 282
            RC+     D    + +L    C +  C   + +  D
Sbjct: 251 HRCL-----DASDNNKMLTSLICPNKSCEAIVYQTFD 282



 Score = 62.8 bits (151), Expect = 4e-07,   Method: Compositional matrix adjust.
 Identities = 33/84 (39%), Positives = 47/84 (55%), Gaps = 1/84 (1%)

Query: 363 EDWKEALAYCQLTIPVYQRVYPQFHPLLGLQYYTCGKLEWFLG-DTENAIKSMTEAVEIL 421
           +DW +AL Y    + VY   YP++HP LG+  Y  GKL      D E A+K + +A  IL
Sbjct: 288 QDWTKALEYATRNLEVYTWFYPKYHPCLGVHLYKIGKLLAVTHQDLELAVKRLEQARRIL 347

Query: 422 RITHGTNSPFMKELILKLEEAQAE 445
            +THG + P ++EL   L +A  E
Sbjct: 348 EVTHGQSHPLVQELCEYLCQASEE 371


>gi|74188167|dbj|BAE37175.1| unnamed protein product [Mus musculus]
          Length = 428

 Score =  151 bits (382), Expect = 6e-34,   Method: Compositional matrix adjust.
 Identities = 116/426 (27%), Positives = 199/426 (46%), Gaps = 54/426 (12%)

Query: 8   GEVIISQEPYVCVPNNSSSISRCDGCF-ASSNLKKCSACQVVWYCGSNCQKLDWKLHRLE 66
           GE++   +P        S    CD C      L +CS C++  YC + CQK  W  HR E
Sbjct: 27  GELLFRSDPLAYTVCKGSRGVVCDRCLLGKEKLMRCSQCRIAKYCSAKCQKKAWPDHRRE 86

Query: 67  CQVLSRLDKEKRKSVTPSIRLMLKLYLRRKLQNDNVIPSTTTDNYSLVEALVAHMSDIDE 126
           C  L      K +    S+RL+ ++ +  KL ++   PS +   YS  + L +++S + E
Sbjct: 87  CSCLKSC---KPRYPPDSVRLLGRVIV--KLMDEK--PSESEKLYSFYD-LESNISKLTE 138

Query: 127 KQLLLYAQIANLVNLILQWPEISINEIA------ENFSKLACNAHTICNSELRPLGTGLY 180
            +     Q+A      ++      +++       E F+K+ CN+ TICN+E++ +G GLY
Sbjct: 139 DKKEGLRQLAMTFQHFMREEIQDASQLPPSFDLFEAFAKVICNSFTICNAEMQEVGVGLY 198

Query: 181 PVISIINHSCLPNAVLVFEGRLAVVRAVQHVPKGAEVLISYIETAGSTMTRQKALKEQYL 240
           P +S++NHSC PN  +VF G   ++RAV+ +  G E+ I Y++   ++  R+K L++QY 
Sbjct: 199 PSMSLLNHSCDPNCSIVFNGPHLLLRAVREIEAGEELTICYLDMLMTSEERRKQLRDQYC 258

Query: 241 FTCTCPRCIKLGQFDDIQESAILEGYRCKDDGCSGFLLRDSDDKGFTCQQCGLVRSKEEI 300
           F C C RC    Q  D                      +D+D          ++   E+I
Sbjct: 259 FECDCIRC----QTQD----------------------KDAD----------MLTGDEQI 282

Query: 301 KKIASEVNILSKKTLALTSCGNHQEVVSTYKMIEKLQKKLYHPFSVNLMQTREKLIKILM 360
            K   EV    KK   L +    ++V++  + I           ++  ++  +  +   +
Sbjct: 283 WK---EVQESLKKIEELKAHWKWEQVLALCQAIINSNSNRLPDINIYQLKVLDCAMDACI 339

Query: 361 ELEDWKEALAYCQLTIPVYQRVYPQFHPLLGLQYYTCGKLEWFLGDTENAIKSMTEAVEI 420
            L   +EAL Y   T+  Y+  +P  HP+ G+Q    GKL+   G    A+K++  A +I
Sbjct: 340 NLGMLEEALLYAMRTMEPYRIFFPGSHPVRGVQVMKVGKLQLHQGMFPQAMKNLRLAFDI 399

Query: 421 LRITHG 426
           +++THG
Sbjct: 400 MKVTHG 405


>gi|38494371|gb|AAH61485.1| SET and MYND domain containing 3 [Mus musculus]
 gi|148681224|gb|EDL13171.1| SET and MYND domain containing 3, isoform CRA_b [Mus musculus]
          Length = 428

 Score =  151 bits (382), Expect = 6e-34,   Method: Compositional matrix adjust.
 Identities = 118/427 (27%), Positives = 200/427 (46%), Gaps = 56/427 (13%)

Query: 8   GEVIISQEPYVCVPNNSSSISRCDGCF-ASSNLKKCSACQVVWYCGSNCQKLDWKLHRLE 66
           GE++   +P        S    CD C      L +CS C++  YC + CQK  W  HR E
Sbjct: 27  GELLFRSDPLAYTVCKGSRGVVCDRCLLGKEKLMRCSQCRIAKYCSAKCQKKAWPDHRRE 86

Query: 67  CQVLSRLDKEKRKSVTPSIRLMLKLYLRRKLQNDNVIPSTTTDNYSLVEALVAHMSDIDE 126
           C  L      K +    S+RL+ ++ +  KL ++   PS +   YS  + L +++S + E
Sbjct: 87  CSCLKSC---KPRYPPDSVRLLGRVIV--KLMDEK--PSESEKLYSFYD-LESNISKLTE 138

Query: 127 KQLLLYAQIANLVNLILQWPEISIN-------EIAENFSKLACNAHTICNSELRPLGTGL 179
            +     Q+A      ++  EI          ++ E F+K+ CN+ TICN+E++ +G GL
Sbjct: 139 DKKEGLRQLAMTFQHFMR-EEIQDASQLPPSFDLFEAFAKVICNSFTICNAEMQEVGVGL 197

Query: 180 YPVISIINHSCLPNAVLVFEGRLAVVRAVQHVPKGAEVLISYIETAGSTMTRQKALKEQY 239
           YP +S++NHSC PN  +VF G   ++RAV+ +  G E+ I Y++   ++  R+K L++QY
Sbjct: 198 YPSMSLLNHSCDPNCSIVFNGPHLLLRAVREIEAGEELTICYLDMLMTSEERRKQLRDQY 257

Query: 240 LFTCTCPRCIKLGQFDDIQESAILEGYRCKDDGCSGFLLRDSDDKGFTCQQCGLVRSKEE 299
            F C C RC    Q  D                      +D+D          ++   E+
Sbjct: 258 CFECDCIRC----QTQD----------------------KDAD----------MLTGDEQ 281

Query: 300 IKKIASEVNILSKKTLALTSCGNHQEVVSTYKMIEKLQKKLYHPFSVNLMQTREKLIKIL 359
           I K   EV    KK   L +    ++V++  + I           ++  ++  +  +   
Sbjct: 282 IWK---EVQESLKKIEELKAHWKWEQVLALCQAIINSNSDRLPDINIYQLKVLDCAMDAC 338

Query: 360 MELEDWKEALAYCQLTIPVYQRVYPQFHPLLGLQYYTCGKLEWFLGDTENAIKSMTEAVE 419
           + L   +EAL Y   T+  Y+  +P  HP+ G+Q    GKL+   G    A+K++  A +
Sbjct: 339 INLGMLEEALFYAMRTMEPYRIFFPGSHPVRGVQVMKVGKLQLHQGMFPQAMKNLRLAFD 398

Query: 420 ILRITHG 426
           I+++THG
Sbjct: 399 IMKVTHG 405


>gi|426239579|ref|XP_004013697.1| PREDICTED: SET and MYND domain-containing protein 3 isoform 2 [Ovis
           aries]
          Length = 428

 Score =  151 bits (382), Expect = 6e-34,   Method: Compositional matrix adjust.
 Identities = 124/430 (28%), Positives = 201/430 (46%), Gaps = 62/430 (14%)

Query: 8   GEVIISQEPYVCVPNNSSSISRCDGCF-ASSNLKKCSACQVVWYCGSNCQKLDWKLHRLE 66
           GE++   +P     +  S    CD C      L +CS C++  YC + CQK  W+ H+ E
Sbjct: 27  GELLFRSDPLAYTVSKGSRGVVCDRCLLGKEKLMRCSQCRIAKYCSAKCQKKAWQDHKRE 86

Query: 67  CQVLSRLDKEKRKSVTPSIRLMLKLYLRRKLQNDNVIPSTTTDNYSLVEALVAHMSDIDE 126
           C+ L      K +    S+RL+ ++    KL  +   PS +   YS  + L ++++ + E
Sbjct: 87  CKCLKSC---KPRYPPDSVRLLGRVVF--KLMQET--PSESEKLYSFYD-LESNINKLTE 138

Query: 127 ------KQLLLYAQ--IANLVNLILQWPEISINEIAENFSKLACNAHTICNSELRPLGTG 178
                 +QL L  Q  +   +    Q P     +I E F+K+ CN+ TICN+E++ +G G
Sbjct: 139 DKKEGLRQLALTFQHFMREEIQDASQLP--PSFDIFEAFAKVICNSFTICNAEMQEVGVG 196

Query: 179 LYPVISIINHSCLPNAVLVFEGRLAVVRAVQHVPKGAEVLISYIETAGSTMTRQKALKEQ 238
           LYP +S++NHSC PN  +VF G   ++R V+ V  G E+ I Y++   ++  R+K L++Q
Sbjct: 197 LYPSMSLLNHSCDPNCSIVFNGPHLLLRTVRDVEAGEELTICYLDMLMTSEERRKQLRDQ 256

Query: 239 YLFTCTCPRCIKLGQFDDIQESAILEGYRCKDDGCSGFLLRDSDDKGFTCQQCGLVRSKE 298
           Y F C C RC                              +D D    T  +      +E
Sbjct: 257 YCFDCDCFRC----------------------------QTQDKDADMLTGDEQVWKEVQE 288

Query: 299 EIKKIASEVNILSKKTLALTSCGNHQEVVSTYKMIEKLQKKLYHPFSVNLMQTR--EKLI 356
            +KKI  E+    K    L  C   Q ++S+    E+L         +N+ Q +  +  +
Sbjct: 289 SLKKI-EELKAHWKWEQVLAMC---QSIISSN--AERLP-------DINIYQLKVLDCAM 335

Query: 357 KILMELEDWKEALAYCQLTIPVYQRVYPQFHPLLGLQYYTCGKLEWFLGDTENAIKSMTE 416
              + L   +EAL Y   T+  Y+  YP  HP+ G+Q    GKL+   G    A+K++  
Sbjct: 336 DACINLGLLEEALFYGIRTMEPYRIFYPGNHPVRGVQIMKVGKLQLHQGMFPQAMKNLRL 395

Query: 417 AVEILRITHG 426
           A +I+R+THG
Sbjct: 396 AFDIMRVTHG 405


>gi|21312378|ref|NP_081464.1| SET and MYND domain-containing protein 3 [Mus musculus]
 gi|30913566|sp|Q9CWR2.1|SMYD3_MOUSE RecName: Full=SET and MYND domain-containing protein 3; AltName:
           Full=Zinc finger MYND domain-containing protein 1
 gi|12845900|dbj|BAB26947.1| unnamed protein product [Mus musculus]
 gi|30851412|gb|AAH52431.1| SET and MYND domain containing 3 [Mus musculus]
          Length = 428

 Score =  151 bits (382), Expect = 6e-34,   Method: Compositional matrix adjust.
 Identities = 116/426 (27%), Positives = 199/426 (46%), Gaps = 54/426 (12%)

Query: 8   GEVIISQEPYVCVPNNSSSISRCDGCF-ASSNLKKCSACQVVWYCGSNCQKLDWKLHRLE 66
           GE++   +P        S    CD C      L +CS C++  YC + CQK  W  HR E
Sbjct: 27  GELLFRSDPLAYTVCKGSRGVVCDRCLLGKEKLMRCSQCRIAKYCSAKCQKKAWPDHRRE 86

Query: 67  CQVLSRLDKEKRKSVTPSIRLMLKLYLRRKLQNDNVIPSTTTDNYSLVEALVAHMSDIDE 126
           C  L      K +    S+RL+ ++ +  KL ++   PS +   YS  + L +++S + E
Sbjct: 87  CSCLKSC---KPRYPPDSVRLLGRVIV--KLMDEK--PSESEKLYSFYD-LESNISKLTE 138

Query: 127 KQLLLYAQIANLVNLILQWPEISINEIA------ENFSKLACNAHTICNSELRPLGTGLY 180
            +     Q+A      ++      +++       E F+K+ CN+ TICN+E++ +G GLY
Sbjct: 139 DKKEGLRQLAMTFQHFMREEIQDASQLPPSFDLFEAFAKVICNSFTICNAEMQEVGVGLY 198

Query: 181 PVISIINHSCLPNAVLVFEGRLAVVRAVQHVPKGAEVLISYIETAGSTMTRQKALKEQYL 240
           P +S++NHSC PN  +VF G   ++RAV+ +  G E+ I Y++   ++  R+K L++QY 
Sbjct: 199 PSMSLLNHSCDPNCSIVFNGPHLLLRAVREIEAGEELTICYLDMLMTSEERRKQLRDQYC 258

Query: 241 FTCTCPRCIKLGQFDDIQESAILEGYRCKDDGCSGFLLRDSDDKGFTCQQCGLVRSKEEI 300
           F C C RC    Q  D                      +D+D          ++   E+I
Sbjct: 259 FECDCIRC----QTQD----------------------KDAD----------MLTGDEQI 282

Query: 301 KKIASEVNILSKKTLALTSCGNHQEVVSTYKMIEKLQKKLYHPFSVNLMQTREKLIKILM 360
            K   EV    KK   L +    ++V++  + I           ++  ++  +  +   +
Sbjct: 283 WK---EVQESLKKIEELKAHWKWEQVLALCQAIINSNSNRLPDINIYQLKVLDCAMDACI 339

Query: 361 ELEDWKEALAYCQLTIPVYQRVYPQFHPLLGLQYYTCGKLEWFLGDTENAIKSMTEAVEI 420
            L   +EAL Y   T+  Y+  +P  HP+ G+Q    GKL+   G    A+K++  A +I
Sbjct: 340 NLGMLEEALFYAMRTMEPYRIFFPGSHPVRGVQVMKVGKLQLHQGMFPQAMKNLRLAFDI 399

Query: 421 LRITHG 426
           +++THG
Sbjct: 400 MKVTHG 405


>gi|296230793|ref|XP_002760890.1| PREDICTED: SET and MYND domain-containing protein 3 [Callithrix
           jacchus]
          Length = 428

 Score =  151 bits (381), Expect = 7e-34,   Method: Compositional matrix adjust.
 Identities = 123/428 (28%), Positives = 200/428 (46%), Gaps = 58/428 (13%)

Query: 8   GEVIISQEPYVCVPNNSSSISRCDGCF-ASSNLKKCSACQVVWYCGSNCQKLDWKLHRLE 66
           GE++   +P     +  S    CD C      L +CS C+V  YC S CQK  W  H+ E
Sbjct: 27  GELLFRSDPLAYTVSKGSRGVVCDRCLLGKEKLMRCSQCRVAKYCSSKCQKKAWPDHKRE 86

Query: 67  CQVLSRLDKEKRKSVTPSIRLMLKLYLRRKLQNDNVIPSTTTDNYSLVEALVAHMSDIDE 126
           C+ L      K +    S+RL+ ++    KL   +  PS +   YS  + L ++++ + E
Sbjct: 87  CKCLKSC---KPRYPPDSVRLLGRVVF--KLMEGS--PSESEKLYSFYD-LESNINKLTE 138

Query: 127 ------KQLLLYAQ--IANLVNLILQWPEISINEIAENFSKLACNAHTICNSELRPLGTG 178
                 +QL++  Q  +   +    Q P     +I E F+K+ CN+ TICN+E++ +G G
Sbjct: 139 DKKEGLRQLVMTFQHFMREEIQDASQLP--PAFDIFEAFAKVICNSFTICNAEMQEVGVG 196

Query: 179 LYPVISIINHSCLPNAVLVFEGRLAVVRAVQHVPKGAEVLISYIETAGSTMTRQKALKEQ 238
           LYP IS++NHSC PN  +VF G   ++RAV+ V  G E+ I Y++   ++  R+K L++Q
Sbjct: 197 LYPSISLLNHSCDPNCSVVFNGPHLLLRAVRDVEVGEELTICYLDMLMTSEERRKQLRDQ 256

Query: 239 YLFTCTCPRCIKLGQFDDIQESAILEGYRCKDDGCSGFLLRDSDDKGFTCQQCGLVRSKE 298
           Y F C C RC                              +D D    T  +      +E
Sbjct: 257 YCFECDCFRC----------------------------QTQDKDADMLTGDERVWKEVQE 288

Query: 299 EIKKIASEVNILSKKTLALTSCGNHQEVVSTYKMIEKLQKKLYHPFSVNLMQTREKLIKI 358
            +K I  E+    K    L  C   Q ++S+    E+L        ++  ++  +  +  
Sbjct: 289 SLKHI-EELKAHWKWEQVLAMC---QAIISSNS--ERLPD-----LNIYQLKVLDCAMDA 337

Query: 359 LMELEDWKEALAYCQLTIPVYQRVYPQFHPLLGLQYYTCGKLEWFLGDTENAIKSMTEAV 418
            + L   +EAL Y   T+  Y+  +P  HP+ G+Q    GKL+   G    A+K++  A 
Sbjct: 338 CINLGLLEEALFYGTRTMEPYRIFFPGSHPVRGVQVMKVGKLQLHQGMFPQAMKNLRLAF 397

Query: 419 EILRITHG 426
           +I+R+THG
Sbjct: 398 DIMRVTHG 405


>gi|332812383|ref|XP_514316.3| PREDICTED: SET and MYND domain-containing protein 3 [Pan
           troglodytes]
 gi|397473047|ref|XP_003808033.1| PREDICTED: SET and MYND domain-containing protein 3 [Pan paniscus]
 gi|410208860|gb|JAA01649.1| SET and MYND domain containing 3 [Pan troglodytes]
 gi|410254410|gb|JAA15172.1| SET and MYND domain containing 3 [Pan troglodytes]
 gi|410289336|gb|JAA23268.1| SET and MYND domain containing 3 [Pan troglodytes]
 gi|410332281|gb|JAA35087.1| SET and MYND domain containing 3 [Pan troglodytes]
          Length = 428

 Score =  150 bits (380), Expect = 9e-34,   Method: Compositional matrix adjust.
 Identities = 123/430 (28%), Positives = 201/430 (46%), Gaps = 62/430 (14%)

Query: 8   GEVIISQEPYVCVPNNSSSISRCDGCF-ASSNLKKCSACQVVWYCGSNCQKLDWKLHRLE 66
           GE++   +P        S    CD C      L +CS C+V  YC + CQK  W  H+ E
Sbjct: 27  GELLFRSDPLAYTVCKGSRGVVCDRCLLGKEKLMRCSQCRVAKYCSAKCQKKAWPDHKRE 86

Query: 67  CQVLSRLDKEKRKSVTPSIRLMLKLYLRRKLQNDNVIPSTTTDNYSLVEALVAHMSDIDE 126
           C+ L      K +    S+RL+ ++    KL +    PS +   YS  + L ++++ + E
Sbjct: 87  CKCLKSC---KPRYPPDSVRLLGRVVF--KLMDG--APSESEKLYSFYD-LESNINKLTE 138

Query: 127 ------KQLLLYAQ--IANLVNLILQWPEISINEIAENFSKLACNAHTICNSELRPLGTG 178
                 +QL++  Q  +   +    Q P     ++ E F+K+ CN+ TICN+E++ +G G
Sbjct: 139 DKKEGLRQLVMTFQHFMREEIQDASQLP--PAFDLFEAFAKVICNSFTICNAEMQEVGVG 196

Query: 179 LYPVISIINHSCLPNAVLVFEGRLAVVRAVQHVPKGAEVLISYIETAGSTMTRQKALKEQ 238
           LYP IS++NHSC PN  +VF G   ++RAV+ +  G E+ I Y++   ++  R+K L++Q
Sbjct: 197 LYPSISLLNHSCDPNCSIVFNGPHLLLRAVRDIEVGEELTICYLDMLMTSEERRKQLRDQ 256

Query: 239 YLFTCTCPRCIKLGQFDDIQESAILEGYRCKDDGCSGFLLRDSDDKGFTCQQCGLVRSKE 298
           Y F C C RC                              +D D    T  +      +E
Sbjct: 257 YCFECDCFRC----------------------------QTQDKDADMLTGDEQVWKEVQE 288

Query: 299 EIKKIASEVNILSKKTLALTSCGNHQEVVSTYKMIEKLQKKLYHPFSVNLMQTR--EKLI 356
            +KKI  E+    K    L  C   Q ++S+    E+L         +N+ Q +  +  +
Sbjct: 289 SLKKI-EELKAHWKWEQVLAMC---QAIISSNS--ERLP-------DINIYQLKVLDCAM 335

Query: 357 KILMELEDWKEALAYCQLTIPVYQRVYPQFHPLLGLQYYTCGKLEWFLGDTENAIKSMTE 416
              + L   +EAL Y   T+  Y+  +P  HP+ G+Q    GKL+   G    A+K++  
Sbjct: 336 DACINLGLLEEALFYATRTMEPYRIFFPGSHPVRGVQVMKVGKLQLHQGMFPQAMKNLRL 395

Query: 417 AVEILRITHG 426
           A +I+R+THG
Sbjct: 396 AFDIMRVTHG 405


>gi|388454549|ref|NP_001252862.1| SET and MYND domain-containing protein 3 [Macaca mulatta]
 gi|380787707|gb|AFE65729.1| SET and MYND domain-containing protein 3 isoform 1 [Macaca mulatta]
 gi|383412903|gb|AFH29665.1| SET and MYND domain-containing protein 3 isoform 1 [Macaca mulatta]
          Length = 428

 Score =  150 bits (380), Expect = 1e-33,   Method: Compositional matrix adjust.
 Identities = 123/430 (28%), Positives = 201/430 (46%), Gaps = 62/430 (14%)

Query: 8   GEVIISQEPYVCVPNNSSSISRCDGCF-ASSNLKKCSACQVVWYCGSNCQKLDWKLHRLE 66
           GE++   +P        S    CD C      L +CS C+V  YC + CQK  W  H+ E
Sbjct: 27  GELLFRSDPLAYTVCKGSRGVVCDRCLLGKEKLMRCSQCRVAKYCSAKCQKKAWPDHKRE 86

Query: 67  CQVLSRLDKEKRKSVTPSIRLMLKLYLRRKLQNDNVIPSTTTDNYSLVEALVAHMSDIDE 126
           C+ L      K +    S+RL+ ++    KL +    PS +   YS  + L ++++ + E
Sbjct: 87  CKCLKSC---KPRYPPDSVRLLGRVVF--KLMDGT--PSESEKLYSFYD-LESNINKLTE 138

Query: 127 ------KQLLLYAQ--IANLVNLILQWPEISINEIAENFSKLACNAHTICNSELRPLGTG 178
                 +QL++  Q  +   +    Q P     ++ E F+K+ CN+ TICN+E++ +G G
Sbjct: 139 DKKEGLRQLVMTFQHFMREEIQDASQLP--PAFDLFEAFAKVICNSFTICNAEMQEVGVG 196

Query: 179 LYPVISIINHSCLPNAVLVFEGRLAVVRAVQHVPKGAEVLISYIETAGSTMTRQKALKEQ 238
           LYP IS++NHSC PN  +VF G   ++RAV+ +  G E+ I Y++   ++  R+K L++Q
Sbjct: 197 LYPSISLLNHSCDPNCSIVFNGPHLLLRAVRDIEVGEELTICYLDMLMTSEERRKQLRDQ 256

Query: 239 YLFTCTCPRCIKLGQFDDIQESAILEGYRCKDDGCSGFLLRDSDDKGFTCQQCGLVRSKE 298
           Y F C C RC                              +D D    T  +      +E
Sbjct: 257 YCFECDCFRC----------------------------QTQDKDADMLTGDEQVWKEVQE 288

Query: 299 EIKKIASEVNILSKKTLALTSCGNHQEVVSTYKMIEKLQKKLYHPFSVNLMQTR--EKLI 356
            +KKI  E+    K    L  C   Q ++S+    E+L         +N+ Q +  +  +
Sbjct: 289 SLKKI-EELKAHWKWEQVLAMC---QAIISSNS--ERLP-------DINIYQLKVLDCAM 335

Query: 357 KILMELEDWKEALAYCQLTIPVYQRVYPQFHPLLGLQYYTCGKLEWFLGDTENAIKSMTE 416
              + L   +EAL Y   T+  Y+  +P  HP+ G+Q    GKL+   G    A+K++  
Sbjct: 336 DACINLGLLEEALFYGTRTMEPYRIFFPGSHPVRGVQVMKVGKLQLHQGMFPQAMKNLRL 395

Query: 417 AVEILRITHG 426
           A +I+R+THG
Sbjct: 396 AFDIMRVTHG 405


>gi|323714508|pdb|3OXG|A Chain A, Human Lysine Methyltransferase Smyd3 In Complex With
           Adohcy (Form Iii)
          Length = 464

 Score =  150 bits (380), Expect = 1e-33,   Method: Compositional matrix adjust.
 Identities = 123/430 (28%), Positives = 201/430 (46%), Gaps = 62/430 (14%)

Query: 8   GEVIISQEPYVCVPNNSSSISRCDGCF-ASSNLKKCSACQVVWYCGSNCQKLDWKLHRLE 66
           GE++   +P        S    CD C      L +CS C+V  YC + CQK  W  H+ E
Sbjct: 63  GELLFRSDPLAYTVCKGSRGVVCDRCLLGKEKLMRCSQCRVAKYCSAKCQKKAWPDHKRE 122

Query: 67  CQVLSRLDKEKRKSVTPSIRLMLKLYLRRKLQNDNVIPSTTTDNYSLVEALVAHMSDIDE 126
           C+ L      K +    S+RL+ ++    KL +    PS +   YS  + L ++++ + E
Sbjct: 123 CKCLKSC---KPRYPPDSVRLLGRVVF--KLMDG--APSESEKLYSFYD-LESNINKLTE 174

Query: 127 ------KQLLLYAQ--IANLVNLILQWPEISINEIAENFSKLACNAHTICNSELRPLGTG 178
                 +QL++  Q  +   +    Q P     ++ E F+K+ CN+ TICN+E++ +G G
Sbjct: 175 DRKEGLRQLVMTFQHFMREEIQDASQLP--PAFDLFEAFAKVICNSFTICNAEMQEVGVG 232

Query: 179 LYPVISIINHSCLPNAVLVFEGRLAVVRAVQHVPKGAEVLISYIETAGSTMTRQKALKEQ 238
           LYP IS++NHSC PN  +VF G   ++RAV+ +  G E+ I Y++   ++  R+K L++Q
Sbjct: 233 LYPSISLLNHSCDPNCSIVFNGPHLLLRAVRDIEVGEELTICYLDMLMTSEERRKQLRDQ 292

Query: 239 YLFTCTCPRCIKLGQFDDIQESAILEGYRCKDDGCSGFLLRDSDDKGFTCQQCGLVRSKE 298
           Y F C C RC                              +D D    T  +      +E
Sbjct: 293 YCFECDCFRC----------------------------QTQDKDADMLTGDEQVWKEVQE 324

Query: 299 EIKKIASEVNILSKKTLALTSCGNHQEVVSTYKMIEKLQKKLYHPFSVNLMQTR--EKLI 356
            +KKI  E+    K    L  C   Q ++S+    E+L         +N+ Q +  +  +
Sbjct: 325 SLKKI-EELKAHWKWEQVLAMC---QAIISSNS--ERLP-------DINIYQLKVLDCAM 371

Query: 357 KILMELEDWKEALAYCQLTIPVYQRVYPQFHPLLGLQYYTCGKLEWFLGDTENAIKSMTE 416
              + L   +EAL Y   T+  Y+  +P  HP+ G+Q    GKL+   G    A+K++  
Sbjct: 372 DACINLGLLEEALFYGTRTMEPYRIFFPGSHPVRGVQVMKVGKLQLHQGMFPQAMKNLRL 431

Query: 417 AVEILRITHG 426
           A +I+R+THG
Sbjct: 432 AFDIMRVTHG 441


>gi|338722720|ref|XP_001914950.2| PREDICTED: LOW QUALITY PROTEIN: SET and MYND domain-containing
           protein 3 [Equus caballus]
          Length = 440

 Score =  150 bits (380), Expect = 1e-33,   Method: Compositional matrix adjust.
 Identities = 121/442 (27%), Positives = 196/442 (44%), Gaps = 74/442 (16%)

Query: 8   GEVIISQEPYVCVPNNSSSISRCDGCF-ASSNLKKCSACQVVWYCGSNCQKLDWKLHRLE 66
           GE++   +P        S    CD C      L +CS C+V  YC + CQK  W+ H+ E
Sbjct: 27  GELLFRSDPLAYTVCKGSRGVVCDRCLLGKEKLMRCSQCRVAKYCSAKCQKKAWQDHKRE 86

Query: 67  CQVLSRLDKEKRKSVTPSIRLMLKLYLRRKLQNDNVIPSTTTDNYSLVEALVAHMSDIDE 126
           C+ L      K +    S+RL+ ++  +       ++  T +++  L   L   +S  D 
Sbjct: 87  CKCLKSC---KPRYPPDSVRLLGRVVFK-------LMEETPSESEKLYHVLHNDISVDDG 136

Query: 127 KQLLLYAQIANLV--------NLILQWPEISINEIA------------ENFSKLACNAHT 166
             +  +A I  L          L + +      EI             E F+K+ CN+ T
Sbjct: 137 PHISWWAHINKLTEDKKEGLRQLAMTFQHFMREEIQDASQLPPSFDVFEAFAKVICNSFT 196

Query: 167 ICNSELRPLGTGLYPVISIINHSCLPNAVLVFEGRLAVVRAVQHVPKGAEVLISYIETAG 226
           ICN+E++ +G GLYP +S++NHSC PN  +VF G   ++RAV+ +  G E+ I Y++   
Sbjct: 197 ICNAEMQEVGVGLYPSMSLLNHSCDPNCSIVFNGPHLLLRAVRDIEAGEELTICYLDMLM 256

Query: 227 STMTRQKALKEQYLFTCTCPRCIKLGQFDDIQESAILEGYRCKDDGCSGFLLRDSDDKGF 286
           ++  R+K L++QY F C C RC                              +D D    
Sbjct: 257 TSEERRKQLRDQYCFECDCFRC----------------------------QTQDKDADML 288

Query: 287 TCQQCGLVRSKEEIKKIASEVNILSKKTLALTSCGNHQEVVSTYKMIEKLQKKLYHPFSV 346
           T  +      +E +KKI  E+    K    L  C   Q ++S+    E+L         +
Sbjct: 289 TGDEHVWKEVQESLKKI-EELKAHWKWEQVLAMC---QTIISSNS--ERLP-------DI 335

Query: 347 NLMQTR--EKLIKILMELEDWKEALAYCQLTIPVYQRVYPQFHPLLGLQYYTCGKLEWFL 404
           N+ Q +  +  +   + L   +EAL Y   T+  Y+  YP  HP+ G+Q    GKL+   
Sbjct: 336 NIYQLKVLDCAMDACINLGLLEEALFYGIRTMEPYRIFYPGSHPVRGVQVMKVGKLQLHQ 395

Query: 405 GDTENAIKSMTEAVEILRITHG 426
           G    A+K++  A +I+R+THG
Sbjct: 396 GMFPQAMKNLRLAFDIMRVTHG 417


>gi|324509870|gb|ADY44135.1| SET and MYND domain-containing protein 3 [Ascaris suum]
          Length = 442

 Score =  150 bits (379), Expect = 1e-33,   Method: Compositional matrix adjust.
 Identities = 116/455 (25%), Positives = 209/455 (45%), Gaps = 41/455 (9%)

Query: 8   GEVIISQEPYVCVPNNSSSISRCDGCFASSN--LKKCSACQVVWYCGSNCQKLDWKLHRL 65
           G V++ + PY  V  N  +++ C  C  SS   L +CS C+ V YC + CQK  W  H+ 
Sbjct: 2   GPVLVEEYPYAHVVGNEHAVTVCAHCMRSSKRPLSRCSRCRFVHYCDTQCQKGGWPAHKE 61

Query: 66  ECQVLSRLDKEKRKSVTPSIRLMLKLYLRRKLQNDNVIPSTTTDNYSLVEALVAHMSDI- 124
           EC  L +        V P +   +   L R +   N   +    N    ++L++H  +I 
Sbjct: 62  ECFFLVK--------VQPRVPTSMARLLARVIIRTNQKETIRAFNGRTFDSLLSHSDEIK 113

Query: 125 -DEKQLLLYAQIANLVNLILQWPEIS-INEIAENFSKLACNAHTICNSELRPLGTGLYPV 182
            D ++   +  +++++   +    +   +E+   F K+  N  TI N +L  +G GLY  
Sbjct: 114 EDGEKSEFFVTLSHVLFEYMGADYLPPASELLTIFGKVMVNVFTISNDDLNTIGLGLYLG 173

Query: 183 ISIINHSCLPNAVLVFEGRLAVVRAVQH--VPKGAEVLISYIETAGSTMTRQKALKEQYL 240
           +S+++HSC P+A ++F G  AV+R ++       + + I+Y +    T  R+  LK+Q+ 
Sbjct: 174 LSVLDHSCDPDAFVLFNGTKAVLRPLKQYITAYDSSLRIAYCDLLDLTSMRRNQLKQQFF 233

Query: 241 FTCTCPRCIKLGQFDDIQESAILEGYRCKDDGCSGFLLRDSDDKGFTCQQCGL---VRSK 297
           FTC C  C+      D++        RC+     G+   D ++    C QCG    V   
Sbjct: 234 FTCECSACL------DLEREKTARSVRCR-HCVDGYCPLDVNENSLVCWQCGATSEVHVD 286

Query: 298 EEI---KKIASEVNILSKKTLALTSCGNHQEVVSTYKMIEKLQKKLYHPFSVNLMQTREK 354
           E +   +++  EV  L ++           + + T K +      +    ++ L Q  ++
Sbjct: 287 EAVHLMQQVEWEVKRLQER---------RNDSICTAKDLYNEASIILSSLNIPLCQLADR 337

Query: 355 LIKILMELEDWKEALAYCQLTIPVYQRVYPQFHPLLGLQYYTCGKLEWFLGDT-ENAIKS 413
           +    ++  ++  A  Y + TI  ++R YP  HP L LQY+  GKL      T E A + 
Sbjct: 338 IASWGIDNGNFSLAADYIEKTISCFKRFYPTAHPSLSLQYFKAGKLHSLNDLTLELAAER 397

Query: 414 MTEAVEILRITHGTN---SPFMKELILKLEEAQAE 445
           + +A+E L  +HG     S    EL+ K+++ + +
Sbjct: 398 LEQAIESLHASHGVQHSLSAEAHELLCKVQQQRRD 432


>gi|312208186|pdb|3PDN|A Chain A, Crystal Structure Of Smyd3 In Complex With
           Methyltransferase Inhibitor Sinefungin
 gi|323714509|pdb|3OXL|A Chain A, Human Lysine Methyltransferase Smyd3 In Complex With
           Adohcy (Form Ii)
 gi|21410974|gb|AAH31010.1| SMYD3 protein [Homo sapiens]
 gi|123984179|gb|ABM83506.1| SET and MYND domain containing 3 [synthetic construct]
 gi|123998243|gb|ABM86723.1| SET and MYND domain containing 3 [synthetic construct]
          Length = 428

 Score =  150 bits (379), Expect = 1e-33,   Method: Compositional matrix adjust.
 Identities = 123/430 (28%), Positives = 201/430 (46%), Gaps = 62/430 (14%)

Query: 8   GEVIISQEPYVCVPNNSSSISRCDGCF-ASSNLKKCSACQVVWYCGSNCQKLDWKLHRLE 66
           GE++   +P        S    CD C      L +CS C+V  YC + CQK  W  H+ E
Sbjct: 27  GELLFRSDPLAYTVCKGSRGVVCDRCLLGKEKLMRCSQCRVAKYCSAKCQKKAWPDHKRE 86

Query: 67  CQVLSRLDKEKRKSVTPSIRLMLKLYLRRKLQNDNVIPSTTTDNYSLVEALVAHMSDIDE 126
           C+ L      K +    S+RL+ ++    KL +    PS +   YS  + L ++++ + E
Sbjct: 87  CKCLKSC---KPRYPPDSVRLLGRVVF--KLMDG--APSESEKLYSFYD-LESNINKLTE 138

Query: 127 ------KQLLLYAQ--IANLVNLILQWPEISINEIAENFSKLACNAHTICNSELRPLGTG 178
                 +QL++  Q  +   +    Q P     ++ E F+K+ CN+ TICN+E++ +G G
Sbjct: 139 DRKEGLRQLVMTFQHFMREEIQDASQLP--PAFDLFEAFAKVICNSFTICNAEMQEVGVG 196

Query: 179 LYPVISIINHSCLPNAVLVFEGRLAVVRAVQHVPKGAEVLISYIETAGSTMTRQKALKEQ 238
           LYP IS++NHSC PN  +VF G   ++RAV+ +  G E+ I Y++   ++  R+K L++Q
Sbjct: 197 LYPSISLLNHSCDPNCSIVFNGPHLLLRAVRDIEVGEELTICYLDMLMTSEERRKQLRDQ 256

Query: 239 YLFTCTCPRCIKLGQFDDIQESAILEGYRCKDDGCSGFLLRDSDDKGFTCQQCGLVRSKE 298
           Y F C C RC                              +D D    T  +      +E
Sbjct: 257 YCFECDCFRC----------------------------QTQDKDADMLTGDEQVWKEVQE 288

Query: 299 EIKKIASEVNILSKKTLALTSCGNHQEVVSTYKMIEKLQKKLYHPFSVNLMQTR--EKLI 356
            +KKI  E+    K    L  C   Q ++S+    E+L         +N+ Q +  +  +
Sbjct: 289 SLKKI-EELKAHWKWEQVLAMC---QAIISSNS--ERLP-------DINIYQLKVLDCAM 335

Query: 357 KILMELEDWKEALAYCQLTIPVYQRVYPQFHPLLGLQYYTCGKLEWFLGDTENAIKSMTE 416
              + L   +EAL Y   T+  Y+  +P  HP+ G+Q    GKL+   G    A+K++  
Sbjct: 336 DACINLGLLEEALFYGTRTMEPYRIFFPGSHPVRGVQVMKVGKLQLHQGMFPQAMKNLRL 395

Query: 417 AVEILRITHG 426
           A +I+R+THG
Sbjct: 396 AFDIMRVTHG 405


>gi|19570495|dbj|BAB86333.1| histone methyltransferase [Homo sapiens]
 gi|158260233|dbj|BAF82294.1| unnamed protein product [Homo sapiens]
 gi|261858444|dbj|BAI45744.1| SET and MYND domain containing 3 [synthetic construct]
          Length = 428

 Score =  150 bits (379), Expect = 1e-33,   Method: Compositional matrix adjust.
 Identities = 123/430 (28%), Positives = 201/430 (46%), Gaps = 62/430 (14%)

Query: 8   GEVIISQEPYVCVPNNSSSISRCDGCF-ASSNLKKCSACQVVWYCGSNCQKLDWKLHRLE 66
           GE++   +P        S    CD C      L +CS C+V  YC + CQK  W  H+ E
Sbjct: 27  GELLFRSDPLAYTVCKGSRGVVCDRCLLGKEKLMRCSQCRVAKYCSAKCQKKAWPDHKRE 86

Query: 67  CQVLSRLDKEKRKSVTPSIRLMLKLYLRRKLQNDNVIPSTTTDNYSLVEALVAHMSDIDE 126
           C+ L      K +    S+RL+ ++    KL +    PS +   YS  + L ++++ + E
Sbjct: 87  CKCLKSC---KPRYPPDSVRLLGRVVF--KLMDG--APSESEKLYSFYD-LESNINKLTE 138

Query: 127 ------KQLLLYAQ--IANLVNLILQWPEISINEIAENFSKLACNAHTICNSELRPLGTG 178
                 +QL++  Q  +   +    Q P     ++ E F+K+ CN+ TICN+E++ +G G
Sbjct: 139 DKKEGLRQLVMTFQHFMREEIQDASQLP--PAFDLFEAFAKVICNSFTICNAEMQEVGVG 196

Query: 179 LYPVISIINHSCLPNAVLVFEGRLAVVRAVQHVPKGAEVLISYIETAGSTMTRQKALKEQ 238
           LYP IS++NHSC PN  +VF G   ++RAV+ +  G E+ I Y++   ++  R+K L++Q
Sbjct: 197 LYPSISLLNHSCDPNCSIVFNGPHLLLRAVRDIEVGEELTICYLDMLMTSEERRKQLRDQ 256

Query: 239 YLFTCTCPRCIKLGQFDDIQESAILEGYRCKDDGCSGFLLRDSDDKGFTCQQCGLVRSKE 298
           Y F C C RC                              +D D    T  +      +E
Sbjct: 257 YCFECDCFRC----------------------------QTQDKDADMLTGDEQVWKEVQE 288

Query: 299 EIKKIASEVNILSKKTLALTSCGNHQEVVSTYKMIEKLQKKLYHPFSVNLMQTR--EKLI 356
            +KKI  E+    K    L  C   Q ++S+    E+L         +N+ Q +  +  +
Sbjct: 289 SLKKI-EELKAHWKWEQVLAMC---QAIISSNS--ERLP-------DINIYQLKVLDCAM 335

Query: 357 KILMELEDWKEALAYCQLTIPVYQRVYPQFHPLLGLQYYTCGKLEWFLGDTENAIKSMTE 416
              + L   +EAL Y   T+  Y+  +P  HP+ G+Q    GKL+   G    A+K++  
Sbjct: 336 DACINLGLLEEALFYGTRTMEPYRIFFPGSHPVRGVQVMKVGKLQLHQGMFPQAMKNLRL 395

Query: 417 AVEILRITHG 426
           A +I+R+THG
Sbjct: 396 AFDIMRVTHG 405


>gi|267844824|ref|NP_001161212.1| SET and MYND domain-containing protein 3 isoform 1 [Homo sapiens]
 gi|212276523|sp|Q9H7B4.4|SMYD3_HUMAN RecName: Full=SET and MYND domain-containing protein 3; AltName:
           Full=Zinc finger MYND domain-containing protein 1
 gi|119597548|gb|EAW77142.1| SET and MYND domain containing 3, isoform CRA_c [Homo sapiens]
          Length = 428

 Score =  150 bits (379), Expect = 1e-33,   Method: Compositional matrix adjust.
 Identities = 123/430 (28%), Positives = 201/430 (46%), Gaps = 62/430 (14%)

Query: 8   GEVIISQEPYVCVPNNSSSISRCDGCF-ASSNLKKCSACQVVWYCGSNCQKLDWKLHRLE 66
           GE++   +P        S    CD C      L +CS C+V  YC + CQK  W  H+ E
Sbjct: 27  GELLFRSDPLAYTVCKGSRGVVCDRCLLGKEKLMRCSQCRVAKYCSAKCQKKAWPDHKRE 86

Query: 67  CQVLSRLDKEKRKSVTPSIRLMLKLYLRRKLQNDNVIPSTTTDNYSLVEALVAHMSDIDE 126
           C+ L      K +    S+RL+ ++    KL +    PS +   YS  + L ++++ + E
Sbjct: 87  CKCLKSC---KPRYPPDSVRLLGRVVF--KLMDG--APSESEKLYSFYD-LESNINKLTE 138

Query: 127 ------KQLLLYAQ--IANLVNLILQWPEISINEIAENFSKLACNAHTICNSELRPLGTG 178
                 +QL++  Q  +   +    Q P     ++ E F+K+ CN+ TICN+E++ +G G
Sbjct: 139 DKKEGLRQLVMTFQHFMREEIQDASQLP--PAFDLFEAFAKVICNSFTICNAEMQEVGVG 196

Query: 179 LYPVISIINHSCLPNAVLVFEGRLAVVRAVQHVPKGAEVLISYIETAGSTMTRQKALKEQ 238
           LYP IS++NHSC PN  +VF G   ++RAV+ +  G E+ I Y++   ++  R+K L++Q
Sbjct: 197 LYPSISLLNHSCDPNCSIVFNGPHLLLRAVRDIEVGEELTICYLDMLMTSEERRKQLRDQ 256

Query: 239 YLFTCTCPRCIKLGQFDDIQESAILEGYRCKDDGCSGFLLRDSDDKGFTCQQCGLVRSKE 298
           Y F C C RC                              +D D    T  +      +E
Sbjct: 257 YCFECDCFRC----------------------------QTQDKDADMLTGDEQVWKEVQE 288

Query: 299 EIKKIASEVNILSKKTLALTSCGNHQEVVSTYKMIEKLQKKLYHPFSVNLMQTR--EKLI 356
            +KKI  E+    K    L  C   Q ++S+    E+L         +N+ Q +  +  +
Sbjct: 289 SLKKI-EELKAHWKWEQVLAMC---QAIISSNS--ERLP-------DINIYQLKVLDCAM 335

Query: 357 KILMELEDWKEALAYCQLTIPVYQRVYPQFHPLLGLQYYTCGKLEWFLGDTENAIKSMTE 416
              + L   +EAL Y   T+  Y+  +P  HP+ G+Q    GKL+   G    A+K++  
Sbjct: 336 DACINLGLLEEALFYGTRTMEPYRIFFPGSHPVRGVQVMKVGKLQLHQGMFPQAMKNLRL 395

Query: 417 AVEILRITHG 426
           A +I+R+THG
Sbjct: 396 AFDIMRVTHG 405


>gi|74178258|dbj|BAE29912.1| unnamed protein product [Mus musculus]
          Length = 420

 Score =  150 bits (379), Expect = 1e-33,   Method: Compositional matrix adjust.
 Identities = 102/388 (26%), Positives = 178/388 (45%), Gaps = 42/388 (10%)

Query: 4   FHVSGEVIISQEPYVCVPNNSSSISRCDGCFA-SSNLKKCSACQVVWYCGSNCQKLDWKL 62
           FHV G+++ S   Y CV         C+ CFA    L KC  C+  +YC   CQK DW L
Sbjct: 27  FHV-GDLLFSCPAYACVLTVGERGHHCECCFARKEGLSKCGRCKQAFYCDVECQKEDWPL 85

Query: 63  HRLECQVLSRLDKEKRKSVTPSIRLMLKLYLRRKLQNDNVIPSTTTDNYSLVEALVAHMS 122
           H+LEC  +  L +    S T  +RL  ++  ++K+  +     T ++    V    +H+ 
Sbjct: 86  HKLECSSMVVLGENWNPSET--VRLTARILAKQKIHPER----TPSEKLLAVREFESHLD 139

Query: 123 DID-EKQLLLYAQIANLVNLI---LQWPEISINEIAENFSKLACNAHTICNSELRPLGTG 178
            +D EK+ L+ + IA L       L++P+ S   +   F+++ CN  TI + EL  LG+ 
Sbjct: 140 KLDNEKKDLIQSDIAALHQFYSKYLEFPDHS--SLVVLFAQVNCNGFTIEDEELSHLGSA 197

Query: 179 LYPVISIINHSCLPNAVLVFEGRLAVVRAVQHVPKGAEVLISYIETAGSTMTRQKALKEQ 238
           ++P ++++NHSC PN ++ ++G LA VRAVQ +  G EV  SYI+    T  R   L++ 
Sbjct: 198 IFPDVALMNHSCCPNVIVTYKGTLAEVRAVQEIHPGDEVFTSYIDLLYPTEDRNDRLRDS 257

Query: 239 YLFTCTCPRCIKLGQFDDIQESAILEGYRCKDDGCSGFLLRDSDDKGFTCQQCGLVRSKE 298
           Y FTC C  C                              +D D      ++       E
Sbjct: 258 YFFTCECRECT----------------------------TKDKDKAKVEVRKLSSPPQAE 289

Query: 299 EIKKIASEVNILSKKTLALTSCGNHQEVVSTYKMIEKLQKKLYHPFSVNLMQTREKLIKI 358
            I+ +      + ++        +  E++   ++ ++    ++   +V ++    + + +
Sbjct: 290 AIRDMVRYARNVIEEFRRAKHYKSPSELLEICELSQEKMSSVFEDSNVYMLHMMYQAMGV 349

Query: 359 LMELEDWKEALAYCQLTIPVYQRVYPQF 386
            + ++DW+ AL Y Q  I  Y + YP +
Sbjct: 350 CLYMQDWEGALKYGQKIIKPYSKHYPVY 377


>gi|348577081|ref|XP_003474313.1| PREDICTED: SET and MYND domain-containing protein 3-like [Cavia
           porcellus]
          Length = 428

 Score =  150 bits (379), Expect = 1e-33,   Method: Compositional matrix adjust.
 Identities = 125/429 (29%), Positives = 208/429 (48%), Gaps = 60/429 (13%)

Query: 8   GEVIISQEPYVCVPNNSSSISRCDGCF-ASSNLKKCSACQVVWYCGSNCQKLDWKLHRLE 66
           GE++   +P     +  S    CD C      L +CS C+V  YC + CQK  W+ H+ E
Sbjct: 27  GELLFRSDPLAYTVSKGSRGVVCDHCLLGKEKLMRCSQCRVAKYCSAKCQKKAWQDHKQE 86

Query: 67  CQVLSRLDKEKRKSVTPSIRLMLKLYLRRKLQNDNVIPSTTTDNYSLVEALVAHMSDIDE 126
           C+ L      K +    S+RL+ ++ +  KL      PS +   YS  + L ++++ + E
Sbjct: 87  CKCLKSC---KPRYPPDSVRLLGRVVI--KLMEKT--PSESEKLYSFYD-LESNINKLTE 138

Query: 127 KQLLLYAQIANLVNLILQWPEISIN-------EIAENFSKLACNAHTICNSELRPLGTGL 179
            +     Q+A      ++  EI          +I E F+K+ CN+ TICN+E++ +G GL
Sbjct: 139 DKKEGLRQLAMTFQHFMR-EEIQDASQLPPSFDIFEAFAKVICNSFTICNAEMQEVGVGL 197

Query: 180 YPVISIINHSCLPNAVLVFEGRLAVVRAVQHVPKGAEVLISYIETAGSTMTRQKALKEQY 239
           YP +S++NHSC PN  +VF G   ++RAV+ +  G E+ I Y++   ++  R+K L++QY
Sbjct: 198 YPSMSLLNHSCDPNCSIVFNGPHLLLRAVRDIEVGEELTICYLDMLMTSEERRKQLRDQY 257

Query: 240 LFTCTCPRCIKLGQFDDIQESAILEGYRCKDDGCSGFLLRDSDDKGFTCQQCGLVRSKEE 299
            F C C RC    Q  D +++ +L G                D++ +   Q       E 
Sbjct: 258 CFECDCFRC----QTQD-KDADMLTG----------------DEQVWKGVQ-------ES 289

Query: 300 IKKIASEVNILSKKTLALTSCGNHQEVVSTYKMIEKLQKKLYHPFSVNLMQTR--EKLIK 357
           +KKI  E+    K    L  C   Q ++S+    E+L         +N+ Q +  +  + 
Sbjct: 290 LKKI-EELKTHWKWEQVLAMC---QSIISSNS--ERLP-------DINIYQLKVLDCAMD 336

Query: 358 ILMELEDWKEALAYCQLTIPVYQRVYPQFHPLLGLQYYTCGKLEWFLGDTENAIKSMTEA 417
             + L   +EAL Y   T+  Y+  +P  HP+ G+Q    GKL+   G    A+K++  A
Sbjct: 337 ACINLGLLEEALFYGIRTMEPYRIFFPGSHPVRGVQVMKVGKLQLHQGMFPQAMKNLRLA 396

Query: 418 VEILRITHG 426
            +I+R+THG
Sbjct: 397 FDIMRVTHG 405


>gi|327533748|pdb|3QWP|A Chain A, Crystal Structure Of Set And Mynd Domain Containing 3;
           Zinc Finger Mynd Domain-Containing Protein 1
          Length = 429

 Score =  150 bits (379), Expect = 1e-33,   Method: Compositional matrix adjust.
 Identities = 123/430 (28%), Positives = 201/430 (46%), Gaps = 62/430 (14%)

Query: 8   GEVIISQEPYVCVPNNSSSISRCDGCF-ASSNLKKCSACQVVWYCGSNCQKLDWKLHRLE 66
           GE++   +P        S    CD C      L +CS C+V  YC + CQK  W  H+ E
Sbjct: 28  GELLFRSDPLAYTVCKGSRGVVCDRCLLGKEKLMRCSQCRVAKYCSAKCQKKAWPDHKRE 87

Query: 67  CQVLSRLDKEKRKSVTPSIRLMLKLYLRRKLQNDNVIPSTTTDNYSLVEALVAHMSDIDE 126
           C+ L      K +    S+RL+ ++    KL +    PS +   YS  + L ++++ + E
Sbjct: 88  CKCLKSC---KPRYPPDSVRLLGRVVF--KLMDG--APSESEKLYSFYD-LESNINKLTE 139

Query: 127 ------KQLLLYAQ--IANLVNLILQWPEISINEIAENFSKLACNAHTICNSELRPLGTG 178
                 +QL++  Q  +   +    Q P     ++ E F+K+ CN+ TICN+E++ +G G
Sbjct: 140 DRKEGLRQLVMTFQHFMREEIQDASQLP--PAFDLFEAFAKVICNSFTICNAEMQEVGVG 197

Query: 179 LYPVISIINHSCLPNAVLVFEGRLAVVRAVQHVPKGAEVLISYIETAGSTMTRQKALKEQ 238
           LYP IS++NHSC PN  +VF G   ++RAV+ +  G E+ I Y++   ++  R+K L++Q
Sbjct: 198 LYPSISLLNHSCDPNCSIVFNGPHLLLRAVRDIEVGEELTICYLDMLMTSEERRKQLRDQ 257

Query: 239 YLFTCTCPRCIKLGQFDDIQESAILEGYRCKDDGCSGFLLRDSDDKGFTCQQCGLVRSKE 298
           Y F C C RC                              +D D    T  +      +E
Sbjct: 258 YCFECDCFRC----------------------------QTQDKDADMLTGDEQVWKEVQE 289

Query: 299 EIKKIASEVNILSKKTLALTSCGNHQEVVSTYKMIEKLQKKLYHPFSVNLMQTR--EKLI 356
            +KKI  E+    K    L  C   Q ++S+    E+L         +N+ Q +  +  +
Sbjct: 290 SLKKI-EELKAHWKWEQVLAMC---QAIISSNS--ERLP-------DINIYQLKVLDCAM 336

Query: 357 KILMELEDWKEALAYCQLTIPVYQRVYPQFHPLLGLQYYTCGKLEWFLGDTENAIKSMTE 416
              + L   +EAL Y   T+  Y+  +P  HP+ G+Q    GKL+   G    A+K++  
Sbjct: 337 DACINLGLLEEALFYGTRTMEPYRIFFPGSHPVRGVQVMKVGKLQLHQGMFPQAMKNLRL 396

Query: 417 AVEILRITHG 426
           A +I+R+THG
Sbjct: 397 AFDIMRVTHG 406


>gi|149040852|gb|EDL94809.1| similar to SET and MYND domain containing 3, isoform CRA_b [Rattus
           norvegicus]
          Length = 428

 Score =  150 bits (379), Expect = 1e-33,   Method: Compositional matrix adjust.
 Identities = 114/426 (26%), Positives = 201/426 (47%), Gaps = 54/426 (12%)

Query: 8   GEVIISQEPYVCVPNNSSSISRCDGCF-ASSNLKKCSACQVVWYCGSNCQKLDWKLHRLE 66
           GE++   +P     +  S    CD C      L +CS C++  YC + CQK  W  H+ E
Sbjct: 27  GELLFRSDPLAYTVSKGSRGVVCDRCLLGKEKLMRCSQCRIAKYCSAKCQKKAWPDHKRE 86

Query: 67  CQVLSRLDKEKRKSVTPSIRLMLKLYLRRKLQNDNVIPSTTTDNYSLVEALVAHMSDIDE 126
           C+ L      K +    S+RL+ ++ +  KL +    PS +   YS  + L +++S + E
Sbjct: 87  CKCLKSC---KPRYPPDSVRLLGRVVV--KLMDGK--PSESEKLYSFYD-LESNISKLTE 138

Query: 127 KQLLLYAQIANLVNLILQWPEISINEIA------ENFSKLACNAHTICNSELRPLGTGLY 180
            +     Q+A      ++      +++       E F+K+ CN+ TICN+E++ +G GLY
Sbjct: 139 DKKEGLRQLAMTFQHFMREEIQDASQLPPSFDLFEAFAKVICNSFTICNAEMQEVGVGLY 198

Query: 181 PVISIINHSCLPNAVLVFEGRLAVVRAVQHVPKGAEVLISYIETAGSTMTRQKALKEQYL 240
           P +S++NHSC PN  +VF G   ++RAV+ +  G E+ I Y++   ++  R+K L++QY 
Sbjct: 199 PSMSLLNHSCDPNCSIVFNGPHLLLRAVREIEAGEELTICYLDMLMTSEERRKQLRDQYC 258

Query: 241 FTCTCPRCIKLGQFDDIQESAILEGYRCKDDGCSGFLLRDSDDKGFTCQQCGLVRSKEEI 300
           F C C RC         Q+                   +D+D          ++   E+I
Sbjct: 259 FECDCIRC-------PTQD-------------------KDAD----------MLTGDEQI 282

Query: 301 KKIASEVNILSKKTLALTSCGNHQEVVSTYKMIEKLQKKLYHPFSVNLMQTREKLIKILM 360
            K   EV    KK   L +    ++V++  + I     +     ++  ++  +  +   +
Sbjct: 283 WK---EVQESLKKIEELKAHWKWEQVLALCQAIINSNSERLPDINIYQLKVLDCAMDACI 339

Query: 361 ELEDWKEALAYCQLTIPVYQRVYPQFHPLLGLQYYTCGKLEWFLGDTENAIKSMTEAVEI 420
            L   +EAL Y   T+  Y+  +P  HP+ G+Q    GKL+   G    A+K++  A +I
Sbjct: 340 NLGMLEEALFYAMRTMEPYRIFFPGSHPVRGVQVMKVGKLQLHQGMFPQAMKNLRLAFDI 399

Query: 421 LRITHG 426
           +++THG
Sbjct: 400 MKVTHG 405


>gi|323714506|pdb|3OXF|A Chain A, Human Lysine Methyltransferase Smyd3 In Complex With
           Adohcy (Form I)
 gi|323714507|pdb|3OXF|B Chain B, Human Lysine Methyltransferase Smyd3 In Complex With
           Adohcy (Form I)
          Length = 436

 Score =  150 bits (378), Expect = 1e-33,   Method: Compositional matrix adjust.
 Identities = 123/430 (28%), Positives = 201/430 (46%), Gaps = 62/430 (14%)

Query: 8   GEVIISQEPYVCVPNNSSSISRCDGCF-ASSNLKKCSACQVVWYCGSNCQKLDWKLHRLE 66
           GE++   +P        S    CD C      L +CS C+V  YC + CQK  W  H+ E
Sbjct: 27  GELLFRSDPLAYTVCKGSRGVVCDRCLLGKEKLMRCSQCRVAKYCSAKCQKKAWPDHKRE 86

Query: 67  CQVLSRLDKEKRKSVTPSIRLMLKLYLRRKLQNDNVIPSTTTDNYSLVEALVAHMSDIDE 126
           C+ L      K +    S+RL+ ++    KL +    PS +   YS  + L ++++ + E
Sbjct: 87  CKCLKSC---KPRYPPDSVRLLGRVVF--KLMDG--APSESEKLYSFYD-LESNINKLTE 138

Query: 127 ------KQLLLYAQ--IANLVNLILQWPEISINEIAENFSKLACNAHTICNSELRPLGTG 178
                 +QL++  Q  +   +    Q P     ++ E F+K+ CN+ TICN+E++ +G G
Sbjct: 139 DRKEGLRQLVMTFQHFMREEIQDASQLP--PAFDLFEAFAKVICNSFTICNAEMQEVGVG 196

Query: 179 LYPVISIINHSCLPNAVLVFEGRLAVVRAVQHVPKGAEVLISYIETAGSTMTRQKALKEQ 238
           LYP IS++NHSC PN  +VF G   ++RAV+ +  G E+ I Y++   ++  R+K L++Q
Sbjct: 197 LYPSISLLNHSCDPNCSIVFNGPHLLLRAVRDIEVGEELTICYLDMLMTSEERRKQLRDQ 256

Query: 239 YLFTCTCPRCIKLGQFDDIQESAILEGYRCKDDGCSGFLLRDSDDKGFTCQQCGLVRSKE 298
           Y F C C RC                              +D D    T  +      +E
Sbjct: 257 YCFECDCFRC----------------------------QTQDKDADMLTGDEQVWKEVQE 288

Query: 299 EIKKIASEVNILSKKTLALTSCGNHQEVVSTYKMIEKLQKKLYHPFSVNLMQTR--EKLI 356
            +KKI  E+    K    L  C   Q ++S+    E+L         +N+ Q +  +  +
Sbjct: 289 SLKKI-EELKAHWKWEQVLAMC---QAIISSNS--ERLP-------DINIYQLKVLDCAM 335

Query: 357 KILMELEDWKEALAYCQLTIPVYQRVYPQFHPLLGLQYYTCGKLEWFLGDTENAIKSMTE 416
              + L   +EAL Y   T+  Y+  +P  HP+ G+Q    GKL+   G    A+K++  
Sbjct: 336 DACINLGLLEEALFYGTRTMEPYRIFFPGSHPVRGVQVMKVGKLQLHQGMFPQAMKNLRL 395

Query: 417 AVEILRITHG 426
           A +I+R+THG
Sbjct: 396 AFDIMRVTHG 405


>gi|71834584|ref|NP_001025394.1| N-lysine methyltransferase SMYD2-B isoform 1 [Danio rerio]
          Length = 437

 Score =  150 bits (378), Expect = 2e-33,   Method: Compositional matrix adjust.
 Identities = 116/437 (26%), Positives = 203/437 (46%), Gaps = 46/437 (10%)

Query: 8   GEVIISQEPYVCVPNNSSSISRCDGCFA-SSNLKKCSACQVVWYCGSNCQKLDWKLHRLE 66
           GE++ S   Y  V +       C+ CF     L KC  C+  +YC +NCQK +W +H+LE
Sbjct: 31  GELLFSCPAYSYVLSVGERGLICEQCFTRKKGLAKCGKCKKAFYCNANCQKKNWPMHKLE 90

Query: 67  CQVLSRLDKEKRKSVTPSIRLMLKLYLRRKLQNDNVIPSTTTDNYSLVEALVAHMSDID- 125
           CQ +    +  R S T  +RL+ ++  R K Q +     + ++   L+  + AH+ D+D 
Sbjct: 91  CQAMCAFGENWRPSET--VRLVARIIARLKAQKER----SPSEILLLLGEMEAHLEDMDN 144

Query: 126 EKQLLLYAQIANLVNLI---LQWPEISINEIAENFSKLACNAHTICNSELRPLGTGLYPV 182
           EK+ +  A IA L       L +P+     +   FS++ CN  T+ + EL  LG  ++P 
Sbjct: 145 EKREMTEAHIAGLHQFYSKHLDFPDH--QALLTLFSQVHCNGFTVEDEELSNLGLAIFPD 202

Query: 183 ISIINHSCLPNAVLVFEGRLAVVRAVQHVPKGAEVLISYIETAGSTMTRQKALKEQYLFT 242
           I+++NHSC PN ++ + G  A VRAV+ +  G E+  SYI+    T  R + L++ Y F+
Sbjct: 203 IALLNHSCSPNVIVTYRGINAEVRAVKDISPGQEIYTSYIDLLYPTADRLERLRDMYYFS 262

Query: 243 CTCPRCIKLGQFDDIQESAILEGYRCKDDGCSGFLLRDSDDKGFTCQQCGLVR----SKE 298
           C C  C      D ++ S      R + D      ++D            +VR    S E
Sbjct: 263 CDCKECTT-KSMDVVKMSV-----RKRSDEIGEKEIKD------------MVRYARNSME 304

Query: 299 EIKKIASEVNILSKKTLALTSCGNHQEVVSTYKMIEKLQKKLYHPFSVNLMQTREKLIKI 358
             ++   + NIL   +  L  C    + +ST          ++   +V ++    + + I
Sbjct: 305 NFRRAKQDKNILC-GSFILEMCELSIDKMST----------VFDDSNVYILHMMYQAMGI 353

Query: 359 LMELEDWKEALAYCQLTIPVYQRVYPQFHPLLGLQYYTCGKLEWFLGDTENAIKSMTEAV 418
            +  ED++ A+ Y +  I  +  +YP +   +   +   G+L   L      I +  +A+
Sbjct: 354 CLFTEDYEGAVRYGEKVIKPFTVLYPAYSMNVASMFLKLGRLYIALDRKLAGIDAFQKAL 413

Query: 419 EILRITHGTNSPFMKEL 435
            I+ + HG +  ++ EL
Sbjct: 414 TIMEVVHGKDHTYVTEL 430


>gi|333361525|pdb|3RU0|A Chain A, Cocrystal Structure Of Human Smyd3 With Inhibitor
           Sinefungin Bound
 gi|333361526|pdb|3RU0|B Chain B, Cocrystal Structure Of Human Smyd3 With Inhibitor
           Sinefungin Bound
          Length = 438

 Score =  150 bits (378), Expect = 2e-33,   Method: Compositional matrix adjust.
 Identities = 123/430 (28%), Positives = 201/430 (46%), Gaps = 62/430 (14%)

Query: 8   GEVIISQEPYVCVPNNSSSISRCDGCF-ASSNLKKCSACQVVWYCGSNCQKLDWKLHRLE 66
           GE++   +P        S    CD C      L +CS C+V  YC + CQK  W  H+ E
Sbjct: 29  GELLFRSDPLAYTVCKGSRGVVCDRCLLGKEKLMRCSQCRVAKYCSAKCQKKAWPDHKRE 88

Query: 67  CQVLSRLDKEKRKSVTPSIRLMLKLYLRRKLQNDNVIPSTTTDNYSLVEALVAHMSDIDE 126
           C+ L      K +    S+RL+ ++    KL +    PS +   YS  + L ++++ + E
Sbjct: 89  CKCLKSC---KPRYPPDSVRLLGRVVF--KLMDG--APSESEKLYSFYD-LESNINKLTE 140

Query: 127 ------KQLLLYAQ--IANLVNLILQWPEISINEIAENFSKLACNAHTICNSELRPLGTG 178
                 +QL++  Q  +   +    Q P     ++ E F+K+ CN+ TICN+E++ +G G
Sbjct: 141 DKKEGLRQLVMTFQHFMREEIQDASQLP--PAFDLFEAFAKVICNSFTICNAEMQEVGVG 198

Query: 179 LYPVISIINHSCLPNAVLVFEGRLAVVRAVQHVPKGAEVLISYIETAGSTMTRQKALKEQ 238
           LYP IS++NHSC PN  +VF G   ++RAV+ +  G E+ I Y++   ++  R+K L++Q
Sbjct: 199 LYPSISLLNHSCDPNCSIVFNGPHLLLRAVRDIEVGEELTICYLDMLMTSEERRKQLRDQ 258

Query: 239 YLFTCTCPRCIKLGQFDDIQESAILEGYRCKDDGCSGFLLRDSDDKGFTCQQCGLVRSKE 298
           Y F C C RC                              +D D    T  +      +E
Sbjct: 259 YCFECDCFRC----------------------------QTQDKDADMLTGDEQVWKEVQE 290

Query: 299 EIKKIASEVNILSKKTLALTSCGNHQEVVSTYKMIEKLQKKLYHPFSVNLMQTR--EKLI 356
            +KKI  E+    K    L  C   Q ++S+    E+L         +N+ Q +  +  +
Sbjct: 291 SLKKI-EELKAHWKWEQVLAMC---QAIISSNS--ERLP-------DINIYQLKVLDCAM 337

Query: 357 KILMELEDWKEALAYCQLTIPVYQRVYPQFHPLLGLQYYTCGKLEWFLGDTENAIKSMTE 416
              + L   +EAL Y   T+  Y+  +P  HP+ G+Q    GKL+   G    A+K++  
Sbjct: 338 DACINLGLLEEALFYGTRTMEPYRIFFPGSHPVRGVQVMKVGKLQLHQGMFPQAMKNLRL 397

Query: 417 AVEILRITHG 426
           A +I+R+THG
Sbjct: 398 AFDIMRVTHG 407


>gi|291402060|ref|XP_002717678.1| PREDICTED: SET and MYND domain containing 3 [Oryctolagus cuniculus]
          Length = 428

 Score =  149 bits (377), Expect = 2e-33,   Method: Compositional matrix adjust.
 Identities = 118/427 (27%), Positives = 202/427 (47%), Gaps = 56/427 (13%)

Query: 8   GEVIISQEPYVCVPNNSSSISRCDGCF-ASSNLKKCSACQVVWYCGSNCQKLDWKLHRLE 66
           GE++   +P        S    CD C      L +CS C+V  YC + CQK  W  H+ E
Sbjct: 27  GELLFRSDPLAYTVCKGSRGVVCDRCLLGKEKLMRCSQCRVAKYCSAKCQKKAWPDHKRE 86

Query: 67  CQVLSRLDKEKRKSVTPSIRLMLKLYLRRKLQNDNVIPSTTTDNYSLVEALVAHMSDIDE 126
           C+ L      K +    S+RL+ ++    KL  +  IPS +   Y+  + L ++++ + E
Sbjct: 87  CKCLKSC---KPRYPPDSVRLLGRVVF--KLMEE--IPSESEKLYTFYD-LESNINKLTE 138

Query: 127 KQLLLYAQIANLVNLILQWPEISIN-------EIAENFSKLACNAHTICNSELRPLGTGL 179
            +     Q+A      ++  EI          +I E F+K+ CN+ TICN+E++ +G GL
Sbjct: 139 DKKEGLRQLAMTFQHFMR-EEIQDASQLPPSFDIFEAFAKVICNSFTICNAEMQEVGVGL 197

Query: 180 YPVISIINHSCLPNAVLVFEGRLAVVRAVQHVPKGAEVLISYIETAGSTMTRQKALKEQY 239
           YP +S++NHSC PN  +VF G   ++RAV+ +  G E+ I Y++   ++  R+K L++QY
Sbjct: 198 YPSMSLLNHSCDPNCSIVFNGPHLLLRAVRDIEAGEELTICYLDMLMTSEERRKQLRDQY 257

Query: 240 LFTCTCPRCIKLGQFDDIQESAILEGYRCKDDGCSGFLLRDSDDKGFTCQQCGLVRSKEE 299
            F C C RC       + Q+                   +D+D          ++   E+
Sbjct: 258 CFECDCFRC-------ETQD-------------------KDAD----------MLTGDEQ 281

Query: 300 IKKIASEVNILSKKTLALTSCGNHQEVVSTYKMIEKLQKKLYHPFSVNLMQTREKLIKIL 359
           I K   EV    KK   L +    ++V++  + I     +     ++  ++  +  +   
Sbjct: 282 IWK---EVQESLKKIEELKAHWKWEQVLALCQTIISSNSERLPDINIYQLKVLDCAMDAC 338

Query: 360 MELEDWKEALAYCQLTIPVYQRVYPQFHPLLGLQYYTCGKLEWFLGDTENAIKSMTEAVE 419
           + L   +EAL Y   T+  Y+  +P  HP+ G+Q    GKL+   G    A+K++  A +
Sbjct: 339 INLGLLEEALFYGIRTMEPYRIFFPGSHPVRGVQVMKVGKLQLHQGMFPQAMKNLRLAFD 398

Query: 420 ILRITHG 426
           I+R+THG
Sbjct: 399 IMRVTHG 405


>gi|301113434|ref|XP_002998487.1| conserved hypothetical protein [Phytophthora infestans T30-4]
 gi|262111788|gb|EEY69840.1| conserved hypothetical protein [Phytophthora infestans T30-4]
          Length = 424

 Score =  148 bits (374), Expect = 4e-33,   Method: Compositional matrix adjust.
 Identities = 125/427 (29%), Positives = 199/427 (46%), Gaps = 68/427 (15%)

Query: 28  SRCDGCFASS-NLKKCSACQVVWYCGSNCQKLDWKL-HRLECQVLSRLDKEKRKSVTPSI 85
           S C  CFA+   L +C  C   +YC   CQ+ DW   HR EC+ +++L +   +S   S 
Sbjct: 52  SHCHKCFATGVRLSRCGRCHTAFYCSKACQQADWAPDHRRECKSMAQLAQIGLRSDQVSD 111

Query: 86  RLMLKLYLRRKLQNDNVIPSTTTDNYSLVEALVAHMSDIDEKQLLLYAQIANLVNLILQW 145
            L+L   LRRK   + + PS           LV +  D+D+++LLL A +A  +NL+ + 
Sbjct: 112 VLLLGRVLRRK-GGEGLQPSE----------LVWYEEDMDDQELLLLAALAQKINLVDE- 159

Query: 146 PEISINEIAENFSKLACNAHTICNSELRPLGTGLYPVISIINHSCLPNAVLVFEGRLAVV 205
              S++E+A   S+   N  +IC+  L  LG G +P+ ++INHSC PN  + F  +   +
Sbjct: 160 -SFSMDEMARMLSRFRNNNFSICDELLLELGAGCFPLGAMINHSCDPNCAVTFVPKTLDM 218

Query: 206 --RAVQHVPKGAEVLISYIETAGSTMTRQKALKEQYLFTCTCPRCIKLGQFDDIQESAIL 263
             RA++ +  G E+  +Y++ A     RQ+ L+ +Y FTC CPRC +      +QE   L
Sbjct: 219 EFRAMKPIKSGEEITQTYVDIALPRRERQQRLQRKYHFTCGCPRCSQ-----PLQEPGSL 273

Query: 264 EGYRCKDDGCSGFLLRDSDDKGFTCQQCGLVRSKEEIKKIASEVNILSKKTLALTSCGNH 323
           + Y             D+D  G   ++    R +E                         
Sbjct: 274 DAY------------LDADIDGVPQERWTEERQQE------------------------- 296

Query: 324 QEVVSTYKMIEKLQKKLYHPFSVNLMQTREKLIKILMELEDWKEALAYCQLTIPVYQRVY 383
            + +   + +   Q  + H  SV  +QT   +    ME    +EA+ Y +  +  Y+RVY
Sbjct: 297 -QCIDALQNLADRQSTILHCHSVARLQTLATIFSAEMERGSVEEAIGYGERMLEFYRRVY 355

Query: 384 PQFHPLLGLQYYTCGKL-----EWFLGDTENAIKS---MTEAVEILRITHGTNSPFMKEL 435
              HP+ GL  +T G L     +   G  ++  KS   + EA  IL+ITHG +  F+K L
Sbjct: 356 NPNHPMTGLHVFTLGDLYGQQAQAGTGAEDSKQKSAEYLAEAQRILQITHGKDHRFVKML 415

Query: 436 ILKLEEA 442
             +LE A
Sbjct: 416 GDRLETA 422


>gi|432873524|ref|XP_004072259.1| PREDICTED: SET and MYND domain-containing protein 1-like isoform 1
           [Oryzias latipes]
          Length = 476

 Score =  148 bits (374), Expect = 5e-33,   Method: Compositional matrix adjust.
 Identities = 116/440 (26%), Positives = 200/440 (45%), Gaps = 53/440 (12%)

Query: 7   SGEVIISQEPYVCVPNNSSSISRCDGCFA-SSNLKKCSACQVVWYCGSNCQKLDWKLHRL 65
           +G+VI S+     V  +S +   C  CF     L++C  C+   YC   CQ+  W  H+ 
Sbjct: 24  AGDVIFSEASLAAVVFDSLAERVCHSCFRRQEKLQRCGQCKFAHYCDRTCQRAGWAEHKQ 83

Query: 66  ECQVLSRLDKEKRKSVTPSIRLMLKLYLRRKLQNDNVIPSTTTD-NYSLVEALVAHMSDI 124
           EC  +    K   K    +IRL+ ++  R  L  D    ST +D   + ++ L  H++D+
Sbjct: 84  ECSAI----KAYGKVPNENIRLVARILWR--LDKDG---STVSDMQLTTLDELEDHITDM 134

Query: 125 DEKQLL-LYAQIANLVNLILQWPEIS----INEIAENFSKLACNAHTICNSE-LRPLGTG 178
            E +L  L   I N ++    WP  S    +++I+  F  + CN  ++ +   L+ +G G
Sbjct: 135 QEDELKELKVDIHNFLDF---WPRTSKQHTVDDISHIFGVINCNGFSVSDQRGLQAVGVG 191

Query: 179 LYPVISIINHSCLPNAVLVFEGRLAVVRAVQHVPKGAEVLISYIETAGSTMTRQKALKEQ 238
           L+P + ++NH C PN  ++       +R++  + +G E+ ++Y++    +  RQ+ LK Q
Sbjct: 192 LFPNLCLVNHDCWPNCTVILNHGKIELRSLGKIAEGEELTVAYVDYMNLSEERQRLLKTQ 251

Query: 239 YLFTCTCPRCIKLGQFDDIQESAILEGYRCKDDGCSGFLLRDSDDKGFTCQQCGLVRSKE 298
           Y F CTC  C      + I++   L G             R+ D         G+  S+E
Sbjct: 252 YFFDCTCEHCK-----NKIKDDIKLGG-------------REVD---------GVKPSEE 284

Query: 299 EIKKIASEVNILSKKTLALTSCGNHQEVVSTYKM-IEKLQKKLY--HPFSVNLMQTREKL 355
           ++K+       + +K       G++ EVV   +  IEK +  L   H + + +  T  ++
Sbjct: 285 QVKEATDYCYQMLEKMDKARLNGDYHEVVKICRECIEKTEPVLAETHIYLLRMWSTMSEV 344

Query: 356 IKILMELEDWKEALAYCQLTIPVYQRVYPQFHPLLGLQYYTCGKLEWFLGDTENAIKSMT 415
              L  L D   A  Y +  +  Y ++YP  +  LG+     G + W  GD E A   + 
Sbjct: 345 QAYLQYLND---AAEYARKMVEGYTKLYPPNNATLGMAAMRAGVIHWQAGDIEVAHGMVC 401

Query: 416 EAVEILRITHGTNSPFMKEL 435
           +A  IL +THG   P  K+L
Sbjct: 402 KAYAILMVTHGPTHPITKDL 421


>gi|413955566|gb|AFW88215.1| hypothetical protein ZEAMMB73_999877 [Zea mays]
          Length = 484

 Score =  147 bits (372), Expect = 9e-33,   Method: Compositional matrix adjust.
 Identities = 83/189 (43%), Positives = 105/189 (55%), Gaps = 46/189 (24%)

Query: 195 VLVFEGRLAVVRAVQHVPKGAEVLISYIETAGSTMTRQKALKEQYLFTCTCPRCIKLGQF 254
           VL+F+GR A VRA+Q + K  EV ISYIETA  T  R   LK QY FTCTCPRC+K   F
Sbjct: 211 VLIFDGRTAYVRALQPINKDEEVSISYIETAAVTKKRNNDLK-QYFFTCTCPRCVK--GF 267

Query: 255 DDIQESAILEGYRCKDDGCSGFLLRDSDDKGFTCQQCGLVRSKEEIKKIASEVNILSKKT 314
           D   E A+LEG+RCK   C GFLL +SD                                
Sbjct: 268 D---EDALLEGFRCKSQACDGFLLPNSD-------------------------------- 292

Query: 315 LALTSCGNHQEVVSTYKMIEKLQKKLYHPFSVNLMQTREKLIKILMELEDWKEALAYCQL 374
                     E  S YK+I++L++ LYH FS  L+ T E L+KI +EL+DW  AL YC+L
Sbjct: 293 --------KAEAGSIYKIIKQLERNLYHAFSTTLLHTCETLLKIYLELQDWWTALTYCRL 344

Query: 375 TIPVYQRVY 383
           TIPVY++++
Sbjct: 345 TIPVYEKIF 353


>gi|413955565|gb|AFW88214.1| hypothetical protein ZEAMMB73_999877 [Zea mays]
          Length = 529

 Score =  147 bits (371), Expect = 1e-32,   Method: Compositional matrix adjust.
 Identities = 83/189 (43%), Positives = 105/189 (55%), Gaps = 46/189 (24%)

Query: 195 VLVFEGRLAVVRAVQHVPKGAEVLISYIETAGSTMTRQKALKEQYLFTCTCPRCIKLGQF 254
           VL+F+GR A VRA+Q + K  EV ISYIETA  T  R   LK QY FTCTCPRC+K   F
Sbjct: 211 VLIFDGRTAYVRALQPINKDEEVSISYIETAAVTKKRNNDLK-QYFFTCTCPRCVK--GF 267

Query: 255 DDIQESAILEGYRCKDDGCSGFLLRDSDDKGFTCQQCGLVRSKEEIKKIASEVNILSKKT 314
           D   E A+LEG+RCK   C GFLL +SD                                
Sbjct: 268 D---EDALLEGFRCKSQACDGFLLPNSD-------------------------------- 292

Query: 315 LALTSCGNHQEVVSTYKMIEKLQKKLYHPFSVNLMQTREKLIKILMELEDWKEALAYCQL 374
                     E  S YK+I++L++ LYH FS  L+ T E L+KI +EL+DW  AL YC+L
Sbjct: 293 --------KAEAGSIYKIIKQLERNLYHAFSTTLLHTCETLLKIYLELQDWWTALTYCRL 344

Query: 375 TIPVYQRVY 383
           TIPVY++++
Sbjct: 345 TIPVYEKIF 353


>gi|348670159|gb|EGZ09981.1| hypothetical protein PHYSODRAFT_523060 [Phytophthora sojae]
          Length = 421

 Score =  147 bits (371), Expect = 1e-32,   Method: Compositional matrix adjust.
 Identities = 118/427 (27%), Positives = 193/427 (45%), Gaps = 68/427 (15%)

Query: 28  SRCDGCFAS-SNLKKCSACQVVWYCGSNCQKLDWKL-HRLECQVLSRLDKEKRKSVTPSI 85
           S C  CFAS + L +C  C   +YC   CQ+ DWK  HR EC+VL++L +   ++   + 
Sbjct: 50  SHCHKCFASGTRLSRCGRCNTAFYCSKACQQADWKPDHRKECKVLAQLAQLGLRNDQTAD 109

Query: 86  RLMLKLYLRRKLQNDNVIPSTTTDNYSLVEALVAHMSDIDEKQLLLYAQIANLVNLILQW 145
            L+L   LRR+   + + P          + LV +  D+++++L+L A +A  + L+   
Sbjct: 110 VLLLGRVLRRE-DAEGLQP----------KELVWYEEDMEDQELMLLAALAQKLELVD-- 156

Query: 146 PEISINEIAENFSKLACNAHTICNSELRPLGTGLYPVISIINHSCLPNAVLVFEGRLAVV 205
              S++E+    S+   N  +IC+  L   G G +P+ ++INHSC PN  + F  +   +
Sbjct: 157 GSYSMDEMLRMLSRFRNNNFSICDELLLEQGAGCFPLGAMINHSCDPNCAITFVPKTLEM 216

Query: 206 --RAVQHVPKGAEVLISYIETAGSTMTRQKALKEQYLFTCTCPRCIKLGQFDDIQESAIL 263
             RA++ +  G E+  +Y++ A     R + L+ +Y F C C RC        +QES  L
Sbjct: 217 EFRAMRPIKAGEEITQTYVDVALPRRERHERLQRKYHFNCACSRCSV-----PLQESGSL 271

Query: 264 EGYRCKDDGCSGFLLRDSDDKGFTCQQCGLVRSKEEIKKIASEVNILSKKTLALTSCGNH 323
           + +             D+D  G   +Q    R  E                         
Sbjct: 272 DAF------------LDADIDGVPKEQWSQERKDE------------------------- 294

Query: 324 QEVVSTYKMIEKLQKKLYHPFSVNLMQTREKLIKILMELEDWKEALAYCQLTIPVYQRVY 383
            + +   + + + Q  + H  S+  +QT   +    ME    +EA+ Y +  +  YQRVY
Sbjct: 295 -QCIDALQKLAERQSNILHRDSIARLQTLATIFSAEMERGSVEEAVVYGEKMLEFYQRVY 353

Query: 384 PQFHPLLGLQYYTCGKLEWFL-----GDTENAIKS---MTEAVEILRITHGTNSPFMKEL 435
              HP+ GL  +T G L   L     G   +  KS   +TEA  IL+IT G    F+K L
Sbjct: 354 NANHPMTGLHLFTLGDLYAQLAQVGTGPENSKAKSSEYLTEARRILQITQGKEHRFVKML 413

Query: 436 ILKLEEA 442
             +L+ A
Sbjct: 414 ADRLQAA 420


>gi|321462734|gb|EFX73755.1| SET and MYND domain-containing protein 1 [Daphnia pulex]
          Length = 452

 Score =  147 bits (371), Expect = 1e-32,   Method: Compositional matrix adjust.
 Identities = 126/443 (28%), Positives = 199/443 (44%), Gaps = 77/443 (17%)

Query: 8   GEVIISQEPYVCVPNNSSSISRCDGCF-ASSNLKKCSACQVVWYCGSNCQKLDWKLHRLE 66
           G  I+   P+V    +S     CD C  A+SNL+KC  C VV YCG  CQ+  WK H+ E
Sbjct: 61  GTTILESVPFVYCLKSSFRRELCDFCLKANSNLRKCLGCMVVSYCGRVCQREGWKDHKGE 120

Query: 67  CQVLSRLDKEKRKSVTPSIRLMLKLYLRRKLQNDNVIPSTTTDNYSL--VEALVAHMSDI 124
           C+   R+   K    T S+RL+ +L L+ ++ N  ++  +     SL      V +  +I
Sbjct: 121 CKNFVRV---KPNVPTDSVRLIARLILKLQVINGYILLLSNFQQKSLNNYREEVIYRKEI 177

Query: 125 --DEKQLLLYAQIANLVNLILQWPEISINEIAEN-------FSKLACNAHTICNSELRPL 175
             D K++  +  I  ++N  L       NEI  N       + ++  N+  I N E++ +
Sbjct: 178 KEDTKRMEYFMTICGVLNEYLS------NEILPNSVELLGIYGRMCINSFNILNGEMQAI 231

Query: 176 GTGLYPVISIINHSCLPNAVLVFEGRLAVVRAVQHVPK--GAEVLISYIETAGSTMTRQK 233
           GTG+Y   SI++HSC PNAV  F+G    ++  Q +PK     + ISYI+   S   R+K
Sbjct: 232 GTGIYLAPSILDHSCSPNAVATFDGFKLRIQLTQELPKLEWDSIRISYIDLMNSKSHRKK 291

Query: 234 ALKEQYLFTCTCPRCIKLGQFDDIQESAILEGYRCKDDGCSGFLLRDSDDKGFTCQQCGL 293
            LK++Y F C CPRC    + DDI               C  +       K  TCQ    
Sbjct: 292 ELKDRYYFDCDCPRC----KNDDID--------------CYHYAA-----KCPTCQ---- 324

Query: 294 VRSKEEIKKIASEVNILSKKTLALTSCGNHQEVVSTYKMIEKLQKKLYHPFSVNLMQTRE 353
                  K + ++V+        L  C          K+    Q+ L+H   +   +  +
Sbjct: 325 -------KPVIAKVD--------LDVC----------KLCIAKQENLFHDLCLVRSKIMD 359

Query: 354 KLIKILMELEDWKEALAYCQLTIPVYQRVYPQFHPLLGLQYYTCGKLEWFLGDTENAI-- 411
                 ++L+ W+ A+ Y    I  Y+  Y + HPL  +      K+   +  T +++  
Sbjct: 360 AAFDSAIQLQLWEAAIQYGIPLIQAYKLWYGEEHPLTAILLLKLFKITLLISTTNDSVAQ 419

Query: 412 KSMTEAVEILRITHGTNSPFMKE 434
           K   EA +IL ITHG +S F ++
Sbjct: 420 KYYEEAAKILEITHGRDSSFYRQ 442


>gi|432852547|ref|XP_004067302.1| PREDICTED: N-lysine methyltransferase SMYD2-A-like isoform 1
           [Oryzias latipes]
          Length = 435

 Score =  147 bits (371), Expect = 1e-32,   Method: Compositional matrix adjust.
 Identities = 102/442 (23%), Positives = 202/442 (45%), Gaps = 42/442 (9%)

Query: 8   GEVIISQEPYVCVPNNSSSISRCDGCFAS-SNLKKCSACQVVWYCGSNCQKLDWKLHRLE 66
           GE++ +   Y  V   +   + C+ CF    +L KC  C+  +YC  +CQ+ DW +H+LE
Sbjct: 30  GELVFACPAYSYVLTVNERGAHCEHCFTRREDLFKCGKCKQAYYCNVDCQRGDWPMHKLE 89

Query: 67  CQVLSRLDKEKRKSVTPSIRLMLKLYLRRKLQNDNVIPSTTTDNYSLVEALVAHMSDID- 125
           C  +    +    S T  +RL+ ++ +++++  +     T ++   L++   +H+  +D 
Sbjct: 90  CVAMCTHGENWCPSET--VRLVSRIIMKQRVTTER----TPSERLLLLKEFESHLDKMDS 143

Query: 126 EKQLLLYAQIANLVNL----ILQWPEISINEIAENFSKLACNAHTICNSELRPLGTGLYP 181
           EK  +  A IA L       I   P+    E+ E F+++ CN  TI + EL  LG+ ++P
Sbjct: 144 EKDEMNQADIAALHYFYSKHISDLPDD--QELTELFAQVNCNGFTIEDEELSHLGSAVFP 201

Query: 182 VISIINHSCLPNAVLVFEGRLAVVRAVQHVPKGAEVLISYIETAGSTMTRQKALKEQYLF 241
            ++++NHSC PN ++ ++G +A VRAVQ +  G E+  SYI+    T  R++ L + Y F
Sbjct: 202 DVALMNHSCSPNVIVTYKGTVAEVRAVQEINPGEEIFNSYIDLLYPTEDRKERLLDSYFF 261

Query: 242 TCTCPRCIKLGQFDDIQESAILEGYRCKDDGCSGFLLRDSDDKGFTCQQCGLVRSKEEIK 301
            C C  C                              +  D      ++       EEIK
Sbjct: 262 GCQCTECTT----------------------------KSKDKAKMEIRKLSSPPEPEEIK 293

Query: 302 KIASEVNILSKKTLALTSCGNHQEVVSTYKMIEKLQKKLYHPFSVNLMQTREKLIKILME 361
            +      + ++           E++   ++  +    ++   +V ++    + + + + 
Sbjct: 294 SMVHYAKNVIEEFRKAKHYKTPSELLEMCELSLEKMGAIFADTNVYMLHMMYQAMGVCLY 353

Query: 362 LEDWKEALAYCQLTIPVYQRVYPQFHPLLGLQYYTCGKLEWFLGDTENAIKSMTEAVEIL 421
           ++DW  A++Y +  +  Y   YP +   +   Y   G+L   L      +K++ +A+ I+
Sbjct: 354 MQDWDGAMSYGEKIVYPYSVHYPPYSLNVASMYLKLGRLYLGLEKKTQGVKALKKALAIM 413

Query: 422 RITHGTNSPFMKELILKLEEAQ 443
            I HG +  ++ E+  ++EE +
Sbjct: 414 EIAHGKDHHYVAEVKREIEEQK 435


>gi|84180545|gb|ABC54714.1| histone methyltransferase SmyD1b [Danio rerio]
 gi|190337458|gb|AAI63095.1| Smyd1b protein [Danio rerio]
          Length = 473

 Score =  147 bits (371), Expect = 1e-32,   Method: Compositional matrix adjust.
 Identities = 102/435 (23%), Positives = 192/435 (44%), Gaps = 43/435 (9%)

Query: 7   SGEVIISQEPYVCVPNNSSSISRCDGCFA-SSNLKKCSACQVVWYCGSNCQKLDWKLHRL 65
           +G+V+ ++ P+  V  +S + S C  CF     L++C  C+   YC   CQ+  W+ H+L
Sbjct: 24  AGDVLFAEPPFASVVFDSQASSICHSCFRRQEKLQRCGQCRFAQYCDKTCQRAGWEEHKL 83

Query: 66  ECQVLSRLDKEKRKSVTPSIRLMLKLYLRRKLQNDNVIPSTTTDNYSLVEALVAHMSDID 125
           EC  +    K   ++V  + R++ ++  +  + +DN + +        +E L  H+ DI 
Sbjct: 84  ECAAIKTYGKPPSENVRLAARILWRMDKQGSVVSDNQLTT--------LEDLEDHICDIS 135

Query: 126 EKQLLLYAQIANLVNLILQWPEIS----INEIAENFSKLACNAHTICNSE-LRPLGTGLY 180
           E  L  +    ++ N +  WP  S    ++ ++     + CN   + +   L+ +G GL+
Sbjct: 136 EDDLKDFK--VDIHNFLDYWPRNSKPHTVDSVSHILGVINCNGFMVSDQRGLQAVGVGLF 193

Query: 181 PVISIINHSCLPNAVLVFEGRLAVVRAVQHVPKGAEVLISYIETAGSTMTRQKALKEQYL 240
           P + ++NH C PN  ++       +RA+  +  G EV ++Y++    +  RQ+ LK+QY 
Sbjct: 194 PNLCLVNHDCWPNCTVILNNGKIELRALGKISAGEEVTVAYVDYLNVSADRQRLLKQQYF 253

Query: 241 FTCTCPRCIKLGQFDDIQESAILEGYRCKDDGCSGFLLRDSDDKGFTCQQCGLVRSKEEI 300
           F CTC  C +  + D     A ++G +  +                           E++
Sbjct: 254 FDCTCKHCTEKIKDDLKMAGAEVDGVKVPE---------------------------EQV 286

Query: 301 KKIASEVNILSKKTLALTSCGNHQEVVSTYKMIEKLQKKLYHPFSVNLMQTREKLIKILM 360
           K++        +K        N+ EVV   +   + Q+ +     +  ++    L ++L 
Sbjct: 287 KEVTEFSRQKLEKMEKARIEANYNEVVKICRECVEKQENVLADTHIYYLRVCCTLSEVLS 346

Query: 361 ELEDWKEALAYCQLTIPVYQRVYPQFHPLLGLQYYTCGKLEWFLGDTENAIKSMTEAVEI 420
            L+ + EA  Y +  +  Y ++Y   +  LG+     G   W  G  E A   + +A  I
Sbjct: 347 YLQFFDEASEYARKMVDGYLKLYHPNNAQLGMATMRAGVTHWQAGFIEVAHGMICKAFAI 406

Query: 421 LRITHGTNSPFMKEL 435
           L ITHG   P  K+L
Sbjct: 407 LMITHGPTHPITKDL 421


>gi|354475964|ref|XP_003500195.1| PREDICTED: SET and MYND domain-containing protein 3 [Cricetulus
           griseus]
          Length = 428

 Score =  147 bits (371), Expect = 1e-32,   Method: Compositional matrix adjust.
 Identities = 117/432 (27%), Positives = 199/432 (46%), Gaps = 66/432 (15%)

Query: 8   GEVIISQEPYVCVPNNSSSISRCDGCF-ASSNLKKCSACQVVWYCGSNCQKLDWKLHRLE 66
           GE++   +P        S    CD C      L +CS C++  YC + CQK  W  H+ E
Sbjct: 27  GELLFRCDPLAYTVCKGSRGVVCDRCLLGKEKLMRCSQCRIAKYCSAKCQKKAWPDHKQE 86

Query: 67  CQVLSRLDKEKRKSVTPSIRLMLKLYLRRKLQNDNVIPSTTTDNYSLVEALVAHMSDIDE 126
           C+ L      K +    S+RL+ ++ +  KL ++   PS +   YS  + L ++++ + E
Sbjct: 87  CKCLKSC---KPRYPPDSVRLLARVIV--KLMDEK--PSESEKLYSFYD-LESNINKLTE 138

Query: 127 KQLLLYAQIANLVNLILQWPEISINEIA------------ENFSKLACNAHTICNSELRP 174
            +         L  L L +   +  EI             E F+K+ CN+ TICN+E++ 
Sbjct: 139 DK------KEGLRQLALTFQHFTREEIQDASQLPPSFDLFEAFAKVICNSFTICNAEMQE 192

Query: 175 LGTGLYPVISIINHSCLPNAVLVFEGRLAVVRAVQHVPKGAEVLISYIETAGSTMTRQKA 234
           +G GLYP +S++NHSC PN  +VF G   ++RAV+ +  G E+ I Y++   ++  R+K 
Sbjct: 193 VGVGLYPSMSLLNHSCDPNCSIVFNGPHLLLRAVREIEAGEELTICYLDMLMTSEERRKQ 252

Query: 235 LKEQYLFTCTCPRCIKLGQFDDIQESAILEGYRCKDDGCSGFLLRDSDDKGFTCQQCGLV 294
           L++QY F C C RC    Q  D                      +D+D          ++
Sbjct: 253 LRDQYCFECDCIRC----QTHD----------------------KDAD----------ML 276

Query: 295 RSKEEIKKIASEVNILSKKTLALTSCGNHQEVVSTYKMIEKLQKKLYHPFSVNLMQTREK 354
              E+I K   EV    KK   L +    ++V++  + I           ++  ++  + 
Sbjct: 277 TGDEQIWK---EVQESLKKIEELKAHWKWEQVLALCQAIISSNSDRLPDINIYQLKVLDC 333

Query: 355 LIKILMELEDWKEALAYCQLTIPVYQRVYPQFHPLLGLQYYTCGKLEWFLGDTENAIKSM 414
            +   + L   +EAL Y   T+  Y+  +P  HP+ G+Q    GKL+   G    A+K++
Sbjct: 334 AMDACINLGMLEEALLYALRTMEPYRIFFPGSHPVRGVQVMKVGKLQLHQGMFPQAMKNL 393

Query: 415 TEAVEILRITHG 426
             A +I+++THG
Sbjct: 394 RLAFDIMKVTHG 405


>gi|313227557|emb|CBY22704.1| unnamed protein product [Oikopleura dioica]
          Length = 589

 Score =  147 bits (370), Expect = 1e-32,   Method: Compositional matrix adjust.
 Identities = 108/434 (24%), Positives = 208/434 (47%), Gaps = 51/434 (11%)

Query: 8   GEVIISQEPYVCVPNNSSSISRCDGCF----------ASSNLKKCSACQVVWYCGSNCQK 57
           G +I+  EP+  V  +  +   C  CF           ++ L +CS+C+   YC   CQK
Sbjct: 124 GNLILKAEPFAFVIFDHMAEHVCHHCFNMVVRDRQGQPTTQLLRCSSCKFARYCSRECQK 183

Query: 58  LDWKLHRLECQVLSRLDKEKRKSVTPSIRLMLKLYLRRKLQNDNVIPSTTTDNYSLVEAL 117
             W +H+ EC  + R+     ++ +  +R++ ++  +   Q +       ++    VE L
Sbjct: 184 KAWSMHKKECMAIKRI---APRTASDEVRMVSQILWK---QAERGEKRAKSEELCRVEEL 237

Query: 118 VAHMSDIDEKQLLLYAQIANLVNLILQWPEISINE--IAENFSKLACNAHTICNSE-LRP 174
             H++D+  + +    + +  +     +  +  ++  I   F  ++CN  +I +   L+ 
Sbjct: 238 CDHLNDMSFEDVNKLEEQSKEIGDYFGYENLPDSDEYIDHLFGIVSCNGMSITDMRGLQY 297

Query: 175 LGTGLYPVISIINHSCLPNAVLVFEGRLAVVRAVQHVPKGAEVLISYIETAGSTMTRQKA 234
           LG  ++P +++INH C PN V V  G    VRA++ + +G E+ ISYI+T+ ++ TR+  
Sbjct: 298 LGVAIHPTLNLINHDCNPNVVAVSCGPNIFVRAIKPIKEGDELFISYIDTSATSETRKNI 357

Query: 235 LKEQYLFTCTCPRCIKLGQFDDIQESAILEGYRCKDDGCSGFLLRDSDDKGFTCQQCGLV 294
           LK+QY F CTC  C + G+ D+++ + ++      +   S ++ +++D            
Sbjct: 358 LKDQYYFDCTCKMC-ESGEKDELKSAYVMPKEDVSEKR-STYIEKNTDIM---------- 405

Query: 295 RSKEEIKKIASEVNILSKKTLALTSCG--NHQEVVSTYKMIEKLQKKLYHPFSVNLMQTR 352
                +KKI S   + + + +A  + G    QE +     ++KL           ++QT 
Sbjct: 406 -----MKKIESSKKVQAWERVAAQAGGCLLQQENLFDDTHLKKLM----------ILQTC 450

Query: 353 EKLIKILMELEDWKEALAYCQLTIPVYQRVYPQFHPLLGLQYYTCGKLEWFLGDTENAIK 412
            ++  IL   +D   A  Y +  +  Y+++YP++   +G+Q Y  G   W L   E+AIK
Sbjct: 451 SEVSAILNHYDD---AAQYAERVLAAYEQLYPEYSTQIGMQAYRLGVHYWHLQRVEDAIK 507

Query: 413 SMTEAVEILRITHG 426
            +  A++ L ITHG
Sbjct: 508 MLGRALKHLEITHG 521


>gi|348534090|ref|XP_003454536.1| PREDICTED: N-lysine methyltransferase SMYD2-A-like [Oreochromis
           niloticus]
          Length = 435

 Score =  147 bits (370), Expect = 1e-32,   Method: Compositional matrix adjust.
 Identities = 99/442 (22%), Positives = 202/442 (45%), Gaps = 42/442 (9%)

Query: 8   GEVIISQEPYVCVPNNSSSISRCDGCFAS-SNLKKCSACQVVWYCGSNCQKLDWKLHRLE 66
           GE++ +   Y  V   +   + C+ CF    +L KC  C+  +YC  +CQ+ DW +H+LE
Sbjct: 30  GELVFACPAYSYVLTVNERGAHCEHCFTRREDLFKCGKCKQAYYCNVDCQRGDWPMHKLE 89

Query: 67  CQVLSRLDKEKRKSVTPSIRLMLKLYLRRKLQNDNVIPSTTTDNYSLVEALVAHMSDID- 125
           C  +    +    S T  +RL+ ++ ++++   +     T ++   L++   AH+  +D 
Sbjct: 90  CVAMCSYGENWCPSET--VRLVARIIMKQRATTER----TPSERLLLLKEFEAHLDKMDS 143

Query: 126 EKQLLLYAQIANLVNL----ILQWPEISINEIAENFSKLACNAHTICNSELRPLGTGLYP 181
           EK+ +    IA L +     I   P+     + E F+++ CN  TI + EL  LG+ ++P
Sbjct: 144 EKEEMNQTDIAALHHFYSRHISNLPDEQA--LTELFAQVNCNGFTIEDEELSHLGSAVFP 201

Query: 182 VISIINHSCLPNAVLVFEGRLAVVRAVQHVPKGAEVLISYIETAGSTMTRQKALKEQYLF 241
            ++++NHSC PN ++ ++G +A VRAV+ +  G E+  SYI+    T  R++ L + Y F
Sbjct: 202 DVALMNHSCSPNVIVTYKGTVAEVRAVKEINPGEEIFNSYIDLLYPTEDRKERLLDSYFF 261

Query: 242 TCTCPRCIKLGQFDDIQESAILEGYRCKDDGCSGFLLRDSDDKGFTCQQCGLVRSKEEIK 301
           TC C  C                              R  D      ++       EEI+
Sbjct: 262 TCQCTECTS----------------------------RSKDKAKMEIRKLNPPPEPEEIR 293

Query: 302 KIASEVNILSKKTLALTSCGNHQEVVSTYKMIEKLQKKLYHPFSVNLMQTREKLIKILME 361
            +      + ++           E++   ++  +    ++   +V ++    + + + + 
Sbjct: 294 SMVRYAKNVIEEFRRAKHYKTPSELLEMCELSLEKMGAIFADTNVYMLHMMYQAMGVCLY 353

Query: 362 LEDWKEALAYCQLTIPVYQRVYPQFHPLLGLQYYTCGKLEWFLGDTENAIKSMTEAVEIL 421
           ++DW  A++Y +  +  Y   YP +   +   Y   G+L   L      +K++ +A+ ++
Sbjct: 354 MQDWDGAMSYGEKIVQPYSVHYPAYSLNVASMYLKLGRLYLGLEKKTQGVKALKKALAVM 413

Query: 422 RITHGTNSPFMKELILKLEEAQ 443
            + HG +  ++ E+  ++EE +
Sbjct: 414 EVAHGKDHHYVAEVKREIEEQK 435


>gi|348524789|ref|XP_003449905.1| PREDICTED: N-lysine methyltransferase SMYD2-like [Oreochromis
           niloticus]
          Length = 440

 Score =  147 bits (370), Expect = 1e-32,   Method: Compositional matrix adjust.
 Identities = 113/433 (26%), Positives = 197/433 (45%), Gaps = 41/433 (9%)

Query: 8   GEVIISQEPYVCVPNNSSSISRCDGCFA-SSNLKKCSACQVVWYCGSNCQKLDWKLHRLE 66
           GE++ S   +  V +       C+ CF  + +L +C  C+  +YC   CQK DW +H+LE
Sbjct: 37  GELLFSCPAFSHVLSVKERGCYCEFCFTRTQHLARCGKCKKAFYCNVKCQKGDWAMHKLE 96

Query: 67  CQVLSRLDKEKRKSVTPS--IRLMLKLYLRRKLQNDNVIPSTTTDNYSLVEALVAHMSDI 124
           C  +    +    +  PS   RL+ ++  ++K+Q +  I     +   L+  + +H  D 
Sbjct: 97  CSAMVAFGE----NWCPSELSRLVARILAKKKMQKERCI----CEKMLLIGEMQSHTEDE 148

Query: 125 D-EKQLLLYAQIANLVNLILQWPEISIN-EIAENFSKLACNAHTICNSELRPLGTGLYPV 182
           D EK+ +  A IA L     +  ++  + E+   FS++ACN  TI + EL  LGT +YP 
Sbjct: 149 DNEKREMTEADIAGLHRFYSKHLDVPDHKELLTLFSQVACNGFTIEDDELSHLGTAVYPD 208

Query: 183 ISIINHSCLPNAVLVFEGRLAVVRAVQHVPKGAEVLISYIETAGSTMTRQKALKEQYLFT 242
           +++INHSCLP+ ++ F+G  A VRAVQ +  G EVLISYI+    T  R   L+E Y F 
Sbjct: 209 MALINHSCLPSVIVTFKGTSAEVRAVQDMKPGDEVLISYIDLLYPTDDRNNRLRESYYFI 268

Query: 243 CTCPRCIKLGQFDDIQESAILEGYRCKDDGCSGFLLRDSDDKGFTCQQCGLVRSKEEIKK 302
           C C  C    Q  D  +  + +     +      ++R +            +R   E K 
Sbjct: 269 CDCQEC--KSQSKDKAKLKVRKQRDSIEPDVINNMVRYARKA---------IREFREFKH 317

Query: 303 IASEVNILSKKTLALTSCGNHQEVVSTYKMIEKLQKKLYHPFSVNLMQTREKLIKILMEL 362
             +   +L     +L   G                  ++   +V ++    + + I M +
Sbjct: 318 TKTPSELLEMCEQSLEEMG-----------------AVFDDSNVYMLHMMYQAMGICMYM 360

Query: 363 EDWKEALAYCQLTIPVYQRVYPQFHPLLGLQYYTCGKLEWFLGDTENAIKSMTEAVEILR 422
           +D   A+ Y +  +  Y  +YP +   +   Y   G+L   L        ++ +A+ I+ 
Sbjct: 361 QDLDGAIRYGEKLLKPYSHLYPPYSLNVSSVYLKLGRLYLGLERQSACTSALKKAMAIME 420

Query: 423 ITHGTNSPFMKEL 435
           + HG +  +++EL
Sbjct: 421 VAHGKDHFYIEEL 433


>gi|197692956|gb|ACH71266.1| SET and MYND domain-containing 3 [Sus scrofa]
          Length = 369

 Score =  145 bits (367), Expect = 3e-32,   Method: Compositional matrix adjust.
 Identities = 117/396 (29%), Positives = 191/396 (48%), Gaps = 61/396 (15%)

Query: 41  KCSACQVVWYCGSNCQKLDWKLHRLECQVLSRLDKEKRKSVTPSIRLMLKLYLRRKLQND 100
           +CS C+V  YC + CQK  W+ H+ EC+ L      K +    S+RL+ ++    KL  +
Sbjct: 2   RCSQCRVAKYCSAKCQKKAWQDHKRECKCLKSC---KPRYPPDSVRLLGRVIF--KLMEE 56

Query: 101 NVIPSTTTDNYSLVEALVAHMSDIDE------KQLLLYAQ--IANLVNLILQWPEISINE 152
              PS +   YS  + L ++M+ + E      +QL+L  Q  +   +    Q P     +
Sbjct: 57  T--PSESEKLYSFYD-LESNMNKLTEEKKEGLRQLVLTFQHFMREEIQDASQLP--PSFD 111

Query: 153 IAENFSKLACNAHTICNSELRPLGTGLYPVISIINHSCLPNAVLVFEGRLAVVRAVQHVP 212
           I E F+K+ CN+ TICN+E++ +G GLYP +S++NHSC PN  +VF G   ++RAV+ + 
Sbjct: 112 IFEAFAKVICNSFTICNAEMQEVGVGLYPSMSLLNHSCDPNCSIVFNGPHLLLRAVRDIE 171

Query: 213 KGAEVLISYIETAGSTMTRQKALKEQYLFTCTCPRCIKLGQFDDIQESAILEGYRCKDDG 272
            G E+ I Y++   ++  R+K L++QY F C C RC                        
Sbjct: 172 AGEELTICYLDMLMTSEERRKQLRDQYCFECDCFRC------------------------ 207

Query: 273 CSGFLLRDSDDKGFTCQQCGLVRSKEEIKKIASEVNILSKKTLALTSCGNHQEVVSTYKM 332
                 +D D    T  +      +E +KKI  E+    K    L  C   Q ++S+   
Sbjct: 208 ----QTQDKDADMLTGDEQVWKEVQESLKKI-EELKAHWKWEQVLAMC---QTIISSN-- 257

Query: 333 IEKLQKKLYHPFSVNLMQTR--EKLIKILMELEDWKEALAYCQLTIPVYQRVYPQFHPLL 390
            E+L         +N+ Q +  +  +   + L   +EAL Y   T+  Y+  +P  HP+ 
Sbjct: 258 AERLP-------DINIYQLKVLDCAMDACINLGLLEEALFYGMRTMEPYRIFFPGSHPVR 310

Query: 391 GLQYYTCGKLEWFLGDTENAIKSMTEAVEILRITHG 426
           G+Q    GKL+   G    A+K++  A +I+R+THG
Sbjct: 311 GVQVMKVGKLQLHQGMFPQAMKNLRLAFDIMRVTHG 346


>gi|325530307|sp|Q5RGL7.2|SMY2B_DANRE RecName: Full=N-lysine methyltransferase SMYD2-B; AltName:
           Full=Histone methyltransferase SMYD2-B; AltName:
           Full=SET and MYND domain-containing protein 2B
          Length = 434

 Score =  145 bits (367), Expect = 3e-32,   Method: Compositional matrix adjust.
 Identities = 114/443 (25%), Positives = 200/443 (45%), Gaps = 61/443 (13%)

Query: 8   GEVIISQEPYVCVPNNSSSISRCDGCFA-SSNLKKCSACQVVWYCGSNCQKLDWKLHRLE 66
           GE++ S   Y  V +       C+ CF     L KC  C+  +YC +NCQK +W +H+LE
Sbjct: 31  GELLFSCPAYSYVLSVGERGLICEQCFTRKKGLAKCGKCKKAFYCNANCQKKNWPMHKLE 90

Query: 67  CQVLSRLDKEKRKSVTPSIRLMLKLYLRRKLQNDNVIPSTTTDNYSLVEALVAHMSDID- 125
           CQ +    +  R S T  +RL+ ++  R K Q +     + ++   L+  + AH+ D+D 
Sbjct: 91  CQAMCAFGENWRPSET--VRLVARIIARLKAQKER----SPSEILLLLGEMEAHLEDMDN 144

Query: 126 EKQLLLYAQIANLVNLI---LQWPEISINEIAENFSKLACNAHTICNSELRPLGTGLYPV 182
           EK+ +  A IA L       L +P+     +   FS++ CN  T+ + EL  LG  ++P 
Sbjct: 145 EKREMTEAHIAGLHQFYSKHLDFPDH--QALLTLFSQVHCNGFTVEDEELSNLGLAIFPD 202

Query: 183 ISIINHSCLPNAVLVFEGRLAVVRAVQHVPKGAEVLISYIETAGSTMTRQKALKEQYLFT 242
           I+++NHSC PN ++ + G  A VRAV+ +  G E+  SYI+    T  R + L++ Y F+
Sbjct: 203 IALLNHSCSPNVIVTYRGINAEVRAVKDISPGQEIYTSYIDLLYPTADRLERLRDMYYFS 262

Query: 243 CTCPRC-------IKLG---QFDDIQESAILEGYRCKDDGCSGFLLRDSDDKGFTCQQCG 292
           C C  C       +K+    + D+I E  I +  R   +    F                
Sbjct: 263 CDCKECTTKSMDVVKMSVRKRSDEIGEKEIKDMVRYARNSMENFR--------------- 307

Query: 293 LVRSKEEIKKIASEVNILSKKTLALTSCGNHQEVVSTYKMIEKLQKKLYHPFSVNLMQTR 352
             R+K++              T  L  C    + +ST          ++   +V ++   
Sbjct: 308 --RAKQD-----------KSPTELLEMCELSIDKMST----------VFDDSNVYILHMM 344

Query: 353 EKLIKILMELEDWKEALAYCQLTIPVYQRVYPQFHPLLGLQYYTCGKLEWFLGDTENAIK 412
            + + I +  ED++ A+ Y +  I  +  +YP +   +   +   G+L   L      I 
Sbjct: 345 YQAMGICLFTEDYEGAVRYGEKVIKPFTVLYPAYSMNVASMFLKLGRLYIALDRKLAGID 404

Query: 413 SMTEAVEILRITHGTNSPFMKEL 435
           +  +A+ I+ + HG +  ++ EL
Sbjct: 405 AFQKALTIMEVVHGKDHTYVTEL 427


>gi|334322119|ref|XP_003340188.1| PREDICTED: SET and MYND domain-containing protein 3-like
           [Monodelphis domestica]
          Length = 441

 Score =  145 bits (366), Expect = 4e-32,   Method: Compositional matrix adjust.
 Identities = 115/448 (25%), Positives = 199/448 (44%), Gaps = 69/448 (15%)

Query: 8   GEVIISQEPYVCVPNNSSSISRCDGCFA-SSNLKKCSACQVVWYCGSNCQK-------LD 59
           GE++   EP        S    C+ C      L +CS C+V  YCGS CQ        +D
Sbjct: 27  GELLFRSEPLAYTVCKESLGVVCERCLCRKEKLLRCSQCKVARYCGSACQADSSEVSFVD 86

Query: 60  WKLHRLECQVLSRLDKEKRKSVTPS----IRLMLKLYLRRKLQNDNVIPSTTTDNYSLVE 115
               +           E  K +  S    I L++ +Y+ RK    N +       + +  
Sbjct: 87  LFGRQSPTSTFPHFPSEPPKHLANSGSVCINLIVYVYIPRKY---NAMLPLMIGPFPIFS 143

Query: 116 ALVAHMSDIDEK---------QLLLYAQIANLVNLILQWPEISINEIAENFSKLACNAHT 166
             + ++S+  +K         QL L  +I +   L   +      +I E+F+K+ CN   
Sbjct: 144 PDIKNLSEEKKKGLGHLAVTLQLYLKEEIQDASQLPPAF------DIFESFAKVICNGFA 197

Query: 167 ICNSELRPLGTGLYPVISIINHSCLPNAVLVFEGRLAVVRAVQHVPKGAEVLISYIETAG 226
           I N E++ +G GLYP +S++NHSC PN V+VFEG    +RAV+ + +G E+ I Y++   
Sbjct: 198 ISNGEMQEVGVGLYPSMSLLNHSCDPNCVIVFEGPSLFLRAVRDIQQGEELTICYLDVLM 257

Query: 227 STMTRQKALKEQYLFTCTCPRCIKLGQFDDIQESAILEGYRCKDDGCSGFLLRDSDDKGF 286
            +  RQK LKEQY F C CP C                              R  DD   
Sbjct: 258 PSAERQKQLKEQYCFDCDCPGCET----------------------------RSKDDDML 289

Query: 287 TCQQCGLVRSKEEIKKIASEVNILSKKTLALTSCGNHQEVVSTYKMIEKLQKKLYHPFSV 346
           + ++      ++ +KK+  ++    K    L  C   Q +++ Y   ++L  +     ++
Sbjct: 290 SGEEQAWKEVQDSLKKV-EDLRAQEKWEQILAIC---QTLINNYG--DRLPDR-----NI 338

Query: 347 NLMQTREKLIKILMELEDWKEALAYCQLTIPVYQRVYPQFHPLLGLQYYTCGKLEWFLGD 406
             ++  +  +   + L  W++AL Y   T+  Y+  Y  FHP+ G+Q    GKL+   G 
Sbjct: 339 YQLKMLDCAMDACINLCLWEDALLYGSRTLEPYRLYYSGFHPISGVQVMRVGKLQHHQGL 398

Query: 407 TENAIKSMTEAVEILRITHGTNSPFMKE 434
              A++++ +A E++++THG +   +++
Sbjct: 399 YPQALETLKQAFELMKVTHGRDHSLVED 426


>gi|294662600|pdb|3MEK|A Chain A, Crystal Structure Of Human Histone-Lysine N-
           Methyltransferase Smyd3 In Complex With S-Adenosyl-L-
           Methionine
          Length = 429

 Score =  145 bits (366), Expect = 5e-32,   Method: Compositional matrix adjust.
 Identities = 121/431 (28%), Positives = 192/431 (44%), Gaps = 64/431 (14%)

Query: 8   GEVIISQEPYVCVPNNSSSISRCDGCF-ASSNLKKCSACQVVWYCGSNCQKLDWKLHRLE 66
           GE++   +P        S    CD C      L +CS C+V  YC + CQK  W  H+ E
Sbjct: 28  GELLFRSDPLAYTVCKGSRGVVCDRCLLGKEKLXRCSQCRVAKYCSAKCQKKAWPDHKRE 87

Query: 67  CQVLSRLDKEKRKSVTPSIRLMLKLYLRRKLQNDNVIPSTTTDNYSLVEALVAHMSDIDE 126
           C+ L      K +    S+RL+ ++    KL +    PS +   YS  + L ++++ + E
Sbjct: 88  CKCLKSC---KPRYPPDSVRLLGRVVF--KLXDG--APSESEKLYSFYD-LESNINKLTE 139

Query: 127 KQLLLYAQIANLVNLILQWPEISINEIA---------ENFSKLACNAHTICNSELRPLGT 177
            +      +  LV     +    I + +         E F+K+ CN+ TICN+E + +G 
Sbjct: 140 DR---KEGLRQLVXTFQHFXREEIQDASQLPPAFDLFEAFAKVICNSFTICNAEXQEVGV 196

Query: 178 GLYPVISIINHSCLPNAVLVFEGRLAVVRAVQHVPKGAEVLISYIETAGSTMTRQKALKE 237
           GLYP IS++NHSC PN  +VF G   ++RAV+ +  G E+ I Y++   ++  R+K L++
Sbjct: 197 GLYPSISLLNHSCDPNCSIVFNGPHLLLRAVRDIEVGEELTICYLDXLXTSEERRKQLRD 256

Query: 238 QYLFTCTCPRCIKLGQFDDIQESAILEGYRCKDDGCSGFLLRDSDDKGFTCQQCGLVRSK 297
           QY F C C RC                              +D D    T  +      +
Sbjct: 257 QYCFECDCFRC----------------------------QTQDKDADXLTGDEQVWKEVQ 288

Query: 298 EEIKKIASEVNILSKKTLALTSCGNHQEVVSTYKMIEKLQKKLYHPFSVNLMQTR--EKL 355
           E +KKI  E+    K    L  C   Q ++S+    E+L         +N+ Q +  +  
Sbjct: 289 ESLKKI-EELKAHWKWEQVLAXC---QAIISSNS--ERLP-------DINIYQLKVLDCA 335

Query: 356 IKILMELEDWKEALAYCQLTIPVYQRVYPQFHPLLGLQYYTCGKLEWFLGDTENAIKSMT 415
               + L   +EAL Y   T   Y+  +P  HP+ G+Q    GKL+   G    A K++ 
Sbjct: 336 XDACINLGLLEEALFYGTRTXEPYRIFFPGSHPVRGVQVXKVGKLQLHQGXFPQAXKNLR 395

Query: 416 EAVEILRITHG 426
            A +I R+THG
Sbjct: 396 LAFDIXRVTHG 406


>gi|290998093|ref|XP_002681615.1| SET domain-containing protein [Naegleria gruberi]
 gi|284095240|gb|EFC48871.1| SET domain-containing protein [Naegleria gruberi]
          Length = 430

 Score =  144 bits (364), Expect = 7e-32,   Method: Compositional matrix adjust.
 Identities = 113/416 (27%), Positives = 187/416 (44%), Gaps = 76/416 (18%)

Query: 8   GEVIISQEPYVCVPNNSSSISRCDGCF-ASSNLKKCSACQVVWYCGSNCQKLDWKLHRLE 66
           G++I+ ++P+  + ++      C+ CF A+  L +C AC+V  YC  NC   D   H  E
Sbjct: 38  GKLILHEKPFCFIIDDRERTHCCNYCFKANIKLSRCKACKVSHYCSMNCYHSDH--HLKE 95

Query: 67  CQVLSRLDKEKRKSVTPSIRLMLKLYL-----RRKLQNDNVIPSTTTDNYSLVEALVAHM 121
           C+ L+   K     VT SIRL+LK +L     ++     N+ PS+++D           +
Sbjct: 96  CKALATHPK-----VTNSIRLLLKCFLSCSDEKQFEMIQNLAPSSSSD---------EKV 141

Query: 122 SDIDEKQLLLYAQIANLVNL---ILQWPEISINEIAENFSKLACNAHTICNSELRPLGTG 178
            D   K  +L+A     V+L    L   +  ++ I     KL  N  +ICN E+  +G+G
Sbjct: 142 EDNLIKLAILFADYVKDVDLSSSFLDNRKEDLDFIYLLLLKLQRNTFSICNEEMNAIGSG 201

Query: 179 LYPVISIINHSCLPNAVLVFEGR----LAVVRAVQHVPKGAEVLISYIETAGSTMTRQKA 234
           +Y   S+ NHSC+PN  ++F+      + ++     + +G  + I+Y++    T  RQK 
Sbjct: 202 IYLKASMFNHSCVPNCAILFDSDKNLYVRILNPSSLLEEGTPLTINYVDLMDLTANRQKK 261

Query: 235 LKEQYLFTCTCPRCIKLGQFDDIQESAILEGYRCKDDGCSGFLLRDSDDKGFTCQQCGLV 294
           LKEQY FTCTCPRC+   +                                         
Sbjct: 262 LKEQYHFTCTCPRCLNSNE----------------------------------------- 280

Query: 295 RSKEEIKKIASEVNILSKKTLALTSCGNHQEVVSTYKMIEKLQKKLYHPFSVNLMQTREK 354
              E + K+ S  N   K      +    Q +VS  +    L  ++Y   ++N       
Sbjct: 281 EVNENVDKMISIANEHRKNQNLQEAVNYFQRIVSKKRSSPSLFHEIYMGIALN------S 334

Query: 355 LIKILMELEDWKEALAYCQLTIPVYQRVYPQFHPLLGLQYYTCGKLEWFLGDTENA 410
           L  + ++L D+K+A  +   ++  +++ YP F+PLLGLQY TC KL  +L +   A
Sbjct: 335 LTHLNVDLSDFKKAYEFGIESLEYFEKYYPPFYPLLGLQYLTCAKLASYLEENGQA 390


>gi|301610418|ref|XP_002934751.1| PREDICTED: LOW QUALITY PROTEIN: SET and MYND domain-containing
           protein 2-like [Xenopus (Silurana) tropicalis]
          Length = 430

 Score =  144 bits (363), Expect = 1e-31,   Method: Compositional matrix adjust.
 Identities = 112/445 (25%), Positives = 206/445 (46%), Gaps = 55/445 (12%)

Query: 8   GEVIISQEPYVCVPNNSSSISRCDGCFA-SSNLKKCSACQVVWYCGSNCQKLDWKLHRLE 66
           GE++ S   Y  V   +   + C+ CFA    L KC  C+  +YC  +CQK DW +H+LE
Sbjct: 28  GELLFSCPAYTYVLTVNERGNHCEFCFARKEGLSKCGKCKQAFYCNVDCQKGDWPMHKLE 87

Query: 67  CQVLSRLDKEKRKSVTPSIRLMLKLYLRRKLQNDNVIPSTTTDNYSLVEALVAHMSDIDE 126
           C  +    +    S T  +RL  ++  ++K Q +     T ++ +  V+   +H+S +D 
Sbjct: 88  CSAMCTYGQNWCPSET--VRLTARILAKQKTQTER----TASERFLSVKDFESHLSKLDN 141

Query: 127 KQL-LLYAQIANLVNLILQWPEISINEIAEN-FSKLACNAHTICNSELRPLGTGLYPVIS 184
           ++L L+   IA L     +    S N      F+++ CN  TI + EL  LG+ ++P ++
Sbjct: 142 EKLELIQNDIAALHRFYSKNLHYSDNAAQVFLFAQVNCNGFTIEDEELSHLGSAIFPDVA 201

Query: 185 IINHSCLPNAVLVFEGRLAVVRAVQHVPKGAEVLISYIETAGSTMTRQKALKEQYLFTCT 244
           ++NHSC PN ++ ++G +A VRAVQ +  G EV+ SYI+    T  R   L +     C 
Sbjct: 202 LMNHSCCPNVIVTYKGTVAEVRAVQEIHAGDEVITSYIDLLYPTEDRNDRLIDSXFCNCD 261

Query: 245 CPRC-------IKLGQFDDIQESAILEGYRCKDDGCSGFLLRDSDDKGFTCQQCGLVRSK 297
           C  C       +K+ +        + + ++ K+  CS  LL                   
Sbjct: 262 CRECSTKQKGRLKIAEISLKIHPPLDKXWKNKERSCSVPLL------------------- 302

Query: 298 EEIKKIASEVNILSKKTLALTSCGNHQEVVSTYKMIEKLQKKLYHPFSVNLMQTREKLIK 357
                       L   T+A  S     E++   ++  +    ++   +V ++    + + 
Sbjct: 303 -----------YLFPLTIAPPS-----ELLEICELSLEKMGSVFEDSNVYMLHMMYQAMG 346

Query: 358 ILMELEDWKEALAYCQLTIPVYQRVYPQFHPLLGLQYYTCGKLEWFLGDTENAI--KSMT 415
           + + ++DW+ AL Y +  I  Y + YP +   +   +   G+L  ++G  + AI  K++ 
Sbjct: 347 VCLYMQDWEGALKYGEKIIKPYSKHYPPYSLNVASMWLKLGRL--YMGLEKKAIGTKALK 404

Query: 416 EAVEILRITHGTNSPFMKELILKLE 440
           +A+ I+ I HG +  ++ E+  +LE
Sbjct: 405 KAIAIMDIAHGPDHHYIAEIKKELE 429


>gi|149041012|gb|EDL94969.1| SET and MYND domain containing 2, isoform CRA_b [Rattus norvegicus]
          Length = 349

 Score =  144 bits (362), Expect = 1e-31,   Method: Compositional matrix adjust.
 Identities = 85/250 (34%), Positives = 132/250 (52%), Gaps = 14/250 (5%)

Query: 4   FHVSGEVIISQEPYVCVPNNSSSISRCDGCFA-SSNLKKCSACQVVWYCGSNCQKLDWKL 62
           FHV G+++ S   Y CV         C+ CFA    L KC  C+  +YC   CQK DW L
Sbjct: 27  FHV-GDLLFSCPAYACVLTVGERGHHCECCFARKEGLSKCGRCKQAFYCDVECQKEDWPL 85

Query: 63  HRLECQVLSRLDKEKRKSVTPSIRLMLKLYLRRKLQNDNVIPSTTTDNYSLVEALVAHMS 122
           H+LEC  +    +    S T  +RL  ++  ++K+  +     T ++    V    +H+ 
Sbjct: 86  HKLECSSMVVFGENWNPSET--VRLTARILAKQKMHPER----TPSEKLLAVREFESHLD 139

Query: 123 DID-EKQLLLYAQIANLVNLI---LQWPEISINEIAENFSKLACNAHTICNSELRPLGTG 178
            +D EK+ L+ + IA L       L++P+ S   +   F+++ CN  TI + EL  LG+ 
Sbjct: 140 KLDNEKKDLIQSDIAALHQFYSKHLEFPDHS--SLVVLFAQVNCNGFTIEDEELSHLGSA 197

Query: 179 LYPVISIINHSCLPNAVLVFEGRLAVVRAVQHVPKGAEVLISYIETAGSTMTRQKALKEQ 238
           ++P ++++NHSC PN ++ ++G LA VRAVQ +  G EV  SYI+    T  R   L++ 
Sbjct: 198 IFPDVALMNHSCCPNVIVTYKGTLAEVRAVQEIHPGDEVFTSYIDLLYPTEDRNDRLRDS 257

Query: 239 YLFTCTCPRC 248
           Y FTC C  C
Sbjct: 258 YFFTCECREC 267


>gi|449496642|ref|XP_002188283.2| PREDICTED: SET and MYND domain-containing protein 3 [Taeniopygia
           guttata]
          Length = 289

 Score =  144 bits (362), Expect = 1e-31,   Method: Compositional matrix adjust.
 Identities = 99/303 (32%), Positives = 153/303 (50%), Gaps = 50/303 (16%)

Query: 146 PEISINEIAENFSKLACNAHTICNSELRPLGTGLYPVISIINHSCLPNAVLVFEGRLAVV 205
           P I   +I   F+K+ CN  TI N E++ +G GLYP +S++NHSC PN V+VFEG   ++
Sbjct: 28  PAIDFFQI---FTKVTCNCFTISNGEMQDVGVGLYPSMSLLNHSCDPNCVIVFEGYQLLL 84

Query: 206 RAVQHVPKGAEVLISYIETAGSTMTRQKALKEQYLFTCTCPRCIKLGQFDDIQESAILEG 265
            +V+ +  G E+ ISY+E+   T  RQK L  QY F C CP C    Q  D ++ A  E 
Sbjct: 85  HSVRDIQIGEELTISYVESLMPTRERQKQLMRQYCFECDCPLC--QNQEKDAEKLAG-EV 141

Query: 266 YRCKDDGCSGFLLRDSDDKGFTCQQCGLVRSKEEIKKIASEVNILSKKTLALTSCGNHQE 325
           +  K+       ++D+        + G  +SKEE K      ++L++    L S   H  
Sbjct: 142 HAWKE-------VKDA------VNEVGYPKSKEEWK------HVLARCQNLLRSNKGHLP 182

Query: 326 VVSTYKMIEKLQKKLYHPFSVNLMQTREKLIKILME----LEDWKEALAYCQLTIPVYQR 381
             + Y++                     K++   M+    LE W+EAL Y   T+  Y+ 
Sbjct: 183 DTNIYQL---------------------KMLDCAMDACINLESWEEALYYGSRTLEPYRL 221

Query: 382 VYPQFHPLLGLQYYTCGKLEWFLGDTENAIKSMTEAVEILRITHGTNSPFMKELILKLEE 441
            YP FHPL  +Q    GKL++  G    A++++ +A  I+++THGT+   MK L+   EE
Sbjct: 222 YYPGFHPLRAVQLMRVGKLQYSQGMVPQALETLKQAYNIMKVTHGTDHSLMKVLMEMKEE 281

Query: 442 AQA 444
            +A
Sbjct: 282 CEA 284


>gi|355559119|gb|EHH15899.1| hypothetical protein EGK_02058, partial [Macaca mulatta]
          Length = 373

 Score =  144 bits (362), Expect = 1e-31,   Method: Compositional matrix adjust.
 Identities = 116/398 (29%), Positives = 191/398 (47%), Gaps = 61/398 (15%)

Query: 39  LKKCSACQVVWYCGSNCQKLDWKLHRLECQVLSRLDKEKRKSVTPSIRLMLKLYLRRKLQ 98
           L +CS C+V  YC + CQK  W  H+ EC+ L      K +    S+RL+ ++    KL 
Sbjct: 4   LMRCSQCRVAKYCSAKCQKKAWPDHKRECKCLKSC---KPRYPPDSVRLLGRVVF--KLM 58

Query: 99  NDNVIPSTTTDNYSLVEALVAHMSDIDE------KQLLLYAQ--IANLVNLILQWPEISI 150
           +    PS +   YS  + L ++++ + E      +QL++  Q  +   +    Q P    
Sbjct: 59  DGT--PSESEKLYSFYD-LESNINKLTEDKKEGLRQLVMTFQHFMREEIQDASQLP--PA 113

Query: 151 NEIAENFSKLACNAHTICNSELRPLGTGLYPVISIINHSCLPNAVLVFEGRLAVVRAVQH 210
            ++ E F+K+ CN+ TICN+E++ +G GLYP IS++NHSC PN  +VF G   ++RAV+ 
Sbjct: 114 FDLFEAFAKVICNSFTICNAEMQEVGVGLYPSISLLNHSCDPNCSIVFNGPHLLLRAVRD 173

Query: 211 VPKGAEVLISYIETAGSTMTRQKALKEQYLFTCTCPRCIKLGQFDDIQESAILEGYRCKD 270
           +  G E+ I Y++   ++  R+K L++QY F C C RC                      
Sbjct: 174 IEVGEELTICYLDMLMTSEERRKQLRDQYCFECDCFRC---------------------- 211

Query: 271 DGCSGFLLRDSDDKGFTCQQCGLVRSKEEIKKIASEVNILSKKTLALTSCGNHQEVVSTY 330
                   +D D    T  +      +E +KKI  E+    K    L  C   Q ++S+ 
Sbjct: 212 ------QTQDKDADMLTGDEQVWKEVQESLKKI-EELKAHWKWEQVLAMC---QAIISSN 261

Query: 331 KMIEKLQKKLYHPFSVNLMQTR--EKLIKILMELEDWKEALAYCQLTIPVYQRVYPQFHP 388
              E+L         +N+ Q +  +  +   + L   +EAL Y   T+  Y+  +P  HP
Sbjct: 262 S--ERLP-------DINIYQLKVLDCAMDACINLGLLEEALFYGTRTMEPYRIFFPGSHP 312

Query: 389 LLGLQYYTCGKLEWFLGDTENAIKSMTEAVEILRITHG 426
           + G+Q    GKL+   G    A+K++  A +I+R+THG
Sbjct: 313 VRGVQVMKVGKLQLHQGMFPQAMKNLRLAFDIMRVTHG 350


>gi|45387527|ref|NP_991103.1| SET and MYND domain containing 1 [Danio rerio]
 gi|41223368|gb|AAH65475.1| SET and MYND domain containing 1a [Danio rerio]
          Length = 485

 Score =  143 bits (360), Expect = 2e-31,   Method: Compositional matrix adjust.
 Identities = 105/452 (23%), Positives = 200/452 (44%), Gaps = 63/452 (13%)

Query: 7   SGEVIISQEPYVCVPNNSSSISRCDGCFASS-NLKKCSACQVVWYCGSNCQKLDWKLHRL 65
           +GEV+ ++  +  V  +S S+  C  CF    N  +C+ C+   YC   CQ+  W  HR 
Sbjct: 29  AGEVVFAEASFAAVVLDSLSLQVCHSCFRRQVNPHRCAQCKFAHYCDRTCQRAAWDEHRK 88

Query: 66  ECQVLSRLDKEKRKSVTPSIRLMLKLYLRRKLQNDNVIPSTTTDNYSLVEALVAHMSDID 125
           EC  +  + K   ++V    R++ ++     L +D+ +  TT D   ++E  ++ M+  D
Sbjct: 89  ECSAIRNIGKAPNENVRLVARILWRIQKHTGLVSDSQL--TTLD---MLEDHLSRMTPED 143

Query: 126 EKQLLLYAQIANLVNLILQWPE----ISINEIAENFSKLACNAHTICNSE-LRPLGTGLY 180
            K+L      A++      WP+    +  + ++  F  ++CN  T+ +   L+ +G GL+
Sbjct: 144 LKELK-----ADVKTFYTYWPKKSKAVGEDYVSHLFGVISCNGFTLSDQRGLQSVGIGLF 198

Query: 181 PVISIINHSCLPNAVLVFE-------------GRLAVVRAVQHVPKGAEVLISYIETAGS 227
           P + ++NH C PN  ++                R   +RA++ +  G E+ +SY++    
Sbjct: 199 PNLCLVNHDCWPNCTVILNHGDQSALDASFHSSRRIELRALEPISAGQELTVSYVDFLSV 258

Query: 228 TMTRQKALKEQYLFTCTCPRCIKLGQFDDIQESAILEGYRCKDDGCSGFLLRDSDDKGFT 287
           +  RQ+ L++QY F C C  C+  G  D++     +   +  +DG               
Sbjct: 259 STDRQRLLQQQYYFDCKCEHCVN-GTKDEL-----MTAVKPTEDG--------------- 297

Query: 288 CQQCGLVRSKEEIKKIASEVNILSKKTL----ALTSCGNHQEVVSTYKMIEKLQKKLYHP 343
                    K+    +  ++   S + L    A  + GN  EV+   +   + Q  ++  
Sbjct: 298 ---------KQPSADVVKQLTDFSLQALVKIEAARAQGNFHEVIRICRECLEKQDPVFAD 348

Query: 344 FSVNLMQTREKLIKILMELEDWKEALAYCQLTIPVYQRVYPQFHPLLGLQYYTCGKLEWF 403
            +V++++      ++L  L+ ++EA  Y Q  +  Y ++Y   +  LG+     G   W 
Sbjct: 349 TNVHVLRVLSTASEVLSFLQQFQEAAGYAQRMVDGYMKLYHPNNAQLGMAIMRAGVTHWH 408

Query: 404 LGDTENAIKSMTEAVEILRITHGTNSPFMKEL 435
            G  E A   +  A  IL ITHG + P  ++L
Sbjct: 409 AGLIEAAHGLICRAYGILMITHGPHHPISRDL 440


>gi|12232401|ref|NP_073580.1| SET and MYND domain-containing protein 3 isoform 2 [Homo sapiens]
 gi|10437096|dbj|BAB14981.1| unnamed protein product [Homo sapiens]
 gi|119597549|gb|EAW77143.1| SET and MYND domain containing 3, isoform CRA_d [Homo sapiens]
          Length = 369

 Score =  142 bits (359), Expect = 3e-31,   Method: Compositional matrix adjust.
 Identities = 115/396 (29%), Positives = 190/396 (47%), Gaps = 61/396 (15%)

Query: 41  KCSACQVVWYCGSNCQKLDWKLHRLECQVLSRLDKEKRKSVTPSIRLMLKLYLRRKLQND 100
           +CS C+V  YC + CQK  W  H+ EC+ L      K +    S+RL+ ++    KL + 
Sbjct: 2   RCSQCRVAKYCSAKCQKKAWPDHKRECKCLKSC---KPRYPPDSVRLLGRVVF--KLMDG 56

Query: 101 NVIPSTTTDNYSLVEALVAHMSDIDE------KQLLLYAQ--IANLVNLILQWPEISINE 152
              PS +   YS  + L ++++ + E      +QL++  Q  +   +    Q P     +
Sbjct: 57  --APSESEKLYSFYD-LESNINKLTEDKKEGLRQLVMTFQHFMREEIQDASQLP--PAFD 111

Query: 153 IAENFSKLACNAHTICNSELRPLGTGLYPVISIINHSCLPNAVLVFEGRLAVVRAVQHVP 212
           + E F+K+ CN+ TICN+E++ +G GLYP IS++NHSC PN  +VF G   ++RAV+ + 
Sbjct: 112 LFEAFAKVICNSFTICNAEMQEVGVGLYPSISLLNHSCDPNCSIVFNGPHLLLRAVRDIE 171

Query: 213 KGAEVLISYIETAGSTMTRQKALKEQYLFTCTCPRCIKLGQFDDIQESAILEGYRCKDDG 272
            G E+ I Y++   ++  R+K L++QY F C C RC                        
Sbjct: 172 VGEELTICYLDMLMTSEERRKQLRDQYCFECDCFRC------------------------ 207

Query: 273 CSGFLLRDSDDKGFTCQQCGLVRSKEEIKKIASEVNILSKKTLALTSCGNHQEVVSTYKM 332
                 +D D    T  +      +E +KKI  E+    K    L  C   Q ++S+   
Sbjct: 208 ----QTQDKDADMLTGDEQVWKEVQESLKKI-EELKAHWKWEQVLAMC---QAIISSNS- 258

Query: 333 IEKLQKKLYHPFSVNLMQTR--EKLIKILMELEDWKEALAYCQLTIPVYQRVYPQFHPLL 390
            E+L         +N+ Q +  +  +   + L   +EAL Y   T+  Y+  +P  HP+ 
Sbjct: 259 -ERLP-------DINIYQLKVLDCAMDACINLGLLEEALFYGTRTMEPYRIFFPGSHPVR 310

Query: 391 GLQYYTCGKLEWFLGDTENAIKSMTEAVEILRITHG 426
           G+Q    GKL+   G    A+K++  A +I+R+THG
Sbjct: 311 GVQVMKVGKLQLHQGMFPQAMKNLRLAFDIMRVTHG 346


>gi|432873526|ref|XP_004072260.1| PREDICTED: SET and MYND domain-containing protein 1-like isoform 2
           [Oryzias latipes]
          Length = 489

 Score =  142 bits (359), Expect = 3e-31,   Method: Compositional matrix adjust.
 Identities = 119/453 (26%), Positives = 203/453 (44%), Gaps = 66/453 (14%)

Query: 7   SGEVIISQEPYVCVPNNSSSISRCDGCFA-SSNLKKCSACQVVWYCGSNCQKLDWKLHRL 65
           +G+VI S+     V  +S +   C  CF     L++C  C+   YC   CQ+  W  H+ 
Sbjct: 24  AGDVIFSEASLAAVVFDSLAERVCHSCFRRQEKLQRCGQCKFAHYCDRTCQRAGWAEHKQ 83

Query: 66  ECQVLSRLDKEKRKSVTPSIRLMLKLYLRRKLQNDNVIPSTTTD-NYSLVEALVAHMSDI 124
           EC  +    K   K    +IRL+ ++  R  L  D    ST +D   + ++ L  H++D+
Sbjct: 84  ECSAI----KAYGKVPNENIRLVARILWR--LDKDG---STVSDMQLTTLDELEDHITDM 134

Query: 125 DEKQLL-LYAQIANLVNLILQWPEIS----INEIAENFSKLACNAHTICNSE-LRPLGTG 178
            E +L  L   I N ++    WP  S    +++I+  F  + CN  ++ +   L+ +G G
Sbjct: 135 QEDELKELKVDIHNFLDF---WPRTSKQHTVDDISHIFGVINCNGFSVSDQRGLQAVGVG 191

Query: 179 LYPVISIINHSCLPN-AVLVFEGRLAVV------------RAVQHVPKGAEVLISYIETA 225
           L+P + ++NH C PN  V++  G  + V            R++  + +G E+ ++Y++  
Sbjct: 192 LFPNLCLVNHDCWPNCTVILNHGNQSAVNTMFHSQRRIELRSLGKIAEGEELTVAYVDYM 251

Query: 226 GSTMTRQKALKEQYLFTCTCPRCIKLGQFDDIQESAILEGYRCKDDGCSGFLLRDSDDKG 285
             +  RQ+ LK QY F CTC  C      + I++   L G             R+ D   
Sbjct: 252 NLSEERQRLLKTQYFFDCTCEHCK-----NKIKDDIKLGG-------------REVD--- 290

Query: 286 FTCQQCGLVRSKEEIKKIASEVNILSKKTLALTSCGNHQEVVSTYKM-IEKLQKKLY--H 342
                 G+  S+E++K+       + +K       G++ EVV   +  IEK +  L   H
Sbjct: 291 ------GVKPSEEQVKEATDYCYQMLEKMDKARLNGDYHEVVKICRECIEKTEPVLAETH 344

Query: 343 PFSVNLMQTREKLIKILMELEDWKEALAYCQLTIPVYQRVYPQFHPLLGLQYYTCGKLEW 402
            + + +  T  ++   L  L D   A  Y +  +  Y ++YP  +  LG+     G + W
Sbjct: 345 IYLLRMWSTMSEVQAYLQYLND---AAEYARKMVEGYTKLYPPNNATLGMAAMRAGVIHW 401

Query: 403 FLGDTENAIKSMTEAVEILRITHGTNSPFMKEL 435
             GD E A   + +A  IL +THG   P  K+L
Sbjct: 402 QAGDIEVAHGMVCKAYAILMVTHGPTHPITKDL 434


>gi|332236389|ref|XP_003267386.1| PREDICTED: SET and MYND domain-containing protein 3 [Nomascus
           leucogenys]
          Length = 369

 Score =  142 bits (359), Expect = 3e-31,   Method: Compositional matrix adjust.
 Identities = 115/396 (29%), Positives = 190/396 (47%), Gaps = 61/396 (15%)

Query: 41  KCSACQVVWYCGSNCQKLDWKLHRLECQVLSRLDKEKRKSVTPSIRLMLKLYLRRKLQND 100
           +CS C+V  YC + CQK  W  H+ EC+ L      K +    S+RL+ ++    KL + 
Sbjct: 2   RCSQCRVAKYCSAKCQKKAWPDHKRECKCLKSC---KPRYPPDSVRLLGRVVF--KLMDG 56

Query: 101 NVIPSTTTDNYSLVEALVAHMSDIDE------KQLLLYAQ--IANLVNLILQWPEISINE 152
              PS +   YS  + L ++++ + E      +QL++  Q  +   +    Q P     +
Sbjct: 57  --APSESEKLYSFYD-LESNINKLTEDKKEGLRQLVMTFQHFMREEIQDASQLP--PAFD 111

Query: 153 IAENFSKLACNAHTICNSELRPLGTGLYPVISIINHSCLPNAVLVFEGRLAVVRAVQHVP 212
           + E F+K+ CN+ TICN+E++ +G GLYP IS++NHSC PN  +VF G   ++RAV+ + 
Sbjct: 112 LFEAFAKVICNSFTICNAEMQEVGVGLYPSISLLNHSCDPNCSIVFNGPHLLLRAVRDIE 171

Query: 213 KGAEVLISYIETAGSTMTRQKALKEQYLFTCTCPRCIKLGQFDDIQESAILEGYRCKDDG 272
            G E+ I Y++   ++  R+K L++QY F C C RC                        
Sbjct: 172 VGEELTICYLDMLMTSEERRKQLRDQYCFECDCLRC------------------------ 207

Query: 273 CSGFLLRDSDDKGFTCQQCGLVRSKEEIKKIASEVNILSKKTLALTSCGNHQEVVSTYKM 332
                 +D D    T  +      +E +KKI  E+    K    L  C   Q ++S+   
Sbjct: 208 ----QTQDKDADMLTGDEQVWKEVQESLKKI-EELKAHWKWEQVLAMC---QAIISSNS- 258

Query: 333 IEKLQKKLYHPFSVNLMQTR--EKLIKILMELEDWKEALAYCQLTIPVYQRVYPQFHPLL 390
            E+L         +N+ Q +  +  +   + L   +EAL Y   T+  Y+  +P  HP+ 
Sbjct: 259 -ERLP-------DINIYQLKVLDCAMDACINLGLLEEALFYGTRTMEPYRIFFPGSHPVR 310

Query: 391 GLQYYTCGKLEWFLGDTENAIKSMTEAVEILRITHG 426
           G+Q    GKL+   G    A+K++  A +I+R+THG
Sbjct: 311 GVQVMKVGKLQLHQGMFPQAMKNLRLAFDIMRVTHG 346


>gi|403288342|ref|XP_003935365.1| PREDICTED: SET and MYND domain-containing protein 3 [Saimiri
           boliviensis boliviensis]
          Length = 369

 Score =  142 bits (358), Expect = 3e-31,   Method: Compositional matrix adjust.
 Identities = 112/394 (28%), Positives = 191/394 (48%), Gaps = 57/394 (14%)

Query: 41  KCSACQVVWYCGSNCQKLDWKLHRLECQVLSRLDKEKRKSVTPSIRLMLKLYLRRKLQND 100
           +CS C+V  YC S CQK  W  H+ EC+ L      K +    S+RL+ ++    KL   
Sbjct: 2   RCSQCRVAKYCSSKCQKKAWPDHKRECKCLKSC---KPRYPPDSVRLLGRVVF--KLMEG 56

Query: 101 NVIPSTTTDNYSLVEALVAHMSDIDE------KQLLLYAQ--IANLVNLILQWPEISINE 152
           +  PS +   YS  + L ++++ + E      +QL++  Q  +   +    Q P     +
Sbjct: 57  S--PSESEKLYSFYD-LESNINKLTEDKKEGLRQLVMTFQHFMREEIQDASQLP--PAFD 111

Query: 153 IAENFSKLACNAHTICNSELRPLGTGLYPVISIINHSCLPNAVLVFEGRLAVVRAVQHVP 212
           I E F+K+ CN+ TICN+E++ +G GLYP IS++NHSC PN  +VF G   ++RAV+ + 
Sbjct: 112 IFEAFAKVICNSFTICNAEMQEVGVGLYPSISLLNHSCDPNCSVVFNGPHLLLRAVRDIE 171

Query: 213 KGAEVLISYIETAGSTMTRQKALKEQYLFTCTCPRCIKLGQFDDIQESAILEGYRCKDDG 272
            G E+ I Y++T  ++  R+K L++QY F C C RC    Q  D                
Sbjct: 172 VGEELTICYLDTLMTSEERRKQLRDQYCFECDCFRC----QTQD---------------- 211

Query: 273 CSGFLLRDSDDKGFTCQQCGLVRSKEEIKKIASEVNILSKKTLALTSCGNHQEVVSTYKM 332
                 +D+D          ++   E++ K   EV    K    L +    ++V++  + 
Sbjct: 212 ------KDAD----------MLTGDEQVWK---EVQKSLKTIEELKAHWKWEQVLAMCQA 252

Query: 333 IEKLQKKLYHPFSVNLMQTREKLIKILMELEDWKEALAYCQLTIPVYQRVYPQFHPLLGL 392
           I     +     ++  ++  +  +   + L   ++AL Y   T+  Y+  +P  HP+ G+
Sbjct: 253 IISSNSERLPDINIYQLKVLDCAMDACIHLGLLEKALFYGTRTMEPYRIFFPGSHPVRGV 312

Query: 393 QYYTCGKLEWFLGDTENAIKSMTEAVEILRITHG 426
           Q    GKL+   G    A+K++  A +I+R+THG
Sbjct: 313 QVMKVGKLQLHQGMFPQAMKNLRLAFDIMRVTHG 346


>gi|426239577|ref|XP_004013696.1| PREDICTED: SET and MYND domain-containing protein 3 isoform 1 [Ovis
           aries]
          Length = 369

 Score =  142 bits (357), Expect = 4e-31,   Method: Compositional matrix adjust.
 Identities = 116/396 (29%), Positives = 189/396 (47%), Gaps = 61/396 (15%)

Query: 41  KCSACQVVWYCGSNCQKLDWKLHRLECQVLSRLDKEKRKSVTPSIRLMLKLYLRRKLQND 100
           +CS C++  YC + CQK  W+ H+ EC+ L      K +    S+RL+ ++    KL  +
Sbjct: 2   RCSQCRIAKYCSAKCQKKAWQDHKRECKCLKSC---KPRYPPDSVRLLGRVVF--KLMQE 56

Query: 101 NVIPSTTTDNYSLVEALVAHMSDIDE------KQLLLYAQ--IANLVNLILQWPEISINE 152
              PS +   YS  + L ++++ + E      +QL L  Q  +   +    Q P     +
Sbjct: 57  T--PSESEKLYSFYD-LESNINKLTEDKKEGLRQLALTFQHFMREEIQDASQLP--PSFD 111

Query: 153 IAENFSKLACNAHTICNSELRPLGTGLYPVISIINHSCLPNAVLVFEGRLAVVRAVQHVP 212
           I E F+K+ CN+ TICN+E++ +G GLYP +S++NHSC PN  +VF G   ++R V+ V 
Sbjct: 112 IFEAFAKVICNSFTICNAEMQEVGVGLYPSMSLLNHSCDPNCSIVFNGPHLLLRTVRDVE 171

Query: 213 KGAEVLISYIETAGSTMTRQKALKEQYLFTCTCPRCIKLGQFDDIQESAILEGYRCKDDG 272
            G E+ I Y++   ++  R+K L++QY F C C RC                        
Sbjct: 172 AGEELTICYLDMLMTSEERRKQLRDQYCFDCDCFRC------------------------ 207

Query: 273 CSGFLLRDSDDKGFTCQQCGLVRSKEEIKKIASEVNILSKKTLALTSCGNHQEVVSTYKM 332
                 +D D    T  +      +E +KKI  E+    K    L  C   Q ++S+   
Sbjct: 208 ----QTQDKDADMLTGDEQVWKEVQESLKKI-EELKAHWKWEQVLAMC---QSIISSNA- 258

Query: 333 IEKLQKKLYHPFSVNLMQTR--EKLIKILMELEDWKEALAYCQLTIPVYQRVYPQFHPLL 390
            E+L         +N+ Q +  +  +   + L   +EAL Y   T+  Y+  YP  HP+ 
Sbjct: 259 -ERLP-------DINIYQLKVLDCAMDACINLGLLEEALFYGIRTMEPYRIFYPGNHPVR 310

Query: 391 GLQYYTCGKLEWFLGDTENAIKSMTEAVEILRITHG 426
           G+Q    GKL+   G    A+K++  A +I+R+THG
Sbjct: 311 GVQIMKVGKLQLHQGMFPQAMKNLRLAFDIMRVTHG 346


>gi|345480324|ref|XP_003424127.1| PREDICTED: SET and MYND domain-containing protein 3-like [Nasonia
           vitripennis]
          Length = 391

 Score =  141 bits (356), Expect = 6e-31,   Method: Compositional matrix adjust.
 Identities = 75/246 (30%), Positives = 136/246 (55%), Gaps = 8/246 (3%)

Query: 8   GEVIISQEPYVCVPNNSSSISRCDGCFASSNLKKCSACQVVWYCGSNCQKLDWKLHRLEC 67
           G+ +IS +P+  V ++    + CD CF S  L KCS CQ V+YC  +CQK  W +H+ EC
Sbjct: 6   GDCLISSKPFAYVLSSKHKDNHCDYCFKSGKLLKCSGCQYVYYCDRSCQKESWSVHKSEC 65

Query: 68  QVLSRLDKEKRKSVTPSIRLMLKLYLRRKLQNDNVIPSTTTDNYSLVEALVAHMSDI--D 125
             L R+     +++  + RLM ++ ++ +    +       + Y   + L++H +DI  D
Sbjct: 66  INLKRI---APRTIPDAARLMARIIVKLQKGGGDEKDYYAKNAYRKFKDLMSHYTDIKND 122

Query: 126 EKQLLLYAQIANLVNLILQWPEI-SINEIAENFSKLACNAHTICNSELRPLGTGLYPVIS 184
            K++  +  +  ++   ++   + +  EI   + ++  N++ I + ++  +G G+Y   S
Sbjct: 123 PKRIEHFVSLCQVLEDFMEGTTLPNSAEILGLYGRICVNSYNILDPDMNSIGVGIYLGPS 182

Query: 185 IINHSCLPNAVLVFEGRLAVVRAVQHVPK--GAEVLISYIETAGSTMTRQKALKEQYLFT 242
           +I+HSC PNAV VFEG   ++RA++ +P+   +++ ISYI+   +T TR   L+  Y F 
Sbjct: 183 VIDHSCKPNAVAVFEGTTILIRALEDIPRLDWSQIHISYIDVLNTTSTRCTELQNTYYFL 242

Query: 243 CTCPRC 248
           C C RC
Sbjct: 243 CECERC 248



 Score = 46.2 bits (108), Expect = 0.034,   Method: Compositional matrix adjust.
 Identities = 21/52 (40%), Positives = 31/52 (59%)

Query: 383 YPQFHPLLGLQYYTCGKLEWFLGDTENAIKSMTEAVEILRITHGTNSPFMKE 434
           Y + HPL G+ Y   GK++  L   ++A+  +T+A +ILRITHG      KE
Sbjct: 325 YGEIHPLTGILYLMLGKIQLHLDKPKSALDMLTKADKILRITHGEKHSLFKE 376


>gi|410903311|ref|XP_003965137.1| PREDICTED: SET and MYND domain-containing protein 1-like [Takifugu
           rubripes]
          Length = 471

 Score =  141 bits (355), Expect = 7e-31,   Method: Compositional matrix adjust.
 Identities = 126/470 (26%), Positives = 217/470 (46%), Gaps = 61/470 (12%)

Query: 7   SGEVIISQEPYVCVPNNSSSISRCDGCFA-SSNLKKCSACQVVWYCGSNCQKLDWKLHRL 65
           +GEV+ ++  Y  V  +S +   C  CF   + L +C  C+   YC   CQ   W+ H+ 
Sbjct: 29  TGEVVFAEPSYSAVVFDSFASQVCHSCFRHQAQLHRCGQCKFAHYCNRTCQTACWEEHKQ 88

Query: 66  ECQVLSRLDKEKRKSVTPSIRLMLKLYLRRKLQNDNVIPSTTTDNYSLVEALVAHMSDID 125
           ECQ + +L       V  + R+M +++    + +D+ + S        VE L  H++D+ 
Sbjct: 89  ECQAIRKLGTVPGDKVRLAARVMWRIHKDTGVASDSQLLS--------VEELEDHVADLP 140

Query: 126 EKQLLLYAQIANLVNLILQW-----PEISINEIAENFSKLACNAHTICNSE-LRPLGTGL 179
           E  L    +I   V++ LQ+      + S++EIA  F  + CN  T+ +   L+ +G GL
Sbjct: 141 EDHL---KRIDTDVHVFLQYWSCGRTKHSLDEIAHIFGIIKCNGFTLSDQRGLKAVGVGL 197

Query: 180 YPVISIINHSCLPNAVLVFEG-------------RLAVVRAVQHVPKGAEVLISYIETAG 226
           +P + ++NH C PN  +V                R   +RA++ + +G E+ +SY++   
Sbjct: 198 FPNLCLVNHDCWPNCSVVLNHGNHSATNSALHSKRRIELRALRKICEGEELTVSYVDFLD 257

Query: 227 STMTRQKALKEQYLFTCTCPRCIKLGQFDDIQESAILEGYRCKDDGCSGFLLRDSDDKGF 286
           ++  RQ+ LKE + F CTC  C +  + DD+  +A         DG  G   + S D+  
Sbjct: 258 TSAERQRKLKEHFYFECTCEHCRQHIK-DDLMTAAAA-------DGPGG---KPSADQ-- 304

Query: 287 TCQQCGLVRSKEEIKKIASEVNILSKKTLALTSCGNHQEV-VSTYKMIEKLQKKLYHPFS 345
              Q     S+E ++KI  E +++ K         + QEV +   K +EK QK L     
Sbjct: 305 --VQEVTAFSQECLEKI--ERSLMDK---------DFQEVRMLCSKCLEK-QKNLLADTH 350

Query: 346 VNLMQTREKLIKILMELEDWKEALAYCQLTIPVYQRVYPQFHPLLGLQYYTCGKLEWFLG 405
           ++ ++     +++L  L  + EA AY +  +  Y ++Y      LG+     G      G
Sbjct: 351 LHHLRVLSAAVEVLSYLRCFSEAAAYARRMVQGYTKLYHPNSAQLGIAVMRAGVTHLQAG 410

Query: 406 DTENAIKSMTEAVEILRITHGTNSPFMKELILKLEEAQAEASYKLSSKDE 455
             E A + + +A  IL +THG N    ++  L+    Q E   KL  +DE
Sbjct: 411 MIEMAHELICKAYRILLVTHGPNHSVTRD--LEAMRRQTEVELKLLKQDE 458


>gi|348517231|ref|XP_003446138.1| PREDICTED: SET and MYND domain-containing protein 1-like isoform 1
           [Oreochromis niloticus]
          Length = 476

 Score =  141 bits (355), Expect = 7e-31,   Method: Compositional matrix adjust.
 Identities = 112/436 (25%), Positives = 194/436 (44%), Gaps = 45/436 (10%)

Query: 7   SGEVIISQEPYVCVPNNSSSISRCDGCFA-SSNLKKCSACQVVWYCGSNCQKLDWKLHRL 65
           +G+VI S+     V  +S +   C  CF     L++C  C+   YC   CQ+  W  H+L
Sbjct: 24  AGDVIFSEASIAAVVFDSLAERICHSCFRRQEKLQRCGQCKFAHYCDRTCQRAGWAEHKL 83

Query: 66  ECQVLSRLDKEKRKSVTPSIRLMLKLYLRRKLQNDNVIPSTTTDNYSLVEALVAHMSDID 125
           EC  +    K   +++    R+M +L       +D  +  TT D    +E  +A M + D
Sbjct: 84  ECSAIKAYGKVPNENIRLVARIMWRLDKEGSTVSD--MQLTTLDE---LEDHIADMPEDD 138

Query: 126 EKQLLLYAQIANLVNLILQWPEIS----INEIAENFSKLACNAHTICNSE-LRPLGTGLY 180
            K+L +     ++ N +  WP  S    I++I+  F  + CN  ++ +   L+ +G GL+
Sbjct: 139 LKELKV-----DIHNFLDYWPHNSKQHTIDDISHIFGVINCNGFSVSDQRGLQAVGVGLF 193

Query: 181 PVISIINHSCLPNAVLVFEGRLAVVRAVQHVPKGAEVLISYIETAGSTMTRQKALKEQYL 240
           P + ++NH C PN  ++       +RA+  + +G E+ ++Y++    +  RQ+ LK QY 
Sbjct: 194 PNLCLVNHDCWPNCTVILNHGKIELRALGKIAEGEELTVAYVDFLNLSEERQRLLKTQYF 253

Query: 241 FTCTCPRCIKLGQFDDIQESAILEGYRCKDDGCSGFLLRDSDDKGFTCQQCGLVRSKEEI 300
           F CTC  C                  R KDD   G   R+ D         G+  S+E++
Sbjct: 254 FDCTCEHCKN----------------RIKDDIKIGG--REED---------GVKPSEEQV 286

Query: 301 KKIASEVNILSKKTLALTSCGNHQEVVSTYKM-IEKLQKKLYHPFSVNLMQTREKLIKIL 359
           K+       + +K       G++ EVV   +  IEK +  L     + L++    + ++ 
Sbjct: 287 KEATDYCFQMLEKMEKARLNGDYHEVVKICRECIEKTEPVLADT-HIYLLRMWSTMSEVQ 345

Query: 360 MELEDWKEALAYCQLTIPVYQRVYPQFHPLLGLQYYTCGKLEWFLGDTENAIKSMTEAVE 419
             L+ + +A  Y +  +  Y ++Y   +  LG+     G   W  G  E     + +A  
Sbjct: 346 AYLQYFDDAADYARKMVEGYMKLYHPNNAALGMAAMRAGVTHWQAGQIEVGHGMICKAYA 405

Query: 420 ILRITHGTNSPFMKEL 435
           IL ITHG   P  K+L
Sbjct: 406 ILMITHGPIHPITKDL 421


>gi|357607357|gb|EHJ65468.1| hypothetical protein KGM_05644 [Danaus plexippus]
          Length = 370

 Score =  140 bits (354), Expect = 1e-30,   Method: Compositional matrix adjust.
 Identities = 84/275 (30%), Positives = 150/275 (54%), Gaps = 19/275 (6%)

Query: 7   SGEVIISQEPYVCVPNNSSSISRCDGCFASSNLKKCSACQVVWYCGSNCQKLDWKLHRLE 66
           +G++I+S+EP+  V ++    SRCD C     + KCS CQ V YC  +CQK  W+ H+ E
Sbjct: 12  TGDLILSEEPFAYVLSSKEKGSRCDFCLEKGKVLKCSGCQFVHYCNRSCQKDAWEDHKWE 71

Query: 67  CQVLSRLDKEKRKSVTPSIRLMLKLYLRRKLQNDNVIPS-TTTDNYSLVEALVAHMSDI- 124
           C  L R+     K++  + RL+ ++  R +  N     +  T  ++ + + L++H SD+ 
Sbjct: 72  CANLKRI---APKTIPDAARLLARILNRLQRGNGGAYKAFYTPTSFRVWKDLMSHYSDLK 128

Query: 125 -DEKQLLLYAQIANLVNLILQWPEISINEIAE---NFSKLACNAHTICNSELRPLGTGLY 180
            D+K++  ++ ++ ++   L+  +IS+   A+    + ++  N+ TI + E+  +GTG+Y
Sbjct: 129 SDKKRMDHFSTLSMVLFEYLK--DISLPNTADLMGLYGRMVINSFTILDIEMNSIGTGIY 186

Query: 181 PVISIINHSCLPNAVLVFEGRLAVVRAVQHVP--KGAEVLISYIETAGSTMTRQKALKEQ 238
              S+I+HSC PNAV VF+G+   +RA++ +      ++ ISYI+   +   RQ  L++ 
Sbjct: 187 LASSVIDHSCNPNAVAVFDGKTINIRALKDMNCLDWKKIRISYIDLMKTPYERQMELRQS 246

Query: 239 YLFTCTCPRCIKLGQFDDIQESAILEGYRCKDDGC 273
           Y F C C RC+      D      +   +C  DGC
Sbjct: 247 YYFLCQCDRCL------DENRIKYVHAAKCLKDGC 275


>gi|410985721|ref|XP_003999165.1| PREDICTED: SET and MYND domain-containing protein 3 [Felis catus]
          Length = 428

 Score =  140 bits (353), Expect = 1e-30,   Method: Compositional matrix adjust.
 Identities = 120/432 (27%), Positives = 203/432 (46%), Gaps = 66/432 (15%)

Query: 8   GEVIISQEPY---VCVPNNSSSISRCDGCFASSNLKKCSACQVVWYCGSNCQKLDWKLHR 64
           GE++   +P    VC  +      RC        L  CS C+++  CG+ C+K  W+ H+
Sbjct: 27  GELLFRSDPLAYTVCKGSRGVVCVRC--IIGRKYLLYCSRCRLLAQCGAKCRKKAWQDHK 84

Query: 65  LECQVLSRLDKEKRKSVTPSIRLMLKLYLRRKLQNDNVIPSTTTDNYSLVEALVAHMSDI 124
            EC+ L      K +    S+RL+ ++    KL  +   PS +   YS  + L ++++ +
Sbjct: 85  RECKCLKSC---KPRYPPDSVRLLGRVVF--KLMEET--PSESEKLYSFYD-LESNINKL 136

Query: 125 DE------KQLLLYAQ--IANLVNLILQWPEISINEIAENFSKLACNAHTICNSELRPLG 176
            E      +QL++  Q  +   +    Q P     +I E F+K+ CN+ T+CN+E++ +G
Sbjct: 137 TEDKKEGLRQLVMTFQHFMREEIQDASQLP--PSFDIFEAFAKVICNSFTVCNAEMQEVG 194

Query: 177 TGLYPVISIINHSCLPNAVLVFEGRLAVVRAVQHVPKGAEVLISYIETAGSTMTRQKALK 236
            GLYP +S++NHSC PN  +VF G   ++RAV+ +  G E+ I Y++   ++  R+K L+
Sbjct: 195 VGLYPSMSLLNHSCDPNCSIVFNGPHLLLRAVRDIEAGEELTICYLDMLMTSEERRKQLR 254

Query: 237 EQYLFTCTCPRCIKLGQFDDIQESAILEGYRCKDDGCSGFLLRDSDDKGFTCQQCGLVRS 296
           +QY F C C RC                              +D D    T  +      
Sbjct: 255 DQYCFECDCIRC----------------------------QTQDKDADMLTGDEQVWKEV 286

Query: 297 KEEIKKIASEVNILSKKTLALTSCGNHQEVVSTYKMIEKLQKKLYHPFSVNLMQTR--EK 354
           +E +KKI  E+    K    L  C   Q ++S+    E+L         +N+ Q +  + 
Sbjct: 287 QESLKKI-EELKAHWKWEQVLAMC---QAIISSNS--ERLP-------DINIYQLKVLDC 333

Query: 355 LIKILMELEDWKEALAYCQLTIPVYQRVYPQFHPLLGLQYYTCGKLEWFLGDTENAIKSM 414
            +   + L   +EAL Y   T+  Y+  +P  HP+ G+Q    GKL+   G    A+K++
Sbjct: 334 AMDACINLGLLEEALFYGIRTMEPYRIFFPGSHPVRGVQVMKVGKLQLHQGMFPQAMKNL 393

Query: 415 TEAVEILRITHG 426
             A +I+R+THG
Sbjct: 394 RLAFDIMRVTHG 405


>gi|348505210|ref|XP_003440154.1| PREDICTED: SET and MYND domain-containing protein 1-like isoform 1
           [Oreochromis niloticus]
          Length = 454

 Score =  140 bits (352), Expect = 2e-30,   Method: Compositional matrix adjust.
 Identities = 113/447 (25%), Positives = 197/447 (44%), Gaps = 70/447 (15%)

Query: 7   SGEVIISQEPYVCVPNNSSSISRCDGCFA-SSNLKKCSACQVVWYCGSNCQKLDWKLHRL 65
           +GEVI S+  +  V  +S +   C  CF   +NL +C+ C+   YC   CQ   W  H+ 
Sbjct: 29  TGEVIFSEPSFAAVVFDSLATQVCHSCFRHQANLHRCAQCKFAHYCDRTCQTACWNEHKQ 88

Query: 66  ECQVLSRLDKEKRKSVTPSIRLMLKLYLRRKLQNDNVIPSTTTDNYSLVEALVAHMSDI- 124
           EC  + +L     +++  + R++ +++    + +D+ + S        V+ L  H++D+ 
Sbjct: 89  ECGAIKKLGSAPSENIRLAARVLWRMHKDTGIASDSQLIS--------VDQLQEHVADLA 140

Query: 125 --DEKQLLLYAQIANLVNLILQWP----EISINEIAENFSKLACNAHTICNSE-LRPLGT 177
             D KQL      A++   +  W     + S++ I+  F  + CN  T+ +   L+ +G 
Sbjct: 141 AEDFKQLR-----ADVHKFLQYWSYGTRQHSVDYISHIFGIIKCNGFTLSDQRGLQAVGV 195

Query: 178 GLYPVISIINHSCLPNAVLVFEGRLAVVRAVQHVPKGAEVLISYIETAGSTMTRQKALKE 237
           GL+P + ++NH C PN  ++       +RA+  +P+G E+ +SY++    +  RQK LKE
Sbjct: 196 GLFPNLCLVNHDCWPNCTVILNHGKIELRALGKIPEGEELTVSYVDFLNLSADRQKKLKE 255

Query: 238 QYLFTCTCPRCIKLGQFDDIQESAILEGYRCKDDGCSGFLLRDSDDKGFTCQQCGLVRSK 297
           ++ F CTC  C K                  KDD     +   +D K           S 
Sbjct: 256 RFHFDCTCEHCSK----------------HIKDD----LMTAVADSKP----------SA 285

Query: 298 EEIKKIASEVNILSKKTLALTSCGNHQEVVSTYKMIEK-----LQKKLYHPFSVNLMQTR 352
           +++K    EV   SK++L          V   Y  + K     LQK+       NL + R
Sbjct: 286 DKVK----EVTAFSKESLEKIE---KSRVERDYNEVLKLCYECLQKQENVLADTNLYKLR 338

Query: 353 EKLIKILMELEDWKEALA----YCQLTIPVYQRVYPQFHPLLGLQYYTCGKLEWFLGDTE 408
             ++ I  E+  ++ + A    Y    +  Y+++Y   +  LG+     G   W  G  E
Sbjct: 339 --VLSIASEVLSYQRSFAKAANYAHRMVEGYRKLYHPNNAQLGMAIMRAGVTHWHAGQIE 396

Query: 409 NAIKSMTEAVEILRITHGTNSPFMKEL 435
                + +A  IL +THG N    K+L
Sbjct: 397 EGHSLICQAYRILMVTHGPNHAITKDL 423


>gi|88900471|ref|NP_001034725.1| SET and MYND domain-containing protein 1 [Danio rerio]
 gi|84180543|gb|ABC54713.1| histone methyltransferase SmyD1a [Danio rerio]
          Length = 486

 Score =  140 bits (352), Expect = 2e-30,   Method: Compositional matrix adjust.
 Identities = 104/448 (23%), Positives = 195/448 (43%), Gaps = 56/448 (12%)

Query: 7   SGEVIISQEPYVCVPNNSSSISRCDGCFA-SSNLKKCSACQVVWYCGSNCQKLDWKLHRL 65
           +G+V+ ++ P+  V  +S + S C  CF     L++C  C+   YC   CQ+  W+ H+L
Sbjct: 24  AGDVLFAEPPFASVVFDSQASSICHSCFRRQEKLQRCGQCRFAQYCDKTCQRAGWEEHKL 83

Query: 66  ECQVLSRLDKEKRKSVTPSIRLMLKLYLRRKLQNDNVIPSTTTDNYSLVEALVAHMSDID 125
           EC  +    K   ++V  + R++ ++  +  + +DN +        + +E L  H+ DI 
Sbjct: 84  ECAAIKTYGKPPSENVRLAARILWRMDKQGSVVSDNQL--------TTLEDLEDHICDIS 135

Query: 126 EKQLLLYAQIANLVNLILQWPEIS----INEIAENFSKLACNAHTICNSE-LRPLGTGLY 180
           E  L  +    ++ N +  WP  S    ++ ++     + CN   + +   L+ +G GL+
Sbjct: 136 EDDLKDFK--VDIHNFLDYWPRNSKPHTVDSVSHILGVINCNGFMVSDQRGLQAVGVGLF 193

Query: 181 PVISIINHSCLPN-AVLVFEGRLAVV------------RAVQHVPKGAEVLISYIETAGS 227
           P + ++NH C PN  V++  G  + +            RA+  +  G EV ++Y++    
Sbjct: 194 PNLCLVNHDCWPNCTVILNNGNQSAIDTVFHSQKRIELRALGKISAGEEVTVAYVDYLNV 253

Query: 228 TMTRQKALKEQYLFTCTCPRCIKLGQFDDIQESAILEGYRCKDDGCSGFLLRDSDDKGFT 287
           +  RQ+ LK+QY F CTC  C +  + D     A ++G +  +                 
Sbjct: 254 SADRQRLLKQQYFFDCTCKHCTEKIKDDLKMAGAEVDGVKVPE----------------- 296

Query: 288 CQQCGLVRSKEEIKKIASEVNILSKKTLALTSCGNHQEVVSTYKMIEKLQKKLYHPFSVN 347
                     E++K++        +K        N+ EVV   +   + Q+ +     + 
Sbjct: 297 ----------EQVKEVTEFSRQKLEKMEKARIEANYNEVVKICRECVEKQENVLADTHIY 346

Query: 348 LMQTREKLIKILMELEDWKEALAYCQLTIPVYQRVYPQFHPLLGLQYYTCGKLEWFLGDT 407
            ++    L ++L  L+ + EA  Y +  +  Y ++Y   +  LG+     G   W  G  
Sbjct: 347 YLRVCCTLSEVLSYLQFFDEASEYARKMVDGYLKLYHPNNAQLGMATMRAGVTHWQAGFI 406

Query: 408 ENAIKSMTEAVEILRITHGTNSPFMKEL 435
           E A   + +A  IL ITHG   P  K+L
Sbjct: 407 EVAHGMICKAFAILMITHGPTHPITKDL 434


>gi|432852549|ref|XP_004067303.1| PREDICTED: N-lysine methyltransferase SMYD2-A-like isoform 2
           [Oryzias latipes]
          Length = 424

 Score =  140 bits (352), Expect = 2e-30,   Method: Compositional matrix adjust.
 Identities = 103/452 (22%), Positives = 193/452 (42%), Gaps = 73/452 (16%)

Query: 8   GEVIISQEPYVCVPNNSSSISRCDGCFAS-SNLKKCSACQVVWYCGSNCQKLDWKLHRLE 66
           GE++ +   Y  V   +   + C+ CF    +L KC  C+  +YC  +CQ+ DW +H+LE
Sbjct: 30  GELVFACPAYSYVLTVNERGAHCEHCFTRREDLFKCGKCKQAYYCNVDCQRGDWPMHKLE 89

Query: 67  CQVL-----------SRLDKEKRKSVTPSIRLMLKLYLRRKLQNDNVIPSTTTDNYSLVE 115
           C  +           +R+  E+    TPS RL+L       L  D +       N + + 
Sbjct: 90  CVAMCTHGENWCPSETRVTTER----TPSERLLLLKEFESHL--DKMDSEKDEMNQADIA 143

Query: 116 AL----VAHMSDIDEKQLLLYAQIANLVNLILQWPEISINEIAENFSKLACNAHTICNSE 171
           AL      H+SD+ + Q                       E+ E F+++ CN  TI + E
Sbjct: 144 ALHYFYSKHISDLPDDQ-----------------------ELTELFAQVNCNGFTIEDEE 180

Query: 172 LRPLGTGLYPVISIINHSCLPNAVLVFEGRLAVVRAVQHVPKGAEVLISYIETAGSTMTR 231
           L  LG+ ++P ++++NHSC PN ++ ++G +A VRAVQ +  G E+  SYI+    T  R
Sbjct: 181 LSHLGSAVFPDVALMNHSCSPNVIVTYKGTVAEVRAVQEINPGEEIFNSYIDLLYPTEDR 240

Query: 232 QKALKEQYLFTCTCPRCIKLGQFDDIQESAILEGYRCKDDGCSGFLLRDSDDKGFTCQQC 291
           ++ L + Y F C C  C                              +  D      ++ 
Sbjct: 241 KERLLDSYFFGCQCTECT----------------------------TKSKDKAKMEIRKL 272

Query: 292 GLVRSKEEIKKIASEVNILSKKTLALTSCGNHQEVVSTYKMIEKLQKKLYHPFSVNLMQT 351
                 EEIK +      + ++           E++   ++  +    ++   +V ++  
Sbjct: 273 SSPPEPEEIKSMVHYAKNVIEEFRKAKHYKTPSELLEMCELSLEKMGAIFADTNVYMLHM 332

Query: 352 REKLIKILMELEDWKEALAYCQLTIPVYQRVYPQFHPLLGLQYYTCGKLEWFLGDTENAI 411
             + + + + ++DW  A++Y +  +  Y   YP +   +   Y   G+L   L      +
Sbjct: 333 MYQAMGVCLYMQDWDGAMSYGEKIVYPYSVHYPPYSLNVASMYLKLGRLYLGLEKKTQGV 392

Query: 412 KSMTEAVEILRITHGTNSPFMKELILKLEEAQ 443
           K++ +A+ I+ I HG +  ++ E+  ++EE +
Sbjct: 393 KALKKALAIMEIAHGKDHHYVAEVKREIEEQK 424


>gi|187607241|ref|NP_001120357.1| SET and MYND domain containing 1 [Xenopus (Silurana) tropicalis]
 gi|170284459|gb|AAI61001.1| LOC100145429 protein [Xenopus (Silurana) tropicalis]
          Length = 478

 Score =  139 bits (351), Expect = 3e-30,   Method: Compositional matrix adjust.
 Identities = 113/459 (24%), Positives = 203/459 (44%), Gaps = 54/459 (11%)

Query: 7   SGEVIISQEPYVCVPNNSSSISRCDGCFA-SSNLKKCSACQVVWYCGSNCQKLDWKLHRL 65
           +G++I ++  Y  V  ++ S S C  CF     L +C  C+   YC   CQK  W  H+ 
Sbjct: 24  AGDIIFAEPAYSAVVFDNLSHSVCHSCFKRQEKLLRCGQCKFAHYCDRTCQKESWANHKN 83

Query: 66  ECQVLSRLDKEKRKSVTPSIRLMLKLYLRRKLQNDNVIPSTTTDNYSLVEALVAHMSDID 125
           EC  +    K+  K+   +IRL  ++  R + +   +           ++ L  H+   D
Sbjct: 84  ECVAI----KKAGKAPNENIRLAARILWRIEREGSGLTEGCLVS----IDDLQNHIDKFD 135

Query: 126 EKQLLLYAQIANLVNLILQWP----EISINEIAENFSKLACNAHTICNSE-LRPLGTGLY 180
           E +  L  +  ++   +  WP    +  +  I+  FS ++CN  T+ +   L+ +G G++
Sbjct: 136 EAEKGLLME--DVQKFLEYWPSQSQQFGMQYISHIFSVISCNGFTLSDQRGLQAVGVGIF 193

Query: 181 PVISIINHSCLPNAVLVFEGRLAVVRAVQHVPKGAEVLISYIETAGSTMTRQKALKEQYL 240
           P + + NH C PN  ++F      +RA+  + KG E+ +SY++    T  R+  LK+QY 
Sbjct: 194 PNLCLANHDCWPNCTVIFNNGKIELRALGKINKGEELTVSYVDFLNLTEDRKAQLKKQYY 253

Query: 241 FTCTCPRCIKLGQFDDIQESAILEGYRCKDDGCSGFLLRDSDDKGFTCQQCGLVRSKEEI 300
           F CTC  C K  +       A+L              ++D +DK             EE 
Sbjct: 254 FDCTCEHCTKKTK------DALL------------LAVKDGEDK------------PEE- 282

Query: 301 KKIASEVNILSKKTL----ALTSCGNHQEVVSTYKMIEKLQKKLYHPFSVNLMQTREKLI 356
            ++  EV   SK T+       S G + +VV   +   K Q+ ++   ++ +++      
Sbjct: 283 -RVVKEVIQYSKDTMEKIEKARSEGLYNDVVKLCRDCLKRQEPIFADTNIYMLRILSIYS 341

Query: 357 KILMELEDWKEALAYCQLTIPVYQRVYPQFHPLLGLQYYTCGKLEWFLGDTENAIKSMTE 416
           ++L  L+ + +A    +  +  Y ++Y Q +  LG+     G   W  G  E     + +
Sbjct: 342 EVLSYLQMFDDAAENAKKMVDGYLKIYHQNNAQLGMAVMRAGVTHWHAGMIEVGHGMICK 401

Query: 417 AVEILRITHGTNSPFMKELILKLEEAQAEASYKLSSKDE 455
           A  IL ITHG   P  K+  L++  AQ E   ++  ++E
Sbjct: 402 AFAILLITHGPLHPITKD--LEVMRAQTEMELRMFKENE 438


>gi|345329417|ref|XP_001513395.2| PREDICTED: SET and MYND domain-containing protein 3-like
           [Ornithorhynchus anatinus]
          Length = 415

 Score =  139 bits (351), Expect = 3e-30,   Method: Compositional matrix adjust.
 Identities = 104/374 (27%), Positives = 182/374 (48%), Gaps = 49/374 (13%)

Query: 57  KLDWKLHRLECQVLSRLDKEKRKSVTPSIRLMLKLYLRRKLQNDNVIPSTTTDNYSLVEA 116
           K  W+ H+ EC  L      K +    S+RL+ ++    KL  ++  PS    ++S +E+
Sbjct: 51  KQAWQDHKRECSCLL---SSKPRFPPDSVRLLGRVVF--KLLRESSCPSEKLYSFSELES 105

Query: 117 LVAHMSDIDEKQLLLYAQIANLVNLILQWPEISI----NEIAENFSKLACNAHTICNSEL 172
            + ++S+ ++++ L Y  +   + L  +  + S       I E+F+K+ CN  TI N E+
Sbjct: 106 NIKNLSE-EKREGLGYLAVTLKLYLKEEIQDASQLPPGFNIFESFAKVICNGFTISNGEM 164

Query: 173 RPLGTGLYPVISIINHSCLPNAVLVFEGRLAVVRAVQHVPKGAEVLISYIETAGSTMTRQ 232
           + +G GLYP +S++NHSC PN V+VFEG    +RAV+ + KG E+ I Y++    +  RQ
Sbjct: 165 QEVGVGLYPSMSLLNHSCDPNCVIVFEGTSLFLRAVREIQKGEELTICYLDVLLPSQERQ 224

Query: 233 KALKEQYLFTCTCPRCIKLGQFDDIQESAILEGYRCKDDGCSGFLLRDSDDKGFTCQQCG 292
           K LKEQY F C C RC    Q  D+   A  E +                          
Sbjct: 225 KQLKEQYCFACDCIRCKT--QDKDVDMLAGEEPF-------------------------- 256

Query: 293 LVRSKEEIKKIASEVNILSKKTLALTSCGNHQEVVSTYKMIEKLQKKLYHPFSVNLMQTR 352
                   K++   V+ + +    L S    ++V++T + +    K      ++  ++  
Sbjct: 257 -------WKEVKDAVDTVEE----LQSQKKWEQVLATCQALINNHKDRIPDRNIYQLKML 305

Query: 353 EKLIKILMELEDWKEALAYCQLTIPVYQRVYPQFHPLLGLQYYTCGKLEWFLGDTENAIK 412
           +  +   + L  W++AL Y + T+  Y+  YP FHP+  +Q    GKL+   G    A++
Sbjct: 306 DCAMDACINLSLWEDALLYGRRTLDPYRLYYPGFHPIKAIQIMKIGKLQQHQGMFREALE 365

Query: 413 SMTEAVEILRITHG 426
           ++ EA ++L++THG
Sbjct: 366 TLREAFDLLKVTHG 379


>gi|326919603|ref|XP_003206069.1| PREDICTED: SET and MYND domain-containing protein 1-like isoform 2
           [Meleagris gallopavo]
          Length = 478

 Score =  138 bits (348), Expect = 5e-30,   Method: Compositional matrix adjust.
 Identities = 116/461 (25%), Positives = 203/461 (44%), Gaps = 56/461 (12%)

Query: 6   VSGEVIISQEPYVCVPNNSSSISRCDGCFA-SSNLKKCSACQVVWYCGSNCQKLDWKLHR 64
           + G+VI ++  Y  V  +S +   C  CF     L +C  C+  +YC   CQ+  W  H+
Sbjct: 28  LPGDVIFAEPAYAAVVFDSLTHVVCHTCFKRQEKLHRCGQCKFAYYCDRTCQRDAWLNHK 87

Query: 65  LECQVLSRLDKEKRKSVTPSIRLMLKLYLRRKLQNDNVIPSTTTDNYSLVEALVAHMSDI 124
            EC  +    K+  K+ T +IRL  ++  R + +   +  +        ++ L  H+ + 
Sbjct: 88  NECSAI----KKHGKAPTENIRLAARILWRIEREGSGLSENCLVS----IDDLQNHVENF 139

Query: 125 DEKQLL-LYAQIANLVNLILQWP----EISINEIAENFSKLACNAHTICNSE-LRPLGTG 178
           DE++   L   + + +     WP    +  +  I+  F  + CNA T+ +   L+ +G G
Sbjct: 140 DEEEKKDLRIDVESFLEF---WPAQSQQFGMQYISHIFGVINCNAFTLSDQRGLQAVGVG 196

Query: 179 LYPVISIINHSCLPNAVLVFEGRLAVVRAVQHVPKGAEVLISYIETAGSTMTRQKALKEQ 238
           ++P +   NH C PN  ++F      +RA+  +  G E+ +SY++    +  RQK LK+Q
Sbjct: 197 IFPNLCQANHDCWPNCTVIFNNGKIELRALSKISPGDELTVSYVDFLNVSEERQKQLKKQ 256

Query: 239 YLFTCTCPRCIKLGQFDDIQESAILEGYRCKDDGCSGFLLRDSDDKGFTCQQCGLVRSKE 298
           Y F CTC  C K                + KDD      +++ D K           S E
Sbjct: 257 YYFDCTCEHCKK----------------KIKDD--LMLAVKEGDKKP----------SAE 288

Query: 299 EIKKIASEVNILSKKTLALTSC----GNHQEVVSTYKMIEKLQKKLYHPFSVNLMQTREK 354
            +K    EV   SK TL   +     G + EVV   +   K Q+ +    ++ L++    
Sbjct: 289 TVK----EVIQFSKDTLEKINKARLEGTYHEVVKLCRECLKKQEPVLGDTNIYLLRILSI 344

Query: 355 LIKILMELEDWKEALAYCQLTIPVYQRVYPQFHPLLGLQYYTCGKLEWFLGDTENAIKSM 414
             ++L  L+ ++EA  Y +  +  Y ++Y   +  LG+     G   W  G  E     +
Sbjct: 345 ASEVLSYLQMFEEAAEYAKRMVDGYLKIYHPNNAQLGMAVMRAGVTHWHAGLIEAGHSMI 404

Query: 415 TEAVEILRITHGTNSPFMKELILKLEEAQAEASYKLSSKDE 455
            +A  IL ITHG + P  K+  L++   Q E   ++  ++E
Sbjct: 405 CKAYAILLITHGPSHPITKD--LEVMRVQTEMELRMFQQNE 443


>gi|71052093|gb|AAH49367.2| SMYD2 protein [Homo sapiens]
          Length = 371

 Score =  138 bits (347), Expect = 6e-30,   Method: Compositional matrix adjust.
 Identities = 78/246 (31%), Positives = 131/246 (53%), Gaps = 13/246 (5%)

Query: 8   GEVIISQEPYVCVPNNSSSISRCDGCFA-SSNLKKCSACQVVWYCGSNCQKLDWKLHRLE 66
           G+++ S   Y  V   +   + C+ CF     L KC  C+  +YC   CQK DW +H+LE
Sbjct: 30  GDLLFSCPAYAYVLTVNERGNHCEYCFTRKEGLSKCGRCKQAFYCNVECQKEDWPMHKLE 89

Query: 67  CQVLSRLDKEKRKSVTPSIRLMLKLYLRRKLQNDNVIPSTTTDNYSLVEALVAHMSDID- 125
           C  +    +    S T  +RL  ++  ++K+  +     T ++    V+   +H+  +D 
Sbjct: 90  CSPMVVFGENWNPSET--VRLTARILAKQKIHPER----TPSEKLLAVKEFESHLDKLDN 143

Query: 126 EKQLLLYAQIANLVNLI---LQWPEISINEIAENFSKLACNAHTICNSELRPLGTGLYPV 182
           EK+ L+ + IA L +     L++P+   + +   F+++ CN  TI + EL  LG+ ++P 
Sbjct: 144 EKKDLIQSDIAALHHFYSKHLEFPDS--DSLVVLFAQVNCNGFTIEDEELSHLGSAIFPD 201

Query: 183 ISIINHSCLPNAVLVFEGRLAVVRAVQHVPKGAEVLISYIETAGSTMTRQKALKEQYLFT 242
           ++++NHSC PN ++ ++G LA VRAVQ +  G EV  SYI+    T  R   L++ Y FT
Sbjct: 202 VALMNHSCCPNVIVTYKGTLAEVRAVQEIKPGEEVFTSYIDLLYPTEDRNDRLRDSYFFT 261

Query: 243 CTCPRC 248
           C C  C
Sbjct: 262 CECQEC 267


>gi|16930389|gb|AAL31881.1|AF410782_1 cardiac and skeletal muscle-specific BOP2 [Gallus gallus]
          Length = 473

 Score =  137 bits (345), Expect = 1e-29,   Method: Compositional matrix adjust.
 Identities = 115/461 (24%), Positives = 202/461 (43%), Gaps = 56/461 (12%)

Query: 6   VSGEVIISQEPYVCVPNNSSSISRCDGCFA-SSNLKKCSACQVVWYCGSNCQKLDWKLHR 64
           + G+VI ++  Y  V  +S +   C  CF     L +C  C+  +YC   CQ+  W  H+
Sbjct: 23  LPGDVIFAEPAYAAVVFDSLTHVICHTCFKRQERLHRCGQCKFAYYCDRTCQRDAWLNHK 82

Query: 65  LECQVLSRLDKEKRKSVTPSIRLMLKLYLRRKLQNDNVIPSTTTDNYSLVEALVAHMSDI 124
            EC  +    K+  K+ T +IRL  ++  R + +   +  +        ++ L  H+   
Sbjct: 83  NECSAI----KKHGKAPTENIRLAARILWRIEREGGGLSENCLVS----IDDLQNHVESF 134

Query: 125 DEKQLL-LYAQIANLVNLILQWP----EISINEIAENFSKLACNAHTICNSE-LRPLGTG 178
           DE++   L   + + +     WP    +  +  I+  F  + CNA T+ +   L+ +G G
Sbjct: 135 DEEEKKDLRVDVESFLEF---WPAQSQQFGMQYISHIFGVINCNAFTLSDQRGLQAVGVG 191

Query: 179 LYPVISIINHSCLPNAVLVFEGRLAVVRAVQHVPKGAEVLISYIETAGSTMTRQKALKEQ 238
           ++P +   NH C PN  ++F      +RA+  +  G E+ +SY++    +  R+K LK+Q
Sbjct: 192 IFPNLCQANHDCWPNCTVIFNNGKIELRALSKISPGDELTVSYVDFLNVSEERRKQLKKQ 251

Query: 239 YLFTCTCPRCIKLGQFDDIQESAILEGYRCKDDGCSGFLLRDSDDKGFTCQQCGLVRSKE 298
           Y F CTC  C K                + KDD      +++ D K           S E
Sbjct: 252 YYFDCTCEHCKK----------------KIKDD--LMLAVKEGDKKP----------SAE 283

Query: 299 EIKKIASEVNILSKKTLALTSC----GNHQEVVSTYKMIEKLQKKLYHPFSVNLMQTREK 354
            +K    EV   SK TL   +     G + EVV   +   K Q+ +    ++ L++    
Sbjct: 284 TVK----EVIQFSKDTLEKINKARLEGTYHEVVKLCRECLKKQEPVLGDTNIYLLRILSI 339

Query: 355 LIKILMELEDWKEALAYCQLTIPVYQRVYPQFHPLLGLQYYTCGKLEWFLGDTENAIKSM 414
             ++L  L+ ++EA  Y +  +  Y ++Y   +  LG+     G   W  G  E     +
Sbjct: 340 ASEVLSYLQMFEEAAEYAKRMVEGYMKIYHPNNAQLGMAVMRAGVTHWHAGLIEAGHSMI 399

Query: 415 TEAVEILRITHGTNSPFMKELILKLEEAQAEASYKLSSKDE 455
            +A  IL ITHG + P  K+  L++   Q E   ++  ++E
Sbjct: 400 CKAYAILLITHGPSHPITKD--LEVMRVQTEMELRMFQQNE 438


>gi|348566409|ref|XP_003468994.1| PREDICTED: SET and MYND domain-containing protein 1-like isoform 2
           [Cavia porcellus]
          Length = 477

 Score =  137 bits (345), Expect = 1e-29,   Method: Compositional matrix adjust.
 Identities = 105/439 (23%), Positives = 191/439 (43%), Gaps = 53/439 (12%)

Query: 7   SGEVIISQEPYVCVPNNSSSISRCDGCFA-SSNLKKCSACQVVWYCGSNCQKLDWKLHRL 65
           + +VI ++  Y  V  +S     C  CF     L +C  C+   YC   CQK  W  H+ 
Sbjct: 29  AADVIFAERAYSAVVFDSLVNFVCHTCFKRQEKLHRCGQCKFAHYCDRTCQKDAWLNHKH 88

Query: 66  ECQVLSRLDKEKRKSVTPSIRLMLKLYLRRKLQNDNVIPSTTTDNYSLVEALVAHMSDID 125
           EC  + R  K   ++V  + R+M ++        +  + S        ++  V H  + +
Sbjct: 89  ECSAIKRYGKVPNENVRLAARIMWRVEREGTGLTEGCLVSVDD-----LQNHVEHFGEEE 143

Query: 126 EKQLLLYAQIANLVNLILQWP----EISINEIAENFSKLACNAHTICNSE-LRPLGTGLY 180
           +K+L +     ++   +  WP    + S+  I+  F  + CN  T+ +   L+ +G G++
Sbjct: 144 QKELRV-----DVDTFLQYWPPQSQQFSMQYISHIFGVINCNGFTLSDQRGLQAVGVGIF 198

Query: 181 PVISIINHSCLPNAVLVFEGRLAVVRAVQHVPKGAEVLISYIETAGSTMTRQKALKEQYL 240
           P + ++NH C PN  ++F      +RA+  +  G E+ +SYI+    +  R++ LK+QY 
Sbjct: 199 PNLGLVNHDCWPNCTVIFNNGKIELRALGKISVGEELTVSYIDFLNVSEERKRQLKKQYY 258

Query: 241 FTCTCPRCIKLGQFDDIQESAILEGYRCKDDGCSGFLLRDSDDKGFTCQQCGLVRSKEEI 300
           F CTC  C K G  DD+                  FL              G+    +  
Sbjct: 259 FDCTCEHCQK-GLKDDL------------------FL--------------GVKEDPKPS 285

Query: 301 KKIASEVNILSKKTL----ALTSCGNHQEVVSTYKMIEKLQKKLYHPFSVNLMQTREKLI 356
           +++  E+   SK TL       S G + EVV   +   + Q+ ++   ++  ++    + 
Sbjct: 286 QEVVKEMTQFSKDTLEKIDKARSEGLYHEVVKLCRECLEKQEPVFADTNLYTLRMLSVVS 345

Query: 357 KILMELEDWKEALAYCQLTIPVYQRVYPQFHPLLGLQYYTCGKLEWFLGDTENAIKSMTE 416
           ++L  L+ + EA  Y +  +  Y ++Y   +  LG+     G   W  G+ E     + +
Sbjct: 346 EVLSYLQAFGEAAHYARRMVDGYMKLYHPNNAQLGMAVMRAGLTNWHAGNIEVGHGMICK 405

Query: 417 AVEILRITHGTNSPFMKEL 435
           A  IL +THG + P  K+L
Sbjct: 406 AYAILLVTHGPSHPITKDL 424


>gi|326427056|gb|EGD72626.1| hypothetical protein PTSG_04361 [Salpingoeca sp. ATCC 50818]
          Length = 464

 Score =  136 bits (342), Expect = 3e-29,   Method: Compositional matrix adjust.
 Identities = 113/447 (25%), Positives = 195/447 (43%), Gaps = 50/447 (11%)

Query: 9   EVIISQEPYVCVPNNSSSISRCDGCFAS-----SNLKKCSACQVVWYCGSNCQKLDWKLH 63
           +VI  + P+  V   +   + C  CF         L++C+ C+ + YC + CQK DWK H
Sbjct: 46  DVIKGKRPFAHVVEKAKMETVCQTCFVQCEQLRRPLQRCAGCKALRYCSAACQKADWKDH 105

Query: 64  RLECQVLSRLDKEKRKSVTPSIRLMLKLYLRRKLQNDNVIPSTTTDNYSLVEALVAHM-- 121
           + EC  L R+       V P+  +M    + RK++           N   ++ L  HM  
Sbjct: 106 KPECAALKRI-----SPVVPATFVMFLARILRKMER----------NTGEMDVLQLHMPG 150

Query: 122 --SDIDEKQLLLYAQIANLVNLILQWPEISINEIAENFSKL---ACNAHTICNSELRPLG 176
             SD  ++Q  L+A + +L + +   P+   + +   +  L   + N+  I   E   LG
Sbjct: 151 EPSD-PQQQRGLFAILEHLRHFL---PDAEKHLLKSAYPVLRITSANSFGISGVEGNNLG 206

Query: 177 TGLYPVISIINHSCLPNAVLVFEGRLAVVRAVQHVPKGAEVLISYIETAGSTMTRQKALK 236
            GLY  +S INHSC PN  + F G  A VR+V  +P   E+ I+YI+       R+  LK
Sbjct: 207 VGLYDTVSYINHSCAPNCSITFSGVYARVRSVHDLPPNQELTIAYIDPCDPRAKRRAHLK 266

Query: 237 EQYLFTCTCPRCIKLGQFDDIQESAILEGYRCKDDGCSGF----LLRDSDDKGF-TCQQC 291
            Q++F C C RC +  + DD     +     C   GC       L    DD  F   ++C
Sbjct: 267 SQFMFDCECSRCER--ERDDDPLLTLC----CPGSGCDQIIKYQLTAPPDDTAFEVTKKC 320

Query: 292 --GLVRSKEEIKKIASEVNILSKKTLALTSCGNHQEVVSTYKMIEKLQKKLYHPFSVNLM 349
             G +         + +V    K+ L +    N         M     ++L  P +  + 
Sbjct: 321 ERGHLDDPATFDSWSRKV----KRVLGIIGEANKARRPVQGDMASF--ERLLPPTNYLMH 374

Query: 350 QTREKLIKILMELEDWKEALAYCQLTIPVYQRVYPQFHPLLGLQYYTCGKLEWFLGDTEN 409
                ++   +    ++EAL + +  I VY  +     PL G++     KL+    D   
Sbjct: 375 IAYTGVLDQAIGTGQFQEALQWARKVILVYDALNSPMDPLRGIELLRVAKLQALARDGRG 434

Query: 410 AIKSMTEAVEILRITHGTNSPFMKELI 436
            ++++ +AV+++R+THG +S   ++++
Sbjct: 435 FLETIQKAVDVIRLTHGEDSVVYQQVL 461


>gi|291386373|ref|XP_002709685.1| PREDICTED: SET and MYND domain containing 1 isoform 1 [Oryctolagus
           cuniculus]
          Length = 477

 Score =  136 bits (342), Expect = 3e-29,   Method: Compositional matrix adjust.
 Identities = 103/435 (23%), Positives = 191/435 (43%), Gaps = 45/435 (10%)

Query: 7   SGEVIISQEPYVCVPNNSSSISRCDGCFA-SSNLKKCSACQVVWYCGSNCQKLDWKLHRL 65
           + ++I ++  Y  V  +S     C  CF     L +C  C+   YC   CQK  W  H+ 
Sbjct: 29  AADIIFAERAYSAVVFDSLVNFVCHTCFKRQEKLHRCGQCKFAHYCDRTCQKDAWLNHKN 88

Query: 66  ECQVLSRLDKEKRKSVTPSIRLMLKLYLRRKLQNDNVIPSTTTDNYSLVEALVAHMSDID 125
           EC  + R  K   +++  + R+M ++        +  + S        ++  V H  + +
Sbjct: 89  ECSAIKRYGKVPNENIRLAARIMWRVEREGTGLTEGCLVSVDD-----LQNHVDHFGEEE 143

Query: 126 EKQLLLYAQIANLVNLILQWP----EISINEIAENFSKLACNAHTICNSE-LRPLGTGLY 180
           +K+L      A++   +  WP      S+  I+  F  + CN  T+ +   L+ +G G++
Sbjct: 144 QKELR-----ADVDTFLQYWPPQSQHFSMQYISHIFGVINCNGFTLSDQRGLQAVGVGIF 198

Query: 181 PVISIINHSCLPNAVLVFEGRLAVVRAVQHVPKGAEVLISYIETAGSTMTRQKALKEQYL 240
           P ++++NH C PN  ++F      +RA+  + +G E+ +SYI+    +  R++ LK+QY 
Sbjct: 199 PNLALVNHDCWPNCTVIFNNGKIELRALGKISEGEELTVSYIDFLNVSEERRRQLKKQYY 258

Query: 241 FTCTCPRCIKLGQFDDIQESAILEGYRCKDDGCSGFLLRDSDDKGFTCQQCGLVRSKEEI 300
           F CTC  C K G  DD+                  FL    + K           S+E +
Sbjct: 259 FDCTCEHCQK-GLKDDL------------------FLGVKDNPKP----------SQEVV 289

Query: 301 KKIASEVNILSKKTLALTSCGNHQEVVSTYKMIEKLQKKLYHPFSVNLMQTREKLIKILM 360
           K++        +K     S G + EVV   +   + Q+ ++   ++  ++    + ++L 
Sbjct: 290 KEMTEFCKDTLEKIDKARSEGLYHEVVKLCRECLQKQEPVFADTNLYTLRMLSIVSEVLS 349

Query: 361 ELEDWKEALAYCQLTIPVYQRVYPQFHPLLGLQYYTCGKLEWFLGDTENAIKSMTEAVEI 420
            L+ + EA  Y +  +  Y ++Y   +  LG+     G   W  G+ E     + +A  I
Sbjct: 350 YLQAFGEAADYAKRMVDGYMKLYHPNNAQLGMAVMRAGLTNWHAGNIEVGHGMICKAYAI 409

Query: 421 LRITHGTNSPFMKEL 435
           L +THG + P  K+L
Sbjct: 410 LLVTHGPSHPITKDL 424


>gi|119597486|gb|EAW77080.1| SET and MYND domain containing 1, isoform CRA_d [Homo sapiens]
          Length = 497

 Score =  136 bits (342), Expect = 3e-29,   Method: Compositional matrix adjust.
 Identities = 103/439 (23%), Positives = 194/439 (44%), Gaps = 53/439 (12%)

Query: 7   SGEVIISQEPYVCVPNNSSSISRCDGCFA-SSNLKKCSACQVVWYCGSNCQKLDWKLHRL 65
           + ++I ++  Y  V  +S     C  CF     L +C  C+   YC   CQK  W  H+ 
Sbjct: 29  AADIIFAERAYSAVVFDSLVNFVCHTCFKRQEKLHRCGQCKFAHYCDRTCQKDAWLNHKN 88

Query: 66  ECQVLSRLDKEKRKSVTPSIRLMLKLYLRRKLQNDNVIPSTTTDNYSLVEALVAHMSDID 125
           EC  + R  K   +++  + R+M ++        +  + S        ++  V H  + +
Sbjct: 89  ECSAIKRYGKVPNENIRLAARIMWRVEREGTGLTEGCLVSVDD-----LQNHVEHFGEEE 143

Query: 126 EKQLLLYAQIANLVNLILQWP----EISINEIAENFSKLACNAHTICNSE-LRPLGTGLY 180
           +K L +     ++   +  WP    + S+  I+  F  + CN  T+ +   L+ +G G++
Sbjct: 144 QKDLRV-----DVDTFLQYWPPQSQQFSMQYISHIFGVINCNGFTLSDQRGLQAVGVGIF 198

Query: 181 PVISIINHSCLPNAVLVFEGRLAVVRAVQHVPKGAEVLISYIETAGSTMTRQKALKEQYL 240
           P + ++NH C PN  ++F      +RA+  + +G E+ +SYI+    +  R++ LK+QY 
Sbjct: 199 PNLGLVNHDCWPNCTVIFNNGKIELRALGKISEGEELTVSYIDFLNVSEERKRQLKKQYY 258

Query: 241 FTCTCPRCIKLGQFDDIQESAILEGYRCKDDGCSGFLLRDSDDKGFTCQQCGLVRSKEEI 300
           F CTC  C K                + KDD    FL              G+  + +  
Sbjct: 259 FDCTCEHCQK----------------KLKDD---LFL--------------GVKDNPKPS 285

Query: 301 KKIASEVNILSKKTL----ALTSCGNHQEVVSTYKMIEKLQKKLYHPFSVNLMQTREKLI 356
           +++  E+   SK TL       S G + EVV   +   + Q+ ++   ++ +++    + 
Sbjct: 286 QEVVKEMIQFSKDTLEKIDKARSEGLYHEVVKLCRECLEKQEPVFADTNIYMLRMLSIVS 345

Query: 357 KILMELEDWKEALAYCQLTIPVYQRVYPQFHPLLGLQYYTCGKLEWFLGDTENAIKSMTE 416
           ++L  L+ ++EA  Y +  +  Y ++Y   +  LG+     G   W  G+ E     + +
Sbjct: 346 EVLSYLQAFEEASFYARRMVDGYMKLYHPNNAQLGMAVMRAGLTNWHAGNIEVGHGMICK 405

Query: 417 AVEILRITHGTNSPFMKEL 435
           A  IL +THG + P  K+L
Sbjct: 406 AYAILLVTHGPSHPITKDL 424


>gi|67514234|gb|AAH98305.1| SMYD2 protein [Homo sapiens]
          Length = 272

 Score =  135 bits (341), Expect = 3e-29,   Method: Compositional matrix adjust.
 Identities = 78/246 (31%), Positives = 130/246 (52%), Gaps = 13/246 (5%)

Query: 8   GEVIISQEPYVCVPNNSSSISRCDGCFA-SSNLKKCSACQVVWYCGSNCQKLDWKLHRLE 66
           G+++ S   Y  V   +   + C+ CF     L KC  C+  +YC   CQK DW +H+LE
Sbjct: 30  GDLLFSCPAYAYVLTVNERGNHCEYCFTRKEGLSKCGRCKQAFYCNVECQKEDWPMHKLE 89

Query: 67  CQVLSRLDKEKRKSVTPSIRLMLKLYLRRKLQNDNVIPSTTTDNYSLVEALVAHMSDID- 125
           C  +    +    S T  +RL  ++  ++K+  +     T ++    V+   +H+  +D 
Sbjct: 90  CSPMVVFGENWNPSET--VRLTARILAKQKIHPER----TPSEKLLAVKEFESHLDKLDN 143

Query: 126 EKQLLLYAQIANLVNLI---LQWPEISINEIAENFSKLACNAHTICNSELRPLGTGLYPV 182
           EK+ L+ + IA L +     L +P+   + +   F+++ CN  TI + EL  LG+ ++P 
Sbjct: 144 EKKDLIQSDIAALHHFYSKHLGFPDN--DSLVVLFAQVNCNGFTIEDEELSHLGSAIFPD 201

Query: 183 ISIINHSCLPNAVLVFEGRLAVVRAVQHVPKGAEVLISYIETAGSTMTRQKALKEQYLFT 242
           ++++NHSC PN ++ ++G LA VRAVQ +  G EV  SYI+    T  R   L++ Y FT
Sbjct: 202 VALMNHSCCPNVIVTYKGTLAEVRAVQEIKPGEEVFTSYIDLLYPTEDRNDRLRDSYFFT 261

Query: 243 CTCPRC 248
           C C  C
Sbjct: 262 CECQEC 267


>gi|21430850|gb|AAM51103.1| SD20045p [Drosophila melanogaster]
          Length = 382

 Score =  135 bits (341), Expect = 3e-29,   Method: Compositional matrix adjust.
 Identities = 102/397 (25%), Positives = 191/397 (48%), Gaps = 29/397 (7%)

Query: 60  WKLHRLECQVLSRLDKEKRKSVTPSIRLMLKLYLRRKLQNDNVIPSTTTDNYSLVEALVA 119
           W  H+ EC  L ++     + V  + R++ +L LR +   D +    T         L++
Sbjct: 4   WGQHKHECPFLKKVHP---RVVPDAARMLCRLILRLEHGGDLIRGYYTEHGSRKFRDLMS 60

Query: 120 HMSDIDEKQLLLY---AQIANLVNLILQWPEISIN--EIAENFSKLACNAHTICNSELRP 174
           H ++I    + L    +  A L +++ + P    N  E+   + +L  N   I ++E+  
Sbjct: 61  HYAEIKNDPMRLEHLDSLHAVLTDMMAESPSTVPNKTELMSIYGRLITNGFNILDAEMNS 120

Query: 175 LGTGLYPVISIINHSCLPNAVLVFEGRLAVVRAVQHVP--KGAEVLISYIETAGSTMTRQ 232
           + T +Y  +SI +HSC PNAV  FEG    V A++ +     +++ ISYI+   +   R+
Sbjct: 121 IATAIYLGVSITDHSCQPNAVATFEGNELHVHAIEDMECLDWSKIFISYIDLLNTPEQRR 180

Query: 233 KALKEQYLFTCTCPRCIKLGQFDDIQESAILEGYRCKDDGCSGFLLRDSDDKGFTCQQCG 292
             LKE Y F C C +C       D +ES  +    C +  C   +  D ++    C +C 
Sbjct: 181 LDLKEHYYFLCVCSKCT------DAKESKEMLAALCPNRNCGAGISVDRNN----CPRCD 230

Query: 293 LVRSKEEIKKIASEVNILSKKTLALTSCGNHQEV--VSTYKMIEKLQKKLYHPFSVNLMQ 350
              S + ++   +E   L++  L      N ++V  +   K+    Q  ++HP +V  ++
Sbjct: 231 AGISPK-LRNAFNEAMTLTRHNLE-----NMKDVAYLDVCKVCLDKQTGVFHPLNVWYVK 284

Query: 351 TREKLIKILMELEDWKEALAYCQLTIPVYQRVYPQFHPLLGLQYYTCGKLEWFLGDTENA 410
           T +   +  +E+  W +AL Y Q  +P +++ +  ++PLLGL +   GK++ + G ++ A
Sbjct: 285 TLDAAFEAAIEVGKWSDALDYGQRLLPGFRKYHGPWNPLLGLLHMKLGKIQLYEGHSKEA 344

Query: 411 IKSMTEAVEILRITHGTNSPFMKE-LILKLEEAQAEA 446
           +  + EA  IL +THG +   + E L + + +A+ EA
Sbjct: 345 LHHLEEAQRILTVTHGRDHRLLTEQLYVLVLQARQEA 381


>gi|348517233|ref|XP_003446139.1| PREDICTED: SET and MYND domain-containing protein 1-like isoform 2
           [Oreochromis niloticus]
          Length = 489

 Score =  135 bits (341), Expect = 4e-29,   Method: Compositional matrix adjust.
 Identities = 115/449 (25%), Positives = 197/449 (43%), Gaps = 58/449 (12%)

Query: 7   SGEVIISQEPYVCVPNNSSSISRCDGCFA-SSNLKKCSACQVVWYCGSNCQKLDWKLHRL 65
           +G+VI S+     V  +S +   C  CF     L++C  C+   YC   CQ+  W  H+L
Sbjct: 24  AGDVIFSEASIAAVVFDSLAERICHSCFRRQEKLQRCGQCKFAHYCDRTCQRAGWAEHKL 83

Query: 66  ECQVLSRLDKEKRKSVTPSIRLMLKLYLRRKLQNDNVIPSTTTDNYSLVEALVAHMSDID 125
           EC  +    K   +++    R+M +L       +D  +  TT D    +E  +A M + D
Sbjct: 84  ECSAIKAYGKVPNENIRLVARIMWRLDKEGSTVSD--MQLTTLDE---LEDHIADMPEDD 138

Query: 126 EKQLLLYAQIANLVNLILQWPEIS----INEIAENFSKLACNAHTICNSE-LRPLGTGLY 180
            K+L +     ++ N +  WP  S    I++I+  F  + CN  ++ +   L+ +G GL+
Sbjct: 139 LKELKV-----DIHNFLDYWPHNSKQHTIDDISHIFGVINCNGFSVSDQRGLQAVGVGLF 193

Query: 181 PVISIINHSCLPN-AVLVFEGRLAVV------------RAVQHVPKGAEVLISYIETAGS 227
           P + ++NH C PN  V++  G  + V            RA+  + +G E+ ++Y++    
Sbjct: 194 PNLCLVNHDCWPNCTVILNHGNQSAVNTMFHSQRRIELRALGKIAEGEELTVAYVDFLNL 253

Query: 228 TMTRQKALKEQYLFTCTCPRCIKLGQFDDIQESAILEGYRCKDDGCSGFLLRDSDDKGFT 287
           +  RQ+ LK QY F CTC  C                  R KDD   G   R+ D     
Sbjct: 254 SEERQRLLKTQYFFDCTCEHCKN----------------RIKDDIKIGG--REED----- 290

Query: 288 CQQCGLVRSKEEIKKIASEVNILSKKTLALTSCGNHQEVVSTYKM-IEKLQKKLYHPFSV 346
               G+  S+E++K+       + +K       G++ EVV   +  IEK +  L     +
Sbjct: 291 ----GVKPSEEQVKEATDYCFQMLEKMEKARLNGDYHEVVKICRECIEKTEPVLADT-HI 345

Query: 347 NLMQTREKLIKILMELEDWKEALAYCQLTIPVYQRVYPQFHPLLGLQYYTCGKLEWFLGD 406
            L++    + ++   L+ + +A  Y +  +  Y ++Y   +  LG+     G   W  G 
Sbjct: 346 YLLRMWSTMSEVQAYLQYFDDAADYARKMVEGYMKLYHPNNAALGMAAMRAGVTHWQAGQ 405

Query: 407 TENAIKSMTEAVEILRITHGTNSPFMKEL 435
            E     + +A  IL ITHG   P  K+L
Sbjct: 406 IEVGHGMICKAYAILMITHGPIHPITKDL 434


>gi|113674346|ref|NP_001038756.1| N-lysine methyltransferase SMYD2-B isoform 2 [Danio rerio]
 gi|95132413|gb|AAI16607.1| SET and MYND domain containing 2b [Danio rerio]
          Length = 423

 Score =  135 bits (340), Expect = 5e-29,   Method: Compositional matrix adjust.
 Identities = 111/443 (25%), Positives = 193/443 (43%), Gaps = 72/443 (16%)

Query: 8   GEVIISQEPYVCVPNNSSSISRCDGCFA-SSNLKKCSACQVVWYCGSNCQKLDWKLHRLE 66
           GE++ S   Y  V +       C+ CF     L KC  C+  +YC +NCQK +W +H+LE
Sbjct: 31  GELLFSCPAYSYVLSVGERGLICEQCFTRKKGLAKCGKCKKAFYCNANCQKKNWPMHKLE 90

Query: 67  CQVLSRLDKEKRKSVTPSIRLMLKLYLRRKLQNDNVIPSTTTDNYSLVEALVAHMSDID- 125
           CQ +    +  R S T             K Q +     + ++   L+  + AH+ D+D 
Sbjct: 91  CQAMCAFGENWRPSET-------------KAQKER----SPSEILLLLGEMEAHLEDMDN 133

Query: 126 EKQLLLYAQIANLVNLI---LQWPEISINEIAENFSKLACNAHTICNSELRPLGTGLYPV 182
           EK+ +  A IA L       L +P+     +   FS++ CN  T+ + EL  LG  ++P 
Sbjct: 134 EKREMTEAHIAGLHQFYSKHLDFPDH--QALLTLFSQVHCNGFTVEDEELSNLGLAIFPD 191

Query: 183 ISIINHSCLPNAVLVFEGRLAVVRAVQHVPKGAEVLISYIETAGSTMTRQKALKEQYLFT 242
           I+++NHSC PN ++ + G  A VRAV+ +  G E+  SYI+    T  R + L++ Y F+
Sbjct: 192 IALLNHSCSPNVIVTYRGINAEVRAVKDISPGQEIYTSYIDLLYPTADRLERLRDMYYFS 251

Query: 243 CTCPRC-------IKLG---QFDDIQESAILEGYRCKDDGCSGFLLRDSDDKGFTCQQCG 292
           C C  C       +K+    + D+I E  I +  R   +    F                
Sbjct: 252 CDCKECTTKSMDVVKMSVRKRSDEIGEKEIKDMVRYARNSMENFR--------------- 296

Query: 293 LVRSKEEIKKIASEVNILSKKTLALTSCGNHQEVVSTYKMIEKLQKKLYHPFSVNLMQTR 352
             R+K++              T  L  C    + +ST          ++   +V ++   
Sbjct: 297 --RAKQD-----------KSPTELLEMCELSIDKMST----------VFDDSNVYILHMM 333

Query: 353 EKLIKILMELEDWKEALAYCQLTIPVYQRVYPQFHPLLGLQYYTCGKLEWFLGDTENAIK 412
            + + I +  ED++ A+ Y +  I  +  +YP +   +   +   G+L   L      I 
Sbjct: 334 YQAMGICLFTEDYEGAVRYGEKVIKPFTVLYPAYSMNVASMFLKLGRLYIALDRKLAGID 393

Query: 413 SMTEAVEILRITHGTNSPFMKEL 435
           +  +A+ I+ + HG +  ++ EL
Sbjct: 394 AFQKALTIMEVVHGKDHTYVTEL 416


>gi|149036368|gb|EDL90986.1| rCG56113, isoform CRA_b [Rattus norvegicus]
          Length = 477

 Score =  135 bits (340), Expect = 5e-29,   Method: Compositional matrix adjust.
 Identities = 103/435 (23%), Positives = 192/435 (44%), Gaps = 45/435 (10%)

Query: 7   SGEVIISQEPYVCVPNNSSSISRCDGCFA-SSNLKKCSACQVVWYCGSNCQKLDWKLHRL 65
           + +VI ++  Y  V  +S     C  CF     L +C  C+   YC   CQK  W  H+ 
Sbjct: 29  AADVIFAERAYSAVVFDSLINFVCHTCFKRQERLHRCGQCKFAHYCDRTCQKDAWLNHKN 88

Query: 66  ECQVLSRLDKEKRKSVTPSIRLMLKLYLRRKLQNDNVIPSTTTDNYSLVEALVAHMSDID 125
           EC  + R  K   +++  + R+M ++        +  + S        ++  V H  + +
Sbjct: 89  ECSAIKRYGKVPNENIRLAARIMWRVEREGTGLTEGCLVSVDD-----LQNHVEHFGEEE 143

Query: 126 EKQLLLYAQIANLVNLILQWP----EISINEIAENFSKLACNAHTICNSE-LRPLGTGLY 180
           +K+L +     ++   +  WP    + S+  I+  F  + CN  T+ +   L+ +G G++
Sbjct: 144 QKELRV-----DVDTFLQYWPPQSQQFSMQYISHIFGVINCNGFTLSDQRGLQAVGVGIF 198

Query: 181 PVISIINHSCLPNAVLVFEGRLAVVRAVQHVPKGAEVLISYIETAGSTMTRQKALKEQYL 240
           P + ++NH C PN  ++F      +RA+  + +G E+ +SYI+    +  R++ LK+QY 
Sbjct: 199 PNLGLVNHDCWPNCTVIFNNGKIELRALGKISEGEELTVSYIDFLHLSEERRQQLKKQYY 258

Query: 241 FTCTCPRCIKLGQFDDIQESAILEGYRCKDDGCSGFLLRDSDDKGFTCQQCGLVRSKEEI 300
           F C+C  C K G  DD+                  FL    D K           S+E +
Sbjct: 259 FDCSCEHCQK-GLKDDL------------------FLAVKEDPKP----------SQEVV 289

Query: 301 KKIASEVNILSKKTLALTSCGNHQEVVSTYKMIEKLQKKLYHPFSVNLMQTREKLIKILM 360
           K++        +K     S G + EVV   +   + Q+ ++   ++ +++    + ++L 
Sbjct: 290 KEMTQFSKDTLEKIDKARSEGLYHEVVKLCRECLEKQEPVFADTNLYVLRLLSIVSEVLS 349

Query: 361 ELEDWKEALAYCQLTIPVYQRVYPQFHPLLGLQYYTCGKLEWFLGDTENAIKSMTEAVEI 420
            L+ ++EA  Y +  +  Y ++Y   +  LG+     G   W  G  E     + +A  I
Sbjct: 350 YLQAFEEASHYARRMVDGYMKLYHHNNAQLGMAVMRAGLTNWHAGHIEVGHGMICKAYAI 409

Query: 421 LRITHGTNSPFMKEL 435
           L +THG + P  K+L
Sbjct: 410 LLVTHGPSHPITKDL 424


>gi|348505212|ref|XP_003440155.1| PREDICTED: SET and MYND domain-containing protein 1-like isoform 2
           [Oreochromis niloticus]
          Length = 467

 Score =  135 bits (339), Expect = 6e-29,   Method: Compositional matrix adjust.
 Identities = 116/460 (25%), Positives = 200/460 (43%), Gaps = 83/460 (18%)

Query: 7   SGEVIISQEPYVCVPNNSSSISRCDGCF-ASSNLKKCSACQVVWYCGSNCQKLDWKLHRL 65
           +GEVI S+  +  V  +S +   C  CF   +NL +C+ C+   YC   CQ   W  H+ 
Sbjct: 29  TGEVIFSEPSFAAVVFDSLATQVCHSCFRHQANLHRCAQCKFAHYCDRTCQTACWNEHKQ 88

Query: 66  ECQVLSRLDKEKRKSVTPSIRLMLKLYLRRKLQNDNVIPSTTTDNYSLVEALVAHMSDI- 124
           EC  + +L     +++  + R++ +++    + +D+ + S        V+ L  H++D+ 
Sbjct: 89  ECGAIKKLGSAPSENIRLAARVLWRMHKDTGIASDSQLIS--------VDQLQEHVADLA 140

Query: 125 --DEKQLLLYAQIANLVNLILQWP----EISINEIAENFSKLACNAHTICNSE-LRPLGT 177
             D KQL      A++   +  W     + S++ I+  F  + CN  T+ +   L+ +G 
Sbjct: 141 AEDFKQLR-----ADVHKFLQYWSYGTRQHSVDYISHIFGIIKCNGFTLSDQRGLQAVGV 195

Query: 178 GLYPVISIINHSCLPN-AVLVFEGRLAVV------------RAVQHVPKGAEVLISYIET 224
           GL+P + ++NH C PN  V++  G  + V            RA+  +P+G E+ +SY++ 
Sbjct: 196 GLFPNLCLVNHDCWPNCTVILNHGNQSAVSSALHSQRRIELRALGKIPEGEELTVSYVDF 255

Query: 225 AGSTMTRQKALKEQYLFTCTCPRCIKLGQFDDIQESAILEGYRCKDDGCSGFLLRDSDDK 284
              +  RQK LKE++ F CTC  C K                  KDD     +   +D K
Sbjct: 256 LNLSADRQKKLKERFHFDCTCEHCSK----------------HIKDD----LMTAVADSK 295

Query: 285 GFTCQQCGLVRSKEEIKKIASEVNILSKKTLALTSCGNHQEVVSTYKMIEK-----LQKK 339
                      S +++K    EV   SK++L          V   Y  + K     LQK+
Sbjct: 296 P----------SADKVK----EVTAFSKESLEKIE---KSRVERDYNEVLKLCYECLQKQ 338

Query: 340 LYHPFSVNLMQTREKLIKILMELEDWKEALA----YCQLTIPVYQRVYPQFHPLLGLQYY 395
                  NL + R  ++ I  E+  ++ + A    Y    +  Y+++Y   +  LG+   
Sbjct: 339 ENVLADTNLYKLR--VLSIASEVLSYQRSFAKAANYAHRMVEGYRKLYHPNNAQLGMAIM 396

Query: 396 TCGKLEWFLGDTENAIKSMTEAVEILRITHGTNSPFMKEL 435
             G   W  G  E     + +A  IL +THG N    K+L
Sbjct: 397 RAGVTHWHAGQIEEGHSLICQAYRILMVTHGPNHAITKDL 436


>gi|118343990|ref|NP_001071820.1| SET and MYND domain containing protein [Ciona intestinalis]
 gi|70571203|dbj|BAE06699.1| SET and MYND domain containing protein [Ciona intestinalis]
          Length = 474

 Score =  134 bits (338), Expect = 7e-29,   Method: Compositional matrix adjust.
 Identities = 112/467 (23%), Positives = 207/467 (44%), Gaps = 50/467 (10%)

Query: 7   SGEVIISQEPYVCVPNNSSSISRCDGCFAS--------SNLKKCSACQVVWYCGSNCQKL 58
           +G+ ++ QEPY     +S     C  C  +         +L +C+ C+   YC   CQK 
Sbjct: 26  TGQAVLKQEPYAYAVMSSHIDVVCHYCLCAPGQPGAPVEDLHRCTGCKFAQYCTKECQKK 85

Query: 59  DWKLHRLECQVLSRLDKEKRKSVTPSIRLMLKLYLRRKLQ---NDNVIPSTTTDNYSL-- 113
            W  H+ EC  + R+   K    T   RL+ ++  RRK +   N         D   +  
Sbjct: 86  AWPEHKQECAAIKRITPGKPVDQT---RLVGRILWRRKREENLNGEKKDGKENDEKKVEL 142

Query: 114 --VEALVAHMSDIDEKQLLLYAQIANLVNLILQWPEI--SINEIAENFSKLACNAHTICN 169
             +E L  H+S  + ++     +          + E+  S  E+A  F+ + CNA  + +
Sbjct: 143 VKIEELEDHLSKRNAEEKEAIDEKVYSFGDYFTYDEMPDSDEEMAHLFAIIDCNAIGLND 202

Query: 170 SE-LRPLGTGLYPVISIINHSCLPNAVLVFEGRLAVVRAVQHVPKGAEVLISYIETAGST 228
              ++ +G G+YP IS++NH C PN V +  G    VRA++ +  G E+ ISYI++  +T
Sbjct: 203 HRGVQTIGVGIYPGISMLNHDCSPNCVAMNNGPRLEVRALRVIQPGEELCISYIDSLETT 262

Query: 229 MTRQKALKEQYLFTCTCPRCIKLGQFDDIQESAILEGYRCKDDGCSGFLLRDSDDKGFTC 288
             R++ LK QY F C C  C K  + ++++ + + E  +   +    ++ + S D     
Sbjct: 263 EKRREKLKLQYYFDCECDTCTKGEELENLKHALVSEDIK---EESVKYINQFSKD---ML 316

Query: 289 QQCGLVRSKEEIKKIASEVNILSKKTLALTSCGNHQEVVSTYKMIEKLQKKLYHPFSVNL 348
           ++    +  +  +++ ++    +  TLA   C     V++   +++     L H   V  
Sbjct: 317 KRIHKTKQNQNWERMCNQ----TLGTLAQQDC-----VIADTNVLKIAM--LNHAVEVQS 365

Query: 349 MQTREKLIKILMELEDWKEALAYCQLTIPVYQRVYPQFHPLLGLQYYTCGKLEWFLGDTE 408
              R+            + AL Y +     Y+++ P  HP LG+     G ++W +  TE
Sbjct: 366 FLRRQ------------EPALEYAERVAAAYEKLLPPVHPTLGMYLMRLGVIQWQIQKTE 413

Query: 409 NAIKSMTEAVEILRITHGTNSPFMKELILKLEEAQAEASYKLSSKDE 455
            A+ ++  A  I+  THG +    KEL+  + + + EA     ++ E
Sbjct: 414 AAVATLGRAASIISKTHGDDHGMFKELLGLIHQCKMEAHMSKGAQRE 460


>gi|116283746|gb|AAH28104.1| SMYD2 protein [Homo sapiens]
          Length = 417

 Score =  134 bits (338), Expect = 8e-29,   Method: Compositional matrix adjust.
 Identities = 97/389 (24%), Positives = 178/389 (45%), Gaps = 40/389 (10%)

Query: 56  QKLDWKLHRLECQVLSRLDKEKRKSVTPSIRLMLKLYLRRKLQNDNVIPSTTTDNYSLVE 115
           QK DW +H+LEC  +    +    S T  +RL  ++  ++K+  +     T ++    V+
Sbjct: 63  QKEDWPMHKLECSPMVVFGENWNPSET--VRLTARILAKQKIHPER----TPSEKLLAVK 116

Query: 116 ALVAHMSDID-EKQLLLYAQIANLVNLI---LQWPEISINEIAENFSKLACNAHTICNSE 171
              +H+  +D EK+ L+ + IA L +     L +P+   + +   F+++ CN  TI + E
Sbjct: 117 EFESHLDKLDNEKKDLIQSDIAALHHFYSKHLGFPDN--DSLVVLFAQVNCNGFTIEDEE 174

Query: 172 LRPLGTGLYPVISIINHSCLPNAVLVFEGRLAVVRAVQHVPKGAEVLISYIETAGSTMTR 231
           L  LG+ ++P ++++NHSC PN ++ ++G LA VRAVQ +  G EV  SYI+    T  R
Sbjct: 175 LSHLGSAIFPDVALMNHSCCPNVIVTYKGTLAEVRAVQEIKPGEEVFTSYIDLLYPTEDR 234

Query: 232 QKALKEQYLFTCTCPRCIKLGQFDDIQESAILEGYRCKDDGCSGFLLRDSDDKGFTCQQC 291
              L++ Y FTC C  C                    KD   +   +R   D        
Sbjct: 235 NDRLRDSYFFTCECQECT------------------TKDKDKAKVEIRKLSDPPKAEAIR 276

Query: 292 GLVRSKEEIKKIASEVNILSKKTLALTSCGNHQEVVSTYKMIEKLQKKLYHPFSVNLMQT 351
            +VR    + +           +  L  C   QE +S+          ++   +V ++  
Sbjct: 277 DMVRYARNVIEEFRRAKHYKSPSELLEICELSQEKMSS----------VFEDSNVYMLHM 326

Query: 352 REKLIKILMELEDWKEALAYCQLTIPVYQRVYPQFHPLLGLQYYTCGKLEWFLGDTENAI 411
             + + + + ++DW+ AL Y Q  I  Y + YP +   +   +   G+L   L       
Sbjct: 327 MYQAMGVCLYMQDWEGALQYGQKIIKPYSKHYPLYSLNVASMWLKLGRLYMGLEHKAAGE 386

Query: 412 KSMTEAVEILRITHGTNSPFMKELILKLE 440
           K++ +A+ I+ + HG + P++ E+  ++E
Sbjct: 387 KALKKAIAIMEVAHGKDHPYISEIKQEME 415


>gi|395508794|ref|XP_003758694.1| PREDICTED: SET and MYND domain-containing protein 1 [Sarcophilus
           harrisii]
          Length = 490

 Score =  134 bits (336), Expect = 1e-28,   Method: Compositional matrix adjust.
 Identities = 111/453 (24%), Positives = 196/453 (43%), Gaps = 68/453 (15%)

Query: 7   SGEVIISQEPYVCVPNNSSSISRCDGCFA-SSNLKKCSACQVVWYCGSNCQKLDWKLHRL 65
           + +V+ ++  Y  V  +S +   C  CF     L++C  C+   YCG  CQK  W  H+ 
Sbjct: 29  AADVVFAEPAYSAVVFDSFTQVVCHTCFKRQEKLQRCGQCKFARYCGRPCQKDAWLNHKN 88

Query: 66  ECQVLSRLDKEKRKSVTPSIRLMLKLYLRRKLQNDNVIPSTTTDNYSLVEALVAHMSDI- 124
           EC  + +  K   +++  + R+M ++        +  + S        V+ L  H+ +  
Sbjct: 89  ECAAIKKHGKPPSENIRLAARIMWRIEREGTGLTEGCLVS--------VDDLQNHVDNFG 140

Query: 125 DEKQLLLYAQIANLVNLILQWP----EISINEIAENFSKLACNAHTICNSE-LRPLGTGL 179
           +E+Q  L   + + +N    WP    + S+  I+  F  + CNA T+ +   L+ +G G+
Sbjct: 141 EEEQKDLRMDVDSFLNF---WPPQSQQFSMQYISHIFGVINCNAFTLSDQRGLQAVGVGI 197

Query: 180 YPVISIINHSCLPNAVLVFE-GRLAVVRAVQH------------VPKGAEVLISYIETAG 226
           +P + ++NH C PN  ++F  G    V+++ H            + +G E+ +SYI+   
Sbjct: 198 FPNLCLVNHDCWPNCTVIFNNGNHEAVKSMFHTQMRIELRALGKISEGEELTVSYIDFLN 257

Query: 227 STMTRQKALKEQYLFTCTCPRCIKLGQFDDIQESAILEGYRCKDDGCSGFLLRDSDDKGF 286
            +  R+K LK+QY F CTC  C K G  DD+                  FL    D K  
Sbjct: 258 ISQERKKQLKKQYYFDCTCEHCEK-GIKDDL------------------FLAVKEDPK-- 296

Query: 287 TCQQCGLVRSKEEIKKIASEVNILSKKTL----ALTSCGNHQEVVSTYKMIEKLQKKLYH 342
                         + +  EV   SK TL       S G + EVV   +   + Q+ +  
Sbjct: 297 ------------PSQDVVKEVTQFSKDTLEKIDKARSEGLYHEVVKLCRECLQKQEPVLA 344

Query: 343 PFSVNLMQTREKLIKILMELEDWKEALAYCQLTIPVYQRVYPQFHPLLGLQYYTCGKLEW 402
             ++++++      ++L  L+ ++EA  Y +  +  Y ++Y   +  LG+     G   W
Sbjct: 345 DTNIHVLRILSIASEVLSYLQSFQEASDYAKRMVDGYMKLYHPNNAQLGMAVMRAGLTHW 404

Query: 403 FLGDTENAIKSMTEAVEILRITHGTNSPFMKEL 435
             G  E     + +A  IL +THG   P  K+L
Sbjct: 405 HAGLIEAGHGMICKAYAILLVTHGPTHPITKDL 437


>gi|126305322|ref|XP_001379366.1| PREDICTED: SET and MYND domain-containing protein 1 [Monodelphis
           domestica]
          Length = 490

 Score =  133 bits (335), Expect = 1e-28,   Method: Compositional matrix adjust.
 Identities = 110/453 (24%), Positives = 196/453 (43%), Gaps = 68/453 (15%)

Query: 7   SGEVIISQEPYVCVPNNSSSISRCDGCFA-SSNLKKCSACQVVWYCGSNCQKLDWKLHRL 65
           + +VI ++  Y  V  +S +   C  CF     L++C  C+   YC   CQK  W  H+ 
Sbjct: 29  AADVIFAERAYSAVVFDSFTQVVCHTCFKRQEKLQRCGQCKFAHYCDRTCQKDAWLNHKH 88

Query: 66  ECQVLSRLDKEKRKSVTPSIRLMLKLYLRRKLQNDNVIPSTTTDNYSLVEALVAHMSDI- 124
           EC  + +  K   +++  + R+M ++        +  + S        V+ L  H+ +  
Sbjct: 89  ECAAIKKHGKPPSENIRLAARIMWRIEREGTGLTEGCLVS--------VDDLQNHVDNFG 140

Query: 125 DEKQLLLYAQIANLVNLILQWP----EISINEIAENFSKLACNAHTICNSE-LRPLGTGL 179
           +E+Q  L   + + +N    WP    + S+  I+  F  + CNA T+ +   L+ +G G+
Sbjct: 141 EEEQKDLRVDVDSFLNF---WPAQSQQFSMQYISHIFGVINCNAFTLSDQRGLQAVGVGI 197

Query: 180 YPVISIINHSCLPNAVLVFE-GRLAVVRAVQH------------VPKGAEVLISYIETAG 226
           +P + ++NH C PN  ++F  G    V+++ H            + +G E+ +SY++   
Sbjct: 198 FPNLCLVNHDCWPNCTVIFNNGNHEAVKSMFHTQMRIELRALGKISEGEELTVSYVDFLN 257

Query: 227 STMTRQKALKEQYLFTCTCPRCIKLGQFDDIQESAILEGYRCKDDGCSGFLLRDSDDKGF 286
            +  R+K LK+QY F CTC  C K G  DD+                  FL   +D K  
Sbjct: 258 VSQERKKQLKKQYYFDCTCEHCEK-GIKDDL------------------FLAVKADPK-- 296

Query: 287 TCQQCGLVRSKEEIKKIASEVNILSKKTL----ALTSCGNHQEVVSTYKMIEKLQKKLYH 342
                         + +  EV   SK+TL       S G + EVV   +   + Q+ +  
Sbjct: 297 ------------PSQDVVKEVTQFSKETLEKIDKARSEGLYHEVVKLCRECLQKQEPVLA 344

Query: 343 PFSVNLMQTREKLIKILMELEDWKEALAYCQLTIPVYQRVYPQFHPLLGLQYYTCGKLEW 402
             ++++++      ++L  L+ + EA  Y +  +  Y ++Y   +  LG+     G   W
Sbjct: 345 DTNIHVLRILSIASEVLSYLQSFHEASDYAKRMVDGYMKLYHPNNAQLGMAVMRAGLTHW 404

Query: 403 FLGDTENAIKSMTEAVEILRITHGTNSPFMKEL 435
             G  E     + +A  IL +THG   P  K+L
Sbjct: 405 HAGLIEAGHGMICKAYAILLVTHGPTHPITKDL 437


>gi|5870834|gb|AAC53022.2| skm-BOP2 [Mus musculus]
          Length = 472

 Score =  133 bits (335), Expect = 1e-28,   Method: Compositional matrix adjust.
 Identities = 102/435 (23%), Positives = 191/435 (43%), Gaps = 45/435 (10%)

Query: 7   SGEVIISQEPYVCVPNNSSSISRCDGCFA-SSNLKKCSACQVVWYCGSNCQKLDWKLHRL 65
           + +VI ++  Y  V  +S     C  CF     L +C  C+   YC   CQK  W  H+ 
Sbjct: 24  AADVIFAERAYSAVVFDSLINFVCHTCFKRQEKLHRCGQCKFAHYCDRTCQKDAWLNHKN 83

Query: 66  ECQVLSRLDKEKRKSVTPSIRLMLKLYLRRKLQNDNVIPSTTTDNYSLVEALVAHMSDID 125
           EC  + +  K   +++  + R+M ++        +  + S        ++  V H  + +
Sbjct: 84  ECAAIKKYGKVPNENIRLAARIMWRVEREGTGLTEGCLVSVDD-----LQNHVEHFGEEE 138

Query: 126 EKQLLLYAQIANLVNLILQWP----EISINEIAENFSKLACNAHTICNSE-LRPLGTGLY 180
           +K+L +     ++   +  WP    + S+  I+  F  + CN  T+ +   L+ +G G++
Sbjct: 139 QKELRV-----DVDTFLQYWPPQSQQFSMQYISHIFGVINCNGFTLSDQRGLQAVGVGIF 193

Query: 181 PVISIINHSCLPNAVLVFEGRLAVVRAVQHVPKGAEVLISYIETAGSTMTRQKALKEQYL 240
           P + ++NH C PN  ++F      +RA+  + +G E+ +SYI+    +  R++ LK+QY 
Sbjct: 194 PNLGLVNHDCWPNCTVIFNNGKIELRALGKISEGEELTVSYIDFLHLSEERRRQLKKQYY 253

Query: 241 FTCTCPRCIKLGQFDDIQESAILEGYRCKDDGCSGFLLRDSDDKGFTCQQCGLVRSKEEI 300
           F C+C  C K G  DD+                  FL    D K           S+E +
Sbjct: 254 FDCSCEHCQK-GLKDDL------------------FLAAKEDPKP----------SQEVV 284

Query: 301 KKIASEVNILSKKTLALTSCGNHQEVVSTYKMIEKLQKKLYHPFSVNLMQTREKLIKILM 360
           K++        +K     S G + EVV   +   + Q+ ++   ++ +++      ++L 
Sbjct: 285 KEMIQFSKDTLEKIDKARSEGLYHEVVKLCRECLEKQEPVFADTNLYVLRLLSIASEVLS 344

Query: 361 ELEDWKEALAYCQLTIPVYQRVYPQFHPLLGLQYYTCGKLEWFLGDTENAIKSMTEAVEI 420
            L+ ++EA  Y +  +  Y ++Y   +  LG+     G   W  G  E     + +A  I
Sbjct: 345 YLQAYEEASHYARRMVDGYMKLYHHNNAQLGMAVMRAGLTNWHAGHIEVGHGMICKAYAI 404

Query: 421 LRITHGTNSPFMKEL 435
           L +THG + P  K+L
Sbjct: 405 LLVTHGPSHPITKDL 419


>gi|236461142|ref|NP_033892.2| SET and MYND domain-containing protein 1 isoform 2 [Mus musculus]
 gi|74209105|dbj|BAE24950.1| unnamed protein product [Mus musculus]
 gi|148666517|gb|EDK98933.1| SET and MYND domain containing 1, isoform CRA_a [Mus musculus]
          Length = 477

 Score =  133 bits (335), Expect = 1e-28,   Method: Compositional matrix adjust.
 Identities = 102/435 (23%), Positives = 191/435 (43%), Gaps = 45/435 (10%)

Query: 7   SGEVIISQEPYVCVPNNSSSISRCDGCFA-SSNLKKCSACQVVWYCGSNCQKLDWKLHRL 65
           + +VI ++  Y  V  +S     C  CF     L +C  C+   YC   CQK  W  H+ 
Sbjct: 29  AADVIFAERAYSAVVFDSLINFVCHTCFKRQEKLHRCGQCKFAHYCDRTCQKDAWLNHKN 88

Query: 66  ECQVLSRLDKEKRKSVTPSIRLMLKLYLRRKLQNDNVIPSTTTDNYSLVEALVAHMSDID 125
           EC  + +  K   +++  + R+M ++        +  + S        ++  V H  + +
Sbjct: 89  ECAAIKKYGKVPNENIRLAARIMWRVEREGTGLTEGCLVSVDD-----LQNHVEHFGEEE 143

Query: 126 EKQLLLYAQIANLVNLILQWP----EISINEIAENFSKLACNAHTICNSE-LRPLGTGLY 180
           +K+L +     ++   +  WP    + S+  I+  F  + CN  T+ +   L+ +G G++
Sbjct: 144 QKELRV-----DVDTFLQYWPPQSQQFSMQYISHIFGVINCNGFTLSDQRGLQAVGVGIF 198

Query: 181 PVISIINHSCLPNAVLVFEGRLAVVRAVQHVPKGAEVLISYIETAGSTMTRQKALKEQYL 240
           P + ++NH C PN  ++F      +RA+  + +G E+ +SYI+    +  R++ LK+QY 
Sbjct: 199 PNLGLVNHDCWPNCTVIFNNGKIELRALGKISEGEELTVSYIDFLHLSEERRRQLKKQYY 258

Query: 241 FTCTCPRCIKLGQFDDIQESAILEGYRCKDDGCSGFLLRDSDDKGFTCQQCGLVRSKEEI 300
           F C+C  C K G  DD+                  FL    D K           S+E +
Sbjct: 259 FDCSCEHCQK-GLKDDL------------------FLAAKEDPKP----------SQEVV 289

Query: 301 KKIASEVNILSKKTLALTSCGNHQEVVSTYKMIEKLQKKLYHPFSVNLMQTREKLIKILM 360
           K++        +K     S G + EVV   +   + Q+ ++   ++ +++      ++L 
Sbjct: 290 KEMIQFSKDTLEKIDKARSEGLYHEVVKLCRECLEKQEPVFADTNLYVLRLLSIASEVLS 349

Query: 361 ELEDWKEALAYCQLTIPVYQRVYPQFHPLLGLQYYTCGKLEWFLGDTENAIKSMTEAVEI 420
            L+ ++EA  Y +  +  Y ++Y   +  LG+     G   W  G  E     + +A  I
Sbjct: 350 YLQAYEEASHYARRMVDGYMKLYHHNNAQLGMAVMRAGLTNWHAGHIEVGHGMICKAYAI 409

Query: 421 LRITHGTNSPFMKEL 435
           L +THG + P  K+L
Sbjct: 410 LLVTHGPSHPITKDL 424


>gi|302850329|ref|XP_002956692.1| SET and zf-MYND domain-containing protein [Volvox carteri f.
           nagariensis]
 gi|300258053|gb|EFJ42294.1| SET and zf-MYND domain-containing protein [Volvox carteri f.
           nagariensis]
          Length = 265

 Score =  133 bits (335), Expect = 2e-28,   Method: Compositional matrix adjust.
 Identities = 74/237 (31%), Positives = 123/237 (51%), Gaps = 16/237 (6%)

Query: 8   GEVIISQEPYVCVPNNSSSISRCDGCFASSNLK-KCSACQVVWYCGSNCQKLDW-KLHRL 65
           G V++ Q+PYV V ++  +   CD CF       +C+  ++  YC    Q+L W   +++
Sbjct: 42  GAVVLQQDPYVSVLSDERTPGFCDFCFRPCERPLRCTRSKLARYCCKEHQRLAWVAGYKM 101

Query: 66  ECQVLSRLDKEKRKSVTPSIRLMLKLYLRRKLQNDNVIPSTTTDNYSLVEALVAHMSDID 125
           EC+ L R      +   P++RL  +L  RR                + +  L  H  ++D
Sbjct: 102 ECEALVRCAP---RVPPPTVRLAARLLWRR----------ARCGGINGLWRLEHHWDELD 148

Query: 126 EKQLLLYAQIANLVNLILQWPEI-SINEIAENFSKLACNAHTICNSELRPLGTGLYPVIS 184
           +++  LYAQ+A +   + +W        +A+  S L+CN HT+C+ ELRPLG  LYP  +
Sbjct: 149 DRRKQLYAQMAVVTWWVARWGTWPGFRTVAQLLSLLSCNCHTVCDEELRPLGVALYPTGA 208

Query: 185 IINHSCLPNAVLVFEGRLAVVRAVQHVPKGAEVLISYIETAGSTMTRQKALKEQYLF 241
           ++NHSC P+ V  F G    +RA++ +  G E+ I+YIE A +   R++ L + Y F
Sbjct: 209 LVNHSCSPSTVQTFHGSTLELRALRQLAPGDEITIAYIELAATRQERRETLADSYFF 265


>gi|296482488|tpg|DAA24603.1| TPA: SET and MYND domain containing 1 [Bos taurus]
          Length = 490

 Score =  132 bits (333), Expect = 3e-28,   Method: Compositional matrix adjust.
 Identities = 109/453 (24%), Positives = 198/453 (43%), Gaps = 68/453 (15%)

Query: 7   SGEVIISQEPYVCVPNNSSSISRCDGCFA-SSNLKKCSACQVVWYCGSNCQKLDWKLHRL 65
           + ++I ++  Y  V  +S     C  CF     L +C  C+   YC   CQK  W  H+ 
Sbjct: 29  AADIIFAERAYSAVVFDSLVNVVCHTCFKRQEKLHRCGQCKFAHYCDRTCQKDAWLNHKN 88

Query: 66  ECQVLSRLDKEKRKSVTPSIRLMLKLYLRRKLQNDNVIPSTTTDNYSLVEALVAHMSDID 125
           EC  + R  K   +++  + R+M ++        +  + S        ++  V H  + +
Sbjct: 89  ECSAIKRYGKVPNENIRLAARIMWRVEREGSGLTEGCLVSVDD-----LQNHVEHFGEEE 143

Query: 126 EKQLLLYAQIANLVNLILQ-WP----EISINEIAENFSKLACNAHTICNSE-LRPLGTGL 179
           +K+L L       V++ LQ WP    + S+  I+  F  + CN  T+ +   L+ +G G+
Sbjct: 144 QKELRLD------VDMFLQYWPPQSQQFSMQYISHIFGVINCNGFTLSDQRGLQAVGVGV 197

Query: 180 YPVISIINHSCLPNAVLVFE-GRLAVVRAVQH------------VPKGAEVLISYIETAG 226
           +P ++++NH C PN  ++F  G    V+++ H            + +G E+ +SYI+   
Sbjct: 198 FPNLALVNHDCWPNCTVIFNNGNHEAVKSMFHTQMRIELRALGKISEGEELTVSYIDFLN 257

Query: 227 STMTRQKALKEQYLFTCTCPRCIKLGQFDDIQESAILEGYRCKDDGCSGFLLRDSDDKGF 286
            +  R+K LK QY F CTC  C K G  DD+                  FL         
Sbjct: 258 VSEERKKQLKRQYYFDCTCEHCQK-GLKDDL------------------FL--------- 289

Query: 287 TCQQCGLVRSKEEIKKIASEVNILSKKTL----ALTSCGNHQEVVSTYKMIEKLQKKLYH 342
                G+  + +  +++  E+   SK TL       S G + EVV   +   + Q+ ++ 
Sbjct: 290 -----GVKDNPKPSQEVVKEMIQFSKDTLEKIDKARSEGLYHEVVKLCRECLQKQEPVFA 344

Query: 343 PFSVNLMQTREKLIKILMELEDWKEALAYCQLTIPVYQRVYPQFHPLLGLQYYTCGKLEW 402
             ++  ++    + ++L  L+ ++EA  Y +  +  Y ++Y   +  LG+     G   W
Sbjct: 345 DTNIYTLRLLSIVSEVLSYLQAFEEASHYARRMVDGYMKLYHHNNAQLGMAIMRAGLTNW 404

Query: 403 FLGDTENAIKSMTEAVEILRITHGTNSPFMKEL 435
             G+ E     + +A  IL +THG + P  K+L
Sbjct: 405 HAGNIEVGHGMICKAYAILLVTHGPSHPITKDL 437


>gi|410912130|ref|XP_003969543.1| PREDICTED: N-lysine methyltransferase SMYD2-A-like [Takifugu
           rubripes]
          Length = 434

 Score =  132 bits (333), Expect = 3e-28,   Method: Compositional matrix adjust.
 Identities = 76/250 (30%), Positives = 136/250 (54%), Gaps = 18/250 (7%)

Query: 8   GEVIISQEPYVCVPNNSSSISRCDGCFAS-SNLKKCSACQVVWYCGSNCQKLDWKLHRLE 66
           GE++ +   Y  V   +   + C+ CF    +L KC  C+  +YC  +CQ+ DW +H++E
Sbjct: 30  GELVFACPAYSYVLTVNERGAYCEQCFTRREDLFKCGKCKQAYYCNVDCQRGDWSMHKME 89

Query: 67  CQVLSRLDKEKRKSVTPSIRLMLKLYLRRKLQNDNVIPSTTTDNYSLVEALVAHMSDIDE 126
           C  +    +    S T  +RL+ ++ L++K+  +     T ++   L++   +H+  +D 
Sbjct: 90  CLAMCAYGENWCPSET--VRLVARIILKQKVTTEQ----TPSERLLLLKDFESHLDKMDS 143

Query: 127 -KQLLLYAQIANLVNLILQWPEISINEIAEN------FSKLACNAHTICNSELRPLGTGL 179
            K  +  A IA L +   ++    IN+I ++      F+++ CN  TI + EL  LG+ +
Sbjct: 144 IKDEMNQADIAALHHFYSRY----INDIPDDQSLTELFAQVNCNGFTIEDEELSHLGSAI 199

Query: 180 YPVISIINHSCLPNAVLVFEGRLAVVRAVQHVPKGAEVLISYIETAGSTMTRQKALKEQY 239
           +P ++++NHSC PN ++ ++G +A VRAVQ +  G E+L SYI+    T  R++ L + Y
Sbjct: 200 FPDVALMNHSCNPNVIVTYKGTVAEVRAVQELNPGDEILNSYIDLLYPTEDRKERLLDSY 259

Query: 240 LFTCTCPRCI 249
            FTC C  C 
Sbjct: 260 FFTCLCSECF 269



 Score = 43.1 bits (100), Expect = 0.33,   Method: Compositional matrix adjust.
 Identities = 21/99 (21%), Positives = 49/99 (49%)

Query: 345 SVNLMQTREKLIKILMELEDWKEALAYCQLTIPVYQRVYPQFHPLLGLQYYTCGKLEWFL 404
           +V ++    + + + + ++DW+ A+ Y +  I  Y   YP +   +   Y   G L   L
Sbjct: 336 NVYMLHMMYQAMGVCLYMQDWEGAMRYGEKIIHPYSFHYPAYSLNVASMYLKLGHLYLGL 395

Query: 405 GDTENAIKSMTEAVEILRITHGTNSPFMKELILKLEEAQ 443
                 +K++ +A+ I+ + HG +  ++ E+  ++EE +
Sbjct: 396 EKRTQGVKALKKALSIMEVAHGKDHYYVAEVKREIEEQK 434


>gi|32140151|tpg|DAA01312.1| TPA_exp: SET and MYND domain protein 2 [Takifugu rubripes]
          Length = 433

 Score =  132 bits (332), Expect = 3e-28,   Method: Compositional matrix adjust.
 Identities = 76/250 (30%), Positives = 136/250 (54%), Gaps = 18/250 (7%)

Query: 8   GEVIISQEPYVCVPNNSSSISRCDGCFAS-SNLKKCSACQVVWYCGSNCQKLDWKLHRLE 66
           GE++ +   Y  V   +   + C+ CF    +L KC  C+  +YC  +CQ+ DW +H++E
Sbjct: 30  GELVFACPAYSYVLTVNERGAYCEQCFTRREDLFKCGKCKQAYYCNVDCQRGDWSMHKME 89

Query: 67  CQVLSRLDKEKRKSVTPSIRLMLKLYLRRKLQNDNVIPSTTTDNYSLVEALVAHMSDIDE 126
           C  +    +    S T  +RL+ ++ L++K+  +     T ++   L++   +H+  +D 
Sbjct: 90  CLAMCAYGENWCPSET--VRLVARIILKQKVTTEQ----TPSERLLLLKDFESHLDKMDS 143

Query: 127 -KQLLLYAQIANLVNLILQWPEISINEIAEN------FSKLACNAHTICNSELRPLGTGL 179
            K  +  A IA L +   ++    IN+I ++      F+++ CN  TI + EL  LG+ +
Sbjct: 144 IKDEMNQADIAALHHFYSRY----INDIPDDQSLTELFAQVNCNGFTIEDEELSHLGSAI 199

Query: 180 YPVISIINHSCLPNAVLVFEGRLAVVRAVQHVPKGAEVLISYIETAGSTMTRQKALKEQY 239
           +P ++++NHSC PN ++ ++G +A VRAVQ +  G E+L SYI+    T  R++ L + Y
Sbjct: 200 FPDVALMNHSCNPNVIVTYKGTVAEVRAVQELNPGDEILNSYIDLLYPTEDRKERLLDSY 259

Query: 240 LFTCTCPRCI 249
            FTC C  C 
Sbjct: 260 FFTCLCSECF 269


>gi|326919601|ref|XP_003206068.1| PREDICTED: SET and MYND domain-containing protein 1-like isoform 1
           [Meleagris gallopavo]
          Length = 486

 Score =  132 bits (332), Expect = 4e-28,   Method: Compositional matrix adjust.
 Identities = 118/474 (24%), Positives = 205/474 (43%), Gaps = 69/474 (14%)

Query: 6   VSGEVIISQEPYVCVPNNSSSISRCDGCFA-SSNLKKCSACQVVWYCGSNCQKLDWKLHR 64
           + G+VI ++  Y  V  +S +   C  CF     L +C  C+  +YC   CQ+  W  H+
Sbjct: 23  LPGDVIFAEPAYAAVVFDSLTHVVCHTCFKRQEKLHRCGQCKFAYYCDRTCQRDAWLNHK 82

Query: 65  LECQVLSRLDKEKRKSVTPSIRLMLKLYLRRKLQNDNVIPSTTTDNYSLVEALVAHMSDI 124
            EC  +    K+  K+ T +IRL  ++  R + +   +  +        ++ L  H+ + 
Sbjct: 83  NECSAI----KKHGKAPTENIRLAARILWRIEREGSGLSENCLVS----IDDLQNHVENF 134

Query: 125 DEKQLL-LYAQIANLVNLILQWP----EISINEIAENFSKLACNAHTICNSE-LRPLGTG 178
           DE++   L   + + +     WP    +  +  I+  F  + CNA T+ +   L+ +G G
Sbjct: 135 DEEEKKDLRIDVESFLEF---WPAQSQQFGMQYISHIFGVINCNAFTLSDQRGLQAVGVG 191

Query: 179 LYPVISIINHSCLPNAVLVFE-GRLAVVRAVQH------------VPKGAEVLISYIETA 225
           ++P +   NH C PN  ++F  G    VR++ H            +  G E+ +SY++  
Sbjct: 192 IFPNLCQANHDCWPNCTVIFNNGNHEAVRSMFHTQMRIELRALSKISPGDELTVSYVDFL 251

Query: 226 GSTMTRQKALKEQYLFTCTCPRCIKLGQFDDIQESAILEGYRCKDDGCSGFLLRDSDDKG 285
             +  RQK LK+QY F CTC  C K                + KDD      +++ D K 
Sbjct: 252 NVSEERQKQLKKQYYFDCTCEHCKK----------------KIKDD--LMLAVKEGDKKP 293

Query: 286 FTCQQCGLVRSKEEIKKIASEVNILSKKTLALTSC----GNHQEVVSTYKMIEKLQKKLY 341
                     S E +K    EV   SK TL   +     G + EVV   +   K Q+ + 
Sbjct: 294 ----------SAETVK----EVIQFSKDTLEKINKARLEGTYHEVVKLCRECLKKQEPVL 339

Query: 342 HPFSVNLMQTREKLIKILMELEDWKEALAYCQLTIPVYQRVYPQFHPLLGLQYYTCGKLE 401
              ++ L++      ++L  L+ ++EA  Y +  +  Y ++Y   +  LG+     G   
Sbjct: 340 GDTNIYLLRILSIASEVLSYLQMFEEAAEYAKRMVDGYLKIYHPNNAQLGMAVMRAGVTH 399

Query: 402 WFLGDTENAIKSMTEAVEILRITHGTNSPFMKELILKLEEAQAEASYKLSSKDE 455
           W  G  E     + +A  IL ITHG + P  K+  L++   Q E   ++  ++E
Sbjct: 400 WHAGLIEAGHSMICKAYAILLITHGPSHPITKD--LEVMRVQTEMELRMFQQNE 451


>gi|148225404|ref|NP_001085463.1| SET and MYND domain containing 1 [Xenopus laevis]
 gi|49118725|gb|AAH72803.1| MGC80131 protein [Xenopus laevis]
          Length = 478

 Score =  132 bits (331), Expect = 4e-28,   Method: Compositional matrix adjust.
 Identities = 107/456 (23%), Positives = 202/456 (44%), Gaps = 48/456 (10%)

Query: 7   SGEVIISQEPYVCVPNNSSSISRCDGCFA-SSNLKKCSACQVVWYCGSNCQKLDWKLHRL 65
           +G+VI ++  Y  V  ++ S   C  CF     L +C  C+   YC   CQK  W  H+ 
Sbjct: 24  AGDVIFAEPAYSAVVFDNLSHCVCHSCFKRQEKLLRCGQCKFAHYCDRTCQKESWVNHKH 83

Query: 66  ECQVLSRLDKEKRKSVTPSIRLMLKLYLRRKLQNDNVIPSTTTDNYSLVEALVAHMSDI- 124
           EC  +    K+  K+   +IRL  ++  R + +   +           ++ L  H+    
Sbjct: 84  ECLAI----KKAGKAPNENIRLAARILWRIEREGGGLTEGCLVS----IDDLQNHLDKFG 135

Query: 125 DEKQLLLYAQIANLVNLILQWP----EISINEIAENFSKLACNAHTICNSE-LRPLGTGL 179
           D ++  L   +   ++    WP    +  +  I+  FS ++CN  T+ +   L+ +G  +
Sbjct: 136 DGEKSSLMEDVQKFMDF---WPSQSQQFGMQYISHIFSVISCNGFTLSDQRGLQAVGVAI 192

Query: 180 YPVISIINHSCLPNAVLVFEGRLAVVRAVQHVPKGAEVLISYIETAGSTMTRQKALKEQY 239
           +P + + NH C PN  ++F      +RA+  + KG E+ +SY++    T  R + LK+QY
Sbjct: 193 FPNLCLTNHDCWPNCTVIFNNGKIELRALGKINKGDELTVSYVDFLNLTEDRMEQLKKQY 252

Query: 240 LFTCTCPRCIKLGQFDDIQESAILEGYRCKDDGCSGFLLRDSDDKGFTCQQCGLVRSKEE 299
            F CTC  C    +  D    A+ +G    +D     +++ S D            + E+
Sbjct: 253 YFDCTCEHCT--NKTKDALLLAVNDGESKPEDRVVKEVIQYSKD------------TMEK 298

Query: 300 IKKIASEVNILSKKTLALTSCGNHQEVVSTYKMIEKLQKKLYHPFSVNLMQTREKLIKIL 359
           I+K  SE              G++ +VV   +   K Q+ ++   ++ +++      ++L
Sbjct: 299 IEKARSE--------------GSYHQVVKLCRECLKRQEPIFADTNIYMLRILSIYSEVL 344

Query: 360 MELEDWKEALAYCQLTIPVYQRVYPQFHPLLGLQYYTCGKLEWFLGDTENAIKSMTEAVE 419
             L+ +++A    +  I  Y ++Y   +  LG+     G  +W  G  E     + +A  
Sbjct: 345 SYLQMFQDAAENAKKMIDGYMKIYHPNNAQLGMAIMRAGVTQWHAGMIEVGHGMICKAFA 404

Query: 420 ILRITHGTNSPFMKELILKLEEAQAEASYKLSSKDE 455
           IL ITHG + P  K+  L++  +Q E   ++  ++E
Sbjct: 405 ILLITHGPSHPITKD--LEMMRSQTEMELRMFKENE 438


>gi|449273578|gb|EMC83062.1| SET and MYND domain-containing protein 1 [Columba livia]
          Length = 491

 Score =  132 bits (331), Expect = 5e-28,   Method: Compositional matrix adjust.
 Identities = 117/474 (24%), Positives = 204/474 (43%), Gaps = 69/474 (14%)

Query: 6   VSGEVIISQEPYVCVPNNSSSISRCDGCFA-SSNLKKCSACQVVWYCGSNCQKLDWKLHR 64
           + G+VI ++  Y  V  +S +   C  CF     L +C  C+  +YC   CQ+  W  H+
Sbjct: 28  LPGDVIFAEPAYAAVVFDSLTHVICHTCFKRQERLHRCGQCKFAYYCDRTCQRAAWLNHK 87

Query: 65  LECQVLSRLDKEKRKSVTPSIRLMLKLYLRRKLQNDNVIPSTTTDNYSLVEALVAHMSDI 124
            EC  + R  K   +++  + R+M K+       ++  + S        ++ L  H+   
Sbjct: 88  NECSAIKRHGKAPTENIRLAARIMWKIEREGSGLSEGCLVS--------IDDLQNHVDSF 139

Query: 125 DEKQLL-LYAQIANLVNLILQWP----EISINEIAENFSKLACNAHTICNSE-LRPLGTG 178
           DE++   L A + + +     WP    +  +  I+  F  ++CN  T+ +   L+ +G G
Sbjct: 140 DEEEKKELRADVESFLEF---WPPHSQQFGMQYISHIFGVISCNGFTLSDQRGLQAVGVG 196

Query: 179 LYPVISIINHSCLPNAVLVFE-GRLAVVRAVQH------------VPKGAEVLISYIETA 225
           ++P +   NH C PN  ++F  G    VR++ H            +  G E+ +SY++  
Sbjct: 197 IFPNLCQANHDCWPNCTVIFNNGNHEAVRSMFHTQMRIELRALTKISPGDELTVSYVDFL 256

Query: 226 GSTMTRQKALKEQYLFTCTCPRCIKLGQFDDIQESAILEGYRCKDDGCSGFLLRDSDDKG 285
             +  R+K LK+QY F CTC  C K                + KDD     +L   + +G
Sbjct: 257 NVSEERRKQLKKQYYFDCTCEHCKK----------------QIKDD----LMLAVKEGEG 296

Query: 286 FTCQQCGLVRSKEEIKKIASEVNILSKKTLALTSC----GNHQEVVSTYKMIEKLQKKLY 341
                     S E +K    EV   SK TL   S     G + EVV   +   K Q+ + 
Sbjct: 297 KP--------SAETVK----EVIQFSKDTLEKISKARMEGLYHEVVKLCRECLKKQEPVL 344

Query: 342 HPFSVNLMQTREKLIKILMELEDWKEALAYCQLTIPVYQRVYPQFHPLLGLQYYTCGKLE 401
              ++ L++      ++L  L+ ++EA  Y +  +  Y ++Y   +  LG+     G   
Sbjct: 345 GDTNIYLLRILSIASEVLSYLQMFEEAADYAKRMVDGYLKIYHPNNAQLGMAVMRAGVTH 404

Query: 402 WFLGDTENAIKSMTEAVEILRITHGTNSPFMKELILKLEEAQAEASYKLSSKDE 455
           W  G  E     + +A  IL ITHG + P  K+  L++   Q E   ++  ++E
Sbjct: 405 WHAGLIEAGHGLICKAYAILLITHGPSHPITKD--LEVMRVQTEMELRMFQQNE 456


>gi|134085965|ref|NP_001076953.1| SET and MYND domain-containing protein 1 [Bos taurus]
 gi|126717435|gb|AAI33279.1| SMYD1 protein [Bos taurus]
          Length = 490

 Score =  131 bits (330), Expect = 6e-28,   Method: Compositional matrix adjust.
 Identities = 109/453 (24%), Positives = 197/453 (43%), Gaps = 68/453 (15%)

Query: 7   SGEVIISQEPYVCVPNNSSSISRCDGCFA-SSNLKKCSACQVVWYCGSNCQKLDWKLHRL 65
           + ++I ++  Y  V  +S     C  CF     L +C  C+   YC   CQK  W  H+ 
Sbjct: 29  AADIIFAERAYSAVVFDSLVNVVCHTCFKRQEKLHRCGQCKFAHYCDRTCQKDAWLNHKN 88

Query: 66  ECQVLSRLDKEKRKSVTPSIRLMLKLYLRRKLQNDNVIPSTTTDNYSLVEALVAHMSDID 125
           EC  + R  K   +++  + R+M ++        +  + S        ++  V H  + +
Sbjct: 89  ECSAIKRYGKVPNENIRLAARIMWRVEREGSGLTEGCLVSVDD-----LQNHVEHFGEEE 143

Query: 126 EKQLLLYAQIANLVNLILQ-WP----EISINEIAENFSKLACNAHTICNSE-LRPLGTGL 179
           +K+L L       V+  LQ WP    + S+  I+  F  + CN  T+ +   L+ +G G+
Sbjct: 144 QKELRLD------VDTFLQYWPPQSQQFSMQYISHIFGVINCNGFTLSDQRGLQAVGVGV 197

Query: 180 YPVISIINHSCLPNAVLVFE-GRLAVVRAVQH------------VPKGAEVLISYIETAG 226
           +P ++++NH C PN  ++F  G    V+++ H            + +G E+ +SYI+   
Sbjct: 198 FPNLALVNHDCWPNCTVIFNNGNHEAVKSMFHTQMRIELRALGKISEGEELTVSYIDFLN 257

Query: 227 STMTRQKALKEQYLFTCTCPRCIKLGQFDDIQESAILEGYRCKDDGCSGFLLRDSDDKGF 286
            +  R+K LK QY F CTC  C K G  DD+                  FL         
Sbjct: 258 VSEERKKQLKRQYYFDCTCEHCQK-GLKDDL------------------FL--------- 289

Query: 287 TCQQCGLVRSKEEIKKIASEVNILSKKTL----ALTSCGNHQEVVSTYKMIEKLQKKLYH 342
                G+  + +  +++  E+   SK TL       S G + EVV   +   + Q+ ++ 
Sbjct: 290 -----GVKDNPKPSQEVVKEMIQFSKDTLEKIDKARSEGLYHEVVKLCRECLQKQEPVFA 344

Query: 343 PFSVNLMQTREKLIKILMELEDWKEALAYCQLTIPVYQRVYPQFHPLLGLQYYTCGKLEW 402
             ++  ++    + ++L  L+ ++EA  Y +  +  Y ++Y   +  LG+     G   W
Sbjct: 345 DTNIYTLRLLSIVSEVLSYLQAFEEASHYARRMVDGYMKLYHHNNAQLGMAIMRAGLTNW 404

Query: 403 FLGDTENAIKSMTEAVEILRITHGTNSPFMKEL 435
             G+ E     + +A  IL +THG + P  K+L
Sbjct: 405 HAGNIEVGHGMICKAYAILLVTHGPSHPITKDL 437


>gi|426223567|ref|XP_004005946.1| PREDICTED: SET and MYND domain-containing protein 1 [Ovis aries]
          Length = 490

 Score =  131 bits (330), Expect = 7e-28,   Method: Compositional matrix adjust.
 Identities = 109/449 (24%), Positives = 195/449 (43%), Gaps = 60/449 (13%)

Query: 7   SGEVIISQEPYVCVPNNSSSISRCDGCFA-SSNLKKCSACQVVWYCGSNCQKLDWKLHRL 65
           + ++I ++  Y  V  +S     C  CF     L +C  C+   YC   CQK  W  H+ 
Sbjct: 29  AADIIFAERAYSAVVFDSLVNVVCHTCFKRQEKLHRCGQCKFAHYCDRTCQKDAWLNHKN 88

Query: 66  ECQVLSRLDKEKRKSVTPSIRLMLKLYLRRKLQNDNVIPSTTTDNYSLVEALVAHMSDID 125
           EC  + R  K   +++  + R+M ++        +  + S        ++  V H  + +
Sbjct: 89  ECSAIKRYGKVPNENIRLAARIMWRVEREGSGLTEGCLVSVDD-----LQNHVEHFGEEE 143

Query: 126 EKQLLLYAQIANLVNLILQ-WP----EISINEIAENFSKLACNAHTICNSE-LRPLGTGL 179
           +K+L L       V+  LQ WP    + S+  I+  F  + CN  T+ +   L+ +G G+
Sbjct: 144 QKELRLD------VDTFLQYWPPQSQQFSMQYISHIFGVINCNGFTLSDQRGLQAVGVGV 197

Query: 180 YPVISIINHSCLPNAVLVFE-GRLAVVRAVQH------------VPKGAEVLISYIETAG 226
           +P ++++NH C PN  ++F  G    V+++ H            + +G E+ +SYI+   
Sbjct: 198 FPNLALVNHDCWPNCTVIFNNGNHEAVKSMFHTQMRIELRALGKISEGEELTVSYIDFLN 257

Query: 227 STMTRQKALKEQYLFTCTCPRCIKLGQFDDIQESAILEGYRCKDDGCSGFLLRDSDDKGF 286
            +  R+K LK QY F CTC  C K G  DD+                  FL    D K  
Sbjct: 258 ISEERKKQLKRQYYFDCTCEHCQK-GLKDDL------------------FLGVKDDPKP- 297

Query: 287 TCQQCGLVRSKEEIKKIASEVNILSKKTLALTSCGNHQEVVSTYKMIEKLQKKLYHPFSV 346
                    S+E +K++        +K     S G + EVV   +   + Q+ ++   ++
Sbjct: 298 ---------SQEVVKEMIQFSKDTLEKIDKARSEGLYHEVVKLCRECLQKQEPVFADTNI 348

Query: 347 NLMQTREKLIKILMELEDWKEALAYCQLTIPVYQRVYPQFHPLLGLQYYTCGKLEWFLGD 406
             ++    + ++L  L+ ++EA  Y +  +  Y ++Y   +  LG+     G   W  G+
Sbjct: 349 YTLRLLSIVSEVLSYLQAFEEASHYARRMVDGYMKLYHPNNAQLGMAIMRAGLTNWHAGN 408

Query: 407 TENAIKSMTEAVEILRITHGTNSPFMKEL 435
            E     + +A  IL +THG + P  K+L
Sbjct: 409 IEVGHGMICKAYAILLVTHGPSHPITKDL 437


>gi|345328822|ref|XP_001510515.2| PREDICTED: SET and MYND domain-containing protein 1 isoform 2
           [Ornithorhynchus anatinus]
          Length = 477

 Score =  131 bits (330), Expect = 7e-28,   Method: Compositional matrix adjust.
 Identities = 106/440 (24%), Positives = 190/440 (43%), Gaps = 55/440 (12%)

Query: 7   SGEVIISQEPYVCVPNNSSSISRCDGCFA-SSNLKKCSACQVVWYCGSNCQKLDWKLHRL 65
           + ++I S+  Y  V  +S +   C  CF     L++C  C+   YC   CQK  W  H+ 
Sbjct: 29  AADLIFSERAYSAVVFDSLTHLVCHTCFKRQEKLQRCGQCKFAHYCDRTCQKDAWLNHKN 88

Query: 66  ECQVLSRLDKEKRKSVTPSIRLMLKLYLRRKLQNDNVIPSTTTDNYSLVEALVAHMSDI- 124
           EC  + +  K   +++  + R+M ++        +  + S        ++ L  H+    
Sbjct: 89  ECSAIKKHGKAPNENIRLAARIMWRIEREGTGLTEGCLVS--------IDDLQNHVDSFG 140

Query: 125 DEKQLLLYAQIANLVNLILQWP----EISINEIAENFSKLACNAHTICNSE-LRPLGTGL 179
           +E+Q  L   + + ++    WP    + S+  I+  F  + CN  T+ +   L+ +G G+
Sbjct: 141 EEEQKELRLDVESFLHF---WPPQSQQFSMQYISHIFGVINCNGFTLSDQRGLQAVGVGV 197

Query: 180 YPVISIINHSCLPNAVLVFEGRLAVVRAVQHVPKGAEVLISYIETAGSTMTRQKALKEQY 239
           +P + ++NH C PN  ++F      +RA+  + +G E+ +SYI+    +  R+K LK+QY
Sbjct: 198 FPNLCLVNHDCWPNCTVIFNNGKIELRALGKISEGEELTVSYIDFLNVSADRKKLLKKQY 257

Query: 240 LFTCTCPRCIKLGQFDDIQESAILEGYRCKDDGCSGFLLRDSDDKGFTCQQCGLVRSKEE 299
            F CTC  C K G  DD+                  FL              G+    + 
Sbjct: 258 YFDCTCEHCEK-GIKDDL------------------FL--------------GVKDEPKP 284

Query: 300 IKKIASEVNILSKKTL----ALTSCGNHQEVVSTYKMIEKLQKKLYHPFSVNLMQTREKL 355
            + +  EV   SK  L       S G + EVV   +   + Q+ +    ++ L++     
Sbjct: 285 SQDVVKEVIQFSKDALEKIDKARSEGVYHEVVKLCRECLQKQEPVLADTNIYLLRILSVA 344

Query: 356 IKILMELEDWKEALAYCQLTIPVYQRVYPQFHPLLGLQYYTCGKLEWFLGDTENAIKSMT 415
            ++L  L+ ++EA  Y +  +  Y ++Y   +  LG+     G   W  G  E     + 
Sbjct: 345 SEVLSYLQSFEEASDYAKKMVDGYMKLYHPNNAQLGMAVMRAGVTHWHAGLIEVGHGMIC 404

Query: 416 EAVEILRITHGTNSPFMKEL 435
           +A  IL +THG   P  K+L
Sbjct: 405 KAYAILLVTHGPTHPITKDL 424


>gi|395853479|ref|XP_003799234.1| PREDICTED: SET and MYND domain-containing protein 1 [Otolemur
           garnettii]
          Length = 490

 Score =  131 bits (329), Expect = 8e-28,   Method: Compositional matrix adjust.
 Identities = 112/450 (24%), Positives = 196/450 (43%), Gaps = 62/450 (13%)

Query: 7   SGEVIISQEPYVCVPNNSSSISRCDGCFA-SSNLKKCSACQVVWYCGSNCQKLDWKLHRL 65
           + +VI ++  Y  V  +S     C  CF     L +C  C+   YC   CQK  W  H+ 
Sbjct: 29  AADVIFAERAYSAVVFDSLVNFVCHTCFKRQEKLHRCGQCKFAHYCDRTCQKDAWLNHKN 88

Query: 66  ECQVLSRLDKEKRKSVTPSIRLMLKLYLRRKLQNDNVIPSTTTDNYSLVEALVAHMSDID 125
           EC  + R  K   +++  + R+M ++        +  + S        ++  V H  + +
Sbjct: 89  ECSAIKRYGKVPNENIRLAARIMWRVEREGTGLTEGCLVSVDD-----LQNHVEHFGEEE 143

Query: 126 EKQLLLYAQIANLVNLILQ-WPE----ISINEIAENFSKLACNAHTICNSE-LRPLGTGL 179
           +K+L +       V+  LQ WP      SI  I+  F  + CN  T+ +   L+ +G G+
Sbjct: 144 QKELRVD------VDTFLQYWPPQSQPFSIQYISHIFGVINCNGFTLSDQRGLQAVGVGI 197

Query: 180 YPVISIINHSCLPNAVLVFE-GRLAVVRAVQH------------VPKGAEVLISYIETAG 226
           +P + ++NH C PN  ++F  G    V+A+ H            + +G E+ +SYI+   
Sbjct: 198 FPNLGLVNHDCWPNCTVIFNNGNHEAVKAMFHTQMRIELRALGKISEGEELTVSYIDFLN 257

Query: 227 STMTRQKALKEQYLFTCTCPRCIKLGQFDDIQESAILEGYRCKDDGCSGFL-LRDSDDKG 285
            T  R+K LK+QY F C C  C K                + KDD    FL ++D+    
Sbjct: 258 VTEERRKQLKKQYYFDCACEHCQK----------------KLKDD---LFLAVKDNPKPS 298

Query: 286 FTCQQCGLVRSKEEIKKIASEVNILSKKTLALTSCGNHQEVVSTYKMIEKLQKKLYHPFS 345
               +  +  SKE + KIA              S G + EVV   +   + Q+ ++   +
Sbjct: 299 QEVVKEMIQFSKETLDKIAK-----------ARSEGLYHEVVKLCRECLQKQEPVFADTN 347

Query: 346 VNLMQTREKLIKILMELEDWKEALAYCQLTIPVYQRVYPQFHPLLGLQYYTCGKLEWFLG 405
           + +++    + ++L  L+ ++EA  Y +  +  Y ++Y   +  LG+     G   W  G
Sbjct: 348 LYMLRILSIVSEVLSYLQAFEEASDYARRMVDGYMKLYHPNNAQLGMAVMRAGLTNWHAG 407

Query: 406 DTENAIKSMTEAVEILRITHGTNSPFMKEL 435
           + E     + +A  IL +THG + P  K+L
Sbjct: 408 NIEVGHGMICKAYAILLVTHGPSHPITKDL 437


>gi|327275307|ref|XP_003222415.1| PREDICTED: SET and MYND domain-containing protein 1-like isoform 1
           [Anolis carolinensis]
          Length = 472

 Score =  131 bits (329), Expect = 9e-28,   Method: Compositional matrix adjust.
 Identities = 110/435 (25%), Positives = 195/435 (44%), Gaps = 45/435 (10%)

Query: 7   SGEVIISQEPYVCVPNNSSSISRCDGCFA-SSNLKKCSACQVVWYCGSNCQKLDWKLHRL 65
           +G+VI ++  Y  V  +S +   C  CF   + L +C  C+   YC   CQK  W  H+ 
Sbjct: 24  AGDVIFAERAYAAVVFDSLTHLVCHTCFKRHAKLHRCGQCKFAHYCDRTCQKDAWVNHKN 83

Query: 66  ECQVLSRLDKEKRKSVTPSIRLMLKLYLRRKLQNDNVIPSTTTDNYSLVEALVAHMSDID 125
           EC  +    K+  K+   +IRL  ++  R + +   +         SL +  V H  + +
Sbjct: 84  ECAAI----KKHGKAPNENIRLAARIMWRIEREGGGLTEGCFVSIDSL-QNHVEHFGEEE 138

Query: 126 EKQLLLYAQIANLVNLILQWP----EISINEIAENFSKLACNAHTICNSE-LRPLGTGLY 180
           +K+L L     +L + +  WP    + ++  I+     + CN  T+ +   L+ +G G++
Sbjct: 139 KKELRL-----DLESFLEFWPHDGKQFAMQYISHILGVINCNGFTLSDQRGLQAVGVGIF 193

Query: 181 PVISIINHSCLPNAVLVFEGRLAVVRAVQHVPKGAEVLISYIETAGSTMTRQKALKEQYL 240
           P + ++NH C PN  ++F      +RA+  +  G E+ +SY++       R++ LK+QY 
Sbjct: 194 PNLCLVNHDCWPNCTVIFNNGKIELRALGKISIGEELTVSYVDFLNVCEDRRQQLKKQYY 253

Query: 241 FTCTCPRCIKLGQFDDIQESAILEGYRCKDDGCSGFLLRDSDDKGFTCQQCGLVRSKEEI 300
           F CTC  C K G  DD+   A+ E     ++  S  +++D            +  SKE I
Sbjct: 254 FDCTCEHCQK-GIKDDLM-LAVKE-----EEKPSPEVVKDV-----------IQFSKEAI 295

Query: 301 KKIASEVNILSKKTLALTSCGNHQEVVSTYKMIEKLQKKLYHPFSVNLMQTREKLIKILM 360
           +KI               S G ++EVV   +   + Q+ +    ++ +++    L ++L 
Sbjct: 296 EKIDK-----------ARSEGLYKEVVKLCRACLEKQEPVLGDTNIYMLRILSILSEVLS 344

Query: 361 ELEDWKEALAYCQLTIPVYQRVYPQFHPLLGLQYYTCGKLEWFLGDTENAIKSMTEAVEI 420
            L+   EA  Y +     Y ++Y   +  LG+     G   W  G  E     + +A  I
Sbjct: 345 YLQHVDEAAGYSKRMADGYLKLYHPNNAQLGMALMRAGVTHWHAGLIEAGHGMICKAYAI 404

Query: 421 LRITHGTNSPFMKEL 435
           L +THG   P  K+L
Sbjct: 405 LLVTHGPTHPITKDL 419


>gi|410955280|ref|XP_003984284.1| PREDICTED: SET and MYND domain-containing protein 1 [Felis catus]
          Length = 490

 Score =  130 bits (328), Expect = 1e-27,   Method: Compositional matrix adjust.
 Identities = 111/453 (24%), Positives = 199/453 (43%), Gaps = 68/453 (15%)

Query: 7   SGEVIISQEPYVCVPNNSSSISRCDGCFA-SSNLKKCSACQVVWYCGSNCQKLDWKLHRL 65
           + +VI ++  Y  V  +S     C  CF     L +C  C+   YC   CQK  W  H+ 
Sbjct: 29  AADVIFAERAYSAVVFDSLVNFVCHTCFKRQEKLHRCGQCKFAHYCDRTCQKDAWLNHKN 88

Query: 66  ECQVLSRLDKEKRKSVTPSIRLMLKLYLRRKLQNDNVIPSTTTDNYSLVEALVAHMSDID 125
           EC  + R  K   +++  + R+M ++        +  + +        ++  V H  + +
Sbjct: 89  ECSAIKRYGKVPNENIRLAARIMWRVEREGTGLTEGCLVAVDD-----LQNHVEHFGEEE 143

Query: 126 EKQLLLYAQIANLVNLILQ-WP----EISINEIAENFSKLACNAHTICNSE-LRPLGTGL 179
           +K+L +       V+  LQ WP    + S+  I+  F  ++CN  T+ +   L+ +G G+
Sbjct: 144 QKELRVD------VDTFLQYWPPQSQQFSMQYISHIFGVISCNGFTLSDQRGLQAVGVGI 197

Query: 180 YPVISIINHSCLPNAVLVFE-GRLAVVRAVQH------------VPKGAEVLISYIETAG 226
           +P + ++NH C PN  ++F  G    V+++ H            + +G E+ +SYI+   
Sbjct: 198 FPNLGLVNHDCWPNCTVIFNNGNHEAVKSMFHTQMRIELRALGKISEGEELTVSYIDFLN 257

Query: 227 STMTRQKALKEQYLFTCTCPRCIKLGQFDDIQESAILEGYRCKDDGCSGFLLRDSDDKGF 286
            +  R++ LK+QY F CTC  C K                + KDD    FL    D K  
Sbjct: 258 VSEERKRQLKKQYYFDCTCEHCQK----------------KLKDD---LFLGVKDDPKP- 297

Query: 287 TCQQCGLVRSKEEIKKIASEVNILSKKTL----ALTSCGNHQEVVSTYKMIEKLQKKLYH 342
                    S+E +K++      LSK TL       S G + EVV   +   + Q+ ++ 
Sbjct: 298 ---------SQEAVKEMIQ----LSKDTLEKIDKARSEGLYHEVVKLCRECLQKQEPVFA 344

Query: 343 PFSVNLMQTREKLIKILMELEDWKEALAYCQLTIPVYQRVYPQFHPLLGLQYYTCGKLEW 402
             ++  ++    + ++L  L+ ++EA  Y +  +  Y ++Y   +  LG+     G   W
Sbjct: 345 DTNLYTLRMLSTVSEVLSYLQAFEEASHYARRMVDGYMKLYHPNNAQLGMAVMRAGLTNW 404

Query: 403 FLGDTENAIKSMTEAVEILRITHGTNSPFMKEL 435
             G+ E     + +A  IL +THG + P  K+L
Sbjct: 405 HAGNIEVGHGMICKAYAILLVTHGPSHPITKDL 437


>gi|449501420|ref|XP_002187966.2| PREDICTED: SET and MYND domain-containing protein 1 isoform 1
           [Taeniopygia guttata]
          Length = 478

 Score =  130 bits (328), Expect = 1e-27,   Method: Compositional matrix adjust.
 Identities = 109/457 (23%), Positives = 202/457 (44%), Gaps = 48/457 (10%)

Query: 6   VSGEVIISQEPYVCVPNNSSSISRCDGCFA-SSNLKKCSACQVVWYCGSNCQKLDWKLHR 64
           ++G+VI ++  Y  V  +S +   C  CF     L +C  C+   YC   CQ+  W  H+
Sbjct: 28  LAGDVIFAEPAYAAVVFDSLTHVICHTCFKRQERLHRCGQCKFAHYCDRTCQRAAWLSHK 87

Query: 65  LECQVLSRLDKEKRKSVTPSIRLMLKLYLRRKLQNDNVIPSTTTDNYSLVEALVAHMSDI 124
            EC  + R      K+ T +IRL  ++  + + +   +           +E L  H+   
Sbjct: 88  NECSAIKR----HGKAPTENIRLAARILWKMEREGSGLSEGCLV----AIEELQNHVDSF 139

Query: 125 -DEKQLLLYAQIANLVNLILQWP----EISINEIAENFSKLACNAHTICNSE-LRPLGTG 178
            +E++  L A + + +     WP    +  +  I+  F  ++CN  T+ +   L+ +G G
Sbjct: 140 GEEEKKDLRADVESFLEF---WPPQCQQFGMQLISHIFGVISCNGFTLSDQRGLQAVGVG 196

Query: 179 LYPVISIINHSCLPNAVLVFEGRLAVVRAVQHVPKGAEVLISYIETAGSTMTRQKALKEQ 238
           ++P +   NH C PN  +VF      +RA+  +  G E+ +SY++    +  R++ LK+Q
Sbjct: 197 IFPNLCQANHDCWPNCTVVFNNGKIELRALNKISPGDELTVSYVDFLSLSEERRRQLKKQ 256

Query: 239 YLFTCTCPRCIKLGQFDDIQESAILEGYRCKDDGCSGFLLRDSDDKGFTCQQCGLVRSKE 298
           Y F CTC  C K                + KDD        +S     T ++  +  SK+
Sbjct: 257 YYFDCTCEHCKK----------------QLKDDLMLAVKAGESKPSADTVKEV-IQLSKD 299

Query: 299 EIKKIASEVNILSKKTLALTSCGNHQEVVSTYKMIEKLQKKLYHPFSVNLMQTREKLIKI 358
            ++KI       +K  +     G++ EVV   +   + Q+ +    ++ L++      ++
Sbjct: 300 TLEKI-------NKARME----GHYHEVVKLCRDCLQKQEPVLGDTNIYLLRILSIASEV 348

Query: 359 LMELEDWKEALAYCQLTIPVYQRVYPQFHPLLGLQYYTCGKLEWFLGDTENAIKSMTEAV 418
           L  L+ ++EA  Y +  +  Y ++Y   +  LG+     G   W  G  E     + +A 
Sbjct: 349 LSYLQMFEEAADYAKRMVDGYLKIYHPNNAQLGMAVMRAGVTHWHAGLIEVGHGLICKAY 408

Query: 419 EILRITHGTNSPFMKELILKLEEAQAEASYKLSSKDE 455
            IL ITHG + P  K+  L++   Q E   ++  ++E
Sbjct: 409 AILLITHGPSHPITKD--LEVMRVQTEMELRMFQQNE 443


>gi|57997548|emb|CAI46077.1| hypothetical protein [Homo sapiens]
          Length = 490

 Score =  130 bits (328), Expect = 1e-27,   Method: Compositional matrix adjust.
 Identities = 106/453 (23%), Positives = 197/453 (43%), Gaps = 68/453 (15%)

Query: 7   SGEVIISQEPYVCVPNNSSSISRCDGCFA-SSNLKKCSACQVVWYCGSNCQKLDWKLHRL 65
           + ++I ++  Y  V  +S     C  CF     L +C  C+   YC   CQK  W  H+ 
Sbjct: 29  AADIIFAERAYSAVVFDSLVNFVCHTCFKRQEKLHRCGQCKFAHYCDRTCQKDAWLNHKN 88

Query: 66  ECQVLSRLDKEKRKSVTPSIRLMLKLYLR-RKLQNDNVIPSTTTDNYSLVEALVAHMSDI 124
           EC  + R  K   +++  + R+M ++      L    ++P     N+      V H  + 
Sbjct: 89  ECSAIKRYGKVPNENIRLAARIMWRVEREGTGLTEGCLVPVDDLQNH------VEHFGEE 142

Query: 125 DEKQLLLYAQIANLVNLILQWP----EISINEIAENFSKLACNAHTICNSE-LRPLGTGL 179
           ++K L +     ++   +  WP    + S+  I+  F  + CN  T+ +   L+ +G G+
Sbjct: 143 EQKDLRV-----DVDTFLQYWPPQSQQFSMQYISHIFGVINCNGFTLSDQRGLQAVGVGI 197

Query: 180 YPVISIINHSCLPNAVLVFE-GRLAVVRAVQH------------VPKGAEVLISYIETAG 226
           +P + ++NH C PN  ++F  G    V+++ H            + +G E+ +SYI+   
Sbjct: 198 FPNLGLVNHDCWPNCTVIFNNGNHEAVKSMFHTQMRIELRALGKISEGEELTVSYIDFLN 257

Query: 227 STMTRQKALKEQYLFTCTCPRCIKLGQFDDIQESAILEGYRCKDDGCSGFLLRDSDDKGF 286
            +  R++ LK+QY F CTC  C K                + KDD    FL         
Sbjct: 258 VSEERKRQLKKQYYFDCTCEHCQK----------------KLKDD---LFL--------- 289

Query: 287 TCQQCGLVRSKEEIKKIASEVNILSKKTL----ALTSCGNHQEVVSTYKMIEKLQKKLYH 342
                G+  + +  +++  E+   SK TL       S G + EVV   +   + Q+ ++ 
Sbjct: 290 -----GVKDNPKPSQEVVKEMIQFSKDTLEKIDKARSEGLYHEVVKLCRECLEKQEPVFA 344

Query: 343 PFSVNLMQTREKLIKILMELEDWKEALAYCQLTIPVYQRVYPQFHPLLGLQYYTCGKLEW 402
             ++ +++    + ++L  L+ ++EA  Y +  +  Y ++Y   +  LG+     G   W
Sbjct: 345 DTNIYMLRMLSIVSEVLSYLQAFEEASFYARRMVDGYMKLYHPNNAQLGMAVMRAGLTNW 404

Query: 403 FLGDTENAIKSMTEAVEILRITHGTNSPFMKEL 435
             G+ E     + +A  IL +THG + P  K+L
Sbjct: 405 HAGNIEVGHGMICKAYAILLVTHGPSHPITKDL 437


>gi|45383804|ref|NP_989486.1| SET and MYND domain-containing protein 1 [Gallus gallus]
 gi|16930387|gb|AAL31880.1|AF410781_1 cardiac and skeletal muscle-specific BOP1 [Gallus gallus]
          Length = 486

 Score =  130 bits (328), Expect = 1e-27,   Method: Compositional matrix adjust.
 Identities = 117/474 (24%), Positives = 204/474 (43%), Gaps = 69/474 (14%)

Query: 6   VSGEVIISQEPYVCVPNNSSSISRCDGCFA-SSNLKKCSACQVVWYCGSNCQKLDWKLHR 64
           + G+VI ++  Y  V  +S +   C  CF     L +C  C+  +YC   CQ+  W  H+
Sbjct: 23  LPGDVIFAEPAYAAVVFDSLTHVICHTCFKRQERLHRCGQCKFAYYCDRTCQRDAWLNHK 82

Query: 65  LECQVLSRLDKEKRKSVTPSIRLMLKLYLRRKLQNDNVIPSTTTDNYSLVEALVAHMSDI 124
            EC  +    K+  K+ T +IRL  ++  R + +   +  +        ++ L  H+   
Sbjct: 83  NECSAI----KKHGKAPTENIRLAARILWRIEREGGGLSENCLVS----IDDLQNHVESF 134

Query: 125 DEKQLL-LYAQIANLVNLILQWP----EISINEIAENFSKLACNAHTICNSE-LRPLGTG 178
           DE++   L   + + +     WP    +  +  I+  F  + CNA T+ +   L+ +G G
Sbjct: 135 DEEEKKDLRVDVESFLEF---WPAQSQQFGMQYISHIFGVINCNAFTLSDQRGLQAVGVG 191

Query: 179 LYPVISIINHSCLPNAVLVFE-GRLAVVRAVQH------------VPKGAEVLISYIETA 225
           ++P +   NH C PN  ++F  G    VR++ H            +  G E+ +SY++  
Sbjct: 192 IFPNLCQANHDCWPNCTVIFNNGNHEAVRSMFHTQMRIELRALSKISPGDELTVSYVDFL 251

Query: 226 GSTMTRQKALKEQYLFTCTCPRCIKLGQFDDIQESAILEGYRCKDDGCSGFLLRDSDDKG 285
             +  R+K LK+QY F CTC  C K                + KDD      +++ D K 
Sbjct: 252 NVSEERRKQLKKQYYFDCTCEHCKK----------------KIKDD--LMLAVKEGDKKP 293

Query: 286 FTCQQCGLVRSKEEIKKIASEVNILSKKTLALTSC----GNHQEVVSTYKMIEKLQKKLY 341
                     S E +K    EV   SK TL   +     G + EVV   +   K Q+ + 
Sbjct: 294 ----------SAETVK----EVIQFSKDTLEKINKARLEGTYHEVVKLCRECLKKQEPVL 339

Query: 342 HPFSVNLMQTREKLIKILMELEDWKEALAYCQLTIPVYQRVYPQFHPLLGLQYYTCGKLE 401
              ++ L++      ++L  L+ ++EA  Y +  +  Y ++Y   +  LG+     G   
Sbjct: 340 GDTNIYLLRILSIASEVLSYLQMFEEAAEYAKRMVEGYMKIYHPNNAQLGMAVMRAGVTH 399

Query: 402 WFLGDTENAIKSMTEAVEILRITHGTNSPFMKELILKLEEAQAEASYKLSSKDE 455
           W  G  E     + +A  IL ITHG + P  K+  L++   Q E   ++  ++E
Sbjct: 400 WHAGLIEAGHSMICKAYAILLITHGPSHPITKD--LEVMRVQTEMELRMFQQNE 451


>gi|432880358|ref|XP_004073658.1| PREDICTED: SET and MYND domain-containing protein 1-like isoform 3
           [Oryzias latipes]
          Length = 476

 Score =  130 bits (327), Expect = 1e-27,   Method: Compositional matrix adjust.
 Identities = 111/441 (25%), Positives = 194/441 (43%), Gaps = 49/441 (11%)

Query: 7   SGEVIISQEPYVCVPNNSSSISRCDGCFA-SSNLKKCSACQVVWYCGSNCQKLDWKLHRL 65
           +GEV+ ++  Y  V  +S +   C  CF   + L +C  C+   YC   CQ   W+ H+ 
Sbjct: 29  TGEVVFAEPSYSAVVFDSLAFQVCHCCFRRQAQLHRCGQCRFAHYCNRTCQTACWEEHKQ 88

Query: 66  ECQVLSRLDKEKRKSVTPSIRLMLKLYLRRKLQNDNVIPSTTTDNYSLVEALVAHMSDID 125
           EC  + +  K   +SV  + RL+ +L+    L +D+ + S        V+ L  H   +D
Sbjct: 89  ECAAIRKSGKAPSESVRLAARLLWRLHKDNGLASDSQLVS--------VDQLQEH---VD 137

Query: 126 EKQLLLYAQIANLVNLILQW-----PEISINEIAENFSKLACNAHTICNSE-LRPLGTGL 179
           +       Q+   V ++  +      + S+  I+  F  + CN  T+ +   L+ +G GL
Sbjct: 138 QLSQEQLQQLQTDVRVLQDYWSYGSKQHSVEYISHIFGIIKCNGFTLTDQRGLQAVGVGL 197

Query: 180 YPVISIINHSCLPNAVLVFEGRLAVVRAVQHVPKGAEVLISYIETAGSTMTRQKALKEQY 239
           +P ++++NH C PN   +       VR +  + +G E+ ISY++    +  RQK LKEQ+
Sbjct: 198 FPNLALVNHDCWPNCTAILNHGKIEVRTLGKISEGEELTISYVDFLQLSADRQKQLKEQF 257

Query: 240 LFTCTCPRCIKLGQFDDIQESAILEGYRCKDDGCSGFLLRDSDDKGFTCQQCGLVR-SKE 298
            F C+C  C +    DD+  +A         DG         D   F      L R S E
Sbjct: 258 HFECSCKHCSQHIS-DDLMMAAA--------DG--------QDSPWFI-----LCRPSAE 295

Query: 299 EIKKIASEVNILSKKTLALTS----CGNHQEVVSTYKMIEKLQKKLYHPFSVNLMQTREK 354
           +I+    EV+  SK +L         G++ EV+   +   + Q+ +     ++ ++    
Sbjct: 296 QIQ----EVSAFSKDSLEKIEKSRLDGDYSEVLRLCRECLQKQRGVLADTHLHRLRVLSI 351

Query: 355 LIKILMELEDWKEALAYCQLTIPVYQRVYPQFHPLLGLQYYTCGKLEWFLGDTENAIKSM 414
             ++L  L  + EA    +  +  Y ++Y   + LLG+     G   W  G  E     +
Sbjct: 352 AAEVLSSLRQFSEAADCARKMVEGYMKLYHPNNALLGMAIMRAGVTLWHAGQIEEGHSLI 411

Query: 415 TEAVEILRITHGTNSPFMKEL 435
            +A  IL +THG N    ++L
Sbjct: 412 CKAYSILMVTHGPNHSVTRDL 432


>gi|301777093|ref|XP_002923966.1| PREDICTED: SET and MYND domain-containing protein 3-like
           [Ailuropoda melanoleuca]
          Length = 477

 Score =  130 bits (327), Expect = 1e-27,   Method: Compositional matrix adjust.
 Identities = 82/254 (32%), Positives = 131/254 (51%), Gaps = 27/254 (10%)

Query: 8   GEVIISQEPYVCVPNNSSSISRCDGCF-ASSNLKKCSACQVVWYCGSNCQKLDWKLHRLE 66
           GE++   +P        S    CD C      L +CS C+V  YC + CQK  W  H+ E
Sbjct: 27  GELLFRSDPLAYTVCKGSRGVVCDRCLLGKEKLMRCSQCRVAKYCSAKCQKKAWPDHKRE 86

Query: 67  CQVLSRLDKEKRKSVTPSIRLMLKLYLRRKLQNDNVIPSTTTDNYSLVEALVAHMSDIDE 126
           C+ L      K +    S+RL+ ++    KL  +   PS +   YS  + L ++++ + E
Sbjct: 87  CKCLKSC---KPRYPPDSVRLLGRVVF--KLMEET--PSESEKLYSFYD-LESNINKLTE 138

Query: 127 K------------QLLLYAQIANLVNLILQWPEISINEIAENFSKLACNAHTICNSELRP 174
           +            Q  +  +I +   L L +      +I E F+K+ CNA TICN+E++ 
Sbjct: 139 EKKEGLRQLVMTFQHFMREEIQDASQLPLPF------DIFEAFAKVICNAFTICNAEMQE 192

Query: 175 LGTGLYPVISIINHSCLPNAVLVFEGRLAVVRAVQHVPKGAEVLISYIETAGSTMTRQKA 234
           +G GLYP +S++NHSC PN  +VF G   ++RAV+ +  G E+ I Y++   ++  R+K 
Sbjct: 193 VGVGLYPSMSLLNHSCDPNCSIVFNGPHLLLRAVRDIEAGEELTICYLDMLMTSEERRKQ 252

Query: 235 LKEQYLFTCTCPRC 248
           L++QY F C C RC
Sbjct: 253 LRDQYCFECDCFRC 266


>gi|197102816|ref|NP_001125377.1| SET and MYND domain-containing protein 1 [Pongo abelii]
 gi|55727869|emb|CAH90687.1| hypothetical protein [Pongo abelii]
          Length = 477

 Score =  130 bits (327), Expect = 1e-27,   Method: Compositional matrix adjust.
 Identities = 102/435 (23%), Positives = 191/435 (43%), Gaps = 45/435 (10%)

Query: 7   SGEVIISQEPYVCVPNNSSSISRCDGCFA-SSNLKKCSACQVVWYCGSNCQKLDWKLHRL 65
           + ++I ++  Y  V  +S     C  CF     L +C  C+   YC   CQK  W  H+ 
Sbjct: 29  AADIIFAERAYSAVVFDSLVNFVCHTCFKRQEKLHRCGQCKFAHYCDRTCQKDAWLNHKN 88

Query: 66  ECQVLSRLDKEKRKSVTPSIRLMLKLYLRRKLQNDNVIPSTTTDNYSLVEALVAHMSDID 125
           EC  + R  K   +++  + R+M ++        +  + S        ++  V H  + +
Sbjct: 89  ECLAIKRYGKVPNENIRLAARIMWRVEREGTGLTEGCLVSVDD-----LQNHVEHFGEEE 143

Query: 126 EKQLLLYAQIANLVNLILQWPEISINEIAENFSKLACNAHTICNSE-LRPLGTGLYPVIS 184
           +K+L +      L   + Q  + S+  I+  F  + CN  T+ +   L+ +G G++P + 
Sbjct: 144 QKELRVDVDTF-LQYWLPQSQQFSMQYISHIFGVINCNGFTLSDQRGLQAVGVGIFPNLG 202

Query: 185 IINHSCLPNAVLVFEGRLAVVRAVQHVPKGAEVLISYIETAGSTMTRQKALKEQYLFTCT 244
           ++NH C PN  ++F      +RA+  + +G E+ +SYI+    +  R++ LK+QY F C 
Sbjct: 203 LVNHDCWPNCTVIFNNGKIELRALGKISEGEELTVSYIDFLNVSEERKRQLKKQYYFDCA 262

Query: 245 CPRCIKLGQFDDIQESAILEGYRCKDDGCSGFLLRDSDDKGFTCQQCGLVRSKEEIKKIA 304
           C  C K                + KDD    FL              G+  + +  +++ 
Sbjct: 263 CEHCQK----------------KLKDD---LFL--------------GVKDNPKPSQEVV 289

Query: 305 SEVNILSKKTL----ALTSCGNHQEVVSTYKMIEKLQKKLYHPFSVNLMQTREKLIKILM 360
            E+   SK TL       S G + EVV   +   + Q+ ++   ++  ++    + ++L 
Sbjct: 290 KEMIRFSKDTLEKIDKARSEGLYHEVVKLCRECLQKQEPVFADTNIYTLRMLSIVSEVLS 349

Query: 361 ELEDWKEALAYCQLTIPVYQRVYPQFHPLLGLQYYTCGKLEWFLGDTENAIKSMTEAVEI 420
            L+ ++EA  Y +  +  Y ++Y   +  LG+     G   W  G+ E     + +A  I
Sbjct: 350 YLQAFEEASFYARRMVDGYMKLYHPNNAQLGMAVMRAGLTNWHAGNIEVGHGMICKAYAI 409

Query: 421 LRITHGTNSPFMKEL 435
           L +THG + P  K+L
Sbjct: 410 LLVTHGPSHPITKDL 424


>gi|327286307|ref|XP_003227872.1| PREDICTED: SET and MYND domain-containing protein 3-like [Anolis
           carolinensis]
          Length = 334

 Score =  130 bits (327), Expect = 1e-27,   Method: Compositional matrix adjust.
 Identities = 82/262 (31%), Positives = 136/262 (51%), Gaps = 23/262 (8%)

Query: 7   SGEVIISQEPYVCVPNNSSSISRCDGCFA-SSNLKKCSACQVVWYCGSNCQKLDWKLHRL 65
           +G ++  +EP+          + C+ C      L +CS C+V  YC + CQK  W+ H+ 
Sbjct: 56  AGALLYREEPFAYAVTKKRLGAACERCLRRKERLLRCSQCKVARYCDARCQKEAWEDHKR 115

Query: 66  ECQVLSRLDKEKRKSVTP-----SIRLMLKLYLRRKLQNDNVIPSTTTDNYSLVEALVAH 120
           EC+ +        KSV P     S+RL+ ++    KL   +  PS    ++S +++ V  
Sbjct: 116 ECKCI--------KSVEPNFPPDSVRLVGRIIF--KLLRQSTCPSEELYSFSDLQSNVEK 165

Query: 121 MSDIDEKQLLLYAQIANL-----VNLILQWPEISINEIAENFSKLACNAHTICNSELRPL 175
           +S+  ++ L   A+   L     +  + Q P     ++ E F+K+ACN  +I N E++ +
Sbjct: 166 LSEEMKEGLGHLAKTLQLYLKVEIQDVSQLP--PALDVFETFAKVACNCFSISNGEMQDV 223

Query: 176 GTGLYPVISIINHSCLPNAVLVFEGRLAVVRAVQHVPKGAEVLISYIETAGSTMTRQKAL 235
           G GLYP +S++N+SC PN  +VFEG    +R+++ + +G E+ I Y+ET   T  RQ+ L
Sbjct: 224 GVGLYPSMSLLNNSCDPNCAIVFEGPELHLRSIREIQEGEELTICYVETMMPTPERQERL 283

Query: 236 KEQYLFTCTCPRCIKLGQFDDI 257
           K QY F C C  C    +  DI
Sbjct: 284 KRQYCFECDCLMCCTKSKAYDI 305


>gi|73980306|ref|XP_852104.1| PREDICTED: SET and MYND domain-containing protein 1 isoform 2
           [Canis lupus familiaris]
          Length = 490

 Score =  130 bits (326), Expect = 2e-27,   Method: Compositional matrix adjust.
 Identities = 106/448 (23%), Positives = 194/448 (43%), Gaps = 58/448 (12%)

Query: 7   SGEVIISQEPYVCVPNNSSSISRCDGCFA-SSNLKKCSACQVVWYCGSNCQKLDWKLHRL 65
           + +VI ++  Y  V  +S     C  CF     L  C  C+   YC   CQK  W  H+ 
Sbjct: 29  AADVIFAERAYSAVVFDSLVNFVCHTCFKRQEKLHHCGQCKFAHYCDRTCQKDAWLNHKN 88

Query: 66  ECQVLSRLDKEKRKSVTPSIRLMLKLYLRRKLQNDNVIPSTTTDNYSLVEALVAHMSDID 125
           EC  + R  K   +++  + R+M ++        +  + +        ++  V H  + +
Sbjct: 89  ECSAIKRYGKVPNENIRLAARIMWRVEREGTGLTEGCLVAVDD-----LQNHVEHFGEEE 143

Query: 126 EKQLLLYAQIANLVNLILQWP----EISINEIAENFSKLACNAHTICNSE-LRPLGTGLY 180
           +K+L L     ++   +  WP    + S+  I+  F  ++CN  T+ +   L+ +G G++
Sbjct: 144 QKELRL-----DVDTFLQYWPPQGQQFSMQYISHIFGVISCNGFTLSDQRGLQAVGVGIF 198

Query: 181 PVISIINHSCLPNAVLVFE-GRLAVVRAVQH------------VPKGAEVLISYIETAGS 227
           P + ++NH C PN  ++F  G    V+++ H            + +G E+ +SYI+    
Sbjct: 199 PNLGLVNHDCWPNCTVIFNNGNHEAVKSMFHTQMRIELRALGKISEGEELTVSYIDFLNL 258

Query: 228 TMTRQKALKEQYLFTCTCPRCIKLGQFDDIQESAILEGYRCKDDGCSGFLLRDSDDKGFT 287
           +  R+K LK+QY F CTC  C K                + KDD    FL    D K   
Sbjct: 259 SEDRKKQLKKQYYFDCTCEHCQK----------------KLKDD---LFLGVKDDPKP-- 297

Query: 288 CQQCGLVRSKEEIKKIASEVNILSKKTLALTSCGNHQEVVSTYKMIEKLQKKLYHPFSVN 347
                   S+E +K++        +K     S G + EVV   +   + Q+ ++   ++ 
Sbjct: 298 --------SQEVVKEMIQFSKDTLEKIDKARSEGLYHEVVKLCRECLQKQEPVFADTNLY 349

Query: 348 LMQTREKLIKILMELEDWKEALAYCQLTIPVYQRVYPQFHPLLGLQYYTCGKLEWFLGDT 407
            ++    + ++L  L+ ++EA  Y +  +  Y ++Y   +  LG+     G   W  G+ 
Sbjct: 350 TLRMLSTVSEVLSYLQAFEEASYYARRMVDGYMKLYHPNNAQLGMAVMRAGLTNWHAGNI 409

Query: 408 ENAIKSMTEAVEILRITHGTNSPFMKEL 435
           E     + +A  IL +THG + P  K+L
Sbjct: 410 EVGHGMICKAYAILLVTHGPSHPITKDL 437


>gi|402891521|ref|XP_003908994.1| PREDICTED: SET and MYND domain-containing protein 1 [Papio anubis]
 gi|355565873|gb|EHH22302.1| hypothetical protein EGK_05542 [Macaca mulatta]
 gi|355751476|gb|EHH55731.1| hypothetical protein EGM_04994 [Macaca fascicularis]
          Length = 490

 Score =  130 bits (326), Expect = 2e-27,   Method: Compositional matrix adjust.
 Identities = 108/453 (23%), Positives = 198/453 (43%), Gaps = 68/453 (15%)

Query: 7   SGEVIISQEPYVCVPNNSSSISRCDGCFA-SSNLKKCSACQVVWYCGSNCQKLDWKLHRL 65
           + +VI ++  Y  V  +S     C  CF     L +C  C+   YC   CQK  W  H+ 
Sbjct: 29  AADVIFAERAYSAVVFDSLVNFVCHTCFKRQEKLHRCGQCKFAHYCDRTCQKDAWLNHKN 88

Query: 66  ECQVLSRLDKEKRKSVTPSIRLMLKLYLRRKLQNDNVIPSTTTDNYSLVEALVAHMSDID 125
           EC  + R  K   +++  + R+M ++        +  + S        ++  V H  + +
Sbjct: 89  ECSAIKRYGKVPNENIRLAARIMWRVEREGTGLTEGCLVSVDD-----LQNHVEHFGEEE 143

Query: 126 EKQLLLYAQIANLVNLILQ-WP----EISINEIAENFSKLACNAHTICNSE-LRPLGTGL 179
           +K+L +       V+  LQ WP    + S+  I+  F  + CN  T+ +   L+ +G G+
Sbjct: 144 QKELRVD------VDTFLQYWPPQSQQFSMQYISHIFGVINCNGFTLSDQRGLQAVGVGI 197

Query: 180 YPVISIINHSCLPNAVLVFE-GRLAVVRAVQH------------VPKGAEVLISYIETAG 226
           +P + ++NH C PN  ++F  G    V+++ H            + +G E+ +SYI+   
Sbjct: 198 FPNLGLVNHDCWPNCTVIFNNGNHEAVKSMFHTQMRIELRALGKISEGEELTVSYIDFLN 257

Query: 227 STMTRQKALKEQYLFTCTCPRCIKLGQFDDIQESAILEGYRCKDDGCSGFLLRDSDDKGF 286
            +  R++ LK+QY F CTC  C K                + KDD    FL         
Sbjct: 258 VSEERKRQLKKQYYFDCTCEHCQK----------------KLKDD---LFL--------- 289

Query: 287 TCQQCGLVRSKEEIKKIASEVNILSKKTL----ALTSCGNHQEVVSTYKMIEKLQKKLYH 342
                G+  + +  +++  E+   SK TL       S G + EVV   +   + Q+ ++ 
Sbjct: 290 -----GVKDNPKPSQEVVKEMIQFSKDTLEKIDKARSEGLYHEVVKLCRECLQKQEPVFA 344

Query: 343 PFSVNLMQTREKLIKILMELEDWKEALAYCQLTIPVYQRVYPQFHPLLGLQYYTCGKLEW 402
             ++ +++    + ++L  L+ ++EA  Y +  +  Y ++Y   +  LG+     G   W
Sbjct: 345 DTNIYMLRILSIVSEVLSYLQAFEEASDYARRMVDGYMKLYHPNNAQLGMAVMRAGLTNW 404

Query: 403 FLGDTENAIKSMTEAVEILRITHGTNSPFMKEL 435
             G+ E     + +A  IL +THG + P  K+L
Sbjct: 405 HAGNIEVGHGMICKAYAILLVTHGPSHPITKDL 437


>gi|403303925|ref|XP_003942566.1| PREDICTED: SET and MYND domain-containing protein 1 [Saimiri
           boliviensis boliviensis]
          Length = 490

 Score =  129 bits (325), Expect = 2e-27,   Method: Compositional matrix adjust.
 Identities = 106/452 (23%), Positives = 196/452 (43%), Gaps = 66/452 (14%)

Query: 7   SGEVIISQEPYVCVPNNSSSISRCDGCFA-SSNLKKCSACQVVWYCGSNCQKLDWKLHRL 65
           + +VI ++  Y  V  +S     C  CF     L +C  C+   YC   CQK  W  H+ 
Sbjct: 29  AADVIFAERAYSAVVFDSLVNFVCHTCFKRQEKLHRCGQCKFAHYCDRTCQKDAWLNHKN 88

Query: 66  ECQVLSRLDKEKRKSVTPSIRLMLKLYLRRKLQNDNVIPSTTTDNYSLVEALVAHMSDID 125
           EC  + R  K   +++  + R+M ++        +  + S        ++  V H  + +
Sbjct: 89  ECSAIKRYGKVPNENIRLAARIMWRVEREGTGLTEGCLVSVDD-----LQNHVEHFGEEE 143

Query: 126 EKQLLLYAQIANLVNLILQWP----EISINEIAENFSKLACNAHTICNSE-LRPLGTGLY 180
           +K+L +     ++   +  WP    + S+  I+  F  + CN  T+ +   L+ +G G++
Sbjct: 144 QKELRV-----DVDTFLQYWPPQSQQFSMQYISHIFGVINCNGFTLSDQRGLQAVGVGIF 198

Query: 181 PVISIINHSCLPNAVLVFE-GRLAVVRAVQH------------VPKGAEVLISYIETAGS 227
           P + ++NH C PN  ++F  G    V+++ H            + +G E+ +SYI+    
Sbjct: 199 PNLGLVNHDCWPNCTVIFNNGNHEAVKSMFHTQMRIELRALGKISEGEELTVSYIDFLNV 258

Query: 228 TMTRQKALKEQYLFTCTCPRCIKLGQFDDIQESAILEGYRCKDDGCSGFLLRDSDDKGFT 287
           +  R+K LK+QY F CTC  C K                + KDD    FL          
Sbjct: 259 SEERKKQLKKQYYFDCTCEHCQK----------------KLKDD---LFL---------- 289

Query: 288 CQQCGLVRSKEEIKKIASEVNILSKKTL----ALTSCGNHQEVVSTYKMIEKLQKKLYHP 343
               G+  + +  +++  E+   SK TL       S G + EVV   +   + Q+ ++  
Sbjct: 290 ----GVKDNPKPSQEVVKEMIQFSKDTLEKIDKARSEGLYHEVVKLCRECLQKQEPVFAD 345

Query: 344 FSVNLMQTREKLIKILMELEDWKEALAYCQLTIPVYQRVYPQFHPLLGLQYYTCGKLEWF 403
            ++  ++    + ++L  L+ ++EA  Y +  +  Y ++Y   +  LG+     G   W 
Sbjct: 346 TNIYTLRMLSIVSEVLSYLQAFEEASDYARRMVDGYMKLYHPNNAQLGMAVMRAGLTNWH 405

Query: 404 LGDTENAIKSMTEAVEILRITHGTNSPFMKEL 435
            G+ E     + +A  IL +THG + P  K+L
Sbjct: 406 AGNIEVGHGMICKAYAILLVTHGPSHPITKDL 437


>gi|348566407|ref|XP_003468993.1| PREDICTED: SET and MYND domain-containing protein 1-like isoform 1
           [Cavia porcellus]
          Length = 490

 Score =  129 bits (325), Expect = 3e-27,   Method: Compositional matrix adjust.
 Identities = 106/452 (23%), Positives = 193/452 (42%), Gaps = 66/452 (14%)

Query: 7   SGEVIISQEPYVCVPNNSSSISRCDGCFA-SSNLKKCSACQVVWYCGSNCQKLDWKLHRL 65
           + +VI ++  Y  V  +S     C  CF     L +C  C+   YC   CQK  W  H+ 
Sbjct: 29  AADVIFAERAYSAVVFDSLVNFVCHTCFKRQEKLHRCGQCKFAHYCDRTCQKDAWLNHKH 88

Query: 66  ECQVLSRLDKEKRKSVTPSIRLMLKLYLRRKLQNDNVIPSTTTDNYSLVEALVAHMSDID 125
           EC  + R  K   ++V  + R+M ++        +  + S        ++  V H  + +
Sbjct: 89  ECSAIKRYGKVPNENVRLAARIMWRVEREGTGLTEGCLVSVDD-----LQNHVEHFGEEE 143

Query: 126 EKQLLLYAQIANLVNLILQWP----EISINEIAENFSKLACNAHTICNSE-LRPLGTGLY 180
           +K+L +     ++   +  WP    + S+  I+  F  + CN  T+ +   L+ +G G++
Sbjct: 144 QKELRV-----DVDTFLQYWPPQSQQFSMQYISHIFGVINCNGFTLSDQRGLQAVGVGIF 198

Query: 181 PVISIINHSCLPNAVLVFE-GRLAVVRAVQH------------VPKGAEVLISYIETAGS 227
           P + ++NH C PN  ++F  G    V+++ H            +  G E+ +SYI+    
Sbjct: 199 PNLGLVNHDCWPNCTVIFNNGNHEAVKSMFHTQMRIELRALGKISVGEELTVSYIDFLNV 258

Query: 228 TMTRQKALKEQYLFTCTCPRCIKLGQFDDIQESAILEGYRCKDDGCSGFLLRDSDDKGFT 287
           +  R++ LK+QY F CTC  C K G  DD+                  FL          
Sbjct: 259 SEERKRQLKKQYYFDCTCEHCQK-GLKDDL------------------FL---------- 289

Query: 288 CQQCGLVRSKEEIKKIASEVNILSKKTL----ALTSCGNHQEVVSTYKMIEKLQKKLYHP 343
               G+    +  +++  E+   SK TL       S G + EVV   +   + Q+ ++  
Sbjct: 290 ----GVKEDPKPSQEVVKEMTQFSKDTLEKIDKARSEGLYHEVVKLCRECLEKQEPVFAD 345

Query: 344 FSVNLMQTREKLIKILMELEDWKEALAYCQLTIPVYQRVYPQFHPLLGLQYYTCGKLEWF 403
            ++  ++    + ++L  L+ + EA  Y +  +  Y ++Y   +  LG+     G   W 
Sbjct: 346 TNLYTLRMLSVVSEVLSYLQAFGEAAHYARRMVDGYMKLYHPNNAQLGMAVMRAGLTNWH 405

Query: 404 LGDTENAIKSMTEAVEILRITHGTNSPFMKEL 435
            G+ E     + +A  IL +THG + P  K+L
Sbjct: 406 AGNIEVGHGMICKAYAILLVTHGPSHPITKDL 437


>gi|229892820|ref|NP_001153561.1| SET and MYND domain-containing protein 1 [Sus scrofa]
 gi|197692962|gb|ACH71269.1| SET and MYND domain-containing 1 [Sus scrofa]
          Length = 490

 Score =  129 bits (324), Expect = 3e-27,   Method: Compositional matrix adjust.
 Identities = 109/453 (24%), Positives = 197/453 (43%), Gaps = 68/453 (15%)

Query: 7   SGEVIISQEPYVCVPNNSSSISRCDGCFA-SSNLKKCSACQVVWYCGSNCQKLDWKLHRL 65
           + +VI ++  Y  V  +S     C  CF     L +C  C+   YC   CQK  W  H+ 
Sbjct: 29  AADVIFAERAYSAVVFDSLVNFVCHTCFKRQEKLHRCGQCKFAHYCDRTCQKDAWLNHKN 88

Query: 66  ECQVLSRLDKEKRKSVTPSIRLMLKLYLRRKLQNDNVIPSTTTDNYSLVEALVAHMSDID 125
           EC  + R  K   +++  + R+M ++        +  + S        ++  V H  + +
Sbjct: 89  ECSAIKRYGKVPNENIRLAARIMWRVEREGTGLTEGCLVSVDD-----LQNHVEHFGEEE 143

Query: 126 EKQLLLYAQIANLVNLILQ-WP----EISINEIAENFSKLACNAHTICNSE-LRPLGTGL 179
           +K+L +       V+  LQ WP    + S+  I+  F  + CN  T+     L+ +G G+
Sbjct: 144 QKELRMD------VDTFLQYWPPQSQQFSMQYISHIFGVINCNGFTLSEQRGLQAVGVGI 197

Query: 180 YPVISIINHSCLPNAVLVFE-GRLAVVRAVQH------------VPKGAEVLISYIETAG 226
           +P + ++NH+C PN  ++F  G    V+++ H            + +G E+ +SYI+   
Sbjct: 198 FPNLGLVNHNCWPNCTVIFNNGNHEAVKSMFHTQTRIELRALGKISEGEELTVSYIDFLN 257

Query: 227 STMTRQKALKEQYLFTCTCPRCIKLGQFDDIQESAILEGYRCKDDGCSGFLLRDSDDKGF 286
            +  R+K LK+QY F CTC  C K G  DD+                  FL         
Sbjct: 258 FSEERKKQLKKQYYFDCTCEHCQK-GLKDDL------------------FL--------- 289

Query: 287 TCQQCGLVRSKEEIKKIASEVNILSKKTL----ALTSCGNHQEVVSTYKMIEKLQKKLYH 342
                G+  + +  +++  E+   SK TL       S G + EVV   +   + Q+ ++ 
Sbjct: 290 -----GVKDNPKPSQEVVKEMIQFSKDTLEKIDKARSKGLYHEVVKLCRECLEKQEPVFA 344

Query: 343 PFSVNLMQTREKLIKILMELEDWKEALAYCQLTIPVYQRVYPQFHPLLGLQYYTCGKLEW 402
             ++  ++    + ++L  L+ ++EA  Y +  +  Y ++Y   +  LG+     G   W
Sbjct: 345 DTNIYTLRLLSIVSEVLSYLQAFEEASHYARRMVDGYMKLYHPNNAQLGMAIMRAGLTNW 404

Query: 403 FLGDTENAIKSMTEAVEILRITHGTNSPFMKEL 435
             G+ E     + +A  IL +THG + P  K+L
Sbjct: 405 HAGNIEVGHSMICKAYAILLVTHGPSHPITKDL 437


>gi|301102901|ref|XP_002900537.1| conserved hypothetical protein [Phytophthora infestans T30-4]
 gi|262101800|gb|EEY59852.1| conserved hypothetical protein [Phytophthora infestans T30-4]
          Length = 425

 Score =  129 bits (324), Expect = 3e-27,   Method: Compositional matrix adjust.
 Identities = 87/279 (31%), Positives = 134/279 (48%), Gaps = 42/279 (15%)

Query: 16  PYVCVPNNSSSISRCDGCFASS-NLKKCSACQVVWYCGSNCQKLDWKLHRLECQVLSRLD 74
           P  C  ++SS    C  CFA   +L++C+ C+ V YC   CQKLDW  HR ECQ    + 
Sbjct: 32  PAFCAVSSSS----CGWCFAPQLSLQRCTGCRQVQYCSRRCQKLDWSQHRRECQAWRSIP 87

Query: 75  KEKRKSVTPSI----RLMLKLYLRRKLQNDNVIPSTTTDNYSLVEALVAHMSDIDEKQLL 130
            +   +  P++    RL  KL+L  K+  ++          + +  L  H+ D    +  
Sbjct: 88  VD---ATLPTVLLVCRLAAKLFLSSKVDQED---------KNRILNLRHHLDDHTAPKQQ 135

Query: 131 LYAQIANLVNLIL---------QWPEIS------INEIAENFSKLACNAHTICNSEL-RP 174
            + +   LV+L+L         Q P  +        EI + F +++CN  +I N     P
Sbjct: 136 QFCETTPLVHLLLSEYKVDKQEQTPSFAELQESLKPEILKLFGQVSCNGFSIMNGVTNEP 195

Query: 175 LGTGLYPVISIINHSCLPNAVLVFEGRLAVVRAVQHVPKGAEVLISYIETAGSTMTRQKA 234
           +G GL+   S+ NH C PN V+ F+G+   V  ++ V +G E+ ISY+E   ST  RQK 
Sbjct: 196 VGIGLFLQGSMFNHDCDPNCVVSFQGQEMNVHVIKDVKEGQELTISYVEVLQSTKKRQKI 255

Query: 235 LKEQYLFTCTCPRCIKLGQFDDIQESAILEGYRCKDDGC 273
           LK+ Y F C C RC       +  +   L+G +C + GC
Sbjct: 256 LKDSYFFECQCSRCTT-----ETTDDWYLDGLQCGNKGC 289


>gi|349604328|gb|AEP99912.1| SET and MYND domain-containing protein 3-like protein, partial
           [Equus caballus]
          Length = 358

 Score =  129 bits (323), Expect = 4e-27,   Method: Compositional matrix adjust.
 Identities = 108/383 (28%), Positives = 181/383 (47%), Gaps = 59/383 (15%)

Query: 53  SNCQKLDWKLHRLECQVLSRLDKEKRKSVTPSIRLMLKLYLRRKLQNDNVIPSTTTDNYS 112
           + CQK  W+ H+ EC+ L      K +    S+RL+ ++    KL  +   PS +   YS
Sbjct: 3   AKCQKKAWQDHKRECKCLKSC---KPRYPPDSVRLLGRVVF--KLMEET--PSESEKLYS 55

Query: 113 LVEALVAHMSDIDEKQLLLYAQIANLVNLILQWPEISIN-------EIAENFSKLACNAH 165
             + L ++++ + E +     Q+A      ++  EI          ++ E F+K+ CN+ 
Sbjct: 56  FYD-LESNINKLTEDKKEGLRQLAMTFQHFMR-EEIQDASQLPPSFDVFEAFAKVICNSF 113

Query: 166 TICNSELRPLGTGLYPVISIINHSCLPNAVLVFEGRLAVVRAVQHVPKGAEVLISYIETA 225
           TICN+E++ +G GLYP +S++NHSC PN  +VF G   ++RAV+ +  G E+ I Y++  
Sbjct: 114 TICNAEMQEVGVGLYPSMSLLNHSCEPNCSIVFNGPHLLLRAVRDIEAGEELTICYLDML 173

Query: 226 GSTMTRQKALKEQYLFTCTCPRCIKLGQFDDIQESAILEGYRCKDDGCSGFLLRDSDDKG 285
            ++  R+K L++QY F C C RC                              +D D   
Sbjct: 174 MTSEERRKQLRDQYCFECDCFRC----------------------------QTQDKDADM 205

Query: 286 FTCQQCGLVRSKEEIKKIASEVNILSKKTLALTSCGNHQEVVSTYKMIEKLQKKLYHPFS 345
            T  +      +E +KKI  E+    K    L  C   Q ++S+    E+L         
Sbjct: 206 LTGDEHVWKEVQESLKKI-EELKAHWKWEQVLAMC---QTIISSNS--ERLP-------D 252

Query: 346 VNLMQTR--EKLIKILMELEDWKEALAYCQLTIPVYQRVYPQFHPLLGLQYYTCGKLEWF 403
           +N+ Q +  +  +   + L   +EAL Y   T+  Y+  YP  HP+ G+Q    GKL+  
Sbjct: 253 INIYQLKVLDCAMDACINLGLLEEALFYGIRTMEPYRIFYPGSHPVRGVQVMKVGKLQLH 312

Query: 404 LGDTENAIKSMTEAVEILRITHG 426
            G    A+K++  A +I+R+THG
Sbjct: 313 QGMFPQAMKNLRLAFDIMRVTHG 335


>gi|303286946|ref|XP_003062762.1| set domain protein [Micromonas pusilla CCMP1545]
 gi|226455398|gb|EEH52701.1| set domain protein [Micromonas pusilla CCMP1545]
          Length = 732

 Score =  129 bits (323), Expect = 4e-27,   Method: Compositional matrix adjust.
 Identities = 106/414 (25%), Positives = 172/414 (41%), Gaps = 51/414 (12%)

Query: 8   GEVIISQEPYVCVPNNSSSISRCDGCF--ASSNLKKCSACQVVWYCGSNCQKLDWKLHRL 65
           G+V+++ +P VC  ++  +   C  C+        +C AC    YCG  C+  D   H  
Sbjct: 234 GDVVMTCDPLVCAVHDRHAERACAFCYRDVDGGGVRCDACDAALYCGRRCRAADTS-HVG 292

Query: 66  ECQVLSRLDKEKR--KSVTPSIRLMLKLYLRRKLQNDNVIPSTTTDNYSLVEALVAHMSD 123
           EC +  R   + R   +       +  LY+R          +T    +  V+AL +   D
Sbjct: 293 ECALARRAKTDPRLSSATRGLRLFLRLLYVR----------ATAPGLFDAVKALDSPFRD 342

Query: 124 IDEKQLLLYAQIANLVNLILQW--PEISINEIAENFSKLACNAHTICNSELRPLGTGLYP 181
           +  ++   Y  +AN VN +L      + ++ +AE  SK+  N+H + ++  R LGTG+YP
Sbjct: 343 VSAEKRATYVGMANAVNSMLPPGPARMDVDALAEVMSKVHVNSHGVVDAAGRALGTGVYP 402

Query: 182 VISIINHSCLPNAVLVF------------------------EGRLAVVRAVQHVPKGAEV 217
             ++ NHSC PNAV+ F                         G    VR V  V +G E+
Sbjct: 403 PAALFNHSCAPNAVVSFGGADDDGDGSHSDRDGSCSNGSTNNGARLTVRCVSPVEEGEEI 462

Query: 218 LISYIETAGSTMTRQKALKEQYLFTCTCPRCIKLGQFDDIQESAILEGYRC-KDDGCSGF 276
            I+Y E   +  TR++AL E+  F C C RC         +  A L G+RC +   C G 
Sbjct: 463 CIAYAEVYATRETRREALWEKKAFACECRRCADAAS---ARRDAPLGGWRCPRLSACDGV 519

Query: 277 LLRDSDDKGFTCQQCGLVRSKEEIKKIASEVNILSKKTLALTSC--GNHQEVVSTYKMIE 334
           +     D G +C  CG   ++   K+ A+E   + +   ALT+    +H   VS    + 
Sbjct: 520 V----PDAGDSCLTCGAKCARARSKRDAAERRWIGRHAAALTALRENDHATAVSIACEVV 575

Query: 335 KLQKKLYHPFSVNLMQTREKLIKILMELEDWKEALAYCQLTIPVYQRVYPQFHP 388
           +          V   + R  L+     L+ W+ A    +      +R     HP
Sbjct: 576 RESDDALCDAHVIRHECRLVLMDAHAALKQWRHAADVSRDVADAMRRFVNVNHP 629


>gi|67539656|ref|XP_663602.1| hypothetical protein AN5998.2 [Aspergillus nidulans FGSC A4]
 gi|40738557|gb|EAA57747.1| hypothetical protein AN5998.2 [Aspergillus nidulans FGSC A4]
 gi|259479819|tpe|CBF70391.1| TPA: SET and MYND domain protein, putative (AFU_orthologue;
           AFUA_2G10080) [Aspergillus nidulans FGSC A4]
          Length = 497

 Score =  129 bits (323), Expect = 4e-27,   Method: Compositional matrix adjust.
 Identities = 78/254 (30%), Positives = 128/254 (50%), Gaps = 22/254 (8%)

Query: 9   EVIISQEPYVCVPNNSSSISRCDGCFASSN----------LKKCSACQVVWYCGSNCQKL 58
           +++  Q+P+V V         C GCF   +          LK C+ C VV YC  +CQ  
Sbjct: 38  DILHIQDPFVAVLKTERLQDTCSGCFGKRHFDSYSGQEVSLKACTGCHVVKYCDKSCQSK 97

Query: 59  DWKL-HRLECQVLSRLDKEKRKSVTPSIRLMLKLYLRRKLQNDNVIPSTTTDNYSLVEAL 117
           DWKL H  EC +   L   K K +  + R +L++ LR + + +    + T +   L + L
Sbjct: 98  DWKLTHSRECVIFRNL---KPKVLPVNARALLRMVLRTEARKN----AYTEEELVLFQTL 150

Query: 118 VAHMSDIDEKQLLLYAQIANLVNLILQWPEISINE--IAENFSKLACNAHTICNSELRPL 175
             H+ DI  +      +IA     + ++ +  + E  I    ++L  N+  + N +   +
Sbjct: 151 ETHIDDILNRNAPQAERIALTSRAVKEYSKTDMEEEKIVAYHARLDLNSFNLTNDD--DI 208

Query: 176 GTGLYPVISIINHSCLPNAVLVFEGRLAVVRAVQHVPKGAEVLISYIETAGSTMTRQKAL 235
           G  L+P  ++INHSC  NAV+ F+G    V+A++ +  G ++ ISYI+T   T  RQK L
Sbjct: 209 GIYLHPYAALINHSCDYNAVVGFDGSEIFVKAIRPIATGEQIFISYIDTTYPTRIRQKEL 268

Query: 236 KEQYLFTCTCPRCI 249
           +E+Y FTC C +C+
Sbjct: 269 QERYFFTCNCAKCL 282


>gi|428171150|gb|EKX40069.1| hypothetical protein GUITHDRAFT_164767 [Guillardia theta CCMP2712]
          Length = 500

 Score =  129 bits (323), Expect = 4e-27,   Method: Compositional matrix adjust.
 Identities = 115/469 (24%), Positives = 199/469 (42%), Gaps = 72/469 (15%)

Query: 8   GEVIISQEPYVCVPNNSSSISRCDGCFASS----NLKKCSACQVVWYCGSNCQKLDWKL- 62
           G+ ++   PY           RC+ CF       +L +CS+C++  YCG  CQ   WK  
Sbjct: 67  GDTVLRSRPYAFEIFPELREERCNECFRRPAEGISLLRCSSCKITRYCGKECQARAWKRS 126

Query: 63  HRLECQVLSRLDKEKRKSVTPSIRLMLKLYLRRK--LQNDNVIPSTTTDNY----SLVEA 116
           H+ EC +   L+ ++  S+  S+ + + + +R    L +   + + ++D+Y      V+A
Sbjct: 127 HKYECSLQRELE-DRFGSLPSSVYIDVTIIIRIAILLMSGKAVNAMSSDDYVQDHDDVKA 185

Query: 117 LVAHMSDIDEKQLLLYAQIANLVNLI--------------LQWP-EISINEIAENFSKLA 161
           ++ HM+ + +     +A    +V +               L W  + +  E+ +   K A
Sbjct: 186 MIDHMAQMRKSNAQEFAGNQEIVRIAEHLLDMLQARSPKGLDWSLKPTEEELLKVLCKFA 245

Query: 162 CN-----AHTICNSELRPLGTGLYPVISIINHSCLPNAVLVF--EGRLAVVRAVQHVPKG 214
           CN     A  I +  +   G G+YP+ +I+NHSC PN V+ +  E      R ++ +  G
Sbjct: 246 CNNFSHAARQIWDDLIVSHGMGVYPLGAILNHSCKPNCVIYYHPETHEQEFRCIEDIQVG 305

Query: 215 AEVLISYIETAGSTMTRQKALKEQYLFTCTCPRCIKLGQFDDIQESAILEGYRCKDDGCS 274
            ++  SYI+ A  + TR++ L+  Y F C C  C    + D+            K     
Sbjct: 306 EDICHSYIDLAAVSKTRKEKLQSTYYFDCDCQCCKFPEELDN------------KLGARD 353

Query: 275 GFLLRDSDDKGFTCQQCGLVRSKEEIKKIASEVNILSKKTLALTSCGNHQEVVSTYKMIE 334
           G +    D         G   S+ EI+   S +  L +  LA    GN            
Sbjct: 354 GKVTEKCDRAAELLAAAG---SRTEIEHALSRLKDLDENVLA--DRGN------------ 396

Query: 335 KLQKKLYHPFSVNLMQTREKLIKILMELEDWKEALAYCQLTIPVYQRVYPQFHPLLGLQY 394
                      ++ +  + K+++  +EL     A+  C+  +  Y+ +YP  HPLLGLQ 
Sbjct: 397 ---------VDLDRLSVKSKMLQASIELGMMDSAIRACKQVVEGYRGIYPPLHPLLGLQL 447

Query: 395 YTCGKLEWFLGDTENAIKSMTEAVEILRITHGTNSPFMKELILKLEEAQ 443
           YT G L +  G  E A   + E  +IL  TH   S  ++ +   L EA+
Sbjct: 448 YTLGNLLFDDGRGEEAADVLQEGQKILLATHDRRSTMVQGITELLAEAR 496


>gi|410922900|ref|XP_003974920.1| PREDICTED: SET and MYND domain-containing protein 1-like [Takifugu
           rubripes]
          Length = 489

 Score =  128 bits (322), Expect = 5e-27,   Method: Compositional matrix adjust.
 Identities = 109/449 (24%), Positives = 195/449 (43%), Gaps = 58/449 (12%)

Query: 7   SGEVIISQEPYVCVPNNSSSISRCDGCFA-SSNLKKCSACQVVWYCGSNCQKLDWKLHRL 65
           +G+VI S+     V  +S +   C  CF     L+KCS C+   YC   CQ+  W  H+ 
Sbjct: 24  AGDVIFSEPSLAAVVFDSLAERICHSCFRRQEKLQKCSQCKFAHYCDRTCQRAGWAEHKQ 83

Query: 66  ECQVLSRLDKEKRKSVTPSIRLMLKLYLRRKLQNDNVIPSTTTDNYSLVEALVAHMSDID 125
           EC  +    K   +++    RLM +L     + +D  + +        VE L  H++D+ 
Sbjct: 84  ECGAIKAYGKAPNENIRLVARLMWRLDKEGSVVSDMQLIT--------VEELEDHVADMQ 135

Query: 126 EKQLL-LYAQIANLVNLILQWPEIS----INEIAENFSKLACNAHTICNSE-LRPLGTGL 179
           E ++  L   I N ++    WP  S    I++I+  F  + CN  T+ +   L+ +G GL
Sbjct: 136 EDEIKELKVDIHNFLDY---WPRNSKQHTIDDISHIFGVINCNGFTVSDQRGLQAVGVGL 192

Query: 180 YPVISIINHSCLPN-AVLVFEGRLAVV------------RAVQHVPKGAEVLISYIETAG 226
           +P + ++NH+C PN  V++  G  + V            R++  + +G E+ ++Y++   
Sbjct: 193 FPNLCMVNHNCWPNCTVILNHGNQSAVNTMFHSQRRIELRSLGKIAEGEELTVAYVDFLN 252

Query: 227 STMTRQKALKEQYLFTCTCPRCIKLGQFDDIQESAILEGYRCKDDGCSGFLLRDSDDKGF 286
            +  R++ LK QY F C C  C K G  DD++ +                  R+ D    
Sbjct: 253 LSEERRRLLKTQYFFDCQCDYC-KNGTKDDLKLAG-----------------REVDGVKP 294

Query: 287 TCQQCGLVRSKEEIKKIASEVNILSKKTLALTSCGNHQEVVSTYKMIEKLQKKLYHPFSV 346
           + QQ      KE       ++  + K  L     G++ +VV   + +    + +     +
Sbjct: 295 SEQQV-----KEATDYCFQKLETMDKARLD----GDYHQVVKICRDVIDRTEPVLADTHI 345

Query: 347 NLMQTREKLIKILMELEDWKEALAYCQLTIPVYQRVYPQFHPLLGLQYYTCGKLEWFLGD 406
            L++    L ++   L+ + +A  Y +  +  Y ++Y   +  LG+     G   W  G 
Sbjct: 346 YLLRMWSTLSEVQAYLQYFNDAAEYSRKMVEGYMKLYHPNNAALGMAAMRAGVNHWQAGL 405

Query: 407 TENAIKSMTEAVEILRITHGTNSPFMKEL 435
            E     + +A  IL +THG   P  K+L
Sbjct: 406 IEVGHGMVCKAYAILLVTHGPTHPITKDL 434


>gi|38093643|ref|NP_938015.1| SET and MYND domain-containing protein 1 [Homo sapiens]
 gi|34925329|sp|Q8NB12.1|SMYD1_HUMAN RecName: Full=SET and MYND domain-containing protein 1
 gi|21750163|dbj|BAC03732.1| unnamed protein product [Homo sapiens]
 gi|57997113|emb|CAI46139.1| hypothetical protein [Homo sapiens]
 gi|116497127|gb|AAI26192.1| SET and MYND domain containing 1 [Homo sapiens]
 gi|119597484|gb|EAW77078.1| SET and MYND domain containing 1, isoform CRA_b [Homo sapiens]
 gi|190689781|gb|ACE86665.1| SET and MYND domain containing 1 protein [synthetic construct]
 gi|190691149|gb|ACE87349.1| SET and MYND domain containing 1 protein [synthetic construct]
 gi|261858976|dbj|BAI46010.1| SET and MYND domain containing 1 [synthetic construct]
 gi|313883064|gb|ADR83018.1| SET and MYND domain containing 1 (SMYD1) [synthetic construct]
          Length = 490

 Score =  128 bits (322), Expect = 5e-27,   Method: Compositional matrix adjust.
 Identities = 104/452 (23%), Positives = 196/452 (43%), Gaps = 66/452 (14%)

Query: 7   SGEVIISQEPYVCVPNNSSSISRCDGCFA-SSNLKKCSACQVVWYCGSNCQKLDWKLHRL 65
           + ++I ++  Y  V  +S     C  CF     L +C  C+   YC   CQK  W  H+ 
Sbjct: 29  AADIIFAERAYSAVVFDSLVNFVCHTCFKRQEKLHRCGQCKFAHYCDRTCQKDAWLNHKN 88

Query: 66  ECQVLSRLDKEKRKSVTPSIRLMLKLYLRRKLQNDNVIPSTTTDNYSLVEALVAHMSDID 125
           EC  + R  K   +++  + R+M ++        +  + S        ++  V H  + +
Sbjct: 89  ECSAIKRYGKVPNENIRLAARIMWRVEREGTGLTEGCLVSVDD-----LQNHVEHFGEEE 143

Query: 126 EKQLLLYAQIANLVNLILQWP----EISINEIAENFSKLACNAHTICNSE-LRPLGTGLY 180
           +K L +     ++   +  WP    + S+  I+  F  + CN  T+ +   L+ +G G++
Sbjct: 144 QKDLRV-----DVDTFLQYWPPQSQQFSMQYISHIFGVINCNGFTLSDQRGLQAVGVGIF 198

Query: 181 PVISIINHSCLPNAVLVFE-GRLAVVRAVQH------------VPKGAEVLISYIETAGS 227
           P + ++NH C PN  ++F  G    V+++ H            + +G E+ +SYI+    
Sbjct: 199 PNLGLVNHDCWPNCTVIFNNGNHEAVKSMFHTQMRIELRALGKISEGEELTVSYIDFLNV 258

Query: 228 TMTRQKALKEQYLFTCTCPRCIKLGQFDDIQESAILEGYRCKDDGCSGFLLRDSDDKGFT 287
           +  R++ LK+QY F CTC  C K                + KDD    FL          
Sbjct: 259 SEERKRQLKKQYYFDCTCEHCQK----------------KLKDD---LFL---------- 289

Query: 288 CQQCGLVRSKEEIKKIASEVNILSKKTL----ALTSCGNHQEVVSTYKMIEKLQKKLYHP 343
               G+  + +  +++  E+   SK TL       S G + EVV   +   + Q+ ++  
Sbjct: 290 ----GVKDNPKPSQEVVKEMIQFSKDTLEKIDKARSEGLYHEVVKLCRECLEKQEPVFAD 345

Query: 344 FSVNLMQTREKLIKILMELEDWKEALAYCQLTIPVYQRVYPQFHPLLGLQYYTCGKLEWF 403
            ++ +++    + ++L  L+ ++EA  Y +  +  Y ++Y   +  LG+     G   W 
Sbjct: 346 TNIYMLRMLSIVSEVLSYLQAFEEASFYARRMVDGYMKLYHPNNAQLGMAVMRAGLTNWH 405

Query: 404 LGDTENAIKSMTEAVEILRITHGTNSPFMKEL 435
            G+ E     + +A  IL +THG + P  K+L
Sbjct: 406 AGNIEVGHGMICKAYAILLVTHGPSHPITKDL 437


>gi|119597485|gb|EAW77079.1| SET and MYND domain containing 1, isoform CRA_c [Homo sapiens]
          Length = 493

 Score =  128 bits (322), Expect = 5e-27,   Method: Compositional matrix adjust.
 Identities = 104/452 (23%), Positives = 196/452 (43%), Gaps = 66/452 (14%)

Query: 7   SGEVIISQEPYVCVPNNSSSISRCDGCFA-SSNLKKCSACQVVWYCGSNCQKLDWKLHRL 65
           + ++I ++  Y  V  +S     C  CF     L +C  C+   YC   CQK  W  H+ 
Sbjct: 29  AADIIFAERAYSAVVFDSLVNFVCHTCFKRQEKLHRCGQCKFAHYCDRTCQKDAWLNHKN 88

Query: 66  ECQVLSRLDKEKRKSVTPSIRLMLKLYLRRKLQNDNVIPSTTTDNYSLVEALVAHMSDID 125
           EC  + R  K   +++  + R+M ++        +  + S        ++  V H  + +
Sbjct: 89  ECSAIKRYGKVPNENIRLAARIMWRVEREGTGLTEGCLVSVDD-----LQNHVEHFGEEE 143

Query: 126 EKQLLLYAQIANLVNLILQWP----EISINEIAENFSKLACNAHTICNSE-LRPLGTGLY 180
           +K L +     ++   +  WP    + S+  I+  F  + CN  T+ +   L+ +G G++
Sbjct: 144 QKDLRV-----DVDTFLQYWPPQSQQFSMQYISHIFGVINCNGFTLSDQRGLQAVGVGIF 198

Query: 181 PVISIINHSCLPNAVLVFE-GRLAVVRAVQH------------VPKGAEVLISYIETAGS 227
           P + ++NH C PN  ++F  G    V+++ H            + +G E+ +SYI+    
Sbjct: 199 PNLGLVNHDCWPNCTVIFNNGNHEAVKSMFHTQMRIELRALGKISEGEELTVSYIDFLNV 258

Query: 228 TMTRQKALKEQYLFTCTCPRCIKLGQFDDIQESAILEGYRCKDDGCSGFLLRDSDDKGFT 287
           +  R++ LK+QY F CTC  C K                + KDD    FL          
Sbjct: 259 SEERKRQLKKQYYFDCTCEHCQK----------------KLKDD---LFL---------- 289

Query: 288 CQQCGLVRSKEEIKKIASEVNILSKKTL----ALTSCGNHQEVVSTYKMIEKLQKKLYHP 343
               G+  + +  +++  E+   SK TL       S G + EVV   +   + Q+ ++  
Sbjct: 290 ----GVKDNPKPSQEVVKEMIQFSKDTLEKIDKARSEGLYHEVVKLCRECLEKQEPVFAD 345

Query: 344 FSVNLMQTREKLIKILMELEDWKEALAYCQLTIPVYQRVYPQFHPLLGLQYYTCGKLEWF 403
            ++ +++    + ++L  L+ ++EA  Y +  +  Y ++Y   +  LG+     G   W 
Sbjct: 346 TNIYMLRMLSIVSEVLSYLQAFEEASFYARRMVDGYMKLYHPNNAQLGMAVMRAGLTNWH 405

Query: 404 LGDTENAIKSMTEAVEILRITHGTNSPFMKEL 435
            G+ E     + +A  IL +THG + P  K+L
Sbjct: 406 AGNIEVGHGMICKAYAILLVTHGPSHPITKDL 437


>gi|397468470|ref|XP_003805903.1| PREDICTED: SET and MYND domain-containing protein 1 [Pan paniscus]
          Length = 490

 Score =  128 bits (322), Expect = 5e-27,   Method: Compositional matrix adjust.
 Identities = 107/453 (23%), Positives = 197/453 (43%), Gaps = 68/453 (15%)

Query: 7   SGEVIISQEPYVCVPNNSSSISRCDGCFA-SSNLKKCSACQVVWYCGSNCQKLDWKLHRL 65
           + ++I ++  Y  V  +S     C  CF     L +C  C+   YC   CQK  W  H+ 
Sbjct: 29  AADIIFAERAYSAVVFDSLVNFVCHTCFKRQEKLHRCGQCKFAHYCDRTCQKDAWLNHKN 88

Query: 66  ECQVLSRLDKEKRKSVTPSIRLMLKLYLRRKLQNDNVIPSTTTDNYSLVEALVAHMSDID 125
           EC  + R  K   +++  + R+M ++        +  + S        ++  V H  + +
Sbjct: 89  ECSAIKRYGKVPNENIRLAARIMWRVEREGTGLTEGCLVSVDD-----LQNHVEHFGEEE 143

Query: 126 EKQLLLYAQIANLVNLILQ-WP----EISINEIAENFSKLACNAHTICNSE-LRPLGTGL 179
           +K L +       V+  LQ WP    + S+  I+  F  + CN  T+ +   L+ +G G+
Sbjct: 144 QKDLRVD------VDTFLQYWPPQSQQFSMQYISHIFGVINCNGFTLSDQRGLQAVGVGI 197

Query: 180 YPVISIINHSCLPNAVLVFE-GRLAVVRAVQH------------VPKGAEVLISYIETAG 226
           +P + ++NH C PN  ++F  G    V+++ H            + +G E+ +SYI+   
Sbjct: 198 FPNLGLVNHDCWPNCTVIFNNGNHEAVKSMFHTQMRIELRALGKISEGEELTVSYIDFLN 257

Query: 227 STMTRQKALKEQYLFTCTCPRCIKLGQFDDIQESAILEGYRCKDDGCSGFLLRDSDDKGF 286
            +  R++ LK+QY F CTC  C K                + KDD    FL         
Sbjct: 258 VSEERKRQLKKQYYFDCTCEHCQK----------------KLKDD---LFL--------- 289

Query: 287 TCQQCGLVRSKEEIKKIASEVNILSKKTL----ALTSCGNHQEVVSTYKMIEKLQKKLYH 342
                G+  + +  +++  E+   SK TL       S G + EVV   +   + Q+ ++ 
Sbjct: 290 -----GVKDNPKPSQEVVKEMIQFSKDTLEKIDKARSEGLYHEVVKLCRECLEKQEPVFA 344

Query: 343 PFSVNLMQTREKLIKILMELEDWKEALAYCQLTIPVYQRVYPQFHPLLGLQYYTCGKLEW 402
             ++ +++    + ++L  L+ ++EA  Y +  +  Y ++Y   +  LG+     G   W
Sbjct: 345 DTNIYMLRMLSIVSEVLSYLQAFEEASFYARRMVDGYMKLYHPNNAQLGMAVMRAGLTNW 404

Query: 403 FLGDTENAIKSMTEAVEILRITHGTNSPFMKEL 435
             G+ E     + +A  IL +THG + P  K+L
Sbjct: 405 HAGNIEVGHGMICKAYAILLVTHGPSHPITKDL 437


>gi|119597483|gb|EAW77077.1| SET and MYND domain containing 1, isoform CRA_a [Homo sapiens]
          Length = 510

 Score =  128 bits (322), Expect = 5e-27,   Method: Compositional matrix adjust.
 Identities = 107/453 (23%), Positives = 197/453 (43%), Gaps = 68/453 (15%)

Query: 7   SGEVIISQEPYVCVPNNSSSISRCDGCFA-SSNLKKCSACQVVWYCGSNCQKLDWKLHRL 65
           + ++I ++  Y  V  +S     C  CF     L +C  C+   YC   CQK  W  H+ 
Sbjct: 29  AADIIFAERAYSAVVFDSLVNFVCHTCFKRQEKLHRCGQCKFAHYCDRTCQKDAWLNHKN 88

Query: 66  ECQVLSRLDKEKRKSVTPSIRLMLKLYLRRKLQNDNVIPSTTTDNYSLVEALVAHMSDID 125
           EC  + R  K   +++  + R+M ++        +  + S        ++  V H  + +
Sbjct: 89  ECSAIKRYGKVPNENIRLAARIMWRVEREGTGLTEGCLVSVDD-----LQNHVEHFGEEE 143

Query: 126 EKQLLLYAQIANLVNLILQ-WP----EISINEIAENFSKLACNAHTICNSE-LRPLGTGL 179
           +K L +       V+  LQ WP    + S+  I+  F  + CN  T+ +   L+ +G G+
Sbjct: 144 QKDLRVD------VDTFLQYWPPQSQQFSMQYISHIFGVINCNGFTLSDQRGLQAVGVGI 197

Query: 180 YPVISIINHSCLPNAVLVFE-GRLAVVRAVQH------------VPKGAEVLISYIETAG 226
           +P + ++NH C PN  ++F  G    V+++ H            + +G E+ +SYI+   
Sbjct: 198 FPNLGLVNHDCWPNCTVIFNNGNHEAVKSMFHTQMRIELRALGKISEGEELTVSYIDFLN 257

Query: 227 STMTRQKALKEQYLFTCTCPRCIKLGQFDDIQESAILEGYRCKDDGCSGFLLRDSDDKGF 286
            +  R++ LK+QY F CTC  C K                + KDD    FL         
Sbjct: 258 VSEERKRQLKKQYYFDCTCEHCQK----------------KLKDD---LFL--------- 289

Query: 287 TCQQCGLVRSKEEIKKIASEVNILSKKTL----ALTSCGNHQEVVSTYKMIEKLQKKLYH 342
                G+  + +  +++  E+   SK TL       S G + EVV   +   + Q+ ++ 
Sbjct: 290 -----GVKDNPKPSQEVVKEMIQFSKDTLEKIDKARSEGLYHEVVKLCRECLEKQEPVFA 344

Query: 343 PFSVNLMQTREKLIKILMELEDWKEALAYCQLTIPVYQRVYPQFHPLLGLQYYTCGKLEW 402
             ++ +++    + ++L  L+ ++EA  Y +  +  Y ++Y   +  LG+     G   W
Sbjct: 345 DTNIYMLRMLSIVSEVLSYLQAFEEASFYARRMVDGYMKLYHPNNAQLGMAVMRAGLTNW 404

Query: 403 FLGDTENAIKSMTEAVEILRITHGTNSPFMKEL 435
             G+ E     + +A  IL +THG + P  K+L
Sbjct: 405 HAGNIEVGHGMICKAYAILLVTHGPSHPITKDL 437


>gi|48476975|gb|AAT44535.1| hypothetical rhabdomyosarcoma antigen MU-RMS-40.9A [Homo sapiens]
          Length = 484

 Score =  128 bits (322), Expect = 6e-27,   Method: Compositional matrix adjust.
 Identities = 104/452 (23%), Positives = 196/452 (43%), Gaps = 66/452 (14%)

Query: 7   SGEVIISQEPYVCVPNNSSSISRCDGCFA-SSNLKKCSACQVVWYCGSNCQKLDWKLHRL 65
           + ++I ++  Y  V  +S     C  CF     L +C  C+   YC   CQK  W  H+ 
Sbjct: 23  AADIIFAERAYSAVVFDSLVNFVCHTCFKRQEKLHRCGQCKFAHYCDRTCQKDAWLNHKN 82

Query: 66  ECQVLSRLDKEKRKSVTPSIRLMLKLYLRRKLQNDNVIPSTTTDNYSLVEALVAHMSDID 125
           EC  + R  K   +++  + R+M ++        +  + S        ++  V H  + +
Sbjct: 83  ECSAIKRYGKVPNENIRLAARIMWRVEREGTGLTEGCLVSVDD-----LQNHVEHFGEEE 137

Query: 126 EKQLLLYAQIANLVNLILQWP----EISINEIAENFSKLACNAHTICNSE-LRPLGTGLY 180
           +K L +     ++   +  WP    + S+  I+  F  + CN  T+ +   L+ +G G++
Sbjct: 138 QKDLRV-----DVDTFLQYWPPQSQQFSMQYISHIFGVINCNGFTLSDQRGLQAVGVGIF 192

Query: 181 PVISIINHSCLPNAVLVFE-GRLAVVRAVQH------------VPKGAEVLISYIETAGS 227
           P + ++NH C PN  ++F  G    V+++ H            + +G E+ +SYI+    
Sbjct: 193 PNLGLVNHDCWPNCTVIFNNGNHEAVKSMFHTQMRIELRALGKISEGEELTVSYIDFLNV 252

Query: 228 TMTRQKALKEQYLFTCTCPRCIKLGQFDDIQESAILEGYRCKDDGCSGFLLRDSDDKGFT 287
           +  R++ LK+QY F CTC  C K                + KDD    FL          
Sbjct: 253 SEERKRQLKKQYYFDCTCEHCQK----------------KLKDD---LFL---------- 283

Query: 288 CQQCGLVRSKEEIKKIASEVNILSKKTL----ALTSCGNHQEVVSTYKMIEKLQKKLYHP 343
               G+  + +  +++  E+   SK TL       S G + EVV   +   + Q+ ++  
Sbjct: 284 ----GVKDNPKPSQEVVKEMIQFSKDTLEKIDKARSEGLYHEVVKLCRECLEKQEPVFAD 339

Query: 344 FSVNLMQTREKLIKILMELEDWKEALAYCQLTIPVYQRVYPQFHPLLGLQYYTCGKLEWF 403
            ++ +++    + ++L  L+ ++EA  Y +  +  Y ++Y   +  LG+     G   W 
Sbjct: 340 TNIYMLRMLSIVSEVLSYLQAFEEASFYARRMVDGYMKLYHPNNAQLGMAVMRAGLTNWH 399

Query: 404 LGDTENAIKSMTEAVEILRITHGTNSPFMKEL 435
            G+ E     + +A  IL +THG + P  K+L
Sbjct: 400 AGNIEVGHGMICKAYAILLVTHGPSHPITKDL 431


>gi|301773928|ref|XP_002922386.1| PREDICTED: SET and MYND domain-containing protein 1-like
           [Ailuropoda melanoleuca]
          Length = 490

 Score =  128 bits (322), Expect = 6e-27,   Method: Compositional matrix adjust.
 Identities = 108/448 (24%), Positives = 194/448 (43%), Gaps = 58/448 (12%)

Query: 7   SGEVIISQEPYVCVPNNSSSISRCDGCFA-SSNLKKCSACQVVWYCGSNCQKLDWKLHRL 65
           + +VI ++  Y  V  +S     C  CF     L +C  C+   YC   CQK  W  H+ 
Sbjct: 29  AADVIFAERAYSAVVFDSLVNFVCHTCFKRQEKLHRCGQCKFAHYCDRTCQKDAWLNHKN 88

Query: 66  ECQVLSRLDKEKRKSVTPSIRLMLKLYLRRKLQNDNVIPSTTTDNYSLVEALVAHMSDID 125
           EC  + R  K      + SIRL  ++  R + +   +          L +  V H  + +
Sbjct: 89  ECSAIKRYGK----VPSESIRLAARILWRVEREGTGLTEGCLVAVDEL-QNHVEHFGEEE 143

Query: 126 EKQLLLYAQIANLVNLILQWP----EISINEIAENFSKLACNAHTICNSE-LRPLGTGLY 180
           +K+L +     ++   +  WP    + S+  I+  F  + CN  T+ +   L+ +G G++
Sbjct: 144 QKELRV-----DVDTFLQYWPPQSQQFSMQYISHVFGVINCNGFTLSDQRGLQAVGVGIF 198

Query: 181 PVISIINHSCLPNAVLVFE-GRLAVVRAVQH------------VPKGAEVLISYIETAGS 227
           P + ++NH+C PN  ++F  G    V+++ H            + +G E+ +SYI+    
Sbjct: 199 PNLGLVNHNCWPNCTVIFNNGNHEAVKSMFHTQMRIELRALGKISEGEELTVSYIDFLNV 258

Query: 228 TMTRQKALKEQYLFTCTCPRCIKLGQFDDIQESAILEGYRCKDDGCSGFLLRDSDDKGFT 287
           +  R++ LK+QY F CTC  C K                + KDD    FL    D K   
Sbjct: 259 SEERKRQLKKQYYFDCTCEHCQK----------------KLKDD---LFLGVKDDPKP-- 297

Query: 288 CQQCGLVRSKEEIKKIASEVNILSKKTLALTSCGNHQEVVSTYKMIEKLQKKLYHPFSVN 347
                   S+E +K++        +K     S G + EVV   +   + Q+ ++   ++ 
Sbjct: 298 --------SQEVVKEMIQFSKDTLEKIDKARSEGLYHEVVKLCRECLQKQEPVFADTNLY 349

Query: 348 LMQTREKLIKILMELEDWKEALAYCQLTIPVYQRVYPQFHPLLGLQYYTCGKLEWFLGDT 407
            ++    + ++L  L+ ++EA  Y +  +  Y ++Y   +  LG+     G   W  G+ 
Sbjct: 350 TLRLLSTVSEVLSYLQAFEEASYYARRMVDGYMKLYHPNNAQLGMAVMRAGLTNWHAGNI 409

Query: 408 ENAIKSMTEAVEILRITHGTNSPFMKEL 435
           E     + +A  IL +THG + P  K+L
Sbjct: 410 EVGHGMICKAYAILLVTHGPSHPITKDL 437


>gi|291386375|ref|XP_002709686.1| PREDICTED: SET and MYND domain containing 1 isoform 2 [Oryctolagus
           cuniculus]
          Length = 490

 Score =  128 bits (322), Expect = 6e-27,   Method: Compositional matrix adjust.
 Identities = 104/448 (23%), Positives = 193/448 (43%), Gaps = 58/448 (12%)

Query: 7   SGEVIISQEPYVCVPNNSSSISRCDGCFA-SSNLKKCSACQVVWYCGSNCQKLDWKLHRL 65
           + ++I ++  Y  V  +S     C  CF     L +C  C+   YC   CQK  W  H+ 
Sbjct: 29  AADIIFAERAYSAVVFDSLVNFVCHTCFKRQEKLHRCGQCKFAHYCDRTCQKDAWLNHKN 88

Query: 66  ECQVLSRLDKEKRKSVTPSIRLMLKLYLRRKLQNDNVIPSTTTDNYSLVEALVAHMSDID 125
           EC  + R  K   +++  + R+M ++        +  + S        ++  V H  + +
Sbjct: 89  ECSAIKRYGKVPNENIRLAARIMWRVEREGTGLTEGCLVSVDD-----LQNHVDHFGEEE 143

Query: 126 EKQLLLYAQIANLVNLILQWP----EISINEIAENFSKLACNAHTICNSE-LRPLGTGLY 180
           +K+L      A++   +  WP      S+  I+  F  + CN  T+ +   L+ +G G++
Sbjct: 144 QKELR-----ADVDTFLQYWPPQSQHFSMQYISHIFGVINCNGFTLSDQRGLQAVGVGIF 198

Query: 181 PVISIINHSCLPNAVLVFE-GRLAVVRAVQH------------VPKGAEVLISYIETAGS 227
           P ++++NH C PN  ++F  G    V+++ H            + +G E+ +SYI+    
Sbjct: 199 PNLALVNHDCWPNCTVIFNNGNHEAVKSMFHTQMRIELRALGKISEGEELTVSYIDFLNV 258

Query: 228 TMTRQKALKEQYLFTCTCPRCIKLGQFDDIQESAILEGYRCKDDGCSGFLLRDSDDKGFT 287
           +  R++ LK+QY F CTC  C K G  DD+                  FL    + K   
Sbjct: 259 SEERRRQLKKQYYFDCTCEHCQK-GLKDDL------------------FLGVKDNPKP-- 297

Query: 288 CQQCGLVRSKEEIKKIASEVNILSKKTLALTSCGNHQEVVSTYKMIEKLQKKLYHPFSVN 347
                   S+E +K++        +K     S G + EVV   +   + Q+ ++   ++ 
Sbjct: 298 --------SQEVVKEMTEFCKDTLEKIDKARSEGLYHEVVKLCRECLQKQEPVFADTNLY 349

Query: 348 LMQTREKLIKILMELEDWKEALAYCQLTIPVYQRVYPQFHPLLGLQYYTCGKLEWFLGDT 407
            ++    + ++L  L+ + EA  Y +  +  Y ++Y   +  LG+     G   W  G+ 
Sbjct: 350 TLRMLSIVSEVLSYLQAFGEAADYAKRMVDGYMKLYHPNNAQLGMAVMRAGLTNWHAGNI 409

Query: 408 ENAIKSMTEAVEILRITHGTNSPFMKEL 435
           E     + +A  IL +THG + P  K+L
Sbjct: 410 EVGHGMICKAYAILLVTHGPSHPITKDL 437


>gi|119613761|gb|EAW93355.1| SET and MYND domain containing 2, isoform CRA_a [Homo sapiens]
          Length = 349

 Score =  128 bits (321), Expect = 6e-27,   Method: Compositional matrix adjust.
 Identities = 93/381 (24%), Positives = 173/381 (45%), Gaps = 36/381 (9%)

Query: 62  LHRLECQVLSRLDKEKRKSVTPSIRLMLKLYLRRKLQNDNVIPSTTTDNYSLVEALVAHM 121
           +H+LEC  +    +    S T  +RL  ++  ++K+  +     T ++    V+   +H+
Sbjct: 1   MHKLECSPMVVFGENWNPSET--VRLTARILAKQKIHPER----TPSEKLLAVKEFESHL 54

Query: 122 SDID-EKQLLLYAQIANLVNLILQWPEISINE-IAENFSKLACNAHTICNSELRPLGTGL 179
             +D EK+ L+ + IA L +   +  E   N+ +   F+++ CN  TI + EL  LG+ +
Sbjct: 55  DKLDNEKKDLIQSDIAALHHFYSKHLEFPDNDSLVVLFAQVNCNGFTIEDEELSHLGSAI 114

Query: 180 YPVISIINHSCLPNAVLVFEGRLAVVRAVQHVPKGAEVLISYIETAGSTMTRQKALKEQY 239
           +P ++++NHSC PN ++ ++G LA VRAVQ +  G EV  SYI+    T  R   L++ Y
Sbjct: 115 FPDVALMNHSCCPNVIVTYKGTLAEVRAVQEIKPGEEVFTSYIDLLYPTEDRNDRLRDSY 174

Query: 240 LFTCTCPRCIKLGQFDDIQESAILEGYRCKDDGCSGFLLRDSDDKGFTCQQCGLVRSKEE 299
            FTC C  C                    KD   +   +R   D         +VR    
Sbjct: 175 FFTCECQECT------------------TKDKDKAKVEIRKLSDPPKAEAIRDMVRYARN 216

Query: 300 IKKIASEVNILSKKTLALTSCGNHQEVVSTYKMIEKLQKKLYHPFSVNLMQTREKLIKIL 359
           + +           +  L  C   QE +S+          ++   +V ++    + + + 
Sbjct: 217 VIEEFRRAKHYKSPSELLEICELSQEKMSS----------VFEDSNVYMLHMMYQAMGVC 266

Query: 360 MELEDWKEALAYCQLTIPVYQRVYPQFHPLLGLQYYTCGKLEWFLGDTENAIKSMTEAVE 419
           + ++DW+ AL Y Q  I  Y + YP +   +   +   G+L   L       K++ +A+ 
Sbjct: 267 LYMQDWEGALQYGQKIIKPYSKHYPLYSLNVASMWLKLGRLYMGLEHKAAGEKALKKAIA 326

Query: 420 ILRITHGTNSPFMKELILKLE 440
           I+ + HG + P++ E+  ++E
Sbjct: 327 IMEVAHGKDHPYISEIKQEIE 347


>gi|426194765|gb|EKV44696.1| hypothetical protein AGABI2DRAFT_120820 [Agaricus bisporus var.
           bisporus H97]
          Length = 674

 Score =  128 bits (321), Expect = 7e-27,   Method: Compositional matrix adjust.
 Identities = 96/327 (29%), Positives = 153/327 (46%), Gaps = 52/327 (15%)

Query: 8   GEVIISQEPYVCVPNNSSSISRCDGCFASSN---LKKCSACQVVWYCGSNCQKLDWKLHR 64
           GE+I ++       +     + C  CF  ++   LK+C  C+++ YC S CQ  DW LH+
Sbjct: 110 GEIIFTEVQQSAALSTGLLATHCSACFVEASGVPLKRCPTCRIIHYCDSECQSRDWTLHK 169

Query: 65  LECQVLSRL------------DKEKRKSVTPS--IRLMLKLYLRRKLQNDNVIPSTTTDN 110
            EC  L +             + +K+  V PS  IR + ++  R++ +        + D 
Sbjct: 170 RECPALQKWATSVPSPSPETSEDKKQGKVIPSDAIRALGRMLWRKQKKG------LSGDW 223

Query: 111 YSLVEALVAHMSDI--DEKQLLLYAQIANLVNLIL------QWPEISINEIA---ENFSK 159
              V+ + +H + +  +EK   L+  + + +   L      ++    I+ IA   +  S+
Sbjct: 224 AKQVDTMQSHRARLSSNEKSSQLHTHMVHALVRYLGLESLDEFSSYGIDSIAGLVDLVSR 283

Query: 160 LACNAHTICNSELRPLGTGLYPVISIINHSCLPNAVLVFEGR---------LAVVRAVQH 210
              NA T+ +  L P+G  + P+I++ NHSC PNAV VF            +A V  ++ 
Sbjct: 284 FTTNAFTVASPTLTPIGVSISPIIALFNHSCAPNAVPVFPRAPHNAKANEPMASVITLRD 343

Query: 211 VPKGAEVLISYIETAGSTMTRQKALKEQYLFTCTCPRCIKL--GQFDDIQESAILEGYRC 268
           +P   EV+ISYI+T  +   RQKALKE Y FTC+C  C K   G + D +ES       C
Sbjct: 344 IPANEEVVISYIDTTLTKRERQKALKETYYFTCSCSLCEKPRDGAYVDSRES-----LYC 398

Query: 269 KDDGCSGF-LLRDSDDKGFTCQQCGLV 294
             + C G   L   DD    C +CG V
Sbjct: 399 PKN-CGGLCWLPTEDDPLARCAKCGTV 424


>gi|432089341|gb|ELK23292.1| N-lysine methyltransferase SMYD2 [Myotis davidii]
          Length = 391

 Score =  127 bits (320), Expect = 9e-27,   Method: Compositional matrix adjust.
 Identities = 95/372 (25%), Positives = 178/372 (47%), Gaps = 27/372 (7%)

Query: 94  RRKLQNDNVIPSTTTDNYSLVEALVAHMSDID-EKQLLLYAQIANLVNLILQWPEISINE 152
           R +++  +    T ++    V+   +H+  +D EK+ L+ + I+ L +   +  E    E
Sbjct: 20  RSRMRKKSHPERTPSEKLLTVKEFESHLDKLDNEKKDLIQSDISALHHFYSKHLEFPDQE 79

Query: 153 -IAENFSKLACNAHTICNSELRPLGTGLYPVISIINHSCLPNAVLVFEGRLAVVRAVQHV 211
            +   F+++ CN  TI + EL  LG+ ++P ++++NHSC PN ++ ++G LA VRAVQ +
Sbjct: 80  SLVALFAQVNCNGFTIEDEELSHLGSAIFPDVALMNHSCCPNVIVTYKGTLAEVRAVQEI 139

Query: 212 PKGAEVLISYIETAGSTMTRQKALKEQYLFTCTCPRC---------IKLGQFDDIQES-A 261
             G EV  SYI+    T  R   L++ Y FTC C  C         +K+ + +D  ++ A
Sbjct: 140 GPGDEVFTSYIDLLYPTEDRNDRLRDSYFFTCECQECTTKDKDKAKVKIRKLNDPPKAEA 199

Query: 262 ILEGYRCKDDGCSGFLLRDSDDKGFTCQQCGL--VRSKEEIKKIASEVNILSKKT----- 314
           I +  R   +    F  R +    +    CG    R   E+ +  + V++L  KT     
Sbjct: 200 IRDMVRYARNVIEEF--RRAKHYKYILDSCGRSPARGAGELGQELNVVDVLLSKTTENGS 257

Query: 315 ---LALTSCGNHQEVVSTYKMIEKLQKKLYHPF---SVNLMQTREKLIKILMELEDWKEA 368
              L L     H       ++ E  Q+K+   F   +V ++    + + + + ++DW+ A
Sbjct: 258 LKVLILGHYFGHTPPSELLEICELSQEKMSSVFEDSNVYMLHMMYQAMGVCLYIQDWEGA 317

Query: 369 LAYCQLTIPVYQRVYPQFHPLLGLQYYTCGKLEWFLGDTENAIKSMTEAVEILRITHGTN 428
           L Y Q  I  Y + YP +   +   +   G+L   L +     K++ +A+ I+ + HG +
Sbjct: 318 LRYGQKIIKPYSKHYPLYSLNVASMWLKLGRLYMGLENRAAGEKALRKAMAIMEVAHGKD 377

Query: 429 SPFMKELILKLE 440
            P++ E+  ++E
Sbjct: 378 HPYISEIKQEIE 389


>gi|325188357|emb|CCA22894.1| conserved hypothetical protein [Albugo laibachii Nc14]
          Length = 528

 Score =  127 bits (320), Expect = 9e-27,   Method: Compositional matrix adjust.
 Identities = 97/321 (30%), Positives = 149/321 (46%), Gaps = 69/321 (21%)

Query: 24  SSSISRCDGCFAS------------------SNLKKCSACQVVWYCGSNCQKLDWKL--H 63
           S S+  C+ CF+S                   +L +CS C+V  YC   CQK DW    H
Sbjct: 37  SISLQACNWCFSSPQERANSSFFALLPSETADSLFRCSRCKVARYCTKKCQKCDWNEGNH 96

Query: 64  RLEC-------QVLSRLDKEKRKSVTPSIRLMLKLYLRRKLQNDNVIPSTTTDNYSLVEA 116
           R EC       Q L   + EKR +  P+I  +++L      Q      S  T N      
Sbjct: 97  RSECISWKTVEQALRTRNSEKRSASYPTILFVMRL----AAQMYQTASSDPTGN------ 146

Query: 117 LVAHM--SDIDEKQLLLYAQIANLVNL------ILQWPEI------------SINEIAEN 156
           L  H   ++ D+++L  ++++A LV +      I+Q  E+            ++ +I + 
Sbjct: 147 LCHHYGRNEFDQEKLQEFSEMAQLVFVHLLRVSIIQNNELEPRNSKQNDTKSTLLQITKL 206

Query: 157 FSKLACNAHTICNSELRPLGTGLYPVISIINHSCLPNAVLVFE--GRLAVVRAVQHVPKG 214
           F  + CN  TICN+   PLG GLYP  +++NHSC PN ++ F+   + AV+  ++ +  G
Sbjct: 207 FCMVQCNVFTICNTLNEPLGIGLYPKAAMLNHSCTPNCIVRFDLTRKQAVIHTIETIHSG 266

Query: 215 AEVLISYIETAGSTMTRQKALKEQYLFTCTCPRCIKLGQFDDIQESAILEGY---RCKDD 271
            E+ ISY++    T  R+K L+  Y F CTC RC         QE A LEG+   +C++ 
Sbjct: 267 QELTISYVDIFDPTTIRKKRLRSSYFFDCTCQRCT------SSQEDAFLEGFLCTQCRNT 320

Query: 272 GCSGFLLRDSDDKGFTCQQCG 292
             S  +L D   + F C  C 
Sbjct: 321 SESARIL-DEKKRIFRCCVCA 340


>gi|157819445|ref|NP_001100065.1| SET and MYND domain-containing protein 1 [Rattus norvegicus]
 gi|149036367|gb|EDL90985.1| rCG56113, isoform CRA_a [Rattus norvegicus]
          Length = 490

 Score =  127 bits (320), Expect = 1e-26,   Method: Compositional matrix adjust.
 Identities = 104/448 (23%), Positives = 194/448 (43%), Gaps = 58/448 (12%)

Query: 7   SGEVIISQEPYVCVPNNSSSISRCDGCFA-SSNLKKCSACQVVWYCGSNCQKLDWKLHRL 65
           + +VI ++  Y  V  +S     C  CF     L +C  C+   YC   CQK  W  H+ 
Sbjct: 29  AADVIFAERAYSAVVFDSLINFVCHTCFKRQERLHRCGQCKFAHYCDRTCQKDAWLNHKN 88

Query: 66  ECQVLSRLDKEKRKSVTPSIRLMLKLYLRRKLQNDNVIPSTTTDNYSLVEALVAHMSDID 125
           EC  + R  K   +++  + R+M ++        +  + S        ++  V H  + +
Sbjct: 89  ECSAIKRYGKVPNENIRLAARIMWRVEREGTGLTEGCLVSVDD-----LQNHVEHFGEEE 143

Query: 126 EKQLLLYAQIANLVNLILQWP----EISINEIAENFSKLACNAHTICNSE-LRPLGTGLY 180
           +K+L +     ++   +  WP    + S+  I+  F  + CN  T+ +   L+ +G G++
Sbjct: 144 QKELRV-----DVDTFLQYWPPQSQQFSMQYISHIFGVINCNGFTLSDQRGLQAVGVGIF 198

Query: 181 PVISIINHSCLPNAVLVF-EGRLAVVRAVQH------------VPKGAEVLISYIETAGS 227
           P + ++NH C PN  ++F  G    V+++ H            + +G E+ +SYI+    
Sbjct: 199 PNLGLVNHDCWPNCTVIFNNGNHEAVKSMFHTQMRIELRALGKISEGEELTVSYIDFLHL 258

Query: 228 TMTRQKALKEQYLFTCTCPRCIKLGQFDDIQESAILEGYRCKDDGCSGFLLRDSDDKGFT 287
           +  R++ LK+QY F C+C  C K G  DD+                  FL    D K   
Sbjct: 259 SEERRQQLKKQYYFDCSCEHCQK-GLKDDL------------------FLAVKEDPKP-- 297

Query: 288 CQQCGLVRSKEEIKKIASEVNILSKKTLALTSCGNHQEVVSTYKMIEKLQKKLYHPFSVN 347
                   S+E +K++        +K     S G + EVV   +   + Q+ ++   ++ 
Sbjct: 298 --------SQEVVKEMTQFSKDTLEKIDKARSEGLYHEVVKLCRECLEKQEPVFADTNLY 349

Query: 348 LMQTREKLIKILMELEDWKEALAYCQLTIPVYQRVYPQFHPLLGLQYYTCGKLEWFLGDT 407
           +++    + ++L  L+ ++EA  Y +  +  Y ++Y   +  LG+     G   W  G  
Sbjct: 350 VLRLLSIVSEVLSYLQAFEEASHYARRMVDGYMKLYHHNNAQLGMAVMRAGLTNWHAGHI 409

Query: 408 ENAIKSMTEAVEILRITHGTNSPFMKEL 435
           E     + +A  IL +THG + P  K+L
Sbjct: 410 EVGHGMICKAYAILLVTHGPSHPITKDL 437


>gi|149727214|ref|XP_001497840.1| PREDICTED: SET and MYND domain-containing protein 1 isoform 2
           [Equus caballus]
          Length = 490

 Score =  127 bits (319), Expect = 1e-26,   Method: Compositional matrix adjust.
 Identities = 106/453 (23%), Positives = 196/453 (43%), Gaps = 68/453 (15%)

Query: 7   SGEVIISQEPYVCVPNNSSSISRCDGCFA-SSNLKKCSACQVVWYCGSNCQKLDWKLHRL 65
           + ++I ++  Y  V  +S     C  CF     L +C  C+   YC   CQK  W  H+ 
Sbjct: 29  AADIIFAERAYSAVVFDSLVNFVCHTCFKRQEKLHRCGQCKFAHYCDRTCQKDAWLNHKN 88

Query: 66  ECQVLSRLDKEKRKSVTPSIRLMLKLYLRRKLQNDNVIPSTTTDNYSLVEALVAHMSDID 125
           EC  + R  K   +++  + R+M ++        +  + S        ++  V H  + +
Sbjct: 89  ECSAIKRYGKVPNENIRLAARIMWRVEREGTGLTEGCLVSVDD-----LQNHVEHFGEEE 143

Query: 126 EKQLLLYAQIANLVNLILQ-WP----EISINEIAENFSKLACNAHTICNSE-LRPLGTGL 179
           +++L +       V+  LQ WP    + S+  I+  F  + CN  T+ +   L+ +G G+
Sbjct: 144 QRELRVD------VDTFLQYWPPQSQQFSMQYISHIFGVINCNGFTLSDQRGLQAVGVGV 197

Query: 180 YPVISIINHSCLPNAVLVFE-GRLAVVRAVQH------------VPKGAEVLISYIETAG 226
           +P + ++NH C PN  ++F  G    V+++ H            + +G E+ +SYI+   
Sbjct: 198 FPNLGLVNHDCWPNCTVIFNNGNHEAVKSMFHTQMRIELRALGKISEGEELTVSYIDFLN 257

Query: 227 STMTRQKALKEQYLFTCTCPRCIKLGQFDDIQESAILEGYRCKDDGCSGFLLRDSDDKGF 286
            +  R++ LK+QY F CTC  C K G  DD+                  FL         
Sbjct: 258 VSEERKRQLKKQYYFDCTCEHCQK-GLKDDL------------------FL--------- 289

Query: 287 TCQQCGLVRSKEEIKKIASEVNILSKKTL----ALTSCGNHQEVVSTYKMIEKLQKKLYH 342
                G+    +  + +  EV   SK TL       + G + EVV   +   + Q+ ++ 
Sbjct: 290 -----GVKDDPKPSQDVVKEVVQFSKDTLEKIDKARTEGLYHEVVKLCRECLQKQEPVFA 344

Query: 343 PFSVNLMQTREKLIKILMELEDWKEALAYCQLTIPVYQRVYPQFHPLLGLQYYTCGKLEW 402
             ++ +++    + ++L  L+ ++EA  Y +  +  Y ++Y   +  LG+     G   W
Sbjct: 345 DTNLYMLRILSIVSEVLSYLQAFEEASYYARKMVDGYMKLYHPNNAQLGMAVMRAGLTNW 404

Query: 403 FLGDTENAIKSMTEAVEILRITHGTNSPFMKEL 435
             G+ E     + +A  IL +THG + P  K+L
Sbjct: 405 HAGNIEVGHGMICKAYAILLVTHGPSHPITKDL 437


>gi|336375028|gb|EGO03364.1| hypothetical protein SERLA73DRAFT_174827 [Serpula lacrymans var.
           lacrymans S7.3]
 gi|336387991|gb|EGO29135.1| hypothetical protein SERLADRAFT_456499 [Serpula lacrymans var.
           lacrymans S7.9]
          Length = 530

 Score =  126 bits (316), Expect = 3e-26,   Method: Compositional matrix adjust.
 Identities = 79/274 (28%), Positives = 130/274 (47%), Gaps = 31/274 (11%)

Query: 7   SGEVIISQEPYVCVPNNSSSISRCDGC---FASSNLKKCSACQVVWYCGSNCQKLDWKLH 63
           +G ++IS +P++ V +N S  + C  C      S LK+C  C++VWYC  NC   DW LH
Sbjct: 26  AGSILISVKPHISVLSNQSLHTHCSSCCGPAPPSGLKRCVRCRMVWYCDVNCSSNDWTLH 85

Query: 64  RLECQVLSRLDKEKRKS--VTPS--IRLMLKLYLRRKLQNDNVIPSTTTDNYSLVEALVA 119
           +LEC  L +            PS  +R + ++  +R+ +    I +   D      ++ +
Sbjct: 86  KLECSALKKWSSSAPSPDVAIPSDAVRCLGRILWKRRAEGPESIWAKEID------SMQS 139

Query: 120 HMSDIDEKQLLLYAQIANLVNLIL---------QWPEISINEIAENFSKLACNAHTICNS 170
           H   +       +  +A+ +   L         ++   +  ++ +  SK   N  T+ +S
Sbjct: 140 HRGSLQPSAFESHTHLAHFLVRFLDLSSPAELSEYGLSTAGDLVDIISKFITNTFTLTSS 199

Query: 171 ELRPLGTGLYPVISIINHSCLPNAVLVF---------EGRLAVVRAVQHVPKGAEVLISY 221
            L  LG  + P++++INHSC PNAV+V+         E  L  V A++ +    E+L +Y
Sbjct: 200 SLSALGVSVSPLVALINHSCDPNAVIVYPRCSNEPSTEEPLMQVVAIRDIEVDEEILTAY 259

Query: 222 IETAGSTMTRQKALKEQYLFTCTCPRCIKLGQFD 255
           I+T     +RQK LKE Y F C CP C K    D
Sbjct: 260 IDTTLPRFSRQKFLKETYNFDCQCPSCTKYSGVD 293


>gi|432880354|ref|XP_004073656.1| PREDICTED: SET and MYND domain-containing protein 1-like isoform 1
           [Oryzias latipes]
          Length = 489

 Score =  125 bits (315), Expect = 3e-26,   Method: Compositional matrix adjust.
 Identities = 113/454 (24%), Positives = 196/454 (43%), Gaps = 62/454 (13%)

Query: 7   SGEVIISQEPYVCVPNNSSSISRCDGCFA-SSNLKKCSACQVVWYCGSNCQKLDWKLHRL 65
           +GEV+ ++  Y  V  +S +   C  CF   + L +C  C+   YC   CQ   W+ H+ 
Sbjct: 29  TGEVVFAEPSYSAVVFDSLAFQVCHCCFRRQAQLHRCGQCRFAHYCNRTCQTACWEEHKQ 88

Query: 66  ECQVLSRLDKEKRKSVTPSIRLMLKLYLRRKLQNDNVIPSTTTDNYSLVEALVAHMSDID 125
           EC  + +  K   +SV  + RL+ +L+    L +D+ + S        V+ L  H   +D
Sbjct: 89  ECAAIRKSGKAPSESVRLAARLLWRLHKDNGLASDSQLVS--------VDQLQEH---VD 137

Query: 126 EKQLLLYAQIANLVNLILQW-----PEISINEIAENFSKLACNAHTICNSE-LRPLGTGL 179
           +       Q+   V ++  +      + S+  I+  F  + CN  T+ +   L+ +G GL
Sbjct: 138 QLSQEQLQQLQTDVRVLQDYWSYGSKQHSVEYISHIFGIIKCNGFTLTDQRGLQAVGVGL 197

Query: 180 YPVISIINHSCLPNAVLVF-EGRLAVVRAVQHVPK------------GAEVLISYIETAG 226
           +P ++++NH C PN   +   G  + V +  H P+            G E+ ISY++   
Sbjct: 198 FPNLALVNHDCWPNCTAILNHGNHSAVSSALHSPRRIEVRTLGKISEGEELTISYVDFLQ 257

Query: 227 STMTRQKALKEQYLFTCTCPRCIKLGQFDDIQESAILEGYRCKDDGCSGFLLRDSDDKGF 286
            +  RQK LKEQ+ F C+C  C +    DD+  +A         DG         D   F
Sbjct: 258 LSADRQKQLKEQFHFECSCKHCSQHIS-DDLMMAAA--------DG--------QDSPWF 300

Query: 287 TCQQCGLVR-SKEEIKKIASEVNILSKKTLALTS----CGNHQEVVSTYKMIEKLQKKLY 341
                 L R S E+I+    EV+  SK +L         G++ EV+   +   + Q+ + 
Sbjct: 301 I-----LCRPSAEQIQ----EVSAFSKDSLEKIEKSRLDGDYSEVLRLCRECLQKQRGVL 351

Query: 342 HPFSVNLMQTREKLIKILMELEDWKEALAYCQLTIPVYQRVYPQFHPLLGLQYYTCGKLE 401
               ++ ++      ++L  L  + EA    +  +  Y ++Y   + LLG+     G   
Sbjct: 352 ADTHLHRLRVLSIAAEVLSSLRQFSEAADCARKMVEGYMKLYHPNNALLGMAIMRAGVTL 411

Query: 402 WFLGDTENAIKSMTEAVEILRITHGTNSPFMKEL 435
           W  G  E     + +A  IL +THG N    ++L
Sbjct: 412 WHAGQIEEGHSLICKAYSILMVTHGPNHSVTRDL 445


>gi|5870832|gb|AAC53021.2| skm-BOP1 [Mus musculus]
          Length = 485

 Score =  125 bits (315), Expect = 3e-26,   Method: Compositional matrix adjust.
 Identities = 103/448 (22%), Positives = 193/448 (43%), Gaps = 58/448 (12%)

Query: 7   SGEVIISQEPYVCVPNNSSSISRCDGCFA-SSNLKKCSACQVVWYCGSNCQKLDWKLHRL 65
           + +VI ++  Y  V  +S     C  CF     L +C  C+   YC   CQK  W  H+ 
Sbjct: 24  AADVIFAERAYSAVVFDSLINFVCHTCFKRQEKLHRCGQCKFAHYCDRTCQKDAWLNHKN 83

Query: 66  ECQVLSRLDKEKRKSVTPSIRLMLKLYLRRKLQNDNVIPSTTTDNYSLVEALVAHMSDID 125
           EC  + +  K   +++  + R+M ++        +  + S        ++  V H  + +
Sbjct: 84  ECAAIKKYGKVPNENIRLAARIMWRVEREGTGLTEGCLVSVDD-----LQNHVEHFGEEE 138

Query: 126 EKQLLLYAQIANLVNLILQWP----EISINEIAENFSKLACNAHTICNSE-LRPLGTGLY 180
           +K+L +     ++   +  WP    + S+  I+  F  + CN  T+ +   L+ +G G++
Sbjct: 139 QKELRV-----DVDTFLQYWPPQSQQFSMQYISHIFGVINCNGFTLSDQRGLQAVGVGIF 193

Query: 181 PVISIINHSCLPNAVLVFE-GRLAVVRAVQH------------VPKGAEVLISYIETAGS 227
           P + ++NH C PN  ++F  G    V+++ H            + +G E+ +SYI+    
Sbjct: 194 PNLGLVNHDCWPNCTVIFNNGNHEAVKSMFHTQMRIELRALGKISEGEELTVSYIDFLHL 253

Query: 228 TMTRQKALKEQYLFTCTCPRCIKLGQFDDIQESAILEGYRCKDDGCSGFLLRDSDDKGFT 287
           +  R++ LK+QY F C+C  C K G  DD+                  FL    D K   
Sbjct: 254 SEERRRQLKKQYYFDCSCEHCQK-GLKDDL------------------FLAAKEDPKP-- 292

Query: 288 CQQCGLVRSKEEIKKIASEVNILSKKTLALTSCGNHQEVVSTYKMIEKLQKKLYHPFSVN 347
                   S+E +K++        +K     S G + EVV   +   + Q+ ++   ++ 
Sbjct: 293 --------SQEVVKEMIQFSKDTLEKIDKARSEGLYHEVVKLCRECLEKQEPVFADTNLY 344

Query: 348 LMQTREKLIKILMELEDWKEALAYCQLTIPVYQRVYPQFHPLLGLQYYTCGKLEWFLGDT 407
           +++      ++L  L+ ++EA  Y +  +  Y ++Y   +  LG+     G   W  G  
Sbjct: 345 VLRLLSIASEVLSYLQAYEEASHYARRMVDGYMKLYHHNNAQLGMAVMRAGLTNWHAGHI 404

Query: 408 ENAIKSMTEAVEILRITHGTNSPFMKEL 435
           E     + +A  IL +THG + P  K+L
Sbjct: 405 EVGHGMICKAYAILLVTHGPSHPITKDL 432


>gi|409076546|gb|EKM76917.1| hypothetical protein AGABI1DRAFT_102088 [Agaricus bisporus var.
           burnettii JB137-S8]
          Length = 643

 Score =  125 bits (315), Expect = 3e-26,   Method: Compositional matrix adjust.
 Identities = 128/481 (26%), Positives = 200/481 (41%), Gaps = 110/481 (22%)

Query: 8   GEVIISQEPYVCVPNNSSSISRCDGCFASSN---LKKCSACQVVWYCGSNCQKLDWKLHR 64
           GE+I ++       +     + C  CF  ++   LK+C  C+++ YC + CQ  DW LH+
Sbjct: 75  GEIIFTEVQQSAALSTGLLATHCSACFVEASGVPLKRCPTCRIIHYCDTECQSRDWTLHK 134

Query: 65  LECQVL-----------SRLDKEKRKS-VTPS--IRLMLKLYLRRKLQNDNVIPSTTTDN 110
            EC  L           S   ++KR+  V PS  IR + ++  R++ +  +   +   D 
Sbjct: 135 RECPALQKWATSVPSPSSETSEDKRQGKVIPSDAIRALGRMMWRKQKKGSSGDWAKQVDT 194

Query: 111 YS--------------------LVEALVAHMS--DIDEKQLLLYAQIANLVNLILQWPEI 148
                                 +V ALV ++    +DE        IA LV+L+      
Sbjct: 195 MQSRKHRVRLSSNEKSSQLHTHMVHALVRYLGLESLDELSSYGIDSIAGLVDLV------ 248

Query: 149 SINEIAENFSKLACNAHTICNSELRPLGTGLYPVISIINHSCLPNAVLVFEGR------- 201
                    S+   NA T+ +  L P+G  + P+I++ NHSC PNAV VF          
Sbjct: 249 ---------SRFTTNAFTVASPTLTPIGVSISPIIALFNHSCAPNAVPVFPRAPHNAKAN 299

Query: 202 --LAVVRAVQHVPKGAEVLISYIETAGSTMTRQKALKEQYLFTCTCPRCIKL--GQFDDI 257
             +A V  ++ +P   EV+ISYI+T  +   RQKALKE Y FTC C  C K   G + D 
Sbjct: 300 EPMASVITLRDIPANEEVVISYIDTTLTKRERQKALKETYYFTCNCSLCEKPRDGAYVDS 359

Query: 258 QESAILEGYRCKDDGCSGF-LLRDSDDKGFTCQQCGLVRSKEE------------IKKIA 304
           +ES       C  + C G   L   DD    C +CG V  + +            +KK+ 
Sbjct: 360 RES-----LYCPKN-CGGLCWLPTEDDPLARCAKCGTVVKESDAILDVIRIGFVALKKV- 412

Query: 305 SEVNILS-KKTLALTS-----------CGNHQEVVSTYKMIEKLQKKLYH-PFSVNLM-- 349
           ++V I   +K + LT+           C   Q +++  ++   L    Y  PF  N+   
Sbjct: 413 NKVQISDPQKAIQLTTNLIRILASARVCPFTQPLLALSRLHTTLLISNYRSPFDPNVAEI 472

Query: 350 ---------QTREKLIKILMELED-WKEALAYCQLTIPVYQRVYPQFHPLLGLQYYTCGK 399
                    QT ++ I  L E ++   E +   + T     R+ P  HP+  L     GK
Sbjct: 473 SSPEIQEQRQTNQEDISALSEKQNQLDETIQSAKRTTMGLNRLLPFGHPIRALALTELGK 532

Query: 400 L 400
           L
Sbjct: 533 L 533


>gi|236461014|ref|NP_001153599.1| SET and MYND domain-containing protein 1 isoform 1 [Mus musculus]
 gi|332278221|sp|P97443.3|SMYD1_MOUSE RecName: Full=SET and MYND domain-containing protein 1; AltName:
           Full=CD8b-opposite; AltName: Full=Zinc finger protein
           BOP; Short=m-BOP
 gi|308387879|pdb|3N71|A Chain A, Crystal Structure Of Cardiac Specific Histone
           Methyltransferase Smyd1
 gi|49903961|gb|AAH76601.1| Smyd1 protein [Mus musculus]
 gi|74209249|dbj|BAE24995.1| unnamed protein product [Mus musculus]
 gi|148666518|gb|EDK98934.1| SET and MYND domain containing 1, isoform CRA_b [Mus musculus]
          Length = 490

 Score =  125 bits (315), Expect = 3e-26,   Method: Compositional matrix adjust.
 Identities = 103/448 (22%), Positives = 193/448 (43%), Gaps = 58/448 (12%)

Query: 7   SGEVIISQEPYVCVPNNSSSISRCDGCFA-SSNLKKCSACQVVWYCGSNCQKLDWKLHRL 65
           + +VI ++  Y  V  +S     C  CF     L +C  C+   YC   CQK  W  H+ 
Sbjct: 29  AADVIFAERAYSAVVFDSLINFVCHTCFKRQEKLHRCGQCKFAHYCDRTCQKDAWLNHKN 88

Query: 66  ECQVLSRLDKEKRKSVTPSIRLMLKLYLRRKLQNDNVIPSTTTDNYSLVEALVAHMSDID 125
           EC  + +  K   +++  + R+M ++        +  + S        ++  V H  + +
Sbjct: 89  ECAAIKKYGKVPNENIRLAARIMWRVEREGTGLTEGCLVSVDD-----LQNHVEHFGEEE 143

Query: 126 EKQLLLYAQIANLVNLILQWP----EISINEIAENFSKLACNAHTICNSE-LRPLGTGLY 180
           +K+L +     ++   +  WP    + S+  I+  F  + CN  T+ +   L+ +G G++
Sbjct: 144 QKELRV-----DVDTFLQYWPPQSQQFSMQYISHIFGVINCNGFTLSDQRGLQAVGVGIF 198

Query: 181 PVISIINHSCLPNAVLVFE-GRLAVVRAVQH------------VPKGAEVLISYIETAGS 227
           P + ++NH C PN  ++F  G    V+++ H            + +G E+ +SYI+    
Sbjct: 199 PNLGLVNHDCWPNCTVIFNNGNHEAVKSMFHTQMRIELRALGKISEGEELTVSYIDFLHL 258

Query: 228 TMTRQKALKEQYLFTCTCPRCIKLGQFDDIQESAILEGYRCKDDGCSGFLLRDSDDKGFT 287
           +  R++ LK+QY F C+C  C K G  DD+                  FL    D K   
Sbjct: 259 SEERRRQLKKQYYFDCSCEHCQK-GLKDDL------------------FLAAKEDPKP-- 297

Query: 288 CQQCGLVRSKEEIKKIASEVNILSKKTLALTSCGNHQEVVSTYKMIEKLQKKLYHPFSVN 347
                   S+E +K++        +K     S G + EVV   +   + Q+ ++   ++ 
Sbjct: 298 --------SQEVVKEMIQFSKDTLEKIDKARSEGLYHEVVKLCRECLEKQEPVFADTNLY 349

Query: 348 LMQTREKLIKILMELEDWKEALAYCQLTIPVYQRVYPQFHPLLGLQYYTCGKLEWFLGDT 407
           +++      ++L  L+ ++EA  Y +  +  Y ++Y   +  LG+     G   W  G  
Sbjct: 350 VLRLLSIASEVLSYLQAYEEASHYARRMVDGYMKLYHHNNAQLGMAVMRAGLTNWHAGHI 409

Query: 408 ENAIKSMTEAVEILRITHGTNSPFMKEL 435
           E     + +A  IL +THG + P  K+L
Sbjct: 410 EVGHGMICKAYAILLVTHGPSHPITKDL 437


>gi|213514028|ref|NP_001135157.1| SET and MYND domain-containing protein 1 [Salmo salar]
 gi|197631975|gb|ACH70711.1| SET and MYND domain containing 1b [Salmo salar]
          Length = 301

 Score =  125 bits (315), Expect = 3e-26,   Method: Compositional matrix adjust.
 Identities = 75/253 (29%), Positives = 123/253 (48%), Gaps = 17/253 (6%)

Query: 6   VSGEVIISQEPYVCVPNNSSSISRCDGCFASSN-LKKCSACQVVWYCGSNCQKLDWKLHR 64
           +SG+V+ S+ P+  V  +S +   C  CF   + L++C  C+   YC   CQ   WK H+
Sbjct: 23  LSGDVLFSEPPFAAVVFDSLADKICHSCFRRQDKLQRCGQCKFAQYCDKTCQSAGWKEHK 82

Query: 65  LEC---QVLSRLDKEKRKSVTPSIRLMLKLYLRRKLQNDNVIPSTTTDNYSLVEALVAHM 121
           LEC   +    +     K+   +IRL  ++  R  L  +  + S      + ++ L  H+
Sbjct: 83  LECGAIKAFGAIGDNFGKAPNENIRLAARIMWR--LDKEGGVMSDM--QMTSLDDLENHI 138

Query: 122 SDIDEKQLL-LYAQIANLVNLILQWPEIS----INEIAENFSKLACNAHTICNSE-LRPL 175
           SD+ E  L  L   I N ++    WP  S    +++I+  F  + CN  T+ +   L  +
Sbjct: 139 SDMPEDDLKELKVDIHNFLDY---WPRNSKQHRVDDISHIFGVINCNGFTVSDQRGLNAV 195

Query: 176 GTGLYPVISIINHSCLPNAVLVFEGRLAVVRAVQHVPKGAEVLISYIETAGSTMTRQKAL 235
           G GL+P + ++NH C PN  ++       +RA+  +  G EV +SY++    T  RQ+ L
Sbjct: 196 GVGLFPNLCLVNHDCWPNCTVILNHGKIELRALGKIEPGEEVTVSYVDFLNVTEDRQRQL 255

Query: 236 KEQYLFTCTCPRC 248
           K  Y F CTC  C
Sbjct: 256 KMHYFFDCTCEHC 268


>gi|395731585|ref|XP_003775930.1| PREDICTED: LOW QUALITY PROTEIN: SET and MYND domain-containing
           protein 1-like [Pongo abelii]
          Length = 490

 Score =  125 bits (314), Expect = 4e-26,   Method: Compositional matrix adjust.
 Identities = 104/448 (23%), Positives = 194/448 (43%), Gaps = 58/448 (12%)

Query: 7   SGEVIISQEPYVCVPNNSSSISRCDGCFA-SSNLKKCSACQVVWYCGSNCQKLDWKLHRL 65
           + ++I ++  Y  V  +S     C  CF     L +C  C+   YC   CQK  W  H+ 
Sbjct: 29  AADIIFAERAYSAVVFDSLVNFVCHTCFKRQEKLHRCGQCKFAHYCDRTCQKDAWLNHKN 88

Query: 66  ECQVLSRLDKEKRKSVTPSIRLMLKLYLRRKLQNDNVIPSTTTDNYSLVEALVAHMSDID 125
           EC  + R  K   +++  + R+M ++        +  + S        ++  V H  + +
Sbjct: 89  ECSAIKRYGKVPNENIRLAARIMWRVEREGTGLTEGCLVSVDD-----LQNHVEHFGEEE 143

Query: 126 EKQLLLYAQIANLVNLILQWPEISINEIAENFSKLACNAHTICNSE-LRPLGTGLYPVIS 184
           +K+L +      L   + Q  + S+  I+  F  + CN  T+ +   L+ +G G++P + 
Sbjct: 144 QKELRVDVDTF-LQYWLPQSQQFSMQYISHIFGVINCNGFTLSDQRGLQAVGVGIFPNLG 202

Query: 185 IINHSCLPNAVLVFE-GRLAVVRAVQH------------VPKGAEVLISYIETAGSTMTR 231
           ++NH C PN  ++F  G    V+++ H            + +G E+ +SYI+    +  R
Sbjct: 203 LVNHDCWPNCTVIFNNGNHEAVKSMFHTQMRIELRALGKISEGEELTVSYIDFLNVSEER 262

Query: 232 QKALKEQYLFTCTCPRCIKLGQFDDIQESAILEGYRCKDDGCSGFLLRDSDDKGFTCQQC 291
           ++ LK+QY F CTC  C K                + KDD    FL              
Sbjct: 263 KRQLKKQYYFDCTCEHCQK----------------KLKDD---LFL-------------- 289

Query: 292 GLVRSKEEIKKIASEVNILSKKTL----ALTSCGNHQEVVSTYKMIEKLQKKLYHPFSVN 347
           G+  + +  +++  E+   SK TL       S G + EVV   +   + Q+ ++   ++ 
Sbjct: 290 GVKDNPKPSQEVVKEMIRFSKDTLEKIDKARSEGLYHEVVKLCRECLQKQEPVFADTNIY 349

Query: 348 LMQTREKLIKILMELEDWKEALAYCQLTIPVYQRVYPQFHPLLGLQYYTCGKLEWFLGDT 407
            ++    + ++L  L+ ++EA  Y +  +  Y ++Y   +  LG+     G   W  G+ 
Sbjct: 350 TLRMLSIVSEVLSYLQAFEEASFYARRMVDGYMKLYHPNNAQLGMAVMRAGLTNWHAGNI 409

Query: 408 ENAIKSMTEAVEILRITHGTNSPFMKEL 435
           E     + +A  IL +THG + P  K+L
Sbjct: 410 EVGHGMICKAYAILLVTHGPSHPITKDL 437


>gi|327275309|ref|XP_003222416.1| PREDICTED: SET and MYND domain-containing protein 1-like isoform 2
           [Anolis carolinensis]
          Length = 485

 Score =  125 bits (313), Expect = 6e-26,   Method: Compositional matrix adjust.
 Identities = 112/448 (25%), Positives = 197/448 (43%), Gaps = 58/448 (12%)

Query: 7   SGEVIISQEPYVCVPNNSSSISRCDGCFA-SSNLKKCSACQVVWYCGSNCQKLDWKLHRL 65
           +G+VI ++  Y  V  +S +   C  CF   + L +C  C+   YC   CQK  W  H+ 
Sbjct: 24  AGDVIFAERAYAAVVFDSLTHLVCHTCFKRHAKLHRCGQCKFAHYCDRTCQKDAWVNHKN 83

Query: 66  ECQVLSRLDKEKRKSVTPSIRLMLKLYLRRKLQNDNVIPSTTTDNYSLVEALVAHMSDID 125
           EC  +    K+  K+   +IRL  ++  R + +   +         SL +  V H  + +
Sbjct: 84  ECAAI----KKHGKAPNENIRLAARIMWRIEREGGGLTEGCFVSIDSL-QNHVEHFGEEE 138

Query: 126 EKQLLLYAQIANLVNLILQWP----EISINEIAENFSKLACNAHTICNSE-LRPLGTGLY 180
           +K+L L     +L + +  WP    + ++  I+     + CN  T+ +   L+ +G G++
Sbjct: 139 KKELRL-----DLESFLEFWPHDGKQFAMQYISHILGVINCNGFTLSDQRGLQAVGVGIF 193

Query: 181 PVISIINHSCLPNAVLVF-EGRLAVVRAVQH------------VPKGAEVLISYIETAGS 227
           P + ++NH C PN  ++F  G    VR++ H            +  G E+ +SY++    
Sbjct: 194 PNLCLVNHDCWPNCTVIFNNGNHEAVRSMFHTQMRIELRALGKISIGEELTVSYVDFLNV 253

Query: 228 TMTRQKALKEQYLFTCTCPRCIKLGQFDDIQESAILEGYRCKDDGCSGFLLRDSDDKGFT 287
              R++ LK+QY F CTC  C K G  DD+   A+ E     ++  S  +++D       
Sbjct: 254 CEDRRQQLKKQYYFDCTCEHCQK-GIKDDLM-LAVKE-----EEKPSPEVVKDV------ 300

Query: 288 CQQCGLVRSKEEIKKIASEVNILSKKTLALTSCGNHQEVVSTYKMIEKLQKKLYHPFSVN 347
                +  SKE I+KI               S G ++EVV   +   + Q+ +    ++ 
Sbjct: 301 -----IQFSKEAIEKIDK-----------ARSEGLYKEVVKLCRACLEKQEPVLGDTNIY 344

Query: 348 LMQTREKLIKILMELEDWKEALAYCQLTIPVYQRVYPQFHPLLGLQYYTCGKLEWFLGDT 407
           +++    L ++L  L+   EA  Y +     Y ++Y   +  LG+     G   W  G  
Sbjct: 345 MLRILSILSEVLSYLQHVDEAAGYSKRMADGYLKLYHPNNAQLGMALMRAGVTHWHAGLI 404

Query: 408 ENAIKSMTEAVEILRITHGTNSPFMKEL 435
           E     + +A  IL +THG   P  K+L
Sbjct: 405 EAGHGMICKAYAILLVTHGPTHPITKDL 432


>gi|325189974|emb|CCA24457.1| conserved hypothetical protein [Albugo laibachii Nc14]
          Length = 465

 Score =  125 bits (313), Expect = 6e-26,   Method: Compositional matrix adjust.
 Identities = 107/413 (25%), Positives = 180/413 (43%), Gaps = 35/413 (8%)

Query: 8   GEVIISQEPYVCVPNNSSSISRCDGCFASSN-LKKCSACQVVWYCGSNCQKLDWKL-HRL 65
           G  ++S  P+  V + +    RC  CF   + + +C AC +  YC  NCQK DW+L HR+
Sbjct: 26  GSRVLSSLPFAKVLSVTYWQKRCSVCFQQLHCVSRCGACHIAHYCSKNCQKDDWRLDHRI 85

Query: 66  ECQVLSRLDKEKRKSVTPSIRLMLKLYLRR-----KLQNDNVIPSTTTDNYSLVEALVAH 120
           EC    +L   +  S   S  L+L   +RR      L  D ++P             + H
Sbjct: 86  ECATFRQLATLRLHSDQISDLLLLGRVVRRIDGIEPLAKDGILPDKANATSVFPMDSMWH 145

Query: 121 MSDI-DEKQLL-LYAQIANLVNLILQWPEISINEIAENFSKLACNAHTICNSELRPLGTG 178
             D+ +E  L+ L AQ   LV          + ++ E  ++  CN   I +     +G G
Sbjct: 146 SVDLTNETHLISLLAQKLGLVR-----ESWHVRDLQEMLARFQCNNFCILDENFFEVGAG 200

Query: 179 LYPVISIINHSCLPNAVLVFEGRLAVVR--AVQHVPKGAEVLISYIETAGSTMTRQKALK 236
            YP+ +++NHSC PN V +F    A +   A++ + K  EV ISY++ A     R+K L+
Sbjct: 201 CYPLGAMVNHSCDPNCVTIFARGSAQLELWAMKSIGKDEEVTISYVDPANCMNKRRKYLQ 260

Query: 237 EQYLFTCTCPRCIKLGQFDDIQESAILEGYRCKDDG---CSGFLLRDSDDKGFTCQQ-CG 292
           ++Y F C C RC     +     +A ++ +   D+       + L ++  +G    Q  G
Sbjct: 261 KRYHFDCRCQRC-----YCKASHAATIDTFLVADENGIPSEKWSLNEATKRGTVLSQITG 315

Query: 293 LVRSKEEIKKIASEVNILSKKTLALTSCGNHQEVVSTYKMIE----KLQKKLYHPFSVNL 348
            +      +K+  +  IL+            + V +   M      +LQ +L+   ++ +
Sbjct: 316 KIEEFRVCEKLTRKCAILADIDSDFV-----RNVFTHLHMYNIDAFRLQARLFSDLTIMI 370

Query: 349 MQTREKLIKILMELEDWKE-ALAYCQLTIPVYQRVYPQFHPLLGLQYYTCGKL 400
            +++    +   ELE  ++ AL Y       Y  VYP  HPL  L     G+L
Sbjct: 371 AESQSGNSREFAELEFVQQKALVYGYRISKFYAFVYPALHPLSNLHNIRMGEL 423


>gi|432880356|ref|XP_004073657.1| PREDICTED: SET and MYND domain-containing protein 1-like isoform 2
           [Oryzias latipes]
          Length = 478

 Score =  124 bits (312), Expect = 7e-26,   Method: Compositional matrix adjust.
 Identities = 106/449 (23%), Positives = 191/449 (42%), Gaps = 63/449 (14%)

Query: 7   SGEVIISQEPYVCVPNNSSSISRCDGCFA-SSNLKKCSACQVVWYCGSNCQKLDWKLHRL 65
           +GEV+ ++  Y  V  +S +   C  CF   + L +C  C+   YC   CQ   W+ H+ 
Sbjct: 29  TGEVVFAEPSYSAVVFDSLAFQVCHCCFRRQAQLHRCGQCRFAHYCNRTCQTACWEEHKQ 88

Query: 66  ECQVLSRLDKEKRKSVTPSIRLMLKLYLRRKLQNDNVIPSTTTDNYSLVEALVAHMSDID 125
           EC  + +  K   +SV  + RL+ +L+    L +D+ + S        V+ L  H   +D
Sbjct: 89  ECAAIRKSGKAPSESVRLAARLLWRLHKDNGLASDSQLVS--------VDQLQEH---VD 137

Query: 126 EKQLLLYAQIANLVNLILQW-----PEISINEIAENFSKLACNAHTICNSE-LRPLGTGL 179
           +       Q+   V ++  +      + S+  I+  F  + CN  T+ +   L+ +G GL
Sbjct: 138 QLSQEQLQQLQTDVRVLQDYWSYGSKQHSVEYISHIFGIIKCNGFTLTDQRGLQAVGVGL 197

Query: 180 YPVISIINHSCLPNAVLVF-EGRLAVVRAVQHVPK------------GAEVLISYIETAG 226
           +P ++++NH C PN   +   G  + V +  H P+            G E+ ISY++   
Sbjct: 198 FPNLALVNHDCWPNCTAILNHGNHSAVSSALHSPRRIEVRTLGKISEGEELTISYVDFLQ 257

Query: 227 STMTRQKALKEQYLFTCTCPRCIKLGQFDDIQESAILEGYRCKDDGCSGFLLRDSDDKGF 286
            +  RQK LKEQ+ F C+C  C +                   DD     ++  +D +  
Sbjct: 258 LSADRQKQLKEQFHFECSCKHCSQ----------------HISDD----LMMAAADGQDS 297

Query: 287 TCQQCGLVRSKEEIKKIASEVNILSKKTLALTSCGNHQEVVSTYKMIEKLQKKLYHPFSV 346
             Q+     SK+ ++KI        K  L     G++ EV+   +   + Q+ +     +
Sbjct: 298 PIQEVSAF-SKDSLEKI-------EKSRLD----GDYSEVLRLCRECLQKQRGVLADTHL 345

Query: 347 NLMQTREKLIKILMELEDWKEALAYCQLTIPVYQRVYPQFHPLLGLQYYTCGKLEWFLGD 406
           + ++      ++L  L  + EA    +  +  Y ++Y   + LLG+     G   W  G 
Sbjct: 346 HRLRVLSIAAEVLSSLRQFSEAADCARKMVEGYMKLYHPNNALLGMAIMRAGVTLWHAGQ 405

Query: 407 TENAIKSMTEAVEILRITHGTNSPFMKEL 435
            E     + +A  IL +THG N    ++L
Sbjct: 406 IEEGHSLICKAYSILMVTHGPNHSVTRDL 434


>gi|354483543|ref|XP_003503952.1| PREDICTED: SET and MYND domain-containing protein 1 [Cricetulus
           griseus]
          Length = 498

 Score =  124 bits (312), Expect = 7e-26,   Method: Compositional matrix adjust.
 Identities = 104/449 (23%), Positives = 194/449 (43%), Gaps = 60/449 (13%)

Query: 7   SGEVIISQEPYVCVPNNSSSISRCDGCFA-SSNLKKCSACQVVWYCGSNCQKLDWKLHRL 65
           + +V+ ++  Y  V  +S     C  CF     L +C  C+   YC   CQK  W  H+ 
Sbjct: 29  AADVVFAERAYSAVVFDSLINVVCHTCFKRQEKLHRCGQCKFAHYCDRTCQKDAWLNHKN 88

Query: 66  ECQVLSRLDKEKRKSVTPSIRLMLKLYLRRKLQNDNVIPSTTTDNYSLVEALVAHMSDID 125
           EC  + R  K   +++  + R+M ++        +  + S        ++  V H  + +
Sbjct: 89  ECSAIKRYGKVPNENIRLAGRIMWRVEREGTGLTEGCLVSVDD-----LQNHVEHFGEEE 143

Query: 126 EKQLLLYAQIANLVNLILQ-WP----EISINEIAENFSKLACNAHTICNSE-LRPLGTGL 179
           +K+L +       V+  LQ WP    + S+  I+  F  + CN  T+ +   L+ +G G+
Sbjct: 144 QKELRVD------VDAFLQYWPPQGQQFSMQYISHIFGVINCNGFTLSDQRGLQAVGVGI 197

Query: 180 YPVISIINHSCLPNAVLVF-EGRLAVVRAVQH------------VPKGAEVLISYIETAG 226
           +P + ++NH C PN  ++F  G    V+++ H            + +G E+ +SYI+   
Sbjct: 198 FPNLGLVNHDCWPNCTVIFNNGNHEAVKSMFHTQMRIELRALGKISEGEELTVSYIDFLH 257

Query: 227 STMTRQKALKEQYLFTCTCPRCIKLGQFDDIQESAILEGYRCKDDGCSGFLLRDSDDKGF 286
            +  R++ L++QY F C+C  C K G  DD+                  F     D K  
Sbjct: 258 LSEERRRQLRKQYYFDCSCEHCQK-GLKDDL------------------FQAVKEDPKP- 297

Query: 287 TCQQCGLVRSKEEIKKIASEVNILSKKTLALTSCGNHQEVVSTYKMIEKLQKKLYHPFSV 346
                    S E +K++        +K     S G + EVV   +   + Q+ ++   ++
Sbjct: 298 ---------SPEVVKEMVQFSKDTLEKIDKARSEGLYHEVVKLCRECLEKQESVFADTNL 348

Query: 347 NLMQTREKLIKILMELEDWKEALAYCQLTIPVYQRVYPQFHPLLGLQYYTCGKLEWFLGD 406
            +++    + ++L  L+ ++EA  Y +  +  Y ++Y   +  LG+     G   W  G+
Sbjct: 349 YVLRLLSIVSEVLSYLQAFEEAAHYARRMVDGYMKLYHHNNAQLGMAVMRAGLTNWHAGN 408

Query: 407 TENAIKSMTEAVEILRITHGTNSPFMKEL 435
            E     + +A  IL +THG + P  K+L
Sbjct: 409 IEVGHGMICKAYAILLVTHGPSHPITKDL 437


>gi|449501413|ref|XP_002187998.2| PREDICTED: SET and MYND domain-containing protein 1 isoform 2
           [Taeniopygia guttata]
          Length = 491

 Score =  124 bits (312), Expect = 8e-26,   Method: Compositional matrix adjust.
 Identities = 111/470 (23%), Positives = 204/470 (43%), Gaps = 61/470 (12%)

Query: 6   VSGEVIISQEPYVCVPNNSSSISRCDGCFA-SSNLKKCSACQVVWYCGSNCQKLDWKLHR 64
           ++G+VI ++  Y  V  +S +   C  CF     L +C  C+   YC   CQ+  W  H+
Sbjct: 28  LAGDVIFAEPAYAAVVFDSLTHVICHTCFKRQERLHRCGQCKFAHYCDRTCQRAAWLSHK 87

Query: 65  LECQVLSRLDKEKRKSVTPSIRLMLKLYLRRKLQNDNVIPSTTTDNYSLVEALVAHMSDI 124
            EC  + R      K+ T +IRL  ++  + + +   +           +E L  H+   
Sbjct: 88  NECSAIKR----HGKAPTENIRLAARILWKMEREGSGLSEGCLV----AIEELQNHVDSF 139

Query: 125 -DEKQLLLYAQIANLVNLILQWP----EISINEIAENFSKLACNAHTICNSE-LRPLGTG 178
            +E++  L A + + +     WP    +  +  I+  F  ++CN  T+ +   L+ +G G
Sbjct: 140 GEEEKKDLRADVESFLEF---WPPQCQQFGMQLISHIFGVISCNGFTLSDQRGLQAVGVG 196

Query: 179 LYPVISIINHSCLPNAVLVFE-GRLAVVRAVQH------------VPKGAEVLISYIETA 225
           ++P +   NH C PN  +VF  G    VR++ H            +  G E+ +SY++  
Sbjct: 197 IFPNLCQANHDCWPNCTVVFNNGNHEAVRSMFHTQMRIELRALNKISPGDELTVSYVDFL 256

Query: 226 GSTMTRQKALKEQYLFTCTCPRCIKLGQFDDIQESAILEGYRCKDDGCSGFLLRDSDDKG 285
             +  R++ LK+QY F CTC  C K                + KDD        +S    
Sbjct: 257 SLSEERRRQLKKQYYFDCTCEHCKK----------------QLKDDLMLAVKAGESKPSA 300

Query: 286 FTCQQCGLVRSKEEIKKIASEVNILSKKTLALTSCGNHQEVVSTYKMIEKLQKKLYHPFS 345
            T ++  +  SK+ ++KI       +K  +     G++ EVV   +   + Q+ +    +
Sbjct: 301 DTVKEV-IQLSKDTLEKI-------NKARME----GHYHEVVKLCRDCLQKQEPVLGDTN 348

Query: 346 VNLMQTREKLIKILMELEDWKEALAYCQLTIPVYQRVYPQFHPLLGLQYYTCGKLEWFLG 405
           + L++      ++L  L+ ++EA  Y +  +  Y ++Y   +  LG+     G   W  G
Sbjct: 349 IYLLRILSIASEVLSYLQMFEEAADYAKRMVDGYLKIYHPNNAQLGMAVMRAGVTHWHAG 408

Query: 406 DTENAIKSMTEAVEILRITHGTNSPFMKELILKLEEAQAEASYKLSSKDE 455
             E     + +A  IL ITHG + P  K+  L++   Q E   ++  ++E
Sbjct: 409 LIEVGHGLICKAYAILLITHGPSHPITKD--LEVMRVQTEMELRMFQQNE 456


>gi|224050278|ref|XP_002188029.1| PREDICTED: SET and MYND domain-containing protein 1 isoform 3
           [Taeniopygia guttata]
          Length = 490

 Score =  124 bits (312), Expect = 8e-26,   Method: Compositional matrix adjust.
 Identities = 115/474 (24%), Positives = 203/474 (42%), Gaps = 70/474 (14%)

Query: 6   VSGEVIISQEPYVCVPNNSSSISRCDGCFA-SSNLKKCSACQVVWYCGSNCQKLDWKLHR 64
           ++G+VI ++  Y  V  +S +   C  CF     L +C  C+   YC   CQ+  W  H+
Sbjct: 28  LAGDVIFAEPAYAAVVFDSLTHVICHTCFKRQERLHRCGQCKFAHYCDRTCQRAAWLSHK 87

Query: 65  LECQVLSRLDKEKRKSVTPSIRLMLKLYLRRKLQNDNVIPSTTTDNYSLVEALVAHMSDI 124
            EC  + R      K+ T +IRL  ++  + + +   +           +E L  H+   
Sbjct: 88  NECSAIKR----HGKAPTENIRLAARILWKMEREGSGLSEGCLV----AIEELQNHVDSF 139

Query: 125 -DEKQLLLYAQIANLVNLILQWP----EISINEIAENFSKLACNAHTICNSE-LRPLGTG 178
            +E++  L A + + +     WP    +  +  I+  F  ++CN  T+ +   L+ +G G
Sbjct: 140 GEEEKKDLRADVESFLEF---WPPQCQQFGMQLISHIFGVISCNGFTLSDQRGLQAVGVG 196

Query: 179 LYPVISIINHSCLPNAVLVFE-GRLAVVRAVQH------------VPKGAEVLISYIETA 225
           ++P +   NH C PN  +VF  G    VR++ H            +  G E+ +SY++  
Sbjct: 197 IFPNLCQANHDCWPNCTVVFNNGNHEAVRSMFHTQMRIELRALNKISPGDELTVSYVDFL 256

Query: 226 GSTMTRQKALKEQYLFTCTCPRCIKLGQFDDIQESAILEGYRCKDDGCSGFLLRDSDDKG 285
             +  R++ LK+QY F CTC  C K                + KDD     +L   D   
Sbjct: 257 SLSEERRRQLKKQYYFDCTCEHCKK----------------QLKDD----LMLAVKDSP- 295

Query: 286 FTCQQCGLVRSKEEIKKIASEVNILSKKTLALTSC----GNHQEVVSTYKMIEKLQKKLY 341
                     S + +K    EV  LSK TL   +     G++ EVV   +   + Q+ + 
Sbjct: 296 --------TPSADTVK----EVIQLSKDTLEKINKARMEGHYHEVVKLCRDCLQKQEPVL 343

Query: 342 HPFSVNLMQTREKLIKILMELEDWKEALAYCQLTIPVYQRVYPQFHPLLGLQYYTCGKLE 401
              ++ L++      ++L  L+ ++EA  Y +  +  Y ++Y   +  LG+     G   
Sbjct: 344 GDTNIYLLRILSIASEVLSYLQMFEEAADYAKRMVDGYLKIYHPNNAQLGMAVMRAGVTH 403

Query: 402 WFLGDTENAIKSMTEAVEILRITHGTNSPFMKELILKLEEAQAEASYKLSSKDE 455
           W  G  E     + +A  IL ITHG + P  K+  L++   Q E   ++  ++E
Sbjct: 404 WHAGLIEVGHGLICKAYAILLITHGPSHPITKD--LEVMRVQTEMELRMFQQNE 455


>gi|47217177|emb|CAG11013.1| unnamed protein product [Tetraodon nigroviridis]
          Length = 473

 Score =  124 bits (311), Expect = 9e-26,   Method: Compositional matrix adjust.
 Identities = 115/478 (24%), Positives = 203/478 (42%), Gaps = 78/478 (16%)

Query: 8   GEVIISQEPYVCVPNNSSSISRCDGCFAS-SNLKKCSACQVVWYCGSNCQ---------- 56
           GE++ +   Y  V   +   + C+ CF    +L KC  C+  +YC  +CQ          
Sbjct: 30  GELVFACPAYSYVLTVNERGAYCEHCFTRREDLFKCGKCKQAYYCNVDCQVPVLPSLCFC 89

Query: 57  -----KLDWKLHRLEC-----------------QVLSR--LDKEKRKSVTPSIRLMLKLY 92
                ++     R EC                 ++++R  L ++     TPS RL+L   
Sbjct: 90  PPDCREVIGPCIRCECVSHVVHYGENWCPSETVRLVARIILKQKVTPERTPSERLLLLRE 149

Query: 93  LRRKLQNDNVIPS--TTTDNYSLVEALVAHMSDIDEKQLL--LYAQI-----ANLVNLIL 143
               L   + +       D  +L      H+SD+ + Q L  L+AQ+     A    LI 
Sbjct: 150 FESHLDKMDSMKDEMNQADIAALHHFYSRHISDLPDDQALTQLFAQVRSRTGAGGEKLIR 209

Query: 144 QWPEISINEIAENFSKLACNAHTICNSELRPLGTGLYPVISIINHSCLPNAVLVFEGRLA 203
             P  S+N          CN  TI + EL  LG+ ++P ++++NHSC PN ++ ++G +A
Sbjct: 210 GSPPASVN----------CNGFTIEDEELSHLGSAVFPDVALMNHSCSPNVIVTYKGTVA 259

Query: 204 VVRAVQHVPKGAEVLISYIETAGSTMTRQKALKEQYLFTCTCPRCIKLGQFDDIQESAIL 263
            VRAVQ +  G E+  SYI+    T  R++ L + Y FTC C  C    + ++  +S   
Sbjct: 260 EVRAVQEINPGDEIFNSYIDLLYPTEDRKERLLDSYFFTCQCAECTTRSKDEEKMKSTKP 319

Query: 264 EGYRCKDDGCSGFLLRDSDDKGFTCQQCGLVRSKEEIKKIASEVNILSKKTLALTSCGNH 323
           +             L   + +        L+   EE ++     NILS     L  C   
Sbjct: 320 KSS-----------LEPEEVQSMVVYARNLI---EEFRRAKHYKNILSFPVSLLEMCE-- 363

Query: 324 QEVVSTYKMIEKLQKKLYHPFSVNLMQTREKLIKILMELEDWKEALAYCQLTIPVYQRVY 383
              +S  KM       ++   +V ++    + + + + ++DW  A++Y +  I  Y   Y
Sbjct: 364 ---LSLDKM-----GSVFADTNVYMLHMMYQAMGVCLYMQDWDGAMSYGEKIIQPYSVHY 415

Query: 384 PQFHPLLGLQYYTCGKLEWFLGDTENAIKSMTEAVEILRITHGTNSPFMKELILKLEE 441
           P +   +   Y   G+L   L  T   +K++ +A+ I+ + HG +  ++ E+  ++EE
Sbjct: 416 PAYSLNVASMYLKLGRLYLGLEKTPQGVKALKKALSIMEVAHGKDHYYVAEVKREIEE 473


>gi|303321560|ref|XP_003070774.1| MYND finger family protein [Coccidioides posadasii C735 delta
           SOWgp]
 gi|240110471|gb|EER28629.1| MYND finger family protein [Coccidioides posadasii C735 delta
           SOWgp]
 gi|320040248|gb|EFW22181.1| conserved hypothetical protein [Coccidioides posadasii str.
           Silveira]
          Length = 497

 Score =  124 bits (311), Expect = 1e-25,   Method: Compositional matrix adjust.
 Identities = 73/265 (27%), Positives = 128/265 (48%), Gaps = 25/265 (9%)

Query: 8   GEVIISQEPYVCVPNNSSSISRCDGCFASSNLKK-------------CSACQVVWYCGSN 54
           G+ I+  + ++ V + +     C  CF    L+              C+ C+ V YC   
Sbjct: 32  GQPILELDTWLAVLDTARLADTCSNCFGGKTLRDREVDGAPQVPLKLCTGCRTVRYCSKG 91

Query: 55  CQKLDWK-LHRLECQVLSRLDKEKRKSVTPSIRLMLKLYLRRKLQNDNVIPSTTTDNYSL 113
           CQK +W  +H+ EC++   L        TPS R +L++ L +K Q D        D    
Sbjct: 92  CQKENWAAIHKHECKIFKNLYPNVLP--TPS-RAVLRILLLKKHQEDQ------GDRLQR 142

Query: 114 VEALVAHMSDIDEKQLLLYAQIANLVNLILQWP--EISINEIAENFSKLACNAHTICNSE 171
            ++L +H+++    +   +  +      I ++   E+S+ E+ + F KL  NA T+    
Sbjct: 143 FDSLTSHLTETIRTKPDHFQNLVLCARAIHEYSKTELSLQEVVDCFGKLDINAFTLTTPF 202

Query: 172 LRPLGTGLYPVISIINHSCLPNAVLVFEGRLAVVRAVQHVPKGAEVLISYIETAGSTMTR 231
              +G  + P+ S+ NHSC PNA + F+     +RA+QH+ +G ++ +SY+E   + + R
Sbjct: 203 YDQIGAAVEPLASLCNHSCDPNAAVDFDKGKTWLRALQHIEEGEQIFVSYVEPTDACLHR 262

Query: 232 QKALKEQYLFTCTCPRCIKLGQFDD 256
           Q  L ++Y F C CPRCI+  +  D
Sbjct: 263 QAELSKRYYFECECPRCIREKEIGD 287


>gi|224033847|gb|ACN35999.1| unknown [Zea mays]
 gi|413955568|gb|AFW88217.1| hypothetical protein ZEAMMB73_999877 [Zea mays]
          Length = 129

 Score =  124 bits (310), Expect = 1e-25,   Method: Compositional matrix adjust.
 Identities = 56/98 (57%), Positives = 73/98 (74%)

Query: 305 SEVNILSKKTLALTSCGNHQEVVSTYKMIEKLQKKLYHPFSVNLMQTREKLIKILMELED 364
           S++  LS K  +  S GN  E  S YK+I++L++ LYH FS  L+ T E L+KI +EL+D
Sbjct: 3   SKILQLSDKASSFLSSGNKAEAGSIYKIIKQLERNLYHAFSTTLLHTCETLLKIYLELQD 62

Query: 365 WKEALAYCQLTIPVYQRVYPQFHPLLGLQYYTCGKLEW 402
           W  AL YC+LTIPVY+RVYP FHP++GLQ+YTCGKLEW
Sbjct: 63  WWTALTYCRLTIPVYERVYPPFHPMIGLQFYTCGKLEW 100


>gi|392862338|gb|EAS37026.2| hypothetical protein CIMG_02225 [Coccidioides immitis RS]
          Length = 497

 Score =  124 bits (310), Expect = 1e-25,   Method: Compositional matrix adjust.
 Identities = 73/265 (27%), Positives = 128/265 (48%), Gaps = 25/265 (9%)

Query: 8   GEVIISQEPYVCVPNNSSSISRCDGCFASSNLKK-------------CSACQVVWYCGSN 54
           G+ I+  + ++ V + +     C  CF    L+              C+ C+ V YC   
Sbjct: 32  GQPILELDTWLAVLDTARLADTCSNCFGGKTLRDREVDGAPQVPLKLCTGCRTVRYCSKG 91

Query: 55  CQKLDWK-LHRLECQVLSRLDKEKRKSVTPSIRLMLKLYLRRKLQNDNVIPSTTTDNYSL 113
           CQK +W  +H+ EC++   L        TPS R +L++ L +K Q D        D    
Sbjct: 92  CQKENWAAIHKHECKIFKNLYPNVLP--TPS-RAVLRILLLKKHQEDQ------GDRMQR 142

Query: 114 VEALVAHMSDIDEKQLLLYAQIANLVNLILQWP--EISINEIAENFSKLACNAHTICNSE 171
            ++L +H+++    +   +  +      I ++   E+S+ E+ + F KL  NA T+    
Sbjct: 143 FDSLTSHLTETIRTKPDHFQNLVLCARAIHEYSKTELSLQEVVDCFGKLDINAFTLTTPF 202

Query: 172 LRPLGTGLYPVISIINHSCLPNAVLVFEGRLAVVRAVQHVPKGAEVLISYIETAGSTMTR 231
              +G  + P+ S+ NHSC PNA + F+     +RA+QH+ +G ++ +SY+E   + + R
Sbjct: 203 YDQIGAAVEPLASLCNHSCDPNAAVDFDKGKTWLRALQHIEEGEQIFVSYVEPTDACLHR 262

Query: 232 QKALKEQYLFTCTCPRCIKLGQFDD 256
           Q  L ++Y F C CPRCI+  +  D
Sbjct: 263 QAELSKRYYFECECPRCIREKEVGD 287


>gi|149640598|ref|XP_001510487.1| PREDICTED: SET and MYND domain-containing protein 1 isoform 1
           [Ornithorhynchus anatinus]
          Length = 490

 Score =  124 bits (310), Expect = 1e-25,   Method: Compositional matrix adjust.
 Identities = 107/453 (23%), Positives = 192/453 (42%), Gaps = 68/453 (15%)

Query: 7   SGEVIISQEPYVCVPNNSSSISRCDGCFA-SSNLKKCSACQVVWYCGSNCQKLDWKLHRL 65
           + ++I S+  Y  V  +S +   C  CF     L++C  C+   YC   CQK  W  H+ 
Sbjct: 29  AADLIFSERAYSAVVFDSLTHLVCHTCFKRQEKLQRCGQCKFAHYCDRTCQKDAWLNHKN 88

Query: 66  ECQVLSRLDKEKRKSVTPSIRLMLKLYLRRKLQNDNVIPSTTTDNYSLVEALVAHMSDI- 124
           EC  + +  K   +++  + R+M ++        +  + S        ++ L  H+    
Sbjct: 89  ECSAIKKHGKAPNENIRLAARIMWRIEREGTGLTEGCLVS--------IDDLQNHVDSFG 140

Query: 125 DEKQLLLYAQIANLVNLILQWP----EISINEIAENFSKLACNAHTICNSE-LRPLGTGL 179
           +E+Q  L   + + ++    WP    + S+  I+  F  + CN  T+ +   L+ +G G+
Sbjct: 141 EEEQKELRLDVESFLHF---WPPQSQQFSMQYISHIFGVINCNGFTLSDQRGLQAVGVGV 197

Query: 180 YPVISIINHSCLPNAVLVFE-GRLAVVRAVQH------------VPKGAEVLISYIETAG 226
           +P + ++NH C PN  ++F  G    V+++ H            + +G E+ +SYI+   
Sbjct: 198 FPNLCLVNHDCWPNCTVIFNNGNHEAVKSMFHTQMRIELRALGKISEGEELTVSYIDFLN 257

Query: 227 STMTRQKALKEQYLFTCTCPRCIKLGQFDDIQESAILEGYRCKDDGCSGFLLRDSDDKGF 286
            +  R+K LK+QY F CTC  C K G  DD+                  FL         
Sbjct: 258 VSADRKKLLKKQYYFDCTCEHCEK-GIKDDL------------------FL--------- 289

Query: 287 TCQQCGLVRSKEEIKKIASEVNILSKKTL----ALTSCGNHQEVVSTYKMIEKLQKKLYH 342
                G+    +  + +  EV   SK  L       S G + EVV   +   + Q+ +  
Sbjct: 290 -----GVKDEPKPSQDVVKEVIQFSKDALEKIDKARSEGVYHEVVKLCRECLQKQEPVLA 344

Query: 343 PFSVNLMQTREKLIKILMELEDWKEALAYCQLTIPVYQRVYPQFHPLLGLQYYTCGKLEW 402
             ++ L++      ++L  L+ ++EA  Y +  +  Y ++Y   +  LG+     G   W
Sbjct: 345 DTNIYLLRILSVASEVLSYLQSFEEASDYAKKMVDGYMKLYHPNNAQLGMAVMRAGVTHW 404

Query: 403 FLGDTENAIKSMTEAVEILRITHGTNSPFMKEL 435
             G  E     + +A  IL +THG   P  K+L
Sbjct: 405 HAGLIEVGHGMICKAYAILLVTHGPTHPITKDL 437


>gi|355746250|gb|EHH50875.1| hypothetical protein EGM_01766, partial [Macaca fascicularis]
          Length = 352

 Score =  123 bits (309), Expect = 2e-25,   Method: Compositional matrix adjust.
 Identities = 106/377 (28%), Positives = 178/377 (47%), Gaps = 61/377 (16%)

Query: 60  WKLHRLECQVLSRLDKEKRKSVTPSIRLMLKLYLRRKLQNDNVIPSTTTDNYSLVEALVA 119
           W  H+ EC+ L      K +    S+RL+ ++    KL +    PS +   YS  + L +
Sbjct: 4   WPDHKRECKCLKSC---KPRYPPDSVRLLGRVVF--KLMDGT--PSESEKLYSFYD-LES 55

Query: 120 HMSDIDE------KQLLLYAQ--IANLVNLILQWPEISINEIAENFSKLACNAHTICNSE 171
           +++ + E      +QL++  Q  +   +    Q P     ++ E F+K+ CN+ TICN+E
Sbjct: 56  NINKLTEDKKEGLRQLVMTFQHFMREEIQDASQLP--PAFDLFEAFAKVICNSFTICNAE 113

Query: 172 LRPLGTGLYPVISIINHSCLPNAVLVFEGRLAVVRAVQHVPKGAEVLISYIETAGSTMTR 231
           ++ +G GLYP IS++NHSC PN  +VF G   ++RAV+ +  G E+ I Y++   ++  R
Sbjct: 114 MQEVGVGLYPSISLLNHSCDPNCSIVFNGPHLLLRAVRDIEVGEELTICYLDMLMTSEER 173

Query: 232 QKALKEQYLFTCTCPRCIKLGQFDDIQESAILEGYRCKDDGCSGFLLRDSDDKGFTCQQC 291
           +K L++QY F C C RC                              +D D    T  + 
Sbjct: 174 RKQLRDQYCFECDCFRC----------------------------QTQDKDADMLTGDEQ 205

Query: 292 GLVRSKEEIKKIASEVNILSKKTLALTSCGNHQEVVSTYKMIEKLQKKLYHPFSVNLMQT 351
                +E +KKI  E+    K    L  C   Q ++S+    E+L         +N+ Q 
Sbjct: 206 VWKEVQESLKKI-EELKAHWKWEQVLAMC---QAIISSNS--ERLP-------DINIYQL 252

Query: 352 R--EKLIKILMELEDWKEALAYCQLTIPVYQRVYPQFHPLLGLQYYTCGKLEWFLGDTEN 409
           +  +  +   + L   +EAL Y   T+  Y+  +P  HP+ G+Q    GKL+   G    
Sbjct: 253 KVLDCAMDACINLGLLEEALFYATRTMEPYRIFFPGSHPVRGVQVMKVGKLQLHQGMFPQ 312

Query: 410 AIKSMTEAVEILRITHG 426
           A+K++  A +I+R+THG
Sbjct: 313 AMKNLRLAFDIMRVTHG 329


>gi|1809322|gb|AAC53020.1| t-BOP [Mus musculus]
          Length = 456

 Score =  123 bits (308), Expect = 2e-25,   Method: Compositional matrix adjust.
 Identities = 99/425 (23%), Positives = 183/425 (43%), Gaps = 58/425 (13%)

Query: 30  CDGCFA-SSNLKKCSACQVVWYCGSNCQKLDWKLHRLECQVLSRLDKEKRKSVTPSIRLM 88
           C  CF     L +C  C+   YC   CQK  W  H+ EC  + R  K   +++  + R+M
Sbjct: 18  CHTCFKRQEKLHRCGQCKFAHYCDRTCQKDAWLNHKNECAAIKRYGKVPNENIRLAARIM 77

Query: 89  LKLYLRRKLQNDNVIPSTTTDNYSLVEALVAHMSDIDEKQLLLYAQIANLVNLILQWP-- 146
            ++        +  + S        ++  V H  + ++K+L +     ++   +  WP  
Sbjct: 78  WRVEREGTGLTEGCLVSVDD-----LQNHVEHFGEEEQKELRV-----DVDTFLQYWPLQ 127

Query: 147 --EISINEIAENFSKLACNAHTICNSE-LRPLGTGLYPVISIINHSCLPNAVLVFE-GRL 202
             + S+  I+  F  + CN  T+ +   L+ +G G++P + ++NH C PN  ++F  G  
Sbjct: 128 SQQFSMQYISHIFGVINCNGFTLSDQRGLQAVGVGIFPNLGLVNHDCWPNCTVIFNNGNH 187

Query: 203 AVVRAVQH------------VPKGAEVLISYIETAGSTMTRQKALKEQYLFTCTCPRCIK 250
             V+++ H            + +G E+ +SYI+    +  R++ LK+QY F C+C  C K
Sbjct: 188 EAVKSMFHTQMRIELRALGKISEGEELTVSYIDFLHLSEERRRQLKKQYYFDCSCEHCQK 247

Query: 251 LGQFDDIQESAILEGYRCKDDGCSGFLLRDSDDKGFTCQQCGLVRSKEEIKKIASEVNIL 310
            G  DD+                  FL    D K           S+E +K++       
Sbjct: 248 -GLKDDL------------------FLAAKEDPKP----------SQEVVKEMIQFSKDT 278

Query: 311 SKKTLALTSCGNHQEVVSTYKMIEKLQKKLYHPFSVNLMQTREKLIKILMELEDWKEALA 370
            +K     S G + EVV   +   + Q+ ++   ++ +++      ++L  L+ ++EA  
Sbjct: 279 LEKIDKARSEGLYHEVVKLCRECLEKQEPVFTDTNLYVLRLLSIASEVLSYLQAYEEASH 338

Query: 371 YCQLTIPVYQRVYPQFHPLLGLQYYTCGKLEWFLGDTENAIKSMTEAVEILRITHGTNSP 430
           Y +  +  Y ++Y   +  LG+     G   W  G  E     + +A  IL +THG + P
Sbjct: 339 YARRMVDGYMKLYHHNNAQLGMAVMRAGLTNWHAGHIEVGHGMICKAYAILLVTHGPSHP 398

Query: 431 FMKEL 435
             K+L
Sbjct: 399 ITKDL 403


>gi|348670644|gb|EGZ10465.1| hypothetical protein PHYSODRAFT_518318 [Phytophthora sojae]
          Length = 437

 Score =  123 bits (308), Expect = 2e-25,   Method: Compositional matrix adjust.
 Identities = 87/290 (30%), Positives = 136/290 (46%), Gaps = 33/290 (11%)

Query: 3   SFHVSGEVIISQEPYVCVPNNSSSISRCDGCFASSNL-KKCSACQVVWYCGSNCQKLDWK 61
           S      V+I+    VC    +SS S C  CFA  ++  +CSAC+   YC   CQ+ DW 
Sbjct: 21  SLRAGDRVLITS--AVCA---ASSPSSCGWCFAPGDVFSRCSACRKARYCSRACQQRDWP 75

Query: 62  LHRLECQVLSRL-DKEKRKSVTPSIRLMLKLYLRRKLQNDNVIPSTTTDNYSLVEALVAH 120
            HR EC     + ++    +V    RL  KL+L  ++  +           + V  L  H
Sbjct: 76  QHRHECAAWRSIPERNPSPTVLLVARLAAKLFLGSQVDQEE---------KNGVLKLRDH 126

Query: 121 MSDIDEKQLLLYAQIANLVNLIL-------QWPEISINEIAEN--------FSKLACNAH 165
           ++D  E +   + ++  LV L+L       +    +++E+  +        F ++ CNA 
Sbjct: 127 LADHTELKRHQFDEMTQLVLLLLSRYKAEKREKNATLDELHRDLELEILKLFGRVNCNAF 186

Query: 166 TICNS-ELRPLGTGLYPVISIINHSCLPNAVLVFEGRLAVVRAVQHVPKGAEVLISYIET 224
           ++ N      +G GLYP  ++ NH C PN V+ F+GR   VR V+ +    E+ +SY+E 
Sbjct: 187 SVANEFTNEAVGIGLYPEGALFNHDCDPNCVVSFKGREMQVRVVRDIEVDEELTVSYVEL 246

Query: 225 AGSTMTRQKALKEQYLFTCTCPRCIKLGQFDDIQESAILEGYRCKDDGCS 274
             ST  R++ LKE Y F C C RC K        E   L+G RC +  C+
Sbjct: 247 LQSTKARRRELKESYFFDCECKRC-KAATNGQSNEDWYLDGLRCSNKKCA 295


>gi|119480927|ref|XP_001260492.1| SET and MYND domain protein, putative [Neosartorya fischeri NRRL
           181]
 gi|119408646|gb|EAW18595.1| SET and MYND domain protein, putative [Neosartorya fischeri NRRL
           181]
          Length = 544

 Score =  123 bits (308), Expect = 2e-25,   Method: Compositional matrix adjust.
 Identities = 80/257 (31%), Positives = 125/257 (48%), Gaps = 22/257 (8%)

Query: 6   VSGEVIISQEPYVCVPNNSSSISRCDGCFASSN--------LKKCSACQVVWYCGSNCQK 57
           V G V++ Q P+V V +       C GC    +        LK C+ C+VV YC   CQ 
Sbjct: 80  VGGTVMLIQRPFVAVLDTERLQDTCSGCLGQHHCNRDVNVELKACTGCRVVKYCNKTCQA 139

Query: 58  LDWKL-HRLECQVLSRLDKEKRKSVTPS-IRLMLKLYLRRKLQNDNVIPSTTTDNYSLVE 115
            DWK  H  EC+V     KE    V PS  R +L++ LR +            +   L  
Sbjct: 140 KDWKFAHSFECRVY----KELMPRVLPSNARAILRMVLRSERGK------YPKEELELFR 189

Query: 116 ALVAHMSDIDEKQLLLYAQIANLVNLILQWPEISINE--IAENFSKLACNAHTICNSELR 173
            L +HM++I  +    + +I+     + ++   +++E  I    +KL  N++   +    
Sbjct: 190 KLESHMAEIQAQNWEQWQRISLTAKAVKKYSGCALDEEMICHYGAKLELNSYNFHSPLAD 249

Query: 174 PLGTGLYPVISIINHSCLPNAVLVFEGRLAVVRAVQHVPKGAEVLISYIETAGSTMTRQK 233
            LG  L+P  ++INHSC  N+V+  +G    V+A++ + KG E+L+SYI+      TRQ 
Sbjct: 250 RLGFYLHPYAALINHSCDYNSVVGSDGDALYVKALRPIQKGEEILVSYIDATNPCKTRQT 309

Query: 234 ALKEQYLFTCTCPRCIK 250
            L E+Y F C CP+C K
Sbjct: 310 ELSERYYFKCRCPKCAK 326


>gi|452820074|gb|EME27122.1| SET and MYND domain-containing protein [Galdieria sulphuraria]
          Length = 464

 Score =  122 bits (307), Expect = 3e-25,   Method: Compositional matrix adjust.
 Identities = 89/269 (33%), Positives = 133/269 (49%), Gaps = 34/269 (12%)

Query: 8   GEVIISQEPYVCVPNNSSSISRCDGCFA-SSNLKKCSACQVVWYCGSNCQKLDWK--LHR 64
           G  I+ +E Y  V    +  + C  CF  S +LKKCS C+   YCG +CQK  WK   HR
Sbjct: 12  GNSILVEEAYSWVTFRDNGPNICHTCFKFSKSLKKCSQCKFARYCGLDCQKKAWKENSHR 71

Query: 65  LECQVLSRLDKEKRKSVTPSIRLMLKLYLRRKLQNDN----VIPSTTTDNYSLVEALVAH 120
            EC+ ++   + K     PSI  +  L+L R L +++    ++    +  YS     +  
Sbjct: 72  WECKAIASSVEAK----IPSIVRLAALFLFRALNSNDKDIRLVDRCRSFPYSYSN--MDS 125

Query: 121 MSDIDEKQLLLYAQIANLVNLILQ------WPEISINEIAE-NF-----SKLACNAHTIC 168
            SD ++  +L   ++  LV  +L+      +P+  ++E+ E NF       L  NAHTI 
Sbjct: 126 RSDENDVDVLSRQRVDCLVEDLLRFIRLSHYPQHEVSELFEDNFVASLIRMLEMNAHTIY 185

Query: 169 NSELRPLGTGLYPVISIINHSCLPNAVLVFEGRLAVV---------RAVQHVPKGAEVLI 219
           +SEL  LG G +P  S +NH C PN V +F G    V         R ++ +  G E++I
Sbjct: 186 DSELNTLGVGFFPKASFMNHDCRPNCVALFTGGFHSVSGKPISIHIRCIRPIEAGEEIVI 245

Query: 220 SYIETAGSTMTRQKALKEQYLFTCTCPRC 248
           SY++   S M R + LKE Y F C C RC
Sbjct: 246 SYLDVCLSWMDRLEWLKEHYQFECCCSRC 274


>gi|332239335|ref|XP_003268859.1| PREDICTED: SET and MYND domain-containing protein 1 [Nomascus
           leucogenys]
          Length = 512

 Score =  122 bits (306), Expect = 4e-25,   Method: Compositional matrix adjust.
 Identities = 103/445 (23%), Positives = 193/445 (43%), Gaps = 60/445 (13%)

Query: 30  CDGCFA-SSNLKKCSACQVVWYCGSNCQKLDWKLHRLECQVLSRLDKEKRKSVTPSIRLM 88
           C  CF     L +C  C+   YC   CQK  W  H+ EC  + R  K   +++  + R+M
Sbjct: 74  CHTCFKRQEKLHRCGQCKFAHYCDRTCQKDAWLNHKNECSAIKRYGKVPNENIRLAARIM 133

Query: 89  LKLYLRRKLQNDNVIPSTTTDNYSLVEALVAHMSDIDEKQLLLYAQIANLVNLILQWPEI 148
            ++        +  + S        ++  V H  + ++K+L +      L   + Q  + 
Sbjct: 134 WRVEREGTGLTEGCLVSVDD-----LQNHVEHFGEEEQKELRVDVDTF-LQYWLPQSQQF 187

Query: 149 SINEIAENFSKLACNAHTICNSE-LRPLGTGLYPVISIINHSCLPNAVLVFE-GRLAVVR 206
           S+  I+  F  + CN  T+ +   L+ +G G++P + ++NH C PN  ++F  G    V+
Sbjct: 188 SMQYISHIFGVINCNGFTLSDQRGLQAVGVGIFPNLGLVNHDCWPNCTVIFNNGNHEAVK 247

Query: 207 AVQH------------VPKGAEVLISYIETAGSTMTRQKALKEQYLFTCTCPRCIKLGQF 254
           ++ H            + +G E+ +SYI+    +  R++ LK+QY F CTC  C K    
Sbjct: 248 SMFHTQMRIELRALGKISEGEELTVSYIDFLNVSEERKRQLKKQYYFDCTCEHCQK---- 303

Query: 255 DDIQESAILEGYRCKDDGCSGFLLRDSDDKGFTCQQCGLVRSKEEIKKIASEVNILSKKT 314
                       + KDD    FL              G+  + +  +++  E+   SK T
Sbjct: 304 ------------KLKDD---LFL--------------GVKDNPKPSQEVVKEMIQFSKDT 334

Query: 315 L----ALTSCGNHQEVVSTYKMIEKLQKKLYHPFSVNLMQTREKLIKILMELEDWKEALA 370
           L       S G + EVV   +   + Q+ ++   ++ +++    + ++L  L+ ++EA  
Sbjct: 335 LEKIDKARSEGLYHEVVKLCRECLQKQEPVFADTNIYMLRMLSIVSEVLSYLQAFEEASF 394

Query: 371 YCQLTIPVYQRVYPQFHPLLGLQYYTCGKLEWFLGDTENAIKSMTEAVEILRITHGTNSP 430
           Y +  +  Y ++Y   +  LG+     G   W  G+ E     + +A  IL +THG + P
Sbjct: 395 YARRMVDGYMKLYHPNNAQLGMAVMRAGLTNWHAGNIEVGHGMICKAYAILLVTHGPSHP 454

Query: 431 FMKELILKLEEAQAEASYKLSSKDE 455
             K+  L+    Q E   ++  ++E
Sbjct: 455 ITKD--LEAMRVQTEMELRMFRQNE 477


>gi|432102449|gb|ELK30026.1| SET and MYND domain-containing protein 1 [Myotis davidii]
          Length = 449

 Score =  122 bits (306), Expect = 4e-25,   Method: Compositional matrix adjust.
 Identities = 109/446 (24%), Positives = 193/446 (43%), Gaps = 46/446 (10%)

Query: 7   SGEVIISQEPYVCVPNNSSSISRCDGCFA-SSNLKKCSACQVVWYCGSNCQKLDWKLHRL 65
           + +VI ++  Y  V  +S     C  CF     L  C  C+   YC   CQK  W  H+ 
Sbjct: 29  ASDVIFAERAYSAVVFDSLVNFVCHTCFKRQEKLHHCGQCKFARYCDRTCQKDAWVDHKK 88

Query: 66  ECQVLSRLDKEKRKSVTPSIRLMLKLYLRRKLQNDNVIPSTTTDNYSLVEALVAHMSDI- 124
           EC  + R  K   +++  + R+M ++        +  + S        V+ L  H+ +  
Sbjct: 89  ECLAVKRYGKVPNENIRLAARIMWRVEREGTGLTEGCLVS--------VDDLQNHVENFG 140

Query: 125 DEKQLLLYAQIANLVNLIL-QWPEISINEIAENFSKLACNAHTICNSE-LRPLGTGLYPV 182
           +E+Q  L   +   +   L Q  + S+  I+  F  + CN  T+ +   L+ +G G++P 
Sbjct: 141 EEEQKQLRVDVDTFLQYWLPQSQQFSMQYISHIFGVINCNGFTLSDQRGLQAVGVGIFPN 200

Query: 183 ISIINHSCLPNAVLVFEGRLAVVRAVQHVPKGAEVLISYIETAGSTMTRQKALKEQYLFT 242
           + ++NH C PN  ++F      +RA+  + +G E+ +SYI+    +  R+K LK+QY F 
Sbjct: 201 LGLVNHDCWPNCTVIFNNGKIELRALGKISEGEELTVSYIDFLNVSEERKKQLKKQYYFD 260

Query: 243 CTCPRCIKLGQFDDIQESAILEGYRCKDDGCSGFLLRDSDDKGFTCQQCGLVRSKEEIKK 302
           CTC  C K G  DD+                  FL    D K           S++ +K+
Sbjct: 261 CTCEHCQK-GLKDDL------------------FLGVKDDPKP----------SQDVVKE 291

Query: 303 IASEVNILSKKTLALTSCGNHQEVVSTYKMIEKLQKKLYHPFSVNLMQTREKLIKILMEL 362
           I        +K     S G + EVV   +   + Q  ++   ++ +++    + ++L  L
Sbjct: 292 IIQFSKDTLEKIDKARSEGLYHEVVKLCRECLEKQDSVFADTNIYMLRVLSIVSEVLSYL 351

Query: 363 EDWKEALAYCQLTIPVYQRVYPQFHPLLGLQYYTCGKLEWFLGDTENAIKSMTEAVEILR 422
           + ++EA  Y +  +  Y ++Y   +  LG+     G   W  G+ E A++  TE    LR
Sbjct: 352 QAFEEASYYARRMVDGYVKLYHPNNAQLGMAVMRAGLTNWHAGNIE-AMRMQTEME--LR 408

Query: 423 ITHGTNSPF--MKELILKLEEAQAEA 446
           + H     +  M+E  L  +  Q  A
Sbjct: 409 LFHQNEFMYRKMREAALNNQPMQVMA 434


>gi|260796111|ref|XP_002593048.1| hypothetical protein BRAFLDRAFT_74375 [Branchiostoma floridae]
 gi|229278272|gb|EEN49059.1| hypothetical protein BRAFLDRAFT_74375 [Branchiostoma floridae]
          Length = 390

 Score =  122 bits (305), Expect = 5e-25,   Method: Compositional matrix adjust.
 Identities = 81/278 (29%), Positives = 126/278 (45%), Gaps = 32/278 (11%)

Query: 157 FSKLACNAHTICNSELRPLGTGLYPVISIINHSCLPNAVLVFEGRLAVVRAVQHVPKGAE 216
           + K  CN   I N +LR +G GLYP  ++INHSC  N V  F G    +RA+  +  G E
Sbjct: 106 YGKTTCNCFAIHNLDLREIGVGLYPQAAMINHSCKSNCVSTFRGPTLQIRALVDIQPGEE 165

Query: 217 VLISYIETAGSTMTRQKALKEQYLFTCTCPRCIKLGQFDDIQESAILEGYRCKDDGCSGF 276
           V  SY E    T  R+  L+ +Y F C CP C+      D    AI++  +C    C G 
Sbjct: 166 VCYSYTEKGNVTHERRDELR-KYFFECQCPHCL------DTDRDAIMKSVKCP--SCQGQ 216

Query: 277 LLRDSDDKGFTCQQCGLVRSKEEIKK-----IASEVNILSKKTLALTSCGNHQEVVSTYK 331
           +   S D+   C  CG      E  +     I  E ++L ++                  
Sbjct: 217 VKPTSSDRYEKCSSCGFTDFTTEFYEDLEIYIHVEFDLLFRENC---------------- 260

Query: 332 MIEKLQKKLYHPFSVNLMQTREKLIKILMELEDWKEALAYCQLTIPVYQRVYPQFHPLLG 391
           ++E    K+ HP ++++++         ++LE+W +A+ Y +     +    P   P  G
Sbjct: 261 LVE--LDKILHPDNIHVVRILVGAFAASVKLEEWTKAIDYGKRLDRAFGLYLPPNEPDTG 318

Query: 392 LQYYTCGKLEWFLGDTENAIKSMTEAVEILRITHGTNS 429
           L YY  GK  + L D ENA+ S+ +A  +L I +G +S
Sbjct: 319 LLYYKMGKAYYHLDDIENAVTSLRKAKTLLSIAYGRDS 356


>gi|48257059|gb|AAH17079.2| SMYD3 protein, partial [Homo sapiens]
          Length = 287

 Score =  121 bits (304), Expect = 6e-25,   Method: Compositional matrix adjust.
 Identities = 84/277 (30%), Positives = 136/277 (49%), Gaps = 43/277 (15%)

Query: 152 EIAENFSKLACNAHTICNSELRPLGTGLYPVISIINHSCLPNAVLVFEGRLAVVRAVQHV 211
           ++ E F+K+ CN+ TICN+E++ +G GLYP IS++NHSC PN  +VF G   ++RAV+ +
Sbjct: 29  DLFEAFAKVICNSFTICNAEMQEVGVGLYPSISLLNHSCDPNCSIVFNGPHLLLRAVRDI 88

Query: 212 PKGAEVLISYIETAGSTMTRQKALKEQYLFTCTCPRCIKLGQFDDIQESAILEGYRCKDD 271
             G E+ I Y++   ++  R+K L++QY F C C RC                       
Sbjct: 89  EVGEELTICYLDMLMTSEERRKQLRDQYCFECDCFRCQT--------------------- 127

Query: 272 GCSGFLLRDSDDKGFTCQQCGLVRSKEEIKKIASEVNILSKKTLALTSCGNHQEVVSTYK 331
                  +D D    T  +      +E +KKI  E+    K    L  C   Q ++S+  
Sbjct: 128 -------QDKDADMLTGDEQVWKEVQESLKKI-EELKAHWKWEQVLAMC---QAIISSNS 176

Query: 332 MIEKLQKKLYHPFSVNLMQTR--EKLIKILMELEDWKEALAYCQLTIPVYQRVYPQFHPL 389
             E+L         +N+ Q +  +  +   + L   +EAL Y   T+  Y+  +P  HP+
Sbjct: 177 --ERLP-------DINIYQLKVLDCAMDACINLGLLEEALFYGTRTMEPYRIFFPGSHPV 227

Query: 390 LGLQYYTCGKLEWFLGDTENAIKSMTEAVEILRITHG 426
            G+Q    GKL+   G    A+K++  A +I+R+THG
Sbjct: 228 RGVQVMKVGKLQLHQGMFPQAMKNLRLAFDIMRVTHG 264


>gi|330917311|ref|XP_003297760.1| hypothetical protein PTT_08278 [Pyrenophora teres f. teres 0-1]
 gi|311329388|gb|EFQ94158.1| hypothetical protein PTT_08278 [Pyrenophora teres f. teres 0-1]
          Length = 532

 Score =  121 bits (304), Expect = 6e-25,   Method: Compositional matrix adjust.
 Identities = 105/438 (23%), Positives = 190/438 (43%), Gaps = 61/438 (13%)

Query: 39  LKKCSACQVVWYCGSNCQKLDW-KLHRLECQVLSRLDKEKRKSVTPSIRLMLKLYLRRKL 97
           + KC+AC    YC   CQK  W + H+ EC+VL  +     + +  +    ++L  RRK 
Sbjct: 91  VDKCAACARFRYCSKACQKAAWNRGHKHECKVLKPM---AGRGLPKAFLACIELLTRRK- 146

Query: 98  QNDNVIPSTTTDNYSLVEALVAHMSDIDEKQLLLYAQIANLVNLILQWPEISI------- 150
               +I   +  ++ +V  L +H+ D             N+  + +  P+ ++       
Sbjct: 147 --HGLI---SDQDWEMVCRLPSHVDDFKRN-----GTYGNIEMMAMGAPQFALPPNMFDR 196

Query: 151 NEIAENFSKLACNAHTICNSELRPLGTGLYPVISIINHSCLPNAVLVFEGRLAVVRAVQH 210
           + IA  ++++  NA TI    L PLG  L P +  +NHSC PNA ++ +G    +R ++ 
Sbjct: 197 DFIAAMYARVMSNALTIITPTLDPLGIILDPTLCSLNHSCDPNAFIMMDGPSVSIRTLRP 256

Query: 211 VPKGAEVLISYIETAGSTMTRQKALKEQYLFTCTCPRCIKLGQFDDIQESAILEGYRCKD 270
           + K  E+ ISYI+T      RQ+ L+ ++ FTC C +C         QE A L+      
Sbjct: 257 IRKDKEIFISYIDTTYPYHKRQEELQTRWFFTCRCAKC---------QEKATLQEDNWLV 307

Query: 271 DGCSGFLLRDSDDKGFTCQQCGLVRSKEEIKKIASEVNILSKKTLALTSCGNHQEVVSTY 330
              S F+L     +     Q        E++K+++E  I   K + L SC          
Sbjct: 308 PADSKFVLDPEAKQAMAQTQEQTFALYGELQKLSTEHVIHGLKQI-LASC---------- 356

Query: 331 KMIEKLQKKLYHPFSVNLMQTREKLIKILMELEDWKEALAYCQLTIP-VYQRVYP-QFHP 388
                 + + Y  +     + R+ L+  L+ +  +++A A C      +  ++YP  FHP
Sbjct: 357 -----YESRFYPMYRQPYAEARDVLVVNLLSVGKFQDAWAQCAKRYKYILPKLYPVPFHP 411

Query: 389 LLGLQYYTCGKLEWFLGDTENAIKS------------MTEAVEILRITHGTNSPFMKELI 436
           +  +Q +    L  +L  TE  + +            + + +++  ++HG N+ F K + 
Sbjct: 412 VRVVQTWQMAMLAAYLASTEEGVGAPGVNMGLIAMMLVKQVLDVAHLSHGPNNAFTKSVK 471

Query: 437 LKLEEAQAEASYKLSSKD 454
            K EE   E    + + D
Sbjct: 472 EKAEEMIEELKRSVGNPD 489


>gi|427782607|gb|JAA56755.1| Putative set and mynd domain protein [Rhipicephalus pulchellus]
          Length = 423

 Score =  121 bits (303), Expect = 8e-25,   Method: Compositional matrix adjust.
 Identities = 81/255 (31%), Positives = 121/255 (47%), Gaps = 22/255 (8%)

Query: 7   SGEVIISQEPYVCVPNNSSSISRCDGCF-ASSNLKKCSACQVVWYCGSNCQKLDWKLHRL 65
           +G+ I+S  P+V V         CD CF  S  LK+CSAC+ V++C   CQKL W  H+ 
Sbjct: 6   AGDEILSSTPFVYVLGKEGRGLVCDFCFLESGTLKRCSACKYVYFCNKQCQKLAWPDHKA 65

Query: 66  ECQVLSRLDKEKRKSVTPSIRLMLKLYLRRKLQNDNVIPSTTT--DNYSLVEALVAHMSD 123
           EC  L ++      +   S+R   +L +  KL N +    T            LV+H+  
Sbjct: 66  ECAGLCKVSPNVPDT---SVRYFCRLLV--KLSNKDAWSETEKVFGKQRCFTDLVSHVDA 120

Query: 124 IDEKQLLLYAQIANLV--------NLILQWPEISINEIAENFSKLACNAHTICNSELRPL 175
           I + Q     +   L         +  L  PE+ +    E + K+  N++ ICN E   +
Sbjct: 121 IKKDQPRYLKEFKRLWETSKMFLDDKYLPEPEVGL----EIYGKMIINSYCICNDEHTAI 176

Query: 176 GTGLYPVISIINHSCLPNAVLVFEGRLAVVRAVQHV--PKGAEVLISYIETAGSTMTRQK 233
           GTGLY   SI++HSC PNA  V+EG    +RA + +       + +SYI+       RQ+
Sbjct: 177 GTGLYMGPSILDHSCSPNAHAVYEGHKLHLRAAEDINCSNFDGIRVSYIDVMAPKKVRQE 236

Query: 234 ALKEQYLFTCTCPRC 248
            L+ QY F C C +C
Sbjct: 237 ELRSQYYFECNCMKC 251


>gi|431907306|gb|ELK11286.1| SET and MYND domain-containing protein 1 [Pteropus alecto]
          Length = 476

 Score =  121 bits (303), Expect = 8e-25,   Method: Compositional matrix adjust.
 Identities = 104/435 (23%), Positives = 189/435 (43%), Gaps = 60/435 (13%)

Query: 7   SGEVIISQEPYVCVPNNSSSISRCDGCFA-SSNLKKCSACQVVWYCGSNCQKLDWKLHRL 65
           + ++I ++  Y  V  +S     C  CF     L  C  C+  +YC   CQK  W  H+ 
Sbjct: 29  AADIIFAERAYSAVVFDSLVNFVCHTCFKRQEKLHHCGQCKFAYYCDRTCQKDAWLNHKN 88

Query: 66  ECQVLSRLDKEKRKSVTPSIRLMLKLYLRRKLQNDNVIPSTTTDNYSLVEALVAHMSDID 125
           EC  + R  K   +++  + R+M ++        +  + S        ++  V H  + +
Sbjct: 89  ECLAIKRYGKVPNENIRLAARIMWRVEREGTGLTEGCLVSVDD-----LQNHVMHFGEEE 143

Query: 126 EKQLLLYAQIANLVNLILQ-WP----EISINEIAENFSKLACNAHTICNSE-LRPLGTGL 179
           +KQL    Q+   V+  LQ WP    + S+  I+  F  + CN  T+ +   L+ +G G+
Sbjct: 144 QKQL----QVD--VDTFLQYWPPKSQQFSMQYISNIFGVINCNGFTLSDQRGLQAVGVGI 197

Query: 180 YPVISIINHSCLPNAVLVFE-GRLAVVRAVQH------------VPKGAEVLISYIETAG 226
           +P + ++NH C PN  ++F  G    V+++ H            + +G E+ +SYI+   
Sbjct: 198 FPNLGLVNHDCWPNCTVIFNNGNHEAVKSMFHTQMRIELRALGKIAEGEELTVSYIDFLN 257

Query: 227 STMTRQKALKEQYLFTCTCPRCIKLGQFDDIQESAILEGYRCKDDGCSGFLLRDSDDKGF 286
            +  R+K LK+QY F CTC  C K G  DD+                    LR  DD   
Sbjct: 258 VSEERKKQLKKQYYFDCTCEHCQK-GLKDDL-------------------FLRVKDDPK- 296

Query: 287 TCQQCGLVRSKEEIKKIASEVNILSKKTLALTSCGNHQEVVSTYKMIEKLQKKLYHPFSV 346
                    S++ +K++    N   +K     S G + EVV   +   + Q+ ++   ++
Sbjct: 297 --------PSQDVVKEMIQFSNDTLEKIDKARSEGLYHEVVKLCRECLEKQESVFADTNL 348

Query: 347 NLMQTREKLIKILMELEDWKEALAYCQLTIPVYQRVYPQFHPLLGLQYYTCGKLEWFLGD 406
            +++    + ++L  L+ ++EA  Y +  +  Y ++Y   +  LG+     G   W  G+
Sbjct: 349 YMLRMLSIVSEVLSYLQAFEEASYYARRMVDGYMKLYHPNNAQLGMAVMRAGLTNWHAGN 408

Query: 407 TENAIKSMTEAVEIL 421
            E     + +A  IL
Sbjct: 409 IEVGHGMICKAYAIL 423


>gi|296423702|ref|XP_002841392.1| hypothetical protein [Tuber melanosporum Mel28]
 gi|295637630|emb|CAZ85583.1| unnamed protein product [Tuber melanosporum]
          Length = 489

 Score =  120 bits (302), Expect = 1e-24,   Method: Compositional matrix adjust.
 Identities = 100/420 (23%), Positives = 179/420 (42%), Gaps = 68/420 (16%)

Query: 7   SGEVIIS-QEPYVCVPNNSSSISRCDGCF--------------ASSNLKKCSACQVVWYC 51
           +G  I+S  +P V +P+++   S C  C               +   L  C  C++V YC
Sbjct: 24  AGTQILSIPKPLVTIPDDAHLQSTCSNCLHHAPDTPSVLTTYSSPRKLSACLGCKIVKYC 83

Query: 52  GSNCQKLDWK-LHRLECQVLSRLDKEKRKSVTPSIRLMLKLYLRRKLQNDNVIPSTTTDN 110
           G  CQ  DWK +H+ EC+  ++L +    SV  ++R++++         +  + +   D+
Sbjct: 84  GKECQTEDWKRVHKHECKTFAKLPRALPGSVRVTMRVLMQ-------NAEGGVLAQCEDH 136

Query: 111 YSLVEALVAHMSDIDEKQLLLYAQIANLVNLILQWPEISINEIAENFSKLACNAHTICNS 170
              VE           + + +  + A+  +   +       E+   +  +  N+ T+   
Sbjct: 137 ---VEKFRTEAGGKRWEGVFVMGKGAHGYSRTRR----GEGEVRRLYCAVLVNSMTLVTE 189

Query: 171 ELRPLGTGLYPVISIINHSCLPNAVLVFEGRLAVVRAVQHVPKGAEVLISYIETAGSTMT 230
              P+G    P+ + INH C+PNAV++F GR+  +RA++ +P  AEVLISYI+       
Sbjct: 190 TFDPIGIAFDPLTASINHDCIPNAVMLFNGRILQIRALEDIPANAEVLISYIDNTVPRER 249

Query: 231 RQKALKEQYLFTCTCPRCIKLGQFDDIQESAILEGYRCKDDGCSGFLLRDSDDKGFTCQQ 290
           RQ  L E+Y FTCTC RC           +  ++G  C    C G  +     +   C +
Sbjct: 250 RQLELSEKYFFTCTCSRCTG--------PAGPMDGLLC---ACGGVGM--PSREAVLCNR 296

Query: 291 CGLVRSKEEIKKIASEVNILSKKTLALTSCGNHQEVVSTYKMIEK-----LQKKLYHPFS 345
           C             ++V IL +   +       + ++S  + + K     L ++      
Sbjct: 297 CA------RALPATADVEILEQTAWSALDSDKSEAMISALRHLHKSPMWPLPRQPLAALH 350

Query: 346 VNLMQTREKLIKILMELEDWKEALAYCQLTIPVYQRV----YPQ-FHPLLGLQYYTCGKL 400
           + L+Q         +   +W+ AL +  L   +Y R+    YPQ FHP+  +  +T   L
Sbjct: 351 LALVQGE------YIPSGNWERALLHHLL---IYLRLDPVFYPQRFHPMRVVHGFTLANL 401


>gi|392587935|gb|EIW77268.1| SET domain-containing protein [Coniophora puteana RWD-64-598 SS2]
          Length = 656

 Score =  120 bits (302), Expect = 1e-24,   Method: Compositional matrix adjust.
 Identities = 94/332 (28%), Positives = 148/332 (44%), Gaps = 38/332 (11%)

Query: 7   SGEVIISQEPYV---CVPNNSSSISRCDGCFASSNLKKCSACQVVWYCGSNCQKLDWKLH 63
           +G V+IS  P+V    +P+  ++ S C      + LK+C+ CQ V YC + CQK DW+LH
Sbjct: 87  AGTVLISVRPHVHVLSIPHLDTACSYCANAAPKNPLKRCTGCQRVRYCDAECQKSDWRLH 146

Query: 64  RLECQVLSRLDKEKRKS--VTPS--IRLMLKLYLRRKLQNDNVIPSTTTDNYSLVEALVA 119
           + EC  L    K    +    PS  +R M +    R+ +         +     ++A+ +
Sbjct: 147 KHECGALQNWAKAAPSADLAVPSDAVRCMGRALRDRQKKGPE------SAWVKEMDAMQS 200

Query: 120 HMSDIDEKQLLLYAQIANLV--NLILQWPE-------ISINEIAENFSKLACNAHTICNS 170
           H S +       +  +A+ +   L L  P         S  ++ +  S+   N+ T+   
Sbjct: 201 HRSSLHPSSFESHTHLAHSIVRYLGLSSPGDLTAFGITSARDLVDLTSRFTSNSFTLATP 260

Query: 171 ELRPLGTGLYPVISIINHSCLPNAVLVF------------EGRLAVVRAVQHVPKGAEVL 218
            L P+G  + P+I++INHSC PNAV+VF            E ++ VV A++ +    E+L
Sbjct: 261 ALAPIGVSVSPLIALINHSCDPNAVVVFPRSSTASTPAKLEPQMQVV-AIKDIEPDTEIL 319

Query: 219 ISYIETAGSTMTRQKALKEQYLFTCTCPRCIKLGQFDDIQ-ESAILEGYRCKDDGCSGFL 277
            SYI+T      RQ ALKE Y FTC C  C +      I   SA+    RC   G +   
Sbjct: 320 TSYIDTTLPQPIRQSALKETYCFTCACTLCARPQDATTIDPRSAVWCAQRC--GGVASVY 377

Query: 278 LRDSDDKGFTCQQCGLVRSKEEIKKIASEVNI 309
               +     C  CG   S    + I   +N+
Sbjct: 378 SAGDELVVGRCNTCGFSASMTTAESILDAINV 409


>gi|393219982|gb|EJD05468.1| SET domain-containing protein [Fomitiporia mediterranea MF3/22]
          Length = 635

 Score =  120 bits (301), Expect = 1e-24,   Method: Compositional matrix adjust.
 Identities = 79/264 (29%), Positives = 130/264 (49%), Gaps = 23/264 (8%)

Query: 7   SGEVIISQEPYVCVPNNSSSISRCDGCFA---SSNLKKCSACQVVWYCGSNCQKLDWKLH 63
           +G V+ S +P+V   + S     C  C A   S  LKKC+ C+ + YCG+ CQ  DW+ H
Sbjct: 105 AGTVLCSVKPHVHALSRSQLHLYCTSCCAPQPSVGLKKCTKCKAIHYCGTTCQNADWQFH 164

Query: 64  RLECQVLSRL-------DKEKRKSVTP-SIRLMLK-LYLRRKLQNDNVIPSTTTDNYSLV 114
           +LEC+ L R          + + +V P ++R + + ++ R+KL + ++      +  S  
Sbjct: 165 KLECEALQRWSVSAPPESDDAKYAVPPEAVRCLARTIWRRKKLGSGSIWWREINEMQSKR 224

Query: 115 EALVAHMSDID---EKQLLLYAQIANLVNLILQWPEISINEIAENFSKLACNAHTICNSE 171
           E +   M D         + Y  I+    L       SI+E+ +  SK   N++T+    
Sbjct: 225 EGVAQSMIDAHVHLAHATVRYMGISGQEELKAHGVH-SISELVDLISKFTLNSYTLTTPS 283

Query: 172 LRPLGTGLYPVISIINHSCLPNAVLVFEGRLAV-------VRAVQHVPKGAEVLISYIET 224
           L P+G  + P+  ++NHSC PN  +VF   L+        + A+Q +P  +E+L SY++ 
Sbjct: 284 LSPIGVSVSPLAGLLNHSCDPNVSVVFPRILSADKEPALHIVAIQDIPADSELLTSYVDV 343

Query: 225 AGSTMTRQKALKEQYLFTCTCPRC 248
               + RQK LKE Y F C+C  C
Sbjct: 344 TLPVLQRQKDLKETYSFNCSCHSC 367


>gi|390361260|ref|XP_799878.2| PREDICTED: SET and MYND domain-containing protein 3-like
           [Strongylocentrotus purpuratus]
          Length = 315

 Score =  120 bits (301), Expect = 1e-24,   Method: Compositional matrix adjust.
 Identities = 79/255 (30%), Positives = 129/255 (50%), Gaps = 23/255 (9%)

Query: 6   VSGEVIISQEPYVCVPNNSSSISRCDGCFAS-SNLKKCSACQVVWYCGSNCQKLDWK-LH 63
            +G+ ++ + PY  V  N      CD C    S L +CS+C+ V YC   CQK DWK  H
Sbjct: 30  AAGKCLLEELPYAYVLCNKKRGLFCDFCLKQCSTLLRCSSCKYVRYCSRPCQKGDWKRCH 89

Query: 64  RLECQVLSRLDKEKRKSVTPSIRLMLKLYLRRKLQNDNVIPSTTTDNY----SLVEALVA 119
           + +C+ L R+       V P +  +L   +R++ Q+    P  T D+     + V+ L +
Sbjct: 90  KQDCKTLKRI----HPRVPPDLAQLLAQIIRKQRQS----PPCTQDDEDCFPTTVDQLES 141

Query: 120 H---MSDI--DEKQLLLYAQIANLVNLILQWPEISINEIAENFSKLACNAHTICNSELRP 174
           H   +SD   D  + L +     +   +L  P    + + + F    CN+ +IC+++L  
Sbjct: 142 HHEKLSDARKDHFESLWFVLQQCIEEDVLPQP----SSLVKMFGATICNSFSICDNDLNG 197

Query: 175 LGTGLYPVISIINHSCLPNAVLVFEGRLAVVRAVQHVPKGAEVLISYIETAGSTMTRQKA 234
           +  G+Y   S++NHSC PN V+VF+ R   +R V+ V +G    ISY++       RQ  
Sbjct: 198 IAVGIYLRASMLNHSCYPNCVVVFDERKLQLRTVRDVKEGDVCTISYVDVINPAKERQTE 257

Query: 235 LKEQYLFTCTCPRCI 249
           L+E+Y F+C C +CI
Sbjct: 258 LEERYHFSCNCVKCI 272


>gi|312076846|ref|XP_003141043.1| MYND finger family protein [Loa loa]
 gi|307763792|gb|EFO23026.1| MYND finger family protein [Loa loa]
          Length = 439

 Score =  120 bits (300), Expect = 2e-24,   Method: Compositional matrix adjust.
 Identities = 103/422 (24%), Positives = 200/422 (47%), Gaps = 39/422 (9%)

Query: 39  LKKCSACQVVWYCGSNCQKLDWKLHRLECQVLSRLDKEKRKSVTPSIRLMLKLYLRRKLQ 98
           L +C  C+   YC   CQK DW  H+ EC  LSR+     +S+    RL+ ++ ++ +  
Sbjct: 43  LSRCGRCKFTHYCNMKCQKKDWLTHKSECSYLSRVAPRIPESMP---RLIGRIIMKLRRC 99

Query: 99  NDNVIPSTTTDNYSLVEALVAHMSDI--DEKQLLLYAQIANLVNLILQWPEISIN-EIAE 155
            D     +   N  +  +L +H  DI  DE++   +  I +++   L + ++  N EI +
Sbjct: 100 GDK----SPAFNGRVFASLKSHTVDIEKDEEKRNGFIAIIHVIKDYLPFDKMPSNTEIFD 155

Query: 156 NFSKLACNAHTICNSELRPLGTGLYPVISIINHSCLPNAVLVFEGRLAVVRAVQH--VPK 213
            F K+  NA  I +S L  +G  +Y  +S ++HSC P+A ++F G  A++R++       
Sbjct: 156 IFCKILINALVITDSCLNRIGLAVYLGLSALDHSCKPDAFIIFSGAKAILRSLNENITEY 215

Query: 214 GAEVLISYIETAGSTMTRQKALKEQYLFTCTCPRCIKLGQFDDIQESAILEGYRCKDDGC 273
              + I Y +       R +AL++Q+ F C C  C     FD  ++ + +   +C D  C
Sbjct: 216 NDNLHIPYCDLLDLRSARCEALQKQHNFVCNCDIC---QDFDLDRQKSSVRCTKCTDGFC 272

Query: 274 SGFLLRDSDDKGFTCQQCGL-----VRSKEEIKKIASEVNILSKKTLALTSCGNHQEVVS 328
                  S D   T ++C +     V + + ++K+  ++      T    +  N  E++ 
Sbjct: 273 PY-----SPDDDHTVKRCKVCHEISVFNSDHVQKLYQQL------TAPRPAEKNLNELID 321

Query: 329 TYKMIEKLQKKLYHPFSVNLMQTREKLIKILMELEDWKEALAYCQLTIPVYQRVYPQFHP 388
            Y  +E    +++ P++V L +  E ++   +  E + EA+ Y + T+  Y+  YP+ HP
Sbjct: 322 LYHELE----EVFSPYNVPLCKLAESIMISALNNEKYDEAVEYAEKTLLCYRTYYPKGHP 377

Query: 389 LLGLQYYTCGKLEWFLGDTENAIKSMTEAVEILRITHGTNSPF---MKELILKLEEAQAE 445
              ++ +   KL   L     ++  + +A++++  ++G+ S F      L+  LE+  + 
Sbjct: 378 SPSVRMFEYAKL-LMLQHNRESLPVLRKALKMICESYGSESSFAFNAATLLSDLEKCVST 436

Query: 446 AS 447
           AS
Sbjct: 437 AS 438


>gi|426336314|ref|XP_004031420.1| PREDICTED: SET and MYND domain-containing protein 1 [Gorilla
           gorilla gorilla]
          Length = 459

 Score =  119 bits (299), Expect = 2e-24,   Method: Compositional matrix adjust.
 Identities = 106/469 (22%), Positives = 194/469 (41%), Gaps = 93/469 (19%)

Query: 7   SGEVIISQEPYVCVPNNSSSISRCDGCFA-SSNLKKCSACQVVWYCGSNCQKLDWKLHRL 65
           + ++I ++  Y  V  +S     C  CF     L +C  C+   YC   CQK  W  H+ 
Sbjct: 29  AADIIFAERAYSAVVFDSLVNFVCHTCFKRQEKLHRCGQCKFAHYCDRTCQKDAWLNHKN 88

Query: 66  ECQVLSRLDKEKRKSVTPSIRLMLKLYLRRKLQNDNVIPSTTTDNYSLVEALVAHMSDID 125
           EC  + R  K   +++  + R+M ++        +  + S        ++  V H  + +
Sbjct: 89  ECSAIKRYGKVPNENIRLAARIMWRVEREGTGLTEGCLVSVDD-----LQNHVEHFGEEE 143

Query: 126 EKQLLLYAQIANLVNLILQ-WP----EISINEIAENFSKLACNAHTICNSE-LRPLGTGL 179
           +K L +       V+  LQ WP    + S+  I+  F  + CN  T+ +   L+ +G G+
Sbjct: 144 QKDLRVD------VDTFLQYWPPQSQQFSMQYISHIFGVINCNGFTLSDQRGLQAVGVGI 197

Query: 180 YPVISIINHSCLPNAVLVFE-GRLAVVRAVQH------------VPKGAEVLISYIETAG 226
           +P + ++NH C PN  ++F  G    V+++ H            + +G E+ +SYI+   
Sbjct: 198 FPNLGLVNHDCWPNCTVIFNNGNHEAVKSMFHTQMRIELRALGKISEGEELTVSYIDFLN 257

Query: 227 STMTRQKALKEQYLFTCTCPRCIKLGQFDDIQESAILEGYRCKDDGCSGFLLRDSDDKGF 286
            +  R++ LK+QY F CTC  C K                + KDD   G  ++D+     
Sbjct: 258 VSEERKRQLKKQYYFDCTCEHCQK----------------KLKDDLFLG--VKDNPKVVK 299

Query: 287 TCQQCGLVRSKEEIKKIASEVNILSKKTLALTSCGNHQEVVSTYKMIEKLQKKLYHPFSV 346
            C++C     K+E   + ++ NI   + L++ S                           
Sbjct: 300 LCREC---LQKQE--PVFADTNIYMLRMLSIVS--------------------------- 327

Query: 347 NLMQTREKLIKILMELEDWKEALAYCQLTIPVYQRVYPQFHPLLGLQYYTCGKLEWFLGD 406
                     ++L  L+ ++EA  Y +  +  Y ++Y   +  LG+     G   W  G+
Sbjct: 328 ----------EVLSYLQAFEEASFYARRMVDGYMKLYHPNNAQLGMAVMRAGLTNWHAGN 377

Query: 407 TENAIKSMTEAVEILRITHGTNSPFMKELILKLEEAQAEASYKLSSKDE 455
            E     + +A  IL +THG + P  K+  L+    Q E   ++  ++E
Sbjct: 378 IEVGHGMICKAYAILLVTHGPSHPITKD--LEAMRVQTEMELRMFRQNE 424


>gi|444515202|gb|ELV10791.1| SET and MYND domain-containing protein 1 [Tupaia chinensis]
          Length = 462

 Score =  118 bits (296), Expect = 5e-24,   Method: Compositional matrix adjust.
 Identities = 100/429 (23%), Positives = 186/429 (43%), Gaps = 59/429 (13%)

Query: 7   SGEVIISQEPYVCVPNNSSSISRCDGCFA-SSNLKKCSACQVVWYCGSNCQKLDWKLHRL 65
           + +VI ++  Y  V  +S     C  CF     L +C  C+   YC   CQK  W  H+ 
Sbjct: 29  AADVIFAERAYSAVVFDSLVNFVCHTCFKRQEKLHRCGQCKFAHYCDRTCQKDAWLNHKN 88

Query: 66  ECQVLSRLDKEKRKSVTPSIRLMLKLYLRRKLQNDNVIPSTTTDNYSLVEALVAHMSDID 125
           EC  + R  K   +++  + R+M ++        +  + S        ++  V H  + +
Sbjct: 89  ECSAIKRYGKVPNENIRLAARIMWRVEREGTGLTEGCLVSVDD-----LQNHVEHFGEEE 143

Query: 126 EKQLLLYAQIANLVNLILQWP----EISINEIAENFSKLACNAHTICNSE-LRPLGTGLY 180
           +K+L +     ++   +  WP    + S+  I+  F  + CN  T+ +   L+ +G G++
Sbjct: 144 QKELRV-----DVDTFLQYWPPQSQQFSMQYISHIFGVINCNGFTLSDQRGLQAVGVGIF 198

Query: 181 PVISIINHSCLPNAVLVFE-GRLAVVRAVQH------------VPKGAEVLISYIETAGS 227
           P + ++NH C PN  ++F  G    V+++ H            + +G E+ +SYI+    
Sbjct: 199 PNLGLVNHDCWPNCTVIFNNGNHEAVKSMFHTQMRIELRALGKISEGEELTVSYIDFLNL 258

Query: 228 TMTRQKALKEQYLFTCTCPRCIKLGQFDDIQESAILEGYRCKDDGCSGFLLRDSDDKGFT 287
           +  R+K LK+QY F CTC  C K G  DD+                  FL    D K   
Sbjct: 259 SEERRKQLKKQYYFDCTCEHCQK-GLKDDL------------------FLGVKDDPKP-- 297

Query: 288 CQQCGLVRSKEEIKKIASEVNILSKKTLALTSCGNHQEVVSTYKMIEKLQKKLYHPFSVN 347
                   S+E +K++        +K     S G + EVV   +   + Q+ ++   ++ 
Sbjct: 298 --------SQEAVKEMTQFSKDTLEKIEKARSEGLYHEVVKLCRECLQKQEPVFADTNLY 349

Query: 348 LMQTREKLIKILMELEDWKEALAYCQLTIPVYQRVYPQFHPLLGLQYYTCGKLEWFLGDT 407
           +++    + ++L  L+ ++EA  Y +  +  Y ++Y   +  LG+     G   W  G+ 
Sbjct: 350 VLRMLSIVSEVLSYLQAFEEASHYARRMVDGYMKLYHPNNAQLGMAVMRAGLTNWHAGNI 409

Query: 408 ENAIKSMTE 416
           E A++  TE
Sbjct: 410 E-AMRVQTE 417


>gi|121715400|ref|XP_001275309.1| SET and MYND domain protein, putative [Aspergillus clavatus NRRL 1]
 gi|119403466|gb|EAW13883.1| SET and MYND domain protein, putative [Aspergillus clavatus NRRL 1]
          Length = 555

 Score =  118 bits (295), Expect = 7e-24,   Method: Compositional matrix adjust.
 Identities = 76/256 (29%), Positives = 128/256 (50%), Gaps = 29/256 (11%)

Query: 10  VIISQEPYVCVPNNSSSISRCDGCFASS-------NLKKCSACQVVWYCGSNCQKLDWKL 62
           V++ Q+P+V V + +   S+C GC  +         LK C+ C+VV YC   CQ  DWKL
Sbjct: 85  VVLIQKPFVAVLDTAQLESKCSGCLGAHANRQEAVELKACTGCRVVKYCNKTCQAKDWKL 144

Query: 63  -HRLECQVLSRLDKEKRKSVTPSIRLMLKLYLRRKLQNDNVIPSTTTDNYSLVEALVAHM 121
            H LEC++   L   K + +  + R +L++ +   L      P+   +   L + L +HM
Sbjct: 145 FHSLECRIFQNL---KPRVLPNNARAILRMVM---LSERGRYPAGEME---LFKGLESHM 195

Query: 122 SDI-------DEKQLLLYAQIANLVNLILQWPEISINE--IAENFSKLACNAHTICNSEL 172
            D+       D+ Q   + +I      +  +   +++E  IAE  +KL  N+    N   
Sbjct: 196 KDVCIRNANGDKDQ---WERILLTAKAVKNYSGCAMDESLIAEYAAKLDVNSFNFDNVMD 252

Query: 173 RPLGTGLYPVISIINHSCLPNAVLVFEGRLAVVRAVQHVPKGAEVLISYIETAGSTMTRQ 232
             LG  ++P  +++NHSC  NA + F+     V+A+  + KG ++ I+Y++T      RQ
Sbjct: 253 ERLGIYMHPYAALMNHSCDYNATVAFDDDRLHVKALHPIKKGEQIFITYVDTTNPYKIRQ 312

Query: 233 KALKEQYLFTCTCPRC 248
           K L ++Y FTC C +C
Sbjct: 313 KELSDRYYFTCRCSKC 328


>gi|159129417|gb|EDP54531.1| SET and MYND domain protein, putative [Aspergillus fumigatus A1163]
          Length = 544

 Score =  117 bits (294), Expect = 9e-24,   Method: Compositional matrix adjust.
 Identities = 78/257 (30%), Positives = 124/257 (48%), Gaps = 22/257 (8%)

Query: 6   VSGEVIISQEPYVCVPNNSSSISRCDGCFASSN--------LKKCSACQVVWYCGSNCQK 57
           V G V++ Q P+V V +       C GC    +        LK C+ C+VV YC   CQ 
Sbjct: 80  VGGTVMLIQRPFVAVLDTERLQDTCSGCLGQHHCNRDVNVELKACTGCRVVKYCNKTCQA 139

Query: 58  LDWKL-HRLECQVLSRLDKEKRKSVTPS-IRLMLKLYLRRKLQNDNVIPSTTTDNYSLVE 115
            DWK  H  EC+V     KE    V PS  R +L++ LR +            +   L  
Sbjct: 140 KDWKFAHSFECRVY----KELMPRVLPSNARAILRMVLRSERGK------YPKEELELFR 189

Query: 116 ALVAHMSDIDEKQLLLYAQIANLVNLILQWPEISINE--IAENFSKLACNAHTICNSELR 173
            L +HM++I  +    + +I+     + ++   +++E  +    +KL  N++   +    
Sbjct: 190 KLESHMAEIQAQNWEQWQRISLTAKAVKKYSGCALDEEMLCHYGAKLELNSYNFHSPLAD 249

Query: 174 PLGTGLYPVISIINHSCLPNAVLVFEGRLAVVRAVQHVPKGAEVLISYIETAGSTMTRQK 233
            LG  L+P  ++INHSC  N+V+  +G    V+A++ + KG E+L+SYI+       RQ 
Sbjct: 250 RLGFYLHPYAALINHSCDYNSVVGSDGDALYVKALRPIRKGEEILVSYIDATNPYKIRQT 309

Query: 234 ALKEQYLFTCTCPRCIK 250
            L E+Y F C CP+C K
Sbjct: 310 ELLERYYFKCRCPKCGK 326


>gi|451849565|gb|EMD62868.1| hypothetical protein COCSADRAFT_144768 [Cochliobolus sativus
           ND90Pr]
          Length = 529

 Score =  117 bits (293), Expect = 1e-23,   Method: Compositional matrix adjust.
 Identities = 103/443 (23%), Positives = 191/443 (43%), Gaps = 62/443 (13%)

Query: 36  SSNLKKCSACQVVWYCGSNCQKLDW-KLHRLECQVLSRLDKEKRKSVTPSIRLMLKLYLR 94
            + + KC+ACQ   YC   CQK  W + HR EC++L  +     + +  +    ++L  R
Sbjct: 88  GAEVSKCAACQRFRYCSKACQKEAWNRGHRHECKILKPM---AGRGLPKAFLASMELLTR 144

Query: 95  RKLQNDNVIPSTTTDNYSLVEALVAHMSDIDEKQLLLYAQIANLVNLILQWPEISINE-- 152
           RK     +IP      + L+  L +H+ D          ++  +       P+   ++  
Sbjct: 145 RK---HGLIPDK---EWELLCQLPSHIDDFKRNGTYQNIEMMAMGAAQFALPQNIFDKDF 198

Query: 153 IAENFSKLACNAHTICNSELRPLGTGLYPVISIINHSCLPNAVLVFEGRLAVVRAVQHVP 212
           +A  + ++  NA T+    L PLG  + P +  +NHSC PNA ++ +G L  +R ++ + 
Sbjct: 199 VAAMYGRVMSNALTLITPTLDPLGIIVDPTLCQMNHSCDPNAYIMMDGPLISIRTLRPIR 258

Query: 213 KGAEVLISYIETAGSTMTRQKALKEQYLFTCTCPRCIKLGQFDDIQESAILEGYRCKDDG 272
           K  E+ ISYI+T      RQ  L+ ++ FTC C +C         Q+ A  +        
Sbjct: 259 KDKEIFISYIDTTHPYQKRQDELQARWFFTCRCAKC---------QDKATHQ-------- 301

Query: 273 CSGFLLRDSDDKGFTCQQCGLVRSKEEIKKIASEVNILSKKTLALTSCGNHQEVVSTYKM 332
                    +D          V SKEE++ +AS    +      +    N + V+   K 
Sbjct: 302 ---------EDNWLVPANSDFVASKEELEALASNQEQMFTLYEQVQGTPNAEAVIPVLKE 352

Query: 333 IEKLQKK-----LY-HPFSVNLMQTREKLIKILMELEDWKEALAYCQLTIP-VYQRVYP- 384
             K  ++     LY  P++      R++LI  L+ +  ++EA A+       +  ++YP 
Sbjct: 353 SLKACRESGNWPLYRQPYA----SIRDELIVNLLSVGKYQEAWAHSAKRYKYILPKLYPV 408

Query: 385 QFHPLLGLQYYTCGKLEWFLGDTENAIKS------------MTEAVEILRITHGTNSPFM 432
            FHP+  +Q +    L  +L  TE  + +            + + +++  ++HG ++ F 
Sbjct: 409 PFHPVRVVQTWQMAMLAAYLAGTEEGVGAPGVNMGLIAMMLVKQVLDVAPMSHGQDNAFT 468

Query: 433 KELILKLEEAQAEASYKLSSKDE 455
           K ++ K  E   E    + + D+
Sbjct: 469 KSVMEKTREMVEELKRSVGNPDK 491


>gi|198420659|ref|XP_002128556.1| PREDICTED: similar to SET and MYND domain containing 3 [Ciona
           intestinalis]
          Length = 430

 Score =  117 bits (293), Expect = 1e-23,   Method: Compositional matrix adjust.
 Identities = 114/439 (25%), Positives = 193/439 (43%), Gaps = 58/439 (13%)

Query: 8   GEVIISQEPYVCVPNNSSSISRCDGCFASSN-LKKCSACQVVWYCGSNCQKLDW-KLHRL 65
           G  ++S EPY  + +       CD CF   + L +CS C+ + YC  NCQK+ W + H+ 
Sbjct: 26  GSTVLSSEPYAYLLSKKQKGVYCDFCFKKQDGLLQCSGCKYMKYCNRNCQKMAWNEHHKA 85

Query: 66  ECQVLSRLDKEKRKSVTPSIRLMLKLYLRRKLQNDNVIPSTTTDNYSLVEALVAHMSDID 125
           EC  L        K+V P       + L R L N     S      + +  L ++ + + 
Sbjct: 86  ECPAL--------KNVMPKRPPDFVILLGRLLWNMQQYSSAKLPEKNSILDLESNYNKLS 137

Query: 126 EKQLLLYAQIANLVNLILQW------PEISINE-IAENFSKLACNAHTICNSELRP-LGT 177
           + Q    A +  LV L   W      P+++ N+ + E  +++  N+  ICN EL+  +GT
Sbjct: 138 QNQK--EALMNFLVILHTFWSPKPLPPQVTDNKMLLELCARIKNNSFAICNEELQSDVGT 195

Query: 178 GLYPVISIINHSCLPNAVLVFEGRLAVVRAVQHVPKGAEVLISYIETAGSTMTRQKALKE 237
           G+Y   S INHSC PN V  F  R   +RAV+++  G EVLISY++   ++  RQ+ L  
Sbjct: 196 GVYLNCSFINHSCEPNCVAEFNMRTLKIRAVKNITAGEEVLISYVDLFATSFERQRELMS 255

Query: 238 QYLFTCTCPRCIKLGQFDDIQESAILEGYRCKDDGCSGFLLRDSDDKGFTCQQCGLVRSK 297
            Y                          ++C    C+         K    Q       K
Sbjct: 256 IY-------------------------HFQCTCHSCNA--------KTDDDQMMQDFDGK 282

Query: 298 EEIKKIASEVNILSKKTLALTSCGNHQEVVSTYKMIEKLQKKLYHPF-SVNLMQTREKLI 356
               K++S  ++LS+       C ++Q++     ++E   K+   P  ++ + +  +  +
Sbjct: 283 ITESKLSSVKDMLSQMEELRKQC-DYQKIKD---LVEGCVKRKILPHENIYMAKVLDFGM 338

Query: 357 KILMELEDWKEALAYCQLTIPVYQRVYPQFHPLLGLQYYTCGKLEWFLGDTENAIKSMTE 416
              +EL    +A  +  + +  Y+      HP+LG+Q    GK+       ++A++ + E
Sbjct: 339 DACIELGVLTQAFEFGSMALFSYKLYLHANHPMLGIQLMKLGKILLHEEKNQDAMQFLRE 398

Query: 417 AVEILRITHGTNSPFMKEL 435
           A +IL ITH  NS  + EL
Sbjct: 399 AFKILTITHSPNSSVLTEL 417


>gi|402858454|ref|XP_003893720.1| PREDICTED: SET and MYND domain-containing protein 3-like [Papio
           anubis]
          Length = 258

 Score =  117 bits (293), Expect = 1e-23,   Method: Compositional matrix adjust.
 Identities = 80/274 (29%), Positives = 134/274 (48%), Gaps = 39/274 (14%)

Query: 153 IAENFSKLACNAHTICNSELRPLGTGLYPVISIINHSCLPNAVLVFEGRLAVVRAVQHVP 212
           + E   K+ CN+ TICN+E++ +G GLYP IS++NHSC PN  +VF G   ++RAV+ + 
Sbjct: 1   MEEEEEKVICNSFTICNAEMQEVGVGLYPSISLLNHSCDPNCSIVFNGPHLLLRAVRDIE 60

Query: 213 KGAEVLISYIETAGSTMTRQKALKEQYLFTCTCPRCIKLGQFDDIQESAILEGYRCKDDG 272
            G E+ I Y++   ++  R+K L++QY F C C RC    Q  D                
Sbjct: 61  VGEELTICYLDMLMTSEERRKQLRDQYCFECDCFRC----QTQD---------------- 100

Query: 273 CSGFLLRDSDDKGFTCQQCGLVRSKEEIKKIASEVNILSKKTLALTSCGNHQEVVSTYKM 332
                 +D+D          ++   E++ K   EV    KK   L +    ++V++  + 
Sbjct: 101 ------KDAD----------MLTGDEQVWK---EVQESLKKIEELKAHWKWEQVLAMCQA 141

Query: 333 IEKLQKKLYHPFSVNLMQTREKLIKILMELEDWKEALAYCQLTIPVYQRVYPQFHPLLGL 392
           I     +     +V  ++  +  +   + L   +EAL Y   T+  Y+  +P  HP+ G+
Sbjct: 142 IISSNSERLPDINVYQLKVLDCAMDACINLGLLEEALFYGTRTMEPYRIFFPGSHPVRGV 201

Query: 393 QYYTCGKLEWFLGDTENAIKSMTEAVEILRITHG 426
           Q    GKL+   G    A+K++  A +I+R+THG
Sbjct: 202 QVMKVGKLQLHQGMFPQAMKNLRLAFDIMRVTHG 235


>gi|328769214|gb|EGF79258.1| hypothetical protein BATDEDRAFT_35411 [Batrachochytrium
           dendrobatidis JAM81]
          Length = 503

 Score =  117 bits (293), Expect = 1e-23,   Method: Compositional matrix adjust.
 Identities = 118/473 (24%), Positives = 204/473 (43%), Gaps = 92/473 (19%)

Query: 30  CDGCFASSN-------LKKCSACQVVWYCGSNCQKLDW-KLHRLECQVLSRLD--KEKRK 79
           C+ C ++S+       L++C+ C+ V YC   CQK DW   H   C VL  +D     R+
Sbjct: 71  CEYCLSTSDIQGQHTQLQRCTGCRQVVYCSVVCQKADWIAGHNSACAVLKTIDCSATTRQ 130

Query: 80  SVTPSIRLMLKLY-------LRRKLQNDNVIPSTTTDNYSLVE-----------ALVAHM 121
           S    I ++ K+         +    +D +      DN +LV            +L +H+
Sbjct: 131 SNRNDIAMLFKVVRIISNPSFQSTTLDDRMDLDCPMDNVALVRNIRSIQPLVFLSLQSHV 190

Query: 122 SDIDEKQLLLYAQIANLVN------LILQWPE-------ISINEIAENFSKLACNAHTIC 168
            D +  Q   Y    N +N      +  Q P        + ++++  +  +  CN  TI 
Sbjct: 191 LDFETCQAS-YPHATNALNGHSMKQIESQLPSAAVKLMNLPVSDLIHHLGRFRCNNFTII 249

Query: 169 NSELRPLGTGLYPVISIINHSCLPNAVLVFEGRLAVVRAVQHVPKGAEVLISYIETAGST 228
           +S L P+G G YP+ S+ NH C PN + +F+G   V++ ++ + KG E+ ISYI+     
Sbjct: 250 DSNLFPVGEGTYPLASLFNHDCWPNCIAIFDGSRVVIQTIRDIAKGDELCISYIDPILDH 309

Query: 229 MTRQKALKEQYLFTCTCPRCIKLGQFDDIQESAILEGYRCKDDGCSGFLLRDSDDKGF-- 286
            +R+ +L+ +Y F C C  C+        +    L     K D      L +  D G+  
Sbjct: 310 DSRRMSLETKYCFNCQCSVCMS-------ESCTPLHSPTTKKDAL--LSLDEKSDVGWRL 360

Query: 287 -----TCQQCGLVRSKEEIKKIASEVNILSKKTLALTSCGNHQEVVSTYKMIEKLQKKLY 341
                TC    L +S +E ++ A +  I   KT+AL S    + VV+      ++     
Sbjct: 361 LKWFHTC----LKKSDDEFER-AQQDQI--SKTIALVS---KRAVVAEIGNATQM----- 405

Query: 342 HPFSVNLMQTREKLIKILMELEDWKEALAYCQLTIPVYQRVYPQFHPLLGLQYYTCGKLE 401
             F  +LM+            +DW++A       +  Y   YP+++PL+    +   +  
Sbjct: 406 --FQSHLMR------------QDWEQAYKSGLHLLGQYLLRYPRYYPLVSQHMFFVAQCM 451

Query: 402 WFLGDTENAIKSMTEAVEILRITHGTNSPFMK--ELILKLEEAQAEASYKLSS 452
           W  G T+  I  +    + L IT+G+    MK   LI +++   A+A Y+ S+
Sbjct: 452 WNSGQTDETIFLLGVIKKCLEITYGS---VMKSHHLIGQVDSLLAQALYEQSA 501


>gi|242798932|ref|XP_002483270.1| SET and MYND domain protein, putative [Talaromyces stipitatus ATCC
           10500]
 gi|218716615|gb|EED16036.1| SET and MYND domain protein, putative [Talaromyces stipitatus ATCC
           10500]
          Length = 547

 Score =  117 bits (293), Expect = 1e-23,   Method: Compositional matrix adjust.
 Identities = 74/260 (28%), Positives = 129/260 (49%), Gaps = 29/260 (11%)

Query: 7   SGEVIISQEPYVCVPNNSSSISRCDGCFASSN-----------LKKCSACQVVWYCGSNC 55
             +V++   P + +   +   + C GC+ +S            +K C+ C+VV+YC  NC
Sbjct: 58  GADVLVVDNPLIALVEEAQLQNICSGCYDTSKAGSIDNRRPDLVKACTRCKVVYYCDKNC 117

Query: 56  QKLDWKL-HRLECQVLSRLDKEKRKSVTPSIRLMLKLYLRRKLQNDNVIPSTTTDNYSLV 114
           Q+ DWK  H LEC+  + L     K +   +R +L++ + R+   D +IP   +++ +L 
Sbjct: 118 QRKDWKAGHSLECKTYAEL---YPKILPLPVRAVLRILMLRRA--DKIIPEVYSESLALT 172

Query: 115 EALVAHMSDIDE-KQLLLYA----QIANLVNLILQWPEISINEIAENFSKLACNAHTICN 169
              +    +  E K  LL A    + +NL +L  +W       +A  F +L  N+  + +
Sbjct: 173 YPKIYDCVESQETKDHLLMAKALREYSNLTDLDNKW-------VASLFGRLNANSFCLTS 225

Query: 170 SELRPLGTGLYPVISIINHSCLPNAVLVFEGRLAVVRAVQHVPKGAEVLISYIETAGSTM 229
           +  R  G   +P  +  NHSC PNA   F      +RA++ + K  ++ ISY++T  S  
Sbjct: 226 AFGRRRGVYFHPGPARFNHSCDPNASYSFAKGKCYIRAIRPIAKDEQIFISYVDTTYSVG 285

Query: 230 TRQKALKEQYLFTCTCPRCI 249
           TR+  L+E+Y F C CP+C+
Sbjct: 286 TRRHELQERYRFECQCPKCL 305


>gi|297661482|ref|XP_002809270.1| PREDICTED: SET and MYND domain-containing protein 3-like [Pongo
           abelii]
 gi|426334412|ref|XP_004028746.1| PREDICTED: SET and MYND domain-containing protein 3-like [Gorilla
           gorilla gorilla]
 gi|28200379|gb|AAO31695.1| hypothetical protein FLJ21080 [Homo sapiens]
 gi|119597546|gb|EAW77140.1| SET and MYND domain containing 3, isoform CRA_b [Homo sapiens]
          Length = 258

 Score =  117 bits (292), Expect = 1e-23,   Method: Compositional matrix adjust.
 Identities = 83/276 (30%), Positives = 133/276 (48%), Gaps = 43/276 (15%)

Query: 153 IAENFSKLACNAHTICNSELRPLGTGLYPVISIINHSCLPNAVLVFEGRLAVVRAVQHVP 212
           + E   K+ CN+ TICN+E++ +G GLYP IS++NHSC PN  +VF G   ++RAV+ + 
Sbjct: 1   MEEEEEKVICNSFTICNAEMQEVGVGLYPSISLLNHSCDPNCSIVFNGPHLLLRAVRDIE 60

Query: 213 KGAEVLISYIETAGSTMTRQKALKEQYLFTCTCPRCIKLGQFDDIQESAILEGYRCKDDG 272
            G E+ I Y++   ++  R+K L++QY F C C RC                        
Sbjct: 61  VGEELTICYLDMLMTSEERRKQLRDQYCFECDCFRC------------------------ 96

Query: 273 CSGFLLRDSDDKGFTCQQCGLVRSKEEIKKIASEVNILSKKTLALTSCGNHQEVVSTYKM 332
                 +D D    T  +      +E +KKI  E+    K    L  C   Q ++S+   
Sbjct: 97  ----QTQDKDADMLTGDEQVWKEVQESLKKI-EELKAHWKWEQVLAMC---QAIISSNS- 147

Query: 333 IEKLQKKLYHPFSVNLMQTR--EKLIKILMELEDWKEALAYCQLTIPVYQRVYPQFHPLL 390
            E+L         +N+ Q +  +  +   + L   +EAL Y   T+  Y+  +P  HP+ 
Sbjct: 148 -ERLP-------DINIYQLKVLDCAMDACINLGLLEEALFYGTRTMEPYRIFFPGSHPVR 199

Query: 391 GLQYYTCGKLEWFLGDTENAIKSMTEAVEILRITHG 426
           G+Q    GKL+   G    A+K++  A +I+R+THG
Sbjct: 200 GVQVMKVGKLQLHQGMFPQAMKNLRLAFDIMRVTHG 235


>gi|324513338|gb|ADY45484.1| SET and MYND domain-containing protein 3 [Ascaris suum]
          Length = 477

 Score =  117 bits (292), Expect = 2e-23,   Method: Compositional matrix adjust.
 Identities = 110/432 (25%), Positives = 183/432 (42%), Gaps = 42/432 (9%)

Query: 16  PYVCVPNNSSSISRCDGCF---ASSNLKKCSACQVVWYCGSNCQKLDWKLHRLECQVLSR 72
           P+  +  N+     C  C     +  L KC++C+   YC   CQ++ WK HR EC+ L  
Sbjct: 23  PFAHICMNNKVDEFCSNCLRIPKTKKLLKCASCEFARYCDKECQRIAWKHHRNECRRL-- 80

Query: 73  LDKEKRKSVTPSIRLMLKLYLRRKLQNDNVIPSTTTDNYSLVE-----ALVAHMSDI--D 125
                 K+V P++ L   L++ R + +  +      D Y          L+ H  DI  D
Sbjct: 81  ------KAVFPNLPLTEVLFMSR-IVDRVLFLEANGDKYGWERDRKWSELLGHEEDIKAD 133

Query: 126 EKQLLLYAQIANLVNLILQWPEISINEIAENFSKLACNAHTICNSELRPLGTGLYPVISI 185
           + +   + +I N +++  +   I  ++    F K + N+H+I  S    +G  L   +S 
Sbjct: 134 QAKYEHFEKIFNKMSIFRKDEMIDKDKFYIIFCKTSINSHSIHTSAGTEVGMALDLGVSA 193

Query: 186 INHSCLPNAVLVFEGRLAVVRAVQHVPKGAE---VLISYIETAGSTMTRQKALKEQYLFT 242
            +HSC PN  LVF+G  A +R +      ++     ISYI+   S   R+K LK ++ F 
Sbjct: 194 YDHSCRPNCSLVFDGYQACLRPLTPQTNASDPRTSFISYIDIGRSRYQRRKELKAKWYFD 253

Query: 243 CTCPRCIKLGQFDDIQESAILEGYRCKDDGCSGFLL--RDSDDKGFTCQQCGLVRSKEEI 300
           C C RC             IL   RC D  C   L+   D++    TC +CG V  +E +
Sbjct: 254 CACERCCDPA-------DNILTSIRCSDASCDEPLITAEDAEATAITCPKCGQVADEEYV 306

Query: 301 KKIASEVNILSKKTLALTSCGNHQEVVSTYKMIEKLQKKLYHPFSVNLMQTREKLIKILM 360
           +     +  L  K    ++  + QE++     +   QK +Y    +  +QT   ++ I  
Sbjct: 307 RAAQQMMLRLPAKFSPESNPDSLQELLDEASNVLH-QKNIY----ITRLQT--AIMHITG 359

Query: 361 ELEDWKEALAYCQLTI-PVYQRVYPQFHPLLGLQYYTCGKLEWFLGDTENAIKSMTEAVE 419
            L+D    L + Q  +   Y+  +P     +G Q     K     G  + A+    EA+ 
Sbjct: 360 TLKD---NLPFIQHQVYENYKMCFPYADRHIGYQLLQIVKAYIEKGQRKEAVSYAFEAMN 416

Query: 420 ILRITHGTNSPF 431
           I  I  G   P+
Sbjct: 417 IFEICFGLQHPY 428


>gi|344236460|gb|EGV92563.1| SET and MYND domain-containing protein 2 [Cricetulus griseus]
          Length = 326

 Score =  116 bits (291), Expect = 2e-23,   Method: Compositional matrix adjust.
 Identities = 81/319 (25%), Positives = 149/319 (46%), Gaps = 27/319 (8%)

Query: 125 DEKQLLLYAQIANLVNLI---LQWPEISINEIAENFSKLACNAHTICNSELRPLGTGLYP 181
           +EK+ L+ + IA L       L++P+ S   +   F+++ CN  TI + EL  LG+ ++P
Sbjct: 30  NEKKDLIQSDIAALHQFYSRHLEFPDHS--SLVVLFAQVNCNGFTIEDEELSHLGSAIFP 87

Query: 182 VISIINHSCLPNAVLVFEGRLAVVRAVQHVPKGAEVLISYIETAGSTMTRQKALKEQYLF 241
            ++++NHSC PN ++ ++G LA VRAVQ +  G EV  SYI+    T  R   L++ Y F
Sbjct: 88  DVALMNHSCCPNVIVTYKGTLAEVRAVQEIHPGDEVFTSYIDLLYPTEDRNDRLRDSYFF 147

Query: 242 TCTCPRCIKLGQFDDIQESAILEGYRCKDDGCSGFLLRDSDDKGFTCQQCGLVRSKEEIK 301
           TC C  C                    KD   +   +R    K  +  Q   VR      
Sbjct: 148 TCECRECT------------------TKDKDKAKVEIR----KLSSPPQAEAVRDMVRYA 185

Query: 302 KIASEVNILSKKTLALTSCGNHQEVVSTYKMIEKLQKKLYHPFSVNLMQTREKLIKILME 361
           +   E    +K      S     E++   ++ ++    ++   +V ++    + + + + 
Sbjct: 186 RNVIEEFRRAKHYKYNLSVSPPSELLEICELSQEKMSSVFEDSNVYMLHMMYQAMGVCLY 245

Query: 362 LEDWKEALAYCQLTIPVYQRVYPQFHPLLGLQYYTCGKLEWFLGDTENAIKSMTEAVEIL 421
           ++DW+ AL Y Q  I  Y + YP +   +   +   G+L   L +     K++ +A+ I+
Sbjct: 246 MQDWEGALKYGQKIIKPYSKHYPVYSLNVASMWLKLGRLYMGLENKAAGQKALKKAIAIM 305

Query: 422 RITHGTNSPFMKELILKLE 440
            + HG + P++ E+  ++E
Sbjct: 306 EVAHGKDHPYISEIKQEIE 324


>gi|71001320|ref|XP_755341.1| SET and MYND domain protein [Aspergillus fumigatus Af293]
 gi|66852979|gb|EAL93303.1| SET and MYND domain protein, putative [Aspergillus fumigatus Af293]
          Length = 544

 Score =  116 bits (291), Expect = 2e-23,   Method: Compositional matrix adjust.
 Identities = 78/257 (30%), Positives = 124/257 (48%), Gaps = 22/257 (8%)

Query: 6   VSGEVIISQEPYVCVPNNSSSISRCDGCFASSN--------LKKCSACQVVWYCGSNCQK 57
           V G V++ Q P+V V +       C GC    +        LK C+ C+VV YC   CQ 
Sbjct: 80  VGGTVMLIQRPFVAVLDTERLQDTCSGCLGQHHCNRDVNVELKACTGCRVVKYCNKTCQA 139

Query: 58  LDWK-LHRLECQVLSRLDKEKRKSVTPS-IRLMLKLYLRRKLQNDNVIPSTTTDNYSLVE 115
            DWK +H  EC+V     KE    V PS  R +L++ LR +            +   L  
Sbjct: 140 KDWKFVHSFECRVY----KELMPRVLPSNARAILRMILRSERGK------YPKEELELFR 189

Query: 116 ALVAHMSDIDEKQLLLYAQIANLVNLILQWPEISINE--IAENFSKLACNAHTICNSELR 173
            L +HM++I  +    + +I+     + ++   +++E  +    +KL  N+    +    
Sbjct: 190 KLESHMAEIQAQNWEQWQRISLTAKAVKKYSGCALDEEMLCHYGAKLELNSFNFHSPLAD 249

Query: 174 PLGTGLYPVISIINHSCLPNAVLVFEGRLAVVRAVQHVPKGAEVLISYIETAGSTMTRQK 233
            LG  L+P  ++INHSC  N+V+  +G    V+A++ + KG E+L+SYI+       RQ 
Sbjct: 250 RLGFYLHPYAALINHSCDYNSVVGSDGDALYVKALRPIRKGEEILVSYIDATNPYKIRQT 309

Query: 234 ALKEQYLFTCTCPRCIK 250
            L E+Y F C CP+C K
Sbjct: 310 ELLERYYFKCRCPKCGK 326


>gi|353241217|emb|CCA73046.1| hypothetical protein PIIN_07001 [Piriformospora indica DSM 11827]
          Length = 565

 Score =  116 bits (291), Expect = 2e-23,   Method: Compositional matrix adjust.
 Identities = 91/311 (29%), Positives = 150/311 (48%), Gaps = 49/311 (15%)

Query: 8   GEVIISQEPYVCVPNNSSSISRCDGCFASSNLKKCSACQVVWYCGSNCQKLDWKLHRLEC 67
           G  ++S  P++   +       C  C +  N+++C+ C+ V YC + CQ  DW +H+ EC
Sbjct: 69  GTRLLSVRPHIHAVSARFLEDNCTLCTSEENVRRCTRCKKVAYCSTECQTADWGIHKQEC 128

Query: 68  QVLSRLDKEKRKSVTP--SIRLMLKLYLRRKLQNDNVI---------------PSTTTDN 110
           Q L R  +      TP  SIR + +L   R ++  + I                S+T ++
Sbjct: 129 QSLRRWAEASGSDSTPADSIRAISRLLWMRNIKGADSIWWRQIAAMQSNREHLSSSTQES 188

Query: 111 YSLVEALVAHMSDIDEKQLLLYAQIANLVNLILQWPEISINEIAENFSKLACNAHTICNS 170
           Y+       H++    + L+LY +I +   L  ++   S  ++ +  SK   N+ T+ ++
Sbjct: 189 YT-------HLA----QSLVLYMKIESPEGL-REYGIESGKDLVDLMSKFTTNSFTLTST 236

Query: 171 ELRPLGTGLYPVISIINHSCLPNAVLVFEG-------RLAVVRAVQHVPKGAEVLISYIE 223
            L  +G    P+ ++INHSC PNAV+VF          L V+ A+Q + +G EVL +Y++
Sbjct: 237 FLNAIGVATAPIPALINHSCQPNAVVVFPATRKGAPPTLDVI-AIQPIRRGEEVLAAYVD 295

Query: 224 TAGSTMTRQKALKEQYLFTCTCPRCIKLGQFDDIQESAILEGYRCKDDGCSGFLLRDSDD 283
                  RQK+LKE Y F C+C  C KL    D++E       +C +    GFL   S  
Sbjct: 296 ITLPREIRQKSLKETYAFECSCTLC-KLPGTLDLRECVFCP--KCNE----GFL---SSH 345

Query: 284 KGFT--CQQCG 292
           KG +  C +CG
Sbjct: 346 KGQSAPCSKCG 356


>gi|212541368|ref|XP_002150839.1| SET and MYND domain protein, putative [Talaromyces marneffei ATCC
           18224]
 gi|210068138|gb|EEA22230.1| SET and MYND domain protein, putative [Talaromyces marneffei ATCC
           18224]
          Length = 527

 Score =  116 bits (290), Expect = 2e-23,   Method: Compositional matrix adjust.
 Identities = 78/263 (29%), Positives = 127/263 (48%), Gaps = 34/263 (12%)

Query: 7   SGEVIISQEPYVCVPNNSSSISRCDGCFASSN-----------LKKCSACQVVWYCGSNC 55
             +V++   P V +       + C GCF +S            +K C  C+VV+YC  NC
Sbjct: 39  GADVLVIDTPMVALVEEKQLKNICSGCFDTSKAASIDSRRPGLVKACVRCKVVYYCDKNC 98

Query: 56  QKLDWKL-HRLECQVLSRLDKEKRKSVTP-SIRLMLKLYLRRKLQNDNVIPSTTTDNYSL 113
           Q+ DWK  H +EC + S+L       + P  +R +L++ L +    D V P    +   L
Sbjct: 99  QRKDWKAGHSVECAIYSKL----YPRILPLPVRAILRVILSQ--DKDKVSPYIHEEFLDL 152

Query: 114 VEALVAHMSDIDEKQLLLYAQIAN----LVNLILQWPEISI---NEIAENFSKLACNAHT 166
             +    + D DE +       AN    +   +L++ +I     + + + F KL+ N+ T
Sbjct: 153 --SYPKEIYDDDENEE------ANDHFLMAEAVLKYGDIEYLDEDYVEQLFGKLSANSFT 204

Query: 167 ICNSELRPLGTGLYPVISIINHSCLPNAVLVFEGRLAVVRAVQHVPKGAEVLISYIETAG 226
           + ++  +  G  L+P  +  NHSC PNA   F+     +RA + + KG ++ I YI+T  
Sbjct: 205 LTSAFGKRRGVYLHPAAARFNHSCNPNASYSFDKGKCYIRATKPIAKGEQIFIPYIDTTY 264

Query: 227 STMTRQKALKEQYLFTCTCPRCI 249
           S  TR+  LKE+Y F C CPRC+
Sbjct: 265 SVGTRRHELKERYKFDCQCPRCL 287


>gi|313234749|emb|CBY24692.1| unnamed protein product [Oikopleura dioica]
          Length = 475

 Score =  116 bits (290), Expect = 3e-23,   Method: Compositional matrix adjust.
 Identities = 111/440 (25%), Positives = 184/440 (41%), Gaps = 48/440 (10%)

Query: 7   SGEVIISQEPYVCVPNNSSSISRCDGC-FASSNLKKCSACQVVWYCGSNCQKLDWKLHRL 65
           +G  I++  PYV          RC  C      L +C+ C++ +YC  +CQK  WK+H+ 
Sbjct: 25  TGMNILNSTPYVHTLALKYHGVRCAQCCVFKPKLLRCAGCKLQYYCSKDCQKSAWKIHKE 84

Query: 66  ECQVLSRLDKEKRKSVTPSIRLMLKLYLRRKLQN---DNVIPSTTTDNYSLVEALVAHMS 122
           EC++L     +  KS+   I L L   L R ++N   D+  P+   D++  V  L +H  
Sbjct: 85  ECKML-----KSEKSMPDDITLFLGRILIR-VENEFGDSDEPNNKIDSFIHVLQLESHFD 138

Query: 123 DIDEKQLLLYAQIANLVNLILQWPEI----SINEIAENFSKLACNAHTICNSE--LRPLG 176
            +D  Q           NL  + P      S  ++ +  + +  N   IC+ E  +  +G
Sbjct: 139 RLDPSQQEDLGIFLMKFNLYWKKPIPGSLRSSRKLLDLVAAIKNNQFAICDEESSICDIG 198

Query: 177 TGLYPVISIINHSCLPNAVLVFEGRLAVVRAVQHVPKGAEVLISYIETAGSTMTRQKALK 236
           + LY   S+INHSC PNA  VF G   V++A++ +  G E+ I+Y +T      R+  L 
Sbjct: 199 SALYLNHSLINHSCKPNAFPVFNGTNLVIKALEKIAPGEEIKIAYTDTKAVIQDRRDYLN 258

Query: 237 EQYLFTCTCPRCIKLGQFDDIQESAILEGYRCKDDGCSGFLLRDSDDKGFTCQQCGLVRS 296
           + Y F C C  C    + D           R K     G ++R SD    + +   +VR 
Sbjct: 259 DIYRFVCQCQGCTNDDEVD-----------RKKHLDKKGNVIRRSDAIWQSAET--MVRD 305

Query: 297 KEEIKKIASEVNILSKKTLALTSCGNHQEVVSTYKMIEKLQKKLYHPFSVNLMQTREKLI 356
            EE KK   E N++             +E    +     L +K     ++  ++  E   
Sbjct: 306 MEEFKK-NKEWNLM-------------KEAAQGW-----LARKFLPDENIFWIRLNEFAF 346

Query: 357 KILMELEDWKEALAYCQLTIPVYQRVYPQFHPLLGLQYYTCGKLEWFLGDTENAIKSMTE 416
              +E ++W   L      I  Y+ +Y   HP LG+      K+   +     A +    
Sbjct: 347 DAGIETQEWSLCLETGASLIVNYKEIYGPMHPTLGIHLMKFAKILLHIEKPNEAEEYFRR 406

Query: 417 AVEILRITHGTNSPFMKELI 436
           A  I+ + +   S   K+L+
Sbjct: 407 AFAIMSLFYEPESAVRKQLL 426


>gi|167533183|ref|XP_001748272.1| hypothetical protein [Monosiga brevicollis MX1]
 gi|163773392|gb|EDQ87033.1| predicted protein [Monosiga brevicollis MX1]
          Length = 456

 Score =  115 bits (289), Expect = 4e-23,   Method: Compositional matrix adjust.
 Identities = 115/455 (25%), Positives = 188/455 (41%), Gaps = 54/455 (11%)

Query: 8   GEVIISQEPYVCVPNNSSSISR-CDGCFASS----NLKKCSACQVVWYCGSNCQKLDWK- 61
           G+ ++   P   V  +   ++R C+ CF S      L  CS C    YC   CQ  +WK 
Sbjct: 23  GQTVLLNPPLAFVLRHEERVARRCEDCFVSEKPEHRLANCSLCHTAAYCSKPCQTRNWKR 82

Query: 62  LHRLECQVLSRLDKEKR-----KSVTPSIRLMLKLYLRRKLQNDNVIPSTTTDNYSLVE- 115
            H+  C++L  L +  +      +   ++  ++ L  R KL+ D      + D  S V  
Sbjct: 83  AHKHVCKLLQTLPENPQPPHIIDAAAMTVATLVALERRAKLE-DKESEQASPDPGSAVRQ 141

Query: 116 -------ALVAHMSDIDEKQLLLYAQIANLVNLILQWPEISINEIAENFSKLACNAHTIC 168
                  A+  H   ++ ++L     +  LV +           + +   +  CN  +I 
Sbjct: 142 PRCADFWAMAQHTPTLNSEEL---DDVLQLVAVTQCPGSTDKQRVMDVLQRADCNNFSIW 198

Query: 169 NSELRPLGTGLYPVISIINHSCLPNAVLVFEGRL--AVVRAVQHVPKGAEVLISYIETAG 226
           +  L P G G+YP  +I+NHSC PN V+ + G L    V+A++ +  G E+  SYI+   
Sbjct: 199 DELLLPRGAGVYPWGAILNHSCEPNCVMTYRGPLHAQAVKALRDIAVGEELCHSYIDLYA 258

Query: 227 STMTRQKALKEQYLFTCTCPRCIK--LGQFDDIQESAILEGYRCKDDGCSGFLLRDSDDK 284
            T  R   L +QY F C C   +   L + D + E    E          G +L + + +
Sbjct: 259 PTGQRHSHLGDQYGFECDCALYLDGALAELDALPEVTAAEMR------LPGKMLHELECE 312

Query: 285 GFTCQQ-CGLVRSKEEIKKIASEVNILSKKTLALTSCGNHQEVVSTYKMIEKLQKKLYHP 343
               Q+   L R+ E +K  A                   +E+V   +    L ++L HP
Sbjct: 313 AEERQRLLSLYRAAETLKDYAWST--------------PEEELVCLLQGYNIL-RRLAHP 357

Query: 344 FSVNLMQTREKLIKILME---LEDWKEALAYCQLTIPVYQRVYPQFHPLLGLQYYTCGKL 400
            +++L     +L  +  E   LED   AL   Q     Y  VYP+ +PL GLQYY  G +
Sbjct: 358 ANISLTSIMTRLQNVATECGRLEDI--ALPVGQHLALAYDHVYPEHNPLCGLQYYRLGDV 415

Query: 401 EWFLGDTENAIKSMTEAVEILRITHGTNSPFMKEL 435
                    A+     A ++LR TH  +   + +L
Sbjct: 416 ANLAQKQALALLWHRRAYQVLRTTHDADHWLLTQL 450


>gi|328767279|gb|EGF77329.1| hypothetical protein BATDEDRAFT_27650 [Batrachochytrium
           dendrobatidis JAM81]
          Length = 445

 Score =  115 bits (288), Expect = 5e-23,   Method: Compositional matrix adjust.
 Identities = 111/455 (24%), Positives = 199/455 (43%), Gaps = 53/455 (11%)

Query: 8   GEVIISQEPYVCVPNNSSSISRCDGCFA-SSNLKKCSACQVVWYCGSNCQKLDWKLHRLE 66
           G   + ++PYV V +++S    C GCF  ++++++CS+C+VV YC   CQ+ DW +H+ E
Sbjct: 29  GLEFMMEKPYVAVVDDASLNQTCSGCFRLAAHMQQCSSCKVVQYCSQTCQRSDWSIHKPE 88

Query: 67  CQVLSRLDKEKRKSVTPS-IRLMLKLYLRRKLQNDNVIPSTTTDNYSLVEALVAHMSDID 125
           C+      K  +  + PS +RL+ ++  +R             +   +V  L +H     
Sbjct: 89  CEGF----KAVQPRIPPSPVRLLGRMMFKR--------AKDCNEFERVVGQLESHRDKRA 136

Query: 126 EKQLLLYAQIANLVNLILQWPEISINEIAENFSKLACNAHTICNSELRPLGTGLYPVISI 185
            K +   A +  + +  +  P + ++  A+  + L C      N+     G  +Y  +S 
Sbjct: 137 SKDIEHIAAMLQMASGFIP-PALLLSTTADMIA-LCCKIQV--NTMTTERGVAIYDRLST 192

Query: 186 INHSCLPNAVLVFE-GRLAVVRAVQHVPKGAEVLISYIETAGSTMTRQKALKEQYLFTCT 244
           +NHSC+PNA L F  G +A +  +  +  G ++ ISY++   S  TRQ+ LKEQY F CT
Sbjct: 193 VNHSCVPNACLTFGIGGIARLSPMTAIASGDQINISYVDVFQSCETRQRQLKEQYYFDCT 252

Query: 245 CPRCIKLGQFDDIQESAILEGYRCKDDGCSGFLLRDSDDKGFTCQQCGLVRSKEEIKKIA 304
           C  C      +  Q+        C     S   L  S              +K++    +
Sbjct: 253 CRLCTANANGESRQQ--------CISQFDSAVDLYQS--------------TKKQTNDAS 290

Query: 305 SEVNILSK--KTLALTSCGNHQEVVS-TYKMIEKLQKKLYHPFSVNLMQTREKLIKILME 361
           +  +IL++  ++L L  C +H  ++     +IE L  +  +   + L   +    + +  
Sbjct: 291 ASQSILTRALESLELIVCKSHSVLIRLRTDLIEILVGQRNYAKVIELSNDQLDAFRCV-- 348

Query: 362 LEDWKEALAYCQLTIPVYQRVYPQFHPLLGLQYYTCGKLE-WFLGDTENAIKSM-TEAVE 419
              WK + +  +  I      YP + P+  +      K   W + D  + +  +  +A  
Sbjct: 349 ---WKASQSI-EKGISESDNAYPVW-PMESVSSVDIFKASMWTMDDDLDRVAELGKQAAH 403

Query: 420 ILRITHGTNSPFMKELILKLEEAQAEASYKLSSKD 454
            LRITHG   P   +L   LE+   E   + S  D
Sbjct: 404 NLRITHGEKHPMYLDLTEGLEDVLRELEARHSGVD 438


>gi|344247559|gb|EGW03663.1| SET and MYND domain-containing protein 1 [Cricetulus griseus]
          Length = 467

 Score =  115 bits (288), Expect = 5e-23,   Method: Compositional matrix adjust.
 Identities = 97/449 (21%), Positives = 187/449 (41%), Gaps = 91/449 (20%)

Query: 7   SGEVIISQEPYVCVPNNSSSISRCDGCFA-SSNLKKCSACQVVWYCGSNCQKLDWKLHRL 65
           + +V+ ++  Y  V  +S     C  CF     L +C  C+   YC   CQK  W  H+ 
Sbjct: 29  AADVVFAERAYSAVVFDSLINVVCHTCFKRQEKLHRCGQCKFAHYCDRTCQKDAWLNHKN 88

Query: 66  ECQVLSRLDKEKRKSVTPSIRLMLKLYLRRKLQNDNVIPSTTTDNYSLVEALVAHMSDID 125
           EC  + R  K   +++  + R+M ++        +  + S        ++  V H  + +
Sbjct: 89  ECSAIKRYGKVPNENIRLAGRIMWRVEREGTGLTEGCLVSVDD-----LQNHVEHFGEEE 143

Query: 126 EKQLLLYAQIANLVNLILQ-WP----EISINEIAENFSKLACNAHTICNSE-LRPLGTGL 179
           +K+L +       V+  LQ WP    + S+  I+  F  + CN  T+ +   L+ +G G+
Sbjct: 144 QKELRVD------VDAFLQYWPPQGQQFSMQYISHIFGVINCNGFTLSDQRGLQAVGVGI 197

Query: 180 YPVISIINHSCLPNAVLVF-EGRLAVVRAVQH------------VPKGAEVLISYIETAG 226
           +P + ++NH C PN  ++F  G    V+++ H            + +G E+ +SYI+   
Sbjct: 198 FPNLGLVNHDCWPNCTVIFNNGNHEAVKSMFHTQMRIELRALGKISEGEELTVSYIDFLH 257

Query: 227 STMTRQKALKEQYLFTCTCPRCIKLGQFDDIQESAILEGYRCKDDGCSGFLLRDSDDKGF 286
            +  R++ L++QY F C+C  C K G  DD+ ++                 +++      
Sbjct: 258 LSEERRRQLRKQYYFDCSCEHCQK-GLKDDLFQA-----------------VKEDPKVVK 299

Query: 287 TCQQCGLVRSKEEIKKIASEVNILSKKTLALTSCGNHQEVVSTYKMIEKLQKKLYHPFSV 346
            C++C      E+ + + ++ N+   + L++ S                           
Sbjct: 300 LCREC-----LEKQESVFADTNLYVLRLLSIVS--------------------------- 327

Query: 347 NLMQTREKLIKILMELEDWKEALAYCQLTIPVYQRVYPQFHPLLGLQYYTCGKLEWFLGD 406
                     ++L  L+ ++EA  Y +  +  Y ++Y   +  LG+     G   W  G+
Sbjct: 328 ----------EVLSYLQAFEEAAHYARRMVDGYMKLYHHNNAQLGMAVMRAGLTNWHAGN 377

Query: 407 TENAIKSMTEAVEILRITHGTNSPFMKEL 435
            E     + +A  IL +THG + P  K+L
Sbjct: 378 IEVGHGMICKAYAILLVTHGPSHPITKDL 406


>gi|355720910|gb|AES07090.1| SET and MYND domain containing 3 [Mustela putorius furo]
          Length = 251

 Score =  115 bits (287), Expect = 6e-23,   Method: Compositional matrix adjust.
 Identities = 82/267 (30%), Positives = 129/267 (48%), Gaps = 43/267 (16%)

Query: 162 CNAHTICNSELRPLGTGLYPVISIINHSCLPNAVLVFEGRLAVVRAVQHVPKGAEVLISY 221
           CNA TICN+E++ +G GLYP +S++NHSC PN  +VF G   ++RAV+ +  G E+ I Y
Sbjct: 3   CNAFTICNAEMQEVGVGLYPSMSLLNHSCDPNCSIVFNGPHLLLRAVRDIEAGEELTICY 62

Query: 222 IETAGSTMTRQKALKEQYLFTCTCPRCIKLGQFDDIQESAILEGYRCKDDGCSGFLLRDS 281
           ++   ++  R+K L++QY F C C RC                              +D 
Sbjct: 63  LDMLMTSEERRKQLRDQYCFECDCFRCQT----------------------------QDK 94

Query: 282 DDKGFTCQQCGLVRSKEEIKKIASEVNILSKKTLALTSCGNHQEVVSTYKMIEKLQKKLY 341
           D    T  +      +E +KKI  E+    K    L  C   Q V+S+    E+L     
Sbjct: 95  DADMLTGDEQVWKEVQESLKKI-EELKAHWKWEQVLAMC---QAVISSNA--ERLP---- 144

Query: 342 HPFSVNLMQTR--EKLIKILMELEDWKEALAYCQLTIPVYQRVYPQFHPLLGLQYYTCGK 399
               +N+ Q +  +  +   + L   +EAL Y   T+  Y+  +P  HP+ G+Q    GK
Sbjct: 145 ---DINIYQLKVLDCAMDACINLGLLEEALFYGIRTMEPYRIFFPGSHPVRGVQVMKVGK 201

Query: 400 LEWFLGDTENAIKSMTEAVEILRITHG 426
           L+   G    A+K++  A +I+R+THG
Sbjct: 202 LQLHQGMFPQAMKNLRLAFDIMRVTHG 228


>gi|299470965|emb|CBN79949.1| conserved unknown protein [Ectocarpus siliculosus]
          Length = 556

 Score =  114 bits (286), Expect = 7e-23,   Method: Compositional matrix adjust.
 Identities = 86/310 (27%), Positives = 133/310 (42%), Gaps = 48/310 (15%)

Query: 11  IISQEPYVCVPNNSSSISRCDGCFASSNLK-------KCSACQVVWYCGSNCQKLDWKLH 63
           I++ +P   V   S + + C  CF S + +       +CSAC  + YC   CQK DW+ H
Sbjct: 6   IMAVQPLSAVLEESQARTHCARCFESMDNQHSGGRGSRCSACSRICYCSRKCQKADWREH 65

Query: 64  RLECQVLSRLDKEKRKSVTPSIRLMLKLYLRRKLQNDNVIPSTTTDNYS----------- 112
           R EC+  +       ++ T ++RL  ++       +++   S  T N             
Sbjct: 66  RPECKAWA--SNSSARTPTRTLRLAGRILNAINRSDNSNTGSMGTVNGGEDDGSSHAPSV 123

Query: 113 --LVEALVAHMSDIDEKQLLLYAQIANLVNLI-------------LQWPEIS-------I 150
              V+ LV H  D   +Q   Y  +AN V  +             L WP          +
Sbjct: 124 REAVDELVHHNDDRSPEQKEEYMLMANFVARLCLAGCGDSKKGSALLWPSAQGRGLPGLV 183

Query: 151 NEIAENFSKLACNAHTICNSELRP-LGTGLYPVISIINHSCLPNAVLVFEGRLAVVRAVQ 209
           +       KL+CN  +I  S L   +G GLY   +  NHSC PNA   F G+   +R  +
Sbjct: 184 DAAYAVLGKLSCNVFSIAESALNGEVGCGLYLEAAAANHSCNPNAAQSFSGKTLSLRCTR 243

Query: 210 HVPKGAEVLISYIETAGSTMTRQKALKEQYLFTCTCPRC-IKLGQFDDIQESAILEGYRC 268
            + KG E+ I   +       R+++L++ Y F C C RC    G+ +D++    LE + C
Sbjct: 244 PIRKGEEITIGITQIQKPGPARRESLRKTYFFECRCERCESPEGRAEDMR----LEAFAC 299

Query: 269 KDDGCSGFLL 278
            D  CSGF L
Sbjct: 300 PDSECSGFCL 309


>gi|170102677|ref|XP_001882554.1| predicted protein [Laccaria bicolor S238N-H82]
 gi|164642451|gb|EDR06707.1| predicted protein [Laccaria bicolor S238N-H82]
          Length = 645

 Score =  114 bits (284), Expect = 1e-22,   Method: Compositional matrix adjust.
 Identities = 81/277 (29%), Positives = 125/277 (45%), Gaps = 40/277 (14%)

Query: 8   GEVIISQEPYVCVPNNSSSISRCDGCFA--SSNLKKCSACQVVWYCGSNCQKLDWKLHRL 65
           G+V++S +P V   +       C  CF   S  LK+C+ C+VV YC S CQ  DW  H+ 
Sbjct: 95  GDVLLSVKPNVAALSIQHLEDYCSNCFGPGSEPLKRCTICKVVHYCDSKCQSADWIFHKQ 154

Query: 66  ECQVLSRLDKEKRKSV-----------------TPSIRLMLKLYLRRKLQNDNVIPSTTT 108
           EC  L R    +  +                    +IR + ++  RR+   D  I +   
Sbjct: 155 ECVALQRWVSNRSAAAEPSSSLSDSSPPLPRVPNDAIRCLARILWRRQKMGDASIWAREI 214

Query: 109 DNYSLVEALVAHMSDIDEKQL-----------LLYAQIANLVNLILQWPEISINEIAENF 157
           D        ++   +  + QL           L  + +  L    +Q    + +++ +  
Sbjct: 215 DAMQSHRTSLSKDPNARDSQLHTFLAHSLVRYLGLSSLEELAAFGMQ----NTSDLVDLI 270

Query: 158 SKLACNAHTICNSELRPLGTGLYPVISIINHSCLPNAVLVF-----EGR-LAVVRAVQHV 211
           S+   N  TI +  L PLG  + P +++INHSC PNA +VF     EG  L  V A++++
Sbjct: 271 SRFTTNTFTITSPTLAPLGACVSPSVALINHSCDPNAAVVFPRSAKEGEPLMQVVALKYI 330

Query: 212 PKGAEVLISYIETAGSTMTRQKALKEQYLFTCTCPRC 248
               E+L +YI+T   T  RQ+ALKE Y F C CP C
Sbjct: 331 GPDEEILTAYIDTTLPTGLRQQALKETYHFVCECPLC 367


>gi|290562039|gb|ADD38416.1| SET and MYND domain-containing protein 3 [Lepeophtheirus salmonis]
          Length = 311

 Score =  113 bits (282), Expect = 2e-22,   Method: Compositional matrix adjust.
 Identities = 74/248 (29%), Positives = 122/248 (49%), Gaps = 10/248 (4%)

Query: 8   GEVIISQEPYVCVPNNSSSISRCDGCFASSN--LKKCSACQVVWYCGSNCQKLDWK-LHR 64
           G+ I+   P+            CD CF++    + +C+ C V++YCG  CQ+  WK  H+
Sbjct: 6   GDEILKCYPFSYAFREEHQHKVCDFCFSTGQAIISRCARCCVIYYCGKQCQRSSWKETHK 65

Query: 65  LECQVLSRLD-KEKRKSVTPSIRLMLKLYLRRKLQNDNVIPSTTTDNYSLVEALVAHMSD 123
            EC+ L  LD KE    +   IR + KL           +P+  +  +     L++H  +
Sbjct: 66  HECKYLKMLDIKEPHMDLLLIIRTLCKLRYDGGYSKKVSLPNGCSRRFG---DLMSHKEN 122

Query: 124 I-DEKQL--LLYAQIANLVNLILQWPEISINEIAENFSKLACNAHTICNSELRPLGTGLY 180
           +  + QL  + Y  I  + N +    EI+ +E+ + F+K++ N+  I N +    G+ L 
Sbjct: 123 VLMDPQLYTMFYRYIGIIKNKLGGALEINESEVLDIFTKMSINSVFILNGKFLNFGSCLS 182

Query: 181 PVISIINHSCLPNAVLVFEGRLAVVRAVQHVPKGAEVLISYIETAGSTMTRQKALKEQYL 240
              S I+HSC PNAV +F GR  VV+A+  +    +V ++Y +       R+  LK+QY 
Sbjct: 183 LEFSAIDHSCRPNAVYMFIGRTLVVKALCDIANFDDVRVTYTDITQPRRMRRNFLKDQYF 242

Query: 241 FTCTCPRC 248
           F CTC  C
Sbjct: 243 FDCTCEEC 250


>gi|390599694|gb|EIN09090.1| SET domain-containing protein [Punctularia strigosozonata HHB-11173
           SS5]
          Length = 562

 Score =  113 bits (282), Expect = 2e-22,   Method: Compositional matrix adjust.
 Identities = 86/280 (30%), Positives = 131/280 (46%), Gaps = 37/280 (13%)

Query: 8   GEVIISQEPYVCVPNNSSSISRCDGCFASSN-----LKKCSACQVVWYCGSNCQKLDWKL 62
           G  ++S  P+V V +     S C  C A ++     LK+C+ C VV YC   CQ  DWK 
Sbjct: 55  GSSLLSTRPHVSVLSTPYLSSHCSSCAAPASPERPQLKRCAKCHVVHYCSQKCQNADWKA 114

Query: 63  HRLECQVLSRLDKEKRKSVTPSIRLMLKLY----LRRKLQNDNVIPSTTTDNYSLVEALV 118
           H+ EC  L R       +  PS  L +       L R L      PS+       ++++ 
Sbjct: 115 HKPECDALQRW-----ATAAPSPDLAVPEEAVRCLGRMLWQKQRNPSSIWSRE--IDSMQ 167

Query: 119 AHMSDIDEKQLLLYAQIAN-LVN-LILQWPE-------ISINEIAENFSKLACNAHTICN 169
           +H S +  +    +A +A+ LV  L L  PE        S  ++ +  S+   NA T+  
Sbjct: 168 SHRSSLRPESFESHAYLAHALVRYLGLDAPEKLASFGLRSAGDVVDLISRFTTNAITLAA 227

Query: 170 SELRPLGTGLYPVISIINHSCLPNAVLVF---------EGRLAVVRAVQHVPKGAEVLIS 220
             L PLG  + P ++++NHSC PNAV+VF         E  + V+ A++ +    EV  +
Sbjct: 228 PSLTPLGVSVSPAVALVNHSCAPNAVVVFPRVSKTVDQEPVMQVI-ALRDIHPDEEVTTA 286

Query: 221 YIETAGSTMTRQKALKEQYLFTCTCPRCIKLGQFDDIQES 260
           YI+T      RQK L++ Y FTC+C  C    +  D +ES
Sbjct: 287 YIDTTVPREQRQKILRDTYHFTCSCSLCA--AEDPDPRES 324


>gi|351695044|gb|EHA97962.1| SET and MYND domain-containing protein 3, partial [Heterocephalus
           glaber]
          Length = 252

 Score =  113 bits (282), Expect = 2e-22,   Method: Compositional matrix adjust.
 Identities = 80/270 (29%), Positives = 131/270 (48%), Gaps = 43/270 (15%)

Query: 159 KLACNAHTICNSELRPLGTGLYPVISIINHSCLPNAVLVFEGRLAVVRAVQHVPKGAEVL 218
           ++ CN+ TICN+E++ +G GLYP +S++NHSC PN  +VF G   ++RAV+ +  G E+ 
Sbjct: 1   QVICNSFTICNAEMQEVGVGLYPSMSLLNHSCDPNCSIVFSGPHLLLRAVRDIEVGEELT 60

Query: 219 ISYIETAGSTMTRQKALKEQYLFTCTCPRCIKLGQFDDIQESAILEGYRCKDDGCSGFLL 278
           I Y++   ++  R+K L++QY F C C RC                              
Sbjct: 61  ICYLDMLMTSEERRKQLRDQYCFDCDCFRCQT---------------------------- 92

Query: 279 RDSDDKGFTCQQCGLVRSKEEIKKIASEVNILSKKTLALTSCGNHQEVVSTYKMIEKLQK 338
           +D D    T  +      +E +KKI  E+    K    L  C   Q ++S+    E+L  
Sbjct: 93  QDKDADMLTGDEQVWKGVQESLKKI-EELKAHWKWEQVLAMC---QSIISSNS--ERLP- 145

Query: 339 KLYHPFSVNLMQTR--EKLIKILMELEDWKEALAYCQLTIPVYQRVYPQFHPLLGLQYYT 396
                  +N+ Q +  +  +   + L   +EAL Y   T+  Y+  +P  HP+ G+Q   
Sbjct: 146 ------DINIYQLKVLDCAMDACINLGLLEEALFYGIRTMEPYRIFFPGSHPVRGVQVMK 199

Query: 397 CGKLEWFLGDTENAIKSMTEAVEILRITHG 426
            GKL+   G    A+K++  A +I+R+THG
Sbjct: 200 VGKLQLHQGMFPQAMKNLRLAFDIMRVTHG 229


>gi|344296426|ref|XP_003419908.1| PREDICTED: N-lysine methyltransferase SMYD2-like [Loxodonta
           africana]
          Length = 427

 Score =  113 bits (282), Expect = 3e-22,   Method: Compositional matrix adjust.
 Identities = 99/438 (22%), Positives = 190/438 (43%), Gaps = 47/438 (10%)

Query: 8   GEVIISQEPYVCVPNNSSSISRCDGCFA-SSNLKKCSACQVVWYCGSNCQKLDWKLHRLE 66
           G+++ S   Y CV             F+    L KC  C+  +YC  +CQ+ DW +H+LE
Sbjct: 30  GDLLFSCPAYACVLTGQERGIYLPFLFSRKEGLSKCGRCKQAFYCNVDCQREDWPMHKLE 89

Query: 67  CQVLSRLDKEKRKSVTPSIRLMLKLYLRRKLQNDNVIPSTTTDNYSLVEALVAHMSDID- 125
           C  +    +    S T  +RL  ++  ++K+  +     T ++    V+   +H+  +D 
Sbjct: 90  CTPMVVFGENWNPSET--VRLTARILAKQKIHPER----TPSEKLLAVKEFESHLDKLDN 143

Query: 126 EKQLLLYAQIANLVNLI---LQWPEISINEIAENFSKLACNAHTICNSELRPLGTGLYPV 182
           EK+ L+ + IA L +     L++P+     +   F+++ CN  TI + EL  +G+ ++P 
Sbjct: 144 EKKDLIQSDIAALHHFYSKHLEFPD--TESLVVLFAQVNCNGFTIEDEELSHVGSAIFPD 201

Query: 183 ISIINHSCLPNAVLVFEGRLAVVRAVQHVPKGAEVLISYIETAGSTMTRQKALKEQYLFT 242
             +   S +P ++ VF+  L    +         V  SYI+    T  R   L++ Y FT
Sbjct: 202 PVV---SGVPGSLTVFQAHLF---SQMFASLCGMVFTSYIDLLYPTEDRNDRLRDSYFFT 255

Query: 243 CTCPRCIKLGQFDDIQESAILEGYRCKDDGCSGFLLRDSDDKGFTCQQCGLVRSKEEIKK 302
           C C  C         +++A +E  R   D      +RD            +VR    + +
Sbjct: 256 CECQECTT-----KDKDNAKVE-IRKLTDPPKAEAIRD------------MVRYARNVIE 297

Query: 303 IASEVNILSKKTLALTSCGNHQEVVSTYKMIEKLQKKLYHPFSVNLMQTREKLIKILMEL 362
                      +  L  C   QE +S+          ++   +V ++    + + + + +
Sbjct: 298 EFRRAKHYKSPSELLEMCALSQEKMSS----------VFEDSNVYMLHMMYQAMGVCLYM 347

Query: 363 EDWKEALAYCQLTIPVYQRVYPQFHPLLGLQYYTCGKLEWFLGDTENAIKSMTEAVEILR 422
           +DW+ AL Y Q  I  Y + YP +   +   +   G+L   L +     K++ +A+ I+ 
Sbjct: 348 QDWEGALRYGQKIIQPYSKHYPLYSLNVASMWLKLGRLYMGLENKAAGEKALKKAIAIME 407

Query: 423 ITHGTNSPFMKELILKLE 440
           + HG + P++ E+  ++E
Sbjct: 408 VAHGKDHPYISEIKHEIE 425


>gi|19113306|ref|NP_596514.1| histone lysine methyltransferase Set6 (predicted)
           [Schizosaccharomyces pombe 972h-]
 gi|74626997|sp|O94256.1|SET6_SCHPO RecName: Full=SET domain and MYND-type zinc finger protein 6
 gi|3810831|emb|CAA21792.1| histone lysine methyltransferase Set6 (predicted)
           [Schizosaccharomyces pombe]
          Length = 483

 Score =  112 bits (281), Expect = 3e-22,   Method: Compositional matrix adjust.
 Identities = 74/247 (29%), Positives = 122/247 (49%), Gaps = 25/247 (10%)

Query: 8   GEVIISQEPYVCVPNNSSSISRCDGCFASS-NLKKCSACQVVWYCGSNCQKLDWKLHRLE 66
           G++II +   +   ++++    C  C       ++C+AC+++ YC   CQK DW  H+LE
Sbjct: 27  GKIIIRKRVDILSLDSANLTRTCSTCTEEKVKTQRCAACKIIHYCSKGCQKADWPFHKLE 86

Query: 67  CQVLSRLDKEKRKSVTPSI-RLMLKLYLRRKLQNDNVIPSTTTDNYSLVEALVAHMSDID 125
           C+ L      K+  + PS+ RL+++LYL    Q +  I      + +  +A+ +  SD +
Sbjct: 87  CKALQ---ASKQNGILPSVCRLLIRLYLL--WQKNPAIIEPMEGHQNEFQAVSSSWSDAE 141

Query: 126 EKQLLLYAQIANLVNLILQWPEISINEIAENFSKLAC----NAHTICNSELRPLGTGLYP 181
                L A  A+    I Q         AE F KL C    NA  +  S    LG  L  
Sbjct: 142 -----LIASAASHYTQIYQ---------AELFQKLFCRLAVNAMNLVTSSFDSLGMCLDT 187

Query: 182 VISIINHSCLPNAVLVFEGRLAVVRAVQHVPKGAEVLISYIETAGSTMTRQKALKEQYLF 241
           ++  +NHSC PN  ++F+G +  + + + + K  ++ ISYI+       RQK L ++Y F
Sbjct: 188 ILCRLNHSCDPNCQIIFDGAIVQLVSKRDIKKDEQLFISYIDIRLPKSIRQKQLLKKYFF 247

Query: 242 TCTCPRC 248
           +C CPRC
Sbjct: 248 SCYCPRC 254


>gi|401886254|gb|EJT50303.1| hypothetical protein A1Q1_00408 [Trichosporon asahii var. asahii
           CBS 2479]
          Length = 582

 Score =  112 bits (280), Expect = 3e-22,   Method: Compositional matrix adjust.
 Identities = 112/439 (25%), Positives = 181/439 (41%), Gaps = 68/439 (15%)

Query: 6   VSGEVIISQEPYVCV---PNNSSSIS------RCDGCFASSNLKKCSACQVVWYCGSNCQ 56
            +GE ++  +P V V   PN     +       C+ C+     ++CS C+ V+YCG+ CQ
Sbjct: 90  TAGETLLKTKPLVSVLSKPNLYEPAAGQKMELHCNHCYLLKPAQRCSGCKGVYYCGAACQ 149

Query: 57  KLDWKLHRLECQVLSRLDK-------EKRKSV-------TPSIRLMLKLYLRRKLQNDNV 102
           + DW  H+ EC+ L+R+ +       EK K             R +  L   RK   D  
Sbjct: 150 QADWPSHKTECKALTRVRQLWVQSYPEKAKDGLNNSWIQAEGARALGLLTWARKAYRDQH 209

Query: 103 IPSTTTDNYSLVEALVAHMSDIDEK------------QLLLYAQIA------NLVNLILQ 144
                 D +  VE + A   D+                L  Y   A      N + LI  
Sbjct: 210 --GRDPDYWPKVEKMYAEGPDMGTGVHGIPAADQMAIHLSYYVGAAEPPKDPNNLELIDM 267

Query: 145 WPEISINEIAENFSK-LACNAHTICNSELRPLGTGLYPVISIINHSCLPNAVLVFE--GR 201
            P    +E   +F +    NA T+ + +L+P+G  + P++++ NHSC PNA +VF   G+
Sbjct: 268 EPYGFQDEQLMSFVRSFVLNAFTLSSFDLKPIGVAMSPLLALFNHSCAPNAAIVFPRGGK 327

Query: 202 LAVVRAVQHVPKGAEVLISYIETAGSTMTRQKALKEQYLFTCTCPRCIKLGQFDDIQESA 261
             V  A   +  G EVL +Y++ +    TRQ  L+ +Y F C CP C      D +    
Sbjct: 328 EMVAVANADIAAGEEVLTTYVDISDDKETRQGDLQSRYGFECECPACT----LDAV---- 379

Query: 262 ILEGYRCKDDGCSGF------LLRDSDDKGFTCQQCGL--VRSKEEIKKIASEVNILSKK 313
             +   C    C G       L + SD +   C  CG     S   ++ + +    LS  
Sbjct: 380 --DPRNCLLHECGGLARMPADLRQMSDSETVKCDTCGASWTVSPNALRDLVN--GGLSLV 435

Query: 314 TLALTSCGNHQEVVSTYKMIEKLQKKLYHPFSVNLMQTREKLIKILMELEDWKEALAYCQ 373
                   +  ++    ++I +L+ ++  P S  L++    + K L       EALAY  
Sbjct: 436 EADDQDQLDPDDLPRVAEVIARLE-EIMPPSSRPLLRLH-TIAKNLHVPAAQPEALAYAH 493

Query: 374 LTIPVYQRVYPQFHPLLGL 392
             +    +VYP+ HP + L
Sbjct: 494 KALAGAYKVYPRNHPSVAL 512


>gi|82594042|ref|XP_725259.1| hypothetical protein [Plasmodium yoelii yoelii 17XNL]
 gi|23480195|gb|EAA16824.1| Homo sapiens HSKM-B [Plasmodium yoelii yoelii]
          Length = 511

 Score =  112 bits (280), Expect = 3e-22,   Method: Compositional matrix adjust.
 Identities = 121/495 (24%), Positives = 198/495 (40%), Gaps = 80/495 (16%)

Query: 7   SGEVIISQEPYVCVP-----------------NNSSSISRCDGCFASSN-LKKCSACQVV 48
           SG  I+   P + +P                 NN  +I+ C  CF   N    C  C+ V
Sbjct: 22  SGYCIVESHPEIAIPLCVKFMAPRIVDSTLKKNNYKTINICFYCFEKVNKCIYCPNCKYV 81

Query: 49  WYCGSNCQKLDWKLHRLECQVLSRLDKEKRKSVTPSIRLMLKLYLRRKLQND--NVIPST 106
            YC  +C +  WK HR EC +  + +   R   T ++RL++  YL      D    I   
Sbjct: 82  AYCSDSCLERAWKFHREECDIY-KSNIFDRYCPTITMRLVIHSYLTHFNFYDYSGTITDL 140

Query: 107 TTDNYSLVE-----ALVAHMSDIDEKQLLLYAQIANLVNLILQWPEISINEIAENFSKLA 161
           T + Y  ++       VA MS    K+  +++      N++          + E F K++
Sbjct: 141 TKEKYENLKYPAYIVAVALMS----KKKKIFSNFDENKNIL--------KNVIEKFIKVS 188

Query: 162 CNAHTICNSELRPLGTGLY--PVISIINHSCLPNAVLVFEGRLAVVRAVQHVPKGAEVLI 219
            N   I ++EL P G G Y  PV    NHSCL N + +F+ +   +R +  +  G E+ I
Sbjct: 189 KNTLQIIDNELEPCGLGFYKKPV-PYFNHSCLSNCITIFKNQKLYIRTLMDIYPGEELTI 247

Query: 220 SYIETAGSTMTRQKALKEQYLFTCTCPRCI---------------------KLGQFDDIQ 258
           SY++ A    TR     +QY FTCTC  C                         +F +  
Sbjct: 248 SYLDIAFDRNTRLAICTDQYFFTCTCKLCKVNIPSECHNMFNNDFICTRSENCKKFVNYM 307

Query: 259 ESAILE------GYRCKDDGCSGFLLRDSDDKGFTCQQCGLVRSK----------EEIKK 302
           E  ++        Y  K    +  +L+ S DK     +C L + +          E+ K+
Sbjct: 308 EMVLISELERKLNYHNKLHFKAFPILKKSTDKNENIWKCMLCKHETNDSVIKGVIEKEKE 367

Query: 303 IASEVNILSKKTLALTSCGNHQEVVSTYKMIEKLQ--KKLYHPFSVNLMQTREKLIKILM 360
              EV  L        S  N   + S  K+  K+      YH    +L + R K++ I +
Sbjct: 368 TVKEVEYLETLFAEKYSYDNKIILQSLNKIKSKIDYLTTFYHHSRYSLQKMRAKILYISI 427

Query: 361 ELEDWKEALAYCQLTIPVYQRVYPQFHPLLGLQYYTCGKLEWFLGDTENAIKSMTEAVEI 420
           +L+++K A       +   +  Y ++ P+ G   +  GKL  FL      +  + +A + 
Sbjct: 428 QLQEFKLAYNIANQYLKSIEVSYGKYSPIYGYYIFLTGKLALFLDLKSAGLNLIHKAKKN 487

Query: 421 LRITHGTNSPFMKEL 435
           +  T+G  SP  K+L
Sbjct: 488 IIKTYGPESPIYKDL 502


>gi|213406173|ref|XP_002173858.1| conserved hypothetical protein [Schizosaccharomyces japonicus
           yFS275]
 gi|212001905|gb|EEB07565.1| conserved hypothetical protein [Schizosaccharomyces japonicus
           yFS275]
          Length = 449

 Score =  112 bits (279), Expect = 5e-22,   Method: Compositional matrix adjust.
 Identities = 76/252 (30%), Positives = 126/252 (50%), Gaps = 19/252 (7%)

Query: 8   GEVIISQEPYVCVPNNSSSISRCDGCFASSN-LKKCSACQVVWYCGSNCQKLDWKLHRLE 66
           GE ++ +   + V NN      C  C + +   K+C+AC+ V YC   CQK DW +H+LE
Sbjct: 26  GECLLEKRLDLYVLNNELLNEACSYCCSQTKPTKRCAACKQVHYCSKICQKQDWSMHKLE 85

Query: 67  CQVLSRLDKEKRKSVTPSI-RLMLKLYLRRKLQNDNVIPSTTTDNYSLVEALVAHMSDID 125
           C+ L          + P++ RL+++LY   + Q D  + +   D+ + ++   A  SD +
Sbjct: 86  CKALR---NASTNGLLPTVCRLLIRLY--SQTQKDQSLFADCKDHEAEIQHNTALWSDAE 140

Query: 126 EKQLLLYAQIANLVNLILQWPEI-SINEIAENFSKLACNAHTICNSELRPLGTGLYPVIS 184
                L +  A+       + E   +N     F KL+ NA ++       +GT +   ++
Sbjct: 141 -----LISSAAS------HYTEARDVNAFLSLFCKLSINAMSLVTPAFDAIGTCMDSTLA 189

Query: 185 IINHSCLPNAVLVFEGRLAVVRAVQHVPKGAEVLISYIETAGSTMTRQKALKEQYLFTCT 244
            INHSC PN V + EG  A + A+Q + KG  V ISY++T      R   L+++Y FTCT
Sbjct: 190 RINHSCQPNCVFMVEGATARLVALQTLQKGDFVHISYVDTTLPYNVRTNELQQKYFFTCT 249

Query: 245 CPRCIKLGQFDD 256
           C +C++  +  D
Sbjct: 250 CQKCMEEARRSD 261


>gi|255950300|ref|XP_002565917.1| Pc22g20170 [Penicillium chrysogenum Wisconsin 54-1255]
 gi|211592934|emb|CAP99305.1| Pc22g20170 [Penicillium chrysogenum Wisconsin 54-1255]
          Length = 495

 Score =  112 bits (279), Expect = 6e-22,   Method: Compositional matrix adjust.
 Identities = 74/254 (29%), Positives = 125/254 (49%), Gaps = 22/254 (8%)

Query: 9   EVIISQEPYVCVPNNSSSISRCDGCFA------SSNLKKCSACQVVWYCGSNCQKLDWKL 62
           +V+  + P++ V ++      C GCF        + LK C+ C+VV YC   CQ  DWK 
Sbjct: 38  DVLHIKTPFLAVLDSPRLEDTCAGCFGKRQVETGNELKACTGCRVVKYCDRTCQSKDWKF 97

Query: 63  -HRLECQVLSRLDKEKRKSVTPS-IRLMLKLYLRRKLQNDNVIPSTTTDNYSLVEALVAH 120
            H LEC +     K  +  V P+  R +L++ LR      +      ++   + + L  H
Sbjct: 98  AHSLECPIF----KNVKPMVLPNNARALLRVVLRTARNKYD------SEESKVFDGLETH 147

Query: 121 MSDIDEKQLLLYAQIANLVNLILQWPEISINE--IAENFSKLACNAHTICNSELRPLGTG 178
           +++I E Q  L  +I      +  +    ++E  +A   +KL  N+  +  S    +G  
Sbjct: 148 INEISESQGQL-DRINLTARAVKNYSGTEMDEGTVASYAAKLDLNSFNLTTSMYDRIGLY 206

Query: 179 LYPVISIINHSCLPNAVLVFEGRLAVVRAVQHVPKGAEVLISYIETAGSTMTRQKALKEQ 238
           ++P   +INHSC  N+ + F+G    V+A++ + KG ++ ISYI+T      R+  LKE+
Sbjct: 207 MHPYAGLINHSCDYNSTVGFDGEELYVKAMRPIKKGEQIFISYIDTTTPYDIRRNELKER 266

Query: 239 YLFTCTCPRCIKLG 252
           Y F C C +C K+G
Sbjct: 267 YFFDCQCTKC-KMG 279


>gi|225561846|gb|EEH10126.1| MYND finger family protein [Ajellomyces capsulatus G186AR]
          Length = 518

 Score =  111 bits (278), Expect = 6e-22,   Method: Compositional matrix adjust.
 Identities = 72/238 (30%), Positives = 114/238 (47%), Gaps = 29/238 (12%)

Query: 30  CDGCFASS--------NLKKCSACQVVWYCGSNCQKLDWKL-HRLECQVLSRLDKEKRKS 80
           C  CF +         +LK C+ C VV YCG  CQ   W   H+ EC++         K 
Sbjct: 58  CSNCFVTVGDELNPDLSLKACAGCHVVKYCGKKCQIESWAASHKKECKIY--------KQ 109

Query: 81  VTPSIRLM-LKLYLRRKLQNDNVIPSTTTD-NYSLVEALVAHMSDIDEK------QLLLY 132
             P I  M  +  LR   Q DN +     D ++++   L  H S +DE+      ++ + 
Sbjct: 110 CQPRILPMNARAILRIISQPDNKVSKEIYDTHHAVFRTLGHHFSKMDERGGEQAHRITVS 169

Query: 133 AQIANLVNLILQWPEISINEIAENFSKLACNAHTICNSELRPLGTGLYPVISIINHSCLP 192
           A+    ++      E+ ++ +   F+KL  NA T+ N     +G  L P  + INHSC P
Sbjct: 170 AEALKAISNT----EVELSTLVVYFAKLETNAFTLTNQYFDRIGLCLLPFAAYINHSCEP 225

Query: 193 NAVLVFEGRLAVVRAVQHVPKGAEVLISYIETAGSTMTRQKALKEQYLFTCTCPRCIK 250
           NA + F+G++  ++A+Q +    E+ ISY +      TRQ  L+ +Y F C CP+C+K
Sbjct: 226 NAYIGFDGQVMYLKALQDIAPDEEIFISYTDNTEPLKTRQTELQLRYFFECKCPKCLK 283


>gi|297266487|ref|XP_001093225.2| PREDICTED: SET and MYND domain-containing protein 1 [Macaca
           mulatta]
          Length = 399

 Score =  111 bits (278), Expect = 7e-22,   Method: Compositional matrix adjust.
 Identities = 67/251 (26%), Positives = 119/251 (47%), Gaps = 18/251 (7%)

Query: 7   SGEVIISQEPYVCVPNNSSSISRCDGCFA-SSNLKKCSACQVVWYCGSNCQKLDWKLHRL 65
           + +VI ++  Y  V  +S     C  CF     L +C  C+   YC   CQK  W  H+ 
Sbjct: 29  AADVIFAERAYSAVVFDSLVNFVCHTCFKRQEKLHRCGQCKFAHYCDRTCQKDAWLNHKN 88

Query: 66  ECQVLSRLDKEKRKSVTPSIRLMLKLYLRRKLQNDNVIPSTTTDNYSLVEALVAHMSDID 125
           EC  + R  K   +++  + R+M ++        +  + S        ++  V H  + +
Sbjct: 89  ECSAIKRYGKVPNENIRLAARIMWRVEREGTGLTEGCLVSVDD-----LQNHVEHFGEEE 143

Query: 126 EKQLLLYAQIANLVNLILQ-WP----EISINEIAENFSKLACNAHTICNSE-LRPLGTGL 179
           +K+L +       V+  LQ WP    +  +  I+  F  + CN  T+ +   L+ +G G+
Sbjct: 144 QKELRVD------VDTFLQYWPPQSQQFGMQYISHIFGVINCNGFTLSDQRGLQAVGVGI 197

Query: 180 YPVISIINHSCLPNAVLVFEGRLAVVRAVQHVPKGAEVLISYIETAGSTMTRQKALKEQY 239
           +P + ++NH C PN  ++F      +RA+  + +G E+ +SYI+    +  R++ LK+QY
Sbjct: 198 FPNLGLVNHDCWPNCTVIFNNGKIELRALGKISEGEELTVSYIDFLNVSEERKRQLKKQY 257

Query: 240 LFTCTCPRCIK 250
            F CTC  C K
Sbjct: 258 YFDCTCEHCQK 268


>gi|225711848|gb|ACO11770.1| SET and MYND domain-containing protein 3 [Lepeophtheirus salmonis]
          Length = 426

 Score =  111 bits (278), Expect = 7e-22,   Method: Compositional matrix adjust.
 Identities = 72/250 (28%), Positives = 124/250 (49%), Gaps = 14/250 (5%)

Query: 8   GEVIISQEPYVCVPNNSSSISRCDGCFASSNLK--KCSACQVVWYCGSNCQKLDW-KLHR 64
           G+ ++   P+       +    CD CF+S   K  +CS C +++YCG  CQ+L W ++H+
Sbjct: 6   GDEVLRCRPFSYAFKEKNRDKICDYCFSSRQAKILRCSRCCIIYYCGRQCQRLSWSEIHK 65

Query: 65  LECQVLSRLD-KEKRKSVTPSIRLMLKLYLRRKLQNDNVIPSTTTDNYSLVEALVAHMSD 123
            EC+ L  LD KE    +   +R + KL        +  +P+  +  +     L++H  +
Sbjct: 66  KECKYLKMLDLKEPPLDLLFIVRTLCKLKYDGGYSKEVSLPNGRSRKFI---DLMSHKEN 122

Query: 124 I-----DEKQLLLYAQIANLVNLILQWPEISINEIAENFSKLACNAHTICNSELRPLGTG 178
           I      +K+ L Y  I  ++ ++    E + +E+   F+KL  NA  + N +L   G  
Sbjct: 123 ILMNPEHKKRFLTYLDI--IIYMLGGSLETNESEVLNIFTKLMINASFMLNEKLIDFGGC 180

Query: 179 LYPVISIINHSCLPNAVLVFEGRLAVVRAVQHVPKGAEVLISYIETAGSTMTRQKALKEQ 238
           L    S I+HSC PNA+ +F G   VV+A+  +    +V ++Y++ +     RQ+ LK Q
Sbjct: 181 LCLEFSAIDHSCRPNAIYMFNGHTLVVKALCEIANFDDVRVAYVDMSQPRSIRQELLKNQ 240

Query: 239 YLFTCTCPRC 248
           + F C C  C
Sbjct: 241 FFFDCNCEEC 250


>gi|406700189|gb|EKD03370.1| hypothetical protein A1Q2_02350 [Trichosporon asahii var. asahii
           CBS 8904]
          Length = 573

 Score =  111 bits (277), Expect = 8e-22,   Method: Compositional matrix adjust.
 Identities = 109/430 (25%), Positives = 172/430 (40%), Gaps = 63/430 (14%)

Query: 8   GEVIISQEPYVCV---PNNSSSIS------RCDGCFASSNLKKCSACQVVWYCGSNCQKL 58
           GE ++  +P V V   PN     +       C+ C+     ++CS C+ V+YCG+ CQ+ 
Sbjct: 92  GETLLKTKPLVSVLSKPNLYEPAAGQKMELHCNHCYLLKPAQRCSGCKGVYYCGAACQQD 151

Query: 59  DWKLHRLECQVLSRLDK-------EKRKSV-------TPSIRLMLKLYLRRKLQNDNVIP 104
           DW  H+ EC+ L+R+ +       EK K             R +  L   RK   D    
Sbjct: 152 DWPSHKTECKALTRVRQLWVQSYPEKAKEGLNNSWIQAEGARALGLLTWARKAYRDQH-- 209

Query: 105 STTTDNYSLVEALVAHMSDIDEK------------QLLLYAQIA------NLVNLILQWP 146
               D +  VE + A   D+                L  Y   A      N + LI   P
Sbjct: 210 GRDPDYWPKVEKMYAEGPDMGTGVHGIPAADQMAIHLSYYVGAAEPPKDPNNLELIDMEP 269

Query: 147 EISINEIAENFSK-LACNAHTICNSELRPLGTGLYPVISIINHSCLPNAVLVFE--GRLA 203
               +E   +F +    NA T+ + +L+P+G  + P++++ NHSC PNA +VF   G+  
Sbjct: 270 YGFQDEQLMSFVRSFVLNAFTLSSFDLKPIGVAMSPLLALFNHSCAPNAAIVFPRGGKEM 329

Query: 204 VVRAVQHVPKGAEVLISYIETAGSTMTRQKALKEQYLFTCTCPRCIKLGQFDDIQESAIL 263
           V  A   +  G EVL +Y++ +    TRQ  L+ +Y F C CP C      D +      
Sbjct: 330 VAVANADIAAGEEVLTTYVDISDDKETRQGDLQSRYGFECECPACT----LDAV------ 379

Query: 264 EGYRCKDDGCSGFLLRDSDDKGFTCQQCGLVRSKEEIKKIASE-VNILSKKTLALTSCGN 322
           +   C    C G        K  TC     V S   ++ + +E ++++           +
Sbjct: 380 DPRNCLLHECGGLARMPETVKCDTCGASWTV-SPNALRDLVNEGLSLVEADDQDQLDPDD 438

Query: 323 HQEVVSTYKMIEKLQKKLYHPFSVNLMQTREKLIKILMELEDWKEALAYCQLTIPVYQRV 382
              V      +EK+      P S   +     + K L       EALAY    +    +V
Sbjct: 439 LPRVAEVIARLEKIM-----PPSSRPLLRLHTIAKNLHVPAAQPEALAYAHKALAGAYKV 493

Query: 383 YPQFHPLLGL 392
           YP+ HP + L
Sbjct: 494 YPRNHPSVAL 503


>gi|425770916|gb|EKV09375.1| hypothetical protein PDIP_64570 [Penicillium digitatum Pd1]
 gi|425776732|gb|EKV14940.1| hypothetical protein PDIG_30190 [Penicillium digitatum PHI26]
          Length = 495

 Score =  111 bits (277), Expect = 1e-21,   Method: Compositional matrix adjust.
 Identities = 73/250 (29%), Positives = 121/250 (48%), Gaps = 21/250 (8%)

Query: 9   EVIISQEPYVCVPNNSSSISRCDGCFA------SSNLKKCSACQVVWYCGSNCQKLDWKL 62
           +V+  + P+V V ++      C GCF        + LK C+ C+VV YC   CQ  DWK 
Sbjct: 38  DVLHIKTPFVAVLDSPRLEDTCAGCFGKRQMQNGNELKACTGCRVVKYCDRACQSKDWKF 97

Query: 63  -HRLECQVLSRLDKEKRKSVTPS-IRLMLKLYLRRKLQNDNVIPSTTTDNYSLVEALVAH 120
            H +EC +     K  +  V P+  R +L++ LR       V     ++ + + + L  H
Sbjct: 98  AHSVECLIF----KNVKPMVLPNNARALLRIVLR------TVRNKYDSEEFKVFDGLETH 147

Query: 121 MSDIDEKQLLLYAQIANLVNLILQWPEISINE--IAENFSKLACNAHTICNSELRPLGTG 178
           M++I   Q  L  +I      + ++    ++E  +A   +KL  N+  +  S    +G  
Sbjct: 148 MNEISGSQGQL-DRINLTARAVKRYSGTDVDEGTVASYAAKLDLNSFNLTTSMYDRIGLY 206

Query: 179 LYPVISIINHSCLPNAVLVFEGRLAVVRAVQHVPKGAEVLISYIETAGSTMTRQKALKEQ 238
           ++P   +INHSC  N+ + F+G    V+A+  + KG ++ ISYI+T      R   LKE+
Sbjct: 207 MHPYAGLINHSCDYNSTVGFDGEELYVKAMHPIKKGEQIFISYIDTTTPYDIRCNELKER 266

Query: 239 YLFTCTCPRC 248
           Y F C C +C
Sbjct: 267 YFFDCQCTKC 276


>gi|302654157|ref|XP_003018889.1| SET and MYND domain protein, putative [Trichophyton verrucosum HKI
           0517]
 gi|291182573|gb|EFE38244.1| SET and MYND domain protein, putative [Trichophyton verrucosum HKI
           0517]
          Length = 498

 Score =  110 bits (276), Expect = 1e-21,   Method: Compositional matrix adjust.
 Identities = 72/234 (30%), Positives = 114/234 (48%), Gaps = 23/234 (9%)

Query: 30  CDGCFA----------SSNLK--KCSACQVVWYCGSNCQKLDWKL-HRLECQVLSRLDKE 76
           C  CFA          S ++K   CS C+VV YC   CQ  +WKL H+ EC +  +L   
Sbjct: 55  CSNCFANLPFGVNVMASVDMKLWACSGCKVVKYCDRKCQSANWKLIHKHECAIYKKL--- 111

Query: 77  KRKSVTPSIRLMLKLYLRRKLQNDNVIPSTTTDNYSLVEALVAHMSDIDEKQLLLYAQIA 136
             K +  + R +L++   R    D+V       + ++   L +H+ +I       Y +I 
Sbjct: 112 YPKILPVNSRAVLRIVKLRSSNEDHV-----KSDLNMFLTLRSHLDEITSSNQEQYERIM 166

Query: 137 NLVNLILQWP--EISINEIAENFSKLACNAHTICNSELRPLGTGLYPVISIINHSCLPNA 194
                  ++   E+ +  IAE  +++  N+ T   +   PLG  + P    +NHSC PNA
Sbjct: 167 LCAKAEKEYSGSELDVETIAEYLARIEVNSFTFTTAFGDPLGLCIQPFACYMNHSCEPNA 226

Query: 195 VLVFEGRLAVVRAVQHVPKGAEVLISYIETAGSTMTRQKALKEQYLFTCTCPRC 248
           V+ F+G L +V+A++ +    +V ISYI+       RQK L E+Y FTC C +C
Sbjct: 227 VVGFDGGLIIVKALREIKPDEQVFISYIDNTYPLEVRQKQLAERYFFTCKCSKC 280


>gi|384498493|gb|EIE88984.1| hypothetical protein RO3G_13695 [Rhizopus delemar RA 99-880]
          Length = 553

 Score =  110 bits (276), Expect = 1e-21,   Method: Compositional matrix adjust.
 Identities = 72/254 (28%), Positives = 120/254 (47%), Gaps = 12/254 (4%)

Query: 8   GEVIISQEPYVCVPNNSSSISRCDGCFASSN---LKKCSACQVVWYCGSNCQKLDW-KLH 63
           G VII+ +P   V    +    C+ CF       L++CS C+ V++C   C    W   H
Sbjct: 33  GTVIITSQPLGTVALPQTINEYCNYCFRKQTRPPLQRCSRCKSVYFCDMGCFSNAWLSYH 92

Query: 64  RLECQVLSRLDKEKRKS-----VTPSIRLMLKLYLRRKLQNDNVIPSTTTDNYSLVEALV 118
           +  C   +R+  E  ++     +   + L +  Y +R          T         +L+
Sbjct: 93  QFVCDP-ARVRHEDAENELDLEMLEKVALNVSRYRKRAKTETEAEGETVEVTMEAFFSLM 151

Query: 119 AHMSDIDEKQLLLYAQIAN--LVNLILQWPEISINEIAENFSKLACNAHTICNSELRPLG 176
            H +    + L  + ++A   L    +Q   I  +E+    +    N  T+ + E+  +G
Sbjct: 152 GHDALQAGRVLASHRRLAREALKRAHVQQTSIDEDELVHYLNVFKSNNFTLDDQEMFAVG 211

Query: 177 TGLYPVISIINHSCLPNAVLVFEGRLAVVRAVQHVPKGAEVLISYIETAGSTMTRQKALK 236
            G YPV S+ NH+C PNAV+VF+G LA +RA+  +    E+ ISYI+ A +   R++AL+
Sbjct: 212 EGTYPVASLFNHTCRPNAVIVFDGALAEIRAIDTIEPDTEITISYIDPAHARAHRKRALR 271

Query: 237 EQYLFTCTCPRCIK 250
           E+Y F C C RC +
Sbjct: 272 EKYFFDCRCVRCTR 285


>gi|240275453|gb|EER38967.1| MYND finger family protein [Ajellomyces capsulatus H143]
          Length = 518

 Score =  110 bits (275), Expect = 1e-21,   Method: Compositional matrix adjust.
 Identities = 72/238 (30%), Positives = 114/238 (47%), Gaps = 29/238 (12%)

Query: 30  CDGCFASS--------NLKKCSACQVVWYCGSNCQKLDWKL-HRLECQVLSRLDKEKRKS 80
           C  CF +         +LK C+ C VV YCG  CQ   W   H+ EC++         K 
Sbjct: 58  CSNCFVTVGDELNPDLSLKACAGCHVVKYCGRKCQIESWAASHKKECKIY--------KQ 109

Query: 81  VTPSIRLM-LKLYLRRKLQNDN-VIPSTTTDNYSLVEALVAHMSDIDEK------QLLLY 132
             P I  M  +  LR   Q DN V+      ++++   L  H S +DE+      ++ + 
Sbjct: 110 CQPRILPMNARAILRIISQPDNKVLKEIYDTHHAVFRTLGHHFSRMDERGGEQAHRITVS 169

Query: 133 AQIANLVNLILQWPEISINEIAENFSKLACNAHTICNSELRPLGTGLYPVISIINHSCLP 192
           A+    ++      E+ ++ +   F+KL  NA T+ N     +G  L P  + INHSC P
Sbjct: 170 AEALKAISNT----EVELSTLVVYFAKLETNAFTLTNQYFDRIGLCLLPFAAYINHSCEP 225

Query: 193 NAVLVFEGRLAVVRAVQHVPKGAEVLISYIETAGSTMTRQKALKEQYLFTCTCPRCIK 250
           NA + F+G++  ++A+Q +    E+ ISY +      TRQ  L+ +Y F C CP+C+K
Sbjct: 226 NAYIGFDGQVMYLKALQDIAPDEEIFISYTDNTEPLKTRQTELQLRYFFECKCPKCLK 283


>gi|407929314|gb|EKG22146.1| hypothetical protein MPH_00473 [Macrophomina phaseolina MS6]
          Length = 492

 Score =  110 bits (275), Expect = 1e-21,   Method: Compositional matrix adjust.
 Identities = 75/258 (29%), Positives = 129/258 (50%), Gaps = 21/258 (8%)

Query: 6   VSGEVIIS-QEPYVCVPNNSSSISRCDGCFASS---------NLKKCSACQVVWYCGSNC 55
            +G++I S + P + V +N S  S C  CFAS+         +L+ C+ C  + YCG +C
Sbjct: 25  AAGDLIFSLKRPLIAVLDNVSLDSCCANCFASTGFGATNNDLSLRACTGCSTLKYCGRSC 84

Query: 56  QKLDWKLH-RLECQVLSRLDKEKRKSVTPSIRLMLKLYLRRKLQNDNVIPSTTTDNYSLV 114
           Q   WK H + EC VL  L  E++  +  ++R +++  + RK     +I   + +++  +
Sbjct: 85  QSQSWKRHHKHECPVLKTLGAERQ--LPNAVRAVIQTLVMRK---SGLI---SDEDWKNL 136

Query: 115 EALVAHMSDIDEKQLLLYAQIANLVNLILQWPE--ISINEIAENFSKLACNAHTICNSEL 172
           + L AH+ ++ ++       +  L  L     E   S N     + ++  N+ T+  S +
Sbjct: 137 QELPAHLDELRQEPDWDKHAVLALGALKYSMAEDKFSSNIATGIYGRVFTNSLTLIGSAM 196

Query: 173 RPLGTGLYPVISIINHSCLPNAVLVFEGRLAVVRAVQHVPKGAEVLISYIETAGSTMTRQ 232
            PLG    P+    NHSC PNA ++ +G     RA++ + K  E+LISYI+       RQ
Sbjct: 197 EPLGICFDPLACSANHSCDPNAFVIMDGAQLSFRALKPIAKDEEILISYIDGTLPFERRQ 256

Query: 233 KALKEQYLFTCTCPRCIK 250
             L  +Y FTC+C +C +
Sbjct: 257 TDLTRRYYFTCSCTKCAQ 274


>gi|296806220|ref|XP_002843920.1| histone methyltransferase SmyD1b [Arthroderma otae CBS 113480]
 gi|238845222|gb|EEQ34884.1| histone methyltransferase SmyD1b [Arthroderma otae CBS 113480]
          Length = 495

 Score =  110 bits (275), Expect = 2e-21,   Method: Compositional matrix adjust.
 Identities = 104/394 (26%), Positives = 169/394 (42%), Gaps = 67/394 (17%)

Query: 30  CDGCFASS------------NLKKCSACQVVWYCGSNCQKLDWK-LHRLECQVLSRLDKE 76
           C  CFA++             L+ C+ C+VV YCG  CQ  +W  +H+ EC    RL   
Sbjct: 52  CSNCFANAPFAANVMADVGIKLRFCTGCRVVKYCGKGCQTENWAAIHKHECATYKRL--- 108

Query: 77  KRKSVTPSIRLMLK-LYLRRKLQNDNVIPSTTTDNYSLVEALVAHMSDIDEKQLLLYAQI 135
             K +  + R +L+ L LR   Q D         +  +  +L +H  +I E     Y +I
Sbjct: 109 YPKVLPVNSRAVLRILKLRSSKQPD------VQQDLGMFRSLRSHFKEIAETNKEQYERI 162

Query: 136 ANLVNLILQWP--EISINEIAENFSKLACNAHTICNSELRPLGTGLYPVISIINHSCLPN 193
                   ++   ++ I  IAE  +K+  N  T       PLG  + P    +NHSC PN
Sbjct: 163 MLCAKAEKEYSHSDLDIETIAEYLAKIEVNGFTFTTPFGDPLGLCIQPFACFVNHSCDPN 222

Query: 194 AVLVF-EGRLAVVRAVQHVPKGAEVLISYIETAGSTMTRQKALKEQYLFTCTCPRCIKLG 252
           AV+ F EGR+  V+A++ +    +V ISYI+       RQK L E+Y FTC C +C+   
Sbjct: 223 AVVGFDEGRI-TVKALRTIEPDEQVFISYIDNTNPFEIRQKELAERYFFTCRCSKCL--- 278

Query: 253 QFDDIQESAILEGYRCKDDGCSGFLLRDSDDKGFTCQQCGLVRSKEEIKKIASEVNILSK 312
           Q D  +E   +      +D                      + + +E +K A E+     
Sbjct: 279 QGDKAREDQFIPASPPSND----------------------IETLKEAEKQAQEL----- 311

Query: 313 KTLALTSCGNHQEVVSTYKMIEKLQ----KKLYHPFSVNLMQTREKLIKILMELEDWKEA 368
               L S  N + V +  ++   ++     K++        Q R +LI  L+++E +  A
Sbjct: 312 ----LASAKNARPVFAVRQLKAAIKILHDTKMWPITRQPYPQIRSELIVALLQIESFWPA 367

Query: 369 LAYCQLT-IPVYQRVYPQ-FHPLLGLQYYTCGKL 400
           L +  +  + V   +YPQ +HP+     Y   +L
Sbjct: 368 LQHSAVRYLFVDPVLYPQEWHPIRNNHAYILSQL 401


>gi|294933189|ref|XP_002780642.1| SET and MYND domain-containing protein, putative [Perkinsus marinus
           ATCC 50983]
 gi|239890576|gb|EER12437.1| SET and MYND domain-containing protein, putative [Perkinsus marinus
           ATCC 50983]
          Length = 409

 Score =  110 bits (274), Expect = 2e-21,   Method: Compositional matrix adjust.
 Identities = 74/262 (28%), Positives = 124/262 (47%), Gaps = 9/262 (3%)

Query: 3   SFHVSGEVIISQEPYVCVPNNSSSISRCDGCFASSNLK--KCSACQVVWYCGSNCQKLDW 60
           S ++ G+ I +  P V     + + + C  C  S   K  +CS C+   YC   C K  +
Sbjct: 27  SHYIPGDSIATYRPIVYCVTPTMAKACCHWCLNSDAPKYYQCSGCRYAVYCSKECAKAAY 86

Query: 61  KL--HRLECQVLSRLDKEKRKSVTPSIRLMLKLYLRRKLQNDNVIPSTTTDNYSLVEALV 118
           KL  HR ECQ++ +L  E   S  P    +    L   +Q D  +  +  +     ++  
Sbjct: 87  KLGTHRRECQLIKKLPAEHASSA-PLTTFLAAAKLHWLVQEDEAVRHSVDEMCRHADSSD 145

Query: 119 AHMSDIDEKQLLLYAQIANLVNLILQWPEISINEIAENFSKLACNAHTICNSELRPLGTG 178
              +D     +LL   +      ++  P + +  + +    L  NA TI N  L+ +  G
Sbjct: 146 TLEADSGSTAILLSRYLDGTQADLIYSPPVPV--MLDLLRVLRYNAVTITNDSLQDVALG 203

Query: 179 LYPVISIINHSCLPNAVLVFEGRLAVVRAVQHVPKGAEVLISYIETAGSTMTRQ-KALKE 237
           LY  +S +NHSC PN VL+F G    +R ++ V  GAE+ ISY++   S   ++ + L++
Sbjct: 204 LYTEVSAMNHSCAPNVVLIFSGSEVTLRTIRAVEDGAELFISYVDVCISPKAKRCQRLRD 263

Query: 238 QYLFTCTCPRCIKLGQF-DDIQ 258
           QY F C+C RC +  ++ DD+ 
Sbjct: 264 QYKFDCSCERCTREDEYLDDVD 285


>gi|22902087|gb|AAC60295.2| skmBOP [Takifugu rubripes]
          Length = 433

 Score =  110 bits (274), Expect = 2e-21,   Method: Compositional matrix adjust.
 Identities = 72/259 (27%), Positives = 124/259 (47%), Gaps = 33/259 (12%)

Query: 7   SGEVIISQEPYVCVPNNSSSISRCDGCFA-SSNLKKCSACQVVWYCGSNCQKLDWKLHRL 65
           +G+VI S+     V  +S +   C  CF     L+KCS C+   YC   CQ+  W  H+ 
Sbjct: 24  AGDVIFSEPSLAAVVFDSLAERICHSCFRRQEKLQKCSQCKFAHYCDRTCQRAGWAEHKQ 83

Query: 66  ECQVLSRLDKEKRKSVTPSIRLMLKLYLRRKLQNDNVIPSTTTDNYSLVEALVAHMSDID 125
           EC  +    K   K+   +IR++  + L                    VE L  H++D+ 
Sbjct: 84  ECGAI----KAYGKAPNENIRVVSHMQL------------------ITVEELEDHVADMQ 121

Query: 126 EKQLL-LYAQIANLVNLILQWPEIS----INEIAENFSKLACNAHTICNSE-LRPLGTGL 179
           E ++  L   I N ++    WP  S    I++I+  F  + CN  T+ +   L+ +G GL
Sbjct: 122 EDEIKELKVDIHNFLDY---WPRNSKQHTIDDISHIFGVINCNGFTVSDQRGLQAVGVGL 178

Query: 180 YPVISIINHSCLPNAVLVFEGRLAVVRAVQHVPKGAEVLISYIETAGSTMTRQKALKEQY 239
           +P + ++NH+C PN  ++       +R++  + +G E+ ++Y++    +  R++ LK QY
Sbjct: 179 FPNLCMVNHNCWPNCTVILNHGKIELRSLGKIAEGEELTVAYVDFLNLSEERRRLLKTQY 238

Query: 240 LFTCTCPRCIKLGQFDDIQ 258
            F C C  C K G  DD++
Sbjct: 239 FFDCQCDYC-KNGTKDDLK 256


>gi|327305077|ref|XP_003237230.1| hypothetical protein TERG_01951 [Trichophyton rubrum CBS 118892]
 gi|326460228|gb|EGD85681.1| hypothetical protein TERG_01951 [Trichophyton rubrum CBS 118892]
          Length = 498

 Score =  110 bits (274), Expect = 2e-21,   Method: Compositional matrix adjust.
 Identities = 71/234 (30%), Positives = 112/234 (47%), Gaps = 23/234 (9%)

Query: 30  CDGCFAS------------SNLKKCSACQVVWYCGSNCQKLDWKL-HRLECQVLSRLDKE 76
           C  CFA+              L  CS C+VV YC   CQ  +WKL H+ EC +  +L   
Sbjct: 55  CSNCFANLPFGVNIMANVDMKLWACSGCKVVKYCDRKCQSANWKLIHKHECAIYKKL--- 111

Query: 77  KRKSVTPSIRLMLKLYLRRKLQNDNVIPSTTTDNYSLVEALVAHMSDIDEKQLLLYAQIA 136
             K +  + R +L++   R    D+V       +  +  +L +H+ +I +     Y +I 
Sbjct: 112 YPKILPVNSRAVLRIVKLRNSNEDHV-----KSDLIMFLSLRSHLDEITKSNQEQYERIM 166

Query: 137 NLVNLILQWP--EISINEIAENFSKLACNAHTICNSELRPLGTGLYPVISIINHSCLPNA 194
                  ++   E+ +  IAE  +++  N+ T   +   PLG  + P    +NHSC PNA
Sbjct: 167 LSAKAEKEYSGSELDVETIAEYLARIEVNSFTFTTAFGDPLGLCIQPFACYMNHSCEPNA 226

Query: 195 VLVFEGRLAVVRAVQHVPKGAEVLISYIETAGSTMTRQKALKEQYLFTCTCPRC 248
           V+ F+G L  V+A++ +    +V ISYI+       RQK L E+Y FTC C +C
Sbjct: 227 VVGFDGGLITVKALREIKSDEQVFISYIDNTYPFEVRQKQLAERYFFTCKCSKC 280


>gi|156101287|ref|XP_001616337.1| hypothetical protein [Plasmodium vivax Sal-1]
 gi|148805211|gb|EDL46610.1| hypothetical protein, conserved [Plasmodium vivax]
          Length = 517

 Score =  109 bits (273), Expect = 2e-21,   Method: Compositional matrix adjust.
 Identities = 111/466 (23%), Positives = 186/466 (39%), Gaps = 78/466 (16%)

Query: 26  SISRCDGCFAS-SNLKKCSACQVVWYCGSNCQKLDWKLHRLECQVLSR--LDKEKRKSVT 82
           +++ C  CF   S    C  C+ V YC   C +  WKLHR EC+V      DK       
Sbjct: 65  TVNLCFYCFEKFSKSFYCPNCKYVVYCSEVCLERGWKLHRDECEVFKSNIFDK-----FC 119

Query: 83  PSI--RLMLKLYLRRKLQNDNVIPSTTTDNYSLVEALVAHMSDIDEKQLLLYAQIANLVN 140
           PSI  R+++  YL                +++  E     ++D+ +++   +   A +V 
Sbjct: 120 PSIIMRMVINSYL---------------SHFNFYE-YCGSVADLPKEKYEYFKYPAYIVA 163

Query: 141 LILQ------WPEISINE-----IAENFSKLACNAHTICNSELRPLGTGLYP-VISIINH 188
           + L       +     NE     I E F K++ N   I ++EL P G G Y   +   NH
Sbjct: 164 VALMSKKKKIFANFEDNESILRNIIEKFIKISKNTLQIIDNELEPAGLGFYKKPVPYFNH 223

Query: 189 SCLPNAVLVFEGRLAVVRAVQHVPKGAEVLISYIETAGSTMTRQKALKEQYLFTCTCPRC 248
           SCL N V +F+ +   +R +  V  G E+ ISY++ A    TR     +QY FTCTC  C
Sbjct: 224 SCLSNCVTIFKNQKLFIRTLMDVYPGEELTISYVDIAFDKNTRLAICMDQYFFTCTCKLC 283

Query: 249 IKLGQFDDIQESAILEGYRCKDDGCSGF------------------------------LL 278
            K+    + Q     E    + + C  +                              L 
Sbjct: 284 -KVNVVSECQNVFNTEFVCTQSESCKKYVNHMELILISELERKSCYVNKSQFKNSYPVLK 342

Query: 279 RDSDDKGFTCQQCGLVRSKEEIKKIASEVNILSKKTLALTSCGNHQEVVSTYKMIEKLQK 338
           + ++   + C  C        IK +  +     K+T  L S  N +       +++ L K
Sbjct: 343 KANEQHVWKCSICKNEVHDNVIKSLLEKEKETVKETAHLESLFNEKYTYDNKSVLQSLNK 402

Query: 339 ---------KLYHPFSVNLMQTREKLIKILMELEDWKEALAYCQLTIPVYQRVYPQFHPL 389
                      YH    +L + R K++ I ++L+++K A       +   +  Y ++ P+
Sbjct: 403 IKSKIDYLTTFYHHTRYSLQKMRAKILYISIQLQEFKLAYNIANQYLKSIEVSYGKYSPI 462

Query: 390 LGLQYYTCGKLEWFLGDTENAIKSMTEAVEILRITHGTNSPFMKEL 435
            G   +  GKL  FL      +  + +A + +  T+G +SP  K+L
Sbjct: 463 YGYYIFLTGKLALFLDLKSEGLSLIHKAKKNIIKTYGPDSPIYKDL 508


>gi|302506763|ref|XP_003015338.1| SET and MYND domain protein, putative [Arthroderma benhamiae CBS
           112371]
 gi|291178910|gb|EFE34698.1| SET and MYND domain protein, putative [Arthroderma benhamiae CBS
           112371]
          Length = 498

 Score =  109 bits (273), Expect = 2e-21,   Method: Compositional matrix adjust.
 Identities = 72/234 (30%), Positives = 113/234 (48%), Gaps = 23/234 (9%)

Query: 30  CDGCFA----------SSNLK--KCSACQVVWYCGSNCQKLDWKL-HRLECQVLSRLDKE 76
           C  CFA          S ++K   CS C+VV YC   CQ  +WKL H+ EC +  +L   
Sbjct: 55  CSNCFANLPFGVNVMASVDMKLWACSGCKVVKYCDRKCQSANWKLIHKHECAIYKKL--- 111

Query: 77  KRKSVTPSIRLMLKLYLRRKLQNDNVIPSTTTDNYSLVEALVAHMSDIDEKQLLLYAQIA 136
             K +  + R +L++   R    D+V       + ++   L +H+ +I       Y +I 
Sbjct: 112 YPKVLPVNSRAVLRIVKLRDSNEDHV-----KSDLNMFLTLRSHLVEITNSNQEQYERIM 166

Query: 137 NLVNLILQWP--EISINEIAENFSKLACNAHTICNSELRPLGTGLYPVISIINHSCLPNA 194
                  ++   E+ +  IAE  +++  N+ T   +   PLG  + P    +NHSC PNA
Sbjct: 167 LCAKAEKEYSGSELDVETIAEYLARIEVNSFTFTTAFGDPLGLCIQPFACYMNHSCEPNA 226

Query: 195 VLVFEGRLAVVRAVQHVPKGAEVLISYIETAGSTMTRQKALKEQYLFTCTCPRC 248
           V+ F+G L  V+A++ +    +V ISYI+       RQK L E+Y FTC C +C
Sbjct: 227 VVGFDGGLITVKALREIKPDEQVFISYIDNTYPLEVRQKQLTERYFFTCKCSKC 280


>gi|47204414|emb|CAF87026.1| unnamed protein product [Tetraodon nigroviridis]
          Length = 386

 Score =  109 bits (273), Expect = 3e-21,   Method: Compositional matrix adjust.
 Identities = 100/428 (23%), Positives = 175/428 (40%), Gaps = 66/428 (15%)

Query: 30  CDGCF-ASSNLKKCSACQVVWYCGSNCQKLDWKLHRLECQVLSRLDKEKRKSVTPSIRLM 88
           C  CF   + L +C+ C+   YC   CQ   W+ HR ECQ + RL +     +  + R+M
Sbjct: 2   CHSCFRQQAQLHRCAQCRFAHYCDRTCQTACWEEHRQECQAIRRLGEVPADRIRLAARVM 61

Query: 89  LKLYLRRKLQNDN-VIPSTTTDNYSLVEALVAHMSDIDEKQL-LLYAQIANLVNLILQW- 145
            +++    + +D+ ++P         VE L  H++D+ ++QL  L+A +         W 
Sbjct: 62  WRMHKDTGVASDSQLLP---------VEELQDHVADLPQEQLQRLHADVRTFQEF---WS 109

Query: 146 ---PEISINEIAENFSKLACNAHTICNSE-LRPLGTGLYPVISIINHSCLPNAVLVFEG- 200
                 S  +IA     +  N  T+ +   L+ +G GL+P + ++NH C PN  +V    
Sbjct: 110 CGRSRPSREDIAHVLGIIKTNGFTLSDQRGLQAVGVGLFPNLCLVNHDCWPNCSVVLNHG 169

Query: 201 ------------RLAVVRAVQHVPKGAEVLISYIETAGSTMTRQKALKEQYLFTCTCPRC 248
                       R   +RA+  V +G E+ +SY++   ++  RQ+ L E + F CTC RC
Sbjct: 170 NQSATNPALHSQRRVELRALGKVCQGEELTVSYVDFLDTSAERQRKLSELFFFHCTCQRC 229

Query: 249 IKLGQFDDIQESAILEGYRCKDDGCSGFLLRDSDDKGFTCQQCGLVRSKEEIKKI-ASEV 307
                                              K              E++++ A   
Sbjct: 230 -------------------------------SQHLKDDAMTAAADGADAAEVQEVTAFSQ 258

Query: 308 NILSKKTLALTSCGNHQEVVSTYKMIEKLQKKLYHPFSVNLMQTREKLIKILMELEDWKE 367
             L K    LT   ++QEV++      + Q+ L     +  ++     ++ L  L  + E
Sbjct: 259 ECLEKIDRCLTG-RDYQEVLALCSACLQKQRGLLADTHLCHLRVLSAAVEALSYLRCFPE 317

Query: 368 ALAYCQLTIPVYQRVYPQFHPLLGLQYYTCGKLEWFLGDTENAIKSMTEAVEILRITHGT 427
           A AY    +  Y+++YP  H  LG+     G      G  E A + + +A   L ++HG 
Sbjct: 318 AAAYALRLLQGYRKLYPPNHAQLGVAAMRAGVAHLQAGMPEAAHELICQAYRTLLVSHGP 377

Query: 428 NSPFMKEL 435
           N    ++L
Sbjct: 378 NHSTTRDL 385


>gi|261198320|ref|XP_002625562.1| SET and MYND domain-containing protein [Ajellomyces dermatitidis
           SLH14081]
 gi|239595525|gb|EEQ78106.1| SET and MYND domain-containing protein [Ajellomyces dermatitidis
           SLH14081]
 gi|327355791|gb|EGE84648.1| SET and MYND domain-containing protein [Ajellomyces dermatitidis
           ATCC 18188]
          Length = 519

 Score =  109 bits (272), Expect = 3e-21,   Method: Compositional matrix adjust.
 Identities = 74/235 (31%), Positives = 107/235 (45%), Gaps = 23/235 (9%)

Query: 30  CDGCFAS--------SNLKKCSACQVVWYCGSNCQKLDWKL-HRLECQVLSRLDKEKRKS 80
           C  CFA+          LK C+ C VV YC   CQ   W   H+ EC++         K+
Sbjct: 59  CSNCFATIGDEVNPDFTLKACTGCNVVRYCDRRCQSESWAASHKKECKIY--------KN 110

Query: 81  VTPSIRLM-LKLYLRRKLQNDNVIPSTTTDN-YSLVEALVAHMSDIDEK---QLLLYAQI 135
             P I  M ++  LR     DN       DN Y     L  H S + ++   Q       
Sbjct: 111 CQPRILPMNVRAVLRIISPPDNKAAKEMYDNHYESFRTLGHHFSKMKQRGGEQAYRTTVT 170

Query: 136 ANLVNLILQWPEISINEIAENFSKLACNAHTICNSELRPLGTGLYPVISIINHSCLPNAV 195
           A  +  I    +I ++ +A  F+KL  NA T+ N     +G    P  S INHSC PNA 
Sbjct: 171 AEALKAISN-TDIDLSTLAAYFAKLETNAFTLTNQYFDRIGLCFLPFASYINHSCQPNAY 229

Query: 196 LVFEGRLAVVRAVQHVPKGAEVLISYIETAGSTMTRQKALKEQYLFTCTCPRCIK 250
           + F+G +  ++A+Q +    ++ ISYI+      TRQ  L+ QY F C CP+C++
Sbjct: 230 IGFDGPVMYLKALQDIAPDEQIYISYIDNTEPFQTRQSELQLQYFFECKCPKCLE 284


>gi|239607850|gb|EEQ84837.1| SET and MYND domain-containing protein [Ajellomyces dermatitidis
           ER-3]
          Length = 519

 Score =  109 bits (272), Expect = 3e-21,   Method: Compositional matrix adjust.
 Identities = 74/235 (31%), Positives = 107/235 (45%), Gaps = 23/235 (9%)

Query: 30  CDGCFAS--------SNLKKCSACQVVWYCGSNCQKLDWKL-HRLECQVLSRLDKEKRKS 80
           C  CFA+          LK C+ C VV YC   CQ   W   H+ EC++         K+
Sbjct: 59  CSNCFATIGDEVNPDFTLKACTGCNVVRYCDRRCQSESWAASHKKECKIY--------KN 110

Query: 81  VTPSIRLM-LKLYLRRKLQNDNVIPSTTTDN-YSLVEALVAHMSDIDEK---QLLLYAQI 135
             P I  M ++  LR     DN       DN Y     L  H S + ++   Q       
Sbjct: 111 CQPRILPMNVRAVLRIISPPDNKAAKEMYDNHYESFRTLGHHFSKMKQRGGEQAYRTTVT 170

Query: 136 ANLVNLILQWPEISINEIAENFSKLACNAHTICNSELRPLGTGLYPVISIINHSCLPNAV 195
           A  +  I    +I ++ +A  F+KL  NA T+ N     +G    P  S INHSC PNA 
Sbjct: 171 AEALKAISN-TDIDLSTLAAYFAKLETNAFTLTNQYFDRIGLCFLPFASYINHSCQPNAY 229

Query: 196 LVFEGRLAVVRAVQHVPKGAEVLISYIETAGSTMTRQKALKEQYLFTCTCPRCIK 250
           + F+G +  ++A+Q +    ++ ISYI+      TRQ  L+ QY F C CP+C++
Sbjct: 230 IGFDGPVMYLKALQDIAPDEQIYISYIDNTEPFQTRQSELQLQYFFECKCPKCLE 284


>gi|403351926|gb|EJY75465.1| SET and MYND domain containing 3 [Oxytricha trifallax]
          Length = 517

 Score =  109 bits (272), Expect = 4e-21,   Method: Compositional matrix adjust.
 Identities = 110/435 (25%), Positives = 187/435 (42%), Gaps = 86/435 (19%)

Query: 30  CDGCF-ASSNLKKCSACQVVWYCGSNCQKLDWKLHRLECQ----------------VLSR 72
           C  CF  SS LK+CS C+   YC  +CQK  W  HR EC+                +L  
Sbjct: 103 CSNCFKTSSTLKQCSQCKFTHYCQQSCQKDHWATHRSECKSPKSAISLKSMPQSLRLLLS 162

Query: 73  LDKEKRKSVTPSIRLMLKLYLRRKLQNDNVIPSTTTDNYSLVEALVAHMSDIDEKQLLLY 132
           L   K  S  P     L+++      N+ VI   +  +  L++      S    ++++ Y
Sbjct: 163 LLYLKTDSKNPKFMSNLQIF------NEQVITMISNASKILMK------SQEKREEVMNY 210

Query: 133 AQIANLVNLILQWPEISINE---IAENFSKLACNAHTICNSELRPLGTGLYPVISIINHS 189
           A +  L        E++I+    I   +    CN   I ++++  +GTGLY   +++NHS
Sbjct: 211 AMVCFLKTAQSSNFEVNIDNFKLIVHLYFLSICNGFGIADNQMLRIGTGLYYPSNLLNHS 270

Query: 190 CLPNAVLVFEGRLAVVRAVQHVPKGAEVLISYIETAGST-MTRQKALKEQYLFTCTCPRC 248
           C PN +++F G+   +   + +    E+ I YI+   S  + RQ+ L+EQY F C C RC
Sbjct: 271 CDPNCMVLFRGQTQFIVTCRPIEADEEITICYIDNGISERIIRQQYLQEQYHFNCMCARC 330

Query: 249 IKLGQFDDIQESAILEGYRCKDDGCSGFLLRDSDDKGFTCQQCGLVRSKEEIKKIASEVN 308
           +K           I EG   K+                          K +I + + E  
Sbjct: 331 LK----------QIGEGTELKE-------------------------QKVQI-QFSEEQQ 354

Query: 309 ILSKKTLALTSCGNHQEVVSTYKMIEKLQKKLYHPFSVNLMQTR------EKLIKILMEL 362
           IL+K+ L   S GN  + +S    + +L K +    S  + Q        E+L+ + +  
Sbjct: 355 ILNKQALHEESLGNFDKSLS---QLSQLYKSVETNKSTQIFQNSMIKELLERLVYLSVMN 411

Query: 363 EDWKEALAYCQLTIPVYQRVYPQF--------HPLLGLQYYTCGKLEWFLGDTENAIKSM 414
           +D+K A+ + Q  + + + +Y +         HPL+ L YY  GKL       + AI ++
Sbjct: 412 KDYKFAVKFQQKLVDLCEIMYIKEVKREDELPHPLIALHYYQLGKLLSQRKKYQEAIDNL 471

Query: 415 TEAVEILRITHGTNS 429
            +A+ ++   +G  S
Sbjct: 472 QKAMTLVSAYYGLKS 486


>gi|390474255|ref|XP_002757614.2| PREDICTED: SET and MYND domain-containing protein 1 [Callithrix
           jacchus]
          Length = 442

 Score =  108 bits (270), Expect = 5e-21,   Method: Compositional matrix adjust.
 Identities = 106/453 (23%), Positives = 177/453 (39%), Gaps = 116/453 (25%)

Query: 7   SGEVIISQEPYVCVPNNSSSISRCDGCFA-SSNLKKCSACQVVWYCGSNCQKLDWKLHRL 65
           + +VI ++  Y  V  +S     C  CF     L +C  C+   YC   CQK  W  H+ 
Sbjct: 29  AADVIFAERAYSAVVFDSLVNFVCHTCFKRQEKLHRCGQCKFAHYCDRTCQKDAWLNHKN 88

Query: 66  ECQVLSRLDKEKRKSVTPSIRLMLKLYLRRKLQNDNVIPSTTTDNYSLVEALVAHMSDID 125
           EC  + R  K   +++  + R+M ++        +  + S        ++  V H  + +
Sbjct: 89  ECSAIRRYGKVPNENIRLAARIMWRVEREGTGLTEGCLVSVDD-----LQNHVEHFGEEE 143

Query: 126 EKQLLLYAQIANLVNLILQ-WP----EISINEIAENFSKLACNAHTICNSE-LRPLGTGL 179
           +K+L +       V+  LQ WP    + S+  I+  F  + CN  T+ +   L+ +G G+
Sbjct: 144 QKELRVD------VDTFLQYWPPQSQQFSMQYISHIFGVINCNGFTLSDQRGLQAVGVGI 197

Query: 180 YPVISIINHSCLPNAVLVFE-GRLAVVRAVQH------------VPKGAEVLISYIETAG 226
           +P + ++NH C PN  ++F  G    V+++ H            + +G E+ +SYI+   
Sbjct: 198 FPNLGLVNHDCWPNCTVIFNNGNHEAVKSMFHTQMRIELRALGKISEGEELTVSYIDFLN 257

Query: 227 STMTRQKALKEQYLFTCTCPRCIKLGQFDDIQESAILEGYRCKDDGCSGFLLRDSDDKGF 286
            +  R+K LK+QY F CTC  C K                + KDD    FL         
Sbjct: 258 VSEERKKQLKKQYYFDCTCEHCQK----------------KLKDD---LFL--------- 289

Query: 287 TCQQCGLVRSKEEIKKIASEVNILSKKTL----ALTSCGNHQEVVSTYKMIEKLQKKLYH 342
                G+  + +  +++  E+   SK TL       S G + EVV         ++KLYH
Sbjct: 290 -----GVKDNPKASQEVVKEMIQFSKDTLEKIDKARSEGLYHEVVKL------CREKLYH 338

Query: 343 PFSVNLMQTREKLIKILMELEDWKEALAYCQLTIPVYQRVYPQFHPLLGLQYYTCGKLEW 402
           P +                                            LG+     G   W
Sbjct: 339 PNNAQ------------------------------------------LGMAVMRAGLTNW 356

Query: 403 FLGDTENAIKSMTEAVEILRITHGTNSPFMKEL 435
             G+ E     + +A  IL +THG + P  K+L
Sbjct: 357 HAGNIEVGHGMICKAYAILLVTHGPSHPITKDL 389


>gi|389746511|gb|EIM87691.1| SET domain-containing protein [Stereum hirsutum FP-91666 SS1]
          Length = 552

 Score =  108 bits (270), Expect = 5e-21,   Method: Compositional matrix adjust.
 Identities = 113/430 (26%), Positives = 183/430 (42%), Gaps = 115/430 (26%)

Query: 7   SGEVIISQEPYVCVPNNSSSISRCDGCFAS---SNLKKCSACQVVWYCGSNCQKLDWKLH 63
           +G  +++ +P+V V +  +    C  C +    + LK+C+ C+V+WYC + CQ  DW +H
Sbjct: 45  AGATLMALKPHVHVLSTRNLELYCSACTSPAPVTGLKRCTRCRVIWYCSTACQNNDWTMH 104

Query: 64  RLECQVLSR--LDKEKRKSVTPS--IRLMLKLYLRRK--------------LQN--DNVI 103
           + EC  + R   +        P+  IR + +L  R++              LQ+   ++ 
Sbjct: 105 KPECAAIQRWAANAPSPDVSIPAEPIRCLGRLLWRKQKLRSSSTWAKEVDGLQSHRKDLQ 164

Query: 104 PSTTTDNYSLVEALVAHMSDIDEKQLLLYA--QIANLVNLILQWPEISINEIAENFSKLA 161
           PST   +  L  A+V ++      QL  Y     A+LV+LI               S+  
Sbjct: 165 PSTYESHTHLAHAVVKYLGLDAPGQLGAYGITTAADLVDLI---------------SRFI 209

Query: 162 CNAHTICNSELRPLGTGLYPVISIINHSCLPNAVLVF----------EGRLAVVRAVQHV 211
            N+ T+    L PLG  + P++++INHSC PNAV+V+          E ++ VV A++++
Sbjct: 210 TNSITLTAPSLTPLGVSVSPLVALINHSCDPNAVVVYPRSTSDPSQQEPQMQVV-AIKNI 268

Query: 212 PKGAEVLISYIETAGSTMTRQKALKEQYLFTCTCPRCIKLGQFDDIQESAILEGYRCKDD 271
               E+  +YI+T      RQ AL   Y FTC C  C +    D      + E   C  +
Sbjct: 269 APDQEIFTAYIDTTMPRGHRQAALTTTYNFTCKCSLCSQPDGMD------LREAIWCPKN 322

Query: 272 GCSGFLLRDSDDKGFTCQQCGLVRSKEEIKKIASEVNILSKKTLALTSCGNHQEVV-STY 330
            C G           TC              + +E N LS+       CG  + +V ST 
Sbjct: 323 -CGG-----------TC-------------PLPTEENTLSR-------CGKCKAIVSSTD 350

Query: 331 KMIEKL---QKKLYHPFSVNLMQTREKLIKILMELEDWKEALAYCQLT---IPVY--QRV 382
            +++ L   Q+ L    S+ L +T + L                 QLT   IP+    ++
Sbjct: 351 SVLDALRVGQEALQKATSLQLTETAKAL-----------------QLTTNIIPILTSSKL 393

Query: 383 YPQFHPLLGL 392
            P  HPLL L
Sbjct: 394 TPSSHPLLAL 403


>gi|336274372|ref|XP_003351940.1| hypothetical protein SMAC_00488 [Sordaria macrospora k-hell]
 gi|380096224|emb|CCC06271.1| unnamed protein product [Sordaria macrospora k-hell]
          Length = 537

 Score =  108 bits (270), Expect = 5e-21,   Method: Compositional matrix adjust.
 Identities = 79/281 (28%), Positives = 137/281 (48%), Gaps = 24/281 (8%)

Query: 7   SGEVI-ISQEPYVCVPNNSSSISRCDGCFASSNL--------------KKCSACQVVWYC 51
           +GE I I + P + +P++++  + C+ C   S                K C+ C+   YC
Sbjct: 34  AGETIAILENPLLALPDDANMRTTCNYCLYVSGTIEFEGDVKAGPRTCKACTGCKAAVYC 93

Query: 52  GSNCQKLDWKL-HRLECQVLSRLDKEKRKSVTPS-IRLMLKLYLRRKLQNDNVIPSTTTD 109
            + CQ+  WKL H+ EC++  R+ +   K   P+ +R + ++ L  K  ++ V  +    
Sbjct: 94  NAECQRQHWKLVHKAECKMFKRIRERTGKDWVPTPVRAVAQVMLLLKAGDEEVTKAFGPG 153

Query: 110 NYSLVEALVAHMSDIDE--KQLLLYAQIANLVNLILQWPEISINEIAENFSKLACNAHTI 167
               +E+ V      +E  K   L A  A +   +LQ  E ++ +  E   K+  NA   
Sbjct: 154 G--TLESNVEGFKTDEELWKDFELQATGAVVYAGLLQSDE-TLKQAMEVLCKIQTNAFNR 210

Query: 168 CNSELRPLGTGLYPVISIINHSCLPNAVLVFEGRLAVVRAVQHVPKGAEVLISYIETAGS 227
            +++    G  L+P +S++NHSC+PNA + FE R A ++A + +  G E+LISYI+    
Sbjct: 211 FDADTGQAGIYLHPSLSMVNHSCVPNAYITFEKRKAFLKAERDIEPGDEILISYIDHTAP 270

Query: 228 TMTRQKALKEQYLFTCTCPRC-IKLGQFDDIQESAILEGYR 267
              RQ++L+  Y F C C RC   L  ++ IQ   +  G +
Sbjct: 271 RRARQESLR-LYHFQCNCVRCKDDLNVYEVIQACGVSPGVK 310


>gi|242009727|ref|XP_002425634.1| set and mynd domain-containing protein, putative [Pediculus humanus
           corporis]
 gi|212509527|gb|EEB12896.1| set and mynd domain-containing protein, putative [Pediculus humanus
           corporis]
          Length = 666

 Score =  108 bits (270), Expect = 6e-21,   Method: Compositional matrix adjust.
 Identities = 109/387 (28%), Positives = 164/387 (42%), Gaps = 53/387 (13%)

Query: 8   GEVIISQEPYVCVPNNSSSISRCDGCFA-SSNLKKCSACQVVWYCGSNCQKLDWKLHRLE 66
           GE I+ ++PY  V      +  C  CF  +     C  C  V +C  NC++     H+ E
Sbjct: 258 GETILVEKPYCSVLLEDYHLINCQHCFKRTQGPYPCENCSQVIFCSLNCRE-SANYHKYE 316

Query: 67  CQVLSRLDKEKRK-SVTPSIRLM----LKLYLRRK---LQNDNVIPSTTTDNYSLVEALV 118
           C +L  L K     +V  ++R++    LK +L  K   +   N      + +Y  V  LV
Sbjct: 317 CSILQTLHKSGVSITVLMALRMITQNNLKYFLNLKSKLVDEKNFDGVYKSSDYLRVYNLV 376

Query: 119 AH-----MSDIDEKQL--LLYAQIANLVNLI---------LQWPEISINE-IAENFSKLA 161
            H       D  E+ +  L   +   LVN           L   EI I   I  +   L 
Sbjct: 377 RHEDLRETKDFFERTVMALFLLKCLKLVNYFNKTDNLLETLTEDEIFIGGLILRHLQILQ 436

Query: 162 CNAHTICNSELRP-----------LGTGLYPVISIINHSCLPNAVLVFEGRLAVVRAVQH 210
            NAH I   E+             +G GLYP +++ NHSC P  V  F G   +VRA + 
Sbjct: 437 FNAHEISELEMENKNILDDSKSLFVGGGLYPTLALFNHSCDPGIVRYFIGTQVIVRAAKP 496

Query: 211 VPKGAEVLISY--IETAGSTMTRQKALKEQYLFTCTCPRCIKL-GQFDDIQESAILEGYR 267
           + KG  V  +Y  I +    + RQ+ LK QY FTCTC  C++L   FD++    I   +R
Sbjct: 497 IKKGEIVAENYGPIYSQMKKLERQEKLKSQYWFTCTCTPCLELWPTFDELDTKTI--RFR 554

Query: 268 C-------KDDGCSGFLL--RDSDDKGFTCQQCGLVRSKEEIKKIASEVNILSKKTLALT 318
           C       K + C   L+   ++DD    C QC    +  +  K   + +IL K+ +   
Sbjct: 555 CSGYGTDEKRENCKNILIVPVNTDDFMIKCSQCNQHTNLFKGLKALQDTDILYKQAIKYA 614

Query: 319 SCGNHQEVVSTY-KMIEKLQKKLYHPF 344
             GN Q  +  Y ++++ L   L  PF
Sbjct: 615 QTGNFQNALDIYLEILKTLHGILAPPF 641


>gi|449540918|gb|EMD31905.1| hypothetical protein CERSUDRAFT_119222 [Ceriporiopsis subvermispora
           B]
          Length = 604

 Score =  108 bits (270), Expect = 6e-21,   Method: Compositional matrix adjust.
 Identities = 79/283 (27%), Positives = 128/283 (45%), Gaps = 49/283 (17%)

Query: 7   SGEVIISQEPYVCVPNNS---SSISRCDGCFASSNLKKCSACQVVWYCGSNCQKLDWKLH 63
           +G  +I+ +P+V V + +   S  S C G    + LK+C+ C+VVWYC S CQ  DW +H
Sbjct: 99  AGSTLIATKPHVTVLSTTYLDSYCSACCGPAPETGLKRCTRCRVVWYCDSCCQNNDWPIH 158

Query: 64  RLECQVLSRLDKEKRKS--VTPS--IRLMLKLYLRRKLQNDNVI---------------- 103
           + EC  + R            PS  IR + ++ L ++ +  + I                
Sbjct: 159 KDECIAIQRWASSAPSPDVAIPSDAIRCLGRILLTQQHRGLDSIWSKEINAMQSHRSSMQ 218

Query: 104 PSTTTDNYSLVEALVAHMSDIDEKQLLLYA--QIANLVNLILQWPEISINEIAENFSKLA 161
           PS    +  +  +LV ++   +  +L  +      NLV+LI               S+ A
Sbjct: 219 PSAVESHTHMAHSLVRYLGIANPAELAPFGLNSAGNLVDLI---------------SRFA 263

Query: 162 CNAHTICNSELRPLGTGLYPVISIINHSCLPNAVLVF---------EGRLAVVRAVQHVP 212
            N  T+ +  L P+G  + P I++ NHSC PNA +VF         +  L  + A++ + 
Sbjct: 264 TNTFTLTSFSLTPIGICICPSIALANHSCEPNAAIVFPRSSSLPQAQEPLMHLMAIRDIA 323

Query: 213 KGAEVLISYIETAGSTMTRQKALKEQYLFTCTCPRCIKLGQFD 255
              +V+ +Y++       RQKAL+E Y FTC C  C K    D
Sbjct: 324 PNEQVVAAYVDVTLPRELRQKALQETYSFTCKCKLCTKRMAVD 366


>gi|440804218|gb|ELR25095.1| SET and MYND domain containing 3, putative [Acanthamoeba
           castellanii str. Neff]
          Length = 365

 Score =  108 bits (269), Expect = 7e-21,   Method: Compositional matrix adjust.
 Identities = 63/189 (33%), Positives = 103/189 (54%), Gaps = 14/189 (7%)

Query: 50  YCGSNCQKLDWK-LHRLECQVLSRLDKEKRKSVTPSIRLMLKLYLRR--KLQNDNVIPST 106
           YC + CQ+ DW  +H+ EC ++  +   K  S   S+ LM ++ +RR  + Q    +P  
Sbjct: 4   YCNTECQRKDWAAVHKEECGMIQSVAPHKPTS---SMLLMWRVLIRRIKEEQEGKSLP-- 58

Query: 107 TTDNYSLVEALVAHMSDIDEKQLLLYAQIANLV--NLILQWP-EISINEIAENFSKLACN 163
              N+ L+  L  H+     ++   +  +A L+  +L    P +I+  EI   F   ACN
Sbjct: 59  ---NFDLIRYLTTHIDAFPSEKKEHFGVMAALIKKSLPATLPTDITPQEIMHLFCLFACN 115

Query: 164 AHTICNSELRPLGTGLYPVISIINHSCLPNAVLVFEGRLAVVRAVQHVPKGAEVLISYIE 223
             T+ + EL+PLG G+YP  ++INHSC PN V++FEGR   VR+++ + K  E+  +Y+E
Sbjct: 116 NFTVSDGELKPLGLGIYPPAALINHSCDPNCVIIFEGRQCTVRSLRDITKDEEITFNYVE 175

Query: 224 TAGSTMTRQ 232
               T TR+
Sbjct: 176 VGDPTETRR 184



 Score = 82.8 bits (203), Expect = 4e-13,   Method: Compositional matrix adjust.
 Identities = 39/85 (45%), Positives = 56/85 (65%)

Query: 365 WKEALAYCQLTIPVYQRVYPQFHPLLGLQYYTCGKLEWFLGDTENAIKSMTEAVEILRIT 424
           W+EAL YC  ++ V++R+YP   PL+GLQY   GKL ++L  TE A++SM  A+ IL IT
Sbjct: 269 WREALLYCSASLAVFERIYPPRWPLVGLQYLIHGKLSFYLKYTEAALESMQRALPILTIT 328

Query: 425 HGTNSPFMKELILKLEEAQAEASYK 449
           H +  P ++ L   L EA AE +Y+
Sbjct: 329 HSSGHPLVRTLHTMLAEATAEWNYE 353


>gi|307196209|gb|EFN77865.1| SET and MYND domain-containing protein 3 [Harpegnathos saltator]
          Length = 354

 Score =  108 bits (269), Expect = 7e-21,   Method: Compositional matrix adjust.
 Identities = 61/208 (29%), Positives = 105/208 (50%), Gaps = 6/208 (2%)

Query: 8   GEVIISQEPYVCVPNNSSSISRCDGCFASSNLKKCSACQVVWYCGSNCQKLDWKLHRLEC 67
           G  I + +P+  V  +  +   CD C  S  L KCS CQ V+YC  NCQK  W +H+ EC
Sbjct: 11  GITIFTSKPFAYVLTSKCNAICCDNCLKSGKLSKCSGCQYVYYCDRNCQKESWPIHKAEC 70

Query: 68  QVLSRLDKEKRKSVTPSIRLMLKLYLRRKLQNDNVIPSTTTDNYSLVEALVAHMSDI--D 125
             L R+     K +  + RLM ++ ++      N +       +   + L++H SDI  D
Sbjct: 71  ANLKRISP---KIIPDAARLMARIIIKLNQGGANEVGYYCETKFRKFKDLMSHYSDIKKD 127

Query: 126 EKQLLLYAQIAN-LVNLILQWPEISINEIAENFSKLACNAHTICNSELRPLGTGLYPVIS 184
            K++  +  +   L+  + +    +  E+   + ++  N+  I + ++  +G G+Y   S
Sbjct: 128 PKRMEHFISLCGVLLEFLGEALMPNTAELIGIYGRICINSFNILDLDMNSIGVGIYLGAS 187

Query: 185 IINHSCLPNAVLVFEGRLAVVRAVQHVP 212
           +I+HSC PNAV+ FEG   +VR +  +P
Sbjct: 188 VIDHSCKPNAVVTFEGTTIIVRTLIDLP 215



 Score = 68.9 bits (167), Expect = 5e-09,   Method: Compositional matrix adjust.
 Identities = 33/98 (33%), Positives = 55/98 (56%)

Query: 337 QKKLYHPFSVNLMQTREKLIKILMELEDWKEALAYCQLTIPVYQRVYPQFHPLLGLQYYT 396
           QK + H F++  ++T +      + LE W++A  Y +  +P Y   Y + HPL+GL Y T
Sbjct: 234 QKDILHSFNIQHIRTLDTAFLAAVNLEYWEDAELYSKQLLPGYLLYYGEVHPLIGLLYLT 293

Query: 397 CGKLEWFLGDTENAIKSMTEAVEILRITHGTNSPFMKE 434
            GK++  L   + A++ + +A  +L ITHG   P M+E
Sbjct: 294 MGKIQLHLRKLKQALEILKKASAVLTITHGDKHPIMRE 331


>gi|124513818|ref|XP_001350265.1| histone-lysine N-methyltransferase, putative [Plasmodium falciparum
           3D7]
 gi|23615682|emb|CAD52674.1| histone-lysine N-methyltransferase, putative [Plasmodium falciparum
           3D7]
          Length = 509

 Score =  108 bits (269), Expect = 8e-21,   Method: Compositional matrix adjust.
 Identities = 116/459 (25%), Positives = 191/459 (41%), Gaps = 57/459 (12%)

Query: 22  NNSSSISRCDGCFASSN-LKKCSACQVVWYCGSNCQKLDWKLHRLECQVLSRLDKEKRKS 80
           NN   I+ C  C    N    C  C+ V YC   C +  WK HR EC +  R +   R  
Sbjct: 54  NNFKIINTCFYCLEKFNKCICCPNCKYVVYCSDMCLERAWKSHREECDIF-RSNIFDRYC 112

Query: 81  VTPSIRLMLKLYLRRKLQNDNVIPST--TTDNYSLVE--ALVAHMSDIDEKQLLLYAQIA 136
            + ++RL++  YL      D    ST  + + Y  ++  A V  ++ + +++ + +    
Sbjct: 113 PSITMRLVINCYLNHFNFYDYCGSSTDISKEKYERLKYPAYVVAVALMSKRKKIFHNFDN 172

Query: 137 NLVNLILQWPEISINEIAENFSKLACNAHTICNSELRPLGTGLY--PVISIINHSCLPNA 194
           N         E  +  I E F K++ N+  I ++EL P G  +Y  PV    NHSCL N 
Sbjct: 173 N---------ESILKNIIEKFIKISKNSLQIIDNELEPAGLAIYKKPV-PFFNHSCLSNC 222

Query: 195 VLVFEGRLAVVRAVQHVPKGAEVLISYIETAGSTMTRQKALKEQYLFTCTCP-------- 246
           V VF  +   +R +  V  G E+ ISYI+ A    TR     +QY FTC+C         
Sbjct: 223 VTVFRNQKLYIRTLMDVYPGEELTISYIDIAFDRCTRLSICMDQYFFTCSCKLCKVNIAS 282

Query: 247 ----------------RCIKLGQFDDIQESAILEGYRCKDDGCSGF----LLRDSD-DKG 285
                            C K   + +I   + LE  +C  +  S F    +LR S+ D  
Sbjct: 283 ECHNIFNTEIVCTNSENCKKFLGYMEIVLISELERKQCYINK-SSFKTYPVLRKSNIDNV 341

Query: 286 FTCQQCGLVRSKEEIKKIASEVNILSKKTLALTSCGNHQEVVSTYKMIEKLQK------- 338
           + C  C        IK +  +     K+T+ L +  N +       +++ L K       
Sbjct: 342 WRCMLCKAEVGDNIIKGLVDKEKETFKETVYLETLFNEKYTYDNKSVLQSLNKIKSKIDY 401

Query: 339 --KLYHPFSVNLMQTREKLIKILMELEDWKEALAYCQLTIPVYQRVYPQFHPLLGLQYYT 396
               YH    +L + R K++ I ++L ++K A       +   +  Y ++ P+ G   + 
Sbjct: 402 LTNYYHHAKYSLQKMRAKILYISIQLHEFKLAYNIANQYLKSIEVSYGKYSPIYGYYIFL 461

Query: 397 CGKLEWFLGDTENAIKSMTEAVEILRITHGTNSPFMKEL 435
            GKL  FL      +  + +A + +  T+G +S   K+L
Sbjct: 462 TGKLALFLDLKNEGLGLIHKAKKNIIKTYGPDSLIYKDL 500


>gi|350633710|gb|EHA22075.1| hypothetical protein ASPNIDRAFT_183370 [Aspergillus niger ATCC
           1015]
          Length = 386

 Score =  107 bits (268), Expect = 1e-20,   Method: Compositional matrix adjust.
 Identities = 73/255 (28%), Positives = 119/255 (46%), Gaps = 19/255 (7%)

Query: 6   VSGEVIISQEPYVCVPNNSSSISRCDGCFASS-------NLKKCSACQVVWYCGSNCQKL 58
           V  EV+    P+V V +       C GCF           LK C  C +V YC   CQ  
Sbjct: 26  VGEEVLHIPVPFVAVLDTEHLGEVCSGCFGQRQLEEEGITLKGCRGCGIVKYCDKTCQAK 85

Query: 59  DWKL-HRLECQVLSRLDKEKRKSVTPSIRLMLKLYLRRKLQNDNVIPSTTTDNYSLVEAL 117
           DWKL H LEC +   L   K + +  + R +L++ LR + Q        +++       L
Sbjct: 86  DWKLGHSLECSIYQNL---KPRVLPINARAVLRMVLRSERQK------YSSEELDQFLQL 136

Query: 118 VAHMSDIDEKQLLLYAQIANLVNLILQWPEISINE--IAENFSKLACNAHTICNSELRPL 175
             H+ DI ++    + +I+     I  +    ++E  I+   +KL  N+  + N+    L
Sbjct: 137 GTHIKDIRDQSASQWERISLSSKAIRAYSGTDMSEEVISAMGAKLDLNSFNLTNAVYDRL 196

Query: 176 GTGLYPVISIINHSCLPNAVLVFEGRLAVVRAVQHVPKGAEVLISYIETAGSTMTRQKAL 235
           G  L+P  +I NHSC  NA + F+G    ++A++ + K  ++ I+YI+       RQ  L
Sbjct: 197 GVYLHPYAAIFNHSCDHNAAVSFDGPNLHIKALRPIRKDEQIFITYIDVTDPYPIRQANL 256

Query: 236 KEQYLFTCTCPRCIK 250
           + +Y FTC C +C +
Sbjct: 257 QSRYYFTCHCSKCTR 271


>gi|430813591|emb|CCJ29069.1| unnamed protein product [Pneumocystis jirovecii]
          Length = 463

 Score =  107 bits (267), Expect = 1e-20,   Method: Compositional matrix adjust.
 Identities = 118/469 (25%), Positives = 190/469 (40%), Gaps = 94/469 (20%)

Query: 15  EPYVCVPNNSSSISRCDGCFASS--------NLKKCSACQVVWYCGSNCQKLDWKL-HRL 65
           +P V V +     + C  CF S         +L +C  C+++ YC   CQK DW++ H+ 
Sbjct: 41  KPLVAVLDEKHIKNTCSNCFISEEDKDDDIFHLIQCKGCKILQYCCQKCQKEDWEIFHQY 100

Query: 66  ECQVLSRLDKEKRKSVTPSIRLMLKLYLRRKLQNDNVIPSTTTDNYSLVEALVAHMSDID 125
           EC     L K   K V  SIR+ ++L    +    ++  +T  D       L +H S+I 
Sbjct: 101 ECAFF--LSKSP-KIVPGSIRICMRLIFYGRCYPSSLEWNTIMD-------LESHRSEIM 150

Query: 126 EKQ-----LLLYAQIANLVNLILQWPEISINEIAENFSKLACNAHTICNSELRPLGTGLY 180
             +      +L   I N  N      E++   + + F K+  N+ ++  S    +GT + 
Sbjct: 151 SSEKGDVIWMLSKGIQNFTN------EMNETFVLDLFCKIMINSFSLMTSSYDTIGTAID 204

Query: 181 PVISIINHSCLPNAVLVFEGRLAVVRAVQHVPKGAEVLISYIETAGSTMTRQKALKEQYL 240
           P+IS INHSC PN  LVF+     +R++Q +    E+ +SYI+   +   R   L  +Y 
Sbjct: 205 PIISRINHSCYPNTALVFDRNTVALRSLQKILPNQEITVSYIDIYNTQKNRHDELLSRYY 264

Query: 241 FTCTCPRCIKLGQFDDIQESAILEGYRCKDDGCSGFLLRDSDDKGFTCQQCGLVRSKEEI 300
           F+C C RCI                     DG   +++   +    T      +     I
Sbjct: 265 FSCKCTRCI-------------------VSDGFESYIVLKKEMTSNTFSYLETI-----I 300

Query: 301 KKIASEVNILSKKTLALTSCGNHQEVVSTYKMIEKLQKKLYHPFSVNLMQTREKLIKILM 360
            +  SE NI   K  AL+     +   ST   + +L     H F++N            +
Sbjct: 301 NRALSEKNIYLFK--ALSMLHRLKGWNSTIYPLNEL-----HRFALNY----------FL 343

Query: 361 ELEDWKEALAYCQLTIPVYQRVYPQ----FHPLLGLQYYTCGKL-------------EWF 403
           +  ++  AL +      V   VY Q    F+PL   Q Y   +L             +W 
Sbjct: 344 DENNFHNALYHGLFIYLVGSSVYNQYISNFNPLYVTQIYLLVRLMIYQASENDQKQFKWL 403

Query: 404 LGDTEN----AIKSMTEAVEILRITHGTNSPFMKELILKLEEAQAEASY 448
             DTE     A K + E VE+   +HG +S F K +    +E   + S+
Sbjct: 404 --DTEKIMKYAYKLLYELVELSYKSHGLSSRFSKRIQKTFKETDEDISF 450


>gi|351696197|gb|EHA99115.1| SET and MYND domain-containing protein 1 [Heterocephalus glaber]
          Length = 483

 Score =  107 bits (267), Expect = 1e-20,   Method: Compositional matrix adjust.
 Identities = 69/264 (26%), Positives = 123/264 (46%), Gaps = 31/264 (11%)

Query: 7   SGEVIISQEPYVCVPNNSSSISRCDGCFA-SSNLKKCSACQVVWYCGSNCQKLDWKLHRL 65
           + +VI ++  Y  V  +S     C  CF     L +C  C+   YC   CQK  W  H+ 
Sbjct: 24  AADVIFAERAYSAVVFDSLVNFVCHTCFKRQEKLHRCGQCKFAHYCDRTCQKDAWLNHKN 83

Query: 66  ECQVLSRLDKEKRKSVTPSIRLMLKLYLRRKLQNDNVIPSTTTDNYSLVEALVAHMSDID 125
           EC  + R  K   +++  + R+M ++        +  + S        ++  V H  + +
Sbjct: 84  ECSAIKRYGKVPNENIRLAARIMWRVEREGTGLTEGCLVSVDD-----LQNHVEHFGEEE 138

Query: 126 EKQLLLYAQIANLVNLILQ-WP----EISINEIAENFSKLACNAHTICNSE-LRPLGTGL 179
           +K+L +       V++ LQ WP    + S+  I+  F  + CN  T+ +   L+ +G G+
Sbjct: 139 QKELRVD------VDMFLQYWPPQSQQFSMQYISHIFGVINCNGFTLSDQRGLQAVGVGI 192

Query: 180 YPVISIINHSCLPNAVLVFE-GRLAVVRAVQH------------VPKGAEVLISYIETAG 226
           +P + ++NH C PN  ++F  G    V+++ H            + +G E+ +SYI+   
Sbjct: 193 FPNLGLVNHDCWPNCTVIFNNGNHEAVKSMFHTQMRIELRALGKISEGEELTVSYIDFLN 252

Query: 227 STMTRQKALKEQYLFTCTCPRCIK 250
            +  R++ LK+QY F CTC  C K
Sbjct: 253 VSEERKRQLKKQYYFDCTCEHCQK 276


>gi|82658264|ref|NP_001032477.1| SET and MYND domain-containing protein 3 [Danio rerio]
 gi|81097726|gb|AAI09413.1| SET and MYND domain containing 3 [Danio rerio]
          Length = 380

 Score =  107 bits (266), Expect = 2e-20,   Method: Compositional matrix adjust.
 Identities = 66/233 (28%), Positives = 118/233 (50%), Gaps = 17/233 (7%)

Query: 8   GEVIISQEPYV-CVPNNSSSISRCDGCFA-SSNLKKCSACQVVWYCGSNCQKLDWKLHRL 65
           GEVI S EP+  CV  +    + C  C     +L +CS C+   YC   CQK  W  H+ 
Sbjct: 27  GEVIYSCEPFAFCVARDFLKTA-CQSCLKRGESLSRCSQCKTARYCNVQCQKQAWPDHKR 85

Query: 66  ECQVLSRLDKEKRKSVTPSIRLMLKLYLRRKLQNDNVIPSTTTDNYSLVEALVAHMSDID 125
           EC+ L  L   + +  T S+RL+ ++  +   Q++    S   + YS+ E   +H++D+ 
Sbjct: 86  ECKCLKHL---QPRIPTDSVRLVARIIFKLLSQSE----SDQEELYSIAEH-QSHLADMS 137

Query: 126 EKQLLLYAQIANLVNLILQWPEISINEIAENF------SKLACNAHTICNSELRPLGTGL 179
           E++      +   + + L      ++ +          +++ CN  +I + EL+ +G GL
Sbjct: 138 EEKTEGLKHLCTTLQVYLAEENCDLSRLPSGLDPVSLLARVTCNCFSISDGELQDVGVGL 197

Query: 180 YPVISIINHSCLPNAVLVFEGRLAVVRAVQHVPKGAEVLISYIETAGSTMTRQ 232
           YP +S++NH C PN +++FEG+   +RAV+ +    E+ ISY +    +  R+
Sbjct: 198 YPSMSLLNHDCQPNCIMMFEGKRLTLRAVRVIRSAEELTISYTDILAPSKDRR 250



 Score = 55.1 bits (131), Expect = 7e-05,   Method: Compositional matrix adjust.
 Identities = 30/96 (31%), Positives = 49/96 (51%), Gaps = 4/96 (4%)

Query: 354 KLIKILME----LEDWKEALAYCQLTIPVYQRVYPQFHPLLGLQYYTCGKLEWFLGDTEN 409
           +L+ + M+    L+D++ AL Y    +  Y+  +   HP   ++    GKL+ FLG  E 
Sbjct: 281 RLLDLAMDACISLDDYETALEYGNRALGPYKLYFSDPHPSRAVELLRVGKLQHFLGRLEE 340

Query: 410 AIKSMTEAVEILRITHGTNSPFMKELILKLEEAQAE 445
           A  S T+A  I+ +THG       ++  +L E QAE
Sbjct: 341 ARGSFTQAYSIMSVTHGAAHALTNQVQRRLSECQAE 376


>gi|344297528|ref|XP_003420449.1| PREDICTED: SET and MYND domain-containing protein 1 [Loxodonta
           africana]
          Length = 478

 Score =  106 bits (264), Expect = 2e-20,   Method: Compositional matrix adjust.
 Identities = 109/441 (24%), Positives = 189/441 (42%), Gaps = 56/441 (12%)

Query: 7   SGEVIISQEPYVCVPNNSSSISRCDGCFA-SSNLKKCSACQVVWYCGSNCQKLDWKLHRL 65
           + ++I ++  Y  V  +S   S C  CF     L +C  C+   YC   CQK  W  H+ 
Sbjct: 29  AADIIFAERAYSAVVFDSLINSVCHTCFKRQEKLHRCGQCKFAHYCDRTCQKDAWLNHKN 88

Query: 66  ECQVLSRLDKEKRKSVTPSIRLMLKLYLRRKLQNDNVIPSTTTDNYSLVEALVAHMSDID 125
           EC  + R  K   +++  + R+M ++        +  + S        ++  V H  + +
Sbjct: 89  ECSAIKRYAKVPNENIRLAARIMWRVEREGTGLTEGCLVSVDD-----LQNHVEHFGEEE 143

Query: 126 EKQLLLYAQIANLVNLILQ-WP----EISINEIAENFSKLACNAHTICNSE-LRPLGTGL 179
           +K L +       VN  L+ WP    + S   I+  F  + CN  T+ +   L+ +G G+
Sbjct: 144 QKALRVD------VNTFLEYWPPQNQQFSTQYISHIFGVINCNGFTLSDQRGLQAVGVGV 197

Query: 180 YPVISIINHSCLPNAVLVFE-GRLAVVRAVQHVPKGAEVLISYIETAGSTMTRQKALKEQ 238
           +P + ++NH C PN  ++F  G    V+++ H        +S  +  G T   Q  +K  
Sbjct: 198 FPNLGLVNHDCWPNCTVIFNNGNHEAVKSMFHTQ------MSKAQDPGKT-GPQFVIKLV 250

Query: 239 YLFTCTCPRCIKLGQFDDIQESAILEGYRCKDDGCSGFLLRDSDDKGFTCQQCGLVRSKE 298
            L TC    C+ LG        A+ EG   KDD      L   DD           +  +
Sbjct: 251 GLTTCFFS-CLALGV------EALPEGL--KDD----LFLGVKDDP----------KPSQ 287

Query: 299 EIKKIASEVNILSKKTL----ALTSCGNHQEVVSTYKMIEKLQKKLYHPFSVNLMQTREK 354
           E+ K   E+  LSK TL       S G + EVV   +   + Q+ ++   ++  ++    
Sbjct: 288 EVVK---EMIQLSKDTLEKIDKARSEGLYHEVVKLCRECLQKQEPVFADTNLYTLRMLSI 344

Query: 355 LIKILMELEDWKEALAYCQLTIPVYQRVYPQFHPLLGLQYYTCGKLEWFLGDTENAIKSM 414
           + ++L  L+ ++EA  Y +  +  Y ++Y   +  LG+     G   W  G+ E     +
Sbjct: 345 VSEVLSYLQAFEEAADYAKRMVDGYMKLYHHNNAQLGMAVMRAGLTNWHAGNIEVGHGMI 404

Query: 415 TEAVEILRITHGTNSPFMKEL 435
            +A  IL +THG + P  K+L
Sbjct: 405 CKAYAILLVTHGPSHPITKDL 425


>gi|358368449|dbj|GAA85066.1| SET and MYND domain protein [Aspergillus kawachii IFO 4308]
          Length = 484

 Score =  106 bits (264), Expect = 3e-20,   Method: Compositional matrix adjust.
 Identities = 71/243 (29%), Positives = 115/243 (47%), Gaps = 19/243 (7%)

Query: 16  PYVCVPNNSSSISRCDGCFASSNL-------KKCSACQVVWYCGSNCQKLDWKL-HRLEC 67
           P+V V +       C GCF    L       K C  C VV YC   CQ  DWKL H  EC
Sbjct: 36  PFVAVLDTEHLGEVCSGCFGQRQLEEEGIALKGCRGCGVVKYCDKTCQAKDWKLGHSFEC 95

Query: 68  QVLSRLDKEKRKSVTPSIRLMLKLYLRRKLQNDNVIPSTTTDNYSLVEALVAHMSDIDEK 127
            +  +L   K + +  + R +L++ LR + Q  +       D +  +E    H+ DI ++
Sbjct: 96  TIYQKL---KPRILPINARAVLRMVLRSERQKYS---DEELDQFLQLET---HIKDIRDE 146

Query: 128 QLLLYAQIANLVNLILQWPEISINE--IAENFSKLACNAHTICNSELRPLGTGLYPVISI 185
               + +I+     I  +    ++E  I+   +KL  N+  + N+    LG  L+P  +I
Sbjct: 147 SASHWERISLSSKAIKAYSGTEMSEEVISAMGAKLDLNSFNLTNAVYDRLGVYLHPYAAI 206

Query: 186 INHSCLPNAVLVFEGRLAVVRAVQHVPKGAEVLISYIETAGSTMTRQKALKEQYLFTCTC 245
            NHSC  NA + F+G    ++A++ + K  ++ I+YI+       RQ  L+ +Y FTC C
Sbjct: 207 FNHSCDHNAAVSFDGPNLHIKAIRPIQKDEQIFITYIDVTDPYPIRQHNLQSRYYFTCHC 266

Query: 246 PRC 248
            +C
Sbjct: 267 SKC 269


>gi|315046180|ref|XP_003172465.1| SET and MYND domain-containing protein 2 [Arthroderma gypseum CBS
           118893]
 gi|311342851|gb|EFR02054.1| SET and MYND domain-containing protein 2 [Arthroderma gypseum CBS
           118893]
          Length = 498

 Score =  106 bits (264), Expect = 3e-20,   Method: Compositional matrix adjust.
 Identities = 68/236 (28%), Positives = 113/236 (47%), Gaps = 23/236 (9%)

Query: 30  CDGCFAS------------SNLKKCSACQVVWYCGSNCQKLDWKL-HRLECQVLSRLDKE 76
           C GCF++              L+ CS C+VV YC   CQ  +W L H+ EC +  +L   
Sbjct: 55  CSGCFSNLPFGANVMASVDMKLRACSGCKVVKYCSKGCQAENWALIHKHECAIYKKL--- 111

Query: 77  KRKSVTPSIRLMLKLYLRRKLQNDNVIPSTTTDNYSLVEALVAHMSDIDEKQLLLYAQIA 136
             K +  + R +L++   R  ++ +V       + ++   L +H+ +I       Y +I 
Sbjct: 112 YPKILPVNSRAVLRIVKLRNSKDADV-----QSDMTMFLGLRSHLKEIASSNQEQYERIM 166

Query: 137 NLVNLILQWPE--ISINEIAENFSKLACNAHTICNSELRPLGTGLYPVISIINHSCLPNA 194
                  ++    + +  IAE  +++  N+ T   S   PLG  + P    +NHSC PNA
Sbjct: 167 LCAKAEKEYSGSGLDLETIAEYLARIEVNSFTFTTSFGDPLGLCIQPFACYMNHSCDPNA 226

Query: 195 VLVFEGRLAVVRAVQHVPKGAEVLISYIETAGSTMTRQKALKEQYLFTCTCPRCIK 250
           V+ F+  L  V+A++ +    +V ISYI+       RQK L E+Y FTC C +C++
Sbjct: 227 VVGFDEGLITVKALRKIKPDEQVFISYIDNTYPFEVRQKQLAERYFFTCKCSKCVE 282


>gi|326477120|gb|EGE01130.1| SET and MYND domain-containing protein [Trichophyton equinum CBS
           127.97]
          Length = 499

 Score =  105 bits (263), Expect = 4e-20,   Method: Compositional matrix adjust.
 Identities = 71/234 (30%), Positives = 113/234 (48%), Gaps = 23/234 (9%)

Query: 30  CDGCFAS------------SNLKKCSACQVVWYCGSNCQKLDWKL-HRLECQVLSRLDKE 76
           C  CFA+              L+ CS C+VV YC   CQ  +WKL H+ EC +  +L   
Sbjct: 56  CSNCFANLPFGVNVMASVDMKLRACSGCKVVKYCDEKCQSANWKLIHKHECAIYKKL--- 112

Query: 77  KRKSVTPSIRLMLKLYLRRKLQNDNVIPSTTTDNYSLVEALVAHMSDIDEKQLLLYAQIA 136
             K +  + R +L++    KL+N N        + ++  +L +H+++I       Y +I 
Sbjct: 113 YPKILPVNSRAVLRIV---KLRNSN--EEHVKSDLTMFLSLRSHLNEITNSNQEQYERIM 167

Query: 137 NLVNLILQWP--EISINEIAENFSKLACNAHTICNSELRPLGTGLYPVISIINHSCLPNA 194
                  ++   ++ I  IAE  +++  N+ T   +   PLG  + P    +NHSC PNA
Sbjct: 168 LSAKAEKEYSGSDLDIETIAEYLARIEVNSFTFTTAFGDPLGLCIQPFACYMNHSCEPNA 227

Query: 195 VLVFEGRLAVVRAVQHVPKGAEVLISYIETAGSTMTRQKALKEQYLFTCTCPRC 248
           V+ F+  L  V+A+  +    +V ISYI+       RQK L E+Y FTC C +C
Sbjct: 228 VVGFDEGLITVKALCKIKPDEQVFISYIDNTYPFEVRQKQLAERYFFTCKCSKC 281


>gi|189204167|ref|XP_001938419.1| MYND finger family protein [Pyrenophora tritici-repentis Pt-1C-BFP]
 gi|187985518|gb|EDU51006.1| MYND finger family protein [Pyrenophora tritici-repentis Pt-1C-BFP]
          Length = 454

 Score =  105 bits (263), Expect = 4e-20,   Method: Compositional matrix adjust.
 Identities = 89/366 (24%), Positives = 158/366 (43%), Gaps = 51/366 (13%)

Query: 110 NYSLVEALVAHMSDIDEKQLLLYAQIANLVNLILQWPEISI-------NEIAENFSKLAC 162
           ++ +V  L +H+ D   K         N+  + +  P+ ++       + IA  ++++  
Sbjct: 76  DWEMVCRLPSHVDDFKRK-----GTYGNIEMMAMGAPQFALPPTMFDKDFIAAMYARVMS 130

Query: 163 NAHTICNSELRPLGTGLYPVISIINHSCLPNAVLVFEGRLAVVRAVQHVPKGAEVLISYI 222
           NA TI    L PLG  L P +  +NHSC PNA ++ +G    +R ++ + K  E+ ISYI
Sbjct: 131 NALTIITPTLDPLGIILDPTLCSLNHSCDPNAFIMMDGPSVSIRTLRPIRKDKEIFISYI 190

Query: 223 ETAGSTMTRQKALKEQYLFTCTCPRCIKLGQFDDIQESAILEGYRCKDDGCSGFLLRDSD 282
           +T      RQ+ L+ ++ FTC C +C         QE A L+         S F+L    
Sbjct: 191 DTTYPYHKRQEELQTRWFFTCRCAKC---------QEKATLQEDNWLLPVDSKFVLDAEA 241

Query: 283 DKGFTCQQCGLVRSKEEIKKIASEVNILSKKTLALTSCGNHQEVVSTYKMIEKLQKKLYH 342
            K     Q       EE+ K+++E  I   K + L SC                + + Y 
Sbjct: 242 KKAMAQTQEQTFAVYEELHKLSTEHVIYGLKQI-LASC---------------YESRFYP 285

Query: 343 PFSVNLMQTREKLIKILMELEDWKEALAYCQLTIP-VYQRVYP-QFHPLLGLQYYTCGKL 400
            +     + R+ LI  L+ +  +++A A C      +  ++YP  FHP+  +Q +    L
Sbjct: 286 IYRQPYAEARDVLIVNLLSMGKFQDAWAQCAKRYKYILPKLYPVPFHPVRVVQTWQMAML 345

Query: 401 EWFLGDTENAIKS------------MTEAVEILRITHGTNSPFMKELILKLEEAQAEASY 448
             +L  TE  + +            + + +++  ++HG N+ F K +  K EE   E   
Sbjct: 346 AAYLASTEEGVGAPGVNMGLIAMMLVKQVLDVAHLSHGPNNAFTKSVKGKAEEMIEELKR 405

Query: 449 KLSSKD 454
            + + D
Sbjct: 406 SVGNPD 411


>gi|326472015|gb|EGD96024.1| hypothetical protein TESG_03485 [Trichophyton tonsurans CBS 112818]
          Length = 499

 Score =  105 bits (263), Expect = 4e-20,   Method: Compositional matrix adjust.
 Identities = 71/234 (30%), Positives = 113/234 (48%), Gaps = 23/234 (9%)

Query: 30  CDGCFAS------------SNLKKCSACQVVWYCGSNCQKLDWKL-HRLECQVLSRLDKE 76
           C  CFA+              L+ CS C+VV YC   CQ  +WKL H+ EC +  +L   
Sbjct: 56  CSNCFANLPFGVNVMASVDMKLRACSGCKVVKYCDEKCQSANWKLIHKHECAIYKKL--- 112

Query: 77  KRKSVTPSIRLMLKLYLRRKLQNDNVIPSTTTDNYSLVEALVAHMSDIDEKQLLLYAQIA 136
             K +  + R +L++    KL+N N        + ++  +L +H+++I       Y +I 
Sbjct: 113 YPKILPVNSRAVLRIV---KLRNSN--EEHVKSDLTMFLSLRSHLNEITNSNQEQYERIM 167

Query: 137 NLVNLILQWP--EISINEIAENFSKLACNAHTICNSELRPLGTGLYPVISIINHSCLPNA 194
                  ++   ++ I  IAE  +++  N+ T   +   PLG  + P    +NHSC PNA
Sbjct: 168 LSAKAEKEYSGSDLDIETIAEYLARIEVNSFTFTTAFGDPLGLCIQPFACYMNHSCEPNA 227

Query: 195 VLVFEGRLAVVRAVQHVPKGAEVLISYIETAGSTMTRQKALKEQYLFTCTCPRC 248
           V+ F+  L  V+A+  +    +V ISYI+       RQK L E+Y FTC C +C
Sbjct: 228 VVGFDEGLITVKALCKIKPDEQVFISYIDNTYPFEVRQKQLAERYFFTCKCSKC 281


>gi|241712248|ref|XP_002413449.1| SET and MYND domain-containing protein, putative [Ixodes
           scapularis]
 gi|215507263|gb|EEC16757.1| SET and MYND domain-containing protein, putative [Ixodes
           scapularis]
          Length = 374

 Score =  105 bits (262), Expect = 4e-20,   Method: Compositional matrix adjust.
 Identities = 93/381 (24%), Positives = 162/381 (42%), Gaps = 47/381 (12%)

Query: 60  WKLHRLECQVLSRLDKEKRKSVTPSIRLMLKLYLRRKLQNDNVIPSTTTDNYSLVEALVA 119
           W  H++EC  L ++  E   +    +R   +L ++ K  +                 LVA
Sbjct: 17  WPDHKVECACLEKVSPEVPAN---PVRFFCRLLIKLKDTSAAAQTEPVFGRGRSFADLVA 73

Query: 120 HMSDI--DEKQLLLYAQIANLVNLILQWPEISINEIA-ENFSKLACNAHTICNSELRPLG 176
           H+ +I  D K++  + ++       L    +   E   E F K+  N +TI N E  P+G
Sbjct: 74  HVEEIKRDAKRVAQFQRLWMTAGAFLGDKYLPPAETGLEIFGKMVVNTYTITNYEECPIG 133

Query: 177 TGLYPVISIINHSCLPNAVLVFEGRLAVVRAVQHVPKGA--EVLISYIETAGSTMTRQKA 234
           TGLY   SI++HSC PNA  VF+GR   +RA   +   +   + +SYI+      TR + 
Sbjct: 134 TGLYIGPSILDHSCTPNAQAVFDGRTLRLRAAADIECDSVDGIRVSYIDLKELRRTRIEE 193

Query: 235 LKEQYLFTCTCPRCIKLGQFDDIQESAILEGYRCKDDGCSGFLLRDSDDKGFTCQQCGLV 294
           L+E+Y F+C CPRC      D+ QE                           T +   L 
Sbjct: 194 LRERYYFSCNCPRC----SADEAQEY-------------------------LTEKSPSLT 224

Query: 295 RSKEEIKKIASEVNILSKKTLALTSCGNHQEVVSTYKMIEKLQKKLYHPFSVNLMQTREK 354
              +E+ +   +  + +++ L     G  + ++S++K+                 +  E 
Sbjct: 225 AKAKELLREFMQAEVHAREDLVRLR-GQAEALLSSHKLPAT---------DAARSEALEL 274

Query: 355 LIKILMELEDWKEALAYCQLTIPVYQRVYPQFHPLLGLQYYTCGKLEWFLGDTENAIKSM 414
           L K  +E+ D+  AL Y       Y+R Y  + P+ G+  +T  KL       + A++  
Sbjct: 275 LSKSCLEMGDFDAALPYYLAKEETYRRCYGPYSPVYGVLLFTIAKLYHHKVLLQEALQYF 334

Query: 415 TEAVEILRITHGTNSPFMKEL 435
            +A E++ ++HG N    ++L
Sbjct: 335 EKASEVVTVSHGRNHRLYRDL 355


>gi|346320789|gb|EGX90389.1| SET domain protein [Cordyceps militaris CM01]
          Length = 540

 Score =  105 bits (262), Expect = 5e-20,   Method: Compositional matrix adjust.
 Identities = 80/276 (28%), Positives = 120/276 (43%), Gaps = 17/276 (6%)

Query: 6   VSGEVIISQEPYVCVPNNSSSISRCDGCFASSNLKKCSACQVVWYCGSNCQKLDWKL-HR 64
            +G V+    P + VP+ S     C  C      + CS C+  +YC + CQ   W   H 
Sbjct: 28  AAGGVLAVFSPLILVPSLSHLTIVCSFCLRGVTPRPCSRCRAAYYCDAQCQAAAWSSGHA 87

Query: 65  LECQVLSRL--DKEKRKSVTPSIRLMLKLYLRRKLQNDNVIPSTTTDNYSLVEALVAHMS 122
            EC  L+       +R+ +   +R +++L L  +    +     T D    +E   A   
Sbjct: 88  AECAALTGAIPPARRRREIPAPVRALVQLLLLLRSGGGH-----TRDLLDGLEGHAAQRR 142

Query: 123 DIDEKQLLLYAQIANLVNLILQWPEISINEIAENFSKLACNAHTICNSELRPLGTGLYPV 182
                  +    +A      ++  E ++ E AE   KL  NA    +++L  +G  L P 
Sbjct: 143 RAPGWADMELMAMAGCAFAGMETSEGAVREAAEILCKLQTNAFHRFDADLGHVGIFLEPT 202

Query: 183 ISIINHSCLPNAVLVFEGRLAVVRAVQHVPKGAEVLISYIETAGSTMTRQKALKEQYLFT 242
           +++ NHSCLPNA + F GR AV+RA Q +  G E+ ISY +   S   RQ AL   Y F 
Sbjct: 203 LAMANHSCLPNAFVQFVGRTAVLRAEQRIQSGDEIEISYTDYTSSLSKRQAALAP-YHFE 261

Query: 243 CTCPRCIKLGQFDDI---QESAILEGYRCKDDGCSG 275
           C C RC +     D+   Q SA    +   DD   G
Sbjct: 262 CRCRRCTQ-----DLCVYQASAHYRDFALNDDSVLG 292


>gi|301603797|ref|XP_002931556.1| PREDICTED: SET and MYND domain-containing protein 3-like [Xenopus
           (Silurana) tropicalis]
          Length = 226

 Score =  105 bits (262), Expect = 5e-20,   Method: Compositional matrix adjust.
 Identities = 77/269 (28%), Positives = 127/269 (47%), Gaps = 55/269 (20%)

Query: 183 ISIINHSCLPNAVLVFEGRLAVVRAVQHVPKGAEVLISYIETAGSTMTRQKALKEQYLFT 242
           +S++NHSC PN V+VFEG   ++R V+ +PKG E+ ISYI+    T  R+  L+ QY F 
Sbjct: 1   MSLLNHSCDPNCVIVFEGTCLLLRTVKEIPKGEELTISYIDVKMPTQGRRDQLQRQYCFL 60

Query: 243 CTCPRCIKLGQFDDIQESAILEGYRCKDDGCSGFLLRDSDDKGFTCQQCGLVRSKEEIKK 302
           C C RC                            LLRD D+         L    E  ++
Sbjct: 61  CDCQRC----------------------------LLRDKDEDM-------LAGDAEASRE 85

Query: 303 IASEVNILSKKTLALTSCGNHQEVVSTYKMIEKLQKKLYHPFSVNLMQTREKLIKIL--- 359
           + S V+ L +    L S    +E ++   + + L  + Y P   N+ Q     +KIL   
Sbjct: 86  VESSVSRLEE----LLSQNTAEEALN---LCKTLMNRYYLP-DKNIYQ-----LKILDCA 132

Query: 360 ----MELEDWKEALAYCQLTIPVYQRVYPQFHPLLGLQYYTCGKLEWFLGDTENAIKSMT 415
               ++L  W+EAL +   T+  Y   Y  +HP+  +Q    GKL+ + G    A+K++ 
Sbjct: 133 MDASIDLGLWEEALHFGLRTLEPYSLYYSNYHPVRAVQLMKVGKLQNYQGLFAEAMKTLK 192

Query: 416 EAVEILRITHGTNSPFMKELILKLEEAQA 444
           +A +I+++THG +    ++L   + + +A
Sbjct: 193 QAFDIMKVTHGRDHGQTQQLAELMNDCEA 221


>gi|317036659|ref|XP_001397803.2| SET and MYND domain protein [Aspergillus niger CBS 513.88]
          Length = 481

 Score =  105 bits (261), Expect = 6e-20,   Method: Compositional matrix adjust.
 Identities = 72/253 (28%), Positives = 118/253 (46%), Gaps = 19/253 (7%)

Query: 6   VSGEVIISQEPYVCVPNNSSSISRCDGCFASSNL-------KKCSACQVVWYCGSNCQKL 58
           V  EV+    P+V V +       C GCF    L       K C  C +V YC   CQ  
Sbjct: 26  VGEEVLHIPVPFVAVLDTEHLGEVCSGCFGQRQLEEEGIALKGCRGCGIVKYCDKTCQAK 85

Query: 59  DWKL-HRLECQVLSRLDKEKRKSVTPSIRLMLKLYLRRKLQNDNVIPSTTTDNYSLVEAL 117
           +WKL H LEC +   L   K + +  + R +L++ LR + Q        +++       L
Sbjct: 86  NWKLGHSLECSIYQNL---KPRVLPINARAVLRMVLRSERQK------YSSEELDQFLQL 136

Query: 118 VAHMSDIDEKQLLLYAQIANLVNLILQWPEISINE--IAENFSKLACNAHTICNSELRPL 175
             H+ DI ++    + +I+     I  +    ++E  I+   +KL  N+  + N+    L
Sbjct: 137 GTHIKDIRDQSASQWERISLSSKAIKAYSGTDMSEEVISAMGAKLDLNSFNLTNAVYDRL 196

Query: 176 GTGLYPVISIINHSCLPNAVLVFEGRLAVVRAVQHVPKGAEVLISYIETAGSTMTRQKAL 235
           G  L+P  +I NHSC  NA + F+G    ++A++ + K  ++ I+YI+       RQ  L
Sbjct: 197 GVYLHPYAAIFNHSCDHNAAVSFDGPNLHIKALRPIRKDEQIFITYIDVTDPYPIRQANL 256

Query: 236 KEQYLFTCTCPRC 248
           + +Y FTC C +C
Sbjct: 257 QSRYYFTCHCSKC 269


>gi|268532182|ref|XP_002631219.1| C. briggsae CBR-SET-14 protein [Caenorhabditis briggsae]
          Length = 428

 Score =  105 bits (261), Expect = 6e-20,   Method: Compositional matrix adjust.
 Identities = 100/422 (23%), Positives = 182/422 (43%), Gaps = 26/422 (6%)

Query: 30  CDGCFA-SSNLKKCSACQVVWYCGSNCQKLDWKLHRLECQVLSRLDKEKRKSVTPSIRLM 88
           C+ C   ++ LKKC+ C   +YC   CQK DWK+H+ EC+ +    +E     + S+RL 
Sbjct: 26  CNLCLTNTTRLKKCAGCNEFFYCSQECQKNDWKVHKPECKSI----QEHGTLASDSMRLA 81

Query: 89  LKLYLRRKLQN-DNVIPSTTTDNYSLVEALVAHMSDIDEKQLLLYAQIANLVNLILQWPE 147
           +++ +    +N  +V  S  +        LV H S +D      Y +  N    +   PE
Sbjct: 82  MRIIVLLGKKNFGHVTASYISTGSRSFLTLVDHGSFLDSGAETFYQEYLNYA--LPPHPE 139

Query: 148 ISINEIAENFSKLACNAHTICNSELRPLGTGLYPVISIINHSCLPNAVLVFEGRLAVVRA 207
             +  +   F K++ N  ++ NS    +G GL   +S  NHSC P   + +  R A++  
Sbjct: 140 PDV--VKSVFKKISINGFSVSNSMGSAIGHGLCIKLSAANHSCKPLTRVCYRNRTAMLVP 197

Query: 208 VQ--HVPKGAE-VLISYIETAGSTMTRQKALKEQYLFTCTCPRCIKLGQFDDIQESAILE 264
           +     P   E    SYI+   S   RQ  LK++Y F C C  C+      D + +  +E
Sbjct: 198 IDPYKPPTSLEGACHSYIDELQSVSARQSELKKKYNFICICEGCM------DDERNESME 251

Query: 265 GYRCKDDGCSGFLLRDSDDKGFTCQQCG--LVRSKEEIKKIASEVNILSKKTLALTSCGN 322
            + C D  C+   + + +D    C  C   + R   E+ + A E  I +K  +   S   
Sbjct: 252 AWACGD--CANGKVPNVEDG--RCASCSWKMTRDHYELCRTAEESAIAAKPKIENESISL 307

Query: 323 HQEVVSTYKMIEKLQKKLYHPFSVNLMQTREKLIKILMELEDWKEALAYCQLTIPVYQRV 382
                   K++E     L H F+V+ +  +  L    + ++D   A  + +  + +  R 
Sbjct: 308 ETRKALCEKLMELFDDTL-HEFNVHRVVPQRLLFFASLAVQDVNGASKHGKALLAMMMRY 366

Query: 383 YPQFHPLLGLQYYTCGKLEWFLGDTENAIKSMTEAVEILRITHGTNSPFMKELILKLEEA 442
             +  P++ L+ +   ++    G+    +K + E  E L+  +   +P    +   +E A
Sbjct: 367 QGKHDPIVMLKKWQIAQICHANGEYNKVVKLLEEIKEPLKKIYTAGAPISVNIRRLMEMA 426

Query: 443 QA 444
           ++
Sbjct: 427 RS 428


>gi|332813752|ref|XP_003309160.1| PREDICTED: SET and MYND domain-containing protein 1 [Pan
           troglodytes]
          Length = 452

 Score =  105 bits (261), Expect = 7e-20,   Method: Compositional matrix adjust.
 Identities = 65/263 (24%), Positives = 120/263 (45%), Gaps = 29/263 (11%)

Query: 7   SGEVIISQEPYVCVPNNSSSISRCDGCFA-SSNLKKCSACQVVWYCGSNCQKLDWKLHRL 65
           + ++I ++  Y  V  +S     C  CF     L +C  C+   YC   CQK  W  H+ 
Sbjct: 29  AADIIFAERAYSAVVFDSLVNFVCHTCFKRQEKLHRCGQCKFAHYCDRTCQKDAWLNHKN 88

Query: 66  ECQVLSRLDKEKRKSVTPSIRLMLKLYLRRKLQNDNVIPSTTTDNYSLVEALVAHMSDID 125
           EC  + R  K   +++  + R+M ++        +  + S        ++  V H  + +
Sbjct: 89  ECSAIKRYGKVPNENIRLAARIMWRVEREGTGLTEGCLVSVDD-----LQNHVEHFGEEE 143

Query: 126 EKQLLLYAQIANLVNLILQWP----EISINEIAENFSKLACNAHTICNSE-LRPLGTGLY 180
           +K L +     ++   +  WP    + S+  I+  F  + CN  T+ +   L+ +G G++
Sbjct: 144 QKDLRV-----DVDTFLQYWPPQSQQFSMQYISHIFGVINCNGFTLSDQRGLQAVGVGIF 198

Query: 181 PVISIINHSCLPNAVLVFE-GRLAVVRAVQH------------VPKGAEVLISYIETAGS 227
           P + ++NH C PN  ++F  G    V+++ H            + +G E+ +SYI+    
Sbjct: 199 PNLGLVNHDCWPNCTVIFNNGNHEAVKSMFHTQMRIELRALGKISEGEELTVSYIDFLNV 258

Query: 228 TMTRQKALKEQYLFTCTCPRCIK 250
           +  R++ LK+QY F CTC  C K
Sbjct: 259 SEERKRQLKKQYYFDCTCEHCQK 281


>gi|119613764|gb|EAW93358.1| SET and MYND domain containing 2, isoform CRA_d [Homo sapiens]
          Length = 230

 Score =  104 bits (260), Expect = 7e-20,   Method: Compositional matrix adjust.
 Identities = 60/189 (31%), Positives = 103/189 (54%), Gaps = 8/189 (4%)

Query: 62  LHRLECQVLSRLDKEKRKSVTPSIRLMLKLYLRRKLQNDNVIPSTTTDNYSLVEALVAHM 121
           +H+LEC  +    +    S T  +RL  ++  ++K+  +     T ++    V+   +H+
Sbjct: 1   MHKLECSPMVVFGENWNPSET--VRLTARILAKQKIHPER----TPSEKLLAVKEFESHL 54

Query: 122 SDID-EKQLLLYAQIANLVNLILQWPEISINE-IAENFSKLACNAHTICNSELRPLGTGL 179
             +D EK+ L+ + IA L +   +  E   N+ +   F+++ CN  TI + EL  LG+ +
Sbjct: 55  DKLDNEKKDLIQSDIAALHHFYSKHLEFPDNDSLVVLFAQVNCNGFTIEDEELSHLGSAI 114

Query: 180 YPVISIINHSCLPNAVLVFEGRLAVVRAVQHVPKGAEVLISYIETAGSTMTRQKALKEQY 239
           +P ++++NHSC PN ++ ++G LA VRAVQ +  G EV  SYI+    T  R   L++ Y
Sbjct: 115 FPDVALMNHSCCPNVIVTYKGTLAEVRAVQEIKPGEEVFTSYIDLLYPTEDRNDRLRDSY 174

Query: 240 LFTCTCPRC 248
            FTC C  C
Sbjct: 175 FFTCECQEC 183


>gi|115497232|ref|NP_001069874.1| SET and MYND domain-containing protein 3 [Bos taurus]
 gi|112362116|gb|AAI20242.1| SET and MYND domain containing 3 [Bos taurus]
 gi|296479288|tpg|DAA21403.1| TPA: SET and MYND domain containing 3 [Bos taurus]
          Length = 391

 Score =  104 bits (260), Expect = 7e-20,   Method: Compositional matrix adjust.
 Identities = 71/218 (32%), Positives = 113/218 (51%), Gaps = 19/218 (8%)

Query: 8   GEVIISQEPYVCVPNNSSSISRCDGCF-ASSNLKKCSACQVVWYCGSNCQKLDWKLHRLE 66
           GE++   +P     +  S    CD C      L +CS C++  YC + CQK  W+ H+ E
Sbjct: 27  GELLFRSDPLAYTVSKGSRGVVCDRCLLGKEKLMRCSQCRIAKYCSAKCQKKAWQDHKRE 86

Query: 67  CQVLSRLDKEKRKSVTPSIRLMLKLYLRRKLQNDNVIPSTTTDNYSLVEALVAHMSDIDE 126
           C+ L      K +    S+RL+ ++    KL  +   PS +   YS  + L ++++ + E
Sbjct: 87  CKCLKSC---KPRYPPDSVRLLGRVVF--KLMQET--PSESEKLYSFYD-LESNINKLTE 138

Query: 127 ------KQLLLYAQ--IANLVNLILQWPEISINEIAENFSKLACNAHTICNSELRPLGTG 178
                 +QL L  Q  +   +    Q P     +I E F+K+ CN+ TICN+E++ +G G
Sbjct: 139 DKKEGLRQLALTFQHFMREEIQDASQLP--PSFDIFEAFAKVICNSFTICNAEMQEVGVG 196

Query: 179 LYPVISIINHSCLPNAVLVFEGRLAVVRAVQHVPKGAE 216
           LYP +S++NHSC PN  +VF G   ++RAV+ V  G E
Sbjct: 197 LYPSMSLLNHSCDPNCSIVFNGPHLLLRAVRDVEAGEE 234



 Score = 48.9 bits (115), Expect = 0.006,   Method: Compositional matrix adjust.
 Identities = 24/61 (39%), Positives = 36/61 (59%)

Query: 366 KEALAYCQLTIPVYQRVYPQFHPLLGLQYYTCGKLEWFLGDTENAIKSMTEAVEILRITH 425
           +EAL Y   T+  Y+  YP  HP+ G+Q    GKL+   G    A+K++  A +I+R+TH
Sbjct: 308 EEALFYGIRTMEPYRIFYPGNHPVRGVQIMKVGKLQLHQGMFPQAMKNLRLAFDIMRVTH 367

Query: 426 G 426
           G
Sbjct: 368 G 368


>gi|336465134|gb|EGO53374.1| hypothetical protein NEUTE1DRAFT_150707 [Neurospora tetrasperma
           FGSC 2508]
          Length = 668

 Score =  104 bits (259), Expect = 1e-19,   Method: Compositional matrix adjust.
 Identities = 77/287 (26%), Positives = 136/287 (47%), Gaps = 36/287 (12%)

Query: 7   SGEVI-ISQEPYVCVPNNSSSISRCDGCFASSNL--------------KKCSACQVVWYC 51
           +GE I I   P + +P++++  + C+ C   S                K C+ C+   YC
Sbjct: 34  AGETIAIIDNPLLALPDDANMRTTCNYCLYVSGTIEFEGDVEAGPRTCKACTGCKAAVYC 93

Query: 52  GSNCQKLDWKL-HRLECQVLSRLDKEKRKSVTPS-IRLMLKLYLRRKLQNDNVI----PS 105
            + CQ+  WKL H+ EC++  R+ +   K   P+ +R + ++ L  K  ++  +    P 
Sbjct: 94  NAECQRAHWKLVHKAECKMFKRIKERTGKDWLPTPVRAVAQVMLLLKAGDEETVKAFGPG 153

Query: 106 TT----TDNYSLVEALVAHMSDIDEKQLLLYAQIANLVNLILQWPEISINEIAENFSKLA 161
            T     + +   E L     ++     ++YA +       LQ  E ++ +  E   K+ 
Sbjct: 154 GTLESNVEGFKTDEGLWGDF-ELQATGAVVYAGL-------LQSDE-TLKQAMEVLCKMQ 204

Query: 162 CNAHTICNSELRPLGTGLYPVISIINHSCLPNAVLVFEGRLAVVRAVQHVPKGAEVLISY 221
            NA    +++    G  L+P +S++NHSC+PNA + FE R A ++A + +  G E+LISY
Sbjct: 205 TNAFNRFDADTGQAGIYLHPSLSMVNHSCVPNAYIAFEKRKAFLKAERDIEPGDEILISY 264

Query: 222 IETAGSTMTRQKALKEQYLFTCTCPRC-IKLGQFDDIQESAILEGYR 267
           I+       RQ++L+  Y F C C RC   L  ++ IQ   +  G +
Sbjct: 265 IDHTMPRRARQESLR-LYHFQCNCIRCKDDLNAYEVIQACGVSPGVK 310


>gi|384246814|gb|EIE20303.1| hypothetical protein COCSUDRAFT_48675 [Coccomyxa subellipsoidea
           C-169]
          Length = 377

 Score =  104 bits (259), Expect = 1e-19,   Method: Compositional matrix adjust.
 Identities = 43/90 (47%), Positives = 59/90 (65%)

Query: 152 EIAENFSKLACNAHTICNSELRPLGTGLYPVISIINHSCLPNAVLVFEGRLAVVRAVQHV 211
           ++A   ++ ACN HTIC+ ELRP+G G+YP  +++NHSC PN +  F G   V RAVQH+
Sbjct: 22  DVAYLLARFACNNHTICDDELRPIGVGIYPQGAMLNHSCTPNTMQSFHGSRIVFRAVQHI 81

Query: 212 PKGAEVLISYIETAGSTMTRQKALKEQYLF 241
           P GAE  I+Y+E A +   R  AL+  Y F
Sbjct: 82  PAGAEATIAYVELAATRAERSAALRANYFF 111


>gi|302691170|ref|XP_003035264.1| hypothetical protein SCHCODRAFT_14438 [Schizophyllum commune H4-8]
 gi|300108960|gb|EFJ00362.1| hypothetical protein SCHCODRAFT_14438 [Schizophyllum commune H4-8]
          Length = 597

 Score =  103 bits (258), Expect = 1e-19,   Method: Compositional matrix adjust.
 Identities = 73/269 (27%), Positives = 125/269 (46%), Gaps = 37/269 (13%)

Query: 8   GEVIISQEPYVCVPNNSSSISRCDGCF---ASSNLKKCSACQVVWYCGSNCQKLDWKLHR 64
           G VII+ +P+V V    +  S C  CF     + LK+C+ C+VV YC S CQ  DW  H+
Sbjct: 92  GTVIIAVKPHVSVLTTQNLASYCSNCFEEAPEAGLKRCAHCRVVHYCNSECQNKDWATHK 151

Query: 65  LECQVLSRLDKEKRKS--VTPS--IRLMLKLYLRRKLQNDNVIPSTTTDNYSLVEALVAH 120
            EC  L    K    +    PS  +R + +L  +R+ +  +          +  + + A 
Sbjct: 152 RECTALQEWAKHAPAAEVSVPSDAVRCLGRLLWKRQKKGLD---------STWAQQIAAM 202

Query: 121 MSDIDEKQLLLYAQIANLVNLILQWPEISINEIAENF------------SKLACNAHTIC 168
            S     Q   Y    +L + ++++  +S  +  ++F            S+ A N++++ 
Sbjct: 203 QSSRKSLQPSGYEFHTHLSHSLVRYLGLSSPDGLQDFGISSPGDLLDLTSRFATNSYSVT 262

Query: 169 NSELRPLGTGLYPVISIINHSCLPNAVLVF---------EGRLAVVRAVQHVPKGAEVLI 219
             +L P+G  + P++S++NHSC PNA  VF         +  L  V A++ +    ++  
Sbjct: 263 TPDLTPIGACVSPLVSLVNHSCSPNAATVFPRASKTPSTDEPLISVVAIRDIQPDEQIFT 322

Query: 220 SYIETAGSTMTRQKALKEQYLFTCTCPRC 248
           SYI+T      R++ L+E Y F C C  C
Sbjct: 323 SYIDTTLPRALRRRELQEGYNFLCKCSLC 351


>gi|307104881|gb|EFN53133.1| hypothetical protein CHLNCDRAFT_137504 [Chlorella variabilis]
          Length = 623

 Score =  103 bits (258), Expect = 1e-19,   Method: Compositional matrix adjust.
 Identities = 70/226 (30%), Positives = 103/226 (45%), Gaps = 31/226 (13%)

Query: 30  CDGCFA-SSNLKKCSACQVVWYCGSNCQKLDWKL--HRLECQVLSRLDKEKRKSVTPSIR 86
           CD C A ++   +CSAC++ WY     Q+  W+   HRLEC  L       R  V  +  
Sbjct: 50  CDCCLAPAAQPLRCSACKLAWYASREHQRRAWQAGAHRLECAAL-------RSCVPHTPP 102

Query: 87  LMLKLYLRRKLQNDNVIPSTTTDNYSLVEALVAHMSDIDEKQLLLYAQIANLVNLILQWP 146
             ++L LRR               Y  V  L  H  D+ +   L  AQ+    + +LQ  
Sbjct: 103 TAVRLALRR---------------YGEVLGLRHHWGDLQDAAKLECAQMGAAAHHLLQAA 147

Query: 147 E------ISINEIAENFSKLACNAHTICNSELRPLGTGLYPVISIINHSCLPNAVLVFEG 200
                  +   E+A   ++  CN+HTI + EL+PL  G++P+ ++ NH C PN +  F G
Sbjct: 148 APEAAEAVGPRELALLIARFGCNSHTISDDELQPLAVGIFPLGAMANHDCRPNTLHAFRG 207

Query: 201 RLAVVRAVQHVPKGAEVLISYIETAGSTMTRQKALKEQYLFTCTCP 246
              V RAV+ +  G EV  SY E A     R+  L + +LF    P
Sbjct: 208 GRMVFRAVRAIQPGEEVTTSYTELAAPRWERRAVLLQHHLFDIDAP 253


>gi|240981045|ref|XP_002403605.1| SET and MYND domain-containing protein (SMYD), putative [Ixodes
           scapularis]
 gi|215491387|gb|EEC01028.1| SET and MYND domain-containing protein (SMYD), putative [Ixodes
           scapularis]
          Length = 770

 Score =  103 bits (257), Expect = 2e-19,   Method: Compositional matrix adjust.
 Identities = 108/439 (24%), Positives = 176/439 (40%), Gaps = 71/439 (16%)

Query: 8   GEVIISQEPYVCVPNNSSSISRCDGCFASS-NLKKCSACQVVWYCGSNCQKLDWK-LHRL 65
           G+ I  + PY  V     + + C  C     N   C+ C  V YC   C K  W   HR 
Sbjct: 266 GDAIFVERPYASVLLPGHTKTNCQHCHKRLLNAVPCAQCNQVRYCSFACAKDSWNSYHRW 325

Query: 66  ECQVLSRLDKEKRKSVTPSIRLMLKLYLR------RKLQNDNVIPSTTTDNYSLVEALVA 119
           EC  L+ L       +  ++R++L   L       R L    V        Y  V  LV 
Sbjct: 326 ECGNLNLLYSVGIAHL--AVRVLLVTGLSGLADFCRHLAEGKV-DVDKNGGYGSVHELVT 382

Query: 120 HMSDIDEKQLLLYAQIANLVNLILQ-----------W--PEISINE-------------- 152
           H   +  + LL Y+  A L++++L            W   E+S                 
Sbjct: 383 HSEKMHVEDLLQYSLTAALLSMLLDHVLFFNVDEASWKMSELSFGSRNGSSYRLASTDAA 442

Query: 153 ---------IAENFSKLACNAHTICNSELRP--------------LGTGLYPVISIINHS 189
                    +  +  +L CNAH I + E +               + T +YP  S++NHS
Sbjct: 443 ALKTVVGGLLLRHIQQLVCNAHAITSLESKTSQEDDVVMTTEQVRIATAIYPSASLMNHS 502

Query: 190 CLPNAVLVFE-GRLAVVRAVQHVPKGAEVLISY--IETAGSTMTRQKALKEQYLFTCTCP 246
           C PN    F  G   VVRA++ + +G EVL  Y       S   RQ+ L+EQY F C+C 
Sbjct: 503 CDPNIFSSFRCGSTLVVRAIRRIQEGEEVLNCYGPHHRRMSFAERQQLLQEQYFFVCSCT 562

Query: 247 RCIKLGQFDDIQESAILEGYRCKDDGCSGFLLRDSDDKGFTCQQCGLVRSKEEIKKIASE 306
            C      +D ++   L+  +C  + C G L +  D    TC  CG  +     ++ A  
Sbjct: 563 AC---SSGEDAEQR--LQALKC--EYCEGPLKQPDDSGKATCLDCGTTQECLGSEQKAFR 615

Query: 307 VNILSKKTLALTSCGNHQEVVSTYKMIEKLQKKLYHPFSVNLMQTREKLIKILMELEDWK 366
           ++ L  + + L   GNH E +       K ++K+ +  ++ L++ ++ + K L  L D++
Sbjct: 616 MHDLYVQGVQLAEKGNHVEALQRLNKCLKTREKVMYKHNLKLLEVKDMVAKCLCSLGDFR 675

Query: 367 EALAYCQLTIPVYQRVYPQ 385
            A    +  +   + +Y +
Sbjct: 676 SASGVLKPAVDAVRVIYGE 694


>gi|303271753|ref|XP_003055238.1| predicted protein [Micromonas pusilla CCMP1545]
 gi|226463212|gb|EEH60490.1| predicted protein [Micromonas pusilla CCMP1545]
          Length = 838

 Score =  103 bits (256), Expect = 2e-19,   Method: Compositional matrix adjust.
 Identities = 77/300 (25%), Positives = 116/300 (38%), Gaps = 69/300 (23%)

Query: 7   SGEVIISQEPYVCVPNNSSSISRCDGCFASSN----LKKCSACQVVWYCGSNCQKLDWKL 62
           +G+  +   PY C  ++S   +RCD  F  ++    L +C+ C+   Y G + Q   WK 
Sbjct: 43  AGDEALVARPYACALHDSQRAARCDHTFKRADDGGALLRCARCKHARYLGRDAQAAAWKR 102

Query: 63  -HRLECQVLSRLDKEKRKSVTPSIRLMLKLYLRRKLQNDNVIPSTTTDN----------- 110
            HR EC  +    K  R  V P+   M    L RK + +         N           
Sbjct: 103 GHREECAAIESAAKGGR--VPPATARMAARALWRKAREEKEKSDREKKNADDADDSDAAD 160

Query: 111 -------------------YSLVEALVAHMSDID---EKQLLLYAQIA------------ 136
                              Y  + +L  H  D D     +   +AQ+A            
Sbjct: 161 ARDDAEDVAAAAAMGLGEGYDALRSLTHHWDDTDALPPARKATFAQMAMLTCAYIARVLD 220

Query: 137 --------------NLVNLILQWPEISINEIAENFSKLACNAHTICNSELRPLGTGLYPV 182
                         N V+ +   P+    ++A   +  ACN HTI + EL P+G G+YP 
Sbjct: 221 PSAAANPKHDPKSPNFVDPLAGGPDP--RDVARLLAAFACNVHTIHDDELNPVGMGVYPT 278

Query: 183 ISIINHSCLPNAVLVF-EGRLAVVRAVQHVPKGAEVLISYIETAGSTMTRQKALKEQYLF 241
            +++NH+C P     F EG  A  R V+ V KG E+ ISY+E   +   R+  L   Y F
Sbjct: 279 AAMMNHACAPTVAQTFGEGTTATFRCVRDVRKGEEIAISYVELLATRAERRAELLNHYRF 338


>gi|432113965|gb|ELK36030.1| SET and MYND domain-containing protein 3, partial [Myotis davidii]
          Length = 241

 Score =  103 bits (256), Expect = 2e-19,   Method: Compositional matrix adjust.
 Identities = 75/260 (28%), Positives = 123/260 (47%), Gaps = 43/260 (16%)

Query: 157 FSKLACNAHTICNSELRPLGTGLYPVISIINHSCLPNAVLVFEGRLAVVRAVQHVPKGAE 216
           F +L CN+ TICN+E++ +G GLYP +S++NHSC PN  +VF G   ++RAV+ +  G E
Sbjct: 1   FPQLICNSFTICNAEMQEVGVGLYPSMSLLNHSCDPNCSIVFNGPHLLLRAVRDIQAGEE 60

Query: 217 VLISYIETAGSTMTRQKALKEQYLFTCTCPRCIKLGQFDDIQESAILEGYRCKDDGCSGF 276
           + I Y++   ++  R++ L+ QY F C C RC                            
Sbjct: 61  LTICYLDMLMTSAERREQLRNQYCFDCDCARC---------------------------- 92

Query: 277 LLRDSDDKGFTCQQCGLVRSKEEIKKIASEVNILSKKTLALTSCGNHQEVVSTYKMIEKL 336
           L +D D    T  +      +E +KKI  E+    K    L  C   Q ++S+    E+L
Sbjct: 93  LTQDKDADMLTGDEKVWKEVQESLKKI-EELKAEWKWEQVLALC---QAIISSNS--ERL 146

Query: 337 QKKLYHPFSVNLMQTR--EKLIKILMELEDWKEALAYCQLTIPVYQRVYPQFHPLLGLQY 394
                    +N+ Q +  +  +   + L   +EAL Y   T+  Y+  +P  HP+ G+Q 
Sbjct: 147 P-------DINIYQLKVLDCAMDACIRLGLLEEALFYGIRTMEPYRIFFPGCHPVRGVQV 199

Query: 395 YTCGKLEWFLGDTENAIKSM 414
              GKL+   G    A++++
Sbjct: 200 MKVGKLQLHQGMFPQAMRNL 219


>gi|426239581|ref|XP_004013698.1| PREDICTED: SET and MYND domain-containing protein 3 isoform 3 [Ovis
           aries]
          Length = 391

 Score =  103 bits (256), Expect = 3e-19,   Method: Compositional matrix adjust.
 Identities = 70/218 (32%), Positives = 112/218 (51%), Gaps = 19/218 (8%)

Query: 8   GEVIISQEPYVCVPNNSSSISRCDGCF-ASSNLKKCSACQVVWYCGSNCQKLDWKLHRLE 66
           GE++   +P     +  S    CD C      L +CS C++  YC + CQK  W+ H+ E
Sbjct: 27  GELLFRSDPLAYTVSKGSRGVVCDRCLLGKEKLMRCSQCRIAKYCSAKCQKKAWQDHKRE 86

Query: 67  CQVLSRLDKEKRKSVTPSIRLMLKLYLRRKLQNDNVIPSTTTDNYSLVEALVAHMSDIDE 126
           C+ L      K +    S+RL+ ++    KL  +   PS +   YS  + L ++++ + E
Sbjct: 87  CKCLKSC---KPRYPPDSVRLLGRVVF--KLMQET--PSESEKLYSFYD-LESNINKLTE 138

Query: 127 ------KQLLLYAQ--IANLVNLILQWPEISINEIAENFSKLACNAHTICNSELRPLGTG 178
                 +QL L  Q  +   +    Q P     +I E F+K+ CN+ TICN+E++ +G G
Sbjct: 139 DKKEGLRQLALTFQHFMREEIQDASQLP--PSFDIFEAFAKVICNSFTICNAEMQEVGVG 196

Query: 179 LYPVISIINHSCLPNAVLVFEGRLAVVRAVQHVPKGAE 216
           LYP +S++NHSC PN  +VF G   ++R V+ V  G E
Sbjct: 197 LYPSMSLLNHSCDPNCSIVFNGPHLLLRTVRDVEAGEE 234



 Score = 48.9 bits (115), Expect = 0.006,   Method: Compositional matrix adjust.
 Identities = 24/61 (39%), Positives = 36/61 (59%)

Query: 366 KEALAYCQLTIPVYQRVYPQFHPLLGLQYYTCGKLEWFLGDTENAIKSMTEAVEILRITH 425
           +EAL Y   T+  Y+  YP  HP+ G+Q    GKL+   G    A+K++  A +I+R+TH
Sbjct: 308 EEALFYGIRTMEPYRIFYPGNHPVRGVQIMKVGKLQLHQGMFPQAMKNLRLAFDIMRVTH 367

Query: 426 G 426
           G
Sbjct: 368 G 368


>gi|308509934|ref|XP_003117150.1| CRE-SET-14 protein [Caenorhabditis remanei]
 gi|308242064|gb|EFO86016.1| CRE-SET-14 protein [Caenorhabditis remanei]
          Length = 429

 Score =  103 bits (256), Expect = 3e-19,   Method: Compositional matrix adjust.
 Identities = 98/351 (27%), Positives = 154/351 (43%), Gaps = 45/351 (12%)

Query: 30  CDGCFASS-NLKKCSACQVVWYCGSNCQKLDWKLHRLECQVLSRLDKEKRKSVTPSIRLM 88
           C+ C  +   LKKCSAC+ V YC + CQ+ DWKLH+ EC+ +    +   +    SIRL+
Sbjct: 26  CNQCLTNMVELKKCSACRKVAYCSAECQRADWKLHKRECKAI----QAHGEVAIDSIRLV 81

Query: 89  LKLYLRRKLQNDN--------VIPSTTTDNYSLVE---ALVAHMSDIDEKQLLLYAQIAN 137
           +++     L N N         IP       +L++    L A   D  E+ L        
Sbjct: 82  MRIV---GLLNQNEVGQFSEEYIPGGIRSFLTLMDHGNHLNAEAEDFAEQYL-------- 130

Query: 138 LVNLILQWPEISINEIAENFSKLACNAHTICNSELRPLGTGLYPVISIINHSCLPNAVLV 197
             N  L  P      I   F K++ N+ ++ NS    +G  L   +S  NHSC P+  + 
Sbjct: 131 --NFALP-PHSHPETIKSIFKKVSVNSFSLSNSTGNSIGIALCVKLSAANHSCKPSTRVC 187

Query: 198 FEGRLAVVRAV-QHVPKGAE-VLISYIETAGSTMTRQKALKEQYLFTCTCPRCIKLGQFD 255
           + GR A++  V   +P   E    SYI+   +  TRQ  LK++Y F C C  C       
Sbjct: 188 YRGRTAMLVPVDDRLPTSLEGACHSYIDELQTLSTRQATLKKKYKFDCACEGCT------ 241

Query: 256 DIQESAILEGYRCKDDGCSGFLLRDSDDKGFTCQQCG--LVRSKEEIKKIASEVNILSKK 313
           D + +  +E + C  + C    +R+ +  G +C  CG  L R + E+ + A E  I S+ 
Sbjct: 242 DDERNGRMEAWAC--EICKTGWIRNVE--GASCNPCGYVLTRDQYELCRTAEEAAIASRP 297

Query: 314 TLALTSCGNHQEVVSTYKMIEKLQKKLYHPFSVNLMQTREKLIKILMELED 364
            L   S           K++E  Q  L H ++V+ +     L    + + D
Sbjct: 298 KLENDSIPLETRRHLCEKLLELFQDTL-HAYNVHRIPVLRCLFVATLAIRD 347


>gi|302922324|ref|XP_003053442.1| hypothetical protein NECHADRAFT_74600 [Nectria haematococca mpVI
           77-13-4]
 gi|256734383|gb|EEU47729.1| hypothetical protein NECHADRAFT_74600 [Nectria haematococca mpVI
           77-13-4]
          Length = 530

 Score =  103 bits (256), Expect = 3e-19,   Method: Compositional matrix adjust.
 Identities = 63/248 (25%), Positives = 119/248 (47%), Gaps = 24/248 (9%)

Query: 8   GEVIISQEPYVCVPNNSSSISRCDGCFASSNLKKCSACQVVWYCGSNCQKLDWK-LHRLE 66
           G+ I++  P + +P+ S + + C  CF     + CS C  V YC ++CQ   WK +H  E
Sbjct: 27  GDTILTFTPTILIPSLSHASTVCSHCFKPGEPRSCSRCHAVAYCDASCQSAAWKAIHSKE 86

Query: 67  CQVLSRLDKEKRKSVTPSIRLMLKLYLRRKLQ------NDNVIPSTTTDNYSLVEALVAH 120
           C+VL ++  + R  +   +R +++  ++ ++         N +    +D ++ +E +   
Sbjct: 87  CKVLQKVSAQGRPGLPTPVRAVVQALVKAEVGAALEDLEGNDLSWRGSDKWADMEMMAMG 146

Query: 121 MSDIDEKQLLLYAQIANLVNLILQWPEISINEIAENFSKLACNAHTICNSELRPLGTGLY 180
            S         +A +           + ++ +      K+  NA    ++++  +G  L 
Sbjct: 147 AS--------AFAGLGTT--------QEAVKKALSLLCKIQTNAFHRYDADIGQVGIFLE 190

Query: 181 PVISIINHSCLPNAVLVFEGRLAVVRAVQHVPKGAEVLISYIETAGSTMTRQKALKEQYL 240
           P +++ NHSC+PNA + F GR A++RA + +    E+ ISY +       R++AL   Y 
Sbjct: 191 PRLAMANHSCIPNATVQFVGRRAILRAERPIKADEEIEISYTDYNYPLSKRKEALAP-YF 249

Query: 241 FTCTCPRC 248
           FTC C RC
Sbjct: 250 FTCECTRC 257


>gi|221057668|ref|XP_002261342.1| hypothetical protein, conserved in Plasmodium species [Plasmodium
           knowlesi strain H]
 gi|194247347|emb|CAQ40747.1| hypothetical protein, conserved in Plasmodium species [Plasmodium
           knowlesi strain H]
          Length = 518

 Score =  102 bits (255), Expect = 3e-19,   Method: Compositional matrix adjust.
 Identities = 115/499 (23%), Positives = 197/499 (39%), Gaps = 88/499 (17%)

Query: 7   SGEVIISQEPYVCVP-----------------NNSSSISRCDGCFAS-SNLKKCSACQVV 48
           +G  I+   P + VP                  N  +++ C  CF   S    C  C+ V
Sbjct: 29  AGYCIVEAHPEIFVPLSVKYMAPRVIDAENEETNYRTVNICFYCFEKFSKSVYCPNCKYV 88

Query: 49  WYCGSNCQKLDWKLHRLECQVLSR--LDKEKRKSVTPSIRLMLKLYLRRKLQNDNV---- 102
            YC   C  L WKLHR EC+V      DK       PSI  ++++ +   L + N     
Sbjct: 89  VYCSDVCLDLAWKLHREECEVFKSNIFDK-----FCPSI--IMRMVINSYLNHFNFYEYC 141

Query: 103 -----IPSTTTDNYSLVEALVAHMSDIDEKQLLLYAQIANLVNLILQWPEISINEIAENF 157
                +P    + +     +VA      +K+L      +N  +      E  +  I E F
Sbjct: 142 GSVEELPKEKYEYFKYPAYIVAVALMSKKKKLF-----SNFED-----NESILKNIIEKF 191

Query: 158 SKLACNAHTICNSELRPLGTGLY-PVISIINHSCLPNAVLVFEGRLAVVRAVQHVPKGAE 216
            K++ N+  I +++L P G   Y   +   NHSCL N V +F+ +   +R +  V  G E
Sbjct: 192 VKISKNSLQIIDNDLEPAGLAFYKKPVPYFNHSCLSNCVTIFKNQKLFIRTLVDVYPGEE 251

Query: 217 VLISYIETAGSTMTRQKALKEQYLFTC---------------------------TCPRCI 249
           + ISY++ A    TR     +QY FTC                           +C +C+
Sbjct: 252 LTISYVDIAFDRNTRLAICMDQYFFTCSCKLCKVNMVSECQHIFNTELLCTQSESCKKCV 311

Query: 250 KLGQFDDIQESAILEGYRCKDDGCSGF-LLRDSDDKG--FTCQQC----------GLVRS 296
              +   I E      Y  K+   S + + + ++++   + C  C           LV  
Sbjct: 312 NHMELLLISELERKSCYMNKNQFKSAYPVFKKTNEQQHVWKCSICKGEVHDNVIKSLVEK 371

Query: 297 KEEIKKIASEVNILSKKTLALTSCGNHQEVVSTYKMIEKLQKKLYHPFSVNLMQTREKLI 356
           ++E  K AS +  L  +     +    Q +      I+ L    YH    +L + R K++
Sbjct: 372 EKETVKEASHLESLFSEKYTYDNKSVLQALNKIKSKIDYLT-TFYHHTKYSLQKMRAKIL 430

Query: 357 KILMELEDWKEALAYCQLTIPVYQRVYPQFHPLLGLQYYTCGKLEWFLGDTENAIKSMTE 416
            I ++L+++K A       +   +  Y ++ P+ G   +  GKL  FL      +  + +
Sbjct: 431 YISIQLQEFKLAYNIANQYLKSIEVTYGKYSPIYGYYIFLTGKLALFLDLKSEGLSLIHK 490

Query: 417 AVEILRITHGTNSPFMKEL 435
           A + +  T+G +SP  K+L
Sbjct: 491 AKKNIIKTYGPDSPIYKDL 509


>gi|47227584|emb|CAG09581.1| unnamed protein product [Tetraodon nigroviridis]
          Length = 382

 Score =  102 bits (255), Expect = 3e-19,   Method: Compositional matrix adjust.
 Identities = 82/331 (24%), Positives = 147/331 (44%), Gaps = 38/331 (11%)

Query: 112 SLVEALVAHMSDIDEKQLL-LYAQIANLVNLILQWPEIS----INEIAENFSKLACNAHT 166
           + VE L  H++D+ E +L  L   I N ++    WP  S    I++I+  F  + CN  T
Sbjct: 28  TTVEELEDHIADLQEDELKELKVDIHNFLDY---WPRNSKQHTIDDISHLFGVINCNGFT 84

Query: 167 ICNSE-LRPLGTGLYPVISIINHSCLPNAVLVFEGRLAVVRAVQHVPKGAEVLISYIETA 225
           + +   L+ +G GL+P + ++NH C PN  ++       +R++  + +G E+ ++Y++  
Sbjct: 85  VSDQRGLQAVGVGLFPNLCLVNHDCWPNCTVILNHGKIELRSLGKIAEGEELTVAYVDFL 144

Query: 226 GSTMTRQKALKEQYLFTCTCPRCIKLGQFDDIQESAILEGYRCKDD-GCSGFLLRDSDDK 284
             +  R++ LK QY F C C  C                  R KDD   +G   R+ D  
Sbjct: 145 NLSEERRRLLKTQYFFDCQCEHCRN----------------RTKDDLKLAG---REVD-- 183

Query: 285 GFTCQQCGLVRSKEEIKKIASEVNILSKKTLALTSCGNHQEVVSTYKMIEKLQKKLYHPF 344
                  GL  S+E++K+       + +K       G++ EVV   +      + +    
Sbjct: 184 -------GLKPSEEQVKEATDYCFQMLEKMDKARLDGDYHEVVKICRDAIDRTEPVLADT 236

Query: 345 SVNLMQTREKLIKILMELEDWKEALAYCQLTIPVYQRVYPQFHPLLGLQYYTCGKLEWFL 404
            + L++    L ++   L+ + +A  Y +  +  Y ++YP  +  LG+     G   W  
Sbjct: 237 HIYLLRMWSTLSEVQAYLQYFNDAATYSRKMVEGYMKLYPPNNAALGMAAMRAGVNHWQA 296

Query: 405 GDTENAIKSMTEAVEILRITHGTNSPFMKEL 435
           G  E     + +A  IL +THG   P  K+L
Sbjct: 297 GLIEVGHGMVCKAYAILLVTHGPTHPITKDL 327


>gi|345566296|gb|EGX49239.1| hypothetical protein AOL_s00078g272 [Arthrobotrys oligospora ATCC
           24927]
          Length = 526

 Score =  102 bits (254), Expect = 4e-19,   Method: Compositional matrix adjust.
 Identities = 65/262 (24%), Positives = 121/262 (46%), Gaps = 30/262 (11%)

Query: 9   EVIISQEPYVCVPNNSSSISRCDGCFASSN---------------LKKCSACQVVWYCGS 53
           E++   +P +C+P+ +   S C  C A +                L  C  C+VV YC  
Sbjct: 34  EILSVADPLICIPDEAHLDSCCHYCMAEATDEAPSVNQAYRAPVKLSYCLGCRVVKYCSK 93

Query: 54  NCQKLDWK--LHRLECQVLSRLDKEKRKSVTPSIRLMLKLYLRRKLQNDNVIPS-----T 106
            CQ  DWK   H+ EC V+ +    +   VT    L +      +    N++        
Sbjct: 94  ACQTTDWKQKAHKYEC-VIYKAQYPRILPVTARAILRMAKQFLSETPGSNIVGGIGALKA 152

Query: 107 TTDNYSLVEALVAHMSDIDEKQLLLYAQIANLVNLILQWPEISINEIAENFSKLACNAHT 166
             ++++ +   +  M+++  K    +++++  ++     PE     + + + KL  N+ +
Sbjct: 153 HAEDFAKIGGDIWEMANLTAKAAAEFSKVSKQISF---EPEF----LRDMYCKLLINSAS 205

Query: 167 ICNSELRPLGTGLYPVISIINHSCLPNAVLVFEGRLAVVRAVQHVPKGAEVLISYIETAG 226
           +      P+G  L    ++ NHSC PNAV++F+GR   +R+++ + K  E+ ISYI+   
Sbjct: 206 VVTQTFDPIGLCLAYQSAMFNHSCDPNAVMMFDGRQLSIRSLKEITKDTEITISYIDNLA 265

Query: 227 STMTRQKALKEQYLFTCTCPRC 248
           S   R++ LK +Y F C+C  C
Sbjct: 266 SRKERKEELKSRYFFDCSCSLC 287


>gi|17535507|ref|NP_496323.1| Protein SET-14 [Caenorhabditis elegans]
 gi|3123316|sp|Q09415.2|SET14_CAEEL RecName: Full=SET domain-containing protein 14
 gi|3878909|emb|CAA86783.1| Protein SET-14 [Caenorhabditis elegans]
          Length = 429

 Score =  102 bits (253), Expect = 5e-19,   Method: Compositional matrix adjust.
 Identities = 88/298 (29%), Positives = 141/298 (47%), Gaps = 30/298 (10%)

Query: 28  SRCDGCFAS-SNLKKCSACQVVWYCGSNCQKLDWKLHRLECQVLSRLDKEKRKSVTPSIR 86
           S C+ C  S + LKKCSAC+ + YC   CQ+ DWKLH++EC+ +    K   +    SIR
Sbjct: 24  SYCNQCLTSMAELKKCSACRRLAYCSQECQRADWKLHKVECKAI----KTHNEVANDSIR 79

Query: 87  LMLKLYLRRKLQNDNVIPSTTTDNYSL-VEALVAHMS--DIDEKQLLL-YAQIANLVNLI 142
           L++++  +     D  I +      +   + L  H S  D DE+  +  Y Q A     I
Sbjct: 80  LVMRIAGKLSRNEDGEIEAYYIPGVARNFQNLEHHPSSYDADEESFVKEYFQFA-----I 134

Query: 143 LQWPEISINEIAENFSKLACNAHTICNSELRPLGTGLYPVISIINHSCLPNAVLVFEGRL 202
              P+  + ++   F K++ N+  + NS   P+G GL   +S  NHSC P   + +  R 
Sbjct: 135 APHPDRDLIKLI--FQKVSINSFVVGNSTGNPIGVGLCIKLSAANHSCKPLTRVCYRNRT 192

Query: 203 AVVRAV-QHVPKGAE-VLISYIETAGSTMTRQKALKEQYLFTCTCPRCIKLGQFDDIQES 260
           A++  V   +P   E    SYI+       R+  LK++Y F C C  C+      D   +
Sbjct: 193 AMLVPVSSELPSTLEGACHSYIDELMPRDMRRDTLKKKYKFLCQCDGCL------DEDRN 246

Query: 261 AILEGYRCKDDGCSGFLLRDSDDKGFTCQQCGLVRSKE--EIKKIASEVNILSKKTLA 316
           A +E + C    C    +R+ ++    C+ CG   SK+  E+ + A E  I ++  LA
Sbjct: 247 ARMEAWTC--GICVKGWMRNKENG--QCELCGWTMSKDHFELCRTAEEAGIAARSRLA 300


>gi|400601369|gb|EJP69012.1| MYND finger protein [Beauveria bassiana ARSEF 2860]
          Length = 542

 Score =  102 bits (253), Expect = 5e-19,   Method: Compositional matrix adjust.
 Identities = 68/224 (30%), Positives = 104/224 (46%), Gaps = 10/224 (4%)

Query: 30  CDGCFASSNLKKCSACQVVWYCGSNCQKLDWKL-HRLECQVLSRLDK--EKRKSVTPSIR 86
           C  C  +   + CS C+  +YC + CQ   W   H LEC  L R  K  +KR+ +   +R
Sbjct: 52  CSFCLRAGTPRPCSRCRAAYYCDARCQAAAWSGGHSLECAALVRAVKSSKKRREIPTPVR 111

Query: 87  LMLKLYLRRKLQNDNVIPSTTTDNYSLVEALVAHMSDIDEKQLLLYAQIANLVNLILQWP 146
            ++K+ L          P   + N   +E  VA          +    +        +  
Sbjct: 112 ALVKVLL------SCGQPEDLSKNMDGLEGHVAERRREPGWADMEMMAMGGCAFAGRETS 165

Query: 147 EISINEIAENFSKLACNAHTICNSELRPLGTGLYPVISIINHSCLPNAVLVFEGRLAVVR 206
           E S+ +  E   KL  NA    +++L  +G  L P +++ NHSCLPNA + F GR AV+R
Sbjct: 166 EESVRQAVEILCKLQTNAFHRFDADLGHVGIFLEPTLAMANHSCLPNAFVQFVGRAAVLR 225

Query: 207 AVQHVPKGAEVLISYIETAGSTMTRQKALKEQYLFTCTCPRCIK 250
           A   + KG E+ ISY +   S++ ++KA    Y F C C RC +
Sbjct: 226 AESRIQKGDEIEISYTDYT-SSLGKRKAALAPYNFECRCRRCTQ 268


>gi|406868025|gb|EKD21062.1| SET and MYND domain-containing protein 2 [Marssonina brunnea f. sp.
           'multigermtubi' MB_m1]
          Length = 570

 Score =  102 bits (253), Expect = 6e-19,   Method: Compositional matrix adjust.
 Identities = 76/256 (29%), Positives = 125/256 (48%), Gaps = 30/256 (11%)

Query: 7   SGEVIISQEPYVCVPNNSSSISR-CDGCF-ASSNLKKCSACQVVWYCGSNCQKLDWK-LH 63
           SG VI+       +     S+SR C  C   S  +K+CSAC++  YCG  CQ   W+ +H
Sbjct: 47  SGSVILKLHSPNLLLVEKESLSRVCSFCMRESQTMKRCSACKIPHYCGKACQIGHWREIH 106

Query: 64  RLECQVLSRLDKEKRKSVTPS-IRLMLKLYLRRKLQNDNVIPSTTTDNYSLVEALVAHMS 122
             EC +L +L       V P+ +R ++ + LR+  +++        D     +AL +H+ 
Sbjct: 107 GRECALLKQL-----PDVPPTAVRALIVMLLRKNSESEG-------DEEKGWKALESHVG 154

Query: 123 DI--DEK---QLLLYAQIANLVNLILQWPEISINEIAENFSKL---ACNAHTICNSELRP 174
            +  D K   ++LL A+        +++ +  +  + E    L   + NA  I   +  P
Sbjct: 155 KLQGDRKRWEEVLLQARAG------IEFTKSRVERMEEAIMYLCVLSTNAFRITLPDNTP 208

Query: 175 LGTGLYPVISIINHSCLPNAVLVFEGRLAVVRAVQHVPKGAEVLISYIETAGSTMTRQKA 234
            G    P +++ NHSC PNA++V   R   +RA++ + K  ++ ISYI+      +RQ  
Sbjct: 209 FGMCFSPTLALANHSCKPNAIIVSNSRSISLRALRPIKKNEQIFISYIDPTEDLPSRQSK 268

Query: 235 LKEQYLFTCTCPRCIK 250
           LKE+Y FTC C  C K
Sbjct: 269 LKERYFFTCKCDSCEK 284


>gi|320586941|gb|EFW99604.1| set and mynd domain protein [Grosmannia clavigera kw1407]
          Length = 576

 Score =  101 bits (252), Expect = 7e-19,   Method: Compositional matrix adjust.
 Identities = 79/260 (30%), Positives = 115/260 (44%), Gaps = 29/260 (11%)

Query: 15  EPYVCVPNNSSSISRCDGCFASSN-LKKCSACQVVWYCGSNCQKLDW-KLHRLECQVLSR 72
           EP V VP   ++   C+ C       K C+ C+ V YCGS CQ+  W  +H+LEC+VL +
Sbjct: 54  EPLVVVPFGVAAAQTCNHCLDPRRPAKACTGCRAVAYCGSRCQRAHWTAVHKLECRVLKK 113

Query: 73  L----------------------DKEKRKSVTPSIRLMLKLYLRRKLQNDNVIPSTTTDN 110
                                  D       TPS   ++   +R  LQ     P+   D 
Sbjct: 114 ALEMSREAEEEKEKEEQKKGDESDSTALTRTTPSTLRIVPAPVRALLQILLQWPAVK-DT 172

Query: 111 YSLVEALVAHMSDIDEKQLLLYAQIANLVNLILQWP--EISINEIAENFSKLACNAHTIC 168
            + +E  VA      +       Q A+    +  WP  + S+    E   K+  N+    
Sbjct: 173 VAGLEGNVAGFRARPDLWKDFEVQ-ASAACALAGWPATDASLAIAVEAMCKIHTNSFDRS 231

Query: 169 NSELRPLGTGLYPVISIINHSCLPNAVLVFEGRLAVVRAVQHVPKGAEVLISYIETAGST 228
           + ++   GT L   +++ NHSC+PNAV+ F GR A +RA Q +  G EV ISYI+     
Sbjct: 232 DVDVAYTGTFLDAHLAMANHSCVPNAVVSFAGRKAFLRAEQAIRAGDEVTISYIDYTKPK 291

Query: 229 MTRQKALKEQYLFTCTCPRC 248
             RQ+ L + Y FTC C RC
Sbjct: 292 SVRQRGL-DLYHFTCDCARC 310


>gi|390353457|ref|XP_003728116.1| PREDICTED: SET and MYND domain-containing protein 3-like
           [Strongylocentrotus purpuratus]
          Length = 319

 Score =  101 bits (251), Expect = 9e-19,   Method: Compositional matrix adjust.
 Identities = 78/285 (27%), Positives = 127/285 (44%), Gaps = 34/285 (11%)

Query: 7   SGEVIISQEPYVCVPNNSSSISRCDGCFASSNL--------KKCSACQVVWYCGSNCQKL 58
           +G  ++  EPYV + N      RC  C  + +L        K C +C+ V YC   C+  
Sbjct: 26  AGTCVLEDEPYVYILNEEQLQLRCGHCLRALSLGNTSEVESKNCKSCKRVVYCNKACKVA 85

Query: 59  DWKLHRLECQVLSRLDKEKRKSVTPSIRLMLKLYLRRK---LQNDNV----IPSTTTD-- 109
             K HR EC++L   +K+       ++RL++K+ L+ K   L+  N      PS+  D  
Sbjct: 86  GRKEHRYECKLLQGKEKDD-----VTLRLLMKMILKSKNYELKKSNGTLCEFPSSYRDLC 140

Query: 110 NYSLVEALVAHMSDIDEKQLLLYAQIANLVNLILQWPEISINEIAENFSKLACNAHTICN 169
           ++ L      H+       L    +      L          E+ + + KL  N+ ++ +
Sbjct: 141 SFQLSTKAQEHLLS-----LFRCYEKGWRSKLRSHLSPFRSQEVLDTYQKLTINSFSMYD 195

Query: 170 SELRPL-GTGLYPVISIINHSCLPNAVLVFEGRLAVVRAVQHVPKGAEVLISYIETAGST 228
              R + G  LY   S+ NHSC PN   VFEG    VRA++ +  G E  ISY+ +   +
Sbjct: 196 EMTRTIVGEALYIRASMFNHSCEPNCTFVFEGSRLSVRAIKRIEIGEECCISYMSSLLPS 255

Query: 229 MTRQKALKEQYLFTCTCPRCIKLGQFDDIQESAILEGYRCKDDGC 273
             R++ L+  Y FTC CPRC+      D     ++   +C ++ C
Sbjct: 256 PLRKEKLRSIYGFTCQCPRCL------DSARDNLMLCVKCPNESC 294


>gi|440907991|gb|ELR58064.1| SET and MYND domain-containing protein 3, partial [Bos grunniens
           mutus]
          Length = 221

 Score =  101 bits (251), Expect = 9e-19,   Method: Compositional matrix adjust.
 Identities = 73/258 (28%), Positives = 123/258 (47%), Gaps = 39/258 (15%)

Query: 157 FSKLACNAHTICNSELRPLGTGLYPVISIINHSCLPNAVLVFEGRLAVVRAVQHVPKGAE 216
           F ++ CN+ TICN+E++ +G GLYP +S++NHSC PN  +VF G   ++RAV+ V  G E
Sbjct: 1   FLQVICNSFTICNAEMQEVGVGLYPSMSLLNHSCDPNCSIVFNGPHLLLRAVRDVEAGEE 60

Query: 217 VLISYIETAGSTMTRQKALKEQYLFTCTCPRCIKLGQFDDIQESAILEGYRCKDDGCSGF 276
           + I Y++   ++  R+K L+ QY F C C RC    Q  D                    
Sbjct: 61  LTICYLDMLMTSEERRKQLRGQYCFDCDCFRC----QTQD-------------------- 96

Query: 277 LLRDSDDKGFTCQQCGLVRSKEEIKKIASEVNILSKKTLALTSCGNHQEVVSTYKMIEKL 336
             +D+D          ++   E++ K   EV    KK   L +    ++V++  + I   
Sbjct: 97  --KDAD----------MLTGDEQVWK---EVQESLKKIEDLKAHWKWEQVLAMCQSIISS 141

Query: 337 QKKLYHPFSVNLMQTREKLIKILMELEDWKEALAYCQLTIPVYQRVYPQFHPLLGLQYYT 396
             +     ++  ++  +  +   + L   +EAL Y   T+  Y+  YP  HP+ G+Q   
Sbjct: 142 NAERLPDINIYQLKVLDCAMDACLNLGLLEEALFYGIRTMEPYRIFYPGNHPVRGVQIMK 201

Query: 397 CGKLEWFLGDTENAIKSM 414
            GKL+   G    A+K++
Sbjct: 202 VGKLQLHQGMFPQAMKNL 219


>gi|322708541|gb|EFZ00118.1| MYND finger family protein [Metarhizium anisopliae ARSEF 23]
          Length = 526

 Score =  101 bits (251), Expect = 9e-19,   Method: Compositional matrix adjust.
 Identities = 69/245 (28%), Positives = 114/245 (46%), Gaps = 19/245 (7%)

Query: 9   EVIISQEPYVCVPNNSSSISRCDGCFASSNLKKCSACQVVWYCGSNCQKLDW-KLHRLEC 67
           ++I +  P +  P+ S   S C  C    + + CS C   +YC ++CQ+  W  +H  EC
Sbjct: 26  QIIHTFHPLILHPSLSQLASVCTHCLRPGSPRACSRCHAAYYCNTSCQQAAWTAVHSKEC 85

Query: 68  QVLS--RLDKEKRKSVTPSIRLMLKLYLRRKLQNDNVIPSTTTDNYSLVEALVAHMSDID 125
           + L   +L       +   +R +L+  L ++ ++              + AL  H  +  
Sbjct: 86  KALQQRKLGSRTGADLPTPVRALLQTLLNKETEDG-------------IAALDGHTEERR 132

Query: 126 EKQLL--LYAQIANLVNLILQWPEISINEIAENFSKLACNAHTICNSELRPLGTGLYPVI 183
           + +    L            +  E +I    E   K+  NA    + +L  +G  L P +
Sbjct: 133 KTKSWPDLEMMAMAACAFAGRQGESNIRRAVELLCKIQTNAFHRWDVDLGQVGVFLEPTL 192

Query: 184 SIINHSCLPNAVLVFEGRLAVVRAVQHVPKGAEVLISYIETAGSTMTRQKALKEQYLFTC 243
           ++ NHSC+PNAV+ F GR A++RA + +  G E+ ISY +      TR++AL EQY F C
Sbjct: 193 AMANHSCVPNAVVQFVGRKAILRAERPIHAGDEIEISYTDYTMPLSTRREAL-EQYSFEC 251

Query: 244 TCPRC 248
           TC RC
Sbjct: 252 TCARC 256


>gi|308499663|ref|XP_003112017.1| CRE-SET-18 protein [Caenorhabditis remanei]
 gi|308268498|gb|EFP12451.1| CRE-SET-18 protein [Caenorhabditis remanei]
          Length = 507

 Score =  101 bits (251), Expect = 9e-19,   Method: Compositional matrix adjust.
 Identities = 96/446 (21%), Positives = 179/446 (40%), Gaps = 48/446 (10%)

Query: 6   VSGEVIISQEPYVCVPNNSSSISRCDGCF----ASSNLKKCSACQVVWYCGSNCQKLDWK 61
           +   + I + P+    +N      C  C         L +C  C    YC   CQ   W 
Sbjct: 29  IHARIKIFETPFATQVSNPKVNEFCANCMRGPAPGEKLLRCGGCNFSMYCSKECQATAWL 88

Query: 62  LHRLECQVLSRLDKEKRKSVTPSIRLMLKLYLRRKL-------QNDNVIPSTTTDNYSLV 114
           +H+ EC+ L        K+  P++ L   L+L + +       +N + +   +   +S  
Sbjct: 89  VHKPECRRL--------KASFPNLPLTEVLFLSKIIDRIQFLEKNGDKLGVESARKFS-- 138

Query: 115 EALVAHMSDI--DEKQLLLYAQIANLVNLILQWPEISINEIAENFSKLACNAHTICNSEL 172
            +LV H  DI  DE+++  + ++   + L      I   +  + F K + N+H+I  +  
Sbjct: 139 -SLVDHKVDIREDEEKMKHFEKVFTKMGLFRGDEMIEKGQFFDVFCKASINSHSIHTNAG 197

Query: 173 RPLGTGLYPVISIINHSCLPNAVLVFEGRLAVVRAVQHVP-----KGAEVLISYIETAGS 227
             +G  L   +S+ NHSC P   +VF+G    +R +  VP        +  ISYI+   S
Sbjct: 198 NEVGMALDLGVSMYNHSCRPTCSMVFDGYRVCLRPL--VPGVDAESTEQAFISYIDVGRS 255

Query: 228 TMTRQKALKEQYLFTCTCPRCIKLGQFDDIQESAILEGYRCKDDGCSGFLLRDSDDK--G 285
              R+K LK ++ F C C RC+         E   L   RC +  C   +L    ++   
Sbjct: 256 KYVRRKDLKSRWYFDCECTRCMD-------PEDDALTAIRCANPACDAPVLTSETEEPMN 308

Query: 286 FTCQQCGLVRSKEEIKKIASEVNILSKKTLALTSCGNHQEVVSTYKMIEKLQKKLYHPFS 345
             C +C  +  + ++KK    +  L             Q+++S  +++        HP +
Sbjct: 309 IACDKCKTIIEENDVKKAQEFMKSLPASFDPQCPADILQDLLSKAEVV-------LHPSN 361

Query: 346 VNLMQTREKLIKILMELEDWKEALAYCQLTIPVYQRVYPQFHPLLGLQYYTCGKLEWFLG 405
           V + + R  L  +  +L     +  + Q+    Y+  +P+    +G Q     K      
Sbjct: 362 VYVARLRTALFHVTGQLTMDNLSTMHTQI-YNNYKLCFPKADRHVGFQLLHIVKALIEKD 420

Query: 406 DTENAIKSMTEAVEILRITHGTNSPF 431
           + + A+    +A+ I  +  G + P+
Sbjct: 421 ERDEAMPYAFDAMNIFEVCFGLDHPY 446


>gi|346978403|gb|EGY21855.1| hypothetical protein VDAG_03295 [Verticillium dahliae VdLs.17]
          Length = 558

 Score =  100 bits (250), Expect = 1e-18,   Method: Compositional matrix adjust.
 Identities = 69/246 (28%), Positives = 117/246 (47%), Gaps = 36/246 (14%)

Query: 18  VCVPNNSSSISRCDGCFASS-NLKKCSACQVVWYCGSNCQKLDWK-LHRLECQVLSRLDK 75
           + +P  ++  + C  C A+    ++C+AC+   YC + CQK  W  +H  EC  L     
Sbjct: 57  LALPPMAARTTTCSSCLATPPTPRRCTACKATAYCDAACQKSHWSSVHARECAALRAAGG 116

Query: 76  EKRKSVTPSIRLMLKLYLRRKLQNDNVIPSTTTDNYSLVEALVAHMSDIDEKQLLLYAQI 135
           E   +V   +R+ +++ L               + ++ VEAL+    D  E +   +A +
Sbjct: 117 EG--AVPAYVRMAMQVLL-------------WPEKFAAVEALLDGNVDRFEGRAEAWAGM 161

Query: 136 ANLVNLILQWP-----------EISINEIAENFSKLACNAHTICNSELRPLGTGLY--PV 182
                ++  W            E++   + E   K+  NA T   +     G GL+    
Sbjct: 162 EVPAKVMGGWVAAGAKGKGKAVEVTERRVVELLCKIKTNAFTRSEA-----GEGLFLDTT 216

Query: 183 ISIINHSCLPNAVLVFEGRLAVVRAVQHVPKGAEVLISYIETAGSTMTRQKALKEQYLFT 242
           ++++NHSC+PNA++ F GR A +RA++ + +G E+ ISYI+   S   R+KAL E Y F 
Sbjct: 217 LAMVNHSCVPNAIVAFSGRRAFLRALRDIKEGEEIEISYIDCTQSLEHRRKAL-ELYFFQ 275

Query: 243 CTCPRC 248
           C C RC
Sbjct: 276 CVCTRC 281


>gi|350295434|gb|EGZ76411.1| hypothetical protein NEUTE2DRAFT_98210 [Neurospora tetrasperma FGSC
           2509]
          Length = 547

 Score =  100 bits (250), Expect = 1e-18,   Method: Compositional matrix adjust.
 Identities = 73/281 (25%), Positives = 126/281 (44%), Gaps = 37/281 (13%)

Query: 3   SFHVSGEVIISQEPYVCVPNNSSSISRCDGCFASSNL--------------KKCSACQVV 48
           SF     + I   P + +P++++  + C+ C   S                K C+ C+  
Sbjct: 31  SFGAGETIAIIDNPLLALPDDANMRTTCNYCLYVSGTIEFEGDVEAGPRTCKACTGCKAA 90

Query: 49  WYCGSNCQKLDWKL-HRLECQVLSRLDKEKRKSVTPSIRLMLKLYLRRKLQNDNVIPSTT 107
            YC + CQ+  WKL H+ EC++  R+ +   K   P+        L            + 
Sbjct: 91  VYCNAECQRAHWKLVHKAECKMFKRIKERTGKDWLPTPAFGPGGTLE-----------SN 139

Query: 108 TDNYSLVEALVAHMSDIDEKQLLLYAQIANLVNLILQWPEISINEIAENFSKLACNAHTI 167
            + +   E L     ++     ++YA +       LQ  E ++ +  E   K+  NA   
Sbjct: 140 VEGFKTDEGLWGDF-ELQATGAVVYAGL-------LQSDE-TLKQAMEVLCKIQTNAFNR 190

Query: 168 CNSELRPLGTGLYPVISIINHSCLPNAVLVFEGRLAVVRAVQHVPKGAEVLISYIETAGS 227
            +++    G  L+P +S++NHSC+PNA + FE R AV++A + +  G E+LISYI+    
Sbjct: 191 FDADTGQAGIYLHPSLSMVNHSCVPNAYITFEKRKAVLKAERDLEPGDEILISYIDHTMP 250

Query: 228 TMTRQKALKEQYLFTCTCPRC-IKLGQFDDIQESAILEGYR 267
              RQ++L+  Y F C C RC   L  ++ IQ   +  G +
Sbjct: 251 RRARQESLR-LYHFQCNCIRCKDDLNAYEVIQACGVSPGVK 290


>gi|443728057|gb|ELU14532.1| hypothetical protein CAPTEDRAFT_199484 [Capitella teleta]
          Length = 374

 Score =  100 bits (249), Expect = 1e-18,   Method: Compositional matrix adjust.
 Identities = 71/254 (27%), Positives = 122/254 (48%), Gaps = 18/254 (7%)

Query: 153 IAENFSKLACNAHTICNSELRPLGTGLYPVISIINHSCLPNAVLVFEGRLAVVRAVQHVP 212
           +A   S+++ N+ TI + +   LG GLY   S ++HSC PNA   FEG    VR ++ + 
Sbjct: 95  MAFYLSQISVNSITINHPQAVGLGVGLYLRFSAVDHSCRPNAFANFEGTKVCVRCIEPIE 154

Query: 213 KGAEVLISYIETAGSTMTRQKALKEQYLFTCTCPRCIKLGQFDDIQESAILEGYRCKDDG 272
              ++ ISY+     T TR+K L ++YLF CTC  C+      DI +  +   ++C   G
Sbjct: 155 DEKDLRISYVSPLDDTATRRKNLLQKYLFECTCEACL------DIDQDQMKFSFKCVRGG 208

Query: 273 CSGFLLRDSDDKGFTCQQCGLVR-SKEEIKKIASEVN------ILSKKTLALTSCGNHQE 325
           C G + R  D++ F C  CG  + +++EI+ +   V        +SKK   L + G + E
Sbjct: 209 CKGHMTRMQDNR-FRCDYCGEKQVTEDEIQTLNVAVEKTRRMMDISKK---LKAQGRYAE 264

Query: 326 VVSTYKMIEKLQKKLYHPFSVNLMQTREKLIKILMELEDWKEALAYCQLT-IPVYQRVYP 384
           ++       K      H  ++ +++ R+ L+      ++W      C +  +  Y+  Y 
Sbjct: 265 IMVAGFACLKDAVDFLHSDNLLIVELRDCLLNACRMTQNWSVEAIQCGIANLEPYRSFYG 324

Query: 385 QFHPLLGLQYYTCG 398
           + HP L +  Y  G
Sbjct: 325 RHHPALAILLYDIG 338


>gi|429860577|gb|ELA35307.1| mynd finger family protein [Colletotrichum gloeosporioides Nara
           gc5]
          Length = 551

 Score =  100 bits (249), Expect = 1e-18,   Method: Compositional matrix adjust.
 Identities = 76/256 (29%), Positives = 119/256 (46%), Gaps = 29/256 (11%)

Query: 8   GEVI-ISQEPYVCVPNNSSSISRCDGCF------ASSNLKKCSACQVVWYCGSNCQKLDW 60
           G+VI +  EP V +P    ++  C+ C           L+ C+ C+ V YCG+ CQK +W
Sbjct: 32  GDVIALFDEPAVVLPPGHRALEYCNHCLKRQGHAGVGKLRACTGCKTVAYCGAACQKANW 91

Query: 61  KL-HRLECQVLSRLDKEKRKS----VTPSIRLMLKLYLRRKLQNDNVIPSTTTDNYSLVE 115
            L H++EC+ +  L + K       V   IR   ++ LR KL+          +++  +E
Sbjct: 92  TLIHKVECKAIQFLHEVKPAHQPDWVPTPIRAGAQVMLRPKLK----------ESFDALE 141

Query: 116 ALVAHMSDIDEKQLLLYAQIANLVNLILQWPEISINEIAENFS---KLACNAHTICNSEL 172
             V      DE  L L A   + V        I++  + E+F    KL  NA +      
Sbjct: 142 GHVEEWRKNDEMDLQLQA---HGVLRCCGLDAITLKGLEESFQVLCKLQTNAFSRTEEYY 198

Query: 173 RPLGTGLYPVISIINHSCLPNAVLVFEGRLAVVRAVQHVPKGAEVLISYIETAGSTMTRQ 232
              G  L   +++INHSC+PNA++ F GR A +R+   +  G E+ ISY +       RQ
Sbjct: 199 ETGGVFLDTTLAMINHSCVPNAMVQFGGRTATLRSATFINPGDEIEISYADHTQPKGKRQ 258

Query: 233 KALKEQYLFTCTCPRC 248
             L + Y F C+C +C
Sbjct: 259 GEL-DLYHFECSCYKC 273


>gi|260788099|ref|XP_002589088.1| hypothetical protein BRAFLDRAFT_75068 [Branchiostoma floridae]
 gi|229274262|gb|EEN45099.1| hypothetical protein BRAFLDRAFT_75068 [Branchiostoma floridae]
          Length = 909

 Score =  100 bits (249), Expect = 2e-18,   Method: Compositional matrix adjust.
 Identities = 76/299 (25%), Positives = 135/299 (45%), Gaps = 23/299 (7%)

Query: 153 IAENFSKLACNAHTIC------------NSELRPLGTGLYPVISIINHSCLPNAVLVFEG 200
           +  +  +L CNA  I             + ++R L T ++P  +++NHSC PN  + F+G
Sbjct: 606 LCHHMQQLRCNAQAITTLQEQDSVSLLEDKQVR-LATAVFPTEALLNHSCRPNVFVSFQG 664

Query: 201 RLAVVRAVQHVPKGAEVLISYIETAGSTM--TRQKALKEQYLFTCTCPRCIKLGQFDDIQ 258
           +  +VRAV H+  G E+L  Y   AG  +   RQ ALKEQY F+C+C  C +  Q  +  
Sbjct: 665 KTLIVRAVSHIKPGEELLHCYGPHAGRMVYGERQAALKEQYFFSCSCDACQE--QVGNPN 722

Query: 259 ESAILEGYRCKDDGCSGFLLRDSDDKGFTCQ--QCGLVRSKEEIKKIASEVNILSKKTLA 316
              +   Y+C     +  L     DK   C    C      ++IK  + ++  L  ++ A
Sbjct: 723 TVDMFSAYKCPVCNNAAKL----QDKKLVCTSPNCDAQGDTDDIKTTSKKIQDLFVQSSA 778

Query: 317 LTSCGNHQEVVSTYKMIEKLQKKLYHPFSVNLMQTREKLIKILMELEDWKEALAYCQLTI 376
               G  QE VS  K    LQ K+ HP + ++ +T + L +      D K+++ + + ++
Sbjct: 779 FLEEGQIQEAVSGLKQCLVLQWKILHPSNKDIARTHDALARCYATSGDCKKSVEHLKKSL 838

Query: 377 PVYQRVYPQFHPLLGLQYYTCGKLEWFLGDTENAIKSMTEAVEILRITHGTNSPFMKEL 435
              +  +      L  + +   +L++        + ++  A+ I    +G + P  KEL
Sbjct: 839 STVELQFGACSVELAHELHKLAQLQFNGQQVAECLDTIERALTIFGCHYGDHHPKSKEL 897


>gi|451992674|gb|EMD85153.1| hypothetical protein COCHEDRAFT_1149213 [Cochliobolus
           heterostrophus C5]
          Length = 508

 Score =  100 bits (248), Expect = 2e-18,   Method: Compositional matrix adjust.
 Identities = 96/445 (21%), Positives = 189/445 (42%), Gaps = 66/445 (14%)

Query: 39  LKKCSACQVVWYCGSN-------CQKLDW-KLHRLECQVLSRLDKEKRKSVTPSIRLMLK 90
           L  C+ C  +W  G++       CQK  W + H+ EC++L  +     + +  +    ++
Sbjct: 64  LDTCANC-YLWTEGASAGTRLYACQKEAWNRGHKHECKILKPM---AGRGLPKAFLASME 119

Query: 91  LYLRRKLQNDNVIPSTTTDNYSLVEALVAHMSDIDEKQLLLYAQIANLVNLILQWPEISI 150
           L  RRK     +IP     ++ L+  L +H+ D          ++  +       P+   
Sbjct: 120 LLTRRK---HGLIPDK---DWELLCQLPSHIDDFKRNATYQNIEMMAMGAAQFSLPQNVF 173

Query: 151 NE--IAENFSKLACNAHTICNSELRPLGTGLYPVISIINHSCLPNAVLVFEGRLAVVRAV 208
           ++  +A  + ++  NA T+    L PLG  + P +  +NHSC PNA ++ +G L  +R +
Sbjct: 174 DKDFVAAMYGRVMSNALTLITPTLDPLGIIVDPTLCQMNHSCDPNAYIMMDGPLVSIRTL 233

Query: 209 QHVPKGAEVLISYIETAGSTMTRQKALKEQYLFTCTCPRCIKLGQFDDIQESAILEGYRC 268
           + + K  E+ ISYI+T      RQ  L+ ++ FTC C +C         Q+ A  +    
Sbjct: 234 RPIRKDKEIFISYIDTTHPYQKRQDELQARWFFTCRCAKC---------QDKATHQEDNW 284

Query: 269 KDDGCSGFLLRDSDDKGFTCQQCGLVRSKEEIKKIASEVNILSKKTLALTSCGNHQEVVS 328
              G S F+      +     Q  +    E+++ + +   ++     AL +C        
Sbjct: 285 LVPGNSDFVASKEQVQALASNQEQMFTLYEQVQGMPNAEAVIPVLKEALKACH------- 337

Query: 329 TYKMIEKLQKKLY-HPFSVNLMQTREKLIKILMELEDWKEALAYCQLTIPVYQRVYPQ-- 385
                E     LY  P++      R++LI  L+ +  ++EA A+   +   Y+ + P+  
Sbjct: 338 -----ESGNWPLYRQPYA----SIRDELIVNLLSVGKYQEAWAH---SAKRYKYILPKLY 385

Query: 386 ---FHPLLGLQYYTCGKLEWFLGDTENAIKS------------MTEAVEILRITHGTNSP 430
              FHP+  +Q +    L  +L  TE  + +            + + +++  ++H  ++ 
Sbjct: 386 SVPFHPVRVVQTWQMAMLAAYLASTEEGVGAPGVNMGLIAMMLVKQVLDVAPMSHSQDNA 445

Query: 431 FMKELILKLEEAQAEASYKLSSKDE 455
           F K ++ K  E   E    + + D+
Sbjct: 446 FTKSVVEKTREMVEELKRSVGNPDK 470


>gi|391338176|ref|XP_003743437.1| PREDICTED: N-lysine methyltransferase SMYD2-A-like [Metaseiulus
           occidentalis]
          Length = 409

 Score =  100 bits (248), Expect = 2e-18,   Method: Compositional matrix adjust.
 Identities = 74/274 (27%), Positives = 132/274 (48%), Gaps = 21/274 (7%)

Query: 7   SGEVIISQEPYVCVPNNSSSISRCDGCFASSNLKKCSACQVVWYCGSNCQKLDWKLHRLE 66
           +G+ +   EPY  + + +S  S C  CF +    +CS C  + YC   CQ+ D    R+E
Sbjct: 6   AGDTVFEDEPYAAIVSATSLDSVCSFCFKNYVEYRCSLCNRLSYCTEGCQQADVFHDRVE 65

Query: 67  CQVLSRLDKEKRKSVTPSIRLMLKLYLRRKLQNDNVIPSTTTDNYSL---VEALVAHMSD 123
           C  L+  D    K V   +RL+++   R K +      S   + + L   ++ LV+H+ D
Sbjct: 66  CIALADQD----KIVEDEVRLVIRTAARYKHEQQRNYAS-LGNFFGLERGLDDLVSHLED 120

Query: 124 IDEKQLLLYAQIANLVNLILQWPEISI--NEIAENFSKLACNAHTICNS---ELRPLGTG 178
           +++ +       A  +  IL    +SI   E+ E + K   N H + +    E    G  
Sbjct: 121 LNDDERQKVKAKAQQIKSILDRVGVSIERKELMEIYMKCKINCHLVIDHNEPEYLSRGRA 180

Query: 179 LYPVISIINHSCLP--NAVLVFEGRLAVVRAVQH--VPKGAEVLISYIETAGSTMTRQKA 234
           +Y   S ++H+C+P  N  L+F+GR  V+RA+++  +   +++ I Y+ +  S   R+  
Sbjct: 181 IYLAASKLDHTCVPTENYTLMFDGRRLVIRAMKNFTLKDPSDLRIHYVPSNLSYNDRRDR 240

Query: 235 LKEQYLFTCTCPRCIKLGQF----DDIQESAILE 264
           L + Y F C+C +C+   +F    DD +E  + E
Sbjct: 241 LLKNYFFLCSCDQCVSQLRFPAADDDSREERVSE 274


>gi|390365247|ref|XP_003730778.1| PREDICTED: SET and MYND domain-containing protein 3-like
           [Strongylocentrotus purpuratus]
          Length = 487

 Score = 99.8 bits (247), Expect = 3e-18,   Method: Compositional matrix adjust.
 Identities = 67/254 (26%), Positives = 127/254 (50%), Gaps = 29/254 (11%)

Query: 8   GEVIISQEPYVCVPNNSSSISRCDGCFAS-SNLKKCSACQVVWYCGSNCQKLDW-KLHRL 65
           G+ I+ + PY    +++     CD C    S LKKCS+C  V YC  +CQK DW + H+ 
Sbjct: 30  GQCILEELPYAYTLHDNKRGLFCDFCLKKCSTLKKCSSCNYVSYCNKSCQKKDWARCHKQ 89

Query: 66  ECQVLSRLDKEKRKSVTPSIRLMLKLYLRRKLQ-------NDNVIPSTTTDNYSLVEALV 118
           +C+ L ++       V   +  +L   +R++ +       +++  P+T       V+ L 
Sbjct: 90  DCKTLKKI----HPRVPSDLAQLLSQIIRKRRKSAPCSQDDEDCFPTT-------VDQLE 138

Query: 119 AHMSDIDEKQLLLYAQIANLVNL--ILQWPEISIN--EIAENFSKLACNAHTICNSELRP 174
           +H      ++L + + ++ L  L   ++  ++  +   + + +    CN+  I +++L  
Sbjct: 139 SH-----HEKLSIESSVSALHKLKHCIEEEDVPTDPRSLLKMYGATNCNSFGIFDNDLIA 193

Query: 175 LGTGLYPVISIINHSCLPNAVLVFEGRLAVVRAVQHVPKGAEVLISYIETAGSTMTRQKA 234
           +  G+Y   S++NHSC  N ++VF+ R   +R V+ V +G E  I Y++       R+  
Sbjct: 194 ISDGIYLRASMVNHSCDYNCIVVFDERKLQLRTVKDVKEGEECTIGYVDVIHPAKERRAE 253

Query: 235 LKEQYLFTCTCPRC 248
           L+E+Y FTC C +C
Sbjct: 254 LEEKYHFTCKCVKC 267



 Score = 62.8 bits (151), Expect = 3e-07,   Method: Compositional matrix adjust.
 Identities = 28/67 (41%), Positives = 40/67 (59%)

Query: 184 SIINHSCLPNAVLVFEGRLAVVRAVQHVPKGAEVLISYIETAGSTMTRQKALKEQYLFTC 243
           S++NHSC PN   V +GR   +  V+ V +G E  I+Y++       RQ  LKE+Y FTC
Sbjct: 364 SMLNHSCDPNCAWVSDGRKIQIMTVKDVKEGEECTITYVDVMDPAKVRQADLKERYHFTC 423

Query: 244 TCPRCIK 250
            C +CI+
Sbjct: 424 KCVKCIE 430


>gi|322697990|gb|EFY89764.1| MYND finger family protein [Metarhizium acridum CQMa 102]
          Length = 513

 Score = 99.4 bits (246), Expect = 3e-18,   Method: Compositional matrix adjust.
 Identities = 68/245 (27%), Positives = 116/245 (47%), Gaps = 19/245 (7%)

Query: 9   EVIISQEPYVCVPNNSSSISRCDGCFASSNLKKCSACQVVWYCGSNCQKLDW-KLHRLEC 67
           + I +  P +  P+ S   S C  C    + + CS C   +YC ++CQ+  W  +H  EC
Sbjct: 26  QTIHTFHPLILHPSLSHLASVCTHCLRPGSPRACSRCHAAYYCNTSCQQAAWTAVHSKEC 85

Query: 68  QVLS--RLDKEKRKSVTPSIRLMLKLYLRRKLQNDNVIPSTTTDNYSLVEALVAHMSDID 125
           + L   +L       +   +R +L+  L +++++              V AL  H  +  
Sbjct: 86  KALQQRKLGSRTGADLPTPVRALLQTLLIKEIEDG-------------VAALDGHTEERR 132

Query: 126 EKQLLLYAQIANLVNLIL--QWPEISINEIAENFSKLACNAHTICNSELRPLGTGLYPVI 183
           + +     ++  L       +  E ++ +  E   K+  NA    + +L  +G  L P +
Sbjct: 133 KTKSWPDLEMMALAACAFAGRQGESNVRKAVELLCKIQTNAFHRWDVDLGQVGVFLEPTL 192

Query: 184 SIINHSCLPNAVLVFEGRLAVVRAVQHVPKGAEVLISYIETAGSTMTRQKALKEQYLFTC 243
           ++ NHSC+PNAV+ F GR A++RA + +  G E+ ISY +       R++AL EQY F C
Sbjct: 193 AMANHSCVPNAVVQFVGRKAILRAERPIQAGDEIEISYTDYTMPLSARREAL-EQYSFEC 251

Query: 244 TCPRC 248
           TC RC
Sbjct: 252 TCARC 256


>gi|347831782|emb|CCD47479.1| hypothetical protein [Botryotinia fuckeliana]
          Length = 612

 Score = 99.0 bits (245), Expect = 4e-18,   Method: Compositional matrix adjust.
 Identities = 72/234 (30%), Positives = 113/234 (48%), Gaps = 16/234 (6%)

Query: 20  VPNNSSSISRCDGCFASSNLKKCSACQVVWYCGSNCQKLDWKLHRLECQVLSR--LDKEK 77
           V N S + S C    A+ N   C  C+ V YC   CQ LD  LH  +C+ +    L    
Sbjct: 115 VTNTSCAYSNCRSKKATVN---CIGCKAVDYCSKKCQDLDHSLHEKQCKAIQSQSLGNSL 171

Query: 78  RKSVTPSIRLMLKLYLR-RKLQNDNVIPSTTTDNYSLVEALVAHMSDIDEKQLLLYAQIA 136
            K +  ++R +++L      L  D    S   D  S  + L    S  D+  L  +A   
Sbjct: 172 NKVLPTAVRGIIQLVSNPAALAKD----SPFMDLASHEDELRRVQSKWDDISLQAHA--- 224

Query: 137 NLVNLILQWPEISINEIAENFSKLACNAHTI-CNSELRPLGTGLYPVISIINHSCLPNAV 195
            +VNL+ + PE  +    E+  +L+ N   +  N    P+G  L P+++  NHSC PNA 
Sbjct: 225 -VVNLLQRKPEF-VKYALESLCRLSTNGFRVESNVGNGPIGLCLDPLLARANHSCRPNAA 282

Query: 196 LVFEGRLAVVRAVQHVPKGAEVLISYIETAGSTMTRQKALKEQYLFTCTCPRCI 249
           + F+G+ A +RA+  + KG ++ ISYI+       R++AL + + F C C RC+
Sbjct: 283 ITFDGKRATLRALSPIAKGEQIFISYIDETQRQEVRREALAKTWFFQCLCSRCV 336


>gi|312373132|gb|EFR20944.1| hypothetical protein AND_18266 [Anopheles darlingi]
          Length = 648

 Score = 98.6 bits (244), Expect = 5e-18,   Method: Compositional matrix adjust.
 Identities = 99/361 (27%), Positives = 157/361 (43%), Gaps = 41/361 (11%)

Query: 11  IISQEPYVCVPNNSSSISRCDGCFASSN-LKKCSACQVVWYCGSNCQKL-DWKLHRLECQ 68
           I+ ++P+        S + C  CF     L  C  C  V +C   C+K  +   HR EC 
Sbjct: 250 ILKEKPHGATLVQEYSSTHCSTCFERVEILFCCPNCVDVVFCSGRCEKTANQSYHRYECG 309

Query: 69  VLSRLDKEKRKSVTP-SIRLML-KLY-----LRRKLQN--DNVIPSTTTDNYSLVEALVA 119
            L  L       V+  ++R++  K Y     +R +L N   N     ++D+Y  V  LV 
Sbjct: 310 FLRSLWSSGATIVSLLALRIVTQKPYSYFEAIRDELPNLTANFTDKLSSDDYRKVFNLVT 369

Query: 120 HMSDIDEKQLLLYAQIANLVNLILQWPEISINE----------IAENFSKLACNAHTICN 169
           H    D +  L++  +A ++N IL+    +  E          +  N   +  +AH +  
Sbjct: 370 HSDKRDPEDYLVWTLMAAMLNTILRHGNYTNTEQPDDGFIGYLLLHNLQIVNYSAHDVAE 429

Query: 170 -SELRP--------LGTGLYPVISIINHSCLPNAVLVFEGRLAVVRAVQHVPKGAEVLIS 220
               RP        +G  LYP++++ NHSC P  V  F G    VR ++++  G  +  +
Sbjct: 430 LQRKRPNEAGKSVAIGAALYPLLALFNHSCDPGIVRYFSGTTVHVRTIKNIAAGQIIAEN 489

Query: 221 Y--IETAGSTMTRQKALKEQYLFTCTCPRCIKLGQ-FDDIQESAILEGYRCKDDGCSGFL 277
           Y  + T      R+++L   Y F C C  C +  Q +  + ES I   +RC    C G L
Sbjct: 490 YGPLYTKMERTERRQSLAANYKFQCNCDACRQDWQLYTHMDESVIR--FRCTGPACQGAL 547

Query: 278 LRD--SDDKGFTCQQCG-LVRSKEEIKKIASEVNILSKKTLA--LTSCGNHQEVVSTYKM 332
           L D  S+  G  C  CG LV  +E I ++  E N+LS+   A  L + G  +  +S Y  
Sbjct: 548 LFDLTSECYGMQCDDCGELVDIRERI-RLLQEANMLSRFNEAAHLYAIGLFEHALSKYAA 606

Query: 333 I 333
           I
Sbjct: 607 I 607


>gi|118379402|ref|XP_001022867.1| conserved hypothetical protein [Tetrahymena thermophila]
 gi|89304634|gb|EAS02622.1| conserved hypothetical protein [Tetrahymena thermophila SB210]
          Length = 418

 Score = 98.6 bits (244), Expect = 7e-18,   Method: Compositional matrix adjust.
 Identities = 105/406 (25%), Positives = 187/406 (46%), Gaps = 49/406 (12%)

Query: 59  DWKLHRLECQVLSRLDKEKRKSVTPSIRLMLKLYLRRKLQNDNVIPSTTTDNYSLVEALV 118
           DWK H+LEC+    + K K  ++T    +++++Y   K QN      +  D++      +
Sbjct: 49  DWKQHKLECKAYKAI-KAKNFNITIDFHILMRIY---KTQNSLTNFLSNGDDFP-----I 99

Query: 119 AHMSDIDEKQLLLYAQIANLVNLILQWPEISINEIAENFSKLACNAHTICNS--ELRPLG 176
                  E+ L+L   +   V  I +  E  I+  A    K A NA TI +S  E   +G
Sbjct: 100 EKQQFFTEQALMLLKSLEEEV--IPEKMEFLISAQA----KFATNALTIQDSLFETDGIG 153

Query: 177 TGLYPVISIINHSCLPNAVLVFEGRLAVVR--AVQHVPKGAEVLISYIETAGSTMTRQKA 234
            GLY  ++ +NHSC PN + VF  +L  VR  A++ + +G E++ SYI+T      R++ 
Sbjct: 154 AGLYEEVNYMNHSCTPNVICVF-NKLPQVRVIAIRDIEQGEEIMNSYIDTKKDLDFRRRF 212

Query: 235 LKEQYLFTCTCPRCIKLGQFDDIQESAILEGYRCKDDGCSGFLLRDSDDKGFTCQQCGL- 293
           LK+ Y F C C RCIK       +++  +   RC+   C    +   + K   C  C   
Sbjct: 213 LKQNYFFLCECKRCIK-------EQNEGVSFVRCQK--CMKGRI---NSKTLNCSDCNTQ 260

Query: 294 -VRSKEEIKKIASEVNILSKKTLALTSCGNHQEVVSTYKMIEKLQKKLYHPFSVNLMQTR 352
              +++E ++I +++  + K+    TS   +++ +   +  EKL   L  P+  ++  +R
Sbjct: 261 GQMTEQEYQRIITQMEEIQKQ----TSQSTYEKTIELTETAEKLC-HLEAPYFNDIYNSR 315

Query: 353 EKLIKILMELEDWKEALAYCQLTIPVYQRVYPQF------HPLLGLQYYTCGKLEWFLGD 406
              I   + ++ +K+A  Y +  I + +  Y +        P  G +     KL + L  
Sbjct: 316 ---ILATLGVQQYKKANKYLKNFIELSKHWYEEIIGNTLEMPFFGWKLNELSKLNFHLDK 372

Query: 407 TENAIKSMTEAVEILRITHGTNS-PFMKELILKLEEAQAEASYKLS 451
              A K    A++IL   +   S P + EL L+L + ++E   K +
Sbjct: 373 CLEAEKYSERALKILTQYYTKESCPELNELFLRLNDIKSEIKMKYN 418


>gi|444714585|gb|ELW55465.1| N-lysine methyltransferase SMYD2 [Tupaia chinensis]
          Length = 494

 Score = 98.2 bits (243), Expect = 7e-18,   Method: Compositional matrix adjust.
 Identities = 99/442 (22%), Positives = 179/442 (40%), Gaps = 93/442 (21%)

Query: 56  QKLDWKLHRLECQVLSRLDKEKRKSVTPSIRLMLKLYLRRKLQNDNVIPSTTTDNYSLVE 115
           QK DW +H+LEC  +    +    S T  +RL  ++  ++K   +     T ++    V+
Sbjct: 87  QKEDWPMHKLECSPMVVFGENWNPSET--VRLTARILAKQKTHPER----TPSEKLLAVK 140

Query: 116 ALVAHMSDID-EKQLLLYAQIANLVNLI---LQWPE------------ISINEIAENFSK 159
              +H+  +D EK+ L+ + IA L +     L++P+            + +N  A    +
Sbjct: 141 EFESHLDKLDNEKKDLIQSDIAALRHFYSKHLEFPDNDNLVVLFAQRLLWVNGAASQ--Q 198

Query: 160 LACNAHTICNS------------------------ELRPLGTGLYPVI-----------S 184
           LA   +T+                           E +PL    +  I           +
Sbjct: 199 LAAEGNTLLQQHRGVEEHGVESSGMHTSPAAWDEQEFQPLPPTRFLTIDKFLLLSDFSVA 258

Query: 185 IINHSCLPNAVLVFEGRLAVVRAVQHVPKGAEVLISYIETAGSTMTRQKALKEQYLFTCT 244
           ++NHSC PN ++ + G LA VRAV  +  G EV  SYI+    T  R   L++ Y FTC 
Sbjct: 259 LMNHSCCPNVIVTYRGTLAEVRAVHEISPGEEVFTSYIDLLYPTEDRNDRLRDSYFFTCE 318

Query: 245 CPRCIKLGQFDDIQESAILEGYRCKDDGCSGFLLRDSDDKGFTCQQCGLVRSK----EEI 300
           C  C                    KD   +   +R  +D         +VR      EE 
Sbjct: 319 CHECT------------------TKDKDKAKVEIRKLNDPPKAEAIRDMVRYARNVIEEF 360

Query: 301 KKIASEVNIL--SKKTLALTSCGNHQEVVSTYKMIEKLQKKLYHPFSVNLMQTREKLIKI 358
           ++      IL  S  +  L  C   QE +S+          ++   +V ++    + + +
Sbjct: 361 RRAKHYKYILYNSPPSELLEICELSQEKMSS----------VFEDSNVYMLHMMYQAMGV 410

Query: 359 LMELEDWKEALAYCQLTIPVYQRVYPQFHPLLGLQYYTCGKLEWFLGDTENAIKSMTEAV 418
            + ++DW+ AL Y Q  I  Y + YP +   +   +   G+L   L +     +++ +A+
Sbjct: 411 CLYMQDWEGALRYGQRIIKPYSKHYPLYSLNVASMWLKLGRLYMGLENKAAGERALKKAI 470

Query: 419 EILRITHGTNSPFMKELILKLE 440
            I+ + HG + P++ E+  ++E
Sbjct: 471 AIMEVAHGKDHPYISEIKQEIE 492


>gi|119597545|gb|EAW77139.1| SET and MYND domain containing 3, isoform CRA_a [Homo sapiens]
 gi|119597547|gb|EAW77141.1| SET and MYND domain containing 3, isoform CRA_a [Homo sapiens]
 gi|193785773|dbj|BAG51208.1| unnamed protein product [Homo sapiens]
          Length = 239

 Score = 98.2 bits (243), Expect = 8e-18,   Method: Compositional matrix adjust.
 Identities = 74/257 (28%), Positives = 120/257 (46%), Gaps = 43/257 (16%)

Query: 172 LRPLGTGLYPVISIINHSCLPNAVLVFEGRLAVVRAVQHVPKGAEVLISYIETAGSTMTR 231
           ++ +G GLYP IS++NHSC PN  +VF G   ++RAV+ +  G E+ I Y++   ++  R
Sbjct: 1   MQEVGVGLYPSISLLNHSCDPNCSIVFNGPHLLLRAVRDIEVGEELTICYLDMLMTSEER 60

Query: 232 QKALKEQYLFTCTCPRCIKLGQFDDIQESAILEGYRCKDDGCSGFLLRDSDDKGFTCQQC 291
           +K L++QY F C C RC                              +D D    T  + 
Sbjct: 61  RKQLRDQYCFECDCFRCQT----------------------------QDKDADMLTGDEQ 92

Query: 292 GLVRSKEEIKKIASEVNILSKKTLALTSCGNHQEVVSTYKMIEKLQKKLYHPFSVNLMQT 351
                +E +KKI  E+    K    L  C   Q ++S+    E+L         +N+ Q 
Sbjct: 93  VWKEVQESLKKI-EELKAHWKWEQVLAMC---QAIISSNS--ERLP-------DINIYQL 139

Query: 352 R--EKLIKILMELEDWKEALAYCQLTIPVYQRVYPQFHPLLGLQYYTCGKLEWFLGDTEN 409
           +  +  +   + L   +EAL Y   T+  Y+  +P  HP+ G+Q    GKL+   G    
Sbjct: 140 KVLDCAMDACINLGLLEEALFYGTRTMEPYRIFFPGSHPVRGVQVMKVGKLQLHQGMFPQ 199

Query: 410 AIKSMTEAVEILRITHG 426
           A+K++  A +I+R+THG
Sbjct: 200 AMKNLRLAFDIMRVTHG 216


>gi|66818090|ref|XP_642738.1| SET domain-containing protein [Dictyostelium discoideum AX4]
 gi|74856989|sp|Q54ZX8.1|Y7331_DICDI RecName: Full=SET and MYND domain-containing protein DDB_G0277331
 gi|60470880|gb|EAL68852.1| SET domain-containing protein [Dictyostelium discoideum AX4]
          Length = 549

 Score = 98.2 bits (243), Expect = 8e-18,   Method: Compositional matrix adjust.
 Identities = 72/265 (27%), Positives = 125/265 (47%), Gaps = 28/265 (10%)

Query: 8   GEVIISQEPYVCVPNNSSSISRCDGCFASS-------NLKKCSACQVVWYCGSNCQKLDW 60
           GE I+  EPYV      + +  CD C  +         LK+CS C++V+YC ++CQ   W
Sbjct: 51  GEKIMKIEPYVWSVAKHAIV--CDECLKNKLDLEEGKTLKRCSNCKLVYYCSTDCQTKAW 108

Query: 61  KLHRLECQVLSRL--DKEKRKSVTPSIRLMLKLYLRRKLQ--------------NDNVIP 104
           K+H+ EC++LS +    +K+   T S  ++L+L+++R L+              N+N   
Sbjct: 109 KIHKQECKILSTIPSTTDKKNINTKSTTMLLRLFIKRNLELINNNNNNNNNNNNNNNNND 168

Query: 105 STTTDNYSLVEALVAHMSDIDEKQLLLYAQIANLVNLILQWPEISINEIAENFSKLACNA 164
           +  T  Y +++ L+ H  DI        +  +   +L+ + P++    + E   KL  N 
Sbjct: 169 NHITGQYEIIDGLLNH-KDIRSDNNEYKSFSSGFCSLLGEDPQLKAPIVLEYLLKLEPNC 227

Query: 165 HTICNSELRPLGTGLYPVISIINHSCLPNAVLVFEGRLAVVRAVQHVPKGAEVLISYIET 224
            TI   E   +  GLYP++   NHSC PN  ++   +  ++   + + K  E+ I+Y   
Sbjct: 228 ITIPRCEASSI--GLYPLMLFFNHSCKPNISIINNRKELLIITNKIIEKDEELFINYSPA 285

Query: 225 AGSTMTRQKALKEQYLFTCTCPRCI 249
                 R   LK+ + F C C  C+
Sbjct: 286 ICYRNERLDNLKQCFFFNCKCTLCL 310


>gi|409047429|gb|EKM56908.1| hypothetical protein PHACADRAFT_208075 [Phanerochaete carnosa
           HHB-10118-sp]
          Length = 606

 Score = 97.8 bits (242), Expect = 9e-18,   Method: Compositional matrix adjust.
 Identities = 77/284 (27%), Positives = 127/284 (44%), Gaps = 54/284 (19%)

Query: 8   GEVIISQEPYVCVPNNSS---SISRCDGCFASSNLKKCSACQVVWYCGSNCQKLDWKLHR 64
           G +++S  P + V ++S      S C G    + LK+C+ C+   YC + CQ  DW +H+
Sbjct: 97  GTILLSVRPCISVLSSSQLGFYCSHCAGPAPETGLKRCTGCRTTRYCNAMCQNSDWAVHK 156

Query: 65  LECQVLSR-LDKEKRKSVT-PS--IRLMLKLY--LRRK------------LQN--DNVIP 104
           +ECQ L R  D    + V  PS  +R + ++   ++RK            LQ+   ++ P
Sbjct: 157 MECQALQRWADAAPSEDVAVPSDAVRCLGRMLWSMQRKGFDSQWTKEMSVLQSHRGSLPP 216

Query: 105 STTTDNYSLVEALVAHMSDIDEKQLLLYA--QIANLVNLILQWPEISINEIAENFSKLAC 162
           S    +  L  ++V ++       L  +      +LV LI               S+   
Sbjct: 217 SQFESHTHLAHSVVRYIGASGPSDLASFGLNSAGDLVGLI---------------SRFTT 261

Query: 163 NAHTICNSELRPLGTGLYPVISIINHSCLPNAVLVF-----------EGRLAVVRAVQHV 211
           N  T+ +  L P+G  + P +++ NHSC PN V+VF           + +L  +R++ H 
Sbjct: 262 NTFTLTSYTLSPVGICISPAMALTNHSCDPNVVIVFPRSSKKVTKEPQMQLIALRSILHE 321

Query: 212 PKGAEVLISYIETAGSTMTRQKALKEQYLFTCTCPRCIKLGQFD 255
               E++ +Y++       RQ ALKE Y FTC C  C   G  D
Sbjct: 322 ---EEIMTAYVDVTLPKELRQSALKEAYNFTCMCSLCKNTGPTD 362


>gi|302767894|ref|XP_002967367.1| hypothetical protein SELMODRAFT_439878 [Selaginella moellendorffii]
 gi|300165358|gb|EFJ31966.1| hypothetical protein SELMODRAFT_439878 [Selaginella moellendorffii]
          Length = 614

 Score = 97.8 bits (242), Expect = 1e-17,   Method: Compositional matrix adjust.
 Identities = 111/448 (24%), Positives = 179/448 (39%), Gaps = 59/448 (13%)

Query: 7   SGEVIISQEPYVCVPNNSSSISRCDGC---FASSNLK-KCSACQVVWYCGSNCQKLDWKL 62
           +GEV+++++    VP      + C  C    A +  K +CSAC    +C   C K     
Sbjct: 86  AGEVVVAEQAIAFVPRGQYRKAICHVCGVNVAENGPKFECSACNYTIFC-EECYKPPQGS 144

Query: 63  HRLECQVLSRLDKEKRKSVTPS--IRLMLKLYLRRK--------------------LQN- 99
           H   C V   +    R     +  +RL+  L L++                      QN 
Sbjct: 145 HENYCAVYREIASIARDENCDADLLRLVFNLALKKAGTAQARSSLLQLELLIETLPFQNS 204

Query: 100 -------DNVIPSTTTDNYSL----VEALVAHMSDIDEKQLLLYAQIANLVNLILQWP-E 147
                  D VI S   D   L     +A  + +  +      L+ +IA + N     P  
Sbjct: 205 KDETCTVDGVIHSCFKDALGLETHRSKAPSSWLEAVRNGCTRLH-KIACVGNGGKPAPVS 263

Query: 148 ISINEIAENFSKLACNAHTICNSELRP--LGTGLYPVISIINHSCLPNAVLVFEGRLAVV 205
            SI E+    + +  N+H +    L+   +  GL+P +SI+NHSC PN     EG +  V
Sbjct: 264 CSIEELEMLATLINTNSHGMGAQNLQNTHVALGLFPYVSILNHSCRPNCCFASEGSVMYV 323

Query: 206 RAVQHVPKGAEVLISYIETAGSTMTRQKALKEQYLFTCTCPRCIKLGQFDDIQESAILEG 265
           RAVQ +PKGAE+ +SYI    S   R+  L     F CTC RC+   +  +      LEG
Sbjct: 324 RAVQDIPKGAELCLSYINLYESRRVRKTLLVATKHFDCTCDRCV---EPLNTSIDRFLEG 380

Query: 266 YRCKDDGCSGFLLRDS---DDKG----FTCQQCGLVRSKEEIKKIASEVNILSK---KTL 315
             C   GC G  LR +   +D+     + C  C  +   E ++       +++K   + +
Sbjct: 381 CVCHVRGCGGVFLRTAALHEDQASSTTWECDLCSRILDPESMRFKDPPWEVIAKAEERLV 440

Query: 316 ALTSCGNHQEVVSTYKMIEKLQKKL---YHPFSVNLMQTREKLIKILMELEDWKEALAYC 372
           A     + +      K++E    +     HP  V L      L+     + D +E L  C
Sbjct: 441 AAVRIYSERRFKEARKLLESYLSEFTGKLHPLHVFLFDALTPLMNCCRAMGDAEEGLRVC 500

Query: 373 QLTIPVYQRVYPQFHPLLGLQYYTCGKL 400
           +L +   ++V P     L   Y+  G++
Sbjct: 501 RLILQCMEKVLPGPSLELANFYFCLGEM 528


>gi|295665212|ref|XP_002793157.1| conserved hypothetical protein [Paracoccidioides sp. 'lutzii' Pb01]
 gi|226278071|gb|EEH33637.1| conserved hypothetical protein [Paracoccidioides sp. 'lutzii' Pb01]
          Length = 508

 Score = 97.8 bits (242), Expect = 1e-17,   Method: Compositional matrix adjust.
 Identities = 70/234 (29%), Positives = 108/234 (46%), Gaps = 22/234 (9%)

Query: 30  CDGCFASSN--------LKKCSACQVVWYCGSNCQKLDWKL-HRLECQVLSRLDKEKRKS 80
           C  CF +          LK C+ C+VV YC   CQ   W   H+ EC +        RK 
Sbjct: 58  CSNCFVTVGDEVNPGLTLKACTGCRVVKYCNERCQTESWAASHKKECSIY-------RKC 110

Query: 81  VTPSIRLMLKLYLRRKLQNDNVI-PSTTTDNYSLVEALVAHMSDIDEKQLLLYAQIANLV 139
               + +  +  L    + DNVI       +++ V+ L  H +++ E   L   QI  ++
Sbjct: 111 HPNILPMHARAVLSMISEPDNVILKEMHRTHHTAVDTLGNHFTEMME---LAGEQIERVL 167

Query: 140 NLILQWPEISINEIAENFSKLACNAHTICNSELRPLGTGLYPVISIINHSCLPNAVLVFE 199
                  EIS  ++      L  NA T+ N     +G  + P  S  NHSC PNA + F+
Sbjct: 168 TSAEGLKEISNTDVGLP-RLLETNAFTLTNRYFDRIGLCMLPFASYANHSCEPNAYIGFD 226

Query: 200 GRLAVVRAVQHVPKGAEVLISYIETAGSTMTRQKALKEQYLFTCTCPRCIKLGQ 253
           G +  ++A+Q +    ++ ISYI+       RQ  LK++Y FTC CP+C++ GQ
Sbjct: 227 GPVIYLKALQDIALDEQIFISYIDNTEPWEKRQSELKKRYFFTCKCPKCVQ-GQ 279


>gi|408398088|gb|EKJ77223.1| hypothetical protein FPSE_02597 [Fusarium pseudograminearum CS3096]
          Length = 520

 Score = 97.4 bits (241), Expect = 1e-17,   Method: Compositional matrix adjust.
 Identities = 67/254 (26%), Positives = 120/254 (47%), Gaps = 38/254 (14%)

Query: 6   VSGEVIISQEPYVCVPNNSSSISRCDGCFASSNLKKCSACQVVWYCGSNCQKLDW-KLHR 64
           V G+VI+   P + +P+ S   + C  CF  + ++ CS C  V YC + CQ  +W  +H 
Sbjct: 25  VPGDVILPFTPRILIPSLSHINNICSHCFKQAEVRACSRCHAVSYCDAACQAANWTAVHS 84

Query: 65  LECQVLSRLDKEKRKSVTPSIRLMLKLYLRRKLQNDNVIPSTTTDNYSLVEALVAHMSDI 124
            EC+VL ++ ++ R  +   IR +++  ++ ++ N              +E L  + +  
Sbjct: 85  KECKVLRKVTEQGRPGLPTPIRAVVQALVKPEIGN-------------AIEDLEGNGTSW 131

Query: 125 DEKQLLLYAQIANLVNLILQWPEISINEI-AENFSKLA-------CNAHTICNSELRPLG 176
            + +               +W ++ +  + A  F+ L         NA    +++L  +G
Sbjct: 132 RKSE---------------KWADMEMMAMGASAFAGLGTGQEDIQTNAFHRYDTDLGQVG 176

Query: 177 TGLYPVISIINHSCLPNAVLVFEGRLAVVRAVQHVPKGAEVLISYIETAGSTMTRQKALK 236
             L P +++ NHSC+PNA++ F GR A++RA + +    E+ ISY +       R+ AL 
Sbjct: 177 IFLEPKLAMANHSCIPNAMVQFIGRKAILRAEKPIKIDDEIEISYTDYTFPLSKRKLALA 236

Query: 237 EQYLFTCTCPRCIK 250
             Y F C C RC K
Sbjct: 237 P-YFFDCMCLRCEK 249


>gi|392562474|gb|EIW55654.1| SET domain-containing protein [Trametes versicolor FP-101664 SS1]
          Length = 499

 Score = 97.4 bits (241), Expect = 1e-17,   Method: Compositional matrix adjust.
 Identities = 87/317 (27%), Positives = 126/317 (39%), Gaps = 67/317 (21%)

Query: 16  PYVCVPNNSSSISRCDGCFAS---SNLKKCSACQVVWYCGSNCQKLDWKLHRLECQVLSR 72
           P+V V         C  C A    + LK+C  C+ V+YC   CQ  DW  H+ EC  L  
Sbjct: 5   PHVSVLATPYLDQHCSACAAPAPETGLKRCPKCKAVYYCDKACQNRDWVWHKHECSAL-- 62

Query: 73  LDKEKRKSVTPSIRLMLK----------LYLRRKLQNDN---------------VIPSTT 107
              +   +  PS  +M+           L+  +K   D+               V  S+ 
Sbjct: 63  ---QTWAASAPSPDVMIPSDAVRCLGRILWGSQKEGPDSTWAREIRMMQSNRSTVHASSV 119

Query: 108 TDNYSLVEALVAHMS--DIDEKQLLLYAQIANLVNLILQWPEISINEIAENFSKLACNAH 165
            D+  L  +LV ++      E Q    +   +LV+LI               S+   N  
Sbjct: 120 EDHTYLAHSLVRYLGVRAPAELQPFSLSSAGDLVDLI---------------SRFTTNTF 164

Query: 166 TICNSELRPLGTGLYPVISIINHSCLPNAVLVF----------EGRLAVVRAVQHVPKGA 215
           T+ ++ L P+G  + P ++  NHSC PNAV+VF          E  L +V A++ +  G 
Sbjct: 165 TLTSASLSPIGICVAPTVAFANHSCSPNAVIVFPRAQGTPASKEPSLNLV-ALRDIAPGK 223

Query: 216 EVLISYIETAGSTMTRQKALKEQYLFTCTCPRCIKLGQFDDIQESAILEGYRCKDDGCSG 275
           E+ ISY++T      RQK L E Y FTC C  C K    D        E   C       
Sbjct: 224 EIRISYVDTTLPKDLRQKELTEVYNFTCQCKLCTKPPAVDP------RESLWCPKKCGGT 277

Query: 276 FLLRDSDDKGFTCQQCG 292
             L   +D  F C +CG
Sbjct: 278 CPLPTEEDNLFRCAKCG 294


>gi|389584498|dbj|GAB67230.1| hypothetical protein PCYB_112510 [Plasmodium cynomolgi strain B]
          Length = 501

 Score = 97.4 bits (241), Expect = 1e-17,   Method: Compositional matrix adjust.
 Identities = 110/443 (24%), Positives = 182/443 (41%), Gaps = 80/443 (18%)

Query: 50  YCGSNCQKLDWKLHRLECQVLSR--LDKEKRKSVTPSIRLMLKLYLRRKLQNDNV----- 102
           YC   C +L WKLHR EC+V      DK       PSI  ++++ +   L + N      
Sbjct: 73  YCSDVCMELAWKLHRDECEVFKSNIFDK-----FCPSI--IMRMVINSYLSHFNFYEYCG 125

Query: 103 ----IPSTTTDNYSLVEALVAHMSDIDEKQLLLYAQIANLVNLILQWPEISINEIAENFS 158
               +P    + +     +VA      +K+       AN  +      E  +  I E F 
Sbjct: 126 SVTELPKEKYEYFKYPAYIVAVALMSKKKKCF-----ANFED-----NESILKNIIEKFV 175

Query: 159 KLACNAHTICNSELRPLGTGLY--PVISIINHSCLPNAVLVFEGRLAVVRAVQHVPKGAE 216
           K++ N+  I ++EL P G   Y  PV    NHSCL N V +F+ +   +R +  V  G E
Sbjct: 176 KISKNSLQIIDNELEPAGLAFYKKPV-PYFNHSCLSNCVTIFKNQKLFIRTLVDVYPGEE 234

Query: 217 VLISYIETAGSTMTRQKALKEQYLFTCTCPRCIKLGQFDDIQESAILEGYRCKDDGC--- 273
           + ISYI+ A    TR     +QY FTC+C  C K+    + Q     E      + C   
Sbjct: 235 LTISYIDIAFDKNTRLAICMDQYFFTCSCKLC-KVNVVSECQNIFNTEFVCTHSESCKKC 293

Query: 274 ---------------------SGF-----LLRDSDDKG--FTCQQCG-----------LV 294
                                S F     +L+ S+++   + C  C            + 
Sbjct: 294 VNNMELILISELERKNCYMNKSQFKNSYPILKKSNEQQHMWKCALCKNEVHDNVIKSLME 353

Query: 295 RSKEEIKKIASEVNILSKKTLALTSCGNHQEVVSTYKMIEKLQ--KKLYHPFSVNLMQTR 352
           + KE +K+ A   ++ S+K     S  N   + S  K+  K+      YH    +L + R
Sbjct: 354 KEKETVKETAHLESLFSEKY----SYDNKSVLQSLNKIKSKIDYLTTFYHHTRYSLQKMR 409

Query: 353 EKLIKILMELEDWKEALAYCQLTIPVYQRVYPQFHPLLGLQYYTCGKLEWFLGDTENAIK 412
            K++ I ++L+++K A       +   +  Y ++ P+ G   +  GKL  FL      + 
Sbjct: 410 AKILYISIQLQEFKLAYNIANQYLKSIEVSYGKYSPIYGYYIFLTGKLALFLDLKSEGLS 469

Query: 413 SMTEAVEILRITHGTNSPFMKEL 435
            + +A + +  T+G +SP  K+L
Sbjct: 470 LIHKAKKNIIKTYGPDSPIYKDL 492


>gi|302753848|ref|XP_002960348.1| hypothetical protein SELMODRAFT_437450 [Selaginella moellendorffii]
 gi|300171287|gb|EFJ37887.1| hypothetical protein SELMODRAFT_437450 [Selaginella moellendorffii]
          Length = 614

 Score = 97.4 bits (241), Expect = 1e-17,   Method: Compositional matrix adjust.
 Identities = 112/448 (25%), Positives = 179/448 (39%), Gaps = 59/448 (13%)

Query: 7   SGEVIISQEPYVCVPNNSSSISRCDGC---FASSNLK-KCSACQVVWYCGSNCQKLDWKL 62
           +GEV+++++    VP      + C  C    A +  K +CSAC    +C   C K     
Sbjct: 86  AGEVVVAEQAIAFVPRGQYRKAICHVCGVNVAENGPKFECSACNYTIFC-EECYKPPQGS 144

Query: 63  HRLECQVLSRLDKEKRKSVTPS--IRLMLKLYLRRK--------------------LQN- 99
           H   C V   +    R     +  +RL+  L L++                      QN 
Sbjct: 145 HENYCAVYREIASIARDENCDADLLRLVFNLALKKAGTAQARSSFLQLELLIETLPFQNS 204

Query: 100 -------DNVIPSTTTDNYSL----VEALVAHMSDIDEKQLLLYAQIANLVNLILQWP-E 147
                  D VI S   D   L     +A  + +  +      L+ +IA + N     P  
Sbjct: 205 KDETCTVDGVILSCFKDALGLETHRSKAPSSWLEAVRNGCTRLH-KIACVGNGGKLAPIS 263

Query: 148 ISINEIAENFSKLACNAHTICNSELRP--LGTGLYPVISIINHSCLPNAVLVFEGRLAVV 205
            SI E+    + +  N+H +    L+   +  GL+P +SI+NHSC PN     EG +  V
Sbjct: 264 CSIEELEMLATLINTNSHGMGAQNLQNTHVALGLFPYVSILNHSCRPNCCFASEGSVMYV 323

Query: 206 RAVQHVPKGAEVLISYIETAGSTMTRQKALKEQYLFTCTCPRCIKLGQFDDIQESAILEG 265
           RAVQ +PKGAE+ +SYI    S   R+  L     F CTC RC+   +  +      LEG
Sbjct: 324 RAVQDIPKGAELCLSYINLYESRRVRKTLLVATKHFDCTCDRCV---EPLNTSIDRFLEG 380

Query: 266 YRCKDDGCSGFLLRDS---DDKG----FTCQQCGLVRSKEEIKKIASEVNILSK---KTL 315
             C   GC G  LR +   +D+     + C  C  V   E ++       +++K   + +
Sbjct: 381 CVCHVRGCGGVFLRTAALHEDQASSTTWECDLCSRVLDPESMRFKDPPWEVIAKAEERLV 440

Query: 316 ALTSCGNHQEVVSTYKMIEKLQKKL---YHPFSVNLMQTREKLIKILMELEDWKEALAYC 372
           A     + +      K++E    +     HP  V L      L+     + D +E L  C
Sbjct: 441 AAVRIYSERRFKEARKLLESYLSEFTGKLHPLHVFLFDALTPLMNCCRAMGDAEEGLRVC 500

Query: 373 QLTIPVYQRVYPQFHPLLGLQYYTCGKL 400
           +L +   ++V P     L   Y+  G++
Sbjct: 501 RLILQCMEKVLPGPSLELANFYFCLGEM 528


>gi|443926535|gb|ELU45157.1| zf-MYND domain-containing protein [Rhizoctonia solani AG-1 IA]
          Length = 584

 Score = 97.4 bits (241), Expect = 1e-17,   Method: Compositional matrix adjust.
 Identities = 103/411 (25%), Positives = 152/411 (36%), Gaps = 83/411 (20%)

Query: 8   GEVIISQEPYVCVPNNSSSISRCDGCF----ASSNLKKCSACQVVWYCGSNCQKLDWKLH 63
           G  ++   P+V   +     S C GC     ++  L +C  C++V YC   CQK DW LH
Sbjct: 75  GATMLRLVPHVTALSTQYLTSHCSGCHEEQKSNKPLLRCQRCKIVCYCDQACQKSDWPLH 134

Query: 64  RLECQVLSRLDKEKRKSVTPSIRLMLKLYLRRKLQNDNVIPSTTTDNYSLVEALVAHMSD 123
           + EC  L     EKR +                         T+  +    E  V  +  
Sbjct: 135 KFECPALV-AHAEKRGT-------------------------TSESDEGSSEGTVGTVFV 168

Query: 124 IDEKQLLLYAQIANLVNLILQWPEISINEIAENFSK------LACNAHTICNSELRPLGT 177
             E        +  L  L+        N +   F K      +  NAHT+    L P+G 
Sbjct: 169 PSET-------VRALGRLVWLHSREKSNSVKAGFGKRYRGSIIEANAHTLSTPSLTPIGV 221

Query: 178 GLYPVISIINHSCLPNAVLVFE--------GRLAVVRAVQHVPKGAEVLISYIETAGSTM 229
            L PV ++INHSC+PN V+VF             ++ A+  +  G EV I +  T G   
Sbjct: 222 ALSPVAALINHSCVPNCVVVFPKASETKKVPNEMLIIAIGKILPGEEVQIPH--TLGK-F 278

Query: 230 TRQKALKEQYLFTCTCPRC-----IKLGQFDDIQESAILEGYRCKDDGCSGFLLRDSDDK 284
              K L+E+Y F C CP C     +K    D  +       +RC    C GFL   S   
Sbjct: 279 ADNKILQERYFFKCHCPNCEMTSLVKTPYVDSRR------AFRCASKTCEGFLPMPSMQS 332

Query: 285 GFTCQQCGLVRSKEEIKKIASEVNILSKKTLALTSCGNHQEVVSTYKMIEKLQKKLYHPF 344
               +Q                 N+ S  + AL++    Q    + K   K ++      
Sbjct: 333 QLGQKQ---------------RFNVYSAISCALSNPRRSQMQSDSEKKGSKGRRNFNSLV 377

Query: 345 SV--NLMQTREKLIKILMELEDWKEALAYCQLTIPVYQRVY-PQFHPLLGL 392
           S   +   T  + + +   L D + A  Y    IP+  R + P  HPLL L
Sbjct: 378 SYFRSPGLTYHQSLTLCAGLLDPERAFKYTSNLIPMVSRFFHPSAHPLLAL 428


>gi|26336238|dbj|BAC31804.1| unnamed protein product [Mus musculus]
 gi|26338432|dbj|BAC32887.1| unnamed protein product [Mus musculus]
          Length = 239

 Score = 97.1 bits (240), Expect = 2e-17,   Method: Compositional matrix adjust.
 Identities = 69/255 (27%), Positives = 120/255 (47%), Gaps = 39/255 (15%)

Query: 172 LRPLGTGLYPVISIINHSCLPNAVLVFEGRLAVVRAVQHVPKGAEVLISYIETAGSTMTR 231
           ++ +G GLYP +S++NHSC PN  +VF G   ++RAV+ +  G E+ I Y++   ++  R
Sbjct: 1   MQEVGVGLYPSMSLLNHSCDPNCSIVFNGPHLLLRAVREIEAGEELTICYLDMLMTSEER 60

Query: 232 QKALKEQYLFTCTCPRCIKLGQFDDIQESAILEGYRCKDDGCSGFLLRDSDDKGFTCQQC 291
           +K L++QY F C C RC    Q  D                      +D+D         
Sbjct: 61  RKQLRDQYCFECDCIRC----QTQD----------------------KDAD--------- 85

Query: 292 GLVRSKEEIKKIASEVNILSKKTLALTSCGNHQEVVSTYKMIEKLQKKLYHPFSVNLMQT 351
            ++   E+I K   EV    KK   L +    ++V++  + I           ++  ++ 
Sbjct: 86  -MLTGDEQIWK---EVQESLKKIEELKAHWKWEQVLALCQAIINSNSNRLPDINIYQLKV 141

Query: 352 REKLIKILMELEDWKEALAYCQLTIPVYQRVYPQFHPLLGLQYYTCGKLEWFLGDTENAI 411
            +  +   + L   +EAL Y   T+  Y+  +P  HP+ G+Q    GKL+   G    A+
Sbjct: 142 LDCAMDACINLGMLEEALFYAMRTMEPYRIFFPGSHPVRGVQVMKVGKLQLHQGMFPQAM 201

Query: 412 KSMTEAVEILRITHG 426
           K++  A +I+++THG
Sbjct: 202 KNLRLAFDIMKVTHG 216


>gi|148681223|gb|EDL13170.1| SET and MYND domain containing 3, isoform CRA_a [Mus musculus]
          Length = 239

 Score = 97.1 bits (240), Expect = 2e-17,   Method: Compositional matrix adjust.
 Identities = 69/255 (27%), Positives = 120/255 (47%), Gaps = 39/255 (15%)

Query: 172 LRPLGTGLYPVISIINHSCLPNAVLVFEGRLAVVRAVQHVPKGAEVLISYIETAGSTMTR 231
           ++ +G GLYP +S++NHSC PN  +VF G   ++RAV+ +  G E+ I Y++   ++  R
Sbjct: 1   MQEVGVGLYPSMSLLNHSCDPNCSIVFNGPHLLLRAVREIEAGEELTICYLDMLMTSEER 60

Query: 232 QKALKEQYLFTCTCPRCIKLGQFDDIQESAILEGYRCKDDGCSGFLLRDSDDKGFTCQQC 291
           +K L++QY F C C RC    Q  D                      +D+D         
Sbjct: 61  RKQLRDQYCFECDCIRC----QTQD----------------------KDAD--------- 85

Query: 292 GLVRSKEEIKKIASEVNILSKKTLALTSCGNHQEVVSTYKMIEKLQKKLYHPFSVNLMQT 351
            ++   E+I K   EV    KK   L +    ++V++  + I           ++  ++ 
Sbjct: 86  -MLTGDEQIWK---EVQESLKKIEELKAHWKWEQVLALCQAIINSNSDRLPDINIYQLKV 141

Query: 352 REKLIKILMELEDWKEALAYCQLTIPVYQRVYPQFHPLLGLQYYTCGKLEWFLGDTENAI 411
            +  +   + L   +EAL Y   T+  Y+  +P  HP+ G+Q    GKL+   G    A+
Sbjct: 142 LDCAMDACINLGMLEEALFYAMRTMEPYRIFFPGSHPVRGVQVMKVGKLQLHQGMFPQAM 201

Query: 412 KSMTEAVEILRITHG 426
           K++  A +I+++THG
Sbjct: 202 KNLRLAFDIMKVTHG 216


>gi|380494289|emb|CCF33265.1| MYND finger [Colletotrichum higginsianum]
          Length = 549

 Score = 97.1 bits (240), Expect = 2e-17,   Method: Compositional matrix adjust.
 Identities = 73/257 (28%), Positives = 114/257 (44%), Gaps = 28/257 (10%)

Query: 8   GEVIIS-QEPYVCVPNNSSSISRCDGCF-----ASSNLKKCSACQVVWYCGSNCQKLDWK 61
           G+VI     P V +P    ++  C+ C      A   L+ C+ C+ V YCG  CQ+ +W 
Sbjct: 31  GDVIARFDNPAVVLPPGHRALEYCNHCVKKQRPAGVKLRACTGCKTVAYCGPACQRANWS 90

Query: 62  L-HRLECQVLSRLDKEKRKS----VTPSIRLMLKLYLRRKLQNDNVIPSTTTDNYSLVEA 116
           L H+LEC+ + RL + K       V   IR   ++ LR ++              +  E 
Sbjct: 91  LVHKLECKAIQRLHEAKPAHQPDWVPTPIRAAAQVMLRPQV-------------LARFEE 137

Query: 117 LVAHMSDIDEKQLLLYAQIANLVNLILQWPEISINEIAENFS---KLACNAHTICNSELR 173
           L  H+    +K        ++ V   L    ++   +  +F    KL  NA +       
Sbjct: 138 LEGHVEQWRKKDGTDLQLQSHGVVKCLGLDTVTFERLETSFQVLCKLQTNAFSRTEEYYE 197

Query: 174 PLGTGLYPVISIINHSCLPNAVLVFEGRLAVVRAVQHVPKGAEVLISYIETAGSTMTRQK 233
             G  L   +++INHSC+PNA++ F GR A +RA   +  G E+ ISYI+       R  
Sbjct: 198 TGGVFLDTTLAMINHSCVPNALVQFGGRTATLRATSFLDPGDEIEISYIDQTQPRGKRHG 257

Query: 234 ALKEQYLFTCTCPRCIK 250
            L + Y F C+C +C K
Sbjct: 258 EL-DLYHFECSCYKCQK 273


>gi|310793444|gb|EFQ28905.1| MYND finger [Glomerella graminicola M1.001]
          Length = 548

 Score = 96.3 bits (238), Expect = 3e-17,   Method: Compositional matrix adjust.
 Identities = 72/253 (28%), Positives = 113/253 (44%), Gaps = 24/253 (9%)

Query: 8   GEVIIS-QEPYVCVPNNSSSISRCDGCF-----ASSNLKKCSACQVVWYCGSNCQKLDWK 61
           G+VI   +EP V +P    ++  C+ C          L+ C+ C+ V YCG  CQ+ +W 
Sbjct: 31  GDVIACFEEPAVVLPPGHRALEYCNHCLKKQQPGGPKLRACTGCKTVAYCGPACQRANWS 90

Query: 62  L-HRLECQVLSRLDKEKRKS----VTPSIRLMLKLYLRRKLQNDNVIPSTTTDNYSLVEA 116
           L H+LEC+ + RL + K       V   IR   ++ LR  +             +  +E 
Sbjct: 91  LVHKLECKAIQRLHEIKPADEPAWVPTPIRAAAQVMLRPHV----------LAQFGELEG 140

Query: 117 LVAHMSDIDEKQLLLYAQ-IANLVNLILQWPEISINEIAENFSKLACNAHTICNSELRPL 175
            V       E  L L +  +   + L +   E ++    +   KL  NA +         
Sbjct: 141 HVEQWRKKSETDLQLQSHGVVRCLGLDMGTFE-ALEAAFQVLCKLQTNAFSRSEEYYETG 199

Query: 176 GTGLYPVISIINHSCLPNAVLVFEGRLAVVRAVQHVPKGAEVLISYIETAGSTMTRQKAL 235
           G  L   +++INHSC+PNA++ F GR A +R+   +  G+E+ ISYI+       RQ  L
Sbjct: 200 GVFLDTTLAMINHSCVPNALVQFGGRTATLRSASFIHPGSEIEISYIDQTQPKSRRQHEL 259

Query: 236 KEQYLFTCTCPRC 248
              Y F C C +C
Sbjct: 260 S-LYHFECHCSKC 271


>gi|344244710|gb|EGW00814.1| SET and MYND domain-containing protein 3 [Cricetulus griseus]
          Length = 239

 Score = 96.3 bits (238), Expect = 3e-17,   Method: Compositional matrix adjust.
 Identities = 69/255 (27%), Positives = 120/255 (47%), Gaps = 39/255 (15%)

Query: 172 LRPLGTGLYPVISIINHSCLPNAVLVFEGRLAVVRAVQHVPKGAEVLISYIETAGSTMTR 231
           ++ +G GLYP +S++NHSC PN  +VF G   ++RAV+ +  G E+ I Y++   ++  R
Sbjct: 1   MQEVGVGLYPSMSLLNHSCDPNCSIVFNGPHLLLRAVREIEAGEELTICYLDMLMTSEER 60

Query: 232 QKALKEQYLFTCTCPRCIKLGQFDDIQESAILEGYRCKDDGCSGFLLRDSDDKGFTCQQC 291
           +K L++QY F C C RC    Q  D                      +D+D         
Sbjct: 61  RKQLRDQYCFECDCIRC----QTHD----------------------KDAD--------- 85

Query: 292 GLVRSKEEIKKIASEVNILSKKTLALTSCGNHQEVVSTYKMIEKLQKKLYHPFSVNLMQT 351
            ++   E+I K   EV    KK   L +    ++V++  + I           ++  ++ 
Sbjct: 86  -MLTGDEQIWK---EVQESLKKIEELKAHWKWEQVLALCQAIISSNSDRLPDINIYQLKV 141

Query: 352 REKLIKILMELEDWKEALAYCQLTIPVYQRVYPQFHPLLGLQYYTCGKLEWFLGDTENAI 411
            +  +   + L   +EAL Y   T+  Y+  +P  HP+ G+Q    GKL+   G    A+
Sbjct: 142 LDCAMDACINLGMLEEALLYALRTMEPYRIFFPGSHPVRGVQVMKVGKLQLHQGMFPQAM 201

Query: 412 KSMTEAVEILRITHG 426
           K++  A +I+++THG
Sbjct: 202 KNLRLAFDIMKVTHG 216


>gi|402073679|gb|EJT69231.1| hypothetical protein GGTG_12851 [Gaeumannomyces graminis var.
           tritici R3-111a-1]
          Length = 541

 Score = 95.9 bits (237), Expect = 4e-17,   Method: Compositional matrix adjust.
 Identities = 69/247 (27%), Positives = 121/247 (48%), Gaps = 21/247 (8%)

Query: 12  ISQEPYVCVPNNSSSISRCDGCFA-SSNLKKCSAC-QVVWYCGSNCQKLDWKL-HRLECQ 68
           +  +P V  P+ + S + C  C   ++ +  C+ C + V YC   CQK++WKL H  EC+
Sbjct: 42  VFDDPLVAFPDAAGSKTTCHHCLVHNAKVFGCTGCDKAVSYCSDECQKVNWKLIHSKECK 101

Query: 69  VLSRLDKEKRKSVTPS-IRLMLKLYLRRKLQNDNVIPSTTTDNYSLVEALVAHMSDIDEK 127
           V  ++     K+  P+ +R ++++ LR             ++ + +V  L  H      +
Sbjct: 102 VFRKVQAAVSKNWLPTPVRALVQILLR------------LSEVHEVVTGLEGHGDKFRAR 149

Query: 128 QLLL----YAQIANLVNLILQWPEISINEIAENFSKLACNAHTICNSELRPLGTGLYPVI 183
           + L         A +     +  + ++N  AE   K+  N+    +++    GT L P++
Sbjct: 150 KELWENMKLQAYAGIHYAGRKEDDANLNLAAEILCKIQTNSFDRFDADTGQSGTFLDPLL 209

Query: 184 SIINHSCLPNAVLVFEGRLAVVRAVQHVPKGAEVLISYIETAGSTMTRQKALKEQYLFTC 243
           +++NHSC+PNAV++F  R A +RA   V  G ++ ISYI+       R++ L E Y F C
Sbjct: 210 AMVNHSCIPNAVVLFWKRKAYLRAETPVKAGEDITISYIDYTKPLSFRRQDL-ELYHFQC 268

Query: 244 TCPRCIK 250
            C RC K
Sbjct: 269 GCLRCEK 275


>gi|149040851|gb|EDL94808.1| similar to SET and MYND domain containing 3, isoform CRA_a [Rattus
           norvegicus]
          Length = 239

 Score = 95.9 bits (237), Expect = 4e-17,   Method: Compositional matrix adjust.
 Identities = 71/257 (27%), Positives = 120/257 (46%), Gaps = 43/257 (16%)

Query: 172 LRPLGTGLYPVISIINHSCLPNAVLVFEGRLAVVRAVQHVPKGAEVLISYIETAGSTMTR 231
           ++ +G GLYP +S++NHSC PN  +VF G   ++RAV+ +  G E+ I Y++   ++  R
Sbjct: 1   MQEVGVGLYPSMSLLNHSCDPNCSIVFNGPHLLLRAVREIEAGEELTICYLDMLMTSEER 60

Query: 232 QKALKEQYLFTCTCPRCIKLGQFDDIQESAILEGYRCKDDGCSGFLLRDSDDKGFTCQQC 291
           +K L++QY F C C RC                              +D D    T  + 
Sbjct: 61  RKQLRDQYCFECDCIRCPT----------------------------QDKDADMLTGDEQ 92

Query: 292 GLVRSKEEIKKIASEVNILSKKTLALTSCGNHQEVVSTYKMIEKLQKKLYHPFSVNLMQT 351
                +E +KKI  E+    K    L  C   Q ++++    E+L         +N+ Q 
Sbjct: 93  IWKEVQESLKKI-EELKAHWKWEQVLALC---QAIINSNS--ERLP-------DINIYQL 139

Query: 352 R--EKLIKILMELEDWKEALAYCQLTIPVYQRVYPQFHPLLGLQYYTCGKLEWFLGDTEN 409
           +  +  +   + L   +EAL Y   T+  Y+  +P  HP+ G+Q    GKL+   G    
Sbjct: 140 KVLDCAMDACINLGMLEEALFYAMRTMEPYRIFFPGSHPVRGVQVMKVGKLQLHQGMFPQ 199

Query: 410 AIKSMTEAVEILRITHG 426
           A+K++  A +I+++THG
Sbjct: 200 AMKNLRLAFDIMKVTHG 216


>gi|341883759|gb|EGT39694.1| hypothetical protein CAEBREN_28295 [Caenorhabditis brenneri]
          Length = 582

 Score = 95.9 bits (237), Expect = 4e-17,   Method: Compositional matrix adjust.
 Identities = 90/416 (21%), Positives = 172/416 (41%), Gaps = 44/416 (10%)

Query: 32  GCFASSNLKKCSACQVVWYCGSNCQKLDWKLHRLECQVLSRLDKEKRKSVTPSIRLMLKL 91
           G      L +C  C    YC   CQ   W +H+ EC+ L        K+  P++ L   L
Sbjct: 120 GPAPGEKLLRCGGCNFSMYCSKECQATAWLVHKPECKRL--------KASFPNLPLTEVL 171

Query: 92  YLRRKL-------QNDNVIPSTTTDNYSLVEALVAHMSDI--DEKQLLLYAQIANLVNLI 142
           +L + +       +N + +       +S   +LV H +DI  DE+++  + ++   + L 
Sbjct: 172 FLSKVIDRIQFLEKNGDKLGIEAERKFS---SLVDHKTDIREDEEKMTHFEKVFTKMGLF 228

Query: 143 LQWPEISINEIAENFSKLACNAHTICNSELRPLGTGLYPVISIINHSCLPNAVLVFEGRL 202
            +   I   E  + F K + N+H+I  +    +G  L   +S  NHSC P   +VF+G  
Sbjct: 229 RKDDMIEKGEFFDIFCKASINSHSIHTNAGNEVGMALDLGVSKYNHSCRPTCSMVFDGYR 288

Query: 203 AVVRAVQHVP-----KGAEVLISYIETAGSTMTRQKALKEQYLFTCTCPRCIKLGQFDDI 257
             +R +  VP        +  ISYI+   S   R++ LK ++ F C C RC+      D 
Sbjct: 289 VCLRPL--VPGVDAEDTEQAFISYIDVGRSKFVRRRDLKSRWYFNCECTRCM------DP 340

Query: 258 QESAILEGYRCKDDGCSGFLLRDSDDK--GFTCQQCGLVRSKEEIKKIASEVNILSKKTL 315
           ++ A L   RC +  C   +L    ++     C +C  +  + ++K     +        
Sbjct: 341 EDDA-LTAIRCANPSCDAPILTSETEEPMNIACDKCKTIVEENDVKAAQEYMK------- 392

Query: 316 ALTSCGNHQEVVSTYKMIEKLQKKLYHPFSVNLMQTREKLIKILMELEDWKEALAYCQLT 375
           +L +  + +      + +    +++ HP +V + + R  L  +   L     +  + Q+ 
Sbjct: 393 SLPASFDPKCPADILRDLLAKAEQVLHPSNVYVARLRTALFHVTGTLTMDNLSSMHTQI- 451

Query: 376 IPVYQRVYPQFHPLLGLQYYTCGKLEWFLGDTENAIKSMTEAVEILRITHGTNSPF 431
              Y+  +P+    +G Q     K      + + A+    +A+ I  +  G + P+
Sbjct: 452 YNNYKMCFPKADRHVGFQLLHIVKALIEKDERDEAMPYAFDAMNIFEVCFGLDHPY 507


>gi|324507181|gb|ADY43048.1| SET and MYND domain-containing protein 3 [Ascaris suum]
          Length = 614

 Score = 95.9 bits (237), Expect = 4e-17,   Method: Compositional matrix adjust.
 Identities = 103/434 (23%), Positives = 186/434 (42%), Gaps = 39/434 (8%)

Query: 36  SSNLKKCSACQVVWYCGSNCQKLDWKLHRLECQVLSRLDKEKRKSVTPSI--RLMLKLYL 93
           S+NL++C  C    +C  NCQ L WK H+ EC+ L      K     P I  RL+ ++  
Sbjct: 100 SANLRRCKGCNRAMFCDQNCQTLGWKDHKAECRAL------KGSPTVPDIEVRLLGRIIT 153

Query: 94  RRK-LQNDNVIPSTT-----TDNYSLVEALVAHMSDI--DEKQLLLYAQI-ANLVNLILQ 144
           R K ++N N     T     +   S+++ + AH   I  DE  +  +  + + LV     
Sbjct: 154 RFKAIKNGNDKKDATFYLQRSSRRSIMD-IWAHTDQIRNDEFAMKKFNDVYSRLVTFYGT 212

Query: 145 WPEISINEIAENFSKLACNAHTICNSE-LRPLGTGLYPVISIINHSCLPNAVLVFEGRLA 203
              ++  E  E   +   N H I +   L  +G GLY  +   +HSC PN +   +G +A
Sbjct: 213 KALLTKEEAFELHCRDYINRHAISDDGYLEEIGKGLYLDLCAYDHSCRPNTIYTCKGFVA 272

Query: 204 VVRAVQ---HVPKGAEVLISYIETAGSTMTRQKALKEQYLFTCTCPRCIKLGQFDDIQES 260
            +RA+    ++        SYI+   +T  R+K L++ + F C C RC+     D +  S
Sbjct: 273 TLRALDLTVNLMDRTTTFYSYIDLLSTTQERKKLLRDTWYFDCQCVRCV--DNDDHVLSS 330

Query: 261 AILEGYRCKDDGCSGF---LLRDSDDKGFTCQQCGLVRSKEEIKKIASEVNILSKKTLAL 317
                 R K +    F     ++   +  TC +C +   KE + +    +  + +    +
Sbjct: 331 MFCPSCREKPERLCIFGESPYKNPVTQILTCPKCHIEVPKERVLEAVDAMRFIDR----I 386

Query: 318 TSCGNHQEVVS--TYKMIEKLQKKLYHPF-SVNLMQTREKLIKILMELEDWKEALAYCQL 374
            +    Q++     ++ ++ ++ +      SVN+     K+I++L+ L D  ++     L
Sbjct: 387 VAKNEIQQMAKEQAFRFLKGIKDRFSKILPSVNVYMC--KIIQLLIPLIDPTDSKTLLSL 444

Query: 375 TIPVYQRV---YPQFHPLLGLQYYTCGKLEWFLGDTENAIKSMTEAVEILRITHGTNSPF 431
            +   + V   +P  HP + +   + G     +G    A   +  A +IL+ T G   P 
Sbjct: 445 HLESEECVRFCFPPNHPAVAVHLSSIGLFYLRIGHPHRAELYLEMACDILKFTLGHTHPL 504

Query: 432 MKELILKLEEAQAE 445
             + +  LEEA  E
Sbjct: 505 TVDKLSILEEAHRE 518


>gi|47224323|emb|CAG09169.1| unnamed protein product [Tetraodon nigroviridis]
          Length = 361

 Score = 95.5 bits (236), Expect = 6e-17,   Method: Compositional matrix adjust.
 Identities = 73/281 (25%), Positives = 128/281 (45%), Gaps = 31/281 (11%)

Query: 164 AHTIC--NSELRPLGTGLYPVISIINHSCLPNAVLVFEGRLAVVRAVQHVPKGAEVLISY 221
           +HT+   + EL  LGT +YP +++INHSCLP+ ++ + G  A VRAV+ +  G EVLISY
Sbjct: 109 SHTLFSEDEELSHLGTAVYPDVALINHSCLPSVIVTYNGTSADVRAVRDMNPGDEVLISY 168

Query: 222 IETAGSTMTRQKALKEQYLFTCTCPRCIKLGQFDDIQESAILEGYRCKDDGCSGFLLRDS 281
           I+    T  R   L+E Y FTC C  C                G +  D   +   LR  
Sbjct: 169 IDVLYPTEDRNTRLRESYYFTCQCQEC----------------GSQSSDQ--AKLKLRKR 210

Query: 282 DDKGFTCQQCGLVR-SKEEIKKIASEVNILSKKTLALTSCGNHQEVVSTYKMIEKLQKKL 340
            D         +VR +++ I++     N  +  +  L  C    + +            +
Sbjct: 211 SDPAEAEVINTMVRYARKCIREFRVFKNSNTPASTLLEMCEQSLDEMGA----------V 260

Query: 341 YHPFSVNLMQTREKLIKILMELEDWKEALAYCQLTIPVYQRVYPQFHPLLGLQYYTCGKL 400
           +   +V ++    + + + + ++D   AL Y +     Y+++YP +   L   Y   G+L
Sbjct: 261 FDESNVYMLHMMYQAMGVCLYMQDPDGALRYGEKIAKYYRKLYPAYSLNLSSLYLKLGRL 320

Query: 401 EWFLGDTENAIKSMTEAVEILRITHGTNSPFMKELILKLEE 441
            + +      I  + +A  I+ +THG N  ++ EL  +++E
Sbjct: 321 YFGMERNSECISVLKKAKAIMEVTHGKNHFYVTELDRQMKE 361


>gi|393212658|gb|EJC98158.1| SET domain-containing protein [Fomitiporia mediterranea MF3/22]
          Length = 459

 Score = 95.1 bits (235), Expect = 6e-17,   Method: Compositional matrix adjust.
 Identities = 71/242 (29%), Positives = 109/242 (45%), Gaps = 33/242 (13%)

Query: 29  RCDGCFASSN-LKKCSACQVVWYCGSNCQKLDWK-LHRLECQVLSRLDKEKRKSVTPSIR 86
           RCD CF  ++ L++CS C   WYCG +CQ   W  +H+  C+ +S          +P I 
Sbjct: 63  RCDACFRKAHTLRRCSGCVAYWYCGKDCQTRHWNVIHKYMCKNISNFCISSDFQASP-IH 121

Query: 87  LMLKLYL----------RRKLQNDNVIPSTTTDNYSLVEALVAHMSDIDEKQLLLYAQIA 136
                +L          R ++  +N+         + V  L    + + +  L     I 
Sbjct: 122 ARTDAFLLSHLVAEQSERLEISGENLREGFNDPLATFVSLLPLSAASVPDTPL-----IC 176

Query: 137 NLVNLILQWPEISINEIAENFSKLACNAHTICNSELRPLGTGLYPVIS-IINHSCLPNAV 195
            L    +  P + I EI   FS    N + I +S L  +G G++P+ S + NHSCLPNA+
Sbjct: 177 PLSRSSIASPAV-IKEIFSRFS----NNNFIVHSHLSAVGHGIFPLASRLFNHSCLPNAI 231

Query: 196 LVF----EGRLAVVRAVQHVPKGAEVLISYIETAGSTMTRQKALKEQYLFTCTC-----P 246
           + +    EG   VV+A+  + +G E+ I Y + A     RQ   +  Y F CTC     P
Sbjct: 232 VTYSFTSEGIQMVVKALTPIKQGEEITIPYFDPALPYHQRQAICRYSYGFECTCSVCMFP 291

Query: 247 RC 248
           RC
Sbjct: 292 RC 293


>gi|340517499|gb|EGR47743.1| predicted protein [Trichoderma reesei QM6a]
          Length = 535

 Score = 95.1 bits (235), Expect = 6e-17,   Method: Compositional matrix adjust.
 Identities = 70/245 (28%), Positives = 114/245 (46%), Gaps = 15/245 (6%)

Query: 8   GEVIISQEPYVCVPNNSSSISRCDGCFASSNLKKCSACQVVWYCGSNCQKLDWK-LHRLE 66
           G VI S  P + +P  +   S C  C      + CS C    YC + CQ   WK +H  E
Sbjct: 28  GAVIFSFTPLLLLPTVACLTSVCSHCLRPGEPRLCSRCHAAAYCDATCQAAAWKAVHSRE 87

Query: 67  CQVLSRL--DKEKRKSVTPSIRLMLKLYLRRKLQNDNVIPSTTTDNYSLVEA-LVAHMSD 123
           C+ L++   D+E+R+ +    R +++  L  K++          D  + +E  ++   ++
Sbjct: 88  CKALTKTIRDEERRRMLPTPTRALIQALLWGKIR----------DGLADLEGHVLEKRAE 137

Query: 124 IDEKQLLLYAQIANLVNLILQWPEISINEIAENFSKLACNAHTICNSELRPLGTGLYPVI 183
            DE + +    +A      L   +  +   AE   K+  N+    +SEL   G  L P +
Sbjct: 138 GDEWRDIEMMAVAGCAFSGLGRGDQDVRMAAEMLCKIQNNSFQRFDSELGVAGLYLEPTL 197

Query: 184 SIINHSCLPNAVLVFEGRLAVVRAVQHVPKGAEVLISYIETAGSTMTRQKALKEQYLFTC 243
           ++ NHSC+PNA + F GR A++ A   +  G E+ I+Y         R++AL   Y FTC
Sbjct: 198 AMANHSCIPNASVQFIGRNALLIAENPIRAGDEIEIAYTFYTDPLPKRREALA-HYKFTC 256

Query: 244 TCPRC 248
            C RC
Sbjct: 257 QCLRC 261


>gi|395331361|gb|EJF63742.1| SET domain-containing protein [Dichomitus squalens LYAD-421 SS1]
          Length = 652

 Score = 95.1 bits (235), Expect = 7e-17,   Method: Compositional matrix adjust.
 Identities = 92/323 (28%), Positives = 138/323 (42%), Gaps = 61/323 (18%)

Query: 7   SGEVIISQEPYVCV---PNNSSSISRCDGCFASSNLKKCSACQVVWYCGSNCQKLDWKLH 63
           +GE+I+S  P+V V   PN     S C    A++ LK+C  C+ V YC S+CQ  DW  H
Sbjct: 139 AGEIILSIIPHVFVLSTPNLEYYCSACAAPAATAGLKRCPKCKTVRYCNSDCQNRDWAWH 198

Query: 64  RLECQVLSRLDKEKRKS--VTPS--IRLMLKLY---------------LRRKLQNDNVI- 103
           R EC  L +       S    P   +R + ++                +R    N N + 
Sbjct: 199 RRECNALQKWAASAPSSDVAIPGEPVRCLGRIMWGSQKEGLDSVWAQEIRMMHSNRNSLQ 258

Query: 104 PSTTTDNYSLVEALVAHMSDIDEKQLLLYA--QIANLVNLILQWPEISINEIAENFSKLA 161
           PS    +  L  ++V ++     ++L  Y      +LV+LI               S+  
Sbjct: 259 PSAFGSHTHLAHSVVRYLGVSSPQELEPYGLKSAGDLVDLI---------------SRFT 303

Query: 162 CNAHTICNSELRPLGTGLYPVISIINHSCLPNAVLVF----------EGRLAVVRAVQHV 211
            N  T+    L P+G  + P +++ NHSC PNA LVF          E  L++V A+++V
Sbjct: 304 TNTFTLTTPALTPIGICVSPTVALANHSCDPNAALVFPRADGGSRVKEPLLSLV-ALRNV 362

Query: 212 PKGAEVLISYIETAGSTMTRQKALKEQYLFTCTCPRCIKLGQFDDIQESAILEGYRCKDD 271
             G E+ ISY++T      RQ  LKE Y F+C C  C +    D        E   C   
Sbjct: 363 APGKEIRISYVDTTLPNRLRQNELKEVYSFSCQCKLCCRTTAADP------REALWCPKS 416

Query: 272 GCSG---FLLRDSDDKGFTCQQC 291
            C G   +   +S D+   C +C
Sbjct: 417 -CGGICPYPTEESGDRATHCVKC 438


>gi|46108572|ref|XP_381344.1| hypothetical protein FG01168.1 [Gibberella zeae PH-1]
          Length = 530

 Score = 95.1 bits (235), Expect = 7e-17,   Method: Compositional matrix adjust.
 Identities = 64/250 (25%), Positives = 118/250 (47%), Gaps = 24/250 (9%)

Query: 8   GEVIISQEPYVCVPNNSSSISRCDGCFASSNLKKCSACQVVWYCGSNCQKLDW-KLHRLE 66
           G+VI+   P + +P+ S   + C  CF  + ++ CS C  V YC + CQ  +W  +H  E
Sbjct: 27  GDVILPFTPTILIPSLSHINTICSHCFKQAEVRACSRCHAVSYCDAACQAANWTAVHSKE 86

Query: 67  CQVLSRLDKEKRKSVTPSIRLMLKLYLRRKLQN------DNVIPSTTTDNYSLVEALVAH 120
           C+VL ++ ++ R  +   IR +++  ++ ++ N       N      ++ ++ +E +   
Sbjct: 87  CKVLRKVTEQGRPGLPTPIRAVVQALVKPEIGNAIEDLEGNAASWRKSEKWADMEMMAMG 146

Query: 121 MSDIDEKQLLLYAQIANLVNLILQWPEISINEIAENFSKLACNAHTICNSELRPLGTGLY 180
            S         +A +           +  + +      K+  NA    +++L  +G  L 
Sbjct: 147 AS--------AFAGLGT--------GQEDVQKALTILCKIQTNAFHRYDTDLGQVGIFLG 190

Query: 181 PVISIINHSCLPNAVLVFEGRLAVVRAVQHVPKGAEVLISYIETAGSTMTRQKALKEQYL 240
             +++ NHSC+PNA++ F GR A++RA + +    E+ ISY +       R+ AL   Y 
Sbjct: 191 TKLAMANHSCIPNAMVQFIGRRAILRAEKPIKIDDEIEISYTDYTFPLSNRKLALAP-YF 249

Query: 241 FTCTCPRCIK 250
           F C C RC K
Sbjct: 250 FDCMCLRCEK 259


>gi|32564211|ref|NP_871849.1| Protein SET-18, isoform b [Caenorhabditis elegans]
 gi|25005018|emb|CAD56600.1| Protein SET-18, isoform b [Caenorhabditis elegans]
          Length = 507

 Score = 94.7 bits (234), Expect = 7e-17,   Method: Compositional matrix adjust.
 Identities = 102/451 (22%), Positives = 180/451 (39%), Gaps = 52/451 (11%)

Query: 6   VSGEVIISQEPYVCVPNNSSSISRCDGCF----ASSNLKKCSACQVVWYCGSNCQKLDWK 61
           +   + I + P+     N      C  C         L +C  C    YC   CQ   W 
Sbjct: 25  IHARIKIFETPFATQVLNPKVSEFCANCLRGPAPGEKLLRCGGCNFSMYCSKECQATAWL 84

Query: 62  LHRLECQVLSRLDKEKRKSVTPSIRLMLKLYLRRKL-------QNDNVIPSTTTDNYSLV 114
           +H+ EC+ L        K+  P++ L   L+L + +       +N + +       +S  
Sbjct: 85  VHKPECKRL--------KASFPNLPLTEVLFLSKVIDRIQFLEKNGDKLGIEAERKFS-- 134

Query: 115 EALVAHMSDI--DEKQLLLYAQIANLVNLILQWPEISINEIAENFSKLACNAHTICNSEL 172
            +LV H  DI  DE+++  + +I   +        I   E  + F K   N+H+I  +  
Sbjct: 135 -SLVDHKVDIRDDEEKMAHFEKIFEKMGAFRGEEMIEKGEFFDVFCKATINSHSIHTNAG 193

Query: 173 RPLGTGLYPVISIINHSCLPNAVLVFEGRLAVVRAVQHVP-----KGAEVLISYIETAGS 227
             +G  L   +S  NHSC P   +VF+G    +R +  VP        E  ISYI+   S
Sbjct: 194 NEVGMALDLGVSKYNHSCRPTCSMVFDGYRVCLRPL--VPGVDAENTEEAFISYIDVGRS 251

Query: 228 TMTRQKALKEQYLFTCTCPRCIKLGQFDDIQESAILEGYRCKDDGCSGFLLRDSDDK--G 285
              R++ L  ++ F C C RC+      D ++ A L   RC +  C   +L    ++   
Sbjct: 252 KYIRRRDLNSRWYFNCECTRCM------DPEDDA-LTAIRCANPACDAPILTSETEEPMN 304

Query: 286 FTCQQCGLVRSKEEIKKIASEVNILSKKTLALT---SCGNHQEVVSTYKMIEKLQK--KL 340
             C++C  +  ++ +K     +     KTL  +    C    E +   K+ E L K  ++
Sbjct: 305 IACEKCKTIVEEDTVKAAQEYM-----KTLPASFDPKCPAEIEALPG-KLKELLAKAEQI 358

Query: 341 YHPFSVNLMQTREKLIKILMELEDWKEALAYCQLTIPVYQRVYPQFHPLLGLQYYTCGKL 400
            HP +V + + R  L  +   L     +  + Q+    Y+  +P+    +G Q     K 
Sbjct: 359 LHPSNVYVARLRTALFHVTGTLTMDNLSSMHTQI-YNNYKMCFPKADRHVGFQLLHIVKA 417

Query: 401 EWFLGDTENAIKSMTEAVEILRITHGTNSPF 431
                + + A+    +A+ I  +  G + P+
Sbjct: 418 LIEKDERDEAMPYAFDAMNIFEVCFGLDHPY 448


>gi|302411726|ref|XP_003003696.1| conserved hypothetical protein [Verticillium albo-atrum VaMs.102]
 gi|261357601|gb|EEY20029.1| conserved hypothetical protein [Verticillium albo-atrum VaMs.102]
          Length = 564

 Score = 94.7 bits (234), Expect = 9e-17,   Method: Compositional matrix adjust.
 Identities = 69/250 (27%), Positives = 115/250 (46%), Gaps = 42/250 (16%)

Query: 20  VPNNSSSISRCDGCFASS-NLKKCSACQVVWYCGSNCQKLDWK-LHRLECQVLSRLDKEK 77
           VP   +  + C  C A+  + ++C+AC+   YC + CQK  W  +H  EC  L     E 
Sbjct: 59  VPPMVARTTICSSCLATPPSPRRCTACKATAYCDAACQKSHWSSVHARECAALQAAGGEG 118

Query: 78  RKSVTPSIRLMLKLYLRRKLQNDNVIPSTTTDNYSLVEALVAHMSDIDEKQLLLYAQIAN 137
             +V   +R+ +++ L               + ++ VEA +    D  E +   +A +  
Sbjct: 119 --AVPAYVRMAMQVLL-------------WPEKFAAVEAQLDGNVDRFEGRAEAWAGMEV 163

Query: 138 LVNLILQWP-----------------EISINEIAENFSKLACNAHTICNSELRPLGTGLY 180
              ++  W                  E++   + E   K+  NA T   +     G GL+
Sbjct: 164 PAKVVGGWVAAGAKGKGKGKGKEKAVEVTERRVVELLCKIKTNAFTRSEA-----GEGLF 218

Query: 181 --PVISIINHSCLPNAVLVFEGRLAVVRAVQHVPKGAEVLISYIETAGSTMTRQKALKEQ 238
               ++++NHSC+PNA++ F GR A +RA++ + +G E+ ISYI+   S   R+KAL E 
Sbjct: 219 LDTTLAMVNHSCVPNAIVAFSGRRAFLRALRDIKEGEEIEISYIDCTQSLEHRRKAL-EL 277

Query: 239 YLFTCTCPRC 248
           Y F C C RC
Sbjct: 278 YFFQCVCMRC 287


>gi|367020696|ref|XP_003659633.1| hypothetical protein MYCTH_99686 [Myceliophthora thermophila ATCC
           42464]
 gi|347006900|gb|AEO54388.1| hypothetical protein MYCTH_99686 [Myceliophthora thermophila ATCC
           42464]
          Length = 573

 Score = 94.7 bits (234), Expect = 9e-17,   Method: Compositional matrix adjust.
 Identities = 70/275 (25%), Positives = 124/275 (45%), Gaps = 31/275 (11%)

Query: 3   SFHVSGEVIISQEPYVCVPNNSSSISRCDGCFASSN--------------LKKCSACQVV 48
           +F     +     P + +P+ ++  + C+ C  ++                K C+AC+  
Sbjct: 28  AFSPGSTIATFSNPLLALPDGATMRTTCNYCLRTTAPSPSPSQSAATPPAFKACTACKAA 87

Query: 49  WYCGSNCQKLDWKL-HRLECQVLSRLDKEKRKSVTPS-IRLMLKLYL-----------RR 95
            YCG  CQ+  WK  H+ EC++ +R+ ++  K   P+ +R + ++ L           R 
Sbjct: 88  VYCGPACQRAHWKAAHKAECKMFARVREQAGKDWLPTPVRAVAQVLLTLQQGKSGGGGRE 147

Query: 96  KLQNDNVIPSTTTDNYSLVEALVAHMSDIDE--KQLLLYAQIANLVNLILQWPEISINEI 153
                  I S   +    +E  V       E  K + L A  A +   +LQ  E+ + + 
Sbjct: 148 AEMRRAFIGSADGEEEDGLEGNVEGFKKDGEVWKDMELQATAAVVYAGLLQGEEV-LEKA 206

Query: 154 AENFSKLACNAHTICNSELRPLGTGLYPVISIINHSCLPNAVLVFEGRLAVVRAVQHVPK 213
            E   K+  NA    +++    G  L   ++++NHSC+PNA + F+ R A++RA + + +
Sbjct: 207 REILCKIQTNAFNRLDADTGMAGIFLDVGLAMVNHSCVPNAFIGFDKRTAILRAERPIQE 266

Query: 214 GAEVLISYIETAGSTMTRQKALKEQYLFTCTCPRC 248
           G E+ ISYI+       R +AL+  Y F C C RC
Sbjct: 267 GEEITISYIDNTLPKAARYEALR-LYHFQCDCVRC 300


>gi|392579066|gb|EIW72193.1| hypothetical protein TREMEDRAFT_58351 [Tremella mesenterica DSM
           1558]
          Length = 660

 Score = 94.7 bits (234), Expect = 9e-17,   Method: Compositional matrix adjust.
 Identities = 89/316 (28%), Positives = 129/316 (40%), Gaps = 63/316 (19%)

Query: 8   GEVIISQEPYVCVPNNSSSISRCDGCFAS-------------SNLKKCSACQVVWYCGSN 54
           G V++++ P+V V +      RC  C  S             S L++C  C+ V YC S+
Sbjct: 174 GSVVLTRPPFVSVLSTPQLPERCSYCLLSPLELSISRRSLDPSPLERCEVCETVAYCSSS 233

Query: 55  CQKLDWKLHRLECQVLSRLDKE--KRKSVTPS-IRLMLKLYLRRKLQNDNVIPSTTTDNY 111
           C   D  LH  EC+ L R+ K+     SV P  IR + +L      +      S T+D +
Sbjct: 234 CATKDRSLHDYECRAL-RMSKQVADAASVPPEHIRALARLAWSFHTER-----SGTSDKH 287

Query: 112 SLVEA---LVAHMSDIDEKQLLLYAQI-------------------ANLVNLILQWPEI- 148
           S + A   L  H+    +    L  QI                     L N   Q P + 
Sbjct: 288 SRINAMKVLEDHLKTTFQGDKELEKQIEEHRAQCLRISKAVMMFIAMGLPNFDPQNPSLD 347

Query: 149 -------SINEIAENFSKLACNAHTICNSELRPLGTGLYPVISIINHSCLPNAVLVF-EG 200
                  S   I +  S    N  +  +  L  +G  L P +++ NHSC PNAV+VF EG
Sbjct: 348 LTSIFNKSTGSILQVISSFMINTFSASSPSLDLVGAALNPAMAMSNHSCSPNAVVVFPEG 407

Query: 201 ----RLAVVRAVQHVPKGAEVLISYIETAGSTMTRQKALKEQYLFTCTCPRC---IKLGQ 253
               R+  ++A++    G EVL  Y++ A     RQ  L+  Y F C CP C    K G 
Sbjct: 408 ADSMRIVAIKAIE---AGEEVLTHYVDLALPYAQRQAELRRTYHFECKCPTCDNRTKDGD 464

Query: 254 FDDIQESAILEGYRCK 269
              +   +IL    CK
Sbjct: 465 KPVLDPRSILIHRECK 480


>gi|299738408|ref|XP_001838336.2| hypothetical protein CC1G_04780 [Coprinopsis cinerea okayama7#130]
 gi|298403294|gb|EAU83524.2| hypothetical protein CC1G_04780 [Coprinopsis cinerea okayama7#130]
          Length = 604

 Score = 94.0 bits (232), Expect = 1e-16,   Method: Compositional matrix adjust.
 Identities = 66/247 (26%), Positives = 102/247 (41%), Gaps = 69/247 (27%)

Query: 8   GEVIISQEPYVCVPNNSSSISRCDGCFA---SSNLKKCSACQVVWYCGSNCQKLDWKLHR 64
           G V++S +P++   + S   S C  CF    S+ LK+C  C++V YCGS CQ  DW LH+
Sbjct: 84  GAVLVSTKPHIATLSTSQLSSYCSACFGPGTSAPLKRCPNCKIVMYCGSACQSRDWSLHK 143

Query: 65  LECQVLSRLDKEKRKSVTP--------------------SIRLMLKLYLRRK-------- 96
           LEC  L R   + R    P                    +IR + ++  R++        
Sbjct: 144 LECSALQRWMSQPRPQPPPGSSSEPQVVSSGETRAPPSDAIRTLARILWRKQKVGLTSTW 203

Query: 97  -------------LQNDNVIPSTTTDNYS------------LVEALVAHMSDIDEKQLLL 131
                        L    V  S T+++ S            L   L+ +M     ++L  
Sbjct: 204 AKEIDLLQSHRASLSKPTVSQSPTSNDASSNITKAAELHTHLSHGLIHYMGLTSPQELEP 263

Query: 132 YAQIANLVNLILQWPEISINEIAENFSKLACNAHTICNSELRPLGTGLYPVISIINHSCL 191
           Y      +N        S  ++ +  S+   N  T+ +  L PLG  + PV ++ NHSC 
Sbjct: 264 YG-----IN--------SAGDLVDLLSRFTTNTFTLTSPSLTPLGACISPVAALFNHSCD 310

Query: 192 PNAVLVF 198
           PNAV+VF
Sbjct: 311 PNAVIVF 317


>gi|328789942|ref|XP_392262.3| PREDICTED: SET and MYND domain-containing protein 4-like [Apis
           mellifera]
          Length = 677

 Score = 94.0 bits (232), Expect = 2e-16,   Method: Compositional matrix adjust.
 Identities = 98/383 (25%), Positives = 156/383 (40%), Gaps = 55/383 (14%)

Query: 8   GEVIISQEPYVCVPNNSSSISRCDGCFASSNL---KKCSACQVVWYCGSNCQKLDWKLHR 64
           GE++  ++PY         +  C  CF    +     C  C  V YC  +C+  D K+H 
Sbjct: 264 GEILAIEKPYSAFLLAEYRLINCFYCFTKIFVPIPAVCQTCSCVAYCSISCRDKDAKIHE 323

Query: 65  LECQVLSRLDKEK---------RKSVTPSIRLMLKLYLRRKLQNDNVIPSTT----TDNY 111
            EC +L  L   K         R  V  S   + KL   ++   D    S +    ++++
Sbjct: 324 NECSILPTLWASKTSINCFLALRIIVQQSFEKLYKLKDVKENSKDKFEVSASEPYRSNDF 383

Query: 112 SLVEALVAHMSDIDEKQLLLYAQIAN-LVNLILQWP--------------EISINE---- 152
            ++  LV H      + L     IA+ L+ L+ + P              ++S  E    
Sbjct: 384 KIMFRLVTHEDTRTVEDLFHRTYIASWLLRLLKKGPYFPKHVKTPDTIEAKLSDGELYIG 443

Query: 153 --IAENFSKLACNAHTI------------CNSELRPLGTGLYPVISIINHSCLPNAVLVF 198
             I  N   +  NAH I             N++ + +G GLYP IS+ NHSC P  +  F
Sbjct: 444 GLILHNLMTIQFNAHEISELVIPKADNNLANAKSKFIGGGLYPTISLFNHSCNPGIIRYF 503

Query: 199 EGRLAVVRAVQHVPKGAEVLISY--IETAGSTMTRQKALKEQYLFTCTCPRCIKLGQFDD 256
            G   VVRA++ +  G E+  +Y  I T      R++ L+ QY F C C  C +     +
Sbjct: 504 IGTTMVVRAIRSISSGEEISENYGQIFTTTPESERKRKLRLQYFFDCNCEACREHWPLLE 563

Query: 257 IQESAILEGYRCKDDGCSGFLL---RDSDDKGFTCQQCGLVRSKEEIKKIASEVNILSKK 313
             +  IL  ++C+     G +L    DS++    C +CG   +  +  K   E + + K 
Sbjct: 564 EIDPTILR-FKCETGKECGNVLPVRTDSNEFMIECSKCGKCMNIFKGLKALQETDAIFKI 622

Query: 314 TLALTSCGNHQEVVSTYKMIEKL 336
                  GNH+E +  Y  I KL
Sbjct: 623 ASRYLEQGNHREALKNYLKILKL 645


>gi|157136833|ref|XP_001656930.1| hypothetical protein AaeL_AAEL003527 [Aedes aegypti]
 gi|108880959|gb|EAT45184.1| AAEL003527-PA [Aedes aegypti]
          Length = 650

 Score = 93.6 bits (231), Expect = 2e-16,   Method: Compositional matrix adjust.
 Identities = 99/379 (26%), Positives = 161/379 (42%), Gaps = 52/379 (13%)

Query: 10  VIISQEPYVCVPNNSSSISRCDGCFASSN-LKKCSACQVVWYCGSNCQKLDWKLHR-LEC 67
           V++ + P+  V  +  S S CD C +    L  C  C  V YC   CQK     H   EC
Sbjct: 253 VLLKELPHASVVMSECSGSHCDQCCSRVEVLFSCPNCVDVVYCSEECQKQAISGHHAFEC 312

Query: 68  QVLSRLDKEKRKSVTPSIRLMLKLYLRR-------------KLQNDNVIPSTTTDNYSLV 114
             LS L       V+    L L++  ++              LQND  + S   D+Y  V
Sbjct: 313 GFLSFLRNSGANVVS---MLALRIVSQKSEKYFYELQDELDNLQND-FVDSLFFDDYRKV 368

Query: 115 EALVAHMSDIDEKQLLLYAQIANLVNLILQWPEISINE---------IAENFSKLACNAH 165
              V H    + +  L +  ++ L+N +L     SI+          +  N   +  N+H
Sbjct: 369 YNFVTHGQQRNAEDYLKWTVMSVLLNTVLVAAGFSISGSLKGFLGKILLHNLQIVTYNSH 428

Query: 166 TICNSEL---RP--------LGTGLYPVISIINHSCLPNAVLVFEGRLAVVRAVQHVPKG 214
            I  SEL   +P        +G GLYP + + NHSC P     F G    +R ++++P G
Sbjct: 429 EI--SELQRKKPKDSGFSVCIGAGLYPTLVLFNHSCDPGITRYFVGNAVYIRTIKNIPAG 486

Query: 215 AEVLISY--IETAGSTMTRQKALKEQYLFTCTCPRCIKLGQFDDIQE-SAILEGYRC-KD 270
           + V  +Y  + T      R+K L E Y F C C  C +   +  + E + ++  ++C  +
Sbjct: 487 SMVAENYGQLYTRAGRRERRKLLAENYKFDCCCQACEE--DWPSLHEMNPMIRRFKCGAN 544

Query: 271 DGCSGFLLR--DSDDKGFTCQQCGLVRSKEEIKKIASEVNILSKKTLA--LTSCGNHQEV 326
           +GC   LL   +S +    C +CG +       +   +V+ L++   A  L S G+ +  
Sbjct: 545 EGCGNELLFKLNSTENEMECNKCGGLTEVNASFETLKQVDFLNRYNDAARLYSQGDFERA 604

Query: 327 VSTY-KMIEKLQKKLYHPF 344
           +S Y  +I  L + L  P+
Sbjct: 605 LSKYAALINSLDEILVPPY 623


>gi|449681118|ref|XP_002159729.2| PREDICTED: uncharacterized protein LOC100210152, partial [Hydra
           magnipapillata]
          Length = 1044

 Score = 93.6 bits (231), Expect = 2e-16,   Method: Compositional matrix adjust.
 Identities = 70/254 (27%), Positives = 126/254 (49%), Gaps = 30/254 (11%)

Query: 7   SGEVIISQEPY--VCVPNNSSSISRCDGCFASSNLK-KCSACQVVWYCGSNCQKLDW-KL 62
           +GEVII ++P+  +C+P   ++   C  C     +   C  C  V YC  +C+K  W K 
Sbjct: 250 AGEVIIIEKPFASLCLPECYNT--HCYHCLTRFKINYPCRLCSTVNYCSISCEKESWEKF 307

Query: 63  HRLECQVLSRLDKEKRK------SVTPSIRLMLKLYLRRKLQNDNVIPSTTTDNYSLVEA 116
           H  EC+ L  L  +          +  ++ + + L  +     D++ P ++TD Y+ + +
Sbjct: 308 HCFECEYLGVLINDDVGLAHLAFKIITNVGISMLLSFKENNSFDDLKPYSSTD-YNSIFS 366

Query: 117 LVAHMSDIDEKQLLLYAQIANLVNLILQWPEISINEIAENFSKLACNAHTICNSELRPLG 176
           L+ ++  +           A+L+  I   P  + +E++E    L   A    +SEL+ +G
Sbjct: 367 LIENLKIV----------CAHLLKHIQMLP-CNAHEVSE----LQLKASNYKDSELKEIG 411

Query: 177 TGLYPVISIINHSCLPNAVLVFEGRLAVVRAVQHVPKGAEVLISY--IETAGSTMTRQKA 234
           + +Y  +S++NHSC P+ V    G   V+RA++H+ +G+E++ +Y  +    S + RQ  
Sbjct: 412 SAVYATLSLLNHSCDPSVVRHCYGDTCVLRAIKHIKEGSEIVDNYGFLYAVESKVIRQSH 471

Query: 235 LKEQYLFTCTCPRC 248
           L EQY F C C  C
Sbjct: 472 LMEQYYFACQCEAC 485


>gi|380013978|ref|XP_003691021.1| PREDICTED: SET and MYND domain-containing protein 4-like [Apis
           florea]
          Length = 678

 Score = 93.6 bits (231), Expect = 2e-16,   Method: Compositional matrix adjust.
 Identities = 99/383 (25%), Positives = 156/383 (40%), Gaps = 55/383 (14%)

Query: 8   GEVIISQEPYVCVPNNSSSISRCDGCFASSNL---KKCSACQVVWYCGSNCQKLDWKLHR 64
           GE++  ++PY         +  C  CF    +     C  C  V YC  +C+  D K+H 
Sbjct: 265 GEILAIEKPYSAFLLAEYRLINCFYCFTKIFVPVPAVCQTCSCVAYCSISCRNKDAKIHE 324

Query: 65  LECQVLSRLDKEK---------RKSVTPSIRLMLKLYLRRKLQNDNVIPSTT----TDNY 111
            EC +L  L   K         R  V  S   + KL   +    D    S +    +D++
Sbjct: 325 NECPILPTLWLSKTSINCFLALRIIVQQSFDKLYKLKDLKTNSKDKFEVSASEPYRSDDF 384

Query: 112 SLVEALVAHMSDIDEKQLLLYAQIAN-LVNLILQWP--------------EISINE---- 152
            ++  LV H      + L     IA+ L+ L+ + P              ++S  E    
Sbjct: 385 KIMFRLVTHEDKRTIEDLFHRTYIASWLLRLLKKGPYFPKHVKTPDTVEAKLSDGELYIG 444

Query: 153 --IAENFSKLACNAHTI------------CNSELRPLGTGLYPVISIINHSCLPNAVLVF 198
             I  N   +  NAH I             N++ + +G GLYP IS+ NHSC P  +  F
Sbjct: 445 GLILHNLMTIQFNAHEISELVIPKADNNLANAKSKFIGGGLYPTISLFNHSCNPGIIRYF 504

Query: 199 EGRLAVVRAVQHVPKGAEVLISY--IETAGSTMTRQKALKEQYLFTCTCPRCIKLGQFDD 256
            G   VVRA++ +  G E+  +Y  I T      R++ L+ QY F C C  C +     +
Sbjct: 505 IGTTMVVRAIRSISSGEEISENYGQIFTTTPEKERKRKLRLQYFFDCNCEACREHWPLLE 564

Query: 257 IQESAILEGYRCKDDGCSGFLL---RDSDDKGFTCQQCGLVRSKEEIKKIASEVNILSKK 313
             +  IL  ++C+     G +L    DS++    C +CG   +  +  K   E + + K 
Sbjct: 565 EIDPTILR-FKCETGKECGNVLPVKTDSNEFMIECSKCGKCMNIFKGLKALQETDAIFKI 623

Query: 314 TLALTSCGNHQEVVSTYKMIEKL 336
                  GNH+E + +Y  I KL
Sbjct: 624 ASRNLEQGNHREALKSYLKILKL 646


>gi|367042716|ref|XP_003651738.1| hypothetical protein THITE_68287 [Thielavia terrestris NRRL 8126]
 gi|346999000|gb|AEO65402.1| hypothetical protein THITE_68287 [Thielavia terrestris NRRL 8126]
          Length = 562

 Score = 93.2 bits (230), Expect = 2e-16,   Method: Compositional matrix adjust.
 Identities = 62/213 (29%), Positives = 103/213 (48%), Gaps = 19/213 (8%)

Query: 37  SNLKKCSACQVVWYCGSNCQKLDWK-LHRLECQVLSRLDKEKRKSVTPSIRLMLKLYLRR 95
           + L+ C+AC+   YCG+ CQ+  W+ +HR EC++ SR+ +   K   P+           
Sbjct: 95  TPLRACTACKAAVYCGAACQRAHWRAVHRAECKMFSRVRESAGKEWLPTPAFGGGGGAWA 154

Query: 96  KLQNDNVIPSTTTDNYSLVEALVAHMSDIDEKQLLLYAQIANLVNLILQWPEISINEIAE 155
              N         + +   EAL     ++      +YA +           E ++ +  E
Sbjct: 155 LEGN--------VEGFRADEALWRDF-ELQAAAAAVYAGLLE--------SEETLEKARE 197

Query: 156 NFSKLACNAHTICNSELRPLGTGLYPVISIINHSCLPNAVLVFEGRLAVVRAVQHVPKGA 215
              K+  NA    +++    G  L   ++++NHSC+PNA + F+ R AV+RA + + +G 
Sbjct: 198 VLCKIQTNAFNRLDADTGLSGIFLDVGLAMVNHSCVPNAFIGFDRRTAVLRAERPIQEGE 257

Query: 216 EVLISYIETAGSTMTRQKALKEQYLFTCTCPRC 248
           E+ ISYI+ A     RQ+AL+  Y F C CPRC
Sbjct: 258 EITISYIDNALPKSARQEALR-LYHFRCDCPRC 289


>gi|291242073|ref|XP_002740933.1| PREDICTED: predicted protein-like [Saccoglossus kowalevskii]
          Length = 669

 Score = 93.2 bits (230), Expect = 2e-16,   Method: Compositional matrix adjust.
 Identities = 89/309 (28%), Positives = 131/309 (42%), Gaps = 51/309 (16%)

Query: 8   GEVIISQEPYVCVPNNSSSISRCDGCFASSNLK-KCSACQVVWYCGSNCQKLDWKL-HRL 65
           G+V+I + PY  V     ++S C  C+        C  C  + YC   C+   W++ H L
Sbjct: 265 GDVLIVENPYSSVGLQPCNVSHCHHCYIRVLASIPCLQCAGIVYCSKECRNASWEMYHNL 324

Query: 66  ECQVLSRLDK----EKRKSVTPSIRLMLKLYLRRKLQNDNV-IPST-----TTD-----N 110
           EC  L  + +        ++   IR  L   L+ + Q+ NV IP       T D     N
Sbjct: 325 ECHHLDLIQELGLGMGHLALRTIIRTGLAFLLKFREQSANVNIPDESFHGCTVDGEYESN 384

Query: 111 YSLVEALVAHMSDIDEKQLLLYAQIANLVNLI--LQWPEISINEIAENFSKLA------- 161
           Y  V  LV H  D   K   L+ ++   V L+  LQ      +  A+N   +A       
Sbjct: 385 YYSVYNLVGHSED--RKPGDLFKRVVKAVCLLRCLQQTNFFQSVGADNEEDVAIFIGGHM 442

Query: 162 --------CNAHTICNSELR----------PLGTGLYPVISIINHSCLPNAVLVFEGRLA 203
                   CNAH I   EL            +G+GLYP +S++NHSC P       G   
Sbjct: 443 LTHLQTIPCNAHEISEYELWRSDITKCHFVEVGSGLYPTMSLVNHSCDPVVTRNCYGETC 502

Query: 204 VVRAVQHVPKGAEVL--ISYIETAGSTMTRQKALKEQYLFTCTCPRCI-KLGQFDDIQES 260
           VVRA++++ KG E+     Y+        RQ  LK QY F C C  C+     + DI++ 
Sbjct: 503 VVRAIRNIYKGEEITDNYGYLYPVHDKSERQTRLKWQYFFECKCDACVHDWPLYPDIED- 561

Query: 261 AILEGYRCK 269
            +L  Y+C+
Sbjct: 562 -LLPKYKCE 569


>gi|17509133|ref|NP_492772.1| Protein SET-18, isoform a [Caenorhabditis elegans]
 gi|3924878|emb|CAB03382.1| Protein SET-18, isoform a [Caenorhabditis elegans]
          Length = 501

 Score = 93.2 bits (230), Expect = 3e-16,   Method: Compositional matrix adjust.
 Identities = 96/446 (21%), Positives = 174/446 (39%), Gaps = 48/446 (10%)

Query: 6   VSGEVIISQEPYVCVPNNSSSISRCDGCF----ASSNLKKCSACQVVWYCGSNCQKLDWK 61
           +   + I + P+     N      C  C         L +C  C    YC   CQ   W 
Sbjct: 25  IHARIKIFETPFATQVLNPKVSEFCANCLRGPAPGEKLLRCGGCNFSMYCSKECQATAWL 84

Query: 62  LHRLECQVLSRLDKEKRKSVTPSIRLMLKLYLRRKL-------QNDNVIPSTTTDNYSLV 114
           +H+ EC+ L        K+  P++ L   L+L + +       +N + +       +S  
Sbjct: 85  VHKPECKRL--------KASFPNLPLTEVLFLSKVIDRIQFLEKNGDKLGIEAERKFS-- 134

Query: 115 EALVAHMSDI--DEKQLLLYAQIANLVNLILQWPEISINEIAENFSKLACNAHTICNSEL 172
            +LV H  DI  DE+++  + +I   +        I   E  + F K   N+H+I  +  
Sbjct: 135 -SLVDHKVDIRDDEEKMAHFEKIFEKMGAFRGEEMIEKGEFFDVFCKATINSHSIHTNAG 193

Query: 173 RPLGTGLYPVISIINHSCLPNAVLVFEGRLAVVRAVQHVP-----KGAEVLISYIETAGS 227
             +G  L   +S  NHSC P   +VF+G    +R +  VP        E  ISYI+   S
Sbjct: 194 NEVGMALDLGVSKYNHSCRPTCSMVFDGYRVCLRPL--VPGVDAENTEEAFISYIDVGRS 251

Query: 228 TMTRQKALKEQYLFTCTCPRCIKLGQFDDIQESAILEGYRCKDDGCSGFLLRDSDDK--G 285
              R++ L  ++ F C C RC+      D ++ A L   RC +  C   +L    ++   
Sbjct: 252 KYIRRRDLNSRWYFNCECTRCM------DPEDDA-LTAIRCANPACDAPILTSETEEPMN 304

Query: 286 FTCQQCGLVRSKEEIKKIASEVNILSKKTLALTSCGNHQEVVSTYKMIEKLQKKLYHPFS 345
             C++C  +  ++ +K     +  L             +E+++  + I        HP +
Sbjct: 305 IACEKCKTIVEEDTVKAAQEYMKTLPASFDPKCPAEILKELLAKAEQI-------LHPSN 357

Query: 346 VNLMQTREKLIKILMELEDWKEALAYCQLTIPVYQRVYPQFHPLLGLQYYTCGKLEWFLG 405
           V + + R  L  +   L     +  + Q+    Y+  +P+    +G Q     K      
Sbjct: 358 VYVARLRTALFHVTGTLTMDNLSSMHTQI-YNNYKMCFPKADRHVGFQLLHIVKALIEKD 416

Query: 406 DTENAIKSMTEAVEILRITHGTNSPF 431
           + + A+    +A+ I  +  G + P+
Sbjct: 417 ERDEAMPYAFDAMNIFEVCFGLDHPY 442


>gi|389631537|ref|XP_003713421.1| hypothetical protein MGG_15522 [Magnaporthe oryzae 70-15]
 gi|351645754|gb|EHA53614.1| hypothetical protein MGG_15522 [Magnaporthe oryzae 70-15]
 gi|440463734|gb|ELQ33288.1| hypothetical protein OOU_Y34scaffold00979g72 [Magnaporthe oryzae
           Y34]
 gi|440483676|gb|ELQ64025.1| hypothetical protein OOW_P131scaffold00888g14 [Magnaporthe oryzae
           P131]
          Length = 542

 Score = 92.8 bits (229), Expect = 3e-16,   Method: Compositional matrix adjust.
 Identities = 73/263 (27%), Positives = 123/263 (46%), Gaps = 48/263 (18%)

Query: 8   GEVIIS-QEPYVCVPNNSSSISRCDGCF-ASSNLKKCSACQ-VVWYCGSNCQKLDWKL-H 63
           G++I     P    P+ + + + C  C   ++ +  C  C+  V YC S CQ+ +WKL H
Sbjct: 40  GDLIAEFDNPLAAFPDAARASTTCHHCLDQNAKVFGCMGCEKAVKYCSSECQRANWKLVH 99

Query: 64  RLECQVLSRLDKEKRKSVTPS-IRLMLKLYLRRKLQNDNVIPSTTTDNYSLVEALVAHMS 122
             EC+V  ++     K   P+ +R +++L +R                ++ V+ LV+ + 
Sbjct: 100 SKECKVFRKVQTAVGKDWLPTPVRTLVQLLVR----------------WAEVQQLVSQLE 143

Query: 123 DIDEK-----------QLLLYAQI------ANLVNLILQWPEISINEIAENFSKLACNAH 165
             +++           +L  YA I       +  NL L           +   K+  N+ 
Sbjct: 144 GNEDRFKERKQLWEDMKLQAYAGIHYAGRKEDDANLFLS---------LDVLCKIQTNSF 194

Query: 166 TICNSELRPLGTGLYPVISIINHSCLPNAVLVFEGRLAVVRAVQHVPKGAEVLISYIETA 225
              +++    GT L PV+++ NHSC+PNAV++F  R A +RA   + +G+E+ ISYI+  
Sbjct: 195 DRFDADTGASGTFLDPVLAMANHSCVPNAVVLFWRRKAYLRAEMPIKQGSEISISYIDYT 254

Query: 226 GSTMTRQKALKEQYLFTCTCPRC 248
                RQ+ L   Y FTC CPRC
Sbjct: 255 KPVRFRQEDL-WLYHFTCKCPRC 276


>gi|351710111|gb|EHB13030.1| SET and MYND domain-containing protein 2 [Heterocephalus glaber]
          Length = 527

 Score = 92.8 bits (229), Expect = 3e-16,   Method: Compositional matrix adjust.
 Identities = 69/274 (25%), Positives = 120/274 (43%), Gaps = 34/274 (12%)

Query: 183 ISIINHSCLPNAVLVFEGRLAVVRAVQHVPKGAEVLISYIETAGSTMTRQKALKEQYLFT 242
           ++++NHSC PN ++ + G LA VRAVQ +  G EV  SYI+    T  R   L++ Y FT
Sbjct: 270 VALMNHSCCPNVIVTYRGTLAEVRAVQEIHPGEEVFTSYIDLLYPTEDRNDRLRDSYFFT 329

Query: 243 CTCPRCIKLGQFDDIQESAILEGYRCKDDGCSGFLLRDSDDKGFTCQQCGLVRSK----E 298
           C C  C   G     Q+ A LE  R   D      +RD            +VR      E
Sbjct: 330 CECRECSTKG-----QDQAKLE-IRKLSDPPRAEAVRD------------MVRYARGVIE 371

Query: 299 EIKKIASEVNILSKKTLALTSCGNH------------QEVVSTYKMIEKLQKKLYHPFSV 346
           E ++      IL            H             E++   ++ ++    ++   +V
Sbjct: 372 EFRRAKHYKYILGCSVQGPMPRSPHCEGREAGPGSPPSELLEICELSQEQMSSVFEDSNV 431

Query: 347 NLMQTREKLIKILMELEDWKEALAYCQLTIPVYQRVYPQFHPLLGLQYYTCGKLEWFLGD 406
            ++    + + + + L+DW+ AL Y Q  I  Y + YP +   +   +   G+L   L +
Sbjct: 432 YMLHMMYQAMGVCLYLQDWEGALRYGQKIIKPYSKHYPLYSLNVASMWLKLGRLYLGLEN 491

Query: 407 TENAIKSMTEAVEILRITHGTNSPFMKELILKLE 440
                K++ +A+ I+ + HG + P++ E+  ++E
Sbjct: 492 RAAGEKALKKALAIMEVAHGKDHPYISEIRREIE 525



 Score = 45.4 bits (106), Expect = 0.053,   Method: Compositional matrix adjust.
 Identities = 34/130 (26%), Positives = 64/130 (49%), Gaps = 12/130 (9%)

Query: 56  QKLDWKLHRLECQVLSRLDKEKRKSVTPSIRLMLKLYLRRKLQNDNVIPSTTTDNYSLVE 115
           +K DW +H+LEC  +         S T  +RL  ++  ++K+  +     T ++    V+
Sbjct: 75  KKEDWSMHKLECSPMVVFGGNWTPSET--VRLTARILAKQKIHPER----TPSEKLLAVK 128

Query: 116 ALVAHMSDID-EKQLLLYAQIANLVNLI---LQWPEISINEIAENFSKLACNAHTICNSE 171
              +H+  +D EK+ L+ + IA L       L  P+   + +   F+++ CN  TI + E
Sbjct: 129 EFESHLDKLDNEKRDLIQSDIATLHQFYSKHLGLPDH--DSLVVLFAQVNCNGFTIEDEE 186

Query: 172 LRPLGTGLYP 181
           L  LG+ ++P
Sbjct: 187 LSHLGSAIFP 196


>gi|159463378|ref|XP_001689919.1| histone methyltransferase [Chlamydomonas reinhardtii]
 gi|158283907|gb|EDP09657.1| histone methyltransferase [Chlamydomonas reinhardtii]
          Length = 765

 Score = 92.4 bits (228), Expect = 4e-16,   Method: Compositional matrix adjust.
 Identities = 43/97 (44%), Positives = 62/97 (63%), Gaps = 2/97 (2%)

Query: 145 WPEISINEIAENFSKLACNAHTICNSELRPLGTGLYPVISIINHSCLPNAVLVFEGRLAV 204
           WP     E+A+  ++LA N HT+C+ ELRPLGT LYP  ++ NHSC P+ V VF GR   
Sbjct: 94  WP--GFKEVAQLLARLAVNCHTVCDEELRPLGTALYPSGALANHSCRPSTVQVFRGRTLQ 151

Query: 205 VRAVQHVPKGAEVLISYIETAGSTMTRQKALKEQYLF 241
           +RA++ +  G EV + Y+E A +   R++AL + Y F
Sbjct: 152 LRALRPLAPGQEVTLCYLEPAATAQERREALWDSYCF 188


>gi|70947750|ref|XP_743461.1| hypothetical protein [Plasmodium chabaudi chabaudi]
 gi|56522968|emb|CAH78269.1| conserved hypothetical protein [Plasmodium chabaudi chabaudi]
          Length = 353

 Score = 92.4 bits (228), Expect = 4e-16,   Method: Compositional matrix adjust.
 Identities = 81/326 (24%), Positives = 138/326 (42%), Gaps = 40/326 (12%)

Query: 150 INEIAENFSKLACNAHTICNSELRPLGTGLYPV-ISIINHSCLPNAVLVFEGRLAVVRAV 208
           +  + E F K++ N   I ++EL P G G+Y   I   NHSCL N + +F+ +   +R +
Sbjct: 19  LKNVIEKFIKVSKNTLQIIDNELEPCGLGIYKKPIPYFNHSCLSNCITIFKNQRLYIRTL 78

Query: 209 QHVPKGAEVLISYIETAGSTMTRQKALKEQYLFTCTCP---------------------- 246
             +  G E+ ISY++ A     R     +QY FTCTC                       
Sbjct: 79  MDIYPGEELTISYLDVAFDRNARLAICADQYFFTCTCKLCKVNIASECHNIFNNDFICTQ 138

Query: 247 --RCIKLGQFDDIQESAILE---GYRCKDDGCSGFLLRDSDDKGFTCQQCGLVRSK---- 297
              C K   + +I   + LE    Y  K    +  +L+ S +K  T  +C L +++    
Sbjct: 139 SENCKKFINYMEIVLMSELERKISYLNKSHFKTFPILKKSTEKNETVWKCMLCKNETNDS 198

Query: 298 ------EEIKKIASEVNILSKKTLALTSCGNHQEVVSTYKMIEKLQK--KLYHPFSVNLM 349
                 E+ K+   EV  L        S  N   + S  K+  K+      YH    +L 
Sbjct: 199 VIKALIEKEKETIKEVEYLDTLFAEKYSYDNKNVLQSLTKIKSKIDDLTDFYHHSKYSLQ 258

Query: 350 QTREKLIKILMELEDWKEALAYCQLTIPVYQRVYPQFHPLLGLQYYTCGKLEWFLGDTEN 409
           + R K++ + ++L+D+K A +     +   +  Y ++ P+ G   +  GKL  FL     
Sbjct: 259 KMRAKILYVSIQLQDFKLAYSIATQYLKSIEVSYGKYSPIYGYYIFLTGKLALFLDLKSA 318

Query: 410 AIKSMTEAVEILRITHGTNSPFMKEL 435
            +  + +A + +  T+G  SP  K+L
Sbjct: 319 GLSLIHKAKKNIIKTYGPESPIYKDL 344


>gi|330799867|ref|XP_003287962.1| hypothetical protein DICPUDRAFT_33308 [Dictyostelium purpureum]
 gi|325081986|gb|EGC35483.1| hypothetical protein DICPUDRAFT_33308 [Dictyostelium purpureum]
          Length = 400

 Score = 92.4 bits (228), Expect = 5e-16,   Method: Compositional matrix adjust.
 Identities = 88/337 (26%), Positives = 140/337 (41%), Gaps = 68/337 (20%)

Query: 42  CSACQVVWYCGSNCQKLDWKLH-RLECQVLSRLDKEKRKS--VTPS----IRLMLKL--- 91
           C+ C  +WYC   C+K    +H   EC+   ++   K     + P     +R+M+ +   
Sbjct: 74  CNVCNEIWYCSEFCKKESQPIHAHYECKFFKKIKAPKLSEWEIDPDTFTEVRMMVGVISR 133

Query: 92  ---------------YLR----RKLQNDNVIPSTTT--DNYSLVEALVAHMSDIDEKQLL 130
                          YL+    ++L N   + S  T  D + LVE  +   S+   K+L+
Sbjct: 134 FYQERVLNKKFNLSNYLKEQQEKRLLNQEELSSEDTLDDIFDLVENTIDDGSNKAAKELI 193

Query: 131 --LYAQIANLVNLI--------LQWPEIS-------INEIAENFSKLACNAHTICNSELR 173
             +   IANL NL+        L  PE+        I+ I E   K+ CN   I     +
Sbjct: 194 DIITDYIANLFNLVITSSDNHDLPTPELKKEAVEKVISVIRELIHKVRCNQFGIWTKNDK 253

Query: 174 PLGTGLYPVISIINHSCLPNAVLVFEGRLAVVRAVQHVPKGAEVLISYIETAGSTMTRQK 233
            +   + P  S  NHSC+PN + + +G     +A+  V KG  + ISY++      +R++
Sbjct: 254 CIAVAISPSSSFFNHSCIPNCINIRDGNKMTFKALYPVKKGEPLAISYLDLDLPVESRKE 313

Query: 234 ALKEQYLFTCTCPRCIKLGQFDDIQESAILEGYRCKDDGCSGFLLRDSDDKGFTCQQCGL 293
            LK  Y F C CPRC +    D+  ++ I + Y C+   C                  GL
Sbjct: 314 YLKYGYYFDCGCPRCDEKTNQDECMDNWISKFY-CQRKKC-----------------VGL 355

Query: 294 VRSKEEIKKIASEVNILSKKTLALTSCGNHQEVVSTY 330
             SK ++     +VN   K TL  + C N   + STY
Sbjct: 356 YYSKTKVN--IDQVNKNDKITLNCSDCNNEFIINSTY 390


>gi|326669800|ref|XP_003199084.1| PREDICTED: SET and MYND domain-containing protein 1-like [Danio
           rerio]
          Length = 231

 Score = 92.0 bits (227), Expect = 6e-16,   Method: Compositional matrix adjust.
 Identities = 54/214 (25%), Positives = 104/214 (48%), Gaps = 17/214 (7%)

Query: 7   SGEVIISQEPYVCVPNNSSSISRCDGCFA-SSNLKKCSACQVVWYCGSNCQKLDWKLHRL 65
           +G+V+ ++ P+  V  +S + S C  CF     L++C  C+   YC   CQ+  W+ H+L
Sbjct: 24  AGDVLFAEPPFASVVFDSQASSICHSCFRRQEKLQRCGQCRFAQYCDKTCQRAGWEEHKL 83

Query: 66  ECQVLSRLDKEKRKSVTPSIRLMLKLYLRRKLQNDNVIPSTTTDNYSLVEALVAHMSDID 125
           EC  +    K   ++V  + R++ ++  +  + +DN +        + +E L  H+ DI 
Sbjct: 84  ECAAIKTYGKPPSENVRLAARILWRMDKQGSVVSDNQL--------TTLEDLEDHICDIS 135

Query: 126 EKQLLLYAQIANLVNLILQWPEIS----INEIAENFSKLACNAHTICNSE-LRPLGTGLY 180
           E  L  +    ++ N +  WP  S    ++ ++     + CN   + +   L+ +G GL+
Sbjct: 136 EDDLKDFK--VDIHNFLDYWPRNSKPHTVDSVSHILGVINCNGFMVSDQRGLQAVGVGLF 193

Query: 181 PVISIINHSCLPN-AVLVFEGRLAVVRAVQHVPK 213
           P + ++NH C PN  V++  G  + +  V H  K
Sbjct: 194 PNLCLVNHDCWPNCTVILNNGNQSAIDTVFHSQK 227


>gi|393234411|gb|EJD41974.1| hypothetical protein AURDEDRAFT_105699 [Auricularia delicata
           TFB-10046 SS5]
          Length = 531

 Score = 92.0 bits (227), Expect = 6e-16,   Method: Compositional matrix adjust.
 Identities = 98/400 (24%), Positives = 159/400 (39%), Gaps = 42/400 (10%)

Query: 39  LKKCSACQVVWYCGSNCQKLDWKLHRLEC---QVLSRLDKEKRKSVTP-SIRLMLKLYLR 94
           L++CS C    YC S+CQ  DW+ H+ EC   Q  +++  +   SV   +IR + ++  R
Sbjct: 56  LRRCSKCGAAHYCSSSCQTSDWQFHKTECVSLQQWAKMAPDPSLSVPGEAIRCLARVLRR 115

Query: 95  RKLQNDNVI-----------PSTTTDNYSLVEALVAHM--SDIDEKQLLLYAQIANLVNL 141
           R+ +  +              S+T  N  L++  V         E    L   +   + +
Sbjct: 116 REREGPDSPWTKQISVMQSCASSTAGNTLLLKTPVRQQLPPSAAETHTQLAHGVVKFMGI 175

Query: 142 -----ILQWPEISINEIAENFSKLACNAHTICNSELRPLGTGLYPVISIINHSCLPNAVL 196
                + ++   S  ++ +  S+   N+ T+ +  L P+G  + P +++ NHSC PNAV+
Sbjct: 176 SRPQDLAKFGINSAGDLVDLVSRFTTNSFTLSSPSLSPIGVCVSPTVALANHSCDPNAVV 235

Query: 197 VFEGRLAV--------VRAVQHVPKGAEVLISYIETAGSTMTRQKALKEQYLFTCTCPRC 248
           VF    A         + A++ +    EVL SY++ +     R++ LKE Y FTC C  C
Sbjct: 236 VFPSAAAPLGDETLMQIIAIKDIMPEEEVLTSYVDISLPRHLRRRDLKETYNFTCNCSAC 295

Query: 249 IKLGQFDDIQESAILEGYRCKDDGCSGFLLRDSDDKGFT-CQQCGL----VRSKEEIKKI 303
             L   DD           C   GC G         G   C +CG      +  E + ++
Sbjct: 296 SSL---DDPSVGDPRYAVWCP-KGCGGVCRIPELGSGIVICLRCGAEYSPAKQDEALDRV 351

Query: 304 ASEVNILSKKTLALTSCGNHQEVVSTYKMIEKLQKKLYHPFSVNLMQTREKLIKILMELE 363
                 L K T    S  +    ++T  +         HP    L      LI  L    
Sbjct: 352 RLGTEALEKATRLELSDPDRALRLTTNMLPLIGMPYSSHPLLGLLRLHSSLLISFLAASP 411

Query: 364 DWK---EALAYCQLTIPVYQRVYPQFHPLLGLQYYTCGKL 400
             K   EA+A         Q V+ + HP+ G+     GKL
Sbjct: 412 SNKTLDEAIAMNFRIQRGMQDVFVEGHPVRGVALAELGKL 451


>gi|258575593|ref|XP_002541978.1| predicted protein [Uncinocarpus reesii 1704]
 gi|237902244|gb|EEP76645.1| predicted protein [Uncinocarpus reesii 1704]
          Length = 481

 Score = 91.7 bits (226), Expect = 6e-16,   Method: Compositional matrix adjust.
 Identities = 95/424 (22%), Positives = 186/424 (43%), Gaps = 57/424 (13%)

Query: 6   VSGEVIISQEPYVCVPNNSSSISRCDGCFASSNLKKCSACQVVWYCGSNCQKLDWK-LHR 64
            +G  I+  + ++ V + +     C  CF    L+     + V      CQ  +W  +H+
Sbjct: 30  TAGHPILELDTWLTVLDTTRLADTCSSCFGVKTLRD----REVDGTPEACQVSNWAAVHK 85

Query: 65  LECQVLSRLDKEKRKSVTPS-IRLMLKLYLRRKLQNDNVIPSTTTDNYSLVEALVAHMSD 123
            EC++  +L      ++ P+  R++L++ + +  + D+        N    +AL +H   
Sbjct: 86  HECKIFKKL----HPNILPTNSRVVLRIIIFKTYRQDD-----PGGNMQRFDALESHQIQ 136

Query: 124 IDEKQLLLYAQIANLVNLILQWP--EISINEIAENFSKLACNAHTICNSELRPLGTGLYP 181
             + +   + ++A     + ++   E+S+++I E F KL  NA T+       +G  + P
Sbjct: 137 TLKSKPEYFQKLALSARAVREYSGTELSLHKIIEYFCKLDINAFTLTTPFYDHVGAAIEP 196

Query: 182 VISIINHSCLPNAVLVFEGRLAVVRAVQHVPKGAEVLISYIETAGSTMTRQKALKEQYLF 241
           + ++ NHSC PNA   F+     VRA++ + KG +V +SYIET      RQ  L ++Y F
Sbjct: 197 LAALCNHSCSPNAATDFDKGKIWVRALRDIGKGEQVFVSYIETTDPYAHRQSELLKRYYF 256

Query: 242 TCTCPRCIKLGQFDDIQESAILEGYRCKDDGCSGFLLRDSDDKG---FTCQQCGLVRSKE 298
            C C +C       +I+++A    +           ++++  +    F   + G   S +
Sbjct: 257 NCKCNKC-------EIEKNAPDTHFLRAITAVDSKTIQNAQQEAMELFEMVKPG-APSTD 308

Query: 299 EIKKIASEVNILSKKTLALTSCGNHQEVVSTYKMIEKLQKKLYHPFSVNLMQTREKLIKI 358
            IK + S ++ L + TL                    L ++ Y       ++ R +LI  
Sbjct: 309 SIKNLRSAMSALRRTTLW------------------PLTRQPY-------VRLRGELIAS 343

Query: 359 LMELEDWKEALAYCQLT-IPVYQRVYP-QFHPLLGLQYYTCGKLEWFLGDTENAIKSMTE 416
           LM    ++ A  +C +  + V   VYP ++HP+  +  +   KL  +L  T+     + E
Sbjct: 344 LMGARQFQSAFVHCVIRHLRVNPVVYPNRWHPISSMHKWVFVKLMRYL--TQAGDLGVAE 401

Query: 417 AVEI 420
            V++
Sbjct: 402 GVDL 405


>gi|383852386|ref|XP_003701709.1| PREDICTED: SET and MYND domain-containing protein 4-like [Megachile
           rotundata]
          Length = 675

 Score = 91.3 bits (225), Expect = 8e-16,   Method: Compositional matrix adjust.
 Identities = 96/385 (24%), Positives = 158/385 (41%), Gaps = 59/385 (15%)

Query: 8   GEVIISQEPYVCVPNNSSSISRCDGCFASSNL---KKCSACQVVWYCGSNCQKLDWKLHR 64
           GE+++ ++P+         ++ C  C     +    KC  C  + YC ++C+  D K+H 
Sbjct: 262 GEILVVEKPHCSFILAEYRLTHCHLCMNKIFVPIPSKCQVCSYLAYCSTSCRDADAKIHE 321

Query: 65  LECQVLSRLDKEKRKSVT--PSIRLMLKLYLRRKLQNDNVIPSTT------------TDN 110
            EC VL  L   K  SVT   ++R + +      ++  + + ST             +D+
Sbjct: 322 SECAVLPSLWMSK-TSVTCFLALRAITQRSFEEFIKLKDKLKSTKGRFEITQQRPYRSDD 380

Query: 111 YSLVEALVAHMSDIDEKQLLLYAQIA----NLVNLILQWPE-ISINEIAE---------- 155
           +     LV H  +  ++ +L    IA     L+ L   +PE +   + AE          
Sbjct: 381 FEAYYGLVTHEDERTDEDMLHRTYIAVWLLRLLKLSRYFPENVKTPDTAEAKPSEDELFI 440

Query: 156 ------NFSKLACNAH------------TICNSELRPLGTGLYPVISIINHSCLPNAVLV 197
                     L  NAH            T+  ++   +G GL+P I++ NHSC P  +  
Sbjct: 441 GSLILHGLMLLQFNAHEISELTIPRGEKTLAKAKSTFIGGGLFPTIALFNHSCNPGVIRY 500

Query: 198 FEGRLAVVRAVQHVPKGAEVLISY--IETAGSTMTRQKALKEQYLFTCTCPRCIKLGQFD 255
           F G   VVRA++ +  G E+  +Y  I T      R++ L+ QY F C C  C+      
Sbjct: 501 FIGTTMVVRAIRSIAAGEEISENYGPIFTMSPEAERKRTLRWQYWFDCNCEACVGHWPLL 560

Query: 256 DIQESAILEGYRCKDDG--CSGFLLRDSDDKGFT--CQQCGLVRSKEEIKKIASEVNILS 311
           D  +  IL  ++C D G  C   L   +D   F   C +CG   +  +  K   + + L 
Sbjct: 561 DDIDPTILR-FKC-DTGRECGNILPVKTDTNEFMIRCPKCGKNTNILKGLKALQDTDSLF 618

Query: 312 KKTLALTSCGNHQEVVSTYKMIEKL 336
           +        G H E +S+Y  I KL
Sbjct: 619 RVASNNLEDGKHVEALSSYLKILKL 643


>gi|312085936|ref|XP_003144878.1| MYND finger family protein [Loa loa]
          Length = 535

 Score = 91.3 bits (225), Expect = 9e-16,   Method: Compositional matrix adjust.
 Identities = 111/462 (24%), Positives = 189/462 (40%), Gaps = 54/462 (11%)

Query: 16  PYVCVPNNSSSISRCDGCFASSN-LKKCSACQVVWYCGSNCQKLDWKLHRLECQVLSRLD 74
           P+    ++    SRC  C A ++ LK+C AC+   +C   CQ L WK HR EC+     D
Sbjct: 73  PFAYSLHDEQLSSRCWYCLAKADTLKRCRACRKGMFCNQKCQTLGWKDHRSECKAFQLHD 132

Query: 75  KEKRKSVTPSI--RLMLKLYLRRKL------QNDNVIPSTTTDNYSLVEALVAHMSDI-- 124
           K       P+I  RL+ ++  R K       + DN      T   S+++ + +H   I  
Sbjct: 133 K------IPNIEVRLLGRIVTRYKAIKLGKDKEDNNFYKDRTSKRSIMD-IWSHTDLIKQ 185

Query: 125 DEKQLLLYAQI-ANLVNLILQWPEISINEIAENFSKLACNAHTICN-SELRPLGTGLYPV 182
           D   +  +  I A+L+        +  +E+ E   +   N H I +   +  +G GLY  
Sbjct: 186 DPGAMKKFNGIYADLLAFYGSKAMVGKDEVFELHCRDYINRHAISDCGYIEEIGKGLYLD 245

Query: 183 ISIINHSCLPNAVLVFEGRLAVVRAVQ---HVPKGAEVLISYIETAGSTMTRQKALKEQY 239
           +   +HSC PN +   +G +A +R +     +   +    SYI+   +T  R+K LK+ +
Sbjct: 246 LCAYDHSCRPNTIYTCDGFVATLRGLTASVDLRNLSSAHYSYIDLINTTQQRRKLLKDTW 305

Query: 240 LFTCTCPRCIKLGQFDDIQESAIL-------EGYRCKDDGCSGFLLRDSDDKGFTCQQCG 292
            F C C RC      DD   S+IL         Y C          +D + +  TC +C 
Sbjct: 306 YFECHCTRC---SDPDDALLSSILCPNCPEKREYLCI---FGDLPYKDRNTQIITCPKCH 359

Query: 293 LVRSKEEIKKIASEVNILSK--KTLALTSCGNHQEVVSTYKMIEKLQKKLYHPFSVNLMQ 350
              S E + +    +  + K  +   +      Q +     + E+  K L     VN+  
Sbjct: 360 NKVSPEYVVEAIGAMRFIDKIVENHEVEQMSREQSIKFLTDLKERFSKLLS---KVNVFL 416

Query: 351 TREKLIKILMELEDWKEALAYCQLTIPVY-------QRVYPQFHPLLGLQYYTCGKLEWF 403
              K+I++L+ L +  +    CQL + ++       +  +P  HP +G+ Y +       
Sbjct: 417 C--KIIQLLIPLINMSD----CQLLLDLHLEAEECVRFCFPFNHPAVGIHYRSIATFYLK 470

Query: 404 LGDTENAIKSMTEAVEILRITHGTNSPFMKELILKLEEAQAE 445
                 A+    +A EI+  T G       E    L++A  E
Sbjct: 471 CKQPHRALLYCKKAYEIISFTLGPKHLMTIETDAMLKDASNE 512


>gi|321476199|gb|EFX87160.1| hypothetical protein DAPPUDRAFT_312722 [Daphnia pulex]
          Length = 732

 Score = 91.3 bits (225), Expect = 9e-16,   Method: Compositional matrix adjust.
 Identities = 114/486 (23%), Positives = 204/486 (41%), Gaps = 75/486 (15%)

Query: 7   SGEVIISQEPYVCVPNNSSSISRCDGCFASSNLKK--CSACQVVWYCGSNCQKLDWKLHR 64
           +GE +  ++P   V       SRC  C   S+ K+  C AC  +W+C  +C++     H 
Sbjct: 252 AGETLFVEQPNALVVLPDFQTSRCHHCTRHSSAKRYPCLACGKIWFCSDSCRQESSCYHN 311

Query: 65  LECQVLSRLDKEKRKSVTPSIRLM--LKLYLRRKLQNDNV--IPSTT----TDNYSLVEA 116
            EC + + L+      +   I L   L   L      D V  +P       T +Y ++  
Sbjct: 312 FECGLEAVLNSVGIAHLGARIVLSHGLDSVLAFLKDTDKVKKVPGIDGPYDTKSYQVMFH 371

Query: 117 LVAHMSDIDEKQLLLYA-----------------QIANLVNLILQWPEISINEIAENFSK 159
           LV+H   +  ++L  YA                 Q A+L +  L    I ++       +
Sbjct: 372 LVSHTERMAPEELYQYALTAAFLTLLLEQHSSFFQSASLESQYLVGGLILVH-----VCQ 426

Query: 160 LACNAHTIC--------NSELRPLGTGLYPVISIINHSCLPNAVLVFEGRLAVVRAVQHV 211
           +  NAH I         N     + T +YP  S++NH+C P  +  F+G   +VRA+++V
Sbjct: 427 MVSNAHAITELCLIDENNERQERIATAIYPSASLMNHNCDPTVINSFQGNTLIVRAIRNV 486

Query: 212 PKGAEVLISYIETAGSTMTRQK------ALKEQYLFTCTCPRCIKLGQFDDIQESAILEG 265
            +G EV   Y    G    R +      AL+ QY FTCTC  C+     +D Q+  ++  
Sbjct: 487 RQGDEVFNCY----GPHYRRMRRSERVEALEAQYSFTCTCDSCLDKNT-EDFQD--VIYS 539

Query: 266 YRCKDDGCSGFLLRDSDDKGFTCQQCGLVRSKE-------EIKKIASEVNILSKKTLALT 318
           + C    C G L+  + +      Q  L RS +       ++K     V++ ++ T A+ 
Sbjct: 540 FSCP--SCQGSLINPTGNNSSAQNQMALCRSCKTPQSYFTQLKADLEAVSLDARGTEAM- 596

Query: 319 SCGNHQEVVSTYKMIEK---LQKKLYHPFSVNLMQTREKLIKILMELEDWKEALAYCQLT 375
              +  +++   K++ K   L+ K       +L ++ +KL +    +  ++E     +++
Sbjct: 597 ---DQGDIMGAIKLLTKCVQLRSKALFKGHPDLGKSADKLAQCYAFIGKYEECEKMLRIS 653

Query: 376 IPVYQRVYPQFHPLLG--LQYYTCGKLEWFLGDTENA----IKSMTEAVEILRITHGTNS 429
           +   +  Y ++   +   LQ +T   +E   G  + A    +  + EA+ I RI +G  S
Sbjct: 654 LTAVEHRYGRYSIEMANELQKFTDVLMELASGRNQRARDELVNYLEEAMLIYRIHYGPWS 713

Query: 430 PFMKEL 435
              KEL
Sbjct: 714 TSYKEL 719


>gi|170060284|ref|XP_001865734.1| conserved hypothetical protein [Culex quinquefasciatus]
 gi|167878798|gb|EDS42181.1| conserved hypothetical protein [Culex quinquefasciatus]
          Length = 647

 Score = 91.3 bits (225), Expect = 9e-16,   Method: Compositional matrix adjust.
 Identities = 86/323 (26%), Positives = 128/323 (39%), Gaps = 47/323 (14%)

Query: 10  VIISQEPYVCVPNNSSSISRCDGCFASSN-LKKCSACQVVWYCGSNCQKLDWK-LHRLEC 67
           V++ Q P+  V       S CD C      L  C  C  V YC S CQK   +  HR EC
Sbjct: 252 VLLRQLPHAAVVTGEFCESHCDHCCRRVEILFSCPRCMDVIYCSSECQKTAQESYHRFEC 311

Query: 68  QVLSRLDKEKRKSVTPSIRLMLKLYLRRKLQ------------NDNVIPSTTTDNYSLVE 115
             LS L   K         L L++  ++ L                 +     D+Y  + 
Sbjct: 312 GFLSYL---KNSGANVVAMLALRIVTQKSLDYFVEIKDELGSLGSEEVDRLPVDDYRRIY 368

Query: 116 ALVAHMSDIDEKQLLLYAQIANLVNLILQWPEISINE---------IAENFSKLACNAHT 166
             V H    D ++ L +A +A L+N +L       +          +  N   +  N+H 
Sbjct: 369 NFVTHSEGRDTEESLKWAFMAALLNTVLSLGYFYRSRYLDSYIGTLLLHNLQVVTFNSHE 428

Query: 167 ICNSELRP-----------LGTGLYPVISIINHSCLPNAVLVFEGRLAVVRAVQHVPKGA 215
           I  SEL+            +G GLYP + + NHSC P     F G    VR V+++P G+
Sbjct: 429 I--SELQRKSPRDTGISVCIGAGLYPTLVLFNHSCDPGITRYFVGSAVFVRTVRNIPAGS 486

Query: 216 EVLISY--IETAGSTMTRQKALKEQYLFTCTCPRCIK-LGQFDDIQESAILEGYRC-KDD 271
            V  +Y  +        R+K+LK+ Y F C C  C +    F D+  S +   +RC   +
Sbjct: 487 VVAENYGQLFVRAPRHERRKSLKKLYKFDCYCQACYEDWPTFFDMNPSVV--RFRCAATE 544

Query: 272 GCSGFLLRDS--DDKGFTCQQCG 292
           GC   L+     D     C++C 
Sbjct: 545 GCENGLIYTERLDQNVMKCEKCN 567


>gi|328873018|gb|EGG21385.1| SET domain-containing protein [Dictyostelium fasciculatum]
          Length = 740

 Score = 91.3 bits (225), Expect = 9e-16,   Method: Compositional matrix adjust.
 Identities = 76/273 (27%), Positives = 123/273 (45%), Gaps = 30/273 (10%)

Query: 8   GEVIISQEPYVCVPNNSSSISRCDGCFASSNL---KKCSACQVVWYCGSNCQKLDW--KL 62
           GE+I+   P+ C  ++      C  CF        +KC++C+    C   C K      +
Sbjct: 284 GELILRVAPFGCALSDDQIDLNCGSCFRKIKYYKSEKCNSCKTNLLC-EICIKDPTIVSM 342

Query: 63  HRLECQVLSRLDKEKRKSVTPSIRLMLKLYLRRKLQNDNVIPSTTT-----------DNY 111
           H+ EC ++S L K    + T   R M+++ L  +      I    T           D+Y
Sbjct: 343 HQEECSLISFLQKHYPNAQTRDFRFMIRVLLSGRANKLGRITKEKTPQLWNQQPFIYDSY 402

Query: 112 SLVEALVAHMSDIDEKQLLLYAQIAN-LVNLI--LQWPE-----ISINEIAENFSKLACN 163
             + +L   +S I+ KQL  +    N + N    ++ P+      +  EI + + K+  N
Sbjct: 403 EDLASLTTDVSKIESKQLQSFTVATNSITNFFAAVKGPKYLSDPFTKQEIFDLYPKILFN 462

Query: 164 AHTICNSELR-PLGTGLYPVISIINHSCLPNAVLVFEGR-LAVVRAVQHVPKGAEVLISY 221
           AH   +      +G G+YP  S INH+CLPN     +   L + R+ + + KG E+  SY
Sbjct: 463 AHEYIDPLTHYEIGRGIYPTASYINHTCLPNTTWYNDDHGLILYRSSRDILKGEEITTSY 522

Query: 222 IETAGSTMTRQKALKEQYLFTCTCPRCI--KLG 252
           ++     + R+K LK QY F C C RC+  KLG
Sbjct: 523 LDILKPKLQRRKDLK-QYSFVCQCERCLNEKLG 554


>gi|270004498|gb|EFA00946.1| hypothetical protein TcasGA2_TC003856 [Tribolium castaneum]
          Length = 713

 Score = 91.3 bits (225), Expect = 9e-16,   Method: Compositional matrix adjust.
 Identities = 116/481 (24%), Positives = 192/481 (39%), Gaps = 83/481 (17%)

Query: 8   GEVIISQEPYVCVP----NNSSSISRCDGCFAS-SNLKKCSACQVVWYCGSNCQKLDW-K 61
           G+++  ++ +V  P    N      +C  C +   +   C+ C V  YC   C+  +W K
Sbjct: 235 GDLLFIEKAFVFAPVFKENKEFYSFKCYNCLSDIVSGVPCNTCTVCVYCDEKCRDQNWTK 294

Query: 62  LHRLECQVLSRLDKEKRKSVTPSIRLMLK------LYLRRKLQNDNVIPSTTTDNYSLVE 115
            HR EC  +            P+ ++ML+        L    ++D        +NY    
Sbjct: 295 CHRWECDGMQANIWFDLGIALPAFKVMLEGALSGFQTLEGGFEDDVKHFGDKENNYRYFN 354

Query: 116 ALVAHMSDIDEKQLLLYAQIANLVNLILQ-------W-------PEISINE--------I 153
            L++++     K L  Y  +A++V   LQ       W       P+  +NE        I
Sbjct: 355 RLLSNIHK--NKNLAPYIIMASVVTTYLQKYTDFFTWFLTQPSCPKEGLNELVKLVGGFI 412

Query: 154 AENFSKLACNAHTI----CNSE-------LRPLGTGLYPVISIINHSCLPNAVLVFEGRL 202
            ++ ++LACN+ TI    C+S        L  + +G++P +SI+NHSC PN    F    
Sbjct: 413 TKHIAQLACNSSTIEQWTCSSSDLLFPDVLITIASGIFPSVSIMNHSCRPNVTNYFMSDT 472

Query: 203 AVVRAVQHVPKGAEVLISY-IETAG-STMTRQKALKEQYLFTCTCPRCIKLGQFDDIQES 260
            VV+A++ + +  E+   Y I+  G     RQ A KE Y F C C  C      D   E 
Sbjct: 473 IVVKALEDIAQNEEIFNCYGIDYRGMEREQRQYACKELYHFECKCVICS-----DPAHEL 527

Query: 261 AILEGYRCKDDGCSGFLLRDSDDKGFTCQQCGLVRSKEEIKKIASEVNILSKKTLALTSC 320
            +L+ Y C    C G +    +     C  CG     +  +KI  E    ++K +   S 
Sbjct: 528 DMLDSYLCPK--CKGLVPEIKNTILSFCVNCGEKYHLKPFRKINYE----AQKYMESDSS 581

Query: 321 GNHQEVVSTYKMIEKLQKKLYHPFSVNLMQTREKLIKILMELEDWKEALAYCQLTI---- 376
              + ++ + K+ EK+  K +  F     +   +L    ++L D +    Y  L +    
Sbjct: 582 NQLELLIKSLKIREKILYKHHKDFE----EVYYRLYSFYVKLGDAENMFKYFHLWLENEK 637

Query: 377 ------------PVYQRVYPQFHPLLGLQYYTCGKLEWFLGDTENAIKSMTEAVEILRIT 424
                        +Y+      H L G Q   C  L+ FL + E  IK   EA  IL + 
Sbjct: 638 ARRGCNSRVVGTKLYEAALAILHCLKGGQQKNCTNLKDFLQNVEYMIK---EAKAILNLY 694

Query: 425 H 425
           +
Sbjct: 695 Y 695


>gi|393905719|gb|EFO19192.2| MYND finger family protein [Loa loa]
          Length = 545

 Score = 91.3 bits (225), Expect = 9e-16,   Method: Compositional matrix adjust.
 Identities = 111/462 (24%), Positives = 190/462 (41%), Gaps = 54/462 (11%)

Query: 16  PYVCVPNNSSSISRCDGCFASSN-LKKCSACQVVWYCGSNCQKLDWKLHRLECQVLSRLD 74
           P+    ++    SRC  C A ++ LK+C AC+   +C   CQ L WK HR EC+     D
Sbjct: 73  PFAYSLHDEQLSSRCWYCLAKADTLKRCRACRKGMFCNQKCQTLGWKDHRSECKAFQLHD 132

Query: 75  KEKRKSVTPSI--RLMLKLYLRRKL------QNDNVIPSTTTDNYSLVEALVAHMSDI-- 124
           K       P+I  RL+ ++  R K       + DN      T   S+++ + +H   I  
Sbjct: 133 K------IPNIEVRLLGRIVTRYKAIKLGKDKEDNNFYKDRTSKRSIMD-IWSHTDLIKQ 185

Query: 125 DEKQLLLYAQI-ANLVNLILQWPEISINEIAENFSKLACNAHTICN-SELRPLGTGLYPV 182
           D   +  +  I A+L+        +  +E+ E   +   N H I +   +  +G GLY  
Sbjct: 186 DPGAMKKFNGIYADLLAFYGSKAMVGKDEVFELHCRDYINRHAISDCGYIEEIGKGLYLD 245

Query: 183 ISIINHSCLPNAVLVFEGRLAVVRAVQ---HVPKGAEVLISYIETAGSTMTRQKALKEQY 239
           +   +HSC PN +   +G +A +R +     +   +    SYI+   +T  R+K LK+ +
Sbjct: 246 LCAYDHSCRPNTIYTCDGFVATLRGLTASVDLRNLSSAHYSYIDLINTTQQRRKLLKDTW 305

Query: 240 LFTCTCPRCIKLGQFDDIQESAIL-------EGYRCKDDGCSGFLLRDSDDKGFTCQQCG 292
            F C C RC      DD   S+IL         Y C          +D + +  TC +C 
Sbjct: 306 YFECHCTRC---SDPDDALLSSILCPNCPEKREYLCI---FGDLPYKDRNTQIITCPKCH 359

Query: 293 LVRSKEEIKKIASEVNILSK--KTLALTSCGNHQEVVSTYKMIEKLQKKLYHPFSVNLMQ 350
              S E + +    +  + K  +   +      Q +     + E+  K L     VN+  
Sbjct: 360 NKVSPEYVVEAIGAMRFIDKIVENHEVEQMSREQSIKFLTDLKERFSKLLS---KVNVFL 416

Query: 351 TREKLIKILMELEDWKEALAYCQLTIPVY-------QRVYPQFHPLLGLQYYTCGKLEWF 403
              K+I++L+ L +    ++ CQL + ++       +  +P  HP +G+ Y +       
Sbjct: 417 C--KIIQLLIPLIN----MSDCQLLLDLHLEAEECVRFCFPFNHPAVGIHYRSIATFYLK 470

Query: 404 LGDTENAIKSMTEAVEILRITHGTNSPFMKELILKLEEAQAE 445
                 A+    +A EI+  T G       E    L++A  E
Sbjct: 471 CKQPHRALLYCKKAYEIISFTLGPKHLMTIETDAMLKDASNE 512


>gi|341883792|gb|EGT39727.1| CBN-SET-18 protein [Caenorhabditis brenneri]
          Length = 499

 Score = 91.3 bits (225), Expect = 1e-15,   Method: Compositional matrix adjust.
 Identities = 94/446 (21%), Positives = 176/446 (39%), Gaps = 50/446 (11%)

Query: 6   VSGEVIISQEPYVCVPNNSSSISRCDGCF----ASSNLKKCSACQVVWYCGSNCQKLDWK 61
           V   + I + P+     N      C  C         L +C  C    YC   CQ   W 
Sbjct: 25  VHARIKIFETPFATQVLNPKVNEFCANCLRGPAPGEKLLRCGGCNFSMYCSKECQATAWL 84

Query: 62  LHRLECQVLSRLDKEKRKSVTPSIRLMLKLYLRRKL-------QNDNVIPSTTTDNYSLV 114
           +H+ EC+ L        K+  P++ L   L+L + +       +N + +       +S  
Sbjct: 85  VHKPECKRL--------KASFPNLPLTEVLFLSKVIDRIQFLDKNGDKLGIEAERKFS-- 134

Query: 115 EALVAHMSDI--DEKQLLLYAQIANLVNLILQWPEISINEIAENFSKLACNAHTICNSEL 172
            +LV H +DI  DE+++  + ++   + L  +   I   E  + F K + N+H+I  +  
Sbjct: 135 -SLVDHKTDIKEDEEKMKHFEKVFTKMGLFRKDDMIEKGEFFDIFCKASINSHSIHTNAG 193

Query: 173 RPLGTGLYPVISIINHSCLPNAVLVFEGRLAVVRAVQHVP-----KGAEVLISYIETAGS 227
             +G  L   +S  NHSC P   +VF+G    +R +  VP        +  ISYI+   S
Sbjct: 194 NEVGMALDLGVSKYNHSCRPTCSMVFDGYRVCLRPL--VPGVDAEDTEQAFISYIDVGRS 251

Query: 228 TMTRQKALKEQYLFTCTCPRCIKLGQFDDIQESAILEGYRCKDDGCSGFLLRDSDDK--G 285
              R++ LK  + F C C RC+         E   L   RC +  C   +L    ++   
Sbjct: 252 KFVRRRDLK--WYFNCECTRCMD-------PEDDALTAIRCANPSCDAPILTSETEEPMN 302

Query: 286 FTCQQCGLVRSKEEIKKIASEVNILSKKTLALTSCGNHQEVVSTYKMIEKLQKKLYHPFS 345
             C +C  +  + ++K     +        +L +  + +      + +    +++ HP +
Sbjct: 303 IACDKCKTIVEENDVKAAQEYMK-------SLPASFDPKCPADILRDLLAKAEQVLHPSN 355

Query: 346 VNLMQTREKLIKILMELEDWKEALAYCQLTIPVYQRVYPQFHPLLGLQYYTCGKLEWFLG 405
           V + + R  L  +   L     +  + Q+    Y+  +P+    +G Q     K      
Sbjct: 356 VYVARLRTALFHVTGTLTMDNLSSMHTQI-YNNYKMCFPKADRHVGFQLLHIVKALIEKD 414

Query: 406 DTENAIKSMTEAVEILRITHGTNSPF 431
           + + A+    +A+ I  +  G + P+
Sbjct: 415 ERDEAMPYAFDAMNIFEVCFGLDHPY 440


>gi|342874438|gb|EGU76450.1| hypothetical protein FOXB_13043 [Fusarium oxysporum Fo5176]
          Length = 896

 Score = 91.3 bits (225), Expect = 1e-15,   Method: Compositional matrix adjust.
 Identities = 60/225 (26%), Positives = 110/225 (48%), Gaps = 24/225 (10%)

Query: 33  CFASSNLKKCSACQVVWYCGSNCQKLDW-KLHRLECQVLSRLDKEKRKSVTPSIRLMLKL 91
           CF  ++++ CS C  V YC + CQ  +W  +H  EC++L ++ ++    +   +R +++ 
Sbjct: 418 CFKPADVRSCSGCHAVSYCDAACQSANWTAVHSKECKLLRKVTEQGHPGIPTPVRAVIQA 477

Query: 92  YLR----RKLQN--DNVIPSTTTDNYSLVEALVAHMSDIDEKQLLLYAQIANLVNLILQW 145
            ++      L+N   NV     +D ++ +E ++A  +     Q     ++   + L+   
Sbjct: 478 LVKPGIGAALENLEGNVESWRKSDKWADME-MMAMGATAFTGQGTGQEELQKTLALLC-- 534

Query: 146 PEISINEIAENFSKLACNAHTICNSELRPLGTGLYPVISIINHSCLPNAVLVFEGRLAVV 205
                        K+  NA    +++L  +G  L P +++ NHSC+PNA++ F GR A++
Sbjct: 535 -------------KIQTNAFHRYDADLGQVGIFLEPKLAMANHSCIPNAMVQFVGRKAIL 581

Query: 206 RAVQHVPKGAEVLISYIETAGSTMTRQKALKEQYLFTCTCPRCIK 250
           RA + +    E+ ISY +       R+ AL   Y F C CPRC K
Sbjct: 582 RAEKPIKVDEEIEISYTDYTFPRSKRKHALAP-YFFDCQCPRCEK 625


>gi|336382331|gb|EGO23481.1| hypothetical protein SERLADRAFT_449865 [Serpula lacrymans var.
           lacrymans S7.9]
          Length = 435

 Score = 90.9 bits (224), Expect = 1e-15,   Method: Compositional matrix adjust.
 Identities = 73/271 (26%), Positives = 125/271 (46%), Gaps = 28/271 (10%)

Query: 7   SGEVIISQEPYVCVPNNSSSISRCDGCF----ASSNLKKCSACQVVWYCGSNCQKLDWKL 62
           SG ++IS  P   V   S    RCD CF       +L++C+ C   WYCG+ CQ + W +
Sbjct: 30  SGSIVISLPPLTTVLLPSEKGCRCDYCFRRDSPDVHLRRCTGCASYWYCGAQCQDMHWAV 89

Query: 63  HRLECQVLSRLDKEKRKSVTPSIRLMLKLYLRRKLQNDNV--IPSTTTDNYSLVEALVAH 120
           H+  C+  +R         +P     +KL    KL    +  + +  T N S      + 
Sbjct: 90  HKNLCKAFNRY------CASPD---FVKLAEHEKLDALLLTHLAAQITSNSSPRSTPASS 140

Query: 121 MSDIDEKQLLLYAQIANLV--NLILQWPEISINEIAENFSKLACNAHTICNSELRPLGTG 178
           +S     +   +++ +  V  ++ ++   +S +E+   +S+   N   I +S L     G
Sbjct: 141 LSTFLNLRPGPFSKGSRPVVCSMTVKRSGLSSDELEAVYSRFGNNNFAI-HSHLTTYAHG 199

Query: 179 LYPVIS-IINHSCLPNAVLVF-----EGRLAVVRAVQHVPKGAEVLISYIETAGSTMTRQ 232
           ++P+ S + NHSCLPNA   F     E     + A++ +  G E+ I+Y++ A    +RQ
Sbjct: 200 VFPLASRLFNHSCLPNAAAKFVLSPSEPIHMEIVAIRDISPGEEICITYLDPA-LLQSRQ 258

Query: 233 KALKEQYLFTCTCPRCI---KLGQFDDIQES 260
           +  +  Y FTC CP C    ++G    + ES
Sbjct: 259 QIFELTYGFTCLCPSCALLNRIGLIPPLPES 289


>gi|336369557|gb|EGN97898.1| hypothetical protein SERLA73DRAFT_109167 [Serpula lacrymans var.
           lacrymans S7.3]
          Length = 452

 Score = 90.9 bits (224), Expect = 1e-15,   Method: Compositional matrix adjust.
 Identities = 73/271 (26%), Positives = 125/271 (46%), Gaps = 28/271 (10%)

Query: 7   SGEVIISQEPYVCVPNNSSSISRCDGCF----ASSNLKKCSACQVVWYCGSNCQKLDWKL 62
           SG ++IS  P   V   S    RCD CF       +L++C+ C   WYCG+ CQ + W +
Sbjct: 30  SGSIVISLPPLTTVLLPSEKGCRCDYCFRRDSPDVHLRRCTGCASYWYCGAQCQDMHWAV 89

Query: 63  HRLECQVLSRLDKEKRKSVTPSIRLMLKLYLRRKLQNDNV--IPSTTTDNYSLVEALVAH 120
           H+  C+  +R         +P     +KL    KL    +  + +  T N S      + 
Sbjct: 90  HKNLCKAFNRY------CASPD---FVKLAEHEKLDALLLTHLAAQITSNSSPRSTPASS 140

Query: 121 MSDIDEKQLLLYAQIANLV--NLILQWPEISINEIAENFSKLACNAHTICNSELRPLGTG 178
           +S     +   +++ +  V  ++ ++   +S +E+   +S+   N   I +S L     G
Sbjct: 141 LSTFLNLRPGPFSKGSRPVVCSMTVKRSGLSSDELEAVYSRFGNNNFAI-HSHLTTYAHG 199

Query: 179 LYPVIS-IINHSCLPNAVLVF-----EGRLAVVRAVQHVPKGAEVLISYIETAGSTMTRQ 232
           ++P+ S + NHSCLPNA   F     E     + A++ +  G E+ I+Y++ A    +RQ
Sbjct: 200 VFPLASRLFNHSCLPNAAAKFVLSPSEPIHMEIVAIRDISPGEEICITYLDPA-LLQSRQ 258

Query: 233 KALKEQYLFTCTCPRCI---KLGQFDDIQES 260
           +  +  Y FTC CP C    ++G    + ES
Sbjct: 259 QIFELTYGFTCLCPSCALLNRIGLIPPLPES 289


>gi|171684729|ref|XP_001907306.1| hypothetical protein [Podospora anserina S mat+]
 gi|170942325|emb|CAP67977.1| unnamed protein product [Podospora anserina S mat+]
          Length = 559

 Score = 90.9 bits (224), Expect = 1e-15,   Method: Compositional matrix adjust.
 Identities = 61/243 (25%), Positives = 107/243 (44%), Gaps = 29/243 (11%)

Query: 16  PYVCVPNNSSSISRCDGC--------FASSNLKKCSACQVVWYCGSNCQKLDWK-LHRLE 66
           P + +P+  S  + C+ C        F+  +LK C+ C+   YCG  CQ+  WK +H+ E
Sbjct: 66  PTLALPDGPSMRTTCNYCLRVGSNEGFSPVSLKACTGCKAAVYCGPTCQRAHWKSIHKAE 125

Query: 67  CQVLSRLDKEKRKSVTPSIRLMLKLYLRRKLQNDNVIPSTTTDNYSLVEALVAHMSDIDE 126
           C++ +R+ +   K             +R      + + S    N    +A      D + 
Sbjct: 126 CKMFARVRETTGKEWH---------GIRDPFDGSDPLES----NLEGFKADEQVWGDFEL 172

Query: 127 KQLLLYAQIANLVNLILQWPEISINEIAENFSKLACNAHTICNSELRPLGTGLYPVISII 186
           + +         +N      E  +        ++  NA    +++    G  L P ++ +
Sbjct: 173 QAMAAMTYSGVFMN------EDGLRVAMRFLCQIQTNAFNRLDADTGMSGIFLDPALARV 226

Query: 187 NHSCLPNAVLVFEGRLAVVRAVQHVPKGAEVLISYIETAGSTMTRQKALKEQYLFTCTCP 246
           NHSC+PNA + F+ R A +RA + + +G E+ ISYI        R++ L+  Y F C CP
Sbjct: 227 NHSCVPNAFIGFDKRTATLRAERPIKEGEEITISYIANDKPRSIRREGLR-LYYFECDCP 285

Query: 247 RCI 249
           RC+
Sbjct: 286 RCV 288


>gi|157136831|ref|XP_001656929.1| hypothetical protein AaeL_AAEL003516 [Aedes aegypti]
 gi|108880958|gb|EAT45183.1| AAEL003516-PA [Aedes aegypti]
          Length = 649

 Score = 90.5 bits (223), Expect = 2e-15,   Method: Compositional matrix adjust.
 Identities = 100/376 (26%), Positives = 157/376 (41%), Gaps = 48/376 (12%)

Query: 10  VIISQEPYVCVPNNSSSISRCDGCFASSNLKKC-SACQVVWYCGSNCQK-LDWKLHRLEC 67
           +I+ ++P+V V     S+S C  CF    +  C   C  V +C   C+K  +   HR EC
Sbjct: 253 IILLEKPHVSVLLEEYSLSHCSTCFKRVTIPVCCPKCSDVIFCSEACEKKANSSYHRYEC 312

Query: 68  QVLSRLDKEKRKSVT--PSIRLMLK------LYLRRKLQN--DNVIPSTTTDNYSLVEAL 117
             L    K    S+T   ++R++ +      + LR +L      V      D+Y  V  L
Sbjct: 313 GFLPIFWKSG-ASITCHMALRMITQKTEEYFVGLRHELDGLTSEVTDKLNNDDYRKVYKL 371

Query: 118 VAHMSDIDEKQLLLYAQIANLVNLILQWPEISINEIAENF---------SKLACNAHTIC 168
           V H      +       +A L+N  L       ++ +E+F           L  NAH I 
Sbjct: 372 VTHEDKRSPEDYFQRTLMATLLNACLSLGGYYKSKESESFIGGLLLHNLQFLQYNAHEI- 430

Query: 169 NSELRP-----------LGTGLYPVISIINHSCLPNAVLVFEGRLAVVRAVQHVPKGAEV 217
            SEL+            +G GLYP +++ NHSC P     + G    VR V+ +P G+ V
Sbjct: 431 -SELQRENERDIGKSTFIGGGLYPTLALFNHSCEPGVTRYYRGNSVCVRTVKGIPAGSMV 489

Query: 218 LISY--IETAGSTMTRQKALKEQYLFTCTCPRCI-KLGQFDDIQESAILEGYRCKDDG-- 272
             +Y  + T  S   R+  L  QY FTC C  C     +F D+  + +    R K DG  
Sbjct: 490 AENYGPLFTQVSREERRSTLLNQYKFTCNCRACAGDWPKFADMDSNVL----RFKCDGGK 545

Query: 273 -CSGFLLRDSDDKGFT--CQQCGLVRSKEEIKKIASEVNILSKKTLALTSCGNHQEVVST 329
            CS  L   ++   F   C +CG   +  +  K   + ++L K    L   G ++  +  
Sbjct: 546 NCSNVLEIPAEINEFMVQCTECGEHTNIMKGLKSLQDTDMLFKSATKLHEAGEYEFALKK 605

Query: 330 Y-KMIEKLQKKLYHPF 344
           Y +M+  L + L  P+
Sbjct: 606 YVEMMNTLDEVLVPPY 621


>gi|357610542|gb|EHJ67027.1| hypothetical protein KGM_11275 [Danaus plexippus]
          Length = 661

 Score = 90.1 bits (222), Expect = 2e-15,   Method: Compositional matrix adjust.
 Identities = 111/377 (29%), Positives = 168/377 (44%), Gaps = 53/377 (14%)

Query: 7   SGEVIISQEPYVCVPNNSSSISRCDGCFASSNLK-KCSACQVVWYCGSNCQKLDWK-LHR 64
           +GE+++ +EP+  V  +  S S C  CF    +   C  C  V +C   C  +  K  H 
Sbjct: 265 AGEILLIEEPHGGVLLSEFSKSHCQNCFNKCLIPLPCPKCPNVIFCSEKCLDIALKSYHG 324

Query: 65  LECQVLSRLDKEKRKSVTPSIRLML-----KLYLRRKLQNDNVIPSTT--TDNYSLVEAL 117
            EC +L  L K    SVT  I L +     K Y  + +Q+    P+    T++Y  +  L
Sbjct: 325 YECHILPLLWKSG-CSVTCHIALRMITQNSKDYFMKIMQDLKEKPTGPYKTEDYRNIYHL 383

Query: 118 VAHMSDIDEKQLLLYAQIANLVNLILQWPEIS--INE--------IAENFSKLACNAHTI 167
           V+H +   ++ +L   ++A     +L+  EIS   N+        I +N   L  NAH +
Sbjct: 384 VSHENKRTKQDILHRTEMAIF---LLKLLEISGYFNDDAASFGCLILKNLQVLQFNAHEV 440

Query: 168 CNSE-LRP---------------LGTGLYPVISIINHSCLPNAVLVFEGRLAVVRAVQHV 211
              + L+P               +G  +YP +++ NHSC P  V  F G   VV AV+++
Sbjct: 441 FEIQCLKPKDGTRFLKHEGKSVFIGGAVYPTLALFNHSCEPGIVRYFCGSRIVVCAVKNI 500

Query: 212 PKGAEVLISY--IETAGSTMTRQKALKEQYLFTCTCPRCIK-LGQFDDIQESAILEGYRC 268
            KG EV  +Y  I T      RQ  LKEQY F C C  C +   +++D+ E+ +   ++C
Sbjct: 501 RKGEEVAENYGPIFTTVPKDKRQSQLKEQYWFDCKCLPCEQNWPKYEDMTENYLR--FKC 558

Query: 269 KDD-GCSGFLLRDSDDKGFTCQQCGLVRSKEEI-KKIAS--EVNILSKKTLALTSCGNHQ 324
             D  CS  +    D   F   QCGL +    I K + S  +   L K   A    G + 
Sbjct: 559 DSDQPCSNVIPVPYDCMEFMV-QCGLCQQYTNILKGLKSLQDTETLYKLGRAAMGEGKYG 617

Query: 325 EVVSTYKMIEKLQKKLY 341
           E +   K IE L  KLY
Sbjct: 618 EAIK--KFIETL--KLY 630


>gi|38566954|emb|CAE76256.1| conserved hypothetical protein [Neurospora crassa]
          Length = 533

 Score = 90.1 bits (222), Expect = 2e-15,   Method: Compositional matrix adjust.
 Identities = 65/244 (26%), Positives = 117/244 (47%), Gaps = 34/244 (13%)

Query: 7   SGEVI-ISQEPYVCVPNNSSSISRCDGCFASSNL--------------KKCSACQVVWYC 51
           +GE I I + P + +P++++  + C+ C   S                K C+ C+   YC
Sbjct: 34  AGETIAIIENPLLALPDDANMRTTCNYCLYVSGTIEFEGDVEAGARTCKACTGCKAAVYC 93

Query: 52  GSNCQKLDWKL-HRLECQVLSRLDKEKRKSVTPS-IRLMLKLYLRRKLQNDNVI----PS 105
            + CQ+  WKL H+ EC++  R+ +   K   P+ +R + ++ L  K  ++  +    P 
Sbjct: 94  NAECQRAHWKLVHKAECKMFKRIQERTGKDWLPTPVRAVAQVMLLLKAGDEETVKAFGPG 153

Query: 106 TT----TDNYSLVEALVAHMSDIDEKQLLLYAQIANLVNLILQWPEISINEIAENFSKLA 161
            T     + +   E L     ++     ++YA +       LQ  E ++ +  E   K+ 
Sbjct: 154 GTLESNVEGFKTDEGLWGDF-ELQATGAVVYAGL-------LQSDE-TLKQAMEVLCKIQ 204

Query: 162 CNAHTICNSELRPLGTGLYPVISIINHSCLPNAVLVFEGRLAVVRAVQHVPKGAEVLISY 221
            NA    +++    G  L+P +S++NHSC+PNA + FE R A ++A + +  G E+LISY
Sbjct: 205 TNAFNRFDADTGQAGIYLHPSLSMVNHSCVPNAYIAFEKRKAFLKAERDLEPGDEILISY 264

Query: 222 IETA 225
           I  A
Sbjct: 265 IGMA 268


>gi|378731922|gb|EHY58381.1| SET and MYND domain-containing protein [Exophiala dermatitidis
           NIH/UT8656]
          Length = 550

 Score = 90.1 bits (222), Expect = 2e-15,   Method: Compositional matrix adjust.
 Identities = 67/225 (29%), Positives = 111/225 (49%), Gaps = 26/225 (11%)

Query: 37  SNLKKCSACQVVWYCGSNCQKLDW-KLHRLECQVLSRLDKEKRKSVTPS-IRLMLKLYLR 94
            +LK CS C+V  +C   CQ   W + HRLEC++ SRL       V PS +R +++L  +
Sbjct: 100 DSLKTCSRCKVAKFCDQKCQTEAWSQYHRLECKLFSRL----YPRVLPSTVRAVIRLLKQ 155

Query: 95  RKLQNDNVIPSTTTDNYSLVEALVAHMSDIDEKQLLLYAQIANLVNLILQWPEISINEIA 154
            K     ++P      +  + AL +H  D+       +  +  +   I  +     +E  
Sbjct: 156 HKA---GILPPG---EWEQLLALQSHQQDLANAGGQRWQDLLIMSQGIKGYS--GTDEDD 207

Query: 155 ENFSKLAC----NAHTICNSELRPLGTGLYPVISIINHSCLPNAVLVFE-------GRLA 203
           +   +L+C    N+ T+ N+    +G  L+P  +++NHSC PNA + F+       G ++
Sbjct: 208 DLILRLSCAVIVNSFTLSNATFDSIGVILHPKPALLNHSCDPNAYVRFDVSETDTLGSIS 267

Query: 204 VVRAVQHVPKGAEVLISYIETAGSTMTRQKALKEQYLFTCTCPRC 248
           V  A++ + K  E+ ISYI+T      RQ+ L E+Y FTC C  C
Sbjct: 268 V-HALRDIAKDEEITISYIDTTVPCKRRQQQLSERYFFTCQCHLC 311


>gi|85117964|ref|XP_965349.1| hypothetical protein NCU02962 [Neurospora crassa OR74A]
 gi|28927157|gb|EAA36113.1| predicted protein [Neurospora crassa OR74A]
          Length = 610

 Score = 89.7 bits (221), Expect = 3e-15,   Method: Compositional matrix adjust.
 Identities = 65/244 (26%), Positives = 117/244 (47%), Gaps = 34/244 (13%)

Query: 7   SGEVI-ISQEPYVCVPNNSSSISRCDGCFASSNL--------------KKCSACQVVWYC 51
           +GE I I + P + +P++++  + C+ C   S                K C+ C+   YC
Sbjct: 34  AGETIAIIENPLLALPDDANMRTTCNYCLYVSGTIEFEGDVEAGARTCKACTGCKAAVYC 93

Query: 52  GSNCQKLDWKL-HRLECQVLSRLDKEKRKSVTPS-IRLMLKLYLRRKLQNDNVI----PS 105
            + CQ+  WKL H+ EC++  R+ +   K   P+ +R + ++ L  K  ++  +    P 
Sbjct: 94  NAECQRAHWKLVHKAECKMFKRIQERTGKDWLPTPVRAVAQVMLLLKAGDEETVKAFGPG 153

Query: 106 TT----TDNYSLVEALVAHMSDIDEKQLLLYAQIANLVNLILQWPEISINEIAENFSKLA 161
            T     + +   E L     ++     ++YA +       LQ  E ++ +  E   K+ 
Sbjct: 154 GTLESNVEGFKTDEGLWGDF-ELQATGAVVYAGL-------LQSDE-TLKQAMEVLCKIQ 204

Query: 162 CNAHTICNSELRPLGTGLYPVISIINHSCLPNAVLVFEGRLAVVRAVQHVPKGAEVLISY 221
            NA    +++    G  L+P +S++NHSC+PNA + FE R A ++A + +  G E+LISY
Sbjct: 205 TNAFNRFDADTGQAGIYLHPSLSMVNHSCVPNAYIAFEKRKAFLKAERDLEPGDEILISY 264

Query: 222 IETA 225
           I  A
Sbjct: 265 IGMA 268


>gi|350405733|ref|XP_003487532.1| PREDICTED: SET and MYND domain-containing protein 4-like [Bombus
           impatiens]
          Length = 676

 Score = 89.4 bits (220), Expect = 3e-15,   Method: Compositional matrix adjust.
 Identities = 92/383 (24%), Positives = 159/383 (41%), Gaps = 55/383 (14%)

Query: 8   GEVIISQEPYVCVPNNSSSISRCDGCFASSNLK---KCSACQVVWYCGSNCQKLDWKLHR 64
           GE ++ ++P+  V      +S C  CFA   +     C  C  V YC   C+  D ++H+
Sbjct: 263 GETLVIEKPHCAVLLAEYRLSNCHHCFAKIFVPIPTSCDTCNFVAYCSIPCRNKDAEIHK 322

Query: 65  LECQVLSRL-DKEKRKSVTPSIRLMLK------LYLRRKLQNDNVIPSTTT------DNY 111
            EC +L  L   E   +   +++ +++      L L+ KL+       T+T      D++
Sbjct: 323 NECMILPSLWFSETSVNCFLALKAIVQKPFEELLALKDKLKATKGRFETSTQRPRRHDDF 382

Query: 112 SLVEALVAHMSDIDEKQLLLYAQIAN-LVNLILQWP-------------------EISIN 151
             +  L+ H  +   + L     IA  L+ L+ + P                   E+ I 
Sbjct: 383 EAIYGLITHEEERTSEDLFHRTYIATWLLRLLKRNPYFPEWVKTPDSAEAIPSDGELYIG 442

Query: 152 E-IAENFSKLACNAHTI------------CNSELRPLGTGLYPVISIINHSCLPNAVLVF 198
             I  N   +  NAH I              ++ + +G G+Y  +S+ NHSC P  +  F
Sbjct: 443 SLILHNLMLIQFNAHEISELAVPKGSNILAKAKSKFIGGGVYSTVSLFNHSCNPGIIRYF 502

Query: 199 EGRLAVVRAVQHVPKGAEVLISY--IETAGSTMTRQKALKEQYLFTCTCPRCIKLGQFDD 256
            G   VVRA++ +P G E+  +Y  I T      R++ L+ QY F C C  C       +
Sbjct: 503 IGTTMVVRAIRTIPAGEEISENYGPIFTTTPEAERKRKLRVQYWFDCNCEACTAHWPTLE 562

Query: 257 IQESAILEGYRCKDDGCSGFLL---RDSDDKGFTCQQCGLVRSKEEIKKIASEVNILSKK 313
             +  IL  ++C+     G +L    D+++    C +CG   +  +  K   + + + + 
Sbjct: 563 EIDPTILR-FKCETGKECGNVLPIKADTNEFMIRCSKCGKNTNIFKGLKALQDTDAIFRT 621

Query: 314 TLALTSCGNHQEVVSTYKMIEKL 336
                  G HQE + +Y  I KL
Sbjct: 622 ASRNLEEGKHQEALKSYLEILKL 644


>gi|12834773|dbj|BAB23038.1| unnamed protein product [Mus musculus]
          Length = 229

 Score = 89.4 bits (220), Expect = 3e-15,   Method: Compositional matrix adjust.
 Identities = 56/255 (21%), Positives = 110/255 (43%), Gaps = 28/255 (10%)

Query: 186 INHSCLPNAVLVFEGRLAVVRAVQHVPKGAEVLISYIETAGSTMTRQKALKEQYLFTCTC 245
           +NHSC PN ++ ++G LA VRAVQ +  G EV  SYI+    T  R   L++ Y FTC C
Sbjct: 1   MNHSCCPNVIVTYKGTLAEVRAVQEIHPGDEVFTSYIDLLYPTEDRNDRLRDSYFFTCEC 60

Query: 246 PRCIKLGQFDDIQESAILEGYRCKDDGCSGFLLRDSDDKGFTCQQCGLVRSKEEIKKIAS 305
             C                              +D D      ++       E I+ +  
Sbjct: 61  RECT----------------------------TKDKDKAKVEVRKLSSPPQAEAIRDMVR 92

Query: 306 EVNILSKKTLALTSCGNHQEVVSTYKMIEKLQKKLYHPFSVNLMQTREKLIKILMELEDW 365
               + ++        +  E++   ++ ++    ++   +V ++    + + + + ++DW
Sbjct: 93  YARNVIEEFRRAKHYKSPSELLEICELSQEKMSSVFEDSNVYMLHMMYQAMGVCLYMQDW 152

Query: 366 KEALAYCQLTIPVYQRVYPQFHPLLGLQYYTCGKLEWFLGDTENAIKSMTEAVEILRITH 425
           + AL Y Q  I  Y + YP +   +   +   G+L   L +     K++ +A+ I+ + H
Sbjct: 153 EGALKYGQKIIKPYSKHYPVYSLNVASMWLKLGRLYMGLENKAAGEKALKKAIAIMEVAH 212

Query: 426 GTNSPFMKELILKLE 440
           G + P++ E+  ++E
Sbjct: 213 GKDHPYISEIKQEIE 227


>gi|452977535|gb|EME77301.1| hypothetical protein MYCFIDRAFT_179882 [Pseudocercospora fijiensis
           CIRAD86]
          Length = 616

 Score = 89.4 bits (220), Expect = 3e-15,   Method: Compositional matrix adjust.
 Identities = 64/237 (27%), Positives = 103/237 (43%), Gaps = 21/237 (8%)

Query: 38  NLKKCSACQVVWYCGSNCQKLDWKL-HRLECQVLSRLDKEKRKSVTPSIRLMLKLYLRRK 96
           + K C+ C+ + YC   CQ   WK  H+ EC VL   +   R  +   +R ++KL  R K
Sbjct: 132 DTKACAGCKKIRYCSKTCQARSWKREHKYECNVLKHPN---RPDLPHGVRAVIKLLGRLK 188

Query: 97  LQ---NDNVIPSTTTDNYSLVEALVAHMSDIDEKQLLLYAQIANLVNLILQWPEIS---- 149
                 D ++ S      +     +  +   D ++   +  +          PE S    
Sbjct: 189 ADPEGKDELLLSILQFKPAADSKALDFIKQHDPQRFEDFNMLGYGAWKYAGEPEFSHPNS 248

Query: 150 ----INEIAEN-----FSKLACNAHTICNS-ELRPLGTGLYPVISIINHSCLPNAVLVFE 199
               ++   E      F  +  N   + N  +   LG G  P+++  NHSC PNA +VF 
Sbjct: 249 PGVKLSNSGEATAKAFFFNVISNLMQLSNPIDDTKLGIGFDPILNSSNHSCDPNAAVVFN 308

Query: 200 GRLAVVRAVQHVPKGAEVLISYIETAGSTMTRQKALKEQYLFTCTCPRCIKLGQFDD 256
               V+RA++ + KG E+ + YI+ +     RQ  LKE Y F+C C +C K   FD+
Sbjct: 309 QPRLVLRALRSISKGEEIFMKYIDVSNPFSVRQAELKESYFFSCRCSKCKKGAIFDE 365


>gi|440799648|gb|ELR20692.1| Histone-lysine N-methyltransferase [Acanthamoeba castellanii str.
           Neff]
          Length = 548

 Score = 89.4 bits (220), Expect = 4e-15,   Method: Compositional matrix adjust.
 Identities = 98/423 (23%), Positives = 171/423 (40%), Gaps = 42/423 (9%)

Query: 7   SGEVIISQEPYVCVPNNSSSISRCDGC------FASSNLKKCSACQVVWYCGSNCQKLDW 60
           +G  I+++      P     +S C  C      +A    +    C  + YC   C++   
Sbjct: 78  AGTTILAEPAIAWQPLEDRIMSVCHHCMTEVPRWAVGCGEGAGGCSGLGYCSPKCREASE 137

Query: 61  KLHRLECQVL--SRLDKEKRKSVTPSIRLMLKLYLRRKLQNDNVIPSTTTDNYSLVEALV 118
            LHR+E +V   +R    + K+    I+L  ++   R L   + +          V A++
Sbjct: 138 ALHRVEHKVFLQARDIANRTKADITLIKLATRIIALRSLSEGHRVQFERG-----VMAMM 192

Query: 119 AHMSDIDEKQLLLYAQIANLVNLILQWP-EISINEIAENFSKLACNAHTICNSEL--RPL 175
            H  +  ++ +    + A LV  +L  P  +S  E  +  +++  N+H   +  +  R L
Sbjct: 193 GHEDECPQQWVDSITETAKLVMPLLPKPARLSAREFVKVCARINTNSHRQHHMFVPQRIL 252

Query: 176 GTGLYPVISIINHSCLPNAVLVFEGRLAVVRAVQHVPKGAEVLISYIETAGSTMTRQKAL 235
           G GLYP+ S+INHSC PN      G    +R +  V +G E+  SYI+   S   R+  L
Sbjct: 253 GVGLYPLASLINHSCQPNCGFYNRGPTLYIRTLCDVKEGEELCYSYIDLYQSRSKRKAEL 312

Query: 236 KEQYLFTCTCPRCIKLGQFDDIQESA--ILEGYRCKDDG---CSGFLLRDSDDKGFTCQQ 290
            E   F C C RC        I +S    L G++C +     C G L+   D +G   ++
Sbjct: 313 LETKHFDCLCNRCSP-----PITDSVDRYLSGFQCPNKAKTSCDGLLVL-PDGEGLASEK 366

Query: 291 -------CGLVRSKEEIKKIASEVNILSKKTLALTSCGNHQEVVSTYKMIEKLQKKLY-- 341
                  CG  ++  ++ K      I+   + +L    + +    +  M++KL    Y  
Sbjct: 367 PVSCTAGCGETKTAGDLLKAQKRAEIMLTASQSLYY--DQKRTSDSKTMLDKLLTGGYSP 424

Query: 342 ----HPFSVNLMQTREKLIKILMELEDWKEALAYCQLTIPVYQRVYPQFHPLLGLQYYTC 397
               HP+   + Q     I +   L     ++ +C+  +   + V P  H      YY  
Sbjct: 425 DVQLHPYHPTVFQAHVDSINVSDALGYTHMSIDHCEAVLRCAEAVLPMNHLETSNYYYYQ 484

Query: 398 GKL 400
           G L
Sbjct: 485 GML 487


>gi|66802125|ref|XP_629856.1| SET domain-containing protein [Dictyostelium discoideum AX4]
 gi|74851143|sp|Q54DL6.1|Y2140_DICDI RecName: Full=SET and MYND domain-containing protein DDB_G0292140
 gi|60463233|gb|EAL61426.1| SET domain-containing protein [Dictyostelium discoideum AX4]
          Length = 521

 Score = 88.6 bits (218), Expect = 5e-15,   Method: Compositional matrix adjust.
 Identities = 81/293 (27%), Positives = 122/293 (41%), Gaps = 59/293 (20%)

Query: 10  VIISQEPYVCVPNNSSSISRCDGCFASSNLKK---------CSACQVVWYCGSNCQKLDW 60
           VI+   PY    ++++  S C  CF    L +         C  CQ V YC +NC+ +D+
Sbjct: 147 VILRDLPYTWAVDHATCDSVCQHCFLEVPLNQQILPTDFYMCEGCQRVGYCSANCRCIDY 206

Query: 61  KLHRLECQVLSRLDKEKRKSVTPS-IRLMLKLYLRRKLQ-----------NDNVIPSTTT 108
             HR ECQ+   LD E+      S I+L+++   R+ L+           ND  I    T
Sbjct: 207 SQHRFECQIFKELDTEEYSPFLMSEIKLLVRTLSRKWLEDSITQTAGIDINDETIKKQNT 266

Query: 109 DN-YSLVEALVAHMSDIDEKQLLLYAQIANLVNLILQWPE------------------IS 149
            N Y   ++L+       +   L Y   A LV+ +  + E                    
Sbjct: 267 YNQYKNPQSLIP------QDNGLRYNDYAELVSNVENYNESLKESLSYWICKYVVKLSAK 320

Query: 150 INEIAENFSKLACNAHTICNS---ELRPL--------GTGLYPVISIINHSCLPNA-VLV 197
           + +I + F  L       CN+   + RP         G G+Y   S  NHSC PN    V
Sbjct: 321 LGKIEDEFDLLNILLRNRCNAFYIQGRPRDGSSGESRGCGVYVRNSFFNHSCDPNVNYWV 380

Query: 198 FEGRLAV-VRAVQHVPKGAEVLISYIETAGSTMTRQKALKEQYLFTCTCPRCI 249
               L V    +++V +G E+ ISYI+T      R++ L E YLF C C +C+
Sbjct: 381 VNNTLEVECTLLKNVKEGDELTISYIDTTSPLNKRREKLLEGYLFNCLCTKCV 433


>gi|393904729|gb|EJD73778.1| hypothetical protein LOAG_18824 [Loa loa]
          Length = 420

 Score = 88.6 bits (218), Expect = 6e-15,   Method: Compositional matrix adjust.
 Identities = 112/436 (25%), Positives = 176/436 (40%), Gaps = 76/436 (17%)

Query: 15  EP--YVCVPNNSSSISRCDGCFASSN---LKKCSACQVVWYCGSNCQKLDWK-LHRLECQ 68
           EP  Y+C+ N  ++   C  C    +   L KCS C+   YC   CQ+L WK  HR+ECQ
Sbjct: 21  EPFAYICINNQVNNY--CSYCLRKPDKSILYKCSKCEFAKYCNKECQRLAWKKHHRMECQ 78

Query: 69  VLSRLDKEKRKSVTPSIRLMLKLYLRRKLQNDNVI-PSTTTDNYSLVE-----ALVAHMS 122
            L          V P++ L   L+L R +  D V+      D Y   +     +L+ H  
Sbjct: 79  RLV--------MVYPNLPLTEVLFLSRII--DKVLFIEQNGDKYKWEKDRKWNSLIGHED 128

Query: 123 DI--DEKQLLLYAQIANLVNLILQWPEISINEIAENFSKLACNAHTICNSELRPLGTGLY 180
           DI  DE + + + +I   + +  +   I   +  E F K   N+H I  +    +G  L 
Sbjct: 129 DIRNDELKYVHFEKIYEKMAIFRKDEMIEKEKFYEIFCKTTINSHAIHTNAGDEIGLAL- 187

Query: 181 PVISIINHSCLPNAVLVFEGRLAVVRAVQHVPKGAEVLISYIETAGSTMTRQKALKEQYL 240
                       NA  + +                   ISYI+   S   RQK LK ++ 
Sbjct: 188 --------DLETNASDIHKA-----------------FISYIDIGCSRYQRQKVLKLKWY 222

Query: 241 FTCTCPRCIKLGQFDDIQESAILEGYRCKDDGCSGFLL--RDSDDKGFTCQQCGLVRSKE 298
           F C C RC+     DD     IL   RC ++ C   L+   DS+     C++C  +  ++
Sbjct: 223 FDCQCDRCMDPS--DD-----ILTSIRCMNEQCDEALIITEDSEPVNIICRKCKQITDED 275

Query: 299 EIKKIASEVNILSKKTLALTSCGNHQEVVSTYKMIEKLQKKLYHPFSVNLMQTREK--LI 356
            +KK    +  L  +    +   N Q         EKL + + +  S N+  TR K  L+
Sbjct: 276 HVKKCQQLMLNLPVRFSIESKAENIQ---------EKLNEAMKYLHSKNVYVTRLKAALL 326

Query: 357 KILMELEDWKEALAYCQLTI-PVYQRVYPQFHPLLGLQYYTCGKLEWFLGDTENAIKSMT 415
            +   L+D    L + Q ++   Y+  +P+    L  Q     K     G+ + AI    
Sbjct: 327 YVTGTLDD---NLLFIQKSVYENYRLCFPRSDRHLAYQLLQIAKSHIEKGERKEAISYAY 383

Query: 416 EAVEILRITHGTNSPF 431
           EA+ I  +  G N P+
Sbjct: 384 EAMCIFEVCFGLNHPY 399


>gi|268570288|ref|XP_002648464.1| C. briggsae CBR-SET-18 protein [Caenorhabditis briggsae]
          Length = 369

 Score = 88.6 bits (218), Expect = 6e-15,   Method: Compositional matrix adjust.
 Identities = 74/316 (23%), Positives = 129/316 (40%), Gaps = 40/316 (12%)

Query: 6   VSGEVIISQEPYVCVPNNSSSISRCDGCF----ASSNLKKCSACQVVWYCGSNCQKLDWK 61
           +   + I + P+     N  +   C  C         L +C  C    YC  +CQ + W 
Sbjct: 27  IHARIKIFETPFATQVLNPKANEFCANCLRGPAPGEKLLRCGGCNFSMYCSKDCQAIAWL 86

Query: 62  LHRLECQVLSRLDKEKRKSVTPSIRLMLKLYLRRKL-------QNDNVIPSTTTDNYSLV 114
           +H+ EC+ L        K+  P++ L   L+L + +       +N + +       +S  
Sbjct: 87  VHKPECKRL--------KASFPNLPLTEVLFLSKVIDRIQFLEKNGDKLGVEAERTFS-- 136

Query: 115 EALVAHMSDI--DEKQLLLYAQIANLVNLILQWPEISINEIAENFSKLACNAHTICNSEL 172
            +LV H  DI  DE+++  + +I   +    +   I      + F K + N+H+I  +  
Sbjct: 137 -SLVDHKKDIREDEEKMKHFEKIFVKMGNFRKDDMIEKGHFFDVFCKASINSHSIHTNAG 195

Query: 173 RPLGTGLYPVISIINHSCLPNAVLVFEGRLAVVRAVQHVP-----KGAEVLISYIETAGS 227
             +G  L   +S  NHSC P   +VF+G    +R +  VP        +  ISYI+   S
Sbjct: 196 NEVGMALDLGVSKYNHSCRPTCSMVFDGYRVCLRPL--VPGVDAENTEQAFISYIDVGRS 253

Query: 228 TMTRQKALKEQYLFTCTCPRCIKLGQFDDIQESAILEGYRCKDDGCSGFLLRDSDDK--G 285
              R++ LK ++ F C C RC+         E   L   +C +  C   +L    ++   
Sbjct: 254 KFVRRRDLKSRWYFDCECTRCMD-------PEDDSLTAIKCANPACDAPILTSETEEPMN 306

Query: 286 FTCQQCGLVRSKEEIK 301
             C +C  V  +  +K
Sbjct: 307 IACDKCKTVVEESTVK 322


>gi|452839795|gb|EME41734.1| hypothetical protein DOTSEDRAFT_81948 [Dothistroma septosporum
           NZE10]
          Length = 549

 Score = 88.2 bits (217), Expect = 7e-15,   Method: Compositional matrix adjust.
 Identities = 66/235 (28%), Positives = 105/235 (44%), Gaps = 26/235 (11%)

Query: 32  GCFASS--NLKKCSACQVVWYCGSNCQKLDWKL-HRLECQVLSRLDKEKRKSVTPSIRLM 88
           G FA+   + K C+ C+ V YC  +CQ   WK  H+ EC+VL+  D   R  +   +R +
Sbjct: 70  GNFANRIVDTKACTGCKRVRYCSRSCQSKAWKREHKYECKVLAPTD---RPDLPHGVRAV 126

Query: 89  LKLYLRRKLQND------------NVIPSTTTDNYSLVEALVAHMSDIDEKQLLLYA--Q 134
           +KL  R  ++ND               P+ + +  +L E    +    ++  +L Y   +
Sbjct: 127 VKLLGR--MKNDPEGKDEALLDVLQFWPAGSAERTALDEFKAQNQQRYEDFGMLAYGAWK 184

Query: 135 IANLVNLILQWPEISINEIAENFSKLACNAHTICNS-ELRPLGTGLYPVISIINHSCLPN 193
            A    +     E   N +   F  +  N   + N  +   LG G  P++   NHSC PN
Sbjct: 185 YAGEPKMGGTESEAIANGL---FFNVMSNTVQLSNPLDDTSLGMGFDPIMCSANHSCDPN 241

Query: 194 AVLVFEGRLAVVRAVQHVPKGAEVLISYIETAGSTMTRQKALKEQYLFTCTCPRC 248
           A  +F     ++RA++ + KG E+ + Y +       RQ  LK  Y FTC C RC
Sbjct: 242 AAALFNQPRQILRALKPIKKGEEISMKYTDVTNPLCVRQAELKGYYFFTCHCARC 296


>gi|158299452|ref|XP_319583.4| AGAP008839-PA [Anopheles gambiae str. PEST]
 gi|157013526|gb|EAA14823.4| AGAP008839-PA [Anopheles gambiae str. PEST]
          Length = 650

 Score = 88.2 bits (217), Expect = 7e-15,   Method: Compositional matrix adjust.
 Identities = 92/371 (24%), Positives = 145/371 (39%), Gaps = 38/371 (10%)

Query: 10  VIISQEPYVCVPNNSSSISRCDGCFASSNLK-KCSACQVVWYCGSNCQ-KLDWKLHRLEC 67
           +++ + P+V V     S+  C  CF   ++   C  C  V +C   C+ K +   HR EC
Sbjct: 254 ILLLERPHVSVLLEDYSLDHCTHCFKRVSVPIACPLCADVVFCSDECETKANATYHRYEC 313

Query: 68  QVLSRL---------DKEKRKSVTPSIRLMLKLYLRRKLQNDNVIPSTTTDNYSLVEALV 118
             L  L             R     S    LKL        +  I     D+Y  V  LV
Sbjct: 314 GFLPILWGSGASITCHMALRMITQKSEEYFLKLKPELAGLTNEQIDKLPVDDYRKVYKLV 373

Query: 119 AHMSDIDEKQLLLYAQIANLVNLIL---------QWPEISINEIAENFSKLACNAHTICN 169
            H S    +       +A L+N  L         Q        +  N   L  NAH +  
Sbjct: 374 THESTRSPEDFFQRTLMATLLNACLTLGGYGACPQEQNFIGGLLVHNLQLLQFNAHEVSE 433

Query: 170 ---------SELRPLGTGLYPVISIINHSCLPNAVLVFEGRLAVVRAVQHVPKGAEVLIS 220
                     +   +G GLYP +++ NHSC P     + G    VR V+++P  + V  +
Sbjct: 434 MIRETAEDIGKSTFIGGGLYPTLALFNHSCDPGVTRYYRGNQVCVRTVKNIPADSMVAEN 493

Query: 221 Y--IETAGSTMTRQKALKEQYLFTCTCPRCIKLGQFDDIQESAILEGYRCKDDG--CSGF 276
           Y  + T      R+  L  QY FTC C  C++        +  ++  +RC D G  CS  
Sbjct: 494 YGPLFTQVRRDERRDTLLHQYRFTCQCVPCVENWPLFTEMDPGVIR-FRC-DSGKICSNV 551

Query: 277 LLRDS--DDKGFTCQQCGLVRSKEEIKKIASEVNILSKKTLALTSCGNHQEVVSTY-KMI 333
           LL  +  +D    C +CG   +  +  K   + ++L K    L S G ++  +  Y +M+
Sbjct: 552 LLIPAAVNDFMVKCTECGEHTNIMKGLKSLQDTDMLFKTATRLHSAGEYEAALRKYIEMM 611

Query: 334 EKLQKKLYHPF 344
           E + + L  P+
Sbjct: 612 ETMSEVLVPPY 622


>gi|322801942|gb|EFZ22489.1| hypothetical protein SINV_02250 [Solenopsis invicta]
          Length = 675

 Score = 88.2 bits (217), Expect = 8e-15,   Method: Compositional matrix adjust.
 Identities = 97/390 (24%), Positives = 159/390 (40%), Gaps = 64/390 (16%)

Query: 6   VSGEVIISQEPYVCVPNNSSSISRCDGCFAS---SNLKKCSACQVVWYCGSNCQKLDWKL 62
           V GE++I +  +       + ++ C+ CF          C  C  V YC   C+  D ++
Sbjct: 259 VPGEIVIVERAHCAFLLEENRLTHCNLCFVRIFVPTPAACRTCNCVMYCSRRCRDADAEV 318

Query: 63  HRLECQVLSRLDKEKRKSVTPSIRLML--------KLYLRRKLQNDNVIPSTTTDN---- 110
           H  EC++L  L   K  SVT  + L           + L+ +L N       +T+N    
Sbjct: 319 HSRECKLLPALWHSK-ASVTCFLALRAITQKPFEETMKLKERLTNPGSASKISTENPYRG 377

Query: 111 -------YSLVEALVAHMSDIDEKQLLLYAQIANLVNLILQWPEISINEIAENFSKLA-- 161
                  Y+LV      M +    +  + A +  L+     +PE ++  +    SKL+  
Sbjct: 378 DDYANTFYNLVTHEDKRMPEDIFHRAYMAAWLLRLLRAGEYFPE-NVKTVDSVDSKLSDE 436

Query: 162 ----------------CNAHTICNSEL-RP-------------LGTGLYPVISIINHSCL 191
                            N+H I  SEL RP             +G G+YP ++++NHSC 
Sbjct: 437 ELFIAELLLHNLQLLQFNSHEI--SELVRPKGAKTLAKAKSMFIGGGVYPTVAMLNHSCN 494

Query: 192 PNAVLVFEGRLAVVRAVQHVPKGAEVLISY--IETAGSTMTRQKALKEQYLFTCTCPRCI 249
           P  V  F G   +VRA++ +  G E+  +Y  I T    + R++ L+ QY F C C  C 
Sbjct: 495 PGVVRYFIGTTMIVRAIRTISVGEEISENYGPIFTTTPEIERKRKLRVQYWFDCNCEACS 554

Query: 250 KLGQFDDIQESAILEGYRCK-DDGCSGFLLRDSDDKGFT--CQQCGLVRSKEEIKKIASE 306
                 D  +  IL  ++C+    C   LL  SD K F   C +CG   +  +  K+  +
Sbjct: 555 GHWPLLDELDPTILR-FKCETGPSCGNVLLVRSDTKEFMIGCAKCGKSTNILKGLKVLQD 613

Query: 307 VNILSKKTLALTSCGNHQEVVSTYKMIEKL 336
            + L +        G +++ +  Y  I KL
Sbjct: 614 TDALFRVASTNLEEGRNEQALKAYLEILKL 643


>gi|330792277|ref|XP_003284216.1| hypothetical protein DICPUDRAFT_147973 [Dictyostelium purpureum]
 gi|325085913|gb|EGC39312.1| hypothetical protein DICPUDRAFT_147973 [Dictyostelium purpureum]
          Length = 507

 Score = 87.8 bits (216), Expect = 1e-14,   Method: Compositional matrix adjust.
 Identities = 71/272 (26%), Positives = 131/272 (48%), Gaps = 29/272 (10%)

Query: 8   GEVIISQ-EPYV-CVPNNSSSISRCDGCFASS---------NLKKCSACQVVWYCGSNCQ 56
           GE I+S  EPYV  VP ++     CD C  +          +LK+C+ C++++YC  +CQ
Sbjct: 53  GEEILSNIEPYVHSVPQHNIV---CDYCLKNKNNENLNTTISLKRCTGCKLIYYCSIDCQ 109

Query: 57  KLDWKLHRLECQVLSRLDKEKRKSVTP----SIRLMLKLYLRRKLQNDNVIPSTTTDNYS 112
           K  W +H+ EC++L    +E  +   P    S  ++L+++++R L+  N      T  + 
Sbjct: 110 KKAWPIHKHECKILLAAAQESLQINKPINTKSTVMLLRIFIKRMLEIQNSNHIDKTGQFE 169

Query: 113 LVEALVAHMSD-IDEKQLLLYAQIANLVNLILQWPEISINEIAENFSKLACNAHTICNSE 171
           +++ L+ H  + +++        I   +N       I    + E   KL  N  TI   E
Sbjct: 170 IIDYLLNHKEERVNDNYKSFSMGICQQLNEDPSKAPI----VLEYLLKLEPNCITIPKCE 225

Query: 172 LRPLGTGLYPVISIINHSCLPNAVLVFEGRLAVVRAVQHVPKGAEVLISYIETAGSTMTR 231
              +G  LYP++   NHSC PN  ++   +   +   + + +  E+ I+Y  +   +  R
Sbjct: 226 ASTIG--LYPLMVFFNHSCKPNLSILNNKKELKIICNRDINENEELFINYSPSICYSNER 283

Query: 232 QKALKEQYLFTCTCPRCIKLGQFDDIQESAIL 263
           ++ LK+ + F C C  C+K    D++++S  L
Sbjct: 284 REMLKQCFFFDCKCELCLK----DELEKSMDL 311


>gi|242007004|ref|XP_002424332.1| conserved hypothetical protein [Pediculus humanus corporis]
 gi|212507732|gb|EEB11594.1| conserved hypothetical protein [Pediculus humanus corporis]
          Length = 572

 Score = 87.8 bits (216), Expect = 1e-14,   Method: Compositional matrix adjust.
 Identities = 82/284 (28%), Positives = 121/284 (42%), Gaps = 50/284 (17%)

Query: 8   GEVIISQEPYVCVPNNSSSISRCDGCFASS-NLKKCSACQVVWYCGSNCQKLDWKLHRLE 66
           GEV+I ++PY  +       + C  CF  S NL  CS C    +C   C  LD   H +E
Sbjct: 203 GEVLIVEKPYSSIVIPKHFYTHCSNCFRRSLNLIPCSNCCTGMFCREEC--LDIANHVIE 260

Query: 67  CQ---VLSRLDKEKRKSVTPSIRLMLKLYLRRKLQNDNV--IPSTTTDNYSLVEALVAHM 121
           C    +LS L+ ++++       L LK+ +    Q   +  + +TT ++ +  E      
Sbjct: 261 CSYLGILSELNADRKE------MLALKILIDASRQGTELEKLCTTTINHLNNEENNCQFY 314

Query: 122 SDIDEKQLL--------------LYAQIANLVNL-ILQWPEISINEIAEN---------- 156
             +D K +L               Y  I +++ + IL+      N I+EN          
Sbjct: 315 KSLDFKNILNLVTNSEKRNAADLFYRSIVSVILISILENNTNFFNYISENNNDKIKIFCG 374

Query: 157 ------FSKLACNAHTICN---SELRPLGTGLYPVISIINHSCLPNAVLVFEGRLAVVRA 207
                    L CNAH I     S L  +G G Y  +S+INHSC PN V        ++RA
Sbjct: 375 GLILLFMQSLPCNAHEISEYYCSNLLEIGAGAYATLSLINHSCDPNVVRHSCRNTVILRA 434

Query: 208 VQHVPKGAEVL--ISYIETAGSTMTRQKALKEQYLFTCTCPRCI 249
           ++ + KG E+     Y         RQKAL +QY FTC C  CI
Sbjct: 435 IKPIKKGEELFDNYGYHYATHELQERQKALLKQYYFTCQCKACI 478


>gi|339235237|ref|XP_003379173.1| MYND finger protein [Trichinella spiralis]
 gi|316978237|gb|EFV61247.1| MYND finger protein [Trichinella spiralis]
          Length = 451

 Score = 87.8 bits (216), Expect = 1e-14,   Method: Compositional matrix adjust.
 Identities = 100/435 (22%), Positives = 181/435 (41%), Gaps = 51/435 (11%)

Query: 29  RCDGCFAS-SNLKKCSACQVVWYCGSNCQKLDWKLHRLECQVLSR-----LDKEKRKSVT 82
           RC  C +   NL +C  C+   +C   CQ+L W  H+ EC  L R      D E      
Sbjct: 27  RCWYCLSDEQNLLRCMRCRRALFCNEQCQRLGWWDHQAECGALKRAYPVVFDVE------ 80

Query: 83  PSIRLMLKLYLRRKLQNDNVIPSTTTDNYSL-------VEALVAHMSDIDE--KQLLLYA 133
             +RL+ ++   R +Q  +   +   +N+ L       +  L  H+  + E  +++  + 
Sbjct: 81  --VRLLGRI-AARHMQISSGKDAVDRENFYLNRQSERQLGDLWHHVGSMRENVEEMRKFR 137

Query: 134 QIANLVNLILQWPEISINE-IAENFSKLACNAHTICNS-ELRPLGTGLYPVISIINHSCL 191
           +I   +    +   +   + + +  S+   N H I +   L  +G GLY  +   +HSC 
Sbjct: 138 RIQASLERFYKAKYLPNEQTLFKLHSRNYINRHAIGDQLYLNEIGKGLYLDLCAYDHSCR 197

Query: 192 PNAVLVFEGRLAVVRAVQ---HVPKGAEVLISYIETAGSTMTRQKALKEQYLFTCTCPRC 248
           PNA+   EG +A ++A+    ++         YI    S   R++ LKE + F C C RC
Sbjct: 198 PNAIYTCEGFVARLKALDDSVNLSDRKRTHTCYITPLDSRAERRRLLKESWYFDCFCERC 257

Query: 249 IKLGQFDDIQESAILEGYRCKDDGCSGFLLRDS--------DDKGFTCQQC-GLVRSKEE 299
                        +L   +C  + CS  +   +        DD  FTC +C     S+ +
Sbjct: 258 TD-------PNDHLLTALKC--NNCSTAVPIHNADNDKDLIDDSFFTCPKCEQRCFSRPD 308

Query: 300 IKKIASEVNILSKKTLALTSCGNHQEVVSTYKMIEKLQKKLYHPFSVNLMQTREKLIKIL 359
           +++   ++  + K+     S    +  + +  ++ K      + F   L+Q + +LI   
Sbjct: 309 VEQALFKMIDIEKQLAKTGSETLSKSELDSMLLVCKSVLADENVFFAKLLQRKLQLI--- 365

Query: 360 MELEDWKEALAYCQLTIPVYQRVYPQFHPLLGLQYYTCGKLEWFLGDTENAIKSMTEAVE 419
            +  D K+ L      +P  +  YP+ HP L   Y T G     L D + A++ + +A+ 
Sbjct: 366 -DCNDSKQLLETSLQALPCLRVCYPEMHPTLAYHYMTIGIYYKNLADRKTAMQYLNDALR 424

Query: 420 ILRITHGTNSPFMKE 434
             R T G+N    K+
Sbjct: 425 RFRFTLGSNHLLTKQ 439


>gi|327283812|ref|XP_003226634.1| PREDICTED: SET and MYND domain-containing protein 4-like [Anolis
           carolinensis]
          Length = 683

 Score = 87.4 bits (215), Expect = 1e-14,   Method: Compositional matrix adjust.
 Identities = 104/476 (21%), Positives = 180/476 (37%), Gaps = 55/476 (11%)

Query: 6   VSGEVIISQEPYVCVPNNSSSISRCDGCFASSNLK-KCSACQVVWYCGSNCQKLDWK-LH 63
           V GE+++ +E +V V         C  C +       C  C    YCGS C +  W+  H
Sbjct: 204 VPGEILLKEEAFVAVLCPGEEDIHCHHCLSPLVASVPCRGCSYAKYCGSACARAAWQSYH 263

Query: 64  RLECQVLSRL-------DKEKRKSVTPSIRLMLKLYLRRKLQNDNV-IPSTTTD-----N 110
           + EC     L           R         +       + Q   + IP    D     +
Sbjct: 264 QRECPFGGLLLAMGVFCHVALRTIFVAGFEEVTLFVENDEEQTTGISIPGCDADGRYRSS 323

Query: 111 YSLVEALVAHMSDID-EKQLLLYAQIANLVNLILQWPEISINEIAENFSK---------- 159
           Y  V  L+ H+     E + L    +A L  ++ +  E+ + +  EN ++          
Sbjct: 324 YQAVFGLLPHVEKHSPEFKFLCGFSVAALCRVMGK--ELFVGKDGENVAQDVLGEAVLRH 381

Query: 160 ---LACNAHTI---------CNSELR----PLGTGLYPVISIINHSCLPNAVLVFEGRLA 203
              L CNA  +         C +  R     L T L+PV+S++NHSC PN  + F+GR A
Sbjct: 382 VLQLQCNAQAVTALRVSGERCEAVARQEEVTLATALFPVLSLLNHSCDPNTSVTFDGRTA 441

Query: 204 VVRAVQHVPKGAEVLISYI--ETAGSTMTRQKALKEQYLFTCTCPRCIKLGQFDDIQESA 261
             RA + +P+G E+L  Y           R++ L  QY F C C  C      +    S 
Sbjct: 442 TARASRAIPRGQEILHCYGPHRCRMKPSERRQRLLAQYFFECRCSACTDETGPESASGST 501

Query: 262 ILEGYRCKD--DGCSGFLLRDSDDKGFTCQQCGLVRSKEEIKKIASEVNILSKKTLALTS 319
           +     C+    G   FL   S         C  +  KE  +    E+    +  + L +
Sbjct: 502 LFCCPTCRAPMQGEGLFLCCSSAS-------CKAMLPKEGFQHQLRELRQHMEAAMDLLA 554

Query: 320 CGNHQEVVSTYKMIEKLQKKLYHPFSVNLMQTREKLIKILMELEDWKEALAYCQLTIPVY 379
            G     V   +      ++L  P  + + +  + L ++     +W EA A+ + +I V 
Sbjct: 555 QGKADRSVEQLRKCHLEAQQLLSPGHLVVGEIEDHLAQVHATQGNWPEAAAHLRRSIEVV 614

Query: 380 QRVYPQFHPLLGLQYYTCGKLEWFLGDTENAIKSMTEAVEILRITHGTNSPFMKEL 435
           +  Y      +G + +   ++ +       A+++  +A  +L +  G   P ++EL
Sbjct: 615 EAHYGAGSIEVGQELFKLAQVLFNGRFVPEALQAAEKAEAVLSVQLGPRGPEVQEL 670


>gi|156050587|ref|XP_001591255.1| hypothetical protein SS1G_07881 [Sclerotinia sclerotiorum 1980]
 gi|154692281|gb|EDN92019.1| hypothetical protein SS1G_07881 [Sclerotinia sclerotiorum 1980
           UF-70]
          Length = 593

 Score = 87.4 bits (215), Expect = 2e-14,   Method: Compositional matrix adjust.
 Identities = 61/230 (26%), Positives = 100/230 (43%), Gaps = 33/230 (14%)

Query: 22  NNSSSISRCDGCFASSNLKKCSACQVVWYCGSNCQKLDWKLHRLECQVLSRLDKEKRKSV 81
           N + + SRC    A+     C  C+ V YC   C+ LD   H  +C+ +    +   K  
Sbjct: 116 NTACAYSRCRSKKATLT---CPGCKAVNYCDEKCENLDRSFHGQQCKDVQ--SQNSNKIF 170

Query: 82  TPSIRLMLKLYLRRKLQNDNVIPSTTTDNYSLVEALVAHMSDIDEKQLLLYAQIANLVNL 141
             ++R  ++L            P + T + S +E L +H  D+++         +   +L
Sbjct: 171 PTAVRATIQLISH---------PVSITKDSSFME-LSSHRDDLEKNN-------SKWDDL 213

Query: 142 ILQWPEISINEIAENFSKLACNAHTICNSELRPLGTGLYPVISIINHSCLPNAVLVFEGR 201
           +LQ   +S N                 N    P+G  L P+++  NH C PNA + F+G+
Sbjct: 214 LLQAHALSTNAF-----------RVESNVGNGPIGLCLDPLLARANHCCYPNAAITFDGK 262

Query: 202 LAVVRAVQHVPKGAEVLISYIETAGSTMTRQKALKEQYLFTCTCPRCIKL 251
            A +RA+  +  G ++ ISYI+       RQ AL+E + F C C RC   
Sbjct: 263 RATLRALFPIKNGEQIFISYIDETQRQEVRQAALEETWFFKCRCSRCTNF 312


>gi|403170123|ref|XP_003329506.2| hypothetical protein PGTG_11256 [Puccinia graminis f. sp. tritici
           CRL 75-36-700-3]
 gi|375168575|gb|EFP85087.2| hypothetical protein PGTG_11256 [Puccinia graminis f. sp. tritici
           CRL 75-36-700-3]
          Length = 647

 Score = 87.0 bits (214), Expect = 2e-14,   Method: Compositional matrix adjust.
 Identities = 71/260 (27%), Positives = 117/260 (45%), Gaps = 50/260 (19%)

Query: 39  LKKCSACQVVWYCGSNCQKLDWKLHRLECQVL---SRLDKE--------KRKSVTPSIRL 87
           +++C  C+V+ YCGSNCQ+LDW  H+ EC  L   ++L +E        K +S    I L
Sbjct: 119 IRRCGKCKVIAYCGSNCQQLDWASHKPECTALVNYAKLAEEAIKANQKSKSRSSPGGIGL 178

Query: 88  MLKLYLRRKLQNDNV-----IPSTTT------------------DNYSLVEALVAHMSDI 124
                L     +++      IPS T                   D ++ +  L  H+++ 
Sbjct: 179 KDSFGLGGDEPDNDPQTISRIPSATVRALGRLIWKKRKEEKLNPDWWTGLSELQHHLNEY 238

Query: 125 DEKQLLLYAQIA-NLVNLILQWPEISINEIAENF----SKLACNAHTICNSELRPLGTGL 179
           +  Q     Q++  L   +     +++ E A       S+   N+ T+ +  L  +G   
Sbjct: 239 NSTQKESLMQLSVTLSRYVGNQELLAVFESASALLPLCSRFIDNSFTLTSIILDQIGVVF 298

Query: 180 YPVISIINHSCLPNAVLVF----EGRLAV-------VRAVQHVPKGAEVLISYIETAGST 228
            P  + INHSC PNAV+VF    EG  +        V A++ +  G E++ SYI++AG+ 
Sbjct: 299 VPSAAFINHSCNPNAVVVFPEGGEGAGSTAGKEWVKVIAIKPIEPGEEIVTSYIDSAGTR 358

Query: 229 MTRQKALKEQYLFTCTCPRC 248
             R+  L ++Y F C C  C
Sbjct: 359 QERRNELVKRYKFVCDCQAC 378


>gi|397612716|gb|EJK61854.1| hypothetical protein THAOC_17577 [Thalassiosira oceanica]
          Length = 750

 Score = 87.0 bits (214), Expect = 2e-14,   Method: Compositional matrix adjust.
 Identities = 113/489 (23%), Positives = 192/489 (39%), Gaps = 111/489 (22%)

Query: 29  RCDGCFASSNLKKCSACQVVWYCGSNCQKLDWKLHRLECQVLSRLDKEKRKSVTP----- 83
           RC  CFA    ++         C  +CQK D+  H+LECQ + +  K+   ++TP     
Sbjct: 292 RCARCFA----QESDTGIKFGRCTRSCQKADFSQHKLECQYIMKRRKQF-GTITPESAEE 346

Query: 84  -SIRLMLKLYLRRKLQND---------NVIP--------------STTTDNYSLVEALVA 119
            +I L+L+ +   K   D         +  P              + ++D  S    L +
Sbjct: 347 DAIPLLLRTFCALKFAKDECETVHAETDHAPGQIVSCGREHFASLAVSSDQLSYPLTLDS 406

Query: 120 HMSDIDEKQLLLYA-------------QIANLVNLILQWPEISINE-IAENFSKLACNAH 165
               + ++ +L +A             +   LV  IL   E +++E I    +    N  
Sbjct: 407 PGMKLAKRLMLSFAAKGSTDTEEVSPQEKEGLVLRILGCKEKTLDEAIRHTLTAFQQNNF 466

Query: 166 TICNSELRPLGTGLYPVISIINHSCLPNAVLVFE-------GRLAVVRAVQHVPKGAEVL 218
            + +S   P+G  +YP  +++NHSC PN +L ++        +L +V A + +  G E++
Sbjct: 467 GVTDSLYAPIGEAVYPHAALLNHSCSPNCILRYKIGLESSPPQLEIV-ACKDISSGEELV 525

Query: 219 ISYIETAGSTMTRQKALKEQYLFTCTCPRCIKLGQFDDIQESAILEGYRCKDDGCSGFLL 278
            SY++    T TR+  L++ + F C C RC   GQ                   C+  L 
Sbjct: 526 HSYVDLVLPTETRRNRLRDTHGFICECKRCA--GQ-------------------CTVELP 564

Query: 279 RDSDDKGF--TCQQCGLVRSKEEIKKIASEVNILSKKTLALTSCGNHQEVVSTYKM---- 332
           RD        T ++ G + S    ++I  E  I  +   A      H ++V+T ++    
Sbjct: 565 RDRQSWMLWPTKERLGEIPSSTPTQQIDIEEAIGGRSGEA-----EHLQIVNTSRLLQDQ 619

Query: 333 -------------IEKLQKKL--------YHPFSVNLMQTREKLIKILMELEDWKEALAY 371
                        I+ LQ+ +        + PF   L   R      L+      E +  
Sbjct: 620 ATRAMVNGNNDLEIQLLQQAVGVFLLSGGFSPFHNELYTARCAYFSALLANGKIDEGIEQ 679

Query: 372 CQLTIPVYQRVYP--QFHPLLGLQYYTCGKLEWFLGDTENAIKSMTEAVEILRITHGTNS 429
           C+  +          + HPLLGLQ YT G L    G  + A  +   A EI+ ++HG   
Sbjct: 680 CEHIVSSLSVCLGNVKHHPLLGLQLYTLGDLCSGGGCGQKAANAYRWAYEIMSVSHGAQD 739

Query: 430 PFMKELILK 438
           P ++ LI K
Sbjct: 740 PMVQALIRK 748


>gi|392592096|gb|EIW81423.1| SET domain-containing protein [Coniophora puteana RWD-64-598 SS2]
          Length = 450

 Score = 87.0 bits (214), Expect = 2e-14,   Method: Compositional matrix adjust.
 Identities = 66/235 (28%), Positives = 110/235 (46%), Gaps = 32/235 (13%)

Query: 29  RCDGCF----ASSNLKKCSACQVVWYCGSNCQKLDWKLHRLECQVLSRLDKE---KRKSV 81
           RCD C     +  +LK+CS C   +YCG  CQ   W  HR  C+ + R       ++   
Sbjct: 53  RCDRCLRLESSGIDLKRCSGCASYYYCGPQCQTSQWSAHRSYCKNIQRFQASAEYQKMET 112

Query: 82  TPSIRLMLKLYLRRKLQNDNVIPSTTTDNYSLVEALVAHMSDIDEKQLLLYAQIANLVNL 141
              +  ML  +L  +L++++    T        + L   +S      LL    IAN   +
Sbjct: 113 HERMDSMLLTHLLAELRSNHTAECT--------QQLSTFLS------LLPRPDIANPPPI 158

Query: 142 -ILQWPEISINEIAENFSKLACNAHTICNSELRPLGTGLYPVIS-IINHSCLPNAVLVF- 198
             +     +I ++ + FS+ + N   + +S L  +  G++P+ S + NHSCLPNA   + 
Sbjct: 159 CAMNGRAGTIPDVNDLFSRFSNNNFAV-HSHLTTVAHGVFPLASRLFNHSCLPNAAARYI 217

Query: 199 -----EGRLAVVRAVQHVPKGAEVLISYIETAGSTMTRQKALKEQYLFTCTCPRC 248
                  R+ +V A+Q +  G E+ + Y++ A    +RQ+  +  Y FTCTCP C
Sbjct: 218 LSEDAAPRMEIV-ALQDIGAGEEICVPYLDPA-LLQSRQQGFQLTYGFTCTCPSC 270


>gi|328724471|ref|XP_003248160.1| PREDICTED: SET and MYND domain-containing protein 4-like
           [Acyrthosiphon pisum]
          Length = 627

 Score = 87.0 bits (214), Expect = 2e-14,   Method: Compositional matrix adjust.
 Identities = 81/296 (27%), Positives = 127/296 (42%), Gaps = 61/296 (20%)

Query: 8   GEVIISQEPYVCVP-NNSSSISRCDGCFASS-NLKKCSACQVVWYCGSNCQKLDWK-LHR 64
           G+V+   EPY+CVP  + S +   +GC      L +C  C + +YC  +C   D+K  H 
Sbjct: 253 GDVVAIDEPYICVPFPDESEVCNYNGCLKFDIALFRCPKCVMAYYCNKDCMNKDYKDGHH 312

Query: 65  LECQVLSRLDKEKRKSVTPSI----RLMLKLYLRRKLQNDNVIPSTTTDNYSLVEA---- 116
           L C ++  +       +TP I     L LK +L+  L+  +    +  DN+S  +     
Sbjct: 313 LVCPIICFI------KLTPGISKINELALKWFLKDYLKMGSKKYCSIVDNFSETKIDPIT 366

Query: 117 --------------LVAHMSDIDEKQLLLYAQIANLVNLI----LQWPEISINEIAE--- 155
                         L A+  D +E ++ +   +    N I    L +  +S  EI E   
Sbjct: 367 RGFDEIGQYKSDNFLTAYSLDSNENKMPI--DVLFFFNCIAVDMLHYLTLSGFEIPEQYM 424

Query: 156 ----------------NFSKLACNAHTICNSELR-PLGTGLYPVISIINHSCLPNAVL-- 196
                           N  KL  NA +I + +   PL   LYP IS+ NHSC  N     
Sbjct: 425 GSVGASLVRILTVLDLNARKLNINAPSISHQQFTFPLALTLYPTISLFNHSCDGNIKRSG 484

Query: 197 VFEGRLAVVRAVQHVPKGAEVLISY--IETAGSTMTRQKALKEQYLFTCTCPRCIK 250
           V   R+ V++AVQ +PKG ++  +Y  +       +RQ A  +++ F C C  CIK
Sbjct: 485 VISDRIRVMKAVQPIPKGTQLCCNYGIMFKEHDKESRQSACNDRFNFNCYCDPCIK 540


>gi|440799473|gb|ELR20518.1| Histone-lysine N-methyltransferase [Acanthamoeba castellanii str.
           Neff]
          Length = 404

 Score = 87.0 bits (214), Expect = 2e-14,   Method: Compositional matrix adjust.
 Identities = 60/193 (31%), Positives = 87/193 (45%), Gaps = 12/193 (6%)

Query: 112 SLVEALVAHMSDIDEKQLLLYAQIANLVNLILQWPEISINEIAENFSKL----ACNAHTI 167
           +L  AL  +   +   +L  Y Q+A +V   +  P  +     ++  +L     CNA  +
Sbjct: 192 ALTFALEPNTHALSPTRLAQYEQMAEVVAASISLPRGARKVTKDDVVRLLCIQQCNAFGL 251

Query: 168 CNSELRPLGTGLYPVISIINHSCLPNAVLVFEG----RLAVVRAVQHVPKGAEVLISYIE 223
            N E    G  LYP +S+ NHSC+PN   V +G    R+  ++ +  VP G E+ ISYI+
Sbjct: 252 SNGEGEMTGVALYPALSLFNHSCMPNCAAVDDGTGSKRVCAIKTLVAVPPGEELTISYID 311

Query: 224 TAGSTMTRQKALKEQYLFTCTCPRCIKLGQFDDIQESAILEGYRCKDDGCSGFLLRDSDD 283
              +   RQ  L+E Y F CTC RC      D    +A L    C    C G L+     
Sbjct: 312 LDLTRELRQDKLEESYAFRCTCARCRAPDADDSPAVAAHLRRITCGS--CGGSLIPAPPG 369

Query: 284 KGFT--CQQCGLV 294
           +  T  C  C LV
Sbjct: 370 QELTVACNSCPLV 382


>gi|344290601|ref|XP_003417026.1| PREDICTED: SET and MYND domain-containing protein 4-like [Loxodonta
           africana]
          Length = 800

 Score = 86.7 bits (213), Expect = 2e-14,   Method: Compositional matrix adjust.
 Identities = 84/318 (26%), Positives = 130/318 (40%), Gaps = 75/318 (23%)

Query: 146 PEISINEIA--ENFSKLACNAHTIC--------------NSELRPLGTGLYPVISIINHS 189
           PE++I  +    +  +L CNA  I               + E+R L TG++PV+S++NHS
Sbjct: 481 PELNIWGVVMLRHMLQLQCNAQAITTIQHTGSKESIITESREIR-LATGIFPVVSLLNHS 539

Query: 190 CLPNAVLVFEGRLAVVRAVQHVPKGAEVLISY--IETAGSTMTRQKALKEQYLFTCTCPR 247
           C PN  + F   +A +RA Q + KG E+L  Y   E+  +   RQ+ L+ QY F CTCP 
Sbjct: 540 CRPNTSVSFTSTVATIRASQQIAKGQEILHCYGPHESRMAVAERQQKLRSQYFFECTCPA 599

Query: 248 CIKLGQFDDIQESAILEGYRCKDDGCSGFLLRDSDDKGFTCQQCGLVRSKEEIKKIASEV 307
           C K        E     G R                  F C  C  V   E++       
Sbjct: 600 CHK--------EPRATAGPRW---------------AAFCCHSCRAVMQGEDV------- 629

Query: 308 NILSKKTLALTSCGNHQ--EVVSTYKMIEKLQKKLYHPFSVNLMQTREKLIKILM--ELE 363
                      SCGN    E VS   ++ +LQ         +L Q      K+L   +LE
Sbjct: 630 ----------LSCGNRSCTESVSRSHLVSRLQ---------DLQQQVGMAQKLLRSGKLE 670

Query: 364 DWKEALAYCQLTIPVYQRVYPQFHPLLGLQYYTCGKLEWFLGDTENAIKSMTEAVEILRI 423
              + L  CQ     +    P  H ++G       +    LGD E +   + ++++++ +
Sbjct: 671 QAVQQLLGCQHAAESF---LPAEHTVVGEIDDHLAQAYAALGDWEKSATHLQKSLQVVEV 727

Query: 424 THGTNSPFMKELILKLEE 441
            HG +S  M   + KL +
Sbjct: 728 RHGPSSIEMGRELFKLAQ 745


>gi|325091290|gb|EGC44600.1| MYND finger family protein [Ajellomyces capsulatus H88]
          Length = 479

 Score = 86.7 bits (213), Expect = 2e-14,   Method: Compositional matrix adjust.
 Identities = 58/231 (25%), Positives = 97/231 (41%), Gaps = 54/231 (23%)

Query: 30  CDGCFASS--------NLKKCSACQVVWYCGSNCQKLDWKLHRLECQVLSRLDKEKRKSV 81
           C  CF +         +LK C+ C VV YCG                             
Sbjct: 58  CSNCFVTVGDELNPDLSLKACAGCHVVKYCGR---------------------------- 89

Query: 82  TPSIRLMLKLYLRRKLQNDNVIPSTTTDNYSLVEALVAHMSDIDEKQLLLYAQIANLVNL 141
            P  +++ ++Y                 ++++   L  H S +DE+      +I      
Sbjct: 90  -PDNKVLKEIY---------------DTHHAVFRTLGHHFSRMDERGGEQAHRITVSAEA 133

Query: 142 I--LQWPEISINEIAENFSKLACNAHTICNSELRPLGTGLYPVISIINHSCLPNAVLVFE 199
           +  +   E+ ++ +   F+KL  NA T+ N     +G  L P  + INHSC PNA + F+
Sbjct: 134 LKAISNTEVELSTLVVYFAKLETNAFTLTNQYFDRIGLCLLPFAAYINHSCEPNAYIGFD 193

Query: 200 GRLAVVRAVQHVPKGAEVLISYIETAGSTMTRQKALKEQYLFTCTCPRCIK 250
           G++  ++A+Q +    E+ ISY +      TRQ  L+ +Y F C CP+C+K
Sbjct: 194 GQVMYLKALQDIAPDEEIFISYTDNTEPLKTRQTELQLRYFFECKCPKCLK 244


>gi|145502993|ref|XP_001437474.1| hypothetical protein [Paramecium tetraurelia strain d4-2]
 gi|124404624|emb|CAK70077.1| unnamed protein product [Paramecium tetraurelia]
          Length = 367

 Score = 86.7 bits (213), Expect = 2e-14,   Method: Compositional matrix adjust.
 Identities = 76/272 (27%), Positives = 119/272 (43%), Gaps = 40/272 (14%)

Query: 8   GEVIISQEPYVCVPNNSSSISRCDGCFASSNLKKCSACQVVWYCGSNCQKLDWKLHRLEC 67
            +V++  +P   V       + C+ C   S   +CS C+  +YC  +CQK DWK H+ EC
Sbjct: 19  DQVVMRCDPLAIVLFKEFRANHCNYCLQGSQTNRCSICKQYYYCSVSCQKNDWKQHKNEC 78

Query: 68  QVLSRLDKEKRKSVTPSIRLMLKLYLRRKLQNDNVIPSTTTDNYSLVEALVAHMSDIDEK 127
           Q+LS++ KE   ++   IRL          QN+  I +   D    V+       DID++
Sbjct: 79  QLLSKITKEMPCNILILIRL---------FQNNIDIQNFYGD----VD------KDIDQE 119

Query: 128 QLLLYAQIANLVNLILQWPEISINEIAENFS---KLACNAHTICNSELRPLGTGLYPVIS 184
               Y Q+ +    I++  ++     A+  S   K+  N  T+ +     LG  +Y   +
Sbjct: 120 ---TYQQVFDCAAYIVKIAQLENETFAKLLSIQVKIHLNTFTVTDINGDTLGIAIYTPAN 176

Query: 185 IINHSCL---------PNAVLVFEGRLAVVRA----VQHVPKGAEVLISYIETAGSTMTR 231
            +NH C+          N    F  R  ++      VQ+     E+ ISY         R
Sbjct: 177 FLNHKCIKTSANQKNVANCSHFFNQRQLIITTNNSFVQNENDPQELCISYGNIVNFK-DR 235

Query: 232 QKALKEQYLFTCTCPRCIKLGQFDDIQESAIL 263
           QK LK QY F C C RCI+  Q +  +ES +L
Sbjct: 236 QKFLKAQYGFICDCDRCIQ-EQTNSEEESDLL 266


>gi|308468789|ref|XP_003096635.1| hypothetical protein CRE_29105 [Caenorhabditis remanei]
 gi|308241582|gb|EFO85534.1| hypothetical protein CRE_29105 [Caenorhabditis remanei]
          Length = 428

 Score = 86.3 bits (212), Expect = 3e-14,   Method: Compositional matrix adjust.
 Identities = 79/299 (26%), Positives = 140/299 (46%), Gaps = 37/299 (12%)

Query: 30  CDGCFASSNLKK------CSACQVVWYCGSNCQKLDWK-LHRLECQVLSRLDKEKRKSVT 82
           C  CFA  ++        C  C  V YC   CQ+ DW+ +H+ EC++L R  K  +K +T
Sbjct: 23  CATCFAELDVNGETEILMCDDCSEVSYCSPKCQRQDWRSVHQFECEIL-RAQKANQKPMT 81

Query: 83  PSIRLMLKLYLRRKLQNDNVIPSTTTDNYSLVEALVA----HMSDIDEKQLLLYAQIANL 138
            ++RL ++  L   L+N    PS    N +++E L      + S     Q L     +++
Sbjct: 82  TTMRLSIRTLLV-TLRNSERTPSF---NGAIIEDLETNYKEYRSSPSHNQFL-----SDM 132

Query: 139 VNLILQ-----WPE-ISINEIAENFSKLACNAHTICNSE-LRPLGTGLYPVISIINHSCL 191
           V +I       +P+ +  N++      + CNA  I + + + P+G+GL+  ++  NHSC 
Sbjct: 133 VTIIKSVGHDVFPKSVETNKMIAIICTVLCNAFGIMDDKRVEPIGSGLFVGLAKHNHSCA 192

Query: 192 PNAVLVFEGRLAVVRAVQHVPKGAEVLISYIETAGSTMTRQKALKEQYLFTCTCPRCIKL 251
             + +VFE    ++R  +       + ISY+     T  RQK+++  +  TC C  C   
Sbjct: 193 STSHVVFEKNQVILRG-READYSKNITISYVSRMLPTSERQKSIRNVHFITCRCEMC--- 248

Query: 252 GQFDDIQESAILEGYRCKDDGCSGFLLRDSDDKGFTCQQCGLVRSKEEIKKIASEVNIL 310
            + DD+    I    RC+   C G+ ++ S+  G  C++  +V   E     +  ++IL
Sbjct: 249 -RNDDLD--LIGLASRCQTINCLGY-VKGSNPCG-VCKKPAVVPIMESSSSTSKLIDIL 302


>gi|428180198|gb|EKX49066.1| hypothetical protein GUITHDRAFT_136248 [Guillardia theta CCMP2712]
          Length = 719

 Score = 86.3 bits (212), Expect = 3e-14,   Method: Compositional matrix adjust.
 Identities = 76/292 (26%), Positives = 119/292 (40%), Gaps = 60/292 (20%)

Query: 8   GEVIISQEP--YVCVPNNSSSISRCDGCFASSN---LKKCSACQVVWYCGSNCQKLDWKL 62
           G  ++ ++P  Y       S  + C    AS +   LK+C+ C+   Y  +  QK  W  
Sbjct: 59  GSQVLLEDPFAYALSSRFGSHENFCHHSLASQSRVRLKRCAGCRFARYASAEDQKKAWSK 118

Query: 63  HRLECQ------------------VLSRLDKEKRK--SVTPSIRLMLKLYLRRKLQNDNV 102
           HRLEC+                  V    D +K +  S T +   +L+L       + ++
Sbjct: 119 HRLECRRIKECIDHGYMPSSFLLTVARMFDAKKHRFNSSTATWEDILELDTNYDTWSTDL 178

Query: 103 IPSTTTDNYSLVEALVAHMSDIDEKQLL-----------LYAQIANLVNLILQWPEISIN 151
           + S     Y + +     M+D D   +            L  Q  N  N I      +  
Sbjct: 179 LASFAQITYVVRD----FMADGDVNNMFCPKEWNLEAARLLDQTFNATNGI-----ENPR 229

Query: 152 EIAENFSKLACNAHTICNSELRPLGTGLYPVISIINHSCLPNAVLVFEGRLAVVRAVQHV 211
           EI     +L     TIC+ E RP+GTGLYP+ S+ NH C PN ++ F G    +RA + +
Sbjct: 230 EILMTICRLYNYGFTICDEEERPIGTGLYPLASLCNHDCAPNCLVTFCGPSLQLRATEEI 289

Query: 212 PKG---------------AEVLISYIETAGSTMTRQKALKEQYLFTCTCPRC 248
             G                E+ ISY+++A     R+K L ++YL  C+C  C
Sbjct: 290 EMGQVQMARVDWRLRSGLQEITISYVDSAEPHWLRRKTLHKRYLINCSCSLC 341



 Score = 48.1 bits (113), Expect = 0.010,   Method: Compositional matrix adjust.
 Identities = 44/174 (25%), Positives = 80/174 (45%), Gaps = 20/174 (11%)

Query: 290 QCGLVR-----SKEEIKKIASEVNILSKKTLALT--SCGNHQEVVSTYKMIEKLQKKLYH 342
           Q G +R      + E+  +  E ++  K   A T  S     E++  + +  +  + + H
Sbjct: 536 QVGAIRKHMWQDENELIDLNKERSLQGKLHAAKTQPSADRANELLQMFILAVQKAESILH 595

Query: 343 PFSVNLMQTREKLIKILMELEDWKEALAYCQLTIPVYQRVYPQFHPLLGLQYYTCGKLEW 402
             +++L++ RE +    +E + WK AL+     +    ++YP+  P +GLQ    GKL  
Sbjct: 596 DNNLSLLRLREAMQSQAVEAKSWKFALSTSYKVLRRLYKIYPRCSPYVGLQEAFLGKLCM 655

Query: 403 FL------------GDT-ENAIKSMTEAVEILRITHGTNSPFMKELILKLEEAQ 443
           FL            GD  + A+K + EA  IL  THG+    ++++   L+EA 
Sbjct: 656 FLSGEPDKTLCHGFGDPLQEAVKHLREAKLILSGTHGSEHVLVQDINRLLQEAH 709


>gi|428182841|gb|EKX51700.1| hypothetical protein GUITHDRAFT_55948, partial [Guillardia theta
           CCMP2712]
          Length = 98

 Score = 86.3 bits (212), Expect = 3e-14,   Method: Compositional matrix adjust.
 Identities = 41/96 (42%), Positives = 58/96 (60%)

Query: 153 IAENFSKLACNAHTICNSELRPLGTGLYPVISIINHSCLPNAVLVFEGRLAVVRAVQHVP 212
           IA   SK   N H I + + +  G GLYP+ S INHSC PNA++ F+G   VVRA++++P
Sbjct: 3   IASMISKAQRNMHGIVDLKGQNFGHGLYPLASFINHSCEPNAIISFDGNKLVVRALENIP 62

Query: 213 KGAEVLISYIETAGSTMTRQKALKEQYLFTCTCPRC 248
           +G E+ I+Y+E       R+ AL  +  F C C RC
Sbjct: 63  RGTEITIAYVELYAPLDVRRDALLSRKGFLCRCSRC 98


>gi|358059760|dbj|GAA94529.1| hypothetical protein E5Q_01181 [Mixia osmundae IAM 14324]
          Length = 554

 Score = 85.5 bits (210), Expect = 5e-14,   Method: Compositional matrix adjust.
 Identities = 64/262 (24%), Positives = 110/262 (41%), Gaps = 27/262 (10%)

Query: 6   VSGEVIISQEPYVCVPNNSSSISRCDGCFASSN----LKKCSACQVVWYCGSNCQKLDW- 60
            +G+ +    P   V  NS  ++RC  C   ++    LK+CS+CQ+  +CG  C K  W 
Sbjct: 37  AAGDTLFETRPPFAVVGNSDILTRCFHCLKPTSPERELKRCSSCQLARFCGPACVKAGWT 96

Query: 61  -KLHRLECQVLSRLDKEKRKSVTPSIRLMLKLYLRRKLQNDNVIPSTTTDNYSLVEALVA 119
            + H+ EC    R   +   +    + L     L R +      PS    +      L  
Sbjct: 97  PEGHKYECAAFKRCGFKDTPASLAKLPLTEARLLARMIWAQKHDPSQHRYD------LHG 150

Query: 120 HMSDIDEKQLLLYAQIANLVNLILQWPEISINEIAEN-------FSKLACNAHTICNSEL 172
            +   D K+  L+A+     ++++   +   NE+  N        +K+  NAHT+C    
Sbjct: 151 DVDCFDNKEKALWARRIQEASVLVG--QSLFNEVLGNAKAALLQMAKVQMNAHTLCTPFG 208

Query: 173 RPLGTGLYPVISIINHSCLPNAVLVF------EGRLAVVRAVQHVPKGAEVLISYIETAG 226
              G  L    ++INHSC PN   +       + +   V A + +  G E+ I+Y++   
Sbjct: 209 EACGVSLSSSFALINHSCAPNTFAMSSHWPDEKPKYLRVAACRPIKAGDEITIAYVDVEE 268

Query: 227 STMTRQKALKEQYLFTCTCPRC 248
             + R++ +K  Y F C C  C
Sbjct: 269 ENLQRRQTIKATYGFDCDCRLC 290


>gi|307207016|gb|EFN84839.1| SET and MYND domain-containing protein 4 [Harpegnathos saltator]
          Length = 638

 Score = 85.5 bits (210), Expect = 6e-14,   Method: Compositional matrix adjust.
 Identities = 76/294 (25%), Positives = 130/294 (44%), Gaps = 53/294 (18%)

Query: 8   GEVIISQEPYVCVPNNSSSISRCDGCFASS-NLKKCSACQVVWYCGSNCQKLDWKL-HRL 65
           G+++  ++PY  V       + C  C + S NL  CS C +  YC   C+KL W++ H  
Sbjct: 267 GDIVSIEDPYAHVIYEERYYTHCHYCLSRSYNLIPCSKCPIAQYCSEKCRKLAWEVCHMT 326

Query: 66  ECQVLSRL------DKEKRKSVTPSIRLMLKLYLR----RKLQNDNVIPSTTTDN----Y 111
           EC +L  L      DK+K + ++  IRL++ +        +LQ D  I  +  DN    +
Sbjct: 327 ECPILKLLTSLLNVDKDKIRMISKIIRLLIIVTANGSKIEELQQDMKIAESNPDNRTAGF 386

Query: 112 SLVEAL--------VAHMSDIDEKQLL---LYAQIANLVNLILQ---------WPEISIN 151
           +  E L        ++  +++  + L+    +A I+ L  ++L          +    + 
Sbjct: 387 TDTEILDSSSARSALSLATNMTTRPLIGISAFACISALAVILLATQTRFFSKTYKMDDLK 446

Query: 152 EIAENFSKL---------AC-----NAHTICNSELRPLGTGLYPVISIINHSCLPNAVLV 197
           +I+E FS+L         AC     N  ++        G+GLY   S+ NHSC PN    
Sbjct: 447 DISE-FSELKFCASIMFRACVIMSSNCFSVQQEPGIVSGSGLYVAHSLYNHSCAPNTFRH 505

Query: 198 FEGRLAVVRAVQHVPKGAEVLISY--IETAGSTMTRQKALKEQYLFTCTCPRCI 249
           FEG   + RA+  +  G ++  SY  +        R++ + + Y F C CP C+
Sbjct: 506 FEGLTMITRALTPIRPGDQIFTSYGGVYAHMPKYERKQKILQDYFFDCDCPACV 559


>gi|260798620|ref|XP_002594298.1| hypothetical protein BRAFLDRAFT_117670 [Branchiostoma floridae]
 gi|229279531|gb|EEN50309.1| hypothetical protein BRAFLDRAFT_117670 [Branchiostoma floridae]
          Length = 400

 Score = 85.1 bits (209), Expect = 7e-14,   Method: Compositional matrix adjust.
 Identities = 66/247 (26%), Positives = 109/247 (44%), Gaps = 18/247 (7%)

Query: 8   GEVIISQEPYVCVPNNSSSI-SRCDGCFAS-SNLKKCSACQVVWYCGSNCQKLDWKLHRL 65
           G +I+ +EPY     +   + +RC  C     N   C AC+   YC   C+K     H  
Sbjct: 31  GTLIVKEEPYSYTLTDGELLRTRCHYCLKRLENSVSCDACRTAKYCNEECKKAAKFHHTP 90

Query: 66  ECQVLSRLDKEKRKSVTPSIRLMLKLYLR---RKLQNDNVIP-STTTDNYSLVEALVAHM 121
           EC+  SRL      ++   +R+M ++  +   RK     + P S+   N   ++     +
Sbjct: 91  ECRGYSRL-----MNLPEHLRVMGRILYKMHARKTDMGALGPLSSLVSNVETLKNCEEGI 145

Query: 122 SDIDEKQLLLYAQIANLVNLILQWPEISINEIAENFSKLACNAHTICNSELRPLGTGLYP 181
           + +D K   L   +          P+ +  E  E + K+A N+  I +  +  +G G+YP
Sbjct: 146 TSLDSKMECLSQHMEKDA-----LPDRAFME--EIYGKIASNSFAILDENMCSIGIGVYP 198

Query: 182 VISIINHSCLPNAVLVFEGRLAVVRAVQHVPKGAEVLISYIETAGSTMTRQKALKEQYLF 241
             S+INHSC  N + +F G    +RA + +  G ++   YI     T  RQ+ L + Y F
Sbjct: 199 QASMINHSCKSNCIGMFYGPQIQIRANEFIRPGEQIFHGYIPPLLPTAKRQEKLLKTYHF 258

Query: 242 TCTCPRC 248
            C C  C
Sbjct: 259 LCQCADC 265



 Score = 41.2 bits (95), Expect = 1.1,   Method: Compositional matrix adjust.
 Identities = 27/104 (25%), Positives = 52/104 (50%)

Query: 342 HPFSVNLMQTREKLIKILMELEDWKEALAYCQLTIPVYQRVYPQFHPLLGLQYYTCGKLE 401
           H  +V +++  + L+++ +EL  W++A+ Y +   P Y+    ++    GL Y       
Sbjct: 281 HSDNVYVIRILKDLVEVCVELGLWEKAVVYGKRIEPGYKMFLKRYDLDQGLLYQKMALAY 340

Query: 402 WFLGDTENAIKSMTEAVEILRITHGTNSPFMKELILKLEEAQAE 445
           + LGD   A+  + +A   L IT G +S  ++E+   L E  A+
Sbjct: 341 YRLGDQSAALPYLRQAKTTLTITSGEDSGLVQEVSDMLAECVAK 384


>gi|390363795|ref|XP_788278.2| PREDICTED: SET and MYND domain-containing protein 3-like
           [Strongylocentrotus purpuratus]
          Length = 356

 Score = 84.7 bits (208), Expect = 8e-14,   Method: Compositional matrix adjust.
 Identities = 63/222 (28%), Positives = 103/222 (46%), Gaps = 21/222 (9%)

Query: 51  CGSNCQKLDW-KLHRLECQVLSRLDKEKRKSVTPSIRLMLKLYLRRKLQNDNVIPSTTTD 109
           C     + DW + H+ +C+ L R    +   + P +  +++L L   +Q     P  T D
Sbjct: 64  CVEEINRGDWARCHKQDCKTLKR----RHPMIPPDLAEIVQL-LSHIIQKQRRSPPCTQD 118

Query: 110 NYSLVEALVAHMSDIDEKQLLLYAQIAN-----LVNLILQWPEISINEIA---ENFSKLA 161
           +       V  +    EK   L  QI        ++L+    +  + E++   + F    
Sbjct: 119 DEDCFPTTVDQLESHHEK---LSGQIRRDAFEIWLSLLKDCEDGVLPELSSWLKMFGATI 175

Query: 162 CNAHTICNSELRPLGTGLYPVISIINHSCLPNAVLVFEGRLAVVRAVQHVPKGAEVLISY 221
           CN+ +IC+++L  +  G+Y   S++NHSC PN   V +GR   +  V+ V +G E  ISY
Sbjct: 176 CNSISICDNDLIDIAVGIYLRASMLNHSCDPNCAWVCDGRKLQIMTVKDVKEGDECTISY 235

Query: 222 IETAGSTMTRQKALKEQYLFTCTCPRCIK----LGQFDDIQE 259
           ++       RQ  LKE Y FTC C +CI+    LG  D + E
Sbjct: 236 VDAMKPAKVRQADLKESYHFTCKCVKCIEEINALGPDDGLGE 277



 Score = 58.5 bits (140), Expect = 8e-06,   Method: Compositional matrix adjust.
 Identities = 25/66 (37%), Positives = 39/66 (59%)

Query: 185 IINHSCLPNAVLVFEGRLAVVRAVQHVPKGAEVLISYIETAGSTMTRQKALKEQYLFTCT 244
           + NHSC  N ++VF+ R   +R ++ V  G E  ISY++       RQ  L+E+Y FTC 
Sbjct: 1   MANHSCDYNCIVVFDERKLQLRTIKDVKDGEECTISYVDVINPAKERQAKLEEEYHFTCK 60

Query: 245 CPRCIK 250
           C +C++
Sbjct: 61  CVKCVE 66


>gi|66822113|ref|XP_644411.1| SET domain-containing protein [Dictyostelium discoideum AX4]
 gi|66822969|ref|XP_644839.1| SET domain-containing protein [Dictyostelium discoideum AX4]
 gi|122057743|sp|Q557F7.1|Y3589_DICDI RecName: Full=SET and MYND domain-containing protein DDB_G0273589
 gi|60472534|gb|EAL70485.1| SET domain-containing protein [Dictyostelium discoideum AX4]
 gi|60472895|gb|EAL70844.1| SET domain-containing protein [Dictyostelium discoideum AX4]
          Length = 386

 Score = 84.7 bits (208), Expect = 9e-14,   Method: Compositional matrix adjust.
 Identities = 84/321 (26%), Positives = 131/321 (40%), Gaps = 54/321 (16%)

Query: 8   GEVIISQEPYVCVPNNSSSISRCDGCFAS--SNLK---KCSACQVVWYCGSNCQKLDWKL 62
           GE ++  + Y  V + +   + C  C     S +K   KC+ C  +WYC   C+  +   
Sbjct: 29  GEDLLKCKSYFAVTSETLKTTSCFNCIKQLPSVIKLSLKCNQCNEIWYCNEQCKNENINK 88

Query: 63  HR-LECQVLSRLDKEKRKSVTPS--------IRLMLKL---YLRRKLQNDNVI------- 103
           H+  EC+   +L   K K V P+        IR+++ L   Y +  L N+  I       
Sbjct: 89  HQHYECKFYKKLKSPKLK-VYPNFDIETFTEIRMIVGLLSRYYQDILLNNKFIEQQLNNN 147

Query: 104 ----------PSTTTDNYSLVEALVAHMSDIDEKQLL--LYAQIANLVNLILQWPEIS-- 149
                      +T  D + LVE  V   S+   K+ +  +   I+ L NL+L     +  
Sbjct: 148 NNNNNDNEQLTNTLDDVFDLVENQVTEESNPAAKERIDSIVEFISELFNLVLLGSTTTKS 207

Query: 150 -------------INEIAENF-SKLACNAHTICNSELRPLGTGLYPVISIINHSCLPNAV 195
                        INE + +   K  CN   I     + +G  + P  S  NHSC+PN  
Sbjct: 208 IINNDDKIEMIRKINEKSRSIIHKTRCNQFGIWTKNDKCIGVAVSPSSSYFNHSCIPNCT 267

Query: 196 LVFEGRLAVVRAVQHVPKGAEVLISYIETAGSTMTRQKALKEQYLFTCTCPRCIKLGQFD 255
            V +G     +++  + KG ++ ISYIE       R+  LK  Y F C CPRC       
Sbjct: 268 DVRDGSNMTFKSLYPIKKGDQLTISYIELDQPIQDRKDELKYGYYFDCICPRCNGDSNSI 327

Query: 256 DIQESAILEGYRCKDDGCSGF 276
           D  ++ I + Y C    C+G 
Sbjct: 328 DSMDNWISKFY-CSQKKCTGL 347


>gi|340711193|ref|XP_003394163.1| PREDICTED: SET and MYND domain-containing protein 4-like [Bombus
           terrestris]
          Length = 676

 Score = 84.7 bits (208), Expect = 1e-13,   Method: Compositional matrix adjust.
 Identities = 90/383 (23%), Positives = 157/383 (40%), Gaps = 55/383 (14%)

Query: 8   GEVIISQEPYVCVPNNSSSISRCDGCFASSNLK---KCSACQVVWYCGSNCQKLDWKLHR 64
           GE ++ + P+  V      ++ C  CF    +     C  C  V YC   C+  D ++H+
Sbjct: 263 GETLVIERPHCAVLLAEYRLTNCHHCFKKIFVPIPTSCDTCNFVAYCSILCRNKDAEIHK 322

Query: 65  LECQVLSRL-DKEKRKSVTPSIRLMLK------LYLRRKLQNDNVIPSTTT------DNY 111
            EC +L  L   E   +   +++ +++      L L+ KL+       T+T      D++
Sbjct: 323 NECMILPSLWFSETSVNCFLALKAIVQKPFEELLALKDKLKATKGRFETSTQRPRRHDDF 382

Query: 112 SLVEALVAHMSDIDEKQLLLYAQIAN-LVNLILQWP-------------------EISIN 151
             +  L+ H  +   + L     IA  L+ L+ + P                   E+ I 
Sbjct: 383 EAIYGLITHEDERTSEDLFHRTYIAIWLLRLLKRSPYFPEWVKTPDSAEATPSDGELYIG 442

Query: 152 E-IAENFSKLACNAHTI------------CNSELRPLGTGLYPVISIINHSCLPNAVLVF 198
             I  N   +  NAH I              ++ + +G G+Y  +S+ NHSC P  +  F
Sbjct: 443 SLILHNLMLIQFNAHEISELAVPKGSNILAKAKSKFIGGGVYSTVSLFNHSCNPGIIRYF 502

Query: 199 EGRLAVVRAVQHVPKGAEVLISY--IETAGSTMTRQKALKEQYLFTCTCPRCIKLGQFDD 256
            G   VVRA++ +P G E+  +Y  I T      R++ L+ QY F C C  C       +
Sbjct: 503 IGTTMVVRAIRTIPAGEEISENYGPIFTTTPEAERKRKLRVQYWFDCNCEACTAHWPTLE 562

Query: 257 IQESAILEGYRCKDDGCSGFLL---RDSDDKGFTCQQCGLVRSKEEIKKIASEVNILSKK 313
             +  IL  ++C+     G +L    D+++    C +CG   +  +  K   + + + + 
Sbjct: 563 EIDPTILR-FKCETGKECGNVLPIKADTNEFMIRCSKCGKNTNIFKGLKALQDTDAIFRT 621

Query: 314 TLALTSCGNHQEVVSTYKMIEKL 336
                  G HQE + +Y  I KL
Sbjct: 622 ASRNLEEGKHQEALKSYLEILKL 644


>gi|317157122|ref|XP_001826232.2| SET and MYND domain protein [Aspergillus oryzae RIB40]
          Length = 435

 Score = 84.7 bits (208), Expect = 1e-13,   Method: Compositional matrix adjust.
 Identities = 56/203 (27%), Positives = 93/203 (45%), Gaps = 14/203 (6%)

Query: 52  GSNCQKLDWKL-HRLECQVLSRLDKEKRKSVTPS-IRLMLKLYLRRKLQNDNVIPSTTTD 109
           G  CQ+ DWK  H  EC +   L+      + PS  R +L++ +R            T  
Sbjct: 22  GKECQRKDWKFAHAHECAIFKNLNPR----ILPSNARALLRMIVR------TAHKKYTNG 71

Query: 110 NYSLVEALVAHMSDIDEKQLLLYAQIANLVNLILQWPEISINE--IAENFSKLACNAHTI 167
              L   L  H+S+I ++    + +IA     +  +    + E  I+   +KL  N+   
Sbjct: 72  ELELFSQLETHISEIRDQSPEQWERIALSSKAVKAYSGTDMKEETISAFGAKLELNSFNF 131

Query: 168 CNSELRPLGTGLYPVISIINHSCLPNAVLVFEGRLAVVRAVQHVPKGAEVLISYIETAGS 227
            N     +G  L+P  ++INHSC  NA + F+     ++A + + KG ++ ISYI+    
Sbjct: 132 TNILSDRIGLYLHPYAALINHSCNYNAAVTFDSDNLYIKATRPIQKGDQIFISYIDATNP 191

Query: 228 TMTRQKALKEQYLFTCTCPRCIK 250
              R+  L+E+Y F C C +C K
Sbjct: 192 VKLRRSELRERYYFDCHCAKCAK 214


>gi|194913080|ref|XP_001982622.1| GG12639 [Drosophila erecta]
 gi|190648298|gb|EDV45591.1| GG12639 [Drosophila erecta]
          Length = 462

 Score = 84.3 bits (207), Expect = 1e-13,   Method: Compositional matrix adjust.
 Identities = 41/102 (40%), Positives = 57/102 (55%), Gaps = 6/102 (5%)

Query: 178 GLYPVISIINHSCLPNAVLVFE-GRLAVVRAVQHVPKGAEVLISYIETAGSTMTRQKALK 236
           GL+P+ +I+NH C PNA   FE GRLAVVRA + +PKG E+  +Y +   S +TR   LK
Sbjct: 242 GLFPLTAIMNHECTPNASHYFENGRLAVVRAARDIPKGGEITTTYTKILWSNLTRNIFLK 301

Query: 237 EQYLFTCTCPRCIKLGQFDDIQESAILEGYRCKDDGCSGFLL 278
               F C C RC      D+ +    L    C++ GC G ++
Sbjct: 302 MTKHFACDCARC-----HDNTENGTYLSALFCREQGCRGLVI 338


>gi|281205858|gb|EFA80047.1| SET domain-containing protein [Polysphondylium pallidum PN500]
          Length = 496

 Score = 84.0 bits (206), Expect = 1e-13,   Method: Compositional matrix adjust.
 Identities = 70/250 (28%), Positives = 115/250 (46%), Gaps = 23/250 (9%)

Query: 40  KKCSACQVVWYCGSNCQKLDWKLHRLECQVLSRLDKEKRKSVTPSIRLMLKLYLRR---- 95
           + C  C    YC   C   D + H LEC +L  L++  +   T S+ L+ +  L R    
Sbjct: 95  RGCERCGKQSYCDKFCSTEDSQ-HLLECDLLDALERRVKAKPTRSVLLLARCLLARQSIL 153

Query: 96  KLQNDNVIPSTTTDNYSLVEALVAHMSDIDEKQLLLYAQIANLVNLILQWPEIS------ 149
           K  +DN       +N + ++ L+ H     ++       ++ +    L    I+      
Sbjct: 154 KRSDDNDDDDDEGNNANFIDYLINHKDKYSKEDKESRVGLSTMAGSFLTKKPINQVLQSE 213

Query: 150 INEIAENFSKLACNAHTI--CNSELRPLGTGLYPVISIINHSCLPNAVLVFEGRLAVVRA 207
           I+ + +    + CNA TI  C S +  LGTGLYP ++ INHSC PN  +   G++  ++A
Sbjct: 214 IDVLYDALCSINCNAFTIIDCVSGVL-LGTGLYPGLTFINHSCDPNLQVTQIGKILTLKA 272

Query: 208 VQHVPKGAEVLISYIETAGSTMTRQKALKEQYLFTCTCPRC--IKLGQFDDIQESAILEG 265
           V+ + KG E  ISYI+     + R + L E + F C C +C  ++  + +D         
Sbjct: 273 VRPIKKGEEFTISYIDRTEGAIQRNEELMETFFFECQCHKCASVRSEKRNDFY------- 325

Query: 266 YRCKDDGCSG 275
           + CKD  C G
Sbjct: 326 FVCKDPSCKG 335


>gi|195456692|ref|XP_002075245.1| GK16970 [Drosophila willistoni]
 gi|194171330|gb|EDW86231.1| GK16970 [Drosophila willistoni]
          Length = 766

 Score = 84.0 bits (206), Expect = 1e-13,   Method: Compositional matrix adjust.
 Identities = 40/102 (39%), Positives = 58/102 (56%), Gaps = 6/102 (5%)

Query: 178 GLYPVISIINHSCLPNAVLVFE-GRLAVVRAVQHVPKGAEVLISYIETAGSTMTRQKALK 236
           GL+P+ +I+NH C PNA   F+ GR+AVVRA + +PKG E+  +Y +   S +TR   LK
Sbjct: 546 GLFPLTAIMNHECTPNASHYFDNGRMAVVRAARDIPKGGEITTTYTKILWSNLTRGIFLK 605

Query: 237 EQYLFTCTCPRCIKLGQFDDIQESAILEGYRCKDDGCSGFLL 278
               F C CPRC      D+ +    L    C++ GC G ++
Sbjct: 606 MTKNFMCDCPRC-----NDNTENGTYLSALFCREQGCKGLVI 642



 Score = 62.8 bits (151), Expect = 4e-07,   Method: Compositional matrix adjust.
 Identities = 29/72 (40%), Positives = 41/72 (56%), Gaps = 1/72 (1%)

Query: 178 GLYPVISIINHSCLPNAVLVFE-GRLAVVRAVQHVPKGAEVLISYIETAGSTMTRQKALK 236
            L+P+  ++NH C PNA   FE G   VV A + +PKG E+ ++Y +   ST+ R+  L 
Sbjct: 222 ALFPLAGLLNHHCTPNAAHHFEDGETIVVSATERIPKGTEITMTYAKLLWSTLARKLFLA 281

Query: 237 EQYLFTCTCPRC 248
               F C CPRC
Sbjct: 282 MTKHFICQCPRC 293


>gi|409078830|gb|EKM79192.1| hypothetical protein AGABI1DRAFT_120631 [Agaricus bisporus var.
           burnettii JB137-S8]
          Length = 443

 Score = 83.6 bits (205), Expect = 2e-13,   Method: Compositional matrix adjust.
 Identities = 69/271 (25%), Positives = 121/271 (44%), Gaps = 37/271 (13%)

Query: 7   SGEVIISQEPYVCVPNNSSSISRCDGCFA----SSNLKKCSACQVVWYCGSNCQKLDWKL 62
           +G V++S   +      +    RCD CF        LKKC+ C   +YC + CQK+ W L
Sbjct: 28  AGSVVVSVPAFTSALLEAEKGQRCDTCFRLPPDGEKLKKCTGCASYFYCDTRCQKVQWDL 87

Query: 63  -HRLECQVLS---------RLDKEKRKSVTPSIRLMLKLYLRRKLQNDNVIPSTTTDNYS 112
            H+  C+  S         +  + +R  V     ++L  +L R L+   V  S+  +++ 
Sbjct: 88  EHKRVCKSYSLMISYLHGQQFSQHERMDV-----ILLSHFLGRTLKTKPV--SSADNSHD 140

Query: 113 LVEALVAHMSDIDEKQLLLYAQIANLVNLILQWPEISINEIAENFSKLACNAHTICNSEL 172
                ++ +      Q  L +   +L+        +  + + + +S+   N  TI +S L
Sbjct: 141 PFSIFLSLLPGSHASQRTLDSIPKSLI--------LDDSLVQDIYSRFGNNNFTI-HSHL 191

Query: 173 RPLGTGLYPVIS-IINHSCLPNAVLVF-----EGRLAVVRAVQHVPKGAEVLISYIETAG 226
             +G G++P+ S + NHSC+PNA   +     +  L  V A++    G E+ I Y++ A 
Sbjct: 192 NSIGHGVFPLASRLFNHSCIPNAAPRYVQGPAQPVLMEVVALRDTDVGEEICIPYLDPA- 250

Query: 227 STMTRQKALKEQYLFTCTCPRCIKLGQFDDI 257
            T +R +  +  Y FTC C  C  + Q   I
Sbjct: 251 LTQSRSQIFQYTYGFTCQCSSCRDISQLGQI 281


>gi|194752924|ref|XP_001958769.1| GF12554 [Drosophila ananassae]
 gi|190620067|gb|EDV35591.1| GF12554 [Drosophila ananassae]
          Length = 660

 Score = 83.6 bits (205), Expect = 2e-13,   Method: Compositional matrix adjust.
 Identities = 92/387 (23%), Positives = 160/387 (41%), Gaps = 60/387 (15%)

Query: 7   SGEVIISQEPYVCVPNNSSSISRCDGCFASSNLK-KCSACQVVWYCGSNC-QKLDWKLHR 64
           +GE ++ + P+V V     + + C+ CF  S +   CS C  V YC   C ++   K H+
Sbjct: 261 AGEELLVERPFVAVLLEKFAKTHCENCFVRSGVPVACSRCADVIYCSEQCREEAANKFHK 320

Query: 65  LECQVLSRLDKEKRKSVTPSIRLM------------LKLYLRRKLQNDNVIPSTTTDNYS 112
            EC ++  L +    S+   I L             LK  +  +L  + +I S   D++ 
Sbjct: 321 YECGIVPILWRSG-ASINNHIALRIIASKPLDYFLKLKPTIDEELSPEKLI-SLPKDDFR 378

Query: 113 LVEALVAHMSDIDEK---QLLLYAQ-----------------------IANLVNLILQWP 146
            V  L  H  +       Q +L A+                       IA+LV   LQ+ 
Sbjct: 379 RVAQLERHQGERQPSNFFQHVLMARFLTRCLKAGGYFGPEPKQDQIQVIASLVLRSLQFI 438

Query: 147 EISINEIAE--NFSKLACNAHTICNSELRPLGTGLYPVISIINHSCLPNAVLVFEGRLAV 204
           + + +E+AE   FS                +G  +YP +++ NHSC P  V  F G    
Sbjct: 439 QFNTHEVAELHKFSSSGREKSIF-------IGGAIYPTLALFNHSCDPGVVRYFRGNTIH 491

Query: 205 VRAVQHVPKGAEVLISY--IETAGSTMTRQKALKEQYLFTCTCPRCI-KLGQFDDIQESA 261
           + +V+ +  G  +  +Y  I T      RQ  LKE Y F C+C  CI    +FDD+    
Sbjct: 492 INSVRPIEAGLPINENYGPIYTQDERSERQARLKELYWFECSCDACIDNWPKFDDLPRDI 551

Query: 262 ILEGYRCK-DDGCSGFL--LRDSDDKGFTCQQCGLVRSKEEIKKIASEVNILSKKTLALT 318
           I   +RC+  + C+  +      +D    C  CG + +  +  K+  +  ++++    L 
Sbjct: 552 I--RFRCEAPNNCAAVIEVPPSCNDFMVKCVTCGEITNILKGLKVMQDTEMMTRTAKRLY 609

Query: 319 SCGNHQEVVSTY-KMIEKLQKKLYHPF 344
             G + + ++ +  +I  + + L  PF
Sbjct: 610 ETGEYSKALTKFIDLIRIMYEVLAPPF 636


>gi|341876422|gb|EGT32357.1| CBN-SET-10 protein [Caenorhabditis brenneri]
          Length = 428

 Score = 83.6 bits (205), Expect = 2e-13,   Method: Compositional matrix adjust.
 Identities = 75/280 (26%), Positives = 127/280 (45%), Gaps = 42/280 (15%)

Query: 30  CDGCFASSNLKK-----CSACQVVWYCGSNCQKLDWK-LHRLECQVLSRLDKEKRKSVTP 83
           C  CFA  ++       C  C  V YC   CQ+ DWK +H+ EC++L    + +   +T 
Sbjct: 23  CATCFAEIDVDSEECLSCDDCSEVTYCSLKCQRKDWKAVHQFECEIL----RSQHTPMTV 78

Query: 84  SIRLMLKLYLRRKLQNDNVIPSTTTDNYSLVEALVAHMSDI----DEKQLLLYAQIANLV 139
           ++RL +++ L    +N      T++ N +++E L  +  +        Q L     ++++
Sbjct: 79  TMRLCIRILL-TTFKNGT---QTSSFNGAVIEDLETNYKEFRSSPKHNQFL-----SDVL 129

Query: 140 NLILQWPEISINEIAENFSKLACNAHTICNS-----ELR--PLGTGLYPVISIINHSCLP 192
            +I            EN   +A     +CNS     E R  P+G+G+Y  ++  NHSC  
Sbjct: 130 TIIKSSGHTIFPASIENNRMIAMICSVLCNSFGIIDEKRGEPIGSGMYIGLAKHNHSCAS 189

Query: 193 NAVLVFEGRLAVVRAVQHVPKGAEVLISYIETAGSTMTRQKALKEQYLFTCTCPRCIKLG 252
            + +VF+    ++RA +         ISY+     T  RQK+++  +  TC C  C    
Sbjct: 190 TSHVVFDKNQVLLRARKE-EYCRNTTISYVSRMLPTAERQKSIRSVHFITCRCEMC---- 244

Query: 253 QFDDIQESAILEGYRCKDDGCSGFLLRDSDDKGFTCQQCG 292
           Q +D+    I    RC+ DGC+GF+   S     +C  CG
Sbjct: 245 QNEDLD--LIGLASRCQTDGCNGFVKGSS-----SCGSCG 277


>gi|30680748|ref|NP_849991.1| histone-lysine N-methyltransferase ASHR2 [Arabidopsis thaliana]
 gi|94707155|sp|Q9ZUM9.3|ASHR2_ARATH RecName: Full=Histone-lysine N-methyltransferase ASHR2; AltName:
           Full=ASH1-related protein 2; AltName: Full=Protein SET
           DOMAIN GROUP 39
 gi|28393236|gb|AAO42047.1| putative SET-domain transcriptional regulator [Arabidopsis
           thaliana]
 gi|330251813|gb|AEC06907.1| histone-lysine N-methyltransferase ASHR2 [Arabidopsis thaliana]
          Length = 398

 Score = 83.6 bits (205), Expect = 2e-13,   Method: Compositional matrix adjust.
 Identities = 81/272 (29%), Positives = 120/272 (44%), Gaps = 38/272 (13%)

Query: 7   SGEVIISQEP---YVCVPNNSSSISR-CDGCF---ASSNLKKCSACQVVWYCGSNC--QK 57
           +G+VI+ + P   Y   P  SSS+S  CD CF   ASS  +KC +C +V +C  NC    
Sbjct: 33  AGQVILRESPLLLYSAFPFLSSSVSPYCDHCFRLLASSAHQKCQSCSLVSFCSPNCFASH 92

Query: 58  LDWKLHRLECQVLSRLDKEKRKSVT--PSIRLMLKLYLRRKLQNDNVIPSTTTDNYSLVE 115
             W      C+ L RL +    + +  PS R +   +L   L   N+  ++ +D   L+ 
Sbjct: 93  TPWL-----CESLRRLHQSSSSAFSDQPSDRQVQARFL---LSAYNLAAASPSDFQILLS 144

Query: 116 ALVAHMSDIDEKQLLLYAQIANLVNLILQ-----WP-EISINEIAENFSKLACNAH---- 165
              +  S+ D       +  A  ++ +L       P  IS +  A   SK   NA     
Sbjct: 145 LQGSGSSNGDPSCSAGDSAAAGFLHSLLSSVCPSLPVSISPDLTAALLSKDKVNAFGLME 204

Query: 166 --TICNSELRPLGTGLYPVISIINHSCLPNAVL------VFEGRLAVV-RAVQHVPKGAE 216
             ++ N +      G+YP  S  NH CLPNA          +G   ++ R +  VP+G E
Sbjct: 205 PCSVSNEKRSVRAYGIYPKTSFFNHDCLPNACRFDYVDSASDGNTDIIIRMIHDVPEGRE 264

Query: 217 VLISYIETAGSTMTRQKALKEQYLFTCTCPRC 248
           V +SY     +  +RQK L E Y F C C RC
Sbjct: 265 VCLSYFPVNMNYSSRQKRLLEDYGFKCDCDRC 296


>gi|391335768|ref|XP_003742261.1| PREDICTED: SET and MYND domain-containing protein 4-like
           [Metaseiulus occidentalis]
          Length = 775

 Score = 83.6 bits (205), Expect = 2e-13,   Method: Compositional matrix adjust.
 Identities = 119/508 (23%), Positives = 186/508 (36%), Gaps = 94/508 (18%)

Query: 8   GEVIISQEPYVCVPNNSSSISRCDGCFASSNLK-KCSACQVVWYCGSNCQKLDWKL-HRL 65
           G+ +  + PY  V   + + + C  C+        C  C  V YC  +C    W L H  
Sbjct: 269 GDNLFVESPYASVLLPAFTKTHCHHCYRRIKAAFPCRQCAQVRYCSMSCSGESWNLYHSQ 328

Query: 66  ECQVLSRLDKEKRKSVTPSIRLMLKLYLRRKLQNDNVIPSTTTDNYSLVEALVAHMSDID 125
           EC  L  L       +   + L+  L L +         +T   +Y  V  LV H  D+ 
Sbjct: 329 ECGNLDLLISVGIAHLAERVILVTGLGLIKDFMKSC---NTLECSYLPVYQLVTHEEDMH 385

Query: 126 EKQLLLYAQIANLVNLILQWPEISINE--IAENFS------------------------- 158
              L  Y+  A L+   L+       E  ++E F                          
Sbjct: 386 IADLFQYSFTATLLLKYLERQTTFFTESHLSERFQDLNLRHAERAQPKGQIRIECSRPLK 445

Query: 159 ------------KLACNAHTICN-----------------SELRPLGTGLYPVISIINHS 189
                       +L CNAH I                    E   + T +YP  S++NHS
Sbjct: 446 LFVGGLLLRHIQQLICNAHAITTIQQPGDHVIEEDGIILEQEQVRVATAIYPSASLMNHS 505

Query: 190 CLPNAVLVFE-GRLAVVRAVQHVPKGAEVLISYIETAGSTMT---RQKALKEQYLFTCTC 245
           C PN +  F  G   VV++V+ +  G EV   Y       MT   RQ AL+EQY F C C
Sbjct: 506 CNPNIISGFRSGSTLVVKSVRPIASGEEVFNCY-GPHFRRMTFQERQTALQEQYFFRCDC 564

Query: 246 PRCIKLGQFDDIQESAILEGYRCKDDGCSGFLLRDSDDKGFTCQQCGLVRSKEEIKKIAS 305
             C K G  DD    A+    RC  + C G L          C QC   +   E ++   
Sbjct: 565 TACQK-GDLDDQISMAL----RC--EYCEGPLSAVQSSGKADCLQCSTTQDCLEKEQKVF 617

Query: 306 EVNILSKKTLALTSCGNHQEVVSTYKMIEKLQKKLYHPFSVNLMQTREKLIKILMELEDW 365
            ++ L  + L L    +H E +   +     Q+KL +  +  LM TR+ + K L  L  +
Sbjct: 618 RMHDLFVQGLQLAEMDSHGEALERLQKCLASQEKLLYRHNKQLMITRDMVAKSLCALGRF 677

Query: 366 KEALAYCQLTIPVYQRVYPQFHPLLGLQYYTCG------------------KLEWFLGDT 407
           ++A+   +  +   + +Y +    LG +    G                  +L   +GDT
Sbjct: 678 RDAVGVLKSAVESVRHMYGKNSIELGNELLKFGDVLMNATEESFAKSGYSRELGTLIGDT 737

Query: 408 ENAIKSMTEAVEILRITHGTNSPFMKEL 435
           +   + + +   I  + +G + P + EL
Sbjct: 738 K---RVLLQTEPIFLLHYGKSHPAIAEL 762


>gi|391334364|ref|XP_003741575.1| PREDICTED: SET and MYND domain-containing protein 1-like
           [Metaseiulus occidentalis]
          Length = 434

 Score = 83.6 bits (205), Expect = 2e-13,   Method: Compositional matrix adjust.
 Identities = 62/249 (24%), Positives = 119/249 (47%), Gaps = 12/249 (4%)

Query: 9   EVIISQEPYVCVPNNSSSISRCDGCFASSNLKKCSACQVVWYCGSNCQKLDWKLHRLECQ 68
           +VI  + PY  V ++      C  C     ++ C  C +V+YC + C+  D + H +EC 
Sbjct: 8   DVIHRESPYCVVVDDGVLDRICSYCLYWKEVEPCE-CLLVFYCSNFCKLQDARDHEIEC- 65

Query: 69  VLSRLDKEKRKSVTPSIRLMLKLYLRRKLQNDNVIPSTTTDNYSL---VEALVAHMSDID 125
           +LSR  + +   +   +RL+++  L +  + + +   +  + Y     +  L  H+ D+ 
Sbjct: 66  LLSRT-RGRESLLDDEVRLVVRA-LAKNFREERIGSVSEGNFYGSRRSISDLETHLEDLS 123

Query: 126 EKQLLLYAQIA-NLVNLILQWPEISINEIAENFSKLACNAHTICNSELRPLGTGLYPVIS 184
           E   L   + A  + +L+  +    ++E+A    +L  N   + +      G   Y  IS
Sbjct: 124 EDDRLEVQEKAQRIASLMRAFTNTEVHEVALMLQRLRINMFQLSDHRAVTKGIACYLGIS 183

Query: 185 IINHSC--LPNAVLVFEGRLAVVRAVQH--VPKGAEVLISYIETAGSTMTRQKALKEQYL 240
           +++H+C    N VL F+GR  ++RA+++  V    +  I+YI+    T  R++AL   Y 
Sbjct: 184 VVDHTCEDSGNFVLAFKGREIILRALKNFTVQDMGDFRINYIDATIPTEERRRALLAGYF 243

Query: 241 FTCTCPRCI 249
           F C C +C+
Sbjct: 244 FACQCAKCL 252


>gi|134083355|emb|CAK42922.1| unnamed protein product [Aspergillus niger]
          Length = 415

 Score = 83.6 bits (205), Expect = 2e-13,   Method: Compositional matrix adjust.
 Identities = 56/202 (27%), Positives = 99/202 (49%), Gaps = 12/202 (5%)

Query: 50  YCGSNCQKLDWKL-HRLECQVLSRLDKEKRKSVTPSIRLMLKLYLRRKLQNDNVIPSTTT 108
           + G  CQ  +WKL H LEC +   L   K + +  + R +L++ LR + Q        ++
Sbjct: 45  HLGETCQAKNWKLGHSLECSIYQNL---KPRVLPINARAVLRMVLRSERQK------YSS 95

Query: 109 DNYSLVEALVAHMSDIDEKQLLLYAQIANLVNLILQWPEISINE--IAENFSKLACNAHT 166
           +       L  H+ DI ++    + +I+     I  +    ++E  I+   +KL  N+  
Sbjct: 96  EELDQFLQLGTHIKDIRDQSASQWERISLSSKAIKAYSGTDMSEEVISAMGAKLDLNSFN 155

Query: 167 ICNSELRPLGTGLYPVISIINHSCLPNAVLVFEGRLAVVRAVQHVPKGAEVLISYIETAG 226
           + N+    LG  L+P  +I NHSC  NA + F+G    ++A++ + K  ++ I+YI+   
Sbjct: 156 LTNAVYDRLGVYLHPYAAIFNHSCDHNAAVSFDGPNLHIKALRPIRKDEQIFITYIDVTD 215

Query: 227 STMTRQKALKEQYLFTCTCPRC 248
               RQ  L+ +Y FTC C +C
Sbjct: 216 PYPIRQANLQSRYYFTCHCSKC 237


>gi|170595415|ref|XP_001902374.1| MYND finger family protein [Brugia malayi]
 gi|158589993|gb|EDP28779.1| MYND finger family protein [Brugia malayi]
          Length = 464

 Score = 83.2 bits (204), Expect = 3e-13,   Method: Compositional matrix adjust.
 Identities = 71/250 (28%), Positives = 110/250 (44%), Gaps = 22/250 (8%)

Query: 28  SRCDGCFAS-SNLKKCSACQVVWYCGSNCQKLDWKLHRLECQVLSRLDKEKRKSVTPSIR 86
           SRC  C A    LK+C AC+   +C   CQ L WK HR EC+     D          +R
Sbjct: 84  SRCWYCLAKVDTLKRCHACRKGMFCNEKCQILGWKDHRSECKAFKSHD----AIANIEVR 139

Query: 87  LMLKLYLRRKL------QNDNVIPSTTTDNYSLVEALVAHMSDI--DEKQLLLYAQI-AN 137
           L+ ++  R K       + DN      T   S+++ + +H   I  D   +  +  I A+
Sbjct: 140 LLGRIVTRYKAIKLGKDKEDNNFYKDRTSKRSIMD-IWSHTDLIKKDPAAMTKFNGIYAD 198

Query: 138 LVNLILQWPEISINEIAENFSKLACNAHTICN-SELRPLGTGLYPVISIINHSCLPNAVL 196
           L+        +S +E+ E   +   N H I +   +  +G GLY  +   +HSC PNA+ 
Sbjct: 199 LLAFYGSKAMVSRDEVFELHCRNYINRHAISDCGYIEEIGKGLYLDLCAYDHSCRPNAIY 258

Query: 197 VFEGRLAVVRAVQ---HVPKGAEVLISYIETAGSTMTRQKALKEQYLFTCTCPRCIKLGQ 253
                +A +R +     +   +    SYI+   +T  R+K LK+ + F C C RC     
Sbjct: 259 TCNSFVATLRGLTANVDLRNLSSTYYSYIDLINTTQQRRKLLKDTWYFECHCTRC---DD 315

Query: 254 FDDIQESAIL 263
            DD   S+IL
Sbjct: 316 PDDALLSSIL 325


>gi|402217590|gb|EJT97670.1| hypothetical protein DACRYDRAFT_111714 [Dacryopinax sp. DJM-731
           SS1]
          Length = 531

 Score = 83.2 bits (204), Expect = 3e-13,   Method: Compositional matrix adjust.
 Identities = 77/273 (28%), Positives = 121/273 (44%), Gaps = 26/273 (9%)

Query: 149 SINEIAENFSKLACNAHTICNSELRPLGTGLYPVISIINHSCLPNAVLVF------EGRL 202
           SI ++    S+   N+ ++   +L  +G  + P+ ++I+HSC+PNAV+VF       G L
Sbjct: 182 SIRDLTNLLSRFQTNSFSLTTPDLTNVGVAISPLAALISHSCMPNAVVVFPTGLGRRGGL 241

Query: 203 AVVRAVQHVPKGAEVLISYIETAGSTMTRQKALKEQYLFTCTCPRCIKLGQFDDIQESAI 262
            V+ A++ +  G EVL SY++ A     R K LK++YLF C C  C K    + I     
Sbjct: 242 EVI-ALRDLQPGEEVLTSYVDIALPRSLRWKELKDRYLFDCECVLCEKHHDHEWIDPR-- 298

Query: 263 LEGYRCKDDGCSGFLLRDSD--DKGFTCQQCGL-----VRSKEEIKKIASEVNILSKKTL 315
            E  RC   GC G +   +       TC  CG        S +E  K+A +V   ++   
Sbjct: 299 -EALRCSKKGCKGKMGTPTSLGTTEITCSICGTKLSVSTASLQEHVKLAEDVLHKAEAIQ 357

Query: 316 ALTSCGNHQEVVSTYKMIEKLQKKLYHPFSVNLMQTREKLIKILMELED--------WKE 367
              S      +VS  +  +      Y P    L   ++ LI  + +L           ++
Sbjct: 358 FSDSDKARYLLVSILRDFDPFALSAY-PLLQILQLLQQLLITAVFDLPSTGTERDGLLQD 416

Query: 368 ALAYCQLTIPVYQRVYPQFHPLLGLQYYTCGKL 400
           A +   L +   + VYP  HP  G+QY T  KL
Sbjct: 417 AASCSALVLSGMEDVYPYGHPSKGIQYVTSAKL 449


>gi|197304712|ref|NP_001127869.1| N-lysine methyltransferase SMYD2-like [Nasonia vitripennis]
          Length = 704

 Score = 83.2 bits (204), Expect = 3e-13,   Method: Compositional matrix adjust.
 Identities = 72/295 (24%), Positives = 127/295 (43%), Gaps = 53/295 (17%)

Query: 8   GEVIISQEPYVCVPNNSSSISRCDGCFASSNLKK-----CSACQVVWYCGSNCQKLDW-K 61
           G V+I   P+    + ++    C  C AS  L +     C  CQ V +C  +C++  W K
Sbjct: 299 GTVLIVDRPFASTTDVAALCRNCLCCHASLKLPESIRVPCEHCQAVQFCSEDCRRRAWNK 358

Query: 62  LHRLECQVLSRLDKEK----------RKSVTPSIRLMLKLYLRRKL----QNDN------ 101
            H+ EC++      ++          R  V  ++R   +  L ++     +ND       
Sbjct: 359 FHKYECRIFDYFYAKQAQKTNCLLAYRAVVAAALRNEKEFSLDQEFLKLHENDKDSLDIA 418

Query: 102 VIPSTTTDNYSLVEALVAHMSDIDEKQLL------LYAQIANLVNLILQWPEISINE--- 152
            + S  + +Y  V +L +H +D D    L      ++   + + +L    PEI  +E   
Sbjct: 419 SLASYESRDYKTVFSLESHCADADPADNLQRAIHSVFFAKSLIYSLNYFHPEIDNHERHL 478

Query: 153 ------IAENFSKLACNAHTICNS---------ELRPLGTGLYPVISIINHSCLPNAVL- 196
                 +  N   + CNA+ I  +         E R +G  +Y  +S+ NHSC PN V  
Sbjct: 479 HILAVALLHNMQAIKCNAYEIVENVRDDETKILEPRNVGGAIYTTVSLTNHSCYPNIVRH 538

Query: 197 VFEGRLAVVRAVQHVPKGAEVLISYIE--TAGSTMTRQKALKEQYLFTCTCPRCI 249
            F     VV +++++P+G+E+L  Y +        +R++ L E+Y F C C  C+
Sbjct: 539 SFPNGTVVVTSLRYIPEGSEILDCYGQHFLENKRDSRRRLLAEKYYFDCQCEPCL 593


>gi|18399103|ref|NP_565457.1| histone-lysine N-methyltransferase ASHR2 [Arabidopsis thaliana]
 gi|14334524|gb|AAK59459.1| putative SET-domain transcriptional regulator protein [Arabidopsis
           thaliana]
 gi|17979109|gb|AAL47498.1| putative SET-domain transcriptional regulator protein [Arabidopsis
           thaliana]
 gi|20197479|gb|AAD10162.2| putative SET-domain transcriptional regulator [Arabidopsis
           thaliana]
 gi|330251812|gb|AEC06906.1| histone-lysine N-methyltransferase ASHR2 [Arabidopsis thaliana]
          Length = 341

 Score = 83.2 bits (204), Expect = 3e-13,   Method: Compositional matrix adjust.
 Identities = 81/272 (29%), Positives = 120/272 (44%), Gaps = 38/272 (13%)

Query: 7   SGEVIISQEP---YVCVPNNSSSISR-CDGCF---ASSNLKKCSACQVVWYCGSNC--QK 57
           +G+VI+ + P   Y   P  SSS+S  CD CF   ASS  +KC +C +V +C  NC    
Sbjct: 33  AGQVILRESPLLLYSAFPFLSSSVSPYCDHCFRLLASSAHQKCQSCSLVSFCSPNCFASH 92

Query: 58  LDWKLHRLECQVLSRLDKEKRKSVT--PSIRLMLKLYLRRKLQNDNVIPSTTTDNYSLVE 115
             W      C+ L RL +    + +  PS R +   +L   L   N+  ++ +D   L+ 
Sbjct: 93  TPWL-----CESLRRLHQSSSSAFSDQPSDRQVQARFL---LSAYNLAAASPSDFQILLS 144

Query: 116 ALVAHMSDIDEKQLLLYAQIANLVNLILQ-----WP-EISINEIAENFSKLACNAH---- 165
              +  S+ D       +  A  ++ +L       P  IS +  A   SK   NA     
Sbjct: 145 LQGSGSSNGDPSCSAGDSAAAGFLHSLLSSVCPSLPVSISPDLTAALLSKDKVNAFGLME 204

Query: 166 --TICNSELRPLGTGLYPVISIINHSCLPNAVL------VFEGRLAVV-RAVQHVPKGAE 216
             ++ N +      G+YP  S  NH CLPNA          +G   ++ R +  VP+G E
Sbjct: 205 PCSVSNEKRSVRAYGIYPKTSFFNHDCLPNACRFDYVDSASDGNTDIIIRMIHDVPEGRE 264

Query: 217 VLISYIETAGSTMTRQKALKEQYLFTCTCPRC 248
           V +SY     +  +RQK L E Y F C C RC
Sbjct: 265 VCLSYFPVNMNYSSRQKRLLEDYGFKCDCDRC 296


>gi|307171148|gb|EFN63141.1| SET and MYND domain-containing protein 4 [Camponotus floridanus]
          Length = 641

 Score = 82.8 bits (203), Expect = 3e-13,   Method: Compositional matrix adjust.
 Identities = 74/293 (25%), Positives = 123/293 (41%), Gaps = 53/293 (18%)

Query: 8   GEVIISQEPYVCVPNNSSSISRCDGCFASS-NLKKCSACQVVWYCGSNCQKLDWKL-HRL 65
           G++I  ++PY  V       + C  C + S NL  C+ C +  YC   C+KL W+  H+ 
Sbjct: 270 GDIISIEDPYAHVIYEERYYTHCHYCLSKSYNLIPCAKCPIAQYCSEKCRKLGWEACHKT 329

Query: 66  ECQVLSRL------DKEKRKSVTPSIRLMLKLYLR----RKLQNDNVIPSTTTDNYSLVE 115
           EC +L  L      DK+K + ++  IRL++ +       ++L+ D  I  +  DN +   
Sbjct: 330 ECPILKLLTNLLNVDKDKIRMISKIIRLLIVVTENGIKIKELEQDMKIAESNPDNRTAGF 389

Query: 116 ALVAHMSDIDEKQLL---------------LYAQIANLVNLIL---------QWPEISIN 151
              A +     +  L                +A I+ L  ++L         ++    + 
Sbjct: 390 TDTAILDSFSARSALSLATNMTTRPLIGISAFACISALAVILLATQTKFFSKKYEIDDLK 449

Query: 152 EIAENFSKL---------AC-----NAHTICNSELRPLGTGLYPVISIINHSCLPNAVLV 197
           +I+E FS+L         AC     N  +I        G+GLY   S+ NHSC PN    
Sbjct: 450 DISE-FSELKFCASIMFRACVIMSSNCFSIQQEPGIVSGSGLYIAHSLYNHSCAPNTFRH 508

Query: 198 FEGRLAVVRAVQHVPKGAEVLISY--IETAGSTMTRQKALKEQYLFTCTCPRC 248
           FEG   + RA+  +  G ++  SY  +        R++ + + Y   C CP C
Sbjct: 509 FEGLTMITRALTPIRVGDQIFTSYGGVYAHMPRFERKQKILQDYFLDCDCPAC 561


>gi|195132929|ref|XP_002010892.1| GI21459 [Drosophila mojavensis]
 gi|193907680|gb|EDW06547.1| GI21459 [Drosophila mojavensis]
          Length = 460

 Score = 82.8 bits (203), Expect = 3e-13,   Method: Compositional matrix adjust.
 Identities = 41/102 (40%), Positives = 57/102 (55%), Gaps = 6/102 (5%)

Query: 178 GLYPVISIINHSCLPNAVLVFE-GRLAVVRAVQHVPKGAEVLISYIETAGSTMTRQKALK 236
           GL+P+ +I+NH C PNA   FE GRLAVVRA + +PKG E+  +Y +   S +TR   LK
Sbjct: 240 GLFPLTAIMNHECTPNASHYFENGRLAVVRAARDIPKGGEITTTYTKILWSNLTRGIFLK 299

Query: 237 EQYLFTCTCPRCIKLGQFDDIQESAILEGYRCKDDGCSGFLL 278
               F C C RC      D+ +    L    C++ GC G ++
Sbjct: 300 MTKYFVCNCDRC-----NDNSENGTYLSALFCREQGCKGLVI 336


>gi|195397183|ref|XP_002057208.1| GJ16480 [Drosophila virilis]
 gi|194146975|gb|EDW62694.1| GJ16480 [Drosophila virilis]
          Length = 600

 Score = 82.8 bits (203), Expect = 4e-13,   Method: Compositional matrix adjust.
 Identities = 41/102 (40%), Positives = 57/102 (55%), Gaps = 6/102 (5%)

Query: 178 GLYPVISIINHSCLPNAVLVFE-GRLAVVRAVQHVPKGAEVLISYIETAGSTMTRQKALK 236
           GL+P+ +I+NH C PNA   FE GRLAVVRA + +PKG E+  +Y +   S +TR   LK
Sbjct: 380 GLFPLTAIMNHECTPNASHYFENGRLAVVRAARDIPKGGEITTTYTKILWSNLTRGIFLK 439

Query: 237 EQYLFTCTCPRCIKLGQFDDIQESAILEGYRCKDDGCSGFLL 278
               F C C RC      D+ +    L    C++ GC G ++
Sbjct: 440 MTKYFVCNCDRCN-----DNSENGTYLSALFCREQGCKGLVI 476



 Score = 64.3 bits (155), Expect = 1e-07,   Method: Compositional matrix adjust.
 Identities = 32/84 (38%), Positives = 45/84 (53%), Gaps = 1/84 (1%)

Query: 175 LGTGLYPVISIINHSCLPNAVLVFE-GRLAVVRAVQHVPKGAEVLISYIETAGSTMTRQK 233
           +   L+P+  ++NH C PNA   FE G   VV A + +P GAE+ +SY +   ST+ R+ 
Sbjct: 210 VARALFPLAGMLNHQCTPNAAHHFEDGETIVVTATERIPLGAEITMSYAKLLWSTLARKM 269

Query: 234 ALKEQYLFTCTCPRCIKLGQFDDI 257
            L     F C CPRC    Q D +
Sbjct: 270 FLGMTKHFMCQCPRCQDPTQTDAV 293


>gi|332025618|gb|EGI65780.1| SET and MYND domain-containing protein 4 [Acromyrmex echinatior]
          Length = 630

 Score = 82.8 bits (203), Expect = 4e-13,   Method: Compositional matrix adjust.
 Identities = 79/293 (26%), Positives = 131/293 (44%), Gaps = 53/293 (18%)

Query: 8   GEVIISQEPYVCVPNNSSSISRCDGCFASS-NLKKCSACQVVWYCGSNCQKLDWKL-HRL 65
           G++I  ++PY  V       + C  C A S NL  CS C +  YC  NC+KL W+  H  
Sbjct: 259 GDIISIEDPYAHVIYEERYYTHCHYCLARSYNLIPCSECPIAQYCSENCRKLAWEACHET 318

Query: 66  ECQVLSRL------DKEKRKSVTPSIRLMLKLYLR----RKLQNDNVIPSTTTDNYS--- 112
           EC +L  L      DK+K + ++  IRL++ +       ++LQ D     + +DN +   
Sbjct: 319 ECPILKLLTNLLNVDKDKIRMISKIIRLLIVVTENGTRIKELQQDMKTAESNSDNRTAGF 378

Query: 113 ----LVEALVAH-----MSDIDEKQLL---LYAQIANLVNLILQ-----WP---EI-SIN 151
               ++++  A       +++  + L+    +A I+ L  ++L      +P   EI  + 
Sbjct: 379 TDTGILDSFSARSALSLATNMTTRPLIGISAFACISALAVILLATQTKFFPKKYEIDDLK 438

Query: 152 EIAENFSKL---------AC-----NAHTICNSELRPLGTGLYPVISIINHSCLPNAVLV 197
           +I+E FS+L         AC     N  +I        G+GLY   S+ NHSC PN    
Sbjct: 439 DISE-FSELKFCASIMFRACVIMSSNCFSIQQEPGIVSGSGLYVAHSLYNHSCAPNTFRH 497

Query: 198 FEGRLAVVRAVQHVPKGAEVLISY--IETAGSTMTRQKALKEQYLFTCTCPRC 248
           FEG   + RA+  +  G ++  SY  +        R++ + + Y   C C  C
Sbjct: 498 FEGLTMITRALTPIHPGDQIFTSYGGVYAHMPRSERKQKILQDYFLDCDCSAC 550


>gi|391337345|ref|XP_003743030.1| PREDICTED: histone-lysine N-methyltransferase ASHR1-like
           [Metaseiulus occidentalis]
          Length = 437

 Score = 82.4 bits (202), Expect = 4e-13,   Method: Compositional matrix adjust.
 Identities = 71/259 (27%), Positives = 116/259 (44%), Gaps = 24/259 (9%)

Query: 8   GEVIISQEPYVCVPNNSSSISRCDGCFASSNLK------KCSACQVVWYCGSNCQKLDWK 61
           GEVI+   PY      S   S C  CF +          +C +C+V  YC   C +LD  
Sbjct: 7   GEVILFSRPYAFALYESFVDSTCSFCFRTQGDGHDFAPIECHSCKVYKYCSDECLRLDQM 66

Query: 62  LHRLECQVLSRLDKEKRKSVTPSI--RLMLKLYLR--RKLQNDNVIPSTTTDNYSLVEAL 117
            H LECQV  R  +E      PSI  R++ +  LR  R+ Q           +   + A 
Sbjct: 67  DHWLECQVAKRKLRE-----FPSIEARMIYRACLRYMREKQTQQESAEYIFGHKRSIHAY 121

Query: 118 VAHMSDID-EKQLLLYAQIANLVNLI--LQWPEISINEIAENFSKLACNAHTICNSELRP 174
           + H+  I  E++   Y  + + ++L+  +  P+  +  +  N  ++A N+H +   +   
Sbjct: 122 LDHVDKISTERREAFYEIVDDALDLVSSIYAPDRELMFLMLN--RIAINSHDVICPKNWS 179

Query: 175 LGTGLYPVISIINHSCLPNAVLV--FEGRLAVVRAVQHVPKGA--EVLISYIETAGSTMT 230
           +G  +Y   S +NHSC  +   V  F GR   + A++ V   +  ++ + Y+        
Sbjct: 180 MGRAIYMAGSKVNHSCEIHDRFVQQFFGRNYAIMALKTVEINSVDDLSVPYLPLMMDVHA 239

Query: 231 RQKALKEQYLFTCTCPRCI 249
           RQK LK+ Y F C C RC+
Sbjct: 240 RQKHLKDNYYFDCKCSRCL 258


>gi|195040117|ref|XP_001991004.1| GH12314 [Drosophila grimshawi]
 gi|193900762|gb|EDV99628.1| GH12314 [Drosophila grimshawi]
          Length = 611

 Score = 82.4 bits (202), Expect = 4e-13,   Method: Compositional matrix adjust.
 Identities = 41/102 (40%), Positives = 57/102 (55%), Gaps = 6/102 (5%)

Query: 178 GLYPVISIINHSCLPNAVLVFE-GRLAVVRAVQHVPKGAEVLISYIETAGSTMTRQKALK 236
           GL+P+ +I+NH C PNA   FE GRLAVVRA + +PKG E+  +Y +   S +TR   LK
Sbjct: 391 GLFPLTAIMNHECTPNASHYFENGRLAVVRAARAIPKGGEITTTYTKILWSNLTRGIFLK 450

Query: 237 EQYLFTCTCPRCIKLGQFDDIQESAILEGYRCKDDGCSGFLL 278
               F C C RC      D+ +    L    C++ GC G ++
Sbjct: 451 MTKYFVCNCDRCN-----DNSENGTYLAALFCREQGCKGLVI 487



 Score = 63.5 bits (153), Expect = 2e-07,   Method: Compositional matrix adjust.
 Identities = 30/75 (40%), Positives = 42/75 (56%), Gaps = 1/75 (1%)

Query: 175 LGTGLYPVISIINHSCLPNAVLVFE-GRLAVVRAVQHVPKGAEVLISYIETAGSTMTRQK 233
           +   L+P+  ++NH C PNA   FE G   VV A + +P GAE+ +SY +   ST+ R+ 
Sbjct: 222 VARALFPLAGLLNHQCTPNAAHHFEDGETIVVTATERIPMGAEITMSYAKLLWSTLARKM 281

Query: 234 ALKEQYLFTCTCPRC 248
            L     F C CPRC
Sbjct: 282 FLGMTKHFMCRCPRC 296


>gi|389744758|gb|EIM85940.1| SET domain-containing protein [Stereum hirsutum FP-91666 SS1]
          Length = 480

 Score = 82.4 bits (202), Expect = 4e-13,   Method: Compositional matrix adjust.
 Identities = 68/247 (27%), Positives = 110/247 (44%), Gaps = 42/247 (17%)

Query: 29  RCDGCFA----SSNLKKCSACQVVWYCGSNCQKLDWKLHRLE-CQVLSRL---------- 73
           RCD C A    S  L KCS C   WYCG  CQK  W  H  + C+  +            
Sbjct: 58  RCDFCLALPSKSRKLGKCSGCAAYWYCGPECQKEAWSSHHKKICKTFATYEASPEYSCLA 117

Query: 74  DKEKRKSVTPSIRLMLKLYLRRKLQNDNV---IPSTTTDNYSLVE--ALVAHMSDIDEKQ 128
           D +K  +V  S  L+ ++ L+     D+V   +PS +++ +      +L+ H +      
Sbjct: 118 DDQKADAVLLS-HLIAEIALKSLSSVDSVTDDVPSLSSEPFMFGTFLSLLGHDNHPAPPP 176

Query: 129 LLLYAQIANLVNLILQWPEISINEIAENFSKLACNAHTICNSELRPLGTGLYPVIS-IIN 187
                        +   P  +    +E +S+   N + + +S L P+  G++PV S + N
Sbjct: 177 -------------VACRPGHNSPSPSELYSRFGRN-NFVVHSHLSPIAHGIFPVASRLFN 222

Query: 188 HSCLPNAVLVF------EGRLAVVRAVQHVPKGAEVLISYIETAGSTMTRQKALKEQYLF 241
           HSC PNA   +       G   VV A++++    E+ I Y++ A   + RQ +L++ Y F
Sbjct: 223 HSCTPNAAAKYIFGRDKPGVRMVVVALRNISHNEEITIPYLDPALPFLERQLSLEQNYGF 282

Query: 242 TCTCPRC 248
            CTC  C
Sbjct: 283 RCTCHVC 289


>gi|330846003|ref|XP_003294847.1| hypothetical protein DICPUDRAFT_159919 [Dictyostelium purpureum]
 gi|325074603|gb|EGC28624.1| hypothetical protein DICPUDRAFT_159919 [Dictyostelium purpureum]
          Length = 428

 Score = 82.4 bits (202), Expect = 4e-13,   Method: Compositional matrix adjust.
 Identities = 78/287 (27%), Positives = 122/287 (42%), Gaps = 51/287 (17%)

Query: 10  VIISQEPYVCVPNNSSSISRCDGCFASSNLKK---------CSACQVVWYCGSNCQKLDW 60
           VI+   PY    ++++    C  CF    L +         C  C  V YC  +C+ +D+
Sbjct: 92  VILRDLPYTWAVDHAACDIVCQHCFLEVPLNQQILPTDFYMCEGCNRVGYCSIHCKYIDY 151

Query: 61  KLHRLECQVLSRLDKEKRKSVTPS-IRLMLKLYLRRKLQN-------------------- 99
             HR ECQ+   LD E+      S I+L+++   R+ L+                     
Sbjct: 152 NQHRFECQIFKELDTEEYSPFLLSEIKLLVRTLSRKWLEESINSSAGVDESEISKINTYN 211

Query: 100 -----DNVIPSTTTDNYSLVEALVAHMSDIDE--KQLLLYAQIANLVNLILQWP-EISIN 151
                 ++IP      Y+    LV+++ + +E  K+ L Y     +V L  +   E   +
Sbjct: 212 QYKNPSSLIPQDNGLRYNDYADLVSNVENYNESLKESLSYWICKYIVKLAAKIDKEEDED 271

Query: 152 EIAENFSKLACNAHTICNSELRP--------LGTGLYPVISIINHSCLPNA-VLVFEGRL 202
           E+     +  CNA  I   + RP         G G+Y   S  NHSC PN    V    L
Sbjct: 272 ELLNILLRNRCNAFYI---QGRPRDGTSGESRGCGVYVRNSFFNHSCDPNVNYWVVNNTL 328

Query: 203 AV-VRAVQHVPKGAEVLISYIETAGSTMTRQKALKEQYLFTCTCPRC 248
            V    +++V +G E+ ISYI+T+     R++ L E YLFTC C +C
Sbjct: 329 EVECTLLKNVKEGEELTISYIDTSAPLNKRREKLLEGYLFTCLCTKC 375


>gi|62473423|ref|NP_001014717.1| msta, isoform A [Drosophila melanogaster]
 gi|25090702|sp|O46040.3|MSTAA_DROME RecName: Full=Protein msta, isoform A
 gi|22831561|gb|AAN09072.1| msta, isoform A [Drosophila melanogaster]
          Length = 462

 Score = 82.4 bits (202), Expect = 5e-13,   Method: Compositional matrix adjust.
 Identities = 40/102 (39%), Positives = 56/102 (54%), Gaps = 6/102 (5%)

Query: 178 GLYPVISIINHSCLPNAVLVFE-GRLAVVRAVQHVPKGAEVLISYIETAGSTMTRQKALK 236
           GL+P+ +I+NH C PNA   FE GRLAVVRA + +PKG E+  +Y +     +TR   LK
Sbjct: 242 GLFPLTAIMNHECTPNASHYFENGRLAVVRAARDIPKGGEITTTYTKILWGNLTRNIFLK 301

Query: 237 EQYLFTCTCPRCIKLGQFDDIQESAILEGYRCKDDGCSGFLL 278
               F C C RC      D+ +    L    C++ GC G ++
Sbjct: 302 MTKHFACDCVRC-----HDNTENGTYLSALFCREQGCRGLVI 338


>gi|388496362|gb|AFK36247.1| unknown [Lotus japonicus]
          Length = 111

 Score = 82.4 bits (202), Expect = 5e-13,   Method: Compositional matrix adjust.
 Identities = 42/75 (56%), Positives = 51/75 (68%), Gaps = 9/75 (12%)

Query: 8   GEVIISQEPYVCVPNNSSSIS---RCDGCFASSNLKKCSACQVVWYCGSNCQKLD----- 59
           GEVI++QEPYV VPNN+SS+S   RCD CF ++NL KCS CQ VWYCG+ CQ L      
Sbjct: 34  GEVILTQEPYVSVPNNNSSVSAQKRCDRCFTTTNLSKCSRCQTVWYCGTACQVLSSSSPF 93

Query: 60  -WKLHRLECQVLSRL 73
            + L RL  +VL  L
Sbjct: 94  PFPLFRLIREVLCTL 108


>gi|195347880|ref|XP_002040479.1| GM18906 [Drosophila sechellia]
 gi|194121907|gb|EDW43950.1| GM18906 [Drosophila sechellia]
          Length = 302

 Score = 82.4 bits (202), Expect = 5e-13,   Method: Compositional matrix adjust.
 Identities = 40/102 (39%), Positives = 56/102 (54%), Gaps = 6/102 (5%)

Query: 178 GLYPVISIINHSCLPNAVLVFE-GRLAVVRAVQHVPKGAEVLISYIETAGSTMTRQKALK 236
           GL+P+ +I+NH C PNA   FE GRLAVVRA + +PKG E+  +Y +     +TR   LK
Sbjct: 85  GLFPLTAIMNHECTPNASHYFENGRLAVVRAARDIPKGGEITTTYTKILWGNLTRNIFLK 144

Query: 237 EQYLFTCTCPRCIKLGQFDDIQESAILEGYRCKDDGCSGFLL 278
               F C C RC      D+ +    L    C++ GC G ++
Sbjct: 145 MTKHFACDCVRCN-----DNTENGTYLSALFCREQGCRGLVI 181


>gi|4753708|emb|CAB42051.1| MSTA protein [Drosophila melanogaster]
          Length = 462

 Score = 82.4 bits (202), Expect = 5e-13,   Method: Compositional matrix adjust.
 Identities = 40/102 (39%), Positives = 56/102 (54%), Gaps = 6/102 (5%)

Query: 178 GLYPVISIINHSCLPNAVLVFE-GRLAVVRAVQHVPKGAEVLISYIETAGSTMTRQKALK 236
           GL+P+ +I+NH C PNA   FE GRLAVVRA + +PKG E+  +Y +     +TR   LK
Sbjct: 242 GLFPLTAIMNHECTPNASHYFENGRLAVVRAARDIPKGGEITTTYTKILWGNLTRNIFLK 301

Query: 237 EQYLFTCTCPRCIKLGQFDDIQESAILEGYRCKDDGCSGFLL 278
               F C C RC      D+ +    L    C++ GC G ++
Sbjct: 302 MTKHFACDCVRC-----HDNTENGTYLSALFCREQGCRGLVI 338


>gi|201066227|gb|ACH92522.1| IP12582p [Drosophila melanogaster]
          Length = 495

 Score = 82.0 bits (201), Expect = 5e-13,   Method: Compositional matrix adjust.
 Identities = 40/102 (39%), Positives = 56/102 (54%), Gaps = 6/102 (5%)

Query: 178 GLYPVISIINHSCLPNAVLVFE-GRLAVVRAVQHVPKGAEVLISYIETAGSTMTRQKALK 236
           GL+P+ +I+NH C PNA   FE GRLAVVRA + +PKG E+  +Y +     +TR   LK
Sbjct: 275 GLFPLTAIMNHECTPNASHYFENGRLAVVRAARDIPKGGEITTTYTKILWGNLTRNIFLK 334

Query: 237 EQYLFTCTCPRCIKLGQFDDIQESAILEGYRCKDDGCSGFLL 278
               F C C RC      D+ +    L    C++ GC G ++
Sbjct: 335 MTKHFACDCVRC-----HDNTENGTYLSALFCREQGCRGLVI 371


>gi|194217425|ref|XP_001504392.2| PREDICTED: SET and MYND domain-containing protein 4 [Equus
           caballus]
          Length = 802

 Score = 82.0 bits (201), Expect = 5e-13,   Method: Compositional matrix adjust.
 Identities = 47/120 (39%), Positives = 63/120 (52%), Gaps = 17/120 (14%)

Query: 146 PEISINEIA--ENFSKLACNAHTIC-------------NSELRPLGTGLYPVISIINHSC 190
           PE SI  +A   +  +L CNA  I              +S    L TG++PV+S++NHSC
Sbjct: 482 PESSIWGVAMLRHMLQLQCNAQAITAIQQTGSKENIITDSRQVRLATGIFPVVSLLNHSC 541

Query: 191 LPNAVLVFEGRLAVVRAVQHVPKGAEVLISYI--ETAGSTMTRQKALKEQYLFTCTCPRC 248
            PN  + F   +A VRA QH+ KG E+L  Y   E+      RQ+ L+ QY F CTCP C
Sbjct: 542 SPNTSMSFVSTVATVRASQHIGKGQEILHCYGPHESRMGVAERQQKLRSQYFFDCTCPAC 601


>gi|4725963|emb|CAA15694.2| EG:103B4.4 [Drosophila melanogaster]
          Length = 991

 Score = 82.0 bits (201), Expect = 6e-13,   Method: Compositional matrix adjust.
 Identities = 40/102 (39%), Positives = 56/102 (54%), Gaps = 6/102 (5%)

Query: 178 GLYPVISIINHSCLPNAVLVFE-GRLAVVRAVQHVPKGAEVLISYIETAGSTMTRQKALK 236
           GL+P+ +I+NH C PNA   FE GRLAVVRA + +PKG E+  +Y +     +TR   LK
Sbjct: 771 GLFPLTAIMNHECTPNASHYFENGRLAVVRAARDIPKGGEITTTYTKILWGNLTRNIFLK 830

Query: 237 EQYLFTCTCPRCIKLGQFDDIQESAILEGYRCKDDGCSGFLL 278
               F C C RC      D+ +    L    C++ GC G ++
Sbjct: 831 MTKHFACDCVRC-----HDNTENGTYLSALFCREQGCRGLVI 867



 Score = 57.8 bits (138), Expect = 1e-05,   Method: Compositional matrix adjust.
 Identities = 29/72 (40%), Positives = 40/72 (55%), Gaps = 1/72 (1%)

Query: 178 GLYPVISIINHSCLPNAVLVFE-GRLAVVRAVQHVPKGAEVLISYIETAGSTMTRQKALK 236
            L+P+  ++NH C PNA   FE G   VV A + +P GAE+ +SY +   ST+ R+  L 
Sbjct: 228 ALFPLAGLLNHQCTPNAAHHFENGETIVVCATERIPAGAEITMSYAKLLWSTLARKIFLG 287

Query: 237 EQYLFTCTCPRC 248
               F C C RC
Sbjct: 288 MTKHFICKCVRC 299


>gi|66825697|ref|XP_646203.1| hypothetical protein DDB_G0270830 [Dictyostelium discoideum AX4]
 gi|60474828|gb|EAL72765.1| hypothetical protein DDB_G0270830 [Dictyostelium discoideum AX4]
          Length = 696

 Score = 82.0 bits (201), Expect = 6e-13,   Method: Compositional matrix adjust.
 Identities = 71/277 (25%), Positives = 120/277 (43%), Gaps = 41/277 (14%)

Query: 8   GEVIISQEPYVCVPNNSSSISRCDGCFASSNL-----KKCSACQVVWYCGSNCQKLDWKL 62
           G +++   P+  V  +      C  CF   N      + C  C+    C   C   ++ L
Sbjct: 234 GTMVLRVSPFASVLEDHKIEKNCGFCFKKINKSIRINQTCKNCKNHLLCPQ-CSVDEYSL 292

Query: 63  --HRLECQVLSRLDKEKRKSVTPSIRLMLKLYLR--RKLQNDNVIPSTTT---------- 108
             H+ EC +L+ L +    S T   R M ++ L   +  +N +      +          
Sbjct: 293 NYHKDECDILNFLKQYYPSSQTRDFRFMFRVLLNVIKDKKNKSFSKENQSKQWLNHQNPF 352

Query: 109 --DNYSLVEALVAHMSDIDEKQLLLYAQIANLVNLI---LQWPE------ISINEIAENF 157
             D+Y  +  L   +  +  +Q+  + + A  V  I   L+ P+      ++I+EI E +
Sbjct: 353 IFDSYKYLINLSRTLDKVQPEQMEAFKRSAQSVIAIFNKLRGPKFFDECGVTIDEIIEIY 412

Query: 158 SKLACNAHTICNSELRPL-----GTGLYPVISIINHSCLPNAVLVFEGR-LAVVRAVQHV 211
           S +  N H +    L PL     G G++P  S +NHSCLPNA    + + + V R ++ +
Sbjct: 413 SIVLSNGHEM----LHPLNCHTYGLGIFPTGSYLNHSCLPNAFWYNDDQGMMVFRTLRPI 468

Query: 212 PKGAEVLISYIETAGSTMTRQKALKEQYLFTCTCPRC 248
            KG E+L SY +       R+K L +QY F C C +C
Sbjct: 469 KKGEEILTSYTDITTECSERRKHLLKQYFFFCQCQQC 505


>gi|164656030|ref|XP_001729143.1| hypothetical protein MGL_3610 [Malassezia globosa CBS 7966]
 gi|159103033|gb|EDP41929.1| hypothetical protein MGL_3610 [Malassezia globosa CBS 7966]
          Length = 390

 Score = 82.0 bits (201), Expect = 6e-13,   Method: Compositional matrix adjust.
 Identities = 49/149 (32%), Positives = 74/149 (49%), Gaps = 17/149 (11%)

Query: 154 AENFSKLAC----NAHTICNSELRPLGTGLYPVISIINHSCLPNAVLVFEGRLAVVR--- 206
           A+   +L C    NA T+ ++ L PLG  + P  ++INHSC PNAV+VF  R+  +    
Sbjct: 54  ADALMELVCQHETNAFTLADAHLNPLGVCIEPTFALINHSCDPNAVIVFPDRVRGMPSKM 113

Query: 207 ---AVQHVPKGAEVLISYIETAGSTMTRQKALKEQYLFTCTCPRCIKLGQFDDIQESAIL 263
              A++ +  G EV ISY++ A     RQ  L E+Y F C C  C ++G  D        
Sbjct: 114 HLVAIRAISAGEEVRISYVDVASCQAERQATLLERYCFACECRLCKRMGWRDPRM----- 168

Query: 264 EGYRCKDDGCSGFLLRDSDDKGFT-CQQC 291
               C    C+G+++  +    +T C QC
Sbjct: 169 -ALWCPQPKCTGWVVFQNAASSWTACTQC 196


>gi|350404370|ref|XP_003487084.1| PREDICTED: SET and MYND domain-containing protein 4-like [Bombus
           impatiens]
          Length = 635

 Score = 82.0 bits (201), Expect = 6e-13,   Method: Compositional matrix adjust.
 Identities = 74/293 (25%), Positives = 128/293 (43%), Gaps = 53/293 (18%)

Query: 8   GEVIISQEPYVCVPNNSSSISRCDGCFASS-NLKKCSACQVVWYCGSNCQKLDWKL-HRL 65
           G+V+  + PY  V       + C  C + S NL  CS C V  YC   C+KL W++ H++
Sbjct: 265 GDVVTIENPYAYVIYTQRYYTHCHHCLSRSYNLIPCSHCPVAQYCSEKCRKLAWEMAHQI 324

Query: 66  ECQVLS------RLDKEKRKSVTPSIRLMLKLYLRRK----LQNDNVIPSTTTDNYSL-- 113
           EC +++       +DK+K + +T  IR ++ +  + K    L+ D  +  +  DN S   
Sbjct: 325 ECPIMALVGNLLNVDKDKIRMLTKIIRFLIVITSKGKKFDELRVDMELAESNPDNRSAGF 384

Query: 114 -------------VEALVAHMSDIDEKQLLLYAQIANLVNLILQWP------EISINEIA 154
                          +L  +M+      +  +A I+ L  ++L         +  +N++ 
Sbjct: 385 TDEGILDSTSARSALSLATNMTTRPLIGISAFACISALAAILLATQTNFFCNKYEVNQLK 444

Query: 155 E--NFSKL---------AC-----NAHTICNSELRPLGTGLYPVISIINHSCLPNAVLVF 198
           +  N+ K+         AC     N  ++        G+GLY   S+ NHSC PN    F
Sbjct: 445 DISNYPKIIFSGSLMFRACVIMSSNCFSVQQEPGIKTGSGLYITHSLYNHSCAPNTFRHF 504

Query: 199 EGRLAVVRAVQHVPKGAEVLISYIETAGSTMT---RQKALKEQYLFTCTCPRC 248
           E    + RA++ +  G ++  +Y   A + MT   R+K + + Y F C C  C
Sbjct: 505 EELTMITRALRPIYPGDQIFTNY-GAAYAYMTKSERRKKIIQDYFFECDCIAC 556


>gi|330841743|ref|XP_003292851.1| hypothetical protein DICPUDRAFT_89992 [Dictyostelium purpureum]
 gi|325076863|gb|EGC30616.1| hypothetical protein DICPUDRAFT_89992 [Dictyostelium purpureum]
          Length = 680

 Score = 82.0 bits (201), Expect = 6e-13,   Method: Compositional matrix adjust.
 Identities = 67/274 (24%), Positives = 117/274 (42%), Gaps = 35/274 (12%)

Query: 8   GEVIISQEPYVCVPNNSSSISRCDGCF-----ASSNLKKCSACQVVWYCG--SNCQKLDW 60
           G +++   P+     +      C  C+     +  +  KC  C     C   S       
Sbjct: 210 GSMLLRVSPFGSCLEDDKIFKNCGFCYKKIEKSKRDQCKCKICNNFLLCERCSTTDPFAS 269

Query: 61  KLHRLECQVLSRLDKEKRKSVTPSIRLMLKLYLR----RKLQNDNVIPSTTT-------- 108
           + H+ EC +L  L +    + T   R M ++ L     +K Q+      ++         
Sbjct: 270 EYHKEECDILQFLKEYYPSTETKDFRFMFRVVLNALKEKKFQSFKKENCSSNWKKHEHPF 329

Query: 109 --DNYSLVEALVAHMSDIDEKQLLLYAQIANLV----------NLILQWPEISINEIAEN 156
             D Y  +E L   +  +D +Q+  + + A+ +          + IL++  I++NEI E 
Sbjct: 330 IFDEYQDLENLSRTLDKVDPQQMEAFKRSASSIIAVISKLRGEDAILKY--ITLNEIIEL 387

Query: 157 FSKLACNAHTICNS-ELRPLGTGLYPVISIINHSCLPNAVLVF-EGRLAVVRAVQHVPKG 214
           +S +  N H + +    R  G G++P  S +NHSC PNA     E  + V R+++ + KG
Sbjct: 388 YSIVLSNGHEMLHPLTCRSYGMGIFPTGSYLNHSCSPNAFWYNDEQGMMVFRSLRPLKKG 447

Query: 215 AEVLISYIETAGSTMTRQKALKEQYLFTCTCPRC 248
            E+L SY +       R+K L +QY F C C +C
Sbjct: 448 EELLTSYTDVTNPLEDRRKYLMKQYFFFCQCNQC 481


>gi|328715975|ref|XP_003245797.1| PREDICTED: SET and MYND domain-containing protein 4-like
           [Acyrthosiphon pisum]
          Length = 632

 Score = 82.0 bits (201), Expect = 6e-13,   Method: Compositional matrix adjust.
 Identities = 81/291 (27%), Positives = 124/291 (42%), Gaps = 48/291 (16%)

Query: 8   GEVIISQEPYVCVP-NNSSSISRCDGCFA-SSNLKKCSACQVVWYCGSNCQKLDWK-LHR 64
           G+V+   EPY+C P  +   +   +GC      L +C  CQ+V+YC  +C   D+K  H 
Sbjct: 249 GDVVAIDEPYICGPFRDDIEVCHYNGCLKLGFALFRCPKCQLVYYCNEDCMNKDYKDGHN 308

Query: 65  LECQVLS--RLDKEKRKSVTPSIRLMLKLYLRRKLQN-----DNV--------------I 103
           L C ++   +L     +    +++  LK YL+  L+      DN               I
Sbjct: 309 LACPIIYFIKLTPGFPRMNELAMKWFLKDYLKMGLKKYCLIVDNFSESKIDPMTRGFDEI 368

Query: 104 PSTTTDNYSLVEALVAHMSDIDEKQLLLYAQIA--NLVNLIL---QWPEISINEIAE--- 155
               +DN+    +L    + +  + L  +  IA   L  LIL   + PE  I  +     
Sbjct: 369 GQYKSDNFLTAYSLDNSENKLPMEILFFFNCIAVDMLHYLILSGFKIPERYIGSVGASLV 428

Query: 156 --------NFSKLACNAHTICNSELRPL----GTGLYPVISIINHSCLPNAVLVFE--GR 201
                   N  KL  NA T+     R L       LYP IS+ NHSC  N     E   R
Sbjct: 429 RILTVLDLNCRKLNINAPTVSFRRDRRLTLRIALTLYPTISLFNHSCDANIKRSGEITDR 488

Query: 202 LAVVRAVQHVPKGAEVLISY--IETAGSTMTRQKALKEQYLFTCTCPRCIK 250
           + V++AVQ +PKG ++  +Y  I       +RQ+   +++ F C C  CIK
Sbjct: 489 IRVMKAVQPIPKGTQLCCTYGMIYNGHDKESRQEVCNDRFNFKCNCQPCIK 539


>gi|238493359|ref|XP_002377916.1| SET and MYND domain protein, putative [Aspergillus flavus NRRL3357]
 gi|220696410|gb|EED52752.1| SET and MYND domain protein, putative [Aspergillus flavus NRRL3357]
          Length = 415

 Score = 82.0 bits (201), Expect = 6e-13,   Method: Compositional matrix adjust.
 Identities = 55/200 (27%), Positives = 93/200 (46%), Gaps = 14/200 (7%)

Query: 55  CQKLDWKL-HRLECQVLSRLDKEKRKSVTPS-IRLMLKLYLRRKLQNDNVIPSTTTDNYS 112
           CQ+ DWK  H  EC +   L+      + PS  R +L++ +R   +        T     
Sbjct: 5   CQRKDWKFAHAHECAIFKNLNPR----ILPSNARALLRMIVRTAHKK------YTNGELE 54

Query: 113 LVEALVAHMSDIDEKQLLLYAQIANLVNLILQWPEISINE--IAENFSKLACNAHTICNS 170
           L   L  H+S+I ++    + +IA     +  +    + E  I+   +KL  N+    N 
Sbjct: 55  LFSQLETHISEIRDQSPEQWERIALSSKAVKAYSGTDMKEETISAFGAKLELNSFNFTNI 114

Query: 171 ELRPLGTGLYPVISIINHSCLPNAVLVFEGRLAVVRAVQHVPKGAEVLISYIETAGSTMT 230
               +G  L+P  ++INHSC  NA + F+     ++A + + KG ++ ISYI+       
Sbjct: 115 LSDRIGLYLHPYAALINHSCNYNAAVTFDSDNLYIKATRPIQKGDQIFISYIDATNPVKL 174

Query: 231 RQKALKEQYLFTCTCPRCIK 250
           R+  L+E+Y F C C +C K
Sbjct: 175 RRSELRERYYFDCHCAKCAK 194


>gi|355720917|gb|AES07092.1| SET and MYND domain containing 4 [Mustela putorius furo]
          Length = 576

 Score = 81.6 bits (200), Expect = 7e-13,   Method: Compositional matrix adjust.
 Identities = 68/284 (23%), Positives = 127/284 (44%), Gaps = 29/284 (10%)

Query: 165 HTICNSELRPLGTGLYPVISIINHSCLPNAVLVFEGRLAVVRAVQHVPKGAEVLISY--I 222
           + + NS    L TG++PV+S++NHSC PN  + F   +A VRA + + KG E+L  Y   
Sbjct: 296 NNVTNSRQVRLATGIFPVVSLLNHSCSPNTSMSFVSTVATVRASEKIAKGQEILHCYGPH 355

Query: 223 ETAGSTMTRQKALKEQYLFTCTCPRCIKLGQFDDIQESAILEGYRCKDDGCSGFLLRDSD 282
            +  +   RQ+ L+ QY F C+CP C K      +      E + C   G  G LL+  D
Sbjct: 356 HSRMAVAERQQKLRAQYFFDCSCPVCQKEKHRATV--GPRWEAFCC---GRCGALLQGGD 410

Query: 283 DKGFTCQQCGLVRSKEEIKK--IASEVNILSKKTLALTSCGNHQEVVSTYKMIEKLQ--- 337
                   CG     E + +  + S V  L ++       G  Q+++   ++ + +Q   
Sbjct: 411 -----VLSCGGTSCMESVSRDHLISRVQDLQRQV------GGAQKLLGNGQLEQAIQLLL 459

Query: 338 ------KKLYHPFSVNLMQTREKLIKILMELEDWKEALAYCQLTIPVYQRVYPQFHPLLG 391
                 +    P    + +  + L +    L DW+++ A+ Q ++ V +  +      +G
Sbjct: 460 GCRHDAENFLSPEHSVVGEMEDALAQAYAALGDWQKSAAHLQKSLQVVEAQHGPSSVEMG 519

Query: 392 LQYYTCGKLEWFLGDTENAIKSMTEAVEILRITHGTNSPFMKEL 435
            + +   +L +       A+ ++  A ++L + +G  S  ++EL
Sbjct: 520 HELFKLAQLFFNGFAIPEALNTIERAEKVLLVHYGPCSDEIQEL 563


>gi|324508446|gb|ADY43564.1| SET domain-containing protein 14 [Ascaris suum]
          Length = 421

 Score = 81.3 bits (199), Expect = 9e-13,   Method: Compositional matrix adjust.
 Identities = 72/337 (21%), Positives = 139/337 (41%), Gaps = 24/337 (7%)

Query: 36  SSNLKKCSACQVVWYCGSNCQKLDWKLHRLECQVLSRLDKEKRKSVTPSIRLMLKLYLRR 95
           ++ L+KC+AC+ ++YC  +CQK DW +H++EC+      K    +   S RL+L++  + 
Sbjct: 34  TTKLEKCAACKSIFYCNRSCQKADWPMHKVECKFC----KAFSSAGDESYRLLLRIVKKL 89

Query: 96  KLQNDNVIPSTTTDNYSLVEALVAHMSDIDEKQLLLYAQIANLVNLILQWPEISINEIAE 155
           +L  D  +             L+ H +++ E   +        +  I    +IS + +  
Sbjct: 90  ELGEDGTVAGNRK-----FSDLIDHKNELTEVYKMWRVGFDEYIGCIPNAVQISDDLVQS 144

Query: 156 NFSKLACNAHTICNSELRPLGTGLYPVISIINHSCLPNAVLVFEGRLAVVRAVQHVPKGA 215
              K+  N+  + +   + +G  L   +S ++HSC P A + F G    +   Q      
Sbjct: 145 IICKIFINSFGLTSVFGQNIGIALCLQLSALDHSCKPTARIAFRGNECRMVPTQSNTTNV 204

Query: 216 EVLISYIETAGSTMTRQKALKEQYLFTCTCPRCIKLGQFDDIQESAILEGYRCKDDGCSG 275
            +  SY++   +   R+K L ++Y F C C  C+      D + +  +  + C+      
Sbjct: 205 VLAHSYVDELLTRDERRKLLMDRYKFDCKCEGCM------DEERNKDMVAFSCE------ 252

Query: 276 FLLRDSDDKGFTCQQCGLVRSKEEIKKIASEVNILSKKTLALTSCGNHQEVVS-TYKMIE 334
              +   + G  C  C +  S E +       ++      AL+ C    +  +   K +E
Sbjct: 253 -TCKRPIEIGAVCSNCKIKMSAERMALCKLVADLAESSINALSRCPTATDRFTLCSKSLE 311

Query: 335 KLQKKLYHPFSVNLMQTREKLIKILMELEDWKEALAY 371
            +   LY  F+V  +       +    LE + EA  Y
Sbjct: 312 VVDDVLY-SFNVRRLALLRAAYETCFSLERFTEAGFY 347


>gi|359486519|ref|XP_002270770.2| PREDICTED: uncharacterized protein LOC100263830 [Vitis vinifera]
          Length = 799

 Score = 81.3 bits (199), Expect = 9e-13,   Method: Compositional matrix adjust.
 Identities = 78/325 (24%), Positives = 126/325 (38%), Gaps = 56/325 (17%)

Query: 156 NFSKLACNAHTICNSELRPLGTGLYPVISIINHSCLPNAVLVFEGRLAVVRAVQHVPKGA 215
           NFS  A  A T  N E   +G  +Y V S+ NHSC PN    F  R   +RA +HV  G 
Sbjct: 467 NFSP-AGGAFT-SNMEQVRVGQAIYSVASLFNHSCQPNIHAYFLSRTLFLRATEHVAVGC 524

Query: 216 EVLISYIETAG--STMTRQKALKEQYLFTCTCPRCIKLGQFDDIQESAILEGYRCKDDGC 273
            + +SY    G      RQK LK++Y F C C  C +L   D      +L  +RC +  C
Sbjct: 525 PLELSYGPQVGQWDCKDRQKFLKDEYSFRCECSGCSELNVSD-----LVLNAFRCVNPDC 579

Query: 274 SGFLL-----------------------------------RDSDDKGFT----------C 288
            G +L                                   R+   + F           C
Sbjct: 580 FGTVLDSCVIKYENKKFERFQGVPQDCISEPHLQLKNDGIREVAHQAFANSSFRAAPGYC 639

Query: 289 QQCGLVRSKEEIKKIASEVNI-LSKKTLALTSCGNHQEVVSTYKMIEKLQKKLYHPFSVN 347
             CG  R  E       E  I +S+   A+ S        S       L K   H ++  
Sbjct: 640 LHCGAYRDLEASHATVGEAGIYISRLQEAIVSKEVPATTFSDALRSLDLLKSTLHAYNKG 699

Query: 348 LMQTREKLIKILMELEDWKEALAYCQLTIPVYQRVYPQFHPLLGLQYYTCGKLEWFLGDT 407
           + +  + + +    + + + A+ +C+ +I + +++Y   H ++G +      ++  LGDT
Sbjct: 700 IAEAEDWIAQAFCMIGELQPAMHHCKASIEILEKLYGSNHIVIGYELMKLSSIQLSLGDT 759

Query: 408 ENAIKSMTEAVEILRITHGTNSPFM 432
             A+KS++    I    +G ++  M
Sbjct: 760 A-AMKSISRLAAIFSWYYGPHADMM 783


>gi|351704191|gb|EHB07110.1| SET and MYND domain-containing protein 4, partial [Heterocephalus
           glaber]
          Length = 793

 Score = 81.3 bits (199), Expect = 9e-13,   Method: Compositional matrix adjust.
 Identities = 64/274 (23%), Positives = 123/274 (44%), Gaps = 29/274 (10%)

Query: 175 LGTGLYPVISIINHSCLPNAVLVFEGRLAVVRAVQHVPKGAEVLISY--IETAGSTMTRQ 232
           L TG++PV+S++NHSC PN  + F G +A +RA Q + KG E+L  Y   ++  S   RQ
Sbjct: 520 LATGIFPVVSLLNHSCSPNTSISFSGTVATIRAAQQIGKGQEILHCYGPHKSWMSIAERQ 579

Query: 233 KALKEQYLFTCTCPRCIKLGQFDDIQESA--ILEGYRCKDDGCSGFLLRDSDDKGFTCQQ 290
           + L+ QY F C CP C     ++ ++ +A    E +RC +  C   +      +G     
Sbjct: 580 QKLRSQYFFDCNCPAC----HYEKLRTAAGPRWEAFRCNN--CRALM------QGDDVLS 627

Query: 291 CG-------LVRSK--EEIKKIASEVNILSKKTLALTSCGNHQEVVSTYKMIEKLQKKLY 341
           CG       + R+    +++ +  +V I SK    L   G   + +       +  +   
Sbjct: 628 CGNRACLEPISRTHLVSQLQDLQQQVGIASK----LLRNGKLDQAIPRLLRCRQDAESFL 683

Query: 342 HPFSVNLMQTREKLIKILMELEDWKEALAYCQLTIPVYQRVYPQFHPLLGLQYYTCGKLE 401
                   +  + L +    L DW+++  + Q ++ V +  +      +G + +   ++ 
Sbjct: 684 AAEHAVAGEIEDNLARAYAALGDWQKSATHLQKSLQVVEARHGPSSVEMGHELFKLAQVL 743

Query: 402 WFLGDTENAIKSMTEAVEILRITHGTNSPFMKEL 435
           +       A+ ++ +A E+L +  G  S  ++EL
Sbjct: 744 FNGFAVPEALNAIQKAEEVLLVHCGPQSDEVQEL 777


>gi|440636518|gb|ELR06437.1| hypothetical protein GMDG_07962 [Geomyces destructans 20631-21]
          Length = 287

 Score = 81.3 bits (199), Expect = 1e-12,   Method: Compositional matrix adjust.
 Identities = 37/86 (43%), Positives = 54/86 (62%)

Query: 163 NAHTICNSELRPLGTGLYPVISIINHSCLPNAVLVFEGRLAVVRAVQHVPKGAEVLISYI 222
           N+  I ++    +G G  P++ + NHSC PNA L F+GR AV+ A++H+ +G E+ ISYI
Sbjct: 188 NSFRITDASGDEIGIGFDPLLGMANHSCAPNASLKFDGRCAVLTALKHIEEGEEITISYI 247

Query: 223 ETAGSTMTRQKALKEQYLFTCTCPRC 248
           +T      RQ  L++ Y FTCTC  C
Sbjct: 248 DTTLPRAARQAFLQKHYYFTCTCAAC 273


>gi|339236741|ref|XP_003379925.1| MYND finger protein [Trichinella spiralis]
 gi|316977375|gb|EFV60485.1| MYND finger protein [Trichinella spiralis]
          Length = 476

 Score = 81.3 bits (199), Expect = 1e-12,   Method: Compositional matrix adjust.
 Identities = 105/447 (23%), Positives = 174/447 (38%), Gaps = 46/447 (10%)

Query: 8   GEVIISQEPYVCVPNNSSSISRCDGCFASSN------LKKCSACQVVWYCGSNCQKLDWK 61
           G+V I  +P+  V  N    + C  CF   N      L+ C  C    YC   CQ+  WK
Sbjct: 10  GKVTIFDKPFASVVLNQQVENVCGYCFQRPNGKTCKRLQICGGCHWYRYCNRACQRASWK 69

Query: 62  -LHRLECQVLSRLDKEKRKSVTPSIRLMLKLYLRRKLQNDNVIPSTTT----DNYSLVEA 116
             H+LEC  L        + V P++ +   L+L R       I +             + 
Sbjct: 70  EHHKLECARL--------QLVFPNLPVTEVLFLGRICDRLRFIEANGDLKKWQAERRFDE 121

Query: 117 LVAHMSDI--DEKQLLLYAQIANLVNLILQWPEISINEIAENFSKLACNAHTICNSELRP 174
           L++H  +I  D++++  +  I +     L        +    F +   N+H+I ++    
Sbjct: 122 LMSHEEEIRQDKEKMKHFELIYDKAQKFLASAIPKREKFFLIFCRSWINSHSIHSNTGIE 181

Query: 175 LGTGLYPVISIINHSCLPNAVLVFEGRLAVVRAVQH---VPKGAEVLISYIETAGSTMTR 231
           +G  L   IS  +HSC PN  +VF G  AV+R + +       ++  I+Y++   S   R
Sbjct: 182 VGMALDLGISKYDHSCRPNTAMVFNGFRAVLRPLVNGIDTADPSQCFIAYVDVGRSRYQR 241

Query: 232 QKALKEQYLFTCTCPRCIKLGQFDDIQESAILEGYRCKDDGCSGFLLRDSDD---KGFTC 288
           +K L+ ++ F C C RC      DD      L   RC +  CS  +    D    K   C
Sbjct: 242 RKELQSKWYFWCECERCRDPS--DD-----RLTSIRCVNVDCSEPVCITEDQTNTKNVQC 294

Query: 289 QQCGLVRSKEEIKKIASEVNILSK-KTLALTS-CGN--HQEVVSTYKMIEKLQKKLYHPF 344
           + CG         KI   V I ++   LAL    GN    E V   K+     ++L H  
Sbjct: 295 RGCG--------SKIPENVIIEAQCFMLALPQHFGNVKSAEEVRRLKIYLNTAERLLHKE 346

Query: 345 SVNLMQTREKLIKILMELEDWKEALAYCQLTIPVYQRVYPQFHPLLGLQYYTCGKLEWFL 404
           ++   +     +++   ++ +   L   +     Y+R  P     +G Q     +     
Sbjct: 347 NIYFCRLLTAYLQLTEGVDSFANNLELQKSVYTNYRRCLPAADRHVGFQLLHIVRTLITQ 406

Query: 405 GDTENAIKSMTEAVEILRITHGTNSPF 431
                +I    EA+ I  I  G + P+
Sbjct: 407 NRRAESIPYAHEAMRIFEICFGLDHPY 433


>gi|255084131|ref|XP_002508640.1| predicted protein [Micromonas sp. RCC299]
 gi|226523917|gb|ACO69898.1| predicted protein [Micromonas sp. RCC299]
          Length = 701

 Score = 81.3 bits (199), Expect = 1e-12,   Method: Compositional matrix adjust.
 Identities = 73/267 (27%), Positives = 113/267 (42%), Gaps = 44/267 (16%)

Query: 8   GEVIISQEPYVCVPNNSSSISRCDGCFAS--SNLKKCSACQVVWYCGSNCQKLDWKLHRL 65
           GE I+  +P+V V ++      C GCF +  S+  +C     V YC  +C++ D   H  
Sbjct: 132 GETILRCDPFVHVVHDRRREDHCAGCFKTLQSSCVECGDFCGVKYCADDCRESDVA-HVA 190

Query: 66  ECQVLSRLDKEKRKSVTPSIRLMLKLYLRRKLQNDNVIPSTTTDNYSLVEALVAHMSDID 125
           EC ++ R +      +    R+ L+L  +R+           TD     E + A      
Sbjct: 191 ECSMV-RTNAGTGADLR-GARMCLRLIHKRQ-----------TDPARFAEVMAALRC--- 234

Query: 126 EKQLLLYAQIANLVNLILQWPEISINEIAENFSKLACNAHTICNSELRPLGTGLYPVISI 185
           EK+    A    L + +     +  +E+ +   K   N+H + + +LR LGTG+YP  S+
Sbjct: 235 EKKSPSPAAT-KLASAVRHAAALDPSEVEDMLGKTRENSHGVVDWKLRQLGTGIYPEASM 293

Query: 186 INHSCLPNAVLVF----------------EGRLAVVRAVQHVP--------KGAEVLISY 221
            NHSC PNA + F                E R   V    +VP        KG EV I+Y
Sbjct: 294 FNHSCAPNAAVSFGAGGTLNTCCIGVDDYEARAMEVGPKGYVPLGRPRAIRKGEEVCIAY 353

Query: 222 IETAGSTMTRQKALKEQYLFTCTCPRC 248
            E      +R+  L++   F C C RC
Sbjct: 354 TELYRPAASRRLTLEKSKGFVCECVRC 380


>gi|328697017|ref|XP_003240210.1| PREDICTED: SET and MYND domain-containing protein 4-like
           [Acyrthosiphon pisum]
          Length = 635

 Score = 81.3 bits (199), Expect = 1e-12,   Method: Compositional matrix adjust.
 Identities = 80/290 (27%), Positives = 127/290 (43%), Gaps = 48/290 (16%)

Query: 8   GEVIISQEPYVCVPN-NSSSISRCDGCFASS-NLKKCSACQVVWYCGSNCQKLDWKL-HR 64
           G+V+   EPY+C P+ N + +   +GC      L +C  C +V+YC  +C   D+K  H 
Sbjct: 253 GDVVAIDEPYICGPSGNDTEVCHYNGCLKMDFALFRCPKCFMVYYCTKDCMDKDYKEGHN 312

Query: 65  LECQVL----SRLDKEKRKSVTPSIRLMLKLYLRRKLQN-----DNV------------- 102
             C ++    SR+    R +   +++  LK YL+  L+      DN              
Sbjct: 313 FACPIIYFIRSRIPGISRMNEL-AMKWFLKDYLKMGLKKYCSIVDNFSESKIDHITRGFD 371

Query: 103 -IPSTTTDNYSLVEALVAHMSDIDEKQLLLYAQIA--NLVNLIL---QWPEISINEIAE- 155
            I    +DN+    +L    + +  + L  +  IA   L  LIL   + PE  +  +   
Sbjct: 372 EIGQYKSDNFLTAYSLDNRENKMPMEVLFFFNCIAVDMLHYLILSGFKIPEQYMGSVGAS 431

Query: 156 ----------NFSKLACNAHTIC-NSELRPLGTGLYPVISIINHSCLPNAVLVFE--GRL 202
                     N  KL  NA T+   S+   +   LYP IS+ NHSC  N     E   R+
Sbjct: 432 LVHILTVLDLNCRKLNTNAPTVSFRSDTYTIALTLYPTISLFNHSCDANIKRSGERIDRI 491

Query: 203 AVVRAVQHVPKGAEVLISY--IETAGSTMTRQKALKEQYLFTCTCPRCIK 250
            V++A+Q +PKG ++  SY  I     T +RQ+   +++ F C    CIK
Sbjct: 492 RVMKAIQPIPKGTQLCCSYGIIFNGHDTESRQEVCNDRFNFKCYSQPCIK 541


>gi|426195740|gb|EKV45669.1| hypothetical protein AGABI2DRAFT_223832 [Agaricus bisporus var.
           bisporus H97]
          Length = 443

 Score = 80.9 bits (198), Expect = 1e-12,   Method: Compositional matrix adjust.
 Identities = 75/279 (26%), Positives = 123/279 (44%), Gaps = 53/279 (18%)

Query: 7   SGEVIISQEPYVCVPNNSSSISRCDGCFA----SSNLKKCSACQVVWYCGSNCQKLDWKL 62
           +G V++S   +      +    RCD CF        LKKC+ C   +YC + CQK+ W L
Sbjct: 28  AGSVVLSVPAFTSALLEAEKGRRCDTCFRLPPDGEKLKKCTGCASYFYCDTRCQKVQWDL 87

Query: 63  -HRLECQVLS---------RLDKEKRKSVTPSIRLMLKLYLRRKLQNDNVIPSTTT--DN 110
            H+  C+  S         +  + +R  V     ++L  +L R ++   V PS     D 
Sbjct: 88  EHKRVCKSYSLMISYLHGQQFSQHERMDV-----ILLSHFLGRTVKTKPV-PSADNPHDP 141

Query: 111 YSLVEALV--AHMS----DIDEKQLLLYAQIANLVNLILQWPEISINEIAENFSKLACNA 164
           +S+  +L+  +H S    D   K L+L   +                 + + +S+   N 
Sbjct: 142 FSVFLSLLPGSHASQRTLDSIPKSLILDDSL-----------------VQDIYSRFGNNN 184

Query: 165 HTICNSELRPLGTGLYPVIS-IINHSCLPNAVLVF-----EGRLAVVRAVQHVPKGAEVL 218
            TI +S L  +G G++P+ S + NHSC+PNA   +     +  L  V A++    G E+ 
Sbjct: 185 FTI-HSHLNSIGHGVFPLASRLFNHSCIPNAAPRYVQGSAQPVLMEVVALRDTDIGEEIC 243

Query: 219 ISYIETAGSTMTRQKALKEQYLFTCTCPRCIKLGQFDDI 257
           I Y++ A  T +R +  +  Y FTC C  C  + Q   I
Sbjct: 244 IPYLDPA-LTQSRSQIFQYTYGFTCQCGPCRDISQLGQI 281


>gi|412986118|emb|CCO17318.1| SET and MYND domain-containing protein 3 [Bathycoccus prasinos]
          Length = 734

 Score = 80.9 bits (198), Expect = 1e-12,   Method: Compositional matrix adjust.
 Identities = 39/93 (41%), Positives = 55/93 (59%)

Query: 149 SINEIAENFSKLACNAHTICNSELRPLGTGLYPVISIINHSCLPNAVLVFEGRLAVVRAV 208
           S+ ++A+   KL  N HT+C+ ELRP G G+YPV +++NHS  PN    F G+  +VR +
Sbjct: 280 SVADLAKLIGKLRLNCHTLCDDELRPYGIGVYPVAAMMNHSENPNCFATFRGKKMIVRCL 339

Query: 209 QHVPKGAEVLISYIETAGSTMTRQKALKEQYLF 241
           + V  G E+ ISY E       R K+LK  Y F
Sbjct: 340 RDVLPGEELTISYDELMKPKRERAKSLKSNYGF 372


>gi|281202822|gb|EFA77024.1| hypothetical protein PPL_09776 [Polysphondylium pallidum PN500]
          Length = 1187

 Score = 80.9 bits (198), Expect = 1e-12,   Method: Compositional matrix adjust.
 Identities = 70/292 (23%), Positives = 128/292 (43%), Gaps = 43/292 (14%)

Query: 8    GEVIISQEPYVCVPNNSSSISRCDGCFASSNL------KKCSACQVVWYCGSNCQKLDWK 61
            GE+++   PY     + + ++ C  CF + +       +KC  C +   C S+   ++  
Sbjct: 740  GELLLRVAPYGAALVDDTLLTHCTSCFRNISYYKHHLCQKCKQCILCEECNSDVDLVN-- 797

Query: 62   LHRLECQVLSRLDKEKRKSVTPSIRLMLKLYLRR--------------KLQNDNVIPSTT 107
             H  EC +L  L +    + T   R M+++ L+               K  + N +P   
Sbjct: 798  EHNEECDILVFLKQNVPGADTRDFRFMVRVMLKSIAILNGKLSKEQSPKCWSKNGVP-FI 856

Query: 108  TDNYSLVEALVAHMSDIDEKQLLLY-AQIANLVNL--ILQWPE----ISINEIAENFSKL 160
             D+Y  +  L    S+ID KQ+  +     +++N+  + + P+    ++  +I + + K+
Sbjct: 857  FDSYDDLTHLTTDTSNIDRKQMESFETATQSIINVFKLAKGPKSLEPLTNQQILDLYPKM 916

Query: 161  ACNAHTICNSELRP-LGTGLYPVISIINHSCLPNAVLVFEGR-LAVVRAVQHVPKGAEVL 218
              NAH   +      +  G+YP  + +NHSC PN V   +   +   R+++ +  G E+ 
Sbjct: 917  LFNAHEYIDPLYHSEVARGIYPTAAYLNHSCEPNTVWHNDNNGMIAYRSIRDIKAGEEIT 976

Query: 219  ISYIETAGSTMTRQKALKEQYLFTCTCPRCIKLGQFDDIQESAILEGYRCKD 270
             +YI+      TRQ  L  QY F C C RC         Q+ A   G++C D
Sbjct: 977  TTYIDITKYKSTRQLNLLSQYAFLCQCARC---------QDRAT--GFKCLD 1017


>gi|308485700|ref|XP_003105048.1| CRE-SET-10 protein [Caenorhabditis remanei]
 gi|308256993|gb|EFP00946.1| CRE-SET-10 protein [Caenorhabditis remanei]
          Length = 436

 Score = 80.5 bits (197), Expect = 2e-12,   Method: Compositional matrix adjust.
 Identities = 84/362 (23%), Positives = 164/362 (45%), Gaps = 46/362 (12%)

Query: 30  CDGCFASSNLKK------CSACQVVWYCGSNCQKLDWK-LHRLECQVLSRLDKEKRKSVT 82
           C  CFA  ++        C  C  V YC   CQ+ DW+ +H+ EC++L    +  +  +T
Sbjct: 23  CATCFAELDVNGEAEILMCDDCSEVSYCSLKCQRKDWRSVHQFECEILRT--QNNQTPMT 80

Query: 83  PSIRLMLKLYLRRKLQNDNVIPSTTTDNYSLVEALVAHMSDI----DEKQLLLYAQIANL 138
            ++RL ++  L   L+N    PS    N +++E L  +  +        Q L     ++L
Sbjct: 81  TTMRLCIRTLLV-TLRNSERSPSF---NGAIIEDLETNYKEFRSSPSHNQFL-----SDL 131

Query: 139 VNLILQ-----WPE-ISINEIAENFSKLACNAHTICNSE-LRPLGTGLYPVISIINHSCL 191
           V +I       +P+ +  N++      + CN+  I + + + P+G+GL+  ++  NHSC 
Sbjct: 132 VTIIKSVGHNVFPKSVETNKMIAIICTVLCNSFGIIDDKRVEPIGSGLFVGLAKHNHSCA 191

Query: 192 PNAVLVFEGRLAVVRAVQHVPKGAEVLISYIETAGSTMTRQKALKEQYLFTCTCPRCIKL 251
             + +VFE    ++R  + +       ISY+     T  RQK+++  +  TC C  C   
Sbjct: 192 STSHVVFEKNQVLLRG-RDMDYCKSTTISYVSRMLPTFERQKSIRSVHFITCRCEMC--- 247

Query: 252 GQFDDIQESAILEGYRCKDDGCSGFLLRDSDDKGFTCQQCGLVRSKEEIKKIASEVNILS 311
            + DD+    +    RC+   CSG+ ++ SD  G  C++  ++   E     +  ++IL 
Sbjct: 248 -RNDDLDFIGL--ASRCQTVNCSGY-VKGSDSCG-VCKKPAVIPIMESASSTSKLIDILD 302

Query: 312 KKTLALTSCGNHQEVVSTYKMIEKLQK---KLYHPFSVNLMQTREKLIKILMELEDWKEA 368
               +     N  +  + Y  ++ L+K   ++    +V ++Q  E++     +L+   + 
Sbjct: 303 NLHKS-----NEFDCTTQYDYLQNLRKEYIRILADCNVAILQLDEQIAYCASDLKKIPDD 357

Query: 369 LA 370
           L 
Sbjct: 358 LT 359


>gi|72169832|ref|XP_781331.1| PREDICTED: SET and MYND domain-containing protein 4-like
           [Strongylocentrotus purpuratus]
          Length = 704

 Score = 80.5 bits (197), Expect = 2e-12,   Method: Compositional matrix adjust.
 Identities = 106/516 (20%), Positives = 202/516 (39%), Gaps = 115/516 (22%)

Query: 9   EVIISQEPYVCVPNNSSSISRCDGCFASSNLKKCSACQVVWYCGSNCQK-LDWKLHRLEC 67
           E++I ++PY  +       S C  CF   +   CS C    YC S C+     + H +EC
Sbjct: 207 ELLIKEKPYAAIILKEEESSHCHQCFEQCSPIPCSNCIHARYCSSRCRSDCLSQYHSIEC 266

Query: 68  QVLSRLDKEKRKSVTPSIRLMLKLYLRRKLQNDNVIP----------------------- 104
                L   ++ SV   + L + +   R+  ++++                         
Sbjct: 267 GTEGLL---QQVSVFSRLSLRILITAGREALSNHICKLKTSSSPPPSSTSTSTSLTTAKG 323

Query: 105 ------STTTDNYSLVEALVAHMSDIDEKQLLLYAQIANLVNLILQWPEISINEIAENFS 158
                  +   NY  +  L AH  +   K+L+ YA  + L+     + E+ + +  E FS
Sbjct: 324 SSLGYLDSGLVNYECIVGLEAHWFNHSHKELVQYAVTSILLAKCF-YRELVLPKTCETFS 382

Query: 159 -----------------KLACNAHTICNSELRP----------------------LGTGL 179
                            +L  N+H I  +E+R                       + T +
Sbjct: 383 EEELITEIASLLLLHTRQLKSNSHAI--TEVRSSEGENTAGESVGGSVQQISQGRIATAV 440

Query: 180 YPVISIINHSCLPNAVLVFEGRLAVVRAVQHVPKGAEVLISYIETAG--STMTRQKALKE 237
           YP +S++NH+C PN +  F   +  VRA++ + +G E+   Y    G  +T  RQ+AL  
Sbjct: 441 YPTVSLMNHACQPNVIASFRKGIISVRAIEKIMRGDEIQHCYGPQVGHMTTSDRQQALLN 500

Query: 238 QYLFTCTC------PR-----------CIKLGQFDDIQESAILEGYRCKDDGCSGFLLRD 280
           QY FTC C      PR           C + GQ  +IQ S      +C +    G L+ +
Sbjct: 501 QYCFTCRCRACTRKPRTFDKEEDLCIKCPQCGQPLNIQTSMC---GKCAERIDVGVLIHE 557

Query: 281 SDDKGFTCQQCGLVRSKEEIKKIASEVNILSKKTLALTSCGNHQEVVSTYKMIEKLQKKL 340
             + G T     L+  +E      ++  ++             +EV+S  K    + +++
Sbjct: 558 LTNAGTT-----LIGLEEMFSAAVNDDTLM-------------REVISKTKSCIDVLERI 599

Query: 341 YHPFSVNLMQTREKLIKILMELEDWKEALAYCQLTIPVYQRVYPQFHPLLGLQYYTCGKL 400
             P  + L    + + K  +EL+++KEA  +     P  +  + +    +  + Y   ++
Sbjct: 600 IIPPDMQLATAYDDMAKCHVELDEFKEAALWLAKATPSIESRFGRDSIEVAHELYKLAQI 659

Query: 401 EWFLGDTENAIKSMTEAVEILRITHGTNSPFMKELI 436
            +   +   A++++  A+E+    +G ++  + EL+
Sbjct: 660 YFNGKEIAPAMETIDRALELFIRHYGNSNEEVLELV 695


>gi|320169071|gb|EFW45970.1| predicted protein [Capsaspora owczarzaki ATCC 30864]
          Length = 509

 Score = 80.5 bits (197), Expect = 2e-12,   Method: Compositional matrix adjust.
 Identities = 67/266 (25%), Positives = 121/266 (45%), Gaps = 41/266 (15%)

Query: 160 LACNAHTICNSELRPLGTGLYPVISIINHSCLPNAVLVFE--GRLAVVRAVQHVPKGAEV 217
           + CN   I N  L   G+G+YPV +++NH+C+ N VL ++   +   +RA++ +  G E+
Sbjct: 254 MQCNNFAIHNDILFARGSGIYPVAALVNHACVANCVLTYDLKSKRQFIRAIRDIRAGEEI 313

Query: 218 LISYIETAGSTMTRQKALKEQYLFTCTCPRCIKLGQFDDIQE-SAILEGYRCKDDGCSGF 276
             ++ + A  T+ R+  LK  Y F C C RC      D  +E  A L   R  D   + F
Sbjct: 314 THAFTDAASPTVVRKAHLKSLYAFDCNCSRC---NDSDAAKELDAELVATRPIDTIPAYF 370

Query: 277 LLRDSDDKGFTCQQCG-----LVRSKEEIKKIAS--EVNILSKKTLALTSCGNHQEVVST 329
                  K F  ++       L R+  ++ K++S  +V  L + + +    G    + + 
Sbjct: 371 -------KRFRLERLAGLPSLLERATRDVPKVSSPADVEALLQASRSWLQEGMDPRLPTA 423

Query: 330 YKMI-----EKLQKKLYHPFSVNLMQTREKLIKILM-----ELEDWKEALAY-------- 371
             ++       +++ L   +++ L ++  K+  + +      L + +  LA         
Sbjct: 424 RAIVLLVTAWAVRQALLGDYNLELFESNVKIFGMALLWREEALSERQRDLALVSGLRNVL 483

Query: 372 ---CQLTIPVYQRVYPQFHPLLGLQY 394
               +  I VYQR+YP  HPLLGLQ+
Sbjct: 484 VDSARHVIRVYQRIYPANHPLLGLQW 509



 Score = 38.9 bits (89), Expect = 6.1,   Method: Compositional matrix adjust.
 Identities = 20/78 (25%), Positives = 38/78 (48%), Gaps = 12/78 (15%)

Query: 7   SGEVIISQEPYVCVPN-NSSSISRCDGCF----------ASSNLKKCSACQVVWYCGSNC 55
           +GE++++ +PY  V + +  +  +   CF          A++ +K+C+ C+   YC   C
Sbjct: 46  AGELVLASKPYAAVADTDGPAAGKAPECFQAQDEDADVAAAAEMKRCAQCRRAQYCSVEC 105

Query: 56  QKLDWK-LHRLECQVLSR 72
           Q+  W   H+ EC    R
Sbjct: 106 QRAAWHGGHKAECAAWVR 123


>gi|268570006|ref|XP_002640670.1| Hypothetical protein CBG19732 [Caenorhabditis briggsae]
          Length = 445

 Score = 80.5 bits (197), Expect = 2e-12,   Method: Compositional matrix adjust.
 Identities = 84/400 (21%), Positives = 169/400 (42%), Gaps = 65/400 (16%)

Query: 10  VIISQEPYVCVPNNSSSISRCDGCFA-------SSNLKKCSACQVVWYCGSNCQKLDWK- 61
            +  ++P   V +     + C  CF+        S +  C  C  V YC   CQ+ DWK 
Sbjct: 3   TVAREKPLAAVLSPEFQDTYCATCFSEIDPSHLDSEILTCDDCTQVSYCSLKCQRKDWKT 62

Query: 62  LHRLECQVLSRLDKEKRKSVTPSIRLMLKLYLR---------------RKLQNDNVIPST 106
           +H+LEC++L    +   +++T ++RL +++ L                ++  +   +   
Sbjct: 63  VHQLECEIL----RGTAQNMTVTMRLCVRVLLNTIGNSNGPDIDALETKRKGSGGQLNRG 118

Query: 107 TTDNYSLVEALVAHMSDI----DEKQLLLYAQIANLVNLILQWPEISINEIAENFSKLAC 162
                +  E+LV H  +        Q L     ++++ +I         E  +N   +A 
Sbjct: 119 YAARRAAGESLVTHYKEFRSSPKHNQFL-----SDILTIIKASGHNIFPESMDNNKMIAI 173

Query: 163 NAHTICNS-------ELRPLGTGLYPVISIINHSCLPNAVLVFEGRLAVVRAVQHVPKGA 215
               +CNS        + P+G+G+Y  ++  NHSC     +VF+G  AV+R+ +      
Sbjct: 174 ICSVLCNSFGIIAEKRVEPIGSGMYVGLATHNHSCASTCHVVFDGNQAVLRS-RDRQYCK 232

Query: 216 EVLISYIETAGSTMTRQKALKEQYLFTCTCPRCIKLGQFDDIQESAILEGYRCKDDGCSG 275
              ISY+     T  RQK+++  +  TC C  C+     +D+  + +    +C+   C G
Sbjct: 233 NTTISYVSRMLPTFERQKSIRNVHFITCRCEMCLN----EDLDLTGL--ASKCQTSKCQG 286

Query: 276 FLLRDSDDKGFTCQQCGLV---RSKEEIKKIASEVNILSKKTLALTSCGNHQEVVSTYKM 332
           F ++ ++    TC +  ++   +S +   K+   +  L K            + V  Y+ 
Sbjct: 287 F-VKGANGSCTTCGKPAMIPFEQSTQSTSKLLDTLENLHK--------SQQLDTVQEYRH 337

Query: 333 IEKLQK---KLYHPFSVNLMQTREKLIKILMELEDWKEAL 369
           ++ LQ+   ++    +V ++Q  E+L     E+++  E +
Sbjct: 338 LQNLQEEYGRILADCNVAILQLDEQLAYCASEMKNVPEDM 377


>gi|358395915|gb|EHK45302.1| hypothetical protein TRIATDRAFT_79702 [Trichoderma atroviride IMI
           206040]
          Length = 543

 Score = 80.5 bits (197), Expect = 2e-12,   Method: Compositional matrix adjust.
 Identities = 63/227 (27%), Positives = 106/227 (46%), Gaps = 22/227 (9%)

Query: 30  CDGCFASSNLKKCSACQVVWYCGSNCQKLDWK-LHRLECQVLSR--LDKEKRKSVTPSIR 86
           C  C    N + CS C    YC + CQ   WK +H  EC+VL +   D+++R+ +    R
Sbjct: 54  CSYCLRPGNPRACSRCHAASYCDATCQAAAWKAVHSRECKVLRQGIKDEDRRRRLPTPTR 113

Query: 87  LMLKLYLRRKLQNDNVIPSTTTDNYSLVEALVAHM-----SDIDEKQLLLYAQIANLVNL 141
            +++  LR+++ +              +E L  H+     ++ DE + +    +A     
Sbjct: 114 ALIQAVLRKEIGDG-------------LEGLEGHVLEKKAAEGDEWKDIEIMAMAACAFS 160

Query: 142 ILQWPEISINEIAENFSKLACNAHTICNSELRPLGTGLYPVISIINHSCLPNAVLVFEGR 201
                E  I + AE   K+  N+    +S+L  +G  L P +++ NHSC+PNA + F GR
Sbjct: 161 GKGTAEEDIRKAAEMLCKIQNNSFQRFDSDLGVIGLFLEPTLAMANHSCIPNAAVQFIGR 220

Query: 202 LAVVRAVQHVPKGAEVLISYIETAGSTMTRQKALKEQYLFTCTCPRC 248
            A++ A   +  G E+ ++Y         R++AL   Y F C C RC
Sbjct: 221 NALLIAENPIRAGDEMELAYTFYTDPLPKRKEALA-HYKFVCQCLRC 266


>gi|449548889|gb|EMD39855.1| hypothetical protein CERSUDRAFT_122028 [Ceriporiopsis subvermispora
           B]
          Length = 454

 Score = 80.5 bits (197), Expect = 2e-12,   Method: Compositional matrix adjust.
 Identities = 69/265 (26%), Positives = 110/265 (41%), Gaps = 37/265 (13%)

Query: 6   VSGEVIISQEPYVCVPNNSSSISRCDGCF----ASSNLKKCSACQVVWYCGSNCQKLDWK 61
            +G +++S          S    RCD C      S  L +C+ C   WYCG  CQK  W+
Sbjct: 28  AAGSIVLSAPALSTTLLQSEKGRRCDACHRLESVSVKLLRCAGCAAYWYCGKPCQKKQWR 87

Query: 62  LHRLE-CQVLSR-------LDKEKRKSVTPSIRLMLKLYLRRKLQNDNVIPSTTTDN--- 110
            H  + C+   +       LD    + +     +ML   L     +++  P T  +N   
Sbjct: 88  AHHKKICKHYGQYTHSPTFLDLRPEEQID---AIMLSHLLAEAYPDND--PDTLPENNTA 142

Query: 111 YSLVEALVAHMSDIDEKQLLLYAQIANLVNLILQWPEISINEIAENFSKLACNAHTICNS 170
           +S+   L  H         +          L  Q  E+ +     NF         I +S
Sbjct: 143 FSVFLDLQKHTPPTPNNPPV--CPPPENPALAKQATELYLRFGNNNF---------IVHS 191

Query: 171 ELRPLGTGLYPVIS-IINHSCLPNAVLVF-----EGRLAVVRAVQHVPKGAEVLISYIET 224
            L P   G++P+ S   NHSC+PNAV+ +     E     V A++ + +G E++I Y++ 
Sbjct: 192 HLYPYAHGIFPLASRTFNHSCVPNAVVKYIIRPSEPVCMQVVALREIQEGEEIVIPYLDP 251

Query: 225 AGSTMTRQKALKEQYLFTCTCPRCI 249
           A S   R+ AL+  Y F C+C  C+
Sbjct: 252 ALSYAARRDALQTNYGFICSCALCV 276


>gi|115395962|ref|XP_001213620.1| predicted protein [Aspergillus terreus NIH2624]
 gi|114193189|gb|EAU34889.1| predicted protein [Aspergillus terreus NIH2624]
          Length = 425

 Score = 80.5 bits (197), Expect = 2e-12,   Method: Compositional matrix adjust.
 Identities = 81/357 (22%), Positives = 147/357 (41%), Gaps = 59/357 (16%)

Query: 54  NCQKLDWKL-HRLECQVLSRLDKEKRKSVTPSIRLMLKLYLRRKLQNDNVIPSTTTDNYS 112
            CQ  DWK  H LEC + S+L   K + +  + R +L++  R   +        T     
Sbjct: 51  TCQSKDWKFAHSLECAIFSKL---KPRVLPVNARAVLRIVQRSARRK------YTPQELD 101

Query: 113 LVEALVAHMSDIDEKQLLLYAQIANLVNLILQWPEISINE--IAENFSKLACNAHTICNS 170
           L + L  H  +I  +    + +IA     +  + +    E  I+   +KL  N+  +  +
Sbjct: 102 LFQQLETHEKEIRHENAPQWERIALSSKAVKAYSQTDTPEDTISAFGAKLDVNSFNMTTA 161

Query: 171 ELRPLGTGLYPVISIINHSCLPNAVLVFEGRLAVVRAVQHVPKGAEVLISYIETAGSTMT 230
               +G  L+P  ++INHSC  NAV+ F+G      A++ + +  ++ ISY++       
Sbjct: 162 LADRIGLYLHPYAALINHSCAYNAVIGFDGAELFATALRPITRDEQIFISYVDATNPVAV 221

Query: 231 RQKALKEQYLFTCTCPRCIKLGQFDDIQESAILEGYRCKDDGCSGFLLRDSDDKGFTCQQ 290
           R+  L+E+Y F C C +C    + D               DG       D+  K +T  +
Sbjct: 222 RRNELRERYFFDCRCAKCA--AEMD-------------APDGGRAPAAADAQRKAYTLME 266

Query: 291 CGL---VRSKEEIKKIASEVNILSKKTLALTSCGNHQEVVSTYKMIEKLQKKLYHPFSVN 347
                 V   E ++ + S +  L +   A            ++++ E+            
Sbjct: 267 VATGAGVDPAEAVRALESAMRTLRESEGA------------SWRITEQ-----------P 303

Query: 348 LMQTREKLIKILMELEDWKEALAYCQLTIPVYQ---RVYP-QFHPLLGLQYYTCGKL 400
           L+  R++LI  L+    ++   A+ Q  I V +    VYP + HP+  L  +T  +L
Sbjct: 304 LVSLRDELIASLLGAGRFQ--TAFVQAAIRVVRVDCVVYPVRGHPIRQLHAWTLARL 358


>gi|291405379|ref|XP_002719091.1| PREDICTED: SET and MYND domain containing 4 [Oryctolagus cuniculus]
          Length = 801

 Score = 80.1 bits (196), Expect = 2e-12,   Method: Compositional matrix adjust.
 Identities = 69/269 (25%), Positives = 121/269 (44%), Gaps = 19/269 (7%)

Query: 175 LGTGLYPVISIINHSCLPNAVLVFEGRLAVVRAVQHVPKGAEVLISYIETAG--STMTRQ 232
           L TG++PVIS++NHSC PN  + F G +A VRA QH+ +G E+L  Y           RQ
Sbjct: 525 LATGVFPVISLLNHSCSPNTSVSFVGTVATVRASQHIRRGQEILHCYGPHVSRLGIAARQ 584

Query: 233 KALKEQYLFTCTCPRCIKLGQFDDIQESAILEGYRCKDDGCSG--FLLRDSDDKGFTCQQ 290
           + LK QY F C+C  C         +  A     R +   CS    LL+  D    +   
Sbjct: 585 QKLKSQYFFNCSCLAC-------QTEAHATAAAPRWEAFCCSSCRALLQGGDVLSCSSGS 637

Query: 291 CGLVRSKEEIKKIASEVNILSKKTLALTSCGNHQEVVSTYKMIEKLQKK----LYHPFSV 346
           C    S++    +A+ +  L ++T          E+    + +   Q+     L+   SV
Sbjct: 638 CTEAVSRDH---LAARLRDLQRQTGLARELLRSGELEKAIQQLSGCQRDAESFLWAEHSV 694

Query: 347 NLMQTREKLIKILMELEDWKEALAYCQLTIPVYQRVYPQFHPLLGLQYYTCGKLEWFLGD 406
            L +  + L +    L DWK++ A+ Q ++ V +  +      +G + +   ++ +   +
Sbjct: 695 -LGEIEDDLAQAYAALGDWKKSAAHLQKSLTVVEARHGPSSVEMGHELFKLAQILFNGFE 753

Query: 407 TENAIKSMTEAVEILRITHGTNSPFMKEL 435
              A+ ++ +A  +L +  G  S  ++EL
Sbjct: 754 VPEALSTIQKAETVLAVHCGPCSDEIQEL 782


>gi|193676233|ref|XP_001943351.1| PREDICTED: SET and MYND domain-containing protein 4-like
           [Acyrthosiphon pisum]
          Length = 636

 Score = 80.1 bits (196), Expect = 2e-12,   Method: Compositional matrix adjust.
 Identities = 86/297 (28%), Positives = 126/297 (42%), Gaps = 60/297 (20%)

Query: 8   GEVIISQEPYVCVP-NNSSSISRCDGCFASS-NLKKCSACQVVWYCGSNC-QKLDWKLHR 64
           G+V+   EPY+C P ++ + + +  GC      L  C  C +V+YC  +C  K +   H 
Sbjct: 250 GDVVAIDEPYICGPISDHTGVCQYSGCLNLDLALIPCPKCLLVYYCNKDCMNKANEDGHY 309

Query: 65  LECQVLSRLDKEKRKSVTPSIRLM--------LKLYLRRKLQN-----DNVIPSTT---- 107
           LEC ++  +        TP I  M        LK YL+  L+      DN   S T    
Sbjct: 310 LECPIMYFIKS------TPGITRMNELAMKWFLKDYLKMGLKKYCLIVDNFSKSKTDPQT 363

Query: 108 ----------TDNYSLVEALVAHMSDIDEKQLLLYAQIA--NLVNLIL---QWPEISINE 152
                     +DN+    +L +  + +    L  +  IA   L  LIL   + PE  I  
Sbjct: 364 RGFDGTGQYKSDNFLTAYSLDSSENKVSIDVLFFFNCIAVDMLHCLILSGFKIPEYYIGF 423

Query: 153 IAE-----------NFSKLACNAHTICNSELRPL----GTGLYPVISIINHSCLPNAVLV 197
           +             N  KL  NA TI     R L       LYP IS+ NHSC PN    
Sbjct: 424 VGASLVRILIVLDLNCRKLNINAPTISFQGKRQLTLTIALTLYPTISLFNHSCDPNIKRS 483

Query: 198 FE--GRLAVVRAVQHVPKGAEVLISY--IETAGSTMTRQKALKEQYLFTCTCPRCIK 250
            E   R+ V++A+Q +PKG+++  +Y  I    +  +RQ   K+ + F C C  CIK
Sbjct: 484 GELSDRIRVMKAIQPIPKGSQLCCTYGIIFRGHTKESRQDICKKLFNFKCYCQPCIK 540


>gi|156539569|ref|XP_001601623.1| PREDICTED: SET and MYND domain-containing protein 4-like [Nasonia
           vitripennis]
          Length = 697

 Score = 80.1 bits (196), Expect = 2e-12,   Method: Compositional matrix adjust.
 Identities = 80/296 (27%), Positives = 125/296 (42%), Gaps = 59/296 (19%)

Query: 8   GEVIISQEPYVCVPNNSSSISRCDGC---FASSNLKKCSACQVVWYCGSNCQKLDWKLHR 64
           GE+++ ++P+  +      ++ C  C     +     C  C  V YC  NC+ LD ++H 
Sbjct: 283 GELLVVEKPHCSMLLGEYRLTHCHRCSIRIVAPYPASCYLCSSVAYCSPNCRDLDERVHS 342

Query: 65  LECQVLSRLDKEKRKSVTPSIRL---MLKLY------------LRRKLQNDNVIPSTTTD 109
           +EC +L  L   K  SVT  + L   + K Y             + KL+     P    D
Sbjct: 343 IECGLLGSLWCSK-ASVTCMMALRAIIQKPYEEFIKAKSELKKTKGKLEIGKGKPFVGRD 401

Query: 110 NYSLVEALVAHMSDIDEKQLLLYAQIANLVNLILQ-------------WPEISINE---- 152
            Y     LV H  +   + L   A ++  +  +L+               EI+++E    
Sbjct: 402 -YKAFCNLVTHEDERTAEDLFHRAYMSAWLLRVLKTSSYLPASVKTPDAAEIALSEGETL 460

Query: 153 ----IAENFSKLACNAHTICNSEL-RP-------------LGTGLYPVISIINHSCLPNA 194
               I  +   L  N+H I  SEL RP             +G G++P +++ NHSC P  
Sbjct: 461 VADAILYHLQMLQFNSHEI--SELVRPRGKPDLSKGKSLFIGGGVFPTVALFNHSCNPGV 518

Query: 195 VLVFEGRLAVVRAVQHVPKGAEVLISY--IETAGSTMTRQKALKEQYLFTCTCPRC 248
           V  F G   VVRA++ +P GAE+  +Y  I T      R++ L+ QY F C C  C
Sbjct: 519 VRYFIGNTMVVRAIKTIPAGAEISENYGPIFTEEEENDRKRKLRLQYWFDCDCEAC 574


>gi|298705753|emb|CBJ49061.1| conserved unknown protein [Ectocarpus siliculosus]
          Length = 401

 Score = 80.1 bits (196), Expect = 2e-12,   Method: Compositional matrix adjust.
 Identities = 46/148 (31%), Positives = 71/148 (47%), Gaps = 8/148 (5%)

Query: 146 PEISINEIA-ENFSKLACNAHTICNSELRPLGTGLYPVISIINHSCLPNAVLVFEGRLAV 204
           P  ++ ++A +   +++ N  T C+    P G GL+P  ++INHSC PN    + G    
Sbjct: 20  PSAALLDLARKELGRVSLNGITTCDG-TGPTGLGLFPSGAMINHSCSPNCQAWWRGSQLE 78

Query: 205 VRAVQHVPKGAEVLISYIETAGSTMTRQKALKEQYLFTCTCPRCIKLGQFDDIQESAILE 264
           +R  + V  G E+ +SYI     +  R+  L+  + F C C RC+   Q+D     A L 
Sbjct: 79  IRCTKPVATGEELCLSYIPIDQPSTVRRAQLRHSWFFACRCRRCVSR-QWD-----AELV 132

Query: 265 GYRCKDDGCSGFLLRDSDDKGFTCQQCG 292
           G RC   GC+G +       G  C  CG
Sbjct: 133 GLRCPTKGCAGAVPPPPWRGGHACSHCG 160


>gi|73967367|ref|XP_548315.2| PREDICTED: SET and MYND domain-containing protein 4 [Canis lupus
           familiaris]
          Length = 794

 Score = 80.1 bits (196), Expect = 2e-12,   Method: Compositional matrix adjust.
 Identities = 44/112 (39%), Positives = 60/112 (53%), Gaps = 6/112 (5%)

Query: 141 LILQWPEISINEIAENFSKLACNAHTICNSELRPLGTGLYPVISIINHSCLPNAVLVFEG 200
           L LQ    +I  I E  SK     + +CNS    L TG++PV+S++NHSC PN  + F  
Sbjct: 488 LQLQCNAQAITTIQETGSK----ENNVCNSRQIRLATGIFPVVSLLNHSCSPNTSVSFSS 543

Query: 201 RLAVVRAVQHVPKGAEVLISYIE--TAGSTMTRQKALKEQYLFTCTCPRCIK 250
            +A +RA Q + KG E+L  Y    +      RQ+ L+ QY F C CP C K
Sbjct: 544 TVATIRASQQIGKGQEILHCYGPHYSRMGVADRQQKLRSQYFFDCGCPACQK 595


>gi|156356358|ref|XP_001623892.1| predicted protein [Nematostella vectensis]
 gi|156210632|gb|EDO31792.1| predicted protein [Nematostella vectensis]
          Length = 634

 Score = 79.7 bits (195), Expect = 3e-12,   Method: Compositional matrix adjust.
 Identities = 84/341 (24%), Positives = 141/341 (41%), Gaps = 53/341 (15%)

Query: 8   GEVIISQEPYVCVPNNSSSISRCDGCFASSNLK-KCSACQVVWYCGSNCQKLDWK-LHRL 65
           G+V++ ++P+  V     S S C  CF        CS C  V YC   C K  W   H  
Sbjct: 227 GDVLLVEKPFASVLLQEQSKSHCHQCFVHILAPLPCSYCTTVRYCSEKCAKESWDAYHYA 286

Query: 66  ECQVLSRLDKEKRKSVTPSIRLMLKL---YLR---RKLQND---------NVIPSTTTD- 109
           EC  L  +    +     ++R+++K    YL+   ++ +++            P    D 
Sbjct: 287 ECMNLEHVYVAGKYGHL-ALRVVVKAGFQYLKASVKQFESEEKKCDPAELGCNPDGVYDP 345

Query: 110 -NYSLVEALVAHMSDIDEKQLLLYAQIANLVNLILQWPEISINE---------------I 153
            +Y  +  LV H  +     L +    A  +   L+  E   +                +
Sbjct: 346 SDYRPIYHLVGHTHERTLNDLFVRTLNAIYLLRCLEGTEYYGDSTKLPSREDQAFIGGLL 405

Query: 154 AENFSKLACNAH----------TICNSELRPLGTGLYPVISIINHSCLPNAVLVFEGRLA 203
             +   L CNAH          ++  SE   +G G+Y  +S+ NHSC PN    F G   
Sbjct: 406 LRHLQSLPCNAHEISELQLSLKSVATSEAAEIGAGIYGTLSLFNHSCEPNVTRFFYGDKC 465

Query: 204 VVRAVQHVPKGAEVLISY--IETAGSTMTRQKALKEQYLFTCTCPRCIKLGQ-FDDI--Q 258
           VVRA   +P   EV+ +Y  +        RQ++L+ QY F C C  C++    + ++  Q
Sbjct: 466 VVRAFSSIPCRGEVVDNYGILSALTPRKQRQESLQSQYYFKCNCHACLEDSPLYSELIKQ 525

Query: 259 ESAILEGYRCKDDGCSGFLLRDSDDKGFTCQQCGLVRSKEE 299
           +   L+   C+    +G +L  +D K   C++CG+ +S E+
Sbjct: 526 DVPQLKCANCR-MALAGEIL--TDGKLVKCEKCGVPQSLED 563


>gi|48476977|gb|AAT44536.1| hypothetical rhabdomyosarcoma antigen MU-RMS-40.9C [Homo sapiens]
          Length = 273

 Score = 79.7 bits (195), Expect = 3e-12,   Method: Compositional matrix adjust.
 Identities = 63/253 (24%), Positives = 113/253 (44%), Gaps = 37/253 (14%)

Query: 187 NHSCLPNAVLVFEGRLAVVRAVQHVPKGAEVLISYIETAGSTMTRQKALKEQYLFTCTCP 246
           NH C PN  ++F      +RA+  + +G E+ +SYI+    +  R++ LK+QY F CTC 
Sbjct: 1   NHDCWPNCTVIFNNGKIELRALGKISEGEELTVSYIDFLNVSEERKRQLKKQYYFDCTCE 60

Query: 247 RCIKLGQFDDIQESAILEGYRCKDDGCSGFLLRDSDDKGFTCQQCGLVRSKEEIKKIASE 306
            C K                + KDD    FL              G+  + +  +++  E
Sbjct: 61  HCQK----------------KLKDD---LFL--------------GVKDNPKPSQEVVKE 87

Query: 307 VNILSKKTL----ALTSCGNHQEVVSTYKMIEKLQKKLYHPFSVNLMQTREKLIKILMEL 362
           +   SK TL       S G + EVV   +   + Q+ ++   ++ +++    + ++L  L
Sbjct: 88  MIQFSKDTLEKIDKARSEGLYHEVVKLCRECLEKQEPVFADTNIYMLRMLSIVSEVLSYL 147

Query: 363 EDWKEALAYCQLTIPVYQRVYPQFHPLLGLQYYTCGKLEWFLGDTENAIKSMTEAVEILR 422
           + ++EA  Y +  +  Y ++Y   +  LG+     G   W  G+ E     + +A  IL 
Sbjct: 148 QAFEEASFYARRMVDGYMKLYHPNNAQLGMAVMRAGLTNWHAGNIEVGHGMICKAYAILL 207

Query: 423 ITHGTNSPFMKEL 435
           +THG + P  K+L
Sbjct: 208 VTHGPSHPITKDL 220


>gi|332257691|ref|XP_003277939.1| PREDICTED: LOW QUALITY PROTEIN: SET and MYND domain-containing
           protein 4 [Nomascus leucogenys]
          Length = 703

 Score = 79.7 bits (195), Expect = 3e-12,   Method: Compositional matrix adjust.
 Identities = 78/315 (24%), Positives = 132/315 (41%), Gaps = 67/315 (21%)

Query: 146 PEISINEIA--ENFSKLACNA-------HT------ICNSELRPLGTGLYPVISIINHSC 190
           P+++I  +A   +  +L CNA       HT      + +S    L TG++PVIS++NHSC
Sbjct: 382 PDVTIWGVAMLRHMLQLQCNAQAMTTIQHTGSKGSIVTDSRQVRLATGIFPVISLLNHSC 441

Query: 191 LPNAVLVFEGRLAVVRAVQHVPKGAEVLISY--IETAGSTMTRQKALKEQYLFTCTCPRC 248
            PN  + F   +A +RA Q + KG E+L  Y   ++      RQ+ L+ QY F CTCP C
Sbjct: 442 SPNTSVSFISTVATIRASQRIRKGQEILHCYGPHKSRMGVAERQQKLRSQYFFDCTCPAC 501

Query: 249 IKLGQFDDIQESAILEGYRCKDDGCSGFLLRDSDDKGFTCQQCGLVRSKEEIKKIASEVN 308
                    +   +  G R                + F C  CG     +++ +  S   
Sbjct: 502 -------QTEAHRMAAGPRW---------------EAFCCNGCGAPMQGDDVLRCGSR-- 537

Query: 309 ILSKKTLALTSCGNHQEVVSTYKMIEKLQKKLYHPFSVNLMQTREKLIKILM--ELEDWK 366
                     SC   +  VS   ++ +LQ         +L Q      K+L   ELE   
Sbjct: 538 ----------SCA--ESAVSRDHLVSRLQ---------DLQQQVRVAQKLLRDGELERAI 576

Query: 367 EALAYCQLTIPVYQRVYPQFHPLLGLQYYTCGKLEWFLGDTENAIKSMTEAVEILRITHG 426
           + L+ CQ     +  ++ + H ++G       +    LGD + +   +  ++ ++ + HG
Sbjct: 577 QRLSGCQRDAESF--LWAE-HAMVGEIADGLARACAALGDWQKSATHLQRSLRVVEVRHG 633

Query: 427 TNSPFMKELILKLEE 441
            +S  M   + KL +
Sbjct: 634 PSSVEMGHELFKLAQ 648


>gi|71985487|ref|NP_493620.2| Protein SET-10 [Caenorhabditis elegans]
 gi|31873104|emb|CAB04267.2| Protein SET-10 [Caenorhabditis elegans]
          Length = 430

 Score = 79.7 bits (195), Expect = 3e-12,   Method: Compositional matrix adjust.
 Identities = 75/283 (26%), Positives = 130/283 (45%), Gaps = 32/283 (11%)

Query: 10  VIISQEPYVCVPNNSSSISRCDGCF------ASSNLKKCSACQVVWYCGSNCQKLDWK-L 62
            +  ++P   V +   S   C  CF        + +  C  C  V YC   CQ+ DWK  
Sbjct: 3   TVAREQPLAAVLSPQFSELYCATCFLEIDSSQETEILTCDDCLAVSYCTLKCQRKDWKSC 62

Query: 63  HRLECQVLSRLDKEKRKSVTPSIRLMLKLYLRRKLQNDNVIPSTTTDNYSLVEALVAHMS 122
           H+ EC++L       + S TP + L +KL +R  L + +    T + N +++E L  +  
Sbjct: 63  HQFECEIL-----RCQGSSTP-MTLTMKLCIRVLLASRST--QTPSFNGAVLEDLETNYK 114

Query: 123 DI----DEKQLLLYAQIANLVNLILQ--WP-EISINEIAENFSKLACNAHTICNSE-LRP 174
           +     +  Q L  + +  +++   Q  +P  +  N+       + CN+ +I N + + P
Sbjct: 115 EFRSSPEHNQFL--SDVLTIISSSGQNIFPTSLETNKTIGIICSVLCNSFSIINEKRVEP 172

Query: 175 LGTGLYPVISIINHSCLPNAVLVFEGRLAVVRAVQHVPKGAEVLISYIETAGSTMTRQKA 234
           +G+GLY  ++  NHSC   + +VFEG    +R  Q      E+ ISY+     T  R+K 
Sbjct: 173 IGSGLYVGVAKHNHSCASTSHVVFEGNQVFLRTNQE-EYSKELTISYVSRMLPTSERRKT 231

Query: 235 LKEQYLFTCTCPRCIKLGQFDDIQESAILEGYRCKDDGCSGFL 277
           ++  +  TC C  C K  + D I  S+     +C+   C+GF+
Sbjct: 232 IRGVHFLTCQCEMC-KNEELDLIGLSS-----KCRTKNCTGFV 268


>gi|302794157|ref|XP_002978843.1| hypothetical protein SELMODRAFT_418586 [Selaginella moellendorffii]
 gi|300153652|gb|EFJ20290.1| hypothetical protein SELMODRAFT_418586 [Selaginella moellendorffii]
          Length = 849

 Score = 79.3 bits (194), Expect = 3e-12,   Method: Compositional matrix adjust.
 Identities = 94/397 (23%), Positives = 163/397 (41%), Gaps = 29/397 (7%)

Query: 7   SGEVIISQEPYVCVPNNSSSISRCDGCFA--SSNLKKCSACQVVWYCGSNCQKLDWKLHR 64
           +G ++ S+E    +       + C GCF    ++   C  C VV YC   C+      H+
Sbjct: 229 AGTLVHSEEALAGILLKKHRPTHCHGCFGVVPADSVPCIGCGVVSYCNDACRDDATVEHK 288

Query: 65  LECQVLSRLDKEKRKSVTPSIRLMLKLYLRRKL-QNDNVIPSTTTDNYSLVEALVAHMSD 123
           LEC           + V  +  L+  L     L  + N +PS T     L   L A ++ 
Sbjct: 289 LECGGSGWAAALPEEGVLAARILVTNLQCEVDLCHHYNDLPSQTKVELYL---LAATLAK 345

Query: 124 IDEKQLLLYA-----QIANLVNLILQWPEISINEIAENFSKLACNAHTICNSELRPLGTG 178
             +   L ++         L+  +L++  + I  I  +    + +   IC  E   +   
Sbjct: 346 CLKASWLAFSFEDLLAKLVLLLAMLRFNAMGIIHIYSSDETGSSSGAHICGIEHVVVAQA 405

Query: 179 LYPVISIINHSCLPNAVLVFEGRLAVVRAVQHVPKGAEVLISYIETAGSTMTRQKA--LK 236
           L+   S  NHSC PN  + +  R       + +P    + ISY    G +   Q+   L+
Sbjct: 406 LFVRGSKFNHSCSPNVHVSYVKRTLRAHCTEALPAFCPLEISYGVQVGESEKSQRKIWLR 465

Query: 237 EQYLFTCTCPRCIKLGQFDDIQESAILEGYRCKDDGCSGFLLR-DSDDKGFTCQQCGLVR 295
           + YLF C+C  C +LG   D+   A    Y C   GC G +L  D+ D+        L  
Sbjct: 466 DHYLFECSCEPC-RLGLVSDLLHIA----YYCCQPGCQGIVLDPDASDEDIL-----LPG 515

Query: 296 SKEEIKKIASEVNILSKKTLA-LTSCG---NHQEVVSTYKMIEKLQKKLYHPFSVNLMQT 351
               ++ +    NIL +K L  L  CG   N  E ++    + +  +K +HPF+  L Q 
Sbjct: 516 GSGTVRFLIEHCNILPQKELEDLQICGSRLNGNEAIADGLRLLENGRKCFHPFNKQLAQM 575

Query: 352 REKLIKILMEL-EDWKEALAYCQLTIPVYQRVYPQFH 387
            + + ++  E  +    A+ + + +I + +R+Y + H
Sbjct: 576 EDIVARLYCEAGQQPAAAIVHARRSIEILERIYGKDH 612


>gi|297836294|ref|XP_002886029.1| hypothetical protein ARALYDRAFT_343234 [Arabidopsis lyrata subsp.
           lyrata]
 gi|297331869|gb|EFH62288.1| hypothetical protein ARALYDRAFT_343234 [Arabidopsis lyrata subsp.
           lyrata]
          Length = 424

 Score = 79.3 bits (194), Expect = 4e-12,   Method: Compositional matrix adjust.
 Identities = 81/272 (29%), Positives = 119/272 (43%), Gaps = 42/272 (15%)

Query: 7   SGEVIISQEP---YVCVPNNSSSISR-CDGCF---ASSNLKKCSACQVVWYCGSNC--QK 57
           +G+VI+ + P   Y   P  SSS+S  CD CF   ASS  +KC +C +V +C  NC    
Sbjct: 32  AGQVILRESPLLLYSAFPFLSSSVSPYCDHCFRLLASSAHQKCQSCSLVSFCSPNCFASH 91

Query: 58  LDWKLHRLECQVLSRLDKEKRK-SVTPSIRLMLKLYLRRKLQNDNVIPSTTTDNYSLVEA 116
             W      C+ L RL +     S  PS R +   +L   L   N+  ++ +D   L+  
Sbjct: 92  TPWL-----CESLLRLHQSSSAFSDQPSDRQVQARFL---LSAYNLAAASPSDFQILLSL 143

Query: 117 LVAHMSDIDEK---------QLLLYAQIANLVNLILQWPEISINEIAEN----FSKLACN 163
             +  S+ D             LL A    L   I   PE++   +A++    F  +   
Sbjct: 144 QGSGCSNGDPSSSATDSGFLHSLLSAVCPPLPVCI--SPELTAALLAKDKVNAFGLM--E 199

Query: 164 AHTICNSELRPLGTGLYPVISIINHSCLPNAVL------VFEGRLAVV-RAVQHVPKGAE 216
             ++ N +      G+YP  S  NH CLPNA          +G   ++ R +  VP+G E
Sbjct: 200 PFSVSNDKRSVRAYGIYPKTSFFNHDCLPNACRFDYVDSASDGNTDIIIRTIHDVPEGRE 259

Query: 217 VLISYIETAGSTMTRQKALKEQYLFTCTCPRC 248
           V +SY     +  +RQK L E Y F C C RC
Sbjct: 260 VCLSYFPVNMNYSSRQKRLLEDYGFKCDCDRC 291


>gi|390360826|ref|XP_001181205.2| PREDICTED: SET and MYND domain-containing protein 4-like
           [Strongylocentrotus purpuratus]
          Length = 679

 Score = 79.3 bits (194), Expect = 4e-12,   Method: Compositional matrix adjust.
 Identities = 107/500 (21%), Positives = 195/500 (39%), Gaps = 96/500 (19%)

Query: 6   VSGEVIISQEPYVCVPNNSSSISRCDGCFASSNLKKCSACQVVWYCGSNCQKLDW-KLHR 64
            +GE++I ++PY  +       S C  CF   +   C  C    YC S C+     + H 
Sbjct: 183 AAGELLIKEKPYAAIILKEEESSHCHHCFEQCSPIPCPNCIHARYCSSRCRSDSLTQYHS 242

Query: 65  LECQVLSRLDKEKRKSVTPSIRLMLKLYL---RRKLQNDNVIPSTTTD------------ 109
           +EC     L +     V+   RL L++ +   R +L ND   P T++             
Sbjct: 243 IECGTEQLLQQ-----VSVFSRLSLRILITAGREELSNDIRKPKTSSSQPSSSSTSTSLT 297

Query: 110 ---------------NYSLVEALVAHMSDIDEKQLL------------LYAQIANLVNLI 142
                          NY  +  L AH  +   ++L+             Y ++ +L    
Sbjct: 298 TAKGSSSGYLDGGLVNYESIVGLEAHWFNHSHEELIQYTVTSILLAKCFYRELVSLKTCE 357

Query: 143 LQWPEISINEIAE----NFSKLACNAHTICNSELRP----------------------LG 176
               E  I EIA     +  +L  N+H I  +E+RP                      + 
Sbjct: 358 TLSEEELIAEIASLLLLHTRQLKSNSHAI--TEVRPSEGENTAGKSAGGTVEEISQFRVA 415

Query: 177 TGLYPVISIINHSCLPNAVLVFEGRLAVVRAVQHVPKGAEVLISYIETAG--STMTRQKA 234
           T +YP +S++NH+C+PN +  F   +  VRA + +  G E+   Y    G  +T  RQ+A
Sbjct: 416 TAVYPTVSMMNHACIPNIIPSFRKGILCVRATKKIMHGDEIQHCYGPQVGHMTTSDRQQA 475

Query: 235 LKEQYLFTCTCPRCIKLGQFDDIQESAILEGYRCKDDGCSGFLLRDSDDKGFTCQQCG-- 292
           L  QY FTC C  C +  +  D   +  ++  +C          +  +     C +C   
Sbjct: 476 LLNQYCFTCRCRACTRDPKSRD---NLCMKCPKCG---------QPLNAMTSICGKCAVW 523

Query: 293 ---LVRSKEEIKKIASEVNILSKKTLALTSCGNHQEVVSTYKMIEKLQKKLYHPFSVNLM 349
              LV  KE    + +   ++     A+      ++V+S  K    +  ++  P  ++L 
Sbjct: 524 IDVLVLLKEVDDTMITFAELVEMYPAAVKDVTLMRDVISKTKTCHDVLDRILIPPHMHLA 583

Query: 350 QTREKLIKILMELEDWKEALAYCQLTIPVYQRVYPQFHPLLGLQYYTCGKLEWFLGDTEN 409
                + K  + L+++KEA  +    IP  +  + +    +  + Y   ++ +   +   
Sbjct: 584 VAYNFMAKCHIALQEFKEAAPWLAKAIPTIECRFGRDSIEVANELYKLAQIYFNGKEIAP 643

Query: 410 AIKSMTEAVEILRITHGTNS 429
           A++++  A+E+  I H  NS
Sbjct: 644 AMEAIDRALELF-IRHYGNS 662


>gi|348567599|ref|XP_003469586.1| PREDICTED: SET and MYND domain-containing protein 4-like [Cavia
           porcellus]
          Length = 800

 Score = 79.0 bits (193), Expect = 4e-12,   Method: Compositional matrix adjust.
 Identities = 37/84 (44%), Positives = 51/84 (60%), Gaps = 2/84 (2%)

Query: 167 ICNSELRPLGTGLYPVISIINHSCLPNAVLVFEGRLAVVRAVQHVPKGAEVLISY--IET 224
           I +S    L TG++PV+S++NHSC PN  + F G +A +RA Q + KG E+L  Y   E+
Sbjct: 512 ITDSRQVRLATGIFPVVSLLNHSCSPNTSVSFTGTIATIRAAQQIRKGQEILHCYGPHES 571

Query: 225 AGSTMTRQKALKEQYLFTCTCPRC 248
             S   RQ+ L+ QY F C CP C
Sbjct: 572 HMSVAERQQKLRSQYFFDCCCPAC 595


>gi|195168896|ref|XP_002025266.1| GL13396 [Drosophila persimilis]
 gi|194108722|gb|EDW30765.1| GL13396 [Drosophila persimilis]
          Length = 787

 Score = 79.0 bits (193), Expect = 4e-12,   Method: Compositional matrix adjust.
 Identities = 38/102 (37%), Positives = 56/102 (54%), Gaps = 6/102 (5%)

Query: 178 GLYPVISIINHSCLPNAVLVFE-GRLAVVRAVQHVPKGAEVLISYIETAGSTMTRQKALK 236
           GL+P+ +I+NH C PNA   F+ GRLA+VRA + +PKG E+  +Y +     +TR   LK
Sbjct: 567 GLFPLTAIMNHECTPNASHYFDNGRLAIVRAARDIPKGGEITTTYTKILWGNLTRGIFLK 626

Query: 237 EQYLFTCTCPRCIKLGQFDDIQESAILEGYRCKDDGCSGFLL 278
               F C C RC      D+ +    L    C++ GC G ++
Sbjct: 627 MTKHFMCDCTRC-----NDNTENGTYLSALFCREQGCRGLVI 663



 Score = 55.5 bits (132), Expect = 6e-05,   Method: Compositional matrix adjust.
 Identities = 27/72 (37%), Positives = 40/72 (55%), Gaps = 1/72 (1%)

Query: 178 GLYPVISIINHSCLPNAVLVF-EGRLAVVRAVQHVPKGAEVLISYIETAGSTMTRQKALK 236
            L+P+  ++NH C PNA   F +G   +V A + +  GAE+ +SY +   ST+ R+  L 
Sbjct: 237 ALFPLAGLLNHQCTPNAGHHFQDGETIIVCATERIACGAEITVSYAKMLWSTLARKIFLG 296

Query: 237 EQYLFTCTCPRC 248
               F C CPRC
Sbjct: 297 MTKGFMCQCPRC 308


>gi|405121568|gb|AFR96336.1| hypothetical protein CNAG_03112 [Cryptococcus neoformans var.
           grubii H99]
          Length = 447

 Score = 79.0 bits (193), Expect = 4e-12,   Method: Compositional matrix adjust.
 Identities = 53/192 (27%), Positives = 91/192 (47%), Gaps = 13/192 (6%)

Query: 149 SINEIAENFSKLACNAHTICNSELRPLGTGLYPVISIINHSCLPNAVLVFE--GRLAVVR 206
            ++E+    S +  NA T+ +  L P+G  + P++++ NHSC PNA+ VF   GR   + 
Sbjct: 110 GVDEVLNLCSAIQVNAFTLTSPSLAPIGMSISPLLALANHSCEPNAIAVFPKGGRDIFLV 169

Query: 207 AVQHVPKGAEVLISYIETAGSTMTRQKALKEQYLFTCTCPRCIKLGQFDDIQESAI---- 262
           A+  +P G E+L SYI+T+     RQ  L  +Y F C C  C K     + ++S +    
Sbjct: 170 ALNDIPPGEEILTSYIDTSTPYHHRQSELLSRYRFVCHCSLCQKSESTANGEQSWVDPRW 229

Query: 263 -LEGYRCKDDGCSGFLLRDSDDK-GFTCQQCGLVRSKEEIKKIASEVNILSKKTLALTSC 320
            +    C  +G +   L ++ DK    C  C     KE+ K  A +V  L ++ ++    
Sbjct: 230 CVRHRECSKEGKARMPLENATDKVEVRCIAC-----KEDFKVDADKVRKLVQEGMSFLES 284

Query: 321 GNHQEVVSTYKM 332
                +  +Y +
Sbjct: 285 DERGTLDKSYAL 296


>gi|255072485|ref|XP_002499917.1| predicted protein [Micromonas sp. RCC299]
 gi|226515179|gb|ACO61175.1| predicted protein [Micromonas sp. RCC299]
          Length = 676

 Score = 79.0 bits (193), Expect = 4e-12,   Method: Compositional matrix adjust.
 Identities = 33/89 (37%), Positives = 55/89 (61%)

Query: 153 IAENFSKLACNAHTICNSELRPLGTGLYPVISIINHSCLPNAVLVFEGRLAVVRAVQHVP 212
           +A   +K++CNAHT+C+ EL  +G G+YP  +++NH   P A   F+G+  V+RA + + 
Sbjct: 228 VARLLAKISCNAHTVCDEELNAIGVGVYPAAAMVNHGDAPTAAQSFKGKKIVLRATRDLK 287

Query: 213 KGAEVLISYIETAGSTMTRQKALKEQYLF 241
           +G EV ++Y+E   +   R+ AL   Y F
Sbjct: 288 RGDEVTMAYVELLATRQERRAALHAGYNF 316



 Score = 53.9 bits (128), Expect = 2e-04,   Method: Compositional matrix adjust.
 Identities = 30/105 (28%), Positives = 53/105 (50%), Gaps = 2/105 (1%)

Query: 346 VNLMQTR--EKLIKILMELEDWKEALAYCQLTIPVYQRVYPQFHPLLGLQYYTCGKLEWF 403
           ++L++ R  + + +  +   D++ A       +P Y+  YP  HP LGL      K+E  
Sbjct: 549 IHLLRVRVLDAMQRAAVTAGDFESARDAAYAVLPAYRLSYPPCHPPLGLHLALIAKIEAH 608

Query: 404 LGDTENAIKSMTEAVEILRITHGTNSPFMKELILKLEEAQAEASY 448
           L D   A++   EA+  L ++HG NS  ++ +   L E++AE  Y
Sbjct: 609 LVDLNAAVQFAREAISCLNVSHGRNSDVVRMMETLLGESEAELRY 653



 Score = 43.1 bits (100), Expect = 0.26,   Method: Compositional matrix adjust.
 Identities = 24/70 (34%), Positives = 36/70 (51%), Gaps = 6/70 (8%)

Query: 7   SGEVIISQEPYVCVPNNSSSISRCDGCFA-----SSNLKKCSACQVVWYCGSNCQKLDWK 61
           +G+ I+S  PY  V N++S  SRCD  FA       +L +C+  +V  Y   + Q   WK
Sbjct: 40  AGDEILSLNPYAAVLNDASRTSRCDHTFAKPSDNGGSLLRCARSKVARYVSRDAQVAAWK 99

Query: 62  L-HRLECQVL 70
             ++ EC  L
Sbjct: 100 RGYKEECASL 109


>gi|392566265|gb|EIW59441.1| SET domain-containing protein [Trametes versicolor FP-101664 SS1]
          Length = 452

 Score = 79.0 bits (193), Expect = 4e-12,   Method: Compositional matrix adjust.
 Identities = 65/235 (27%), Positives = 107/235 (45%), Gaps = 26/235 (11%)

Query: 29  RCDGCFA----SSNLKKCSACQVVWYCGSNCQKLDWKLHRLE-CQVLSRLDKEKR-KSVT 82
           RCD C      +  LK+CS C   WYCG+ CQ   WK H  + C+  + L      +++T
Sbjct: 53  RCDACHILQSDAVTLKRCSGCASFWYCGTLCQMGAWKAHHKKLCKNFNTLTTSNEYQALT 112

Query: 83  PSIRLMLKLYLRRKLQND---NVIPSTTTDNYSLVEALVAHMSDIDEKQLLLYAQIANLV 139
           P  ++   L L + + +        ST+  + + ++ L    +D+ E  L L        
Sbjct: 113 PHDQVD-ALLLSQMIADSASWRAGQSTSGPHATFLDLLKGPRADVFELPLCLPKG----- 166

Query: 140 NLILQWPEISINEIAENFSKLACNAHTICNSELRPLGTGLYPVIS-IINHSCLPNAVLVF 198
                 P  S++   E + +   N   + +S L     G++P+ S + NHSC+PNA   +
Sbjct: 167 ----ALPSESLSLAKELYGRFGNNNFAL-HSHLNAYAHGVFPLASRLFNHSCIPNAACKY 221

Query: 199 -----EGRLAVVRAVQHVPKGAEVLISYIETAGSTMTRQKALKEQYLFTCTCPRC 248
                E     V A++ + +G E+ I Y++ A    TRQ+AL+  Y F C C  C
Sbjct: 222 IIRASEPVAMQVVALRDIAEGEEITIPYLDPALPYQTRQEALEVNYSFNCDCRLC 276


>gi|391347157|ref|XP_003747831.1| PREDICTED: SET and MYND domain-containing protein 1-like
           [Metaseiulus occidentalis]
          Length = 684

 Score = 79.0 bits (193), Expect = 5e-12,   Method: Compositional matrix adjust.
 Identities = 57/255 (22%), Positives = 115/255 (45%), Gaps = 11/255 (4%)

Query: 5   HVSGEVIISQEPYVCVPNNSSSISRCDGCFASSNLKKCSACQVVWYCGSNCQKLDWKLHR 64
           + +G++I   +PY C  ++    + C  C +      C  C+ + YC   C+  D   HR
Sbjct: 248 YSTGDIIHEDDPYACALSSDVLSAVCSYCISRGTSHSCEKCKRLHYCSKRCKSKDRVFHR 307

Query: 65  LECQVLSRLDKEKRKSVTPSIRLMLKLYLR--RKLQNDNVIPSTTTDNYSLVEALVAHMS 122
           LECQVL  L  E    +    R++++   R  R+ Q+ ++  ++       +  L++H+ 
Sbjct: 308 LECQVLRALGPED-DVLDEETRMVIRTLDRFIRERQSHSISEASFFSCTRTIYDLLSHIE 366

Query: 123 DIDEKQLLLYAQIA-NLVNLILQWPEISINEIAENFSKLACNAHTICNS---ELRPLGTG 178
           D+   +     + A  +++++  +    + E+ +   +   N H+I +         G  
Sbjct: 367 DLSPSEFEQAKERAETILSIVSPFVFTQLEEVMDILQRCRINCHSIVDHNDFHFFSRGRA 426

Query: 179 LYPVISIINHSCLPNA--VLVFEGRLAVVRAVQ--HVPKGAEVLISYIETAGSTMTRQKA 234
           +Y   S  NHSC+ +   V +F GR  ++RA+Q   +    ++ I Y+       +R + 
Sbjct: 427 VYLAASKTNHSCVTSNEYVQIFNGRKILLRAIQDFRIDDPLQMTIHYMPPTLPYESRLRR 486

Query: 235 LKEQYLFTCTCPRCI 249
               Y F C+C +C+
Sbjct: 487 CLNNYYFICSCRKCV 501


>gi|198470120|ref|XP_002133373.1| GA22855 [Drosophila pseudoobscura pseudoobscura]
 gi|198145302|gb|EDY72001.1| GA22855 [Drosophila pseudoobscura pseudoobscura]
          Length = 844

 Score = 79.0 bits (193), Expect = 5e-12,   Method: Compositional matrix adjust.
 Identities = 38/102 (37%), Positives = 56/102 (54%), Gaps = 6/102 (5%)

Query: 178 GLYPVISIINHSCLPNAVLVFE-GRLAVVRAVQHVPKGAEVLISYIETAGSTMTRQKALK 236
           GL+P+ +I+NH C PNA   F+ GRLA+VRA + +PKG E+  +Y +     +TR   LK
Sbjct: 624 GLFPLTAIMNHECTPNASHYFDNGRLAIVRAARDIPKGGEITTTYTKILWGNLTRGIFLK 683

Query: 237 EQYLFTCTCPRCIKLGQFDDIQESAILEGYRCKDDGCSGFLL 278
               F C C RC      D+ +    L    C++ GC G ++
Sbjct: 684 MTKHFMCDCTRC-----NDNTENGTYLSALFCREQGCRGLVI 720



 Score = 55.1 bits (131), Expect = 7e-05,   Method: Compositional matrix adjust.
 Identities = 27/72 (37%), Positives = 40/72 (55%), Gaps = 1/72 (1%)

Query: 178 GLYPVISIINHSCLPNAVLVF-EGRLAVVRAVQHVPKGAEVLISYIETAGSTMTRQKALK 236
            L+P+  ++NH C PNA   F +G   +V A + +  GAE+ +SY +   ST+ R+  L 
Sbjct: 237 ALFPLAGLLNHQCTPNAGHHFQDGETIIVCATERIACGAEITVSYAKMLWSTLARKIFLG 296

Query: 237 EQYLFTCTCPRC 248
               F C CPRC
Sbjct: 297 MTKGFMCQCPRC 308


>gi|440792261|gb|ELR13489.1| tetratricopeptide repeat domain containing protein [Acanthamoeba
           castellanii str. Neff]
          Length = 764

 Score = 79.0 bits (193), Expect = 5e-12,   Method: Compositional matrix adjust.
 Identities = 51/147 (34%), Positives = 79/147 (53%), Gaps = 11/147 (7%)

Query: 109 DNYSLVEALVAHMSDIDEKQLLLYAQIANLVNLILQWPEISINEIAENFSKL----ACNA 164
           D  SLVE      +++ E       Q  N+  L    P +++ +  E  ++L      NA
Sbjct: 291 DGRSLVEHHDRLAAEVREG---FRQQARNVRWLAAHGPGLALPDADETVARLFAVAQANA 347

Query: 165 HTICN-SELRPLGTGLYPVISIINHSCLPNA--VLVFEGRLAVVRAVQHVPKGAEVLISY 221
            T+ + S LRP+G GLY   +++NHSCLPNA   +  EGRL  VRAV+ +  G EV ++Y
Sbjct: 348 LTLTDPSSLRPIGQGLYASAALLNHSCLPNANWSVDGEGRL-CVRAVRPIEAGEEVTVAY 406

Query: 222 IETAGSTMTRQKALKEQYLFTCTCPRC 248
           ++       RQ+AL++ + F C C +C
Sbjct: 407 VDPTLPYHARQQALQDHFFFACRCLQC 433


>gi|154308769|ref|XP_001553720.1| hypothetical protein BC1G_07807 [Botryotinia fuckeliana B05.10]
          Length = 409

 Score = 79.0 bits (193), Expect = 5e-12,   Method: Compositional matrix adjust.
 Identities = 40/117 (34%), Positives = 66/117 (56%), Gaps = 2/117 (1%)

Query: 134 QIANLVNLILQWPEISINEIAENFSKLACNAHTI-CNSELRPLGTGLYPVISIINHSCLP 192
           Q   +VNL+ + PE  +    E+  +L+ N   +  N    P+G  L P+++  NHSC P
Sbjct: 58  QAHAVVNLLQRKPEF-VKYALESLCRLSTNGFRVESNVGNGPIGLCLDPLLARANHSCRP 116

Query: 193 NAVLVFEGRLAVVRAVQHVPKGAEVLISYIETAGSTMTRQKALKEQYLFTCTCPRCI 249
           NA + F+G+ A +RA+  + KG ++ ISYI+       R++AL + + F C C RC+
Sbjct: 117 NAAITFDGKRATLRALSPIAKGEQIFISYIDETQRQEVRREALAKTWFFQCLCSRCV 173


>gi|391340382|ref|XP_003744521.1| PREDICTED: son of sevenless homolog 2-like [Metaseiulus occidentalis]
          Length = 1624

 Score = 79.0 bits (193), Expect = 5e-12,   Method: Compositional matrix adjust.
 Identities = 95/402 (23%), Positives = 167/402 (41%), Gaps = 62/402 (15%)

Query: 59   DWKLHRLECQVLSRLDKEKRKSVTPSIRLMLKLYLRRKLQNDNVIPSTTTDNYSLVEALV 118
            DW  H+ EC  L R+    R+   P+ RLM ++ LR + + +N    T       ++  +
Sbjct: 1252 DWVDHKFECANLRRV--APRRPAEPA-RLMARIILRWRRELNNPSSETLLGTRRQLKDCM 1308

Query: 119  AHMSDIDEKQLL--LYAQIANLVNLIL---QWPEISINEIAENFSKLACNAHTICNSELR 173
            +H+ +I   ++    + ++ NL+   L   Q   +S  +I E F  +  N   I N +  
Sbjct: 1309 SHLQEIKRDRVRGGAFVEMLNLLKQFLTAAQLEGVSDLDILEIFGIMCVNTIHISNDD-D 1367

Query: 174  PLGTGLYPVISIINHSCLPNAVLVFEGRLAVVRAVQHV-PKG-AEVLISYIETAGSTMTR 231
              G  LY   S+I+HSC PN    F+G+  V++ ++   PK  A++ ++Y+    +   R
Sbjct: 1368 SFGCALYLAPSLIDHSCYPNLTATFKGQKIVLKVLRPCEPKTVADLSLAYMPVCTTKERR 1427

Query: 232  QKALKEQYLFTCTCPRCIKLGQFDDIQESAILEGYRCKDDGCSGFLLRDSDDKGFTCQQC 291
            +K L+E+Y FTC C  C                       G    +L ++D         
Sbjct: 1428 RKTLREEYYFTCECEMC----------------------SGKVPEVLSEAD--------- 1456

Query: 292  GLVRSKEEIKKIASEVNILSKKTLALTSCGNHQEVVSTYKMIEKLQKKLYHPFSVNLMQT 351
                      K+  EV  L K +L L+S  NH++ +   +  E L  KL      ++ + 
Sbjct: 1457 ---------PKLTDEVLELEKLSLDLSSPENHRKALKGVE--ELLSTKLKDLDDSDVAKF 1505

Query: 352  REKLIKILMEL--EDWKEALAYCQLTIPVYQRVYPQFHPLLGLQYYTCGKLEWFLGDTEN 409
            R  L+     +    +  A  Y   ++PV +RV+ +       +     +L   +  T  
Sbjct: 1506 RAILVAADASVCASSYDRAYNYYSRSLPVMKRVFTENKAEYAYKLVRLARLSTIVVKTSQ 1565

Query: 410  A-------IKSMTEAVEILRITHGTNSPFMKELILKLEEAQA 444
                    I  + EAV + RI  G +    K++ L  EE +A
Sbjct: 1566 DPEVFGPLIALLQEAVRVTRIALGEDHSDTKDVALLYEELRA 1607


>gi|350582164|ref|XP_003481215.1| PREDICTED: SET and MYND domain-containing protein 1-like [Sus
           scrofa]
          Length = 220

 Score = 79.0 bits (193), Expect = 5e-12,   Method: Compositional matrix adjust.
 Identities = 50/199 (25%), Positives = 92/199 (46%), Gaps = 18/199 (9%)

Query: 7   SGEVIISQEPYVCVPNNSSSISRCDGCFA-SSNLKKCSACQVVWYCGSNCQKLDWKLHRL 65
           + +VI ++  Y  V  +S     C  CF     L +C  C+   YC   CQK  W  H+ 
Sbjct: 29  AADVIFAERAYSAVVFDSLVNFVCHTCFKRQEKLHRCGQCKFAHYCDRTCQKDAWLNHKN 88

Query: 66  ECQVLSRLDKEKRKSVTPSIRLMLKLYLRRKLQNDNVIPSTTTDNYSLVEALVAHMSDID 125
           EC  + R  K   +++  + R+M ++        +  + S        ++  V H  + +
Sbjct: 89  ECSAIKRYGKVPNENIRLAARIMWRVEREGTGLTEGCLVSVDD-----LQNHVEHFGEEE 143

Query: 126 EKQLLLYAQIANLVNLILQ-WP----EISINEIAENFSKLACNAHTICNSE-LRPLGTGL 179
           +K+L +       V+  LQ WP    + S+  I+  F  + CN  T+ +   L+ +G G+
Sbjct: 144 QKELRMD------VDTFLQYWPPQSQQFSMQYISHIFGVINCNGFTLSDQRGLQAVGVGI 197

Query: 180 YPVISIINHSCLPNAVLVF 198
           +P + ++NH+C PN  ++F
Sbjct: 198 FPNLGLVNHNCWPNCTVIF 216


>gi|395331093|gb|EJF63475.1| SET domain-containing protein [Dichomitus squalens LYAD-421 SS1]
          Length = 445

 Score = 79.0 bits (193), Expect = 5e-12,   Method: Compositional matrix adjust.
 Identities = 72/238 (30%), Positives = 107/238 (44%), Gaps = 33/238 (13%)

Query: 29  RCDGCFA----SSNLKKCSACQVVWYCGSNCQKLDWKLH-RLECQVLSR-LDKEKRKSVT 82
           RCDGC+     +  L +CS C   WYCG+ CQ  +W+LH R  C+  +  +   + +++T
Sbjct: 47  RCDGCYMLKSEAVELMRCSGCASSWYCGTTCQNREWRLHHRKMCKHYNAFVASGQYQALT 106

Query: 83  PSIRLMLKLYLRRKLQNDNVIPST---TTDNYSLVEALVAHMSDIDEKQLLL--YAQIAN 137
           P  ++     L  +L  D     T   T    + +  L     D     L L   AQ A 
Sbjct: 107 PHDKV--DAILLSQLVADPEAWGTDRLTDAGRTFLNLLKVPRLDGFVPPLCLSRAAQTAG 164

Query: 138 LVNLILQWPEISINEIAENFSKLACNAHTICNSELRPLGTGLYPVIS-IINHSCLPNAVL 196
            V L            AE+      N + I +S L     G++P+ S ++NHSC+PNA  
Sbjct: 165 TVAL------------AEDLYSRFGNNNFILHSHLTSYAHGVFPLASRLLNHSCVPNAAC 212

Query: 197 VF------EGRLAVVRAVQHVPKGAEVLISYIETAGSTMTRQKALKEQYLFTCTCPRC 248
            +        R+ VV A++ + +G EV I Y++ A    TRQ AL   Y F C C  C
Sbjct: 213 KYMLAPSEPVRMEVV-ALRDINEGDEVTIPYLDPALPFQTRQDALHVNYGFECGCRLC 269


>gi|195352295|ref|XP_002042648.1| GM14901 [Drosophila sechellia]
 gi|194124532|gb|EDW46575.1| GM14901 [Drosophila sechellia]
          Length = 530

 Score = 79.0 bits (193), Expect = 5e-12,   Method: Compositional matrix adjust.
 Identities = 91/342 (26%), Positives = 149/342 (43%), Gaps = 61/342 (17%)

Query: 8   GEVIISQEPYVCVPNNSSSISRCDGCFAS--------SNLKKCSACQVVW-YCGSNCQKL 58
           GE I+ + P V  P  +S+   C  C  +         N  KCS+C   W  CG  C+  
Sbjct: 70  GEQILKEAPLVLGPKVASA-PLCLSCHRNLLAPEKPRGNYHKCSSCS--WPLCGKECE-- 124

Query: 59  DWKLHRLECQVLSRLD----------KEKRKSVTPSIRLMLKLYLRRKLQNDNVIPSTTT 108
           D   H+ ECQ++S  +          +E+RK     + ++L     R +Q   + P    
Sbjct: 125 DSVHHKAECQLMSGSNFQSKINYVPGEEERKESAYCVIMLL-----RCMQLKAIDPKAFL 179

Query: 109 DNYSLVEALVAHMSDIDEKQL--LLYAQIANLVNLIL---QWPEISINEIAENFSKLACN 163
             Y+L +    H+ +  E  L  +L A +   +N +L    WPE+ I  IA   + L  N
Sbjct: 180 KLYNLED----HLKERLETPLYQVLRANLITFINTVLGMKDWPEMDILRIA---AILDTN 232

Query: 164 AHTICNSELRPLGTGLYPVISIINHSCLPNAVLVFEGRLAVV-RAVQHVPKGAEVLISYI 222
              +     R     LYP  ++I+H C+PN    F+  + +V  A + + KG  + ISY 
Sbjct: 233 TFEVRQPRERRKIRALYPEAAMISHDCVPNMRHRFDDDMNIVFLAKRKIAKGEILSISYT 292

Query: 223 ETAGSTMTRQKALKEQYLFTCTCPRCI---KLGQFDDIQESAILEGYRCKDDGCSGFLLR 279
           +   ST+ R+  L++   F C+C RC    +LG F   Q        +CK    +G ++ 
Sbjct: 293 QPLRSTIQRRVHLRQAKCFDCSCARCQDPEELGSFAGAQTC-----LKCK----AGKIIS 343

Query: 280 DS---DDKGFTCQQCGLVRSKEEI----KKIASEVNILSKKT 314
            +   +   + CQ C   RS +E+     ++  E+  L K T
Sbjct: 344 QNPLLNSAPWKCQLCNFKRSAKEVVTSDAELQQELESLDKTT 385


>gi|340716457|ref|XP_003396714.1| PREDICTED: SET and MYND domain-containing protein 4-like [Bombus
           terrestris]
          Length = 635

 Score = 79.0 bits (193), Expect = 5e-12,   Method: Compositional matrix adjust.
 Identities = 73/294 (24%), Positives = 128/294 (43%), Gaps = 55/294 (18%)

Query: 8   GEVIISQEPYVCVPNNSSSISRCDGCFASS-NLKKCSACQVVWYCGSNCQKLDWKL-HRL 65
           G+V+  + PY  V       + C  C + S NL  CS C V  YC   C+KL W++ H++
Sbjct: 265 GDVVTIENPYAYVIYTQRYYTHCHHCLSRSYNLIPCSYCPVAQYCSEKCRKLAWEMAHQI 324

Query: 66  ECQVLS------RLDKEKRKSVTPSIRLMLKLYLRRK----LQNDNVIPSTTTDNYSL-- 113
           EC +++       +DK+K + +T  IR ++ +  + K    L+ D  +  +  DN S   
Sbjct: 325 ECPIMALVGNLLNVDKDKIRMLTKIIRFLIVVTSKGKKFDELRVDMELAESNPDNRSAGF 384

Query: 114 -------------VEALVAHMSDIDEKQLLLYAQIANLVNLIL---------QWPEISIN 151
                          +L  +M+      +  +A I+ L  ++L         ++    + 
Sbjct: 385 TDEGILDSTSARSALSLATNMTTRPLIGISAFACISALAAILLATQTNFFCNKYEVDQLK 444

Query: 152 EIAENFSKL---------AC-----NAHTICNSELRPLGTGLYPVISIINHSCLPNAVLV 197
           +I+ N+ K+         AC     N  ++        G+GLY   S+ NHSC PN    
Sbjct: 445 DIS-NYPKIIFSGSLMFRACVIMSSNCFSVQQEPGIKTGSGLYVTHSLYNHSCAPNTFRH 503

Query: 198 FEGRLAVVRAVQHVPKGAEVLISYIETAGSTMT---RQKALKEQYLFTCTCPRC 248
           FE    + RA++ +  G ++  +Y   A + MT   R++ + + Y F C C  C
Sbjct: 504 FEELTMITRALRPIYPGDQIFTNY-GAAYAYMTKSERREKIIQDYFFECDCIAC 556


>gi|357146941|ref|XP_003574165.1| PREDICTED: uncharacterized protein LOC100821025 [Brachypodium
           distachyon]
          Length = 791

 Score = 78.6 bits (192), Expect = 6e-12,   Method: Compositional matrix adjust.
 Identities = 72/326 (22%), Positives = 140/326 (42%), Gaps = 52/326 (15%)

Query: 152 EIAENFSKLACNAHTICNSELRPLGTGLYPVISIINHSCLPNAVLVFEGRLAVVRAVQHV 211
           E+ E+      +   +C+ E   +   +Y   S+ NHSC PN    F  R   +R+ +++
Sbjct: 452 ELTESKGYSTASDAVMCSVEQIRVAQAIYMSGSLFNHSCRPNVHTYFHSRTLFLRSTEYI 511

Query: 212 PKGAEVLISYIETAGST--MTRQKALKEQYLFTCTCPRCIKLGQFDDIQES--------- 260
             G+ + +SY   AG    + RQK+L+E Y F+C C  C +L   D + +S         
Sbjct: 512 ESGSPLELSYGPQAGEMDLLERQKSLQENYKFSCRCSSCSELNLSDLVIDSFCCPRSFCL 571

Query: 261 -AILEG--YRCKDD---------------------------GCSGFLLRDSD-----DKG 285
            A+ E   YR +++                                  R+ D     D G
Sbjct: 572 GAVSESTCYRSEENFVHVSLNGSAVCKLSLPDVSKVDEDMEKVGKLFFRNDDADLKIDPG 631

Query: 286 F--TCQ-QCGLVRSKEEIKKIASEVNILSKKTLALTSCGNHQEVVSTYKMIEKLQKKLYH 342
           +  +C+ Q  L      + + AS++N    K L +    +   + +  + +E++ KKL H
Sbjct: 632 YCMSCRSQLDLSSVVATLDRAASKINRF--KELIVIDDVSEVLITNALQSLEQI-KKLRH 688

Query: 343 PFSVNLMQTREKLIKILMELEDWKEALAYCQLTIPVYQRVYPQFHPLLGLQYYTCGKLEW 402
           P+S  L Q  +K+ +   ++ D ++A  +C+ +I + +++Y   H  +  +      ++ 
Sbjct: 689 PYSKALAQAEDKIAEAFAKVGDQEQARKHCEASIKILEKLYHPKHIAIAHELIKLISIKL 748

Query: 403 FLGDTENAIKSMTEAVEILRITHGTN 428
            +GD   A  ++T A  I  + +G +
Sbjct: 749 CMGDKAGAAATVTRAEAIFLLYYGPD 774


>gi|328777896|ref|XP_001120776.2| PREDICTED: SET and MYND domain-containing protein 4-like [Apis
           mellifera]
          Length = 633

 Score = 78.6 bits (192), Expect = 6e-12,   Method: Compositional matrix adjust.
 Identities = 73/300 (24%), Positives = 133/300 (44%), Gaps = 57/300 (19%)

Query: 8   GEVIISQEPYVCVPNNSSSISRCDGCFASS-NLKKCSACQVVWYCGSNCQKLDWKL-HRL 65
           G++I  ++PY  V       + C  C + S NL  C  C V  YC   C+ L W++ H +
Sbjct: 263 GDIITIEDPYAYVIYTQRYYTHCHHCLSRSYNLIPCLHCPVAQYCSEKCRILAWEMAHDI 322

Query: 66  ECQVLS------RLDKEKRKSVTPSIRLMLKLYLRRK----LQNDNVIPSTTTDNYSL-- 113
           EC +++       +DK+K + +T  IR ++    + K    L+ D  +  +  DN +   
Sbjct: 323 ECPIMALIGNLLHVDKDKIRMLTKIIRFLIIATAKGKNINELRADMKLAESNPDNRTAGF 382

Query: 114 ----------VEALVAHMSDIDEKQLL---LYAQIANLVNLILQWP------EISINEIA 154
                       + ++  +++  + L+    +A I+ L  ++L         +  +N++ 
Sbjct: 383 TDEDILDSTSARSALSLATNMTMRPLIGISAFACISALAAILLATQTNFFCNKYEVNQLK 442

Query: 155 E--NFSKL---------AC-----NAHTICNSELRPLGTGLYPVISIINHSCLPNAVLVF 198
           +  N+S +         AC     N  ++       +G+GLY   S+ NHSC PN    F
Sbjct: 443 DINNYSDIIFCSSIMFRACVIMSSNCFSVQQEPGIKIGSGLYVTNSLYNHSCAPNTFRHF 502

Query: 199 EGRLAVVRAVQHVPKGAEVLISYIETAGSTMT---RQKALKEQYLFTCTCPRCIKLGQFD 255
           EG   + RA++ +  G ++  SY   A + MT   R++ + + Y F C C  C    +FD
Sbjct: 503 EGLTMITRALKPLYPGDQIFTSY-GAAYAYMTRSERREKIMQDYFFECDCIAC----EFD 557


>gi|390363607|ref|XP_001199204.2| PREDICTED: N-lysine methyltransferase SMYD2-like
           [Strongylocentrotus purpuratus]
          Length = 490

 Score = 78.6 bits (192), Expect = 7e-12,   Method: Compositional matrix adjust.
 Identities = 86/398 (21%), Positives = 158/398 (39%), Gaps = 74/398 (18%)

Query: 59  DW-KLHRLECQVLSRLDKEKRKSVTPSIRLMLKLYLRRKLQNDNVIPSTTTDNY----SL 113
           DW + H+ +C+ L ++       V   +  +L   +R++ Q+    P  T D+     + 
Sbjct: 147 DWARCHKQDCKTLKKI----HPRVPSDLAQLLSQVIRKQRQS----PPCTQDDGDCFPTT 198

Query: 114 VEALVAHMSDIDEKQLLLYAQIANLVNLILQWP-EISINEIAENFSKLACNAHTICNSEL 172
           V+ L +H  ++  +  L  + +  L + I +         + + +    CN+  I +++L
Sbjct: 199 VDQLESHHENLSTESSL--SALHKLKHCIEEEDVPTDPRSLLKMYGATNCNSFGIFDNDL 256

Query: 173 RPLGTGLYPVISIINHSCLPNAVLVFEGRLAVVRAVQHVPKGAEVLISYIETAGSTMTRQ 232
             +   +Y   S+ NHSC  N ++VF+ R   +R V+ V +G E  I Y++       R+
Sbjct: 257 IVISDAIYLRASMANHSCDYNCIVVFDERKLQLRTVKDVQEGEECTIGYVDVIHPAKERR 316

Query: 233 KALKEQYLFTCTCPRCIK----LGQFDDIQESAILEGYRCKDDGCSGFLLRDSDDKGFTC 288
             L+E+Y FTC C +CI+    LG                 DDG                
Sbjct: 317 AELEEKYHFTCKCVKCIEEINALG----------------PDDG---------------- 344

Query: 289 QQCGLVRSKEEIKKIASEVNILSKKTLALTSCGNHQEVVSTYKMIEKLQKKLYHPFSVN- 347
                         +  E+  L K    +    N  +     ++ E   K +    S+  
Sbjct: 345 --------------LGEELRDLKKSLEQIVDAENSHDWAKVIQLCEPYLKPMDSSSSLPA 390

Query: 348 ----LMQTREKLIKILMELEDWKEALAYCQLTIPVYQRVYPQFHPLLGLQYYTCGKLEWF 403
               L+  R+      ++ + WK+A    QL I  Y   Y +++P +G+     G++   
Sbjct: 391 NHQLLVMLRDTAFFACIQSQSWKKAAEMGQLNIESYIYHYGRYNPNVGMYLLKIGEVLLN 450

Query: 404 LGDTENAIKSMTEAVEILRITHGTNSPFM---KELILK 438
           L     A K   EA  + ++THG     M   K+L+ K
Sbjct: 451 LDRLREARKCFKEAESVFKVTHGLQHSLMASVKKLLFK 488



 Score = 56.6 bits (135), Expect = 3e-05,   Method: Compositional matrix adjust.
 Identities = 25/66 (37%), Positives = 38/66 (57%)

Query: 185 IINHSCLPNAVLVFEGRLAVVRAVQHVPKGAEVLISYIETAGSTMTRQKALKEQYLFTCT 244
           + NHSC  N   VF+G    +R ++ V +G E  ISY++       RQ  L+E+Y FTC 
Sbjct: 1   MANHSCDYNCAGVFDGMKLQLRTIKDVKEGEECTISYVDVINPAKERQAKLEEEYHFTCK 60

Query: 245 CPRCIK 250
           C +C++
Sbjct: 61  CVKCVE 66


>gi|195582454|ref|XP_002081043.1| GD10796 [Drosophila simulans]
 gi|194193052|gb|EDX06628.1| GD10796 [Drosophila simulans]
          Length = 646

 Score = 78.6 bits (192), Expect = 7e-12,   Method: Compositional matrix adjust.
 Identities = 84/376 (22%), Positives = 153/376 (40%), Gaps = 54/376 (14%)

Query: 8   GEVIISQEPYVCVPNNSSSISRCDGCFASSNLK-KCSACQVVWYCGSNC-QKLDWKLHRL 65
           GE ++ + P+V V     + S C+ CF  + +   C  C  V YC   C ++   K H+ 
Sbjct: 262 GEELLVERPFVSVLLEKFAKSHCENCFMRTVVPVACPRCADVLYCSEQCREEASKKYHKY 321

Query: 66  ECQVLSRLDKEKRKSVTPSIRLM------------LKLYLRRKLQNDNVIPSTTTDNYSL 113
           EC ++  + +    S+   I L             LK  +  +L  + +I S   D++  
Sbjct: 322 ECGIVPIIWRSG-ASINNHIALRIIASKPLDYFLKLKPTIDEQLTPEQLI-SLPKDDFRR 379

Query: 114 VEALVAHMSDIDEKQLLLYAQIANLVNLILQW----------PEISI--NEIAENFSKLA 161
           V  L  H  +        +A +A  +   L+            E+SI  + +  +   + 
Sbjct: 380 VAQLERHQGERQPPNFFQHALMARFLTHCLRAGGYFGSEPKPDEVSIICSLVLRSLQFIQ 439

Query: 162 CNAHTIC-----NSELRP----LGTGLYPVISIINHSCLPNAVLVFEGRLAVVRAVQHVP 212
            N H +      +S  R     +G  +YP +++ NHSC P  V  F G    + +V+ + 
Sbjct: 440 FNTHEVAELHKFSSSGREKSIFIGGAIYPTLALFNHSCDPGVVRYFRGTTIHINSVRPIE 499

Query: 213 KGAEVLISY--IETAGSTMTRQKALKEQYLFTCTCPRCI-KLGQFDDIQESAILEGYRCK 269
            G  +  +Y  + T      RQ  LKE Y F C+C  CI    +FDD+    I   +RC 
Sbjct: 500 AGLPINENYGPMYTQDERSERQARLKELYWFECSCDACIDNWPKFDDLPRDVI--RFRC- 556

Query: 270 DDGCSGFLLRDSDDKGFTCQQCGLVRSKEEIKKIASEVNILSKKTLALTSCGNHQEVVST 329
                     D+ +    C  CG + +  +  K+  +  ++++    L   G + + ++ 
Sbjct: 557 ----------DAPNNCSACVTCGEITNILKGLKVMQDTEMMTRTAKRLYETGEYSKALAK 606

Query: 330 Y-KMIEKLQKKLYHPF 344
           +  +I  + + L  PF
Sbjct: 607 FIDLIRIMYEVLAPPF 622


>gi|440490836|gb|ELQ70343.1| hypothetical protein OOW_P131scaffold00045g35 [Magnaporthe oryzae
           P131]
          Length = 582

 Score = 78.2 bits (191), Expect = 7e-12,   Method: Compositional matrix adjust.
 Identities = 63/219 (28%), Positives = 100/219 (45%), Gaps = 33/219 (15%)

Query: 42  CSACQVVWYCGSNCQKLDWK-LHRLECQ-VLSRLDKEKRKSVTPSIRLMLKLYLRRKLQN 99
           CS C+V  YC   CQ+  WK  H+LEC+ VLS     +  ++ P+   M +L L   L+ 
Sbjct: 116 CSGCKVGRYCSQACQRNAWKTFHKLECKAVLS-----EAPAMGPTTLAMYRLLL---LKK 167

Query: 100 DNVIPSTTTDNYSLVEALVAHMSDIDEKQLLLYAQIANLVNLILQ----W-----PEISI 150
           D      T   +  +  L +H  + +E          +L N IL     W      ++  
Sbjct: 168 DGRF---TERQWKAIMQLESHFEEQNE----------DLRNDILAKVHGWGRRMGTDVDF 214

Query: 151 NEIAENFSKLACNAHTICNSELR-PLGTGLYPVISIINHSCLPNAVLVFEGRLAVVRAVQ 209
             + E    +  N+  +  +E     GT L  V+S+INH C PNA + FEG    VR+++
Sbjct: 215 MTVWELLCIIFTNSMNVRPAEADVSFGTSLDLVVSMINHDCSPNAHVFFEGSQVRVRSLK 274

Query: 210 HVPKGAEVLISYIETAGSTMTRQKALKEQYLFTCTCPRC 248
            +  G E+ +SY +     + RQ+ L++   F C C  C
Sbjct: 275 AIAAGGEITVSYCDPRLDVLLRQEILRQTQFFHCECTTC 313


>gi|348681060|gb|EGZ20876.1| hypothetical protein PHYSODRAFT_313346 [Phytophthora sojae]
          Length = 494

 Score = 78.2 bits (191), Expect = 7e-12,   Method: Compositional matrix adjust.
 Identities = 81/306 (26%), Positives = 133/306 (43%), Gaps = 28/306 (9%)

Query: 8   GEVIISQEPYVCVPNNSSSISRCDGCFASSNLKKC-SACQVVWYCGSNCQKLDWKLHRLE 66
           GE ++S+  Y  V  +    + C  C  +++   C   C  V++C   CQ+    +H  E
Sbjct: 29  GESVLSERAYGNVVLSQHRATLCAVCLCAADPDICCDDCSKVFFCSDACQEKLQDVHEKE 88

Query: 67  CQVLSRLDKEKRKSVTPSIRLMLKLYLRRKLQNDNVIPSTTTDNYSLVEALVAHMSD--- 123
           C  L   D   RK+ T    L L + +      D        D   +V A  A + D   
Sbjct: 89  CGALEEADLAARKTSTDVDLLRLLIRILAARSLDTADGKLRADEQGVVHASYAGVQDLVH 148

Query: 124 IDEKQLLLYAQ--IANLVNLILQWPE---ISINEIAENFSKLACNAHTICNSELRPL--G 176
           + +K+   +A    A    ++   P+   + + EI    +++  N++++   + + L   
Sbjct: 149 VLDKEGGAWADHVRAGAKKILEDLPDECHLPVEEILVIAAQINENSYSMDALDEKHLVAA 208

Query: 177 TGLYPVISIINHSCLPNAVLVFEGR-LAVVRAVQHVPKGAEVLISYIETAGSTMTRQKAL 235
            GL+P+  +INHSC PN      G  +A VRA++ + +G E+ +SYI+       R+K L
Sbjct: 209 VGLFPICGLINHSCQPNCTWSNAGDGIAEVRALRDIKEGDEITLSYIDIDKERAERRKEL 268

Query: 236 KEQYLFTCTCPRCIKLGQFDDIQESA--ILEGYRCKDDGCS------GFLLRDSDDKGFT 287
           +E   F C C RC        + ES    LEG+ C    CS       +LL   +DK   
Sbjct: 269 RETKHFDCQCGRCAA-----PLSESVDRYLEGFCCP--RCSVMASGKDYLLAQVEDK-LV 320

Query: 288 CQQCGL 293
           C  C L
Sbjct: 321 CSDCQL 326


>gi|440476470|gb|ELQ45066.1| hypothetical protein OOU_Y34scaffold00021g6 [Magnaporthe oryzae
           Y34]
          Length = 581

 Score = 78.2 bits (191), Expect = 7e-12,   Method: Compositional matrix adjust.
 Identities = 63/219 (28%), Positives = 100/219 (45%), Gaps = 33/219 (15%)

Query: 42  CSACQVVWYCGSNCQKLDWK-LHRLECQ-VLSRLDKEKRKSVTPSIRLMLKLYLRRKLQN 99
           CS C+V  YC   CQ+  WK  H+LEC+ VLS     +  ++ P+   M +L L   L+ 
Sbjct: 116 CSGCKVGRYCSQACQRNAWKTFHKLECKAVLS-----EAPAMGPTTLAMYRLLL---LKK 167

Query: 100 DNVIPSTTTDNYSLVEALVAHMSDIDEKQLLLYAQIANLVNLILQ----W-----PEISI 150
           D      T   +  +  L +H  + +E          +L N IL     W      ++  
Sbjct: 168 DGRF---TERQWKAIMQLESHFEEQNE----------DLRNDILAKVHGWGRRMGTDVDF 214

Query: 151 NEIAENFSKLACNAHTICNSELR-PLGTGLYPVISIINHSCLPNAVLVFEGRLAVVRAVQ 209
             + E    +  N+  +  +E     GT L  V+S+INH C PNA + FEG    VR+++
Sbjct: 215 MTVWELLCIIFTNSMNVRPAEADVSFGTSLDLVVSMINHDCSPNAHVFFEGSQVRVRSLK 274

Query: 210 HVPKGAEVLISYIETAGSTMTRQKALKEQYLFTCTCPRC 248
            +  G E+ +SY +     + RQ+ L++   F C C  C
Sbjct: 275 AIAAGGEITVSYCDPRLDVLLRQEILRQTQFFHCECTTC 313


>gi|301765320|ref|XP_002918077.1| PREDICTED: SET and MYND domain-containing protein 4-like
           [Ailuropoda melanoleuca]
 gi|281342357|gb|EFB17941.1| hypothetical protein PANDA_006471 [Ailuropoda melanoleuca]
          Length = 793

 Score = 78.2 bits (191), Expect = 8e-12,   Method: Compositional matrix adjust.
 Identities = 71/299 (23%), Positives = 130/299 (43%), Gaps = 15/299 (5%)

Query: 141 LILQWPEISINEIAENFSKLACNAHTICNSELRPLGTGLYPVISIINHSCLPNAVLVFEG 200
           L LQ    +I  I +  SK     + I +S L  L TG++PV+S++NHSC PN  + F  
Sbjct: 487 LQLQCNAQAITTIQQTGSK----ENNITDSRLVRLATGIFPVVSLLNHSCSPNTSMSFVS 542

Query: 201 RLAVVRAVQHVPKGAEVLISY--IETAGSTMTRQKALKEQYLFTCTCPRCIKLGQFDDIQ 258
            +A +RA + + KG E+L  Y    +      RQ+ L+ QY F C CP C K     +  
Sbjct: 543 TVATIRASEKIGKGQEILHCYGPHHSRMGVADRQQKLRSQYFFDCGCPACQK-----EKH 597

Query: 259 ESAILEGYRCKDDGCS--GFLLRDSDDKGFTCQQCGLVRSKEEIKKIASEVNILSKKTLA 316
            +A+  G R +   CS  G +L+  D        C    S+E +     ++         
Sbjct: 598 RAAV--GPRWEAFCCSRCGAVLQGGDVLSCGSTSCTESVSREHLVSRLQDLRQRVGSARN 655

Query: 317 LTSCGNHQEVVSTYKMIEKLQKKLYHPFSVNLMQTREKLIKILMELEDWKEALAYCQLTI 376
           L S G  ++ +    +     +         + +  + L +    L DW+++  + Q ++
Sbjct: 656 LLSNGQLEQAIQLLLVYRHDAENFLSAEHSMVGEMEDALAQAYAALGDWQKSATHLQKSL 715

Query: 377 PVYQRVYPQFHPLLGLQYYTCGKLEWFLGDTENAIKSMTEAVEILRITHGTNSPFMKEL 435
            V +  +      +G + +   +L +       A+ ++  A ++L + +G  S  ++EL
Sbjct: 716 RVVEVQHGPASVEMGHELFKLAQLFFNGFAMAEALHTVQRAEKVLMVHYGPCSDEIQEL 774


>gi|326931360|ref|XP_003211799.1| PREDICTED: SET and MYND domain-containing protein 4-like, partial
           [Meleagris gallopavo]
          Length = 706

 Score = 78.2 bits (191), Expect = 8e-12,   Method: Compositional matrix adjust.
 Identities = 62/264 (23%), Positives = 112/264 (42%), Gaps = 9/264 (3%)

Query: 175 LGTGLYPVISIINHSCLPNAVLVFEGRLAVVRAVQHVPKGAEVLISY-IETAGSTMTRQK 233
           L T  +PV+S++NHSC PN  + F G +A VRA Q +P G E+   Y        +  ++
Sbjct: 431 LATAFFPVLSLLNHSCSPNISVSFSGTVATVRASQPIPSGQEIFHCYGPHRCRMRVAERQ 490

Query: 234 ALKEQYLFTCTCPRCIKLGQFDDIQESAILEGYRCKDDGCSGFLLRDSDDKGFTC--QQC 291
            L  QY F C C  C  L + +   ES  +    C    C         ++   C  + C
Sbjct: 491 QLLSQYFFECRCQAC--LDELESDVESVSMRNSFC----CPSCQASMQGEEMLCCSNEAC 544

Query: 292 GLVRSKEEIKKIASEVNILSKKTLALTSCGNHQEVVSTYKMIEKLQKKLYHPFSVNLMQT 351
            L  S+E + +   ++    +K L L    N  E +      +   +    P  + + + 
Sbjct: 545 ALSVSRERLSRRLLDLQQQMEKALELLRDSNADEAIRMLLKCQMDARNFLSPEHLLMGEL 604

Query: 352 REKLIKILMELEDWKEALAYCQLTIPVYQRVYPQFHPLLGLQYYTCGKLEWFLGDTENAI 411
            + L +I   L  W+EA  + + +I + +  +      +G + +   ++ +       A+
Sbjct: 605 EDHLAQIYATLGKWQEAARHLERSIQLVEMHHGPSSVEMGHELFKLAQILFNGFAVSEAL 664

Query: 412 KSMTEAVEILRITHGTNSPFMKEL 435
            ++  A EIL +  G  S  ++EL
Sbjct: 665 STIQRAEEILSVHCGPQSTQIQEL 688


>gi|443689223|gb|ELT91670.1| hypothetical protein CAPTEDRAFT_90747 [Capitella teleta]
          Length = 673

 Score = 78.2 bits (191), Expect = 9e-12,   Method: Compositional matrix adjust.
 Identities = 70/285 (24%), Positives = 116/285 (40%), Gaps = 44/285 (15%)

Query: 8   GEVIISQEPYVCVPNNSSSISRCDGCFASSNLK-KCSACQVVWYCGSNCQKLDWK-LHRL 65
           G+ ++ ++PYV V + S    RC  CF        C+ C  V YC   C+   W+  H+ 
Sbjct: 260 GDTVVKEDPYVWVLDPSQYGYRCYHCFKVLTYPVGCTQCMRVRYCSETCRTSAWESYHKT 319

Query: 66  ECQVLSRLDKEKRKSVTPSI-RLMLKLYLRRKLQNDNVIPSTTTD-NYSLVE-------- 115
           EC  L  +   +  ++T  + R +L   + + ++  N   S   D  +SL          
Sbjct: 320 ECPYLPAMLMSRTGNLTSLVLRTILVTGMSKIIRYKNNPKSDQHDAQFSLFTDSKGVYMG 379

Query: 116 ------ALVAHMSDIDEKQLLLYAQIANLVNLILQ---WPEISINEIAENFSKL------ 160
                  L+ H       +LL Y  +   +  IL+   + E   +EI  + S +      
Sbjct: 380 GFVGLYGLLTHTEHRSPSELLQYCFLTLFILAILEKSGFIEKHSSEIKGDISVVLGGIIL 439

Query: 161 ------ACNA---------HTICNSELRPLGTGLYPVISIINHSCLPNAVLVFEGRLAVV 205
                 ACN            +  S    +G   +P + ++NHSC P   LVF     + 
Sbjct: 440 RFLQITACNGIEITEMSIGDDLTKSHPESIGLAFFPTVCLVNHSCDPVMELVFYENTCIA 499

Query: 206 RAVQHVPKGAEVLI--SYIETAGSTMTRQKALKEQYLFTCTCPRC 248
           RA++++ +G E+ I   Y+        RQ +LK QY F C+C  C
Sbjct: 500 RALRNIEEGQELTIDYGYLYYVSKKQPRQLSLKAQYFFDCSCNAC 544


>gi|402898218|ref|XP_003912123.1| PREDICTED: SET and MYND domain-containing protein 4 [Papio anubis]
          Length = 802

 Score = 77.8 bits (190), Expect = 9e-12,   Method: Compositional matrix adjust.
 Identities = 79/318 (24%), Positives = 132/318 (41%), Gaps = 73/318 (22%)

Query: 146 PEISINEIA--ENFSKLACNA-------HT------ICNSELRPLGTGLYPVISIINHSC 190
           P+++I  +A   +  +L CNA       HT      + +S    L TG++PVIS++NHSC
Sbjct: 481 PDVTIWGVAMLRHMLQLQCNAQAITTVQHTGSKGSIVTDSRQVRLATGIFPVISLLNHSC 540

Query: 191 LPNAVLVFEGRLAVVRAVQHVPKGAEVLISYI--ETAGSTMTRQKALKEQYLFTCTCPRC 248
            PN  + F   +A +RA Q + KG E+L  Y   ++      RQ+ L+ QY F CTCP C
Sbjct: 541 SPNTSVSFISTVATIRASQRIGKGQEILHCYGPHKSRMGVAERQQKLRSQYFFDCTCPAC 600

Query: 249 IKLGQFDDIQESAILEGYRCKDDGCSGFLLRDSDDKGFTCQQCGLVRSKEEIKKIASEVN 308
                    +   +  G R                K F C  C      + + +      
Sbjct: 601 -------QTEAHRMAAGPRW---------------KAFCCNSCRAPMQGDGVLR------ 632

Query: 309 ILSKKTLALTSCGNH---QEVVSTYKMIEKLQKKLYHPFSVNLMQTREKLIKILM--ELE 363
                      CGN    + VVS   ++ +LQ         +L Q      K+L   ELE
Sbjct: 633 -----------CGNRSCAESVVSRDHLVSRLQ---------DLQQQVGVAQKLLRDGELE 672

Query: 364 DWKEALAYCQLTIPVYQRVYPQFHPLLGLQYYTCGKLEWFLGDTENAIKSMTEAVEILRI 423
              + L+ CQ     +  ++ + H ++G       +    LGD + +   + +++ ++ +
Sbjct: 673 RAVQLLSRCQRDAESF--LWAE-HAMVGAIADGLARACAALGDWQKSATHLQKSLRVVEV 729

Query: 424 THGTNSPFMKELILKLEE 441
            HG +S  M   + KL +
Sbjct: 730 RHGPSSIEMGHELFKLAQ 747


>gi|431891028|gb|ELK01907.1| SET and MYND domain-containing protein 4 [Pteropus alecto]
          Length = 776

 Score = 77.8 bits (190), Expect = 1e-11,   Method: Compositional matrix adjust.
 Identities = 70/282 (24%), Positives = 118/282 (41%), Gaps = 59/282 (20%)

Query: 167 ICNSELRPLGTGLYPVISIINHSCLPNAVLVFEGRLAVVRAVQHVPKGAEVLISY--IET 224
           I NS+   L TG++PV+S++NHSC PN  + F   +A VRA   + KG E+L  Y   E+
Sbjct: 492 ITNSKQVRLATGIFPVVSLLNHSCSPNTSVSFISTVATVRASVQIGKGQEILHCYGPHES 551

Query: 225 AGSTMTRQKALKEQYLFTCTCPRCIKLGQFDDIQESAILEGYRCKDDGCSGFLLRDSDDK 284
                 RQ+ L+ QY F C CP C               E +R             S  +
Sbjct: 552 RMGAAERQQKLRSQYFFDCNCPACQN-------------EKHRTTT---------GSKWE 589

Query: 285 GFTCQQCGLVRSKEEIKKIASEVNILSKKTLALTSCGNHQ--EVVSTYKMIEKLQKKLYH 342
            F C +C  +   +++                  SCGN    E VS  +++ +LQ     
Sbjct: 590 AFCCNRCRTLMQGDDV-----------------LSCGNTSCTEAVSRDRLVSQLQD---- 628

Query: 343 PFSVNLMQTREKLIKILM---ELEDWKEALAYCQLTIPVYQRVYPQFHPLLGLQYYTCGK 399
                 +Q + K+ + L+   +LE   + L  C+     +    P  H ++G       +
Sbjct: 629 ------LQQQIKMAQKLLKNGKLEQAIQLLLGCRYDAESF---LPAEHSVVGEIEDDLAQ 679

Query: 400 LEWFLGDTENAIKSMTEAVEILRITHGTNSPFMKELILKLEE 441
               LGD + +   + +++ ++ + HG +S  M   + KL +
Sbjct: 680 AYAALGDWQKSAAHLQKSLHVVEVRHGPSSVEMGHELFKLAQ 721


>gi|119597550|gb|EAW77144.1| SET and MYND domain containing 3, isoform CRA_e [Homo sapiens]
          Length = 170

 Score = 77.8 bits (190), Expect = 1e-11,   Method: Compositional matrix adjust.
 Identities = 33/77 (42%), Positives = 51/77 (66%)

Query: 172 LRPLGTGLYPVISIINHSCLPNAVLVFEGRLAVVRAVQHVPKGAEVLISYIETAGSTMTR 231
           ++ +G GLYP IS++NHSC PN  +VF G   ++RAV+ +  G E+ I Y++   ++  R
Sbjct: 1   MQEVGVGLYPSISLLNHSCDPNCSIVFNGPHLLLRAVRDIEVGEELTICYLDMLMTSEER 60

Query: 232 QKALKEQYLFTCTCPRC 248
           +K L++QY F C C RC
Sbjct: 61  RKQLRDQYCFECDCFRC 77


>gi|170060287|ref|XP_001865735.1| conserved hypothetical protein [Culex quinquefasciatus]
 gi|167878799|gb|EDS42182.1| conserved hypothetical protein [Culex quinquefasciatus]
          Length = 650

 Score = 77.8 bits (190), Expect = 1e-11,   Method: Compositional matrix adjust.
 Identities = 89/367 (24%), Positives = 153/367 (41%), Gaps = 49/367 (13%)

Query: 10  VIISQEPYVCVPNNSSSISRCDGCFASSNLKKC-SACQVVWYCGSNCQK-LDWKLHRLEC 67
           +++ ++P+V V     S + C  CF   ++  C   C  V +C  +C+   +   H+ EC
Sbjct: 254 IVLVEKPHVSVLLEEYSKTHCSTCFKRVSVPVCCPKCSDVVFCSEDCESSANSGYHKYEC 313

Query: 68  QVLSRLDKEKRKSVTPSIRLML------KLYLRRKLQNDNVIPSTTT----DNYSLVEAL 117
             L    K    S+T  + L +      + +L+ + + D +    T     D+Y  +  L
Sbjct: 314 GFLPIFWKSG-ASITCHMALRIITQQSEEYFLQLRPELDGLTSEQTDKLKHDDYRKIYKL 372

Query: 118 VAHMSDIDEKQLLLYAQIANLVNLILQWPEISINEIAENF---------SKLACNAHTIC 168
           V H      +       +A L+N  L        + AE+F           L  NAH I 
Sbjct: 373 VTHEETRSAEDFFQRTLMATLLNACLTLGGFYKTKEAESFIGGLLLHNLQLLQFNAHEI- 431

Query: 169 NSELRP-----------LGTGLYPVISIINHSCLPNAVLVFEGRLAVVRAVQHVPKGAEV 217
            SEL+            +G GLYP +++ NHSC P     ++G    VR+V+ +  G+ +
Sbjct: 432 -SELQREDDRDVGKSVFIGGGLYPTLALFNHSCEPGVTRYYKGNSVCVRSVRSIAAGSMI 490

Query: 218 LISY--IETAGSTMTRQKALKEQYLFTCTCPRCIK-LGQFDDIQESAILEGYRCKDDG-- 272
             +Y  + T      R+  L  QY F+C C  C +    F ++ ++ +    R K DG  
Sbjct: 491 GENYGPLFTQTPRDERRATLLNQYRFSCNCRACSENWPLFSEMDDTVL----RFKCDGGK 546

Query: 273 -CSGFLLRDSDDKGF--TCQQCGLVRSKEEIKKIASEVNILSKKTLALTSCGNHQEVVST 329
            CS  LL  S+   F   C  CG   +  +  K+  +   + K    + S G  +  +  
Sbjct: 547 ICSNVLLIPSEINEFMIKCTDCGEHTNIMKGLKLVQDTETMFKLATKMHSAGEIEAAL-- 604

Query: 330 YKMIEKL 336
           YK +E++
Sbjct: 605 YKYVEEM 611


>gi|242007370|ref|XP_002424514.1| set and mynd domain containing protein, putative [Pediculus humanus
           corporis]
 gi|212507932|gb|EEB11776.1| set and mynd domain containing protein, putative [Pediculus humanus
           corporis]
          Length = 646

 Score = 77.4 bits (189), Expect = 1e-11,   Method: Compositional matrix adjust.
 Identities = 99/449 (22%), Positives = 175/449 (38%), Gaps = 82/449 (18%)

Query: 42  CSACQVVWYCGSNCQK-LDWKLHRLECQVLSRLDKEKRKSVTPSIRLMLKLYLR------ 94
           C  C  V YC   C+K  +  +H  EC       K K         L L+L L       
Sbjct: 213 CDKCLDVIYCSVECKKKFEKSVHSFEC------GKGKFFKSIGIAHLSLRLILAASSNYK 266

Query: 95  --RKLQN--------DNVIPSTTTDNYSLVEALVAHMSDIDEKQLLLYAQIANLVNLILQ 144
             +KL +        D ++P    ++Y  V  L+ ++   + K L  YA  A+L+ L L+
Sbjct: 267 ELKKLISSYTETKYFDGIVP-CENNSYHNVYGLITNLEKYEGKHLFQYALTASLLALYLK 325

Query: 145 WP----------------EISINEIAENFSKLACNAHTICNSEL-------------RPL 175
                             EI  + + ++  +L CN H I +                + +
Sbjct: 326 THTSYFSNESNDDDDNNLEIVASTLFKHILQLICNGHAITDVITDDSNDSNAVDLIEKRI 385

Query: 176 GTGLYPVISIINHSCLPNAVLVFEGRLAVVRAVQHVPKGAEVLISYIETAGSTMT--RQK 233
           GT +YP  S++NHSC P+ +  F     +VRA + +  G E+   Y   A       RQ+
Sbjct: 386 GTAIYPSASMMNHSCEPSIITSFSNNHLIVRASKPIKAGEEIFNCYGPHAKRMKKEERQE 445

Query: 234 ALKEQYLFTCTCPRCIKLGQFDDIQESAILEGYRCKDDGCSGFLLRDSDDKGFTCQQCGL 293
            L++QY F C C  C +  Q+D ++       Y C +  C G L+  +D     C  C  
Sbjct: 446 ILRKQYFFNCHCMACHQ--QYDIMER---FMAYMCFE--CKGPLIIINDSNNGYCLDC-- 496

Query: 294 VRSKEEIKKIASEVNILSKKTL--ALTSCGNHQEVVSTYKMIEKLQKKLYHPFSVNLMQT 351
                      ++ +I+S+  +  +    G   + +  +     + +K+ + F+ +L  +
Sbjct: 497 -----------NKNSIISQDVIRESFKEQGKILDALKKFLNFFIIGRKILYKFNKDLCYS 545

Query: 352 REKLIKILMELEDWKEALAYCQLTIPVYQRVYPQFHPLLGLQYYTCGKLE-----WFLGD 406
            ++      +LED+ + L   + TI   ++ +  F   LG      G             
Sbjct: 546 ADQAASCFSKLEDYDKCLVILKCTILSEEKRFGSFSIELGYALKKIGDASTNYCIQIDQA 605

Query: 407 TENAIKSMTEAVEILRITHGTNSPFMKEL 435
           T   I+    A  I++I  G ++   KE+
Sbjct: 606 TREGIEYYERAKGIIQILMGNDNNDFKEI 634


>gi|383858836|ref|XP_003704905.1| PREDICTED: SET and MYND domain-containing protein DDB_G0273589-like
           [Megachile rotundata]
          Length = 523

 Score = 77.4 bits (189), Expect = 1e-11,   Method: Compositional matrix adjust.
 Identities = 79/320 (24%), Positives = 135/320 (42%), Gaps = 39/320 (12%)

Query: 7   SGEVIISQE-PYVCVPNNSSSISRCDGCFA---SSNLKKCSACQVVWYCGSNCQKLDWKL 62
           +GE+++  + P V  P ++S +  C GC+     +    C+ C   W    NC++     
Sbjct: 66  AGEIVLKDDRPLVAGPMHNS-VPVCLGCYMVLHENTAVPCTKCG--WPLCQNCKE----- 117

Query: 63  HRLECQVLSRLDKEKRKSVTPSIRLMLKLYLRRKLQNDNVIPS-----TTTDNYSLVEAL 117
           H  EC   S      R+S   SI      Y     Q  NVI +        ++Y+ +  L
Sbjct: 118 HGTECDFTS-----SRRSDKVSITKFG--YPHPSYQCINVIRALSLKDNNIESYNKLLTL 170

Query: 118 VAHMSDIDEKQLLLYAQIANLVNLILQW---PEISINEIAENFSKLACNAHTICNSELRP 174
            +H   I  K+  ++ + +NLVN I ++    +I   +I +    L  N H +  ++  P
Sbjct: 171 ESHSDKIKNKETFIFEESSNLVNFIKRFFKSEDIPEEDIRKIIGVLQVNGHEVPLTD--P 228

Query: 175 LGTGLYPVISIINHSCLPNAVLVFEGRLA-VVRAVQHVPKGAEVLISYIETAGSTMTRQK 233
               +Y + S++ H+C  N    F      ++ A   + KG  + I Y +    TM R+ 
Sbjct: 229 PYVAVYELASLLEHNCKANCSKSFTDTGGLIIHAATPITKGDHISICYTDPLWGTMNRRH 288

Query: 234 ALKEQYLFTCTCPRCIKLGQFDDIQESAILEGYRCKDDGCSGFLLRDS----DDKGFTCQ 289
            L E   F CTC RC    +F      A+    RC +  CSG++L  +      + + C 
Sbjct: 289 HLLETKYFECTCDRCRDPTEF-----GAMFNAIRCSEIDCSGYVLPKTFLGDHREDYVCT 343

Query: 290 QCGLVRSKEEIKKIASEVNI 309
            C  V   E I++   ++ I
Sbjct: 344 NCTTVVPLEIIEETLEDIGI 363


>gi|328791769|ref|XP_001121272.2| PREDICTED: SET and MYND domain-containing protein 4-like [Apis
           mellifera]
          Length = 668

 Score = 77.4 bits (189), Expect = 1e-11,   Method: Compositional matrix adjust.
 Identities = 94/386 (24%), Positives = 163/386 (42%), Gaps = 82/386 (21%)

Query: 7   SGEVIISQEPYVCVPNNSSSISRCDGC---FASSNLKK--CSACQVVWYCGSNCQKLDWK 61
           +G V+I + P+    N  +    C  C     SSN  K  C  CQ V +C   C+   W+
Sbjct: 241 AGSVLIVETPFAFSTNKEALGRNCLHCHITLMSSNSVKIPCYYCQTVSFCSEKCRSKAWQ 300

Query: 62  L-HRLECQVLSRL---DKEKRKSVTPSIRLMLKLYL--------------RRKLQ--NDN 101
           + H+ EC +       D E+ +  T  + L  ++ +              ++K+   N+N
Sbjct: 301 IYHQYECFIFDVFFENDSEQIQRNTSYLLLAYRMIISGFLSSTEQIKNIEKKKISFLNNN 360

Query: 102 VIPSTTTD-----------------NYSLVEALVAHMSDIDEKQLLLYA----QIANLVN 140
            +    T+                 +Y  +  L  H + ++ K  L+ A     +A    
Sbjct: 361 FLQYYVTNINKERSNLGTNEIYSPYDYRTILNLETHCTKMEPKTNLIRAIEAIFLAKCFT 420

Query: 141 LILQWPEISINEIAENFSKLA-----------CNAHTICNS---------ELRPLGTGLY 180
            +L   ++ +  + E+F  LA           CNA+ I  +         E R +G  +Y
Sbjct: 421 FVLS--KMDVVYLKESFISLAVAILHHLQAINCNAYEIVENIYDKKTHIWEPRQIGGAIY 478

Query: 181 PVISIINHSCLPNAVL-VFEGRLAVVRAVQHVPKGAEVLISYIET--AGSTMTRQKALKE 237
           P +S+INHSC PN V   +   + VVR ++ V KG E+L  Y     + + ++R + L +
Sbjct: 479 PSVSLINHSCYPNVVRHTYPSGIVVVRTLRFVGKGTEILDCYGPHWFSENKLSRIEYLWK 538

Query: 238 QYLFTCTCPRCIKLGQFDDIQESAILEGYRCKDDGCSGFL----LRDSDDKGFTCQQCGL 293
           +Y F CTC  CI+  Q+   +    +  Y+CK   CS  +    L + D +    +QC  
Sbjct: 539 KYRFLCTCDACIQNWQYPLPE----IMKYKCK--TCSEIIGIIALNEKDMQNVLNKQCNC 592

Query: 294 VRSKEEIKKIASEVNILSKKTLALTS 319
            ++   +KKI +++    KK L   S
Sbjct: 593 NKTI-NLKKIKNQLQNSIKKRLNAIS 617


>gi|255082145|ref|XP_002508291.1| predicted protein [Micromonas sp. RCC299]
 gi|226523567|gb|ACO69549.1| predicted protein [Micromonas sp. RCC299]
          Length = 311

 Score = 77.4 bits (189), Expect = 1e-11,   Method: Compositional matrix adjust.
 Identities = 70/278 (25%), Positives = 111/278 (39%), Gaps = 42/278 (15%)

Query: 8   GEVIISQEPYVCVPNNSSSISRCDGCFASSNLKKCSACQVVWYCGSNCQKLDWKLHRLEC 67
           G  ++   P   VP  +     C GCF       C++C     C          LH LEC
Sbjct: 34  GARLLRVAPLFAVPYAAELTRLCGGCFQPRG-AVCASCGSARLCSRCGAGAAGTLHGLEC 92

Query: 68  QVLSRL-DKEKRKSVTPSIRLMLKLY-----LRRKL------------QNDNVIPSTTTD 109
             L+RL D E+  ++  S   +L        +RR +            ++ +VI     D
Sbjct: 93  HALARLRDGEEGLTLAHSDLRLLLRALAVRSMRRAVDAGGDPAAIAAAEDGDVI----VD 148

Query: 110 NYSLVEALVAHMSDIDEKQL-----LLYAQIANLVNLILQWP-EISINEIAENFSKLACN 163
            Y  +E L++ +   D+ +L        A++A     +L      S++E      +L  N
Sbjct: 149 GYDALEGLMSGVDGGDDGELPHDAVATIAEVAKQARFLLAASCRCSMDECVRTLGRLQLN 208

Query: 164 AHTICNSEL------------RPLGTGLYPVISIINHSCLPNAVLVFEGRLA-VVRAVQH 210
              +  SE             RP+G G++P  S  NHSC PN    F+G    VV   + 
Sbjct: 209 GFEMTASEPEEGADEAEGGGHRPVGVGVFPSASYTNHSCAPNCAQRFDGHGCIVVETARD 268

Query: 211 VPKGAEVLISYIETAGSTMTRQKALKEQYLFTCTCPRC 248
           V  G E+ I Y++       R++ L++ + F C C RC
Sbjct: 269 VRGGEELTIPYVDVRLGRRERRERLRKNFAFDCACERC 306


>gi|328873515|gb|EGG21882.1| SET domain-containing protein [Dictyostelium fasciculatum]
          Length = 438

 Score = 77.4 bits (189), Expect = 1e-11,   Method: Compositional matrix adjust.
 Identities = 72/242 (29%), Positives = 101/242 (41%), Gaps = 40/242 (16%)

Query: 42  CSACQVVWYCGSNCQKLDWKLHRLECQVLSRLDKE-----------------KRKSVTPS 84
           C  C  V YC   C + D   H+ EC V   LD+                   RK +  S
Sbjct: 169 CEDCSFVGYCCKECMEKDSHQHKFECLVFQNLDQSEYSSSLLSEIKLLIRTLSRKWLEQS 228

Query: 85  IRLMLKLYLRRKLQNDNVIPSTTTDNYSLVEALVAHMSDI-----DEKQLLLYAQIANLV 139
           ++     Y + K   D+ +P      YS  + LV+++        D     +   +  L 
Sbjct: 229 LQEEGTQYRKYKQYEDSNVPQENGLRYSDYDQLVSNIEAFSSTLKDSLSYWICDPVIRLG 288

Query: 140 NLILQWPEISINEIAENFSKLACNAHTI-------CNSELRPLGTGLYPVISIINHSCLP 192
           +  +Q  E  + ++     +  CNA  I        N E R  G G+Y   S  NHSC P
Sbjct: 289 SKYVQKKEDGV-DLLNVLLRNRCNAFYIQGRPKNGGNGESR--GCGVYVRNSFFNHSCNP 345

Query: 193 NA-VLVFEGRLAV----VRAVQHVPKGAEVLISYIETAGSTMTRQKALKEQYLFTCTCPR 247
           N    V E  L V    +RAV+   KG E+ ISYI+TA S   R++ L E YLF C C +
Sbjct: 346 NVNYWVVENTLEVECSLMRAVR---KGEELCISYIDTAASLRDRREKLSEGYLFHCRCEK 402

Query: 248 CI 249
           CI
Sbjct: 403 CI 404


>gi|254569888|ref|XP_002492054.1| hypothetical protein [Komagataella pastoris GS115]
 gi|238031851|emb|CAY69774.1| Hypothetical protein PAS_chr2-2_0152 [Komagataella pastoris GS115]
 gi|328351455|emb|CCA37854.1| SET and MYND domain-containing protein 2 [Komagataella pastoris CBS
           7435]
          Length = 736

 Score = 77.4 bits (189), Expect = 1e-11,   Method: Compositional matrix adjust.
 Identities = 59/220 (26%), Positives = 100/220 (45%), Gaps = 16/220 (7%)

Query: 39  LKKCSACQVVWYCGSNCQKLDW-KLHRLECQVLSRLDKEKRKSVTPSI------RLMLKL 91
           L +CS C+VV YC  +CQ  DW + H  EC  L   D  K+ S+ PS       R++LK 
Sbjct: 139 LLRCSGCRVVNYCSLSCQLQDWNEFHSKECTYLK--DYLKKYSMVPSTTEVLVNRILLKY 196

Query: 92  YLRR--KLQNDNVIPSTTTDNYSLVEALVAHMSDIDEKQLLLYAQIANLVNLILQWPEIS 149
                 KLQ D +          +    V H+  +++ Q +L        N+  ++ E  
Sbjct: 197 KSNSLFKLQIDMLTSHLEVLEDEVYRGHVEHLQVLEQGQHIL----ERFSNVSSKYAE-D 251

Query: 150 INEIAENFSKLACNAHTICNSELRPLGTGLYPVISIINHSCLPNAVLVFEGRLAVVRAVQ 209
           +  +      +  N+  + N  L P G       ++INHSC PN +++F+     ++++ 
Sbjct: 252 LTLLKRLVLTVFVNSTVMYNEFLEPNGLMFDSFFALINHSCEPNILVIFQNGKLSLKSIL 311

Query: 210 HVPKGAEVLISYIETAGSTMTRQKALKEQYLFTCTCPRCI 249
            +  G E+  +Y  T   T  R+  L+ ++ F C CP C+
Sbjct: 312 DIKPGTELTTNYEFTNLPTALRKLNLRSRFFFDCQCPLCL 351


>gi|355753603|gb|EHH57568.1| hypothetical protein EGM_07234 [Macaca fascicularis]
          Length = 806

 Score = 77.4 bits (189), Expect = 2e-11,   Method: Compositional matrix adjust.
 Identities = 79/318 (24%), Positives = 132/318 (41%), Gaps = 73/318 (22%)

Query: 146 PEISINEIA--ENFSKLACNA-------HT------ICNSELRPLGTGLYPVISIINHSC 190
           P+++I  +A   +  +L CNA       HT      + +S    L TG++PVIS++NHSC
Sbjct: 485 PDVTIWGVAMLRHMLQLQCNAQAITTVQHTGSKGSIVTDSRQVRLATGIFPVISLLNHSC 544

Query: 191 LPNAVLVFEGRLAVVRAVQHVPKGAEVLISYI--ETAGSTMTRQKALKEQYLFTCTCPRC 248
            PN  + F   +A +RA Q + KG E+L  Y   ++      RQ+ L+ QY F CTCP C
Sbjct: 545 SPNTSVSFISTVATIRASQRIGKGQEILHCYGPHKSRMGVAERQQKLRSQYFFDCTCPAC 604

Query: 249 IKLGQFDDIQESAILEGYRCKDDGCSGFLLRDSDDKGFTCQQCGLVRSKEEIKKIASEVN 308
                    +   +  G R                K F C  C      + + +      
Sbjct: 605 -------QTEAHRMAAGPRW---------------KAFCCNSCRAPMQGDGVLR------ 636

Query: 309 ILSKKTLALTSCGNH---QEVVSTYKMIEKLQKKLYHPFSVNLMQTREKLIKILM--ELE 363
                      CGN    + VVS   ++ +LQ         +L Q      K+L   ELE
Sbjct: 637 -----------CGNRSCAESVVSRDHLVSRLQ---------DLQQQVGVAQKLLRDGELE 676

Query: 364 DWKEALAYCQLTIPVYQRVYPQFHPLLGLQYYTCGKLEWFLGDTENAIKSMTEAVEILRI 423
              + L+ CQ     +  ++ + H ++G       +    LGD + +   + +++ ++ +
Sbjct: 677 RAVQLLSGCQRDAESF--LWAE-HAMVGEIADGLARACAALGDWQKSATHLQKSLRVVEV 733

Query: 424 THGTNSPFMKELILKLEE 441
            HG +S  M   + KL +
Sbjct: 734 RHGPSSVEMGHELFKLAQ 751


>gi|355568065|gb|EHH24346.1| hypothetical protein EGK_07991 [Macaca mulatta]
          Length = 806

 Score = 77.4 bits (189), Expect = 2e-11,   Method: Compositional matrix adjust.
 Identities = 79/318 (24%), Positives = 132/318 (41%), Gaps = 73/318 (22%)

Query: 146 PEISINEIA--ENFSKLACNA-------HT------ICNSELRPLGTGLYPVISIINHSC 190
           P+++I  +A   +  +L CNA       HT      + +S    L TG++PVIS++NHSC
Sbjct: 485 PDVTIWGVAMLRHMLQLQCNAQAITTVQHTGSKGSIVTDSRQVRLATGIFPVISLLNHSC 544

Query: 191 LPNAVLVFEGRLAVVRAVQHVPKGAEVLISYI--ETAGSTMTRQKALKEQYLFTCTCPRC 248
            PN  + F   +A +RA Q + KG E+L  Y   ++      RQ+ L+ QY F CTCP C
Sbjct: 545 SPNTSVSFISTVATIRASQRIGKGQEILHCYGPHKSRMGVAERQQKLRSQYFFDCTCPAC 604

Query: 249 IKLGQFDDIQESAILEGYRCKDDGCSGFLLRDSDDKGFTCQQCGLVRSKEEIKKIASEVN 308
                    +   +  G R                K F C  C      + + +      
Sbjct: 605 -------QTEAHRMAAGPRW---------------KAFCCNSCRAPMQGDGVLR------ 636

Query: 309 ILSKKTLALTSCGNH---QEVVSTYKMIEKLQKKLYHPFSVNLMQTREKLIKILM--ELE 363
                      CGN    + VVS   ++ +LQ         +L Q      K+L   ELE
Sbjct: 637 -----------CGNRSCAESVVSRDHLVSRLQ---------DLQQQVGVAQKLLRDGELE 676

Query: 364 DWKEALAYCQLTIPVYQRVYPQFHPLLGLQYYTCGKLEWFLGDTENAIKSMTEAVEILRI 423
              + L+ CQ     +  ++ + H ++G       +    LGD + +   + +++ ++ +
Sbjct: 677 RAVQLLSGCQRDAESF--LWAE-HAMVGEIADGLARACAALGDWQKSATHLQKSLRVVEV 733

Query: 424 THGTNSPFMKELILKLEE 441
            HG +S  M   + KL +
Sbjct: 734 RHGPSSVEMGHELFKLAQ 751


>gi|195485733|ref|XP_002091210.1| GE12361 [Drosophila yakuba]
 gi|194177311|gb|EDW90922.1| GE12361 [Drosophila yakuba]
          Length = 660

 Score = 77.0 bits (188), Expect = 2e-11,   Method: Compositional matrix adjust.
 Identities = 86/386 (22%), Positives = 154/386 (39%), Gaps = 60/386 (15%)

Query: 8   GEVIISQEPYVCVPNNSSSISRCDGCFASSNLK-KCSACQVVWYCGSNC-QKLDWKLHRL 65
           GE ++ + P+V V     + + C+ CF  + +   C  C  V YC   C ++   K H+ 
Sbjct: 262 GEELLVERPFVSVLLEKFAKTHCENCFMRTVVPVACPKCADVLYCSEQCREEASKKYHKY 321

Query: 66  ECQVLSRLDKEKRKSVTPSIRLM------------LKLYLRRKLQNDNVIPSTTTDNYSL 113
           EC ++  + +    S+   I L             LK  +  +L  + +I S   D+Y  
Sbjct: 322 ECGIVPIIWRSG-ASINNHIALRIIASKPLDYFLKLKPTIDEELTPEKLI-SLPKDDYRR 379

Query: 114 VEALVAHMSDIDEKQLLLYAQIANL-------------------VNLI-------LQWPE 147
           V  L  H  +        +  +A                     V++I       LQ+ +
Sbjct: 380 VAQLERHQGERQPSNFFQHVLMARFLTHCLRAGGYFGSEPKPEAVSIICSLVLRSLQFIQ 439

Query: 148 ISINEIAE--NFSKLACNAHTICNSELRPLGTGLYPVISIINHSCLPNAVLVFEGRLAVV 205
            + +E+AE   FS                +G  +YP +++ NHSC P  V  F G    V
Sbjct: 440 FNTHEVAELHKFSSSGREKSIF-------IGGAIYPTLALFNHSCDPGVVRYFRGTTIHV 492

Query: 206 RAVQHVPKGAEVLISY--IETAGSTMTRQKALKEQYLFTCTCPRCI-KLGQFDDIQESAI 262
            +V+ +  G  +  +Y  + T      RQ  LKE Y F C+C  CI    +FDD+    I
Sbjct: 493 NSVRPIEAGLPINENYGPMYTQDGRPERQARLKELYWFECSCDACIDNWPKFDDLPRDVI 552

Query: 263 LEGYRCK-DDGCSGFL--LRDSDDKGFTCQQCGLVRSKEEIKKIASEVNILSKKTLALTS 319
              +RC   + C+  +      +D    C  CG + +  +  K+  +  ++++    L  
Sbjct: 553 --RFRCDAPNNCAAVIEVPPSCNDFMVKCVTCGEITNILKGLKVMQDTEMMTRTAKRLYE 610

Query: 320 CGNHQEVVSTY-KMIEKLQKKLYHPF 344
            G + + ++ +  +I  + + L  PF
Sbjct: 611 TGEYSKALAKFIDLIRIMYEVLAPPF 636


>gi|109112713|ref|XP_001117365.1| PREDICTED: SET and MYND domain-containing protein 4-like [Macaca
           mulatta]
          Length = 802

 Score = 77.0 bits (188), Expect = 2e-11,   Method: Compositional matrix adjust.
 Identities = 79/318 (24%), Positives = 132/318 (41%), Gaps = 73/318 (22%)

Query: 146 PEISINEIA--ENFSKLACNA-------HT------ICNSELRPLGTGLYPVISIINHSC 190
           P+++I  +A   +  +L CNA       HT      + +S    L TG++PVIS++NHSC
Sbjct: 481 PDVTIWGVAMLRHMLQLQCNAQAITTVQHTGSKGSIVTDSRQVRLATGIFPVISLLNHSC 540

Query: 191 LPNAVLVFEGRLAVVRAVQHVPKGAEVLISYI--ETAGSTMTRQKALKEQYLFTCTCPRC 248
            PN  + F   +A +RA Q + KG E+L  Y   ++      RQ+ L+ QY F CTCP C
Sbjct: 541 SPNTSVSFISTVATIRASQRIGKGQEILHCYGPHKSRMGVAERQQKLRSQYFFDCTCPAC 600

Query: 249 IKLGQFDDIQESAILEGYRCKDDGCSGFLLRDSDDKGFTCQQCGLVRSKEEIKKIASEVN 308
                    +   +  G R                K F C  C      + + +      
Sbjct: 601 -------QTEAHRMAAGPRW---------------KAFCCNSCRAPMQGDGVLR------ 632

Query: 309 ILSKKTLALTSCGNH---QEVVSTYKMIEKLQKKLYHPFSVNLMQTREKLIKILM--ELE 363
                      CGN    + VVS   ++ +LQ         +L Q      K+L   ELE
Sbjct: 633 -----------CGNRSCAESVVSRDHLVSRLQ---------DLQQQVGVAQKLLRDGELE 672

Query: 364 DWKEALAYCQLTIPVYQRVYPQFHPLLGLQYYTCGKLEWFLGDTENAIKSMTEAVEILRI 423
              + L+ CQ     +  ++ + H ++G       +    LGD + +   + +++ ++ +
Sbjct: 673 RAVQLLSGCQRDAESF--LWAE-HAMVGEIADGLARACAALGDWQKSATHLQKSLRVVEV 729

Query: 424 THGTNSPFMKELILKLEE 441
            HG +S  M   + KL +
Sbjct: 730 RHGPSSVEMGHELFKLAQ 747


>gi|395536340|ref|XP_003770178.1| PREDICTED: SET and MYND domain-containing protein 4 [Sarcophilus
           harrisii]
          Length = 802

 Score = 77.0 bits (188), Expect = 2e-11,   Method: Compositional matrix adjust.
 Identities = 64/276 (23%), Positives = 117/276 (42%), Gaps = 32/276 (11%)

Query: 175 LGTGLYPVISIINHSCLPNAVLVFEGRLAVVRAVQHVPKGAEVLISYIETAG--STMTRQ 232
           L TGL+P IS++NHSC PN  L F G +  V A Q + +G E+L  Y    G     TRQ
Sbjct: 525 LATGLFPAISLMNHSCRPNTSLSFRGSVGSVHASQLIARGQEILHCYGPHEGRMDAATRQ 584

Query: 233 KALKEQYLFTCTCPRCIKLGQFDDIQESAILE---GYRCK----------DDGCSGFLLR 279
           + L+ QY F C C  C    Q +++  + +     G+RC             GCS    R
Sbjct: 585 QKLRSQYFFDCRCQAC----QEEEVHGAGVTPKQGGFRCPTCRAALQGLDPLGCSSGSCR 640

Query: 280 DSDDKGFTCQQCGLVRSKEEIKKIASEVNILSKKTLALTSCGNHQEVVSTYKMIEKLQKK 339
               +     Q G ++ + E+       +  ++    L +C    E   T +        
Sbjct: 641 AQVSRAHLLGQLGDLQQRVEMAGELLRGDRTARAVQLLLACRREAEDFLTSE-------- 692

Query: 340 LYHPFSVNLMQTREKLIKILMELEDWKEALAYCQLTIPVYQRVYPQFHPLLGLQYYTCGK 399
                 V + +  ++L +    L DW ++ A+ Q ++ V +  +      +G + +   +
Sbjct: 693 -----HVLMGEIEDRLAQAYASLGDWPKSAAHLQNSLQVVKAQHGPASVEMGHELFKLAQ 747

Query: 400 LEWFLGDTENAIKSMTEAVEILRITHGTNSPFMKEL 435
           + +       A+  + +A ++L + +G  +  ++EL
Sbjct: 748 VLFNGFAVTEAVSVIEKAEKVLSLHYGPENEQVQEL 783


>gi|195591423|ref|XP_002085440.1| GD12310 [Drosophila simulans]
 gi|194197449|gb|EDX11025.1| GD12310 [Drosophila simulans]
          Length = 530

 Score = 77.0 bits (188), Expect = 2e-11,   Method: Compositional matrix adjust.
 Identities = 91/342 (26%), Positives = 149/342 (43%), Gaps = 61/342 (17%)

Query: 8   GEVIISQEPYVCVPNNSSSISRCDGCFAS--------SNLKKCSACQVVW-YCGSNCQKL 58
           GE I+ + P V  P  +S+   C GC  +         N  KCS+C   W  CG  C+  
Sbjct: 70  GEQILKEAPLVLGPKVASA-PLCLGCHRNLLAPEKPRGNYHKCSSCS--WPLCGKECE-- 124

Query: 59  DWKLHRLECQVLSRLD----------KEKRKSVTPSIRLMLKLYLRRKLQNDNVIPSTTT 108
           D   H+ ECQ++S  +          +E+RK     + ++L     R +Q   + P    
Sbjct: 125 DSVHHKAECQLMSGSNFQSKINYVAGEEERKESAYCVIMLL-----RCMQLKAMDPDAFL 179

Query: 109 DNYSLVEALVAHMSDIDEKQL--LLYAQIANLVNLIL---QWPEISINEIAENFSKLACN 163
             Y+L +    H+ +  E  L  +L A +   +  +L    WPE+ I  IA   + L  N
Sbjct: 180 KLYNLED----HLKERLETPLYQVLRANLITFIKTVLGMKDWPEMDILRIA---AILDTN 232

Query: 164 AHTICNSELRPLGTGLYPVISIINHSCLPNAVLVFEGRLAVV-RAVQHVPKGAEVLISYI 222
              +     R     LYP  ++I+H C+PN    F+  + +V  A + + KG  + ISY 
Sbjct: 233 TFEVRQPRERRKIRALYPGAAMISHDCVPNMRHRFDDDMNIVFLAKRKIAKGEILSISYT 292

Query: 223 ETAGSTMTRQKALKEQYLFTCTCPRCI---KLGQFDDIQESAILEGYRCKDDGCSGFLLR 279
           +   ST+ R+  L++   F C+C RC    +LG F   Q        +CK    +G ++ 
Sbjct: 293 QPLRSTIQRRVHLRQAKCFDCSCARCQDPEELGSFAGAQTC-----LKCK----AGKIIS 343

Query: 280 DS---DDKGFTCQQCGLVRSKEEI----KKIASEVNILSKKT 314
            +   +   + CQ C   RS +E+     ++  E+  L K T
Sbjct: 344 LNPLLNSAPWKCQLCNFKRSAKEVVTSDAELQQELESLDKTT 385


>gi|453088056|gb|EMF16097.1| SET domain-containing protein, partial [Mycosphaerella populorum
           SO2202]
          Length = 558

 Score = 77.0 bits (188), Expect = 2e-11,   Method: Compositional matrix adjust.
 Identities = 58/227 (25%), Positives = 98/227 (43%), Gaps = 16/227 (7%)

Query: 35  ASSNLKKCSACQVVWYCGSNCQKLDWKL-HRLECQVLSRLDKEKRKSVTPSIRLMLKLYL 93
           A    K C+ C+ V YC   CQ   WK  H+ EC+VL+     +R  +   +R ++KL  
Sbjct: 75  AGIETKACTGCKRVRYCSKTCQTRAWKREHKYECKVLA---PAERPDLPHGVRAVVKLLG 131

Query: 94  RRK--------LQNDNVIPSTTTDNYSLVEALVAHMSDIDEKQLLLYAQIANLVNLILQW 145
           R K        L  D V     +D  +L +         ++  +L Y         +L  
Sbjct: 132 RLKADPEGKDQLLLDIVQFRPASDPKALEDIQRQDAQRFEDFSMLAYGAWKYAGEPMLG- 190

Query: 146 PEISINEIAENFS-KLACNAHTICNS-ELRPLGTGLYPVISIINHSCLPNAVLVFEGRLA 203
            ++  N I++ F   +  N   + ++ +   LG    P++   NHSC PN  ++F     
Sbjct: 191 -DMDSNAISKAFFFNIMSNTLQLSSALDDTKLGIAFDPILCSANHSCEPNTAVIFNQPQL 249

Query: 204 VVRAVQHVPKGAEVLISYIETAGSTMTRQKALKEQYLFTCTCPRCIK 250
           ++RA   + K  E+ + Y++       RQ  L++ Y F C C +C K
Sbjct: 250 ILRAQTKIKKDEEIFMKYVDITNPFSVRQAELRQSYFFGCRCTKCRK 296


>gi|194768765|ref|XP_001966482.1| GF21975 [Drosophila ananassae]
 gi|190617246|gb|EDV32770.1| GF21975 [Drosophila ananassae]
          Length = 445

 Score = 77.0 bits (188), Expect = 2e-11,   Method: Compositional matrix adjust.
 Identities = 38/102 (37%), Positives = 55/102 (53%), Gaps = 6/102 (5%)

Query: 178 GLYPVISIINHSCLPNAVLVFE-GRLAVVRAVQHVPKGAEVLISYIETAGSTMTRQKALK 236
           GL+P+ +I+NH C PNA   F+ G LAVVRA + +PKG E+  +Y +     +TR   LK
Sbjct: 225 GLFPLTAIMNHECTPNASHYFDNGTLAVVRAARDIPKGGEITTTYTKILWGNLTRGIFLK 284

Query: 237 EQYLFTCTCPRCIKLGQFDDIQESAILEGYRCKDDGCSGFLL 278
               F C C RC      D+ +    L    C++ GC G ++
Sbjct: 285 MTKNFNCDCVRC-----NDNTENGTYLSALFCREQGCRGLVI 321


>gi|299115343|emb|CBN74163.1| conserved unknown protein [Ectocarpus siliculosus]
          Length = 579

 Score = 77.0 bits (188), Expect = 2e-11,   Method: Compositional matrix adjust.
 Identities = 45/143 (31%), Positives = 71/143 (49%), Gaps = 7/143 (4%)

Query: 137 NLVNLILQWPEISINEIAENFSKLACNAHTICNSELRPL--GTGLYPVISIINHSCLPNA 194
           +L+ L+ +W    + E+ +   ++  NAH + +     L  G G++P+ ++INH+C PN 
Sbjct: 252 DLLQLLPEWVRFDVEEVVQLACRVNVNAHGLRDDSGANLVIGVGMFPLTAMINHACRPNC 311

Query: 195 VLVFEGRLAVVRAVQHVPKGAEVLISYIETAGSTMTRQKALKEQYLFTCTCPRCIKLGQF 254
             V+ G    VR ++ V  GAE+ + YI+   ST  R++ L     F C C RC      
Sbjct: 312 TFVYFGGNLEVRTLEPVSAGAELSVYYIDLLQSTAARRQELLTSKHFLCKCSRCENPSSM 371

Query: 255 DDIQESAILEGYRCKDDGCSGFL 277
           DD      L+G  C D G  G L
Sbjct: 372 DD-----YLDGVCCTDCGERGCL 389


>gi|157786648|ref|NP_001099280.1| SET and MYND domain-containing protein 4 [Rattus norvegicus]
 gi|149053389|gb|EDM05206.1| SET and MYND domain containing 4 (predicted) [Rattus norvegicus]
          Length = 801

 Score = 77.0 bits (188), Expect = 2e-11,   Method: Compositional matrix adjust.
 Identities = 36/84 (42%), Positives = 49/84 (58%), Gaps = 2/84 (2%)

Query: 167 ICNSELRPLGTGLYPVISIINHSCLPNAVLVFEGRLAVVRAVQHVPKGAEVLISY--IET 224
           I NS    L TG++PV+S++NHSC PN  + F   +A +RA Q + KG E+L  Y   E+
Sbjct: 517 ITNSRQVRLATGVFPVVSLLNHSCSPNTSVSFTSTVATIRAAQQIAKGQEILHCYGPHES 576

Query: 225 AGSTMTRQKALKEQYLFTCTCPRC 248
                 RQ+ L  QY F C+CP C
Sbjct: 577 RMGVAERQQRLSSQYFFDCSCPAC 600


>gi|195027211|ref|XP_001986477.1| GH21386 [Drosophila grimshawi]
 gi|193902477|gb|EDW01344.1| GH21386 [Drosophila grimshawi]
          Length = 664

 Score = 77.0 bits (188), Expect = 2e-11,   Method: Compositional matrix adjust.
 Identities = 85/377 (22%), Positives = 151/377 (40%), Gaps = 42/377 (11%)

Query: 8   GEVIISQEPYVCVPNNSSSISRCDGCFASSNLK-KCSACQVVWYCGSNCQKLDW-KLHRL 65
           G+ ++ + PYV V     + + C+ CF  + +   C  C  V YC   CQ+    K H+ 
Sbjct: 266 GQELLVEHPYVAVLLEKYAHTHCEYCFVRTVVPVACPGCSDVIYCSEQCQQRSADKYHKY 325

Query: 66  ECQVL-------SRLDKEKRKSVTPSIRLMLKLYLRRKLQND---NVIPSTTTDNYSLVE 115
           EC +L       + ++      +  S  L   L LR  L  D     + S   D++  V 
Sbjct: 326 ECGILPIIWRSGASINNHMALRIIASKPLDYFLQLRPSLDEDLSLEQLLSLPKDDFRRVA 385

Query: 116 ALVAHMSDIDEKQLLLYAQIANLVNLILQ---------WPE--ISINEIA-ENFSKLACN 163
            L  H  +        Y  +A  +   LQ          P+   +IN +   +   +  N
Sbjct: 386 HLERHQKERAASNFFQYVLMARFLTRCLQSAGYFGTEPQPDQIRTINALLLRSLQFIQFN 445

Query: 164 AHTIC-----NSELRP----LGTGLYPVISIINHSCLPNAVLVFEGRLAVVRAVQHVPKG 214
            H +      +SE R     +G  +YP +++ NHSC P  V  F G    + +V+ +  G
Sbjct: 446 THEVAELHKYSSEGREKSIFIGGAIYPTLALFNHSCDPGVVRYFRGTTIHINSVRPIEAG 505

Query: 215 AEVLISY--IETAGSTMTRQKALKEQYLFTCTCPRCI-KLGQFDDIQESAILEGYRCK-D 270
             +  +Y  I T      RQ  LK+ Y F C C  C+     FD++    I   +RC   
Sbjct: 506 LPINENYGPIYTQDKREDRQARLKDLYWFECNCDACLDNWPLFDELPRDLI--RFRCDAP 563

Query: 271 DGCSGFL--LRDSDDKGFTCQQCGLVRSKEEIKKIASEVNILSKKTLALTSCGNHQEVVS 328
           + C+  +      +D    C  CG   +  +  K+  +  ++++    L   G + + ++
Sbjct: 564 NNCAAIIEVPPTCNDFMINCVTCGETTNILKGLKVMQDTEMMTRTAKRLYDTGEYSKALN 623

Query: 329 TY-KMIEKLQKKLYHPF 344
            +  ++  + + L  PF
Sbjct: 624 KFVDLLRIMYEVLAPPF 640


>gi|440794181|gb|ELR15350.1| tetratricopeptide repeat domain containing protein [Acanthamoeba
           castellanii str. Neff]
          Length = 823

 Score = 77.0 bits (188), Expect = 2e-11,   Method: Compositional matrix adjust.
 Identities = 68/269 (25%), Positives = 114/269 (42%), Gaps = 35/269 (13%)

Query: 149 SINEIAENFSKLACNAHTICNSELRPLGTGLYPVISIINHSCLPNAVLVFEGRLAVVRAV 208
           +I E+A++      + H      +R +   +Y   S+ NHSC PN VL +EGR+  +RA 
Sbjct: 513 AIYEVADD------DEHGGGTKPIRRVAEAMYATGSLFNHSCRPNTVLNYEGRVLTIRAC 566

Query: 209 QHVPKGAEVLISY---IETAGSTMTRQKALKEQYLFTCTCPRCIKLGQFDDIQESAILEG 265
           + + KG EVL  Y   +    S   RQKAL+ QY FTC C  C           S++ +G
Sbjct: 567 EDIAKGKEVLNCYGPHVAHIPSRDERQKALRHQYFFTCRCDAC----------SSSVEQG 616

Query: 266 YR------CKDDGCSGFLLRDSDDKGFTCQQCGLVRSKEEIKKIASEVNILSKKTLALTS 319
            +      C  +GC+  L  + +      Q+  +   + EI K+A E    +++ L    
Sbjct: 617 GQDFDYLVCGSEGCTRELQINPEQ---LRQEVDIRVVQREI-KLAREFYERARRALDGGG 672

Query: 320 CGNH-----QEVVSTYKMIEKLQKKLYHPFSVNLMQTREKLIKILMELEDWKEALAYCQL 374
                    +    +     +L+ K  HP    L +T + L +    L  ++ A  + + 
Sbjct: 673 GDGSGDGDVERAAQSLAHALRLRAKYLHPQHRELGRTHDALAEAHARLGRYEAAATHLER 732

Query: 375 TIPVYQRVYPQFHPLLGLQYYTCGKLEWF 403
            + V +  +  F   L  QY    +  WF
Sbjct: 733 AVAVLESRFAPFSFELADQYAKLAQA-WF 760


>gi|224006285|ref|XP_002292103.1| hypothetical protein THAPSDRAFT_23831 [Thalassiosira pseudonana
           CCMP1335]
 gi|220972622|gb|EED90954.1| hypothetical protein THAPSDRAFT_23831 [Thalassiosira pseudonana
           CCMP1335]
          Length = 575

 Score = 76.6 bits (187), Expect = 2e-11,   Method: Compositional matrix adjust.
 Identities = 43/124 (34%), Positives = 59/124 (47%), Gaps = 11/124 (8%)

Query: 157 FSKLACNAHTICNSELRPLGTGLYPVISIINHSCLPNAVLVF-----EGRLAVVRAVQHV 211
            S+L  N  TI  SE  PLG G+YP  S+INHSC PNAV  F        +  +   + V
Sbjct: 257 MSRLIMNGFTISTSEQLPLGVGVYPAASMINHSCRPNAVPTFWFSTPSPPMLQITMCKSV 316

Query: 212 PKGAEVLISYIETAGSTMTRQKALKEQYLFTCTCPRCIKLGQFDDIQESAILEGYRCKDD 271
             G E+ ISY + +     R++ L + Y F C C  C       D +    + G +C+  
Sbjct: 317 RVGDEIAISYCDVSAPRYVRREGLIKNYKFACDCSHC------GDFETDGDMIGLKCQAS 370

Query: 272 GCSG 275
           GC G
Sbjct: 371 GCIG 374


>gi|198463716|ref|XP_001352921.2| GA14810 [Drosophila pseudoobscura pseudoobscura]
 gi|198151380|gb|EAL30422.2| GA14810 [Drosophila pseudoobscura pseudoobscura]
          Length = 532

 Score = 76.6 bits (187), Expect = 2e-11,   Method: Compositional matrix adjust.
 Identities = 88/337 (26%), Positives = 148/337 (43%), Gaps = 51/337 (15%)

Query: 8   GEVIISQEPYVCVPNNSSSISRCDGCFAS--------SNLKKCSACQVVW-YCGSNCQKL 58
           GE I+ + P +  P  +S+   C GC  +         N  KCS+C   W  CG  C+  
Sbjct: 69  GEEILREAPLLLGPKVASA-PICLGCHRNLLAPQKQRGNYYKCSSCS--WPLCGRECE-- 123

Query: 59  DWKLHRLECQVLSRLDKEKRKSVTPSIRLMLK-----LYLRRKLQNDNVIPSTTTDNYSL 113
           +   HR ECQ++S  + + + + TP      +     + L R +Q    + S+  + ++ 
Sbjct: 124 ESPHHRAECQLMSGSNFQSKINYTPGEDERKESAYCVIMLLRCMQ----LKSSDPEGFAR 179

Query: 114 VEALVAHMSDIDEKQL--LLYAQIANLVNLIL---QWPEISINEIAENFSKLACNAHTIC 168
           + AL  H+ +     L  +L A +   +  IL    W E+ I  IA   + L  N   + 
Sbjct: 180 LSALEDHLEERLATPLYQVLRANLITFIKTILGMRDWSEVDILRIA---AILDTNTFELR 236

Query: 169 NSELRPLGTGLYPVISIINHSCLPNAVLVFEGRLAVV-RAVQHVPKGAEVLISYIETAGS 227
               R     L+P  ++I+H C PN    F+  + ++  A + + KG  + ISY +   S
Sbjct: 237 QPRERRKVRALFPGAAMISHDCAPNMRHRFDDDMNIIFLAKRPIAKGEILSISYTQPLRS 296

Query: 228 TMTRQKALKEQYLFTCTCPRC---IKLGQFDDIQESAILEGYRCKDDGCSGFLLRDS--- 281
           T+ R+  L++   F C C RC    +LG F   Q        +CK    +G ++  +   
Sbjct: 297 TIQRRLHLRQAKCFDCACARCQDPTELGTFAGAQTCV-----KCK----AGKIISVNPLQ 347

Query: 282 DDKGFTCQQCGLVRSKEEI----KKIASEVNILSKKT 314
           +   + CQ C L RS +E+     K+  E+  L K T
Sbjct: 348 NTANWKCQLCNLKRSAKEVLLSDAKLQQEIEALDKTT 384


>gi|312373131|gb|EFR20943.1| hypothetical protein AND_18265 [Anopheles darlingi]
          Length = 675

 Score = 76.6 bits (187), Expect = 2e-11,   Method: Compositional matrix adjust.
 Identities = 89/375 (23%), Positives = 153/375 (40%), Gaps = 56/375 (14%)

Query: 10  VIISQEPYVCVPNNSSSISRCDGCFASSNLK-KCSACQVVWYCGSNCQK-LDWKLHRLEC 67
           +++ + P+V       S+  C  CF   ++   C  C  V +C   C++  +   HR EC
Sbjct: 268 ILLLERPHVSALLEDYSLDHCSNCFKRVSVPIACPLCSDVVFCSDECERRANASFHRYEC 327

Query: 68  QVLSRLDKEKRKSVTPSIRLMLKLYLRR------KLQ-------NDNVIPSTTTDNYSLV 114
             L  L        + +  + L++  ++      KLQ        +  I     ++Y  V
Sbjct: 328 GFLPIL---WGSGASITCHMALRMITQKSSEYFAKLQPELAAGLTNEQIDRLPVNDYRKV 384

Query: 115 EALVAHMSDIDEKQLLLYAQIANLVNLILQWPEISINEIA---------ENFSKLACNAH 165
             LV H +    + L     +A L+   L   + + ++ +          N   L  NAH
Sbjct: 385 YQLVTHEATRTPEDLFHRTLMATLLVSCLTLSDYAPDQASCEYLGGLMLHNLQLLQFNAH 444

Query: 166 TICNSEL---RP--------LGTGLYPVISIINHSCLPNAVLVFEGRLAVVRAVQHVPKG 214
            I  SE+   +P        +G GLYP +++ NHSC P     + G    VR V+++  G
Sbjct: 445 EI--SEMIREKPSDIGKSTFIGGGLYPTLALFNHSCDPGVTRYYRGNQVCVRTVKNIAAG 502

Query: 215 AEVLISYIETAGSTMTRQKALKEQYLFTCTCPRCIK-LGQFDDIQESAILEGYRCKDDG- 272
                    T      R++ L  QY FTC C  C++    F D+  S I   +RC+    
Sbjct: 503 ---------TRVRREERRETLLNQYRFTCYCEPCMQNWPLFSDMDPSII--RFRCEGGKI 551

Query: 273 CSGFLLRDSDDKGFT--CQQCGLVRSKEEIKKIASEVNILSKKTLALTSCGNHQEVVSTY 330
           CS  LL  S+   F   C +CG   +  +  K   + ++L +    L S G ++  +  Y
Sbjct: 552 CSNVLLIPSEINDFMVKCTECGEHTNIMKGLKSLQDTDMLFQTATRLHSAGEYEAALMKY 611

Query: 331 -KMIEKLQKKLYHPF 344
            +M+  + + L  P+
Sbjct: 612 VEMMAIMSEVLVPPY 626


>gi|345326540|ref|XP_003431054.1| PREDICTED: LOW QUALITY PROTEIN: SET and MYND domain-containing
           protein 4-like [Ornithorhynchus anatinus]
          Length = 815

 Score = 76.6 bits (187), Expect = 3e-11,   Method: Compositional matrix adjust.
 Identities = 38/92 (41%), Positives = 53/92 (57%), Gaps = 2/92 (2%)

Query: 161 ACNAHTICNSELRPLGTGLYPVISIINHSCLPNAVLVFEGRLAVVRAVQHVPKGAEVLIS 220
           A   H I +S    L TGL+PV+S++NHSC PN  + F GR+AVV+A Q + +G E+L  
Sbjct: 508 ASEDHVIASSRQVRLATGLFPVVSLLNHSCSPNTSVSFSGRVAVVQATQPIGRGQEILHC 567

Query: 221 YI--ETAGSTMTRQKALKEQYLFTCTCPRCIK 250
           Y           R++ LK QY F C C  C++
Sbjct: 568 YGPHRCRMDVAERRQKLKAQYFFDCRCQACLE 599


>gi|303279368|ref|XP_003058977.1| predicted protein [Micromonas pusilla CCMP1545]
 gi|226460137|gb|EEH57432.1| predicted protein [Micromonas pusilla CCMP1545]
          Length = 322

 Score = 76.6 bits (187), Expect = 3e-11,   Method: Compositional matrix adjust.
 Identities = 66/286 (23%), Positives = 111/286 (38%), Gaps = 46/286 (16%)

Query: 8   GEVIISQEPYVCVPNNSSSISRCDGCF-ASSNLKKCSACQVVWYCGS--NCQKLDWKLHR 64
           G +++   P+  VP        C GCF A    ++C AC V   C S   C+      H 
Sbjct: 33  GTLLVRVAPFAAVPYPDEMRRSCHGCFRACGAERECGACGVARLCASCAGCETT-KAYHA 91

Query: 65  LECQVLSRL-DKEKRKSVTPSIRLMLKLYLRRKLQNDNVI---------------PSTTT 108
            EC  L+RL D E+  ++  +   +L   L  + ++ +V+                    
Sbjct: 92  YECHALARLRDGERGLTLAHNDLRLLLRVLSVRRRHRDVVASSYATAASDAAAASGDVIV 151

Query: 109 DNYSLVEALVAHMSDIDEKQL------LLYAQIANLVNLILQWPEISINEIAENFSKLAC 162
           D+    +AL++ +   D+ +L      +L+        L+      S++       +L  
Sbjct: 152 DDVDAFDALMSGVDGGDDGELPESSIAMLHEVAKQAKFLVAAEARASVDACVRTLGRLQL 211

Query: 163 NAHTICNSEL-------------------RPLGTGLYPVISIINHSCLPNAVLVFEGRLA 203
           N   +  S                     RP+G G+YP  ++ NH C PNA   F+    
Sbjct: 212 NGFEMTASASEEEEEGGRGGGGGGGGGGHRPIGVGVYPSAAMFNHDCAPNAAQRFDAFGC 271

Query: 204 V-VRAVQHVPKGAEVLISYIETAGSTMTRQKALKEQYLFTCTCPRC 248
           V V   + V KG E+ I Y++       R+  L++ + F C C RC
Sbjct: 272 VRVETTRRVRKGEELTIPYVDVMLGREERRGKLRKNFAFECACARC 317


>gi|194883957|ref|XP_001976062.1| GG20202 [Drosophila erecta]
 gi|190659249|gb|EDV56462.1| GG20202 [Drosophila erecta]
          Length = 660

 Score = 76.6 bits (187), Expect = 3e-11,   Method: Compositional matrix adjust.
 Identities = 84/379 (22%), Positives = 155/379 (40%), Gaps = 46/379 (12%)

Query: 8   GEVIISQEPYVCVPNNSSSISRCDGCFASSNLK-KCSACQVVWYCGSNC-QKLDWKLHRL 65
           G+ ++ + P+V V     + + C+ CF  + +   C  C  V YC   C Q+   K H+ 
Sbjct: 262 GDELLVERPFVSVLLEKFAKTHCENCFMRTVVPVACPRCADVLYCSEQCRQEASKKYHKY 321

Query: 66  ECQVLSRLDKEKRKSVTPSIRLM------------LKLYLRRKLQNDNVIPSTTTDNYSL 113
           EC ++  + +    S+   I L             LK  +  +L  + +I S   D++  
Sbjct: 322 ECGIVPIIWRSG-ASINNHIALRIIASKPLDYFLKLKPTIDEELTPEQLI-SLPKDDFRR 379

Query: 114 VEALVAHMSDIDEKQLLLYAQIANLVNLILQW----------PEISI--NEIAENFSKLA 161
           V  L  H  +        +  +A  +   L+            E+SI  + +  +   + 
Sbjct: 380 VAQLERHQGERQPSNFFQHVLMARFLTHCLRAGGYFGAEPKPAEVSIICSLVLRSLQFIQ 439

Query: 162 CNAHTIC-----NSELRP----LGTGLYPVISIINHSCLPNAVLVFEGRLAVVRAVQHVP 212
            N H +      +S  R     +G  +YP +++ NHSC P  V  F G    + +V+ + 
Sbjct: 440 FNTHEVAELHKFSSSGREKSIFIGGAIYPTLALFNHSCDPGVVRYFRGTTIHINSVRPIE 499

Query: 213 KGAEVLISY--IETAGSTMTRQKALKEQYLFTCTCPRCI-KLGQFDDIQESAILEGYRCK 269
            G  +  +Y  + T      RQ  LKE Y F C+C  CI    +FDD+    I   +RC 
Sbjct: 500 AGLPINENYGPMYTQDERAERQARLKELYWFECSCDACIDNWPKFDDLPRDVI--RFRCD 557

Query: 270 -DDGCSGFL--LRDSDDKGFTCQQCGLVRSKEEIKKIASEVNILSKKTLALTSCGNHQEV 326
             + CS  +      +D    C  CG + +  +  K+  +  ++++    L   G + + 
Sbjct: 558 APNNCSAIIEVPPSCNDFMVKCVTCGEITNILKGLKVMQDTEMMTRTAKRLYETGEYLKA 617

Query: 327 VSTY-KMIEKLQKKLYHPF 344
           ++ +  +I  + + L  PF
Sbjct: 618 LAKFIDLIRIMYEVLAPPF 636


>gi|432096123|gb|ELK26991.1| SET and MYND domain-containing protein 4 [Myotis davidii]
          Length = 777

 Score = 76.6 bits (187), Expect = 3e-11,   Method: Compositional matrix adjust.
 Identities = 69/299 (23%), Positives = 129/299 (43%), Gaps = 15/299 (5%)

Query: 141 LILQWPEISINEIAENFSKLACNAHTICNSELRPLGTGLYPVISIINHSCLPNAVLVFEG 200
           L LQ    +I  I +  SK     + + +S    L TGL+PV+S++NHSC PN  + F  
Sbjct: 471 LQLQCNAQAITTIQQTGSK----ENMVTDSRQVRLATGLFPVVSLLNHSCSPNTSVSFVS 526

Query: 201 RLAVVRAVQHVPKGAEVLISYI--ETAGSTMTRQKALKEQYLFTCTCPRCIKLGQFDDIQ 258
            +A VRA Q + +G E+L  Y   E+      RQ+ L+ QY F C C  C    Q    +
Sbjct: 527 TVATVRASQQIGEGQEILHCYGPHESRMGVAERQQKLRSQYFFDCDCLAC----QNGKCR 582

Query: 259 ESA--ILEGYRCKDDGCSGFLLRDSDDKGFTCQQCGLVRSKEEIKKIASEVNILSKKTLA 316
            +A    E +RC    C G L++  D        C    S++ +     ++     K   
Sbjct: 583 AAAGPRWEAFRCH---CCGELMQGDDVLSCGSASCTESASRDRLVSRLQDLRQRVGKAQT 639

Query: 317 LTSCGNHQEVVSTYKMIEKLQKKLYHPFSVNLMQTREKLIKILMELEDWKEALAYCQLTI 376
           L   G  +  +      ++  +         + +  + L +    L DW ++ A+ Q ++
Sbjct: 640 LLRNGKVERAIQLLLGCQRDAESFLSAEHSLVGEIEDALAQAYAALGDWHKSAAHLQRSL 699

Query: 377 PVYQRVYPQFHPLLGLQYYTCGKLEWFLGDTENAIKSMTEAVEILRITHGTNSPFMKEL 435
            V +  +      +G + +   ++ +       A+ ++ +A ++L + +G  +  ++EL
Sbjct: 700 RVVEVRHGPASVEMGHELFKLAQIFFNGFAIPEALNTIQKAEKVLLVHYGRGNEEIQEL 758


>gi|299743688|ref|XP_002910692.1| hypothetical protein CC1G_15023 [Coprinopsis cinerea okayama7#130]
 gi|298405772|gb|EFI27198.1| hypothetical protein CC1G_15023 [Coprinopsis cinerea okayama7#130]
          Length = 449

 Score = 76.6 bits (187), Expect = 3e-11,   Method: Compositional matrix adjust.
 Identities = 67/271 (24%), Positives = 115/271 (42%), Gaps = 32/271 (11%)

Query: 7   SGEVIISQEPYVCVPNNSSSISRCDGCF----ASSNLKKCSACQVVWYCGSNCQKLDWKL 62
           +G  I+    +  V   S   +RCD C         L++C+ C   WYC + CQ   W+ 
Sbjct: 32  TGSTILKIPAFATVLQFSQKGNRCDHCMRLPTEGQPLRRCTGCSAYWYCDAQCQSAQWQT 91

Query: 63  HRLECQVLSRLDKEKRKSVTPSI---RLMLKLYLRRKLQNDNVIPST----TTDNYSLVE 115
           H    ++  R+++    +V   +     M  L L   + + + +P+      +   +L  
Sbjct: 92  HHK--RICKRINQFTSSTVFQGLEEHEKMDALLLNHLVASLSTLPTAYNLEQSQEATLFL 149

Query: 116 ALVAHMSDIDEKQLLLYAQIANLVNLILQWPEISINEIAENFSKLACNAHTICNSELRPL 175
           +L+   S   E  L+         +LI   P +        F +   N  T+ +S L  +
Sbjct: 150 SLLPGPSSGSEPPLICTISPPPKADLI---PTL--------FERFGNNNFTM-HSHLNSI 197

Query: 176 GTGLYPVIS-IINHSCLPNAV--LVFEGRLAV---VRAVQHVPKGAEVLISYIETAGSTM 229
             G++P+ S   NHSC PNA     F     V   V A++ + +G E+ I Y++ A  T 
Sbjct: 198 AHGIFPLASRSFNHSCSPNAAPKYTFSAHQPVIMEVVALKDIHQGEEICIPYLDPA-LTQ 256

Query: 230 TRQKALKEQYLFTCTCPRCIKLGQFDDIQES 260
           T+++  +  Y F C CP C+ L     I E+
Sbjct: 257 TKRQIFQFTYGFNCNCPACLFLNSIGHIPET 287


>gi|66804135|ref|XP_635865.1| hypothetical protein DDB_G0290183 [Dictyostelium discoideum AX4]
 gi|60464204|gb|EAL62363.1| hypothetical protein DDB_G0290183 [Dictyostelium discoideum AX4]
          Length = 447

 Score = 76.6 bits (187), Expect = 3e-11,   Method: Compositional matrix adjust.
 Identities = 89/346 (25%), Positives = 141/346 (40%), Gaps = 70/346 (20%)

Query: 7   SGE-VIISQEPYVCVPNNSSSISR--CDGCFA-SSNLKK-----------CSACQVVWYC 51
           +GE ++  +EPYV   N      R  C  CF  SS   K           C  C +VWYC
Sbjct: 91  AGESIVFREEPYVVCMNYKDPYERKICHHCFGLSSTTSKSMITTPDFTLHCETCNIVWYC 150

Query: 52  GSNCQKLDWKLHRLEC------QVLSRLDKEKRKSVTPSIRLMLKLYLR-----RKLQND 100
            + CQ  D   H+ EC      Q  S+ D   + S+   ++L++K YL       K+ ND
Sbjct: 151 SNYCQSNDLTYHKHECFTYKRMQSSSQFDTSCKTSIKLLLKLIIKQYLEIKELSIKINND 210

Query: 101 NVIPSTTTDNYSLVEALVAHMSDIDEKQLLLYAQIANLVNLILQWPEISI---------- 150
           +  PS++   +  +  L  +++    +++  +  I+  +   +    + I          
Sbjct: 211 SSSPSSSVK-FKDILTLETNLNKFSTQRITEFRMISKFIEKTMDKEFLKIICPTNREVIE 269

Query: 151 --NEIAENFSKLACNAHTI---------CNSELRPLGTGLYPVISIINHSCLPNAVLVFE 199
             N + +    L CN+H I          + E   +G GL+   S+ NHSC PN   V E
Sbjct: 270 FQNNLIKLMCILECNSHDISFTIPQSTKSSYEYCSIGIGLFYHSSMFNHSCNPNICKVIE 329

Query: 200 ----------------GRLAV--VRAVQHVPKGAEVLISYIETAGSTMTRQKALKEQYLF 241
                           G  A   + A++ + K  E+  +YI+   S + R K L+  Y F
Sbjct: 330 SKQHAQPISNEMVEYSGNFATHSMIAIKDIKKDDEISFNYIQITLSKVDRLKKLESAYHF 389

Query: 242 TCTCPRCI-KLGQFDDIQ-ESAILEGYRCKDD--GCSGFLLRDSDD 283
            C C  CI  +     I+  S  +E Y CK     CSG L   +D+
Sbjct: 390 QCKCSSCIGDINHKPSIKSHSQFIEKYICKSKSGNCSGILFPLNDN 435


>gi|24666583|ref|NP_649084.1| CG18136 [Drosophila melanogaster]
 gi|7293833|gb|AAF49199.1| CG18136 [Drosophila melanogaster]
          Length = 530

 Score = 76.3 bits (186), Expect = 3e-11,   Method: Compositional matrix adjust.
 Identities = 77/280 (27%), Positives = 124/280 (44%), Gaps = 47/280 (16%)

Query: 8   GEVIISQEPYVCVPNNSSSISRCDGCFAS--------SNLKKCSACQVVW-YCGSNCQKL 58
           GE I+ + P V  P  +S+   C GC  +         N  KCS+C   W  CG  C+  
Sbjct: 70  GEQILKEAPLVLGPKVASA-PLCLGCHRNLLAPGKPRGNYHKCSSCS--WPLCGKECE-- 124

Query: 59  DWKLHRLECQVLSRL-----------DKEKRKSVTPSIRLMLKLYLRRKLQNDNVIPSTT 107
           D   H+ ECQ++S             ++E+++S    I L+  ++L+ K           
Sbjct: 125 DSVHHKAECQLMSGSNFQSKINYVPGEEERKESAYCVIMLLRCMHLKDK----------D 174

Query: 108 TDNYSLVEALVAHMSDIDEKQL--LLYAQIANLVNLIL---QWPEISINEIAENFSKLAC 162
            D +  +  L  H+ +  E  L  +L A +   +  +L    WPE+ I  IA   + L  
Sbjct: 175 PDAFLKLYNLEDHLKERLETPLYQVLRANLITFIKTVLGMKDWPEMDILRIA---AILDT 231

Query: 163 NAHTICNSELRPLGTGLYPVISIINHSCLPNAVLVFEGRLAVV-RAVQHVPKGAEVLISY 221
           N   +     R     LYP  ++I+H C+PN    F+  + +V  A + + KG  + ISY
Sbjct: 232 NTFEVRQPRERRKIRALYPGAAMISHDCVPNMRHRFDDDMNIVFLAKRKIAKGEILSISY 291

Query: 222 IETAGSTMTRQKALKEQYLFTCTCPRCI---KLGQFDDIQ 258
            +   ST+ R+  L++   F C+C RC    +LG F   Q
Sbjct: 292 TQPLRSTIQRRVHLRQAKCFDCSCARCQDPEELGSFAGAQ 331


>gi|410980233|ref|XP_003996482.1| PREDICTED: SET and MYND domain-containing protein 4 [Felis catus]
          Length = 800

 Score = 76.3 bits (186), Expect = 3e-11,   Method: Compositional matrix adjust.
 Identities = 70/282 (24%), Positives = 119/282 (42%), Gaps = 55/282 (19%)

Query: 165 HTICNSELRPLGTGLYPVISIINHSCLPNAVLVFEGRLAVVRAVQHVPKGAEVLISY--I 222
           +++ NS    L TG++PVIS++NHSC PN  + F   +A VRA Q + KG E+L  Y   
Sbjct: 506 NSVTNSRQVRLATGIFPVISLLNHSCSPNTSVSFISTVATVRASQQIGKGQEILHCYGPH 565

Query: 223 ETAGSTMTRQKALKEQYLFTCTCPRCIKLGQFDDIQESAILEGYRCKDDGCSGFLLRDSD 282
           ++      RQ+ L+ QY F C+C  C         ++ +   G R               
Sbjct: 566 QSRMRAAERQQKLRSQYFFDCSCAACQN-------EKHSAATGPRW-------------- 604

Query: 283 DKGFTCQQCGLVRSKEEIKKIASEVNILSKKTLALTSCGNHQEVVSTYKMIEKLQKKLYH 342
            + F C+ CG +   +++    S            TSC    E VS   +I +LQ     
Sbjct: 605 -EAFCCRSCGALMQGDDVLICGS------------TSC---TESVSRDHLISRLQD---- 644

Query: 343 PFSVNLMQTREKLIKILMELEDWKEA---LAYCQLTIPVYQRVYPQFHPLLGLQYYTCGK 399
                 +Q +  + + L+  +  + A   L  CQ      Q      H L+G       +
Sbjct: 645 ------LQQQVGMARKLLRNDKPERAIQLLLGCQRDA---QSFLSGEHSLVGEMEDDLAQ 695

Query: 400 LEWFLGDTENAIKSMTEAVEILRITHGTNSPFMKELILKLEE 441
               LGD + +   + ++++++ + HG +S  M   + KL +
Sbjct: 696 AYAALGDWQKSATHLQKSLQVVEVRHGPSSVEMGHELFKLAQ 737


>gi|409040050|gb|EKM49538.1| hypothetical protein PHACADRAFT_214100 [Phanerochaete carnosa
           HHB-10118-sp]
          Length = 461

 Score = 76.3 bits (186), Expect = 3e-11,   Method: Compositional matrix adjust.
 Identities = 69/239 (28%), Positives = 100/239 (41%), Gaps = 28/239 (11%)

Query: 29  RCDGCF----ASSNLKKCSACQVVWYCGSNCQKLDWKL-HRLECQVLSRLDKE-KRKSVT 82
           RCD C         L KC+ C   WYCG+ CQ   W + HR  C+  +R     + +++ 
Sbjct: 50  RCDYCHLVPTEGEKLFKCTGCAAFWYCGTACQTKQWNVHHRKVCKRYNRYTASIEYQAMP 109

Query: 83  PSIR---LMLKLYLRRKLQNDNVIPSTTTDNYSLVEALVAHMSDIDEKQLLLYAQIANLV 139
           P  R   LML   L      D       T         VAH  D+    L   A   N  
Sbjct: 110 PGHRTDALMLSQLLLEVFPKDEFGIEAATHRSD----AVAHFFDL----LKGPASRGNPT 161

Query: 140 NLILQWPE----ISINEIAENFSKLACNAHTICNSELRPLGTGLYPVIS-IINHSCLPN- 193
            + L  P     +  + I E F++   N + I +S L     G++P+ S + NHSC+PN 
Sbjct: 162 QISLCRPSNSAAVPPSIIEEVFARFG-NNNFIVHSHLNSYAHGVFPLASRLFNHSCVPNC 220

Query: 194 ----AVLVFEGRLAVVRAVQHVPKGAEVLISYIETAGSTMTRQKALKEQYLFTCTCPRC 248
                +   E     + A++ +  G E+ I Y++ A     RQ  L+E Y FTC C  C
Sbjct: 221 ASKYVITSTEMMGMEIVALRDIEFGDELTIPYLDPALPFDIRQNTLQESYGFTCNCSLC 279


>gi|156616310|ref|NP_001096081.1| SET and MYND domain-containing protein 4 [Mus musculus]
 gi|90101757|sp|Q8BTK5.2|SMYD4_MOUSE RecName: Full=SET and MYND domain-containing protein 4
 gi|148680856|gb|EDL12803.1| SET and MYND domain containing 4, isoform CRA_c [Mus musculus]
          Length = 799

 Score = 76.3 bits (186), Expect = 3e-11,   Method: Compositional matrix adjust.
 Identities = 38/88 (43%), Positives = 49/88 (55%), Gaps = 2/88 (2%)

Query: 163 NAHTICNSELRPLGTGLYPVISIINHSCLPNAVLVFEGRLAVVRAVQHVPKGAEVLISY- 221
           N   I NS    L TG++PV+S++NHSC PN  + F G +A VRA Q + KG E+L  Y 
Sbjct: 511 NESIITNSRQIRLATGIFPVVSLLNHSCRPNTSVSFTGTVATVRAAQRIAKGQEILHCYG 570

Query: 222 -IETAGSTMTRQKALKEQYLFTCTCPRC 248
             E+      RQ+ L  QY F C C  C
Sbjct: 571 PHESRMGVAERQQRLSSQYFFDCRCGAC 598


>gi|12007319|gb|AAG45134.1|AF310894_3 BOP [Dictyostelium discoideum]
          Length = 403

 Score = 76.3 bits (186), Expect = 3e-11,   Method: Compositional matrix adjust.
 Identities = 59/215 (27%), Positives = 99/215 (46%), Gaps = 33/215 (15%)

Query: 63  HRLECQVLSRLDKEKRKSVTPSIRLMLKLYLR--RKLQNDNVIPSTTT------------ 108
           H+ EC +L+ L +    S T   R M ++ L   +  +N +      +            
Sbjct: 2   HKDECDILNFLKQYYPSSQTRDFRFMFRVLLNVIKDKKNKSFSKENQSKQWLNHQNPFIF 61

Query: 109 DNYSLVEALVAHMSDIDEKQLLLYAQIANLVNLI---LQWPE------ISINEIAENFSK 159
           D+Y  +  L   +  +  +Q+  + + A  V  I   L+ P+      ++I+EI E +S 
Sbjct: 62  DSYKYLINLSRTLDKVQPEQMEAFKRSAQSVIAIFNKLRGPKFFDECGVTIDEIIEIYSI 121

Query: 160 LACNAHTICNSELRPL-----GTGLYPVISIINHSCLPNAVLVFEGR-LAVVRAVQHVPK 213
           +  N H +    L PL     G G++P  S +NHSCLPNA    + + + V R ++ + K
Sbjct: 122 VLSNGHEM----LHPLNCHTYGLGIFPTGSYLNHSCLPNAFWYNDDQGMMVFRTLRPIKK 177

Query: 214 GAEVLISYIETAGSTMTRQKALKEQYLFTCTCPRC 248
           G E+L SY +       R+K L +QY F C C +C
Sbjct: 178 GEEILTSYTDITTECSERRKHLLKQYFFFCQCQQC 212


>gi|195171538|ref|XP_002026562.1| GL21959 [Drosophila persimilis]
 gi|194111478|gb|EDW33521.1| GL21959 [Drosophila persimilis]
          Length = 532

 Score = 76.3 bits (186), Expect = 3e-11,   Method: Compositional matrix adjust.
 Identities = 87/337 (25%), Positives = 148/337 (43%), Gaps = 51/337 (15%)

Query: 8   GEVIISQEPYVCVPNNSSSISRCDGCFAS--------SNLKKCSACQVVW-YCGSNCQKL 58
           GE I+ + P +  P  +S+   C GC  +         N  KCS+C   W  CG  C+  
Sbjct: 69  GEEILREAPLLLGPKVASA-PICLGCHRNLLAPQKQRGNYYKCSSCS--WPLCGRECE-- 123

Query: 59  DWKLHRLECQVLSRLDKEKRKSVTPSIRLMLK-----LYLRRKLQNDNVIPSTTTDNYSL 113
           +   HR ECQ++S  + + + + TP      +     + L R +Q    + ++  + ++ 
Sbjct: 124 ESSHHRAECQLMSGSNFQSKINYTPGEDERKESAYCVIMLLRCMQ----LKASDPEGFAR 179

Query: 114 VEALVAHMSDIDEKQL--LLYAQIANLVNLIL---QWPEISINEIAENFSKLACNAHTIC 168
           + AL  H+ +     L  +L A +   +  IL    W E+ I  IA   + L  N   + 
Sbjct: 180 LSALEDHLEERLATPLYQVLRANLITFIKTILGMRDWSEVDILRIA---AILDTNTFELR 236

Query: 169 NSELRPLGTGLYPVISIINHSCLPNAVLVFEGRLAVV-RAVQHVPKGAEVLISYIETAGS 227
               R     L+P  ++I+H C PN    F+  + ++  A + + KG  + ISY +   S
Sbjct: 237 QPRERRKVRALFPGAAMISHDCAPNMRHRFDDDMNIIFLAKRPIAKGEILTISYTQPLRS 296

Query: 228 TMTRQKALKEQYLFTCTCPRC---IKLGQFDDIQESAILEGYRCKDDGCSGFLLRDS--- 281
           T+ R+  L++   F C C RC    +LG F   Q        +CK    +G ++  +   
Sbjct: 297 TIQRRLHLRQAKCFDCACDRCQDPTELGTFAGAQTCV-----KCK----AGKIISVNPLQ 347

Query: 282 DDKGFTCQQCGLVRSKEEI----KKIASEVNILSKKT 314
           +   + CQ C L RS +E+     K+  E+  L K T
Sbjct: 348 NTANWKCQLCNLKRSAKEVLLSDAKLQQEIEALDKTT 384


>gi|26354773|dbj|BAC41013.1| unnamed protein product [Mus musculus]
          Length = 799

 Score = 76.3 bits (186), Expect = 3e-11,   Method: Compositional matrix adjust.
 Identities = 38/88 (43%), Positives = 49/88 (55%), Gaps = 2/88 (2%)

Query: 163 NAHTICNSELRPLGTGLYPVISIINHSCLPNAVLVFEGRLAVVRAVQHVPKGAEVLISY- 221
           N   I NS    L TG++PV+S++NHSC PN  + F G +A VRA Q + KG E+L  Y 
Sbjct: 511 NESIITNSRQIRLATGIFPVVSLLNHSCRPNTSVSFTGTVATVRAAQRIAKGQEILHCYG 570

Query: 222 -IETAGSTMTRQKALKEQYLFTCTCPRC 248
             E+      RQ+ L  QY F C C  C
Sbjct: 571 PHESRMGVAERQQRLSSQYFFDCRCGAC 598


>gi|21754614|dbj|BAC04538.1| unnamed protein product [Homo sapiens]
          Length = 804

 Score = 76.3 bits (186), Expect = 3e-11,   Method: Compositional matrix adjust.
 Identities = 77/315 (24%), Positives = 131/315 (41%), Gaps = 67/315 (21%)

Query: 146 PEISINEIA--ENFSKLACNA-------HT------ICNSELRPLGTGLYPVISIINHSC 190
           P+++I  +A   +  +L CNA       HT      + +S    L TG++PVIS++NHSC
Sbjct: 483 PDVTIWGVAMLRHMLQLQCNAQAMTTIQHTGSKGSIVTDSRQVRLATGIFPVISLLNHSC 542

Query: 191 LPNAVLVFEGRLAVVRAVQHVPKGAEVLISY--IETAGSTMTRQKALKEQYLFTCTCPRC 248
            PN  + F   +A +RA Q + KG E+L  Y   ++      RQ+ L+ QY F C CP C
Sbjct: 543 SPNTSVSFISTVATIRASQRIRKGQEILHCYGPHKSRMGVAERQQKLRSQYFFDCACPAC 602

Query: 249 IKLGQFDDIQESAILEGYRCKDDGCSGFLLRDSDDKGFTCQQCGLVRSKEEIKKIASEVN 308
                    +   +  G R                + F C  CG     +++ +  S   
Sbjct: 603 -------QTEAHRMAAGPRW---------------EAFCCNSCGAPMQGDDVLRCGSR-- 638

Query: 309 ILSKKTLALTSCGNHQEVVSTYKMIEKLQKKLYHPFSVNLMQTREKLIKILM--ELEDWK 366
                     SC   +  VS   ++ +LQ         +L Q      K+L   ELE   
Sbjct: 639 ----------SCA--ESAVSRDHLVSRLQ---------DLQQQVRVAQKLLRDGELERAV 677

Query: 367 EALAYCQLTIPVYQRVYPQFHPLLGLQYYTCGKLEWFLGDTENAIKSMTEAVEILRITHG 426
           + L+ CQ     +  ++ + H ++G       +    LGD + +   +  ++ ++ + HG
Sbjct: 678 QRLSGCQRDAESF--LWAE-HAVVGEIADGLARACAALGDWQKSATHLQRSLYVVEVRHG 734

Query: 427 TNSPFMKELILKLEE 441
            +S  M   + KL +
Sbjct: 735 PSSVEMGHELFKLAQ 749


>gi|383866011|ref|XP_003708465.1| PREDICTED: SET and MYND domain-containing protein 4-like [Megachile
           rotundata]
          Length = 636

 Score = 76.3 bits (186), Expect = 3e-11,   Method: Compositional matrix adjust.
 Identities = 68/292 (23%), Positives = 117/292 (40%), Gaps = 51/292 (17%)

Query: 8   GEVIISQEPYVCVPNNSSSISRCDGCFASS-NLKKCSACQVVWYCGSNCQKLDWKL-HRL 65
           G+++  ++P+       S  + C  C A S NL  C  C V  YC   C+KL W++ H++
Sbjct: 266 GDIVTIEDPFAFTIYMQSYFTHCHHCLARSFNLIPCLKCPVAQYCSETCRKLAWEMAHQI 325

Query: 66  ECQVLS------RLDKEKRKSVTPSIRLMLKLYLRRK----LQNDNVIPSTTTDNYSL-- 113
           EC +L+       +DK K + +T  +R ++    + K    L  D  +  +  DN +   
Sbjct: 326 ECPILALVGNLLNVDKNKIRMLTKIVRFLIVATAKGKKIQELLEDMKVAESNPDNRTAGF 385

Query: 114 -------------VEALVAHMSDIDEKQLLLYAQIANLVNLIL----------------- 143
                          +L  +M+      +  +A I+ L  ++L                 
Sbjct: 386 TDEGILDSTSSRSALSLATNMTTRPLIGISAFACISALAAILLATQTNFFCKKYEVDQLR 445

Query: 144 ---QWPEISI-NEIAENFSKLAC-NAHTICNSELRPLGTGLYPVISIINHSCLPNAVLVF 198
                PEI     I      + C N   +        G+GLY   S+ NHSC PN    F
Sbjct: 446 DINDLPEIKFCGSIMFRACVIMCSNCFLVQQEPGIKTGSGLYITHSLYNHSCAPNTFRHF 505

Query: 199 EGRLAVVRAVQHVPKGAEVLISYIETAG--STMTRQKALKEQYLFTCTCPRC 248
           EG   + RA++ +  G ++  +Y  +    +   R++ + + Y F C C  C
Sbjct: 506 EGMTMITRALEPIFPGDQIFTNYCASYAYMTRSERREKIMQDYFFECDCIAC 557


>gi|307172338|gb|EFN63826.1| Protein msta, isoform B [Camponotus floridanus]
          Length = 510

 Score = 76.3 bits (186), Expect = 3e-11,   Method: Compositional matrix adjust.
 Identities = 73/303 (24%), Positives = 125/303 (41%), Gaps = 35/303 (11%)

Query: 148 ISINEIAENFSKLACNAHTICNSELRPL----GTGLYPVISIINHSCLPN---AVLVFEG 200
           + ++  +E F +  C    I   E+R +      GLYP ++++NHSC+ N   ++   + 
Sbjct: 179 LKLDRFSEEFIQTICGILDINTFEVRTMKGFCARGLYPTVAMMNHSCVSNTSHSISPIDY 238

Query: 201 RLAVVRAVQHVPKGAEVLISYIETAGSTMTRQKALKEQYLFTCTCPRCIKLGQFDDIQES 260
           R+ + R    VP G E+  SY  +   TM R++ L E   F C CPRC      D  +  
Sbjct: 239 RIRL-RTTLKVPAGGELYGSYTHSLLPTMLRREHLLEGKYFACACPRC-----SDPTELG 292

Query: 261 AILEGYRCKDDGCSGFLLRDSDDKGFT--CQQCGLVRSKEEIKKIASEVNILSKKTLALT 318
             +   +C        L  DS D   T  C  C    S + ++KI   +        A++
Sbjct: 293 THMSSLKCNKCDNGVILSLDSLDSTSTWNCTHCDFSTSGQAVRKIFKIIQAEVDTVEAIS 352

Query: 319 SCGNHQEVVSTYKMIEKLQKKLYHPFSVNLMQTREKLIKILMELEDW----------KEA 368
                  ++    +I+K +  L HP    L   R  L ++   ++++          +  
Sbjct: 353 GADGADAILERENVIKKYRSVL-HPHHAFLTILRHSLTQMYGRVDEYLLDDLPDVVLEHK 411

Query: 369 LAYCQLTIPVYQRVYPQFHPLLGLQYYT-------CGKLEWFLGDTENA-IKS-MTEAVE 419
           +  C+L + V   V P +  + G+  Y          K +W  G  + A +KS M EA  
Sbjct: 412 VDMCRLLLQVLDIVEPGYSRIRGMTLYELHAPLLFLAKGQWNAGVIDEAGLKSKMIEAAT 471

Query: 420 ILR 422
           IL+
Sbjct: 472 ILK 474


>gi|405969486|gb|EKC34455.1| SET and MYND domain-containing protein 4 [Crassostrea gigas]
          Length = 324

 Score = 75.9 bits (185), Expect = 4e-11,   Method: Compositional matrix adjust.
 Identities = 52/156 (33%), Positives = 78/156 (50%), Gaps = 19/156 (12%)

Query: 151 NEIAENFSKLACNAHTICNSELRP----------LGTGLYPVISIINHSCLPNAVLVFEG 200
           + I      L CNAH I     +P          +G+G Y ++S+INHSC P+ V    G
Sbjct: 94  SHILRQIQMLPCNAHEISEILWKPGDPTVTNSIEIGSGAYALLSLINHSCDPSVVRHNYG 153

Query: 201 RLAVVRAVQHVPKGAEVLISYIETAGSTM--TRQKALKEQYLFTCTCPRC-IKLGQFDDI 257
            + VVRA++ + KG E+L +Y      T+   R+  L+ QY F C C  C ++L  + DI
Sbjct: 154 NICVVRAIKPIKKGEEILDNYGALYPLTIREERRAKLRPQYFFDCNCDACQLELPLYFDI 213

Query: 258 QESAILEGYRCKDDGCSG--FLLRDSDDKGFTCQQC 291
            +   +  ++CKD  CSG  F+ +D D     C  C
Sbjct: 214 PDDVPV--FKCKD--CSGPIFISQDKDLAEAECSSC 245


>gi|148680855|gb|EDL12802.1| SET and MYND domain containing 4, isoform CRA_b [Mus musculus]
          Length = 828

 Score = 75.9 bits (185), Expect = 4e-11,   Method: Compositional matrix adjust.
 Identities = 38/88 (43%), Positives = 49/88 (55%), Gaps = 2/88 (2%)

Query: 163 NAHTICNSELRPLGTGLYPVISIINHSCLPNAVLVFEGRLAVVRAVQHVPKGAEVLISY- 221
           N   I NS    L TG++PV+S++NHSC PN  + F G +A VRA Q + KG E+L  Y 
Sbjct: 540 NESIITNSRQIRLATGIFPVVSLLNHSCRPNTSVSFTGTVATVRAAQRIAKGQEILHCYG 599

Query: 222 -IETAGSTMTRQKALKEQYLFTCTCPRC 248
             E+      RQ+ L  QY F C C  C
Sbjct: 600 PHESRMGVAERQQRLSSQYFFDCRCGAC 627


>gi|195333441|ref|XP_002033400.1| GM21288 [Drosophila sechellia]
 gi|194125370|gb|EDW47413.1| GM21288 [Drosophila sechellia]
          Length = 660

 Score = 75.9 bits (185), Expect = 4e-11,   Method: Compositional matrix adjust.
 Identities = 84/379 (22%), Positives = 154/379 (40%), Gaps = 46/379 (12%)

Query: 8   GEVIISQEPYVCVPNNSSSISRCDGCFASSNLK-KCSACQVVWYCGSNC-QKLDWKLHRL 65
           GE ++ + P+V V     + + C+ CF  + +   C  C  V YC   C ++   K H+ 
Sbjct: 262 GEELLVERPFVSVLLEKFAKTHCENCFMRTVVPVACPRCADVLYCSEQCREEASKKYHKY 321

Query: 66  ECQVLSRLDKEKRKSVTPSIRLM------------LKLYLRRKLQNDNVIPSTTTDNYSL 113
           EC ++  + +    S+   I L             LK  +  +L  + +I S   D++  
Sbjct: 322 ECGIVPIIWRSG-ASINNHIALRIIASKPLDYFLKLKPTIDEQLTPEQLI-SLPKDDFRR 379

Query: 114 VEALVAHMSDIDEKQLLLYAQIANLVNLILQW----------PEISI--NEIAENFSKLA 161
           V  L  H  +        +  +A  +   L+            E+SI  + +  +   + 
Sbjct: 380 VAQLERHQGERQPSNFFQHVLMARFLTHCLRAGGYFGSEPKPDEVSIICSLVLRSLQFIQ 439

Query: 162 CNAHTIC-----NSELRP----LGTGLYPVISIINHSCLPNAVLVFEGRLAVVRAVQHVP 212
            N H +      +S  R     +G  +YP +++ NHSC P  V  F G    + +V+ + 
Sbjct: 440 FNTHEVAELHKFSSSGREKSIFIGGAIYPTLALFNHSCDPGVVRYFRGTTIHINSVRPIE 499

Query: 213 KGAEVLISY--IETAGSTMTRQKALKEQYLFTCTCPRCI-KLGQFDDIQESAILEGYRCK 269
            G  +  +Y  + T      RQ  LKE Y F C+C  CI    +FDD+    I   +RC 
Sbjct: 500 AGLPINENYGPMYTQDERSERQARLKELYWFECSCDACIDNWPKFDDLPRDVI--RFRCD 557

Query: 270 -DDGCSGFL--LRDSDDKGFTCQQCGLVRSKEEIKKIASEVNILSKKTLALTSCGNHQEV 326
             + CS  +      +D    C  CG + +  +  K+  +  ++++    L   G + + 
Sbjct: 558 APNNCSAVIEVPPSCNDFMVKCVTCGEITNILKGLKVMQDTEMMTRTAKRLYETGEYSKA 617

Query: 327 VSTY-KMIEKLQKKLYHPF 344
           +  +  +I  + + L  PF
Sbjct: 618 LPKFIDLIRIMYEVLAPPF 636


>gi|357611158|gb|EHJ67334.1| hypothetical protein KGM_20152 [Danaus plexippus]
          Length = 397

 Score = 75.9 bits (185), Expect = 4e-11,   Method: Compositional matrix adjust.
 Identities = 88/329 (26%), Positives = 129/329 (39%), Gaps = 63/329 (19%)

Query: 7   SGEVIISQEPYVCVPNNSSSISRCDGCFASSNLK-----KCSACQVVWYCGSNCQKLDWK 61
           +GEV+    P V  P +   + R  GC    N+      KCS C ++  C   CQ  D+ 
Sbjct: 50  AGEVLFVDYPLVYGPRSGIIVQR--GCTVCKNIDSDIFFKCSKCALI-LCSVQCQNSDF- 105

Query: 62  LHRLECQVLSRLDKEKRKSVTPSIRLMLKLYLRRKLQNDNVIPSTTTDNYSLVEALVAHM 121
            H  +C ++S                             N +P    D+  L  AL A  
Sbjct: 106 -HSGDCSIISHWP--------------------------NKVPIEEVDDTLLSRALTAIR 138

Query: 122 S---DIDEKQLLLYAQIANL------VNLILQWPEISINEIAENFSKLA-CNAHTICNSE 171
           +   + D+K LL   Q   L      +  + Q+ +I ++E  E F  L  C  +T     
Sbjct: 139 ALLLNEDQKYLLTSLQANKLPQYGSEIRDLKQYFDIPLHE--EEFMILVICILNTNAFQM 196

Query: 172 LRPLGT------GLYPVISIINHSCLPNAVLVFEGRLAV-VRAVQHVPKGAEVLISYIET 224
             P G       GLYPV SI+NH+C+PN    F G L + V+A + +  G+E+   Y   
Sbjct: 197 ATPYGKKEMSLRGLYPVASILNHNCVPNTRNCFNGDLQMTVKATKTINAGSEIFTCYSGM 256

Query: 225 AGSTMTRQKALKEQYLFTCTCPRCIKLGQFDDIQESAILEGYRCKDDGCSGFLLRDSDDK 284
              T  R+  L +   F C C RC      D  +   +L   +C    C G LL     K
Sbjct: 257 LWGTPARRLYLYKSKHFLCDCERCA-----DPTERGTLLAALKCFSTECQGSLLPIQPLK 311

Query: 285 GFT---CQQCGLVRSKEEIKKIASEVNIL 310
             T   C +CG+    + I  I S +  L
Sbjct: 312 TTTAWRCLECGMRVPNDNICVIQSALGSL 340


>gi|156616308|ref|NP_443160.2| SET and MYND domain-containing protein 4 [Homo sapiens]
 gi|296452956|sp|Q8IYR2.3|SMYD4_HUMAN RecName: Full=SET and MYND domain-containing protein 4
          Length = 804

 Score = 75.9 bits (185), Expect = 4e-11,   Method: Compositional matrix adjust.
 Identities = 77/315 (24%), Positives = 131/315 (41%), Gaps = 67/315 (21%)

Query: 146 PEISINEIA--ENFSKLACNA-------HT------ICNSELRPLGTGLYPVISIINHSC 190
           P+++I  +A   +  +L CNA       HT      + +S    L TG++PVIS++NHSC
Sbjct: 483 PDVTIWGVAMLRHMLQLQCNAQAMTTIQHTGPKGSIVTDSRQVRLATGIFPVISLLNHSC 542

Query: 191 LPNAVLVFEGRLAVVRAVQHVPKGAEVLISY--IETAGSTMTRQKALKEQYLFTCTCPRC 248
            PN  + F   +A +RA Q + KG E+L  Y   ++      RQ+ L+ QY F C CP C
Sbjct: 543 SPNTSVSFISTVATIRASQRIRKGQEILHCYGPHKSRMGVAERQQKLRSQYFFDCACPAC 602

Query: 249 IKLGQFDDIQESAILEGYRCKDDGCSGFLLRDSDDKGFTCQQCGLVRSKEEIKKIASEVN 308
                    +   +  G R                + F C  CG     +++ +  S   
Sbjct: 603 -------QTEAHRMAAGPRW---------------EAFCCNSCGAPMQGDDVLRCGSR-- 638

Query: 309 ILSKKTLALTSCGNHQEVVSTYKMIEKLQKKLYHPFSVNLMQTREKLIKILM--ELEDWK 366
                     SC   +  VS   ++ +LQ         +L Q      K+L   ELE   
Sbjct: 639 ----------SCA--ESAVSRDHLVSRLQ---------DLQQQVRVAQKLLRDGELERAV 677

Query: 367 EALAYCQLTIPVYQRVYPQFHPLLGLQYYTCGKLEWFLGDTENAIKSMTEAVEILRITHG 426
           + L+ CQ     +  ++ + H ++G       +    LGD + +   +  ++ ++ + HG
Sbjct: 678 QRLSGCQRDAESF--LWAE-HAVVGEIADGLARACAALGDWQKSATHLQRSLYVVEVRHG 734

Query: 427 TNSPFMKELILKLEE 441
            +S  M   + KL +
Sbjct: 735 PSSVEMGHELFKLAQ 749


>gi|440635519|gb|ELR05438.1| hypothetical protein GMDG_01733 [Geomyces destructans 20631-21]
          Length = 505

 Score = 75.9 bits (185), Expect = 4e-11,   Method: Compositional matrix adjust.
 Identities = 68/236 (28%), Positives = 105/236 (44%), Gaps = 27/236 (11%)

Query: 42  CSACQVVWYCGSNCQ-KLDWKLHRLECQVLSRLDKEKRKSVTPSIRLMLKLYLRRKLQND 100
           C+ C V  YC  +CQ K   K H+ EC+ L R         +PS  L  ++  R      
Sbjct: 92  CTQCGVCGYCSKDCQAKALEKYHKSECEALKR---------SPSADLWTRMLYR------ 136

Query: 101 NVIPSTTTDNYSLVEALVAHMSDIDEKQLLLYAQIANLVNLILQWPEISI-----NE-IA 154
            V+       YS  E    H    +++Q          ++ I +  + SI     NE I 
Sbjct: 137 -VLAMHKQGTYSEQEWQALHHLCGNQRQNNSVFDSHKGID-ISKGAKTSIQSKLGNEAIQ 194

Query: 155 ENFSKLACNAHTICNSELRPLGTGLYPVISIINHSCLPNAVLVFEGRLAVVRAVQHVPKG 214
           + F K+  N   I + E  P G  L  V S +NHSC PNA + FEG    +R+++ +  G
Sbjct: 195 DLFCKVLTNVMCINHQEGSP-GITLDLVGSFVNHSCDPNAFVFFEGSQLRMRSLKPINAG 253

Query: 215 AEVLISYIETAGSTMTRQKALKEQYLFTCTCPRCIKLGQFDDIQESAILEGYRCKD 270
            E+ ++Y E     + R++ L   Y F C C RC K  +  ++ E +  EG+  +D
Sbjct: 254 DEITLTYTELREGVLMRKRKLHRGYSFFCRCNRCKK--EHKELHELSFREGFSIRD 307


>gi|403275299|ref|XP_003929391.1| PREDICTED: SET and MYND domain-containing protein 4 [Saimiri
           boliviensis boliviensis]
          Length = 800

 Score = 75.9 bits (185), Expect = 4e-11,   Method: Compositional matrix adjust.
 Identities = 71/312 (22%), Positives = 134/312 (42%), Gaps = 32/312 (10%)

Query: 146 PEISINEIA--ENFSKLACNA-------HTIC------NSELRPLGTGLYPVISIINHSC 190
           P++++  +A   +  +L CNA       HT C       S    L TG++PV+S++NHSC
Sbjct: 480 PDVTVWGVAMLRHMLQLQCNAQAITTIQHTGCKEGIVTGSRQVRLATGIFPVVSLLNHSC 539

Query: 191 LPNAVLVFEGRLAVVRAVQHVPKGAEVLISYI--ETAGSTMTRQKALKEQYLFTCTCPRC 248
            PN  + F   +A +RA Q + KG E++  Y   ++      RQ+ L+ QY F CTCP C
Sbjct: 540 SPNTSVSFISTVATIRASQRIGKGQEIVHCYGPHKSRMGVAERQQKLRSQYFFDCTCPAC 599

Query: 249 IKLGQFDDIQESAILEGYRCKDDGCSGFLLRDSDDKGFTCQQCGLVRSKEEIKK--IASE 306
              G    +      E + C    C   +      +G     CG     E +++  +AS 
Sbjct: 600 HSEGH--GMAARPRWEAFCCSS--CRAPM------QGDDVLSCGRRGCAESVRRDHLASR 649

Query: 307 VNILSKK---TLALTSCGNHQEVVSTYKMIEKLQKKLYHPFSVNLMQTREKLIKILMELE 363
           +  L ++      L S G  +  +      ++  +         L +  + L +    L 
Sbjct: 650 LQDLRQQVGMAQKLLSDGELEPAIQQLSGCQRDAESFLSAEHAVLGEIEDGLAQACAALG 709

Query: 364 DWKEALAYCQLTIPVYQRVYPQFHPLLGLQYYTCGKLEWFLGDTENAIKSMTEAVEILRI 423
           DW+++  + Q ++ V +  +      +G + +   ++ +       A+ ++ +A E+L +
Sbjct: 710 DWQKSATHLQNSLRVVEARHGPSSVEMGHELFKLAQIFFNGFAVPEALSTIQKAEEVLSL 769

Query: 424 THGTNSPFMKEL 435
             G     ++EL
Sbjct: 770 HCGPWDDEIQEL 781


>gi|312383723|gb|EFR28693.1| hypothetical protein AND_03014 [Anopheles darlingi]
          Length = 423

 Score = 75.9 bits (185), Expect = 4e-11,   Method: Compositional matrix adjust.
 Identities = 75/286 (26%), Positives = 124/286 (43%), Gaps = 41/286 (14%)

Query: 8   GEVIISQEPYVCVPNNSSSISRCDGC----FASSNLKK----------CSACQVVW-YCG 52
           GEVI    P V  P   +S+  C GC     A++ +++          CS C   W  CG
Sbjct: 33  GEVIFRDAPAVVGPK-MASVPVCLGCNRDLMATATVRQQGDPNFRFHQCSRCG--WPVCG 89

Query: 53  SNCQKLDWKLHRLECQVLSRLDKEKRKSVTPSIRLMLKLYLRRKLQNDNVIP-------S 105
           + C+  D   HR EC +++        S  P I    +   RR+     ++P       S
Sbjct: 90  AECEAADQ--HRSECSIMAG------SSYRPKICPAREEQARRESAYCVIVPLRVLLLQS 141

Query: 106 TTTDNYSLVEALVAHMSDIDEKQL--LLYAQIANLVNLILQWPEISINEIAENFSKLACN 163
            + D Y+ ++ L +H+++  +  L  ++ + +   +  +L   + S   I E  + L  N
Sbjct: 142 HSPDTYAQLQRLESHVAERLKSPLYEVVRSNLVPFIRSVLGLQQYSAETILEVCAMLDTN 201

Query: 164 AHTICNSELRPLGTGLYPVISIINHSCLPNAVLVFEGRLAVVR-AVQHVPKGAEVLISYI 222
              I   E R     LYP+ ++++H C PN    F+  L +V  A   + +GA +  SY 
Sbjct: 202 CFEIRLPERRTKVRALYPLGAMLSHDCRPNTKHYFDDALRMVLVATVDIERGATISTSYT 261

Query: 223 ETAGSTMTRQKALKEQYLFTCTCPRCIKLGQFDDIQESAILEGYRC 268
           +    T+ R+ ALK+   F C C RC      D  +    L G+RC
Sbjct: 262 QPLLGTLHRRLALKQSKHFDCQCERCT-----DPTELGTNLSGFRC 302


>gi|332372756|gb|AEE61520.1| unknown [Dendroctonus ponderosae]
          Length = 501

 Score = 75.9 bits (185), Expect = 4e-11,   Method: Compositional matrix adjust.
 Identities = 97/376 (25%), Positives = 161/376 (42%), Gaps = 37/376 (9%)

Query: 8   GEVIISQEPYVCVPNNSSSISRCDGCFASSNLKK-CSACQVVW-YCGSNCQKLDWKLHRL 65
           G+VI S+ P+   P + SS   C GC +  +    CS C   W  CG  C++L  K+H+ 
Sbjct: 29  GDVIFSETPFAYGPKSDSS-CLCLGCHSPVDCAYLCSTCS--WPVCGPECEQL--KVHKE 83

Query: 66  -ECQVLSRLDKEKRKSVTPSIRLMLK---LYLRRKLQNDNVIPSTTTDNYSLVEALVAHM 121
            ECQV S   K K ++V     + L+   +   R L      P    +  S++EA   H 
Sbjct: 84  NECQVFSNA-KVKFQTVEDPTDVCLQYECITPLRMLLEKERDPKRWEEEISIMEA---HN 139

Query: 122 SDIDEKQLLLYAQIANLVNLILQWPEISINEIAENFSKLACNAHTICNSELRPLG----T 177
            +   + +  + Q  N+V  I + P   +++ +E    L C    I   E R        
Sbjct: 140 EERKSRPIWEFNQ-HNIVEYI-RGP-CKMDKYSEELIHLICGILEINAFEARTPSCYAIR 196

Query: 178 GLYPVISIINHSCLPNAVLVFE----GRL----AVVRAVQHVPKGAEVLISYIETAGSTM 229
            L+P ++I++H+C+ N     +    G L      VRA   +PKG E+  SY  +   T+
Sbjct: 197 CLFPKLAILSHNCVSNIHHAVDCNGDGELKDCVVTVRAAVDIPKGGELFSSYTFSLWPTL 256

Query: 230 TRQKALKEQYLFTCTCPRCIKLGQFDDIQESAILEGYRCKDDGCSGFLLRDSDDKGFTCQ 289
            R++ L+E   F CTCPRC    + D    S  L+  RC D+G         D   + C 
Sbjct: 257 VRREFLRESKFFDCTCPRCSSPTELDTHMSS--LKCQRC-DNGVIISTNTADDACEWKCT 313

Query: 290 QCGLVRSKEEIKKIASEVNILSKKTLALTSCGNHQEVVSTYKMIEKLQKKLYHPFSVNLM 349
            C    +   ++K+ + +     +   +   G  Q +     +  K +  L HP +  + 
Sbjct: 314 HCDYKTNGRAVRKVFAAIQ---NEIDQVEYIGGPQGIQQRETIFRKYRSVL-HPNNSYMT 369

Query: 350 QTREKLIKILMELEDW 365
             R  LI++  + E +
Sbjct: 370 TLRSALIQLYGKTEGY 385


>gi|148680854|gb|EDL12801.1| SET and MYND domain containing 4, isoform CRA_a [Mus musculus]
          Length = 743

 Score = 75.9 bits (185), Expect = 5e-11,   Method: Compositional matrix adjust.
 Identities = 38/88 (43%), Positives = 49/88 (55%), Gaps = 2/88 (2%)

Query: 163 NAHTICNSELRPLGTGLYPVISIINHSCLPNAVLVFEGRLAVVRAVQHVPKGAEVLISY- 221
           N   I NS    L TG++PV+S++NHSC PN  + F G +A VRA Q + KG E+L  Y 
Sbjct: 455 NESIITNSRQIRLATGIFPVVSLLNHSCRPNTSVSFTGTVATVRAAQRIAKGQEILHCYG 514

Query: 222 -IETAGSTMTRQKALKEQYLFTCTCPRC 248
             E+      RQ+ L  QY F C C  C
Sbjct: 515 PHESRMGVAERQQRLSSQYFFDCRCGAC 542


>gi|449265983|gb|EMC77110.1| SET and MYND domain-containing protein 4, partial [Columba livia]
          Length = 706

 Score = 75.9 bits (185), Expect = 5e-11,   Method: Compositional matrix adjust.
 Identities = 63/265 (23%), Positives = 116/265 (43%), Gaps = 10/265 (3%)

Query: 175 LGTGLYPVISIINHSCLPNAVLVFEGRLAVVRAVQHVPKGAEVLISY-IETAGSTMTRQK 233
           L T  +PV+S++NHSC PN  + F G  A VRA Q +P G E+   Y        +  ++
Sbjct: 430 LATAFFPVLSLLNHSCCPNTSVSFSGTTATVRASQLIPSGQEIFHCYGPHRCRMRVAERQ 489

Query: 234 ALKEQYLFTCTCPRCIKLGQFDDIQESAILEGYRCKDDGCSGFLLRDSDDKGFTCQQCGL 293
            L  QY F C C  C    + D     ++   + C   GC G  ++  D    + + C +
Sbjct: 490 QLLSQYFFECRCQACCDELESDVKSVVSLRNTFCCP--GCRG-PMQGEDMLCCSNEACAI 546

Query: 294 VRSKEEIKKIASEVNILSKKTLALTSCGNHQEVVSTYKMIEKLQ---KKLYHPFSVNLMQ 350
             S+E +     ++    K+ L L      ++     KM+ K Q   +    P  + + +
Sbjct: 547 SVSRESLSCRLQDLQQRIKRALELL---RDRKADQAIKMLLKCQVDAECFLSPEHLLMGE 603

Query: 351 TREKLIKILMELEDWKEALAYCQLTIPVYQRVYPQFHPLLGLQYYTCGKLEWFLGDTENA 410
             + L ++   L  W+EA  + + +I + +  +      +G + +   ++ +       A
Sbjct: 604 LEDHLAQVYATLGKWQEAARHLERSIEIVEMHHGPSSIEIGHELFKLAQILFNGFAVSEA 663

Query: 411 IKSMTEAVEILRITHGTNSPFMKEL 435
           + ++  A EIL +  G  SP ++EL
Sbjct: 664 LSTIQRAEEILSVHCGPQSPQIQEL 688


>gi|50511197|dbj|BAD32584.1| mKIAA1936 protein [Mus musculus]
          Length = 716

 Score = 75.9 bits (185), Expect = 5e-11,   Method: Compositional matrix adjust.
 Identities = 38/88 (43%), Positives = 49/88 (55%), Gaps = 2/88 (2%)

Query: 163 NAHTICNSELRPLGTGLYPVISIINHSCLPNAVLVFEGRLAVVRAVQHVPKGAEVLISY- 221
           N   I NS    L TG++PV+S++NHSC PN  + F G +A VRA Q + KG E+L  Y 
Sbjct: 428 NESIITNSRQIRLATGIFPVVSLLNHSCRPNTSVSFTGTVATVRAAQRIAKGQEILHCYG 487

Query: 222 -IETAGSTMTRQKALKEQYLFTCTCPRC 248
             E+      RQ+ L  QY F C C  C
Sbjct: 488 PHESRMGVAERQQRLSSQYFFDCRCGAC 515


>gi|395853348|ref|XP_003799177.1| PREDICTED: SET and MYND domain-containing protein 4 [Otolemur
           garnettii]
          Length = 749

 Score = 75.5 bits (184), Expect = 5e-11,   Method: Compositional matrix adjust.
 Identities = 43/120 (35%), Positives = 61/120 (50%), Gaps = 17/120 (14%)

Query: 146 PEISINEIA--ENFSKLACNAHTIC-------------NSELRPLGTGLYPVISIINHSC 190
           P++ I  +A   +  +L CNA  I              NS    L TG++PV+S++NHSC
Sbjct: 483 PDLDIWGVAMLRHMLQLQCNAQAITTIQQTGSEENIITNSRQVRLATGVFPVVSLLNHSC 542

Query: 191 LPNAVLVFEGRLAVVRAVQHVPKGAEVLISY--IETAGSTMTRQKALKEQYLFTCTCPRC 248
            PN  + F   +A +RA Q + KG E+L  Y   E+      RQ+ L+ QY F C CP C
Sbjct: 543 SPNTSISFCSTVATIRASQQIGKGQEILHCYGPHESRMGVAKRQQKLRSQYFFDCNCPAC 602


>gi|410209806|gb|JAA02122.1| SET and MYND domain containing 4 [Pan troglodytes]
 gi|410209808|gb|JAA02123.1| SET and MYND domain containing 4 [Pan troglodytes]
 gi|410246942|gb|JAA11438.1| SET and MYND domain containing 4 [Pan troglodytes]
 gi|410246944|gb|JAA11439.1| SET and MYND domain containing 4 [Pan troglodytes]
 gi|410289944|gb|JAA23572.1| SET and MYND domain containing 4 [Pan troglodytes]
 gi|410289946|gb|JAA23573.1| SET and MYND domain containing 4 [Pan troglodytes]
 gi|410289948|gb|JAA23574.1| SET and MYND domain containing 4 [Pan troglodytes]
 gi|410330765|gb|JAA34329.1| SET and MYND domain containing 4 [Pan troglodytes]
 gi|410330767|gb|JAA34330.1| SET and MYND domain containing 4 [Pan troglodytes]
 gi|410330769|gb|JAA34331.1| SET and MYND domain containing 4 [Pan troglodytes]
          Length = 835

 Score = 75.5 bits (184), Expect = 5e-11,   Method: Compositional matrix adjust.
 Identities = 82/343 (23%), Positives = 144/343 (41%), Gaps = 68/343 (19%)

Query: 118 VAHMSDIDEKQLLLYAQIANLVNLILQWPEISINEIA--ENFSKLACNA-------HT-- 166
            A +  I  ++++  +Q+   V   L+ P+++I  +A   +  +L CNA       HT  
Sbjct: 456 AASLQAIPTERIVNSSQLKAAVTPELR-PDVTIWGVAMLRHMLQLQCNAQAVTTIQHTGP 514

Query: 167 ----ICNSELRPLGTGLYPVISIINHSCLPNAVLVFEGRLAVVRAVQHVPKGAEVLISY- 221
               + +S    L TG++PVIS++NHSC PN  + F   +A +RA Q + KG E+L  Y 
Sbjct: 515 KGSIVTDSRQVRLATGIFPVISLLNHSCNPNTSVSFISTVATIRASQRIRKGQEILHCYG 574

Query: 222 -IETAGSTMTRQKALKEQYLFTCTCPRCIKLGQFDDIQESAILEGYRCKDDGCSGFLLRD 280
             ++      RQ+ L+ QY F C CP C         +   +  G R             
Sbjct: 575 PHKSRMGVAERQQKLRSQYFFDCACPAC-------QTEAHRMAAGPRW------------ 615

Query: 281 SDDKGFTCQQCGLVRSKEEIKKIASEVNILSKKTLALTSCGNHQEVVSTYKMIEKLQKKL 340
              + F C  CG     +++ +  S             SC   +  VS   ++ +LQ   
Sbjct: 616 ---EAFCCNSCGAPMQGDDVLRCGSR------------SCA--ESAVSRDHLVSRLQ--- 655

Query: 341 YHPFSVNLMQTREKLIKILM--ELEDWKEALAYCQLTIPVYQRVYPQFHPLLGLQYYTCG 398
                 +L Q      K+L   ELE   + L+ CQ     +  ++ + H ++G       
Sbjct: 656 ------DLQQQVRVAQKLLRDGELERAVQRLSGCQRDAESF--LWAE-HAVVGEIADGLA 706

Query: 399 KLEWFLGDTENAIKSMTEAVEILRITHGTNSPFMKELILKLEE 441
           +    LGD + +   +  ++ ++ + HG +S  M   + KL +
Sbjct: 707 RAFAALGDWQKSATHLQRSLCVVEVRHGPSSVEMGHELFKLAQ 749


>gi|114665632|ref|XP_511253.2| PREDICTED: SET and MYND domain-containing protein 4 isoform 3 [Pan
           troglodytes]
          Length = 835

 Score = 75.5 bits (184), Expect = 5e-11,   Method: Compositional matrix adjust.
 Identities = 82/343 (23%), Positives = 144/343 (41%), Gaps = 68/343 (19%)

Query: 118 VAHMSDIDEKQLLLYAQIANLVNLILQWPEISINEIA--ENFSKLACNA-------HT-- 166
            A +  I  ++++  +Q+   V   L+ P+++I  +A   +  +L CNA       HT  
Sbjct: 456 AASLQAIPTERIVNSSQLKAAVTPELR-PDVTIWGVAMLRHMLQLQCNAQAVTTIQHTGP 514

Query: 167 ----ICNSELRPLGTGLYPVISIINHSCLPNAVLVFEGRLAVVRAVQHVPKGAEVLISY- 221
               + +S    L TG++PVIS++NHSC PN  + F   +A +RA Q + KG E+L  Y 
Sbjct: 515 KGSIVTDSRQVRLATGIFPVISLLNHSCNPNTSVSFISTVATIRASQRIRKGQEILHCYG 574

Query: 222 -IETAGSTMTRQKALKEQYLFTCTCPRCIKLGQFDDIQESAILEGYRCKDDGCSGFLLRD 280
             ++      RQ+ L+ QY F C CP C         +   +  G R             
Sbjct: 575 PHKSRMGVAERQQKLRSQYFFDCACPAC-------QTEAHRMAAGPRW------------ 615

Query: 281 SDDKGFTCQQCGLVRSKEEIKKIASEVNILSKKTLALTSCGNHQEVVSTYKMIEKLQKKL 340
              + F C  CG     +++ +  S             SC   +  VS   ++ +LQ   
Sbjct: 616 ---EAFCCNSCGAPMQGDDVLRCGSR------------SCA--ESAVSRDHLVSRLQ--- 655

Query: 341 YHPFSVNLMQTREKLIKILM--ELEDWKEALAYCQLTIPVYQRVYPQFHPLLGLQYYTCG 398
                 +L Q      K+L   ELE   + L+ CQ     +  ++ + H ++G       
Sbjct: 656 ------DLQQQVRVAQKLLRDGELERAVQRLSGCQRDAESF--LWAE-HAVVGEIADGLA 706

Query: 399 KLEWFLGDTENAIKSMTEAVEILRITHGTNSPFMKELILKLEE 441
           +    LGD + +   +  ++ ++ + HG +S  M   + KL +
Sbjct: 707 RAFAALGDWQKSATHLQRSLCVVEVRHGPSSVEMGHELFKLAQ 749


>gi|270011320|gb|EFA07768.1| hypothetical protein TcasGA2_TC005322 [Tribolium castaneum]
          Length = 1086

 Score = 75.5 bits (184), Expect = 5e-11,   Method: Compositional matrix adjust.
 Identities = 73/273 (26%), Positives = 110/273 (40%), Gaps = 43/273 (15%)

Query: 8    GEVIISQEPYVCVPNNSSSISRCDGCFA-SSNLKKCSACQVVWYCGSNCQ-KLDWKLHRL 65
            GE+I  ++P+V     ++ +  C  C+    N   C  C    YCG  C+ K   K H+ 
Sbjct: 747  GEIIAVEKPFV-FTLAAADLYHCHECYQLCYNPIPCEICSQTLYCGEECRDKAREKYHQY 805

Query: 66   ECQVLSRLDKEKRKSVTPSIRLMLKLYLRRKLQNDNVIPSTTTDNYSLVEALVAHMSDID 125
            EC +L  L    +  V      +L + +   + ++N +P      Y+LVE L    +D  
Sbjct: 806  ECPILISL----KNIVGKHKAFLLAIKMSFMISDENDVPEV----YALVENLSRDNNDEV 857

Query: 126  EKQLLLYAQIANLVNLIL-QWPEISINEIAENFSK-------LACNAHTICNSELRP--- 174
                L+ A + +LV     ++PE  +   AEN  K         C  H     EL P   
Sbjct: 858  FTTALITALMYHLVKTYTGKFPEDDLE--AENKFKHFLMTHLRICLTHAAGIDELYPNQV 915

Query: 175  -----------------LGTGLYPVISIINHSCLPNAVLVFEGRLAVVRAVQHVPKGAEV 217
                             +G  LYP  ++  H+C PN      G   V+RAV+ + +G E 
Sbjct: 916  SEGQEPGQELLSFKSETVGCALYPFYALFRHACCPNVFAHHHGTQRVLRAVRTIHEGQEC 975

Query: 218  LISY--IETAGSTMTRQKALKEQYLFTCTCPRC 248
             +SY       S   R+  L  QY FTC C  C
Sbjct: 976  FVSYGPYYVEHSKQERKSRLLSQYHFTCKCRAC 1008



 Score = 54.7 bits (130), Expect = 1e-04,   Method: Compositional matrix adjust.
 Identities = 63/264 (23%), Positives = 98/264 (37%), Gaps = 39/264 (14%)

Query: 8   GEVIISQEPYVCVPNNSSSISRCDGCFA-SSNLKKCSACQVVWYCGSNCQKLDW-KLHRL 65
           G+VI  + P V    N   +  C  C+    N   C  C  V YC   C++  + K H+ 
Sbjct: 226 GQVIAVETPCVAALINVV-LFHCHDCYILCYNPIPCKTCTEVVYCSEACRENAFAKYHQK 284

Query: 66  ECQVLSRLDKEKRKSVTPSIRLMLKLYLRRKLQNDNVIPSTTTDNYSLVEALVAHMSDID 125
           EC +   L   K   +    +  LK+ L  + Q D       TD+    + L+A      
Sbjct: 285 ECPIY--LSMRKLVGIDTHFQWALKMTLLVQTQADKCCDRIDTDDDRTRKTLMA------ 336

Query: 126 EKQLLLYAQIANLVNLILQWPEISINEIAENFSKLACNAHTICNSEL------------- 172
               L Y  I N   ++    E  +    +NF ++  +   IC+  +             
Sbjct: 337 ---TLFYHLIKNCTTVLAGVDEAGV----KNFKRVLMSYMHICDYHVSDIDEIFVHGGSR 389

Query: 173 ------RPLGTGLYPVISIINHSCLPNAVLVFEGRLAVVRAVQHVPKGAEVLISY--IET 224
                       +YP    + HSC PN +  + G   V+RA++ + KG E   SY  + T
Sbjct: 390 DLELKQETFAKAMYPFSDKLRHSCCPNVMGWYHGVTRVLRAIRTIKKGEECFFSYGPLYT 449

Query: 225 AGSTMTRQKALKEQYLFTCTCPRC 248
                 RQ  +   Y F C C  C
Sbjct: 450 NIEKEERQNYIFFIYNFKCACRAC 473


>gi|397491961|ref|XP_003816904.1| PREDICTED: SET and MYND domain-containing protein 4 [Pan paniscus]
          Length = 835

 Score = 75.5 bits (184), Expect = 5e-11,   Method: Compositional matrix adjust.
 Identities = 82/343 (23%), Positives = 144/343 (41%), Gaps = 68/343 (19%)

Query: 118 VAHMSDIDEKQLLLYAQIANLVNLILQWPEISINEIA--ENFSKLACNA-------HT-- 166
            A +  I  ++++  +Q+   V   L+ P+++I  +A   +  +L CNA       HT  
Sbjct: 456 AASLQAIPTERIVNSSQLKAAVTPELR-PDVTIWGVAMLRHMLQLQCNAQAMTTIQHTGP 514

Query: 167 ----ICNSELRPLGTGLYPVISIINHSCLPNAVLVFEGRLAVVRAVQHVPKGAEVLISY- 221
               + +S    L TG++PVIS++NHSC PN  + F   +A +RA Q + KG E+L  Y 
Sbjct: 515 KGSIVTDSRQVRLATGIFPVISLLNHSCNPNTSVSFISTVATIRASQRIRKGQEILHCYG 574

Query: 222 -IETAGSTMTRQKALKEQYLFTCTCPRCIKLGQFDDIQESAILEGYRCKDDGCSGFLLRD 280
             ++      RQ+ L+ QY F C CP C         +   +  G R             
Sbjct: 575 PHKSRMGVAERQQKLRSQYFFDCACPAC-------QTEAHRMAAGPRW------------ 615

Query: 281 SDDKGFTCQQCGLVRSKEEIKKIASEVNILSKKTLALTSCGNHQEVVSTYKMIEKLQKKL 340
              + F C  CG     +++ +  S             SC   +  VS   ++ +LQ   
Sbjct: 616 ---EAFCCNSCGAPMQGDDVLRCGSR------------SCA--ESAVSRDHLVSRLQ--- 655

Query: 341 YHPFSVNLMQTREKLIKILM--ELEDWKEALAYCQLTIPVYQRVYPQFHPLLGLQYYTCG 398
                 +L Q      K+L   ELE   + L+ CQ     +  ++ + H ++G       
Sbjct: 656 ------DLQQQVRVAQKLLRDGELERAVQRLSGCQRDAESF--LWAE-HAVVGEIADGLA 706

Query: 399 KLEWFLGDTENAIKSMTEAVEILRITHGTNSPFMKELILKLEE 441
           +    LGD + +   +  ++ ++ + HG +S  M   + KL +
Sbjct: 707 RAFAALGDWQKSATHLQRSLCVVEVRHGPSSVEMGHELFKLAQ 749


>gi|431906502|gb|ELK10625.1| SET and MYND domain-containing protein 3 [Pteropus alecto]
          Length = 352

 Score = 75.5 bits (184), Expect = 5e-11,   Method: Compositional matrix adjust.
 Identities = 32/77 (41%), Positives = 51/77 (66%)

Query: 172 LRPLGTGLYPVISIINHSCLPNAVLVFEGRLAVVRAVQHVPKGAEVLISYIETAGSTMTR 231
           ++ +G GLYP +S++NHSC PN  +VF G   ++RAV+ +  G E+ I Y++   ++  R
Sbjct: 1   MQEVGVGLYPSMSLLNHSCDPNCSIVFNGPHLLLRAVRDIEAGEELTICYLDMLMTSEER 60

Query: 232 QKALKEQYLFTCTCPRC 248
           +K L++QY F C C RC
Sbjct: 61  RKQLRDQYCFECDCSRC 77


>gi|74181871|dbj|BAE32637.1| unnamed protein product [Mus musculus]
 gi|122936404|gb|AAI30221.1| Smyd4 protein [Mus musculus]
          Length = 666

 Score = 75.5 bits (184), Expect = 5e-11,   Method: Compositional matrix adjust.
 Identities = 38/88 (43%), Positives = 49/88 (55%), Gaps = 2/88 (2%)

Query: 163 NAHTICNSELRPLGTGLYPVISIINHSCLPNAVLVFEGRLAVVRAVQHVPKGAEVLISY- 221
           N   I NS    L TG++PV+S++NHSC PN  + F G +A VRA Q + KG E+L  Y 
Sbjct: 378 NESIITNSRQIRLATGIFPVVSLLNHSCRPNTSVSFTGTVATVRAAQRIAKGQEILHCYG 437

Query: 222 -IETAGSTMTRQKALKEQYLFTCTCPRC 248
             E+      RQ+ L  QY F C C  C
Sbjct: 438 PHESRMGVAERQQRLSSQYFFDCRCGAC 465


>gi|63146229|gb|AAH95952.1| Smyd4 protein [Mus musculus]
          Length = 799

 Score = 75.5 bits (184), Expect = 5e-11,   Method: Compositional matrix adjust.
 Identities = 37/88 (42%), Positives = 49/88 (55%), Gaps = 2/88 (2%)

Query: 163 NAHTICNSELRPLGTGLYPVISIINHSCLPNAVLVFEGRLAVVRAVQHVPKGAEVLISY- 221
           N   I NS    L TG++PV+S++NHSC PN  + F G +A +RA Q + KG E+L  Y 
Sbjct: 511 NESIITNSRQIRLATGVFPVVSLLNHSCRPNTSVSFTGTVATIRAAQRIAKGQEILHCYG 570

Query: 222 -IETAGSTMTRQKALKEQYLFTCTCPRC 248
             E+      RQ+ L  QY F C C  C
Sbjct: 571 PHESRMGVAERQQRLSSQYFFDCRCGAC 598


>gi|26326379|dbj|BAC26933.1| unnamed protein product [Mus musculus]
          Length = 666

 Score = 75.5 bits (184), Expect = 5e-11,   Method: Compositional matrix adjust.
 Identities = 38/88 (43%), Positives = 49/88 (55%), Gaps = 2/88 (2%)

Query: 163 NAHTICNSELRPLGTGLYPVISIINHSCLPNAVLVFEGRLAVVRAVQHVPKGAEVLISY- 221
           N   I NS    L TG++PV+S++NHSC PN  + F G +A VRA Q + KG E+L  Y 
Sbjct: 378 NESIITNSRQIRLATGIFPVVSLLNHSCRPNTSVSFTGTVATVRAAQRIAKGQEILHCYG 437

Query: 222 -IETAGSTMTRQKALKEQYLFTCTCPRC 248
             E+      RQ+ L  QY F C C  C
Sbjct: 438 PHESRMGVAERQQRLSSQYFFDCRCGAC 465


>gi|91089701|ref|XP_974846.1| PREDICTED: similar to conserved hypothetical protein [Tribolium
           castaneum]
          Length = 561

 Score = 75.5 bits (184), Expect = 5e-11,   Method: Compositional matrix adjust.
 Identities = 73/273 (26%), Positives = 110/273 (40%), Gaps = 43/273 (15%)

Query: 8   GEVIISQEPYVCVPNNSSSISRCDGCFA-SSNLKKCSACQVVWYCGSNCQ-KLDWKLHRL 65
           GE+I  ++P+V     ++ +  C  C+    N   C  C    YCG  C+ K   K H+ 
Sbjct: 222 GEIIAVEKPFV-FTLAAADLYHCHECYQLCYNPIPCEICSQTLYCGEECRDKAREKYHQY 280

Query: 66  ECQVLSRLDKEKRKSVTPSIRLMLKLYLRRKLQNDNVIPSTTTDNYSLVEALVAHMSDID 125
           EC +L  L    +  V      +L + +   + ++N +P      Y+LVE L    +D  
Sbjct: 281 ECPILISL----KNIVGKHKAFLLAIKMSFMISDENDVPEV----YALVENLSRDNNDEV 332

Query: 126 EKQLLLYAQIANLVNLIL-QWPEISINEIAENFSK-------LACNAHTICNSELRP--- 174
               L+ A + +LV     ++PE  +   AEN  K         C  H     EL P   
Sbjct: 333 FTTALITALMYHLVKTYTGKFPEDDLE--AENKFKHFLMTHLRICLTHAAGIDELYPNQV 390

Query: 175 -----------------LGTGLYPVISIINHSCLPNAVLVFEGRLAVVRAVQHVPKGAEV 217
                            +G  LYP  ++  H+C PN      G   V+RAV+ + +G E 
Sbjct: 391 SEGQEPGQELLSFKSETVGCALYPFYALFRHACCPNVFAHHHGTQRVLRAVRTIHEGQEC 450

Query: 218 LISY--IETAGSTMTRQKALKEQYLFTCTCPRC 248
            +SY       S   R+  L  QY FTC C  C
Sbjct: 451 FVSYGPYYVEHSKQERKSRLLSQYHFTCKCRAC 483


>gi|194890613|ref|XP_001977353.1| GG18992 [Drosophila erecta]
 gi|190649002|gb|EDV46280.1| GG18992 [Drosophila erecta]
          Length = 503

 Score = 75.5 bits (184), Expect = 5e-11,   Method: Compositional matrix adjust.
 Identities = 45/183 (24%), Positives = 83/183 (45%), Gaps = 27/183 (14%)

Query: 178 GLYPVISIINHSCLPNAVLVFEGRL--AVVRAVQHVPKGAEVLISYIETAGSTMTRQKAL 235
            LYP+  ++NH C+PNA   FE +    +VRA   +P+G EV  +Y +     + R   L
Sbjct: 208 ALYPLFGVVNHDCIPNAYYTFEDKTNNMIVRAAVDIPEGFEVTTTYTKLFTGNIARHLFL 267

Query: 236 KEQYLFTCTCPRCIKLGQFDDIQESAILEGYRCKDDGCSGFLLRDSDD---KGFTCQQC- 291
           K +  FTC C RC      D  ++ A + G  C+D  C+G ++ +        + C +C 
Sbjct: 268 KMKKSFTCKCSRC-----SDPTEKGAFISGLYCRDTNCTGLVVPEISGLPHPNWNCLECK 322

Query: 292 ----------------GLVRSKEEIKKIASEVNILSKKTLALTSCGNHQEVVSTYKMIEK 335
                           G + +K     + + V  L++K+ +     N+  + +   ++++
Sbjct: 323 QKSTHAQMMKSQDFASGAINAKANSNSLRTLVQYLNEKSDSFIPSSNYVVIDAKMSVLQR 382

Query: 336 LQK 338
           LQ+
Sbjct: 383 LQQ 385


>gi|332018228|gb|EGI58833.1| Protein msta, isoform A [Acromyrmex echinatior]
          Length = 483

 Score = 75.5 bits (184), Expect = 5e-11,   Method: Compositional matrix adjust.
 Identities = 111/494 (22%), Positives = 199/494 (40%), Gaps = 81/494 (16%)

Query: 7   SGEVIISQEPYVCVPNNSSSISRCDGCFASSNLKK-----CSACQVVW-YCGSNCQKLDW 60
           +GE II++ P+V  P  + +   C  CF    L+      CS C   W  CG  C+    
Sbjct: 9   AGEEIITEMPFVIGPK-ACTYPLCLSCFTPWPLEPDDKSLCSKCG--WPVCGEECENAP- 64

Query: 61  KLHRLECQVLSR----------LDKEKRKSVTPSIRLMLKLYLRRKLQNDNVIPSTTTDN 110
           +    ECQV ++          LD      V P +  +  L  R  L+++  +     + 
Sbjct: 65  QHKDYECQVFAQANEKFNVDAALDGNSENGV-PQLECITPL--RLLLESERNVERWNKE- 120

Query: 111 YSLVEALVAHMSDIDEKQLLLYAQIANLVNLILQWPEISINEIAENFSKLACNAHTICNS 170
              V+ + AH     +K      QI N+V+ + +   + ++  +E + +  C    I   
Sbjct: 121 ---VKDMEAHNKTRCQKSQWKSDQI-NIVDYLRK--RLKLDRFSEKYIQTICGILEINTF 174

Query: 171 ELRPL----GTGLYPVISIINHSCLPN---AVLVFEGRLAVVRAVQHVPKGAEVLISYIE 223
           E+R        GLYP ++++NHSC+ N   ++   + R+ + R    +P G E+  SY  
Sbjct: 175 EVRTAKGFSARGLYPTVAMMNHSCVSNTSHSISPIDYRIRL-RTTLKIPAGGELYASYTH 233

Query: 224 TAGSTMTRQKALKEQYLFTCTCPRCIKLGQFDDIQESAILEGYRCK--DDGCSGFLLRDS 281
           +   T+ R++ L E   F C CPRC      D  +    +   +C   D+G    L    
Sbjct: 234 SLLPTILRREHLLEGKHFACACPRC-----SDPTELGTHMSSLKCNKCDNGIVLPLDSLD 288

Query: 282 DDKGFTCQQCGLVRSKEEIKKIASEVNILSKKTLALTSCGNHQEVVSTYK--MIEKLQKK 339
            +  + C  C    + + ++KI   + I+  +  A  +        + YK   + K  + 
Sbjct: 289 SESTWKCTHCDFSTNGQAVRKI---LRIIQAQVDAAEAISGADGADAIYKRETVMKKYRL 345

Query: 340 LYHPFSVNLMQTREKLIKILMELEDW----------KEALAYCQLTIPVYQRVYPQFHPL 389
           + HP    L   R  L ++   ++++          +  +  C+L + V   V P +  +
Sbjct: 346 VLHPHHAFLSMLRHSLTQMYGRVDEYLLDDLPDVVLEHKVDMCRLLLQVLDVVEPGYSRV 405

Query: 390 LGLQYYT-------CGKLEWFLGDTENA-IKS-MTEAVEILR------------ITHGTN 428
            G+  Y          K +W  G  + A +KS M EA  IL+             + G  
Sbjct: 406 RGMTLYELHAPLLFLAKDQWNAGVIDEAKLKSKMIEAANILKEAVMILSLEPSETSEGQI 465

Query: 429 SPFMKELILKLEEA 442
               KE I++LE++
Sbjct: 466 GLVAKESIIQLEQS 479


>gi|386769408|ref|NP_001245962.1| CG43129, isoform C [Drosophila melanogaster]
 gi|383291417|gb|AFH03636.1| CG43129, isoform C [Drosophila melanogaster]
          Length = 503

 Score = 75.5 bits (184), Expect = 6e-11,   Method: Compositional matrix adjust.
 Identities = 94/390 (24%), Positives = 152/390 (38%), Gaps = 58/390 (14%)

Query: 8   GEVIISQEPYVCVPNNSSSISRCDGCFA-------SSNLKKCSACQVVW-YCGSNCQKLD 59
           GE+++ + P+   P   S I  C GC+          +L +C  C   W  CGS     D
Sbjct: 25  GELVLEELPFARGPKRDSGIV-CLGCYQFLQFGEDGDSLDRCELCD--WPLCGSCADDED 81

Query: 60  WKLHRLECQVLSRLDKEKRKSVTPS-----------IRLMLKLYLRRKLQNDNVIPSTTT 108
              HR ECQV S        +V+             +R++L      +  ++ V P    
Sbjct: 82  VTEHRGECQVFSAARVTFAGNVSDDGVCPQLDCITILRVLLAKEANPERWDNEVAPMEHH 141

Query: 109 DNYSLVEALVAHMSDIDEKQLLLYA-QIANLVNLILQWPEISINEIAENFSKLACNAHTI 167
                 +A V H   ++  Q L    Q+AN           S   I +    L  NA   
Sbjct: 142 KEERQRDADVWHADRVNIAQYLRGPCQLAN---------RFSEELIMQVVGVLEVNAFEA 192

Query: 168 CNSELRPLGTGLYPVISIINHSCLPN---AVLVFEGRLAVVRAVQHVPKGAEVLISYIET 224
            + +  PL   L+P   I+ H+C+PN   ++   EG    +RA+  + +G  +  SY  T
Sbjct: 193 RSPKGYPLRC-LFPYTGILAHNCVPNTSRSIYPSEGYKIRLRAMVDLEEGQPLHHSYTYT 251

Query: 225 AGSTMTRQKALKEQYLFTCTCPRCI---KLG-QFDDIQESAILEGYRCKDDGCSGFLLRD 280
              T  RQK LK+   FTC C RC+   +LG  F  ++     EG++             
Sbjct: 252 LDGTAQRQKHLKQGKFFTCQCERCLDPTELGTHFSSLKCGQCAEGFQVPRQP-------T 304

Query: 281 SDDKGFTCQQCGLVRSKEE----IKKIASEVNILSKKTLALTSCGNHQEVVSTYKMIEKL 336
             D  + C  CG   S  +    ++ + SEVN +    +A       + ++  Y      
Sbjct: 305 EPDTSWNCANCGSDTSNADALAMLQSLQSEVNAVQALPMAAKRLEEIERLLRKY------ 358

Query: 337 QKKLYHPFSVNLMQTREKLIKILMELEDWK 366
            K L HP        R+ LI++   ++ ++
Sbjct: 359 -KSLLHPLHFIATGLRQLLIEMYGRVQGYE 387


>gi|405959013|gb|EKC25088.1| SET and MYND domain-containing protein 5 [Crassostrea gigas]
          Length = 396

 Score = 75.1 bits (183), Expect = 6e-11,   Method: Compositional matrix adjust.
 Identities = 37/87 (42%), Positives = 51/87 (58%), Gaps = 5/87 (5%)

Query: 176 GTGLYPVISIINHSCLPNAVLVF--EGRLAVVRAVQHVPKGAEVLISYI---ETAGSTMT 230
           G+GLY + S  NHSCLPNA + F     +  V A + +P G E+ ISY+   + + S  +
Sbjct: 296 GSGLYSLQSTCNHSCLPNAEITFPYNNNVMAVVAKEKIPSGQEICISYLSECDLSRSRHS 355

Query: 231 RQKALKEQYLFTCTCPRCIKLGQFDDI 257
           RQ  LKE YLFTC CP+C+      D+
Sbjct: 356 RQNILKENYLFTCDCPKCLSEADEPDV 382


>gi|195359283|ref|XP_002045335.1| GM11659 [Drosophila sechellia]
 gi|194129184|gb|EDW51227.1| GM11659 [Drosophila sechellia]
          Length = 496

 Score = 75.1 bits (183), Expect = 6e-11,   Method: Compositional matrix adjust.
 Identities = 49/209 (23%), Positives = 91/209 (43%), Gaps = 27/209 (12%)

Query: 178 GLYPVISIINHSCLPNAVLVFEGRL--AVVRAVQHVPKGAEVLISYIETAGSTMTRQKAL 235
            LYP+  ++NH C+PNA   FE +    +VRA   +P+G EV  +Y +     + R   L
Sbjct: 208 ALYPLFGVVNHDCIPNAYYTFEEKTNNMIVRAAVDIPEGFEVTTTYTKLFTGNIARHLFL 267

Query: 236 KEQYLFTCTCPRCIKLGQFDDIQESAILEGYRCKDDGCSGFLLRDSDD---KGFTCQQC- 291
           K +  FTC C RC      D  ++ A + G  C+D  C+G ++ +        + C +C 
Sbjct: 268 KMKKSFTCKCSRCS-----DPTEKGAFISGLYCRDTNCTGLVVPEISGLPHPNWNCLECK 322

Query: 292 ----------------GLVRSKEEIKKIASEVNILSKKTLALTSCGNHQEVVSTYKMIEK 335
                           G + +K     + + V  L++K+ +     N+  + +   ++++
Sbjct: 323 QKSTHAQMMKSQDFASGAINAKVNSNSLRTLVQYLNEKSDSFIPNSNYVVIDAKMSVLQR 382

Query: 336 LQKKLYHPFSVNLMQTREKLIKILMELED 364
           LQ+            TR +  + + +L D
Sbjct: 383 LQQGTRGLLRGAGAHTRLRYSRDITQLMD 411


>gi|426383436|ref|XP_004058287.1| PREDICTED: SET and MYND domain-containing protein 4 [Gorilla
           gorilla gorilla]
          Length = 804

 Score = 75.1 bits (183), Expect = 6e-11,   Method: Compositional matrix adjust.
 Identities = 76/315 (24%), Positives = 131/315 (41%), Gaps = 67/315 (21%)

Query: 146 PEISINEIA--ENFSKLACNA-------HT------ICNSELRPLGTGLYPVISIINHSC 190
           P++++  +A   +  +L CNA       HT      + +S    L TG++PVIS++NHSC
Sbjct: 483 PDVTVWGVAMLRHMLQLQCNAQAMTTIQHTGSKGSIVTDSRQVRLATGIFPVISLLNHSC 542

Query: 191 LPNAVLVFEGRLAVVRAVQHVPKGAEVLISY--IETAGSTMTRQKALKEQYLFTCTCPRC 248
            PN  + F   +A +RA Q + KG E+L  Y   ++      RQ+ L+ QY F C CP C
Sbjct: 543 CPNTSVSFISTVATIRASQRIQKGQEILHCYGPHKSRMGVAERQQKLRSQYFFDCACPAC 602

Query: 249 IKLGQFDDIQESAILEGYRCKDDGCSGFLLRDSDDKGFTCQQCGLVRSKEEIKKIASEVN 308
                    +   +  G R                + F C  CG     +++ +  S   
Sbjct: 603 -------QTEAHRMAAGPRW---------------EAFCCNSCGAPMQGDDVLRCGSR-- 638

Query: 309 ILSKKTLALTSCGNHQEVVSTYKMIEKLQKKLYHPFSVNLMQTREKLIKILM--ELEDWK 366
                     SC   +  VS   ++ +LQ         +L Q      K+L   ELE   
Sbjct: 639 ----------SCA--ESAVSRDHLVSRLQ---------DLQQQVRVAQKLLRDGELERAV 677

Query: 367 EALAYCQLTIPVYQRVYPQFHPLLGLQYYTCGKLEWFLGDTENAIKSMTEAVEILRITHG 426
           + L+ CQ     +  ++ + H ++G       +    LGD + +   +  ++ ++ + HG
Sbjct: 678 QRLSGCQRDAESF--LWAE-HAVVGEIADGLARACAALGDWQKSATHLQRSLCVVEVRHG 734

Query: 427 TNSPFMKELILKLEE 441
            +S  M   + KL +
Sbjct: 735 PSSVEMGHELFKLAQ 749


>gi|194769376|ref|XP_001966780.1| GF19100 [Drosophila ananassae]
 gi|190618301|gb|EDV33825.1| GF19100 [Drosophila ananassae]
          Length = 499

 Score = 75.1 bits (183), Expect = 6e-11,   Method: Compositional matrix adjust.
 Identities = 34/103 (33%), Positives = 55/103 (53%), Gaps = 7/103 (6%)

Query: 178 GLYPVISIINHSCLPNAVLVFEGRL--AVVRAVQHVPKGAEVLISYIETAGSTMTRQKAL 235
            LYP+  ++NH C+PN+   F+ +    VVRA   +P+G E+  +Y +     + R   L
Sbjct: 208 ALYPLFGVMNHDCIPNSYYTFDEKTNHMVVRAAVDIPEGFEITTTYTKLFTGNIARHLFL 267

Query: 236 KEQYLFTCTCPRCIKLGQFDDIQESAILEGYRCKDDGCSGFLL 278
           K +  FTC CPRC      D  ++ A + G  C+D  C+G ++
Sbjct: 268 KMKKGFTCKCPRC-----SDPTEKGAFISGLYCRDTNCTGLVV 305


>gi|168275572|dbj|BAG10506.1| SET and MYND domain-containing protein 4 [synthetic construct]
          Length = 804

 Score = 75.1 bits (183), Expect = 6e-11,   Method: Compositional matrix adjust.
 Identities = 77/315 (24%), Positives = 131/315 (41%), Gaps = 67/315 (21%)

Query: 146 PEISINEIA--ENFSKLACNA-------HT------ICNSELRPLGTGLYPVISIINHSC 190
           P+++I  +A   +  +L CNA       HT      + +S    L TG++PVIS++NHSC
Sbjct: 483 PDVTIWGVAMLRHMLQLQCNAQAMTTIQHTGPKGSIVTDSRQVRLATGIFPVISLLNHSC 542

Query: 191 LPNAVLVFEGRLAVVRAVQHVPKGAEVLISY--IETAGSTMTRQKALKEQYLFTCTCPRC 248
            PN  + F   +A +RA Q + KG E+L  Y   ++      RQ+ L+ QY F C CP C
Sbjct: 543 SPNTSVSFISTVATIRASQRIRKGQEILHCYGPHKSRMGVAERQQKLRSQYFFDCACPAC 602

Query: 249 IKLGQFDDIQESAILEGYRCKDDGCSGFLLRDSDDKGFTCQQCGLVRSKEEIKKIASEVN 308
                    +   +  G R                + F C  CG     +++ +  S   
Sbjct: 603 -------QTEAHRMAAGPRW---------------EAFCCNSCGAPMQGDDVLRCGSR-- 638

Query: 309 ILSKKTLALTSCGNHQEVVSTYKMIEKLQKKLYHPFSVNLMQTREKLIKILM--ELEDWK 366
                     SC   +  VS   ++ +LQ         +L Q      K+L   ELE   
Sbjct: 639 ----------SCA--ESAVSRDHLVSRLQ---------DLQQQVRVAQKLLRDGELERAV 677

Query: 367 EALAYCQLTIPVYQRVYPQFHPLLGLQYYTCGKLEWFLGDTENAIKSMTEAVEILRITHG 426
           + L+ CQ     +  ++ + H ++G       +    LGD + +   +  ++ ++ + HG
Sbjct: 678 QRLSGCQRDAESF--LWAE-HAVVGEIADGLARACAALGDWQKSATHLQRSLCVVEVRHG 734

Query: 427 TNSPFMKELILKLEE 441
            +S  M   + KL +
Sbjct: 735 PSSVEMGHELFKLAQ 749


>gi|195481611|ref|XP_002101711.1| GE15465 [Drosophila yakuba]
 gi|194189235|gb|EDX02819.1| GE15465 [Drosophila yakuba]
          Length = 497

 Score = 75.1 bits (183), Expect = 6e-11,   Method: Compositional matrix adjust.
 Identities = 45/183 (24%), Positives = 83/183 (45%), Gaps = 27/183 (14%)

Query: 178 GLYPVISIINHSCLPNAVLVFEGRL--AVVRAVQHVPKGAEVLISYIETAGSTMTRQKAL 235
            LYP+  ++NH C+PNA   FE +    +VRA   +P+G EV  +Y +     + R   L
Sbjct: 208 ALYPLFGVVNHDCIPNAYYTFEEKTNNMIVRAAVDIPEGFEVTTTYTKLFTGNIARHLFL 267

Query: 236 KEQYLFTCTCPRCIKLGQFDDIQESAILEGYRCKDDGCSGFLLRDSDD---KGFTCQQC- 291
           K +  FTC C RC      D  ++ A + G  C+D  C+G ++ +        + C +C 
Sbjct: 268 KMKKSFTCKCSRCS-----DPTEKGAFISGLYCRDTNCTGLVVPEISGLPHPNWNCLECK 322

Query: 292 ----------------GLVRSKEEIKKIASEVNILSKKTLALTSCGNHQEVVSTYKMIEK 335
                           G + +K     + + V  L++K+ +     N+  + +   ++++
Sbjct: 323 QKSTHAQMMKSQDFASGAINAKANSNSLRTLVQYLNEKSDSFIPSSNYVVIDAKMSVLQR 382

Query: 336 LQK 338
           LQ+
Sbjct: 383 LQQ 385


>gi|403413006|emb|CCL99706.1| predicted protein [Fibroporia radiculosa]
          Length = 800

 Score = 75.1 bits (183), Expect = 7e-11,   Method: Compositional matrix adjust.
 Identities = 48/127 (37%), Positives = 66/127 (51%), Gaps = 14/127 (11%)

Query: 155 ENFSKLACNAHTICNSELRPLGTGLYPVIS-IINHSCLPNAVL-----VFEGRLAVVRAV 208
           E +++   N + I +S L   G G+YP+ S + NHSC+PNAV       FE     V A+
Sbjct: 531 EAYTRFGNN-NFIIHSHLNTFGHGIYPLASRLFNHSCVPNAVAKYIISPFESVQMEVVAL 589

Query: 209 QHVPKGAEVLISYIETAGSTMTRQKALKEQYLFTCTCPRCIKLGQFDDI-------QESA 261
           + + +G E+ I Y++ A    TRQ+AL+  Y FTCTCP C   G  D          E A
Sbjct: 590 RDIAEGEEITIPYLDPALPLHTRQEALRLNYGFTCTCPLCSWAGHLDPAPPPARGSAELA 649

Query: 262 ILEGYRC 268
           ILE   C
Sbjct: 650 ILESQLC 656


>gi|119610982|gb|EAW90576.1| SET and MYND domain containing 4 [Homo sapiens]
          Length = 804

 Score = 75.1 bits (183), Expect = 7e-11,   Method: Compositional matrix adjust.
 Identities = 77/315 (24%), Positives = 131/315 (41%), Gaps = 67/315 (21%)

Query: 146 PEISINEIA--ENFSKLACNA-------HT------ICNSELRPLGTGLYPVISIINHSC 190
           P+++I  +A   +  +L CNA       HT      + +S    L TG++PVIS++NHSC
Sbjct: 483 PDVTIWGVAMLRHMLQLQCNAQAMTTIQHTGPKGSIVTDSRQVRLATGIFPVISLLNHSC 542

Query: 191 LPNAVLVFEGRLAVVRAVQHVPKGAEVLISY--IETAGSTMTRQKALKEQYLFTCTCPRC 248
            PN  + F   +A +RA Q + KG E+L  Y   ++      RQ+ L+ QY F C CP C
Sbjct: 543 SPNTSVSFISTVATIRASQRIRKGQEILHCYGPHKSRMGVAERQQKLRSQYFFDCACPAC 602

Query: 249 IKLGQFDDIQESAILEGYRCKDDGCSGFLLRDSDDKGFTCQQCGLVRSKEEIKKIASEVN 308
                    +   +  G R                + F C  CG     +++ +  S   
Sbjct: 603 -------QTEAHRMAAGPRW---------------EAFCCNSCGAPMQGDDVLRCGSR-- 638

Query: 309 ILSKKTLALTSCGNHQEVVSTYKMIEKLQKKLYHPFSVNLMQTREKLIKILM--ELEDWK 366
                     SC   +  VS   ++ +LQ         +L Q      K+L   ELE   
Sbjct: 639 ----------SCA--ESAVSRDHLVSRLQ---------DLQQQVRVAQKLLRDGELERAV 677

Query: 367 EALAYCQLTIPVYQRVYPQFHPLLGLQYYTCGKLEWFLGDTENAIKSMTEAVEILRITHG 426
           + L+ CQ     +  ++ + H ++G       +    LGD + +   +  ++ ++ + HG
Sbjct: 678 QRLSGCQRDAESF--LWAE-HAVVGEIADGLARACAALGDWQKSATHLQRSLCVVEVRHG 734

Query: 427 TNSPFMKELILKLEE 441
            +S  M   + KL +
Sbjct: 735 PSSVEMGHELFKLAQ 749


>gi|42562487|ref|NP_174606.2| tetratricopeptide repeat-containing protein [Arabidopsis thaliana]
 gi|332193468|gb|AEE31589.1| tetratricopeptide repeat-containing protein [Arabidopsis thaliana]
          Length = 798

 Score = 75.1 bits (183), Expect = 7e-11,   Method: Compositional matrix adjust.
 Identities = 70/326 (21%), Positives = 135/326 (41%), Gaps = 64/326 (19%)

Query: 175 LGTGLYPVISIINHSCLPNAVLVFEGRLAVVRAVQHVPKGAEVLISYIETAG--STMTRQ 232
           +G  LY   S+ NHSC PN  L F  R  +++  + VP G  + +SY    G      R 
Sbjct: 475 VGQALYKTGSLFNHSCKPNIHLYFLSRGLIMQTTEFVPTGCPLELSYGPEVGKWDCKNRI 534

Query: 233 KALKEQYLFTCTCPRCIKLGQFDDIQESAILEGYRCKDDGCSGFLLRDSDDKGFTCQQCG 292
           + L+E+Y F C C  C ++   D      ++ GY C +  C+G +L   D    TC+   
Sbjct: 535 RFLEEEYFFHCRCRGCAQINISD-----LVINGYGCVNTNCTGVVL---DSNVATCESEK 586

Query: 293 L-------------VRSKEEIKKIASEV--NILSKKTLALT-------SCG-------NH 323
           L             V+ +E++     EV  ++LSK + +L         CG       +H
Sbjct: 587 LNHFFTAPRNVDQQVQMREKVYADVGEVASSLLSKPSGSLHIEPEICLKCGSRCDIENSH 646

Query: 324 QEVVSTYKMIEKLQ-----------------------KKLYHPFSVNLMQTREKLIKILM 360
            EV   +  + +++                       +   H ++ ++    +K+ +   
Sbjct: 647 AEVNKAWNHMRRVEELMNSGRANYSVLSDCSRSIAVLRTFLHMYNKDIADAEDKVAQACY 706

Query: 361 ELEDWKEALAYCQLTIPVYQRVYPQFHPLLGLQYYTCGKLEWFLGDTENAIKSMTEAVEI 420
              +  +A  +C+ +I + +R+Y   H ++G +      ++   GD+  A  +   + +I
Sbjct: 707 LAGELVDARKHCEASIKILKRLYEDEHVVIGNEMVKLASIQLASGDSSGAWDTTKRSSQI 766

Query: 421 LRITHGTNSPFMKELI--LKLEEAQA 444
               +G+++  +   +  LK E A+A
Sbjct: 767 FSKYYGSHAETLFSYLPCLKQETAKA 792


>gi|157124067|ref|XP_001660315.1| hypothetical protein AaeL_AAEL009757 [Aedes aegypti]
 gi|108874116|gb|EAT38341.1| AAEL009757-PA [Aedes aegypti]
          Length = 518

 Score = 75.1 bits (183), Expect = 7e-11,   Method: Compositional matrix adjust.
 Identities = 107/416 (25%), Positives = 169/416 (40%), Gaps = 60/416 (14%)

Query: 8   GEVIISQEPYVCVPNNSSSISRCDGCFASSNLKKCSACQVVWYCGSNCQKLDWKLH-RLE 66
           GE I+ +EPYV  P   + +S C  CF  S    CS C+    CG      D K+H   E
Sbjct: 29  GEQILLEEPYVTGPYWDADVS-CLNCFRDS----CSTCR---QCGVAPLCYDCKIHDEAE 80

Query: 67  CQVL--SRLDKE---KRKSVTPSIRLMLKLYLRRKLQNDNVIPSTTTDNYSLVEALVAHM 121
           C     S LD+       +V   +R ++ LY   + + D V+    T    L E     +
Sbjct: 81  CSFYRESSLDRHFLYNHFNVVMPVRCLM-LYRSNRERYDEVM----TMESRLEERRGTEI 135

Query: 122 SDIDEKQLLLYAQIANLVNLILQWPEISINEIAENFSKLACNAHTICN------SELRPL 175
            DI EK ++     + +     +  E++   I      L  N   I        S++  L
Sbjct: 136 WDIHEKFVVKPLMESGIFAEKFEGLEVTGELIQRICGILDANTFEIRGNVDSRESQMGNL 195

Query: 176 GTGLYPVISIINHSCLPNAVLVFEGRLAVVRAVQHVP-KGAEVL-ISYIETAGSTMTRQK 233
             GLYP  +++ HSC+PN +L  +G  + VR     P K  E+L ISY  +   T  RQ 
Sbjct: 196 ARGLYPKTALMMHSCMPNTLLSVDG-YSNVRVFTSAPVKMGEILNISYTRSLFGTYDRQT 254

Query: 234 ALKEQYLFTCTCPRCIKLGQFDDIQESAILEGYRCK--DDGCSGFLLRDSDDKGFTCQQC 291
            L++   F C+C RC      D  +    L   RC   DDG   F     D+  + C +C
Sbjct: 255 HLRKGKYFICSCRRCK-----DPTELGTHLSSIRCSECDDGLCSFY---PDEPRWECNKC 306

Query: 292 GLVRSKEEIKKI--ASEVNILSKKTLALTSCGNHQEVVSTYKMIEKLQKKL--YHPFSVN 347
             +  +E + +I  A+  +I++             EV    ++I K  K L  +H   + 
Sbjct: 307 HKLLKREYVNEILCAARDDIMACPL----------EVRDLERVISKHSKILNPHHSLVLE 356

Query: 348 LMQTREKLIKILMELED--------WKEALAYCQLTIPVYQRVYPQFHPLLGLQYY 395
             QT    +K +    D         +     C+  +P+ + + P    L+G+  Y
Sbjct: 357 AKQTLMGELKAICMSTDPTAVPKPVLRRKFELCEEILPILRTLEPGLSRLVGIAMY 412


>gi|23272879|gb|AAH35077.1| SET and MYND domain containing 4 [Homo sapiens]
          Length = 804

 Score = 75.1 bits (183), Expect = 7e-11,   Method: Compositional matrix adjust.
 Identities = 77/315 (24%), Positives = 131/315 (41%), Gaps = 67/315 (21%)

Query: 146 PEISINEIA--ENFSKLACNA-------HT------ICNSELRPLGTGLYPVISIINHSC 190
           P+++I  +A   +  +L CNA       HT      + +S    L TG++PVIS++NHSC
Sbjct: 483 PDVTIWGVAMLRHMLQLQCNAQAMTTIQHTGPKGSIVTDSRQVRLATGIFPVISLLNHSC 542

Query: 191 LPNAVLVFEGRLAVVRAVQHVPKGAEVLISY--IETAGSTMTRQKALKEQYLFTCTCPRC 248
            PN  + F   +A +RA Q + KG E+L  Y   ++      RQ+ L+ QY F C CP C
Sbjct: 543 SPNTSVSFISTVATIRASQRIRKGQEILHCYGPHKSRMGVAERQQKLRSQYFFDCACPAC 602

Query: 249 IKLGQFDDIQESAILEGYRCKDDGCSGFLLRDSDDKGFTCQQCGLVRSKEEIKKIASEVN 308
                    +   +  G R                + F C  CG     +++ +  S   
Sbjct: 603 -------QTEAHRMAAGPRW---------------EAFCCNSCGAPMQGDDVLRCGSR-- 638

Query: 309 ILSKKTLALTSCGNHQEVVSTYKMIEKLQKKLYHPFSVNLMQTREKLIKILM--ELEDWK 366
                     SC   +  VS   ++ +LQ         +L Q      K+L   ELE   
Sbjct: 639 ----------SCA--ESAVSRDHLVSRLQ---------DLQQQVRVAQKLLRDGELERAV 677

Query: 367 EALAYCQLTIPVYQRVYPQFHPLLGLQYYTCGKLEWFLGDTENAIKSMTEAVEILRITHG 426
           + L+ CQ     +  ++ + H ++G       +    LGD + +   +  ++ ++ + HG
Sbjct: 678 QRLSGCQRDAESF--LWAE-HAVVGEIADGLARACAALGDWQKSATHLQRSLCVVEVRHG 734

Query: 427 TNSPFMKELILKLEE 441
            +S  M   + KL +
Sbjct: 735 PSSVEMGHELFKLAQ 749


>gi|449672885|ref|XP_002160254.2| PREDICTED: SET and MYND domain-containing protein 4-like [Hydra
           magnipapillata]
          Length = 729

 Score = 75.1 bits (183), Expect = 7e-11,   Method: Compositional matrix adjust.
 Identities = 112/486 (23%), Positives = 186/486 (38%), Gaps = 105/486 (21%)

Query: 7   SGEVIISQEPYVCVPNNSSSISRCDGCFASS-NLKKCSACQVVWYCGSNCQKLDWK-LHR 64
           +G +IIS++PY  V       + C  CF    +L  C  C  V +C  +C    W   HR
Sbjct: 243 NGSIIISEKPYAAVLLPHWYKTHCQLCFDKVVSLFPCYECAEVVFCSLSCYNDAWATYHR 302

Query: 65  LECQVLSRLDKEKRKSVTPSIRLML----KLYLR------RKLQNDNVIPSTTTD----- 109
            EC+ LS +  EK      S+R++L    K  LR       K  +   +PS+  +     
Sbjct: 303 FECKKLSLM--EKVGIAHLSLRIVLVSDAKDLLRFLGSDLNKFTDSPTLPSSKIEGCNDQ 360

Query: 110 -----NYSLVEALVAHMSDIDEKQLLLYAQIANLV------------NLILQWPEISINE 152
                +Y  V  L  H   +  + L  Y+    L+            + +LQ     +  
Sbjct: 361 GIYQGDYESVYFLSTHSDRLPIEDLFQYSVAGFLLYKLLINSSFFKTHTVLQQHHFGVGS 420

Query: 153 I-AENFSKLACNAH-------------TICNSELRPLGTGLYPVISIINHSCLPNAVLVF 198
           +   +  +L CNAH             ++ + E   + T +YP  S++NHSC P  +  F
Sbjct: 421 LLIRHIQQLICNAHAVTCLSAEKLDTTSVIDQEQVRIATAIYPTTSLLNHSCEPTILNCF 480

Query: 199 EGRLAVVRAVQHVPKGAEVLISY---IETAGSTMTRQKALKEQYLFTCTCPRCIKLGQFD 255
                +V+ V+ V KG ++   Y    +  G    R+ AL +QY F C+C  C+      
Sbjct: 481 HKNQLIVKVVKDVVKGEQIFNCYGPHFKRMGYE-DRRAALMQQYFFLCSCEHCV------ 533

Query: 256 DIQESAILEGYRCKDDGCSGFLLRDSDDKGFTCQQCGLVRSKEEIKKIASEVNILSKKTL 315
                               +    S+  GF C +C L  S EE K  + + +    K +
Sbjct: 534 --------------------YQNGHSNKNGFICFKCKLPLSNEE-KCTSCDTDFC--KDI 570

Query: 316 ALTSCGNHQEVVSTYKMIEKLQKKLYHPFSVNLMQTREKLIKILMELEDWKEALAYCQLT 375
            ++      E+                   +N M    +++ +  E    ++ALA     
Sbjct: 571 YVSKANRCDEIF------------------LNAM----RVLSLNSERHSVEKALALFLEC 608

Query: 376 IPVYQRVYPQFHPLLGLQYYTCGKLEWFLGDTENAIKSMTEAVEILRITHGTNSPFMKEL 435
           +   + +Y   H LL   Y   GK    L D E+A+K +  +V  ++  +G  S      
Sbjct: 609 LNHQKEIYIDNHFLLSRSYDVVGKCYAMLEDYESALKFVKNSVLAIKTIYGDRSIEYTNE 668

Query: 436 ILKLEE 441
           ILKL +
Sbjct: 669 ILKLTD 674


>gi|24640767|ref|NP_572539.2| CG12119 [Drosophila melanogaster]
 gi|7291024|gb|AAF46462.1| CG12119 [Drosophila melanogaster]
          Length = 500

 Score = 75.1 bits (183), Expect = 7e-11,   Method: Compositional matrix adjust.
 Identities = 35/103 (33%), Positives = 54/103 (52%), Gaps = 7/103 (6%)

Query: 178 GLYPVISIINHSCLPNAVLVFEGRL--AVVRAVQHVPKGAEVLISYIETAGSTMTRQKAL 235
            LYP+  ++NH C+PNA   FE +    +VRA   +P+G EV  +Y +     + R   L
Sbjct: 208 ALYPLFGVVNHDCIPNAYYTFEEKTNNMIVRAAVDIPEGFEVTTTYTKLFTGNIARHLFL 267

Query: 236 KEQYLFTCTCPRCIKLGQFDDIQESAILEGYRCKDDGCSGFLL 278
           K +  FTC C RC      D  ++ A + G  C+D  C+G ++
Sbjct: 268 KMKKSFTCKCSRC-----SDPTEKGAFISGLYCRDTNCTGLVV 305


>gi|17946274|gb|AAL49177.1| RE62495p [Drosophila melanogaster]
          Length = 500

 Score = 75.1 bits (183), Expect = 7e-11,   Method: Compositional matrix adjust.
 Identities = 35/103 (33%), Positives = 54/103 (52%), Gaps = 7/103 (6%)

Query: 178 GLYPVISIINHSCLPNAVLVFEGRL--AVVRAVQHVPKGAEVLISYIETAGSTMTRQKAL 235
            LYP+  ++NH C+PNA   FE +    +VRA   +P+G EV  +Y +     + R   L
Sbjct: 208 ALYPLFGVVNHDCIPNAYYTFEEKTNNMIVRAAVDIPEGFEVTTTYTKLFTGNIARHLFL 267

Query: 236 KEQYLFTCTCPRCIKLGQFDDIQESAILEGYRCKDDGCSGFLL 278
           K +  FTC C RC      D  ++ A + G  C+D  C+G ++
Sbjct: 268 KMKKSFTCKCSRCS-----DPTEKGAFISGLYCRDTNCAGLVV 305


>gi|195151412|ref|XP_002016641.1| GL11690 [Drosophila persimilis]
 gi|194110488|gb|EDW32531.1| GL11690 [Drosophila persimilis]
          Length = 660

 Score = 75.1 bits (183), Expect = 7e-11,   Method: Compositional matrix adjust.
 Identities = 82/383 (21%), Positives = 153/383 (39%), Gaps = 52/383 (13%)

Query: 7   SGEVIISQEPYVCVPNNSSSISRCDGCFASSNLK-KCSACQVVWYCGSNCQ-KLDWKLHR 64
           +GE ++ + P+V V     + S C+ CF  S +   C  C  V YC   C+ +   K H+
Sbjct: 261 AGEELLVERPFVSVLLEKFAKSHCENCFVRSVVPVACPRCADVIYCSEQCRDEAAKKYHK 320

Query: 65  LECQVLSRLDKEKRKSVTPSIRLMLKLYLRRKLQ---------NDNVIP----STTTDNY 111
            EC ++  +    R   + +  + L+++  + L+         +  + P    S   D++
Sbjct: 321 YECGIVPTI---WRSGASINNHIALRIFASKPLEYFLQLKPTIDKELSPEELLSLPKDDF 377

Query: 112 SLVEALVAHMSDIDEKQLLLYAQIANLVNLILQ-----WPEISINEIA-------ENFSK 159
             V  L  H +         +  +A  +   LQ      PE    ++         +   
Sbjct: 378 RRVAQLERHQNKRPPSNFFQHVLMARFLTKCLQASGYFGPEPKPEQVRAICSLVLRSLQF 437

Query: 160 LACNAHTICNSELRP-----------LGTGLYPVISIINHSCLPNAVLVFEGRLAVVRAV 208
           +  N H +  +EL             +G  +YP +++ NHSC P  V  F G    + +V
Sbjct: 438 IQFNTHEV--AELHKFSSTGREKSIFIGGAIYPTLALFNHSCDPGVVRYFRGTTIHINSV 495

Query: 209 QHVPKGAEVLISY--IETAGSTMTRQKALKEQYLFTCTCPRCI-KLGQFDDIQESAILEG 265
           + +  G  +  +Y  I T      RQ  +KE Y F C C  CI     F+D+    I   
Sbjct: 496 RPIEAGLPINENYGPIYTQDKREERQAKMKELYWFECCCDACIDNWPLFEDLPRDVI--R 553

Query: 266 YRCK-DDGCSGFL--LRDSDDKGFTCQQCGLVRSKEEIKKIASEVNILSKKTLALTSCGN 322
           +RC+  + C+  +      +D    C  CG V +  +  K+  +  ++++    L   G+
Sbjct: 554 FRCEAPNNCAAVIEVPPSCNDFMVKCVTCGEVTNILKGLKVMQDTEMMTRTGKRLYETGD 613

Query: 323 HQEVVSTY-KMIEKLQKKLYHPF 344
           + + +  +  +I  +   L  PF
Sbjct: 614 YAKALQKFIDLIRIMYDVLAPPF 636


>gi|242397545|gb|ACS92862.1| MIP11718p [Drosophila melanogaster]
          Length = 514

 Score = 75.1 bits (183), Expect = 8e-11,   Method: Compositional matrix adjust.
 Identities = 76/280 (27%), Positives = 124/280 (44%), Gaps = 47/280 (16%)

Query: 8   GEVIISQEPYVCVPNNSSSISRCDGCFAS--------SNLKKCSACQVVW-YCGSNCQKL 58
           GE I+ + P V  P  +S+   C GC  +         N  KCS+C   W  CG  C+  
Sbjct: 70  GEQILKEAPLVLGPKVASA-PLCLGCHRNLLAPGKPRGNYHKCSSCS--WPLCGKECE-- 124

Query: 59  DWKLHRLECQVLSRL-----------DKEKRKSVTPSIRLMLKLYLRRKLQNDNVIPSTT 107
           D   H+ ECQ++S             ++E+++S    I L+  ++L+ K           
Sbjct: 125 DSVHHKAECQLMSGSNFQSKINYVPGEEERKESAYCVIMLLRCMHLKDK----------D 174

Query: 108 TDNYSLVEALVAHMSDIDEKQL--LLYAQIANLVNLIL---QWPEISINEIAENFSKLAC 162
            D +  +  L  H+ +  E  L  +L A +   +  +L    WPE+ I  IA   + L  
Sbjct: 175 PDAFLKLYNLEDHLKERLETPLYQVLRANLITFIKTVLGMKDWPEMDILRIA---AILDT 231

Query: 163 NAHTICNSELRPLGTGLYPVISIINHSCLPNAVLVFEGRLAVV-RAVQHVPKGAEVLISY 221
           N   +     R     LYP  ++I+H C+PN    F+  + ++  A + + KG  + ISY
Sbjct: 232 NTFEVRQPRERRKIRALYPGAAMISHDCVPNMRHRFDDDMNILFLAKRKIAKGEILSISY 291

Query: 222 IETAGSTMTRQKALKEQYLFTCTCPRCI---KLGQFDDIQ 258
            +   ST+ R+  L++   F C+C RC    +LG F   Q
Sbjct: 292 TQPLRSTIQRRVHLRQAKCFDCSCARCQDPEELGSFAGAQ 331


>gi|321472149|gb|EFX83120.1| hypothetical protein DAPPUDRAFT_2393 [Daphnia pulex]
          Length = 458

 Score = 75.1 bits (183), Expect = 8e-11,   Method: Compositional matrix adjust.
 Identities = 82/370 (22%), Positives = 160/370 (43%), Gaps = 50/370 (13%)

Query: 7   SGEVIISQEPYVCVPNNSSSISRCDGCFAS-SNLKKCSACQVVWYCGSNCQKLDWKLHRL 65
           +GEVI+ + P V  P   +++  C GC+ S ++  +CS C             +W L   
Sbjct: 61  AGEVILQELPLVVGPK-LNTLPLCLGCYKSITDTYRCSRC-------------NWPLCSA 106

Query: 66  ECQVLSRLDKEKRKSVTPSIRLMLKLYLRRKLQNDNVIPSTTTDNY--------SLVEAL 117
            C+  +     + + + P+   ++  +L +   N  V    T   Y          ++ L
Sbjct: 107 ACEESALHKNGECRMIDPT---LMTNHLSQGAINSQVFQCITPLRYLTLPDSDRERLDEL 163

Query: 118 VAHMSDIDEKQLLLYAQIANLVNLILQWPEISINEIAENFSKLACNAHTICNSELRPLGT 177
           V+H+     + + +Y  +   ++  L++  +     +E+  ++     T C  E+R  G 
Sbjct: 164 VSHLEQ--RRGMDIYRLVEQNISSFLRYRLLLTQYDSESIQRVCGILETNC-FEIRIQGR 220

Query: 178 ----GLYPVISIINHSCLPNAVLVFEGRLAVVR--AVQHVPKGAEVLISYIETAGSTMTR 231
               GLYP  S++NH C+ N   VF+     +R  A + +P G ++  +Y  +  +T+ R
Sbjct: 221 VSVRGLYPTASLMNHDCVANTRHVFDPADFRIRILATKDIPAGDKISATYTRSLWNTLDR 280

Query: 232 QKALKEQYLFTCTCPRCIKLGQFDDIQESAILEGYRCKDDGCSGFLLRDS---DDKGFTC 288
           +  LK    F C C RC      D  +   +L   +C   GC G +L  +       + C
Sbjct: 281 RLHLKSTKHFWCQCSRCA-----DPRELGTLLSAVKCT--GCGGAVLSQNPLYQMSNWEC 333

Query: 289 QQCGLVRSKEEIKKIASEVNILSKKTLALTSCGNHQEVVSTYKMIEKLQKKLYHPFSVNL 348
             CG V+  E++K++     +  K+   L    N  E++  +  I K    + HP S ++
Sbjct: 334 SDCGSVQKVEQVKRVHDSARMELKQIAQLAR--NRPELLEDF--IRKYSGAI-HPDSCHV 388

Query: 349 MQTREKLIKI 358
           ++ +  L+++
Sbjct: 389 IEAKYALVQL 398


>gi|195565893|ref|XP_002106530.1| GD16076 [Drosophila simulans]
 gi|194203908|gb|EDX17484.1| GD16076 [Drosophila simulans]
          Length = 497

 Score = 75.1 bits (183), Expect = 8e-11,   Method: Compositional matrix adjust.
 Identities = 35/103 (33%), Positives = 54/103 (52%), Gaps = 7/103 (6%)

Query: 178 GLYPVISIINHSCLPNAVLVFEGRL--AVVRAVQHVPKGAEVLISYIETAGSTMTRQKAL 235
            LYP+  ++NH C+PNA   FE +    +VRA   +P+G EV  +Y +     + R   L
Sbjct: 208 ALYPLFGVVNHDCIPNAYYTFEEKTNNMIVRAAVDIPEGFEVTTTYTKLFTGNIARHLFL 267

Query: 236 KEQYLFTCTCPRCIKLGQFDDIQESAILEGYRCKDDGCSGFLL 278
           K +  FTC C RC      D  ++ A + G  C+D  C+G ++
Sbjct: 268 KMKKSFTCKCSRCS-----DPTEKGAFISGLYCRDTNCTGLVV 305


>gi|195446904|ref|XP_002070974.1| GK25543 [Drosophila willistoni]
 gi|194167059|gb|EDW81960.1| GK25543 [Drosophila willistoni]
          Length = 503

 Score = 75.1 bits (183), Expect = 8e-11,   Method: Compositional matrix adjust.
 Identities = 34/103 (33%), Positives = 55/103 (53%), Gaps = 7/103 (6%)

Query: 178 GLYPVISIINHSCLPNAVLVFEGRL--AVVRAVQHVPKGAEVLISYIETAGSTMTRQKAL 235
            LYP+  ++NH C+PN+   FE +    +VRA   +P+G E+  +Y +     + R   L
Sbjct: 208 ALYPLFGVMNHDCIPNSYYTFEEKTNNMIVRAAVDIPEGFEITTTYTKLFTGNIARHLFL 267

Query: 236 KEQYLFTCTCPRCIKLGQFDDIQESAILEGYRCKDDGCSGFLL 278
           K +  FTC CPRC      D  ++ A + G  C+D  C+G ++
Sbjct: 268 KMKKNFTCKCPRC-----SDPTEKGAYISGLYCRDTSCNGLVV 305


>gi|195123911|ref|XP_002006445.1| GI18563 [Drosophila mojavensis]
 gi|193911513|gb|EDW10380.1| GI18563 [Drosophila mojavensis]
          Length = 666

 Score = 75.1 bits (183), Expect = 8e-11,   Method: Compositional matrix adjust.
 Identities = 83/380 (21%), Positives = 151/380 (39%), Gaps = 48/380 (12%)

Query: 8   GEVIISQEPYVCVPNNSSSISRCDGCFASSNLK-KCSACQVVWYCGSNCQ-KLDWKLHRL 65
           G+ ++ + PYV V     + + C+ CF  + +   C  C  V YC   CQ K   K H+ 
Sbjct: 268 GQELLVEHPYVAVLLEKYAQTHCEFCFMRTVVPVSCPGCSDVIYCSEQCQQKAAAKYHKF 327

Query: 66  ECQVLSRLDKEKRKSVTPSIRLMLKLYLRRKLQNDNVIPSTTT-------------DNYS 112
           EC +L  +    R   + +  + L++   + L     + ++               D++ 
Sbjct: 328 ECGLLPVI---WRSGASINNHMALRIIASKPLDYFMQLRASLDEELSLEQLLSLPKDDFR 384

Query: 113 LVEALVAHMSDIDEKQLLLYAQIANLVNLILQWPEISINEI-AENFSKLAC--------- 162
            V  L  H  +        Y  +A  +   LQ      +E  +E  S +           
Sbjct: 385 RVAHLERHQKERPPSNFFQYVLMARFLTRCLQAAGYFGSEPKSEQVSAIGGLLLRCLQFI 444

Query: 163 --NAHTICN-----SELRP----LGTGLYPVISIINHSCLPNAVLVFEGRLAVVRAVQHV 211
             N H +       +E R     +G  +YP +++ NHSC P  V  F G    +  V+ V
Sbjct: 445 QFNTHEVAELHKYAAEGREKSIFIGGAIYPTLALFNHSCDPGVVRYFRGNTIHINTVRPV 504

Query: 212 PKGAEVLISY--IETAGSTMTRQKALKEQYLFTCTCPRCIK-LGQFDDIQESAILEGYRC 268
             G  +  +Y  I T      RQ  LKE Y F C C  C++    FDD+    I   +RC
Sbjct: 505 EAGLPINENYGPIYTQDKREDRQARLKELYWFECNCDACLENWPLFDDLPRDII--RFRC 562

Query: 269 K-DDGCSGFL--LRDSDDKGFTCQQCGLVRSKEEIKKIASEVNILSKKTLALTSCGNHQE 325
           +  + C+  +      +D    C  CG + +  +  K+  +  ++++    L   G++ +
Sbjct: 563 EAPNNCTAVIEVPPSCNDFMIKCVTCGELTNILKGLKVMQDTEMMTRTAKRLYDTGDYAK 622

Query: 326 VVSTY-KMIEKLQKKLYHPF 344
            ++ +  ++  + + L  PF
Sbjct: 623 ALNKFVDLLRIMYEVLAPPF 642


>gi|126314231|ref|XP_001371610.1| PREDICTED: SET and MYND domain-containing protein 4-like
           [Monodelphis domestica]
          Length = 845

 Score = 74.7 bits (182), Expect = 8e-11,   Method: Compositional matrix adjust.
 Identities = 61/266 (22%), Positives = 116/266 (43%), Gaps = 12/266 (4%)

Query: 175 LGTGLYPVISIINHSCLPNAVLVFEGRLAVVRAVQHVPKGAEVLISY--IETAGSTMTRQ 232
           L TG +PVIS++NHSC PN  L F G + +++A + + +G E+L  Y   E      TRQ
Sbjct: 513 LATGFFPVISLLNHSCRPNTSLSFRGSVGIIQASRLIAQGEEILHCYGPHECRMDVTTRQ 572

Query: 233 KALKEQYLFTCTCPRCIKLGQFDDIQESAILEGYRCKDDGCSGFLLRDSDDKGFTC---Q 289
           + L+ QY F C C  C +  +   +  +    G+RC    C   L  D     F C   +
Sbjct: 573 QKLRSQYFFDCHCQAC-QNEEVCPVSTAPKPSGFRCPS--CKAALQGDD----FLCCDNR 625

Query: 290 QCGLVRSKEEIKKIASEVNILSKKTLALTSCGNHQEVVSTYKMIEKLQKKLYHPFSVNLM 349
            C    S+ +++    ++    K  L L       + +      ++  +       + + 
Sbjct: 626 TCTASASRAQLEHQVQDLQQQVKAALELLRDDRSAQAIQLLLGCQQDAENFLSSEHMLMG 685

Query: 350 QTREKLIKILMELEDWKEALAYCQLTIPVYQRVYPQFHPLLGLQYYTCGKLEWFLGDTEN 409
           +  + L +    L DW  + A+ Q ++ V +  +      LG + +   ++ +       
Sbjct: 686 EIEDHLAQAYASLGDWSRSAAHLQKSLLVVEARHGPASVELGHELFKLAQVLFNGFAVSK 745

Query: 410 AIKSMTEAVEILRITHGTNSPFMKEL 435
           A+ ++ +A ++L I +   +  M+EL
Sbjct: 746 ALCAIEKAEKVLSIHYDPENEQMQEL 771


>gi|167535778|ref|XP_001749562.1| hypothetical protein [Monosiga brevicollis MX1]
 gi|163771954|gb|EDQ85613.1| predicted protein [Monosiga brevicollis MX1]
          Length = 338

 Score = 74.7 bits (182), Expect = 8e-11,   Method: Compositional matrix adjust.
 Identities = 35/94 (37%), Positives = 57/94 (60%), Gaps = 1/94 (1%)

Query: 156 NFSKLACNAHTICNS-ELRPLGTGLYPVISIINHSCLPNAVLVFEGRLAVVRAVQHVPKG 214
           +F+KL  N    C++ ++   G  +Y  +S  NHSC PNA +V+ G  AV+R+++ +P+G
Sbjct: 109 SFNKLMPNTFGWCDTFDMSSFGAVVYAELSRANHSCQPNAAVVYNGAAAVLRSMRDIPEG 168

Query: 215 AEVLISYIETAGSTMTRQKALKEQYLFTCTCPRC 248
            EV ISY++   +   R++ L + Y F C C RC
Sbjct: 169 EEVCISYVDPTLARDVRRRELVQSYGFACDCARC 202



 Score = 46.6 bits (109), Expect = 0.025,   Method: Compositional matrix adjust.
 Identities = 26/89 (29%), Positives = 35/89 (39%), Gaps = 22/89 (24%)

Query: 7   SGEVIISQEPYVCVPNNSSSISRCDGCFAS----------------------SNLKKCSA 44
           +G  I    P V V  +     RC GC  S                       + K+CS 
Sbjct: 25  AGTAIRQALPVVAVVEDDERFRRCAGCGLSVDRALAYGHPGAQAAVEMTGDRPSWKRCSR 84

Query: 45  CQVVWYCGSNCQKLDWKLHRLECQVLSRL 73
           C+ + YC   CQK DWK H+ EC   ++L
Sbjct: 85  CKNIAYCSPGCQKRDWKAHKRECASFNKL 113


>gi|12322390|gb|AAG51222.1|AC051630_19 unknown protein; 33246-28649 [Arabidopsis thaliana]
          Length = 781

 Score = 74.7 bits (182), Expect = 8e-11,   Method: Compositional matrix adjust.
 Identities = 70/326 (21%), Positives = 135/326 (41%), Gaps = 64/326 (19%)

Query: 175 LGTGLYPVISIINHSCLPNAVLVFEGRLAVVRAVQHVPKGAEVLISYIETAG--STMTRQ 232
           +G  LY   S+ NHSC PN  L F  R  +++  + VP G  + +SY    G      R 
Sbjct: 458 VGQALYKTGSLFNHSCKPNIHLYFLSRGLIMQTTEFVPTGCPLELSYGPEVGKWDCKNRI 517

Query: 233 KALKEQYLFTCTCPRCIKLGQFDDIQESAILEGYRCKDDGCSGFLLRDSDDKGFTCQQCG 292
           + L+E+Y F C C  C ++   D      ++ GY C +  C+G +L   D    TC+   
Sbjct: 518 RFLEEEYFFHCRCRGCAQINISD-----LVINGYGCVNTNCTGVVL---DSNVATCESEK 569

Query: 293 L-------------VRSKEEIKKIASEV--NILSKKTLAL-------TSCG-------NH 323
           L             V+ +E++     EV  ++LSK + +L         CG       +H
Sbjct: 570 LNHFFTAPRNVDQQVQMREKVYADVGEVASSLLSKPSGSLHIEPEICLKCGSRCDIENSH 629

Query: 324 QEVVSTYKMIEKLQ-----------------------KKLYHPFSVNLMQTREKLIKILM 360
            EV   +  + +++                       +   H ++ ++    +K+ +   
Sbjct: 630 AEVNKAWNHMRRVEELMNSGRANYSVLSDCSRSIAVLRTFLHMYNKDIADAEDKVAQACY 689

Query: 361 ELEDWKEALAYCQLTIPVYQRVYPQFHPLLGLQYYTCGKLEWFLGDTENAIKSMTEAVEI 420
              +  +A  +C+ +I + +R+Y   H ++G +      ++   GD+  A  +   + +I
Sbjct: 690 LAGELVDARKHCEASIKILKRLYEDEHVVIGNEMVKLASIQLASGDSSGAWDTTKRSSQI 749

Query: 421 LRITHGTNSPFMKELI--LKLEEAQA 444
               +G+++  +   +  LK E A+A
Sbjct: 750 FSKYYGSHAETLFSYLPCLKQETAKA 775


>gi|224140669|ref|XP_002323703.1| tetratricopeptide repeat-containing protein [Populus trichocarpa]
 gi|222868333|gb|EEF05464.1| tetratricopeptide repeat-containing protein [Populus trichocarpa]
          Length = 584

 Score = 74.7 bits (182), Expect = 8e-11,   Method: Compositional matrix adjust.
 Identities = 67/313 (21%), Positives = 128/313 (40%), Gaps = 61/313 (19%)

Query: 149 SINEIAENFSKLACNAHTICNS-ELRPLGTGLYPVISIINHSCLPNAVLVFEGRLAVVRA 207
           S+++  + F KL      + +S E  P+G  +Y   S+ NHSCLPN    F  R   +R 
Sbjct: 240 SVDDPPDQFRKLTSVGDALTSSLEQVPVGQAIYKAASLFNHSCLPNIHAYFLSRTLFIRT 299

Query: 208 VQHVPKGAEVLISYIETAGST--MTRQKALKEQYLFTCTCPRCIKLGQFDDIQESAILEG 265
            ++V  G  + +SY    G +    R + L ++Y F C C  C +L   D      +L  
Sbjct: 300 TEYVSTGCPLELSYGPQVGQSDCEDRLRYLADKYSFRCQCRGCSQLNLSD-----LVLNA 354

Query: 266 YRCKDDGCSGFLLRDSDDKGFTCQQCGLVRSKEEIKKIASE--------VNILSKKTLAL 317
           + C +  C+G +L  +   G T +     R+ E+ +K  S         V+I    +LAL
Sbjct: 355 FCCVNHNCAGVVLESTIINGETRKLNNFPRAPEK-QKFDSHLQGHKLNIVDINDVASLAL 413

Query: 318 T--------------SCGNHQEVVSTYKMIEK---------------------------- 335
                           CG+H+++ ++++ I K                            
Sbjct: 414 KFNNSSLHIHPGFCLHCGSHRDLDASHEAINKAWSYIKRLQEAIISKDISGTTLLDASRA 473

Query: 336 --LQKKLYHPFSVNLMQTREKLIKILMELEDWKEALAYCQLTIPVYQRVYPQFHPLLGLQ 393
             + +   H ++ ++ +  + L +    + D++ A  +C+ +I + Q +Y   H ++G +
Sbjct: 474 LGILRSTLHAYNKSVAEAEDNLAQAFCLVRDFQSAREHCKESIKILQTLYDPDHIVIGYE 533

Query: 394 YYTCGKLEWFLGD 406
                 ++  L D
Sbjct: 534 LVKLASIQLSLDD 546


>gi|391337809|ref|XP_003743257.1| PREDICTED: N-lysine methyltransferase SMYD2-B-like [Metaseiulus
           occidentalis]
          Length = 442

 Score = 74.7 bits (182), Expect = 9e-11,   Method: Compositional matrix adjust.
 Identities = 69/254 (27%), Positives = 121/254 (47%), Gaps = 21/254 (8%)

Query: 6   VSGEVIISQEPYV-CVPNNSSSISRCDGCFASS-NLKKCSACQVVWYCGSNCQKLDWKLH 63
           ++G  II  EPY  CV  N  S   C  C      L+ C  C V  YC   CQ+ D+  H
Sbjct: 5   LAGSEIIRCEPYSWCVRTNRLS-DVCSFCIKQKKRLQFCEKCSVAVYCSDKCQENDYVNH 63

Query: 64  RLECQVL----SRLDKEKRKSVTPSIRLMLKLYLRRKLQNDNVIPSTTTDNYSLVEALVA 119
           RLEC +L     RLD   R  +         L L+++  +   I        +L + LVA
Sbjct: 64  RLECSLLRSATERLDTTTRFCIRT-------LTLKKREISLGRITRGEKFRKTLAD-LVA 115

Query: 120 HMSDIDEKQLLLYAQIANLVNLILQWPEISINEIAENFSKLACNAHTICNSELRPLGTGL 179
            + + D++  +  A+   +  LI    ++++ E+     +L  ++  + + +  PLG GL
Sbjct: 116 PLWEGDQEDKVRNAK--KIFALIQNRMKVTLREVLVCLLQLHVHSIAVEDYKGAPLGRGL 173

Query: 180 YPVISIINHSCLPNAVLV--FEGRLAVVRAVQH--VPKGAEVLISYIETAGSTMTRQKAL 235
           Y   +  +HSC   + +   F+G++ V+RA+Q+  V   A++ +S++  +   M RQ+ L
Sbjct: 174 YLDQTEFDHSCEAESRVGAHFDGKIFVLRALQNFRVCSLAKIRMSFVPRSLPVMRRQELL 233

Query: 236 KEQYLFTCTCPRCI 249
            +++   C C  C+
Sbjct: 234 AQEFYIQCECASCL 247


>gi|348540237|ref|XP_003457594.1| PREDICTED: SET and MYND domain-containing protein 4-like
           [Oreochromis niloticus]
          Length = 725

 Score = 74.7 bits (182), Expect = 9e-11,   Method: Compositional matrix adjust.
 Identities = 123/556 (22%), Positives = 205/556 (36%), Gaps = 137/556 (24%)

Query: 6   VSGEVIISQEPYVCV--P------------NNSSSI------SRCDGCFASS-NLKKCSA 44
            +GEVI++  PY CV  P            +  S +       RC  C   + +L  C  
Sbjct: 172 AAGEVILTDRPYSCVLIPGMEEVKGKVGRYSRDSEVLFGVEDRRCHRCLTETLSLVPCDG 231

Query: 45  CQVVWYCGSNCQKLDWK-LHRLECQVLSRLDKEKRKSVTPSIRLMLKLYLRRKL------ 97
           C    YC + CQ+  W+  HR EC + + L      S   ++R+ LK  L+  L      
Sbjct: 232 CSYSRYCSTACQQEAWEEHHRWECALGAHLMAMGVMSQL-ALRVALKAGLKNFLLAREPI 290

Query: 98  --------QNDNVIPSTTT--DNYSLVEALVAHMSDIDEKQLLLYAQIANLVNLILQW-- 145
                    N +  PS ++   +Y  V  L+ H++        + A  A  + L L    
Sbjct: 291 SDEQKECFNNHSSDPSMSSYGGSYLNVYNLLHHLNHHSPALRFMCAVTAATLYLKLSKAG 350

Query: 146 -PEISINEIAENFSK------------------------LACNAHTI------------- 167
            P +S +   +  +K                        L CNA  I             
Sbjct: 351 PPPVSWSHTPQGPNKEGEDADWNAELWMLGSAVLQHILQLRCNAQAIVVLQDTGATNSLV 410

Query: 168 -CNSELRPLGTGLYPVISIINHSCLPNAVLVF---------------------------- 198
             N E+R + T ++P +S++NHSC PN  LVF                            
Sbjct: 411 QSNREIR-IATAVFPALSLLNHSCCPNTSLVFNTGGIIEPSGSALSADFSEGEAEWRHKA 469

Query: 199 EGRLAVVRAVQHVPKGAEVLISYIETAG--STMTRQKALKEQYLFTCTCPRCIKLGQFDD 256
           +G    VRA + +  G E+L  Y   +   +T  R++ L+EQY F C C  C +  Q  D
Sbjct: 470 QGVTVTVRAAKLITPGQEILHCYGPHSSRMATQERRRLLQEQYYFLCQCEACSQHQQ--D 527

Query: 257 IQESAILE-GYRCKDDG-----CSGFLLRDSDDK--GFTC--QQCGLVRSKEEIKKIASE 306
           ++E    + G   +D G     C G L + S+D+  GF C    C    S  ++     E
Sbjct: 528 VEEPRQQQPGAEGRDLGLLCIKCKGSLKKRSEDREAGFVCSLSSCSHCMSLSDVNNRLQE 587

Query: 307 VNILSKKTLALTSCGNHQEVVSTYKMIEKLQKKLYHPFSVNLMQTR-------EKLIKIL 359
           + +  +K + L       E +       +L KK      V L +T        +   +  
Sbjct: 588 IRVNLEKAVELMETDRPGEAL-------RLLKKTLCQSGVILAETHPLQGELADATARAY 640

Query: 360 MELEDWKEALAYCQLTIPVYQRVYPQFHPLLGLQYYTCGKLEWFLGDTENAIKSMTEAVE 419
             + DW  A ++ + +       + +    LG Q +   +L +  G    A+  + +   
Sbjct: 641 ASMGDWNNAASHLERSTVAIASQFGKESVELGRQLFKLAQLHFNSGSRGPALSVIPKVRR 700

Query: 420 ILRITHGTNSPFMKEL 435
           +L +  G +   ++EL
Sbjct: 701 LLCLHCGPHCHELQEL 716


>gi|195130539|ref|XP_002009709.1| GI15088 [Drosophila mojavensis]
 gi|193908159|gb|EDW07026.1| GI15088 [Drosophila mojavensis]
          Length = 492

 Score = 74.7 bits (182), Expect = 9e-11,   Method: Compositional matrix adjust.
 Identities = 33/103 (32%), Positives = 56/103 (54%), Gaps = 7/103 (6%)

Query: 178 GLYPVISIINHSCLPNAVLVFEGRL--AVVRAVQHVPKGAEVLISYIETAGSTMTRQKAL 235
            LYP+ +++NH C+PN+   FE +    +VRA   + +G E+  +Y +     + R   L
Sbjct: 208 ALYPLFAVMNHDCIPNSYYTFEEKTNNMIVRAAVDIAEGEEITTTYTKLFTGNIARHLFL 267

Query: 236 KEQYLFTCTCPRCIKLGQFDDIQESAILEGYRCKDDGCSGFLL 278
           K +  FTC CPRC+     D  ++ + + G  C+D  CSG ++
Sbjct: 268 KMKKGFTCKCPRCL-----DPTEKGSFISGLYCRDTNCSGLVV 305


>gi|24652742|ref|NP_725048.1| CG7759, isoform A [Drosophila melanogaster]
 gi|21627489|gb|AAF58645.2| CG7759, isoform A [Drosophila melanogaster]
          Length = 660

 Score = 74.7 bits (182), Expect = 1e-10,   Method: Compositional matrix adjust.
 Identities = 83/379 (21%), Positives = 155/379 (40%), Gaps = 46/379 (12%)

Query: 8   GEVIISQEPYVCVPNNSSSISRCDGCFASSNLK-KCSACQVVWYCGSNC-QKLDWKLHRL 65
           GE ++ + P+V V     + + C+ CF  + +   C  C  V YC   C ++   K H+ 
Sbjct: 262 GEELLVERPFVSVLLEKFAKTHCENCFMRTVVPVACPRCADVLYCSEQCREEASKKYHKY 321

Query: 66  ECQVLSRLDKEKRKSVTPSIRLM------------LKLYLRRKLQNDNVIPSTTTDNYSL 113
           EC ++  + +    S+   I L             LK  +  +L  + +I S   D++  
Sbjct: 322 ECGIVPIIWRSG-ASINNHIALRIIASKPLDYFLKLKPTIDEELTPEQLI-SLPKDDFRR 379

Query: 114 VEALVAHMSDIDEKQLLLYAQIANLVNLILQW----------PEISI--NEIAENFSKLA 161
           V  L  H  +        +  +A  +   L+            E+SI  + +  +   + 
Sbjct: 380 VAQLERHQGERQPSNFFQHVLMARFLTNCLRAGGYFGSEPKPDEVSIICSLVLRSLQFIQ 439

Query: 162 CNAHTIC-----NSELRP----LGTGLYPVISIINHSCLPNAVLVFEGRLAVVRAVQHVP 212
            N H +      +S  R     +G  +YP +++ NHSC P  V  F G    + +V+ + 
Sbjct: 440 FNTHEVAELHKFSSSGREKSIFIGGAIYPTLALFNHSCDPGVVRYFRGTTIHINSVRPIE 499

Query: 213 KGAEVLISY--IETAGSTMTRQKALKEQYLFTCTCPRCI-KLGQFDDIQESAILEGYRCK 269
            G  +  +Y  + T      RQ  LK+ Y F C+C  CI    +FDD+    I   +RC 
Sbjct: 500 AGLPINENYGPMYTQDERSERQARLKDLYWFECSCDACIDNWPKFDDLPRDVI--RFRCD 557

Query: 270 -DDGCSGFL--LRDSDDKGFTCQQCGLVRSKEEIKKIASEVNILSKKTLALTSCGNHQEV 326
             + CS  +      +D    C  CG + +  +  K+  +  ++++    L   G + + 
Sbjct: 558 APNNCSAVIEVPPSCNDFMVKCVTCGEITNILKGLKVMQDTEMMTRTAKRLYETGEYPKA 617

Query: 327 VSTY-KMIEKLQKKLYHPF 344
           ++ +  +I  + + L  PF
Sbjct: 618 LAKFVDLIRIMYEVLAPPF 636


>gi|290975196|ref|XP_002670329.1| predicted protein [Naegleria gruberi]
 gi|284083887|gb|EFC37585.1| predicted protein [Naegleria gruberi]
          Length = 626

 Score = 74.7 bits (182), Expect = 1e-10,   Method: Compositional matrix adjust.
 Identities = 37/120 (30%), Positives = 64/120 (53%), Gaps = 16/120 (13%)

Query: 170 SELRPLGTGLYPVISIINHSCLPNAVL----------VFEGRLAVVRAVQHVPKGAEVLI 219
           ++ R +GTGL+P++S+ +HSC PN              + G + +V+ V+ V KG E+ I
Sbjct: 287 ADFRDIGTGLFPMVSMFDHSCSPNCSFQTFDDMKSRNSYSGNVILVQTVKKVKKGEELCI 346

Query: 220 SYIETAGSTMTRQKALKEQYLFTCTCPRCIKLGQFDDIQESAILEGYRCKDDGCS-GFLL 278
           SYI+    T  R++ L     F C C RC+      + +E+ ++  Y C +  C+ G+L+
Sbjct: 347 SYIDIMNPTCIRRRELWYSKYFVCRCSRCMS-----ETEENRMVRAYSCGESECTEGYLV 401


>gi|91079710|ref|XP_969181.1| PREDICTED: similar to SET and MYND domain containing 4 [Tribolium
           castaneum]
          Length = 393

 Score = 74.7 bits (182), Expect = 1e-10,   Method: Compositional matrix adjust.
 Identities = 93/368 (25%), Positives = 150/368 (40%), Gaps = 71/368 (19%)

Query: 109 DNYSLVEALVAHMSDIDEKQLLLYAQIANLVNLILQ-------W-------PEISINE-- 152
           +NY     L++++     K L  Y  +A++V   LQ       W       P+  +NE  
Sbjct: 28  NNYRYFNRLLSNIHK--NKNLAPYIIMASVVTTYLQKYTDFFTWFLTQPSCPKEGLNELV 85

Query: 153 ------IAENFSKLACNAHTI----CNSE-------LRPLGTGLYPVISIINHSCLPNAV 195
                 I ++ ++LACN+ TI    C+S        L  + +G++P +SI+NHSC PN  
Sbjct: 86  KLVGGFITKHIAQLACNSSTIEQWTCSSSDLLFPDVLITIASGIFPSVSIMNHSCRPNVT 145

Query: 196 LVFEGRLAVVRAVQHVPKGAEVLISY-IETAG-STMTRQKALKEQYLFTCTCPRCIKLGQ 253
             F     VV+A++ + +  E+   Y I+  G     RQ A KE Y F C C  C     
Sbjct: 146 NYFMSDTIVVKALEDIAQNEEIFNCYGIDYRGMEREQRQYACKELYHFECKCVICS---- 201

Query: 254 FDDIQESAILEGYRCKDDGCSGFLLRDSDDKGFTCQQCGLVRSKEEIKKIASEVNILSKK 313
            D   E  +L+ Y C    C G +    +     C  CG     +  +KI  E    ++K
Sbjct: 202 -DPAHELDMLDSYLCPK--CKGLVPEIKNTILSFCVNCGEKYHLKPFRKINYE----AQK 254

Query: 314 TLALTSCGNHQEVVSTYKMIEKLQKKLYHPFSVNLMQTREKLIKILMELEDWKEALAYCQ 373
            +   S    + ++ + K+ EK+  K +  F     +   +L    ++L D +    Y  
Sbjct: 255 YMESDSSNQLELLIKSLKIREKILYKHHKDFE----EVYYRLYSFYVKLGDAENMFKYFH 310

Query: 374 LTI----------------PVYQRVYPQFHPLLGLQYYTCGKLEWFLGDTENAIKSMTEA 417
           L +                 +Y+      H L G Q   C  L+ FL + E  IK   EA
Sbjct: 311 LWLENEKARRGCNSRVVGTKLYEAALAILHCLKGGQQKNCTNLKDFLQNVEYMIK---EA 367

Query: 418 VEILRITH 425
             IL + +
Sbjct: 368 KAILNLYY 375


>gi|197102342|ref|NP_001127594.1| SET and MYND domain-containing protein 4 [Pongo abelii]
 gi|75054780|sp|Q5R5X9.1|SMYD4_PONAB RecName: Full=SET and MYND domain-containing protein 4
 gi|55732267|emb|CAH92837.1| hypothetical protein [Pongo abelii]
          Length = 804

 Score = 74.3 bits (181), Expect = 1e-10,   Method: Compositional matrix adjust.
 Identities = 77/315 (24%), Positives = 130/315 (41%), Gaps = 67/315 (21%)

Query: 146 PEISINEIA--ENFSKLACNA-------HT------ICNSELRPLGTGLYPVISIINHSC 190
           P+++I  +A   +  +L CNA       HT      + +S    L TG++PV+S++NHSC
Sbjct: 483 PDVTIWGVAMLRHMLQLQCNAQAMTTIQHTGSKGSIVTDSRQVRLATGIFPVVSLLNHSC 542

Query: 191 LPNAVLVFEGRLAVVRAVQHVPKGAEVLISY--IETAGSTMTRQKALKEQYLFTCTCPRC 248
            PN  + F   +A ++A Q + KG E+L  Y   ++      RQ+ L+ QY F C CP C
Sbjct: 543 SPNTSMSFISTVATIQASQRIRKGQEILHCYGPHKSRMGVAERQQELRSQYFFDCACPAC 602

Query: 249 IKLGQFDDIQESAILEGYRCKDDGCSGFLLRDSDDKGFTCQQCGLVRSKEEIKKIASEVN 308
                          E +R         +  +   + F C  CG     +++    S   
Sbjct: 603 QT-------------EAHR---------MAAEPRWEAFCCNSCGAPMQGDDVLHCGSR-- 638

Query: 309 ILSKKTLALTSCGNHQEVVSTYKMIEKLQKKLYHPFSVNLMQTREKLIKILM--ELEDWK 366
                     SC   +  VS   ++ +LQ         +L Q      K+L   ELE   
Sbjct: 639 ----------SCA--ESAVSRDHLVSRLQ---------DLQQQVGVAQKLLRDGELERAV 677

Query: 367 EALAYCQLTIPVYQRVYPQFHPLLGLQYYTCGKLEWFLGDTENAIKSMTEAVEILRITHG 426
           + L  CQ     +  ++ + H L+G       +    LGD + A   +  ++ ++ + HG
Sbjct: 678 QQLLGCQRDAESF--LWAE-HALVGEIADGLARACAALGDWQKAATHLQRSLRVVEVRHG 734

Query: 427 TNSPFMKELILKLEE 441
            +S  M   + KL +
Sbjct: 735 PSSVEMGHELFKLAQ 749


>gi|19922036|ref|NP_610684.1| CG7759, isoform B [Drosophila melanogaster]
 gi|16768704|gb|AAL28571.1| HL04910p [Drosophila melanogaster]
 gi|21627490|gb|AAM68738.1| CG7759, isoform B [Drosophila melanogaster]
          Length = 567

 Score = 74.3 bits (181), Expect = 1e-10,   Method: Compositional matrix adjust.
 Identities = 83/381 (21%), Positives = 155/381 (40%), Gaps = 50/381 (13%)

Query: 8   GEVIISQEPYVCVPNNSSSISRCDGCFASSNLK-KCSACQVVWYCGSNC-QKLDWKLHRL 65
           GE ++ + P+V V     + + C+ CF  + +   C  C  V YC   C ++   K H+ 
Sbjct: 169 GEELLVERPFVSVLLEKFAKTHCENCFMRTVVPVACPRCADVLYCSEQCREEASKKYHKY 228

Query: 66  ECQVLSRLDKEKRKSVTPSIRLM------------LKLYLRRKLQNDNVIPSTTTDNYSL 113
           EC ++  + +    S+   I L             LK  +  +L  + +I S   D++  
Sbjct: 229 ECGIVPIIWRSG-ASINNHIALRIIASKPLDYFLKLKPTIDEELTPEQLI-SLPKDDFRR 286

Query: 114 VEALVAHMSDIDEKQLLLYAQIANLVNLILQW----------PEISI--NEIAENFSKLA 161
           V  L  H  +        +  +A  +   L+            E+SI  + +  +   + 
Sbjct: 287 VAQLERHQGERQPSNFFQHVLMARFLTNCLRAGGYFGSEPKPDEVSIICSLVLRSLQFIQ 346

Query: 162 CNAHTICNSELRP-----------LGTGLYPVISIINHSCLPNAVLVFEGRLAVVRAVQH 210
            N H +  +EL             +G  +YP +++ NHSC P  V  F G    + +V+ 
Sbjct: 347 FNTHEV--AELHKFSSSGREKSIFIGGAIYPTLALFNHSCDPGVVRYFRGTTIHINSVRP 404

Query: 211 VPKGAEVLISY--IETAGSTMTRQKALKEQYLFTCTCPRCI-KLGQFDDIQESAILEGYR 267
           +  G  +  +Y  + T      RQ  LK+ Y F C+C  CI    +FDD+    I   +R
Sbjct: 405 IEAGLPINENYGPMYTQDERSERQARLKDLYWFECSCDACIDNWPKFDDLPRDVI--RFR 462

Query: 268 CK-DDGCSGFL--LRDSDDKGFTCQQCGLVRSKEEIKKIASEVNILSKKTLALTSCGNHQ 324
           C   + CS  +      +D    C  CG + +  +  K+  +  ++++    L   G + 
Sbjct: 463 CDAPNNCSAVIEVPPSCNDFMVKCVTCGEITNILKGLKVMQDTEMMTRTAKRLYETGEYP 522

Query: 325 EVVSTY-KMIEKLQKKLYHPF 344
           + ++ +  +I  + + L  PF
Sbjct: 523 KALAKFVDLIRIMYEVLAPPF 543


>gi|353234811|emb|CCA66832.1| hypothetical protein PIIN_00594 [Piriformospora indica DSM 11827]
          Length = 456

 Score = 74.3 bits (181), Expect = 1e-10,   Method: Compositional matrix adjust.
 Identities = 71/271 (26%), Positives = 107/271 (39%), Gaps = 44/271 (16%)

Query: 8   GEVIISQEPYVCVPNNSSSISRCDGCFASSNLKKCSACQVVWYCGSNCQKLDWKLH-RLE 66
           G+VI+            S   RCD C   + +K CS C+  +YC + CQ   WK H R  
Sbjct: 29  GQVILRSSSLATSLLPKSRGQRCDECCRQTTVKACSRCKEAFYCDTRCQSAAWKSHHRTT 88

Query: 67  CQVLSRLDKEKRKSVT------PSIRLMLKLY-----------------LRRKLQNDNVI 103
           C +LS   + K   ++        + L++ LY                 +RR+     ++
Sbjct: 89  CALLSNGYRAKHPYISQPEGKQADLDLLITLYGLVATSQPSMFEQSNWAVRRE---PTLL 145

Query: 104 PSTTTDNYSLVEALVAHMSDIDEKQLLLYAQIANLVNLILQWPEISINE-IAENFSKLAC 162
           P    + +S    L+ H   +    +           L   W     N  +  N S +  
Sbjct: 146 PEEPLECFS---TLLPHSGQVGPPYIPTTFGKTQAEFLKEAWARFENNNFVLHNISSMVP 202

Query: 163 NAHTICNSELRPLGTGLYPVISI-INHSCLPNA--VLVFEGRLAV--VRAVQHVPKGAEV 217
           N+    +        G++P  S   NHSC PNA    V E R A   +RA+  + +  E+
Sbjct: 203 NSGAYAH--------GIFPHASRGFNHSCSPNAWPAFVLEQRQAWLEIRALISIKESEEI 254

Query: 218 LISYIETAGSTMTRQKALKEQYLFTCTCPRC 248
            I Y++ A S   RQ  LK  Y F CTC RC
Sbjct: 255 TIPYLDPALSLPERQARLKATYGFDCTCSRC 285


>gi|384252143|gb|EIE25620.1| hypothetical protein COCSUDRAFT_83627 [Coccomyxa subellipsoidea
           C-169]
          Length = 394

 Score = 74.3 bits (181), Expect = 1e-10,   Method: Compositional matrix adjust.
 Identities = 89/368 (24%), Positives = 156/368 (42%), Gaps = 54/368 (14%)

Query: 42  CSACQVVWYCGSNCQKLDWKLHRLECQVLSRLDKEKRKSVTPSIRLMLKLYLRRKLQNDN 101
           C AC + WYC +  + LD       C             V   + L  +L L  +L    
Sbjct: 54  CQACPLAWYCCAQHRALDAHHGGASC------------GVPWPVLLPEQLVLATRL---A 98

Query: 102 VIPSTTTDNYSLVEALVAHMSDIDEKQLL-------LYAQIANL-VNLILQ-WPEISINE 152
           V+   T+ +  LV  L  H+ D+   QL        + A  A L   L+L+ + ++ IN 
Sbjct: 99  VVAQVTSRD--LVAGLETHVGDVPSHQLFQEAVMCFIAAACAGLEPQLVLKAYRQVVINA 156

Query: 153 IAENFSKLACNAHTICNSELRPLGTGLYPVISIINHSCLPNAVLVFEGRLAVVRAVQHVP 212
           IA   ++ A       ++E R +G  +YP  +++NH+C PN    F G    + A   +P
Sbjct: 157 IAITPAEHA-------SAEDR-MGLAIYPRAAMLNHACSPNVAAAFAGTRLHICATSDLP 208

Query: 213 KGAEVLISYIETAGSTM--TRQKALKEQYLFTCTCPRCIKLGQFDDIQESAILEGYRCKD 270
            G  +   Y   AG ++   R + LKEQY F C C  C     +  +QE+A++ G +C+ 
Sbjct: 209 PGTTLRFCYGPQAGESIREVRLRQLKEQYHFWCRCSAC--EASYSGVQEAAMI-GMKCRL 265

Query: 271 DGCSGFLLRDSDDKGFTCQQCGL-VRSKEEIKKIASEVNILSKKTLALTSCGNHQ----E 325
             C+G ++            CG  + S+E+I+++A      +    A  S   H     E
Sbjct: 266 --CNGPVVPSIG--------CGAEMSSEEQIERMARLSQADAAFASACLSLAQHSAVSAE 315

Query: 326 VVSTYKMIEKLQKKLYHPFSVNLMQTREKLIKILMELEDWKEALAYCQLTIPVYQRVYPQ 385
            + +     ++Q+ L    +  L +T + L +      D + A+ + Q  + + +  Y  
Sbjct: 316 TLQSMHFCLEVQEVLLVTSNQVLGRTHDALARAYGPRGDTENAMLHAQKALDIVRSNYGD 375

Query: 386 FHPLLGLQ 393
            H ++  Q
Sbjct: 376 GHLVVAHQ 383


>gi|296201020|ref|XP_002747857.1| PREDICTED: SET and MYND domain-containing protein 4 [Callithrix
           jacchus]
          Length = 798

 Score = 74.3 bits (181), Expect = 1e-10,   Method: Compositional matrix adjust.
 Identities = 52/156 (33%), Positives = 77/156 (49%), Gaps = 9/156 (5%)

Query: 95  RKLQNDNVIPSTTTDNYSLVEALVAHMSDIDEKQLLLYAQIANLVNLILQWPEISINEIA 154
           R+L+  +  P TT  + S ++A VA     D   + ++  +     L LQ    +I  I 
Sbjct: 449 RQLEAASFQPVTTGASSSQLKAAVAPELCPD---MTVWGVVMLRHMLQLQCNAQAITTIL 505

Query: 155 ENFSKLACNAHTICNSELRPLGTGLYPVISIINHSCLPNAVLVFEGRLAVVRAVQHVPKG 214
              SK       +  S    L TG++PVIS++NHSC PN  + F   +A +RA Q + KG
Sbjct: 506 HTGSKEG----IVTGSRQVRLATGIFPVISLLNHSCSPNTSVSFVSTVATIRASQRIGKG 561

Query: 215 AEVLISY--IETAGSTMTRQKALKEQYLFTCTCPRC 248
            E+L  Y   ++      RQ+ L+ QY F CTCP C
Sbjct: 562 QEILHCYGPHKSRIGVAERQQKLRSQYFFDCTCPAC 597


>gi|260800041|ref|XP_002594945.1| hypothetical protein BRAFLDRAFT_174600 [Branchiostoma floridae]
 gi|229280183|gb|EEN50956.1| hypothetical protein BRAFLDRAFT_174600 [Branchiostoma floridae]
          Length = 458

 Score = 74.3 bits (181), Expect = 1e-10,   Method: Compositional matrix adjust.
 Identities = 105/477 (22%), Positives = 188/477 (39%), Gaps = 83/477 (17%)

Query: 8   GEVIISQEPYVCVPNNSSSISRCDGCFASSNLK-KCSACQVVW-YCGSNCQ-----KLDW 60
           G ++IS+EP +  P+ ++S + C GCF S +   +C  C   W  C   C+     +L+ 
Sbjct: 18  GHLVISEEPILTGPSYTTS-ALCLGCFDSVDGSYRCKGCN--WPLCNRKCETAGSHQLEV 74

Query: 61  KLH---RLECQVLSRLDKEKRKSVTPSIRLMLKLYLRRKLQNDNVIPSTTTDNYSLVEAL 117
             H    L C +L   D +    V P+        LR  L     + S     + L+  +
Sbjct: 75  CTHDYSSLSCAMLKTTDIDITDFVGPNTSYQFITPLRCLL-----LKSRDPARWKLISGM 129

Query: 118 VAHMSDIDEKQLLLYAQIANLVNLILQWPEISINEIAENFSKLACNAHTICNSEL----- 172
            +H  D   +         N+V +I Q   + ++E +E+     C   T+   EL     
Sbjct: 130 ESHTEDQTREYRWGMEMGRNVVRVIRQ--TLGMDEFSEHEIMKVCGILTVNAFELFTTGP 187

Query: 173 ------RPLGTGLYPVISIINHSCLPNAVLVFEGRLAV-VRAVQHVPKGAEVLISYIETA 225
                  P    +YP   ++NH CL N +   + +  + VR    V KG  +   Y E  
Sbjct: 188 NKDRDSTPRAWAVYPTTYLMNHDCLANTLTSIDSKNKMHVRTRMPVKKGEALTAEYSECL 247

Query: 226 GSTMTRQKALKEQYLFTCTCPRCIKLGQFDDIQESAILEGYRCKDDGCSGFLLR----DS 281
             T  R+  L     F C+C RC      D  +  + +  +RC +  C G ++     D 
Sbjct: 248 WGTEIRRHQLHRYKYFWCSCQRC-----RDPTELGSFISSHRCTN--CGGNVVATKPLDF 300

Query: 282 DDKGFTCQQCGLVRSKEEIK----KIASEVNILSKKTLALTSCGNHQEVVSTYKMIEKLQ 337
           +  GF C+ C ++   E +K    ++  EV  L  K L        +  +  Y+ + +  
Sbjct: 301 NSTGFRCESCRVIIRGEVLKGTNHQVEEEVFELESKEL--------ETPIEEYEDLVERH 352

Query: 338 KKLYHPFSVNLMQTREKLIKI--------LMELEDWK--EALAYCQLTIPVYQRVYPQFH 387
           + + HP    +++ ++ L           L EL + +  + +  C+  I   + V P+F+
Sbjct: 353 ESILHPNHYAMLRCKKFLAHAYGRQPGYSLAELSEDRLNKKMEMCRTLIDTLEIVSPRFN 412

Query: 388 PLLGLQYYTCGKLEWFLGDTENAIKSMTEA-VEILRITHGTNSPFMKELILKLEEAQ 443
                             DT + +  +  A VE+ R  H       +E+++ L EA+
Sbjct: 413 -----------------DDTAHFLYELHAALVEVARRRHDLGLISNQEMVVALTEAK 452


>gi|270007661|gb|EFA04109.1| hypothetical protein TcasGA2_TC014347 [Tribolium castaneum]
          Length = 497

 Score = 74.3 bits (181), Expect = 1e-10,   Method: Compositional matrix adjust.
 Identities = 83/308 (26%), Positives = 132/308 (42%), Gaps = 37/308 (12%)

Query: 8   GEVIISQEPYVCVPNNSSSIS-RCDGCFASSNLKKCSACQVVWYCGSNCQKLDWKLHRLE 66
           GE+I    P +  P  + +    C  C++  +L+ C     +  C   C+K     H+ E
Sbjct: 49  GELIFKDFPVILGPRAAPTCPLSCVSCYSRRDLRLCGKKCGLLVCSEKCEKS--LGHQKE 106

Query: 67  CQVLSRLDKEKRKSVTPSIRLMLKLYLRRKLQNDNVIPSTTTDNYSLVEALVAHMSDIDE 126
           C+V+ R  + K  S   S++L+  L   R L           ++ S+V+ L AH SD   
Sbjct: 107 CKVV-RQWQSKPISEDLSVQLVRVLSPIRSLL-------LGEEDKSVVKCLKAHKSD--- 155

Query: 127 KQLLLYAQIANLVNLILQWPEISINEIAENFSKLACNAHTICNSEL-------RPLGTGL 179
                  Q    V+++     +++ E  E F +  C+       E+       +    GL
Sbjct: 156 -------QHGFEVDVMKDALALNVKEDEEKFMRFVCSVMDSNAFEVLVGFEDNQASVKGL 208

Query: 180 YPVISIINHSCLPNAVLVFEGRL-AVVRAVQHVPKGAEVLISYIETAGSTMTRQKALKEQ 238
           YP+ S+ NHSC PN   VF+ +   VVRA + +P+G+E+  SY     ST  R+  L   
Sbjct: 209 YPLGSLANHSCCPNTCHVFDDKQHMVVRASKFIPQGSEIFHSYSRLIWSTSARRFHLYRT 268

Query: 239 YLFTCTCPRCIKLGQFDDIQESAILEGYRCKDDGCSGFLLRDS--DDKGFTCQQCGLVRS 296
             F C C RC    +F     S +     CK       +  +S   DK + C+ CG +  
Sbjct: 269 KHFLCKCQRCEDPTEFGSYIGSIL-----CKVCKTGKVIPTNSLQTDK-WQCEVCGSLIK 322

Query: 297 KEEIKKIA 304
           KEE+  I 
Sbjct: 323 KEEVANIT 330


>gi|125810418|ref|XP_001361485.1| GA20567 [Drosophila pseudoobscura pseudoobscura]
 gi|54636660|gb|EAL26063.1| GA20567 [Drosophila pseudoobscura pseudoobscura]
          Length = 660

 Score = 74.3 bits (181), Expect = 1e-10,   Method: Compositional matrix adjust.
 Identities = 81/383 (21%), Positives = 153/383 (39%), Gaps = 52/383 (13%)

Query: 7   SGEVIISQEPYVCVPNNSSSISRCDGCFASSNLK-KCSACQVVWYCGSNCQ-KLDWKLHR 64
           +GE ++ + P+V V     + + C+ CF  S +   C  C  V YC   C+ +   K H+
Sbjct: 261 AGEELLVERPFVSVLLEKFAKTHCENCFVRSVVPVACPRCADVIYCSEQCRDEAAKKYHK 320

Query: 65  LECQVLSRLDKEKRKSVTPSIRLMLKLYLRRKLQ---------NDNVIP----STTTDNY 111
            EC ++  +    R   + +  + L+++  + L+         +  + P    S   D++
Sbjct: 321 YECGIVPTI---WRSGASINNHIALRIFASKPLEYFLQLKPTIDKELSPEELLSLPKDDF 377

Query: 112 SLVEALVAHMSDIDEKQLLLYAQIANLVNLILQ-----WPEISINEIA-------ENFSK 159
             V  L  H +         +  +A  +   LQ      PE    ++         +   
Sbjct: 378 RRVAQLERHQNKRPPSNFFQHVLMARFLTKCLQASGYFGPEPKPEQVRAICSLVLRSLQF 437

Query: 160 LACNAHTICNSELRP-----------LGTGLYPVISIINHSCLPNAVLVFEGRLAVVRAV 208
           +  N H +  +EL             +G  +YP +++ NHSC P  V  F G    + +V
Sbjct: 438 IQFNTHEV--AELHKFSSTGREKSIFIGGAIYPTLALFNHSCDPGVVRYFRGTTIHINSV 495

Query: 209 QHVPKGAEVLISY--IETAGSTMTRQKALKEQYLFTCTCPRCI-KLGQFDDIQESAILEG 265
           + +  G  +  +Y  I T      RQ  +KE Y F C C  CI     F+D+    I   
Sbjct: 496 RPIEAGLPINENYGPIYTQDKREDRQAKMKELYWFECCCDACIDNWPLFEDLPRDVI--R 553

Query: 266 YRCK-DDGCSGFL--LRDSDDKGFTCQQCGLVRSKEEIKKIASEVNILSKKTLALTSCGN 322
           +RC+  + C+  +      +D    C  CG V +  +  K+  +  ++++    L   G+
Sbjct: 554 FRCEAPNNCAAVIEVPPSCNDFMVKCVTCGEVTNILKGLKVMQDTEMMTRTGKRLYETGD 613

Query: 323 HQEVVSTY-KMIEKLQKKLYHPF 344
           + + +  +  +I  +   L  PF
Sbjct: 614 YAKALQKFIDLIRIMYDVLAPPF 636


>gi|15620931|dbj|BAB67829.1| KIAA1936 protein [Homo sapiens]
          Length = 558

 Score = 74.3 bits (181), Expect = 1e-10,   Method: Compositional matrix adjust.
 Identities = 75/315 (23%), Positives = 129/315 (40%), Gaps = 67/315 (21%)

Query: 146 PEISINEIA--ENFSKLACNAHT-------------ICNSELRPLGTGLYPVISIINHSC 190
           P+++I  +A   +  +L CNA               + +S    L TG++PVIS++NHSC
Sbjct: 287 PDVTIWGVAMLRHMLQLQCNAQAMTTIQRTGPKGSIVTDSRQVRLATGIFPVISLLNHSC 346

Query: 191 LPNAVLVFEGRLAVVRAVQHVPKGAEVLISYI--ETAGSTMTRQKALKEQYLFTCTCPRC 248
            PN  + F   +A +RA Q + KG E+L  Y   ++      RQ+ L+ QY F C CP C
Sbjct: 347 SPNTSVSFISTVATIRASQWIRKGQEILHCYGPHKSRMGVAERQQKLRSQYFFDCACPAC 406

Query: 249 IKLGQFDDIQESAILEGYRCKDDGCSGFLLRDSDDKGFTCQQCGLVRSKEEIKKIASEVN 308
                    +   +  G R                + F C  CG     +++ +  S   
Sbjct: 407 -------QTEAHRMAAGPRW---------------EAFCCNSCGAPMQGDDVLRCGSR-- 442

Query: 309 ILSKKTLALTSCGNHQEVVSTYKMIEKLQKKLYHPFSVNLMQTREKLIKILM--ELEDWK 366
                     SC   +  VS   ++ +LQ         +L Q      K+L   ELE   
Sbjct: 443 ----------SCA--ESAVSRDHLVSRLQ---------DLQQQVRVAQKLLRDGELERAV 481

Query: 367 EALAYCQLTIPVYQRVYPQFHPLLGLQYYTCGKLEWFLGDTENAIKSMTEAVEILRITHG 426
           + L+ CQ     +  ++ + H ++G       +    LGD + +   +  ++ ++ + HG
Sbjct: 482 QRLSGCQRDAESF--LWAE-HAVVGEIADGLARACAALGDWQKSATHLQRSLCVVEVRHG 538

Query: 427 TNSPFMKELILKLEE 441
            +S  M   + KL +
Sbjct: 539 PSSVEMGHELFKLAQ 553


>gi|146413731|ref|XP_001482836.1| hypothetical protein PGUG_04791 [Meyerozyma guilliermondii ATCC
           6260]
          Length = 637

 Score = 73.9 bits (180), Expect = 1e-10,   Method: Compositional matrix adjust.
 Identities = 71/284 (25%), Positives = 115/284 (40%), Gaps = 46/284 (16%)

Query: 42  CSACQVVWYCGSNCQKLDW-KLHRLECQVLSRLDKEKRKSVTPSIRLMLKLYLRRKLQND 100
           C+ C++  YC   C  L W K+H+ EC V  +L  +        +RL L+L         
Sbjct: 88  CNGCKLYTYCNQRCYDLSWEKVHQYECPVFEQL--KSSFGYQEFVRLALRLCTLYS---- 141

Query: 101 NVIPSTTTDNYSLVEALVAHMSDIDEKQLL--LYAQIANLVNLILQWPEISINEIAENFS 158
             IPS   +    +E L  H  ++++   L  L  +IA L+   ++  + ++ +I     
Sbjct: 142 --IPSYRKN----LERLTTHREEVEKNATLVRLSEKIAPLLPEKIKASQQTVLKI---LC 192

Query: 159 KLACNAHTICNSELRPLGTGLYPVISIINHSCLPNAVLVFEGRLAVVRAVQHVP--KGAE 216
             A N+ T+ N     +G    P  S+INHSC+PN +      L V+  V   P   G E
Sbjct: 193 LAAINSSTMMNENFEQVGMAFDPTFSLINHSCVPN-LYSIPISLTVISFVATSPIKAGTE 251

Query: 217 VLISYIETAGSTMTRQKALKEQYLFTCTCPRCIKLGQFDDIQESAILEGYRCKDDGCSGF 276
           V  SY      T  R++ L+ ++ FTC C  C     F           Y C + G   F
Sbjct: 252 VFTSYCFNGYPTEVRRRTLETRFYFTCKCSICRNKKNF--------FFSYNCPNCGIMMF 303

Query: 277 LLRDSDDKGF-----------------TCQQCGLVRSKEEIKKI 303
            +  S    F                  C  CGL  + ++++++
Sbjct: 304 SIALSSFFKFGREKMYQAISPTNGTPSVCSSCGLAVALKQVERV 347


>gi|16553701|dbj|BAB71564.1| unnamed protein product [Homo sapiens]
          Length = 313

 Score = 73.9 bits (180), Expect = 2e-10,   Method: Compositional matrix adjust.
 Identities = 67/271 (24%), Positives = 113/271 (41%), Gaps = 52/271 (19%)

Query: 175 LGTGLYPVISIINHSCLPNAVLVFEGRLAVVRAVQHVPKGAEVLISYI--ETAGSTMTRQ 232
           L TG++PVIS++NHSC PN  + F   +A +RA Q + KG E+L  Y   ++      RQ
Sbjct: 36  LATGIFPVISLLNHSCSPNTSVSFISTVATIRASQRIRKGQEILHCYGPHKSRMGVAERQ 95

Query: 233 KALKEQYLFTCTCPRCIKLGQFDDIQESAILEGYRCKDDGCSGFLLRDSDDKGFTCQQCG 292
           + L+ QY F C CP C         +   +  G R                + F C  CG
Sbjct: 96  QKLRSQYFFDCACPAC-------QTEAHRMAAGPRW---------------EAFCCNSCG 133

Query: 293 LVRSKEEIKKIASEVNILSKKTLALTSCGNHQEVVSTYKMIEKLQKKLYHPFSVNLMQTR 352
                +++ +  S             SC   +  VS   ++ +LQ         +L Q  
Sbjct: 134 APMQGDDVLRCGSR------------SCA--ESAVSRDHLVSRLQ---------DLQQQV 170

Query: 353 EKLIKILM--ELEDWKEALAYCQLTIPVYQRVYPQFHPLLGLQYYTCGKLEWFLGDTENA 410
               K+L   ELE   + L+ CQ     +  ++ + H ++G       +    LGD + +
Sbjct: 171 RVAQKLLRDGELERAVQRLSGCQRDAESF--LWAE-HAVVGEIADGLARACAALGDWQKS 227

Query: 411 IKSMTEAVEILRITHGTNSPFMKELILKLEE 441
              +  ++ ++ + HG +S  M   + KL +
Sbjct: 228 ATHLQRSLCVVEVRHGPSSVEMGHELFKLAQ 258


>gi|190348269|gb|EDK40693.2| hypothetical protein PGUG_04791 [Meyerozyma guilliermondii ATCC
           6260]
          Length = 637

 Score = 73.9 bits (180), Expect = 2e-10,   Method: Compositional matrix adjust.
 Identities = 60/212 (28%), Positives = 95/212 (44%), Gaps = 21/212 (9%)

Query: 42  CSACQVVWYCGSNCQKLDW-KLHRLECQVLSRLDKEKRKSVTPSIRLMLKLYLRRKLQND 100
           C+ C++  YC   C  L W K+H+ EC V  +L  +        +RL L+L         
Sbjct: 88  CNGCKLYTYCNQRCYDLSWEKVHQYECPVFEQL--KSSFGYQEFVRLALRLCTLYS---- 141

Query: 101 NVIPSTTTDNYSLVEALVAHMSDIDEKQLL--LYAQIANLVNLILQWPEISINEIAENFS 158
             IPS   +    +E L  H  ++++   L  L  +IA L+   ++  + ++ +I     
Sbjct: 142 --IPSYRKN----LERLTTHREEVEKNATLVRLSEKIAPLLPEKIKASQQTVLKI---LC 192

Query: 159 KLACNAHTICNSELRPLGTGLYPVISIINHSCLPNAVLVFEGRLAVVRAVQHVP--KGAE 216
             A N+ T+ N     +G    P  S+INHSC+PN +      L V+  V   P   G E
Sbjct: 193 LAAINSSTMMNENFEQVGMAFDPTFSLINHSCVPN-LYSIPISLTVISFVATSPIKAGTE 251

Query: 217 VLISYIETAGSTMTRQKALKEQYLFTCTCPRC 248
           V  SY      T  R++ L+ ++ FTC C  C
Sbjct: 252 VFTSYCFNGYPTEVRRRTLETRFYFTCKCSIC 283


>gi|380801673|gb|AFE72712.1| SET and MYND domain-containing protein 4, partial [Macaca mulatta]
          Length = 212

 Score = 73.9 bits (180), Expect = 2e-10,   Method: Compositional matrix adjust.
 Identities = 34/76 (44%), Positives = 47/76 (61%), Gaps = 2/76 (2%)

Query: 175 LGTGLYPVISIINHSCLPNAVLVFEGRLAVVRAVQHVPKGAEVLISYI--ETAGSTMTRQ 232
           L TG++PVIS++NHSC PN  + F   +A +RA Q + KG E+L  Y   ++      RQ
Sbjct: 26  LATGIFPVISLLNHSCSPNTSVSFISTVATIRASQRIGKGQEILHCYGPHKSRMGVAERQ 85

Query: 233 KALKEQYLFTCTCPRC 248
           + L+ QY F CTCP C
Sbjct: 86  QKLRSQYFFDCTCPAC 101


>gi|125983736|ref|XP_001355633.1| GA11413 [Drosophila pseudoobscura pseudoobscura]
 gi|54643949|gb|EAL32692.1| GA11413 [Drosophila pseudoobscura pseudoobscura]
          Length = 496

 Score = 73.9 bits (180), Expect = 2e-10,   Method: Compositional matrix adjust.
 Identities = 34/106 (32%), Positives = 58/106 (54%), Gaps = 7/106 (6%)

Query: 178 GLYPVISIINHSCLPNAVLVFEGR--LAVVRAVQHVPKGAEVLISYIETAGSTMTRQKAL 235
            LYP+ +++NH C+PN+   FE +  + +VRA   +P+G E+  +Y +     + R   L
Sbjct: 208 ALYPLFAVMNHDCIPNSYYTFEEKTNIMIVRAAVDIPEGFEITTTYTKLFTGNIARHLFL 267

Query: 236 KEQYLFTCTCPRCIKLGQFDDIQESAILEGYRCKDDGCSGFLLRDS 281
           K +  FTC C RC      D  ++ A + G  C+D  CSG ++ ++
Sbjct: 268 KIKRGFTCKCSRC-----SDPTEKGAYISGLYCRDTNCSGLVVPET 308


>gi|195168440|ref|XP_002025039.1| GL26796 [Drosophila persimilis]
 gi|194108484|gb|EDW30527.1| GL26796 [Drosophila persimilis]
          Length = 498

 Score = 73.9 bits (180), Expect = 2e-10,   Method: Compositional matrix adjust.
 Identities = 34/106 (32%), Positives = 58/106 (54%), Gaps = 7/106 (6%)

Query: 178 GLYPVISIINHSCLPNAVLVFEGR--LAVVRAVQHVPKGAEVLISYIETAGSTMTRQKAL 235
            LYP+ +++NH C+PN+   FE +  + +VRA   +P+G E+  +Y +     + R   L
Sbjct: 208 ALYPLFAVMNHDCIPNSYYTFEEKTNIMIVRAAVDIPEGFEITTTYTKLFTGNIARHLFL 267

Query: 236 KEQYLFTCTCPRCIKLGQFDDIQESAILEGYRCKDDGCSGFLLRDS 281
           K +  FTC C RC      D  ++ A + G  C+D  CSG ++ ++
Sbjct: 268 KIKRGFTCKCSRC-----SDPTEKGAYISGLYCRDTNCSGLVVPET 308


>gi|303282641|ref|XP_003060612.1| predicted protein [Micromonas pusilla CCMP1545]
 gi|226458083|gb|EEH55381.1| predicted protein [Micromonas pusilla CCMP1545]
          Length = 302

 Score = 73.9 bits (180), Expect = 2e-10,   Method: Compositional matrix adjust.
 Identities = 50/163 (30%), Positives = 74/163 (45%), Gaps = 17/163 (10%)

Query: 93  LRRKLQNDNVIPSTTTDNYSLVEALVAHM-----SDIDEKQLLLYAQIANLVNLILQWPE 147
           LRR L+  +  P   +D    +  +  +       D+DE   +   + A         P 
Sbjct: 149 LRRLLKLQSHEPEVDSDEREWLSGIARNTLRLMEQDVDEGVKVFGEKDA---------PG 199

Query: 148 ISINEIAENFSKLACNAHTICNSE---LRPLGTGLYPVISIINHSCLPNAVLVFEGRLAV 204
             + E+      + CN+HT+  SE     P+GT +Y   S  NHSCLP+A    EG    
Sbjct: 200 YGVEELVRLMCCVRCNSHTLYASEEWPSEPVGTAVYLKGSAFNHSCLPSAEFYNEGTSLR 259

Query: 205 VRAVQHVPKGAEVLISYIETAGSTMTRQKALKEQYLFTCTCPR 247
           VR+V+ +  G EV ISY+    +   R++AL  QY F C C R
Sbjct: 260 VRSVRDISAGEEVTISYVPVTETLWDRRQALWRQYKFDCECDR 302


>gi|189237463|ref|XP_967982.2| PREDICTED: similar to msta CG33548-PB [Tribolium castaneum]
          Length = 387

 Score = 73.6 bits (179), Expect = 2e-10,   Method: Compositional matrix adjust.
 Identities = 83/308 (26%), Positives = 132/308 (42%), Gaps = 37/308 (12%)

Query: 8   GEVIISQEPYVCVPNNSSSIS-RCDGCFASSNLKKCSACQVVWYCGSNCQKLDWKLHRLE 66
           GE+I    P +  P  + +    C  C++  +L+ C     +  C   C+K     H+ E
Sbjct: 49  GELIFKDFPVILGPRAAPTCPLSCVSCYSRRDLRLCGKKCGLLVCSEKCEKSLG--HQKE 106

Query: 67  CQVLSRLDKEKRKSVTPSIRLMLKLYLRRKLQNDNVIPSTTTDNYSLVEALVAHMSDIDE 126
           C+V+ R  + K  S   S++L+  L   R L           ++ S+V+ L AH SD   
Sbjct: 107 CKVV-RQWQSKPISEDLSVQLVRVLSPIRSLL-------LGEEDKSVVKCLKAHKSD--- 155

Query: 127 KQLLLYAQIANLVNLILQWPEISINEIAENFSKLACNAHTICNSEL-------RPLGTGL 179
                  Q    V+++     +++ E  E F +  C+       E+       +    GL
Sbjct: 156 -------QHGFEVDVMKDALALNVKEDEEKFMRFVCSVMDSNAFEVLVGFEDNQASVKGL 208

Query: 180 YPVISIINHSCLPNAVLVFEGRL-AVVRAVQHVPKGAEVLISYIETAGSTMTRQKALKEQ 238
           YP+ S+ NHSC PN   VF+ +   VVRA + +P+G+E+  SY     ST  R+  L   
Sbjct: 209 YPLGSLANHSCCPNTCHVFDDKQHMVVRASKFIPQGSEIFHSYSRLIWSTSARRFHLYRT 268

Query: 239 YLFTCTCPRCIKLGQFDDIQESAILEGYRCKDDGCSGFLLRDS--DDKGFTCQQCGLVRS 296
             F C C RC    +F     S +     CK       +  +S   DK + C+ CG +  
Sbjct: 269 KHFLCKCQRCEDPTEFGSYIGSIL-----CKVCKTGKVIPTNSLQTDK-WQCEVCGSLIK 322

Query: 297 KEEIKKIA 304
           KEE+  I 
Sbjct: 323 KEEVANIT 330


>gi|449480134|ref|XP_004177074.1| PREDICTED: LOW QUALITY PROTEIN: SET and MYND domain-containing
           protein 4 [Taeniopygia guttata]
          Length = 797

 Score = 73.6 bits (179), Expect = 2e-10,   Method: Compositional matrix adjust.
 Identities = 70/295 (23%), Positives = 121/295 (41%), Gaps = 24/295 (8%)

Query: 159 KLACNAHTIC-NSELRP------------LGTGLYPVISIINHSCLPNAVLVFEGRLAVV 205
           +L CNA  I    EL P            L T  +PV+S++NHSC PN  + F G  A V
Sbjct: 491 QLQCNAQAITVMQELGPGDGAVVDKKPVRLATAFFPVLSLLNHSCCPNTSMSFSGTAATV 550

Query: 206 RAVQHVPKGAEVLISYIE--TAGSTMTRQKALKEQYLFTCTCPRCIKLGQFDDIQESAIL 263
           RA Q +  G EVL  Y           RQ+ L+ QY F C CP C++  +       +I 
Sbjct: 551 RASQPISSGQEVLHCYGPHWCRMRVAERQQLLR-QYFFECRCPACLEELESGVKSVVSIR 609

Query: 264 EGYRCKDDGCSGFLLRDSDDKGFTCQQCGLVRSKEEIKKIASEVNILSKKTLALTSCGNH 323
             + C    C   +  + D    + + C    S+  +     ++    KK L +   G  
Sbjct: 610 NSFCCPK--CQAQMQGEEDTLCCSNEACATSASRNHLSGRLQDLQQQIKKALGMLRVGKA 667

Query: 324 QEVVSTYKMIEKLQ---KKLYHPFSVNLMQTREKLIKILMELEDWKEALAYCQLTIPVYQ 380
            + +   KM+ K Q        P  + + +  + L ++   L  W+EA  + + +I + +
Sbjct: 668 DQAI---KMLLKCQMDAGTFLSPEHLLMGEMEDHLAQVYATLGKWQEAARHLKKSIEIVE 724

Query: 381 RVYPQFHPLLGLQYYTCGKLEWFLGDTENAIKSMTEAVEILRITHGTNSPFMKEL 435
             +       G + +   ++ +       A+ ++  A  IL +  G  +  ++EL
Sbjct: 725 MHHGPSSVETGHELFKLAQILFNGFAVSEALSTIQRAEGILSVHFGPQNAQIQEL 779


>gi|157124668|ref|XP_001654144.1| hypothetical protein AaeL_AAEL001857 [Aedes aegypti]
 gi|108882773|gb|EAT46998.1| AAEL001857-PA [Aedes aegypti]
          Length = 497

 Score = 73.6 bits (179), Expect = 2e-10,   Method: Compositional matrix adjust.
 Identities = 90/373 (24%), Positives = 152/373 (40%), Gaps = 41/373 (10%)

Query: 7   SGEVIISQEPYVCVPNNSSS---ISRCDGCFASSNLKKCSACQVVWYCGSNCQ--KLDWK 61
           +G+VI+ + P+   P  +SS   +  C     S+N  KCS C   W     C   K +  
Sbjct: 25  AGDVILEEIPFAIGPKVNSSPLCLECCCPVDGSANGPKCSDCG--WPMCEECNEDKDNVV 82

Query: 62  LHRLECQVLSRLDKEKRKSVTPSIRLMLKL----YLRRKLQNDNVIPSTTTDNYSLVEAL 117
            H+ EC +  +  K K ++V  S    ++L     LR  L  ++  P       S++E  
Sbjct: 83  YHKKECALFVK-GKAKFQNVEDSTASCMQLDCITPLRMLLAKEDH-PERWDAELSVME-- 138

Query: 118 VAHMSDIDEKQLLLYAQIANLVNLILQWP-----EISINEIAENFSKLACNAHTICNSEL 172
             + +D  +     +    N+V   L+ P       S   I +    L  NA  +  S+ 
Sbjct: 139 --YHNDARKDGTTWHQDQQNIVGY-LRGPCGLKDRFSEEVIQQVIGILDINAFEVRTSKG 195

Query: 173 RPLGTGLYPVISIINHSCLPNAVLVF---EGRLAVVRAVQHVPKGAEVLISYIETAGSTM 229
                GLYP + ++ HSC+PN V      +G   + R    +P+G ++   Y  T   T 
Sbjct: 196 YS-ARGLYPKLGVMAHSCVPNVVHSIHPSDGYRLIGRVAIDIPEGEKLYTVYTYTLNGTS 254

Query: 230 TRQKALKEQYLFTCTCPRCIKLGQFDDIQESAILEGYRCKDDGCSGFLLRDS----DDKG 285
           TRQ ALK    FTC CPRC+     D  +        +C+   C   L+  S    D+  
Sbjct: 255 TRQAALKSSKYFTCRCPRCL-----DPTELGTHFSSLKCQK--CDNGLIVSSNPLDDEAE 307

Query: 286 FTCQQCGLVRSKEEIKKIASEVNILSKKTLALTSCGNHQEVVSTYKMIEKLQKKLYHPFS 345
           + C  C        ++K    + ++  +   L+   N  E ++ ++ + K  + + HP  
Sbjct: 308 WRCSNCEFKTRGAAVQKA---IQVMQAEIDQLSCLENGPEKLTAFEKLYKKYRSILHPLH 364

Query: 346 VNLMQTREKLIKI 358
                 R  LI++
Sbjct: 365 FIKTSIRHSLIEL 377


>gi|195127975|ref|XP_002008442.1| GI11814 [Drosophila mojavensis]
 gi|193920051|gb|EDW18918.1| GI11814 [Drosophila mojavensis]
          Length = 527

 Score = 73.6 bits (179), Expect = 2e-10,   Method: Compositional matrix adjust.
 Identities = 83/333 (24%), Positives = 146/333 (43%), Gaps = 43/333 (12%)

Query: 8   GEVIISQEPYVCVPNNSSSISRCDGCF--------ASSNLKKCSACQVVW-YCGSNCQKL 58
           GE+I+ + P V  P   +S+  C GC          + N  KCSAC   W  CG  C++ 
Sbjct: 68  GELIMREAPLVLGPK-VASMPICLGCHRNLLPPQKPALNYYKCSACS--WPLCGPECERS 124

Query: 59  DWKLHRLECQVLSRLDKEKRKSVTPSIRLMLK-----LYLRRKLQNDNVIPSTTTDNYSL 113
            +  H  EC++++    + + +  P   +  +     + L R +Q     P+     ++ 
Sbjct: 125 AF--HVDECRLMAASKFQSKINYNPGQPVGKESAYCVIMLLRCMQLKESKPAA----FAR 178

Query: 114 VEALVAHMSDIDEKQL--LLYAQIANLVNLIL---QWPEISINEIAENFSKLACNAHTIC 168
           +  L  H+ +  E  L  +L A + + +  +L   +WP++ I  IA   + L  NA  + 
Sbjct: 179 LCELEDHLKERLETPLYQVLRANLISFIKTVLGLREWPDLDILRIA---AILDTNAFEVR 235

Query: 169 NSELRPLGTGLYPVISIINHSCLPNAVLVFEGRLAVV-RAVQHVPKGAEVLISYIETAGS 227
            +  R    GL+P  ++ +H C+PN    F+  + ++  A + + KG  + ISY +   S
Sbjct: 236 QAGDRIKVRGLFPGGAMFSHDCVPNMRHRFDDDMNIMFLAKRPIAKGEILSISYTQQLRS 295

Query: 228 TMTRQKALKEQYLFTCTCPRCIKLGQFDDIQESAILEGYRCKDDGCSGFLLRD--SDDKG 285
           T+ R+  LK+   F C C RC      D  +       + C       F+  D   +   
Sbjct: 296 TIQRRLHLKQVKCFECACARCA-----DPTELGTYAGAHMCAKCKVGKFISMDPLQNAAN 350

Query: 286 FTCQQCGLVRSKEEI----KKIASEVNILSKKT 314
           + C+ C L+R  +E      KI +E+  L K T
Sbjct: 351 WRCEVCNLIRPAKEYLINDAKIEAELESLDKTT 383


>gi|321473475|gb|EFX84442.1| hypothetical protein DAPPUDRAFT_194440 [Daphnia pulex]
          Length = 524

 Score = 73.6 bits (179), Expect = 2e-10,   Method: Compositional matrix adjust.
 Identities = 80/332 (24%), Positives = 140/332 (42%), Gaps = 45/332 (13%)

Query: 7   SGEVIISQEPYVCVPNNSSSISRCDGCFASSNLK-KCSACQVVW-YCGSNCQKLDWKLHR 64
           +G+VI+ ++P V  P +++    C GC++  + + +CS C   W  CG +        H 
Sbjct: 64  AGDVILQEKPIVMGPKHTAG-QICLGCYSGVDGRTRCSQCG--WPMCGRDDCHAHESDHA 120

Query: 65  LECQVLSRLDKEKRKSVT----PSIRLMLKLYLRRKLQNDNVIPSTTTDNYSLVEALVAH 120
            EC V++   +    S+      S+ ++  L LR   QN             + E     
Sbjct: 121 AECGVMASGGRPIVGSLPVQAYQSVMVLRCLALRD--QNPERWDELMQLEAHVQERRQKG 178

Query: 121 MSDIDEKQLLLYAQIANLVNLILQWPEISINEIAENFSKLACNAHTICNSELRPLG---- 176
           M D+D+   + + +      L LQ PE  I ++        C    + + E  P+     
Sbjct: 179 MEDVDQATAVRFIR----ETLGLQIPEELILQL--------CGILMVNSFEQPPMKGNSQ 226

Query: 177 ---TGLYPVISIINHSCLPNAVLVFEGRLAVV-RAVQHVPKGAEVLISYIETAGSTMTRQ 232
                +Y   S++ H C+ NA+  F  +  +V RA   +PKG ++ + Y E    TM RQ
Sbjct: 227 HGLVAVYSTASLLEHDCVANAIKTFTNKGDIVIRAAVPIPKGEKIALCYTEPLWGTMNRQ 286

Query: 233 KALKEQYLFTCTCPRCIKLGQFDDIQESAILEGYRCKD--DGCSGFLLRDS---DDKGFT 287
           + L +   F C C RC    + D         G+ C+      +G LL ++   D   + 
Sbjct: 287 RHLSQTKFFQCVCERCKDPTELD-----TFTSGFYCQKCPQQSAGILLTENPLDDASDWI 341

Query: 288 CQQCGLVR----SKEEIKKIASEVNILSKKTL 315
           C+QCG  +    + E I+ +  E+  L K ++
Sbjct: 342 CRQCGARQPANYAAEIIESVGKEIVALKKGSV 373



 Score = 40.8 bits (94), Expect = 1.4,   Method: Compositional matrix adjust.
 Identities = 17/38 (44%), Positives = 22/38 (57%), Gaps = 1/38 (2%)

Query: 33 CFASSNL-KKCSACQVVWYCGSNCQKLDWKLHRLECQV 69
          CF      ++CS CQ V+YC  + QK DWK HR  C+ 
Sbjct: 6  CFCGKEAHQRCSGCQAVFYCSRDHQKSDWKKHRSHCKA 43


>gi|321476942|gb|EFX87901.1| SET and MYND domain-containing protein 4A [Daphnia pulex]
          Length = 648

 Score = 73.6 bits (179), Expect = 2e-10,   Method: Compositional matrix adjust.
 Identities = 66/287 (22%), Positives = 114/287 (39%), Gaps = 44/287 (15%)

Query: 8   GEVIISQEPYVCVPNNSSSISRCDGCFA-SSNLKKCSACQVVWYCGSNCQKLDW-KLHRL 65
           GE +I ++ +  +       S C  C   +     C  C  V +C + C+   W   H+ 
Sbjct: 270 GETVIVEKAHASILQYEFKESHCHHCLHWTPGPVPCHKCSQVGFCSTQCRDEAWDSYHQF 329

Query: 66  ECQVLSRLDKEKRKSVTP-----SIRLMLKLYLR------RKLQNDNVIPSTTTD--NYS 112
           EC +   L +  R   T      ++R +LK   R       + ++     S   D  NY 
Sbjct: 330 ECGLTDFLCRTTRDVNTGQHGLLALRTVLKADRRLIIIANEQEKSPESFASQVFDSANYD 389

Query: 113 LVEALVAHMSDIDEKQLLLYAQIANLVNLILQW-------PEISINEIAENFSKLACNAH 165
            V  LV + S      +   A +A  +  ++Q         E+    +        CNAH
Sbjct: 390 TVHRLVDNSSQRSTTDIFRRAVMAVYLTSLIQIRDGKDRPDEVLATAVLRLLHSYPCNAH 449

Query: 166 TICNSELR--------------------PLGTGLYPVISIINHSCLPNAVLVFEGRLAVV 205
            I +  +                      +G+  +PV+S++NHSC PN V +  G + VV
Sbjct: 450 EISHMAIPVPSGFCAQSKSLQLQQIQSCEIGSAAFPVVSLMNHSCNPNVVHLCYGDVMVV 509

Query: 206 RAVQHVPKGAEVLISYIETAGSTMTRQKALK--EQYLFTCTCPRCIK 250
           + +  + +G E+L +Y     +   R++ LK  +QY F C C  C++
Sbjct: 510 KVIHRIARGEEILDNYGYHYATHEKRERQLKLCQQYYFRCRCQSCVE 556


>gi|308512211|ref|XP_003118288.1| CRE-SET-30 protein [Caenorhabditis remanei]
 gi|308238934|gb|EFO82886.1| CRE-SET-30 protein [Caenorhabditis remanei]
          Length = 548

 Score = 73.6 bits (179), Expect = 2e-10,   Method: Compositional matrix adjust.
 Identities = 64/245 (26%), Positives = 106/245 (43%), Gaps = 17/245 (6%)

Query: 16  PYVCVPNNSSSISRCDGCFASSNLKKCSACQVVWYCGSNCQKLDWKLHRLECQVLSRLDK 75
           P+     +S+    C  C   +    C  C+V  +C   CQ      H+ EC  +     
Sbjct: 27  PFAYSLIDSTKDHYCWTCLGETVELTCDQCKVARFCSKQCQVFGAFDHKYECGAI----- 81

Query: 76  EKRKSVTPSIRLMLKLYLRRKL---QNDNVIPSTTTDNYS--LVEALVAHMSDI--DEKQ 128
           +K   +    R+++++  R K     ND  I    T+  S   V  +  H +D+  DE  
Sbjct: 82  QKCADLNTDERMLIRIIGRYKDILDGNDKKIDGFYTNRESGRTVMQIWEHCADMKKDEHA 141

Query: 129 LLLYAQIANLVNLILQWPEISINEIA-ENFSKLACNAHTICNSE-LRPLGTGLYPVISII 186
           + ++ +I + V        +   E+A +  S+   N H+I N + LR +G GLY  +   
Sbjct: 142 MNVFKKIYDRVKQFGDKNYLVDEEVAFQLHSRNFINRHSISNVDYLREIGKGLYLDLCRY 201

Query: 187 NHSCLPNAVLVFEGRLAVVRAVQ---HVPKGAEVLISYIETAGSTMTRQKALKEQYLFTC 243
           +HSC PNA+    G +A +RA+     +        +YIE     + R+  LKE + F C
Sbjct: 202 DHSCRPNAIYSCNGTVAKLRALHDNVDLENVETTHYTYIELPPCKIQRRHMLKETWYFEC 261

Query: 244 TCPRC 248
            C RC
Sbjct: 262 HCERC 266


>gi|384250973|gb|EIE24451.1| SET domain-containing protein [Coccomyxa subellipsoidea C-169]
          Length = 606

 Score = 73.6 bits (179), Expect = 2e-10,   Method: Compositional matrix adjust.
 Identities = 42/123 (34%), Positives = 58/123 (47%), Gaps = 5/123 (4%)

Query: 163 NAHTI--CNSELRPLGTGLYPVISIINHSCLPNAVLVFEGRLAVVRAVQHVPKGAEVLIS 220
           NAH +   NS       GLYP +S++NHSCLPN V    G    VRA++ V  G ++ ++
Sbjct: 306 NAHGMGAANSTNTDSALGLYPALSMLNHSCLPNCVFASCGSDMHVRAIRPVAAGEQLTVT 365

Query: 221 YIETAGSTMTRQKALKEQYLFTCTCPRCIKLGQFDDIQESAILEGYRCKDDGCSGFLLRD 280
           YI        R + L +   F C C RC+      +      LE  +C   GC G+LL  
Sbjct: 366 YINIMEPRRIRARELMDTKHFACACERCV---SPLETHPDRFLEAVKCGAGGCDGWLLEQ 422

Query: 281 SDD 283
             D
Sbjct: 423 RAD 425


>gi|413917046|gb|AFW56978.1| hypothetical protein ZEAMMB73_771542 [Zea mays]
          Length = 404

 Score = 73.6 bits (179), Expect = 2e-10,   Method: Compositional matrix adjust.
 Identities = 86/360 (23%), Positives = 129/360 (35%), Gaps = 71/360 (19%)

Query: 8   GEVIISQEPYVCVPNNSSSI-SRCDGCFAS---SNLKKCSACQVVWYCGSNCQKLDWKLH 63
           GEV++S+ P +  P+  SS+ S C  CF S   +    C +C+   +C   C        
Sbjct: 38  GEVLLSEPPVLLYPSTLSSLRSYCSACFRSLPPAATIPCVSCRAAAFCSPACAAASHP-- 95

Query: 64  RLECQVLSRLDKEKRKSVTPSIRLMLKLYLRRKLQNDNVIPSTTTDNYSLVEALVAHMSD 123
            L C  LS        +  P+  +   L     L   ++  ST     SL  A       
Sbjct: 96  HLLCAALSHGGGAGLAAAAPTEAIQEPLLF--LLSAYSLPESTMRSMLSLSSAPPPPPGA 153

Query: 124 IDEKQLLLYAQIANLVNLILQW--PEISINEIAEN----FSKLACNAHTICNSELRPLGT 177
            D   L          +++ Q   P+++   +A++    F+ +    H +    L+    
Sbjct: 154 QDAAGLHAAVAALAPPHMLPQGLSPDLTAALLAKDRGNSFAIMEPYRHGMSLELLKARAY 213

Query: 178 GLYPVISIINHSCLPNAV-LVFEGRLA------VVRAVQHVPKGAEVLISYIETAGSTMT 230
            +YP  S+ NH CLPNA    +  R        VVRA+  +P+G EV ISY         
Sbjct: 214 AVYPRASLFNHDCLPNACHFDYPDRPGPGNTDIVVRALHDIPEGREVCISYFAANWRYAD 273

Query: 231 RQKALKEQYLFTCTCPRC------------------------------------------ 248
           RQ+ L E Y F C C RC                                          
Sbjct: 274 RQRRLLEDYGFRCECDRCQVESQWKDDDDNNGDDGDDTMEEEGEDDVGDRGDDGIEEEEG 333

Query: 249 --IKLGQFDDIQESAILEGYRCKDDGCSGFL--LRDSDDKG----FTCQQCGLVRSKEEI 300
                G  DD   +     Y C ++GC G L  L  S +      F C  CG +R +E++
Sbjct: 334 DGATNGGDDDFPHAYFFVRYLCDNEGCWGMLAPLPPSPNGDLSHVFECNLCGKLRKEEDV 393


>gi|195393214|ref|XP_002055249.1| GJ18896 [Drosophila virilis]
 gi|194149759|gb|EDW65450.1| GJ18896 [Drosophila virilis]
          Length = 506

 Score = 73.2 bits (178), Expect = 2e-10,   Method: Compositional matrix adjust.
 Identities = 34/103 (33%), Positives = 55/103 (53%), Gaps = 7/103 (6%)

Query: 178 GLYPVISIINHSCLPNAVLVFEGRL--AVVRAVQHVPKGAEVLISYIETAGSTMTRQKAL 235
            LYP+ +++NH C+PN+   FE +    +VRA   + +G E+  +Y +     + R   L
Sbjct: 208 ALYPLFAVMNHDCIPNSYYTFEEKTNNMIVRAAVDIAEGEEITTTYTKLFTGNIARHLYL 267

Query: 236 KEQYLFTCTCPRCIKLGQFDDIQESAILEGYRCKDDGCSGFLL 278
           K +  FTC CPRC      D  ++ A + G  C+D  CSG ++
Sbjct: 268 KMKKGFTCKCPRC-----SDPTEKGAYISGLYCRDTKCSGLVV 305


>gi|388854233|emb|CCF52152.1| uncharacterized protein [Ustilago hordei]
          Length = 938

 Score = 73.2 bits (178), Expect = 3e-10,   Method: Compositional matrix adjust.
 Identities = 39/118 (33%), Positives = 65/118 (55%), Gaps = 16/118 (13%)

Query: 149 SINEIAENFSKLACNAHTICNSELRPLGTGLYPVISIINHSCLPNAVLVF---------- 198
           S  E+ +   + A N+ T+ +S+L PLG  ++P +++INH+C PNA +VF          
Sbjct: 481 SAKELLDLVCQFASNSFTLTDSDLNPLGVCMHPSMAMINHACTPNAAVVFPFGGAAKNGQ 540

Query: 199 ------EGRLAVVRAVQHVPKGAEVLISYIETAGSTMTRQKALKEQYLFTCTCPRCIK 250
                 E R+  + A++ +  G E+LISY++ A +   R+  LK++Y F C C  C K
Sbjct: 541 QKWNDGEDRIMQLVALRAIEPGEELLISYVDIADTYQDRRNYLKKRYCFDCRCELCRK 598



 Score = 53.5 bits (127), Expect = 2e-04,   Method: Compositional matrix adjust.
 Identities = 26/79 (32%), Positives = 39/79 (49%), Gaps = 16/79 (20%)

Query: 8   GEVIISQEPYVCVPNNSSSISRCDGCFA----------------SSNLKKCSACQVVWYC 51
           G+V++   P V V + +    RC  C++                +  L++CS C+VV YC
Sbjct: 209 GDVLLRVRPEVAVLSTALLEQRCSACYSPRSVSEPIAQQAAASTAGKLQRCSGCKVVRYC 268

Query: 52  GSNCQKLDWKLHRLECQVL 70
            S CQ+ DW  HR EC+ L
Sbjct: 269 SSACQRRDWPTHRDECKAL 287


>gi|294659615|ref|XP_462014.2| DEHA2G10846p [Debaryomyces hansenii CBS767]
 gi|199434101|emb|CAG90495.2| DEHA2G10846p [Debaryomyces hansenii CBS767]
          Length = 725

 Score = 73.2 bits (178), Expect = 3e-10,   Method: Compositional matrix adjust.
 Identities = 90/356 (25%), Positives = 155/356 (43%), Gaps = 46/356 (12%)

Query: 36  SSNLKKCSACQVVWYCGSNCQKLDW-KLHRLECQVLSRLDKEKR----KSVTPSIRLMLK 90
           SS +  C  C++  YC   C    W ++HR ECQ+  ++ K  +     S++  IR  +K
Sbjct: 127 SSKMVTCERCRIYKYCNQECYNAHWNQIHRYECQLFGQIMKNSQFKDEDSLSEFIRHGIK 186

Query: 91  LYLRRKLQNDNVIPSTTTDNYSLVEALVAHMSDIDEKQLLLYAQIANLVNLILQWPEISI 150
           LY+   L  D    S   D       L +H S ID      Y+ +      I++  ++++
Sbjct: 187 LYILCDL--DRAYKSRVFD-------LTSHSSIIDSDDE--YSWLHEYTKCIVRAIDLNM 235

Query: 151 NEIAENFS-KLAC----NAHTICNSELRPLGTGLYPVISIINHSCLPNAVLV--FEGRLA 203
           +     +  KL C    N+  + N     +G    P  S++NHSC+PN +++     + +
Sbjct: 236 DSKLYGYIWKLLCIILVNSSVLMNEYQEAIGFSFDPDFSLLNHSCIPNTLVIPLNHSKFS 295

Query: 204 VVRAVQHVPKGAEVLISYIETAGSTMTRQKALKEQYLFTCTCPRCI-KLGQFDDIQESA- 261
           +V A   V +  E+L +Y  T+     R   L+ ++ F C C  C  K   F     SA 
Sbjct: 296 LV-ATFPVAENKEILTNYCFTSCPKELRNSELQRRFFFRCNCILCKQKFDWFFSYNCSAC 354

Query: 262 --ILEGYRCK----DDGCSGFLLRDSDDKGFTCQQCGLVRSKEEIKK--------IASEV 307
             +L     K    DD   G + ++  +  F C  CG + +K+ +K+        +A  +
Sbjct: 355 GMLLCSLTFKDFFSDDLSKGLVFKNQYNVDF-CSNCGKLINKDVLKETRKSHQQLLAIIL 413

Query: 308 NILSKKTLALTSCGNHQEVVSTYKMIEKLQKKLYHPFSVNLMQTREKLIKILMELE 363
             L   +L   S   H   +S  K  EK++K +     VNL  + +KLI+ +  L+
Sbjct: 414 YDLYNNSLDYKSDDTH---ISKEKFYEKIKKFMIDVDLVNL--SGDKLIEFIDSLQ 464


>gi|340722924|ref|XP_003399849.1| PREDICTED: protein msta, isoform B-like [Bombus terrestris]
          Length = 513

 Score = 73.2 bits (178), Expect = 3e-10,   Method: Compositional matrix adjust.
 Identities = 99/462 (21%), Positives = 189/462 (40%), Gaps = 69/462 (14%)

Query: 7   SGEVIISQEPYVCVPNNSSSISRCDGCFAS-----SNLKKCSACQVVW-YCGSNCQKLDW 60
           +GE I+++ P+V  P  + +   C  C+A+     S    CS C   W  C   C+  + 
Sbjct: 39  AGEEIVTEMPFVVGPK-AFTYPLCLSCYATWPPSLSAKPLCSKCG--WPVCSEECE--NQ 93

Query: 61  KLHR-LECQVLSRLDK---------EKRKSVTPSIRLMLKLYLRRKLQNDNVIPSTTTDN 110
             H+  ECQV  + ++         E  ++  P +  +  L L  + + +   P    D 
Sbjct: 94  PQHKDYECQVFVQANEKFNVQAALEEANENGVPQLECITPLRLLLESERN---PERWNDE 150

Query: 111 YSLVEALVAHMSDIDEKQLLLYAQIANLVNLILQWPEISINEIAENFSKLACNAHTICNS 170
              ++ + AH     +K+     QI N+V+ I +  ++ ++  +E     AC    I   
Sbjct: 151 ---IKNMEAHNKIRSQKKHWQSNQI-NIVDYIRK--QLKLDRFSEEQIHTACGILEINTF 204

Query: 171 ELRPL----GTGLYPVISIINHSCLPN---AVLVFEGRLAVVRAVQHVPKGAEVLISYIE 223
           E+R         LYP ++++NHSC+ N   ++   + R+ + R    +P G E+  SY  
Sbjct: 205 EIRTAKGCSARALYPTVALMNHSCISNTCHSISPTDYRVRL-RTTLKIPAGGELYGSYTH 263

Query: 224 TAGSTMTRQKALKEQYLFTCTCPRCIKLGQFDDIQESAILEGYRCK--DDGCSGFLLRDS 281
           +   T+ R++ L E   F C C RC      D  +    +   +C   D+G    L    
Sbjct: 264 SLLPTLLRREHLLEGKYFACACSRC-----SDPTELGTHVSSLKCNKCDNGIVLSLDSLD 318

Query: 282 DDKGFTCQQCGLVRSKEEIKKIASEVNILSKKTLAL--TSCGNHQEVVSTYKMIEKLQKK 339
            D  + C  C    S   ++K+   ++I+  +  A+   S  +  + +   + + K  + 
Sbjct: 319 PDSSWKCTHCEFTTSGSAVRKV---LHIIQAEVDAVETISGADGADAIQERETVVKKYRS 375

Query: 340 LYHPFSVNLMQTREKLIKILMELEDW----------KEALAYCQLTIPVYQRVYPQFHPL 389
           + HP    L   R  L ++   ++++          +  +  C+L + V   + P +  +
Sbjct: 376 VLHPRHGFLTMLRHSLTQMYGRVDEYLLDDLPDVVLEHKIDICRLLLQVLDVIEPGYSRI 435

Query: 390 LGLQYYT-------CGKLEWFLGDTENAI--KSMTEAVEILR 422
            G+  Y          K +W  G  + A+    M EA  IL+
Sbjct: 436 RGMTLYELHAPLLFVAKTQWNAGVIDEAVLKSKMIEAANILK 477


>gi|226499648|ref|NP_001143943.1| uncharacterized protein LOC100276756 [Zea mays]
 gi|195629926|gb|ACG36604.1| hypothetical protein [Zea mays]
          Length = 404

 Score = 72.8 bits (177), Expect = 3e-10,   Method: Compositional matrix adjust.
 Identities = 86/360 (23%), Positives = 128/360 (35%), Gaps = 71/360 (19%)

Query: 8   GEVIISQEPYVCVPNNSSSI-SRCDGCFAS---SNLKKCSACQVVWYCGSNCQKLDWKLH 63
           GEV+ S+ P +  P+  SS+ S C  CF S   +    C +C+   +C   C        
Sbjct: 38  GEVLFSEPPVLLYPSTLSSLRSYCSACFRSLTPAATIPCVSCRAAAFCSPACAAASHP-- 95

Query: 64  RLECQVLSRLDKEKRKSVTPSIRLMLKLYLRRKLQNDNVIPSTTTDNYSLVEALVAHMSD 123
            L C  LS        +  P+  +   L     L   ++  ST     SL  A       
Sbjct: 96  HLLCAALSHGGGAGLAAAAPTEAIQEPLLF--LLSAYSLPESTMRSMLSLSSAPPPXPGA 153

Query: 124 IDEKQLLLYAQIANLVNLILQW--PEISINEIAEN----FSKLACNAHTICNSELRPLGT 177
            D   L          +++ Q   P+++   +A++    F+ +    H +    L+    
Sbjct: 154 QDAAGLHAAVAALAPPHMLPQGLSPDLTAALLAKDRGNSFAIMEPYRHGMSLELLKARAY 213

Query: 178 GLYPVISIINHSCLPNAV-LVFEGRLA------VVRAVQHVPKGAEVLISYIETAGSTMT 230
            +YP  S+ NH CLPNA    +  R        VVRA+  +P+G EV ISY         
Sbjct: 214 AVYPRASLFNHDCLPNACHFDYPDRPGPGNTDIVVRALHDIPEGREVCISYFAANWRYAD 273

Query: 231 RQKALKEQYLFTCTCPRC------------------------------------------ 248
           RQ+ L E Y F C C RC                                          
Sbjct: 274 RQRRLLEDYGFRCECDRCQVESQWKDDDDNNGDDGDDTMEEEDEDDAGDRGDDGIEEEEG 333

Query: 249 --IKLGQFDDIQESAILEGYRCKDDGCSGFL--LRDSDDKG----FTCQQCGLVRSKEEI 300
                G  DD   +     Y C ++GC G L  L  S +      F C  CG +R +E++
Sbjct: 334 DGATNGGDDDFPHAYFFVRYLCDNEGCWGMLAPLPPSPNGDLSHVFECNLCGKLRKEEDV 393


>gi|281208760|gb|EFA82935.1| SET domain-containing protein [Polysphondylium pallidum PN500]
          Length = 422

 Score = 72.8 bits (177), Expect = 3e-10,   Method: Compositional matrix adjust.
 Identities = 46/132 (34%), Positives = 64/132 (48%), Gaps = 9/132 (6%)

Query: 175 LGTGLYPVISIINHSCLPNAVLVFEGRLAVVRAVQHVPKGAEVLISYIETAGSTMTRQKA 234
            G  +Y   S+ NHSC PNA  V  GR   + A++ +    E+ ISY+     +  R+  
Sbjct: 251 FGLAIYLEASLFNHSCYPNAARVQRGRSIDIIAIRDIEPNEEICISYLNITNGSHERKDH 310

Query: 235 LKEQYLFTCTCPRCIKLG-QFDDIQESAILEGYRCKDDGCSGFLLRDSDDKGFTCQQCGL 293
           LK  YLF C C RC +     ++I  S I    R K   CSG+L      +G   +QC  
Sbjct: 311 LKNNYLFDCVCIRCTQTNPDIENIVRSFICRNPRVK---CSGYLYL---PQGSDSRQCNF 364

Query: 294 VRSKE--EIKKI 303
            + KE  EIK+I
Sbjct: 365 CQWKESSEIKRI 376


>gi|195020683|ref|XP_001985247.1| GH16955 [Drosophila grimshawi]
 gi|193898729|gb|EDV97595.1| GH16955 [Drosophila grimshawi]
          Length = 529

 Score = 72.8 bits (177), Expect = 3e-10,   Method: Compositional matrix adjust.
 Identities = 88/343 (25%), Positives = 150/343 (43%), Gaps = 63/343 (18%)

Query: 8   GEVIISQEPYVCVPNNSSSISRCDGCF--------ASSNLKKCSACQVVW-YCGSNCQKL 58
           GE I+ + P V  P  +++   C GC          +    KCS+C   W  C   C+K 
Sbjct: 68  GEQIMCEAPLVVGPKVAAT-PLCLGCHRNLLAPQKPAMTFYKCSSCS--WPMCSKECEKS 124

Query: 59  DWKLHRLECQVLSR-----------LDKEKRKSVTPSIRLMLKLYLRRKLQNDNVIPSTT 107
            + L   EC++++             D E ++S    I L+  + L+R          + 
Sbjct: 125 PFHLD--ECRLMAASNFQSKINYNPADPEGKESAYCVIMLLRCMQLKR----------SN 172

Query: 108 TDNYSLVEALVAHMSDIDEKQL--LLYAQIANLVNLIL---QWPEISINEIAENFSKLAC 162
            + ++ + AL  H+ +  E  L  +L A +   +  +L    WPE+ I  IA   + L  
Sbjct: 173 PEAFARLAALEDHLKERIETPLYQVLRANLITFIKTVLGQRDWPELEILRIA---AILDT 229

Query: 163 NAHTICNSELRPLGTGLYPVISIINHSCLPNAVLVFEGRLAVV-RAVQHVPKGAEVLISY 221
           NA  +  +  R     ++P  ++I H C+PN    F+  + ++  A + +PKGA + ISY
Sbjct: 230 NAFEVRQNGERRKVRAIFPGGAMIAHDCVPNLRHRFDDNMRIIFLAKRPIPKGAILSISY 289

Query: 222 IETAGSTMTRQKALKEQYLFTCTCPRC---IKLGQFDDIQESAILEGYRCKDDGCSGFLL 278
            +   ST+ R+  LK+   F C C RC    +LG F      A L G +CK     G ++
Sbjct: 290 TQPLRSTVQRRVHLKQVKCFDCACERCSDPTELGTF----AGAHLCG-KCK----VGKVI 340

Query: 279 RDS---DDKGFTCQQCGLVRSKEEI----KKIASEVNILSKKT 314
             +   +   + CQ C + +S  E+     ++  E+  L K T
Sbjct: 341 SQNPLENAANWHCQLCNVKKSAREVLTQDARLQQEIESLDKTT 383


>gi|255570887|ref|XP_002526395.1| heat shock protein 70 (HSP70)-interacting protein, putative
           [Ricinus communis]
 gi|223534257|gb|EEF35971.1| heat shock protein 70 (HSP70)-interacting protein, putative
           [Ricinus communis]
          Length = 781

 Score = 72.8 bits (177), Expect = 4e-10,   Method: Compositional matrix adjust.
 Identities = 72/364 (19%), Positives = 153/364 (42%), Gaps = 52/364 (14%)

Query: 127 KQLLLYAQIANLVNLILQWPEISINEIAENFSKLACNAHTICNS-ELRPLGTGLYPVISI 185
           + ++L +Q+      +++   +  +  +++  KL+ +   + +S E  P+G  +Y   S+
Sbjct: 423 QAIILVSQVRVNAMAVVRMKSVDAHCPSDHLVKLSHSGDALTSSVEQVPVGQAIYTAGSL 482

Query: 186 INHSCLPNAVLVFEGRLAVVRAVQHVPKGAEVLISYIETAG--STMTRQKALKEQYLFTC 243
            NHSC PN    F  R   +R  +H+  G  + +SY    G      R K L+++Y F C
Sbjct: 483 FNHSCQPNVHAYFLSRTLFIRTTEHLATGCPLELSYGPQVGQRDCKDRLKFLQDKYSFRC 542

Query: 244 TCPRCIKLGQFDDIQESAILEGYRCKDDGCSGFLLRDSDDKGFTCQQCGLVRSKEEIK-K 302
            C  C  +   D +Q +     +RC D  C G +L  S       +     R+ E  +  
Sbjct: 543 HCNGCSIVNLSDLVQNA-----FRCIDLNCVGVVLDRSVINSEIKKLKNFPRAPERQRLD 597

Query: 303 IASEVNILSKKTLALT------------SCGN-------HQEVVSTYKMIEKLQ------ 337
           +  +V+ L+   L L+            +CG+       H+ + + +   ++LQ      
Sbjct: 598 LCLQVDDLAHLALELSNGPLHIQPGCCLNCGSYCDLEAVHEGMRTAWIYFKRLQDAIVVK 657

Query: 338 -----------------KKLYHPFSVNLMQTREKLIKILMELEDWKEALAYCQLTIPVYQ 380
                            + + H ++ ++ +  + L +    +ED++ A  +C+ +I + +
Sbjct: 658 KISTTVITDASRALGALRSILHAYNKHIAEAEDILAQAFCLVEDFQSARDHCRASIKILE 717

Query: 381 RVYPQFHPLLGLQYYTCGKLEWFLGDTENAIKSMTEAVEILRITHGTNSPFMKELILKLE 440
            +Y   H ++G +      ++  +GD  +A+ S+     I     G+++ F+   +  L 
Sbjct: 718 MLYGPDHIVIGYELIKLSTIQLSMGDL-SAVDSINRLGAIFERYFGSHADFIFPYLQTLR 776

Query: 441 EAQA 444
           E  A
Sbjct: 777 EKLA 780


>gi|350419310|ref|XP_003492139.1| PREDICTED: SET and MYND domain-containing protein 4-like [Bombus
           impatiens]
          Length = 673

 Score = 72.8 bits (177), Expect = 4e-10,   Method: Compositional matrix adjust.
 Identities = 89/385 (23%), Positives = 153/385 (39%), Gaps = 78/385 (20%)

Query: 7   SGEVIISQEPYVCVPNNSSSISRCDGCF----ASSNLK-KCSACQVVWYCGSNCQKLDWK 61
           +G V+I + P+    N  +    C  C     +S ++K  C  CQ V +C   C++  W+
Sbjct: 245 AGSVLIVETPFAFSTNKEALDRNCLHCHITLKSSESVKIPCHFCQTVSFCSEKCRREAWQ 304

Query: 62  L-HRLECQVLSRLDKEKRKSVTPSIRLMLKLYL--------------------RRKLQND 100
           + H+ EC +         +S+  +  L+L   +                    +    ND
Sbjct: 305 MYHQYECFIFDAFYGNDFESMQHTSHLLLAYRMIISAFLSLSTEQINNNTEKSKIPFMND 364

Query: 101 NVI-----------------PSTTTDNYSLVEALVAHMSDIDEKQLLLYA----QIANLV 139
           N +                  +  T +Y  +  L  H + I+    L  A     +A   
Sbjct: 365 NFLRDYVTIKNKEYKDLGINKAYCTYDYHTILNLETHCTKIESSINLARAIEAIFLAKCF 424

Query: 140 NLIL-QWPEISINE--------IAENFSKLACNAHTICNS---------ELRPLGTGLYP 181
             +L +   I + E        +  N   + CNA+ I  +         E R +G  +YP
Sbjct: 425 TFVLSKMDVICLKESFIYLAVAMLHNLQAINCNAYEIVENIYDKKTHVWEPRNIGGAIYP 484

Query: 182 VISIINHSCLPNAVL-VFEGRLAVVRAVQHVPKGAEVLISYIE--TAGSTMTRQKALKEQ 238
            IS+INHSC PN V   +     V+R ++ + KG E+L  Y     +   ++R + L ++
Sbjct: 485 SISLINHSCYPNVVRHSYPSGTVVLRTLRFIGKGTEILDCYGPHWLSEKRLSRLEYLWKK 544

Query: 239 YLFTCTCPRCIKLGQFDDIQESAILEGYRCKDDGCSGFL----LRDSDDKGFTCQQCGLV 294
           Y F CTC  C +  QF  + E+     Y+CK   CS  +    L +   +  + +QC   
Sbjct: 545 YCFLCTCEACTQNWQF-PLPETM---NYKCK--MCSEIIGTITLDEKHTQNVSSKQCCKC 598

Query: 295 RSKEEIKKIASEVNILSKKTLALTS 319
             K ++KKI ++     +K L   S
Sbjct: 599 NKKTDLKKIKNQFRKSVEKRLDAIS 623


>gi|388581455|gb|EIM21763.1| hypothetical protein WALSEDRAFT_68691 [Wallemia sebi CBS 633.66]
          Length = 533

 Score = 72.8 bits (177), Expect = 4e-10,   Method: Compositional matrix adjust.
 Identities = 61/233 (26%), Positives = 104/233 (44%), Gaps = 27/233 (11%)

Query: 28  SRCDGCFASS-----NLKKCSACQVVWYCGSNCQKLDWKLHRLECQVLSRLDKEK--RKS 80
           S   G FAS       L +CS C+ ++Y     QK DW +H+ EC  L  L      + S
Sbjct: 110 SEYGGLFASELGTMKKLARCSLCKTIFYASKEAQKSDWPIHKNECLCLRSLGSSSIPQDS 169

Query: 81  VTPSIRLMLKLYLRRKLQNDNVIPSTTTDNYSLVEALVAHMSDIDEKQLLLYAQIANLVN 140
           V    R +     +R L  DN    +  D Y       A+ +  +E+++ L  +     +
Sbjct: 170 VRALARFVWMKDHKRTL-GDNEWGQSVEDMYD------AYDNYTEEQKIGLGNKAKEFFD 222

Query: 141 LILQWPE---------ISINEIAENFSKLACNAHTICNSELRPLGTGLYPVISIINHSCL 191
              Q  +         +SI+ I     K+  N   +  S+   +GT + P+ + INH+  
Sbjct: 223 FFTQGGKDREAMSRYNLSISYIMRMLCKIQTNGFALSGSDFMEIGTAICPLAACINHNQE 282

Query: 192 PNAVLVF---EGRLAVVRAVQHVPKGAEVLISYIETAGSTMTRQKALKEQYLF 241
            N+ ++F   + RL+VV A + + KG E+  +Y++ A     ++  L +QYLF
Sbjct: 283 YNSTVLFTQEDNRLSVV-AFKGIEKGDEITTTYVDGAEPWDAQRNQLLKQYLF 334


>gi|328710001|ref|XP_003244133.1| PREDICTED: SET and MYND domain-containing protein 4-like isoform 1
           [Acyrthosiphon pisum]
 gi|328710003|ref|XP_003244134.1| PREDICTED: SET and MYND domain-containing protein 4-like isoform 2
           [Acyrthosiphon pisum]
          Length = 645

 Score = 72.8 bits (177), Expect = 4e-10,   Method: Compositional matrix adjust.
 Identities = 70/288 (24%), Positives = 121/288 (42%), Gaps = 47/288 (16%)

Query: 8   GEVIISQEPYVCVPNNSSSISRCDGCFASS-NLKKCSACQVVWYCGSNCQKLDWKL-HRL 65
           GEV+  ++PYV V    S    C  CF    N   C  C +V YC   C+   ++  H+ 
Sbjct: 260 GEVLAIEKPYVGVLRRESYEYNCRNCFKRCLNGIPCLKCTLVIYCDETCRIKSYESGHKY 319

Query: 66  ECQVLSR------LDKEKRKSVTPSIRLMLKLYLRRKLQNDNVIPSTTTD---------- 109
           EC + S       +D  +  S+   ++ + +L L + +     + + TTD          
Sbjct: 320 ECSLFSTFNNWPGMDHMEHLSLNIFLKSVCELGLDKYIATVCALNADTTDPMMRGFNNVG 379

Query: 110 -----NYSLVEALVAH-----MSDIDEKQLLLYAQIANLVNLILQWPEISINEIAENFSK 159
                 +  V  L  +     +SD+  +       ++ +    LQ P   +  + E+   
Sbjct: 380 KYLSDQFCSVYTLEGNETKRSVSDLFSRHCHAAVMVSIMKLAGLQIPNHQLGTVGESLVH 439

Query: 160 LAC----NAHTI-----CNSELR--------PLGTGLYPVISIINHSCLPNAVLVFEGRL 202
           + C    NAH I     C ++L+        P+ + L PV+S++NH C PN V       
Sbjct: 440 IICAVSSNAHGITQPSDCKTQLKLSLDNRFVPVASLLMPVLSLLNHHCDPNVVRHNYNGT 499

Query: 203 AVVRAVQHVPKGAEVLISY--IETAGSTMTRQKALKEQYLFTCTCPRC 248
            V+ A+Q + KG+++  +Y  +       +R + LK QY F+C C  C
Sbjct: 500 IVLTAIQPISKGSQLFDNYGLLYATHPKESRLQILKNQYYFSCECSSC 547


>gi|347964965|ref|XP_309220.5| AGAP001025-PA [Anopheles gambiae str. PEST]
 gi|333466560|gb|EAA04943.5| AGAP001025-PA [Anopheles gambiae str. PEST]
          Length = 519

 Score = 72.4 bits (176), Expect = 4e-10,   Method: Compositional matrix adjust.
 Identities = 83/378 (21%), Positives = 157/378 (41%), Gaps = 48/378 (12%)

Query: 8   GEVIISQEPY----------VCVPNNSSSISRCDGCFASSNLKKCSACQVVW-YCGSNCQ 56
           GE+I   EPY          +C+  N + ++  D         +CS C   W  CG  C+
Sbjct: 67  GEIIYRDEPYAVGPKIANVPLCLGCNRNLMAGWDATRGLDRFHECSRCG--WPLCGPGCE 124

Query: 57  KLDWKLHRLECQVLSRLDKEKRKSVTPSIRLMLKLYLRRKLQNDNVIP-------STTTD 109
           ++    HR EC VL+           P+IR       +R+     ++P           +
Sbjct: 125 EV--AQHRPECSVLAG------SGYRPNIRPNPSNPEQRESAYCVIVPLRVLLLERIAPE 176

Query: 110 NYSLVEALVAHMSDIDEKQL--LLYAQIANLVNLILQWPEISINEIAENFSKLACNAHTI 167
            Y+ V+   +H+ +     L  +L + +   +  +L+  + S   + +  + L  N + I
Sbjct: 177 RYATVQGFESHLDERLASPLYGVLRSNLVPFLRQVLRLQQYSEQTVLKLSAILDTNCYEI 236

Query: 168 CNSELRPLGTGLYPVISIINHSCLPNAVLVFEGRL-AVVRAVQHVPKGAEVLISYIETAG 226
              E      GLYP+ ++++H C PN    F+ RL  V+ A   +P G  +  SY +   
Sbjct: 237 RLPEQHVKVRGLYPLGAMLSHDCRPNTKHYFDDRLHMVLVATVDIPAGGVIHASYTQPLL 296

Query: 227 STMTRQKALKEQYLFTCTCPRCIKLGQFDDIQESAILEGYRCKDDGCSGFLLRDSDDKGF 286
            T+ R+ AL++   F C C RC      D  +      G+RC +   +  L+  ++   +
Sbjct: 297 GTVQRRLALRQAKCFDCCCERCA-----DPTEYGTSASGFRCPNCRRTPSLVLAAEPTNY 351

Query: 287 -TCQQCGLVRS--KEEIKKIASEVNILSKKTLAL--TSCGNHQEVVSTYKMIEKLQKKLY 341
            T  +C   R   +E   +  +    + ++ LAL  T    ++E ++ +           
Sbjct: 352 RTVWRCQNQRCAYQERPDQYVARCERMQQELLALDRTEPAGYEEFLARHATT-------L 404

Query: 342 HPFSVNLMQTREKLIKIL 359
           HP++  ++Q +  L ++L
Sbjct: 405 HPWNAYMLQAKYALTQLL 422



 Score = 39.3 bits (90), Expect = 4.2,   Method: Compositional matrix adjust.
 Identities = 25/75 (33%), Positives = 37/75 (49%), Gaps = 11/75 (14%)

Query: 30  CDGCFASSNLKKCSACQVVWYCGSNCQKLDWKL-HRLECQVLSRLDKEKRKSVTPSIRLM 88
           C  C A + L++C+ CQ V YCG + Q+ DWK  HR +C         +R  V  + RL 
Sbjct: 6   CRVCSAPA-LQRCAGCQQVGYCGRDHQRADWKAQHRDQC---------RRFKVVRNDRLG 55

Query: 89  LKLYLRRKLQNDNVI 103
             L   R ++   +I
Sbjct: 56  RHLVATRHIKQGEII 70


>gi|156386886|ref|XP_001634142.1| predicted protein [Nematostella vectensis]
 gi|156221221|gb|EDO42079.1| predicted protein [Nematostella vectensis]
          Length = 73

 Score = 72.4 bits (176), Expect = 4e-10,   Method: Compositional matrix adjust.
 Identities = 32/73 (43%), Positives = 46/73 (63%)

Query: 176 GTGLYPVISIINHSCLPNAVLVFEGRLAVVRAVQHVPKGAEVLISYIETAGSTMTRQKAL 235
           GTG++P    +NHSC PN+V VF G    ++A++ +P G E+ ISYI+      TRQ+ L
Sbjct: 1   GTGIFPNAVCLNHSCAPNSVAVFNGTNIYIKALEEIPVGEELTISYIQQLHPRETRQEEL 60

Query: 236 KEQYLFTCTCPRC 248
           + Q+ F C C RC
Sbjct: 61  QTQFCFYCQCHRC 73


>gi|356525020|ref|XP_003531125.1| PREDICTED: histone-lysine N-methyltransferase ASHR2-like [Glycine
           max]
          Length = 419

 Score = 72.0 bits (175), Expect = 5e-10,   Method: Compositional matrix adjust.
 Identities = 77/298 (25%), Positives = 121/298 (40%), Gaps = 58/298 (19%)

Query: 30  CDGCF----------ASSNLKKCSACQVVWYCGSNC--QKLDWKLHRLECQVLSRLDKEK 77
           CD CF          +SS+   C  C+   +C SNC    L+       CQ LS L    
Sbjct: 66  CDHCFRILSPSLQGDSSSSTVLCPNCRHHCFCNSNCLSNALNSSHSSWVCQALSHLRANS 125

Query: 78  RKSVTPSIRLMLKLYLRRKLQNDNVIPSTTTDNYSL--VEALVAHMSDIDEKQL----LL 131
                    L+L+  L  ++Q + ++ +    N S    + +++     D+  +     L
Sbjct: 126 ---------LLLEQPLEHQVQVNFLVAAYNLANISPSDFQIMLSLQGSPDDSTIAAAQFL 176

Query: 132 YAQIANLVNLILQWPE--ISINEIAENFSKLACNAHTICNSELRPL----------GTGL 179
           +  I++L +L L  P+   S+   +   +K   NA  I    ++P             G+
Sbjct: 177 HPLISSLCSLALIGPQNGFSLELTSAILAKDKLNAFGI----MQPFSEHDDQRSVRAYGI 232

Query: 180 YPVISIINHSCLPNAVLVFEGRLA-----------VVRAVQHVPKGAEVLISYIETAGST 228
           YP  S  NH CLPNA   F+   A           ++R +  VP+G E+ +SY       
Sbjct: 233 YPYASFFNHDCLPNACR-FDYVDANPSDDSHNTDFIIRMIHDVPQGREICLSYFPVNEKY 291

Query: 229 MTRQKALKEQYLFTCTCPRC---IKLGQFDDIQESAILEGYRCKDDGCSGFLLRDSDD 283
            +RQK L E Y FTC C RC         D ++++A  E     +D C      D+DD
Sbjct: 292 SSRQKRLIEDYGFTCNCDRCNVESNWSDNDSVEDNAEEEEEVMDEDQCETMAASDTDD 349


>gi|443693726|gb|ELT95020.1| hypothetical protein CAPTEDRAFT_183922 [Capitella teleta]
          Length = 666

 Score = 72.0 bits (175), Expect = 5e-10,   Method: Compositional matrix adjust.
 Identities = 45/139 (32%), Positives = 73/139 (52%), Gaps = 16/139 (11%)

Query: 175 LGTGLYPVISIINHSCLPNAVLVFEGRLAVVRAVQHVPKGAEVLISYIET--AGSTMTRQ 232
           +G GL+P  +++NHSC P A++ + G  AVVRA++ + K  E+ I+Y  T      ++R+
Sbjct: 472 IGIGLFPTAALLNHSCNPEAIVCYYGNKAVVRAIRDIDKNEEISIAYGVTFYDDEELSRR 531

Query: 233 KALKEQYLFTCTCPRCIKLG----QFDDIQESAILEGYRCKDDGCSGFLLRD--SDDKGF 286
             LKE + F CTC  C++      + D  Q   + E        C   LL D   D+K  
Sbjct: 532 HQLKETHFFHCTCKACLEGWPMWLEMDQNQPDWLCE-------ACGSILLSDKIEDNKFA 584

Query: 287 TCQQCGLVRSKEE-IKKIA 304
            C++C   ++ E+ I K+A
Sbjct: 585 KCKKCSHRQNLEDAINKLA 603


>gi|115449849|ref|XP_001218712.1| predicted protein [Aspergillus terreus NIH2624]
 gi|114187661|gb|EAU29361.1| predicted protein [Aspergillus terreus NIH2624]
          Length = 306

 Score = 72.0 bits (175), Expect = 5e-10,   Method: Compositional matrix adjust.
 Identities = 35/87 (40%), Positives = 50/87 (57%), Gaps = 2/87 (2%)

Query: 175 LGTGLYPVISIINHSCLPNAVLVFEGRLAVVRAVQHVPKGAEVLISYIETAGSTMTRQKA 234
           LGT L    S +NHSC P A + FEGR   VRA+  +P G E+  +Y++ +GS  +RQ  
Sbjct: 9   LGTTLDLAASFMNHSCNPGAFVFFEGRQMRVRALLSLPAGEEITQAYVDLSGSVFSRQAT 68

Query: 235 LKEQYLFTCTCPRCIKLGQFDDIQESA 261
            + +Y F C C RC      +D+Q+ A
Sbjct: 69  TEAEYFFQCHCVRCED--DLEDLQQIA 93


>gi|195058496|ref|XP_001995453.1| GH17754 [Drosophila grimshawi]
 gi|193896239|gb|EDV95105.1| GH17754 [Drosophila grimshawi]
          Length = 499

 Score = 72.0 bits (175), Expect = 6e-10,   Method: Compositional matrix adjust.
 Identities = 34/103 (33%), Positives = 54/103 (52%), Gaps = 7/103 (6%)

Query: 178 GLYPVISIINHSCLPNAVLVFEGRL--AVVRAVQHVPKGAEVLISYIETAGSTMTRQKAL 235
            LYP+  ++NH C+PN+   FE +    +VRA   + +G E+  +Y +     + R   L
Sbjct: 208 ALYPLFGVMNHDCIPNSYYTFEEKTNNMIVRAAVDILEGEEITTTYTKLFTGNIARHLFL 267

Query: 236 KEQYLFTCTCPRCIKLGQFDDIQESAILEGYRCKDDGCSGFLL 278
           K +  FTC CPRC      D  ++ A + G  C+D  CSG ++
Sbjct: 268 KMKKGFTCKCPRC-----SDPTEKGAYISGLYCRDTNCSGLVV 305


>gi|226290918|gb|EEH46346.1| conserved hypothetical protein [Paracoccidioides brasiliensis Pb18]
          Length = 389

 Score = 72.0 bits (175), Expect = 6e-10,   Method: Compositional matrix adjust.
 Identities = 33/93 (35%), Positives = 53/93 (56%)

Query: 158 SKLACNAHTICNSELRPLGTGLYPVISIINHSCLPNAVLVFEGRLAVVRAVQHVPKGAEV 217
           ++L  NA T+ N     +G  + P  S  NHSC PNA + F+G +  ++A+Q +    ++
Sbjct: 99  NELETNAFTLTNRYFDRIGLCMLPFASYANHSCEPNAYIGFDGPVIYLKALQDIALDEQI 158

Query: 218 LISYIETAGSTMTRQKALKEQYLFTCTCPRCIK 250
            ISYI+       RQ  L+++Y FTC CP+C +
Sbjct: 159 FISYIDNTEPWGKRQSELQKRYFFTCKCPKCAQ 191


>gi|195430720|ref|XP_002063397.1| GK21416 [Drosophila willistoni]
 gi|194159482|gb|EDW74383.1| GK21416 [Drosophila willistoni]
          Length = 661

 Score = 72.0 bits (175), Expect = 6e-10,   Method: Compositional matrix adjust.
 Identities = 82/375 (21%), Positives = 151/375 (40%), Gaps = 54/375 (14%)

Query: 8   GEVIISQEPYVCVPNNSSSISRCDGCFASSNLK-KCSACQVVWYCGSNCQKLDW-KLHRL 65
           G+ ++ + PYV V     + + C+ CF  + +   C  C  V YC   CQ+L   K H+ 
Sbjct: 262 GQELLVERPYVAVLLEKYAKTHCEACFMRTVVPVACPRCADVVYCSEKCQQLAANKYHKY 321

Query: 66  ECQVLSRLDKEKRKSVTPSIRLMLKLYLRR--------KLQNDNVIP-----STTTDNYS 112
           EC +L  +    R   + +  + L+++  +        K   D  +P     S   D++ 
Sbjct: 322 ECGILPSI---WRSGASINNHIALRIFASKPWSYFKDLKPLIDVELPVEKLISLPKDDFR 378

Query: 113 LVEALVAHMSDIDEKQLLLYAQIANLVNLILQWPEISINE-------------IAENFSK 159
            V  L  H ++        +  +A  +   L+      +E             I  +   
Sbjct: 379 RVAQLERHQTERQPSNFFQHVLMARFLAKCLRASNYLGSEEPQKEDVQALTSLILRSLQF 438

Query: 160 LACNAHTICNSELRP-----------LGTGLYPVISIINHSCLPNAVLVFEGRLAVVRAV 208
           +  N H +  +EL             +G  +YP +++ NHSC P  V  F G    + +V
Sbjct: 439 IQFNTHEV--AELHKFSSSKSEKSIFIGGAIYPTLALFNHSCDPGVVRYFRGSTIHINSV 496

Query: 209 QHVPKGAEVLISY--IETAGSTMTRQKALKEQYLFTCTCPRCIK-LGQFDDIQESAILEG 265
           + +  G  +  +Y  I T      RQ  LKE Y F C C  C++    F+D+    I   
Sbjct: 497 RPIEAGLPINENYGPIYTQDKREERQSRLKELYWFECCCDACLENWPLFEDLPRDVI--R 554

Query: 266 YRCK-DDGCSGFL--LRDSDDKGFTCQQCGLVRSKEEIKKIASEVNILSKKTLALTSCGN 322
           +RC+  + C+  +      +D    C  CG   +  +  K+  +  ++++    L   G+
Sbjct: 555 FRCEAPNNCAAVIEVPPTCNDFMVKCVTCGENTNILKGLKVMQDTEMMTRTGKRLYESGD 614

Query: 323 HQEVVSTYKMIEKLQ 337
           + + +  YK I+ L+
Sbjct: 615 YSKAL--YKFIDLLR 627


>gi|225679205|gb|EEH17489.1| conserved hypothetical protein [Paracoccidioides brasiliensis Pb03]
          Length = 377

 Score = 72.0 bits (175), Expect = 6e-10,   Method: Compositional matrix adjust.
 Identities = 35/96 (36%), Positives = 55/96 (57%), Gaps = 1/96 (1%)

Query: 158 SKLACNAHTICNSELRPLGTGLYPVISIINHSCLPNAVLVFEGRLAVVRAVQHVPKGAEV 217
           ++L  NA T+ N     +G  + P  S  NHSC PNA + F+G +  ++A+Q +    ++
Sbjct: 88  NELETNAFTLTNRYFDRIGLCMLPFASYANHSCEPNAYIGFDGPVIYLKALQDIALDEQI 147

Query: 218 LISYIETAGSTMTRQKALKEQYLFTCTCPRCIKLGQ 253
            ISYI+       RQ  L+++Y FTC CP+C + GQ
Sbjct: 148 FISYIDNTEPWEKRQSELQKRYFFTCKCPKCAQ-GQ 182


>gi|189242482|ref|XP_001810915.1| PREDICTED: similar to CG8378 CG8378-PA [Tribolium castaneum]
          Length = 538

 Score = 72.0 bits (175), Expect = 6e-10,   Method: Compositional matrix adjust.
 Identities = 61/266 (22%), Positives = 111/266 (41%), Gaps = 24/266 (9%)

Query: 8   GEVIISQEPYVCVPNNSSSISRCDGCFA-SSNLKKCSACQVVWYCGSNCQKLDWK-LHRL 65
           GE++  ++PY  +  +S S+  C  C     N+  C  C    YC  NC+   ++  H+ 
Sbjct: 109 GEILAIEKPYASIVTDSVSVY-CHECLKLCYNMIPCDKCTKALYCSDNCKDKAYESYHKY 167

Query: 66  ECQVLSRLDK---EKRKSVTPSIRLMLKLYLRRKLQNDNVIPSTTTDNYSLVEALVAHMS 122
           EC +   LD    +  K +   I L+ +      L N++      +D +  V  L  ++ 
Sbjct: 168 ECPIHLSLDPLLIDSSKRLALRIALISRNEWAGSLLNESPDEMYCSDRFKEVFNLDQNVR 227

Query: 123 DIDEKQLLLYAQIANLVNLILQWPEISINEIAEN-FSKLACNAHTICNS----------- 170
                 L     IA  V  +++     + E  E+ F ++  +   IC++           
Sbjct: 228 QRFTHDLFGRTTIACGVFYLIKKYTTFLQEYDEDRFKEILLSLLLICSTNTVRVNEVSST 287

Query: 171 ----ELRPLGTGLYPVISIINHSCLPNAVLVFEGRLAVVRAVQHVPKGAEVLISYIET-- 224
               ++       YP  S+ NHSC PN    + G   V+RA++ + KG +  ++Y  +  
Sbjct: 288 LGEYDVCGFACSHYPFFSMFNHSCWPNVCRSYHGSQMVLRAIRTIKKGEQCFVTYGPSYL 347

Query: 225 AGSTMTRQKALKEQYLFTCTCPRCIK 250
           + + + RQ  L   Y F C C  C++
Sbjct: 348 SDNIVGRQAFLFFHYFFNCGCKACVE 373



 Score = 44.7 bits (104), Expect = 0.091,   Method: Compositional matrix adjust.
 Identities = 16/47 (34%), Positives = 26/47 (55%)

Query: 175 LGTGLYPVISIINHSCLPNAVLVFEGRLAVVRAVQHVPKGAEVLISY 221
             +  YP  S+ NHSC PN    + G   V+RA++ + KG +  ++Y
Sbjct: 491 FASSHYPFFSMFNHSCWPNVCRSYHGSQMVLRAIRTIKKGEQCFVTY 537


>gi|156367132|ref|XP_001627273.1| predicted protein [Nematostella vectensis]
 gi|156214178|gb|EDO35173.1| predicted protein [Nematostella vectensis]
          Length = 750

 Score = 72.0 bits (175), Expect = 6e-10,   Method: Compositional matrix adjust.
 Identities = 58/272 (21%), Positives = 118/272 (43%), Gaps = 11/272 (4%)

Query: 171 ELRPLGTGLYPVISIINHSCLPNAVLVFEGRLAVVRAVQHVPKGAEVLISYIETAGST-- 228
           E R + + +YP  S++NH+C P+ ++ F   + V RA  ++  G+ +   Y         
Sbjct: 468 EQRRIASAIYPTASLLNHACDPDVLVSFVDGVLVARATHNIAPGSGITHCYGPHVNHMPR 527

Query: 229 MTRQKALKEQYLFTCTCPRCIKLGQFDDIQESAILEGYRCKDDGCSGFLLRDS--DDKGF 286
             RQK L +QY FTC C  C    + ++ +       + C    C    ++ S  +    
Sbjct: 528 EERQKLLYKQYFFTCQCSACTSDEEMENTR--LCFSAFACPRCKCP---MKTSPLEPSLA 582

Query: 287 TCQ--QCGLVRSKEEIKKIASEVNILSKKTLALTSCGNHQEVVSTYKMIEKLQKKLYHPF 344
            CQ  +C L +S EE    + +  +L  K +        QE +  ++   + + ++ HP 
Sbjct: 583 RCQNKKCTLEKSIEEELSHSRQAELLFFKAVRTMERIGVQEALGLFQECLRTRTQILHPH 642

Query: 345 SVNLMQTREKLIKILMELEDWKEALAYCQLTIPVYQRVYPQFHPLLGLQYYTCGKLEWFL 404
             +L +T + L +    + D+K A  +C  +    ++ +         + +   +L +  
Sbjct: 643 HKDLAETHDALARCYAMIGDFKLASQHCLQSSEAVEKAFGSTSVEYAHELHKLSQLLFND 702

Query: 405 GDTENAIKSMTEAVEILRITHGTNSPFMKELI 436
              + A+  + +A  +L   +G N P ++EL+
Sbjct: 703 RQAKKALPMIDKAASLLATYYGRNHPDVQELV 734



 Score = 40.8 bits (94), Expect = 1.5,   Method: Compositional matrix adjust.
 Identities = 22/72 (30%), Positives = 40/72 (55%), Gaps = 6/72 (8%)

Query: 7   SGEVIISQEPY--VCVPNNSSSISRCDGCFASSNLK-KCSACQVVWYCGSNCQKLDW-KL 62
           +G+ +I++EPY  V +P N+   + C+ C+ S      C+ C  V YC + C+   W + 
Sbjct: 181 AGDTLIAEEPYSAVLLPENAK--THCECCYKSLVAPVPCNHCSSVLYCSAACRNKAWSQY 238

Query: 63  HRLECQVLSRLD 74
           H +EC++   L+
Sbjct: 239 HHVECEIFPVLE 250


>gi|195440652|ref|XP_002068154.1| GK12548 [Drosophila willistoni]
 gi|194164239|gb|EDW79140.1| GK12548 [Drosophila willistoni]
          Length = 534

 Score = 72.0 bits (175), Expect = 6e-10,   Method: Compositional matrix adjust.
 Identities = 83/332 (25%), Positives = 148/332 (44%), Gaps = 53/332 (15%)

Query: 8   GEVIISQEPYVCVPNNSSSISRCDGCF-------ASSNLKKCSACQVVW-YCGSNCQKLD 59
           GE I+ + P V  P  +S+   C GC         S N  KCS+C   W  CG  C++  
Sbjct: 69  GEQILREAPLVVGPKVASA-PICLGCHRNIGAPGKSMNYYKCSSC--TWPLCGKECEQSP 125

Query: 60  WKLHRLECQVLSRLD-------------KEKRKSVTPSIRLMLKLYLRRKLQNDNVIPST 106
              HR ECQ++++ +              ++RK     + ++L+  L         + ++
Sbjct: 126 H--HRGECQLMAQSNFQSKINYHPDQDQNQERKESAYCVIMLLRCML---------LKTS 174

Query: 107 TTDNYSLVEALVAHMSDIDEKQL--LLYAQIANLVNLIL---QWPEISINEIAENFSKLA 161
             +++  + +L  H+ +  E  L  +L A +   +  +L   +W E+ I  IA   + L 
Sbjct: 175 QPEDFVRLFSLEDHLKERLETPLYQVLRANLITFIKTVLGLREWSEMEILRIA---AILD 231

Query: 162 CNAHTICNSELRPLGTGLYPVISIINHSCLPNAVLVFEGRLAVV-RAVQHVPKGAEVLIS 220
            N   +   + R     LYP  ++I+H C+PN    F+  + ++  A + + KG  + IS
Sbjct: 232 TNTFEVRQPKERRKVRALYPGAAMISHDCVPNMRHRFDDNMNIIFLAKRAISKGEILNIS 291

Query: 221 YIETAGSTMTRQKALKEQYLFTCTCPRC---IKLGQFDDIQESAILEGYRCKDDGCSGFL 277
           Y +   ST+ R+  L++   F C C RC    +LG F     +A     +CK  G    L
Sbjct: 292 YTQPLRSTIQRRLHLRQAKCFDCACQRCSDPSELGTF-----AAAQMCLKCK-AGKIIPL 345

Query: 278 LRDSDDKGFTCQQCGLVRSKEEIKKIASEVNI 309
               +   + CQ C + +S +EI  I +++ +
Sbjct: 346 NPLQNSSPWKCQLCNVKKSAKEIITIDNQLQL 377


>gi|332023172|gb|EGI63428.1| SET and MYND domain-containing protein 4 [Acromyrmex echinatior]
          Length = 624

 Score = 72.0 bits (175), Expect = 6e-10,   Method: Compositional matrix adjust.
 Identities = 90/365 (24%), Positives = 147/365 (40%), Gaps = 76/365 (20%)

Query: 8   GEVIISQEPYVCVPNNSSSISRCDGCFASSNLKK-----CSACQVVWYCGSNCQKLDWK- 61
           G V+I   P+    + S+    C  C A+  L+      C  CQ V +C   C+K  W+ 
Sbjct: 201 GTVLIVDRPFSFSTDASALDRNCLHCHATLKLEDSVRIPCRNCQTVAFCTEVCRKEAWET 260

Query: 62  LHRLECQVLS-----RLDKEKRKS----VTPSIRLMLKLYLRRKLQ-------------- 98
            H+ EC V +      L+ E+++S    +     ++  L LR + +              
Sbjct: 261 YHQYECSVFNYFFKNSLNNERQQSSYLLLAYRTTVIQALSLRNRTELTCVLNPDFLRYHV 320

Query: 99  ------NDNVIPSTTTD----------NYSLVEALVAHMSDIDEKQLLLYAQIANLVN-- 140
                  DN I     D          +Y  V  L  H +D++    L+    A  +   
Sbjct: 321 NSNAKDKDNDISKECADLGSKRTYSPLDYRTVFQLETHCADVEPHVNLIRTVEAIFLTKC 380

Query: 141 LILQWPEISINEIAENFSKLA-----------CNAHTICNS---------ELRPLGTGLY 180
           LIL   ++ I    E F  LA           CNA+ I  +         E R +G  +Y
Sbjct: 381 LILVLNKLDIICTTETFIVLAVAMLHHLQAINCNAYEIIENVHDETTRVWEPRNIGAAIY 440

Query: 181 PVISIINHSCLPNAVL-VFEGRLAVVRAVQHVPKGAEVLISYIE--TAGSTMTRQKALKE 237
             +S++NHSC PN V   +   + VVRA++ + KG E+   Y     + S + R++ L +
Sbjct: 441 STVSLVNHSCYPNMVRHSYPNGIVVVRALRFIGKGCEIFDCYGPHFLSESKLNRREFLWK 500

Query: 238 QYLFTCTCPRCIKLGQFDDIQESAILEGYRCKDDGCSGFLLRDSDDKGFTCQQCGLVRSK 297
           +Y F C C  C +  +F  +QE   +  Y+C    CS  +    D     C +C  +   
Sbjct: 501 KYRFLCGCDACKQNWKF-PLQE---IMNYKCT--ACSEPIDLSIDATNQRCAKCEKIVDL 554

Query: 298 EEIKK 302
           + I+K
Sbjct: 555 KRIEK 559


>gi|242078489|ref|XP_002444013.1| hypothetical protein SORBIDRAFT_07g005870 [Sorghum bicolor]
 gi|241940363|gb|EES13508.1| hypothetical protein SORBIDRAFT_07g005870 [Sorghum bicolor]
          Length = 404

 Score = 72.0 bits (175), Expect = 7e-10,   Method: Compositional matrix adjust.
 Identities = 71/261 (27%), Positives = 103/261 (39%), Gaps = 27/261 (10%)

Query: 8   GEVIISQEPYVCVPNNSSSI-SRCDGCFAS---SNLKKCSACQVVWYCGSNCQKLDWKLH 63
           GEV++S+ P +  P+  SS+ S C  CF S   +    C++C+   +C   C        
Sbjct: 38  GEVLLSEPPVLLYPSTLSSLASYCSACFRSLPAAATVPCASCRAAAFCSPACAAASHP-- 95

Query: 64  RLECQVLSRLDKEKRKSVTPSIRLMLKLYLRRKLQNDNVIPSTTTDN-YSLVEALVAHMS 122
           RL C  LS        +  P+  +   L     L +   +P +T  +  SL  A      
Sbjct: 96  RLLCAALSHGGGAGVAAAAPTEAIQEPLLF---LLSAYSLPESTIHSILSLSSAPPPPPG 152

Query: 123 DIDEKQLLLYAQIANLVNLILQWPEISINEIAENFSKLACNAHTICNSE--------LRP 174
             D   L          +++ Q    S +  A   +K   N+  I            L+ 
Sbjct: 153 AQDAAGLHAAVAALAPPHMLPQG--FSPDLTAALLAKDRANSFAIMEPYRVGMSLELLKA 210

Query: 175 LGTGLYPVISIINHSCLPNAV-LVFEGRLA------VVRAVQHVPKGAEVLISYIETAGS 227
               +YP  S+ NH CLPNA    +  R        VVRA+  +P+G EV ISY      
Sbjct: 211 RAYAVYPRASLFNHDCLPNACHFDYPDRPGPGNTDIVVRALHDIPEGREVCISYFAANWR 270

Query: 228 TMTRQKALKEQYLFTCTCPRC 248
              RQ+ L E Y F C C RC
Sbjct: 271 YADRQRRLLEDYGFRCECDRC 291


>gi|340708900|ref|XP_003393055.1| PREDICTED: SET and MYND domain-containing protein 4-like [Bombus
           terrestris]
          Length = 673

 Score = 71.6 bits (174), Expect = 7e-10,   Method: Compositional matrix adjust.
 Identities = 87/383 (22%), Positives = 151/383 (39%), Gaps = 74/383 (19%)

Query: 7   SGEVIISQEPYVCVPNNSSSISRCDGCF----ASSNLK-KCSACQVVWYCGSNCQKLDWK 61
           +G V+I + P+    N  +    C  C     +S ++K  C  CQ V +C   C++  W+
Sbjct: 245 AGSVLIVETPFAFSTNKEALDRNCLHCHITLKSSESVKIPCHFCQTVSFCSEKCRREAWQ 304

Query: 62  L-HRLECQVLSRLDKEKRKSVTPSIRLMLKLYL--------------------RRKLQND 100
           + H+ EC +         +S+  +  L+L   +                    +    ND
Sbjct: 305 MYHQYECFIFDAFYGNDFESMQHTSHLLLAYRMIISAFLSLNTEQINNNTEKSKIPFMND 364

Query: 101 NVI-----------------PSTTTDNYSLVEALVAHMSDIDEKQLLLYA----QIANLV 139
           N +                  +  T +Y  +  L  H + I+    L  A     +A   
Sbjct: 365 NFLRDYVTIKNNEYKDLGINKAYCTYDYHTILNLETHCTKIESSINLARAIEAIFLAKCF 424

Query: 140 NLIL-QWPEISINE--------IAENFSKLACNAHTICNS---------ELRPLGTGLYP 181
             +L +   I + E        +  N   + CNA+ I  +         E R +G  +YP
Sbjct: 425 TFVLSKMDVICLKESFIYLAVAMLHNLQAINCNAYEIVENIYDKKTHVWEPRHIGGAIYP 484

Query: 182 VISIINHSCLPNAVL-VFEGRLAVVRAVQHVPKGAEVLISYIE--TAGSTMTRQKALKEQ 238
            IS+INHSC PN V   +     V+R ++ + KG E+L  Y     +   ++R + L ++
Sbjct: 485 SISLINHSCYPNVVRHSYPSGTVVLRTLRFIGKGTEILDCYGPHWLSEKRLSRLEYLWKK 544

Query: 239 YLFTCTCPRCIKLGQFDDIQESAILEGYRCK--DDGCSGFLLRDSDDKGFTCQQCGLVRS 296
           Y F C C  C +  QF  + E+     Y+CK   +  S   L +   +  + +QC     
Sbjct: 545 YRFLCACEACTQNWQF-PLPETM---NYKCKMCSEIISTITLDEKHTQNVSSKQCCKCNK 600

Query: 297 KEEIKKIASEVNILSKKTLALTS 319
           K ++KKI ++     +K L   S
Sbjct: 601 KTDLKKIKNQFRKSVEKRLDAIS 623


>gi|194873875|ref|XP_001973295.1| GG13432 [Drosophila erecta]
 gi|190655078|gb|EDV52321.1| GG13432 [Drosophila erecta]
          Length = 530

 Score = 71.6 bits (174), Expect = 7e-10,   Method: Compositional matrix adjust.
 Identities = 88/343 (25%), Positives = 148/343 (43%), Gaps = 63/343 (18%)

Query: 8   GEVIISQEPYVCVPNNSSSISRCDGCFAS--------SNLKKCSACQVVW-YCGSNCQKL 58
           GE I+ + P V  P  +S+   C GC  +         N  KCS+C   W  CG  C+  
Sbjct: 70  GEQILKESPLVLGPKVASA-PLCLGCHRNLLAPENPRGNYHKCSSCS--WPLCGKECE-- 124

Query: 59  DWKLHRLECQVLSRL-----------DKEKRKSVTPSIRLMLKLYLRRKLQNDNVIPSTT 107
           D   H+ ECQ++S             ++E+++S    I L+  + L+ K           
Sbjct: 125 DSVHHKAECQLMSGSNFQSKINYVPGEEERKESAYCVIMLLRCMQLKAK----------D 174

Query: 108 TDNYSLVEALVAHMSDIDEKQL--LLYAQIANLVNLIL---QWPEISINEIAENFSKLAC 162
            + +  ++ L  H+ +  E  L  +L A +   +  +L    W E+ I  IA   + L  
Sbjct: 175 PEAFLRLDTLEDHLKERLETPLYQVLRANLITFIKTVLGMKDWLEMDILRIA---AILDT 231

Query: 163 NAHTICNSELRPLGTGLYPVISIINHSCLPNAVLVFEGRLAVV-RAVQHVPKGAEVLISY 221
           N   +     R     LYP  ++I+H C+PN    F+  + +V  A + + KG  + ISY
Sbjct: 232 NTFEVRQPRERRKIRALYPGAAMISHDCVPNMRHRFDDDMNIVFLAKRRIAKGEILSISY 291

Query: 222 IETAGSTMTRQKALKEQYLFTCTCPRCI---KLGQFDDIQESAILEGYRCKDDGCSGFLL 278
            +   ST+ R+  L++   F C+C RC    +LG F   Q        +CK    +G ++
Sbjct: 292 TQPLRSTIQRRVHLRQAKCFDCSCARCQDPEELGSFAGAQTC-----LKCK----AGKII 342

Query: 279 RDS---DDKGFTCQQCGLVRSKEEI----KKIASEVNILSKKT 314
             +   +   + CQ C   RS +E+     ++  E+  L K T
Sbjct: 343 SLNPLLNSAPWKCQLCSYKRSAKEVVTSDAELQQELESLDKTT 385


>gi|393233728|gb|EJD41297.1| hypothetical protein AURDEDRAFT_106306 [Auricularia delicata
           TFB-10046 SS5]
          Length = 373

 Score = 71.6 bits (174), Expect = 7e-10,   Method: Compositional matrix adjust.
 Identities = 72/258 (27%), Positives = 105/258 (40%), Gaps = 63/258 (24%)

Query: 28  SRCDGCF-ASSNLKKCSACQVVWYCGSNC---QKLDWKL--HRLECQVLS---------R 72
           +RCD C  A+ +L++CS C   +YC + C   Q   W    HR  C++L          R
Sbjct: 49  ARCDWCVRAAPDLRRCSRCAAYYYCDAECKSRQNRHWTQGGHRRVCRLLPAFLAGAPFQR 108

Query: 73  LDKEKRKSVTPSIRLMLKLYLRRKLQNDNVIPSTTTDNYSLVEALVAHMSDIDEKQLLLY 132
           L +E+R         +L   L      D+V+     D     +      + +D  +  LY
Sbjct: 109 LAQEQRTDAL-----LLTHLLAAGQNTDDVL-----DLLPHSQPGPTPPTPLDGDKDALY 158

Query: 133 AQIANLVNLILQWPEISINEIAENFSKLACNAHTICNSELRPLGTGLYPVIS-IINHSCL 191
           A+  N                  NF         + +S L PL   +YP  S  +NHSC 
Sbjct: 159 ARFEN-----------------NNF---------VLHSHLTPLAAAVYPAASRALNHSCA 192

Query: 192 PNAVLVF------EGRLAVVRAVQHVPKGAEVLISYIETAGSTMTRQKALKEQYLFTCTC 245
            NAV +F        R+ VV  V+ V  G E+ I YI+ A +   R++ L+  Y F C C
Sbjct: 193 TNAVPLFVFAPATPPRMEVV-LVRDVAPGDEITIPYIDPALAPSARRERLRASYGFECAC 251

Query: 246 PRCIKL----GQFDDIQE 259
            RCI       Q  D+ E
Sbjct: 252 ARCITPSGLEAQIPDLAE 269


>gi|4324419|gb|AAD16880.1| unknown [Dictyostelium discoideum]
          Length = 333

 Score = 71.6 bits (174), Expect = 7e-10,   Method: Compositional matrix adjust.
 Identities = 39/107 (36%), Positives = 59/107 (55%), Gaps = 10/107 (9%)

Query: 148 ISINEIAENFSKLACNAHTICNSELRPL-----GTGLYPVISIINHSCLPNAVLVFEGR- 201
           ++I+EI E +S +  N H +    L PL     G G++P  S +NHSCLPNA    + + 
Sbjct: 40  VTIDEIIEIYSIVLSNGHEM----LHPLNCHTYGLGIFPTGSYLNHSCLPNAFWYNDDQG 95

Query: 202 LAVVRAVQHVPKGAEVLISYIETAGSTMTRQKALKEQYLFTCTCPRC 248
           + V R ++ + KG E+L SY +       R+K L +QY F C C +C
Sbjct: 96  MMVFRTLRPIKKGEEILTSYTDITTECSERRKHLLKQYFFFCQCQQC 142


>gi|328875104|gb|EGG23469.1| SET domain-containing protein [Dictyostelium fasciculatum]
          Length = 448

 Score = 71.6 bits (174), Expect = 7e-10,   Method: Compositional matrix adjust.
 Identities = 43/125 (34%), Positives = 64/125 (51%), Gaps = 4/125 (3%)

Query: 159 KLACNAHTICNSELRPLGTGLYPVISIINHSCLPNAVLVFEGRLAVVRAVQHVPKGAEVL 218
           K+ CN   I + + + LG  + P+ S  NHSC PN V V    L   +A+  +PKG+++ 
Sbjct: 272 KIRCNQFGIWSKKDKCLGVSVTPIASYFNHSCCPNIVDVRGTTLLEFKALHFIPKGSQLC 331

Query: 219 ISYIETAGSTMTRQKALKEQYLFTCTCPRCIKLGQFDDIQESAILEGYRCKDDGCSG-FL 277
           ISY++   +T +RQ  L   Y F C C RC   G   D  ++ I + Y      CSG + 
Sbjct: 332 ISYLDLDQTTDSRQDYLIYSYYFKCGCKRCNDKG---DSIDNWISQFYCQGTKKCSGTYF 388

Query: 278 LRDSD 282
           L D +
Sbjct: 389 LEDEE 393


>gi|335309609|ref|XP_003131876.2| PREDICTED: SET and MYND domain-containing protein 4-like [Sus
           scrofa]
          Length = 788

 Score = 71.6 bits (174), Expect = 8e-10,   Method: Compositional matrix adjust.
 Identities = 39/108 (36%), Positives = 59/108 (54%), Gaps = 6/108 (5%)

Query: 145 WPEISINEIA--ENFSKLACNAHTICNSELRPLGTGLYPVISIINHSCLPNAVLVFEGRL 202
           +PE +I  +A   +  +L CNA  I  + ++  G  ++P++++  HSC PNA + F   +
Sbjct: 481 YPEWNIWGVAMLRHMLQLQCNAQAI--TAIQQTGDSIFPIVNVFXHSCSPNASVSFISTV 538

Query: 203 AVVRAVQHVPKGAEVLISY--IETAGSTMTRQKALKEQYLFTCTCPRC 248
           A VRA Q + KG EVL  Y   E+      R + L+ QY F C CP C
Sbjct: 539 ATVRASQPIRKGQEVLHCYGPHESRMGVAERHQKLRSQYFFDCNCPPC 586


>gi|392579385|gb|EIW72512.1| hypothetical protein TREMEDRAFT_58684 [Tremella mesenterica DSM
           1558]
          Length = 885

 Score = 71.6 bits (174), Expect = 8e-10,   Method: Compositional matrix adjust.
 Identities = 38/102 (37%), Positives = 52/102 (50%), Gaps = 1/102 (0%)

Query: 177 TGLYPVISIINHSCLPNAVLVFEGRLAVVRAVQHVPKGAEVLISYIETAGSTMTRQKALK 236
           T ++P +S+INHSCLPN V +  G +AVVRA+  +  G E+ ISY        TR    K
Sbjct: 553 TAIWPTVSMINHSCLPNTVHIAWGDVAVVRALNDMSAGTELTISYFSREQPYETRAAKAK 612

Query: 237 EQYLFTCTCPRCIKLGQFDDIQESAILEGYRCKDDGCSGFLL 278
            QYLFTC C  C      + I+   ++E +          LL
Sbjct: 613 -QYLFTCDCNLCAADRMDESIRRGKLMEKFDMAKQASRNILL 653


>gi|158287331|ref|XP_309383.4| AGAP011257-PA [Anopheles gambiae str. PEST]
 gi|157019599|gb|EAA05326.4| AGAP011257-PA [Anopheles gambiae str. PEST]
          Length = 540

 Score = 71.6 bits (174), Expect = 8e-10,   Method: Compositional matrix adjust.
 Identities = 78/294 (26%), Positives = 124/294 (42%), Gaps = 60/294 (20%)

Query: 8   GEVIISQEPYVCVPNNSSSISRCDGCFASS--NLKKCSACQVVWYCGSNC-QKLDWKLHR 64
           G+V++ + P+V V  +S    RCD C       L  C  C    YC   C  K     HR
Sbjct: 161 GDVVMIERPFVTVLKDSFRYVRCDFCHGERPFTLIPCEGCTAAMYCSEECLSKAYNNYHR 220

Query: 65  LECQVLSRLDKEKRKSVTPSIRLM-------------LKLYL---------------RRK 96
            +C +L  L ++  +     IR++             LK +L                + 
Sbjct: 221 YDCGILRDLYEDFEEVSLIDIRMIAIAITTFDNNPEALKDHLDALDESNVNGFTMDWNKA 280

Query: 97  LQND--NVIPSTTTDN---YSLVEALVAHMSDIDEKQLLLYAQI-----AN------LVN 140
            Q D  N +   TT+    +S+  A+    + I    +L   ++     AN      L++
Sbjct: 281 TQQDIFNTVHVLTTNQERRHSMFVAMFIFNATILHTLILERTELGPVCEANPATNKFLLD 340

Query: 141 LILQWPEISINEIAENFSKLACNAHTICNSELRPLGTGLYPVISIINHSCLPNA--VLVF 198
           LIL++ +I    +  N   L+ NA+ +          G YP+IS++NHSC PN   + + 
Sbjct: 341 LILRYMQI----VNCNRKLLSFNAYKVNEYVAESFAVGCYPLISMLNHSCAPNVKRITLP 396

Query: 199 EGRLAVVRAVQHVPKGAEVLISYIETAGSTM----TRQKALKEQYLFTCTCPRC 248
           +GR AV   ++ V +G+++  SY   AG T+     RQ  L   Y F CTC  C
Sbjct: 397 DGRCAVF-VIRPVLEGSQLFDSY--EAGHTLHEREMRQSMLSFTYSFRCTCEAC 447


>gi|350403520|ref|XP_003486827.1| PREDICTED: protein msta, isoform B-like [Bombus impatiens]
          Length = 513

 Score = 71.6 bits (174), Expect = 8e-10,   Method: Compositional matrix adjust.
 Identities = 101/462 (21%), Positives = 191/462 (41%), Gaps = 69/462 (14%)

Query: 7   SGEVIISQEPYVCVPNNSSSISRCDGCFAS-----SNLKKCSACQVVW-YCGSNCQKLDW 60
           +GE I+++ P+V  P  + +   C  C+A+     S    CS C   W  C   C+  + 
Sbjct: 39  AGEEIVTEMPFVVGPK-AFTYPLCLSCYATWPPSLSAKPLCSKCG--WPVCSEECE--NQ 93

Query: 61  KLHR-LECQVLSRLDK---------EKRKSVTPSIRLMLKLYLRRKLQNDNVIPSTTTDN 110
             H+  ECQV  +  +         E  ++  P +  +  L L  + + +   P    + 
Sbjct: 94  PQHKDYECQVFVQAKEKFNVQAALEEANENGVPQLECITPLRLLLESERN---PERWNNE 150

Query: 111 YSLVEALVAHMSDIDEKQLLLYAQIANLVNLILQWPEISINEIAENFSKLACNAHTICNS 170
              ++ + AH     +K+     QI N+V+ I +  ++ ++  +E     AC    I   
Sbjct: 151 ---IKDMEAHNKIRSQKKHWQSNQI-NIVDYIRK--QLKLDRFSEEQIHTACGILEINTF 204

Query: 171 ELRPL----GTGLYPVISIINHSCLPN---AVLVFEGRLAVVRAVQHVPKGAEVLISYIE 223
           E+R         LYP ++++NHSC+ N   ++   + R+ + R    +P G E+  SY  
Sbjct: 205 EIRTAKGCSARALYPTVALMNHSCISNTCHSISPTDYRVRL-RTTLKIPAGGELYGSYTH 263

Query: 224 TAGSTMTRQKALKEQYLFTCTCPRCIKLGQFDDIQESAILEGYRCK--DDGCSGFLLRDS 281
           +   T+ R++ L E   F C CPRC      D  +    +   +C   D+G    L    
Sbjct: 264 SLLPTLLRREHLLEGKYFACACPRC-----SDPTELGTHVSSLKCNKCDNGIVLSLDSLD 318

Query: 282 DDKGFTCQQCGLVRSKEEIKKIASEVNILSKKTLAL--TSCGNHQEVVSTYKMIEKLQKK 339
            +  + C  C    S   ++K+   ++I+  +  A+   S  +  + +   + + K  + 
Sbjct: 319 PESSWKCTHCEFTTSGSAVRKV---LHIIQAEVDAVETISGADGADAIQERETVVKKYRS 375

Query: 340 LYHPFSVNLMQTREKLIKILMELEDW----------KEALAYCQLTIPVYQRVYPQFHPL 389
           + HP    L   R  L ++   ++++          +  +  C+L + V   + P +  +
Sbjct: 376 VLHPRHGFLTMLRHSLTQMYGRVDEYLLDDLPDVVLEHKIDICRLLLQVLDVIEPGYSRI 435

Query: 390 LGLQYYT-------CGKLEWFLGD-TENAIKS-MTEAVEILR 422
            G+  Y          K +W  G   E A+KS M EA  IL+
Sbjct: 436 RGMTLYELHAPLLFIAKTQWNAGVIDEAALKSKMIEAANILK 477


>gi|194762123|ref|XP_001963207.1| GF15829 [Drosophila ananassae]
 gi|190616904|gb|EDV32428.1| GF15829 [Drosophila ananassae]
          Length = 507

 Score = 71.2 bits (173), Expect = 1e-09,   Method: Compositional matrix adjust.
 Identities = 95/389 (24%), Positives = 152/389 (39%), Gaps = 56/389 (14%)

Query: 6   VSGEVIISQEPYVCVPNNSSSISRCDGCF---ASSNLKKCSACQVVWYCGSNCQKLDWKL 62
           V   ++I + P+   P  +  +  C GC+   A +  + CS C   W   S C + +   
Sbjct: 31  VGHGLLIEELPFAVGPKCNGPVV-CLGCYQFDADTEEEYCSECG--WPLCSECAQDEGNH 87

Query: 63  H-RLECQVLSRLDKE-----KRKSVTPSIRLMLKLYLRRKLQNDNVIPSTTTDNY---SL 113
           H R EC+VL           K  S  P +  +L L          V+ +  TD     S 
Sbjct: 88  HFRAECRVLQDARARFFPLPKGSSHCPQLDCILPL---------RVLLAKETDPERWDSQ 138

Query: 114 VEALVAHMSDIDEKQLLLYAQIANLVNLILQWP-----EISINEIAENFSKLACNAHTIC 168
           VE +  H  +  E   + +A   N+    L+ P       S   I +    L  NA    
Sbjct: 139 VEPMEHHEEERKEDADVWHADRVNIAQ-YLRGPCKLSSRFSEELIMQVVGILEVNAFEAR 197

Query: 169 NSELRPLGTGLYPVISIINHSCLPN---AVLVFEGRLAVVRAVQHVPKGAEVLISYIETA 225
            +   PL   LYP   I+ H+C+PN   ++   EG    +RA+  +  G  +  SY  T 
Sbjct: 198 TTRGFPLRC-LYPYTGILAHNCVPNTARSIYPSEGYKIRLRAMVDLKDGQPLHHSYTYTL 256

Query: 226 GSTMTRQKALKEQYLFTCTCPRC---IKLG-QFDDIQESAILEGYRCKDDGCSGFLLRDS 281
             T  RQ+ LK+   F+CTC RC    +LG  F  ++     EGY+              
Sbjct: 257 DGTAQRQRHLKQGKYFSCTCERCQDPTELGTHFSSLKCGQCTEGYQVPRQP-------TD 309

Query: 282 DDKGFTCQQCGLVRSKEEI----KKIASEVNILSKKTLALTSCGNHQEVVSTYKMIEKLQ 337
            D  + C  CG   S+ E+    + + SEVN +    +        + ++  Y       
Sbjct: 310 SDSSWNCNNCGASSSQAEVETLLQSLQSEVNKVQGLEMGAKRLEEAERLLRKY------- 362

Query: 338 KKLYHPFSVNLMQTREKLIKILMELEDWK 366
           K L HP        R+ LI++   ++ ++
Sbjct: 363 KSLLHPLHFIATGLRQLLIEMYGRVQGYE 391


>gi|195121780|ref|XP_002005397.1| GI19093 [Drosophila mojavensis]
 gi|193910465|gb|EDW09332.1| GI19093 [Drosophila mojavensis]
          Length = 515

 Score = 71.2 bits (173), Expect = 1e-09,   Method: Compositional matrix adjust.
 Identities = 98/395 (24%), Positives = 161/395 (40%), Gaps = 29/395 (7%)

Query: 9   EVIISQEPYVCVPNNSSSISRCDGCFAS---SNLKKCSACQVVW-YCGSNCQKLDWKLHR 64
           E+++ + P +  P   S +  C GC  +   ++   C  C   W  CG  CQ LD   H+
Sbjct: 65  EIVLKEAPLMRGPAQIS-VPVCMGCLNAIEPNDHITCDKCG--WPLCGPECQALD--EHK 119

Query: 65  LECQVL-SRLDKEKRKSVTPSIRLMLKLYLRRKLQNDNVIPSTTTDNYSLVEALVAHMSD 123
            ECQ+  +R  K   +       L   +   R L      P+   + +  +E+L      
Sbjct: 120 AECQLTEARGQKVNVQEFNGPHPLYTCVSTVRCLLLSETNPANA-EKFQQLESLEQTRRG 178

Query: 124 IDEKQLLLYAQIANLVNLILQWPEISINEIAENFSKLACNAHTICNSELRPLGTGLYPVI 183
            ++ +  L A I   +    +  + S  EI      L  N H +  S+  P    ++   
Sbjct: 179 SNQWKADL-ASIGQFIPKFFRTQKFSEEEIMRAVGALQINGHEVPTSD--PPHVAVFYTA 235

Query: 184 SIINHSCLPNAVLVF-EGRLAVVRAVQHVPKGAEVLISYIETAGSTMTRQKALKEQYLFT 242
           S   +SC+PN    F +    ++ A + + KGA + I Y +    T  RQ+ L +  LF 
Sbjct: 236 SFTENSCVPNLAKSFTKNGHCILWAPKAIKKGANLSICYSDAVWGTADRQRHLMQTKLFK 295

Query: 243 CTCPRCIKLGQFDDIQESAILEGYRCKDDGCSGFLLRDSDDKGFTCQQCGLVRSKEEIKK 302
           CTC RC+ + +             +C+D  C+G LL    D    C +C   + + + K 
Sbjct: 296 CTCERCVDVTEL-----GTYYSALKCEDRKCTGLLLPIKADDWNGCWRCRDCQKQVQRKY 350

Query: 303 IASEVNILSKKTLALTSCGNHQEVVSTY-KMIEK-LQKKLYHPFSVNLMQTR--EKLIKI 358
           +     IL +    + S     E    Y K  EK L  + YH   V ++Q +   K  K 
Sbjct: 351 VDG---ILERAIKDIQSMEKTAENGFKYLKHYEKWLPPQHYHLSEVKILQVQLIAKDQKE 407

Query: 359 LMELED--WKEALAYCQLTIPVYQRVYPQFHPLLG 391
           LM L D   +  L Y +  I +Y+ + P    +LG
Sbjct: 408 LMVLPDEQLQLKLKYAKELIELYEILAPCEVRMLG 442


>gi|268579277|ref|XP_002644621.1| C. briggsae CBR-SET-30 protein [Caenorhabditis briggsae]
          Length = 634

 Score = 71.2 bits (173), Expect = 1e-09,   Method: Compositional matrix adjust.
 Identities = 65/245 (26%), Positives = 103/245 (42%), Gaps = 17/245 (6%)

Query: 16  PYVCVPNNSSSISRCDGCFASSNLKKCSACQVVWYCGSNCQKLDWKLHRLECQVLSRLDK 75
           P+     +S+    C  C   +    C  C+V  +C   CQ      H+ EC  L     
Sbjct: 110 PFAYSLLDSTKNHYCWTCLGETVDLTCDKCKVAKFCSKQCQTSGAFDHKYECGPLHNC-- 167

Query: 76  EKRKSVTPSIRLMLKLYLRRKL---QNDNVIPSTTTDNYS--LVEALVAHMSDI--DEKQ 128
              K +    R++L++  R K     ND  I    T+  S   V  +  H +D+  DE  
Sbjct: 168 ---KDLNTDERMLLRIIGRYKDILDGNDKKIDGFYTNRESGRSVMQIWEHCADMKKDENA 224

Query: 129 LLLYAQIANLVNLILQWPEISINEIAENF-SKLACNAHTICNSE-LRPLGTGLYPVISII 186
           + ++ +I + V        +   + A    S+   N H+I N + LR +G GLY  +   
Sbjct: 225 MNVFKKIYDRVKQYGDKTHLVDKDTAFQLHSRNFINRHSISNVDYLREIGKGLYLDLCRY 284

Query: 187 NHSCLPNAVLVFEGRLAVVRAVQ---HVPKGAEVLISYIETAGSTMTRQKALKEQYLFTC 243
           +HSC PNA+    G +A +RA+     +        +YIE     + R+  LKE + F C
Sbjct: 285 DHSCRPNAIYSCNGTVAKLRALHDNVDLENVETTHYTYIELPPCKIQRRHMLKETWYFEC 344

Query: 244 TCPRC 248
            C RC
Sbjct: 345 HCERC 349


>gi|157125402|ref|XP_001654323.1| hypothetical protein AaeL_AAEL001936 [Aedes aegypti]
 gi|108882687|gb|EAT46912.1| AAEL001936-PA [Aedes aegypti]
          Length = 546

 Score = 71.2 bits (173), Expect = 1e-09,   Method: Compositional matrix adjust.
 Identities = 79/291 (27%), Positives = 123/291 (42%), Gaps = 34/291 (11%)

Query: 144 QWPEISINEIAENFSKLACNAHTICNSELRPLGTGLYPVISIINHSCLPNAVLVFEGRLA 203
           +W E   +EI +    L  N H +  +E  P    +Y   S++ HSC PN    F  +  
Sbjct: 224 KWSE---DEILKIVGILQVNGHEVPLTE--PPSVAIYNNASMLEHSCRPNLSKSFTSKKE 278

Query: 204 VV-RAVQHVPKGAEVLISYIETAGSTMTRQKALKEQYLFTCTCPRCIKLGQFDDIQESAI 262
           +V  A   + +G  + ISY +    T  RQ  L++  LF CTC RC+   +F     +  
Sbjct: 279 IVFWAPNPIKQGERLSISYSDVLWGTANRQDHLQQTKLFRCTCVRCLDPTEFGTYLSALK 338

Query: 263 LEGYRCKDDGCSGFLLRDSDDK---GFTCQQC-GLVRSKEEIKKIASEVNILSKKTLALT 318
             G++ KD  CSG LL ++      G+ C +C GLV  KE         NIL +  +   
Sbjct: 339 CSGFK-KDSNCSGLLLPENLKNWYGGYICNKCRGLVDGKE-------ITNILDRARVDHE 390

Query: 319 SCGNHQEVVSTYKMIEKLQKKLYHPFSVNLMQTREKLIKIL----------MELEDWKEA 368
           +     E    +K I    + L  P    L+  +  L +I+          +  ED    
Sbjct: 391 AMQKDNE-QHCHKYIAHYGRWL-GPNHHLLVDVKISLSQIMGGGDPNAIQKISDEDLMTK 448

Query: 369 LAYCQLTIPVYQRVYPQFHPLLGLQYYTCGKLEWFLGD-TENAIKSMTEAV 418
           +  CQ  I V+QRV P    ++G   +   +L   L +     ++SM  AV
Sbjct: 449 MKICQELIDVFQRVCPAEARVIGTTRF---ELHAALAEFARRGVESMNSAV 496


>gi|302845794|ref|XP_002954435.1| hypothetical protein VOLCADRAFT_118721 [Volvox carteri f.
           nagariensis]
 gi|300260365|gb|EFJ44585.1| hypothetical protein VOLCADRAFT_118721 [Volvox carteri f.
           nagariensis]
          Length = 405

 Score = 71.2 bits (173), Expect = 1e-09,   Method: Compositional matrix adjust.
 Identities = 35/73 (47%), Positives = 45/73 (61%), Gaps = 2/73 (2%)

Query: 178 GLYPVISIINHSCLPNAVLVFEGRLAVVRAVQHVPKGAEVLISYIETAG--STMTRQKAL 235
           GL+P  S +NHSCLPNAV    G + VV A + + +G+EVLI+Y+  A       RQ  L
Sbjct: 122 GLFPSFSFLNHSCLPNAVNFVVGGIMVVVAARKIRQGSEVLINYLGRASLRPVGERQGQL 181

Query: 236 KEQYLFTCTCPRC 248
            E Y F+C CPRC
Sbjct: 182 AEGYHFSCDCPRC 194


>gi|195496367|ref|XP_002095664.1| GE22530 [Drosophila yakuba]
 gi|194181765|gb|EDW95376.1| GE22530 [Drosophila yakuba]
          Length = 530

 Score = 71.2 bits (173), Expect = 1e-09,   Method: Compositional matrix adjust.
 Identities = 88/337 (26%), Positives = 145/337 (43%), Gaps = 51/337 (15%)

Query: 8   GEVIISQEPYVCVPNNSSSISRCDGCF--------ASSNLKKCSACQVVW-YCGSNCQKL 58
           GE I+ + P V  P  +S+   C GC         +  N  KCS+C   W  CG  C+  
Sbjct: 70  GEQILKESPLVLGPKVASA-PLCLGCHRNLLAPEKSRGNYHKCSSCS--WPLCGKECE-- 124

Query: 59  DWKLHRLECQVLSRLDKEKRKSVTPSIRLMLK-----LYLRRKLQNDNVIPSTTTDNYSL 113
           D   H+ ECQ++S  + + + +  P      +     + L R +Q     P      Y+L
Sbjct: 125 DSVHHKAECQLMSGSNFQSKINYVPGEDERKESAYCVIMLLRCMQLKAKDPEAFLKLYTL 184

Query: 114 VEALVAHMSDIDEKQL--LLYAQIANLVNLIL---QWPEISINEIAENFSKLACNAHTIC 168
            +    H+ +  E  L  +L A +   +  +L    W E+ I  IA   + L  N   + 
Sbjct: 185 ED----HLKERLETPLYQVLRANLITFIKTVLGMKNWLEMDILRIA---AILDTNTFEVR 237

Query: 169 NSELRPLGTGLYPVISIINHSCLPNAVLVFEGRLAVV-RAVQHVPKGAEVLISYIETAGS 227
               R     LYP  ++I+H C+PN    F+  + +V  A + + KG  + ISY +   S
Sbjct: 238 QPRERRKIRALYPEAAMISHDCVPNMRHRFDDDMNIVFLAKRKIAKGEILSISYTQPLRS 297

Query: 228 TMTRQKALKEQYLFTCTCPRCI---KLGQFDDIQESAILEGYRCKDDGCSGFLLRDS--- 281
           T+ R+  L++   F C+C RC    +LG F   Q        +CK    +G ++  +   
Sbjct: 298 TIQRRVHLRQAKCFDCSCARCQDPEELGSFAGAQTC-----LKCK----AGKIISLNPLL 348

Query: 282 DDKGFTCQQCGLVRSKEEI----KKIASEVNILSKKT 314
           +   + CQ C   RS +E+     ++  E+  L K T
Sbjct: 349 NSAPWKCQLCNYKRSAKEVVTSDAELQQELESLDKTT 385


>gi|291227705|ref|XP_002733823.1| PREDICTED: predicted protein-like [Saccoglossus kowalevskii]
          Length = 644

 Score = 70.9 bits (172), Expect = 1e-09,   Method: Compositional matrix adjust.
 Identities = 67/294 (22%), Positives = 127/294 (43%), Gaps = 13/294 (4%)

Query: 149 SINEIAENFSKLACNAHTICNSELRPLGTGLYPVISIINHSCLPNAVLVFEGRLAVVRAV 208
           +++ + E  +K   +   +  ++   +   +Y   S++NHSC PN +  ++G    +RA 
Sbjct: 346 NVHAVTEVATKTDSSTSFVATTQQIRIAVAVYGTASMLNHSCTPNVIAGYDGNQLTIRAT 405

Query: 209 QHVPKGAEVLISYIETAGSTM--TRQKALKEQYLFTCTCPRCIKLGQFDDIQESAILEGY 266
           + + KG EVL  Y           R K L++QY FTC C  C  + Q + I  S  L   
Sbjct: 406 EMIKKGGEVLHCYGPRVSDMFRDERLKVLRDQYYFTCKCMFC-GIPQ-EAITPSGALRCP 463

Query: 267 RCKDDGCSGFLLRDSDDKG----FTCQQCGLVRSKEEIKKIASEVNILSKKTLALTSCGN 322
           RC D   SG + +  +  G    F C     +   +E  KI +   +       +     
Sbjct: 464 RCSD--MSG-VYQTPEVTGQCCNFDCHHSWKLGETDE--KIENARQLFRVGKTLIGDKNK 518

Query: 323 HQEVVSTYKMIEKLQKKLYHPFSVNLMQTREKLIKILMELEDWKEALAYCQLTIPVYQRV 382
            +E +ST     K+Q+ + +  +  L +T + L        ++  A  Y +L+I     +
Sbjct: 519 VKECLSTLHKAYKIQQSVLYKHNKELAETEDCLALCYASFGNYHYAAKYLKLSIKSNIIL 578

Query: 383 YPQFHPLLGLQYYTCGKLEWFLGDTENAIKSMTEAVEILRITHGTNSPFMKELI 436
           Y +    +G + Y   ++ +   +    +  + +AV+I+   +G N   +KEL+
Sbjct: 579 YGEDSIEIGHELYKLAQILFNCKNVTEGLGVIDKAVKIISDHYGENHSDVKELL 632



 Score = 43.5 bits (101), Expect = 0.22,   Method: Compositional matrix adjust.
 Identities = 30/85 (35%), Positives = 42/85 (49%), Gaps = 3/85 (3%)

Query: 8   GEVIISQEPYVCVPNNSSSISRCDGCFASSNLK-KCSACQVVWYCGSNCQKLDWK-LHRL 65
           GE+II ++PY CV   S   +RC  C   +     C  C  V YC   CQ+  WK  H +
Sbjct: 87  GEIIIKEKPYGCVLLPSHYNTRCYHCVRKTVAPIPCCTCTHVRYCSVECQQESWKSYHYI 146

Query: 66  ECQVLSRLDKEKRKSVTPSIRLMLK 90
           EC +   L +    S   S+R++LK
Sbjct: 147 ECPLWPFLSQAGNFSQL-SLRILLK 170


>gi|170048026|ref|XP_001851501.1| conserved hypothetical protein [Culex quinquefasciatus]
 gi|167870252|gb|EDS33635.1| conserved hypothetical protein [Culex quinquefasciatus]
          Length = 521

 Score = 70.9 bits (172), Expect = 1e-09,   Method: Compositional matrix adjust.
 Identities = 68/283 (24%), Positives = 121/283 (42%), Gaps = 51/283 (18%)

Query: 8   GEVIISQEPYVCVPNNSSSISRCDGCFASS--NLKKCSACQVVWYCGSNCQKLDW-KLHR 64
           GE+++ + P + V     + +RC+ C   +  +L  C  C    YC   C+K  + K HR
Sbjct: 184 GEIVLHENPSLVVIEPEVTFTRCNHCGRRNEYDLIPCKTCSSAMYCSEQCRKEAFTKYHR 243

Query: 65  LECQVLSRLDK-EKRKSVTPSIRLMLKLYLRRKLQNDNVIPSTTTDNYSLVEA---LVAH 120
            EC+++  L    K    T   +L L+L+         V+     D    ++    L A+
Sbjct: 244 FECEIVEDLKNLFKGPKTTRMFQLTLRLFWM-------VVADLIADRDQFLKRYADLSAY 296

Query: 121 MS--DIDEKQLLLYAQIANLVNL--------ILQW----------------PEISINE-- 152
            +   ID+  L L+    NL ++        + Q+                P   +NE  
Sbjct: 297 RNPLQIDKSTLHLHVLADNLPDMSADQTGKGVTQFLTALTYKLALEENDSVPRELLNEHQ 356

Query: 153 --IAENFSKLACNAHTICNSELRPLGTGLYPVISIINHSCLPNAVLVFEGRLAVVRAVQH 210
             + E   +LA  A  +C+     + + L+P+  ++NHSC PNA  V  G  +++ A + 
Sbjct: 357 DLLLEVLFRLALQARLVCDQSPADI-SCLFPLFRMVNHSCAPNAERVLNGERSMLVAKRP 415

Query: 211 VPKGAEVLISYIETAGSTMT-----RQKALKEQYLFTCTCPRC 248
           +  G +VL+ Y    G+T +     R+  L+ ++ F C C  C
Sbjct: 416 IRAGEQVLVCYFPN-GTTDSVPKDKRRAQLQREFKFDCQCLGC 457


>gi|358391549|gb|EHK40953.1| hypothetical protein TRIATDRAFT_29913 [Trichoderma atroviride IMI
           206040]
          Length = 554

 Score = 70.9 bits (172), Expect = 1e-09,   Method: Compositional matrix adjust.
 Identities = 68/275 (24%), Positives = 115/275 (41%), Gaps = 50/275 (18%)

Query: 6   VSGEVIISQEPYVCVPNNSSSISRCDGCFAS-------------------SNLKKCSACQ 46
            +GE+I +  P V       S   CD CF                      ++ KC  C 
Sbjct: 76  AAGELIFTSVPLVLCAEVGDSKEACDFCFQQRRRAIHPVEDRLADPGETLPDVYKCMGCN 135

Query: 47  VVWYCGSNCQKLDWKL-HRLECQVLSRLDKEKRKSVTPSIRLMLKLY--LRRKLQNDNVI 103
           +  YC  +C +  W   H  EC +L+    E        IR++ ++   LR+K+    ++
Sbjct: 136 LYQYCSESCWQRAWDTGHLYECGLLANAPYEL------EIRMLYRILILLRKKV----LL 185

Query: 104 PSTTTDNYSLVEAL--VAHMSDIDEKQLLLYAQIANLVNLILQW--PEISINEIAENFSK 159
           P         V+AL  +AH  D  E+    +  + ++     +    E+ + ++ + +  
Sbjct: 186 PEQ-------VQALARLAHEQDKYEQLSSDWQGVKDIAAEAKRRMKSELDVADVLKLYCL 238

Query: 160 LACNA----HTICNSELRPLGTGLYPVISIINHSCLPNAVLVFEGRLAVVRAVQHVPKGA 215
           + CNA     T  NS   PLG+ +    +++NH+C PN V+VF       RAV+ +  G 
Sbjct: 239 IRCNAVPVDQTFRNS---PLGSAIDLGAAMLNHNCEPNIVIVFNSTRVEARAVRSIKAGE 295

Query: 216 EVLISYIETAGSTMTRQKALKEQYLFTCTCPRCIK 250
           E+   Y + A     R   +  +Y F C C RCI+
Sbjct: 296 ELQHCYRDIAYDCTFRSPRIAARYQFKCQCDRCIR 330


>gi|358378046|gb|EHK15729.1| hypothetical protein TRIVIDRAFT_228756 [Trichoderma virens Gv29-8]
          Length = 583

 Score = 70.9 bits (172), Expect = 1e-09,   Method: Compositional matrix adjust.
 Identities = 65/265 (24%), Positives = 105/265 (39%), Gaps = 37/265 (13%)

Query: 8   GEVIISQEPYVCVPNNSSSISRCDGCFASSN-------------------LKKCSACQVV 48
           GE+I +  P V        +  CD CF                       L  C+ C++ 
Sbjct: 109 GELIFTSVPLVTCAEVGPGMEACDFCFQQRRRVFHPVEDRFLQPGEVLPPLHICNGCRLY 168

Query: 49  WYCGSNCQKLDWKL-HRLECQVLSRLDKEKRKSVTPSIRLMLKLYLRRKLQNDNVIPSTT 107
            YC  +C +  W   H  EC +L+        S     R + +L +   L    V+    
Sbjct: 169 QYCSQSCSQRAWDTGHLYECGLLAG------ASADVETRTLYRLLI---LMRKKVLLPQQ 219

Query: 108 TDNYSLVEALVAHMSDIDEKQLLLYAQIANLVNLILQW--PEISINEIAENFSKLACNAH 165
               + +E  VA+     +K    + ++ NL     +    E+SI E+   +  + CN+ 
Sbjct: 220 VKALARLENEVANFEKRTKKS---WPRVLNLAREAKERTKSELSIGEVLMLYGIVRCNSL 276

Query: 166 TICNSELR--PLGTGLYPVISIINHSCLPNAVLVFEGRLAVVRAVQHVPKGAEVLISYIE 223
            + +   R  PLG  L    +++NH C PN V+VF      VRA++ +  G E+L  Y +
Sbjct: 277 PV-DQTFRNAPLGIALDMGGALLNHCCDPNVVIVFNSTQVQVRALRKIKDGEELLHCYRD 335

Query: 224 TAGSTMTRQKALKEQYLFTCTCPRC 248
            A     R   +  +Y F C C RC
Sbjct: 336 IAYDFTFRNPRITARYQFNCQCDRC 360


>gi|194751716|ref|XP_001958171.1| GF10789 [Drosophila ananassae]
 gi|190625453|gb|EDV40977.1| GF10789 [Drosophila ananassae]
          Length = 438

 Score = 70.9 bits (172), Expect = 1e-09,   Method: Compositional matrix adjust.
 Identities = 86/333 (25%), Positives = 145/333 (43%), Gaps = 43/333 (12%)

Query: 8   GEVIISQEPYVCVPNNSSSISRCDGCFAS--------SNLKKCSACQVVW-YCGSNCQKL 58
           GE I+ + P +  P  +S+   C GC  +         N  KCS+C   W  CG  C+  
Sbjct: 71  GEQILKEAPLILGPKVASA-PLCLGCHRNLLAPEKRKGNYYKCSSCS--WPLCGKQCE-- 125

Query: 59  DWKLHRLECQVLSRLDKEKRKSVTPSI-----RLMLKLYLRRKLQNDNVIPSTTTDNYSL 113
           D   H+ ECQ++S  + + + +  P            + L R +Q     P +    Y+L
Sbjct: 126 DSPHHKAECQLMSSSNFQSKINYVPGEAERKESAYCVIMLLRCMQLKTKDPESFAKLYTL 185

Query: 114 VEALVAHMSDIDEKQLLLYAQIANLVNLIL---QWPEISINEIAENFSKLACNAHTICNS 170
            + L   + D    Q+L  A +   +  +L    W E+ I  IA   + L  N   +   
Sbjct: 186 EDHLKERL-DTPLYQVL-RANLITFIKTVLGLKDWSELDILRIA---AILDTNTFEVRQP 240

Query: 171 ELRPLGTGLYPVISIINHSCLPNAVLVFEGRLAVV-RAVQHVPKGAEVLISYIETAGSTM 229
             R     L+P  ++I+H C+PN    F+  + ++  A + + KG  + ISY +   ST+
Sbjct: 241 RERIKIRALFPGAAMISHDCVPNMRHRFDDDMNIIFLAKRKISKGEILSISYTQPLRSTI 300

Query: 230 TRQKALKEQYLFTCTCPRCI---KLGQFDDIQESAILEGYRCKDDG-CSGFLLRDSDDKG 285
            R+  L++   F C+C RC    +LG F   Q        +CK     S   L++S +  
Sbjct: 301 QRRLHLRQAKCFDCSCARCQDPEELGSFAGAQNCV-----KCKAGKIVSMNPLQNSAN-- 353

Query: 286 FTCQQCGLVRSKEEI----KKIASEVNILSKKT 314
           + CQ C L R+ +++     ++  E+  L K T
Sbjct: 354 WKCQICNLKRTAKDVLTSDAQLQQELETLDKST 386


>gi|328698846|ref|XP_001948827.2| PREDICTED: SET and MYND domain-containing protein 4-like
           [Acyrthosiphon pisum]
          Length = 742

 Score = 70.9 bits (172), Expect = 1e-09,   Method: Compositional matrix adjust.
 Identities = 71/283 (25%), Positives = 116/283 (40%), Gaps = 48/283 (16%)

Query: 8   GEVIISQEPYVCVPNNSSSISRCDGCFASSNLKK--CSACQVVWYCGSNC-QKLDWKLHR 64
           GE++I + P+  V       S C  C          C  C  + +CG  C Q+     HR
Sbjct: 269 GELLIFENPFAFVLLPEYYNSFCYNCCVPLKYYSIPCDNCCTILFCGDKCLQEARNSYHR 328

Query: 65  LECQVLSRLDKEKRKSVTPSIRLMLKLYLRRKLQNDNVIPSTTTDNYSLVEALVAHMSDI 124
            EC+        +  S+   I +   L LR  ++      S    N   +  L+ H+ D+
Sbjct: 329 WECK--------QGTSIFKCIGIA-HLALRLTIE-----TSQANSNNDQIYNLLTHIDDL 374

Query: 125 DEKQLLLYAQIANLVNLILQ--------WPEISI----NEIAENFSKLACNAHTICN--- 169
              +L  Y+  A L+ + LQ         P + +    NE+  + ++L CN + I     
Sbjct: 375 KSLELYQYSLTATLLLIYLQKKTDFFEKHPNLVLDSVGNELLHHMTRLVCNGNAISTHML 434

Query: 170 ------------SELRP-LGTGLYPVISIINHSCLPNAVLVFEGRLAVVRAVQHVPKGAE 216
                        E +P +GT ++P  S++NHSC PN       +  V++A + + +G E
Sbjct: 435 SDYDSGSRTPIIDESQPRIGTAIFPTSSLLNHSCDPNIFSSNILKYVVIKASRDISEGEE 494

Query: 217 VLISYIET--AGSTMTRQKALKEQYLFTCTCPRCIKLGQFDDI 257
           +   Y         + RQ +LK QY F C C  C+   Q DD+
Sbjct: 495 ITNCYGPNFLRMRVVDRQASLKNQYHFDCECNTCLD-PQADDL 536


>gi|340992675|gb|EGS23230.1| hypothetical protein CTHT_0008940 [Chaetomium thermophilum var.
           thermophilum DSM 1495]
          Length = 459

 Score = 70.5 bits (171), Expect = 2e-09,   Method: Compositional matrix adjust.
 Identities = 39/103 (37%), Positives = 59/103 (57%), Gaps = 2/103 (1%)

Query: 147 EISINEIAENFSKLACNAHTICNSELRPLGTGLYPVISIINHSCLPNAVLVFEGRLAVVR 206
           + SI    E   K+  NA    + +    G  L PV+++ NHSC+P+A + F+ R AV+R
Sbjct: 89  DFSIPRAREILCKIQTNAFNRLDPDAGMTGIYLDPVLAMANHSCMPSAFVSFDQRNAVLR 148

Query: 207 AVQHVPKGAEVLISYIE-TAGSTMTRQKALKEQYLFTCTCPRC 248
           A + + +G E+ I Y++ T  +   RQ+ALK  Y F C CPRC
Sbjct: 149 AWRDIKEGDEITICYVDVTLPNKAARQEALK-LYHFECRCPRC 190


>gi|110289369|gb|AAP54526.2| TPR Domain containing protein, expressed [Oryza sativa Japonica
           Group]
          Length = 687

 Score = 70.5 bits (171), Expect = 2e-09,   Method: Compositional matrix adjust.
 Identities = 79/351 (22%), Positives = 148/351 (42%), Gaps = 58/351 (16%)

Query: 129 LLLYAQI-ANLVNLILQWPEISINEIAENFSKLACNAHTICNSELRPLGTGLYPVISIIN 187
           +LL  QI  N + ++       +  + + FS  +     +C+ E   +   +Y   S  N
Sbjct: 321 VLLICQIKVNSIAIVHMKSMDGVKALTKGFSGFS--GDVMCSVEQVRVAQAIYMSGSFFN 378

Query: 188 HSCLPNAVLVFEGRLAVVRAVQHVPKGAEVLISYIETAGST--MTRQKALKEQYLFTCT- 244
           HSC PN    F  R  ++R+ +++  G+ + +SY    G      RQK+L+E Y F+C  
Sbjct: 379 HSCRPNIHAYFHSRTLILRSTEYIKAGSPIELSYGPQVGEMDLPERQKSLRENYYFSCGC 438

Query: 245 -----------------CPRCIKLGQF---------DDIQESAILEGYRC---------- 268
                            CP+   LG           ++    +I E + C          
Sbjct: 439 SSCSVLSLSDLVMNSFCCPQSNCLGAVSELIHHRHKENFVHVSIGESHVCTLSLPDVSKF 498

Query: 269 -KDDGCSGFLLRDSD-----DKGF--TCQ-QCGLVRSKEEIKKIASEVNILSKKTLALTS 319
            +D    G L   SD     D GF  +C+ Q  L  +     +  S++N L +    L S
Sbjct: 499 DEDIVKVGKLFFKSDTMFNIDPGFCMSCRSQLDLSSAVAMSDRATSKINRLKE----LPS 554

Query: 320 CGNHQEVV--STYKMIEKLQKKLYHPFSVNLMQTREKLIKILMELEDWKEALAYCQLTIP 377
             N  EV+     + +E+++ KL HP+S  L Q+ + + +   ++ D ++A  +C+ +I 
Sbjct: 555 LDNVPEVLIAEALQSLERIE-KLRHPYSKTLAQSHDTIAEAFAKVGDQEQARKHCEASIK 613

Query: 378 VYQRVYPQFHPLLGLQYYTCGKLEWFLGDTENAIKSMTEAVEILRITHGTN 428
           + +++Y   H ++  +      +E  +GD  +A  +   A  I  + +G +
Sbjct: 614 ILEKLYHPRHIIIAHELIKLVSIELSMGDGASAAAAFARADAIFSLYYGPD 664


>gi|307204816|gb|EFN83374.1| SET and MYND domain-containing protein 4 [Harpegnathos saltator]
          Length = 628

 Score = 70.5 bits (171), Expect = 2e-09,   Method: Compositional matrix adjust.
 Identities = 105/501 (20%), Positives = 192/501 (38%), Gaps = 91/501 (18%)

Query: 8   GEVIISQEPYVCVPNNSSSISR-CDGCFAS--SNLKKCSACQVVWYCGSNCQKLDWK-LH 63
            +++  ++P+  V       S  C+ C  S   NL  CS C    +C  NC    +   H
Sbjct: 136 SQILFVEKPFSFVTLECDGASDLCENCCRSCGKNLTPCSGCIDTVFCDVNCWNEAYSSHH 195

Query: 64  RLECQVLSRLDKEKRKSVTPSIRLMLKLYLRRKLQNDNVIPSTTTDNYSLVEALVAHMSD 123
           R EC V +++   ++  V     L LK  L+  +  DN +       ++ V+ LV    +
Sbjct: 196 RWEC-VGNQMRLWQQIGVA---HLGLKTLLKCTMTTDNSM-------FNRVQQLVTGFDN 244

Query: 124 IDEKQLLLYAQIANLVNLIL----------QWPEISINEIAEN---FS------------ 158
           +    L++Y   A ++ L L             E  +++  +N   F+            
Sbjct: 245 LSANDLIVYGITATMLTLYLTKYTDYFKVCNVREHLVSKFTDNTFNFNNDLATESDERVY 304

Query: 159 ----------KLACNAHTI-------------CNSELRPLGTGLYPVISIINHSCLPNAV 195
                     +L CN H I             C  +   + T +YP  S++NHSC PN +
Sbjct: 305 VSSLLLRHVLQLICNGHAITKLNKIASDKDKLCVEQQDRIATAIYPSASMMNHSCDPNII 364

Query: 196 LVFEGRLAVVRAVQHVPKGAEVLISYIET--AGSTMTRQKALKEQYLFTCTCPRCIKLGQ 253
             F  +  +VRA++ +  G EV   Y       S   RQ+ LK QY F C C  C+ + +
Sbjct: 365 TSFVDQYLIVRAMKDIQAGEEVSNCYGPNFRRMSREHRQEILKNQYGFECRCDPCV-MPE 423

Query: 254 FDDIQESAILEGYRCKDDGCSGFLLRDSDDKGFTCQQCG---LVRSKEEIKKIASEVNIL 310
           +++  E       +C +  C G  L +       C  CG    +  + E+K   +  +  
Sbjct: 424 YENFMER--FHAIKCPE--CGG-ALEEFHSYSMHCLDCGATPYINCQIELKNAVTAFDAA 478

Query: 311 SKKTLALTSCGNHQEVVSTYKMIEKLQKKLYHPFSVNLMQTREKLIKILMELEDWKEALA 370
                        +E V   K    +++ L + +  +L  T + L K+   +  W E+++
Sbjct: 479 Q----IYIELEREEEAVVKLKECLSIRRNLLYKYHQDLTATLDILGKVYAIMGRWLESIS 534

Query: 371 YCQLTIPVYQRVYP--------QFHPLLG-----LQYYTCGKLEWFLGDTENAIKSMTEA 417
           + + ++   +  Y         + + L       LQ       +W+    +   + +  A
Sbjct: 535 HIEHSMAAIEERYGCSSMEVANELNKLTDICIRYLQEEPNRTTKWYKNVLKKTRRYLDRA 594

Query: 418 VEILRITHGTNSPFMKELILK 438
            EI+  T+G      +E+I K
Sbjct: 595 EEIVNFTYGPWHEIHQEIIEK 615


>gi|115482840|ref|NP_001065013.1| Os10g0506800 [Oryza sativa Japonica Group]
 gi|62733614|gb|AAX95731.1| tetratricopeptide repeat, putative [Oryza sativa Japonica Group]
 gi|110289368|gb|ABB47859.2| TPR Domain containing protein, expressed [Oryza sativa Japonica
           Group]
 gi|113639622|dbj|BAF26927.1| Os10g0506800 [Oryza sativa Japonica Group]
 gi|215695497|dbj|BAG90688.1| unnamed protein product [Oryza sativa Japonica Group]
          Length = 793

 Score = 70.5 bits (171), Expect = 2e-09,   Method: Compositional matrix adjust.
 Identities = 79/351 (22%), Positives = 149/351 (42%), Gaps = 58/351 (16%)

Query: 129 LLLYAQI-ANLVNLILQWPEISINEIAENFSKLACNAHTICNSELRPLGTGLYPVISIIN 187
           +LL  QI  N + ++       +  + + FS  + +   +C+ E   +   +Y   S  N
Sbjct: 427 VLLICQIKVNSIAIVHMKSMDGVKALTKGFSGFSGD--VMCSVEQVRVAQAIYMSGSFFN 484

Query: 188 HSCLPNAVLVFEGRLAVVRAVQHVPKGAEVLISYIETAGST--MTRQKALKEQYLFTCT- 244
           HSC PN    F  R  ++R+ +++  G+ + +SY    G      RQK+L+E Y F+C  
Sbjct: 485 HSCRPNIHAYFHSRTLILRSTEYIKAGSPIELSYGPQVGEMDLPERQKSLRENYYFSCGC 544

Query: 245 -----------------CPRCIKLGQF---------DDIQESAILEGYRC---------- 268
                            CP+   LG           ++    +I E + C          
Sbjct: 545 SSCSVLSLSDLVMNSFCCPQSNCLGAVSELIHHRHKENFVHVSIGESHVCTLSLPDVSKF 604

Query: 269 -KDDGCSGFLLRDSD-----DKGF--TCQ-QCGLVRSKEEIKKIASEVNILSKKTLALTS 319
            +D    G L   SD     D GF  +C+ Q  L  +     +  S++N L +    L S
Sbjct: 605 DEDIVKVGKLFFKSDTMFNIDPGFCMSCRSQLDLSSAVAMSDRATSKINRLKE----LPS 660

Query: 320 CGNHQEVV--STYKMIEKLQKKLYHPFSVNLMQTREKLIKILMELEDWKEALAYCQLTIP 377
             N  EV+     + +E+++ KL HP+S  L Q+ + + +   ++ D ++A  +C+ +I 
Sbjct: 661 LDNVPEVLIAEALQSLERIE-KLRHPYSKTLAQSHDTIAEAFAKVGDQEQARKHCEASIK 719

Query: 378 VYQRVYPQFHPLLGLQYYTCGKLEWFLGDTENAIKSMTEAVEILRITHGTN 428
           + +++Y   H ++  +      +E  +GD  +A  +   A  I  + +G +
Sbjct: 720 ILEKLYHPRHIIIAHELIKLVSIELSMGDGASAAAAFARADAIFSLYYGPD 770


>gi|307195197|gb|EFN77181.1| SET and MYND domain-containing protein 4 [Harpegnathos saltator]
          Length = 693

 Score = 70.5 bits (171), Expect = 2e-09,   Method: Compositional matrix adjust.
 Identities = 76/314 (24%), Positives = 119/314 (37%), Gaps = 73/314 (23%)

Query: 8   GEVIISQEPYVCVPNNSSSISRCDGCFASSNLKK-----CSACQVVWYCGSNCQKLDW-K 61
           G V+I   P+    + S+ I  C  C A+  L+      C  CQ V +C   C+K  W +
Sbjct: 260 GAVLIVDRPFSYSTDASALIRNCLHCHATLKLENSVRIPCRNCQTVSFCTETCRKEAWQR 319

Query: 62  LHRLECQVLSRL-------DKEKRKSVTPSIRLMLKLYLRRKLQNDNVIPSTTTD----- 109
            HR EC V           + ++R  +  + R  +   L     ND    S   +     
Sbjct: 320 YHRYECSVFDYFFESAPNGECQRRSHLLLAYRTTVLQALSVDTSNDTSETSCVLNSDFLR 379

Query: 110 ------------------------------NYSLVEALVAHMSDIDEKQLLLYAQIANLV 139
                                         +Y  V  L  H +D+     L+ A  A  +
Sbjct: 380 YHANGNANAEDDDISKECADLGTKKPYSPLDYRTVYQLETHYADMGANVKLIRAIEAVFL 439

Query: 140 N--LILQWPEISINEIAENFSKLA-----------CNAHTICNS---------ELRPLGT 177
              LI    ++ +    E F  LA           CNA+ I  +         E R +G 
Sbjct: 440 AKCLIFVLSKLDVVCTKETFVPLAVAMLHHLQAIDCNAYEIIENVHDEATRVWEPRNIGG 499

Query: 178 GLYPVISIINHSCLPNAVL-VFEGRLAVVRAVQHVPKGAEVLISYIET--AGSTMTRQKA 234
            +Y  +S++NHSC PN V   +   + VVRA++ + KG E+   Y     + S +TR++ 
Sbjct: 500 AIYTTVSLVNHSCYPNVVRHSYPNGMIVVRALRSISKGCEIFDCYGPQFLSESRLTRREF 559

Query: 235 LKEQYLFTCTCPRC 248
           L ++Y F C C  C
Sbjct: 560 LWKKYRFLCECNAC 573


>gi|222613106|gb|EEE51238.1| hypothetical protein OsJ_32096 [Oryza sativa Japonica Group]
          Length = 718

 Score = 70.5 bits (171), Expect = 2e-09,   Method: Compositional matrix adjust.
 Identities = 79/351 (22%), Positives = 149/351 (42%), Gaps = 58/351 (16%)

Query: 129 LLLYAQI-ANLVNLILQWPEISINEIAENFSKLACNAHTICNSELRPLGTGLYPVISIIN 187
           +LL  QI  N + ++       +  + + FS  + +   +C+ E   +   +Y   S  N
Sbjct: 352 VLLICQIKVNSIAIVHMKSMDGVKALTKGFSGFSGD--VMCSVEQVRVAQAIYMSGSFFN 409

Query: 188 HSCLPNAVLVFEGRLAVVRAVQHVPKGAEVLISYIETAGST--MTRQKALKEQYLFTCT- 244
           HSC PN    F  R  ++R+ +++  G+ + +SY    G      RQK+L+E Y F+C  
Sbjct: 410 HSCRPNIHAYFHSRTLILRSTEYIKAGSPIELSYGPQVGEMDLPERQKSLRENYYFSCGC 469

Query: 245 -----------------CPRCIKLGQF---------DDIQESAILEGYRC---------- 268
                            CP+   LG           ++    +I E + C          
Sbjct: 470 SSCSVLSLSDLVMNSFCCPQSNCLGAVSELIHHRHKENFVHVSIGESHVCTLSLPDVSKF 529

Query: 269 -KDDGCSGFLLRDSD-----DKGF--TCQ-QCGLVRSKEEIKKIASEVNILSKKTLALTS 319
            +D    G L   SD     D GF  +C+ Q  L  +     +  S++N L +    L S
Sbjct: 530 DEDIVKVGKLFFKSDTMFNIDPGFCMSCRSQLDLSSAVAMSDRATSKINRLKE----LPS 585

Query: 320 CGNHQEVV--STYKMIEKLQKKLYHPFSVNLMQTREKLIKILMELEDWKEALAYCQLTIP 377
             N  EV+     + +E+++ KL HP+S  L Q+ + + +   ++ D ++A  +C+ +I 
Sbjct: 586 LDNVPEVLIAEALQSLERIE-KLRHPYSKTLAQSHDTIAEAFAKVGDQEQARKHCEASIK 644

Query: 378 VYQRVYPQFHPLLGLQYYTCGKLEWFLGDTENAIKSMTEAVEILRITHGTN 428
           + +++Y   H ++  +      +E  +GD  +A  +   A  I  + +G +
Sbjct: 645 ILEKLYHPRHIIIAHELIKLVSIELSMGDGASAAAAFARADAIFSLYYGPD 695


>gi|195426423|ref|XP_002061334.1| GK20779 [Drosophila willistoni]
 gi|194157419|gb|EDW72320.1| GK20779 [Drosophila willistoni]
          Length = 515

 Score = 70.5 bits (171), Expect = 2e-09,   Method: Compositional matrix adjust.
 Identities = 79/307 (25%), Positives = 127/307 (41%), Gaps = 29/307 (9%)

Query: 9   EVIISQEPYVCVPNNSSSISRCDGCFAS---SNLKKCSACQVVW-YCGSNCQKLDWKLHR 64
           E+I+ + P V  P   S+   C GC  S   ++  +C  C   W  CG  C+ LD   HR
Sbjct: 65  EIILKEAPLVRGPAQISA-PVCMGCLNSIEPTDHIECEQCG--WPLCGPECKSLD--EHR 119

Query: 65  LECQVLSRLDKEKRKSVT-PSIRLMLKLYLRRKLQNDNVIPSTTTDN---YSLVEALVAH 120
            EC    +L K++ + V     +    LY         ++  T   N   +S +E+L + 
Sbjct: 120 AEC----KLTKDRGQKVNVQEFQGPHPLYTCVSTVRCLLLGETNAANAKKFSDLESLEST 175

Query: 121 MSDIDEKQLLLYAQIANLVNLILQWPEISINEIAENFSKLACNAHTICNSELRPLGTGLY 180
                + +  L + I   +    +  + S  EI      L  N H +  S+  P    ++
Sbjct: 176 RRGSSQWKADLTS-IGQFIPKFFKTQKFSEEEIMRTVGALQINGHEVPTSD--PPHVAVF 232

Query: 181 PVISIINHSCLPNAVLVFEGR-LAVVRAVQHVPKGAEVLISYIETAGSTMTRQKALKEQY 239
            V S   +SCLPN    F      ++ A Q + K + + I Y +    T  RQ+ L +  
Sbjct: 233 YVASFTENSCLPNLAKSFNKHGHCILWAPQAIKKNSHLSICYSDAIWGTSDRQRHLMQTK 292

Query: 240 LFTCTCPRCIKLGQFDDIQESAILEGYRCKDDGCSGFLLRDSDDK---GFTCQQCGLVRS 296
           LF C C RC      D  +        +C+D  CSG +L    D    G+ C++CG    
Sbjct: 293 LFKCACLRCQ-----DPTELGTNYSAIKCEDRKCSGLMLPTKADDWHGGWQCKECGKQVP 347

Query: 297 KEEIKKI 303
           K+ ++ I
Sbjct: 348 KQYVEGI 354


>gi|195382848|ref|XP_002050140.1| GJ20356 [Drosophila virilis]
 gi|194144937|gb|EDW61333.1| GJ20356 [Drosophila virilis]
          Length = 662

 Score = 70.5 bits (171), Expect = 2e-09,   Method: Compositional matrix adjust.
 Identities = 79/381 (20%), Positives = 153/381 (40%), Gaps = 50/381 (13%)

Query: 8   GEVIISQEPYVCVPNNSSSISRCDGCFASSNLK-KCSACQVVWYCGSNCQ-KLDWKLHRL 65
           G+ ++ + PYV V     + + C+ CF  + +   C  C  V YC   CQ +   K H+ 
Sbjct: 264 GQELLVEHPYVAVLLEKFAQTHCEYCFVRTVVPVACPGCSDVIYCSEQCQERASAKYHKY 323

Query: 66  ECQVLSRLDKEKRKSVTPSIRLMLKLYLRRKLQNDNVIPSTTT-------------DNYS 112
           EC +L  +    R   + +  + L++   + L     + S+               D++ 
Sbjct: 324 ECGILPVI---WRSGASINNHMALRIIASKPLDYFLQLKSSLDEELSLEQLLSLPKDDFR 380

Query: 113 LVEALVAHMSDIDEKQLLLYAQIANLVNLILQ---------WPEISINEIA----ENFSK 159
            V  L  H           Y  +A  +   LQ          PE  ++ I+     +   
Sbjct: 381 RVAHLERHEGQRQPSNFFQYVLMARFLTKCLQSTGYFGSEPQPE-QVSAISALLLRSLQF 439

Query: 160 LACNAHTIC-----NSELRP----LGTGLYPVISIINHSCLPNAVLVFEGRLAVVRAVQH 210
           +  N H +       +E R     +G  +YP +++ NHSC P  V  F G    + +V+ 
Sbjct: 440 IQFNTHEVAELHKFQAERREKSIFIGGAIYPTLALFNHSCDPGVVRYFRGTTIHINSVRP 499

Query: 211 VPKGAEVLISY--IETAGSTMTRQKALKEQYLFTCTCPRCIK-LGQFDDIQESAILEGYR 267
           +  G  +  +Y  I T      RQ  LK+ Y F C C  C++    F+++    I   +R
Sbjct: 500 IEAGLPINENYGPIYTQDRREDRQARLKDLYWFECNCDACLESWPLFEELPRDII--RFR 557

Query: 268 CK-DDGCSGFL--LRDSDDKGFTCQQCGLVRSKEEIKKIASEVNILSKKTLALTSCGNHQ 324
           C+  + C+  +      +D    C  CG + +  +  K+  +  ++++    L   G++ 
Sbjct: 558 CEAPNNCAAIIEVPPTCNDFMIKCVTCGEITNILKGLKVMQDTEMMTRTAKRLYDTGDYS 617

Query: 325 EVVSTY-KMIEKLQKKLYHPF 344
           + ++ +  ++  + + L  PF
Sbjct: 618 KALNKFVDLLRIMYEVLAPPF 638


>gi|50552992|ref|XP_503906.1| YALI0E13552p [Yarrowia lipolytica]
 gi|49649775|emb|CAG79499.1| YALI0E13552p [Yarrowia lipolytica CLIB122]
          Length = 637

 Score = 70.1 bits (170), Expect = 2e-09,   Method: Compositional matrix adjust.
 Identities = 69/277 (24%), Positives = 112/277 (40%), Gaps = 48/277 (17%)

Query: 3   SFHVSGEVIISQEPYVCVPNNS---------------------SSISRCDGC---FASSN 38
           S++ +G+VI S EP V VP +                       +   C  C   + + N
Sbjct: 22  SYYSTGDVICSFEPAVNVPLSGRLLVGGDADGTIEVEQEARLDPTTGVCAFCLETYPARN 81

Query: 39  LKKCSACQVVWYCGSNCQKLDWKLHRLECQVLSRLDKEKRKSVTPSIRLMLKLYLRRKLQ 98
           +  C  C+++ YC   C + D  LH  EC  L    K        ++  +L +  + + +
Sbjct: 82  ITICQDCKLMSYCSQRCLEAD-TLHSPECTDLCGPQKLIMSGFLRALYRILCMAEQPRPR 140

Query: 99  NDNVIPSTTTDNYSLVEALVAHMSD-----IDEKQLLLYAQIANLVNLILQWPEISINEI 153
           N           Y+    L  H +D     +D  ++   A ++   N     P   + +I
Sbjct: 141 N----------TYA---QLTHHTTDTSWPAMDTLRMAAEALVSR--NASRGHP---LEQI 182

Query: 154 AENFSKLACNAHTICNSELRPLGTGLYPVISIINHSCLPNAVLVFEGRLAVVRAVQHVPK 213
            E    L  N +T C+   R  G    P +++INHSC+PNA L+F GR   +   + +  
Sbjct: 183 LEYAGMLFVNQYTRCDDLGRQGGYIFDPTLALINHSCVPNAYLLFRGRKVHLVCWKPIND 242

Query: 214 GAEVLISYIETAGSTMTRQKALKEQYLFTCTCPRCIK 250
           G EV +SY      T  R+  L   + F C CP C+ 
Sbjct: 243 GDEVFLSYTRFMHPTPERRTLLYMHFRFWCECPGCVS 279


>gi|302682538|ref|XP_003030950.1| expressed protein [Schizophyllum commune H4-8]
 gi|300104642|gb|EFI96047.1| expressed protein, partial [Schizophyllum commune H4-8]
          Length = 423

 Score = 70.1 bits (170), Expect = 2e-09,   Method: Compositional matrix adjust.
 Identities = 67/253 (26%), Positives = 104/253 (41%), Gaps = 24/253 (9%)

Query: 6   VSGEVIISQEPYVCVPNNSSSISRCDGC--FASSN-LKKCSACQVVWYCGSNCQKLDWKL 62
           + G VI  Q        ++    RCD C   AS   L++CS C   +YC  +CQ+  W+ 
Sbjct: 20  IEGSVIFGQYALATALLSTEKGRRCDACTRLASDKPLRRCSGCAEYYYCDEHCQRYHWQR 79

Query: 63  -HRLECQVLSRLDKEKRKSVTPSIRLMLKLYLRRKLQNDNVIPSTTTDNYSLVEALVAHM 121
            H+  C++ S           P    M  + L   L    + P       S V+ L++  
Sbjct: 80  NHKRICKLYSSFSASL--GSLPEHGRMDAVLLSHLLAQ--MTPDDLATESSPVQTLLS-- 133

Query: 122 SDIDEKQLLLYAQIANLVNLILQWPEISINEIAENFSKLACNAHTICNSELRPLGTGLYP 181
                  LL  +Q  N    +   P    + + +       N +   +S L   G G++P
Sbjct: 134 -------LLPSSQEENPAPPVCYVPRFVPSILVQTLFDRFQNNNFAMHSHLSTFGHGIFP 186

Query: 182 VIS-IINHSCLPNAV--LVFEGRLAV---VRAVQHVPKGAEVLISYIETAGSTMTRQKAL 235
           + S + NHSC+PNA    +   R  V   V A++ +  G E+ I Y++ A    TRQ+  
Sbjct: 187 LASRLFNHSCVPNAAARYILRPREPVKMEVVALRPITAGEEICIPYVDPA-LLETRQQIF 245

Query: 236 KEQYLFTCTCPRC 248
           K  Y F C C  C
Sbjct: 246 KLSYGFECRCTSC 258


>gi|383864965|ref|XP_003707948.1| PREDICTED: SET and MYND domain-containing protein 4-like [Megachile
           rotundata]
          Length = 674

 Score = 70.1 bits (170), Expect = 2e-09,   Method: Compositional matrix adjust.
 Identities = 84/383 (21%), Positives = 151/383 (39%), Gaps = 74/383 (19%)

Query: 7   SGEVIISQEPYVCVPNNSSSISRCDGCFASSNLKK-----CSACQVVWYCGSNCQKLDWK 61
           +G ++I + P+    NN +    C  C  +  L       C  C+ V +C   C+   WK
Sbjct: 245 AGYILIVESPFAFSTNNEAFDRNCLNCHVTLKLNDSVKIPCYFCRTVSFCSEKCRNEAWK 304

Query: 62  L-HRLECQVLSRL---DKEKRKSVTPSIRLMLKLYLRRKLQ-----------------ND 100
           L HR EC +       D E+ +  +  + L  ++ +   L                  ND
Sbjct: 305 LYHRYECSIFDIFCGNDSEQPQRQSSYLLLAYRMTIAGCLLSNTDKVNNMDKTEIPVLND 364

Query: 101 NVIPSTTTD-----------------NYSLVEALVAHMSDIDEKQLLLYAQIANLVNLIL 143
           N +    T+                 +Y  +  L  H   I+    L+ A  A  +    
Sbjct: 365 NFLRYHGTNTNQECSDLGINEVYSPRDYRTILKLKTHCEKIEPNINLIRAIEAIFLTKCF 424

Query: 144 QW--PEISINEIAENFSKLA-----------CNAHTICNS---------ELRPLGTGLYP 181
            +   ++ +  + E F  LA           CNA+ I  +         E R +G  +YP
Sbjct: 425 TFVLSKMDVVCLKETFISLAVAMLHHLQAINCNAYEIVENLYDKKAHVWEPRYVGGAIYP 484

Query: 182 VISIINHSCLPNAVL-VFEGRLAVVRAVQHVPKGAEVLISYIE--TAGSTMTRQKALKEQ 238
            +S++NHSC PN V   +   + VVR+++ + KG E++  Y     +   + R++ L ++
Sbjct: 485 SVSLVNHSCYPNVVRHSYPSGVVVVRSLRFIGKGTEIVDCYGPHWLSEGRLPRREYLWKK 544

Query: 239 YLFTCTCPRCIKLGQFDDIQESAILEGYRCKD--DGCSGFLLRDSDDKGFTCQQCGLVRS 296
           Y F C C  C +  Q+  + E+     Y+C+   +    F   + D +  + ++C     
Sbjct: 545 YRFLCACEACTQNWQY-PLPETI---NYKCRACLEVTDIFGSNEKDTQNISIKKCHKCNG 600

Query: 297 KEEIKKIASEVNILSKKTLALTS 319
           K + KKI ++     +K L   S
Sbjct: 601 KIDCKKIKNQFRKSVEKRLNAIS 623


>gi|341874291|gb|EGT30226.1| CBN-SET-30 protein [Caenorhabditis brenneri]
          Length = 555

 Score = 70.1 bits (170), Expect = 2e-09,   Method: Compositional matrix adjust.
 Identities = 102/454 (22%), Positives = 183/454 (40%), Gaps = 70/454 (15%)

Query: 26  SISRCDGCFASSNLK---KCSACQVVWYCGSNCQKLDWKLHRLECQVLSRLDKEKRKSVT 82
            I++ D C+A    K    C  C+V  +C   CQ      H+ EC  L     +K   + 
Sbjct: 34  DITKNDYCWACLGEKVELTCGECEVAKFCSKECQASGAFDHKYECGAL-----KKCSDLN 88

Query: 83  PSIRLMLKLYLRRK-LQ--NDNVIPSTTTDNYSL--VEALVAHMSDI--DEKQLLLYAQI 135
              R+++++  R K +Q  ND  I     +  S+  V  +  H +D+  DE  + ++ +I
Sbjct: 89  TDERMLIRIIGRYKEIQDGNDKKIEGFYKNRESIRSVMQIWEHCADMKKDESAMNVFKKI 148

Query: 136 ANLVNLILQWPEISINEIAENF-SKLACNAHTICNSE-LRPLGTGLYPVISIINHSCLPN 193
            + V        +   E A    S+   N H+I N + LR +G GLY  +   +HSC PN
Sbjct: 149 YDRVKEFGDKNYLVDEETAFQLHSRNFINRHSISNVDYLREIGKGLYLDLCRYDHSCRPN 208

Query: 194 AVLVFEGRLAVVRAVQH---VPKGAEVLISYIETAGSTMTRQKALKEQYLFTCTCPRCIK 250
           A+    G +A +RA+     +        +YIE     + R+  LKE + F C C RC  
Sbjct: 209 AIYSCNGTIAKLRALHDNVDLENVETTHYTYIELPPCKIQRRHMLKETWYFECHCDRC-- 266

Query: 251 LGQFDDIQE---SAIL-------EGYR--CKDDGCSGFLLRDSDDKGFTCQQCGLVRSKE 298
               DD  +   +A++         +R   K  G   +   + +     C++C     +E
Sbjct: 267 ----DDPTDNWLTAVICPICIAKTEFRKTIKLHGPEAYA--NPETLEIVCERCNTTLDRE 320

Query: 299 ----------EIKKIASEVNILSKKTLALTSCGNHQEVVSTYKMIEKLQKKLYHPFSVNL 348
                      I++   +   +SK  + L     +Q  + +Y+ I  +    +      +
Sbjct: 321 YIYMALDGMRTIRRTIEDAEDVSKDPVELLKI--YQGALMSYERILPMSNAYFCQLIAAM 378

Query: 349 --------MQTREKLIKILMELEDWKEALAYCQLTIPVYQRVYPQFHPLLGLQYYTCGKL 400
                   M  +E+L+  L       +  A C+   P  + VY   HP   + +   G+L
Sbjct: 379 IPLISRVSMTKKERLVASL-------DLHAKCE---PFVRYVYRYAHPSKAMHFLQMGQL 428

Query: 401 EWFLGDTENAIKSMTEAVEILRITHGTNSPFMKE 434
              L + ++A + + EA  I+    G++    K+
Sbjct: 429 HLELNNDKDAARYLIEAHRIMDYLFGSDHFLSKQ 462


>gi|289740749|gb|ADD19122.1| SET and MYND domain containing protein 5 [Glossina morsitans
           morsitans]
          Length = 395

 Score = 70.1 bits (170), Expect = 2e-09,   Method: Compositional matrix adjust.
 Identities = 41/111 (36%), Positives = 58/111 (52%), Gaps = 5/111 (4%)

Query: 143 LQWPEISINEIAENFSKLACNAHTICNSELRPLGTGLYPVISIINHSCLPNAVLVF--EG 200
           L  PE   N++ E    +           L   G+GLY + S INHSCLPNA + F    
Sbjct: 253 LPLPETDKNKLDEYIDDIYHKVGEFAGEFLNNEGSGLYLLQSKINHSCLPNAQVTFPYSN 312

Query: 201 RLAVVRAVQHVPKGAEVLISYIETAG---STMTRQKALKEQYLFTCTCPRC 248
            + V++A+Q +  G E+ ISY++      S  +RQK LKE Y+F C C +C
Sbjct: 313 DIVVLKALQPIQVGEEICISYLDEGQLERSRHSRQKILKENYIFVCECFKC 363


>gi|195038187|ref|XP_001990541.1| GH18189 [Drosophila grimshawi]
 gi|193894737|gb|EDV93603.1| GH18189 [Drosophila grimshawi]
          Length = 393

 Score = 69.7 bits (169), Expect = 3e-09,   Method: Compositional matrix adjust.
 Identities = 39/101 (38%), Positives = 58/101 (57%), Gaps = 9/101 (8%)

Query: 153 IAENFSKLACNAHTICNSELRPLGTGLYPVISIINHSCLPNA--VLVFEGRLAVVRAVQH 210
           I E ++K+   A    N+E    G+GLY + S INHSC+PNA     +   + V++AV  
Sbjct: 267 IDELYAKVGDFAGEFLNNE----GSGLYILQSKINHSCVPNAQSTFPYSNDIVVLKAVAP 322

Query: 211 VPKGAEVLISYIET---AGSTMTRQKALKEQYLFTCTCPRC 248
           + KG E+ ISY++      S  +R K L+E Y+F C CP+C
Sbjct: 323 IEKGDEICISYLDECQLERSRHSRHKTLRENYIFVCQCPKC 363


>gi|443683649|gb|ELT87827.1| hypothetical protein CAPTEDRAFT_118237 [Capitella teleta]
          Length = 596

 Score = 69.7 bits (169), Expect = 3e-09,   Method: Compositional matrix adjust.
 Identities = 68/277 (24%), Positives = 111/277 (40%), Gaps = 36/277 (12%)

Query: 8   GEVIISQEPYVCVPNNSSSISRCDGCFASS-NLKKCSACQVVWYCGSNCQKLDWK-LHRL 65
           GE+++ ++P+  V   ++  + C  C   +     C  C    +C   C+    +  H+ 
Sbjct: 201 GEIVLIEKPFASVLLRANYSNHCHHCLKHTLEGIPCRTCPDARFCSEACRDTAMQTYHQY 260

Query: 66  ECQVLSRLDKEK-RKSVTPSIRLMLKLY---LRRKLQNDNVIPSTTTD------NYSLVE 115
           EC VL+ L   +  K    + R + K     L+     D  +    +D      +Y+ + 
Sbjct: 261 ECSVLNTLHHSQINKFGCLAFRAITKQSYQSLKDIRAQDLPLNGCHSDGLYRPQDYNTII 320

Query: 116 ALVAHMSDIDEKQLLLYAQIA-NLVNLILQWPEISINEIAE-----------NFSKLACN 163
            LV H  D   + L     +A  L+ L+ Q    +  E  E           +     CN
Sbjct: 321 QLVTHAKDRPVQDLFHRTVMAVYLLKLLQQTSYFNGEEDVEMQAYIAGLFLSHLQSFPCN 380

Query: 164 AHTICNSELRP----------LGTGLYPVISIINHSCLPNAVLVFEGRLAVVRAVQHVPK 213
           AH +    L P          LG G+Y  +S+ NHSC P     F G   VVRA++ + K
Sbjct: 381 AHEVPELYLDPNAIDLSMPNELGAGIYSTLSLFNHSCDPGVNRNFYGDTCVVRAIKTIRK 440

Query: 214 GAEVLISY--IETAGSTMTRQKALKEQYLFTCTCPRC 248
           G +V  +Y  +    +   R   L+ QY F+C C  C
Sbjct: 441 GHQVSDNYGALYATNTLKERHDKLQPQYFFSCRCEPC 477


>gi|255085128|ref|XP_002504995.1| predicted protein [Micromonas sp. RCC299]
 gi|226520264|gb|ACO66253.1| predicted protein [Micromonas sp. RCC299]
          Length = 267

 Score = 69.7 bits (169), Expect = 3e-09,   Method: Compositional matrix adjust.
 Identities = 58/237 (24%), Positives = 100/237 (42%), Gaps = 28/237 (11%)

Query: 41  KCSACQVVWYCGSNCQKLDWKLHRL---ECQVL-SRLDKEKRKSVTPSI--RLMLKLYLR 94
           +C  C++  YC + C+    +L+     EC    + ++ E        +  R +L++  +
Sbjct: 26  RCDGCKLECYCSTACKDEARRLYHSSDGECAAFEAGIEAEAIDFEFGDVPERFLLRVLSQ 85

Query: 95  RKLQNDNVIPSTTTDNYSLVEALVAHMSDIDEKQLLLYAQIA-NLVNLILQWPEISIN-- 151
                    P         V  L AH+   D ++    A I+ N + L+ +  E  +   
Sbjct: 86  AGGWRTPGGPMVVGAGVDRVRMLQAHVPAPDTEERGRLAGISRNTLRLMDESVEDGVRVV 145

Query: 152 ----------------EIAENFSKLACNAHTIC---NSELRPLGTGLYPVISIINHSCLP 192
                           E+ +    + CNAHT+     S L P+G  +Y   S  NHSC+P
Sbjct: 146 DPSGDEPATTSRYGEAELTQLLCSVNCNAHTLYANDRSSLVPVGIAVYLQGSAFNHSCVP 205

Query: 193 NAVLVFEGRLAVVRAVQHVPKGAEVLISYIETAGSTMTRQKALKEQYLFTCTCPRCI 249
           +A     G    VR+++ V  G E+ +SY+ T  +   R++ L+ QY F C C RC+
Sbjct: 206 SAEFCNVGTSLTVRSLRRVQAGEEITVSYVPTTMNLKERRRCLEGQYKFVCACARCV 262


>gi|328784417|ref|XP_624539.3| PREDICTED: protein msta, isoform A-like [Apis mellifera]
          Length = 440

 Score = 69.7 bits (169), Expect = 3e-09,   Method: Compositional matrix adjust.
 Identities = 70/275 (25%), Positives = 116/275 (42%), Gaps = 41/275 (14%)

Query: 55  CQKLDWKL------HRLECQVLSRLDKEKRKSVTPSIRLMLKLYLRRKLQNDNVIPS--- 105
           C+K  W L      H LEC    +    +R S    I +    Y     Q  NVI +   
Sbjct: 103 CEKCGWPLCQNCNEHGLEC----KFSSSRRDS---KISITEFGYPHPSYQCINVIRALSL 155

Query: 106 --TTTDNYSLVEALVAHMSDIDEKQLLLYAQIANLVNLILQWPEISINEIAENFSKLACN 163
             T  ++Y  + +L +H ++++  +  L   IA+ +    +  +IS  EIA     L  N
Sbjct: 156 KDTNPESYKKLISLESHCNEMNNSKEPL--NIAHFIKRFFKADDISEEEIATIIGILQVN 213

Query: 164 AHTICNSELRPLG----TGLYPVISIINHSCLPNAVLVFE--GRLAVVRAVQHVPKGAEV 217
            H +      PL       +Y + S+I H+C  N    F   G L ++RA   + KG  +
Sbjct: 214 GHEV------PLTDSPYVAVYEMASLIEHNCRANCSKSFTDMGGL-IIRAALPITKGDHI 266

Query: 218 LISYIETAGSTMTRQKALKEQYLFTCTCPRCIKLGQFDDIQESAILEGYRCKDDGCSGFL 277
            I Y +    T  R+  L +   F C C RC    +F       +    +C +  CSG++
Sbjct: 267 SICYTDPLWGTANRRHHLFKTKFFECICNRCQDPTEF-----QTMFNALKCNNINCSGYI 321

Query: 278 LRDS---DDKGFTCQQCGLVRSKEEIKKIASEVNI 309
           L  +    ++ + C+ C  V S  EI+K+  ++ I
Sbjct: 322 LPKTFLEQEQDYICKICESVVSCTEIEKVLEDIGI 356


>gi|380022725|ref|XP_003695189.1| PREDICTED: protein msta, isoform B-like [Apis florea]
          Length = 513

 Score = 69.7 bits (169), Expect = 3e-09,   Method: Compositional matrix adjust.
 Identities = 97/462 (20%), Positives = 188/462 (40%), Gaps = 69/462 (14%)

Query: 7   SGEVIISQEPYVCVPNNSSSISRCDGCFAS-----SNLKKCSACQVVW-YCGSNCQKLDW 60
           +GE II++ P++  P  +++   C  C+ S     +    CS C   W  C + C+  + 
Sbjct: 39  AGEEIITEMPFIVGPK-AATYPLCLSCYVSWPPSLNEKPLCSKCG--WPVCSTECE--NQ 93

Query: 61  KLHR-LECQVLSRLDK---------EKRKSVTPSIRLMLKLYLRRKLQNDNVIPSTTTDN 110
             H+  ECQ+ ++ ++         E  ++  P +  +  L L  + + +   P    + 
Sbjct: 94  PQHKDYECQIFAQANEKFNVQAALEETNENGVPQLECITPLRLLLESEKN---PERWKNE 150

Query: 111 YSLVEALVAHMSDIDEKQLLLYAQIANLVNLILQWPEISINEIAENFSKLACNAHTICNS 170
              V+ + AH     +K+     Q+ N+V  I +  ++ ++  +E   +  C    I   
Sbjct: 151 ---VKDMEAHNKIRSQKKQWKLDQV-NIVEYIRK--QLKLDRFSEKEIQTVCGILEINTF 204

Query: 171 ELRPL----GTGLYPVISIINHSCLPN-----AVLVFEGRLAVVRAVQHVPKGAEVLISY 221
           E+R         +YP ++++NHSC+ N     +   ++ RL   R    +P G E+  SY
Sbjct: 205 EVRTAKGFSARAIYPTVALMNHSCISNTCHSISPTDYKIRL---RTTLKIPPGGELYGSY 261

Query: 222 IETAGSTMTRQKALKEQYLFTCTCPRCIKLGQFDDIQESAILEGYRCK--DDGCSGFLLR 279
             +   TM R++ L E   F C C RC      D  +    +   +C   D+G    L  
Sbjct: 262 THSLLPTMLRREHLLEGKHFACACARC-----SDPTELGTHMSSLKCNKCDNGIVLSLDS 316

Query: 280 DSDDKGFTCQQCGLVRSKEEIKKIASEVNILSKKTLALTSCGNHQEVVSTYKMIEKLQKK 339
              +  + C  C    S   ++K+   +++      A+ S  +  + +   + + K  + 
Sbjct: 317 LDPESSWKCTHCEFSTSGSAVRKVLQIIHMEVDAVEAI-SGADGADAIQERETVAKKYRS 375

Query: 340 LYHPFSVNLMQTREKLIKILMELEDW----------KEALAYCQLTIPVYQRVYPQFHPL 389
           + HP    L   R  L ++   +ED+          +  +  C+L + V   + P +  +
Sbjct: 376 VLHPRHAFLTMLRHSLTQMYGRVEDYLLEDLPDVVLEHKIDMCKLLLQVLDVIEPGYSRI 435

Query: 390 LGLQYYTCGKLEWFLGDTE--------NAIKS-MTEAVEILR 422
            G+  Y       F   T+         A+KS M EA +IL+
Sbjct: 436 RGMTLYELHAPLLFTAKTQWNAEVIDKAALKSKMIEASKILK 477


>gi|270016446|gb|EFA12892.1| hypothetical protein TcasGA2_TC004406 [Tribolium castaneum]
          Length = 1108

 Score = 69.3 bits (168), Expect = 3e-09,   Method: Compositional matrix adjust.
 Identities = 61/266 (22%), Positives = 111/266 (41%), Gaps = 24/266 (9%)

Query: 8   GEVIISQEPYVCVPNNSSSISRCDGCFA-SSNLKKCSACQVVWYCGSNCQKLDWK-LHRL 65
           GE++  ++PY  +  +S S+  C  C     N+  C  C    YC  NC+   ++  H+ 
Sbjct: 186 GEILAIEKPYASIVTDSVSV-YCHECLKLCYNMIPCDKCTKALYCSDNCKDKAYESYHKY 244

Query: 66  ECQVLSRLDK---EKRKSVTPSIRLMLKLYLRRKLQNDNVIPSTTTDNYSLVEALVAHMS 122
           EC +   LD    +  K +   I L+ +      L N++      +D +  V  L  ++ 
Sbjct: 245 ECPIHLSLDPLLIDSSKRLALRIALISRNEWAGSLLNESPDEMYCSDRFKEVFNLDQNVR 304

Query: 123 DIDEKQLLLYAQIANLVNLILQWPEISINEIAEN-FSKLACNAHTICNS----------- 170
                 L     IA  V  +++     + E  E+ F ++  +   IC++           
Sbjct: 305 QRFTHDLFGRTTIACGVFYLIKKYTTFLQEYDEDRFKEILLSLLLICSTNTVRVNEVSST 364

Query: 171 ----ELRPLGTGLYPVISIINHSCLPNAVLVFEGRLAVVRAVQHVPKGAEVLISYIET-- 224
               ++       YP  S+ NHSC PN    + G   V+RA++ + KG +  ++Y  +  
Sbjct: 365 LGEYDVCGFACSHYPFFSMFNHSCWPNVCRSYHGSQMVLRAIRTIKKGEQCFVTYGPSYL 424

Query: 225 AGSTMTRQKALKEQYLFTCTCPRCIK 250
           + + + RQ  L   Y F C C  C++
Sbjct: 425 SDNIVGRQAFLFFHYFFNCGCKACVE 450



 Score = 43.5 bits (101), Expect = 0.22,   Method: Compositional matrix adjust.
 Identities = 19/61 (31%), Positives = 33/61 (54%), Gaps = 1/61 (1%)

Query: 180 YPVISIINHSCLPNAVLVFEGRLAVVRAVQHVPKGAEVLISY-IETAGSTMTRQKALKEQ 238
           YP  S+ NHSC PN    + G   V+RA++ + KG +  ++Y +ET  +       +K+ 
Sbjct: 573 YPFFSMFNHSCWPNVCRSYHGSQMVLRAIRTIKKGEQCFVTYGMETPSNASAPSNNVKQP 632

Query: 239 Y 239
           +
Sbjct: 633 W 633


>gi|390603364|gb|EIN12756.1| SET domain-containing protein [Punctularia strigosozonata HHB-11173
           SS5]
          Length = 450

 Score = 69.3 bits (168), Expect = 3e-09,   Method: Compositional matrix adjust.
 Identities = 62/261 (23%), Positives = 101/261 (38%), Gaps = 45/261 (17%)

Query: 8   GEVIISQEPYVCVPNNSSSISRCDGCFA-SSNLKKCSACQVVWYCGSNCQKLDWKLHRLE 66
           G+VI+  E        S +  RC+ C   +S+LK+C+ C    YC + CQ   W +H   
Sbjct: 31  GDVILVDEALTTAILPSDAGRRCNHCLRLTSDLKRCAGCAAYHYCSTQCQSQQWSIHH-- 88

Query: 67  CQVLSRLDKEKRKSVTPSIRLMLKLYLRRKLQNDNVIPSTTTDNYSLVEALVAHMSDIDE 126
                       K +   I   +     + L           ++  +   L++H+  I +
Sbjct: 89  ------------KRICKHISSFIASPAYQGLH----------EHEKITAVLLSHLCTISQ 126

Query: 127 KQLLLYAQIANLVNLILQWPEISINEI------------AENFSKLACNAHTICNSELRP 174
                     +    +L  PE +   I            A  +     N + + +S L  
Sbjct: 127 HGKDSLRPALSTFEDLLPGPESATPPICPPPFSTEEQQKATYYHSKFGNNNFVIHSHLTS 186

Query: 175 LGTGLYPVISI-INHSCLPNAVLVF------EGRLAVVRAVQHVPKGAEVLISYIETAGS 227
              G++P+ SI  NHSC PNA   +        R+ V+ A+ H+  G EV I Y++ A  
Sbjct: 187 FAHGIFPMSSICFNHSCAPNAAARYILTPHQVPRMEVI-ALTHIAAGTEVTIPYLDPALP 245

Query: 228 TMTRQKALKEQYLFTCTCPRC 248
              RQ+  +  Y F C CP C
Sbjct: 246 LANRQQITQITYGFICGCPLC 266


>gi|402587997|gb|EJW81931.1| MYND finger family protein, partial [Wuchereria bancrofti]
          Length = 348

 Score = 69.3 bits (168), Expect = 3e-09,   Method: Compositional matrix adjust.
 Identities = 76/332 (22%), Positives = 139/332 (41%), Gaps = 37/332 (11%)

Query: 136 ANLVNLILQWPEISINEIAENFSKLACNAHTICN-SELRPLGTGLYPVISIINHSCLPNA 194
           A+L+        +S +E+ E   +   N H I +   +  +G GLY  +   +HSC PN 
Sbjct: 25  ADLLAFYGSKAMVSKDEVFELHCRNYINRHAISDCGYIEEIGKGLYLDLCAYDHSCRPNT 84

Query: 195 VLVFEGRLAVVRAVQHVPKGAEVL------ISYIETAGSTMTRQKALKEQYLFTCTCPRC 248
           +      +A +R +     G ++        SYI+   +T  R+K LK+ + F C C RC
Sbjct: 85  IYTCNSFVATLRGLT---AGVDLRNLNSTHYSYIDLINTTQQRRKLLKDTWYFECHCTRC 141

Query: 249 IKLGQFDDIQESAILEGYRCKDDGC----SGFLLRDSDDKGFTCQQCGLVRSKEEIKKIA 304
                 DD+  S+IL     K   C         +D + +  TC +C  + S E + +  
Sbjct: 142 ---DDPDDVLLSSILCPNCPKKRECLCIFGNVPYKDRNTQIITCPKCHNIVSPEYVVEAI 198

Query: 305 SEVNILSK--KTLALTSCGNHQEVVSTYKMIEKLQKKLYHPFS-VNLMQTREKLIKILME 361
           + +  + K  +   +      Q +    + +  L+K+     S VN+     K+I++L+ 
Sbjct: 199 AAMRFIDKIVENREVEKMSREQSI----RFLTDLKKRFSELLSKVNVFLC--KVIQLLIP 252

Query: 362 LEDWKEALAYCQLTIPVY-------QRVYPQFHPLLGLQYYTCGKLEWFLGDTENAIKSM 414
           L D  +    CQL + ++       +  +P  HP +G+ Y +             A+   
Sbjct: 253 LIDISD----CQLLLDLHLETEECVRFCFPYNHPAVGIHYRSIATFFLKCKQPHRALLYY 308

Query: 415 TEAVEILRITHGTNSPFMKELILKLEEAQAEA 446
            +A E+++ T G+  P   E    L++A  E 
Sbjct: 309 KKAYEVVKFTLGSKHPMTIETDAMLKDASREV 340


>gi|426237300|ref|XP_004012599.1| PREDICTED: SET and MYND domain-containing protein 4 [Ovis aries]
          Length = 802

 Score = 69.3 bits (168), Expect = 4e-09,   Method: Compositional matrix adjust.
 Identities = 36/96 (37%), Positives = 51/96 (53%), Gaps = 4/96 (4%)

Query: 175 LGTGLYPVISIINHSCLPNAVLVFEGRLAVVRAVQHVPKGAEVLISYI--ETAGSTMTRQ 232
           L T L+PV+S++NHSC PN  + F G +A +RA Q +  G E+L  Y   E+      R+
Sbjct: 526 LATALFPVVSLLNHSCSPNTSVSFIGTIATIRASQLIRSGQEILHCYGPHESRMGVAERR 585

Query: 233 KALKEQYLFTCTCPRCIKLGQFDDIQESAILEGYRC 268
           + L+ QY F C CP C +  Q          E +RC
Sbjct: 586 QKLRSQYFFDCDCPPCEREKQRPSAGPGR--EAFRC 619


>gi|328866939|gb|EGG15322.1| SET domain-containing protein [Dictyostelium fasciculatum]
          Length = 524

 Score = 69.3 bits (168), Expect = 4e-09,   Method: Compositional matrix adjust.
 Identities = 65/271 (23%), Positives = 109/271 (40%), Gaps = 77/271 (28%)

Query: 42  CSACQVVWYCGSNCQKLDW-KLHRLECQ----------------------VLSRL----- 73
           C  C+V +YC  +CQ+L W + H+++C                       ++SR+     
Sbjct: 261 CGHCKVEYYCSESCQQLAWSQYHQIQCPHMSIIKQQLYPHCIENEQTNPLIISRMFAMVL 320

Query: 74  ------DKEKRKSVTP----------------SIRLMLKLYLRRKLQNDNVIPSTTTDNY 111
                  KE  +S+ P                 +  +LK   + + Q D V+   T + Y
Sbjct: 321 QSIIYQQKEFTESIIPFTFFISNGQWQSKGDEKVLSILKSIFKNESQLDQVL---TIERY 377

Query: 112 SLVEALVA-HMSDID---------EKQLLLYAQIANLVNLILQWPEISINEIAENF--SK 159
               +++  + S I+         EK  +    + + + L+   P++   E   N   S 
Sbjct: 378 RQFNSIIQCNASKINPLSRIHMLIEKNTIESPALQDYIQLVEDGPKVPFGEPLLNLLESL 437

Query: 160 LACNAHTICNSELRPLGTGLYPVISIINHSCLPNAVLVFEGR--LAVVRAVQHVPKGAEV 217
             C  H          G+GLY   +  NHSC PNA +  E     A +R++  +P G ++
Sbjct: 438 SDCCVH----------GSGLYVYANSCNHSCSPNAAITNESTNFSATIRSITDIPNGNQI 487

Query: 218 LISYIETAGSTMTRQKALKEQYLFTCTCPRC 248
            ISYIE    + TRQ  L ++Y F C C +C
Sbjct: 488 EISYIEEDQPSQTRQSELIDKYKFKCHCQKC 518


>gi|407405659|gb|EKF30538.1| hypothetical protein MOQ_005654 [Trypanosoma cruzi marinkellei]
          Length = 585

 Score = 69.3 bits (168), Expect = 4e-09,   Method: Compositional matrix adjust.
 Identities = 87/345 (25%), Positives = 142/345 (41%), Gaps = 46/345 (13%)

Query: 36  SSNLKKCSACQVVWYCGSNCQKLDWKLH-RLECQVLSRLDKEKRK-----SVTPSIRLML 89
           ++ L+KC  CQ V YC   C     + H   EC++  RL +  R      +  P   + +
Sbjct: 206 AAALRKCRRCQFVSYCSDTCASCHDRQHEEYECRLFFRLREMLRSMQSCDAAVPDDFVTM 265

Query: 90  KLYLRRKLQNDNVIPSTTTDNYSLVEALVAHMSDIDEKQLLLYAQIANLVNLILQWPEIS 149
             +    +    V      + +  V  L +H  ++ +    L   + +L N      E  
Sbjct: 266 ATHCITTVSGVKV----RKEGHEAVLRLESHEVEVSQGLTPLLRLVQDLFN------EED 315

Query: 150 INEIAENFSKLACNAHTICNSELRPLGTGLY--PVISIINHSCLPNAVLVFEGRLAVVRA 207
              I      + CNA  +C++   P+G  L+   V S  NHSCLPN  +  +  +   RA
Sbjct: 316 PTFITRILGVVRCNALAVCDASGLPVGQALHVASVTSYFNHSCLPNCAIEADA-IVTTRA 374

Query: 208 VQHVPKGAEVLISYI-ETAGSTMTRQKALKEQYLFTCTCPRCIKLGQFDDIQ-ESAILEG 265
           ++    G E+ ISY+ +       R++ L E+Y F C C RC   G  D    ESA+   
Sbjct: 375 IR---PGEELTISYLPQLYWPAWLRREELAERYFFDCRCMRC---GDGDRHPFESALSAT 428

Query: 266 YRCKDDGCSGFLLRDSDDKGFTCQQ--CGLVRSKEEIKKIASEVNILSKKTL--ALTSCG 321
            R           R+ + +  +  Q  CG VR+K+       +V++  +  L   L  C 
Sbjct: 429 LRLGGS-------REKEREYISSVQILCGRVRAKD-----VGDVDVGDRDALLHLLQECR 476

Query: 322 NHQEVVSTYKMIEKLQKKLYHPFSVNLMQTREKLIKILMELEDWK 366
            H  +   + +  +LQ  L   ++V L  TR  L   L EL  W+
Sbjct: 477 QH--LFPFHYLCHELQNTLSFVYAV-LGDTRACLCSCLRELVLWE 518


>gi|440912312|gb|ELR61896.1| SET and MYND domain-containing protein 4 [Bos grunniens mutus]
          Length = 802

 Score = 68.9 bits (167), Expect = 4e-09,   Method: Compositional matrix adjust.
 Identities = 32/76 (42%), Positives = 45/76 (59%), Gaps = 2/76 (2%)

Query: 175 LGTGLYPVISIINHSCLPNAVLVFEGRLAVVRAVQHVPKGAEVLISY--IETAGSTMTRQ 232
           L T L+PV+S++NHSC PN  + F G +A +RA Q +  G E+L  Y   E+      R+
Sbjct: 526 LATALFPVVSLLNHSCSPNTSVSFIGTIATIRASQLIRSGQEILHCYGPHESRMGVAERR 585

Query: 233 KALKEQYLFTCTCPRC 248
           + L+ QY F C CP C
Sbjct: 586 QKLRSQYFFDCDCPPC 601


>gi|91089699|ref|XP_974834.1| PREDICTED: similar to CG8378 CG8378-PA [Tribolium castaneum]
 gi|270011321|gb|EFA07769.1| hypothetical protein TcasGA2_TC005323 [Tribolium castaneum]
          Length = 543

 Score = 68.9 bits (167), Expect = 4e-09,   Method: Compositional matrix adjust.
 Identities = 69/275 (25%), Positives = 120/275 (43%), Gaps = 47/275 (17%)

Query: 8   GEVIISQEPYVCVPNNSSSISRCDGCFA-SSNLKKCSACQVVWYCGSNCQKLDWKL-HRL 65
           GEV+  ++P V +  N  S + C  C +   NL  C  C    YC  +C+   + + H+ 
Sbjct: 208 GEVLAVEKPLVTLVVNELS-NHCHECVSLCYNLIPCKTCTQAMYCSESCRDYAFDMYHKY 266

Query: 66  ECQVLSRLDKEKRKSVTPSIRLM----LKLY-LRRKLQNDNVIPSTT----TDNYSLVEA 116
           EC +L+            ++R +    LKL+ L+  LQ +++  ++     +D Y  +  
Sbjct: 267 ECSILA------------TLRFLQFDKLKLFALKISLQYNDLGETSDLLYRSDRYREIHN 314

Query: 117 LVAHMS-----DIDEKQL---LLYAQIANLVNLILQWPEISINEI--------AENFSK- 159
           LV + +     D+ E+     L+Y  +    N    + + +  EI          NF + 
Sbjct: 315 LVTNTTKRSVPDLFERATAAALIYDLVKTHTNFFSAFNQNNFKEILLLHMQTGPSNFHEI 374

Query: 160 --LACNAHTICNSELRPLGTGLYPVISIINHSCLPNAVLVFEGRLAVVRAVQHVPKGAEV 217
             L  N+  I   E   + +G +  +S++NHSC PN      G   V+RA+Q++ +G + 
Sbjct: 375 VELVPNSRGIYEPE--EIASGAFAFLSLLNHSCCPNVARFSYGSTLVLRAIQNIQEGEQC 432

Query: 218 LISYIETAG--STMTRQKALKEQYLFTCTCPRCIK 250
             +Y           R+K L+ QY F C C  C K
Sbjct: 433 FDNYGYHFALMDKSERKKHLQSQYYFNCVCQACEK 467


>gi|40253403|dbj|BAD05333.1| SET-domain transcriptional regulator-like protein [Oryza sativa
           Japonica Group]
 gi|42761302|dbj|BAD11545.1| SET-domain transcriptional regulator-like protein [Oryza sativa
           Japonica Group]
 gi|125602534|gb|EAZ41859.1| hypothetical protein OsJ_26404 [Oryza sativa Japonica Group]
          Length = 392

 Score = 68.9 bits (167), Expect = 4e-09,   Method: Compositional matrix adjust.
 Identities = 74/260 (28%), Positives = 107/260 (41%), Gaps = 27/260 (10%)

Query: 8   GEVIISQEPYVCVPNNSSSI-SRCDGCFAS--SNLKKCSACQVVWYCGSNCQKLDWKLHR 64
           GEVI++++P +  P + +S+ S C  CF S  +    C +C+   +C  +C        R
Sbjct: 27  GEVILTEQPLLLYPASLASLPSFCSACFRSLSAAASPCPSCRAAGFCSPSCAAASHP--R 84

Query: 65  LECQVLSRLDKEKRKSVTPSIRLMLKLYLRRKLQNDNVIPSTTTDNYSLVEALVAHMSDI 124
           L C  LS        +          L+L   L   ++   +     SL  A     SD 
Sbjct: 85  LLCTALSGGGGNGNLASAAEPHQEPLLFL---LSAYSLPEPSLRVLLSLSSAATPPPSDQ 141

Query: 125 DEKQLLLYAQIANLVNLILQWPEISINEIAENFSKLACNAHTICNSELRP---------L 175
           D   L  +A +A LV   +  P  S +  A   SK   N+ +I     RP          
Sbjct: 142 DPGSL--HAMVAALVPPQMLPPGFSPDLTAALLSKDRTNSFSIMEP-YRPEVPQPLRKAR 198

Query: 176 GTGLYPVISIINHSCLPNAV-LVFEGRLA------VVRAVQHVPKGAEVLISYIETAGST 228
              +YP  S++NH CLPNA    +  R        VVRA+  + +G EV +SY       
Sbjct: 199 AYAVYPRASLLNHDCLPNACHFDYADRPGPGNTDIVVRALHDITEGREVCLSYFAANWQY 258

Query: 229 MTRQKALKEQYLFTCTCPRC 248
             RQ+ L E Y F C C RC
Sbjct: 259 KDRQQRLLEDYGFRCECERC 278


>gi|329664548|ref|NP_001192662.1| SET and MYND domain-containing protein 4 [Bos taurus]
          Length = 802

 Score = 68.9 bits (167), Expect = 5e-09,   Method: Compositional matrix adjust.
 Identities = 32/76 (42%), Positives = 45/76 (59%), Gaps = 2/76 (2%)

Query: 175 LGTGLYPVISIINHSCLPNAVLVFEGRLAVVRAVQHVPKGAEVLISYI--ETAGSTMTRQ 232
           L T L+PV+S++NHSC PN  + F G +A +RA Q +  G E+L  Y   E+      R+
Sbjct: 526 LATALFPVVSLLNHSCSPNTSVSFIGTIATIRASQLIRSGQEILHCYGPHESRMGVAERR 585

Query: 233 KALKEQYLFTCTCPRC 248
           + L+ QY F C CP C
Sbjct: 586 QKLRSQYFFDCDCPPC 601


>gi|328866132|gb|EGG14518.1| hypothetical protein DFA_12294 [Dictyostelium fasciculatum]
          Length = 561

 Score = 68.9 bits (167), Expect = 5e-09,   Method: Compositional matrix adjust.
 Identities = 50/154 (32%), Positives = 70/154 (45%), Gaps = 24/154 (15%)

Query: 108 TDNYSLVEALVAHMSDIDEKQLLLYAQIANLVNL-----ILQWPEISINEIAENFSKLAC 162
           +DN  L+E L    S+        Y +I  L+N       +   E  IN  A+N  K+  
Sbjct: 419 SDNKILIEELEKLFSN------KFYDEILGLINYNSISTFVYGEETVINTGAKN-KKMQP 471

Query: 163 NAHTICNSELRPLGTGLYPVISIINHSCLPNAVLVFEGRLAVVRAV------QHVPKGAE 216
                C      LG+ L P+ + +NHSC PN  +  E R  V +A+      Q + KG E
Sbjct: 472 KLRQYC------LGSALIPIFACLNHSCAPNIEMGREQRDGVTKAIAVMIAKQDIKKGQE 525

Query: 217 VLISYIETAGSTMTRQKALKEQYLFTCTCPRCIK 250
           +L SYI+ +     RQ  L  QY FTC C +C K
Sbjct: 526 LLTSYIDESNPFKDRQSILSSQYGFTCNCNKCSK 559


>gi|296476893|tpg|DAA19008.1| TPA: CG8378-like [Bos taurus]
          Length = 802

 Score = 68.9 bits (167), Expect = 5e-09,   Method: Compositional matrix adjust.
 Identities = 32/76 (42%), Positives = 45/76 (59%), Gaps = 2/76 (2%)

Query: 175 LGTGLYPVISIINHSCLPNAVLVFEGRLAVVRAVQHVPKGAEVLISYI--ETAGSTMTRQ 232
           L T L+PV+S++NHSC PN  + F G +A +RA Q +  G E+L  Y   E+      R+
Sbjct: 526 LATALFPVVSLLNHSCSPNTSVSFIGTIATIRASQLIRSGQEILHCYGPHESRMGVAERR 585

Query: 233 KALKEQYLFTCTCPRC 248
           + L+ QY F C CP C
Sbjct: 586 QKLRSQYFFDCDCPPC 601


>gi|440802077|gb|ELR23016.1| Histone-lysine N-methyltransferase [Acanthamoeba castellanii str.
           Neff]
          Length = 481

 Score = 68.9 bits (167), Expect = 5e-09,   Method: Compositional matrix adjust.
 Identities = 68/274 (24%), Positives = 117/274 (42%), Gaps = 43/274 (15%)

Query: 42  CSACQVVWYCGSNCQKLDWK---LH-RLECQVLSRLDKE---KRKSV--TPSIRLMLKLY 92
           C+ C  V +C + C         +H + EC+ L RL      K  SV     +RL++++ 
Sbjct: 170 CARCGQVSFCSAACATPTPSGRFVHPKAECEALQRLHTTCNLKGFSVDEVAEVRLLIRIL 229

Query: 93  LRRKLQNDNVI------PSTTTDNYSLVEALVAHMSDIDEKQ----------------LL 130
           + R +  ++            +D  S VE   AH  ++   +                L+
Sbjct: 230 VLRWIAANSANVEEDGNNGEGSDTSSAVETSFAHFMELMSNKHSFGSAAINRFRQRAFLI 289

Query: 131 LYAQIANLVNLILQWPEISINEIAENFSKLACNAHTICNSELRPL---GTGLYPVISIIN 187
           + A      + +LQ   + ++EI E   +L  N     ++  R     G+ LY   ++ N
Sbjct: 290 MEALSGRATSDVLQG--LDVDEIVEYIGRLESNTFARYSTTKRKTVNYGSALYNCAAMFN 347

Query: 188 HSCLPNAVLVFEGRLAVVRAVQHVPKGAEVLISYIETAGSTMTRQKALKEQYLFTCTCPR 247
           HSC P+ V  F+G+   +RA++ +  G E+ ++YI     T  RQ  L++ Y F C C R
Sbjct: 348 HSCFPSVVRQFDGQHLTLRALRPLNPGDELTMTYIPLRDDTPERQDELQQHYHFACRCQR 407

Query: 248 CIKLG-QFDDIQESAILEGYRCKDDG--CSGFLL 278
           C   G + +   E  +L    C +D   C G L+
Sbjct: 408 CEARGAEVERYMEECLL----CPNDKRECDGLLV 437


>gi|158284652|ref|XP_307655.3| Anopheles gambiae str. PEST AGAP012638-PA [Anopheles gambiae str.
           PEST]
 gi|157020944|gb|EAA03452.3| AGAP012638-PA [Anopheles gambiae str. PEST]
          Length = 452

 Score = 68.9 bits (167), Expect = 5e-09,   Method: Compositional matrix adjust.
 Identities = 76/289 (26%), Positives = 117/289 (40%), Gaps = 50/289 (17%)

Query: 8   GEVIISQEPYVCVPNNSSSISRCDGCFASS--NLKKCSACQVVWYCGSNC-QKLDWKLHR 64
           G+V++ + P+V V  +S    RCD C       L  C  C +  YC   C  K   K HR
Sbjct: 161 GDVVMIERPFVTVLKDSFRYVRCDFCHGERPFTLIPCEGCTMAMYCSEECLSKAYNKYHR 220

Query: 65  LECQVLSRLDKEKRKSVTPSIRLM-------------LKLYLRRKLQNDNVI-------P 104
            EC +L  + +   +     IR++             LK +L   L   NV         
Sbjct: 221 YECGLLRDMWEVFEEVSLIDIRMIAIAITTFDNNPEALKDHLD-ALDESNVNGFTMDWNK 279

Query: 105 STTTDNYSLVEALVAHMSDIDEKQLLLYAQIANLVN-LILQWPEIS-------------- 149
           +T  D ++ V  L  +    D   +  Y   A +++ L+L+  E+               
Sbjct: 280 ATQQDIFNTVHVLTTNQERRDSFFVAFYIFNATILHTLVLERTELGPVCEANPATNKILL 339

Query: 150 -----INEIAENFSKL-ACNAHTICNSELRPLGTGLYPVISIINHSCLPNA--VLVFEGR 201
                  +I E  SKL + NA+ +          G YP+IS++NHSC PN   + + +GR
Sbjct: 340 DLILRYEQIVECNSKLLSFNAYKVKEYVAESFAVGCYPLISMLNHSCAPNVQRITLPDGR 399

Query: 202 LAVVRAVQHVPKGAEVLISY--IETAGSTMTRQKALKEQYLFTCTCPRC 248
            AV   ++ V +G+++  SY           RQ  L   Y F CTC  C
Sbjct: 400 CAVF-VIRPVLEGSQLFDSYEADHILNKRAMRQSMLSFMYSFRCTCEAC 447


>gi|403414107|emb|CCM00807.1| predicted protein [Fibroporia radiculosa]
          Length = 304

 Score = 68.9 bits (167), Expect = 5e-09,   Method: Compositional matrix adjust.
 Identities = 35/104 (33%), Positives = 56/104 (53%), Gaps = 9/104 (8%)

Query: 149 SINEIAENFSKLACNAHTICNSELRPLGTGLYPVISIINHSCLPNAVLVF---------E 199
           S  ++ +  S+   NA T+ +S L  +G  + P ++++NHSC PN V+VF         E
Sbjct: 10  SPGDLVDLISRFTTNAFTLTSSSLSAIGVAISPTVALVNHSCDPNVVIVFPRNPSTSHAE 69

Query: 200 GRLAVVRAVQHVPKGAEVLISYIETAGSTMTRQKALKEQYLFTC 243
             +  V A++ +  G E+L +Y++       RQK LKE Y FTC
Sbjct: 70  EPMMTVVAIKPILPGEEILSAYVDVTQPRELRQKELKETYNFTC 113


>gi|157135679|ref|XP_001663543.1| hypothetical protein AaeL_AAEL003295 [Aedes aegypti]
 gi|108881205|gb|EAT45430.1| AAEL003295-PA [Aedes aegypti]
          Length = 565

 Score = 68.9 bits (167), Expect = 5e-09,   Method: Compositional matrix adjust.
 Identities = 91/404 (22%), Positives = 154/404 (38%), Gaps = 75/404 (18%)

Query: 10  VIISQEPYVCVPN-------NSSSISRCDGCFASSNLK--KCSACQVVW-YCGSNCQKLD 59
           ++ ++ P V  P          S    C GCFA   L   +C ACQ  W  C  +C  L+
Sbjct: 72  ILFTEAPMVIGPKWNLAEYEQRSMTVPCVGCFADCQLGMYRCEACQ--WPACKPDCPGLE 129

Query: 60  -WKLHRLECQVLSRLDKEKRKSVTPSIRLMLKLYLRRKLQNDNVIPSTTTDNYSLVEALV 118
              LH LEC +L R     +    P        Y R          +    N  L + ++
Sbjct: 130 NANLHALECGIL-RFGSGPKPRDDPEAVFD---YYRYDALLALKCLALQIRNRQLFDQML 185

Query: 119 AHMSDIDEKQLLLYAQIA----------NLVNLILQW---------PEISINEIAENFSK 159
              S  + ++   + + A          N +N + +          P   +  + +    
Sbjct: 186 QLESHYEARKGTPFYKEADERTVQYLLKNFLNPLKKMEKQQGKTVLPVCDVKTLHKISGI 245

Query: 160 LACNAHTICNSELRPLGTGLYPVISIINHSCLPNAVLVF---EGRLAVVRAVQHVPKGAE 216
           L  NA  I  S  R +  GLYP+  ++ H C+PN    F   +G     +A +++ KG  
Sbjct: 246 LEVNAMVIPLSNGREI-CGLYPIACMLEHCCMPNCFYTFNCTKGMKLTFKAGRNIKKGEH 304

Query: 217 VLISYIETAGSTMTRQKALKEQYLFTCTCPRCIKLGQFDDIQESAILEGYRC---KDDGC 273
           +  +Y  +   T  R+  LK    F+C+C RC      D  +    L   +C     + C
Sbjct: 305 LTTTYTHSLWGTQQRRDHLKANKYFSCSCARCA-----DPTELGTYLSALKCMGIDGNAC 359

Query: 274 SGFL-----LRDSDDKGFTCQQCGLVRSKEEIK----KIASEVNILSKKTLALTSCGNHQ 324
           +G+      L+++ D  + C QC +    +++     KI  EV+             +  
Sbjct: 360 AGYQLPIDPLKETSD--WKCNQCPITIESDQVNFLLSKIGEEVD-------------DAM 404

Query: 325 EVVSTYKMIEKLQKKL---YHPFSVNLMQTREKLIKILMELEDW 365
           E  S+ K +E L  KL    HP    L+Q +  LI++    + +
Sbjct: 405 ERKSSVKQLEDLISKLLTFLHPNHHFLLQLKHSLIQMYGHFKGY 448


>gi|357145110|ref|XP_003573528.1| PREDICTED: histone-lysine N-methyltransferase ASHR2-like isoform 1
           [Brachypodium distachyon]
 gi|357145113|ref|XP_003573529.1| PREDICTED: histone-lysine N-methyltransferase ASHR2-like isoform 2
           [Brachypodium distachyon]
          Length = 389

 Score = 68.9 bits (167), Expect = 5e-09,   Method: Compositional matrix adjust.
 Identities = 69/263 (26%), Positives = 106/263 (40%), Gaps = 34/263 (12%)

Query: 6   VSGEVIISQEPYVCVPNNSSSI-SRCDGCF----ASSNLKKCSACQVVWYCGSNCQKLDW 60
           + GEV++S+ P +  P++ +S+ S C  CF     + +   C +C+   +C   C     
Sbjct: 25  LEGEVLLSEPPILLYPSSLASLPSYCSACFRCLPQAPHAAPCPSCRAAAFCSPACAAASH 84

Query: 61  KLHRLECQVLSRLDKEKRKSVTPSIRLMLKLYLRRKLQNDNVIPSTTTDNYSLVEALVAH 120
              RL C  LSRL     +S    +  +L  Y    LQ            ++L+    A 
Sbjct: 85  P--RLLCAALSRLAAAP-ESHQEQLLFLLSAY---SLQEPAF--------HALLSLSSAP 130

Query: 121 MSDIDEKQLLLYAQIANLVNLILQWPEISINEIAENFSKLACNAHTICNSELRPLGTGL- 179
                +    L+A +++L    +     S +  A   SK   N+ +I       +  GL 
Sbjct: 131 QGTQQQDAASLHAMVSSLAPPHMLPAGFSPDLTAALLSKDRTNSFSIMEPYRPDVAQGLR 190

Query: 180 -------YPVISIINHSCLPNAV-LVFEGRLA------VVRAVQHVPKGAEVLISYIETA 225
                  Y   S++NH CLPNA    +  R        V+RA+  +  G EV ISY    
Sbjct: 191 KARAYAVYHRASLLNHDCLPNACHFDYPDRPGPGNTDIVLRALHGITAGMEVRISYFAAN 250

Query: 226 GSTMTRQKALKEQYLFTCTCPRC 248
                RQ+ L E Y F C C RC
Sbjct: 251 WRYADRQRRLLEDYGFRCECERC 273


>gi|330812969|ref|XP_003291388.1| hypothetical protein DICPUDRAFT_98991 [Dictyostelium purpureum]
 gi|325078448|gb|EGC32099.1| hypothetical protein DICPUDRAFT_98991 [Dictyostelium purpureum]
          Length = 824

 Score = 68.9 bits (167), Expect = 5e-09,   Method: Compositional matrix adjust.
 Identities = 32/98 (32%), Positives = 51/98 (52%), Gaps = 1/98 (1%)

Query: 152 EIAENFSKLACNAHTICNSELRPLGTGLYPVISIINHSCLPNAV-LVFEGRLAVVRAVQH 210
           EI E ++    N HT+ N   +  G G YP+    NHSC+PN    V +  +   RA + 
Sbjct: 509 EIYEIYTIFLSNGHTVSNIFGKGYGHGFYPLGCYFNHSCVPNVTWYVDDNGMLAFRATRT 568

Query: 211 VPKGAEVLISYIETAGSTMTRQKALKEQYLFTCTCPRC 248
           + +G E+ ++Y +   +   R+  L +QY F C CP+C
Sbjct: 569 ISQGEELFLTYYDMFSTLDDRRPHLLDQYRFFCQCPQC 606


>gi|312371974|gb|EFR20029.1| hypothetical protein AND_20776 [Anopheles darlingi]
          Length = 256

 Score = 68.9 bits (167), Expect = 5e-09,   Method: Compositional matrix adjust.
 Identities = 42/106 (39%), Positives = 61/106 (57%), Gaps = 10/106 (9%)

Query: 149 SINE-IAENFSKLACNAHTICNSELRPLGTGLYPVISIINHSCLPNAVLVF--EGRLAVV 205
           S++E I E ++K+     T  N+E    G+ LY + S INHSC PNA +VF     +  +
Sbjct: 125 SVDELIDELYNKMDEVVGTFLNNE----GSALYAMQSKINHSCTPNAEIVFPKSNHVLAL 180

Query: 206 RAVQHVPKGAEVLISYIETAG---STMTRQKALKEQYLFTCTCPRC 248
           RA++ +  G E+ ISY++      S  +RQK L+E YLF C C RC
Sbjct: 181 RALRDLAAGEEICISYLDECNLQRSRHSRQKNLREYYLFECQCERC 226


>gi|71019315|ref|XP_759888.1| hypothetical protein UM03741.1 [Ustilago maydis 521]
 gi|46099457|gb|EAK84690.1| hypothetical protein UM03741.1 [Ustilago maydis 521]
          Length = 739

 Score = 68.9 bits (167), Expect = 5e-09,   Method: Compositional matrix adjust.
 Identities = 35/118 (29%), Positives = 65/118 (55%), Gaps = 16/118 (13%)

Query: 149 SINEIAENFSKLACNAHTICNSELRPLGTGLYPVISIINHSCLPNAVLVF---------- 198
           S  E+ +   + +CN+ T+ +S+L  LG  ++  ++++NH+C+PNA +VF          
Sbjct: 423 SATELLDLVCQFSCNSFTLADSDLNALGVCMHASMAMLNHACIPNAAVVFPFGGAAKGGQ 482

Query: 199 ------EGRLAVVRAVQHVPKGAEVLISYIETAGSTMTRQKALKEQYLFTCTCPRCIK 250
                 + ++  + A++ +  G E+LISY++   +   RQK LK++Y F C C  C K
Sbjct: 483 QRWNDGDDKIMQLVALRAIEPGEELLISYVDLCDTVEERQKQLKQRYCFDCRCDLCCK 540



 Score = 51.2 bits (121), Expect = 0.001,   Method: Compositional matrix adjust.
 Identities = 25/83 (30%), Positives = 39/83 (46%), Gaps = 15/83 (18%)

Query: 8   GEVIISQEPYVCVPNNSSSISRCDGCFAS---------------SNLKKCSACQVVWYCG 52
           G+V++   P V V + +    RC  C++                  L++CS C+V  YC 
Sbjct: 174 GDVLLRVRPEVAVLSTALLEVRCSACYSPPSSESASASAPTATAGKLQRCSGCKVTRYCS 233

Query: 53  SNCQKLDWKLHRLECQVLSRLDK 75
           + CQK DW  HR EC+ L  + +
Sbjct: 234 AGCQKRDWPAHRDECKALKAMQQ 256


>gi|384498991|gb|EIE89482.1| hypothetical protein RO3G_14193 [Rhizopus delemar RA 99-880]
          Length = 511

 Score = 68.9 bits (167), Expect = 5e-09,   Method: Compositional matrix adjust.
 Identities = 60/262 (22%), Positives = 115/262 (43%), Gaps = 36/262 (13%)

Query: 5   HVSGEVIISQEP-YVCVPNNSSSISRCDGCFASSNLKKCSACQVVWYCGSNCQKLDW-KL 62
           H+   + +  EP  V +P  S    RC+ C + + L+ CS C+  ++CG+ C +  W   
Sbjct: 25  HLEPGITLFIEPSLVSIPLPSKRHQRCNYCLSKAQLQCCSRCRSAYFCGNACFRNAWLHF 84

Query: 63  HRLECQVLSRLDKEKRKSVTPSIRLMLKLYLRRKLQNDNVIPSTTTDNYSLVEALVAHMS 122
           HR+ C+  +    +    V     L+ +  L            T + +Y + +    H++
Sbjct: 85  HRVLCEPQA---TDNYVHVDTDQWLLERAAL------------TLSSHYRMNKQQSPHLA 129

Query: 123 DIDEKQLLLYAQIANLVNLILQWPE----------ISINEIAENFSKLACNAHTICNSE- 171
                 L       NL N   +W E          IS  E+A  + ++      I + E 
Sbjct: 130 ----FALKALKDTPNLCNNPPEWLERVAELLKPQDISTQELAVLYGQIQACIFPIFDFEH 185

Query: 172 -LRPLGTGLYPVISI-INHSCLPNAVLVFEGRLAVVRAVQHVPKGAEVLISYIETAGSTM 229
            +  +  GLYP+ ++ + HSC PN+ +V++     +  ++ +  G  + I+Y++   +  
Sbjct: 186 HMEQMAVGLYPITALHVKHSCRPNSAVVYKQGKQHLITIETIQPGDPITIAYVDMISNKK 245

Query: 230 TRQKALKEQYL--FTCTCPRCI 249
            R + LK+++   + CTC RCI
Sbjct: 246 QRSEQLKKRFGKDYECTCARCI 267


>gi|428176655|gb|EKX45538.1| hypothetical protein GUITHDRAFT_138770 [Guillardia theta CCMP2712]
          Length = 598

 Score = 68.9 bits (167), Expect = 6e-09,   Method: Compositional matrix adjust.
 Identities = 98/441 (22%), Positives = 180/441 (40%), Gaps = 55/441 (12%)

Query: 7   SGEVIISQEPYVCVPNNSSSISRCDGCFASSNLKKCSACQVVW--YCGSNCQKLDWKLHR 64
           +GE+I+ +EP V     S    RC  C       +CS  Q  +     +N ++++  L  
Sbjct: 141 AGEIILEEEPLVAA---SWHAHRCIECHEPHRSSECSRVQHKYPPAVAANMEEIERFLEG 197

Query: 65  LECQVLSRLDKEKRKSVTPSIRLMLKLYLRRKLQNDNVIPSTTTDNYSLVEALVAHMSDI 124
           LE   +  LD         S R  +KL           +  +++D   +V       +++
Sbjct: 198 LES--IEELD---------SARRFIKL-----------LSKSSSDPAVMVHLSPCTAANM 235

Query: 125 DEKQLLLYAQIANLVNLILQWPEISINEIAENFSKLACNAHTICNSELRPLGTGLYPVIS 184
           +  +  + A   + V   +  P +S  + A   S L  N+H +        G+GL+ +  
Sbjct: 236 ESCRKAVSAVRKHKVAKAIIPPTVSDEDAARMLSILNTNSHELGEWG----GSGLFLLGC 291

Query: 185 IINHSCLPNAVLVFEGRLAVVRAVQHVPKGAEVLISYIE-TAGSTMTRQKALKEQYLFTC 243
           ++ HSC PN      G    V A+  + KG  + I Y + T   T  R+K LK+ Y F C
Sbjct: 292 LMEHSCAPNCNFSTYGTQLWVTALNSIKKGESLSIDYNDSTFEPTAVRRKLLKDSYDFLC 351

Query: 244 TCPRCIKLGQFDDIQESAILEGYRCKDDGCSGFLLRDSDDKGFTCQQCGLVRSKEEIKKI 303
            CPRC  L    D+  + I    +CK   C   L   S    + C++C     K ++ K 
Sbjct: 352 LCPRCTTL---PDVMRAFICPVKQCKGKVCP--LGEGSKPSDWVCEKC-----KSQLDKK 401

Query: 304 ASEVNILSKKTLALTSCGNHQEVVSTYKMIEKLQKKLYHPFSVNLMQTREKLIKILMELE 363
           + +  +  +K L +     H     T+ ++ +  ++L   ++ +     E+ +      E
Sbjct: 402 SRKRFLEIEKAL-MEEGPEHDVFHETHHIVLEALRELGSVYAKD-----EEAVADGSAEE 455

Query: 364 DW----KEALAYCQLTIPVYQRVYPQFHPLLGLQYYTCGKLEWFLGDTENAIKSMTEAVE 419
            W    K A     + IP+   V P+FHP   + +    +L     +   A +++  + E
Sbjct: 456 IWLRVLKGADQARSILIPL---VLPKFHPTKAIWWDNVAQLRLSRKNQAGAQEALKRSYE 512

Query: 420 ILRITHGTNSPFMKELILKLE 440
           + R   G  +P  ++ +  +E
Sbjct: 513 VTRRCCGDATPSTQDALRLVE 533


>gi|358388868|gb|EHK26461.1| hypothetical protein TRIVIDRAFT_35412 [Trichoderma virens Gv29-8]
          Length = 535

 Score = 68.6 bits (166), Expect = 6e-09,   Method: Compositional matrix adjust.
 Identities = 67/246 (27%), Positives = 107/246 (43%), Gaps = 17/246 (6%)

Query: 8   GEVIISQEPYVCVPNNSSSISRCDGCFASSNLKKCSACQVVWYCGSNCQKLDWK-LHRLE 66
           G VI    P + +P  S   S C  C    N + CS C    YC + CQ   WK +H  E
Sbjct: 28  GAVIFPFTPLLLLPTVSCLSSVCSYCLRPGNPRACSRCHAASYCDAACQAAAWKAVHSRE 87

Query: 67  CQVLSRLDKE---KRKSVTPSIRLMLKLYLRRKLQNDNVIPSTTTDNYSLVEA-LVAHMS 122
           C+ L +  K+   +R+  TP+  LM  L           +     D    +E  ++   +
Sbjct: 88  CKALRQGIKDEGRRRQLPTPTRALMQAL-----------LCGEIGDGLKDLEGHVLEKKA 136

Query: 123 DIDEKQLLLYAQIANLVNLILQWPEISINEIAENFSKLACNAHTICNSELRPLGTGLYPV 182
           + DE + +    +A          E  +   AE   K+  N+    + +L  +G  L P 
Sbjct: 137 EGDEWRDIEMMAMAACAFSGKGTAEELVRRAAEMLCKIQNNSFQRFDPDLGVVGLFLEPT 196

Query: 183 ISIINHSCLPNAVLVFEGRLAVVRAVQHVPKGAEVLISYIETAGSTMTRQKALKEQYLFT 242
           +++ NHSC+PNA + F GR  ++ A   +  G E+ ++Y       + R++AL   Y F 
Sbjct: 197 LAMANHSCIPNAAVQFIGRNTLLVAESPIRAGDEIELAYTFYTDPLLKRREALA-HYKFV 255

Query: 243 CTCPRC 248
           C C RC
Sbjct: 256 CQCLRC 261


>gi|357609801|gb|EHJ66685.1| hypothetical protein KGM_08794 [Danaus plexippus]
          Length = 776

 Score = 68.6 bits (166), Expect = 7e-09,   Method: Compositional matrix adjust.
 Identities = 109/514 (21%), Positives = 194/514 (37%), Gaps = 118/514 (22%)

Query: 8   GEVIISQEPY--VCVPNNSSSISRCDGCFASSNLKKCSACQVVWYCGSNCQKLDWKLHRL 65
           G+V+ S+EPY  V +P++ +    C  C    N        V  YCG  C       HR 
Sbjct: 262 GDVLFSEEPYAWVALPSDDAICEMC--CDTDIN-------PVPVYCGCECASRAISFHRW 312

Query: 66  ECQVLSRLDKEKRKSVTPSI-------RLML---------------------KLYLRRKL 97
           EC          + S+ P+I       R++L                     +L+    L
Sbjct: 313 ECV-------GAQCSLFPTIGIAHLALRVLLISTNNGFPPSPVSLPQACTAGELFRSYGL 365

Query: 98  QNDNVIPSTTTDNYSLVEALVAHMSDIDEKQLLLYAQIANLVNLIL-------------- 143
            ++  I  T TD +  +  LV + + +D    + YA  A ++ L L              
Sbjct: 366 VDNIQIYKTGTDPFYRMFNLVTNFNKMDNTDYIQYALTATMLTLYLENFTSFFDYLPSKM 425

Query: 144 -------QWPEISINEIAENFSKLACNAH-----------------TICNSELRPLGTGL 179
                  Q    +   I  +  +L CN H                 TI   E+R   T +
Sbjct: 426 PCSMSESQLKLFAAAVILRSMGQLVCNGHATLSLAVVEEDDGRNGKTITEKEVR-RATAI 484

Query: 180 YPVISIINHSCLPNAVLVFEGRLAVVRAVQHVPKGAEVLISY--IETAGSTMTRQKALKE 237
           YP  +++NHSC PN +  F     +VR  + +P G EV   Y           R+KALK 
Sbjct: 485 YPSAAMMNHSCDPNIINTFYKSRLIVRCQRELPAGGEVFNCYGPHRARAPAAARRKALKA 544

Query: 238 QYLFTCTCPRCIKLGQFDDIQESAILEGYRCKDDGCSGFLLRDSDDKGFTCQQCGLVRSK 297
           QY+FTC C  C    + D +   ++   Y C+   C G +          C QC      
Sbjct: 545 QYMFTCHCADCNDTERKDFV---SLFSAYLCQ--SCKGPVWAHCVRP--LCTQCRSALHL 597

Query: 298 EEIKKIASEVNILSKKTLALTSCGNH-QEVVSTYKMIEKLQKKLYHPFSVNLMQTREKLI 356
           E    +    + L+ +   + S     +++ ++Y    +L+++++H    +L    ++L 
Sbjct: 598 ERAHTLLDRADDLATQAEQVVSLEERCEKMAASY----RLKQQVWHRHHASLRMAADRLA 653

Query: 357 KILMELEDWKEALAYCQLTIPVYQRVYPQF-----HPLLGLQYYTCGKL----------E 401
           ++  +  D+ +++   +  I   +  +  F     H L  L      ++          E
Sbjct: 654 RLYADTGDFGKSMELIKQNIQSLEYRFGSFSVEVAHELRKLSDVMLERILNSPQHLEYRE 713

Query: 402 WFLGDTENAIKSMTEAVEILRITHGTNSPFMKEL 435
           W L     A K + +A++++ + +G+  P +  L
Sbjct: 714 WCL----EAHKVVKKAIQLMELNYGSWEPLVSRL 743


>gi|383858987|ref|XP_003704980.1| PREDICTED: protein msta, isoform B-like [Megachile rotundata]
          Length = 514

 Score = 68.6 bits (166), Expect = 7e-09,   Method: Compositional matrix adjust.
 Identities = 100/463 (21%), Positives = 188/463 (40%), Gaps = 73/463 (15%)

Query: 8   GEVIISQEPYVCVPNNSSSISRCDGCFAS-----SNLKKCSACQVVW-YCGSNCQKLDWK 61
           GE I+++ P+V  P  + +   C  C+A      +    CS C   W  C + C+  +  
Sbjct: 41  GEEIVTEMPFVVGPK-AFTYPLCLSCYAPWPPSLTEKPLCSRCG--WPVCSTECE--NQP 95

Query: 62  LHR-LECQVLSRLDK---------EKRKSVTPSIRLMLKLYLRRKLQNDNVIPSTTTDNY 111
            H+  ECQ+  + ++         E  ++  P +  +  L L  + + +   P    +  
Sbjct: 96  QHKDYECQIFVQANEKFNVQAALEEANENGVPQLECITPLRLLLESEKN---PERWNNE- 151

Query: 112 SLVEALVAHMSDIDEKQLLLYAQIANLVNLILQWPEISINEIAENFSKLACNAHTICNSE 171
             V+ + AH + I  ++    +   N+V  +    ++ ++  +E   + AC    I   E
Sbjct: 152 --VKNMEAH-NKIRSQKAHWKSDQVNVVEYLRN--QLKLDRFSEELIQTACGILEINTFE 206

Query: 172 LR-PLGTG---LYPVISIINHSCLPN---AVLVFEGRLAVVRAVQHVPKGAEVLISYIET 224
           +R   G G   LYP ++++NHSC+ N   ++   + R+ + R   HVP   E+  SY  +
Sbjct: 207 VRTSRGYGARALYPTVALMNHSCVSNTCHSISPTDYRIRL-RTTVHVPPSGELYGSYTHS 265

Query: 225 AGSTMTRQKALKEQYLFTCTCPRCIKLGQFDDIQESAILEGYRCK--DDGCSGFLLRDSD 282
              TM R++ L E   F C C RC      D  +    +   +C   D+G    L     
Sbjct: 266 LYPTMLRREHLLEGKHFACACARC-----SDPTELGTHMSSLKCNKCDNGIVLSLDSLDP 320

Query: 283 DKGFTCQQCGLVRSKEEIKK----IASEVNILSKKTLALTSCGNHQEVVSTYKMIEKLQK 338
           D  + C  C    S   ++K    I +EV+ + +      S  +  + +   + + K  +
Sbjct: 321 DSSWKCTHCEFSTSGPAVRKVLKIIQAEVDAVEQ-----ISGADGADAIQERETVMKKYR 375

Query: 339 KLYHPFSVNLMQTREKLIKILMELEDW----------KEALAYCQLTIPVYQRVYPQFHP 388
            + HP    L   R  L ++   ++++          +  +  C+  + V   + P +  
Sbjct: 376 SVLHPRHAFLTMLRHSLTQMYGRVDEYLLDDLPDVVLEHKIDMCRFLLQVLDVIEPGYSR 435

Query: 389 LLGLQYYTCGKLEWFLGDT--------ENAIKS-MTEAVEILR 422
           + G+  Y       F+  T        E A+KS M EA  IL+
Sbjct: 436 IRGMTLYELHAPLLFVAKTLWNAGVIDEAALKSKMIEASNILK 478


>gi|195377589|ref|XP_002047571.1| GJ13517 [Drosophila virilis]
 gi|194154729|gb|EDW69913.1| GJ13517 [Drosophila virilis]
          Length = 528

 Score = 68.6 bits (166), Expect = 7e-09,   Method: Compositional matrix adjust.
 Identities = 89/339 (26%), Positives = 141/339 (41%), Gaps = 55/339 (16%)

Query: 8   GEVIISQEPYVCVPNNSSSISRCDGCF--------ASSNLKKCSACQVVW-YCGSNCQKL 58
           GE I+ + P V  P   +S+  C GC          + N  KCSAC   W  C + C++ 
Sbjct: 68  GEQIMREAPLVVGPK-VASVPLCLGCHKNLLPPAKPAQNYYKCSAC--TWPLCSAECEQS 124

Query: 59  DWKL--------HRLECQVLSRLDKEKRKSVTPSIRLMLKLYLRRKLQNDNVIPSTTTDN 110
            + L           EC++  +  +++ K     + ++L+  ++ KL++    P+     
Sbjct: 125 PYHLAECRLMAGSNFECKINYKAAEQEHKESAYCVIMLLRC-VQLKLES----PAAFARL 179

Query: 111 YSLVEALVAHMSDIDEKQL--LLYAQIANLVNLILQWPEISINEIAENFSKLACNAHTIC 168
           Y L +    H+ +  E  L  +L A +   +  +L     S  EI    + L  NA    
Sbjct: 180 YELED----HLKERLETPLYQVLRANLITFIKTVLGLNNCSELEILRIAAILDTNAF--- 232

Query: 169 NSELRPLG-----TGLYPVISIINHSCLPNAVLVFEGRLAVV-RAVQHVPKGAEVLISYI 222
             ELR LG      GLYP  ++  H C+PN    F+  + +V  A + + KG  + ISY 
Sbjct: 233 --ELRQLGGCVKVRGLYPGAAMFAHDCVPNMRHRFDDDMNIVFLAKRPIAKGEVLSISYT 290

Query: 223 ETAGSTMTRQKALKEQYLFTCTCPRCIKLGQFDDIQESAILEG-YRCKDDGCSGFLLRDS 281
           +   ST+ R+  LK+   F C C RC      +D  E     G + C        +  D 
Sbjct: 291 QPLRSTIQRRVHLKQVKCFDCACARC------EDPTELGTFAGAHVCGKCKVGKIISMDP 344

Query: 282 DD--KGFTCQQCGLVRSKEEI----KKIASEVNILSKKT 314
                 + C+ C L RS +E      KI  E+  L K T
Sbjct: 345 LQLAANWKCEVCNLKRSAKEFLTQDAKIEQELESLDKTT 383


>gi|440302623|gb|ELP94930.1| set and mynd domain containing protein, putative [Entamoeba
           invadens IP1]
          Length = 428

 Score = 68.2 bits (165), Expect = 7e-09,   Method: Compositional matrix adjust.
 Identities = 95/409 (23%), Positives = 163/409 (39%), Gaps = 68/409 (16%)

Query: 30  CDGCFAS--SNLKKCSACQVVWYCGSNCQKLDW-KLHRLECQVLSRLDKEKRKSVTPSIR 86
           C+ CF     N K    C    YC   CQK  + + H+ EC ++  L     + +   I 
Sbjct: 47  CEYCFKELKENSKISCGCGFN-YCSMECQKKAYDEYHKNECSIIKSL-----REIPDGIG 100

Query: 87  LMLKLYL--RRKLQNDNVIPSTTTDNYSLVEALVAHMSDIDEKQLLLYAQIANLVNLILQ 144
            +L LY    + ++ ++       +NY  VE  V           LL+   ++L+  I  
Sbjct: 101 EILLLYRCSIKTVKWEDYCSLPNINNYKAVEYCV-----------LLFKSNSSLLKTI-- 147

Query: 145 WPEISINEIAENFSKLACNAHTICNSELRPLGTGLYPVISIINHSCLPNAVLVFEGRLAV 204
           W  I              N+  + N E + +G GL+   S INHSC PN V +   R   
Sbjct: 148 WEVIQ------------ANSFCLTNGEEQVIGIGLFDYASFINHSCCPNCVPLQNKREMS 195

Query: 205 VRAVQHVPKGAEVLISYIETAGSTMTRQKALKEQYLFTCTCPRCIKLGQFDDIQESAILE 264
           ++++  +  G E+ ISYI+   S   R+K L+ ++ F+C C  C         +E     
Sbjct: 196 IKSLTSIKSGEEIFISYIDITESFERREKELR-KWHFSCGCSLC---------EEDKKRY 245

Query: 265 GYR--CKDDGCSGFLLRDSDDKGFTCQQCGLVRSKEEIKKIASEVNILSKKTLALTSCGN 322
            Y   C+   C G L  D   +   C+ C  V  + + K   +   I   + L      N
Sbjct: 246 RYHLLCE---CGGSLFVDEIGQFAKCKTCKKVFEEGDDKYEDALYQINEYRNLE-----N 297

Query: 323 HQEVVSTYKMIEKLQKKLYHPFSVNLMQTREK-----------LIKILMELEDWKEALAY 371
             + +ST K  E++ +KL    SV      EK           +    ++L+ + EA   
Sbjct: 298 DIDKLSTNKQYEQVIEKLNENVSVITSLIDEKDVREHYKFNSIMFHSFIQLKKYNEAAQV 357

Query: 372 CQLTIPVYQRVYPQFHPLLGLQYYTCGKLEWFLGDTENAIKSMTEAVEI 420
            +  + +   ++P  +    ++++   K+    GD EN  +   E V +
Sbjct: 358 GKKLMKMVDTIFPLLYITGVVEHFHLAKVFRMSGD-ENLFQEEIEKVRL 405


>gi|66822115|ref|XP_644412.1| SET domain-containing protein [Dictyostelium discoideum AX4]
 gi|66822967|ref|XP_644838.1| SET domain-containing protein [Dictyostelium discoideum AX4]
 gi|74857697|sp|Q557F6.1|Y3591_DICDI RecName: Full=SET and MYND domain-containing protein DDB_G0273591
 gi|60472535|gb|EAL70486.1| SET domain-containing protein [Dictyostelium discoideum AX4]
 gi|60472965|gb|EAL70914.1| SET domain-containing protein [Dictyostelium discoideum AX4]
          Length = 413

 Score = 68.2 bits (165), Expect = 8e-09,   Method: Compositional matrix adjust.
 Identities = 73/341 (21%), Positives = 125/341 (36%), Gaps = 74/341 (21%)

Query: 8   GEVIISQEPYVCVPNNSSSISRCDGCF------ASSNLKKCSACQVVWYCGSNCQKLDWK 61
           GE I+  + Y  V       + C  C       +   + +C  C  VWYC   C++ +  
Sbjct: 29  GESILKCKSYFAVTCEDFKKNSCYNCIKLIKSPSPQQVPRCFGCNEVWYCSEKCKQDNQA 88

Query: 62  LHR-LECQVL-----------SRLDKEKRKSVTPSIRLMLKLY----LRRKLQNDNVIPS 105
            H+  EC              S+LD +    +   + L+ + Y    L  K  +  +I +
Sbjct: 89  KHQHYECAFFNNIKSPKLIQNSKLDFDSYSEIRIILGLLSRYYQDKLLNNKFNSSIIINN 148

Query: 106 TTTDNYSLVEALVAHMSDIDEKQL------LLYAQIANLVNLILQWPEISINE------- 152
              D    ++  +  + D+ E  +      +    I N++  I+   +++IN        
Sbjct: 149 QQDDEEDFIKDTLDGVLDLVENDINEETNSVAKEYIDNIIEYIINILKLTINNNSNDNNN 208

Query: 153 ---------------------------------IAENFSKLACNAHTICNSELRPLGTGL 179
                                            I     K+ CN   I     + +G  +
Sbjct: 209 NNNNNNNNNNNNNNNNNNNNNNNNNNIEELIKLIRPLIQKVRCNQFGIWTKNDKCIGMAV 268

Query: 180 YPVISIINHSCLPNAVLVFEGRLAVVRAVQHVPKGAEVLISYIETAGSTMTRQKALKEQY 239
            P  S  NHSC+PN   V +G     +++  + KG ++ ISY+    ST  R+  LK  Y
Sbjct: 269 SPSSSYFNHSCIPNCESVRDGSDMTFKSLFPIKKGDQINISYLALDKSTKRRRDYLKFGY 328

Query: 240 LFTCTCPRCIK-----LGQFDDIQESAILEGYRCKDDGCSG 275
            F C CPRC        G+ +D  ++ I + Y C    C+G
Sbjct: 329 YFHCQCPRCNSTDIDPTGKLEDSLDNWISKFY-CHQKKCTG 368


>gi|302838330|ref|XP_002950723.1| hypothetical protein VOLCADRAFT_91197 [Volvox carteri f.
           nagariensis]
 gi|300263840|gb|EFJ48038.1| hypothetical protein VOLCADRAFT_91197 [Volvox carteri f.
           nagariensis]
          Length = 610

 Score = 68.2 bits (165), Expect = 9e-09,   Method: Compositional matrix adjust.
 Identities = 33/73 (45%), Positives = 44/73 (60%), Gaps = 2/73 (2%)

Query: 178 GLYPVISIINHSCLPNAVLVFEGRLAVVRAVQHVPKGAEVLISYI--ETAGSTMTRQKAL 235
           GL+P  + +NH+C PN V    G   VVRAVQ +P+G E+L+SY+  +       RQ AL
Sbjct: 246 GLWPYFNQLNHACAPNCVHYVVGSTMVVRAVQVIPEGCELLVSYLGRDDLAPRQVRQAAL 305

Query: 236 KEQYLFTCTCPRC 248
           K +Y F C C RC
Sbjct: 306 KARYGFICNCTRC 318


>gi|242006418|ref|XP_002424047.1| SET and MYND domain-containing protein, putative [Pediculus humanus
           corporis]
 gi|212507353|gb|EEB11309.1| SET and MYND domain-containing protein, putative [Pediculus humanus
           corporis]
          Length = 236

 Score = 68.2 bits (165), Expect = 9e-09,   Method: Compositional matrix adjust.
 Identities = 44/105 (41%), Positives = 61/105 (58%), Gaps = 14/105 (13%)

Query: 150 INEIAENFSKLACNAHTICNSELRPLGTGLYPVISIINHSCLPNAVLVFEG---RLAVVR 206
           I+ + ENF K   +     N+E    G+GLY + S INHSC PNA + F     +LAVV 
Sbjct: 122 IDAVYENFEKGVGD---FLNNE----GSGLYELQSTINHSCSPNAEVTFPHSNYQLAVV- 173

Query: 207 AVQHVPKGAEVLISYIET---AGSTMTRQKALKEQYLFTCTCPRC 248
           A  ++  G E+ ISY++    + S  +RQK L+E YLFTC C +C
Sbjct: 174 ATDNINPGDEICISYLDMCSLSRSRHSRQKILQENYLFTCKCHKC 218


>gi|307173919|gb|EFN64667.1| Protein msta, isoform B [Camponotus floridanus]
          Length = 513

 Score = 68.2 bits (165), Expect = 9e-09,   Method: Compositional matrix adjust.
 Identities = 58/194 (29%), Positives = 93/194 (47%), Gaps = 22/194 (11%)

Query: 168 CNSELRPLGTGLYPVISIINHSCLPNAVLVFEGRLAV-VRAVQHVPKGAEVLISYIETAG 226
           C + LR    GLYP+ ++ NH C+PN    F+ +  + V A   +  G E+ +SY +   
Sbjct: 213 CTTSLR----GLYPLGALQNHCCVPNTRHHFDDQQQLHVTATLPIAAGEEITMSYTDLLW 268

Query: 227 STMTRQKALKEQYLFTCTCPRCIKLGQFDDIQESAILEGYRCKDDGCSGFLLRDSDDKGF 286
            T +R++ LK    F+C C RC    +F   Q  A+L    C  D CSG LL  S+   F
Sbjct: 269 DTSSRRQFLKATKHFSCNCSRCSDPSEFGS-QLGALL----CAKDDCSGHLL-PSNPLNF 322

Query: 287 ----TCQQCGLVRSKEEIKKIASEVNILSKKTLALTSCGNHQEVVSTYKMIEKLQKKLYH 342
                C +C +  + ++I+ I S++N L    + +T      E+V   K IE    KL  
Sbjct: 323 ISPWICDKCQISVNYKQIEFIHSQLNALVLDAMYMTP----HEIV---KFIETALSKLVP 375

Query: 343 PFSVNLMQTREKLI 356
             +  L+  + ++I
Sbjct: 376 TSNFILIDVKFRII 389


>gi|391868965|gb|EIT78172.1| hypothetical protein Ao3042_05555 [Aspergillus oryzae 3.042]
          Length = 324

 Score = 68.2 bits (165), Expect = 9e-09,   Method: Compositional matrix adjust.
 Identities = 31/93 (33%), Positives = 50/93 (53%)

Query: 158 SKLACNAHTICNSELRPLGTGLYPVISIINHSCLPNAVLVFEGRLAVVRAVQHVPKGAEV 217
           +KL  N+    N     +G  L+P  ++INHSC  NA + F+     ++A + + KG ++
Sbjct: 11  AKLELNSFNFTNILSDRIGLYLHPYAALINHSCNYNAAVTFDSDNLYIKATRPIQKGDQI 70

Query: 218 LISYIETAGSTMTRQKALKEQYLFTCTCPRCIK 250
            ISYI+       R+  L+E+Y F C C +C K
Sbjct: 71  FISYIDATNPVKLRRSELRERYYFDCHCAKCAK 103


>gi|322792366|gb|EFZ16350.1| hypothetical protein SINV_08812 [Solenopsis invicta]
          Length = 510

 Score = 67.8 bits (164), Expect = 1e-08,   Method: Compositional matrix adjust.
 Identities = 95/465 (20%), Positives = 183/465 (39%), Gaps = 76/465 (16%)

Query: 7   SGEVIISQEPYVCVPNNSSSISRCDGCFAS-----SNLKKCSACQVVW-YCGSNCQKLDW 60
           +GE ++++ P+V  P  + +   C  C+        + + CS C   W  CG  C+    
Sbjct: 37  AGEELVTEMPFVVGPK-ACTYPLCLSCYTPWPPEPDDKQLCSTCG--WPVCGQECENA-L 92

Query: 61  KLHRLECQVLSRLDKEKRKSVTPSIRLMLKLYLRRKLQN-DNVIPSTT--------TDNY 111
           +    ECQV ++ ++              K  +   LQN +N +P            ++ 
Sbjct: 93  QHKDYECQVFAQANE--------------KFNIDAALQNNENGVPQLECITPLRLLLESE 138

Query: 112 SLVEALVAHMSDIDE------KQLLLYAQIANLVNLILQWPEISINEIAENFSKLACNAH 165
             VE     + D++       ++    +   N+V+ + +   + ++  +E   +  C   
Sbjct: 139 RNVERWNKEIKDMEPHNKIRCQKAQWKSDHVNIVDYLRK--RLKLDRFSEEHIQTVCGIL 196

Query: 166 TICNSELRPL----GTGLYPVISIINHSCLPN---AVLVFEGRLAVVRAVQHVPKGAEVL 218
            I   E+R        GLYP ++++NHSC+ N   ++   + R+ + R    +P G E+ 
Sbjct: 197 EINTFEVRTAKGFSARGLYPTVAMMNHSCVSNTSHSISPIDYRIRL-RTTLKIPAGGELY 255

Query: 219 ISYIETAGSTMTRQKALKEQYLFTCTCPRCIKLGQFDDIQESAILEGYRCK--DDGCSGF 276
            SY  +   TM R++ L E   F C CPRC      D  +    +   +C   D+G    
Sbjct: 256 ASYTHSLLPTMLRREHLLEGKHFACACPRC-----SDPTELGTHMSSLKCNKCDNGIVLS 310

Query: 277 LLRDSDDKGFTCQQCGLVRSKEEIKKIASEVNILSKKTLALTSCGNHQEVVSTYKMIEKL 336
           L     +  + C  C    + + ++K+   +        A+ S  +  + +   + I K 
Sbjct: 311 LDSLDSESKWKCTHCDFSTTGQAVRKVFRIIQAEVDAAEAI-SGADGADAIHERETIMKK 369

Query: 337 QKKLYHPFSVNLMQTREKLIKILMELEDW----------KEALAYCQLTIPVYQRVYPQF 386
            + + HP    L   R  L ++   ++++          +  +  C+  + V   V P +
Sbjct: 370 YRSVLHPQHAFLSMLRHSLTQMYGRVDEYLLDDLPDVVLEHKVDLCRSLLQVLNVVEPGY 429

Query: 387 HPLLGLQYYT-------CGKLEWFLGD-TENAIKS-MTEAVEILR 422
             + G+  Y          K +W  G   E  +KS M EA  IL+
Sbjct: 430 SRVRGMTLYELHAPLLFLAKGQWNAGVIDETGLKSKMIEAANILK 474


>gi|83774976|dbj|BAE65099.1| unnamed protein product [Aspergillus oryzae RIB40]
          Length = 323

 Score = 67.8 bits (164), Expect = 1e-08,   Method: Compositional matrix adjust.
 Identities = 32/98 (32%), Positives = 51/98 (52%)

Query: 153 IAENFSKLACNAHTICNSELRPLGTGLYPVISIINHSCLPNAVLVFEGRLAVVRAVQHVP 212
           +A    KL  N+    N     +G  L+P  ++INHSC  NA + F+     ++A + + 
Sbjct: 5   VANVLIKLELNSFNFTNILSDRIGLYLHPYAALINHSCNYNAAVTFDSDNLYIKATRPIQ 64

Query: 213 KGAEVLISYIETAGSTMTRQKALKEQYLFTCTCPRCIK 250
           KG ++ ISYI+       R+  L+E+Y F C C +C K
Sbjct: 65  KGDQIFISYIDATNPVKLRRSELRERYYFDCHCAKCAK 102


>gi|307179275|gb|EFN67661.1| SET and MYND domain-containing protein 4 [Camponotus floridanus]
          Length = 631

 Score = 67.8 bits (164), Expect = 1e-08,   Method: Compositional matrix adjust.
 Identities = 77/316 (24%), Positives = 126/316 (39%), Gaps = 69/316 (21%)

Query: 8   GEVIISQEPYVCVPNNSSSISRCDGCFASSNLKK-----CSACQVVWYCGSNCQKLDWK- 61
           G V+I  +P+    +  +    C  C A+  L+      C  CQ V +C   C+K  W+ 
Sbjct: 202 GAVLIVDQPFSFSTDGPALSRNCLYCHATLKLEDSVKIPCRNCQTVSFCTETCRKKAWEA 261

Query: 62  LHRLECQVLSRL---DKEKRKSVTPSIRLML-------KLYLRRKLQ-----NDNVIPST 106
            H+ EC V           R+  T S  L+         L LR  ++     N + +   
Sbjct: 262 YHQYECSVFDHFFENSSNDRRQQTASYLLLAYRTTVLQALSLRDNVETKCVLNPDFLRYH 321

Query: 107 TTD----------------------NYSLVEALVAHMSDIDEK-QLLLYAQIANLVNLIL 143
            +D                      +Y  V  L  H +D++    L+   Q   L   +L
Sbjct: 322 ASDKGFDDIDKEYTKSKSRRIYNPLDYRTVFQLETHCTDVEPNVNLIRTIQAIFLAKCLL 381

Query: 144 Q-WPEISINE-IAENFSKLA-----------CNAHTICNS---------ELRPLGTGLYP 181
               ++ + + + E F  LA           CNA+ I  +         E R +G  +Y 
Sbjct: 382 SVLSKLDVEDCMKEIFVPLAVAMLHHLQAINCNAYEIVENVHEEVTRVWEPRNIGGAIYT 441

Query: 182 VISIINHSCLPNAVL-VFEGRLAVVRAVQHVPKGAEVLISYIET--AGSTMTRQKALKEQ 238
            +S++NHSC PN V   +   + VVRA++ + KG E+L  Y     + S M R++ L ++
Sbjct: 442 TVSLVNHSCYPNVVRHSYPSGIVVVRALRFIGKGCEILDCYGPQFLSESRMARRELLWKK 501

Query: 239 YLFTCTCPRCIKLGQF 254
           Y F C C  C +  +F
Sbjct: 502 YRFLCECDACTQDWKF 517


>gi|194756896|ref|XP_001960706.1| GF13486 [Drosophila ananassae]
 gi|190622004|gb|EDV37528.1| GF13486 [Drosophila ananassae]
          Length = 517

 Score = 67.8 bits (164), Expect = 1e-08,   Method: Compositional matrix adjust.
 Identities = 71/290 (24%), Positives = 115/290 (39%), Gaps = 19/290 (6%)

Query: 9   EVIISQEPYVCVPNNSSSISRCDGCFASSNLKKCSAC-QVVW-YCGSNCQKLDWKLHRLE 66
           E+I+ + P V  P   S+   C GC  S   +    C Q  W  CG  C+ L+   H+ E
Sbjct: 67  EIILREAPLVRGPAQISA-PVCMGCLNSIEAEDHIDCEQCGWPLCGPECKSLE--EHQAE 123

Query: 67  CQVL-SRLDKEKRKSVTPSIRLMLKLYLRRKLQNDNVIPSTTTDNYSLVEALVAHMSDID 125
           C++   R  K           L   +   R L      P      +  +E+L +     +
Sbjct: 124 CKLTKDRGQKVNVNEFNGPHPLYTCVSTVRCLLIGETSPEKAA-KFQELESLESTRRGSN 182

Query: 126 EKQLLLYAQIANLVNLILQWPEISINEIAENFSKLACNAHTICNSELRPLGTGLYPVISI 185
           + +  L A I   +    +  + +  +I      L  N H I  S+  P   G++   S 
Sbjct: 183 QWKADL-ASIGQFIPKFFKTQKFTEEQIMRAVGALQINGHEIPTSD--PPHVGVFYTASF 239

Query: 186 INHSCLPNAVLVF-EGRLAVVRAVQHVPKGAEVLISYIETAGSTMTRQKALKEQYLFTCT 244
             +SCLPN    F +    ++ A Q + K A + I Y +    T  RQ+ L +  LF C 
Sbjct: 240 TENSCLPNLAKSFNKNGHCILWAPQEIKKNAHLSICYSDAMWGTADRQRHLMQTKLFKCA 299

Query: 245 CPRCIKLGQFDDIQESAILEGYRCKDDGCSGFLLRDSDDK---GFTCQQC 291
           C RC  + + D           +C+D  C G +L    D+    + C++C
Sbjct: 300 CARCADVTELD-----TFYSALKCEDRQCGGLMLPSKSDEWNGSWRCREC 344


>gi|307182513|gb|EFN69720.1| Histone-lysine N-methyltransferase ASHR1 [Camponotus floridanus]
          Length = 132

 Score = 67.8 bits (164), Expect = 1e-08,   Method: Compositional matrix adjust.
 Identities = 36/100 (36%), Positives = 55/100 (55%)

Query: 335 KLQKKLYHPFSVNLMQTREKLIKILMELEDWKEALAYCQLTIPVYQRVYPQFHPLLGLQY 394
           K QK + H F++  ++T E      M L+ W++A  Y +  IP Y   Y + HPL+GL Y
Sbjct: 12  KKQKNILHRFNIQHIRTLEAAHIAAMNLKYWEDAELYGKELIPGYLLYYGEIHPLIGLLY 71

Query: 395 YTCGKLEWFLGDTENAIKSMTEAVEILRITHGTNSPFMKE 434
              GK++  L   + A+K + +A E+L ITHG     M+E
Sbjct: 72  LMTGKIQLHLNKPKEALKVLEKASEVLMITHGDKHSLMRE 111


>gi|195389466|ref|XP_002053397.1| GJ23360 [Drosophila virilis]
 gi|194151483|gb|EDW66917.1| GJ23360 [Drosophila virilis]
          Length = 393

 Score = 67.8 bits (164), Expect = 1e-08,   Method: Compositional matrix adjust.
 Identities = 39/110 (35%), Positives = 60/110 (54%), Gaps = 9/110 (8%)

Query: 153 IAENFSKLACNAHTICNSELRPLGTGLYPVISIINHSCLPNA--VLVFEGRLAVVRAVQH 210
           I + ++K+   A    N+E    G+GLY + S INHSC+PNA     +   + V++A+  
Sbjct: 267 IDQLYAKVGDFAGEFLNNE----GSGLYILQSKINHSCVPNAQSTFPYSNDIVVLKALAP 322

Query: 211 VPKGAEVLISYIET---AGSTMTRQKALKEQYLFTCTCPRCIKLGQFDDI 257
           + KG E+ ISY++      S  +R K L+E Y+F C CP+C       DI
Sbjct: 323 IQKGDEICISYLDDCQLERSRHSRHKTLRENYIFVCQCPKCRAQASDPDI 372


>gi|440583731|emb|CCH47233.1| similar to histone-lysine N-methyltransferase ASHR2 [Lupinus
           angustifolius]
          Length = 357

 Score = 67.8 bits (164), Expect = 1e-08,   Method: Compositional matrix adjust.
 Identities = 72/302 (23%), Positives = 116/302 (38%), Gaps = 42/302 (13%)

Query: 30  CDGCFAS--SNLKKCSACQVVWYCGSNCQKLDWKLHR--LECQVLSRLDKEKRKSVTPSI 85
           CD CF +  +N   C +C    +C   C    +      L C  LS L   +     P  
Sbjct: 60  CDHCFRTLPTNTIPCPSCSYHHFCSHKCFSTAFNTFHSSLVCHALSHLKDSESLQQQPYE 119

Query: 86  RLMLKLYLRRKLQNDNVIPSTTTDNYSLVEALVAHMSDIDEKQLLLYAQIANLVNLILQW 145
           R +   ++        + PS     ++ +         I E    L++ I+ L    +  
Sbjct: 120 RQVQARFVVAAYNLAIISPSGI---HAFLSLHGTPDDTIVEAAKFLHSLISPLFPPSVN- 175

Query: 146 PEISINEIAENFSKLACNAHTICNSELRPLGT-------GLYPVISIINHSCLPNAV--- 195
             IS++  A+  +K   N+  + N    P G         +YP  S+ NH C+PNA    
Sbjct: 176 --ISVDLTAQLLAKDRINSFCLMN-PYSPDGPQRSIKAYAIYPKASMFNHDCIPNACRFD 232

Query: 196 LVFEGRL--------AVVRAVQHVPKGAEVLISYIETAGSTMTRQKALKEQYLFTCTCPR 247
            V    L         V+R +Q +P+G EV ISY   +    TR++ L + Y F+C C R
Sbjct: 233 YVDSADLDDEHNNTDMVIRMIQDLPEGREVCISYFRISRDYCTRKRILMDDYGFSCECDR 292

Query: 248 CI-----------KLGQFDDIQESAILEGYRCKDDGCSGFLL--RDSDDKGFTCQQCGLV 294
           C             + ++ D+     +  Y C    C+G L    D+      C  CG +
Sbjct: 293 CKIEANWPHDCQNYVEEYSDLPHVRFIAKYVCDRKNCNGTLAPKDDAHTNVLECNFCGNL 352

Query: 295 RS 296
           +S
Sbjct: 353 KS 354


>gi|66521464|ref|XP_396314.2| PREDICTED: protein msta, isoform B-like [Apis mellifera]
          Length = 513

 Score = 67.8 bits (164), Expect = 1e-08,   Method: Compositional matrix adjust.
 Identities = 98/462 (21%), Positives = 186/462 (40%), Gaps = 69/462 (14%)

Query: 7   SGEVIISQEPYVCVPNNSSSISRCDGCFAS-----SNLKKCSACQVVW-YCGSNCQKLDW 60
           +GE II++ P++  P  +++   C  C+ S     +    CS C   W  C + C+  + 
Sbjct: 39  AGEEIITEMPFIVGPK-AATYPLCLSCYVSWPPTLNEKPLCSKCG--WPVCSTECE--NQ 93

Query: 61  KLHR-LECQVLSRLDK---------EKRKSVTPSIRLMLKLYLRRKLQNDNVIPSTTTDN 110
             H+  ECQ+ ++ ++         E  ++  P +  +  L L  + + +   P    + 
Sbjct: 94  PQHKDYECQIFAQANEKFNVQAALEETNENGVPQLECITPLRLLLESEKN---PERWKNE 150

Query: 111 YSLVEALVAHMSDIDEKQLLLYAQIANLVNLILQWPEISINEIAENFSKLACNAHTICNS 170
              V+ + AH     +K+     QI N+V  I +  ++ +   +E   +  C    I   
Sbjct: 151 ---VKDMEAHNKIRSQKKQWKSDQI-NIVEYIRK--QLKLERFSEEQIQTVCGILEINTF 204

Query: 171 ELRPL----GTGLYPVISIINHSCLPN-----AVLVFEGRLAVVRAVQHVPKGAEVLISY 221
           E+R         +YP ++++NHSC+ N     +   ++ RL   R    +P G E+  SY
Sbjct: 205 EVRTAKGFSARAIYPTVALMNHSCISNTCHSISPTDYKIRL---RTTLKIPVGGELYGSY 261

Query: 222 IETAGSTMTRQKALKEQYLFTCTCPRCIKLGQFDDIQESAILEGYRCK--DDGCSGFLLR 279
             +   TM R++ L E   F C C RC      D  +    +   +C   D+G    L  
Sbjct: 262 THSLLPTMLRREHLLEGKNFACACARC-----SDPTELGTHMSSLKCNKCDNGIVLSLDS 316

Query: 280 DSDDKGFTCQQCGLVRSKEEIKKIASEVNILSKKTLALTSCGNHQEVVSTYKMIEKLQKK 339
              +  + C  C    S   ++K+   + +      A+ S  +  + +   + + K  + 
Sbjct: 317 LDPESSWKCTHCEFSTSGSAVRKVLQIIQMEVDAVEAI-SGADGADAIQERETVAKKYRS 375

Query: 340 LYHPFSVNLMQTREKLIKILMELEDW----------KEALAYCQLTIPVYQRVYPQFHPL 389
           + HP    L   R  L ++   +ED+          +  +  C+L + V   + P +  +
Sbjct: 376 VLHPRHAFLTMLRHSLTQMYGRVEDYLLEDLPDVVLEHKIDMCRLLLQVLDVIEPGYSRI 435

Query: 390 LGLQYYTCGKLEWFLGDTE--------NAIKS-MTEAVEILR 422
            G+  Y       F   T+         A+KS M EA +IL+
Sbjct: 436 RGMTLYELHAPLLFTAKTQWNAEVIDKAALKSKMIEASKILK 477


>gi|343424810|emb|CBQ68348.1| conserved hypothetical protein [Sporisorium reilianum SRZ2]
          Length = 855

 Score = 67.8 bits (164), Expect = 1e-08,   Method: Compositional matrix adjust.
 Identities = 35/116 (30%), Positives = 64/116 (55%), Gaps = 16/116 (13%)

Query: 149 SINEIAENFSKLACNAHTICNSELRPLGTGLYPVISIINHSCLPNAVLVF---------- 198
           S  E+ +   + +CN+ T+ +S+L PLG  ++  ++++NH+C PNA +VF          
Sbjct: 415 SATELLDLVCQFSCNSFTLTDSDLNPLGVCMHASMAMLNHACTPNAAVVFPFGGAAKSGQ 474

Query: 199 ------EGRLAVVRAVQHVPKGAEVLISYIETAGSTMTRQKALKEQYLFTCTCPRC 248
                 E ++  + A++ +  G E+LISY++ A +   R+  LK++Y F C C  C
Sbjct: 475 QKWRDGEDKIMQLVALRAIEPGEELLISYVDIADTFEERRSYLKKRYCFDCRCELC 530



 Score = 56.2 bits (134), Expect = 4e-05,   Method: Compositional matrix adjust.
 Identities = 25/71 (35%), Positives = 40/71 (56%), Gaps = 8/71 (11%)

Query: 8   GEVIISQEPYVCVPNNSSSISRCDGCFA--------SSNLKKCSACQVVWYCGSNCQKLD 59
           G+V++   P V V + +    RC  C++        +  L++CSAC+++ YC + CQK D
Sbjct: 176 GDVLLRVRPEVAVLSTALLDQRCSACYSPPIISDSGAGKLQRCSACKLIRYCSAACQKRD 235

Query: 60  WKLHRLECQVL 70
           W  HR EC+ L
Sbjct: 236 WPAHRDECKAL 246


>gi|61097985|ref|NP_001012912.1| SET and MYND domain-containing protein 5 [Gallus gallus]
 gi|75571258|sp|Q5ZIZ2.1|SMYD5_CHICK RecName: Full=SET and MYND domain-containing protein 5
 gi|53134094|emb|CAG32301.1| hypothetical protein RCJMB04_22j23 [Gallus gallus]
          Length = 420

 Score = 67.4 bits (163), Expect = 1e-08,   Method: Compositional matrix adjust.
 Identities = 37/86 (43%), Positives = 51/86 (59%), Gaps = 7/86 (8%)

Query: 176 GTGLYPVISIINHSCLPNAVLVFEGR--LAVVRAVQHVPKGAEVLISYIE---TAGSTMT 230
           G+GLY + S  NHSC+PNA   F     L  + A++ +  G E+ ISY++      S  +
Sbjct: 310 GSGLYMLQSCCNHSCIPNAETSFPDNNFLLYLTALEDIEAGEEICISYLDCCQRERSRHS 369

Query: 231 RQKALKEQYLFTCTCPRCIKLGQFDD 256
           R K L+E YLFTC+CP+C  L Q DD
Sbjct: 370 RNKILRENYLFTCSCPKC--LAQADD 393


>gi|297736653|emb|CBI25524.3| unnamed protein product [Vitis vinifera]
          Length = 716

 Score = 67.4 bits (163), Expect = 1e-08,   Method: Compositional matrix adjust.
 Identities = 40/112 (35%), Positives = 54/112 (48%), Gaps = 7/112 (6%)

Query: 169 NSELRPLGTGLYPVISIINHSCLPNAVLVFEGRLAVVRAVQHVPKGAEVLISYIETAG-- 226
           N E   +G  +Y V S+ NHSC PN    F  R   +RA +HV  G  + +SY    G  
Sbjct: 452 NMEQVRVGQAIYSVASLFNHSCQPNIHAYFLSRTLFLRATEHVAVGCPLELSYGPQVGQW 511

Query: 227 STMTRQKALKEQYLFTCTCPRCIKLGQFDDIQESAILEGYRCKDDGCSGFLL 278
               RQK LK++Y F C C  C +L   D      +L  +RC +  C G +L
Sbjct: 512 DCKDRQKFLKDEYSFRCECSGCSELNVSD-----LVLNAFRCVNPDCFGTVL 558


>gi|219130732|ref|XP_002185512.1| predicted protein [Phaeodactylum tricornutum CCAP 1055/1]
 gi|217403043|gb|EEC42999.1| predicted protein [Phaeodactylum tricornutum CCAP 1055/1]
          Length = 493

 Score = 67.4 bits (163), Expect = 1e-08,   Method: Compositional matrix adjust.
 Identities = 37/104 (35%), Positives = 52/104 (50%), Gaps = 3/104 (2%)

Query: 148 ISINEIAENFSKLACNAHT---ICNSELRPLGTGLYPVISIINHSCLPNAVLVFEGRLAV 204
           +S   I +++   A NA T   +    L P   GL+P+ ++INHSC+ NAV V    + V
Sbjct: 188 LSYTNIEQSWRTTALNAPTPLDLSRLSLPPRLLGLFPLAAMINHSCVGNAVRVLVDDVMV 247

Query: 205 VRAVQHVPKGAEVLISYIETAGSTMTRQKALKEQYLFTCTCPRC 248
           V A   +P G E++ SY         R   L+  Y F C CPRC
Sbjct: 248 VHATTDLPAGTELVWSYGPPTTPFAQRNTRLRAHYGFVCDCPRC 291


>gi|443896871|dbj|GAC74214.1| predicted histone tail methylase containing SET domain [Pseudozyma
           antarctica T-34]
          Length = 882

 Score = 67.4 bits (163), Expect = 1e-08,   Method: Compositional matrix adjust.
 Identities = 36/119 (30%), Positives = 65/119 (54%), Gaps = 17/119 (14%)

Query: 149 SINEIAENFSKLACNAHTICNSELRPLGTGLYPVISIINHSCLPNAVLVF---------- 198
           S  E+ +   + + N+ T+ +S+L PLG  ++  ++++NHSC PNA +VF          
Sbjct: 463 SATELLDLVCQFSSNSFTLTDSDLNPLGVCMHASMAMLNHSCAPNAAVVFPFGGAAKSGQ 522

Query: 199 -------EGRLAVVRAVQHVPKGAEVLISYIETAGSTMTRQKALKEQYLFTCTCPRCIK 250
                  + ++  + A++ +  G E+LISY++ A +   R+ ALK++Y F C C  C K
Sbjct: 523 QNWSDGEDAKVMQLVALRAIEPGEELLISYVDVADAYEQRRSALKKRYCFECRCELCRK 581



 Score = 52.0 bits (123), Expect = 6e-04,   Method: Compositional matrix adjust.
 Identities = 24/76 (31%), Positives = 40/76 (52%), Gaps = 13/76 (17%)

Query: 8   GEVIISQEPYVCVPNNSSSISRCDGCFA-------------SSNLKKCSACQVVWYCGSN 54
           G+V++   P V V + +    RC  C++             ++ L++CS C+V+ YC + 
Sbjct: 196 GDVLLRLRPEVAVLSTALLEQRCSACYSPPSPADASNATAGANKLQRCSGCKVLRYCSAA 255

Query: 55  CQKLDWKLHRLECQVL 70
           CQ+ DW  HR EC+ L
Sbjct: 256 CQRRDWPSHRDECKAL 271


>gi|218184846|gb|EEC67273.1| hypothetical protein OsI_34249 [Oryza sativa Indica Group]
          Length = 816

 Score = 67.4 bits (163), Expect = 1e-08,   Method: Compositional matrix adjust.
 Identities = 82/359 (22%), Positives = 148/359 (41%), Gaps = 77/359 (21%)

Query: 137 NLVNLILQWPEISINEIA--------------ENFSKLACNAHTICNSELRPLGTGLYPV 182
           +L  L+L   +I +N IA              + FS  + +   +C+ E   +   +Y  
Sbjct: 445 SLSQLVLLICQIKVNSIAIVHMKSMDGGKVLTKGFSGFSGD--VMCSVEQVRVAQAIYMS 502

Query: 183 ISIINHSCLPNAVLVFEGRLAVVRAVQHVPKGAEVLISYIETAGST--MTRQKALKEQYL 240
            S  NHSC PN    F  R  ++R+ +++  G+ + +SY    G      RQK+L+E Y 
Sbjct: 503 GSFFNHSCQPNIHAYFHSRTLILRSTEYIKAGSPIELSYGPQVGEMDLPERQKSLRENYY 562

Query: 241 FTCT------------------CPRCIKLGQF---------DDIQESAILEGYRC----- 268
           F+C                   CP+   LG           ++    +I E + C     
Sbjct: 563 FSCGCSSCSVLSLSDLVMNSFCCPQSNCLGAVSELIHHRRKENFVHVSIGESHVCTLSLP 622

Query: 269 ------KDDGCSGFLLRDSD-----DKGF--TCQ-QCGLVRSKEEIKKIASEVNILSKKT 314
                 +D    G L  +SD     D GF  +C+ Q  L  +     +  S++N L++  
Sbjct: 623 DVSKFDEDIVKVGKLFFNSDTMLNIDPGFCMSCRSQLDLSSAVAMSDRATSKINRLNE-- 680

Query: 315 LALTSCGNHQEVVSTYKMIEKLQ-----KKLYHPFSVNLMQTREKLIKILMELEDWKEAL 369
             L S  N  EV+    + E LQ     +KL HP+S  L Q  + + +   ++ D ++A 
Sbjct: 681 --LPSPDNVPEVL----IAEALQSLEHIEKLRHPYSKALAQAHDTIAEAFAKVGDQEQAR 734

Query: 370 AYCQLTIPVYQRVYPQFHPLLGLQYYTCGKLEWFLGDTENAIKSMTEAVEILRITHGTN 428
            +C+ +  + +++Y   H ++  +      +E  +GD  +A  +   A  I  + +G +
Sbjct: 735 KHCEASFKILEKLYHPRHIIIAHELIKLVSIELSMGDGASAAAAFARADAIFSLYYGPD 793


>gi|328724231|ref|XP_003248075.1| PREDICTED: SET and MYND domain-containing protein 4-like
           [Acyrthosiphon pisum]
          Length = 624

 Score = 67.4 bits (163), Expect = 1e-08,   Method: Compositional matrix adjust.
 Identities = 76/294 (25%), Positives = 129/294 (43%), Gaps = 67/294 (22%)

Query: 8   GEVIISQEPYVCVP-NNSSSISRCDGCFA-SSNLKKCSACQVVWYCGSNCQKLDWK-LHR 64
           G+V+   EPY+C P +    +   +GC    S+L  C  C +V+YC  +C   D+K  H 
Sbjct: 255 GDVVAIDEPYICGPLSEFREVCHYNGCMKIHSSLIHCPKCLLVFYCNKDCMNKDYKDGHN 314

Query: 65  LECQVLSRLDKEKRKSVTP----SIRLMLKLYLRRKLQN-----DNV------------- 102
           L C ++  +  + R  +T     +I+  L+ Y +  L+      DN              
Sbjct: 315 LLCPIMYHI--KSRPGITKINELAIKWFLRAYSKMGLKKYCAIVDNFSESKIDPIKRGFD 372

Query: 103 -IPSTTTDNYSLVEALVAHMSDIDEKQLLLYAQIA-NLVNLIL----QWPEISINEIAE- 155
            I    +D++    +L   ++++    L  +  IA ++++ ++    + PE  I  I   
Sbjct: 373 EIGQYKSDDFLTAYSLDIIVNEMSIDVLFFFNCIAVDMLHYLMLSGFKIPECYIGIIGTS 432

Query: 156 ----------NFSKLACNAHTICNSELRPLGTGLYPVISIINHSCLPNAVLVFEG----R 201
                     N+SKL  +  +I           LYP IS+ NHSC  +A +   G    R
Sbjct: 433 LVRILTILDLNYSKLKLDDLSI--------SLTLYPTISLFNHSC--DANISPSGNILDR 482

Query: 202 LAVVRAVQHVPKGAEVLISYIETAGSTMTR-QKALKEQYL----FTCTCPRCIK 250
           + V++AVQ +PKG ++  +Y    G   T   K L++Q+     F C C  CIK
Sbjct: 483 IRVMKAVQPIPKGTQLCCNY----GIKFTAFNKELRQQFCNLLRFNCYCEPCIK 532


>gi|156550498|ref|XP_001601757.1| PREDICTED: protein msta, isoform A-like [Nasonia vitripennis]
          Length = 519

 Score = 67.4 bits (163), Expect = 1e-08,   Method: Compositional matrix adjust.
 Identities = 69/311 (22%), Positives = 121/311 (38%), Gaps = 32/311 (10%)

Query: 7   SGEVIISQEPYVCVPNNSSSISRCDGCFA---SSNLKKCSACQVVWYCGSNCQKLDWKLH 63
           +GEV+I ++  +       ++  C GC+    S   K C  C   W   ++C       H
Sbjct: 70  AGEVVIREDDALVQAPQQETVPICLGCYKPLESDTAKPCELCG--WPLCASCTS-----H 122

Query: 64  RLECQVLSRLDKEKRKSVTPSIRLMLKLYLRRKLQNDNVIPSTTTDNYSLVEALVAHMSD 123
             EC    R  ++ + ++T    +    Y    +     +  T  + Y     L +   +
Sbjct: 123 GAECDFTKR-HRDTKVTIT-GFGIAHPTYKCIGVVRALALRETNPEAYERFTKLAS--GE 178

Query: 124 IDEKQLLLYAQIANLVNLIL-QWPEISINEIAENFSKLACNAHTICNSELRPLGTGLYPV 182
           ++E +     +IA  +     +  + S +EIA     +  N H +  SE  P+   +Y  
Sbjct: 179 LEEPR-----EIARFIKRFFNKLHDFSEDEIARAAGIMQINGHEVPISE--PVLIAVYDE 231

Query: 183 ISIINHSCLPNAVLVFEGRLAVV-RAVQHVPKGAEVLISYIETAGSTMTRQKALKEQYLF 241
            S + HSC  N    F     +V RA   + KG  ++I Y +    T  R+  L     F
Sbjct: 232 SSYVEHSCRANCSKSFTSSGGIVIRAAMEIEKGEHIMICYTDPLWGTANRRHHLLRTKFF 291

Query: 242 TCTCPRCIKLGQFDDIQESAILEGYRCKDDGCSGFLLRDS----DDKGFTCQQCGLVRSK 297
            CTCPRC    +F      ++     C    CSG +L  S    +   + C +C   RS 
Sbjct: 292 ECTCPRCSDPAEF-----GSMFNAIECSKSDCSGCMLPKSFTVQEIPDYRCNKCDNSRSS 346

Query: 298 EEIKKIASEVN 308
             + ++  ++ 
Sbjct: 347 NSVDEMLEQIG 357


>gi|289739577|gb|ADD18536.1| putative histone tail methylase [Glossina morsitans morsitans]
          Length = 429

 Score = 67.4 bits (163), Expect = 1e-08,   Method: Compositional matrix adjust.
 Identities = 32/101 (31%), Positives = 52/101 (51%), Gaps = 7/101 (6%)

Query: 178 GLYPVISIINHSCLPNAVLVFEGRL--AVVRAVQHVPKGAEVLISYIETAGSTMTRQKAL 235
            L+P+ +++NH C+PN+   FE +    V+RA   +P+G E+  +Y +     + R   L
Sbjct: 208 ALFPLFALVNHDCVPNSYYTFEEKTNYMVLRASVDLPEGTEITTTYTKLFTGNIARHLFL 267

Query: 236 KEQYLFTCTCPRCIKLGQFDDIQESAILEGYRCKDDGCSGF 276
           K +  FTC C RC      D  ++ A +    C+D  CSG 
Sbjct: 268 KMKKNFTCKCSRC-----SDPTEKGAFISAVYCRDTNCSGL 303


>gi|302853821|ref|XP_002958423.1| hypothetical protein VOLCADRAFT_99704 [Volvox carteri f.
           nagariensis]
 gi|300256228|gb|EFJ40499.1| hypothetical protein VOLCADRAFT_99704 [Volvox carteri f.
           nagariensis]
          Length = 729

 Score = 67.4 bits (163), Expect = 1e-08,   Method: Compositional matrix adjust.
 Identities = 39/106 (36%), Positives = 60/106 (56%), Gaps = 10/106 (9%)

Query: 148 ISINEIAENFSKLACNAHTICNSELRPLGT--GLYPVISIINHSCLPNAVLVFEGRLAVV 205
           + +N +AE        A  + NSE  PL +  G++P +++INHSC PNAV V      VV
Sbjct: 589 LQLNLVAEETQD---GAAAVANSE--PLTSRMGVWPEVALINHSCGPNAVAVMLYDRLVV 643

Query: 206 RAVQHVPKGAEVLISYIETAGS---TMTRQKALKEQYLFTCTCPRC 248
           RA + +P+G EVL++++ + G+      R+  L + Y F C C RC
Sbjct: 644 RATRRIPRGREVLLNWLGSQGALAPARERRAQLSQMYDFACRCARC 689


>gi|189240267|ref|XP_971538.2| PREDICTED: similar to CG18136 CG18136-PA [Tribolium castaneum]
          Length = 468

 Score = 67.4 bits (163), Expect = 1e-08,   Method: Compositional matrix adjust.
 Identities = 83/331 (25%), Positives = 137/331 (41%), Gaps = 48/331 (14%)

Query: 7   SGEVIISQEPYVCVPNNSSSISRCDGCFASSNLKK----CSACQVVW-YCGSNCQKLDWK 61
           SGEVI    P +  P  +S    C GC     L +    CS C   W  CG  C+     
Sbjct: 31  SGEVIAESLPLIVGPKMASP-PLCLGCHKKLALAESRYDCSKC--FWPLCGQACESS--P 85

Query: 62  LHRLECQVLSRL-------DKEKRKSVTPSIRLMLKLYLRRKLQNDNVIPSTTTDNYSLV 114
           LH  EC++ ++        +   +++V   I  +  L L++           + + ++L+
Sbjct: 86  LHEAECEIFAKAGYKPTVKNDNSKQTVYCPIAPLRALLLKK----------NSPERFNLL 135

Query: 115 EALVAHMSDIDEKQLLLYAQIANLVNLILQWPEISINE--IAENFSKLACNAHTICNSEL 172
               +H+ D ++ Q+    +  +LV    Q  ++  NE  I    S    N   + +++ 
Sbjct: 136 LDFQSHLQDHEKTQVYQVLK-KSLVPFFTQLLKLDTNESEILTICSIFDTNCFEVRDTQR 194

Query: 173 RPLGTGLYPVISIINHSCLPNAVLVFEG---RLAVVRAVQHVPKGAEVLISYIETAGSTM 229
                GLYP IS ++HSC  N    F G   RL V+ A   + KG  V  +Y +T   T+
Sbjct: 195 LVNIRGLYPTISFLSHSCKHNTKHCFNGDNFRL-VLTATTPIKKGDLVTTTYTQTLWGTL 253

Query: 230 TRQKALKEQYLFTCTCPRCIKLGQFDDIQESAILEGYRCKDDGCSGFLLRDSD----DKG 285
           +R+  LK    F C C RC    +F        L    C   G  G  +  SD    +  
Sbjct: 254 SRRSHLKMAKHFDCLCERCTDPTEF-----GTYLSAVNCSVCG-DGSKVTSSDPLNCEAD 307

Query: 286 FTCQQCGLVRSKEEI----KKIASEVNILSK 312
           + C+ CG   +  ++    + +  E+N L+K
Sbjct: 308 WFCENCGFAITGSDMLWGNEVLQKEINHLNK 338


>gi|432896582|ref|XP_004076331.1| PREDICTED: SET and MYND domain-containing protein 4-like [Oryzias
           latipes]
          Length = 770

 Score = 67.4 bits (163), Expect = 2e-08,   Method: Compositional matrix adjust.
 Identities = 117/564 (20%), Positives = 196/564 (34%), Gaps = 145/564 (25%)

Query: 8   GEVIISQEPY--VCVPNN-------------SSSISRCDGCFASSNLK-KCSACQVVWYC 51
           G+VI++  PY  V +P                +   RC  C A +     C  C    YC
Sbjct: 207 GDVILTDRPYSFVLIPEMEEMREKDARADMFGTQYLRCHQCLAKTLCSVPCEGCSYSRYC 266

Query: 52  GSNCQKLDWKLHR-LECQ---------VLSRLD-----KEKRKSVTPSIRLMLKLYLRRK 96
            + CQ+  W+ H   EC          V++ L      K   K++  + +L+ + +   K
Sbjct: 267 SARCQREAWEEHHSWECSLGADLMAMGVMAHLALRITLKAGLKTIQEARQLVTETHEEAK 326

Query: 97  L--------------QNDNVIPSTTTDNYSLVEALVAHMSDIDEKQLLLYAQIANLVNLI 142
                          + D+   S   D+Y+ V  L+ H S        LYA  A  +++ 
Sbjct: 327 SNGLISSPCNPNTCSKGDDPFASHYGDSYTSVFHLLHHTSHQSTAVRFLYAVTAATLSMK 386

Query: 143 L--------------------QWP----------EISINE------IAENFSKLACNAHT 166
           L                    +WP          E S  +      I  +  +L CNA  
Sbjct: 387 LSQAGFFLASRNINKHSSANSRWPCGPKKEEGDAERSSEQWLLGSAILRHLLQLRCNAQA 446

Query: 167 IC--------NSELRP-----LGTGLYPVISIINHSCLPNAVLVF--------------- 198
           I         NS ++      + T ++P +S++NHSC PN  LVF               
Sbjct: 447 ITVLQDSEMENSRVQSRQEIRIATAMFPSLSLLNHSCSPNTTLVFSTVTADAGCADGPAD 506

Query: 199 ------------EGRLAVVRAVQHVPKGAEVLISYIETAG--STMTRQKALKEQYLFTCT 244
                        G    VRA + +  G EVL  Y   +   +   RQ+ L+EQY F C 
Sbjct: 507 MRQHLPEERQETRGVSVTVRAAKVISLGQEVLHCYGPHSSRMAAEERQRLLQEQYYFLCQ 566

Query: 245 CPRCIKLGQFDDIQESAI--------LEGYRCKDDGCSGFLLRDSDDKGFTC--QQCGLV 294
           C  CI   +  +     +        L  ++C        +L+ + +  F C    CG  
Sbjct: 567 CEACINPEKDGEATHPGVKGSPSEAELLCFKCNT------VLKKNHEGRFCCSLSTCGHQ 620

Query: 295 RSKEEIKKIASEVNILSKKTLALTSCGNHQEVVSTYKMIE---KLQKKLYHPFSVNLMQT 351
            +  ++     E+ I+ +K + L       E +   K IE   +L     HP    L   
Sbjct: 621 TTSAKVSSKLQEIRIVLEKAVDLMEKDRPAEALQLLKKIESHSELMLAQTHPLQGELADA 680

Query: 352 REKLIKILMELEDWKEALAYCQLTIPVYQRVYPQFHPLLGLQYYTCGKLEWFLGDTENAI 411
             +    + E   W  A ++ + +       Y +    LG Q +   +L +  G    A+
Sbjct: 681 TARAYATMGE---WNNAASHLERSAVATCSQYGEDSIELGQQLFKLVQLHFNGGARSQAL 737

Query: 412 KSMTEAVEILRITHGTNSPFMKEL 435
             + +   +  +  G + P + EL
Sbjct: 738 AVIPKVRRLFSLHCGPHCPELLEL 761


>gi|345480361|ref|XP_003424136.1| PREDICTED: protein msta, isoform A-like [Nasonia vitripennis]
          Length = 501

 Score = 67.4 bits (163), Expect = 2e-08,   Method: Compositional matrix adjust.
 Identities = 43/136 (31%), Positives = 67/136 (49%), Gaps = 13/136 (9%)

Query: 178 GLYPVISIINHSCLPNA--VLVFEGRLAVVRAVQHVPKGAEVLISYIETAGSTMTRQKAL 235
            L+PV ++ NH C+PN   ++  EG L V  AV  + +G E+ +SY +    T  R+  L
Sbjct: 208 ALFPVGALTNHQCVPNTRHIVNAEGELLVYAAVP-IAEGQEITMSYADVLWDTQMRRHFL 266

Query: 236 KEQYLFTCTCPRCIKLGQFDDIQE-SAILEGYRCKDDGCSGFLLRDSD---DKGFTCQQC 291
                F C CPRC       D+ E  ++L    C  D C G LL ++    +  +TC +C
Sbjct: 267 LATKHFACQCPRCT------DVTECGSLLGALSCAIDNCQGNLLPNTPLSLNCPWTCNEC 320

Query: 292 GLVRSKEEIKKIASEV 307
           GL+    +I  I + +
Sbjct: 321 GLIIKGRQISSIRAGI 336


>gi|332027349|gb|EGI67433.1| SET and MYND domain-containing protein 4 [Acromyrmex echinatior]
          Length = 619

 Score = 67.0 bits (162), Expect = 2e-08,   Method: Compositional matrix adjust.
 Identities = 40/123 (32%), Positives = 61/123 (49%), Gaps = 6/123 (4%)

Query: 175 LGTGLYPVISIINHSCLPNAVLVFEGRLAVVRAVQHVPKGAEVLISY--IETAGSTMTRQ 232
           +G G+YP ++++NHSC P  +  F G   +VRAV+ +  G E+  +Y  I T      R+
Sbjct: 422 IGGGVYPTVAMLNHSCNPGVIRYFIGTTMIVRAVRTINAGEEISENYGPIFTTMPESERK 481

Query: 233 KALKEQYLFTCTCPRCIKLGQFDDIQESAILEGYRCKDD-GCSGFLLRDSDDKGFT--CQ 289
           + L+ QY F C C  C       D  +  IL  ++C+    C   L+  SD   F   C 
Sbjct: 482 RKLRVQYWFDCNCEACSGHWPLLDELDPTILR-FKCETGPSCGNVLMVKSDTNEFMIGCA 540

Query: 290 QCG 292
           +CG
Sbjct: 541 KCG 543



 Score = 40.0 bits (92), Expect = 2.2,   Method: Compositional matrix adjust.
 Identities = 19/71 (26%), Positives = 35/71 (49%), Gaps = 3/71 (4%)

Query: 6   VSGEVIISQEPYVCVPNNSSSISRCDGCFASSNLKKCSACQV---VWYCGSNCQKLDWKL 62
           + GE++I + P+       + ++ C  CF    +   +AC+    V YC   C+  D ++
Sbjct: 203 IPGEIVIVERPHCAFLLAETRLTHCHLCFVKIFVPTPAACRTYSCVAYCSRRCRDADAQV 262

Query: 63  HRLECQVLSRL 73
           H  EC++L  L
Sbjct: 263 HSQECKLLPAL 273


>gi|449274577|gb|EMC83667.1| SET and MYND domain-containing protein 5, partial [Columba livia]
          Length = 383

 Score = 67.0 bits (162), Expect = 2e-08,   Method: Compositional matrix adjust.
 Identities = 37/86 (43%), Positives = 51/86 (59%), Gaps = 7/86 (8%)

Query: 176 GTGLYPVISIINHSCLPNAVLVF--EGRLAVVRAVQHVPKGAEVLISYIE---TAGSTMT 230
           G+GLY + S  NHSC+PNA   F     L  + A++ +  G E+ ISY++      S  +
Sbjct: 273 GSGLYVLQSCCNHSCIPNAETSFPENNFLLHLTALEDIEAGEEICISYLDCCQRERSRHS 332

Query: 231 RQKALKEQYLFTCTCPRCIKLGQFDD 256
           R K L+E YLFTC+CP+C  L Q DD
Sbjct: 333 RNKILRENYLFTCSCPKC--LAQADD 356


>gi|383849380|ref|XP_003700323.1| PREDICTED: SET and MYND domain-containing protein 4-like [Megachile
           rotundata]
          Length = 691

 Score = 67.0 bits (162), Expect = 2e-08,   Method: Compositional matrix adjust.
 Identities = 76/296 (25%), Positives = 120/296 (40%), Gaps = 67/296 (22%)

Query: 4   FHVSGEVIISQEPY--VCVPNNSSSISRCDGC--FASSNLKKCSACQVVWYCGSNCQKLD 59
           F   GE++  ++P   V +PN S S  RC  C  F       C  C +  YCG  C    
Sbjct: 202 FINKGEILFVEKPISIVLIPNESIS-DRCHNCNCFIGDIPIPCKTC-LYTYCGEKCLNEA 259

Query: 60  WKL-HRLECQVLSRLDKEKRKSVTPSIRLMLKLYLRRKLQNDNVIPSTTTDNYSLVEALV 118
           W L H  EC   S++    +  +T    L  K+       +D +        ++ ++ LV
Sbjct: 260 WSLYHCWECPG-SQMRLWGQVGIT---HLASKVLFNCSTMSDKI-------RFNQLQNLV 308

Query: 119 AHMSDIDEKQLLL-----------------YAQIANLVNLILQWPEISINEIAENFS--- 158
           +H   I +  L +                 + +  NL N ++   + S N    NF+   
Sbjct: 309 SHFDKIPDADLRVNGIAAMMLTIYLSEYTNFFETTNLENCLIS--KFSDNAFNSNFNIST 366

Query: 159 -----------------KLACNAHTICNS-------ELRPLGTGLYPVISIINHSCLPNA 194
                            +L  NA  I +S       ++  + TG+YP  S++NHSC PN 
Sbjct: 367 KIGKHLYVSSLLLRYIHQLTVNAAGIIHSNVIEDDVQINIVATGIYPSASMMNHSCNPNI 426

Query: 195 VLVFEGRLAVVRAVQHVPKGAEVLISYIET--AGSTMTRQKALKEQYLFTCTCPRC 248
           + ++  +  +VRAV+ +    E+  SY+ T     T  RQK L E Y F+C C  C
Sbjct: 427 IKIYMDQYLIVRAVEDIFPTEEIFNSYVATYRYKKTKARQKLL-ELYYFSCKCEAC 481


>gi|325187786|emb|CCA22331.1| SET and MYND domain containing 2 putative [Albugo laibachii Nc14]
          Length = 507

 Score = 67.0 bits (162), Expect = 2e-08,   Method: Compositional matrix adjust.
 Identities = 57/222 (25%), Positives = 105/222 (47%), Gaps = 15/222 (6%)

Query: 41  KCSACQVVWYCGSNCQKLDWK-LHRLECQVLSRLDKEKRKSVTPSIRLMLKLYLRRKLQN 99
           +C  C+   YC   C   D    H L+C+VL RL K +R   T  +RL++ +     +Q+
Sbjct: 169 QCDQCRFSVYCSMECLTTDATTFHELQCEVLIRL-KAERDCDTELLRLVIAVL---SMQH 224

Query: 100 DNVI-PSTTTDNYSLVEALVAHMSDIDEKQLLLYAQIANLVNLILQWPEISINEIAENFS 158
              + P        LV  +V +     ++QLL   + + L + +      S  +  +   
Sbjct: 225 CLALKPGNNPLQDLLVPPVVENTGQKYKEQLLKLLRGSKLSHFV------SPTQAHDVLL 278

Query: 159 KLACNAHTICNSELRPLGTGLYPVISII-NHSCLPNAVLVFEGRLAVVRA--VQHVPKGA 215
           K+  NAH +  +     G GL+P  +++ NHSC PN +L F+    ++RA  ++ +    
Sbjct: 279 KVRSNAHPLVLNGSVTCGLGLFPEAAMVFNHSCSPNIILAFQPGTRMLRAHSIRPIQPRQ 338

Query: 216 EVLISYIETAGSTMTRQKALKEQYLFTCTCPRCIKLGQFDDI 257
            +  +YI+   S   R++ L + + F C+C RC + G+ + +
Sbjct: 339 ALEYAYIDLLPSKSRRRQLLNDAFAFDCSCLRCYEEGKLERM 380


>gi|449501577|ref|XP_002187845.2| PREDICTED: SET and MYND domain-containing protein 5-like
           [Taeniopygia guttata]
          Length = 404

 Score = 67.0 bits (162), Expect = 2e-08,   Method: Compositional matrix adjust.
 Identities = 37/86 (43%), Positives = 51/86 (59%), Gaps = 7/86 (8%)

Query: 176 GTGLYPVISIINHSCLPNAVLVF--EGRLAVVRAVQHVPKGAEVLISYIE---TAGSTMT 230
           G+GLY + S  NHSC+PNA   F     L  + A++ +  G E+ ISY++      S  +
Sbjct: 294 GSGLYVLQSCCNHSCIPNAETSFPENNFLLHLTALEDIEAGEEICISYLDCCQRERSRHS 353

Query: 231 RQKALKEQYLFTCTCPRCIKLGQFDD 256
           R K L+E YLFTC+CP+C  L Q DD
Sbjct: 354 RNKILRENYLFTCSCPKC--LAQADD 377


>gi|380018360|ref|XP_003693098.1| PREDICTED: protein msta, isoform A-like [Apis florea]
          Length = 467

 Score = 67.0 bits (162), Expect = 2e-08,   Method: Compositional matrix adjust.
 Identities = 76/318 (23%), Positives = 131/318 (41%), Gaps = 43/318 (13%)

Query: 8   GEVIISQEPYVCVPNNSSSISRCDGCFASSNLKKCSACQVV-WYCGSNCQKLDWKLHRLE 66
           GE+I+  +  +      +S+  C  C+   N +    C+   W    NC +     H LE
Sbjct: 66  GEIILRDDQPLITGLMYNSVPVCLQCYTMLNQEIAIPCEKCGWPLCQNCNE-----HGLE 120

Query: 67  CQV-LSRLDKEKRKSVTPSIRLMLKLYLRRKLQNDNVIPS-----TTTDNYSLVEALVAH 120
           C+   SR D +        I +    Y     Q  N+I +     T  ++Y  + +L +H
Sbjct: 121 CKFSCSRRDSK--------ISITEFGYPHPSYQCINIIRALSLKDTNPESYKKLISLESH 172

Query: 121 MSDIDEKQLLLYAQIANLVNLILQWPEISINEIAENFSKLACNAHTICNSELRPLG---- 176
            ++I+  +  L   IA+ +    +  +IS  EI      L  N H +      PL     
Sbjct: 173 CNEINNSKEPL--NIAHFIKRFFKADDISEEEIVTIIGILQVNGHEV------PLTDSPY 224

Query: 177 TGLYPVISIINHSCLPNAVLVFE--GRLAVVRAVQHVPKGAEVLISYIETAGSTMTRQKA 234
             +Y + S+I H+C  N    F   G L ++RA   + KG  + I Y +    T  R+  
Sbjct: 225 VAVYEMASLIEHNCRANCSKSFTDMGGL-IIRAALPITKGDHISICYTDPLWGTANRRHH 283

Query: 235 LKEQYLFTCTCPRCIKLGQFDDIQESAILEGYRCKDDGCSGFLLRDS---DDKGFTCQQC 291
           L +   F C C RC    +F       +    +C    CSG++L  +    ++ + C+ C
Sbjct: 284 LLKTKFFECICNRCQDPTEF-----QTMFNALKCNKINCSGYVLPKTFLEQEQDYVCKTC 338

Query: 292 GLVRSKEEIKKIASEVNI 309
             + S  EI+K+  ++ I
Sbjct: 339 ESIVSCTEIEKMLEDIGI 356


>gi|158287868|ref|XP_309762.4| AGAP010931-PA [Anopheles gambiae str. PEST]
 gi|157019395|gb|EAA05512.4| AGAP010931-PA [Anopheles gambiae str. PEST]
          Length = 383

 Score = 67.0 bits (162), Expect = 2e-08,   Method: Compositional matrix adjust.
 Identities = 71/274 (25%), Positives = 107/274 (39%), Gaps = 76/274 (27%)

Query: 8   GEVIISQEPYVCVPNNSSSISRCDGCFASS--NLKKCSACQVVWYCGSNC-QKLDWKLHR 64
           G+V++ ++P+V V   +    RCD C A     L  C  C V  YC   C  K   K HR
Sbjct: 125 GDVVMIEKPFVTVAKETFQYIRCDFCQAKRLFTLIPCEGCTVAMYCSEECISKAYGKYHR 184

Query: 65  LECQVLSRLDKEKRKSVTPSIRLMLKLYLRRKLQNDNVIPSTTTDNYSLVEALVAHMSDI 124
            EC VL  L      S   ++R++              I  TT DN   +E L  H+  +
Sbjct: 185 YECGVLRDLWTVLGISGVIALRMI-------------AIAITTFDND--LEKLKDHLDAL 229

Query: 125 DEKQLLLYA----------------------------QIANLV-------NLILQWPEIS 149
           DE ++  +                             ++A L        N +L+W E+ 
Sbjct: 230 DESKVDGFTMDWKKATLQDVFNTVHVLCTNQERRNIKELAGLTFFTVVMHNHLLEWTELG 289

Query: 150 -------------------INEIAE-NFSKLACNAHTICNSELRPLGTGLYPVISIINHS 189
                                +I E N+  L C   T  N E     T  YP+IS++NHS
Sbjct: 290 PACEANPTASKLLLDLILRYLQITECNYKLLTCIKITNRNPEDETFTTSCYPLISMLNHS 349

Query: 190 CLPNA--VLVFEGRLAVVRAVQHVPKGAEVLISY 221
           C PN   +++ +GR AV+  +  V KG ++  +Y
Sbjct: 350 CAPNVRRLILPDGRCAVI-VIHTVAKGGQLFDNY 382


>gi|147817677|emb|CAN75589.1| hypothetical protein VITISV_042880 [Vitis vinifera]
          Length = 816

 Score = 67.0 bits (162), Expect = 2e-08,   Method: Compositional matrix adjust.
 Identities = 46/125 (36%), Positives = 60/125 (48%), Gaps = 9/125 (7%)

Query: 156 NFSKLACNAHTICNSELRPLGTGLYPVISIINHSCLPNAVLVFEGRLAVVRAVQHVPKGA 215
           NFS  A  A T  N E   +G  +Y V S+ NHSC PN    F  R   +RA +HV  G 
Sbjct: 62  NFSP-AGGAFT-SNMEQVRVGQAIYSVASLFNHSCQPNIHAYFLSRTLFLRATEHVAVGC 119

Query: 216 EVLISYIETAGST--MTRQKALKEQYLFTCTCPRCIKLGQFDDIQESAILEGYRCKDDGC 273
            + +SY    G      RQK LK++Y F C C  C +L   D      +L  +RC +  C
Sbjct: 120 PLELSYGPQVGQWDCKDRQKFLKDEYSFRCECSGCSELNVSD-----LVLNAFRCVNPDC 174

Query: 274 SGFLL 278
            G +L
Sbjct: 175 FGTVL 179


>gi|270012346|gb|EFA08794.1| hypothetical protein TcasGA2_TC006485 [Tribolium castaneum]
          Length = 498

 Score = 67.0 bits (162), Expect = 2e-08,   Method: Compositional matrix adjust.
 Identities = 83/331 (25%), Positives = 137/331 (41%), Gaps = 48/331 (14%)

Query: 7   SGEVIISQEPYVCVPNNSSSISRCDGCFASSNLKK----CSACQVVW-YCGSNCQKLDWK 61
           SGEVI    P +  P  +S    C GC     L +    CS C   W  CG  C+     
Sbjct: 31  SGEVIAESLPLIVGPKMASP-PLCLGCHKKLALAESRYDCSKC--FWPLCGQACESS--P 85

Query: 62  LHRLECQVLSRL-------DKEKRKSVTPSIRLMLKLYLRRKLQNDNVIPSTTTDNYSLV 114
           LH  EC++ ++        +   +++V   I  +  L L++           + + ++L+
Sbjct: 86  LHEAECEIFAKAGYKPTVKNDNSKQTVYCPIAPLRALLLKK----------NSPERFNLL 135

Query: 115 EALVAHMSDIDEKQLLLYAQIANLVNLILQWPEISINE--IAENFSKLACNAHTICNSEL 172
               +H+ D ++ Q+    +  +LV    Q  ++  NE  I    S    N   + +++ 
Sbjct: 136 LDFQSHLQDHEKTQVYQVLK-KSLVPFFTQLLKLDTNESEILTICSIFDTNCFEVRDTQR 194

Query: 173 RPLGTGLYPVISIINHSCLPNAVLVFEG---RLAVVRAVQHVPKGAEVLISYIETAGSTM 229
                GLYP IS ++HSC  N    F G   RL V+ A   + KG  V  +Y +T   T+
Sbjct: 195 LVNIRGLYPTISFLSHSCKHNTKHCFNGDNFRL-VLTATTPIKKGDLVTTTYTQTLWGTL 253

Query: 230 TRQKALKEQYLFTCTCPRCIKLGQFDDIQESAILEGYRCKDDGCSGFLLRDSD----DKG 285
           +R+  LK    F C C RC    +F        L    C   G  G  +  SD    +  
Sbjct: 254 SRRSHLKMAKHFDCLCERCTDPTEF-----GTYLSAVNCSVCG-DGSKVTSSDPLNCEAD 307

Query: 286 FTCQQCGLVRSKEEI----KKIASEVNILSK 312
           + C+ CG   +  ++    + +  E+N L+K
Sbjct: 308 WFCENCGFAITGSDMLWGNEVLQKEINHLNK 338


>gi|340513811|gb|EGR44092.1| predicted protein [Trichoderma reesei QM6a]
          Length = 549

 Score = 66.6 bits (161), Expect = 2e-08,   Method: Compositional matrix adjust.
 Identities = 64/266 (24%), Positives = 102/266 (38%), Gaps = 39/266 (14%)

Query: 8   GEVIISQEPYVCVPNNSSSISRCDGCFASSN-------------------LKKCSACQVV 48
           GE+I +  P V        +  CD CF                       L  C+ C + 
Sbjct: 75  GELIFTSVPLVLCAEVGDGMEACDFCFQQRRRVFHPSESRFLGPGEMMPPLHVCNGCHMY 134

Query: 49  WYCGSNCQKLDWKL-HRLECQVLS--RLDKEKRKSVTPSIRLMLKLYLRRKLQNDNVIPS 105
            YC  +C +  W   H  EC +LS    D E R      I +  K+ L  ++Q       
Sbjct: 135 QYCSKSCWQRAWDTGHLYECGLLSGATTDLETRTLYRLLILMRKKVLLPEQVQG------ 188

Query: 106 TTTDNYSLVEALVAHMSDIDEKQLLLYAQIANLVNLILQW--PEISINEIAENFSKLACN 163
                   +  L + M++ + +    + ++  L     +    E+ I EI   +  + CN
Sbjct: 189 --------LSRLESEMANFERRAKKTWPKVLELALEAKERTKSELGIYEILILYGIIRCN 240

Query: 164 AHTICNSELR-PLGTGLYPVISIINHSCLPNAVLVFEGRLAVVRAVQHVPKGAEVLISYI 222
           A  +  +    PLG  L    +++NH C PN  +VF      VRA++ +  G E+L  Y 
Sbjct: 241 ALPVDQTYRNAPLGIALDLGGALLNHDCDPNVAIVFNNTQVQVRALRKLKAGEELLHCYR 300

Query: 223 ETAGSTMTRQKALKEQYLFTCTCPRC 248
           + A     R   +  +Y F C C RC
Sbjct: 301 DIAYDFTFRNPRITSRYQFRCHCERC 326


>gi|327288923|ref|XP_003229174.1| PREDICTED: SET and MYND domain-containing protein 5-like, partial
           [Anolis carolinensis]
          Length = 163

 Score = 66.6 bits (161), Expect = 2e-08,   Method: Compositional matrix adjust.
 Identities = 36/86 (41%), Positives = 50/86 (58%), Gaps = 7/86 (8%)

Query: 176 GTGLYPVISIINHSCLPNAVLVFEGR--LAVVRAVQHVPKGAEVLISYIETAG---STMT 230
           G+GLY + S  NHSC+PNA   F     L  + A++ +  G E+ ISY++      S  +
Sbjct: 45  GSGLYVLQSCCNHSCIPNAETSFPDNNFLLHLTALEDIRPGEEICISYLDCCQRERSRHS 104

Query: 231 RQKALKEQYLFTCTCPRCIKLGQFDD 256
           R K L+E YLF C+CP+C  L Q DD
Sbjct: 105 RHKVLRENYLFVCSCPKC--LAQADD 128


>gi|62473432|ref|NP_001014718.1| msta, isoform B [Drosophila melanogaster]
 gi|76363525|sp|P83501.2|MSTAB_DROME RecName: Full=Protein msta, isoform B
 gi|61677866|gb|AAF45751.3| msta, isoform B [Drosophila melanogaster]
          Length = 448

 Score = 66.6 bits (161), Expect = 2e-08,   Method: Compositional matrix adjust.
 Identities = 35/102 (34%), Positives = 51/102 (50%), Gaps = 6/102 (5%)

Query: 178 GLYPVISIINHSCLPNAVLVFE-GRLAVVRAVQHVPKGAEVLISYIETAGSTMTRQKALK 236
            L+P+  ++NH C PNA   FE G   VV A + +P GAE+ +SY +   ST+ R+  L 
Sbjct: 228 ALFPLAGLLNHQCTPNAAHHFENGETIVVCATERIPAGAEITMSYAKLLWSTLARKIFLG 287

Query: 237 EQYLFTCTCPRCIKLGQFDDIQESAILEGYRCKDDGCSGFLL 278
               F C C RC      D  +    L    C++ GC G ++
Sbjct: 288 MTKHFICKCVRCQ-----DPTENGTYLSALFCREQGCRGLVI 324


>gi|281347298|gb|EFB22882.1| hypothetical protein PANDA_013199 [Ailuropoda melanoleuca]
          Length = 1019

 Score = 66.6 bits (161), Expect = 2e-08,   Method: Compositional matrix adjust.
 Identities = 29/59 (49%), Positives = 43/59 (72%)

Query: 158 SKLACNAHTICNSELRPLGTGLYPVISIINHSCLPNAVLVFEGRLAVVRAVQHVPKGAE 216
           S++ CNA TICN+E++ +G GLYP +S++NHSC PN  +VF G   ++RAV+ +  G E
Sbjct: 130 SQVICNAFTICNAEMQEVGVGLYPSMSLLNHSCDPNCSIVFNGPHLLLRAVRDIEAGEE 188


>gi|268637641|ref|XP_002649106.1| SET domain-containing protein [Dictyostelium discoideum AX4]
 gi|256012848|gb|EEU04054.1| SET domain-containing protein [Dictyostelium discoideum AX4]
          Length = 449

 Score = 66.6 bits (161), Expect = 2e-08,   Method: Compositional matrix adjust.
 Identities = 54/211 (25%), Positives = 95/211 (45%), Gaps = 23/211 (10%)

Query: 80  SVTPSIRLMLKLYLRRKLQNDNVIPSTTTDNYSLVEALVAHMSDIDEKQLLLYAQ----I 135
           S T  +RL+L++     + N      T  D+Y     L + +  +D++ +  + +    I
Sbjct: 104 SDTRDMRLLLRIIAN--IANGKQGMQTPIDDYQDFMGLTSTLDKVDKEHMTKFKRGVTSI 161

Query: 136 ANLVNLILQWP----EISINEIAENFSKLACNAHTICNSELRPLGTGLYPVISIINHSCL 191
           ++L++ +         I+I+EI E FS +  NAH    +  + +G G+ P     NHSC+
Sbjct: 162 SSLISSVRGVGYLKNTITIHEILECFSSVLTNAHQFSYATSKEIGRGVCPT-GYFNHSCM 220

Query: 192 PNAVLVFEGR-LAVVRAVQHVPKGAEVLISYIETAGSTMTRQKALKEQYLFTCTCPRCIK 250
           PN     + + + +     +V KG E+ + Y+        R++ L + Y F C CP C  
Sbjct: 221 PNTTWSLDDQGMLLFSTSSNVKKGDELSLGYLANEYPLKNRRRELLDGYYFFCQCPLC-- 278

Query: 251 LGQFDDIQESAILEGYRCKDDGCSGFLLRDS 281
                  +  + L GY C  + C   LL DS
Sbjct: 279 -------EFQSNLSGYLC--EKCKEPLLNDS 300


>gi|195485910|ref|XP_002091283.1| GE13567 [Drosophila yakuba]
 gi|194177384|gb|EDW90995.1| GE13567 [Drosophila yakuba]
          Length = 514

 Score = 66.6 bits (161), Expect = 2e-08,   Method: Compositional matrix adjust.
 Identities = 73/302 (24%), Positives = 124/302 (41%), Gaps = 19/302 (6%)

Query: 9   EVIISQEPYVCVPNNSSSISRCDGCFASSNLKKCSAC-QVVW-YCGSNCQKLDWKLHRLE 66
           E+++ + P V  P   S+   C GC      +    C Q  W  CG  C+ LD   H+ E
Sbjct: 65  EIVLKEAPLVRGPAQISA-PVCLGCLNGIEAEDHIECEQCGWPLCGPECKSLD--EHKAE 121

Query: 67  CQVL-SRLDKEKRKSVTPSIRLMLKLYLRRKLQNDNVIPSTTTDNYSLVEALVAHMSDID 125
           C +   R  K   +       L   L   R L      P   +  +  +E+L +     +
Sbjct: 122 CSLTKDRGQKVNVQEFGGPHPLYTCLSTVRCLLIGETSPEKAS-KFQELESLESTRRGSN 180

Query: 126 EKQLLLYAQIANLVNLILQWPEISINEIAENFSKLACNAHTICNSELRPLGTGLYPVISI 185
           + +  L + I   +    +  + S  EI +    L  N H +  ++  P    ++   S 
Sbjct: 181 QWKADLVS-IGQFIPKFFKTQKFSEEEIMKAVGALQINGHEVPTTD--PSHVAVFYTASF 237

Query: 186 INHSCLPNAVLVF-EGRLAVVRAVQHVPKGAEVLISYIETAGSTMTRQKALKEQYLFTCT 244
             +SCLPN    F +    ++ A + + K A + I Y +    T  RQ+ L +  LF C 
Sbjct: 238 TENSCLPNLAKSFNKNGHCILWAPREIKKNAHLSICYSDAVWGTADRQRHLMQTKLFRCA 297

Query: 245 CPRCIKLGQFDDIQESAILEGYRCKDDGCSGFLL---RDSDDKGFTCQQCGLVRSKEEIK 301
           C RC+ + + D    SAI    +C+D  C G +L    D  +  + C++C     K  ++
Sbjct: 298 CERCVDVTELDTFY-SAI----KCEDRQCGGLMLPTKTDDWNGNWRCRECHKQVQKHYVE 352

Query: 302 KI 303
           +I
Sbjct: 353 RI 354


>gi|125560527|gb|EAZ05975.1| hypothetical protein OsI_28216 [Oryza sativa Indica Group]
          Length = 392

 Score = 66.6 bits (161), Expect = 2e-08,   Method: Compositional matrix adjust.
 Identities = 73/269 (27%), Positives = 103/269 (38%), Gaps = 45/269 (16%)

Query: 8   GEVIISQEPYVCVPNNSSSI-SRCDGCFAS--SNLKKCSACQVVWYCGSNCQKLDWKLHR 64
           GEVI++++P +  P + SS+ S C  CF S  +    C +C+   +C  +C        R
Sbjct: 27  GEVILTEQPLLLYPASLSSLPSFCSACFRSLSAAASPCPSCRAAGFCSPSCAAASHP--R 84

Query: 65  LECQVLSRLDKEKRKSVTPSIRLMLKLYLRRKLQNDNVIPSTTTDNYSLVEAL------- 117
           L C  LS        +          L+L                 YSL E         
Sbjct: 85  LLCTALSGGGGNGNLASAAEPHQEPLLFL--------------LSAYSLPEPSLRVLLSL 130

Query: 118 --VAHMSDIDEKQLLLYAQIANLVNLILQWPEISINEIAENFSKLACNAHTICNSELRP- 174
              A     D+    L+A +  LV   +  P  S +  A   SK   N+ +I     RP 
Sbjct: 131 SSAATPPPRDQDPGSLHAMVTALVPPQMLPPGFSPDLTAALLSKDRTNSFSIMEP-YRPE 189

Query: 175 --------LGTGLYPVISIINHSCLPNAV-LVFEGRLA------VVRAVQHVPKGAEVLI 219
                       +YP  S++NH CLPNA    +  R        VVRA+  + +G EV +
Sbjct: 190 VPQPLRKARAYAVYPRASLLNHDCLPNACHFDYADRPGPGNTDIVVRALHDITEGREVCL 249

Query: 220 SYIETAGSTMTRQKALKEQYLFTCTCPRC 248
           SY         RQ+ L E Y F C C RC
Sbjct: 250 SYFAANWQYKDRQQRLLEDYGFRCECERC 278


>gi|157136250|ref|XP_001656795.1| hypothetical protein AaeL_AAEL003447 [Aedes aegypti]
 gi|108881063|gb|EAT45288.1| AAEL003447-PA [Aedes aegypti]
          Length = 537

 Score = 66.6 bits (161), Expect = 2e-08,   Method: Compositional matrix adjust.
 Identities = 75/297 (25%), Positives = 121/297 (40%), Gaps = 41/297 (13%)

Query: 8   GEVIISQEPYVCVPNNSSSISRCDGCFASSNLK--KCSACQVVWYCGSNCQKLDWK-LHR 64
           GEV + + P + V    +  SRC+ C + + L    C  C  V YC   C+   +   H+
Sbjct: 195 GEVAVVERPLLVVVEPEAVQSRCNYCGSKNELDLIPCRKCVSVMYCSEKCRDEAYSCYHK 254

Query: 65  LECQVLSRLDKEKR-KSVTPSIRLMLKLYLR--RKLQND--NVIPSTTTDNYSLVEALVA 119
            EC V+  L    R    T   +L LKL+      L ND    +        S  + L  
Sbjct: 255 FECAVIKDLKNLFRGPKPTRMFQLTLKLFWMALNDLINDQEGFLKKYQEGLKSFRDPLEM 314

Query: 120 HMS-----------DIDEKQLLLYAQIANLVNLILQWPEISINE-------------IAE 155
            +S           + D    L    +   + +++    +  NE             + E
Sbjct: 315 DLSKQLHMHVLANDEPDTSSDLTGKGVTQFLTVLIYKIAVEENESVPTQVLKDNNSLLLE 374

Query: 156 NFSKLACNAHTICNSELRPLGTGLYPVISIINHSCLPNAVLVFEGRL-AVVRAVQHVPKG 214
              KL   A  IC+  +  L T  YP++ +INHSC PNA  +  G L +++   + +  G
Sbjct: 375 IMQKLVLMAKQICDQSVDDL-TCFYPLLQMINHSCAPNAERIVSGDLRSIILTKRPINAG 433

Query: 215 AEVLISYIETAGST-----MTRQKALKEQYLFTCTCPRC-IKLGQFDDIQESAILEG 265
            ++LI Y    GST       R++ L++++ F C C  C +       I+E+A L G
Sbjct: 434 EQILICYFPN-GSTDYKDKTKRKEMLQKEFQFECQCLGCSLDYPLLSTIEENAELRG 489


>gi|384249285|gb|EIE22767.1| hypothetical protein COCSUDRAFT_42390 [Coccomyxa subellipsoidea
           C-169]
          Length = 329

 Score = 66.6 bits (161), Expect = 3e-08,   Method: Compositional matrix adjust.
 Identities = 46/149 (30%), Positives = 61/149 (40%), Gaps = 18/149 (12%)

Query: 118 VAHMSDIDEKQLLLYAQIANLVNLILQWP---EISINEIAENFSKLACNAHTIC-----N 169
            AH    D   L      A L   + Q      +S  E+A+  +K   N + I       
Sbjct: 77  AAHSRSKDSSDLFRRLSTATLSAQLPQLAGSRPLSPEEVADALAKEELNGYGIMAPSGPE 136

Query: 170 SELRPLGTGLYPVISIINHSCLPNAVLVF----------EGRLAVVRAVQHVPKGAEVLI 219
            E R  GT LYP  S++NH CLPN   V           E      +A+ ++P G E   
Sbjct: 137 GERRIRGTALYPKASLLNHECLPNVARVDNFDEADVNAPENTAVHFKALHNLPAGEEFTQ 196

Query: 220 SYIETAGSTMTRQKALKEQYLFTCTCPRC 248
           SY     S   RQ+  ++QY F C CPRC
Sbjct: 197 SYFPMHTSYHVRQQRCQDQYGFACNCPRC 225


>gi|91088043|ref|XP_966391.1| PREDICTED: similar to conserved hypothetical protein [Tribolium
           castaneum]
 gi|270012080|gb|EFA08528.1| hypothetical protein TcasGA2_TC006181 [Tribolium castaneum]
          Length = 492

 Score = 66.6 bits (161), Expect = 3e-08,   Method: Compositional matrix adjust.
 Identities = 113/468 (24%), Positives = 192/468 (41%), Gaps = 85/468 (18%)

Query: 8   GEVIISQEPYVCVPNNSSSISRCDGCFASSNLKK-CSACQVVW-YCGSNCQKLDWKLHR- 64
           G++I ++ P+   P  S S   C GCFA  +    CS C   W  CG  C+      HR 
Sbjct: 26  GDLIFTETPFAYGPK-SDSPPLCLGCFAPVDCTVLCSHCS--WPVCGPECEAK--PSHRD 80

Query: 65  LECQVLSRLDKE------------KRKSVTPSIRLMLKLYLRRKLQNDNVIPSTTTDNYS 112
            EC V +    +            + + +TP   L++K    ++ Q +            
Sbjct: 81  FECGVFASAGVKFQAVEDPTNVCLQYECITPLRVLLMKEKDPKRWQEE------------ 128

Query: 113 LVEALVAHMSDIDEKQLLLYAQIANLVNLILQWPEISINEIAENFSKLACNAHTICNSEL 172
            +EA+ +H      K +  + Q+ N+V+  L+ P   ++  A+      C    I   E 
Sbjct: 129 -IEAMESHNDVRKTKPIWEFNQV-NIVDY-LKGP-CKLDRFADELIHTVCGILEINAFEA 184

Query: 173 RP----LGTGLYPVISIINHSCLPN---AVLVFEGRLAVVRAVQHVPKGAEVLISYIETA 225
           R     L   L+P ++I++H+C+ N   +V     +++V  +V+ VP+  E+  SY  + 
Sbjct: 185 RASSGYLIRCLFPKLAILSHNCVSNVHHSVDCDTFQVSVSASVE-VPESGELFSSYTYSL 243

Query: 226 GSTMTRQKALKEQYLFTCTCPRCIKLGQFDDIQESAILEGYRCKDDGCSGFLLRDSDDKG 285
             T+ R++ LKE   F C C RC      D  +    L   +C  + C   ++  +D   
Sbjct: 244 WPTLVRREFLKESKYFECMCDRCK-----DKTELGTHLGTLKC--NKCDNGVILSTDPLS 296

Query: 286 FTCQ----QCGLVRSKEEIKKIASEVNILSKKTLALTSCGNHQEVVSTYKMIEKLQKKLY 341
           FTC+     C    S   ++K+   V         L+      E +   + I +  + ++
Sbjct: 297 FTCEWKCTHCEFKTSAIAVRKVYEAVQTEIDAVEMLSG----PEGIEQREAIFRKYRSVF 352

Query: 342 HP-------FSVNLMQ--------TREKLIKILME--LEDWKEALAYCQLTIPVYQRV-- 382
           HP          +L Q        T E L  +L+E  +E   + L    +  P + R+  
Sbjct: 353 HPKNAYMTILRTSLSQLYGRAEGYTVEDLPDLLLERKVELCNQLLEVLDVVEPGHSRIRG 412

Query: 383 ---YPQFHPLLGL--QYYTCGKL--EWFLGDTENAIKSMTEAVEILRI 423
              Y    PLL L    YT G +  E F    E A++ + +AVEIL+I
Sbjct: 413 VTLYELHAPLLILARNQYTSGMIDKEQFRKKMEEAVEILGKAVEILKI 460


>gi|66800595|ref|XP_629223.1| hypothetical protein DDB_G0293294 [Dictyostelium discoideum AX4]
 gi|60462643|gb|EAL60846.1| hypothetical protein DDB_G0293294 [Dictyostelium discoideum AX4]
          Length = 508

 Score = 66.6 bits (161), Expect = 3e-08,   Method: Compositional matrix adjust.
 Identities = 32/81 (39%), Positives = 48/81 (59%), Gaps = 6/81 (7%)

Query: 176 GTGLYPVISIINHSCLPNAVL---VFEGRLAV---VRAVQHVPKGAEVLISYIETAGSTM 229
           G GL+P+ S +NHSC PN  +   + +G  +V   V+A +++P G+E+L SY +   S  
Sbjct: 428 GVGLFPIFSCMNHSCFPNVEISNEIIDGVTSVRMVVKAKKNIPAGSEILHSYCDETLSNK 487

Query: 230 TRQKALKEQYLFTCTCPRCIK 250
            R+  L  QY F CTC +C K
Sbjct: 488 ERKDILFSQYGFKCTCNKCSK 508


>gi|71421326|ref|XP_811771.1| hypothetical protein [Trypanosoma cruzi strain CL Brener]
 gi|70876474|gb|EAN89920.1| hypothetical protein, conserved [Trypanosoma cruzi]
          Length = 697

 Score = 66.6 bits (161), Expect = 3e-08,   Method: Compositional matrix adjust.
 Identities = 56/188 (29%), Positives = 80/188 (42%), Gaps = 16/188 (8%)

Query: 74  DKEKRKSVTPSIRLMLKL-YLRRKLQNDNVIPSTTTDNYSLVEALVAHMSDIDEKQLLLY 132
           D E    +  S+R  + L     K +   V P   +D   LV  L     D   K    Y
Sbjct: 458 DPETASQLYFSLRSNMSLGVFEEKEETHAVRPPRWSDTARLVTNLSMLSKDSRSKFRCAY 517

Query: 133 AQIANLVNLILQW------PEISINEIAENFSKLA----CNAHTICNSELRPLGTGLYPV 182
            +      LIL W       E ++   A  F++L     CN+  + N++   +G  LYP 
Sbjct: 518 RR---FTKLILPWLGEGGSTEANLTVTATFFNRLCAALQCNSFGLFNADGNCIGVALYPE 574

Query: 183 ISIINHSCLPNAVLV-FEGRLAVVRAVQHVPKGAEVLISYIET-AGSTMTRQKALKEQYL 240
            S  NHSC PN   V + G LA   A++ + KG  + I Y++    ST  R++ L   Y 
Sbjct: 575 ASYFNHSCCPNICRVTYRGILAAFHALREIRKGEPLTICYVDVQETSTAERRRTLFSSYR 634

Query: 241 FTCTCPRC 248
           F C C RC
Sbjct: 635 FFCECARC 642


>gi|118403760|ref|NP_001072288.1| SET and MYND domain containing 4 [Xenopus (Silurana) tropicalis]
 gi|111305557|gb|AAI21315.1| hypothetical protein MGC145614 [Xenopus (Silurana) tropicalis]
          Length = 739

 Score = 66.6 bits (161), Expect = 3e-08,   Method: Compositional matrix adjust.
 Identities = 68/295 (23%), Positives = 116/295 (39%), Gaps = 64/295 (21%)

Query: 169 NSELRPLGTGLYPVISIINHSCLPNAVLVFEGRLAVVRAVQHVPKGAEVLISY--IETAG 226
           N  +R L T ++PV+S++NHSC PN  + F GR   VRA + + +  EV   Y   +   
Sbjct: 465 NKSIR-LATAVFPVLSLLNHSCDPNTTVSFTGRFVTVRANRPIRRDEEVTHCYGPHKLRM 523

Query: 227 STMTRQKALKEQYLFTCTCPRCIKLGQFDDIQESAILEGYRCKDDGCSGFLLRDSDDKGF 286
               RQ+ LK+QY F C C  C +                           L+     GF
Sbjct: 524 DVAERQQLLKDQYFFVCQCKACTE--------------------------ELKGKKTHGF 557

Query: 287 TCQQCGLVRSKEEI--------KKIASEVNILSKKTLALTSCGNHQEVVSTYKMIEKLQK 338
            C  C      EE            AS+  + S+    L   GNH ++        +LQ 
Sbjct: 558 FCPLCKAQLEGEEALYCTGARCTYTASQTQLTSR----LNQLGNHIQIAKV-----QLQD 608

Query: 339 KLYHPFSVNLM-----------QTREKLIKILMELE-------DWKEALAYCQLTIPVYQ 380
                  + LM           Q    L +I+  L        +WK A  + + +I + +
Sbjct: 609 NKTDNAKMTLMSCLSEAECFLSQDHLLLGEIMDHLAQAEASDGNWKAAAGHLRKSISIVK 668

Query: 381 RVYPQFHPLLGLQYYTCGKLEWFLGDTENAIKSMTEAVEILRITHGTNSPFMKEL 435
             Y      LG + +   ++ +   +  +A++++ EA ++L + +G +   ++EL
Sbjct: 669 VHYGSSSMELGHELFKLAQILFNGFEVPDAMRTIMEAQKVLSMHYGPDHNLVQEL 723


>gi|402590196|gb|EJW84127.1| hypothetical protein WUBG_04964 [Wuchereria bancrofti]
          Length = 283

 Score = 66.6 bits (161), Expect = 3e-08,   Method: Compositional matrix adjust.
 Identities = 70/288 (24%), Positives = 132/288 (45%), Gaps = 32/288 (11%)

Query: 149 SINEIAENFSKLACNAHTICNSELRPLGTGLYPVISIINHSCLPNAVLVFEGRLAVVRAV 208
           S  E+ + F K+  N   I +S L   G G+Y  +S ++HSC P+A ++F G  A++R++
Sbjct: 3   SSTEMFDIFCKIVINTVVITDSCLSRTGIGVYLGLSALDHSCKPDAFIIFNGTKAILRSL 62

Query: 209 QH--VPKGAEVLISYIETAGSTMTRQKALKEQYLFTCTCPRCIKLGQFDDIQESAILEGY 266
                     + I Y +    T TR K ++ Q+ F C C  C  + + D  ++   L   
Sbjct: 63  SKNITEYNNNLRIPYCDLLELTSTRCKYMQLQHNFICNCEICQNV-ELD--RQKCSLRCT 119

Query: 267 RCKDDGCSGFLLRDSDDKGFT-CQQCGLVR--SKEEIKKIASEVNILSKKTLALTSCGNH 323
           +C D  C        D++  T C+ CG +   + + ++K+  ++      T   ++  N 
Sbjct: 120 KCTDGFCP---YSPEDEQAETRCKVCGEISLFNFDHLQKLYQQL------TTDDSTEKNL 170

Query: 324 QEVVSTYKMIEKLQKKLYHPFSVNLMQTREKLIKILMELEDWKEALAYCQLTIPVYQRVY 383
            E++  Y   E    +++ P++V L +  EK++   M+   + EA           +  Y
Sbjct: 171 NELIDLYCESE----EVFSPYNVPLCKFAEKIMISAMKHHKYDEAA----------KTFY 216

Query: 384 PQFHPLLGLQYYTCGKLEWFLGDTENAIKSMTEAVEILRITHGTNSPF 431
           P+  P L  +     KL   L   E +   + EA++++R ++G+ S F
Sbjct: 217 PKGLPPLPTRMLEYAKL-LMLQHDEASFPILREALKMIRESYGSESNF 263


>gi|302834810|ref|XP_002948967.1| hypothetical protein VOLCADRAFT_89375 [Volvox carteri f.
           nagariensis]
 gi|300265712|gb|EFJ49902.1| hypothetical protein VOLCADRAFT_89375 [Volvox carteri f.
           nagariensis]
          Length = 679

 Score = 66.6 bits (161), Expect = 3e-08,   Method: Compositional matrix adjust.
 Identities = 37/104 (35%), Positives = 52/104 (50%), Gaps = 2/104 (1%)

Query: 148 ISINEIAENFSKLACNAHTICNSELRPLGTGLYPVISIINHSCLPNAVLVFEGRLAVVRA 207
           ++ N   E+    A +A    ++EL     GL+P  S+INHSC P A     G + VVRA
Sbjct: 218 VTFNAYGESKPDPAVSAVRSLSAELLSGCVGLWPPFSLINHSCAPVASYGLVGDVMVVRA 277

Query: 208 VQHVPKGAEVLISYI--ETAGSTMTRQKALKEQYLFTCTCPRCI 249
              +P G +V ISY      G    R+  L++ Y F C C RC+
Sbjct: 278 AADLPAGQQVTISYFGRRALGPLELRRAYLRQHYGFVCACERCV 321


>gi|170047543|ref|XP_001851277.1| conserved hypothetical protein [Culex quinquefasciatus]
 gi|167869950|gb|EDS33333.1| conserved hypothetical protein [Culex quinquefasciatus]
          Length = 697

 Score = 66.6 bits (161), Expect = 3e-08,   Method: Compositional matrix adjust.
 Identities = 57/202 (28%), Positives = 82/202 (40%), Gaps = 51/202 (25%)

Query: 131 LYAQIANLVNLILQ--WPEISINEIAENFSKLACNAHTICNSELRP-------------- 174
            + Q+ +L+N++ +  W       I  +  +L CN H I  SELR               
Sbjct: 353 FFQQLGDLINVMSKENWSIYCGAVIFRHIGQLVCNGHAI--SELRGSFASENNCLEADSF 410

Query: 175 ---------------LGTGLYPVISIINHSCLPNAVLVFEGRLAVVRAVQHVPKGAEVLI 219
                          + TG++P IS+ NHSC PN    F      V A + +P G E+  
Sbjct: 411 NIKAGFLHRYFESTRVFTGIFPQISMFNHSCEPNIRNTFNKNTLTVYAAKDIPVGGEIFN 470

Query: 220 SYIETAGSTMT------RQKALKEQYLFTCTCPRCIKLGQFDDIQESAILEGYRCKDDGC 273
            Y    G          R+ AL++QY F C C RC      D+  ES   E Y+C    C
Sbjct: 471 CY----GPNFKLMCKDERKSALRQQYGFDCKCTRCAMNN--DEAYES--FEHYKCPFAKC 522

Query: 274 SG-FLLRDSDD---KGFTCQQC 291
           S  F+L+++ D   K   C  C
Sbjct: 523 SKYFMLKENADPFEKDIKCPMC 544


>gi|330791636|ref|XP_003283898.1| hypothetical protein DICPUDRAFT_86015 [Dictyostelium purpureum]
 gi|325086169|gb|EGC39563.1| hypothetical protein DICPUDRAFT_86015 [Dictyostelium purpureum]
          Length = 555

 Score = 66.2 bits (160), Expect = 3e-08,   Method: Compositional matrix adjust.
 Identities = 50/216 (23%), Positives = 100/216 (46%), Gaps = 17/216 (7%)

Query: 48  VWYCGSNCQKLDWKLHRLECQVLSRL-DKEKRKSVTPSIRLM-LKLYLRRKLQNDNVIPS 105
           +W+C + C   D  +H+LEC  +S+  +   +     SI L+ L++ ++ K+ ++ V   
Sbjct: 86  MWFCSNECA--DGMVHQLECSFISKFVELSLKHDCDASISLLALRVVMKNKVASE-VFEQ 142

Query: 106 TTTDNYSLVEALVAHMSDIDEKQLLLYAQIANLVNLILQWPEISINEI--AENFSKLACN 163
           T     +     + +     +K    Y+++ + V LI++     ++ I   E + ++ C+
Sbjct: 143 TVGKLTNQKSIFIQNNKSFLDK----YSKLFDQVYLIVKSENSELSSIFNKEEYLEVVCS 198

Query: 164 ------AHTICNSELRPLGTGLYPVISIINHSCLPNAVLVFEGRLAVVRAVQHVPKGAEV 217
                 A    N    P+  G +   +++NHSC PN     +     +R V+ + KG ++
Sbjct: 199 IFVNSFAGLSNNFNRTPISNGYFYKAALLNHSCEPNTFFSIQNSTLEMRVVKDINKGEQI 258

Query: 218 LISYIETAGSTMTRQKALKEQYLFTCTCPRCIKLGQ 253
             SY++    T+ RQK L +   F C C RC  + +
Sbjct: 259 FDSYVDLLLPTIERQKILLKSKNFVCDCSRCTDITE 294


>gi|452846662|gb|EME48594.1| hypothetical protein DOTSEDRAFT_19117 [Dothistroma septosporum
           NZE10]
          Length = 199

 Score = 66.2 bits (160), Expect = 3e-08,   Method: Compositional matrix adjust.
 Identities = 35/92 (38%), Positives = 48/92 (52%), Gaps = 16/92 (17%)

Query: 157 FSKLACNAHTICNSELRPLGTGLYPVISIINHSCLPNAVLVFEGRLAVVRAVQHVPKGAE 216
            S L CN+  +      PLG  +  +IS +NHSC+PNA +VFE                E
Sbjct: 38  ISLLLCNSLILMTPANDPLGIVVELLISNVNHSCVPNACIVFE----------------E 81

Query: 217 VLISYIETAGSTMTRQKALKEQYLFTCTCPRC 248
           + +SY++  G    RQ AL  +Y FTCTCP+C
Sbjct: 82  IFVSYVDQHGLFHARQAALWHRYYFTCTCPKC 113


>gi|242019690|ref|XP_002430292.1| histone-lysine N-methyltransferase ASHR1, putative [Pediculus
           humanus corporis]
 gi|212515407|gb|EEB17554.1| histone-lysine N-methyltransferase ASHR1, putative [Pediculus
           humanus corporis]
          Length = 487

 Score = 66.2 bits (160), Expect = 3e-08,   Method: Compositional matrix adjust.
 Identities = 117/488 (23%), Positives = 205/488 (42%), Gaps = 79/488 (16%)

Query: 8   GEVIISQEPYVCVPNNSSSISRCDGCFAS---SNLKKCSACQVVWYCGSNCQKLDWKLHR 64
           GE+I+ ++P +  P+ +S+I  C GCF        K C  CQ+   CG +C  L  +  +
Sbjct: 15  GEIILKEQPLILGPSINSNI-LCLGCFRRILFEKFKFCRECQIAPVCGEDCS-LKSQHSK 72

Query: 65  LECQVLSRL---DKEKRKSVTPSIRLMLKLY-LRRKLQNDNVIPSTTTDNYSLVEALVAH 120
            EC+   +L   DK  +  +  + +++L L     KL    + P    D+  L +  +  
Sbjct: 73  YECKYYKKLFENDKIDKNFIVSNFQIVLPLKCFSLKLF---LFPDDDDDDDHLWDNFINM 129

Query: 121 MSDIDEKQ---LLLYAQ-----IANLVNLILQWPEISINEIAE----NFSKLAC--NAHT 166
            S +DE+Q   +  Y Q     + N VN++     + + +I      N  ++ C  +   
Sbjct: 130 ESHVDERQNEPIWNYYQENIIKVLNSVNILNDEECLLMQKICGILDVNSFEMRCPDDEQI 189

Query: 167 ICNSELRPLGTGLYPVISIINHSCLPNAVLVFEGRLAV-VRAVQHVPKGAEVLISYIETA 225
              ++LR    G+Y   + ++H C+ N  +       + + A   +  G  +  +Y    
Sbjct: 190 SAGNQLR----GIYREAAFMSHDCIGNTHVSLNDFFEMTIHASVPIKSGQAIFFNYANPF 245

Query: 226 GSTMTRQKALKEQYLFTCTCPRCIKLGQFDDIQESAILEGYRCKDDGCSGFLLRDSDDKG 285
             T  RQ+ LK+   F C C RC    +      S I    RC  +GC       SD  G
Sbjct: 246 ECTAARQQRLKKGKYFNCFCKRCTDPTELGTYSGSIICP--RCH-EGCIISQNPLSDSSG 302

Query: 286 FTCQQCGLVRSKEEIKKIASE-VNILSKKTLALTSCGNHQEVVSTYKMIEKLQKKLYHPF 344
           + C++C      + IK    E  N+LS          +  E  ST   +E L KK    F
Sbjct: 303 WNCEKCNHFFYNQLIKATNVEGQNLLS----------DVDETNST--QLENLLKKCLKSF 350

Query: 345 SVN---LMQTREKLI---------------KILMELEDW-KEALAYCQLTIPVYQR---- 381
             N   + + ++KLI               KIL+++  + +E +   ++  P   R    
Sbjct: 351 PQNNYVMTEIKQKLITSLRDTIRNQMNPQKKILIKIIRFCREMINLLEIIEPGISRHKGI 410

Query: 382 -VYPQFHPLLGL--QYYTCGKLEW--FLGDTENAIKSMTEAVEILRITHGTNSP---FMK 433
             Y  + P+L L  + Y   ++ +  F+ + E   K + +A++IL I    NSP   F+K
Sbjct: 411 VFYELYKPMLMLAEKKYELKEINYLEFVYEMEECEKILKKAIDIL-IYEPNNSPEGRFLK 469

Query: 434 ELILKLEE 441
             I +L++
Sbjct: 470 IAIHELKD 477


>gi|195569233|ref|XP_002102615.1| GD19999 [Drosophila simulans]
 gi|194198542|gb|EDX12118.1| GD19999 [Drosophila simulans]
          Length = 391

 Score = 66.2 bits (160), Expect = 3e-08,   Method: Compositional matrix adjust.
 Identities = 37/97 (38%), Positives = 57/97 (58%), Gaps = 9/97 (9%)

Query: 157 FSKLACNAHTICNSELRPLGTGLYPVISIINHSCLPNAVLVF--EGRLAVVRAVQHVPKG 214
           ++K+A  A    N+E    G+GLY + S INHSC+PNA   F     + V++A+  + +G
Sbjct: 271 YAKVAEFAGEFLNNE----GSGLYLLQSKINHSCVPNACSTFPYSNDIVVLKALGPIQQG 326

Query: 215 AEVLISYIET---AGSTMTRQKALKEQYLFTCTCPRC 248
            E+ ISY++      S  +R K L+E Y+F C CP+C
Sbjct: 327 EEICISYLDECMLERSRHSRHKVLRENYVFICQCPKC 363


>gi|195355540|ref|XP_002044249.1| GM15092 [Drosophila sechellia]
 gi|194129550|gb|EDW51593.1| GM15092 [Drosophila sechellia]
          Length = 392

 Score = 66.2 bits (160), Expect = 3e-08,   Method: Compositional matrix adjust.
 Identities = 37/97 (38%), Positives = 57/97 (58%), Gaps = 9/97 (9%)

Query: 157 FSKLACNAHTICNSELRPLGTGLYPVISIINHSCLPNAVLVF--EGRLAVVRAVQHVPKG 214
           ++K+A  A    N+E    G+GLY + S INHSC+PNA   F     + V++A+  + +G
Sbjct: 271 YAKVAEFAGEFLNNE----GSGLYLLQSKINHSCVPNACSTFPYSNDIVVLKALGPIQQG 326

Query: 215 AEVLISYIET---AGSTMTRQKALKEQYLFTCTCPRC 248
            E+ ISY++      S  +R K L+E Y+F C CP+C
Sbjct: 327 EEICISYLDECMLERSRHSRHKVLRENYVFICQCPKC 363


>gi|400595566|gb|EJP63361.1| TPR domain-containing protein [Beauveria bassiana ARSEF 2860]
          Length = 662

 Score = 66.2 bits (160), Expect = 3e-08,   Method: Compositional matrix adjust.
 Identities = 32/73 (43%), Positives = 44/73 (60%), Gaps = 1/73 (1%)

Query: 177 TGLYPVISIINHSCLPNAVLVFEGRLAVVRAVQHVPKGAEVLISYIETAGSTMTRQKALK 236
           TGL+   S +NHSC+PN+V  F G L VVRA+QH+  G E+  +Y +  G    RQ  L+
Sbjct: 515 TGLWARASYLNHSCMPNSVKDFAGDLIVVRALQHIRTGDEITHAYQDN-GDYDARQALLQ 573

Query: 237 EQYLFTCTCPRCI 249
             + FTC C  C+
Sbjct: 574 TTWGFTCRCKLCV 586


>gi|340924027|gb|EGS18930.1| hypothetical protein CTHT_0055430 [Chaetomium thermophilum var.
           thermophilum DSM 1495]
          Length = 278

 Score = 66.2 bits (160), Expect = 3e-08,   Method: Compositional matrix adjust.
 Identities = 31/76 (40%), Positives = 40/76 (52%)

Query: 178 GLYPVISIINHSCLPNAVLVFEGRLAVVRAVQHVPKGAEVLISYIETAGSTMTRQKALKE 237
            +YP +S  NH C PN      G +    AV+ +P G E+ ISYI    S   RQK L E
Sbjct: 69  AVYPKVSRFNHDCRPNVHYRLSGLIHTTVAVRDIPAGTELTISYIYVRKSRAERQKELSE 128

Query: 238 QYLFTCTCPRCIKLGQ 253
            + FTCTC +C K  +
Sbjct: 129 SWNFTCTCEQCSKSAE 144


>gi|328702715|ref|XP_001948576.2| PREDICTED: SET and MYND domain-containing protein 5-like
           [Acyrthosiphon pisum]
          Length = 387

 Score = 66.2 bits (160), Expect = 3e-08,   Method: Compositional matrix adjust.
 Identities = 37/80 (46%), Positives = 47/80 (58%), Gaps = 10/80 (12%)

Query: 176 GTGLYPVISIINHSCLPNAVLVFEG----RLAVVRAVQHVPKGAEVLISYI---ETAGST 228
           G+ L+   S INHSC PNA  VF+G    RL  +R ++    G E+ ISYI   E   S 
Sbjct: 289 GSALFQNHSSINHSCFPNAASVFDGNHVLRLVAIRMIE---PGDEINISYIAPCELDHSR 345

Query: 229 MTRQKALKEQYLFTCTCPRC 248
            TRQK L+E Y+FTC C +C
Sbjct: 346 HTRQKYLQENYVFTCRCIKC 365


>gi|407867908|gb|EKG08717.1| hypothetical protein TCSYLVIO_000127 [Trypanosoma cruzi]
          Length = 697

 Score = 65.9 bits (159), Expect = 4e-08,   Method: Compositional matrix adjust.
 Identities = 50/159 (31%), Positives = 70/159 (44%), Gaps = 15/159 (9%)

Query: 102 VIPSTTTDNYSLVEALVAHMSDIDEKQLLLYAQIANLVNLILQW------PEISINEIAE 155
           V P   +D   LV  L     D   K    Y +      LIL W       E ++   A 
Sbjct: 487 VRPPKWSDTARLVTNLSVLSKDSRSKFRCAYRR---FTKLILPWLGEGGSTEANLTVTAT 543

Query: 156 NFSKLA----CNAHTICNSELRPLGTGLYPVISIINHSCLPNAVLV-FEGRLAVVRAVQH 210
            F++L     CN+  + N++   +G  LYP  S  NHSC PN   V + G LA   A++ 
Sbjct: 544 FFNRLCAALQCNSFGLFNADGNCIGVALYPEASYFNHSCCPNICRVTYRGILAAFHALRE 603

Query: 211 VPKGAEVLISYIET-AGSTMTRQKALKEQYLFTCTCPRC 248
           + KG  + I Y++    ST  R++ L   Y F C C RC
Sbjct: 604 IRKGEPLTICYVDVQETSTAERRRTLFSSYRFFCECARC 642


>gi|253741927|gb|EES98785.1| Hypothetical protein GL50581_3974 [Giardia intestinalis ATCC 50581]
          Length = 409

 Score = 65.9 bits (159), Expect = 4e-08,   Method: Compositional matrix adjust.
 Identities = 36/104 (34%), Positives = 59/104 (56%), Gaps = 8/104 (7%)

Query: 154 AENFSKLAC----NAHTICNSELRPLGTGLYPVISIINHSCLPNAVLVFE----GRLAVV 205
           A+ F+ L C    N+++I +   + +G  LY +IS  NHSC PNA ++FE     R A +
Sbjct: 300 AQMFNYLVCILLTNSYSIEDKNGQEIGAALYSLISCCNHSCAPNAQVIFEDSEDAREATL 359

Query: 206 RAVQHVPKGAEVLISYIETAGSTMTRQKALKEQYLFTCTCPRCI 249
             ++   +G E+ ISYI   G +++ ++    Q+ FTC C RC+
Sbjct: 360 VLLRPCVQGEELYISYITDLGRSVSERRRELAQWCFTCQCTRCL 403


>gi|270012059|gb|EFA08507.1| hypothetical protein TcasGA2_TC006159 [Tribolium castaneum]
          Length = 529

 Score = 65.9 bits (159), Expect = 4e-08,   Method: Compositional matrix adjust.
 Identities = 99/415 (23%), Positives = 165/415 (39%), Gaps = 47/415 (11%)

Query: 8   GEVIISQEPYVCVPNNSSSISRCDGCFASSNLKKCSACQVVW-YCGSNCQKLDWKL--HR 64
           G+VI+S+ P V  P     + +C GC+ +     C+ C   W  C   C+ L  +    +
Sbjct: 61  GDVILSEAPLVIGPCTGCKV-QCLGCYKTLESSFCTKCTCGWPLCSPKCKGLGKRFGHSQ 119

Query: 65  LECQVLSR------LDKEKRKSVTPSIRLMLKLYLRRKLQNDNVIPSTTTDNYSLVEALV 118
           +EC VL        L+ +  + + P+   ++ L   R L    ++  T   +Y  +  + 
Sbjct: 120 VECAVLKATKSSKFLNYDNFEEMRPNFNAIVPL---RCL----LLKETDARSYQHLMTME 172

Query: 119 AH---MSDIDEKQLLLYAQIANLVNLILQWPEISINEIAENFSKLACNAHTICNSELRPL 175
            H     +I E        + + + +     E S  EI      L  N+  I    +   
Sbjct: 173 THNEIRRNIPELWQTNQRTVVDKIRIDWGLREYSQEEIHSVCGILEVNSFEIGQQGVNI- 231

Query: 176 GTGLYPVISIINHSCLPNAVLVFEG---RLAVVRAVQHVPKGAEVLISYIETAGSTMTRQ 232
             GLYP   +++H C+PN   + E    RL V RA   +  G  + +SY  T  ST+ R+
Sbjct: 232 -RGLYPSAFLMSHDCVPNTNHIDEESTFRLTV-RASTRIEPGEMITLSYAYTLQSTLKRR 289

Query: 233 KALKEQYLFTCTCPRC---IKLGQFDDIQESAILEGYRCKDDGCSGFLLRDSDDKGFTCQ 289
           + L E   F C C RC    +LG F     ++ L   +CKD    GF+L  +        
Sbjct: 290 EHLLENKFFECQCRRCSDPTELGTF-----TSALICPKCKD----GFVLSSNPLDPEADW 340

Query: 290 QCGLVRSKEEIKKIASEVNILSKKTLALTSCGNHQEVVSTYKMIEKLQKKLYHPFSVN-- 347
            C         K  A  + +L  K        +  +V++    ++K +  L+    +N  
Sbjct: 341 TCSNYNKCPGYKISAKSMQLLLDKITQEVDNIDCNDVITMETFLKKYRNVLHPTHYINLS 400

Query: 348 ----LMQTREKLIKILM-ELED--WKEALAYCQLTIPVYQRVYPQFHPLLGLQYY 395
               L Q   ++   L+ EL D      L  C+  I V+  + P F  L G+  Y
Sbjct: 401 VKLSLSQLYGRINGFLIDELSDEVLTRKLEICEEIIKVFNVIEPGFTRLRGVTLY 455


>gi|407394357|gb|EKF26910.1| hypothetical protein MOQ_009378 [Trypanosoma cruzi marinkellei]
          Length = 697

 Score = 65.9 bits (159), Expect = 4e-08,   Method: Compositional matrix adjust.
 Identities = 34/93 (36%), Positives = 49/93 (52%), Gaps = 2/93 (2%)

Query: 158 SKLACNAHTICNSELRPLGTGLYPVISIINHSCLPNAVLV-FEGRLAVVRAVQHVPKGAE 216
           + L CN+  + N++   +G  LYP  S  NHSC PN   V + G  A   A++ +PKG  
Sbjct: 550 AALQCNSFGLFNADGNCIGVALYPEASYFNHSCCPNICRVTYRGLFAAFHALREIPKGEP 609

Query: 217 VLISYIET-AGSTMTRQKALKEQYLFTCTCPRC 248
           + I Y++    ST  R++ L   Y F C C RC
Sbjct: 610 LTICYVDVQETSTAERRRTLFSSYRFFCECARC 642


>gi|91087991|ref|XP_973574.1| PREDICTED: similar to CG11160 CG11160-PA [Tribolium castaneum]
          Length = 528

 Score = 65.9 bits (159), Expect = 4e-08,   Method: Compositional matrix adjust.
 Identities = 99/415 (23%), Positives = 165/415 (39%), Gaps = 47/415 (11%)

Query: 8   GEVIISQEPYVCVPNNSSSISRCDGCFASSNLKKCSACQVVW-YCGSNCQKLDWKL--HR 64
           G+VI+S+ P V  P     + +C GC+ +     C+ C   W  C   C+ L  +    +
Sbjct: 60  GDVILSEAPLVIGPCTGCKV-QCLGCYKTLESSFCTKCTCGWPLCSPKCKGLGKRFGHSQ 118

Query: 65  LECQVLSR------LDKEKRKSVTPSIRLMLKLYLRRKLQNDNVIPSTTTDNYSLVEALV 118
           +EC VL        L+ +  + + P+   ++ L   R L    ++  T   +Y  +  + 
Sbjct: 119 VECAVLKATKSSKFLNYDNFEEMRPNFNAIVPL---RCL----LLKETDARSYQHLMTME 171

Query: 119 AH---MSDIDEKQLLLYAQIANLVNLILQWPEISINEIAENFSKLACNAHTICNSELRPL 175
            H     +I E        + + + +     E S  EI      L  N+  I    +   
Sbjct: 172 THNEIRRNIPELWQTNQRTVVDKIRIDWGLREYSQEEIHSVCGILEVNSFEIGQQGVNI- 230

Query: 176 GTGLYPVISIINHSCLPNAVLVFEG---RLAVVRAVQHVPKGAEVLISYIETAGSTMTRQ 232
             GLYP   +++H C+PN   + E    RL V RA   +  G  + +SY  T  ST+ R+
Sbjct: 231 -RGLYPSAFLMSHDCVPNTNHIDEESTFRLTV-RASTRIEPGEMITLSYAYTLQSTLKRR 288

Query: 233 KALKEQYLFTCTCPRC---IKLGQFDDIQESAILEGYRCKDDGCSGFLLRDSDDKGFTCQ 289
           + L E   F C C RC    +LG F     ++ L   +CKD    GF+L  +        
Sbjct: 289 EHLLENKFFECQCRRCSDPTELGTF-----TSALICPKCKD----GFVLSSNPLDPEADW 339

Query: 290 QCGLVRSKEEIKKIASEVNILSKKTLALTSCGNHQEVVSTYKMIEKLQKKLYHPFSVN-- 347
            C         K  A  + +L  K        +  +V++    ++K +  L+    +N  
Sbjct: 340 TCSNYNKCPGYKISAKSMQLLLDKITQEVDNIDCNDVITMETFLKKYRNVLHPTHYINLS 399

Query: 348 ----LMQTREKLIKILM-ELED--WKEALAYCQLTIPVYQRVYPQFHPLLGLQYY 395
               L Q   ++   L+ EL D      L  C+  I V+  + P F  L G+  Y
Sbjct: 400 VKLSLSQLYGRINGFLIDELSDEVLTRKLEICEEIIKVFNVIEPGFTRLRGVTLY 454


>gi|403222774|dbj|BAM40905.1| uncharacterized protein TOT_030000166 [Theileria orientalis strain
           Shintoku]
          Length = 150

 Score = 65.9 bits (159), Expect = 5e-08,   Method: Compositional matrix adjust.
 Identities = 35/90 (38%), Positives = 50/90 (55%), Gaps = 3/90 (3%)

Query: 166 TICNSELRP--LGTGLYPVISIINHSCLPNAVLVF-EGRLAVVRAVQHVPKGAEVLISYI 222
           TI   EL P  +G GL+  +S +NHSC PN  + + +  +A +  +  VP+G E  ISYI
Sbjct: 56  TIQKKELYPETMGLGLFNYLSKMNHSCEPNLQIEYTKNNIAHIAPLVDVPRGEEATISYI 115

Query: 223 ETAGSTMTRQKALKEQYLFTCTCPRCIKLG 252
           +   S   RQ+ L + Y F C C +CI  G
Sbjct: 116 DEKDSVENRQEKLYKNYGFKCDCNKCILEG 145


>gi|113206685|gb|ABI34493.1| SET and MYND domain containing 2b [Danio rerio]
          Length = 129

 Score = 65.9 bits (159), Expect = 5e-08,   Method: Compositional matrix adjust.
 Identities = 38/111 (34%), Positives = 61/111 (54%), Gaps = 8/111 (7%)

Query: 30  CDGCFA-SSNLKKCSACQVVWYCGSNCQKLDWKLHRLECQVLSRLDKEKRKSVTPSIRLM 88
           C+ CF     L KC  C+  +YC +NCQK +W +H+LECQ +    +  R S T  +RL+
Sbjct: 2   CEQCFTRKKGLAKCGKCKKAFYCNANCQKKNWPMHKLECQAMCAFGENWRPSET--VRLV 59

Query: 89  LKLYLRRKLQNDNVIPSTTTDNYSLVEALVAHMSDID-EKQLLLYAQIANL 138
            ++  R K Q +     + ++   L+  + AH+ D+D EK+ +  A IA L
Sbjct: 60  ARIIARLKAQKER----SPSEILLLLGEMEAHLEDMDNEKREMTEAHIAGL 106


>gi|157104022|ref|XP_001648222.1| hypothetical protein AaeL_AAEL003992 [Aedes aegypti]
 gi|108880438|gb|EAT44663.1| AAEL003992-PA [Aedes aegypti]
          Length = 594

 Score = 65.9 bits (159), Expect = 5e-08,   Method: Compositional matrix adjust.
 Identities = 76/292 (26%), Positives = 115/292 (39%), Gaps = 52/292 (17%)

Query: 7   SGEVIISQEPYVCVPNNSSSISRCDGCFASS--NLKKCSACQVVWYCGSNCQ-KLDWKLH 63
           +G+++I ++P+  +  N      CD C       L  C  C V  +C + CQ K     H
Sbjct: 213 AGDIVIEEKPFSSLLVNDHRYMNCDYCHDDKFLTLIPCKCCTVTMFCSTKCQQKAMDNYH 272

Query: 64  RLECQVLSRLDKEKRKSVTPSIRLM-------------LKLYLR---RKLQNDNVIPSTT 107
           R+EC V+  +     K +  ++R               L+LY+     K  N   +  TT
Sbjct: 273 RIECSVIKDMQLLFTKVILMALRTTTTAISTFDYNLEELRLYVESIDEKSLNPFKLDWTT 332

Query: 108 TDN---YSLVEALVAHM---SDIDEKQLLLYAQIANLVNLILQWPEISI----NEIAENF 157
            D+   YS +  L  +    S  D  Q  +YA I  +  L+ Q  E+      NE  +  
Sbjct: 333 IDSKQVYSTIHVLATNQDSRSTSDIVQRSVYAII--MSELLFQHTELGKLCDNNESHDLI 390

Query: 158 SKL--------ACNAHTICNSELRP----------LGTGLYPVISIINHSCLPNAVLVFE 199
             L          N H++   +  P          LG G +P++S+INHSC PN V +  
Sbjct: 391 RTLLFRHAQTAPVNMHSVMFMDYTPNEIEKYSQLKLGCGSFPILSMINHSCAPNLVRMTL 450

Query: 200 GRLAVVRAVQH-VPKGAEVL--ISYIETAGSTMTRQKALKEQYLFTCTCPRC 248
               VV  V   + KG ++     Y     +   RQ  L  QY F C C  C
Sbjct: 451 PNGHVVALVNRPIKKGGQLFDNYGYHHCLDTLDERQSGLLGQYCFRCQCEAC 502


>gi|256062338|ref|XP_002570318.1| set and mynd domain containing [Schistosoma mansoni]
 gi|353230727|emb|CCD77144.1| putative set and mynd domain containing [Schistosoma mansoni]
          Length = 423

 Score = 65.9 bits (159), Expect = 5e-08,   Method: Compositional matrix adjust.
 Identities = 36/79 (45%), Positives = 48/79 (60%), Gaps = 7/79 (8%)

Query: 176 GTGLYPVISIINHSCLPNAVLVFEG---RLAVVRAVQHVPKGAEVLISYIE---TAGSTM 229
           G GLY   S INHSC PNA++ F G   RL+VV A+  + +G E+ ISY++    +    
Sbjct: 314 GVGLYYYQSRINHSCSPNAIIRFSGVNSRLSVV-ALTSIQEGEEITISYLDHCLQSRGRH 372

Query: 230 TRQKALKEQYLFTCTCPRC 248
           TR+K L   YLF C CP+C
Sbjct: 373 TRRKHLSSNYLFWCNCPKC 391


>gi|225425346|ref|XP_002275154.1| PREDICTED: histone-lysine N-methyltransferase ASHR2-like [Vitis
           vinifera]
          Length = 405

 Score = 65.5 bits (158), Expect = 5e-08,   Method: Compositional matrix adjust.
 Identities = 73/264 (27%), Positives = 102/264 (38%), Gaps = 27/264 (10%)

Query: 8   GEVIISQEP---YVCVPNNSSSISRCDGCF----ASSNLKKCSACQVVWYCGSNC--QKL 58
           G++I++  P   Y   P +SSS + C  CF      S L  CS+C  + +C  +C    L
Sbjct: 35  GQIILTDSPILLYSAHPLSSSSNAYCSNCFRHLQTCSTLVSCSSCPCL-FCSPDCLTHAL 93

Query: 59  DWKLHRLECQVLSRLDKEKRKSVTPSIRLMLKLYLRRKLQNDNVIPSTTTDNYSLVEALV 118
                   C  LS L      S++ S R +   +L        V PS      SL + + 
Sbjct: 94  SSSHSPWACLTLSLLRASPSLSLSHSERQVQARFLVAAYNLAIVSPSHFHILLSL-QGMA 152

Query: 119 AHMSDIDEKQLLLYAQIANLVNLILQWPEISINEIAENFSKLACNAHTICNSE-LRPLGT 177
              SD D    L     +      +     S+       +K   NA  +     L P G 
Sbjct: 153 LPSSDSDAPTFLHSLLSSLSPPQGVAG--FSVELTTALLAKDKLNAFGLMEPPALAPGGE 210

Query: 178 ------GLYPVISIINHSCLPNA-------VLVFEGRLAVVRAVQHVPKGAEVLISYIET 224
                 G+YP  S  NH CLPNA                 +R +  VP+G+E+ +SY   
Sbjct: 211 RSVRAYGIYPKASFFNHDCLPNACRFDYVDTASHHNTDITIRLIHDVPEGSEICLSYFPV 270

Query: 225 AGSTMTRQKALKEQYLFTCTCPRC 248
             +   RQK L E Y FTC C RC
Sbjct: 271 NETYADRQKRLLEDYGFTCYCDRC 294


>gi|56788824|gb|AAH88504.1| smyd5-prov protein, partial [Xenopus (Silurana) tropicalis]
          Length = 420

 Score = 65.5 bits (158), Expect = 5e-08,   Method: Compositional matrix adjust.
 Identities = 36/86 (41%), Positives = 51/86 (59%), Gaps = 7/86 (8%)

Query: 176 GTGLYPVISIINHSCLPNAVLVFEGRLAVVR--AVQHVPKGAEVLISYIETAG---STMT 230
           G+GLY + S  NHSC+PNA   F     ++   A++ +  G E+ ISY++      S  +
Sbjct: 303 GSGLYLLQSCCNHSCVPNAEASFPDNNFILHLTALEDIQPGEEICISYLDCCQRDRSRHS 362

Query: 231 RQKALKEQYLFTCTCPRCIKLGQFDD 256
           RQK L+E YLF C+CP+C  L Q DD
Sbjct: 363 RQKILRENYLFVCSCPKC--LAQADD 386


>gi|166796007|ref|NP_001107677.1| SMYD family member 5 [Xenopus (Silurana) tropicalis]
 gi|163916066|gb|AAI57309.1| smyd5 protein [Xenopus (Silurana) tropicalis]
          Length = 421

 Score = 65.5 bits (158), Expect = 5e-08,   Method: Compositional matrix adjust.
 Identities = 36/86 (41%), Positives = 51/86 (59%), Gaps = 7/86 (8%)

Query: 176 GTGLYPVISIINHSCLPNAVLVFEGRLAVVR--AVQHVPKGAEVLISYIETAG---STMT 230
           G+GLY + S  NHSC+PNA   F     ++   A++ +  G E+ ISY++      S  +
Sbjct: 305 GSGLYLLQSCCNHSCVPNAEASFPDNNFILHLTALEDIQPGEEICISYLDCCQRDRSRHS 364

Query: 231 RQKALKEQYLFTCTCPRCIKLGQFDD 256
           RQK L+E YLF C+CP+C  L Q DD
Sbjct: 365 RQKILRENYLFVCSCPKC--LAQADD 388


>gi|345493155|ref|XP_001601354.2| PREDICTED: SET and MYND domain-containing protein 4-like [Nasonia
           vitripennis]
          Length = 737

 Score = 65.5 bits (158), Expect = 5e-08,   Method: Compositional matrix adjust.
 Identities = 98/486 (20%), Positives = 174/486 (35%), Gaps = 126/486 (25%)

Query: 8   GEVIISQEPYVCVP-NNSSSISRCDGC--FASSNLKKCSACQVVWYCGSNCQKLDWK-LH 63
           GEV+  ++P+  V  +N  S + C  C  F       C  C    YC   C+K  W   H
Sbjct: 237 GEVLFVEKPFAFVLLDNEYSDAVCANCLKFRGDVPVPCKFCASTVYCTEQCRKKAWSTYH 296

Query: 64  RLEC--QVLSRLDKEKRKSVTPSIRLMLKLYLRRKLQNDNVIPSTTTDNYSLVEALVAHM 121
           + EC    +   D+     +T  +R  L           N   +  T  ++ ++ LV ++
Sbjct: 297 QWECFGNQIGIWDQIGIAHLT--VRTFL-----------NCCYTDDTKKFNEIQRLVTNI 343

Query: 122 SDIDEKQLLLYAQIANLVNLILQ-----WPEISINE-------------------IAENF 157
             I  + + +Y   A ++ L L      +  I+I E                   + E +
Sbjct: 344 DKIATQDMFVYGVSALMMTLYLNKFTNFFKSINIYEKLYKKFDNKELNMYILSEFVPEKW 403

Query: 158 S-----------------KLACNAHTICNSELRP-------------LGTGLYPVISIIN 187
           +                 +L CN H I    +               + T +YP  S++N
Sbjct: 404 TEDLNFVYISGILLRHMLQLICNGHAITRLNISDSESGNVVTEYQCRIATAIYPSASMMN 463

Query: 188 HSCLPNAVLVFEGRLAVVRAVQHVPKGAEVLISYIE--TAGSTMTRQKALKEQYLFTCTC 245
           HSC PN +  F+ +  +V+A + +    EV   Y           RQ AL+ QY FTC C
Sbjct: 464 HSCDPNIINSFKDQYLIVKATKDIAAKEEVFNCYGPHYRRMRKKDRQIALQNQYCFTCEC 523

Query: 246 PRCIKLGQFDDIQESAILEGYRCKDDGCSGFLLRDSDDK--GFTCQQCGLVRSKEEIKKI 303
                                    + C+   L++  DK   F C++C            
Sbjct: 524 -------------------------EACTQRALQNFSDKFQRFNCEECN----------- 547

Query: 304 ASEVNILSKKTLALTSCGNHQEVVSTYKMIEKLQKKLYHPFSVNLMQTREKLIKILMELE 363
              V I+S  ++    C    ++V +  +  +   KL+    +NL   +           
Sbjct: 548 -GPVEIISHSSMRCLDCETTFDLVKSQLLELEEANKLFEAAKINLKSQKV---------- 596

Query: 364 DWKEALAYCQLTIPVYQRVYPQFHPLLGLQYYTCGKLEWFLGDTENAIKSMTEAVEILRI 423
             KEAL   +  + + +R+  ++H  + L Y   GK+    G   ++I  +  ++  +  
Sbjct: 597 --KEALENAKQCLEIRKRILYEYHESVTLTYDLIGKIFAVTGRWLDSISHLEHSLAAVEE 654

Query: 424 THGTNS 429
             G +S
Sbjct: 655 RFGPDS 660


>gi|71409653|ref|XP_807160.1| hypothetical protein [Trypanosoma cruzi strain CL Brener]
 gi|70871100|gb|EAN85309.1| hypothetical protein, conserved [Trypanosoma cruzi]
          Length = 697

 Score = 65.5 bits (158), Expect = 5e-08,   Method: Compositional matrix adjust.
 Identities = 56/188 (29%), Positives = 81/188 (43%), Gaps = 16/188 (8%)

Query: 74  DKEKRKSVTPSIRLMLKLYL-RRKLQNDNVIPSTTTDNYSLVEALVAHMSDIDEKQLLLY 132
           D E    +  S+R  + L +   K +   V P   +D   LV  L     D   K    Y
Sbjct: 458 DPETTSQLYFSLRSNMSLGVCEEKEETHAVRPPRWSDTARLVTNLSVLSKDSRSKFRCAY 517

Query: 133 AQIANLVNLILQW------PEISINEIAENFSKLA----CNAHTICNSELRPLGTGLYPV 182
            +      LIL W       E ++   A  F++L     CN+  + N++   +G  LYP 
Sbjct: 518 RR---FTKLILPWLGEGGSIEANLTVTATFFNRLCAALQCNSFGLFNADGNCIGVALYPE 574

Query: 183 ISIINHSCLPNAVLV-FEGRLAVVRAVQHVPKGAEVLISYIET-AGSTMTRQKALKEQYL 240
            S  NHSC PN   V + G LA   A++ + KG  + I Y++    ST  R++ L   Y 
Sbjct: 575 ASYFNHSCCPNICRVTYRGILAAFHALREIRKGEPLTICYVDVQETSTAERRRTLFSSYR 634

Query: 241 FTCTCPRC 248
           F C C RC
Sbjct: 635 FFCECARC 642


>gi|170061020|ref|XP_001866058.1| conserved hypothetical protein [Culex quinquefasciatus]
 gi|167879295|gb|EDS42678.1| conserved hypothetical protein [Culex quinquefasciatus]
          Length = 494

 Score = 65.5 bits (158), Expect = 5e-08,   Method: Compositional matrix adjust.
 Identities = 78/318 (24%), Positives = 122/318 (38%), Gaps = 73/318 (22%)

Query: 7   SGEVIISQEPYVCVPNNSSSISRCDGCFASSNLKKCSACQVVWYCGSNCQKLDWKL---- 62
           SGEVI  + P V  P  ++S+  C GC  + +     A +  +    NCQ   W L    
Sbjct: 26  SGEVIFRESPTVLGPK-TASVPLCLGCHRNLDPITTDAGKKYY----NCQHCGWPLCSPS 80

Query: 63  ------HRLECQVLSRLDKEKRKSVTPSIRLMLKLYLRRKLQNDNVIPSTTTDNYSLVEA 116
                 HR ECQ+ +       KS  P IR             D ++PS     Y  +  
Sbjct: 81  CETSCYHRDECQLFAS------KSYRPQIRF------------DELVPSKKHSGYCAIVP 122

Query: 117 LVAHM----------------SDIDEKQ-LLLYAQI-ANLVNLILQ----WPEISINEIA 154
           L A +                S ++ +Q   LYA + +NLV  +        E+S+ ++ 
Sbjct: 123 LRAILLKRKEPARFAKLATLESHVETRQSTPLYAAVQSNLVPFVRDVLNLRNEVSVGQLM 182

Query: 155 ENFSKLACNAHTICNSELRPLGTGLYPVISIINHSCLPNAVLVFEGRLAVVR-AVQHVPK 213
           E       N++ I   E       LY + +++ H C PN    F+  L +V  A   +PK
Sbjct: 183 EIAGIFDTNSYEIRIPERGIKIRALYELGAMMAHCCQPNTKHFFDEELNLVMIAAVDIPK 242

Query: 214 GAEVLISYIETAGSTMTRQKALKEQYLFTCTCPRCIKLGQFDDIQESAIL---------- 263
              + ISY +   +T+ R+  +K+   F C C RC    +F     S +           
Sbjct: 243 DEMISISYAQPLQATIQRRFTIKQAKCFECGCHRCSDPTEFRTYAGSIVCPQCSKSMVVA 302

Query: 264 -------EGYRCKDDGCS 274
                    +RC+D  CS
Sbjct: 303 VNPLDFRSDWRCEDKKCS 320


>gi|328875121|gb|EGG23486.1| hypothetical protein DFA_05619 [Dictyostelium fasciculatum]
          Length = 391

 Score = 65.5 bits (158), Expect = 5e-08,   Method: Compositional matrix adjust.
 Identities = 37/107 (34%), Positives = 54/107 (50%), Gaps = 7/107 (6%)

Query: 173 RPLGTGLYPVISIINHSCLPNAVLVFE--GRLAVVRAVQHVPKGAEVLISYIETAGSTMT 230
           + +G+GLY   S  NHSC PN   V +  G   V+ A++ + +G E+  +YI+ + S   
Sbjct: 268 KSIGSGLYDQCSYFNHSCQPNIFKVNQTPGGELVMVALRDIEQGEELFYNYIQISMSGEA 327

Query: 231 RQKALKEQYLFTCTCPRCIKLGQFDDIQESAILEGYRCKDDGCSGFL 277
           R K LKE Y F C CP C      +       L+ Y CK   C+G +
Sbjct: 328 RIKKLKESYFFNCQCPGCK-----NAPSHKQFLDKYLCKVKQCNGVI 369



 Score = 43.1 bits (100), Expect = 0.25,   Method: Compositional matrix adjust.
 Identities = 26/77 (33%), Positives = 34/77 (44%), Gaps = 10/77 (12%)

Query: 7  SGEVIISQEPYVCVPNNSSSISR--CDGCFA--------SSNLKKCSACQVVWYCGSNCQ 56
          +G VI+ + P +   N    +    C  C          SS L  CS C++VWYC  +C 
Sbjct: 22 AGTVIVRELPIIQTVNYMDPLQNAFCHQCLTYHHQHQQHSSPLLHCSKCKLVWYCNRDCA 81

Query: 57 KLDWKLHRLECQVLSRL 73
            D   H LEC   SRL
Sbjct: 82 SKDADQHELECPSYSRL 98


>gi|256062336|ref|XP_002570317.1| set and mynd domain containing [Schistosoma mansoni]
 gi|353230728|emb|CCD77145.1| putative set and mynd domain containing [Schistosoma mansoni]
          Length = 429

 Score = 65.5 bits (158), Expect = 5e-08,   Method: Compositional matrix adjust.
 Identities = 36/79 (45%), Positives = 48/79 (60%), Gaps = 7/79 (8%)

Query: 176 GTGLYPVISIINHSCLPNAVLVFEG---RLAVVRAVQHVPKGAEVLISYIE---TAGSTM 229
           G GLY   S INHSC PNA++ F G   RL+VV A+  + +G E+ ISY++    +    
Sbjct: 320 GVGLYYYQSRINHSCSPNAIIRFSGVNSRLSVV-ALTSIQEGEEITISYLDHCLQSRGRH 378

Query: 230 TRQKALKEQYLFTCTCPRC 248
           TR+K L   YLF C CP+C
Sbjct: 379 TRRKHLSSNYLFWCNCPKC 397


>gi|440796609|gb|ELR17718.1| Histone-lysine N-methyltransferase [Acanthamoeba castellanii str.
           Neff]
          Length = 419

 Score = 65.5 bits (158), Expect = 5e-08,   Method: Compositional matrix adjust.
 Identities = 57/200 (28%), Positives = 89/200 (44%), Gaps = 12/200 (6%)

Query: 61  KLHRLECQVLSRLDKEK-RKSVTPSIRLMLKLYLRRKLQN-DNVIPS---TTTDNYSLVE 115
           +LH L+C  L+ L   K  K     +R++L +  RR+L+  D   P+   +    ++   
Sbjct: 176 RLHPLQCPGLAFLASCKLTKDDKTVVRVLLHVLARRELEAIDGPPPAGGVSGKPRFADFL 235

Query: 116 ALVAHMSD--IDEKQLLLYAQIANLVNLILQ----WPEISINEIAENFSKLACNAHTICN 169
            LVA++    I   +     ++   V  +L       + +  E+ E  SK+  N   + +
Sbjct: 236 CLVANVPSLAISPDRYKGDMEVVRYVEKMLASCSLAHKYTSQELMEYISKIESNTFGMWS 295

Query: 170 SELRPLGTGLYPVISIINHSCLPN-AVLVFEGRLAVVRAVQHVPKGAEVLISYIETAGST 228
            +L  +G  LY   S  NHSC PN      EG+     A   +P G EV I YI+    T
Sbjct: 296 DKLVSMGMVLYAEGSYFNHSCAPNCGTRTGEGQAVQFVATHDIPAGDEVCIRYIDVDKPT 355

Query: 229 MTRQKALKEQYLFTCTCPRC 248
            +R+  L   Y FTC CP C
Sbjct: 356 TSRRSELLSHYHFTCMCPLC 375


>gi|301121168|ref|XP_002908311.1| conserved hypothetical protein [Phytophthora infestans T30-4]
 gi|262103342|gb|EEY61394.1| conserved hypothetical protein [Phytophthora infestans T30-4]
          Length = 424

 Score = 65.5 bits (158), Expect = 5e-08,   Method: Compositional matrix adjust.
 Identities = 68/294 (23%), Positives = 115/294 (39%), Gaps = 49/294 (16%)

Query: 8   GEVIISQEPYVCVPNNSSSISRCDGCFASSNLKKC-SACQVVWYCGSNCQKLDWKLHRLE 66
           GE I+    Y  V  + +    C  C  +++   C   C  V++C   C++    +H  E
Sbjct: 29  GESILRDCAYGNVVLSGNRAELCAVCLRAADADICCDDCSKVFFCSEECREKLRDIHENE 88

Query: 67  CQVLSRLDKEKRKSVTPSIRLMLKLYLRRKLQNDNVIPSTTTDNYSLVEALVAHMSDIDE 126
           C+ L  +D                           +   T+ D +S  E  V        
Sbjct: 89  CEALEEVDL--------------------------IASKTSVDLHSDEEGNVRSS----- 117

Query: 127 KQLLLYAQIANLVNLILQWPEISINEIAENFSKLACNAHTICNSELRPLGTGLYPVISII 186
                YA + +LV+++ +      + +     ++  +    C+        GL+ +  +I
Sbjct: 118 -----YANVKDLVHVLDKEAGPWADHVRAGAKRILEDLPDECHLPHLVAAVGLFSICGLI 172

Query: 187 NHSCLPNAVLVFEGRLAV-VRAVQHVPKGAEVLISYIETAGSTMTRQKALKEQYLFTCTC 245
           NHSC PN      G   + VRA++ + +G E+ +SYI+       RQK L++   F C C
Sbjct: 173 NHSCQPNCTWSNAGDSVMEVRALRDIEEGEEITLSYIDIDKERSERQKELRDTKHFDCQC 232

Query: 246 PRC-IKLGQFDDIQESAILEGYRC-----KDDGCSGFLLRDSDDKGFTCQQCGL 293
            RC   L +  D     +L+G+RC     K      +LL   +DK   C  C L
Sbjct: 233 ERCSTPLSESVD----RVLDGFRCPRCSVKASEEENYLLAQVEDK-LVCPDCQL 281


>gi|281202062|gb|EFA76267.1| autophagy protein 9 [Polysphondylium pallidum PN500]
          Length = 1064

 Score = 65.5 bits (158), Expect = 5e-08,   Method: Compositional matrix adjust.
 Identities = 41/123 (33%), Positives = 54/123 (43%), Gaps = 6/123 (4%)

Query: 159  KLACNAHTICNSELRPLGTGLYPVISIINHSCLPNAVLVFEGRLAVVRAVQHVPKGAEVL 218
            K+ CN   I +   + +G    P  S  NHSC PN   V    + V +A+  + KG  + 
Sbjct: 910  KIRCNQFGIWSKRDKCIGVSCTPAASYFNHSCCPNLADVRGTTVVVFKALHFISKGTPIS 969

Query: 219  ISYIETAGSTMTRQKALKEQYLFTCTCPRCIKLGQFDDIQES-AILEGYRCKDDGCSGFL 277
            ISY++    T  RQ  LK  Y FTC C RC      D   ES   +  + C    CSG  
Sbjct: 970  ISYLDLDQPTHERQSYLKTFYYFTCQCLRCK-----DQTDESDNWISRFYCDRFKCSGTY 1024

Query: 278  LRD 280
              D
Sbjct: 1025 FLD 1027


>gi|332025390|gb|EGI65557.1| SET and MYND domain-containing protein 4 [Acromyrmex echinatior]
          Length = 401

 Score = 65.5 bits (158), Expect = 5e-08,   Method: Compositional matrix adjust.
 Identities = 40/124 (32%), Positives = 58/124 (46%), Gaps = 7/124 (5%)

Query: 173 RP--LGTGLYPVISIINHSCLPNAVLVFEGRLAVVRAVQHVPKGAEVLISY--IETAGST 228
           RP  LG  +YP ++  NH C P     F GR  V+RA++ +  G  +  +Y  I T  + 
Sbjct: 200 RPVYLGVAIYPSVARFNHDCYPAVTRYFIGRHIVIRAIRGLGPGDVIAENYGPIFTKRTL 259

Query: 229 MTRQKALKEQYLFTCTCPRCIKLGQ-FDDIQESAILEGYRCKDDGCSGFLLRDSDDKGFT 287
             RQ+ L  +Y F CTC  C +    F+++   ++    RC   GC G  LR    K   
Sbjct: 260 AERQRTLTGRYWFQCTCKACQEDWPCFENLTNDSV--KLRCPTVGCGGLHLRSRQGKPIK 317

Query: 288 CQQC 291
           C  C
Sbjct: 318 CPDC 321


>gi|281206015|gb|EFA80204.1| SET domain-containing protein [Polysphondylium pallidum PN500]
          Length = 490

 Score = 65.5 bits (158), Expect = 5e-08,   Method: Compositional matrix adjust.
 Identities = 31/82 (37%), Positives = 49/82 (59%), Gaps = 2/82 (2%)

Query: 170 SELRPLGTGLYPVISIINHSCLPNAVLVFEGRL--AVVRAVQHVPKGAEVLISYIETAGS 227
           S+L   G+ L+P+++  NH C PNAV+ +        +R+++ +P   EV ISYI+   S
Sbjct: 406 SDLCISGSALFPIVNSCNHHCNPNAVVSYTTNCNRVTLRSLRSIPIHEEVEISYIDETVS 465

Query: 228 TMTRQKALKEQYLFTCTCPRCI 249
              R+K L+ +YLF C C RC+
Sbjct: 466 CSQRRKELQHKYLFNCKCTRCL 487


>gi|357620843|gb|EHJ72884.1| hypothetical protein KGM_13875 [Danaus plexippus]
          Length = 516

 Score = 65.5 bits (158), Expect = 5e-08,   Method: Compositional matrix adjust.
 Identities = 68/277 (24%), Positives = 121/277 (43%), Gaps = 52/277 (18%)

Query: 8   GEVIISQEPYVCVPNNSSSISRCDGCF----------ASSNLKKCSACQVVW-YCGSNCQ 56
           GEVI+ ++P V  P     + +C  C            + +  +CS+C   W  CG +C+
Sbjct: 46  GEVILKEKPAVIGPRMYCKV-QCLSCGRQLQPIPNDDGNMDFIRCSSCN--WPVCGIDCE 102

Query: 57  KLDWKLHRLECQVLSR-----------LDK-EKRKSVTPSIRLMLKLYLRRKLQNDNVIP 104
           K +  +HR EC+++ +            DK E    V   +R++L               
Sbjct: 103 KAE--VHREECKIMVQSKYKCDIKYECPDKAEAAYCVIAPLRVLL--------------- 145

Query: 105 STTTDNYSLVEALVAHMSDIDEKQLLLYAQI--ANLVNLILQWPEISINE--IAENFSKL 160
                N  + E+++   S +D++      Q+   NL+  I Q   ++ ++  I E  S  
Sbjct: 146 -MKDSNPRMFESIMNLESHVDQRINTPIYQVLRPNLIMFIRQVLGMNFDDKIILEISSIF 204

Query: 161 ACNAHTICNSELRPLGTGLYPVISIINHSCLPNAVLVFEGR---LAVVRAVQHVPKGAEV 217
             N+  + +++       +Y + S++NHSC PN   ++ G    LA++  V H+ KG E+
Sbjct: 205 DNNSFDVRSADKTKRLRAIYLLASMMNHSCRPNTRHIYLGEDKTLALIATV-HIAKGEEI 263

Query: 218 LISYIETAGSTMTRQKALKEQYLFTCTCPRCIKLGQF 254
             +Y +    T+ R+K LK    F C C RC    +F
Sbjct: 264 TATYTQPLWGTLDRRKFLKTNKYFDCKCERCADPTEF 300


>gi|260940791|ref|XP_002615235.1| hypothetical protein CLUG_04117 [Clavispora lusitaniae ATCC 42720]
 gi|238850525|gb|EEQ39989.1| hypothetical protein CLUG_04117 [Clavispora lusitaniae ATCC 42720]
          Length = 402

 Score = 65.5 bits (158), Expect = 6e-08,   Method: Compositional matrix adjust.
 Identities = 44/132 (33%), Positives = 59/132 (44%), Gaps = 11/132 (8%)

Query: 132 YAQIANLVNLILQ---WPEISINEIAENFSKLACNAHTI------CNSELRPLGTGLYPV 182
           Y+ I   + LI      P ISI  + +   +   NA  I       + E    G G+YP 
Sbjct: 271 YSNIYKFIRLIAPEEYLPFISIQAVRDIIGRNLTNAFGIWSPVTSSDEEREYFGFGVYPS 330

Query: 183 ISIINHSCLPNAVLVFEGRLAVVRAVQHVPKGAEVLISY-IETAGSTMTRQKALKEQYLF 241
            S  NHSC PN   +  G      A + +P GAE+ ISY I  +     RQ AL+E + F
Sbjct: 331 ASFFNHSCKPNVTKIRRGASYEYIAKEDIPVGAELCISYGIRESDGLKQRQDALRE-WFF 389

Query: 242 TCTCPRCIKLGQ 253
            C C RC   G+
Sbjct: 390 ECGCSRCTTEGK 401


>gi|194883134|ref|XP_001975659.1| GG20436 [Drosophila erecta]
 gi|190658846|gb|EDV56059.1| GG20436 [Drosophila erecta]
          Length = 515

 Score = 65.5 bits (158), Expect = 6e-08,   Method: Compositional matrix adjust.
 Identities = 72/302 (23%), Positives = 124/302 (41%), Gaps = 19/302 (6%)

Query: 9   EVIISQEPYVCVPNNSSSISRCDGCFASSNLKKCSAC-QVVW-YCGSNCQKLDWKLHRLE 66
           E+++ + P V  P   S+   C GC      +    C Q  W  CG  C+ LD   H+ E
Sbjct: 65  EIVLKEAPLVRGPAQISA-PVCLGCLNGIEAEDHIECEQCGWPLCGPECKSLD--EHKAE 121

Query: 67  CQVL-SRLDKEKRKSVTPSIRLMLKLYLRRKLQNDNVIPSTTTDNYSLVEALVAHMSDID 125
           C++   R  K   +       L   L   R L      P   +  +  +E+L +     +
Sbjct: 122 CRLTKDRGQKVNVQEFGGPHPLYTCLSTVRCLLIGETSPEKAS-KFQELESLESTRRGSN 180

Query: 126 EKQLLLYAQIANLVNLILQWPEISINEIAENFSKLACNAHTICNSELRPLGTGLYPVISI 185
           + +  L + I   +    +  + +  EI +    L  N H +  ++  P    ++   S 
Sbjct: 181 QWKADLVS-IGQFIPKFFKTQKFTEEEIMKAVGALQINGHEVPTTD--PSHVAVFYTASF 237

Query: 186 INHSCLPNAVLVF-EGRLAVVRAVQHVPKGAEVLISYIETAGSTMTRQKALKEQYLFTCT 244
             +SCLPN    F +    ++ A   + K A + I Y +    T  RQ+ L +  LF C 
Sbjct: 238 TENSCLPNLAKSFNKNGHCILWAPHEIKKNAHLSICYSDAVWGTADRQRHLMQTKLFKCA 297

Query: 245 CPRCIKLGQFDDIQESAILEGYRCKDDGCSGFLL---RDSDDKGFTCQQCGLVRSKEEIK 301
           C RC+ + +  D   SAI    +C+D  C G +L    D  +  + C++C     K  ++
Sbjct: 298 CDRCVDVTEL-DTNYSAI----KCEDRQCGGLMLPTKADDWNGSWRCRECHKQVQKHYVE 352

Query: 302 KI 303
           +I
Sbjct: 353 RI 354


>gi|443920735|gb|ELU40589.1| SET domain-containing protein [Rhizoctonia solani AG-1 IA]
          Length = 380

 Score = 65.5 bits (158), Expect = 6e-08,   Method: Compositional matrix adjust.
 Identities = 61/223 (27%), Positives = 95/223 (42%), Gaps = 25/223 (11%)

Query: 7   SGEVIISQEPYVCVPNNSSSISRCDGCFAS-SNLKKCSACQVVWYCGSNCQKLDWKLHRL 65
           +G +++ + P     + +    RCDGC     +L+KCS C V WYCG +     WK H  
Sbjct: 31  AGNIVLVETPLAVAIHPTYKGRRCDGCLREFDSLQKCSGCGVYWYCGVSA---SWKRHHR 87

Query: 66  ECQVLSR------LDKEKRKSVTPSIRLMLKLYLRRKLQNDNVIPSTTTDNYSLVEALVA 119
                S+        K+  +     I L+  L      + + +  + TT N +     V 
Sbjct: 88  RLCGFSKAYSSTSAYKDATEDTQTDICLLAHLGAEHFYKFNTLEEAQTTSNQT-----VQ 142

Query: 120 HMSDIDEKQLLLYAQIANLVNLILQWPEISINEIAENFSKLACNAHTICNSELRPLGTGL 179
              D     LL  AQ     NLI      S   ++   ++   N   I +++L P   G+
Sbjct: 143 IFWD-----LLASAQPHAGQNLISPLDFTSSGVLSSAAARFGNNNFVIHDAQLVPYAHGV 197

Query: 180 YPVIS-IINHSCLPNAVLVFE----GRLAVVRAVQHVPKGAEV 217
           + + S   NHSC PNAV +FE    G   V++ V++V  G EV
Sbjct: 198 FALASRSFNHSCRPNAVAMFEESEKGPQMVIKLVENVAAGEEV 240


>gi|256062334|ref|XP_002570316.1| set and mynd domain containing [Schistosoma mansoni]
 gi|353230729|emb|CCD77146.1| putative set and mynd domain containing [Schistosoma mansoni]
          Length = 433

 Score = 65.5 bits (158), Expect = 6e-08,   Method: Compositional matrix adjust.
 Identities = 36/79 (45%), Positives = 48/79 (60%), Gaps = 7/79 (8%)

Query: 176 GTGLYPVISIINHSCLPNAVLVFEG---RLAVVRAVQHVPKGAEVLISYIE---TAGSTM 229
           G GLY   S INHSC PNA++ F G   RL+VV A+  + +G E+ ISY++    +    
Sbjct: 324 GVGLYYYQSRINHSCSPNAIIRFSGVNSRLSVV-ALTSIQEGEEITISYLDHCLQSRGRH 382

Query: 230 TRQKALKEQYLFTCTCPRC 248
           TR+K L   YLF C CP+C
Sbjct: 383 TRRKHLSSNYLFWCNCPKC 401


>gi|328849245|gb|EGF98429.1| hypothetical protein MELLADRAFT_69335 [Melampsora larici-populina
           98AG31]
          Length = 324

 Score = 65.5 bits (158), Expect = 6e-08,   Method: Compositional matrix adjust.
 Identities = 31/81 (38%), Positives = 44/81 (54%), Gaps = 13/81 (16%)

Query: 8   GEVIISQEPYVCVPNNSSSISRCDGCF-------------ASSNLKKCSACQVVWYCGSN 54
           G  ++S  P+V V + SS  +RC  CF              S  +++C+ C+VV YCG N
Sbjct: 80  GTTVLSLSPHVAVLDTSSLSNRCSSCFLEDEDFEALPDPSVSRQIRRCTKCRVVSYCGEN 139

Query: 55  CQKLDWKLHRLECQVLSRLDK 75
           CQ+LDW  H+ ECQ L+   K
Sbjct: 140 CQRLDWVSHKAECQALTNYSK 160


>gi|148235066|ref|NP_001085635.1| SET and MYND domain-containing protein 5 [Xenopus laevis]
 gi|82184468|sp|Q6GPQ4.1|SMYD5_XENLA RecName: Full=SET and MYND domain-containing protein 5
 gi|49115717|gb|AAH73058.1| MGC82689 protein [Xenopus laevis]
          Length = 421

 Score = 65.5 bits (158), Expect = 6e-08,   Method: Compositional matrix adjust.
 Identities = 34/87 (39%), Positives = 50/87 (57%), Gaps = 5/87 (5%)

Query: 176 GTGLYPVISIINHSCLPNAVLVFEGRLAVVR--AVQHVPKGAEVLISYIETAG---STMT 230
           G+GLY + S  NHSC+PNA   F     ++   A++ +  G E+ ISY++      S  +
Sbjct: 303 GSGLYLLQSCCNHSCVPNAEASFPDNNFILHLTALEDIQPGEEICISYLDCCQRDRSRHS 362

Query: 231 RQKALKEQYLFTCTCPRCIKLGQFDDI 257
           RQK L+E YLF C+CP+C+      DI
Sbjct: 363 RQKILRENYLFMCSCPKCLAQADEPDI 389


>gi|71654175|ref|XP_815712.1| hypothetical protein [Trypanosoma cruzi strain CL Brener]
 gi|70880787|gb|EAN93861.1| hypothetical protein, conserved [Trypanosoma cruzi]
          Length = 584

 Score = 65.5 bits (158), Expect = 6e-08,   Method: Compositional matrix adjust.
 Identities = 81/374 (21%), Positives = 148/374 (39%), Gaps = 62/374 (16%)

Query: 31  DGCFASSNLKKCSACQVVWYCGSNCQKLDWKLHR-LECQVLSRLDK-----EKRKSVTPS 84
           DG   ++ L+ C  CQ V YC   C     + H   EC++L RL +        ++  P 
Sbjct: 200 DGEEPTAALRTCRRCQFVSYCSDRCASCHGRQHEEYECRLLFRLREMLGSMRSCEAAVPD 259

Query: 85  IRLMLKLYLRRKLQNDNVIPSTTTDNYSLVEALVAHMSDIDEKQLLLYAQIANLVNLILQ 144
               +  +    +    +      + +  V  L +H  ++ +        +  LV  +  
Sbjct: 260 DFFTMATHCITTMSGVKM----RKEGHEAVLRLESHEVEVSQG----LTPLVRLVQDLFS 311

Query: 145 WPEISINEIAENFSKLACNAHTICNSELRPLGTGLYP--VISIINHSCLPNAVLVFEGRL 202
             + +   +      + CNA  +C++   P+G  L+   V S  NHSCLPN   +  G +
Sbjct: 312 GEDPTF--VTRILGVVRCNALAVCDASGLPVGQALHAASVTSYFNHSCLPNCA-IEAGAI 368

Query: 203 AVVRAVQHVPKGAEVLISYI-ETAGSTMTRQKALKEQYLFTCTCPRCIKLGQFDDIQ--- 258
              RA++    G E+ ISY+ +       R++ L E+Y F C C RC      DD     
Sbjct: 369 VTTRAIR---PGEELTISYLPQLYWPAWLRREELAERYFFDCRCVRC------DDGDRSP 419

Query: 259 -ESAILEGYRCKDDGCSGFLLRDSDDKGFTCQQ--CGLVRSKEEIKKIASEVNILSKKTL 315
            ESA+    R           R+ + +  +  Q  CG VR+K+       +V++  +  L
Sbjct: 420 FESALSATLRLGGS-------REKEREYISSVQILCGRVRAKD-----VGDVDVGDRDAL 467

Query: 316 ALTSCGNHQEVVSTYKMIEKLQKKLYHPFSVNLMQTREKLIKILMELEDWKEALAYCQLT 375
                           ++++ ++ L+ PF     +    L  +   L D +  L  C   
Sbjct: 468 --------------LHLLQECRQHLF-PFHYLCHELHNTLSFVYAVLGDTRACLCSCLRE 512

Query: 376 IPVYQRVYPQFHPL 389
           + +++ + P  HP+
Sbjct: 513 LVMWEAIVPGAHPV 526


>gi|194764587|ref|XP_001964410.1| GF23162 [Drosophila ananassae]
 gi|190614682|gb|EDV30206.1| GF23162 [Drosophila ananassae]
          Length = 392

 Score = 65.1 bits (157), Expect = 6e-08,   Method: Compositional matrix adjust.
 Identities = 37/101 (36%), Positives = 58/101 (57%), Gaps = 9/101 (8%)

Query: 153 IAENFSKLACNAHTICNSELRPLGTGLYPVISIINHSCLPNAVLVF--EGRLAVVRAVQH 210
           I E ++K+   A    N+E    G+GLY + S INHSC+PN+   F     + V++A+  
Sbjct: 267 IDEIYAKVGEFAGEFLNNE----GSGLYLLQSKINHSCVPNSCSTFPYSNDIVVLKALTP 322

Query: 211 VPKGAEVLISYIETAG---STMTRQKALKEQYLFTCTCPRC 248
           + +G E+ ISY++      S  +R K L+E Y+F C CP+C
Sbjct: 323 IQEGDEICISYLDECQLERSRHSRHKVLRENYIFVCQCPKC 363


>gi|326435247|gb|EGD80817.1| hypothetical protein PTSG_01403 [Salpingoeca sp. ATCC 50818]
          Length = 431

 Score = 65.1 bits (157), Expect = 6e-08,   Method: Compositional matrix adjust.
 Identities = 39/111 (35%), Positives = 61/111 (54%), Gaps = 10/111 (9%)

Query: 148 ISINEIAENFSKLACNAHTICNSELRPLGT-GLYPVISIINHSCLPNAVLVFE-----GR 201
           +S++++ +  + +  N+  + +     + T G Y  ++  NHSC PNA    +      R
Sbjct: 143 LSLDDLRQLLAVIHLNSFALSSERFPGMTTYGFYLRMAFCNHSCRPNACQYIDPNSTRAR 202

Query: 202 LA----VVRAVQHVPKGAEVLISYIETAGSTMTRQKALKEQYLFTCTCPRC 248
           L     V+RAV  + +G EV ISYIE   +T  R++AL+E Y FTC CPRC
Sbjct: 203 LNSPSIVLRAVSDIAEGEEVCISYIELMDTTPERREALQELYYFTCQCPRC 253


>gi|358394896|gb|EHK44289.1| hypothetical protein TRIATDRAFT_79382 [Trichoderma atroviride IMI
           206040]
          Length = 742

 Score = 65.1 bits (157), Expect = 6e-08,   Method: Compositional matrix adjust.
 Identities = 28/72 (38%), Positives = 41/72 (56%)

Query: 177 TGLYPVISIINHSCLPNAVLVFEGRLAVVRAVQHVPKGAEVLISYIETAGSTMTRQKALK 236
           TG++   S  NHSCLPNA   F G + +VRA++ +P G E+L+ Y+        RQ+ ++
Sbjct: 501 TGIWLQASYANHSCLPNATRAFIGDMMIVRALREIPVGGEILMQYVPQDKPFDKRQEVVE 560

Query: 237 EQYLFTCTCPRC 248
             Y F C C  C
Sbjct: 561 NHYGFKCDCDLC 572


>gi|398392173|ref|XP_003849546.1| hypothetical protein MYCGRDRAFT_48051 [Zymoseptoria tritici IPO323]
 gi|339469423|gb|EGP84522.1| hypothetical protein MYCGRDRAFT_48051 [Zymoseptoria tritici IPO323]
          Length = 633

 Score = 65.1 bits (157), Expect = 6e-08,   Method: Compositional matrix adjust.
 Identities = 45/131 (34%), Positives = 67/131 (51%), Gaps = 12/131 (9%)

Query: 133 AQIANLVNLILQW----PEISINEIAENFSKLAC-NAH---TICNSE---LRPLGTGLYP 181
           +QI  +++L   W     E+ I E A     + C NA     +   E   +R   TGL+ 
Sbjct: 430 SQIEKVMDLFGDWRSIGKEVLIGEDAFQVHDVVCRNAFGPGAVIAGEAENIRKASTGLWL 489

Query: 182 VISIINHSCLPNAVLVFEGRLAVVRAVQHVPKGAEVLISYIETAGSTMTRQKALKEQYLF 241
           + + INHSCLPNA   F G L VVRA +++  G+E+  SY +++     RQ+AL + + F
Sbjct: 490 MAAYINHSCLPNAEKSFLGDLMVVRATRNITAGSEITHSY-DSSSDYDARQEALMKTWGF 548

Query: 242 TCTCPRCIKLG 252
            C C  C   G
Sbjct: 549 RCRCELCKADG 559


>gi|414870819|tpg|DAA49376.1| TPA: hypothetical protein ZEAMMB73_691595 [Zea mays]
          Length = 489

 Score = 65.1 bits (157), Expect = 6e-08,   Method: Compositional matrix adjust.
 Identities = 73/308 (23%), Positives = 129/308 (41%), Gaps = 56/308 (18%)

Query: 170 SELRPLGTGLYPVISIINHSCLPNAVLVFEGRLAVVRAVQHVPKGAEVLISYIETAGSTM 229
           S++R +   LY   S+ NHSC PN    F  R  V+R  + +  G+ V +SY    G   
Sbjct: 171 SQVR-VAQALYVSGSLFNHSCQPNVHAYFLSRAFVLRTTEFIKSGSPVELSYGPQVGEMH 229

Query: 230 T--RQKALKEQYLFTCTCPRCIKLGQFDDIQES----------AILEG--YRCKDD---- 271
              RQK+L+E Y F+C C  C +L   D +  S          AI E   YR K++    
Sbjct: 230 ISERQKSLQENYYFSCQCSSCSELNLSDLVMNSFCCPQSNCLGAISESTYYRSKENFVNV 289

Query: 272 -----------------------GCSGFLLRDSD-----DKG--FTCQQC-GLVRSKEEI 300
                                    +  LL++S      D G   +C+ C  L  +    
Sbjct: 290 SLGGSYVCKLSLPDVSKVDMDMENVAKSLLQNSGVSLNIDHGCCMSCRSCIDLSSALATS 349

Query: 301 KKIASEVNILSKKTLALTSCGNHQEVVSTYKMIEKLQKKLYHPFSVNLMQTREKLIKILM 360
            +  S ++ L K T       +   +    + +++L KKL HP+S  L Q  + + +   
Sbjct: 350 HREESTIDRLKKHTFL-----DKTLITEALQSLKQL-KKLRHPYSKALAQAEDTIAEAFA 403

Query: 361 ELEDWKEALAYCQLTIPVYQRVYPQFHPLLGLQYYTCGKLEWFLGDTENAIKSMTEAVEI 420
           ++ D + A  +C+ +I + +++Y   H ++  +      +   LGD  +A  +  +A  I
Sbjct: 404 KVGDQELARKHCEASIQILEKLYHPKHIIIAHELIKLVSILLSLGDGASAAATFAQAEAI 463

Query: 421 LRITHGTN 428
             + +G++
Sbjct: 464 FSLYYGSH 471


>gi|307187924|gb|EFN72837.1| SET and MYND domain-containing protein 4 [Camponotus floridanus]
          Length = 651

 Score = 65.1 bits (157), Expect = 7e-08,   Method: Compositional matrix adjust.
 Identities = 40/124 (32%), Positives = 62/124 (50%), Gaps = 8/124 (6%)

Query: 175 LGTGLYPVISIINHSCLPNAVLVFEGRLAVVRAVQHVPKGAEVLISY--IETAGSTMTRQ 232
           +G G+YP ++++NHSC P  +  F G   VVRA + +  G E+  +Y  I T      R+
Sbjct: 454 IGGGVYPTVAMLNHSCNPGVIRYFIGTTMVVRAARTIRAGEEISENYGPIFTTTPENERK 513

Query: 233 KALKEQYLFTCTCPRCIKLGQFDDIQE-SAILEGYRCKDD-GCSGFLLRDSDDKGFT--C 288
           + L+ QY F C C  C   G +  ++E    +  ++C+    C   LL  SD   F   C
Sbjct: 514 RRLRVQYWFDCNCEACS--GHWPLLEELDPTILRFKCETGLSCGNVLLVRSDTNEFMIGC 571

Query: 289 QQCG 292
            +CG
Sbjct: 572 AKCG 575


>gi|393236328|gb|EJD43877.1| hypothetical protein AURDEDRAFT_114449 [Auricularia delicata
           TFB-10046 SS5]
          Length = 451

 Score = 65.1 bits (157), Expect = 7e-08,   Method: Compositional matrix adjust.
 Identities = 31/75 (41%), Positives = 49/75 (65%), Gaps = 3/75 (4%)

Query: 177 TGLYPVISIINHSCLPNAVLVFE-GRLAV-VRAVQHVPKGAEVLISYIETA-GSTMTRQK 233
           +G++P  S ++HSC PNA+  ++ G + + +RA +H+ +GAEV ISY+      T  RQ+
Sbjct: 244 SGVFPTFSRVSHSCCPNAIPRWDSGTMTLELRATRHIQQGAEVTISYVPPPLKPTEARQE 303

Query: 234 ALKEQYLFTCTCPRC 248
            L++ Y F CTCP C
Sbjct: 304 FLRDGYHFECTCPAC 318


>gi|147844297|emb|CAN82112.1| hypothetical protein VITISV_031337 [Vitis vinifera]
          Length = 405

 Score = 65.1 bits (157), Expect = 7e-08,   Method: Compositional matrix adjust.
 Identities = 73/264 (27%), Positives = 102/264 (38%), Gaps = 27/264 (10%)

Query: 8   GEVIISQEP---YVCVPNNSSSISRCDGCF----ASSNLKKCSACQVVWYCGSNC--QKL 58
           G++I++  P   Y   P +SSS + C  CF      S L  CS+C  + +C  +C    L
Sbjct: 35  GQIILTDSPILLYSAHPLSSSSNAYCSNCFRHLQTCSTLVSCSSCPCL-FCSPDCLTXAL 93

Query: 59  DWKLHRLECQVLSRLDKEKRKSVTPSIRLMLKLYLRRKLQNDNVIPSTTTDNYSLVEALV 118
                   C  LS L      S++ S R +   +L        V PS      SL + + 
Sbjct: 94  SSSHSPWACLTLSLLRASPSLSLSHSERQVQARFLVAAYNLAIVSPSHFHILLSL-QGMA 152

Query: 119 AHMSDIDEKQLLLYAQIANLVNLILQWPEISINEIAENFSKLACNAHTICNSE-LRPLGT 177
              SD D    L     +      +     S+       +K   NA  +     L P G 
Sbjct: 153 LPSSDSDAPTFLHSLLSSLSPPQGVAG--FSVELTTALLAKDKLNAFGLMEPPALAPGGE 210

Query: 178 ------GLYPVISIINHSCLPNA-------VLVFEGRLAVVRAVQHVPKGAEVLISYIET 224
                 G+YP  S  NH CLPNA                 +R +  VP+G+E+ +SY   
Sbjct: 211 RSVRAYGIYPKASFFNHDCLPNACRFDYVDTASHHNTDITIRLIHDVPEGSEICLSYFPV 270

Query: 225 AGSTMTRQKALKEQYLFTCTCPRC 248
             +   RQK L E Y FTC C RC
Sbjct: 271 NETYADRQKRLLEDYGFTCYCDRC 294


>gi|24648668|ref|NP_650955.1| CG3353 [Drosophila melanogaster]
 gi|23171865|gb|AAF55866.2| CG3353 [Drosophila melanogaster]
 gi|33636489|gb|AAQ23542.1| RE75113p [Drosophila melanogaster]
          Length = 393

 Score = 65.1 bits (157), Expect = 7e-08,   Method: Compositional matrix adjust.
 Identities = 36/97 (37%), Positives = 56/97 (57%), Gaps = 9/97 (9%)

Query: 157 FSKLACNAHTICNSELRPLGTGLYPVISIINHSCLPNAVLVF--EGRLAVVRAVQHVPKG 214
           ++K+   A    N+E    G+GLY + S INHSC+PNA   F     + V++A+  + +G
Sbjct: 271 YAKVGEFAGEFLNNE----GSGLYLLQSKINHSCVPNACSTFPYSNDIVVLKALAPIQQG 326

Query: 215 AEVLISYIET---AGSTMTRQKALKEQYLFTCTCPRC 248
            E+ ISY++      S  +R K L+E Y+F C CP+C
Sbjct: 327 EEICISYLDECMLERSRHSRHKVLRENYVFICQCPKC 363


>gi|330796724|ref|XP_003286415.1| hypothetical protein DICPUDRAFT_77304 [Dictyostelium purpureum]
 gi|325083610|gb|EGC37058.1| hypothetical protein DICPUDRAFT_77304 [Dictyostelium purpureum]
          Length = 499

 Score = 65.1 bits (157), Expect = 7e-08,   Method: Compositional matrix adjust.
 Identities = 32/79 (40%), Positives = 46/79 (58%), Gaps = 6/79 (7%)

Query: 176 GTGLYPVISIINHSCLPNAVLVFEGRLAV------VRAVQHVPKGAEVLISYIETAGSTM 229
           G GL+P+ S +NHSCLPN  +  E +  V      V+A + +P G+E+L SY +    T 
Sbjct: 419 GVGLFPIFSCMNHSCLPNIEISNEIQDGVDRVKMVVKAKKLIPAGSEILHSYCDENLPTK 478

Query: 230 TRQKALKEQYLFTCTCPRC 248
            R++ L  QY F C CP+C
Sbjct: 479 ERKQLLLSQYGFKCLCPKC 497


>gi|452988674|gb|EME88429.1| hypothetical protein MYCFIDRAFT_43814 [Pseudocercospora fijiensis
           CIRAD86]
          Length = 334

 Score = 65.1 bits (157), Expect = 8e-08,   Method: Compositional matrix adjust.
 Identities = 37/100 (37%), Positives = 51/100 (51%), Gaps = 8/100 (8%)

Query: 156 NFSKLACNAHTICNSELRPLGT-----GLYPVISIINHSCLPNAVLVFEGRLA--VVRAV 208
           NF         I  S   PLG      G++P++S +NHSCLPNA   +  RL   +V  +
Sbjct: 89  NFPGSPTPFSNIVRSNGYPLGPSSSIGGIFPLVSRLNHSCLPNAQHAYNERLGKMLVHII 148

Query: 209 QHVPKGAEVLISYIETAGSTMTRQKALKEQYLFTCTCPRC 248
           + +    E+ +SYI   G +  RQ  LK  +LFTCTC  C
Sbjct: 149 RPILPNEEITLSYI-PGGPSPQRQTELKSNFLFTCTCTLC 187


>gi|389645452|ref|XP_003720358.1| hypothetical protein MGG_09876 [Magnaporthe oryzae 70-15]
 gi|351640127|gb|EHA47991.1| hypothetical protein MGG_09876 [Magnaporthe oryzae 70-15]
          Length = 427

 Score = 65.1 bits (157), Expect = 8e-08,   Method: Compositional matrix adjust.
 Identities = 59/216 (27%), Positives = 96/216 (44%), Gaps = 40/216 (18%)

Query: 42  CSACQVVWYCGSNCQKLDWK-LHRLECQ-VLSRLDKEKRKSVTPSIRLMLKLYLRRKLQN 99
           CS C+        CQ+  WK  H+LEC+ VLS     +  ++ P+   M +L L   L+ 
Sbjct: 11  CSGCKA-------CQRNAWKTFHKLECKAVLS-----EAPAMGPTTLAMYRLLL---LKK 55

Query: 100 DNVIPSTTTDNYSLVEALVAHMSDIDEKQLLLYAQIANLVNLILQ----W-----PEISI 150
           D      T   +  +  L +H  + +E          +L N IL     W      ++  
Sbjct: 56  DGRF---TERQWKAIMQLESHFEEQNE----------DLRNDILAKVHGWGRRMGTDVDF 102

Query: 151 NEIAENFSKLACNAHTICNSELR-PLGTGLYPVISIINHSCLPNAVLVFEGRLAVVRAVQ 209
             + E    +  N+  +  +E     GT L  V+S+INH C PNA + FEG    VR+++
Sbjct: 103 MTVWELLCIIFTNSMNVRPAEADVSFGTSLDLVVSMINHDCSPNAHVFFEGSQVRVRSLK 162

Query: 210 HVPKGAEVLISYIETAGSTMTRQKALKEQYLFTCTC 245
            +  G E+ +SY +     + RQ+ L++   F C C
Sbjct: 163 AIAAGGEITVSYCDPRLDVLLRQEILRQTQFFHCEC 198


>gi|198461003|ref|XP_002138938.1| GA25085 [Drosophila pseudoobscura pseudoobscura]
 gi|198137203|gb|EDY69496.1| GA25085 [Drosophila pseudoobscura pseudoobscura]
          Length = 515

 Score = 65.1 bits (157), Expect = 8e-08,   Method: Compositional matrix adjust.
 Identities = 73/302 (24%), Positives = 119/302 (39%), Gaps = 19/302 (6%)

Query: 9   EVIISQEPYVCVPNNSSSISRCDGCFASSNLKKCSACQVVWY--CGSNCQKLDWKLHRLE 66
           E+I+ + P V  P   +    C GC  S   +    C +  +  CG  C+ L    HR E
Sbjct: 65  EIILKEAPLVRGPAQITP-PVCMGCLNSIEPEDHIDCDLCGWPLCGPECKSLG--EHRAE 121

Query: 67  CQVLS-RLDKEKRKSVTPSIRLMLKLYLRRKLQNDNVIPSTTTDNYSLVEALVAHMSDID 125
           CQ+   R  K        S  L   +   R L      P   +  +  +E+L +     +
Sbjct: 122 CQLTQERGQKVNVHEFNGSHPLYTCVSTVRCLLIGETSPEKAS-KFQELESLESTRRGSN 180

Query: 126 EKQLLLYAQIANLVNLILQWPEISINEIAENFSKLACNAHTICNSELRPLGTGLYPVISI 185
           + +  L A I   +    +  + S  E+      L  N H +  S+  P    ++   S 
Sbjct: 181 QWKADL-ASIGQFIPKFFKTEKFSEEEVMRAVGALQINGHEVPTSD--PPHVAVFYTASF 237

Query: 186 INHSCLPNAVLVF-EGRLAVVRAVQHVPKGAEVLISYIETAGSTMTRQKALKEQYLFTCT 244
             HSC+PN    F +    ++ A + + K + + I Y +    T  RQ+ L +  LF C 
Sbjct: 238 TEHSCMPNLAKSFNKNGHCILWAPREIKKNSHLSICYSDAMWGTADRQRHLVQTKLFKCA 297

Query: 245 CPRCIKLGQFDDIQESAILEGYRCKDDGCSGFLLRDSDDK---GFTCQQCGLVRSKEEIK 301
           C RC  + +      SAI    +C+D  C G +L    D     + C++C     K  I 
Sbjct: 298 CDRCTDVTEL-GTNYSAI----KCEDRRCDGLMLPSKTDDWNGSWKCRECQKQVQKHYID 352

Query: 302 KI 303
           +I
Sbjct: 353 QI 354


>gi|350424957|ref|XP_003493967.1| PREDICTED: SET and MYND domain-containing protein 4-like [Bombus
           impatiens]
          Length = 631

 Score = 65.1 bits (157), Expect = 8e-08,   Method: Compositional matrix adjust.
 Identities = 72/290 (24%), Positives = 116/290 (40%), Gaps = 50/290 (17%)

Query: 8   GEVIISQEPYVCVPNNSSSISRCDGCFASS--NLKKCSACQVVWYCGSNCQKLDWK-LHR 64
           GEVI  ++PY  +    +  + C  C   S  N+  C  C    YC   C+ ++WK  H 
Sbjct: 248 GEVIAVEKPYSLILTPDNIHTHCSNCLEVSWANIP-CEHCTYAMYCSEECKAMEWKKYHD 306

Query: 65  LECQVL-SRLDKEKRKSVTPSIRLMLKLY--------LRRKLQN-DNVIPSTT------- 107
           +EC+V  S L     K    S+RL ++          LR++L+  D+ + S T       
Sbjct: 307 VECRVFPSMLKMNFVKLDLFSLRLAIQAVREATNIQELRKELKEVDSCVASRTKGFSKDG 366

Query: 108 ---TDNYSLVEALVAHMSDIDEKQLLLYAQIANLV-NLILQWPEISINEIAENFSKLACN 163
              +D Y  +  LV +      + L   +  A+ +   +     I  N ++++ S L  N
Sbjct: 367 TFPSDKYRSLLGLVTNTEKRSVQDLFRRSLDASFILYFVATCTNIFGNPLSKDLSVLIKN 426

Query: 164 A--------------------HTI---CNSELRPLGTGLYPVISIINHSCLPNAVLVFEG 200
           A                    H+    C  E    G    P  S+INHSC PN +     
Sbjct: 427 ADVTFVGSLILRHQQMIPSNIHSFSEECGLEAVERGAAAMPFSSLINHSCNPNILRHSRS 486

Query: 201 RLAVVRAVQHVPKGAEVLISYIETAGST--MTRQKALKEQYLFTCTCPRC 248
           +  ++ A+  + +G ++   Y +    T    RQK L +QY F C C  C
Sbjct: 487 KYVIIYAIYPIEEGEQLYDIYTQHYAITPKAVRQKKLLKQYYFKCNCLPC 536


>gi|66810740|ref|XP_639077.1| SET domain-containing protein [Dictyostelium discoideum AX4]
 gi|74854618|sp|Q54R14.1|Y3443_DICDI RecName: Full=SET domain-containing protein DDB_G0283443
 gi|60467687|gb|EAL65706.1| SET domain-containing protein [Dictyostelium discoideum AX4]
          Length = 393

 Score = 64.7 bits (156), Expect = 8e-08,   Method: Compositional matrix adjust.
 Identities = 76/326 (23%), Positives = 132/326 (40%), Gaps = 57/326 (17%)

Query: 2   FSFHVSGEVIISQEPYVCVPN----NSSSISRCDGCFASSNLKKCSACQVVWYCGSNC-- 55
           FSFH   + +++Q  + C  N    N++  ++       + + +C+ C+++++C   C  
Sbjct: 52  FSFH---KHLVNQFCFNCFSNSHEINNAKFNKFKVDINKNYIIRCNNCKLIYFCSDECFE 108

Query: 56  ---------QKLDWKLHR-LECQVLSRLDKEKRKSVTPSIRLMLKLYLRRKLQN--DNVI 103
                          +H  LEC +LS    +   +++P I  +      R + N    + 
Sbjct: 109 KVMSIESFQDSTSTNIHTPLECLILSNYHDQ---TISPKINTLHDQTENRMIINYLSKIA 165

Query: 104 PSTTTDN-YSL----VEALVAHMSDIDEKQLLLYAQIANLVNLILQWPEISINE------ 152
            ST  +N + L    +  L+   ++ +  Q L   +I N+ N      ++  N       
Sbjct: 166 YSTNNNNKFKLLLIEMNQLIGDFNNDNNNQTLSLNEIKNIKNKSFNLRKLFNNFFFNIDK 225

Query: 153 --IAENFSKLACNAHTICNSELRPLGTGLYP---------------VISIINHSCLPNAV 195
             I E ++K   N+  +  +     G  +Y                 IS  NHSC PN V
Sbjct: 226 VIIEELYAKSQRNSFGLWKNSDECFGLSMYGNQTIYNNNNDKDDNISISYFNHSCFPNCV 285

Query: 196 LVFEGRLAVVRAVQHVPKGAEVLISYIETAGSTMTRQKALKEQYLFTCTCPRCI--KLGQ 253
            V E +   + ++  + KG E+ ISYI+   S   R   LKE Y F C C RC    L  
Sbjct: 286 RVQENQSISIYSLIPIKKGDELSISYIDIRMSKNDRLLHLKEIYYFECKCKRCTLPPLSN 345

Query: 254 FDDIQESAILEGYRCKDDG--CSGFL 277
              ++    +E Y CK+    C+G L
Sbjct: 346 L-SLEIEKTIENYTCKNQSIKCTGIL 370


>gi|302697297|ref|XP_003038327.1| hypothetical protein SCHCODRAFT_103387 [Schizophyllum commune H4-8]
 gi|300112024|gb|EFJ03425.1| hypothetical protein SCHCODRAFT_103387, partial [Schizophyllum
           commune H4-8]
          Length = 360

 Score = 64.7 bits (156), Expect = 8e-08,   Method: Compositional matrix adjust.
 Identities = 33/74 (44%), Positives = 43/74 (58%), Gaps = 2/74 (2%)

Query: 177 TGLYPVISIINHSCLPNAVLVFE--GRLAVVRAVQHVPKGAEVLISYIETAGSTMTRQKA 234
            G++   S INHSC PNAV  F+      VVRA++H+ KG E+ ISY ET     TR++ 
Sbjct: 174 VGVFERASRINHSCSPNAVYHFDLQSFCLVVRAIRHISKGEEIFISYSETLLQAATRRQN 233

Query: 235 LKEQYLFTCTCPRC 248
             + Y F C CP C
Sbjct: 234 SLQDYGFRCACPAC 247


>gi|407844697|gb|EKG02089.1| hypothetical protein TCSYLVIO_006900 [Trypanosoma cruzi]
          Length = 584

 Score = 64.7 bits (156), Expect = 9e-08,   Method: Compositional matrix adjust.
 Identities = 84/391 (21%), Positives = 153/391 (39%), Gaps = 60/391 (15%)

Query: 18  VCVPNNSSSISRCDGCFA----------SSNLKKCSACQVVWYCGSNCQKLDWKLHR-LE 66
           VC    + S+  C  C            ++ L+ C  CQ V YC   C     + H   E
Sbjct: 177 VCSYEETGSVGLCAHCGVVVMPDDVEEPTAALRTCRRCQFVSYCSDRCASCHGRQHEEYE 236

Query: 67  CQVLSRLDKEKRKSVTPSIRLMLKLYLRRKLQNDNVIPSTTT--DNYSLVEALVAHMSDI 124
           C++L RL +E   S+      +   +          +       + +  V  L +H  ++
Sbjct: 237 CRLLFRL-REMLGSMRSCEAAVPDDFFTMATHCITTVSGVKMRKEGHEAVLRLESHEVEV 295

Query: 125 DEKQLLLYAQIANLVNLILQWPEISINEIAENFSKLACNAHTICNSELRPLGTGLYP--V 182
            +        +  LV  +    + +   +      + CNA  +C++   P+G  L+   V
Sbjct: 296 SQG----LTPLVRLVQDLFSGEDPTF--VTRILGVVRCNALAVCDASGLPVGQALHAASV 349

Query: 183 ISIINHSCLPNAVLVFEGRLAVVRAVQHVPKGAEVLISYI-ETAGSTMTRQKALKEQYLF 241
            S  NHSCLPN   +  G +   RA++    G E+ ISY+ +       R++ L E+Y F
Sbjct: 350 TSYFNHSCLPNCA-IEAGAIVTTRAIR---PGEELTISYLPQLYWPAWLRREELAERYFF 405

Query: 242 TCTCPRCIKLGQFDDIQ-ESAILEGYRCKDDGCSGFLLRDSDDKGFTCQQ--CGLVRSKE 298
            C   RC++ G  D    ESA+    R           R+ + +  +  Q  CG VR+K+
Sbjct: 406 DC---RCVRCGDGDRSPFESALSATLRLGGS-------REKEREYISSVQILCGRVRAKD 455

Query: 299 EIKKIASEVNILSKKTLALTSCGNHQEVVSTYKMIEKLQKKLYHPFSVNLMQTREKLIKI 358
                  +V++  +  L                ++++ ++ L+ PF     +    L  +
Sbjct: 456 -----VGDVDVGDRDAL--------------LHLLQECRQHLF-PFHYLCHELHNTLSFV 495

Query: 359 LMELEDWKEALAYCQLTIPVYQRVYPQFHPL 389
              L D +  L  C   + +++ + P  HP+
Sbjct: 496 YAVLGDTRACLCGCLRELVMWEAIVPGAHPV 526


>gi|330843375|ref|XP_003293631.1| hypothetical protein DICPUDRAFT_90323 [Dictyostelium purpureum]
 gi|325076008|gb|EGC29833.1| hypothetical protein DICPUDRAFT_90323 [Dictyostelium purpureum]
          Length = 343

 Score = 64.7 bits (156), Expect = 9e-08,   Method: Compositional matrix adjust.
 Identities = 30/73 (41%), Positives = 43/73 (58%), Gaps = 1/73 (1%)

Query: 178 GLYPVISIINHSCLPNAVLVFEG-RLAVVRAVQHVPKGAEVLISYIETAGSTMTRQKALK 236
           GLY + S INHSC PNA + F     A +R ++ +  G E+ ISY +T+   + R+  L 
Sbjct: 260 GLYFLTSFINHSCDPNAYVQFPNDHTAEIRLLKPINPGEEITISYADTSKDIIDRRSQLF 319

Query: 237 EQYLFTCTCPRCI 249
           E Y F C CP+C+
Sbjct: 320 ENYGFNCECPKCL 332


>gi|240979812|ref|XP_002403255.1| SET and MYND domain-containing protein (SMYD), putative [Ixodes
           scapularis]
 gi|215491338|gb|EEC00979.1| SET and MYND domain-containing protein (SMYD), putative [Ixodes
           scapularis]
          Length = 413

 Score = 64.7 bits (156), Expect = 9e-08,   Method: Compositional matrix adjust.
 Identities = 41/106 (38%), Positives = 60/106 (56%), Gaps = 8/106 (7%)

Query: 176 GTGLYPVISIINHSCLPNAVLVF---EGRLAVVRAVQHVPKGAEVLISYIET---AGSTM 229
           G GLYP+ S+  HSC PNA   F   +  L++V A++ +  G EV +SYI+    + S  
Sbjct: 309 GCGLYPLQSLCAHSCCPNAEARFLHNDHTLSLV-ALRDIRPGEEVTVSYIDECSLSRSRH 367

Query: 230 TRQKALKEQYLFTCTCPRCIKLG-QFDDIQESAILEGYRCKDDGCS 274
           +R K L+E +LFTCTCPRC +   Q  D+     +E     ++ CS
Sbjct: 368 SRIKMLRESHLFTCTCPRCEEEATQQPDVTSDEEMEDSSDAEEACS 413


>gi|195170308|ref|XP_002025955.1| GL10122 [Drosophila persimilis]
 gi|194110819|gb|EDW32862.1| GL10122 [Drosophila persimilis]
          Length = 515

 Score = 64.7 bits (156), Expect = 9e-08,   Method: Compositional matrix adjust.
 Identities = 73/302 (24%), Positives = 119/302 (39%), Gaps = 19/302 (6%)

Query: 9   EVIISQEPYVCVPNNSSSISRCDGCFASSNLKKCSACQVVWY--CGSNCQKLDWKLHRLE 66
           E+I+ + P V  P   +    C GC  S   +    C +  +  CG  C+ L    HR E
Sbjct: 65  EIILKEAPLVRGPAQITP-PVCMGCLNSIEPEDHIDCDLCGWPLCGPECKSLG--EHRAE 121

Query: 67  CQVLS-RLDKEKRKSVTPSIRLMLKLYLRRKLQNDNVIPSTTTDNYSLVEALVAHMSDID 125
           CQ+   R  K        S  L   +   R L      P   +  +  +E+L +     +
Sbjct: 122 CQLTQERGQKVNVHEFNGSHPLYTCVSTVRCLLIGETSPEKAS-KFQELESLESTRRGSN 180

Query: 126 EKQLLLYAQIANLVNLILQWPEISINEIAENFSKLACNAHTICNSELRPLGTGLYPVISI 185
           + +  L A I   +    +  + S  E+      L  N H +  S+  P    ++   S 
Sbjct: 181 QWKADL-ASIGQFIPKFFKTEKFSEEEVMRAVGALQINGHEVPTSD--PPHVAVFYTASF 237

Query: 186 INHSCLPNAVLVF-EGRLAVVRAVQHVPKGAEVLISYIETAGSTMTRQKALKEQYLFTCT 244
             HSC+PN    F +    ++ A + + K + + I Y +    T  RQ+ L +  LF C 
Sbjct: 238 TEHSCMPNLAKSFNKNGHCILWAPREIKKNSHLSICYSDAMWGTADRQRHLVQTKLFKCA 297

Query: 245 CPRCIKLGQFDDIQESAILEGYRCKDDGCSGFLLRDSDDK---GFTCQQCGLVRSKEEIK 301
           C RC  + +      SAI    +C+D  C G +L    D     + C++C     K  I 
Sbjct: 298 CDRCTDVTEL-GTNYSAI----KCEDRRCDGLMLPSKTDDWNGSWKCRECQKQVQKHYID 352

Query: 302 KI 303
           +I
Sbjct: 353 QI 354


>gi|19922236|ref|NP_610944.1| CG8503 [Drosophila melanogaster]
 gi|7303209|gb|AAF58272.1| CG8503 [Drosophila melanogaster]
 gi|15291193|gb|AAK92865.1| GH11294p [Drosophila melanogaster]
 gi|220945042|gb|ACL85064.1| CG8503-PA [synthetic construct]
 gi|220954946|gb|ACL90016.1| CG8503-PA [synthetic construct]
          Length = 513

 Score = 64.7 bits (156), Expect = 9e-08,   Method: Compositional matrix adjust.
 Identities = 74/305 (24%), Positives = 128/305 (41%), Gaps = 25/305 (8%)

Query: 9   EVIISQEPYVCVPNNSSSISRCDGCFASSNLKKCSAC-QVVW-YCGSNCQKLDWKLHRLE 66
           E+++ + P V  P   S+   C GC      +    C Q  W  CG  C+ LD   H+ E
Sbjct: 65  EIVLKEAPLVRGPAQISA-PVCLGCLNGIEAEDHIECEQCGWPLCGPECKSLD--EHKAE 121

Query: 67  CQVL-SRLDKEKRKSVTPSIRLMLKLYLRRKLQNDNVIPSTTTDN---YSLVEALVAHMS 122
           C +   R  K   +       L   L   R L    +I  T+T+    +  +E+L +   
Sbjct: 122 CGLTKDRGQKVNVQEFGGPHPLYTCLSTVRCL----LIGETSTEKASKFQDLESLESTRR 177

Query: 123 DIDEKQLLLYAQIANLVNLILQWPEISINEIAENFSKLACNAHTICNSELRPLGTGLYPV 182
             ++ +  L + I   +    +  + +  EI +    L  N H +  ++  P    ++  
Sbjct: 178 GSNQWKADLVS-IGQFIPKFFKTQKFTEEEIMKAVGALQINGHEVPTTD--PSHVAVFYT 234

Query: 183 ISIINHSCLPNAVLVF-EGRLAVVRAVQHVPKGAEVLISYIETAGSTMTRQKALKEQYLF 241
            S   +SCLPN    F +    ++ A + + K A + I Y +    T  RQ+ L +  LF
Sbjct: 235 ASFTENSCLPNLAKSFNKNGHCILWAPREIKKNAHLSICYSDAMWGTADRQRHLMQTKLF 294

Query: 242 TCTCPRCIKLGQFDDIQESAILEGYRCKDDGCSGFLLRDSDDK---GFTCQQCGLVRSKE 298
            C C RC+ + +  D   SAI    +C+D  C G +L    D+    + C++C     K 
Sbjct: 295 KCACERCVDVTEL-DTNYSAI----KCEDRQCGGLMLPTKADEWNGSWRCRECHKQVQKH 349

Query: 299 EIKKI 303
            +++I
Sbjct: 350 YVERI 354


>gi|384485892|gb|EIE78072.1| hypothetical protein RO3G_02776 [Rhizopus delemar RA 99-880]
          Length = 554

 Score = 64.7 bits (156), Expect = 9e-08,   Method: Compositional matrix adjust.
 Identities = 64/231 (27%), Positives = 100/231 (43%), Gaps = 18/231 (7%)

Query: 30  CDGCFASSNLKK-CSACQVVWYCGSNCQKLDWKLHRLECQVLSRLDKEKRKSVTPSIRLM 88
           C  C A    K  CS C + +YC   C   D  LH+  C   S++      +      L 
Sbjct: 104 CHFCLADLTQKVICSNCSLSYYCSETCSSKDHTLHQPVCNAFSQVQAIAHATDVDVDLLR 163

Query: 89  LKLYLRRKLQNDNVIPSTTTDNYSLVEALVAHMSDIDEKQLLLYAQIANLVN--LILQWP 146
           L L L  +   D+         +  VE L++H+    E     + Q+    +  L+ + P
Sbjct: 164 LMLLLLARRHMDSQSNQEDVTPFWCVEDLLSHI----ESAPAAFKQVVTAASERLLSELP 219

Query: 147 EISINEIAENFSKLAC----NAHTICNSELR--PLGTGLYPVISI-INHSCLPNAVLVF- 198
           E S+    E+   LAC    NAH + ++  R   +  GL+P+ ++  NH C PN   V  
Sbjct: 220 E-SVQIPVEDMVSLACRINSNAHGLGDNHSRNTDVALGLFPLGALFFNHGCNPNTAFVGL 278

Query: 199 -EGRLAVVRAVQHVPKGAEVLISYIETAGSTMTRQKALKEQYLFTCTCPRC 248
             G+LA  R ++ V K  E+++SYI+       R++ L     F C C RC
Sbjct: 279 PNGQLA-FRTIRSVQKDEELVVSYIDIYSDRDERRQELLTTKHFWCKCKRC 328


>gi|149238794|ref|XP_001525273.1| hypothetical protein LELG_03201 [Lodderomyces elongisporus NRRL
           YB-4239]
 gi|146450766|gb|EDK45022.1| hypothetical protein LELG_03201 [Lodderomyces elongisporus NRRL
           YB-4239]
          Length = 619

 Score = 64.7 bits (156), Expect = 1e-07,   Method: Compositional matrix adjust.
 Identities = 36/97 (37%), Positives = 51/97 (52%), Gaps = 4/97 (4%)

Query: 155 ENFSKLA-CNAHTICNSELRPLGTGLYPVISIINHSCLPNAVLVFEGRLAVVRAVQHVP- 212
           ENF  L   N +T+ + + R +G  L P  S+INHSC+PN+  V E      R +  +P 
Sbjct: 213 ENFVNLIFINCNTLVDYKFRIVGYNLDPQFSMINHSCMPNSCQV-ETEYGCYRLINTLPI 271

Query: 213 -KGAEVLISYIETAGSTMTRQKALKEQYLFTCTCPRC 248
             G E+ ++YI +   T  RQ  L  +Y F C CP C
Sbjct: 272 NAGDEITVNYIASGIPTELRQVQLFSRYFFRCKCPLC 308


>gi|312373795|gb|EFR21480.1| hypothetical protein AND_16998 [Anopheles darlingi]
          Length = 486

 Score = 64.7 bits (156), Expect = 1e-07,   Method: Compositional matrix adjust.
 Identities = 76/338 (22%), Positives = 129/338 (38%), Gaps = 74/338 (21%)

Query: 40  KKCSACQVVWYCGSNCQKLDWKLHRLECQVLSRLDKEKRKSVTPSIRLMLKLYLRRKLQN 99
           K+CS C +V+YC    Q+  WK H+  C        E+             L   R+++ 
Sbjct: 23  KRCSRCALVYYCSVEHQQQHWKKHKATCNPFKVFSNEQFGRF---------LVATREIRA 73

Query: 100 DNVI---------PSTTTDNYSLVEALVAH-------------MSDIDEKQLLLYAQ--- 134
             V+         P+  T   ++    V H             +++ D  +  L  Q   
Sbjct: 74  GEVVIKEPPLVHGPAQITGPIAIQHFYVPHPMYQCLMPLRCLLLAESDPAKWELLQQLES 133

Query: 135 -----------------IANLVNLIL----QWPEISINEIAENFSKLACNAHTICNSELR 173
                            +A L+        +W E   +EI      +  N H I  S   
Sbjct: 134 HEEERRGSELWRNDREGVAKLIPRFFKTGDRWDE---DEILRIVGIIQVNGHEIPLS--V 188

Query: 174 PLGTGLYPVISIINHSCLPNAVLVFEGRLAVV-RAVQHVPKGAEVLISYIETAGSTMTRQ 232
           P    +Y   S++ HSC PN    F  R  +V  A   + +G  + I Y +   +T  R 
Sbjct: 189 PSSVAIYNQASMLEHSCRPNLSKSFTNRGEIVLWAPNTIKRGERLSICYTDAMWATGNRL 248

Query: 233 KALKEQYLFTCTCPRCIKLGQFDDIQESAILEGYRCKDDGCSGFLLR-DSDD---KGFTC 288
           + L++  +F C C RC    +++    +    G++ KD  C G++L  DSD+     ++C
Sbjct: 249 EHLQQTKMFRCECERCSDPTEYETYFSALRCSGFQ-KDSKCKGYILPVDSDNWTTGDWSC 307

Query: 289 QQC-GLVRSKEEIKKIASEVNILSKKTLALTSCGNHQE 325
           Q+C G+V         A+   IL +  + L +   H+E
Sbjct: 308 QKCRGVVTG-------AAVGQILERARMDLEAMQKHRE 338


>gi|159472132|ref|XP_001694205.1| predicted protein [Chlamydomonas reinhardtii]
 gi|158276868|gb|EDP02638.1| predicted protein [Chlamydomonas reinhardtii]
          Length = 468

 Score = 64.7 bits (156), Expect = 1e-07,   Method: Compositional matrix adjust.
 Identities = 33/73 (45%), Positives = 42/73 (57%), Gaps = 2/73 (2%)

Query: 178 GLYPVISIINHSCLPNAVLVFEGRLAVVRAVQHVPKGAEVLISYIETAG--STMTRQKAL 235
           GL+   S++NHSC PNAV    G    V A  HV +G+E+LI+Y+  A       RQ  L
Sbjct: 139 GLWHSFSMLNHSCCPNAVNYVVGDSMCVFAAGHVEQGSELLINYLGAASLRPVEERQAGL 198

Query: 236 KEQYLFTCTCPRC 248
            E Y F+C CPRC
Sbjct: 199 AEAYAFSCGCPRC 211


>gi|195334290|ref|XP_002033816.1| GM21522 [Drosophila sechellia]
 gi|194125786|gb|EDW47829.1| GM21522 [Drosophila sechellia]
          Length = 514

 Score = 64.7 bits (156), Expect = 1e-07,   Method: Compositional matrix adjust.
 Identities = 72/304 (23%), Positives = 128/304 (42%), Gaps = 23/304 (7%)

Query: 9   EVIISQEPYVCVPNNSSSISRCDGCFASSNLKKCSAC-QVVW-YCGSNCQKLDWKLHRLE 66
           E+++ + P V  P   S+   C GC           C Q  W  CG  C+ LD   H+ E
Sbjct: 65  EIVLKEAPLVRGPAQISA-PVCLGCLNGIEPDDHIECEQCGWPLCGPECKSLD--EHKAE 121

Query: 67  CQVLSRLDKEKRKSVTP---SIRLMLKLYLRRKLQNDNVIPSTTTDNYSLVEALVAHMSD 123
           C++    D+++R +V        L   L   R L      P   +  +  +E+L +    
Sbjct: 122 CRLTK--DRDQRVNVQEFGGPHPLYTCLSTVRCLLIGETSPEKAS-KFQDLESLESTRRG 178

Query: 124 IDEKQLLLYAQIANLVNLILQWPEISINEIAENFSKLACNAHTICNSELRPLGTGLYPVI 183
            ++ +  L + I   +    +  + +  +I +    L  N H +  ++  P    ++   
Sbjct: 179 SNQWKADLVS-IGQFIPKFFKTQKFTEEQIMKAVGALQINGHEVPTTD--PSHVAVFYTA 235

Query: 184 SIINHSCLPNAVLVF-EGRLAVVRAVQHVPKGAEVLISYIETAGSTMTRQKALKEQYLFT 242
           S   +SCLPN    F +    ++ A + + K A + I Y +    T  RQ+ L +  LF 
Sbjct: 236 SFTENSCLPNLAKSFNKNGHCILWAPREIKKNAHLSICYSDAMWGTADRQRHLMQTKLFK 295

Query: 243 CTCPRCIKLGQFDDIQESAILEGYRCKDDGCSGFLL---RDSDDKGFTCQQCGLVRSKEE 299
           C C RC+ + +  D   SAI    +C+D  C G +L    D  +  + C++C     K  
Sbjct: 296 CACERCVDVTEL-DTNYSAI----KCEDRQCGGLMLPTKADDWNGSWRCRECHKQVQKHY 350

Query: 300 IKKI 303
           +++I
Sbjct: 351 VERI 354


>gi|195498437|ref|XP_002096523.1| GE25007 [Drosophila yakuba]
 gi|194182624|gb|EDW96235.1| GE25007 [Drosophila yakuba]
          Length = 393

 Score = 64.7 bits (156), Expect = 1e-07,   Method: Compositional matrix adjust.
 Identities = 36/97 (37%), Positives = 56/97 (57%), Gaps = 9/97 (9%)

Query: 157 FSKLACNAHTICNSELRPLGTGLYPVISIINHSCLPNAVLVF--EGRLAVVRAVQHVPKG 214
           ++K+   A    N+E    G+GLY + S INHSC+PNA   F     + V++A+  + +G
Sbjct: 271 YAKVGEFAGEFLNNE----GSGLYLLQSKINHSCVPNACSTFPYSNDIIVLKALAPIQQG 326

Query: 215 AEVLISYIET---AGSTMTRQKALKEQYLFTCTCPRC 248
            E+ ISY++      S  +R K L+E Y+F C CP+C
Sbjct: 327 DEICISYLDECMLERSRHSRHKVLRENYVFICQCPKC 363


>gi|224010509|ref|XP_002294212.1| predicted protein [Thalassiosira pseudonana CCMP1335]
 gi|220970229|gb|EED88567.1| predicted protein [Thalassiosira pseudonana CCMP1335]
          Length = 516

 Score = 64.3 bits (155), Expect = 1e-07,   Method: Compositional matrix adjust.
 Identities = 58/199 (29%), Positives = 92/199 (46%), Gaps = 27/199 (13%)

Query: 67  CQVLSRLDKEKRKSVTPSIRLMLKLYLRRKLQNDNVIPSTTTDNYSLVEALVAHMSDIDE 126
           CQ+L   +   ++SV  + RL+  +  +   ++   +P T   + +++ A  +    I+ 
Sbjct: 144 CQLLEDEELHPKESVDIARRLLGAVVAQMSSED---VPQT---DMNVLLAASSQNDTINP 197

Query: 127 KQLLLYAQIANLVNLILQWPEI-SINEIAENFSKLACNAHTICNSELRPLGTGLYPVISI 185
           +  L    I N +      P+  S N IA ++S    N H       R LG  LYP+ ++
Sbjct: 198 QSQLDNTTILNTIRRNAFGPDFNSYNTIATSWS----NKHANEAPYTRLLG--LYPLAAM 251

Query: 186 INHSCLPNAVLVF-------------EGR-LAVVRAVQHVPKGAEVLISYIETAGSTMTR 231
           INHSC PNAV VF             +GR + +V A   + KG E+  SY+        R
Sbjct: 252 INHSCCPNAVRVFGTLPAINDNMHRVQGREVMIVHATAKISKGDEITWSYLPPCSPFNER 311

Query: 232 QKALKEQYLFTCTCPRCIK 250
           +  L  ++ FTC C RCIK
Sbjct: 312 RAVLSSKFGFTCQCVRCIK 330


>gi|156366270|ref|XP_001627062.1| predicted protein [Nematostella vectensis]
 gi|156213960|gb|EDO34962.1| predicted protein [Nematostella vectensis]
          Length = 391

 Score = 64.3 bits (155), Expect = 1e-07,   Method: Compositional matrix adjust.
 Identities = 52/170 (30%), Positives = 81/170 (47%), Gaps = 15/170 (8%)

Query: 88  MLKLYLRRKLQNDNVIPSTTTDNYSLVEALVA-HMSDIDEKQLLLYAQIANLVNLILQWP 146
           M+++ L   + +D +    T + +S + ALV  +   I    L LY      V+ I  +P
Sbjct: 207 MIRILLSEAMYDDRLEQWFTPEGFSSLFALVGTNGQGIGTSSLSLY------VHNIDSYP 260

Query: 147 EISINE---IAENFSKLACNAHTICNSELRPLGTGLYPVISIINHSCLPNAVLVF--EGR 201
            +S +E   I    ++L      +    L   G GLY + S  NHSC PNA + F     
Sbjct: 261 ALSDDERQAIDIFLNQLYEEMERVSGQFLNCEGAGLYALQSSCNHSCAPNAEVTFPKNNS 320

Query: 202 LAVVRAVQHVPKGAEVLISYIETAG---STMTRQKALKEQYLFTCTCPRC 248
             V++A+  +  G E+ ISY+E      S  +R K L+E Y+F CTC +C
Sbjct: 321 TLVLKALHPIKNGEEICISYLEECQRERSRHSRLKYLRENYIFDCTCTKC 370


>gi|350401158|ref|XP_003486068.1| PREDICTED: SET and MYND domain-containing protein 4-like [Bombus
           impatiens]
          Length = 662

 Score = 64.3 bits (155), Expect = 1e-07,   Method: Compositional matrix adjust.
 Identities = 68/293 (23%), Positives = 113/293 (38%), Gaps = 63/293 (21%)

Query: 8   GEVIISQEPYVCVPNNSSSISRCDGCFASSN--LKKCSACQVVWYCGSNCQKLDWKL-HR 64
           G+++  ++P   V  +  + +RC  C   +      C+ C   +YC   C    W L H 
Sbjct: 171 GDILFLEKPVSFVVLSHDTYNRCPHCNCQNADIPVPCTTCLNTFYCNEYCSTKAWSLYHC 230

Query: 65  LECQVLSRLDKEKRKSVTPSIRLMLKLYLRRKLQNDNVIPSTTTDNYSLVEALVAHMSDI 124
            EC      ++ K         L LK+ L      D +        ++ ++ LV +   +
Sbjct: 231 WECPG----NQMKLWEEIGIGHLALKVLLTCSTMTDEI-------KFNEIQNLVTNFDKL 279

Query: 125 DEKQLLLYAQIANLVNLIL------------------QWPEISINE-------------- 152
              +L +Y   A ++ + L                  ++P  S+N               
Sbjct: 280 SMDELTVYGVTAVMLTIYLLKYTNFFQSSNLEDCLKKKFPNNSLNTSFNILTTNDKQLYV 339

Query: 153 ---IAENFSKLACNAHTIC--NSELRP----------LGTGLYPVISIINHSCLPNAVLV 197
              +     +L  N H I   N+ LR           + TG+YP  S++NHSC PN + +
Sbjct: 340 SSLLLRYILQLIVNGHAITKSNTLLRKNDSSMKQQDIIATGIYPSASMMNHSCDPNIINI 399

Query: 198 FEGRLAVVRAVQHVPKGAEVLISYIE--TAGSTMTRQKALKEQYLFTCTCPRC 248
           F G+  +VRA + + +  E+   Y       +T  RQK LK  Y FTC C  C
Sbjct: 400 FVGQYLIVRASRDIGQSEEIFNCYGPHYRHMTTEDRQKILKSHYCFTCKCKAC 452


>gi|298710004|emb|CBJ31723.1| SET and MYND domain containing 3 [Ectocarpus siliculosus]
          Length = 636

 Score = 64.3 bits (155), Expect = 1e-07,   Method: Compositional matrix adjust.
 Identities = 32/78 (41%), Positives = 43/78 (55%), Gaps = 3/78 (3%)

Query: 175 LGTGLYPVISIINHSCLPNAVLVF--EGRLAVVRAVQHVPKGAEVLISYIETAGSTMTRQ 232
            G GLYP  ++ NHSC PN        G L VV AV+ VP G+++ ISY++       RQ
Sbjct: 228 FGVGLYPAGAMFNHSCRPNCSWRTNRSGELCVV-AVEDVPAGSQLFISYVDILQPWPVRQ 286

Query: 233 KALKEQYLFTCTCPRCIK 250
             L+  + F C CPRC +
Sbjct: 287 DLLRCHFFFECACPRCCR 304


>gi|406867180|gb|EKD20219.1| xanthine dehydrogenase [Marssonina brunnea f. sp. 'multigermtubi'
           MB_m1]
          Length = 654

 Score = 64.3 bits (155), Expect = 1e-07,   Method: Compositional matrix adjust.
 Identities = 31/77 (40%), Positives = 41/77 (53%), Gaps = 2/77 (2%)

Query: 175 LGTGLYPVISIINHSCLPNAVLVFE--GRLAVVRAVQHVPKGAEVLISYIETAGSTMTRQ 232
           LG  +Y +    NHSC PNA   F   G L V+RA   + +G E+ I Y    G+T  R+
Sbjct: 439 LGPFIYEIAGRFNHSCTPNAARGFSDVGELIVLRAFVDIQEGEEITIEYFSDGGTTTERR 498

Query: 233 KALKEQYLFTCTCPRCI 249
           + L +QY F C C  CI
Sbjct: 499 QHLYKQYRFNCDCRACI 515


>gi|313217681|emb|CBY38723.1| unnamed protein product [Oikopleura dioica]
          Length = 228

 Score = 64.3 bits (155), Expect = 1e-07,   Method: Compositional matrix adjust.
 Identities = 30/101 (29%), Positives = 52/101 (51%), Gaps = 2/101 (1%)

Query: 157 FSKLACNAHTICNSEL--RPLGTGLYPVISIINHSCLPNAVLVFEGRLAVVRAVQHVPKG 214
           F+ +  N   IC+ E     +G+ +Y   +++NHSC PNA  VF     + +A++ +  G
Sbjct: 30  FAAIKNNQFAICDEESGDYDVGSAVYIDHALVNHSCRPNAYPVFNKTNMIFKALRKIEPG 89

Query: 215 AEVLISYIETAGSTMTRQKALKEQYLFTCTCPRCIKLGQFD 255
            E+  +Y +T      R++ L + + F C CP C K  + D
Sbjct: 90  EEITHAYTDTISPIQERREYLNDVWRFMCNCPGCTKSNEID 130


>gi|328869415|gb|EGG17793.1| SET domain-containing protein [Dictyostelium fasciculatum]
          Length = 363

 Score = 64.3 bits (155), Expect = 1e-07,   Method: Compositional matrix adjust.
 Identities = 28/74 (37%), Positives = 42/74 (56%), Gaps = 1/74 (1%)

Query: 178 GLYPVISIINHSCLPNAVLVF-EGRLAVVRAVQHVPKGAEVLISYIETAGSTMTRQKALK 236
           GL+ + S  NHSC PN  + F   + AV+ A++ + KG E+ ISY ++      RQ  L 
Sbjct: 279 GLFFLSSFFNHSCDPNVYMAFPHDKTAVITALRDIKKGEELFISYGDSEKDMFDRQTHLF 338

Query: 237 EQYLFTCTCPRCIK 250
           + Y F C CP+C +
Sbjct: 339 DNYGFNCDCPKCTQ 352


>gi|384493687|gb|EIE84178.1| hypothetical protein RO3G_08888 [Rhizopus delemar RA 99-880]
          Length = 410

 Score = 64.3 bits (155), Expect = 1e-07,   Method: Compositional matrix adjust.
 Identities = 67/277 (24%), Positives = 122/277 (44%), Gaps = 36/277 (12%)

Query: 9   EVIISQEPYVCVPN-NSSSISR----CDGC-----FASSNLKKCSACQVVWYCGSNCQKL 58
           E++ ++EPYV  P     S++R    C  C     +A+   + C  C + +YC  +C+  
Sbjct: 118 EILFTEEPYVYFPPWEGFSLARSGHVCGLCCKPILYANRLTQHCKHCDM-YYCSKSCRAT 176

Query: 59  DW-KLHRLECQVLSRL--------DKEKRKSVTPSIRLMLKLYL--RRKLQNDNV--IPS 105
            W + H+LEC  L+          + E+ ++     R+  +L L  +R+  ND +  + +
Sbjct: 177 AWDQFHQLECTRLNPAITPFMNFCEGERWQAPMAVARIYARLILAHQREELNDVIGRLDA 236

Query: 106 TTTDNYSLVEALVAHMSDIDEKQLLLYAQIANLVNLILQWPEIS---INEIAENF-SKLA 161
             T +    +A       +++    L+ +  +L+    + P        ++ E    KL 
Sbjct: 237 FATVSQEERQAKETEWIFMEQPTRELWNKARDLLRKAYKQPPAKCKITQDLPEALKEKLF 296

Query: 162 CNAHTICNS----ELRPLGTGLYPVISIINHSCLPNAVLVFEGRLA----VVRAVQHVPK 213
            +  T  N      +     G+Y V S INH+C PN  + +    +     VRAV+ + +
Sbjct: 297 EDEETFLNFMGKFNINNQNGGMYLVHSHINHNCYPNVSIDYPNPKSQYKIAVRAVRDIKQ 356

Query: 214 GAEVLISYIETAGSTMTRQKALKEQYLFTCTCPRCIK 250
           G ++  +Y+    +  TRQ  L + YLF C C RC K
Sbjct: 357 GEQLFETYVNPRWNKETRQTYLDKSYLFNCKCERCEK 393


>gi|449459406|ref|XP_004147437.1| PREDICTED: SET and MYND domain-containing protein 4-like [Cucumis
           sativus]
 gi|449500639|ref|XP_004161155.1| PREDICTED: SET and MYND domain-containing protein 4-like [Cucumis
           sativus]
          Length = 775

 Score = 64.3 bits (155), Expect = 1e-07,   Method: Compositional matrix adjust.
 Identities = 65/311 (20%), Positives = 122/311 (39%), Gaps = 46/311 (14%)

Query: 168 CNSELRPLGTGLYPVISIINHSCLPNAVLVFEGRLAVVRAVQHVPKGAEVLISYIETAG- 226
           CN E   +G  +Y   S+ NHSC PN    F  R   +RA   +  G  + +SY    G 
Sbjct: 464 CNMEQVRVGQAIYTTGSLFNHSCKPNIHAYFNSRTLFIRATVFMAVGCPLELSYGPQVGQ 523

Query: 227 -STMTRQKALKEQYLFTCTCPRCIKLGQFDDIQESAILEGYRCKDDGCSGFLLRDSDDKG 285
                R + LK++Y F C C  C  +   D      ++  + C +  C G +L   D   
Sbjct: 524 LDCKDRLQLLKDEYSFNCQCSGCSTVHISD-----LVINAFCCINPNCRGVVL---DRSI 575

Query: 286 FTCQQCG-----LVRSKEEIKKIASEVNILSKKTLALTSCGNHQEVVSTYKMIEK----- 335
           F+C+         V  +  ++      + L         CG++ ++ S+   ++K     
Sbjct: 576 FSCENTKTKDFLTVNDQMILEPFMQTDSFLHAGPSHCLKCGSYCDIKSSRLTVDKAGIHF 635

Query: 336 --LQKKL-----------------------YHPFSVNLMQTREKLIKILMELEDWKEALA 370
             LQ+++                        H ++  + +  + L +    L   + A  
Sbjct: 636 TRLQQEINLNRVSETTVSDALGALISLKSTLHEYNRRIAEAEDNLSQAFSLLGKLELAAE 695

Query: 371 YCQLTIPVYQRVYPQFHPLLGLQYYTCGKLEWFLGDTENAIKSMTEAVEILRITHGTNSP 430
           +C+ +I + +++Y + H  +G +      +   +GD  NA+  +    +I R  +G+N  
Sbjct: 696 HCKASIRILEKLYGENHIAIGNELSKLSSILISVGD-HNAVDCIKRLSKIFRCYYGSNVN 754

Query: 431 FMKELILKLEE 441
            M   +  LEE
Sbjct: 755 TMFPFLNILEE 765


>gi|340052290|emb|CCC46562.1| conserved hypothetical protein [Trypanosoma vivax Y486]
          Length = 667

 Score = 64.3 bits (155), Expect = 1e-07,   Method: Compositional matrix adjust.
 Identities = 59/237 (24%), Positives = 104/237 (43%), Gaps = 23/237 (9%)

Query: 31  DGCFASSNLKKCSACQVVWYCGSNCQKLDWKLH-RLECQVLSRL---------DKEKRKS 80
           D   A+     CS C ++ YC + C K   + H R EC++L RL          +++R+ 
Sbjct: 235 DKAEANGGTLSCSVCGLISYCSAGCAKAYREQHKRYECRLLVRLKEMEKALRKQQQERER 294

Query: 81  VTPSIRLMLKLYLRRKLQNDNVIPSTTT--DNYSLVEALVAHMSDIDEKQLLLYAQIANL 138
            +P    + + +    +     +         Y  V+   +H+ D   +    +A +  L
Sbjct: 295 DSPEWEDLPEEFFAMSMHCVTTLSGIKARRPGYKTVQ---SHLQDHAAEMSQRFAPLVRL 351

Query: 139 VNLILQWPEISINEIAENFSKLACNAHTICNSELRPLGTGLYP--VISIINHSCLPNAVL 196
           V  +L+  E  ++ +      + CNA  +C+S    +G  L+   + S  NHSC PN  +
Sbjct: 352 VEGLLEGEETPVS-VVRLLGVIRCNALEVCDSTGLGIGQALHASTITSYFNHSCAPNCAM 410

Query: 197 VFEGRLAVVRAVQHVPKGAEVLISYI-ETAGSTMTRQKALKEQYLFTCTCPRCIKLG 252
             +G  A+V   + +P G E+ I+YI +       R++ L  +Y F C C RC   G
Sbjct: 411 --DGN-AIVTICEVLP-GEELTIAYIPQLYWPVELRREELAGRYFFHCCCTRCGDGG 463


>gi|307198438|gb|EFN79380.1| Protein msta, isoform B [Harpegnathos saltator]
          Length = 514

 Score = 64.3 bits (155), Expect = 1e-07,   Method: Compositional matrix adjust.
 Identities = 92/459 (20%), Positives = 185/459 (40%), Gaps = 63/459 (13%)

Query: 7   SGEVIISQEPYVCVPNNSSSISRCDGCF-----ASSNLKKCSACQVVW-YCGSNCQKLDW 60
           +GE I+++ P+V  P  + +   C  C+     +  +   CS C   W  CG  C+    
Sbjct: 40  AGEEIMTEMPFVVGPK-ACTYPLCLACYTPWPPSPDDKPLCSKCG--WPVCGEECEN-SL 95

Query: 61  KLHRLECQVLSRLDK---------EKRKSVTPSIRLMLKLYLRRKLQNDNVIPSTTTDNY 111
           +    ECQV  + ++         E  ++  P +  +  L  R  L+++  +    ++  
Sbjct: 96  QHKDYECQVFVQANEKFNVTAALDENNENGVPQLECITPL--RLLLESEKSVERWNSE-- 151

Query: 112 SLVEALVAHMSDIDEKQLLLYAQIANLVNLILQWPEISINEIAENFSKLACNAHTICNSE 171
             V  + AH + I  ++    +   N+V  + +   + ++  +E   ++AC    I   E
Sbjct: 152 --VRDMEAH-TKIRCQKSQWKSDHVNIVEYLRK--RLKLDRFSEECIQMACGILEINTFE 206

Query: 172 LRPL----GTGLYPVISIINHSCLPN---AVLVFEGRLAVVRAVQHVPKGAEVLISYIET 224
           +R         LYP ++++NHSC+ N    +   + R+ + R    +P   E+  SY  +
Sbjct: 207 IRTAKGYSARALYPTVAMMNHSCVSNMSHTISPIDYRIRL-RTTLKIPAAGELYASYTHS 265

Query: 225 AGSTMTRQKALKEQYLFTCTCPRCIKLGQFDDIQESAILEGYRCK--DDGCSGFLLRDSD 282
              T+ R++ L E   F C CPRC      D  + +  +   +C   D+G    L     
Sbjct: 266 LLPTLLRREHLLEGKHFACACPRC-----SDPTELATHMSSLKCNKCDNGIVLPLDSLDA 320

Query: 283 DKGFTCQQCGLVRSKEEIKKIASEVNILSKKTLALTSCGNHQEVVSTYKMIEKLQKKLYH 342
           +  + C  C    + + ++K+   +        A+ S  +  + +   + + +  + + H
Sbjct: 321 ESAWKCTHCEYSINGQAVRKVFRIIQAEVDNVEAI-SGADGADAIHERETVLRKYRSVLH 379

Query: 343 PFSVNLMQTREKLIKILMELEDW----------KEALAYCQLTIPVYQRVYPQFHPLLGL 392
           P    L   R  L ++   ++++          +  +  C+L + +   V P +  + G+
Sbjct: 380 PRHAFLTILRHSLTQMYGRVDEYLLDDLPDVVLEHKVDMCRLLLQILDVVEPGYSRIRGM 439

Query: 393 QYYTCGKLEWFLGD--------TENAIKS-MTEAVEILR 422
             Y       FL           E  +KS MTEA  IL+
Sbjct: 440 TLYELHAPLLFLAKGQWNADVIDEAGLKSKMTEAANILK 478


>gi|157138251|ref|XP_001664198.1| hypothetical protein AaeL_AAEL003788 [Aedes aegypti]
 gi|108880683|gb|EAT44908.1| AAEL003788-PA, partial [Aedes aegypti]
          Length = 485

 Score = 64.3 bits (155), Expect = 1e-07,   Method: Compositional matrix adjust.
 Identities = 97/425 (22%), Positives = 172/425 (40%), Gaps = 60/425 (14%)

Query: 8   GEVIISQEPYVCVPNNSSSISRCDGCFASSNLKKCSACQVVWYCGSNCQKLDWKL----- 62
           GE+I +  P +  P   +SI  C GC  +      S  QV +Y   +C+   W L     
Sbjct: 69  GELIFNDSPLLLGPK-IASIPVCLGCHRNLPPLMNSHLQVDFY---HCRHCSWPLCGPSC 124

Query: 63  -----HRLECQVLSRL-------------DKEKRKSVTPSIRLMLKLYLRRKLQNDNVIP 104
                H+ EC +L+R              D  +++SV  ++  +  L L+R        P
Sbjct: 125 ETATSHKDECPLLARSSKSIGSRLQFCFSDPTRKESVYCAVVPLRALLLKRS------NP 178

Query: 105 STTTDNYSLVEALVAHMSDIDEKQLLLYAQIANLV----NLILQWPEISINEIAENFSKL 160
               D +  +E+   H+ +   K  L  A  +NLV    +++    E++  E+ E  + L
Sbjct: 179 ERFRDGFLTLES---HL-ETRIKTPLYAALRSNLVPFVRDILGMGGEVTERELLEIAAIL 234

Query: 161 ACNAHTICNSELRPLGTGLYPVISIINHSCLPNAVLVFEGRLA-VVRAVQHVPKGAEVLI 219
             N + +   EL      LY + ++++H C PN    F+ RL  VV A   +PK A + I
Sbjct: 235 DTNCYDVRLPELDVKVRALYELGAMMSHHCRPNTKHYFDERLRLVVVATVDIPKDAVISI 294

Query: 220 SYIETAGSTMTRQKALKEQYLFTCTCPRCIKLGQFDDIQESAILEGYRCKDDGCSGFLLR 279
           SY +   ST+ R+ A+++   F C C RC    +      S +     C+      F   
Sbjct: 295 SYTQPLLSTIQRRYAIQQSKCFECCCDRCRDPTELGTYLGSIVCPS--CRKAKVVAFDPL 352

Query: 280 D-SDDKGFTCQQCGLVRSKEEIKKIASEVNILSKKTLALTSCGNHQEVVSTYKMIEKLQK 338
           D   D     ++C     +E          +   + +A+    + ++    + +  +L+ 
Sbjct: 353 DLRSDWQCESRKCSF---RESAVDAVRRNELRQAEIMAVARTVSPRDQYERFLVTHQLES 409

Query: 339 K-----LYHPFSVNLMQTREKLIKIL--MELEDWKEA-----LAYCQLTIPVYQRVYPQF 386
                 L HP++ N++Q +  L ++      E   EA     +  C+  + V  R+ P F
Sbjct: 410 SKEELGLLHPWNSNVLQVKYALTQLYRGAATEPQPEAHLRRQVELCRDLLEVADRLEPGF 469

Query: 387 HPLLG 391
            P  G
Sbjct: 470 SPFRG 474



 Score = 38.5 bits (88), Expect = 6.2,   Method: Compositional matrix adjust.
 Identities = 21/46 (45%), Positives = 26/46 (56%), Gaps = 2/46 (4%)

Query: 24 SSSISRCDGCFASSNLKKCSACQVVWYCGSNCQKLDWKL-HRLECQ 68
           S   RC  C AS   K+CS CQ V YCG + Q+  WK  HR +C+
Sbjct: 2  GSMAGRCAVCDASGG-KQCSGCQQVSYCGRDHQRQHWKAGHREQCR 46


>gi|443719695|gb|ELU09739.1| hypothetical protein CAPTEDRAFT_159764 [Capitella teleta]
          Length = 338

 Score = 63.9 bits (154), Expect = 1e-07,   Method: Compositional matrix adjust.
 Identities = 31/78 (39%), Positives = 45/78 (57%), Gaps = 5/78 (6%)

Query: 176 GTGLYPVISIINHSCLPNAVLVF--EGRLAVVRAVQHVPKGAEVLISYIETAG---STMT 230
           G+GLY + S  NHSC+PNA + F        V+A+ ++  G E+ ISY++      S  +
Sbjct: 234 GSGLYLLQSTCNHSCMPNAEITFPHNNSTLAVKALSNIKTGEEICISYLDECALERSRHS 293

Query: 231 RQKALKEQYLFTCTCPRC 248
           R K L+E YLF C C +C
Sbjct: 294 RHKILRENYLFNCNCSKC 311


>gi|113206683|gb|ABI34492.1| SET and MYND domain containing 3 [Danio rerio]
          Length = 151

 Score = 63.9 bits (154), Expect = 1e-07,   Method: Compositional matrix adjust.
 Identities = 40/155 (25%), Positives = 77/155 (49%), Gaps = 14/155 (9%)

Query: 39  LKKCSACQVVWYCGSNCQKLDWKLHRLECQVLSRLDKEKRKSVTPSIRLMLKLYLRRKLQ 98
           L +CS C+   YC   CQK  W  H+ EC+ L  L   + +  T S+RL+ ++  +   Q
Sbjct: 1   LSRCSQCKTARYCSVQCQKQAWPDHKRECKCLKHL---QPRIPTDSVRLVARIIFKLLSQ 57

Query: 99  NDNVIPSTTTDNYSLVEALVAHMSDIDEKQLLLYAQIANLVNLILQWPEISINEIAEN-- 156
           ++    S   + YS+ E   +H++D+ E++      +   + + L      ++ +     
Sbjct: 58  SE----SDQEELYSIAEH-QSHLADMSEEKKEGLKHLCTTLQVYLAEENCDLSRLPSGLD 112

Query: 157 ----FSKLACNAHTICNSELRPLGTGLYPVISIIN 187
                +++ CN  +I + EL+ +G GLYP +S++N
Sbjct: 113 PVSLLARVTCNCFSISDGELQDVGVGLYPSMSLLN 147


>gi|7510854|pir||T30001 hypothetical protein ZC8.3 - Caenorhabditis elegans
          Length = 306

 Score = 63.9 bits (154), Expect = 1e-07,   Method: Compositional matrix adjust.
 Identities = 63/251 (25%), Positives = 108/251 (43%), Gaps = 28/251 (11%)

Query: 17  YVCVPNNSSSISRCDGCFA--SSNLK-KCSACQVVWYCGSNCQKLDWKLHRLECQVLSRL 73
           ++ + N SS+ ++ D C+     N++  C  C+V  +C   C+      H+ EC  L + 
Sbjct: 3   HIFIKNESSNCTKDDYCWTCLGENVELTCEQCKVAKFCSKQCETSGAIDHKYECGPLKKC 62

Query: 74  DKEKRKSVTPSIRLMLKLYLRRK---LQNDNVIPSTTTDNYSL--VEALVAHMSDI--DE 126
                  +    R+++++  R K      D  I     +  S   V  +  H +D+  DE
Sbjct: 63  -----PDLNTDERMLIRIVGRYKDIHSGKDKSIDGFYNNRESKRSVMEIWEHCADMKKDE 117

Query: 127 KQLLLYAQIANLVNLI-----LQWPEISINEIAENFSKLACNAHTICNSE-LRPLGTGLY 180
             +  + +  + V        L   E++    + NF     N H+I N + LR +G GLY
Sbjct: 118 NAMKSFKKTYDRVKQFGDTNHLMDEEVTFQLHSRNFI----NRHSISNVDYLREIGKGLY 173

Query: 181 PVISIINHSCLPNAVLVFEGRLAVVRAVQ---HVPKGAEVLISYIETAGSTMTRQKALKE 237
             +   +HSC PNA+    G +A +RA+     +        +YIE     + R+  LKE
Sbjct: 174 LDLCKYDHSCRPNAIYSCNGIVAKLRALHDNVDLENVETTHYTYIELPPCKIQRRHMLKE 233

Query: 238 QYLFTCTCPRC 248
            + F C C RC
Sbjct: 234 TWYFECHCERC 244


>gi|407849671|gb|EKG04345.1| hypothetical protein TCSYLVIO_004595 [Trypanosoma cruzi]
          Length = 359

 Score = 63.9 bits (154), Expect = 1e-07,   Method: Compositional matrix adjust.
 Identities = 52/175 (29%), Positives = 85/175 (48%), Gaps = 32/175 (18%)

Query: 105 STTTDNYSLVEALVAHMSDI--------DEKQLLLYAQIANLVNLIL-----QWPEISIN 151
           ++TT+N +  E L   M+ +        +E  L   A+ + L+  +L     + P  + +
Sbjct: 159 TSTTENDTTDEFLATGMTLLGRILRFTQEECTLFTVARWSELLGAVLLNGQERSPPSNYD 218

Query: 152 EIAENFSKLACNAHTI---------CNSELRPL-----GTGLYPVISIINHSCLPNAVLV 197
            + E   +L C   T+            EL+ L     G G+Y V  + NHSC PN  +V
Sbjct: 219 RLKELVQRLPCGESTMDAFEQEVQMAGKELQHLLQSSRGQGVYTVGCLFNHSCEPNLQVV 278

Query: 198 F----EGRLAVVRAVQHVPKGAEVLISYIETAGSTMTRQKALKEQYLFTCTCPRC 248
           +    +  L+VV A++++  G E+ ISYI+ + S   RQ+ L E YLF C CP+C
Sbjct: 279 YSESGDETLSVV-ALRNIELGEELCISYIDESLSYPERQQELYEHYLFVCQCPKC 332


>gi|193683766|ref|XP_001950081.1| PREDICTED: protein msta, isoform A-like [Acyrthosiphon pisum]
          Length = 533

 Score = 63.9 bits (154), Expect = 1e-07,   Method: Compositional matrix adjust.
 Identities = 88/364 (24%), Positives = 156/364 (42%), Gaps = 52/364 (14%)

Query: 9   EVIISQEPYVCVPNNSSSISRCDGCFA----SSNLKKCSACQVVW-YCGSNC-QKLDWKL 62
           E+II +EP V  P   +S   C  C      S +L +    + +W  CG+ C   ++  +
Sbjct: 79  EIIIQEEPLVIGPKFPTSEPICIKCLQRLKRSESLVESLCEKCLWPICGTGCVTSINKNI 138

Query: 63  HRLECQVLSRLDKEKRK-------SVTPSIRLMLKLYLRRKLQNDNVIPSTTTDNYSLVE 115
           H  EC VL +  ++  K       ++TP   L+L+               T  + ++ + 
Sbjct: 139 HEGECTVLVKGSEKIAKNNDYMYDALTPLKCLLLQF--------------TDKNKWNRLM 184

Query: 116 ALVAHMSDIDEKQLLLYAQIANLVNLILQWPEISINEIAENFSKLACNAHTICNS----- 170
            L +HM +    +  +Y +I ++ N  L+   +S     E F   +   HTIC       
Sbjct: 185 ELKSHM-EYRGPESEVYEEINSVYNY-LKNNYLS----GEEFETSSDLIHTICGILDVNA 238

Query: 171 -ELRPLG---TGLYPVISIINHSCLPNAVLVFE--GRLAVVRAVQHVPKGAEVLISYIET 224
            +++  G   T +YP +S + H+CLPN  + F+  GR+  V A + + KG  +   Y   
Sbjct: 239 LDVQVAGLELTAIYPTVSKLEHNCLPNTGISFDKCGRI-YVYASRKITKGEHITTMYTNA 297

Query: 225 AGSTMTRQKALKEQYLFTCTCPRCIKLGQFDDIQESAI--LEGYRCKDDGCSGFLLRDSD 282
              T  R+  L     F C C RC    +      + +  ++G+ CK +      L DS 
Sbjct: 298 LWGTRERRAHLLSTKYFKCKCKRCSDATELGTNFSTIVCNVKGF-CKGNLTPIHPLDDSS 356

Query: 283 DKGFTCQQCGLVRSKEEIKKIASEVNILSKKTLALTSCGNHQEVVSTYKMIEKLQKKLYH 342
           +  + C +C    S ++I  I +E+N +  K L   S  + ++  S  K+  ++    Y 
Sbjct: 357 E--WECDRCPNTVSSDKIDAILTELNHIVDKALQNPSINSLEDAFS--KLKTRIHSNHYL 412

Query: 343 PFSV 346
            F+V
Sbjct: 413 CFNV 416



 Score = 38.5 bits (88), Expect = 6.8,   Method: Compositional matrix adjust.
 Identities = 19/42 (45%), Positives = 25/42 (59%), Gaps = 1/42 (2%)

Query: 26 SISRCDGCFASSNLKKCSACQVVWYCGSNCQKLDWKLHRLEC 67
          +IS C  C  S++  KC+AC +V YC    QK  WK H+ EC
Sbjct: 14 NISHCAFCGNSAS-SKCAACTLVVYCSKEHQKAHWKQHKNEC 54


>gi|383861061|ref|XP_003706005.1| PREDICTED: SET and MYND domain-containing protein 5-like [Megachile
           rotundata]
          Length = 388

 Score = 63.9 bits (154), Expect = 2e-07,   Method: Compositional matrix adjust.
 Identities = 31/79 (39%), Positives = 49/79 (62%), Gaps = 5/79 (6%)

Query: 176 GTGLYPVISIINHSCLPNAVLVF--EGRLAVVRAVQHVPKGAEVLISYIETAG---STMT 230
           G+GLY + S +NHSC+PNA++ F     + V++A++ +    E+ ISY++      S  +
Sbjct: 287 GSGLYILQSAVNHSCIPNAIVEFPYSNNVLVLKAIRDIHPEEEICISYLDECDLERSRHS 346

Query: 231 RQKALKEQYLFTCTCPRCI 249
           RQKAL   YLF C C +C+
Sbjct: 347 RQKALSSLYLFICCCSKCL 365


>gi|328770867|gb|EGF80908.1| hypothetical protein BATDEDRAFT_88247 [Batrachochytrium
           dendrobatidis JAM81]
          Length = 428

 Score = 63.9 bits (154), Expect = 2e-07,   Method: Compositional matrix adjust.
 Identities = 66/265 (24%), Positives = 114/265 (43%), Gaps = 47/265 (17%)

Query: 176 GTGLYPVISIINHSCLPNAVLVFEGRLAVVRAVQHVPKGAEVLISYIETAGSTM--TRQK 233
           G+G+YP  S+INH                 RA +++ KG  + ISY E+AG ++  TRQ+
Sbjct: 153 GSGIYPTASLINH-----------------RASKNIRKGNAITISYGESAGRSLRATRQQ 195

Query: 234 ALKEQYLFTCTCPRCIKLGQFDDIQESAILEGYRCKDDGCSGFLLRDSDDKGFTCQQCGL 293
            LK  + F C C  CI      D Q    L G +C   GC  + L + + +   C +C  
Sbjct: 196 NLKAGWGFECRCSACISEDDTKDRQ----LAGLKCP--GCP-YPLYEGEPR---CPKC-- 243

Query: 294 VRSKEEIKKIASEVNILSKKTLALTSCGNHQEVVSTYKMIEKLQ------KKLYHPFSVN 347
                 I  I+ +  +++K       C +H++  +   +I+ LQ      + + HP S  
Sbjct: 244 ----HSIVDISRQQKVMAKIDAVWHECQDHRKHDNVASLIQTLQGIQTDLESIMHPSSSR 299

Query: 348 LMQTREKLIKILMELE---DWKEALAYCQLTIPVYQRVYPQFHPLLGLQYYT-CGKLEWF 403
           L    + L  + ++L    D +  L++C+  +    R+      L  ++      K+ +F
Sbjct: 300 LGCMYDNLASLNVQLAKQGDNRILLSWCEKLLESSIRIVEGIFGLTSIEAANEHNKMAYF 359

Query: 404 LG--DTENAIKSMTEAVEILRITHG 426
           +   D E  IK    A  + R  +G
Sbjct: 360 MMAVDAEKGIKIARHAEGLFRTLYG 384


>gi|350418674|ref|XP_003491932.1| PREDICTED: SET and MYND domain-containing protein 4-like [Bombus
           impatiens]
          Length = 681

 Score = 63.9 bits (154), Expect = 2e-07,   Method: Compositional matrix adjust.
 Identities = 44/144 (30%), Positives = 64/144 (44%), Gaps = 17/144 (11%)

Query: 163 NAHTICNSEL---------RPL--GTGLYPVISIINHSCLPNAVLVFEGRLAVVRAVQHV 211
           NAH +  + L         RPL  G  +YP ++  NH C P     F GR  V+RA++ +
Sbjct: 454 NAHEVFETRLGNEHRFRGSRPLYIGVAIYPTVARFNHDCYPAVTRYFVGRSIVIRAIRSL 513

Query: 212 PKGAEVLISY--IETAGSTMTRQKALKEQYLFTCTCPRCIK-LGQFDDIQESAILEGYRC 268
             G  V  +Y  I T  S   R++ L  +Y F C C  C +   +F+ +    +    RC
Sbjct: 514 RAGDAVAENYGPIFTKRSLEDRRRTLAARYWFRCECTACREDWPRFETLTNDMVR--LRC 571

Query: 269 KDDGCSGFLLRDSD-DKGFTCQQC 291
             +GCS    R  D D+   C  C
Sbjct: 572 PTEGCSKLHPRPRDPDRSIECSSC 595


>gi|340720588|ref|XP_003398716.1| PREDICTED: SET and MYND domain-containing protein 4-like [Bombus
           terrestris]
          Length = 735

 Score = 63.9 bits (154), Expect = 2e-07,   Method: Compositional matrix adjust.
 Identities = 41/122 (33%), Positives = 61/122 (50%), Gaps = 13/122 (10%)

Query: 175 LGTGLYPVISIINHSCLPNAVLVFEGRLAVVRAVQHVPKGAEVLISYIE--TAGSTMTRQ 232
           + TG+YP  S++NHSC PN + +F G+  +VRA + + +   +   Y       +T  RQ
Sbjct: 456 VATGIYPSASMMNHSCDPNIINIFVGQYLIVRASRDIDQSEGIFNCYGPHYRHMTTEDRQ 515

Query: 233 KALKEQYLFTCTCPRCI--KLGQFDDIQESAILEGYRCKDDGCSGFLLRDSDDKGFTCQQ 290
           K LK QY FTC C  C   KL  F  I+    ++  +C    C+   ++DS      C  
Sbjct: 516 KILKSQYCFTCKCKACTLPKLQYF--IERFNAVKCLKCNGPVCN---IKDS----IYCLD 566

Query: 291 CG 292
           CG
Sbjct: 567 CG 568


>gi|357464329|ref|XP_003602446.1| SET and MYND domain-containing protein [Medicago truncatula]
 gi|355491494|gb|AES72697.1| SET and MYND domain-containing protein [Medicago truncatula]
          Length = 785

 Score = 63.9 bits (154), Expect = 2e-07,   Method: Compositional matrix adjust.
 Identities = 38/118 (32%), Positives = 54/118 (45%), Gaps = 7/118 (5%)

Query: 163 NAHTICNSELRPLGTGLYPVISIINHSCLPNAVLVFEGRLAVVRAVQHVPKGAEVLISYI 222
           + H   N E   +G  +Y V S+ NHSC PN    F  R   +R  Q V  G ++ +SY 
Sbjct: 462 SVHLTSNVEQVRVGKAIYKVGSLFNHSCQPNVHAYFLSRTLYLRTTQAVAAGCQLELSYG 521

Query: 223 ETAG--STMTRQKALKEQYLFTCTCPRCIKLGQFDDIQESAILEGYRCKDDGCSGFLL 278
              G      RQ  LK++Y F C C  C ++   D      +L  + C +  CSG +L
Sbjct: 522 PQVGLWDCKDRQSFLKDEYAFHCQCTGCSEVNLSD-----IVLNAFHCVNPNCSGAVL 574


>gi|116179288|ref|XP_001219493.1| hypothetical protein CHGG_00272 [Chaetomium globosum CBS 148.51]
 gi|88184569|gb|EAQ92037.1| hypothetical protein CHGG_00272 [Chaetomium globosum CBS 148.51]
          Length = 588

 Score = 63.9 bits (154), Expect = 2e-07,   Method: Compositional matrix adjust.
 Identities = 30/70 (42%), Positives = 45/70 (64%), Gaps = 1/70 (1%)

Query: 179 LYPVISIINHSCLPNAVLVFEGRLAVVRAVQHVPKGAEVLISYIETAGSTMTRQKALKEQ 238
           L+P  ++INHSC+PN+   F G L V+RA +++ KG E++ SY E+ G    RQ+AL   
Sbjct: 425 LWPWAALINHSCIPNSEREFVGDLMVIRATKNIAKGEEIVHSYDES-GVYDDRQRALMTT 483

Query: 239 YLFTCTCPRC 248
           + F C+C  C
Sbjct: 484 WGFECSCALC 493


>gi|167521373|ref|XP_001745025.1| hypothetical protein [Monosiga brevicollis MX1]
 gi|163776639|gb|EDQ90258.1| predicted protein [Monosiga brevicollis MX1]
          Length = 399

 Score = 63.5 bits (153), Expect = 2e-07,   Method: Compositional matrix adjust.
 Identities = 34/98 (34%), Positives = 46/98 (46%), Gaps = 17/98 (17%)

Query: 178 GLYPVISIINHSCLPNAV-----------------LVFEGRLAVVRAVQHVPKGAEVLIS 220
           GL+P  + INHSC PNA                  L  EG   ++R+V  + +G  + IS
Sbjct: 132 GLWPQAAAINHSCRPNATHYLDASAPMKPRESGADLPPEGGTMIIRSVSDIARGEPITIS 191

Query: 221 YIETAGSTMTRQKALKEQYLFTCTCPRCIKLGQFDDIQ 258
           Y+E       RQ+AL+  Y F CTC RC +    D  Q
Sbjct: 192 YVELGDPWPVRQEALRTGYGFACTCIRCTEEAALDSEQ 229


>gi|198454620|ref|XP_001359652.2| GA17400 [Drosophila pseudoobscura pseudoobscura]
 gi|198132877|gb|EAL28802.2| GA17400 [Drosophila pseudoobscura pseudoobscura]
          Length = 392

 Score = 63.5 bits (153), Expect = 2e-07,   Method: Compositional matrix adjust.
 Identities = 38/101 (37%), Positives = 57/101 (56%), Gaps = 9/101 (8%)

Query: 153 IAENFSKLACNAHTICNSELRPLGTGLYPVISIINHSCLPNAVLVF--EGRLAVVRAVQH 210
           I E +SK+   A    N+E    G+GLY + S INHSC+PNA   F     + V++A+  
Sbjct: 267 IDELYSKVGEFAGEFLNNE----GSGLYLLQSKINHSCVPNACSTFPYSNDIVVLKALAP 322

Query: 211 VPKGAEVLISYIETAG---STMTRQKALKEQYLFTCTCPRC 248
           + +G E+ ISY++      S  +R K L+E Y+F C C +C
Sbjct: 323 IQEGEEICISYLDECQLERSRHSRHKVLRENYIFVCQCLKC 363


>gi|19921122|ref|NP_609464.1| CG43129, isoform D [Drosophila melanogaster]
 gi|442627404|ref|NP_001260366.1| CG43129, isoform E [Drosophila melanogaster]
 gi|7297774|gb|AAF53024.1| CG43129, isoform D [Drosophila melanogaster]
 gi|17945207|gb|AAL48662.1| RE12806p [Drosophila melanogaster]
 gi|440213690|gb|AGB92901.1| CG43129, isoform E [Drosophila melanogaster]
          Length = 507

 Score = 63.5 bits (153), Expect = 2e-07,   Method: Compositional matrix adjust.
 Identities = 91/386 (23%), Positives = 149/386 (38%), Gaps = 58/386 (15%)

Query: 10  VIISQEPYVCVPNNSSSISRCDGCFASSNLKK---CSACQVVW-YCGSNCQKLDWKLHRL 65
           ++I + P+   P  +  +  C GC+  +   +   CS C   W  C    Q+ D    RL
Sbjct: 35  LLIEELPFAVGPKCNGPVV-CLGCYEPNPDPEEELCSECG--WPLCVECAQQADNAHFRL 91

Query: 66  ECQVLSRLDKEKRKSVTPS-------------IRLMLKLYLRRKLQNDNVIPSTTTDNYS 112
           EC  L   D   R    PS             +R++L      +  ++ V P        
Sbjct: 92  ECSQLK--DARARFFRLPSGSRHCPQLDCIMPLRVLLAKEANPERWDNEVAPMEHHKEER 149

Query: 113 LVEALVAHMSDIDEKQLLLYA-QIANLVNLILQWPEISINEIAENFSKLACNAHTICNSE 171
             +A V H   ++  Q L    Q+AN           S   I +    L  NA    + +
Sbjct: 150 QRDADVWHADRVNIAQYLRGPCQLAN---------RFSEELIMQVVGVLEVNAFEARSPK 200

Query: 172 LRPLGTGLYPVISIINHSCLPN---AVLVFEGRLAVVRAVQHVPKGAEVLISYIETAGST 228
             PL   L+P   I+ H+C+PN   ++   EG    +RA+  + +G  +  SY  T   T
Sbjct: 201 GYPLRC-LFPYTGILAHNCVPNTSRSIYPSEGYKIRLRAMVDLEEGQPLHHSYTYTLDGT 259

Query: 229 MTRQKALKEQYLFTCTCPRCI---KLG-QFDDIQESAILEGYRCKDDGCSGFLLRDSDDK 284
             RQK LK+   FTC C RC+   +LG  F  ++     EG++               D 
Sbjct: 260 AQRQKHLKQGKFFTCQCERCLDPTELGTHFSSLKCGQCAEGFQVPRQP-------TEPDT 312

Query: 285 GFTCQQCGLVRSKEE----IKKIASEVNILSKKTLALTSCGNHQEVVSTYKMIEKLQKKL 340
            + C  CG   S  +    ++ + SEVN +    +A       + ++  Y       K L
Sbjct: 313 SWNCANCGSDTSNADALAMLQSLQSEVNAVQALPMAAKRLEEIERLLRKY-------KSL 365

Query: 341 YHPFSVNLMQTREKLIKILMELEDWK 366
            HP        R+ LI++   ++ ++
Sbjct: 366 LHPLHFIATGLRQLLIEMYGRVQGYE 391


>gi|73980996|ref|XP_533007.2| PREDICTED: SET and MYND domain-containing protein 5 [Canis lupus
           familiaris]
          Length = 419

 Score = 63.5 bits (153), Expect = 2e-07,   Method: Compositional matrix adjust.
 Identities = 33/79 (41%), Positives = 47/79 (59%), Gaps = 5/79 (6%)

Query: 176 GTGLYPVISIINHSCLPNAVLVF--EGRLAVVRAVQHVPKGAEVLISYIETAG---STMT 230
           G+GLY + S  NHSC+PNA   F     L  V A++ +  G E+ ISY++      S  +
Sbjct: 304 GSGLYVLQSCCNHSCVPNAETTFPENNFLLHVTALEDIKPGEEICISYLDCCQRERSRHS 363

Query: 231 RQKALKEQYLFTCTCPRCI 249
           R K L+E YLF C+CP+C+
Sbjct: 364 RHKILRENYLFVCSCPKCL 382


>gi|195111430|ref|XP_002000282.1| GI10143 [Drosophila mojavensis]
 gi|193916876|gb|EDW15743.1| GI10143 [Drosophila mojavensis]
          Length = 392

 Score = 63.5 bits (153), Expect = 2e-07,   Method: Compositional matrix adjust.
 Identities = 31/78 (39%), Positives = 47/78 (60%), Gaps = 5/78 (6%)

Query: 176 GTGLYPVISIINHSCLPNA--VLVFEGRLAVVRAVQHVPKGAEVLISYIET---AGSTMT 230
           G+GLY + S +NHSC+PNA     +   + V++AV  + KG E+ ISY++      S  +
Sbjct: 286 GSGLYILQSKVNHSCVPNAQSTFPYSNDIVVLKAVAPIQKGEEICISYLDECQLERSRHS 345

Query: 231 RQKALKEQYLFTCTCPRC 248
           R K L+E Y+F C C +C
Sbjct: 346 RHKILRENYIFVCQCSKC 363


>gi|355720921|gb|AES07094.1| SMYD family member 5 [Mustela putorius furo]
          Length = 159

 Score = 63.5 bits (153), Expect = 2e-07,   Method: Compositional matrix adjust.
 Identities = 32/79 (40%), Positives = 47/79 (59%), Gaps = 5/79 (6%)

Query: 176 GTGLYPVISIINHSCLPNAVLVF--EGRLAVVRAVQHVPKGAEVLISYIETAG---STMT 230
           G+GL+ + S  NHSC+PNA   F     L  V A++ +  G E+ ISY++      S  +
Sbjct: 46  GSGLFVLQSCCNHSCVPNAETSFPENNFLLHVTALEDIKPGEEICISYLDCCQRERSRHS 105

Query: 231 RQKALKEQYLFTCTCPRCI 249
           R K L+E YLF C+CP+C+
Sbjct: 106 RHKILRENYLFVCSCPKCL 124


>gi|350632678|gb|EHA21045.1| hypothetical protein ASPNIDRAFT_45712 [Aspergillus niger ATCC 1015]
          Length = 367

 Score = 63.5 bits (153), Expect = 2e-07,   Method: Compositional matrix adjust.
 Identities = 65/290 (22%), Positives = 120/290 (41%), Gaps = 48/290 (16%)

Query: 7   SGEVIISQEPYVCVP-----NNSSSISRCDGC------FASSNLKKCSACQV-VWYCGSN 54
           +GE++++  PY  +P         SI    GC       A++++   + C + +W+C  +
Sbjct: 74  AGELLMADLPYAVLPTVAVGGGDDSICSNLGCSRRVSRHAANSVTCETGCSLDIWWCNES 133

Query: 55  CQKLDWKLHRLECQVLSRLDKEKRKSVTPSIRLMLKLYLR----RKLQNDNVIPSTTTDN 110
           C+  D   H +EC  L +     R+ +      ML + +R    R L+     P+     
Sbjct: 134 CKDEDQARHDVECAWLKKYSPVLRQELGDHDYYMLWIIIRLLAARYLELQGTTPTNVHQQ 193

Query: 111 YSLVEALVA---------------------HMSDIDEKQLLLYA------QIANLVNLIL 143
            +  +  V+                     H S + EK LL  +       +  LV+LI 
Sbjct: 194 LTFQDRFVSGWKGVQLLRSNRDSWPTSQLQHWSALAEKYLLQSSLLPETLDLDTLVDLIC 253

Query: 144 QWPEISINEIAENFSKLACNAHTICNSELRPLGTGLYPVISIINHSCLPNAVLVFEGR-L 202
              E ++ E+    +++  +         +  G  ++  I++ NHSC PN     + R  
Sbjct: 254 A-EETNVFELCPGPTEIIPHQSPGVERGTQ-YGLAVFLRITLANHSCAPNVTHQADDRGR 311

Query: 203 AVVRAVQHVPKGAEVLISYIETAG--STMTRQKALKEQYLFTCTCPRCIK 250
            +V A++ +  G E   SY + +       R+K  +E + FTCTCPRC++
Sbjct: 312 MMVTALRDIAPGEECCTSYFDLSEYVDLQARRKKTQELFTFTCTCPRCLQ 361


>gi|401405154|ref|XP_003882027.1| conserved hypothetical protein [Neospora caninum Liverpool]
 gi|325116441|emb|CBZ51994.1| conserved hypothetical protein [Neospora caninum Liverpool]
          Length = 576

 Score = 63.5 bits (153), Expect = 2e-07,   Method: Compositional matrix adjust.
 Identities = 30/76 (39%), Positives = 46/76 (60%), Gaps = 2/76 (2%)

Query: 175 LGTGLYPVISIINHSCLPNAVLVFEG--RLAVVRAVQHVPKGAEVLISYIETAGSTMTRQ 232
           +G GL+  +S+ NHSC PNA + +    + A V A++ + +  +VL+SYI+      +RQ
Sbjct: 419 MGIGLFRAVSMTNHSCWPNAEVDYPSLTKTAQVTALRDIAQDEQVLLSYIDETLPLASRQ 478

Query: 233 KALKEQYLFTCTCPRC 248
           + LK  Y FTC C RC
Sbjct: 479 RLLKRHYKFTCACVRC 494


>gi|332021392|gb|EGI61760.1| SET and MYND domain-containing protein 4 [Acromyrmex echinatior]
          Length = 749

 Score = 63.5 bits (153), Expect = 2e-07,   Method: Compositional matrix adjust.
 Identities = 49/217 (22%), Positives = 90/217 (41%), Gaps = 16/217 (7%)

Query: 175 LGTGLYPVISIINHSCLPNAVLVFEGRLAVVRAVQHVPKGAEVLISYIETAGSTM--TRQ 232
           + T +YP  S++NHSC PN +  F G+  + +A + +  G EV   Y       +   RQ
Sbjct: 460 IATAIYPSASMMNHSCDPNIINSFLGQFLITKATRDIAAGEEVFNCYGADFRRMLRKDRQ 519

Query: 233 KALKEQYLFTCTCPRCIKLGQFDDIQESAILEGYRCK------DDGCSGFLLRDSDDKGF 286
           + ++ QY F C C  C    +++DI +       +C       +D CS  L R S     
Sbjct: 520 EKMESQYCFKCNCAAC-SAPEYEDILKK--FTAKKCPECSGPLNDNCS-VLHRSS----I 571

Query: 287 TCQQCGLVRSKEEIKKIASEVNILSKKTLALTSCGNHQEVVSTYKMIEKLQKKLYHPFSV 346
            C  CG    +        +     +K        N+ E +   K   +L+K   + +  
Sbjct: 572 HCMDCGTAVYENIYDFTLRQAQHYFEKAEICIENENYDEALDKLKKCLRLRKDALYKYHD 631

Query: 347 NLMQTREKLIKILMELEDWKEALAYCQLTIPVYQRVY 383
           ++  T + + K+   +  W  +++Y + +I   +  Y
Sbjct: 632 DIADTMDFIAKVYAIIGQWLNSISYLEHSIVAIEEKY 668


>gi|296413480|ref|XP_002836440.1| hypothetical protein [Tuber melanosporum Mel28]
 gi|295630261|emb|CAZ80631.1| unnamed protein product [Tuber melanosporum]
          Length = 763

 Score = 63.5 bits (153), Expect = 2e-07,   Method: Compositional matrix adjust.
 Identities = 29/73 (39%), Positives = 42/73 (57%)

Query: 176 GTGLYPVISIINHSCLPNAVLVFEGRLAVVRAVQHVPKGAEVLISYIETAGSTMTRQKAL 235
           G+GL+ + S INHSC  N+   F G L +VRA + + +G E+ I+Y+E       RQKA 
Sbjct: 515 GSGLWIMPSYINHSCWQNSTRSFLGDLLIVRAARDLSEGEEITINYMENESGVQKRQKAF 574

Query: 236 KEQYLFTCTCPRC 248
             ++ F C C  C
Sbjct: 575 LSEWGFECKCTMC 587


>gi|221503737|gb|EEE29421.1| SET and MYND domain containing protein, putative [Toxoplasma gondii
           VEG]
          Length = 402

 Score = 63.5 bits (153), Expect = 2e-07,   Method: Compositional matrix adjust.
 Identities = 32/76 (42%), Positives = 45/76 (59%), Gaps = 2/76 (2%)

Query: 175 LGTGLYPVISIINHSCLPNAVL--VFEGRLAVVRAVQHVPKGAEVLISYIETAGSTMTRQ 232
           LG GL+  +S+ NHSC PNA +   F    A V A++ + +  E+L+SYI+ +     RQ
Sbjct: 263 LGVGLFRAVSMTNHSCWPNAEVDYPFLTNAAQVTALRDIAEKEEILLSYIDESLPLAERQ 322

Query: 233 KALKEQYLFTCTCPRC 248
           + LK  Y FTC C RC
Sbjct: 323 RLLKSHYKFTCGCQRC 338


>gi|255543292|ref|XP_002512709.1| protein with unknown function [Ricinus communis]
 gi|223548670|gb|EEF50161.1| protein with unknown function [Ricinus communis]
          Length = 379

 Score = 63.5 bits (153), Expect = 2e-07,   Method: Compositional matrix adjust.
 Identities = 29/75 (38%), Positives = 41/75 (54%), Gaps = 4/75 (5%)

Query: 178 GLYPVISIINHSCLPNAV----LVFEGRLAVVRAVQHVPKGAEVLISYIETAGSTMTRQK 233
           G+YP  ++ NH CLPNA     +  +    ++R +  VP+G E+ +SY        TRQK
Sbjct: 209 GIYPKAALFNHDCLPNACRFDYVDTQDTDLIIRMIHDVPQGREICLSYFPVNYDYSTRQK 268

Query: 234 ALKEQYLFTCTCPRC 248
            L+E Y F C C RC
Sbjct: 269 RLREDYGFICDCDRC 283


>gi|26450409|dbj|BAC42319.1| unknown protein [Arabidopsis thaliana]
          Length = 604

 Score = 63.5 bits (153), Expect = 2e-07,   Method: Compositional matrix adjust.
 Identities = 37/117 (31%), Positives = 56/117 (47%), Gaps = 10/117 (8%)

Query: 175 LGTGLYPVISIINHSCLPNAVLVFEGRLAVVRAVQHVPKGAEVLISYIETAG--STMTRQ 232
           +G  LY   S+ NHSC PN  L F  R  +++  + VP G  + +SY    G      R 
Sbjct: 475 VGQALYKTGSLFNHSCKPNIHLYFLSRGLIMQTTEFVPTGCPLELSYGPEVGKWDCKNRI 534

Query: 233 KALKEQYLFTCTCPRCIKLGQFDDIQESAILEGYRCKDDGCSGFLLRDSDDKGFTCQ 289
           + L+E+Y F C C  C ++   D      ++ GY C +  C+G +L   D    TC+
Sbjct: 535 RFLEEEYFFHCRCRGCAQINISD-----LVINGYGCVNTNCTGVVL---DSNVATCE 583


>gi|195421076|ref|XP_002060836.1| GK23578 [Drosophila willistoni]
 gi|194156921|gb|EDW71822.1| GK23578 [Drosophila willistoni]
          Length = 128

 Score = 63.5 bits (153), Expect = 2e-07,   Method: Composition-based stats.
 Identities = 29/72 (40%), Positives = 41/72 (56%), Gaps = 1/72 (1%)

Query: 178 GLYPVISIINHSCLPNAVLVFE-GRLAVVRAVQHVPKGAEVLISYIETAGSTMTRQKALK 236
            L+P+  ++NH C PNA   FE G   VV A + +PKG E+ ++Y +   ST+ R+  L 
Sbjct: 51  ALFPLAGLLNHHCTPNAAHHFEDGETIVVSATERIPKGTEITMTYAKLLWSTLARKLFLA 110

Query: 237 EQYLFTCTCPRC 248
               F C CPRC
Sbjct: 111 MTKHFICQCPRC 122


>gi|428180005|gb|EKX48874.1| hypothetical protein GUITHDRAFT_105498 [Guillardia theta CCMP2712]
          Length = 197

 Score = 63.2 bits (152), Expect = 2e-07,   Method: Compositional matrix adjust.
 Identities = 45/156 (28%), Positives = 70/156 (44%), Gaps = 22/156 (14%)

Query: 163 NAHTICNSELRPLGTGLYPVISIINHSCLPNAVLVFEGRLAVVRAVQHVPKGAEVLISYI 222
           NAH I N   + +G G++P   ++NH+C  NAV  F            VP GAEV  + I
Sbjct: 7   NAHPITNGSFKRIGLGIFPSACMLNHACNYNAVYSF------------VPGGAEVHANAI 54

Query: 223 E----TAGSTMTRQKALKEQYLFTCTCPRCIKLGQFDDIQESAILEGY-RCKDDGCSGFL 277
           +    T+   + RQK ++E Y F C C  C      +    S +   +  C    C   +
Sbjct: 55  DFGLNTSSQLIRRQKLIREGYFFQCACEWCAGTSGPERYFNSLVCNPWPDCDGHKCEAAI 114

Query: 278 LRDSD-DKGFTCQQCGLVRSKEEIKKIASEVNILSK 312
           L D++ D+G     C   + +  I    +E+N L+K
Sbjct: 115 LPDANGDEGQAVVICDACKRRHPI----TEMNALNK 146


>gi|66815955|ref|XP_641994.1| hypothetical protein DDB_G0278829 [Dictyostelium discoideum AX4]
 gi|60470035|gb|EAL68016.1| hypothetical protein DDB_G0278829 [Dictyostelium discoideum AX4]
          Length = 563

 Score = 63.2 bits (152), Expect = 3e-07,   Method: Compositional matrix adjust.
 Identities = 48/209 (22%), Positives = 96/209 (45%), Gaps = 11/209 (5%)

Query: 48  VWYCGSNCQKLDWKLHRLECQVLSRLDKE--KRKSVTPSIRLMLKLYLRRKLQNDNVIPS 105
           +W+C   C   +   H+LEC  +++L +   K +    +  L L++ +R K+++D     
Sbjct: 86  MWFCSEICS--NDMTHQLECSFINKLIESSIKNECDVSTCLLALRIMIRNKIESDKY-QE 142

Query: 106 TTTDNYSLVEALVAHMSDIDEKQLLLYAQIANLVNLILQWPEI----SINEIAENFSKLA 161
           T     + +E  +       EK  + + Q  N +      P +    + +E  +  S + 
Sbjct: 143 TVGKLSNQLEKFIEINKSFIEKYDINFQQFINQLQQENSDPSLLAIFNKDEFLQIISSIY 202

Query: 162 CNAHTICNSEL--RPLGTGLYPVISIINHSCLPNAVLVFEGRLAVVRAVQHVPKGAEVLI 219
            N+ +  +++   +P+  G +   +++NHSC PN     + +   +RA + + K  E++ 
Sbjct: 203 INSFSGLSNDFNRKPISNGYFYKPALLNHSCEPNIFFTIKDKNLEMRACKKIEKDEEIVD 262

Query: 220 SYIETAGSTMTRQKALKEQYLFTCTCPRC 248
           SY++    T+ RQK L     F C C RC
Sbjct: 263 SYVDLLLPTIERQKILYNSKNFLCKCSRC 291


>gi|255081676|ref|XP_002508060.1| predicted protein [Micromonas sp. RCC299]
 gi|226523336|gb|ACO69318.1| predicted protein [Micromonas sp. RCC299]
          Length = 541

 Score = 63.2 bits (152), Expect = 3e-07,   Method: Compositional matrix adjust.
 Identities = 65/300 (21%), Positives = 107/300 (35%), Gaps = 88/300 (29%)

Query: 8   GEVIISQEPY-VCVPNNSSSISRCDGCFASSNLKK----CSACQVVWYCGSNCQKLDWKL 62
           G+++++  PY + VP       RC GCF   +  +    C AC +VW+C   C  +D  +
Sbjct: 45  GDLVLANAPYGLPVPMGD----RCAGCFMKEDDDRVKMRCKACDMVWFCSERC--MDGAM 98

Query: 63  ---HRLECQVLSRLDKEKRKSVTPSIRLMLKLYLRRKLQNDNVIPSTTTDNYSLVE---A 116
              H +EC  + R   +                       D+ I       +S +     
Sbjct: 99  DGGHAVECAAMRRYFAD---------------------DPDDAIRENGFVQFSALHRNCG 137

Query: 117 LVAHMSDIDEKQ----LLLYAQIANLVNLILQWP--EISINEIAENFSKLACNAHTICN- 169
             AH  D DE        L    A+  + I +W        E   +F+  A      C  
Sbjct: 138 KAAHPEDTDEPLDDHLPYLVPTAADKADTIARWALHGFPWREYCGDFAAEATAGARRCGL 197

Query: 170 -----------------------------SELRPLGTGLYPVISIINHSCLPNAVLVFEG 200
                                         ++  +    YP+ +++NHSC P  V  +  
Sbjct: 198 IPSSATDETIKKEVVRSRMNDFFIQQWRGGDIDLMAGATYPLGALLNHSCAPTCVCSY-- 255

Query: 201 RLAVVRAVQH------------VPKGAEVLISYIETAGSTMTRQKALKEQYLFTCTCPRC 248
           RL +  AVQ+            V KG E+  +Y++ +  +  R+  L ++Y F C CPRC
Sbjct: 256 RLDISDAVQYPTWIQEFRCVADVGKGTELTHAYVDASDWSNHRRAELLDRYGFVCDCPRC 315


>gi|321449488|gb|EFX61914.1| hypothetical protein DAPPUDRAFT_68494 [Daphnia pulex]
          Length = 680

 Score = 63.2 bits (152), Expect = 3e-07,   Method: Compositional matrix adjust.
 Identities = 38/111 (34%), Positives = 59/111 (53%), Gaps = 5/111 (4%)

Query: 175 LGTGLYPVISIINHSCLPNAVLVFEGRLAVVRAVQHVPKGAEVLISY--IETAGSTMTRQ 232
           LG G+Y  ++++NHSC P  +  F G + VVRA++    G  V  +Y  I T    + RQ
Sbjct: 476 LGAGVYSTVALLNHSCEPGVIRHFIGDVMVVRAIKSFQPGEMVNENYGPIFTQKRRVDRQ 535

Query: 233 KALKEQYLFTCTCPRCIK-LGQFDDIQESAILEGYRCKDDGCSGFLLRDSD 282
           ++LK++Y F C C  C +      ++ E A+   +RC D  C   L+  SD
Sbjct: 536 RSLKDRYWFDCRCNPCTENWPLIGEMTEEALC--FRCADRRCRKPLVVQSD 584



 Score = 38.9 bits (89), Expect = 5.6,   Method: Compositional matrix adjust.
 Identities = 22/68 (32%), Positives = 33/68 (48%), Gaps = 4/68 (5%)

Query: 8   GEVIISQEPYVCVPNNSSSISRCDGCFASSNLKK---CSACQVVWYCGSNCQKLDWKL-H 63
           G+VI    PY  V N     + C  C+    L +   CS C +V +C  NC+    ++ H
Sbjct: 266 GDVIAVDAPYASVMNPEKFSTHCHHCYQILELGEVLPCSHCDLVSFCSVNCRSRAMEIYH 325

Query: 64  RLECQVLS 71
            +EC +LS
Sbjct: 326 AIECPILS 333


>gi|345484303|ref|XP_001603615.2| PREDICTED: SET and MYND domain-containing protein 5-like [Nasonia
           vitripennis]
          Length = 386

 Score = 63.2 bits (152), Expect = 3e-07,   Method: Compositional matrix adjust.
 Identities = 36/88 (40%), Positives = 51/88 (57%), Gaps = 7/88 (7%)

Query: 176 GTGLYPVISIINHSCLPNAVLVF---EGRLAVVRAVQHVPKGAEVLISYIETAG---STM 229
           G+GLY + S INHSC PNA + F   + RL VVRA++ +    E+ I+Y++      S  
Sbjct: 288 GSGLYVLQSAINHSCAPNASVEFPHSDSRL-VVRALRDIKPDEEICIAYLDECHLERSRH 346

Query: 230 TRQKALKEQYLFTCTCPRCIKLGQFDDI 257
           +RQKAL   YLF C C +C +     D+
Sbjct: 347 SRQKALSSLYLFVCKCDKCQQQASDPDV 374


>gi|31208665|ref|XP_313299.1| AGAP003552-PA [Anopheles gambiae str. PEST]
 gi|21296724|gb|EAA08869.1| AGAP003552-PA [Anopheles gambiae str. PEST]
          Length = 391

 Score = 63.2 bits (152), Expect = 3e-07,   Method: Compositional matrix adjust.
 Identities = 35/87 (40%), Positives = 47/87 (54%), Gaps = 5/87 (5%)

Query: 167 ICNSELRPLGTGLYPVISIINHSCLPNAVLVF--EGRLAVVRAVQHVPKGAEVLISYIET 224
           +  S L   G+ LY   S INHSC PNA  VF     +  +RA + +  G E+ ISY++ 
Sbjct: 279 VVGSFLNNEGSALYARQSKINHSCAPNAETVFPKSNHMLALRATRDIQPGEEICISYLDE 338

Query: 225 AG---STMTRQKALKEQYLFTCTCPRC 248
                S  +RQK LK+ YLF C C +C
Sbjct: 339 CNLQRSRHSRQKTLKDYYLFICQCEKC 365


>gi|367028494|ref|XP_003663531.1| hypothetical protein MYCTH_33039, partial [Myceliophthora
           thermophila ATCC 42464]
 gi|347010800|gb|AEO58286.1| hypothetical protein MYCTH_33039, partial [Myceliophthora
           thermophila ATCC 42464]
          Length = 419

 Score = 63.2 bits (152), Expect = 3e-07,   Method: Compositional matrix adjust.
 Identities = 32/87 (36%), Positives = 46/87 (52%), Gaps = 2/87 (2%)

Query: 162 CNAHTICNSELRPLGTGLYPVISIINHSCLPNAVLVFEGRLAVVRAVQHVPKGAEVLISY 221
           C     C+ ++  +G  L  V ++INHSC PNA +  EG    VR+++ +  G E+ +SY
Sbjct: 103 CKIARYCSKDV--IGYALDMVTAMINHSCAPNAFVTLEGCQLRVRSLKPIAAGEEITVSY 160

Query: 222 IETAGSTMTRQKALKEQYLFTCTCPRC 248
            +       RQK LKE Y F C C  C
Sbjct: 161 ADPTLPVFNRQKFLKETYFFDCRCKGC 187


>gi|198433887|ref|XP_002127168.1| PREDICTED: similar to SMYD family member 5 [Ciona intestinalis]
          Length = 372

 Score = 63.2 bits (152), Expect = 3e-07,   Method: Compositional matrix adjust.
 Identities = 37/89 (41%), Positives = 49/89 (55%), Gaps = 7/89 (7%)

Query: 166 TICNSELRPLGTGLYPVISIINHSCLPNAVLVFEG---RLAVVRAVQHVPKGAEVLISYI 222
            +  S L   G GLY + S  NHSC PNA + F     RLAV +A + +  G E+ ISY+
Sbjct: 271 NVAGSFLNCEGAGLYRIQSKCNHSCEPNAEVCFPNNNHRLAV-KACRDIAAGEEITISYL 329

Query: 223 ---ETAGSTMTRQKALKEQYLFTCTCPRC 248
              + A    +RQ+ LKE YLF C C +C
Sbjct: 330 SQCQIARGCRSRQQYLKENYLFHCCCSKC 358


>gi|255085478|ref|XP_002505170.1| predicted protein [Micromonas sp. RCC299]
 gi|226520439|gb|ACO66428.1| predicted protein [Micromonas sp. RCC299]
          Length = 625

 Score = 62.8 bits (151), Expect = 3e-07,   Method: Compositional matrix adjust.
 Identities = 36/106 (33%), Positives = 57/106 (53%), Gaps = 11/106 (10%)

Query: 146 PEISINEIAENFSKLACNAHTICNSELRPLGT-GLYPVISIINHSCLPNAVLVFEGRLAV 204
           PE+   E+   F++LA N   +  S +R   + G+YP  S +NHSC PNA  +  G    
Sbjct: 282 PELPPQEV---FARLAAN--VVKTSAVRNANSVGVYPFPSFLNHSCAPNACKLMVGHTMF 336

Query: 205 VRAVQHVPKGAEVLISYIETAGSTMTR--QKALKEQYLFTCTCPRC 248
           +RA + +  G EV + Y +    TM +  + A+ +++ F C CPRC
Sbjct: 337 IRAARDLVAGEEVFMKYFDV---TMPKPERSAVAKRWGFECACPRC 379


>gi|294892061|ref|XP_002773875.1| hypothetical protein, conserved [Perkinsus marinus ATCC 50983]
 gi|239879079|gb|EER05691.1| hypothetical protein, conserved [Perkinsus marinus ATCC 50983]
          Length = 140

 Score = 62.8 bits (151), Expect = 3e-07,   Method: Composition-based stats.
 Identities = 32/99 (32%), Positives = 47/99 (47%), Gaps = 4/99 (4%)

Query: 159 KLACNAHTICNSELRPLGTGLYPVISIINHSCLPNAVLVF-EGRLAVVRAVQHVPKGAEV 217
           +  CN   I +     +  G+YP  ++ NHSC PN +  F  G     RA++ +  G E+
Sbjct: 41  RFPCNNFAIVDDLWSGIAAGVYPNAALFNHSCHPNVIPAFGHGSTLSFRAIRDISPGEEI 100

Query: 218 LISYIETAGSTMTRQKALKEQYLFTCTCPRCIKLGQFDD 256
             SY+E    +  R+  L   Y F C C RC   G+ DD
Sbjct: 101 CHSYVELTLPSWKRRDVLLRDYEFLCECERC---GKRDD 136


>gi|66525107|ref|XP_394075.2| PREDICTED: SET and MYND domain-containing protein 5 [Apis
           mellifera]
          Length = 387

 Score = 62.8 bits (151), Expect = 3e-07,   Method: Compositional matrix adjust.
 Identities = 34/87 (39%), Positives = 53/87 (60%), Gaps = 7/87 (8%)

Query: 176 GTGLYPVISIINHSCLPNAVLVF--EGRLAVVRAVQHVPKGAEVLISYIET---AGSTMT 230
           G+GLY + S +NHSC+PNA++ F     + V++A++ +    E+ ISY++      S  +
Sbjct: 287 GSGLYILQSSVNHSCVPNAIVEFPYSNNVLVLKAIRDIHPEEEICISYLDECCLERSRHS 346

Query: 231 RQKALKEQYLFTCTCPRCIKLGQFDDI 257
           RQKAL   YLF C C +C  L Q +D+
Sbjct: 347 RQKALNSLYLFQCYCNKC--LSQANDL 371


>gi|389643366|ref|XP_003719315.1| TPR domain-containing protein [Magnaporthe oryzae 70-15]
 gi|351639084|gb|EHA46948.1| TPR domain-containing protein [Magnaporthe oryzae 70-15]
 gi|440463125|gb|ELQ32752.1| TPR domain-containing protein [Magnaporthe oryzae Y34]
 gi|440483490|gb|ELQ63875.1| TPR domain-containing protein [Magnaporthe oryzae P131]
          Length = 746

 Score = 62.8 bits (151), Expect = 3e-07,   Method: Compositional matrix adjust.
 Identities = 26/72 (36%), Positives = 39/72 (54%)

Query: 177 TGLYPVISIINHSCLPNAVLVFEGRLAVVRAVQHVPKGAEVLISYIETAGSTMTRQKALK 236
           TG++   +  NH+C+PNA   F G + +VRA + +P GAE+ + Y   A    +R+   K
Sbjct: 503 TGMWLHAAYANHTCIPNATRAFIGDMMIVRAARDIPAGAEIFMGYASLAEPFESRRSKFK 562

Query: 237 EQYLFTCTCPRC 248
             Y F C C  C
Sbjct: 563 TSYGFECDCEMC 574


>gi|328785542|ref|XP_001122203.2| PREDICTED: protein msta, isoform A-like [Apis mellifera]
          Length = 490

 Score = 62.8 bits (151), Expect = 3e-07,   Method: Compositional matrix adjust.
 Identities = 42/132 (31%), Positives = 64/132 (48%), Gaps = 9/132 (6%)

Query: 178 GLYPVISIINHSCLPNAVLVFEGRLAV-VRAVQHVPKGAEVLISYIETAGSTMTRQKALK 236
           GLYP+ S+ NH C+PN    F+ +  + VRA   +  G E+ +SY      T  R++ L 
Sbjct: 214 GLYPLGSLQNHCCIPNTRHYFDEKFRLYVRAALPISAGEEITMSYTSLFWDTTLRRQFLN 273

Query: 237 EQYLFTCTCPRCIKLGQFDDIQESAILEGYRCKDDGCSGFLLRDSD---DKGFTCQQCGL 293
               F+C C RC    +F+  + SA+L    C  D CSG LL  +       + C +C +
Sbjct: 274 VTKNFSCMCKRCSDPTEFNS-KLSALL----CASDKCSGELLPKNPLNMRTSWICNKCSI 328

Query: 294 VRSKEEIKKIAS 305
             +  +I  I S
Sbjct: 329 TVNHRQICSIRS 340


>gi|195454342|ref|XP_002074198.1| GK12748 [Drosophila willistoni]
 gi|194170283|gb|EDW85184.1| GK12748 [Drosophila willistoni]
          Length = 394

 Score = 62.8 bits (151), Expect = 3e-07,   Method: Compositional matrix adjust.
 Identities = 52/171 (30%), Positives = 88/171 (51%), Gaps = 16/171 (9%)

Query: 88  MLKLYLR--RKLQNDNVIPSTTTDNYSLVEALVAHMSDIDEKQLLLYAQIANLV-NLILQ 144
           M +LYL   +  Q ++    TT D +  +  ++   S      +L  +Q   +V +L L 
Sbjct: 199 MEQLYLAFCQAFQGEDFARFTTPDAFKTLMGIMGTNSQGIATSVL--SQWVTIVSDLPLS 256

Query: 145 WPE-ISINEIAEN-FSKLACNAHTICNSELRPLGTGLYPVISIINHSCLPNAVLVF--EG 200
            PE + +++  +N ++K+   A    N+E    G+GLY + S INHSC+PNA   F    
Sbjct: 257 DPEKVQLDQEIDNLYAKVGEFAGEFLNNE----GSGLYLLQSKINHSCVPNACSTFPYSN 312

Query: 201 RLAVVRAVQHVPKGAEVLISYIETAG---STMTRQKALKEQYLFTCTCPRC 248
            + V++A+  + +G E+ ISY++      S  +R K L   Y+F C CP+C
Sbjct: 313 DIVVLKALSPIQEGDEICISYLDECQLERSRHSRHKILCGNYIFICQCPKC 363


>gi|380016398|ref|XP_003692172.1| PREDICTED: SET and MYND domain-containing protein 5-like [Apis
           florea]
          Length = 387

 Score = 62.8 bits (151), Expect = 3e-07,   Method: Compositional matrix adjust.
 Identities = 34/87 (39%), Positives = 53/87 (60%), Gaps = 7/87 (8%)

Query: 176 GTGLYPVISIINHSCLPNAVLVF--EGRLAVVRAVQHVPKGAEVLISYIETA---GSTMT 230
           G+GLY + S +NHSC+PNA++ F     + V++A++ +    E+ ISY++      S  +
Sbjct: 287 GSGLYILQSSVNHSCVPNAIVEFPYSNNVLVLKAIRDIHPEEEICISYLDECCLERSRHS 346

Query: 231 RQKALKEQYLFTCTCPRCIKLGQFDDI 257
           RQKAL   YLF C C +C  L Q +D+
Sbjct: 347 RQKALNSLYLFQCYCNKC--LSQANDL 371


>gi|195132931|ref|XP_002010893.1| GI21458 [Drosophila mojavensis]
 gi|193907681|gb|EDW06548.1| GI21458 [Drosophila mojavensis]
          Length = 302

 Score = 62.8 bits (151), Expect = 3e-07,   Method: Compositional matrix adjust.
 Identities = 30/72 (41%), Positives = 41/72 (56%), Gaps = 1/72 (1%)

Query: 178 GLYPVISIINHSCLPNAVLVFE-GRLAVVRAVQHVPKGAEVLISYIETAGSTMTRQKALK 236
            L+P+  ++NH C PNA   FE G   VV A + +P GAE+ +SY +   ST+ R+  L 
Sbjct: 226 ALFPLAGLLNHQCTPNAAHHFEDGETIVVTATERIPAGAEITMSYTKLLWSTLARKIFLG 285

Query: 237 EQYLFTCTCPRC 248
               F C CPRC
Sbjct: 286 MTKHFMCQCPRC 297


>gi|195156964|ref|XP_002019366.1| GL12277 [Drosophila persimilis]
 gi|194115957|gb|EDW38000.1| GL12277 [Drosophila persimilis]
          Length = 188

 Score = 62.8 bits (151), Expect = 4e-07,   Method: Compositional matrix adjust.
 Identities = 38/101 (37%), Positives = 57/101 (56%), Gaps = 9/101 (8%)

Query: 153 IAENFSKLACNAHTICNSELRPLGTGLYPVISIINHSCLPNAVLVF--EGRLAVVRAVQH 210
           I E +SK+   A    N+E    G+GLY + S INHSC+PNA   F     + V++A+  
Sbjct: 62  IDELYSKVGEFAGEFLNNE----GSGLYLLQSKINHSCVPNACSTFPYSNDIVVLKALAP 117

Query: 211 VPKGAEVLISYIETAG---STMTRQKALKEQYLFTCTCPRC 248
           + +G E+ ISY++      S  +R K L+E Y+F C C +C
Sbjct: 118 IQEGEEICISYLDECQLERSRHSRHKVLRENYIFVCQCLKC 158


>gi|294886191|ref|XP_002771602.1| hypothetical protein, conserved [Perkinsus marinus ATCC 50983]
 gi|239875308|gb|EER03418.1| hypothetical protein, conserved [Perkinsus marinus ATCC 50983]
          Length = 265

 Score = 62.8 bits (151), Expect = 4e-07,   Method: Compositional matrix adjust.
 Identities = 34/98 (34%), Positives = 51/98 (52%), Gaps = 9/98 (9%)

Query: 176 GTGLYPVISIINHSCLPNAVLVFEGRLAVVRAVQHVPKGAEVLISYIETAGSTMTRQKAL 235
           GT +YPV S+ NHSC PN      G L   R    V +G E+ ISYI      +   KA+
Sbjct: 17  GTAVYPVASMFNHSCTPNVTRASLGDLTWFRTCTDVKRGQELTISYI--GSDLLCEPKAV 74

Query: 236 KEQYL---FTCTCPRCIKLGQFDDIQESAILEGYRCKD 270
           ++++L   F+C CP C K    +D  +S ++  +  +D
Sbjct: 75  RQKHLARDFSCNCPACTK----EDDDDSVVIMPFTLQD 108


>gi|443704468|gb|ELU01530.1| hypothetical protein CAPTEDRAFT_197951 [Capitella teleta]
          Length = 614

 Score = 62.8 bits (151), Expect = 4e-07,   Method: Compositional matrix adjust.
 Identities = 80/370 (21%), Positives = 145/370 (39%), Gaps = 68/370 (18%)

Query: 108 TDNYSLVEALVAHMSDIDEKQLLLYAQIANLVNLILQWPEISINE-----------IAEN 156
           + NY+ V  L+ H   +  + L+ YA  A L+   L   E   N            +  +
Sbjct: 260 SRNYTAVYHLLPHNESMLREDLIQYALTAALLLKCLDHTEYFKNHSPDFRFSIGGLLLRH 319

Query: 157 FSKLACNAHTICNSELR------------PLGTGLYPVISIINHSCLPNAVLVFEGRLAV 204
             +L CNAH I   E               + T +YP  S++NHSC P +++  +  L  
Sbjct: 320 ICQLVCNAHAITRLEQGLCSQAVVECQQVRIATAIYPTTSLLNHSCDP-SIIARKNEL-F 377

Query: 205 VRAVQHVPKGAEVLISYIETAGST--MTRQKALKEQYLFTCTCPRCIKLGQFDDIQESAI 262
           VR V+ V  G E+   Y           RQ+ L+ QY F C C  C      ++     +
Sbjct: 378 VRLVKDVKAGEEIFNCYGPHYARMPKKERQEVLQSQYFFKCDCSECTAEEPLEN-----L 432

Query: 263 LEGYRCKDDGCSGFLLRDSDDKGFTCQQCGLVRSKEEIKKIASEVNILSKKTLALTSCGN 322
           L+ YRC+   CS  ++     +   C +C  +    ++ ++++     S   ++   C  
Sbjct: 433 LKAYRCQK--CSHAIISTGTQEVLKCSKCVTMV---DMNQLSAAEKQSSDDFMSSLKCLQ 487

Query: 323 HQEVVSTYKMIEK---LQKKLYHPFSVNLMQTREKLIKILMELEDWKEALAYCQLTIPVY 379
            +++    K ++    ++K +YH           +  K+L E +D   ALA C      +
Sbjct: 488 VEDIDGAIKKLKNSLDIRKMIYH-----------RNHKLLTEAKD---ALARCYCIKGQF 533

Query: 380 QRVYPQFHPLLGLQYYTCGKLEWFLGDT--------------ENAIKSMTEAVEILRITH 425
            R  P     L       G     L +               + A++++ ++++ILR+ +
Sbjct: 534 TRAVPLLKDCLLCVETAFGSRSIELANELHKYAEVLVNARKLDEALETVNQSIDILRLNY 593

Query: 426 GTNSPFMKEL 435
           G      KE+
Sbjct: 594 GPEYSTAKEM 603


>gi|116181330|ref|XP_001220514.1| hypothetical protein CHGG_01293 [Chaetomium globosum CBS 148.51]
 gi|88185590|gb|EAQ93058.1| hypothetical protein CHGG_01293 [Chaetomium globosum CBS 148.51]
          Length = 348

 Score = 62.8 bits (151), Expect = 4e-07,   Method: Compositional matrix adjust.
 Identities = 31/71 (43%), Positives = 40/71 (56%), Gaps = 1/71 (1%)

Query: 178 GLYPVISIINHSCLPNAVLVFEGRLAVVRAVQHVPKGAEVLISYIETAGSTMTRQKALKE 237
           G++P +S  NH C PN      G      AV+ +P G E+ ISYI    S  TRQ  L+E
Sbjct: 173 GVFPEVSKFNHDCRPNVHYRLNGLKHTTIAVRDIPAGDELTISYIYGRASHATRQSQLRE 232

Query: 238 QYLFTCTCPRC 248
            + FTCTCP+C
Sbjct: 233 -WGFTCTCPQC 242


>gi|307212779|gb|EFN88450.1| SET and MYND domain-containing protein 4 [Harpegnathos saltator]
          Length = 687

 Score = 62.8 bits (151), Expect = 4e-07,   Method: Compositional matrix adjust.
 Identities = 37/124 (29%), Positives = 64/124 (51%), Gaps = 8/124 (6%)

Query: 175 LGTGLYPVISIINHSCLPNAVLVFEGRLAVVRAVQHVPKGAEVLISY--IETAGSTMTRQ 232
           +G G+YP ++++NHSC P  V  F G   ++RAV+ +  G E+  +Y  I T      R+
Sbjct: 490 IGGGVYPTVAMLNHSCNPGVVRYFVGTTMILRAVRTISAGEEISENYGPIFTTTPENERK 549

Query: 233 KALKEQYLFTCTCPRCIKLGQFDDIQE-SAILEGYRCKDDGCSGFLL---RDSDDKGFTC 288
           + L+ QY F C C  C   G +  ++E    +  ++C+     G +L    D+++    C
Sbjct: 550 RRLRVQYWFDCNCEACT--GHWPLLEELDPTVLRFKCETGPSCGNVLPIRSDTNEFMIGC 607

Query: 289 QQCG 292
            +CG
Sbjct: 608 VKCG 611



 Score = 40.8 bits (94), Expect = 1.5,   Method: Compositional matrix adjust.
 Identities = 21/69 (30%), Positives = 32/69 (46%), Gaps = 3/69 (4%)

Query: 8   GEVIISQEPYVCVPNNSSSISRCDGCFASSNL---KKCSACQVVWYCGSNCQKLDWKLHR 64
           GE++I + PY       + ++ C  CFA   +     C  C  V YC   C+  D ++H 
Sbjct: 274 GEIVIMERPYCASLLAENRLTHCHLCFARIFVPIPAACRTCSCVAYCSRRCRDRDAQVHL 333

Query: 65  LECQVLSRL 73
            EC +L  L
Sbjct: 334 RECSLLPVL 342


>gi|154346778|ref|XP_001569326.1| hypothetical protein LBRM_35_7140 [Leishmania braziliensis
           MHOM/BR/75/M2904]
 gi|134066668|emb|CAM44467.1| hypothetical protein LBRM_35_7140 [Leishmania braziliensis
           MHOM/BR/75/M2904]
          Length = 442

 Score = 62.8 bits (151), Expect = 4e-07,   Method: Compositional matrix adjust.
 Identities = 34/84 (40%), Positives = 48/84 (57%), Gaps = 5/84 (5%)

Query: 176 GTGLYPVISIINHSCLPNAVLVF----EGRLAVVRAVQHVPKGAEVLISYIETAGSTMTR 231
           G G+Y V  + NHSC PN  + +    +  L VV A++ V  G E+ ISYI+++     R
Sbjct: 357 GQGIYEVGCLFNHSCDPNLSVQYSSLNDETLTVV-ALRDVKAGEELTISYIDSSLPFAVR 415

Query: 232 QKALKEQYLFTCTCPRCIKLGQFD 255
           Q+ L + YLF C CPRC+  G  D
Sbjct: 416 QQQLLDHYLFECRCPRCVAEGTTD 439


>gi|219112705|ref|XP_002178104.1| predicted protein [Phaeodactylum tricornutum CCAP 1055/1]
 gi|217410989|gb|EEC50918.1| predicted protein [Phaeodactylum tricornutum CCAP 1055/1]
          Length = 501

 Score = 62.8 bits (151), Expect = 4e-07,   Method: Compositional matrix adjust.
 Identities = 35/91 (38%), Positives = 54/91 (59%), Gaps = 10/91 (10%)

Query: 167 ICNSELRPLGTGLYPVISIINHSCLPNAVL----VFEGRLAVVRAVQHVPKGAEVLISYI 222
           +   ++ P    ++P+ + INHSC+PNA +      + R+ VV A++ +  G E+ ISYI
Sbjct: 408 VVEDKVAPEIVAVFPLTARINHSCVPNAQVQSQEFVDARIDVV-ALRDIAAGEEITISYI 466

Query: 223 ---ETAGSTMT--RQKALKEQYLFTCTCPRC 248
               T+GS  T  R++ L  +YLFTC CPRC
Sbjct: 467 GCGRTSGSKSTSRRRRELLAKYLFTCECPRC 497


>gi|384490543|gb|EIE81765.1| hypothetical protein RO3G_06470 [Rhizopus delemar RA 99-880]
          Length = 400

 Score = 62.4 bits (150), Expect = 4e-07,   Method: Compositional matrix adjust.
 Identities = 84/324 (25%), Positives = 141/324 (43%), Gaps = 39/324 (12%)

Query: 8   GEVIISQEPYVCVPNNSSSISRCDGCFAS-SNLKK-CSACQVVW---YCGSNCQKLDW-K 61
           G  I+ ++P++   N+++  + C  CF     +KK C      W   YC  NCQ+  W  
Sbjct: 55  GRTILEEDPFIRQLNDANRTTHCTYCFTEFKKIKKLCRNKDCKWEVLYCSLNCQQQHWLT 114

Query: 62  LHRLECQVLSRLDKEKRKSVTPSIRLMLKLYLRRKLQNDNVIPS--TTTDNYSLVEALVA 119
            H+  C+  +  + +K      ++   L+ Y+  + +++  +P   +  D+Y   E L  
Sbjct: 115 EHKWFCRFPNLANTDK------NVLFALQGYIASRSKSEYTLPGLVSNIDSYK-PEDLEE 167

Query: 120 HMSDIDEKQL--LLYAQIANLVNLILQWPEISINEIA-ENFSKLACNAHTICNSELRPLG 176
           +   I   +L  + +     + ++++   +I  N  A + F  +      I   E   LG
Sbjct: 168 YRKKIGNSELSKVFHLTEGVIESMVVIMAQIRCNTFAVKQF--VISKTLEIEERETITLG 225

Query: 177 TGLYPVISIINHSCLPNAVLVFE-----GRLAVVRAVQHVPKGAEVLISY--IETAGSTM 229
             +Y   S  NHSC PNA+++F        L V        K AE+ ISY  + T  S  
Sbjct: 226 RAIYLTASKFNHSCNPNALVLFGTNNVGSHLKVHFIGDEAKKNAEINISYGPVSTRHSKK 285

Query: 230 TRQKALKEQYLFTCTCPRCIKLGQFDDIQESAILEGYRCKDDGCSGFLLRDSDDKGFTCQ 289
            R+K LK++Y F C C  C +L  +D I  ++I +  RC+         R SD K   C 
Sbjct: 286 QRKKKLKDEYFFDCNCSSC-QLS-YDPI--TSIYKCQRCRKG-------RLSDIKS-QCD 333

Query: 290 QCGLVRSKEEIKKIASEVNILSKK 313
            C   ++   I KI  E+  L  K
Sbjct: 334 YCDFDQNYLIIPKIEREIKTLKDK 357


>gi|16741406|gb|AAH16525.1| SET and MYND domain containing 5 [Mus musculus]
          Length = 300

 Score = 62.4 bits (150), Expect = 4e-07,   Method: Compositional matrix adjust.
 Identities = 35/86 (40%), Positives = 50/86 (58%), Gaps = 7/86 (8%)

Query: 176 GTGLYPVISIINHSCLPNAVLVFEGRLAV--VRAVQHVPKGAEVLISYIETAG---STMT 230
           G+GL+ + S  NHSC+PNA   F     V  V A++ +  G E+ ISY++      S  +
Sbjct: 187 GSGLFVLQSCCNHSCVPNAETSFPENNFVLHVTALEDIKPGEEICISYLDCCQRERSRHS 246

Query: 231 RQKALKEQYLFTCTCPRCIKLGQFDD 256
           R K L+E YLF C+CP+C  L + DD
Sbjct: 247 RHKILRENYLFNCSCPKC--LAEADD 270


>gi|344283937|ref|XP_003413727.1| PREDICTED: SET and MYND domain-containing protein 5 [Loxodonta
           africana]
          Length = 417

 Score = 62.4 bits (150), Expect = 4e-07,   Method: Compositional matrix adjust.
 Identities = 35/86 (40%), Positives = 50/86 (58%), Gaps = 7/86 (8%)

Query: 176 GTGLYPVISIINHSCLPNAVLVF--EGRLAVVRAVQHVPKGAEVLISYIETAG---STMT 230
           G+GL+ + S  NHSC+PNA   F     L  V A++ +  G E+ ISY++      S  +
Sbjct: 304 GSGLFVLQSCCNHSCVPNAETSFPENNFLLHVTALEDIKPGEEICISYLDCCQRERSRHS 363

Query: 231 RQKALKEQYLFTCTCPRCIKLGQFDD 256
           R K L+E YLF C+CP+C  L + DD
Sbjct: 364 RHKILRENYLFVCSCPKC--LAEADD 387


>gi|426335970|ref|XP_004029477.1| PREDICTED: SET and MYND domain-containing protein 5 isoform 2
           [Gorilla gorilla gorilla]
          Length = 303

 Score = 62.4 bits (150), Expect = 4e-07,   Method: Compositional matrix adjust.
 Identities = 32/79 (40%), Positives = 47/79 (59%), Gaps = 5/79 (6%)

Query: 176 GTGLYPVISIINHSCLPNAVLVF--EGRLAVVRAVQHVPKGAEVLISYIE---TAGSTMT 230
           G+GL+ + S  NHSC+PNA   F     L  V A++ +  G E+ ISY++      S  +
Sbjct: 188 GSGLFVLQSCCNHSCVPNAETSFPENNFLLHVTALEDIKPGEEICISYLDCCQRERSRHS 247

Query: 231 RQKALKEQYLFTCTCPRCI 249
           R K L+E YLF C+CP+C+
Sbjct: 248 RHKILRENYLFVCSCPKCL 266


>gi|397473489|ref|XP_003808243.1| PREDICTED: SET and MYND domain-containing protein 5 isoform 2 [Pan
           paniscus]
          Length = 304

 Score = 62.4 bits (150), Expect = 4e-07,   Method: Compositional matrix adjust.
 Identities = 32/79 (40%), Positives = 47/79 (59%), Gaps = 5/79 (6%)

Query: 176 GTGLYPVISIINHSCLPNAVLVF--EGRLAVVRAVQHVPKGAEVLISYIE---TAGSTMT 230
           G+GL+ + S  NHSC+PNA   F     L  V A++ +  G E+ ISY++      S  +
Sbjct: 188 GSGLFVLQSCCNHSCVPNAETSFPENNFLLHVTALEDIKPGEEICISYLDCCQRERSRHS 247

Query: 231 RQKALKEQYLFTCTCPRCI 249
           R K L+E YLF C+CP+C+
Sbjct: 248 RHKILRENYLFVCSCPKCL 266


>gi|224057872|ref|XP_002299366.1| SET domain protein [Populus trichocarpa]
 gi|222846624|gb|EEE84171.1| SET domain protein [Populus trichocarpa]
          Length = 391

 Score = 62.4 bits (150), Expect = 4e-07,   Method: Compositional matrix adjust.
 Identities = 33/79 (41%), Positives = 42/79 (53%), Gaps = 9/79 (11%)

Query: 178 GLYPVISIINHSCLPNAVLVFE--------GRLAVVRAVQHVPKGAEVLISYIETAGSTM 229
           G+YP  S+ NH CLPNA   F+            VVR +  VP+G E+ +SY     +  
Sbjct: 220 GIYPKASLFNHDCLPNACR-FDYVDTNNSGNTDIVVRMIHDVPQGREICLSYFPVNSNYS 278

Query: 230 TRQKALKEQYLFTCTCPRC 248
           TR+K L E Y FTC C RC
Sbjct: 279 TRRKRLLEDYGFTCDCDRC 297


>gi|49257476|gb|AAH73806.1| SMYD family member 5 [Homo sapiens]
 gi|167773713|gb|ABZ92291.1| SMYD family member 5 [synthetic construct]
 gi|194374657|dbj|BAG62443.1| unnamed protein product [Homo sapiens]
          Length = 302

 Score = 62.4 bits (150), Expect = 4e-07,   Method: Compositional matrix adjust.
 Identities = 32/79 (40%), Positives = 47/79 (59%), Gaps = 5/79 (6%)

Query: 176 GTGLYPVISIINHSCLPNAVLVF--EGRLAVVRAVQHVPKGAEVLISYIETAG---STMT 230
           G+GL+ + S  NHSC+PNA   F     L  V A++ +  G E+ ISY++      S  +
Sbjct: 188 GSGLFVLQSCCNHSCVPNAETSFPENNFLLHVTALEDIKPGEEICISYLDCCQRERSRHS 247

Query: 231 RQKALKEQYLFTCTCPRCI 249
           R K L+E YLF C+CP+C+
Sbjct: 248 RHKILRENYLFVCSCPKCL 266


>gi|332813421|ref|XP_515547.3| PREDICTED: SET and MYND domain-containing protein 5 isoform 4 [Pan
           troglodytes]
          Length = 299

 Score = 62.4 bits (150), Expect = 4e-07,   Method: Compositional matrix adjust.
 Identities = 32/79 (40%), Positives = 47/79 (59%), Gaps = 5/79 (6%)

Query: 176 GTGLYPVISIINHSCLPNAVLVF--EGRLAVVRAVQHVPKGAEVLISYIETAG---STMT 230
           G+GL+ + S  NHSC+PNA   F     L  V A++ +  G E+ ISY++      S  +
Sbjct: 188 GSGLFVLQSCCNHSCVPNAETSFPENNFLLHVTALEDIKPGEEICISYLDCCQRERSRHS 247

Query: 231 RQKALKEQYLFTCTCPRCI 249
           R K L+E YLF C+CP+C+
Sbjct: 248 RHKILRENYLFVCSCPKCL 266


>gi|74187979|dbj|BAE37118.1| unnamed protein product [Mus musculus]
          Length = 400

 Score = 62.4 bits (150), Expect = 4e-07,   Method: Compositional matrix adjust.
 Identities = 35/86 (40%), Positives = 50/86 (58%), Gaps = 7/86 (8%)

Query: 176 GTGLYPVISIINHSCLPNAVLVFEGRLAV--VRAVQHVPKGAEVLISYIE---TAGSTMT 230
           G+GL+ + S  NHSC+PNA   F     V  V A++ +  G E+ ISY++      S  +
Sbjct: 287 GSGLFVLQSCCNHSCVPNAETSFPENNFVLHVTALEDIKPGEEICISYLDCCQRERSRHS 346

Query: 231 RQKALKEQYLFTCTCPRCIKLGQFDD 256
           R K L+E YLF C+CP+C  L + DD
Sbjct: 347 RHKILRENYLFNCSCPKC--LAEADD 370


>gi|321477118|gb|EFX88077.1| hypothetical protein DAPPUDRAFT_305694 [Daphnia pulex]
          Length = 651

 Score = 62.4 bits (150), Expect = 4e-07,   Method: Compositional matrix adjust.
 Identities = 63/251 (25%), Positives = 101/251 (40%), Gaps = 42/251 (16%)

Query: 42  CSACQVVWYCGSNCQKLDW-KLHRLECQVLSRLDKEK-RKSVTPSIRLMLKLYLRRKL-- 97
           C  C  V +C + C+   W   H+ EC +   L      +    ++R +LK+  +R +  
Sbjct: 305 CHQCSQVGFCSTLCRDEAWASYHQSECGLTDSLHGTNVGRHGLLAVRTVLKVGRQRIMDV 364

Query: 98  --QNDNVIPS-----TTTDNYSLVEALVAHMSDIDEKQLLLYAQIA-NLVNLILQW---- 145
             ++D   P+      +TD    +  LV + +      L   A +A  L +LI Q     
Sbjct: 365 ASEDDGCNPAGGELYDSTDYGRTIHRLVGNTARRSVADLFRRAVMAVYLTSLIQQQRDGK 424

Query: 146 ----PEISINEIAENFSKLACNAHTICN------------------SELR--PLGTGLYP 181
                EI    + +      CNAH I +                   ++R   +G    P
Sbjct: 425 EDDPDEILATAVLQLIQSYPCNAHEISHLAFPLPGTPSGPDLPSTLQQIRLCEIGAAAMP 484

Query: 182 VISIINHSCLPNAVLVFEGRLAVVRAVQHVPKGAEVL--ISYIETAGSTMTRQKALKEQY 239
           V+S+INHSC PN V    G +  V+A++ + +G E+L    Y         RQ  L +QY
Sbjct: 485 VLSLINHSCDPNVVRDCYGDVIAVKAIRRIARGDEILDNYGYHYATHDKKERQLKLSQQY 544

Query: 240 LFTCTCPRCIK 250
            F C C  C +
Sbjct: 545 YFRCNCLACAQ 555


>gi|392558999|gb|EIW52184.1| hypothetical protein TRAVEDRAFT_75273 [Trametes versicolor
           FP-101664 SS1]
          Length = 721

 Score = 62.4 bits (150), Expect = 5e-07,   Method: Compositional matrix adjust.
 Identities = 36/72 (50%), Positives = 41/72 (56%), Gaps = 3/72 (4%)

Query: 183 ISIINHSCLPNAVLVFEGRLAV--VRAVQHVPKGAEVLISYIETAGSTMTRQKALKEQYL 240
           IS INHSC PNAV  FE       VRA+  +P GA+V ISYI+ A     RQ+AL   Y 
Sbjct: 179 ISRINHSCSPNAVYRFEPATLTFEVRALSPIPSGAQVFISYIDPALPRAKRQEALS-SYG 237

Query: 241 FTCTCPRCIKLG 252
           FTC C  C   G
Sbjct: 238 FTCACTACALTG 249


>gi|332022210|gb|EGI62525.1| SET and MYND domain-containing protein 5 [Acromyrmex echinatior]
          Length = 391

 Score = 62.4 bits (150), Expect = 5e-07,   Method: Compositional matrix adjust.
 Identities = 31/78 (39%), Positives = 48/78 (61%), Gaps = 5/78 (6%)

Query: 176 GTGLYPVISIINHSCLPNAVLVF--EGRLAVVRAVQHVPKGAEVLISYIETAG---STMT 230
           G+GLY + S +NHSC+PNAV+ F       V++A++++  G E+  SY++      S  +
Sbjct: 295 GSGLYVLQSSVNHSCIPNAVVEFPYSNNTLVLKAIRNIKVGEEICTSYLDECQLERSRHS 354

Query: 231 RQKALKEQYLFTCTCPRC 248
           RQ+AL   YLF C C +C
Sbjct: 355 RQQALSSLYLFVCHCDKC 372


>gi|328874283|gb|EGG22649.1| SET domain-containing protein [Dictyostelium fasciculatum]
          Length = 1240

 Score = 62.4 bits (150), Expect = 5e-07,   Method: Compositional matrix adjust.
 Identities = 64/277 (23%), Positives = 117/277 (42%), Gaps = 30/277 (10%)

Query: 175  LGTGLYPVISIINHSCLPNAVLVFEGRLAVVRAVQHVPKGAEVLISYIETAGSTMTRQK- 233
            +   ++PV S++NHSC  N +L + G    +++++++ K  E+   Y   A     R++ 
Sbjct: 893  IAYAIFPVSSLLNHSCDNNTILQYNGSSITIKSLKNISKNDEITGCYGPHAFHLELRERL 952

Query: 234  -ALKEQYLFTCTCPRC-IKLGQFDDIQESAILE--GYRCK-----------------DDG 272
              LKE+Y F C C  C  K+G       +  LE  G +CK                 D  
Sbjct: 953  ECLKEEYFFICRCHACNAKIGPNLLRCPTQSLESGGEQCKGTLLERVDTNSLGQEEDDSY 1012

Query: 273  CSGFLLRDSDDKGFTCQQCGLVRSKEEIKKIASEV----NILSKKTLALTSCGNHQEVVS 328
              G    D + + F C  C L   K +   + S+      I S  T  L    +HQ    
Sbjct: 1013 PEGASYDDYNGRKFVCSICSLELGKMDCFMLTSQSVMSGAIFSNATKLLIDTEHHQPPRE 1072

Query: 329  TYKMIEK---LQKKLYHPFSVNLMQTREKLIKILMELEDWKEALAYCQLTI-PVYQRVYP 384
              +M+ K   ++K++YH  S  + +  + L +  +  + +KEA+ Y +  I  ++ R+  
Sbjct: 1073 VEEMLFKCLEIRKQIYHVKSRKIGEVYDALSRYYIAKDKYKEAVTYAERAINNIFLRLGH 1132

Query: 385  QFHPLLGLQYYTCGKLEWFLGDTENAIKSMTEAVEIL 421
                 L  ++     +    G+   AIK++  +  +L
Sbjct: 1133 SHSTELAREWAKLANIYINAGEPAKAIKAINTSESLL 1169


>gi|350403870|ref|XP_003486931.1| PREDICTED: SET and MYND domain-containing protein 5-like [Bombus
           impatiens]
          Length = 386

 Score = 62.4 bits (150), Expect = 5e-07,   Method: Compositional matrix adjust.
 Identities = 36/86 (41%), Positives = 51/86 (59%), Gaps = 7/86 (8%)

Query: 176 GTGLYPVISIINHSCLPNAVLVF--EGRLAVVRAVQHVPKGAEVLISYIETA---GSTMT 230
           G+GLY   S INHSC+PNAV+ F     + V++A++ +    E+ ISY++      S  +
Sbjct: 287 GSGLYIFQSSINHSCVPNAVVEFPYSNNVLVLKAIRDIHPEEEICISYLDECCLERSRHS 346

Query: 231 RQKALKEQYLFTCTCPRCIKLGQFDD 256
           RQKAL   YLF C C +C  L Q +D
Sbjct: 347 RQKALSSLYLFHCYCNKC--LSQVND 370


>gi|157823305|ref|NP_001101340.1| SET and MYND domain-containing protein 5 [Rattus norvegicus]
 gi|149036567|gb|EDL91185.1| SET and MYND domain containing 5 (predicted) [Rattus norvegicus]
          Length = 417

 Score = 62.4 bits (150), Expect = 5e-07,   Method: Compositional matrix adjust.
 Identities = 35/86 (40%), Positives = 50/86 (58%), Gaps = 7/86 (8%)

Query: 176 GTGLYPVISIINHSCLPNAVLVF--EGRLAVVRAVQHVPKGAEVLISYIETAG---STMT 230
           G+GL+ + S  NHSC+PNA   F     L  V A++ +  G E+ ISY++      S  +
Sbjct: 304 GSGLFVLQSCCNHSCVPNAETSFPENNFLLHVTALEDIEPGEEICISYLDCCQRERSRHS 363

Query: 231 RQKALKEQYLFTCTCPRCIKLGQFDD 256
           R K L+E YLF C+CP+C  L + DD
Sbjct: 364 RHKILRENYLFVCSCPKC--LAEADD 387


>gi|346320526|gb|EGX90126.1| TPR domain protein [Cordyceps militaris CM01]
          Length = 656

 Score = 62.4 bits (150), Expect = 5e-07,   Method: Compositional matrix adjust.
 Identities = 31/72 (43%), Positives = 42/72 (58%), Gaps = 1/72 (1%)

Query: 177 TGLYPVISIINHSCLPNAVLVFEGRLAVVRAVQHVPKGAEVLISYIETAGSTMTRQKALK 236
           TGL+   S +NH+C+ NAV  F G L V RA +H+  G E+  +Y E A     RQ AL+
Sbjct: 509 TGLWVRASYVNHACVANAVKDFAGDLMVFRAARHIRAGEEIFHAYDENA-DYDARQAALR 567

Query: 237 EQYLFTCTCPRC 248
           + + F CTC  C
Sbjct: 568 KTWGFACTCALC 579


>gi|134057769|emb|CAK38164.1| unnamed protein product [Aspergillus niger]
          Length = 358

 Score = 62.4 bits (150), Expect = 5e-07,   Method: Compositional matrix adjust.
 Identities = 64/277 (23%), Positives = 120/277 (43%), Gaps = 36/277 (12%)

Query: 7   SGEVIISQEPYVCVP-----NNSSSISRCDGC------FASSNLKKCSACQV-VWYCGSN 54
           +GE++++  PY  +P         SI    GC       A++++   + C + +W+C  +
Sbjct: 74  AGELLMADLPYAVLPTVAVGGGDDSICSNLGCSRRVSRHAANSVTCETGCSLDIWWCNES 133

Query: 55  CQKLDWKLHRLECQVLSRLDKEKRKSVTPSIRLMLKLYLR-----------RKLQNDNVI 103
           C+  D   H +EC  L +     R+ +      ML + +R             L     +
Sbjct: 134 CKDEDQARHDVECAWLKKYSPVLRQELGDHDYYMLWIIIRLLAARNSHSKIASLAAGKGV 193

Query: 104 PSTTTDNYSLVEALVAHMSDIDEKQLLLYA------QIANLVNLILQWPEISINEIAENF 157
               ++  S   + + H S + EK LL  +       +  LV+LI    E ++ E+    
Sbjct: 194 QLLRSNRDSWPTSQLQHWSTLAEKYLLQSSLLPETLDLDTLVDLICA-EETNVFELCPGP 252

Query: 158 SKLACNAHTICNSELRPLGTGLYPVISIINHSCLPNAVLVFE--GRLAVVRAVQHVPKGA 215
           +++  +         +  G  ++  I++ NHSC PN     +  GR+ +V A++ +  G 
Sbjct: 253 TEIIPHQSPGVERGTQ-YGLAVFLRITLANHSCAPNVTHQADDRGRM-MVTALRDIAPGE 310

Query: 216 EVLISYIETAG--STMTRQKALKEQYLFTCTCPRCIK 250
           E   SY + +       R+K  +E + FTCTCPRC++
Sbjct: 311 ECCTSYFDLSEYVDLQARRKKTQELFTFTCTCPRCLQ 347


>gi|354492175|ref|XP_003508226.1| PREDICTED: SET and MYND domain-containing protein 5-like
           [Cricetulus griseus]
          Length = 435

 Score = 62.4 bits (150), Expect = 5e-07,   Method: Compositional matrix adjust.
 Identities = 35/86 (40%), Positives = 50/86 (58%), Gaps = 7/86 (8%)

Query: 176 GTGLYPVISIINHSCLPNAVLVF--EGRLAVVRAVQHVPKGAEVLISYIE---TAGSTMT 230
           G+GL+ + S  NHSC+PNA   F     L  V A++ +  G E+ ISY++      S  +
Sbjct: 322 GSGLFVLQSCCNHSCVPNAETSFPENNFLLHVTALEDIKPGEEICISYLDCCQRERSRHS 381

Query: 231 RQKALKEQYLFTCTCPRCIKLGQFDD 256
           R K L+E YLF C+CP+C  L + DD
Sbjct: 382 RHKILRENYLFICSCPKC--LAEADD 405


>gi|71666752|ref|XP_820332.1| hypothetical protein [Trypanosoma cruzi strain CL Brener]
 gi|70885672|gb|EAN98481.1| hypothetical protein, conserved [Trypanosoma cruzi]
          Length = 359

 Score = 62.4 bits (150), Expect = 5e-07,   Method: Compositional matrix adjust.
 Identities = 51/174 (29%), Positives = 84/174 (48%), Gaps = 32/174 (18%)

Query: 106 TTTDNYSLVEALVAHMSDI--------DEKQLLLYAQIANLVNLIL-----QWPEISINE 152
           +TT+N +  + L   M+ +        +E  L   A+ + L+  +L     + P  + + 
Sbjct: 160 STTENDTTDDFLATGMTLLGRILRFTQEECTLFTVARWSELLGAVLLNGQERSPPSNYDR 219

Query: 153 IAENFSKLACNAHTI---------CNSELRPL-----GTGLYPVISIINHSCLPNAVLVF 198
           + E   +L C   T+           +EL+ L     G G+Y V  + NHSC PN  +V+
Sbjct: 220 LKELVRRLPCGESTMDAFEQEVQMAGNELQHLRQSSRGQGVYTVGCLFNHSCEPNLQVVY 279

Query: 199 ----EGRLAVVRAVQHVPKGAEVLISYIETAGSTMTRQKALKEQYLFTCTCPRC 248
               +  L+VV A++ +  G E+ ISYI+ + S   RQ+ L E YLF C CP+C
Sbjct: 280 SESGDETLSVV-ALRDIEPGEELCISYIDESLSYPERQQELYEHYLFFCQCPKC 332


>gi|195339975|ref|XP_002036592.1| GM18755 [Drosophila sechellia]
 gi|194130472|gb|EDW52515.1| GM18755 [Drosophila sechellia]
          Length = 507

 Score = 62.4 bits (150), Expect = 5e-07,   Method: Compositional matrix adjust.
 Identities = 90/386 (23%), Positives = 151/386 (39%), Gaps = 58/386 (15%)

Query: 10  VIISQEPYVCVPNNSSSISRCDGCFASSNLKK---CSACQVVW-YCGSNCQKLDWKLHRL 65
           ++I + P+   P  +  +  C GC+ +    +   CS C   W  C    Q+ D    +L
Sbjct: 35  LLIEELPFAIGPKCNGPVV-CLGCYETDPDPEEELCSECG--WPLCVECAQQADNAHFQL 91

Query: 66  ECQVLSRLDKEKRKSVTPS-------------IRLMLKLYLRRKLQNDNVIPSTTTDNYS 112
           EC+ L   D   R    PS             +R++L      +  ++ V P        
Sbjct: 92  ECRELR--DARARFFRLPSGSSHCPQLDCIMPLRVLLAKEANPERWDNEVAPMEHHKEER 149

Query: 113 LVEALVAHMSDIDEKQLLLYA-QIANLVNLILQWPEISINEIAENFSKLACNAHTICNSE 171
             +A V H   ++  Q L    Q+AN           S   I +    L  NA    + +
Sbjct: 150 QKDADVWHADRVNIAQYLRGPCQLAN---------RFSEELIMQVVGVLEVNAFEARSPK 200

Query: 172 LRPLGTGLYPVISIINHSCLPN---AVLVFEGRLAVVRAVQHVPKGAEVLISYIETAGST 228
             PL   L+P   I+ H+C+PN   ++   EG    +RA+  + +G  +  SY  T   T
Sbjct: 201 GYPLRC-LFPYTGILAHNCVPNTARSIYPSEGYKIRLRAMVDLEEGQPLHHSYTYTLDGT 259

Query: 229 MTRQKALKEQYLFTCTCPRCI---KLG-QFDDIQESAILEGYRCKDDGCSGFLLRDSDDK 284
             RQK LK+   FTC C RC+   +LG  F  ++     EG++               D 
Sbjct: 260 AQRQKHLKQGKFFTCQCERCLDPTELGTHFSSLKCGQCAEGFQVPRQP-------TEPDT 312

Query: 285 GFTCQQCGLVRSKEE----IKKIASEVNILSKKTLALTSCGNHQEVVSTYKMIEKLQKKL 340
            ++C  CG   S  +    ++ + SEVN +    +A       + ++  Y       K L
Sbjct: 313 SWSCASCGSETSNVDALAMLQSLQSEVNAVQALPMAAKRLEEIERLLRKY-------KSL 365

Query: 341 YHPFSVNLMQTREKLIKILMELEDWK 366
            HP        R+ LI++   ++ ++
Sbjct: 366 LHPLHFIATGLRQLLIEMYGRVQGYE 391


>gi|330801330|ref|XP_003288681.1| hypothetical protein DICPUDRAFT_34471 [Dictyostelium purpureum]
 gi|325081244|gb|EGC34766.1| hypothetical protein DICPUDRAFT_34471 [Dictyostelium purpureum]
          Length = 378

 Score = 62.4 bits (150), Expect = 5e-07,   Method: Compositional matrix adjust.
 Identities = 76/329 (23%), Positives = 138/329 (41%), Gaps = 66/329 (20%)

Query: 7   SGEVIISQE-PYVCVPN-----NSSSISRCDGCFA---------------SSNLKKCSAC 45
           +G+ I+ +E PY C  N     N   I  C  CF                + N   C+ C
Sbjct: 31  AGDFIVFREKPYSCCINFKIEENEKKI--CHYCFTINNNNNLNNNIHNMHTDNFIICNNC 88

Query: 46  QVVWYCGSNCQKLDWKL-HRLECQV------LSRLDKEKRKSVTPSIRLMLKLYLRRKLQ 98
           ++ WYC   C+  D++  H+ EC++       S+ + E + ++   I++++  Y   K+ 
Sbjct: 89  KLNWYCSEKCKNEDYQTSHQYECKIWKKIQSASQFNNECKTTIKLLIKILILQYNEIKIN 148

Query: 99  NDNVIPSTTTD----NYSLVEALVAHMSDIDEKQLLLYAQIANLVNLILQ-------WPE 147
           N+N             +  +  L ++ +    +++  + +I+  +   +        +P 
Sbjct: 149 NNNDNNDNNNSYKEIKFKDIIGLESNSNKFSTQRITDFMKISKFIEKSVDKEIIDTIYPT 208

Query: 148 IS-INEIAENFSKL----ACNAHTI-----CNSELR-----PLGTGLYPVISIINHSCLP 192
           +  ++   +N  KL     CN+H I      NS +       +G G+Y   S+ NHSC P
Sbjct: 209 VKEMSLFQKNMIKLMCILECNSHDISFPFRTNSSITTFNYYSIGIGIYFNSSLFNHSCQP 268

Query: 193 NAVLVFE------GRLAVVRAVQHVPKGAEVLISYIETAGSTMTRQKALKEQYLFTCTCP 246
           N   V E      G  ++V A++ + +  E+  +YI+   +   R + L+  Y FTC CP
Sbjct: 269 NICKVIESCKNNFGCHSMV-AIRDIKENEEISFNYIQLNQNKQERIEKLQSSYYFTCNCP 327

Query: 247 RCIKLGQFDDIQESAILEGYRCKDDGCSG 275
            C      +       L  Y+CK   CSG
Sbjct: 328 SCCTN---NPNPHKEFLNKYKCKKLNCSG 353


>gi|154689581|ref|NP_659167.2| SET and MYND domain-containing protein 5 [Mus musculus]
 gi|90101759|sp|Q3TYX3.2|SMYD5_MOUSE RecName: Full=SET and MYND domain-containing protein 5; AltName:
           Full=Protein NN8-4AG; AltName: Full=Retinoic
           acid-induced protein 15
 gi|148666711|gb|EDK99127.1| SET and MYND domain containing 5 [Mus musculus]
          Length = 416

 Score = 62.4 bits (150), Expect = 5e-07,   Method: Compositional matrix adjust.
 Identities = 35/86 (40%), Positives = 50/86 (58%), Gaps = 7/86 (8%)

Query: 176 GTGLYPVISIINHSCLPNAVLVFEGRLAV--VRAVQHVPKGAEVLISYIETAG---STMT 230
           G+GL+ + S  NHSC+PNA   F     V  V A++ +  G E+ ISY++      S  +
Sbjct: 303 GSGLFVLQSCCNHSCVPNAETSFPENNFVLHVTALEDIKPGEEICISYLDCCQRERSRHS 362

Query: 231 RQKALKEQYLFTCTCPRCIKLGQFDD 256
           R K L+E YLF C+CP+C  L + DD
Sbjct: 363 RHKILRENYLFNCSCPKC--LAEADD 386


>gi|307183932|gb|EFN70520.1| SET and MYND domain-containing protein 4 [Camponotus floridanus]
          Length = 421

 Score = 62.4 bits (150), Expect = 5e-07,   Method: Compositional matrix adjust.
 Identities = 43/145 (29%), Positives = 64/145 (44%), Gaps = 18/145 (12%)

Query: 163 NAHTICNSEL---------RP--LGTGLYPVISIINHSCLPNAVLVFEGRLAVVRAVQHV 211
           NAH    + L         RP  LG  +YP ++  NH C P     F GR  V+RA++ +
Sbjct: 199 NAHEFFETRLSAGHRFRGSRPVYLGVAIYPTVARFNHDCYPAVTRYFVGRHIVIRAIRGL 258

Query: 212 PKGAEVLISY--IETAGSTMTRQKALKEQYLFTCTCPRCIK-LGQFDDIQESAILEGYRC 268
             G  +  +Y  I T  +   RQ+ L  +Y F C+C  C +   +F+ +   ++    RC
Sbjct: 259 RPGDVIAENYGPIFTKRTLAERQRTLAGRYWFRCSCRACQEDWPRFETLTNDSVR--LRC 316

Query: 269 KDDGCSGFLLRDSD--DKGFTCQQC 291
              GC+G   R     DK   C  C
Sbjct: 317 PTTGCNGLHSRPQQRPDKPIKCSAC 341


>gi|195383960|ref|XP_002050693.1| GJ20068 [Drosophila virilis]
 gi|194145490|gb|EDW61886.1| GJ20068 [Drosophila virilis]
          Length = 515

 Score = 62.0 bits (149), Expect = 5e-07,   Method: Compositional matrix adjust.
 Identities = 69/291 (23%), Positives = 114/291 (39%), Gaps = 21/291 (7%)

Query: 9   EVIISQEPYVCVPNNSSS--ISRCDGCFASSNLKKCSACQVVW-YCGSNCQKLDWKLHRL 65
           E++I + P +  P+  S+     C  C   ++   C  C   W  CG  C  LD   HR 
Sbjct: 65  EIVIKEAPLMRGPSQISAPVCMGCLNCIEPNDHITCDKCG--WPLCGPECHALD--EHRA 120

Query: 66  ECQVLS-RLDKEKRKSVTPSIRLMLKLYLRRKLQNDNVIPSTTTDNYSLVEALVAHMSDI 124
           ECQ+   R  K   +       L   +   R L      P+     +  +E L       
Sbjct: 121 ECQLTQERGQKVNVQEFNGPHPLYTCVSTVRCLLIGETNPANAA-KFKQLETLEQTRRGS 179

Query: 125 DEKQLLLYAQIANLVNLILQWPEISINEIAENFSKLACNAHTICNSELRPLGTGLYPVIS 184
           ++ +  L A I   +    +  + S  EI      L  N H +  S+  P    ++   S
Sbjct: 180 NQWKADL-ASIGQFIPKFFRTQKFSEEEIMRAVGALQINGHEVPTSD--PPHVAVFYTAS 236

Query: 185 IINHSCLPNAVLVFEGR-LAVVRAVQHVPKGAEVLISYIETAGSTMTRQKALKEQYLFTC 243
              +SC+PN    F      ++ A + + K + + I Y +    T  RQ+ L +  LF C
Sbjct: 237 FTENSCVPNLAKSFNKHGHCMLWAPKEIKKNSHLSICYSDAVWGTADRQRHLMQTKLFKC 296

Query: 244 TCPRCIKLGQFDDIQESAILEGYRCKDDGCSGFLLR-DSDD--KGFTCQQC 291
           +C RC+ + +             +C+D  C G LL   +DD    + C++C
Sbjct: 297 SCERCLDVTEL-----GTNYSALKCEDRKCPGLLLPIKADDWHGSWRCREC 342


>gi|332016864|gb|EGI57673.1| Protein msta, isoform A [Acromyrmex echinatior]
          Length = 421

 Score = 62.0 bits (149), Expect = 5e-07,   Method: Compositional matrix adjust.
 Identities = 54/205 (26%), Positives = 95/205 (46%), Gaps = 18/205 (8%)

Query: 178 GLYPVISIINHSCLPNAVLVFEGRLAV-VRAVQHVPKGAEVLISYIETAGSTMTRQKALK 236
           GLYP+ ++ NH C+PN    F+ +  + V AV  +  G E+ +SY +    T +R++ LK
Sbjct: 147 GLYPMGALQNHCCVPNTRHHFDDQQRLYVSAVLPIAAGEEITMSYTDLLWDTSSRRQFLK 206

Query: 237 EQYLFTCTCPRCIKLGQFDDIQESAILEGYRCKDDGCSGFLLRD---SDDKGFTCQQCGL 293
               F+C C RC      D ++  + L    C  D CSG LL     + +  + C +C +
Sbjct: 207 VTKRFSCNCNRC-----SDPLEFGSQLSTLLCAKDNCSGCLLPRNPLNHESSWICDKCQI 261

Query: 294 -VRSKEEIKKIASEVNILSKKTLALTSCGNHQEVVSTYKMIEKLQKKLYHPFSVNLMQTR 352
            V  ++ I+ I S +N      +  T     +E++   +  E +  +L    +  +M  +
Sbjct: 262 SVNYRQVIECIHSGLNTFVSDAMYKTP----REIL---RFTEAILSRLVPATNYIMMDMK 314

Query: 353 EKLIKILMELEDWK-EALAYCQLTI 376
            ++I     + D K E L   +L I
Sbjct: 315 YRIISYFGRVPDLKWENLTDTELRI 339


>gi|74196676|dbj|BAE34438.1| unnamed protein product [Mus musculus]
          Length = 416

 Score = 62.0 bits (149), Expect = 6e-07,   Method: Compositional matrix adjust.
 Identities = 35/86 (40%), Positives = 50/86 (58%), Gaps = 7/86 (8%)

Query: 176 GTGLYPVISIINHSCLPNAVLVFEGRLAV--VRAVQHVPKGAEVLISYIE---TAGSTMT 230
           G+GL+ + S  NHSC+PNA   F     V  V A++ +  G E+ ISY++      S  +
Sbjct: 303 GSGLFVLQSCCNHSCVPNAETSFPENNFVLHVTALEDIKPGEEICISYLDCRQRERSRHS 362

Query: 231 RQKALKEQYLFTCTCPRCIKLGQFDD 256
           R K L+E YLF C+CP+C  L + DD
Sbjct: 363 RHKILRENYLFNCSCPKC--LAEADD 386


>gi|449550207|gb|EMD41172.1| hypothetical protein CERSUDRAFT_91931 [Ceriporiopsis subvermispora
           B]
          Length = 329

 Score = 62.0 bits (149), Expect = 6e-07,   Method: Compositional matrix adjust.
 Identities = 28/74 (37%), Positives = 40/74 (54%), Gaps = 2/74 (2%)

Query: 177 TGLYPVISIINHSCLPNA--VLVFEGRLAVVRAVQHVPKGAEVLISYIETAGSTMTRQKA 234
           +G +P++S INH C P+A     FE     + A + +P G E+ I+Y       + RQ  
Sbjct: 138 SGCFPILSRINHDCSPSANYFFAFEPYCGQLWAARDIPDGGEITIAYTTLPAPRVDRQAH 197

Query: 235 LKEQYLFTCTCPRC 248
           L E+Y FTCTC  C
Sbjct: 198 LSERYFFTCTCVAC 211


>gi|193683573|ref|XP_001946512.1| PREDICTED: protein msta, isoform A-like [Acyrthosiphon pisum]
          Length = 499

 Score = 62.0 bits (149), Expect = 6e-07,   Method: Compositional matrix adjust.
 Identities = 74/313 (23%), Positives = 120/313 (38%), Gaps = 36/313 (11%)

Query: 9   EVIISQEPYVCVPNNSSSISR--CDGCFAS---SNLKKCSACQVVWYCGSNCQKLDWKLH 63
           +V + + P +  P   S ++R  C GC  S    +  +C  C       SNC   +W  H
Sbjct: 28  QVALVERPLIRGP---SQVTRPVCLGCLKSITADSAVRCERCGWPMCPDSNCSDDEW--H 82

Query: 64  RLECQ--VLSRLDKEKRKSVT---PSIRLMLKLYLRRKLQNDNVIPSTTTDNYSLVEALV 118
           + EC   V  R  K + K  T   PS + +  +    +  N+  + +  T   S   +  
Sbjct: 83  KAECDWTVTRRKQKIEIKDFTNPHPSYQSITVIRCLYQKHNNPEVWAKLTKLESHCASRR 142

Query: 119 AHMSDIDEKQLLLYAQIANLVNL-ILQWPEISINEIAENFSKLACNAHTICNSELRPLGT 177
              S  +  ++ +   +     L   +WP   ++EI      +  N H +  ++  P   
Sbjct: 143 G-GSKYETDRIWIADHLQRFFKLDPTEWP---VDEILRVCGIVQVNGHEVPLTD--PPYV 196

Query: 178 GLYPVISIINHSCLPNAVLVF--EGRLAVVRAVQHVPKGAEVLISYIETAGSTMTRQKAL 235
            +Y   S++ HSC+PN    F  +G L +  A   V  G  + ISY +    T  R   L
Sbjct: 197 AIYDAGSMLEHSCVPNCSKSFTRDGHLLIRTAAAAVESGGHLSISYTDVLWGTAQRLAHL 256

Query: 236 KEQYLFTCTCPRCIKLGQFDDIQESAILEGYRCKDDGCSGFLL-------RDSDDKGFTC 288
            +   F C CPRC      D  +      G +C  + C G+ L        D  D  + C
Sbjct: 257 ADTKFFVCKCPRCS-----DPTELGTYFSGVKCATEDCMGYSLPNVHPSTNDPFDVEWMC 311

Query: 289 QQCGLVRSKEEIK 301
             C  V     I+
Sbjct: 312 NFCFTVADPNRIE 324


>gi|307177472|gb|EFN66599.1| SET and MYND domain-containing protein 5 [Camponotus floridanus]
          Length = 386

 Score = 62.0 bits (149), Expect = 6e-07,   Method: Compositional matrix adjust.
 Identities = 34/87 (39%), Positives = 48/87 (55%), Gaps = 5/87 (5%)

Query: 167 ICNSELRPLGTGLYPVISIINHSCLPNAVLVF--EGRLAVVRAVQHVPKGAEVLISYI-- 222
           +  S L   G+GLY + S +NHSC PNA+  F       V++A++ +  G E+  SY+  
Sbjct: 278 VVGSFLNNEGSGLYILQSSVNHSCTPNAIAEFPYSNNTLVLKAIRDIQVGEEICTSYLDE 337

Query: 223 -ETAGSTMTRQKALKEQYLFTCTCPRC 248
            E   S  +RQKAL   YLF C C +C
Sbjct: 338 CELERSRYSRQKALSSLYLFVCHCDKC 364


>gi|390362089|ref|XP_784346.3| PREDICTED: SET and MYND domain-containing protein 5-like
           [Strongylocentrotus purpuratus]
          Length = 396

 Score = 62.0 bits (149), Expect = 6e-07,   Method: Compositional matrix adjust.
 Identities = 34/79 (43%), Positives = 49/79 (62%), Gaps = 7/79 (8%)

Query: 176 GTGLYPVISIINHSCLPNAVLVF---EGRLAVVRAVQHVPKGAEVLISYIETA---GSTM 229
           G+ LY + S  NHSC+PNA + F      L++V A+  + +G EVLISY++      S  
Sbjct: 288 GSALYKLQSCCNHSCVPNAEISFLHNNSTLSLV-ALTDITEGQEVLISYLDECCKERSRH 346

Query: 230 TRQKALKEQYLFTCTCPRC 248
           +RQK L+E YLF+C C +C
Sbjct: 347 SRQKELRENYLFSCDCSKC 365


>gi|426335968|ref|XP_004029476.1| PREDICTED: SET and MYND domain-containing protein 5 isoform 1
           [Gorilla gorilla gorilla]
          Length = 419

 Score = 62.0 bits (149), Expect = 6e-07,   Method: Compositional matrix adjust.
 Identities = 32/79 (40%), Positives = 47/79 (59%), Gaps = 5/79 (6%)

Query: 176 GTGLYPVISIINHSCLPNAVLVF--EGRLAVVRAVQHVPKGAEVLISYIETAG---STMT 230
           G+GL+ + S  NHSC+PNA   F     L  V A++ +  G E+ ISY++      S  +
Sbjct: 304 GSGLFVLQSCCNHSCVPNAETSFPENNFLLHVTALEDIKPGEEICISYLDCCQRERSRHS 363

Query: 231 RQKALKEQYLFTCTCPRCI 249
           R K L+E YLF C+CP+C+
Sbjct: 364 RHKILRENYLFVCSCPKCL 382


>gi|291231170|ref|XP_002735533.1| PREDICTED: SMYD family member 5-like [Saccoglossus kowalevskii]
          Length = 395

 Score = 62.0 bits (149), Expect = 6e-07,   Method: Compositional matrix adjust.
 Identities = 32/78 (41%), Positives = 45/78 (57%), Gaps = 5/78 (6%)

Query: 176 GTGLYPVISIINHSCLPNAVLVFEGRLAVVR--AVQHVPKGAEVLISYI---ETAGSTMT 230
           G+ LY + S  NHSC+PNA + F    A V   A+Q + +  E+ ISY+   +   S  +
Sbjct: 295 GSALYSLQSCCNHSCVPNAEVTFPDNDAAVSVMALQDIQENEEICISYLGECDIGRSRHS 354

Query: 231 RQKALKEQYLFTCTCPRC 248
           RQK L+E YLF C C +C
Sbjct: 355 RQKILRENYLFNCNCMKC 372


>gi|342883051|gb|EGU83613.1| hypothetical protein FOXB_05861 [Fusarium oxysporum Fo5176]
          Length = 374

 Score = 62.0 bits (149), Expect = 6e-07,   Method: Compositional matrix adjust.
 Identities = 37/113 (32%), Positives = 58/113 (51%), Gaps = 7/113 (6%)

Query: 176 GTGLYPVISIINHSCLPNAVLVFEGRLAVV-RAVQHVPKGAEVLISYIE--TAGSTMTRQ 232
           G  LYP  +  NHSCLPN     +G+  +V  A + + KG E +I+Y +  T     +RQ
Sbjct: 261 GMALYPRAAQFNHSCLPNVTHQPDGQARMVYTAARDISKGEECMITYFDLTTHKDLTSRQ 320

Query: 233 KALKEQYLFTCTCPRCIKLGQFDDIQESAILEGYRCKDDGCSGFLLRDSDDKG 285
              +EQ+ F CTC RC+K    ++ +E+   +    K + C       SD++G
Sbjct: 321 NHTQEQFQFKCTCERCLK----EEAEENIEFQELELKLEPCPISSATTSDNEG 369


>gi|351716047|gb|EHB18966.1| SET and MYND domain-containing protein 5 [Heterocephalus glaber]
          Length = 420

 Score = 62.0 bits (149), Expect = 6e-07,   Method: Compositional matrix adjust.
 Identities = 32/79 (40%), Positives = 47/79 (59%), Gaps = 5/79 (6%)

Query: 176 GTGLYPVISIINHSCLPNAVLVF--EGRLAVVRAVQHVPKGAEVLISYIETAG---STMT 230
           G+GL+ + S  NHSC+PNA   F     L  V A++ +  G E+ ISY++      S  +
Sbjct: 304 GSGLFVLQSCCNHSCVPNAETAFPENNFLLHVTALEDINPGEEICISYLDCCQRERSRHS 363

Query: 231 RQKALKEQYLFTCTCPRCI 249
           R K L+E YLF C+CP+C+
Sbjct: 364 RHKILRENYLFVCSCPKCL 382


>gi|307202234|gb|EFN81718.1| SET and MYND domain-containing protein 4 [Harpegnathos saltator]
          Length = 343

 Score = 62.0 bits (149), Expect = 7e-07,   Method: Compositional matrix adjust.
 Identities = 62/242 (25%), Positives = 95/242 (39%), Gaps = 46/242 (19%)

Query: 42  CSACQVVWYCGSNCQKLDW-KLHRLECQVLSRLDKEKRKSVTP-SIRLMLKLY------- 92
           C  C    YC   C+ ++W K H +EC + S +  EK  S+   S+R+++          
Sbjct: 17  CKYCTYAMYCSEQCRDIEWEKYHDVECAIFSVMVAEKFLSLDLFSLRILMCAIKEAGDIQ 76

Query: 93  -LRRKLQNDNVIPSTTTDNYSLVEALVAHMSDIDEKQLLLYAQIANLVNLILQWPEISIN 151
            LR  L+  +      T  +S      +      EK + LY  + N      Q+ +I   
Sbjct: 77  GLRTMLERADGSDDPRTMGFSPDGKFHS------EKYISLYGLMTNT-----QYTKIFGT 125

Query: 152 EIAENFSKLA--------------------CNAHTICNSEL---RPLGTGLYPVISIINH 188
           ++  NF  LA                     NAHTI   +       G  + P  S+ NH
Sbjct: 126 KLPGNFKTLANNDNATFLGGLILRHQLIISTNAHTIYEEQYLCAEERGNAILPFCSLFNH 185

Query: 189 SCLPNAVLVFEGRLAVVRAVQHVPKGAEVLISYIETAG--STMTRQKALKEQYLFTCTCP 246
           SC PN   V   +  V+  +  + KG ++L +Y       S + RQ  L + Y FTC C 
Sbjct: 186 SCNPNVFRVSRSQHTVLYTLYPIRKGEQLLDNYGSHFAMESKIVRQNMLLKHYHFTCKCI 245

Query: 247 RC 248
            C
Sbjct: 246 PC 247


  Database: nr
    Posted date:  Mar 3, 2013 10:45 PM
  Number of letters in database: 999,999,864
  Number of sequences in database:  2,912,245
  
  Database: /local_scratch/syshi//blastdatabase/nr.01
    Posted date:  Mar 3, 2013 10:52 PM
  Number of letters in database: 999,999,666
  Number of sequences in database:  2,912,720
  
  Database: /local_scratch/syshi//blastdatabase/nr.02
    Posted date:  Mar 3, 2013 10:58 PM
  Number of letters in database: 999,999,938
  Number of sequences in database:  3,014,250
  
  Database: /local_scratch/syshi//blastdatabase/nr.03
    Posted date:  Mar 3, 2013 11:03 PM
  Number of letters in database: 999,999,780
  Number of sequences in database:  2,805,020
  
  Database: /local_scratch/syshi//blastdatabase/nr.04
    Posted date:  Mar 3, 2013 11:08 PM
  Number of letters in database: 999,999,551
  Number of sequences in database:  2,816,253
  
  Database: /local_scratch/syshi//blastdatabase/nr.05
    Posted date:  Mar 3, 2013 11:13 PM
  Number of letters in database: 999,999,897
  Number of sequences in database:  2,981,387
  
  Database: /local_scratch/syshi//blastdatabase/nr.06
    Posted date:  Mar 3, 2013 11:18 PM
  Number of letters in database: 999,999,649
  Number of sequences in database:  2,911,476
  
  Database: /local_scratch/syshi//blastdatabase/nr.07
    Posted date:  Mar 3, 2013 11:24 PM
  Number of letters in database: 999,999,452
  Number of sequences in database:  2,920,260
  
  Database: /local_scratch/syshi//blastdatabase/nr.08
    Posted date:  Mar 3, 2013 11:25 PM
  Number of letters in database: 64,230,274
  Number of sequences in database:  189,558
  
Lambda     K      H
   0.320    0.134    0.401 

Lambda     K      H
   0.267   0.0410    0.140 


Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Number of Hits to DB: 6,587,023,827
Number of Sequences: 23463169
Number of extensions: 259040598
Number of successful extensions: 675278
Number of sequences better than 100.0: 1000
Number of HSP's better than 100.0 without gapping: 2536
Number of HSP's successfully gapped in prelim test: 2212
Number of HSP's that attempted gapping in prelim test: 668215
Number of HSP's gapped (non-prelim): 6624
length of query: 455
length of database: 8,064,228,071
effective HSP length: 146
effective length of query: 309
effective length of database: 8,933,572,693
effective search space: 2760473962137
effective search space used: 2760473962137
T: 11
A: 40
X1: 16 ( 7.4 bits)
X2: 38 (14.6 bits)
X3: 64 (24.7 bits)
S1: 41 (21.8 bits)
S2: 79 (35.0 bits)