BLASTP 2.2.26 [Sep-21-2011]
Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer,
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997),
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs", Nucleic Acids Res. 25:3389-3402.
Reference for compositional score matrix adjustment: Altschul, Stephen F.,
John C. Wootton, E. Michael Gertz, Richa Agarwala, Aleksandr Morgulis,
Alejandro A. Schaffer, and Yi-Kuo Yu (2005) "Protein database searches
using compositionally adjusted substitution matrices", FEBS J. 272:5101-5109.
Query= 017649
(368 letters)
Database: nr
23,463,169 sequences; 8,064,228,071 total letters
Searching..................................................done
>gi|302141868|emb|CBI19071.3| unnamed protein product [Vitis vinifera]
Length = 480
Score = 615 bits (1587), Expect = e-174, Method: Compositional matrix adjust.
Identities = 283/368 (76%), Positives = 335/368 (91%), Gaps = 1/368 (0%)
Query: 1 MLKLYLRRKLQNDNVIPSTTTDNYSLVEALVAHMSDIDEKQLLLYAQIANLVNLILQWPE 60
M+KLY+RRKLQ++ ++P+T DNY+LVEALV+H++DIDEKQL+LYAQ+ANLVNLILQWP+
Sbjct: 114 MVKLYMRRKLQSEKIMPTTARDNYNLVEALVSHITDIDEKQLVLYAQMANLVNLILQWPD 173
Query: 61 ISINEIAENFSKLACNAHTICNSELRPLGTGLYPVISIINHSCLPNAVLVFEGRLAVVRA 120
I++ EIAENFSKLACNAHTIC+ ELRPLGTGLYPVISIINHSCLPN+VLVFE RLAVVRA
Sbjct: 174 INVKEIAENFSKLACNAHTICDGELRPLGTGLYPVISIINHSCLPNSVLVFEERLAVVRA 233
Query: 121 VQHVPKGAEVLISYIETAGSTMTRQKALKEQYLFTCTCPRCIKLGQFDDIQESAILEGYR 180
VQH+PKG EVLISYIETAGST+TRQKALKEQYLFTCTCPRC ++GQ+DDIQESAILEGYR
Sbjct: 234 VQHIPKGTEVLISYIETAGSTITRQKALKEQYLFTCTCPRCRRMGQYDDIQESAILEGYR 293
Query: 181 CKDDGCSGFLLRDSDDKGFTCQQCGLVRSKEEIKKIASEVNILSKKTLALTSCGNHQEVV 240
CKDD C GFLLRDSDD GF CQQCGLVR+KEEIK++ASE+ LS K ++S ++ E
Sbjct: 294 CKDDRCDGFLLRDSDDIGFICQQCGLVRNKEEIKRLASELKPLSDKA-TMSSSSHYVEAT 352
Query: 241 STYKMIEKLQKKLYHPFSVNLMQTREKLIKILMELEDWKEALAYCQLTIPVYQRVYPQFH 300
S YKMIEKLQ KL+HPFS+NLM+TRE ++KILME++DW+ AL YC+LTIPVYQRVYP FH
Sbjct: 353 SIYKMIEKLQTKLFHPFSINLMRTREAILKILMEMKDWRAALTYCKLTIPVYQRVYPGFH 412
Query: 301 PLLGLQYYTCGKLEWFLGDTENAIKSMTEAVEILRITHGTNSPFMKELILKLEEAQAEAS 360
PLLGLQYYTCGKLEW LG+TE+A+KS+T+A +IL+ITHGTN+PFMKEL+ KLEEA+AEAS
Sbjct: 413 PLLGLQYYTCGKLEWLLGETEDAVKSLTKAADILQITHGTNTPFMKELLFKLEEARAEAS 472
Query: 361 YKLSSKDE 368
+K SSKD+
Sbjct: 473 HKRSSKDK 480
>gi|255545512|ref|XP_002513816.1| protein with unknown function [Ricinus communis]
gi|223546902|gb|EEF48399.1| protein with unknown function [Ricinus communis]
Length = 482
Score = 613 bits (1582), Expect = e-173, Method: Compositional matrix adjust.
Identities = 283/367 (77%), Positives = 329/367 (89%)
Query: 1 MLKLYLRRKLQNDNVIPSTTTDNYSLVEALVAHMSDIDEKQLLLYAQIANLVNLILQWPE 60
M++L +RRKLQ++ +I ST TDNY LVEALVAHM DIDEKQL+LYAQ+ANLVNLIL+WP+
Sbjct: 114 MVRLLIRRKLQSEKIISSTATDNYDLVEALVAHMKDIDEKQLVLYAQMANLVNLILRWPD 173
Query: 61 ISINEIAENFSKLACNAHTICNSELRPLGTGLYPVISIINHSCLPNAVLVFEGRLAVVRA 120
++I EIAENFSKLACNAHTIC+SELRPLGTGLYPV+SIINHSCLPNAVLVF+GRLAVV
Sbjct: 174 VNIKEIAENFSKLACNAHTICDSELRPLGTGLYPVVSIINHSCLPNAVLVFDGRLAVVHT 233
Query: 121 VQHVPKGAEVLISYIETAGSTMTRQKALKEQYLFTCTCPRCIKLGQFDDIQESAILEGYR 180
VQH+PKG+EVLISYIETAGSTMTRQKALK+QY FTCTCPRCIK+G DDIQESAILEGYR
Sbjct: 234 VQHIPKGSEVLISYIETAGSTMTRQKALKQQYFFTCTCPRCIKMGLLDDIQESAILEGYR 293
Query: 181 CKDDGCSGFLLRDSDDKGFTCQQCGLVRSKEEIKKIASEVNILSKKTLALTSCGNHQEVV 240
CKD+ C+GFLLRDSDD+GF CQQCGL+RSKEE+KK A+E+ S K S GN QE V
Sbjct: 294 CKDNRCNGFLLRDSDDRGFICQQCGLLRSKEEVKKSAAEIKATSDKASKSISSGNLQEAV 353
Query: 241 STYKMIEKLQKKLYHPFSVNLMQTREKLIKILMELEDWKEALAYCQLTIPVYQRVYPQFH 300
S YK+IEKLQ+KL HPFS +LMQTREKL+K+LMELEDW EAL+YC+LTIPVYQRVYP+FH
Sbjct: 354 SIYKLIEKLQRKLCHPFSTSLMQTREKLLKMLMELEDWGEALSYCKLTIPVYQRVYPEFH 413
Query: 301 PLLGLQYYTCGKLEWFLGDTENAIKSMTEAVEILRITHGTNSPFMKELILKLEEAQAEAS 360
PLLGLQY++CGKLEW LGDTE AIKS+T+A+++LRITHGT +PFMKEL++KLEEA+AE S
Sbjct: 414 PLLGLQYFSCGKLEWLLGDTEAAIKSLTKALDVLRITHGTKTPFMKELMMKLEEARAEVS 473
Query: 361 YKLSSKD 367
Y +S D
Sbjct: 474 YLRASND 480
>gi|225459467|ref|XP_002284386.1| PREDICTED: histone-lysine N-methyltransferase ASHR1-like [Vitis
vinifera]
Length = 477
Score = 611 bits (1575), Expect = e-172, Method: Compositional matrix adjust.
Identities = 282/368 (76%), Positives = 332/368 (90%), Gaps = 4/368 (1%)
Query: 1 MLKLYLRRKLQNDNVIPSTTTDNYSLVEALVAHMSDIDEKQLLLYAQIANLVNLILQWPE 60
M+KLY+RRKLQ++ ++P+T DNY+LVEALV+H++DIDEKQL+LYAQ+ANLVNLILQWP+
Sbjct: 114 MVKLYMRRKLQSEKIMPTTARDNYNLVEALVSHITDIDEKQLVLYAQMANLVNLILQWPD 173
Query: 61 ISINEIAENFSKLACNAHTICNSELRPLGTGLYPVISIINHSCLPNAVLVFEGRLAVVRA 120
I++ EIAENFSKLACNAHTIC+ ELRPLGTGLYPVISIINHSCLPN+VLVFE RLAVVRA
Sbjct: 174 INVKEIAENFSKLACNAHTICDGELRPLGTGLYPVISIINHSCLPNSVLVFEERLAVVRA 233
Query: 121 VQHVPKGAEVLISYIETAGSTMTRQKALKEQYLFTCTCPRCIKLGQFDDIQESAILEGYR 180
VQH+PKG EVLISYIETAGST+TRQKALKEQYLFTCTCPRC ++GQ+DDIQESAILEGYR
Sbjct: 234 VQHIPKGTEVLISYIETAGSTITRQKALKEQYLFTCTCPRCRRMGQYDDIQESAILEGYR 293
Query: 181 CKDDGCSGFLLRDSDDKGFTCQQCGLVRSKEEIKKIASEVNILSKKTLALTSCGNHQEVV 240
CKDD C GFLLRDSDD GF CQQCGLVR+KEEIK++ASE+ LS K +S +
Sbjct: 294 CKDDRCDGFLLRDSDDIGFICQQCGLVRNKEEIKRLASELKPLSDKATMSSS----SQAT 349
Query: 241 STYKMIEKLQKKLYHPFSVNLMQTREKLIKILMELEDWKEALAYCQLTIPVYQRVYPQFH 300
S YKMIEKLQ KL+HPFS+NLM+TRE ++KILME++DW+ AL YC+LTIPVYQRVYP FH
Sbjct: 350 SIYKMIEKLQTKLFHPFSINLMRTREAILKILMEMKDWRAALTYCKLTIPVYQRVYPGFH 409
Query: 301 PLLGLQYYTCGKLEWFLGDTENAIKSMTEAVEILRITHGTNSPFMKELILKLEEAQAEAS 360
PLLGLQYYTCGKLEW LG+TE+A+KS+T+A +IL+ITHGTN+PFMKEL+ KLEEA+AEAS
Sbjct: 410 PLLGLQYYTCGKLEWLLGETEDAVKSLTKAADILQITHGTNTPFMKELLFKLEEARAEAS 469
Query: 361 YKLSSKDE 368
+K SSKD+
Sbjct: 470 HKRSSKDK 477
>gi|356507670|ref|XP_003522587.1| PREDICTED: histone-lysine N-methyltransferase ASHR1-like [Glycine
max]
Length = 484
Score = 570 bits (1469), Expect = e-160, Method: Compositional matrix adjust.
Identities = 266/357 (74%), Positives = 309/357 (86%), Gaps = 1/357 (0%)
Query: 11 QNDNVIPSTTTDNYSLVEALVAHMSDIDEKQLLLYAQIANLVNLILQWPEISINEIAENF 70
QND +IPST DNY+LVEALVAHMSDI E+QL+LYAQ+ANLVN IL+WP I+I EIAENF
Sbjct: 129 QNDKIIPSTAMDNYNLVEALVAHMSDITEEQLVLYAQMANLVNSILEWPGINIKEIAENF 188
Query: 71 SKLACNAHTICNSELRPLGTGLYPVISIINHSCLPNAVLVFEGRLAVVRAVQHVPKGAEV 130
SK ACNAHTIC+SELRP+GTGLYPVISIINHSCLPN+VLVFEG A+VRAVQH+P G EV
Sbjct: 189 SKFACNAHTICDSELRPVGTGLYPVISIINHSCLPNSVLVFEGSSALVRAVQHIPSGTEV 248
Query: 131 LISYIETAGSTMTRQKALKEQYLFTCTCPRCIKLGQFDDIQESAILEGYRCKDDGCSGFL 190
LISYIETA STMTRQKALKEQYLFTCTCPRC K+GQ+DDIQESAILEGY+CK + C GFL
Sbjct: 249 LISYIETAESTMTRQKALKEQYLFTCTCPRCSKVGQYDDIQESAILEGYKCKSEKCGGFL 308
Query: 191 LRDSDDKGFTCQQCGLVRSKEEIKKIASEVNILSKKTLALTSCGNHQEVVSTYKMIEKLQ 250
LR +D KGF CQ CGL+R KEEIK+I +E+ +LS+ + S +QE +S YK IEKLQ
Sbjct: 309 LRTTDGKGFQCQGCGLIRDKEEIKRITTEIKLLSEDA-SKPSATYYQEAISIYKRIEKLQ 367
Query: 251 KKLYHPFSVNLMQTREKLIKILMELEDWKEALAYCQLTIPVYQRVYPQFHPLLGLQYYTC 310
+L+HP S+NLM TREK++K LMELE W EALAYC+LTIP YQRVYP HPL GLQYYTC
Sbjct: 368 TELFHPLSINLMHTREKILKSLMELEHWTEALAYCKLTIPFYQRVYPAVHPLPGLQYYTC 427
Query: 311 GKLEWFLGDTENAIKSMTEAVEILRITHGTNSPFMKELILKLEEAQAEASYKLSSKD 367
GKLEW+LGDTE A+KS+T+AV+ILRITHGTN+PFMK+L++KLEEA+ EASYK SSK+
Sbjct: 428 GKLEWYLGDTEEAVKSLTKAVDILRITHGTNTPFMKDLLMKLEEARTEASYKFSSKE 484
>gi|449445136|ref|XP_004140329.1| PREDICTED: histone-lysine N-methyltransferase ASHR1-like [Cucumis
sativus]
gi|449502473|ref|XP_004161650.1| PREDICTED: histone-lysine N-methyltransferase ASHR1-like [Cucumis
sativus]
Length = 482
Score = 566 bits (1458), Expect = e-159, Method: Compositional matrix adjust.
Identities = 266/368 (72%), Positives = 313/368 (85%)
Query: 1 MLKLYLRRKLQNDNVIPSTTTDNYSLVEALVAHMSDIDEKQLLLYAQIANLVNLILQWPE 60
M+KL++RRKLQ++ VIP TDNY LVE L+ HMS IDEKQL+LYAQ+ LVN ILQWP
Sbjct: 114 MIKLFIRRKLQSEKVIPMAATDNYKLVEELIDHMSGIDEKQLVLYAQMGTLVNFILQWPG 173
Query: 61 ISINEIAENFSKLACNAHTICNSELRPLGTGLYPVISIINHSCLPNAVLVFEGRLAVVRA 120
+++ EIA NFSKLACNAHTIC+SELRPLGTGLYPVISIINHSCLPNAVLVFEGR AVVRA
Sbjct: 174 MNVKEIAVNFSKLACNAHTICDSELRPLGTGLYPVISIINHSCLPNAVLVFEGRTAVVRA 233
Query: 121 VQHVPKGAEVLISYIETAGSTMTRQKALKEQYLFTCTCPRCIKLGQFDDIQESAILEGYR 180
VQH+P GAEV ISYIETAGSTMTRQK LKE YLFTCTC RC+K+ Q D+I+ESAILEGYR
Sbjct: 234 VQHIPAGAEVSISYIETAGSTMTRQKTLKENYLFTCTCSRCVKVAQEDEIKESAILEGYR 293
Query: 181 CKDDGCSGFLLRDSDDKGFTCQQCGLVRSKEEIKKIASEVNILSKKTLALTSCGNHQEVV 240
C++D C GFLLR+SDD GFTCQQCGLVRSKE+IK IAS++ +S + S ++ E +
Sbjct: 294 CRNDQCDGFLLRNSDDTGFTCQQCGLVRSKEDIKNIASKIKSISDEASTSLSSQSYAEAL 353
Query: 241 STYKMIEKLQKKLYHPFSVNLMQTREKLIKILMELEDWKEALAYCQLTIPVYQRVYPQFH 300
Y+ +EKLQ+ L HP+S++LMQTREKL+KI MELE+W +AL YC+LTI VYQ++YP H
Sbjct: 354 FMYEKVEKLQRILCHPYSISLMQTREKLLKISMELENWTKALTYCKLTISVYQKLYPGIH 413
Query: 301 PLLGLQYYTCGKLEWFLGDTENAIKSMTEAVEILRITHGTNSPFMKELILKLEEAQAEAS 360
PLLGLQ+YTCGKLEW LG TE+AIKS T+A +ILRITHGTNS FMKEL+LKLEEA+AEAS
Sbjct: 414 PLLGLQFYTCGKLEWLLGHTEDAIKSYTKAFDILRITHGTNSSFMKELLLKLEEARAEAS 473
Query: 361 YKLSSKDE 368
YKLSS D+
Sbjct: 474 YKLSSTDD 481
>gi|224082230|ref|XP_002306611.1| SET domain protein [Populus trichocarpa]
gi|222856060|gb|EEE93607.1| SET domain protein [Populus trichocarpa]
Length = 458
Score = 557 bits (1435), Expect = e-156, Method: Compositional matrix adjust.
Identities = 266/368 (72%), Positives = 312/368 (84%), Gaps = 23/368 (6%)
Query: 1 MLKLYLRRKLQNDNVIPSTTTDNYSLVEALVAHMSDIDEKQLLLYAQIANLVNLILQWPE 60
M++LYLRRKLQN+ D+DEKQL+LYAQ+ANLV+ ILQWPE
Sbjct: 114 MVRLYLRRKLQNE---------------------MDLDEKQLVLYAQMANLVHFILQWPE 152
Query: 61 ISINEIAENFSKLACNAHTICNSELRPLGTGLYPVISIINHSCLPNAVLVFEGRLAVVRA 120
I++ EIAENFSKLACNAHTIC+ ELRPLGTGLYPV+SIINHSC+PNAVL FEG+ +VVRA
Sbjct: 153 INLKEIAENFSKLACNAHTICDCELRPLGTGLYPVVSIINHSCMPNAVLTFEGKSSVVRA 212
Query: 121 VQHVPKGAEVLISYIETAGSTMTRQKALKEQYLFTCTCPRCIKLGQFDDIQESAILEGYR 180
V+H+P+GAEV I+YI+TAGSTMTRQKALKEQY FTCTCPRCIK+ +DDIQESAILEGYR
Sbjct: 213 VEHIPEGAEVSIAYIDTAGSTMTRQKALKEQYFFTCTCPRCIKV--YDDIQESAILEGYR 270
Query: 181 CKDDGCSGFLLRDSDDKGFTCQQCGLVRSKEEIKKIASEVNILSKKTLALTSCGNHQEVV 240
CKDD C+GFLLRDS+DKGF CQ CGL RSKEE+K+I E+ +S K L TS GNH+EV+
Sbjct: 271 CKDDRCNGFLLRDSEDKGFICQTCGLRRSKEEVKRIVCEITAISDKKLKSTSPGNHEEVI 330
Query: 241 STYKMIEKLQKKLYHPFSVNLMQTREKLIKILMELEDWKEALAYCQLTIPVYQRVYPQFH 300
S YKMIEKLQ +L HPFS++LM+T+E+L+KILMEL DW+EALAYC+LTI YQRVYP+ H
Sbjct: 331 SLYKMIEKLQMELCHPFSISLMRTQEELLKILMELGDWREALAYCRLTITGYQRVYPEPH 390
Query: 301 PLLGLQYYTCGKLEWFLGDTENAIKSMTEAVEILRITHGTNSPFMKELILKLEEAQAEAS 360
PLLGLQYYTCGK+EW LG TE+AIKS+T AV+ILRITHGTNSPFMKEL++KL+EA AEAS
Sbjct: 391 PLLGLQYYTCGKIEWLLGYTEDAIKSLTRAVDILRITHGTNSPFMKELMMKLDEAHAEAS 450
Query: 361 YKLSSKDE 368
Y LSSKDE
Sbjct: 451 YNLSSKDE 458
>gi|356518519|ref|XP_003527926.1| PREDICTED: LOW QUALITY PROTEIN: histone-lysine N-methyltransferase
ASHR1-like [Glycine max]
Length = 401
Score = 551 bits (1421), Expect = e-154, Method: Compositional matrix adjust.
Identities = 258/357 (72%), Positives = 305/357 (85%), Gaps = 3/357 (0%)
Query: 11 QNDNVIPSTTTDNYSLVEALVAHMSDIDEKQLLLYAQIANLVNLILQWPEISINEIAENF 70
QND ++PST DNY+LVEALVA+MSDI E+QL+LYA++ANLVN ILQWPEI+I EIAENF
Sbjct: 48 QNDKIVPSTAMDNYNLVEALVAYMSDITEEQLVLYAKMANLVNSILQWPEINIKEIAENF 107
Query: 71 SKLACNAHTICNSELRPLGTGLYPVISIINHSCLPNAVLVFEGRLAVVRAVQHVPKGAEV 130
SK ACNAHTIC+SELRP+GTGLYPVISIINHSCLPN+VLVFEGR A+V VQH+P G EV
Sbjct: 108 SKFACNAHTICDSELRPVGTGLYPVISIINHSCLPNSVLVFEGRSALVPTVQHIPTGTEV 167
Query: 131 LISYIETAGSTMTRQKALKEQYLFTCTCPRCIKLGQFDDIQESAILEGYRCKDDGCSGFL 190
ISYIETAGSTMT+Q ALK FTCTCPRC K+G++DDIQESAILEGY+CK + C GFL
Sbjct: 168 PISYIETAGSTMTQQNALKSSTXFTCTCPRCSKVGEYDDIQESAILEGYKCKSEKCGGFL 227
Query: 191 LRDSDDKGFTCQQCGLVRSKEEIKKIASEVNILSKKTLALTSCGNHQEVVSTYKMIEKLQ 250
L KGF CQ CGLVR KEEIK+I +E+ +LS++ + +S N+QE +S YK IEKLQ
Sbjct: 228 LY---GKGFQCQGCGLVRDKEEIKRITTEIKLLSEEKIISSSTCNYQEAISIYKWIEKLQ 284
Query: 251 KKLYHPFSVNLMQTREKLIKILMELEDWKEALAYCQLTIPVYQRVYPQFHPLLGLQYYTC 310
+LYHP SVNLMQ REK++K LMELE W EALAYC+LTIP YQRVYP HPLLGLQYYTC
Sbjct: 285 TELYHPLSVNLMQNREKILKSLMELEHWAEALAYCKLTIPFYQRVYPAVHPLLGLQYYTC 344
Query: 311 GKLEWFLGDTENAIKSMTEAVEILRITHGTNSPFMKELILKLEEAQAEASYKLSSKD 367
GKLEW+LGDT+ A+KS+ +AV+ILRITHGTN+PFMK+L++KLEEA+AEASY+LS K+
Sbjct: 345 GKLEWYLGDTDEAVKSLIKAVDILRITHGTNTPFMKDLLMKLEEARAEASYRLSPKE 401
>gi|357461631|ref|XP_003601097.1| Histone-lysine N-methyltransferase ASHR1 [Medicago truncatula]
gi|355490145|gb|AES71348.1| Histone-lysine N-methyltransferase ASHR1 [Medicago truncatula]
Length = 511
Score = 543 bits (1400), Expect = e-152, Method: Compositional matrix adjust.
Identities = 262/398 (65%), Positives = 313/398 (78%), Gaps = 38/398 (9%)
Query: 1 MLKLYLRRKLQNDNVIPSTTTDNYSLVEALVAH--------------------------- 33
MLKLYLRRKLQ+ +IPST DNY LVEALVA
Sbjct: 110 MLKLYLRRKLQDQKIIPSTAMDNYKLVEALVARILFIIVVLSSRAMIVEFSLLTNMSVVI 169
Query: 34 ----------MSDIDEKQLLLYAQIANLVNLILQWPEISINEIAENFSKLACNAHTICNS 83
MSDI E+QL+LYAQ+ANLV+LILQWPEI+I EIA FSK ACNAHT+C+S
Sbjct: 170 VLFNTYFHEDMSDIKEEQLVLYAQMANLVHLILQWPEINIKEIANFFSKFACNAHTVCDS 229
Query: 84 ELRPLGTGLYPVISIINHSCLPNAVLVFEGRLAVVRAVQHVPKGAEVLISYIETAGSTMT 143
ELRPLGTGLYPV+SIINHSCLPN+VLVF+GR A VRA+QH+PKG EVLISYIETAGST+T
Sbjct: 230 ELRPLGTGLYPVVSIINHSCLPNSVLVFDGREASVRALQHIPKGTEVLISYIETAGSTVT 289
Query: 144 RQKALKEQYLFTCTCPRCIKLGQFDDIQESAILEGYRCKDDGCSGFLLRDSDDKGFTCQQ 203
RQKAL+EQYLF C CP C K+GQ++D++E+AILEGYRCK++ C GFLLR +D K F CQ+
Sbjct: 290 RQKALREQYLFQCVCPLCSKVGQYEDVRENAILEGYRCKNETCDGFLLRTTDGKAFQCQE 349
Query: 204 CGLVRSKEEIKKIASEVNILSKKTLALTSCGNHQEVVSTYKMIEKLQKKLYHPFSVNLMQ 263
CGLVR KEEIK+IA+E+ L ++ +S +H E +S +KMIEKLQ KLYHPFS+NLMQ
Sbjct: 350 CGLVRDKEEIKQIATEIKFLLEEASKPSSNDSH-EAISIHKMIEKLQTKLYHPFSINLMQ 408
Query: 264 TREKLIKILMELEDWKEALAYCQLTIPVYQRVYPQFHPLLGLQYYTCGKLEWFLGDTENA 323
TRE ++K LM+LE W+EALAYC+LTIP+YQRVYP HPLLGLQYYTCGKLEW+LGDTE A
Sbjct: 409 TRETILKSLMKLEYWREALAYCKLTIPIYQRVYPAVHPLLGLQYYTCGKLEWYLGDTEEA 468
Query: 324 IKSMTEAVEILRITHGTNSPFMKELILKLEEAQAEASY 361
IKS+T+AV+ILRITHGT +PF+KEL + L EA+AEAS+
Sbjct: 469 IKSLTKAVDILRITHGTKTPFVKELSMMLVEARAEASF 506
>gi|297836508|ref|XP_002886136.1| hypothetical protein ARALYDRAFT_319740 [Arabidopsis lyrata subsp.
lyrata]
gi|297331976|gb|EFH62395.1| hypothetical protein ARALYDRAFT_319740 [Arabidopsis lyrata subsp.
lyrata]
Length = 483
Score = 521 bits (1343), Expect = e-145, Method: Compositional matrix adjust.
Identities = 241/364 (66%), Positives = 307/364 (84%)
Query: 1 MLKLYLRRKLQNDNVIPSTTTDNYSLVEALVAHMSDIDEKQLLLYAQIANLVNLILQWPE 60
M+KLY++R LQN+ V+P T+TDNYSLVEALV+HMS+IDEKQ+LLYAQ+ANLVNLILQ+P
Sbjct: 114 MVKLYIKRNLQNEKVLPITSTDNYSLVEALVSHMSEIDEKQMLLYAQMANLVNLILQFPS 173
Query: 61 ISINEIAENFSKLACNAHTICNSELRPLGTGLYPVISIINHSCLPNAVLVFEGRLAVVRA 120
+ + EIAENFSK +CNAH+IC+SELRP G GL+P++SIINHSC PNAVLVFE ++AVVRA
Sbjct: 174 VDLREIAENFSKFSCNAHSICDSELRPQGIGLFPLVSIINHSCSPNAVLVFEEQMAVVRA 233
Query: 121 VQHVPKGAEVLISYIETAGSTMTRQKALKEQYLFTCTCPRCIKLGQFDDIQESAILEGYR 180
+ ++ K +E+ ISYIETAGST+TRQK+LKEQYLF C C RC +G+ DI+ESAILEGYR
Sbjct: 234 MDNISKDSEITISYIETAGSTLTRQKSLKEQYLFHCQCARCSNVGKPHDIEESAILEGYR 293
Query: 181 CKDDGCSGFLLRDSDDKGFTCQQCGLVRSKEEIKKIASEVNILSKKTLALTSCGNHQEVV 240
C ++ C+GFLLRD ++KGF CQ+C L+RSKEE+KK+AS++ +S+K A S N Q +
Sbjct: 294 CSNEKCTGFLLRDPEEKGFVCQKCLLLRSKEEVKKLASDLKTVSEKAHASPSAENKQAAI 353
Query: 241 STYKMIEKLQKKLYHPFSVNLMQTREKLIKILMELEDWKEALAYCQLTIPVYQRVYPQFH 300
YK IEKLQ KLYH FS+ LM+TREKL+K+LM+LE W+EAL YC+L +PVYQRVYP H
Sbjct: 354 ELYKTIEKLQVKLYHSFSITLMRTREKLLKMLMDLEIWREALNYCRLIVPVYQRVYPATH 413
Query: 301 PLLGLQYYTCGKLEWFLGDTENAIKSMTEAVEILRITHGTNSPFMKELILKLEEAQAEAS 360
PL+GLQ+YT GKLEW LG+T+ A+ S+ +A +ILRI+HGT++PFMKEL KLEEA+AEAS
Sbjct: 414 PLIGLQFYTQGKLEWLLGETKEAVSSLIKAFDILRISHGTSTPFMKELSAKLEEARAEAS 473
Query: 361 YKLS 364
YKL+
Sbjct: 474 YKLA 477
>gi|30680137|ref|NP_849969.1| histone-lysine N-methyltransferase ASHR1 [Arabidopsis thaliana]
gi|94707144|sp|Q7XJS0.2|ASHR1_ARATH RecName: Full=Histone-lysine N-methyltransferase ASHR1; AltName:
Full=ASH1-related protein 1; AltName: Full=Protein SET
DOMAIN GROUP 37
gi|145651792|gb|ABP88121.1| At2g17900 [Arabidopsis thaliana]
gi|330251606|gb|AEC06700.1| histone-lysine N-methyltransferase ASHR1 [Arabidopsis thaliana]
Length = 480
Score = 513 bits (1320), Expect = e-143, Method: Compositional matrix adjust.
Identities = 236/362 (65%), Positives = 302/362 (83%)
Query: 1 MLKLYLRRKLQNDNVIPSTTTDNYSLVEALVAHMSDIDEKQLLLYAQIANLVNLILQWPE 60
M++LY++R LQN+ V+P TTTDNYSLVEALV+HMS+IDEKQ+LLYAQ+ANLVNLILQ+P
Sbjct: 114 MVRLYIKRNLQNEKVLPITTTDNYSLVEALVSHMSEIDEKQMLLYAQMANLVNLILQFPS 173
Query: 61 ISINEIAENFSKLACNAHTICNSELRPLGTGLYPVISIINHSCLPNAVLVFEGRLAVVRA 120
+ + EIAENFSK +CNAH+IC+SELRP G GL+P++SIINHSC PNAVLVFE ++AVVRA
Sbjct: 174 VDLREIAENFSKFSCNAHSICDSELRPQGIGLFPLVSIINHSCSPNAVLVFEEQMAVVRA 233
Query: 121 VQHVPKGAEVLISYIETAGSTMTRQKALKEQYLFTCTCPRCIKLGQFDDIQESAILEGYR 180
+ ++ K +E+ ISYIETAGST+TRQK+LKEQYLF C C RC G+ DI+ESAILEGYR
Sbjct: 234 MDNISKDSEITISYIETAGSTLTRQKSLKEQYLFHCQCARCSNFGKPHDIEESAILEGYR 293
Query: 181 CKDDGCSGFLLRDSDDKGFTCQQCGLVRSKEEIKKIASEVNILSKKTLALTSCGNHQEVV 240
C ++ C+GFLLRD ++KGF CQ+C L+RSKEE+KK+AS++ +S+K S + Q +
Sbjct: 294 CANEKCTGFLLRDPEEKGFVCQKCLLLRSKEEVKKLASDLKTVSEKAPTSPSAEDKQAAI 353
Query: 241 STYKMIEKLQKKLYHPFSVNLMQTREKLIKILMELEDWKEALAYCQLTIPVYQRVYPQFH 300
YK IEKLQ KLYH FS+ LM+TREKL+K+LM++E W+EAL YC+L +PVYQRVYP H
Sbjct: 354 ELYKTIEKLQVKLYHSFSIPLMRTREKLLKMLMDVEIWREALNYCRLIVPVYQRVYPATH 413
Query: 301 PLLGLQYYTCGKLEWFLGDTENAIKSMTEAVEILRITHGTNSPFMKELILKLEEAQAEAS 360
PL+GLQ+YT GKLEW LG+T+ A+ S+ +A +ILRI+HG ++PFMKEL KLEEA+AEAS
Sbjct: 414 PLIGLQFYTQGKLEWLLGETKEAVSSLIKAFDILRISHGISTPFMKELSAKLEEARAEAS 473
Query: 361 YK 362
YK
Sbjct: 474 YK 475
>gi|41469441|gb|AAS07242.1| putative MYND finger protein [Oryza sativa Japonica Group]
gi|108710650|gb|ABF98445.1| MYND finger family protein, expressed [Oryza sativa Japonica Group]
Length = 480
Score = 486 bits (1250), Expect = e-134, Method: Compositional matrix adjust.
Identities = 230/368 (62%), Positives = 290/368 (78%), Gaps = 6/368 (1%)
Query: 1 MLKLYLRRKLQNDNVIPSTTTDNYSLVEALVAHMSDIDEKQLLLYAQIANLVNLILQWPE 60
M++L LRRKLQ+D IPS+ TDNY+LV+AL +H+S++D+ QL+LYAQ+ANLV LIL E
Sbjct: 118 MVRLVLRRKLQDDKAIPSSGTDNYNLVDALESHISEVDKNQLVLYAQMANLVQLILPSFE 177
Query: 61 ISINEIAENFSKLACNAHTICNSELRPLGTGLYPVISIINHSCLPNAVLVFEGRLAVVRA 120
+ + EI FSK ACNAHTIC+ ELRPLGTGLYPV+SIINHSC+PNAVL+FEGR A VRA
Sbjct: 178 LDLKEITHTFSKFACNAHTICDPELRPLGTGLYPVLSIINHSCVPNAVLIFEGRTAYVRA 237
Query: 121 VQHVPKGAEVLISYIETAGSTMTRQKALKEQYLFTCTCPRCIKLGQFDDIQESAILEGYR 180
+Q + K EV ISYIETA +TM RQ LK Y FTCTCPRC+K D +E A+LEGYR
Sbjct: 238 LQPISKNEEVSISYIETAATTMKRQDDLK-HYYFTCTCPRCVK-----DSEEDALLEGYR 291
Query: 181 CKDDGCSGFLLRDSDDKGFTCQQCGLVRSKEEIKKIASEVNILSKKTLALTSCGNHQEVV 240
C D C GFLL ++ +KG+TCQ+C R EE++K+AS+V +LS K +L S GN+ EV
Sbjct: 292 CNDQKCDGFLLPNAGNKGYTCQKCSTSRDGEELQKMASDVLLLSDKVSSLVSSGNNSEVG 351
Query: 241 STYKMIEKLQKKLYHPFSVNLMQTREKLIKILMELEDWKEALAYCQLTIPVYQRVYPQFH 300
S YK IE+L++KLYHP S+ L+ TRE L+KI MEL+DW+ AL YC+LTIPVY+R+YP FH
Sbjct: 352 SMYKTIEELERKLYHPLSITLLHTRETLLKIYMELQDWQTALMYCRLTIPVYERIYPPFH 411
Query: 301 PLLGLQYYTCGKLEWFLGDTENAIKSMTEAVEILRITHGTNSPFMKELILKLEEAQAEAS 360
P++GLQ+YTCGKLEW L TE+A+ S+T A +ILRITHGT S FMKEL+ KLEE +AEAS
Sbjct: 412 PMIGLQFYTCGKLEWLLEYTEDALMSLTRAADILRITHGTKSEFMKELLGKLEEVRAEAS 471
Query: 361 YKLSSKDE 368
++LS+ DE
Sbjct: 472 FRLSAGDE 479
>gi|108710649|gb|ABF98444.1| MYND finger family protein, expressed [Oryza sativa Japonica Group]
Length = 481
Score = 479 bits (1234), Expect = e-133, Method: Compositional matrix adjust.
Identities = 229/369 (62%), Positives = 290/369 (78%), Gaps = 7/369 (1%)
Query: 1 MLKLYLRRKLQNDNVIPSTTTDNYSLVEALVAHMSDIDEKQLLLYAQIANLVNLILQWPE 60
M++L LRRKLQ+D IPS+ TDNY+LV+AL +H+S++D+ QL+LYAQ+ANLV LIL E
Sbjct: 118 MVRLVLRRKLQDDKAIPSSGTDNYNLVDALESHISEVDKNQLVLYAQMANLVQLILPSFE 177
Query: 61 ISINEIAENFSKLACNAHTICNSELRPLGTGLYPVISIINHSCLPNAVLVFEGRLAVVRA 120
+ + EI FSK ACNAHTIC+ ELRPLGTGLYPV+SIINHSC+PNAVL+FEGR A VRA
Sbjct: 178 LDLKEITHTFSKFACNAHTICDPELRPLGTGLYPVLSIINHSCVPNAVLIFEGRTAYVRA 237
Query: 121 VQHVPKGAEVLISYIETAGSTMTRQKALKEQYLFTCTCPRCIKLGQFDDIQESAILEGYR 180
+Q + K EV ISYIETA +TM RQ LK Y FTCTCPRC+K D +E A+LEGYR
Sbjct: 238 LQPISKNEEVSISYIETAATTMKRQDDLK-HYYFTCTCPRCVK-----DSEEDALLEGYR 291
Query: 181 CKDDGCSGFLLRDSDDKGFTCQQCGLVRSKEEIKKIASEVNILSKKTLALTSCG-NHQEV 239
C D C GFLL ++ +KG+TCQ+C R EE++K+AS+V +LS K +L S G ++ EV
Sbjct: 292 CNDQKCDGFLLPNAGNKGYTCQKCSTSRDGEELQKMASDVLLLSDKVSSLVSSGIDNSEV 351
Query: 240 VSTYKMIEKLQKKLYHPFSVNLMQTREKLIKILMELEDWKEALAYCQLTIPVYQRVYPQF 299
S YK IE+L++KLYHP S+ L+ TRE L+KI MEL+DW+ AL YC+LTIPVY+R+YP F
Sbjct: 352 GSMYKTIEELERKLYHPLSITLLHTRETLLKIYMELQDWQTALMYCRLTIPVYERIYPPF 411
Query: 300 HPLLGLQYYTCGKLEWFLGDTENAIKSMTEAVEILRITHGTNSPFMKELILKLEEAQAEA 359
HP++GLQ+YTCGKLEW L TE+A+ S+T A +ILRITHGT S FMKEL+ KLEE +AEA
Sbjct: 412 HPMIGLQFYTCGKLEWLLEYTEDALMSLTRAADILRITHGTKSEFMKELLGKLEEVRAEA 471
Query: 360 SYKLSSKDE 368
S++LS+ DE
Sbjct: 472 SFRLSAGDE 480
>gi|218193604|gb|EEC76031.1| hypothetical protein OsI_13199 [Oryza sativa Indica Group]
Length = 481
Score = 476 bits (1225), Expect = e-132, Method: Compositional matrix adjust.
Identities = 228/369 (61%), Positives = 289/369 (78%), Gaps = 7/369 (1%)
Query: 1 MLKLYLRRKLQNDNVIPSTTTDNYSLVEALVAHMSDIDEKQLLLYAQIANLVNLILQWPE 60
M++L LRRKLQ+D IPS+ TDNY+LV+AL +H+S++D+ QL+LYAQ+ANLV LIL E
Sbjct: 118 MVRLVLRRKLQDDKAIPSSGTDNYNLVDALESHISEVDKNQLVLYAQMANLVQLILPSFE 177
Query: 61 ISINEIAENFSKLACNAHTICNSELRPLGTGLYPVISIINHSCLPNAVLVFEGRLAVVRA 120
+ + EI FSK ACNAHTIC+ ELR LGTGLYPV+SIINHSC+PNAVL+FEGR A VRA
Sbjct: 178 LDLKEITHTFSKFACNAHTICDPELRSLGTGLYPVLSIINHSCVPNAVLIFEGRTAYVRA 237
Query: 121 VQHVPKGAEVLISYIETAGSTMTRQKALKEQYLFTCTCPRCIKLGQFDDIQESAILEGYR 180
+Q + K EV ISYIETA +TM RQ LK Y FTCTCPRC+K D +E A+LEGYR
Sbjct: 238 LQPISKNEEVSISYIETAATTMKRQDDLK-HYYFTCTCPRCVK-----DSEEDALLEGYR 291
Query: 181 CKDDGCSGFLLRDSDDKGFTCQQCGLVRSKEEIKKIASEVNILSKKTLALTSCG-NHQEV 239
C D C GFLL ++ +KG+TCQ+C R EE++K+AS+V +LS K +L S G ++ EV
Sbjct: 292 CNDQKCDGFLLPNAGNKGYTCQKCSTSRDGEELQKMASDVLLLSDKVSSLVSSGIDNSEV 351
Query: 240 VSTYKMIEKLQKKLYHPFSVNLMQTREKLIKILMELEDWKEALAYCQLTIPVYQRVYPQF 299
S YK IE+L++KLYHP S+ L+ TRE L+KI MEL+DW+ AL YC+LTIPVY+R+YP F
Sbjct: 352 GSMYKTIEELERKLYHPLSITLLHTRETLLKIYMELQDWQTALMYCRLTIPVYERIYPPF 411
Query: 300 HPLLGLQYYTCGKLEWFLGDTENAIKSMTEAVEILRITHGTNSPFMKELILKLEEAQAEA 359
HP++GLQ+YTCGKLEW L TE+A+ S+T A +ILRITHGT S FMKEL+ KLEE +AEA
Sbjct: 412 HPMIGLQFYTCGKLEWLLEYTEDALMSLTRAADILRITHGTKSEFMKELLGKLEEVRAEA 471
Query: 360 SYKLSSKDE 368
S++LS+ DE
Sbjct: 472 SFRLSAGDE 480
>gi|388518229|gb|AFK47176.1| unknown [Medicago truncatula]
Length = 477
Score = 475 bits (1222), Expect = e-131, Method: Compositional matrix adjust.
Identities = 235/398 (59%), Positives = 283/398 (71%), Gaps = 72/398 (18%)
Query: 1 MLKLYLRRKLQNDNVIPSTTTDNYSLVEALVAH--------------------------- 33
MLKLYLRRKLQ+ +IPST DNY LVEALVA
Sbjct: 110 MLKLYLRRKLQDQKIIPSTAMDNYKLVEALVARILFIIVVLSSRAMIVEFSLLTNMSVVI 169
Query: 34 ----------MSDIDEKQLLLYAQIANLVNLILQWPEISINEIAENFSKLACNAHTICNS 83
MSDI E+QL+LYAQ+ANLV+LILQWPEI+I EIA FSK ACNAHT+C+S
Sbjct: 170 VLFNTYFHEDMSDIKEEQLVLYAQMANLVHLILQWPEINIKEIANFFSKFACNAHTVCDS 229
Query: 84 ELRPLGTGLYPVISIINHSCLPNAVLVFEGRLAVVRAVQHVPKGAEVLISYIETAGSTMT 143
ELRPLGTGLYPV+SIINHSCLPN+VLVF+GR A+VRA+QH+PKG E
Sbjct: 230 ELRPLGTGLYPVVSIINHSCLPNSVLVFDGREALVRALQHIPKGTE-------------- 275
Query: 144 RQKALKEQYLFTCTCPRCIKLGQFDDIQESAILEGYRCKDDGCSGFLLRDSDDKGFTCQQ 203
GQ++D++E+AILEGYRCK++ C GFLLR +D K F CQ+
Sbjct: 276 ---------------------GQYEDVRENAILEGYRCKNETCDGFLLRTTDGKAFQCQE 314
Query: 204 CGLVRSKEEIKKIASEVNILSKKTLALTSCGNHQEVVSTYKMIEKLQKKLYHPFSVNLMQ 263
CGLVR KEEIK+IA+E+ L ++ +S GN E +S +KMIEKLQ KLYHPFS+NLMQ
Sbjct: 315 CGLVRDKEEIKQIATEIKFLLEEASKPSSNGNSHEAISIHKMIEKLQTKLYHPFSINLMQ 374
Query: 264 TREKLIKILMELEDWKEALAYCQLTIPVYQRVYPQFHPLLGLQYYTCGKLEWFLGDTENA 323
TRE ++K LM+LE W+EALAYC+LTIP+YQRVYP HPLLGLQYYTCGKLEW+LGDTE A
Sbjct: 375 TRETILKSLMKLEYWREALAYCKLTIPIYQRVYPAVHPLLGLQYYTCGKLEWYLGDTEEA 434
Query: 324 IKSMTEAVEILRITHGTNSPFMKELILKLEEAQAEASY 361
IKS+T+AV+ILRITHGT +PF+KEL + L EA+AEAS+
Sbjct: 435 IKSLTKAVDILRITHGTKTPFVKELSMMLVEARAEASF 472
>gi|357117918|ref|XP_003560708.1| PREDICTED: histone-lysine N-methyltransferase ASHR1-like
[Brachypodium distachyon]
Length = 482
Score = 474 bits (1221), Expect = e-131, Method: Compositional matrix adjust.
Identities = 224/368 (60%), Positives = 283/368 (76%), Gaps = 6/368 (1%)
Query: 1 MLKLYLRRKLQNDNVIPSTTTDNYSLVEALVAHMSDIDEKQLLLYAQIANLVNLILQWPE 60
M++L LRRKLQN+ VIPST TDNY LV+AL +H+S++D+KQL+LYAQ+ANLV LIL E
Sbjct: 118 MVRLILRRKLQNEKVIPSTGTDNYGLVDALESHISEVDDKQLVLYAQMANLVQLILPAIE 177
Query: 61 ISINEIAENFSKLACNAHTICNSELRPLGTGLYPVISIINHSCLPNAVLVFEGRLAVVRA 120
+ + E A FSK +CNAHTIC+ ELRP+GTGL+P IS INHSC+PNAVL+FEGR A VRA
Sbjct: 178 LDLKETAHIFSKFSCNAHTICDPELRPVGTGLFPAISTINHSCVPNAVLLFEGRTAYVRA 237
Query: 121 VQHVPKGAEVLISYIETAGSTMTRQKALKEQYLFTCTCPRCIKLGQFDDIQESAILEGYR 180
+Q + EV ISYIETA +T+ R LK Y FTCTCPRCIK +E +LEGYR
Sbjct: 238 LQPLSNNTEVSISYIETAATTLKRHNDLK-HYFFTCTCPRCIK-----GSEEDPLLEGYR 291
Query: 181 CKDDGCSGFLLRDSDDKGFTCQQCGLVRSKEEIKKIASEVNILSKKTLALTSCGNHQEVV 240
CKD C GFLL DS K +TCQ+C + R EE+KK++SE+ +LS K + S GN+ E
Sbjct: 292 CKDQNCDGFLLPDSGKKAYTCQKCSMCRDGEEVKKLSSEILLLSDKASSFVSSGNNNEAG 351
Query: 241 STYKMIEKLQKKLYHPFSVNLMQTREKLIKILMELEDWKEALAYCQLTIPVYQRVYPQFH 300
S YK+IE+L++KLYH FS+ L+ TRE L+K+ MEL+DW+ AL YC+ TIPVY+RVYP FH
Sbjct: 352 SVYKIIEQLERKLYHSFSITLLHTRETLLKLYMELQDWRSALMYCRSTIPVYERVYPPFH 411
Query: 301 PLLGLQYYTCGKLEWFLGDTENAIKSMTEAVEILRITHGTNSPFMKELILKLEEAQAEAS 360
P++GLQ+YTCGKLEW L TE+A+KS+T A ++LRITHGTNS FMKEL KLEEA+AE S
Sbjct: 412 PMVGLQFYTCGKLEWLLEYTEDALKSLTRAADVLRITHGTNSQFMKELFGKLEEARAEVS 471
Query: 361 YKLSSKDE 368
++LS +E
Sbjct: 472 FRLSPGNE 479
>gi|242033255|ref|XP_002464022.1| hypothetical protein SORBIDRAFT_01g010740 [Sorghum bicolor]
gi|241917876|gb|EER91020.1| hypothetical protein SORBIDRAFT_01g010740 [Sorghum bicolor]
Length = 482
Score = 461 bits (1185), Expect = e-127, Method: Compositional matrix adjust.
Identities = 220/366 (60%), Positives = 280/366 (76%), Gaps = 6/366 (1%)
Query: 1 MLKLYLRRKLQNDNVIPSTTTDNYSLVEALVAHMSDIDEKQLLLYAQIANLVNLILQWPE 60
M++L L+RKLQN+ VIPS++ DNY+LV+AL +H+ +DE L+LYAQ+ANLV+LIL E
Sbjct: 118 MVRLVLKRKLQNEKVIPSSSIDNYNLVDALESHIWKVDENHLVLYAQMANLVSLILPLIE 177
Query: 61 ISINEIAENFSKLACNAHTICNSELRPLGTGLYPVISIINHSCLPNAVLVFEGRLAVVRA 120
+ + EIA FSK ACNAHTIC+ ELRPLGTGLYPVISIINHSC+PNAVL+F+GR A VRA
Sbjct: 178 LDLKEIAHTFSKFACNAHTICDPELRPLGTGLYPVISIINHSCVPNAVLIFDGRTAYVRA 237
Query: 121 VQHVPKGAEVLISYIETAGSTMTRQKALKEQYLFTCTCPRCIKLGQFDDIQESAILEGYR 180
+Q + K EV ISYIETA T R LK QY FTC+CPRC+K +E A+LEG+R
Sbjct: 238 LQPIGKNEEVSISYIETAAVTKKRHNDLK-QYFFTCSCPRCVK-----GSEEDALLEGFR 291
Query: 181 CKDDGCSGFLLRDSDDKGFTCQQCGLVRSKEEIKKIASEVNILSKKTLALTSCGNHQEVV 240
CK+ C GFLL DS K +TCQ+C + R +EEI+K+ SE+ LS K + S GN E
Sbjct: 292 CKNQTCDGFLLPDSGKKAYTCQKCSVSRDEEEIQKMRSEILQLSDKASSFLSSGNKAEAG 351
Query: 241 STYKMIEKLQKKLYHPFSVNLMQTREKLIKILMELEDWKEALAYCQLTIPVYQRVYPQFH 300
S YK IE+L++ LYH FS L+ T E L+KI MEL+DW+ ALAYC+LT+PVY+RVYP FH
Sbjct: 352 SVYKTIEQLEQNLYHAFSTTLLHTCETLLKIYMELQDWRTALAYCRLTVPVYERVYPPFH 411
Query: 301 PLLGLQYYTCGKLEWFLGDTENAIKSMTEAVEILRITHGTNSPFMKELILKLEEAQAEAS 360
P++GLQ+YTCGKLEW L TE+A+KS+T A +IL++THGT S FMKEL KLEEA+AE S
Sbjct: 412 PMIGLQFYTCGKLEWLLECTEDALKSLTRATDILKVTHGTKSQFMKELFGKLEEARAEVS 471
Query: 361 YKLSSK 366
+++SS+
Sbjct: 472 FRISSR 477
>gi|212722234|ref|NP_001131420.1| uncharacterized protein LOC100192749 [Zea mays]
gi|194691468|gb|ACF79818.1| unknown [Zea mays]
gi|413933330|gb|AFW67881.1| histone-lysine N-methyltransferase ASHR1 [Zea mays]
Length = 482
Score = 460 bits (1183), Expect = e-127, Method: Compositional matrix adjust.
Identities = 223/366 (60%), Positives = 280/366 (76%), Gaps = 6/366 (1%)
Query: 1 MLKLYLRRKLQNDNVIPSTTTDNYSLVEALVAHMSDIDEKQLLLYAQIANLVNLILQWPE 60
M++L L+RKLQN+ VIPS++ DNY+LV+ L +H+S +DE QL+LYAQ+ANLV+LIL + E
Sbjct: 118 MVRLALKRKLQNEKVIPSSSIDNYNLVDGLESHISKVDENQLVLYAQMANLVSLILPFIE 177
Query: 61 ISINEIAENFSKLACNAHTICNSELRPLGTGLYPVISIINHSCLPNAVLVFEGRLAVVRA 120
+ + EIA+ FSK ACNAHTIC+ ELRPLGTGLYPVISIINHSC+PNAVL+F+GR A VRA
Sbjct: 178 LDLKEIAQTFSKFACNAHTICDPELRPLGTGLYPVISIINHSCVPNAVLIFDGRTAYVRA 237
Query: 121 VQHVPKGAEVLISYIETAGSTMTRQKALKEQYLFTCTCPRCIKLGQFDDIQESAILEGYR 180
+Q + K EV ISYIETA T R LK QY FTCTCPRC+K FD E A+LEG+R
Sbjct: 238 LQPINKDEEVSISYIETATVTKKRNNDLK-QYFFTCTCPRCVK--GFD---EDALLEGFR 291
Query: 181 CKDDGCSGFLLRDSDDKGFTCQQCGLVRSKEEIKKIASEVNILSKKTLALTSCGNHQEVV 240
CK+ C GFLL +S K +TCQ+CG R EEIK + SE+ LS K + S GN E
Sbjct: 292 CKNQACDGFLLPNSGKKAYTCQKCGASRDVEEIKNMRSEILQLSDKASSFLSSGNKAEAG 351
Query: 241 STYKMIEKLQKKLYHPFSVNLMQTREKLIKILMELEDWKEALAYCQLTIPVYQRVYPQFH 300
S YK+IE+L++ LYH FS L+ T E L+KI +EL+DW AL YC+LTIPVY+RVYP FH
Sbjct: 352 SIYKIIEQLERNLYHAFSTTLLHTCETLLKIYLELQDWWTALTYCRLTIPVYERVYPPFH 411
Query: 301 PLLGLQYYTCGKLEWFLGDTENAIKSMTEAVEILRITHGTNSPFMKELILKLEEAQAEAS 360
P++GLQ+YTCGKLEW L TE+A+KS+T A +IL++THG S FMKEL KLEEA+AE S
Sbjct: 412 PMIGLQFYTCGKLEWLLEFTEDALKSLTRAADILKVTHGVKSQFMKELFGKLEEARAEVS 471
Query: 361 YKLSSK 366
++LSS+
Sbjct: 472 FRLSSR 477
>gi|195636164|gb|ACG37550.1| histone-lysine N-methyltransferase ASHR1 [Zea mays]
Length = 482
Score = 459 bits (1181), Expect = e-127, Method: Compositional matrix adjust.
Identities = 223/366 (60%), Positives = 279/366 (76%), Gaps = 6/366 (1%)
Query: 1 MLKLYLRRKLQNDNVIPSTTTDNYSLVEALVAHMSDIDEKQLLLYAQIANLVNLILQWPE 60
M++L L+RKLQN+ VIPS++ DNY+LV+AL +H+S +DE QL+LYAQ+ANLV+LIL + E
Sbjct: 118 MVRLALKRKLQNEKVIPSSSIDNYNLVDALESHISKVDENQLVLYAQMANLVSLILPFIE 177
Query: 61 ISINEIAENFSKLACNAHTICNSELRPLGTGLYPVISIINHSCLPNAVLVFEGRLAVVRA 120
+ + EIA+ FSK ACNAHTIC+ ELRPLGTGLYPVISIINHSC+PNAVL+F+GR A VRA
Sbjct: 178 LDLKEIAQTFSKFACNAHTICDPELRPLGTGLYPVISIINHSCVPNAVLIFDGRTAYVRA 237
Query: 121 VQHVPKGAEVLISYIETAGSTMTRQKALKEQYLFTCTCPRCIKLGQFDDIQESAILEGYR 180
+Q + K EV ISYIETA T R LK QY FTCTCPRC+K FD E +LEG+R
Sbjct: 238 LQPIDKDEEVSISYIETAAVTKKRNNDLK-QYFFTCTCPRCVK--GFD---EDPLLEGFR 291
Query: 181 CKDDGCSGFLLRDSDDKGFTCQQCGLVRSKEEIKKIASEVNILSKKTLALTSCGNHQEVV 240
CK+ C GFLL S K +TCQ+CG R EEIK + SE+ LS K + S GN E
Sbjct: 292 CKNQTCDGFLLPSSGKKAYTCQKCGASRDVEEIKNMRSEILQLSDKASSFLSSGNKAEAG 351
Query: 241 STYKMIEKLQKKLYHPFSVNLMQTREKLIKILMELEDWKEALAYCQLTIPVYQRVYPQFH 300
S YK+IE+L++ LYH FS L+ T E L+KI +EL+DW AL YC+LTIPVY+RVYP FH
Sbjct: 352 SIYKIIEQLERNLYHAFSTTLLHTCETLLKIYLELQDWWTALTYCRLTIPVYERVYPPFH 411
Query: 301 PLLGLQYYTCGKLEWFLGDTENAIKSMTEAVEILRITHGTNSPFMKELILKLEEAQAEAS 360
P++GLQ+YTCGKLEW L TE+A+KS+T A +IL++THG S FMKEL KLEEA+AE S
Sbjct: 412 PMIGLQFYTCGKLEWLLEFTEDALKSLTRAADILKVTHGVKSQFMKELFGKLEEARAEVS 471
Query: 361 YKLSSK 366
++LSS+
Sbjct: 472 FRLSSR 477
>gi|22450579|gb|AAM97151.1| unknown protein,3'-partial [Oryza sativa Japonica Group]
Length = 425
Score = 417 bits (1073), Expect = e-114, Method: Compositional matrix adjust.
Identities = 196/314 (62%), Positives = 247/314 (78%), Gaps = 6/314 (1%)
Query: 1 MLKLYLRRKLQNDNVIPSTTTDNYSLVEALVAHMSDIDEKQLLLYAQIANLVNLILQWPE 60
M++L LRRKLQ+D IPS+ TDNY+LV+AL +H+S++D+ QL+LYAQ+ANLV LIL E
Sbjct: 118 MVRLVLRRKLQDDKAIPSSGTDNYNLVDALESHISEVDKNQLVLYAQMANLVQLILPSFE 177
Query: 61 ISINEIAENFSKLACNAHTICNSELRPLGTGLYPVISIINHSCLPNAVLVFEGRLAVVRA 120
+ + EI FSK ACNAHTIC+ ELRPLGTGLYPV+SIINHSC+PNAVL+FEGR A VRA
Sbjct: 178 LDLKEITHTFSKFACNAHTICDPELRPLGTGLYPVLSIINHSCVPNAVLIFEGRTAYVRA 237
Query: 121 VQHVPKGAEVLISYIETAGSTMTRQKALKEQYLFTCTCPRCIKLGQFDDIQESAILEGYR 180
+Q + K EV ISYIETA +TM RQ LK Y FTCTCPRC+K D +E A+LEGYR
Sbjct: 238 LQPISKNEEVSISYIETAATTMKRQDDLK-HYYFTCTCPRCVK-----DSEEDALLEGYR 291
Query: 181 CKDDGCSGFLLRDSDDKGFTCQQCGLVRSKEEIKKIASEVNILSKKTLALTSCGNHQEVV 240
C D C GFLL ++ +KG+TCQ+C R EE++K+AS+V +LS K +L S GN+ EV
Sbjct: 292 CNDQKCDGFLLPNAGNKGYTCQKCSTSRDGEELQKMASDVLLLSDKVSSLVSSGNNSEVG 351
Query: 241 STYKMIEKLQKKLYHPFSVNLMQTREKLIKILMELEDWKEALAYCQLTIPVYQRVYPQFH 300
S YK IE+L++KLYHP S+ L+ TRE L+KI MEL+DW+ AL YC+LTIPVY+R+YP FH
Sbjct: 352 SMYKTIEELERKLYHPLSITLLHTRETLLKIYMELQDWQTALMYCRLTIPVYERIYPPFH 411
Query: 301 PLLGLQYYTCGKLE 314
P++GLQ+YTCGKLE
Sbjct: 412 PMIGLQFYTCGKLE 425
>gi|413933331|gb|AFW67882.1| hypothetical protein ZEAMMB73_219813 [Zea mays]
Length = 443
Score = 402 bits (1032), Expect = e-109, Method: Compositional matrix adjust.
Identities = 193/316 (61%), Positives = 241/316 (76%), Gaps = 6/316 (1%)
Query: 1 MLKLYLRRKLQNDNVIPSTTTDNYSLVEALVAHMSDIDEKQLLLYAQIANLVNLILQWPE 60
M++L L+RKLQN+ VIPS++ DNY+LV+ L +H+S +DE QL+LYAQ+ANLV+LIL + E
Sbjct: 118 MVRLALKRKLQNEKVIPSSSIDNYNLVDGLESHISKVDENQLVLYAQMANLVSLILPFIE 177
Query: 61 ISINEIAENFSKLACNAHTICNSELRPLGTGLYPVISIINHSCLPNAVLVFEGRLAVVRA 120
+ + EIA+ FSK ACNAHTIC+ ELRPLGTGLYPVISIINHSC+PNAVL+F+GR A VRA
Sbjct: 178 LDLKEIAQTFSKFACNAHTICDPELRPLGTGLYPVISIINHSCVPNAVLIFDGRTAYVRA 237
Query: 121 VQHVPKGAEVLISYIETAGSTMTRQKALKEQYLFTCTCPRCIKLGQFDDIQESAILEGYR 180
+Q + K EV ISYIETA T R LK QY FTCTCPRC+K FD E A+LEG+R
Sbjct: 238 LQPINKDEEVSISYIETATVTKKRNNDLK-QYFFTCTCPRCVK--GFD---EDALLEGFR 291
Query: 181 CKDDGCSGFLLRDSDDKGFTCQQCGLVRSKEEIKKIASEVNILSKKTLALTSCGNHQEVV 240
CK+ C GFLL +S K +TCQ+CG R EEIK + SE+ LS K + S GN E
Sbjct: 292 CKNQACDGFLLPNSGKKAYTCQKCGASRDVEEIKNMRSEILQLSDKASSFLSSGNKAEAG 351
Query: 241 STYKMIEKLQKKLYHPFSVNLMQTREKLIKILMELEDWKEALAYCQLTIPVYQRVYPQFH 300
S YK+IE+L++ LYH FS L+ T E L+KI +EL+DW AL YC+LTIPVY+RVYP FH
Sbjct: 352 SIYKIIEQLERNLYHAFSTTLLHTCETLLKIYLELQDWWTALTYCRLTIPVYERVYPPFH 411
Query: 301 PLLGLQYYTCGKLEWF 316
P++GLQ+YTCGKLEW+
Sbjct: 412 PMIGLQFYTCGKLEWY 427
>gi|148906849|gb|ABR16570.1| unknown [Picea sitchensis]
Length = 441
Score = 356 bits (913), Expect = 1e-95, Method: Compositional matrix adjust.
Identities = 165/290 (56%), Positives = 216/290 (74%), Gaps = 1/290 (0%)
Query: 73 LACNAHTICNSELRPLGTGLYPVISIINHSCLPNAVLVFEGRLAVVRAVQHVPKGAEVLI 132
+CNAHTIC+SELRP+GTGLYPVISIINHSC PNAVL+FEGR AVVRAV+ + +G+E+ +
Sbjct: 142 FSCNAHTICDSELRPMGTGLYPVISIINHSCFPNAVLLFEGRQAVVRAVEPIREGSELTV 201
Query: 133 SYIETAGSTMTRQKALKEQYLFTCTCPRCIKLGQFDDIQESAILEGYRCKDDGCSGFLLR 192
SYIE A ST +R+K+LKEQY F C C RC+K+ D + E AILEG+RC D C GFLL
Sbjct: 202 SYIEIAASTASRKKSLKEQYFFDCKCLRCLKVDTPDGLHEDAILEGFRCSSDHCEGFLLH 261
Query: 193 DSDD-KGFTCQQCGLVRSKEEIKKIASEVNILSKKTLALTSCGNHQEVVSTYKMIEKLQK 251
D DD + CQ CG R++EE KK A +V+ L K+ L S GN+ E S Y+ I++LQ
Sbjct: 262 DPDDAQSLVCQLCGCGRNEEETKKQARKVDKLGKEASKLLSSGNYSEARSLYEQIQQLQT 321
Query: 252 KLYHPFSVNLMQTREKLIKILMELEDWKEALAYCQLTIPVYQRVYPQFHPLLGLQYYTCG 311
+L+HP+SV L++T + L+KI MEL DWK+AL YC+LTIP Y+R YP HP++GLQYY CG
Sbjct: 322 QLWHPYSVILLRTGDTLLKICMELYDWKQALKYCRLTIPAYERAYPTCHPMMGLQYYACG 381
Query: 312 KLEWFLGDTENAIKSMTEAVEILRITHGTNSPFMKELILKLEEAQAEASY 361
KLEWFL +T A+ +A +IL +THG NS F+ +L +++EA AEA++
Sbjct: 382 KLEWFLENTLEALNFFEKAAKILTVTHGRNSEFLTQLFDRIQEAHAEAAH 431
>gi|168051542|ref|XP_001778213.1| predicted protein [Physcomitrella patens subsp. patens]
gi|162670426|gb|EDQ56995.1| predicted protein [Physcomitrella patens subsp. patens]
Length = 490
Score = 354 bits (908), Expect = 5e-95, Method: Compositional matrix adjust.
Identities = 162/364 (44%), Positives = 239/364 (65%)
Query: 1 MLKLYLRRKLQNDNVIPSTTTDNYSLVEALVAHMSDIDEKQLLLYAQIANLVNLILQWPE 60
+++ ++R+LQ + V DNY + E L HMS+ E +L++YAQ+A++V ++ E
Sbjct: 121 VMRFLIKRRLQAERVQLKAAVDNYDVCEVLPTHMSETSEDRLVMYAQMASIVQQMMAPDE 180
Query: 61 ISINEIAENFSKLACNAHTICNSELRPLGTGLYPVISIINHSCLPNAVLVFEGRLAVVRA 120
+++ E+ + + ACNAHTIC+ E+RPLGTGLYPVISI+NHSC+PNAVL F+G A +RA
Sbjct: 181 VNVKEVTQMICRFACNAHTICDEEVRPLGTGLYPVISIVNHSCVPNAVLHFDGNRAALRA 240
Query: 121 VQHVPKGAEVLISYIETAGSTMTRQKALKEQYLFTCTCPRCIKLGQFDDIQESAILEGYR 180
++ +G E+ ISY+E A ST TR+KAL++QY F C C RC +L + +E A LEGY
Sbjct: 241 LEDTQEGTEITISYVELAASTNTRRKALRDQYYFDCNCIRCSRLVTREGSREDAFLEGYG 300
Query: 181 CKDDGCSGFLLRDSDDKGFTCQQCGLVRSKEEIKKIASEVNILSKKTLALTSCGNHQEVV 240
C + C+G L+ D C+ CGL R ++ K A EV + + L + G +
Sbjct: 301 CVNSDCNGPLIEDPGSDKVICEVCGLKREVQQTKSAAKEVELDVLEASNLYAAGKLESAR 360
Query: 241 STYKMIEKLQKKLYHPFSVNLMQTREKLIKILMELEDWKEALAYCQLTIPVYQRVYPQFH 300
Y +E Q+KL+HP+SV L++T + L+KI M++EDW AL +CQ TIP Y+R YP F
Sbjct: 361 RLYSEVEAKQRKLWHPYSVPLLRTHDALLKICMDMEDWASALEFCQSTIPAYERAYPPFS 420
Query: 301 PLLGLQYYTCGKLEWFLGDTENAIKSMTEAVEILRITHGTNSPFMKELILKLEEAQAEAS 360
PLLGLQY+T GKL+W LGD+ A+ ++ +A ++++THG+ S + L L+EAQAE +
Sbjct: 421 PLLGLQYFTLGKLQWLLGDSAKAVNTLRKAYTVIQVTHGSKSELLHGLTSTLQEAQAEVA 480
Query: 361 YKLS 364
YK S
Sbjct: 481 YKRS 484
>gi|302799034|ref|XP_002981276.1| hypothetical protein SELMODRAFT_114558 [Selaginella moellendorffii]
gi|300150816|gb|EFJ17464.1| hypothetical protein SELMODRAFT_114558 [Selaginella moellendorffii]
Length = 453
Score = 298 bits (762), Expect = 4e-78, Method: Compositional matrix adjust.
Identities = 146/359 (40%), Positives = 228/359 (63%), Gaps = 7/359 (1%)
Query: 6 LRRKLQNDNVIPSTTTDNYSLVEALVAHMSDIDEKQLLLYAQIANLVNLIL--QWPEISI 63
++R+LQ V+P T DNY +V AL H S+ +++L++YAQ+A L+ IL ++ E +
Sbjct: 87 VKRELQATGVLPVTALDNYEIVRALPTHFSETGDERLVMYAQMAVLIKTILNARYAE-DV 145
Query: 64 NEIAENFSKLACNAHTICNSELRPLGTGLYPVISIINHSCLPNAVLVFEGRLAVVRAVQH 123
EI ++ +++CN HTIC+ ELRP+G GL+PV+S+INHSC N++L+F+G+ AVVRA+
Sbjct: 146 KEITKDICRISCNGHTICDDELRPVGIGLFPVVSVINHSCSSNSLLLFDGKHAVVRALGT 205
Query: 124 VPKGAEVLISYIETAGSTMTRQKALKEQYLFTCTCPRCIKLG-QFDDIQESAILEGYRCK 182
+ +G EV +SYIE ST +R++AL +QY F C CPRC + Q + + +LE C
Sbjct: 206 ISRGCEVTVSYIELGASTNSRREALSDQYYFHCKCPRCTDVSPQAHGLYKDDVLEAVACL 265
Query: 183 DDGCSGFLLRDSDDKGFTCQQCGLVRSKEEIKKIASEVNILSKKTLALTSCGNHQEVVST 242
D C F+ + + C CG R E+ K+++E + +K L + G+
Sbjct: 266 DPACESFM--RLSNGSWRCMSCGSSREPNEVNKLSTEAEGMIEKANRLQAAGDLHGARMA 323
Query: 243 YKMIEKLQKKLYHPFSVNLMQTREKLIKILMELEDWKEALAYCQLTIPVYQRVYP-QFHP 301
++ EKLQ +L++P SV LM+TR+ L+++ + LEDW AL C+LT+P Y+ YP HP
Sbjct: 324 FQQAEKLQTELWNPRSVKLMRTRDLLLRVYLSLEDWDLALHVCRLTLPAYETAYPGSKHP 383
Query: 302 LLGLQYYTCGKLEWFLGDTENAIKSMTEAVEILRITHGTNSPFMKELILKLEEAQAEAS 360
LLGLQYYT GKLE G A+++ +A +IL +THG+ F+++L +L++A+ AS
Sbjct: 384 LLGLQYYTLGKLEMHAGSVTEAVRAYAKAFDILSVTHGSRGEFVRKLRNELDQAKMVAS 442
>gi|18568265|gb|AAL75997.1|AF466646_5 putative SET-domain transcriptional regulator [Zea mays]
gi|413933333|gb|AFW67884.1| putative SET-domain transcriptional regulator [Zea mays]
Length = 410
Score = 297 bits (761), Expect = 4e-78, Method: Compositional matrix adjust.
Identities = 169/357 (47%), Positives = 216/357 (60%), Gaps = 55/357 (15%)
Query: 15 VIPSTTTDNYSLVEALVAHMSDIDEKQLLLYAQIANLVNLILQWPEISINEIAENFSKLA 74
+ + T D ++ + M + K+ L ++ANLV+LIL + E+ + EIA+ FSK A
Sbjct: 99 AMAALTEDRKKMLTPTIRLMVRLALKRKLQNEKMANLVSLILPFIELDLKEIAQTFSKFA 158
Query: 75 CNAHTICNSELRPLGTGLYPVISIINHSCLPNAVLVFEGRLAVVRAVQHVPKGAEVLISY 134
CNAHTIC+ ELRPLGTGLYPVISIINHSC+PNAVL+F+GR A VRA+Q + K EV ISY
Sbjct: 159 CNAHTICDPELRPLGTGLYPVISIINHSCVPNAVLIFDGRTAYVRALQPINKDEEVSISY 218
Query: 135 IETAGSTMTRQKALKEQYLFTCTCPRCIKLGQFDDIQESAILEGYRCKDDGCSGFLLRDS 194
IETA T R LK QY FTCTCPRC+K FD E A+LEG+RCK+ C GFLL +S
Sbjct: 219 IETATVTKKRNNDLK-QYFFTCTCPRCVK--GFD---EDALLEGFRCKNQACDGFLLPNS 272
Query: 195 DDKGFTCQQCGLVRSKEEIKKIASEVNILSKKTLALTSCGNHQEVVSTYKMIEKLQKKLY 254
D E S YK+IE+L++ LY
Sbjct: 273 DK----------------------------------------AEAGSIYKIIEQLERNLY 292
Query: 255 HPFSVNLMQTREKLIKILMELEDWKEALAYCQLTIPVYQR-----VYPQFHPLLGLQYYT 309
H FS L+ T E L+KI +EL+DW AL YC+LTIPVY+ P P + Y
Sbjct: 293 HAFSTTLLHTCETLLKIYLELQDWWTALTYCRLTIPVYESKAWRLKIPFPRPRTVREAYG 352
Query: 310 CGKLEWFLGDTENAIKSMTEAVEILRITHGTNSPFMKELILKLEEAQAEASYKLSSK 366
G LE+ TE+A+KS+T A +IL++THG S FMKEL KLEEA+AE S++LSS+
Sbjct: 353 TGLLEF----TEDALKSLTRAADILKVTHGVKSQFMKELFGKLEEARAEVSFRLSSR 405
>gi|302772491|ref|XP_002969663.1| hypothetical protein SELMODRAFT_231366 [Selaginella moellendorffii]
gi|300162174|gb|EFJ28787.1| hypothetical protein SELMODRAFT_231366 [Selaginella moellendorffii]
Length = 484
Score = 296 bits (758), Expect = 1e-77, Method: Compositional matrix adjust.
Identities = 146/358 (40%), Positives = 227/358 (63%), Gaps = 7/358 (1%)
Query: 6 LRRKLQNDNVIPSTTTDNYSLVEALVAHMSDIDEKQLLLYAQIANLVNLIL--QWPEISI 63
++R+LQ V+P T DNY +V AL H S+ +++L++YAQ+A L+ IL ++ E +
Sbjct: 120 VKRELQATGVLPVTALDNYEIVRALPTHFSETGDERLVMYAQMAVLIKTILNARYAE-DV 178
Query: 64 NEIAENFSKLACNAHTICNSELRPLGTGLYPVISIINHSCLPNAVLVFEGRLAVVRAVQH 123
EI ++ +++CN HTIC+ ELRP+G GL+PV+SIINHSC N++L+F+G+ AVVRA+
Sbjct: 179 KEITKDICRISCNGHTICDDELRPVGIGLFPVVSIINHSCSSNSLLLFDGKHAVVRALGT 238
Query: 124 VPKGAEVLISYIETAGSTMTRQKALKEQYLFTCTCPRCIKLGQFDDIQESAILEGYRCKD 183
+ +G EV +SYIE ST +R++AL +QY F C CPRC + + + +LE C D
Sbjct: 239 ISRGCEVTVSYIELGASTNSRREALSDQYYFHCKCPRCTDDSEA-GLYKDDVLEAVACLD 297
Query: 184 DGCSGFLLRDSDDKGFTCQQCGLVRSKEEIKKIASEVNILSKKTLALTSCGNHQEVVSTY 243
C F+ + + C CG R E+ K+++E + +K L + G+ +
Sbjct: 298 PACESFM--RLSNGSWRCMSCGSSREPNEVNKLSTEAEGMIEKANRLQAAGDLHGARMAF 355
Query: 244 KMIEKLQKKLYHPFSVNLMQTREKLIKILMELEDWKEALAYCQLTIPVYQRVYP-QFHPL 302
+ EKLQ +L++P SV LM+TR+ L+++ + LEDW AL C+LT+P Y+ YP HPL
Sbjct: 356 QQAEKLQTELWNPRSVKLMRTRDLLLRVYLSLEDWDLALHVCRLTLPAYETAYPGSKHPL 415
Query: 303 LGLQYYTCGKLEWFLGDTENAIKSMTEAVEILRITHGTNSPFMKELILKLEEAQAEAS 360
LGLQYYT GKLE G A+++ +A +IL +THG+ F+++L +L++A+ AS
Sbjct: 416 LGLQYYTLGKLEMHAGSVTEAVRAYAKAFDILSVTHGSRGEFVRKLRNELDQAKMVAS 473
>gi|108710651|gb|ABF98446.1| MYND finger family protein, expressed [Oryza sativa Japonica Group]
Length = 352
Score = 295 bits (755), Expect = 3e-77, Method: Compositional matrix adjust.
Identities = 143/235 (60%), Positives = 180/235 (76%), Gaps = 6/235 (2%)
Query: 1 MLKLYLRRKLQNDNVIPSTTTDNYSLVEALVAHMSDIDEKQLLLYAQIANLVNLILQWPE 60
M++L LRRKLQ+D IPS+ TDNY+LV+AL +H+S++D+ QL+LYAQ+ANLV LIL E
Sbjct: 118 MVRLVLRRKLQDDKAIPSSGTDNYNLVDALESHISEVDKNQLVLYAQMANLVQLILPSFE 177
Query: 61 ISINEIAENFSKLACNAHTICNSELRPLGTGLYPVISIINHSCLPNAVLVFEGRLAVVRA 120
+ + EI FSK ACNAHTIC+ ELRPLGTGLYPV+SIINHSC+PNAVL+FEGR A VRA
Sbjct: 178 LDLKEITHTFSKFACNAHTICDPELRPLGTGLYPVLSIINHSCVPNAVLIFEGRTAYVRA 237
Query: 121 VQHVPKGAEVLISYIETAGSTMTRQKALKEQYLFTCTCPRCIKLGQFDDIQESAILEGYR 180
+Q + K EV ISYIETA +TM RQ LK Y FTCTCPRC+K D +E A+LEGYR
Sbjct: 238 LQPISKNEEVSISYIETAATTMKRQDDLK-HYYFTCTCPRCVK-----DSEEDALLEGYR 291
Query: 181 CKDDGCSGFLLRDSDDKGFTCQQCGLVRSKEEIKKIASEVNILSKKTLALTSCGN 235
C D C GFLL ++ +KG+TCQ+C R EE++K+AS+V +LS K +L S G+
Sbjct: 292 CNDQKCDGFLLPNAGNKGYTCQKCSTSRDGEELQKMASDVLLLSDKVSSLVSSGS 346
>gi|222625640|gb|EEE59772.1| hypothetical protein OsJ_12271 [Oryza sativa Japonica Group]
Length = 384
Score = 201 bits (510), Expect = 6e-49, Method: Compositional matrix adjust.
Identities = 103/200 (51%), Positives = 128/200 (64%), Gaps = 39/200 (19%)
Query: 1 MLKLYLRRKLQNDNVIPSTTTDNYSLVEALVAHMSDIDEKQLLLYAQIANLVNLILQWPE 60
M++L LRRKLQ+D IPS+ TDNY+LV+AL +H+S++D+ QL+LYAQ+ANLV LIL E
Sbjct: 130 MVRLVLRRKLQDDKAIPSSGTDNYNLVDALESHISEVDKNQLVLYAQMANLVQLILPSFE 189
Query: 61 ISINEIAENFSKLACNAHTICNSELRPLGTGLYPVISIINHSCLPNAVLVFEGRLAVVRA 120
+ + EI FSK ACNAHTIC+ ELRPLGTGLYPV+SIINHSC+PNAVL+FEGR A VRA
Sbjct: 190 LDLKEITHTFSKFACNAHTICDPELRPLGTGLYPVLSIINHSCVPNAVLIFEGRTAYVRA 249
Query: 121 VQHVPKGAEVLISYIETAGSTMTRQKALKEQYLFTCTCPRCIKLGQFDDIQESAILEGYR 180
+Q + K E D +E A+LEGYR
Sbjct: 250 LQPISKNEE---------------------------------------DSEEDALLEGYR 270
Query: 181 CKDDGCSGFLLRDSDDKGFT 200
C D C GFLL ++D G T
Sbjct: 271 CNDQKCDGFLLPNADIYGVT 290
Score = 122 bits (305), Expect = 4e-25, Method: Compositional matrix adjust.
Identities = 53/80 (66%), Positives = 67/80 (83%)
Query: 289 IPVYQRVYPQFHPLLGLQYYTCGKLEWFLGDTENAIKSMTEAVEILRITHGTNSPFMKEL 348
PVY+R+YP FHP++GLQ+YTCGKLEW L TE+A+ S+T A +ILRITHGT S FMKEL
Sbjct: 304 FPVYERIYPPFHPMIGLQFYTCGKLEWLLEYTEDALMSLTRAADILRITHGTKSEFMKEL 363
Query: 349 ILKLEEAQAEASYKLSSKDE 368
+ KLEE +AEAS++LS+ DE
Sbjct: 364 LGKLEEVRAEASFRLSAGDE 383
>gi|326496881|dbj|BAJ98467.1| predicted protein [Hordeum vulgare subsp. vulgare]
Length = 153
Score = 186 bits (473), Expect = 1e-44, Method: Compositional matrix adjust.
Identities = 86/140 (61%), Positives = 111/140 (79%)
Query: 229 ALTSCGNHQEVVSTYKMIEKLQKKLYHPFSVNLMQTREKLIKILMELEDWKEALAYCQLT 288
L S ++ E S YK+IE+L++K YH FS+ L+ TRE L+KI MEL+DW+ AL YC+LT
Sbjct: 11 TLISDTDNSEAGSVYKIIEQLEQKHYHSFSITLLHTRETLLKIYMELQDWQTALMYCRLT 70
Query: 289 IPVYQRVYPQFHPLLGLQYYTCGKLEWFLGDTENAIKSMTEAVEILRITHGTNSPFMKEL 348
IPVY+RVYP FHP++GLQ+YTCGKLEW L TE+A+K +T A ++LRITHGT S FMKEL
Sbjct: 71 IPVYERVYPPFHPMVGLQFYTCGKLEWLLEYTEDALKLLTRAADVLRITHGTQSQFMKEL 130
Query: 349 ILKLEEAQAEASYKLSSKDE 368
KLEEA+AE S++LSS +E
Sbjct: 131 FGKLEEARAEVSFRLSSGEE 150
>gi|108710652|gb|ABF98447.1| MYND finger family protein, expressed [Oryza sativa Japonica Group]
Length = 282
Score = 184 bits (466), Expect = 7e-44, Method: Compositional matrix adjust.
Identities = 86/131 (65%), Positives = 108/131 (82%)
Query: 1 MLKLYLRRKLQNDNVIPSTTTDNYSLVEALVAHMSDIDEKQLLLYAQIANLVNLILQWPE 60
M++L LRRKLQ+D IPS+ TDNY+LV+AL +H+S++D+ QL+LYAQ+ANLV LIL E
Sbjct: 118 MVRLVLRRKLQDDKAIPSSGTDNYNLVDALESHISEVDKNQLVLYAQMANLVQLILPSFE 177
Query: 61 ISINEIAENFSKLACNAHTICNSELRPLGTGLYPVISIINHSCLPNAVLVFEGRLAVVRA 120
+ + EI FSK ACNAHTIC+ ELRPLGTGLYPV+SIINHSC+PNAVL+FEGR A VRA
Sbjct: 178 LDLKEITHTFSKFACNAHTICDPELRPLGTGLYPVLSIINHSCVPNAVLIFEGRTAYVRA 237
Query: 121 VQHVPKGAEVL 131
+Q + K EV+
Sbjct: 238 LQPISKNEEVM 248
>gi|260799834|ref|XP_002594889.1| hypothetical protein BRAFLDRAFT_124463 [Branchiostoma floridae]
gi|229280126|gb|EEN50900.1| hypothetical protein BRAFLDRAFT_124463 [Branchiostoma floridae]
Length = 463
Score = 169 bits (427), Expect = 3e-39, Method: Compositional matrix adjust.
Identities = 106/346 (30%), Positives = 182/346 (52%), Gaps = 21/346 (6%)
Query: 15 VIPSTTTDNYSL----VEALVAHMSDIDEKQLLLYAQIANLVNLILQWPEIS-INEIAEN 69
+I T TD+ + ++ L +++ ++ E ++AQ+A ++ + + + EI E
Sbjct: 107 IINKTKTDSPGVPGNSIDELQSNLREMPENVKEMFAQLAVVLRMYVGKDVMDDAREIFEL 166
Query: 70 FSKLACNAHTICNSELRPLGTGLYPVISIINHSCLPNAVLVFEGRLAVVRAVQHVPKGAE 129
F ++ CN +IC+ E++ +G G+YP +S+ NHSC PN V VF G VRA+Q++ G E
Sbjct: 167 FGRMTCNTFSICDPEMQYIGIGIYPKMSLFNHSCEPNCVAVFNGLRMEVRAIQNIQPGEE 226
Query: 130 VLISYIETAGSTMTRQKALKEQYLFTCTCPRCIKLGQFDDIQESAILEGYRCKDDGCSGF 189
+LISY+E + R++ L +QY FTC CPRC D + ++ +C + C
Sbjct: 227 LLISYVEMLAMSSVRKQQLLQQYYFTCKCPRC------QDQTKDGMMMAVKCGNINCKKV 280
Query: 190 LLRDSDDKGFTCQQCGLVRSKEEIKKIASEVNILSKKT----LALTSCGNHQEVVSTYKM 245
+++ D TC+ C E+ K EVN +++ + A+TS QE ++
Sbjct: 281 IIQ-VDGAYETCKVC--CHDNEKDPKFWKEVNKVTQFSEEMLAAITSADQRQEPKEGLRL 337
Query: 246 IEKL---QKKLYHPFSVNLMQTREKLIKILMELEDWKEALAYCQLTIPVYQRVYPQFHPL 302
E+L Q+++ H + +++ +K + + L W AL TI Y+ +P +HP
Sbjct: 338 AERLLERQQQVLHNNHLFVLKCLDKALDQAVVLRKWNRALRCALQTIEPYKVHFPAYHPS 397
Query: 303 LGLQYYTCGKLEWFLGDTENAIKSMTEAVEILRITHGTNSPFMKEL 348
LG+QY GKL +L A++++ A IL +THG + P KEL
Sbjct: 398 LGIQYMRIGKLLLYLEKRLAALEALQMAEHILNVTHGKDHPINKEL 443
>gi|414864678|tpg|DAA43235.1| TPA: hypothetical protein ZEAMMB73_073593 [Zea mays]
Length = 342
Score = 162 bits (410), Expect = 2e-37, Method: Compositional matrix adjust.
Identities = 85/157 (54%), Positives = 104/157 (66%), Gaps = 6/157 (3%)
Query: 102 SCLPNAVLVFEGRLAVVRAVQHVPKGAEVLISYIETAGSTMTRQKALKEQYLFTCTCPRC 161
SC+PNAVL+F+GR VRA+Q + K EV ISYIETA T R LK QY FT TCPRC
Sbjct: 182 SCVPNAVLIFDGRTTYVRALQPIDKDEEVSISYIETAAVTKKRNNYLK-QYFFTYTCPRC 240
Query: 162 IKLGQFDDIQESAILEGYRCKDDGCSGFLLRDSDDKGFTCQQCGLVRSKEEIKKIASEVN 221
+K FDD A+LEG+RCK+ C GFLL +S K +TCQ+CG R EEI K+ SE+
Sbjct: 241 VK--GFDD---DALLEGFRCKNQTCDGFLLPNSGKKAYTCQKCGASRDVEEINKMRSEIL 295
Query: 222 ILSKKTLALTSCGNHQEVVSTYKMIEKLQKKLYHPFS 258
LS K + S GN E S YK+IE+L++ LYH FS
Sbjct: 296 QLSDKASSFLSSGNKAEAGSIYKIIEQLERNLYHAFS 332
>gi|348534108|ref|XP_003454545.1| PREDICTED: SET and MYND domain-containing protein 3-like
[Oreochromis niloticus]
Length = 439
Score = 158 bits (400), Expect = 3e-36, Method: Compositional matrix adjust.
Identities = 110/361 (30%), Positives = 182/361 (50%), Gaps = 52/361 (14%)
Query: 18 STTTDNYSLVEALVAHMSDIDEKQLLLYAQIANLVNLILQW--PEI----------SINE 65
S++ + Y+L E +H++ + E++ +Q+A+++ L LQ P++ S E
Sbjct: 118 SSSEELYTLEEH-ESHLTSMPEQKKQGLSQLASMLELYLQQEAPDLLQEMGSALPPSCRE 176
Query: 66 IAENFSKLACNAHTICNSELRPLGTGLYPVISIINHSCLPNAVLVFEGRLAVVRAVQHVP 125
+K+ CN TI + EL+ +G GLYP +S++NH C PN V+VFEG +RAV+ +
Sbjct: 177 PLSLIAKVTCNCFTISDGELQEIGVGLYPSLSLLNHDCRPNCVMVFEGTKLELRAVRDID 236
Query: 126 KGAEVLISYIETAGSTMTRQKALKEQYLFTCTCPRCIKLGQFDDIQESAILEGYRCKDDG 185
E+ ISYIET T RQ+ L+EQY FTC C RC R KD
Sbjct: 237 PEDELTISYIETLSLTEDRQRQLEEQYHFTCHCQRC----------------DSRDKD-- 278
Query: 186 CSGFLLRDSDDKGFTCQQCGLVRSKEEIKKIASEVNILSKKTLALTSCGNHQEVVSTYKM 245
G +L + K ++ L R E +K ++ +L + L+S G + YK+
Sbjct: 279 --GLMLSGEEGKWRLLKE-ALPR-LEGLKAESNWETLLESCSHLLSSVGGEVPEENLYKL 334
Query: 246 IEKLQKKLYHPFSVNLMQTREKLIKILMELEDWKEALAYCQLTIPVYQRVYPQFHPLLGL 305
+ + + + L W+EA++Y T+P Y++ YP HP+ G+
Sbjct: 335 -----------------RITDMALDASVHLGRWEEAMSYGVTTLPAYRQYYPDPHPVHGI 377
Query: 306 QYYTCGKLEWFLGDTENAIKSMTEAVEILRITHGTNSPFMKELILKLEEAQAEASYKLSS 365
Q GKL+ +L + E+A+ + +A EI++ THG + P EL +K++E +E + SS
Sbjct: 378 QLMRVGKLQHYLENIEDALDTFRQAYEIVKFTHGEDHPLTTELTMKMKECHSEMDHHSSS 437
Query: 366 K 366
+
Sbjct: 438 R 438
>gi|443720661|gb|ELU10312.1| hypothetical protein CAPTEDRAFT_158133 [Capitella teleta]
Length = 441
Score = 154 bits (388), Expect = 8e-35, Method: Compositional matrix adjust.
Identities = 93/336 (27%), Positives = 169/336 (50%), Gaps = 33/336 (9%)
Query: 30 LVAHMSDIDEKQ------LLLYAQIANLVNLILQWPEISINEIAENFSKLACNAHTICNS 83
L++H+ ++ E Q ++L + + ++ P S+ E+ F ++ N+ +IC+
Sbjct: 113 LISHVDEVQEDQIRLQQFMVLMTTLKSFSENVMSMP--SVEELFVIFGRVCVNSFSICDP 170
Query: 84 ELRPLGTGLY----PVISIINHSCLPNAVLVFEGRLAVVRAVQHVPKGAEVLISYIETAG 139
E+ P+G G+Y S+++HSC PNAV VFEG +R V+ V ++ ISYI+T
Sbjct: 171 EMNPIGVGVYIRQIKPCSVLDHSCRPNAVAVFEGTTLRIRCVEPVDSEQDLRISYIDTLD 230
Query: 140 STMTRQKALKEQYLFTCTCPRCIKLGQFDDIQESAILEGYRCKDDGCSGFLLRDSDDKGF 199
T TR++ L++QY F C C G+ D ++ I + C C G + + D F
Sbjct: 231 DTTTRRRNLQQQYYFNCLC------GECKDSEKDLIKFSFNCTSVECKGHVTQVPADDRF 284
Query: 200 TCQQCG-------LVRSKEEIKKIASEVNILSKKTLALTSCGNHQEVVSTYKMIEKLQKK 252
+C CG L ++ E+ +KI +N L K+ H+ + +
Sbjct: 285 SCDICGTTVDDVNLNQAAEKAQKIIKHLNELKKQR-------EHESIRMKGLSCVSEMRA 337
Query: 253 LYHPFSVNLMQTREKLIKILMELEDWKEALAYCQLTIPVYQRVYPQFHPLLGLQYYTCGK 312
+ HP++V+ ++ + + +E W +AL +LT+ Y+ Y + HP +G+ + GK
Sbjct: 338 ILHPYNVHFIKLCDLSLDACIETSQWDDALELGRLTVVAYRFYYGELHPSVGILLFKIGK 397
Query: 313 LEWFLGDTENAIKSMTEAVEILRITHGTNSPFMKEL 348
+ +L D E A++ + +A +LR+THG + K+L
Sbjct: 398 ILSYLADRE-ALQFLKQAATVLRVTHGDDHSLYKDL 432
>gi|432852874|ref|XP_004067428.1| PREDICTED: SET and MYND domain-containing protein 3-like [Oryzias
latipes]
Length = 495
Score = 152 bits (384), Expect = 2e-34, Method: Compositional matrix adjust.
Identities = 101/318 (31%), Positives = 159/318 (50%), Gaps = 39/318 (12%)
Query: 41 QLLLYAQIANLVNLILQWPEISINEIAENFSKLACNAHTICNSELRPLGTGLYPVISIIN 100
+L + +I +L + +S E +K+ CN TI + EL+ +G GLYP +S++N
Sbjct: 209 ELYMKTEIPDLKQRMTSELPLSCREALSIIAKVTCNCFTISDGELQEIGVGLYPSLSLLN 268
Query: 101 HSCLPNAVLVFEGRLAVVRAVQHVPKGAEVLISYIETAGSTMTRQKALKEQYLFTCTCPR 160
H C PN V+VF G +RAVQ + G E+ ISYIET T RQK L+EQY FTC C R
Sbjct: 269 HDCRPNCVMVFNGTELQLRAVQDINPGEELRISYIETLSLTEDRQKQLEEQYHFTCHCQR 328
Query: 161 CIKLGQFDDIQESAILEGYRCKDDGCSGFLLRDSDDKGFTCQQCGLVRSKEEIKKIASEV 220
C Q+ +L G ++ + KE + ++ +
Sbjct: 329 C-----SSKEQDGVMLSG-----------------------REAAWRQLKEALPRLEG-L 359
Query: 221 NILSKKTLALTSCGNHQEVVSTYKMIEKLQKKLYHPFSVNLMQTREKLIKILMELEDWKE 280
+ S L SC N + T E + LY ++ + + + L W+E
Sbjct: 360 QLESNWLELLESCSN----LLTSAGEEVPDENLYK------LKITDMALDASVHLGRWEE 409
Query: 281 ALAYCQLTIPVYQRVYPQFHPLLGLQYYTCGKLEWFLGDTENAIKSMTEAVEILRITHGT 340
AL++ T+P Y+ Y HP+ G+Q GKL+ LG E+A++S EA +IL++THG
Sbjct: 410 ALSFGLKTLPSYRLYYADPHPVHGVQLMRVGKLQHHLGCIEDALESFKEAFKILKLTHGE 469
Query: 341 NSPFMKELILKLEEAQAE 358
+ P + +L++K+EE ++E
Sbjct: 470 DHPLIVDLMMKMEECRSE 487
>gi|395531511|ref|XP_003767821.1| PREDICTED: SET and MYND domain-containing protein 3 [Sarcophilus
harrisii]
Length = 428
Score = 149 bits (375), Expect = 3e-33, Method: Compositional matrix adjust.
Identities = 90/293 (30%), Positives = 152/293 (51%), Gaps = 39/293 (13%)
Query: 65 EIAENFSKLACNAHTICNSELRPLGTGLYPVISIINHSCLPNAVLVFEGRLAVVRAVQHV 124
+I E+F+K+ CN TI N E++ +G GLYP +S++NHSC PN V+VFEG +RA++++
Sbjct: 170 DIFESFAKVICNGFTISNGEMQEVGVGLYPSMSLLNHSCDPNCVIVFEGPSLFLRAIRNI 229
Query: 125 PKGAEVLISYIETAGSTMTRQKALKEQYLFTCTCPRCIKLGQFDDIQESAILEGYRCKDD 184
P G E+ I Y++ T RQK LKEQY F C CP C + D +L G
Sbjct: 230 PLGEELTICYLDVLMPTAERQKQLKEQYCFDCDCPLCKTQSKDAD-----MLAGEEQAWK 284
Query: 185 GCSGFLLRDSDDKGFTCQQCGLVRSKEEIKKIASEVNILSKKTLALTSCGNHQEVVSTYK 244
G L++ D ++S+E+ +++ + L + +CGN + Y+
Sbjct: 285 EIQGSLIKIED-----------LQSQEKWEQVLAMCQTL------INNCGNRLPDRNIYQ 327
Query: 245 MIEKLQKKLYHPFSVNLMQTREKLIKILMELEDWKEALAYCQLTIPVYQRVYPQFHPLLG 304
++ E + + L W++AL Y T+ Y+ YP FHP+ G
Sbjct: 328 -----------------LKMLECAMDACINLSLWEDALLYGSRTLEPYRLYYPGFHPVRG 370
Query: 305 LQYYTCGKLEWFLGDTENAIKSMTEAVEILRITHGTNSPFMKELILKLEEAQA 357
+Q GKL+ G A++++ +A E++++THG + +++L+L L + +A
Sbjct: 371 VQVMKVGKLQQHQGLYPQALETLKQAFELIKVTHGRDHSLIEDLMLLLGDCEA 423
>gi|413955565|gb|AFW88214.1| hypothetical protein ZEAMMB73_999877 [Zea mays]
Length = 529
Score = 148 bits (373), Expect = 5e-33, Method: Compositional matrix adjust.
Identities = 83/189 (43%), Positives = 105/189 (55%), Gaps = 46/189 (24%)
Query: 108 VLVFEGRLAVVRAVQHVPKGAEVLISYIETAGSTMTRQKALKEQYLFTCTCPRCIKLGQF 167
VL+F+GR A VRA+Q + K EV ISYIETA T R LK QY FTCTCPRC+K F
Sbjct: 211 VLIFDGRTAYVRALQPINKDEEVSISYIETAAVTKKRNNDLK-QYFFTCTCPRCVK--GF 267
Query: 168 DDIQESAILEGYRCKDDGCSGFLLRDSDDKGFTCQQCGLVRSKEEIKKIASEVNILSKKT 227
D E A+LEG+RCK C GFLL +SD
Sbjct: 268 D---EDALLEGFRCKSQACDGFLLPNSD-------------------------------- 292
Query: 228 LALTSCGNHQEVVSTYKMIEKLQKKLYHPFSVNLMQTREKLIKILMELEDWKEALAYCQL 287
E S YK+I++L++ LYH FS L+ T E L+KI +EL+DW AL YC+L
Sbjct: 293 --------KAEAGSIYKIIKQLERNLYHAFSTTLLHTCETLLKIYLELQDWWTALTYCRL 344
Query: 288 TIPVYQRVY 296
TIPVY++++
Sbjct: 345 TIPVYEKIF 353
>gi|413955566|gb|AFW88215.1| hypothetical protein ZEAMMB73_999877 [Zea mays]
Length = 484
Score = 148 bits (373), Expect = 5e-33, Method: Compositional matrix adjust.
Identities = 83/189 (43%), Positives = 105/189 (55%), Gaps = 46/189 (24%)
Query: 108 VLVFEGRLAVVRAVQHVPKGAEVLISYIETAGSTMTRQKALKEQYLFTCTCPRCIKLGQF 167
VL+F+GR A VRA+Q + K EV ISYIETA T R LK QY FTCTCPRC+K F
Sbjct: 211 VLIFDGRTAYVRALQPINKDEEVSISYIETAAVTKKRNNDLK-QYFFTCTCPRCVK--GF 267
Query: 168 DDIQESAILEGYRCKDDGCSGFLLRDSDDKGFTCQQCGLVRSKEEIKKIASEVNILSKKT 227
D E A+LEG+RCK C GFLL +SD
Sbjct: 268 D---EDALLEGFRCKSQACDGFLLPNSD-------------------------------- 292
Query: 228 LALTSCGNHQEVVSTYKMIEKLQKKLYHPFSVNLMQTREKLIKILMELEDWKEALAYCQL 287
E S YK+I++L++ LYH FS L+ T E L+KI +EL+DW AL YC+L
Sbjct: 293 --------KAEAGSIYKIIKQLERNLYHAFSTTLLHTCETLLKIYLELQDWWTALTYCRL 344
Query: 288 TIPVYQRVY 296
TIPVY++++
Sbjct: 345 TIPVYEKIF 353
>gi|410912232|ref|XP_003969594.1| PREDICTED: SET and MYND domain-containing protein 3-like [Takifugu
rubripes]
Length = 434
Score = 147 bits (372), Expect = 6e-33, Method: Compositional matrix adjust.
Identities = 97/337 (28%), Positives = 169/337 (50%), Gaps = 49/337 (14%)
Query: 32 AHMSDIDEKQLLLYAQIANLVNLIL--QWPEI--------SINEIAENFSKLACNAHTIC 81
+H++ + E++ +Q+A ++ L L + P++ S + +K+ CN TI
Sbjct: 130 SHLTSLSEQRKQGLSQLATMLKLYLHKEVPDLPQDTPSLSSCRDALSLIAKVTCNCFTIS 189
Query: 82 NSELRPLGTGLYPVISIINHSCLPNAVLVFEGRLAVVRAVQHVPKGAEVLISYIETAGST 141
+ EL+ +G GLYP +S++NH C PN V+VFEG ++RAV+ + G E+ ISYIET
Sbjct: 190 DGELQEIGVGLYPSLSLLNHDCRPNCVMVFEGTKLLLRAVRGLSPGEELTISYIETLSLN 249
Query: 142 MTRQKALKEQYLFTCTCPRCIKLGQFDDIQESAILEGYRCKDDGCSGFLLRDSDDKGFTC 201
RQ+ L++QY FTC C +C D + S +R +L+++
Sbjct: 250 EDRQQRLEDQYCFTCHC-QCCNSPDNDKLMLSGEESSWR---------VLKEA------- 292
Query: 202 QQCGLVRSKEEIKKIASEVNILSKKTLALTSCGNHQEVVSTYKMIEKLQKKLYHPFSVNL 261
+ + EE+K ++ ++ L++ G + YK
Sbjct: 293 -----LPTLEELKAKSNWQTLVESCHRLLSTEGKRVPEENLYK----------------- 330
Query: 262 MQTREKLIKILMELEDWKEALAYCQLTIPVYQRVYPQFHPLLGLQYYTCGKLEWFLGDTE 321
++ + + ++L W EA+ Y Q T+PVY YP HP+ G+Q GKL+ +L E
Sbjct: 331 LRVTDMALDASIQLGLWTEAVEYGQKTLPVYHLYYPDPHPVHGVQLVRVGKLQHYLAHIE 390
Query: 322 NAIKSMTEAVEILRITHGTNSPFMKELILKLEEAQAE 358
A+ + +A IL++THG ++ +L++K+EE +AE
Sbjct: 391 EALDTFKQAYRILKVTHGNDNSITTDLLMKMEECRAE 427
>gi|196000040|ref|XP_002109888.1| hypothetical protein TRIADDRAFT_53251 [Trichoplax adhaerens]
gi|190588012|gb|EDV28054.1| hypothetical protein TRIADDRAFT_53251 [Trichoplax adhaerens]
Length = 448
Score = 147 bits (370), Expect = 1e-32, Method: Compositional matrix adjust.
Identities = 90/283 (31%), Positives = 149/283 (52%), Gaps = 17/283 (6%)
Query: 71 SKLACNAHTICNSELRPLGTGLYPVISIINHSCLPNAVLVFEGRLAVVRAVQHVPKGAEV 130
S++ CN TICNSE++ +G G+YP +S++NHSC PN F G+ +R +++ G E+
Sbjct: 172 SRVTCNTFTICNSEMQTVGIGVYPGLSLVNHSCSPNCSATFRGKQMQLRIIENTKIGDEL 231
Query: 131 LISYIETAGSTMTRQKALKEQYLFTCTCPRCIKLGQFDDIQESA--ILEGYRCKDDGCSG 188
LISYI+ +RQ L+ QY F C C RCI D ++S +++ RC C
Sbjct: 232 LISYIDPMQVLSSRQNQLQSQYCFKCICERCI-----DTTKDSCNNLMDSVRCPKKICKA 286
Query: 189 FLLRDSDDKGFTCQQCGLVRSKE---EIKKIASEVNILSKKTLALTSCGNHQEVVSTYKM 245
DS C +CG + + EI+ +++N KT++L G Q + K
Sbjct: 287 ASSLDSLLANKLCPECGSIVDQSFFAEIENFQAQIN----KTISL---GYQQAKLEDLKK 339
Query: 246 IEKLQKKLYHPFSVNLMQTREKLIKILMELEDWKEALAYCQLTIPVYQRVYPQFHPLLGL 305
+ K+ ++ ++ E L+ +E + ++EAL Y + Y R+YP+++P+ G+
Sbjct: 340 LFIEGKQRLGECNMLYIRIIENLMDAYIESQRYEEALEYARRLEEPYCRLYPRYYPVTGV 399
Query: 306 QYYTCGKLEWFLGDTENAIKSMTEAVEILRITHGTNSPFMKEL 348
GKLE +LG A+KS+ +A EIL ++HG + E+
Sbjct: 400 HLMKQGKLECYLGKFGEAVKSLGKAKEILLVSHGKECGLIHEM 442
>gi|449496642|ref|XP_002188283.2| PREDICTED: SET and MYND domain-containing protein 3 [Taeniopygia
guttata]
Length = 289
Score = 144 bits (363), Expect = 6e-32, Method: Compositional matrix adjust.
Identities = 99/303 (32%), Positives = 154/303 (50%), Gaps = 50/303 (16%)
Query: 59 PEISINEIAENFSKLACNAHTICNSELRPLGTGLYPVISIINHSCLPNAVLVFEGRLAVV 118
P I +I F+K+ CN TI N E++ +G GLYP +S++NHSC PN V+VFEG ++
Sbjct: 28 PAIDFFQI---FTKVTCNCFTISNGEMQDVGVGLYPSMSLLNHSCDPNCVIVFEGYQLLL 84
Query: 119 RAVQHVPKGAEVLISYIETAGSTMTRQKALKEQYLFTCTCPRCIKLGQFDDIQESAILEG 178
+V+ + G E+ ISY+E+ T RQK L QY F C CP C Q D ++ A E
Sbjct: 85 HSVRDIQIGEELTISYVESLMPTRERQKQLMRQYCFECDCPLC--QNQEKDAEKLAG-EV 141
Query: 179 YRCKDDGCSGFLLRDSDDKGFTCQQCGLVRSKEEIKKIASEVNILSKKTLALTSCGNHQE 238
+ K+ ++D+ + + G +SKEE K ++L++ L S H
Sbjct: 142 HAWKE-------VKDAVN------EVGYPKSKEEWK------HVLARCQNLLRSNKGHLP 182
Query: 239 VVSTYKMIEKLQKKLYHPFSVNLMQTREKLIKILME----LEDWKEALAYCQLTIPVYQR 294
+ Y++ K++ M+ LE W+EAL Y T+ Y+
Sbjct: 183 DTNIYQL---------------------KMLDCAMDACINLESWEEALYYGSRTLEPYRL 221
Query: 295 VYPQFHPLLGLQYYTCGKLEWFLGDTENAIKSMTEAVEILRITHGTNSPFMKELILKLEE 354
YP FHPL +Q GKL++ G A++++ +A I+++THGT+ MK L+ EE
Sbjct: 222 YYPGFHPLRAVQLMRVGKLQYSQGMVPQALETLKQAYNIMKVTHGTDHSLMKVLMEMKEE 281
Query: 355 AQA 357
+A
Sbjct: 282 CEA 284
>gi|50740707|ref|XP_419536.1| PREDICTED: SET and MYND domain-containing protein 3 [Gallus gallus]
Length = 428
Score = 143 bits (361), Expect = 1e-31, Method: Compositional matrix adjust.
Identities = 107/327 (32%), Positives = 165/327 (50%), Gaps = 55/327 (16%)
Query: 41 QLLLYAQIANLVNLILQWPEISINEIAENFSKLACNAHTICNSELRPLGTGLYPVISIIN 100
QL L A+I + +L P I +I F+K+ CN TI N E++ +G GLYP +S++N
Sbjct: 152 QLYLRAEIQDASHLP---PAIDFFQI---FTKVTCNCFTISNGEMQDVGVGLYPSMSLLN 205
Query: 101 HSCLPNAVLVFEGRLAVVRAVQHVPKGAEVLISYIETAGSTMTRQKALKEQYLFTCTCPR 160
HSC PN V++FEG ++R+++ + G E+ ISYIE+ T RQK LK QY F C C
Sbjct: 206 HSCDPNCVIIFEGYQLLLRSIREIQIGEELTISYIESLMPTSERQKQLKRQYCFECDC-- 263
Query: 161 CIKLGQFDDIQESAILEGYRCKDDGCSGFLLRDSDDKGFTCQQCGLVRSKEEIKKIASEV 220
C+ Q D ++ A E ++ D + +SKEE +K
Sbjct: 264 CLCQDQEKDAKKLAGEEP-----------AWKEVKD---AVNEVKYPKSKEEWEK----- 304
Query: 221 NILSKKTLALTSCGNHQEVVSTYKMIEKLQKKLYHPFSVNLMQTREKLIKILME----LE 276
+L+K L+S +H + Y++ KL+ M+ LE
Sbjct: 305 -VLAKCQHLLSSHTSHLPDTNIYQL---------------------KLLDCAMDACINLE 342
Query: 277 DWKEALAYCQLTIPVYQRVYPQFHPLLGLQYYTCGKLEWFLGDTENAIKSMTEAVEILRI 336
W++AL Y T+ Y+ YP FHPL +Q GKL++ G A+ ++ +A +I+++
Sbjct: 343 AWEQALCYGSRTLGPYRLYYPDFHPLRAVQLMRVGKLQYSQGMLPQALGTLKQAYDIMKV 402
Query: 337 THGTNSPFMKELILKLEEAQAEASYKL 363
THGT+ M+ L + L+E Q EA KL
Sbjct: 403 THGTDHSLMQAL-MDLKE-QCEAMMKL 427
>gi|449665374|ref|XP_002163555.2| PREDICTED: uncharacterized protein LOC100200645 [Hydra
magnipapillata]
Length = 1037
Score = 140 bits (352), Expect = 1e-30, Method: Compositional matrix adjust.
Identities = 98/333 (29%), Positives = 166/333 (49%), Gaps = 7/333 (2%)
Query: 25 SLVEALVAHMSDIDEKQLLLYAQIANLVNLILQWPEISINEIA--ENFSKLACNAHTICN 82
+L+ L A+ +I + + A ++ LQ I+IN+I K +CN+ I N
Sbjct: 121 TLINNLYANKGNISNARKEAFFTFAAVLVEYLQDVNININDIDIYGLMCKASCNSFAITN 180
Query: 83 SELRPLGTGLYPVISIINHSCLPNAVLVFEGRLAVVRAVQHVPKGAEVLISYIETAGSTM 142
+EL LGTG++ S+ NHSC PN V F GR +RA++ + +G E+++SYI ++
Sbjct: 181 AELNSLGTGIFSSASLFNHSCDPNCVATFNGRDISIRAIKPIAEGEELMLSYISILATSD 240
Query: 143 TRQKALKEQYLFTCTCPRCIKLGQFDDIQESAILEGYRCKDDGCSGFLLRDSDDKGFTCQ 202
RQ L+E Y+FTC C C + D + +S +C C FL+ + +CQ
Sbjct: 241 VRQLELRESYMFTCKCTVCSRKEVNDSLMKSVKCSQPQC---LCMKFLITAPESNKCSCQ 297
Query: 203 QCGLVRSKEEIKKIASEVNILSKKTLALTSCGNHQEVVSTYKMIEKLQKKLYHPFSVNLM 262
+ S + I K ++ L +++ ++ S ++I ++ L P ++ L+
Sbjct: 298 K-NCEASNDYILKANDCMDKLQALYNSISVVPTIEQQKSLTQLIRCGEEILCSP-NIALL 355
Query: 263 QTREKLIKILMELEDWKEALAYCQLTIPVYQRVYPQFHPLLGLQYYTCGKLEWFLGDTEN 322
Q E + +E DWK A Y Y+ ++HP LGL Y+ GKL + D N
Sbjct: 356 QCYEVAMDGCIESGDWKGAFQYGIKLECSYKNYLSEYHPTLGLHYFKLGKLALQIEDLRN 415
Query: 323 AIKSMTEAVEILRITHGTNSPFMKELILKLEEA 355
I + + ++L ITHG++ F+++L L+EA
Sbjct: 416 GITYLEKGYKVLSITHGSSCHFVQKLKSYLDEA 448
>gi|326915443|ref|XP_003204027.1| PREDICTED: SET and MYND domain-containing protein 3-like [Meleagris
gallopavo]
Length = 427
Score = 140 bits (352), Expect = 1e-30, Method: Compositional matrix adjust.
Identities = 101/322 (31%), Positives = 161/322 (50%), Gaps = 55/322 (17%)
Query: 41 QLLLYAQIANLVNLILQWPEISINEIAENFSKLACNAHTICNSELRPLGTGLYPVISIIN 100
QL L A+I + +L P I +I F+K+ CN TI N E++ +G GLYP +S++N
Sbjct: 151 QLYLRAEIQDASHLS---PAIDFFQI---FTKVTCNCFTISNGEMQDVGVGLYPSMSLLN 204
Query: 101 HSCLPNAVLVFEGRLAVVRAVQHVPKGAEVLISYIETAGSTMTRQKALKEQYLFTCTCPR 160
HSC PN V++FEG ++R+V+ + G E+ +SYIE+ T RQ+ LK QY F C C
Sbjct: 205 HSCDPNCVIIFEGYQLLLRSVREIQIGEELTVSYIESLMPTSERQEQLKRQYCFECDC-- 262
Query: 161 CIKLGQFDDIQESAILEGYRCKDDGCSGFLLRDSDDKGFTCQQCGLVRSKEEIKKIASEV 220
C+ Q D ++ A E ++ D + +SKEE +K+ ++
Sbjct: 263 CLCQDQEKDAKKLAGEEP-----------AWKEVKD---AVNEVKYPKSKEEWEKVLAKC 308
Query: 221 -NILSKKTLALTSCGNHQEVVSTYKMIEKLQKKLYHPFSVNLMQTREKLIKILME----L 275
++LS T L N+ Q KL+ M+ L
Sbjct: 309 QHLLSSHTSRLP--------------------------DTNIYQL--KLLDCAMDACINL 340
Query: 276 EDWKEALAYCQLTIPVYQRVYPQFHPLLGLQYYTCGKLEWFLGDTENAIKSMTEAVEILR 335
E W++AL Y T+ Y+ YP FHPL +Q GKL++ G A++++ +A +I++
Sbjct: 341 EAWEQALCYGSRTLGPYRLYYPNFHPLRAVQLMRVGKLQYSQGMLPQALETLKQAYDIMK 400
Query: 336 ITHGTNSPFMKELILKLEEAQA 357
+THGT+ M+ L+ E+ +A
Sbjct: 401 VTHGTDHSLMQALMDLKEQCEA 422
>gi|213512076|ref|NP_001133866.1| SET and MYND domain-containing protein 3 [Salmo salar]
gi|209155626|gb|ACI34045.1| SET and MYND domain-containing protein 3 [Salmo salar]
Length = 429
Score = 135 bits (341), Expect = 3e-29, Method: Compositional matrix adjust.
Identities = 98/350 (28%), Positives = 164/350 (46%), Gaps = 46/350 (13%)
Query: 17 PSTTTDNYSLVEALVAHMSDIDEKQLLLYAQIANLVNLILQW--PEISINEIAEN----F 70
PS T+ +E +H+ D+ E++ +Q+++++ L +Q P I+ A +
Sbjct: 114 PSQTSPGLCSLEEHESHLCDMGEERREGLSQLSSMLQLYIQQEVPNITQELPAVDPLSLL 173
Query: 71 SKLACNAHTICNSELRPLGTGLYPVISIINHSCLPNAVLVFEGRLAVVRAVQHVPKGAEV 130
+KL CN TI + ELR +G GLYP +S++NH C P+ V++FEG +RAV+ + E+
Sbjct: 174 AKLTCNCFTISDGELREIGVGLYPSMSLLNHDCRPSCVMLFEGETLHLRAVRDMQPAEEL 233
Query: 131 LISYIETAGSTMTRQKALKEQYLFTCTCPRCIKLGQFDDIQESAILEGYRCKDDGCSGFL 190
ISYI T T R+ L+EQY FTC C RC +
Sbjct: 234 TISYIGTLAPTRDRRTQLEEQYHFTCQC--------------------QRCTTADMDPVM 273
Query: 191 LRDSDDKGFTCQQCGLVRSKEEIKKIASEVNILSKKTLALTSCGNHQEVVSTYKMIEKLQ 250
L CG ++ +++ I + + +C N + + +++ L
Sbjct: 274 L------------CGEEKAWTPLREA-----IPTLENWQTQACWN-ELLYECSSLLDPLD 315
Query: 251 KKLYHPFSVNLMQTR--EKLIKILMELEDWKEALAYCQLTIPVYQRVYPQFHPLLGLQYY 308
+ N+ + R + + + L W ALAY +T+ Y++ YP HP +Q
Sbjct: 316 GDVPAVPDCNVYRLRVTDLALDACINLAAWDTALAYGTITLEPYRQYYPDPHPAHAIQLM 375
Query: 309 TCGKLEWFLGDTENAIKSMTEAVEILRITHGTNSPFMKELILKLEEAQAE 358
KL+ FL +A ++ A +I+++THG P +LI KLEE +AE
Sbjct: 376 RVAKLQHFLVHLGDAQHTLRLAYDIMKVTHGNQHPLTSDLIRKLEECRAE 425
>gi|312383019|gb|EFR28260.1| hypothetical protein AND_04036 [Anopheles darlingi]
Length = 455
Score = 135 bits (340), Expect = 3e-29, Method: Compositional matrix adjust.
Identities = 105/375 (28%), Positives = 175/375 (46%), Gaps = 26/375 (6%)
Query: 1 MLKLYLRRKLQNDNVIPSTTTDN-YSLVEALVAHMSDI--DEKQLLLYAQIANLVNLILQ 57
ML +RR + + TD Y L+ H DI D K++ + + ++ +L
Sbjct: 86 MLARIIRRLQKGGDFCKGYYTDKLYRRFNDLMPHEEDIRKDVKRIEHFHTLNVVLQRLLD 145
Query: 58 WPEISI-NEIAENFSKLACNAHTICNSELRPLGTGLYPVISIINHSCLPNAVLVFEGRLA 116
P I +E+ F K+ N+ +C+ E+ +GTG+Y SI++HSC PNAV F G
Sbjct: 146 EPAIPPRDELLRIFGKMCINSFNVCDDEMNSIGTGMYLGASILDHSCRPNAVATFVGEQL 205
Query: 117 VVRAVQHVPKGAE-----VLISYIETAGSTMTRQKALKEQYLFTCTCPRCIKLGQFDDIQ 171
+R ++ G E + ISYI+ + TR++ L E+Y F C C RC D
Sbjct: 206 QLRLLEDF-AGPELDFSRIFISYIDLIDPSDTRREQLSERYYFRCECVRC------RDEA 258
Query: 172 ESAILEGYRCKDDGCSGFLLRDSDDKGFTCQQCGLVRSKEEIKKIASEVNILSKKTLALT 231
E ++ C++ C +RD C+ + + EV ++ LA
Sbjct: 259 ERELMGAAACQNRKCDE-PIRDGQTLCSACE----APFDQSARDRFDEVTSFTRDRLAEM 313
Query: 232 SCGNHQEVVSTYKMIEKLQKKLYHPFSVNLMQTREKLIKILMELEDWKEALAYCQLTIPV 291
+ +V ++ + Q + HP + ++T + ++LE W+ AL Y +
Sbjct: 314 KDVAYFDVC---RLCLEKQSGVLHPLNAQHIKTLDYAFDSAIKLEKWEAALRYGAGAVAG 370
Query: 292 YQRVYPQFHPLLGLQYYTCGKLEWFLGDTENAIKSMTEAVEILRITHGTNSPFMKELILK 351
Y+R P +PLLGL GK++ +LGD + A+ S+ EA +ILR+THG K+ ++
Sbjct: 371 YRRYSPS-NPLLGLMLANIGKIQLYLGDAKTALSSLHEADKILRVTHGEQHDLYKDQLVP 429
Query: 352 LEEAQAEASYKLSSK 366
L A Y+LS +
Sbjct: 430 L-LCDAAQQYELSQR 443
>gi|158299622|ref|XP_319707.4| AGAP008954-PA [Anopheles gambiae str. PEST]
gi|157013605|gb|EAA14812.4| AGAP008954-PA [Anopheles gambiae str. PEST]
Length = 453
Score = 135 bits (339), Expect = 4e-29, Method: Compositional matrix adjust.
Identities = 101/378 (26%), Positives = 180/378 (47%), Gaps = 21/378 (5%)
Query: 1 MLKLYLRRKLQN-DNVIPSTTTDNYSLVEALVAHMSDI--DEKQLLLYAQI-ANLVNLIL 56
M+ +RR L+ D T+ Y L+ H +I D K++ + + L L+
Sbjct: 86 MIARIVRRLLKGGDTHKGYYTSKQYRKFCDLMPHEENIRADSKRMEHFGTLYVVLQRLLD 145
Query: 57 QWPEISINEIAENFSKLACNAHTICNSELRPLGTGLYPVISIINHSCLPNAVLVFEGRLA 116
+ + E+ + K+ N I ++E+ +GTG+Y SII+HSC PN V+ F+G
Sbjct: 146 EASRPTKAELLRIYGKMCINTFNILDAEMSTIGTGMYIGASIIDHSCRPNVVVSFDGETL 205
Query: 117 VVRAVQHVPKG----AEVLISYIETAGSTMTRQKALKEQYLFTCTCPRCIKLGQFDDIQE 172
+R ++ P+ ++ ISYI+ + RQ+ L E+Y F C C RC D QE
Sbjct: 206 RMRLLEDYPEQELDFGKLFISYIDLIDTAEVRQEQLAERYYFHCACERC------RDEQE 259
Query: 173 SAILEGYRCKDDGCSGFLLRDSDDKGFTCQQCGLVRSKEEIKKIASEVNILSKKTLALTS 232
+ C + C L ++ C CG + + + A E++ ++ LA
Sbjct: 260 QKRMNAAACPNTTCHEPLDFSDSEQLNQCPACGTAVTHSDREAFA-EISSFTRDHLAQMK 318
Query: 233 CGNHQEVVSTYKMIEKLQKKLYHPFSVNLMQTREKLIKILMELEDWKEALAYCQLTIPVY 292
+ +V ++ + Q + H ++V+ ++T + ++ + LE W EA Y + +
Sbjct: 319 SVAYLDVS---RLCLEKQANVLHRYNVHHIKTLDNAMESALNLEKWTEATGYGLRLLDGF 375
Query: 293 QRVYPQFHPLLGLQYYTCGKLEWFLGDTENAIKSMTEAVEILRITHGTNSPFMKELILKL 352
++ Y +HPLLGL Y GKL+ + A+K + +A +ILR+THG K +++ L
Sbjct: 376 RQYYSTYHPLLGLTYLKVGKLQLYQCQFAEALKQLQQAAKILRVTHGEQDDLYKRVLVPL 435
Query: 353 --EEAQAEASYKLSSKDE 368
+ AQ + + L+ DE
Sbjct: 436 LCDAAQGDLGH-LAIADE 452
>gi|350416087|ref|XP_003490836.1| PREDICTED: SET and MYND domain-containing protein 3-like [Bombus
impatiens]
Length = 439
Score = 134 bits (337), Expect = 8e-29, Method: Compositional matrix adjust.
Identities = 95/324 (29%), Positives = 164/324 (50%), Gaps = 26/324 (8%)
Query: 23 NYSLVEALVAHMSDI--DEKQLLLYAQIANLVNLILQ-WPEISINEIAENFSKLACNAHT 79
+Y + L++H S+I DEK++ + + N+++ L+ P S E+ + ++ N+ +
Sbjct: 110 SYRKFKDLMSHCSEIEKDEKKMEHFVCLCNVLHKFLEDMPIPSTAELLGIYGRITINSFS 169
Query: 80 ICNSELRPLGTGLYPVISIINHSCLPNAVLVFEGRLAVVRAVQHVPKG--AEVLISYIET 137
I NS++ +G G+Y SI++HSC PNAV FEG V+A++ +P +++ I YI+
Sbjct: 170 IFNSDMN-IGVGIYLGPSILDHSCKPNAVATFEGTTITVKAIEDLPSLDLSQIRIPYIDV 228
Query: 138 AGSTMTRQKALKEQYLFTCTCPRCIKLGQFDDIQESAILEGYRCKDDGCSGFLLRDSDDK 197
+T R+ L+ Y F C C RC K + E C + C+ ++D
Sbjct: 229 IKTTRDRRAELQSSYYFWCDCERCEK--------PEPMAEAAACPNKLCTYPCAPNAD-- 278
Query: 198 GFTCQQCGLVRSKEEIKKIASEVNILSKKTLALTSCGNHQEV--VSTYKMIEKLQKKLYH 255
C++C + E K+I E++ + L N + + + KM Q+ + H
Sbjct: 279 --LCEKCN-TKFPENFKEIFDEISEFTAYHLE-----NMKNIAYLDVSKMCLSRQEGILH 330
Query: 256 PFSVNLMQTREKLIKILMELEDWKEALAYCQLTIPVYQRVYPQFHPLLGLQYYTCGKLEW 315
P +V +QT + + L+ W+EA +Y + I Y Y + HP G+ Y + GKLE
Sbjct: 331 PLNVQYVQTLQTAFDSSINLQHWEEAESYAKKLINGYLTYYGEIHPSTGILYLSIGKLEV 390
Query: 316 FLGDTENAIKSMTEAVEILRITHG 339
+L + AIK++ +A IL ITHG
Sbjct: 391 YLKKLKQAIKTLRKASSILTITHG 414
>gi|345329417|ref|XP_001513395.2| PREDICTED: SET and MYND domain-containing protein 3-like
[Ornithorhynchus anatinus]
Length = 415
Score = 133 bits (335), Expect = 1e-28, Method: Compositional matrix adjust.
Identities = 83/274 (30%), Positives = 136/274 (49%), Gaps = 39/274 (14%)
Query: 66 IAENFSKLACNAHTICNSELRPLGTGLYPVISIINHSCLPNAVLVFEGRLAVVRAVQHVP 125
I E+F+K+ CN TI N E++ +G GLYP +S++NHSC PN V+VFEG +RAV+ +
Sbjct: 145 IFESFAKVICNGFTISNGEMQEVGVGLYPSMSLLNHSCDPNCVIVFEGTSLFLRAVREIQ 204
Query: 126 KGAEVLISYIETAGSTMTRQKALKEQYLFTCTCPRCIKLGQFDDIQESAILEGYRCKDDG 185
KG E+ I Y++ + RQK LKEQY F C C RC Q D+ A E +
Sbjct: 205 KGEELTICYLDVLLPSQERQKQLKEQYCFACDCIRCKT--QDKDVDMLAGEEPF------ 256
Query: 186 CSGFLLRDSDDKGFTCQQCGLVRSKEEIKKIASEVNILSKKTLALTSCGNHQEVVSTYKM 245
K++ V+ + + L S ++V++T +
Sbjct: 257 ---------------------------WKEVKDAVDTVEE----LQSQKKWEQVLATCQA 285
Query: 246 IEKLQKKLYHPFSVNLMQTREKLIKILMELEDWKEALAYCQLTIPVYQRVYPQFHPLLGL 305
+ K ++ ++ + + + L W++AL Y + T+ Y+ YP FHP+ +
Sbjct: 286 LINNHKDRIPDRNIYQLKMLDCAMDACINLSLWEDALLYGRRTLDPYRLYYPGFHPIKAI 345
Query: 306 QYYTCGKLEWFLGDTENAIKSMTEAVEILRITHG 339
Q GKL+ G A++++ EA ++L++THG
Sbjct: 346 QIMKIGKLQQHQGMFREALETLREAFDLLKVTHG 379
>gi|170064095|ref|XP_001867383.1| conserved hypothetical protein [Culex quinquefasciatus]
gi|167881524|gb|EDS44907.1| conserved hypothetical protein [Culex quinquefasciatus]
Length = 445
Score = 132 bits (331), Expect = 4e-28, Method: Compositional matrix adjust.
Identities = 106/380 (27%), Positives = 185/380 (48%), Gaps = 33/380 (8%)
Query: 1 MLKLYLRRKLQNDNVIPSTTTDN-YSLVEALVAHMSDI--DEKQLLLYAQIANLVNLILQ 57
M+ +R+ L+ +V+ TD Y L+ H DI DEK++ + + ++ ++
Sbjct: 85 MMSRIIRKLLKGGDVMKGYYTDKCYRRFWDLMPHEEDIKKDEKRMEHFQSLTVVLRSLID 144
Query: 58 WPEISIN-EIAENFSKLACNAHTICNSELRPLGTGLYPVISIINHSCLPNAVLVFEGRLA 116
+ N E+ F K+ N+ I + E+ +GTG+Y SI++HSC PNAV +F+G
Sbjct: 145 EAAMPGNQELLRIFGKMCINSFNILDDEMNSIGTGMYLGASIMDHSCRPNAVAIFDGCNL 204
Query: 117 VVRAVQHVPKGAEV-----LISYIETAGSTMTRQKALKEQYLFTCTCPRCIKLGQFDDIQ 171
VR ++ GAE+ ISYI+ T R+ L+++Y F C C RC D Q
Sbjct: 205 NVRLLEDY-HGAEIDFSKIFISYIDLLNPTDVRRDMLRKRYYFECGCERC------RDEQ 257
Query: 172 ESAILEGYRCKDDGCSGFL-LRDSDDKGFTCQQCGLVRSKEEIKKIASEVNILSKKTLAL 230
E ++ G C + C + + D+ C +++ ++I++ +
Sbjct: 258 ELKLMNGAACANAQCDEPISMTARQDRCPGCSTAIKQTERDKFREISA---------FTM 308
Query: 231 TSCGNHQEVVSTYKMIEKL----QKKLYHPFSVNLMQTREKLIKILMELEDWKEALAYCQ 286
T ++V TY I +L Q+K++HPF+V ++T + + +E+E W++A+ Y
Sbjct: 309 TQLEQMKDV--TYLDICQLCLSKQEKVFHPFNVWYLKTLDLAFESAIEMEKWEDAIDYGS 366
Query: 287 LTIPVYQRVYPQFHPLLGLQYYTCGKLEWFLGDTENAIKSMTEAVEILRITHGTNSPFMK 346
++R FHPL L GK+E +L + A+ ++ E+ +ILRITHG K
Sbjct: 367 RLKDGFKRFNGAFHPLYALLLLKLGKIEIYLKHGKEALMNVNESEKILRITHGEEHDLYK 426
Query: 347 ELILKLEEAQAEASYKLSSK 366
+ ++ L QA Y+ K
Sbjct: 427 KQLIPL-LCQAATEYEQMDK 445
>gi|310656796|gb|ADP02224.1| zf-MYND domain-containing protein [Triticum aestivum]
Length = 520
Score = 132 bits (331), Expect = 4e-28, Method: Compositional matrix adjust.
Identities = 56/82 (68%), Positives = 70/82 (85%)
Query: 268 LIKILMELEDWKEALAYCQLTIPVYQRVYPQFHPLLGLQYYTCGKLEWFLGDTENAIKSM 327
L++I MEL+DW+ AL YC+LTIPVY+RVYP FHP++GLQ+YTCGKLEW L TE+A+KS+
Sbjct: 367 LLQIYMELQDWQTALMYCRLTIPVYERVYPPFHPMVGLQFYTCGKLEWLLEYTEDALKSL 426
Query: 328 TEAVEILRITHGTNSPFMKELI 349
T A ++LRITHGT S FMKEL
Sbjct: 427 TRAADVLRITHGTQSQFMKELF 448
Score = 64.7 bits (156), Expect = 7e-08, Method: Compositional matrix adjust.
Identities = 47/118 (39%), Positives = 64/118 (54%), Gaps = 17/118 (14%)
Query: 1 MLKLYLRRKLQNDN-------VIPSTTTD----NYSLVEALVAHMSDIDEKQLL-LYAQI 48
M+KL LRRKLQ++ ++PS D ++ ++ V I +L + A I
Sbjct: 118 MVKLILRRKLQSEKMANLVQLILPSIELDLKEIAHTFSKSFVCTEVGIAYCHVLEILAVI 177
Query: 49 ANLVN--LILQWPEISINEIAENFSKLACNAHTICNSELRPLGTGLYPVISIINHSCL 104
+L L L I+++ F ACNAHTIC+ ELRPLGTGL+P ISIINH L
Sbjct: 178 GSLRPNPLGLGSAVIAVHPTQVQF---ACNAHTICDPELRPLGTGLFPAISIINHRIL 232
>gi|334322119|ref|XP_003340188.1| PREDICTED: SET and MYND domain-containing protein 3-like
[Monodelphis domestica]
Length = 441
Score = 130 bits (327), Expect = 1e-27, Method: Compositional matrix adjust.
Identities = 84/293 (28%), Positives = 148/293 (50%), Gaps = 39/293 (13%)
Query: 65 EIAENFSKLACNAHTICNSELRPLGTGLYPVISIINHSCLPNAVLVFEGRLAVVRAVQHV 124
+I E+F+K+ CN I N E++ +G GLYP +S++NHSC PN V+VFEG +RAV+ +
Sbjct: 183 DIFESFAKVICNGFAISNGEMQEVGVGLYPSMSLLNHSCDPNCVIVFEGPSLFLRAVRDI 242
Query: 125 PKGAEVLISYIETAGSTMTRQKALKEQYLFTCTCPRCIKLGQFDDIQESAILEGYRCKDD 184
+G E+ I Y++ + RQK LKEQY F C CP C
Sbjct: 243 QQGEELTICYLDVLMPSAERQKQLKEQYCFDCDCPGCET--------------------- 281
Query: 185 GCSGFLLRDSDDKGFTCQQCGLVRSKEEIKKIASEVNILSKKTLALTSCGNHQEVVSTYK 244
R DD + ++ ++ +KK+ ++ K L C Q +++ Y
Sbjct: 282 -------RSKDDDMLSGEEQAWKEVQDSLKKV-EDLRAQEKWEQILAIC---QTLINNYG 330
Query: 245 MIEKLQKKLYHPFSVNLMQTREKLIKILMELEDWKEALAYCQLTIPVYQRVYPQFHPLLG 304
++L + ++ ++ + + + L W++AL Y T+ Y+ Y FHP+ G
Sbjct: 331 --DRLPDR-----NIYQLKMLDCAMDACINLCLWEDALLYGSRTLEPYRLYYSGFHPISG 383
Query: 305 LQYYTCGKLEWFLGDTENAIKSMTEAVEILRITHGTNSPFMKELILKLEEAQA 357
+Q GKL+ G A++++ +A E++++THG + +++L+L L + +A
Sbjct: 384 VQVMRVGKLQHHQGLYPQALETLKQAFELMKVTHGRDHSLVEDLLLLLGDCEA 436
>gi|395852657|ref|XP_003798851.1| PREDICTED: SET and MYND domain-containing protein 3 [Otolemur
garnettii]
Length = 428
Score = 129 bits (324), Expect = 2e-27, Method: Compositional matrix adjust.
Identities = 85/293 (29%), Positives = 146/293 (49%), Gaps = 39/293 (13%)
Query: 65 EIAENFSKLACNAHTICNSELRPLGTGLYPVISIINHSCLPNAVLVFEGRLAVVRAVQHV 124
++ E F+K+ CN+ TICN+E++ +G GLYP +S++NHSC PN +VF G ++RAV+ +
Sbjct: 170 DVFEAFAKVICNSFTICNAEMQEVGVGLYPSMSLLNHSCDPNCSIVFNGPHLLLRAVRDI 229
Query: 125 PKGAEVLISYIETAGSTMTRQKALKEQYLFTCTCPRCIKLGQFDDIQESAILEGYRCKDD 184
G E+ I Y++ ++ R+K L++QY F C C RC Q D
Sbjct: 230 EAGEELTICYLDMLMTSEERRKQLRDQYCFECDCLRC----QTQD--------------- 270
Query: 185 GCSGFLLRDSDDKGFTCQQCGLVRSKEEIKKIASEVNILSKKTLALTSCGNHQEVVSTYK 244
+D+D ++ E++ K EV KK L + ++V++ +
Sbjct: 271 -------KDAD----------MLTGDEQVWK---EVQESLKKIEELKAHWKWEQVLAMCQ 310
Query: 245 MIEKLQKKLYHPFSVNLMQTREKLIKILMELEDWKEALAYCQLTIPVYQRVYPQFHPLLG 304
I + ++ ++ + + + L +EAL Y T+ Y+ +P HP+ G
Sbjct: 311 TIISSNSERLPDINIYQLKVLDCAMDACINLGLLEEALFYATRTMEPYRIFFPGSHPVRG 370
Query: 305 LQYYTCGKLEWFLGDTENAIKSMTEAVEILRITHGTNSPFMKELILKLEEAQA 357
+Q GKL+ G A++++ A +I+R+THG ++LIL LEE A
Sbjct: 371 VQVMKVGKLQLHQGMFPQAMQNLRLAFDIIRVTHGREHSLTEDLILLLEECDA 423
>gi|74204261|dbj|BAE39889.1| unnamed protein product [Mus musculus]
Length = 428
Score = 129 bits (323), Expect = 3e-27, Method: Compositional matrix adjust.
Identities = 85/293 (29%), Positives = 146/293 (49%), Gaps = 39/293 (13%)
Query: 65 EIAENFSKLACNAHTICNSELRPLGTGLYPVISIINHSCLPNAVLVFEGRLAVVRAVQHV 124
++ E F+K+ CN+ TICN+E++ +G GLYP +S++NHSC PN +VF G ++RAV+ +
Sbjct: 170 DLFEAFAKVICNSFTICNAEMQEVGVGLYPSMSLLNHSCDPNCSIVFNGPHLLLRAVREI 229
Query: 125 PKGAEVLISYIETAGSTMTRQKALKEQYLFTCTCPRCIKLGQFDDIQESAILEGYRCKDD 184
G E+ I Y++ ++ R+K L++QY F C C RC Q D
Sbjct: 230 EAGEELTICYLDMLMTSEERRKQLRDQYCFECDCIRC----QTQD--------------- 270
Query: 185 GCSGFLLRDSDDKGFTCQQCGLVRSKEEIKKIASEVNILSKKTLALTSCGNHQEVVSTYK 244
+D+D ++ E+I K EV KK L + ++V++ +
Sbjct: 271 -------KDAD----------MLTGDEQIWK---EVQESLKKIEELKAHWKWEQVLALCQ 310
Query: 245 MIEKLQKKLYHPFSVNLMQTREKLIKILMELEDWKEALAYCQLTIPVYQRVYPQFHPLLG 304
I ++ ++ + + + L +EAL Y T+ Y+ +P HP+ G
Sbjct: 311 AIINSNSNRLPNINIYQLKVLDCAMDACINLGMLEEALFYAMRTMEPYRIFFPGSHPVRG 370
Query: 305 LQYYTCGKLEWFLGDTENAIKSMTEAVEILRITHGTNSPFMKELILKLEEAQA 357
+Q GKL+ G A+K++ A +I+++THG +++LI+ LEE A
Sbjct: 371 VQVMKVGKLQLHQGMFPQAMKNLRLAFDIMKVTHGREHSLIEDLIVLLEECDA 423
>gi|66540564|ref|XP_625013.1| PREDICTED: SET and MYND domain-containing protein 3 [Apis
mellifera]
Length = 440
Score = 128 bits (322), Expect = 3e-27, Method: Compositional matrix adjust.
Identities = 93/324 (28%), Positives = 164/324 (50%), Gaps = 27/324 (8%)
Query: 24 YSLVEALVAHMSDI--DEKQLLLYAQIANLVNLILQWPEISINEIAE---NFSKLACNAH 78
Y + L++H SDI DEK++ + + ++ L ++SI AE + ++ N+
Sbjct: 111 YRKFKDLMSHYSDIKKDEKKMEHFVCVCGVLYEFL--GDMSIPNSAELMGIYGRIYINSF 168
Query: 79 TICNSELRPLGTGLYPVISIINHSCLPNAVLVFEGRLAVVRAVQHVP--KGAEVLISYIE 136
I + ++ +G G+Y SI++HSC PNAV FEG ++R + +P +++ ISYI+
Sbjct: 169 NISDLDMNNIGAGIYLGPSILDHSCKPNAVATFEGTTIIIRTTEDLPCLDLSQIRISYID 228
Query: 137 TAGSTMTRQKALKEQYLFTCTCPRCIKLGQFDDIQES-AILEGYRCKDDGCSGFLLRDSD 195
+T R++ L+ Y F C C +C +ES ++E C + C+ D+D
Sbjct: 229 VIKTTKDRREELQSSYYFWCNCKKC---------EESEPMVEAAACPNKFCTYPCSLDAD 279
Query: 196 DKGFTCQQCGLVRSKEEIKKIASEVNILSKKTLALTSCGNHQEVVSTYKMIEKLQKKLYH 255
C+ C + E K+ E++ L+ L + +V + M K Q+ + H
Sbjct: 280 ----MCENCN-TKFPENFKETFYEISDLTAYHLQNMKNIAYLDVST---MCLKKQEGVLH 331
Query: 256 PFSVNLMQTREKLIKILMELEDWKEALAYCQLTIPVYQRVYPQFHPLLGLQYYTCGKLEW 315
P ++ +QT + + L+ W+EA +Y + I Y Y +FHP G+ Y + GKL+
Sbjct: 332 PLNIQHVQTLQSAFDSSLTLQHWEEAESYAKRLINGYLTYYGEFHPSTGILYLSIGKLQV 391
Query: 316 FLGDTENAIKSMTEAVEILRITHG 339
+L + AI+++ +A IL ITHG
Sbjct: 392 YLKKLKQAIETLRKASAILTITHG 415
>gi|344278575|ref|XP_003411069.1| PREDICTED: SET and MYND domain-containing protein 3 [Loxodonta
africana]
Length = 428
Score = 127 bits (320), Expect = 7e-27, Method: Compositional matrix adjust.
Identities = 90/295 (30%), Positives = 146/295 (49%), Gaps = 43/295 (14%)
Query: 65 EIAENFSKLACNAHTICNSELRPLGTGLYPVISIINHSCLPNAVLVFEGRLAVVRAVQHV 124
+I E F+K+ CN+ TIC++E++ +G GLYP +S++NHSC PN +VF G ++RAV+ +
Sbjct: 170 DIFEAFAKVICNSFTICDAEMQEVGVGLYPSMSLLNHSCDPNCSIVFNGPHLLLRAVRDI 229
Query: 125 PKGAEVLISYIETAGSTMTRQKALKEQYLFTCTCPRCIKLGQFDDIQESAILEGYRCKDD 184
G E+ I Y++ ++ R+K L++QY F C C +RC+
Sbjct: 230 EVGEELTICYLDMLMTSEERRKQLRDQYCFECDC--------------------FRCR-- 267
Query: 185 GCSGFLLRDSDDKGFTCQQCGLVRSKEEIKKIASEVNILSKKTLALTSCGNHQEVVSTYK 244
+D D T + +E +KKI E+ K L C Q ++S+
Sbjct: 268 ------TQDKDADMLTGNEQAWKEVQESLKKI-EELKAHRKWEQVLAMC---QTILSSNS 317
Query: 245 MIEKLQKKLYHPFSVNLMQTR--EKLIKILMELEDWKEALAYCQLTIPVYQRVYPQFHPL 302
E+L N+ Q + + + + L +EAL Y T+ Y+ YP HP+
Sbjct: 318 --EQLP-------DTNIYQLKVLDCAMDACINLGLLEEALFYGIRTMEPYRIFYPGSHPV 368
Query: 303 LGLQYYTCGKLEWFLGDTENAIKSMTEAVEILRITHGTNSPFMKELILKLEEAQA 357
G+Q GKL+ G A+K++ A +I+R+THG +++LI LEE A
Sbjct: 369 RGVQVMKVGKLQLHQGMFPQAMKNLRLAFDIMRVTHGREHSLIEDLIRLLEECDA 423
>gi|225715142|gb|ACO13417.1| SET and MYND domain-containing protein 3 [Esox lucius]
Length = 429
Score = 127 bits (320), Expect = 7e-27, Method: Compositional matrix adjust.
Identities = 96/357 (26%), Positives = 162/357 (45%), Gaps = 48/357 (13%)
Query: 15 VIPSTTTDNYSLVEALVAHMSDIDEKQLLLYAQIANLVNLILQWPEISINEIAENF---- 70
+ PS T E +H+ D+ E++ +Q+++++ L L+ + I + +F
Sbjct: 112 LCPSQMTPQLYSFEEHESHLCDMGEEKREGLSQLSSMLQLYLKQEQPDIIQKVPSFDPIS 171
Query: 71 --SKLACNAHTICNSELRPLGTGLYPVISIINHSCLPNAVLVFEGRLAVVRAVQHVPKGA 128
+K+ CN TI ++EL+ +G GLYP +S++NH C P+ V++F+G+ +RA++ +
Sbjct: 172 LLAKVTCNCFTISDAELQEIGVGLYPSMSLLNHDCRPSCVMLFQGKTLQLRAIRDIQPTE 231
Query: 129 EVLISYIETAGSTMTRQKALKEQYLFTCTCPRCIKLGQFDDIQESAILEGYRCKDDGCSG 188
EV ISYI T RQ L EQY F+C C C + D +
Sbjct: 232 EVTISYIGVLLPTRERQTQLMEQYHFSCQCGLC-STAELDPLMF---------------- 274
Query: 189 FLLRDSDDKGFTCQQCGLVRSKEEIKKIASEVNILSKKTLALTSCGNHQEVVS-TYKMIE 247
CG+ + +K+ + IL GN +E++ ++
Sbjct: 275 ---------------CGVKEAWTPMKEAIPRLEILQTN-------GNWEELLQECSSLLA 312
Query: 248 KLQKKLYHPFSVNLMQTR--EKLIKILMELEDWKEALAYCQLTIPVYQRVYPQFHPLLGL 305
+ + N+ + R + + L W+ ALAY T+ Y++ YP HP +
Sbjct: 313 PVGGAVPAVPDSNVYRLRVTDLAFDACINLARWETALAYGLKTLGPYRQYYPDPHPAHAI 372
Query: 306 QYYTCGKLEWFLGDTENAIKSMTEAVEILRITHGTNSPFMKELILKLEEAQAEASYK 362
Q KL+ FL E+A ++ A +I++ITHG +LI KLEE AE K
Sbjct: 373 QLMRVAKLQHFLVHLEDAQHTLRLAYDIMKITHGNEHSLTSDLIRKLEECLAEMDCK 429
>gi|432089341|gb|ELK23292.1| N-lysine methyltransferase SMYD2 [Myotis davidii]
Length = 391
Score = 127 bits (320), Expect = 7e-27, Method: Compositional matrix adjust.
Identities = 95/372 (25%), Positives = 178/372 (47%), Gaps = 27/372 (7%)
Query: 7 RRKLQNDNVIPSTTTDNYSLVEALVAHMSDID-EKQLLLYAQIANLVNLILQWPEISINE 65
R +++ + T ++ V+ +H+ +D EK+ L+ + I+ L + + E E
Sbjct: 20 RSRMRKKSHPERTPSEKLLTVKEFESHLDKLDNEKKDLIQSDISALHHFYSKHLEFPDQE 79
Query: 66 -IAENFSKLACNAHTICNSELRPLGTGLYPVISIINHSCLPNAVLVFEGRLAVVRAVQHV 124
+ F+++ CN TI + EL LG+ ++P ++++NHSC PN ++ ++G LA VRAVQ +
Sbjct: 80 SLVALFAQVNCNGFTIEDEELSHLGSAIFPDVALMNHSCCPNVIVTYKGTLAEVRAVQEI 139
Query: 125 PKGAEVLISYIETAGSTMTRQKALKEQYLFTCTCPRC---------IKLGQFDDIQES-A 174
G EV SYI+ T R L++ Y FTC C C +K+ + +D ++ A
Sbjct: 140 GPGDEVFTSYIDLLYPTEDRNDRLRDSYFFTCECQECTTKDKDKAKVKIRKLNDPPKAEA 199
Query: 175 ILEGYRCKDDGCSGFLLRDSDDKGFTCQQCGL--VRSKEEIKKIASEVNILSKKT----- 227
I + R + F R + + CG R E+ + + V++L KT
Sbjct: 200 IRDMVRYARNVIEEF--RRAKHYKYILDSCGRSPARGAGELGQELNVVDVLLSKTTENGS 257
Query: 228 ---LALTSCGNHQEVVSTYKMIEKLQKKLYHPF---SVNLMQTREKLIKILMELEDWKEA 281
L L H ++ E Q+K+ F +V ++ + + + + ++DW+ A
Sbjct: 258 LKVLILGHYFGHTPPSELLEICELSQEKMSSVFEDSNVYMLHMMYQAMGVCLYIQDWEGA 317
Query: 282 LAYCQLTIPVYQRVYPQFHPLLGLQYYTCGKLEWFLGDTENAIKSMTEAVEILRITHGTN 341
L Y Q I Y + YP + + + G+L L + K++ +A+ I+ + HG +
Sbjct: 318 LRYGQKIIKPYSKHYPLYSLNVASMWLKLGRLYMGLENRAAGEKALRKAMAIMEVAHGKD 377
Query: 342 SPFMKELILKLE 353
P++ E+ ++E
Sbjct: 378 HPYISEIKQEIE 389
>gi|340711607|ref|XP_003394366.1| PREDICTED: SET and MYND domain-containing protein 3-like [Bombus
terrestris]
Length = 439
Score = 127 bits (319), Expect = 8e-27, Method: Compositional matrix adjust.
Identities = 93/325 (28%), Positives = 160/325 (49%), Gaps = 26/325 (8%)
Query: 23 NYSLVEALVAHMSDI--DEKQLLLYAQIANLVNLILQ-WPEISINEIAENFSKLACNAHT 79
NY + L++H S+I DEK++ + + N+++ L+ P S E+ + ++ N+ +
Sbjct: 110 NYRKFKDLMSHCSEIKKDEKKMEHFVCLCNVLHKFLEDMPIPSTAELLGIYGRITINSFS 169
Query: 80 ICNSELRPLGTGLYPVISIINHSCLPNAVLVFEGRLAVVRAVQHVPKG--AEVLISYIET 137
I N ++ +G G+Y SI++HSC PNAV FEG V+A++ +P +++ I YI+
Sbjct: 170 IFNLDMN-IGVGIYLGPSILDHSCKPNAVATFEGTTINVKAIEDLPSLDLSQIRIPYIDV 228
Query: 138 AGSTMTRQKALKEQYLFTCTCPRCIKLGQFDDIQESAILEGYRCKDDGCSGFLLRDSDDK 197
+ R+ L+ Y F C C +C K + E C + C+ D
Sbjct: 229 IKTAGDRRAELQSSYYFWCDCEKCEK--------PEPMAEAAACPNKLCT----YPCDPN 276
Query: 198 GFTCQQCGLVRSKEEIKKIASEVNILSKKTLALTSCGNHQEV--VSTYKMIEKLQKKLYH 255
C++C + E K+I E++ + L N + + + KM Q+ + H
Sbjct: 277 ADLCEKCN-TKFPENFKEIFDEISEFTAYHLE-----NMKNIAYLDVSKMCLSRQEGILH 330
Query: 256 PFSVNLMQTREKLIKILMELEDWKEALAYCQLTIPVYQRVYPQFHPLLGLQYYTCGKLEW 315
P +V +QT + + L+ W+EA +Y + I Y Y + HP G+ Y + GKLE
Sbjct: 331 PLNVQYVQTLQTAFDSSINLQHWEEAESYAKKLINGYLAYYGEIHPSTGILYLSIGKLEV 390
Query: 316 FLGDTENAIKSMTEAVEILRITHGT 340
+L + AIK++ +A IL ITHG
Sbjct: 391 YLKKLKQAIKTLRKASLILTITHGA 415
>gi|332024084|gb|EGI64301.1| SET and MYND domain-containing protein 3 [Acromyrmex echinatior]
Length = 441
Score = 127 bits (319), Expect = 9e-27, Method: Compositional matrix adjust.
Identities = 94/344 (27%), Positives = 166/344 (48%), Gaps = 37/344 (10%)
Query: 20 TTDNYSLVEALVAHMSDI--DEKQLLLYAQIANLVNLILQWPEI-SINEIAENFSKLACN 76
T N+ + L++H SDI D K++ + + +++ L + +I E+ + ++ N
Sbjct: 103 TEKNFRRFKDLMSHYSDIKVDVKRMEHFTMLCGVLSQFLDETFMPNIAELMGIYGRICTN 162
Query: 77 AHTICNSELRPLGTGLYPVISIINHSCLPNAVLVFEGRLAVVRAVQHVPK--------GA 128
+ I + + +G G+Y S+I+HSC PN ++VFEG +VR + +P
Sbjct: 163 SFNILDINMNTIGVGIYLGASVIDHSCKPNVIVVFEGTTIIVRTLTDLPSLDWSQASIDK 222
Query: 129 EVLISYIETAGSTMTRQKALKEQYLFTCTCPRCIKLGQFDDIQESAILEGYRCKDDGCSG 188
++ ISY++ S R++ L Y F C C RC K E ++E C + C
Sbjct: 223 DIRISYVDLLNSNKDRREELHSSYYFWCDCERCKK--------EEPMVEAAACPNLLCDS 274
Query: 189 FLLRDSDDKGFTCQQCG---LVRSKEEIKKIAS-EVNILSK-KTLALTSCGNHQEVVSTY 243
++D+ C++C V KE +++ ++ L K KT+A +
Sbjct: 275 PCSIEADE----CEKCNKEISVEFKETFREVVDFTIHHLEKMKTMAY---------LDVS 321
Query: 244 KMIEKLQKKLYHPFSVNLMQTREKLIKILMELEDWKEALAYCQLTIPVYQRVYPQFHPLL 303
K+ K QK + H F++ ++T E M L+ W++A Y + +P Y Y + HPL
Sbjct: 322 KICLKKQKGIMHKFNIQHVRTLEMAHIAAMNLKCWEDAEFYGKELVPGYLLYYGEIHPLT 381
Query: 304 GLQYYTCGKLEWFLGDTENAIKSMTEAVEILRITHGTNSPFMKE 347
GL Y T GK++ L + A +++T+A +L ITHG ++E
Sbjct: 382 GLLYLTVGKIQLHLEKPKEAFQALTKANTVLTITHGDKHSIVEE 425
>gi|242020146|ref|XP_002430517.1| SET and MYND domain-containing protein, putative [Pediculus humanus
corporis]
gi|212515674|gb|EEB17779.1| SET and MYND domain-containing protein, putative [Pediculus humanus
corporis]
Length = 378
Score = 126 bits (317), Expect = 1e-26, Method: Compositional matrix adjust.
Identities = 84/285 (29%), Positives = 142/285 (49%), Gaps = 30/285 (10%)
Query: 73 LACNAHTICNSELRPLGTGLYPVISIINHSCLPNAVLVFEGRLAVVRAVQHVP--KGAEV 130
+ N I +SE++ +G+GLY SII+HSC PNAV VF+G +R ++ +P ++V
Sbjct: 108 MVINTFNILDSEMQSIGSGLYLASSIIDHSCSPNAVAVFKGTTIFIRTLEDIPIMDWSKV 167
Query: 131 LISYIETAGSTMTRQKALKEQYLFTCTCPRCIKLGQFDDIQESAILEGYRCKDDGCSGFL 190
ISYIE RQ+ L Y F C C +C D ++ C+++ C FL
Sbjct: 168 FISYIELLNLPEIRQQELLSSYYFLCQCSKCT------DSDNLNFMKSIYCQNEKCKNFL 221
Query: 191 LRDSDDKGFTCQQCGLVRSKE---EIKKI----ASEVNILSKKTLALTSCGNHQEVVSTY 243
+ + + C +CG SKE EIK+I SE+N + + +
Sbjct: 222 MPNEVE----CHKCGQSISKEDEDEIKEIIQYTESELNKME-----------NMAYLDIC 266
Query: 244 KMIEKLQKKLYHPFSVNLMQTREKLIKILMELEDWKEALAYCQLTIPVYQRVYPQFHPLL 303
++ + Q+K P ++ ++ + ++ + LE+W+ A+ Y + + + + Y ++HPL
Sbjct: 267 RICLQKQEKYLTPLNIYKVKILDLAMESSIALENWESAVFYGRQLLEPFLKYYGEYHPLR 326
Query: 304 GLQYYTCGKLEWFLGDTENAIKSMTEAVEILRITHGTNSPFMKEL 348
G+ GK+ + L + + A K EA IL +THG N F +EL
Sbjct: 327 GIFLMKLGKILFLLNNIDEAKKYFKEANLILSVTHGKNHCFSREL 371
>gi|24639370|ref|NP_524768.2| buzidau [Drosophila melanogaster]
gi|6018870|emb|CAB58065.1| EG:BACR7C10.4 [Drosophila melanogaster]
gi|22831587|gb|AAF45795.2| buzidau [Drosophila melanogaster]
gi|223718730|gb|ACN22202.1| MIP05435p [Drosophila melanogaster]
Length = 468
Score = 125 bits (314), Expect = 3e-26, Method: Compositional matrix adjust.
Identities = 95/369 (25%), Positives = 178/369 (48%), Gaps = 26/369 (7%)
Query: 1 MLKLYLRRKLQNDNVIPSTTTDNYSLVEALVAHMSDIDEKQLLLY---AQIANLVNLILQ 57
+ +L LR + D + T L++H ++I + L + A L +++ +
Sbjct: 115 LCRLILRLEHGGDLIRGYYTEHGSRKFRDLMSHYAEIKNDPMRLEHLDSLHAVLTDMMAE 174
Query: 58 WPEISIN--EIAENFSKLACNAHTICNSELRPLGTGLYPVISIINHSCLPNAVLVFEGRL 115
P N E+ + +L N I ++E+ + T +Y +SI +HSC PNAV FEG
Sbjct: 175 SPSTVPNKTELMSIYGRLITNGFNILDAEMNSIATAIYLGVSITDHSCQPNAVATFEGNE 234
Query: 116 AVVRAVQHVP--KGAEVLISYIETAGSTMTRQKALKEQYLFTCTCPRCIKLGQFDDIQES 173
V A++ + +++ ISYI+ + R+ LKE Y F C C +C D +ES
Sbjct: 235 LHVHAIEDMECLDWSKIFISYIDLLNTPEQRRLDLKEHYYFLCVCSKCT------DAKES 288
Query: 174 AILEGYRCKDDGCSGFLLRDSDDKGFTCQQCGLVRSKEEIKKIASEVNILSKKTLALTSC 233
+ C + C + D ++ C +C S + ++ +E L++ L
Sbjct: 289 KEMLAALCPNRNCGAGISVDRNN----CPRCDAGISPK-LRNAFNEAMTLTRHNLE---- 339
Query: 234 GNHQEV--VSTYKMIEKLQKKLYHPFSVNLMQTREKLIKILMELEDWKEALAYCQLTIPV 291
N ++V + K+ Q ++HP +V ++T + + +E+ W +AL Y Q +P
Sbjct: 340 -NMKDVAYLDVCKVCLDKQTGVFHPLNVWYVKTLDAAFEAAIEVGKWSDALDYGQRLLPG 398
Query: 292 YQRVYPQFHPLLGLQYYTCGKLEWFLGDTENAIKSMTEAVEILRITHGTNSPFMKE-LIL 350
+++ + ++PLLGL + GK++ + G ++ A+ + EA IL +THG + + E L +
Sbjct: 399 FRKYHGPWNPLLGLLHMKLGKIQLYEGHSKEALHHLEEAQRILTVTHGRDHRLLTEQLYM 458
Query: 351 KLEEAQAEA 359
+ +A+ EA
Sbjct: 459 LVLQARQEA 467
>gi|125981921|ref|XP_001354964.1| GA12511 [Drosophila pseudoobscura pseudoobscura]
gi|54643276|gb|EAL32020.1| GA12511 [Drosophila pseudoobscura pseudoobscura]
Length = 439
Score = 125 bits (314), Expect = 3e-26, Method: Compositional matrix adjust.
Identities = 98/368 (26%), Positives = 181/368 (49%), Gaps = 22/368 (5%)
Query: 1 MLKLYLRRKLQNDNVIPSTTTDNYSLVEALVAHMSDI--DEKQL-LLYAQIANLVNLILQ 57
+ +L LR + D + T L++H ++I D K+L L + A L +++
Sbjct: 86 LCRLILRLQHGGDLIRGYYTEHGSRKFRDLMSHYAEIKNDPKRLEHLDSLHAVLTDMMAD 145
Query: 58 WPEISIN--EIAENFSKLACNAHTICNSELRPLGTGLYPVISIINHSCLPNAVLVFEGRL 115
P N E+ + +L N + ++E+ + T +Y +SI +HSC PNAV FEG
Sbjct: 146 SPSTVPNKSELMSIYGRLITNGFNVLDAEMNSIATAIYLGVSITDHSCQPNAVATFEGNE 205
Query: 116 AVVRAVQHVP--KGAEVLISYIETAGSTMTRQKALKEQYLFTCTCPRCIKLGQFDDIQES 173
+ A++ +P +++ ISYI+ + R++ LK+ Y F C C +C D +E+
Sbjct: 206 LHIHALEDMPCLDWSKIFISYIDLLNTPEQRRQDLKDHYYFLCVCSKC------RDPKEA 259
Query: 174 AILEGYRCKDDGCSGFLLRDSDDKGFTCQQCGLVRSKEEIKKIASEVNILSKKTLALTSC 233
+ C + CS + + ++ C++C + S + ++ +E+ L+K L
Sbjct: 260 RQMTAAACPNRKCSASINIEWNN----CKRCSVGISPK-LRNAYNEIMALTKHNLEAMKD 314
Query: 234 GNHQEVVSTYKMIEKLQKKLYHPFSVNLMQTREKLIKILMELEDWKEALAYCQLTIPVYQ 293
+ +V K+ Q L HP +V ++T + + +E+ W+EAL Y Q +P +Q
Sbjct: 315 VAYLDVC---KVCLDKQAGLMHPLNVWHVKTLDAAFEAAIEVGKWEEALDYGQQLLPGFQ 371
Query: 294 RVYPQFHPLLGLQYYTCGKLEWFLGDTENAIKSMTEAVEILRITHGTNSPFMKELILKL- 352
+ Q++PL+GL + GK++ F + A + EA IL +THG + + E + L
Sbjct: 372 MYHGQWNPLVGLLHMKLGKIQLFERHHKEASHHLEEAQRILSVTHGRDHRLLVEQLYPLI 431
Query: 353 EEAQAEAS 360
+A+ EA+
Sbjct: 432 FQARQEAN 439
>gi|383865502|ref|XP_003708212.1| PREDICTED: SET and MYND domain-containing protein 3-like [Megachile
rotundata]
Length = 438
Score = 125 bits (314), Expect = 4e-26, Method: Compositional matrix adjust.
Identities = 88/353 (24%), Positives = 168/353 (47%), Gaps = 23/353 (6%)
Query: 1 MLKLYLRRKLQNDNVIPSTTTDNYSLVEALVAHMSDI--DEKQLLLYAQIAN-LVNLILQ 57
M ++ ++ D+ + T + + L++H SDI DEK++ + + L +
Sbjct: 88 MARIIIKLSQGGDDEVEYYTKTKFRRFKDLMSHYSDIKKDEKKMEHFMFLCGVLFGFLGD 147
Query: 58 WPEISINEIAENFSKLACNAHTICNSELRPLGTGLYPVISIINHSCLPNAVLVFEGRLAV 117
P + E+ + ++ N+ I + ++ +G G+Y S+++HSC+PNAV FEG
Sbjct: 148 TPMPNSAELMGIYGRICINSFNIFDLDMNSIGVGIYLAPSVVDHSCVPNAVATFEGITLN 207
Query: 118 VRAVQHVPK--GAEVLISYIETAGSTMTRQKALKEQYLFTCTCPRCIKLGQFDDIQESAI 175
+R ++ +P +++ ISYI+ +T R+ L+ Y F C C +C +E +
Sbjct: 208 IRTIEDLPSLDWSQIRISYIDVLKTTKERRSELQSSYYFWCNCKKC---------EEPEL 258
Query: 176 L-EGYRCKDDGCSGFLLRDSDDKGFTCQQCGLVRSKEEIKKIASEVNILSKKTLALTSCG 234
+ E C + C+ D + +C +C ++ E K+I EV+ + L
Sbjct: 259 MAEAAACSNKNCTNPCSPDME----SCPECN-IKLLENFKEIFDEVSSFTAHHLQNMK-- 311
Query: 235 NHQEVVSTYKMIEKLQKKLYHPFSVNLMQTREKLIKILMELEDWKEALAYCQLTIPVYQR 294
+ + KM K Q+ + H ++ +QT + + L+ W+EA Y + I Y
Sbjct: 312 -NMAYLDVSKMCLKKQEGVLHSLNIQHVQTLQTAFDSSISLQHWEEAEFYAKKLIKGYLV 370
Query: 295 VYPQFHPLLGLQYYTCGKLEWFLGDTENAIKSMTEAVEILRITHGTNSPFMKE 347
Y + HPL G+ GK++ +L + A++++ +A IL ITHG ++E
Sbjct: 371 YYGEVHPLTGILLLMTGKIQLYLEKPKQALEALRKANSILAITHGEQHTLVRE 423
>gi|21430850|gb|AAM51103.1| SD20045p [Drosophila melanogaster]
Length = 382
Score = 125 bits (314), Expect = 4e-26, Method: Compositional matrix adjust.
Identities = 95/369 (25%), Positives = 178/369 (48%), Gaps = 26/369 (7%)
Query: 1 MLKLYLRRKLQNDNVIPSTTTDNYSLVEALVAHMSDIDEKQLLLY---AQIANLVNLILQ 57
+ +L LR + D + T L++H ++I + L + A L +++ +
Sbjct: 29 LCRLILRLEHGGDLIRGYYTEHGSRKFRDLMSHYAEIKNDPMRLEHLDSLHAVLTDMMAE 88
Query: 58 WPEISIN--EIAENFSKLACNAHTICNSELRPLGTGLYPVISIINHSCLPNAVLVFEGRL 115
P N E+ + +L N I ++E+ + T +Y +SI +HSC PNAV FEG
Sbjct: 89 SPSTVPNKTELMSIYGRLITNGFNILDAEMNSIATAIYLGVSITDHSCQPNAVATFEGNE 148
Query: 116 AVVRAVQHVP--KGAEVLISYIETAGSTMTRQKALKEQYLFTCTCPRCIKLGQFDDIQES 173
V A++ + +++ ISYI+ + R+ LKE Y F C C +C D +ES
Sbjct: 149 LHVHAIEDMECLDWSKIFISYIDLLNTPEQRRLDLKEHYYFLCVCSKCT------DAKES 202
Query: 174 AILEGYRCKDDGCSGFLLRDSDDKGFTCQQCGLVRSKEEIKKIASEVNILSKKTLALTSC 233
+ C + C + D ++ C +C S + ++ +E L++ L
Sbjct: 203 KEMLAALCPNRNCGAGISVDRNN----CPRCDAGISPK-LRNAFNEAMTLTRHNLE---- 253
Query: 234 GNHQEV--VSTYKMIEKLQKKLYHPFSVNLMQTREKLIKILMELEDWKEALAYCQLTIPV 291
N ++V + K+ Q ++HP +V ++T + + +E+ W +AL Y Q +P
Sbjct: 254 -NMKDVAYLDVCKVCLDKQTGVFHPLNVWYVKTLDAAFEAAIEVGKWSDALDYGQRLLPG 312
Query: 292 YQRVYPQFHPLLGLQYYTCGKLEWFLGDTENAIKSMTEAVEILRITHGTNSPFMKE-LIL 350
+++ + ++PLLGL + GK++ + G ++ A+ + EA IL +THG + + E L +
Sbjct: 313 FRKYHGPWNPLLGLLHMKLGKIQLYEGHSKEALHHLEEAQRILTVTHGRDHRLLTEQLYV 372
Query: 351 KLEEAQAEA 359
+ +A+ EA
Sbjct: 373 LVLQARQEA 381
>gi|224033847|gb|ACN35999.1| unknown [Zea mays]
gi|413955568|gb|AFW88217.1| hypothetical protein ZEAMMB73_999877 [Zea mays]
Length = 129
Score = 124 bits (311), Expect = 7e-26, Method: Compositional matrix adjust.
Identities = 56/98 (57%), Positives = 73/98 (74%)
Query: 218 SEVNILSKKTLALTSCGNHQEVVSTYKMIEKLQKKLYHPFSVNLMQTREKLIKILMELED 277
S++ LS K + S GN E S YK+I++L++ LYH FS L+ T E L+KI +EL+D
Sbjct: 3 SKILQLSDKASSFLSSGNKAEAGSIYKIIKQLERNLYHAFSTTLLHTCETLLKIYLELQD 62
Query: 278 WKEALAYCQLTIPVYQRVYPQFHPLLGLQYYTCGKLEW 315
W AL YC+LTIPVY+RVYP FHP++GLQ+YTCGKLEW
Sbjct: 63 WWTALTYCRLTIPVYERVYPPFHPMIGLQFYTCGKLEW 100
>gi|431915909|gb|ELK16163.1| SET and MYND domain-containing protein 2, partial [Pteropus alecto]
Length = 390
Score = 124 bits (310), Expect = 9e-26, Method: Compositional matrix adjust.
Identities = 90/357 (25%), Positives = 163/357 (45%), Gaps = 26/357 (7%)
Query: 2 LKLYLRRKLQNDNVIPSTTTDNYSLVEALVAHMSDID-EKQLLLYAQIANLVNLILQWPE 60
++L R + N +T ++ V+ +H+ +D EK+ L+ + I+ L + + E
Sbjct: 53 VRLTARILAKQKNHPETTPSEKLLAVKDFESHLDKLDNEKKDLIQSDISALHHFYSKHLE 112
Query: 61 ISINE-IAENFSKLACNAHTICNSELRPLGTGLYPVISIINHSCLPNAVLVFEGRLAVVR 119
N+ + F+++ CN TI + EL LG+ ++P ++++NHSC PN ++ ++G LA VR
Sbjct: 113 FPDNKSLVVLFAQVNCNGFTIEDEELSHLGSAIFPDVALMNHSCCPNVIVTYKGTLAEVR 172
Query: 120 AVQHVPKGAEVLISYIETAGSTMTRQKALKEQYLFTCTCPRCIKLGQFDDIQESAILEGY 179
AVQ + G EV SYI+ T R L++ Y FTC C C
Sbjct: 173 AVQEINPGEEVFTSYIDLLYPTEDRNDRLRDSYFFTCECQECTT---------------- 216
Query: 180 RCKDDGCSGFLLRDSDDKGFTCQQCGLVRSKEEIKKIASEVNILSKKTLALTSCGNHQEV 239
K + +R D C +VR + + E L C
Sbjct: 217 --KHKDKAKVKIRKLSDPPKAEAICDMVRY---ARNVIEEFRRAKHYKYILYRCPGVAPP 271
Query: 240 VSTYKMIEKLQKKLYHPF---SVNLMQTREKLIKILMELEDWKEALAYCQLTIPVYQRVY 296
++ E Q+K+ F +V ++ + + + + ++DW+ AL Y Q I Y + Y
Sbjct: 272 SELLEICELSQEKMSSVFEDSNVYMLHMMYQAMGVCLYMQDWEGALRYGQKIIKPYSKHY 331
Query: 297 PQFHPLLGLQYYTCGKLEWFLGDTENAIKSMTEAVEILRITHGTNSPFMKELILKLE 353
P + + + G+L L + K++ +AVEI+ + HG + P++ E+ ++E
Sbjct: 332 PLYSLNVASMWLKLGRLYMGLENKAAGEKALRKAVEIMEVAHGKDHPYISEIKQEIE 388
>gi|345803330|ref|XP_537223.3| PREDICTED: SET and MYND domain-containing protein 3 [Canis lupus
familiaris]
Length = 428
Score = 124 bits (310), Expect = 9e-26, Method: Compositional matrix adjust.
Identities = 87/277 (31%), Positives = 137/277 (49%), Gaps = 43/277 (15%)
Query: 65 EIAENFSKLACNAHTICNSELRPLGTGLYPVISIINHSCLPNAVLVFEGRLAVVRAVQHV 124
+I E F+K+ CNA TICN+E++ +G GLYP +S++NHSC PN +VF G ++RAV+ +
Sbjct: 170 DIFEAFAKVICNAFTICNAEMQEVGVGLYPSMSLLNHSCDPNCSIVFNGPHLLLRAVRDI 229
Query: 125 PKGAEVLISYIETAGSTMTRQKALKEQYLFTCTCPRCIKLGQFDDIQESAILEGYRCKDD 184
G E+ I Y++ ++ R+K LK+QY F C C RC
Sbjct: 230 EAGEELTICYLDMLMTSEERRKQLKDQYCFECDCVRC----------------------- 266
Query: 185 GCSGFLLRDSDDKGFTCQQCGLVRSKEEIKKIASEVNILSKKTLALTSCGNHQEVVSTYK 244
+D D T + + +E +KKI E+ K L C Q ++S+
Sbjct: 267 -----ETQDKDADMLTGDEQVWRQVQESLKKI-EELKAHWKWEQVLAMC---QTIISSN- 316
Query: 245 MIEKLQKKLYHPFSVNLMQTR--EKLIKILMELEDWKEALAYCQLTIPVYQRVYPQFHPL 302
E+L +N+ Q R + + + L +EAL Y T+ Y+ +P HP+
Sbjct: 317 -AERLP-------DINVYQLRVLDCAMDACINLGLLEEALFYGIRTMEPYRIFFPGSHPV 368
Query: 303 LGLQYYTCGKLEWFLGDTENAIKSMTEAVEILRITHG 339
G+Q GKL+ G A+K++ A +I+R+THG
Sbjct: 369 RGVQVMKVGKLQLHQGMFPQAMKNLRLAFDIMRVTHG 405
>gi|91076642|ref|XP_970424.1| PREDICTED: similar to Buzidau CG13761-PB [Tribolium castaneum]
gi|270002373|gb|EEZ98820.1| hypothetical protein TcasGA2_TC004426 [Tribolium castaneum]
Length = 442
Score = 123 bits (309), Expect = 1e-25, Method: Compositional matrix adjust.
Identities = 79/301 (26%), Positives = 151/301 (50%), Gaps = 24/301 (7%)
Query: 65 EIAENFSKLACNAHTICNSELRPLGTGLYPVISIINHSCLPNAVLVFEGRLAVVRAVQHV 124
E+ + ++ N+ I + EL+ +GTG+Y S+I+HSC PNAV +F+G + +RA+Q
Sbjct: 154 ELMGMYGRMCINSFNIIDQELQCIGTGMYLGASVIDHSCSPNAVAIFDGPILSIRALQTF 213
Query: 125 P--KGAEVLISYIETAGSTMTRQKALKEQYLFTCTCPRCIKLGQFDDIQESAILEGYRCK 182
+++ ISYI+ +T RQ L+ Y F C CP+C++ E + C
Sbjct: 214 QYLDWSQIKISYIDILNTTKDRQSELEAAYYFLCKCPKCLE-------PEPPEINAAACP 266
Query: 183 DDGCSGFLLRDSDDKGFTCQQCGLVRSKEEIKKIA-----SEVNILSKKTLALTSCGNHQ 237
++ C + + G C +C V S+ +K+ +++++ + K LA
Sbjct: 267 NEKCDNHIDTEIITPGDKCAKCDTVVSETFLKRFKEVIEFTDLHLQNMKQLA-------- 318
Query: 238 EVVSTYKMIEKLQKKLYHPFSVNLMQTREKLIKILMELEDWKEALAYCQLTIPVYQRVYP 297
++ K Q+ + H F++ ++T + + ++ + W A + + + + Y
Sbjct: 319 -YFDVCEICLKKQEGVLHKFNIKHVKTLDLAFQSSIDFQKWDFARKFALELVDAFYKYYG 377
Query: 298 QFHPLLGLQYYTCGKLEWFLGDTENAIKSMTEAVEILRITHGTNSPFMK-ELILKLEEAQ 356
HP+ GL + GK+ F + + A++ +T+A +IL+I HG S K EL+ L++A+
Sbjct: 378 HVHPITGLLHLKLGKILLFEENDQLALEHLTKAYQILKIIHGVGSHLFKDELVPLLQQAR 437
Query: 357 A 357
A
Sbjct: 438 A 438
>gi|126307073|ref|XP_001374886.1| PREDICTED: n-lysine methyltransferase SMYD2 [Monodelphis domestica]
Length = 386
Score = 123 bits (308), Expect = 2e-25, Method: Compositional matrix adjust.
Identities = 86/339 (25%), Positives = 161/339 (47%), Gaps = 32/339 (9%)
Query: 19 TTTDNYSLVEALVAHMSDID-EKQLLLYAQIANLVNLILQWPEISIN-EIAENFSKLACN 76
T+++ V+ +H+ +D EK+ L+ + IA L + + E N + F+++ CN
Sbjct: 76 TSSEKLLAVKEFESHLDKLDNEKRELIQSDIAALHHFYSKHLEYPDNCSLVTLFAQVNCN 135
Query: 77 AHTICNSELRPLGTGLYPVISIINHSCLPNAVLVFEGRLAVVRAVQHVPKGAEVLISYIE 136
TI + EL LG+ ++P ++++NHSC PN ++ ++G LA VRAVQ + G EV SYI+
Sbjct: 136 GFTIEDEELSHLGSAIFPDVALMNHSCCPNVIVTYKGTLAEVRAVQEINPGDEVFTSYID 195
Query: 137 TAGSTMTRQKALKEQYLFTCTCPRC-IKLGQFDDIQESAILEGYRCKDDGCSGFLLRDSD 195
T R LK+ Y FTC C C K ++ + E + +D +RD
Sbjct: 196 LLYPTEDRNDRLKDSYFFTCECRECTTKAKDKAKVEIRKLSEPPKAED-------IRD-- 246
Query: 196 DKGFTCQQCGLVRSKEEIKKIASEVNILSKKTLALTSCGNHQEVVSTYKMIEKLQKKLYH 255
+V+ + + + L C QE +S+ L+
Sbjct: 247 ----------MVKYARNVIEEFRRAKHYKSPSELLEICELSQEKMSS----------LFE 286
Query: 256 PFSVNLMQTREKLIKILMELEDWKEALAYCQLTIPVYQRVYPQFHPLLGLQYYTCGKLEW 315
+V ++ + + + + ++DW+ AL Y Q I Y + YP + + + G+L
Sbjct: 287 DSNVYMLHMMYQAMGVCLYMQDWEGALRYGQKIIKPYSKHYPLYSLNVASMWLKLGRLYM 346
Query: 316 FLGDTENAIKSMTEAVEILRITHGTNSPFMKELILKLEE 354
L + +K++ +A+ I+ + HG + P++ E+ ++E+
Sbjct: 347 GLEHKASGVKALKKAIVIMEVAHGKDHPYISEIKKEIED 385
>gi|355746250|gb|EHH50875.1| hypothetical protein EGM_01766, partial [Macaca fascicularis]
Length = 352
Score = 122 bits (306), Expect = 3e-25, Method: Compositional matrix adjust.
Identities = 84/277 (30%), Positives = 136/277 (49%), Gaps = 43/277 (15%)
Query: 65 EIAENFSKLACNAHTICNSELRPLGTGLYPVISIINHSCLPNAVLVFEGRLAVVRAVQHV 124
++ E F+K+ CN+ TICN+E++ +G GLYP IS++NHSC PN +VF G ++RAV+ +
Sbjct: 94 DLFEAFAKVICNSFTICNAEMQEVGVGLYPSISLLNHSCDPNCSIVFNGPHLLLRAVRDI 153
Query: 125 PKGAEVLISYIETAGSTMTRQKALKEQYLFTCTCPRCIKLGQFDDIQESAILEGYRCKDD 184
G E+ I Y++ ++ R+K L++QY F C C RC
Sbjct: 154 EVGEELTICYLDMLMTSEERRKQLRDQYCFECDCFRC----------------------- 190
Query: 185 GCSGFLLRDSDDKGFTCQQCGLVRSKEEIKKIASEVNILSKKTLALTSCGNHQEVVSTYK 244
+D D T + +E +KKI E+ K L C Q ++S+
Sbjct: 191 -----QTQDKDADMLTGDEQVWKEVQESLKKI-EELKAHWKWEQVLAMC---QAIISSNS 241
Query: 245 MIEKLQKKLYHPFSVNLMQTR--EKLIKILMELEDWKEALAYCQLTIPVYQRVYPQFHPL 302
E+L +N+ Q + + + + L +EAL Y T+ Y+ +P HP+
Sbjct: 242 --ERLP-------DINIYQLKVLDCAMDACINLGLLEEALFYATRTMEPYRIFFPGSHPV 292
Query: 303 LGLQYYTCGKLEWFLGDTENAIKSMTEAVEILRITHG 339
G+Q GKL+ G A+K++ A +I+R+THG
Sbjct: 293 RGVQVMKVGKLQLHQGMFPQAMKNLRLAFDIMRVTHG 329
>gi|195347982|ref|XP_002040530.1| GM18871 [Drosophila sechellia]
gi|194121958|gb|EDW44001.1| GM18871 [Drosophila sechellia]
Length = 466
Score = 122 bits (305), Expect = 3e-25, Method: Compositional matrix adjust.
Identities = 96/369 (26%), Positives = 176/369 (47%), Gaps = 26/369 (7%)
Query: 1 MLKLYLRRKLQNDNVIPSTTTDNYSLVEALVAHMSDIDEKQLLLY---AQIANLVNLILQ 57
+ +L LR + D + T L++H ++I + L + A L +++ +
Sbjct: 113 LCRLILRLEHGGDLIRGYYTEHGSRKFRDLMSHYAEIKNDPMRLEHLDSLHAVLSDMMAE 172
Query: 58 WPEISIN--EIAENFSKLACNAHTICNSELRPLGTGLYPVISIINHSCLPNAVLVFEGRL 115
P N E+ + +L N I ++E+ + T +Y +SI +HSC PNAV FEG
Sbjct: 173 SPSTVPNKTELMSIYGRLITNGFNILDAEMNSIATAIYLGVSITDHSCQPNAVATFEGNE 232
Query: 116 AVVRAVQHVP--KGAEVLISYIETAGSTMTRQKALKEQYLFTCTCPRCIKLGQFDDIQES 173
V A++ + +++ ISYI+ + R+ LKE Y F C C +C D +ES
Sbjct: 233 LHVHAIEDMECLDWSKIFISYIDLLNTPEQRRLDLKEHYYFLCVCSKCT------DAKES 286
Query: 174 AILEGYRCKDDGCSGFLLRDSDDKGFTCQQCGLVRSKEEIKKIASEVNILSKKTLALTSC 233
+ C + C + D + C +C S + ++ +E L++ L
Sbjct: 287 KEMLAALCPNRNCGVGISVDRTN----CPRCDAGISPK-LRNAFNEAMTLTRHNLE---- 337
Query: 234 GNHQEV--VSTYKMIEKLQKKLYHPFSVNLMQTREKLIKILMELEDWKEALAYCQLTIPV 291
N ++V + K+ Q ++HP +V ++T + + +E+ W +AL Y Q +P
Sbjct: 338 -NMKDVAYLDVCKVCLDKQTGVFHPLNVWYVKTLDAAFEAAIEVGKWSDALDYGQRLLPG 396
Query: 292 YQRVYPQFHPLLGLQYYTCGKLEWFLGDTENAIKSMTEAVEILRITHGTNSPFMKE-LIL 350
+++ + ++PLLGL + GK++ + G + A+ + EA IL +THG + + E L L
Sbjct: 397 FRKYHGPWNPLLGLLHMKLGKIQLYEGHAKEALHHLEEAQRILTVTHGRDHRLLTEQLYL 456
Query: 351 KLEEAQAEA 359
+ +A+ EA
Sbjct: 457 LVLQARQEA 465
>gi|260796111|ref|XP_002593048.1| hypothetical protein BRAFLDRAFT_74375 [Branchiostoma floridae]
gi|229278272|gb|EEN49059.1| hypothetical protein BRAFLDRAFT_74375 [Branchiostoma floridae]
Length = 390
Score = 122 bits (305), Expect = 4e-25, Method: Compositional matrix adjust.
Identities = 81/278 (29%), Positives = 126/278 (45%), Gaps = 32/278 (11%)
Query: 70 FSKLACNAHTICNSELRPLGTGLYPVISIINHSCLPNAVLVFEGRLAVVRAVQHVPKGAE 129
+ K CN I N +LR +G GLYP ++INHSC N V F G +RA+ + G E
Sbjct: 106 YGKTTCNCFAIHNLDLREIGVGLYPQAAMINHSCKSNCVSTFRGPTLQIRALVDIQPGEE 165
Query: 130 VLISYIETAGSTMTRQKALKEQYLFTCTCPRCIKLGQFDDIQESAILEGYRCKDDGCSGF 189
V SY E T R+ L+ +Y F C CP C+ D AI++ +C C G
Sbjct: 166 VCYSYTEKGNVTHERRDELR-KYFFECQCPHCL------DTDRDAIMKSVKCP--SCQGQ 216
Query: 190 LLRDSDDKGFTCQQCGLVRSKEEIKK-----IASEVNILSKKTLALTSCGNHQEVVSTYK 244
+ S D+ C CG E + I E ++L ++
Sbjct: 217 VKPTSSDRYEKCSSCGFTDFTTEFYEDLEIYIHVEFDLLFRENC---------------- 260
Query: 245 MIEKLQKKLYHPFSVNLMQTREKLIKILMELEDWKEALAYCQLTIPVYQRVYPQFHPLLG 304
++E K+ HP ++++++ ++LE+W +A+ Y + + P P G
Sbjct: 261 LVE--LDKILHPDNIHVVRILVGAFAASVKLEEWTKAIDYGKRLDRAFGLYLPPNEPDTG 318
Query: 305 LQYYTCGKLEWFLGDTENAIKSMTEAVEILRITHGTNS 342
L YY GK + L D ENA+ S+ +A +L I +G +S
Sbjct: 319 LLYYKMGKAYYHLDDIENAVTSLRKAKTLLSIAYGRDS 356
>gi|349604328|gb|AEP99912.1| SET and MYND domain-containing protein 3-like protein, partial
[Equus caballus]
Length = 358
Score = 122 bits (305), Expect = 4e-25, Method: Compositional matrix adjust.
Identities = 84/277 (30%), Positives = 136/277 (49%), Gaps = 43/277 (15%)
Query: 65 EIAENFSKLACNAHTICNSELRPLGTGLYPVISIINHSCLPNAVLVFEGRLAVVRAVQHV 124
++ E F+K+ CN+ TICN+E++ +G GLYP +S++NHSC PN +VF G ++RAV+ +
Sbjct: 100 DVFEAFAKVICNSFTICNAEMQEVGVGLYPSMSLLNHSCEPNCSIVFNGPHLLLRAVRDI 159
Query: 125 PKGAEVLISYIETAGSTMTRQKALKEQYLFTCTCPRCIKLGQFDDIQESAILEGYRCKDD 184
G E+ I Y++ ++ R+K L++QY F C C RC
Sbjct: 160 EAGEELTICYLDMLMTSEERRKQLRDQYCFECDCFRC----------------------- 196
Query: 185 GCSGFLLRDSDDKGFTCQQCGLVRSKEEIKKIASEVNILSKKTLALTSCGNHQEVVSTYK 244
+D D T + +E +KKI E+ K L C Q ++S+
Sbjct: 197 -----QTQDKDADMLTGDEHVWKEVQESLKKI-EELKAHWKWEQVLAMC---QTIISSNS 247
Query: 245 MIEKLQKKLYHPFSVNLMQTR--EKLIKILMELEDWKEALAYCQLTIPVYQRVYPQFHPL 302
E+L +N+ Q + + + + L +EAL Y T+ Y+ YP HP+
Sbjct: 248 --ERLP-------DINIYQLKVLDCAMDACINLGLLEEALFYGIRTMEPYRIFYPGSHPV 298
Query: 303 LGLQYYTCGKLEWFLGDTENAIKSMTEAVEILRITHG 339
G+Q GKL+ G A+K++ A +I+R+THG
Sbjct: 299 RGVQVMKVGKLQLHQGMFPQAMKNLRLAFDIMRVTHG 335
>gi|48257059|gb|AAH17079.2| SMYD3 protein, partial [Homo sapiens]
Length = 287
Score = 122 bits (305), Expect = 4e-25, Method: Compositional matrix adjust.
Identities = 84/277 (30%), Positives = 136/277 (49%), Gaps = 43/277 (15%)
Query: 65 EIAENFSKLACNAHTICNSELRPLGTGLYPVISIINHSCLPNAVLVFEGRLAVVRAVQHV 124
++ E F+K+ CN+ TICN+E++ +G GLYP IS++NHSC PN +VF G ++RAV+ +
Sbjct: 29 DLFEAFAKVICNSFTICNAEMQEVGVGLYPSISLLNHSCDPNCSIVFNGPHLLLRAVRDI 88
Query: 125 PKGAEVLISYIETAGSTMTRQKALKEQYLFTCTCPRCIKLGQFDDIQESAILEGYRCKDD 184
G E+ I Y++ ++ R+K L++QY F C C RC
Sbjct: 89 EVGEELTICYLDMLMTSEERRKQLRDQYCFECDCFRCQT--------------------- 127
Query: 185 GCSGFLLRDSDDKGFTCQQCGLVRSKEEIKKIASEVNILSKKTLALTSCGNHQEVVSTYK 244
+D D T + +E +KKI E+ K L C Q ++S+
Sbjct: 128 -------QDKDADMLTGDEQVWKEVQESLKKI-EELKAHWKWEQVLAMC---QAIISSNS 176
Query: 245 MIEKLQKKLYHPFSVNLMQTR--EKLIKILMELEDWKEALAYCQLTIPVYQRVYPQFHPL 302
E+L +N+ Q + + + + L +EAL Y T+ Y+ +P HP+
Sbjct: 177 --ERLP-------DINIYQLKVLDCAMDACINLGLLEEALFYGTRTMEPYRIFFPGSHPV 227
Query: 303 LGLQYYTCGKLEWFLGDTENAIKSMTEAVEILRITHG 339
G+Q GKL+ G A+K++ A +I+R+THG
Sbjct: 228 RGVQVMKVGKLQLHQGMFPQAMKNLRLAFDIMRVTHG 264
>gi|119613763|gb|EAW93357.1| SET and MYND domain containing 2, isoform CRA_c [Homo sapiens]
Length = 391
Score = 121 bits (304), Expect = 4e-25, Method: Compositional matrix adjust.
Identities = 90/350 (25%), Positives = 161/350 (46%), Gaps = 32/350 (9%)
Query: 8 RKLQNDNVIPSTTTDNYSL-VEALVAHMSDID-EKQLLLYAQIANLVNLILQWPEISINE 65
R L + P T L V+ +H+ +D EK+ L+ + IA L + + E N+
Sbjct: 68 RILAKQKIHPERTPSEKLLAVKEFESHLDKLDNEKKDLIQSDIAALHHFYSKHLEFPDND 127
Query: 66 -IAENFSKLACNAHTICNSELRPLGTGLYPVISIINHSCLPNAVLVFEGRLAVVRAVQHV 124
+ F+++ CN TI + EL LG+ ++P ++++NHSC PN ++ ++G LA VRAVQ +
Sbjct: 128 SLVVLFAQVNCNGFTIEDEELSHLGSAIFPDVALMNHSCCPNVIVTYKGTLAEVRAVQEI 187
Query: 125 PKGAEVLISYIETAGSTMTRQKALKEQYLFTCTCPRCIKLGQFDDIQESAILEGYRCKDD 184
G EV SYI+ T R L++ Y FTC C C KD
Sbjct: 188 KPGEEVFTSYIDLLYPTEDRNDRLRDSYFFTCECQECTT------------------KDK 229
Query: 185 GCSGFLLRDSDDKGFTCQQCGLVR-SKEEIKKIASEVNILSKKTLALTSCGNHQEVVSTY 243
+ +R D +VR ++ I++ + S + L C QE +S+
Sbjct: 230 DKAKVEIRKLSDPPKAEAIRDMVRYARNVIEEFRRAKHYKSPPSELLEICELSQEKMSS- 288
Query: 244 KMIEKLQKKLYHPFSVNLMQTREKLIKILMELEDWKEALAYCQLTIPVYQRVYPQFHPLL 303
++ +V ++ + + + + ++DW+ AL Y Q I Y + YP + +
Sbjct: 289 ---------VFEDSNVYMLHMMYQAMGVCLYMQDWEGALQYGQKIIKPYSKHYPLYSLNV 339
Query: 304 GLQYYTCGKLEWFLGDTENAIKSMTEAVEILRITHGTNSPFMKELILKLE 353
+ G+L L K++ +A+ I+ + HG + P++ E+ ++E
Sbjct: 340 ASMWLKLGRLYMGLEHKAAGEKALKKAIAIMEVAHGKDHPYISEIKQEIE 389
>gi|281342967|gb|EFB18551.1| hypothetical protein PANDA_009836 [Ailuropoda melanoleuca]
Length = 386
Score = 121 bits (304), Expect = 5e-25, Method: Compositional matrix adjust.
Identities = 93/357 (26%), Positives = 163/357 (45%), Gaps = 41/357 (11%)
Query: 8 RKLQNDNVIPSTTTDNYSL-VEALVAHMSDID-EKQLLLYAQIANLVNLI---LQWPEIS 62
R L + P T L V+ +H+ +D EK+ L+ + IA L + L++P+
Sbjct: 58 RILAKQKIHPERTPSEKLLAVKEFESHLDKLDNEKKDLIQSDIAALHHFYSKHLEFPDS- 116
Query: 63 INEIAENFSKLACNAHTICNSELRPLGTGLYPVISIINHSCLPNAVLVFEGRLAVVRAVQ 122
+ + F+++ CN TI + EL LG+ ++P ++++NHSC PN ++ ++G LA VRAVQ
Sbjct: 117 -DSLVVLFAQVNCNGFTIEDEELSHLGSAIFPDVALMNHSCCPNVIVTYKGTLAEVRAVQ 175
Query: 123 HVPKGAEVLISYIETAGSTMTRQKALKEQYLFTCTCPRCIKLGQFDDIQESAILEGYRCK 182
+ G EV SYI+ T R L++ Y FTC C C K
Sbjct: 176 EISPGEEVFTSYIDLLYPTEDRNDRLRDSYFFTCQCQECT------------------TK 217
Query: 183 DDGCSGFLLRDSDDKGFTCQQCGLVRSK----EEIKKIASEVNIL--SKKTLALTSCGNH 236
D + +R D +VR EE ++ IL S + L C
Sbjct: 218 DKDKAKVEIRKLSDPPKAETIRDMVRYARNVIEEFRRAKHYKYILYNSPPSELLEICELS 277
Query: 237 QEVVSTYKMIEKLQKKLYHPFSVNLMQTREKLIKILMELEDWKEALAYCQLTIPVYQRVY 296
QE + + ++ +V ++ + + + + ++DW+ AL Y Q I Y + Y
Sbjct: 278 QEKMGS----------VFEDSNVYMLHMMYQAMGVCLYMQDWEGALRYGQKIIKPYSKHY 327
Query: 297 PQFHPLLGLQYYTCGKLEWFLGDTENAIKSMTEAVEILRITHGTNSPFMKELILKLE 353
P + + + G+L L D K++ +A+ I+ + HG + P++ E+ ++E
Sbjct: 328 PLYSLNVASMWLKLGRLYMGLEDKAAGEKALKKAIAIMEVAHGKDHPYISEIKQEIE 384
>gi|195477573|ref|XP_002100247.1| GE16937 [Drosophila yakuba]
gi|194187771|gb|EDX01355.1| GE16937 [Drosophila yakuba]
Length = 469
Score = 121 bits (304), Expect = 5e-25, Method: Compositional matrix adjust.
Identities = 94/371 (25%), Positives = 178/371 (47%), Gaps = 26/371 (7%)
Query: 1 MLKLYLRRKLQNDNVIPSTTTDNYSLVEALVAHMSDIDEKQLLLY---AQIANLVNLILQ 57
+ +L LR + D + T L++H ++I + L + A L +++
Sbjct: 115 LCRLILRLEHGGDLIRGYYTEHGSRKFRDLMSHYAEIKNDPMRLEHLDSLHAVLTDMMAD 174
Query: 58 WPEISIN--EIAENFSKLACNAHTICNSELRPLGTGLYPVISIINHSCLPNAVLVFEGRL 115
P N E+ + +L N I ++E+ + T +Y +SI +HSC PNAV FEG
Sbjct: 175 SPSTVPNKTELMSIYGRLITNGFNILDAEMNSIATAIYLGVSITDHSCQPNAVATFEGNE 234
Query: 116 AVVRAVQHVP--KGAEVLISYIETAGSTMTRQKALKEQYLFTCTCPRCIKLGQFDDIQES 173
V A++ + +++ ISYI+ + R+ LKE Y F C C +C D +ES
Sbjct: 235 LHVHAIEDMECLDWSKIFISYIDLLNTPEQRRLDLKEHYYFLCVCSKCT------DAKES 288
Query: 174 AILEGYRCKDDGCSGFLLRDSDDKGFTCQQCGLVRSKEEIKKIASEVNILSKKTLALTSC 233
+ C + C + D + C +C S + ++ ++ L++ L
Sbjct: 289 KEMLSALCPNRNCGVGITVDRTN----CPRCDAGISPK-LRNAFNDAMTLTRHNLE---- 339
Query: 234 GNHQEV--VSTYKMIEKLQKKLYHPFSVNLMQTREKLIKILMELEDWKEALAYCQLTIPV 291
N ++V + K+ Q ++HP +V ++T + + +E+ W +AL Y Q +P
Sbjct: 340 -NMKDVAYLDVCKVCLDKQTGVFHPLNVWYVKTLDAAFEAAIEVGKWTDALDYGQRLLPG 398
Query: 292 YQRVYPQFHPLLGLQYYTCGKLEWFLGDTENAIKSMTEAVEILRITHGTNSPFMKE-LIL 350
+++ + ++PLLGL + GK++ + G ++ A+ + EA IL +THG + + E L +
Sbjct: 399 FRKYHGPWNPLLGLLHMKLGKIQLYEGRSKEALHHLEEAQRILTVTHGRDHRLLTEQLYV 458
Query: 351 KLEEAQAEASY 361
+ +A+ EA++
Sbjct: 459 LILQARQEAAH 469
>gi|301771175|ref|XP_002921007.1| PREDICTED: SET and MYND domain-containing protein 2-like
[Ailuropoda melanoleuca]
Length = 385
Score = 121 bits (304), Expect = 5e-25, Method: Compositional matrix adjust.
Identities = 90/351 (25%), Positives = 162/351 (46%), Gaps = 35/351 (9%)
Query: 8 RKLQNDNVIPSTTTDNYSL-VEALVAHMSDID-EKQLLLYAQIANLVNLI---LQWPEIS 62
R L + P T L V+ +H+ +D EK+ L+ + IA L + L++P+
Sbjct: 63 RILAKQKIHPERTPSEKLLAVKEFESHLDKLDNEKKDLIQSDIAALHHFYSKHLEFPDS- 121
Query: 63 INEIAENFSKLACNAHTICNSELRPLGTGLYPVISIINHSCLPNAVLVFEGRLAVVRAVQ 122
+ + F+++ CN TI + EL LG+ ++P ++++NHSC PN ++ ++G LA VRAVQ
Sbjct: 122 -DSLVVLFAQVNCNGFTIEDEELSHLGSAIFPDVALMNHSCCPNVIVTYKGTLAEVRAVQ 180
Query: 123 HVPKGAEVLISYIETAGSTMTRQKALKEQYLFTCTCPRCIKLGQFDDIQESAILEGYRCK 182
+ G EV SYI+ T R L++ Y FTC C C ++ A +E R
Sbjct: 181 EISPGEEVFTSYIDLLYPTEDRNDRLRDSYFFTCQCQECTTKD-----KDKAKVE-IRKL 234
Query: 183 DDGCSGFLLRDSDDKGFTCQQCGLVRSKEEIKKIASEVNILSKKTLALTSCGNHQEVVST 242
D +RD +VR + + + L C QE + +
Sbjct: 235 SDPPKAETIRD------------MVRYARNVIEEFRRAKHYKSPSELLEICELSQEKMGS 282
Query: 243 YKMIEKLQKKLYHPFSVNLMQTREKLIKILMELEDWKEALAYCQLTIPVYQRVYPQFHPL 302
++ +V ++ + + + + ++DW+ AL Y Q I Y + YP +
Sbjct: 283 ----------VFEDSNVYMLHMMYQAMGVCLYMQDWEGALRYGQKIIKPYSKHYPLYSLN 332
Query: 303 LGLQYYTCGKLEWFLGDTENAIKSMTEAVEILRITHGTNSPFMKELILKLE 353
+ + G+L L D K++ +A+ I+ + HG + P++ E+ ++E
Sbjct: 333 VASMWLKLGRLYMGLEDKAAGEKALKKAIAIMEVAHGKDHPYISEIKQEIE 383
>gi|197692956|gb|ACH71266.1| SET and MYND domain-containing 3 [Sus scrofa]
Length = 369
Score = 121 bits (304), Expect = 5e-25, Method: Compositional matrix adjust.
Identities = 84/277 (30%), Positives = 136/277 (49%), Gaps = 43/277 (15%)
Query: 65 EIAENFSKLACNAHTICNSELRPLGTGLYPVISIINHSCLPNAVLVFEGRLAVVRAVQHV 124
+I E F+K+ CN+ TICN+E++ +G GLYP +S++NHSC PN +VF G ++RAV+ +
Sbjct: 111 DIFEAFAKVICNSFTICNAEMQEVGVGLYPSMSLLNHSCDPNCSIVFNGPHLLLRAVRDI 170
Query: 125 PKGAEVLISYIETAGSTMTRQKALKEQYLFTCTCPRCIKLGQFDDIQESAILEGYRCKDD 184
G E+ I Y++ ++ R+K L++QY F C C RC
Sbjct: 171 EAGEELTICYLDMLMTSEERRKQLRDQYCFECDCFRC----------------------- 207
Query: 185 GCSGFLLRDSDDKGFTCQQCGLVRSKEEIKKIASEVNILSKKTLALTSCGNHQEVVSTYK 244
+D D T + +E +KKI E+ K L C Q ++S+
Sbjct: 208 -----QTQDKDADMLTGDEQVWKEVQESLKKI-EELKAHWKWEQVLAMC---QTIISSN- 257
Query: 245 MIEKLQKKLYHPFSVNLMQTR--EKLIKILMELEDWKEALAYCQLTIPVYQRVYPQFHPL 302
E+L +N+ Q + + + + L +EAL Y T+ Y+ +P HP+
Sbjct: 258 -AERLP-------DINIYQLKVLDCAMDACINLGLLEEALFYGMRTMEPYRIFFPGSHPV 309
Query: 303 LGLQYYTCGKLEWFLGDTENAIKSMTEAVEILRITHG 339
G+Q GKL+ G A+K++ A +I+R+THG
Sbjct: 310 RGVQVMKVGKLQLHQGMFPQAMKNLRLAFDIMRVTHG 346
>gi|355720907|gb|AES07089.1| SET and MYND domain containing 2 [Mustela putorius furo]
Length = 432
Score = 121 bits (303), Expect = 6e-25, Method: Compositional matrix adjust.
Identities = 82/351 (23%), Positives = 161/351 (45%), Gaps = 35/351 (9%)
Query: 8 RKLQNDNVIPSTTTDNYSL-VEALVAHMSDID-EKQLLLYAQIANLVNLI---LQWPEIS 62
R L + P T L V+ +H+ +D EK+ L+ + +A L + L++P+
Sbjct: 111 RILAKQKIHPERTPSEKLLAVKEFESHLEKLDNEKRDLIQSDVAALHHFYSKHLEFPDT- 169
Query: 63 INEIAENFSKLACNAHTICNSELRPLGTGLYPVISIINHSCLPNAVLVFEGRLAVVRAVQ 122
+ + F+++ CN TI + EL LG+ ++P ++++NHSC PN ++ ++G LA VRAVQ
Sbjct: 170 -DSLVVLFAQVNCNGFTIEDEELSHLGSAIFPDVALMNHSCCPNVIVTYKGTLAEVRAVQ 228
Query: 123 HVPKGAEVLISYIETAGSTMTRQKALKEQYLFTCTCPRCIKLGQFDDIQESAILEGYRCK 182
+ G EV SYI+ T R L++ Y FTC C C
Sbjct: 229 EISPGEEVFTSYIDLLYPTEDRNDRLRDSYFFTCQCQECT-------------------- 268
Query: 183 DDGCSGFLLRDSDDKGFTCQQCGLVRSKEEIKKIASEVNILSKKTLALTSCGNHQEVVST 242
+D D ++ E I+++ + ++ + E++
Sbjct: 269 --------TKDKDKAKVEIRKLSDPPKAETIREMVRYARNVIEEFRRAKHYKSPSELLEI 320
Query: 243 YKMIEKLQKKLYHPFSVNLMQTREKLIKILMELEDWKEALAYCQLTIPVYQRVYPQFHPL 302
++ ++ ++ +V ++ + + + + ++DW+ AL Y Q I Y + YP +
Sbjct: 321 CELSQEKMGSVFEDSNVYMLHMMYQAMGVCLYMQDWEGALRYGQKIIKPYSKHYPLYSLN 380
Query: 303 LGLQYYTCGKLEWFLGDTENAIKSMTEAVEILRITHGTNSPFMKELILKLE 353
+ + G+L L D K++ +A+ I+ + HG + P++ E+ ++E
Sbjct: 381 VASMWLKLGRLYMGLEDKAAGEKALKKAIAIMEVAHGKDHPYISEIKQEIE 431
>gi|332236389|ref|XP_003267386.1| PREDICTED: SET and MYND domain-containing protein 3 [Nomascus
leucogenys]
Length = 369
Score = 121 bits (303), Expect = 6e-25, Method: Compositional matrix adjust.
Identities = 84/277 (30%), Positives = 136/277 (49%), Gaps = 43/277 (15%)
Query: 65 EIAENFSKLACNAHTICNSELRPLGTGLYPVISIINHSCLPNAVLVFEGRLAVVRAVQHV 124
++ E F+K+ CN+ TICN+E++ +G GLYP IS++NHSC PN +VF G ++RAV+ +
Sbjct: 111 DLFEAFAKVICNSFTICNAEMQEVGVGLYPSISLLNHSCDPNCSIVFNGPHLLLRAVRDI 170
Query: 125 PKGAEVLISYIETAGSTMTRQKALKEQYLFTCTCPRCIKLGQFDDIQESAILEGYRCKDD 184
G E+ I Y++ ++ R+K L++QY F C C RC
Sbjct: 171 EVGEELTICYLDMLMTSEERRKQLRDQYCFECDCLRC----------------------- 207
Query: 185 GCSGFLLRDSDDKGFTCQQCGLVRSKEEIKKIASEVNILSKKTLALTSCGNHQEVVSTYK 244
+D D T + +E +KKI E+ K L C Q ++S+
Sbjct: 208 -----QTQDKDADMLTGDEQVWKEVQESLKKI-EELKAHWKWEQVLAMC---QAIISSNS 258
Query: 245 MIEKLQKKLYHPFSVNLMQTR--EKLIKILMELEDWKEALAYCQLTIPVYQRVYPQFHPL 302
E+L +N+ Q + + + + L +EAL Y T+ Y+ +P HP+
Sbjct: 259 --ERLP-------DINIYQLKVLDCAMDACINLGLLEEALFYGTRTMEPYRIFFPGSHPV 309
Query: 303 LGLQYYTCGKLEWFLGDTENAIKSMTEAVEILRITHG 339
G+Q GKL+ G A+K++ A +I+R+THG
Sbjct: 310 RGVQVMKVGKLQLHQGMFPQAMKNLRLAFDIMRVTHG 346
>gi|332812383|ref|XP_514316.3| PREDICTED: SET and MYND domain-containing protein 3 [Pan
troglodytes]
gi|397473047|ref|XP_003808033.1| PREDICTED: SET and MYND domain-containing protein 3 [Pan paniscus]
gi|410208860|gb|JAA01649.1| SET and MYND domain containing 3 [Pan troglodytes]
gi|410254410|gb|JAA15172.1| SET and MYND domain containing 3 [Pan troglodytes]
gi|410289336|gb|JAA23268.1| SET and MYND domain containing 3 [Pan troglodytes]
gi|410332281|gb|JAA35087.1| SET and MYND domain containing 3 [Pan troglodytes]
Length = 428
Score = 121 bits (303), Expect = 6e-25, Method: Compositional matrix adjust.
Identities = 84/277 (30%), Positives = 136/277 (49%), Gaps = 43/277 (15%)
Query: 65 EIAENFSKLACNAHTICNSELRPLGTGLYPVISIINHSCLPNAVLVFEGRLAVVRAVQHV 124
++ E F+K+ CN+ TICN+E++ +G GLYP IS++NHSC PN +VF G ++RAV+ +
Sbjct: 170 DLFEAFAKVICNSFTICNAEMQEVGVGLYPSISLLNHSCDPNCSIVFNGPHLLLRAVRDI 229
Query: 125 PKGAEVLISYIETAGSTMTRQKALKEQYLFTCTCPRCIKLGQFDDIQESAILEGYRCKDD 184
G E+ I Y++ ++ R+K L++QY F C C RC
Sbjct: 230 EVGEELTICYLDMLMTSEERRKQLRDQYCFECDCFRC----------------------- 266
Query: 185 GCSGFLLRDSDDKGFTCQQCGLVRSKEEIKKIASEVNILSKKTLALTSCGNHQEVVSTYK 244
+D D T + +E +KKI E+ K L C Q ++S+
Sbjct: 267 -----QTQDKDADMLTGDEQVWKEVQESLKKI-EELKAHWKWEQVLAMC---QAIISSNS 317
Query: 245 MIEKLQKKLYHPFSVNLMQTR--EKLIKILMELEDWKEALAYCQLTIPVYQRVYPQFHPL 302
E+L +N+ Q + + + + L +EAL Y T+ Y+ +P HP+
Sbjct: 318 --ERLP-------DINIYQLKVLDCAMDACINLGLLEEALFYATRTMEPYRIFFPGSHPV 368
Query: 303 LGLQYYTCGKLEWFLGDTENAIKSMTEAVEILRITHG 339
G+Q GKL+ G A+K++ A +I+R+THG
Sbjct: 369 RGVQVMKVGKLQLHQGMFPQAMKNLRLAFDIMRVTHG 405
>gi|338722720|ref|XP_001914950.2| PREDICTED: LOW QUALITY PROTEIN: SET and MYND domain-containing
protein 3 [Equus caballus]
Length = 440
Score = 121 bits (303), Expect = 6e-25, Method: Compositional matrix adjust.
Identities = 84/277 (30%), Positives = 136/277 (49%), Gaps = 43/277 (15%)
Query: 65 EIAENFSKLACNAHTICNSELRPLGTGLYPVISIINHSCLPNAVLVFEGRLAVVRAVQHV 124
++ E F+K+ CN+ TICN+E++ +G GLYP +S++NHSC PN +VF G ++RAV+ +
Sbjct: 182 DVFEAFAKVICNSFTICNAEMQEVGVGLYPSMSLLNHSCDPNCSIVFNGPHLLLRAVRDI 241
Query: 125 PKGAEVLISYIETAGSTMTRQKALKEQYLFTCTCPRCIKLGQFDDIQESAILEGYRCKDD 184
G E+ I Y++ ++ R+K L++QY F C C RC
Sbjct: 242 EAGEELTICYLDMLMTSEERRKQLRDQYCFECDCFRC----------------------- 278
Query: 185 GCSGFLLRDSDDKGFTCQQCGLVRSKEEIKKIASEVNILSKKTLALTSCGNHQEVVSTYK 244
+D D T + +E +KKI E+ K L C Q ++S+
Sbjct: 279 -----QTQDKDADMLTGDEHVWKEVQESLKKI-EELKAHWKWEQVLAMC---QTIISSNS 329
Query: 245 MIEKLQKKLYHPFSVNLMQTR--EKLIKILMELEDWKEALAYCQLTIPVYQRVYPQFHPL 302
E+L +N+ Q + + + + L +EAL Y T+ Y+ YP HP+
Sbjct: 330 --ERLP-------DINIYQLKVLDCAMDACINLGLLEEALFYGIRTMEPYRIFYPGSHPV 380
Query: 303 LGLQYYTCGKLEWFLGDTENAIKSMTEAVEILRITHG 339
G+Q GKL+ G A+K++ A +I+R+THG
Sbjct: 381 RGVQVMKVGKLQLHQGMFPQAMKNLRLAFDIMRVTHG 417
>gi|119613761|gb|EAW93355.1| SET and MYND domain containing 2, isoform CRA_a [Homo sapiens]
Length = 349
Score = 121 bits (303), Expect = 6e-25, Method: Compositional matrix adjust.
Identities = 88/349 (25%), Positives = 156/349 (44%), Gaps = 31/349 (8%)
Query: 8 RKLQNDNVIPSTTTDNYSL-VEALVAHMSDID-EKQLLLYAQIANLVNLILQWPEISINE 65
R L + P T L V+ +H+ +D EK+ L+ + IA L + + E N+
Sbjct: 27 RILAKQKIHPERTPSEKLLAVKEFESHLDKLDNEKKDLIQSDIAALHHFYSKHLEFPDND 86
Query: 66 -IAENFSKLACNAHTICNSELRPLGTGLYPVISIINHSCLPNAVLVFEGRLAVVRAVQHV 124
+ F+++ CN TI + EL LG+ ++P ++++NHSC PN ++ ++G LA VRAVQ +
Sbjct: 87 SLVVLFAQVNCNGFTIEDEELSHLGSAIFPDVALMNHSCCPNVIVTYKGTLAEVRAVQEI 146
Query: 125 PKGAEVLISYIETAGSTMTRQKALKEQYLFTCTCPRCIKLGQFDDIQESAILEGYRCKDD 184
G EV SYI+ T R L++ Y FTC C C KD
Sbjct: 147 KPGEEVFTSYIDLLYPTEDRNDRLRDSYFFTCECQECT------------------TKDK 188
Query: 185 GCSGFLLRDSDDKGFTCQQCGLVRSKEEIKKIASEVNILSKKTLALTSCGNHQEVVSTYK 244
+ +R D +VR + + + L C QE +S+
Sbjct: 189 DKAKVEIRKLSDPPKAEAIRDMVRYARNVIEEFRRAKHYKSPSELLEICELSQEKMSS-- 246
Query: 245 MIEKLQKKLYHPFSVNLMQTREKLIKILMELEDWKEALAYCQLTIPVYQRVYPQFHPLLG 304
++ +V ++ + + + + ++DW+ AL Y Q I Y + YP + +
Sbjct: 247 --------VFEDSNVYMLHMMYQAMGVCLYMQDWEGALQYGQKIIKPYSKHYPLYSLNVA 298
Query: 305 LQYYTCGKLEWFLGDTENAIKSMTEAVEILRITHGTNSPFMKELILKLE 353
+ G+L L K++ +A+ I+ + HG + P++ E+ ++E
Sbjct: 299 SMWLKLGRLYMGLEHKAAGEKALKKAIAIMEVAHGKDHPYISEIKQEIE 347
>gi|332811913|ref|XP_003308794.1| PREDICTED: N-lysine methyltransferase SMYD2 [Pan troglodytes]
Length = 433
Score = 121 bits (303), Expect = 7e-25, Method: Compositional matrix adjust.
Identities = 91/349 (26%), Positives = 160/349 (45%), Gaps = 31/349 (8%)
Query: 8 RKLQNDNVIPSTTTDNYSL-VEALVAHMSDID-EKQLLLYAQIANLVNLILQWPEISINE 65
R L + P T L V+ +H+ +D EK+ L+ + IA L + + E N+
Sbjct: 111 RILAKQKIHPERTPSEKLLAVKEFESHLDKLDNEKKDLIQSDIAALHHFYSKHLEFPDND 170
Query: 66 -IAENFSKLACNAHTICNSELRPLGTGLYPVISIINHSCLPNAVLVFEGRLAVVRAVQHV 124
+ F+++ CN TI + EL LG+ ++P ++++NHSC PN ++ ++G LA VRAVQ +
Sbjct: 171 SLVVVFAQVNCNGFTIEDEELSHLGSAIFPDVALMNHSCCPNVIVTYKGTLAEVRAVQEI 230
Query: 125 PKGAEVLISYIETAGSTMTRQKALKEQYLFTCTCPRCIKLGQFDDIQESAILEGYRCKDD 184
G EV SYI+ T R L++ Y FTC C C ++ A +E R D
Sbjct: 231 KPGEEVFTSYIDLLYPTEDRNDRLRDSYFFTCECQECTTKD-----KDKAKVE-IRKLSD 284
Query: 185 GCSGFLLRDSDDKGFTCQQCGLVRSKEEIKKIASEVNILSKKTLALTSCGNHQEVVSTYK 244
+RD +VR + + + L C QE +S+
Sbjct: 285 PPKAEAIRD------------MVRYARNVIEEFRRAKHYKSPSELLEICELSQEKMSS-- 330
Query: 245 MIEKLQKKLYHPFSVNLMQTREKLIKILMELEDWKEALAYCQLTIPVYQRVYPQFHPLLG 304
++ +V ++ + + + + ++DW+ AL Y Q I Y + YP + +
Sbjct: 331 --------VFEDSNVYMLHMMYQAMGVCLYMQDWEGALQYGQKIIKPYSKHYPLYSLNVA 382
Query: 305 LQYYTCGKLEWFLGDTENAIKSMTEAVEILRITHGTNSPFMKELILKLE 353
+ G+L L K++ +A+ I+ I HG + P++ E+ ++E
Sbjct: 383 SMWLKLGRLYMGLEHKAAGEKALKKAIAIMEIAHGKDHPYISEIKQEIE 431
>gi|355559119|gb|EHH15899.1| hypothetical protein EGK_02058, partial [Macaca mulatta]
Length = 373
Score = 121 bits (303), Expect = 7e-25, Method: Compositional matrix adjust.
Identities = 84/277 (30%), Positives = 136/277 (49%), Gaps = 43/277 (15%)
Query: 65 EIAENFSKLACNAHTICNSELRPLGTGLYPVISIINHSCLPNAVLVFEGRLAVVRAVQHV 124
++ E F+K+ CN+ TICN+E++ +G GLYP IS++NHSC PN +VF G ++RAV+ +
Sbjct: 115 DLFEAFAKVICNSFTICNAEMQEVGVGLYPSISLLNHSCDPNCSIVFNGPHLLLRAVRDI 174
Query: 125 PKGAEVLISYIETAGSTMTRQKALKEQYLFTCTCPRCIKLGQFDDIQESAILEGYRCKDD 184
G E+ I Y++ ++ R+K L++QY F C C RC
Sbjct: 175 EVGEELTICYLDMLMTSEERRKQLRDQYCFECDCFRC----------------------- 211
Query: 185 GCSGFLLRDSDDKGFTCQQCGLVRSKEEIKKIASEVNILSKKTLALTSCGNHQEVVSTYK 244
+D D T + +E +KKI E+ K L C Q ++S+
Sbjct: 212 -----QTQDKDADMLTGDEQVWKEVQESLKKI-EELKAHWKWEQVLAMC---QAIISSNS 262
Query: 245 MIEKLQKKLYHPFSVNLMQTR--EKLIKILMELEDWKEALAYCQLTIPVYQRVYPQFHPL 302
E+L +N+ Q + + + + L +EAL Y T+ Y+ +P HP+
Sbjct: 263 --ERLP-------DINIYQLKVLDCAMDACINLGLLEEALFYGTRTMEPYRIFFPGSHPV 313
Query: 303 LGLQYYTCGKLEWFLGDTENAIKSMTEAVEILRITHG 339
G+Q GKL+ G A+K++ A +I+R+THG
Sbjct: 314 RGVQVMKVGKLQLHQGMFPQAMKNLRLAFDIMRVTHG 350
>gi|347300358|ref|NP_001153564.2| SET and MYND domain-containing protein 3 [Sus scrofa]
Length = 428
Score = 121 bits (303), Expect = 7e-25, Method: Compositional matrix adjust.
Identities = 84/277 (30%), Positives = 136/277 (49%), Gaps = 43/277 (15%)
Query: 65 EIAENFSKLACNAHTICNSELRPLGTGLYPVISIINHSCLPNAVLVFEGRLAVVRAVQHV 124
+I E F+K+ CN+ TICN+E++ +G GLYP +S++NHSC PN +VF G ++RAV+ +
Sbjct: 170 DIFEAFAKVICNSFTICNAEMQEVGVGLYPSMSLLNHSCDPNCSIVFNGPHLLLRAVRDI 229
Query: 125 PKGAEVLISYIETAGSTMTRQKALKEQYLFTCTCPRCIKLGQFDDIQESAILEGYRCKDD 184
G E+ I Y++ ++ R+K L++QY F C C RC
Sbjct: 230 EAGEELTICYLDMLMTSEERRKQLRDQYCFECDCFRC----------------------- 266
Query: 185 GCSGFLLRDSDDKGFTCQQCGLVRSKEEIKKIASEVNILSKKTLALTSCGNHQEVVSTYK 244
+D D T + +E +KKI E+ K L C Q ++S+
Sbjct: 267 -----QTQDKDADMLTGDEQVWKEVQESLKKI-EELKAHWKWEQVLAMC---QTIISSN- 316
Query: 245 MIEKLQKKLYHPFSVNLMQTR--EKLIKILMELEDWKEALAYCQLTIPVYQRVYPQFHPL 302
E+L +N+ Q + + + + L +EAL Y T+ Y+ +P HP+
Sbjct: 317 -AERLP-------DINIYQLKVLDCAMDACINLGLLEEALFYGMRTMEPYRIFFPGSHPV 368
Query: 303 LGLQYYTCGKLEWFLGDTENAIKSMTEAVEILRITHG 339
G+Q GKL+ G A+K++ A +I+R+THG
Sbjct: 369 RGVQVMKVGKLQLHQGMFPQAMKNLRLAFDIMRVTHG 405
>gi|395859538|ref|XP_003802095.1| PREDICTED: N-lysine methyltransferase SMYD2 [Otolemur garnettii]
Length = 433
Score = 120 bits (302), Expect = 7e-25, Method: Compositional matrix adjust.
Identities = 88/349 (25%), Positives = 157/349 (44%), Gaps = 31/349 (8%)
Query: 8 RKLQNDNVIPSTTTDNYSL-VEALVAHMSDID-EKQLLLYAQIANLVNLILQWPEISINE 65
R L + P T L V+ +H+ +D EK+ L+ + IA L + + E N+
Sbjct: 111 RILAKQKIHPERTPSEKLLAVKEFESHLDKLDNEKKELIQSDIAALHHFYSKHLEFPDND 170
Query: 66 -IAENFSKLACNAHTICNSELRPLGTGLYPVISIINHSCLPNAVLVFEGRLAVVRAVQHV 124
+ F+++ CN TI + EL LG+ ++P ++++NHSC PN ++ ++G LA VRAVQ +
Sbjct: 171 SLVVLFAQVNCNGFTIEDEELSHLGSAIFPDVALMNHSCCPNVIVTYKGTLAEVRAVQEI 230
Query: 125 PKGAEVLISYIETAGSTMTRQKALKEQYLFTCTCPRCIKLGQFDDIQESAILEGYRCKDD 184
G EV SYI+ T R L++ Y FTC C C KD
Sbjct: 231 NPGEEVFTSYIDLLYPTEDRNDRLRDSYFFTCECQECTT------------------KDK 272
Query: 185 GCSGFLLRDSDDKGFTCQQCGLVRSKEEIKKIASEVNILSKKTLALTSCGNHQEVVSTYK 244
+ +R +D +VR + + + L C QE +S+
Sbjct: 273 DKAKVEIRKLNDPPKAETVRDMVRYARNVIEEFRRAKHYKSPSELLEICELSQEKMSS-- 330
Query: 245 MIEKLQKKLYHPFSVNLMQTREKLIKILMELEDWKEALAYCQLTIPVYQRVYPQFHPLLG 304
++ +V ++ + + + + ++DW+ AL Y Q I Y + YP + +
Sbjct: 331 --------VFEDSNVYMLHMMYQAMGVCLYMQDWEGALRYGQKIIKPYSKHYPLYSLNVA 382
Query: 305 LQYYTCGKLEWFLGDTENAIKSMTEAVEILRITHGTNSPFMKELILKLE 353
+ G+L L K++ +A+ I+ + HG + P++ E+ ++E
Sbjct: 383 SMWLKLGRLYMGLEHKAAGEKALKKAIAIMEVAHGKDHPYISEIKQEIE 431
>gi|426239577|ref|XP_004013696.1| PREDICTED: SET and MYND domain-containing protein 3 isoform 1 [Ovis
aries]
Length = 369
Score = 120 bits (302), Expect = 7e-25, Method: Compositional matrix adjust.
Identities = 85/277 (30%), Positives = 135/277 (48%), Gaps = 43/277 (15%)
Query: 65 EIAENFSKLACNAHTICNSELRPLGTGLYPVISIINHSCLPNAVLVFEGRLAVVRAVQHV 124
+I E F+K+ CN+ TICN+E++ +G GLYP +S++NHSC PN +VF G ++R V+ V
Sbjct: 111 DIFEAFAKVICNSFTICNAEMQEVGVGLYPSMSLLNHSCDPNCSIVFNGPHLLLRTVRDV 170
Query: 125 PKGAEVLISYIETAGSTMTRQKALKEQYLFTCTCPRCIKLGQFDDIQESAILEGYRCKDD 184
G E+ I Y++ ++ R+K L++QY F C C RC
Sbjct: 171 EAGEELTICYLDMLMTSEERRKQLRDQYCFDCDCFRC----------------------- 207
Query: 185 GCSGFLLRDSDDKGFTCQQCGLVRSKEEIKKIASEVNILSKKTLALTSCGNHQEVVSTYK 244
+D D T + +E +KKI E+ K L C Q ++S+
Sbjct: 208 -----QTQDKDADMLTGDEQVWKEVQESLKKI-EELKAHWKWEQVLAMC---QSIISSNA 258
Query: 245 MIEKLQKKLYHPFSVNLMQTR--EKLIKILMELEDWKEALAYCQLTIPVYQRVYPQFHPL 302
E+L +N+ Q + + + + L +EAL Y T+ Y+ YP HP+
Sbjct: 259 --ERLP-------DINIYQLKVLDCAMDACINLGLLEEALFYGIRTMEPYRIFYPGNHPV 309
Query: 303 LGLQYYTCGKLEWFLGDTENAIKSMTEAVEILRITHG 339
G+Q GKL+ G A+K++ A +I+R+THG
Sbjct: 310 RGVQIMKVGKLQLHQGMFPQAMKNLRLAFDIMRVTHG 346
>gi|12232401|ref|NP_073580.1| SET and MYND domain-containing protein 3 isoform 2 [Homo sapiens]
gi|10437096|dbj|BAB14981.1| unnamed protein product [Homo sapiens]
gi|119597549|gb|EAW77143.1| SET and MYND domain containing 3, isoform CRA_d [Homo sapiens]
Length = 369
Score = 120 bits (302), Expect = 7e-25, Method: Compositional matrix adjust.
Identities = 84/277 (30%), Positives = 136/277 (49%), Gaps = 43/277 (15%)
Query: 65 EIAENFSKLACNAHTICNSELRPLGTGLYPVISIINHSCLPNAVLVFEGRLAVVRAVQHV 124
++ E F+K+ CN+ TICN+E++ +G GLYP IS++NHSC PN +VF G ++RAV+ +
Sbjct: 111 DLFEAFAKVICNSFTICNAEMQEVGVGLYPSISLLNHSCDPNCSIVFNGPHLLLRAVRDI 170
Query: 125 PKGAEVLISYIETAGSTMTRQKALKEQYLFTCTCPRCIKLGQFDDIQESAILEGYRCKDD 184
G E+ I Y++ ++ R+K L++QY F C C RC
Sbjct: 171 EVGEELTICYLDMLMTSEERRKQLRDQYCFECDCFRC----------------------- 207
Query: 185 GCSGFLLRDSDDKGFTCQQCGLVRSKEEIKKIASEVNILSKKTLALTSCGNHQEVVSTYK 244
+D D T + +E +KKI E+ K L C Q ++S+
Sbjct: 208 -----QTQDKDADMLTGDEQVWKEVQESLKKI-EELKAHWKWEQVLAMC---QAIISSNS 258
Query: 245 MIEKLQKKLYHPFSVNLMQTR--EKLIKILMELEDWKEALAYCQLTIPVYQRVYPQFHPL 302
E+L +N+ Q + + + + L +EAL Y T+ Y+ +P HP+
Sbjct: 259 --ERLP-------DINIYQLKVLDCAMDACINLGLLEEALFYGTRTMEPYRIFFPGSHPV 309
Query: 303 LGLQYYTCGKLEWFLGDTENAIKSMTEAVEILRITHG 339
G+Q GKL+ G A+K++ A +I+R+THG
Sbjct: 310 RGVQVMKVGKLQLHQGMFPQAMKNLRLAFDIMRVTHG 346
>gi|410219524|gb|JAA06981.1| SET and MYND domain containing 2 [Pan troglodytes]
gi|410248566|gb|JAA12250.1| SET and MYND domain containing 2 [Pan troglodytes]
gi|410291460|gb|JAA24330.1| SET and MYND domain containing 2 [Pan troglodytes]
gi|410336589|gb|JAA37241.1| SET and MYND domain containing 2 [Pan troglodytes]
Length = 433
Score = 120 bits (302), Expect = 8e-25, Method: Compositional matrix adjust.
Identities = 91/349 (26%), Positives = 160/349 (45%), Gaps = 31/349 (8%)
Query: 8 RKLQNDNVIPSTTTDNYSL-VEALVAHMSDID-EKQLLLYAQIANLVNLILQWPEISINE 65
R L + P T L V+ +H+ +D EK+ L+ + IA L + + E N+
Sbjct: 111 RILAKQKIHPERTPSEKLLAVKEFESHLDKLDNEKKDLIQSDIAALHHFYSKHLEFPDND 170
Query: 66 -IAENFSKLACNAHTICNSELRPLGTGLYPVISIINHSCLPNAVLVFEGRLAVVRAVQHV 124
+ F+++ CN TI + EL LG+ ++P ++++NHSC PN ++ ++G LA VRAVQ +
Sbjct: 171 SLIVVFAQVNCNGFTIEDEELSHLGSAIFPDVALMNHSCCPNVIVTYKGTLAEVRAVQEI 230
Query: 125 PKGAEVLISYIETAGSTMTRQKALKEQYLFTCTCPRCIKLGQFDDIQESAILEGYRCKDD 184
G EV SYI+ T R L++ Y FTC C C ++ A +E R D
Sbjct: 231 KPGEEVFTSYIDLLYPTEDRNDRLRDSYFFTCECQECTTKD-----KDKAKVE-IRKLSD 284
Query: 185 GCSGFLLRDSDDKGFTCQQCGLVRSKEEIKKIASEVNILSKKTLALTSCGNHQEVVSTYK 244
+RD +VR + + + L C QE +S+
Sbjct: 285 PPKAEAIRD------------MVRYARNVIEEFRRAKHYKSPSELLEICELSQEKMSS-- 330
Query: 245 MIEKLQKKLYHPFSVNLMQTREKLIKILMELEDWKEALAYCQLTIPVYQRVYPQFHPLLG 304
++ +V ++ + + + + ++DW+ AL Y Q I Y + YP + +
Sbjct: 331 --------VFEDSNVYMLHMMYQAMGVCLYMQDWEGALQYGQKIIKPYSKHYPLYSLNVA 382
Query: 305 LQYYTCGKLEWFLGDTENAIKSMTEAVEILRITHGTNSPFMKELILKLE 353
+ G+L L K++ +A+ I+ I HG + P++ E+ ++E
Sbjct: 383 SMWLKLGRLYMGLEHKAAGEKALKKAIAIMEIAHGKDHPYISEIKQEIE 431
>gi|426333764|ref|XP_004028440.1| PREDICTED: N-lysine methyltransferase SMYD2 [Gorilla gorilla
gorilla]
Length = 384
Score = 120 bits (302), Expect = 8e-25, Method: Compositional matrix adjust.
Identities = 88/349 (25%), Positives = 156/349 (44%), Gaps = 31/349 (8%)
Query: 8 RKLQNDNVIPSTTTDNYSL-VEALVAHMSDID-EKQLLLYAQIANLVNLILQWPEISINE 65
R L + P T L V+ +H+ +D EK+ L+ + IA L + + E N+
Sbjct: 62 RILAKQKIHPERTPSEKLLAVKEFESHLDKLDNEKKDLIQSDIAALHHFYSKHLEFPDND 121
Query: 66 -IAENFSKLACNAHTICNSELRPLGTGLYPVISIINHSCLPNAVLVFEGRLAVVRAVQHV 124
+ F+++ CN TI + EL LG+ ++P ++++NHSC PN ++ ++G LA VRAVQ +
Sbjct: 122 SLVVLFAQVNCNGFTIEDEELSHLGSAIFPDVALMNHSCCPNVIVTYKGTLAEVRAVQEI 181
Query: 125 PKGAEVLISYIETAGSTMTRQKALKEQYLFTCTCPRCIKLGQFDDIQESAILEGYRCKDD 184
G EV SYI+ T R L++ Y FTC C C KD
Sbjct: 182 KPGEEVFTSYIDLLYPTEDRNDRLRDSYFFTCECQECT------------------TKDK 223
Query: 185 GCSGFLLRDSDDKGFTCQQCGLVRSKEEIKKIASEVNILSKKTLALTSCGNHQEVVSTYK 244
+ +R D +VR + + + L C QE +S+
Sbjct: 224 DKAKVEIRKLSDPPKAEAIRDMVRYARNVIEEFRRAKHYKSPSELLEICELSQEKMSS-- 281
Query: 245 MIEKLQKKLYHPFSVNLMQTREKLIKILMELEDWKEALAYCQLTIPVYQRVYPQFHPLLG 304
++ +V ++ + + + + ++DW+ AL Y Q I Y + YP + +
Sbjct: 282 --------VFEDSNVYMLHMMYQAMGVCLYMQDWEGALQYGQKIIKPYSKHYPLYSLNVA 333
Query: 305 LQYYTCGKLEWFLGDTENAIKSMTEAVEILRITHGTNSPFMKELILKLE 353
+ G+L L K++ +A+ I+ + HG + P++ E+ ++E
Sbjct: 334 SMWLKLGRLYMGLEHKAAGEKALKKAIAIMEVAHGKDHPYISEIKQEIE 382
>gi|355558772|gb|EHH15552.1| hypothetical protein EGK_01662, partial [Macaca mulatta]
Length = 380
Score = 120 bits (302), Expect = 8e-25, Method: Compositional matrix adjust.
Identities = 88/349 (25%), Positives = 156/349 (44%), Gaps = 31/349 (8%)
Query: 8 RKLQNDNVIPSTTTDNYSL-VEALVAHMSDID-EKQLLLYAQIANLVNLILQWPEISINE 65
R L + P T L V+ +H+ +D EK+ L+ + IA L + + E N+
Sbjct: 58 RILAKQKIHPERTPSEKLLAVKEFESHLDKLDNEKKDLIQSDIAALHHFYSKHLEFPDND 117
Query: 66 -IAENFSKLACNAHTICNSELRPLGTGLYPVISIINHSCLPNAVLVFEGRLAVVRAVQHV 124
+ F+++ CN TI + EL LG+ ++P ++++NHSC PN ++ ++G LA VRAVQ +
Sbjct: 118 SLVVLFAQVNCNGFTIEDEELSHLGSAIFPDVALMNHSCCPNVIVTYKGTLAEVRAVQEI 177
Query: 125 PKGAEVLISYIETAGSTMTRQKALKEQYLFTCTCPRCIKLGQFDDIQESAILEGYRCKDD 184
G EV SYI+ T R L++ Y FTC C C KD
Sbjct: 178 KPGEEVFTSYIDLLYPTEDRNDRLRDSYFFTCECQECT------------------TKDK 219
Query: 185 GCSGFLLRDSDDKGFTCQQCGLVRSKEEIKKIASEVNILSKKTLALTSCGNHQEVVSTYK 244
+ +R D +VR + + + L C QE +S+
Sbjct: 220 DKAKVEIRKLSDPPKAEAIRDMVRYARNVIEEFRRAKHYKSPSELLEICELSQEKMSS-- 277
Query: 245 MIEKLQKKLYHPFSVNLMQTREKLIKILMELEDWKEALAYCQLTIPVYQRVYPQFHPLLG 304
++ +V ++ + + + + ++DW+ AL Y Q I Y + YP + +
Sbjct: 278 --------VFEDSNVYMLHMMYQAMGVCLYMQDWEGALQYGQKIIKPYSKHYPLYSLNVA 329
Query: 305 LQYYTCGKLEWFLGDTENAIKSMTEAVEILRITHGTNSPFMKELILKLE 353
+ G+L L K++ +A+ I+ + HG + P++ E+ ++E
Sbjct: 330 SMWLKLGRLYMGLEHKAAGEKALKKAIAIMEVAHGKDHPYISEIKQEIE 378
>gi|426239579|ref|XP_004013697.1| PREDICTED: SET and MYND domain-containing protein 3 isoform 2 [Ovis
aries]
Length = 428
Score = 120 bits (302), Expect = 8e-25, Method: Compositional matrix adjust.
Identities = 85/277 (30%), Positives = 135/277 (48%), Gaps = 43/277 (15%)
Query: 65 EIAENFSKLACNAHTICNSELRPLGTGLYPVISIINHSCLPNAVLVFEGRLAVVRAVQHV 124
+I E F+K+ CN+ TICN+E++ +G GLYP +S++NHSC PN +VF G ++R V+ V
Sbjct: 170 DIFEAFAKVICNSFTICNAEMQEVGVGLYPSMSLLNHSCDPNCSIVFNGPHLLLRTVRDV 229
Query: 125 PKGAEVLISYIETAGSTMTRQKALKEQYLFTCTCPRCIKLGQFDDIQESAILEGYRCKDD 184
G E+ I Y++ ++ R+K L++QY F C C RC
Sbjct: 230 EAGEELTICYLDMLMTSEERRKQLRDQYCFDCDCFRC----------------------- 266
Query: 185 GCSGFLLRDSDDKGFTCQQCGLVRSKEEIKKIASEVNILSKKTLALTSCGNHQEVVSTYK 244
+D D T + +E +KKI E+ K L C Q ++S+
Sbjct: 267 -----QTQDKDADMLTGDEQVWKEVQESLKKI-EELKAHWKWEQVLAMC---QSIISSN- 316
Query: 245 MIEKLQKKLYHPFSVNLMQTR--EKLIKILMELEDWKEALAYCQLTIPVYQRVYPQFHPL 302
E+L +N+ Q + + + + L +EAL Y T+ Y+ YP HP+
Sbjct: 317 -AERLP-------DINIYQLKVLDCAMDACINLGLLEEALFYGIRTMEPYRIFYPGNHPV 368
Query: 303 LGLQYYTCGKLEWFLGDTENAIKSMTEAVEILRITHG 339
G+Q GKL+ G A+K++ A +I+R+THG
Sbjct: 369 RGVQIMKVGKLQLHQGMFPQAMKNLRLAFDIMRVTHG 405
>gi|297661992|ref|XP_002809506.1| PREDICTED: N-lysine methyltransferase SMYD2 [Pongo abelii]
Length = 433
Score = 120 bits (302), Expect = 9e-25, Method: Compositional matrix adjust.
Identities = 90/349 (25%), Positives = 161/349 (46%), Gaps = 31/349 (8%)
Query: 8 RKLQNDNVIPSTTTDNYSL-VEALVAHMSDID-EKQLLLYAQIANLVNLILQWPEISINE 65
R L + P T L V+ +H+ +D EK+ L+ + IA L + + E N+
Sbjct: 111 RILAKQKIHPERTPSEKLLAVKEFESHLDKLDNEKKDLIQSDIAALHHFYSKHLEFPDND 170
Query: 66 -IAENFSKLACNAHTICNSELRPLGTGLYPVISIINHSCLPNAVLVFEGRLAVVRAVQHV 124
+ F+++ CN TI + EL LG+ ++P ++++NHSC PN ++ ++G LA VRAVQ +
Sbjct: 171 SLIVLFAQVNCNGFTIEDEELSHLGSAIFPDVALMNHSCCPNVIVTYKGTLAEVRAVQEI 230
Query: 125 PKGAEVLISYIETAGSTMTRQKALKEQYLFTCTCPRCIKLGQFDDIQESAILEGYRCKDD 184
G EV SYI+ T R L++ Y FTC C C ++ A +E R D
Sbjct: 231 KPGEEVFTSYIDLLYPTEDRNDRLRDSYFFTCECQECTTKD-----KDKAKVE-IRKLSD 284
Query: 185 GCSGFLLRDSDDKGFTCQQCGLVRSKEEIKKIASEVNILSKKTLALTSCGNHQEVVSTYK 244
++RD +VR + + + L C QE +S+
Sbjct: 285 PPKAEVIRD------------MVRYARNVIEEFRRAKHYKSPSELLEICELSQEKMSS-- 330
Query: 245 MIEKLQKKLYHPFSVNLMQTREKLIKILMELEDWKEALAYCQLTIPVYQRVYPQFHPLLG 304
++ +V ++ + + + + ++DW+ AL Y Q I Y + YP + +
Sbjct: 331 --------VFEDSNVYMLHMMYQAMGVCLYMQDWEGALQYGQKIIKPYSKHYPLYSLNVA 382
Query: 305 LQYYTCGKLEWFLGDTENAIKSMTEAVEILRITHGTNSPFMKELILKLE 353
+ G+L L K++ +A+ I+ + HG + P++ E+ ++E
Sbjct: 383 SMWLKLGRLYMGLEHKAAGEKALKKAIAIMEVAHGKDHPYISEIKQEIE 431
>gi|267844824|ref|NP_001161212.1| SET and MYND domain-containing protein 3 isoform 1 [Homo sapiens]
gi|212276523|sp|Q9H7B4.4|SMYD3_HUMAN RecName: Full=SET and MYND domain-containing protein 3; AltName:
Full=Zinc finger MYND domain-containing protein 1
gi|119597548|gb|EAW77142.1| SET and MYND domain containing 3, isoform CRA_c [Homo sapiens]
Length = 428
Score = 120 bits (301), Expect = 1e-24, Method: Compositional matrix adjust.
Identities = 84/277 (30%), Positives = 136/277 (49%), Gaps = 43/277 (15%)
Query: 65 EIAENFSKLACNAHTICNSELRPLGTGLYPVISIINHSCLPNAVLVFEGRLAVVRAVQHV 124
++ E F+K+ CN+ TICN+E++ +G GLYP IS++NHSC PN +VF G ++RAV+ +
Sbjct: 170 DLFEAFAKVICNSFTICNAEMQEVGVGLYPSISLLNHSCDPNCSIVFNGPHLLLRAVRDI 229
Query: 125 PKGAEVLISYIETAGSTMTRQKALKEQYLFTCTCPRCIKLGQFDDIQESAILEGYRCKDD 184
G E+ I Y++ ++ R+K L++QY F C C RC
Sbjct: 230 EVGEELTICYLDMLMTSEERRKQLRDQYCFECDCFRC----------------------- 266
Query: 185 GCSGFLLRDSDDKGFTCQQCGLVRSKEEIKKIASEVNILSKKTLALTSCGNHQEVVSTYK 244
+D D T + +E +KKI E+ K L C Q ++S+
Sbjct: 267 -----QTQDKDADMLTGDEQVWKEVQESLKKI-EELKAHWKWEQVLAMC---QAIISSNS 317
Query: 245 MIEKLQKKLYHPFSVNLMQTR--EKLIKILMELEDWKEALAYCQLTIPVYQRVYPQFHPL 302
E+L +N+ Q + + + + L +EAL Y T+ Y+ +P HP+
Sbjct: 318 --ERLP-------DINIYQLKVLDCAMDACINLGLLEEALFYGTRTMEPYRIFFPGSHPV 368
Query: 303 LGLQYYTCGKLEWFLGDTENAIKSMTEAVEILRITHG 339
G+Q GKL+ G A+K++ A +I+R+THG
Sbjct: 369 RGVQVMKVGKLQLHQGMFPQAMKNLRLAFDIMRVTHG 405
>gi|333361525|pdb|3RU0|A Chain A, Cocrystal Structure Of Human Smyd3 With Inhibitor
Sinefungin Bound
gi|333361526|pdb|3RU0|B Chain B, Cocrystal Structure Of Human Smyd3 With Inhibitor
Sinefungin Bound
Length = 438
Score = 120 bits (301), Expect = 1e-24, Method: Compositional matrix adjust.
Identities = 84/277 (30%), Positives = 136/277 (49%), Gaps = 43/277 (15%)
Query: 65 EIAENFSKLACNAHTICNSELRPLGTGLYPVISIINHSCLPNAVLVFEGRLAVVRAVQHV 124
++ E F+K+ CN+ TICN+E++ +G GLYP IS++NHSC PN +VF G ++RAV+ +
Sbjct: 172 DLFEAFAKVICNSFTICNAEMQEVGVGLYPSISLLNHSCDPNCSIVFNGPHLLLRAVRDI 231
Query: 125 PKGAEVLISYIETAGSTMTRQKALKEQYLFTCTCPRCIKLGQFDDIQESAILEGYRCKDD 184
G E+ I Y++ ++ R+K L++QY F C C RC
Sbjct: 232 EVGEELTICYLDMLMTSEERRKQLRDQYCFECDCFRC----------------------- 268
Query: 185 GCSGFLLRDSDDKGFTCQQCGLVRSKEEIKKIASEVNILSKKTLALTSCGNHQEVVSTYK 244
+D D T + +E +KKI E+ K L C Q ++S+
Sbjct: 269 -----QTQDKDADMLTGDEQVWKEVQESLKKI-EELKAHWKWEQVLAMC---QAIISSNS 319
Query: 245 MIEKLQKKLYHPFSVNLMQTR--EKLIKILMELEDWKEALAYCQLTIPVYQRVYPQFHPL 302
E+L +N+ Q + + + + L +EAL Y T+ Y+ +P HP+
Sbjct: 320 --ERLP-------DINIYQLKVLDCAMDACINLGLLEEALFYGTRTMEPYRIFFPGSHPV 370
Query: 303 LGLQYYTCGKLEWFLGDTENAIKSMTEAVEILRITHG 339
G+Q GKL+ G A+K++ A +I+R+THG
Sbjct: 371 RGVQVMKVGKLQLHQGMFPQAMKNLRLAFDIMRVTHG 407
>gi|312208186|pdb|3PDN|A Chain A, Crystal Structure Of Smyd3 In Complex With
Methyltransferase Inhibitor Sinefungin
gi|323714509|pdb|3OXL|A Chain A, Human Lysine Methyltransferase Smyd3 In Complex With
Adohcy (Form Ii)
gi|21410974|gb|AAH31010.1| SMYD3 protein [Homo sapiens]
gi|123984179|gb|ABM83506.1| SET and MYND domain containing 3 [synthetic construct]
gi|123998243|gb|ABM86723.1| SET and MYND domain containing 3 [synthetic construct]
Length = 428
Score = 120 bits (301), Expect = 1e-24, Method: Compositional matrix adjust.
Identities = 84/277 (30%), Positives = 136/277 (49%), Gaps = 43/277 (15%)
Query: 65 EIAENFSKLACNAHTICNSELRPLGTGLYPVISIINHSCLPNAVLVFEGRLAVVRAVQHV 124
++ E F+K+ CN+ TICN+E++ +G GLYP IS++NHSC PN +VF G ++RAV+ +
Sbjct: 170 DLFEAFAKVICNSFTICNAEMQEVGVGLYPSISLLNHSCDPNCSIVFNGPHLLLRAVRDI 229
Query: 125 PKGAEVLISYIETAGSTMTRQKALKEQYLFTCTCPRCIKLGQFDDIQESAILEGYRCKDD 184
G E+ I Y++ ++ R+K L++QY F C C RC
Sbjct: 230 EVGEELTICYLDMLMTSEERRKQLRDQYCFECDCFRC----------------------- 266
Query: 185 GCSGFLLRDSDDKGFTCQQCGLVRSKEEIKKIASEVNILSKKTLALTSCGNHQEVVSTYK 244
+D D T + +E +KKI E+ K L C Q ++S+
Sbjct: 267 -----QTQDKDADMLTGDEQVWKEVQESLKKI-EELKAHWKWEQVLAMC---QAIISSNS 317
Query: 245 MIEKLQKKLYHPFSVNLMQTR--EKLIKILMELEDWKEALAYCQLTIPVYQRVYPQFHPL 302
E+L +N+ Q + + + + L +EAL Y T+ Y+ +P HP+
Sbjct: 318 --ERLP-------DINIYQLKVLDCAMDACINLGLLEEALFYGTRTMEPYRIFFPGSHPV 368
Query: 303 LGLQYYTCGKLEWFLGDTENAIKSMTEAVEILRITHG 339
G+Q GKL+ G A+K++ A +I+R+THG
Sbjct: 369 RGVQVMKVGKLQLHQGMFPQAMKNLRLAFDIMRVTHG 405
>gi|19570495|dbj|BAB86333.1| histone methyltransferase [Homo sapiens]
gi|158260233|dbj|BAF82294.1| unnamed protein product [Homo sapiens]
gi|261858444|dbj|BAI45744.1| SET and MYND domain containing 3 [synthetic construct]
Length = 428
Score = 120 bits (301), Expect = 1e-24, Method: Compositional matrix adjust.
Identities = 84/277 (30%), Positives = 136/277 (49%), Gaps = 43/277 (15%)
Query: 65 EIAENFSKLACNAHTICNSELRPLGTGLYPVISIINHSCLPNAVLVFEGRLAVVRAVQHV 124
++ E F+K+ CN+ TICN+E++ +G GLYP IS++NHSC PN +VF G ++RAV+ +
Sbjct: 170 DLFEAFAKVICNSFTICNAEMQEVGVGLYPSISLLNHSCDPNCSIVFNGPHLLLRAVRDI 229
Query: 125 PKGAEVLISYIETAGSTMTRQKALKEQYLFTCTCPRCIKLGQFDDIQESAILEGYRCKDD 184
G E+ I Y++ ++ R+K L++QY F C C RC
Sbjct: 230 EVGEELTICYLDMLMTSEERRKQLRDQYCFECDCFRC----------------------- 266
Query: 185 GCSGFLLRDSDDKGFTCQQCGLVRSKEEIKKIASEVNILSKKTLALTSCGNHQEVVSTYK 244
+D D T + +E +KKI E+ K L C Q ++S+
Sbjct: 267 -----QTQDKDADMLTGDEQVWKEVQESLKKI-EELKAHWKWEQVLAMC---QAIISSNS 317
Query: 245 MIEKLQKKLYHPFSVNLMQTR--EKLIKILMELEDWKEALAYCQLTIPVYQRVYPQFHPL 302
E+L +N+ Q + + + + L +EAL Y T+ Y+ +P HP+
Sbjct: 318 --ERLP-------DINIYQLKVLDCAMDACINLGLLEEALFYGTRTMEPYRIFFPGSHPV 368
Query: 303 LGLQYYTCGKLEWFLGDTENAIKSMTEAVEILRITHG 339
G+Q GKL+ G A+K++ A +I+R+THG
Sbjct: 369 RGVQVMKVGKLQLHQGMFPQAMKNLRLAFDIMRVTHG 405
>gi|327533748|pdb|3QWP|A Chain A, Crystal Structure Of Set And Mynd Domain Containing 3;
Zinc Finger Mynd Domain-Containing Protein 1
Length = 429
Score = 120 bits (301), Expect = 1e-24, Method: Compositional matrix adjust.
Identities = 84/277 (30%), Positives = 136/277 (49%), Gaps = 43/277 (15%)
Query: 65 EIAENFSKLACNAHTICNSELRPLGTGLYPVISIINHSCLPNAVLVFEGRLAVVRAVQHV 124
++ E F+K+ CN+ TICN+E++ +G GLYP IS++NHSC PN +VF G ++RAV+ +
Sbjct: 171 DLFEAFAKVICNSFTICNAEMQEVGVGLYPSISLLNHSCDPNCSIVFNGPHLLLRAVRDI 230
Query: 125 PKGAEVLISYIETAGSTMTRQKALKEQYLFTCTCPRCIKLGQFDDIQESAILEGYRCKDD 184
G E+ I Y++ ++ R+K L++QY F C C RC
Sbjct: 231 EVGEELTICYLDMLMTSEERRKQLRDQYCFECDCFRC----------------------- 267
Query: 185 GCSGFLLRDSDDKGFTCQQCGLVRSKEEIKKIASEVNILSKKTLALTSCGNHQEVVSTYK 244
+D D T + +E +KKI E+ K L C Q ++S+
Sbjct: 268 -----QTQDKDADMLTGDEQVWKEVQESLKKI-EELKAHWKWEQVLAMC---QAIISSNS 318
Query: 245 MIEKLQKKLYHPFSVNLMQTR--EKLIKILMELEDWKEALAYCQLTIPVYQRVYPQFHPL 302
E+L +N+ Q + + + + L +EAL Y T+ Y+ +P HP+
Sbjct: 319 --ERLP-------DINIYQLKVLDCAMDACINLGLLEEALFYGTRTMEPYRIFFPGSHPV 369
Query: 303 LGLQYYTCGKLEWFLGDTENAIKSMTEAVEILRITHG 339
G+Q GKL+ G A+K++ A +I+R+THG
Sbjct: 370 RGVQVMKVGKLQLHQGMFPQAMKNLRLAFDIMRVTHG 406
>gi|323714508|pdb|3OXG|A Chain A, Human Lysine Methyltransferase Smyd3 In Complex With
Adohcy (Form Iii)
Length = 464
Score = 120 bits (301), Expect = 1e-24, Method: Compositional matrix adjust.
Identities = 84/277 (30%), Positives = 136/277 (49%), Gaps = 43/277 (15%)
Query: 65 EIAENFSKLACNAHTICNSELRPLGTGLYPVISIINHSCLPNAVLVFEGRLAVVRAVQHV 124
++ E F+K+ CN+ TICN+E++ +G GLYP IS++NHSC PN +VF G ++RAV+ +
Sbjct: 206 DLFEAFAKVICNSFTICNAEMQEVGVGLYPSISLLNHSCDPNCSIVFNGPHLLLRAVRDI 265
Query: 125 PKGAEVLISYIETAGSTMTRQKALKEQYLFTCTCPRCIKLGQFDDIQESAILEGYRCKDD 184
G E+ I Y++ ++ R+K L++QY F C C RC
Sbjct: 266 EVGEELTICYLDMLMTSEERRKQLRDQYCFECDCFRC----------------------- 302
Query: 185 GCSGFLLRDSDDKGFTCQQCGLVRSKEEIKKIASEVNILSKKTLALTSCGNHQEVVSTYK 244
+D D T + +E +KKI E+ K L C Q ++S+
Sbjct: 303 -----QTQDKDADMLTGDEQVWKEVQESLKKI-EELKAHWKWEQVLAMC---QAIISSNS 353
Query: 245 MIEKLQKKLYHPFSVNLMQTR--EKLIKILMELEDWKEALAYCQLTIPVYQRVYPQFHPL 302
E+L +N+ Q + + + + L +EAL Y T+ Y+ +P HP+
Sbjct: 354 --ERLP-------DINIYQLKVLDCAMDACINLGLLEEALFYGTRTMEPYRIFFPGSHPV 404
Query: 303 LGLQYYTCGKLEWFLGDTENAIKSMTEAVEILRITHG 339
G+Q GKL+ G A+K++ A +I+R+THG
Sbjct: 405 RGVQVMKVGKLQLHQGMFPQAMKNLRLAFDIMRVTHG 441
>gi|388454549|ref|NP_001252862.1| SET and MYND domain-containing protein 3 [Macaca mulatta]
gi|380787707|gb|AFE65729.1| SET and MYND domain-containing protein 3 isoform 1 [Macaca mulatta]
gi|383412903|gb|AFH29665.1| SET and MYND domain-containing protein 3 isoform 1 [Macaca mulatta]
Length = 428
Score = 120 bits (301), Expect = 1e-24, Method: Compositional matrix adjust.
Identities = 84/277 (30%), Positives = 136/277 (49%), Gaps = 43/277 (15%)
Query: 65 EIAENFSKLACNAHTICNSELRPLGTGLYPVISIINHSCLPNAVLVFEGRLAVVRAVQHV 124
++ E F+K+ CN+ TICN+E++ +G GLYP IS++NHSC PN +VF G ++RAV+ +
Sbjct: 170 DLFEAFAKVICNSFTICNAEMQEVGVGLYPSISLLNHSCDPNCSIVFNGPHLLLRAVRDI 229
Query: 125 PKGAEVLISYIETAGSTMTRQKALKEQYLFTCTCPRCIKLGQFDDIQESAILEGYRCKDD 184
G E+ I Y++ ++ R+K L++QY F C C RC
Sbjct: 230 EVGEELTICYLDMLMTSEERRKQLRDQYCFECDCFRC----------------------- 266
Query: 185 GCSGFLLRDSDDKGFTCQQCGLVRSKEEIKKIASEVNILSKKTLALTSCGNHQEVVSTYK 244
+D D T + +E +KKI E+ K L C Q ++S+
Sbjct: 267 -----QTQDKDADMLTGDEQVWKEVQESLKKI-EELKAHWKWEQVLAMC---QAIISSNS 317
Query: 245 MIEKLQKKLYHPFSVNLMQTR--EKLIKILMELEDWKEALAYCQLTIPVYQRVYPQFHPL 302
E+L +N+ Q + + + + L +EAL Y T+ Y+ +P HP+
Sbjct: 318 --ERLP-------DINIYQLKVLDCAMDACINLGLLEEALFYGTRTMEPYRIFFPGSHPV 368
Query: 303 LGLQYYTCGKLEWFLGDTENAIKSMTEAVEILRITHG 339
G+Q GKL+ G A+K++ A +I+R+THG
Sbjct: 369 RGVQVMKVGKLQLHQGMFPQAMKNLRLAFDIMRVTHG 405
>gi|323714506|pdb|3OXF|A Chain A, Human Lysine Methyltransferase Smyd3 In Complex With
Adohcy (Form I)
gi|323714507|pdb|3OXF|B Chain B, Human Lysine Methyltransferase Smyd3 In Complex With
Adohcy (Form I)
Length = 436
Score = 120 bits (301), Expect = 1e-24, Method: Compositional matrix adjust.
Identities = 84/277 (30%), Positives = 136/277 (49%), Gaps = 43/277 (15%)
Query: 65 EIAENFSKLACNAHTICNSELRPLGTGLYPVISIINHSCLPNAVLVFEGRLAVVRAVQHV 124
++ E F+K+ CN+ TICN+E++ +G GLYP IS++NHSC PN +VF G ++RAV+ +
Sbjct: 170 DLFEAFAKVICNSFTICNAEMQEVGVGLYPSISLLNHSCDPNCSIVFNGPHLLLRAVRDI 229
Query: 125 PKGAEVLISYIETAGSTMTRQKALKEQYLFTCTCPRCIKLGQFDDIQESAILEGYRCKDD 184
G E+ I Y++ ++ R+K L++QY F C C RC
Sbjct: 230 EVGEELTICYLDMLMTSEERRKQLRDQYCFECDCFRC----------------------- 266
Query: 185 GCSGFLLRDSDDKGFTCQQCGLVRSKEEIKKIASEVNILSKKTLALTSCGNHQEVVSTYK 244
+D D T + +E +KKI E+ K L C Q ++S+
Sbjct: 267 -----QTQDKDADMLTGDEQVWKEVQESLKKI-EELKAHWKWEQVLAMC---QAIISSNS 317
Query: 245 MIEKLQKKLYHPFSVNLMQTR--EKLIKILMELEDWKEALAYCQLTIPVYQRVYPQFHPL 302
E+L +N+ Q + + + + L +EAL Y T+ Y+ +P HP+
Sbjct: 318 --ERLP-------DINIYQLKVLDCAMDACINLGLLEEALFYGTRTMEPYRIFFPGSHPV 368
Query: 303 LGLQYYTCGKLEWFLGDTENAIKSMTEAVEILRITHG 339
G+Q GKL+ G A+K++ A +I+R+THG
Sbjct: 369 RGVQVMKVGKLQLHQGMFPQAMKNLRLAFDIMRVTHG 405
>gi|149708613|ref|XP_001488507.1| PREDICTED: n-lysine methyltransferase SMYD2 [Equus caballus]
Length = 384
Score = 120 bits (301), Expect = 1e-24, Method: Compositional matrix adjust.
Identities = 86/337 (25%), Positives = 158/337 (46%), Gaps = 30/337 (8%)
Query: 19 TTTDNYSLVEALVAHMSDID-EKQLLLYAQIANLVNLILQWPEISINE-IAENFSKLACN 76
T ++ V+ +H+ +D EK+ L+ + IA L + + E N+ + F+++ CN
Sbjct: 74 TPSEKLLAVKEFESHLEKLDNEKKDLIQSDIAALHHFYSKHLEFPDNDSLVVLFAQVNCN 133
Query: 77 AHTICNSELRPLGTGLYPVISIINHSCLPNAVLVFEGRLAVVRAVQHVPKGAEVLISYIE 136
TI + EL LG+ ++P ++++NHSC PN ++ ++G LA VRAVQ + G EV SYI+
Sbjct: 134 GFTIEDEELSHLGSAIFPDVALMNHSCCPNVIVTYKGTLAEVRAVQEINPGEEVFTSYID 193
Query: 137 TAGSTMTRQKALKEQYLFTCTCPRCIKLGQFDDIQESAILEGYRCKDDGCSGFLLRDSDD 196
T R L++ Y FTC C C ++ A +E R D +RD
Sbjct: 194 LLYPTEDRNDRLRDSYFFTCECQECTTRD-----KDKAKVE-IRKLTDPPKAEAIRD--- 244
Query: 197 KGFTCQQCGLVRSKEEIKKIASEVNILSKKTLALTSCGNHQEVVSTYKMIEKLQKKLYHP 256
+VR + + + L C QE +S+ ++
Sbjct: 245 ---------MVRYARNVIEEFRRAKHYKSPSELLEICELSQEKMSS----------VFED 285
Query: 257 FSVNLMQTREKLIKILMELEDWKEALAYCQLTIPVYQRVYPQFHPLLGLQYYTCGKLEWF 316
+V ++ + + + + ++DW+ AL Y Q I Y + YP + + + G+L
Sbjct: 286 SNVYMLHMMYQAMGVCLYMQDWEGALRYGQKIIKPYSKHYPLYSLNVASMWLKLGRLYMG 345
Query: 317 LGDTENAIKSMTEAVEILRITHGTNSPFMKELILKLE 353
L + K++ +A+ I+ + HG + P++ E+ ++E
Sbjct: 346 LENKAAGEKALKKAIAIMEVAHGKDHPYISEIKQEIE 382
>gi|403277520|ref|XP_003930406.1| PREDICTED: N-lysine methyltransferase SMYD2 [Saimiri boliviensis
boliviensis]
Length = 433
Score = 120 bits (301), Expect = 1e-24, Method: Compositional matrix adjust.
Identities = 90/349 (25%), Positives = 160/349 (45%), Gaps = 31/349 (8%)
Query: 8 RKLQNDNVIPSTTTDNYSL-VEALVAHMSDID-EKQLLLYAQIANLVNLILQWPEISINE 65
R L + P T L V+ +H+ +D EK+ L+ + IA L + + E N+
Sbjct: 111 RILAKQKIHPERTPSERLLAVKEFESHLDKLDNEKKDLIQSDIAALHHFYSKHLEFPDND 170
Query: 66 -IAENFSKLACNAHTICNSELRPLGTGLYPVISIINHSCLPNAVLVFEGRLAVVRAVQHV 124
+ F+++ CN TI + EL LG+ ++P ++++NHSC PN ++ ++G LA VRAVQ +
Sbjct: 171 SLVVLFAQVNCNGFTIEDEELSHLGSAIFPDVALMNHSCCPNVIVTYKGTLAEVRAVQEI 230
Query: 125 PKGAEVLISYIETAGSTMTRQKALKEQYLFTCTCPRCIKLGQFDDIQESAILEGYRCKDD 184
G EV SYI+ T R L++ Y FTC C C ++ A +E R D
Sbjct: 231 SPGEEVFTSYIDLLYPTEDRNDRLRDSYFFTCECQECTTKD-----KDKAKVE-IRKLSD 284
Query: 185 GCSGFLLRDSDDKGFTCQQCGLVRSKEEIKKIASEVNILSKKTLALTSCGNHQEVVSTYK 244
+RD +VR + + + L C QE +S+
Sbjct: 285 PPKAEAIRD------------MVRYARNVIEEFRRAKHYKSPSELLEICELSQEKMSS-- 330
Query: 245 MIEKLQKKLYHPFSVNLMQTREKLIKILMELEDWKEALAYCQLTIPVYQRVYPQFHPLLG 304
++ +V ++ + + + + ++DW+ AL Y Q I Y + YP + +
Sbjct: 331 --------VFEDSNVYMLHMMYQAMGVCLYMQDWEGALQYGQKIIKPYSKHYPLYSLNVA 382
Query: 305 LQYYTCGKLEWFLGDTENAIKSMTEAVEILRITHGTNSPFMKELILKLE 353
+ G+L L K++ +A+ I+ + HG + P++ E+ ++E
Sbjct: 383 SMWLKLGRLYMGLEHKAAGEKALKKAIAIMEVAHGKDHPYISEIKQEIE 431
>gi|343197592|pdb|3S7B|A Chain A, Structural Basis Of Substrate Methylation And Inhibition
Of Smyd2
gi|343197593|pdb|3S7D|A Chain A, Structural Basis Of Substrate Methylation And Inhibition
Of Smyd2
gi|343197595|pdb|3S7F|A Chain A, Structural Basis Of Substrate Methylation And Inhibition
Of Smyd2
gi|343197597|pdb|3S7J|A Chain A, Structural Basis Of Substrate Methylation And Inhibition
Of Smyd2
gi|9295345|gb|AAF86953.1|AF226053_1 HSKM-B [Homo sapiens]
gi|66990077|gb|AAH98133.1| SET and MYND domain containing 2 [Homo sapiens]
gi|68226703|gb|AAH98335.1| SET and MYND domain containing 2 [Homo sapiens]
gi|189054089|dbj|BAG36596.1| unnamed protein product [Homo sapiens]
gi|307686281|dbj|BAJ21071.1| SET and MYND domain containing 2 [synthetic construct]
Length = 433
Score = 120 bits (301), Expect = 1e-24, Method: Compositional matrix adjust.
Identities = 90/349 (25%), Positives = 160/349 (45%), Gaps = 31/349 (8%)
Query: 8 RKLQNDNVIPSTTTDNYSL-VEALVAHMSDID-EKQLLLYAQIANLVNLILQWPEISINE 65
R L + P T L V+ +H+ +D EK+ L+ + IA L + + E N+
Sbjct: 111 RILAKQKIHPERTPSEKLLAVKEFESHLDKLDNEKKDLIQSDIAALHHFYSKHLEFPDND 170
Query: 66 -IAENFSKLACNAHTICNSELRPLGTGLYPVISIINHSCLPNAVLVFEGRLAVVRAVQHV 124
+ F+++ CN TI + EL LG+ ++P ++++NHSC PN ++ ++G LA VRAVQ +
Sbjct: 171 SLVVLFAQVNCNGFTIEDEELSHLGSAIFPDVALMNHSCCPNVIVTYKGTLAEVRAVQEI 230
Query: 125 PKGAEVLISYIETAGSTMTRQKALKEQYLFTCTCPRCIKLGQFDDIQESAILEGYRCKDD 184
G EV SYI+ T R L++ Y FTC C C ++ A +E R D
Sbjct: 231 KPGEEVFTSYIDLLYPTEDRNDRLRDSYFFTCECQECTTKD-----KDKAKVE-IRKLSD 284
Query: 185 GCSGFLLRDSDDKGFTCQQCGLVRSKEEIKKIASEVNILSKKTLALTSCGNHQEVVSTYK 244
+RD +VR + + + L C QE +S+
Sbjct: 285 PPKAEAIRD------------MVRYARNVIEEFRRAKHYKSPSELLEICELSQEKMSS-- 330
Query: 245 MIEKLQKKLYHPFSVNLMQTREKLIKILMELEDWKEALAYCQLTIPVYQRVYPQFHPLLG 304
++ +V ++ + + + + ++DW+ AL Y Q I Y + YP + +
Sbjct: 331 --------VFEDSNVYMLHMMYQAMGVCLYMQDWEGALQYGQKIIKPYSKHYPLYSLNVA 382
Query: 305 LQYYTCGKLEWFLGDTENAIKSMTEAVEILRITHGTNSPFMKELILKLE 353
+ G+L L K++ +A+ I+ + HG + P++ E+ ++E
Sbjct: 383 SMWLKLGRLYMGLEHKAAGEKALKKAIAIMEVAHGKDHPYISEIKQEIE 431
>gi|380814842|gb|AFE79295.1| N-lysine methyltransferase SMYD2 [Macaca mulatta]
Length = 433
Score = 120 bits (301), Expect = 1e-24, Method: Compositional matrix adjust.
Identities = 90/349 (25%), Positives = 160/349 (45%), Gaps = 31/349 (8%)
Query: 8 RKLQNDNVIPSTTTDNYSL-VEALVAHMSDID-EKQLLLYAQIANLVNLILQWPEISINE 65
R L + P T L V+ +H+ +D EK+ L+ + IA L + + E N+
Sbjct: 111 RILAKQKIHPERTPSEKLLAVKEFESHLDKLDNEKKDLIQSDIAALHHFYSKHLEFPDND 170
Query: 66 -IAENFSKLACNAHTICNSELRPLGTGLYPVISIINHSCLPNAVLVFEGRLAVVRAVQHV 124
+ F+++ CN TI + EL LG+ ++P ++++NHSC PN ++ ++G LA VRAVQ +
Sbjct: 171 SLVVLFAQVNCNGFTIEDEELSHLGSAIFPDVALMNHSCCPNVIVTYKGTLAEVRAVQEI 230
Query: 125 PKGAEVLISYIETAGSTMTRQKALKEQYLFTCTCPRCIKLGQFDDIQESAILEGYRCKDD 184
G EV SYI+ T R L++ Y FTC C C ++ A +E R D
Sbjct: 231 KPGEEVFTSYIDLLYPTEDRNDRLRDSYFFTCECQECTTKD-----KDKAKVE-IRKLSD 284
Query: 185 GCSGFLLRDSDDKGFTCQQCGLVRSKEEIKKIASEVNILSKKTLALTSCGNHQEVVSTYK 244
+RD +VR + + + L C QE +S+
Sbjct: 285 PPKAEAIRD------------MVRYARNVIEEFRRAKHYKSPSELLEICELSQEKMSS-- 330
Query: 245 MIEKLQKKLYHPFSVNLMQTREKLIKILMELEDWKEALAYCQLTIPVYQRVYPQFHPLLG 304
++ +V ++ + + + + ++DW+ AL Y Q I Y + YP + +
Sbjct: 331 --------VFEDSNVYMLHMMYQAMGVCLYMQDWEGALQYGQKIIKPYSKHYPLYSLNVA 382
Query: 305 LQYYTCGKLEWFLGDTENAIKSMTEAVEILRITHGTNSPFMKELILKLE 353
+ G+L L K++ +A+ I+ + HG + P++ E+ ++E
Sbjct: 383 SMWLKLGRLYMGLEHKAAGEKALKKAIAIMEVAHGKDHPYISEIKQEIE 431
>gi|355745920|gb|EHH50545.1| hypothetical protein EGM_01398, partial [Macaca fascicularis]
Length = 380
Score = 120 bits (301), Expect = 1e-24, Method: Compositional matrix adjust.
Identities = 88/349 (25%), Positives = 156/349 (44%), Gaps = 31/349 (8%)
Query: 8 RKLQNDNVIPSTTTDNYSL-VEALVAHMSDID-EKQLLLYAQIANLVNLILQWPEISINE 65
R L + P T L V+ +H+ +D EK+ L+ + IA L + + E N+
Sbjct: 58 RILAKQKIHPERTPSEKLLAVKEFESHLDKLDNEKKDLIQSDIAALHHFYSKHLEFPDND 117
Query: 66 -IAENFSKLACNAHTICNSELRPLGTGLYPVISIINHSCLPNAVLVFEGRLAVVRAVQHV 124
+ F+++ CN TI + EL LG+ ++P ++++NHSC PN ++ ++G LA VRAVQ +
Sbjct: 118 SLIVLFAQVNCNGFTIEDEELSHLGSAIFPDVALMNHSCCPNVIVTYKGTLAEVRAVQEI 177
Query: 125 PKGAEVLISYIETAGSTMTRQKALKEQYLFTCTCPRCIKLGQFDDIQESAILEGYRCKDD 184
G EV SYI+ T R L++ Y FTC C C KD
Sbjct: 178 KPGEEVFTSYIDLLYPTEDRNDRLRDSYFFTCECQECT------------------TKDK 219
Query: 185 GCSGFLLRDSDDKGFTCQQCGLVRSKEEIKKIASEVNILSKKTLALTSCGNHQEVVSTYK 244
+ +R D +VR + + + L C QE +S+
Sbjct: 220 DKAKVEIRKLSDPPKAEAIRDMVRYARNVIEEFRRAKHYKSPSELLEICELSQEKMSS-- 277
Query: 245 MIEKLQKKLYHPFSVNLMQTREKLIKILMELEDWKEALAYCQLTIPVYQRVYPQFHPLLG 304
++ +V ++ + + + + ++DW+ AL Y Q I Y + YP + +
Sbjct: 278 --------VFEDSNVYMLHMMYQAMGVCLYMQDWEGALQYGQKIIKPYSKHYPLYSLNVA 329
Query: 305 LQYYTCGKLEWFLGDTENAIKSMTEAVEILRITHGTNSPFMKELILKLE 353
+ G+L L K++ +A+ I+ + HG + P++ E+ ++E
Sbjct: 330 SMWLKLGRLYMGLEHKAAGEKALKKAIAIMEVAHGKDHPYISEIKQEIE 378
>gi|402857226|ref|XP_003893168.1| PREDICTED: N-lysine methyltransferase SMYD2 [Papio anubis]
Length = 433
Score = 120 bits (301), Expect = 1e-24, Method: Compositional matrix adjust.
Identities = 90/349 (25%), Positives = 160/349 (45%), Gaps = 31/349 (8%)
Query: 8 RKLQNDNVIPSTTTDNYSL-VEALVAHMSDID-EKQLLLYAQIANLVNLILQWPEISINE 65
R L + P T L V+ +H+ +D EK+ L+ + IA L + + E N+
Sbjct: 111 RILAKQKIHPERTPSEKLLAVKEFESHLDKLDNEKKDLIQSDIAALHHFYSRHLEFPDND 170
Query: 66 -IAENFSKLACNAHTICNSELRPLGTGLYPVISIINHSCLPNAVLVFEGRLAVVRAVQHV 124
+ F+++ CN TI + EL LG+ ++P ++++NHSC PN ++ ++G LA VRAVQ +
Sbjct: 171 SLVVLFAQVNCNGFTIEDEELSHLGSAIFPDVALMNHSCCPNVIVTYKGTLAEVRAVQEI 230
Query: 125 PKGAEVLISYIETAGSTMTRQKALKEQYLFTCTCPRCIKLGQFDDIQESAILEGYRCKDD 184
G EV SYI+ T R L++ Y FTC C C ++ A +E R D
Sbjct: 231 KPGEEVFTSYIDLLYPTEDRNDRLRDSYFFTCECQECTTKD-----KDKAKVE-IRKLSD 284
Query: 185 GCSGFLLRDSDDKGFTCQQCGLVRSKEEIKKIASEVNILSKKTLALTSCGNHQEVVSTYK 244
+RD +VR + + + L C QE +S+
Sbjct: 285 PPKAEAIRD------------MVRYARNVIEEFRRAKHYKSPSELLEICELSQEKMSS-- 330
Query: 245 MIEKLQKKLYHPFSVNLMQTREKLIKILMELEDWKEALAYCQLTIPVYQRVYPQFHPLLG 304
++ +V ++ + + + + ++DW+ AL Y Q I Y + YP + +
Sbjct: 331 --------VFEDSNVYMLHMMYQAMGVCLYMQDWEGALQYGQKIIKPYSKHYPLYSLNVA 382
Query: 305 LQYYTCGKLEWFLGDTENAIKSMTEAVEILRITHGTNSPFMKELILKLE 353
+ G+L L K++ +A+ I+ + HG + P++ E+ ++E
Sbjct: 383 SMWLKLGRLYMGLEHKAAGEKALKKAIAIMEVAHGKDHPYISEIKQEIE 431
>gi|74188167|dbj|BAE37175.1| unnamed protein product [Mus musculus]
Length = 428
Score = 120 bits (301), Expect = 1e-24, Method: Compositional matrix adjust.
Identities = 79/275 (28%), Positives = 136/275 (49%), Gaps = 39/275 (14%)
Query: 65 EIAENFSKLACNAHTICNSELRPLGTGLYPVISIINHSCLPNAVLVFEGRLAVVRAVQHV 124
++ E F+K+ CN+ TICN+E++ +G GLYP +S++NHSC PN +VF G ++RAV+ +
Sbjct: 170 DLFEAFAKVICNSFTICNAEMQEVGVGLYPSMSLLNHSCDPNCSIVFNGPHLLLRAVREI 229
Query: 125 PKGAEVLISYIETAGSTMTRQKALKEQYLFTCTCPRCIKLGQFDDIQESAILEGYRCKDD 184
G E+ I Y++ ++ R+K L++QY F C C RC Q D
Sbjct: 230 EAGEELTICYLDMLMTSEERRKQLRDQYCFECDCIRC----QTQD--------------- 270
Query: 185 GCSGFLLRDSDDKGFTCQQCGLVRSKEEIKKIASEVNILSKKTLALTSCGNHQEVVSTYK 244
+D+D ++ E+I K EV KK L + ++V++ +
Sbjct: 271 -------KDAD----------MLTGDEQIWK---EVQESLKKIEELKAHWKWEQVLALCQ 310
Query: 245 MIEKLQKKLYHPFSVNLMQTREKLIKILMELEDWKEALAYCQLTIPVYQRVYPQFHPLLG 304
I ++ ++ + + + L +EAL Y T+ Y+ +P HP+ G
Sbjct: 311 AIINSNSNRLPDINIYQLKVLDCAMDACINLGMLEEALLYAMRTMEPYRIFFPGSHPVRG 370
Query: 305 LQYYTCGKLEWFLGDTENAIKSMTEAVEILRITHG 339
+Q GKL+ G A+K++ A +I+++THG
Sbjct: 371 VQVMKVGKLQLHQGMFPQAMKNLRLAFDIMKVTHG 405
>gi|440901481|gb|ELR52415.1| SET and MYND domain-containing protein 2, partial [Bos grunniens
mutus]
Length = 383
Score = 120 bits (301), Expect = 1e-24, Method: Compositional matrix adjust.
Identities = 91/350 (26%), Positives = 166/350 (47%), Gaps = 32/350 (9%)
Query: 8 RKLQNDNVIPSTTTDNYSL-VEALVAHMSDID-EKQLLLYAQIANLVNLILQWPEISINE 65
R L + P T L V+ +H+ +D EK+ L+ + IA L + + E N+
Sbjct: 60 RILAKQKIHPERTPSEKLLAVKEFESHLDKLDNEKRDLIQSDIAALHHFYSKHLEFPDND 119
Query: 66 -IAENFSKLACNAHTICNSELRPLGTGLYPVISIINHSCLPNAVLVFEGRLAVVRAVQHV 124
+ F+++ CN TI + EL LG+ ++P ++++NHSC PN ++ ++G LA VRAVQ +
Sbjct: 120 SLVVLFAQVNCNGFTIEDEELSHLGSAIFPDVALMNHSCCPNVIVTYKGTLAEVRAVQEI 179
Query: 125 PKGAEVLISYIETAGSTMTRQKALKEQYLFTCTCPRCIKLGQFDDIQESAILEGYRCKDD 184
G EV SYI+ T R L++ Y FTC C C ++ A +E R +D
Sbjct: 180 HPGEEVFTSYIDLLYPTEDRNDRLRDSYFFTCECQECTTKD-----KDKAKVE-IRKLND 233
Query: 185 GCSGFLLRDSDDKGFTCQQCGLVR-SKEEIKKIASEVNILSKKTLALTSCGNHQEVVSTY 243
+RD +VR ++ I++ + S + L C QE +S
Sbjct: 234 PPKAETIRD------------MVRYARNVIEEFRRAKHYKSPPSELLEICELSQEKMSC- 280
Query: 244 KMIEKLQKKLYHPFSVNLMQTREKLIKILMELEDWKEALAYCQLTIPVYQRVYPQFHPLL 303
++ +V ++ + + + + ++DW+ AL Y Q I Y + YP + +
Sbjct: 281 ---------VFEDSNVYMLHMMYQAMGVCLYMQDWEGALRYGQKIIQPYSKHYPLYSLNV 331
Query: 304 GLQYYTCGKLEWFLGDTENAIKSMTEAVEILRITHGTNSPFMKELILKLE 353
+ G+L L + +++ +A+ I+ + HG + P++ E+ ++E
Sbjct: 332 ASMWLKLGRLYMGLENKAAGERALKKAIAIMEVAHGKDHPYISEIKQEIE 381
>gi|71043910|ref|NP_001020933.1| SET and MYND domain-containing protein 3 [Rattus norvegicus]
gi|66910630|gb|AAH97455.1| SET and MYND domain containing 3 [Rattus norvegicus]
Length = 428
Score = 120 bits (301), Expect = 1e-24, Method: Compositional matrix adjust.
Identities = 79/275 (28%), Positives = 137/275 (49%), Gaps = 39/275 (14%)
Query: 65 EIAENFSKLACNAHTICNSELRPLGTGLYPVISIINHSCLPNAVLVFEGRLAVVRAVQHV 124
++ E F+K+ CN+ TICN+E++ +G GLYP +S++NHSC PN +VF G ++RAV+ +
Sbjct: 170 DLFEAFAKVICNSFTICNAEMQEVGVGLYPSMSLLNHSCDPNCSIVFNGPHLLLRAVREI 229
Query: 125 PKGAEVLISYIETAGSTMTRQKALKEQYLFTCTCPRCIKLGQFDDIQESAILEGYRCKDD 184
G E+ I Y++ ++ R+K L++QY F C C RC Q D
Sbjct: 230 EAGEELTICYLDMLMTSEERRKQLRDQYCFECDCIRC----QTQD--------------- 270
Query: 185 GCSGFLLRDSDDKGFTCQQCGLVRSKEEIKKIASEVNILSKKTLALTSCGNHQEVVSTYK 244
+D+D ++ E+I K EV KK L + ++V++ +
Sbjct: 271 -------KDAD----------MLTGDEQIWK---EVQESLKKIEELKAHWKWEQVLALCQ 310
Query: 245 MIEKLQKKLYHPFSVNLMQTREKLIKILMELEDWKEALAYCQLTIPVYQRVYPQFHPLLG 304
I + ++ ++ + + + L +EAL Y T+ Y+ +P HP+ G
Sbjct: 311 AIINSNSERLPDINIYQLKVLDCAMDACINLGMLEEALFYAMRTMEPYRIFFPGSHPVRG 370
Query: 305 LQYYTCGKLEWFLGDTENAIKSMTEAVEILRITHG 339
+Q GKL+ G A+K++ A +I+++THG
Sbjct: 371 VQVMKVGKLQLHQGMFPQAMKNLRLAFDIMKVTHG 405
>gi|297280801|ref|XP_001106482.2| PREDICTED: SET and MYND domain-containing protein 2 [Macaca
mulatta]
Length = 433
Score = 120 bits (300), Expect = 1e-24, Method: Compositional matrix adjust.
Identities = 90/349 (25%), Positives = 160/349 (45%), Gaps = 31/349 (8%)
Query: 8 RKLQNDNVIPSTTTDNYSL-VEALVAHMSDID-EKQLLLYAQIANLVNLILQWPEISINE 65
R L + P T L V+ +H+ +D EK+ L+ + IA L + + E N+
Sbjct: 111 RILAKQKIHPERTPSEKLLAVKEFESHLDKLDNEKKDLIQSDIAALHHFYSKHLEFPDND 170
Query: 66 -IAENFSKLACNAHTICNSELRPLGTGLYPVISIINHSCLPNAVLVFEGRLAVVRAVQHV 124
+ F+++ CN TI + EL LG+ ++P ++++NHSC PN ++ ++G LA VRAVQ +
Sbjct: 171 SLVVLFAQVNCNGFTIEDEELSHLGSAIFPDVALMNHSCCPNVIVTYKGTLAEVRAVQEI 230
Query: 125 PKGAEVLISYIETAGSTMTRQKALKEQYLFTCTCPRCIKLGQFDDIQESAILEGYRCKDD 184
G EV SYI+ T R L++ Y FTC C C ++ A +E R D
Sbjct: 231 KPGEEVFTSYIDLLYPTEDRNDRLRDSYFFTCECQECTTKD-----KDKAKVE-IRKLSD 284
Query: 185 GCSGFLLRDSDDKGFTCQQCGLVRSKEEIKKIASEVNILSKKTLALTSCGNHQEVVSTYK 244
+RD +VR + + + L C QE +S+
Sbjct: 285 PPKAEAIRD------------MVRYARNVIEEFRRAKHYKSPSELLEICELSQEKMSS-- 330
Query: 245 MIEKLQKKLYHPFSVNLMQTREKLIKILMELEDWKEALAYCQLTIPVYQRVYPQFHPLLG 304
++ +V ++ + + + + ++DW+ AL Y Q I Y + YP + +
Sbjct: 331 --------VFEDSNVYMLHMMYQAMGVCLYMQDWEGALQYGQKIIKPYSKHYPLYSLNVA 382
Query: 305 LQYYTCGKLEWFLGDTENAIKSMTEAVEILRITHGTNSPFMKELILKLE 353
+ G+L L K++ +A+ I+ + HG + P++ E+ ++E
Sbjct: 383 SMWLKLGRLYMGLEHKAAGEKALKKAIAIMEVAHGKDHPYISEIKQEIE 431
>gi|296230124|ref|XP_002760570.1| PREDICTED: N-lysine methyltransferase SMYD2 [Callithrix jacchus]
Length = 433
Score = 120 bits (300), Expect = 1e-24, Method: Compositional matrix adjust.
Identities = 90/349 (25%), Positives = 160/349 (45%), Gaps = 31/349 (8%)
Query: 8 RKLQNDNVIPSTTTDNYSL-VEALVAHMSDID-EKQLLLYAQIANLVNLILQWPEISINE 65
R L + P T L V+ +H+ +D EK+ L+ + IA L + + E N+
Sbjct: 111 RILAKQKIHPERTPSEKLLAVKEFESHLDKLDNEKKDLIQSDIAALHHFYSKHLEFPDND 170
Query: 66 -IAENFSKLACNAHTICNSELRPLGTGLYPVISIINHSCLPNAVLVFEGRLAVVRAVQHV 124
+ F+++ CN TI + EL LG+ ++P ++++NHSC PN ++ ++G LA VRAVQ +
Sbjct: 171 SLVVLFAQVNCNGFTIEDEELSHLGSAIFPDVALMNHSCCPNVIVTYKGTLAEVRAVQEI 230
Query: 125 PKGAEVLISYIETAGSTMTRQKALKEQYLFTCTCPRCIKLGQFDDIQESAILEGYRCKDD 184
G EV SYI+ T R L++ Y FTC C C ++ A +E R D
Sbjct: 231 SPGEEVFTSYIDLLYPTEDRNDRLRDSYFFTCECQECTTKD-----KDKAKVE-IRKLSD 284
Query: 185 GCSGFLLRDSDDKGFTCQQCGLVRSKEEIKKIASEVNILSKKTLALTSCGNHQEVVSTYK 244
+RD +VR + + + L C QE +S+
Sbjct: 285 PPKAEAIRD------------MVRYARNVIEEFRRAKHYKSPSELLEICELSQEKMSS-- 330
Query: 245 MIEKLQKKLYHPFSVNLMQTREKLIKILMELEDWKEALAYCQLTIPVYQRVYPQFHPLLG 304
++ +V ++ + + + + ++DW+ AL Y Q I Y + YP + +
Sbjct: 331 --------VFEDSNVYMLHMMYQAMGVCLYMQDWEGALQYGQKIIKPYSKHYPLYSLNVA 382
Query: 305 LQYYTCGKLEWFLGDTENAIKSMTEAVEILRITHGTNSPFMKELILKLE 353
+ G+L L K++ +A+ I+ + HG + P++ E+ ++E
Sbjct: 383 SMWLKLGRLYMGLEHKAAGEKALKKAIAIMEVAHGKDHPYISEIKQEIE 431
>gi|449283344|gb|EMC90014.1| SET and MYND domain-containing protein 2, partial [Columba livia]
Length = 385
Score = 120 bits (300), Expect = 1e-24, Method: Compositional matrix adjust.
Identities = 88/341 (25%), Positives = 165/341 (48%), Gaps = 29/341 (8%)
Query: 19 TTTDNYSLVEALVAHMSDID-EKQLLLYAQIANLVNLI---LQWPEISINEIAENFSKLA 74
T ++ V+ +H+ +D EK+ L+ IA L + L++P+ + + F+++
Sbjct: 68 TQSEKLLAVKEFESHLDKLDNEKRELIQNDIAALHHFYSKHLEYPDNAA--LVVLFAQVN 125
Query: 75 CNAHTICNSELRPLGTGLYPVISIINHSCLPNAVLVFEGRLAVVRAVQHVPKGAEVLISY 134
CN TI + EL LG+ ++P ++++NHSC PN ++ ++G LA VRAV+ + G EV SY
Sbjct: 126 CNGFTIEDEELSHLGSAIFPDVALMNHSCCPNVIVTYKGTLAEVRAVKEIEPGEEVFTSY 185
Query: 135 IETAGSTMTRQKALKEQYLFTCTCPRCIKLGQFDDIQESAILEGYRCKDDGCSGFLLRDS 194
I+ T R L++ Y F C C C+ + D ++ I R +D S +RD
Sbjct: 186 IDLLYPTEDRNDRLRDSYFFNCDCRECVTKEK--DKEKLEI----RKLNDPPSAETVRD- 238
Query: 195 DDKGFTCQQCGLVRSK-EEIKKIASEVNILSKKTLALTSCGNHQEVVSTYKMIEKLQKKL 253
R+ EE ++ IL TLA E++ ++ +
Sbjct: 239 --------MIKYARNVIEEFRRAKHYKYILCLTTLAC-------ELLEICELSLDKMGAV 283
Query: 254 YHPFSVNLMQTREKLIKILMELEDWKEALAYCQLTIPVYQRVYPQFHPLLGLQYYTCGKL 313
+ +V ++ + + + + ++DW+ AL Y Q I Y + YP + + + G+L
Sbjct: 284 FEDSNVYMLHMMYQAMGVCLYVQDWEGALRYGQKIIRPYSKHYPSYSLNVASMWLKLGRL 343
Query: 314 EWFLGDTENAIKSMTEAVEILRITHGTNSPFMKELILKLEE 354
L + +K++ A+ I+ + HG + P++ E+ +LE+
Sbjct: 344 YMALENRAAGVKALKRAIAIMEVAHGKDHPYISEIKKELED 384
>gi|397486375|ref|XP_003814305.1| PREDICTED: N-lysine methyltransferase SMYD2 [Pan paniscus]
Length = 384
Score = 120 bits (300), Expect = 1e-24, Method: Compositional matrix adjust.
Identities = 88/349 (25%), Positives = 156/349 (44%), Gaps = 31/349 (8%)
Query: 8 RKLQNDNVIPSTTTDNYSL-VEALVAHMSDID-EKQLLLYAQIANLVNLILQWPEISINE 65
R L + P T L V+ +H+ +D EK+ L+ + IA L + + E N+
Sbjct: 62 RILAKQKIHPERTPSEKLLAVKEFESHLDKLDNEKKDLIQSDIAALHHFYSKHLEFLDND 121
Query: 66 -IAENFSKLACNAHTICNSELRPLGTGLYPVISIINHSCLPNAVLVFEGRLAVVRAVQHV 124
+ F+++ CN TI + EL LG+ ++P ++++NHSC PN ++ ++G LA VRAVQ +
Sbjct: 122 SLVVVFAQVNCNGFTIEDEELSHLGSAIFPDVALMNHSCCPNVIVTYKGTLAEVRAVQEI 181
Query: 125 PKGAEVLISYIETAGSTMTRQKALKEQYLFTCTCPRCIKLGQFDDIQESAILEGYRCKDD 184
G EV SYI+ T R L++ Y FTC C C KD
Sbjct: 182 KPGEEVFTSYIDLLYPTEDRNDRLRDSYFFTCECQECT------------------TKDK 223
Query: 185 GCSGFLLRDSDDKGFTCQQCGLVRSKEEIKKIASEVNILSKKTLALTSCGNHQEVVSTYK 244
+ +R D +VR + + + L C QE +S+
Sbjct: 224 DKAKVEIRKLSDPPKAEAIRDMVRYARNVIEEFRRAKHYKSPSELLEICELSQEKMSS-- 281
Query: 245 MIEKLQKKLYHPFSVNLMQTREKLIKILMELEDWKEALAYCQLTIPVYQRVYPQFHPLLG 304
++ +V ++ + + + + ++DW+ AL Y Q I Y + YP + +
Sbjct: 282 --------VFEDSNVYMLHMMYQAMGVCLYMQDWEGALQYGQKIIKPYSKHYPLYSLNVA 333
Query: 305 LQYYTCGKLEWFLGDTENAIKSMTEAVEILRITHGTNSPFMKELILKLE 353
+ G+L L K++ +A+ I+ + HG + P++ E+ ++E
Sbjct: 334 SMWLKLGRLYMGLEHKAAGEKALKKAIAIMEVAHGKDHPYISEIKQEIE 382
>gi|21312378|ref|NP_081464.1| SET and MYND domain-containing protein 3 [Mus musculus]
gi|30913566|sp|Q9CWR2.1|SMYD3_MOUSE RecName: Full=SET and MYND domain-containing protein 3; AltName:
Full=Zinc finger MYND domain-containing protein 1
gi|12845900|dbj|BAB26947.1| unnamed protein product [Mus musculus]
gi|30851412|gb|AAH52431.1| SET and MYND domain containing 3 [Mus musculus]
Length = 428
Score = 120 bits (300), Expect = 1e-24, Method: Compositional matrix adjust.
Identities = 79/275 (28%), Positives = 136/275 (49%), Gaps = 39/275 (14%)
Query: 65 EIAENFSKLACNAHTICNSELRPLGTGLYPVISIINHSCLPNAVLVFEGRLAVVRAVQHV 124
++ E F+K+ CN+ TICN+E++ +G GLYP +S++NHSC PN +VF G ++RAV+ +
Sbjct: 170 DLFEAFAKVICNSFTICNAEMQEVGVGLYPSMSLLNHSCDPNCSIVFNGPHLLLRAVREI 229
Query: 125 PKGAEVLISYIETAGSTMTRQKALKEQYLFTCTCPRCIKLGQFDDIQESAILEGYRCKDD 184
G E+ I Y++ ++ R+K L++QY F C C RC Q D
Sbjct: 230 EAGEELTICYLDMLMTSEERRKQLRDQYCFECDCIRC----QTQD--------------- 270
Query: 185 GCSGFLLRDSDDKGFTCQQCGLVRSKEEIKKIASEVNILSKKTLALTSCGNHQEVVSTYK 244
+D+D ++ E+I K EV KK L + ++V++ +
Sbjct: 271 -------KDAD----------MLTGDEQIWK---EVQESLKKIEELKAHWKWEQVLALCQ 310
Query: 245 MIEKLQKKLYHPFSVNLMQTREKLIKILMELEDWKEALAYCQLTIPVYQRVYPQFHPLLG 304
I ++ ++ + + + L +EAL Y T+ Y+ +P HP+ G
Sbjct: 311 AIINSNSNRLPDINIYQLKVLDCAMDACINLGMLEEALFYAMRTMEPYRIFFPGSHPVRG 370
Query: 305 LQYYTCGKLEWFLGDTENAIKSMTEAVEILRITHG 339
+Q GKL+ G A+K++ A +I+++THG
Sbjct: 371 VQVMKVGKLQLHQGMFPQAMKNLRLAFDIMKVTHG 405
>gi|38494371|gb|AAH61485.1| SET and MYND domain containing 3 [Mus musculus]
gi|148681224|gb|EDL13171.1| SET and MYND domain containing 3, isoform CRA_b [Mus musculus]
Length = 428
Score = 120 bits (300), Expect = 1e-24, Method: Compositional matrix adjust.
Identities = 79/275 (28%), Positives = 136/275 (49%), Gaps = 39/275 (14%)
Query: 65 EIAENFSKLACNAHTICNSELRPLGTGLYPVISIINHSCLPNAVLVFEGRLAVVRAVQHV 124
++ E F+K+ CN+ TICN+E++ +G GLYP +S++NHSC PN +VF G ++RAV+ +
Sbjct: 170 DLFEAFAKVICNSFTICNAEMQEVGVGLYPSMSLLNHSCDPNCSIVFNGPHLLLRAVREI 229
Query: 125 PKGAEVLISYIETAGSTMTRQKALKEQYLFTCTCPRCIKLGQFDDIQESAILEGYRCKDD 184
G E+ I Y++ ++ R+K L++QY F C C RC Q D
Sbjct: 230 EAGEELTICYLDMLMTSEERRKQLRDQYCFECDCIRC----QTQD--------------- 270
Query: 185 GCSGFLLRDSDDKGFTCQQCGLVRSKEEIKKIASEVNILSKKTLALTSCGNHQEVVSTYK 244
+D+D ++ E+I K EV KK L + ++V++ +
Sbjct: 271 -------KDAD----------MLTGDEQIWK---EVQESLKKIEELKAHWKWEQVLALCQ 310
Query: 245 MIEKLQKKLYHPFSVNLMQTREKLIKILMELEDWKEALAYCQLTIPVYQRVYPQFHPLLG 304
I ++ ++ + + + L +EAL Y T+ Y+ +P HP+ G
Sbjct: 311 AIINSNSDRLPDINIYQLKVLDCAMDACINLGMLEEALFYAMRTMEPYRIFFPGSHPVRG 370
Query: 305 LQYYTCGKLEWFLGDTENAIKSMTEAVEILRITHG 339
+Q GKL+ G A+K++ A +I+++THG
Sbjct: 371 VQVMKVGKLQLHQGMFPQAMKNLRLAFDIMKVTHG 405
>gi|348577081|ref|XP_003474313.1| PREDICTED: SET and MYND domain-containing protein 3-like [Cavia
porcellus]
Length = 428
Score = 120 bits (300), Expect = 1e-24, Method: Compositional matrix adjust.
Identities = 87/277 (31%), Positives = 143/277 (51%), Gaps = 43/277 (15%)
Query: 65 EIAENFSKLACNAHTICNSELRPLGTGLYPVISIINHSCLPNAVLVFEGRLAVVRAVQHV 124
+I E F+K+ CN+ TICN+E++ +G GLYP +S++NHSC PN +VF G ++RAV+ +
Sbjct: 170 DIFEAFAKVICNSFTICNAEMQEVGVGLYPSMSLLNHSCDPNCSIVFNGPHLLLRAVRDI 229
Query: 125 PKGAEVLISYIETAGSTMTRQKALKEQYLFTCTCPRCIKLGQFDDIQESAILEGYRCKDD 184
G E+ I Y++ ++ R+K L++QY F C C RC Q D +++ +L G
Sbjct: 230 EVGEELTICYLDMLMTSEERRKQLRDQYCFECDCFRC----QTQD-KDADMLTG------ 278
Query: 185 GCSGFLLRDSDDKGFTCQQCGLVRSKEEIKKIASEVNILSKKTLALTSCGNHQEVVSTYK 244
D++ + Q E +KKI E+ K L C Q ++S+
Sbjct: 279 ----------DEQVWKGVQ-------ESLKKI-EELKTHWKWEQVLAMC---QSIISSNS 317
Query: 245 MIEKLQKKLYHPFSVNLMQTR--EKLIKILMELEDWKEALAYCQLTIPVYQRVYPQFHPL 302
E+L +N+ Q + + + + L +EAL Y T+ Y+ +P HP+
Sbjct: 318 --ERLP-------DINIYQLKVLDCAMDACINLGLLEEALFYGIRTMEPYRIFFPGSHPV 368
Query: 303 LGLQYYTCGKLEWFLGDTENAIKSMTEAVEILRITHG 339
G+Q GKL+ G A+K++ A +I+R+THG
Sbjct: 369 RGVQVMKVGKLQLHQGMFPQAMKNLRLAFDIMRVTHG 405
>gi|345802872|ref|XP_537149.3| PREDICTED: N-lysine methyltransferase SMYD2 [Canis lupus
familiaris]
Length = 523
Score = 120 bits (300), Expect = 2e-24, Method: Compositional matrix adjust.
Identities = 82/349 (23%), Positives = 158/349 (45%), Gaps = 31/349 (8%)
Query: 8 RKLQNDNVIPSTTTDNYSL-VEALVAHMSDID-EKQLLLYAQIANLVNLILQWPEISINE 65
R L + P T L V+ +H+ +D EK+ L+ + IA L + + E +E
Sbjct: 201 RILAKQKIHPERTPSEKLLAVKEFESHLDKLDNEKKDLIQSDIAALHHFYSKHLEFPDSE 260
Query: 66 -IAENFSKLACNAHTICNSELRPLGTGLYPVISIINHSCLPNAVLVFEGRLAVVRAVQHV 124
+ F+++ CN TI + EL LG+ ++P ++++NHSC PN ++ ++G LA VRAVQ +
Sbjct: 261 SLVVLFAQVNCNGFTIEDEELSHLGSAIFPDVALMNHSCCPNVIVTYKGTLAEVRAVQEI 320
Query: 125 PKGAEVLISYIETAGSTMTRQKALKEQYLFTCTCPRCIKLGQFDDIQESAILEGYRCKDD 184
G EV SYI+ T R L++ Y FTC C C
Sbjct: 321 SPGEEVFTSYIDLLYPTEDRNDRLRDSYFFTCQCQECTT--------------------- 359
Query: 185 GCSGFLLRDSDDKGFTCQQCGLVRSKEEIKKIASEVNILSKKTLALTSCGNHQEVVSTYK 244
+D D ++ E I+ + + ++ + E++ +
Sbjct: 360 -------KDKDKAKVEIRKLSDPPKAETIRDMVRYARNVIEEFRRAKHYKSPSELLEICE 412
Query: 245 MIEKLQKKLYHPFSVNLMQTREKLIKILMELEDWKEALAYCQLTIPVYQRVYPQFHPLLG 304
+ ++ ++ +V ++ + + + + ++DW+ AL Y Q I Y + YP + +
Sbjct: 413 LSQEKMGSVFEDSNVYMLHMMYQAMGVCLYMQDWEGALRYGQKIIKPYSKHYPLYSLNVA 472
Query: 305 LQYYTCGKLEWFLGDTENAIKSMTEAVEILRITHGTNSPFMKELILKLE 353
+ G+L L D +++ +A+ I+ + HG + P++ E+ ++E
Sbjct: 473 SMWLKLGRLYMGLEDKAAGERALKKAIAIMEVAHGKDHPYISEIKQEIE 521
>gi|410985721|ref|XP_003999165.1| PREDICTED: SET and MYND domain-containing protein 3 [Felis catus]
Length = 428
Score = 119 bits (299), Expect = 2e-24, Method: Compositional matrix adjust.
Identities = 83/277 (29%), Positives = 136/277 (49%), Gaps = 43/277 (15%)
Query: 65 EIAENFSKLACNAHTICNSELRPLGTGLYPVISIINHSCLPNAVLVFEGRLAVVRAVQHV 124
+I E F+K+ CN+ T+CN+E++ +G GLYP +S++NHSC PN +VF G ++RAV+ +
Sbjct: 170 DIFEAFAKVICNSFTVCNAEMQEVGVGLYPSMSLLNHSCDPNCSIVFNGPHLLLRAVRDI 229
Query: 125 PKGAEVLISYIETAGSTMTRQKALKEQYLFTCTCPRCIKLGQFDDIQESAILEGYRCKDD 184
G E+ I Y++ ++ R+K L++QY F C C RC
Sbjct: 230 EAGEELTICYLDMLMTSEERRKQLRDQYCFECDCIRC----------------------- 266
Query: 185 GCSGFLLRDSDDKGFTCQQCGLVRSKEEIKKIASEVNILSKKTLALTSCGNHQEVVSTYK 244
+D D T + +E +KKI E+ K L C Q ++S+
Sbjct: 267 -----QTQDKDADMLTGDEQVWKEVQESLKKI-EELKAHWKWEQVLAMC---QAIISSNS 317
Query: 245 MIEKLQKKLYHPFSVNLMQTR--EKLIKILMELEDWKEALAYCQLTIPVYQRVYPQFHPL 302
E+L +N+ Q + + + + L +EAL Y T+ Y+ +P HP+
Sbjct: 318 --ERLP-------DINIYQLKVLDCAMDACINLGLLEEALFYGIRTMEPYRIFFPGSHPV 368
Query: 303 LGLQYYTCGKLEWFLGDTENAIKSMTEAVEILRITHG 339
G+Q GKL+ G A+K++ A +I+R+THG
Sbjct: 369 RGVQVMKVGKLQLHQGMFPQAMKNLRLAFDIMRVTHG 405
>gi|403288342|ref|XP_003935365.1| PREDICTED: SET and MYND domain-containing protein 3 [Saimiri
boliviensis boliviensis]
Length = 369
Score = 119 bits (299), Expect = 2e-24, Method: Compositional matrix adjust.
Identities = 80/275 (29%), Positives = 137/275 (49%), Gaps = 39/275 (14%)
Query: 65 EIAENFSKLACNAHTICNSELRPLGTGLYPVISIINHSCLPNAVLVFEGRLAVVRAVQHV 124
+I E F+K+ CN+ TICN+E++ +G GLYP IS++NHSC PN +VF G ++RAV+ +
Sbjct: 111 DIFEAFAKVICNSFTICNAEMQEVGVGLYPSISLLNHSCDPNCSVVFNGPHLLLRAVRDI 170
Query: 125 PKGAEVLISYIETAGSTMTRQKALKEQYLFTCTCPRCIKLGQFDDIQESAILEGYRCKDD 184
G E+ I Y++T ++ R+K L++QY F C C RC Q D
Sbjct: 171 EVGEELTICYLDTLMTSEERRKQLRDQYCFECDCFRC----QTQD--------------- 211
Query: 185 GCSGFLLRDSDDKGFTCQQCGLVRSKEEIKKIASEVNILSKKTLALTSCGNHQEVVSTYK 244
+D+D ++ E++ K EV K L + ++V++ +
Sbjct: 212 -------KDAD----------MLTGDEQVWK---EVQKSLKTIEELKAHWKWEQVLAMCQ 251
Query: 245 MIEKLQKKLYHPFSVNLMQTREKLIKILMELEDWKEALAYCQLTIPVYQRVYPQFHPLLG 304
I + ++ ++ + + + L ++AL Y T+ Y+ +P HP+ G
Sbjct: 252 AIISSNSERLPDINIYQLKVLDCAMDACIHLGLLEKALFYGTRTMEPYRIFFPGSHPVRG 311
Query: 305 LQYYTCGKLEWFLGDTENAIKSMTEAVEILRITHG 339
+Q GKL+ G A+K++ A +I+R+THG
Sbjct: 312 VQVMKVGKLQLHQGMFPQAMKNLRLAFDIMRVTHG 346
>gi|417400865|gb|JAA47349.1| Putative histone tail methylase [Desmodus rotundus]
Length = 433
Score = 119 bits (299), Expect = 2e-24, Method: Compositional matrix adjust.
Identities = 84/337 (24%), Positives = 153/337 (45%), Gaps = 30/337 (8%)
Query: 19 TTTDNYSLVEALVAHMSDID-EKQLLLYAQIANLVNLILQWPEISINE-IAENFSKLACN 76
T ++ V+ +H+ +D EK+ L+ + I+ L + E NE + F+++ CN
Sbjct: 123 TPSEKLLAVKEFESHLDKLDNEKKDLIQSDISALHRFYSKHLEFPDNESLVTLFAQVNCN 182
Query: 77 AHTICNSELRPLGTGLYPVISIINHSCLPNAVLVFEGRLAVVRAVQHVPKGAEVLISYIE 136
TI + EL LG+ ++P ++++NHSC PN ++ ++G LA VRAVQ + G EV SYI+
Sbjct: 183 GFTIEDEELSHLGSAIFPDVALMNHSCCPNVIVTYKGTLAEVRAVQAISPGEEVFTSYID 242
Query: 137 TAGSTMTRQKALKEQYLFTCTCPRCIKLGQFDDIQESAILEGYRCKDDGCSGFLLRDSDD 196
T R L++ Y FTC C C KD + +R +D
Sbjct: 243 LLYPTEDRNDRLRDSYFFTCECQECTT------------------KDKDKAKVKIRKLND 284
Query: 197 KGFTCQQCGLVRSKEEIKKIASEVNILSKKTLALTSCGNHQEVVSTYKMIEKLQKKLYHP 256
+VR + + + L C QE +S+ ++
Sbjct: 285 PPKADAVRDMVRYARNVIEEFRRAKHYKSPSELLEICELSQEKMSS----------VFED 334
Query: 257 FSVNLMQTREKLIKILMELEDWKEALAYCQLTIPVYQRVYPQFHPLLGLQYYTCGKLEWF 316
+V ++ + + + + +DW+ AL Y Q I Y + YP + + + G+L
Sbjct: 335 SNVYMLHMMYQAMGVCLYTQDWEGALRYGQKIIKPYSKHYPLYSLNVASMWLKLGRLYMG 394
Query: 317 LGDTENAIKSMTEAVEILRITHGTNSPFMKELILKLE 353
L + K++ +A+ I+ + HG + P++ E+ ++E
Sbjct: 395 LENRAAGEKALRKAMAIMEVAHGKDHPYISEIKQEIE 431
>gi|291402060|ref|XP_002717678.1| PREDICTED: SET and MYND domain containing 3 [Oryctolagus cuniculus]
Length = 428
Score = 119 bits (299), Expect = 2e-24, Method: Compositional matrix adjust.
Identities = 80/275 (29%), Positives = 138/275 (50%), Gaps = 39/275 (14%)
Query: 65 EIAENFSKLACNAHTICNSELRPLGTGLYPVISIINHSCLPNAVLVFEGRLAVVRAVQHV 124
+I E F+K+ CN+ TICN+E++ +G GLYP +S++NHSC PN +VF G ++RAV+ +
Sbjct: 170 DIFEAFAKVICNSFTICNAEMQEVGVGLYPSMSLLNHSCDPNCSIVFNGPHLLLRAVRDI 229
Query: 125 PKGAEVLISYIETAGSTMTRQKALKEQYLFTCTCPRCIKLGQFDDIQESAILEGYRCKDD 184
G E+ I Y++ ++ R+K L++QY F C C RC + Q+
Sbjct: 230 EAGEELTICYLDMLMTSEERRKQLRDQYCFECDCFRC-------ETQD------------ 270
Query: 185 GCSGFLLRDSDDKGFTCQQCGLVRSKEEIKKIASEVNILSKKTLALTSCGNHQEVVSTYK 244
+D+D ++ E+I K EV KK L + ++V++ +
Sbjct: 271 -------KDAD----------MLTGDEQIWK---EVQESLKKIEELKAHWKWEQVLALCQ 310
Query: 245 MIEKLQKKLYHPFSVNLMQTREKLIKILMELEDWKEALAYCQLTIPVYQRVYPQFHPLLG 304
I + ++ ++ + + + L +EAL Y T+ Y+ +P HP+ G
Sbjct: 311 TIISSNSERLPDINIYQLKVLDCAMDACINLGLLEEALFYGIRTMEPYRIFFPGSHPVRG 370
Query: 305 LQYYTCGKLEWFLGDTENAIKSMTEAVEILRITHG 339
+Q GKL+ G A+K++ A +I+R+THG
Sbjct: 371 VQVMKVGKLQLHQGMFPQAMKNLRLAFDIMRVTHG 405
>gi|417400767|gb|JAA47309.1| Hypothetical protein [Desmodus rotundus]
Length = 428
Score = 119 bits (299), Expect = 2e-24, Method: Compositional matrix adjust.
Identities = 84/277 (30%), Positives = 135/277 (48%), Gaps = 43/277 (15%)
Query: 65 EIAENFSKLACNAHTICNSELRPLGTGLYPVISIINHSCLPNAVLVFEGRLAVVRAVQHV 124
+I E F+K+ CNA TICN+E++ +G GLYP +S++NHSC PN +VF G ++RAV+ +
Sbjct: 170 DIFEAFAKVFCNAFTICNAEMQGVGVGLYPSMSLLNHSCDPNCSIVFNGPHLLLRAVRDI 229
Query: 125 PKGAEVLISYIETAGSTMTRQKALKEQYLFTCTCPRCIKLGQFDDIQESAILEGYRCKDD 184
G E+ + Y++ ++ R+K L+ QY F C C RC
Sbjct: 230 ETGEELTVCYLDLLMTSEERRKHLRSQYCFDCDCVRC----------------------- 266
Query: 185 GCSGFLLRDSDDKGFTCQQCGLVRSKEEIKKIASEVNILSKKTLALTSCGNHQEVVSTYK 244
+D D T + +E +KKI E+ K L C Q ++S+
Sbjct: 267 -----QTQDKDADMLTGDEQVWKEVQESLKKI-EELKAHWKWEQVLAMC---QAIISSN- 316
Query: 245 MIEKLQKKLYHPFSVNLMQTR--EKLIKILMELEDWKEALAYCQLTIPVYQRVYPQFHPL 302
E+L +N+ Q + + + + L +EAL Y T+ Y+ +P HP+
Sbjct: 317 -AERLP-------DINIYQLKVLDCAMDACIHLGLLEEALFYGLRTMEPYRIFFPGSHPV 368
Query: 303 LGLQYYTCGKLEWFLGDTENAIKSMTEAVEILRITHG 339
G+Q GKL+ G A+K++ A +I+R+THG
Sbjct: 369 RGVQVMKVGKLQLHQGMFPQAMKNLRLAFDIMRVTHG 405
>gi|195438687|ref|XP_002067264.1| GK16327 [Drosophila willistoni]
gi|194163349|gb|EDW78250.1| GK16327 [Drosophila willistoni]
Length = 461
Score = 119 bits (299), Expect = 2e-24, Method: Compositional matrix adjust.
Identities = 98/361 (27%), Positives = 168/361 (46%), Gaps = 25/361 (6%)
Query: 1 MLKLYLRRKLQNDNVIPSTTTDNYSLVEALVAHMSDI--DEKQL-LLYAQIANLVNLILQ 57
+ +L LR + D + T L++H ++I D K+L L + A L ++++
Sbjct: 102 LCRLILRLEHGGDLIRGYYTEHGSRKFRDLMSHYAEIKNDPKRLEHLESLHAVLTDMMVD 161
Query: 58 WPEISIN--EIAENFSKLACNAHTICNSELRPLGTGLYPVISIINHSCLPNAVLVFEGRL 115
P N E+ + +L N I + E+ + T +Y +SI +HSC PNAV FEG
Sbjct: 162 SPSTVPNKTELMSIYGRLITNGFNILDGEMNSIATAIYLGVSITDHSCQPNAVATFEGNE 221
Query: 116 AVVRAVQHVP--KGAEVLISYIETAGSTMTRQKALKEQYLFTCTCPRCIKLGQFDDIQES 173
+ A++ +P +++ ISYI+ + R+ LKE Y F C C +C D QE
Sbjct: 222 LHIYAIEDMPCLDWSKIFISYIDLLNTPEQRRLDLKEHYYFLCVCSKCT------DRQEQ 275
Query: 174 AILEGYRCKDDGCSGFLLRDSDDKGFTCQQC--GLVRSKEEIKKIASEVNILSKKTLALT 231
+E C C + + K C+ C G+ +++ +EV ++K L
Sbjct: 276 REMEAAVCPFRKCGSSI----NVKWTHCRNCQAGIT---PKLRNTYNEVMAMTKHHLESM 328
Query: 232 SCGNHQEVVSTYKMIEKLQKKLYHPFSVNLMQTREKLIKILMELEDWKEALAYCQLTIPV 291
+ +V K+ Q + HP +V ++T + + +E+ W EAL Y Q +
Sbjct: 329 KDVAYLDVC---KVCLDKQTGVLHPLNVWHVKTLDAAFEAAIEVGKWPEALNYGQQLLQG 385
Query: 292 YQRVYPQFHPLLGLQYYTCGKLEWFLGDTENAIKSMTEAVEILRITHGTNSPFMKELILK 351
+++ +HPLLGL + GK++ + G + A+ EA IL +THG + E +L
Sbjct: 386 FRKYNGPWHPLLGLLHMKLGKIQLYEGQCKEAMHHFEEARRILCVTHGREHRLIGEQLLP 445
Query: 352 L 352
L
Sbjct: 446 L 446
>gi|291402401|ref|XP_002717452.1| PREDICTED: SET and MYND domain containing 2, partial [Oryctolagus
cuniculus]
Length = 394
Score = 119 bits (299), Expect = 2e-24, Method: Compositional matrix adjust.
Identities = 86/339 (25%), Positives = 159/339 (46%), Gaps = 34/339 (10%)
Query: 19 TTTDNYSLVEALVAHMSDID-EKQLLLYAQIANLVNLI---LQWPEISINEIAENFSKLA 74
T ++ VE +H+ +D EK+ L+ + I+ L + L++P+ + + F+++
Sbjct: 84 TPSEKLLAVEEFESHLDKLDNEKKDLIQSDISALHHFYSKHLEFPDH--DSLVVLFAQVN 141
Query: 75 CNAHTICNSELRPLGTGLYPVISIINHSCLPNAVLVFEGRLAVVRAVQHVPKGAEVLISY 134
CN TI + EL LG+ ++P ++++NHSC PN ++ ++G LA VRAVQ + G EV SY
Sbjct: 142 CNGFTIEDEELSHLGSAIFPDVALMNHSCCPNVIVTYKGTLAEVRAVQEISPGEEVFTSY 201
Query: 135 IETAGSTMTRQKALKEQYLFTCTCPRCIKLGQFDDIQESAILEGYRCKDDGCSGFLLRDS 194
I+ T R + L++ Y FTC C C ++ A +E R D +RD
Sbjct: 202 IDLLYPTEDRNERLRDSYFFTCECLECTTKD-----KDKAKVEIRRLSDPP-KAEAIRD- 254
Query: 195 DDKGFTCQQCGLVRSKEEIKKIASEVNILSKKTLALTSCGNHQEVVSTYKMIEKLQKKLY 254
+VR + + L C QE + + ++
Sbjct: 255 -----------MVRYARNVIEEFRRAKHYKSPPELLEICELSQEKMGS----------VF 293
Query: 255 HPFSVNLMQTREKLIKILMELEDWKEALAYCQLTIPVYQRVYPQFHPLLGLQYYTCGKLE 314
+V ++ + + + + ++DW+ AL Y Q I Y + YP + + + G+L
Sbjct: 294 EDSNVYMLHMMYQAMGVCLYMQDWEGALRYGQKIIKPYSKHYPLYSLNVASMWLKLGRLY 353
Query: 315 WFLGDTENAIKSMTEAVEILRITHGTNSPFMKELILKLE 353
L + K++ +A+ I+ + HG + P++ E+ +LE
Sbjct: 354 LGLENKAAGEKALRKAIAIMEVAHGKDHPYISEIKQELE 392
>gi|441613050|ref|XP_003265162.2| PREDICTED: N-lysine methyltransferase SMYD2 [Nomascus leucogenys]
Length = 536
Score = 119 bits (299), Expect = 2e-24, Method: Compositional matrix adjust.
Identities = 90/349 (25%), Positives = 160/349 (45%), Gaps = 31/349 (8%)
Query: 8 RKLQNDNVIPSTTTDNYSL-VEALVAHMSDID-EKQLLLYAQIANLVNLILQWPEISINE 65
R L + P T L V+ +H+ +D EK+ L+ + IA L + + E N+
Sbjct: 214 RILAKQKIYPERTPSEKLLAVKEFESHLDKLDNEKKDLIQSDIAALHHFYSKHLEFPDND 273
Query: 66 -IAENFSKLACNAHTICNSELRPLGTGLYPVISIINHSCLPNAVLVFEGRLAVVRAVQHV 124
+ F+++ CN TI + EL LG+ ++P ++++NHSC PN ++ ++G LA VRAVQ +
Sbjct: 274 SLVVLFAQVNCNGFTIEDEELSHLGSAIFPDVALMNHSCCPNVIVTYKGTLAEVRAVQEI 333
Query: 125 PKGAEVLISYIETAGSTMTRQKALKEQYLFTCTCPRCIKLGQFDDIQESAILEGYRCKDD 184
G EV SYI+ T R L++ Y FTC C C ++ A +E R D
Sbjct: 334 KTGEEVFTSYIDLLYPTEDRNDRLRDSYFFTCECQECTTKD-----KDKAKVE-IRKLSD 387
Query: 185 GCSGFLLRDSDDKGFTCQQCGLVRSKEEIKKIASEVNILSKKTLALTSCGNHQEVVSTYK 244
+RD +VR + + + L C QE +S+
Sbjct: 388 PPKAEAIRD------------MVRYARNVIEEFRRAKHYKSPSELLEICELSQEKMSS-- 433
Query: 245 MIEKLQKKLYHPFSVNLMQTREKLIKILMELEDWKEALAYCQLTIPVYQRVYPQFHPLLG 304
++ +V ++ + + + + ++DW+ AL Y Q I Y + YP + +
Sbjct: 434 --------VFEDSNVYMLHMMYQAMGVCLYMQDWEGALQYGQKIIKPYSKHYPLYSLNVA 485
Query: 305 LQYYTCGKLEWFLGDTENAIKSMTEAVEILRITHGTNSPFMKELILKLE 353
+ G+L L K++ +A+ I+ + HG + P++ E+ ++E
Sbjct: 486 SMWLKLGRLYMGLEYKAAGEKALKKAIAIMEVAHGKDHPYISEIKQEIE 534
>gi|348577033|ref|XP_003474289.1| PREDICTED: N-lysine methyltransferase SMYD2-like [Cavia porcellus]
Length = 433
Score = 119 bits (298), Expect = 2e-24, Method: Compositional matrix adjust.
Identities = 85/351 (24%), Positives = 159/351 (45%), Gaps = 35/351 (9%)
Query: 8 RKLQNDNVIPSTTTDNYSL-VEALVAHMSDID-EKQLLLYAQIANLVNLI---LQWPEIS 62
R L + P T L V+ +H+ +D EK+ L+ + IA L L++P+
Sbjct: 111 RILAKQKIHPERTPSEMLLAVKEFESHLDKLDNEKKDLIQSDIAMLHQFYSKHLEFPDH- 169
Query: 63 INEIAENFSKLACNAHTICNSELRPLGTGLYPVISIINHSCLPNAVLVFEGRLAVVRAVQ 122
+ + F+++ CN TI + EL LG+ ++P ++++NHSC PN ++ ++G LA VRAVQ
Sbjct: 170 -DSLVVLFAQVNCNGFTIEDEELSHLGSAIFPDVALMNHSCCPNVIVTYKGTLAEVRAVQ 228
Query: 123 HVPKGAEVLISYIETAGSTMTRQKALKEQYLFTCTCPRCIKLGQFDDIQESAILEGYRCK 182
V G EV SYI+ T R L++ Y FTC C C
Sbjct: 229 EVLPGEEVFTSYIDLLYPTEDRNDRLRDSYFFTCECQECTT------------------- 269
Query: 183 DDGCSGFLLRDSDDKGFTCQQCGLVRSKEEIKKIASEVNILSKKTLALTSCGNHQEVVST 242
RD D ++ E I+ + + ++ + E++
Sbjct: 270 ---------RDKDKAKVEIRKLSNPPKAEAIRDMVRYARNVIEEFRRAKHYKSPSELLEI 320
Query: 243 YKMIEKLQKKLYHPFSVNLMQTREKLIKILMELEDWKEALAYCQLTIPVYQRVYPQFHPL 302
++ ++ ++ +V ++ + + + + L+DW+ AL Y Q I Y R YP +
Sbjct: 321 CELSQEKMSAVFEDSNVYMLHMMYQAMGVCLYLQDWEGALRYGQKIIKPYSRHYPLYSLN 380
Query: 303 LGLQYYTCGKLEWFLGDTENAIKSMTEAVEILRITHGTNSPFMKELILKLE 353
+ + G+L L + +++ +A+ I+ + HG + P++ E+ ++E
Sbjct: 381 VASMWLKLGRLYLGLENRVAGERALRKALAIMEVAHGKDHPYISEIKREIE 431
>gi|115496736|ref|NP_001069832.1| N-lysine methyltransferase SMYD2 [Bos taurus]
gi|122144240|sp|Q0P585.1|SMYD2_BOVIN RecName: Full=N-lysine methyltransferase SMYD2; AltName:
Full=Histone methyltransferase SMYD2; AltName: Full=SET
and MYND domain-containing protein 2
gi|112362245|gb|AAI20365.1| SET and MYND domain containing 2 [Bos taurus]
gi|296478866|tpg|DAA20981.1| TPA: SET and MYND domain containing 2 [Bos taurus]
Length = 433
Score = 119 bits (298), Expect = 2e-24, Method: Compositional matrix adjust.
Identities = 89/349 (25%), Positives = 161/349 (46%), Gaps = 31/349 (8%)
Query: 8 RKLQNDNVIPSTTTDNYSL-VEALVAHMSDID-EKQLLLYAQIANLVNLILQWPEISINE 65
R L + P T L V+ +H+ +D EK+ L+ + IA L + + E N+
Sbjct: 111 RILAKQKIHPERTPSEKLLAVKEFESHLDKLDNEKRDLIQSDIAALHHFYSKHLEFPDND 170
Query: 66 -IAENFSKLACNAHTICNSELRPLGTGLYPVISIINHSCLPNAVLVFEGRLAVVRAVQHV 124
+ F+++ CN TI + EL LG+ ++P ++++NHSC PN ++ ++G LA VRAVQ +
Sbjct: 171 SLVVLFAQVNCNGFTIEDEELSHLGSAIFPDVALMNHSCCPNVIVTYKGTLAEVRAVQEI 230
Query: 125 PKGAEVLISYIETAGSTMTRQKALKEQYLFTCTCPRCIKLGQFDDIQESAILEGYRCKDD 184
G EV SYI+ T R L++ Y FTC C C ++ A +E R +D
Sbjct: 231 HPGEEVFTSYIDLLYPTEDRNDRLRDSYFFTCECQECTTKD-----KDKAKVE-IRKLND 284
Query: 185 GCSGFLLRDSDDKGFTCQQCGLVRSKEEIKKIASEVNILSKKTLALTSCGNHQEVVSTYK 244
+RD +VR + + + L C QE +S
Sbjct: 285 PPKAETIRD------------MVRYARNVIEEFRRAKHYKSPSELLEICELSQEKMSC-- 330
Query: 245 MIEKLQKKLYHPFSVNLMQTREKLIKILMELEDWKEALAYCQLTIPVYQRVYPQFHPLLG 304
++ +V ++ + + + + ++DW+ AL Y Q I Y + YP + +
Sbjct: 331 --------VFEDSNVYMLHMMYQAMGVCLYMQDWEGALRYGQKIIQPYSKHYPLYSLNVA 382
Query: 305 LQYYTCGKLEWFLGDTENAIKSMTEAVEILRITHGTNSPFMKELILKLE 353
+ G+L L + +++ +A+ I+ + HG + P++ E+ ++E
Sbjct: 383 SMWLKLGRLYMGLENKAAGERALKKAIAIMEVAHGKDHPYISEIKQEIE 431
>gi|296230793|ref|XP_002760890.1| PREDICTED: SET and MYND domain-containing protein 3 [Callithrix
jacchus]
Length = 428
Score = 119 bits (298), Expect = 2e-24, Method: Compositional matrix adjust.
Identities = 85/275 (30%), Positives = 141/275 (51%), Gaps = 39/275 (14%)
Query: 65 EIAENFSKLACNAHTICNSELRPLGTGLYPVISIINHSCLPNAVLVFEGRLAVVRAVQHV 124
+I E F+K+ CN+ TICN+E++ +G GLYP IS++NHSC PN +VF G ++RAV+ V
Sbjct: 170 DIFEAFAKVICNSFTICNAEMQEVGVGLYPSISLLNHSCDPNCSVVFNGPHLLLRAVRDV 229
Query: 125 PKGAEVLISYIETAGSTMTRQKALKEQYLFTCTCPRCIKLGQFDDIQESAILEGYRCKDD 184
G E+ I Y++ ++ R+K L++QY F C C RC Q D +++ +L G
Sbjct: 230 EVGEELTICYLDMLMTSEERRKQLRDQYCFECDCFRC----QTQD-KDADMLTG------ 278
Query: 185 GCSGFLLRDSDDKGFTCQQCGLVRSKEEIKKIASEVNILSKKTLALTSCGNHQEVVSTYK 244
D++ + +E +K I E+ K L C Q ++S+
Sbjct: 279 ----------DERVWK-------EVQESLKHI-EELKAHWKWEQVLAMC---QAIISSNS 317
Query: 245 MIEKLQKKLYHPFSVNLMQTREKLIKILMELEDWKEALAYCQLTIPVYQRVYPQFHPLLG 304
E+L ++ ++ + + + L +EAL Y T+ Y+ +P HP+ G
Sbjct: 318 --ERLPD-----LNIYQLKVLDCAMDACINLGLLEEALFYGTRTMEPYRIFFPGSHPVRG 370
Query: 305 LQYYTCGKLEWFLGDTENAIKSMTEAVEILRITHG 339
+Q GKL+ G A+K++ A +I+R+THG
Sbjct: 371 VQVMKVGKLQLHQGMFPQAMKNLRLAFDIMRVTHG 405
>gi|195398853|ref|XP_002058035.1| GJ15711 [Drosophila virilis]
gi|194150459|gb|EDW66143.1| GJ15711 [Drosophila virilis]
Length = 448
Score = 119 bits (298), Expect = 2e-24, Method: Compositional matrix adjust.
Identities = 80/292 (27%), Positives = 142/292 (48%), Gaps = 20/292 (6%)
Query: 65 EIAENFSKLACNAHTICNSELRPLGTGLYPVISIINHSCLPNAVLVFEGRLAVVRAVQHV 124
E+ + +L N I ++E+ + T +Y +SI +HSC PNAV FEG V A++ +
Sbjct: 164 ELMSIYGRLITNGFNILDAEMNSIATAIYLGVSITDHSCQPNAVATFEGNELHVHAIEDL 223
Query: 125 P--KGAEVLISYIETAGSTMTRQKALKEQYLFTCTCPRCIKLGQFDDIQESAILEGYRCK 182
P +++ ISYI+ + R+ LKE Y F C C +CI D QE + C
Sbjct: 224 PCLDWSKIYISYIDLLNTPEQRRADLKEHYYFLCVCSKCI------DPQEMHEMTAAVCP 277
Query: 183 DDGCSGFLLRDSDDKGFTCQQCGLVRSKEEIKKIASEVNILSKKTLALTSCGNHQEV--V 240
+ C + D CQ C E++ +E+ L++ +L + +EV +
Sbjct: 278 NASCDASVNIDL----AKCQSCD-ASVTPELRSAYNEIMSLTQSSL-----DSMKEVAYL 327
Query: 241 STYKMIEKLQKKLYHPFSVNLMQTREKLIKILMELEDWKEALAYCQLTIPVYQRVYPQFH 300
K+ Q+ + HP +V ++T + + + + W +AL Y Q +P + + + ++
Sbjct: 328 DVCKVCLAKQRGVLHPLNVWHVKTLDAAFEAAINVGKWTDALEYGQQLLPGFAKYHGAWN 387
Query: 301 PLLGLQYYTCGKLEWFLGDTENAIKSMTEAVEILRITHGTNSPFMKELILKL 352
PLLGL + GK++ + + + A+ + EA IL +THG + + E + L
Sbjct: 388 PLLGLLHMKLGKIQLYERNYKQAVHHLQEAQRILNVTHGRDHRLLLEQLRPL 439
>gi|30840984|ref|NP_081072.1| N-lysine methyltransferase SMYD2 [Mus musculus]
gi|47117248|sp|Q8R5A0.1|SMYD2_MOUSE RecName: Full=N-lysine methyltransferase SMYD2; AltName:
Full=Histone methyltransferase SMYD2; AltName: Full=SET
and MYND domain-containing protein 2
gi|339717596|pdb|3QWV|A Chain A, Crystal Structure Of Histone Lysine Methyltransferase
Smyd2 In Complex With The Cofactor Product Adohcy
gi|339717597|pdb|3QWW|A Chain A, Crystal Structure Of Histone Lysine Methyltransferase
Smyd2 In Complex With The Methyltransferase Inhibitor
Sinefungin
gi|18606376|gb|AAH23119.1| SET and MYND domain containing 2 [Mus musculus]
Length = 433
Score = 119 bits (298), Expect = 2e-24, Method: Compositional matrix adjust.
Identities = 83/351 (23%), Positives = 158/351 (45%), Gaps = 35/351 (9%)
Query: 8 RKLQNDNVIPSTTTDNYSL-VEALVAHMSDID-EKQLLLYAQIANLVNLI---LQWPEIS 62
R L + P T L V +H+ +D EK+ L+ + IA L L++P+ S
Sbjct: 111 RILAKQKIHPERTPSEKLLAVREFESHLDKLDNEKKDLIQSDIAALHQFYSKYLEFPDHS 170
Query: 63 INEIAENFSKLACNAHTICNSELRPLGTGLYPVISIINHSCLPNAVLVFEGRLAVVRAVQ 122
+ F+++ CN TI + EL LG+ ++P ++++NHSC PN ++ ++G LA VRAVQ
Sbjct: 171 --SLVVLFAQVNCNGFTIEDEELSHLGSAIFPDVALMNHSCCPNVIVTYKGTLAEVRAVQ 228
Query: 123 HVPKGAEVLISYIETAGSTMTRQKALKEQYLFTCTCPRCIKLGQFDDIQESAILEGYRCK 182
+ G EV SYI+ T R L++ Y FTC C C
Sbjct: 229 EIHPGDEVFTSYIDLLYPTEDRNDRLRDSYFFTCECRECT-------------------- 268
Query: 183 DDGCSGFLLRDSDDKGFTCQQCGLVRSKEEIKKIASEVNILSKKTLALTSCGNHQEVVST 242
+D D ++ E I+ + + ++ + E++
Sbjct: 269 --------TKDKDKAKVEVRKLSSPPQAEAIRDMVRYARNVIEEFRRAKHYKSPSELLEI 320
Query: 243 YKMIEKLQKKLYHPFSVNLMQTREKLIKILMELEDWKEALAYCQLTIPVYQRVYPQFHPL 302
++ ++ ++ +V ++ + + + + ++DW+ AL Y Q I Y + YP +
Sbjct: 321 CELSQEKMSSVFEDSNVYMLHMMYQAMGVCLYMQDWEGALKYGQKIIKPYSKHYPVYSLN 380
Query: 303 LGLQYYTCGKLEWFLGDTENAIKSMTEAVEILRITHGTNSPFMKELILKLE 353
+ + G+L L + K++ +A+ I+ + HG + P++ E+ ++E
Sbjct: 381 VASMWLKLGRLYMGLENKAAGEKALKKAIAIMEVAHGKDHPYISEIKQEIE 431
>gi|426240539|ref|XP_004014156.1| PREDICTED: N-lysine methyltransferase SMYD2 [Ovis aries]
Length = 395
Score = 119 bits (298), Expect = 3e-24, Method: Compositional matrix adjust.
Identities = 89/349 (25%), Positives = 161/349 (46%), Gaps = 31/349 (8%)
Query: 8 RKLQNDNVIPSTTTDNYSL-VEALVAHMSDID-EKQLLLYAQIANLVNLILQWPEISINE 65
R L + P T L V+ +H+ +D EK+ L+ + IA L + + E N+
Sbjct: 73 RILAKQKIHPERTPSEKLLAVKEFESHLDKLDNEKRDLIQSDIAALHHFYSKHLEFPDND 132
Query: 66 -IAENFSKLACNAHTICNSELRPLGTGLYPVISIINHSCLPNAVLVFEGRLAVVRAVQHV 124
+ F+++ CN TI + EL LG+ ++P ++++NHSC PN ++ ++G LA VRAVQ +
Sbjct: 133 SLVVLFAQVNCNGFTIEDEELSHLGSAIFPDVALMNHSCCPNVIVTYKGTLAEVRAVQEI 192
Query: 125 PKGAEVLISYIETAGSTMTRQKALKEQYLFTCTCPRCIKLGQFDDIQESAILEGYRCKDD 184
G EV SYI+ T R L++ Y FTC C C ++ A +E R +D
Sbjct: 193 HPGEEVFTSYIDLLYPTEDRNDRLRDSYFFTCECQECTTKD-----KDKAKVE-IRKLND 246
Query: 185 GCSGFLLRDSDDKGFTCQQCGLVRSKEEIKKIASEVNILSKKTLALTSCGNHQEVVSTYK 244
+RD +VR + + + L C QE +S
Sbjct: 247 PPKAETIRD------------MVRYARNVIEEFRRAKHYKSPSELLEICELSQEKMSC-- 292
Query: 245 MIEKLQKKLYHPFSVNLMQTREKLIKILMELEDWKEALAYCQLTIPVYQRVYPQFHPLLG 304
++ +V ++ + + + + ++DW+ AL Y Q I Y + YP + +
Sbjct: 293 --------VFEDTNVYMLHMMYQAMGVCLYMQDWEGALRYGQKIIQPYSKHYPLYSLNVA 344
Query: 305 LQYYTCGKLEWFLGDTENAIKSMTEAVEILRITHGTNSPFMKELILKLE 353
+ G+L L + +++ +A+ I+ + HG + P++ E+ ++E
Sbjct: 345 SMWLKLGRLYMGLENKAAGERALKKAIAIMEVAHGKDHPYISEIKQEIE 393
>gi|395531367|ref|XP_003767751.1| PREDICTED: N-lysine methyltransferase SMYD2 [Sarcophilus harrisii]
Length = 402
Score = 119 bits (297), Expect = 3e-24, Method: Compositional matrix adjust.
Identities = 78/338 (23%), Positives = 155/338 (45%), Gaps = 30/338 (8%)
Query: 19 TTTDNYSLVEALVAHMSDID-EKQLLLYAQIANLVNLILQWPEISIN-EIAENFSKLACN 76
T+++ V+ +H+ +D EK+ L+ + I+ L + + E N + F+++ CN
Sbjct: 92 TSSEKLLAVKEFESHLDKLDNEKRELIQSDISALHHFYSKHIEYPDNASLVTLFAQVNCN 151
Query: 77 AHTICNSELRPLGTGLYPVISIINHSCLPNAVLVFEGRLAVVRAVQHVPKGAEVLISYIE 136
TI + EL LG+ ++P ++++NHSC PN ++ ++G LA VRAVQ + G EV SYI+
Sbjct: 152 GFTIEDEELSHLGSAIFPDVALMNHSCCPNVIVTYKGTLAEVRAVQEINPGDEVFTSYID 211
Query: 137 TAGSTMTRQKALKEQYLFTCTCPRCIKLGQFDDIQESAILEGYRCKDDGCSGFLLRDSDD 196
T R LK+ Y FTC C C + D
Sbjct: 212 LLYPTEDRNDRLKDSYFFTCECRECT----------------------------TKAKDK 243
Query: 197 KGFTCQQCGLVRSKEEIKKIASEVNILSKKTLALTSCGNHQEVVSTYKMIEKLQKKLYHP 256
++ E I+ + + ++ E++ ++ ++ L+
Sbjct: 244 AKVEIRKLSEPPKAESIRDMVKYARNVIEEFRRAKHYKTPSELLEICELSQEKMGSLFED 303
Query: 257 FSVNLMQTREKLIKILMELEDWKEALAYCQLTIPVYQRVYPQFHPLLGLQYYTCGKLEWF 316
+V ++ + + + + ++DW+ AL Y Q I Y + YP + + + G+L
Sbjct: 304 SNVYMLHMMYQAMGVCLYMQDWEGALRYGQKIIKPYSKHYPLYSLNVASMWLKLGRLYMG 363
Query: 317 LGDTENAIKSMTEAVEILRITHGTNSPFMKELILKLEE 354
L + +K++ +A+ I+ + HG + P++ E+ ++E+
Sbjct: 364 LEHKASGVKALKKAIAIMEVAHGKDHPYISEIKKEIED 401
>gi|345329543|ref|XP_001510161.2| PREDICTED: N-lysine methyltransferase SMYD2-like [Ornithorhynchus
anatinus]
Length = 412
Score = 119 bits (297), Expect = 3e-24, Method: Compositional matrix adjust.
Identities = 83/338 (24%), Positives = 160/338 (47%), Gaps = 30/338 (8%)
Query: 19 TTTDNYSLVEALVAHMSDID-EKQLLLYAQIANLVNLILQWPEISIN-EIAENFSKLACN 76
T ++ V+ +H+ +D EK+ L+ + IA L + + E N + F+++ CN
Sbjct: 102 TPSEKLLAVKEFESHLDKLDNEKRELIQSDIAALHHFYSKHIEYPDNASLVVLFAQVNCN 161
Query: 77 AHTICNSELRPLGTGLYPVISIINHSCLPNAVLVFEGRLAVVRAVQHVPKGAEVLISYIE 136
TI + EL LG+ ++P ++++NHSC PN ++ ++G LA VRAVQ + G E+ SYI+
Sbjct: 162 GFTIEDEELSHLGSAIFPDVALMNHSCCPNVIVTYKGTLAEVRAVQEINPGDEIFTSYID 221
Query: 137 TAGSTMTRQKALKEQYLFTCTCPRCIKLGQFDDIQESAILEGYRCKDDGCSGFLLRDSDD 196
T R L++ Y FTC C CI ++ A +E R +D +RD
Sbjct: 222 LLYPTEDRNDRLRDSYFFTCECRECITKE-----KDKAKVE-IRKLNDPPKAEAIRD--- 272
Query: 197 KGFTCQQCGLVRSKEEIKKIASEVNILSKKTLALTSCGNHQEVVSTYKMIEKLQKKLYHP 256
+++ + + + L C QE + + ++
Sbjct: 273 ---------MIKYARNVIEEFRRAKHYKSPSELLEICELSQEKMGS----------VFED 313
Query: 257 FSVNLMQTREKLIKILMELEDWKEALAYCQLTIPVYQRVYPQFHPLLGLQYYTCGKLEWF 316
+V ++ + + + + ++DW+ AL Y Q I Y + YP + + + G+L
Sbjct: 314 SNVYMLHMMYQAMGVCLYMQDWEGALRYGQKIIKPYSKHYPLYSLNVASMWLKLGRLYMG 373
Query: 317 LGDTENAIKSMTEAVEILRITHGTNSPFMKELILKLEE 354
L + +K++ +A+ I+ + HG + P++ E+ ++E+
Sbjct: 374 LENKSAGVKALKKALTIMEVAHGKDHPYISEIKKEIED 411
>gi|410986126|ref|XP_003999363.1| PREDICTED: N-lysine methyltransferase SMYD2 [Felis catus]
Length = 386
Score = 119 bits (297), Expect = 3e-24, Method: Compositional matrix adjust.
Identities = 87/346 (25%), Positives = 160/346 (46%), Gaps = 35/346 (10%)
Query: 8 RKLQNDNVIPSTTTDNYSL-VEALVAHMSDID-EKQLLLYAQIANLVNLI---LQWPEIS 62
R L + P T L V+ +H+ +D EK+ L+ + +A L + L++P+
Sbjct: 64 RILAKQKIHPERTPSEKLLAVKEFESHLDKLDNEKKDLIQSDVAALHHFYSKHLEFPDS- 122
Query: 63 INEIAENFSKLACNAHTICNSELRPLGTGLYPVISIINHSCLPNAVLVFEGRLAVVRAVQ 122
+ + F+++ CN TI + EL LG+ ++P ++++NHSC PN ++ ++G LA VRAVQ
Sbjct: 123 -DSLVVLFAQVNCNGFTIEDEELSHLGSAIFPDVALMNHSCCPNVIVTYKGTLAEVRAVQ 181
Query: 123 HVPKGAEVLISYIETAGSTMTRQKALKEQYLFTCTCPRCIKLGQFDDIQESAILEGYRCK 182
+ G EV SYI+ T R L++ Y FTC C C ++ A +E R
Sbjct: 182 EINPGEEVFTSYIDLLYPTEDRNDRLRDSYFFTCQCQECTTKD-----KDKAKVE-IRKL 235
Query: 183 DDGCSGFLLRDSDDKGFTCQQCGLVRSKEEIKKIASEVNILSKKTLALTSCGNHQEVVST 242
D +RD +VR + + + L C QE +S+
Sbjct: 236 SDPPKAETIRD------------MVRYARNVIEEFRRAKHYKSPSELLEICELSQEKMSS 283
Query: 243 YKMIEKLQKKLYHPFSVNLMQTREKLIKILMELEDWKEALAYCQLTIPVYQRVYPQFHPL 302
++ +V ++ + + + + ++DW+ AL Y Q I Y + YP +
Sbjct: 284 ----------VFEDSNVYMLHMMYQAMGVCLYMQDWEGALRYGQKIIKPYSKHYPLYSLN 333
Query: 303 LGLQYYTCGKLEWFLGDTENAIKSMTEAVEILRITHGTNSPFMKEL 348
+ + G+L L + +++ +A+ I+ + HG + P++ E+
Sbjct: 334 VASMWLKLGRLHMGLENKAAGERALKKAIAIMEVAHGKDHPYISEI 379
>gi|354465172|ref|XP_003495054.1| PREDICTED: N-lysine methyltransferase SMYD2-like [Cricetulus
griseus]
Length = 404
Score = 119 bits (297), Expect = 3e-24, Method: Compositional matrix adjust.
Identities = 82/351 (23%), Positives = 159/351 (45%), Gaps = 35/351 (9%)
Query: 8 RKLQNDNVIPSTTTDNYSL-VEALVAHMSDID-EKQLLLYAQIANLVNLI---LQWPEIS 62
R L + P T L V+ +H+ +D EK+ L+ + IA L L++P+ S
Sbjct: 82 RILAKQKIHPERTPSEKLLAVKEFESHLDKLDNEKKDLIQSDIAALHQFYSRHLEFPDHS 141
Query: 63 INEIAENFSKLACNAHTICNSELRPLGTGLYPVISIINHSCLPNAVLVFEGRLAVVRAVQ 122
+ F+++ CN TI + EL LG+ ++P ++++NHSC PN ++ ++G LA VRAVQ
Sbjct: 142 --SLVVLFAQVNCNGFTIEDEELSHLGSAIFPDVALMNHSCCPNVIVTYKGTLAEVRAVQ 199
Query: 123 HVPKGAEVLISYIETAGSTMTRQKALKEQYLFTCTCPRCIKLGQFDDIQESAILEGYRCK 182
+ G EV SYI+ T R L++ Y FTC C C
Sbjct: 200 EIHPGDEVFTSYIDLLYPTEDRNDRLRDSYFFTCECRECT-------------------- 239
Query: 183 DDGCSGFLLRDSDDKGFTCQQCGLVRSKEEIKKIASEVNILSKKTLALTSCGNHQEVVST 242
+D D ++ E ++ + + ++ + E++
Sbjct: 240 --------TKDKDKAKVEIRKLSSPPQAEAVRDMVRYARNVIEEFRRAKHYKSPSELLEI 291
Query: 243 YKMIEKLQKKLYHPFSVNLMQTREKLIKILMELEDWKEALAYCQLTIPVYQRVYPQFHPL 302
++ ++ ++ +V ++ + + + + ++DW+ AL Y Q I Y + YP +
Sbjct: 292 CELSQEKMSSVFEDSNVYMLHMMYQAMGVCLYMQDWEGALKYGQKIIKPYSKHYPVYSLN 351
Query: 303 LGLQYYTCGKLEWFLGDTENAIKSMTEAVEILRITHGTNSPFMKELILKLE 353
+ + G+L L + K++ +A+ I+ + HG + P++ E+ ++E
Sbjct: 352 VASMWLKLGRLYMGLENKAAGQKALKKAIAIMEVAHGKDHPYISEIKQEIE 402
>gi|149040852|gb|EDL94809.1| similar to SET and MYND domain containing 3, isoform CRA_b [Rattus
norvegicus]
Length = 428
Score = 119 bits (297), Expect = 3e-24, Method: Compositional matrix adjust.
Identities = 78/275 (28%), Positives = 137/275 (49%), Gaps = 39/275 (14%)
Query: 65 EIAENFSKLACNAHTICNSELRPLGTGLYPVISIINHSCLPNAVLVFEGRLAVVRAVQHV 124
++ E F+K+ CN+ TICN+E++ +G GLYP +S++NHSC PN +VF G ++RAV+ +
Sbjct: 170 DLFEAFAKVICNSFTICNAEMQEVGVGLYPSMSLLNHSCDPNCSIVFNGPHLLLRAVREI 229
Query: 125 PKGAEVLISYIETAGSTMTRQKALKEQYLFTCTCPRCIKLGQFDDIQESAILEGYRCKDD 184
G E+ I Y++ ++ R+K L++QY F C C RC Q+
Sbjct: 230 EAGEELTICYLDMLMTSEERRKQLRDQYCFECDCIRC-------PTQD------------ 270
Query: 185 GCSGFLLRDSDDKGFTCQQCGLVRSKEEIKKIASEVNILSKKTLALTSCGNHQEVVSTYK 244
+D+D ++ E+I K EV KK L + ++V++ +
Sbjct: 271 -------KDAD----------MLTGDEQIWK---EVQESLKKIEELKAHWKWEQVLALCQ 310
Query: 245 MIEKLQKKLYHPFSVNLMQTREKLIKILMELEDWKEALAYCQLTIPVYQRVYPQFHPLLG 304
I + ++ ++ + + + L +EAL Y T+ Y+ +P HP+ G
Sbjct: 311 AIINSNSERLPDINIYQLKVLDCAMDACINLGMLEEALFYAMRTMEPYRIFFPGSHPVRG 370
Query: 305 LQYYTCGKLEWFLGDTENAIKSMTEAVEILRITHG 339
+Q GKL+ G A+K++ A +I+++THG
Sbjct: 371 VQVMKVGKLQLHQGMFPQAMKNLRLAFDIMKVTHG 405
>gi|354475964|ref|XP_003500195.1| PREDICTED: SET and MYND domain-containing protein 3 [Cricetulus
griseus]
Length = 428
Score = 119 bits (297), Expect = 3e-24, Method: Compositional matrix adjust.
Identities = 79/275 (28%), Positives = 136/275 (49%), Gaps = 39/275 (14%)
Query: 65 EIAENFSKLACNAHTICNSELRPLGTGLYPVISIINHSCLPNAVLVFEGRLAVVRAVQHV 124
++ E F+K+ CN+ TICN+E++ +G GLYP +S++NHSC PN +VF G ++RAV+ +
Sbjct: 170 DLFEAFAKVICNSFTICNAEMQEVGVGLYPSMSLLNHSCDPNCSIVFNGPHLLLRAVREI 229
Query: 125 PKGAEVLISYIETAGSTMTRQKALKEQYLFTCTCPRCIKLGQFDDIQESAILEGYRCKDD 184
G E+ I Y++ ++ R+K L++QY F C C RC Q D
Sbjct: 230 EAGEELTICYLDMLMTSEERRKQLRDQYCFECDCIRC----QTHD--------------- 270
Query: 185 GCSGFLLRDSDDKGFTCQQCGLVRSKEEIKKIASEVNILSKKTLALTSCGNHQEVVSTYK 244
+D+D ++ E+I K EV KK L + ++V++ +
Sbjct: 271 -------KDAD----------MLTGDEQIWK---EVQESLKKIEELKAHWKWEQVLALCQ 310
Query: 245 MIEKLQKKLYHPFSVNLMQTREKLIKILMELEDWKEALAYCQLTIPVYQRVYPQFHPLLG 304
I ++ ++ + + + L +EAL Y T+ Y+ +P HP+ G
Sbjct: 311 AIISSNSDRLPDINIYQLKVLDCAMDACINLGMLEEALLYALRTMEPYRIFFPGSHPVRG 370
Query: 305 LQYYTCGKLEWFLGDTENAIKSMTEAVEILRITHG 339
+Q GKL+ G A+K++ A +I+++THG
Sbjct: 371 VQVMKVGKLQLHQGMFPQAMKNLRLAFDIMKVTHG 405
>gi|281306785|ref|NP_996733.1| N-lysine methyltransferase SMYD2 [Rattus norvegicus]
gi|47116972|sp|Q7M6Z3.1|SMYD2_RAT RecName: Full=N-lysine methyltransferase SMYD2; AltName:
Full=Histone methyltransferase SMYD2; AltName: Full=SET
and MYND domain-containing protein 2
gi|32140157|tpg|DAA01315.1| TPA_exp: SET and MYND domain protein 2 [Rattus norvegicus]
Length = 433
Score = 118 bits (296), Expect = 4e-24, Method: Compositional matrix adjust.
Identities = 80/339 (23%), Positives = 155/339 (45%), Gaps = 34/339 (10%)
Query: 19 TTTDNYSLVEALVAHMSDID-EKQLLLYAQIANLVNLI---LQWPEISINEIAENFSKLA 74
T ++ V +H+ +D EK+ L+ + IA L L++P+ S + F+++
Sbjct: 123 TPSEKLLAVREFESHLDKLDNEKKDLIQSDIAALHQFYSKHLEFPDHS--SLVVLFAQVN 180
Query: 75 CNAHTICNSELRPLGTGLYPVISIINHSCLPNAVLVFEGRLAVVRAVQHVPKGAEVLISY 134
CN TI + EL LG+ ++P ++++NHSC PN ++ ++G LA VRAVQ + G EV SY
Sbjct: 181 CNGFTIEDEELSHLGSAIFPDVALMNHSCCPNVIVTYKGTLAEVRAVQEIHPGDEVFTSY 240
Query: 135 IETAGSTMTRQKALKEQYLFTCTCPRCIKLGQFDDIQESAILEGYRCKDDGCSGFLLRDS 194
I+ T R L++ Y FTC C C +D
Sbjct: 241 IDLLYPTEDRNDRLRDSYFFTCECRECT----------------------------TKDK 272
Query: 195 DDKGFTCQQCGLVRSKEEIKKIASEVNILSKKTLALTSCGNHQEVVSTYKMIEKLQKKLY 254
D ++ E I+ + + ++ + E++ ++ ++ ++
Sbjct: 273 DKAKVEIRKLSNPPQAEAIRDMVRYARNVIEEFRRAKHYKSPSELLEICELSQEKMSSVF 332
Query: 255 HPFSVNLMQTREKLIKILMELEDWKEALAYCQLTIPVYQRVYPQFHPLLGLQYYTCGKLE 314
+V ++ + + + + ++DW+ AL Y Q I Y + YP + + + G+L
Sbjct: 333 EDSNVYMLHMMYQAMGVCLYMQDWEGALKYGQKIIKPYSKHYPVYSLNVASMWLKLGRLY 392
Query: 315 WFLGDTENAIKSMTEAVEILRITHGTNSPFMKELILKLE 353
L + K++ +A+ I+ I HG + P++ E+ ++E
Sbjct: 393 MGLENKAAGEKALKKAIAIMEIAHGKDHPYISEIKQEIE 431
>gi|194913303|ref|XP_001982670.1| GG12606 [Drosophila erecta]
gi|190648346|gb|EDV45639.1| GG12606 [Drosophila erecta]
Length = 468
Score = 118 bits (296), Expect = 4e-24, Method: Compositional matrix adjust.
Identities = 94/369 (25%), Positives = 173/369 (46%), Gaps = 26/369 (7%)
Query: 1 MLKLYLRRKLQNDNVIPSTTTDNYSLVEALVAHMSDIDEKQLLLY---AQIANLVNLILQ 57
+ +L LR + D + T L++H ++I + L + A L +++
Sbjct: 115 LCRLILRLEHGGDLIRGYYTEHGSRKFRDLMSHYAEIKNDPMRLEHLDSLHAVLTDMMAD 174
Query: 58 WPEISIN--EIAENFSKLACNAHTICNSELRPLGTGLYPVISIINHSCLPNAVLVFEGRL 115
P N E+ + +L N + ++++ + T +Y +SI +HSC PNAV FEG
Sbjct: 175 SPSTVPNKTELMSIYGRLITNGFNVLDTDMNSIATAIYLGVSITDHSCQPNAVATFEGNE 234
Query: 116 AVVRAVQHVP--KGAEVLISYIETAGSTMTRQKALKEQYLFTCTCPRCIKLGQFDDIQES 173
V A++ + +++ ISYI+ + R+ LKE Y F C C +C D +ES
Sbjct: 235 LHVHAIEDMECLDWSKIFISYIDLLNTPEQRRLDLKEHYYFLCVCSKCT------DAKES 288
Query: 174 AILEGYRCKDDGCSGFLLRDSDDKGFTCQQCGLVRSKEEIKKIASEVNILSKKTLALTSC 233
+ C + C + D + C +C S K+ + N L L +
Sbjct: 289 KEMLAALCPNRNCGVGISVDRTN----CPRCDAGISP----KLRNAFN--DAMALTLHNL 338
Query: 234 GNHQEV--VSTYKMIEKLQKKLYHPFSVNLMQTREKLIKILMELEDWKEALAYCQLTIPV 291
N ++V + K+ Q ++HP +V ++T + + +E+ W +AL Y Q +P
Sbjct: 339 ENMKDVAYLDVCKVCLDKQTGVFHPLNVWYVKTLDAAFEAAIEVGKWSDALDYGQRLLPG 398
Query: 292 YQRVYPQFHPLLGLQYYTCGKLEWFLGDTENAIKSMTEAVEILRITHGTNSPFMKE-LIL 350
+++ + ++PLLGL + GK++ + G + A+ + EA IL +THG + + E L L
Sbjct: 399 FRKYHGPWNPLLGLVHMKLGKIQLYTGQCKEALHHLEEAQRILTVTHGRDHRLLTEQLYL 458
Query: 351 KLEEAQAEA 359
+ +A+ +A
Sbjct: 459 LILQARQQA 467
>gi|340780606|pdb|3RIB|A Chain A, Human Lysine Methyltransferase Smyd2 In Complex With
Adohcy
gi|340780607|pdb|3RIB|B Chain B, Human Lysine Methyltransferase Smyd2 In Complex With
Adohcy
Length = 441
Score = 118 bits (295), Expect = 6e-24, Method: Compositional matrix adjust.
Identities = 90/351 (25%), Positives = 161/351 (45%), Gaps = 35/351 (9%)
Query: 8 RKLQNDNVIPSTTTDNYSL-VEALVAHMSDID-EKQLLLYAQIANLVNLI---LQWPEIS 62
R L + P T L V+ +H+ +D EK+ L+ + IA L + L +P+
Sbjct: 111 RILAKQKIHPERTPSEKLLAVKEFESHLDKLDNEKKDLIQSDIAALHHFYSKHLGFPDN- 169
Query: 63 INEIAENFSKLACNAHTICNSELRPLGTGLYPVISIINHSCLPNAVLVFEGRLAVVRAVQ 122
+ + F+++ CN TI + EL LG+ ++P ++++NHSC PN ++ ++G LA VRAVQ
Sbjct: 170 -DSLVVLFAQVNCNGFTIEDEELSHLGSAIFPDVALMNHSCCPNVIVTYKGTLAEVRAVQ 228
Query: 123 HVPKGAEVLISYIETAGSTMTRQKALKEQYLFTCTCPRCIKLGQFDDIQESAILEGYRCK 182
+ G EV SYI+ T R L++ Y FTC C C ++ A +E R
Sbjct: 229 EIKPGEEVFTSYIDLLYPTEDRNDRLRDSYFFTCECQECTTKD-----KDKAKVE-IRKL 282
Query: 183 DDGCSGFLLRDSDDKGFTCQQCGLVRSKEEIKKIASEVNILSKKTLALTSCGNHQEVVST 242
D +RD +VR + + + L C QE +S+
Sbjct: 283 SDPPKAEAIRD------------MVRYARNVIEEFRRAKHYKSPSELLEICELSQEKMSS 330
Query: 243 YKMIEKLQKKLYHPFSVNLMQTREKLIKILMELEDWKEALAYCQLTIPVYQRVYPQFHPL 302
++ +V ++ + + + + ++DW+ AL Y Q I Y + YP +
Sbjct: 331 ----------VFEDSNVYMLHMMYQAMGVCLYMQDWEGALQYGQKIIKPYSKHYPLYSLN 380
Query: 303 LGLQYYTCGKLEWFLGDTENAIKSMTEAVEILRITHGTNSPFMKELILKLE 353
+ + G+L L K++ +A+ I+ + HG + P++ E+ ++E
Sbjct: 381 VASMWLKLGRLYMGLEHKAAGEKALKKAIAIMEVAHGKDHPYISEIKQEIE 431
>gi|188035871|ref|NP_064582.2| N-lysine methyltransferase SMYD2 [Homo sapiens]
gi|90185234|sp|Q9NRG4.2|SMYD2_HUMAN RecName: Full=N-lysine methyltransferase SMYD2; AltName:
Full=HSKM-B; AltName: Full=Histone methyltransferase
SMYD2; AltName: Full=Lysine N-methyltransferase 3C;
AltName: Full=SET and MYND domain-containing protein 2
gi|345111051|pdb|3TG4|A Chain A, Structure Of Smyd2 In Complex With Sam
gi|345111052|pdb|3TG5|A Chain A, Structure Of Smyd2 In Complex With P53 And Sah
gi|67514269|gb|AAH98276.1| SET and MYND domain containing 2 [Homo sapiens]
Length = 433
Score = 117 bits (294), Expect = 6e-24, Method: Compositional matrix adjust.
Identities = 90/351 (25%), Positives = 161/351 (45%), Gaps = 35/351 (9%)
Query: 8 RKLQNDNVIPSTTTDNYSL-VEALVAHMSDID-EKQLLLYAQIANLVNLI---LQWPEIS 62
R L + P T L V+ +H+ +D EK+ L+ + IA L + L +P+
Sbjct: 111 RILAKQKIHPERTPSEKLLAVKEFESHLDKLDNEKKDLIQSDIAALHHFYSKHLGFPDN- 169
Query: 63 INEIAENFSKLACNAHTICNSELRPLGTGLYPVISIINHSCLPNAVLVFEGRLAVVRAVQ 122
+ + F+++ CN TI + EL LG+ ++P ++++NHSC PN ++ ++G LA VRAVQ
Sbjct: 170 -DSLVVLFAQVNCNGFTIEDEELSHLGSAIFPDVALMNHSCCPNVIVTYKGTLAEVRAVQ 228
Query: 123 HVPKGAEVLISYIETAGSTMTRQKALKEQYLFTCTCPRCIKLGQFDDIQESAILEGYRCK 182
+ G EV SYI+ T R L++ Y FTC C C ++ A +E R
Sbjct: 229 EIKPGEEVFTSYIDLLYPTEDRNDRLRDSYFFTCECQECTTKD-----KDKAKVE-IRKL 282
Query: 183 DDGCSGFLLRDSDDKGFTCQQCGLVRSKEEIKKIASEVNILSKKTLALTSCGNHQEVVST 242
D +RD +VR + + + L C QE +S+
Sbjct: 283 SDPPKAEAIRD------------MVRYARNVIEEFRRAKHYKSPSELLEICELSQEKMSS 330
Query: 243 YKMIEKLQKKLYHPFSVNLMQTREKLIKILMELEDWKEALAYCQLTIPVYQRVYPQFHPL 302
++ +V ++ + + + + ++DW+ AL Y Q I Y + YP +
Sbjct: 331 ----------VFEDSNVYMLHMMYQAMGVCLYMQDWEGALQYGQKIIKPYSKHYPLYSLN 380
Query: 303 LGLQYYTCGKLEWFLGDTENAIKSMTEAVEILRITHGTNSPFMKELILKLE 353
+ + G+L L K++ +A+ I+ + HG + P++ E+ ++E
Sbjct: 381 VASMWLKLGRLYMGLEHKAAGEKALKKAIAIMEVAHGKDHPYISEIKQEIE 431
>gi|116283746|gb|AAH28104.1| SMYD2 protein [Homo sapiens]
Length = 417
Score = 117 bits (294), Expect = 7e-24, Method: Compositional matrix adjust.
Identities = 88/351 (25%), Positives = 157/351 (44%), Gaps = 35/351 (9%)
Query: 8 RKLQNDNVIPSTTTDNYSL-VEALVAHMSDID-EKQLLLYAQIANLVNLI---LQWPEIS 62
R L + P T L V+ +H+ +D EK+ L+ + IA L + L +P+
Sbjct: 95 RILAKQKIHPERTPSEKLLAVKEFESHLDKLDNEKKDLIQSDIAALHHFYSKHLGFPDN- 153
Query: 63 INEIAENFSKLACNAHTICNSELRPLGTGLYPVISIINHSCLPNAVLVFEGRLAVVRAVQ 122
+ + F+++ CN TI + EL LG+ ++P ++++NHSC PN ++ ++G LA VRAVQ
Sbjct: 154 -DSLVVLFAQVNCNGFTIEDEELSHLGSAIFPDVALMNHSCCPNVIVTYKGTLAEVRAVQ 212
Query: 123 HVPKGAEVLISYIETAGSTMTRQKALKEQYLFTCTCPRCIKLGQFDDIQESAILEGYRCK 182
+ G EV SYI+ T R L++ Y FTC C C K
Sbjct: 213 EIKPGEEVFTSYIDLLYPTEDRNDRLRDSYFFTCECQECT------------------TK 254
Query: 183 DDGCSGFLLRDSDDKGFTCQQCGLVRSKEEIKKIASEVNILSKKTLALTSCGNHQEVVST 242
D + +R D +VR + + + L C QE +S+
Sbjct: 255 DKDKAKVEIRKLSDPPKAEAIRDMVRYARNVIEEFRRAKHYKSPSELLEICELSQEKMSS 314
Query: 243 YKMIEKLQKKLYHPFSVNLMQTREKLIKILMELEDWKEALAYCQLTIPVYQRVYPQFHPL 302
++ +V ++ + + + + ++DW+ AL Y Q I Y + YP +
Sbjct: 315 ----------VFEDSNVYMLHMMYQAMGVCLYMQDWEGALQYGQKIIKPYSKHYPLYSLN 364
Query: 303 LGLQYYTCGKLEWFLGDTENAIKSMTEAVEILRITHGTNSPFMKELILKLE 353
+ + G+L L K++ +A+ I+ + HG + P++ E+ ++E
Sbjct: 365 VASMWLKLGRLYMGLEHKAAGEKALKKAIAIMEVAHGKDHPYISEIKQEME 415
>gi|229892824|ref|NP_001153563.1| N-lysine methyltransferase SMYD2 [Sus scrofa]
gi|325530260|sp|C3RZA1.1|SMYD2_PIG RecName: Full=N-lysine methyltransferase SMYD2; AltName:
Full=Histone methyltransferase SMYD2; AltName: Full=SET
and MYND domain-containing protein 2
gi|197692954|gb|ACH71265.1| SET and MYND domain-containing 2 [Sus scrofa]
Length = 433
Score = 117 bits (294), Expect = 8e-24, Method: Compositional matrix adjust.
Identities = 89/351 (25%), Positives = 163/351 (46%), Gaps = 35/351 (9%)
Query: 8 RKLQNDNVIPSTTTDNYSL-VEALVAHMSDID-EKQLLLYAQIANLVNLI---LQWPEIS 62
R L + P T L V+ +H+ +D EK+ L+ + IA L + L++P+
Sbjct: 111 RILAKQKIHPERTPSEKLLAVKEFESHLDKLDNEKRDLIQSDIAALHHFYSKHLEFPDS- 169
Query: 63 INEIAENFSKLACNAHTICNSELRPLGTGLYPVISIINHSCLPNAVLVFEGRLAVVRAVQ 122
+ + F+++ CN TI + EL LG+ ++P ++++NHSC PN ++ ++G LA VRAVQ
Sbjct: 170 -DSLVVLFAQVNCNGFTIEDEELSHLGSXIFPDVALMNHSCCPNVIVTYKGTLAEVRAVQ 228
Query: 123 HVPKGAEVLISYIETAGSTMTRQKALKEQYLFTCTCPRCIKLGQFDDIQESAILEGYRCK 182
+ G EV SYI+ T R L++ Y FTC C C ++ A +E R
Sbjct: 229 EIHPGEEVFTSYIDLLYPTEDRNDRLRDSYFFTCECQECTTKD-----KDKAKVE-IRKL 282
Query: 183 DDGCSGFLLRDSDDKGFTCQQCGLVRSKEEIKKIASEVNILSKKTLALTSCGNHQEVVST 242
+D +RD +VR + + + L C QE +S
Sbjct: 283 NDPPKAEAIRD------------MVRYARNVIEEFRRAKHYKSPSELLEICELSQEKMSC 330
Query: 243 YKMIEKLQKKLYHPFSVNLMQTREKLIKILMELEDWKEALAYCQLTIPVYQRVYPQFHPL 302
++ +V ++ + + + + ++DW+ AL Y Q I Y + YP +
Sbjct: 331 ----------VFEDSNVYMLHMMYQAMGVCLYMQDWEGALRYGQKIIQPYSKHYPLYSLN 380
Query: 303 LGLQYYTCGKLEWFLGDTENAIKSMTEAVEILRITHGTNSPFMKELILKLE 353
+ + G+L L + +++ +A+ I+ + HG + P++ E+ ++E
Sbjct: 381 VASMWLKLGRLYMGLENKAAGERALRKAIAIMEVAHGKDHPYISEIKQEIE 431
>gi|402858454|ref|XP_003893720.1| PREDICTED: SET and MYND domain-containing protein 3-like [Papio
anubis]
Length = 258
Score = 117 bits (292), Expect = 1e-23, Method: Compositional matrix adjust.
Identities = 80/274 (29%), Positives = 134/274 (48%), Gaps = 39/274 (14%)
Query: 66 IAENFSKLACNAHTICNSELRPLGTGLYPVISIINHSCLPNAVLVFEGRLAVVRAVQHVP 125
+ E K+ CN+ TICN+E++ +G GLYP IS++NHSC PN +VF G ++RAV+ +
Sbjct: 1 MEEEEEKVICNSFTICNAEMQEVGVGLYPSISLLNHSCDPNCSIVFNGPHLLLRAVRDIE 60
Query: 126 KGAEVLISYIETAGSTMTRQKALKEQYLFTCTCPRCIKLGQFDDIQESAILEGYRCKDDG 185
G E+ I Y++ ++ R+K L++QY F C C RC Q D
Sbjct: 61 VGEELTICYLDMLMTSEERRKQLRDQYCFECDCFRC----QTQD---------------- 100
Query: 186 CSGFLLRDSDDKGFTCQQCGLVRSKEEIKKIASEVNILSKKTLALTSCGNHQEVVSTYKM 245
+D+D ++ E++ K EV KK L + ++V++ +
Sbjct: 101 ------KDAD----------MLTGDEQVWK---EVQESLKKIEELKAHWKWEQVLAMCQA 141
Query: 246 IEKLQKKLYHPFSVNLMQTREKLIKILMELEDWKEALAYCQLTIPVYQRVYPQFHPLLGL 305
I + +V ++ + + + L +EAL Y T+ Y+ +P HP+ G+
Sbjct: 142 IISSNSERLPDINVYQLKVLDCAMDACINLGLLEEALFYGTRTMEPYRIFFPGSHPVRGV 201
Query: 306 QYYTCGKLEWFLGDTENAIKSMTEAVEILRITHG 339
Q GKL+ G A+K++ A +I+R+THG
Sbjct: 202 QVMKVGKLQLHQGMFPQAMKNLRLAFDIMRVTHG 235
>gi|380030528|ref|XP_003698898.1| PREDICTED: LOW QUALITY PROTEIN: SET and MYND domain-containing
protein 3-like [Apis florea]
Length = 414
Score = 117 bits (292), Expect = 1e-23, Method: Compositional matrix adjust.
Identities = 90/324 (27%), Positives = 158/324 (48%), Gaps = 53/324 (16%)
Query: 24 YSLVEALVAHMSDI--DEKQLLLYAQIANLVNLILQWPEISINEIAE---NFSKLACNAH 78
Y + L++H SDI DEK++ + + ++ L ++SI AE + ++ N+
Sbjct: 111 YRKFKDLMSHYSDIKKDEKKMEHFVCVCGVLYEFL--GDMSIPNSAELMGIYGRIYINSF 168
Query: 79 TICNSELRPLGTGLYPVISIINHSCLPNAVLVFEGRLAVVRAVQHVP--KGAEVLISYIE 136
I + ++ +G G+Y SI++HSC PNAV FEG ++R + +P +++ ISYI+
Sbjct: 169 NISDLDMNNIGAGIYLGPSILDHSCKPNAVATFEGTTIIIRTTEDLPCLDLSQIRISYID 228
Query: 137 TAGSTMTRQKALKEQYLFTCTCPRCIKLGQFDDIQES-AILEGYRCKDDGCSGFLLRDSD 195
+T R++ L+ Y F C C +C +ES ++E C +
Sbjct: 229 VIKTTKDRREELQNSYYFWCNCKKC---------EESEPMVEAAACPN------------ 267
Query: 196 DKGFTCQQCGLVRSKEEIKKIASEVNILSKKTLALTSCGNHQEVVSTYKMIEKLQKKLYH 255
R+ EI + + ++ + K +A VST M K Q+ + H
Sbjct: 268 ------------RTFYEISDLTA-YHLQNMKNIAYLD-------VST--MCLKKQEGVLH 305
Query: 256 PFSVNLMQTREKLIKILMELEDWKEALAYCQLTIPVYQRVYPQFHPLLGLQYYTCGKLEW 315
P ++ +QT + + L+ W+EA +Y + I Y Y +FHP G+ Y + GKL+
Sbjct: 306 PLNIQHVQTLQSAFDSSLTLQHWEEAESYAKRLINGYLTYYGEFHPSTGILYLSIGKLQV 365
Query: 316 FLGDTENAIKSMTEAVEILRITHG 339
+L + + AI+++ +A IL ITHG
Sbjct: 366 YLKNXKQAIETLRKASAILTITHG 389
>gi|297661482|ref|XP_002809270.1| PREDICTED: SET and MYND domain-containing protein 3-like [Pongo
abelii]
gi|426334412|ref|XP_004028746.1| PREDICTED: SET and MYND domain-containing protein 3-like [Gorilla
gorilla gorilla]
gi|28200379|gb|AAO31695.1| hypothetical protein FLJ21080 [Homo sapiens]
gi|119597546|gb|EAW77140.1| SET and MYND domain containing 3, isoform CRA_b [Homo sapiens]
Length = 258
Score = 116 bits (291), Expect = 1e-23, Method: Compositional matrix adjust.
Identities = 83/276 (30%), Positives = 133/276 (48%), Gaps = 43/276 (15%)
Query: 66 IAENFSKLACNAHTICNSELRPLGTGLYPVISIINHSCLPNAVLVFEGRLAVVRAVQHVP 125
+ E K+ CN+ TICN+E++ +G GLYP IS++NHSC PN +VF G ++RAV+ +
Sbjct: 1 MEEEEEKVICNSFTICNAEMQEVGVGLYPSISLLNHSCDPNCSIVFNGPHLLLRAVRDIE 60
Query: 126 KGAEVLISYIETAGSTMTRQKALKEQYLFTCTCPRCIKLGQFDDIQESAILEGYRCKDDG 185
G E+ I Y++ ++ R+K L++QY F C C RC
Sbjct: 61 VGEELTICYLDMLMTSEERRKQLRDQYCFECDCFRC------------------------ 96
Query: 186 CSGFLLRDSDDKGFTCQQCGLVRSKEEIKKIASEVNILSKKTLALTSCGNHQEVVSTYKM 245
+D D T + +E +KKI E+ K L C Q ++S+
Sbjct: 97 ----QTQDKDADMLTGDEQVWKEVQESLKKI-EELKAHWKWEQVLAMC---QAIISSNS- 147
Query: 246 IEKLQKKLYHPFSVNLMQTR--EKLIKILMELEDWKEALAYCQLTIPVYQRVYPQFHPLL 303
E+L +N+ Q + + + + L +EAL Y T+ Y+ +P HP+
Sbjct: 148 -ERLP-------DINIYQLKVLDCAMDACINLGLLEEALFYGTRTMEPYRIFFPGSHPVR 199
Query: 304 GLQYYTCGKLEWFLGDTENAIKSMTEAVEILRITHG 339
G+Q GKL+ G A+K++ A +I+R+THG
Sbjct: 200 GVQVMKVGKLQLHQGMFPQAMKNLRLAFDIMRVTHG 235
>gi|294662600|pdb|3MEK|A Chain A, Crystal Structure Of Human Histone-Lysine N-
Methyltransferase Smyd3 In Complex With S-Adenosyl-L-
Methionine
Length = 429
Score = 116 bits (291), Expect = 1e-23, Method: Compositional matrix adjust.
Identities = 84/277 (30%), Positives = 131/277 (47%), Gaps = 43/277 (15%)
Query: 65 EIAENFSKLACNAHTICNSELRPLGTGLYPVISIINHSCLPNAVLVFEGRLAVVRAVQHV 124
++ E F+K+ CN+ TICN+E + +G GLYP IS++NHSC PN +VF G ++RAV+ +
Sbjct: 171 DLFEAFAKVICNSFTICNAEXQEVGVGLYPSISLLNHSCDPNCSIVFNGPHLLLRAVRDI 230
Query: 125 PKGAEVLISYIETAGSTMTRQKALKEQYLFTCTCPRCIKLGQFDDIQESAILEGYRCKDD 184
G E+ I Y++ ++ R+K L++QY F C C RC
Sbjct: 231 EVGEELTICYLDXLXTSEERRKQLRDQYCFECDCFRC----------------------- 267
Query: 185 GCSGFLLRDSDDKGFTCQQCGLVRSKEEIKKIASEVNILSKKTLALTSCGNHQEVVSTYK 244
+D D T + +E +KKI E+ K L C Q ++S+
Sbjct: 268 -----QTQDKDADXLTGDEQVWKEVQESLKKI-EELKAHWKWEQVLAXC---QAIISSNS 318
Query: 245 MIEKLQKKLYHPFSVNLMQTR--EKLIKILMELEDWKEALAYCQLTIPVYQRVYPQFHPL 302
E+L +N+ Q + + + L +EAL Y T Y+ +P HP+
Sbjct: 319 --ERLP-------DINIYQLKVLDCAXDACINLGLLEEALFYGTRTXEPYRIFFPGSHPV 369
Query: 303 LGLQYYTCGKLEWFLGDTENAIKSMTEAVEILRITHG 339
G+Q GKL+ G A K++ A +I R+THG
Sbjct: 370 RGVQVXKVGKLQLHQGXFPQAXKNLRLAFDIXRVTHG 406
>gi|344236460|gb|EGV92563.1| SET and MYND domain-containing protein 2 [Cricetulus griseus]
Length = 326
Score = 116 bits (291), Expect = 2e-23, Method: Compositional matrix adjust.
Identities = 81/319 (25%), Positives = 149/319 (46%), Gaps = 27/319 (8%)
Query: 38 DEKQLLLYAQIANLVNLI---LQWPEISINEIAENFSKLACNAHTICNSELRPLGTGLYP 94
+EK+ L+ + IA L L++P+ S + F+++ CN TI + EL LG+ ++P
Sbjct: 30 NEKKDLIQSDIAALHQFYSRHLEFPDHS--SLVVLFAQVNCNGFTIEDEELSHLGSAIFP 87
Query: 95 VISIINHSCLPNAVLVFEGRLAVVRAVQHVPKGAEVLISYIETAGSTMTRQKALKEQYLF 154
++++NHSC PN ++ ++G LA VRAVQ + G EV SYI+ T R L++ Y F
Sbjct: 88 DVALMNHSCCPNVIVTYKGTLAEVRAVQEIHPGDEVFTSYIDLLYPTEDRNDRLRDSYFF 147
Query: 155 TCTCPRCIKLGQFDDIQESAILEGYRCKDDGCSGFLLRDSDDKGFTCQQCGLVRSKEEIK 214
TC C C KD + +R K + Q VR
Sbjct: 148 TCECRECT------------------TKDKDKAKVEIR----KLSSPPQAEAVRDMVRYA 185
Query: 215 KIASEVNILSKKTLALTSCGNHQEVVSTYKMIEKLQKKLYHPFSVNLMQTREKLIKILME 274
+ E +K S E++ ++ ++ ++ +V ++ + + + +
Sbjct: 186 RNVIEEFRRAKHYKYNLSVSPPSELLEICELSQEKMSSVFEDSNVYMLHMMYQAMGVCLY 245
Query: 275 LEDWKEALAYCQLTIPVYQRVYPQFHPLLGLQYYTCGKLEWFLGDTENAIKSMTEAVEIL 334
++DW+ AL Y Q I Y + YP + + + G+L L + K++ +A+ I+
Sbjct: 246 MQDWEGALKYGQKIIKPYSKHYPVYSLNVASMWLKLGRLYMGLENKAAGQKALKKAIAIM 305
Query: 335 RITHGTNSPFMKELILKLE 353
+ HG + P++ E+ ++E
Sbjct: 306 EVAHGKDHPYISEIKQEIE 324
>gi|50740296|ref|XP_419420.1| PREDICTED: N-lysine methyltransferase SMYD2 [Gallus gallus]
Length = 436
Score = 116 bits (290), Expect = 2e-23, Method: Compositional matrix adjust.
Identities = 90/350 (25%), Positives = 159/350 (45%), Gaps = 31/350 (8%)
Query: 8 RKLQNDNVIPSTTTDNYSL-VEALVAHMSDID-EKQLLLYAQIANLVNLILQWPEISINE 65
R L + P T L V+ +H+ +D EK+ L+ IA L + + E N
Sbjct: 114 RILAKQKIHPERTQSEKLLAVKEFESHLDKLDNEKRELIQNDIAALHHFYSKHMEYPDNA 173
Query: 66 -IAENFSKLACNAHTICNSELRPLGTGLYPVISIINHSCLPNAVLVFEGRLAVVRAVQHV 124
+ F+++ CN TI + EL LG+ ++P ++++NHSC PN ++ ++G LA VRAV+ +
Sbjct: 174 ALVVLFAQVNCNGFTIEDEELSHLGSAIFPDVALMNHSCCPNVIVTYKGTLAEVRAVKEI 233
Query: 125 PKGAEVLISYIETAGSTMTRQKALKEQYLFTCTCPRCIKLGQFDDIQESAILEGYRCKDD 184
G EV SYI+ T R L++ Y FTC C C + D ++ I R +D
Sbjct: 234 EPGEEVFTSYIDLLYPTEDRNDRLRDSYFFTCDCRECTM--KEKDKEKLKI----RKLND 287
Query: 185 GCSGFLLRDSDDKGFTCQQCGLVRSKEEIKKIASEVNILSKKTLALTSCGNHQEVVSTYK 244
S +RD ++ K S +L L+L G
Sbjct: 288 PPSAEAVRDMIKYARN-----VIEEFRRAKHYKSPSELLEICELSLDKMG---------- 332
Query: 245 MIEKLQKKLYHPFSVNLMQTREKLIKILMELEDWKEALAYCQLTIPVYQRVYPQFHPLLG 304
++ +V ++ + + + + ++DW+ AL Y Q I Y + YP + +
Sbjct: 333 -------AVFEDSNVYMLHMMYQAMGVCLYVQDWEGALRYGQKIIRPYSKHYPSYSLNVA 385
Query: 305 LQYYTCGKLEWFLGDTENAIKSMTEAVEILRITHGTNSPFMKELILKLEE 354
+ G+L L + K++ +A+ I+ + HG + P++ E+ +LE+
Sbjct: 386 SMWLKLGRLYMALENRPAGDKALKKAIAIMEVAHGKDHPYISEIKKELED 435
>gi|195133772|ref|XP_002011313.1| GI16066 [Drosophila mojavensis]
gi|193907288|gb|EDW06155.1| GI16066 [Drosophila mojavensis]
Length = 448
Score = 115 bits (289), Expect = 2e-23, Method: Compositional matrix adjust.
Identities = 85/295 (28%), Positives = 150/295 (50%), Gaps = 20/295 (6%)
Query: 62 SINEIAENFSKLACNAHTICNSELRPLGTGLYPVISIINHSCLPNAVLVFEGRLAVVRAV 121
++NE+ + +L N I ++E+ + TG+Y +SI +HSC PNAV FEG V A+
Sbjct: 161 NMNELMSIYGRLITNGFNILDAEMNSIATGIYLGVSITDHSCQPNAVATFEGNELHVHAI 220
Query: 122 QHVP--KGAEVLISYIETAGSTMTRQKALKEQYLFTCTCPRCIKLGQFDDIQESAILEGY 179
+ +P +++ ISYI+ + R+ LKE Y F C C +CI D QE +
Sbjct: 221 EDLPCLDWSKIYISYIDLLNTPEQRRADLKEHYYFLCVCSKCI------DPQEMHEMTAA 274
Query: 180 RCKDDGCSGFLLRDSDDKGFTCQQCGLVRSKEEIKKIASEVNILSKKTLALTSCGNHQEV 239
C + C + D C+ CG S E+++ SEV L++ +L + +EV
Sbjct: 275 VCPNGSCDASVNIQLD----KCESCGTPVS-EQLRTDYSEVMALTQSSL-----DSMKEV 324
Query: 240 --VSTYKMIEKLQKKLYHPFSVNLMQTREKLIKILMELEDWKEALAYCQLTIPVYQRVYP 297
+ K+ + + HP +V ++T + + +++ W EAL Y + +P + + +
Sbjct: 325 AYLDVCKVCLAKHRGILHPLNVWHVKTLDAAFEASIQVGKWTEALDYGRQLLPGFAKYHG 384
Query: 298 QFHPLLGLQYYTCGKLEWFLGDTENAIKSMTEAVEILRITHGTNSPFMKELILKL 352
+++PLLGL GK+E + + + A++ + EA IL +THG + + E + L
Sbjct: 385 EWNPLLGLLQLKLGKIELYERNYKQALQHLQEAQRILNVTHGRDHRLLLEQLRPL 439
>gi|325530259|sp|E1C5V0.1|SMYD2_CHICK RecName: Full=N-lysine methyltransferase SMYD2; AltName:
Full=Histone methyltransferase SMYD2; AltName: Full=SET
and MYND domain-containing protein 2
Length = 436
Score = 115 bits (287), Expect = 4e-23, Method: Compositional matrix adjust.
Identities = 81/350 (23%), Positives = 157/350 (44%), Gaps = 31/350 (8%)
Query: 8 RKLQNDNVIPSTTTDNYSL-VEALVAHMSDID-EKQLLLYAQIANLVNLILQWPEISINE 65
R L + P T L V+ +H+ +D EK+ L+ IA L + + E N
Sbjct: 114 RILAKQKIHPERTQSEKLLAVKEFESHLDKLDNEKRELIQNDIAALHHFYSKHMEYPDNA 173
Query: 66 -IAENFSKLACNAHTICNSELRPLGTGLYPVISIINHSCLPNAVLVFEGRLAVVRAVQHV 124
+ F+++ CN TI + EL LG+ ++P ++++NHSC PN ++ ++G LA VRAV+ +
Sbjct: 174 ALVVLFAQVNCNGFTIEDEELSHLGSAIFPDVALMNHSCCPNVIVTYKGTLAEVRAVKEI 233
Query: 125 PKGAEVLISYIETAGSTMTRQKALKEQYLFTCTCPRCIKLGQFDDIQESAILEGYRCKDD 184
G EV SYI+ T R L++ Y FTC C C
Sbjct: 234 EPGEEVFTSYIDLLYPTEDRNDRLRDSYFFTCDCRECT---------------------- 271
Query: 185 GCSGFLLRDSDDKGFTCQQCGLVRSKEEIKKIASEVNILSKKTLALTSCGNHQEVVSTYK 244
+++ D + ++ S E ++ + + ++ E++ +
Sbjct: 272 ------MKEKDKEKLKIRKLNDPPSAEAVRDMIKYARNVIEEFRRAKHYKPPSELLEICE 325
Query: 245 MIEKLQKKLYHPFSVNLMQTREKLIKILMELEDWKEALAYCQLTIPVYQRVYPQFHPLLG 304
+ ++ +V ++ + + + + ++DW+ AL Y Q I Y + YP + +
Sbjct: 326 LSLDKMGAVFEDSNVYMLHMMYQAMGVCLYVQDWEGALRYGQKIIRPYSKHYPSYSLNVA 385
Query: 305 LQYYTCGKLEWFLGDTENAIKSMTEAVEILRITHGTNSPFMKELILKLEE 354
+ G+L L + K++ +A+ I+ + HG + P++ E+ +LE+
Sbjct: 386 SMWLKLGRLYMALENRPAGDKALKKAIAIMEVAHGKDHPYISEIKKELED 435
>gi|327262509|ref|XP_003216066.1| PREDICTED: n-lysine methyltransferase SMYD2-like [Anolis
carolinensis]
Length = 437
Score = 115 bits (287), Expect = 4e-23, Method: Compositional matrix adjust.
Identities = 84/329 (25%), Positives = 152/329 (46%), Gaps = 30/329 (9%)
Query: 27 VEALVAHMSDID-EKQLLLYAQIANLVNLILQWPEISINE-IAENFSKLACNAHTICNSE 84
V+ +H+ +D EK L+ + I+ L + + E N + F+++ CN TI + E
Sbjct: 135 VKEFESHLDKLDNEKMELIQSDISALHHFYSKHIEYPDNAALVVLFAQVNCNGFTIEDEE 194
Query: 85 LRPLGTGLYPVISIINHSCLPNAVLVFEGRLAVVRAVQHVPKGAEVLISYIETAGSTMTR 144
L LG+ ++P ++++NHSC PN ++ ++G +A VRAVQ + G E+ SYI+ T R
Sbjct: 195 LSHLGSAIFPDVALMNHSCCPNVIVTYKGTVAEVRAVQEIEPGDEIFTSYIDLLYPTEDR 254
Query: 145 QKALKEQYLFTCTCPRCIKLGQFDDIQESAILEGYRCKDDGCSGFLLRDSDDKGFTCQQC 204
LK+ Y FTC C CI + D E L ++ +RD
Sbjct: 255 NDRLKDSYFFTCDCRECITKVKDKDKLEIRKL------NEPPPAEAVRDMIKYARN---- 304
Query: 205 GLVRSKEEIKKIASEVNILSKKTLALTSCGNHQEVVSTYKMIEKLQKKLYHPFSVNLMQT 264
++ K S +L L+L G+ ++ +V ++
Sbjct: 305 -VIEEFRRAKHYKSPSELLEICELSLEKMGS-----------------VFAESNVYMLHM 346
Query: 265 REKLIKILMELEDWKEALAYCQLTIPVYQRVYPQFHPLLGLQYYTCGKLEWFLGDTENAI 324
+ + + + ++DW+ AL Y Q I Y + YP + + + G+L L + +
Sbjct: 347 MYQAMGVCLYMQDWEGALRYGQKIIKPYSKHYPLYSLNVASMWLKLGRLYMGLENRTAGV 406
Query: 325 KSMTEAVEILRITHGTNSPFMKELILKLE 353
K++ +A+ I+ I HG + P++ E+ +LE
Sbjct: 407 KALKKAIAIMEIAHGKDHPYVIEIKKELE 435
>gi|355720910|gb|AES07090.1| SET and MYND domain containing 3 [Mustela putorius furo]
Length = 251
Score = 115 bits (287), Expect = 4e-23, Method: Compositional matrix adjust.
Identities = 82/267 (30%), Positives = 129/267 (48%), Gaps = 43/267 (16%)
Query: 75 CNAHTICNSELRPLGTGLYPVISIINHSCLPNAVLVFEGRLAVVRAVQHVPKGAEVLISY 134
CNA TICN+E++ +G GLYP +S++NHSC PN +VF G ++RAV+ + G E+ I Y
Sbjct: 3 CNAFTICNAEMQEVGVGLYPSMSLLNHSCDPNCSIVFNGPHLLLRAVRDIEAGEELTICY 62
Query: 135 IETAGSTMTRQKALKEQYLFTCTCPRCIKLGQFDDIQESAILEGYRCKDDGCSGFLLRDS 194
++ ++ R+K L++QY F C C RC +D
Sbjct: 63 LDMLMTSEERRKQLRDQYCFECDCFRCQT----------------------------QDK 94
Query: 195 DDKGFTCQQCGLVRSKEEIKKIASEVNILSKKTLALTSCGNHQEVVSTYKMIEKLQKKLY 254
D T + +E +KKI E+ K L C Q V+S+ E+L
Sbjct: 95 DADMLTGDEQVWKEVQESLKKI-EELKAHWKWEQVLAMC---QAVISSNA--ERLP---- 144
Query: 255 HPFSVNLMQTR--EKLIKILMELEDWKEALAYCQLTIPVYQRVYPQFHPLLGLQYYTCGK 312
+N+ Q + + + + L +EAL Y T+ Y+ +P HP+ G+Q GK
Sbjct: 145 ---DINIYQLKVLDCAMDACINLGLLEEALFYGIRTMEPYRIFFPGSHPVRGVQVMKVGK 201
Query: 313 LEWFLGDTENAIKSMTEAVEILRITHG 339
L+ G A+K++ A +I+R+THG
Sbjct: 202 LQLHQGMFPQAMKNLRLAFDIMRVTHG 228
>gi|326915197|ref|XP_003203906.1| PREDICTED: n-lysine methyltransferase SMYD2-like [Meleagris
gallopavo]
Length = 575
Score = 115 bits (287), Expect = 5e-23, Method: Compositional matrix adjust.
Identities = 86/338 (25%), Positives = 156/338 (46%), Gaps = 30/338 (8%)
Query: 19 TTTDNYSLVEALVAHMSDID-EKQLLLYAQIANLVNLILQWPEISINE-IAENFSKLACN 76
T ++ V+ +H+ +D EK+ L+ IA L + + E N + F+++ CN
Sbjct: 265 TQSEKLLAVKEFESHLDKLDNEKRELIQNDIAALHHFYSKHMEYPDNAALVVLFAQVNCN 324
Query: 77 AHTICNSELRPLGTGLYPVISIINHSCLPNAVLVFEGRLAVVRAVQHVPKGAEVLISYIE 136
TI + EL LG+ ++P ++++NHSC PN ++ ++G LA VRAV+ + G EV SYI+
Sbjct: 325 GFTIEDEELSHLGSAIFPDVALMNHSCCPNVIVTYKGILAEVRAVKEIEPGEEVFTSYID 384
Query: 137 TAGSTMTRQKALKEQYLFTCTCPRCIKLGQFDDIQESAILEGYRCKDDGCSGFLLRDSDD 196
T R L++ Y FTC C C + D ++ I R +D S +RD
Sbjct: 385 LLYPTEDRNDRLRDSYFFTCDCRECTM--KEKDKEKLKI----RKLNDPPSAEAVRD--- 435
Query: 197 KGFTCQQCGLVRSKEEIKKIASEVNILSKKTLALTSCGNHQEVVSTYKMIEKLQKKLYHP 256
++ K S +L L+L G ++
Sbjct: 436 --MIKYARNVIEEFRRAKHYKSPSELLEICELSLDKMG-----------------AVFED 476
Query: 257 FSVNLMQTREKLIKILMELEDWKEALAYCQLTIPVYQRVYPQFHPLLGLQYYTCGKLEWF 316
+V ++ + + + + ++DW+ AL Y Q I Y + YP + + + G+L
Sbjct: 477 SNVYMLHMMYQAMGVCLYVQDWEGALRYGQKIIRPYSKYYPSYSLNVASMWLKLGRLYMA 536
Query: 317 LGDTENAIKSMTEAVEILRITHGTNSPFMKELILKLEE 354
L + K++ +A+ I+ + HG + P++ E+ +LE+
Sbjct: 537 LENRPAGDKALKKAIAIMEVAHGKDHPYISEIKKELED 574
>gi|148681077|gb|EDL13024.1| SET and MYND domain containing 2 [Mus musculus]
Length = 450
Score = 114 bits (286), Expect = 5e-23, Method: Compositional matrix adjust.
Identities = 86/351 (24%), Positives = 162/351 (46%), Gaps = 18/351 (5%)
Query: 8 RKLQNDNVIPSTTTDNYSL-VEALVAHMSDID-EKQLLLYAQIANLVNLI---LQWPEIS 62
R L + P T L V +H+ +D EK+ L+ + IA L L++P+ S
Sbjct: 111 RILAKQKIHPERTPSEKLLAVREFESHLDKLDNEKKDLIQSDIAALHQFYSKYLEFPDHS 170
Query: 63 INEIAENFSKLACNAHTICNSELRPLGTGLYPVISIINHSCLPNAVLVFEGRLAVVRAVQ 122
+ F+++ CN TI + EL LG+ ++P ++++NHSC PN ++ ++G LA VRAVQ
Sbjct: 171 --SLVVLFAQVNCNGFTIEDEELSHLGSAIFPDVALMNHSCCPNVIVTYKGTLAEVRAVQ 228
Query: 123 HVPKGAEVLISYIETAGSTMTRQKALKEQYLFTCTCPRCIKLGQFDDIQESAILEGYRCK 182
+ G EV SYI+ T R L++ Y FTC C C ++ A +E R
Sbjct: 229 EIHPGDEVFTSYIDLLYPTEDRNDRLRDSYFFTCECRECTTKD-----KDKAKVE-VRKL 282
Query: 183 DDGCSGFLLRDSDDKGFTCQQCGLVRSKEEIKKIASEVNILSKKTLALTSCGNHQEVVST 242
+RD ++ K +++ + A E++
Sbjct: 283 SSPPQAEAIRDMVRYARN-----VIEEFRRAKHYKYNLSVWPARGTAPCLTDGVCELLEI 337
Query: 243 YKMIEKLQKKLYHPFSVNLMQTREKLIKILMELEDWKEALAYCQLTIPVYQRVYPQFHPL 302
++ ++ ++ +V ++ + + + + ++DW+ AL Y Q I Y + YP +
Sbjct: 338 CELSQEKMSSVFEDSNVYMLHMMYQAMGVCLYMQDWEGALKYGQKIIKPYSKHYPVYSLN 397
Query: 303 LGLQYYTCGKLEWFLGDTENAIKSMTEAVEILRITHGTNSPFMKELILKLE 353
+ + G+L L + K++ +A+ I+ + HG + P++ E+ ++E
Sbjct: 398 VASMWLKLGRLYMGLENKAAGEKALKKAIAIMEVAHGKDHPYISEIKQEIE 448
>gi|47217177|emb|CAG11013.1| unnamed protein product [Tetraodon nigroviridis]
Length = 473
Score = 114 bits (285), Expect = 7e-23, Method: Compositional matrix adjust.
Identities = 86/329 (26%), Positives = 152/329 (46%), Gaps = 41/329 (12%)
Query: 33 HMSDIDEKQLL--LYAQI-----ANLVNLILQWPEISINEIAENFSKLACNAHTICNSEL 85
H+SD+ + Q L L+AQ+ A LI P S+N CN TI + EL
Sbjct: 179 HISDLPDDQALTQLFAQVRSRTGAGGEKLIRGSPPASVN----------CNGFTIEDEEL 228
Query: 86 RPLGTGLYPVISIINHSCLPNAVLVFEGRLAVVRAVQHVPKGAEVLISYIETAGSTMTRQ 145
LG+ ++P ++++NHSC PN ++ ++G +A VRAVQ + G E+ SYI+ T R+
Sbjct: 229 SHLGSAVFPDVALMNHSCSPNVIVTYKGTVAEVRAVQEINPGDEIFNSYIDLLYPTEDRK 288
Query: 146 KALKEQYLFTCTCPRCIKLGQFDDIQESAILEGYRCKDDGCSGFLLRDSDDKGFTCQQCG 205
+ L + Y FTC C C + ++ +S + L + +
Sbjct: 289 ERLLDSYFFTCQCAECTTRSKDEEKMKSTKPKSS-----------LEPEEVQSMVVYARN 337
Query: 206 LVRSKEEIKKIASEVNILSKKTLALTSCGNHQEVVSTYKMIEKLQKKLYHPFSVNLMQTR 265
L+ EE ++ NILS L C +S KM ++ +V ++
Sbjct: 338 LI---EEFRRAKHYKNILSFPVSLLEMCE-----LSLDKM-----GSVFADTNVYMLHMM 384
Query: 266 EKLIKILMELEDWKEALAYCQLTIPVYQRVYPQFHPLLGLQYYTCGKLEWFLGDTENAIK 325
+ + + + ++DW A++Y + I Y YP + + Y G+L L T +K
Sbjct: 385 YQAMGVCLYMQDWDGAMSYGEKIIQPYSVHYPAYSLNVASMYLKLGRLYLGLEKTPQGVK 444
Query: 326 SMTEAVEILRITHGTNSPFMKELILKLEE 354
++ +A+ I+ + HG + ++ E+ ++EE
Sbjct: 445 ALKKALSIMEVAHGKDHYYVAEVKREIEE 473
>gi|224047131|ref|XP_002191464.1| PREDICTED: N-lysine methyltransferase SMYD2 [Taeniopygia guttata]
Length = 436
Score = 114 bits (285), Expect = 7e-23, Method: Compositional matrix adjust.
Identities = 84/341 (24%), Positives = 159/341 (46%), Gaps = 36/341 (10%)
Query: 19 TTTDNYSLVEALVAHMSDID-EKQLLLYAQIANLVNLI---LQWPEISINEIAENFSKLA 74
T ++ V+ +H+ +D EK+ L+ IA L + L +P+ + + F+++
Sbjct: 126 TQSEKLLAVKEFESHLDKLDNEKRELIQNDIAALHHFYSKHLDYPDNAA--LVVLFAQVN 183
Query: 75 CNAHTICNSELRPLGTGLYPVISIINHSCLPNAVLVFEGRLAVVRAVQHVPKGAEVLISY 134
CN TI + EL LG+ ++P ++++NHSC PN ++ ++G LA VRAV+ + G E+ SY
Sbjct: 184 CNGFTIEDEELSHLGSAIFPDVALMNHSCCPNVIVTYKGTLAEVRAVREIEPGEEIFSSY 243
Query: 135 IETAGSTMTRQKALKEQYLFTCTCPRCIKLGQFDDIQESAILEGYRCK-DDGCSGFLLRD 193
I+ T R L++ Y F+C C C + D ++ I CK +D S ++D
Sbjct: 244 IDLLYPTEDRNDRLRDSYFFSCDCRECTT--KEKDKEKLEI-----CKLNDPPSAETVKD 296
Query: 194 SDDKGFTCQQCGLVRSKEEIKKIASEVNILSKKTLALTSCGNHQEVVSTYKMIEKLQKKL 253
++ K S +L L+L G +
Sbjct: 297 MIKYARN-----VIEEFRRAKHYKSPSELLEICELSLDKMG-----------------AV 334
Query: 254 YHPFSVNLMQTREKLIKILMELEDWKEALAYCQLTIPVYQRVYPQFHPLLGLQYYTCGKL 313
+ +V ++ + + + + ++DW+ AL Y Q I Y + YP + + + G+L
Sbjct: 335 FEDSNVYMLHMMYQAMGVCLYVQDWEGALCYGQKIIRPYSKHYPSYSLNVASMWLKLGRL 394
Query: 314 EWFLGDTENAIKSMTEAVEILRITHGTNSPFMKELILKLEE 354
L + +K++ A+ I+ + HG + P++ E+ +LE+
Sbjct: 395 YMALKNRTAGVKALKRAIAIMEVAHGKDHPYISEIKKELED 435
>gi|195059707|ref|XP_001995687.1| GH17627 [Drosophila grimshawi]
gi|193896473|gb|EDV95339.1| GH17627 [Drosophila grimshawi]
Length = 457
Score = 113 bits (283), Expect = 1e-22, Method: Compositional matrix adjust.
Identities = 74/282 (26%), Positives = 141/282 (50%), Gaps = 16/282 (5%)
Query: 62 SINEIAENFSKLACNAHTICNSELRPLGTGLYPVISIINHSCLPNAVLVFEGRLAVVRAV 121
+I E+ + +L N I ++E+ + T +Y +SI +HSC PNAV FEG V +
Sbjct: 170 NITELMSIYGRLITNGFNILDAEMNSIATAIYLGVSITDHSCQPNAVATFEGNELHVHVI 229
Query: 122 QHVP--KGAEVLISYIETAGSTMTRQKALKEQYLFTCTCPRCIKLGQFDDIQESAILEGY 179
+ +P +++ ISYI+ + R+ L+E Y F C C +CI D QE +
Sbjct: 230 EDLPCLDWSKIYISYIDLLNTPEQRRADLREHYYFLCVCSKCI------DPQEMHEMTAA 283
Query: 180 RCKDDGCSGFLLRDSDDKGFTCQQCGLVRSKEEIKKIASEVNILSKKTLALTSCGNHQEV 239
C + C + + + CQQC +++ +++ ++ +L + +V
Sbjct: 284 VCPNASCDASV----NIELAKCQQCD-ASVTPQLRTAYNDIMAFTQSSLDSMKEVAYLDV 338
Query: 240 VSTYKMIEKLQKKLYHPFSVNLMQTREKLIKILMELEDWKEALAYCQLTIPVYQRVYPQF 299
+ +EK Q+ + HP +V ++T + + + + W +ALA+ Q +P + + + +
Sbjct: 339 CNV--CLEK-QRGVLHPLNVWHVKTLDAAFEAAINVGKWTDALAFGQQLLPGFGKYHGAW 395
Query: 300 HPLLGLQYYTCGKLEWFLGDTENAIKSMTEAVEILRITHGTN 341
+PL+GL Y GK++ + + + A+ + EA IL +THG +
Sbjct: 396 NPLVGLLYMKLGKIQLYERNYKLAVHHLQEAQCILSVTHGHD 437
>gi|195163295|ref|XP_002022487.1| GL12938 [Drosophila persimilis]
gi|194104479|gb|EDW26522.1| GL12938 [Drosophila persimilis]
Length = 420
Score = 113 bits (283), Expect = 1e-22, Method: Compositional matrix adjust.
Identities = 93/368 (25%), Positives = 176/368 (47%), Gaps = 41/368 (11%)
Query: 1 MLKLYLRRKLQNDNVIPSTTTDNYSLVEALVAHMSDI--DEKQL-LLYAQIANLVNLILQ 57
+ +L LR + D + T L++H ++I D K+L L + A L +++
Sbjct: 86 LCRLILRLQHGGDLIRGYYTEHGSRKFRDLMSHYAEIKNDRKRLEHLDSLHAVLTDMMAD 145
Query: 58 WPEISIN--EIAENFSKLACNAHTICNSELRPLGTGLYPVISIINHSCLPNAVLVFEGRL 115
P N E+ + +L N + ++E+ + T +Y +SI +HSC PNAV FEG
Sbjct: 146 SPSTVPNKSELMSIYGRLITNGFNVLDAEMNSIATAIYLGVSITDHSCQPNAVATFEGNE 205
Query: 116 AVVRAVQHVP--KGAEVLISYIETAGSTMTRQKALKEQYLFTCTCPRCIKLGQFDDIQES 173
+ A++ +P +++ ISYI+ + R++ LK+ Y F C C +C D +E+
Sbjct: 206 LHIHALEDMPCLDWSKIFISYIDLLNTPEQRRQDLKDHYYFLCVCSKC------RDPKEA 259
Query: 174 AILEGYRCKDDGCSGFLLRDSDDKGFTCQQCGLVRSKEEIKKIASEVNILSKKTLALTSC 233
+ C + CS + + ++ C++C + S + ++ +E+ L+K L
Sbjct: 260 RQMTAAACPNRKCSASINIEWNN----CKRCSVGISPK-LRNAYNEIMALTKHNL----- 309
Query: 234 GNHQEVVSTYKMIEKLQKKLYHPFSVNLMQTREKLIKILMELEDWKEALAYCQLTIPVYQ 293
E ++ Y +++ +T + + +E+ W+EAL Y Q +P +Q
Sbjct: 310 -------------EAMKDVAY----LDVCKTLDAACEAAIEVGKWEEALNYGQQLLPGFQ 352
Query: 294 RVYPQFHPLLGLQYYTCGKLEWFLGDTENAIKSMTEAVEILRITHGTNSPFMKELILKL- 352
+ Q++PL+GL + GK++ F + A + EA IL +THG + + E + L
Sbjct: 353 MYHGQWNPLVGLLHMKLGKIQLFERHHKEASHHLEEAQRILSVTHGRDHRLLVEQLYPLI 412
Query: 353 EEAQAEAS 360
+A+ EA+
Sbjct: 413 FQARQEAN 420
>gi|432945357|ref|XP_004083558.1| PREDICTED: N-lysine methyltransferase SMYD2-like [Oryzias latipes]
Length = 432
Score = 113 bits (283), Expect = 1e-22, Method: Compositional matrix adjust.
Identities = 98/347 (28%), Positives = 154/347 (44%), Gaps = 62/347 (17%)
Query: 26 LVEALVAHMSDID-EKQLLLYAQIANLVNLILQWPEI-SINEIAENFSKLACNAHTICNS 83
L+ + +H+ D D EK+ IA L + E+ S ++ FS++ACN TI +
Sbjct: 129 LIGEMQSHLEDEDNEKKERTELDIAGLGRFFSKHLEVPSHKDLLTLFSQVACNGFTIEDD 188
Query: 84 ELRPLGTGLYPVISIINHSCLPNAVLVFEGRLAVVRAVQHVPKGAEVLISYIETAGSTMT 143
EL +GT +YP +++INHSCLP+ ++ F G LA VRAVQ + G EVLISYI+ T
Sbjct: 189 ELSHMGTAVYPDVALINHSCLPSVIVTFNGTLAQVRAVQDMKPGDEVLISYIDLLYPTDD 248
Query: 144 RQKALKEQYLFTCTCPRC-------IKL---GQFDDIQESAILEGYRCKDDGCSGF-LLR 192
R L+E Y FTC C C +KL Q D I+ I R + F ++
Sbjct: 249 RNSRLRESYYFTCNCLECQNKQKDKVKLKVRKQSDPIEPQVISNMTRYAKNTIREFRAMK 308
Query: 193 DSDDKGFTCQQCGLVRSKEEIKKI--ASEVNILSKKTLALTSCGNHQEVVSTYKMIEKLQ 250
+ + C +S EE+ + S V +L A+ C Q+ + EK+
Sbjct: 309 STKTPSELLEMC--EQSLEEMGAVFEDSNVYVLHMMYQAMGVCIYMQDADGALRYGEKVA 366
Query: 251 K---KLYHPFSVNLMQTREKLIKILMELEDWKEALAYCQLTIPVYQRVYPQFHPLLGLQY 307
+ K YHP+S+N+ KL +I M
Sbjct: 367 RYYSKRYHPYSLNVSSLYLKLGRIYMA--------------------------------- 393
Query: 308 YTCGKLEWFLGDTENAIKSMTEAVEILRITHGTNSPFMKELILKLEE 354
LG I ++T+A+ I+ + HG + ++K++ +L +
Sbjct: 394 ---------LGRHSAGINALTKAMAIMEVAHGKDHQYLKDIRKELSQ 431
>gi|195059722|ref|XP_001995690.1| GH17625 [Drosophila grimshawi]
gi|193896476|gb|EDV95342.1| GH17625 [Drosophila grimshawi]
Length = 457
Score = 113 bits (283), Expect = 1e-22, Method: Compositional matrix adjust.
Identities = 73/282 (25%), Positives = 141/282 (50%), Gaps = 16/282 (5%)
Query: 62 SINEIAENFSKLACNAHTICNSELRPLGTGLYPVISIINHSCLPNAVLVFEGRLAVVRAV 121
+I E+ + +L N I ++E+ + T +Y +SI +HSC PNAV FEG V +
Sbjct: 170 NITELMSIYGRLITNGFNILDAEMNSIATAIYLGVSITDHSCQPNAVATFEGNELHVHVI 229
Query: 122 QHVP--KGAEVLISYIETAGSTMTRQKALKEQYLFTCTCPRCIKLGQFDDIQESAILEGY 179
+ +P +++ ISYI+ + R+ L+E Y F C C +CI D QE +
Sbjct: 230 EDLPCLDWSKIYISYIDLLNTPEQRRADLREHYYFLCVCSKCI------DPQEMHEMTAA 283
Query: 180 RCKDDGCSGFLLRDSDDKGFTCQQCGLVRSKEEIKKIASEVNILSKKTLALTSCGNHQEV 239
C + C + + + CQQC +++ +++ ++ +L + +V
Sbjct: 284 VCPNASCDASV----NIELAKCQQCD-ASVTPQLRTAYNDIMAFTQSSLDSMKEVAYLDV 338
Query: 240 VSTYKMIEKLQKKLYHPFSVNLMQTREKLIKILMELEDWKEALAYCQLTIPVYQRVYPQF 299
+ +EK Q+ + HP +V ++T + + + + W +ALA+ Q +P + + + +
Sbjct: 339 CNV--CLEK-QRGVLHPLNVWHVKTLDAAFEAAINVGKWTDALAFGQQLLPGFGKYHGAW 395
Query: 300 HPLLGLQYYTCGKLEWFLGDTENAIKSMTEAVEILRITHGTN 341
+PL+GL + GK++ + + + A+ + EA IL +THG +
Sbjct: 396 NPLVGLLHMKLGKIQLYERNYKLAVHHLQEAQRILSVTHGHD 437
>gi|351695044|gb|EHA97962.1| SET and MYND domain-containing protein 3, partial [Heterocephalus
glaber]
Length = 252
Score = 113 bits (282), Expect = 2e-22, Method: Compositional matrix adjust.
Identities = 83/270 (30%), Positives = 138/270 (51%), Gaps = 43/270 (15%)
Query: 72 KLACNAHTICNSELRPLGTGLYPVISIINHSCLPNAVLVFEGRLAVVRAVQHVPKGAEVL 131
++ CN+ TICN+E++ +G GLYP +S++NHSC PN +VF G ++RAV+ + G E+
Sbjct: 1 QVICNSFTICNAEMQEVGVGLYPSMSLLNHSCDPNCSIVFSGPHLLLRAVRDIEVGEELT 60
Query: 132 ISYIETAGSTMTRQKALKEQYLFTCTCPRCIKLGQFDDIQESAILEGYRCKDDGCSGFLL 191
I Y++ ++ R+K L++QY F C C RC Q D +++ +L G
Sbjct: 61 ICYLDMLMTSEERRKQLRDQYCFDCDCFRC----QTQD-KDADMLTG------------- 102
Query: 192 RDSDDKGFTCQQCGLVRSKEEIKKIASEVNILSKKTLALTSCGNHQEVVSTYKMIEKLQK 251
D++ + Q E +KKI E+ K L C Q ++S+ E+L
Sbjct: 103 ---DEQVWKGVQ-------ESLKKI-EELKAHWKWEQVLAMC---QSIISSNS--ERLP- 145
Query: 252 KLYHPFSVNLMQTR--EKLIKILMELEDWKEALAYCQLTIPVYQRVYPQFHPLLGLQYYT 309
+N+ Q + + + + L +EAL Y T+ Y+ +P HP+ G+Q
Sbjct: 146 ------DINIYQLKVLDCAMDACINLGLLEEALFYGIRTMEPYRIFFPGSHPVRGVQVMK 199
Query: 310 CGKLEWFLGDTENAIKSMTEAVEILRITHG 339
GKL+ G A+K++ A +I+R+THG
Sbjct: 200 VGKLQLHQGMFPQAMKNLRLAFDIMRVTHG 229
>gi|313227557|emb|CBY22704.1| unnamed protein product [Oikopleura dioica]
Length = 589
Score = 113 bits (282), Expect = 2e-22, Method: Compositional matrix adjust.
Identities = 78/273 (28%), Positives = 140/273 (51%), Gaps = 33/273 (12%)
Query: 70 FSKLACNAHTICNSE-LRPLGTGLYPVISIINHSCLPNAVLVFEGRLAVVRAVQHVPKGA 128
F ++CN +I + L+ LG ++P +++INH C PN V V G VRA++ + +G
Sbjct: 279 FGIVSCNGMSITDMRGLQYLGVAIHPTLNLINHDCNPNVVAVSCGPNIFVRAIKPIKEGD 338
Query: 129 EVLISYIETAGSTMTRQKALKEQYLFTCTCPRCIKLGQFDDIQESAILEGYRCKDDGCSG 188
E+ ISYI+T+ ++ TR+ LK+QY F CTC C + G+ D+++ + ++ + S
Sbjct: 339 ELFISYIDTSATSETRKNILKDQYYFDCTCKMC-ESGEKDELKSAYVMPKEDVSEKR-ST 396
Query: 189 FLLRDSDDKGFTCQQCGLVRSKEEIKKIASEVNILSKKTLALTSCGN--HQEVVSTYKMI 246
++ +++D +KKI S + + + +A + G QE + +
Sbjct: 397 YIEKNTDIM---------------MKKIESSKKVQAWERVAAQAGGCLLQQENLFDDTHL 441
Query: 247 EKLQKKLYHPFSVNLMQTREKLIKILMELEDWKEALAYCQLTIPVYQRVYPQFHPLLGLQ 306
+KL ++QT ++ IL +D A Y + + Y+++YP++ +G+Q
Sbjct: 442 KKLM----------ILQTCSEVSAILNHYDD---AAQYAERVLAAYEQLYPEYSTQIGMQ 488
Query: 307 YYTCGKLEWFLGDTENAIKSMTEAVEILRITHG 339
Y G W L E+AIK + A++ L ITHG
Sbjct: 489 AYRLGVHYWHLQRVEDAIKMLGRALKHLEITHG 521
>gi|157137747|ref|XP_001657162.1| hypothetical protein AaeL_AAEL003674 [Aedes aegypti]
gi|108880819|gb|EAT45044.1| AAEL003674-PA [Aedes aegypti]
Length = 446
Score = 113 bits (282), Expect = 2e-22, Method: Compositional matrix adjust.
Identities = 98/369 (26%), Positives = 168/369 (45%), Gaps = 23/369 (6%)
Query: 8 RKLQ--NDNVIPSTTTDNYSLVEALVAHMSDI--DEKQLLLYAQI-ANLVNLILQWPEIS 62
RKLQ D T Y L+ H DI D ++ + + L +L+ + S
Sbjct: 91 RKLQKGGDYQKGYYTAKFYRRFHDLMTHEDDIKNDAPRMEHFQSLLVVLRSLVEEAAMPS 150
Query: 63 INEIAENFSKLACNAHTICNSELRPLGTGLYPVISIINHSCLPNAVLVFEGRLAVVRAVQ 122
E+ + F K+ N+ I + E+ +GTG+Y ISII+HSC PNA+ F+G +R ++
Sbjct: 151 KAELLQIFGKMCINSFNILDDEMNSIGTGMYLGISIIDHSCRPNALATFDGTTIHLRLLE 210
Query: 123 HVPKGAEV-----LISYIETAGSTMTRQKALKEQYLFTCTCPRCIKLGQFDDIQESAILE 177
G+EV ISYI+ R+ L+ QY F C C RC D QE ++
Sbjct: 211 DY-HGSEVDFSKIFISYIDLMNPAEERKARLRAQYYFECNCERC------RDEQEQQLMI 263
Query: 178 GYRCKDDGCSGFLLRDSDDKGFTCQQCGLVRSKEEIKKIASEVNILSKKTLALTSCGNHQ 237
C + C L D K C +C K +++ EV + LA +
Sbjct: 264 AGACPNVDCDE-PLSMLDRKVEKCPRCDTA-IKHSAREMFREVTDFTLAQLAKMKDITYM 321
Query: 238 EVVSTYKMIEKLQKKLYHPFSVNLMQTREKLIKILMELEDWKEALAYCQLTIPVYQRVYP 297
++ ++ K Q+ + H ++V ++T + + + ++ W++A+ Y +++
Sbjct: 322 DIC---QLCLKKQENILHYYNVWYLKTLDLAFESAINMQKWEDAIGYGLRLKDGFKKYNG 378
Query: 298 QFHPLLGLQYYTCGKLEWFLGDTENAIKSMTEAVEILRITHGTNSPFMKELILKLEEAQA 357
FHPL GL GK++ + A++++ ++ +ILR+THG K ++ L QA
Sbjct: 379 PFHPLYGLLLLKIGKIQLYTKHVVEALQNINDSEKILRVTHGEEHDLYKSQLIPL-LCQA 437
Query: 358 EASYKLSSK 366
Y +K
Sbjct: 438 AGEYDALNK 446
>gi|432873524|ref|XP_004072259.1| PREDICTED: SET and MYND domain-containing protein 1-like isoform 1
[Oryzias latipes]
Length = 476
Score = 113 bits (282), Expect = 2e-22, Method: Compositional matrix adjust.
Identities = 88/341 (25%), Positives = 157/341 (46%), Gaps = 43/341 (12%)
Query: 18 STTTD-NYSLVEALVAHMSDIDEKQLL-LYAQIANLVNLILQWPEIS----INEIAENFS 71
ST +D + ++ L H++D+ E +L L I N ++ WP S +++I+ F
Sbjct: 114 STVSDMQLTTLDELEDHITDMQEDELKELKVDIHNFLDF---WPRTSKQHTVDDISHIFG 170
Query: 72 KLACNAHTICNSE-LRPLGTGLYPVISIINHSCLPNAVLVFEGRLAVVRAVQHVPKGAEV 130
+ CN ++ + L+ +G GL+P + ++NH C PN ++ +R++ + +G E+
Sbjct: 171 VINCNGFSVSDQRGLQAVGVGLFPNLCLVNHDCWPNCTVILNHGKIELRSLGKIAEGEEL 230
Query: 131 LISYIETAGSTMTRQKALKEQYLFTCTCPRCIKLGQFDDIQESAILEGYRCKDDGCSGFL 190
++Y++ + RQ+ LK QY F CTC C + I++ L G
Sbjct: 231 TVAYVDYMNLSEERQRLLKTQYFFDCTCEHCK-----NKIKDDIKLGG------------ 273
Query: 191 LRDSDDKGFTCQQCGLVRSKEEIKKIASEVNILSKKTLALTSCGNHQEVVSTYK-MIEKL 249
R+ D G+ S+E++K+ + +K G++ EVV + IEK
Sbjct: 274 -REVD---------GVKPSEEQVKEATDYCYQMLEKMDKARLNGDYHEVVKICRECIEKT 323
Query: 250 QKKLY--HPFSVNLMQTREKLIKILMELEDWKEALAYCQLTIPVYQRVYPQFHPLLGLQY 307
+ L H + + + T ++ L L D A Y + + Y ++YP + LG+
Sbjct: 324 EPVLAETHIYLLRMWSTMSEVQAYLQYLND---AAEYARKMVEGYTKLYPPNNATLGMAA 380
Query: 308 YTCGKLEWFLGDTENAIKSMTEAVEILRITHGTNSPFMKEL 348
G + W GD E A + +A IL +THG P K+L
Sbjct: 381 MRAGVIHWQAGDIEVAHGMVCKAYAILMVTHGPTHPITKDL 421
>gi|322779440|gb|EFZ09632.1| hypothetical protein SINV_01274 [Solenopsis invicta]
Length = 425
Score = 112 bits (281), Expect = 2e-22, Method: Compositional matrix adjust.
Identities = 98/350 (28%), Positives = 160/350 (45%), Gaps = 39/350 (11%)
Query: 20 TTDNYSLVEALVAHMSDI--DEKQL----LLYAQIANLVNLILQWPEISINEIAENFSKL 73
T +NY L++H SD+ D K + LY +A ++ L P SI E+ + K+
Sbjct: 103 TKENYRKFTDLMSHYSDMRADAKHMEPFIKLYRFLAEFLDETLM-P--SIAELISIYGKM 159
Query: 74 ACNAHTICNSELRPLGTGLYPVISIINHSCLPNAVLVFEGRLAVVRAVQHVPKGAEVLIS 133
+ N I + L +G G+Y S++ HSC PNAV FEG ++ + IS
Sbjct: 160 STNRLNIQDEWLNVIGFGIYLGASVVGHSCKPNAVATFEGTTIII-----------IKIS 208
Query: 134 YIETAGSTMTRQKALKEQYLFTCTCPRCIKLGQFDDIQESAILEGYRCKDDGCSGFLLRD 193
Y++ S R++ L Y F C C RC K E ++ E C + C D
Sbjct: 209 YVDLIKSKKDRREELYSLYHFWCDCERCEK--------EESLTEAAACPNSSCDSPCSID 260
Query: 194 SDDKGFTCQQCGL---VRSKEEIKKIASEVNILSKKTLALTSCGNHQEVVSTYKMIEKLQ 250
+D C++C V+ KE +++ + + +++ N + S K I + Q
Sbjct: 261 ADK----CEKCNTRISVKFKETFQEVVDLTDYHFEHIISILF-FNDLDTFSWIKEILEKQ 315
Query: 251 KKLYHPFSVNLMQTREKLIKILMELEDWKEALAYCQLTIPVYQRVYPQFHPLLGLQYYTC 310
K + H F+V+L+ T K+ L+ WK+ Y + Y Y + +P +G Y T
Sbjct: 316 KNVMHKFNVHLICTLRKIHCAAWNLKCWKDVEFYGKKLEAGYLFYYGKVYPPIGTLYLTL 375
Query: 311 GKLEWFLGDTENAIKSMTEAVEILRITHGTNSPFMKELILKLEEAQAEAS 360
G LG + A K++ +A +L+ITHG +KE+ LE + +AS
Sbjct: 376 GIALLHLGKPKEAFKTLNKAYTVLKITHGNKHFLVKEI---LEPLRFDAS 422
>gi|194768855|ref|XP_001966527.1| GF21943 [Drosophila ananassae]
gi|190617291|gb|EDV32815.1| GF21943 [Drosophila ananassae]
Length = 459
Score = 112 bits (279), Expect = 4e-22, Method: Compositional matrix adjust.
Identities = 89/368 (24%), Positives = 173/368 (47%), Gaps = 22/368 (5%)
Query: 1 MLKLYLRRKLQNDNVIPSTTTDNYSLVEALVAHMSDIDEKQLLLY---AQIANLVNLILQ 57
+ +L LR + D + T L++H ++I + L + A L +++ +
Sbjct: 106 LCRLILRLEHGGDLIRGYYTEHGSRKFRDLMSHYAEIKNDPMRLEHLDSLHAVLTDMMAE 165
Query: 58 WPEISIN--EIAENFSKLACNAHTICNSELRPLGTGLYPVISIINHSCLPNAVLVFEGRL 115
P N E+ + +L N + ++E+ + T +Y +SI +HSC PNAV FEG
Sbjct: 166 SPSTVPNKTELMSIYGRLITNGFNVLDAEMNSIATAIYLGVSITDHSCQPNAVATFEGNE 225
Query: 116 AVVRAVQHVP--KGAEVLISYIETAGSTMTRQKALKEQYLFTCTCPRCIKLGQFDDIQES 173
+ A++ + +++ ISYI+ + R+ LKE Y F C C +CI D QE+
Sbjct: 226 LHIHAIEDMECLDWSKIFISYIDLLNTPEERRLDLKEHYYFLCCCSKCI------DPQET 279
Query: 174 AILEGYRCKDDGCSGFLLRDSDDKGFTCQQCGLVRSKEEIKKIASEVNILSKKTLALTSC 233
+ C + C + + + C++C S + ++ + V ++ L
Sbjct: 280 KEMMAALCPNRNCGIGI----NPERTNCKRCDTGISPK-LRNAYNSVMAFTRTNLEAMKD 334
Query: 234 GNHQEVVSTYKMIEKLQKKLYHPFSVNLMQTREKLIKILMELEDWKEALAYCQLTIPVYQ 293
+ +V K+ Q + HP +V ++T + + +++ W +AL Y Q +P Y+
Sbjct: 335 VAYLDVC---KVCLNKQTGVLHPLNVWHVKTLDAAFEAAIDVGKWTDALDYGQRLLPGYR 391
Query: 294 RVYPQFHPLLGLQYYTCGKLEWFLGDTENAIKSMTEAVEILRITHGTNSPFMKELILKLE 353
+ + ++PLLGL + GK++ + A+ + +A+ IL +THG + + E + L
Sbjct: 392 KYHGAWNPLLGLLHMKLGKIQLHEQRAKEALPHLEDALRILTVTHGRDHRLLAEQLYPLL 451
Query: 354 -EAQAEAS 360
+A+ EA+
Sbjct: 452 IQARHEAN 459
>gi|301113434|ref|XP_002998487.1| conserved hypothetical protein [Phytophthora infestans T30-4]
gi|262111788|gb|EEY69840.1| conserved hypothetical protein [Phytophthora infestans T30-4]
Length = 424
Score = 111 bits (278), Expect = 5e-22, Method: Compositional matrix adjust.
Identities = 96/336 (28%), Positives = 157/336 (46%), Gaps = 55/336 (16%)
Query: 30 LVAHMSDIDEKQLLLYAQIANLVNLILQWPEISINEIAENFSKLACNAHTICNSELRPLG 89
LV + D+D+++LLL A +A +NL+ + S++E+A S+ N +IC+ L LG
Sbjct: 132 LVWYEEDMDDQELLLLAALAQKINLVDE--SFSMDEMARMLSRFRNNNFSICDELLLELG 189
Query: 90 TGLYPVISIINHSCLPNAVLVFEGRLAVV--RAVQHVPKGAEVLISYIETAGSTMTRQKA 147
G +P+ ++INHSC PN + F + + RA++ + G E+ +Y++ A RQ+
Sbjct: 190 AGCFPLGAMINHSCDPNCAVTFVPKTLDMEFRAMKPIKSGEEITQTYVDIALPRRERQQR 249
Query: 148 LKEQYLFTCTCPRCIKLGQFDDIQESAILEGYRCKDDGCSGFLLRDSDDKGFTCQQCGLV 207
L+ +Y FTC CPRC + +QE L+ Y D+D G ++
Sbjct: 250 LQRKYHFTCGCPRCSQ-----PLQEPGSLDAY------------LDADIDGVPQERWTEE 292
Query: 208 RSKEEIKKIASEVNILSKKTLALTSCGNHQEVVSTYKMIEKLQKKLYHPFSVNLMQTREK 267
R +E + + + + Q + H SV +QT
Sbjct: 293 RQQE--------------------------QCIDALQNLADRQSTILHCHSVARLQTLAT 326
Query: 268 LIKILMELEDWKEALAYCQLTIPVYQRVYPQFHPLLGLQYYTCGKL-----EWFLGDTEN 322
+ ME +EA+ Y + + Y+RVY HP+ GL +T G L + G ++
Sbjct: 327 IFSAEMERGSVEEAIGYGERMLEFYRRVYNPNHPMTGLHVFTLGDLYGQQAQAGTGAEDS 386
Query: 323 AIKS---MTEAVEILRITHGTNSPFMKELILKLEEA 355
KS + EA IL+ITHG + F+K L +LE A
Sbjct: 387 KQKSAEYLAEAQRILQITHGKDHRFVKMLGDRLETA 422
>gi|61806679|ref|NP_001013568.1| N-lysine methyltransferase SMYD2-A [Danio rerio]
gi|82178565|sp|Q5BJI7.1|SMY2A_DANRE RecName: Full=N-lysine methyltransferase SMYD2-A; AltName:
Full=Histone methyltransferase SMYD2-A; AltName:
Full=SET and MYND domain-containing protein 2A
gi|60552471|gb|AAH91465.1| SET and MYND domain containing 2a [Danio rerio]
Length = 435
Score = 110 bits (274), Expect = 1e-21, Method: Compositional matrix adjust.
Identities = 88/361 (24%), Positives = 164/361 (45%), Gaps = 41/361 (11%)
Query: 2 LKLYLRRKLQNDNVIPSTTTDNYSLVEALVAHMSDID-EKQLLLYAQIANLVNLI---LQ 57
++L R L+ + T ++ + L AH+ +D EK + IA L + L
Sbjct: 106 VRLVARIILKQKHQTERTPSERVLTLRELEAHLDKLDNEKNEMNDTDIAALHHFYSRHLD 165
Query: 58 WPEISINEIAENFSKLACNAHTICNSELRPLGTGLYPVISIINHSCLPNAVLVFEGRLAV 117
+P+ + + E +++ CN TI + EL LG+ L+P ++++NHSC PN ++ ++G +A
Sbjct: 166 FPDNAA--LTELIAQVNCNGFTIEDEELSHLGSALFPDVALMNHSCSPNVIVTYKGTVAE 223
Query: 118 VRAVQHVPKGAEVLISYIETAGSTMTRQKALKEQYLFTCTCPRCIKLGQFDDIQESAILE 177
VRAVQ + E+ SYI+ T R + LK+ Y F C C C
Sbjct: 224 VRAVQEINPEEEIFNSYIDLLYPTEDRIERLKDSYFFNCDCKECTS-------------- 269
Query: 178 GYRCKDDGCSGFLLRDSDDKGFTCQQCGLVRSKEEIKKIASEVNILSKKTLALTSCGNHQ 237
+ KD+ K Q+ + +EEIK++ I ++ + H
Sbjct: 270 --KSKDEA-----------KMEIRQKLSIPPEEEEIKQMV----IYARNVIEEFRRAKHY 312
Query: 238 EVVSTYKMIEKLQKK----LYHPFSVNLMQTREKLIKILMELEDWKEALAYCQLTIPVYQ 293
+ S I +L + ++ +V ++ + + + + ++DW A+ Y + I Y
Sbjct: 313 KTPSELLEICELSMEKMGAIFAETNVYMLHMMYQAMGVCLYMQDWDGAMKYGEKIIHPYS 372
Query: 294 RVYPQFHPLLGLQYYTCGKLEWFLGDTENAIKSMTEAVEILRITHGTNSPFMKELILKLE 353
YP + + Y G+L L +K++ +A+ I+ I HG + P++ E+ ++E
Sbjct: 373 VHYPPYSLNVASMYLKLGRLYLGLEKRTQGVKALKKALAIMDIAHGKDHPYIDEIKKEME 432
Query: 354 E 354
E
Sbjct: 433 E 433
>gi|348670159|gb|EGZ09981.1| hypothetical protein PHYSODRAFT_523060 [Phytophthora sojae]
Length = 421
Score = 110 bits (274), Expect = 1e-21, Method: Compositional matrix adjust.
Identities = 89/336 (26%), Positives = 148/336 (44%), Gaps = 55/336 (16%)
Query: 30 LVAHMSDIDEKQLLLYAQIANLVNLILQWPEISINEIAENFSKLACNAHTICNSELRPLG 89
LV + D+++++L+L A +A + L+ S++E+ S+ N +IC+ L G
Sbjct: 130 LVWYEEDMEDQELMLLAALAQKLELVD--GSYSMDEMLRMLSRFRNNNFSICDELLLEQG 187
Query: 90 TGLYPVISIINHSCLPNAVLVFEGRLAVV--RAVQHVPKGAEVLISYIETAGSTMTRQKA 147
G +P+ ++INHSC PN + F + + RA++ + G E+ +Y++ A R +
Sbjct: 188 AGCFPLGAMINHSCDPNCAITFVPKTLEMEFRAMRPIKAGEEITQTYVDVALPRRERHER 247
Query: 148 LKEQYLFTCTCPRCIKLGQFDDIQESAILEGYRCKDDGCSGFLLRDSDDKGFTCQQCGLV 207
L+ +Y F C C RC +QES L+ + D+D G +Q
Sbjct: 248 LQRKYHFNCACSRCSV-----PLQESGSLDAF------------LDADIDGVPKEQWSQE 290
Query: 208 RSKEEIKKIASEVNILSKKTLALTSCGNHQEVVSTYKMIEKLQKKLYHPFSVNLMQTREK 267
R E + + + + + Q + H S+ +QT
Sbjct: 291 RKDE--------------------------QCIDALQKLAERQSNILHRDSIARLQTLAT 324
Query: 268 LIKILMELEDWKEALAYCQLTIPVYQRVYPQFHPLLGLQYYTCGKLEWFL-----GDTEN 322
+ ME +EA+ Y + + YQRVY HP+ GL +T G L L G +
Sbjct: 325 IFSAEMERGSVEEAVVYGEKMLEFYQRVYNANHPMTGLHLFTLGDLYAQLAQVGTGPENS 384
Query: 323 AIKS---MTEAVEILRITHGTNSPFMKELILKLEEA 355
KS +TEA IL+IT G F+K L +L+ A
Sbjct: 385 KAKSSEYLTEARRILQITQGKEHRFVKMLADRLQAA 420
>gi|324509870|gb|ADY44135.1| SET and MYND domain-containing protein 3 [Ascaris suum]
Length = 442
Score = 109 bits (273), Expect = 2e-21, Method: Compositional matrix adjust.
Identities = 81/307 (26%), Positives = 144/307 (46%), Gaps = 28/307 (9%)
Query: 64 NEIAENFSKLACNAHTICNSELRPLGTGLYPVISIINHSCLPNAVLVFEGRLAVVRAVQH 123
+E+ F K+ N TI N +L +G GLY +S+++HSC P+A ++F G AV+R ++
Sbjct: 142 SELLTIFGKVMVNVFTISNDDLNTIGLGLYLGLSVLDHSCDPDAFVLFNGTKAVLRPLKQ 201
Query: 124 --VPKGAEVLISYIETAGSTMTRQKALKEQYLFTCTCPRCIKLGQFDDIQESAILEGYRC 181
+ + I+Y + T R+ LK+Q+ FTC C C+ D++ RC
Sbjct: 202 YITAYDSSLRIAYCDLLDLTSMRRNQLKQQFFFTCECSACL------DLEREKTARSVRC 255
Query: 182 KDDGCSGFLLRDSDDKGFTCQQCGL---VRSKEEI---KKIASEVNILSKKTLALTSCGN 235
+ G+ D ++ C QCG V E + +++ EV L ++
Sbjct: 256 R-HCVDGYCPLDVNENSLVCWQCGATSEVHVDEAVHLMQQVEWEVKRLQER--------- 305
Query: 236 HQEVVSTYKMIEKLQKKLYHPFSVNLMQTREKLIKILMELEDWKEALAYCQLTIPVYQRV 295
+ + T K + + ++ L Q +++ ++ ++ A Y + TI ++R
Sbjct: 306 RNDSICTAKDLYNEASIILSSLNIPLCQLADRIASWGIDNGNFSLAADYIEKTISCFKRF 365
Query: 296 YPQFHPLLGLQYYTCGKLEWFLGDT-ENAIKSMTEAVEILRITHGTN---SPFMKELILK 351
YP HP L LQY+ GKL T E A + + +A+E L +HG S EL+ K
Sbjct: 366 YPTAHPSLSLQYFKAGKLHSLNDLTLELAAERLEQAIESLHASHGVQHSLSAEAHELLCK 425
Query: 352 LEEAQAE 358
+++ + +
Sbjct: 426 VQQQRRD 432
>gi|290998093|ref|XP_002681615.1| SET domain-containing protein [Naegleria gruberi]
gi|284095240|gb|EFC48871.1| SET domain-containing protein [Naegleria gruberi]
Length = 430
Score = 109 bits (273), Expect = 2e-21, Method: Compositional matrix adjust.
Identities = 73/257 (28%), Positives = 122/257 (47%), Gaps = 53/257 (20%)
Query: 72 KLACNAHTICNSELRPLGTGLYPVISIINHSCLPNAVLVFEGR----LAVVRAVQHVPKG 127
KL N +ICN E+ +G+G+Y S+ NHSC+PN ++F+ + ++ + +G
Sbjct: 182 KLQRNTFSICNEEMNAIGSGIYLKASMFNHSCVPNCAILFDSDKNLYVRILNPSSLLEEG 241
Query: 128 AEVLISYIETAGSTMTRQKALKEQYLFTCTCPRCIKLGQFDDIQESAILEGYRCKDDGCS 187
+ I+Y++ T RQK LKEQY FTCTCPRC L +++ E+
Sbjct: 242 TPLTINYVDLMDLTANRQKKLKEQYHFTCTCPRC--LNSNEEVNENV------------- 286
Query: 188 GFLLRDSDDKGFTCQQCGLVRSKEEIKKIASEVNILSKKTLALTSCGNH-QEVVSTYKMI 246
+++ IA+E +K L N+ Q +VS +
Sbjct: 287 -----------------------DKMISIANE----HRKNQNLQEAVNYFQRIVSKKRSS 319
Query: 247 EKLQKKLYHPFSVNLMQTREKLIKILMELEDWKEALAYCQLTIPVYQRVYPQFHPLLGLQ 306
L ++Y ++N L + ++L D+K+A + ++ +++ YP F+PLLGLQ
Sbjct: 320 PSLFHEIYMGIALN------SLTHLNVDLSDFKKAYEFGIESLEYFEKYYPPFYPLLGLQ 373
Query: 307 YYTCGKLEWFLGDTENA 323
Y TC KL +L + A
Sbjct: 374 YLTCAKLASYLEENGQA 390
>gi|410916601|ref|XP_003971775.1| PREDICTED: N-lysine methyltransferase SMYD2-B-like [Takifugu
rubripes]
Length = 432
Score = 109 bits (273), Expect = 2e-21, Method: Compositional matrix adjust.
Identities = 90/358 (25%), Positives = 162/358 (45%), Gaps = 42/358 (11%)
Query: 3 KLYLRRKLQNDNVIPSTTTDNYSLVEALVAHMSDID-EKQLLLYAQIANLVNLILQWPEI 61
++ ++K+Q D ++ L+ + +H D+D E++ + A +A L + EI
Sbjct: 110 RILTKKKMQKDRC----ASEKLLLLGEMQSHTEDMDNERRETMEADVAGLHQFFSKHLEI 165
Query: 62 -SINEIAENFSKLACNAHTICNSELRPLGTGLYPVISIINHSCLPNAVLVFEGRLAVVRA 120
++ FS++ACN TI + EL LGT +YP +++INHSC P+ ++ + G A VRA
Sbjct: 166 PGHKDLLTLFSQVACNGFTIEDEELSHLGTAVYPDVALINHSCRPSVIVTYSGTSAHVRA 225
Query: 121 VQHVPKGAEVLISYIETAGSTMTRQKALKEQYLFTCTCPRCIKLGQFDDIQESAILEGYR 180
++ + G EVLISYI+ T R L+E Y FTC C C ES
Sbjct: 226 LRDMKPGDEVLISYIDVLYPTEDRNNRLRESYYFTCQCEEC----------ES------- 268
Query: 181 CKDDGCSGFLLRDSDDKGFTCQQCGLVRSKE-EIKKIASEVNILSKKTLALTSCGNHQEV 239
R D ++ RS+ E + I++ V K N
Sbjct: 269 -----------RSMDQAKLKARK----RSEPIEPEAISNMVRYARKCIREFRVFKNTSPA 313
Query: 240 VSTYKMIEKLQKKLYHPF---SVNLMQTREKLIKILMELEDWKEALAYCQLTIPVYQRVY 296
+ +M E+ ++ F +V ++ + + + + ++D A+ Y + Y+++Y
Sbjct: 314 STLLEMCEQSLDEMGAVFDDSNVYMLHMMYQAMGVCLYMQDPDGAIRYGEKIAKYYRKLY 373
Query: 297 PQFHPLLGLQYYTCGKLEWFLGDTENAIKSMTEAVEILRITHGTNSPFMKELILKLEE 354
P + L Y G+L + + I + +A I+ +THG ++ EL +++E
Sbjct: 374 PAYSLNLSSLYLKLGRLYFGMERNAECIGMLKKAKAIMEVTHGKEHFYLTELDRQMKE 431
>gi|147905510|ref|NP_001085986.1| N-lysine methyltransferase SMYD2-B [Xenopus laevis]
gi|82184154|sp|Q6GN68.1|SMY2B_XENLA RecName: Full=N-lysine methyltransferase SMYD2-B; AltName:
Full=Histone methyltransferase SMYD2-B; AltName:
Full=SET and MYND domain-containing protein 2B
gi|49115919|gb|AAH73650.1| MGC82991 protein [Xenopus laevis]
Length = 430
Score = 108 bits (271), Expect = 3e-21, Method: Compositional matrix adjust.
Identities = 88/339 (25%), Positives = 158/339 (46%), Gaps = 34/339 (10%)
Query: 19 TTTDNYSLVEALVAHMSDID-EKQLLLYAQIANLVNLILQWPEISINEIAEN-FSKLACN 76
T ++ + V+ +H+S +D EK+ L+ I+ L + N E F+++ CN
Sbjct: 121 TASERFMSVKEFESHLSKLDNEKKELIENDISALHRFYSKNVHNCDNAALEFLFAQVNCN 180
Query: 77 AHTICNSELRPLGTGLYPVISIINHSCLPNAVLVFEGRLAVVRAVQHVPKGAEVLISYIE 136
TI + EL LG+ ++P ++++NHSC PN ++ ++G +A VRAVQ + G EV SYI+
Sbjct: 181 GFTIEDEELSHLGSAIFPDVALMNHSCCPNVIVTYKGTVAEVRAVQEIHAGEEVFTSYID 240
Query: 137 TAGSTMTRQKALKEQYLFTCTCPRCIKLGQFDDIQESAILEGYRCKDDGCSGFLLRDSDD 196
T R LK+ Y F+C C C ++ A LE R D S +RD
Sbjct: 241 LLYPTEDRNDRLKDSYFFSCDCRECST-----KQKDPAKLE-LRKLSDPPSPQTVRD--- 291
Query: 197 KGFTCQQCGLVRSKEEIKKIASEVNILSKKTLALTSCGNHQEVVSTYKMIEKLQKKLYHP 256
+V K + +L L+L G+ ++
Sbjct: 292 --MITYARNVVEEFRRAKHYKTPSELLEICELSLDKMGS-----------------VFVD 332
Query: 257 FSVNLMQTREKLIKILMELEDWKEALAYCQLTIPVYQRVYPQFHPLLGLQYYTCGKLEWF 316
+V ++ + + + + ++DW+ AL Y + I Y + YP + + + G+L +
Sbjct: 333 SNVYMLHMMYQAMGVCLYMQDWEGALKYGEKIIKPYSKHYPAYSLNVASMWLKLGRL--Y 390
Query: 317 LGDTENAI--KSMTEAVEILRITHGTNSPFMKELILKLE 353
+G +N I K++ +A+ I+ I HG + ++ E+ +LE
Sbjct: 391 MGLEKNTIGTKALKKALAIMEIAHGPDHYYIAEIKKELE 429
>gi|348524789|ref|XP_003449905.1| PREDICTED: N-lysine methyltransferase SMYD2-like [Oreochromis
niloticus]
Length = 440
Score = 108 bits (270), Expect = 4e-21, Method: Compositional matrix adjust.
Identities = 88/356 (24%), Positives = 160/356 (44%), Gaps = 50/356 (14%)
Query: 3 KLYLRRKLQNDNVIPSTTTDNYSLVEALVAHMSDID-EKQLLLYAQIANLVNLILQWPEI 61
++ ++K+Q + I + L+ + +H D D EK+ + A IA L + ++
Sbjct: 118 RILAKKKMQKERCI----CEKMLLIGEMQSHTEDEDNEKREMTEADIAGLHRFYSKHLDV 173
Query: 62 SIN-EIAENFSKLACNAHTICNSELRPLGTGLYPVISIINHSCLPNAVLVFEGRLAVVRA 120
+ E+ FS++ACN TI + EL LGT +YP +++INHSCLP+ ++ F+G A VRA
Sbjct: 174 PDHKELLTLFSQVACNGFTIEDDELSHLGTAVYPDMALINHSCLPSVIVTFKGTSAEVRA 233
Query: 121 VQHVPKGAEVLISYIETAGSTMTRQKALKEQYLFTCTCPRCIKLGQFDDIQESAILEGYR 180
VQ + G EVLISYI+ T R L+E Y F C C C
Sbjct: 234 VQDMKPGDEVLISYIDLLYPTDDRNNRLRESYYFICDCQECKS----------------- 276
Query: 181 CKDDGCSGFLLRDSDDKGFTCQQCGLVRSKEEIKKIASEV--NILSKKTLALTSCGNHQE 238
S DK ++ +++ I +V N++ A+ +
Sbjct: 277 ------------QSKDK-------AKLKVRKQRDSIEPDVINNMVRYARKAIREFREFKH 317
Query: 239 VVSTYKMIEKLQKKL------YHPFSVNLMQTREKLIKILMELEDWKEALAYCQLTIPVY 292
+ +++E ++ L + +V ++ + + I M ++D A+ Y + + Y
Sbjct: 318 TKTPSELLEMCEQSLEEMGAVFDDSNVYMLHMMYQAMGICMYMQDLDGAIRYGEKLLKPY 377
Query: 293 QRVYPQFHPLLGLQYYTCGKLEWFLGDTENAIKSMTEAVEILRITHGTNSPFMKEL 348
+YP + + Y G+L L ++ +A+ I+ + HG + +++EL
Sbjct: 378 SHLYPPYSLNVSSVYLKLGRLYLGLERQSACTSALKKAMAIMEVAHGKDHFYIEEL 433
>gi|148228090|ref|NP_001080251.1| N-lysine methyltransferase SMYD2-A [Xenopus laevis]
gi|82176673|sp|Q7ZXV5.1|SMY2A_XENLA RecName: Full=N-lysine methyltransferase SMYD2-A; AltName:
Full=Histone methyltransferase SMYD2-A; AltName:
Full=SET and MYND domain-containing protein 2A
gi|28279861|gb|AAH44103.1| Smyd2-prov protein [Xenopus laevis]
Length = 430
Score = 107 bits (268), Expect = 7e-21, Method: Compositional matrix adjust.
Identities = 86/337 (25%), Positives = 155/337 (45%), Gaps = 30/337 (8%)
Query: 19 TTTDNYSLVEALVAHMSDID-EKQLLLYAQIANLVNLILQWPEISINE-IAENFSKLACN 76
T ++ + V+ +H+S +D EK+ L+ + IA L + + N + F+++ CN
Sbjct: 121 TPSETFLSVKEFESHLSKLDNEKKELIESDIAALHRFYSKNLHYTDNAALVFLFAQVNCN 180
Query: 77 AHTICNSELRPLGTGLYPVISIINHSCLPNAVLVFEGRLAVVRAVQHVPKGAEVLISYIE 136
TI + EL LG+ ++P ++++NHSC PN ++ F+G +A +RAVQ + G EV SYI+
Sbjct: 181 GFTIEDEELSHLGSAIFPDVALMNHSCCPNIIVTFKGTVAEIRAVQEIHAGDEVFTSYID 240
Query: 137 TAGSTMTRQKALKEQYLFTCTCPRCIKLGQFDDIQESAILEGYRCKDDGCSGFLLRDSDD 196
T R L + Y FTC C C ++ A LE R D S ++D
Sbjct: 241 LLYPTEDRNDRLMDSYFFTCDCRECST-----KQKDPAKLE-IRKLSDPPSHQTVKDMIK 294
Query: 197 KGFTCQQCGLVRSKEEIKKIASEVNILSKKTLALTSCGNHQEVVSTYKMIEKLQKKLYHP 256
+V K + +L L+L G+ ++
Sbjct: 295 YARN-----IVEEFRRAKHYKTPSELLEMCELSLDKMGS-----------------VFVD 332
Query: 257 FSVNLMQTREKLIKILMELEDWKEALAYCQLTIPVYQRVYPQFHPLLGLQYYTCGKLEWF 316
+V ++ + + + + L++W AL Y + I Y + YP + + + G+L
Sbjct: 333 SNVYMLHMMYQAMGVCLYLQEWDGALKYGEKIIKPYSKHYPAYSLNVASMWLKLGRLYMG 392
Query: 317 LGDTENAIKSMTEAVEILRITHGTNSPFMKELILKLE 353
L T K++ +A+ I++I HG + ++ E+ +LE
Sbjct: 393 LEKTTIGTKALKKALAIMQIAHGPDHHYIAEIKKELE 429
>gi|432873526|ref|XP_004072260.1| PREDICTED: SET and MYND domain-containing protein 1-like isoform 2
[Oryzias latipes]
Length = 489
Score = 107 bits (266), Expect = 1e-20, Method: Compositional matrix adjust.
Identities = 91/354 (25%), Positives = 160/354 (45%), Gaps = 56/354 (15%)
Query: 18 STTTD-NYSLVEALVAHMSDIDEKQLL-LYAQIANLVNLILQWPEIS----INEIAENFS 71
ST +D + ++ L H++D+ E +L L I N ++ WP S +++I+ F
Sbjct: 114 STVSDMQLTTLDELEDHITDMQEDELKELKVDIHNFLDF---WPRTSKQHTVDDISHIFG 170
Query: 72 KLACNAHTICNSE-LRPLGTGLYPVISIINHSCLPN-AVLVFEGRLAVV----------- 118
+ CN ++ + L+ +G GL+P + ++NH C PN V++ G + V
Sbjct: 171 VINCNGFSVSDQRGLQAVGVGLFPNLCLVNHDCWPNCTVILNHGNQSAVNTMFHSQRRIE 230
Query: 119 -RAVQHVPKGAEVLISYIETAGSTMTRQKALKEQYLFTCTCPRCIKLGQFDDIQESAILE 177
R++ + +G E+ ++Y++ + RQ+ LK QY F CTC C + I++ L
Sbjct: 231 LRSLGKIAEGEELTVAYVDYMNLSEERQRLLKTQYFFDCTCEHCK-----NKIKDDIKLG 285
Query: 178 GYRCKDDGCSGFLLRDSDDKGFTCQQCGLVRSKEEIKKIASEVNILSKKTLALTSCGNHQ 237
G R+ D G+ S+E++K+ + +K G++
Sbjct: 286 G-------------REVD---------GVKPSEEQVKEATDYCYQMLEKMDKARLNGDYH 323
Query: 238 EVVSTYK-MIEKLQKKLY--HPFSVNLMQTREKLIKILMELEDWKEALAYCQLTIPVYQR 294
EVV + IEK + L H + + + T ++ L L D A Y + + Y +
Sbjct: 324 EVVKICRECIEKTEPVLAETHIYLLRMWSTMSEVQAYLQYLND---AAEYARKMVEGYTK 380
Query: 295 VYPQFHPLLGLQYYTCGKLEWFLGDTENAIKSMTEAVEILRITHGTNSPFMKEL 348
+YP + LG+ G + W GD E A + +A IL +THG P K+L
Sbjct: 381 LYPPNNATLGMAAMRAGVIHWQAGDIEVAHGMVCKAYAILMVTHGPTHPITKDL 434
>gi|348534090|ref|XP_003454536.1| PREDICTED: N-lysine methyltransferase SMYD2-A-like [Oreochromis
niloticus]
Length = 435
Score = 105 bits (263), Expect = 2e-20, Method: Compositional matrix adjust.
Identities = 76/360 (21%), Positives = 159/360 (44%), Gaps = 35/360 (9%)
Query: 2 LKLYLRRKLQNDNVIPSTTTDNYSLVEALVAHMSDID-EKQLLLYAQIANLVNL----IL 56
++L R ++ T ++ L++ AH+ +D EK+ + IA L + I
Sbjct: 106 VRLVARIIMKQRATTERTPSERLLLLKEFEAHLDKMDSEKEEMNQTDIAALHHFYSRHIS 165
Query: 57 QWPEISINEIAENFSKLACNAHTICNSELRPLGTGLYPVISIINHSCLPNAVLVFEGRLA 116
P+ + E F+++ CN TI + EL LG+ ++P ++++NHSC PN ++ ++G +A
Sbjct: 166 NLPDEQA--LTELFAQVNCNGFTIEDEELSHLGSAVFPDVALMNHSCSPNVIVTYKGTVA 223
Query: 117 VVRAVQHVPKGAEVLISYIETAGSTMTRQKALKEQYLFTCTCPRCIKLGQFDDIQESAIL 176
VRAV+ + G E+ SYI+ T R++ L + Y FTC C C
Sbjct: 224 EVRAVKEINPGEEIFNSYIDLLYPTEDRKERLLDSYFFTCQCTECTS------------- 270
Query: 177 EGYRCKDDGCSGFLLRDSDDKGFTCQQCGLVRSKEEIKKIASEVNILSKKTLALTSCGNH 236
R D ++ EEI+ + + ++
Sbjct: 271 ---------------RSKDKAKMEIRKLNPPPEPEEIRSMVRYAKNVIEEFRRAKHYKTP 315
Query: 237 QEVVSTYKMIEKLQKKLYHPFSVNLMQTREKLIKILMELEDWKEALAYCQLTIPVYQRVY 296
E++ ++ + ++ +V ++ + + + + ++DW A++Y + + Y Y
Sbjct: 316 SELLEMCELSLEKMGAIFADTNVYMLHMMYQAMGVCLYMQDWDGAMSYGEKIVQPYSVHY 375
Query: 297 PQFHPLLGLQYYTCGKLEWFLGDTENAIKSMTEAVEILRITHGTNSPFMKELILKLEEAQ 356
P + + Y G+L L +K++ +A+ ++ + HG + ++ E+ ++EE +
Sbjct: 376 PAYSLNVASMYLKLGRLYLGLEKKTQGVKALKKALAVMEVAHGKDHHYVAEVKREIEEQK 435
>gi|303286946|ref|XP_003062762.1| set domain protein [Micromonas pusilla CCMP1545]
gi|226455398|gb|EEH52701.1| set domain protein [Micromonas pusilla CCMP1545]
Length = 732
Score = 105 bits (263), Expect = 3e-20, Method: Compositional matrix adjust.
Identities = 83/315 (26%), Positives = 133/315 (42%), Gaps = 36/315 (11%)
Query: 16 IPSTTTDNYSLVEALVAHMSDIDEKQLLLYAQIANLVNLILQW--PEISINEIAENFSKL 73
+ +T + V+AL + D+ ++ Y +AN VN +L + ++ +AE SK+
Sbjct: 322 VRATAPGLFDAVKALDSPFRDVSAEKRATYVGMANAVNSMLPPGPARMDVDALAEVMSKV 381
Query: 74 ACNAHTICNSELRPLGTGLYPVISIINHSCLPNAVLVF---------------------- 111
N+H + ++ R LGTG+YP ++ NHSC PNAV+ F
Sbjct: 382 HVNSHGVVDAAGRALGTGVYPPAALFNHSCAPNAVVSFGGADDDGDGSHSDRDGSCSNGS 441
Query: 112 --EGRLAVVRAVQHVPKGAEVLISYIETAGSTMTRQKALKEQYLFTCTCPRCIKLGQFDD 169
G VR V V +G E+ I+Y E + TR++AL E+ F C C RC
Sbjct: 442 TNNGARLTVRCVSPVEEGEEICIAYAEVYATRETRREALWEKKAFACECRRCADAAS--- 498
Query: 170 IQESAILEGYRCKD-DGCSGFLLRDSDDKGFTCQQCGLVRSKEEIKKIASEVNILSKKTL 228
+ A L G+RC C G + D G +C CG ++ K+ A+E + +
Sbjct: 499 ARRDAPLGGWRCPRLSACDGVV----PDAGDSCLTCGAKCARARSKRDAAERRWIGRHAA 554
Query: 229 ALTSC--GNHQEVVSTYKMIEKLQKKLYHPFSVNLMQTREKLIKILMELEDWKEALAYCQ 286
ALT+ +H VS + + V + R L+ L+ W+ A +
Sbjct: 555 ALTALRENDHATAVSIACEVVRESDDALCDAHVIRHECRLVLMDAHAALKQWRHAADVSR 614
Query: 287 LTIPVYQRVYPQFHP 301
+R HP
Sbjct: 615 DVADAMRRFVNVNHP 629
>gi|432852547|ref|XP_004067302.1| PREDICTED: N-lysine methyltransferase SMYD2-A-like isoform 1
[Oryzias latipes]
Length = 435
Score = 105 bits (263), Expect = 3e-20, Method: Compositional matrix adjust.
Identities = 77/343 (22%), Positives = 152/343 (44%), Gaps = 35/343 (10%)
Query: 19 TTTDNYSLVEALVAHMSDID-EKQLLLYAQIANLVNL----ILQWPEISINEIAENFSKL 73
T ++ L++ +H+ +D EK + A IA L I P+ E+ E F+++
Sbjct: 123 TPSERLLLLKEFESHLDKMDSEKDEMNQADIAALHYFYSKHISDLPDD--QELTELFAQV 180
Query: 74 ACNAHTICNSELRPLGTGLYPVISIINHSCLPNAVLVFEGRLAVVRAVQHVPKGAEVLIS 133
CN TI + EL LG+ ++P ++++NHSC PN ++ ++G +A VRAVQ + G E+ S
Sbjct: 181 NCNGFTIEDEELSHLGSAVFPDVALMNHSCSPNVIVTYKGTVAEVRAVQEINPGEEIFNS 240
Query: 134 YIETAGSTMTRQKALKEQYLFTCTCPRCIKLGQFDDIQESAILEGYRCKDDGCSGFLLRD 193
YI+ T R++ L + Y F C C C +
Sbjct: 241 YIDLLYPTEDRKERLLDSYFFGCQCTECT----------------------------TKS 272
Query: 194 SDDKGFTCQQCGLVRSKEEIKKIASEVNILSKKTLALTSCGNHQEVVSTYKMIEKLQKKL 253
D ++ EEIK + + ++ E++ ++ + +
Sbjct: 273 KDKAKMEIRKLSSPPEPEEIKSMVHYAKNVIEEFRKAKHYKTPSELLEMCELSLEKMGAI 332
Query: 254 YHPFSVNLMQTREKLIKILMELEDWKEALAYCQLTIPVYQRVYPQFHPLLGLQYYTCGKL 313
+ +V ++ + + + + ++DW A++Y + + Y YP + + Y G+L
Sbjct: 333 FADTNVYMLHMMYQAMGVCLYMQDWDGAMSYGEKIVYPYSVHYPPYSLNVASMYLKLGRL 392
Query: 314 EWFLGDTENAIKSMTEAVEILRITHGTNSPFMKELILKLEEAQ 356
L +K++ +A+ I+ I HG + ++ E+ ++EE +
Sbjct: 393 YLGLEKKTQGVKALKKALAIMEIAHGKDHHYVAEVKREIEEQK 435
>gi|301603797|ref|XP_002931556.1| PREDICTED: SET and MYND domain-containing protein 3-like [Xenopus
(Silurana) tropicalis]
Length = 226
Score = 105 bits (263), Expect = 3e-20, Method: Compositional matrix adjust.
Identities = 74/264 (28%), Positives = 126/264 (47%), Gaps = 45/264 (17%)
Query: 96 ISIINHSCLPNAVLVFEGRLAVVRAVQHVPKGAEVLISYIETAGSTMTRQKALKEQYLFT 155
+S++NHSC PN V+VFEG ++R V+ +PKG E+ ISYI+ T R+ L+ QY F
Sbjct: 1 MSLLNHSCDPNCVIVFEGTCLLLRTVKEIPKGEELTISYIDVKMPTQGRRDQLQRQYCFL 60
Query: 156 CTCPRCIKLGQFDDIQESAILEGYRCKDDGCSGFLLRDSDDKGFTCQQCGLVRSKEEIKK 215
C C RC LLRD D+ L E ++
Sbjct: 61 CDCQRC----------------------------LLRDKDEDM-------LAGDAEASRE 85
Query: 216 IASEVNILSKKTLALTSCGNHQEVVSTYKMIEKLQKKLYHPFSVNLMQTR--EKLIKILM 273
+ S V+ L + L S +E ++ + + L + Y P N+ Q + + + +
Sbjct: 86 VESSVSRLEE----LLSQNTAEEALN---LCKTLMNRYYLP-DKNIYQLKILDCAMDASI 137
Query: 274 ELEDWKEALAYCQLTIPVYQRVYPQFHPLLGLQYYTCGKLEWFLGDTENAIKSMTEAVEI 333
+L W+EAL + T+ Y Y +HP+ +Q GKL+ + G A+K++ +A +I
Sbjct: 138 DLGLWEEALHFGLRTLEPYSLYYSNYHPVRAVQLMKVGKLQNYQGLFAEAMKTLKQAFDI 197
Query: 334 LRITHGTNSPFMKELILKLEEAQA 357
+++THG + ++L + + +A
Sbjct: 198 MKVTHGRDHGQTQQLAELMNDCEA 221
>gi|432852549|ref|XP_004067303.1| PREDICTED: N-lysine methyltransferase SMYD2-A-like isoform 2
[Oryzias latipes]
Length = 424
Score = 105 bits (263), Expect = 3e-20, Method: Compositional matrix adjust.
Identities = 77/343 (22%), Positives = 152/343 (44%), Gaps = 35/343 (10%)
Query: 19 TTTDNYSLVEALVAHMSDID-EKQLLLYAQIANLVNL----ILQWPEISINEIAENFSKL 73
T ++ L++ +H+ +D EK + A IA L I P+ E+ E F+++
Sbjct: 112 TPSERLLLLKEFESHLDKMDSEKDEMNQADIAALHYFYSKHISDLPDD--QELTELFAQV 169
Query: 74 ACNAHTICNSELRPLGTGLYPVISIINHSCLPNAVLVFEGRLAVVRAVQHVPKGAEVLIS 133
CN TI + EL LG+ ++P ++++NHSC PN ++ ++G +A VRAVQ + G E+ S
Sbjct: 170 NCNGFTIEDEELSHLGSAVFPDVALMNHSCSPNVIVTYKGTVAEVRAVQEINPGEEIFNS 229
Query: 134 YIETAGSTMTRQKALKEQYLFTCTCPRCIKLGQFDDIQESAILEGYRCKDDGCSGFLLRD 193
YI+ T R++ L + Y F C C C +
Sbjct: 230 YIDLLYPTEDRKERLLDSYFFGCQCTECT----------------------------TKS 261
Query: 194 SDDKGFTCQQCGLVRSKEEIKKIASEVNILSKKTLALTSCGNHQEVVSTYKMIEKLQKKL 253
D ++ EEIK + + ++ E++ ++ + +
Sbjct: 262 KDKAKMEIRKLSSPPEPEEIKSMVHYAKNVIEEFRKAKHYKTPSELLEMCELSLEKMGAI 321
Query: 254 YHPFSVNLMQTREKLIKILMELEDWKEALAYCQLTIPVYQRVYPQFHPLLGLQYYTCGKL 313
+ +V ++ + + + + ++DW A++Y + + Y YP + + Y G+L
Sbjct: 322 FADTNVYMLHMMYQAMGVCLYMQDWDGAMSYGEKIVYPYSVHYPPYSLNVASMYLKLGRL 381
Query: 314 EWFLGDTENAIKSMTEAVEILRITHGTNSPFMKELILKLEEAQ 356
L +K++ +A+ I+ I HG + ++ E+ ++EE +
Sbjct: 382 YLGLEKKTQGVKALKKALAIMEIAHGKDHHYVAEVKREIEEQK 424
>gi|189204167|ref|XP_001938419.1| MYND finger family protein [Pyrenophora tritici-repentis Pt-1C-BFP]
gi|187985518|gb|EDU51006.1| MYND finger family protein [Pyrenophora tritici-repentis Pt-1C-BFP]
Length = 454
Score = 105 bits (263), Expect = 3e-20, Method: Compositional matrix adjust.
Identities = 89/366 (24%), Positives = 158/366 (43%), Gaps = 51/366 (13%)
Query: 23 NYSLVEALVAHMSDIDEKQLLLYAQIANLVNLILQWPEISI-------NEIAENFSKLAC 75
++ +V L +H+ D K N+ + + P+ ++ + IA ++++
Sbjct: 76 DWEMVCRLPSHVDDFKRK-----GTYGNIEMMAMGAPQFALPPTMFDKDFIAAMYARVMS 130
Query: 76 NAHTICNSELRPLGTGLYPVISIINHSCLPNAVLVFEGRLAVVRAVQHVPKGAEVLISYI 135
NA TI L PLG L P + +NHSC PNA ++ +G +R ++ + K E+ ISYI
Sbjct: 131 NALTIITPTLDPLGIILDPTLCSLNHSCDPNAFIMMDGPSVSIRTLRPIRKDKEIFISYI 190
Query: 136 ETAGSTMTRQKALKEQYLFTCTCPRCIKLGQFDDIQESAILEGYRCKDDGCSGFLLRDSD 195
+T RQ+ L+ ++ FTC C +C QE A L+ S F+L
Sbjct: 191 DTTYPYHKRQEELQTRWFFTCRCAKC---------QEKATLQEDNWLLPVDSKFVLDAEA 241
Query: 196 DKGFTCQQCGLVRSKEEIKKIASEVNILSKKTLALTSCGNHQEVVSTYKMIEKLQKKLYH 255
K Q EE+ K+++E I K + L SC + + Y
Sbjct: 242 KKAMAQTQEQTFAVYEELHKLSTEHVIYGLKQI-LASC---------------YESRFYP 285
Query: 256 PFSVNLMQTREKLIKILMELEDWKEALAYCQLTIP-VYQRVYP-QFHPLLGLQYYTCGKL 313
+ + R+ LI L+ + +++A A C + ++YP FHP+ +Q + L
Sbjct: 286 IYRQPYAEARDVLIVNLLSMGKFQDAWAQCAKRYKYILPKLYPVPFHPVRVVQTWQMAML 345
Query: 314 EWFLGDTENAIKS------------MTEAVEILRITHGTNSPFMKELILKLEEAQAEASY 361
+L TE + + + + +++ ++HG N+ F K + K EE E
Sbjct: 346 AAYLASTEEGVGAPGVNMGLIAMMLVKQVLDVAHLSHGPNNAFTKSVKGKAEEMIEELKR 405
Query: 362 KLSSKD 367
+ + D
Sbjct: 406 SVGNPD 411
>gi|119597486|gb|EAW77080.1| SET and MYND domain containing 1, isoform CRA_d [Homo sapiens]
Length = 497
Score = 105 bits (261), Expect = 5e-20, Method: Compositional matrix adjust.
Identities = 82/328 (25%), Positives = 150/328 (45%), Gaps = 49/328 (14%)
Query: 31 VAHMSDIDEKQLLLYAQIANLVNLILQ-WP----EISINEIAENFSKLACNAHTICNSE- 84
V H + ++K L + V+ LQ WP + S+ I+ F + CN T+ +
Sbjct: 136 VEHFGEEEQKDLRVD------VDTFLQYWPPQSQQFSMQYISHIFGVINCNGFTLSDQRG 189
Query: 85 LRPLGTGLYPVISIINHSCLPNAVLVFEGRLAVVRAVQHVPKGAEVLISYIETAGSTMTR 144
L+ +G G++P + ++NH C PN ++F +RA+ + +G E+ +SYI+ + R
Sbjct: 190 LQAVGVGIFPNLGLVNHDCWPNCTVIFNNGKIELRALGKISEGEELTVSYIDFLNVSEER 249
Query: 145 QKALKEQYLFTCTCPRCIKLGQFDDIQESAILEGYRCKDDGCSGFLLRDSDDKGFTCQQC 204
++ LK+QY F CTC C K + KDD FL
Sbjct: 250 KRQLKKQYYFDCTCEHCQK----------------KLKDD---LFL-------------- 276
Query: 205 GLVRSKEEIKKIASEVNILSKKTL----ALTSCGNHQEVVSTYKMIEKLQKKLYHPFSVN 260
G+ + + +++ E+ SK TL S G + EVV + + Q+ ++ ++
Sbjct: 277 GVKDNPKPSQEVVKEMIQFSKDTLEKIDKARSEGLYHEVVKLCRECLEKQEPVFADTNIY 336
Query: 261 LMQTREKLIKILMELEDWKEALAYCQLTIPVYQRVYPQFHPLLGLQYYTCGKLEWFLGDT 320
+++ + ++L L+ ++EA Y + + Y ++Y + LG+ G W G+
Sbjct: 337 MLRMLSIVSEVLSYLQAFEEASFYARRMVDGYMKLYHPNNAQLGMAVMRAGLTNWHAGNI 396
Query: 321 ENAIKSMTEAVEILRITHGTNSPFMKEL 348
E + +A IL +THG + P K+L
Sbjct: 397 EVGHGMICKAYAILLVTHGPSHPITKDL 424
>gi|291386373|ref|XP_002709685.1| PREDICTED: SET and MYND domain containing 1 isoform 1 [Oryctolagus
cuniculus]
Length = 477
Score = 104 bits (260), Expect = 6e-20, Method: Compositional matrix adjust.
Identities = 79/323 (24%), Positives = 146/323 (45%), Gaps = 39/323 (12%)
Query: 31 VAHMSDIDEKQLLLYAQIANLVNLILQWP----EISINEIAENFSKLACNAHTICNSE-L 85
V H + ++K+L A++ + WP S+ I+ F + CN T+ + L
Sbjct: 136 VDHFGEEEQKELR-----ADVDTFLQYWPPQSQHFSMQYISHIFGVINCNGFTLSDQRGL 190
Query: 86 RPLGTGLYPVISIINHSCLPNAVLVFEGRLAVVRAVQHVPKGAEVLISYIETAGSTMTRQ 145
+ +G G++P ++++NH C PN ++F +RA+ + +G E+ +SYI+ + R+
Sbjct: 191 QAVGVGIFPNLALVNHDCWPNCTVIFNNGKIELRALGKISEGEELTVSYIDFLNVSEERR 250
Query: 146 KALKEQYLFTCTCPRCIKLGQFDDIQESAILEGYRCKDDGCSGFLLRDSDDKGFTCQQCG 205
+ LK+QY F CTC C K G DD+ FL + K
Sbjct: 251 RQLKKQYYFDCTCEHCQK-GLKDDL------------------FLGVKDNPKP------- 284
Query: 206 LVRSKEEIKKIASEVNILSKKTLALTSCGNHQEVVSTYKMIEKLQKKLYHPFSVNLMQTR 265
S+E +K++ +K S G + EVV + + Q+ ++ ++ ++
Sbjct: 285 ---SQEVVKEMTEFCKDTLEKIDKARSEGLYHEVVKLCRECLQKQEPVFADTNLYTLRML 341
Query: 266 EKLIKILMELEDWKEALAYCQLTIPVYQRVYPQFHPLLGLQYYTCGKLEWFLGDTENAIK 325
+ ++L L+ + EA Y + + Y ++Y + LG+ G W G+ E
Sbjct: 342 SIVSEVLSYLQAFGEAADYAKRMVDGYMKLYHPNNAQLGMAVMRAGLTNWHAGNIEVGHG 401
Query: 326 SMTEAVEILRITHGTNSPFMKEL 348
+ +A IL +THG + P K+L
Sbjct: 402 MICKAYAILLVTHGPSHPITKDL 424
>gi|384246814|gb|EIE20303.1| hypothetical protein COCSUDRAFT_48675 [Coccomyxa subellipsoidea
C-169]
Length = 377
Score = 104 bits (260), Expect = 7e-20, Method: Compositional matrix adjust.
Identities = 43/90 (47%), Positives = 59/90 (65%)
Query: 65 EIAENFSKLACNAHTICNSELRPLGTGLYPVISIINHSCLPNAVLVFEGRLAVVRAVQHV 124
++A ++ ACN HTIC+ ELRP+G G+YP +++NHSC PN + F G V RAVQH+
Sbjct: 22 DVAYLLARFACNNHTICDDELRPIGVGIYPQGAMLNHSCTPNTMQSFHGSRIVFRAVQHI 81
Query: 125 PKGAEVLISYIETAGSTMTRQKALKEQYLF 154
P GAE I+Y+E A + R AL+ Y F
Sbjct: 82 PAGAEATIAYVELAATRAERSAALRANYFF 111
>gi|348566409|ref|XP_003468994.1| PREDICTED: SET and MYND domain-containing protein 1-like isoform 2
[Cavia porcellus]
Length = 477
Score = 104 bits (259), Expect = 7e-20, Method: Compositional matrix adjust.
Identities = 82/328 (25%), Positives = 147/328 (44%), Gaps = 49/328 (14%)
Query: 31 VAHMSDIDEKQLLLYAQIANLVNLILQ-WP----EISINEIAENFSKLACNAHTICNSE- 84
V H + ++K+L + V+ LQ WP + S+ I+ F + CN T+ +
Sbjct: 136 VEHFGEEEQKELRVD------VDTFLQYWPPQSQQFSMQYISHIFGVINCNGFTLSDQRG 189
Query: 85 LRPLGTGLYPVISIINHSCLPNAVLVFEGRLAVVRAVQHVPKGAEVLISYIETAGSTMTR 144
L+ +G G++P + ++NH C PN ++F +RA+ + G E+ +SYI+ + R
Sbjct: 190 LQAVGVGIFPNLGLVNHDCWPNCTVIFNNGKIELRALGKISVGEELTVSYIDFLNVSEER 249
Query: 145 QKALKEQYLFTCTCPRCIKLGQFDDIQESAILEGYRCKDDGCSGFLLRDSDDKGFTCQQC 204
++ LK+QY F CTC C K G DD+ FL
Sbjct: 250 KRQLKKQYYFDCTCEHCQK-GLKDDL------------------FL-------------- 276
Query: 205 GLVRSKEEIKKIASEVNILSKKTL----ALTSCGNHQEVVSTYKMIEKLQKKLYHPFSVN 260
G+ + +++ E+ SK TL S G + EVV + + Q+ ++ ++
Sbjct: 277 GVKEDPKPSQEVVKEMTQFSKDTLEKIDKARSEGLYHEVVKLCRECLEKQEPVFADTNLY 336
Query: 261 LMQTREKLIKILMELEDWKEALAYCQLTIPVYQRVYPQFHPLLGLQYYTCGKLEWFLGDT 320
++ + ++L L+ + EA Y + + Y ++Y + LG+ G W G+
Sbjct: 337 TLRMLSVVSEVLSYLQAFGEAAHYARRMVDGYMKLYHPNNAQLGMAVMRAGLTNWHAGNI 396
Query: 321 ENAIKSMTEAVEILRITHGTNSPFMKEL 348
E + +A IL +THG + P K+L
Sbjct: 397 EVGHGMICKAYAILLVTHGPSHPITKDL 424
>gi|156368233|ref|XP_001627600.1| predicted protein [Nematostella vectensis]
gi|156214514|gb|EDO35500.1| predicted protein [Nematostella vectensis]
Length = 377
Score = 104 bits (259), Expect = 8e-20, Method: Compositional matrix adjust.
Identities = 56/168 (33%), Positives = 92/168 (54%), Gaps = 6/168 (3%)
Query: 28 EALVAHMSDIDEKQLLLYAQIANLVNLILQWPEISINEIAENFSKLACNAHTICNSELRP 87
++LV+++ ID + + ++N L EIS E E FSK++CN+ IC+ E++
Sbjct: 121 DSLVSNVEKIDSDAKEDFVSVLMVLNEYLG-SEISPPEGLELFSKISCNSFAICDGEMQA 179
Query: 88 LGTGLYPVISIINHSCLPNAVLVFEGRLAVVRAVQHVPKGAEVLISYIETAGSTMTRQKA 147
+GTG++P +NHSC PN+V VF G ++A++ +P G E+ ISYI+ TRQ+
Sbjct: 180 IGTGIFPNAVCLNHSCAPNSVAVFNGTNIYIKALEEIPVGEELTISYIQQLHPRETRQEE 239
Query: 148 LKEQYLFTCTCPRCIKLGQFDDIQESAILEGYRCKDDGCSGFLLRDSD 195
L+ Q+ F C C RC+ D + +L C + C + + D
Sbjct: 240 LQTQFCFYCQCHRCL-----DASDNNKMLTSLICPNKSCEAIVYQTFD 282
Score = 62.4 bits (150), Expect = 3e-07, Method: Compositional matrix adjust.
Identities = 33/84 (39%), Positives = 47/84 (55%), Gaps = 1/84 (1%)
Query: 276 EDWKEALAYCQLTIPVYQRVYPQFHPLLGLQYYTCGKLEWFLG-DTENAIKSMTEAVEIL 334
+DW +AL Y + VY YP++HP LG+ Y GKL D E A+K + +A IL
Sbjct: 288 QDWTKALEYATRNLEVYTWFYPKYHPCLGVHLYKIGKLLAVTHQDLELAVKRLEQARRIL 347
Query: 335 RITHGTNSPFMKELILKLEEAQAE 358
+THG + P ++EL L +A E
Sbjct: 348 EVTHGQSHPLVQELCEYLCQASEE 371
>gi|47227584|emb|CAG09581.1| unnamed protein product [Tetraodon nigroviridis]
Length = 382
Score = 103 bits (258), Expect = 1e-19, Method: Compositional matrix adjust.
Identities = 81/330 (24%), Positives = 145/330 (43%), Gaps = 36/330 (10%)
Query: 25 SLVEALVAHMSDIDEKQLL-LYAQIANLVNLILQWPEIS----INEIAENFSKLACNAHT 79
+ VE L H++D+ E +L L I N ++ WP S I++I+ F + CN T
Sbjct: 28 TTVEELEDHIADLQEDELKELKVDIHNFLDY---WPRNSKQHTIDDISHLFGVINCNGFT 84
Query: 80 ICNSE-LRPLGTGLYPVISIINHSCLPNAVLVFEGRLAVVRAVQHVPKGAEVLISYIETA 138
+ + L+ +G GL+P + ++NH C PN ++ +R++ + +G E+ ++Y++
Sbjct: 85 VSDQRGLQAVGVGLFPNLCLVNHDCWPNCTVILNHGKIELRSLGKIAEGEELTVAYVDFL 144
Query: 139 GSTMTRQKALKEQYLFTCTCPRCIKLGQFDDIQESAILEGYRCKDDGCSGFLLRDSDDKG 198
+ R++ LK QY F C C C R KDD R+ D
Sbjct: 145 NLSEERRRLLKTQYFFDCQCEHCRN----------------RTKDD--LKLAGREVD--- 183
Query: 199 FTCQQCGLVRSKEEIKKIASEVNILSKKTLALTSCGNHQEVVSTYKMIEKLQKKLYHPFS 258
GL S+E++K+ + +K G++ EVV + + +
Sbjct: 184 ------GLKPSEEQVKEATDYCFQMLEKMDKARLDGDYHEVVKICRDAIDRTEPVLADTH 237
Query: 259 VNLMQTREKLIKILMELEDWKEALAYCQLTIPVYQRVYPQFHPLLGLQYYTCGKLEWFLG 318
+ L++ L ++ L+ + +A Y + + Y ++YP + LG+ G W G
Sbjct: 238 IYLLRMWSTLSEVQAYLQYFNDAATYSRKMVEGYMKLYPPNNAALGMAAMRAGVNHWQAG 297
Query: 319 DTENAIKSMTEAVEILRITHGTNSPFMKEL 348
E + +A IL +THG P K+L
Sbjct: 298 LIEVGHGMVCKAYAILLVTHGPTHPITKDL 327
>gi|302850329|ref|XP_002956692.1| SET and zf-MYND domain-containing protein [Volvox carteri f.
nagariensis]
gi|300258053|gb|EFJ42294.1| SET and zf-MYND domain-containing protein [Volvox carteri f.
nagariensis]
Length = 265
Score = 103 bits (258), Expect = 1e-19, Method: Compositional matrix adjust.
Identities = 46/123 (37%), Positives = 74/123 (60%), Gaps = 1/123 (0%)
Query: 33 HMSDIDEKQLLLYAQIANLVNLILQWPEI-SINEIAENFSKLACNAHTICNSELRPLGTG 91
H ++D+++ LYAQ+A + + +W +A+ S L+CN HT+C+ ELRPLG
Sbjct: 143 HWDELDDRRKQLYAQMAVVTWWVARWGTWPGFRTVAQLLSLLSCNCHTVCDEELRPLGVA 202
Query: 92 LYPVISIINHSCLPNAVLVFEGRLAVVRAVQHVPKGAEVLISYIETAGSTMTRQKALKEQ 151
LYP +++NHSC P+ V F G +RA++ + G E+ I+YIE A + R++ L +
Sbjct: 203 LYPTGALVNHSCSPSTVQTFHGSTLELRALRQLAPGDEITIAYIELAATRQERRETLADS 262
Query: 152 YLF 154
Y F
Sbjct: 263 YFF 265
>gi|5870834|gb|AAC53022.2| skm-BOP2 [Mus musculus]
Length = 472
Score = 103 bits (257), Expect = 1e-19, Method: Compositional matrix adjust.
Identities = 81/324 (25%), Positives = 147/324 (45%), Gaps = 41/324 (12%)
Query: 31 VAHMSDIDEKQLLLYAQIANLVNLILQ-WP----EISINEIAENFSKLACNAHTICNSE- 84
V H + ++K+L + V+ LQ WP + S+ I+ F + CN T+ +
Sbjct: 131 VEHFGEEEQKELRVD------VDTFLQYWPPQSQQFSMQYISHIFGVINCNGFTLSDQRG 184
Query: 85 LRPLGTGLYPVISIINHSCLPNAVLVFEGRLAVVRAVQHVPKGAEVLISYIETAGSTMTR 144
L+ +G G++P + ++NH C PN ++F +RA+ + +G E+ +SYI+ + R
Sbjct: 185 LQAVGVGIFPNLGLVNHDCWPNCTVIFNNGKIELRALGKISEGEELTVSYIDFLHLSEER 244
Query: 145 QKALKEQYLFTCTCPRCIKLGQFDDIQESAILEGYRCKDDGCSGFLLRDSDDKGFTCQQC 204
++ LK+QY F C+C C K G DD+ FL D K
Sbjct: 245 RRQLKKQYYFDCSCEHCQK-GLKDDL------------------FLAAKEDPKP------ 279
Query: 205 GLVRSKEEIKKIASEVNILSKKTLALTSCGNHQEVVSTYKMIEKLQKKLYHPFSVNLMQT 264
S+E +K++ +K S G + EVV + + Q+ ++ ++ +++
Sbjct: 280 ----SQEVVKEMIQFSKDTLEKIDKARSEGLYHEVVKLCRECLEKQEPVFADTNLYVLRL 335
Query: 265 REKLIKILMELEDWKEALAYCQLTIPVYQRVYPQFHPLLGLQYYTCGKLEWFLGDTENAI 324
++L L+ ++EA Y + + Y ++Y + LG+ G W G E
Sbjct: 336 LSIASEVLSYLQAYEEASHYARRMVDGYMKLYHHNNAQLGMAVMRAGLTNWHAGHIEVGH 395
Query: 325 KSMTEAVEILRITHGTNSPFMKEL 348
+ +A IL +THG + P K+L
Sbjct: 396 GMICKAYAILLVTHGPSHPITKDL 419
>gi|187607241|ref|NP_001120357.1| SET and MYND domain containing 1 [Xenopus (Silurana) tropicalis]
gi|170284459|gb|AAI61001.1| LOC100145429 protein [Xenopus (Silurana) tropicalis]
Length = 478
Score = 103 bits (257), Expect = 1e-19, Method: Compositional matrix adjust.
Identities = 86/351 (24%), Positives = 157/351 (44%), Gaps = 45/351 (12%)
Query: 27 VEALVAHMSDIDEKQLLLYAQIANLVNLILQWP----EISINEIAENFSKLACNAHTICN 82
++ L H+ DE + L + ++ + WP + + I+ FS ++CN T+ +
Sbjct: 124 IDDLQNHIDKFDEAEKGLLME--DVQKFLEYWPSQSQQFGMQYISHIFSVISCNGFTLSD 181
Query: 83 SE-LRPLGTGLYPVISIINHSCLPNAVLVFEGRLAVVRAVQHVPKGAEVLISYIETAGST 141
L+ +G G++P + + NH C PN ++F +RA+ + KG E+ +SY++ T
Sbjct: 182 QRGLQAVGVGIFPNLCLANHDCWPNCTVIFNNGKIELRALGKINKGEELTVSYVDFLNLT 241
Query: 142 MTRQKALKEQYLFTCTCPRCIKLGQFDDIQESAILEGYRCKDDGCSGFLLRDSDDKGFTC 201
R+ LK+QY F CTC C K + A+L ++D +DK
Sbjct: 242 EDRKAQLKKQYYFDCTCEHCTKKTK------DALL------------LAVKDGEDK---- 279
Query: 202 QQCGLVRSKEEIKKIASEVNILSKKTL----ALTSCGNHQEVVSTYKMIEKLQKKLYHPF 257
EE ++ EV SK T+ S G + +VV + K Q+ ++
Sbjct: 280 --------PEE--RVVKEVIQYSKDTMEKIEKARSEGLYNDVVKLCRDCLKRQEPIFADT 329
Query: 258 SVNLMQTREKLIKILMELEDWKEALAYCQLTIPVYQRVYPQFHPLLGLQYYTCGKLEWFL 317
++ +++ ++L L+ + +A + + Y ++Y Q + LG+ G W
Sbjct: 330 NIYMLRILSIYSEVLSYLQMFDDAAENAKKMVDGYLKIYHQNNAQLGMAVMRAGVTHWHA 389
Query: 318 GDTENAIKSMTEAVEILRITHGTNSPFMKELILKLEEAQAEASYKLSSKDE 368
G E + +A IL ITHG P K+ L++ AQ E ++ ++E
Sbjct: 390 GMIEVGHGMICKAFAILLITHGPLHPITKD--LEVMRAQTEMELRMFKENE 438
>gi|149036368|gb|EDL90986.1| rCG56113, isoform CRA_b [Rattus norvegicus]
Length = 477
Score = 103 bits (257), Expect = 1e-19, Method: Compositional matrix adjust.
Identities = 85/328 (25%), Positives = 150/328 (45%), Gaps = 49/328 (14%)
Query: 31 VAHMSDIDEKQLLLYAQIANLVNLILQ-WP----EISINEIAENFSKLACNAHTICNSE- 84
V H + ++K+L + V+ LQ WP + S+ I+ F + CN T+ +
Sbjct: 136 VEHFGEEEQKELRVD------VDTFLQYWPPQSQQFSMQYISHIFGVINCNGFTLSDQRG 189
Query: 85 LRPLGTGLYPVISIINHSCLPNAVLVFEGRLAVVRAVQHVPKGAEVLISYIETAGSTMTR 144
L+ +G G++P + ++NH C PN ++F +RA+ + +G E+ +SYI+ + R
Sbjct: 190 LQAVGVGIFPNLGLVNHDCWPNCTVIFNNGKIELRALGKISEGEELTVSYIDFLHLSEER 249
Query: 145 QKALKEQYLFTCTCPRCIKLGQFDDIQESAILEGYRCKDDGCSGFLLRDSDDKGFTCQQC 204
++ LK+QY F C+C C K G DD+ FL D K
Sbjct: 250 RQQLKKQYYFDCSCEHCQK-GLKDDL------------------FLAVKEDPKP------ 284
Query: 205 GLVRSKEEIKKIASEVNILSKKTL----ALTSCGNHQEVVSTYKMIEKLQKKLYHPFSVN 260
S+E +K E+ SK TL S G + EVV + + Q+ ++ ++
Sbjct: 285 ----SQEVVK----EMTQFSKDTLEKIDKARSEGLYHEVVKLCRECLEKQEPVFADTNLY 336
Query: 261 LMQTREKLIKILMELEDWKEALAYCQLTIPVYQRVYPQFHPLLGLQYYTCGKLEWFLGDT 320
+++ + ++L L+ ++EA Y + + Y ++Y + LG+ G W G
Sbjct: 337 VLRLLSIVSEVLSYLQAFEEASHYARRMVDGYMKLYHHNNAQLGMAVMRAGLTNWHAGHI 396
Query: 321 ENAIKSMTEAVEILRITHGTNSPFMKEL 348
E + +A IL +THG + P K+L
Sbjct: 397 EVGHGMICKAYAILLVTHGPSHPITKDL 424
>gi|236461142|ref|NP_033892.2| SET and MYND domain-containing protein 1 isoform 2 [Mus musculus]
gi|74209105|dbj|BAE24950.1| unnamed protein product [Mus musculus]
gi|148666517|gb|EDK98933.1| SET and MYND domain containing 1, isoform CRA_a [Mus musculus]
Length = 477
Score = 103 bits (257), Expect = 1e-19, Method: Compositional matrix adjust.
Identities = 81/324 (25%), Positives = 147/324 (45%), Gaps = 41/324 (12%)
Query: 31 VAHMSDIDEKQLLLYAQIANLVNLILQ-WP----EISINEIAENFSKLACNAHTICNSE- 84
V H + ++K+L + V+ LQ WP + S+ I+ F + CN T+ +
Sbjct: 136 VEHFGEEEQKELRVD------VDTFLQYWPPQSQQFSMQYISHIFGVINCNGFTLSDQRG 189
Query: 85 LRPLGTGLYPVISIINHSCLPNAVLVFEGRLAVVRAVQHVPKGAEVLISYIETAGSTMTR 144
L+ +G G++P + ++NH C PN ++F +RA+ + +G E+ +SYI+ + R
Sbjct: 190 LQAVGVGIFPNLGLVNHDCWPNCTVIFNNGKIELRALGKISEGEELTVSYIDFLHLSEER 249
Query: 145 QKALKEQYLFTCTCPRCIKLGQFDDIQESAILEGYRCKDDGCSGFLLRDSDDKGFTCQQC 204
++ LK+QY F C+C C K G DD+ FL D K
Sbjct: 250 RRQLKKQYYFDCSCEHCQK-GLKDDL------------------FLAAKEDPKP------ 284
Query: 205 GLVRSKEEIKKIASEVNILSKKTLALTSCGNHQEVVSTYKMIEKLQKKLYHPFSVNLMQT 264
S+E +K++ +K S G + EVV + + Q+ ++ ++ +++
Sbjct: 285 ----SQEVVKEMIQFSKDTLEKIDKARSEGLYHEVVKLCRECLEKQEPVFADTNLYVLRL 340
Query: 265 REKLIKILMELEDWKEALAYCQLTIPVYQRVYPQFHPLLGLQYYTCGKLEWFLGDTENAI 324
++L L+ ++EA Y + + Y ++Y + LG+ G W G E
Sbjct: 341 LSIASEVLSYLQAYEEASHYARRMVDGYMKLYHHNNAQLGMAVMRAGLTNWHAGHIEVGH 400
Query: 325 KSMTEAVEILRITHGTNSPFMKEL 348
+ +A IL +THG + P K+L
Sbjct: 401 GMICKAYAILLVTHGPSHPITKDL 424
>gi|348517231|ref|XP_003446138.1| PREDICTED: SET and MYND domain-containing protein 1-like isoform 1
[Oreochromis niloticus]
Length = 476
Score = 103 bits (257), Expect = 1e-19, Method: Compositional matrix adjust.
Identities = 87/339 (25%), Positives = 152/339 (44%), Gaps = 39/339 (11%)
Query: 18 STTTD-NYSLVEALVAHMSDIDEKQLL-LYAQIANLVNLILQWPEIS----INEIAENFS 71
ST +D + ++ L H++D+ E L L I N ++ WP S I++I+ F
Sbjct: 114 STVSDMQLTTLDELEDHIADMPEDDLKELKVDIHNFLDY---WPHNSKQHTIDDISHIFG 170
Query: 72 KLACNAHTICNSE-LRPLGTGLYPVISIINHSCLPNAVLVFEGRLAVVRAVQHVPKGAEV 130
+ CN ++ + L+ +G GL+P + ++NH C PN ++ +RA+ + +G E+
Sbjct: 171 VINCNGFSVSDQRGLQAVGVGLFPNLCLVNHDCWPNCTVILNHGKIELRALGKIAEGEEL 230
Query: 131 LISYIETAGSTMTRQKALKEQYLFTCTCPRCIKLGQFDDIQESAILEGYRCKDDGCSGFL 190
++Y++ + RQ+ LK QY F CTC C R KDD G
Sbjct: 231 TVAYVDFLNLSEERQRLLKTQYFFDCTCEHCKN----------------RIKDDIKIGG- 273
Query: 191 LRDSDDKGFTCQQCGLVRSKEEIKKIASEVNILSKKTLALTSCGNHQEVVSTYK-MIEKL 249
R+ D G+ S+E++K+ + +K G++ EVV + IEK
Sbjct: 274 -REED---------GVKPSEEQVKEATDYCFQMLEKMEKARLNGDYHEVVKICRECIEKT 323
Query: 250 QKKLYHPFSVNLMQTREKLIKILMELEDWKEALAYCQLTIPVYQRVYPQFHPLLGLQYYT 309
+ L + L++ + ++ L+ + +A Y + + Y ++Y + LG+
Sbjct: 324 EPVLADT-HIYLLRMWSTMSEVQAYLQYFDDAADYARKMVEGYMKLYHPNNAALGMAAMR 382
Query: 310 CGKLEWFLGDTENAIKSMTEAVEILRITHGTNSPFMKEL 348
G W G E + +A IL ITHG P K+L
Sbjct: 383 AGVTHWQAGQIEVGHGMICKAYAILMITHGPIHPITKDL 421
>gi|432113965|gb|ELK36030.1| SET and MYND domain-containing protein 3, partial [Myotis davidii]
Length = 241
Score = 102 bits (255), Expect = 2e-19, Method: Compositional matrix adjust.
Identities = 75/260 (28%), Positives = 123/260 (47%), Gaps = 43/260 (16%)
Query: 70 FSKLACNAHTICNSELRPLGTGLYPVISIINHSCLPNAVLVFEGRLAVVRAVQHVPKGAE 129
F +L CN+ TICN+E++ +G GLYP +S++NHSC PN +VF G ++RAV+ + G E
Sbjct: 1 FPQLICNSFTICNAEMQEVGVGLYPSMSLLNHSCDPNCSIVFNGPHLLLRAVRDIQAGEE 60
Query: 130 VLISYIETAGSTMTRQKALKEQYLFTCTCPRCIKLGQFDDIQESAILEGYRCKDDGCSGF 189
+ I Y++ ++ R++ L+ QY F C C RC
Sbjct: 61 LTICYLDMLMTSAERREQLRNQYCFDCDCARC---------------------------- 92
Query: 190 LLRDSDDKGFTCQQCGLVRSKEEIKKIASEVNILSKKTLALTSCGNHQEVVSTYKMIEKL 249
L +D D T + +E +KKI E+ K L C Q ++S+ E+L
Sbjct: 93 LTQDKDADMLTGDEKVWKEVQESLKKI-EELKAEWKWEQVLALC---QAIISSNS--ERL 146
Query: 250 QKKLYHPFSVNLMQTR--EKLIKILMELEDWKEALAYCQLTIPVYQRVYPQFHPLLGLQY 307
+N+ Q + + + + L +EAL Y T+ Y+ +P HP+ G+Q
Sbjct: 147 P-------DINIYQLKVLDCAMDACIRLGLLEEALFYGIRTMEPYRIFFPGCHPVRGVQV 199
Query: 308 YTCGKLEWFLGDTENAIKSM 327
GKL+ G A++++
Sbjct: 200 MKVGKLQLHQGMFPQAMRNL 219
>gi|84180545|gb|ABC54714.1| histone methyltransferase SmyD1b [Danio rerio]
gi|190337458|gb|AAI63095.1| Smyd1b protein [Danio rerio]
Length = 473
Score = 102 bits (254), Expect = 3e-19, Method: Compositional matrix adjust.
Identities = 79/337 (23%), Positives = 144/337 (42%), Gaps = 35/337 (10%)
Query: 18 STTTDN-YSLVEALVAHMSDIDEKQLLLYAQIANLVNLILQWPEIS----INEIAENFSK 72
S +DN + +E L H+ DI E L + ++ N + WP S ++ ++
Sbjct: 114 SVVSDNQLTTLEDLEDHICDISEDDLKDFK--VDIHNFLDYWPRNSKPHTVDSVSHILGV 171
Query: 73 LACNAHTICNSE-LRPLGTGLYPVISIINHSCLPNAVLVFEGRLAVVRAVQHVPKGAEVL 131
+ CN + + L+ +G GL+P + ++NH C PN ++ +RA+ + G EV
Sbjct: 172 INCNGFMVSDQRGLQAVGVGLFPNLCLVNHDCWPNCTVILNNGKIELRALGKISAGEEVT 231
Query: 132 ISYIETAGSTMTRQKALKEQYLFTCTCPRCIKLGQFDDIQESAILEGYRCKDDGCSGFLL 191
++Y++ + RQ+ LK+QY F CTC C + + D A ++G + +
Sbjct: 232 VAYVDYLNVSADRQRLLKQQYFFDCTCKHCTEKIKDDLKMAGAEVDGVKVPE-------- 283
Query: 192 RDSDDKGFTCQQCGLVRSKEEIKKIASEVNILSKKTLALTSCGNHQEVVSTYKMIEKLQK 251
E++K++ +K N+ EVV + + Q+
Sbjct: 284 -------------------EQVKEVTEFSRQKLEKMEKARIEANYNEVVKICRECVEKQE 324
Query: 252 KLYHPFSVNLMQTREKLIKILMELEDWKEALAYCQLTIPVYQRVYPQFHPLLGLQYYTCG 311
+ + ++ L ++L L+ + EA Y + + Y ++Y + LG+ G
Sbjct: 325 NVLADTHIYYLRVCCTLSEVLSYLQFFDEASEYARKMVDGYLKLYHPNNAQLGMATMRAG 384
Query: 312 KLEWFLGDTENAIKSMTEAVEILRITHGTNSPFMKEL 348
W G E A + +A IL ITHG P K+L
Sbjct: 385 VTHWQAGFIEVAHGMICKAFAILMITHGPTHPITKDL 421
>gi|326919603|ref|XP_003206069.1| PREDICTED: SET and MYND domain-containing protein 1-like isoform 2
[Meleagris gallopavo]
Length = 478
Score = 102 bits (253), Expect = 4e-19, Method: Compositional matrix adjust.
Identities = 89/352 (25%), Positives = 155/352 (44%), Gaps = 47/352 (13%)
Query: 27 VEALVAHMSDIDEKQLL-LYAQIANLVNLILQWP----EISINEIAENFSKLACNAHTIC 81
++ L H+ + DE++ L + + + WP + + I+ F + CNA T+
Sbjct: 129 IDDLQNHVENFDEEEKKDLRIDVESFLEF---WPAQSQQFGMQYISHIFGVINCNAFTLS 185
Query: 82 NSE-LRPLGTGLYPVISIINHSCLPNAVLVFEGRLAVVRAVQHVPKGAEVLISYIETAGS 140
+ L+ +G G++P + NH C PN ++F +RA+ + G E+ +SY++
Sbjct: 186 DQRGLQAVGVGIFPNLCQANHDCWPNCTVIFNNGKIELRALSKISPGDELTVSYVDFLNV 245
Query: 141 TMTRQKALKEQYLFTCTCPRCIKLGQFDDIQESAILEGYRCKDDGCSGFLLRDSDDKGFT 200
+ RQK LK+QY F CTC C K + KDD +++ D K
Sbjct: 246 SEERQKQLKKQYYFDCTCEHCKK----------------KIKDD--LMLAVKEGDKKP-- 285
Query: 201 CQQCGLVRSKEEIKKIASEVNILSKKTLALTSC----GNHQEVVSTYKMIEKLQKKLYHP 256
S E +K EV SK TL + G + EVV + K Q+ +
Sbjct: 286 --------SAETVK----EVIQFSKDTLEKINKARLEGTYHEVVKLCRECLKKQEPVLGD 333
Query: 257 FSVNLMQTREKLIKILMELEDWKEALAYCQLTIPVYQRVYPQFHPLLGLQYYTCGKLEWF 316
++ L++ ++L L+ ++EA Y + + Y ++Y + LG+ G W
Sbjct: 334 TNIYLLRILSIASEVLSYLQMFEEAAEYAKRMVDGYLKIYHPNNAQLGMAVMRAGVTHWH 393
Query: 317 LGDTENAIKSMTEAVEILRITHGTNSPFMKELILKLEEAQAEASYKLSSKDE 368
G E + +A IL ITHG + P K+ L++ Q E ++ ++E
Sbjct: 394 AGLIEAGHSMICKAYAILLITHGPSHPITKD--LEVMRVQTEMELRMFQQNE 443
>gi|71834584|ref|NP_001025394.1| N-lysine methyltransferase SMYD2-B isoform 1 [Danio rerio]
Length = 437
Score = 101 bits (252), Expect = 4e-19, Method: Compositional matrix adjust.
Identities = 84/331 (25%), Positives = 151/331 (45%), Gaps = 39/331 (11%)
Query: 26 LVEALVAHMSDID-EKQLLLYAQIANLVNLI---LQWPEISINEIAENFSKLACNAHTIC 81
L+ + AH+ D+D EK+ + A IA L L +P+ + FS++ CN T+
Sbjct: 131 LLGEMEAHLEDMDNEKREMTEAHIAGLHQFYSKHLDFPDH--QALLTLFSQVHCNGFTVE 188
Query: 82 NSELRPLGTGLYPVISIINHSCLPNAVLVFEGRLAVVRAVQHVPKGAEVLISYIETAGST 141
+ EL LG ++P I+++NHSC PN ++ + G A VRAV+ + G E+ SYI+ T
Sbjct: 189 DEELSNLGLAIFPDIALLNHSCSPNVIVTYRGINAEVRAVKDISPGQEIYTSYIDLLYPT 248
Query: 142 MTRQKALKEQYLFTCTCPRCIKLGQFDDIQESAILEGYRCKDDGCSGFLLRDSDDKGFTC 201
R + L++ Y F+C C C D ++ S R + D ++D
Sbjct: 249 ADRLERLRDMYYFSCDCKECTT-KSMDVVKMSV-----RKRSDEIGEKEIKD-------- 294
Query: 202 QQCGLVR----SKEEIKKIASEVNILSKKTLALTSCGNHQEVVSTYKMIEKLQKKLYHPF 257
+VR S E ++ + NIL + L C + +ST ++
Sbjct: 295 ----MVRYARNSMENFRRAKQDKNILC-GSFILEMCELSIDKMST----------VFDDS 339
Query: 258 SVNLMQTREKLIKILMELEDWKEALAYCQLTIPVYQRVYPQFHPLLGLQYYTCGKLEWFL 317
+V ++ + + I + ED++ A+ Y + I + +YP + + + G+L L
Sbjct: 340 NVYILHMMYQAMGICLFTEDYEGAVRYGEKVIKPFTVLYPAYSMNVASMFLKLGRLYIAL 399
Query: 318 GDTENAIKSMTEAVEILRITHGTNSPFMKEL 348
I + +A+ I+ + HG + ++ EL
Sbjct: 400 DRKLAGIDAFQKALTIMEVVHGKDHTYVTEL 430
>gi|440907991|gb|ELR58064.1| SET and MYND domain-containing protein 3, partial [Bos grunniens
mutus]
Length = 221
Score = 101 bits (252), Expect = 5e-19, Method: Compositional matrix adjust.
Identities = 73/258 (28%), Positives = 123/258 (47%), Gaps = 39/258 (15%)
Query: 70 FSKLACNAHTICNSELRPLGTGLYPVISIINHSCLPNAVLVFEGRLAVVRAVQHVPKGAE 129
F ++ CN+ TICN+E++ +G GLYP +S++NHSC PN +VF G ++RAV+ V G E
Sbjct: 1 FLQVICNSFTICNAEMQEVGVGLYPSMSLLNHSCDPNCSIVFNGPHLLLRAVRDVEAGEE 60
Query: 130 VLISYIETAGSTMTRQKALKEQYLFTCTCPRCIKLGQFDDIQESAILEGYRCKDDGCSGF 189
+ I Y++ ++ R+K L+ QY F C C RC Q D
Sbjct: 61 LTICYLDMLMTSEERRKQLRGQYCFDCDCFRC----QTQD-------------------- 96
Query: 190 LLRDSDDKGFTCQQCGLVRSKEEIKKIASEVNILSKKTLALTSCGNHQEVVSTYKMIEKL 249
+D+D ++ E++ K EV KK L + ++V++ + I
Sbjct: 97 --KDAD----------MLTGDEQVWK---EVQESLKKIEDLKAHWKWEQVLAMCQSIISS 141
Query: 250 QKKLYHPFSVNLMQTREKLIKILMELEDWKEALAYCQLTIPVYQRVYPQFHPLLGLQYYT 309
+ ++ ++ + + + L +EAL Y T+ Y+ YP HP+ G+Q
Sbjct: 142 NAERLPDINIYQLKVLDCAMDACLNLGLLEEALFYGIRTMEPYRIFYPGNHPVRGVQIMK 201
Query: 310 CGKLEWFLGDTENAIKSM 327
GKL+ G A+K++
Sbjct: 202 VGKLQLHQGMFPQAMKNL 219
>gi|126305322|ref|XP_001379366.1| PREDICTED: SET and MYND domain-containing protein 1 [Monodelphis
domestica]
Length = 490
Score = 101 bits (251), Expect = 6e-19, Method: Compositional matrix adjust.
Identities = 86/345 (24%), Positives = 151/345 (43%), Gaps = 59/345 (17%)
Query: 27 VEALVAHMSDI-DEKQLLLYAQIANLVNLILQWP----EISINEIAENFSKLACNAHTIC 81
V+ L H+ + +E+Q L + + +N WP + S+ I+ F + CNA T+
Sbjct: 129 VDDLQNHVDNFGEEEQKDLRVDVDSFLNF---WPAQSQQFSMQYISHIFGVINCNAFTLS 185
Query: 82 NSE-LRPLGTGLYPVISIINHSCLPNAVLVFE-GRLAVVRAVQH------------VPKG 127
+ L+ +G G++P + ++NH C PN ++F G V+++ H + +G
Sbjct: 186 DQRGLQAVGVGIFPNLCLVNHDCWPNCTVIFNNGNHEAVKSMFHTQMRIELRALGKISEG 245
Query: 128 AEVLISYIETAGSTMTRQKALKEQYLFTCTCPRCIKLGQFDDIQESAILEGYRCKDDGCS 187
E+ +SY++ + R+K LK+QY F CTC C K G DD+
Sbjct: 246 EELTVSYVDFLNVSQERKKQLKKQYYFDCTCEHCEK-GIKDDL----------------- 287
Query: 188 GFLLRDSDDKGFTCQQCGLVRSKEEIKKIASEVNILSKKTL----ALTSCGNHQEVVSTY 243
FL +D K + + EV SK+TL S G + EVV
Sbjct: 288 -FLAVKADPK--------------PSQDVVKEVTQFSKETLEKIDKARSEGLYHEVVKLC 332
Query: 244 KMIEKLQKKLYHPFSVNLMQTREKLIKILMELEDWKEALAYCQLTIPVYQRVYPQFHPLL 303
+ + Q+ + ++++++ ++L L+ + EA Y + + Y ++Y + L
Sbjct: 333 RECLQKQEPVLADTNIHVLRILSIASEVLSYLQSFHEASDYAKRMVDGYMKLYHPNNAQL 392
Query: 304 GLQYYTCGKLEWFLGDTENAIKSMTEAVEILRITHGTNSPFMKEL 348
G+ G W G E + +A IL +THG P K+L
Sbjct: 393 GMAVMRAGLTHWHAGLIEAGHGMICKAYAILLVTHGPTHPITKDL 437
>gi|296482488|tpg|DAA24603.1| TPA: SET and MYND domain containing 1 [Bos taurus]
Length = 490
Score = 101 bits (251), Expect = 7e-19, Method: Compositional matrix adjust.
Identities = 85/341 (24%), Positives = 153/341 (44%), Gaps = 62/341 (18%)
Query: 31 VAHMSDIDEKQLLLYAQIANLVNLILQ-WP----EISINEIAENFSKLACNAHTICNSE- 84
V H + ++K+L L V++ LQ WP + S+ I+ F + CN T+ +
Sbjct: 136 VEHFGEEEQKELRLD------VDMFLQYWPPQSQQFSMQYISHIFGVINCNGFTLSDQRG 189
Query: 85 LRPLGTGLYPVISIINHSCLPNAVLVFE-GRLAVVRAVQH------------VPKGAEVL 131
L+ +G G++P ++++NH C PN ++F G V+++ H + +G E+
Sbjct: 190 LQAVGVGVFPNLALVNHDCWPNCTVIFNNGNHEAVKSMFHTQMRIELRALGKISEGEELT 249
Query: 132 ISYIETAGSTMTRQKALKEQYLFTCTCPRCIKLGQFDDIQESAILEGYRCKDDGCSGFLL 191
+SYI+ + R+K LK QY F CTC C K G DD+ FL
Sbjct: 250 VSYIDFLNVSEERKKQLKRQYYFDCTCEHCQK-GLKDDL------------------FL- 289
Query: 192 RDSDDKGFTCQQCGLVRSKEEIKKIASEVNILSKKTL----ALTSCGNHQEVVSTYKMIE 247
G+ + + +++ E+ SK TL S G + EVV +
Sbjct: 290 -------------GVKDNPKPSQEVVKEMIQFSKDTLEKIDKARSEGLYHEVVKLCRECL 336
Query: 248 KLQKKLYHPFSVNLMQTREKLIKILMELEDWKEALAYCQLTIPVYQRVYPQFHPLLGLQY 307
+ Q+ ++ ++ ++ + ++L L+ ++EA Y + + Y ++Y + LG+
Sbjct: 337 QKQEPVFADTNIYTLRLLSIVSEVLSYLQAFEEASHYARRMVDGYMKLYHHNNAQLGMAI 396
Query: 308 YTCGKLEWFLGDTENAIKSMTEAVEILRITHGTNSPFMKEL 348
G W G+ E + +A IL +THG + P K+L
Sbjct: 397 MRAGLTNWHAGNIEVGHGMICKAYAILLVTHGPSHPITKDL 437
>gi|301777093|ref|XP_002923966.1| PREDICTED: SET and MYND domain-containing protein 3-like
[Ailuropoda melanoleuca]
Length = 477
Score = 101 bits (251), Expect = 7e-19, Method: Compositional matrix adjust.
Identities = 44/97 (45%), Positives = 67/97 (69%)
Query: 65 EIAENFSKLACNAHTICNSELRPLGTGLYPVISIINHSCLPNAVLVFEGRLAVVRAVQHV 124
+I E F+K+ CNA TICN+E++ +G GLYP +S++NHSC PN +VF G ++RAV+ +
Sbjct: 170 DIFEAFAKVICNAFTICNAEMQEVGVGLYPSMSLLNHSCDPNCSIVFNGPHLLLRAVRDI 229
Query: 125 PKGAEVLISYIETAGSTMTRQKALKEQYLFTCTCPRC 161
G E+ I Y++ ++ R+K L++QY F C C RC
Sbjct: 230 EAGEELTICYLDMLMTSEERRKQLRDQYCFECDCFRC 266
>gi|16930389|gb|AAL31881.1|AF410782_1 cardiac and skeletal muscle-specific BOP2 [Gallus gallus]
Length = 473
Score = 100 bits (250), Expect = 9e-19, Method: Compositional matrix adjust.
Identities = 88/352 (25%), Positives = 154/352 (43%), Gaps = 47/352 (13%)
Query: 27 VEALVAHMSDIDEKQLL-LYAQIANLVNLILQWP----EISINEIAENFSKLACNAHTIC 81
++ L H+ DE++ L + + + WP + + I+ F + CNA T+
Sbjct: 124 IDDLQNHVESFDEEEKKDLRVDVESFLEF---WPAQSQQFGMQYISHIFGVINCNAFTLS 180
Query: 82 NSE-LRPLGTGLYPVISIINHSCLPNAVLVFEGRLAVVRAVQHVPKGAEVLISYIETAGS 140
+ L+ +G G++P + NH C PN ++F +RA+ + G E+ +SY++
Sbjct: 181 DQRGLQAVGVGIFPNLCQANHDCWPNCTVIFNNGKIELRALSKISPGDELTVSYVDFLNV 240
Query: 141 TMTRQKALKEQYLFTCTCPRCIKLGQFDDIQESAILEGYRCKDDGCSGFLLRDSDDKGFT 200
+ R+K LK+QY F CTC C K + KDD +++ D K
Sbjct: 241 SEERRKQLKKQYYFDCTCEHCKK----------------KIKDD--LMLAVKEGDKKP-- 280
Query: 201 CQQCGLVRSKEEIKKIASEVNILSKKTLALTSC----GNHQEVVSTYKMIEKLQKKLYHP 256
S E +K EV SK TL + G + EVV + K Q+ +
Sbjct: 281 --------SAETVK----EVIQFSKDTLEKINKARLEGTYHEVVKLCRECLKKQEPVLGD 328
Query: 257 FSVNLMQTREKLIKILMELEDWKEALAYCQLTIPVYQRVYPQFHPLLGLQYYTCGKLEWF 316
++ L++ ++L L+ ++EA Y + + Y ++Y + LG+ G W
Sbjct: 329 TNIYLLRILSIASEVLSYLQMFEEAAEYAKRMVEGYMKIYHPNNAQLGMAVMRAGVTHWH 388
Query: 317 LGDTENAIKSMTEAVEILRITHGTNSPFMKELILKLEEAQAEASYKLSSKDE 368
G E + +A IL ITHG + P K+ L++ Q E ++ ++E
Sbjct: 389 AGLIEAGHSMICKAYAILLITHGPSHPITKD--LEVMRVQTEMELRMFQQNE 438
>gi|395508794|ref|XP_003758694.1| PREDICTED: SET and MYND domain-containing protein 1 [Sarcophilus
harrisii]
Length = 490
Score = 100 bits (250), Expect = 1e-18, Method: Compositional matrix adjust.
Identities = 87/345 (25%), Positives = 150/345 (43%), Gaps = 59/345 (17%)
Query: 27 VEALVAHMSDI-DEKQLLLYAQIANLVNLILQWP----EISINEIAENFSKLACNAHTIC 81
V+ L H+ + +E+Q L + + +N WP + S+ I+ F + CNA T+
Sbjct: 129 VDDLQNHVDNFGEEEQKDLRMDVDSFLNF---WPPQSQQFSMQYISHIFGVINCNAFTLS 185
Query: 82 NSE-LRPLGTGLYPVISIINHSCLPNAVLVFE-GRLAVVRAVQH------------VPKG 127
+ L+ +G G++P + ++NH C PN ++F G V+++ H + +G
Sbjct: 186 DQRGLQAVGVGIFPNLCLVNHDCWPNCTVIFNNGNHEAVKSMFHTQMRIELRALGKISEG 245
Query: 128 AEVLISYIETAGSTMTRQKALKEQYLFTCTCPRCIKLGQFDDIQESAILEGYRCKDDGCS 187
E+ +SYI+ + R+K LK+QY F CTC C K G DD+
Sbjct: 246 EELTVSYIDFLNISQERKKQLKKQYYFDCTCEHCEK-GIKDDL----------------- 287
Query: 188 GFLLRDSDDKGFTCQQCGLVRSKEEIKKIASEVNILSKKTL----ALTSCGNHQEVVSTY 243
FL D K + + EV SK TL S G + EVV
Sbjct: 288 -FLAVKEDPK--------------PSQDVVKEVTQFSKDTLEKIDKARSEGLYHEVVKLC 332
Query: 244 KMIEKLQKKLYHPFSVNLMQTREKLIKILMELEDWKEALAYCQLTIPVYQRVYPQFHPLL 303
+ + Q+ + ++++++ ++L L+ ++EA Y + + Y ++Y + L
Sbjct: 333 RECLQKQEPVLADTNIHVLRILSIASEVLSYLQSFQEASDYAKRMVDGYMKLYHPNNAQL 392
Query: 304 GLQYYTCGKLEWFLGDTENAIKSMTEAVEILRITHGTNSPFMKEL 348
G+ G W G E + +A IL +THG P K+L
Sbjct: 393 GMAVMRAGLTHWHAGLIEAGHGMICKAYAILLVTHGPTHPITKDL 437
>gi|260788099|ref|XP_002589088.1| hypothetical protein BRAFLDRAFT_75068 [Branchiostoma floridae]
gi|229274262|gb|EEN45099.1| hypothetical protein BRAFLDRAFT_75068 [Branchiostoma floridae]
Length = 909
Score = 100 bits (249), Expect = 1e-18, Method: Compositional matrix adjust.
Identities = 76/299 (25%), Positives = 135/299 (45%), Gaps = 23/299 (7%)
Query: 66 IAENFSKLACNAHTIC------------NSELRPLGTGLYPVISIINHSCLPNAVLVFEG 113
+ + +L CNA I + ++R L T ++P +++NHSC PN + F+G
Sbjct: 606 LCHHMQQLRCNAQAITTLQEQDSVSLLEDKQVR-LATAVFPTEALLNHSCRPNVFVSFQG 664
Query: 114 RLAVVRAVQHVPKGAEVLISYIETAGSTM--TRQKALKEQYLFTCTCPRCIKLGQFDDIQ 171
+ +VRAV H+ G E+L Y AG + RQ ALKEQY F+C+C C + Q +
Sbjct: 665 KTLIVRAVSHIKPGEELLHCYGPHAGRMVYGERQAALKEQYFFSCSCDACQE--QVGNPN 722
Query: 172 ESAILEGYRCKDDGCSGFLLRDSDDKGFTCQ--QCGLVRSKEEIKKIASEVNILSKKTLA 229
+ Y+C + L DK C C ++IK + ++ L ++ A
Sbjct: 723 TVDMFSAYKCPVCNNAAKL----QDKKLVCTSPNCDAQGDTDDIKTTSKKIQDLFVQSSA 778
Query: 230 LTSCGNHQEVVSTYKMIEKLQKKLYHPFSVNLMQTREKLIKILMELEDWKEALAYCQLTI 289
G QE VS K LQ K+ HP + ++ +T + L + D K+++ + + ++
Sbjct: 779 FLEEGQIQEAVSGLKQCLVLQWKILHPSNKDIARTHDALARCYATSGDCKKSVEHLKKSL 838
Query: 290 PVYQRVYPQFHPLLGLQYYTCGKLEWFLGDTENAIKSMTEAVEILRITHGTNSPFMKEL 348
+ + L + + +L++ + ++ A+ I +G + P KEL
Sbjct: 839 STVELQFGACSVELAHELHKLAQLQFNGQQVAECLDTIERALTIFGCHYGDHHPKSKEL 897
>gi|443728057|gb|ELU14532.1| hypothetical protein CAPTEDRAFT_199484 [Capitella teleta]
Length = 374
Score = 100 bits (249), Expect = 1e-18, Method: Compositional matrix adjust.
Identities = 71/254 (27%), Positives = 122/254 (48%), Gaps = 18/254 (7%)
Query: 66 IAENFSKLACNAHTICNSELRPLGTGLYPVISIINHSCLPNAVLVFEGRLAVVRAVQHVP 125
+A S+++ N+ TI + + LG GLY S ++HSC PNA FEG VR ++ +
Sbjct: 95 MAFYLSQISVNSITINHPQAVGLGVGLYLRFSAVDHSCRPNAFANFEGTKVCVRCIEPIE 154
Query: 126 KGAEVLISYIETAGSTMTRQKALKEQYLFTCTCPRCIKLGQFDDIQESAILEGYRCKDDG 185
++ ISY+ T TR+K L ++YLF CTC C+ DI + + ++C G
Sbjct: 155 DEKDLRISYVSPLDDTATRRKNLLQKYLFECTCEACL------DIDQDQMKFSFKCVRGG 208
Query: 186 CSGFLLRDSDDKGFTCQQCGLVR-SKEEIKKIASEVNI------LSKKTLALTSCGNHQE 238
C G + R D++ F C CG + +++EI+ + V +SKK L + G + E
Sbjct: 209 CKGHMTRMQDNR-FRCDYCGEKQVTEDEIQTLNVAVEKTRRMMDISKK---LKAQGRYAE 264
Query: 239 VVSTYKMIEKLQKKLYHPFSVNLMQTREKLIKILMELEDWKEALAYCQLT-IPVYQRVYP 297
++ K H ++ +++ R+ L+ ++W C + + Y+ Y
Sbjct: 265 IMVAGFACLKDAVDFLHSDNLLIVELRDCLLNACRMTQNWSVEAIQCGIANLEPYRSFYG 324
Query: 298 QFHPLLGLQYYTCG 311
+ HP L + Y G
Sbjct: 325 RHHPALAILLYDIG 338
>gi|410955280|ref|XP_003984284.1| PREDICTED: SET and MYND domain-containing protein 1 [Felis catus]
Length = 490
Score = 100 bits (249), Expect = 1e-18, Method: Compositional matrix adjust.
Identities = 87/341 (25%), Positives = 154/341 (45%), Gaps = 62/341 (18%)
Query: 31 VAHMSDIDEKQLLLYAQIANLVNLILQ-WP----EISINEIAENFSKLACNAHTICNSE- 84
V H + ++K+L + V+ LQ WP + S+ I+ F ++CN T+ +
Sbjct: 136 VEHFGEEEQKELRVD------VDTFLQYWPPQSQQFSMQYISHIFGVISCNGFTLSDQRG 189
Query: 85 LRPLGTGLYPVISIINHSCLPNAVLVFE-GRLAVVRAVQH------------VPKGAEVL 131
L+ +G G++P + ++NH C PN ++F G V+++ H + +G E+
Sbjct: 190 LQAVGVGIFPNLGLVNHDCWPNCTVIFNNGNHEAVKSMFHTQMRIELRALGKISEGEELT 249
Query: 132 ISYIETAGSTMTRQKALKEQYLFTCTCPRCIKLGQFDDIQESAILEGYRCKDDGCSGFLL 191
+SYI+ + R++ LK+QY F CTC C K + KDD FL
Sbjct: 250 VSYIDFLNVSEERKRQLKKQYYFDCTCEHCQK----------------KLKDD---LFLG 290
Query: 192 RDSDDKGFTCQQCGLVRSKEEIKKIASEVNILSKKTL----ALTSCGNHQEVVSTYKMIE 247
D K S+E +K++ LSK TL S G + EVV +
Sbjct: 291 VKDDPKP----------SQEAVKEMIQ----LSKDTLEKIDKARSEGLYHEVVKLCRECL 336
Query: 248 KLQKKLYHPFSVNLMQTREKLIKILMELEDWKEALAYCQLTIPVYQRVYPQFHPLLGLQY 307
+ Q+ ++ ++ ++ + ++L L+ ++EA Y + + Y ++Y + LG+
Sbjct: 337 QKQEPVFADTNLYTLRMLSTVSEVLSYLQAFEEASHYARRMVDGYMKLYHPNNAQLGMAV 396
Query: 308 YTCGKLEWFLGDTENAIKSMTEAVEILRITHGTNSPFMKEL 348
G W G+ E + +A IL +THG + P K+L
Sbjct: 397 MRAGLTNWHAGNIEVGHGMICKAYAILLVTHGPSHPITKDL 437
>gi|345328822|ref|XP_001510515.2| PREDICTED: SET and MYND domain-containing protein 1 isoform 2
[Ornithorhynchus anatinus]
Length = 477
Score = 100 bits (249), Expect = 1e-18, Method: Compositional matrix adjust.
Identities = 77/300 (25%), Positives = 131/300 (43%), Gaps = 42/300 (14%)
Query: 58 WP----EISINEIAENFSKLACNAHTICNSE-LRPLGTGLYPVISIINHSCLPNAVLVFE 112
WP + S+ I+ F + CN T+ + L+ +G G++P + ++NH C PN ++F
Sbjct: 158 WPPQSQQFSMQYISHIFGVINCNGFTLSDQRGLQAVGVGVFPNLCLVNHDCWPNCTVIFN 217
Query: 113 GRLAVVRAVQHVPKGAEVLISYIETAGSTMTRQKALKEQYLFTCTCPRCIKLGQFDDIQE 172
+RA+ + +G E+ +SYI+ + R+K LK+QY F CTC C K G DD+
Sbjct: 218 NGKIELRALGKISEGEELTVSYIDFLNVSADRKKLLKKQYYFDCTCEHCEK-GIKDDL-- 274
Query: 173 SAILEGYRCKDDGCSGFLLRDSDDKGFTCQQCGLVRSKEEIKKIASEVNILSKKTL---- 228
FL G+ + + + EV SK L
Sbjct: 275 ----------------FL--------------GVKDEPKPSQDVVKEVIQFSKDALEKID 304
Query: 229 ALTSCGNHQEVVSTYKMIEKLQKKLYHPFSVNLMQTREKLIKILMELEDWKEALAYCQLT 288
S G + EVV + + Q+ + ++ L++ ++L L+ ++EA Y +
Sbjct: 305 KARSEGVYHEVVKLCRECLQKQEPVLADTNIYLLRILSVASEVLSYLQSFEEASDYAKKM 364
Query: 289 IPVYQRVYPQFHPLLGLQYYTCGKLEWFLGDTENAIKSMTEAVEILRITHGTNSPFMKEL 348
+ Y ++Y + LG+ G W G E + +A IL +THG P K+L
Sbjct: 365 VDGYMKLYHPNNAQLGMAVMRAGVTHWHAGLIEVGHGMICKAYAILLVTHGPTHPITKDL 424
>gi|73980306|ref|XP_852104.1| PREDICTED: SET and MYND domain-containing protein 1 isoform 2
[Canis lupus familiaris]
Length = 490
Score = 100 bits (248), Expect = 1e-18, Method: Compositional matrix adjust.
Identities = 85/337 (25%), Positives = 151/337 (44%), Gaps = 54/337 (16%)
Query: 31 VAHMSDIDEKQLLLYAQIANLVNLILQ-WP----EISINEIAENFSKLACNAHTICNSE- 84
V H + ++K+L L V+ LQ WP + S+ I+ F ++CN T+ +
Sbjct: 136 VEHFGEEEQKELRLD------VDTFLQYWPPQGQQFSMQYISHIFGVISCNGFTLSDQRG 189
Query: 85 LRPLGTGLYPVISIINHSCLPNAVLVFE-GRLAVVRAVQH------------VPKGAEVL 131
L+ +G G++P + ++NH C PN ++F G V+++ H + +G E+
Sbjct: 190 LQAVGVGIFPNLGLVNHDCWPNCTVIFNNGNHEAVKSMFHTQMRIELRALGKISEGEELT 249
Query: 132 ISYIETAGSTMTRQKALKEQYLFTCTCPRCIKLGQFDDIQESAILEGYRCKDDGCSGFLL 191
+SYI+ + R+K LK+QY F CTC C K + KDD FL
Sbjct: 250 VSYIDFLNLSEDRKKQLKKQYYFDCTCEHCQK----------------KLKDD---LFLG 290
Query: 192 RDSDDKGFTCQQCGLVRSKEEIKKIASEVNILSKKTLALTSCGNHQEVVSTYKMIEKLQK 251
D K S+E +K++ +K S G + EVV + + Q+
Sbjct: 291 VKDDPKP----------SQEVVKEMIQFSKDTLEKIDKARSEGLYHEVVKLCRECLQKQE 340
Query: 252 KLYHPFSVNLMQTREKLIKILMELEDWKEALAYCQLTIPVYQRVYPQFHPLLGLQYYTCG 311
++ ++ ++ + ++L L+ ++EA Y + + Y ++Y + LG+ G
Sbjct: 341 PVFADTNLYTLRMLSTVSEVLSYLQAFEEASYYARRMVDGYMKLYHPNNAQLGMAVMRAG 400
Query: 312 KLEWFLGDTENAIKSMTEAVEILRITHGTNSPFMKEL 348
W G+ E + +A IL +THG + P K+L
Sbjct: 401 LTNWHAGNIEVGHGMICKAYAILLVTHGPSHPITKDL 437
>gi|134085965|ref|NP_001076953.1| SET and MYND domain-containing protein 1 [Bos taurus]
gi|126717435|gb|AAI33279.1| SMYD1 protein [Bos taurus]
Length = 490
Score = 100 bits (248), Expect = 1e-18, Method: Compositional matrix adjust.
Identities = 85/341 (24%), Positives = 152/341 (44%), Gaps = 62/341 (18%)
Query: 31 VAHMSDIDEKQLLLYAQIANLVNLILQ-WP----EISINEIAENFSKLACNAHTICNSE- 84
V H + ++K+L L V+ LQ WP + S+ I+ F + CN T+ +
Sbjct: 136 VEHFGEEEQKELRLD------VDTFLQYWPPQSQQFSMQYISHIFGVINCNGFTLSDQRG 189
Query: 85 LRPLGTGLYPVISIINHSCLPNAVLVFE-GRLAVVRAVQH------------VPKGAEVL 131
L+ +G G++P ++++NH C PN ++F G V+++ H + +G E+
Sbjct: 190 LQAVGVGVFPNLALVNHDCWPNCTVIFNNGNHEAVKSMFHTQMRIELRALGKISEGEELT 249
Query: 132 ISYIETAGSTMTRQKALKEQYLFTCTCPRCIKLGQFDDIQESAILEGYRCKDDGCSGFLL 191
+SYI+ + R+K LK QY F CTC C K G DD+ FL
Sbjct: 250 VSYIDFLNVSEERKKQLKRQYYFDCTCEHCQK-GLKDDL------------------FL- 289
Query: 192 RDSDDKGFTCQQCGLVRSKEEIKKIASEVNILSKKTL----ALTSCGNHQEVVSTYKMIE 247
G+ + + +++ E+ SK TL S G + EVV +
Sbjct: 290 -------------GVKDNPKPSQEVVKEMIQFSKDTLEKIDKARSEGLYHEVVKLCRECL 336
Query: 248 KLQKKLYHPFSVNLMQTREKLIKILMELEDWKEALAYCQLTIPVYQRVYPQFHPLLGLQY 307
+ Q+ ++ ++ ++ + ++L L+ ++EA Y + + Y ++Y + LG+
Sbjct: 337 QKQEPVFADTNIYTLRLLSIVSEVLSYLQAFEEASHYARRMVDGYMKLYHHNNAQLGMAI 396
Query: 308 YTCGKLEWFLGDTENAIKSMTEAVEILRITHGTNSPFMKEL 348
G W G+ E + +A IL +THG + P K+L
Sbjct: 397 MRAGLTNWHAGNIEVGHGMICKAYAILLVTHGPSHPITKDL 437
>gi|241712248|ref|XP_002413449.1| SET and MYND domain-containing protein, putative [Ixodes
scapularis]
gi|215507263|gb|EEC16757.1| SET and MYND domain-containing protein, putative [Ixodes
scapularis]
Length = 374
Score = 100 bits (248), Expect = 1e-18, Method: Compositional matrix adjust.
Identities = 84/324 (25%), Positives = 144/324 (44%), Gaps = 44/324 (13%)
Query: 30 LVAHMSDI--DEKQLLLYAQIANLVNLILQWPEISINEIA-ENFSKLACNAHTICNSELR 86
LVAH+ +I D K++ + ++ L + E E F K+ N +TI N E
Sbjct: 71 LVAHVEEIKRDAKRVAQFQRLWMTAGAFLGDKYLPPAETGLEIFGKMVVNTYTITNYEEC 130
Query: 87 PLGTGLYPVISIINHSCLPNAVLVFEGRLAVVRAVQHVPKGA--EVLISYIETAGSTMTR 144
P+GTGLY SI++HSC PNA VF+GR +RA + + + +SYI+ TR
Sbjct: 131 PIGTGLYIGPSILDHSCTPNAQAVFDGRTLRLRAAADIECDSVDGIRVSYIDLKELRRTR 190
Query: 145 QKALKEQYLFTCTCPRCIKLGQFDDIQESAILEGYRCKDDGCSGFLLRDSDDKGFTCQQC 204
+ L+E+Y F+C CPRC D+ QE T +
Sbjct: 191 IEELRERYYFSCNCPRC----SADEAQEY-------------------------LTEKSP 221
Query: 205 GLVRSKEEIKKIASEVNILSKKTLALTSCGNHQEVVSTYKMIEKLQKKLYHPFSVNLMQT 264
L +E+ + + + +++ L G + ++S++K+ + +
Sbjct: 222 SLTAKAKELLREFMQAEVHAREDLVRLR-GQAEALLSSHKLPATDAAR---------SEA 271
Query: 265 REKLIKILMELEDWKEALAYCQLTIPVYQRVYPQFHPLLGLQYYTCGKLEWFLGDTENAI 324
E L K +E+ D+ AL Y Y+R Y + P+ G+ +T KL + A+
Sbjct: 272 LELLSKSCLEMGDFDAALPYYLAKEETYRRCYGPYSPVYGVLLFTIAKLYHHKVLLQEAL 331
Query: 325 KSMTEAVEILRITHGTNSPFMKEL 348
+ +A E++ ++HG N ++L
Sbjct: 332 QYFEKASEVVTVSHGRNHRLYRDL 355
>gi|148225404|ref|NP_001085463.1| SET and MYND domain containing 1 [Xenopus laevis]
gi|49118725|gb|AAH72803.1| MGC80131 protein [Xenopus laevis]
Length = 478
Score = 100 bits (248), Expect = 1e-18, Method: Compositional matrix adjust.
Identities = 76/316 (24%), Positives = 145/316 (45%), Gaps = 35/316 (11%)
Query: 58 WP----EISINEIAENFSKLACNAHTICNSE-LRPLGTGLYPVISIINHSCLPNAVLVFE 112
WP + + I+ FS ++CN T+ + L+ +G ++P + + NH C PN ++F
Sbjct: 153 WPSQSQQFGMQYISHIFSVISCNGFTLSDQRGLQAVGVAIFPNLCLTNHDCWPNCTVIFN 212
Query: 113 GRLAVVRAVQHVPKGAEVLISYIETAGSTMTRQKALKEQYLFTCTCPRCIKLGQFDDIQE 172
+RA+ + KG E+ +SY++ T R + LK+QY F CTC C + D
Sbjct: 213 NGKIELRALGKINKGDELTVSYVDFLNLTEDRMEQLKKQYYFDCTCEHCT--NKTKDALL 270
Query: 173 SAILEGYRCKDDGCSGFLLRDSDDKGFTCQQCGLVRSKEEIKKIASEVNILSKKTLALTS 232
A+ +G +D +++ S D + E+I+K SE
Sbjct: 271 LAVNDGESKPEDRVVKEVIQYSKD------------TMEKIEKARSE------------- 305
Query: 233 CGNHQEVVSTYKMIEKLQKKLYHPFSVNLMQTREKLIKILMELEDWKEALAYCQLTIPVY 292
G++ +VV + K Q+ ++ ++ +++ ++L L+ +++A + I Y
Sbjct: 306 -GSYHQVVKLCRECLKRQEPIFADTNIYMLRILSIYSEVLSYLQMFQDAAENAKKMIDGY 364
Query: 293 QRVYPQFHPLLGLQYYTCGKLEWFLGDTENAIKSMTEAVEILRITHGTNSPFMKELILKL 352
++Y + LG+ G +W G E + +A IL ITHG + P K+ L++
Sbjct: 365 MKIYHPNNAQLGMAIMRAGVTQWHAGMIEVGHGMICKAFAILLITHGPSHPITKD--LEM 422
Query: 353 EEAQAEASYKLSSKDE 368
+Q E ++ ++E
Sbjct: 423 MRSQTEMELRMFKENE 438
>gi|197102816|ref|NP_001125377.1| SET and MYND domain-containing protein 1 [Pongo abelii]
gi|55727869|emb|CAH90687.1| hypothetical protein [Pongo abelii]
Length = 477
Score = 100 bits (248), Expect = 2e-18, Method: Compositional matrix adjust.
Identities = 78/323 (24%), Positives = 146/323 (45%), Gaps = 39/323 (12%)
Query: 31 VAHMSDIDEKQLLLYAQIANLVNLILQWPEISINEIAENFSKLACNAHTICNSE-LRPLG 89
V H + ++K+L + L + Q + S+ I+ F + CN T+ + L+ +G
Sbjct: 136 VEHFGEEEQKELRVDVDTF-LQYWLPQSQQFSMQYISHIFGVINCNGFTLSDQRGLQAVG 194
Query: 90 TGLYPVISIINHSCLPNAVLVFEGRLAVVRAVQHVPKGAEVLISYIETAGSTMTRQKALK 149
G++P + ++NH C PN ++F +RA+ + +G E+ +SYI+ + R++ LK
Sbjct: 195 VGIFPNLGLVNHDCWPNCTVIFNNGKIELRALGKISEGEELTVSYIDFLNVSEERKRQLK 254
Query: 150 EQYLFTCTCPRCIKLGQFDDIQESAILEGYRCKDDGCSGFLLRDSDDKGFTCQQCGLVRS 209
+QY F C C C K + KDD FL G+ +
Sbjct: 255 KQYYFDCACEHCQK----------------KLKDD---LFL--------------GVKDN 281
Query: 210 KEEIKKIASEVNILSKKTL----ALTSCGNHQEVVSTYKMIEKLQKKLYHPFSVNLMQTR 265
+ +++ E+ SK TL S G + EVV + + Q+ ++ ++ ++
Sbjct: 282 PKPSQEVVKEMIRFSKDTLEKIDKARSEGLYHEVVKLCRECLQKQEPVFADTNIYTLRML 341
Query: 266 EKLIKILMELEDWKEALAYCQLTIPVYQRVYPQFHPLLGLQYYTCGKLEWFLGDTENAIK 325
+ ++L L+ ++EA Y + + Y ++Y + LG+ G W G+ E
Sbjct: 342 SIVSEVLSYLQAFEEASFYARRMVDGYMKLYHPNNAQLGMAVMRAGLTNWHAGNIEVGHG 401
Query: 326 SMTEAVEILRITHGTNSPFMKEL 348
+ +A IL +THG + P K+L
Sbjct: 402 MICKAYAILLVTHGPSHPITKDL 424
>gi|330917311|ref|XP_003297760.1| hypothetical protein PTT_08278 [Pyrenophora teres f. teres 0-1]
gi|311329388|gb|EFQ94158.1| hypothetical protein PTT_08278 [Pyrenophora teres f. teres 0-1]
Length = 532
Score = 100 bits (248), Expect = 2e-18, Method: Compositional matrix adjust.
Identities = 85/366 (23%), Positives = 157/366 (42%), Gaps = 51/366 (13%)
Query: 23 NYSLVEALVAHMSDIDEKQLLLYAQIANLVNLILQWPEISI-------NEIAENFSKLAC 75
++ +V L +H+ D N+ + + P+ ++ + IA ++++
Sbjct: 154 DWEMVCRLPSHVDDFKRN-----GTYGNIEMMAMGAPQFALPPNMFDRDFIAAMYARVMS 208
Query: 76 NAHTICNSELRPLGTGLYPVISIINHSCLPNAVLVFEGRLAVVRAVQHVPKGAEVLISYI 135
NA TI L PLG L P + +NHSC PNA ++ +G +R ++ + K E+ ISYI
Sbjct: 209 NALTIITPTLDPLGIILDPTLCSLNHSCDPNAFIMMDGPSVSIRTLRPIRKDKEIFISYI 268
Query: 136 ETAGSTMTRQKALKEQYLFTCTCPRCIKLGQFDDIQESAILEGYRCKDDGCSGFLLRDSD 195
+T RQ+ L+ ++ FTC C +C QE A L+ S F+L
Sbjct: 269 DTTYPYHKRQEELQTRWFFTCRCAKC---------QEKATLQEDNWLVPADSKFVLDPEA 319
Query: 196 DKGFTCQQCGLVRSKEEIKKIASEVNILSKKTLALTSCGNHQEVVSTYKMIEKLQKKLYH 255
+ Q E++K+++E I K + L SC + + Y
Sbjct: 320 KQAMAQTQEQTFALYGELQKLSTEHVIHGLKQI-LASC---------------YESRFYP 363
Query: 256 PFSVNLMQTREKLIKILMELEDWKEALAYCQLTIP-VYQRVYP-QFHPLLGLQYYTCGKL 313
+ + R+ L+ L+ + +++A A C + ++YP FHP+ +Q + L
Sbjct: 364 MYRQPYAEARDVLVVNLLSVGKFQDAWAQCAKRYKYILPKLYPVPFHPVRVVQTWQMAML 423
Query: 314 EWFLGDTENAIKS------------MTEAVEILRITHGTNSPFMKELILKLEEAQAEASY 361
+L TE + + + + +++ ++HG N+ F K + K EE E
Sbjct: 424 AAYLASTEEGVGAPGVNMGLIAMMLVKQVLDVAHLSHGPNNAFTKSVKEKAEEMIEELKR 483
Query: 362 KLSSKD 367
+ + D
Sbjct: 484 SVGNPD 489
>gi|426223567|ref|XP_004005946.1| PREDICTED: SET and MYND domain-containing protein 1 [Ovis aries]
Length = 490
Score = 99.8 bits (247), Expect = 2e-18, Method: Compositional matrix adjust.
Identities = 85/337 (25%), Positives = 150/337 (44%), Gaps = 54/337 (16%)
Query: 31 VAHMSDIDEKQLLLYAQIANLVNLILQ-WP----EISINEIAENFSKLACNAHTICNSE- 84
V H + ++K+L L V+ LQ WP + S+ I+ F + CN T+ +
Sbjct: 136 VEHFGEEEQKELRLD------VDTFLQYWPPQSQQFSMQYISHIFGVINCNGFTLSDQRG 189
Query: 85 LRPLGTGLYPVISIINHSCLPNAVLVFE-GRLAVVRAVQH------------VPKGAEVL 131
L+ +G G++P ++++NH C PN ++F G V+++ H + +G E+
Sbjct: 190 LQAVGVGVFPNLALVNHDCWPNCTVIFNNGNHEAVKSMFHTQMRIELRALGKISEGEELT 249
Query: 132 ISYIETAGSTMTRQKALKEQYLFTCTCPRCIKLGQFDDIQESAILEGYRCKDDGCSGFLL 191
+SYI+ + R+K LK QY F CTC C K G DD+ FL
Sbjct: 250 VSYIDFLNISEERKKQLKRQYYFDCTCEHCQK-GLKDDL------------------FLG 290
Query: 192 RDSDDKGFTCQQCGLVRSKEEIKKIASEVNILSKKTLALTSCGNHQEVVSTYKMIEKLQK 251
D K S+E +K++ +K S G + EVV + + Q+
Sbjct: 291 VKDDPKP----------SQEVVKEMIQFSKDTLEKIDKARSEGLYHEVVKLCRECLQKQE 340
Query: 252 KLYHPFSVNLMQTREKLIKILMELEDWKEALAYCQLTIPVYQRVYPQFHPLLGLQYYTCG 311
++ ++ ++ + ++L L+ ++EA Y + + Y ++Y + LG+ G
Sbjct: 341 PVFADTNIYTLRLLSIVSEVLSYLQAFEEASHYARRMVDGYMKLYHPNNAQLGMAIMRAG 400
Query: 312 KLEWFLGDTENAIKSMTEAVEILRITHGTNSPFMKEL 348
W G+ E + +A IL +THG + P K+L
Sbjct: 401 LTNWHAGNIEVGHGMICKAYAILLVTHGPSHPITKDL 437
>gi|74178258|dbj|BAE29912.1| unnamed protein product [Mus musculus]
Length = 420
Score = 99.8 bits (247), Expect = 2e-18, Method: Compositional matrix adjust.
Identities = 72/297 (24%), Positives = 131/297 (44%), Gaps = 35/297 (11%)
Query: 8 RKLQNDNVIPSTTTDNYSL-VEALVAHMSDID-EKQLLLYAQIANLVNLI---LQWPEIS 62
R L + P T L V +H+ +D EK+ L+ + IA L L++P+ S
Sbjct: 111 RILAKQKIHPERTPSEKLLAVREFESHLDKLDNEKKDLIQSDIAALHQFYSKYLEFPDHS 170
Query: 63 INEIAENFSKLACNAHTICNSELRPLGTGLYPVISIINHSCLPNAVLVFEGRLAVVRAVQ 122
+ F+++ CN TI + EL LG+ ++P ++++NHSC PN ++ ++G LA VRAVQ
Sbjct: 171 --SLVVLFAQVNCNGFTIEDEELSHLGSAIFPDVALMNHSCCPNVIVTYKGTLAEVRAVQ 228
Query: 123 HVPKGAEVLISYIETAGSTMTRQKALKEQYLFTCTCPRCIKLGQFDDIQESAILEGYRCK 182
+ G EV SYI+ T R L++ Y FTC C C
Sbjct: 229 EIHPGDEVFTSYIDLLYPTEDRNDRLRDSYFFTCECRECT-------------------- 268
Query: 183 DDGCSGFLLRDSDDKGFTCQQCGLVRSKEEIKKIASEVNILSKKTLALTSCGNHQEVVST 242
+D D ++ E I+ + + ++ + E++
Sbjct: 269 --------TKDKDKAKVEVRKLSSPPQAEAIRDMVRYARNVIEEFRRAKHYKSPSELLEI 320
Query: 243 YKMIEKLQKKLYHPFSVNLMQTREKLIKILMELEDWKEALAYCQLTIPVYQRVYPQF 299
++ ++ ++ +V ++ + + + + ++DW+ AL Y Q I Y + YP +
Sbjct: 321 CELSQEKMSSVFEDSNVYMLHMMYQAMGVCLYMQDWEGALKYGQKIIKPYSKHYPVY 377
>gi|325188357|emb|CCA22894.1| conserved hypothetical protein [Albugo laibachii Nc14]
Length = 528
Score = 99.8 bits (247), Expect = 2e-18, Method: Compositional matrix adjust.
Identities = 53/146 (36%), Positives = 79/146 (54%), Gaps = 12/146 (8%)
Query: 65 EIAENFSKLACNAHTICNSELRPLGTGLYPVISIINHSCLPNAVLVFE--GRLAVVRAVQ 122
+I + F + CN TICN+ PLG GLYP +++NHSC PN ++ F+ + AV+ ++
Sbjct: 202 QITKLFCMVQCNVFTICNTLNEPLGIGLYPKAAMLNHSCTPNCIVRFDLTRKQAVIHTIE 261
Query: 123 HVPKGAEVLISYIETAGSTMTRQKALKEQYLFTCTCPRCIKLGQFDDIQESAILEGY--- 179
+ G E+ ISY++ T R+K L+ Y F CTC RC QE A LEG+
Sbjct: 262 TIHSGQELTISYVDIFDPTTIRKKRLRSSYFFDCTCQRCT------SSQEDAFLEGFLCT 315
Query: 180 RCKDDGCSGFLLRDSDDKGFTCQQCG 205
+C++ S +L D + F C C
Sbjct: 316 QCRNTSESARIL-DEKKRIFRCCVCA 340
>gi|321462734|gb|EFX73755.1| SET and MYND domain-containing protein 1 [Daphnia pulex]
Length = 452
Score = 99.0 bits (245), Expect = 3e-18, Method: Compositional matrix adjust.
Identities = 84/321 (26%), Positives = 138/321 (42%), Gaps = 69/321 (21%)
Query: 38 DEKQLLLYAQIANLVNLILQWPEISINEIAEN-------FSKLACNAHTICNSELRPLGT 90
D K++ + I ++N L NEI N + ++ N+ I N E++ +GT
Sbjct: 180 DTKRMEYFMTICGVLNEYLS------NEILPNSVELLGIYGRMCINSFNILNGEMQAIGT 233
Query: 91 GLYPVISIINHSCLPNAVLVFEGRLAVVRAVQHVPK--GAEVLISYIETAGSTMTRQKAL 148
G+Y SI++HSC PNAV F+G ++ Q +PK + ISYI+ S R+K L
Sbjct: 234 GIYLAPSILDHSCSPNAVATFDGFKLRIQLTQELPKLEWDSIRISYIDLMNSKSHRKKEL 293
Query: 149 KEQYLFTCTCPRCIKLGQFDDIQESAILEGYRCKDDGCSGFLLRDSDDKGFTCQQCGLVR 208
K++Y F C CPRC + DDI C + K TCQ
Sbjct: 294 KDRYYFDCDCPRC----KNDDID--------------CYHYAA-----KCPTCQ------ 324
Query: 209 SKEEIKKIASEVNILSKKTLALTSCGNHQEVVSTYKMIEKLQKKLYHPFSVNLMQTREKL 268
K + ++V++ K+ Q+ L+H + + +
Sbjct: 325 -----KPVIAKVDL------------------DVCKLCIAKQENLFHDLCLVRSKIMDAA 361
Query: 269 IKILMELEDWKEALAYCQLTIPVYQRVYPQFHPLLGLQYYTCGKLEWFLGDTENAI--KS 326
++L+ W+ A+ Y I Y+ Y + HPL + K+ + T +++ K
Sbjct: 362 FDSAIQLQLWEAAIQYGIPLIQAYKLWYGEEHPLTAILLLKLFKITLLISTTNDSVAQKY 421
Query: 327 MTEAVEILRITHGTNSPFMKE 347
EA +IL ITHG +S F ++
Sbjct: 422 YEEAAKILEITHGRDSSFYRQ 442
>gi|395853479|ref|XP_003799234.1| PREDICTED: SET and MYND domain-containing protein 1 [Otolemur
garnettii]
Length = 490
Score = 99.0 bits (245), Expect = 3e-18, Method: Compositional matrix adjust.
Identities = 87/338 (25%), Positives = 151/338 (44%), Gaps = 56/338 (16%)
Query: 31 VAHMSDIDEKQLLLYAQIANLVNLILQ-WPE----ISINEIAENFSKLACNAHTICNSE- 84
V H + ++K+L + V+ LQ WP SI I+ F + CN T+ +
Sbjct: 136 VEHFGEEEQKELRVD------VDTFLQYWPPQSQPFSIQYISHIFGVINCNGFTLSDQRG 189
Query: 85 LRPLGTGLYPVISIINHSCLPNAVLVFE-GRLAVVRAVQH------------VPKGAEVL 131
L+ +G G++P + ++NH C PN ++F G V+A+ H + +G E+
Sbjct: 190 LQAVGVGIFPNLGLVNHDCWPNCTVIFNNGNHEAVKAMFHTQMRIELRALGKISEGEELT 249
Query: 132 ISYIETAGSTMTRQKALKEQYLFTCTCPRCIKLGQFDDIQESAILEGYRCKDDGCSGFL- 190
+SYI+ T R+K LK+QY F C C C K + KDD FL
Sbjct: 250 VSYIDFLNVTEERRKQLKKQYYFDCACEHCQK----------------KLKDD---LFLA 290
Query: 191 LRDSDDKGFTCQQCGLVRSKEEIKKIASEVNILSKKTLALTSCGNHQEVVSTYKMIEKLQ 250
++D+ + + SKE + KIA S G + EVV + + Q
Sbjct: 291 VKDNPKPSQEVVKEMIQFSKETLDKIAK-----------ARSEGLYHEVVKLCRECLQKQ 339
Query: 251 KKLYHPFSVNLMQTREKLIKILMELEDWKEALAYCQLTIPVYQRVYPQFHPLLGLQYYTC 310
+ ++ ++ +++ + ++L L+ ++EA Y + + Y ++Y + LG+
Sbjct: 340 EPVFADTNLYMLRILSIVSEVLSYLQAFEEASDYARRMVDGYMKLYHPNNAQLGMAVMRA 399
Query: 311 GKLEWFLGDTENAIKSMTEAVEILRITHGTNSPFMKEL 348
G W G+ E + +A IL +THG + P K+L
Sbjct: 400 GLTNWHAGNIEVGHGMICKAYAILLVTHGPSHPITKDL 437
>gi|45387527|ref|NP_991103.1| SET and MYND domain containing 1 [Danio rerio]
gi|41223368|gb|AAH65475.1| SET and MYND domain containing 1a [Danio rerio]
Length = 485
Score = 98.6 bits (244), Expect = 4e-18, Method: Compositional matrix adjust.
Identities = 78/365 (21%), Positives = 156/365 (42%), Gaps = 57/365 (15%)
Query: 6 LRRKLQNDNVIPSTTTDNYSLVEALVAHMSDIDEKQLLLYAQIANLVNLILQWPE----I 61
L R ++ ++ + ++E ++ M+ D K+L A++ WP+ +
Sbjct: 111 LWRIQKHTGLVSDSQLTTLDMLEDHLSRMTPEDLKELK-----ADVKTFYTYWPKKSKAV 165
Query: 62 SINEIAENFSKLACNAHTICNSE-LRPLGTGLYPVISIINHSCLPNAVLVF--------- 111
+ ++ F ++CN T+ + L+ +G GL+P + ++NH C PN ++
Sbjct: 166 GEDYVSHLFGVISCNGFTLSDQRGLQSVGIGLFPNLCLVNHDCWPNCTVILNHGDQSALD 225
Query: 112 ----EGRLAVVRAVQHVPKGAEVLISYIETAGSTMTRQKALKEQYLFTCTCPRCIKLGQF 167
R +RA++ + G E+ +SY++ + RQ+ L++QY F C C C+ G
Sbjct: 226 ASFHSSRRIELRALEPISAGQELTVSYVDFLSVSTDRQRLLQQQYYFDCKCEHCVN-GTK 284
Query: 168 DDIQESAILEGYRCKDDGCSGFLLRDSDDKGFTCQQCGLVRSKEEIKKIASEVNILSKKT 227
D++ + + +DG K+ + ++ S +
Sbjct: 285 DEL-----MTAVKPTEDG------------------------KQPSADVVKQLTDFSLQA 315
Query: 228 L----ALTSCGNHQEVVSTYKMIEKLQKKLYHPFSVNLMQTREKLIKILMELEDWKEALA 283
L A + GN EV+ + + Q ++ +V++++ ++L L+ ++EA
Sbjct: 316 LVKIEAARAQGNFHEVIRICRECLEKQDPVFADTNVHVLRVLSTASEVLSFLQQFQEAAG 375
Query: 284 YCQLTIPVYQRVYPQFHPLLGLQYYTCGKLEWFLGDTENAIKSMTEAVEILRITHGTNSP 343
Y Q + Y ++Y + LG+ G W G E A + A IL ITHG + P
Sbjct: 376 YAQRMVDGYMKLYHPNNAQLGMAIMRAGVTHWHAGLIEAAHGLICRAYGILMITHGPHHP 435
Query: 344 FMKEL 348
++L
Sbjct: 436 ISRDL 440
>gi|410903311|ref|XP_003965137.1| PREDICTED: SET and MYND domain-containing protein 1-like [Takifugu
rubripes]
Length = 471
Score = 98.6 bits (244), Expect = 4e-18, Method: Compositional matrix adjust.
Identities = 99/362 (27%), Positives = 168/362 (46%), Gaps = 52/362 (14%)
Query: 27 VEALVAHMSDIDEKQLLLYAQIANLVNLILQW-----PEISINEIAENFSKLACNAHTIC 81
VE L H++D+ E L +I V++ LQ+ + S++EIA F + CN T+
Sbjct: 129 VEELEDHVADLPEDHL---KRIDTDVHVFLQYWSCGRTKHSLDEIAHIFGIIKCNGFTLS 185
Query: 82 NSE-LRPLGTGLYPVISIINHSCLPNAVLVF-------------EGRLAVVRAVQHVPKG 127
+ L+ +G GL+P + ++NH C PN +V R +RA++ + +G
Sbjct: 186 DQRGLKAVGVGLFPNLCLVNHDCWPNCSVVLNHGNHSATNSALHSKRRIELRALRKICEG 245
Query: 128 AEVLISYIETAGSTMTRQKALKEQYLFTCTCPRCIKLGQFDDIQESAILEGYRCKDDGCS 187
E+ +SY++ ++ RQ+ LKE + F CTC C + + DD+ +A DG
Sbjct: 246 EELTVSYVDFLDTSAERQRKLKEHFYFECTCEHCRQHIK-DDLMTAAAA-------DGPG 297
Query: 188 GFLLRDSDDKGFTCQQCGLVRSKEEIKKIASEVNILSKKTLALTSCGNHQEV-VSTYKMI 246
G + S D+ Q S+E ++KI E +++ K + QEV + K +
Sbjct: 298 G---KPSADQ----VQEVTAFSQECLEKI--ERSLMDK---------DFQEVRMLCSKCL 339
Query: 247 EKLQKKLYHPFSVNLMQTREKLIKILMELEDWKEALAYCQLTIPVYQRVYPQFHPLLGLQ 306
EK QK L ++ ++ +++L L + EA AY + + Y ++Y LG+
Sbjct: 340 EK-QKNLLADTHLHHLRVLSAAVEVLSYLRCFSEAAAYARRMVQGYTKLYHPNSAQLGIA 398
Query: 307 YYTCGKLEWFLGDTENAIKSMTEAVEILRITHGTNSPFMKELILKLEEAQAEASYKLSSK 366
G G E A + + +A IL +THG N ++ L+ Q E KL +
Sbjct: 399 VMRAGVTHLQAGMIEMAHELICKAYRILLVTHGPNHSVTRD--LEAMRRQTEVELKLLKQ 456
Query: 367 DE 368
DE
Sbjct: 457 DE 458
>gi|402891521|ref|XP_003908994.1| PREDICTED: SET and MYND domain-containing protein 1 [Papio anubis]
gi|355565873|gb|EHH22302.1| hypothetical protein EGK_05542 [Macaca mulatta]
gi|355751476|gb|EHH55731.1| hypothetical protein EGM_04994 [Macaca fascicularis]
Length = 490
Score = 98.6 bits (244), Expect = 5e-18, Method: Compositional matrix adjust.
Identities = 83/341 (24%), Positives = 153/341 (44%), Gaps = 62/341 (18%)
Query: 31 VAHMSDIDEKQLLLYAQIANLVNLILQ-WP----EISINEIAENFSKLACNAHTICNSE- 84
V H + ++K+L + V+ LQ WP + S+ I+ F + CN T+ +
Sbjct: 136 VEHFGEEEQKELRVD------VDTFLQYWPPQSQQFSMQYISHIFGVINCNGFTLSDQRG 189
Query: 85 LRPLGTGLYPVISIINHSCLPNAVLVFE-GRLAVVRAVQH------------VPKGAEVL 131
L+ +G G++P + ++NH C PN ++F G V+++ H + +G E+
Sbjct: 190 LQAVGVGIFPNLGLVNHDCWPNCTVIFNNGNHEAVKSMFHTQMRIELRALGKISEGEELT 249
Query: 132 ISYIETAGSTMTRQKALKEQYLFTCTCPRCIKLGQFDDIQESAILEGYRCKDDGCSGFLL 191
+SYI+ + R++ LK+QY F CTC C K + KDD FL
Sbjct: 250 VSYIDFLNVSEERKRQLKKQYYFDCTCEHCQK----------------KLKDD---LFL- 289
Query: 192 RDSDDKGFTCQQCGLVRSKEEIKKIASEVNILSKKTL----ALTSCGNHQEVVSTYKMIE 247
G+ + + +++ E+ SK TL S G + EVV +
Sbjct: 290 -------------GVKDNPKPSQEVVKEMIQFSKDTLEKIDKARSEGLYHEVVKLCRECL 336
Query: 248 KLQKKLYHPFSVNLMQTREKLIKILMELEDWKEALAYCQLTIPVYQRVYPQFHPLLGLQY 307
+ Q+ ++ ++ +++ + ++L L+ ++EA Y + + Y ++Y + LG+
Sbjct: 337 QKQEPVFADTNIYMLRILSIVSEVLSYLQAFEEASDYARRMVDGYMKLYHPNNAQLGMAV 396
Query: 308 YTCGKLEWFLGDTENAIKSMTEAVEILRITHGTNSPFMKEL 348
G W G+ E + +A IL +THG + P K+L
Sbjct: 397 MRAGLTNWHAGNIEVGHGMICKAYAILLVTHGPSHPITKDL 437
>gi|119613764|gb|EAW93358.1| SET and MYND domain containing 2, isoform CRA_d [Homo sapiens]
Length = 230
Score = 98.2 bits (243), Expect = 5e-18, Method: Compositional matrix adjust.
Identities = 51/145 (35%), Positives = 83/145 (57%), Gaps = 2/145 (1%)
Query: 19 TTTDNYSLVEALVAHMSDID-EKQLLLYAQIANLVNLILQWPEISINE-IAENFSKLACN 76
T ++ V+ +H+ +D EK+ L+ + IA L + + E N+ + F+++ CN
Sbjct: 39 TPSEKLLAVKEFESHLDKLDNEKKDLIQSDIAALHHFYSKHLEFPDNDSLVVLFAQVNCN 98
Query: 77 AHTICNSELRPLGTGLYPVISIINHSCLPNAVLVFEGRLAVVRAVQHVPKGAEVLISYIE 136
TI + EL LG+ ++P ++++NHSC PN ++ ++G LA VRAVQ + G EV SYI+
Sbjct: 99 GFTIEDEELSHLGSAIFPDVALMNHSCCPNVIVTYKGTLAEVRAVQEIKPGEEVFTSYID 158
Query: 137 TAGSTMTRQKALKEQYLFTCTCPRC 161
T R L++ Y FTC C C
Sbjct: 159 LLYPTEDRNDRLRDSYFFTCECQEC 183
>gi|229892820|ref|NP_001153561.1| SET and MYND domain-containing protein 1 [Sus scrofa]
gi|197692962|gb|ACH71269.1| SET and MYND domain-containing 1 [Sus scrofa]
Length = 490
Score = 98.2 bits (243), Expect = 5e-18, Method: Compositional matrix adjust.
Identities = 84/341 (24%), Positives = 152/341 (44%), Gaps = 62/341 (18%)
Query: 31 VAHMSDIDEKQLLLYAQIANLVNLILQ-WP----EISINEIAENFSKLACNAHTICNSE- 84
V H + ++K+L + V+ LQ WP + S+ I+ F + CN T+
Sbjct: 136 VEHFGEEEQKELRMD------VDTFLQYWPPQSQQFSMQYISHIFGVINCNGFTLSEQRG 189
Query: 85 LRPLGTGLYPVISIINHSCLPNAVLVFE-GRLAVVRAVQH------------VPKGAEVL 131
L+ +G G++P + ++NH+C PN ++F G V+++ H + +G E+
Sbjct: 190 LQAVGVGIFPNLGLVNHNCWPNCTVIFNNGNHEAVKSMFHTQTRIELRALGKISEGEELT 249
Query: 132 ISYIETAGSTMTRQKALKEQYLFTCTCPRCIKLGQFDDIQESAILEGYRCKDDGCSGFLL 191
+SYI+ + R+K LK+QY F CTC C K G DD+ FL
Sbjct: 250 VSYIDFLNFSEERKKQLKKQYYFDCTCEHCQK-GLKDDL------------------FL- 289
Query: 192 RDSDDKGFTCQQCGLVRSKEEIKKIASEVNILSKKTL----ALTSCGNHQEVVSTYKMIE 247
G+ + + +++ E+ SK TL S G + EVV +
Sbjct: 290 -------------GVKDNPKPSQEVVKEMIQFSKDTLEKIDKARSKGLYHEVVKLCRECL 336
Query: 248 KLQKKLYHPFSVNLMQTREKLIKILMELEDWKEALAYCQLTIPVYQRVYPQFHPLLGLQY 307
+ Q+ ++ ++ ++ + ++L L+ ++EA Y + + Y ++Y + LG+
Sbjct: 337 EKQEPVFADTNIYTLRLLSIVSEVLSYLQAFEEASHYARRMVDGYMKLYHPNNAQLGMAI 396
Query: 308 YTCGKLEWFLGDTENAIKSMTEAVEILRITHGTNSPFMKEL 348
G W G+ E + +A IL +THG + P K+L
Sbjct: 397 MRAGLTNWHAGNIEVGHSMICKAYAILLVTHGPSHPITKDL 437
>gi|301610418|ref|XP_002934751.1| PREDICTED: LOW QUALITY PROTEIN: SET and MYND domain-containing
protein 2-like [Xenopus (Silurana) tropicalis]
Length = 430
Score = 98.2 bits (243), Expect = 6e-18, Method: Compositional matrix adjust.
Identities = 82/346 (23%), Positives = 158/346 (45%), Gaps = 48/346 (13%)
Query: 19 TTTDNYSLVEALVAHMSDIDEKQL-LLYAQIANLVNLILQWPEISINEIAEN-FSKLACN 76
T ++ + V+ +H+S +D ++L L+ IA L + S N F+++ CN
Sbjct: 121 TASERFLSVKDFESHLSKLDNEKLELIQNDIAALHRFYSKNLHYSDNAAQVFLFAQVNCN 180
Query: 77 AHTICNSELRPLGTGLYPVISIINHSCLPNAVLVFEGRLAVVRAVQHVPKGAEVLISYIE 136
TI + EL LG+ ++P ++++NHSC PN ++ ++G +A VRAVQ + G EV+ SYI+
Sbjct: 181 GFTIEDEELSHLGSAIFPDVALMNHSCCPNVIVTYKGTVAEVRAVQEIHAGDEVITSYID 240
Query: 137 TAGSTMTRQKALKEQYLFTCTCPRC-------IKLGQFDDIQESAILEGYRCKDDGCSGF 189
T R L + C C C +K+ + + + ++ K+ CS
Sbjct: 241 LLYPTEDRNDRLIDSXFCNCDCRECSTKQKGRLKIAEISLKIHPPLDKXWKNKERSCSVP 300
Query: 190 LLRDSDDKGFTCQQCGLVRSKEEIKKIASEVNILSKKTLALTSCGNHQEVVSTYKMIEKL 249
LL L T+A S E++ ++ +
Sbjct: 301 LL------------------------------YLFPLTIAPPS-----ELLEICELSLEK 325
Query: 250 QKKLYHPFSVNLMQTREKLIKILMELEDWKEALAYCQLTIPVYQRVYPQFHPLLGLQYYT 309
++ +V ++ + + + + ++DW+ AL Y + I Y + YP + + +
Sbjct: 326 MGSVFEDSNVYMLHMMYQAMGVCLYMQDWEGALKYGEKIIKPYSKHYPPYSLNVASMWLK 385
Query: 310 CGKLEWFLGDTENAI--KSMTEAVEILRITHGTNSPFMKELILKLE 353
G+L ++G + AI K++ +A+ I+ I HG + ++ E+ +LE
Sbjct: 386 LGRL--YMGLEKKAIGTKALKKAIAIMDIAHGPDHHYIAEIKKELE 429
>gi|119597545|gb|EAW77139.1| SET and MYND domain containing 3, isoform CRA_a [Homo sapiens]
gi|119597547|gb|EAW77141.1| SET and MYND domain containing 3, isoform CRA_a [Homo sapiens]
gi|193785773|dbj|BAG51208.1| unnamed protein product [Homo sapiens]
Length = 239
Score = 98.2 bits (243), Expect = 6e-18, Method: Compositional matrix adjust.
Identities = 74/257 (28%), Positives = 120/257 (46%), Gaps = 43/257 (16%)
Query: 85 LRPLGTGLYPVISIINHSCLPNAVLVFEGRLAVVRAVQHVPKGAEVLISYIETAGSTMTR 144
++ +G GLYP IS++NHSC PN +VF G ++RAV+ + G E+ I Y++ ++ R
Sbjct: 1 MQEVGVGLYPSISLLNHSCDPNCSIVFNGPHLLLRAVRDIEVGEELTICYLDMLMTSEER 60
Query: 145 QKALKEQYLFTCTCPRCIKLGQFDDIQESAILEGYRCKDDGCSGFLLRDSDDKGFTCQQC 204
+K L++QY F C C RC +D D T +
Sbjct: 61 RKQLRDQYCFECDCFRCQT----------------------------QDKDADMLTGDEQ 92
Query: 205 GLVRSKEEIKKIASEVNILSKKTLALTSCGNHQEVVSTYKMIEKLQKKLYHPFSVNLMQT 264
+E +KKI E+ K L C Q ++S+ E+L +N+ Q
Sbjct: 93 VWKEVQESLKKI-EELKAHWKWEQVLAMC---QAIISSNS--ERLP-------DINIYQL 139
Query: 265 R--EKLIKILMELEDWKEALAYCQLTIPVYQRVYPQFHPLLGLQYYTCGKLEWFLGDTEN 322
+ + + + L +EAL Y T+ Y+ +P HP+ G+Q GKL+ G
Sbjct: 140 KVLDCAMDACINLGLLEEALFYGTRTMEPYRIFFPGSHPVRGVQVMKVGKLQLHQGMFPQ 199
Query: 323 AIKSMTEAVEILRITHG 339
A+K++ A +I+R+THG
Sbjct: 200 AMKNLRLAFDIMRVTHG 216
>gi|301773928|ref|XP_002922386.1| PREDICTED: SET and MYND domain-containing protein 1-like
[Ailuropoda melanoleuca]
Length = 490
Score = 98.2 bits (243), Expect = 6e-18, Method: Compositional matrix adjust.
Identities = 83/337 (24%), Positives = 151/337 (44%), Gaps = 54/337 (16%)
Query: 31 VAHMSDIDEKQLLLYAQIANLVNLILQ-WP----EISINEIAENFSKLACNAHTICNSE- 84
V H + ++K+L + V+ LQ WP + S+ I+ F + CN T+ +
Sbjct: 136 VEHFGEEEQKELRVD------VDTFLQYWPPQSQQFSMQYISHVFGVINCNGFTLSDQRG 189
Query: 85 LRPLGTGLYPVISIINHSCLPNAVLVFE-GRLAVVRAVQH------------VPKGAEVL 131
L+ +G G++P + ++NH+C PN ++F G V+++ H + +G E+
Sbjct: 190 LQAVGVGIFPNLGLVNHNCWPNCTVIFNNGNHEAVKSMFHTQMRIELRALGKISEGEELT 249
Query: 132 ISYIETAGSTMTRQKALKEQYLFTCTCPRCIKLGQFDDIQESAILEGYRCKDDGCSGFLL 191
+SYI+ + R++ LK+QY F CTC C K + KDD FL
Sbjct: 250 VSYIDFLNVSEERKRQLKKQYYFDCTCEHCQK----------------KLKDD---LFLG 290
Query: 192 RDSDDKGFTCQQCGLVRSKEEIKKIASEVNILSKKTLALTSCGNHQEVVSTYKMIEKLQK 251
D K S+E +K++ +K S G + EVV + + Q+
Sbjct: 291 VKDDPKP----------SQEVVKEMIQFSKDTLEKIDKARSEGLYHEVVKLCRECLQKQE 340
Query: 252 KLYHPFSVNLMQTREKLIKILMELEDWKEALAYCQLTIPVYQRVYPQFHPLLGLQYYTCG 311
++ ++ ++ + ++L L+ ++EA Y + + Y ++Y + LG+ G
Sbjct: 341 PVFADTNLYTLRLLSTVSEVLSYLQAFEEASYYARRMVDGYMKLYHPNNAQLGMAVMRAG 400
Query: 312 KLEWFLGDTENAIKSMTEAVEILRITHGTNSPFMKEL 348
W G+ E + +A IL +THG + P K+L
Sbjct: 401 LTNWHAGNIEVGHGMICKAYAILLVTHGPSHPITKDL 437
>gi|403303925|ref|XP_003942566.1| PREDICTED: SET and MYND domain-containing protein 1 [Saimiri
boliviensis boliviensis]
Length = 490
Score = 98.2 bits (243), Expect = 6e-18, Method: Compositional matrix adjust.
Identities = 84/341 (24%), Positives = 152/341 (44%), Gaps = 62/341 (18%)
Query: 31 VAHMSDIDEKQLLLYAQIANLVNLILQ-WP----EISINEIAENFSKLACNAHTICNSE- 84
V H + ++K+L + V+ LQ WP + S+ I+ F + CN T+ +
Sbjct: 136 VEHFGEEEQKELRVD------VDTFLQYWPPQSQQFSMQYISHIFGVINCNGFTLSDQRG 189
Query: 85 LRPLGTGLYPVISIINHSCLPNAVLVFE-GRLAVVRAVQH------------VPKGAEVL 131
L+ +G G++P + ++NH C PN ++F G V+++ H + +G E+
Sbjct: 190 LQAVGVGIFPNLGLVNHDCWPNCTVIFNNGNHEAVKSMFHTQMRIELRALGKISEGEELT 249
Query: 132 ISYIETAGSTMTRQKALKEQYLFTCTCPRCIKLGQFDDIQESAILEGYRCKDDGCSGFLL 191
+SYI+ + R+K LK+QY F CTC C K + KDD FL
Sbjct: 250 VSYIDFLNVSEERKKQLKKQYYFDCTCEHCQK----------------KLKDD---LFL- 289
Query: 192 RDSDDKGFTCQQCGLVRSKEEIKKIASEVNILSKKTL----ALTSCGNHQEVVSTYKMIE 247
G+ + + +++ E+ SK TL S G + EVV +
Sbjct: 290 -------------GVKDNPKPSQEVVKEMIQFSKDTLEKIDKARSEGLYHEVVKLCRECL 336
Query: 248 KLQKKLYHPFSVNLMQTREKLIKILMELEDWKEALAYCQLTIPVYQRVYPQFHPLLGLQY 307
+ Q+ ++ ++ ++ + ++L L+ ++EA Y + + Y ++Y + LG+
Sbjct: 337 QKQEPVFADTNIYTLRMLSIVSEVLSYLQAFEEASDYARRMVDGYMKLYHPNNAQLGMAV 396
Query: 308 YTCGKLEWFLGDTENAIKSMTEAVEILRITHGTNSPFMKEL 348
G W G+ E + +A IL +THG + P K+L
Sbjct: 397 MRAGLTNWHAGNIEVGHGMICKAYAILLVTHGPSHPITKDL 437
>gi|71052093|gb|AAH49367.2| SMYD2 protein [Homo sapiens]
Length = 371
Score = 97.8 bits (242), Expect = 8e-18, Method: Compositional matrix adjust.
Identities = 55/159 (34%), Positives = 88/159 (55%), Gaps = 7/159 (4%)
Query: 8 RKLQNDNVIPSTTTDNYSL-VEALVAHMSDID-EKQLLLYAQIANLVNLI---LQWPEIS 62
R L + P T L V+ +H+ +D EK+ L+ + IA L + L++P+
Sbjct: 111 RILAKQKIHPERTPSEKLLAVKEFESHLDKLDNEKKDLIQSDIAALHHFYSKHLEFPDS- 169
Query: 63 INEIAENFSKLACNAHTICNSELRPLGTGLYPVISIINHSCLPNAVLVFEGRLAVVRAVQ 122
+ + F+++ CN TI + EL LG+ ++P ++++NHSC PN ++ ++G LA VRAVQ
Sbjct: 170 -DSLVVLFAQVNCNGFTIEDEELSHLGSAIFPDVALMNHSCCPNVIVTYKGTLAEVRAVQ 228
Query: 123 HVPKGAEVLISYIETAGSTMTRQKALKEQYLFTCTCPRC 161
+ G EV SYI+ T R L++ Y FTC C C
Sbjct: 229 EIKPGEEVFTSYIDLLYPTEDRNDRLRDSYFFTCECQEC 267
>gi|26336238|dbj|BAC31804.1| unnamed protein product [Mus musculus]
gi|26338432|dbj|BAC32887.1| unnamed protein product [Mus musculus]
Length = 239
Score = 97.4 bits (241), Expect = 9e-18, Method: Compositional matrix adjust.
Identities = 69/255 (27%), Positives = 120/255 (47%), Gaps = 39/255 (15%)
Query: 85 LRPLGTGLYPVISIINHSCLPNAVLVFEGRLAVVRAVQHVPKGAEVLISYIETAGSTMTR 144
++ +G GLYP +S++NHSC PN +VF G ++RAV+ + G E+ I Y++ ++ R
Sbjct: 1 MQEVGVGLYPSMSLLNHSCDPNCSIVFNGPHLLLRAVREIEAGEELTICYLDMLMTSEER 60
Query: 145 QKALKEQYLFTCTCPRCIKLGQFDDIQESAILEGYRCKDDGCSGFLLRDSDDKGFTCQQC 204
+K L++QY F C C RC Q D +D+D
Sbjct: 61 RKQLRDQYCFECDCIRC----QTQD----------------------KDAD--------- 85
Query: 205 GLVRSKEEIKKIASEVNILSKKTLALTSCGNHQEVVSTYKMIEKLQKKLYHPFSVNLMQT 264
++ E+I K EV KK L + ++V++ + I ++ ++
Sbjct: 86 -MLTGDEQIWK---EVQESLKKIEELKAHWKWEQVLALCQAIINSNSNRLPDINIYQLKV 141
Query: 265 REKLIKILMELEDWKEALAYCQLTIPVYQRVYPQFHPLLGLQYYTCGKLEWFLGDTENAI 324
+ + + L +EAL Y T+ Y+ +P HP+ G+Q GKL+ G A+
Sbjct: 142 LDCAMDACINLGMLEEALFYAMRTMEPYRIFFPGSHPVRGVQVMKVGKLQLHQGMFPQAM 201
Query: 325 KSMTEAVEILRITHG 339
K++ A +I+++THG
Sbjct: 202 KNLRLAFDIMKVTHG 216
>gi|48476975|gb|AAT44535.1| hypothetical rhabdomyosarcoma antigen MU-RMS-40.9A [Homo sapiens]
Length = 484
Score = 97.4 bits (241), Expect = 9e-18, Method: Compositional matrix adjust.
Identities = 83/341 (24%), Positives = 152/341 (44%), Gaps = 62/341 (18%)
Query: 31 VAHMSDIDEKQLLLYAQIANLVNLILQ-WP----EISINEIAENFSKLACNAHTICNSE- 84
V H + ++K L + V+ LQ WP + S+ I+ F + CN T+ +
Sbjct: 130 VEHFGEEEQKDLRVD------VDTFLQYWPPQSQQFSMQYISHIFGVINCNGFTLSDQRG 183
Query: 85 LRPLGTGLYPVISIINHSCLPNAVLVFE-GRLAVVRAVQH------------VPKGAEVL 131
L+ +G G++P + ++NH C PN ++F G V+++ H + +G E+
Sbjct: 184 LQAVGVGIFPNLGLVNHDCWPNCTVIFNNGNHEAVKSMFHTQMRIELRALGKISEGEELT 243
Query: 132 ISYIETAGSTMTRQKALKEQYLFTCTCPRCIKLGQFDDIQESAILEGYRCKDDGCSGFLL 191
+SYI+ + R++ LK+QY F CTC C K + KDD FL
Sbjct: 244 VSYIDFLNVSEERKRQLKKQYYFDCTCEHCQK----------------KLKDD---LFL- 283
Query: 192 RDSDDKGFTCQQCGLVRSKEEIKKIASEVNILSKKTL----ALTSCGNHQEVVSTYKMIE 247
G+ + + +++ E+ SK TL S G + EVV +
Sbjct: 284 -------------GVKDNPKPSQEVVKEMIQFSKDTLEKIDKARSEGLYHEVVKLCRECL 330
Query: 248 KLQKKLYHPFSVNLMQTREKLIKILMELEDWKEALAYCQLTIPVYQRVYPQFHPLLGLQY 307
+ Q+ ++ ++ +++ + ++L L+ ++EA Y + + Y ++Y + LG+
Sbjct: 331 EKQEPVFADTNIYMLRMLSIVSEVLSYLQAFEEASFYARRMVDGYMKLYHPNNAQLGMAV 390
Query: 308 YTCGKLEWFLGDTENAIKSMTEAVEILRITHGTNSPFMKEL 348
G W G+ E + +A IL +THG + P K+L
Sbjct: 391 MRAGLTNWHAGNIEVGHGMICKAYAILLVTHGPSHPITKDL 431
>gi|119597483|gb|EAW77077.1| SET and MYND domain containing 1, isoform CRA_a [Homo sapiens]
Length = 510
Score = 97.4 bits (241), Expect = 9e-18, Method: Compositional matrix adjust.
Identities = 83/341 (24%), Positives = 152/341 (44%), Gaps = 62/341 (18%)
Query: 31 VAHMSDIDEKQLLLYAQIANLVNLILQ-WP----EISINEIAENFSKLACNAHTICNSE- 84
V H + ++K L + V+ LQ WP + S+ I+ F + CN T+ +
Sbjct: 136 VEHFGEEEQKDLRVD------VDTFLQYWPPQSQQFSMQYISHIFGVINCNGFTLSDQRG 189
Query: 85 LRPLGTGLYPVISIINHSCLPNAVLVFE-GRLAVVRAVQH------------VPKGAEVL 131
L+ +G G++P + ++NH C PN ++F G V+++ H + +G E+
Sbjct: 190 LQAVGVGIFPNLGLVNHDCWPNCTVIFNNGNHEAVKSMFHTQMRIELRALGKISEGEELT 249
Query: 132 ISYIETAGSTMTRQKALKEQYLFTCTCPRCIKLGQFDDIQESAILEGYRCKDDGCSGFLL 191
+SYI+ + R++ LK+QY F CTC C K + KDD FL
Sbjct: 250 VSYIDFLNVSEERKRQLKKQYYFDCTCEHCQK----------------KLKDD---LFL- 289
Query: 192 RDSDDKGFTCQQCGLVRSKEEIKKIASEVNILSKKTL----ALTSCGNHQEVVSTYKMIE 247
G+ + + +++ E+ SK TL S G + EVV +
Sbjct: 290 -------------GVKDNPKPSQEVVKEMIQFSKDTLEKIDKARSEGLYHEVVKLCRECL 336
Query: 248 KLQKKLYHPFSVNLMQTREKLIKILMELEDWKEALAYCQLTIPVYQRVYPQFHPLLGLQY 307
+ Q+ ++ ++ +++ + ++L L+ ++EA Y + + Y ++Y + LG+
Sbjct: 337 EKQEPVFADTNIYMLRMLSIVSEVLSYLQAFEEASFYARRMVDGYMKLYHPNNAQLGMAV 396
Query: 308 YTCGKLEWFLGDTENAIKSMTEAVEILRITHGTNSPFMKEL 348
G W G+ E + +A IL +THG + P K+L
Sbjct: 397 MRAGLTNWHAGNIEVGHGMICKAYAILLVTHGPSHPITKDL 437
>gi|348517233|ref|XP_003446139.1| PREDICTED: SET and MYND domain-containing protein 1-like isoform 2
[Oreochromis niloticus]
Length = 489
Score = 97.4 bits (241), Expect = 1e-17, Method: Compositional matrix adjust.
Identities = 90/352 (25%), Positives = 155/352 (44%), Gaps = 52/352 (14%)
Query: 18 STTTD-NYSLVEALVAHMSDIDEKQLL-LYAQIANLVNLILQWPEIS----INEIAENFS 71
ST +D + ++ L H++D+ E L L I N ++ WP S I++I+ F
Sbjct: 114 STVSDMQLTTLDELEDHIADMPEDDLKELKVDIHNFLDY---WPHNSKQHTIDDISHIFG 170
Query: 72 KLACNAHTICNSE-LRPLGTGLYPVISIINHSCLPN-AVLVFEGRLAVV----------- 118
+ CN ++ + L+ +G GL+P + ++NH C PN V++ G + V
Sbjct: 171 VINCNGFSVSDQRGLQAVGVGLFPNLCLVNHDCWPNCTVILNHGNQSAVNTMFHSQRRIE 230
Query: 119 -RAVQHVPKGAEVLISYIETAGSTMTRQKALKEQYLFTCTCPRCIKLGQFDDIQESAILE 177
RA+ + +G E+ ++Y++ + RQ+ LK QY F CTC C
Sbjct: 231 LRALGKIAEGEELTVAYVDFLNLSEERQRLLKTQYFFDCTCEHCKN-------------- 276
Query: 178 GYRCKDDGCSGFLLRDSDDKGFTCQQCGLVRSKEEIKKIASEVNILSKKTLALTSCGNHQ 237
R KDD G R+ D G+ S+E++K+ + +K G++
Sbjct: 277 --RIKDDIKIGG--REED---------GVKPSEEQVKEATDYCFQMLEKMEKARLNGDYH 323
Query: 238 EVVSTYK-MIEKLQKKLYHPFSVNLMQTREKLIKILMELEDWKEALAYCQLTIPVYQRVY 296
EVV + IEK + L + L++ + ++ L+ + +A Y + + Y ++Y
Sbjct: 324 EVVKICRECIEKTEPVLADT-HIYLLRMWSTMSEVQAYLQYFDDAADYARKMVEGYMKLY 382
Query: 297 PQFHPLLGLQYYTCGKLEWFLGDTENAIKSMTEAVEILRITHGTNSPFMKEL 348
+ LG+ G W G E + +A IL ITHG P K+L
Sbjct: 383 HPNNAALGMAAMRAGVTHWQAGQIEVGHGMICKAYAILMITHGPIHPITKDL 434
>gi|119597485|gb|EAW77079.1| SET and MYND domain containing 1, isoform CRA_c [Homo sapiens]
Length = 493
Score = 97.4 bits (241), Expect = 1e-17, Method: Compositional matrix adjust.
Identities = 83/341 (24%), Positives = 152/341 (44%), Gaps = 62/341 (18%)
Query: 31 VAHMSDIDEKQLLLYAQIANLVNLILQ-WP----EISINEIAENFSKLACNAHTICNSE- 84
V H + ++K L + V+ LQ WP + S+ I+ F + CN T+ +
Sbjct: 136 VEHFGEEEQKDLRVD------VDTFLQYWPPQSQQFSMQYISHIFGVINCNGFTLSDQRG 189
Query: 85 LRPLGTGLYPVISIINHSCLPNAVLVFE-GRLAVVRAVQH------------VPKGAEVL 131
L+ +G G++P + ++NH C PN ++F G V+++ H + +G E+
Sbjct: 190 LQAVGVGIFPNLGLVNHDCWPNCTVIFNNGNHEAVKSMFHTQMRIELRALGKISEGEELT 249
Query: 132 ISYIETAGSTMTRQKALKEQYLFTCTCPRCIKLGQFDDIQESAILEGYRCKDDGCSGFLL 191
+SYI+ + R++ LK+QY F CTC C K + KDD FL
Sbjct: 250 VSYIDFLNVSEERKRQLKKQYYFDCTCEHCQK----------------KLKDD---LFL- 289
Query: 192 RDSDDKGFTCQQCGLVRSKEEIKKIASEVNILSKKTL----ALTSCGNHQEVVSTYKMIE 247
G+ + + +++ E+ SK TL S G + EVV +
Sbjct: 290 -------------GVKDNPKPSQEVVKEMIQFSKDTLEKIDKARSEGLYHEVVKLCRECL 336
Query: 248 KLQKKLYHPFSVNLMQTREKLIKILMELEDWKEALAYCQLTIPVYQRVYPQFHPLLGLQY 307
+ Q+ ++ ++ +++ + ++L L+ ++EA Y + + Y ++Y + LG+
Sbjct: 337 EKQEPVFADTNIYMLRMLSIVSEVLSYLQAFEEASFYARRMVDGYMKLYHPNNAQLGMAV 396
Query: 308 YTCGKLEWFLGDTENAIKSMTEAVEILRITHGTNSPFMKEL 348
G W G+ E + +A IL +THG + P K+L
Sbjct: 397 MRAGLTNWHAGNIEVGHGMICKAYAILLVTHGPSHPITKDL 437
>gi|397468470|ref|XP_003805903.1| PREDICTED: SET and MYND domain-containing protein 1 [Pan paniscus]
Length = 490
Score = 97.4 bits (241), Expect = 1e-17, Method: Compositional matrix adjust.
Identities = 83/341 (24%), Positives = 152/341 (44%), Gaps = 62/341 (18%)
Query: 31 VAHMSDIDEKQLLLYAQIANLVNLILQ-WP----EISINEIAENFSKLACNAHTICNSE- 84
V H + ++K L + V+ LQ WP + S+ I+ F + CN T+ +
Sbjct: 136 VEHFGEEEQKDLRVD------VDTFLQYWPPQSQQFSMQYISHIFGVINCNGFTLSDQRG 189
Query: 85 LRPLGTGLYPVISIINHSCLPNAVLVFE-GRLAVVRAVQH------------VPKGAEVL 131
L+ +G G++P + ++NH C PN ++F G V+++ H + +G E+
Sbjct: 190 LQAVGVGIFPNLGLVNHDCWPNCTVIFNNGNHEAVKSMFHTQMRIELRALGKISEGEELT 249
Query: 132 ISYIETAGSTMTRQKALKEQYLFTCTCPRCIKLGQFDDIQESAILEGYRCKDDGCSGFLL 191
+SYI+ + R++ LK+QY F CTC C K + KDD FL
Sbjct: 250 VSYIDFLNVSEERKRQLKKQYYFDCTCEHCQK----------------KLKDD---LFL- 289
Query: 192 RDSDDKGFTCQQCGLVRSKEEIKKIASEVNILSKKTL----ALTSCGNHQEVVSTYKMIE 247
G+ + + +++ E+ SK TL S G + EVV +
Sbjct: 290 -------------GVKDNPKPSQEVVKEMIQFSKDTLEKIDKARSEGLYHEVVKLCRECL 336
Query: 248 KLQKKLYHPFSVNLMQTREKLIKILMELEDWKEALAYCQLTIPVYQRVYPQFHPLLGLQY 307
+ Q+ ++ ++ +++ + ++L L+ ++EA Y + + Y ++Y + LG+
Sbjct: 337 EKQEPVFADTNIYMLRMLSIVSEVLSYLQAFEEASFYARRMVDGYMKLYHPNNAQLGMAV 396
Query: 308 YTCGKLEWFLGDTENAIKSMTEAVEILRITHGTNSPFMKEL 348
G W G+ E + +A IL +THG + P K+L
Sbjct: 397 MRAGLTNWHAGNIEVGHGMICKAYAILLVTHGPSHPITKDL 437
>gi|148681223|gb|EDL13170.1| SET and MYND domain containing 3, isoform CRA_a [Mus musculus]
Length = 239
Score = 97.4 bits (241), Expect = 1e-17, Method: Compositional matrix adjust.
Identities = 69/255 (27%), Positives = 120/255 (47%), Gaps = 39/255 (15%)
Query: 85 LRPLGTGLYPVISIINHSCLPNAVLVFEGRLAVVRAVQHVPKGAEVLISYIETAGSTMTR 144
++ +G GLYP +S++NHSC PN +VF G ++RAV+ + G E+ I Y++ ++ R
Sbjct: 1 MQEVGVGLYPSMSLLNHSCDPNCSIVFNGPHLLLRAVREIEAGEELTICYLDMLMTSEER 60
Query: 145 QKALKEQYLFTCTCPRCIKLGQFDDIQESAILEGYRCKDDGCSGFLLRDSDDKGFTCQQC 204
+K L++QY F C C RC Q D +D+D
Sbjct: 61 RKQLRDQYCFECDCIRC----QTQD----------------------KDAD--------- 85
Query: 205 GLVRSKEEIKKIASEVNILSKKTLALTSCGNHQEVVSTYKMIEKLQKKLYHPFSVNLMQT 264
++ E+I K EV KK L + ++V++ + I ++ ++
Sbjct: 86 -MLTGDEQIWK---EVQESLKKIEELKAHWKWEQVLALCQAIINSNSDRLPDINIYQLKV 141
Query: 265 REKLIKILMELEDWKEALAYCQLTIPVYQRVYPQFHPLLGLQYYTCGKLEWFLGDTENAI 324
+ + + L +EAL Y T+ Y+ +P HP+ G+Q GKL+ G A+
Sbjct: 142 LDCAMDACINLGMLEEALFYAMRTMEPYRIFFPGSHPVRGVQVMKVGKLQLHQGMFPQAM 201
Query: 325 KSMTEAVEILRITHG 339
K++ A +I+++THG
Sbjct: 202 KNLRLAFDIMKVTHG 216
>gi|38093643|ref|NP_938015.1| SET and MYND domain-containing protein 1 [Homo sapiens]
gi|34925329|sp|Q8NB12.1|SMYD1_HUMAN RecName: Full=SET and MYND domain-containing protein 1
gi|21750163|dbj|BAC03732.1| unnamed protein product [Homo sapiens]
gi|57997113|emb|CAI46139.1| hypothetical protein [Homo sapiens]
gi|116497127|gb|AAI26192.1| SET and MYND domain containing 1 [Homo sapiens]
gi|119597484|gb|EAW77078.1| SET and MYND domain containing 1, isoform CRA_b [Homo sapiens]
gi|190689781|gb|ACE86665.1| SET and MYND domain containing 1 protein [synthetic construct]
gi|190691149|gb|ACE87349.1| SET and MYND domain containing 1 protein [synthetic construct]
gi|261858976|dbj|BAI46010.1| SET and MYND domain containing 1 [synthetic construct]
gi|313883064|gb|ADR83018.1| SET and MYND domain containing 1 (SMYD1) [synthetic construct]
Length = 490
Score = 97.4 bits (241), Expect = 1e-17, Method: Compositional matrix adjust.
Identities = 83/341 (24%), Positives = 152/341 (44%), Gaps = 62/341 (18%)
Query: 31 VAHMSDIDEKQLLLYAQIANLVNLILQ-WP----EISINEIAENFSKLACNAHTICNSE- 84
V H + ++K L + V+ LQ WP + S+ I+ F + CN T+ +
Sbjct: 136 VEHFGEEEQKDLRVD------VDTFLQYWPPQSQQFSMQYISHIFGVINCNGFTLSDQRG 189
Query: 85 LRPLGTGLYPVISIINHSCLPNAVLVFE-GRLAVVRAVQH------------VPKGAEVL 131
L+ +G G++P + ++NH C PN ++F G V+++ H + +G E+
Sbjct: 190 LQAVGVGIFPNLGLVNHDCWPNCTVIFNNGNHEAVKSMFHTQMRIELRALGKISEGEELT 249
Query: 132 ISYIETAGSTMTRQKALKEQYLFTCTCPRCIKLGQFDDIQESAILEGYRCKDDGCSGFLL 191
+SYI+ + R++ LK+QY F CTC C K + KDD FL
Sbjct: 250 VSYIDFLNVSEERKRQLKKQYYFDCTCEHCQK----------------KLKDD---LFL- 289
Query: 192 RDSDDKGFTCQQCGLVRSKEEIKKIASEVNILSKKTL----ALTSCGNHQEVVSTYKMIE 247
G+ + + +++ E+ SK TL S G + EVV +
Sbjct: 290 -------------GVKDNPKPSQEVVKEMIQFSKDTLEKIDKARSEGLYHEVVKLCRECL 336
Query: 248 KLQKKLYHPFSVNLMQTREKLIKILMELEDWKEALAYCQLTIPVYQRVYPQFHPLLGLQY 307
+ Q+ ++ ++ +++ + ++L L+ ++EA Y + + Y ++Y + LG+
Sbjct: 337 EKQEPVFADTNIYMLRMLSIVSEVLSYLQAFEEASFYARRMVDGYMKLYHPNNAQLGMAV 396
Query: 308 YTCGKLEWFLGDTENAIKSMTEAVEILRITHGTNSPFMKEL 348
G W G+ E + +A IL +THG + P K+L
Sbjct: 397 MRAGLTNWHAGNIEVGHGMICKAYAILLVTHGPSHPITKDL 437
>gi|57997548|emb|CAI46077.1| hypothetical protein [Homo sapiens]
Length = 490
Score = 97.4 bits (241), Expect = 1e-17, Method: Compositional matrix adjust.
Identities = 83/341 (24%), Positives = 152/341 (44%), Gaps = 62/341 (18%)
Query: 31 VAHMSDIDEKQLLLYAQIANLVNLILQ-WP----EISINEIAENFSKLACNAHTICNSE- 84
V H + ++K L + V+ LQ WP + S+ I+ F + CN T+ +
Sbjct: 136 VEHFGEEEQKDLRVD------VDTFLQYWPPQSQQFSMQYISHIFGVINCNGFTLSDQRG 189
Query: 85 LRPLGTGLYPVISIINHSCLPNAVLVFE-GRLAVVRAVQH------------VPKGAEVL 131
L+ +G G++P + ++NH C PN ++F G V+++ H + +G E+
Sbjct: 190 LQAVGVGIFPNLGLVNHDCWPNCTVIFNNGNHEAVKSMFHTQMRIELRALGKISEGEELT 249
Query: 132 ISYIETAGSTMTRQKALKEQYLFTCTCPRCIKLGQFDDIQESAILEGYRCKDDGCSGFLL 191
+SYI+ + R++ LK+QY F CTC C K + KDD FL
Sbjct: 250 VSYIDFLNVSEERKRQLKKQYYFDCTCEHCQK----------------KLKDD---LFL- 289
Query: 192 RDSDDKGFTCQQCGLVRSKEEIKKIASEVNILSKKTL----ALTSCGNHQEVVSTYKMIE 247
G+ + + +++ E+ SK TL S G + EVV +
Sbjct: 290 -------------GVKDNPKPSQEVVKEMIQFSKDTLEKIDKARSEGLYHEVVKLCRECL 336
Query: 248 KLQKKLYHPFSVNLMQTREKLIKILMELEDWKEALAYCQLTIPVYQRVYPQFHPLLGLQY 307
+ Q+ ++ ++ +++ + ++L L+ ++EA Y + + Y ++Y + LG+
Sbjct: 337 EKQEPVFADTNIYMLRMLSIVSEVLSYLQAFEEASFYARRMVDGYMKLYHPNNAQLGMAV 396
Query: 308 YTCGKLEWFLGDTENAIKSMTEAVEILRITHGTNSPFMKEL 348
G W G+ E + +A IL +THG + P K+L
Sbjct: 397 MRAGLTNWHAGNIEVGHGMICKAYAILLVTHGPSHPITKDL 437
>gi|291386375|ref|XP_002709686.1| PREDICTED: SET and MYND domain containing 1 isoform 2 [Oryctolagus
cuniculus]
Length = 490
Score = 97.1 bits (240), Expect = 1e-17, Method: Compositional matrix adjust.
Identities = 80/336 (23%), Positives = 148/336 (44%), Gaps = 52/336 (15%)
Query: 31 VAHMSDIDEKQLLLYAQIANLVNLILQWP----EISINEIAENFSKLACNAHTICNSE-L 85
V H + ++K+L A++ + WP S+ I+ F + CN T+ + L
Sbjct: 136 VDHFGEEEQKELR-----ADVDTFLQYWPPQSQHFSMQYISHIFGVINCNGFTLSDQRGL 190
Query: 86 RPLGTGLYPVISIINHSCLPNAVLVFE-GRLAVVRAVQH------------VPKGAEVLI 132
+ +G G++P ++++NH C PN ++F G V+++ H + +G E+ +
Sbjct: 191 QAVGVGIFPNLALVNHDCWPNCTVIFNNGNHEAVKSMFHTQMRIELRALGKISEGEELTV 250
Query: 133 SYIETAGSTMTRQKALKEQYLFTCTCPRCIKLGQFDDIQESAILEGYRCKDDGCSGFLLR 192
SYI+ + R++ LK+QY F CTC C K G DD+ FL
Sbjct: 251 SYIDFLNVSEERRRQLKKQYYFDCTCEHCQK-GLKDDL------------------FLGV 291
Query: 193 DSDDKGFTCQQCGLVRSKEEIKKIASEVNILSKKTLALTSCGNHQEVVSTYKMIEKLQKK 252
+ K S+E +K++ +K S G + EVV + + Q+
Sbjct: 292 KDNPKP----------SQEVVKEMTEFCKDTLEKIDKARSEGLYHEVVKLCRECLQKQEP 341
Query: 253 LYHPFSVNLMQTREKLIKILMELEDWKEALAYCQLTIPVYQRVYPQFHPLLGLQYYTCGK 312
++ ++ ++ + ++L L+ + EA Y + + Y ++Y + LG+ G
Sbjct: 342 VFADTNLYTLRMLSIVSEVLSYLQAFGEAADYAKRMVDGYMKLYHPNNAQLGMAVMRAGL 401
Query: 313 LEWFLGDTENAIKSMTEAVEILRITHGTNSPFMKEL 348
W G+ E + +A IL +THG + P K+L
Sbjct: 402 TNWHAGNIEVGHGMICKAYAILLVTHGPSHPITKDL 437
>gi|328769214|gb|EGF79258.1| hypothetical protein BATDEDRAFT_35411 [Batrachochytrium
dendrobatidis JAM81]
Length = 503
Score = 97.1 bits (240), Expect = 1e-17, Method: Compositional matrix adjust.
Identities = 81/315 (25%), Positives = 144/315 (45%), Gaps = 50/315 (15%)
Query: 60 EISINEIAENFSKLACNAHTICNSELRPLGTGLYPVISIINHSCLPNAVLVFEGRLAVVR 119
+ ++++ + + CN TI +S L P+G G YP+ S+ NH C PN + +F+G V++
Sbjct: 228 NLPVSDLIHHLGRFRCNNFTIIDSNLFPVGEGTYPLASLFNHDCWPNCIAIFDGSRVVIQ 287
Query: 120 AVQHVPKGAEVLISYIETAGSTMTRQKALKEQYLFTCTCPRCIKLGQFDDIQESAILEGY 179
++ + KG E+ ISYI+ +R+ +L+ +Y F C C C+ + L
Sbjct: 288 TIRDIAKGDELCISYIDPILDHDSRRMSLETKYCFNCQCSVCMS-------ESCTPLHSP 340
Query: 180 RCKDDGCSGFLLRDSDDKGF-------TCQQCGLVRSKEEIKKIASEVNILSKKTLALTS 232
K D L + D G+ TC L +S +E ++ A + I KT+AL S
Sbjct: 341 TTKKDAL--LSLDEKSDVGWRLLKWFHTC----LKKSDDEFER-AQQDQI--SKTIALVS 391
Query: 233 CGNHQEVVSTYKMIEKLQKKLYHPFSVNLMQTREKLIKILMELEDWKEALAYCQLTIPVY 292
+ VV+ ++ F +LM+ +DW++A + Y
Sbjct: 392 ---KRAVVAEIGNATQM-------FQSHLMR------------QDWEQAYKSGLHLLGQY 429
Query: 293 QRVYPQFHPLLGLQYYTCGKLEWFLGDTENAIKSMTEAVEILRITHGTNSPFMK--ELIL 350
YP+++PL+ + + W G T+ I + + L IT+G+ MK LI
Sbjct: 430 LLRYPRYYPLVSQHMFFVAQCMWNSGQTDETIFLLGVIKKCLEITYGS---VMKSHHLIG 486
Query: 351 KLEEAQAEASYKLSS 365
+++ A+A Y+ S+
Sbjct: 487 QVDSLLAQALYEQSA 501
>gi|325530307|sp|Q5RGL7.2|SMY2B_DANRE RecName: Full=N-lysine methyltransferase SMYD2-B; AltName:
Full=Histone methyltransferase SMYD2-B; AltName:
Full=SET and MYND domain-containing protein 2B
Length = 434
Score = 97.1 bits (240), Expect = 1e-17, Method: Compositional matrix adjust.
Identities = 82/337 (24%), Positives = 148/337 (43%), Gaps = 54/337 (16%)
Query: 26 LVEALVAHMSDID-EKQLLLYAQIANLVNLI---LQWPEISINEIAENFSKLACNAHTIC 81
L+ + AH+ D+D EK+ + A IA L L +P+ + FS++ CN T+
Sbjct: 131 LLGEMEAHLEDMDNEKREMTEAHIAGLHQFYSKHLDFPDH--QALLTLFSQVHCNGFTVE 188
Query: 82 NSELRPLGTGLYPVISIINHSCLPNAVLVFEGRLAVVRAVQHVPKGAEVLISYIETAGST 141
+ EL LG ++P I+++NHSC PN ++ + G A VRAV+ + G E+ SYI+ T
Sbjct: 189 DEELSNLGLAIFPDIALLNHSCSPNVIVTYRGINAEVRAVKDISPGQEIYTSYIDLLYPT 248
Query: 142 MTRQKALKEQYLFTCTCPRC-------IKLG---QFDDIQESAILEGYRCKDDGCSGFLL 191
R + L++ Y F+C C C +K+ + D+I E I + R + F
Sbjct: 249 ADRLERLRDMYYFSCDCKECTTKSMDVVKMSVRKRSDEIGEKEIKDMVRYARNSMENFR- 307
Query: 192 RDSDDKGFTCQQCGLVRSKEEIKKIASEVNILSKKTLALTSCGNHQEVVSTYKMIEKLQK 251
R+K++ T L C + +ST
Sbjct: 308 ----------------RAKQD-----------KSPTELLEMCELSIDKMST--------- 331
Query: 252 KLYHPFSVNLMQTREKLIKILMELEDWKEALAYCQLTIPVYQRVYPQFHPLLGLQYYTCG 311
++ +V ++ + + I + ED++ A+ Y + I + +YP + + + G
Sbjct: 332 -VFDDSNVYILHMMYQAMGICLFTEDYEGAVRYGEKVIKPFTVLYPAYSMNVASMFLKLG 390
Query: 312 KLEWFLGDTENAIKSMTEAVEILRITHGTNSPFMKEL 348
+L L I + +A+ I+ + HG + ++ EL
Sbjct: 391 RLYIALDRKLAGIDAFQKALTIMEVVHGKDHTYVTEL 427
>gi|348566407|ref|XP_003468993.1| PREDICTED: SET and MYND domain-containing protein 1-like isoform 1
[Cavia porcellus]
Length = 490
Score = 97.1 bits (240), Expect = 1e-17, Method: Compositional matrix adjust.
Identities = 83/341 (24%), Positives = 149/341 (43%), Gaps = 62/341 (18%)
Query: 31 VAHMSDIDEKQLLLYAQIANLVNLILQ-WP----EISINEIAENFSKLACNAHTICNSE- 84
V H + ++K+L + V+ LQ WP + S+ I+ F + CN T+ +
Sbjct: 136 VEHFGEEEQKELRVD------VDTFLQYWPPQSQQFSMQYISHIFGVINCNGFTLSDQRG 189
Query: 85 LRPLGTGLYPVISIINHSCLPNAVLVFE-GRLAVVRAVQH------------VPKGAEVL 131
L+ +G G++P + ++NH C PN ++F G V+++ H + G E+
Sbjct: 190 LQAVGVGIFPNLGLVNHDCWPNCTVIFNNGNHEAVKSMFHTQMRIELRALGKISVGEELT 249
Query: 132 ISYIETAGSTMTRQKALKEQYLFTCTCPRCIKLGQFDDIQESAILEGYRCKDDGCSGFLL 191
+SYI+ + R++ LK+QY F CTC C K G DD+ FL
Sbjct: 250 VSYIDFLNVSEERKRQLKKQYYFDCTCEHCQK-GLKDDL------------------FL- 289
Query: 192 RDSDDKGFTCQQCGLVRSKEEIKKIASEVNILSKKTL----ALTSCGNHQEVVSTYKMIE 247
G+ + +++ E+ SK TL S G + EVV +
Sbjct: 290 -------------GVKEDPKPSQEVVKEMTQFSKDTLEKIDKARSEGLYHEVVKLCRECL 336
Query: 248 KLQKKLYHPFSVNLMQTREKLIKILMELEDWKEALAYCQLTIPVYQRVYPQFHPLLGLQY 307
+ Q+ ++ ++ ++ + ++L L+ + EA Y + + Y ++Y + LG+
Sbjct: 337 EKQEPVFADTNLYTLRMLSVVSEVLSYLQAFGEAAHYARRMVDGYMKLYHPNNAQLGMAV 396
Query: 308 YTCGKLEWFLGDTENAIKSMTEAVEILRITHGTNSPFMKEL 348
G W G+ E + +A IL +THG + P K+L
Sbjct: 397 MRAGLTNWHAGNIEVGHGMICKAYAILLVTHGPSHPITKDL 437
>gi|113674346|ref|NP_001038756.1| N-lysine methyltransferase SMYD2-B isoform 2 [Danio rerio]
gi|95132413|gb|AAI16607.1| SET and MYND domain containing 2b [Danio rerio]
Length = 423
Score = 96.7 bits (239), Expect = 1e-17, Method: Compositional matrix adjust.
Identities = 82/337 (24%), Positives = 148/337 (43%), Gaps = 54/337 (16%)
Query: 26 LVEALVAHMSDID-EKQLLLYAQIANLVNLI---LQWPEISINEIAENFSKLACNAHTIC 81
L+ + AH+ D+D EK+ + A IA L L +P+ + FS++ CN T+
Sbjct: 120 LLGEMEAHLEDMDNEKREMTEAHIAGLHQFYSKHLDFPDH--QALLTLFSQVHCNGFTVE 177
Query: 82 NSELRPLGTGLYPVISIINHSCLPNAVLVFEGRLAVVRAVQHVPKGAEVLISYIETAGST 141
+ EL LG ++P I+++NHSC PN ++ + G A VRAV+ + G E+ SYI+ T
Sbjct: 178 DEELSNLGLAIFPDIALLNHSCSPNVIVTYRGINAEVRAVKDISPGQEIYTSYIDLLYPT 237
Query: 142 MTRQKALKEQYLFTCTCPRC-------IKLG---QFDDIQESAILEGYRCKDDGCSGFLL 191
R + L++ Y F+C C C +K+ + D+I E I + R + F
Sbjct: 238 ADRLERLRDMYYFSCDCKECTTKSMDVVKMSVRKRSDEIGEKEIKDMVRYARNSMENFR- 296
Query: 192 RDSDDKGFTCQQCGLVRSKEEIKKIASEVNILSKKTLALTSCGNHQEVVSTYKMIEKLQK 251
R+K++ T L C + +ST
Sbjct: 297 ----------------RAKQD-----------KSPTELLEMCELSIDKMST--------- 320
Query: 252 KLYHPFSVNLMQTREKLIKILMELEDWKEALAYCQLTIPVYQRVYPQFHPLLGLQYYTCG 311
++ +V ++ + + I + ED++ A+ Y + I + +YP + + + G
Sbjct: 321 -VFDDSNVYILHMMYQAMGICLFTEDYEGAVRYGEKVIKPFTVLYPAYSMNVASMFLKLG 379
Query: 312 KLEWFLGDTENAIKSMTEAVEILRITHGTNSPFMKEL 348
+L L I + +A+ I+ + HG + ++ EL
Sbjct: 380 RLYIALDRKLAGIDAFQKALTIMEVVHGKDHTYVTEL 416
>gi|327286307|ref|XP_003227872.1| PREDICTED: SET and MYND domain-containing protein 3-like [Anolis
carolinensis]
Length = 334
Score = 96.7 bits (239), Expect = 2e-17, Method: Compositional matrix adjust.
Identities = 58/169 (34%), Positives = 94/169 (55%), Gaps = 11/169 (6%)
Query: 9 KLQNDNVIPSTTTDNYSLVEALVAHMSD-IDE------KQLLLYAQIANLVNLILQWPEI 61
KL + PS ++S +++ V +S+ + E K L LY ++ + + Q P
Sbjct: 141 KLLRQSTCPSEELYSFSDLQSNVEKLSEEMKEGLGHLAKTLQLYLKVE--IQDVSQLP-- 196
Query: 62 SINEIAENFSKLACNAHTICNSELRPLGTGLYPVISIINHSCLPNAVLVFEGRLAVVRAV 121
++ E F+K+ACN +I N E++ +G GLYP +S++N+SC PN +VFEG +R++
Sbjct: 197 PALDVFETFAKVACNCFSISNGEMQDVGVGLYPSMSLLNNSCDPNCAIVFEGPELHLRSI 256
Query: 122 QHVPKGAEVLISYIETAGSTMTRQKALKEQYLFTCTCPRCIKLGQFDDI 170
+ + +G E+ I Y+ET T RQ+ LK QY F C C C + DI
Sbjct: 257 REIQEGEELTICYVETMMPTPERQERLKRQYCFECDCLMCCTKSKAYDI 305
>gi|348505210|ref|XP_003440154.1| PREDICTED: SET and MYND domain-containing protein 1-like isoform 1
[Oreochromis niloticus]
Length = 454
Score = 96.7 bits (239), Expect = 2e-17, Method: Compositional matrix adjust.
Identities = 86/339 (25%), Positives = 150/339 (44%), Gaps = 61/339 (17%)
Query: 27 VEALVAHMSDI---DEKQLLLYAQIANLVNLILQWP----EISINEIAENFSKLACNAHT 79
V+ L H++D+ D KQL A++ + W + S++ I+ F + CN T
Sbjct: 129 VDQLQEHVADLAAEDFKQLR-----ADVHKFLQYWSYGTRQHSVDYISHIFGIIKCNGFT 183
Query: 80 ICNSE-LRPLGTGLYPVISIINHSCLPNAVLVFEGRLAVVRAVQHVPKGAEVLISYIETA 138
+ + L+ +G GL+P + ++NH C PN ++ +RA+ +P+G E+ +SY++
Sbjct: 184 LSDQRGLQAVGVGLFPNLCLVNHDCWPNCTVILNHGKIELRALGKIPEGEELTVSYVDFL 243
Query: 139 GSTMTRQKALKEQYLFTCTCPRCIKLGQFDDIQESAILEGYRCKDDGCSGFLLRDSDDKG 198
+ RQK LKE++ F CTC C K KDD + +D K
Sbjct: 244 NLSADRQKKLKERFHFDCTCEHCSK----------------HIKDD----LMTAVADSKP 283
Query: 199 FTCQQCGLVRSKEEIKKIASEVNILSKKTLALTSCG----NHQEVVS-TYKMIEKLQKKL 253
S +++K EV SK++L ++ EV+ Y+ ++K + L
Sbjct: 284 ----------SADKVK----EVTAFSKESLEKIEKSRVERDYNEVLKLCYECLQKQENVL 329
Query: 254 YHPFSVNLMQTREKLIKILMELEDWKEALA----YCQLTIPVYQRVYPQFHPLLGLQYYT 309
NL + R ++ I E+ ++ + A Y + Y+++Y + LG+
Sbjct: 330 A---DTNLYKLR--VLSIASEVLSYQRSFAKAANYAHRMVEGYRKLYHPNNAQLGMAIMR 384
Query: 310 CGKLEWFLGDTENAIKSMTEAVEILRITHGTNSPFMKEL 348
G W G E + +A IL +THG N K+L
Sbjct: 385 AGVTHWHAGQIEEGHSLICQAYRILMVTHGPNHAITKDL 423
>gi|302753848|ref|XP_002960348.1| hypothetical protein SELMODRAFT_437450 [Selaginella moellendorffii]
gi|300171287|gb|EFJ37887.1| hypothetical protein SELMODRAFT_437450 [Selaginella moellendorffii]
Length = 614
Score = 96.7 bits (239), Expect = 2e-17, Method: Compositional matrix adjust.
Identities = 68/236 (28%), Positives = 105/236 (44%), Gaps = 16/236 (6%)
Query: 91 GLYPVISIINHSCLPNAVLVFEGRLAVVRAVQHVPKGAEVLISYIETAGSTMTRQKALKE 150
GL+P +SI+NHSC PN EG + VRAVQ +PKGAE+ +SYI S R+ L
Sbjct: 296 GLFPYVSILNHSCRPNCCFASEGSVMYVRAVQDIPKGAELCLSYINLYESRRVRKTLLVA 355
Query: 151 QYLFTCTCPRCIKLGQFDDIQESAILEGYRCKDDGCSGFLLRDS---DDKG----FTCQQ 203
F CTC RC+ + + LEG C GC G LR + +D+ + C
Sbjct: 356 TKHFDCTCDRCV---EPLNTSIDRFLEGCVCHVRGCGGVFLRTAALHEDQASSTTWECDL 412
Query: 204 CGLVRSKEEIKKIASEVNILSK---KTLALTSCGNHQEVVSTYKMIEKLQKKL---YHPF 257
C V E ++ +++K + +A + + K++E + HP
Sbjct: 413 CSRVLDPESMRFKDPPWEVIAKAEERLVAAVRIYSERRFKEARKLLESYLSEFTGKLHPL 472
Query: 258 SVNLMQTREKLIKILMELEDWKEALAYCQLTIPVYQRVYPQFHPLLGLQYYTCGKL 313
V L L+ + D +E L C+L + ++V P L Y+ G++
Sbjct: 473 HVFLFDALTPLMNCCRAMGDAEEGLRVCRLILQCMEKVLPGPSLELANFYFCLGEM 528
>gi|149727214|ref|XP_001497840.1| PREDICTED: SET and MYND domain-containing protein 1 isoform 2
[Equus caballus]
Length = 490
Score = 96.7 bits (239), Expect = 2e-17, Method: Compositional matrix adjust.
Identities = 79/320 (24%), Positives = 142/320 (44%), Gaps = 56/320 (17%)
Query: 52 VNLILQ-WP----EISINEIAENFSKLACNAHTICNSE-LRPLGTGLYPVISIINHSCLP 105
V+ LQ WP + S+ I+ F + CN T+ + L+ +G G++P + ++NH C P
Sbjct: 151 VDTFLQYWPPQSQQFSMQYISHIFGVINCNGFTLSDQRGLQAVGVGVFPNLGLVNHDCWP 210
Query: 106 NAVLVFE-GRLAVVRAVQH------------VPKGAEVLISYIETAGSTMTRQKALKEQY 152
N ++F G V+++ H + +G E+ +SYI+ + R++ LK+QY
Sbjct: 211 NCTVIFNNGNHEAVKSMFHTQMRIELRALGKISEGEELTVSYIDFLNVSEERKRQLKKQY 270
Query: 153 LFTCTCPRCIKLGQFDDIQESAILEGYRCKDDGCSGFLLRDSDDKGFTCQQCGLVRSKEE 212
F CTC C K G DD+ FL G+ +
Sbjct: 271 YFDCTCEHCQK-GLKDDL------------------FL--------------GVKDDPKP 297
Query: 213 IKKIASEVNILSKKTL----ALTSCGNHQEVVSTYKMIEKLQKKLYHPFSVNLMQTREKL 268
+ + EV SK TL + G + EVV + + Q+ ++ ++ +++ +
Sbjct: 298 SQDVVKEVVQFSKDTLEKIDKARTEGLYHEVVKLCRECLQKQEPVFADTNLYMLRILSIV 357
Query: 269 IKILMELEDWKEALAYCQLTIPVYQRVYPQFHPLLGLQYYTCGKLEWFLGDTENAIKSMT 328
++L L+ ++EA Y + + Y ++Y + LG+ G W G+ E +
Sbjct: 358 SEVLSYLQAFEEASYYARKMVDGYMKLYHPNNAQLGMAVMRAGLTNWHAGNIEVGHGMIC 417
Query: 329 EAVEILRITHGTNSPFMKEL 348
+A IL +THG + P K+L
Sbjct: 418 KAYAILLVTHGPSHPITKDL 437
>gi|344244710|gb|EGW00814.1| SET and MYND domain-containing protein 3 [Cricetulus griseus]
Length = 239
Score = 96.3 bits (238), Expect = 2e-17, Method: Compositional matrix adjust.
Identities = 69/255 (27%), Positives = 120/255 (47%), Gaps = 39/255 (15%)
Query: 85 LRPLGTGLYPVISIINHSCLPNAVLVFEGRLAVVRAVQHVPKGAEVLISYIETAGSTMTR 144
++ +G GLYP +S++NHSC PN +VF G ++RAV+ + G E+ I Y++ ++ R
Sbjct: 1 MQEVGVGLYPSMSLLNHSCDPNCSIVFNGPHLLLRAVREIEAGEELTICYLDMLMTSEER 60
Query: 145 QKALKEQYLFTCTCPRCIKLGQFDDIQESAILEGYRCKDDGCSGFLLRDSDDKGFTCQQC 204
+K L++QY F C C RC Q D +D+D
Sbjct: 61 RKQLRDQYCFECDCIRC----QTHD----------------------KDAD--------- 85
Query: 205 GLVRSKEEIKKIASEVNILSKKTLALTSCGNHQEVVSTYKMIEKLQKKLYHPFSVNLMQT 264
++ E+I K EV KK L + ++V++ + I ++ ++
Sbjct: 86 -MLTGDEQIWK---EVQESLKKIEELKAHWKWEQVLALCQAIISSNSDRLPDINIYQLKV 141
Query: 265 REKLIKILMELEDWKEALAYCQLTIPVYQRVYPQFHPLLGLQYYTCGKLEWFLGDTENAI 324
+ + + L +EAL Y T+ Y+ +P HP+ G+Q GKL+ G A+
Sbjct: 142 LDCAMDACINLGMLEEALLYALRTMEPYRIFFPGSHPVRGVQVMKVGKLQLHQGMFPQAM 201
Query: 325 KSMTEAVEILRITHG 339
K++ A +I+++THG
Sbjct: 202 KNLRLAFDIMKVTHG 216
>gi|236461014|ref|NP_001153599.1| SET and MYND domain-containing protein 1 isoform 1 [Mus musculus]
gi|332278221|sp|P97443.3|SMYD1_MOUSE RecName: Full=SET and MYND domain-containing protein 1; AltName:
Full=CD8b-opposite; AltName: Full=Zinc finger protein
BOP; Short=m-BOP
gi|308387879|pdb|3N71|A Chain A, Crystal Structure Of Cardiac Specific Histone
Methyltransferase Smyd1
gi|49903961|gb|AAH76601.1| Smyd1 protein [Mus musculus]
gi|74209249|dbj|BAE24995.1| unnamed protein product [Mus musculus]
gi|148666518|gb|EDK98934.1| SET and MYND domain containing 1, isoform CRA_b [Mus musculus]
Length = 490
Score = 96.3 bits (238), Expect = 2e-17, Method: Compositional matrix adjust.
Identities = 79/336 (23%), Positives = 148/336 (44%), Gaps = 52/336 (15%)
Query: 31 VAHMSDIDEKQLLLYAQIANLVNLILQWP----EISINEIAENFSKLACNAHTICNSE-L 85
V H + ++K+L + ++ + WP + S+ I+ F + CN T+ + L
Sbjct: 136 VEHFGEEEQKELRV-----DVDTFLQYWPPQSQQFSMQYISHIFGVINCNGFTLSDQRGL 190
Query: 86 RPLGTGLYPVISIINHSCLPNAVLVFE-GRLAVVRAVQH------------VPKGAEVLI 132
+ +G G++P + ++NH C PN ++F G V+++ H + +G E+ +
Sbjct: 191 QAVGVGIFPNLGLVNHDCWPNCTVIFNNGNHEAVKSMFHTQMRIELRALGKISEGEELTV 250
Query: 133 SYIETAGSTMTRQKALKEQYLFTCTCPRCIKLGQFDDIQESAILEGYRCKDDGCSGFLLR 192
SYI+ + R++ LK+QY F C+C C K G DD+ FL
Sbjct: 251 SYIDFLHLSEERRRQLKKQYYFDCSCEHCQK-GLKDDL------------------FLAA 291
Query: 193 DSDDKGFTCQQCGLVRSKEEIKKIASEVNILSKKTLALTSCGNHQEVVSTYKMIEKLQKK 252
D K S+E +K++ +K S G + EVV + + Q+
Sbjct: 292 KEDPKP----------SQEVVKEMIQFSKDTLEKIDKARSEGLYHEVVKLCRECLEKQEP 341
Query: 253 LYHPFSVNLMQTREKLIKILMELEDWKEALAYCQLTIPVYQRVYPQFHPLLGLQYYTCGK 312
++ ++ +++ ++L L+ ++EA Y + + Y ++Y + LG+ G
Sbjct: 342 VFADTNLYVLRLLSIASEVLSYLQAYEEASHYARRMVDGYMKLYHHNNAQLGMAVMRAGL 401
Query: 313 LEWFLGDTENAIKSMTEAVEILRITHGTNSPFMKEL 348
W G E + +A IL +THG + P K+L
Sbjct: 402 TNWHAGHIEVGHGMICKAYAILLVTHGPSHPITKDL 437
>gi|157819445|ref|NP_001100065.1| SET and MYND domain-containing protein 1 [Rattus norvegicus]
gi|149036367|gb|EDL90985.1| rCG56113, isoform CRA_a [Rattus norvegicus]
Length = 490
Score = 96.3 bits (238), Expect = 2e-17, Method: Compositional matrix adjust.
Identities = 86/341 (25%), Positives = 152/341 (44%), Gaps = 62/341 (18%)
Query: 31 VAHMSDIDEKQLLLYAQIANLVNLILQ-WP----EISINEIAENFSKLACNAHTICNSE- 84
V H + ++K+L + V+ LQ WP + S+ I+ F + CN T+ +
Sbjct: 136 VEHFGEEEQKELRVD------VDTFLQYWPPQSQQFSMQYISHIFGVINCNGFTLSDQRG 189
Query: 85 LRPLGTGLYPVISIINHSCLPNAVLVFE-GRLAVVRAVQH------------VPKGAEVL 131
L+ +G G++P + ++NH C PN ++F G V+++ H + +G E+
Sbjct: 190 LQAVGVGIFPNLGLVNHDCWPNCTVIFNNGNHEAVKSMFHTQMRIELRALGKISEGEELT 249
Query: 132 ISYIETAGSTMTRQKALKEQYLFTCTCPRCIKLGQFDDIQESAILEGYRCKDDGCSGFLL 191
+SYI+ + R++ LK+QY F C+C C K G DD+ FL
Sbjct: 250 VSYIDFLHLSEERRQQLKKQYYFDCSCEHCQK-GLKDDL------------------FLA 290
Query: 192 RDSDDKGFTCQQCGLVRSKEEIKKIASEVNILSKKTL----ALTSCGNHQEVVSTYKMIE 247
D K S+E +K E+ SK TL S G + EVV +
Sbjct: 291 VKEDPKP----------SQEVVK----EMTQFSKDTLEKIDKARSEGLYHEVVKLCRECL 336
Query: 248 KLQKKLYHPFSVNLMQTREKLIKILMELEDWKEALAYCQLTIPVYQRVYPQFHPLLGLQY 307
+ Q+ ++ ++ +++ + ++L L+ ++EA Y + + Y ++Y + LG+
Sbjct: 337 EKQEPVFADTNLYVLRLLSIVSEVLSYLQAFEEASHYARRMVDGYMKLYHHNNAQLGMAV 396
Query: 308 YTCGKLEWFLGDTENAIKSMTEAVEILRITHGTNSPFMKEL 348
G W G E + +A IL +THG + P K+L
Sbjct: 397 MRAGLTNWHAGHIEVGHGMICKAYAILLVTHGPSHPITKDL 437
>gi|302767894|ref|XP_002967367.1| hypothetical protein SELMODRAFT_439878 [Selaginella moellendorffii]
gi|300165358|gb|EFJ31966.1| hypothetical protein SELMODRAFT_439878 [Selaginella moellendorffii]
Length = 614
Score = 96.3 bits (238), Expect = 2e-17, Method: Compositional matrix adjust.
Identities = 69/237 (29%), Positives = 106/237 (44%), Gaps = 18/237 (7%)
Query: 91 GLYPVISIINHSCLPNAVLVFEGRLAVVRAVQHVPKGAEVLISYIETAGSTMTRQKALKE 150
GL+P +SI+NHSC PN EG + VRAVQ +PKGAE+ +SYI S R+ L
Sbjct: 296 GLFPYVSILNHSCRPNCCFASEGSVMYVRAVQDIPKGAELCLSYINLYESRRVRKTLLVA 355
Query: 151 QYLFTCTCPRCIK-LGQFDDIQESAILEGYRCKDDGCSGFLLRDS---DDKG----FTCQ 202
F CTC RC++ L D LEG C GC G LR + +D+ + C
Sbjct: 356 TKHFDCTCDRCVEPLNTSID----RFLEGCVCHVRGCGGVFLRTAALHEDQASSTTWECD 411
Query: 203 QCGLVRSKEEIKKIASEVNILSK---KTLALTSCGNHQEVVSTYKMIEKLQKKL---YHP 256
C + E ++ +++K + +A + + K++E + HP
Sbjct: 412 LCSRILDPESMRFKDPPWEVIAKAEERLVAAVRIYSERRFKEARKLLESYLSEFTGKLHP 471
Query: 257 FSVNLMQTREKLIKILMELEDWKEALAYCQLTIPVYQRVYPQFHPLLGLQYYTCGKL 313
V L L+ + D +E L C+L + ++V P L Y+ G++
Sbjct: 472 LHVFLFDALTPLMNCCRAMGDAEEGLRVCRLILQCMEKVLPGPSLELANFYFCLGEM 528
>gi|5870832|gb|AAC53021.2| skm-BOP1 [Mus musculus]
Length = 485
Score = 95.9 bits (237), Expect = 2e-17, Method: Compositional matrix adjust.
Identities = 82/337 (24%), Positives = 149/337 (44%), Gaps = 54/337 (16%)
Query: 31 VAHMSDIDEKQLLLYAQIANLVNLILQ-WP----EISINEIAENFSKLACNAHTICNSE- 84
V H + ++K+L + V+ LQ WP + S+ I+ F + CN T+ +
Sbjct: 131 VEHFGEEEQKELRVD------VDTFLQYWPPQSQQFSMQYISHIFGVINCNGFTLSDQRG 184
Query: 85 LRPLGTGLYPVISIINHSCLPNAVLVFE-GRLAVVRAVQH------------VPKGAEVL 131
L+ +G G++P + ++NH C PN ++F G V+++ H + +G E+
Sbjct: 185 LQAVGVGIFPNLGLVNHDCWPNCTVIFNNGNHEAVKSMFHTQMRIELRALGKISEGEELT 244
Query: 132 ISYIETAGSTMTRQKALKEQYLFTCTCPRCIKLGQFDDIQESAILEGYRCKDDGCSGFLL 191
+SYI+ + R++ LK+QY F C+C C K G DD+ FL
Sbjct: 245 VSYIDFLHLSEERRRQLKKQYYFDCSCEHCQK-GLKDDL------------------FLA 285
Query: 192 RDSDDKGFTCQQCGLVRSKEEIKKIASEVNILSKKTLALTSCGNHQEVVSTYKMIEKLQK 251
D K S+E +K++ +K S G + EVV + + Q+
Sbjct: 286 AKEDPKP----------SQEVVKEMIQFSKDTLEKIDKARSEGLYHEVVKLCRECLEKQE 335
Query: 252 KLYHPFSVNLMQTREKLIKILMELEDWKEALAYCQLTIPVYQRVYPQFHPLLGLQYYTCG 311
++ ++ +++ ++L L+ ++EA Y + + Y ++Y + LG+ G
Sbjct: 336 PVFADTNLYVLRLLSIASEVLSYLQAYEEASHYARRMVDGYMKLYHHNNAQLGMAVMRAG 395
Query: 312 KLEWFLGDTENAIKSMTEAVEILRITHGTNSPFMKEL 348
W G E + +A IL +THG + P K+L
Sbjct: 396 LTNWHAGHIEVGHGMICKAYAILLVTHGPSHPITKDL 432
>gi|149040851|gb|EDL94808.1| similar to SET and MYND domain containing 3, isoform CRA_a [Rattus
norvegicus]
Length = 239
Score = 95.9 bits (237), Expect = 3e-17, Method: Compositional matrix adjust.
Identities = 67/255 (26%), Positives = 119/255 (46%), Gaps = 39/255 (15%)
Query: 85 LRPLGTGLYPVISIINHSCLPNAVLVFEGRLAVVRAVQHVPKGAEVLISYIETAGSTMTR 144
++ +G GLYP +S++NHSC PN +VF G ++RAV+ + G E+ I Y++ ++ R
Sbjct: 1 MQEVGVGLYPSMSLLNHSCDPNCSIVFNGPHLLLRAVREIEAGEELTICYLDMLMTSEER 60
Query: 145 QKALKEQYLFTCTCPRCIKLGQFDDIQESAILEGYRCKDDGCSGFLLRDSDDKGFTCQQC 204
+K L++QY F C C RC +D+D
Sbjct: 61 RKQLRDQYCFECDCIRCPTQD--------------------------KDAD--------- 85
Query: 205 GLVRSKEEIKKIASEVNILSKKTLALTSCGNHQEVVSTYKMIEKLQKKLYHPFSVNLMQT 264
++ E+I K EV KK L + ++V++ + I + ++ ++
Sbjct: 86 -MLTGDEQIWK---EVQESLKKIEELKAHWKWEQVLALCQAIINSNSERLPDINIYQLKV 141
Query: 265 REKLIKILMELEDWKEALAYCQLTIPVYQRVYPQFHPLLGLQYYTCGKLEWFLGDTENAI 324
+ + + L +EAL Y T+ Y+ +P HP+ G+Q GKL+ G A+
Sbjct: 142 LDCAMDACINLGMLEEALFYAMRTMEPYRIFFPGSHPVRGVQVMKVGKLQLHQGMFPQAM 201
Query: 325 KSMTEAVEILRITHG 339
K++ A +I+++THG
Sbjct: 202 KNLRLAFDIMKVTHG 216
>gi|1809322|gb|AAC53020.1| t-BOP [Mus musculus]
Length = 456
Score = 95.9 bits (237), Expect = 3e-17, Method: Compositional matrix adjust.
Identities = 82/337 (24%), Positives = 149/337 (44%), Gaps = 54/337 (16%)
Query: 31 VAHMSDIDEKQLLLYAQIANLVNLILQ-WP----EISINEIAENFSKLACNAHTICNSE- 84
V H + ++K+L + V+ LQ WP + S+ I+ F + CN T+ +
Sbjct: 102 VEHFGEEEQKELRVD------VDTFLQYWPLQSQQFSMQYISHIFGVINCNGFTLSDQRG 155
Query: 85 LRPLGTGLYPVISIINHSCLPNAVLVFE-GRLAVVRAVQH------------VPKGAEVL 131
L+ +G G++P + ++NH C PN ++F G V+++ H + +G E+
Sbjct: 156 LQAVGVGIFPNLGLVNHDCWPNCTVIFNNGNHEAVKSMFHTQMRIELRALGKISEGEELT 215
Query: 132 ISYIETAGSTMTRQKALKEQYLFTCTCPRCIKLGQFDDIQESAILEGYRCKDDGCSGFLL 191
+SYI+ + R++ LK+QY F C+C C K G DD+ FL
Sbjct: 216 VSYIDFLHLSEERRRQLKKQYYFDCSCEHCQK-GLKDDL------------------FLA 256
Query: 192 RDSDDKGFTCQQCGLVRSKEEIKKIASEVNILSKKTLALTSCGNHQEVVSTYKMIEKLQK 251
D K S+E +K++ +K S G + EVV + + Q+
Sbjct: 257 AKEDPKP----------SQEVVKEMIQFSKDTLEKIDKARSEGLYHEVVKLCRECLEKQE 306
Query: 252 KLYHPFSVNLMQTREKLIKILMELEDWKEALAYCQLTIPVYQRVYPQFHPLLGLQYYTCG 311
++ ++ +++ ++L L+ ++EA Y + + Y ++Y + LG+ G
Sbjct: 307 PVFTDTNLYVLRLLSIASEVLSYLQAYEEASHYARRMVDGYMKLYHHNNAQLGMAVMRAG 366
Query: 312 KLEWFLGDTENAIKSMTEAVEILRITHGTNSPFMKEL 348
W G E + +A IL +THG + P K+L
Sbjct: 367 LTNWHAGHIEVGHGMICKAYAILLVTHGPSHPITKDL 403
>gi|428171150|gb|EKX40069.1| hypothetical protein GUITHDRAFT_164767 [Guillardia theta CCMP2712]
Length = 500
Score = 95.9 bits (237), Expect = 3e-17, Method: Compositional matrix adjust.
Identities = 91/373 (24%), Positives = 156/373 (41%), Gaps = 64/373 (17%)
Query: 10 LQNDNVIPSTTTDNY----SLVEALVAHMSDIDEKQLLLYAQIANLVNLI---------- 55
L + + + ++D+Y V+A++ HM+ + + +A +V +
Sbjct: 162 LMSGKAVNAMSSDDYVQDHDDVKAMIDHMAQMRKSNAQEFAGNQEIVRIAEHLLDMLQAR 221
Query: 56 ----LQWP-EISINEIAENFSKLACN-----AHTICNSELRPLGTGLYPVISIINHSCLP 105
L W + + E+ + K ACN A I + + G G+YP+ +I+NHSC P
Sbjct: 222 SPKGLDWSLKPTEEELLKVLCKFACNNFSHAARQIWDDLIVSHGMGVYPLGAILNHSCKP 281
Query: 106 NAVLVF--EGRLAVVRAVQHVPKGAEVLISYIETAGSTMTRQKALKEQYLFTCTCPRCIK 163
N V+ + E R ++ + G ++ SYI+ A + TR++ L+ Y F C C +C K
Sbjct: 282 NCVIYYHPETHEQEFRCIEDIQVGEDICHSYIDLAAVSKTRKEKLQSTYYFDCDC-QCCK 340
Query: 164 LGQFDDIQESAILEGYRCKDDGCSGFLLRDSDDKGFTCQQCGLVRSKEEIKKIASEVNIL 223
+ D + A G + D G S+ EI+ S + L
Sbjct: 341 FPEELDNKLGAR-----------DGKVTEKCDRAAELLAAAG---SRTEIEHALSRLKDL 386
Query: 224 SKKTLALTSCGNHQEVVSTYKMIEKLQKKLYHPFSVNLMQTREKLIKILMELEDWKEALA 283
+ LA GN ++ + + K+++ +EL A+
Sbjct: 387 DENVLA--DRGN---------------------VDLDRLSVKSKMLQASIELGMMDSAIR 423
Query: 284 YCQLTIPVYQRVYPQFHPLLGLQYYTCGKLEWFLGDTENAIKSMTEAVEILRITHGTNSP 343
C+ + Y+ +YP HPLLGLQ YT G L + G E A + E +IL TH S
Sbjct: 424 ACKQVVEGYRGIYPPLHPLLGLQLYTLGNLLFDDGRGEEAADVLQEGQKILLATHDRRST 483
Query: 344 FMKELILKLEEAQ 356
++ + L EA+
Sbjct: 484 MVQGITELLAEAR 496
>gi|326919601|ref|XP_003206068.1| PREDICTED: SET and MYND domain-containing protein 1-like isoform 1
[Meleagris gallopavo]
Length = 486
Score = 95.5 bits (236), Expect = 3e-17, Method: Compositional matrix adjust.
Identities = 91/365 (24%), Positives = 157/365 (43%), Gaps = 60/365 (16%)
Query: 27 VEALVAHMSDIDEKQLL-LYAQIANLVNLILQWP----EISINEIAENFSKLACNAHTIC 81
++ L H+ + DE++ L + + + WP + + I+ F + CNA T+
Sbjct: 124 IDDLQNHVENFDEEEKKDLRIDVESFLEF---WPAQSQQFGMQYISHIFGVINCNAFTLS 180
Query: 82 NSE-LRPLGTGLYPVISIINHSCLPNAVLVFE-GRLAVVRAVQH------------VPKG 127
+ L+ +G G++P + NH C PN ++F G VR++ H + G
Sbjct: 181 DQRGLQAVGVGIFPNLCQANHDCWPNCTVIFNNGNHEAVRSMFHTQMRIELRALSKISPG 240
Query: 128 AEVLISYIETAGSTMTRQKALKEQYLFTCTCPRCIKLGQFDDIQESAILEGYRCKDDGCS 187
E+ +SY++ + RQK LK+QY F CTC C K + KDD
Sbjct: 241 DELTVSYVDFLNVSEERQKQLKKQYYFDCTCEHCKK----------------KIKDD--L 282
Query: 188 GFLLRDSDDKGFTCQQCGLVRSKEEIKKIASEVNILSKKTLALTSC----GNHQEVVSTY 243
+++ D K S E +K EV SK TL + G + EVV
Sbjct: 283 MLAVKEGDKKP----------SAETVK----EVIQFSKDTLEKINKARLEGTYHEVVKLC 328
Query: 244 KMIEKLQKKLYHPFSVNLMQTREKLIKILMELEDWKEALAYCQLTIPVYQRVYPQFHPLL 303
+ K Q+ + ++ L++ ++L L+ ++EA Y + + Y ++Y + L
Sbjct: 329 RECLKKQEPVLGDTNIYLLRILSIASEVLSYLQMFEEAAEYAKRMVDGYLKIYHPNNAQL 388
Query: 304 GLQYYTCGKLEWFLGDTENAIKSMTEAVEILRITHGTNSPFMKELILKLEEAQAEASYKL 363
G+ G W G E + +A IL ITHG + P K+ L++ Q E ++
Sbjct: 389 GMAVMRAGVTHWHAGLIEAGHSMICKAYAILLITHGPSHPITKD--LEVMRVQTEMELRM 446
Query: 364 SSKDE 368
++E
Sbjct: 447 FQQNE 451
>gi|426194765|gb|EKV44696.1| hypothetical protein AGABI2DRAFT_120820 [Agaricus bisporus var.
bisporus H97]
Length = 674
Score = 95.5 bits (236), Expect = 4e-17, Method: Compositional matrix adjust.
Identities = 68/196 (34%), Positives = 94/196 (47%), Gaps = 35/196 (17%)
Query: 26 LVEALVAHMS--DIDEKQLLLYAQIANLVNLILQWPEISINEIAENFSKLACNAHTICNS 83
+V ALV ++ +DE IA LV+L+ S+ NA T+ +
Sbjct: 250 MVHALVRYLGLESLDEFSSYGIDSIAGLVDLV---------------SRFTTNAFTVASP 294
Query: 84 ELRPLGTGLYPVISIINHSCLPNAVLVFEGR---------LAVVRAVQHVPKGAEVLISY 134
L P+G + P+I++ NHSC PNAV VF +A V ++ +P EV+ISY
Sbjct: 295 TLTPIGVSISPIIALFNHSCAPNAVPVFPRAPHNAKANEPMASVITLRDIPANEEVVISY 354
Query: 135 IETAGSTMTRQKALKEQYLFTCTCPRCIKL--GQFDDIQESAILEGYRCKDDGCSGF-LL 191
I+T + RQKALKE Y FTC+C C K G + D +ES C + C G L
Sbjct: 355 IDTTLTKRERQKALKETYYFTCSCSLCEKPRDGAYVDSRES-----LYCPKN-CGGLCWL 408
Query: 192 RDSDDKGFTCQQCGLV 207
DD C +CG V
Sbjct: 409 PTEDDPLARCAKCGTV 424
>gi|149041012|gb|EDL94969.1| SET and MYND domain containing 2, isoform CRA_b [Rattus norvegicus]
Length = 349
Score = 95.1 bits (235), Expect = 4e-17, Method: Compositional matrix adjust.
Identities = 52/147 (35%), Positives = 83/147 (56%), Gaps = 6/147 (4%)
Query: 19 TTTDNYSLVEALVAHMSDID-EKQLLLYAQIANLVNLI---LQWPEISINEIAENFSKLA 74
T ++ V +H+ +D EK+ L+ + IA L L++P+ S + F+++
Sbjct: 123 TPSEKLLAVREFESHLDKLDNEKKDLIQSDIAALHQFYSKHLEFPDHS--SLVVLFAQVN 180
Query: 75 CNAHTICNSELRPLGTGLYPVISIINHSCLPNAVLVFEGRLAVVRAVQHVPKGAEVLISY 134
CN TI + EL LG+ ++P ++++NHSC PN ++ ++G LA VRAVQ + G EV SY
Sbjct: 181 CNGFTIEDEELSHLGSAIFPDVALMNHSCCPNVIVTYKGTLAEVRAVQEIHPGDEVFTSY 240
Query: 135 IETAGSTMTRQKALKEQYLFTCTCPRC 161
I+ T R L++ Y FTC C C
Sbjct: 241 IDLLYPTEDRNDRLRDSYFFTCECREC 267
>gi|384498493|gb|EIE88984.1| hypothetical protein RO3G_13695 [Rhizopus delemar RA 99-880]
Length = 553
Score = 95.1 bits (235), Expect = 4e-17, Method: Compositional matrix adjust.
Identities = 42/108 (38%), Positives = 65/108 (60%)
Query: 56 LQWPEISINEIAENFSKLACNAHTICNSELRPLGTGLYPVISIINHSCLPNAVLVFEGRL 115
+Q I +E+ + N T+ + E+ +G G YPV S+ NH+C PNAV+VF+G L
Sbjct: 178 VQQTSIDEDELVHYLNVFKSNNFTLDDQEMFAVGEGTYPVASLFNHTCRPNAVIVFDGAL 237
Query: 116 AVVRAVQHVPKGAEVLISYIETAGSTMTRQKALKEQYLFTCTCPRCIK 163
A +RA+ + E+ ISYI+ A + R++AL+E+Y F C C RC +
Sbjct: 238 AEIRAIDTIEPDTEITISYIDPAHARAHRKRALREKYFFDCRCVRCTR 285
>gi|88900471|ref|NP_001034725.1| SET and MYND domain-containing protein 1 [Danio rerio]
gi|84180543|gb|ABC54713.1| histone methyltransferase SmyD1a [Danio rerio]
Length = 486
Score = 95.1 bits (235), Expect = 5e-17, Method: Compositional matrix adjust.
Identities = 81/350 (23%), Positives = 147/350 (42%), Gaps = 48/350 (13%)
Query: 18 STTTDN-YSLVEALVAHMSDIDEKQLLLYAQIANLVNLILQWPEIS----INEIAENFSK 72
S +DN + +E L H+ DI E L + ++ N + WP S ++ ++
Sbjct: 114 SVVSDNQLTTLEDLEDHICDISEDDLKDFK--VDIHNFLDYWPRNSKPHTVDSVSHILGV 171
Query: 73 LACNAHTICNSE-LRPLGTGLYPVISIINHSCLPNAVL------------VFEGRLAV-V 118
+ CN + + L+ +G GL+P + ++NH C PN + VF + + +
Sbjct: 172 INCNGFMVSDQRGLQAVGVGLFPNLCLVNHDCWPNCTVILNNGNQSAIDTVFHSQKRIEL 231
Query: 119 RAVQHVPKGAEVLISYIETAGSTMTRQKALKEQYLFTCTCPRCIKLGQFDDIQESAILEG 178
RA+ + G EV ++Y++ + RQ+ LK+QY F CTC C + + D A ++G
Sbjct: 232 RALGKISAGEEVTVAYVDYLNVSADRQRLLKQQYFFDCTCKHCTEKIKDDLKMAGAEVDG 291
Query: 179 YRCKDDGCSGFLLRDSDDKGFTCQQCGLVRSKEEIKKIASEVNILSKKTLALTSCGNHQE 238
+ + E++K++ +K N+ E
Sbjct: 292 VKVPE---------------------------EQVKEVTEFSRQKLEKMEKARIEANYNE 324
Query: 239 VVSTYKMIEKLQKKLYHPFSVNLMQTREKLIKILMELEDWKEALAYCQLTIPVYQRVYPQ 298
VV + + Q+ + + ++ L ++L L+ + EA Y + + Y ++Y
Sbjct: 325 VVKICRECVEKQENVLADTHIYYLRVCCTLSEVLSYLQFFDEASEYARKMVDGYLKLYHP 384
Query: 299 FHPLLGLQYYTCGKLEWFLGDTENAIKSMTEAVEILRITHGTNSPFMKEL 348
+ LG+ G W G E A + +A IL ITHG P K+L
Sbjct: 385 NNAQLGMATMRAGVTHWQAGFIEVAHGMICKAFAILMITHGPTHPITKDL 434
>gi|332239335|ref|XP_003268859.1| PREDICTED: SET and MYND domain-containing protein 1 [Nomascus
leucogenys]
Length = 512
Score = 95.1 bits (235), Expect = 5e-17, Method: Compositional matrix adjust.
Identities = 80/336 (23%), Positives = 150/336 (44%), Gaps = 52/336 (15%)
Query: 31 VAHMSDIDEKQLLLYAQIANLVNLILQWPEISINEIAENFSKLACNAHTICNSE-LRPLG 89
V H + ++K+L + L + Q + S+ I+ F + CN T+ + L+ +G
Sbjct: 158 VEHFGEEEQKELRVDVDTF-LQYWLPQSQQFSMQYISHIFGVINCNGFTLSDQRGLQAVG 216
Query: 90 TGLYPVISIINHSCLPNAVLVFE-GRLAVVRAVQH------------VPKGAEVLISYIE 136
G++P + ++NH C PN ++F G V+++ H + +G E+ +SYI+
Sbjct: 217 VGIFPNLGLVNHDCWPNCTVIFNNGNHEAVKSMFHTQMRIELRALGKISEGEELTVSYID 276
Query: 137 TAGSTMTRQKALKEQYLFTCTCPRCIKLGQFDDIQESAILEGYRCKDDGCSGFLLRDSDD 196
+ R++ LK+QY F CTC C K + KDD FL
Sbjct: 277 FLNVSEERKRQLKKQYYFDCTCEHCQK----------------KLKDD---LFL------ 311
Query: 197 KGFTCQQCGLVRSKEEIKKIASEVNILSKKTL----ALTSCGNHQEVVSTYKMIEKLQKK 252
G+ + + +++ E+ SK TL S G + EVV + + Q+
Sbjct: 312 --------GVKDNPKPSQEVVKEMIQFSKDTLEKIDKARSEGLYHEVVKLCRECLQKQEP 363
Query: 253 LYHPFSVNLMQTREKLIKILMELEDWKEALAYCQLTIPVYQRVYPQFHPLLGLQYYTCGK 312
++ ++ +++ + ++L L+ ++EA Y + + Y ++Y + LG+ G
Sbjct: 364 VFADTNIYMLRMLSIVSEVLSYLQAFEEASFYARRMVDGYMKLYHPNNAQLGMAVMRAGL 423
Query: 313 LEWFLGDTENAIKSMTEAVEILRITHGTNSPFMKEL 348
W G+ E + +A IL +THG + P K+L
Sbjct: 424 TNWHAGNIEVGHGMICKAYAILLVTHGPSHPITKDL 459
>gi|67514234|gb|AAH98305.1| SMYD2 protein [Homo sapiens]
Length = 272
Score = 95.1 bits (235), Expect = 5e-17, Method: Compositional matrix adjust.
Identities = 55/159 (34%), Positives = 87/159 (54%), Gaps = 7/159 (4%)
Query: 8 RKLQNDNVIPSTTTDNYSL-VEALVAHMSDID-EKQLLLYAQIANLVNLI---LQWPEIS 62
R L + P T L V+ +H+ +D EK+ L+ + IA L + L +P+
Sbjct: 111 RILAKQKIHPERTPSEKLLAVKEFESHLDKLDNEKKDLIQSDIAALHHFYSKHLGFPDN- 169
Query: 63 INEIAENFSKLACNAHTICNSELRPLGTGLYPVISIINHSCLPNAVLVFEGRLAVVRAVQ 122
+ + F+++ CN TI + EL LG+ ++P ++++NHSC PN ++ ++G LA VRAVQ
Sbjct: 170 -DSLVVLFAQVNCNGFTIEDEELSHLGSAIFPDVALMNHSCCPNVIVTYKGTLAEVRAVQ 228
Query: 123 HVPKGAEVLISYIETAGSTMTRQKALKEQYLFTCTCPRC 161
+ G EV SYI+ T R L++ Y FTC C C
Sbjct: 229 EIKPGEEVFTSYIDLLYPTEDRNDRLRDSYFFTCECQEC 267
>gi|47224323|emb|CAG09169.1| unnamed protein product [Tetraodon nigroviridis]
Length = 361
Score = 95.1 bits (235), Expect = 5e-17, Method: Compositional matrix adjust.
Identities = 76/286 (26%), Positives = 126/286 (44%), Gaps = 41/286 (14%)
Query: 77 AHTIC--NSELRPLGTGLYPVISIINHSCLPNAVLVFEGRLAVVRAVQHVPKGAEVLISY 134
+HT+ + EL LGT +YP +++INHSCLP+ ++ + G A VRAV+ + G EVLISY
Sbjct: 109 SHTLFSEDEELSHLGTAVYPDVALINHSCLPSVIVTYNGTSADVRAVRDMNPGDEVLISY 168
Query: 135 IETAGSTMTRQKALKEQYLFTCTCPRCIKLGQFDDIQESAILEGYRCKDDGCSGFLLRDS 194
I+ T R L+E Y FTC C C G + D + LR
Sbjct: 169 IDVLYPTEDRNTRLRESYYFTCQCQEC----------------GSQSSDQ--AKLKLRKR 210
Query: 195 DDKGFTCQQCGLVRSKEEIKKIASEVNILSKKTLALTSCGNHQEVVSTYKMIEKLQKKL- 253
D E + I + V K N ST ++E ++ L
Sbjct: 211 SDPA-------------EAEVINTMVRYARKCIREFRVFKNSNTPAST--LLEMCEQSLD 255
Query: 254 -----YHPFSVNLMQTREKLIKILMELEDWKEALAYCQLTIPVYQRVYPQFHPLLGLQYY 308
+ +V ++ + + + + ++D AL Y + Y+++YP + L Y
Sbjct: 256 EMGAVFDESNVYMLHMMYQAMGVCLYMQDPDGALRYGEKIAKYYRKLYPAYSLNLSSLYL 315
Query: 309 TCGKLEWFLGDTENAIKSMTEAVEILRITHGTNSPFMKELILKLEE 354
G+L + + I + +A I+ +THG N ++ EL +++E
Sbjct: 316 KLGRLYFGMERNSECISVLKKAKAIMEVTHGKNHFYVTELDRQMKE 361
>gi|409076546|gb|EKM76917.1| hypothetical protein AGABI1DRAFT_102088 [Agaricus bisporus var.
burnettii JB137-S8]
Length = 643
Score = 95.1 bits (235), Expect = 5e-17, Method: Compositional matrix adjust.
Identities = 68/196 (34%), Positives = 93/196 (47%), Gaps = 35/196 (17%)
Query: 26 LVEALVAHMS--DIDEKQLLLYAQIANLVNLILQWPEISINEIAENFSKLACNAHTICNS 83
+V ALV ++ +DE IA LV+L+ S+ NA T+ +
Sbjct: 217 MVHALVRYLGLESLDELSSYGIDSIAGLVDLV---------------SRFTTNAFTVASP 261
Query: 84 ELRPLGTGLYPVISIINHSCLPNAVLVFEGR---------LAVVRAVQHVPKGAEVLISY 134
L P+G + P+I++ NHSC PNAV VF +A V ++ +P EV+ISY
Sbjct: 262 TLTPIGVSISPIIALFNHSCAPNAVPVFPRAPHNAKANEPMASVITLRDIPANEEVVISY 321
Query: 135 IETAGSTMTRQKALKEQYLFTCTCPRCIKL--GQFDDIQESAILEGYRCKDDGCSGF-LL 191
I+T + RQKALKE Y FTC C C K G + D +ES C + C G L
Sbjct: 322 IDTTLTKRERQKALKETYYFTCNCSLCEKPRDGAYVDSRES-----LYCPKN-CGGLCWL 375
Query: 192 RDSDDKGFTCQQCGLV 207
DD C +CG V
Sbjct: 376 PTEDDPLARCAKCGTV 391
>gi|449501420|ref|XP_002187966.2| PREDICTED: SET and MYND domain-containing protein 1 isoform 1
[Taeniopygia guttata]
Length = 478
Score = 94.7 bits (234), Expect = 6e-17, Method: Compositional matrix adjust.
Identities = 83/348 (23%), Positives = 156/348 (44%), Gaps = 39/348 (11%)
Query: 27 VEALVAHMSDI-DEKQLLLYAQIANLVNLILQWP----EISINEIAENFSKLACNAHTIC 81
+E L H+ +E++ L A + + + WP + + I+ F ++CN T+
Sbjct: 129 IEELQNHVDSFGEEEKKDLRADVESFLEF---WPPQCQQFGMQLISHIFGVISCNGFTLS 185
Query: 82 NSE-LRPLGTGLYPVISIINHSCLPNAVLVFEGRLAVVRAVQHVPKGAEVLISYIETAGS 140
+ L+ +G G++P + NH C PN +VF +RA+ + G E+ +SY++
Sbjct: 186 DQRGLQAVGVGIFPNLCQANHDCWPNCTVVFNNGKIELRALNKISPGDELTVSYVDFLSL 245
Query: 141 TMTRQKALKEQYLFTCTCPRCIKLGQFDDIQESAILEGYRCKDDGCSGFLLRDSDDKGFT 200
+ R++ LK+QY F CTC C K + KDD +S T
Sbjct: 246 SEERRRQLKKQYYFDCTCEHCKK----------------QLKDDLMLAVKAGESKPSADT 289
Query: 201 CQQCGLVRSKEEIKKIASEVNILSKKTLALTSCGNHQEVVSTYKMIEKLQKKLYHPFSVN 260
++ + SK+ ++KI +K + G++ EVV + + Q+ + ++
Sbjct: 290 VKEV-IQLSKDTLEKI-------NKARME----GHYHEVVKLCRDCLQKQEPVLGDTNIY 337
Query: 261 LMQTREKLIKILMELEDWKEALAYCQLTIPVYQRVYPQFHPLLGLQYYTCGKLEWFLGDT 320
L++ ++L L+ ++EA Y + + Y ++Y + LG+ G W G
Sbjct: 338 LLRILSIASEVLSYLQMFEEAADYAKRMVDGYLKIYHPNNAQLGMAVMRAGVTHWHAGLI 397
Query: 321 ENAIKSMTEAVEILRITHGTNSPFMKELILKLEEAQAEASYKLSSKDE 368
E + +A IL ITHG + P K+ L++ Q E ++ ++E
Sbjct: 398 EVGHGLICKAYAILLITHGPSHPITKD--LEVMRVQTEMELRMFQQNE 443
>gi|326427056|gb|EGD72626.1| hypothetical protein PTSG_04361 [Salpingoeca sp. ATCC 50818]
Length = 464
Score = 94.7 bits (234), Expect = 7e-17, Method: Compositional matrix adjust.
Identities = 73/283 (25%), Positives = 124/283 (43%), Gaps = 19/283 (6%)
Query: 74 ACNAHTICNSELRPLGTGLYPVISIINHSCLPNAVLVFEGRLAVVRAVQHVPKGAEVLIS 133
+ N+ I E LG GLY +S INHSC PN + F G A VR+V +P E+ I+
Sbjct: 191 SANSFGISGVEGNNLGVGLYDTVSYINHSCAPNCSITFSGVYARVRSVHDLPPNQELTIA 250
Query: 134 YIETAGSTMTRQKALKEQYLFTCTCPRCIKLGQFDDIQESAILEGYRCKDDGCSGF---- 189
YI+ R+ LK Q++F C C RC + + DD + C GC
Sbjct: 251 YIDPCDPRAKRRAHLKSQFMFDCECSRCER--ERDDDPLLTLC----CPGSGCDQIIKYQ 304
Query: 190 LLRDSDDKGF-TCQQC--GLVRSKEEIKKIASEVNILSKKTLALTSCGNHQEVVSTYKMI 246
L DD F ++C G + + +V K+ L + N M
Sbjct: 305 LTAPPDDTAFEVTKKCERGHLDDPATFDSWSRKV----KRVLGIIGEANKARRPVQGDMA 360
Query: 247 EKLQKKLYHPFSVNLMQTREKLIKILMELEDWKEALAYCQLTIPVYQRVYPQFHPLLGLQ 306
++L P + + ++ + ++EAL + + I VY + PL G++
Sbjct: 361 SF--ERLLPPTNYLMHIAYTGVLDQAIGTGQFQEALQWARKVILVYDALNSPMDPLRGIE 418
Query: 307 YYTCGKLEWFLGDTENAIKSMTEAVEILRITHGTNSPFMKELI 349
KL+ D ++++ +AV+++R+THG +S ++++
Sbjct: 419 LLRVAKLQALARDGRGFLETIQKAVDVIRLTHGEDSVVYQQVL 461
>gi|327275307|ref|XP_003222415.1| PREDICTED: SET and MYND domain-containing protein 1-like isoform 1
[Anolis carolinensis]
Length = 472
Score = 94.4 bits (233), Expect = 7e-17, Method: Compositional matrix adjust.
Identities = 81/323 (25%), Positives = 146/323 (45%), Gaps = 39/323 (12%)
Query: 31 VAHMSDIDEKQLLLYAQIANLVNLILQWP----EISINEIAENFSKLACNAHTICNSE-L 85
V H + ++K+L L +L + + WP + ++ I+ + CN T+ + L
Sbjct: 131 VEHFGEEEKKELRL-----DLESFLEFWPHDGKQFAMQYISHILGVINCNGFTLSDQRGL 185
Query: 86 RPLGTGLYPVISIINHSCLPNAVLVFEGRLAVVRAVQHVPKGAEVLISYIETAGSTMTRQ 145
+ +G G++P + ++NH C PN ++F +RA+ + G E+ +SY++ R+
Sbjct: 186 QAVGVGIFPNLCLVNHDCWPNCTVIFNNGKIELRALGKISIGEELTVSYVDFLNVCEDRR 245
Query: 146 KALKEQYLFTCTCPRCIKLGQFDDIQESAILEGYRCKDDGCSGFLLRDSDDKGFTCQQCG 205
+ LK+QY F CTC C K G DD+ A+ E ++ S +++D
Sbjct: 246 QQLKKQYYFDCTCEHCQK-GIKDDLM-LAVKE-----EEKPSPEVVKDV----------- 287
Query: 206 LVRSKEEIKKIASEVNILSKKTLALTSCGNHQEVVSTYKMIEKLQKKLYHPFSVNLMQTR 265
+ SKE I+KI S G ++EVV + + Q+ + ++ +++
Sbjct: 288 IQFSKEAIEKIDK-----------ARSEGLYKEVVKLCRACLEKQEPVLGDTNIYMLRIL 336
Query: 266 EKLIKILMELEDWKEALAYCQLTIPVYQRVYPQFHPLLGLQYYTCGKLEWFLGDTENAIK 325
L ++L L+ EA Y + Y ++Y + LG+ G W G E
Sbjct: 337 SILSEVLSYLQHVDEAAGYSKRMADGYLKLYHPNNAQLGMALMRAGVTHWHAGLIEAGHG 396
Query: 326 SMTEAVEILRITHGTNSPFMKEL 348
+ +A IL +THG P K+L
Sbjct: 397 MICKAYAILLVTHGPTHPITKDL 419
>gi|45383804|ref|NP_989486.1| SET and MYND domain-containing protein 1 [Gallus gallus]
gi|16930387|gb|AAL31880.1|AF410781_1 cardiac and skeletal muscle-specific BOP1 [Gallus gallus]
Length = 486
Score = 94.4 bits (233), Expect = 8e-17, Method: Compositional matrix adjust.
Identities = 90/365 (24%), Positives = 156/365 (42%), Gaps = 60/365 (16%)
Query: 27 VEALVAHMSDIDEKQLL-LYAQIANLVNLILQWP----EISINEIAENFSKLACNAHTIC 81
++ L H+ DE++ L + + + WP + + I+ F + CNA T+
Sbjct: 124 IDDLQNHVESFDEEEKKDLRVDVESFLEF---WPAQSQQFGMQYISHIFGVINCNAFTLS 180
Query: 82 NSE-LRPLGTGLYPVISIINHSCLPNAVLVFE-GRLAVVRAVQH------------VPKG 127
+ L+ +G G++P + NH C PN ++F G VR++ H + G
Sbjct: 181 DQRGLQAVGVGIFPNLCQANHDCWPNCTVIFNNGNHEAVRSMFHTQMRIELRALSKISPG 240
Query: 128 AEVLISYIETAGSTMTRQKALKEQYLFTCTCPRCIKLGQFDDIQESAILEGYRCKDDGCS 187
E+ +SY++ + R+K LK+QY F CTC C K + KDD
Sbjct: 241 DELTVSYVDFLNVSEERRKQLKKQYYFDCTCEHCKK----------------KIKDD--L 282
Query: 188 GFLLRDSDDKGFTCQQCGLVRSKEEIKKIASEVNILSKKTLALTSC----GNHQEVVSTY 243
+++ D K S E +K EV SK TL + G + EVV
Sbjct: 283 MLAVKEGDKKP----------SAETVK----EVIQFSKDTLEKINKARLEGTYHEVVKLC 328
Query: 244 KMIEKLQKKLYHPFSVNLMQTREKLIKILMELEDWKEALAYCQLTIPVYQRVYPQFHPLL 303
+ K Q+ + ++ L++ ++L L+ ++EA Y + + Y ++Y + L
Sbjct: 329 RECLKKQEPVLGDTNIYLLRILSIASEVLSYLQMFEEAAEYAKRMVEGYMKIYHPNNAQL 388
Query: 304 GLQYYTCGKLEWFLGDTENAIKSMTEAVEILRITHGTNSPFMKELILKLEEAQAEASYKL 363
G+ G W G E + +A IL ITHG + P K+ L++ Q E ++
Sbjct: 389 GMAVMRAGVTHWHAGLIEAGHSMICKAYAILLITHGPSHPITKD--LEVMRVQTEMELRM 446
Query: 364 SSKDE 368
++E
Sbjct: 447 FQQNE 451
>gi|354483543|ref|XP_003503952.1| PREDICTED: SET and MYND domain-containing protein 1 [Cricetulus
griseus]
Length = 498
Score = 94.4 bits (233), Expect = 8e-17, Method: Compositional matrix adjust.
Identities = 80/337 (23%), Positives = 149/337 (44%), Gaps = 54/337 (16%)
Query: 31 VAHMSDIDEKQLLLYAQIANLVNLILQ-WP----EISINEIAENFSKLACNAHTICNSE- 84
V H + ++K+L + V+ LQ WP + S+ I+ F + CN T+ +
Sbjct: 136 VEHFGEEEQKELRVD------VDAFLQYWPPQGQQFSMQYISHIFGVINCNGFTLSDQRG 189
Query: 85 LRPLGTGLYPVISIINHSCLPNAVLVFE-GRLAVVRAVQH------------VPKGAEVL 131
L+ +G G++P + ++NH C PN ++F G V+++ H + +G E+
Sbjct: 190 LQAVGVGIFPNLGLVNHDCWPNCTVIFNNGNHEAVKSMFHTQMRIELRALGKISEGEELT 249
Query: 132 ISYIETAGSTMTRQKALKEQYLFTCTCPRCIKLGQFDDIQESAILEGYRCKDDGCSGFLL 191
+SYI+ + R++ L++QY F C+C C K G DD+ F
Sbjct: 250 VSYIDFLHLSEERRRQLRKQYYFDCSCEHCQK-GLKDDL------------------FQA 290
Query: 192 RDSDDKGFTCQQCGLVRSKEEIKKIASEVNILSKKTLALTSCGNHQEVVSTYKMIEKLQK 251
D K S E +K++ +K S G + EVV + + Q+
Sbjct: 291 VKEDPKP----------SPEVVKEMVQFSKDTLEKIDKARSEGLYHEVVKLCRECLEKQE 340
Query: 252 KLYHPFSVNLMQTREKLIKILMELEDWKEALAYCQLTIPVYQRVYPQFHPLLGLQYYTCG 311
++ ++ +++ + ++L L+ ++EA Y + + Y ++Y + LG+ G
Sbjct: 341 SVFADTNLYVLRLLSIVSEVLSYLQAFEEAAHYARRMVDGYMKLYHHNNAQLGMAVMRAG 400
Query: 312 KLEWFLGDTENAIKSMTEAVEILRITHGTNSPFMKEL 348
W G+ E + +A IL +THG + P K+L
Sbjct: 401 LTNWHAGNIEVGHGMICKAYAILLVTHGPSHPITKDL 437
>gi|395731585|ref|XP_003775930.1| PREDICTED: LOW QUALITY PROTEIN: SET and MYND domain-containing
protein 1-like [Pongo abelii]
Length = 490
Score = 94.0 bits (232), Expect = 1e-16, Method: Compositional matrix adjust.
Identities = 80/336 (23%), Positives = 149/336 (44%), Gaps = 52/336 (15%)
Query: 31 VAHMSDIDEKQLLLYAQIANLVNLILQWPEISINEIAENFSKLACNAHTICNSE-LRPLG 89
V H + ++K+L + L + Q + S+ I+ F + CN T+ + L+ +G
Sbjct: 136 VEHFGEEEQKELRVDVDTF-LQYWLPQSQQFSMQYISHIFGVINCNGFTLSDQRGLQAVG 194
Query: 90 TGLYPVISIINHSCLPNAVLVFE-GRLAVVRAVQH------------VPKGAEVLISYIE 136
G++P + ++NH C PN ++F G V+++ H + +G E+ +SYI+
Sbjct: 195 VGIFPNLGLVNHDCWPNCTVIFNNGNHEAVKSMFHTQMRIELRALGKISEGEELTVSYID 254
Query: 137 TAGSTMTRQKALKEQYLFTCTCPRCIKLGQFDDIQESAILEGYRCKDDGCSGFLLRDSDD 196
+ R++ LK+QY F CTC C K + KDD FL
Sbjct: 255 FLNVSEERKRQLKKQYYFDCTCEHCQK----------------KLKDD---LFL------ 289
Query: 197 KGFTCQQCGLVRSKEEIKKIASEVNILSKKTL----ALTSCGNHQEVVSTYKMIEKLQKK 252
G+ + + +++ E+ SK TL S G + EVV + + Q+
Sbjct: 290 --------GVKDNPKPSQEVVKEMIRFSKDTLEKIDKARSEGLYHEVVKLCRECLQKQEP 341
Query: 253 LYHPFSVNLMQTREKLIKILMELEDWKEALAYCQLTIPVYQRVYPQFHPLLGLQYYTCGK 312
++ ++ ++ + ++L L+ ++EA Y + + Y ++Y + LG+ G
Sbjct: 342 VFADTNIYTLRMLSIVSEVLSYLQAFEEASFYARRMVDGYMKLYHPNNAQLGMAVMRAGL 401
Query: 313 LEWFLGDTENAIKSMTEAVEILRITHGTNSPFMKEL 348
W G+ E + +A IL +THG + P K+L
Sbjct: 402 TNWHAGNIEVGHGMICKAYAILLVTHGPSHPITKDL 437
>gi|159463378|ref|XP_001689919.1| histone methyltransferase [Chlamydomonas reinhardtii]
gi|158283907|gb|EDP09657.1| histone methyltransferase [Chlamydomonas reinhardtii]
Length = 765
Score = 93.2 bits (230), Expect = 2e-16, Method: Compositional matrix adjust.
Identities = 43/97 (44%), Positives = 62/97 (63%), Gaps = 2/97 (2%)
Query: 58 WPEISINEIAENFSKLACNAHTICNSELRPLGTGLYPVISIINHSCLPNAVLVFEGRLAV 117
WP E+A+ ++LA N HT+C+ ELRPLGT LYP ++ NHSC P+ V VF GR
Sbjct: 94 WP--GFKEVAQLLARLAVNCHTVCDEELRPLGTALYPSGALANHSCRPSTVQVFRGRTLQ 151
Query: 118 VRAVQHVPKGAEVLISYIETAGSTMTRQKALKEQYLF 154
+RA++ + G EV + Y+E A + R++AL + Y F
Sbjct: 152 LRALRPLAPGQEVTLCYLEPAATAQERREALWDSYCF 188
>gi|118343990|ref|NP_001071820.1| SET and MYND domain containing protein [Ciona intestinalis]
gi|70571203|dbj|BAE06699.1| SET and MYND domain containing protein [Ciona intestinalis]
Length = 474
Score = 93.2 bits (230), Expect = 2e-16, Method: Compositional matrix adjust.
Identities = 79/308 (25%), Positives = 146/308 (47%), Gaps = 30/308 (9%)
Query: 62 SINEIAENFSKLACNAHTICNSE-LRPLGTGLYPVISIINHSCLPNAVLVFEGRLAVVRA 120
S E+A F+ + CNA + + ++ +G G+YP IS++NH C PN V + G VRA
Sbjct: 182 SDEEMAHLFAIIDCNAIGLNDHRGVQTIGVGIYPGISMLNHDCSPNCVAMNNGPRLEVRA 241
Query: 121 VQHVPKGAEVLISYIETAGSTMTRQKALKEQYLFTCTCPRCIKLGQFDDIQESAILEGYR 180
++ + G E+ ISYI++ +T R++ LK QY F C C C K + ++++ + + E +
Sbjct: 242 LRVIQPGEELCISYIDSLETTEKRREKLKLQYYFDCECDTCTKGEELENLKHALVSEDIK 301
Query: 181 CKDDGCSGFLLRDSDDKGFTCQQCGLVRSKEEIKKIASEVNILSKKTLALTSCGNHQEVV 240
+ ++ + S D ++ + + +++ ++ + TLA C V+
Sbjct: 302 ---EESVKYINQFSKD---MLKRIHKTKQNQNWERMCNQ----TLGTLAQQDC-----VI 346
Query: 241 STYKMIEKLQKKLYHPFSVNLMQTREKLIKILMELEDWKEALAYCQLTIPVYQRVYPQFH 300
+ +++ L H V R++ E + A AY +L PV H
Sbjct: 347 ADTNVLKIAM--LNHAVEVQSFLRRQEPALEYAE----RVAAAYEKLLPPV--------H 392
Query: 301 PLLGLQYYTCGKLEWFLGDTENAIKSMTEAVEILRITHGTNSPFMKELILKLEEAQAEAS 360
P LG+ G ++W + TE A+ ++ A I+ THG + KEL+ + + + EA
Sbjct: 393 PTLGMYLMRLGVIQWQIQKTEAAVATLGRAASIISKTHGDDHGMFKELLGLIHQCKMEAH 452
Query: 361 YKLSSKDE 368
++ E
Sbjct: 453 MSKGAQRE 460
>gi|449273578|gb|EMC83062.1| SET and MYND domain-containing protein 1 [Columba livia]
Length = 491
Score = 93.2 bits (230), Expect = 2e-16, Method: Compositional matrix adjust.
Identities = 91/365 (24%), Positives = 157/365 (43%), Gaps = 60/365 (16%)
Query: 27 VEALVAHMSDIDEKQLL-LYAQIANLVNLILQWP----EISINEIAENFSKLACNAHTIC 81
++ L H+ DE++ L A + + + WP + + I+ F ++CN T+
Sbjct: 129 IDDLQNHVDSFDEEEKKELRADVESFLEF---WPPHSQQFGMQYISHIFGVISCNGFTLS 185
Query: 82 NSE-LRPLGTGLYPVISIINHSCLPNAVLVFE-GRLAVVRAVQH------------VPKG 127
+ L+ +G G++P + NH C PN ++F G VR++ H + G
Sbjct: 186 DQRGLQAVGVGIFPNLCQANHDCWPNCTVIFNNGNHEAVRSMFHTQMRIELRALTKISPG 245
Query: 128 AEVLISYIETAGSTMTRQKALKEQYLFTCTCPRCIKLGQFDDIQESAILEGYRCKDDGCS 187
E+ +SY++ + R+K LK+QY F CTC C K + KDD
Sbjct: 246 DELTVSYVDFLNVSEERRKQLKKQYYFDCTCEHCKK----------------QIKDD--- 286
Query: 188 GFLLRDSDDKGFTCQQCGLVRSKEEIKKIASEVNILSKKTLALTSC----GNHQEVVSTY 243
+L + +G S E +K EV SK TL S G + EVV
Sbjct: 287 -LMLAVKEGEGKP--------SAETVK----EVIQFSKDTLEKISKARMEGLYHEVVKLC 333
Query: 244 KMIEKLQKKLYHPFSVNLMQTREKLIKILMELEDWKEALAYCQLTIPVYQRVYPQFHPLL 303
+ K Q+ + ++ L++ ++L L+ ++EA Y + + Y ++Y + L
Sbjct: 334 RECLKKQEPVLGDTNIYLLRILSIASEVLSYLQMFEEAADYAKRMVDGYLKIYHPNNAQL 393
Query: 304 GLQYYTCGKLEWFLGDTENAIKSMTEAVEILRITHGTNSPFMKELILKLEEAQAEASYKL 363
G+ G W G E + +A IL ITHG + P K+ L++ Q E ++
Sbjct: 394 GMAVMRAGVTHWHAGLIEAGHGLICKAYAILLITHGPSHPITKD--LEVMRVQTEMELRM 451
Query: 364 SSKDE 368
++E
Sbjct: 452 FQQNE 456
>gi|303321560|ref|XP_003070774.1| MYND finger family protein [Coccidioides posadasii C735 delta
SOWgp]
gi|240110471|gb|EER28629.1| MYND finger family protein [Coccidioides posadasii C735 delta
SOWgp]
gi|320040248|gb|EFW22181.1| conserved hypothetical protein [Coccidioides posadasii str.
Silveira]
Length = 497
Score = 92.8 bits (229), Expect = 2e-16, Method: Compositional matrix adjust.
Identities = 49/171 (28%), Positives = 89/171 (52%), Gaps = 8/171 (4%)
Query: 1 MLKLYLRRKLQNDNVIPSTTTDNYSLVEALVAHMSDIDEKQLLLYAQIANLVNLILQWP- 59
+L++ L +K Q D D ++L +H+++ + + + I ++
Sbjct: 123 VLRILLLKKHQEDQ------GDRLQRFDSLTSHLTETIRTKPDHFQNLVLCARAIHEYSK 176
Query: 60 -EISINEIAENFSKLACNAHTICNSELRPLGTGLYPVISIINHSCLPNAVLVFEGRLAVV 118
E+S+ E+ + F KL NA T+ +G + P+ S+ NHSC PNA + F+ +
Sbjct: 177 TELSLQEVVDCFGKLDINAFTLTTPFYDQIGAAVEPLASLCNHSCDPNAAVDFDKGKTWL 236
Query: 119 RAVQHVPKGAEVLISYIETAGSTMTRQKALKEQYLFTCTCPRCIKLGQFDD 169
RA+QH+ +G ++ +SY+E + + RQ L ++Y F C CPRCI+ + D
Sbjct: 237 RALQHIEEGEQIFVSYVEPTDACLHRQAELSKRYYFECECPRCIREKEIGD 287
>gi|392862338|gb|EAS37026.2| hypothetical protein CIMG_02225 [Coccidioides immitis RS]
Length = 497
Score = 92.8 bits (229), Expect = 2e-16, Method: Compositional matrix adjust.
Identities = 49/171 (28%), Positives = 89/171 (52%), Gaps = 8/171 (4%)
Query: 1 MLKLYLRRKLQNDNVIPSTTTDNYSLVEALVAHMSDIDEKQLLLYAQIANLVNLILQWP- 59
+L++ L +K Q D D ++L +H+++ + + + I ++
Sbjct: 123 VLRILLLKKHQEDQ------GDRMQRFDSLTSHLTETIRTKPDHFQNLVLCARAIHEYSK 176
Query: 60 -EISINEIAENFSKLACNAHTICNSELRPLGTGLYPVISIINHSCLPNAVLVFEGRLAVV 118
E+S+ E+ + F KL NA T+ +G + P+ S+ NHSC PNA + F+ +
Sbjct: 177 TELSLQEVVDCFGKLDINAFTLTTPFYDQIGAAVEPLASLCNHSCDPNAAVDFDKGKTWL 236
Query: 119 RAVQHVPKGAEVLISYIETAGSTMTRQKALKEQYLFTCTCPRCIKLGQFDD 169
RA+QH+ +G ++ +SY+E + + RQ L ++Y F C CPRCI+ + D
Sbjct: 237 RALQHIEEGEQIFVSYVEPTDACLHRQAELSKRYYFECECPRCIREKEVGD 287
>gi|32140151|tpg|DAA01312.1| TPA_exp: SET and MYND domain protein 2 [Takifugu rubripes]
Length = 433
Score = 92.8 bits (229), Expect = 3e-16, Method: Compositional matrix adjust.
Identities = 53/168 (31%), Positives = 92/168 (54%), Gaps = 11/168 (6%)
Query: 2 LKLYLRRKLQNDNVIPSTTTDNYSLVEALVAHMSDIDE-KQLLLYAQIANLVNLILQWPE 60
++L R L+ T ++ L++ +H+ +D K + A IA L + ++
Sbjct: 106 VRLVARIILKQKVTTEQTPSERLLLLKDFESHLDKMDSIKDEMNQADIAALHHFYSRY-- 163
Query: 61 ISINEIAEN------FSKLACNAHTICNSELRPLGTGLYPVISIINHSCLPNAVLVFEGR 114
IN+I ++ F+++ CN TI + EL LG+ ++P ++++NHSC PN ++ ++G
Sbjct: 164 --INDIPDDQSLTELFAQVNCNGFTIEDEELSHLGSAIFPDVALMNHSCNPNVIVTYKGT 221
Query: 115 LAVVRAVQHVPKGAEVLISYIETAGSTMTRQKALKEQYLFTCTCPRCI 162
+A VRAVQ + G E+L SYI+ T R++ L + Y FTC C C
Sbjct: 222 VAEVRAVQELNPGDEILNSYIDLLYPTEDRKERLLDSYFFTCLCSECF 269
>gi|432102449|gb|ELK30026.1| SET and MYND domain-containing protein 1 [Myotis davidii]
Length = 449
Score = 92.8 bits (229), Expect = 3e-16, Method: Compositional matrix adjust.
Identities = 79/315 (25%), Positives = 143/315 (45%), Gaps = 35/315 (11%)
Query: 27 VEALVAHMSDI-DEKQLLLYAQIANLVNLIL-QWPEISINEIAENFSKLACNAHTICNSE 84
V+ L H+ + +E+Q L + + L Q + S+ I+ F + CN T+ +
Sbjct: 129 VDDLQNHVENFGEEEQKQLRVDVDTFLQYWLPQSQQFSMQYISHIFGVINCNGFTLSDQR 188
Query: 85 -LRPLGTGLYPVISIINHSCLPNAVLVFEGRLAVVRAVQHVPKGAEVLISYIETAGSTMT 143
L+ +G G++P + ++NH C PN ++F +RA+ + +G E+ +SYI+ +
Sbjct: 189 GLQAVGVGIFPNLGLVNHDCWPNCTVIFNNGKIELRALGKISEGEELTVSYIDFLNVSEE 248
Query: 144 RQKALKEQYLFTCTCPRCIKLGQFDDIQESAILEGYRCKDDGCSGFLLRDSDDKGFTCQQ 203
R+K LK+QY F CTC C K G DD+ FL D K
Sbjct: 249 RKKQLKKQYYFDCTCEHCQK-GLKDDL------------------FLGVKDDPKP----- 284
Query: 204 CGLVRSKEEIKKIASEVNILSKKTLALTSCGNHQEVVSTYKMIEKLQKKLYHPFSVNLMQ 263
S++ +K+I +K S G + EVV + + Q ++ ++ +++
Sbjct: 285 -----SQDVVKEIIQFSKDTLEKIDKARSEGLYHEVVKLCRECLEKQDSVFADTNIYMLR 339
Query: 264 TREKLIKILMELEDWKEALAYCQLTIPVYQRVYPQFHPLLGLQYYTCGKLEWFLGDTENA 323
+ ++L L+ ++EA Y + + Y ++Y + LG+ G W G+ E A
Sbjct: 340 VLSIVSEVLSYLQAFEEASYYARRMVDGYVKLYHPNNAQLGMAVMRAGLTNWHAGNIE-A 398
Query: 324 IKSMTEAVEILRITH 338
++ TE LR+ H
Sbjct: 399 MRMQTEME--LRLFH 411
>gi|149640598|ref|XP_001510487.1| PREDICTED: SET and MYND domain-containing protein 1 isoform 1
[Ornithorhynchus anatinus]
Length = 490
Score = 92.4 bits (228), Expect = 3e-16, Method: Compositional matrix adjust.
Identities = 78/313 (24%), Positives = 133/313 (42%), Gaps = 55/313 (17%)
Query: 58 WP----EISINEIAENFSKLACNAHTICNSE-LRPLGTGLYPVISIINHSCLPNAVLVFE 112
WP + S+ I+ F + CN T+ + L+ +G G++P + ++NH C PN ++F
Sbjct: 158 WPPQSQQFSMQYISHIFGVINCNGFTLSDQRGLQAVGVGVFPNLCLVNHDCWPNCTVIFN 217
Query: 113 -GRLAVVRAVQH------------VPKGAEVLISYIETAGSTMTRQKALKEQYLFTCTCP 159
G V+++ H + +G E+ +SYI+ + R+K LK+QY F CTC
Sbjct: 218 NGNHEAVKSMFHTQMRIELRALGKISEGEELTVSYIDFLNVSADRKKLLKKQYYFDCTCE 277
Query: 160 RCIKLGQFDDIQESAILEGYRCKDDGCSGFLLRDSDDKGFTCQQCGLVRSKEEIKKIASE 219
C K G DD+ FL G+ + + + E
Sbjct: 278 HCEK-GIKDDL------------------FL--------------GVKDEPKPSQDVVKE 304
Query: 220 VNILSKKTL----ALTSCGNHQEVVSTYKMIEKLQKKLYHPFSVNLMQTREKLIKILMEL 275
V SK L S G + EVV + + Q+ + ++ L++ ++L L
Sbjct: 305 VIQFSKDALEKIDKARSEGVYHEVVKLCRECLQKQEPVLADTNIYLLRILSVASEVLSYL 364
Query: 276 EDWKEALAYCQLTIPVYQRVYPQFHPLLGLQYYTCGKLEWFLGDTENAIKSMTEAVEILR 335
+ ++EA Y + + Y ++Y + LG+ G W G E + +A IL
Sbjct: 365 QSFEEASDYAKKMVDGYMKLYHPNNAQLGMAVMRAGVTHWHAGLIEVGHGMICKAYAILL 424
Query: 336 ITHGTNSPFMKEL 348
+THG P K+L
Sbjct: 425 VTHGPTHPITKDL 437
>gi|432880358|ref|XP_004073658.1| PREDICTED: SET and MYND domain-containing protein 1-like isoform 3
[Oryzias latipes]
Length = 476
Score = 92.4 bits (228), Expect = 3e-16, Method: Compositional matrix adjust.
Identities = 76/293 (25%), Positives = 130/293 (44%), Gaps = 32/293 (10%)
Query: 62 SINEIAENFSKLACNAHTICNSE-LRPLGTGLYPVISIINHSCLPNAVLVFEGRLAVVRA 120
S+ I+ F + CN T+ + L+ +G GL+P ++++NH C PN + VR
Sbjct: 166 SVEYISHIFGIIKCNGFTLTDQRGLQAVGVGLFPNLALVNHDCWPNCTAILNHGKIEVRT 225
Query: 121 VQHVPKGAEVLISYIETAGSTMTRQKALKEQYLFTCTCPRCIKLGQFDDIQESAILEGYR 180
+ + +G E+ ISY++ + RQK LKEQ+ F C+C C + DD+ +A
Sbjct: 226 LGKISEGEELTISYVDFLQLSADRQKQLKEQFHFECSCKHCSQHIS-DDLMMAAA----- 279
Query: 181 CKDDGCSGFLLRDSDDKGFTCQQCGLVR-SKEEIKKIASEVNILSKKTLALTSC----GN 235
DG D F L R S E+I+ EV+ SK +L G+
Sbjct: 280 ---DG--------QDSPWFI-----LCRPSAEQIQ----EVSAFSKDSLEKIEKSRLDGD 319
Query: 236 HQEVVSTYKMIEKLQKKLYHPFSVNLMQTREKLIKILMELEDWKEALAYCQLTIPVYQRV 295
+ EV+ + + Q+ + ++ ++ ++L L + EA + + Y ++
Sbjct: 320 YSEVLRLCRECLQKQRGVLADTHLHRLRVLSIAAEVLSSLRQFSEAADCARKMVEGYMKL 379
Query: 296 YPQFHPLLGLQYYTCGKLEWFLGDTENAIKSMTEAVEILRITHGTNSPFMKEL 348
Y + LLG+ G W G E + +A IL +THG N ++L
Sbjct: 380 YHPNNALLGMAIMRAGVTLWHAGQIEEGHSLICKAYSILMVTHGPNHSVTRDL 432
>gi|296423702|ref|XP_002841392.1| hypothetical protein [Tuber melanosporum Mel28]
gi|295637630|emb|CAZ85583.1| unnamed protein product [Tuber melanosporum]
Length = 489
Score = 92.4 bits (228), Expect = 3e-16, Method: Compositional matrix adjust.
Identities = 68/260 (26%), Positives = 114/260 (43%), Gaps = 38/260 (14%)
Query: 64 NEIAENFSKLACNAHTICNSELRPLGTGLYPVISIINHSCLPNAVLVFEGRLAVVRAVQH 123
E+ + + N+ T+ P+G P+ + INH C+PNAV++F GR+ +RA++
Sbjct: 170 GEVRRLYCAVLVNSMTLVTETFDPIGIAFDPLTASINHDCIPNAVMLFNGRILQIRALED 229
Query: 124 VPKGAEVLISYIETAGSTMTRQKALKEQYLFTCTCPRCIKLGQFDDIQESAILEGYRCKD 183
+P AEVLISYI+ RQ L E+Y FTCTC RC + ++G C
Sbjct: 230 IPANAEVLISYIDNTVPRERRQLELSEKYFFTCTCSRCTG--------PAGPMDGLLC-- 279
Query: 184 DGCSGFLLRDSDDKGFTCQQCGLVRSKEEIKKIASEVNILSKKTLALTSCGNHQEVVSTY 243
C G + + C +C ++V IL + + + ++S
Sbjct: 280 -ACGGVGM--PSREAVLCNRCA------RALPATADVEILEQTAWSALDSDKSEAMISAL 330
Query: 244 KMIEK-----LQKKLYHPFSVNLMQTREKLIKILMELEDWKEALAYCQLTIPVYQRV--- 295
+ + K L ++ + L+Q + +W+ AL + L +Y R+
Sbjct: 331 RHLHKSPMWPLPRQPLAALHLALVQGE------YIPSGNWERALLHHLL---IYLRLDPV 381
Query: 296 -YPQ-FHPLLGLQYYTCGKL 313
YPQ FHP+ + +T L
Sbjct: 382 FYPQRFHPMRVVHGFTLANL 401
>gi|410912130|ref|XP_003969543.1| PREDICTED: N-lysine methyltransferase SMYD2-A-like [Takifugu
rubripes]
Length = 434
Score = 92.4 bits (228), Expect = 3e-16, Method: Compositional matrix adjust.
Identities = 53/168 (31%), Positives = 92/168 (54%), Gaps = 11/168 (6%)
Query: 2 LKLYLRRKLQNDNVIPSTTTDNYSLVEALVAHMSDIDE-KQLLLYAQIANLVNLILQWPE 60
++L R L+ T ++ L++ +H+ +D K + A IA L + ++
Sbjct: 106 VRLVARIILKQKVTTEQTPSERLLLLKDFESHLDKMDSIKDEMNQADIAALHHFYSRY-- 163
Query: 61 ISINEIAEN------FSKLACNAHTICNSELRPLGTGLYPVISIINHSCLPNAVLVFEGR 114
IN+I ++ F+++ CN TI + EL LG+ ++P ++++NHSC PN ++ ++G
Sbjct: 164 --INDIPDDQSLTELFAQVNCNGFTIEDEELSHLGSAIFPDVALMNHSCNPNVIVTYKGT 221
Query: 115 LAVVRAVQHVPKGAEVLISYIETAGSTMTRQKALKEQYLFTCTCPRCI 162
+A VRAVQ + G E+L SYI+ T R++ L + Y FTC C C
Sbjct: 222 VAEVRAVQELNPGDEILNSYIDLLYPTEDRKERLLDSYFFTCLCSECF 269
Score = 42.4 bits (98), Expect = 0.40, Method: Compositional matrix adjust.
Identities = 21/99 (21%), Positives = 49/99 (49%)
Query: 258 SVNLMQTREKLIKILMELEDWKEALAYCQLTIPVYQRVYPQFHPLLGLQYYTCGKLEWFL 317
+V ++ + + + + ++DW+ A+ Y + I Y YP + + Y G L L
Sbjct: 336 NVYMLHMMYQAMGVCLYMQDWEGAMRYGEKIIHPYSFHYPAYSLNVASMYLKLGHLYLGL 395
Query: 318 GDTENAIKSMTEAVEILRITHGTNSPFMKELILKLEEAQ 356
+K++ +A+ I+ + HG + ++ E+ ++EE +
Sbjct: 396 EKRTQGVKALKKALSIMEVAHGKDHYYVAEVKREIEEQK 434
>gi|70947750|ref|XP_743461.1| hypothetical protein [Plasmodium chabaudi chabaudi]
gi|56522968|emb|CAH78269.1| conserved hypothetical protein [Plasmodium chabaudi chabaudi]
Length = 353
Score = 92.0 bits (227), Expect = 4e-16, Method: Compositional matrix adjust.
Identities = 81/326 (24%), Positives = 138/326 (42%), Gaps = 40/326 (12%)
Query: 63 INEIAENFSKLACNAHTICNSELRPLGTGLYPV-ISIINHSCLPNAVLVFEGRLAVVRAV 121
+ + E F K++ N I ++EL P G G+Y I NHSCL N + +F+ + +R +
Sbjct: 19 LKNVIEKFIKVSKNTLQIIDNELEPCGLGIYKKPIPYFNHSCLSNCITIFKNQRLYIRTL 78
Query: 122 QHVPKGAEVLISYIETAGSTMTRQKALKEQYLFTCTCP---------------------- 159
+ G E+ ISY++ A R +QY FTCTC
Sbjct: 79 MDIYPGEELTISYLDVAFDRNARLAICADQYFFTCTCKLCKVNIASECHNIFNNDFICTQ 138
Query: 160 --RCIKLGQFDDIQESAILE---GYRCKDDGCSGFLLRDSDDKGFTCQQCGLVRSK---- 210
C K + +I + LE Y K + +L+ S +K T +C L +++
Sbjct: 139 SENCKKFINYMEIVLMSELERKISYLNKSHFKTFPILKKSTEKNETVWKCMLCKNETNDS 198
Query: 211 ------EEIKKIASEVNILSKKTLALTSCGNHQEVVSTYKMIEKLQK--KLYHPFSVNLM 262
E+ K+ EV L S N + S K+ K+ YH +L
Sbjct: 199 VIKALIEKEKETIKEVEYLDTLFAEKYSYDNKNVLQSLTKIKSKIDDLTDFYHHSKYSLQ 258
Query: 263 QTREKLIKILMELEDWKEALAYCQLTIPVYQRVYPQFHPLLGLQYYTCGKLEWFLGDTEN 322
+ R K++ + ++L+D+K A + + + Y ++ P+ G + GKL FL
Sbjct: 259 KMRAKILYVSIQLQDFKLAYSIATQYLKSIEVSYGKYSPIYGYYIFLTGKLALFLDLKSA 318
Query: 323 AIKSMTEAVEILRITHGTNSPFMKEL 348
+ + +A + + T+G SP K+L
Sbjct: 319 GLSLIHKAKKNIIKTYGPESPIYKDL 344
>gi|451849565|gb|EMD62868.1| hypothetical protein COCSADRAFT_144768 [Cochliobolus sativus
ND90Pr]
Length = 529
Score = 92.0 bits (227), Expect = 4e-16, Method: Compositional matrix adjust.
Identities = 76/323 (23%), Positives = 141/323 (43%), Gaps = 50/323 (15%)
Query: 66 IAENFSKLACNAHTICNSELRPLGTGLYPVISIINHSCLPNAVLVFEGRLAVVRAVQHVP 125
+A + ++ NA T+ L PLG + P + +NHSC PNA ++ +G L +R ++ +
Sbjct: 199 VAAMYGRVMSNALTLITPTLDPLGIIVDPTLCQMNHSCDPNAYIMMDGPLISIRTLRPIR 258
Query: 126 KGAEVLISYIETAGSTMTRQKALKEQYLFTCTCPRCIKLGQFDDIQESAILEGYRCKDDG 185
K E+ ISYI+T RQ L+ ++ FTC C +C Q+ A +
Sbjct: 259 KDKEIFISYIDTTHPYQKRQDELQARWFFTCRCAKC---------QDKATHQ-------- 301
Query: 186 CSGFLLRDSDDKGFTCQQCGLVRSKEEIKKIASEVNILSKKTLALTSCGNHQEVVSTYKM 245
+D V SKEE++ +AS + + N + V+ K
Sbjct: 302 ---------EDNWLVPANSDFVASKEELEALASNQEQMFTLYEQVQGTPNAEAVIPVLKE 352
Query: 246 IEKLQKK-----LY-HPFSVNLMQTREKLIKILMELEDWKEALAYCQLTIP-VYQRVYP- 297
K ++ LY P++ R++LI L+ + ++EA A+ + ++YP
Sbjct: 353 SLKACRESGNWPLYRQPYA----SIRDELIVNLLSVGKYQEAWAHSAKRYKYILPKLYPV 408
Query: 298 QFHPLLGLQYYTCGKLEWFLGDTENAIKS------------MTEAVEILRITHGTNSPFM 345
FHP+ +Q + L +L TE + + + + +++ ++HG ++ F
Sbjct: 409 PFHPVRVVQTWQMAMLAAYLAGTEEGVGAPGVNMGLIAMMLVKQVLDVAPMSHGQDNAFT 468
Query: 346 KELILKLEEAQAEASYKLSSKDE 368
K ++ K E E + + D+
Sbjct: 469 KSVMEKTREMVEELKRSVGNPDK 491
>gi|351710111|gb|EHB13030.1| SET and MYND domain-containing protein 2 [Heterocephalus glaber]
Length = 527
Score = 92.0 bits (227), Expect = 4e-16, Method: Compositional matrix adjust.
Identities = 69/274 (25%), Positives = 120/274 (43%), Gaps = 34/274 (12%)
Query: 96 ISIINHSCLPNAVLVFEGRLAVVRAVQHVPKGAEVLISYIETAGSTMTRQKALKEQYLFT 155
++++NHSC PN ++ + G LA VRAVQ + G EV SYI+ T R L++ Y FT
Sbjct: 270 VALMNHSCCPNVIVTYRGTLAEVRAVQEIHPGEEVFTSYIDLLYPTEDRNDRLRDSYFFT 329
Query: 156 CTCPRCIKLGQFDDIQESAILEGYRCKDDGCSGFLLRDSDDKGFTCQQCGLVRSK----E 211
C C C G Q+ A LE R D +RD +VR E
Sbjct: 330 CECRECSTKG-----QDQAKLE-IRKLSDPPRAEAVRD------------MVRYARGVIE 371
Query: 212 EIKKIASEVNILSKKTLALTSCGNH------------QEVVSTYKMIEKLQKKLYHPFSV 259
E ++ IL H E++ ++ ++ ++ +V
Sbjct: 372 EFRRAKHYKYILGCSVQGPMPRSPHCEGREAGPGSPPSELLEICELSQEQMSSVFEDSNV 431
Query: 260 NLMQTREKLIKILMELEDWKEALAYCQLTIPVYQRVYPQFHPLLGLQYYTCGKLEWFLGD 319
++ + + + + L+DW+ AL Y Q I Y + YP + + + G+L L +
Sbjct: 432 YMLHMMYQAMGVCLYLQDWEGALRYGQKIIKPYSKHYPLYSLNVASMWLKLGRLYLGLEN 491
Query: 320 TENAIKSMTEAVEILRITHGTNSPFMKELILKLE 353
K++ +A+ I+ + HG + P++ E+ ++E
Sbjct: 492 RAAGEKALKKALAIMEVAHGKDHPYISEIRREIE 525
>gi|410922900|ref|XP_003974920.1| PREDICTED: SET and MYND domain-containing protein 1-like [Takifugu
rubripes]
Length = 489
Score = 91.3 bits (225), Expect = 7e-16, Method: Compositional matrix adjust.
Identities = 82/345 (23%), Positives = 154/345 (44%), Gaps = 57/345 (16%)
Query: 27 VEALVAHMSDIDEKQLL-LYAQIANLVNLILQWPEIS----INEIAENFSKLACNAHTIC 81
VE L H++D+ E ++ L I N ++ WP S I++I+ F + CN T+
Sbjct: 124 VEELEDHVADMQEDEIKELKVDIHNFLDY---WPRNSKQHTIDDISHIFGVINCNGFTVS 180
Query: 82 NSE-LRPLGTGLYPVISIINHSCLPN-AVLVFEGRLAVV------------RAVQHVPKG 127
+ L+ +G GL+P + ++NH+C PN V++ G + V R++ + +G
Sbjct: 181 DQRGLQAVGVGLFPNLCMVNHNCWPNCTVILNHGNQSAVNTMFHSQRRIELRSLGKIAEG 240
Query: 128 AEVLISYIETAGSTMTRQKALKEQYLFTCTCPRCIKLGQFDDIQESAILEGYRCKDDGCS 187
E+ ++Y++ + R++ LK QY F C C C K G DD++ L G
Sbjct: 241 EELTVAYVDFLNLSEERRRLLKTQYFFDCQCDYC-KNGTKDDLK----LAG--------- 286
Query: 188 GFLLRDSDDKGFTCQQCGLVRSKEEIKKIAS----EVNILSKKTLALTSCGNHQEVVSTY 243
R+ D G+ S++++K+ ++ + K L G++ +VV
Sbjct: 287 ----REVD---------GVKPSEQQVKEATDYCFQKLETMDKARLD----GDYHQVVKIC 329
Query: 244 KMIEKLQKKLYHPFSVNLMQTREKLIKILMELEDWKEALAYCQLTIPVYQRVYPQFHPLL 303
+ + + + + L++ L ++ L+ + +A Y + + Y ++Y + L
Sbjct: 330 RDVIDRTEPVLADTHIYLLRMWSTLSEVQAYLQYFNDAAEYSRKMVEGYMKLYHPNNAAL 389
Query: 304 GLQYYTCGKLEWFLGDTENAIKSMTEAVEILRITHGTNSPFMKEL 348
G+ G W G E + +A IL +THG P K+L
Sbjct: 390 GMAAMRAGVNHWQAGLIEVGHGMVCKAYAILLVTHGPTHPITKDL 434
>gi|348670644|gb|EGZ10465.1| hypothetical protein PHYSODRAFT_518318 [Phytophthora sojae]
Length = 437
Score = 91.3 bits (225), Expect = 7e-16, Method: Compositional matrix adjust.
Identities = 47/125 (37%), Positives = 67/125 (53%), Gaps = 2/125 (1%)
Query: 65 EIAENFSKLACNAHTICNS-ELRPLGTGLYPVISIINHSCLPNAVLVFEGRLAVVRAVQH 123
EI + F ++ CNA ++ N +G GLYP ++ NH C PN V+ F+GR VR V+
Sbjct: 173 EILKLFGRVNCNAFSVANEFTNEAVGIGLYPEGALFNHDCDPNCVVSFKGREMQVRVVRD 232
Query: 124 VPKGAEVLISYIETAGSTMTRQKALKEQYLFTCTCPRCIKLGQFDDIQESAILEGYRCKD 183
+ E+ +SY+E ST R++ LKE Y F C C RC K E L+G RC +
Sbjct: 233 IEVDEELTVSYVELLQSTKARRRELKESYFFDCECKRC-KAATNGQSNEDWYLDGLRCSN 291
Query: 184 DGCSG 188
C+
Sbjct: 292 KKCAS 296
>gi|156101287|ref|XP_001616337.1| hypothetical protein [Plasmodium vivax Sal-1]
gi|148805211|gb|EDL46610.1| hypothetical protein, conserved [Plasmodium vivax]
Length = 517
Score = 91.3 bits (225), Expect = 7e-16, Method: Compositional matrix adjust.
Identities = 80/327 (24%), Positives = 133/327 (40%), Gaps = 43/327 (13%)
Query: 63 INEIAENFSKLACNAHTICNSELRPLGTGLY--PVISIINHSCLPNAVLVFEGRLAVVRA 120
+ I E F K++ N I ++EL P G G Y PV NHSCL N V +F+ + +R
Sbjct: 184 LRNIIEKFIKISKNTLQIIDNELEPAGLGFYKKPV-PYFNHSCLSNCVTIFKNQKLFIRT 242
Query: 121 VQHVPKGAEVLISYIETAGSTMTRQKALKEQYLFTCTCPRCIKLGQFDDIQESAILEGYR 180
+ V G E+ ISY++ A TR +QY FTCTC C K+ + Q E
Sbjct: 243 LMDVYPGEELTISYVDIAFDKNTRLAICMDQYFFTCTCKLC-KVNVVSECQNVFNTEFVC 301
Query: 181 CKDDGCSGF------------------------------LLRDSDDKGFTCQQCGLVRSK 210
+ + C + L + ++ + C C
Sbjct: 302 TQSESCKKYVNHMELILISELERKSCYVNKSQFKNSYPVLKKANEQHVWKCSICKNEVHD 361
Query: 211 EEIKKIASEVNILSKKTLALTSCGNHQEVVSTYKMIEKLQK---------KLYHPFSVNL 261
IK + + K+T L S N + +++ L K YH +L
Sbjct: 362 NVIKSLLEKEKETVKETAHLESLFNEKYTYDNKSVLQSLNKIKSKIDYLTTFYHHTRYSL 421
Query: 262 MQTREKLIKILMELEDWKEALAYCQLTIPVYQRVYPQFHPLLGLQYYTCGKLEWFLGDTE 321
+ R K++ I ++L+++K A + + Y ++ P+ G + GKL FL
Sbjct: 422 QKMRAKILYISIQLQEFKLAYNIANQYLKSIEVSYGKYSPIYGYYIFLTGKLALFLDLKS 481
Query: 322 NAIKSMTEAVEILRITHGTNSPFMKEL 348
+ + +A + + T+G +SP K+L
Sbjct: 482 EGLSLIHKAKKNIIKTYGPDSPIYKDL 508
>gi|413933332|gb|AFW67883.1| hypothetical protein ZEAMMB73_219813 [Zea mays]
Length = 190
Score = 91.3 bits (225), Expect = 7e-16, Method: Compositional matrix adjust.
Identities = 42/73 (57%), Positives = 60/73 (82%)
Query: 1 MLKLYLRRKLQNDNVIPSTTTDNYSLVEALVAHMSDIDEKQLLLYAQIANLVNLILQWPE 60
M++L L+RKLQN+ VIPS++ DNY+LV+ L +H+S +DE QL+LYAQ+ANLV+LIL + E
Sbjct: 118 MVRLALKRKLQNEKVIPSSSIDNYNLVDGLESHISKVDENQLVLYAQMANLVSLILPFIE 177
Query: 61 ISINEIAENFSKL 73
+ + EIA+ FSK
Sbjct: 178 LDLKEIAQTFSKF 190
>gi|348505212|ref|XP_003440155.1| PREDICTED: SET and MYND domain-containing protein 1-like isoform 2
[Oreochromis niloticus]
Length = 467
Score = 91.3 bits (225), Expect = 7e-16, Method: Compositional matrix adjust.
Identities = 89/352 (25%), Positives = 153/352 (43%), Gaps = 74/352 (21%)
Query: 27 VEALVAHMSDI---DEKQLLLYAQIANLVNLILQWP----EISINEIAENFSKLACNAHT 79
V+ L H++D+ D KQL A++ + W + S++ I+ F + CN T
Sbjct: 129 VDQLQEHVADLAAEDFKQLR-----ADVHKFLQYWSYGTRQHSVDYISHIFGIIKCNGFT 183
Query: 80 ICNSE-LRPLGTGLYPVISIINHSCLPN-AVLVFEGRLAVV------------RAVQHVP 125
+ + L+ +G GL+P + ++NH C PN V++ G + V RA+ +P
Sbjct: 184 LSDQRGLQAVGVGLFPNLCLVNHDCWPNCTVILNHGNQSAVSSALHSQRRIELRALGKIP 243
Query: 126 KGAEVLISYIETAGSTMTRQKALKEQYLFTCTCPRCIKLGQFDDIQESAILEGYRCKDDG 185
+G E+ +SY++ + RQK LKE++ F CTC C K KDD
Sbjct: 244 EGEELTVSYVDFLNLSADRQKKLKERFHFDCTCEHCSK----------------HIKDD- 286
Query: 186 CSGFLLRDSDDKGFTCQQCGLVRSKEEIKKIASEVNILSKKTLALTSCG----NHQEVVS 241
+ +D K S +++K EV SK++L ++ EV+
Sbjct: 287 ---LMTAVADSKP----------SADKVK----EVTAFSKESLEKIEKSRVERDYNEVLK 329
Query: 242 -TYKMIEKLQKKLYHPFSVNLMQTREKLIKILMELEDWKEALA----YCQLTIPVYQRVY 296
Y+ ++K + L NL + R ++ I E+ ++ + A Y + Y+++Y
Sbjct: 330 LCYECLQKQENVLA---DTNLYKLR--VLSIASEVLSYQRSFAKAANYAHRMVEGYRKLY 384
Query: 297 PQFHPLLGLQYYTCGKLEWFLGDTENAIKSMTEAVEILRITHGTNSPFMKEL 348
+ LG+ G W G E + +A IL +THG N K+L
Sbjct: 385 HPNNAQLGMAIMRAGVTHWHAGQIEEGHSLICQAYRILMVTHGPNHAITKDL 436
>gi|312076846|ref|XP_003141043.1| MYND finger family protein [Loa loa]
gi|307763792|gb|EFO23026.1| MYND finger family protein [Loa loa]
Length = 439
Score = 90.5 bits (223), Expect = 1e-15, Method: Compositional matrix adjust.
Identities = 83/351 (23%), Positives = 167/351 (47%), Gaps = 32/351 (9%)
Query: 23 NYSLVEALVAHMSDI--DEKQLLLYAQIANLVNLILQWPEISIN-EIAENFSKLACNAHT 79
N + +L +H DI DE++ + I +++ L + ++ N EI + F K+ NA
Sbjct: 107 NGRVFASLKSHTVDIEKDEEKRNGFIAIIHVIKDYLPFDKMPSNTEIFDIFCKILINALV 166
Query: 80 ICNSELRPLGTGLYPVISIINHSCLPNAVLVFEGRLAVVRAVQH--VPKGAEVLISYIET 137
I +S L +G +Y +S ++HSC P+A ++F G A++R++ + I Y +
Sbjct: 167 ITDSCLNRIGLAVYLGLSALDHSCKPDAFIIFSGAKAILRSLNENITEYNDNLHIPYCDL 226
Query: 138 AGSTMTRQKALKEQYLFTCTCPRCIKLGQFDDIQESAILEGYRCKDDGCSGFLLRDSDDK 197
R +AL++Q+ F C C C FD ++ + + +C D C S D
Sbjct: 227 LDLRSARCEALQKQHNFVCNCDIC---QDFDLDRQKSSVRCTKCTDGFCPY-----SPDD 278
Query: 198 GFTCQQCGL-----VRSKEEIKKIASEVNILSKKTLALTSCGNHQEVVSTYKMIEKLQKK 252
T ++C + V + + ++K+ ++ T + N E++ Y +E +
Sbjct: 279 DHTVKRCKVCHEISVFNSDHVQKLYQQL------TAPRPAEKNLNELIDLYHELE----E 328
Query: 253 LYHPFSVNLMQTREKLIKILMELEDWKEALAYCQLTIPVYQRVYPQFHPLLGLQYYTCGK 312
++ P++V L + E ++ + E + EA+ Y + T+ Y+ YP+ HP ++ + K
Sbjct: 329 VFSPYNVPLCKLAESIMISALNNEKYDEAVEYAEKTLLCYRTYYPKGHPSPSVRMFEYAK 388
Query: 313 LEWFLGDTENAIKSMTEAVEILRITHGTNSPF---MKELILKLEEAQAEAS 360
L L ++ + +A++++ ++G+ S F L+ LE+ + AS
Sbjct: 389 L-LMLQHNRESLPVLRKALKMICESYGSESSFAFNAATLLSDLEKCVSTAS 438
>gi|451992674|gb|EMD85153.1| hypothetical protein COCHEDRAFT_1149213 [Cochliobolus
heterostrophus C5]
Length = 508
Score = 90.5 bits (223), Expect = 1e-15, Method: Compositional matrix adjust.
Identities = 83/387 (21%), Positives = 164/387 (42%), Gaps = 54/387 (13%)
Query: 2 LKLYLRRKLQNDNVIPSTTTDNYSLVEALVAHMSDIDEKQLLLYAQIANLVNLILQWPEI 61
++L RRK +IP ++ L+ L +H+ D ++ + P+
Sbjct: 118 MELLTRRK---HGLIPDK---DWELLCQLPSHIDDFKRNATYQNIEMMAMGAAQFSLPQN 171
Query: 62 SINE--IAENFSKLACNAHTICNSELRPLGTGLYPVISIINHSCLPNAVLVFEGRLAVVR 119
++ +A + ++ NA T+ L PLG + P + +NHSC PNA ++ +G L +R
Sbjct: 172 VFDKDFVAAMYGRVMSNALTLITPTLDPLGIIVDPTLCQMNHSCDPNAYIMMDGPLVSIR 231
Query: 120 AVQHVPKGAEVLISYIETAGSTMTRQKALKEQYLFTCTCPRCIKLGQFDDIQESAILEGY 179
++ + K E+ ISYI+T RQ L+ ++ FTC C +C Q+ A +
Sbjct: 232 TLRPIRKDKEIFISYIDTTHPYQKRQDELQARWFFTCRCAKC---------QDKATHQED 282
Query: 180 RCKDDGCSGFLLRDSDDKGFTCQQCGLVRSKEEIKKIASEVNILSKKTLALTSCGNHQEV 239
G S F+ + Q + E+++ + + ++ AL +C
Sbjct: 283 NWLVPGNSDFVASKEQVQALASNQEQMFTLYEQVQGMPNAEAVIPVLKEALKACH----- 337
Query: 240 VSTYKMIEKLQKKLY-HPFSVNLMQTREKLIKILMELEDWKEALAYCQLTIPVYQRVYPQ 298
E LY P++ R++LI L+ + ++EA A+ + Y+ + P+
Sbjct: 338 -------ESGNWPLYRQPYA----SIRDELIVNLLSVGKYQEAWAH---SAKRYKYILPK 383
Query: 299 -----FHPLLGLQYYTCGKLEWFLGDTENAIKS------------MTEAVEILRITHGTN 341
FHP+ +Q + L +L TE + + + + +++ ++H +
Sbjct: 384 LYSVPFHPVRVVQTWQMAMLAAYLASTEEGVGAPGVNMGLIAMMLVKQVLDVAPMSHSQD 443
Query: 342 SPFMKELILKLEEAQAEASYKLSSKDE 368
+ F K ++ K E E + + D+
Sbjct: 444 NAFTKSVVEKTREMVEELKRSVGNPDK 470
>gi|431907306|gb|ELK11286.1| SET and MYND domain-containing protein 1 [Pteropus alecto]
Length = 476
Score = 90.5 bits (223), Expect = 1e-15, Method: Compositional matrix adjust.
Identities = 80/323 (24%), Positives = 144/323 (44%), Gaps = 54/323 (16%)
Query: 31 VAHMSDIDEKQLLLYAQIANLVNLILQ-WP----EISINEIAENFSKLACNAHTICNSE- 84
V H + ++KQL Q+ V+ LQ WP + S+ I+ F + CN T+ +
Sbjct: 136 VMHFGEEEQKQL----QVD--VDTFLQYWPPKSQQFSMQYISNIFGVINCNGFTLSDQRG 189
Query: 85 LRPLGTGLYPVISIINHSCLPNAVLVFE-GRLAVVRAVQH------------VPKGAEVL 131
L+ +G G++P + ++NH C PN ++F G V+++ H + +G E+
Sbjct: 190 LQAVGVGIFPNLGLVNHDCWPNCTVIFNNGNHEAVKSMFHTQMRIELRALGKIAEGEELT 249
Query: 132 ISYIETAGSTMTRQKALKEQYLFTCTCPRCIKLGQFDDIQESAILEGYRCKDDGCSGFLL 191
+SYI+ + R+K LK+QY F CTC C K G DD+ L
Sbjct: 250 VSYIDFLNVSEERKKQLKKQYYFDCTCEHCQK-GLKDDL-------------------FL 289
Query: 192 RDSDDKGFTCQQCGLVRSKEEIKKIASEVNILSKKTLALTSCGNHQEVVSTYKMIEKLQK 251
R DD S++ +K++ N +K S G + EVV + + Q+
Sbjct: 290 RVKDDPK---------PSQDVVKEMIQFSNDTLEKIDKARSEGLYHEVVKLCRECLEKQE 340
Query: 252 KLYHPFSVNLMQTREKLIKILMELEDWKEALAYCQLTIPVYQRVYPQFHPLLGLQYYTCG 311
++ ++ +++ + ++L L+ ++EA Y + + Y ++Y + LG+ G
Sbjct: 341 SVFADTNLYMLRMLSIVSEVLSYLQAFEEASYYARRMVDGYMKLYHPNNAQLGMAVMRAG 400
Query: 312 KLEWFLGDTENAIKSMTEAVEIL 334
W G+ E + +A IL
Sbjct: 401 LTNWHAGNIEVGHGMICKAYAIL 423
>gi|444714585|gb|ELW55465.1| N-lysine methyltransferase SMYD2 [Tupaia chinensis]
Length = 494
Score = 90.1 bits (222), Expect = 1e-15, Method: Compositional matrix adjust.
Identities = 65/264 (24%), Positives = 116/264 (43%), Gaps = 34/264 (12%)
Query: 96 ISIINHSCLPNAVLVFEGRLAVVRAVQHVPKGAEVLISYIETAGSTMTRQKALKEQYLFT 155
++++NHSC PN ++ + G LA VRAV + G EV SYI+ T R L++ Y FT
Sbjct: 257 VALMNHSCCPNVIVTYRGTLAEVRAVHEISPGEEVFTSYIDLLYPTEDRNDRLRDSYFFT 316
Query: 156 CTCPRCIKLGQFDDIQESAILEGYRCKDDGCSGFLLRDSDDKGFTCQQCGLVRSK----E 211
C C C KD + +R +D +VR E
Sbjct: 317 CECHECTT------------------KDKDKAKVEIRKLNDPPKAEAIRDMVRYARNVIE 358
Query: 212 EIKKIASEVNIL--SKKTLALTSCGNHQEVVSTYKMIEKLQKKLYHPFSVNLMQTREKLI 269
E ++ IL S + L C QE +S+ ++ +V ++ + +
Sbjct: 359 EFRRAKHYKYILYNSPPSELLEICELSQEKMSS----------VFEDSNVYMLHMMYQAM 408
Query: 270 KILMELEDWKEALAYCQLTIPVYQRVYPQFHPLLGLQYYTCGKLEWFLGDTENAIKSMTE 329
+ + ++DW+ AL Y Q I Y + YP + + + G+L L + +++ +
Sbjct: 409 GVCLYMQDWEGALRYGQRIIKPYSKHYPLYSLNVASMWLKLGRLYMGLENKAAGERALKK 468
Query: 330 AVEILRITHGTNSPFMKELILKLE 353
A+ I+ + HG + P++ E+ ++E
Sbjct: 469 AIAIMEVAHGKDHPYISEIKQEIE 492
>gi|167533183|ref|XP_001748272.1| hypothetical protein [Monosiga brevicollis MX1]
gi|163773392|gb|EDQ87033.1| predicted protein [Monosiga brevicollis MX1]
Length = 456
Score = 89.7 bits (221), Expect = 2e-15, Method: Compositional matrix adjust.
Identities = 80/291 (27%), Positives = 126/291 (43%), Gaps = 31/291 (10%)
Query: 66 IAENFSKLACNAHTICNSELRPLGTGLYPVISIINHSCLPNAVLVFEGRL--AVVRAVQH 123
+ + + CN +I + L P G G+YP +I+NHSC PN V+ + G L V+A++
Sbjct: 183 VMDVLQRADCNNFSIWDELLLPRGAGVYPWGAILNHSCEPNCVMTYRGPLHAQAVKALRD 242
Query: 124 VPKGAEVLISYIETAGSTMTRQKALKEQYLFTCTCPRCIK--LGQFDDIQESAILEGYRC 181
+ G E+ SYI+ T R L +QY F C C + L + D + E E
Sbjct: 243 IAVGEELCHSYIDLYAPTGQRHSHLGDQYGFECDCALYLDGALAELDALPEVTAAEMR-- 300
Query: 182 KDDGCSGFLLRDSDDKGFTCQQ-CGLVRSKEEIKKIASEVNILSKKTLALTSCGNHQEVV 240
G +L + + + Q+ L R+ E +K A +E+V
Sbjct: 301 ----LPGKMLHELECEAEERQRLLSLYRAAETLKDYAWST--------------PEEELV 342
Query: 241 STYKMIEKLQKKLYHPFSVNLMQTREKLIKILME---LEDWKEALAYCQLTIPVYQRVYP 297
+ L ++L HP +++L +L + E LED AL Q Y VYP
Sbjct: 343 CLLQGYNIL-RRLAHPANISLTSIMTRLQNVATECGRLEDI--ALPVGQHLALAYDHVYP 399
Query: 298 QFHPLLGLQYYTCGKLEWFLGDTENAIKSMTEAVEILRITHGTNSPFMKEL 348
+ +PL GLQYY G + A+ A ++LR TH + + +L
Sbjct: 400 EHNPLCGLQYYRLGDVANLAQKQALALLWHRRAYQVLRTTHDADHWLLTQL 450
>gi|301102901|ref|XP_002900537.1| conserved hypothetical protein [Phytophthora infestans T30-4]
gi|262101800|gb|EEY59852.1| conserved hypothetical protein [Phytophthora infestans T30-4]
Length = 425
Score = 89.4 bits (220), Expect = 2e-15, Method: Compositional matrix adjust.
Identities = 46/123 (37%), Positives = 68/123 (55%), Gaps = 6/123 (4%)
Query: 65 EIAENFSKLACNAHTICNSEL-RPLGTGLYPVISIINHSCLPNAVLVFEGRLAVVRAVQH 123
EI + F +++CN +I N P+G GL+ S+ NH C PN V+ F+G+ V ++
Sbjct: 172 EILKLFGQVSCNGFSIMNGVTNEPVGIGLFLQGSMFNHDCDPNCVVSFQGQEMNVHVIKD 231
Query: 124 VPKGAEVLISYIETAGSTMTRQKALKEQYLFTCTCPRCIKLGQFDDIQESAILEGYRCKD 183
V +G E+ ISY+E ST RQK LK+ Y F C C RC + + L+G +C +
Sbjct: 232 VKEGQELTISYVEVLQSTKKRQKILKDSYFFECQCSRCTT-----ETTDDWYLDGLQCGN 286
Query: 184 DGC 186
GC
Sbjct: 287 KGC 289
>gi|12834773|dbj|BAB23038.1| unnamed protein product [Mus musculus]
Length = 229
Score = 89.4 bits (220), Expect = 2e-15, Method: Compositional matrix adjust.
Identities = 56/255 (21%), Positives = 110/255 (43%), Gaps = 28/255 (10%)
Query: 99 INHSCLPNAVLVFEGRLAVVRAVQHVPKGAEVLISYIETAGSTMTRQKALKEQYLFTCTC 158
+NHSC PN ++ ++G LA VRAVQ + G EV SYI+ T R L++ Y FTC C
Sbjct: 1 MNHSCCPNVIVTYKGTLAEVRAVQEIHPGDEVFTSYIDLLYPTEDRNDRLRDSYFFTCEC 60
Query: 159 PRCIKLGQFDDIQESAILEGYRCKDDGCSGFLLRDSDDKGFTCQQCGLVRSKEEIKKIAS 218
C +D D ++ E I+ +
Sbjct: 61 RECT----------------------------TKDKDKAKVEVRKLSSPPQAEAIRDMVR 92
Query: 219 EVNILSKKTLALTSCGNHQEVVSTYKMIEKLQKKLYHPFSVNLMQTREKLIKILMELEDW 278
+ ++ + E++ ++ ++ ++ +V ++ + + + + ++DW
Sbjct: 93 YARNVIEEFRRAKHYKSPSELLEICELSQEKMSSVFEDSNVYMLHMMYQAMGVCLYMQDW 152
Query: 279 KEALAYCQLTIPVYQRVYPQFHPLLGLQYYTCGKLEWFLGDTENAIKSMTEAVEILRITH 338
+ AL Y Q I Y + YP + + + G+L L + K++ +A+ I+ + H
Sbjct: 153 EGALKYGQKIIKPYSKHYPVYSLNVASMWLKLGRLYMGLENKAAGEKALKKAIAIMEVAH 212
Query: 339 GTNSPFMKELILKLE 353
G + P++ E+ ++E
Sbjct: 213 GKDHPYISEIKQEIE 227
>gi|440804218|gb|ELR25095.1| SET and MYND domain containing 3, putative [Acanthamoeba
castellanii str. Neff]
Length = 365
Score = 89.0 bits (219), Expect = 3e-15, Method: Compositional matrix adjust.
Identities = 45/126 (35%), Positives = 72/126 (57%), Gaps = 3/126 (2%)
Query: 23 NYSLVEALVAHMSDIDEKQLLLYAQIANLV--NLILQWP-EISINEIAENFSKLACNAHT 79
N+ L+ L H+ ++ + +A L+ +L P +I+ EI F ACN T
Sbjct: 59 NFDLIRYLTTHIDAFPSEKKEHFGVMAALIKKSLPATLPTDITPQEIMHLFCLFACNNFT 118
Query: 80 ICNSELRPLGTGLYPVISIINHSCLPNAVLVFEGRLAVVRAVQHVPKGAEVLISYIETAG 139
+ + EL+PLG G+YP ++INHSC PN V++FEGR VR+++ + K E+ +Y+E
Sbjct: 119 VSDGELKPLGLGIYPPAALINHSCDPNCVIIFEGRQCTVRSLRDITKDEEITFNYVEVGD 178
Query: 140 STMTRQ 145
T TR+
Sbjct: 179 PTETRR 184
Score = 82.4 bits (202), Expect = 3e-13, Method: Compositional matrix adjust.
Identities = 39/85 (45%), Positives = 56/85 (65%)
Query: 278 WKEALAYCQLTIPVYQRVYPQFHPLLGLQYYTCGKLEWFLGDTENAIKSMTEAVEILRIT 337
W+EAL YC ++ V++R+YP PL+GLQY GKL ++L TE A++SM A+ IL IT
Sbjct: 269 WREALLYCSASLAVFERIYPPRWPLVGLQYLIHGKLSFYLKYTEAALESMQRALPILTIT 328
Query: 338 HGTNSPFMKELILKLEEAQAEASYK 362
H + P ++ L L EA AE +Y+
Sbjct: 329 HSSGHPLVRTLHTMLAEATAEWNYE 353
>gi|224050278|ref|XP_002188029.1| PREDICTED: SET and MYND domain-containing protein 1 isoform 3
[Taeniopygia guttata]
Length = 490
Score = 89.0 bits (219), Expect = 4e-15, Method: Compositional matrix adjust.
Identities = 89/365 (24%), Positives = 157/365 (43%), Gaps = 61/365 (16%)
Query: 27 VEALVAHMSDI-DEKQLLLYAQIANLVNLILQWP----EISINEIAENFSKLACNAHTIC 81
+E L H+ +E++ L A + + + WP + + I+ F ++CN T+
Sbjct: 129 IEELQNHVDSFGEEEKKDLRADVESFLEF---WPPQCQQFGMQLISHIFGVISCNGFTLS 185
Query: 82 NSE-LRPLGTGLYPVISIINHSCLPNAVLVFE-GRLAVVRAVQH------------VPKG 127
+ L+ +G G++P + NH C PN +VF G VR++ H + G
Sbjct: 186 DQRGLQAVGVGIFPNLCQANHDCWPNCTVVFNNGNHEAVRSMFHTQMRIELRALNKISPG 245
Query: 128 AEVLISYIETAGSTMTRQKALKEQYLFTCTCPRCIKLGQFDDIQESAILEGYRCKDDGCS 187
E+ +SY++ + R++ LK+QY F CTC C K + KDD
Sbjct: 246 DELTVSYVDFLSLSEERRRQLKKQYYFDCTCEHCKK----------------QLKDD--- 286
Query: 188 GFLLRDSDDKGFTCQQCGLVRSKEEIKKIASEVNILSKKTLALTSC----GNHQEVVSTY 243
+L D S + +K EV LSK TL + G++ EVV
Sbjct: 287 -LMLAVKDSP---------TPSADTVK----EVIQLSKDTLEKINKARMEGHYHEVVKLC 332
Query: 244 KMIEKLQKKLYHPFSVNLMQTREKLIKILMELEDWKEALAYCQLTIPVYQRVYPQFHPLL 303
+ + Q+ + ++ L++ ++L L+ ++EA Y + + Y ++Y + L
Sbjct: 333 RDCLQKQEPVLGDTNIYLLRILSIASEVLSYLQMFEEAADYAKRMVDGYLKIYHPNNAQL 392
Query: 304 GLQYYTCGKLEWFLGDTENAIKSMTEAVEILRITHGTNSPFMKELILKLEEAQAEASYKL 363
G+ G W G E + +A IL ITHG + P K+ L++ Q E ++
Sbjct: 393 GMAVMRAGVTHWHAGLIEVGHGLICKAYAILLITHGPSHPITKD--LEVMRVQTEMELRM 450
Query: 364 SSKDE 368
++E
Sbjct: 451 FQQNE 455
>gi|242009727|ref|XP_002425634.1| set and mynd domain-containing protein, putative [Pediculus humanus
corporis]
gi|212509527|gb|EEB12896.1| set and mynd domain-containing protein, putative [Pediculus humanus
corporis]
Length = 666
Score = 88.6 bits (218), Expect = 4e-15, Method: Compositional matrix adjust.
Identities = 61/195 (31%), Positives = 96/195 (49%), Gaps = 15/195 (7%)
Query: 76 NAHTICNSELRPLGTGLYPVISIINHSCLPNAVLVFEGRLAVVRAVQHVPKGAEVLISY- 134
N + + +S+ +G GLYP +++ NHSC P V F G +VRA + + KG V +Y
Sbjct: 449 NKNILDDSKSLFVGGGLYPTLALFNHSCDPGIVRYFIGTQVIVRAAKPIKKGEIVAENYG 508
Query: 135 -IETAGSTMTRQKALKEQYLFTCTCPRCIKL-GQFDDIQESAILEGYRC-------KDDG 185
I + + RQ+ LK QY FTCTC C++L FD++ I +RC K +
Sbjct: 509 PIYSQMKKLERQEKLKSQYWFTCTCTPCLELWPTFDELDTKTI--RFRCSGYGTDEKREN 566
Query: 186 CSGFLL--RDSDDKGFTCQQCGLVRSKEEIKKIASEVNILSKKTLALTSCGNHQEVVSTY 243
C L+ ++DD C QC + + K + +IL K+ + GN Q + Y
Sbjct: 567 CKNILIVPVNTDDFMIKCSQCNQHTNLFKGLKALQDTDILYKQAIKYAQTGNFQNALDIY 626
Query: 244 -KMIEKLQKKLYHPF 257
++++ L L PF
Sbjct: 627 LEILKTLHGILAPPF 641
>gi|426336314|ref|XP_004031420.1| PREDICTED: SET and MYND domain-containing protein 1 [Gorilla
gorilla gorilla]
Length = 459
Score = 88.6 bits (218), Expect = 5e-15, Method: Compositional matrix adjust.
Identities = 79/337 (23%), Positives = 141/337 (41%), Gaps = 85/337 (25%)
Query: 31 VAHMSDIDEKQLLLYAQIANLVNLILQ-WP----EISINEIAENFSKLACNAHTICNSE- 84
V H + ++K L + V+ LQ WP + S+ I+ F + CN T+ +
Sbjct: 136 VEHFGEEEQKDLRVD------VDTFLQYWPPQSQQFSMQYISHIFGVINCNGFTLSDQRG 189
Query: 85 LRPLGTGLYPVISIINHSCLPNAVLVFE-GRLAVVRAVQH------------VPKGAEVL 131
L+ +G G++P + ++NH C PN ++F G V+++ H + +G E+
Sbjct: 190 LQAVGVGIFPNLGLVNHDCWPNCTVIFNNGNHEAVKSMFHTQMRIELRALGKISEGEELT 249
Query: 132 ISYIETAGSTMTRQKALKEQYLFTCTCPRCIKLGQFDDIQESAILEGYRCKDDGCSGFLL 191
+SYI+ + R++ LK+QY F CTC C K + KDD G +
Sbjct: 250 VSYIDFLNVSEERKRQLKKQYYFDCTCEHCQK----------------KLKDDLFLG--V 291
Query: 192 RDSDDKGFTCQQCGLVRSKEEIKKIASEVNILSKKTLALTSCGNHQEVVSTYKMIEKLQK 251
+D+ C++C K+E + ++ NI + L++ S
Sbjct: 292 KDNPKVVKLCREC---LQKQE--PVFADTNIYMLRMLSIVS------------------- 327
Query: 252 KLYHPFSVNLMQTREKLIKILMELEDWKEALAYCQLTIPVYQRVYPQFHPLLGLQYYTCG 311
++L L+ ++EA Y + + Y ++Y + LG+ G
Sbjct: 328 ------------------EVLSYLQAFEEASFYARRMVDGYMKLYHPNNAQLGMAVMRAG 369
Query: 312 KLEWFLGDTENAIKSMTEAVEILRITHGTNSPFMKEL 348
W G+ E + +A IL +THG + P K+L
Sbjct: 370 LTNWHAGNIEVGHGMICKAYAILLVTHGPSHPITKDL 406
>gi|432880354|ref|XP_004073656.1| PREDICTED: SET and MYND domain-containing protein 1-like isoform 1
[Oryzias latipes]
Length = 489
Score = 88.6 bits (218), Expect = 5e-15, Method: Compositional matrix adjust.
Identities = 78/306 (25%), Positives = 132/306 (43%), Gaps = 45/306 (14%)
Query: 62 SINEIAENFSKLACNAHTICNSE-LRPLGTGLYPVISIINHSCLPNAVLVF-EGRLAVVR 119
S+ I+ F + CN T+ + L+ +G GL+P ++++NH C PN + G + V
Sbjct: 166 SVEYISHIFGIIKCNGFTLTDQRGLQAVGVGLFPNLALVNHDCWPNCTAILNHGNHSAVS 225
Query: 120 AVQHVPK------------GAEVLISYIETAGSTMTRQKALKEQYLFTCTCPRCIKLGQF 167
+ H P+ G E+ ISY++ + RQK LKEQ+ F C+C C +
Sbjct: 226 SALHSPRRIEVRTLGKISEGEELTISYVDFLQLSADRQKQLKEQFHFECSCKHCSQHIS- 284
Query: 168 DDIQESAILEGYRCKDDGCSGFLLRDSDDKGFTCQQCGLVR-SKEEIKKIASEVNILSKK 226
DD+ +A DG D F L R S E+I+ EV+ SK
Sbjct: 285 DDLMMAAA--------DG--------QDSPWFI-----LCRPSAEQIQ----EVSAFSKD 319
Query: 227 TLALTSC----GNHQEVVSTYKMIEKLQKKLYHPFSVNLMQTREKLIKILMELEDWKEAL 282
+L G++ EV+ + + Q+ + ++ ++ ++L L + EA
Sbjct: 320 SLEKIEKSRLDGDYSEVLRLCRECLQKQRGVLADTHLHRLRVLSIAAEVLSSLRQFSEAA 379
Query: 283 AYCQLTIPVYQRVYPQFHPLLGLQYYTCGKLEWFLGDTENAIKSMTEAVEILRITHGTNS 342
+ + Y ++Y + LLG+ G W G E + +A IL +THG N
Sbjct: 380 DCARKMVEGYMKLYHPNNALLGMAIMRAGVTLWHAGQIEEGHSLICKAYSILMVTHGPNH 439
Query: 343 PFMKEL 348
++L
Sbjct: 440 SVTRDL 445
>gi|327275309|ref|XP_003222416.1| PREDICTED: SET and MYND domain-containing protein 1-like isoform 2
[Anolis carolinensis]
Length = 485
Score = 88.2 bits (217), Expect = 5e-15, Method: Compositional matrix adjust.
Identities = 83/336 (24%), Positives = 148/336 (44%), Gaps = 52/336 (15%)
Query: 31 VAHMSDIDEKQLLLYAQIANLVNLILQWP----EISINEIAENFSKLACNAHTICNSE-L 85
V H + ++K+L L +L + + WP + ++ I+ + CN T+ + L
Sbjct: 131 VEHFGEEEKKELRL-----DLESFLEFWPHDGKQFAMQYISHILGVINCNGFTLSDQRGL 185
Query: 86 RPLGTGLYPVISIINHSCLPNAVLVFE-GRLAVVRAVQH------------VPKGAEVLI 132
+ +G G++P + ++NH C PN ++F G VR++ H + G E+ +
Sbjct: 186 QAVGVGIFPNLCLVNHDCWPNCTVIFNNGNHEAVRSMFHTQMRIELRALGKISIGEELTV 245
Query: 133 SYIETAGSTMTRQKALKEQYLFTCTCPRCIKLGQFDDIQESAILEGYRCKDDGCSGFLLR 192
SY++ R++ LK+QY F CTC C K G DD+ A+ E ++ S +++
Sbjct: 246 SYVDFLNVCEDRRQQLKKQYYFDCTCEHCQK-GIKDDLM-LAVKE-----EEKPSPEVVK 298
Query: 193 DSDDKGFTCQQCGLVRSKEEIKKIASEVNILSKKTLALTSCGNHQEVVSTYKMIEKLQKK 252
D + SKE I+KI S G ++EVV + + Q+
Sbjct: 299 DV-----------IQFSKEAIEKIDK-----------ARSEGLYKEVVKLCRACLEKQEP 336
Query: 253 LYHPFSVNLMQTREKLIKILMELEDWKEALAYCQLTIPVYQRVYPQFHPLLGLQYYTCGK 312
+ ++ +++ L ++L L+ EA Y + Y ++Y + LG+ G
Sbjct: 337 VLGDTNIYMLRILSILSEVLSYLQHVDEAAGYSKRMADGYLKLYHPNNAQLGMALMRAGV 396
Query: 313 LEWFLGDTENAIKSMTEAVEILRITHGTNSPFMKEL 348
W G E + +A IL +THG P K+L
Sbjct: 397 THWHAGLIEAGHGMICKAYAILLVTHGPTHPITKDL 432
>gi|240981045|ref|XP_002403605.1| SET and MYND domain-containing protein (SMYD), putative [Ixodes
scapularis]
gi|215491387|gb|EEC01028.1| SET and MYND domain-containing protein (SMYD), putative [Ixodes
scapularis]
Length = 770
Score = 88.2 bits (217), Expect = 6e-15, Method: Compositional matrix adjust.
Identities = 66/247 (26%), Positives = 114/247 (46%), Gaps = 24/247 (9%)
Query: 69 NFSKLACNAHTICNSELRP--------------LGTGLYPVISIINHSCLPNAVLVFE-G 113
+ +L CNAH I + E + + T +YP S++NHSC PN F G
Sbjct: 455 HIQQLVCNAHAITSLESKTSQEDDVVMTTEQVRIATAIYPSASLMNHSCDPNIFSSFRCG 514
Query: 114 RLAVVRAVQHVPKGAEVLISY--IETAGSTMTRQKALKEQYLFTCTCPRCIKLGQFDDIQ 171
VVRA++ + +G EVL Y S RQ+ L+EQY F C+C C +D +
Sbjct: 515 STLVVRAIRRIQEGEEVLNCYGPHHRRMSFAERQQLLQEQYFFVCSCTAC---SSGEDAE 571
Query: 172 ESAILEGYRCKDDGCSGFLLRDSDDKGFTCQQCGLVRSKEEIKKIASEVNILSKKTLALT 231
+ L+ +C + C G L + D TC CG + ++ A ++ L + + L
Sbjct: 572 QR--LQALKC--EYCEGPLKQPDDSGKATCLDCGTTQECLGSEQKAFRMHDLYVQGVQLA 627
Query: 232 SCGNHQEVVSTYKMIEKLQKKLYHPFSVNLMQTREKLIKILMELEDWKEALAYCQLTIPV 291
GNH E + K ++K+ + ++ L++ ++ + K L L D++ A + +
Sbjct: 628 EKGNHVEALQRLNKCLKTREKVMYKHNLKLLEVKDMVAKCLCSLGDFRSASGVLKPAVDA 687
Query: 292 YQRVYPQ 298
+ +Y +
Sbjct: 688 VRVIYGE 694
>gi|118379402|ref|XP_001022867.1| conserved hypothetical protein [Tetrahymena thermophila]
gi|89304634|gb|EAS02622.1| conserved hypothetical protein [Tetrahymena thermophila SB210]
Length = 418
Score = 87.8 bits (216), Expect = 7e-15, Method: Compositional matrix adjust.
Identities = 82/307 (26%), Positives = 146/307 (47%), Gaps = 34/307 (11%)
Query: 71 SKLACNAHTICNS--ELRPLGTGLYPVISIINHSCLPNAVLVFEGRLAVVR--AVQHVPK 126
+K A NA TI +S E +G GLY ++ +NHSC PN + VF +L VR A++ + +
Sbjct: 133 AKFATNALTIQDSLFETDGIGAGLYEEVNYMNHSCTPNVICVF-NKLPQVRVIAIRDIEQ 191
Query: 127 GAEVLISYIETAGSTMTRQKALKEQYLFTCTCPRCIKLGQFDDIQESAILEGYRCKDDGC 186
G E++ SYI+T R++ LK+ Y F C C RCIK +++ + RC+ C
Sbjct: 192 GEEIMNSYIDTKKDLDFRRRFLKQNYFFLCECKRCIK-------EQNEGVSFVRCQK--C 242
Query: 187 SGFLLRDSDDKGFTCQQCGL--VRSKEEIKKIASEVNILSKKTLALTSCGNHQEVVSTYK 244
+ + K C C +++E ++I +++ + K+ TS +++ + +
Sbjct: 243 MKGRI---NSKTLNCSDCNTQGQMTEQEYQRIITQMEEIQKQ----TSQSTYEKTIELTE 295
Query: 245 MIEKLQKKLYHPFSVNLMQTREKLIKILMELEDWKEALAYCQLTIPVYQRVYPQF----- 299
EKL L P+ ++ +R I + ++ +K+A Y + I + + Y +
Sbjct: 296 TAEKLC-HLEAPYFNDIYNSR---ILATLGVQQYKKANKYLKNFIELSKHWYEEIIGNTL 351
Query: 300 -HPLLGLQYYTCGKLEWFLGDTENAIKSMTEAVEILRITHGTNS-PFMKELILKLEEAQA 357
P G + KL + L A K A++IL + S P + EL L+L + ++
Sbjct: 352 EMPFFGWKLNELSKLNFHLDKCLEAEKYSERALKILTQYYTKESCPELNELFLRLNDIKS 411
Query: 358 EASYKLS 364
E K +
Sbjct: 412 EIKMKYN 418
>gi|449501413|ref|XP_002187998.2| PREDICTED: SET and MYND domain-containing protein 1 isoform 2
[Taeniopygia guttata]
Length = 491
Score = 87.8 bits (216), Expect = 7e-15, Method: Compositional matrix adjust.
Identities = 85/361 (23%), Positives = 158/361 (43%), Gaps = 52/361 (14%)
Query: 27 VEALVAHMSDI-DEKQLLLYAQIANLVNLILQWP----EISINEIAENFSKLACNAHTIC 81
+E L H+ +E++ L A + + + WP + + I+ F ++CN T+
Sbjct: 129 IEELQNHVDSFGEEEKKDLRADVESFLEF---WPPQCQQFGMQLISHIFGVISCNGFTLS 185
Query: 82 NSE-LRPLGTGLYPVISIINHSCLPNAVLVFE-GRLAVVRAVQH------------VPKG 127
+ L+ +G G++P + NH C PN +VF G VR++ H + G
Sbjct: 186 DQRGLQAVGVGIFPNLCQANHDCWPNCTVVFNNGNHEAVRSMFHTQMRIELRALNKISPG 245
Query: 128 AEVLISYIETAGSTMTRQKALKEQYLFTCTCPRCIKLGQFDDIQESAILEGYRCKDDGCS 187
E+ +SY++ + R++ LK+QY F CTC C K + KDD
Sbjct: 246 DELTVSYVDFLSLSEERRRQLKKQYYFDCTCEHCKK----------------QLKDDLML 289
Query: 188 GFLLRDSDDKGFTCQQCGLVRSKEEIKKIASEVNILSKKTLALTSCGNHQEVVSTYKMIE 247
+S T ++ + SK+ ++KI +K + G++ EVV +
Sbjct: 290 AVKAGESKPSADTVKEV-IQLSKDTLEKI-------NKARME----GHYHEVVKLCRDCL 337
Query: 248 KLQKKLYHPFSVNLMQTREKLIKILMELEDWKEALAYCQLTIPVYQRVYPQFHPLLGLQY 307
+ Q+ + ++ L++ ++L L+ ++EA Y + + Y ++Y + LG+
Sbjct: 338 QKQEPVLGDTNIYLLRILSIASEVLSYLQMFEEAADYAKRMVDGYLKIYHPNNAQLGMAV 397
Query: 308 YTCGKLEWFLGDTENAIKSMTEAVEILRITHGTNSPFMKELILKLEEAQAEASYKLSSKD 367
G W G E + +A IL ITHG + P K+ L++ Q E ++ ++
Sbjct: 398 MRAGVTHWHAGLIEVGHGLICKAYAILLITHGPSHPITKD--LEVMRVQTEMELRMFQQN 455
Query: 368 E 368
E
Sbjct: 456 E 456
>gi|124513818|ref|XP_001350265.1| histone-lysine N-methyltransferase, putative [Plasmodium falciparum
3D7]
gi|23615682|emb|CAD52674.1| histone-lysine N-methyltransferase, putative [Plasmodium falciparum
3D7]
Length = 509
Score = 87.8 bits (216), Expect = 8e-15, Method: Compositional matrix adjust.
Identities = 84/326 (25%), Positives = 137/326 (42%), Gaps = 42/326 (12%)
Query: 63 INEIAENFSKLACNAHTICNSELRPLGTGLY--PVISIINHSCLPNAVLVFEGRLAVVRA 120
+ I E F K++ N+ I ++EL P G +Y PV NHSCL N V VF + +R
Sbjct: 177 LKNIIEKFIKISKNSLQIIDNELEPAGLAIYKKPV-PFFNHSCLSNCVTVFRNQKLYIRT 235
Query: 121 VQHVPKGAEVLISYIETAGSTMTRQKALKEQYLFTCTCP--------------------- 159
+ V G E+ ISYI+ A TR +QY FTC+C
Sbjct: 236 LMDVYPGEELTISYIDIAFDRCTRLSICMDQYFFTCSCKLCKVNIASECHNIFNTEIVCT 295
Query: 160 ---RCIKLGQFDDIQESAILEGYRCKDDGCSGF----LLRDSD-DKGFTCQQCGLVRSKE 211
C K + +I + LE +C + S F +LR S+ D + C C
Sbjct: 296 NSENCKKFLGYMEIVLISELERKQCYINK-SSFKTYPVLRKSNIDNVWRCMLCKAEVGDN 354
Query: 212 EIKKIASEVNILSKKTLALTSCGNHQEVVSTYKMIEKLQK---------KLYHPFSVNLM 262
IK + + K+T+ L + N + +++ L K YH +L
Sbjct: 355 IIKGLVDKEKETFKETVYLETLFNEKYTYDNKSVLQSLNKIKSKIDYLTNYYHHAKYSLQ 414
Query: 263 QTREKLIKILMELEDWKEALAYCQLTIPVYQRVYPQFHPLLGLQYYTCGKLEWFLGDTEN 322
+ R K++ I ++L ++K A + + Y ++ P+ G + GKL FL
Sbjct: 415 KMRAKILYISIQLHEFKLAYNIANQYLKSIEVSYGKYSPIYGYYIFLTGKLALFLDLKNE 474
Query: 323 AIKSMTEAVEILRITHGTNSPFMKEL 348
+ + +A + + T+G +S K+L
Sbjct: 475 GLGLIHKAKKNIIKTYGPDSLIYKDL 500
>gi|428182841|gb|EKX51700.1| hypothetical protein GUITHDRAFT_55948, partial [Guillardia theta
CCMP2712]
Length = 98
Score = 87.8 bits (216), Expect = 8e-15, Method: Composition-based stats.
Identities = 41/96 (42%), Positives = 58/96 (60%)
Query: 66 IAENFSKLACNAHTICNSELRPLGTGLYPVISIINHSCLPNAVLVFEGRLAVVRAVQHVP 125
IA SK N H I + + + G GLYP+ S INHSC PNA++ F+G VVRA++++P
Sbjct: 3 IASMISKAQRNMHGIVDLKGQNFGHGLYPLASFINHSCEPNAIISFDGNKLVVRALENIP 62
Query: 126 KGAEVLISYIETAGSTMTRQKALKEQYLFTCTCPRC 161
+G E+ I+Y+E R+ AL + F C C RC
Sbjct: 63 RGTEITIAYVELYAPLDVRRDALLSRKGFLCRCSRC 98
>gi|344290601|ref|XP_003417026.1| PREDICTED: SET and MYND domain-containing protein 4-like [Loxodonta
africana]
Length = 800
Score = 87.4 bits (215), Expect = 9e-15, Method: Compositional matrix adjust.
Identities = 84/318 (26%), Positives = 130/318 (40%), Gaps = 75/318 (23%)
Query: 59 PEISINEIA--ENFSKLACNAHTIC--------------NSELRPLGTGLYPVISIINHS 102
PE++I + + +L CNA I + E+R L TG++PV+S++NHS
Sbjct: 481 PELNIWGVVMLRHMLQLQCNAQAITTIQHTGSKESIITESREIR-LATGIFPVVSLLNHS 539
Query: 103 CLPNAVLVFEGRLAVVRAVQHVPKGAEVLISY--IETAGSTMTRQKALKEQYLFTCTCPR 160
C PN + F +A +RA Q + KG E+L Y E+ + RQ+ L+ QY F CTCP
Sbjct: 540 CRPNTSVSFTSTVATIRASQQIAKGQEILHCYGPHESRMAVAERQQKLRSQYFFECTCPA 599
Query: 161 CIKLGQFDDIQESAILEGYRCKDDGCSGFLLRDSDDKGFTCQQCGLVRSKEEIKKIASEV 220
C K E G R F C C V E++
Sbjct: 600 CHK--------EPRATAGPRW---------------AAFCCHSCRAVMQGEDV------- 629
Query: 221 NILSKKTLALTSCGNHQ--EVVSTYKMIEKLQKKLYHPFSVNLMQTREKLIKILM--ELE 276
SCGN E VS ++ +LQ +L Q K+L +LE
Sbjct: 630 ----------LSCGNRSCTESVSRSHLVSRLQ---------DLQQQVGMAQKLLRSGKLE 670
Query: 277 DWKEALAYCQLTIPVYQRVYPQFHPLLGLQYYTCGKLEWFLGDTENAIKSMTEAVEILRI 336
+ L CQ + P H ++G + LGD E + + ++++++ +
Sbjct: 671 QAVQQLLGCQHAAESF---LPAEHTVVGEIDDHLAQAYAALGDWEKSATHLQKSLQVVEV 727
Query: 337 THGTNSPFMKELILKLEE 354
HG +S M + KL +
Sbjct: 728 RHGPSSIEMGRELFKLAQ 745
>gi|444515202|gb|ELV10791.1| SET and MYND domain-containing protein 1 [Tupaia chinensis]
Length = 462
Score = 87.0 bits (214), Expect = 1e-14, Method: Compositional matrix adjust.
Identities = 78/318 (24%), Positives = 142/318 (44%), Gaps = 55/318 (17%)
Query: 31 VAHMSDIDEKQLLLYAQIANLVNLILQ-WP----EISINEIAENFSKLACNAHTICNSE- 84
V H + ++K+L + V+ LQ WP + S+ I+ F + CN T+ +
Sbjct: 136 VEHFGEEEQKELRVD------VDTFLQYWPPQSQQFSMQYISHIFGVINCNGFTLSDQRG 189
Query: 85 LRPLGTGLYPVISIINHSCLPNAVLVFE-GRLAVVRAVQH------------VPKGAEVL 131
L+ +G G++P + ++NH C PN ++F G V+++ H + +G E+
Sbjct: 190 LQAVGVGIFPNLGLVNHDCWPNCTVIFNNGNHEAVKSMFHTQMRIELRALGKISEGEELT 249
Query: 132 ISYIETAGSTMTRQKALKEQYLFTCTCPRCIKLGQFDDIQESAILEGYRCKDDGCSGFLL 191
+SYI+ + R+K LK+QY F CTC C K G DD+ FL
Sbjct: 250 VSYIDFLNLSEERRKQLKKQYYFDCTCEHCQK-GLKDDL------------------FLG 290
Query: 192 RDSDDKGFTCQQCGLVRSKEEIKKIASEVNILSKKTLALTSCGNHQEVVSTYKMIEKLQK 251
D K S+E +K++ +K S G + EVV + + Q+
Sbjct: 291 VKDDPKP----------SQEAVKEMTQFSKDTLEKIEKARSEGLYHEVVKLCRECLQKQE 340
Query: 252 KLYHPFSVNLMQTREKLIKILMELEDWKEALAYCQLTIPVYQRVYPQFHPLLGLQYYTCG 311
++ ++ +++ + ++L L+ ++EA Y + + Y ++Y + LG+ G
Sbjct: 341 PVFADTNLYVLRMLSIVSEVLSYLQAFEEASHYARRMVDGYMKLYHPNNAQLGMAVMRAG 400
Query: 312 KLEWFLGDTENAIKSMTE 329
W G+ E A++ TE
Sbjct: 401 LTNWHAGNIE-AMRVQTE 417
>gi|440799473|gb|ELR20518.1| Histone-lysine N-methyltransferase [Acanthamoeba castellanii str.
Neff]
Length = 404
Score = 86.7 bits (213), Expect = 2e-14, Method: Compositional matrix adjust.
Identities = 60/193 (31%), Positives = 87/193 (45%), Gaps = 12/193 (6%)
Query: 25 SLVEALVAHMSDIDEKQLLLYAQIANLVNLILQWPEISINEIAENFSKL----ACNAHTI 80
+L AL + + +L Y Q+A +V + P + ++ +L CNA +
Sbjct: 192 ALTFALEPNTHALSPTRLAQYEQMAEVVAASISLPRGARKVTKDDVVRLLCIQQCNAFGL 251
Query: 81 CNSELRPLGTGLYPVISIINHSCLPNAVLVFEG----RLAVVRAVQHVPKGAEVLISYIE 136
N E G LYP +S+ NHSC+PN V +G R+ ++ + VP G E+ ISYI+
Sbjct: 252 SNGEGEMTGVALYPALSLFNHSCMPNCAAVDDGTGSKRVCAIKTLVAVPPGEELTISYID 311
Query: 137 TAGSTMTRQKALKEQYLFTCTCPRCIKLGQFDDIQESAILEGYRCKDDGCSGFLLRDSDD 196
+ RQ L+E Y F CTC RC D +A L C C G L+
Sbjct: 312 LDLTRELRQDKLEESYAFRCTCARCRAPDADDSPAVAAHLRRITCGS--CGGSLIPAPPG 369
Query: 197 KGFT--CQQCGLV 207
+ T C C LV
Sbjct: 370 QELTVACNSCPLV 382
>gi|312373132|gb|EFR20944.1| hypothetical protein AND_18266 [Anopheles darlingi]
Length = 648
Score = 86.7 bits (213), Expect = 2e-14, Method: Compositional matrix adjust.
Identities = 73/260 (28%), Positives = 116/260 (44%), Gaps = 30/260 (11%)
Query: 14 NVIPSTTTDNYSLVEALVAHMSDIDEKQLLLYAQIANLVNLILQWPEISINE-------- 65
N ++D+Y V LV H D + L++ +A ++N IL+ + E
Sbjct: 351 NFTDKLSSDDYRKVFNLVTHSDKRDPEDYLVWTLMAAMLNTILRHGNYTNTEQPDDGFIG 410
Query: 66 --IAENFSKLACNAHTICN-SELRP--------LGTGLYPVISIINHSCLPNAVLVFEGR 114
+ N + +AH + RP +G LYP++++ NHSC P V F G
Sbjct: 411 YLLLHNLQIVNYSAHDVAELQRKRPNEAGKSVAIGAALYPLLALFNHSCDPGIVRYFSGT 470
Query: 115 LAVVRAVQHVPKGAEVLISY--IETAGSTMTRQKALKEQYLFTCTCPRCIKLGQ-FDDIQ 171
VR ++++ G + +Y + T R+++L Y F C C C + Q + +
Sbjct: 471 TVHVRTIKNIAAGQIIAENYGPLYTKMERTERRQSLAANYKFQCNCDACRQDWQLYTHMD 530
Query: 172 ESAILEGYRCKDDGCSGFLLRD--SDDKGFTCQQCG-LVRSKEEIKKIASEVNILSKKTL 228
ES I +RC C G LL D S+ G C CG LV +E I ++ E N+LS+
Sbjct: 531 ESVIR--FRCTGPACQGALLFDLTSECYGMQCDDCGELVDIRERI-RLLQEANMLSRFNE 587
Query: 229 A--LTSCGNHQEVVSTYKMI 246
A L + G + +S Y I
Sbjct: 588 AAHLYAIGLFEHALSKYAAI 607
>gi|432880356|ref|XP_004073657.1| PREDICTED: SET and MYND domain-containing protein 1-like isoform 2
[Oryzias latipes]
Length = 478
Score = 86.3 bits (212), Expect = 2e-14, Method: Compositional matrix adjust.
Identities = 71/301 (23%), Positives = 127/301 (42%), Gaps = 46/301 (15%)
Query: 62 SINEIAENFSKLACNAHTICNSE-LRPLGTGLYPVISIINHSCLPNAVLVF-EGRLAVVR 119
S+ I+ F + CN T+ + L+ +G GL+P ++++NH C PN + G + V
Sbjct: 166 SVEYISHIFGIIKCNGFTLTDQRGLQAVGVGLFPNLALVNHDCWPNCTAILNHGNHSAVS 225
Query: 120 AVQHVPK------------GAEVLISYIETAGSTMTRQKALKEQYLFTCTCPRCIKLGQF 167
+ H P+ G E+ ISY++ + RQK LKEQ+ F C+C C +
Sbjct: 226 SALHSPRRIEVRTLGKISEGEELTISYVDFLQLSADRQKQLKEQFHFECSCKHCSQ---- 281
Query: 168 DDIQESAILEGYRCKDDGCSGFLLRDSDDKGFTCQQCGLVRSKEEIKKIASEVNILSKKT 227
DD ++ +D + Q+ SK+ ++KI K
Sbjct: 282 ------------HISDD----LMMAAADGQDSPIQEVSAF-SKDSLEKI-------EKSR 317
Query: 228 LALTSCGNHQEVVSTYKMIEKLQKKLYHPFSVNLMQTREKLIKILMELEDWKEALAYCQL 287
L G++ EV+ + + Q+ + ++ ++ ++L L + EA +
Sbjct: 318 LD----GDYSEVLRLCRECLQKQRGVLADTHLHRLRVLSIAAEVLSSLRQFSEAADCARK 373
Query: 288 TIPVYQRVYPQFHPLLGLQYYTCGKLEWFLGDTENAIKSMTEAVEILRITHGTNSPFMKE 347
+ Y ++Y + LLG+ G W G E + +A IL +THG N ++
Sbjct: 374 MVEGYMKLYHPNNALLGMAIMRAGVTLWHAGQIEEGHSLICKAYSILMVTHGPNHSVTRD 433
Query: 348 L 348
L
Sbjct: 434 L 434
>gi|239607850|gb|EEQ84837.1| SET and MYND domain-containing protein [Ajellomyces dermatitidis
ER-3]
Length = 519
Score = 86.3 bits (212), Expect = 2e-14, Method: Compositional matrix adjust.
Identities = 39/104 (37%), Positives = 60/104 (57%)
Query: 60 EISINEIAENFSKLACNAHTICNSELRPLGTGLYPVISIINHSCLPNAVLVFEGRLAVVR 119
+I ++ +A F+KL NA T+ N +G P S INHSC PNA + F+G + ++
Sbjct: 181 DIDLSTLAAYFAKLETNAFTLTNQYFDRIGLCFLPFASYINHSCQPNAYIGFDGPVMYLK 240
Query: 120 AVQHVPKGAEVLISYIETAGSTMTRQKALKEQYLFTCTCPRCIK 163
A+Q + ++ ISYI+ TRQ L+ QY F C CP+C++
Sbjct: 241 ALQDIAPDEQIYISYIDNTEPFQTRQSELQLQYFFECKCPKCLE 284
>gi|324513338|gb|ADY45484.1| SET and MYND domain-containing protein 3 [Ascaris suum]
Length = 477
Score = 86.3 bits (212), Expect = 2e-14, Method: Compositional matrix adjust.
Identities = 83/323 (25%), Positives = 138/323 (42%), Gaps = 25/323 (7%)
Query: 30 LVAHMSDI--DEKQLLLYAQIANLVNLILQWPEISINEIAENFSKLACNAHTICNSELRP 87
L+ H DI D+ + + +I N +++ + I ++ F K + N+H+I S
Sbjct: 123 LLGHEEDIKADQAKYEHFEKIFNKMSIFRKDEMIDKDKFYIIFCKTSINSHSIHTSAGTE 182
Query: 88 LGTGLYPVISIINHSCLPNAVLVFEGRLAVVRAVQHVPKGAE---VLISYIETAGSTMTR 144
+G L +S +HSC PN LVF+G A +R + ++ ISYI+ S R
Sbjct: 183 VGMALDLGVSAYDHSCRPNCSLVFDGYQACLRPLTPQTNASDPRTSFISYIDIGRSRYQR 242
Query: 145 QKALKEQYLFTCTCPRCIKLGQFDDIQESAILEGYRCKDDGCSGFLL--RDSDDKGFTCQ 202
+K LK ++ F C C RC IL RC D C L+ D++ TC
Sbjct: 243 RKELKAKWYFDCACERCCDPA-------DNILTSIRCSDASCDEPLITAEDAEATAITCP 295
Query: 203 QCGLVRSKEEIKKIASEVNILSKKTLALTSCGNHQEVVSTYKMIEKLQKKLYHPFSVNLM 262
+CG V +E ++ + L K ++ + QE++ + QK +Y + +
Sbjct: 296 KCGQVADEEYVRAAQQMMLRLPAKFSPESNPDSLQELLDEASNVLH-QKNIY----ITRL 350
Query: 263 QTREKLIKILMELEDWKEALAYCQLTI-PVYQRVYPQFHPLLGLQYYTCGKLEWFLGDTE 321
QT ++ I L+D L + Q + Y+ +P +G Q K G +
Sbjct: 351 QT--AIMHITGTLKD---NLPFIQHQVYENYKMCFPYADRHIGYQLLQIVKAYIEKGQRK 405
Query: 322 NAIKSMTEAVEILRITHGTNSPF 344
A+ EA+ I I G P+
Sbjct: 406 EAVSYAFEAMNIFEICFGLQHPY 428
>gi|261198320|ref|XP_002625562.1| SET and MYND domain-containing protein [Ajellomyces dermatitidis
SLH14081]
gi|239595525|gb|EEQ78106.1| SET and MYND domain-containing protein [Ajellomyces dermatitidis
SLH14081]
gi|327355791|gb|EGE84648.1| SET and MYND domain-containing protein [Ajellomyces dermatitidis
ATCC 18188]
Length = 519
Score = 86.3 bits (212), Expect = 2e-14, Method: Compositional matrix adjust.
Identities = 39/104 (37%), Positives = 60/104 (57%)
Query: 60 EISINEIAENFSKLACNAHTICNSELRPLGTGLYPVISIINHSCLPNAVLVFEGRLAVVR 119
+I ++ +A F+KL NA T+ N +G P S INHSC PNA + F+G + ++
Sbjct: 181 DIDLSTLAAYFAKLETNAFTLTNQYFDRIGLCFLPFASYINHSCQPNAYIGFDGPVMYLK 240
Query: 120 AVQHVPKGAEVLISYIETAGSTMTRQKALKEQYLFTCTCPRCIK 163
A+Q + ++ ISYI+ TRQ L+ QY F C CP+C++
Sbjct: 241 ALQDIAPDEQIYISYIDNTEPFQTRQSELQLQYFFECKCPKCLE 284
>gi|225561846|gb|EEH10126.1| MYND finger family protein [Ajellomyces capsulatus G186AR]
Length = 518
Score = 86.3 bits (212), Expect = 2e-14, Method: Compositional matrix adjust.
Identities = 50/165 (30%), Positives = 85/165 (51%), Gaps = 11/165 (6%)
Query: 6 LRRKLQNDNVIPSTTTD-NYSLVEALVAHMSDIDEK------QLLLYAQIANLVNLILQW 58
LR Q DN + D ++++ L H S +DE+ ++ + A+ ++
Sbjct: 123 LRIISQPDNKVSKEIYDTHHAVFRTLGHHFSKMDERGGEQAHRITVSAEALKAISNT--- 179
Query: 59 PEISINEIAENFSKLACNAHTICNSELRPLGTGLYPVISIINHSCLPNAVLVFEGRLAVV 118
E+ ++ + F+KL NA T+ N +G L P + INHSC PNA + F+G++ +
Sbjct: 180 -EVELSTLVVYFAKLETNAFTLTNQYFDRIGLCLLPFAAYINHSCEPNAYIGFDGQVMYL 238
Query: 119 RAVQHVPKGAEVLISYIETAGSTMTRQKALKEQYLFTCTCPRCIK 163
+A+Q + E+ ISY + TRQ L+ +Y F C CP+C+K
Sbjct: 239 KALQDIAPDEEIFISYTDNTEPLKTRQTELQLRYFFECKCPKCLK 283
>gi|403351926|gb|EJY75465.1| SET and MYND domain containing 3 [Oxytricha trifallax]
Length = 517
Score = 85.9 bits (211), Expect = 3e-14, Method: Compositional matrix adjust.
Identities = 73/283 (25%), Positives = 128/283 (45%), Gaps = 54/283 (19%)
Query: 75 CNAHTICNSELRPLGTGLYPVISIINHSCLPNAVLVFEGRLAVVRAVQHVPKGAEVLISY 134
CN I ++++ +GTGLY +++NHSC PN +++F G+ + + + E+ I Y
Sbjct: 243 CNGFGIADNQMLRIGTGLYYPSNLLNHSCDPNCMVLFRGQTQFIVTCRPIEADEEITICY 302
Query: 135 IETAGS-TMTRQKALKEQYLFTCTCPRCIKLGQFDDIQESAILEGYRCKDDGCSGFLLRD 193
I+ S + RQ+ L+EQY F C C RC+K I EG K+
Sbjct: 303 IDNGISERIIRQQYLQEQYHFNCMCARCLK----------QIGEGTELKE---------- 342
Query: 194 SDDKGFTCQQCGLVRSKEEIKKIASEVNILSKKTLALTSCGNHQEVVSTYKMIEKLQKKL 253
K +I + + E IL+K+ L S GN + +S + +L K +
Sbjct: 343 ---------------QKVQI-QFSEEQQILNKQALHEESLGNFDKSLS---QLSQLYKSV 383
Query: 254 YHPFSVNLMQTR------EKLIKILMELEDWKEALAYCQLTIPVYQRVYPQF-------- 299
S + Q E+L+ + + +D+K A+ + Q + + + +Y +
Sbjct: 384 ETNKSTQIFQNSMIKELLERLVYLSVMNKDYKFAVKFQQKLVDLCEIMYIKEVKREDELP 443
Query: 300 HPLLGLQYYTCGKLEWFLGDTENAIKSMTEAVEILRITHGTNS 342
HPL+ L YY GKL + AI ++ +A+ ++ +G S
Sbjct: 444 HPLIALHYYQLGKLLSQRKKYQEAIDNLQKAMTLVSAYYGLKS 486
>gi|82594042|ref|XP_725259.1| hypothetical protein [Plasmodium yoelii yoelii 17XNL]
gi|23480195|gb|EAA16824.1| Homo sapiens HSKM-B [Plasmodium yoelii yoelii]
Length = 511
Score = 85.9 bits (211), Expect = 3e-14, Method: Compositional matrix adjust.
Identities = 81/327 (24%), Positives = 135/327 (41%), Gaps = 42/327 (12%)
Query: 63 INEIAENFSKLACNAHTICNSELRPLGTGLY--PVISIINHSCLPNAVLVFEGRLAVVRA 120
+ + E F K++ N I ++EL P G G Y PV NHSCL N + +F+ + +R
Sbjct: 177 LKNVIEKFIKVSKNTLQIIDNELEPCGLGFYKKPV-PYFNHSCLSNCITIFKNQKLYIRT 235
Query: 121 VQHVPKGAEVLISYIETAGSTMTRQKALKEQYLFTCTCPRCI------------------ 162
+ + G E+ ISY++ A TR +QY FTCTC C
Sbjct: 236 LMDIYPGEELTISYLDIAFDRNTRLAICTDQYFFTCTCKLCKVNIPSECHNMFNNDFICT 295
Query: 163 ---KLGQFDDIQESAILE------GYRCKDDGCSGFLLRDSDDKGFTCQQCGLVRSK--- 210
+F + E ++ Y K + +L+ S DK +C L + +
Sbjct: 296 RSENCKKFVNYMEMVLISELERKLNYHNKLHFKAFPILKKSTDKNENIWKCMLCKHETND 355
Query: 211 -------EEIKKIASEVNILSKKTLALTSCGNHQEVVSTYKMIEKLQ--KKLYHPFSVNL 261
E+ K+ EV L S N + S K+ K+ YH +L
Sbjct: 356 SVIKGVIEKEKETVKEVEYLETLFAEKYSYDNKIILQSLNKIKSKIDYLTTFYHHSRYSL 415
Query: 262 MQTREKLIKILMELEDWKEALAYCQLTIPVYQRVYPQFHPLLGLQYYTCGKLEWFLGDTE 321
+ R K++ I ++L+++K A + + Y ++ P+ G + GKL FL
Sbjct: 416 QKMRAKILYISIQLQEFKLAYNIANQYLKSIEVSYGKYSPIYGYYIFLTGKLALFLDLKS 475
Query: 322 NAIKSMTEAVEILRITHGTNSPFMKEL 348
+ + +A + + T+G SP K+L
Sbjct: 476 AGLNLIHKAKKNIIKTYGPESPIYKDL 502
>gi|344247559|gb|EGW03663.1| SET and MYND domain-containing protein 1 [Cricetulus griseus]
Length = 467
Score = 85.1 bits (209), Expect = 4e-14, Method: Compositional matrix adjust.
Identities = 76/338 (22%), Positives = 140/338 (41%), Gaps = 87/338 (25%)
Query: 31 VAHMSDIDEKQLLLYAQIANLVNLILQ-WP----EISINEIAENFSKLACNAHTICNSE- 84
V H + ++K+L + V+ LQ WP + S+ I+ F + CN T+ +
Sbjct: 136 VEHFGEEEQKELRVD------VDAFLQYWPPQGQQFSMQYISHIFGVINCNGFTLSDQRG 189
Query: 85 LRPLGTGLYPVISIINHSCLPNAVLVFE-GRLAVVRAVQH------------VPKGAEVL 131
L+ +G G++P + ++NH C PN ++F G V+++ H + +G E+
Sbjct: 190 LQAVGVGIFPNLGLVNHDCWPNCTVIFNNGNHEAVKSMFHTQMRIELRALGKISEGEELT 249
Query: 132 ISYIETAGSTMTRQKALKEQYLFTCTCPRCIKLGQFDDIQESAILEGYRCKDDGCSGFLL 191
+SYI+ + R++ L++QY F C+C C K G DD+ F
Sbjct: 250 VSYIDFLHLSEERRRQLRKQYYFDCSCEHCQK-GLKDDL------------------FQA 290
Query: 192 RDSDDKGFT-CQQCGLVRSKEEIKKIASEVNILSKKTLALTSCGNHQEVVSTYKMIEKLQ 250
D K C++C E+ + + ++ N+ + L++ S
Sbjct: 291 VKEDPKVVKLCREC-----LEKQESVFADTNLYVLRLLSIVS------------------ 327
Query: 251 KKLYHPFSVNLMQTREKLIKILMELEDWKEALAYCQLTIPVYQRVYPQFHPLLGLQYYTC 310
++L L+ ++EA Y + + Y ++Y + LG+
Sbjct: 328 -------------------EVLSYLQAFEEAAHYARRMVDGYMKLYHHNNAQLGMAVMRA 368
Query: 311 GKLEWFLGDTENAIKSMTEAVEILRITHGTNSPFMKEL 348
G W G+ E + +A IL +THG + P K+L
Sbjct: 369 GLTNWHAGNIEVGHGMICKAYAILLVTHGPSHPITKDL 406
>gi|389584498|dbj|GAB67230.1| hypothetical protein PCYB_112510 [Plasmodium cynomolgi strain B]
Length = 501
Score = 85.1 bits (209), Expect = 5e-14, Method: Compositional matrix adjust.
Identities = 84/332 (25%), Positives = 141/332 (42%), Gaps = 52/332 (15%)
Query: 63 INEIAENFSKLACNAHTICNSELRPLGTGLY--PVISIINHSCLPNAVLVFEGRLAVVRA 120
+ I E F K++ N+ I ++EL P G Y PV NHSCL N V +F+ + +R
Sbjct: 167 LKNIIEKFVKISKNSLQIIDNELEPAGLAFYKKPV-PYFNHSCLSNCVTIFKNQKLFIRT 225
Query: 121 VQHVPKGAEVLISYIETAGSTMTRQKALKEQYLFTCTCPRCIKLGQFDDIQESAILEGYR 180
+ V G E+ ISYI+ A TR +QY FTC+C C K+ + Q E
Sbjct: 226 LVDVYPGEELTISYIDIAFDKNTRLAICMDQYFFTCSCKLC-KVNVVSECQNIFNTEFVC 284
Query: 181 CKDDGC------------------------SGF-----LLRDSDDKG--FTCQQCG---- 205
+ C S F +L+ S+++ + C C
Sbjct: 285 THSESCKKCVNNMELILISELERKNCYMNKSQFKNSYPILKKSNEQQHMWKCALCKNEVH 344
Query: 206 -------LVRSKEEIKKIASEVNILSKKTLALTSCGNHQEVVSTYKMIEKLQ--KKLYHP 256
+ + KE +K+ A ++ S+K S N + S K+ K+ YH
Sbjct: 345 DNVIKSLMEKEKETVKETAHLESLFSEKY----SYDNKSVLQSLNKIKSKIDYLTTFYHH 400
Query: 257 FSVNLMQTREKLIKILMELEDWKEALAYCQLTIPVYQRVYPQFHPLLGLQYYTCGKLEWF 316
+L + R K++ I ++L+++K A + + Y ++ P+ G + GKL F
Sbjct: 401 TRYSLQKMRAKILYISIQLQEFKLAYNIANQYLKSIEVSYGKYSPIYGYYIFLTGKLALF 460
Query: 317 LGDTENAIKSMTEAVEILRITHGTNSPFMKEL 348
L + + +A + + T+G +SP K+L
Sbjct: 461 LDLKSEGLSLIHKAKKNIIKTYGPDSPIYKDL 492
>gi|67539656|ref|XP_663602.1| hypothetical protein AN5998.2 [Aspergillus nidulans FGSC A4]
gi|40738557|gb|EAA57747.1| hypothetical protein AN5998.2 [Aspergillus nidulans FGSC A4]
gi|259479819|tpe|CBF70391.1| TPA: SET and MYND domain protein, putative (AFU_orthologue;
AFUA_2G10080) [Aspergillus nidulans FGSC A4]
Length = 497
Score = 85.1 bits (209), Expect = 5e-14, Method: Compositional matrix adjust.
Identities = 49/164 (29%), Positives = 88/164 (53%), Gaps = 8/164 (4%)
Query: 1 MLKLYLRRKLQNDNVIPSTTTDNYSLVEALVAHMSDIDEKQLLLYAQIANLVNLILQWPE 60
+L++ LR + + + + T + L + L H+ DI + +IA + ++ +
Sbjct: 125 LLRMVLRTEARKN----AYTEEELVLFQTLETHIDDILNRNAPQAERIALTSRAVKEYSK 180
Query: 61 ISINE--IAENFSKLACNAHTICNSELRPLGTGLYPVISIINHSCLPNAVLVFEGRLAVV 118
+ E I ++L N+ + N + +G L+P ++INHSC NAV+ F+G V
Sbjct: 181 TDMEEEKIVAYHARLDLNSFNLTNDD--DIGIYLHPYAALINHSCDYNAVVGFDGSEIFV 238
Query: 119 RAVQHVPKGAEVLISYIETAGSTMTRQKALKEQYLFTCTCPRCI 162
+A++ + G ++ ISYI+T T RQK L+E+Y FTC C +C+
Sbjct: 239 KAIRPIATGEQIFISYIDTTYPTRIRQKELQERYFFTCNCAKCL 282
>gi|240275453|gb|EER38967.1| MYND finger family protein [Ajellomyces capsulatus H143]
Length = 518
Score = 85.1 bits (209), Expect = 5e-14, Method: Compositional matrix adjust.
Identities = 50/165 (30%), Positives = 85/165 (51%), Gaps = 11/165 (6%)
Query: 6 LRRKLQNDNVIPSTTTD-NYSLVEALVAHMSDIDEK------QLLLYAQIANLVNLILQW 58
LR Q DN + D ++++ L H S +DE+ ++ + A+ ++
Sbjct: 123 LRIISQPDNKVLKEIYDTHHAVFRTLGHHFSRMDERGGEQAHRITVSAEALKAISNT--- 179
Query: 59 PEISINEIAENFSKLACNAHTICNSELRPLGTGLYPVISIINHSCLPNAVLVFEGRLAVV 118
E+ ++ + F+KL NA T+ N +G L P + INHSC PNA + F+G++ +
Sbjct: 180 -EVELSTLVVYFAKLETNAFTLTNQYFDRIGLCLLPFAAYINHSCEPNAYIGFDGQVMYL 238
Query: 119 RAVQHVPKGAEVLISYIETAGSTMTRQKALKEQYLFTCTCPRCIK 163
+A+Q + E+ ISY + TRQ L+ +Y F C CP+C+K
Sbjct: 239 KALQDIAPDEEIFISYTDNTEPLKTRQTELQLRYFFECKCPKCLK 283
>gi|325091290|gb|EGC44600.1| MYND finger family protein [Ajellomyces capsulatus H88]
Length = 479
Score = 84.3 bits (207), Expect = 8e-14, Method: Compositional matrix adjust.
Identities = 45/158 (28%), Positives = 82/158 (51%), Gaps = 10/158 (6%)
Query: 12 NDNVIPSTTTDNYSLVEALVAHMSDIDEK------QLLLYAQIANLVNLILQWPEISINE 65
++ V+ ++++ L H S +DE+ ++ + A+ ++ E+ ++
Sbjct: 91 DNKVLKEIYDTHHAVFRTLGHHFSRMDERGGEQAHRITVSAEALKAISNT----EVELST 146
Query: 66 IAENFSKLACNAHTICNSELRPLGTGLYPVISIINHSCLPNAVLVFEGRLAVVRAVQHVP 125
+ F+KL NA T+ N +G L P + INHSC PNA + F+G++ ++A+Q +
Sbjct: 147 LVVYFAKLETNAFTLTNQYFDRIGLCLLPFAAYINHSCEPNAYIGFDGQVMYLKALQDIA 206
Query: 126 KGAEVLISYIETAGSTMTRQKALKEQYLFTCTCPRCIK 163
E+ ISY + TRQ L+ +Y F C CP+C+K
Sbjct: 207 PDEEIFISYTDNTEPLKTRQTELQLRYFFECKCPKCLK 244
>gi|213514028|ref|NP_001135157.1| SET and MYND domain-containing protein 1 [Salmo salar]
gi|197631975|gb|ACH70711.1| SET and MYND domain containing 1b [Salmo salar]
Length = 301
Score = 84.3 bits (207), Expect = 8e-14, Method: Compositional matrix adjust.
Identities = 45/141 (31%), Positives = 72/141 (51%), Gaps = 9/141 (6%)
Query: 27 VEALVAHMSDIDEKQLL-LYAQIANLVNLILQWPEIS----INEIAENFSKLACNAHTIC 81
++ L H+SD+ E L L I N ++ WP S +++I+ F + CN T+
Sbjct: 131 LDDLENHISDMPEDDLKELKVDIHNFLDY---WPRNSKQHRVDDISHIFGVINCNGFTVS 187
Query: 82 NSE-LRPLGTGLYPVISIINHSCLPNAVLVFEGRLAVVRAVQHVPKGAEVLISYIETAGS 140
+ L +G GL+P + ++NH C PN ++ +RA+ + G EV +SY++
Sbjct: 188 DQRGLNAVGVGLFPNLCLVNHDCWPNCTVILNHGKIELRALGKIEPGEEVTVSYVDFLNV 247
Query: 141 TMTRQKALKEQYLFTCTCPRC 161
T RQ+ LK Y F CTC C
Sbjct: 248 TEDRQRQLKMHYFFDCTCEHC 268
>gi|194913080|ref|XP_001982622.1| GG12639 [Drosophila erecta]
gi|190648298|gb|EDV45591.1| GG12639 [Drosophila erecta]
Length = 462
Score = 84.3 bits (207), Expect = 8e-14, Method: Compositional matrix adjust.
Identities = 41/102 (40%), Positives = 57/102 (55%), Gaps = 6/102 (5%)
Query: 91 GLYPVISIINHSCLPNAVLVFE-GRLAVVRAVQHVPKGAEVLISYIETAGSTMTRQKALK 149
GL+P+ +I+NH C PNA FE GRLAVVRA + +PKG E+ +Y + S +TR LK
Sbjct: 242 GLFPLTAIMNHECTPNASHYFENGRLAVVRAARDIPKGGEITTTYTKILWSNLTRNIFLK 301
Query: 150 EQYLFTCTCPRCIKLGQFDDIQESAILEGYRCKDDGCSGFLL 191
F C C RC D+ + L C++ GC G ++
Sbjct: 302 MTKHFACDCARC-----HDNTENGTYLSALFCREQGCRGLVI 338
>gi|345480324|ref|XP_003424127.1| PREDICTED: SET and MYND domain-containing protein 3-like [Nasonia
vitripennis]
Length = 391
Score = 84.3 bits (207), Expect = 8e-14, Method: Compositional matrix adjust.
Identities = 44/143 (30%), Positives = 82/143 (57%), Gaps = 5/143 (3%)
Query: 24 YSLVEALVAHMSDI--DEKQLLLYAQIANLVNLILQWPEI-SINEIAENFSKLACNAHTI 80
Y + L++H +DI D K++ + + ++ ++ + + EI + ++ N++ I
Sbjct: 106 YRKFKDLMSHYTDIKNDPKRIEHFVSLCQVLEDFMEGTTLPNSAEILGLYGRICVNSYNI 165
Query: 81 CNSELRPLGTGLYPVISIINHSCLPNAVLVFEGRLAVVRAVQHVPK--GAEVLISYIETA 138
+ ++ +G G+Y S+I+HSC PNAV VFEG ++RA++ +P+ +++ ISYI+
Sbjct: 166 LDPDMNSIGVGIYLGPSVIDHSCKPNAVAVFEGTTILIRALEDIPRLDWSQIHISYIDVL 225
Query: 139 GSTMTRQKALKEQYLFTCTCPRC 161
+T TR L+ Y F C C RC
Sbjct: 226 NTTSTRCTELQNTYYFLCECERC 248
Score = 45.4 bits (106), Expect = 0.041, Method: Compositional matrix adjust.
Identities = 21/52 (40%), Positives = 31/52 (59%)
Query: 296 YPQFHPLLGLQYYTCGKLEWFLGDTENAIKSMTEAVEILRITHGTNSPFMKE 347
Y + HPL G+ Y GK++ L ++A+ +T+A +ILRITHG KE
Sbjct: 325 YGEIHPLTGILYLMLGKIQLHLDKPKSALDMLTKADKILRITHGEKHSLFKE 376
>gi|195456692|ref|XP_002075245.1| GK16970 [Drosophila willistoni]
gi|194171330|gb|EDW86231.1| GK16970 [Drosophila willistoni]
Length = 766
Score = 84.3 bits (207), Expect = 8e-14, Method: Compositional matrix adjust.
Identities = 40/102 (39%), Positives = 58/102 (56%), Gaps = 6/102 (5%)
Query: 91 GLYPVISIINHSCLPNAVLVFE-GRLAVVRAVQHVPKGAEVLISYIETAGSTMTRQKALK 149
GL+P+ +I+NH C PNA F+ GR+AVVRA + +PKG E+ +Y + S +TR LK
Sbjct: 546 GLFPLTAIMNHECTPNASHYFDNGRMAVVRAARDIPKGGEITTTYTKILWSNLTRGIFLK 605
Query: 150 EQYLFTCTCPRCIKLGQFDDIQESAILEGYRCKDDGCSGFLL 191
F C CPRC D+ + L C++ GC G ++
Sbjct: 606 MTKNFMCDCPRC-----NDNTENGTYLSALFCREQGCKGLVI 642
Score = 62.8 bits (151), Expect = 3e-07, Method: Compositional matrix adjust.
Identities = 29/72 (40%), Positives = 41/72 (56%), Gaps = 1/72 (1%)
Query: 91 GLYPVISIINHSCLPNAVLVFE-GRLAVVRAVQHVPKGAEVLISYIETAGSTMTRQKALK 149
L+P+ ++NH C PNA FE G VV A + +PKG E+ ++Y + ST+ R+ L
Sbjct: 222 ALFPLAGLLNHHCTPNAAHHFEDGETIVVSATERIPKGTEITMTYAKLLWSTLARKLFLA 281
Query: 150 EQYLFTCTCPRC 161
F C CPRC
Sbjct: 282 MTKHFICQCPRC 293
>gi|357607357|gb|EHJ65468.1| hypothetical protein KGM_05644 [Danaus plexippus]
Length = 370
Score = 84.0 bits (206), Expect = 1e-13, Method: Compositional matrix adjust.
Identities = 51/174 (29%), Positives = 96/174 (55%), Gaps = 15/174 (8%)
Query: 20 TTDNYSLVEALVAHMSDI--DEKQLLLYAQIANLVNLILQWPEISINEIAEN---FSKLA 74
T ++ + + L++H SD+ D+K++ ++ ++ ++ L+ +IS+ A+ + ++
Sbjct: 110 TPTSFRVWKDLMSHYSDLKSDKKRMDHFSTLSMVLFEYLK--DISLPNTADLMGLYGRMV 167
Query: 75 CNAHTICNSELRPLGTGLYPVISIINHSCLPNAVLVFEGRLAVVRAVQHVP--KGAEVLI 132
N+ TI + E+ +GTG+Y S+I+HSC PNAV VF+G+ +RA++ + ++ I
Sbjct: 168 INSFTILDIEMNSIGTGIYLASSVIDHSCNPNAVAVFDGKTINIRALKDMNCLDWKKIRI 227
Query: 133 SYIETAGSTMTRQKALKEQYLFTCTCPRCIKLGQFDDIQESAILEGYRCKDDGC 186
SYI+ + RQ L++ Y F C C RC+ D + +C DGC
Sbjct: 228 SYIDLMKTPYERQMELRQSYYFLCQCDRCL------DENRIKYVHAAKCLKDGC 275
>gi|302654157|ref|XP_003018889.1| SET and MYND domain protein, putative [Trichophyton verrucosum HKI
0517]
gi|291182573|gb|EFE38244.1| SET and MYND domain protein, putative [Trichophyton verrucosum HKI
0517]
Length = 498
Score = 84.0 bits (206), Expect = 1e-13, Method: Compositional matrix adjust.
Identities = 39/102 (38%), Positives = 59/102 (57%)
Query: 60 EISINEIAENFSKLACNAHTICNSELRPLGTGLYPVISIINHSCLPNAVLVFEGRLAVVR 119
E+ + IAE +++ N+ T + PLG + P +NHSC PNAV+ F+G L +V+
Sbjct: 179 ELDVETIAEYLARIEVNSFTFTTAFGDPLGLCIQPFACYMNHSCEPNAVVGFDGGLIIVK 238
Query: 120 AVQHVPKGAEVLISYIETAGSTMTRQKALKEQYLFTCTCPRC 161
A++ + +V ISYI+ RQK L E+Y FTC C +C
Sbjct: 239 ALREIKPDEQVFISYIDNTYPLEVRQKQLAERYFFTCKCSKC 280
>gi|452820074|gb|EME27122.1| SET and MYND domain-containing protein [Galdieria sulphuraria]
Length = 464
Score = 84.0 bits (206), Expect = 1e-13, Method: Compositional matrix adjust.
Identities = 43/107 (40%), Positives = 56/107 (52%), Gaps = 9/107 (8%)
Query: 64 NEIAENFSKLACNAHTICNSELRPLGTGLYPVISIINHSCLPNAVLVFEGRLAVV----- 118
N +A L NAHTI +SEL LG G +P S +NH C PN V +F G V
Sbjct: 168 NFVASLIRMLEMNAHTIYDSELNTLGVGFFPKASFMNHDCRPNCVALFTGGFHSVSGKPI 227
Query: 119 ----RAVQHVPKGAEVLISYIETAGSTMTRQKALKEQYLFTCTCPRC 161
R ++ + G E++ISY++ S M R + LKE Y F C C RC
Sbjct: 228 SIHIRCIRPIEAGEEIVISYLDVCLSWMDRLEWLKEHYQFECCCSRC 274
>gi|402217590|gb|EJT97670.1| hypothetical protein DACRYDRAFT_111714 [Dacryopinax sp. DJM-731
SS1]
Length = 531
Score = 84.0 bits (206), Expect = 1e-13, Method: Compositional matrix adjust.
Identities = 77/273 (28%), Positives = 121/273 (44%), Gaps = 26/273 (9%)
Query: 62 SINEIAENFSKLACNAHTICNSELRPLGTGLYPVISIINHSCLPNAVLVF------EGRL 115
SI ++ S+ N+ ++ +L +G + P+ ++I+HSC+PNAV+VF G L
Sbjct: 182 SIRDLTNLLSRFQTNSFSLTTPDLTNVGVAISPLAALISHSCMPNAVVVFPTGLGRRGGL 241
Query: 116 AVVRAVQHVPKGAEVLISYIETAGSTMTRQKALKEQYLFTCTCPRCIKLGQFDDIQESAI 175
V+ A++ + G EVL SY++ A R K LK++YLF C C C K + I
Sbjct: 242 EVI-ALRDLQPGEEVLTSYVDIALPRSLRWKELKDRYLFDCECVLCEKHHDHEWIDPR-- 298
Query: 176 LEGYRCKDDGCSGFLLRDSD--DKGFTCQQCGL-----VRSKEEIKKIASEVNILSKKTL 228
E RC GC G + + TC CG S +E K+A +V ++
Sbjct: 299 -EALRCSKKGCKGKMGTPTSLGTTEITCSICGTKLSVSTASLQEHVKLAEDVLHKAEAIQ 357
Query: 229 ALTSCGNHQEVVSTYKMIEKLQKKLYHPFSVNLMQTREKLIKILMELED--------WKE 280
S +VS + + Y P L ++ LI + +L ++
Sbjct: 358 FSDSDKARYLLVSILRDFDPFALSAY-PLLQILQLLQQLLITAVFDLPSTGTERDGLLQD 416
Query: 281 ALAYCQLTIPVYQRVYPQFHPLLGLQYYTCGKL 313
A + L + + VYP HP G+QY T KL
Sbjct: 417 AASCSALVLSGMEDVYPYGHPSKGIQYVTSAKL 449
>gi|164656030|ref|XP_001729143.1| hypothetical protein MGL_3610 [Malassezia globosa CBS 7966]
gi|159103033|gb|EDP41929.1| hypothetical protein MGL_3610 [Malassezia globosa CBS 7966]
Length = 390
Score = 83.6 bits (205), Expect = 1e-13, Method: Compositional matrix adjust.
Identities = 49/149 (32%), Positives = 74/149 (49%), Gaps = 17/149 (11%)
Query: 67 AENFSKLAC----NAHTICNSELRPLGTGLYPVISIINHSCLPNAVLVFEGRLAVVR--- 119
A+ +L C NA T+ ++ L PLG + P ++INHSC PNAV+VF R+ +
Sbjct: 54 ADALMELVCQHETNAFTLADAHLNPLGVCIEPTFALINHSCDPNAVIVFPDRVRGMPSKM 113
Query: 120 ---AVQHVPKGAEVLISYIETAGSTMTRQKALKEQYLFTCTCPRCIKLGQFDDIQESAIL 176
A++ + G EV ISY++ A RQ L E+Y F C C C ++G D
Sbjct: 114 HLVAIRAISAGEEVRISYVDVASCQAERQATLLERYCFACECRLCKRMGWRDPRM----- 168
Query: 177 EGYRCKDDGCSGFLLRDSDDKGFT-CQQC 204
C C+G+++ + +T C QC
Sbjct: 169 -ALWCPQPKCTGWVVFQNAASSWTACTQC 196
>gi|327305077|ref|XP_003237230.1| hypothetical protein TERG_01951 [Trichophyton rubrum CBS 118892]
gi|326460228|gb|EGD85681.1| hypothetical protein TERG_01951 [Trichophyton rubrum CBS 118892]
Length = 498
Score = 83.6 bits (205), Expect = 1e-13, Method: Compositional matrix adjust.
Identities = 39/102 (38%), Positives = 58/102 (56%)
Query: 60 EISINEIAENFSKLACNAHTICNSELRPLGTGLYPVISIINHSCLPNAVLVFEGRLAVVR 119
E+ + IAE +++ N+ T + PLG + P +NHSC PNAV+ F+G L V+
Sbjct: 179 ELDVETIAEYLARIEVNSFTFTTAFGDPLGLCIQPFACYMNHSCEPNAVVGFDGGLITVK 238
Query: 120 AVQHVPKGAEVLISYIETAGSTMTRQKALKEQYLFTCTCPRC 161
A++ + +V ISYI+ RQK L E+Y FTC C +C
Sbjct: 239 ALREIKSDEQVFISYIDNTYPFEVRQKQLAERYFFTCKCSKC 280
>gi|299470965|emb|CBN79949.1| conserved unknown protein [Ectocarpus siliculosus]
Length = 556
Score = 83.6 bits (205), Expect = 2e-13, Method: Compositional matrix adjust.
Identities = 57/187 (30%), Positives = 83/187 (44%), Gaps = 26/187 (13%)
Query: 27 VEALVAHMSDIDEKQLLLYAQIANLVNLI-------------LQWPEIS-------INEI 66
V+ LV H D +Q Y +AN V + L WP ++
Sbjct: 127 VDELVHHNDDRSPEQKEEYMLMANFVARLCLAGCGDSKKGSALLWPSAQGRGLPGLVDAA 186
Query: 67 AENFSKLACNAHTICNSELRP-LGTGLYPVISIINHSCLPNAVLVFEGRLAVVRAVQHVP 125
KL+CN +I S L +G GLY + NHSC PNA F G+ +R + +
Sbjct: 187 YAVLGKLSCNVFSIAESALNGEVGCGLYLEAAAANHSCNPNAAQSFSGKTLSLRCTRPIR 246
Query: 126 KGAEVLISYIETAGSTMTRQKALKEQYLFTCTCPRC-IKLGQFDDIQESAILEGYRCKDD 184
KG E+ I + R+++L++ Y F C C RC G+ +D++ LE + C D
Sbjct: 247 KGEEITIGITQIQKPGPARRESLRKTYFFECRCERCESPEGRAEDMR----LEAFACPDS 302
Query: 185 GCSGFLL 191
CSGF L
Sbjct: 303 ECSGFCL 309
>gi|302506763|ref|XP_003015338.1| SET and MYND domain protein, putative [Arthroderma benhamiae CBS
112371]
gi|291178910|gb|EFE34698.1| SET and MYND domain protein, putative [Arthroderma benhamiae CBS
112371]
Length = 498
Score = 83.2 bits (204), Expect = 2e-13, Method: Compositional matrix adjust.
Identities = 39/102 (38%), Positives = 58/102 (56%)
Query: 60 EISINEIAENFSKLACNAHTICNSELRPLGTGLYPVISIINHSCLPNAVLVFEGRLAVVR 119
E+ + IAE +++ N+ T + PLG + P +NHSC PNAV+ F+G L V+
Sbjct: 179 ELDVETIAEYLARIEVNSFTFTTAFGDPLGLCIQPFACYMNHSCEPNAVVGFDGGLITVK 238
Query: 120 AVQHVPKGAEVLISYIETAGSTMTRQKALKEQYLFTCTCPRC 161
A++ + +V ISYI+ RQK L E+Y FTC C +C
Sbjct: 239 ALREIKPDEQVFISYIDNTYPLEVRQKQLTERYFFTCKCSKC 280
>gi|195132929|ref|XP_002010892.1| GI21459 [Drosophila mojavensis]
gi|193907680|gb|EDW06547.1| GI21459 [Drosophila mojavensis]
Length = 460
Score = 83.2 bits (204), Expect = 2e-13, Method: Compositional matrix adjust.
Identities = 41/102 (40%), Positives = 57/102 (55%), Gaps = 6/102 (5%)
Query: 91 GLYPVISIINHSCLPNAVLVFE-GRLAVVRAVQHVPKGAEVLISYIETAGSTMTRQKALK 149
GL+P+ +I+NH C PNA FE GRLAVVRA + +PKG E+ +Y + S +TR LK
Sbjct: 240 GLFPLTAIMNHECTPNASHYFENGRLAVVRAARDIPKGGEITTTYTKILWSNLTRGIFLK 299
Query: 150 EQYLFTCTCPRCIKLGQFDDIQESAILEGYRCKDDGCSGFLL 191
F C C RC D+ + L C++ GC G ++
Sbjct: 300 MTKYFVCNCDRC-----NDNSENGTYLSALFCREQGCKGLVI 336
>gi|212541368|ref|XP_002150839.1| SET and MYND domain protein, putative [Talaromyces marneffei ATCC
18224]
gi|210068138|gb|EEA22230.1| SET and MYND domain protein, putative [Talaromyces marneffei ATCC
18224]
Length = 527
Score = 83.2 bits (204), Expect = 2e-13, Method: Compositional matrix adjust.
Identities = 37/93 (39%), Positives = 55/93 (59%)
Query: 70 FSKLACNAHTICNSELRPLGTGLYPVISIINHSCLPNAVLVFEGRLAVVRAVQHVPKGAE 129
F KL+ N+ T+ ++ + G L+P + NHSC PNA F+ +RA + + KG +
Sbjct: 195 FGKLSANSFTLTSAFGKRRGVYLHPAAARFNHSCNPNASYSFDKGKCYIRATKPIAKGEQ 254
Query: 130 VLISYIETAGSTMTRQKALKEQYLFTCTCPRCI 162
+ I YI+T S TR+ LKE+Y F C CPRC+
Sbjct: 255 IFIPYIDTTYSVGTRRHELKERYKFDCQCPRCL 287
>gi|198420659|ref|XP_002128556.1| PREDICTED: similar to SET and MYND domain containing 3 [Ciona
intestinalis]
Length = 430
Score = 83.2 bits (204), Expect = 2e-13, Method: Compositional matrix adjust.
Identities = 43/105 (40%), Positives = 64/105 (60%), Gaps = 2/105 (1%)
Query: 59 PEISINE-IAENFSKLACNAHTICNSELRP-LGTGLYPVISIINHSCLPNAVLVFEGRLA 116
P+++ N+ + E +++ N+ ICN EL+ +GTG+Y S INHSC PN V F R
Sbjct: 162 PQVTDNKMLLELCARIKNNSFAICNEELQSDVGTGVYLNCSFINHSCEPNCVAEFNMRTL 221
Query: 117 VVRAVQHVPKGAEVLISYIETAGSTMTRQKALKEQYLFTCTCPRC 161
+RAV+++ G EVLISY++ ++ RQ+ L Y F CTC C
Sbjct: 222 KIRAVKNITAGEEVLISYVDLFATSFERQRELMSIYHFQCTCHSC 266
>gi|195397183|ref|XP_002057208.1| GJ16480 [Drosophila virilis]
gi|194146975|gb|EDW62694.1| GJ16480 [Drosophila virilis]
Length = 600
Score = 83.2 bits (204), Expect = 2e-13, Method: Compositional matrix adjust.
Identities = 41/102 (40%), Positives = 57/102 (55%), Gaps = 6/102 (5%)
Query: 91 GLYPVISIINHSCLPNAVLVFE-GRLAVVRAVQHVPKGAEVLISYIETAGSTMTRQKALK 149
GL+P+ +I+NH C PNA FE GRLAVVRA + +PKG E+ +Y + S +TR LK
Sbjct: 380 GLFPLTAIMNHECTPNASHYFENGRLAVVRAARDIPKGGEITTTYTKILWSNLTRGIFLK 439
Query: 150 EQYLFTCTCPRCIKLGQFDDIQESAILEGYRCKDDGCSGFLL 191
F C C RC D+ + L C++ GC G ++
Sbjct: 440 MTKYFVCNCDRCN-----DNSENGTYLSALFCREQGCKGLVI 476
Score = 64.3 bits (155), Expect = 9e-08, Method: Compositional matrix adjust.
Identities = 32/84 (38%), Positives = 45/84 (53%), Gaps = 1/84 (1%)
Query: 88 LGTGLYPVISIINHSCLPNAVLVFE-GRLAVVRAVQHVPKGAEVLISYIETAGSTMTRQK 146
+ L+P+ ++NH C PNA FE G VV A + +P GAE+ +SY + ST+ R+
Sbjct: 210 VARALFPLAGMLNHQCTPNAAHHFEDGETIVVTATERIPLGAEITMSYAKLLWSTLARKM 269
Query: 147 ALKEQYLFTCTCPRCIKLGQFDDI 170
L F C CPRC Q D +
Sbjct: 270 FLGMTKHFMCQCPRCQDPTQTDAV 293
>gi|194217425|ref|XP_001504392.2| PREDICTED: SET and MYND domain-containing protein 4 [Equus
caballus]
Length = 802
Score = 82.8 bits (203), Expect = 2e-13, Method: Compositional matrix adjust.
Identities = 47/120 (39%), Positives = 63/120 (52%), Gaps = 17/120 (14%)
Query: 59 PEISINEIA--ENFSKLACNAHTIC-------------NSELRPLGTGLYPVISIINHSC 103
PE SI +A + +L CNA I +S L TG++PV+S++NHSC
Sbjct: 482 PESSIWGVAMLRHMLQLQCNAQAITAIQQTGSKENIITDSRQVRLATGIFPVVSLLNHSC 541
Query: 104 LPNAVLVFEGRLAVVRAVQHVPKGAEVLISY--IETAGSTMTRQKALKEQYLFTCTCPRC 161
PN + F +A VRA QH+ KG E+L Y E+ RQ+ L+ QY F CTCP C
Sbjct: 542 SPNTSMSFVSTVATVRASQHIGKGQEILHCYGPHESRMGVAERQQKLRSQYFFDCTCPAC 601
>gi|195040117|ref|XP_001991004.1| GH12314 [Drosophila grimshawi]
gi|193900762|gb|EDV99628.1| GH12314 [Drosophila grimshawi]
Length = 611
Score = 82.8 bits (203), Expect = 2e-13, Method: Compositional matrix adjust.
Identities = 41/102 (40%), Positives = 57/102 (55%), Gaps = 6/102 (5%)
Query: 91 GLYPVISIINHSCLPNAVLVFE-GRLAVVRAVQHVPKGAEVLISYIETAGSTMTRQKALK 149
GL+P+ +I+NH C PNA FE GRLAVVRA + +PKG E+ +Y + S +TR LK
Sbjct: 391 GLFPLTAIMNHECTPNASHYFENGRLAVVRAARAIPKGGEITTTYTKILWSNLTRGIFLK 450
Query: 150 EQYLFTCTCPRCIKLGQFDDIQESAILEGYRCKDDGCSGFLL 191
F C C RC D+ + L C++ GC G ++
Sbjct: 451 MTKYFVCNCDRCN-----DNSENGTYLAALFCREQGCKGLVI 487
Score = 63.5 bits (153), Expect = 1e-07, Method: Compositional matrix adjust.
Identities = 30/75 (40%), Positives = 42/75 (56%), Gaps = 1/75 (1%)
Query: 88 LGTGLYPVISIINHSCLPNAVLVFE-GRLAVVRAVQHVPKGAEVLISYIETAGSTMTRQK 146
+ L+P+ ++NH C PNA FE G VV A + +P GAE+ +SY + ST+ R+
Sbjct: 222 VARALFPLAGLLNHQCTPNAAHHFEDGETIVVTATERIPMGAEITMSYAKLLWSTLARKM 281
Query: 147 ALKEQYLFTCTCPRC 161
L F C CPRC
Sbjct: 282 FLGMTKHFMCRCPRC 296
>gi|195347880|ref|XP_002040479.1| GM18906 [Drosophila sechellia]
gi|194121907|gb|EDW43950.1| GM18906 [Drosophila sechellia]
Length = 302
Score = 82.8 bits (203), Expect = 3e-13, Method: Compositional matrix adjust.
Identities = 40/102 (39%), Positives = 56/102 (54%), Gaps = 6/102 (5%)
Query: 91 GLYPVISIINHSCLPNAVLVFE-GRLAVVRAVQHVPKGAEVLISYIETAGSTMTRQKALK 149
GL+P+ +I+NH C PNA FE GRLAVVRA + +PKG E+ +Y + +TR LK
Sbjct: 85 GLFPLTAIMNHECTPNASHYFENGRLAVVRAARDIPKGGEITTTYTKILWGNLTRNIFLK 144
Query: 150 EQYLFTCTCPRCIKLGQFDDIQESAILEGYRCKDDGCSGFLL 191
F C C RC D+ + L C++ GC G ++
Sbjct: 145 MTKHFACDCVRCN-----DNTENGTYLSALFCREQGCRGLVI 181
>gi|4753708|emb|CAB42051.1| MSTA protein [Drosophila melanogaster]
Length = 462
Score = 82.4 bits (202), Expect = 3e-13, Method: Compositional matrix adjust.
Identities = 40/102 (39%), Positives = 56/102 (54%), Gaps = 6/102 (5%)
Query: 91 GLYPVISIINHSCLPNAVLVFE-GRLAVVRAVQHVPKGAEVLISYIETAGSTMTRQKALK 149
GL+P+ +I+NH C PNA FE GRLAVVRA + +PKG E+ +Y + +TR LK
Sbjct: 242 GLFPLTAIMNHECTPNASHYFENGRLAVVRAARDIPKGGEITTTYTKILWGNLTRNIFLK 301
Query: 150 EQYLFTCTCPRCIKLGQFDDIQESAILEGYRCKDDGCSGFLL 191
F C C RC D+ + L C++ GC G ++
Sbjct: 302 MTKHFACDCVRC-----HDNTENGTYLSALFCREQGCRGLVI 338
>gi|62473423|ref|NP_001014717.1| msta, isoform A [Drosophila melanogaster]
gi|25090702|sp|O46040.3|MSTAA_DROME RecName: Full=Protein msta, isoform A
gi|22831561|gb|AAN09072.1| msta, isoform A [Drosophila melanogaster]
Length = 462
Score = 82.4 bits (202), Expect = 3e-13, Method: Compositional matrix adjust.
Identities = 40/102 (39%), Positives = 56/102 (54%), Gaps = 6/102 (5%)
Query: 91 GLYPVISIINHSCLPNAVLVFE-GRLAVVRAVQHVPKGAEVLISYIETAGSTMTRQKALK 149
GL+P+ +I+NH C PNA FE GRLAVVRA + +PKG E+ +Y + +TR LK
Sbjct: 242 GLFPLTAIMNHECTPNASHYFENGRLAVVRAARDIPKGGEITTTYTKILWGNLTRNIFLK 301
Query: 150 EQYLFTCTCPRCIKLGQFDDIQESAILEGYRCKDDGCSGFLL 191
F C C RC D+ + L C++ GC G ++
Sbjct: 302 MTKHFACDCVRC-----HDNTENGTYLSALFCREQGCRGLVI 338
>gi|201066227|gb|ACH92522.1| IP12582p [Drosophila melanogaster]
Length = 495
Score = 82.4 bits (202), Expect = 3e-13, Method: Compositional matrix adjust.
Identities = 40/102 (39%), Positives = 56/102 (54%), Gaps = 6/102 (5%)
Query: 91 GLYPVISIINHSCLPNAVLVFE-GRLAVVRAVQHVPKGAEVLISYIETAGSTMTRQKALK 149
GL+P+ +I+NH C PNA FE GRLAVVRA + +PKG E+ +Y + +TR LK
Sbjct: 275 GLFPLTAIMNHECTPNASHYFENGRLAVVRAARDIPKGGEITTTYTKILWGNLTRNIFLK 334
Query: 150 EQYLFTCTCPRCIKLGQFDDIQESAILEGYRCKDDGCSGFLL 191
F C C RC D+ + L C++ GC G ++
Sbjct: 335 MTKHFACDCVRC-----HDNTENGTYLSALFCREQGCRGLVI 371
>gi|4725963|emb|CAA15694.2| EG:103B4.4 [Drosophila melanogaster]
Length = 991
Score = 82.0 bits (201), Expect = 4e-13, Method: Compositional matrix adjust.
Identities = 40/102 (39%), Positives = 56/102 (54%), Gaps = 6/102 (5%)
Query: 91 GLYPVISIINHSCLPNAVLVFE-GRLAVVRAVQHVPKGAEVLISYIETAGSTMTRQKALK 149
GL+P+ +I+NH C PNA FE GRLAVVRA + +PKG E+ +Y + +TR LK
Sbjct: 771 GLFPLTAIMNHECTPNASHYFENGRLAVVRAARDIPKGGEITTTYTKILWGNLTRNIFLK 830
Query: 150 EQYLFTCTCPRCIKLGQFDDIQESAILEGYRCKDDGCSGFLL 191
F C C RC D+ + L C++ GC G ++
Sbjct: 831 MTKHFACDCVRC-----HDNTENGTYLSALFCREQGCRGLVI 867
Score = 58.2 bits (139), Expect = 7e-06, Method: Compositional matrix adjust.
Identities = 29/72 (40%), Positives = 40/72 (55%), Gaps = 1/72 (1%)
Query: 91 GLYPVISIINHSCLPNAVLVFE-GRLAVVRAVQHVPKGAEVLISYIETAGSTMTRQKALK 149
L+P+ ++NH C PNA FE G VV A + +P GAE+ +SY + ST+ R+ L
Sbjct: 228 ALFPLAGLLNHQCTPNAAHHFENGETIVVCATERIPAGAEITMSYAKLLWSTLARKIFLG 287
Query: 150 EQYLFTCTCPRC 161
F C C RC
Sbjct: 288 MTKHFICKCVRC 299
>gi|294933189|ref|XP_002780642.1| SET and MYND domain-containing protein, putative [Perkinsus marinus
ATCC 50983]
gi|239890576|gb|EER12437.1| SET and MYND domain-containing protein, putative [Perkinsus marinus
ATCC 50983]
Length = 409
Score = 82.0 bits (201), Expect = 4e-13, Method: Compositional matrix adjust.
Identities = 39/98 (39%), Positives = 60/98 (61%), Gaps = 2/98 (2%)
Query: 76 NAHTICNSELRPLGTGLYPVISIINHSCLPNAVLVFEGRLAVVRAVQHVPKGAEVLISYI 135
NA TI N L+ + GLY +S +NHSC PN VL+F G +R ++ V GAE+ ISY+
Sbjct: 188 NAVTITNDSLQDVALGLYTEVSAMNHSCAPNVVLIFSGSEVTLRTIRAVEDGAELFISYV 247
Query: 136 ETAGSTMTRQ-KALKEQYLFTCTCPRCIKLGQF-DDIQ 171
+ S ++ + L++QY F C+C RC + ++ DD+
Sbjct: 248 DVCISPKAKRCQRLRDQYKFDCSCERCTREDEYLDDVD 285
>gi|390361260|ref|XP_799878.2| PREDICTED: SET and MYND domain-containing protein 3-like
[Strongylocentrotus purpuratus]
Length = 315
Score = 82.0 bits (201), Expect = 4e-13, Method: Compositional matrix adjust.
Identities = 36/93 (38%), Positives = 56/93 (60%)
Query: 70 FSKLACNAHTICNSELRPLGTGLYPVISIINHSCLPNAVLVFEGRLAVVRAVQHVPKGAE 129
F CN+ +IC+++L + G+Y S++NHSC PN V+VF+ R +R V+ V +G
Sbjct: 180 FGATICNSFSICDNDLNGIAVGIYLRASMLNHSCYPNCVVVFDERKLQLRTVRDVKEGDV 239
Query: 130 VLISYIETAGSTMTRQKALKEQYLFTCTCPRCI 162
ISY++ RQ L+E+Y F+C C +CI
Sbjct: 240 CTISYVDVINPAKERQTELEERYHFSCNCVKCI 272
>gi|298705753|emb|CBJ49061.1| conserved unknown protein [Ectocarpus siliculosus]
Length = 401
Score = 81.6 bits (200), Expect = 5e-13, Method: Compositional matrix adjust.
Identities = 46/148 (31%), Positives = 71/148 (47%), Gaps = 8/148 (5%)
Query: 59 PEISINEIA-ENFSKLACNAHTICNSELRPLGTGLYPVISIINHSCLPNAVLVFEGRLAV 117
P ++ ++A + +++ N T C+ P G GL+P ++INHSC PN + G
Sbjct: 20 PSAALLDLARKELGRVSLNGITTCDG-TGPTGLGLFPSGAMINHSCSPNCQAWWRGSQLE 78
Query: 118 VRAVQHVPKGAEVLISYIETAGSTMTRQKALKEQYLFTCTCPRCIKLGQFDDIQESAILE 177
+R + V G E+ +SYI + R+ L+ + F C C RC+ Q+D A L
Sbjct: 79 IRCTKPVATGEELCLSYIPIDQPSTVRRAQLRHSWFFACRCRRCVSR-QWD-----AELV 132
Query: 178 GYRCKDDGCSGFLLRDSDDKGFTCQQCG 205
G RC GC+G + G C CG
Sbjct: 133 GLRCPTKGCAGAVPPPPWRGGHACSHCG 160
>gi|330799867|ref|XP_003287962.1| hypothetical protein DICPUDRAFT_33308 [Dictyostelium purpureum]
gi|325081986|gb|EGC35483.1| hypothetical protein DICPUDRAFT_33308 [Dictyostelium purpureum]
Length = 400
Score = 81.6 bits (200), Expect = 5e-13, Method: Compositional matrix adjust.
Identities = 69/243 (28%), Positives = 105/243 (43%), Gaps = 37/243 (15%)
Query: 18 STTTDNYSLVEALVAHMSDIDEKQLL--LYAQIANLVNLI--------LQWPEIS----- 62
T D + LVE + S+ K+L+ + IANL NL+ L PE+
Sbjct: 168 DTLDDIFDLVENTIDDGSNKAAKELIDIITDYIANLFNLVITSSDNHDLPTPELKKEAVE 227
Query: 63 --INEIAENFSKLACNAHTICNSELRPLGTGLYPVISIINHSCLPNAVLVFEGRLAVVRA 120
I+ I E K+ CN I + + + P S NHSC+PN + + +G +A
Sbjct: 228 KVISVIRELIHKVRCNQFGIWTKNDKCIAVAISPSSSFFNHSCIPNCINIRDGNKMTFKA 287
Query: 121 VQHVPKGAEVLISYIETAGSTMTRQKALKEQYLFTCTCPRCIKLGQFDDIQESAILEGYR 180
+ V KG + ISY++ +R++ LK Y F C CPRC + D+ ++ I + Y
Sbjct: 288 LYPVKKGEPLAISYLDLDLPVESRKEYLKYGYYFDCGCPRCDEKTNQDECMDNWISKFY- 346
Query: 181 CKDDGCSGFLLRDSDDKGFTCQQCGLVRSKEEIKKIASEVNILSKKTLALTSCGNHQEVV 240
C+ C GL SK ++ +VN K TL + C N +
Sbjct: 347 CQRKKCV-----------------GLYYSKTKVN--IDQVNKNDKITLNCSDCNNEFIIN 387
Query: 241 STY 243
STY
Sbjct: 388 STY 390
>gi|22902087|gb|AAC60295.2| skmBOP [Takifugu rubripes]
Length = 433
Score = 81.6 bits (200), Expect = 5e-13, Method: Compositional matrix adjust.
Identities = 45/151 (29%), Positives = 82/151 (54%), Gaps = 10/151 (6%)
Query: 27 VEALVAHMSDIDEKQLL-LYAQIANLVNLILQWPEIS----INEIAENFSKLACNAHTIC 81
VE L H++D+ E ++ L I N ++ WP S I++I+ F + CN T+
Sbjct: 110 VEELEDHVADMQEDEIKELKVDIHNFLDY---WPRNSKQHTIDDISHIFGVINCNGFTVS 166
Query: 82 NSE-LRPLGTGLYPVISIINHSCLPNAVLVFEGRLAVVRAVQHVPKGAEVLISYIETAGS 140
+ L+ +G GL+P + ++NH+C PN ++ +R++ + +G E+ ++Y++
Sbjct: 167 DQRGLQAVGVGLFPNLCMVNHNCWPNCTVILNHGKIELRSLGKIAEGEELTVAYVDFLNL 226
Query: 141 TMTRQKALKEQYLFTCTCPRCIKLGQFDDIQ 171
+ R++ LK QY F C C C K G DD++
Sbjct: 227 SEERRRLLKTQYFFDCQCDYC-KNGTKDDLK 256
>gi|325189974|emb|CCA24457.1| conserved hypothetical protein [Albugo laibachii Nc14]
Length = 465
Score = 81.6 bits (200), Expect = 6e-13, Method: Compositional matrix adjust.
Identities = 76/317 (23%), Positives = 134/317 (42%), Gaps = 28/317 (8%)
Query: 10 LQNDNVIPSTTTDNYSLVEALVAHMSDI-DEKQLL-LYAQIANLVNLILQWPEISINEIA 67
L D ++P + H D+ +E L+ L AQ LV + ++
Sbjct: 122 LAKDGILPDKANATSVFPMDSMWHSVDLTNETHLISLLAQKLGLVR-----ESWHVRDLQ 176
Query: 68 ENFSKLACNAHTICNSELRPLGTGLYPVISIINHSCLPNAVLVFEGRLAVVR--AVQHVP 125
E ++ CN I + +G G YP+ +++NHSC PN V +F A + A++ +
Sbjct: 177 EMLARFQCNNFCILDENFFEVGAGCYPLGAMVNHSCDPNCVTIFARGSAQLELWAMKSIG 236
Query: 126 KGAEVLISYIETAGSTMTRQKALKEQYLFTCTCPRCIKLGQFDDIQESAILEGYRCKDDG 185
K EV ISY++ A R+K L+++Y F C C RC + +A ++ + D+
Sbjct: 237 KDEEVTISYVDPANCMNKRRKYLQKRYHFDCRCQRC-----YCKASHAATIDTFLVADEN 291
Query: 186 ---CSGFLLRDSDDKGFTCQQ-CGLVRSKEEIKKIASEVNILSKKTLALTSCGNHQEVVS 241
+ L ++ +G Q G + +K+ + IL+ + V +
Sbjct: 292 GIPSEKWSLNEATKRGTVLSQITGKIEEFRVCEKLTRKCAILADIDSDFV-----RNVFT 346
Query: 242 TYKMIE----KLQKKLYHPFSVNLMQTREKLIKILMELEDWKE-ALAYCQLTIPVYQRVY 296
M +LQ +L+ ++ + +++ + ELE ++ AL Y Y VY
Sbjct: 347 HLHMYNIDAFRLQARLFSDLTIMIAESQSGNSREFAELEFVQQKALVYGYRISKFYAFVY 406
Query: 297 PQFHPLLGLQYYTCGKL 313
P HPL L G+L
Sbjct: 407 PALHPLSNLHNIRMGEL 423
>gi|359486519|ref|XP_002270770.2| PREDICTED: uncharacterized protein LOC100263830 [Vitis vinifera]
Length = 799
Score = 81.3 bits (199), Expect = 7e-13, Method: Compositional matrix adjust.
Identities = 72/312 (23%), Positives = 120/312 (38%), Gaps = 54/312 (17%)
Query: 82 NSELRPLGTGLYPVISIINHSCLPNAVLVFEGRLAVVRAVQHVPKGAEVLISYIETAG-- 139
N E +G +Y V S+ NHSC PN F R +RA +HV G + +SY G
Sbjct: 478 NMEQVRVGQAIYSVASLFNHSCQPNIHAYFLSRTLFLRATEHVAVGCPLELSYGPQVGQW 537
Query: 140 STMTRQKALKEQYLFTCTCPRCIKLGQFDDIQESAILEGYRCKDDGCSGFLL-------- 191
RQK LK++Y F C C C +L D +L +RC + C G +L
Sbjct: 538 DCKDRQKFLKDEYSFRCECSGCSELNVSD-----LVLNAFRCVNPDCFGTVLDSCVIKYE 592
Query: 192 ---------------------------RDSDDKGFT----------CQQCGLVRSKEEIK 214
R+ + F C CG R E
Sbjct: 593 NKKFERFQGVPQDCISEPHLQLKNDGIREVAHQAFANSSFRAAPGYCLHCGAYRDLEASH 652
Query: 215 KIASEVNI-LSKKTLALTSCGNHQEVVSTYKMIEKLQKKLYHPFSVNLMQTREKLIKILM 273
E I +S+ A+ S S L K H ++ + + + + +
Sbjct: 653 ATVGEAGIYISRLQEAIVSKEVPATTFSDALRSLDLLKSTLHAYNKGIAEAEDWIAQAFC 712
Query: 274 ELEDWKEALAYCQLTIPVYQRVYPQFHPLLGLQYYTCGKLEWFLGDTENAIKSMTEAVEI 333
+ + + A+ +C+ +I + +++Y H ++G + ++ LGDT A+KS++ I
Sbjct: 713 MIGELQPAMHHCKASIEILEKLYGSNHIVIGYELMKLSSIQLSLGDTA-AMKSISRLAAI 771
Query: 334 LRITHGTNSPFM 345
+G ++ M
Sbjct: 772 FSWYYGPHADMM 783
>gi|351704191|gb|EHB07110.1| SET and MYND domain-containing protein 4, partial [Heterocephalus
glaber]
Length = 793
Score = 81.3 bits (199), Expect = 7e-13, Method: Compositional matrix adjust.
Identities = 65/274 (23%), Positives = 120/274 (43%), Gaps = 29/274 (10%)
Query: 88 LGTGLYPVISIINHSCLPNAVLVFEGRLAVVRAVQHVPKGAEVLISY--IETAGSTMTRQ 145
L TG++PV+S++NHSC PN + F G +A +RA Q + KG E+L Y ++ S RQ
Sbjct: 520 LATGIFPVVSLLNHSCSPNTSISFSGTVATIRAAQQIGKGQEILHCYGPHKSWMSIAERQ 579
Query: 146 KALKEQYLFTCTCPRCIKLGQFDDIQESA--ILEGYRCKDDGCSGFLLRDSDDKGFTCQQ 203
+ L+ QY F C CP C ++ ++ +A E +RC + C + +G
Sbjct: 580 QKLRSQYFFDCNCPAC----HYEKLRTAAGPRWEAFRCNN--CRALM------QGDDVLS 627
Query: 204 CGLVRSKEEIKK---------IASEVNILSKKTLALTSCGNHQEVVSTYKMIEKLQKKLY 254
CG E I + + +V I SK L G + + + +
Sbjct: 628 CGNRACLEPISRTHLVSQLQDLQQQVGIASK----LLRNGKLDQAIPRLLRCRQDAESFL 683
Query: 255 HPFSVNLMQTREKLIKILMELEDWKEALAYCQLTIPVYQRVYPQFHPLLGLQYYTCGKLE 314
+ + L + L DW+++ + Q ++ V + + +G + + ++
Sbjct: 684 AAEHAVAGEIEDNLARAYAALGDWQKSATHLQKSLQVVEARHGPSSVEMGHELFKLAQVL 743
Query: 315 WFLGDTENAIKSMTEAVEILRITHGTNSPFMKEL 348
+ A+ ++ +A E+L + G S ++EL
Sbjct: 744 FNGFAVPEALNAIQKAEEVLLVHCGPQSDEVQEL 777
>gi|355720917|gb|AES07092.1| SET and MYND domain containing 4 [Mustela putorius furo]
Length = 576
Score = 81.3 bits (199), Expect = 7e-13, Method: Compositional matrix adjust.
Identities = 68/284 (23%), Positives = 127/284 (44%), Gaps = 29/284 (10%)
Query: 78 HTICNSELRPLGTGLYPVISIINHSCLPNAVLVFEGRLAVVRAVQHVPKGAEVLISY--I 135
+ + NS L TG++PV+S++NHSC PN + F +A VRA + + KG E+L Y
Sbjct: 296 NNVTNSRQVRLATGIFPVVSLLNHSCSPNTSMSFVSTVATVRASEKIAKGQEILHCYGPH 355
Query: 136 ETAGSTMTRQKALKEQYLFTCTCPRCIKLGQFDDIQESAILEGYRCKDDGCSGFLLRDSD 195
+ + RQ+ L+ QY F C+CP C K + E + C G G LL+ D
Sbjct: 356 HSRMAVAERQQKLRAQYFFDCSCPVCQKEKHRATV--GPRWEAFCC---GRCGALLQGGD 410
Query: 196 DKGFTCQQCGLVRSKEEIKK--IASEVNILSKKTLALTSCGNHQEVVSTYKMIEKLQ--- 250
CG E + + + S V L ++ G Q+++ ++ + +Q
Sbjct: 411 -----VLSCGGTSCMESVSRDHLISRVQDLQRQV------GGAQKLLGNGQLEQAIQLLL 459
Query: 251 ------KKLYHPFSVNLMQTREKLIKILMELEDWKEALAYCQLTIPVYQRVYPQFHPLLG 304
+ P + + + L + L DW+++ A+ Q ++ V + + +G
Sbjct: 460 GCRHDAENFLSPEHSVVGEMEDALAQAYAALGDWQKSAAHLQKSLQVVEAQHGPSSVEMG 519
Query: 305 LQYYTCGKLEWFLGDTENAIKSMTEAVEILRITHGTNSPFMKEL 348
+ + +L + A+ ++ A ++L + +G S ++EL
Sbjct: 520 HELFKLAQLFFNGFAIPEALNTIERAEKVLLVHYGPCSDEIQEL 563
>gi|390363795|ref|XP_788278.2| PREDICTED: SET and MYND domain-containing protein 3-like
[Strongylocentrotus purpuratus]
Length = 356
Score = 81.3 bits (199), Expect = 7e-13, Method: Compositional matrix adjust.
Identities = 41/107 (38%), Positives = 59/107 (55%), Gaps = 4/107 (3%)
Query: 70 FSKLACNAHTICNSELRPLGTGLYPVISIINHSCLPNAVLVFEGRLAVVRAVQHVPKGAE 129
F CN+ +IC+++L + G+Y S++NHSC PN V +GR + V+ V +G E
Sbjct: 171 FGATICNSISICDNDLIDIAVGIYLRASMLNHSCDPNCAWVCDGRKLQIMTVKDVKEGDE 230
Query: 130 VLISYIETAGSTMTRQKALKEQYLFTCTCPRCIK----LGQFDDIQE 172
ISY++ RQ LKE Y FTC C +CI+ LG D + E
Sbjct: 231 CTISYVDAMKPAKVRQADLKESYHFTCKCVKCIEEINALGPDDGLGE 277
Score = 58.5 bits (140), Expect = 4e-06, Method: Compositional matrix adjust.
Identities = 25/66 (37%), Positives = 39/66 (59%)
Query: 98 IINHSCLPNAVLVFEGRLAVVRAVQHVPKGAEVLISYIETAGSTMTRQKALKEQYLFTCT 157
+ NHSC N ++VF+ R +R ++ V G E ISY++ RQ L+E+Y FTC
Sbjct: 1 MANHSCDYNCIVVFDERKLQLRTIKDVKDGEECTISYVDVINPAKERQAKLEEEYHFTCK 60
Query: 158 CPRCIK 163
C +C++
Sbjct: 61 CVKCVE 66
>gi|412986118|emb|CCO17318.1| SET and MYND domain-containing protein 3 [Bathycoccus prasinos]
Length = 734
Score = 81.3 bits (199), Expect = 8e-13, Method: Compositional matrix adjust.
Identities = 39/93 (41%), Positives = 55/93 (59%)
Query: 62 SINEIAENFSKLACNAHTICNSELRPLGTGLYPVISIINHSCLPNAVLVFEGRLAVVRAV 121
S+ ++A+ KL N HT+C+ ELRP G G+YPV +++NHS PN F G+ +VR +
Sbjct: 280 SVADLAKLIGKLRLNCHTLCDDELRPYGIGVYPVAAMMNHSENPNCFATFRGKKMIVRCL 339
Query: 122 QHVPKGAEVLISYIETAGSTMTRQKALKEQYLF 154
+ V G E+ ISY E R K+LK Y F
Sbjct: 340 RDVLPGEELTISYDELMKPKRERAKSLKSNYGF 372
>gi|440636518|gb|ELR06437.1| hypothetical protein GMDG_07962 [Geomyces destructans 20631-21]
Length = 287
Score = 81.3 bits (199), Expect = 8e-13, Method: Compositional matrix adjust.
Identities = 37/86 (43%), Positives = 54/86 (62%)
Query: 76 NAHTICNSELRPLGTGLYPVISIINHSCLPNAVLVFEGRLAVVRAVQHVPKGAEVLISYI 135
N+ I ++ +G G P++ + NHSC PNA L F+GR AV+ A++H+ +G E+ ISYI
Sbjct: 188 NSFRITDASGDEIGIGFDPLLGMANHSCAPNASLKFDGRCAVLTALKHIEEGEEITISYI 247
Query: 136 ETAGSTMTRQKALKEQYLFTCTCPRC 161
+T RQ L++ Y FTCTC C
Sbjct: 248 DTTLPRAARQAFLQKHYYFTCTCAAC 273
>gi|296806220|ref|XP_002843920.1| histone methyltransferase SmyD1b [Arthroderma otae CBS 113480]
gi|238845222|gb|EEQ34884.1| histone methyltransferase SmyD1b [Arthroderma otae CBS 113480]
Length = 495
Score = 81.3 bits (199), Expect = 8e-13, Method: Compositional matrix adjust.
Identities = 70/261 (26%), Positives = 115/261 (44%), Gaps = 42/261 (16%)
Query: 60 EISINEIAENFSKLACNAHTICNSELRPLGTGLYPVISIINHSCLPNAVLVF-EGRLAVV 118
++ I IAE +K+ N T PLG + P +NHSC PNAV+ F EGR+ V
Sbjct: 176 DLDIETIAEYLAKIEVNGFTFTTPFGDPLGLCIQPFACFVNHSCDPNAVVGFDEGRI-TV 234
Query: 119 RAVQHVPKGAEVLISYIETAGSTMTRQKALKEQYLFTCTCPRCIKLGQFDDIQESAILEG 178
+A++ + +V ISYI+ RQK L E+Y FTC C +C+ Q D +E +
Sbjct: 235 KALRTIEPDEQVFISYIDNTNPFEIRQKELAERYFFTCRCSKCL---QGDKAREDQFIPA 291
Query: 179 YRCKDDGCSGFLLRDSDDKGFTCQQCGLVRSKEEIKKIASEVNILSKKTLALTSCGNHQE 238
+D + + +E +K A E+ L S N +
Sbjct: 292 SPPSND----------------------IETLKEAEKQAQEL---------LASAKNARP 320
Query: 239 VVSTYKMIEKLQ----KKLYHPFSVNLMQTREKLIKILMELEDWKEALAYCQLT-IPVYQ 293
V + ++ ++ K++ Q R +LI L+++E + AL + + + V
Sbjct: 321 VFAVRQLKAAIKILHDTKMWPITRQPYPQIRSELIVALLQIESFWPALQHSAVRYLFVDP 380
Query: 294 RVYPQ-FHPLLGLQYYTCGKL 313
+YPQ +HP+ Y +L
Sbjct: 381 VLYPQEWHPIRNNHAYILSQL 401
>gi|73967367|ref|XP_548315.2| PREDICTED: SET and MYND domain-containing protein 4 [Canis lupus
familiaris]
Length = 794
Score = 80.9 bits (198), Expect = 8e-13, Method: Compositional matrix adjust.
Identities = 44/112 (39%), Positives = 60/112 (53%), Gaps = 6/112 (5%)
Query: 54 LILQWPEISINEIAENFSKLACNAHTICNSELRPLGTGLYPVISIINHSCLPNAVLVFEG 113
L LQ +I I E SK + +CNS L TG++PV+S++NHSC PN + F
Sbjct: 488 LQLQCNAQAITTIQETGSK----ENNVCNSRQIRLATGIFPVVSLLNHSCSPNTSVSFSS 543
Query: 114 RLAVVRAVQHVPKGAEVLISYIE--TAGSTMTRQKALKEQYLFTCTCPRCIK 163
+A +RA Q + KG E+L Y + RQ+ L+ QY F C CP C K
Sbjct: 544 TVATIRASQQIGKGQEILHCYGPHYSRMGVADRQQKLRSQYFFDCGCPACQK 595
>gi|221057668|ref|XP_002261342.1| hypothetical protein, conserved in Plasmodium species [Plasmodium
knowlesi strain H]
gi|194247347|emb|CAQ40747.1| hypothetical protein, conserved in Plasmodium species [Plasmodium
knowlesi strain H]
Length = 518
Score = 80.9 bits (198), Expect = 9e-13, Method: Compositional matrix adjust.
Identities = 78/328 (23%), Positives = 137/328 (41%), Gaps = 44/328 (13%)
Query: 63 INEIAENFSKLACNAHTICNSELRPLGTGLY--PVISIINHSCLPNAVLVFEGRLAVVRA 120
+ I E F K++ N+ I +++L P G Y PV NHSCL N V +F+ + +R
Sbjct: 184 LKNIIEKFVKISKNSLQIIDNDLEPAGLAFYKKPV-PYFNHSCLSNCVTIFKNQKLFIRT 242
Query: 121 VQHVPKGAEVLISYIETAGSTMTRQKALKEQYLFTCTCP--------------------- 159
+ V G E+ ISY++ A TR +QY FTC+C
Sbjct: 243 LVDVYPGEELTISYVDIAFDRNTRLAICMDQYFFTCSCKLCKVNMVSECQHIFNTELLCT 302
Query: 160 ------RCIKLGQFDDIQESAILEGYRCKDDGCSGF-LLRDSDDKG--FTCQQC------ 204
+C+ + I E Y K+ S + + + ++++ + C C
Sbjct: 303 QSESCKKCVNHMELLLISELERKSCYMNKNQFKSAYPVFKKTNEQQHVWKCSICKGEVHD 362
Query: 205 ----GLVRSKEEIKKIASEVNILSKKTLALTSCGNHQEVVSTYKMIEKLQKKLYHPFSVN 260
LV ++E K AS + L + + Q + I+ L YH +
Sbjct: 363 NVIKSLVEKEKETVKEASHLESLFSEKYTYDNKSVLQALNKIKSKIDYLTT-FYHHTKYS 421
Query: 261 LMQTREKLIKILMELEDWKEALAYCQLTIPVYQRVYPQFHPLLGLQYYTCGKLEWFLGDT 320
L + R K++ I ++L+++K A + + Y ++ P+ G + GKL FL
Sbjct: 422 LQKMRAKILYISIQLQEFKLAYNIANQYLKSIEVTYGKYSPIYGYYIFLTGKLALFLDLK 481
Query: 321 ENAIKSMTEAVEILRITHGTNSPFMKEL 348
+ + +A + + T+G +SP K+L
Sbjct: 482 SEGLSLIHKAKKNIIKTYGPDSPIYKDL 509
>gi|320169071|gb|EFW45970.1| predicted protein [Capsaspora owczarzaki ATCC 30864]
Length = 509
Score = 80.5 bits (197), Expect = 1e-12, Method: Compositional matrix adjust.
Identities = 67/266 (25%), Positives = 121/266 (45%), Gaps = 41/266 (15%)
Query: 73 LACNAHTICNSELRPLGTGLYPVISIINHSCLPNAVLVFE--GRLAVVRAVQHVPKGAEV 130
+ CN I N L G+G+YPV +++NH+C+ N VL ++ + +RA++ + G E+
Sbjct: 254 MQCNNFAIHNDILFARGSGIYPVAALVNHACVANCVLTYDLKSKRQFIRAIRDIRAGEEI 313
Query: 131 LISYIETAGSTMTRQKALKEQYLFTCTCPRCIKLGQFDDIQE-SAILEGYRCKDDGCSGF 189
++ + A T+ R+ LK Y F C C RC D +E A L R D + F
Sbjct: 314 THAFTDAASPTVVRKAHLKSLYAFDCNCSRC---NDSDAAKELDAELVATRPIDTIPAYF 370
Query: 190 LLRDSDDKGFTCQQCG-----LVRSKEEIKKIAS--EVNILSKKTLALTSCGNHQEVVST 242
K F ++ L R+ ++ K++S +V L + + + G + +
Sbjct: 371 -------KRFRLERLAGLPSLLERATRDVPKVSSPADVEALLQASRSWLQEGMDPRLPTA 423
Query: 243 YKMI-----EKLQKKLYHPFSVNLMQTREKLIKILM-----ELEDWKEALAY-------- 284
++ +++ L +++ L ++ K+ + + L + + LA
Sbjct: 424 RAIVLLVTAWAVRQALLGDYNLELFESNVKIFGMALLWREEALSERQRDLALVSGLRNVL 483
Query: 285 ---CQLTIPVYQRVYPQFHPLLGLQY 307
+ I VYQR+YP HPLLGLQ+
Sbjct: 484 VDSARHVIRVYQRIYPANHPLLGLQW 509
>gi|291405379|ref|XP_002719091.1| PREDICTED: SET and MYND domain containing 4 [Oryctolagus cuniculus]
Length = 801
Score = 80.5 bits (197), Expect = 1e-12, Method: Compositional matrix adjust.
Identities = 69/269 (25%), Positives = 121/269 (44%), Gaps = 19/269 (7%)
Query: 88 LGTGLYPVISIINHSCLPNAVLVFEGRLAVVRAVQHVPKGAEVLISYIETAG--STMTRQ 145
L TG++PVIS++NHSC PN + F G +A VRA QH+ +G E+L Y RQ
Sbjct: 525 LATGVFPVISLLNHSCSPNTSVSFVGTVATVRASQHIRRGQEILHCYGPHVSRLGIAARQ 584
Query: 146 KALKEQYLFTCTCPRCIKLGQFDDIQESAILEGYRCKDDGCSG--FLLRDSDDKGFTCQQ 203
+ LK QY F C+C C + A R + CS LL+ D +
Sbjct: 585 QKLKSQYFFNCSCLAC-------QTEAHATAAAPRWEAFCCSSCRALLQGGDVLSCSSGS 637
Query: 204 CGLVRSKEEIKKIASEVNILSKKTLALTSCGNHQEVVSTYKMIEKLQKK----LYHPFSV 259
C S++ +A+ + L ++T E+ + + Q+ L+ SV
Sbjct: 638 CTEAVSRDH---LAARLRDLQRQTGLARELLRSGELEKAIQQLSGCQRDAESFLWAEHSV 694
Query: 260 NLMQTREKLIKILMELEDWKEALAYCQLTIPVYQRVYPQFHPLLGLQYYTCGKLEWFLGD 319
L + + L + L DWK++ A+ Q ++ V + + +G + + ++ + +
Sbjct: 695 -LGEIEDDLAQAYAALGDWKKSAAHLQKSLTVVEARHGPSSVEMGHELFKLAQILFNGFE 753
Query: 320 TENAIKSMTEAVEILRITHGTNSPFMKEL 348
A+ ++ +A +L + G S ++EL
Sbjct: 754 VPEALSTIQKAETVLAVHCGPCSDEIQEL 782
>gi|332257691|ref|XP_003277939.1| PREDICTED: LOW QUALITY PROTEIN: SET and MYND domain-containing
protein 4 [Nomascus leucogenys]
Length = 703
Score = 80.5 bits (197), Expect = 1e-12, Method: Compositional matrix adjust.
Identities = 78/315 (24%), Positives = 132/315 (41%), Gaps = 67/315 (21%)
Query: 59 PEISINEIA--ENFSKLACNA-------HT------ICNSELRPLGTGLYPVISIINHSC 103
P+++I +A + +L CNA HT + +S L TG++PVIS++NHSC
Sbjct: 382 PDVTIWGVAMLRHMLQLQCNAQAMTTIQHTGSKGSIVTDSRQVRLATGIFPVISLLNHSC 441
Query: 104 LPNAVLVFEGRLAVVRAVQHVPKGAEVLISY--IETAGSTMTRQKALKEQYLFTCTCPRC 161
PN + F +A +RA Q + KG E+L Y ++ RQ+ L+ QY F CTCP C
Sbjct: 442 SPNTSVSFISTVATIRASQRIRKGQEILHCYGPHKSRMGVAERQQKLRSQYFFDCTCPAC 501
Query: 162 IKLGQFDDIQESAILEGYRCKDDGCSGFLLRDSDDKGFTCQQCGLVRSKEEIKKIASEVN 221
+ + G R + F C CG +++ + S
Sbjct: 502 -------QTEAHRMAAGPRW---------------EAFCCNGCGAPMQGDDVLRCGSR-- 537
Query: 222 ILSKKTLALTSCGNHQEVVSTYKMIEKLQKKLYHPFSVNLMQTREKLIKILM--ELEDWK 279
SC + VS ++ +LQ +L Q K+L ELE
Sbjct: 538 ----------SCA--ESAVSRDHLVSRLQ---------DLQQQVRVAQKLLRDGELERAI 576
Query: 280 EALAYCQLTIPVYQRVYPQFHPLLGLQYYTCGKLEWFLGDTENAIKSMTEAVEILRITHG 339
+ L+ CQ + ++ + H ++G + LGD + + + ++ ++ + HG
Sbjct: 577 QRLSGCQRDAESF--LWAE-HAMVGEIADGLARACAALGDWQKSATHLQRSLRVVEVRHG 633
Query: 340 TNSPFMKELILKLEE 354
+S M + KL +
Sbjct: 634 PSSVEMGHELFKLAQ 648
>gi|48476977|gb|AAT44536.1| hypothetical rhabdomyosarcoma antigen MU-RMS-40.9C [Homo sapiens]
Length = 273
Score = 80.5 bits (197), Expect = 1e-12, Method: Compositional matrix adjust.
Identities = 63/253 (24%), Positives = 113/253 (44%), Gaps = 37/253 (14%)
Query: 100 NHSCLPNAVLVFEGRLAVVRAVQHVPKGAEVLISYIETAGSTMTRQKALKEQYLFTCTCP 159
NH C PN ++F +RA+ + +G E+ +SYI+ + R++ LK+QY F CTC
Sbjct: 1 NHDCWPNCTVIFNNGKIELRALGKISEGEELTVSYIDFLNVSEERKRQLKKQYYFDCTCE 60
Query: 160 RCIKLGQFDDIQESAILEGYRCKDDGCSGFLLRDSDDKGFTCQQCGLVRSKEEIKKIASE 219
C K + KDD FL G+ + + +++ E
Sbjct: 61 HCQK----------------KLKDD---LFL--------------GVKDNPKPSQEVVKE 87
Query: 220 VNILSKKTL----ALTSCGNHQEVVSTYKMIEKLQKKLYHPFSVNLMQTREKLIKILMEL 275
+ SK TL S G + EVV + + Q+ ++ ++ +++ + ++L L
Sbjct: 88 MIQFSKDTLEKIDKARSEGLYHEVVKLCRECLEKQEPVFADTNIYMLRMLSIVSEVLSYL 147
Query: 276 EDWKEALAYCQLTIPVYQRVYPQFHPLLGLQYYTCGKLEWFLGDTENAIKSMTEAVEILR 335
+ ++EA Y + + Y ++Y + LG+ G W G+ E + +A IL
Sbjct: 148 QAFEEASFYARRMVDGYMKLYHPNNAQLGMAVMRAGLTNWHAGNIEVGHGMICKAYAILL 207
Query: 336 ITHGTNSPFMKEL 348
+THG + P K+L
Sbjct: 208 VTHGPSHPITKDL 220
>gi|297266487|ref|XP_001093225.2| PREDICTED: SET and MYND domain-containing protein 1 [Macaca
mulatta]
Length = 399
Score = 80.1 bits (196), Expect = 1e-12, Method: Compositional matrix adjust.
Identities = 42/139 (30%), Positives = 74/139 (53%), Gaps = 12/139 (8%)
Query: 31 VAHMSDIDEKQLLLYAQIANLVNLILQ-WP----EISINEIAENFSKLACNAHTICNSE- 84
V H + ++K+L + V+ LQ WP + + I+ F + CN T+ +
Sbjct: 136 VEHFGEEEQKELRVD------VDTFLQYWPPQSQQFGMQYISHIFGVINCNGFTLSDQRG 189
Query: 85 LRPLGTGLYPVISIINHSCLPNAVLVFEGRLAVVRAVQHVPKGAEVLISYIETAGSTMTR 144
L+ +G G++P + ++NH C PN ++F +RA+ + +G E+ +SYI+ + R
Sbjct: 190 LQAVGVGIFPNLGLVNHDCWPNCTVIFNNGKIELRALGKISEGEELTVSYIDFLNVSEER 249
Query: 145 QKALKEQYLFTCTCPRCIK 163
++ LK+QY F CTC C K
Sbjct: 250 KRQLKKQYYFDCTCEHCQK 268
>gi|348567599|ref|XP_003469586.1| PREDICTED: SET and MYND domain-containing protein 4-like [Cavia
porcellus]
Length = 800
Score = 80.1 bits (196), Expect = 2e-12, Method: Compositional matrix adjust.
Identities = 43/105 (40%), Positives = 58/105 (55%), Gaps = 15/105 (14%)
Query: 72 KLACNA-------HT------ICNSELRPLGTGLYPVISIINHSCLPNAVLVFEGRLAVV 118
+L CNA HT I +S L TG++PV+S++NHSC PN + F G +A +
Sbjct: 491 QLQCNAQAITAIQHTGSKESIITDSRQVRLATGIFPVVSLLNHSCSPNTSVSFTGTIATI 550
Query: 119 RAVQHVPKGAEVLISY--IETAGSTMTRQKALKEQYLFTCTCPRC 161
RA Q + KG E+L Y E+ S RQ+ L+ QY F C CP C
Sbjct: 551 RAAQQIRKGQEILHCYGPHESHMSVAERQQKLRSQYFFDCCCPAC 595
>gi|430813591|emb|CCJ29069.1| unnamed protein product [Pneumocystis jirovecii]
Length = 463
Score = 80.1 bits (196), Expect = 2e-12, Method: Compositional matrix adjust.
Identities = 44/150 (29%), Positives = 74/150 (49%), Gaps = 11/150 (7%)
Query: 24 YSLVEALVAHMSDIDEKQ-----LLLYAQIANLVNLILQWPEISINEIAENFSKLACNAH 78
++ + L +H S+I + +L I N N E++ + + F K+ N+
Sbjct: 136 WNTIMDLESHRSEIMSSEKGDVIWMLSKGIQNFTN------EMNETFVLDLFCKIMINSF 189
Query: 79 TICNSELRPLGTGLYPVISIINHSCLPNAVLVFEGRLAVVRAVQHVPKGAEVLISYIETA 138
++ S +GT + P+IS INHSC PN LVF+ +R++Q + E+ +SYI+
Sbjct: 190 SLMTSSYDTIGTAIDPIISRINHSCYPNTALVFDRNTVALRSLQKILPNQEITVSYIDIY 249
Query: 139 GSTMTRQKALKEQYLFTCTCPRCIKLGQFD 168
+ R L +Y F+C C RCI F+
Sbjct: 250 NTQKNRHDELLSRYYFSCKCTRCIVSDGFE 279
>gi|154308769|ref|XP_001553720.1| hypothetical protein BC1G_07807 [Botryotinia fuckeliana B05.10]
Length = 409
Score = 80.1 bits (196), Expect = 2e-12, Method: Compositional matrix adjust.
Identities = 40/117 (34%), Positives = 66/117 (56%), Gaps = 2/117 (1%)
Query: 47 QIANLVNLILQWPEISINEIAENFSKLACNAHTI-CNSELRPLGTGLYPVISIINHSCLP 105
Q +VNL+ + PE + E+ +L+ N + N P+G L P+++ NHSC P
Sbjct: 58 QAHAVVNLLQRKPEF-VKYALESLCRLSTNGFRVESNVGNGPIGLCLDPLLARANHSCRP 116
Query: 106 NAVLVFEGRLAVVRAVQHVPKGAEVLISYIETAGSTMTRQKALKEQYLFTCTCPRCI 162
NA + F+G+ A +RA+ + KG ++ ISYI+ R++AL + + F C C RC+
Sbjct: 117 NAAITFDGKRATLRALSPIAKGEQIFISYIDETQRQEVRREALAKTWFFQCLCSRCV 173
>gi|307104881|gb|EFN53133.1| hypothetical protein CHLNCDRAFT_137504 [Chlorella variabilis]
Length = 623
Score = 79.7 bits (195), Expect = 2e-12, Method: Compositional matrix adjust.
Identities = 36/99 (36%), Positives = 54/99 (54%)
Query: 61 ISINEIAENFSKLACNAHTICNSELRPLGTGLYPVISIINHSCLPNAVLVFEGRLAVVRA 120
+ E+A ++ CN+HTI + EL+PL G++P+ ++ NH C PN + F G V RA
Sbjct: 155 VGPRELALLIARFGCNSHTISDDELQPLAVGIFPLGAMANHDCRPNTLHAFRGGRMVFRA 214
Query: 121 VQHVPKGAEVLISYIETAGSTMTRQKALKEQYLFTCTCP 159
V+ + G EV SY E A R+ L + +LF P
Sbjct: 215 VRAIQPGEEVTTSYTELAAPRWERRAVLLQHHLFDIDAP 253
>gi|406700189|gb|EKD03370.1| hypothetical protein A1Q2_02350 [Trichosporon asahii var. asahii
CBS 8904]
Length = 573
Score = 79.7 bits (195), Expect = 2e-12, Method: Compositional matrix adjust.
Identities = 62/233 (26%), Positives = 100/233 (42%), Gaps = 19/233 (8%)
Query: 76 NAHTICNSELRPLGTGLYPVISIINHSCLPNAVLVFE--GRLAVVRAVQHVPKGAEVLIS 133
NA T+ + +L+P+G + P++++ NHSC PNA +VF G+ V A + G EVL +
Sbjct: 287 NAFTLSSFDLKPIGVAMSPLLALFNHSCAPNAAIVFPRGGKEMVAVANADIAAGEEVLTT 346
Query: 134 YIETAGSTMTRQKALKEQYLFTCTCPRCIKLGQFDDIQESAILEGYRCKDDGCSGFLLRD 193
Y++ + TRQ L+ +Y F C CP C D + + C C G
Sbjct: 347 YVDISDDKETRQGDLQSRYGFECECPACT----LDAV------DPRNCLLHECGGLARMP 396
Query: 194 SDDKGFTCQQCGLVRSKEEIKKIASE-VNILSKKTLALTSCGNHQEVVSTYKMIEKLQKK 252
K TC V S ++ + +E ++++ + V +EK+
Sbjct: 397 ETVKCDTCGASWTV-SPNALRDLVNEGLSLVEADDQDQLDPDDLPRVAEVIARLEKIM-- 453
Query: 253 LYHPFSVNLMQTREKLIKILMELEDWKEALAYCQLTIPVYQRVYPQFHPLLGL 305
P S + + K L EALAY + +VYP+ HP + L
Sbjct: 454 ---PPSSRPLLRLHTIAKNLHVPAAQPEALAYAHKALAGAYKVYPRNHPSVAL 503
>gi|303271753|ref|XP_003055238.1| predicted protein [Micromonas pusilla CCMP1545]
gi|226463212|gb|EEH60490.1| predicted protein [Micromonas pusilla CCMP1545]
Length = 838
Score = 79.7 bits (195), Expect = 2e-12, Method: Compositional matrix adjust.
Identities = 39/106 (36%), Positives = 57/106 (53%), Gaps = 3/106 (2%)
Query: 50 NLVNLILQWPEISINEIAENFSKLACNAHTICNSELRPLGTGLYPVISIINHSCLPNAVL 109
N V+ + P+ ++A + ACN HTI + EL P+G G+YP +++NH+C P
Sbjct: 235 NFVDPLAGGPDP--RDVARLLAAFACNVHTIHDDELNPVGMGVYPTAAMMNHACAPTVAQ 292
Query: 110 VF-EGRLAVVRAVQHVPKGAEVLISYIETAGSTMTRQKALKEQYLF 154
F EG A R V+ V KG E+ ISY+E + R+ L Y F
Sbjct: 293 TFGEGTTATFRCVRDVRKGEEIAISYVELLATRAERRAELLNHYRF 338
>gi|119480927|ref|XP_001260492.1| SET and MYND domain protein, putative [Neosartorya fischeri NRRL
181]
gi|119408646|gb|EAW18595.1| SET and MYND domain protein, putative [Neosartorya fischeri NRRL
181]
Length = 544
Score = 79.7 bits (195), Expect = 2e-12, Method: Compositional matrix adjust.
Identities = 44/140 (31%), Positives = 76/140 (54%), Gaps = 2/140 (1%)
Query: 26 LVEALVAHMSDIDEKQLLLYAQIANLVNLILQWPEISINE--IAENFSKLACNAHTICNS 83
L L +HM++I + + +I+ + ++ +++E I +KL N++ +
Sbjct: 187 LFRKLESHMAEIQAQNWEQWQRISLTAKAVKKYSGCALDEEMICHYGAKLELNSYNFHSP 246
Query: 84 ELRPLGTGLYPVISIINHSCLPNAVLVFEGRLAVVRAVQHVPKGAEVLISYIETAGSTMT 143
LG L+P ++INHSC N+V+ +G V+A++ + KG E+L+SYI+ T
Sbjct: 247 LADRLGFYLHPYAALINHSCDYNSVVGSDGDALYVKALRPIQKGEEILVSYIDATNPCKT 306
Query: 144 RQKALKEQYLFTCTCPRCIK 163
RQ L E+Y F C CP+C K
Sbjct: 307 RQTELSERYYFKCRCPKCAK 326
>gi|347831782|emb|CCD47479.1| hypothetical protein [Botryotinia fuckeliana]
Length = 612
Score = 79.7 bits (195), Expect = 2e-12, Method: Compositional matrix adjust.
Identities = 40/117 (34%), Positives = 66/117 (56%), Gaps = 2/117 (1%)
Query: 47 QIANLVNLILQWPEISINEIAENFSKLACNAHTI-CNSELRPLGTGLYPVISIINHSCLP 105
Q +VNL+ + PE + E+ +L+ N + N P+G L P+++ NHSC P
Sbjct: 221 QAHAVVNLLQRKPEF-VKYALESLCRLSTNGFRVESNVGNGPIGLCLDPLLARANHSCRP 279
Query: 106 NAVLVFEGRLAVVRAVQHVPKGAEVLISYIETAGSTMTRQKALKEQYLFTCTCPRCI 162
NA + F+G+ A +RA+ + KG ++ ISYI+ R++AL + + F C C RC+
Sbjct: 280 NAAITFDGKRATLRALSPIAKGEQIFISYIDETQRQEVRREALAKTWFFQCLCSRCV 336
>gi|258575593|ref|XP_002541978.1| predicted protein [Uncinocarpus reesii 1704]
gi|237902244|gb|EEP76645.1| predicted protein [Uncinocarpus reesii 1704]
Length = 481
Score = 79.7 bits (195), Expect = 2e-12, Method: Compositional matrix adjust.
Identities = 71/279 (25%), Positives = 126/279 (45%), Gaps = 40/279 (14%)
Query: 60 EISINEIAENFSKLACNAHTICNSELRPLGTGLYPVISIINHSCLPNAVLVFEGRLAVVR 119
E+S+++I E F KL NA T+ +G + P+ ++ NHSC PNA F+ VR
Sbjct: 162 ELSLHKIIEYFCKLDINAFTLTTPFYDHVGAAIEPLAALCNHSCSPNAATDFDKGKIWVR 221
Query: 120 AVQHVPKGAEVLISYIETAGSTMTRQKALKEQYLFTCTCPRCIKLGQFDDIQESAILEGY 179
A++ + KG +V +SYIET RQ L ++Y F C C +C +I+++A +
Sbjct: 222 ALRDIGKGEQVFVSYIETTDPYAHRQSELLKRYYFNCKCNKC-------EIEKNAPDTHF 274
Query: 180 RCKDDGCSGFLLRDSDDKG---FTCQQCGLVRSKEEIKKIASEVNILSKKTLALTSCGNH 236
++++ + F + G S + IK + S ++ L + TL
Sbjct: 275 LRAITAVDSKTIQNAQQEAMELFEMVKPG-APSTDSIKNLRSAMSALRRTTLW------- 326
Query: 237 QEVVSTYKMIEKLQKKLYHPFSVNLMQTREKLIKILMELEDWKEALAYCQLT-IPVYQRV 295
L ++ Y ++ R +LI LM ++ A +C + + V V
Sbjct: 327 -----------PLTRQPY-------VRLRGELIASLMGARQFQSAFVHCVIRHLRVNPVV 368
Query: 296 YP-QFHPLLGLQYYTCGKLEWFLGDTENAIKSMTEAVEI 333
YP ++HP+ + + KL +L T+ + E V++
Sbjct: 369 YPNRWHPISSMHKWVFVKLMRYL--TQAGDLGVAEGVDL 405
>gi|405121568|gb|AFR96336.1| hypothetical protein CNAG_03112 [Cryptococcus neoformans var.
grubii H99]
Length = 447
Score = 79.3 bits (194), Expect = 2e-12, Method: Compositional matrix adjust.
Identities = 52/177 (29%), Positives = 87/177 (49%), Gaps = 13/177 (7%)
Query: 62 SINEIAENFSKLACNAHTICNSELRPLGTGLYPVISIINHSCLPNAVLVFE--GRLAVVR 119
++E+ S + NA T+ + L P+G + P++++ NHSC PNA+ VF GR +
Sbjct: 110 GVDEVLNLCSAIQVNAFTLTSPSLAPIGMSISPLLALANHSCEPNAIAVFPKGGRDIFLV 169
Query: 120 AVQHVPKGAEVLISYIETAGSTMTRQKALKEQYLFTCTCPRCIKLGQFDDIQESAI---- 175
A+ +P G E+L SYI+T+ RQ L +Y F C C C K + ++S +
Sbjct: 170 ALNDIPPGEEILTSYIDTSTPYHHRQSELLSRYRFVCHCSLCQKSESTANGEQSWVDPRW 229
Query: 176 -LEGYRCKDDGCSGFLLRDSDDK-GFTCQQCGLVRSKEEIKKIASEVNILSKKTLAL 230
+ C +G + L ++ DK C C KE+ K A +V L ++ ++
Sbjct: 230 CVRHRECSKEGKARMPLENATDKVEVRCIAC-----KEDFKVDADKVRKLVQEGMSF 281
>gi|346320789|gb|EGX90389.1| SET domain protein [Cordyceps militaris CM01]
Length = 540
Score = 79.3 bits (194), Expect = 2e-12, Method: Compositional matrix adjust.
Identities = 44/104 (42%), Positives = 59/104 (56%), Gaps = 1/104 (0%)
Query: 60 EISINEIAENFSKLACNAHTICNSELRPLGTGLYPVISIINHSCLPNAVLVFEGRLAVVR 119
E ++ E AE KL NA +++L +G L P +++ NHSCLPNA + F GR AV+R
Sbjct: 167 EGAVREAAEILCKLQTNAFHRFDADLGHVGIFLEPTLAMANHSCLPNAFVQFVGRTAVLR 226
Query: 120 AVQHVPKGAEVLISYIETAGSTMTRQKALKEQYLFTCTCPRCIK 163
A Q + G E+ ISY + S RQ AL Y F C C RC +
Sbjct: 227 AEQRIQSGDEIEISYTDYTSSLSKRQAALAP-YHFECRCRRCTQ 269
>gi|401886254|gb|EJT50303.1| hypothetical protein A1Q1_00408 [Trichosporon asahii var. asahii
CBS 2479]
Length = 582
Score = 79.3 bits (194), Expect = 2e-12, Method: Compositional matrix adjust.
Identities = 62/241 (25%), Positives = 102/241 (42%), Gaps = 26/241 (10%)
Query: 76 NAHTICNSELRPLGTGLYPVISIINHSCLPNAVLVFE--GRLAVVRAVQHVPKGAEVLIS 133
NA T+ + +L+P+G + P++++ NHSC PNA +VF G+ V A + G EVL +
Sbjct: 287 NAFTLSSFDLKPIGVAMSPLLALFNHSCAPNAAIVFPRGGKEMVAVANADIAAGEEVLTT 346
Query: 134 YIETAGSTMTRQKALKEQYLFTCTCPRCIKLGQFDDIQESAILEGYRCKDDGCSGF---- 189
Y++ + TRQ L+ +Y F C CP C D + + C C G
Sbjct: 347 YVDISDDKETRQGDLQSRYGFECECPACT----LDAV------DPRNCLLHECGGLARMP 396
Query: 190 --LLRDSDDKGFTCQQCG--LVRSKEEIKK-IASEVNILSKKTLALTSCGNHQEVVSTYK 244
L + SD + C CG S ++ + ++++ + V
Sbjct: 397 ADLRQMSDSETVKCDTCGASWTVSPNALRDLVNGGLSLVEADDQDQLDPDDLPRVAEVIA 456
Query: 245 MIEKLQKKLYHPFSVNLMQTREKLIKILMELEDWKEALAYCQLTIPVYQRVYPQFHPLLG 304
+E++ P S + + K L EALAY + +VYP+ HP +
Sbjct: 457 RLEEIM-----PPSSRPLLRLHTIAKNLHVPAAQPEALAYAHKALAGAYKVYPRNHPSVA 511
Query: 305 L 305
L
Sbjct: 512 L 512
>gi|315046180|ref|XP_003172465.1| SET and MYND domain-containing protein 2 [Arthroderma gypseum CBS
118893]
gi|311342851|gb|EFR02054.1| SET and MYND domain-containing protein 2 [Arthroderma gypseum CBS
118893]
Length = 498
Score = 79.3 bits (194), Expect = 3e-12, Method: Compositional matrix adjust.
Identities = 38/103 (36%), Positives = 58/103 (56%)
Query: 61 ISINEIAENFSKLACNAHTICNSELRPLGTGLYPVISIINHSCLPNAVLVFEGRLAVVRA 120
+ + IAE +++ N+ T S PLG + P +NHSC PNAV+ F+ L V+A
Sbjct: 180 LDLETIAEYLARIEVNSFTFTTSFGDPLGLCIQPFACYMNHSCDPNAVVGFDEGLITVKA 239
Query: 121 VQHVPKGAEVLISYIETAGSTMTRQKALKEQYLFTCTCPRCIK 163
++ + +V ISYI+ RQK L E+Y FTC C +C++
Sbjct: 240 LRKIKPDEQVFISYIDNTYPFEVRQKQLAERYFFTCKCSKCVE 282
>gi|392587935|gb|EIW77268.1| SET domain-containing protein [Coniophora puteana RWD-64-598 SS2]
Length = 656
Score = 79.3 bits (194), Expect = 3e-12, Method: Compositional matrix adjust.
Identities = 54/174 (31%), Positives = 80/174 (45%), Gaps = 16/174 (9%)
Query: 62 SINEIAENFSKLACNAHTICNSELRPLGTGLYPVISIINHSCLPNAVLVF---------- 111
S ++ + S+ N+ T+ L P+G + P+I++INHSC PNAV+VF
Sbjct: 239 SARDLVDLTSRFTSNSFTLATPALAPIGVSVSPLIALINHSCDPNAVVVFPRSSTASTPA 298
Query: 112 --EGRLAVVRAVQHVPKGAEVLISYIETAGSTMTRQKALKEQYLFTCTCPRCIKLGQFDD 169
E ++ VV A++ + E+L SYI+T RQ ALKE Y FTC C C +
Sbjct: 299 KLEPQMQVV-AIKDIEPDTEILTSYIDTTLPQPIRQSALKETYCFTCACTLCARPQDATT 357
Query: 170 IQ-ESAILEGYRCKDDGCSGFLLRDSDDKGFTCQQCGLVRSKEEIKKIASEVNI 222
I SA+ RC G + + C CG S + I +N+
Sbjct: 358 IDPRSAVWCAQRC--GGVASVYSAGDELVVGRCNTCGFSASMTTAESILDAINV 409
>gi|313234749|emb|CBY24692.1| unnamed protein product [Oikopleura dioica]
Length = 475
Score = 79.3 bits (194), Expect = 3e-12, Method: Compositional matrix adjust.
Identities = 81/343 (23%), Positives = 137/343 (39%), Gaps = 38/343 (11%)
Query: 13 DNVIPSTTTDNYSLVEALVAHMSDIDEKQLLLYAQIANLVNLILQWPEI----SINEIAE 68
D+ P+ D++ V L +H +D Q NL + P S ++ +
Sbjct: 116 DSDEPNNKIDSFIHVLQLESHFDRLDPSQQEDLGIFLMKFNLYWKKPIPGSLRSSRKLLD 175
Query: 69 NFSKLACNAHTICNSE--LRPLGTGLYPVISIINHSCLPNAVLVFEGRLAVVRAVQHVPK 126
+ + N IC+ E + +G+ LY S+INHSC PNA VF G V++A++ +
Sbjct: 176 LVAAIKNNQFAICDEESSICDIGSALYLNHSLINHSCKPNAFPVFNGTNLVIKALEKIAP 235
Query: 127 GAEVLISYIETAGSTMTRQKALKEQYLFTCTCPRCIKLGQFDDIQESAILEGYRCKDDGC 186
G E+ I+Y +T R+ L + Y F C C C + D R K
Sbjct: 236 GEEIKIAYTDTKAVIQDRRDYLNDIYRFVCQCQGCTNDDEVD-----------RKKHLDK 284
Query: 187 SGFLLRDSDDKGFTCQQCGLVRSKEEIKKIASEVNILSKKTLALTSCGNHQEVVSTYKMI 246
G ++R SD + + +VR EE KK E N++ +E +
Sbjct: 285 KGNVIRRSDAIWQSAET--MVRDMEEFKK-NKEWNLM-------------KEAAQGW--- 325
Query: 247 EKLQKKLYHPFSVNLMQTREKLIKILMELEDWKEALAYCQLTIPVYQRVYPQFHPLLGLQ 306
L +K ++ ++ E +E ++W L I Y+ +Y HP LG+
Sbjct: 326 --LARKFLPDENIFWIRLNEFAFDAGIETQEWSLCLETGASLIVNYKEIYGPMHPTLGIH 383
Query: 307 YYTCGKLEWFLGDTENAIKSMTEAVEILRITHGTNSPFMKELI 349
K+ + A + A I+ + + S K+L+
Sbjct: 384 LMKFAKILLHIEKPNEAEEYFRRAFAIMSLFYEPESAVRKQLL 426
>gi|195168896|ref|XP_002025266.1| GL13396 [Drosophila persimilis]
gi|194108722|gb|EDW30765.1| GL13396 [Drosophila persimilis]
Length = 787
Score = 79.3 bits (194), Expect = 3e-12, Method: Compositional matrix adjust.
Identities = 38/102 (37%), Positives = 56/102 (54%), Gaps = 6/102 (5%)
Query: 91 GLYPVISIINHSCLPNAVLVFE-GRLAVVRAVQHVPKGAEVLISYIETAGSTMTRQKALK 149
GL+P+ +I+NH C PNA F+ GRLA+VRA + +PKG E+ +Y + +TR LK
Sbjct: 567 GLFPLTAIMNHECTPNASHYFDNGRLAIVRAARDIPKGGEITTTYTKILWGNLTRGIFLK 626
Query: 150 EQYLFTCTCPRCIKLGQFDDIQESAILEGYRCKDDGCSGFLL 191
F C C RC D+ + L C++ GC G ++
Sbjct: 627 MTKHFMCDCTRC-----NDNTENGTYLSALFCREQGCRGLVI 663
Score = 55.8 bits (133), Expect = 3e-05, Method: Compositional matrix adjust.
Identities = 27/72 (37%), Positives = 40/72 (55%), Gaps = 1/72 (1%)
Query: 91 GLYPVISIINHSCLPNAVLVF-EGRLAVVRAVQHVPKGAEVLISYIETAGSTMTRQKALK 149
L+P+ ++NH C PNA F +G +V A + + GAE+ +SY + ST+ R+ L
Sbjct: 237 ALFPLAGLLNHQCTPNAGHHFQDGETIIVCATERIACGAEITVSYAKMLWSTLARKIFLG 296
Query: 150 EQYLFTCTCPRC 161
F C CPRC
Sbjct: 297 MTKGFMCQCPRC 308
>gi|402898218|ref|XP_003912123.1| PREDICTED: SET and MYND domain-containing protein 4 [Papio anubis]
Length = 802
Score = 79.3 bits (194), Expect = 3e-12, Method: Compositional matrix adjust.
Identities = 79/318 (24%), Positives = 132/318 (41%), Gaps = 73/318 (22%)
Query: 59 PEISINEIA--ENFSKLACNA-------HT------ICNSELRPLGTGLYPVISIINHSC 103
P+++I +A + +L CNA HT + +S L TG++PVIS++NHSC
Sbjct: 481 PDVTIWGVAMLRHMLQLQCNAQAITTVQHTGSKGSIVTDSRQVRLATGIFPVISLLNHSC 540
Query: 104 LPNAVLVFEGRLAVVRAVQHVPKGAEVLISY--IETAGSTMTRQKALKEQYLFTCTCPRC 161
PN + F +A +RA Q + KG E+L Y ++ RQ+ L+ QY F CTCP C
Sbjct: 541 SPNTSVSFISTVATIRASQRIGKGQEILHCYGPHKSRMGVAERQQKLRSQYFFDCTCPAC 600
Query: 162 IKLGQFDDIQESAILEGYRCKDDGCSGFLLRDSDDKGFTCQQCGLVRSKEEIKKIASEVN 221
+ + G R K F C C + + +
Sbjct: 601 -------QTEAHRMAAGPRW---------------KAFCCNSCRAPMQGDGVLR------ 632
Query: 222 ILSKKTLALTSCGNH---QEVVSTYKMIEKLQKKLYHPFSVNLMQTREKLIKILM--ELE 276
CGN + VVS ++ +LQ +L Q K+L ELE
Sbjct: 633 -----------CGNRSCAESVVSRDHLVSRLQ---------DLQQQVGVAQKLLRDGELE 672
Query: 277 DWKEALAYCQLTIPVYQRVYPQFHPLLGLQYYTCGKLEWFLGDTENAIKSMTEAVEILRI 336
+ L+ CQ + ++ + H ++G + LGD + + + +++ ++ +
Sbjct: 673 RAVQLLSRCQRDAESF--LWAE-HAMVGAIADGLARACAALGDWQKSATHLQKSLRVVEV 729
Query: 337 THGTNSPFMKELILKLEE 354
HG +S M + KL +
Sbjct: 730 RHGPSSIEMGHELFKLAQ 747
>gi|198470120|ref|XP_002133373.1| GA22855 [Drosophila pseudoobscura pseudoobscura]
gi|198145302|gb|EDY72001.1| GA22855 [Drosophila pseudoobscura pseudoobscura]
Length = 844
Score = 79.0 bits (193), Expect = 3e-12, Method: Compositional matrix adjust.
Identities = 38/102 (37%), Positives = 56/102 (54%), Gaps = 6/102 (5%)
Query: 91 GLYPVISIINHSCLPNAVLVFE-GRLAVVRAVQHVPKGAEVLISYIETAGSTMTRQKALK 149
GL+P+ +I+NH C PNA F+ GRLA+VRA + +PKG E+ +Y + +TR LK
Sbjct: 624 GLFPLTAIMNHECTPNASHYFDNGRLAIVRAARDIPKGGEITTTYTKILWGNLTRGIFLK 683
Query: 150 EQYLFTCTCPRCIKLGQFDDIQESAILEGYRCKDDGCSGFLL 191
F C C RC D+ + L C++ GC G ++
Sbjct: 684 MTKHFMCDCTRC-----NDNTENGTYLSALFCREQGCRGLVI 720
Score = 55.8 bits (133), Expect = 3e-05, Method: Compositional matrix adjust.
Identities = 27/72 (37%), Positives = 40/72 (55%), Gaps = 1/72 (1%)
Query: 91 GLYPVISIINHSCLPNAVLVF-EGRLAVVRAVQHVPKGAEVLISYIETAGSTMTRQKALK 149
L+P+ ++NH C PNA F +G +V A + + GAE+ +SY + ST+ R+ L
Sbjct: 237 ALFPLAGLLNHQCTPNAGHHFQDGETIIVCATERIACGAEITVSYAKMLWSTLARKIFLG 296
Query: 150 EQYLFTCTCPRC 161
F C CPRC
Sbjct: 297 MTKGFMCQCPRC 308
>gi|452977535|gb|EME77301.1| hypothetical protein MYCFIDRAFT_179882 [Pseudocercospora fijiensis
CIRAD86]
Length = 616
Score = 79.0 bits (193), Expect = 3e-12, Method: Compositional matrix adjust.
Identities = 34/82 (41%), Positives = 49/82 (59%)
Query: 88 LGTGLYPVISIINHSCLPNAVLVFEGRLAVVRAVQHVPKGAEVLISYIETAGSTMTRQKA 147
LG G P+++ NHSC PNA +VF V+RA++ + KG E+ + YI+ + RQ
Sbjct: 284 LGIGFDPILNSSNHSCDPNAAVVFNQPRLVLRALRSISKGEEIFMKYIDVSNPFSVRQAE 343
Query: 148 LKEQYLFTCTCPRCIKLGQFDD 169
LKE Y F+C C +C K FD+
Sbjct: 344 LKESYFFSCRCSKCKKGAIFDE 365
>gi|440792261|gb|ELR13489.1| tetratricopeptide repeat domain containing protein [Acanthamoeba
castellanii str. Neff]
Length = 764
Score = 79.0 bits (193), Expect = 3e-12, Method: Compositional matrix adjust.
Identities = 51/147 (34%), Positives = 79/147 (53%), Gaps = 11/147 (7%)
Query: 22 DNYSLVEALVAHMSDIDEKQLLLYAQIANLVNLILQWPEISINEIAENFSKL----ACNA 77
D SLVE +++ E Q N+ L P +++ + E ++L NA
Sbjct: 291 DGRSLVEHHDRLAAEVREG---FRQQARNVRWLAAHGPGLALPDADETVARLFAVAQANA 347
Query: 78 HTICN-SELRPLGTGLYPVISIINHSCLPNA--VLVFEGRLAVVRAVQHVPKGAEVLISY 134
T+ + S LRP+G GLY +++NHSCLPNA + EGRL VRAV+ + G EV ++Y
Sbjct: 348 LTLTDPSSLRPIGQGLYASAALLNHSCLPNANWSVDGEGRL-CVRAVRPIEAGEEVTVAY 406
Query: 135 IETAGSTMTRQKALKEQYLFTCTCPRC 161
++ RQ+AL++ + F C C +C
Sbjct: 407 VDPTLPYHARQQALQDHFFFACRCLQC 433
>gi|255072485|ref|XP_002499917.1| predicted protein [Micromonas sp. RCC299]
gi|226515179|gb|ACO61175.1| predicted protein [Micromonas sp. RCC299]
Length = 676
Score = 79.0 bits (193), Expect = 4e-12, Method: Compositional matrix adjust.
Identities = 33/89 (37%), Positives = 55/89 (61%)
Query: 66 IAENFSKLACNAHTICNSELRPLGTGLYPVISIINHSCLPNAVLVFEGRLAVVRAVQHVP 125
+A +K++CNAHT+C+ EL +G G+YP +++NH P A F+G+ V+RA + +
Sbjct: 228 VARLLAKISCNAHTVCDEELNAIGVGVYPAAAMVNHGDAPTAAQSFKGKKIVLRATRDLK 287
Query: 126 KGAEVLISYIETAGSTMTRQKALKEQYLF 154
+G EV ++Y+E + R+ AL Y F
Sbjct: 288 RGDEVTMAYVELLATRQERRAALHAGYNF 316
Score = 53.5 bits (127), Expect = 2e-04, Method: Compositional matrix adjust.
Identities = 30/105 (28%), Positives = 53/105 (50%), Gaps = 2/105 (1%)
Query: 259 VNLMQTR--EKLIKILMELEDWKEALAYCQLTIPVYQRVYPQFHPLLGLQYYTCGKLEWF 316
++L++ R + + + + D++ A +P Y+ YP HP LGL K+E
Sbjct: 549 IHLLRVRVLDAMQRAAVTAGDFESARDAAYAVLPAYRLSYPPCHPPLGLHLALIAKIEAH 608
Query: 317 LGDTENAIKSMTEAVEILRITHGTNSPFMKELILKLEEAQAEASY 361
L D A++ EA+ L ++HG NS ++ + L E++AE Y
Sbjct: 609 LVDLNAAVQFAREAISCLNVSHGRNSDVVRMMETLLGESEAELRY 653
>gi|308499663|ref|XP_003112017.1| CRE-SET-18 protein [Caenorhabditis remanei]
gi|308268498|gb|EFP12451.1| CRE-SET-18 protein [Caenorhabditis remanei]
Length = 507
Score = 79.0 bits (193), Expect = 4e-12, Method: Compositional matrix adjust.
Identities = 73/325 (22%), Positives = 138/325 (42%), Gaps = 26/325 (8%)
Query: 29 ALVAHMSDI--DEKQLLLYAQIANLVNLILQWPEISINEIAENFSKLACNAHTICNSELR 86
+LV H DI DE+++ + ++ + L I + + F K + N+H+I +
Sbjct: 139 SLVDHKVDIREDEEKMKHFEKVFTKMGLFRGDEMIEKGQFFDVFCKASINSHSIHTNAGN 198
Query: 87 PLGTGLYPVISIINHSCLPNAVLVFEGRLAVVRAVQHVP-----KGAEVLISYIETAGST 141
+G L +S+ NHSC P +VF+G +R + VP + ISYI+ S
Sbjct: 199 EVGMALDLGVSMYNHSCRPTCSMVFDGYRVCLRPL--VPGVDAESTEQAFISYIDVGRSK 256
Query: 142 MTRQKALKEQYLFTCTCPRCIKLGQFDDIQESAILEGYRCKDDGCSGFLLRDSDDK--GF 199
R+K LK ++ F C C RC+ D ++ A L RC + C +L ++
Sbjct: 257 YVRRKDLKSRWYFDCECTRCM------DPEDDA-LTAIRCANPACDAPVLTSETEEPMNI 309
Query: 200 TCQQCGLVRSKEEIKKIASEVNILSKKTLALTSCGNHQEVVSTYKMIEKLQKKLYHPFSV 259
C +C + + ++KK + L Q+++S +++ HP +V
Sbjct: 310 ACDKCKTIIEENDVKKAQEFMKSLPASFDPQCPADILQDLLSKAEVV-------LHPSNV 362
Query: 260 NLMQTREKLIKILMELEDWKEALAYCQLTIPVYQRVYPQFHPLLGLQYYTCGKLEWFLGD 319
+ + R L + +L + + Q+ Y+ +P+ +G Q K +
Sbjct: 363 YVARLRTALFHVTGQLTMDNLSTMHTQI-YNNYKLCFPKADRHVGFQLLHIVKALIEKDE 421
Query: 320 TENAIKSMTEAVEILRITHGTNSPF 344
+ A+ +A+ I + G + P+
Sbjct: 422 RDEAMPYAFDAMNIFEVCFGLDHPY 446
>gi|242798932|ref|XP_002483270.1| SET and MYND domain protein, putative [Talaromyces stipitatus ATCC
10500]
gi|218716615|gb|EED16036.1| SET and MYND domain protein, putative [Talaromyces stipitatus ATCC
10500]
Length = 547
Score = 78.6 bits (192), Expect = 4e-12, Method: Compositional matrix adjust.
Identities = 50/164 (30%), Positives = 82/164 (50%), Gaps = 15/164 (9%)
Query: 4 LYLRRKLQNDNVIPSTTTDNYSLVEALVAHMSDIDE-KQLLLYA----QIANLVNLILQW 58
L LRR D +IP +++ +L + + E K LL A + +NL +L +W
Sbjct: 152 LMLRRA---DKIIPEVYSESLALTYPKIYDCVESQETKDHLLMAKALREYSNLTDLDNKW 208
Query: 59 PEISINEIAENFSKLACNAHTICNSELRPLGTGLYPVISIINHSCLPNAVLVFEGRLAVV 118
+A F +L N+ + ++ R G +P + NHSC PNA F +
Sbjct: 209 -------VASLFGRLNANSFCLTSAFGRRRGVYFHPGPARFNHSCDPNASYSFAKGKCYI 261
Query: 119 RAVQHVPKGAEVLISYIETAGSTMTRQKALKEQYLFTCTCPRCI 162
RA++ + K ++ ISY++T S TR+ L+E+Y F C CP+C+
Sbjct: 262 RAIRPIAKDEQIFISYVDTTYSVGTRRHELQERYRFECQCPKCL 305
>gi|157136833|ref|XP_001656930.1| hypothetical protein AaeL_AAEL003527 [Aedes aegypti]
gi|108880959|gb|EAT45184.1| AAEL003527-PA [Aedes aegypti]
Length = 650
Score = 78.6 bits (192), Expect = 4e-12, Method: Compositional matrix adjust.
Identities = 73/277 (26%), Positives = 124/277 (44%), Gaps = 34/277 (12%)
Query: 10 LQNDNVIPSTTTDNYSLVEALVAHMSDIDEKQLLLYAQIANLVNLILQWPEISINE---- 65
LQND + S D+Y V V H + + L + ++ L+N +L SI+
Sbjct: 352 LQND-FVDSLFFDDYRKVYNFVTHGQQRNAEDYLKWTVMSVLLNTVLVAAGFSISGSLKG 410
Query: 66 -----IAENFSKLACNAHTICNSEL---RP--------LGTGLYPVISIINHSCLPNAVL 109
+ N + N+H I SEL +P +G GLYP + + NHSC P
Sbjct: 411 FLGKILLHNLQIVTYNSHEI--SELQRKKPKDSGFSVCIGAGLYPTLVLFNHSCDPGITR 468
Query: 110 VFEGRLAVVRAVQHVPKGAEVLISY--IETAGSTMTRQKALKEQYLFTCTCPRCIKLGQF 167
F G +R ++++P G+ V +Y + T R+K L E Y F C C C + +
Sbjct: 469 YFVGNAVYIRTIKNIPAGSMVAENYGQLYTRAGRRERRKLLAENYKFDCCCQACEE--DW 526
Query: 168 DDIQE-SAILEGYRC-KDDGCSGFLLR--DSDDKGFTCQQCGLVRSKEEIKKIASEVNIL 223
+ E + ++ ++C ++GC LL +S + C +CG + + +V+ L
Sbjct: 527 PSLHEMNPMIRRFKCGANEGCGNELLFKLNSTENEMECNKCGGLTEVNASFETLKQVDFL 586
Query: 224 SKKTLA--LTSCGNHQEVVSTY-KMIEKLQKKLYHPF 257
++ A L S G+ + +S Y +I L + L P+
Sbjct: 587 NRYNDAARLYSQGDFERALSKYAALINSLDEILVPPY 623
>gi|301765320|ref|XP_002918077.1| PREDICTED: SET and MYND domain-containing protein 4-like
[Ailuropoda melanoleuca]
gi|281342357|gb|EFB17941.1| hypothetical protein PANDA_006471 [Ailuropoda melanoleuca]
Length = 793
Score = 78.6 bits (192), Expect = 4e-12, Method: Compositional matrix adjust.
Identities = 76/307 (24%), Positives = 128/307 (41%), Gaps = 61/307 (19%)
Query: 54 LILQWPEISINEIAENFSKLACNAHTICNSELRPLGTGLYPVISIINHSCLPNAVLVFEG 113
L LQ +I I + SK + I +S L L TG++PV+S++NHSC PN + F
Sbjct: 487 LQLQCNAQAITTIQQTGSK----ENNITDSRLVRLATGIFPVVSLLNHSCSPNTSMSFVS 542
Query: 114 RLAVVRAVQHVPKGAEVLISY--IETAGSTMTRQKALKEQYLFTCTCPRCIKLGQFDDIQ 171
+A +RA + + KG E+L Y + RQ+ L+ QY F C CP C K +
Sbjct: 543 TVATIRASEKIGKGQEILHCYGPHHSRMGVADRQQKLRSQYFFDCGCPACQK-----EKH 597
Query: 172 ESAILEGYRCKDDGCSGFLLRDSDDKGFTCQQCGLVRSKEEIKKIASEVNILSKKTLALT 231
+A+ + + F C +CG V ++ S T
Sbjct: 598 RAAVGPRW-----------------EAFCCSRCGAVLQGGDVLSCGS------------T 628
Query: 232 SCGNHQEVVSTYKMIEKLQKKLYHPFSVNLMQTREKLIKILMELEDWKEALAYCQLTIPV 291
SC E VS ++ +LQ ++ R + L+ ++A+ QL + V
Sbjct: 629 SC---TESVSREHLVSRLQD----------LRQRVGSARNLLSNGQLEQAI---QLLL-V 671
Query: 292 YQRVYPQF----HPLLGLQYYTCGKLEWFLGDTENAIKSMTEAVEILRITHGTNSPFMKE 347
Y+ F H ++G + LGD + + + +++ ++ + HG S M
Sbjct: 672 YRHDAENFLSAEHSMVGEMEDALAQAYAALGDWQKSATHLQKSLRVVEVQHGPASVEMGH 731
Query: 348 LILKLEE 354
+ KL +
Sbjct: 732 ELFKLAQ 738
>gi|431891028|gb|ELK01907.1| SET and MYND domain-containing protein 4 [Pteropus alecto]
Length = 776
Score = 78.6 bits (192), Expect = 5e-12, Method: Compositional matrix adjust.
Identities = 70/282 (24%), Positives = 118/282 (41%), Gaps = 59/282 (20%)
Query: 80 ICNSELRPLGTGLYPVISIINHSCLPNAVLVFEGRLAVVRAVQHVPKGAEVLISY--IET 137
I NS+ L TG++PV+S++NHSC PN + F +A VRA + KG E+L Y E+
Sbjct: 492 ITNSKQVRLATGIFPVVSLLNHSCSPNTSVSFISTVATVRASVQIGKGQEILHCYGPHES 551
Query: 138 AGSTMTRQKALKEQYLFTCTCPRCIKLGQFDDIQESAILEGYRCKDDGCSGFLLRDSDDK 197
RQ+ L+ QY F C CP C E +R S +
Sbjct: 552 RMGAAERQQKLRSQYFFDCNCPACQN-------------EKHR---------TTTGSKWE 589
Query: 198 GFTCQQCGLVRSKEEIKKIASEVNILSKKTLALTSCGNHQ--EVVSTYKMIEKLQKKLYH 255
F C +C + +++ SCGN E VS +++ +LQ
Sbjct: 590 AFCCNRCRTLMQGDDV-----------------LSCGNTSCTEAVSRDRLVSQLQD---- 628
Query: 256 PFSVNLMQTREKLIKILM---ELEDWKEALAYCQLTIPVYQRVYPQFHPLLGLQYYTCGK 312
+Q + K+ + L+ +LE + L C+ + P H ++G +
Sbjct: 629 ------LQQQIKMAQKLLKNGKLEQAIQLLLGCRYDAESF---LPAEHSVVGEIEDDLAQ 679
Query: 313 LEWFLGDTENAIKSMTEAVEILRITHGTNSPFMKELILKLEE 354
LGD + + + +++ ++ + HG +S M + KL +
Sbjct: 680 AYAALGDWQKSAAHLQKSLHVVEVRHGPSSVEMGHELFKLAQ 721
>gi|119597550|gb|EAW77144.1| SET and MYND domain containing 3, isoform CRA_e [Homo sapiens]
Length = 170
Score = 78.6 bits (192), Expect = 5e-12, Method: Compositional matrix adjust.
Identities = 33/77 (42%), Positives = 51/77 (66%)
Query: 85 LRPLGTGLYPVISIINHSCLPNAVLVFEGRLAVVRAVQHVPKGAEVLISYIETAGSTMTR 144
++ +G GLYP IS++NHSC PN +VF G ++RAV+ + G E+ I Y++ ++ R
Sbjct: 1 MQEVGVGLYPSISLLNHSCDPNCSIVFNGPHLLLRAVRDIEVGEELTICYLDMLMTSEER 60
Query: 145 QKALKEQYLFTCTCPRC 161
+K L++QY F C C RC
Sbjct: 61 RKQLRDQYCFECDCFRC 77
>gi|355753603|gb|EHH57568.1| hypothetical protein EGM_07234 [Macaca fascicularis]
Length = 806
Score = 78.6 bits (192), Expect = 5e-12, Method: Compositional matrix adjust.
Identities = 79/318 (24%), Positives = 132/318 (41%), Gaps = 73/318 (22%)
Query: 59 PEISINEIA--ENFSKLACNA-------HT------ICNSELRPLGTGLYPVISIINHSC 103
P+++I +A + +L CNA HT + +S L TG++PVIS++NHSC
Sbjct: 485 PDVTIWGVAMLRHMLQLQCNAQAITTVQHTGSKGSIVTDSRQVRLATGIFPVISLLNHSC 544
Query: 104 LPNAVLVFEGRLAVVRAVQHVPKGAEVLISY--IETAGSTMTRQKALKEQYLFTCTCPRC 161
PN + F +A +RA Q + KG E+L Y ++ RQ+ L+ QY F CTCP C
Sbjct: 545 SPNTSVSFISTVATIRASQRIGKGQEILHCYGPHKSRMGVAERQQKLRSQYFFDCTCPAC 604
Query: 162 IKLGQFDDIQESAILEGYRCKDDGCSGFLLRDSDDKGFTCQQCGLVRSKEEIKKIASEVN 221
+ + G R K F C C + + +
Sbjct: 605 -------QTEAHRMAAGPRW---------------KAFCCNSCRAPMQGDGVLR------ 636
Query: 222 ILSKKTLALTSCGNH---QEVVSTYKMIEKLQKKLYHPFSVNLMQTREKLIKILM--ELE 276
CGN + VVS ++ +LQ +L Q K+L ELE
Sbjct: 637 -----------CGNRSCAESVVSRDHLVSRLQ---------DLQQQVGVAQKLLRDGELE 676
Query: 277 DWKEALAYCQLTIPVYQRVYPQFHPLLGLQYYTCGKLEWFLGDTENAIKSMTEAVEILRI 336
+ L+ CQ + ++ + H ++G + LGD + + + +++ ++ +
Sbjct: 677 RAVQLLSGCQRDAESF--LWAE-HAMVGEIADGLARACAALGDWQKSATHLQKSLRVVEV 733
Query: 337 THGTNSPFMKELILKLEE 354
HG +S M + KL +
Sbjct: 734 RHGPSSVEMGHELFKLAQ 751
>gi|328789942|ref|XP_392262.3| PREDICTED: SET and MYND domain-containing protein 4-like [Apis
mellifera]
Length = 677
Score = 78.2 bits (191), Expect = 5e-12, Method: Compositional matrix adjust.
Identities = 59/201 (29%), Positives = 90/201 (44%), Gaps = 18/201 (8%)
Query: 66 IAENFSKLACNAHTI------------CNSELRPLGTGLYPVISIINHSCLPNAVLVFEG 113
I N + NAH I N++ + +G GLYP IS+ NHSC P + F G
Sbjct: 446 ILHNLMTIQFNAHEISELVIPKADNNLANAKSKFIGGGLYPTISLFNHSCNPGIIRYFIG 505
Query: 114 RLAVVRAVQHVPKGAEVLISY--IETAGSTMTRQKALKEQYLFTCTCPRCIKLGQFDDIQ 171
VVRA++ + G E+ +Y I T R++ L+ QY F C C C + +
Sbjct: 506 TTMVVRAIRSISSGEEISENYGQIFTTTPESERKRKLRLQYFFDCNCEACREHWPLLEEI 565
Query: 172 ESAILEGYRCKDDGCSGFLL---RDSDDKGFTCQQCGLVRSKEEIKKIASEVNILSKKTL 228
+ IL ++C+ G +L DS++ C +CG + + K E + + K
Sbjct: 566 DPTILR-FKCETGKECGNVLPVRTDSNEFMIECSKCGKCMNIFKGLKALQETDAIFKIAS 624
Query: 229 ALTSCGNHQEVVSTYKMIEKL 249
GNH+E + Y I KL
Sbjct: 625 RYLEQGNHREALKNYLKILKL 645
>gi|109112713|ref|XP_001117365.1| PREDICTED: SET and MYND domain-containing protein 4-like [Macaca
mulatta]
Length = 802
Score = 78.2 bits (191), Expect = 5e-12, Method: Compositional matrix adjust.
Identities = 79/318 (24%), Positives = 132/318 (41%), Gaps = 73/318 (22%)
Query: 59 PEISINEIA--ENFSKLACNA-------HT------ICNSELRPLGTGLYPVISIINHSC 103
P+++I +A + +L CNA HT + +S L TG++PVIS++NHSC
Sbjct: 481 PDVTIWGVAMLRHMLQLQCNAQAITTVQHTGSKGSIVTDSRQVRLATGIFPVISLLNHSC 540
Query: 104 LPNAVLVFEGRLAVVRAVQHVPKGAEVLISY--IETAGSTMTRQKALKEQYLFTCTCPRC 161
PN + F +A +RA Q + KG E+L Y ++ RQ+ L+ QY F CTCP C
Sbjct: 541 SPNTSVSFISTVATIRASQRIGKGQEILHCYGPHKSRMGVAERQQKLRSQYFFDCTCPAC 600
Query: 162 IKLGQFDDIQESAILEGYRCKDDGCSGFLLRDSDDKGFTCQQCGLVRSKEEIKKIASEVN 221
+ + G R K F C C + + +
Sbjct: 601 -------QTEAHRMAAGPRW---------------KAFCCNSCRAPMQGDGVLR------ 632
Query: 222 ILSKKTLALTSCGNH---QEVVSTYKMIEKLQKKLYHPFSVNLMQTREKLIKILM--ELE 276
CGN + VVS ++ +LQ +L Q K+L ELE
Sbjct: 633 -----------CGNRSCAESVVSRDHLVSRLQ---------DLQQQVGVAQKLLRDGELE 672
Query: 277 DWKEALAYCQLTIPVYQRVYPQFHPLLGLQYYTCGKLEWFLGDTENAIKSMTEAVEILRI 336
+ L+ CQ + ++ + H ++G + LGD + + + +++ ++ +
Sbjct: 673 RAVQLLSGCQRDAESF--LWAE-HAMVGEIADGLARACAALGDWQKSATHLQKSLRVVEV 729
Query: 337 THGTNSPFMKELILKLEE 354
HG +S M + KL +
Sbjct: 730 RHGPSSVEMGHELFKLAQ 747
>gi|355568065|gb|EHH24346.1| hypothetical protein EGK_07991 [Macaca mulatta]
Length = 806
Score = 78.2 bits (191), Expect = 5e-12, Method: Compositional matrix adjust.
Identities = 79/318 (24%), Positives = 132/318 (41%), Gaps = 73/318 (22%)
Query: 59 PEISINEIA--ENFSKLACNA-------HT------ICNSELRPLGTGLYPVISIINHSC 103
P+++I +A + +L CNA HT + +S L TG++PVIS++NHSC
Sbjct: 485 PDVTIWGVAMLRHMLQLQCNAQAITTVQHTGSKGSIVTDSRQVRLATGIFPVISLLNHSC 544
Query: 104 LPNAVLVFEGRLAVVRAVQHVPKGAEVLISY--IETAGSTMTRQKALKEQYLFTCTCPRC 161
PN + F +A +RA Q + KG E+L Y ++ RQ+ L+ QY F CTCP C
Sbjct: 545 SPNTSVSFISTVATIRASQRIGKGQEILHCYGPHKSRMGVAERQQKLRSQYFFDCTCPAC 604
Query: 162 IKLGQFDDIQESAILEGYRCKDDGCSGFLLRDSDDKGFTCQQCGLVRSKEEIKKIASEVN 221
+ + G R K F C C + + +
Sbjct: 605 -------QTEAHRMAAGPRW---------------KAFCCNSCRAPMQGDGVLR------ 636
Query: 222 ILSKKTLALTSCGNH---QEVVSTYKMIEKLQKKLYHPFSVNLMQTREKLIKILM--ELE 276
CGN + VVS ++ +LQ +L Q K+L ELE
Sbjct: 637 -----------CGNRSCAESVVSRDHLVSRLQ---------DLQQQVGVAQKLLRDGELE 676
Query: 277 DWKEALAYCQLTIPVYQRVYPQFHPLLGLQYYTCGKLEWFLGDTENAIKSMTEAVEILRI 336
+ L+ CQ + ++ + H ++G + LGD + + + +++ ++ +
Sbjct: 677 RAVQLLSGCQRDAESF--LWAE-HAMVGEIADGLARACAALGDWQKSATHLQKSLRVVEV 733
Query: 337 THGTNSPFMKELILKLEE 354
HG +S M + KL +
Sbjct: 734 RHGPSSVEMGHELFKLAQ 751
>gi|170102677|ref|XP_001882554.1| predicted protein [Laccaria bicolor S238N-H82]
gi|164642451|gb|EDR06707.1| predicted protein [Laccaria bicolor S238N-H82]
Length = 645
Score = 78.2 bits (191), Expect = 6e-12, Method: Compositional matrix adjust.
Identities = 42/104 (40%), Positives = 61/104 (58%), Gaps = 6/104 (5%)
Query: 64 NEIAENFSKLACNAHTICNSELRPLGTGLYPVISIINHSCLPNAVLVF-----EGR-LAV 117
+++ + S+ N TI + L PLG + P +++INHSC PNA +VF EG L
Sbjct: 264 SDLVDLISRFTTNTFTITSPTLAPLGACVSPSVALINHSCDPNAAVVFPRSAKEGEPLMQ 323
Query: 118 VRAVQHVPKGAEVLISYIETAGSTMTRQKALKEQYLFTCTCPRC 161
V A++++ E+L +YI+T T RQ+ALKE Y F C CP C
Sbjct: 324 VVALKYIGPDEEILTAYIDTTLPTGLRQQALKETYHFVCECPLC 367
>gi|393219982|gb|EJD05468.1| SET domain-containing protein [Fomitiporia mediterranea MF3/22]
Length = 635
Score = 78.2 bits (191), Expect = 6e-12, Method: Compositional matrix adjust.
Identities = 38/107 (35%), Positives = 60/107 (56%), Gaps = 7/107 (6%)
Query: 62 SINEIAENFSKLACNAHTICNSELRPLGTGLYPVISIINHSCLPNAVLVFEGRLAV---- 117
SI+E+ + SK N++T+ L P+G + P+ ++NHSC PN +VF L+
Sbjct: 261 SISELVDLISKFTLNSYTLTTPSLSPIGVSVSPLAGLLNHSCDPNVSVVFPRILSADKEP 320
Query: 118 ---VRAVQHVPKGAEVLISYIETAGSTMTRQKALKEQYLFTCTCPRC 161
+ A+Q +P +E+L SY++ + RQK LKE Y F C+C C
Sbjct: 321 ALHIVAIQDIPADSELLTSYVDVTLPVLQRQKDLKETYSFNCSCHSC 367
>gi|326931360|ref|XP_003211799.1| PREDICTED: SET and MYND domain-containing protein 4-like, partial
[Meleagris gallopavo]
Length = 706
Score = 78.2 bits (191), Expect = 7e-12, Method: Compositional matrix adjust.
Identities = 62/264 (23%), Positives = 112/264 (42%), Gaps = 9/264 (3%)
Query: 88 LGTGLYPVISIINHSCLPNAVLVFEGRLAVVRAVQHVPKGAEVLISY-IETAGSTMTRQK 146
L T +PV+S++NHSC PN + F G +A VRA Q +P G E+ Y + ++
Sbjct: 431 LATAFFPVLSLLNHSCSPNISVSFSGTVATVRASQPIPSGQEIFHCYGPHRCRMRVAERQ 490
Query: 147 ALKEQYLFTCTCPRCIKLGQFDDIQESAILEGYRCKDDGCSGFLLRDSDDKGFTC--QQC 204
L QY F C C C L + + ES + C C ++ C + C
Sbjct: 491 QLLSQYFFECRCQAC--LDELESDVESVSMRNSFC----CPSCQASMQGEEMLCCSNEAC 544
Query: 205 GLVRSKEEIKKIASEVNILSKKTLALTSCGNHQEVVSTYKMIEKLQKKLYHPFSVNLMQT 264
L S+E + + ++ +K L L N E + + + P + + +
Sbjct: 545 ALSVSRERLSRRLLDLQQQMEKALELLRDSNADEAIRMLLKCQMDARNFLSPEHLLMGEL 604
Query: 265 REKLIKILMELEDWKEALAYCQLTIPVYQRVYPQFHPLLGLQYYTCGKLEWFLGDTENAI 324
+ L +I L W+EA + + +I + + + +G + + ++ + A+
Sbjct: 605 EDHLAQIYATLGKWQEAARHLERSIQLVEMHHGPSSVEMGHELFKLAQILFNGFAVSEAL 664
Query: 325 KSMTEAVEILRITHGTNSPFMKEL 348
++ A EIL + G S ++EL
Sbjct: 665 STIQRAEEILSVHCGPQSTQIQEL 688
>gi|380013978|ref|XP_003691021.1| PREDICTED: SET and MYND domain-containing protein 4-like [Apis
florea]
Length = 678
Score = 77.8 bits (190), Expect = 7e-12, Method: Compositional matrix adjust.
Identities = 59/201 (29%), Positives = 91/201 (45%), Gaps = 18/201 (8%)
Query: 66 IAENFSKLACNAHTI------------CNSELRPLGTGLYPVISIINHSCLPNAVLVFEG 113
I N + NAH I N++ + +G GLYP IS+ NHSC P + F G
Sbjct: 447 ILHNLMTIQFNAHEISELVIPKADNNLANAKSKFIGGGLYPTISLFNHSCNPGIIRYFIG 506
Query: 114 RLAVVRAVQHVPKGAEVLISY--IETAGSTMTRQKALKEQYLFTCTCPRCIKLGQFDDIQ 171
VVRA++ + G E+ +Y I T R++ L+ QY F C C C + +
Sbjct: 507 TTMVVRAIRSISSGEEISENYGQIFTTTPEKERKRKLRLQYFFDCNCEACREHWPLLEEI 566
Query: 172 ESAILEGYRCKDDGCSGFLL---RDSDDKGFTCQQCGLVRSKEEIKKIASEVNILSKKTL 228
+ IL ++C+ G +L DS++ C +CG + + K E + + K
Sbjct: 567 DPTILR-FKCETGKECGNVLPVKTDSNEFMIECSKCGKCMNIFKGLKALQETDAIFKIAS 625
Query: 229 ALTSCGNHQEVVSTYKMIEKL 249
GNH+E + +Y I KL
Sbjct: 626 RNLEQGNHREALKSYLKILKL 646
>gi|336375028|gb|EGO03364.1| hypothetical protein SERLA73DRAFT_174827 [Serpula lacrymans var.
lacrymans S7.3]
gi|336387991|gb|EGO29135.1| hypothetical protein SERLADRAFT_456499 [Serpula lacrymans var.
lacrymans S7.9]
Length = 530
Score = 77.8 bits (190), Expect = 7e-12, Method: Compositional matrix adjust.
Identities = 42/114 (36%), Positives = 61/114 (53%), Gaps = 9/114 (7%)
Query: 64 NEIAENFSKLACNAHTICNSELRPLGTGLYPVISIINHSCLPNAVLVF---------EGR 114
++ + SK N T+ +S L LG + P++++INHSC PNAV+V+ E
Sbjct: 180 GDLVDIISKFITNTFTLTSSSLSALGVSVSPLVALINHSCDPNAVIVYPRCSNEPSTEEP 239
Query: 115 LAVVRAVQHVPKGAEVLISYIETAGSTMTRQKALKEQYLFTCTCPRCIKLGQFD 168
L V A++ + E+L +YI+T +RQK LKE Y F C CP C K D
Sbjct: 240 LMQVVAIRDIEVDEEILTAYIDTTLPRFSRQKFLKETYNFDCQCPSCTKYSGVD 293
>gi|397612716|gb|EJK61854.1| hypothetical protein THAOC_17577 [Thalassiosira oceanica]
Length = 750
Score = 77.8 bits (190), Expect = 7e-12, Method: Compositional matrix adjust.
Identities = 84/339 (24%), Positives = 138/339 (40%), Gaps = 64/339 (18%)
Query: 50 NLVNLILQWPEISINE-IAENFSKLACNAHTICNSELRPLGTGLYPVISIINHSCLPNAV 108
LV IL E +++E I + N + +S P+G +YP +++NHSC PN +
Sbjct: 437 GLVLRILGCKEKTLDEAIRHTLTAFQQNNFGVTDSLYAPIGEAVYPHAALLNHSCSPNCI 496
Query: 109 LVFE-------GRLAVVRAVQHVPKGAEVLISYIETAGSTMTRQKALKEQYLFTCTCPRC 161
L ++ +L +V A + + G E++ SY++ T TR+ L++ + F C C RC
Sbjct: 497 LRYKIGLESSPPQLEIV-ACKDISSGEELVHSYVDLVLPTETRRNRLRDTHGFICECKRC 555
Query: 162 IKLGQFDDIQESAILEGYRCKDDGCSGFLLRDSDDKGF--TCQQCGLVRSKEEIKKIASE 219
GQ C+ L RD T ++ G + S ++I E
Sbjct: 556 A--GQ-------------------CTVELPRDRQSWMLWPTKERLGEIPSSTPTQQIDIE 594
Query: 220 VNILSKKTLALTSCGNHQEVVSTYKM-----------------IEKLQKKL--------Y 254
I + A H ++V+T ++ I+ LQ+ + +
Sbjct: 595 EAIGGRSGEA-----EHLQIVNTSRLLQDQATRAMVNGNNDLEIQLLQQAVGVFLLSGGF 649
Query: 255 HPFSVNLMQTREKLIKILMELEDWKEALAYCQLTIPVYQRVYP--QFHPLLGLQYYTCGK 312
PF L R L+ E + C+ + + HPLLGLQ YT G
Sbjct: 650 SPFHNELYTARCAYFSALLANGKIDEGIEQCEHIVSSLSVCLGNVKHHPLLGLQLYTLGD 709
Query: 313 LEWFLGDTENAIKSMTEAVEILRITHGTNSPFMKELILK 351
L G + A + A EI+ ++HG P ++ LI K
Sbjct: 710 LCSGGGCGQKAANAYRWAYEIMSVSHGAQDPMVQALIRK 748
>gi|281205858|gb|EFA80047.1| SET domain-containing protein [Polysphondylium pallidum PN500]
Length = 496
Score = 77.8 bits (190), Expect = 7e-12, Method: Compositional matrix adjust.
Identities = 48/141 (34%), Positives = 74/141 (52%), Gaps = 15/141 (10%)
Query: 52 VNLILQWPEISINEIAENFSKLACNAHTI--CNSELRPLGTGLYPVISIINHSCLPNAVL 109
+N +LQ I+ + + + CNA TI C S + LGTGLYP ++ INHSC PN +
Sbjct: 206 INQVLQS---EIDVLYDALCSINCNAFTIIDCVSGVL-LGTGLYPGLTFINHSCDPNLQV 261
Query: 110 VFEGRLAVVRAVQHVPKGAEVLISYIETAGSTMTRQKALKEQYLFTCTCPRC--IKLGQF 167
G++ ++AV+ + KG E ISYI+ + R + L E + F C C +C ++ +
Sbjct: 262 TQIGKILTLKAVRPIKKGEEFTISYIDRTEGAIQRNEELMETFFFECQCHKCASVRSEKR 321
Query: 168 DDIQESAILEGYRCKDDGCSG 188
+D + CKD C G
Sbjct: 322 NDFY-------FVCKDPSCKG 335
>gi|299115343|emb|CBN74163.1| conserved unknown protein [Ectocarpus siliculosus]
Length = 579
Score = 77.8 bits (190), Expect = 8e-12, Method: Compositional matrix adjust.
Identities = 45/143 (31%), Positives = 71/143 (49%), Gaps = 7/143 (4%)
Query: 50 NLVNLILQWPEISINEIAENFSKLACNAHTICNSELRPL--GTGLYPVISIINHSCLPNA 107
+L+ L+ +W + E+ + ++ NAH + + L G G++P+ ++INH+C PN
Sbjct: 252 DLLQLLPEWVRFDVEEVVQLACRVNVNAHGLRDDSGANLVIGVGMFPLTAMINHACRPNC 311
Query: 108 VLVFEGRLAVVRAVQHVPKGAEVLISYIETAGSTMTRQKALKEQYLFTCTCPRCIKLGQF 167
V+ G VR ++ V GAE+ + YI+ ST R++ L F C C RC
Sbjct: 312 TFVYFGGNLEVRTLEPVSAGAELSVYYIDLLQSTAARRQELLTSKHFLCKCSRCENPSSM 371
Query: 168 DDIQESAILEGYRCKDDGCSGFL 190
DD L+G C D G G L
Sbjct: 372 DD-----YLDGVCCTDCGERGCL 389
>gi|21754614|dbj|BAC04538.1| unnamed protein product [Homo sapiens]
Length = 804
Score = 77.8 bits (190), Expect = 8e-12, Method: Compositional matrix adjust.
Identities = 77/315 (24%), Positives = 131/315 (41%), Gaps = 67/315 (21%)
Query: 59 PEISINEIA--ENFSKLACNA-------HT------ICNSELRPLGTGLYPVISIINHSC 103
P+++I +A + +L CNA HT + +S L TG++PVIS++NHSC
Sbjct: 483 PDVTIWGVAMLRHMLQLQCNAQAMTTIQHTGSKGSIVTDSRQVRLATGIFPVISLLNHSC 542
Query: 104 LPNAVLVFEGRLAVVRAVQHVPKGAEVLISY--IETAGSTMTRQKALKEQYLFTCTCPRC 161
PN + F +A +RA Q + KG E+L Y ++ RQ+ L+ QY F C CP C
Sbjct: 543 SPNTSVSFISTVATIRASQRIRKGQEILHCYGPHKSRMGVAERQQKLRSQYFFDCACPAC 602
Query: 162 IKLGQFDDIQESAILEGYRCKDDGCSGFLLRDSDDKGFTCQQCGLVRSKEEIKKIASEVN 221
+ + G R + F C CG +++ + S
Sbjct: 603 -------QTEAHRMAAGPRW---------------EAFCCNSCGAPMQGDDVLRCGSR-- 638
Query: 222 ILSKKTLALTSCGNHQEVVSTYKMIEKLQKKLYHPFSVNLMQTREKLIKILM--ELEDWK 279
SC + VS ++ +LQ +L Q K+L ELE
Sbjct: 639 ----------SCA--ESAVSRDHLVSRLQ---------DLQQQVRVAQKLLRDGELERAV 677
Query: 280 EALAYCQLTIPVYQRVYPQFHPLLGLQYYTCGKLEWFLGDTENAIKSMTEAVEILRITHG 339
+ L+ CQ + ++ + H ++G + LGD + + + ++ ++ + HG
Sbjct: 678 QRLSGCQRDAESF--LWAE-HAVVGEIADGLARACAALGDWQKSATHLQRSLYVVEVRHG 734
Query: 340 TNSPFMKELILKLEE 354
+S M + KL +
Sbjct: 735 PSSVEMGHELFKLAQ 749
>gi|156616308|ref|NP_443160.2| SET and MYND domain-containing protein 4 [Homo sapiens]
gi|296452956|sp|Q8IYR2.3|SMYD4_HUMAN RecName: Full=SET and MYND domain-containing protein 4
Length = 804
Score = 77.8 bits (190), Expect = 9e-12, Method: Compositional matrix adjust.
Identities = 77/315 (24%), Positives = 131/315 (41%), Gaps = 67/315 (21%)
Query: 59 PEISINEIA--ENFSKLACNA-------HT------ICNSELRPLGTGLYPVISIINHSC 103
P+++I +A + +L CNA HT + +S L TG++PVIS++NHSC
Sbjct: 483 PDVTIWGVAMLRHMLQLQCNAQAMTTIQHTGPKGSIVTDSRQVRLATGIFPVISLLNHSC 542
Query: 104 LPNAVLVFEGRLAVVRAVQHVPKGAEVLISY--IETAGSTMTRQKALKEQYLFTCTCPRC 161
PN + F +A +RA Q + KG E+L Y ++ RQ+ L+ QY F C CP C
Sbjct: 543 SPNTSVSFISTVATIRASQRIRKGQEILHCYGPHKSRMGVAERQQKLRSQYFFDCACPAC 602
Query: 162 IKLGQFDDIQESAILEGYRCKDDGCSGFLLRDSDDKGFTCQQCGLVRSKEEIKKIASEVN 221
+ + G R + F C CG +++ + S
Sbjct: 603 -------QTEAHRMAAGPRW---------------EAFCCNSCGAPMQGDDVLRCGSR-- 638
Query: 222 ILSKKTLALTSCGNHQEVVSTYKMIEKLQKKLYHPFSVNLMQTREKLIKILM--ELEDWK 279
SC + VS ++ +LQ +L Q K+L ELE
Sbjct: 639 ----------SCA--ESAVSRDHLVSRLQ---------DLQQQVRVAQKLLRDGELERAV 677
Query: 280 EALAYCQLTIPVYQRVYPQFHPLLGLQYYTCGKLEWFLGDTENAIKSMTEAVEILRITHG 339
+ L+ CQ + ++ + H ++G + LGD + + + ++ ++ + HG
Sbjct: 678 QRLSGCQRDAESF--LWAE-HAVVGEIADGLARACAALGDWQKSATHLQRSLYVVEVRHG 734
Query: 340 TNSPFMKELILKLEE 354
+S M + KL +
Sbjct: 735 PSSVEMGHELFKLAQ 749
>gi|157786648|ref|NP_001099280.1| SET and MYND domain-containing protein 4 [Rattus norvegicus]
gi|149053389|gb|EDM05206.1| SET and MYND domain containing 4 (predicted) [Rattus norvegicus]
Length = 801
Score = 77.4 bits (189), Expect = 9e-12, Method: Compositional matrix adjust.
Identities = 36/84 (42%), Positives = 49/84 (58%), Gaps = 2/84 (2%)
Query: 80 ICNSELRPLGTGLYPVISIINHSCLPNAVLVFEGRLAVVRAVQHVPKGAEVLISY--IET 137
I NS L TG++PV+S++NHSC PN + F +A +RA Q + KG E+L Y E+
Sbjct: 517 ITNSRQVRLATGVFPVVSLLNHSCSPNTSVSFTSTVATIRAAQQIAKGQEILHCYGPHES 576
Query: 138 AGSTMTRQKALKEQYLFTCTCPRC 161
RQ+ L QY F C+CP C
Sbjct: 577 RMGVAERQQRLSSQYFFDCSCPAC 600
>gi|390474255|ref|XP_002757614.2| PREDICTED: SET and MYND domain-containing protein 1 [Callithrix
jacchus]
Length = 442
Score = 77.4 bits (189), Expect = 1e-11, Method: Compositional matrix adjust.
Identities = 81/341 (23%), Positives = 132/341 (38%), Gaps = 110/341 (32%)
Query: 31 VAHMSDIDEKQLLLYAQIANLVNLILQ-WP----EISINEIAENFSKLACNAHTICNSE- 84
V H + ++K+L + V+ LQ WP + S+ I+ F + CN T+ +
Sbjct: 136 VEHFGEEEQKELRVD------VDTFLQYWPPQSQQFSMQYISHIFGVINCNGFTLSDQRG 189
Query: 85 LRPLGTGLYPVISIINHSCLPNAVLVFE-GRLAVVRAVQH------------VPKGAEVL 131
L+ +G G++P + ++NH C PN ++F G V+++ H + +G E+
Sbjct: 190 LQAVGVGIFPNLGLVNHDCWPNCTVIFNNGNHEAVKSMFHTQMRIELRALGKISEGEELT 249
Query: 132 ISYIETAGSTMTRQKALKEQYLFTCTCPRCIKLGQFDDIQESAILEGYRCKDDGCSGFLL 191
+SYI+ + R+K LK+QY F CTC C K + KDD FL
Sbjct: 250 VSYIDFLNVSEERKKQLKKQYYFDCTCEHCQK----------------KLKDD---LFL- 289
Query: 192 RDSDDKGFTCQQCGLVRSKEEIKKIASEVNILSKKTL----ALTSCGNHQEVVSTYKMIE 247
G+ + + +++ E+ SK TL S G + EVV
Sbjct: 290 -------------GVKDNPKASQEVVKEMIQFSKDTLEKIDKARSEGLYHEVVKL----- 331
Query: 248 KLQKKLYHPFSVNLMQTREKLIKILMELEDWKEALAYCQLTIPVYQRVYPQFHPLLGLQY 307
++KLYHP + LG+
Sbjct: 332 -CREKLYHPNNAQ------------------------------------------LGMAV 348
Query: 308 YTCGKLEWFLGDTENAIKSMTEAVEILRITHGTNSPFMKEL 348
G W G+ E + +A IL +THG + P K+L
Sbjct: 349 MRAGLTNWHAGNIEVGHGMICKAYAILLVTHGPSHPITKDL 389
>gi|357146941|ref|XP_003574165.1| PREDICTED: uncharacterized protein LOC100821025 [Brachypodium
distachyon]
Length = 791
Score = 77.4 bits (189), Expect = 1e-11, Method: Compositional matrix adjust.
Identities = 70/311 (22%), Positives = 135/311 (43%), Gaps = 52/311 (16%)
Query: 80 ICNSELRPLGTGLYPVISIINHSCLPNAVLVFEGRLAVVRAVQHVPKGAEVLISYIETAG 139
+C+ E + +Y S+ NHSC PN F R +R+ +++ G+ + +SY AG
Sbjct: 467 MCSVEQIRVAQAIYMSGSLFNHSCRPNVHTYFHSRTLFLRSTEYIESGSPLELSYGPQAG 526
Query: 140 ST--MTRQKALKEQYLFTCTCPRCIKLGQFDDIQES----------AILEG--YRCKDD- 184
+ RQK+L+E Y F+C C C +L D + +S A+ E YR +++
Sbjct: 527 EMDLLERQKSLQENYKFSCRCSSCSELNLSDLVIDSFCCPRSFCLGAVSESTCYRSEENF 586
Query: 185 --------------------------GCSGFLLRDSD-----DKGF--TCQ-QCGLVRSK 210
R+ D D G+ +C+ Q L
Sbjct: 587 VHVSLNGSAVCKLSLPDVSKVDEDMEKVGKLFFRNDDADLKIDPGYCMSCRSQLDLSSVV 646
Query: 211 EEIKKIASEVNILSKKTLALTSCGNHQEVVSTYKMIEKLQKKLYHPFSVNLMQTREKLIK 270
+ + AS++N K L + + + + + +E++ KKL HP+S L Q +K+ +
Sbjct: 647 ATLDRAASKINRF--KELIVIDDVSEVLITNALQSLEQI-KKLRHPYSKALAQAEDKIAE 703
Query: 271 ILMELEDWKEALAYCQLTIPVYQRVYPQFHPLLGLQYYTCGKLEWFLGDTENAIKSMTEA 330
++ D ++A +C+ +I + +++Y H + + ++ +GD A ++T A
Sbjct: 704 AFAKVGDQEQARKHCEASIKILEKLYHPKHIAIAHELIKLISIKLCMGDKAGAAATVTRA 763
Query: 331 VEILRITHGTN 341
I + +G +
Sbjct: 764 EAIFLLYYGPD 774
>gi|410980233|ref|XP_003996482.1| PREDICTED: SET and MYND domain-containing protein 4 [Felis catus]
Length = 800
Score = 77.4 bits (189), Expect = 1e-11, Method: Compositional matrix adjust.
Identities = 71/283 (25%), Positives = 122/283 (43%), Gaps = 57/283 (20%)
Query: 78 HTICNSELRPLGTGLYPVISIINHSCLPNAVLVFEGRLAVVRAVQHVPKGAEVLISY--I 135
+++ NS L TG++PVIS++NHSC PN + F +A VRA Q + KG E+L Y
Sbjct: 506 NSVTNSRQVRLATGIFPVISLLNHSCSPNTSVSFISTVATVRASQQIGKGQEILHCYGPH 565
Query: 136 ETAGSTMTRQKALKEQYLFTCTCPRCIKLGQFDDIQESAILEGYRCKDDGCSGFLLRDSD 195
++ RQ+ L+ QY F C+C C ++ + G R
Sbjct: 566 QSRMRAAERQQKLRSQYFFDCSCAACQN-------EKHSAATGPRW-------------- 604
Query: 196 DKGFTCQQCGLVRSKEEIKKIASEVNILSKKTLALTSCGNHQEVVSTYKMIEKLQKKLYH 255
+ F C+ CG + +++ S TSC E VS +I +LQ
Sbjct: 605 -EAFCCRSCGALMQGDDVLICGS------------TSC---TESVSRDHLISRLQD---- 644
Query: 256 PFSVNLMQTREKLIKILMELEDWKEALAYCQLTIPVYQRVYPQF----HPLLGLQYYTCG 311
+Q + + + L+ + + A+ QL + QR F H L+G
Sbjct: 645 ------LQQQVGMARKLLRNDKPERAI---QLLLGC-QRDAQSFLSGEHSLVGEMEDDLA 694
Query: 312 KLEWFLGDTENAIKSMTEAVEILRITHGTNSPFMKELILKLEE 354
+ LGD + + + ++++++ + HG +S M + KL +
Sbjct: 695 QAYAALGDWQKSATHLQKSLQVVEVRHGPSSVEMGHELFKLAQ 737
>gi|345326540|ref|XP_003431054.1| PREDICTED: LOW QUALITY PROTEIN: SET and MYND domain-containing
protein 4-like [Ornithorhynchus anatinus]
Length = 815
Score = 77.4 bits (189), Expect = 1e-11, Method: Compositional matrix adjust.
Identities = 38/92 (41%), Positives = 53/92 (57%), Gaps = 2/92 (2%)
Query: 74 ACNAHTICNSELRPLGTGLYPVISIINHSCLPNAVLVFEGRLAVVRAVQHVPKGAEVLIS 133
A H I +S L TGL+PV+S++NHSC PN + F GR+AVV+A Q + +G E+L
Sbjct: 508 ASEDHVIASSRQVRLATGLFPVVSLLNHSCSPNTSVSFSGRVAVVQATQPIGRGQEILHC 567
Query: 134 YI--ETAGSTMTRQKALKEQYLFTCTCPRCIK 163
Y R++ LK QY F C C C++
Sbjct: 568 YGPHRCRMDVAERRQKLKAQYFFDCRCQACLE 599
>gi|194768765|ref|XP_001966482.1| GF21975 [Drosophila ananassae]
gi|190617246|gb|EDV32770.1| GF21975 [Drosophila ananassae]
Length = 445
Score = 77.0 bits (188), Expect = 1e-11, Method: Compositional matrix adjust.
Identities = 38/102 (37%), Positives = 55/102 (53%), Gaps = 6/102 (5%)
Query: 91 GLYPVISIINHSCLPNAVLVFE-GRLAVVRAVQHVPKGAEVLISYIETAGSTMTRQKALK 149
GL+P+ +I+NH C PNA F+ G LAVVRA + +PKG E+ +Y + +TR LK
Sbjct: 225 GLFPLTAIMNHECTPNASHYFDNGTLAVVRAARDIPKGGEITTTYTKILWGNLTRGIFLK 284
Query: 150 EQYLFTCTCPRCIKLGQFDDIQESAILEGYRCKDDGCSGFLL 191
F C C RC D+ + L C++ GC G ++
Sbjct: 285 MTKNFNCDCVRC-----NDNTENGTYLSALFCREQGCRGLVI 321
>gi|390363607|ref|XP_001199204.2| PREDICTED: N-lysine methyltransferase SMYD2-like
[Strongylocentrotus purpuratus]
Length = 490
Score = 77.0 bits (188), Expect = 1e-11, Method: Compositional matrix adjust.
Identities = 68/294 (23%), Positives = 115/294 (39%), Gaps = 58/294 (19%)
Query: 70 FSKLACNAHTICNSELRPLGTGLYPVISIINHSCLPNAVLVFEGRLAVVRAVQHVPKGAE 129
+ CN+ I +++L + +Y S+ NHSC N ++VF+ R +R V+ V +G E
Sbjct: 241 YGATNCNSFGIFDNDLIVISDAIYLRASMANHSCDYNCIVVFDERKLQLRTVKDVQEGEE 300
Query: 130 VLISYIETAGSTMTRQKALKEQYLFTCTCPRCIK----LGQFDDIQESAILEGYRCKDDG 185
I Y++ R+ L+E+Y FTC C +CI+ LG DDG
Sbjct: 301 CTIGYVDVIHPAKERRAELEEKYHFTCKCVKCIEEINALG----------------PDDG 344
Query: 186 CSGFLLRDSDDKGFTCQQCGLVRSKEEIKKIASEVNILSKKTLALTSCGNHQEVVSTYKM 245
+ E+ L K + N + ++
Sbjct: 345 ------------------------------LGEELRDLKKSLEQIVDAENSHDWAKVIQL 374
Query: 246 IEKLQKKLYHPFSVN-----LMQTREKLIKILMELEDWKEALAYCQLTIPVYQRVYPQFH 300
E K + S+ L+ R+ ++ + WK+A QL I Y Y +++
Sbjct: 375 CEPYLKPMDSSSSLPANHQLLVMLRDTAFFACIQSQSWKKAAEMGQLNIESYIYHYGRYN 434
Query: 301 PLLGLQYYTCGKLEWFLGDTENAIKSMTEAVEILRITHGTNSPFM---KELILK 351
P +G+ G++ L A K EA + ++THG M K+L+ K
Sbjct: 435 PNVGMYLLKIGEVLLNLDRLREARKCFKEAESVFKVTHGLQHSLMASVKKLLFK 488
Score = 57.0 bits (136), Expect = 1e-05, Method: Compositional matrix adjust.
Identities = 25/66 (37%), Positives = 38/66 (57%)
Query: 98 IINHSCLPNAVLVFEGRLAVVRAVQHVPKGAEVLISYIETAGSTMTRQKALKEQYLFTCT 157
+ NHSC N VF+G +R ++ V +G E ISY++ RQ L+E+Y FTC
Sbjct: 1 MANHSCDYNCAGVFDGMKLQLRTIKDVKEGEECTISYVDVINPAKERQAKLEEEYHFTCK 60
Query: 158 CPRCIK 163
C +C++
Sbjct: 61 CVKCVE 66
>gi|326477120|gb|EGE01130.1| SET and MYND domain-containing protein [Trichophyton equinum CBS
127.97]
Length = 499
Score = 77.0 bits (188), Expect = 1e-11, Method: Compositional matrix adjust.
Identities = 38/102 (37%), Positives = 56/102 (54%)
Query: 60 EISINEIAENFSKLACNAHTICNSELRPLGTGLYPVISIINHSCLPNAVLVFEGRLAVVR 119
++ I IAE +++ N+ T + PLG + P +NHSC PNAV+ F+ L V+
Sbjct: 180 DLDIETIAEYLARIEVNSFTFTTAFGDPLGLCIQPFACYMNHSCEPNAVVGFDEGLITVK 239
Query: 120 AVQHVPKGAEVLISYIETAGSTMTRQKALKEQYLFTCTCPRC 161
A+ + +V ISYI+ RQK L E+Y FTC C +C
Sbjct: 240 ALCKIKPDEQVFISYIDNTYPFEVRQKQLAERYFFTCKCSKC 281
>gi|119610982|gb|EAW90576.1| SET and MYND domain containing 4 [Homo sapiens]
Length = 804
Score = 77.0 bits (188), Expect = 1e-11, Method: Compositional matrix adjust.
Identities = 77/315 (24%), Positives = 131/315 (41%), Gaps = 67/315 (21%)
Query: 59 PEISINEIA--ENFSKLACNA-------HT------ICNSELRPLGTGLYPVISIINHSC 103
P+++I +A + +L CNA HT + +S L TG++PVIS++NHSC
Sbjct: 483 PDVTIWGVAMLRHMLQLQCNAQAMTTIQHTGPKGSIVTDSRQVRLATGIFPVISLLNHSC 542
Query: 104 LPNAVLVFEGRLAVVRAVQHVPKGAEVLISY--IETAGSTMTRQKALKEQYLFTCTCPRC 161
PN + F +A +RA Q + KG E+L Y ++ RQ+ L+ QY F C CP C
Sbjct: 543 SPNTSVSFISTVATIRASQRIRKGQEILHCYGPHKSRMGVAERQQKLRSQYFFDCACPAC 602
Query: 162 IKLGQFDDIQESAILEGYRCKDDGCSGFLLRDSDDKGFTCQQCGLVRSKEEIKKIASEVN 221
+ + G R + F C CG +++ + S
Sbjct: 603 -------QTEAHRMAAGPRW---------------EAFCCNSCGAPMQGDDVLRCGSR-- 638
Query: 222 ILSKKTLALTSCGNHQEVVSTYKMIEKLQKKLYHPFSVNLMQTREKLIKILM--ELEDWK 279
SC + VS ++ +LQ +L Q K+L ELE
Sbjct: 639 ----------SCA--ESAVSRDHLVSRLQ---------DLQQQVRVAQKLLRDGELERAV 677
Query: 280 EALAYCQLTIPVYQRVYPQFHPLLGLQYYTCGKLEWFLGDTENAIKSMTEAVEILRITHG 339
+ L+ CQ + ++ + H ++G + LGD + + + ++ ++ + HG
Sbjct: 678 QRLSGCQRDAESF--LWAE-HAVVGEIADGLARACAALGDWQKSATHLQRSLCVVEVRHG 734
Query: 340 TNSPFMKELILKLEE 354
+S M + KL +
Sbjct: 735 PSSVEMGHELFKLAQ 749
>gi|168275572|dbj|BAG10506.1| SET and MYND domain-containing protein 4 [synthetic construct]
Length = 804
Score = 77.0 bits (188), Expect = 1e-11, Method: Compositional matrix adjust.
Identities = 77/315 (24%), Positives = 131/315 (41%), Gaps = 67/315 (21%)
Query: 59 PEISINEIA--ENFSKLACNA-------HT------ICNSELRPLGTGLYPVISIINHSC 103
P+++I +A + +L CNA HT + +S L TG++PVIS++NHSC
Sbjct: 483 PDVTIWGVAMLRHMLQLQCNAQAMTTIQHTGPKGSIVTDSRQVRLATGIFPVISLLNHSC 542
Query: 104 LPNAVLVFEGRLAVVRAVQHVPKGAEVLISY--IETAGSTMTRQKALKEQYLFTCTCPRC 161
PN + F +A +RA Q + KG E+L Y ++ RQ+ L+ QY F C CP C
Sbjct: 543 SPNTSVSFISTVATIRASQRIRKGQEILHCYGPHKSRMGVAERQQKLRSQYFFDCACPAC 602
Query: 162 IKLGQFDDIQESAILEGYRCKDDGCSGFLLRDSDDKGFTCQQCGLVRSKEEIKKIASEVN 221
+ + G R + F C CG +++ + S
Sbjct: 603 -------QTEAHRMAAGPRW---------------EAFCCNSCGAPMQGDDVLRCGSR-- 638
Query: 222 ILSKKTLALTSCGNHQEVVSTYKMIEKLQKKLYHPFSVNLMQTREKLIKILM--ELEDWK 279
SC + VS ++ +LQ +L Q K+L ELE
Sbjct: 639 ----------SCA--ESAVSRDHLVSRLQ---------DLQQQVRVAQKLLRDGELERAV 677
Query: 280 EALAYCQLTIPVYQRVYPQFHPLLGLQYYTCGKLEWFLGDTENAIKSMTEAVEILRITHG 339
+ L+ CQ + ++ + H ++G + LGD + + + ++ ++ + HG
Sbjct: 678 QRLSGCQRDAESF--LWAE-HAVVGEIADGLARACAALGDWQKSATHLQRSLCVVEVRHG 734
Query: 340 TNSPFMKELILKLEE 354
+S M + KL +
Sbjct: 735 PSSVEMGHELFKLAQ 749
>gi|322708541|gb|EFZ00118.1| MYND finger family protein [Metarhizium anisopliae ARSEF 23]
Length = 526
Score = 77.0 bits (188), Expect = 1e-11, Method: Compositional matrix adjust.
Identities = 42/102 (41%), Positives = 60/102 (58%), Gaps = 1/102 (0%)
Query: 60 EISINEIAENFSKLACNAHTICNSELRPLGTGLYPVISIINHSCLPNAVLVFEGRLAVVR 119
E +I E K+ NA + +L +G L P +++ NHSC+PNAV+ F GR A++R
Sbjct: 156 ESNIRRAVELLCKIQTNAFHRWDVDLGQVGVFLEPTLAMANHSCVPNAVVQFVGRKAILR 215
Query: 120 AVQHVPKGAEVLISYIETAGSTMTRQKALKEQYLFTCTCPRC 161
A + + G E+ ISY + TR++AL EQY F CTC RC
Sbjct: 216 AERPIHAGDEIEISYTDYTMPLSTRREAL-EQYSFECTCARC 256
>gi|326472015|gb|EGD96024.1| hypothetical protein TESG_03485 [Trichophyton tonsurans CBS 112818]
Length = 499
Score = 77.0 bits (188), Expect = 1e-11, Method: Compositional matrix adjust.
Identities = 38/102 (37%), Positives = 56/102 (54%)
Query: 60 EISINEIAENFSKLACNAHTICNSELRPLGTGLYPVISIINHSCLPNAVLVFEGRLAVVR 119
++ I IAE +++ N+ T + PLG + P +NHSC PNAV+ F+ L V+
Sbjct: 180 DLDIETIAEYLARIEVNSFTFTTAFGDPLGLCIQPFACYMNHSCEPNAVVGFDEGLITVK 239
Query: 120 AVQHVPKGAEVLISYIETAGSTMTRQKALKEQYLFTCTCPRC 161
A+ + +V ISYI+ RQK L E+Y FTC C +C
Sbjct: 240 ALCKIKPDEQVFISYIDNTYPFEVRQKQLAERYFFTCKCSKC 281
>gi|440794181|gb|ELR15350.1| tetratricopeptide repeat domain containing protein [Acanthamoeba
castellanii str. Neff]
Length = 823
Score = 77.0 bits (188), Expect = 1e-11, Method: Compositional matrix adjust.
Identities = 68/269 (25%), Positives = 114/269 (42%), Gaps = 35/269 (13%)
Query: 62 SINEIAENFSKLACNAHTICNSELRPLGTGLYPVISIINHSCLPNAVLVFEGRLAVVRAV 121
+I E+A++ + H +R + +Y S+ NHSC PN VL +EGR+ +RA
Sbjct: 513 AIYEVADD------DEHGGGTKPIRRVAEAMYATGSLFNHSCRPNTVLNYEGRVLTIRAC 566
Query: 122 QHVPKGAEVLISY---IETAGSTMTRQKALKEQYLFTCTCPRCIKLGQFDDIQESAILEG 178
+ + KG EVL Y + S RQKAL+ QY FTC C C S++ +G
Sbjct: 567 EDIAKGKEVLNCYGPHVAHIPSRDERQKALRHQYFFTCRCDAC----------SSSVEQG 616
Query: 179 YR------CKDDGCSGFLLRDSDDKGFTCQQCGLVRSKEEIKKIASEVNILSKKTLALTS 232
+ C +GC+ L + + Q+ + + EI K+A E +++ L
Sbjct: 617 GQDFDYLVCGSEGCTRELQINPEQ---LRQEVDIRVVQREI-KLAREFYERARRALDGGG 672
Query: 233 CGNH-----QEVVSTYKMIEKLQKKLYHPFSVNLMQTREKLIKILMELEDWKEALAYCQL 287
+ + +L+ K HP L +T + L + L ++ A + +
Sbjct: 673 GDGSGDGDVERAAQSLAHALRLRAKYLHPQHRELGRTHDALAEAHARLGRYEAAATHLER 732
Query: 288 TIPVYQRVYPQFHPLLGLQYYTCGKLEWF 316
+ V + + F L QY + WF
Sbjct: 733 AVAVLESRFAPFSFELADQYAKLAQA-WF 760
>gi|397491961|ref|XP_003816904.1| PREDICTED: SET and MYND domain-containing protein 4 [Pan paniscus]
Length = 835
Score = 76.6 bits (187), Expect = 2e-11, Method: Compositional matrix adjust.
Identities = 82/343 (23%), Positives = 144/343 (41%), Gaps = 68/343 (19%)
Query: 31 VAHMSDIDEKQLLLYAQIANLVNLILQWPEISINEIA--ENFSKLACNA-------HT-- 79
A + I ++++ +Q+ V L+ P+++I +A + +L CNA HT
Sbjct: 456 AASLQAIPTERIVNSSQLKAAVTPELR-PDVTIWGVAMLRHMLQLQCNAQAMTTIQHTGP 514
Query: 80 ----ICNSELRPLGTGLYPVISIINHSCLPNAVLVFEGRLAVVRAVQHVPKGAEVLISY- 134
+ +S L TG++PVIS++NHSC PN + F +A +RA Q + KG E+L Y
Sbjct: 515 KGSIVTDSRQVRLATGIFPVISLLNHSCNPNTSVSFISTVATIRASQRIRKGQEILHCYG 574
Query: 135 -IETAGSTMTRQKALKEQYLFTCTCPRCIKLGQFDDIQESAILEGYRCKDDGCSGFLLRD 193
++ RQ+ L+ QY F C CP C + + G R
Sbjct: 575 PHKSRMGVAERQQKLRSQYFFDCACPAC-------QTEAHRMAAGPRW------------ 615
Query: 194 SDDKGFTCQQCGLVRSKEEIKKIASEVNILSKKTLALTSCGNHQEVVSTYKMIEKLQKKL 253
+ F C CG +++ + S SC + VS ++ +LQ
Sbjct: 616 ---EAFCCNSCGAPMQGDDVLRCGSR------------SCA--ESAVSRDHLVSRLQ--- 655
Query: 254 YHPFSVNLMQTREKLIKILM--ELEDWKEALAYCQLTIPVYQRVYPQFHPLLGLQYYTCG 311
+L Q K+L ELE + L+ CQ + ++ + H ++G
Sbjct: 656 ------DLQQQVRVAQKLLRDGELERAVQRLSGCQRDAESF--LWAE-HAVVGEIADGLA 706
Query: 312 KLEWFLGDTENAIKSMTEAVEILRITHGTNSPFMKELILKLEE 354
+ LGD + + + ++ ++ + HG +S M + KL +
Sbjct: 707 RAFAALGDWQKSATHLQRSLCVVEVRHGPSSVEMGHELFKLAQ 749
>gi|426383436|ref|XP_004058287.1| PREDICTED: SET and MYND domain-containing protein 4 [Gorilla
gorilla gorilla]
Length = 804
Score = 76.6 bits (187), Expect = 2e-11, Method: Compositional matrix adjust.
Identities = 76/315 (24%), Positives = 131/315 (41%), Gaps = 67/315 (21%)
Query: 59 PEISINEIA--ENFSKLACNA-------HT------ICNSELRPLGTGLYPVISIINHSC 103
P++++ +A + +L CNA HT + +S L TG++PVIS++NHSC
Sbjct: 483 PDVTVWGVAMLRHMLQLQCNAQAMTTIQHTGSKGSIVTDSRQVRLATGIFPVISLLNHSC 542
Query: 104 LPNAVLVFEGRLAVVRAVQHVPKGAEVLISY--IETAGSTMTRQKALKEQYLFTCTCPRC 161
PN + F +A +RA Q + KG E+L Y ++ RQ+ L+ QY F C CP C
Sbjct: 543 CPNTSVSFISTVATIRASQRIQKGQEILHCYGPHKSRMGVAERQQKLRSQYFFDCACPAC 602
Query: 162 IKLGQFDDIQESAILEGYRCKDDGCSGFLLRDSDDKGFTCQQCGLVRSKEEIKKIASEVN 221
+ + G R + F C CG +++ + S
Sbjct: 603 -------QTEAHRMAAGPRW---------------EAFCCNSCGAPMQGDDVLRCGSR-- 638
Query: 222 ILSKKTLALTSCGNHQEVVSTYKMIEKLQKKLYHPFSVNLMQTREKLIKILM--ELEDWK 279
SC + VS ++ +LQ +L Q K+L ELE
Sbjct: 639 ----------SCA--ESAVSRDHLVSRLQ---------DLQQQVRVAQKLLRDGELERAV 677
Query: 280 EALAYCQLTIPVYQRVYPQFHPLLGLQYYTCGKLEWFLGDTENAIKSMTEAVEILRITHG 339
+ L+ CQ + ++ + H ++G + LGD + + + ++ ++ + HG
Sbjct: 678 QRLSGCQRDAESF--LWAE-HAVVGEIADGLARACAALGDWQKSATHLQRSLCVVEVRHG 734
Query: 340 TNSPFMKELILKLEE 354
+S M + KL +
Sbjct: 735 PSSVEMGHELFKLAQ 749
>gi|114665632|ref|XP_511253.2| PREDICTED: SET and MYND domain-containing protein 4 isoform 3 [Pan
troglodytes]
Length = 835
Score = 76.6 bits (187), Expect = 2e-11, Method: Compositional matrix adjust.
Identities = 82/343 (23%), Positives = 144/343 (41%), Gaps = 68/343 (19%)
Query: 31 VAHMSDIDEKQLLLYAQIANLVNLILQWPEISINEIA--ENFSKLACNA-------HT-- 79
A + I ++++ +Q+ V L+ P+++I +A + +L CNA HT
Sbjct: 456 AASLQAIPTERIVNSSQLKAAVTPELR-PDVTIWGVAMLRHMLQLQCNAQAVTTIQHTGP 514
Query: 80 ----ICNSELRPLGTGLYPVISIINHSCLPNAVLVFEGRLAVVRAVQHVPKGAEVLISY- 134
+ +S L TG++PVIS++NHSC PN + F +A +RA Q + KG E+L Y
Sbjct: 515 KGSIVTDSRQVRLATGIFPVISLLNHSCNPNTSVSFISTVATIRASQRIRKGQEILHCYG 574
Query: 135 -IETAGSTMTRQKALKEQYLFTCTCPRCIKLGQFDDIQESAILEGYRCKDDGCSGFLLRD 193
++ RQ+ L+ QY F C CP C + + G R
Sbjct: 575 PHKSRMGVAERQQKLRSQYFFDCACPAC-------QTEAHRMAAGPRW------------ 615
Query: 194 SDDKGFTCQQCGLVRSKEEIKKIASEVNILSKKTLALTSCGNHQEVVSTYKMIEKLQKKL 253
+ F C CG +++ + S SC + VS ++ +LQ
Sbjct: 616 ---EAFCCNSCGAPMQGDDVLRCGSR------------SCA--ESAVSRDHLVSRLQ--- 655
Query: 254 YHPFSVNLMQTREKLIKILM--ELEDWKEALAYCQLTIPVYQRVYPQFHPLLGLQYYTCG 311
+L Q K+L ELE + L+ CQ + ++ + H ++G
Sbjct: 656 ------DLQQQVRVAQKLLRDGELERAVQRLSGCQRDAESF--LWAE-HAVVGEIADGLA 706
Query: 312 KLEWFLGDTENAIKSMTEAVEILRITHGTNSPFMKELILKLEE 354
+ LGD + + + ++ ++ + HG +S M + KL +
Sbjct: 707 RAFAALGDWQKSATHLQRSLCVVEVRHGPSSVEMGHELFKLAQ 749
>gi|23272879|gb|AAH35077.1| SET and MYND domain containing 4 [Homo sapiens]
Length = 804
Score = 76.6 bits (187), Expect = 2e-11, Method: Compositional matrix adjust.
Identities = 77/315 (24%), Positives = 131/315 (41%), Gaps = 67/315 (21%)
Query: 59 PEISINEIA--ENFSKLACNA-------HT------ICNSELRPLGTGLYPVISIINHSC 103
P+++I +A + +L CNA HT + +S L TG++PVIS++NHSC
Sbjct: 483 PDVTIWGVAMLRHMLQLQCNAQAMTTIQHTGPKGSIVTDSRQVRLATGIFPVISLLNHSC 542
Query: 104 LPNAVLVFEGRLAVVRAVQHVPKGAEVLISY--IETAGSTMTRQKALKEQYLFTCTCPRC 161
PN + F +A +RA Q + KG E+L Y ++ RQ+ L+ QY F C CP C
Sbjct: 543 SPNTSVSFISTVATIRASQRIRKGQEILHCYGPHKSRMGVAERQQKLRSQYFFDCACPAC 602
Query: 162 IKLGQFDDIQESAILEGYRCKDDGCSGFLLRDSDDKGFTCQQCGLVRSKEEIKKIASEVN 221
+ + G R + F C CG +++ + S
Sbjct: 603 -------QTEAHRMAAGPRW---------------EAFCCNSCGAPMQGDDVLRCGSR-- 638
Query: 222 ILSKKTLALTSCGNHQEVVSTYKMIEKLQKKLYHPFSVNLMQTREKLIKILM--ELEDWK 279
SC + VS ++ +LQ +L Q K+L ELE
Sbjct: 639 ----------SCA--ESAVSRDHLVSRLQ---------DLQQQVRVAQKLLRDGELERAV 677
Query: 280 EALAYCQLTIPVYQRVYPQFHPLLGLQYYTCGKLEWFLGDTENAIKSMTEAVEILRITHG 339
+ L+ CQ + ++ + H ++G + LGD + + + ++ ++ + HG
Sbjct: 678 QRLSGCQRDAESF--LWAE-HAVVGEIADGLARACAALGDWQKSATHLQRSLCVVEVRHG 734
Query: 340 TNSPFMKELILKLEE 354
+S M + KL +
Sbjct: 735 PSSVEMGHELFKLAQ 749
>gi|410209806|gb|JAA02122.1| SET and MYND domain containing 4 [Pan troglodytes]
gi|410209808|gb|JAA02123.1| SET and MYND domain containing 4 [Pan troglodytes]
gi|410246942|gb|JAA11438.1| SET and MYND domain containing 4 [Pan troglodytes]
gi|410246944|gb|JAA11439.1| SET and MYND domain containing 4 [Pan troglodytes]
gi|410289944|gb|JAA23572.1| SET and MYND domain containing 4 [Pan troglodytes]
gi|410289946|gb|JAA23573.1| SET and MYND domain containing 4 [Pan troglodytes]
gi|410289948|gb|JAA23574.1| SET and MYND domain containing 4 [Pan troglodytes]
gi|410330765|gb|JAA34329.1| SET and MYND domain containing 4 [Pan troglodytes]
gi|410330767|gb|JAA34330.1| SET and MYND domain containing 4 [Pan troglodytes]
gi|410330769|gb|JAA34331.1| SET and MYND domain containing 4 [Pan troglodytes]
Length = 835
Score = 76.6 bits (187), Expect = 2e-11, Method: Compositional matrix adjust.
Identities = 82/343 (23%), Positives = 144/343 (41%), Gaps = 68/343 (19%)
Query: 31 VAHMSDIDEKQLLLYAQIANLVNLILQWPEISINEIA--ENFSKLACNA-------HT-- 79
A + I ++++ +Q+ V L+ P+++I +A + +L CNA HT
Sbjct: 456 AASLQAIPTERIVNSSQLKAAVTPELR-PDVTIWGVAMLRHMLQLQCNAQAVTTIQHTGP 514
Query: 80 ----ICNSELRPLGTGLYPVISIINHSCLPNAVLVFEGRLAVVRAVQHVPKGAEVLISY- 134
+ +S L TG++PVIS++NHSC PN + F +A +RA Q + KG E+L Y
Sbjct: 515 KGSIVTDSRQVRLATGIFPVISLLNHSCNPNTSVSFISTVATIRASQRIRKGQEILHCYG 574
Query: 135 -IETAGSTMTRQKALKEQYLFTCTCPRCIKLGQFDDIQESAILEGYRCKDDGCSGFLLRD 193
++ RQ+ L+ QY F C CP C + + G R
Sbjct: 575 PHKSRMGVAERQQKLRSQYFFDCACPAC-------QTEAHRMAAGPRW------------ 615
Query: 194 SDDKGFTCQQCGLVRSKEEIKKIASEVNILSKKTLALTSCGNHQEVVSTYKMIEKLQKKL 253
+ F C CG +++ + S SC + VS ++ +LQ
Sbjct: 616 ---EAFCCNSCGAPMQGDDVLRCGSR------------SCA--ESAVSRDHLVSRLQ--- 655
Query: 254 YHPFSVNLMQTREKLIKILM--ELEDWKEALAYCQLTIPVYQRVYPQFHPLLGLQYYTCG 311
+L Q K+L ELE + L+ CQ + ++ + H ++G
Sbjct: 656 ------DLQQQVRVAQKLLRDGELERAVQRLSGCQRDAESF--LWAE-HAVVGEIADGLA 706
Query: 312 KLEWFLGDTENAIKSMTEAVEILRITHGTNSPFMKELILKLEE 354
+ LGD + + + ++ ++ + HG +S M + KL +
Sbjct: 707 RAFAALGDWQKSATHLQRSLCVVEVRHGPSSVEMGHELFKLAQ 749
>gi|156616310|ref|NP_001096081.1| SET and MYND domain-containing protein 4 [Mus musculus]
gi|90101757|sp|Q8BTK5.2|SMYD4_MOUSE RecName: Full=SET and MYND domain-containing protein 4
gi|148680856|gb|EDL12803.1| SET and MYND domain containing 4, isoform CRA_c [Mus musculus]
Length = 799
Score = 76.6 bits (187), Expect = 2e-11, Method: Compositional matrix adjust.
Identities = 38/88 (43%), Positives = 49/88 (55%), Gaps = 2/88 (2%)
Query: 76 NAHTICNSELRPLGTGLYPVISIINHSCLPNAVLVFEGRLAVVRAVQHVPKGAEVLISY- 134
N I NS L TG++PV+S++NHSC PN + F G +A VRA Q + KG E+L Y
Sbjct: 511 NESIITNSRQIRLATGIFPVVSLLNHSCRPNTSVSFTGTVATVRAAQRIAKGQEILHCYG 570
Query: 135 -IETAGSTMTRQKALKEQYLFTCTCPRC 161
E+ RQ+ L QY F C C C
Sbjct: 571 PHESRMGVAERQQRLSSQYFFDCRCGAC 598
>gi|26354773|dbj|BAC41013.1| unnamed protein product [Mus musculus]
Length = 799
Score = 76.6 bits (187), Expect = 2e-11, Method: Compositional matrix adjust.
Identities = 38/88 (43%), Positives = 49/88 (55%), Gaps = 2/88 (2%)
Query: 76 NAHTICNSELRPLGTGLYPVISIINHSCLPNAVLVFEGRLAVVRAVQHVPKGAEVLISY- 134
N I NS L TG++PV+S++NHSC PN + F G +A VRA Q + KG E+L Y
Sbjct: 511 NESIITNSRQIRLATGIFPVVSLLNHSCRPNTSVSFTGTVATVRAAQRIAKGQEILHCYG 570
Query: 135 -IETAGSTMTRQKALKEQYLFTCTCPRC 161
E+ RQ+ L QY F C C C
Sbjct: 571 PHESRMGVAERQQRLSSQYFFDCRCGAC 598
>gi|395853348|ref|XP_003799177.1| PREDICTED: SET and MYND domain-containing protein 4 [Otolemur
garnettii]
Length = 749
Score = 76.6 bits (187), Expect = 2e-11, Method: Compositional matrix adjust.
Identities = 43/120 (35%), Positives = 61/120 (50%), Gaps = 17/120 (14%)
Query: 59 PEISINEIA--ENFSKLACNAHTIC-------------NSELRPLGTGLYPVISIINHSC 103
P++ I +A + +L CNA I NS L TG++PV+S++NHSC
Sbjct: 483 PDLDIWGVAMLRHMLQLQCNAQAITTIQQTGSEENIITNSRQVRLATGVFPVVSLLNHSC 542
Query: 104 LPNAVLVFEGRLAVVRAVQHVPKGAEVLISY--IETAGSTMTRQKALKEQYLFTCTCPRC 161
PN + F +A +RA Q + KG E+L Y E+ RQ+ L+ QY F C CP C
Sbjct: 543 SPNTSISFCSTVATIRASQQIGKGQEILHCYGPHESRMGVAKRQQKLRSQYFFDCNCPAC 602
>gi|440799648|gb|ELR20692.1| Histone-lysine N-methyltransferase [Acanthamoeba castellanii str.
Neff]
Length = 548
Score = 76.6 bits (187), Expect = 2e-11, Method: Compositional matrix adjust.
Identities = 78/307 (25%), Positives = 130/307 (42%), Gaps = 27/307 (8%)
Query: 27 VEALVAHMSDIDEKQLLLYAQIANLVNLILQWP-EISINEIAENFSKLACNAHTICNSEL 85
V A++ H + ++ + + A LV +L P +S E + +++ N+H + +
Sbjct: 188 VMAMMGHEDECPQQWVDSITETAKLVMPLLPKPARLSAREFVKVCARINTNSHRQHHMFV 247
Query: 86 --RPLGTGLYPVISIINHSCLPNAVLVFEGRLAVVRAVQHVPKGAEVLISYIETAGSTMT 143
R LG GLYP+ S+INHSC PN G +R + V +G E+ SYI+ S
Sbjct: 248 PQRILGVGLYPLASLINHSCQPNCGFYNRGPTLYIRTLCDVKEGEELCYSYIDLYQSRSK 307
Query: 144 RQKALKEQYLFTCTCPRCIKLGQFDDIQESA--ILEGYRCKDDG---CSGFL-LRDSD-- 195
R+ L E F C C RC I +S L G++C + C G L L D +
Sbjct: 308 RKAELLETKHFDCLCNRCSP-----PITDSVDRYLSGFQCPNKAKTSCDGLLVLPDGEGL 362
Query: 196 --DKGFTCQQ-CGLVRSKEEIKKIASEVNILSKKTLALTSCGNHQEVVSTYKMIEKLQKK 252
+K +C CG ++ ++ K I+ + +L + + + M++KL
Sbjct: 363 ASEKPVSCTAGCGETKTAGDLLKAQKRAEIMLTASQSLYY--DQKRTSDSKTMLDKLLTG 420
Query: 253 LY------HPFSVNLMQTREKLIKILMELEDWKEALAYCQLTIPVYQRVYPQFHPLLGLQ 306
Y HP+ + Q I + L ++ +C+ + + V P H
Sbjct: 421 GYSPDVQLHPYHPTVFQAHVDSINVSDALGYTHMSIDHCEAVLRCAEAVLPMNHLETSNY 480
Query: 307 YYTCGKL 313
YY G L
Sbjct: 481 YYYQGML 487
>gi|403275299|ref|XP_003929391.1| PREDICTED: SET and MYND domain-containing protein 4 [Saimiri
boliviensis boliviensis]
Length = 800
Score = 76.6 bits (187), Expect = 2e-11, Method: Compositional matrix adjust.
Identities = 71/312 (22%), Positives = 134/312 (42%), Gaps = 32/312 (10%)
Query: 59 PEISINEIA--ENFSKLACNA-------HTIC------NSELRPLGTGLYPVISIINHSC 103
P++++ +A + +L CNA HT C S L TG++PV+S++NHSC
Sbjct: 480 PDVTVWGVAMLRHMLQLQCNAQAITTIQHTGCKEGIVTGSRQVRLATGIFPVVSLLNHSC 539
Query: 104 LPNAVLVFEGRLAVVRAVQHVPKGAEVLISY--IETAGSTMTRQKALKEQYLFTCTCPRC 161
PN + F +A +RA Q + KG E++ Y ++ RQ+ L+ QY F CTCP C
Sbjct: 540 SPNTSVSFISTVATIRASQRIGKGQEIVHCYGPHKSRMGVAERQQKLRSQYFFDCTCPAC 599
Query: 162 IKLGQFDDIQESAILEGYRCKDDGCSGFLLRDSDDKGFTCQQCGLVRSKEEIKK--IASE 219
G + E + C C + +G CG E +++ +AS
Sbjct: 600 HSEGH--GMAARPRWEAFCCSS--CRAPM------QGDDVLSCGRRGCAESVRRDHLASR 649
Query: 220 VNILSKK---TLALTSCGNHQEVVSTYKMIEKLQKKLYHPFSVNLMQTREKLIKILMELE 276
+ L ++ L S G + + ++ + L + + L + L
Sbjct: 650 LQDLRQQVGMAQKLLSDGELEPAIQQLSGCQRDAESFLSAEHAVLGEIEDGLAQACAALG 709
Query: 277 DWKEALAYCQLTIPVYQRVYPQFHPLLGLQYYTCGKLEWFLGDTENAIKSMTEAVEILRI 336
DW+++ + Q ++ V + + +G + + ++ + A+ ++ +A E+L +
Sbjct: 710 DWQKSATHLQNSLRVVEARHGPSSVEMGHELFKLAQIFFNGFAVPEALSTIQKAEEVLSL 769
Query: 337 THGTNSPFMKEL 348
G ++EL
Sbjct: 770 HCGPWDDEIQEL 781
>gi|395536340|ref|XP_003770178.1| PREDICTED: SET and MYND domain-containing protein 4 [Sarcophilus
harrisii]
Length = 802
Score = 76.6 bits (187), Expect = 2e-11, Method: Compositional matrix adjust.
Identities = 64/276 (23%), Positives = 117/276 (42%), Gaps = 32/276 (11%)
Query: 88 LGTGLYPVISIINHSCLPNAVLVFEGRLAVVRAVQHVPKGAEVLISYIETAG--STMTRQ 145
L TGL+P IS++NHSC PN L F G + V A Q + +G E+L Y G TRQ
Sbjct: 525 LATGLFPAISLMNHSCRPNTSLSFRGSVGSVHASQLIARGQEILHCYGPHEGRMDAATRQ 584
Query: 146 KALKEQYLFTCTCPRCIKLGQFDDIQESAILE---GYRCKDD----------GCSGFLLR 192
+ L+ QY F C C C Q +++ + + G+RC GCS R
Sbjct: 585 QKLRSQYFFDCRCQAC----QEEEVHGAGVTPKQGGFRCPTCRAALQGLDPLGCSSGSCR 640
Query: 193 DSDDKGFTCQQCGLVRSKEEIKKIASEVNILSKKTLALTSCGNHQEVVSTYKMIEKLQKK 252
+ Q G ++ + E+ + ++ L +C E T +
Sbjct: 641 AQVSRAHLLGQLGDLQQRVEMAGELLRGDRTARAVQLLLACRREAEDFLTSEH------- 693
Query: 253 LYHPFSVNLMQTREKLIKILMELEDWKEALAYCQLTIPVYQRVYPQFHPLLGLQYYTCGK 312
V + + ++L + L DW ++ A+ Q ++ V + + +G + + +
Sbjct: 694 ------VLMGEIEDRLAQAYASLGDWPKSAAHLQNSLQVVKAQHGPASVEMGHELFKLAQ 747
Query: 313 LEWFLGDTENAIKSMTEAVEILRITHGTNSPFMKEL 348
+ + A+ + +A ++L + +G + ++EL
Sbjct: 748 VLFNGFAVTEAVSVIEKAEKVLSLHYGPENEQVQEL 783
>gi|391335768|ref|XP_003742261.1| PREDICTED: SET and MYND domain-containing protein 4-like
[Metaseiulus occidentalis]
Length = 775
Score = 76.6 bits (187), Expect = 2e-11, Method: Compositional matrix adjust.
Identities = 70/249 (28%), Positives = 105/249 (42%), Gaps = 29/249 (11%)
Query: 69 NFSKLACNAHTICN-----------------SELRPLGTGLYPVISIINHSCLPNAVLVF 111
+ +L CNAH I E + T +YP S++NHSC PN + F
Sbjct: 455 HIQQLICNAHAITTIQQPGDHVIEEDGIILEQEQVRVATAIYPSASLMNHSCNPNIISGF 514
Query: 112 E-GRLAVVRAVQHVPKGAEVLISYIETAGSTMT---RQKALKEQYLFTCTCPRCIKLGQF 167
G VV++V+ + G EV Y MT RQ AL+EQY F C C C K G
Sbjct: 515 RSGSTLVVKSVRPIASGEEVFNCY-GPHFRRMTFQERQTALQEQYFFRCDCTACQK-GDL 572
Query: 168 DDIQESAILEGYRCKDDGCSGFLLRDSDDKGFTCQQCGLVRSKEEIKKIASEVNILSKKT 227
DD I RC + C G L C QC + E ++ ++ L +
Sbjct: 573 DD----QISMALRC--EYCEGPLSAVQSSGKADCLQCSTTQDCLEKEQKVFRMHDLFVQG 626
Query: 228 LALTSCGNHQEVVSTYKMIEKLQKKLYHPFSVNLMQTREKLIKILMELEDWKEALAYCQL 287
L L +H E + + Q+KL + + LM TR+ + K L L +++A+ +
Sbjct: 627 LQLAEMDSHGEALERLQKCLASQEKLLYRHNKQLMITRDMVAKSLCALGRFRDAVGVLKS 686
Query: 288 TIPVYQRVY 296
+ + +Y
Sbjct: 687 AVESVRHMY 695
>gi|405969486|gb|EKC34455.1| SET and MYND domain-containing protein 4 [Crassostrea gigas]
Length = 324
Score = 76.6 bits (187), Expect = 2e-11, Method: Compositional matrix adjust.
Identities = 51/147 (34%), Positives = 76/147 (51%), Gaps = 19/147 (12%)
Query: 73 LACNAHTICNSELRP----------LGTGLYPVISIINHSCLPNAVLVFEGRLAVVRAVQ 122
L CNAH I +P +G+G Y ++S+INHSC P+ V G + VVRA++
Sbjct: 103 LPCNAHEISEILWKPGDPTVTNSIEIGSGAYALLSLINHSCDPSVVRHNYGNICVVRAIK 162
Query: 123 HVPKGAEVLISYIETAGSTMT--RQKALKEQYLFTCTCPRC-IKLGQFDDIQESAILEGY 179
+ KG E+L +Y T+ R+ L+ QY F C C C ++L + DI + + +
Sbjct: 163 PIKKGEEILDNYGALYPLTIREERRAKLRPQYFFDCNCDACQLELPLYFDIPDDVPV--F 220
Query: 180 RCKDDGCSG--FLLRDSDDKGFTCQQC 204
+CKD CSG F+ +D D C C
Sbjct: 221 KCKD--CSGPIFISQDKDLAEAECSSC 245
>gi|330841743|ref|XP_003292851.1| hypothetical protein DICPUDRAFT_89992 [Dictyostelium purpureum]
gi|325076863|gb|EGC30616.1| hypothetical protein DICPUDRAFT_89992 [Dictyostelium purpureum]
Length = 680
Score = 76.6 bits (187), Expect = 2e-11, Method: Compositional matrix adjust.
Identities = 47/152 (30%), Positives = 78/152 (51%), Gaps = 14/152 (9%)
Query: 22 DNYSLVEALVAHMSDIDEKQLLLYAQIANLV----------NLILQWPEISINEIAENFS 71
D Y +E L + +D +Q+ + + A+ + + IL++ I++NEI E +S
Sbjct: 332 DEYQDLENLSRTLDKVDPQQMEAFKRSASSIIAVISKLRGEDAILKY--ITLNEIIELYS 389
Query: 72 KLACNAHTICNS-ELRPLGTGLYPVISIINHSCLPNAVLVF-EGRLAVVRAVQHVPKGAE 129
+ N H + + R G G++P S +NHSC PNA E + V R+++ + KG E
Sbjct: 390 IVLSNGHEMLHPLTCRSYGMGIFPTGSYLNHSCSPNAFWYNDEQGMMVFRSLRPLKKGEE 449
Query: 130 VLISYIETAGSTMTRQKALKEQYLFTCTCPRC 161
+L SY + R+K L +QY F C C +C
Sbjct: 450 LLTSYTDVTNPLEDRRKYLMKQYFFFCQCNQC 481
>gi|351696197|gb|EHA99115.1| SET and MYND domain-containing protein 1 [Heterocephalus glaber]
Length = 483
Score = 76.3 bits (186), Expect = 2e-11, Method: Compositional matrix adjust.
Identities = 44/152 (28%), Positives = 78/152 (51%), Gaps = 25/152 (16%)
Query: 31 VAHMSDIDEKQLLLYAQIANLVNLILQ-WP----EISINEIAENFSKLACNAHTICNSE- 84
V H + ++K+L + V++ LQ WP + S+ I+ F + CN T+ +
Sbjct: 131 VEHFGEEEQKELRVD------VDMFLQYWPPQSQQFSMQYISHIFGVINCNGFTLSDQRG 184
Query: 85 LRPLGTGLYPVISIINHSCLPNAVLVFE-GRLAVVRAVQH------------VPKGAEVL 131
L+ +G G++P + ++NH C PN ++F G V+++ H + +G E+
Sbjct: 185 LQAVGVGIFPNLGLVNHDCWPNCTVIFNNGNHEAVKSMFHTQMRIELRALGKISEGEELT 244
Query: 132 ISYIETAGSTMTRQKALKEQYLFTCTCPRCIK 163
+SYI+ + R++ LK+QY F CTC C K
Sbjct: 245 VSYIDFLNVSEERKRQLKKQYYFDCTCEHCQK 276
>gi|224006285|ref|XP_002292103.1| hypothetical protein THAPSDRAFT_23831 [Thalassiosira pseudonana
CCMP1335]
gi|220972622|gb|EED90954.1| hypothetical protein THAPSDRAFT_23831 [Thalassiosira pseudonana
CCMP1335]
Length = 575
Score = 76.3 bits (186), Expect = 2e-11, Method: Compositional matrix adjust.
Identities = 43/124 (34%), Positives = 59/124 (47%), Gaps = 11/124 (8%)
Query: 70 FSKLACNAHTICNSELRPLGTGLYPVISIINHSCLPNAVLVF-----EGRLAVVRAVQHV 124
S+L N TI SE PLG G+YP S+INHSC PNAV F + + + V
Sbjct: 257 MSRLIMNGFTISTSEQLPLGVGVYPAASMINHSCRPNAVPTFWFSTPSPPMLQITMCKSV 316
Query: 125 PKGAEVLISYIETAGSTMTRQKALKEQYLFTCTCPRCIKLGQFDDIQESAILEGYRCKDD 184
G E+ ISY + + R++ L + Y F C C C D + + G +C+
Sbjct: 317 RVGDEIAISYCDVSAPRYVRREGLIKNYKFACDCSHC------GDFETDGDMIGLKCQAS 370
Query: 185 GCSG 188
GC G
Sbjct: 371 GCIG 374
>gi|148680855|gb|EDL12802.1| SET and MYND domain containing 4, isoform CRA_b [Mus musculus]
Length = 828
Score = 76.3 bits (186), Expect = 2e-11, Method: Compositional matrix adjust.
Identities = 38/88 (43%), Positives = 49/88 (55%), Gaps = 2/88 (2%)
Query: 76 NAHTICNSELRPLGTGLYPVISIINHSCLPNAVLVFEGRLAVVRAVQHVPKGAEVLISY- 134
N I NS L TG++PV+S++NHSC PN + F G +A VRA Q + KG E+L Y
Sbjct: 540 NESIITNSRQIRLATGIFPVVSLLNHSCRPNTSVSFTGTVATVRAAQRIAKGQEILHCYG 599
Query: 135 -IETAGSTMTRQKALKEQYLFTCTCPRC 161
E+ RQ+ L QY F C C C
Sbjct: 600 PHESRMGVAERQQRLSSQYFFDCRCGAC 627
>gi|431906502|gb|ELK10625.1| SET and MYND domain-containing protein 3 [Pteropus alecto]
Length = 352
Score = 76.3 bits (186), Expect = 2e-11, Method: Compositional matrix adjust.
Identities = 32/74 (43%), Positives = 49/74 (66%)
Query: 88 LGTGLYPVISIINHSCLPNAVLVFEGRLAVVRAVQHVPKGAEVLISYIETAGSTMTRQKA 147
+G GLYP +S++NHSC PN +VF G ++RAV+ + G E+ I Y++ ++ R+K
Sbjct: 4 VGVGLYPSMSLLNHSCDPNCSIVFNGPHLLLRAVRDIEAGEELTICYLDMLMTSEERRKQ 63
Query: 148 LKEQYLFTCTCPRC 161
L++QY F C C RC
Sbjct: 64 LRDQYCFECDCSRC 77
>gi|344296426|ref|XP_003419908.1| PREDICTED: N-lysine methyltransferase SMYD2-like [Loxodonta
africana]
Length = 427
Score = 76.3 bits (186), Expect = 2e-11, Method: Compositional matrix adjust.
Identities = 78/351 (22%), Positives = 149/351 (42%), Gaps = 41/351 (11%)
Query: 8 RKLQNDNVIPSTTTDNYSL-VEALVAHMSDID-EKQLLLYAQIANLVNLI---LQWPEIS 62
R L + P T L V+ +H+ +D EK+ L+ + IA L + L++P+
Sbjct: 111 RILAKQKIHPERTPSEKLLAVKEFESHLDKLDNEKKDLIQSDIAALHHFYSKHLEFPDT- 169
Query: 63 INEIAENFSKLACNAHTICNSELRPLGTGLYPVISIINHSCLPNAVLVFEGRLAVVRAVQ 122
+ F+++ CN TI + EL +G+ ++P + S +P ++ VF+ L +
Sbjct: 170 -ESLVVLFAQVNCNGFTIEDEELSHVGSAIFPDPVV---SGVPGSLTVFQAHLF---SQM 222
Query: 123 HVPKGAEVLISYIETAGSTMTRQKALKEQYLFTCTCPRCIKLGQFDDIQESAILEGYRCK 182
V SYI+ T R L++ Y FTC C C +++A +E R
Sbjct: 223 FASLCGMVFTSYIDLLYPTEDRNDRLRDSYFFTCECQECTTKD-----KDNAKVE-IRKL 276
Query: 183 DDGCSGFLLRDSDDKGFTCQQCGLVRSKEEIKKIASEVNILSKKTLALTSCGNHQEVVST 242
D +RD +VR + + + L C QE +S+
Sbjct: 277 TDPPKAEAIRD------------MVRYARNVIEEFRRAKHYKSPSELLEMCALSQEKMSS 324
Query: 243 YKMIEKLQKKLYHPFSVNLMQTREKLIKILMELEDWKEALAYCQLTIPVYQRVYPQFHPL 302
++ +V ++ + + + + ++DW+ AL Y Q I Y + YP +
Sbjct: 325 ----------VFEDSNVYMLHMMYQAMGVCLYMQDWEGALRYGQKIIQPYSKHYPLYSLN 374
Query: 303 LGLQYYTCGKLEWFLGDTENAIKSMTEAVEILRITHGTNSPFMKELILKLE 353
+ + G+L L + K++ +A+ I+ + HG + P++ E+ ++E
Sbjct: 375 VASMWLKLGRLYMGLENKAAGEKALKKAIAIMEVAHGKDHPYISEIKHEIE 425
>gi|213406173|ref|XP_002173858.1| conserved hypothetical protein [Schizosaccharomyces japonicus
yFS275]
gi|212001905|gb|EEB07565.1| conserved hypothetical protein [Schizosaccharomyces japonicus
yFS275]
Length = 449
Score = 76.3 bits (186), Expect = 2e-11, Method: Compositional matrix adjust.
Identities = 38/107 (35%), Positives = 59/107 (55%)
Query: 63 INEIAENFSKLACNAHTICNSELRPLGTGLYPVISIINHSCLPNAVLVFEGRLAVVRAVQ 122
+N F KL+ NA ++ +GT + ++ INHSC PN V + EG A + A+Q
Sbjct: 155 VNAFLSLFCKLSINAMSLVTPAFDAIGTCMDSTLARINHSCQPNCVFMVEGATARLVALQ 214
Query: 123 HVPKGAEVLISYIETAGSTMTRQKALKEQYLFTCTCPRCIKLGQFDD 169
+ KG V ISY++T R L+++Y FTCTC +C++ + D
Sbjct: 215 TLQKGDFVHISYVDTTLPYNVRTNELQQKYFFTCTCQKCMEEARRSD 261
>gi|449265983|gb|EMC77110.1| SET and MYND domain-containing protein 4, partial [Columba livia]
Length = 706
Score = 76.3 bits (186), Expect = 2e-11, Method: Compositional matrix adjust.
Identities = 63/265 (23%), Positives = 116/265 (43%), Gaps = 10/265 (3%)
Query: 88 LGTGLYPVISIINHSCLPNAVLVFEGRLAVVRAVQHVPKGAEVLISY-IETAGSTMTRQK 146
L T +PV+S++NHSC PN + F G A VRA Q +P G E+ Y + ++
Sbjct: 430 LATAFFPVLSLLNHSCCPNTSVSFSGTTATVRASQLIPSGQEIFHCYGPHRCRMRVAERQ 489
Query: 147 ALKEQYLFTCTCPRCIKLGQFDDIQESAILEGYRCKDDGCSGFLLRDSDDKGFTCQQCGL 206
L QY F C C C + D ++ + C GC G ++ D + + C +
Sbjct: 490 QLLSQYFFECRCQACCDELESDVKSVVSLRNTFCCP--GCRG-PMQGEDMLCCSNEACAI 546
Query: 207 VRSKEEIKKIASEVNILSKKTLALTSCGNHQEVVSTYKMIEKLQ---KKLYHPFSVNLMQ 263
S+E + ++ K+ L L ++ KM+ K Q + P + + +
Sbjct: 547 SVSRESLSCRLQDLQQRIKRALELL---RDRKADQAIKMLLKCQVDAECFLSPEHLLMGE 603
Query: 264 TREKLIKILMELEDWKEALAYCQLTIPVYQRVYPQFHPLLGLQYYTCGKLEWFLGDTENA 323
+ L ++ L W+EA + + +I + + + +G + + ++ + A
Sbjct: 604 LEDHLAQVYATLGKWQEAARHLERSIEIVEMHHGPSSIEIGHELFKLAQILFNGFAVSEA 663
Query: 324 IKSMTEAVEILRITHGTNSPFMKEL 348
+ ++ A EIL + G SP ++EL
Sbjct: 664 LSTIQRAEEILSVHCGPQSPQIQEL 688
>gi|405959013|gb|EKC25088.1| SET and MYND domain-containing protein 5 [Crassostrea gigas]
Length = 396
Score = 76.3 bits (186), Expect = 2e-11, Method: Compositional matrix adjust.
Identities = 37/87 (42%), Positives = 51/87 (58%), Gaps = 5/87 (5%)
Query: 89 GTGLYPVISIINHSCLPNAVLVF--EGRLAVVRAVQHVPKGAEVLISYI---ETAGSTMT 143
G+GLY + S NHSCLPNA + F + V A + +P G E+ ISY+ + + S +
Sbjct: 296 GSGLYSLQSTCNHSCLPNAEITFPYNNNVMAVVAKEKIPSGQEICISYLSECDLSRSRHS 355
Query: 144 RQKALKEQYLFTCTCPRCIKLGQFDDI 170
RQ LKE YLFTC CP+C+ D+
Sbjct: 356 RQNILKENYLFTCDCPKCLSEADEPDV 382
>gi|148680854|gb|EDL12801.1| SET and MYND domain containing 4, isoform CRA_a [Mus musculus]
Length = 743
Score = 76.3 bits (186), Expect = 3e-11, Method: Compositional matrix adjust.
Identities = 38/88 (43%), Positives = 49/88 (55%), Gaps = 2/88 (2%)
Query: 76 NAHTICNSELRPLGTGLYPVISIINHSCLPNAVLVFEGRLAVVRAVQHVPKGAEVLISY- 134
N I NS L TG++PV+S++NHSC PN + F G +A VRA Q + KG E+L Y
Sbjct: 455 NESIITNSRQIRLATGIFPVVSLLNHSCRPNTSVSFTGTVATVRAAQRIAKGQEILHCYG 514
Query: 135 -IETAGSTMTRQKALKEQYLFTCTCPRC 161
E+ RQ+ L QY F C C C
Sbjct: 515 PHESRMGVAERQQRLSSQYFFDCRCGAC 542
>gi|432096123|gb|ELK26991.1| SET and MYND domain-containing protein 4 [Myotis davidii]
Length = 777
Score = 75.9 bits (185), Expect = 3e-11, Method: Compositional matrix adjust.
Identities = 69/299 (23%), Positives = 129/299 (43%), Gaps = 15/299 (5%)
Query: 54 LILQWPEISINEIAENFSKLACNAHTICNSELRPLGTGLYPVISIINHSCLPNAVLVFEG 113
L LQ +I I + SK + + +S L TGL+PV+S++NHSC PN + F
Sbjct: 471 LQLQCNAQAITTIQQTGSK----ENMVTDSRQVRLATGLFPVVSLLNHSCSPNTSVSFVS 526
Query: 114 RLAVVRAVQHVPKGAEVLISY--IETAGSTMTRQKALKEQYLFTCTCPRCIKLGQFDDIQ 171
+A VRA Q + +G E+L Y E+ RQ+ L+ QY F C C C Q +
Sbjct: 527 TVATVRASQQIGEGQEILHCYGPHESRMGVAERQQKLRSQYFFDCDCLAC----QNGKCR 582
Query: 172 ESA--ILEGYRCKDDGCSGFLLRDSDDKGFTCQQCGLVRSKEEIKKIASEVNILSKKTLA 229
+A E +RC C G L++ D C S++ + ++ K
Sbjct: 583 AAAGPRWEAFRCH---CCGELMQGDDVLSCGSASCTESASRDRLVSRLQDLRQRVGKAQT 639
Query: 230 LTSCGNHQEVVSTYKMIEKLQKKLYHPFSVNLMQTREKLIKILMELEDWKEALAYCQLTI 289
L G + + ++ + + + + L + L DW ++ A+ Q ++
Sbjct: 640 LLRNGKVERAIQLLLGCQRDAESFLSAEHSLVGEIEDALAQAYAALGDWHKSAAHLQRSL 699
Query: 290 PVYQRVYPQFHPLLGLQYYTCGKLEWFLGDTENAIKSMTEAVEILRITHGTNSPFMKEL 348
V + + +G + + ++ + A+ ++ +A ++L + +G + ++EL
Sbjct: 700 RVVEVRHGPASVEMGHELFKLAQIFFNGFAIPEALNTIQKAEKVLLVHYGRGNEEIQEL 758
>gi|50511197|dbj|BAD32584.1| mKIAA1936 protein [Mus musculus]
Length = 716
Score = 75.9 bits (185), Expect = 3e-11, Method: Compositional matrix adjust.
Identities = 38/88 (43%), Positives = 49/88 (55%), Gaps = 2/88 (2%)
Query: 76 NAHTICNSELRPLGTGLYPVISIINHSCLPNAVLVFEGRLAVVRAVQHVPKGAEVLISY- 134
N I NS L TG++PV+S++NHSC PN + F G +A VRA Q + KG E+L Y
Sbjct: 428 NESIITNSRQIRLATGIFPVVSLLNHSCRPNTSVSFTGTVATVRAAQRIAKGQEILHCYG 487
Query: 135 -IETAGSTMTRQKALKEQYLFTCTCPRC 161
E+ RQ+ L QY F C C C
Sbjct: 488 PHESRMGVAERQQRLSSQYFFDCRCGAC 515
>gi|74181871|dbj|BAE32637.1| unnamed protein product [Mus musculus]
gi|122936404|gb|AAI30221.1| Smyd4 protein [Mus musculus]
Length = 666
Score = 75.9 bits (185), Expect = 3e-11, Method: Compositional matrix adjust.
Identities = 38/88 (43%), Positives = 49/88 (55%), Gaps = 2/88 (2%)
Query: 76 NAHTICNSELRPLGTGLYPVISIINHSCLPNAVLVFEGRLAVVRAVQHVPKGAEVLISY- 134
N I NS L TG++PV+S++NHSC PN + F G +A VRA Q + KG E+L Y
Sbjct: 378 NESIITNSRQIRLATGIFPVVSLLNHSCRPNTSVSFTGTVATVRAAQRIAKGQEILHCYG 437
Query: 135 -IETAGSTMTRQKALKEQYLFTCTCPRC 161
E+ RQ+ L QY F C C C
Sbjct: 438 PHESRMGVAERQQRLSSQYFFDCRCGAC 465
>gi|26326379|dbj|BAC26933.1| unnamed protein product [Mus musculus]
Length = 666
Score = 75.9 bits (185), Expect = 3e-11, Method: Compositional matrix adjust.
Identities = 38/88 (43%), Positives = 49/88 (55%), Gaps = 2/88 (2%)
Query: 76 NAHTICNSELRPLGTGLYPVISIINHSCLPNAVLVFEGRLAVVRAVQHVPKGAEVLISY- 134
N I NS L TG++PV+S++NHSC PN + F G +A VRA Q + KG E+L Y
Sbjct: 378 NESIITNSRQIRLATGIFPVVSLLNHSCRPNTSVSFTGTVATVRAAQRIAKGQEILHCYG 437
Query: 135 -IETAGSTMTRQKALKEQYLFTCTCPRC 161
E+ RQ+ L QY F C C C
Sbjct: 438 PHESRMGVAERQQRLSSQYFFDCRCGAC 465
>gi|427782607|gb|JAA56755.1| Putative set and mynd domain protein [Rhipicephalus pulchellus]
Length = 423
Score = 75.9 bits (185), Expect = 3e-11, Method: Compositional matrix adjust.
Identities = 39/105 (37%), Positives = 58/105 (55%), Gaps = 6/105 (5%)
Query: 59 PEISINEIAENFSKLACNAHTICNSELRPLGTGLYPVISIINHSCLPNAVLVFEGRLAVV 118
PE+ + E + K+ N++ ICN E +GTGLY SI++HSC PNA V+EG +
Sbjct: 151 PEVGL----EIYGKMIINSYCICNDEHTAIGTGLYMGPSILDHSCSPNAHAVYEGHKLHL 206
Query: 119 RAVQHV--PKGAEVLISYIETAGSTMTRQKALKEQYLFTCTCPRC 161
RA + + + +SYI+ RQ+ L+ QY F C C +C
Sbjct: 207 RAAEDINCSNFDGIRVSYIDVMAPKKVRQEELRSQYYFECNCMKC 251
>gi|197102342|ref|NP_001127594.1| SET and MYND domain-containing protein 4 [Pongo abelii]
gi|75054780|sp|Q5R5X9.1|SMYD4_PONAB RecName: Full=SET and MYND domain-containing protein 4
gi|55732267|emb|CAH92837.1| hypothetical protein [Pongo abelii]
Length = 804
Score = 75.9 bits (185), Expect = 3e-11, Method: Compositional matrix adjust.
Identities = 77/315 (24%), Positives = 130/315 (41%), Gaps = 67/315 (21%)
Query: 59 PEISINEIA--ENFSKLACNA-------HT------ICNSELRPLGTGLYPVISIINHSC 103
P+++I +A + +L CNA HT + +S L TG++PV+S++NHSC
Sbjct: 483 PDVTIWGVAMLRHMLQLQCNAQAMTTIQHTGSKGSIVTDSRQVRLATGIFPVVSLLNHSC 542
Query: 104 LPNAVLVFEGRLAVVRAVQHVPKGAEVLISY--IETAGSTMTRQKALKEQYLFTCTCPRC 161
PN + F +A ++A Q + KG E+L Y ++ RQ+ L+ QY F C CP C
Sbjct: 543 SPNTSMSFISTVATIQASQRIRKGQEILHCYGPHKSRMGVAERQQELRSQYFFDCACPAC 602
Query: 162 IKLGQFDDIQESAILEGYRCKDDGCSGFLLRDSDDKGFTCQQCGLVRSKEEIKKIASEVN 221
E +R + + + F C CG +++ S
Sbjct: 603 QT-------------EAHR---------MAAEPRWEAFCCNSCGAPMQGDDVLHCGSR-- 638
Query: 222 ILSKKTLALTSCGNHQEVVSTYKMIEKLQKKLYHPFSVNLMQTREKLIKILM--ELEDWK 279
SC + VS ++ +LQ +L Q K+L ELE
Sbjct: 639 ----------SCA--ESAVSRDHLVSRLQ---------DLQQQVGVAQKLLRDGELERAV 677
Query: 280 EALAYCQLTIPVYQRVYPQFHPLLGLQYYTCGKLEWFLGDTENAIKSMTEAVEILRITHG 339
+ L CQ + ++ + H L+G + LGD + A + ++ ++ + HG
Sbjct: 678 QQLLGCQRDAESF--LWAE-HALVGEIADGLARACAALGDWQKAATHLQRSLRVVEVRHG 734
Query: 340 TNSPFMKELILKLEE 354
+S M + KL +
Sbjct: 735 PSSVEMGHELFKLAQ 749
>gi|290975196|ref|XP_002670329.1| predicted protein [Naegleria gruberi]
gi|284083887|gb|EFC37585.1| predicted protein [Naegleria gruberi]
Length = 626
Score = 75.9 bits (185), Expect = 3e-11, Method: Compositional matrix adjust.
Identities = 37/120 (30%), Positives = 64/120 (53%), Gaps = 16/120 (13%)
Query: 83 SELRPLGTGLYPVISIINHSCLPNAVL----------VFEGRLAVVRAVQHVPKGAEVLI 132
++ R +GTGL+P++S+ +HSC PN + G + +V+ V+ V KG E+ I
Sbjct: 287 ADFRDIGTGLFPMVSMFDHSCSPNCSFQTFDDMKSRNSYSGNVILVQTVKKVKKGEELCI 346
Query: 133 SYIETAGSTMTRQKALKEQYLFTCTCPRCIKLGQFDDIQESAILEGYRCKDDGCS-GFLL 191
SYI+ T R++ L F C C RC+ + +E+ ++ Y C + C+ G+L+
Sbjct: 347 SYIDIMNPTCIRRRELWYSKYFVCRCSRCMS-----ETEENRMVRAYSCGESECTEGYLV 401
>gi|339235237|ref|XP_003379173.1| MYND finger protein [Trichinella spiralis]
gi|316978237|gb|EFV61247.1| MYND finger protein [Trichinella spiralis]
Length = 451
Score = 75.9 bits (185), Expect = 3e-11, Method: Compositional matrix adjust.
Identities = 70/285 (24%), Positives = 121/285 (42%), Gaps = 26/285 (9%)
Query: 76 NAHTICNS-ELRPLGTGLYPVISIINHSCLPNAVLVFEGRLAVVRAVQ---HVPKGAEVL 131
N H I + L +G GLY + +HSC PNA+ EG +A ++A+ ++
Sbjct: 168 NRHAIGDQLYLNEIGKGLYLDLCAYDHSCRPNAIYTCEGFVARLKALDDSVNLSDRKRTH 227
Query: 132 ISYIETAGSTMTRQKALKEQYLFTCTCPRCIKLGQFDDIQESAILEGYRCKDDGCSGFLL 191
YI S R++ LKE + F C C RC +L +C + CS +
Sbjct: 228 TCYITPLDSRAERRRLLKESWYFDCFCERCTD-------PNDHLLTALKC--NNCSTAVP 278
Query: 192 RDS--------DDKGFTCQQC-GLVRSKEEIKKIASEVNILSKKTLALTSCGNHQEVVST 242
+ DD FTC +C S+ ++++ ++ + K+ S + + +
Sbjct: 279 IHNADNDKDLIDDSFFTCPKCEQRCFSRPDVEQALFKMIDIEKQLAKTGSETLSKSELDS 338
Query: 243 YKMIEKLQKKLYHPFSVNLMQTREKLIKILMELEDWKEALAYCQLTIPVYQRVYPQFHPL 302
++ K + F L+Q + +LI + D K+ L +P + YP+ HP
Sbjct: 339 MLLVCKSVLADENVFFAKLLQRKLQLI----DCNDSKQLLETSLQALPCLRVCYPEMHPT 394
Query: 303 LGLQYYTCGKLEWFLGDTENAIKSMTEAVEILRITHGTNSPFMKE 347
L Y T G L D + A++ + +A+ R T G+N K+
Sbjct: 395 LAYHYMTIGIYYKNLADRKTAMQYLNDALRRFRFTLGSNHLLTKQ 439
>gi|63146229|gb|AAH95952.1| Smyd4 protein [Mus musculus]
Length = 799
Score = 75.9 bits (185), Expect = 3e-11, Method: Compositional matrix adjust.
Identities = 37/88 (42%), Positives = 49/88 (55%), Gaps = 2/88 (2%)
Query: 76 NAHTICNSELRPLGTGLYPVISIINHSCLPNAVLVFEGRLAVVRAVQHVPKGAEVLISY- 134
N I NS L TG++PV+S++NHSC PN + F G +A +RA Q + KG E+L Y
Sbjct: 511 NESIITNSRQIRLATGVFPVVSLLNHSCRPNTSVSFTGTVATIRAAQRIAKGQEILHCYG 570
Query: 135 -IETAGSTMTRQKALKEQYLFTCTCPRC 161
E+ RQ+ L QY F C C C
Sbjct: 571 PHESRMGVAERQQRLSSQYFFDCRCGAC 598
>gi|341883759|gb|EGT39694.1| hypothetical protein CAEBREN_28295 [Caenorhabditis brenneri]
Length = 582
Score = 75.9 bits (185), Expect = 3e-11, Method: Compositional matrix adjust.
Identities = 70/325 (21%), Positives = 138/325 (42%), Gaps = 26/325 (8%)
Query: 29 ALVAHMSDI--DEKQLLLYAQIANLVNLILQWPEISINEIAENFSKLACNAHTICNSELR 86
+LV H +DI DE+++ + ++ + L + I E + F K + N+H+I +
Sbjct: 200 SLVDHKTDIREDEEKMTHFEKVFTKMGLFRKDDMIEKGEFFDIFCKASINSHSIHTNAGN 259
Query: 87 PLGTGLYPVISIINHSCLPNAVLVFEGRLAVVRAVQHVP-----KGAEVLISYIETAGST 141
+G L +S NHSC P +VF+G +R + VP + ISYI+ S
Sbjct: 260 EVGMALDLGVSKYNHSCRPTCSMVFDGYRVCLRPL--VPGVDAEDTEQAFISYIDVGRSK 317
Query: 142 MTRQKALKEQYLFTCTCPRCIKLGQFDDIQESAILEGYRCKDDGCSGFLLRDSDDK--GF 199
R++ LK ++ F C C RC+ D ++ A L RC + C +L ++
Sbjct: 318 FVRRRDLKSRWYFNCECTRCM------DPEDDA-LTAIRCANPSCDAPILTSETEEPMNI 370
Query: 200 TCQQCGLVRSKEEIKKIASEVNILSKKTLALTSCGNHQEVVSTYKMIEKLQKKLYHPFSV 259
C +C + + ++K + +L + + + + + +++ HP +V
Sbjct: 371 ACDKCKTIVEENDVKAAQEYMK-------SLPASFDPKCPADILRDLLAKAEQVLHPSNV 423
Query: 260 NLMQTREKLIKILMELEDWKEALAYCQLTIPVYQRVYPQFHPLLGLQYYTCGKLEWFLGD 319
+ + R L + L + + Q+ Y+ +P+ +G Q K +
Sbjct: 424 YVARLRTALFHVTGTLTMDNLSSMHTQI-YNNYKMCFPKADRHVGFQLLHIVKALIEKDE 482
Query: 320 TENAIKSMTEAVEILRITHGTNSPF 344
+ A+ +A+ I + G + P+
Sbjct: 483 RDEAMPYAFDAMNIFEVCFGLDHPY 507
>gi|295665212|ref|XP_002793157.1| conserved hypothetical protein [Paracoccidioides sp. 'lutzii' Pb01]
gi|226278071|gb|EEH33637.1| conserved hypothetical protein [Paracoccidioides sp. 'lutzii' Pb01]
Length = 508
Score = 75.9 bits (185), Expect = 3e-11, Method: Compositional matrix adjust.
Identities = 36/94 (38%), Positives = 54/94 (57%), Gaps = 1/94 (1%)
Query: 73 LACNAHTICNSELRPLGTGLYPVISIINHSCLPNAVLVFEGRLAVVRAVQHVPKGAEVLI 132
L NA T+ N +G + P S NHSC PNA + F+G + ++A+Q + ++ I
Sbjct: 187 LETNAFTLTNRYFDRIGLCMLPFASYANHSCEPNAYIGFDGPVIYLKALQDIALDEQIFI 246
Query: 133 SYIETAGSTMTRQKALKEQYLFTCTCPRCIKLGQ 166
SYI+ RQ LK++Y FTC CP+C++ GQ
Sbjct: 247 SYIDNTEPWEKRQSELKKRYFFTCKCPKCVQ-GQ 279
>gi|194890613|ref|XP_001977353.1| GG18992 [Drosophila erecta]
gi|190649002|gb|EDV46280.1| GG18992 [Drosophila erecta]
Length = 503
Score = 75.9 bits (185), Expect = 3e-11, Method: Compositional matrix adjust.
Identities = 45/183 (24%), Positives = 83/183 (45%), Gaps = 27/183 (14%)
Query: 91 GLYPVISIINHSCLPNAVLVFEGRL--AVVRAVQHVPKGAEVLISYIETAGSTMTRQKAL 148
LYP+ ++NH C+PNA FE + +VRA +P+G EV +Y + + R L
Sbjct: 208 ALYPLFGVVNHDCIPNAYYTFEDKTNNMIVRAAVDIPEGFEVTTTYTKLFTGNIARHLFL 267
Query: 149 KEQYLFTCTCPRCIKLGQFDDIQESAILEGYRCKDDGCSGFLLRDSDD---KGFTCQQC- 204
K + FTC C RC D ++ A + G C+D C+G ++ + + C +C
Sbjct: 268 KMKKSFTCKCSRC-----SDPTEKGAFISGLYCRDTNCTGLVVPEISGLPHPNWNCLECK 322
Query: 205 ----------------GLVRSKEEIKKIASEVNILSKKTLALTSCGNHQEVVSTYKMIEK 248
G + +K + + V L++K+ + N+ + + ++++
Sbjct: 323 QKSTHAQMMKSQDFASGAINAKANSNSLRTLVQYLNEKSDSFIPSSNYVVIDAKMSVLQR 382
Query: 249 LQK 251
LQ+
Sbjct: 383 LQQ 385
>gi|290562039|gb|ADD38416.1| SET and MYND domain-containing protein 3 [Lepeophtheirus salmonis]
Length = 311
Score = 75.5 bits (184), Expect = 3e-11, Method: Compositional matrix adjust.
Identities = 40/119 (33%), Positives = 64/119 (53%)
Query: 43 LLYAQIANLVNLILQWPEISINEIAENFSKLACNAHTICNSELRPLGTGLYPVISIINHS 102
+ Y I + N + EI+ +E+ + F+K++ N+ I N + G+ L S I+HS
Sbjct: 132 MFYRYIGIIKNKLGGALEINESEVLDIFTKMSINSVFILNGKFLNFGSCLSLEFSAIDHS 191
Query: 103 CLPNAVLVFEGRLAVVRAVQHVPKGAEVLISYIETAGSTMTRQKALKEQYLFTCTCPRC 161
C PNAV +F GR VV+A+ + +V ++Y + R+ LK+QY F CTC C
Sbjct: 192 CRPNAVYMFIGRTLVVKALCDIANFDDVRVTYTDITQPRRMRRNFLKDQYFFDCTCEEC 250
>gi|428180198|gb|EKX49066.1| hypothetical protein GUITHDRAFT_136248 [Guillardia theta CCMP2712]
Length = 719
Score = 75.5 bits (184), Expect = 3e-11, Method: Compositional matrix adjust.
Identities = 39/112 (34%), Positives = 57/112 (50%), Gaps = 15/112 (13%)
Query: 65 EIAENFSKLACNAHTICNSELRPLGTGLYPVISIINHSCLPNAVLVFEGRLAVVRAVQHV 124
EI +L TIC+ E RP+GTGLYP+ S+ NH C PN ++ F G +RA + +
Sbjct: 230 EILMTICRLYNYGFTICDEEERPIGTGLYPLASLCNHDCAPNCLVTFCGPSLQLRATEEI 289
Query: 125 PKG---------------AEVLISYIETAGSTMTRQKALKEQYLFTCTCPRC 161
G E+ ISY+++A R+K L ++YL C+C C
Sbjct: 290 EMGQVQMARVDWRLRSGLQEITISYVDSAEPHWLRRKTLHKRYLINCSCSLC 341
Score = 47.4 bits (111), Expect = 0.012, Method: Compositional matrix adjust.
Identities = 36/138 (26%), Positives = 66/138 (47%), Gaps = 13/138 (9%)
Query: 232 SCGNHQEVVSTYKMIEKLQKKLYHPFSVNLMQTREKLIKILMELEDWKEALAYCQLTIPV 291
S E++ + + + + + H +++L++ RE + +E + WK AL+ +
Sbjct: 572 SADRANELLQMFILAVQKAESILHDNNLSLLRLREAMQSQAVEAKSWKFALSTSYKVLRR 631
Query: 292 YQRVYPQFHPLLGLQYYTCGKLEWFL------------GDT-ENAIKSMTEAVEILRITH 338
++YP+ P +GLQ GKL FL GD + A+K + EA IL TH
Sbjct: 632 LYKIYPRCSPYVGLQEAFLGKLCMFLSGEPDKTLCHGFGDPLQEAVKHLREAKLILSGTH 691
Query: 339 GTNSPFMKELILKLEEAQ 356
G+ ++++ L+EA
Sbjct: 692 GSEHVLVQDINRLLQEAH 709
>gi|353241217|emb|CCA73046.1| hypothetical protein PIIN_07001 [Piriformospora indica DSM 11827]
Length = 565
Score = 75.5 bits (184), Expect = 4e-11, Method: Compositional matrix adjust.
Identities = 58/175 (33%), Positives = 91/175 (52%), Gaps = 21/175 (12%)
Query: 40 KQLLLYAQIANLVNLILQWPEISINEIAENFSKLACNAHTICNSELRPLGTGLYPVISII 99
+ L+LY +I + L ++ S ++ + SK N+ T+ ++ L +G P+ ++I
Sbjct: 194 QSLVLYMKIESPEGL-REYGIESGKDLVDLMSKFTTNSFTLTSTFLNAIGVATAPIPALI 252
Query: 100 NHSCLPNAVLVFEGR-------LAVVRAVQHVPKGAEVLISYIETAGSTMTRQKALKEQY 152
NHSC PNAV+VF L V+ A+Q + +G EVL +Y++ RQK+LKE Y
Sbjct: 253 NHSCQPNAVVVFPATRKGAPPTLDVI-AIQPIRRGEEVLAAYVDITLPREIRQKSLKETY 311
Query: 153 LFTCTCPRCIKLGQFDDIQESAILEGYRCKDDGCSGFLLRDSDDKGFT--CQQCG 205
F C+C C KL D++E +C + GFL S KG + C +CG
Sbjct: 312 AFECSCTLC-KLPGTLDLRECVFCP--KCNE----GFL---SSHKGQSAPCSKCG 356
>gi|167535778|ref|XP_001749562.1| hypothetical protein [Monosiga brevicollis MX1]
gi|163771954|gb|EDQ85613.1| predicted protein [Monosiga brevicollis MX1]
Length = 338
Score = 75.5 bits (184), Expect = 4e-11, Method: Compositional matrix adjust.
Identities = 35/94 (37%), Positives = 57/94 (60%), Gaps = 1/94 (1%)
Query: 69 NFSKLACNAHTICNS-ELRPLGTGLYPVISIINHSCLPNAVLVFEGRLAVVRAVQHVPKG 127
+F+KL N C++ ++ G +Y +S NHSC PNA +V+ G AV+R+++ +P+G
Sbjct: 109 SFNKLMPNTFGWCDTFDMSSFGAVVYAELSRANHSCQPNAAVVYNGAAAVLRSMRDIPEG 168
Query: 128 AEVLISYIETAGSTMTRQKALKEQYLFTCTCPRC 161
EV ISY++ + R++ L + Y F C C RC
Sbjct: 169 EEVCISYVDPTLARDVRRRELVQSYGFACDCARC 202
>gi|403413006|emb|CCL99706.1| predicted protein [Fibroporia radiculosa]
Length = 800
Score = 75.5 bits (184), Expect = 4e-11, Method: Compositional matrix adjust.
Identities = 48/127 (37%), Positives = 66/127 (51%), Gaps = 14/127 (11%)
Query: 68 ENFSKLACNAHTICNSELRPLGTGLYPVIS-IINHSCLPNAVL-----VFEGRLAVVRAV 121
E +++ N + I +S L G G+YP+ S + NHSC+PNAV FE V A+
Sbjct: 531 EAYTRFG-NNNFIIHSHLNTFGHGIYPLASRLFNHSCVPNAVAKYIISPFESVQMEVVAL 589
Query: 122 QHVPKGAEVLISYIETAGSTMTRQKALKEQYLFTCTCPRCIKLGQFDDI-------QESA 174
+ + +G E+ I Y++ A TRQ+AL+ Y FTCTCP C G D E A
Sbjct: 590 RDIAEGEEITIPYLDPALPLHTRQEALRLNYGFTCTCPLCSWAGHLDPAPPPARGSAELA 649
Query: 175 ILEGYRC 181
ILE C
Sbjct: 650 ILESQLC 656
>gi|406868025|gb|EKD21062.1| SET and MYND domain-containing protein 2 [Marssonina brunnea f. sp.
'multigermtubi' MB_m1]
Length = 570
Score = 75.5 bits (184), Expect = 4e-11, Method: Compositional matrix adjust.
Identities = 35/91 (38%), Positives = 51/91 (56%)
Query: 73 LACNAHTICNSELRPLGTGLYPVISIINHSCLPNAVLVFEGRLAVVRAVQHVPKGAEVLI 132
L+ NA I + P G P +++ NHSC PNA++V R +RA++ + K ++ I
Sbjct: 194 LSTNAFRITLPDNTPFGMCFSPTLALANHSCKPNAIIVSNSRSISLRALRPIKKNEQIFI 253
Query: 133 SYIETAGSTMTRQKALKEQYLFTCTCPRCIK 163
SYI+ +RQ LKE+Y FTC C C K
Sbjct: 254 SYIDPTEDLPSRQSKLKERYFFTCKCDSCEK 284
>gi|15620931|dbj|BAB67829.1| KIAA1936 protein [Homo sapiens]
Length = 558
Score = 75.5 bits (184), Expect = 4e-11, Method: Compositional matrix adjust.
Identities = 75/315 (23%), Positives = 129/315 (40%), Gaps = 67/315 (21%)
Query: 59 PEISINEIA--ENFSKLACNAHT-------------ICNSELRPLGTGLYPVISIINHSC 103
P+++I +A + +L CNA + +S L TG++PVIS++NHSC
Sbjct: 287 PDVTIWGVAMLRHMLQLQCNAQAMTTIQRTGPKGSIVTDSRQVRLATGIFPVISLLNHSC 346
Query: 104 LPNAVLVFEGRLAVVRAVQHVPKGAEVLISY--IETAGSTMTRQKALKEQYLFTCTCPRC 161
PN + F +A +RA Q + KG E+L Y ++ RQ+ L+ QY F C CP C
Sbjct: 347 SPNTSVSFISTVATIRASQWIRKGQEILHCYGPHKSRMGVAERQQKLRSQYFFDCACPAC 406
Query: 162 IKLGQFDDIQESAILEGYRCKDDGCSGFLLRDSDDKGFTCQQCGLVRSKEEIKKIASEVN 221
+ + G R + F C CG +++ + S
Sbjct: 407 -------QTEAHRMAAGPRW---------------EAFCCNSCGAPMQGDDVLRCGSR-- 442
Query: 222 ILSKKTLALTSCGNHQEVVSTYKMIEKLQKKLYHPFSVNLMQTREKLIKILM--ELEDWK 279
SC + VS ++ +LQ +L Q K+L ELE
Sbjct: 443 ----------SCA--ESAVSRDHLVSRLQ---------DLQQQVRVAQKLLRDGELERAV 481
Query: 280 EALAYCQLTIPVYQRVYPQFHPLLGLQYYTCGKLEWFLGDTENAIKSMTEAVEILRITHG 339
+ L+ CQ + ++ + H ++G + LGD + + + ++ ++ + HG
Sbjct: 482 QRLSGCQRDAESF--LWAE-HAVVGEIADGLARACAALGDWQKSATHLQRSLCVVEVRHG 538
Query: 340 TNSPFMKELILKLEE 354
+S M + KL +
Sbjct: 539 PSSVEMGHELFKLAQ 553
>gi|400601369|gb|EJP69012.1| MYND finger protein [Beauveria bassiana ARSEF 2860]
Length = 542
Score = 75.5 bits (184), Expect = 4e-11, Method: Compositional matrix adjust.
Identities = 41/104 (39%), Positives = 60/104 (57%), Gaps = 1/104 (0%)
Query: 60 EISINEIAENFSKLACNAHTICNSELRPLGTGLYPVISIINHSCLPNAVLVFEGRLAVVR 119
E S+ + E KL NA +++L +G L P +++ NHSCLPNA + F GR AV+R
Sbjct: 166 EESVRQAVEILCKLQTNAFHRFDADLGHVGIFLEPTLAMANHSCLPNAFVQFVGRAAVLR 225
Query: 120 AVQHVPKGAEVLISYIETAGSTMTRQKALKEQYLFTCTCPRCIK 163
A + KG E+ ISY + S++ ++KA Y F C C RC +
Sbjct: 226 AESRIQKGDEIEISYTDYT-SSLGKRKAALAPYNFECRCRRCTQ 268
>gi|392562474|gb|EIW55654.1| SET domain-containing protein [Trametes versicolor FP-101664 SS1]
Length = 499
Score = 75.5 bits (184), Expect = 4e-11, Method: Compositional matrix adjust.
Identities = 51/155 (32%), Positives = 72/155 (46%), Gaps = 19/155 (12%)
Query: 62 SINEIAENFSKLACNAHTICNSELRPLGTGLYPVISIINHSCLPNAVLVF---------- 111
S ++ + S+ N T+ ++ L P+G + P ++ NHSC PNAV+VF
Sbjct: 148 SAGDLVDLISRFTTNTFTLTSASLSPIGICVAPTVAFANHSCSPNAVIVFPRAQGTPASK 207
Query: 112 EGRLAVVRAVQHVPKGAEVLISYIETAGSTMTRQKALKEQYLFTCTCPRCIKLGQFDDIQ 171
E L +V A++ + G E+ ISY++T RQK L E Y FTC C C K D
Sbjct: 208 EPSLNLV-ALRDIAPGKEIRISYVDTTLPKDLRQKELTEVYNFTCQCKLCTKPPAVDP-- 264
Query: 172 ESAILEGYRCKDDGCSGFL-LRDSDDKGFTCQQCG 205
E C C G L +D F C +CG
Sbjct: 265 ----RESLWCPKK-CGGTCPLPTEEDNLFRCAKCG 294
>gi|195481611|ref|XP_002101711.1| GE15465 [Drosophila yakuba]
gi|194189235|gb|EDX02819.1| GE15465 [Drosophila yakuba]
Length = 497
Score = 75.1 bits (183), Expect = 4e-11, Method: Compositional matrix adjust.
Identities = 45/183 (24%), Positives = 83/183 (45%), Gaps = 27/183 (14%)
Query: 91 GLYPVISIINHSCLPNAVLVFEGRL--AVVRAVQHVPKGAEVLISYIETAGSTMTRQKAL 148
LYP+ ++NH C+PNA FE + +VRA +P+G EV +Y + + R L
Sbjct: 208 ALYPLFGVVNHDCIPNAYYTFEEKTNNMIVRAAVDIPEGFEVTTTYTKLFTGNIARHLFL 267
Query: 149 KEQYLFTCTCPRCIKLGQFDDIQESAILEGYRCKDDGCSGFLLRDSDD---KGFTCQQC- 204
K + FTC C RC D ++ A + G C+D C+G ++ + + C +C
Sbjct: 268 KMKKSFTCKCSRCS-----DPTEKGAFISGLYCRDTNCTGLVVPEISGLPHPNWNCLECK 322
Query: 205 ----------------GLVRSKEEIKKIASEVNILSKKTLALTSCGNHQEVVSTYKMIEK 248
G + +K + + V L++K+ + N+ + + ++++
Sbjct: 323 QKSTHAQMMKSQDFASGAINAKANSNSLRTLVQYLNEKSDSFIPSSNYVVIDAKMSVLQR 382
Query: 249 LQK 251
LQ+
Sbjct: 383 LQQ 385
>gi|91079710|ref|XP_969181.1| PREDICTED: similar to SET and MYND domain containing 4 [Tribolium
castaneum]
Length = 393
Score = 75.1 bits (183), Expect = 5e-11, Method: Compositional matrix adjust.
Identities = 93/368 (25%), Positives = 150/368 (40%), Gaps = 71/368 (19%)
Query: 22 DNYSLVEALVAHMSDIDEKQLLLYAQIANLVNLILQ-------W-------PEISINE-- 65
+NY L++++ K L Y +A++V LQ W P+ +NE
Sbjct: 28 NNYRYFNRLLSNIHK--NKNLAPYIIMASVVTTYLQKYTDFFTWFLTQPSCPKEGLNELV 85
Query: 66 ------IAENFSKLACNAHTI----CNSE-------LRPLGTGLYPVISIINHSCLPNAV 108
I ++ ++LACN+ TI C+S L + +G++P +SI+NHSC PN
Sbjct: 86 KLVGGFITKHIAQLACNSSTIEQWTCSSSDLLFPDVLITIASGIFPSVSIMNHSCRPNVT 145
Query: 109 LVFEGRLAVVRAVQHVPKGAEVLISY-IETAG-STMTRQKALKEQYLFTCTCPRCIKLGQ 166
F VV+A++ + + E+ Y I+ G RQ A KE Y F C C C
Sbjct: 146 NYFMSDTIVVKALEDIAQNEEIFNCYGIDYRGMEREQRQYACKELYHFECKCVICS---- 201
Query: 167 FDDIQESAILEGYRCKDDGCSGFLLRDSDDKGFTCQQCGLVRSKEEIKKIASEVNILSKK 226
D E +L+ Y C C G + + C CG + +KI E ++K
Sbjct: 202 -DPAHELDMLDSYLCPK--CKGLVPEIKNTILSFCVNCGEKYHLKPFRKINYE----AQK 254
Query: 227 TLALTSCGNHQEVVSTYKMIEKLQKKLYHPFSVNLMQTREKLIKILMELEDWKEALAYCQ 286
+ S + ++ + K+ EK+ K + F + +L ++L D + Y
Sbjct: 255 YMESDSSNQLELLIKSLKIREKILYKHHKDFE----EVYYRLYSFYVKLGDAENMFKYFH 310
Query: 287 LTI----------------PVYQRVYPQFHPLLGLQYYTCGKLEWFLGDTENAIKSMTEA 330
L + +Y+ H L G Q C L+ FL + E IK EA
Sbjct: 311 LWLENEKARRGCNSRVVGTKLYEAALAILHCLKGGQQKNCTNLKDFLQNVEYMIK---EA 367
Query: 331 VEILRITH 338
IL + +
Sbjct: 368 KAILNLYY 375
>gi|42562487|ref|NP_174606.2| tetratricopeptide repeat-containing protein [Arabidopsis thaliana]
gi|332193468|gb|AEE31589.1| tetratricopeptide repeat-containing protein [Arabidopsis thaliana]
Length = 798
Score = 75.1 bits (183), Expect = 5e-11, Method: Compositional matrix adjust.
Identities = 68/326 (20%), Positives = 135/326 (41%), Gaps = 62/326 (19%)
Query: 88 LGTGLYPVISIINHSCLPNAVLVFEGRLAVVRAVQHVPKGAEVLISYIETAG--STMTRQ 145
+G LY S+ NHSC PN L F R +++ + VP G + +SY G R
Sbjct: 475 VGQALYKTGSLFNHSCKPNIHLYFLSRGLIMQTTEFVPTGCPLELSYGPEVGKWDCKNRI 534
Query: 146 KALKEQYLFTCTCPRCIKLGQFDDIQESAILEGYRCKDDGCSGFLLRDSDDKGFTCQQCG 205
+ L+E+Y F C C C ++ D ++ GY C + C+G +L D TC+
Sbjct: 535 RFLEEEYFFHCRCRGCAQINISD-----LVINGYGCVNTNCTGVVL---DSNVATCESEK 586
Query: 206 L-------------VRSKEEIKKIASEV--NILSKKTLALT-------SCG-------NH 236
L V+ +E++ EV ++LSK + +L CG +H
Sbjct: 587 LNHFFTAPRNVDQQVQMREKVYADVGEVASSLLSKPSGSLHIEPEICLKCGSRCDIENSH 646
Query: 237 QEVVSTYKMIEKLQ-----------------------KKLYHPFSVNLMQTREKLIKILM 273
EV + + +++ + H ++ ++ +K+ +
Sbjct: 647 AEVNKAWNHMRRVEELMNSGRANYSVLSDCSRSIAVLRTFLHMYNKDIADAEDKVAQACY 706
Query: 274 ELEDWKEALAYCQLTIPVYQRVYPQFHPLLGLQYYTCGKLEWFLGDTENAIKSMTEAVEI 333
+ +A +C+ +I + +R+Y H ++G + ++ GD+ A + + +I
Sbjct: 707 LAGELVDARKHCEASIKILKRLYEDEHVVIGNEMVKLASIQLASGDSSGAWDTTKRSSQI 766
Query: 334 LRITHGTNSPFMKELILKLEEAQAEA 359
+G+++ + + L++ A+A
Sbjct: 767 FSKYYGSHAETLFSYLPCLKQETAKA 792
>gi|194769376|ref|XP_001966780.1| GF19100 [Drosophila ananassae]
gi|190618301|gb|EDV33825.1| GF19100 [Drosophila ananassae]
Length = 499
Score = 75.1 bits (183), Expect = 5e-11, Method: Compositional matrix adjust.
Identities = 34/103 (33%), Positives = 55/103 (53%), Gaps = 7/103 (6%)
Query: 91 GLYPVISIINHSCLPNAVLVFEGRL--AVVRAVQHVPKGAEVLISYIETAGSTMTRQKAL 148
LYP+ ++NH C+PN+ F+ + VVRA +P+G E+ +Y + + R L
Sbjct: 208 ALYPLFGVMNHDCIPNSYYTFDEKTNHMVVRAAVDIPEGFEITTTYTKLFTGNIARHLFL 267
Query: 149 KEQYLFTCTCPRCIKLGQFDDIQESAILEGYRCKDDGCSGFLL 191
K + FTC CPRC D ++ A + G C+D C+G ++
Sbjct: 268 KMKKGFTCKCPRC-----SDPTEKGAFISGLYCRDTNCTGLVV 305
>gi|12322390|gb|AAG51222.1|AC051630_19 unknown protein; 33246-28649 [Arabidopsis thaliana]
Length = 781
Score = 75.1 bits (183), Expect = 5e-11, Method: Compositional matrix adjust.
Identities = 68/326 (20%), Positives = 135/326 (41%), Gaps = 62/326 (19%)
Query: 88 LGTGLYPVISIINHSCLPNAVLVFEGRLAVVRAVQHVPKGAEVLISYIETAG--STMTRQ 145
+G LY S+ NHSC PN L F R +++ + VP G + +SY G R
Sbjct: 458 VGQALYKTGSLFNHSCKPNIHLYFLSRGLIMQTTEFVPTGCPLELSYGPEVGKWDCKNRI 517
Query: 146 KALKEQYLFTCTCPRCIKLGQFDDIQESAILEGYRCKDDGCSGFLLRDSDDKGFTCQQCG 205
+ L+E+Y F C C C ++ D ++ GY C + C+G +L D TC+
Sbjct: 518 RFLEEEYFFHCRCRGCAQINISD-----LVINGYGCVNTNCTGVVL---DSNVATCESEK 569
Query: 206 L-------------VRSKEEIKKIASEV--NILSKKTLALT-------SCG-------NH 236
L V+ +E++ EV ++LSK + +L CG +H
Sbjct: 570 LNHFFTAPRNVDQQVQMREKVYADVGEVASSLLSKPSGSLHIEPEICLKCGSRCDIENSH 629
Query: 237 QEVVSTYKMIEKLQ-----------------------KKLYHPFSVNLMQTREKLIKILM 273
EV + + +++ + H ++ ++ +K+ +
Sbjct: 630 AEVNKAWNHMRRVEELMNSGRANYSVLSDCSRSIAVLRTFLHMYNKDIADAEDKVAQACY 689
Query: 274 ELEDWKEALAYCQLTIPVYQRVYPQFHPLLGLQYYTCGKLEWFLGDTENAIKSMTEAVEI 333
+ +A +C+ +I + +R+Y H ++G + ++ GD+ A + + +I
Sbjct: 690 LAGELVDARKHCEASIKILKRLYEDEHVVIGNEMVKLASIQLASGDSSGAWDTTKRSSQI 749
Query: 334 LRITHGTNSPFMKELILKLEEAQAEA 359
+G+++ + + L++ A+A
Sbjct: 750 FSKYYGSHAETLFSYLPCLKQETAKA 775
>gi|24640767|ref|NP_572539.2| CG12119 [Drosophila melanogaster]
gi|7291024|gb|AAF46462.1| CG12119 [Drosophila melanogaster]
Length = 500
Score = 75.1 bits (183), Expect = 5e-11, Method: Compositional matrix adjust.
Identities = 35/103 (33%), Positives = 54/103 (52%), Gaps = 7/103 (6%)
Query: 91 GLYPVISIINHSCLPNAVLVFEGRL--AVVRAVQHVPKGAEVLISYIETAGSTMTRQKAL 148
LYP+ ++NH C+PNA FE + +VRA +P+G EV +Y + + R L
Sbjct: 208 ALYPLFGVVNHDCIPNAYYTFEEKTNNMIVRAAVDIPEGFEVTTTYTKLFTGNIARHLFL 267
Query: 149 KEQYLFTCTCPRCIKLGQFDDIQESAILEGYRCKDDGCSGFLL 191
K + FTC C RC D ++ A + G C+D C+G ++
Sbjct: 268 KMKKSFTCKCSRCS-----DPTEKGAFISGLYCRDTNCTGLVV 305
>gi|321476199|gb|EFX87160.1| hypothetical protein DAPPUDRAFT_312722 [Daphnia pulex]
Length = 732
Score = 75.1 bits (183), Expect = 5e-11, Method: Compositional matrix adjust.
Identities = 74/307 (24%), Positives = 137/307 (44%), Gaps = 43/307 (14%)
Query: 72 KLACNAHTIC--------NSELRPLGTGLYPVISIINHSCLPNAVLVFEGRLAVVRAVQH 123
++ NAH I N + T +YP S++NH+C P + F+G +VRA+++
Sbjct: 426 QMVSNAHAITELCLIDENNERQERIATAIYPSASLMNHNCDPTVINSFQGNTLIVRAIRN 485
Query: 124 VPKGAEVLISYIETAGSTMTRQK------ALKEQYLFTCTCPRCIKLGQFDDIQESAILE 177
V +G EV Y G R + AL+ QY FTCTC C+ +D Q+ ++
Sbjct: 486 VRQGDEVFNCY----GPHYRRMRRSERVEALEAQYSFTCTCDSCLD-KNTEDFQD--VIY 538
Query: 178 GYRCKDDGCSGFLLRDSDDKGFTCQQCGLVRSKE-------EIKKIASEVNILSKKTLAL 230
+ C C G L+ + + Q L RS + ++K V++ ++ T A+
Sbjct: 539 SFSCP--SCQGSLINPTGNNSSAQNQMALCRSCKTPQSYFTQLKADLEAVSLDARGTEAM 596
Query: 231 TSCGNHQEVVSTYKMIEK---LQKKLYHPFSVNLMQTREKLIKILMELEDWKEALAYCQL 287
+ +++ K++ K L+ K +L ++ +KL + + ++E ++
Sbjct: 597 ----DQGDIMGAIKLLTKCVQLRSKALFKGHPDLGKSADKLAQCYAFIGKYEECEKMLRI 652
Query: 288 TIPVYQRVYPQFHPLLG--LQYYTCGKLEWFLGDTENA----IKSMTEAVEILRITHGTN 341
++ + Y ++ + LQ +T +E G + A + + EA+ I RI +G
Sbjct: 653 SLTAVEHRYGRYSIEMANELQKFTDVLMELASGRNQRARDELVNYLEEAMLIYRIHYGPW 712
Query: 342 SPFMKEL 348
S KEL
Sbjct: 713 STSYKEL 719
>gi|307172338|gb|EFN63826.1| Protein msta, isoform B [Camponotus floridanus]
Length = 510
Score = 75.1 bits (183), Expect = 5e-11, Method: Compositional matrix adjust.
Identities = 73/303 (24%), Positives = 125/303 (41%), Gaps = 35/303 (11%)
Query: 61 ISINEIAENFSKLACNAHTICNSELRPL----GTGLYPVISIINHSCLPN---AVLVFEG 113
+ ++ +E F + C I E+R + GLYP ++++NHSC+ N ++ +
Sbjct: 179 LKLDRFSEEFIQTICGILDINTFEVRTMKGFCARGLYPTVAMMNHSCVSNTSHSISPIDY 238
Query: 114 RLAVVRAVQHVPKGAEVLISYIETAGSTMTRQKALKEQYLFTCTCPRCIKLGQFDDIQES 173
R+ + R VP G E+ SY + TM R++ L E F C CPRC D +
Sbjct: 239 RIRL-RTTLKVPAGGELYGSYTHSLLPTMLRREHLLEGKYFACACPRC-----SDPTELG 292
Query: 174 AILEGYRCKDDGCSGFLLRDSDDKGFT--CQQCGLVRSKEEIKKIASEVNILSKKTLALT 231
+ +C L DS D T C C S + ++KI + A++
Sbjct: 293 THMSSLKCNKCDNGVILSLDSLDSTSTWNCTHCDFSTSGQAVRKIFKIIQAEVDTVEAIS 352
Query: 232 SCGNHQEVVSTYKMIEKLQKKLYHPFSVNLMQTREKLIKILMELEDW----------KEA 281
++ +I+K + L HP L R L ++ ++++ +
Sbjct: 353 GADGADAILERENVIKKYRSVL-HPHHAFLTILRHSLTQMYGRVDEYLLDDLPDVVLEHK 411
Query: 282 LAYCQLTIPVYQRVYPQFHPLLGLQYYT-------CGKLEWFLGDTENA-IKS-MTEAVE 332
+ C+L + V V P + + G+ Y K +W G + A +KS M EA
Sbjct: 412 VDMCRLLLQVLDIVEPGYSRIRGMTLYELHAPLLFLAKGQWNAGVIDEAGLKSKMIEAAT 471
Query: 333 ILR 335
IL+
Sbjct: 472 ILK 474
>gi|16553701|dbj|BAB71564.1| unnamed protein product [Homo sapiens]
Length = 313
Score = 75.1 bits (183), Expect = 5e-11, Method: Compositional matrix adjust.
Identities = 67/271 (24%), Positives = 113/271 (41%), Gaps = 52/271 (19%)
Query: 88 LGTGLYPVISIINHSCLPNAVLVFEGRLAVVRAVQHVPKGAEVLISYI--ETAGSTMTRQ 145
L TG++PVIS++NHSC PN + F +A +RA Q + KG E+L Y ++ RQ
Sbjct: 36 LATGIFPVISLLNHSCSPNTSVSFISTVATIRASQRIRKGQEILHCYGPHKSRMGVAERQ 95
Query: 146 KALKEQYLFTCTCPRCIKLGQFDDIQESAILEGYRCKDDGCSGFLLRDSDDKGFTCQQCG 205
+ L+ QY F C CP C + + G R + F C CG
Sbjct: 96 QKLRSQYFFDCACPAC-------QTEAHRMAAGPRW---------------EAFCCNSCG 133
Query: 206 LVRSKEEIKKIASEVNILSKKTLALTSCGNHQEVVSTYKMIEKLQKKLYHPFSVNLMQTR 265
+++ + S SC + VS ++ +LQ +L Q
Sbjct: 134 APMQGDDVLRCGSR------------SCA--ESAVSRDHLVSRLQ---------DLQQQV 170
Query: 266 EKLIKILM--ELEDWKEALAYCQLTIPVYQRVYPQFHPLLGLQYYTCGKLEWFLGDTENA 323
K+L ELE + L+ CQ + ++ + H ++G + LGD + +
Sbjct: 171 RVAQKLLRDGELERAVQRLSGCQRDAESF--LWAE-HAVVGEIADGLARACAALGDWQKS 227
Query: 324 IKSMTEAVEILRITHGTNSPFMKELILKLEE 354
+ ++ ++ + HG +S M + KL +
Sbjct: 228 ATHLQRSLCVVEVRHGPSSVEMGHELFKLAQ 258
>gi|17946274|gb|AAL49177.1| RE62495p [Drosophila melanogaster]
Length = 500
Score = 75.1 bits (183), Expect = 5e-11, Method: Compositional matrix adjust.
Identities = 35/103 (33%), Positives = 54/103 (52%), Gaps = 7/103 (6%)
Query: 91 GLYPVISIINHSCLPNAVLVFEGRL--AVVRAVQHVPKGAEVLISYIETAGSTMTRQKAL 148
LYP+ ++NH C+PNA FE + +VRA +P+G EV +Y + + R L
Sbjct: 208 ALYPLFGVVNHDCIPNAYYTFEEKTNNMIVRAAVDIPEGFEVTTTYTKLFTGNIARHLFL 267
Query: 149 KEQYLFTCTCPRCIKLGQFDDIQESAILEGYRCKDDGCSGFLL 191
K + FTC C RC D ++ A + G C+D C+G ++
Sbjct: 268 KMKKSFTCKCSRCS-----DPTEKGAFISGLYCRDTNCAGLVV 305
>gi|195565893|ref|XP_002106530.1| GD16076 [Drosophila simulans]
gi|194203908|gb|EDX17484.1| GD16076 [Drosophila simulans]
Length = 497
Score = 75.1 bits (183), Expect = 6e-11, Method: Compositional matrix adjust.
Identities = 35/103 (33%), Positives = 54/103 (52%), Gaps = 7/103 (6%)
Query: 91 GLYPVISIINHSCLPNAVLVFEGRL--AVVRAVQHVPKGAEVLISYIETAGSTMTRQKAL 148
LYP+ ++NH C+PNA FE + +VRA +P+G EV +Y + + R L
Sbjct: 208 ALYPLFGVVNHDCIPNAYYTFEEKTNNMIVRAAVDIPEGFEVTTTYTKLFTGNIARHLFL 267
Query: 149 KEQYLFTCTCPRCIKLGQFDDIQESAILEGYRCKDDGCSGFLL 191
K + FTC C RC D ++ A + G C+D C+G ++
Sbjct: 268 KMKKSFTCKCSRCS-----DPTEKGAFISGLYCRDTNCTGLVV 305
>gi|195446904|ref|XP_002070974.1| GK25543 [Drosophila willistoni]
gi|194167059|gb|EDW81960.1| GK25543 [Drosophila willistoni]
Length = 503
Score = 75.1 bits (183), Expect = 6e-11, Method: Compositional matrix adjust.
Identities = 34/103 (33%), Positives = 55/103 (53%), Gaps = 7/103 (6%)
Query: 91 GLYPVISIINHSCLPNAVLVFEGRL--AVVRAVQHVPKGAEVLISYIETAGSTMTRQKAL 148
LYP+ ++NH C+PN+ FE + +VRA +P+G E+ +Y + + R L
Sbjct: 208 ALYPLFGVMNHDCIPNSYYTFEEKTNNMIVRAAVDIPEGFEITTTYTKLFTGNIARHLFL 267
Query: 149 KEQYLFTCTCPRCIKLGQFDDIQESAILEGYRCKDDGCSGFLL 191
K + FTC CPRC D ++ A + G C+D C+G ++
Sbjct: 268 KMKKNFTCKCPRC-----SDPTEKGAYISGLYCRDTSCNGLVV 305
>gi|296201020|ref|XP_002747857.1| PREDICTED: SET and MYND domain-containing protein 4 [Callithrix
jacchus]
Length = 798
Score = 75.1 bits (183), Expect = 6e-11, Method: Compositional matrix adjust.
Identities = 53/158 (33%), Positives = 78/158 (49%), Gaps = 9/158 (5%)
Query: 6 LRRKLQNDNVIPSTTTDNYSLVEALVAHMSDIDEKQLLLYAQIANLVNLILQWPEISINE 65
L R+L+ + P TT + S ++A VA D + ++ + L LQ +I
Sbjct: 447 LCRQLEAASFQPVTTGASSSQLKAAVAPELCPD---MTVWGVVMLRHMLQLQCNAQAITT 503
Query: 66 IAENFSKLACNAHTICNSELRPLGTGLYPVISIINHSCLPNAVLVFEGRLAVVRAVQHVP 125
I SK + S L TG++PVIS++NHSC PN + F +A +RA Q +
Sbjct: 504 ILHTGSKEG----IVTGSRQVRLATGIFPVISLLNHSCSPNTSVSFVSTVATIRASQRIG 559
Query: 126 KGAEVLISY--IETAGSTMTRQKALKEQYLFTCTCPRC 161
KG E+L Y ++ RQ+ L+ QY F CTCP C
Sbjct: 560 KGQEILHCYGPHKSRIGVAERQQKLRSQYFFDCTCPAC 597
>gi|390353457|ref|XP_003728116.1| PREDICTED: SET and MYND domain-containing protein 3-like
[Strongylocentrotus purpuratus]
Length = 319
Score = 74.7 bits (182), Expect = 6e-11, Method: Compositional matrix adjust.
Identities = 40/123 (32%), Positives = 64/123 (52%), Gaps = 7/123 (5%)
Query: 65 EIAENFSKLACNAHTICNSELRPL-GTGLYPVISIINHSCLPNAVLVFEGRLAVVRAVQH 123
E+ + + KL N+ ++ + R + G LY S+ NHSC PN VFEG VRA++
Sbjct: 178 EVLDTYQKLTINSFSMYDEMTRTIVGEALYIRASMFNHSCEPNCTFVFEGSRLSVRAIKR 237
Query: 124 VPKGAEVLISYIETAGSTMTRQKALKEQYLFTCTCPRCIKLGQFDDIQESAILEGYRCKD 183
+ G E ISY+ + + R++ L+ Y FTC CPRC+ D ++ +C +
Sbjct: 238 IEIGEECCISYMSSLLPSPLRKEKLRSIYGFTCQCPRCL------DSARDNLMLCVKCPN 291
Query: 184 DGC 186
+ C
Sbjct: 292 ESC 294
>gi|195359283|ref|XP_002045335.1| GM11659 [Drosophila sechellia]
gi|194129184|gb|EDW51227.1| GM11659 [Drosophila sechellia]
Length = 496
Score = 74.7 bits (182), Expect = 6e-11, Method: Compositional matrix adjust.
Identities = 49/209 (23%), Positives = 91/209 (43%), Gaps = 27/209 (12%)
Query: 91 GLYPVISIINHSCLPNAVLVFEGRL--AVVRAVQHVPKGAEVLISYIETAGSTMTRQKAL 148
LYP+ ++NH C+PNA FE + +VRA +P+G EV +Y + + R L
Sbjct: 208 ALYPLFGVVNHDCIPNAYYTFEEKTNNMIVRAAVDIPEGFEVTTTYTKLFTGNIARHLFL 267
Query: 149 KEQYLFTCTCPRCIKLGQFDDIQESAILEGYRCKDDGCSGFLLRDSDD---KGFTCQQC- 204
K + FTC C RC D ++ A + G C+D C+G ++ + + C +C
Sbjct: 268 KMKKSFTCKCSRCS-----DPTEKGAFISGLYCRDTNCTGLVVPEISGLPHPNWNCLECK 322
Query: 205 ----------------GLVRSKEEIKKIASEVNILSKKTLALTSCGNHQEVVSTYKMIEK 248
G + +K + + V L++K+ + N+ + + ++++
Sbjct: 323 QKSTHAQMMKSQDFASGAINAKVNSNSLRTLVQYLNEKSDSFIPNSNYVVIDAKMSVLQR 382
Query: 249 LQKKLYHPFSVNLMQTREKLIKILMELED 277
LQ+ TR + + + +L D
Sbjct: 383 LQQGTRGLLRGAGAHTRLRYSRDITQLMD 411
>gi|47204414|emb|CAF87026.1| unnamed protein product [Tetraodon nigroviridis]
Length = 386
Score = 74.7 bits (182), Expect = 7e-11, Method: Compositional matrix adjust.
Identities = 79/341 (23%), Positives = 139/341 (40%), Gaps = 53/341 (15%)
Query: 27 VEALVAHMSDIDEKQL-LLYAQIANLVNLILQWP----EISINEIAENFSKLACNAHTIC 81
VE L H++D+ ++QL L+A + W S +IA + N T+
Sbjct: 79 VEELQDHVADLPQEQLQRLHADVRTFQEF---WSCGRSRPSREDIAHVLGIIKTNGFTLS 135
Query: 82 NSE-LRPLGTGLYPVISIINHSCLPNAVLVFEG-------------RLAVVRAVQHVPKG 127
+ L+ +G GL+P + ++NH C PN +V R +RA+ V +G
Sbjct: 136 DQRGLQAVGVGLFPNLCLVNHDCWPNCSVVLNHGNQSATNPALHSQRRVELRALGKVCQG 195
Query: 128 AEVLISYIETAGSTMTRQKALKEQYLFTCTCPRCIKLGQFDDIQESAILEGYRCKDDGCS 187
E+ +SY++ ++ RQ+ L E + F CTC RC + + D + +A
Sbjct: 196 EELTVSYVDFLDTSAERQRKLSELFFFHCTCQRCSQHLKDDAMTAAADGA---------- 245
Query: 188 GFLLRDSDDKGFTCQQCGLVRSKEEIKKIASEVNILSKKTLALTSCGNHQEVVSTYKMIE 247
+E+ + E L K LT ++QEV++
Sbjct: 246 ------------------DAAEVQEVTAFSQEC--LEKIDRCLTG-RDYQEVLALCSACL 284
Query: 248 KLQKKLYHPFSVNLMQTREKLIKILMELEDWKEALAYCQLTIPVYQRVYPQFHPLLGLQY 307
+ Q+ L + ++ ++ L L + EA AY + Y+++YP H LG+
Sbjct: 285 QKQRGLLADTHLCHLRVLSAAVEALSYLRCFPEAAAYALRLLQGYRKLYPPNHAQLGVAA 344
Query: 308 YTCGKLEWFLGDTENAIKSMTEAVEILRITHGTNSPFMKEL 348
G G E A + + +A L ++HG N ++L
Sbjct: 345 MRAGVAHLQAGMPEAAHELICQAYRTLLVSHGPNHSTTRDL 385
>gi|195130539|ref|XP_002009709.1| GI15088 [Drosophila mojavensis]
gi|193908159|gb|EDW07026.1| GI15088 [Drosophila mojavensis]
Length = 492
Score = 74.7 bits (182), Expect = 7e-11, Method: Compositional matrix adjust.
Identities = 33/103 (32%), Positives = 56/103 (54%), Gaps = 7/103 (6%)
Query: 91 GLYPVISIINHSCLPNAVLVFEGRL--AVVRAVQHVPKGAEVLISYIETAGSTMTRQKAL 148
LYP+ +++NH C+PN+ FE + +VRA + +G E+ +Y + + R L
Sbjct: 208 ALYPLFAVMNHDCIPNSYYTFEEKTNNMIVRAAVDIAEGEEITTTYTKLFTGNIARHLFL 267
Query: 149 KEQYLFTCTCPRCIKLGQFDDIQESAILEGYRCKDDGCSGFLL 191
K + FTC CPRC+ D ++ + + G C+D CSG ++
Sbjct: 268 KMKKGFTCKCPRCL-----DPTEKGSFISGLYCRDTNCSGLVV 305
>gi|332813752|ref|XP_003309160.1| PREDICTED: SET and MYND domain-containing protein 1 [Pan
troglodytes]
Length = 452
Score = 74.7 bits (182), Expect = 7e-11, Method: Compositional matrix adjust.
Identities = 44/152 (28%), Positives = 76/152 (50%), Gaps = 25/152 (16%)
Query: 31 VAHMSDIDEKQLLLYAQIANLVNLILQ-WP----EISINEIAENFSKLACNAHTICNSE- 84
V H + ++K L + V+ LQ WP + S+ I+ F + CN T+ +
Sbjct: 136 VEHFGEEEQKDLRVD------VDTFLQYWPPQSQQFSMQYISHIFGVINCNGFTLSDQRG 189
Query: 85 LRPLGTGLYPVISIINHSCLPNAVLVFE-GRLAVVRAVQH------------VPKGAEVL 131
L+ +G G++P + ++NH C PN ++F G V+++ H + +G E+
Sbjct: 190 LQAVGVGIFPNLGLVNHDCWPNCTVIFNNGNHEAVKSMFHTQMRIELRALGKISEGEELT 249
Query: 132 ISYIETAGSTMTRQKALKEQYLFTCTCPRCIK 163
+SYI+ + R++ LK+QY F CTC C K
Sbjct: 250 VSYIDFLNVSEERKRQLKKQYYFDCTCEHCQK 281
>gi|126314231|ref|XP_001371610.1| PREDICTED: SET and MYND domain-containing protein 4-like
[Monodelphis domestica]
Length = 845
Score = 74.3 bits (181), Expect = 8e-11, Method: Compositional matrix adjust.
Identities = 61/266 (22%), Positives = 116/266 (43%), Gaps = 12/266 (4%)
Query: 88 LGTGLYPVISIINHSCLPNAVLVFEGRLAVVRAVQHVPKGAEVLISY--IETAGSTMTRQ 145
L TG +PVIS++NHSC PN L F G + +++A + + +G E+L Y E TRQ
Sbjct: 513 LATGFFPVISLLNHSCRPNTSLSFRGSVGIIQASRLIAQGEEILHCYGPHECRMDVTTRQ 572
Query: 146 KALKEQYLFTCTCPRCIKLGQFDDIQESAILEGYRCKDDGCSGFLLRDSDDKGFTC---Q 202
+ L+ QY F C C C + + + + G+RC C L D F C +
Sbjct: 573 QKLRSQYFFDCHCQAC-QNEEVCPVSTAPKPSGFRCPS--CKAALQGDD----FLCCDNR 625
Query: 203 QCGLVRSKEEIKKIASEVNILSKKTLALTSCGNHQEVVSTYKMIEKLQKKLYHPFSVNLM 262
C S+ +++ ++ K L L + + ++ + + +
Sbjct: 626 TCTASASRAQLEHQVQDLQQQVKAALELLRDDRSAQAIQLLLGCQQDAENFLSSEHMLMG 685
Query: 263 QTREKLIKILMELEDWKEALAYCQLTIPVYQRVYPQFHPLLGLQYYTCGKLEWFLGDTEN 322
+ + L + L DW + A+ Q ++ V + + LG + + ++ +
Sbjct: 686 EIEDHLAQAYASLGDWSRSAAHLQKSLLVVEARHGPASVELGHELFKLAQVLFNGFAVSK 745
Query: 323 AIKSMTEAVEILRITHGTNSPFMKEL 348
A+ ++ +A ++L I + + M+EL
Sbjct: 746 ALCAIEKAEKVLSIHYDPENEQMQEL 771
>gi|115497232|ref|NP_001069874.1| SET and MYND domain-containing protein 3 [Bos taurus]
gi|112362116|gb|AAI20242.1| SET and MYND domain containing 3 [Bos taurus]
gi|296479288|tpg|DAA21403.1| TPA: SET and MYND domain containing 3 [Bos taurus]
Length = 391
Score = 74.3 bits (181), Expect = 8e-11, Method: Compositional matrix adjust.
Identities = 32/65 (49%), Positives = 47/65 (72%)
Query: 65 EIAENFSKLACNAHTICNSELRPLGTGLYPVISIINHSCLPNAVLVFEGRLAVVRAVQHV 124
+I E F+K+ CN+ TICN+E++ +G GLYP +S++NHSC PN +VF G ++RAV+ V
Sbjct: 170 DIFEAFAKVICNSFTICNAEMQEVGVGLYPSMSLLNHSCDPNCSIVFNGPHLLLRAVRDV 229
Query: 125 PKGAE 129
G E
Sbjct: 230 EAGEE 234
Score = 48.1 bits (113), Expect = 0.007, Method: Compositional matrix adjust.
Identities = 24/61 (39%), Positives = 36/61 (59%)
Query: 279 KEALAYCQLTIPVYQRVYPQFHPLLGLQYYTCGKLEWFLGDTENAIKSMTEAVEILRITH 338
+EAL Y T+ Y+ YP HP+ G+Q GKL+ G A+K++ A +I+R+TH
Sbjct: 308 EEALFYGIRTMEPYRIFYPGNHPVRGVQIMKVGKLQLHQGMFPQAMKNLRLAFDIMRVTH 367
Query: 339 G 339
G
Sbjct: 368 G 368
>gi|322697990|gb|EFY89764.1| MYND finger family protein [Metarhizium acridum CQMa 102]
Length = 513
Score = 74.3 bits (181), Expect = 8e-11, Method: Compositional matrix adjust.
Identities = 40/102 (39%), Positives = 60/102 (58%), Gaps = 1/102 (0%)
Query: 60 EISINEIAENFSKLACNAHTICNSELRPLGTGLYPVISIINHSCLPNAVLVFEGRLAVVR 119
E ++ + E K+ NA + +L +G L P +++ NHSC+PNAV+ F GR A++R
Sbjct: 156 ESNVRKAVELLCKIQTNAFHRWDVDLGQVGVFLEPTLAMANHSCVPNAVVQFVGRKAILR 215
Query: 120 AVQHVPKGAEVLISYIETAGSTMTRQKALKEQYLFTCTCPRC 161
A + + G E+ ISY + R++AL EQY F CTC RC
Sbjct: 216 AERPIQAGDEIEISYTDYTMPLSARREAL-EQYSFECTCARC 256
>gi|159129417|gb|EDP54531.1| SET and MYND domain protein, putative [Aspergillus fumigatus A1163]
Length = 544
Score = 74.3 bits (181), Expect = 8e-11, Method: Compositional matrix adjust.
Identities = 42/140 (30%), Positives = 75/140 (53%), Gaps = 2/140 (1%)
Query: 26 LVEALVAHMSDIDEKQLLLYAQIANLVNLILQWPEISINE--IAENFSKLACNAHTICNS 83
L L +HM++I + + +I+ + ++ +++E + +KL N++ +
Sbjct: 187 LFRKLESHMAEIQAQNWEQWQRISLTAKAVKKYSGCALDEEMLCHYGAKLELNSYNFHSP 246
Query: 84 ELRPLGTGLYPVISIINHSCLPNAVLVFEGRLAVVRAVQHVPKGAEVLISYIETAGSTMT 143
LG L+P ++INHSC N+V+ +G V+A++ + KG E+L+SYI+
Sbjct: 247 LADRLGFYLHPYAALINHSCDYNSVVGSDGDALYVKALRPIRKGEEILVSYIDATNPYKI 306
Query: 144 RQKALKEQYLFTCTCPRCIK 163
RQ L E+Y F C CP+C K
Sbjct: 307 RQTELLERYYFKCRCPKCGK 326
>gi|346978403|gb|EGY21855.1| hypothetical protein VDAG_03295 [Verticillium dahliae VdLs.17]
Length = 558
Score = 74.3 bits (181), Expect = 8e-11, Method: Compositional matrix adjust.
Identities = 48/153 (31%), Positives = 78/153 (50%), Gaps = 19/153 (12%)
Query: 22 DNYSLVEALVAHMSDIDEKQLLLYAQIANLVNLILQWP-----------EISINEIAENF 70
+ ++ VEAL+ D E + +A + ++ W E++ + E
Sbjct: 135 EKFAAVEALLDGNVDRFEGRAEAWAGMEVPAKVMGGWVAAGAKGKGKAVEVTERRVVELL 194
Query: 71 SKLACNAHTICNSELRPLGTGLY--PVISIINHSCLPNAVLVFEGRLAVVRAVQHVPKGA 128
K+ NA T + G GL+ ++++NHSC+PNA++ F GR A +RA++ + +G
Sbjct: 195 CKIKTNAFTRSEA-----GEGLFLDTTLAMVNHSCVPNAIVAFSGRRAFLRALRDIKEGE 249
Query: 129 EVLISYIETAGSTMTRQKALKEQYLFTCTCPRC 161
E+ ISYI+ S R+KAL E Y F C C RC
Sbjct: 250 EIEISYIDCTQSLEHRRKAL-ELYFFQCVCTRC 281
>gi|270004498|gb|EFA00946.1| hypothetical protein TcasGA2_TC003856 [Tribolium castaneum]
Length = 713
Score = 74.3 bits (181), Expect = 8e-11, Method: Compositional matrix adjust.
Identities = 93/368 (25%), Positives = 150/368 (40%), Gaps = 71/368 (19%)
Query: 22 DNYSLVEALVAHMSDIDEKQLLLYAQIANLVNLILQ-------W-------PEISINE-- 65
+NY L++++ K L Y +A++V LQ W P+ +NE
Sbjct: 348 NNYRYFNRLLSNIHK--NKNLAPYIIMASVVTTYLQKYTDFFTWFLTQPSCPKEGLNELV 405
Query: 66 ------IAENFSKLACNAHTI----CNSE-------LRPLGTGLYPVISIINHSCLPNAV 108
I ++ ++LACN+ TI C+S L + +G++P +SI+NHSC PN
Sbjct: 406 KLVGGFITKHIAQLACNSSTIEQWTCSSSDLLFPDVLITIASGIFPSVSIMNHSCRPNVT 465
Query: 109 LVFEGRLAVVRAVQHVPKGAEVLISY-IETAG-STMTRQKALKEQYLFTCTCPRCIKLGQ 166
F VV+A++ + + E+ Y I+ G RQ A KE Y F C C C
Sbjct: 466 NYFMSDTIVVKALEDIAQNEEIFNCYGIDYRGMEREQRQYACKELYHFECKCVICS---- 521
Query: 167 FDDIQESAILEGYRCKDDGCSGFLLRDSDDKGFTCQQCGLVRSKEEIKKIASEVNILSKK 226
D E +L+ Y C C G + + C CG + +KI E ++K
Sbjct: 522 -DPAHELDMLDSYLCPK--CKGLVPEIKNTILSFCVNCGEKYHLKPFRKINYE----AQK 574
Query: 227 TLALTSCGNHQEVVSTYKMIEKLQKKLYHPFSVNLMQTREKLIKILMELEDWKEALAYCQ 286
+ S + ++ + K+ EK+ K + F + +L ++L D + Y
Sbjct: 575 YMESDSSNQLELLIKSLKIREKILYKHHKDFE----EVYYRLYSFYVKLGDAENMFKYFH 630
Query: 287 LTI----------------PVYQRVYPQFHPLLGLQYYTCGKLEWFLGDTENAIKSMTEA 330
L + +Y+ H L G Q C L+ FL + E IK EA
Sbjct: 631 LWLENEKARRGCNSRVVGTKLYEAALAILHCLKGGQQKNCTNLKDFLQNVEYMIK---EA 687
Query: 331 VEILRITH 338
IL + +
Sbjct: 688 KAILNLYY 695
>gi|303282641|ref|XP_003060612.1| predicted protein [Micromonas pusilla CCMP1545]
gi|226458083|gb|EEH55381.1| predicted protein [Micromonas pusilla CCMP1545]
Length = 302
Score = 74.3 bits (181), Expect = 8e-11, Method: Compositional matrix adjust.
Identities = 50/163 (30%), Positives = 74/163 (45%), Gaps = 17/163 (10%)
Query: 6 LRRKLQNDNVIPSTTTDNYSLVEALVAHM-----SDIDEKQLLLYAQIANLVNLILQWPE 60
LRR L+ + P +D + + + D+DE + + A P
Sbjct: 149 LRRLLKLQSHEPEVDSDEREWLSGIARNTLRLMEQDVDEGVKVFGEKDA---------PG 199
Query: 61 ISINEIAENFSKLACNAHTICNSE---LRPLGTGLYPVISIINHSCLPNAVLVFEGRLAV 117
+ E+ + CN+HT+ SE P+GT +Y S NHSCLP+A EG
Sbjct: 200 YGVEELVRLMCCVRCNSHTLYASEEWPSEPVGTAVYLKGSAFNHSCLPSAEFYNEGTSLR 259
Query: 118 VRAVQHVPKGAEVLISYIETAGSTMTRQKALKEQYLFTCTCPR 160
VR+V+ + G EV ISY+ + R++AL QY F C C R
Sbjct: 260 VRSVRDISAGEEVTISYVPVTETLWDRRQALWRQYKFDCECDR 302
>gi|345566296|gb|EGX49239.1| hypothetical protein AOL_s00078g272 [Arthrobotrys oligospora ATCC
24927]
Length = 526
Score = 74.3 bits (181), Expect = 8e-11, Method: Compositional matrix adjust.
Identities = 37/140 (26%), Positives = 76/140 (54%), Gaps = 7/140 (5%)
Query: 22 DNYSLVEALVAHMSDIDEKQLLLYAQIANLVNLILQWPEISINEIAENFSKLACNAHTIC 81
++++ + + M+++ K +++++ ++ PE + + + KL N+ ++
Sbjct: 155 EDFAKIGGDIWEMANLTAKAAAEFSKVSKQISF---EPEF----LRDMYCKLLINSASVV 207
Query: 82 NSELRPLGTGLYPVISIINHSCLPNAVLVFEGRLAVVRAVQHVPKGAEVLISYIETAGST 141
P+G L ++ NHSC PNAV++F+GR +R+++ + K E+ ISYI+ S
Sbjct: 208 TQTFDPIGLCLAYQSAMFNHSCDPNAVMMFDGRQLSIRSLKEITKDTEITISYIDNLASR 267
Query: 142 MTRQKALKEQYLFTCTCPRC 161
R++ LK +Y F C+C C
Sbjct: 268 KERKEELKSRYFFDCSCSLC 287
>gi|327283812|ref|XP_003226634.1| PREDICTED: SET and MYND domain-containing protein 4-like [Anolis
carolinensis]
Length = 683
Score = 74.3 bits (181), Expect = 9e-11, Method: Compositional matrix adjust.
Identities = 62/265 (23%), Positives = 111/265 (41%), Gaps = 11/265 (4%)
Query: 88 LGTGLYPVISIINHSCLPNAVLVFEGRLAVVRAVQHVPKGAEVLISYI--ETAGSTMTRQ 145
L T L+PV+S++NHSC PN + F+GR A RA + +P+G E+L Y R+
Sbjct: 413 LATALFPVLSLLNHSCDPNTSVTFDGRTATARASRAIPRGQEILHCYGPHRCRMKPSERR 472
Query: 146 KALKEQYLFTCTCPRCIKLGQFDDIQESAILEGYRCKD--DGCSGFLLRDSDDKGFTCQQ 203
+ L QY F C C C + S + C+ G FL S
Sbjct: 473 QRLLAQYFFECRCSACTDETGPESASGSTLFCCPTCRAPMQGEGLFLCCSSAS------- 525
Query: 204 CGLVRSKEEIKKIASEVNILSKKTLALTSCGNHQEVVSTYKMIEKLQKKLYHPFSVNLMQ 263
C + KE + E+ + + L + G V + ++L P + + +
Sbjct: 526 CKAMLPKEGFQHQLRELRQHMEAAMDLLAQGKADRSVEQLRKCHLEAQQLLSPGHLVVGE 585
Query: 264 TREKLIKILMELEDWKEALAYCQLTIPVYQRVYPQFHPLLGLQYYTCGKLEWFLGDTENA 323
+ L ++ +W EA A+ + +I V + Y +G + + ++ + A
Sbjct: 586 IEDHLAQVHATQGNWPEAAAHLRRSIEVVEAHYGAGSIEVGQELFKLAQVLFNGRFVPEA 645
Query: 324 IKSMTEAVEILRITHGTNSPFMKEL 348
+++ +A +L + G P ++EL
Sbjct: 646 LQAAEKAEAVLSVQLGPRGPEVQEL 670
>gi|380801673|gb|AFE72712.1| SET and MYND domain-containing protein 4, partial [Macaca mulatta]
Length = 212
Score = 74.3 bits (181), Expect = 9e-11, Method: Compositional matrix adjust.
Identities = 34/76 (44%), Positives = 47/76 (61%), Gaps = 2/76 (2%)
Query: 88 LGTGLYPVISIINHSCLPNAVLVFEGRLAVVRAVQHVPKGAEVLISYI--ETAGSTMTRQ 145
L TG++PVIS++NHSC PN + F +A +RA Q + KG E+L Y ++ RQ
Sbjct: 26 LATGIFPVISLLNHSCSPNTSVSFISTVATIRASQRIGKGQEILHCYGPHKSRMGVAERQ 85
Query: 146 KALKEQYLFTCTCPRC 161
+ L+ QY F CTCP C
Sbjct: 86 QKLRSQYFFDCTCPAC 101
>gi|336274372|ref|XP_003351940.1| hypothetical protein SMAC_00488 [Sordaria macrospora k-hell]
gi|380096224|emb|CCC06271.1| unnamed protein product [Sordaria macrospora k-hell]
Length = 537
Score = 74.3 bits (181), Expect = 9e-11, Method: Compositional matrix adjust.
Identities = 47/142 (33%), Positives = 75/142 (52%), Gaps = 3/142 (2%)
Query: 40 KQLLLYAQIANLVNLILQWPEISINEIAENFSKLACNAHTICNSELRPLGTGLYPVISII 99
K L A A + +LQ E ++ + E K+ NA +++ G L+P +S++
Sbjct: 171 KDFELQATGAVVYAGLLQSDE-TLKQAMEVLCKIQTNAFNRFDADTGQAGIYLHPSLSMV 229
Query: 100 NHSCLPNAVLVFEGRLAVVRAVQHVPKGAEVLISYIETAGSTMTRQKALKEQYLFTCTCP 159
NHSC+PNA + FE R A ++A + + G E+LISYI+ RQ++L+ Y F C C
Sbjct: 230 NHSCVPNAYITFEKRKAFLKAERDIEPGDEILISYIDHTAPRRARQESLR-LYHFQCNCV 288
Query: 160 RC-IKLGQFDDIQESAILEGYR 180
RC L ++ IQ + G +
Sbjct: 289 RCKDDLNVYEVIQACGVSPGVK 310
>gi|242007370|ref|XP_002424514.1| set and mynd domain containing protein, putative [Pediculus humanus
corporis]
gi|212507932|gb|EEB11776.1| set and mynd domain containing protein, putative [Pediculus humanus
corporis]
Length = 646
Score = 74.3 bits (181), Expect = 9e-11, Method: Compositional matrix adjust.
Identities = 72/325 (22%), Positives = 136/325 (41%), Gaps = 54/325 (16%)
Query: 13 DNVIPSTTTDNYSLVEALVAHMSDIDEKQLLLYAQIANLVNLILQWP------------- 59
D ++P ++Y V L+ ++ + K L YA A+L+ L L+
Sbjct: 282 DGIVP-CENNSYHNVYGLITNLEKYEGKHLFQYALTASLLALYLKTHTSYFSNESNDDDD 340
Query: 60 ---EISINEIAENFSKLACNAHTICNSEL-------------RPLGTGLYPVISIINHSC 103
EI + + ++ +L CN H I + + +GT +YP S++NHSC
Sbjct: 341 NNLEIVASTLFKHILQLICNGHAITDVITDDSNDSNAVDLIEKRIGTAIYPSASMMNHSC 400
Query: 104 LPNAVLVFEGRLAVVRAVQHVPKGAEVLISYIETAGSTMT--RQKALKEQYLFTCTCPRC 161
P+ + F +VRA + + G E+ Y A RQ+ L++QY F C C C
Sbjct: 401 EPSIITSFSNNHLIVRASKPIKAGEEIFNCYGPHAKRMKKEERQEILRKQYFFNCHCMAC 460
Query: 162 IKLGQFDDIQESAILEGYRCKDDGCSGFLLRDSDDKGFTCQQCGLVRSKEEIKKIASEVN 221
+ Q+D ++ Y C + C G L+ +D C C ++ +
Sbjct: 461 HQ--QYDIMER---FMAYMCFE--CKGPLIIINDSNNGYCLDC-------------NKNS 500
Query: 222 ILSKKTL--ALTSCGNHQEVVSTYKMIEKLQKKLYHPFSVNLMQTREKLIKILMELEDWK 279
I+S+ + + G + + + + +K+ + F+ +L + ++ +LED+
Sbjct: 501 IISQDVIRESFKEQGKILDALKKFLNFFIIGRKILYKFNKDLCYSADQAASCFSKLEDYD 560
Query: 280 EALAYCQLTIPVYQRVYPQFHPLLG 304
+ L + TI ++ + F LG
Sbjct: 561 KCLVILKCTILSEEKRFGSFSIELG 585
>gi|32564211|ref|NP_871849.1| Protein SET-18, isoform b [Caenorhabditis elegans]
gi|25005018|emb|CAD56600.1| Protein SET-18, isoform b [Caenorhabditis elegans]
Length = 507
Score = 74.3 bits (181), Expect = 9e-11, Method: Compositional matrix adjust.
Identities = 78/330 (23%), Positives = 138/330 (41%), Gaps = 30/330 (9%)
Query: 29 ALVAHMSDI--DEKQLLLYAQIANLVNLILQWPEISINEIAENFSKLACNAHTICNSELR 86
+LV H DI DE+++ + +I + I E + F K N+H+I +
Sbjct: 135 SLVDHKVDIRDDEEKMAHFEKIFEKMGAFRGEEMIEKGEFFDVFCKATINSHSIHTNAGN 194
Query: 87 PLGTGLYPVISIINHSCLPNAVLVFEGRLAVVRAVQHVP-----KGAEVLISYIETAGST 141
+G L +S NHSC P +VF+G +R + VP E ISYI+ S
Sbjct: 195 EVGMALDLGVSKYNHSCRPTCSMVFDGYRVCLRPL--VPGVDAENTEEAFISYIDVGRSK 252
Query: 142 MTRQKALKEQYLFTCTCPRCIKLGQFDDIQESAILEGYRCKDDGCSGFLLRDSDDK--GF 199
R++ L ++ F C C RC+ D ++ A L RC + C +L ++
Sbjct: 253 YIRRRDLNSRWYFNCECTRCM------DPEDDA-LTAIRCANPACDAPILTSETEEPMNI 305
Query: 200 TCQQCGLVRSKEEIKKIASEVNILSKKTLALT---SCGNHQEVVSTYKMIEKLQK--KLY 254
C++C + ++ +K + KTL + C E + K+ E L K ++
Sbjct: 306 ACEKCKTIVEEDTVKAAQEYM-----KTLPASFDPKCPAEIEALPG-KLKELLAKAEQIL 359
Query: 255 HPFSVNLMQTREKLIKILMELEDWKEALAYCQLTIPVYQRVYPQFHPLLGLQYYTCGKLE 314
HP +V + + R L + L + + Q+ Y+ +P+ +G Q K
Sbjct: 360 HPSNVYVARLRTALFHVTGTLTMDNLSSMHTQI-YNNYKMCFPKADRHVGFQLLHIVKAL 418
Query: 315 WFLGDTENAIKSMTEAVEILRITHGTNSPF 344
+ + A+ +A+ I + G + P+
Sbjct: 419 IEKDERDEAMPYAFDAMNIFEVCFGLDHPY 448
>gi|384250973|gb|EIE24451.1| SET domain-containing protein [Coccomyxa subellipsoidea C-169]
Length = 606
Score = 73.9 bits (180), Expect = 1e-10, Method: Compositional matrix adjust.
Identities = 42/123 (34%), Positives = 58/123 (47%), Gaps = 5/123 (4%)
Query: 76 NAHTI--CNSELRPLGTGLYPVISIINHSCLPNAVLVFEGRLAVVRAVQHVPKGAEVLIS 133
NAH + NS GLYP +S++NHSCLPN V G VRA++ V G ++ ++
Sbjct: 306 NAHGMGAANSTNTDSALGLYPALSMLNHSCLPNCVFASCGSDMHVRAIRPVAAGEQLTVT 365
Query: 134 YIETAGSTMTRQKALKEQYLFTCTCPRCIKLGQFDDIQESAILEGYRCKDDGCSGFLLRD 193
YI R + L + F C C RC+ + LE +C GC G+LL
Sbjct: 366 YINIMEPRRIRARELMDTKHFACACERCV---SPLETHPDRFLEAVKCGAGGCDGWLLEQ 422
Query: 194 SDD 196
D
Sbjct: 423 RAD 425
>gi|344297528|ref|XP_003420449.1| PREDICTED: SET and MYND domain-containing protein 1 [Loxodonta
africana]
Length = 478
Score = 73.9 bits (180), Expect = 1e-10, Method: Compositional matrix adjust.
Identities = 80/308 (25%), Positives = 135/308 (43%), Gaps = 44/308 (14%)
Query: 52 VNLILQ-WP----EISINEIAENFSKLACNAHTICNSE-LRPLGTGLYPVISIINHSCLP 105
VN L+ WP + S I+ F + CN T+ + L+ +G G++P + ++NH C P
Sbjct: 151 VNTFLEYWPPQNQQFSTQYISHIFGVINCNGFTLSDQRGLQAVGVGVFPNLGLVNHDCWP 210
Query: 106 NAVLVFE-GRLAVVRAVQHVPKGAEVLISYIETAGSTMTRQKALKEQYLFTCTCPRCIKL 164
N ++F G V+++ H +S + G T Q +K L TC C+ L
Sbjct: 211 NCTVIFNNGNHEAVKSMFHTQ------MSKAQDPGKT-GPQFVIKLVGLTTCFFS-CLAL 262
Query: 165 GQFDDIQESAILEGYRCKDDGCSGFLLRDSDDKGFTCQQCGLVRSKEEIKKIASEVNILS 224
G A+ EG KDD L DD + +E+ K E+ LS
Sbjct: 263 GV------EALPEGL--KDD----LFLGVKDDP----------KPSQEVVK---EMIQLS 297
Query: 225 KKTL----ALTSCGNHQEVVSTYKMIEKLQKKLYHPFSVNLMQTREKLIKILMELEDWKE 280
K TL S G + EVV + + Q+ ++ ++ ++ + ++L L+ ++E
Sbjct: 298 KDTLEKIDKARSEGLYHEVVKLCRECLQKQEPVFADTNLYTLRMLSIVSEVLSYLQAFEE 357
Query: 281 ALAYCQLTIPVYQRVYPQFHPLLGLQYYTCGKLEWFLGDTENAIKSMTEAVEILRITHGT 340
A Y + + Y ++Y + LG+ G W G+ E + +A IL +THG
Sbjct: 358 AADYAKRMVDGYMKLYHHNNAQLGMAVMRAGLTNWHAGNIEVGHGMICKAYAILLVTHGP 417
Query: 341 NSPFMKEL 348
+ P K+L
Sbjct: 418 SHPITKDL 425
>gi|195168440|ref|XP_002025039.1| GL26796 [Drosophila persimilis]
gi|194108484|gb|EDW30527.1| GL26796 [Drosophila persimilis]
Length = 498
Score = 73.9 bits (180), Expect = 1e-10, Method: Compositional matrix adjust.
Identities = 34/103 (33%), Positives = 56/103 (54%), Gaps = 7/103 (6%)
Query: 91 GLYPVISIINHSCLPNAVLVFEGR--LAVVRAVQHVPKGAEVLISYIETAGSTMTRQKAL 148
LYP+ +++NH C+PN+ FE + + +VRA +P+G E+ +Y + + R L
Sbjct: 208 ALYPLFAVMNHDCIPNSYYTFEEKTNIMIVRAAVDIPEGFEITTTYTKLFTGNIARHLFL 267
Query: 149 KEQYLFTCTCPRCIKLGQFDDIQESAILEGYRCKDDGCSGFLL 191
K + FTC C RC D ++ A + G C+D CSG ++
Sbjct: 268 KIKRGFTCKCSRC-----SDPTEKGAYISGLYCRDTNCSGLVV 305
>gi|125983736|ref|XP_001355633.1| GA11413 [Drosophila pseudoobscura pseudoobscura]
gi|54643949|gb|EAL32692.1| GA11413 [Drosophila pseudoobscura pseudoobscura]
Length = 496
Score = 73.9 bits (180), Expect = 1e-10, Method: Compositional matrix adjust.
Identities = 34/103 (33%), Positives = 56/103 (54%), Gaps = 7/103 (6%)
Query: 91 GLYPVISIINHSCLPNAVLVFEGR--LAVVRAVQHVPKGAEVLISYIETAGSTMTRQKAL 148
LYP+ +++NH C+PN+ FE + + +VRA +P+G E+ +Y + + R L
Sbjct: 208 ALYPLFAVMNHDCIPNSYYTFEEKTNIMIVRAAVDIPEGFEITTTYTKLFTGNIARHLFL 267
Query: 149 KEQYLFTCTCPRCIKLGQFDDIQESAILEGYRCKDDGCSGFLL 191
K + FTC C RC D ++ A + G C+D CSG ++
Sbjct: 268 KIKRGFTCKCSRC-----SDPTEKGAYISGLYCRDTNCSGLVV 305
>gi|224140669|ref|XP_002323703.1| tetratricopeptide repeat-containing protein [Populus trichocarpa]
gi|222868333|gb|EEF05464.1| tetratricopeptide repeat-containing protein [Populus trichocarpa]
Length = 584
Score = 73.9 bits (180), Expect = 1e-10, Method: Compositional matrix adjust.
Identities = 67/313 (21%), Positives = 128/313 (40%), Gaps = 61/313 (19%)
Query: 62 SINEIAENFSKLACNAHTICNS-ELRPLGTGLYPVISIINHSCLPNAVLVFEGRLAVVRA 120
S+++ + F KL + +S E P+G +Y S+ NHSCLPN F R +R
Sbjct: 240 SVDDPPDQFRKLTSVGDALTSSLEQVPVGQAIYKAASLFNHSCLPNIHAYFLSRTLFIRT 299
Query: 121 VQHVPKGAEVLISYIETAGST--MTRQKALKEQYLFTCTCPRCIKLGQFDDIQESAILEG 178
++V G + +SY G + R + L ++Y F C C C +L D +L
Sbjct: 300 TEYVSTGCPLELSYGPQVGQSDCEDRLRYLADKYSFRCQCRGCSQLNLSD-----LVLNA 354
Query: 179 YRCKDDGCSGFLLRDSDDKGFTCQQCGLVRSKEEIKKIASE--------VNILSKKTLAL 230
+ C + C+G +L + G T + R+ E+ +K S V+I +LAL
Sbjct: 355 FCCVNHNCAGVVLESTIINGETRKLNNFPRAPEK-QKFDSHLQGHKLNIVDINDVASLAL 413
Query: 231 T--------------SCGNHQEVVSTYKMIEK---------------------------- 248
CG+H+++ ++++ I K
Sbjct: 414 KFNNSSLHIHPGFCLHCGSHRDLDASHEAINKAWSYIKRLQEAIISKDISGTTLLDASRA 473
Query: 249 --LQKKLYHPFSVNLMQTREKLIKILMELEDWKEALAYCQLTIPVYQRVYPQFHPLLGLQ 306
+ + H ++ ++ + + L + + D++ A +C+ +I + Q +Y H ++G +
Sbjct: 474 LGILRSTLHAYNKSVAEAEDNLAQAFCLVRDFQSAREHCKESIKILQTLYDPDHIVIGYE 533
Query: 307 YYTCGKLEWFLGD 319
++ L D
Sbjct: 534 LVKLASIQLSLDD 546
>gi|350295434|gb|EGZ76411.1| hypothetical protein NEUTE2DRAFT_98210 [Neurospora tetrasperma FGSC
2509]
Length = 547
Score = 73.6 bits (179), Expect = 1e-10, Method: Compositional matrix adjust.
Identities = 41/120 (34%), Positives = 67/120 (55%), Gaps = 2/120 (1%)
Query: 62 SINEIAENFSKLACNAHTICNSELRPLGTGLYPVISIINHSCLPNAVLVFEGRLAVVRAV 121
++ + E K+ NA +++ G L+P +S++NHSC+PNA + FE R AV++A
Sbjct: 172 TLKQAMEVLCKIQTNAFNRFDADTGQAGIYLHPSLSMVNHSCVPNAYITFEKRKAVLKAE 231
Query: 122 QHVPKGAEVLISYIETAGSTMTRQKALKEQYLFTCTCPRC-IKLGQFDDIQESAILEGYR 180
+ + G E+LISYI+ RQ++L+ Y F C C RC L ++ IQ + G +
Sbjct: 232 RDLEPGDEILISYIDHTMPRRARQESLR-LYHFQCNCIRCKDDLNAYEVIQACGVSPGVK 290
>gi|281208760|gb|EFA82935.1| SET domain-containing protein [Polysphondylium pallidum PN500]
Length = 422
Score = 73.6 bits (179), Expect = 1e-10, Method: Compositional matrix adjust.
Identities = 46/132 (34%), Positives = 64/132 (48%), Gaps = 9/132 (6%)
Query: 88 LGTGLYPVISIINHSCLPNAVLVFEGRLAVVRAVQHVPKGAEVLISYIETAGSTMTRQKA 147
G +Y S+ NHSC PNA V GR + A++ + E+ ISY+ + R+
Sbjct: 251 FGLAIYLEASLFNHSCYPNAARVQRGRSIDIIAIRDIEPNEEICISYLNITNGSHERKDH 310
Query: 148 LKEQYLFTCTCPRCIKLG-QFDDIQESAILEGYRCKDDGCSGFLLRDSDDKGFTCQQCGL 206
LK YLF C C RC + ++I S I R K CSG+L +G +QC
Sbjct: 311 LKNNYLFDCVCIRCTQTNPDIENIVRSFICRNPRVK---CSGYLYL---PQGSDSRQCNF 364
Query: 207 VRSKE--EIKKI 216
+ KE EIK+I
Sbjct: 365 CQWKESSEIKRI 376
>gi|328873018|gb|EGG21385.1| SET domain-containing protein [Dictyostelium fasciculatum]
Length = 740
Score = 73.6 bits (179), Expect = 1e-10, Method: Compositional matrix adjust.
Identities = 50/156 (32%), Positives = 79/156 (50%), Gaps = 13/156 (8%)
Query: 22 DNYSLVEALVAHMSDIDEKQLLLYAQIAN-LVNLI--LQWPE-----ISINEIAENFSKL 73
D+Y + +L +S I+ KQL + N + N ++ P+ + EI + + K+
Sbjct: 400 DSYEDLASLTTDVSKIESKQLQSFTVATNSITNFFAAVKGPKYLSDPFTKQEIFDLYPKI 459
Query: 74 ACNAHTICNSELR-PLGTGLYPVISIINHSCLPNAVLVFEGR-LAVVRAVQHVPKGAEVL 131
NAH + +G G+YP S INH+CLPN + L + R+ + + KG E+
Sbjct: 460 LFNAHEYIDPLTHYEIGRGIYPTASYINHTCLPNTTWYNDDHGLILYRSSRDILKGEEIT 519
Query: 132 ISYIETAGSTMTRQKALKEQYLFTCTCPRCI--KLG 165
SY++ + R+K LK QY F C C RC+ KLG
Sbjct: 520 TSYLDILKPKLQRRKDLK-QYSFVCQCERCLNEKLG 554
>gi|443926535|gb|ELU45157.1| zf-MYND domain-containing protein [Rhizoctonia solani AG-1 IA]
Length = 584
Score = 73.6 bits (179), Expect = 1e-10, Method: Compositional matrix adjust.
Identities = 71/264 (26%), Positives = 104/264 (39%), Gaps = 46/264 (17%)
Query: 64 NEIAENFSK------LACNAHTICNSELRPLGTGLYPVISIINHSCLPNAVLVFE----- 112
N + F K + NAHT+ L P+G L PV ++INHSC+PN V+VF
Sbjct: 189 NSVKAGFGKRYRGSIIEANAHTLSTPSLTPIGVALSPVAALINHSCVPNCVVVFPKASET 248
Query: 113 ---GRLAVVRAVQHVPKGAEVLISYIETAGSTMTRQKALKEQYLFTCTCPRC-----IKL 164
++ A+ + G EV I + T G K L+E+Y F C CP C +K
Sbjct: 249 KKVPNEMLIIAIGKILPGEEVQIPH--TLGK-FADNKILQERYFFKCHCPNCEMTSLVKT 305
Query: 165 GQFDDIQESAILEGYRCKDDGCSGFLLRDSDDKGFTCQQCGLVRSKEEIKKIASEVNILS 224
D + +RC C GFL S +Q N+ S
Sbjct: 306 PYVDSRR------AFRCASKTCEGFLPMPSMQSQLGQKQ---------------RFNVYS 344
Query: 225 KKTLALTSCGNHQEVVSTYKMIEKLQKKLYHPFSV--NLMQTREKLIKILMELEDWKEAL 282
+ AL++ Q + K K ++ S + T + + + L D + A
Sbjct: 345 AISCALSNPRRSQMQSDSEKKGSKGRRNFNSLVSYFRSPGLTYHQSLTLCAGLLDPERAF 404
Query: 283 AYCQLTIPVYQRVY-PQFHPLLGL 305
Y IP+ R + P HPLL L
Sbjct: 405 KYTSNLIPMVSRFFHPSAHPLLAL 428
>gi|388854233|emb|CCF52152.1| uncharacterized protein [Ustilago hordei]
Length = 938
Score = 73.6 bits (179), Expect = 1e-10, Method: Compositional matrix adjust.
Identities = 39/118 (33%), Positives = 65/118 (55%), Gaps = 16/118 (13%)
Query: 62 SINEIAENFSKLACNAHTICNSELRPLGTGLYPVISIINHSCLPNAVLVF---------- 111
S E+ + + A N+ T+ +S+L PLG ++P +++INH+C PNA +VF
Sbjct: 481 SAKELLDLVCQFASNSFTLTDSDLNPLGVCMHPSMAMINHACTPNAAVVFPFGGAAKNGQ 540
Query: 112 ------EGRLAVVRAVQHVPKGAEVLISYIETAGSTMTRQKALKEQYLFTCTCPRCIK 163
E R+ + A++ + G E+LISY++ A + R+ LK++Y F C C C K
Sbjct: 541 QKWNDGEDRIMQLVALRAIEPGEELLISYVDIADTYQDRRNYLKKRYCFDCRCELCRK 598
>gi|443693726|gb|ELT95020.1| hypothetical protein CAPTEDRAFT_183922 [Capitella teleta]
Length = 666
Score = 73.6 bits (179), Expect = 2e-10, Method: Compositional matrix adjust.
Identities = 45/139 (32%), Positives = 73/139 (52%), Gaps = 16/139 (11%)
Query: 88 LGTGLYPVISIINHSCLPNAVLVFEGRLAVVRAVQHVPKGAEVLISYIET--AGSTMTRQ 145
+G GL+P +++NHSC P A++ + G AVVRA++ + K E+ I+Y T ++R+
Sbjct: 472 IGIGLFPTAALLNHSCNPEAIVCYYGNKAVVRAIRDIDKNEEISIAYGVTFYDDEELSRR 531
Query: 146 KALKEQYLFTCTCPRCIKLG----QFDDIQESAILEGYRCKDDGCSGFLLRD--SDDKGF 199
LKE + F CTC C++ + D Q + E C LL D D+K
Sbjct: 532 HQLKETHFFHCTCKACLEGWPMWLEMDQNQPDWLCE-------ACGSILLSDKIEDNKFA 584
Query: 200 TCQQCGLVRSKEE-IKKIA 217
C++C ++ E+ I K+A
Sbjct: 585 KCKKCSHRQNLEDAINKLA 603
>gi|255950300|ref|XP_002565917.1| Pc22g20170 [Penicillium chrysogenum Wisconsin 54-1255]
gi|211592934|emb|CAP99305.1| Pc22g20170 [Penicillium chrysogenum Wisconsin 54-1255]
Length = 495
Score = 73.2 bits (178), Expect = 2e-10, Method: Compositional matrix adjust.
Identities = 43/142 (30%), Positives = 75/142 (52%), Gaps = 4/142 (2%)
Query: 26 LVEALVAHMSDIDEKQLLLYAQIANLVNLILQWPEISINE--IAENFSKLACNAHTICNS 83
+ + L H+++I E Q L +I + + ++E +A +KL N+ + S
Sbjct: 140 VFDGLETHINEISESQGQL-DRINLTARAVKNYSGTEMDEGTVASYAAKLDLNSFNLTTS 198
Query: 84 ELRPLGTGLYPVISIINHSCLPNAVLVFEGRLAVVRAVQHVPKGAEVLISYIETAGSTMT 143
+G ++P +INHSC N+ + F+G V+A++ + KG ++ ISYI+T
Sbjct: 199 MYDRIGLYMHPYAGLINHSCDYNSTVGFDGEELYVKAMRPIKKGEQIFISYIDTTTPYDI 258
Query: 144 RQKALKEQYLFTCTCPRCIKLG 165
R+ LKE+Y F C C +C K+G
Sbjct: 259 RRNELKERYFFDCQCTKC-KMG 279
>gi|71001320|ref|XP_755341.1| SET and MYND domain protein [Aspergillus fumigatus Af293]
gi|66852979|gb|EAL93303.1| SET and MYND domain protein, putative [Aspergillus fumigatus Af293]
Length = 544
Score = 73.2 bits (178), Expect = 2e-10, Method: Compositional matrix adjust.
Identities = 42/140 (30%), Positives = 74/140 (52%), Gaps = 2/140 (1%)
Query: 26 LVEALVAHMSDIDEKQLLLYAQIANLVNLILQWPEISINE--IAENFSKLACNAHTICNS 83
L L +HM++I + + +I+ + ++ +++E + +KL N+ +
Sbjct: 187 LFRKLESHMAEIQAQNWEQWQRISLTAKAVKKYSGCALDEEMLCHYGAKLELNSFNFHSP 246
Query: 84 ELRPLGTGLYPVISIINHSCLPNAVLVFEGRLAVVRAVQHVPKGAEVLISYIETAGSTMT 143
LG L+P ++INHSC N+V+ +G V+A++ + KG E+L+SYI+
Sbjct: 247 LADRLGFYLHPYAALINHSCDYNSVVGSDGDALYVKALRPIRKGEEILVSYIDATNPYKI 306
Query: 144 RQKALKEQYLFTCTCPRCIK 163
RQ L E+Y F C CP+C K
Sbjct: 307 RQTELLERYYFKCRCPKCGK 326
>gi|195393214|ref|XP_002055249.1| GJ18896 [Drosophila virilis]
gi|194149759|gb|EDW65450.1| GJ18896 [Drosophila virilis]
Length = 506
Score = 73.2 bits (178), Expect = 2e-10, Method: Compositional matrix adjust.
Identities = 34/103 (33%), Positives = 55/103 (53%), Gaps = 7/103 (6%)
Query: 91 GLYPVISIINHSCLPNAVLVFEGRL--AVVRAVQHVPKGAEVLISYIETAGSTMTRQKAL 148
LYP+ +++NH C+PN+ FE + +VRA + +G E+ +Y + + R L
Sbjct: 208 ALYPLFAVMNHDCIPNSYYTFEEKTNNMIVRAAVDIAEGEEITTTYTKLFTGNIARHLYL 267
Query: 149 KEQYLFTCTCPRCIKLGQFDDIQESAILEGYRCKDDGCSGFLL 191
K + FTC CPRC D ++ A + G C+D CSG ++
Sbjct: 268 KMKKGFTCKCPRC-----SDPTEKGAYISGLYCRDTKCSGLVV 305
>gi|302411726|ref|XP_003003696.1| conserved hypothetical protein [Verticillium albo-atrum VaMs.102]
gi|261357601|gb|EEY20029.1| conserved hypothetical protein [Verticillium albo-atrum VaMs.102]
Length = 564
Score = 73.2 bits (178), Expect = 2e-10, Method: Compositional matrix adjust.
Identities = 40/104 (38%), Positives = 61/104 (58%), Gaps = 8/104 (7%)
Query: 60 EISINEIAENFSKLACNAHTICNSELRPLGTGLY--PVISIINHSCLPNAVLVFEGRLAV 117
E++ + E K+ NA T + G GL+ ++++NHSC+PNA++ F GR A
Sbjct: 190 EVTERRVVELLCKIKTNAFTRSEA-----GEGLFLDTTLAMVNHSCVPNAIVAFSGRRAF 244
Query: 118 VRAVQHVPKGAEVLISYIETAGSTMTRQKALKEQYLFTCTCPRC 161
+RA++ + +G E+ ISYI+ S R+KAL E Y F C C RC
Sbjct: 245 LRALRDIKEGEEIEISYIDCTQSLEHRRKAL-ELYFFQCVCMRC 287
>gi|66825697|ref|XP_646203.1| hypothetical protein DDB_G0270830 [Dictyostelium discoideum AX4]
gi|60474828|gb|EAL72765.1| hypothetical protein DDB_G0270830 [Dictyostelium discoideum AX4]
Length = 696
Score = 73.2 bits (178), Expect = 2e-10, Method: Compositional matrix adjust.
Identities = 39/107 (36%), Positives = 59/107 (55%), Gaps = 10/107 (9%)
Query: 61 ISINEIAENFSKLACNAHTICNSELRPL-----GTGLYPVISIINHSCLPNAVLVFEGR- 114
++I+EI E +S + N H + L PL G G++P S +NHSCLPNA + +
Sbjct: 403 VTIDEIIEIYSIVLSNGHEM----LHPLNCHTYGLGIFPTGSYLNHSCLPNAFWYNDDQG 458
Query: 115 LAVVRAVQHVPKGAEVLISYIETAGSTMTRQKALKEQYLFTCTCPRC 161
+ V R ++ + KG E+L SY + R+K L +QY F C C +C
Sbjct: 459 MMVFRTLRPIKKGEEILTSYTDITTECSERRKHLLKQYFFFCQCQQC 505
>gi|449480134|ref|XP_004177074.1| PREDICTED: LOW QUALITY PROTEIN: SET and MYND domain-containing
protein 4 [Taeniopygia guttata]
Length = 797
Score = 73.2 bits (178), Expect = 2e-10, Method: Compositional matrix adjust.
Identities = 68/294 (23%), Positives = 120/294 (40%), Gaps = 22/294 (7%)
Query: 72 KLACNAHTI-CNSELRP------------LGTGLYPVISIINHSCLPNAVLVFEGRLAVV 118
+L CNA I EL P L T +PV+S++NHSC PN + F G A V
Sbjct: 491 QLQCNAQAITVMQELGPGDGAVVDKKPVRLATAFFPVLSLLNHSCCPNTSMSFSGTAATV 550
Query: 119 RAVQHVPKGAEVLISY-IETAGSTMTRQKALKEQYLFTCTCPRCIKLGQFDDIQESAILE 177
RA Q + G EVL Y + ++ L QY F C CP C++ + +I
Sbjct: 551 RASQPISSGQEVLHCYGPHWCRMRVAERQQLLRQYFFECRCPACLEELESGVKSVVSIRN 610
Query: 178 GYRCKDDGCSGFLLRDSDDKGFTCQQCGLVRSKEEIKKIASEVNILSKKTLALTSCGNHQ 237
+ C C + + D + + C S+ + ++ KK L + G
Sbjct: 611 SFCCPK--CQAQMQGEEDTLCCSNEACATSASRNHLSGRLQDLQQQIKKALGMLRVGKAD 668
Query: 238 EVVSTYKMIEKLQ---KKLYHPFSVNLMQTREKLIKILMELEDWKEALAYCQLTIPVYQR 294
+ + KM+ K Q P + + + + L ++ L W+EA + + +I + +
Sbjct: 669 QAI---KMLLKCQMDAGTFLSPEHLLMGEMEDHLAQVYATLGKWQEAARHLKKSIEIVEM 725
Query: 295 VYPQFHPLLGLQYYTCGKLEWFLGDTENAIKSMTEAVEILRITHGTNSPFMKEL 348
+ G + + ++ + A+ ++ A IL + G + ++EL
Sbjct: 726 HHGPSSVETGHELFKLAQILFNGFAVSEALSTIQRAEGILSVHFGPQNAQIQEL 779
>gi|336465134|gb|EGO53374.1| hypothetical protein NEUTE1DRAFT_150707 [Neurospora tetrasperma
FGSC 2508]
Length = 668
Score = 72.8 bits (177), Expect = 2e-10, Method: Compositional matrix adjust.
Identities = 40/120 (33%), Positives = 66/120 (55%), Gaps = 2/120 (1%)
Query: 62 SINEIAENFSKLACNAHTICNSELRPLGTGLYPVISIINHSCLPNAVLVFEGRLAVVRAV 121
++ + E K+ NA +++ G L+P +S++NHSC+PNA + FE R A ++A
Sbjct: 192 TLKQAMEVLCKMQTNAFNRFDADTGQAGIYLHPSLSMVNHSCVPNAYIAFEKRKAFLKAE 251
Query: 122 QHVPKGAEVLISYIETAGSTMTRQKALKEQYLFTCTCPRCI-KLGQFDDIQESAILEGYR 180
+ + G E+LISYI+ RQ++L+ Y F C C RC L ++ IQ + G +
Sbjct: 252 RDIEPGDEILISYIDHTMPRRARQESLR-LYHFQCNCIRCKDDLNAYEVIQACGVSPGVK 310
>gi|156050587|ref|XP_001591255.1| hypothetical protein SS1G_07881 [Sclerotinia sclerotiorum 1980]
gi|154692281|gb|EDN92019.1| hypothetical protein SS1G_07881 [Sclerotinia sclerotiorum 1980
UF-70]
Length = 593
Score = 72.8 bits (177), Expect = 2e-10, Method: Compositional matrix adjust.
Identities = 37/112 (33%), Positives = 58/112 (51%), Gaps = 11/112 (9%)
Query: 64 NEIAENFSK---LACNAHTICNSELR--------PLGTGLYPVISIINHSCLPNAVLVFE 112
+++ +N SK L AH + + R P+G L P+++ NH C PNA + F+
Sbjct: 201 DDLEKNNSKWDDLLLQAHALSTNAFRVESNVGNGPIGLCLDPLLARANHCCYPNAAITFD 260
Query: 113 GRLAVVRAVQHVPKGAEVLISYIETAGSTMTRQKALKEQYLFTCTCPRCIKL 164
G+ A +RA+ + G ++ ISYI+ RQ AL+E + F C C RC
Sbjct: 261 GKRATLRALFPIKNGEQIFISYIDETQRQEVRQAALEETWFFKCRCSRCTNF 312
>gi|389631537|ref|XP_003713421.1| hypothetical protein MGG_15522 [Magnaporthe oryzae 70-15]
gi|351645754|gb|EHA53614.1| hypothetical protein MGG_15522 [Magnaporthe oryzae 70-15]
gi|440463734|gb|ELQ33288.1| hypothetical protein OOU_Y34scaffold00979g72 [Magnaporthe oryzae
Y34]
gi|440483676|gb|ELQ64025.1| hypothetical protein OOW_P131scaffold00888g14 [Magnaporthe oryzae
P131]
Length = 542
Score = 72.8 bits (177), Expect = 2e-10, Method: Compositional matrix adjust.
Identities = 37/90 (41%), Positives = 55/90 (61%), Gaps = 1/90 (1%)
Query: 72 KLACNAHTICNSELRPLGTGLYPVISIINHSCLPNAVLVFEGRLAVVRAVQHVPKGAEVL 131
K+ N+ +++ GT L PV+++ NHSC+PNAV++F R A +RA + +G+E+
Sbjct: 188 KIQTNSFDRFDADTGASGTFLDPVLAMANHSCVPNAVVLFWRRKAYLRAEMPIKQGSEIS 247
Query: 132 ISYIETAGSTMTRQKALKEQYLFTCTCPRC 161
ISYI+ RQ+ L Y FTC CPRC
Sbjct: 248 ISYIDYTKPVRFRQEDL-WLYHFTCKCPRC 276
>gi|426239581|ref|XP_004013698.1| PREDICTED: SET and MYND domain-containing protein 3 isoform 3 [Ovis
aries]
Length = 391
Score = 72.8 bits (177), Expect = 3e-10, Method: Compositional matrix adjust.
Identities = 31/65 (47%), Positives = 46/65 (70%)
Query: 65 EIAENFSKLACNAHTICNSELRPLGTGLYPVISIINHSCLPNAVLVFEGRLAVVRAVQHV 124
+I E F+K+ CN+ TICN+E++ +G GLYP +S++NHSC PN +VF G ++R V+ V
Sbjct: 170 DIFEAFAKVICNSFTICNAEMQEVGVGLYPSMSLLNHSCDPNCSIVFNGPHLLLRTVRDV 229
Query: 125 PKGAE 129
G E
Sbjct: 230 EAGEE 234
Score = 48.1 bits (113), Expect = 0.007, Method: Compositional matrix adjust.
Identities = 24/61 (39%), Positives = 36/61 (59%)
Query: 279 KEALAYCQLTIPVYQRVYPQFHPLLGLQYYTCGKLEWFLGDTENAIKSMTEAVEILRITH 338
+EAL Y T+ Y+ YP HP+ G+Q GKL+ G A+K++ A +I+R+TH
Sbjct: 308 EEALFYGIRTMEPYRIFYPGNHPVRGVQIMKVGKLQLHQGMFPQAMKNLRLAFDIMRVTH 367
Query: 339 G 339
G
Sbjct: 368 G 368
>gi|12007319|gb|AAG45134.1|AF310894_3 BOP [Dictyostelium discoideum]
Length = 403
Score = 72.8 bits (177), Expect = 3e-10, Method: Compositional matrix adjust.
Identities = 39/107 (36%), Positives = 59/107 (55%), Gaps = 10/107 (9%)
Query: 61 ISINEIAENFSKLACNAHTICNSELRPL-----GTGLYPVISIINHSCLPNAVLVFEGR- 114
++I+EI E +S + N H + L PL G G++P S +NHSCLPNA + +
Sbjct: 110 VTIDEIIEIYSIVLSNGHEM----LHPLNCHTYGLGIFPTGSYLNHSCLPNAFWYNDDQG 165
Query: 115 LAVVRAVQHVPKGAEVLISYIETAGSTMTRQKALKEQYLFTCTCPRC 161
+ V R ++ + KG E+L SY + R+K L +QY F C C +C
Sbjct: 166 MMVFRTLRPIKKGEEILTSYTDITTECSERRKHLLKQYFFFCQCQQC 212
>gi|335309609|ref|XP_003131876.2| PREDICTED: SET and MYND domain-containing protein 4-like [Sus
scrofa]
Length = 788
Score = 72.8 bits (177), Expect = 3e-10, Method: Compositional matrix adjust.
Identities = 39/108 (36%), Positives = 59/108 (54%), Gaps = 6/108 (5%)
Query: 58 WPEISINEIA--ENFSKLACNAHTICNSELRPLGTGLYPVISIINHSCLPNAVLVFEGRL 115
+PE +I +A + +L CNA I + ++ G ++P++++ HSC PNA + F +
Sbjct: 481 YPEWNIWGVAMLRHMLQLQCNAQAI--TAIQQTGDSIFPIVNVFXHSCSPNASVSFISTV 538
Query: 116 AVVRAVQHVPKGAEVLISY--IETAGSTMTRQKALKEQYLFTCTCPRC 161
A VRA Q + KG EVL Y E+ R + L+ QY F C CP C
Sbjct: 539 ATVRASQPIRKGQEVLHCYGPHESRMGVAERHQKLRSQYFFDCNCPPC 586
>gi|156386886|ref|XP_001634142.1| predicted protein [Nematostella vectensis]
gi|156221221|gb|EDO42079.1| predicted protein [Nematostella vectensis]
Length = 73
Score = 72.8 bits (177), Expect = 3e-10, Method: Compositional matrix adjust.
Identities = 32/73 (43%), Positives = 46/73 (63%)
Query: 89 GTGLYPVISIINHSCLPNAVLVFEGRLAVVRAVQHVPKGAEVLISYIETAGSTMTRQKAL 148
GTG++P +NHSC PN+V VF G ++A++ +P G E+ ISYI+ TRQ+ L
Sbjct: 1 GTGIFPNAVCLNHSCAPNSVAVFNGTNIYIKALEEIPVGEELTISYIQQLHPRETRQEEL 60
Query: 149 KEQYLFTCTCPRC 161
+ Q+ F C C RC
Sbjct: 61 QTQFCFYCQCHRC 73
>gi|322801942|gb|EFZ22489.1| hypothetical protein SINV_02250 [Solenopsis invicta]
Length = 675
Score = 72.4 bits (176), Expect = 3e-10, Method: Compositional matrix adjust.
Identities = 51/179 (28%), Positives = 85/179 (47%), Gaps = 6/179 (3%)
Query: 76 NAHTICNSELRPLGTGLYPVISIINHSCLPNAVLVFEGRLAVVRAVQHVPKGAEVLISY- 134
A T+ ++ +G G+YP ++++NHSC P V F G +VRA++ + G E+ +Y
Sbjct: 466 GAKTLAKAKSMFIGGGVYPTVAMLNHSCNPGVVRYFIGTTMIVRAIRTISVGEEISENYG 525
Query: 135 -IETAGSTMTRQKALKEQYLFTCTCPRCIKLGQFDDIQESAILEGYRCK-DDGCSGFLLR 192
I T + R++ L+ QY F C C C D + IL ++C+ C LL
Sbjct: 526 PIFTTTPEIERKRKLRVQYWFDCNCEACSGHWPLLDELDPTILR-FKCETGPSCGNVLLV 584
Query: 193 DSDDKGFT--CQQCGLVRSKEEIKKIASEVNILSKKTLALTSCGNHQEVVSTYKMIEKL 249
SD K F C +CG + + K+ + + L + G +++ + Y I KL
Sbjct: 585 RSDTKEFMIGCAKCGKSTNILKGLKVLQDTDALFRVASTNLEEGRNEQALKAYLEILKL 643
>gi|383852386|ref|XP_003701709.1| PREDICTED: SET and MYND domain-containing protein 4-like [Megachile
rotundata]
Length = 675
Score = 72.4 bits (176), Expect = 3e-10, Method: Compositional matrix adjust.
Identities = 52/168 (30%), Positives = 78/168 (46%), Gaps = 8/168 (4%)
Query: 88 LGTGLYPVISIINHSCLPNAVLVFEGRLAVVRAVQHVPKGAEVLISY--IETAGSTMTRQ 145
+G GL+P I++ NHSC P + F G VVRA++ + G E+ +Y I T R+
Sbjct: 478 IGGGLFPTIALFNHSCNPGVIRYFIGTTMVVRAIRSIAAGEEISENYGPIFTMSPEAERK 537
Query: 146 KALKEQYLFTCTCPRCIKLGQFDDIQESAILEGYRCKDDG--CSGFLLRDSDDKGFT--C 201
+ L+ QY F C C C+ D + IL ++C D G C L +D F C
Sbjct: 538 RTLRWQYWFDCNCEACVGHWPLLDDIDPTILR-FKC-DTGRECGNILPVKTDTNEFMIRC 595
Query: 202 QQCGLVRSKEEIKKIASEVNILSKKTLALTSCGNHQEVVSTYKMIEKL 249
+CG + + K + + L + G H E +S+Y I KL
Sbjct: 596 PKCGKNTNILKGLKALQDTDSLFRVASNNLEDGKHVEALSSYLKILKL 643
>gi|425770916|gb|EKV09375.1| hypothetical protein PDIP_64570 [Penicillium digitatum Pd1]
gi|425776732|gb|EKV14940.1| hypothetical protein PDIG_30190 [Penicillium digitatum PHI26]
Length = 495
Score = 72.4 bits (176), Expect = 3e-10, Method: Compositional matrix adjust.
Identities = 41/143 (28%), Positives = 73/143 (51%), Gaps = 3/143 (2%)
Query: 21 TDNYSLVEALVAHMSDIDEKQLLLYAQIANLVNLILQWPEISINE--IAENFSKLACNAH 78
++ + + + L HM++I Q L +I + ++ ++E +A +KL N+
Sbjct: 135 SEEFKVFDGLETHMNEISGSQGQL-DRINLTARAVKRYSGTDVDEGTVASYAAKLDLNSF 193
Query: 79 TICNSELRPLGTGLYPVISIINHSCLPNAVLVFEGRLAVVRAVQHVPKGAEVLISYIETA 138
+ S +G ++P +INHSC N+ + F+G V+A+ + KG ++ ISYI+T
Sbjct: 194 NLTTSMYDRIGLYMHPYAGLINHSCDYNSTVGFDGEELYVKAMHPIKKGEQIFISYIDTT 253
Query: 139 GSTMTRQKALKEQYLFTCTCPRC 161
R LKE+Y F C C +C
Sbjct: 254 TPYDIRCNELKERYFFDCQCTKC 276
>gi|115449849|ref|XP_001218712.1| predicted protein [Aspergillus terreus NIH2624]
gi|114187661|gb|EAU29361.1| predicted protein [Aspergillus terreus NIH2624]
Length = 306
Score = 72.4 bits (176), Expect = 3e-10, Method: Compositional matrix adjust.
Identities = 35/87 (40%), Positives = 50/87 (57%), Gaps = 2/87 (2%)
Query: 88 LGTGLYPVISIINHSCLPNAVLVFEGRLAVVRAVQHVPKGAEVLISYIETAGSTMTRQKA 147
LGT L S +NHSC P A + FEGR VRA+ +P G E+ +Y++ +GS +RQ
Sbjct: 9 LGTTLDLAASFMNHSCNPGAFVFFEGRQMRVRALLSLPAGEEITQAYVDLSGSVFSRQAT 68
Query: 148 LKEQYLFTCTCPRCIKLGQFDDIQESA 174
+ +Y F C C RC +D+Q+ A
Sbjct: 69 TEAEYFFQCHCVRCED--DLEDLQQIA 93
>gi|328875104|gb|EGG23469.1| SET domain-containing protein [Dictyostelium fasciculatum]
Length = 448
Score = 72.4 bits (176), Expect = 3e-10, Method: Compositional matrix adjust.
Identities = 43/125 (34%), Positives = 64/125 (51%), Gaps = 4/125 (3%)
Query: 72 KLACNAHTICNSELRPLGTGLYPVISIINHSCLPNAVLVFEGRLAVVRAVQHVPKGAEVL 131
K+ CN I + + + LG + P+ S NHSC PN V V L +A+ +PKG+++
Sbjct: 272 KIRCNQFGIWSKKDKCLGVSVTPIASYFNHSCCPNIVDVRGTTLLEFKALHFIPKGSQLC 331
Query: 132 ISYIETAGSTMTRQKALKEQYLFTCTCPRCIKLGQFDDIQESAILEGYRCKDDGCSG-FL 190
ISY++ +T +RQ L Y F C C RC G D ++ I + Y CSG +
Sbjct: 332 ISYLDLDQTTDSRQDYLIYSYYFKCGCKRCNDKG---DSIDNWISQFYCQGTKKCSGTYF 388
Query: 191 LRDSD 195
L D +
Sbjct: 389 LEDEE 393
>gi|392579385|gb|EIW72512.1| hypothetical protein TREMEDRAFT_58684 [Tremella mesenterica DSM
1558]
Length = 885
Score = 72.4 bits (176), Expect = 3e-10, Method: Compositional matrix adjust.
Identities = 38/102 (37%), Positives = 52/102 (50%), Gaps = 1/102 (0%)
Query: 90 TGLYPVISIINHSCLPNAVLVFEGRLAVVRAVQHVPKGAEVLISYIETAGSTMTRQKALK 149
T ++P +S+INHSCLPN V + G +AVVRA+ + G E+ ISY TR K
Sbjct: 553 TAIWPTVSMINHSCLPNTVHIAWGDVAVVRALNDMSAGTELTISYFSREQPYETRAAKAK 612
Query: 150 EQYLFTCTCPRCIKLGQFDDIQESAILEGYRCKDDGCSGFLL 191
QYLFTC C C + I+ ++E + LL
Sbjct: 613 -QYLFTCDCNLCAADRMDESIRRGKLMEKFDMAKQASRNILL 653
>gi|226290918|gb|EEH46346.1| conserved hypothetical protein [Paracoccidioides brasiliensis Pb18]
Length = 389
Score = 72.4 bits (176), Expect = 3e-10, Method: Compositional matrix adjust.
Identities = 36/119 (30%), Positives = 62/119 (52%)
Query: 45 YAQIANLVNLILQWPEISINEIAENFSKLACNAHTICNSELRPLGTGLYPVISIINHSCL 104
+ + + VN L + + + ++L NA T+ N +G + P S NHSC
Sbjct: 73 FVTVGDEVNPGLTLKACTGCRVVKYCNELETNAFTLTNRYFDRIGLCMLPFASYANHSCE 132
Query: 105 PNAVLVFEGRLAVVRAVQHVPKGAEVLISYIETAGSTMTRQKALKEQYLFTCTCPRCIK 163
PNA + F+G + ++A+Q + ++ ISYI+ RQ L+++Y FTC CP+C +
Sbjct: 133 PNAYIGFDGPVIYLKALQDIALDEQIFISYIDNTEPWGKRQSELQKRYFFTCKCPKCAQ 191
>gi|121715400|ref|XP_001275309.1| SET and MYND domain protein, putative [Aspergillus clavatus NRRL 1]
gi|119403466|gb|EAW13883.1| SET and MYND domain protein, putative [Aspergillus clavatus NRRL 1]
Length = 555
Score = 72.4 bits (176), Expect = 4e-10, Method: Compositional matrix adjust.
Identities = 43/145 (29%), Positives = 74/145 (51%), Gaps = 12/145 (8%)
Query: 26 LVEALVAHMSDI-------DEKQLLLYAQIANLVNLILQWPEISINE--IAENFSKLACN 76
L + L +HM D+ D+ Q + +I + + +++E IAE +KL N
Sbjct: 187 LFKGLESHMKDVCIRNANGDKDQ---WERILLTAKAVKNYSGCAMDESLIAEYAAKLDVN 243
Query: 77 AHTICNSELRPLGTGLYPVISIINHSCLPNAVLVFEGRLAVVRAVQHVPKGAEVLISYIE 136
+ N LG ++P +++NHSC NA + F+ V+A+ + KG ++ I+Y++
Sbjct: 244 SFNFDNVMDERLGIYMHPYAALMNHSCDYNATVAFDDDRLHVKALHPIKKGEQIFITYVD 303
Query: 137 TAGSTMTRQKALKEQYLFTCTCPRC 161
T RQK L ++Y FTC C +C
Sbjct: 304 TTNPYKIRQKELSDRYYFTCRCSKC 328
>gi|255570887|ref|XP_002526395.1| heat shock protein 70 (HSP70)-interacting protein, putative
[Ricinus communis]
gi|223534257|gb|EEF35971.1| heat shock protein 70 (HSP70)-interacting protein, putative
[Ricinus communis]
Length = 781
Score = 72.4 bits (176), Expect = 4e-10, Method: Compositional matrix adjust.
Identities = 72/364 (19%), Positives = 153/364 (42%), Gaps = 52/364 (14%)
Query: 40 KQLLLYAQIANLVNLILQWPEISINEIAENFSKLACNAHTICNS-ELRPLGTGLYPVISI 98
+ ++L +Q+ +++ + + +++ KL+ + + +S E P+G +Y S+
Sbjct: 423 QAIILVSQVRVNAMAVVRMKSVDAHCPSDHLVKLSHSGDALTSSVEQVPVGQAIYTAGSL 482
Query: 99 INHSCLPNAVLVFEGRLAVVRAVQHVPKGAEVLISYIETAG--STMTRQKALKEQYLFTC 156
NHSC PN F R +R +H+ G + +SY G R K L+++Y F C
Sbjct: 483 FNHSCQPNVHAYFLSRTLFIRTTEHLATGCPLELSYGPQVGQRDCKDRLKFLQDKYSFRC 542
Query: 157 TCPRCIKLGQFDDIQESAILEGYRCKDDGCSGFLLRDSDDKGFTCQQCGLVRSKEEIK-K 215
C C + D +Q + +RC D C G +L S + R+ E +
Sbjct: 543 HCNGCSIVNLSDLVQNA-----FRCIDLNCVGVVLDRSVINSEIKKLKNFPRAPERQRLD 597
Query: 216 IASEVNILSKKTLALT------------SCGN-------HQEVVSTYKMIEKLQ------ 250
+ +V+ L+ L L+ +CG+ H+ + + + ++LQ
Sbjct: 598 LCLQVDDLAHLALELSNGPLHIQPGCCLNCGSYCDLEAVHEGMRTAWIYFKRLQDAIVVK 657
Query: 251 -----------------KKLYHPFSVNLMQTREKLIKILMELEDWKEALAYCQLTIPVYQ 293
+ + H ++ ++ + + L + +ED++ A +C+ +I + +
Sbjct: 658 KISTTVITDASRALGALRSILHAYNKHIAEAEDILAQAFCLVEDFQSARDHCRASIKILE 717
Query: 294 RVYPQFHPLLGLQYYTCGKLEWFLGDTENAIKSMTEAVEILRITHGTNSPFMKELILKLE 353
+Y H ++G + ++ +GD +A+ S+ I G+++ F+ + L
Sbjct: 718 MLYGPDHIVIGYELIKLSTIQLSMGDL-SAVDSINRLGAIFERYFGSHADFIFPYLQTLR 776
Query: 354 EAQA 357
E A
Sbjct: 777 EKLA 780
>gi|17509133|ref|NP_492772.1| Protein SET-18, isoform a [Caenorhabditis elegans]
gi|3924878|emb|CAB03382.1| Protein SET-18, isoform a [Caenorhabditis elegans]
Length = 501
Score = 72.4 bits (176), Expect = 4e-10, Method: Compositional matrix adjust.
Identities = 72/325 (22%), Positives = 132/325 (40%), Gaps = 26/325 (8%)
Query: 29 ALVAHMSDI--DEKQLLLYAQIANLVNLILQWPEISINEIAENFSKLACNAHTICNSELR 86
+LV H DI DE+++ + +I + I E + F K N+H+I +
Sbjct: 135 SLVDHKVDIRDDEEKMAHFEKIFEKMGAFRGEEMIEKGEFFDVFCKATINSHSIHTNAGN 194
Query: 87 PLGTGLYPVISIINHSCLPNAVLVFEGRLAVVRAVQHVP-----KGAEVLISYIETAGST 141
+G L +S NHSC P +VF+G +R + VP E ISYI+ S
Sbjct: 195 EVGMALDLGVSKYNHSCRPTCSMVFDGYRVCLRPL--VPGVDAENTEEAFISYIDVGRSK 252
Query: 142 MTRQKALKEQYLFTCTCPRCIKLGQFDDIQESAILEGYRCKDDGCSGFLLRDSDDK--GF 199
R++ L ++ F C C RC+ D ++ A L RC + C +L ++
Sbjct: 253 YIRRRDLNSRWYFNCECTRCM------DPEDDA-LTAIRCANPACDAPILTSETEEPMNI 305
Query: 200 TCQQCGLVRSKEEIKKIASEVNILSKKTLALTSCGNHQEVVSTYKMIEKLQKKLYHPFSV 259
C++C + ++ +K + L +E+++ + I HP +V
Sbjct: 306 ACEKCKTIVEEDTVKAAQEYMKTLPASFDPKCPAEILKELLAKAEQI-------LHPSNV 358
Query: 260 NLMQTREKLIKILMELEDWKEALAYCQLTIPVYQRVYPQFHPLLGLQYYTCGKLEWFLGD 319
+ + R L + L + + Q+ Y+ +P+ +G Q K +
Sbjct: 359 YVARLRTALFHVTGTLTMDNLSSMHTQI-YNNYKMCFPKADRHVGFQLLHIVKALIEKDE 417
Query: 320 TENAIKSMTEAVEILRITHGTNSPF 344
+ A+ +A+ I + G + P+
Sbjct: 418 RDEAMPYAFDAMNIFEVCFGLDHPY 442
>gi|389746511|gb|EIM87691.1| SET domain-containing protein [Stereum hirsutum FP-91666 SS1]
Length = 552
Score = 72.0 bits (175), Expect = 4e-10, Method: Compositional matrix adjust.
Identities = 87/320 (27%), Positives = 131/320 (40%), Gaps = 97/320 (30%)
Query: 7 RRKLQNDNVIPSTTTDNYSLVEALVAHMSDIDEKQLLLYA--QIANLVNLILQWPEISIN 64
R+ LQ PST + L A+V ++ QL Y A+LV+LI
Sbjct: 160 RKDLQ-----PSTYESHTHLAHAVVKYLGLDAPGQLGAYGITTAADLVDLI--------- 205
Query: 65 EIAENFSKLACNAHTICNSELRPLGTGLYPVISIINHSCLPNAVLVF----------EGR 114
S+ N+ T+ L PLG + P++++INHSC PNAV+V+ E +
Sbjct: 206 ------SRFITNSITLTAPSLTPLGVSVSPLVALINHSCDPNAVVVYPRSTSDPSQQEPQ 259
Query: 115 LAVVRAVQHVPKGAEVLISYIETAGSTMTRQKALKEQYLFTCTCPRCIKLGQFDDIQESA 174
+ VV A++++ E+ +YI+T RQ AL Y FTC C C + D
Sbjct: 260 MQVV-AIKNIAPDQEIFTAYIDTTMPRGHRQAALTTTYNFTCKCSLCSQPDGMD------ 312
Query: 175 ILEGYRCKDDGCSGFLLRDSDDKGFTCQQCGLVRSKEEIKKIASEVNILSKKTLALTSCG 234
+ E C + C G TC + +E N LS+ CG
Sbjct: 313 LREAIWCPKN-CGG-----------TC-------------PLPTEENTLSR-------CG 340
Query: 235 NHQEVV-STYKMIEKL---QKKLYHPFSVNLMQTREKLIKILMELEDWKEALAYCQLT-- 288
+ +V ST +++ L Q+ L S+ L +T + L QLT
Sbjct: 341 KCKAIVSSTDSVLDALRVGQEALQKATSLQLTETAKAL-----------------QLTTN 383
Query: 289 -IPVY--QRVYPQFHPLLGL 305
IP+ ++ P HPLL L
Sbjct: 384 IIPILTSSKLTPSSHPLLAL 403
>gi|72169832|ref|XP_781331.1| PREDICTED: SET and MYND domain-containing protein 4-like
[Strongylocentrotus purpuratus]
Length = 704
Score = 72.0 bits (175), Expect = 4e-10, Method: Compositional matrix adjust.
Identities = 79/382 (20%), Positives = 158/382 (41%), Gaps = 76/382 (19%)
Query: 23 NYSLVEALVAHMSDIDEKQLLLYAQIANLVNLILQWPEISINEIAENFS----------- 71
NY + L AH + K+L+ YA + L+ + E+ + + E FS
Sbjct: 335 NYECIVGLEAHWFNHSHKELVQYAVTSILLAKCF-YRELVLPKTCETFSEEELITEIASL 393
Query: 72 ------KLACNAHTICNSELRP----------------------LGTGLYPVISIINHSC 103
+L N+H I +E+R + T +YP +S++NH+C
Sbjct: 394 LLLHTRQLKSNSHAI--TEVRSSEGENTAGESVGGSVQQISQGRIATAVYPTVSLMNHAC 451
Query: 104 LPNAVLVFEGRLAVVRAVQHVPKGAEVLISYIETAG--STMTRQKALKEQYLFTCTCPRC 161
PN + F + VRA++ + +G E+ Y G +T RQ+AL QY FTC C C
Sbjct: 452 QPNVIASFRKGIISVRAIEKIMRGDEIQHCYGPQVGHMTTSDRQQALLNQYCFTCRCRAC 511
Query: 162 IKLGQFDDIQESAILEGYRCKD---------DGCS-----GFLLRDSDDKGFTCQQCGLV 207
+ + D +E ++ +C C+ G L+ + + G T L+
Sbjct: 512 TRKPRTFDKEEDLCIKCPQCGQPLNIQTSMCGKCAERIDVGVLIHELTNAGTT-----LI 566
Query: 208 RSKEEIKKIASEVNILSKKTLALTSCGNHQEVVSTYKMIEKLQKKLYHPFSVNLMQTREK 267
+E ++ ++ +EV+S K + +++ P + L +
Sbjct: 567 GLEEMFSAAVNDDTLM-------------REVISKTKSCIDVLERIIIPPDMQLATAYDD 613
Query: 268 LIKILMELEDWKEALAYCQLTIPVYQRVYPQFHPLLGLQYYTCGKLEWFLGDTENAIKSM 327
+ K +EL+++KEA + P + + + + + Y ++ + + A++++
Sbjct: 614 MAKCHVELDEFKEAALWLAKATPSIESRFGRDSIEVAHELYKLAQIYFNGKEIAPAMETI 673
Query: 328 TEAVEILRITHGTNSPFMKELI 349
A+E+ +G ++ + EL+
Sbjct: 674 DRALELFIRHYGNSNEEVLELV 695
>gi|19113306|ref|NP_596514.1| histone lysine methyltransferase Set6 (predicted)
[Schizosaccharomyces pombe 972h-]
gi|74626997|sp|O94256.1|SET6_SCHPO RecName: Full=SET domain and MYND-type zinc finger protein 6
gi|3810831|emb|CAA21792.1| histone lysine methyltransferase Set6 (predicted)
[Schizosaccharomyces pombe]
Length = 483
Score = 72.0 bits (175), Expect = 4e-10, Method: Compositional matrix adjust.
Identities = 33/92 (35%), Positives = 52/92 (56%)
Query: 70 FSKLACNAHTICNSELRPLGTGLYPVISIINHSCLPNAVLVFEGRLAVVRAVQHVPKGAE 129
F +LA NA + S LG L ++ +NHSC PN ++F+G + + + + + K +
Sbjct: 163 FCRLAVNAMNLVTSSFDSLGMCLDTILCRLNHSCDPNCQIIFDGAIVQLVSKRDIKKDEQ 222
Query: 130 VLISYIETAGSTMTRQKALKEQYLFTCTCPRC 161
+ ISYI+ RQK L ++Y F+C CPRC
Sbjct: 223 LFISYIDIRLPKSIRQKQLLKKYFFSCYCPRC 254
>gi|158299452|ref|XP_319583.4| AGAP008839-PA [Anopheles gambiae str. PEST]
gi|157013526|gb|EAA14823.4| AGAP008839-PA [Anopheles gambiae str. PEST]
Length = 650
Score = 72.0 bits (175), Expect = 4e-10, Method: Compositional matrix adjust.
Identities = 68/267 (25%), Positives = 109/267 (40%), Gaps = 27/267 (10%)
Query: 16 IPSTTTDNYSLVEALVAHMSDIDEKQLLLYAQIANLVNLIL---------QWPEISINEI 66
I D+Y V LV H S + +A L+N L Q +
Sbjct: 358 IDKLPVDDYRKVYKLVTHESTRSPEDFFQRTLMATLLNACLTLGGYGACPQEQNFIGGLL 417
Query: 67 AENFSKLACNAHTICN---------SELRPLGTGLYPVISIINHSCLPNAVLVFEGRLAV 117
N L NAH + + +G GLYP +++ NHSC P + G
Sbjct: 418 VHNLQLLQFNAHEVSEMIRETAEDIGKSTFIGGGLYPTLALFNHSCDPGVTRYYRGNQVC 477
Query: 118 VRAVQHVPKGAEVLISY--IETAGSTMTRQKALKEQYLFTCTCPRCIKLGQFDDIQESAI 175
VR V+++P + V +Y + T R+ L QY FTC C C++ + +
Sbjct: 478 VRTVKNIPADSMVAENYGPLFTQVRRDERRDTLLHQYRFTCQCVPCVENWPLFTEMDPGV 537
Query: 176 LEGYRCKDDG--CSGFLLRDS--DDKGFTCQQCGLVRSKEEIKKIASEVNILSKKTLALT 231
+ +RC D G CS LL + +D C +CG + + K + ++L K L
Sbjct: 538 IR-FRC-DSGKICSNVLLIPAAVNDFMVKCTECGEHTNIMKGLKSLQDTDMLFKTATRLH 595
Query: 232 SCGNHQEVVSTY-KMIEKLQKKLYHPF 257
S G ++ + Y +M+E + + L P+
Sbjct: 596 SAGEYEAALRKYIEMMETMSEVLVPPY 622
>gi|225679205|gb|EEH17489.1| conserved hypothetical protein [Paracoccidioides brasiliensis Pb03]
Length = 377
Score = 72.0 bits (175), Expect = 4e-10, Method: Compositional matrix adjust.
Identities = 38/122 (31%), Positives = 64/122 (52%), Gaps = 1/122 (0%)
Query: 45 YAQIANLVNLILQWPEISINEIAENFSKLACNAHTICNSELRPLGTGLYPVISIINHSCL 104
+ + + VN L + + + ++L NA T+ N +G + P S NHSC
Sbjct: 62 FVTVGDEVNPGLTLKACTGCRVVKYCNELETNAFTLTNRYFDRIGLCMLPFASYANHSCE 121
Query: 105 PNAVLVFEGRLAVVRAVQHVPKGAEVLISYIETAGSTMTRQKALKEQYLFTCTCPRCIKL 164
PNA + F+G + ++A+Q + ++ ISYI+ RQ L+++Y FTC CP+C +
Sbjct: 122 PNAYIGFDGPVIYLKALQDIALDEQIFISYIDNTEPWEKRQSELQKRYFFTCKCPKCAQ- 180
Query: 165 GQ 166
GQ
Sbjct: 181 GQ 182
>gi|195058496|ref|XP_001995453.1| GH17754 [Drosophila grimshawi]
gi|193896239|gb|EDV95105.1| GH17754 [Drosophila grimshawi]
Length = 499
Score = 72.0 bits (175), Expect = 4e-10, Method: Compositional matrix adjust.
Identities = 34/103 (33%), Positives = 54/103 (52%), Gaps = 7/103 (6%)
Query: 91 GLYPVISIINHSCLPNAVLVFEGRL--AVVRAVQHVPKGAEVLISYIETAGSTMTRQKAL 148
LYP+ ++NH C+PN+ FE + +VRA + +G E+ +Y + + R L
Sbjct: 208 ALYPLFGVMNHDCIPNSYYTFEEKTNNMIVRAAVDILEGEEITTTYTKLFTGNIARHLFL 267
Query: 149 KEQYLFTCTCPRCIKLGQFDDIQESAILEGYRCKDDGCSGFLL 191
K + FTC CPRC D ++ A + G C+D CSG ++
Sbjct: 268 KMKKGFTCKCPRC-----SDPTEKGAYISGLYCRDTNCSGLVV 305
>gi|157136831|ref|XP_001656929.1| hypothetical protein AaeL_AAEL003516 [Aedes aegypti]
gi|108880958|gb|EAT45183.1| AAEL003516-PA [Aedes aegypti]
Length = 649
Score = 72.0 bits (175), Expect = 4e-10, Method: Compositional matrix adjust.
Identities = 73/272 (26%), Positives = 112/272 (41%), Gaps = 35/272 (12%)
Query: 15 VIPSTTTDNYSLVEALVAHMSDIDEKQLLLYAQIANLVNLILQWPEISINEIAENF---- 70
V D+Y V LV H + +A L+N L ++ +E+F
Sbjct: 356 VTDKLNNDDYRKVYKLVTHEDKRSPEDYFQRTLMATLLNACLSLGGYYKSKESESFIGGL 415
Query: 71 -----SKLACNAHTICNSELRP-----------LGTGLYPVISIINHSCLPNAVLVFEGR 114
L NAH I SEL+ +G GLYP +++ NHSC P + G
Sbjct: 416 LLHNLQFLQYNAHEI--SELQRENERDIGKSTFIGGGLYPTLALFNHSCEPGVTRYYRGN 473
Query: 115 LAVVRAVQHVPKGAEVLISY--IETAGSTMTRQKALKEQYLFTCTCPRCI-KLGQFDDIQ 171
VR V+ +P G+ V +Y + T S R+ L QY FTC C C +F D+
Sbjct: 474 SVCVRTVKGIPAGSMVAENYGPLFTQVSREERRSTLLNQYKFTCNCRACAGDWPKFADMD 533
Query: 172 ESAILEGYRCKDDG---CSGFLLRDSDDKGF--TCQQCGLVRSKEEIKKIASEVNILSKK 226
+ + R K DG CS L ++ F C +CG + + K + ++L K
Sbjct: 534 SNVL----RFKCDGGKNCSNVLEIPAEINEFMVQCTECGEHTNIMKGLKSLQDTDMLFKS 589
Query: 227 TLALTSCGNHQEVVSTY-KMIEKLQKKLYHPF 257
L G ++ + Y +M+ L + L P+
Sbjct: 590 ATKLHEAGEYEFALKKYVEMMNTLDEVLVPPY 621
>gi|302845794|ref|XP_002954435.1| hypothetical protein VOLCADRAFT_118721 [Volvox carteri f.
nagariensis]
gi|300260365|gb|EFJ44585.1| hypothetical protein VOLCADRAFT_118721 [Volvox carteri f.
nagariensis]
Length = 405
Score = 72.0 bits (175), Expect = 5e-10, Method: Compositional matrix adjust.
Identities = 35/73 (47%), Positives = 45/73 (61%), Gaps = 2/73 (2%)
Query: 91 GLYPVISIINHSCLPNAVLVFEGRLAVVRAVQHVPKGAEVLISYIETAG--STMTRQKAL 148
GL+P S +NHSCLPNAV G + VV A + + +G+EVLI+Y+ A RQ L
Sbjct: 122 GLFPSFSFLNHSCLPNAVNFVVGGIMVVVAARKIRQGSEVLINYLGRASLRPVGERQGQL 181
Query: 149 KEQYLFTCTCPRC 161
E Y F+C CPRC
Sbjct: 182 AEGYHFSCDCPRC 194
>gi|4324419|gb|AAD16880.1| unknown [Dictyostelium discoideum]
Length = 333
Score = 71.6 bits (174), Expect = 5e-10, Method: Compositional matrix adjust.
Identities = 39/107 (36%), Positives = 59/107 (55%), Gaps = 10/107 (9%)
Query: 61 ISINEIAENFSKLACNAHTICNSELRPL-----GTGLYPVISIINHSCLPNAVLVFEGR- 114
++I+EI E +S + N H + L PL G G++P S +NHSCLPNA + +
Sbjct: 40 VTIDEIIEIYSIVLSNGHEM----LHPLNCHTYGLGIFPTGSYLNHSCLPNAFWYNDDQG 95
Query: 115 LAVVRAVQHVPKGAEVLISYIETAGSTMTRQKALKEQYLFTCTCPRC 161
+ V R ++ + KG E+L SY + R+K L +QY F C C +C
Sbjct: 96 MMVFRTLRPIKKGEEILTSYTDITTECSERRKHLLKQYFFFCQCQQC 142
>gi|320586941|gb|EFW99604.1| set and mynd domain protein [Grosmannia clavigera kw1407]
Length = 576
Score = 71.6 bits (174), Expect = 5e-10, Method: Compositional matrix adjust.
Identities = 41/106 (38%), Positives = 58/106 (54%), Gaps = 3/106 (2%)
Query: 58 WP--EISINEIAENFSKLACNAHTICNSELRPLGTGLYPVISIINHSCLPNAVLVFEGRL 115
WP + S+ E K+ N+ + ++ GT L +++ NHSC+PNAV+ F GR
Sbjct: 206 WPATDASLAIAVEAMCKIHTNSFDRSDVDVAYTGTFLDAHLAMANHSCVPNAVVSFAGRK 265
Query: 116 AVVRAVQHVPKGAEVLISYIETAGSTMTRQKALKEQYLFTCTCPRC 161
A +RA Q + G EV ISYI+ RQ+ L + Y FTC C RC
Sbjct: 266 AFLRAEQAIRAGDEVTISYIDYTKPKSVRQRGL-DLYHFTCDCARC 310
>gi|449681118|ref|XP_002159729.2| PREDICTED: uncharacterized protein LOC100210152, partial [Hydra
magnipapillata]
Length = 1044
Score = 71.6 bits (174), Expect = 6e-10, Method: Compositional matrix adjust.
Identities = 36/101 (35%), Positives = 57/101 (56%), Gaps = 12/101 (11%)
Query: 73 LACNAHTIC----------NSELRPLGTGLYPVISIINHSCLPNAVLVFEGRLAVVRAVQ 122
L CNAH + +SEL+ +G+ +Y +S++NHSC P+ V G V+RA++
Sbjct: 385 LPCNAHEVSELQLKASNYKDSELKEIGSAVYATLSLLNHSCDPSVVRHCYGDTCVLRAIK 444
Query: 123 HVPKGAEVLISY--IETAGSTMTRQKALKEQYLFTCTCPRC 161
H+ +G+E++ +Y + S + RQ L EQY F C C C
Sbjct: 445 HIKEGSEIVDNYGFLYAVESKVIRQSHLMEQYYFACQCEAC 485
>gi|225711848|gb|ACO11770.1| SET and MYND domain-containing protein 3 [Lepeophtheirus salmonis]
Length = 426
Score = 71.6 bits (174), Expect = 6e-10, Method: Compositional matrix adjust.
Identities = 42/137 (30%), Positives = 71/137 (51%), Gaps = 7/137 (5%)
Query: 30 LVAHMSDI-----DEKQLLLYAQIANLVNLILQWPEISINEIAENFSKLACNAHTICNSE 84
L++H +I +K+ L Y I ++ ++ E + +E+ F+KL NA + N +
Sbjct: 116 LMSHKENILMNPEHKKRFLTYLDI--IIYMLGGSLETNESEVLNIFTKLMINASFMLNEK 173
Query: 85 LRPLGTGLYPVISIINHSCLPNAVLVFEGRLAVVRAVQHVPKGAEVLISYIETAGSTMTR 144
L G L S I+HSC PNA+ +F G VV+A+ + +V ++Y++ + R
Sbjct: 174 LIDFGGCLCLEFSAIDHSCRPNAIYMFNGHTLVVKALCEIANFDDVRVAYVDMSQPRSIR 233
Query: 145 QKALKEQYLFTCTCPRC 161
Q+ LK Q+ F C C C
Sbjct: 234 QELLKNQFFFDCNCEEC 250
>gi|390599694|gb|EIN09090.1| SET domain-containing protein [Punctularia strigosozonata HHB-11173
SS5]
Length = 562
Score = 71.6 bits (174), Expect = 6e-10, Method: Compositional matrix adjust.
Identities = 51/165 (30%), Positives = 84/165 (50%), Gaps = 21/165 (12%)
Query: 27 VEALVAHMSDIDEKQLLLYAQIAN-LVN-LILQWPE-------ISINEIAENFSKLACNA 77
++++ +H S + + +A +A+ LV L L PE S ++ + S+ NA
Sbjct: 163 IDSMQSHRSSLRPESFESHAYLAHALVRYLGLDAPEKLASFGLRSAGDVVDLISRFTTNA 222
Query: 78 HTICNSELRPLGTGLYPVISIINHSCLPNAVLVF---------EGRLAVVRAVQHVPKGA 128
T+ L PLG + P ++++NHSC PNAV+VF E + V+ A++ +
Sbjct: 223 ITLAAPSLTPLGVSVSPAVALVNHSCAPNAVVVFPRVSKTVDQEPVMQVI-ALRDIHPDE 281
Query: 129 EVLISYIETAGSTMTRQKALKEQYLFTCTCPRCIKLGQFDDIQES 173
EV +YI+T RQK L++ Y FTC+C C + D +ES
Sbjct: 282 EVTTAYIDTTVPREQRQKILRDTYHFTCSCSLCA--AEDPDPRES 324
>gi|156356358|ref|XP_001623892.1| predicted protein [Nematostella vectensis]
gi|156210632|gb|EDO31792.1| predicted protein [Nematostella vectensis]
Length = 634
Score = 71.6 bits (174), Expect = 6e-10, Method: Compositional matrix adjust.
Identities = 48/159 (30%), Positives = 78/159 (49%), Gaps = 18/159 (11%)
Query: 69 NFSKLACNAH----------TICNSELRPLGTGLYPVISIINHSCLPNAVLVFEGRLAVV 118
+ L CNAH ++ SE +G G+Y +S+ NHSC PN F G VV
Sbjct: 408 HLQSLPCNAHEISELQLSLKSVATSEAAEIGAGIYGTLSLFNHSCEPNVTRFFYGDKCVV 467
Query: 119 RAVQHVPKGAEVLISY--IETAGSTMTRQKALKEQYLFTCTCPRCIKLGQ-FDDI--QES 173
RA +P EV+ +Y + RQ++L+ QY F C C C++ + ++ Q+
Sbjct: 468 RAFSSIPCRGEVVDNYGILSALTPRKQRQESLQSQYYFKCNCHACLEDSPLYSELIKQDV 527
Query: 174 AILEGYRCKDDGCSGFLLRDSDDKGFTCQQCGLVRSKEE 212
L+ C+ +G +L +D K C++CG+ +S E+
Sbjct: 528 PQLKCANCR-MALAGEIL--TDGKLVKCEKCGVPQSLED 563
>gi|157124668|ref|XP_001654144.1| hypothetical protein AaeL_AAEL001857 [Aedes aegypti]
gi|108882773|gb|EAT46998.1| AAEL001857-PA [Aedes aegypti]
Length = 497
Score = 71.2 bits (173), Expect = 7e-10, Method: Compositional matrix adjust.
Identities = 48/188 (25%), Positives = 81/188 (43%), Gaps = 17/188 (9%)
Query: 91 GLYPVISIINHSCLPNAVLVF---EGRLAVVRAVQHVPKGAEVLISYIETAGSTMTRQKA 147
GLYP + ++ HSC+PN V +G + R +P+G ++ Y T T TRQ A
Sbjct: 200 GLYPKLGVMAHSCVPNVVHSIHPSDGYRLIGRVAIDIPEGEKLYTVYTYTLNGTSTRQAA 259
Query: 148 LKEQYLFTCTCPRCIKLGQFDDIQESAILEGYRCKDDGCSGFLLRDS----DDKGFTCQQ 203
LK FTC CPRC+ D + +C+ C L+ S D+ + C
Sbjct: 260 LKSSKYFTCRCPRCL-----DPTELGTHFSSLKCQK--CDNGLIVSSNPLDDEAEWRCSN 312
Query: 204 CGLVRSKEEIKKIASEVNILSKKTLALTSCGNHQEVVSTYKMIEKLQKKLYHPFSVNLMQ 263
C ++K + ++ + L+ N E ++ ++ + K + + HP
Sbjct: 313 CEFKTRGAAVQKA---IQVMQAEIDQLSCLENGPEKLTAFEKLYKKYRSILHPLHFIKTS 369
Query: 264 TREKLIKI 271
R LI++
Sbjct: 370 IRHSLIEL 377
>gi|340992675|gb|EGS23230.1| hypothetical protein CTHT_0008940 [Chaetomium thermophilum var.
thermophilum DSM 1495]
Length = 459
Score = 71.2 bits (173), Expect = 7e-10, Method: Compositional matrix adjust.
Identities = 39/103 (37%), Positives = 59/103 (57%), Gaps = 2/103 (1%)
Query: 60 EISINEIAENFSKLACNAHTICNSELRPLGTGLYPVISIINHSCLPNAVLVFEGRLAVVR 119
+ SI E K+ NA + + G L PV+++ NHSC+P+A + F+ R AV+R
Sbjct: 89 DFSIPRAREILCKIQTNAFNRLDPDAGMTGIYLDPVLAMANHSCMPSAFVSFDQRNAVLR 148
Query: 120 AVQHVPKGAEVLISYIE-TAGSTMTRQKALKEQYLFTCTCPRC 161
A + + +G E+ I Y++ T + RQ+ALK Y F C CPRC
Sbjct: 149 AWRDIKEGDEITICYVDVTLPNKAARQEALK-LYHFECRCPRC 190
>gi|317157122|ref|XP_001826232.2| SET and MYND domain protein [Aspergillus oryzae RIB40]
Length = 435
Score = 71.2 bits (173), Expect = 7e-10, Method: Compositional matrix adjust.
Identities = 41/146 (28%), Positives = 70/146 (47%), Gaps = 2/146 (1%)
Query: 20 TTDNYSLVEALVAHMSDIDEKQLLLYAQIANLVNLILQWPEISINE--IAENFSKLACNA 77
T L L H+S+I ++ + +IA + + + E I+ +KL N+
Sbjct: 69 TNGELELFSQLETHISEIRDQSPEQWERIALSSKAVKAYSGTDMKEETISAFGAKLELNS 128
Query: 78 HTICNSELRPLGTGLYPVISIINHSCLPNAVLVFEGRLAVVRAVQHVPKGAEVLISYIET 137
N +G L+P ++INHSC NA + F+ ++A + + KG ++ ISYI+
Sbjct: 129 FNFTNILSDRIGLYLHPYAALINHSCNYNAAVTFDSDNLYIKATRPIQKGDQIFISYIDA 188
Query: 138 AGSTMTRQKALKEQYLFTCTCPRCIK 163
R+ L+E+Y F C C +C K
Sbjct: 189 TNPVKLRRSELRERYYFDCHCAKCAK 214
>gi|156367132|ref|XP_001627273.1| predicted protein [Nematostella vectensis]
gi|156214178|gb|EDO35173.1| predicted protein [Nematostella vectensis]
Length = 750
Score = 71.2 bits (173), Expect = 7e-10, Method: Compositional matrix adjust.
Identities = 58/272 (21%), Positives = 118/272 (43%), Gaps = 11/272 (4%)
Query: 84 ELRPLGTGLYPVISIINHSCLPNAVLVFEGRLAVVRAVQHVPKGAEVLISYIETAGSTMT 143
E R + + +YP S++NH+C P+ ++ F + V RA ++ G+ + Y
Sbjct: 468 EQRRIASAIYPTASLLNHACDPDVLVSFVDGVLVARATHNIAPGSGITHCYGPHVNHMPR 527
Query: 144 --RQKALKEQYLFTCTCPRCIKLGQFDDIQESAILEGYRCKDDGCSGFLLRDS--DDKGF 199
RQK L +QY FTC C C + ++ + + C C ++ S +
Sbjct: 528 EERQKLLYKQYFFTCQCSACTSDEEMENTR--LCFSAFACPRCKCP---MKTSPLEPSLA 582
Query: 200 TCQ--QCGLVRSKEEIKKIASEVNILSKKTLALTSCGNHQEVVSTYKMIEKLQKKLYHPF 257
CQ +C L +S EE + + +L K + QE + ++ + + ++ HP
Sbjct: 583 RCQNKKCTLEKSIEEELSHSRQAELLFFKAVRTMERIGVQEALGLFQECLRTRTQILHPH 642
Query: 258 SVNLMQTREKLIKILMELEDWKEALAYCQLTIPVYQRVYPQFHPLLGLQYYTCGKLEWFL 317
+L +T + L + + D+K A +C + ++ + + + +L +
Sbjct: 643 HKDLAETHDALARCYAMIGDFKLASQHCLQSSEAVEKAFGSTSVEYAHELHKLSQLLFND 702
Query: 318 GDTENAIKSMTEAVEILRITHGTNSPFMKELI 349
+ A+ + +A +L +G N P ++EL+
Sbjct: 703 RQAKKALPMIDKAASLLATYYGRNHPDVQELV 734
>gi|238493359|ref|XP_002377916.1| SET and MYND domain protein, putative [Aspergillus flavus NRRL3357]
gi|220696410|gb|EED52752.1| SET and MYND domain protein, putative [Aspergillus flavus NRRL3357]
Length = 415
Score = 70.9 bits (172), Expect = 9e-10, Method: Compositional matrix adjust.
Identities = 41/146 (28%), Positives = 70/146 (47%), Gaps = 2/146 (1%)
Query: 20 TTDNYSLVEALVAHMSDIDEKQLLLYAQIANLVNLILQWPEISINE--IAENFSKLACNA 77
T L L H+S+I ++ + +IA + + + E I+ +KL N+
Sbjct: 49 TNGELELFSQLETHISEIRDQSPEQWERIALSSKAVKAYSGTDMKEETISAFGAKLELNS 108
Query: 78 HTICNSELRPLGTGLYPVISIINHSCLPNAVLVFEGRLAVVRAVQHVPKGAEVLISYIET 137
N +G L+P ++INHSC NA + F+ ++A + + KG ++ ISYI+
Sbjct: 109 FNFTNILSDRIGLYLHPYAALINHSCNYNAAVTFDSDNLYIKATRPIQKGDQIFISYIDA 168
Query: 138 AGSTMTRQKALKEQYLFTCTCPRCIK 163
R+ L+E+Y F C C +C K
Sbjct: 169 TNPVKLRRSELRERYYFDCHCAKCAK 194
>gi|407929314|gb|EKG22146.1| hypothetical protein MPH_00473 [Macrophomina phaseolina MS6]
Length = 492
Score = 70.9 bits (172), Expect = 1e-09, Method: Compositional matrix adjust.
Identities = 37/109 (33%), Positives = 57/109 (52%), Gaps = 3/109 (2%)
Query: 70 FSKLACNAHTICNSELRPLGTGLYPVISIINHSCLPNAVLVFEGRLAVVRAVQHVPKGAE 129
+ ++ N+ T+ S + PLG P+ NHSC PNA ++ +G RA++ + K E
Sbjct: 181 YGRVFTNSLTLIGSAMEPLGICFDPLACSANHSCDPNAFVIMDGAQLSFRALKPIAKDEE 240
Query: 130 VLISYIETAGSTMTRQKALKEQYLFTCTCPRCIKLGQFDDIQESAILEG 178
+LISYI+ RQ L +Y FTC+C +C Q ++E L G
Sbjct: 241 ILISYIDGTLPFERRQTDLTRRYYFTCSCTKC---AQGPTLREDRWLPG 286
>gi|328767279|gb|EGF77329.1| hypothetical protein BATDEDRAFT_27650 [Batrachochytrium
dendrobatidis JAM81]
Length = 445
Score = 70.9 bits (172), Expect = 1e-09, Method: Compositional matrix adjust.
Identities = 71/285 (24%), Positives = 120/285 (42%), Gaps = 35/285 (12%)
Query: 89 GTGLYPVISIINHSCLPNAVLVFE-GRLAVVRAVQHVPKGAEVLISYIETAGSTMTRQKA 147
G +Y +S +NHSC+PNA L F G +A + + + G ++ ISY++ S TRQ+
Sbjct: 183 GVAIYDRLSTVNHSCVPNACLTFGIGGIARLSPMTAIASGDQINISYVDVFQSCETRQRQ 242
Query: 148 LKEQYLFTCTCPRCIKLGQFDDIQESAILEGYRCKDDGCSGFLLRDSDDKGFTCQQCGLV 207
LKEQY F CTC C + Q+ C S L S
Sbjct: 243 LKEQYYFDCTCRLCTANANGESRQQ--------CISQFDSAVDLYQS------------- 281
Query: 208 RSKEEIKKIASEVNILSK--KTLALTSCGNHQEVVS-TYKMIEKLQKKLYHPFSVNLMQT 264
+K++ ++ +IL++ ++L L C +H ++ +IE L + + + L
Sbjct: 282 -TKKQTNDASASQSILTRALESLELIVCKSHSVLIRLRTDLIEILVGQRNYAKVIELSND 340
Query: 265 REKLIKILMELEDWKEALAYCQLTIPVYQRVYPQFHPLLGLQYYTCGKLE-WFLGDTENA 323
+ + + WK + + + I YP + P+ + K W + D +
Sbjct: 341 QLDAFRCV-----WKASQSI-EKGISESDNAYPVW-PMESVSSVDIFKASMWTMDDDLDR 393
Query: 324 IKSM-TEAVEILRITHGTNSPFMKELILKLEEAQAEASYKLSSKD 367
+ + +A LRITHG P +L LE+ E + S D
Sbjct: 394 VAELGKQAAHNLRITHGEKHPMYLDLTEGLEDVLRELEARHSGVD 438
>gi|341883792|gb|EGT39727.1| CBN-SET-18 protein [Caenorhabditis brenneri]
Length = 499
Score = 70.5 bits (171), Expect = 1e-09, Method: Compositional matrix adjust.
Identities = 69/325 (21%), Positives = 134/325 (41%), Gaps = 28/325 (8%)
Query: 29 ALVAHMSDI--DEKQLLLYAQIANLVNLILQWPEISINEIAENFSKLACNAHTICNSELR 86
+LV H +DI DE+++ + ++ + L + I E + F K + N+H+I +
Sbjct: 135 SLVDHKTDIKEDEEKMKHFEKVFTKMGLFRKDDMIEKGEFFDIFCKASINSHSIHTNAGN 194
Query: 87 PLGTGLYPVISIINHSCLPNAVLVFEGRLAVVRAVQHVP-----KGAEVLISYIETAGST 141
+G L +S NHSC P +VF+G +R + VP + ISYI+ S
Sbjct: 195 EVGMALDLGVSKYNHSCRPTCSMVFDGYRVCLRPL--VPGVDAEDTEQAFISYIDVGRSK 252
Query: 142 MTRQKALKEQYLFTCTCPRCIKLGQFDDIQESAILEGYRCKDDGCSGFLLRDSDDK--GF 199
R++ LK + F C C RC+ E L RC + C +L ++
Sbjct: 253 FVRRRDLK--WYFNCECTRCMD-------PEDDALTAIRCANPSCDAPILTSETEEPMNI 303
Query: 200 TCQQCGLVRSKEEIKKIASEVNILSKKTLALTSCGNHQEVVSTYKMIEKLQKKLYHPFSV 259
C +C + + ++K + +L + + + + + +++ HP +V
Sbjct: 304 ACDKCKTIVEENDVKAAQEYMK-------SLPASFDPKCPADILRDLLAKAEQVLHPSNV 356
Query: 260 NLMQTREKLIKILMELEDWKEALAYCQLTIPVYQRVYPQFHPLLGLQYYTCGKLEWFLGD 319
+ + R L + L + + Q+ Y+ +P+ +G Q K +
Sbjct: 357 YVARLRTALFHVTGTLTMDNLSSMHTQI-YNNYKMCFPKADRHVGFQLLHIVKALIEKDE 415
Query: 320 TENAIKSMTEAVEILRITHGTNSPF 344
+ A+ +A+ I + G + P+
Sbjct: 416 RDEAMPYAFDAMNIFEVCFGLDHPY 440
>gi|426237300|ref|XP_004012599.1| PREDICTED: SET and MYND domain-containing protein 4 [Ovis aries]
Length = 802
Score = 70.5 bits (171), Expect = 1e-09, Method: Compositional matrix adjust.
Identities = 36/96 (37%), Positives = 51/96 (53%), Gaps = 4/96 (4%)
Query: 88 LGTGLYPVISIINHSCLPNAVLVFEGRLAVVRAVQHVPKGAEVLISYI--ETAGSTMTRQ 145
L T L+PV+S++NHSC PN + F G +A +RA Q + G E+L Y E+ R+
Sbjct: 526 LATALFPVVSLLNHSCSPNTSVSFIGTIATIRASQLIRSGQEILHCYGPHESRMGVAERR 585
Query: 146 KALKEQYLFTCTCPRCIKLGQFDDIQESAILEGYRC 181
+ L+ QY F C CP C + Q E +RC
Sbjct: 586 QKLRSQYFFDCDCPPCEREKQRPSAGPGR--EAFRC 619
>gi|170060284|ref|XP_001865734.1| conserved hypothetical protein [Culex quinquefasciatus]
gi|167878798|gb|EDS42181.1| conserved hypothetical protein [Culex quinquefasciatus]
Length = 647
Score = 70.5 bits (171), Expect = 1e-09, Method: Compositional matrix adjust.
Identities = 57/210 (27%), Positives = 90/210 (42%), Gaps = 30/210 (14%)
Query: 22 DNYSLVEALVAHMSDIDEKQLLLYAQIANLVNLILQWPEISINE---------IAENFSK 72
D+Y + V H D ++ L +A +A L+N +L + + N
Sbjct: 362 DDYRRIYNFVTHSEGRDTEESLKWAFMAALLNTVLSLGYFYRSRYLDSYIGTLLLHNLQV 421
Query: 73 LACNAHTICNSELRP-----------LGTGLYPVISIINHSCLPNAVLVFEGRLAVVRAV 121
+ N+H I SEL+ +G GLYP + + NHSC P F G VR V
Sbjct: 422 VTFNSHEI--SELQRKSPRDTGISVCIGAGLYPTLVLFNHSCDPGITRYFVGSAVFVRTV 479
Query: 122 QHVPKGAEVLISY--IETAGSTMTRQKALKEQYLFTCTCPRCIK-LGQFDDIQESAILEG 178
+++P G+ V +Y + R+K+LK+ Y F C C C + F D+ S +
Sbjct: 480 RNIPAGSVVAENYGQLFVRAPRHERRKSLKKLYKFDCYCQACYEDWPTFFDMNPSVV--R 537
Query: 179 YRC-KDDGCSGFLLRDS--DDKGFTCQQCG 205
+RC +GC L+ D C++C
Sbjct: 538 FRCAATEGCENGLIYTERLDQNVMKCEKCN 567
>gi|383858836|ref|XP_003704905.1| PREDICTED: SET and MYND domain-containing protein DDB_G0273589-like
[Megachile rotundata]
Length = 523
Score = 70.5 bits (171), Expect = 1e-09, Method: Compositional matrix adjust.
Identities = 54/210 (25%), Positives = 95/210 (45%), Gaps = 17/210 (8%)
Query: 22 DNYSLVEALVAHMSDIDEKQLLLYAQIANLVNLILQW---PEISINEIAENFSKLACNAH 78
++Y+ + L +H I K+ ++ + +NLVN I ++ +I +I + L N H
Sbjct: 162 ESYNKLLTLESHSDKIKNKETFIFEESSNLVNFIKRFFKSEDIPEEDIRKIIGVLQVNGH 221
Query: 79 TICNSELRPLGTGLYPVISIINHSCLPNAVLVFE--GRLAVVRAVQHVPKGAEVLISYIE 136
+ ++ P +Y + S++ H+C N F G L ++ A + KG + I Y +
Sbjct: 222 EVPLTD--PPYVAVYELASLLEHNCKANCSKSFTDTGGL-IIHAATPITKGDHISICYTD 278
Query: 137 TAGSTMTRQKALKEQYLFTCTCPRCIKLGQFDDIQESAILEGYRCKDDGCSGFLLRDS-- 194
TM R+ L E F CTC RC +F A+ RC + CSG++L +
Sbjct: 279 PLWGTMNRRHHLLETKYFECTCDRCRDPTEF-----GAMFNAIRCSEIDCSGYVLPKTFL 333
Query: 195 --DDKGFTCQQCGLVRSKEEIKKIASEVNI 222
+ + C C V E I++ ++ I
Sbjct: 334 GDHREDYVCTNCTTVVPLEIIEETLEDIGI 363
>gi|66822113|ref|XP_644411.1| SET domain-containing protein [Dictyostelium discoideum AX4]
gi|66822969|ref|XP_644839.1| SET domain-containing protein [Dictyostelium discoideum AX4]
gi|122057743|sp|Q557F7.1|Y3589_DICDI RecName: Full=SET and MYND domain-containing protein DDB_G0273589
gi|60472534|gb|EAL70485.1| SET domain-containing protein [Dictyostelium discoideum AX4]
gi|60472895|gb|EAL70844.1| SET domain-containing protein [Dictyostelium discoideum AX4]
Length = 386
Score = 70.1 bits (170), Expect = 1e-09, Method: Compositional matrix adjust.
Identities = 54/191 (28%), Positives = 82/191 (42%), Gaps = 19/191 (9%)
Query: 16 IPSTTTDNYSLVEALVAHMSDIDEKQLL--LYAQIANLVNLILQWPEIS----------- 62
+ +T D + LVE V S+ K+ + + I+ L NL+L +
Sbjct: 157 LTNTLDDVFDLVENQVTEESNPAAKERIDSIVEFISELFNLVLLGSTTTKSIINNDDKIE 216
Query: 63 ----INEIAENF-SKLACNAHTICNSELRPLGTGLYPVISIINHSCLPNAVLVFEGRLAV 117
INE + + K CN I + +G + P S NHSC+PN V +G
Sbjct: 217 MIRKINEKSRSIIHKTRCNQFGIWTKNDKCIGVAVSPSSSYFNHSCIPNCTDVRDGSNMT 276
Query: 118 VRAVQHVPKGAEVLISYIETAGSTMTRQKALKEQYLFTCTCPRCIKLGQFDDIQESAILE 177
+++ + KG ++ ISYIE R+ LK Y F C CPRC D ++ I +
Sbjct: 277 FKSLYPIKKGDQLTISYIELDQPIQDRKDELKYGYYFDCICPRCNGDSNSIDSMDNWISK 336
Query: 178 GYRCKDDGCSG 188
Y C C+G
Sbjct: 337 FY-CSQKKCTG 346
>gi|195038187|ref|XP_001990541.1| GH18189 [Drosophila grimshawi]
gi|193894737|gb|EDV93603.1| GH18189 [Drosophila grimshawi]
Length = 393
Score = 70.1 bits (170), Expect = 1e-09, Method: Compositional matrix adjust.
Identities = 39/101 (38%), Positives = 58/101 (57%), Gaps = 9/101 (8%)
Query: 66 IAENFSKLACNAHTICNSELRPLGTGLYPVISIINHSCLPNA--VLVFEGRLAVVRAVQH 123
I E ++K+ A N+E G+GLY + S INHSC+PNA + + V++AV
Sbjct: 267 IDELYAKVGDFAGEFLNNE----GSGLYILQSKINHSCVPNAQSTFPYSNDIVVLKAVAP 322
Query: 124 VPKGAEVLISYIETAG---STMTRQKALKEQYLFTCTCPRC 161
+ KG E+ ISY++ S +R K L+E Y+F C CP+C
Sbjct: 323 IEKGDEICISYLDECQLERSRHSRHKTLRENYIFVCQCPKC 363
>gi|222613106|gb|EEE51238.1| hypothetical protein OsJ_32096 [Oryza sativa Japonica Group]
Length = 718
Score = 70.1 bits (170), Expect = 2e-09, Method: Compositional matrix adjust.
Identities = 80/355 (22%), Positives = 150/355 (42%), Gaps = 66/355 (18%)
Query: 42 LLLYAQI-ANLVNLILQWPEISINEIAENFSKLACNAHTICNSELRPLGTGLYPVISIIN 100
+LL QI N + ++ + + + FS + + +C+ E + +Y S N
Sbjct: 352 VLLICQIKVNSIAIVHMKSMDGVKALTKGFSGFSGD--VMCSVEQVRVAQAIYMSGSFFN 409
Query: 101 HSCLPNAVLVFEGRLAVVRAVQHVPKGAEVLISYIETAGST--MTRQKALKEQYLFTCT- 157
HSC PN F R ++R+ +++ G+ + +SY G RQK+L+E Y F+C
Sbjct: 410 HSCRPNIHAYFHSRTLILRSTEYIKAGSPIELSYGPQVGEMDLPERQKSLRENYYFSCGC 469
Query: 158 -----------------CPRCIKLGQF---------DDIQESAILEGYRC---------- 181
CP+ LG ++ +I E + C
Sbjct: 470 SSCSVLSLSDLVMNSFCCPQSNCLGAVSELIHHRHKENFVHVSIGESHVCTLSLPDVSKF 529
Query: 182 -KDDGCSGFLLRDSD-----DKGFTCQQCGLVRSKEEIK-------KIASEVNILSKKTL 228
+D G L SD D GF C C RS+ ++ + S++N L +
Sbjct: 530 DEDIVKVGKLFFKSDTMFNIDPGF-CMSC---RSQLDLSSAVAMSDRATSKINRLKE--- 582
Query: 229 ALTSCGNHQEVV--STYKMIEKLQKKLYHPFSVNLMQTREKLIKILMELEDWKEALAYCQ 286
L S N EV+ + +E+++ KL HP+S L Q+ + + + ++ D ++A +C+
Sbjct: 583 -LPSLDNVPEVLIAEALQSLERIE-KLRHPYSKTLAQSHDTIAEAFAKVGDQEQARKHCE 640
Query: 287 LTIPVYQRVYPQFHPLLGLQYYTCGKLEWFLGDTENAIKSMTEAVEILRITHGTN 341
+I + +++Y H ++ + +E +GD +A + A I + +G +
Sbjct: 641 ASIKILEKLYHPRHIIIAHELIKLVSIELSMGDGASAAAAFARADAIFSLYYGPD 695
>gi|440912312|gb|ELR61896.1| SET and MYND domain-containing protein 4 [Bos grunniens mutus]
Length = 802
Score = 70.1 bits (170), Expect = 2e-09, Method: Compositional matrix adjust.
Identities = 40/111 (36%), Positives = 56/111 (50%), Gaps = 10/111 (9%)
Query: 88 LGTGLYPVISIINHSCLPNAVLVFEGRLAVVRAVQHVPKGAEVLISY--IETAGSTMTRQ 145
L T L+PV+S++NHSC PN + F G +A +RA Q + G E+L Y E+ R+
Sbjct: 526 LATALFPVVSLLNHSCSPNTSVSFIGTIATIRASQLIRSGQEILHCYGPHESRMGVAERR 585
Query: 146 KALKEQYLFTCTCPRCIKLGQFDDIQES-AILEGYRCK-----DD--GCSG 188
+ L+ QY F C CP C + Q +RC+ DD CSG
Sbjct: 586 QKLRSQYFFDCDCPPCEREKQRPSAGPGRGAFRCHRCRALLQGDDVLSCSG 636
>gi|296476893|tpg|DAA19008.1| TPA: CG8378-like [Bos taurus]
Length = 802
Score = 70.1 bits (170), Expect = 2e-09, Method: Compositional matrix adjust.
Identities = 40/111 (36%), Positives = 56/111 (50%), Gaps = 10/111 (9%)
Query: 88 LGTGLYPVISIINHSCLPNAVLVFEGRLAVVRAVQHVPKGAEVLISY--IETAGSTMTRQ 145
L T L+PV+S++NHSC PN + F G +A +RA Q + G E+L Y E+ R+
Sbjct: 526 LATALFPVVSLLNHSCSPNTSVSFIGTIATIRASQLIRSGQEILHCYGPHESRMGVAERR 585
Query: 146 KALKEQYLFTCTCPRCIKLGQFDDIQES-AILEGYRCK-----DD--GCSG 188
+ L+ QY F C CP C + Q +RC+ DD CSG
Sbjct: 586 QKLRSQYFFDCDCPPCEREKQRPSAGPGRGAFRCHRCRALLQGDDVLSCSG 636
>gi|329664548|ref|NP_001192662.1| SET and MYND domain-containing protein 4 [Bos taurus]
Length = 802
Score = 69.7 bits (169), Expect = 2e-09, Method: Compositional matrix adjust.
Identities = 40/111 (36%), Positives = 56/111 (50%), Gaps = 10/111 (9%)
Query: 88 LGTGLYPVISIINHSCLPNAVLVFEGRLAVVRAVQHVPKGAEVLISY--IETAGSTMTRQ 145
L T L+PV+S++NHSC PN + F G +A +RA Q + G E+L Y E+ R+
Sbjct: 526 LATALFPVVSLLNHSCSPNTSVSFIGTIATIRASQLIRSGQEILHCYGPHESRMGVAERR 585
Query: 146 KALKEQYLFTCTCPRCIKLGQFDDIQES-AILEGYRCK-----DD--GCSG 188
+ L+ QY F C CP C + Q +RC+ DD CSG
Sbjct: 586 QKLRSQYFFDCDCPPCEREKQRPSAGPGRGAFRCHRCRALLQGDDVLSCSG 636
>gi|289740749|gb|ADD19122.1| SET and MYND domain containing protein 5 [Glossina morsitans
morsitans]
Length = 395
Score = 69.7 bits (169), Expect = 2e-09, Method: Compositional matrix adjust.
Identities = 41/111 (36%), Positives = 58/111 (52%), Gaps = 5/111 (4%)
Query: 56 LQWPEISINEIAENFSKLACNAHTICNSELRPLGTGLYPVISIINHSCLPNAVLVF--EG 113
L PE N++ E + L G+GLY + S INHSCLPNA + F
Sbjct: 253 LPLPETDKNKLDEYIDDIYHKVGEFAGEFLNNEGSGLYLLQSKINHSCLPNAQVTFPYSN 312
Query: 114 RLAVVRAVQHVPKGAEVLISYIETAG---STMTRQKALKEQYLFTCTCPRC 161
+ V++A+Q + G E+ ISY++ S +RQK LKE Y+F C C +C
Sbjct: 313 DIVVLKALQPIQVGEEICISYLDEGQLERSRHSRQKILKENYIFVCECFKC 363
>gi|367042716|ref|XP_003651738.1| hypothetical protein THITE_68287 [Thielavia terrestris NRRL 8126]
gi|346999000|gb|AEO65402.1| hypothetical protein THITE_68287 [Thielavia terrestris NRRL 8126]
Length = 562
Score = 69.7 bits (169), Expect = 2e-09, Method: Compositional matrix adjust.
Identities = 38/102 (37%), Positives = 60/102 (58%), Gaps = 1/102 (0%)
Query: 60 EISINEIAENFSKLACNAHTICNSELRPLGTGLYPVISIINHSCLPNAVLVFEGRLAVVR 119
E ++ + E K+ NA +++ G L ++++NHSC+PNA + F+ R AV+R
Sbjct: 189 EETLEKAREVLCKIQTNAFNRLDADTGLSGIFLDVGLAMVNHSCVPNAFIGFDRRTAVLR 248
Query: 120 AVQHVPKGAEVLISYIETAGSTMTRQKALKEQYLFTCTCPRC 161
A + + +G E+ ISYI+ A RQ+AL+ Y F C CPRC
Sbjct: 249 AERPIQEGEEITISYIDNALPKSARQEALR-LYHFRCDCPRC 289
>gi|157125402|ref|XP_001654323.1| hypothetical protein AaeL_AAEL001936 [Aedes aegypti]
gi|108882687|gb|EAT46912.1| AAEL001936-PA [Aedes aegypti]
Length = 546
Score = 69.7 bits (169), Expect = 2e-09, Method: Compositional matrix adjust.
Identities = 79/291 (27%), Positives = 123/291 (42%), Gaps = 34/291 (11%)
Query: 57 QWPEISINEIAENFSKLACNAHTICNSELRPLGTGLYPVISIINHSCLPNAVLVFEGRLA 116
+W E +EI + L N H + +E P +Y S++ HSC PN F +
Sbjct: 224 KWSE---DEILKIVGILQVNGHEVPLTE--PPSVAIYNNASMLEHSCRPNLSKSFTSKKE 278
Query: 117 VV-RAVQHVPKGAEVLISYIETAGSTMTRQKALKEQYLFTCTCPRCIKLGQFDDIQESAI 175
+V A + +G + ISY + T RQ L++ LF CTC RC+ +F +
Sbjct: 279 IVFWAPNPIKQGERLSISYSDVLWGTANRQDHLQQTKLFRCTCVRCLDPTEFGTYLSALK 338
Query: 176 LEGYRCKDDGCSGFLLRDSDDK---GFTCQQC-GLVRSKEEIKKIASEVNILSKKTLALT 231
G++ KD CSG LL ++ G+ C +C GLV KE NIL + +
Sbjct: 339 CSGFK-KDSNCSGLLLPENLKNWYGGYICNKCRGLVDGKE-------ITNILDRARVDHE 390
Query: 232 SCGNHQEVVSTYKMIEKLQKKLYHPFSVNLMQTREKLIKIL----------MELEDWKEA 281
+ E +K I + L P L+ + L +I+ + ED
Sbjct: 391 AMQKDNE-QHCHKYIAHYGRWL-GPNHHLLVDVKISLSQIMGGGDPNAIQKISDEDLMTK 448
Query: 282 LAYCQLTIPVYQRVYPQFHPLLGLQYYTCGKLEWFLGD-TENAIKSMTEAV 331
+ CQ I V+QRV P ++G + +L L + ++SM AV
Sbjct: 449 MKICQELIDVFQRVCPAEARVIGTTRF---ELHAALAEFARRGVESMNSAV 496
>gi|291242073|ref|XP_002740933.1| PREDICTED: predicted protein-like [Saccoglossus kowalevskii]
Length = 669
Score = 69.7 bits (169), Expect = 2e-09, Method: Compositional matrix adjust.
Identities = 42/123 (34%), Positives = 61/123 (49%), Gaps = 15/123 (12%)
Query: 73 LACNAHTICNSELR----------PLGTGLYPVISIINHSCLPNAVLVFEGRLAVVRAVQ 122
+ CNAH I EL +G+GLYP +S++NHSC P G VVRA++
Sbjct: 449 IPCNAHEISEYELWRSDITKCHFVEVGSGLYPTMSLVNHSCDPVVTRNCYGETCVVRAIR 508
Query: 123 HVPKGAEVL--ISYIETAGSTMTRQKALKEQYLFTCTCPRCI-KLGQFDDIQESAILEGY 179
++ KG E+ Y+ RQ LK QY F C C C+ + DI++ +L Y
Sbjct: 509 NIYKGEEITDNYGYLYPVHDKSERQTRLKWQYFFECKCDACVHDWPLYPDIED--LLPKY 566
Query: 180 RCK 182
+C+
Sbjct: 567 KCE 569
>gi|71019315|ref|XP_759888.1| hypothetical protein UM03741.1 [Ustilago maydis 521]
gi|46099457|gb|EAK84690.1| hypothetical protein UM03741.1 [Ustilago maydis 521]
Length = 739
Score = 69.7 bits (169), Expect = 2e-09, Method: Compositional matrix adjust.
Identities = 35/118 (29%), Positives = 65/118 (55%), Gaps = 16/118 (13%)
Query: 62 SINEIAENFSKLACNAHTICNSELRPLGTGLYPVISIINHSCLPNAVLVF---------- 111
S E+ + + +CN+ T+ +S+L LG ++ ++++NH+C+PNA +VF
Sbjct: 423 SATELLDLVCQFSCNSFTLADSDLNALGVCMHASMAMLNHACIPNAAVVFPFGGAAKGGQ 482
Query: 112 ------EGRLAVVRAVQHVPKGAEVLISYIETAGSTMTRQKALKEQYLFTCTCPRCIK 163
+ ++ + A++ + G E+LISY++ + RQK LK++Y F C C C K
Sbjct: 483 QRWNDGDDKIMQLVALRAIEPGEELLISYVDLCDTVEERQKQLKQRYCFDCRCDLCCK 540
>gi|115482840|ref|NP_001065013.1| Os10g0506800 [Oryza sativa Japonica Group]
gi|62733614|gb|AAX95731.1| tetratricopeptide repeat, putative [Oryza sativa Japonica Group]
gi|110289368|gb|ABB47859.2| TPR Domain containing protein, expressed [Oryza sativa Japonica
Group]
gi|113639622|dbj|BAF26927.1| Os10g0506800 [Oryza sativa Japonica Group]
gi|215695497|dbj|BAG90688.1| unnamed protein product [Oryza sativa Japonica Group]
Length = 793
Score = 69.7 bits (169), Expect = 2e-09, Method: Compositional matrix adjust.
Identities = 80/355 (22%), Positives = 149/355 (41%), Gaps = 66/355 (18%)
Query: 42 LLLYAQI-ANLVNLILQWPEISINEIAENFSKLACNAHTICNSELRPLGTGLYPVISIIN 100
+LL QI N + ++ + + + FS + +C+ E + +Y S N
Sbjct: 427 VLLICQIKVNSIAIVHMKSMDGVKALTKGFSGFS--GDVMCSVEQVRVAQAIYMSGSFFN 484
Query: 101 HSCLPNAVLVFEGRLAVVRAVQHVPKGAEVLISYIETAGST--MTRQKALKEQYLFTCT- 157
HSC PN F R ++R+ +++ G+ + +SY G RQK+L+E Y F+C
Sbjct: 485 HSCRPNIHAYFHSRTLILRSTEYIKAGSPIELSYGPQVGEMDLPERQKSLRENYYFSCGC 544
Query: 158 -----------------CPRCIKLGQF---------DDIQESAILEGYRC---------- 181
CP+ LG ++ +I E + C
Sbjct: 545 SSCSVLSLSDLVMNSFCCPQSNCLGAVSELIHHRHKENFVHVSIGESHVCTLSLPDVSKF 604
Query: 182 -KDDGCSGFLLRDSD-----DKGFTCQQCGLVRSKEEIK-------KIASEVNILSKKTL 228
+D G L SD D GF C C RS+ ++ + S++N L +
Sbjct: 605 DEDIVKVGKLFFKSDTMFNIDPGF-CMSC---RSQLDLSSAVAMSDRATSKINRLKE--- 657
Query: 229 ALTSCGNHQEVV--STYKMIEKLQKKLYHPFSVNLMQTREKLIKILMELEDWKEALAYCQ 286
L S N EV+ + +E+++ KL HP+S L Q+ + + + ++ D ++A +C+
Sbjct: 658 -LPSLDNVPEVLIAEALQSLERIE-KLRHPYSKTLAQSHDTIAEAFAKVGDQEQARKHCE 715
Query: 287 LTIPVYQRVYPQFHPLLGLQYYTCGKLEWFLGDTENAIKSMTEAVEILRITHGTN 341
+I + +++Y H ++ + +E +GD +A + A I + +G +
Sbjct: 716 ASIKILEKLYHPRHIIIAHELIKLVSIELSMGDGASAAAAFARADAIFSLYYGPD 770
>gi|255085128|ref|XP_002504995.1| predicted protein [Micromonas sp. RCC299]
gi|226520264|gb|ACO66253.1| predicted protein [Micromonas sp. RCC299]
Length = 267
Score = 69.7 bits (169), Expect = 2e-09, Method: Compositional matrix adjust.
Identities = 36/101 (35%), Positives = 55/101 (54%), Gaps = 3/101 (2%)
Query: 65 EIAENFSKLACNAHTIC---NSELRPLGTGLYPVISIINHSCLPNAVLVFEGRLAVVRAV 121
E+ + + CNAHT+ S L P+G +Y S NHSC+P+A G VR++
Sbjct: 162 ELTQLLCSVNCNAHTLYANDRSSLVPVGIAVYLQGSAFNHSCVPSAEFCNVGTSLTVRSL 221
Query: 122 QHVPKGAEVLISYIETAGSTMTRQKALKEQYLFTCTCPRCI 162
+ V G E+ +SY+ T + R++ L+ QY F C C RC+
Sbjct: 222 RRVQAGEEITVSYVPTTMNLKERRRCLEGQYKFVCACARCV 262
>gi|328866132|gb|EGG14518.1| hypothetical protein DFA_12294 [Dictyostelium fasciculatum]
Length = 561
Score = 69.3 bits (168), Expect = 2e-09, Method: Compositional matrix adjust.
Identities = 50/154 (32%), Positives = 70/154 (45%), Gaps = 24/154 (15%)
Query: 21 TDNYSLVEALVAHMSDIDEKQLLLYAQIANLVNL-----ILQWPEISINEIAENFSKLAC 75
+DN L+E L S+ Y +I L+N + E IN A+N K+
Sbjct: 419 SDNKILIEELEKLFSN------KFYDEILGLINYNSISTFVYGEETVINTGAKN-KKMQP 471
Query: 76 NAHTICNSELRPLGTGLYPVISIINHSCLPNAVLVFEGRLAVVRAV------QHVPKGAE 129
C LG+ L P+ + +NHSC PN + E R V +A+ Q + KG E
Sbjct: 472 KLRQYC------LGSALIPIFACLNHSCAPNIEMGREQRDGVTKAIAVMIAKQDIKKGQE 525
Query: 130 VLISYIETAGSTMTRQKALKEQYLFTCTCPRCIK 163
+L SYI+ + RQ L QY FTC C +C K
Sbjct: 526 LLTSYIDESNPFKDRQSILSSQYGFTCNCNKCSK 559
>gi|110289369|gb|AAP54526.2| TPR Domain containing protein, expressed [Oryza sativa Japonica
Group]
Length = 687
Score = 69.3 bits (168), Expect = 2e-09, Method: Compositional matrix adjust.
Identities = 80/355 (22%), Positives = 149/355 (41%), Gaps = 66/355 (18%)
Query: 42 LLLYAQI-ANLVNLILQWPEISINEIAENFSKLACNAHTICNSELRPLGTGLYPVISIIN 100
+LL QI N + ++ + + + FS + +C+ E + +Y S N
Sbjct: 321 VLLICQIKVNSIAIVHMKSMDGVKALTKGFSGFS--GDVMCSVEQVRVAQAIYMSGSFFN 378
Query: 101 HSCLPNAVLVFEGRLAVVRAVQHVPKGAEVLISYIETAGST--MTRQKALKEQYLFTC-- 156
HSC PN F R ++R+ +++ G+ + +SY G RQK+L+E Y F+C
Sbjct: 379 HSCRPNIHAYFHSRTLILRSTEYIKAGSPIELSYGPQVGEMDLPERQKSLRENYYFSCGC 438
Query: 157 ----------------TCPRCIKLGQF---------DDIQESAILEGYRC---------- 181
CP+ LG ++ +I E + C
Sbjct: 439 SSCSVLSLSDLVMNSFCCPQSNCLGAVSELIHHRHKENFVHVSIGESHVCTLSLPDVSKF 498
Query: 182 -KDDGCSGFLLRDSD-----DKGFTCQQCGLVRSKEEIK-------KIASEVNILSKKTL 228
+D G L SD D GF C C RS+ ++ + S++N L +
Sbjct: 499 DEDIVKVGKLFFKSDTMFNIDPGF-CMSC---RSQLDLSSAVAMSDRATSKINRLKE--- 551
Query: 229 ALTSCGNHQEVV--STYKMIEKLQKKLYHPFSVNLMQTREKLIKILMELEDWKEALAYCQ 286
L S N EV+ + +E+++ KL HP+S L Q+ + + + ++ D ++A +C+
Sbjct: 552 -LPSLDNVPEVLIAEALQSLERIE-KLRHPYSKTLAQSHDTIAEAFAKVGDQEQARKHCE 609
Query: 287 LTIPVYQRVYPQFHPLLGLQYYTCGKLEWFLGDTENAIKSMTEAVEILRITHGTN 341
+I + +++Y H ++ + +E +GD +A + A I + +G +
Sbjct: 610 ASIKILEKLYHPRHIIIAHELIKLVSIELSMGDGASAAAAFARADAIFSLYYGPD 664
>gi|291227705|ref|XP_002733823.1| PREDICTED: predicted protein-like [Saccoglossus kowalevskii]
Length = 644
Score = 69.3 bits (168), Expect = 2e-09, Method: Compositional matrix adjust.
Identities = 67/294 (22%), Positives = 127/294 (43%), Gaps = 13/294 (4%)
Query: 62 SINEIAENFSKLACNAHTICNSELRPLGTGLYPVISIINHSCLPNAVLVFEGRLAVVRAV 121
+++ + E +K + + ++ + +Y S++NHSC PN + ++G +RA
Sbjct: 346 NVHAVTEVATKTDSSTSFVATTQQIRIAVAVYGTASMLNHSCTPNVIAGYDGNQLTIRAT 405
Query: 122 QHVPKGAEVLISYIETAGSTMT--RQKALKEQYLFTCTCPRCIKLGQFDDIQESAILEGY 179
+ + KG EVL Y R K L++QY FTC C C + Q + I S L
Sbjct: 406 EMIKKGGEVLHCYGPRVSDMFRDERLKVLRDQYYFTCKCMFC-GIPQ-EAITPSGALRCP 463
Query: 180 RCKDDGCSGFLLRDSDDKG----FTCQQCGLVRSKEEIKKIASEVNILSKKTLALTSCGN 235
RC D SG + + + G F C + +E KI + + +
Sbjct: 464 RCSD--MSG-VYQTPEVTGQCCNFDCHHSWKLGETDE--KIENARQLFRVGKTLIGDKNK 518
Query: 236 HQEVVSTYKMIEKLQKKLYHPFSVNLMQTREKLIKILMELEDWKEALAYCQLTIPVYQRV 295
+E +ST K+Q+ + + + L +T + L ++ A Y +L+I +
Sbjct: 519 VKECLSTLHKAYKIQQSVLYKHNKELAETEDCLALCYASFGNYHYAAKYLKLSIKSNIIL 578
Query: 296 YPQFHPLLGLQYYTCGKLEWFLGDTENAIKSMTEAVEILRITHGTNSPFMKELI 349
Y + +G + Y ++ + + + + +AV+I+ +G N +KEL+
Sbjct: 579 YGEDSIEIGHELYKLAQILFNCKNVTEGLGVIDKAVKIISDHYGENHSDVKELL 632
>gi|242007004|ref|XP_002424332.1| conserved hypothetical protein [Pediculus humanus corporis]
gi|212507732|gb|EEB11594.1| conserved hypothetical protein [Pediculus humanus corporis]
Length = 572
Score = 69.3 bits (168), Expect = 3e-09, Method: Compositional matrix adjust.
Identities = 39/95 (41%), Positives = 49/95 (51%), Gaps = 5/95 (5%)
Query: 73 LACNAHTICN---SELRPLGTGLYPVISIINHSCLPNAVLVFEGRLAVVRAVQHVPKGAE 129
L CNAH I S L +G G Y +S+INHSC PN V ++RA++ + KG E
Sbjct: 384 LPCNAHEISEYYCSNLLEIGAGAYATLSLINHSCDPNVVRHSCRNTVILRAIKPIKKGEE 443
Query: 130 VL--ISYIETAGSTMTRQKALKEQYLFTCTCPRCI 162
+ Y RQKAL +QY FTC C CI
Sbjct: 444 LFDNYGYHYATHELQERQKALLKQYYFTCQCKACI 478
>gi|402587997|gb|EJW81931.1| MYND finger family protein, partial [Wuchereria bancrofti]
Length = 348
Score = 69.3 bits (168), Expect = 3e-09, Method: Compositional matrix adjust.
Identities = 74/320 (23%), Positives = 136/320 (42%), Gaps = 37/320 (11%)
Query: 61 ISINEIAENFSKLACNAHTICN-SELRPLGTGLYPVISIINHSCLPNAVLVFEGRLAVVR 119
+S +E+ E + N H I + + +G GLY + +HSC PN + +A +R
Sbjct: 37 VSKDEVFELHCRNYINRHAISDCGYIEEIGKGLYLDLCAYDHSCRPNTIYTCNSFVATLR 96
Query: 120 AVQHVPKGAEVL------ISYIETAGSTMTRQKALKEQYLFTCTCPRCIKLGQFDDIQES 173
+ G ++ SYI+ +T R+K LK+ + F C C RC DD+ S
Sbjct: 97 GLT---AGVDLRNLNSTHYSYIDLINTTQQRRKLLKDTWYFECHCTRC---DDPDDVLLS 150
Query: 174 AILEGYRCKDDGC----SGFLLRDSDDKGFTCQQCGLVRSKEEIKKIASEVNILSK--KT 227
+IL K C +D + + TC +C + S E + + + + + K +
Sbjct: 151 SILCPNCPKKRECLCIFGNVPYKDRNTQIITCPKCHNIVSPEYVVEAIAAMRFIDKIVEN 210
Query: 228 LALTSCGNHQEVVSTYKMIEKLQKKLYHPFS-VNLMQTREKLIKILMELEDWKEALAYCQ 286
+ Q + + + L+K+ S VN+ K+I++L+ L D ++ CQ
Sbjct: 211 REVEKMSREQSI----RFLTDLKKRFSELLSKVNVFLC--KVIQLLIPLID----ISDCQ 260
Query: 287 LTIPVY-------QRVYPQFHPLLGLQYYTCGKLEWFLGDTENAIKSMTEAVEILRITHG 339
L + ++ + +P HP +G+ Y + A+ +A E+++ T G
Sbjct: 261 LLLDLHLETEECVRFCFPYNHPAVGIHYRSIATFFLKCKQPHRALLYYKKAYEVVKFTLG 320
Query: 340 TNSPFMKELILKLEEAQAEA 359
+ P E L++A E
Sbjct: 321 SKHPMTIETDAMLKDASREV 340
>gi|302794157|ref|XP_002978843.1| hypothetical protein SELMODRAFT_418586 [Selaginella moellendorffii]
gi|300153652|gb|EFJ20290.1| hypothetical protein SELMODRAFT_418586 [Selaginella moellendorffii]
Length = 849
Score = 69.3 bits (168), Expect = 3e-09, Method: Compositional matrix adjust.
Identities = 62/233 (26%), Positives = 103/233 (44%), Gaps = 19/233 (8%)
Query: 76 NAHTICNSELRPLGTGLYPVISIINHSCLPNAVLVFEGRLAVVRAVQHVPKGAEVLISYI 135
AH IC E + L+ S NHSC PN + + R + +P + ISY
Sbjct: 391 GAH-ICGIEHVVVAQALFVRGSKFNHSCSPNVHVSYVKRTLRAHCTEALPAFCPLEISYG 449
Query: 136 ETAGSTMTRQKA--LKEQYLFTCTCPRCIKLGQFDDIQESAILEGYRCKDDGCSGFLLR- 192
G + Q+ L++ YLF C+C C +LG D+ A Y C GC G +L
Sbjct: 450 VQVGESEKSQRKIWLRDHYLFECSCEPC-RLGLVSDLLHIA----YYCCQPGCQGIVLDP 504
Query: 193 DSDDKGFTCQQCGLVRSKEEIKKIASEVNILSKKTLA-LTSCG---NHQEVVSTYKMIEK 248
D+ D+ L ++ + NIL +K L L CG N E ++ + +
Sbjct: 505 DASDEDIL-----LPGGSGTVRFLIEHCNILPQKELEDLQICGSRLNGNEAIADGLRLLE 559
Query: 249 LQKKLYHPFSVNLMQTREKLIKILMEL-EDWKEALAYCQLTIPVYQRVYPQFH 300
+K +HPF+ L Q + + ++ E + A+ + + +I + +R+Y + H
Sbjct: 560 NGRKCFHPFNKQLAQMEDIVARLYCEAGQQPAAAIVHARRSIEILERIYGKDH 612
>gi|82658264|ref|NP_001032477.1| SET and MYND domain-containing protein 3 [Danio rerio]
gi|81097726|gb|AAI09413.1| SET and MYND domain containing 3 [Danio rerio]
Length = 380
Score = 69.3 bits (168), Expect = 3e-09, Method: Compositional matrix adjust.
Identities = 34/128 (26%), Positives = 68/128 (53%), Gaps = 7/128 (5%)
Query: 24 YSLVEALVAHMSDIDEKQLLLYAQIANLVNLILQWPEISINEIAENF------SKLACNA 77
YS+ E +H++D+ E++ + + + L ++ + +++ CN
Sbjct: 124 YSIAEH-QSHLADMSEEKTEGLKHLCTTLQVYLAEENCDLSRLPSGLDPVSLLARVTCNC 182
Query: 78 HTICNSELRPLGTGLYPVISIINHSCLPNAVLVFEGRLAVVRAVQHVPKGAEVLISYIET 137
+I + EL+ +G GLYP +S++NH C PN +++FEG+ +RAV+ + E+ ISY +
Sbjct: 183 FSISDGELQDVGVGLYPSMSLLNHDCQPNCIMMFEGKRLTLRAVRVIRSAEELTISYTDI 242
Query: 138 AGSTMTRQ 145
+ R+
Sbjct: 243 LAPSKDRR 250
Score = 54.7 bits (130), Expect = 7e-05, Method: Compositional matrix adjust.
Identities = 28/86 (32%), Positives = 44/86 (51%)
Query: 273 MELEDWKEALAYCQLTIPVYQRVYPQFHPLLGLQYYTCGKLEWFLGDTENAIKSMTEAVE 332
+ L+D++ AL Y + Y+ + HP ++ GKL+ FLG E A S T+A
Sbjct: 291 ISLDDYETALEYGNRALGPYKLYFSDPHPSRAVELLRVGKLQHFLGRLEEARGSFTQAYS 350
Query: 333 ILRITHGTNSPFMKELILKLEEAQAE 358
I+ +THG ++ +L E QAE
Sbjct: 351 IMSVTHGAAHALTNQVQRRLSECQAE 376
>gi|312371974|gb|EFR20029.1| hypothetical protein AND_20776 [Anopheles darlingi]
Length = 256
Score = 69.3 bits (168), Expect = 3e-09, Method: Compositional matrix adjust.
Identities = 42/106 (39%), Positives = 61/106 (57%), Gaps = 10/106 (9%)
Query: 62 SINE-IAENFSKLACNAHTICNSELRPLGTGLYPVISIINHSCLPNAVLVF--EGRLAVV 118
S++E I E ++K+ T N+E G+ LY + S INHSC PNA +VF + +
Sbjct: 125 SVDELIDELYNKMDEVVGTFLNNE----GSALYAMQSKINHSCTPNAEIVFPKSNHVLAL 180
Query: 119 RAVQHVPKGAEVLISYIETAG---STMTRQKALKEQYLFTCTCPRC 161
RA++ + G E+ ISY++ S +RQK L+E YLF C C RC
Sbjct: 181 RALRDLAAGEEICISYLDECNLQRSRHSRQKNLREYYLFECQCERC 226
>gi|403414107|emb|CCM00807.1| predicted protein [Fibroporia radiculosa]
Length = 304
Score = 68.9 bits (167), Expect = 3e-09, Method: Compositional matrix adjust.
Identities = 35/104 (33%), Positives = 56/104 (53%), Gaps = 9/104 (8%)
Query: 62 SINEIAENFSKLACNAHTICNSELRPLGTGLYPVISIINHSCLPNAVLVF---------E 112
S ++ + S+ NA T+ +S L +G + P ++++NHSC PN V+VF E
Sbjct: 10 SPGDLVDLISRFTTNAFTLTSSSLSAIGVAISPTVALVNHSCDPNVVIVFPRNPSTSHAE 69
Query: 113 GRLAVVRAVQHVPKGAEVLISYIETAGSTMTRQKALKEQYLFTC 156
+ V A++ + G E+L +Y++ RQK LKE Y FTC
Sbjct: 70 EPMMTVVAIKPILPGEEILSAYVDVTQPRELRQKELKETYNFTC 113
>gi|350633710|gb|EHA22075.1| hypothetical protein ASPNIDRAFT_183370 [Aspergillus niger ATCC
1015]
Length = 386
Score = 68.9 bits (167), Expect = 3e-09, Method: Compositional matrix adjust.
Identities = 39/136 (28%), Positives = 70/136 (51%), Gaps = 2/136 (1%)
Query: 30 LVAHMSDIDEKQLLLYAQIANLVNLILQWPEISINE--IAENFSKLACNAHTICNSELRP 87
L H+ DI ++ + +I+ I + ++E I+ +KL N+ + N+
Sbjct: 136 LGTHIKDIRDQSASQWERISLSSKAIRAYSGTDMSEEVISAMGAKLDLNSFNLTNAVYDR 195
Query: 88 LGTGLYPVISIINHSCLPNAVLVFEGRLAVVRAVQHVPKGAEVLISYIETAGSTMTRQKA 147
LG L+P +I NHSC NA + F+G ++A++ + K ++ I+YI+ RQ
Sbjct: 196 LGVYLHPYAAIFNHSCDHNAAVSFDGPNLHIKALRPIRKDEQIFITYIDVTDPYPIRQAN 255
Query: 148 LKEQYLFTCTCPRCIK 163
L+ +Y FTC C +C +
Sbjct: 256 LQSRYYFTCHCSKCTR 271
>gi|302838330|ref|XP_002950723.1| hypothetical protein VOLCADRAFT_91197 [Volvox carteri f.
nagariensis]
gi|300263840|gb|EFJ48038.1| hypothetical protein VOLCADRAFT_91197 [Volvox carteri f.
nagariensis]
Length = 610
Score = 68.9 bits (167), Expect = 4e-09, Method: Compositional matrix adjust.
Identities = 33/73 (45%), Positives = 44/73 (60%), Gaps = 2/73 (2%)
Query: 91 GLYPVISIINHSCLPNAVLVFEGRLAVVRAVQHVPKGAEVLISYI--ETAGSTMTRQKAL 148
GL+P + +NH+C PN V G VVRAVQ +P+G E+L+SY+ + RQ AL
Sbjct: 246 GLWPYFNQLNHACAPNCVHYVVGSTMVVRAVQVIPEGCELLVSYLGRDDLAPRQVRQAAL 305
Query: 149 KEQYLFTCTCPRC 161
K +Y F C C RC
Sbjct: 306 KARYGFICNCTRC 318
>gi|340711193|ref|XP_003394163.1| PREDICTED: SET and MYND domain-containing protein 4-like [Bombus
terrestris]
Length = 676
Score = 68.9 bits (167), Expect = 4e-09, Method: Compositional matrix adjust.
Identities = 51/201 (25%), Positives = 95/201 (47%), Gaps = 11/201 (5%)
Query: 54 LILQWPEISINEIAENFSKLACNAHTICNSELRPLGTGLYPVISIINHSCLPNAVLVFEG 113
+++Q+ I+E+A + ++ + ++ + +G G+Y +S+ NHSC P + F G
Sbjct: 450 MLIQFNAHEISELA-----VPKGSNILAKAKSKFIGGGVYSTVSLFNHSCNPGIIRYFIG 504
Query: 114 RLAVVRAVQHVPKGAEVLISY--IETAGSTMTRQKALKEQYLFTCTCPRCIKLGQFDDIQ 171
VVRA++ +P G E+ +Y I T R++ L+ QY F C C C +
Sbjct: 505 TTMVVRAIRTIPAGEEISENYGPIFTTTPEAERKRKLRVQYWFDCNCEACTAHWPTLEEI 564
Query: 172 ESAILEGYRCKDDGCSGFLL---RDSDDKGFTCQQCGLVRSKEEIKKIASEVNILSKKTL 228
+ IL ++C+ G +L D+++ C +CG + + K + + + +
Sbjct: 565 DPTILR-FKCETGKECGNVLPIKADTNEFMIRCSKCGKNTNIFKGLKALQDTDAIFRTAS 623
Query: 229 ALTSCGNHQEVVSTYKMIEKL 249
G HQE + +Y I KL
Sbjct: 624 RNLEEGKHQEALKSYLEILKL 644
>gi|393234411|gb|EJD41974.1| hypothetical protein AURDEDRAFT_105699 [Auricularia delicata
TFB-10046 SS5]
Length = 531
Score = 68.6 bits (166), Expect = 4e-09, Method: Compositional matrix adjust.
Identities = 46/153 (30%), Positives = 69/153 (45%), Gaps = 13/153 (8%)
Query: 62 SINEIAENFSKLACNAHTICNSELRPLGTGLYPVISIINHSCLPNAVLVFEGRLAV---- 117
S ++ + S+ N+ T+ + L P+G + P +++ NHSC PNAV+VF A
Sbjct: 188 SAGDLVDLVSRFTTNSFTLSSPSLSPIGVCVSPTVALANHSCDPNAVVVFPSAAAPLGDE 247
Query: 118 ----VRAVQHVPKGAEVLISYIETAGSTMTRQKALKEQYLFTCTCPRCIKLGQFDDIQES 173
+ A++ + EVL SY++ + R++ LKE Y FTC C C L DD
Sbjct: 248 TLMQIIAIKDIMPEEEVLTSYVDISLPRHLRRRDLKETYNFTCNCSACSSL---DDPSVG 304
Query: 174 AILEGYRCKDDGCSGFLLRDSDDKGFT-CQQCG 205
C GC G G C +CG
Sbjct: 305 DPRYAVWCP-KGCGGVCRIPELGSGIVICLRCG 336
>gi|350405733|ref|XP_003487532.1| PREDICTED: SET and MYND domain-containing protein 4-like [Bombus
impatiens]
Length = 676
Score = 68.6 bits (166), Expect = 4e-09, Method: Compositional matrix adjust.
Identities = 51/201 (25%), Positives = 95/201 (47%), Gaps = 11/201 (5%)
Query: 54 LILQWPEISINEIAENFSKLACNAHTICNSELRPLGTGLYPVISIINHSCLPNAVLVFEG 113
+++Q+ I+E+A + ++ + ++ + +G G+Y +S+ NHSC P + F G
Sbjct: 450 MLIQFNAHEISELA-----VPKGSNILAKAKSKFIGGGVYSTVSLFNHSCNPGIIRYFIG 504
Query: 114 RLAVVRAVQHVPKGAEVLISY--IETAGSTMTRQKALKEQYLFTCTCPRCIKLGQFDDIQ 171
VVRA++ +P G E+ +Y I T R++ L+ QY F C C C +
Sbjct: 505 TTMVVRAIRTIPAGEEISENYGPIFTTTPEAERKRKLRVQYWFDCNCEACTAHWPTLEEI 564
Query: 172 ESAILEGYRCKDDGCSGFLL---RDSDDKGFTCQQCGLVRSKEEIKKIASEVNILSKKTL 228
+ IL ++C+ G +L D+++ C +CG + + K + + + +
Sbjct: 565 DPTILR-FKCETGKECGNVLPIKADTNEFMIRCSKCGKNTNIFKGLKALQDTDAIFRTAS 623
Query: 229 ALTSCGNHQEVVSTYKMIEKL 249
G HQE + +Y I KL
Sbjct: 624 RNLEEGKHQEALKSYLEILKL 644
>gi|61097985|ref|NP_001012912.1| SET and MYND domain-containing protein 5 [Gallus gallus]
gi|75571258|sp|Q5ZIZ2.1|SMYD5_CHICK RecName: Full=SET and MYND domain-containing protein 5
gi|53134094|emb|CAG32301.1| hypothetical protein RCJMB04_22j23 [Gallus gallus]
Length = 420
Score = 68.6 bits (166), Expect = 4e-09, Method: Compositional matrix adjust.
Identities = 37/86 (43%), Positives = 51/86 (59%), Gaps = 7/86 (8%)
Query: 89 GTGLYPVISIINHSCLPNAVLVFEGR--LAVVRAVQHVPKGAEVLISYIETAG---STMT 143
G+GLY + S NHSC+PNA F L + A++ + G E+ ISY++ S +
Sbjct: 310 GSGLYMLQSCCNHSCIPNAETSFPDNNFLLYLTALEDIEAGEEICISYLDCCQRERSRHS 369
Query: 144 RQKALKEQYLFTCTCPRCIKLGQFDD 169
R K L+E YLFTC+CP+C L Q DD
Sbjct: 370 RNKILRENYLFTCSCPKC--LAQADD 393
>gi|330812969|ref|XP_003291388.1| hypothetical protein DICPUDRAFT_98991 [Dictyostelium purpureum]
gi|325078448|gb|EGC32099.1| hypothetical protein DICPUDRAFT_98991 [Dictyostelium purpureum]
Length = 824
Score = 68.6 bits (166), Expect = 4e-09, Method: Compositional matrix adjust.
Identities = 32/98 (32%), Positives = 51/98 (52%), Gaps = 1/98 (1%)
Query: 65 EIAENFSKLACNAHTICNSELRPLGTGLYPVISIINHSCLPNAV-LVFEGRLAVVRAVQH 123
EI E ++ N HT+ N + G G YP+ NHSC+PN V + + RA +
Sbjct: 509 EIYEIYTIFLSNGHTVSNIFGKGYGHGFYPLGCYFNHSCVPNVTWYVDDNGMLAFRATRT 568
Query: 124 VPKGAEVLISYIETAGSTMTRQKALKEQYLFTCTCPRC 161
+ +G E+ ++Y + + R+ L +QY F C CP+C
Sbjct: 569 ISQGEELFLTYYDMFSTLDDRRPHLLDQYRFFCQCPQC 606
>gi|358368449|dbj|GAA85066.1| SET and MYND domain protein [Aspergillus kawachii IFO 4308]
Length = 484
Score = 68.6 bits (166), Expect = 5e-09, Method: Compositional matrix adjust.
Identities = 39/134 (29%), Positives = 69/134 (51%), Gaps = 2/134 (1%)
Query: 30 LVAHMSDIDEKQLLLYAQIANLVNLILQWPEISINE--IAENFSKLACNAHTICNSELRP 87
L H+ DI ++ + +I+ I + ++E I+ +KL N+ + N+
Sbjct: 136 LETHIKDIRDESASHWERISLSSKAIKAYSGTEMSEEVISAMGAKLDLNSFNLTNAVYDR 195
Query: 88 LGTGLYPVISIINHSCLPNAVLVFEGRLAVVRAVQHVPKGAEVLISYIETAGSTMTRQKA 147
LG L+P +I NHSC NA + F+G ++A++ + K ++ I+YI+ RQ
Sbjct: 196 LGVYLHPYAAIFNHSCDHNAAVSFDGPNLHIKAIRPIQKDEQIFITYIDVTDPYPIRQHN 255
Query: 148 LKEQYLFTCTCPRC 161
L+ +Y FTC C +C
Sbjct: 256 LQSRYYFTCHCSKC 269
>gi|242006418|ref|XP_002424047.1| SET and MYND domain-containing protein, putative [Pediculus humanus
corporis]
gi|212507353|gb|EEB11309.1| SET and MYND domain-containing protein, putative [Pediculus humanus
corporis]
Length = 236
Score = 68.6 bits (166), Expect = 5e-09, Method: Compositional matrix adjust.
Identities = 44/105 (41%), Positives = 60/105 (57%), Gaps = 14/105 (13%)
Query: 63 INEIAENFSKLACNAHTICNSELRPLGTGLYPVISIINHSCLPNAVLVFEG---RLAVVR 119
I+ + ENF K N+E G+GLY + S INHSC PNA + F +LAVV
Sbjct: 122 IDAVYENFEK---GVGDFLNNE----GSGLYELQSTINHSCSPNAEVTFPHSNYQLAVV- 173
Query: 120 AVQHVPKGAEVLISYIET---AGSTMTRQKALKEQYLFTCTCPRC 161
A ++ G E+ ISY++ + S +RQK L+E YLFTC C +C
Sbjct: 174 ATDNINPGDEICISYLDMCSLSRSRHSRQKILQENYLFTCKCHKC 218
>gi|449274577|gb|EMC83667.1| SET and MYND domain-containing protein 5, partial [Columba livia]
Length = 383
Score = 68.6 bits (166), Expect = 5e-09, Method: Compositional matrix adjust.
Identities = 37/86 (43%), Positives = 51/86 (59%), Gaps = 7/86 (8%)
Query: 89 GTGLYPVISIINHSCLPNAVLVF--EGRLAVVRAVQHVPKGAEVLISYIETAG---STMT 143
G+GLY + S NHSC+PNA F L + A++ + G E+ ISY++ S +
Sbjct: 273 GSGLYVLQSCCNHSCIPNAETSFPENNFLLHLTALEDIEAGEEICISYLDCCQRERSRHS 332
Query: 144 RQKALKEQYLFTCTCPRCIKLGQFDD 169
R K L+E YLFTC+CP+C L Q DD
Sbjct: 333 RNKILRENYLFTCSCPKC--LAQADD 356
>gi|195389466|ref|XP_002053397.1| GJ23360 [Drosophila virilis]
gi|194151483|gb|EDW66917.1| GJ23360 [Drosophila virilis]
Length = 393
Score = 68.6 bits (166), Expect = 5e-09, Method: Compositional matrix adjust.
Identities = 34/87 (39%), Positives = 50/87 (57%), Gaps = 5/87 (5%)
Query: 89 GTGLYPVISIINHSCLPNA--VLVFEGRLAVVRAVQHVPKGAEVLISYIETAG---STMT 143
G+GLY + S INHSC+PNA + + V++A+ + KG E+ ISY++ S +
Sbjct: 286 GSGLYILQSKINHSCVPNAQSTFPYSNDIVVLKALAPIQKGDEICISYLDDCQLERSRHS 345
Query: 144 RQKALKEQYLFTCTCPRCIKLGQFDDI 170
R K L+E Y+F C CP+C DI
Sbjct: 346 RHKTLRENYIFVCQCPKCRAQASDPDI 372
>gi|134083355|emb|CAK42922.1| unnamed protein product [Aspergillus niger]
Length = 415
Score = 68.2 bits (165), Expect = 5e-09, Method: Compositional matrix adjust.
Identities = 39/136 (28%), Positives = 70/136 (51%), Gaps = 2/136 (1%)
Query: 30 LVAHMSDIDEKQLLLYAQIANLVNLILQWPEISINE--IAENFSKLACNAHTICNSELRP 87
L H+ DI ++ + +I+ I + ++E I+ +KL N+ + N+
Sbjct: 104 LGTHIKDIRDQSASQWERISLSSKAIKAYSGTDMSEEVISAMGAKLDLNSFNLTNAVYDR 163
Query: 88 LGTGLYPVISIINHSCLPNAVLVFEGRLAVVRAVQHVPKGAEVLISYIETAGSTMTRQKA 147
LG L+P +I NHSC NA + F+G ++A++ + K ++ I+YI+ RQ
Sbjct: 164 LGVYLHPYAAIFNHSCDHNAAVSFDGPNLHIKALRPIRKDEQIFITYIDVTDPYPIRQAN 223
Query: 148 LKEQYLFTCTCPRCIK 163
L+ +Y FTC C +C +
Sbjct: 224 LQSRYYFTCHCSKCSR 239
>gi|449501577|ref|XP_002187845.2| PREDICTED: SET and MYND domain-containing protein 5-like
[Taeniopygia guttata]
Length = 404
Score = 68.2 bits (165), Expect = 5e-09, Method: Compositional matrix adjust.
Identities = 37/86 (43%), Positives = 51/86 (59%), Gaps = 7/86 (8%)
Query: 89 GTGLYPVISIINHSCLPNAVLVF--EGRLAVVRAVQHVPKGAEVLISYIETAG---STMT 143
G+GLY + S NHSC+PNA F L + A++ + G E+ ISY++ S +
Sbjct: 294 GSGLYVLQSCCNHSCIPNAETSFPENNFLLHLTALEDIEAGEEICISYLDCCQRERSRHS 353
Query: 144 RQKALKEQYLFTCTCPRCIKLGQFDD 169
R K L+E YLFTC+CP+C L Q DD
Sbjct: 354 RNKILRENYLFTCSCPKC--LAQADD 377
>gi|391868965|gb|EIT78172.1| hypothetical protein Ao3042_05555 [Aspergillus oryzae 3.042]
Length = 324
Score = 68.2 bits (165), Expect = 5e-09, Method: Compositional matrix adjust.
Identities = 31/93 (33%), Positives = 50/93 (53%)
Query: 71 SKLACNAHTICNSELRPLGTGLYPVISIINHSCLPNAVLVFEGRLAVVRAVQHVPKGAEV 130
+KL N+ N +G L+P ++INHSC NA + F+ ++A + + KG ++
Sbjct: 11 AKLELNSFNFTNILSDRIGLYLHPYAALINHSCNYNAAVTFDSDNLYIKATRPIQKGDQI 70
Query: 131 LISYIETAGSTMTRQKALKEQYLFTCTCPRCIK 163
ISYI+ R+ L+E+Y F C C +C K
Sbjct: 71 FISYIDATNPVKLRRSELRERYYFDCHCAKCAK 103
>gi|317036659|ref|XP_001397803.2| SET and MYND domain protein [Aspergillus niger CBS 513.88]
Length = 481
Score = 68.2 bits (165), Expect = 6e-09, Method: Compositional matrix adjust.
Identities = 39/136 (28%), Positives = 70/136 (51%), Gaps = 2/136 (1%)
Query: 30 LVAHMSDIDEKQLLLYAQIANLVNLILQWPEISINE--IAENFSKLACNAHTICNSELRP 87
L H+ DI ++ + +I+ I + ++E I+ +KL N+ + N+
Sbjct: 136 LGTHIKDIRDQSASQWERISLSSKAIKAYSGTDMSEEVISAMGAKLDLNSFNLTNAVYDR 195
Query: 88 LGTGLYPVISIINHSCLPNAVLVFEGRLAVVRAVQHVPKGAEVLISYIETAGSTMTRQKA 147
LG L+P +I NHSC NA + F+G ++A++ + K ++ I+YI+ RQ
Sbjct: 196 LGVYLHPYAAIFNHSCDHNAAVSFDGPNLHIKALRPIRKDEQIFITYIDVTDPYPIRQAN 255
Query: 148 LKEQYLFTCTCPRCIK 163
L+ +Y FTC C +C +
Sbjct: 256 LQSRYYFTCHCSKCSR 271
>gi|268532182|ref|XP_002631219.1| C. briggsae CBR-SET-14 protein [Caenorhabditis briggsae]
Length = 428
Score = 68.2 bits (165), Expect = 6e-09, Method: Compositional matrix adjust.
Identities = 75/333 (22%), Positives = 138/333 (41%), Gaps = 20/333 (6%)
Query: 30 LVAHMSDIDEKQLLLYAQIANLVNLILQWPEISINEIAENFSKLACNAHTICNSELRPLG 89
LV H S +D Y + N + PE + + F K++ N ++ NS +G
Sbjct: 111 LVDHGSFLDSGAETFYQEYLNYA--LPPHPEPDV--VKSVFKKISINGFSVSNSMGSAIG 166
Query: 90 TGLYPVISIINHSCLPNAVLVFEGRLAVVRAVQ--HVPKGAE-VLISYIETAGSTMTRQK 146
GL +S NHSC P + + R A++ + P E SYI+ S RQ
Sbjct: 167 HGLCIKLSAANHSCKPLTRVCYRNRTAMLVPIDPYKPPTSLEGACHSYIDELQSVSARQS 226
Query: 147 ALKEQYLFTCTCPRCIKLGQFDDIQESAILEGYRCKDDGCSGFLLRDSDDKGFTCQQCG- 205
LK++Y F C C C+ D + + +E + C D C+ + + +D C C
Sbjct: 227 ELKKKYNFICICEGCM------DDERNESMEAWACGD--CANGKVPNVEDG--RCASCSW 276
Query: 206 -LVRSKEEIKKIASEVNILSKKTLALTSCGNHQEVVSTYKMIEKLQKKLYHPFSVNLMQT 264
+ R E+ + A E I +K + S K++E L H F+V+ +
Sbjct: 277 KMTRDHYELCRTAEESAIAAKPKIENESISLETRKALCEKLMELFDDTL-HEFNVHRVVP 335
Query: 265 REKLIKILMELEDWKEALAYCQLTIPVYQRVYPQFHPLLGLQYYTCGKLEWFLGDTENAI 324
+ L + ++D A + + + + R + P++ L+ + ++ G+ +
Sbjct: 336 QRLLFFASLAVQDVNGASKHGKALLAMMMRYQGKHDPIVMLKKWQIAQICHANGEYNKVV 395
Query: 325 KSMTEAVEILRITHGTNSPFMKELILKLEEAQA 357
K + E E L+ + +P + +E A++
Sbjct: 396 KLLEEIKEPLKKIYTAGAPISVNIRRLMEMARS 428
>gi|343424810|emb|CBQ68348.1| conserved hypothetical protein [Sporisorium reilianum SRZ2]
Length = 855
Score = 68.2 bits (165), Expect = 6e-09, Method: Compositional matrix adjust.
Identities = 35/116 (30%), Positives = 64/116 (55%), Gaps = 16/116 (13%)
Query: 62 SINEIAENFSKLACNAHTICNSELRPLGTGLYPVISIINHSCLPNAVLVF---------- 111
S E+ + + +CN+ T+ +S+L PLG ++ ++++NH+C PNA +VF
Sbjct: 415 SATELLDLVCQFSCNSFTLTDSDLNPLGVCMHASMAMLNHACTPNAAVVFPFGGAAKSGQ 474
Query: 112 ------EGRLAVVRAVQHVPKGAEVLISYIETAGSTMTRQKALKEQYLFTCTCPRC 161
E ++ + A++ + G E+LISY++ A + R+ LK++Y F C C C
Sbjct: 475 QKWRDGEDKIMQLVALRAIEPGEELLISYVDIADTFEERRSYLKKRYCFDCRCELC 530
>gi|194752924|ref|XP_001958769.1| GF12554 [Drosophila ananassae]
gi|190620067|gb|EDV35591.1| GF12554 [Drosophila ananassae]
Length = 660
Score = 68.2 bits (165), Expect = 6e-09, Method: Compositional matrix adjust.
Identities = 56/219 (25%), Positives = 99/219 (45%), Gaps = 18/219 (8%)
Query: 48 IANLVNLILQWPEISINEIAE--NFSKLACNAHTICNSELRPLGTGLYPVISIINHSCLP 105
IA+LV LQ+ + + +E+AE FS +G +YP +++ NHSC P
Sbjct: 427 IASLVLRSLQFIQFNTHEVAELHKFSSSGREKSIF-------IGGAIYPTLALFNHSCDP 479
Query: 106 NAVLVFEGRLAVVRAVQHVPKGAEVLISY--IETAGSTMTRQKALKEQYLFTCTCPRCI- 162
V F G + +V+ + G + +Y I T RQ LKE Y F C+C CI
Sbjct: 480 GVVRYFRGNTIHINSVRPIEAGLPINENYGPIYTQDERSERQARLKELYWFECSCDACID 539
Query: 163 KLGQFDDIQESAILEGYRCK-DDGCSGFL--LRDSDDKGFTCQQCGLVRSKEEIKKIASE 219
+FDD+ I +RC+ + C+ + +D C CG + + + K+ +
Sbjct: 540 NWPKFDDLPRDII--RFRCEAPNNCAAVIEVPPSCNDFMVKCVTCGEITNILKGLKVMQD 597
Query: 220 VNILSKKTLALTSCGNHQEVVSTY-KMIEKLQKKLYHPF 257
++++ L G + + ++ + +I + + L PF
Sbjct: 598 TEMMTRTAKRLYETGEYSKALTKFIDLIRIMYEVLAPPF 636
>gi|83774976|dbj|BAE65099.1| unnamed protein product [Aspergillus oryzae RIB40]
Length = 323
Score = 68.2 bits (165), Expect = 6e-09, Method: Compositional matrix adjust.
Identities = 32/98 (32%), Positives = 51/98 (52%)
Query: 66 IAENFSKLACNAHTICNSELRPLGTGLYPVISIINHSCLPNAVLVFEGRLAVVRAVQHVP 125
+A KL N+ N +G L+P ++INHSC NA + F+ ++A + +
Sbjct: 5 VANVLIKLELNSFNFTNILSDRIGLYLHPYAALINHSCNYNAAVTFDSDNLYIKATRPIQ 64
Query: 126 KGAEVLISYIETAGSTMTRQKALKEQYLFTCTCPRCIK 163
KG ++ ISYI+ R+ L+E+Y F C C +C K
Sbjct: 65 KGDQIFISYIDATNPVKLRRSELRERYYFDCHCAKCAK 102
>gi|66822115|ref|XP_644412.1| SET domain-containing protein [Dictyostelium discoideum AX4]
gi|66822967|ref|XP_644838.1| SET domain-containing protein [Dictyostelium discoideum AX4]
gi|74857697|sp|Q557F6.1|Y3591_DICDI RecName: Full=SET and MYND domain-containing protein DDB_G0273591
gi|60472535|gb|EAL70486.1| SET domain-containing protein [Dictyostelium discoideum AX4]
gi|60472965|gb|EAL70914.1| SET domain-containing protein [Dictyostelium discoideum AX4]
Length = 413
Score = 68.2 bits (165), Expect = 6e-09, Method: Compositional matrix adjust.
Identities = 39/122 (31%), Positives = 59/122 (48%), Gaps = 6/122 (4%)
Query: 72 KLACNAHTICNSELRPLGTGLYPVISIINHSCLPNAVLVFEGRLAVVRAVQHVPKGAEVL 131
K+ CN I + +G + P S NHSC+PN V +G +++ + KG ++
Sbjct: 248 KVRCNQFGIWTKNDKCIGMAVSPSSSYFNHSCIPNCESVRDGSDMTFKSLFPIKKGDQIN 307
Query: 132 ISYIETAGSTMTRQKALKEQYLFTCTCPRCIK-----LGQFDDIQESAILEGYRCKDDGC 186
ISY+ ST R+ LK Y F C CPRC G+ +D ++ I + Y C C
Sbjct: 308 ISYLALDKSTKRRRDYLKFGYYFHCQCPRCNSTDIDPTGKLEDSLDNWISKFY-CHQKKC 366
Query: 187 SG 188
+G
Sbjct: 367 TG 368
>gi|297736653|emb|CBI25524.3| unnamed protein product [Vitis vinifera]
Length = 716
Score = 68.2 bits (165), Expect = 6e-09, Method: Compositional matrix adjust.
Identities = 40/112 (35%), Positives = 54/112 (48%), Gaps = 7/112 (6%)
Query: 82 NSELRPLGTGLYPVISIINHSCLPNAVLVFEGRLAVVRAVQHVPKGAEVLISYIETAG-- 139
N E +G +Y V S+ NHSC PN F R +RA +HV G + +SY G
Sbjct: 452 NMEQVRVGQAIYSVASLFNHSCQPNIHAYFLSRTLFLRATEHVAVGCPLELSYGPQVGQW 511
Query: 140 STMTRQKALKEQYLFTCTCPRCIKLGQFDDIQESAILEGYRCKDDGCSGFLL 191
RQK LK++Y F C C C +L D +L +RC + C G +L
Sbjct: 512 DCKDRQKFLKDEYSFRCECSGCSELNVSD-----LVLNAFRCVNPDCFGTVL 558
>gi|307173919|gb|EFN64667.1| Protein msta, isoform B [Camponotus floridanus]
Length = 513
Score = 68.2 bits (165), Expect = 6e-09, Method: Compositional matrix adjust.
Identities = 58/194 (29%), Positives = 93/194 (47%), Gaps = 22/194 (11%)
Query: 81 CNSELRPLGTGLYPVISIINHSCLPNAVLVFEGRLAV-VRAVQHVPKGAEVLISYIETAG 139
C + LR GLYP+ ++ NH C+PN F+ + + V A + G E+ +SY +
Sbjct: 213 CTTSLR----GLYPLGALQNHCCVPNTRHHFDDQQQLHVTATLPIAAGEEITMSYTDLLW 268
Query: 140 STMTRQKALKEQYLFTCTCPRCIKLGQFDDIQESAILEGYRCKDDGCSGFLLRDSDDKGF 199
T +R++ LK F+C C RC +F Q A+L C D CSG LL S+ F
Sbjct: 269 DTSSRRQFLKATKHFSCNCSRCSDPSEFGS-QLGALL----CAKDDCSGHLL-PSNPLNF 322
Query: 200 ----TCQQCGLVRSKEEIKKIASEVNILSKKTLALTSCGNHQEVVSTYKMIEKLQKKLYH 255
C +C + + ++I+ I S++N L + +T E+V K IE KL
Sbjct: 323 ISPWICDKCQISVNYKQIEFIHSQLNALVLDAMYMTP----HEIV---KFIETALSKLVP 375
Query: 256 PFSVNLMQTREKLI 269
+ L+ + ++I
Sbjct: 376 TSNFILIDVKFRII 389
>gi|302853821|ref|XP_002958423.1| hypothetical protein VOLCADRAFT_99704 [Volvox carteri f.
nagariensis]
gi|300256228|gb|EFJ40499.1| hypothetical protein VOLCADRAFT_99704 [Volvox carteri f.
nagariensis]
Length = 729
Score = 68.2 bits (165), Expect = 7e-09, Method: Compositional matrix adjust.
Identities = 36/91 (39%), Positives = 54/91 (59%), Gaps = 7/91 (7%)
Query: 76 NAHTICNSELRPLGT--GLYPVISIINHSCLPNAVLVFEGRLAVVRAVQHVPKGAEVLIS 133
A + NSE PL + G++P +++INHSC PNAV V VVRA + +P+G EVL++
Sbjct: 601 GAAAVANSE--PLTSRMGVWPEVALINHSCGPNAVAVMLYDRLVVRATRRIPRGREVLLN 658
Query: 134 YIETAGS---TMTRQKALKEQYLFTCTCPRC 161
++ + G+ R+ L + Y F C C RC
Sbjct: 659 WLGSQGALAPARERRAQLSQMYDFACRCARC 689
>gi|308509934|ref|XP_003117150.1| CRE-SET-14 protein [Caenorhabditis remanei]
gi|308242064|gb|EFO86016.1| CRE-SET-14 protein [Caenorhabditis remanei]
Length = 429
Score = 68.2 bits (165), Expect = 7e-09, Method: Compositional matrix adjust.
Identities = 60/223 (26%), Positives = 98/223 (43%), Gaps = 15/223 (6%)
Query: 59 PEISINEIAENFSKLACNAHTICNSELRPLGTGLYPVISIINHSCLPNAVLVFEGRLAVV 118
P I F K++ N+ ++ NS +G L +S NHSC P+ + + GR A++
Sbjct: 136 PHSHPETIKSIFKKVSVNSFSLSNSTGNSIGIALCVKLSAANHSCKPSTRVCYRGRTAML 195
Query: 119 RAV-QHVPKGAE-VLISYIETAGSTMTRQKALKEQYLFTCTCPRCIKLGQFDDIQESAIL 176
V +P E SYI+ + TRQ LK++Y F C C C D + + +
Sbjct: 196 VPVDDRLPTSLEGACHSYIDELQTLSTRQATLKKKYKFDCACEGCT------DDERNGRM 249
Query: 177 EGYRCKDDGCSGFLLRDSDDKGFTCQQCG--LVRSKEEIKKIASEVNILSKKTLALTSCG 234
E + C + C +R+ + G +C CG L R + E+ + A E I S+ L S
Sbjct: 250 EAWAC--EICKTGWIRNVE--GASCNPCGYVLTRDQYELCRTAEEAAIASRPKLENDSIP 305
Query: 235 NHQEVVSTYKMIEKLQKKLYHPFSVNLMQTREKLIKILMELED 277
K++E Q L H ++V+ + L + + D
Sbjct: 306 LETRRHLCEKLLELFQDTL-HAYNVHRIPVLRCLFVATLAIRD 347
>gi|332027349|gb|EGI67433.1| SET and MYND domain-containing protein 4 [Acromyrmex echinatior]
Length = 619
Score = 67.8 bits (164), Expect = 7e-09, Method: Compositional matrix adjust.
Identities = 40/123 (32%), Positives = 61/123 (49%), Gaps = 6/123 (4%)
Query: 88 LGTGLYPVISIINHSCLPNAVLVFEGRLAVVRAVQHVPKGAEVLISY--IETAGSTMTRQ 145
+G G+YP ++++NHSC P + F G +VRAV+ + G E+ +Y I T R+
Sbjct: 422 IGGGVYPTVAMLNHSCNPGVIRYFIGTTMIVRAVRTINAGEEISENYGPIFTTMPESERK 481
Query: 146 KALKEQYLFTCTCPRCIKLGQFDDIQESAILEGYRCKDD-GCSGFLLRDSDDKGFT--CQ 202
+ L+ QY F C C C D + IL ++C+ C L+ SD F C
Sbjct: 482 RKLRVQYWFDCNCEACSGHWPLLDELDPTILR-FKCETGPSCGNVLMVKSDTNEFMIGCA 540
Query: 203 QCG 205
+CG
Sbjct: 541 KCG 543
>gi|219130732|ref|XP_002185512.1| predicted protein [Phaeodactylum tricornutum CCAP 1055/1]
gi|217403043|gb|EEC42999.1| predicted protein [Phaeodactylum tricornutum CCAP 1055/1]
Length = 493
Score = 67.8 bits (164), Expect = 8e-09, Method: Compositional matrix adjust.
Identities = 37/104 (35%), Positives = 52/104 (50%), Gaps = 3/104 (2%)
Query: 61 ISINEIAENFSKLACNAHT---ICNSELRPLGTGLYPVISIINHSCLPNAVLVFEGRLAV 117
+S I +++ A NA T + L P GL+P+ ++INHSC+ NAV V + V
Sbjct: 188 LSYTNIEQSWRTTALNAPTPLDLSRLSLPPRLLGLFPLAAMINHSCVGNAVRVLVDDVMV 247
Query: 118 VRAVQHVPKGAEVLISYIETAGSTMTRQKALKEQYLFTCTCPRC 161
V A +P G E++ SY R L+ Y F C CPRC
Sbjct: 248 VHATTDLPAGTELVWSYGPPTTPFAQRNTRLRAHYGFVCDCPRC 291
>gi|440635519|gb|ELR05438.1| hypothetical protein GMDG_01733 [Geomyces destructans 20631-21]
Length = 505
Score = 67.8 bits (164), Expect = 8e-09, Method: Compositional matrix adjust.
Identities = 39/118 (33%), Positives = 61/118 (51%), Gaps = 3/118 (2%)
Query: 66 IAENFSKLACNAHTICNSELRPLGTGLYPVISIINHSCLPNAVLVFEGRLAVVRAVQHVP 125
I + F K+ N I + E P G L V S +NHSC PNA + FEG +R+++ +
Sbjct: 193 IQDLFCKVLTNVMCINHQEGSP-GITLDLVGSFVNHSCDPNAFVFFEGSQLRMRSLKPIN 251
Query: 126 KGAEVLISYIETAGSTMTRQKALKEQYLFTCTCPRCIKLGQFDDIQESAILEGYRCKD 183
G E+ ++Y E + R++ L Y F C C RC K + ++ E + EG+ +D
Sbjct: 252 AGDEITLTYTELREGVLMRKRKLHRGYSFFCRCNRCKK--EHKELHELSFREGFSIRD 307
>gi|327288923|ref|XP_003229174.1| PREDICTED: SET and MYND domain-containing protein 5-like, partial
[Anolis carolinensis]
Length = 163
Score = 67.8 bits (164), Expect = 8e-09, Method: Compositional matrix adjust.
Identities = 36/86 (41%), Positives = 50/86 (58%), Gaps = 7/86 (8%)
Query: 89 GTGLYPVISIINHSCLPNAVLVFEGR--LAVVRAVQHVPKGAEVLISYIETAG---STMT 143
G+GLY + S NHSC+PNA F L + A++ + G E+ ISY++ S +
Sbjct: 45 GSGLYVLQSCCNHSCIPNAETSFPDNNFLLHLTALEDIRPGEEICISYLDCCQRERSRHS 104
Query: 144 RQKALKEQYLFTCTCPRCIKLGQFDD 169
R K L+E YLF C+CP+C L Q DD
Sbjct: 105 RHKVLRENYLFVCSCPKC--LAQADD 128
>gi|443896871|dbj|GAC74214.1| predicted histone tail methylase containing SET domain [Pseudozyma
antarctica T-34]
Length = 882
Score = 67.8 bits (164), Expect = 8e-09, Method: Compositional matrix adjust.
Identities = 36/119 (30%), Positives = 65/119 (54%), Gaps = 17/119 (14%)
Query: 62 SINEIAENFSKLACNAHTICNSELRPLGTGLYPVISIINHSCLPNAVLVF---------- 111
S E+ + + + N+ T+ +S+L PLG ++ ++++NHSC PNA +VF
Sbjct: 463 SATELLDLVCQFSSNSFTLTDSDLNPLGVCMHASMAMLNHSCAPNAAVVFPFGGAAKSGQ 522
Query: 112 -------EGRLAVVRAVQHVPKGAEVLISYIETAGSTMTRQKALKEQYLFTCTCPRCIK 163
+ ++ + A++ + G E+LISY++ A + R+ ALK++Y F C C C K
Sbjct: 523 QNWSDGEDAKVMQLVALRAIEPGEELLISYVDVADAYEQRRSALKKRYCFECRCELCRK 581
>gi|307196209|gb|EFN77865.1| SET and MYND domain-containing protein 3 [Harpegnathos saltator]
Length = 354
Score = 67.8 bits (164), Expect = 8e-09, Method: Compositional matrix adjust.
Identities = 33/98 (33%), Positives = 55/98 (56%)
Query: 250 QKKLYHPFSVNLMQTREKLIKILMELEDWKEALAYCQLTIPVYQRVYPQFHPLLGLQYYT 309
QK + H F++ ++T + + LE W++A Y + +P Y Y + HPL+GL Y T
Sbjct: 234 QKDILHSFNIQHIRTLDTAFLAAVNLEYWEDAELYSKQLLPGYLLYYGEVHPLIGLLYLT 293
Query: 310 CGKLEWFLGDTENAIKSMTEAVEILRITHGTNSPFMKE 347
GK++ L + A++ + +A +L ITHG P M+E
Sbjct: 294 MGKIQLHLRKLKQALEILKKASAVLTITHGDKHPIMRE 331
Score = 51.6 bits (122), Expect = 5e-04, Method: Compositional matrix adjust.
Identities = 29/99 (29%), Positives = 55/99 (55%), Gaps = 3/99 (3%)
Query: 30 LVAHMSDI--DEKQLLLYAQIAN-LVNLILQWPEISINEIAENFSKLACNAHTICNSELR 86
L++H SDI D K++ + + L+ + + + E+ + ++ N+ I + ++
Sbjct: 117 LMSHYSDIKKDPKRMEHFISLCGVLLEFLGEALMPNTAELIGIYGRICINSFNILDLDMN 176
Query: 87 PLGTGLYPVISIINHSCLPNAVLVFEGRLAVVRAVQHVP 125
+G G+Y S+I+HSC PNAV+ FEG +VR + +P
Sbjct: 177 SIGVGIYLGASVIDHSCKPNAVVTFEGTTIIVRTLIDLP 215
>gi|66800595|ref|XP_629223.1| hypothetical protein DDB_G0293294 [Dictyostelium discoideum AX4]
gi|60462643|gb|EAL60846.1| hypothetical protein DDB_G0293294 [Dictyostelium discoideum AX4]
Length = 508
Score = 67.8 bits (164), Expect = 8e-09, Method: Compositional matrix adjust.
Identities = 32/81 (39%), Positives = 48/81 (59%), Gaps = 6/81 (7%)
Query: 89 GTGLYPVISIINHSCLPNAVL---VFEGRLAV---VRAVQHVPKGAEVLISYIETAGSTM 142
G GL+P+ S +NHSC PN + + +G +V V+A +++P G+E+L SY + S
Sbjct: 428 GVGLFPIFSCMNHSCFPNVEISNEIIDGVTSVRMVVKAKKNIPAGSEILHSYCDETLSNK 487
Query: 143 TRQKALKEQYLFTCTCPRCIK 163
R+ L QY F CTC +C K
Sbjct: 488 ERKDILFSQYGFKCTCNKCSK 508
>gi|342874438|gb|EGU76450.1| hypothetical protein FOXB_13043 [Fusarium oxysporum Fo5176]
Length = 896
Score = 67.8 bits (164), Expect = 8e-09, Method: Compositional matrix adjust.
Identities = 35/92 (38%), Positives = 53/92 (57%), Gaps = 1/92 (1%)
Query: 72 KLACNAHTICNSELRPLGTGLYPVISIINHSCLPNAVLVFEGRLAVVRAVQHVPKGAEVL 131
K+ NA +++L +G L P +++ NHSC+PNA++ F GR A++RA + + E+
Sbjct: 535 KIQTNAFHRYDADLGQVGIFLEPKLAMANHSCIPNAMVQFVGRKAILRAEKPIKVDEEIE 594
Query: 132 ISYIETAGSTMTRQKALKEQYLFTCTCPRCIK 163
ISY + R+ AL Y F C CPRC K
Sbjct: 595 ISYTDYTFPRSKRKHALAP-YFFDCQCPRCEK 625
>gi|170047543|ref|XP_001851277.1| conserved hypothetical protein [Culex quinquefasciatus]
gi|167869950|gb|EDS33333.1| conserved hypothetical protein [Culex quinquefasciatus]
Length = 697
Score = 67.8 bits (164), Expect = 8e-09, Method: Compositional matrix adjust.
Identities = 57/202 (28%), Positives = 82/202 (40%), Gaps = 51/202 (25%)
Query: 44 LYAQIANLVNLILQ--WPEISINEIAENFSKLACNAHTICNSELRP-------------- 87
+ Q+ +L+N++ + W I + +L CN H I SELR
Sbjct: 353 FFQQLGDLINVMSKENWSIYCGAVIFRHIGQLVCNGHAI--SELRGSFASENNCLEADSF 410
Query: 88 ---------------LGTGLYPVISIINHSCLPNAVLVFEGRLAVVRAVQHVPKGAEVLI 132
+ TG++P IS+ NHSC PN F V A + +P G E+
Sbjct: 411 NIKAGFLHRYFESTRVFTGIFPQISMFNHSCEPNIRNTFNKNTLTVYAAKDIPVGGEIFN 470
Query: 133 SYIETAGSTMT------RQKALKEQYLFTCTCPRCIKLGQFDDIQESAILEGYRCKDDGC 186
Y G R+ AL++QY F C C RC D+ ES E Y+C C
Sbjct: 471 CY----GPNFKLMCKDERKSALRQQYGFDCKCTRCAMNN--DEAYES--FEHYKCPFAKC 522
Query: 187 SG-FLLRDSDD---KGFTCQQC 204
S F+L+++ D K C C
Sbjct: 523 SKYFMLKENADPFEKDIKCPMC 544
>gi|321472149|gb|EFX83120.1| hypothetical protein DAPPUDRAFT_2393 [Daphnia pulex]
Length = 458
Score = 67.8 bits (164), Expect = 9e-09, Method: Compositional matrix adjust.
Identities = 48/186 (25%), Positives = 87/186 (46%), Gaps = 17/186 (9%)
Query: 91 GLYPVISIINHSCLPNAVLVFEGRLAVVR--AVQHVPKGAEVLISYIETAGSTMTRQKAL 148
GLYP S++NH C+ N VF+ +R A + +P G ++ +Y + +T+ R+ L
Sbjct: 225 GLYPTASLMNHDCVANTRHVFDPADFRIRILATKDIPAGDKISATYTRSLWNTLDRRLHL 284
Query: 149 KEQYLFTCTCPRCIKLGQFDDIQESAILEGYRCKDDGCSGFLLRDS---DDKGFTCQQCG 205
K F C C RC D + +L +C GC G +L + + C CG
Sbjct: 285 KSTKHFWCQCSRCA-----DPRELGTLLSAVKCT--GCGGAVLSQNPLYQMSNWECSDCG 337
Query: 206 LVRSKEEIKKIASEVNILSKKTLALTSCGNHQEVVSTYKMIEKLQKKLYHPFSVNLMQTR 265
V+ E++K++ + K+ L N E++ + I K + HP S ++++ +
Sbjct: 338 SVQKVEQVKRVHDSARMELKQIAQLAR--NRPELLEDF--IRKYSGAI-HPDSCHVIEAK 392
Query: 266 EKLIKI 271
L+++
Sbjct: 393 YALVQL 398
>gi|449540918|gb|EMD31905.1| hypothetical protein CERSUDRAFT_119222 [Ceriporiopsis subvermispora
B]
Length = 604
Score = 67.8 bits (164), Expect = 9e-09, Method: Compositional matrix adjust.
Identities = 46/163 (28%), Positives = 74/163 (45%), Gaps = 26/163 (15%)
Query: 17 PSTTTDNYSLVEALVAHMSDIDEKQLLLYA--QIANLVNLILQWPEISINEIAENFSKLA 74
PS + + +LV ++ + +L + NLV+LI S+ A
Sbjct: 219 PSAVESHTHMAHSLVRYLGIANPAELAPFGLNSAGNLVDLI---------------SRFA 263
Query: 75 CNAHTICNSELRPLGTGLYPVISIINHSCLPNAVLVF---------EGRLAVVRAVQHVP 125
N T+ + L P+G + P I++ NHSC PNA +VF + L + A++ +
Sbjct: 264 TNTFTLTSFSLTPIGICICPSIALANHSCEPNAAIVFPRSSSLPQAQEPLMHLMAIRDIA 323
Query: 126 KGAEVLISYIETAGSTMTRQKALKEQYLFTCTCPRCIKLGQFD 168
+V+ +Y++ RQKAL+E Y FTC C C K D
Sbjct: 324 PNEQVVAAYVDVTLPRELRQKALQETYSFTCKCKLCTKRMAVD 366
>gi|402073679|gb|EJT69231.1| hypothetical protein GGTG_12851 [Gaeumannomyces graminis var.
tritici R3-111a-1]
Length = 541
Score = 67.8 bits (164), Expect = 9e-09, Method: Compositional matrix adjust.
Identities = 38/104 (36%), Positives = 60/104 (57%), Gaps = 1/104 (0%)
Query: 60 EISINEIAENFSKLACNAHTICNSELRPLGTGLYPVISIINHSCLPNAVLVFEGRLAVVR 119
+ ++N AE K+ N+ +++ GT L P+++++NHSC+PNAV++F R A +R
Sbjct: 173 DANLNLAAEILCKIQTNSFDRFDADTGQSGTFLDPLLAMVNHSCIPNAVVLFWKRKAYLR 232
Query: 120 AVQHVPKGAEVLISYIETAGSTMTRQKALKEQYLFTCTCPRCIK 163
A V G ++ ISYI+ R++ L E Y F C C RC K
Sbjct: 233 AETPVKAGEDITISYIDYTKPLSFRRQDL-ELYHFQCGCLRCEK 275
>gi|147817677|emb|CAN75589.1| hypothetical protein VITISV_042880 [Vitis vinifera]
Length = 816
Score = 67.8 bits (164), Expect = 9e-09, Method: Compositional matrix adjust.
Identities = 46/125 (36%), Positives = 60/125 (48%), Gaps = 9/125 (7%)
Query: 69 NFSKLACNAHTICNSELRPLGTGLYPVISIINHSCLPNAVLVFEGRLAVVRAVQHVPKGA 128
NFS A A T N E +G +Y V S+ NHSC PN F R +RA +HV G
Sbjct: 62 NFSP-AGGAFT-SNMEQVRVGQAIYSVASLFNHSCQPNIHAYFLSRTLFLRATEHVAVGC 119
Query: 129 EVLISYIETAG--STMTRQKALKEQYLFTCTCPRCIKLGQFDDIQESAILEGYRCKDDGC 186
+ +SY G RQK LK++Y F C C C +L D +L +RC + C
Sbjct: 120 PLELSYGPQVGQWDCKDRQKFLKDEYSFRCECSGCSELNVSD-----LVLNAFRCVNPDC 174
Query: 187 SGFLL 191
G +L
Sbjct: 175 FGTVL 179
>gi|357610542|gb|EHJ67027.1| hypothetical protein KGM_11275 [Danaus plexippus]
Length = 661
Score = 67.8 bits (164), Expect = 9e-09, Method: Compositional matrix adjust.
Identities = 80/267 (29%), Positives = 121/267 (45%), Gaps = 43/267 (16%)
Query: 21 TDNYSLVEALVAHMSDIDEKQLLLYAQIANLVNLILQWPEIS--INE--------IAENF 70
T++Y + LV+H + ++ +L ++A +L+ EIS N+ I +N
Sbjct: 374 TEDYRNIYHLVSHENKRTKQDILHRTEMAIF---LLKLLEISGYFNDDAASFGCLILKNL 430
Query: 71 SKLACNAHTICNSE-LRP---------------LGTGLYPVISIINHSCLPNAVLVFEGR 114
L NAH + + L+P +G +YP +++ NHSC P V F G
Sbjct: 431 QVLQFNAHEVFEIQCLKPKDGTRFLKHEGKSVFIGGAVYPTLALFNHSCEPGIVRYFCGS 490
Query: 115 LAVVRAVQHVPKGAEVLISY--IETAGSTMTRQKALKEQYLFTCTCPRCIK-LGQFDDIQ 171
VV AV+++ KG EV +Y I T RQ LKEQY F C C C + +++D+
Sbjct: 491 RIVVCAVKNIRKGEEVAENYGPIFTTVPKDKRQSQLKEQYWFDCKCLPCEQNWPKYEDMT 550
Query: 172 ESAILEGYRCKDD-GCSGFLLRDSDDKGFTCQQCGLVRSKEEI-KKIAS--EVNILSKKT 227
E+ + ++C D CS + D F QCGL + I K + S + L K
Sbjct: 551 ENYLR--FKCDSDQPCSNVIPVPYDCMEFMV-QCGLCQQYTNILKGLKSLQDTETLYKLG 607
Query: 228 LALTSCGNHQEVVSTYKMIEKLQKKLY 254
A G + E + K IE L KLY
Sbjct: 608 RAAMGEGKYGEAIK--KFIETL--KLY 630
>gi|307182513|gb|EFN69720.1| Histone-lysine N-methyltransferase ASHR1 [Camponotus floridanus]
Length = 132
Score = 67.4 bits (163), Expect = 9e-09, Method: Compositional matrix adjust.
Identities = 36/100 (36%), Positives = 55/100 (55%)
Query: 248 KLQKKLYHPFSVNLMQTREKLIKILMELEDWKEALAYCQLTIPVYQRVYPQFHPLLGLQY 307
K QK + H F++ ++T E M L+ W++A Y + IP Y Y + HPL+GL Y
Sbjct: 12 KKQKNILHRFNIQHIRTLEAAHIAAMNLKYWEDAELYGKELIPGYLLYYGEIHPLIGLLY 71
Query: 308 YTCGKLEWFLGDTENAIKSMTEAVEILRITHGTNSPFMKE 347
GK++ L + A+K + +A E+L ITHG M+E
Sbjct: 72 LMTGKIQLHLNKPKEALKVLEKASEVLMITHGDKHSLMRE 111
>gi|390365247|ref|XP_003730778.1| PREDICTED: SET and MYND domain-containing protein 3-like
[Strongylocentrotus purpuratus]
Length = 487
Score = 67.4 bits (163), Expect = 9e-09, Method: Compositional matrix adjust.
Identities = 31/87 (35%), Positives = 51/87 (58%)
Query: 75 CNAHTICNSELRPLGTGLYPVISIINHSCLPNAVLVFEGRLAVVRAVQHVPKGAEVLISY 134
CN+ I +++L + G+Y S++NHSC N ++VF+ R +R V+ V +G E I Y
Sbjct: 181 CNSFGIFDNDLIAISDGIYLRASMVNHSCDYNCIVVFDERKLQLRTVKDVKEGEECTIGY 240
Query: 135 IETAGSTMTRQKALKEQYLFTCTCPRC 161
++ R+ L+E+Y FTC C +C
Sbjct: 241 VDVIHPAKERRAELEEKYHFTCKCVKC 267
Score = 63.5 bits (153), Expect = 2e-07, Method: Compositional matrix adjust.
Identities = 28/67 (41%), Positives = 40/67 (59%)
Query: 97 SIINHSCLPNAVLVFEGRLAVVRAVQHVPKGAEVLISYIETAGSTMTRQKALKEQYLFTC 156
S++NHSC PN V +GR + V+ V +G E I+Y++ RQ LKE+Y FTC
Sbjct: 364 SMLNHSCDPNCAWVSDGRKIQIMTVKDVKEGEECTITYVDVMDPAKVRQADLKERYHFTC 423
Query: 157 TCPRCIK 163
C +CI+
Sbjct: 424 KCVKCIE 430
>gi|345480361|ref|XP_003424136.1| PREDICTED: protein msta, isoform A-like [Nasonia vitripennis]
Length = 501
Score = 67.4 bits (163), Expect = 1e-08, Method: Compositional matrix adjust.
Identities = 42/135 (31%), Positives = 66/135 (48%), Gaps = 11/135 (8%)
Query: 91 GLYPVISIINHSCLPNA--VLVFEGRLAVVRAVQHVPKGAEVLISYIETAGSTMTRQKAL 148
L+PV ++ NH C+PN ++ EG L V AV + +G E+ +SY + T R+ L
Sbjct: 208 ALFPVGALTNHQCVPNTRHIVNAEGELLVYAAVP-IAEGQEITMSYADVLWDTQMRRHFL 266
Query: 149 KEQYLFTCTCPRCIKLGQFDDIQESAILEGYRCKDDGCSGFLLRDSD---DKGFTCQQCG 205
F C CPRC D + ++L C D C G LL ++ + +TC +CG
Sbjct: 267 LATKHFACQCPRCT-----DVTECGSLLGALSCAIDNCQGNLLPNTPLSLNCPWTCNECG 321
Query: 206 LVRSKEEIKKIASEV 220
L+ +I I + +
Sbjct: 322 LIIKGRQISSIRAGI 336
>gi|302834810|ref|XP_002948967.1| hypothetical protein VOLCADRAFT_89375 [Volvox carteri f.
nagariensis]
gi|300265712|gb|EFJ49902.1| hypothetical protein VOLCADRAFT_89375 [Volvox carteri f.
nagariensis]
Length = 679
Score = 67.4 bits (163), Expect = 1e-08, Method: Compositional matrix adjust.
Identities = 37/104 (35%), Positives = 52/104 (50%), Gaps = 2/104 (1%)
Query: 61 ISINEIAENFSKLACNAHTICNSELRPLGTGLYPVISIINHSCLPNAVLVFEGRLAVVRA 120
++ N E+ A +A ++EL GL+P S+INHSC P A G + VVRA
Sbjct: 218 VTFNAYGESKPDPAVSAVRSLSAELLSGCVGLWPPFSLINHSCAPVASYGLVGDVMVVRA 277
Query: 121 VQHVPKGAEVLISYI--ETAGSTMTRQKALKEQYLFTCTCPRCI 162
+P G +V ISY G R+ L++ Y F C C RC+
Sbjct: 278 AADLPAGQQVTISYFGRRALGPLELRRAYLRQHYGFVCACERCV 321
>gi|393212658|gb|EJC98158.1| SET domain-containing protein [Fomitiporia mediterranea MF3/22]
Length = 459
Score = 67.4 bits (163), Expect = 1e-08, Method: Compositional matrix adjust.
Identities = 40/106 (37%), Positives = 59/106 (55%), Gaps = 11/106 (10%)
Query: 66 IAENFSKLACNAHTICNSELRPLGTGLYPVIS-IINHSCLPNAVLVF----EGRLAVVRA 120
I E FS+ + N + I +S L +G G++P+ S + NHSCLPNA++ + EG VV+A
Sbjct: 189 IKEIFSRFS-NNNFIVHSHLSAVGHGIFPLASRLFNHSCLPNAIVTYSFTSEGIQMVVKA 247
Query: 121 VQHVPKGAEVLISYIETAGSTMTRQKALKEQYLFTCTC-----PRC 161
+ + +G E+ I Y + A RQ + Y F CTC PRC
Sbjct: 248 LTPIKQGEEITIPYFDPALPYHQRQAICRYSYGFECTCSVCMFPRC 293
>gi|281202062|gb|EFA76267.1| autophagy protein 9 [Polysphondylium pallidum PN500]
Length = 1064
Score = 67.4 bits (163), Expect = 1e-08, Method: Compositional matrix adjust.
Identities = 41/123 (33%), Positives = 54/123 (43%), Gaps = 6/123 (4%)
Query: 72 KLACNAHTICNSELRPLGTGLYPVISIINHSCLPNAVLVFEGRLAVVRAVQHVPKGAEVL 131
K+ CN I + + +G P S NHSC PN V + V +A+ + KG +
Sbjct: 910 KIRCNQFGIWSKRDKCIGVSCTPAASYFNHSCCPNLADVRGTTVVVFKALHFISKGTPIS 969
Query: 132 ISYIETAGSTMTRQKALKEQYLFTCTCPRCIKLGQFDDIQES-AILEGYRCKDDGCSGFL 190
ISY++ T RQ LK Y FTC C RC D ES + + C CSG
Sbjct: 970 ISYLDLDQPTHERQSYLKTFYYFTCQCLRCK-----DQTDESDNWISRFYCDRFKCSGTY 1024
Query: 191 LRD 193
D
Sbjct: 1025 FLD 1027
>gi|407867908|gb|EKG08717.1| hypothetical protein TCSYLVIO_000127 [Trypanosoma cruzi]
Length = 697
Score = 67.4 bits (163), Expect = 1e-08, Method: Compositional matrix adjust.
Identities = 50/159 (31%), Positives = 70/159 (44%), Gaps = 15/159 (9%)
Query: 15 VIPSTTTDNYSLVEALVAHMSDIDEKQLLLYAQIANLVNLILQW------PEISINEIAE 68
V P +D LV L D K Y + LIL W E ++ A
Sbjct: 487 VRPPKWSDTARLVTNLSVLSKDSRSKFRCAYRR---FTKLILPWLGEGGSTEANLTVTAT 543
Query: 69 NFSKLA----CNAHTICNSELRPLGTGLYPVISIINHSCLPNAVLV-FEGRLAVVRAVQH 123
F++L CN+ + N++ +G LYP S NHSC PN V + G LA A++
Sbjct: 544 FFNRLCAALQCNSFGLFNADGNCIGVALYPEASYFNHSCCPNICRVTYRGILAAFHALRE 603
Query: 124 VPKGAEVLISYIETA-GSTMTRQKALKEQYLFTCTCPRC 161
+ KG + I Y++ ST R++ L Y F C C RC
Sbjct: 604 IRKGEPLTICYVDVQETSTAERRRTLFSSYRFFCECARC 642
>gi|289739577|gb|ADD18536.1| putative histone tail methylase [Glossina morsitans morsitans]
Length = 429
Score = 67.4 bits (163), Expect = 1e-08, Method: Compositional matrix adjust.
Identities = 32/103 (31%), Positives = 53/103 (51%), Gaps = 7/103 (6%)
Query: 91 GLYPVISIINHSCLPNAVLVFEGRL--AVVRAVQHVPKGAEVLISYIETAGSTMTRQKAL 148
L+P+ +++NH C+PN+ FE + V+RA +P+G E+ +Y + + R L
Sbjct: 208 ALFPLFALVNHDCVPNSYYTFEEKTNYMVLRASVDLPEGTEITTTYTKLFTGNIARHLFL 267
Query: 149 KEQYLFTCTCPRCIKLGQFDDIQESAILEGYRCKDDGCSGFLL 191
K + FTC C RC D ++ A + C+D CSG +
Sbjct: 268 KMKKNFTCKCSRC-----SDPTEKGAFISAVYCRDTNCSGLAV 305
>gi|340924027|gb|EGS18930.1| hypothetical protein CTHT_0055430 [Chaetomium thermophilum var.
thermophilum DSM 1495]
Length = 278
Score = 67.4 bits (163), Expect = 1e-08, Method: Compositional matrix adjust.
Identities = 31/76 (40%), Positives = 40/76 (52%)
Query: 91 GLYPVISIINHSCLPNAVLVFEGRLAVVRAVQHVPKGAEVLISYIETAGSTMTRQKALKE 150
+YP +S NH C PN G + AV+ +P G E+ ISYI S RQK L E
Sbjct: 69 AVYPKVSRFNHDCRPNVHYRLSGLIHTTVAVRDIPAGTELTISYIYVRKSRAERQKELSE 128
Query: 151 QYLFTCTCPRCIKLGQ 166
+ FTCTC +C K +
Sbjct: 129 SWNFTCTCEQCSKSAE 144
>gi|332018228|gb|EGI58833.1| Protein msta, isoform A [Acromyrmex echinatior]
Length = 483
Score = 67.0 bits (162), Expect = 1e-08, Method: Compositional matrix adjust.
Identities = 65/303 (21%), Positives = 125/303 (41%), Gaps = 35/303 (11%)
Query: 61 ISINEIAENFSKLACNAHTICNSELRPL----GTGLYPVISIINHSCLPN---AVLVFEG 113
+ ++ +E + + C I E+R GLYP ++++NHSC+ N ++ +
Sbjct: 152 LKLDRFSEKYIQTICGILEINTFEVRTAKGFSARGLYPTVAMMNHSCVSNTSHSISPIDY 211
Query: 114 RLAVVRAVQHVPKGAEVLISYIETAGSTMTRQKALKEQYLFTCTCPRCIKLGQFDDIQES 173
R+ + R +P G E+ SY + T+ R++ L E F C CPRC D +
Sbjct: 212 RIRL-RTTLKIPAGGELYASYTHSLLPTILRREHLLEGKHFACACPRC-----SDPTELG 265
Query: 174 AILEGYRCK--DDGCSGFLLRDSDDKGFTCQQCGLVRSKEEIKKIASEVNILSKKTLALT 231
+ +C D+G L + + C C + + ++KI + A++
Sbjct: 266 THMSSLKCNKCDNGIVLPLDSLDSESTWKCTHCDFSTNGQAVRKILRIIQAQVDAAEAIS 325
Query: 232 SCGNHQEVVSTYKMIEKLQKKLYHPFSVNLMQTREKLIKILMELEDW----------KEA 281
+ + + + + K + + HP L R L ++ ++++ +
Sbjct: 326 G-ADGADAIYKRETVMKKYRLVLHPHHAFLSMLRHSLTQMYGRVDEYLLDDLPDVVLEHK 384
Query: 282 LAYCQLTIPVYQRVYPQFHPLLGLQYYT-------CGKLEWFLGDTENA-IKS-MTEAVE 332
+ C+L + V V P + + G+ Y K +W G + A +KS M EA
Sbjct: 385 VDMCRLLLQVLDVVEPGYSRVRGMTLYELHAPLLFLAKDQWNAGVIDEAKLKSKMIEAAN 444
Query: 333 ILR 335
IL+
Sbjct: 445 ILK 447
>gi|195355540|ref|XP_002044249.1| GM15092 [Drosophila sechellia]
gi|194129550|gb|EDW51593.1| GM15092 [Drosophila sechellia]
Length = 392
Score = 67.0 bits (162), Expect = 1e-08, Method: Compositional matrix adjust.
Identities = 37/97 (38%), Positives = 57/97 (58%), Gaps = 9/97 (9%)
Query: 70 FSKLACNAHTICNSELRPLGTGLYPVISIINHSCLPNAVLVF--EGRLAVVRAVQHVPKG 127
++K+A A N+E G+GLY + S INHSC+PNA F + V++A+ + +G
Sbjct: 271 YAKVAEFAGEFLNNE----GSGLYLLQSKINHSCVPNACSTFPYSNDIVVLKALGPIQQG 326
Query: 128 AEVLISYIETA---GSTMTRQKALKEQYLFTCTCPRC 161
E+ ISY++ S +R K L+E Y+F C CP+C
Sbjct: 327 EEICISYLDECMLERSRHSRHKVLRENYVFICQCPKC 363
>gi|281347298|gb|EFB22882.1| hypothetical protein PANDA_013199 [Ailuropoda melanoleuca]
Length = 1019
Score = 67.0 bits (162), Expect = 1e-08, Method: Compositional matrix adjust.
Identities = 29/59 (49%), Positives = 43/59 (72%)
Query: 71 SKLACNAHTICNSELRPLGTGLYPVISIINHSCLPNAVLVFEGRLAVVRAVQHVPKGAE 129
S++ CNA TICN+E++ +G GLYP +S++NHSC PN +VF G ++RAV+ + G E
Sbjct: 130 SQVICNAFTICNAEMQEVGVGLYPSMSLLNHSCDPNCSIVFNGPHLLLRAVRDIEAGEE 188
>gi|452846662|gb|EME48594.1| hypothetical protein DOTSEDRAFT_19117 [Dothistroma septosporum
NZE10]
Length = 199
Score = 67.0 bits (162), Expect = 1e-08, Method: Compositional matrix adjust.
Identities = 35/91 (38%), Positives = 48/91 (52%), Gaps = 16/91 (17%)
Query: 71 SKLACNAHTICNSELRPLGTGLYPVISIINHSCLPNAVLVFEGRLAVVRAVQHVPKGAEV 130
S L CN+ + PLG + +IS +NHSC+PNA +VFE E+
Sbjct: 39 SLLLCNSLILMTPANDPLGIVVELLISNVNHSCVPNACIVFE----------------EI 82
Query: 131 LISYIETAGSTMTRQKALKEQYLFTCTCPRC 161
+SY++ G RQ AL +Y FTCTCP+C
Sbjct: 83 FVSYVDQHGLFHARQAALWHRYYFTCTCPKC 113
>gi|281206015|gb|EFA80204.1| SET domain-containing protein [Polysphondylium pallidum PN500]
Length = 490
Score = 67.0 bits (162), Expect = 1e-08, Method: Compositional matrix adjust.
Identities = 32/82 (39%), Positives = 49/82 (59%), Gaps = 2/82 (2%)
Query: 83 SELRPLGTGLYPVISIINHSCLPNAVLVFEGRLAVV--RAVQHVPKGAEVLISYIETAGS 140
S+L G+ L+P+++ NH C PNAV+ + V R+++ +P EV ISYI+ S
Sbjct: 406 SDLCISGSALFPIVNSCNHHCNPNAVVSYTTNCNRVTLRSLRSIPIHEEVEISYIDETVS 465
Query: 141 TMTRQKALKEQYLFTCTCPRCI 162
R+K L+ +YLF C C RC+
Sbjct: 466 CSQRRKELQHKYLFNCKCTRCL 487
>gi|195569233|ref|XP_002102615.1| GD19999 [Drosophila simulans]
gi|194198542|gb|EDX12118.1| GD19999 [Drosophila simulans]
Length = 391
Score = 67.0 bits (162), Expect = 1e-08, Method: Compositional matrix adjust.
Identities = 37/97 (38%), Positives = 57/97 (58%), Gaps = 9/97 (9%)
Query: 70 FSKLACNAHTICNSELRPLGTGLYPVISIINHSCLPNAVLVF--EGRLAVVRAVQHVPKG 127
++K+A A N+E G+GLY + S INHSC+PNA F + V++A+ + +G
Sbjct: 271 YAKVAEFAGEFLNNE----GSGLYLLQSKINHSCVPNACSTFPYSNDIVVLKALGPIQQG 326
Query: 128 AEVLISYIETA---GSTMTRQKALKEQYLFTCTCPRC 161
E+ ISY++ S +R K L+E Y+F C CP+C
Sbjct: 327 EEICISYLDECMLERSRHSRHKVLRENYVFICQCPKC 363
>gi|400595566|gb|EJP63361.1| TPR domain-containing protein [Beauveria bassiana ARSEF 2860]
Length = 662
Score = 67.0 bits (162), Expect = 1e-08, Method: Compositional matrix adjust.
Identities = 32/73 (43%), Positives = 44/73 (60%), Gaps = 1/73 (1%)
Query: 90 TGLYPVISIINHSCLPNAVLVFEGRLAVVRAVQHVPKGAEVLISYIETAGSTMTRQKALK 149
TGL+ S +NHSC+PN+V F G L VVRA+QH+ G E+ +Y + G RQ L+
Sbjct: 515 TGLWARASYLNHSCMPNSVKDFAGDLIVVRALQHIRTGDEITHAYQDN-GDYDARQALLQ 573
Query: 150 EQYLFTCTCPRCI 162
+ FTC C C+
Sbjct: 574 TTWGFTCRCKLCV 586
>gi|56788824|gb|AAH88504.1| smyd5-prov protein, partial [Xenopus (Silurana) tropicalis]
Length = 420
Score = 67.0 bits (162), Expect = 1e-08, Method: Compositional matrix adjust.
Identities = 36/86 (41%), Positives = 51/86 (59%), Gaps = 7/86 (8%)
Query: 89 GTGLYPVISIINHSCLPNAVLVFEGRLAVVR--AVQHVPKGAEVLISYIETAG---STMT 143
G+GLY + S NHSC+PNA F ++ A++ + G E+ ISY++ S +
Sbjct: 303 GSGLYLLQSCCNHSCVPNAEASFPDNNFILHLTALEDIQPGEEICISYLDCCQRDRSRHS 362
Query: 144 RQKALKEQYLFTCTCPRCIKLGQFDD 169
RQK L+E YLF C+CP+C L Q DD
Sbjct: 363 RQKILRENYLFVCSCPKC--LAQADD 386
>gi|218184846|gb|EEC67273.1| hypothetical protein OsI_34249 [Oryza sativa Indica Group]
Length = 816
Score = 67.0 bits (162), Expect = 1e-08, Method: Compositional matrix adjust.
Identities = 83/363 (22%), Positives = 148/363 (40%), Gaps = 85/363 (23%)
Query: 50 NLVNLILQWPEISINEIA--------------ENFSKLACNAHTICNSELRPLGTGLYPV 95
+L L+L +I +N IA + FS + +C+ E + +Y
Sbjct: 445 SLSQLVLLICQIKVNSIAIVHMKSMDGGKVLTKGFSGFS--GDVMCSVEQVRVAQAIYMS 502
Query: 96 ISIINHSCLPNAVLVFEGRLAVVRAVQHVPKGAEVLISYIETAGST--MTRQKALKEQYL 153
S NHSC PN F R ++R+ +++ G+ + +SY G RQK+L+E Y
Sbjct: 503 GSFFNHSCQPNIHAYFHSRTLILRSTEYIKAGSPIELSYGPQVGEMDLPERQKSLRENYY 562
Query: 154 FTCT------------------CPRCIKLGQF---------DDIQESAILEGYRC----- 181
F+C CP+ LG ++ +I E + C
Sbjct: 563 FSCGCSSCSVLSLSDLVMNSFCCPQSNCLGAVSELIHHRRKENFVHVSIGESHVCTLSLP 622
Query: 182 ------KDDGCSGFLLRDSD-----DKGFTCQQCGLVRSKEEIK-------KIASEVNIL 223
+D G L +SD D GF C C RS+ ++ + S++N L
Sbjct: 623 DVSKFDEDIVKVGKLFFNSDTMLNIDPGF-CMSC---RSQLDLSSAVAMSDRATSKINRL 678
Query: 224 SKKTLALTSCGNHQEVVSTYKMIEKLQ-----KKLYHPFSVNLMQTREKLIKILMELEDW 278
++ L S N EV+ + E LQ +KL HP+S L Q + + + ++ D
Sbjct: 679 NE----LPSPDNVPEVL----IAEALQSLEHIEKLRHPYSKALAQAHDTIAEAFAKVGDQ 730
Query: 279 KEALAYCQLTIPVYQRVYPQFHPLLGLQYYTCGKLEWFLGDTENAIKSMTEAVEILRITH 338
++A +C+ + + +++Y H ++ + +E +GD +A + A I + +
Sbjct: 731 EQARKHCEASFKILEKLYHPRHIIIAHELIKLVSIELSMGDGASAAAAFARADAIFSLYY 790
Query: 339 GTN 341
G +
Sbjct: 791 GPD 793
>gi|166796007|ref|NP_001107677.1| SMYD family member 5 [Xenopus (Silurana) tropicalis]
gi|163916066|gb|AAI57309.1| smyd5 protein [Xenopus (Silurana) tropicalis]
Length = 421
Score = 67.0 bits (162), Expect = 1e-08, Method: Compositional matrix adjust.
Identities = 36/86 (41%), Positives = 51/86 (59%), Gaps = 7/86 (8%)
Query: 89 GTGLYPVISIINHSCLPNAVLVFEGRLAVVR--AVQHVPKGAEVLISYIETAG---STMT 143
G+GLY + S NHSC+PNA F ++ A++ + G E+ ISY++ S +
Sbjct: 305 GSGLYLLQSCCNHSCVPNAEASFPDNNFILHLTALEDIQPGEEICISYLDCCQRDRSRHS 364
Query: 144 RQKALKEQYLFTCTCPRCIKLGQFDD 169
RQK L+E YLF C+CP+C L Q DD
Sbjct: 365 RQKILRENYLFVCSCPKC--LAQADD 388
>gi|71421326|ref|XP_811771.1| hypothetical protein [Trypanosoma cruzi strain CL Brener]
gi|70876474|gb|EAN89920.1| hypothetical protein, conserved [Trypanosoma cruzi]
Length = 697
Score = 67.0 bits (162), Expect = 1e-08, Method: Compositional matrix adjust.
Identities = 51/165 (30%), Positives = 72/165 (43%), Gaps = 15/165 (9%)
Query: 9 KLQNDNVIPSTTTDNYSLVEALVAHMSDIDEKQLLLYAQIANLVNLILQW------PEIS 62
K + V P +D LV L D K Y + LIL W E +
Sbjct: 481 KEETHAVRPPRWSDTARLVTNLSMLSKDSRSKFRCAYRR---FTKLILPWLGEGGSTEAN 537
Query: 63 INEIAENFSKLA----CNAHTICNSELRPLGTGLYPVISIINHSCLPNAVLV-FEGRLAV 117
+ A F++L CN+ + N++ +G LYP S NHSC PN V + G LA
Sbjct: 538 LTVTATFFNRLCAALQCNSFGLFNADGNCIGVALYPEASYFNHSCCPNICRVTYRGILAA 597
Query: 118 VRAVQHVPKGAEVLISYIETA-GSTMTRQKALKEQYLFTCTCPRC 161
A++ + KG + I Y++ ST R++ L Y F C C RC
Sbjct: 598 FHALREIRKGEPLTICYVDVQETSTAERRRTLFSSYRFFCECARC 642
>gi|62473432|ref|NP_001014718.1| msta, isoform B [Drosophila melanogaster]
gi|76363525|sp|P83501.2|MSTAB_DROME RecName: Full=Protein msta, isoform B
gi|61677866|gb|AAF45751.3| msta, isoform B [Drosophila melanogaster]
Length = 448
Score = 67.0 bits (162), Expect = 1e-08, Method: Compositional matrix adjust.
Identities = 35/102 (34%), Positives = 51/102 (50%), Gaps = 6/102 (5%)
Query: 91 GLYPVISIINHSCLPNAVLVFE-GRLAVVRAVQHVPKGAEVLISYIETAGSTMTRQKALK 149
L+P+ ++NH C PNA FE G VV A + +P GAE+ +SY + ST+ R+ L
Sbjct: 228 ALFPLAGLLNHQCTPNAAHHFENGETIVVCATERIPAGAEITMSYAKLLWSTLARKIFLG 287
Query: 150 EQYLFTCTCPRCIKLGQFDDIQESAILEGYRCKDDGCSGFLL 191
F C C RC D + L C++ GC G ++
Sbjct: 288 MTKHFICKCVRCQ-----DPTENGTYLSALFCREQGCRGLVI 324
>gi|452839795|gb|EME41734.1| hypothetical protein DOTSEDRAFT_81948 [Dothistroma septosporum
NZE10]
Length = 549
Score = 67.0 bits (162), Expect = 2e-08, Method: Compositional matrix adjust.
Identities = 29/74 (39%), Positives = 40/74 (54%)
Query: 88 LGTGLYPVISIINHSCLPNAVLVFEGRLAVVRAVQHVPKGAEVLISYIETAGSTMTRQKA 147
LG G P++ NHSC PNA +F ++RA++ + KG E+ + Y + RQ
Sbjct: 223 LGMGFDPIMCSANHSCDPNAAALFNQPRQILRALKPIKKGEEISMKYTDVTNPLCVRQAE 282
Query: 148 LKEQYLFTCTCPRC 161
LK Y FTC C RC
Sbjct: 283 LKGYYFFTCHCARC 296
>gi|171684729|ref|XP_001907306.1| hypothetical protein [Podospora anserina S mat+]
gi|170942325|emb|CAP67977.1| unnamed protein product [Podospora anserina S mat+]
Length = 559
Score = 66.6 bits (161), Expect = 2e-08, Method: Compositional matrix adjust.
Identities = 32/91 (35%), Positives = 52/91 (57%), Gaps = 1/91 (1%)
Query: 72 KLACNAHTICNSELRPLGTGLYPVISIINHSCLPNAVLVFEGRLAVVRAVQHVPKGAEVL 131
++ NA +++ G L P ++ +NHSC+PNA + F+ R A +RA + + +G E+
Sbjct: 199 QIQTNAFNRLDADTGMSGIFLDPALARVNHSCVPNAFIGFDKRTATLRAERPIKEGEEIT 258
Query: 132 ISYIETAGSTMTRQKALKEQYLFTCTCPRCI 162
ISYI R++ L+ Y F C CPRC+
Sbjct: 259 ISYIANDKPRSIRREGLR-LYYFECDCPRCV 288
>gi|253741927|gb|EES98785.1| Hypothetical protein GL50581_3974 [Giardia intestinalis ATCC 50581]
Length = 409
Score = 66.6 bits (161), Expect = 2e-08, Method: Compositional matrix adjust.
Identities = 36/104 (34%), Positives = 59/104 (56%), Gaps = 8/104 (7%)
Query: 67 AENFSKLAC----NAHTICNSELRPLGTGLYPVISIINHSCLPNAVLVFE----GRLAVV 118
A+ F+ L C N+++I + + +G LY +IS NHSC PNA ++FE R A +
Sbjct: 300 AQMFNYLVCILLTNSYSIEDKNGQEIGAALYSLISCCNHSCAPNAQVIFEDSEDAREATL 359
Query: 119 RAVQHVPKGAEVLISYIETAGSTMTRQKALKEQYLFTCTCPRCI 162
++ +G E+ ISYI G +++ ++ Q+ FTC C RC+
Sbjct: 360 VLLRPCVQGEELYISYITDLGRSVSERRRELAQWCFTCQCTRCL 403
>gi|384249285|gb|EIE22767.1| hypothetical protein COCSUDRAFT_42390 [Coccomyxa subellipsoidea
C-169]
Length = 329
Score = 66.6 bits (161), Expect = 2e-08, Method: Compositional matrix adjust.
Identities = 46/149 (30%), Positives = 61/149 (40%), Gaps = 18/149 (12%)
Query: 31 VAHMSDIDEKQLLLYAQIANLVNLILQWP---EISINEIAENFSKLACNAHTIC-----N 82
AH D L A L + Q +S E+A+ +K N + I
Sbjct: 77 AAHSRSKDSSDLFRRLSTATLSAQLPQLAGSRPLSPEEVADALAKEELNGYGIMAPSGPE 136
Query: 83 SELRPLGTGLYPVISIINHSCLPNAVLVF----------EGRLAVVRAVQHVPKGAEVLI 132
E R GT LYP S++NH CLPN V E +A+ ++P G E
Sbjct: 137 GERRIRGTALYPKASLLNHECLPNVARVDNFDEADVNAPENTAVHFKALHNLPAGEEFTQ 196
Query: 133 SYIETAGSTMTRQKALKEQYLFTCTCPRC 161
SY S RQ+ ++QY F C CPRC
Sbjct: 197 SYFPMHTSYHVRQQRCQDQYGFACNCPRC 225
>gi|328791769|ref|XP_001121272.2| PREDICTED: SET and MYND domain-containing protein 4-like [Apis
mellifera]
Length = 668
Score = 66.6 bits (161), Expect = 2e-08, Method: Compositional matrix adjust.
Identities = 54/183 (29%), Positives = 89/183 (48%), Gaps = 23/183 (12%)
Query: 66 IAENFSKLACNAHTICNS---------ELRPLGTGLYPVISIINHSCLPNAVL-VFEGRL 115
I + + CNA+ I + E R +G +YP +S+INHSC PN V + +
Sbjct: 442 ILHHLQAINCNAYEIVENIYDKKTHIWEPRQIGGAIYPSVSLINHSCYPNVVRHTYPSGI 501
Query: 116 AVVRAVQHVPKGAEVLISYIET--AGSTMTRQKALKEQYLFTCTCPRCIKLGQFDDIQES 173
VVR ++ V KG E+L Y + + ++R + L ++Y F CTC CI+ Q+ +
Sbjct: 502 VVVRTLRFVGKGTEILDCYGPHWFSENKLSRIEYLWKKYRFLCTCDACIQNWQYPLPE-- 559
Query: 174 AILEGYRCKDDGCSGFL----LRDSDDKGFTCQQCGLVRSKEEIKKIASEVNILSKKTLA 229
+ Y+CK CS + L + D + +QC ++ +KKI +++ KK L
Sbjct: 560 --IMKYKCK--TCSEIIGIIALNEKDMQNVLNKQCNCNKTI-NLKKIKNQLQNSIKKRLN 614
Query: 230 LTS 232
S
Sbjct: 615 AIS 617
>gi|328702715|ref|XP_001948576.2| PREDICTED: SET and MYND domain-containing protein 5-like
[Acyrthosiphon pisum]
Length = 387
Score = 66.6 bits (161), Expect = 2e-08, Method: Compositional matrix adjust.
Identities = 37/80 (46%), Positives = 47/80 (58%), Gaps = 10/80 (12%)
Query: 89 GTGLYPVISIINHSCLPNAVLVFEG----RLAVVRAVQHVPKGAEVLISYI---ETAGST 141
G+ L+ S INHSC PNA VF+G RL +R ++ G E+ ISYI E S
Sbjct: 289 GSALFQNHSSINHSCFPNAASVFDGNHVLRLVAIRMIE---PGDEINISYIAPCELDHSR 345
Query: 142 MTRQKALKEQYLFTCTCPRC 161
TRQK L+E Y+FTC C +C
Sbjct: 346 HTRQKYLQENYVFTCRCIKC 365
>gi|307187924|gb|EFN72837.1| SET and MYND domain-containing protein 4 [Camponotus floridanus]
Length = 651
Score = 66.6 bits (161), Expect = 2e-08, Method: Compositional matrix adjust.
Identities = 40/124 (32%), Positives = 62/124 (50%), Gaps = 8/124 (6%)
Query: 88 LGTGLYPVISIINHSCLPNAVLVFEGRLAVVRAVQHVPKGAEVLISY--IETAGSTMTRQ 145
+G G+YP ++++NHSC P + F G VVRA + + G E+ +Y I T R+
Sbjct: 454 IGGGVYPTVAMLNHSCNPGVIRYFIGTTMVVRAARTIRAGEEISENYGPIFTTTPENERK 513
Query: 146 KALKEQYLFTCTCPRCIKLGQFDDIQE-SAILEGYRCKDD-GCSGFLLRDSDDKGFT--C 201
+ L+ QY F C C C G + ++E + ++C+ C LL SD F C
Sbjct: 514 RRLRVQYWFDCNCEACS--GHWPLLEELDPTILRFKCETGLSCGNVLLVRSDTNEFMIGC 571
Query: 202 QQCG 205
+CG
Sbjct: 572 AKCG 575
>gi|407394357|gb|EKF26910.1| hypothetical protein MOQ_009378 [Trypanosoma cruzi marinkellei]
Length = 697
Score = 66.6 bits (161), Expect = 2e-08, Method: Compositional matrix adjust.
Identities = 34/93 (36%), Positives = 49/93 (52%), Gaps = 2/93 (2%)
Query: 71 SKLACNAHTICNSELRPLGTGLYPVISIINHSCLPNAVLV-FEGRLAVVRAVQHVPKGAE 129
+ L CN+ + N++ +G LYP S NHSC PN V + G A A++ +PKG
Sbjct: 550 AALQCNSFGLFNADGNCIGVALYPEASYFNHSCCPNICRVTYRGLFAAFHALREIPKGEP 609
Query: 130 VLISYIETA-GSTMTRQKALKEQYLFTCTCPRC 161
+ I Y++ ST R++ L Y F C C RC
Sbjct: 610 LTICYVDVQETSTAERRRTLFSSYRFFCECARC 642
>gi|350403520|ref|XP_003486827.1| PREDICTED: protein msta, isoform B-like [Bombus impatiens]
Length = 513
Score = 66.6 bits (161), Expect = 2e-08, Method: Compositional matrix adjust.
Identities = 70/316 (22%), Positives = 133/316 (42%), Gaps = 41/316 (12%)
Query: 50 NLVNLILQWPEISINEIAENFSKLACNAHTICNSELRPL----GTGLYPVISIINHSCLP 105
N+V+ I + ++ ++ +E AC I E+R LYP ++++NHSC+
Sbjct: 173 NIVDYIRK--QLKLDRFSEEQIHTACGILEINTFEIRTAKGCSARALYPTVALMNHSCIS 230
Query: 106 N---AVLVFEGRLAVVRAVQHVPKGAEVLISYIETAGSTMTRQKALKEQYLFTCTCPRCI 162
N ++ + R+ + R +P G E+ SY + T+ R++ L E F C CPRC
Sbjct: 231 NTCHSISPTDYRVRL-RTTLKIPAGGELYGSYTHSLLPTLLRREHLLEGKYFACACPRC- 288
Query: 163 KLGQFDDIQESAILEGYRCK--DDGCSGFLLRDSDDKGFTCQQCGLVRSKEEIKKIASEV 220
D + + +C D+G L + + C C S ++K+ +
Sbjct: 289 ----SDPTELGTHVSSLKCNKCDNGIVLSLDSLDPESSWKCTHCEFTTSGSAVRKV---L 341
Query: 221 NILSKKTLAL--TSCGNHQEVVSTYKMIEKLQKKLYHPFSVNLMQTREKLIKILMELEDW 278
+I+ + A+ S + + + + + K + + HP L R L ++ ++++
Sbjct: 342 HIIQAEVDAVETISGADGADAIQERETVVKKYRSVLHPRHGFLTMLRHSLTQMYGRVDEY 401
Query: 279 ----------KEALAYCQLTIPVYQRVYPQFHPLLGLQYYT-------CGKLEWFLGD-T 320
+ + C+L + V + P + + G+ Y K +W G
Sbjct: 402 LLDDLPDVVLEHKIDICRLLLQVLDVIEPGYSRIRGMTLYELHAPLLFIAKTQWNAGVID 461
Query: 321 ENAIKS-MTEAVEILR 335
E A+KS M EA IL+
Sbjct: 462 EAALKSKMIEAANILK 477
>gi|148235066|ref|NP_001085635.1| SET and MYND domain-containing protein 5 [Xenopus laevis]
gi|82184468|sp|Q6GPQ4.1|SMYD5_XENLA RecName: Full=SET and MYND domain-containing protein 5
gi|49115717|gb|AAH73058.1| MGC82689 protein [Xenopus laevis]
Length = 421
Score = 66.6 bits (161), Expect = 2e-08, Method: Compositional matrix adjust.
Identities = 34/87 (39%), Positives = 50/87 (57%), Gaps = 5/87 (5%)
Query: 89 GTGLYPVISIINHSCLPNAVLVFEGRLAVVR--AVQHVPKGAEVLISYIETAG---STMT 143
G+GLY + S NHSC+PNA F ++ A++ + G E+ ISY++ S +
Sbjct: 303 GSGLYLLQSCCNHSCVPNAEASFPDNNFILHLTALEDIQPGEEICISYLDCCQRDRSRHS 362
Query: 144 RQKALKEQYLFTCTCPRCIKLGQFDDI 170
RQK L+E YLF C+CP+C+ DI
Sbjct: 363 RQKILRENYLFMCSCPKCLAQADEPDI 389
>gi|383858987|ref|XP_003704980.1| PREDICTED: protein msta, isoform B-like [Megachile rotundata]
Length = 514
Score = 66.6 bits (161), Expect = 2e-08, Method: Compositional matrix adjust.
Identities = 70/308 (22%), Positives = 126/308 (40%), Gaps = 43/308 (13%)
Query: 60 EISINEIAENFSKLACNAHTICNSELR-PLGTG---LYPVISIINHSCLPN---AVLVFE 112
++ ++ +E + AC I E+R G G LYP ++++NHSC+ N ++ +
Sbjct: 182 QLKLDRFSEELIQTACGILEINTFEVRTSRGYGARALYPTVALMNHSCVSNTCHSISPTD 241
Query: 113 GRLAVVRAVQHVPKGAEVLISYIETAGSTMTRQKALKEQYLFTCTCPRCIKLGQFDDIQE 172
R+ + R HVP E+ SY + TM R++ L E F C C RC D +
Sbjct: 242 YRIRL-RTTVHVPPSGELYGSYTHSLYPTMLRREHLLEGKHFACACARC-----SDPTEL 295
Query: 173 SAILEGYRCK--DDGCSGFLLRDSDDKGFTCQQCGLVRSKEEIKK----IASEVNILSKK 226
+ +C D+G L D + C C S ++K I +EV+ + +
Sbjct: 296 GTHMSSLKCNKCDNGIVLSLDSLDPDSSWKCTHCEFSTSGPAVRKVLKIIQAEVDAVEQ- 354
Query: 227 TLALTSCGNHQEVVSTYKMIEKLQKKLYHPFSVNLMQTREKLIKILMELEDW-------- 278
S + + + + + K + + HP L R L ++ ++++
Sbjct: 355 ----ISGADGADAIQERETVMKKYRSVLHPRHAFLTMLRHSLTQMYGRVDEYLLDDLPDV 410
Query: 279 --KEALAYCQLTIPVYQRVYPQFHPLLGLQYYTCGKLEWFLGDT--------ENAIKS-M 327
+ + C+ + V + P + + G+ Y F+ T E A+KS M
Sbjct: 411 VLEHKIDMCRFLLQVLDVIEPGYSRIRGMTLYELHAPLLFVAKTLWNAGVIDEAALKSKM 470
Query: 328 TEAVEILR 335
EA IL+
Sbjct: 471 IEASNILK 478
>gi|402590196|gb|EJW84127.1| hypothetical protein WUBG_04964 [Wuchereria bancrofti]
Length = 283
Score = 66.6 bits (161), Expect = 2e-08, Method: Compositional matrix adjust.
Identities = 70/288 (24%), Positives = 132/288 (45%), Gaps = 32/288 (11%)
Query: 62 SINEIAENFSKLACNAHTICNSELRPLGTGLYPVISIINHSCLPNAVLVFEGRLAVVRAV 121
S E+ + F K+ N I +S L G G+Y +S ++HSC P+A ++F G A++R++
Sbjct: 3 SSTEMFDIFCKIVINTVVITDSCLSRTGIGVYLGLSALDHSCKPDAFIIFNGTKAILRSL 62
Query: 122 QH--VPKGAEVLISYIETAGSTMTRQKALKEQYLFTCTCPRCIKLGQFDDIQESAILEGY 179
+ I Y + T TR K ++ Q+ F C C C + + D ++ L
Sbjct: 63 SKNITEYNNNLRIPYCDLLELTSTRCKYMQLQHNFICNCEICQNV-ELD--RQKCSLRCT 119
Query: 180 RCKDDGCSGFLLRDSDDKGFT-CQQCGLVR--SKEEIKKIASEVNILSKKTLALTSCGNH 236
+C D C D++ T C+ CG + + + ++K+ ++ T ++ N
Sbjct: 120 KCTDGFCP---YSPEDEQAETRCKVCGEISLFNFDHLQKLYQQL------TTDDSTEKNL 170
Query: 237 QEVVSTYKMIEKLQKKLYHPFSVNLMQTREKLIKILMELEDWKEALAYCQLTIPVYQRVY 296
E++ Y E +++ P++V L + EK++ M+ + EA + Y
Sbjct: 171 NELIDLYCESE----EVFSPYNVPLCKFAEKIMISAMKHHKYDEAA----------KTFY 216
Query: 297 PQFHPLLGLQYYTCGKLEWFLGDTENAIKSMTEAVEILRITHGTNSPF 344
P+ P L + KL L E + + EA++++R ++G+ S F
Sbjct: 217 PKGLPPLPTRMLEYAKL-LMLQHDEASFPILREALKMIRESYGSESNF 263
>gi|268570288|ref|XP_002648464.1| C. briggsae CBR-SET-18 protein [Caenorhabditis briggsae]
Length = 369
Score = 66.2 bits (160), Expect = 2e-08, Method: Compositional matrix adjust.
Identities = 50/195 (25%), Positives = 84/195 (43%), Gaps = 18/195 (9%)
Query: 29 ALVAHMSDI--DEKQLLLYAQIANLVNLILQWPEISINEIAENFSKLACNAHTICNSELR 86
+LV H DI DE+++ + +I + + I + F K + N+H+I +
Sbjct: 137 SLVDHKKDIREDEEKMKHFEKIFVKMGNFRKDDMIEKGHFFDVFCKASINSHSIHTNAGN 196
Query: 87 PLGTGLYPVISIINHSCLPNAVLVFEGRLAVVRAVQHVP-----KGAEVLISYIETAGST 141
+G L +S NHSC P +VF+G +R + VP + ISYI+ S
Sbjct: 197 EVGMALDLGVSKYNHSCRPTCSMVFDGYRVCLRPL--VPGVDAENTEQAFISYIDVGRSK 254
Query: 142 MTRQKALKEQYLFTCTCPRCIKLGQFDDIQESAILEGYRCKDDGCSGFLLRDSDDK--GF 199
R++ LK ++ F C C RC+ E L +C + C +L ++
Sbjct: 255 FVRRRDLKSRWYFDCECTRCMD-------PEDDSLTAIKCANPACDAPILTSETEEPMNI 307
Query: 200 TCQQCGLVRSKEEIK 214
C +C V + +K
Sbjct: 308 ACDKCKTVVEESTVK 322
>gi|256062338|ref|XP_002570318.1| set and mynd domain containing [Schistosoma mansoni]
gi|353230727|emb|CCD77144.1| putative set and mynd domain containing [Schistosoma mansoni]
Length = 423
Score = 66.2 bits (160), Expect = 2e-08, Method: Compositional matrix adjust.
Identities = 36/79 (45%), Positives = 48/79 (60%), Gaps = 7/79 (8%)
Query: 89 GTGLYPVISIINHSCLPNAVLVFEG---RLAVVRAVQHVPKGAEVLISYIE---TAGSTM 142
G GLY S INHSC PNA++ F G RL+VV A+ + +G E+ ISY++ +
Sbjct: 314 GVGLYYYQSRINHSCSPNAIIRFSGVNSRLSVV-ALTSIQEGEEITISYLDHCLQSRGRH 372
Query: 143 TRQKALKEQYLFTCTCPRC 161
TR+K L YLF C CP+C
Sbjct: 373 TRRKHLSSNYLFWCNCPKC 391
>gi|71409653|ref|XP_807160.1| hypothetical protein [Trypanosoma cruzi strain CL Brener]
gi|70871100|gb|EAN85309.1| hypothetical protein, conserved [Trypanosoma cruzi]
Length = 697
Score = 66.2 bits (160), Expect = 2e-08, Method: Compositional matrix adjust.
Identities = 51/165 (30%), Positives = 72/165 (43%), Gaps = 15/165 (9%)
Query: 9 KLQNDNVIPSTTTDNYSLVEALVAHMSDIDEKQLLLYAQIANLVNLILQW------PEIS 62
K + V P +D LV L D K Y + LIL W E +
Sbjct: 481 KEETHAVRPPRWSDTARLVTNLSVLSKDSRSKFRCAYRR---FTKLILPWLGEGGSIEAN 537
Query: 63 INEIAENFSKLA----CNAHTICNSELRPLGTGLYPVISIINHSCLPNAVLV-FEGRLAV 117
+ A F++L CN+ + N++ +G LYP S NHSC PN V + G LA
Sbjct: 538 LTVTATFFNRLCAALQCNSFGLFNADGNCIGVALYPEASYFNHSCCPNICRVTYRGILAA 597
Query: 118 VRAVQHVPKGAEVLISYIETA-GSTMTRQKALKEQYLFTCTCPRC 161
A++ + KG + I Y++ ST R++ L Y F C C RC
Sbjct: 598 FHALREIRKGEPLTICYVDVQETSTAERRRTLFSSYRFFCECARC 642
>gi|256062336|ref|XP_002570317.1| set and mynd domain containing [Schistosoma mansoni]
gi|353230728|emb|CCD77145.1| putative set and mynd domain containing [Schistosoma mansoni]
Length = 429
Score = 66.2 bits (160), Expect = 2e-08, Method: Compositional matrix adjust.
Identities = 36/79 (45%), Positives = 48/79 (60%), Gaps = 7/79 (8%)
Query: 89 GTGLYPVISIINHSCLPNAVLVFEG---RLAVVRAVQHVPKGAEVLISYIE---TAGSTM 142
G GLY S INHSC PNA++ F G RL+VV A+ + +G E+ ISY++ +
Sbjct: 320 GVGLYYYQSRINHSCSPNAIIRFSGVNSRLSVV-ALTSIQEGEEITISYLDHCLQSRGRH 378
Query: 143 TRQKALKEQYLFTCTCPRC 161
TR+K L YLF C CP+C
Sbjct: 379 TRRKHLSSNYLFWCNCPKC 397
>gi|24648668|ref|NP_650955.1| CG3353 [Drosophila melanogaster]
gi|23171865|gb|AAF55866.2| CG3353 [Drosophila melanogaster]
gi|33636489|gb|AAQ23542.1| RE75113p [Drosophila melanogaster]
Length = 393
Score = 66.2 bits (160), Expect = 2e-08, Method: Compositional matrix adjust.
Identities = 32/78 (41%), Positives = 48/78 (61%), Gaps = 5/78 (6%)
Query: 89 GTGLYPVISIINHSCLPNAVLVF--EGRLAVVRAVQHVPKGAEVLISYIETA---GSTMT 143
G+GLY + S INHSC+PNA F + V++A+ + +G E+ ISY++ S +
Sbjct: 286 GSGLYLLQSKINHSCVPNACSTFPYSNDIVVLKALAPIQQGEEICISYLDECMLERSRHS 345
Query: 144 RQKALKEQYLFTCTCPRC 161
R K L+E Y+F C CP+C
Sbjct: 346 RHKVLRENYVFICQCPKC 363
>gi|393236328|gb|EJD43877.1| hypothetical protein AURDEDRAFT_114449 [Auricularia delicata
TFB-10046 SS5]
Length = 451
Score = 66.2 bits (160), Expect = 2e-08, Method: Compositional matrix adjust.
Identities = 31/75 (41%), Positives = 49/75 (65%), Gaps = 3/75 (4%)
Query: 90 TGLYPVISIINHSCLPNAVLVFE-GRLAV-VRAVQHVPKGAEVLISYIETA-GSTMTRQK 146
+G++P S ++HSC PNA+ ++ G + + +RA +H+ +GAEV ISY+ T RQ+
Sbjct: 244 SGVFPTFSRVSHSCCPNAIPRWDSGTMTLELRATRHIQQGAEVTISYVPPPLKPTEARQE 303
Query: 147 ALKEQYLFTCTCPRC 161
L++ Y F CTCP C
Sbjct: 304 FLRDGYHFECTCPAC 318
>gi|156539569|ref|XP_001601623.1| PREDICTED: SET and MYND domain-containing protein 4-like [Nasonia
vitripennis]
Length = 697
Score = 66.2 bits (160), Expect = 2e-08, Method: Compositional matrix adjust.
Identities = 30/76 (39%), Positives = 44/76 (57%), Gaps = 2/76 (2%)
Query: 88 LGTGLYPVISIINHSCLPNAVLVFEGRLAVVRAVQHVPKGAEVLISY--IETAGSTMTRQ 145
+G G++P +++ NHSC P V F G VVRA++ +P GAE+ +Y I T R+
Sbjct: 499 IGGGVFPTVALFNHSCNPGVVRYFIGNTMVVRAIKTIPAGAEISENYGPIFTEEEENDRK 558
Query: 146 KALKEQYLFTCTCPRC 161
+ L+ QY F C C C
Sbjct: 559 RKLRLQYWFDCDCEAC 574
>gi|356525020|ref|XP_003531125.1| PREDICTED: histone-lysine N-methyltransferase ASHR2-like [Glycine
max]
Length = 419
Score = 66.2 bits (160), Expect = 2e-08, Method: Compositional matrix adjust.
Identities = 40/120 (33%), Positives = 55/120 (45%), Gaps = 15/120 (12%)
Query: 91 GLYPVISIINHSCLPNAVLVFEGRLA-----------VVRAVQHVPKGAEVLISYIETAG 139
G+YP S NH CLPNA F+ A ++R + VP+G E+ +SY
Sbjct: 231 GIYPYASFFNHDCLPNACR-FDYVDANPSDDSHNTDFIIRMIHDVPQGREICLSYFPVNE 289
Query: 140 STMTRQKALKEQYLFTCTCPRC---IKLGQFDDIQESAILEGYRCKDDGCSGFLLRDSDD 196
+RQK L E Y FTC C RC D ++++A E +D C D+DD
Sbjct: 290 KYSSRQKRLIEDYGFTCNCDRCNVESNWSDNDSVEDNAEEEEEVMDEDQCETMAASDTDD 349
>gi|350419310|ref|XP_003492139.1| PREDICTED: SET and MYND domain-containing protein 4-like [Bombus
impatiens]
Length = 673
Score = 66.2 bits (160), Expect = 2e-08, Method: Compositional matrix adjust.
Identities = 53/180 (29%), Positives = 85/180 (47%), Gaps = 22/180 (12%)
Query: 69 NFSKLACNAHTICNS---------ELRPLGTGLYPVISIINHSCLPNAVL-VFEGRLAVV 118
N + CNA+ I + E R +G +YP IS+INHSC PN V + V+
Sbjct: 450 NLQAINCNAYEIVENIYDKKTHVWEPRNIGGAIYPSISLINHSCYPNVVRHSYPSGTVVL 509
Query: 119 RAVQHVPKGAEVLISYIET--AGSTMTRQKALKEQYLFTCTCPRCIKLGQFDDIQESAIL 176
R ++ + KG E+L Y + ++R + L ++Y F CTC C + QF + E+
Sbjct: 510 RTLRFIGKGTEILDCYGPHWLSEKRLSRLEYLWKKYCFLCTCEACTQNWQF-PLPETM-- 566
Query: 177 EGYRCKDDGCSGFL----LRDSDDKGFTCQQCGLVRSKEEIKKIASEVNILSKKTLALTS 232
Y+CK CS + L + + + +QC K ++KKI ++ +K L S
Sbjct: 567 -NYKCK--MCSEIIGTITLDEKHTQNVSSKQCCKCNKKTDLKKIKNQFRKSVEKRLDAIS 623
>gi|256062334|ref|XP_002570316.1| set and mynd domain containing [Schistosoma mansoni]
gi|353230729|emb|CCD77146.1| putative set and mynd domain containing [Schistosoma mansoni]
Length = 433
Score = 66.2 bits (160), Expect = 2e-08, Method: Compositional matrix adjust.
Identities = 36/79 (45%), Positives = 48/79 (60%), Gaps = 7/79 (8%)
Query: 89 GTGLYPVISIINHSCLPNAVLVFEG---RLAVVRAVQHVPKGAEVLISYIE---TAGSTM 142
G GLY S INHSC PNA++ F G RL+VV A+ + +G E+ ISY++ +
Sbjct: 324 GVGLYYYQSRINHSCSPNAIIRFSGVNSRLSVV-ALTSIQEGEEITISYLDHCLQSRGRH 382
Query: 143 TRQKALKEQYLFTCTCPRC 161
TR+K L YLF C CP+C
Sbjct: 383 TRRKHLSSNYLFWCNCPKC 401
>gi|398392173|ref|XP_003849546.1| hypothetical protein MYCGRDRAFT_48051 [Zymoseptoria tritici IPO323]
gi|339469423|gb|EGP84522.1| hypothetical protein MYCGRDRAFT_48051 [Zymoseptoria tritici IPO323]
Length = 633
Score = 66.2 bits (160), Expect = 2e-08, Method: Compositional matrix adjust.
Identities = 45/131 (34%), Positives = 67/131 (51%), Gaps = 12/131 (9%)
Query: 46 AQIANLVNLILQW----PEISINEIAENFSKLAC-NAH---TICNSE---LRPLGTGLYP 94
+QI +++L W E+ I E A + C NA + E +R TGL+
Sbjct: 430 SQIEKVMDLFGDWRSIGKEVLIGEDAFQVHDVVCRNAFGPGAVIAGEAENIRKASTGLWL 489
Query: 95 VISIINHSCLPNAVLVFEGRLAVVRAVQHVPKGAEVLISYIETAGSTMTRQKALKEQYLF 154
+ + INHSCLPNA F G L VVRA +++ G+E+ SY +++ RQ+AL + + F
Sbjct: 490 MAAYINHSCLPNAEKSFLGDLMVVRATRNITAGSEITHSY-DSSSDYDARQEALMKTWGF 548
Query: 155 TCTCPRCIKLG 165
C C C G
Sbjct: 549 RCRCELCKADG 559
>gi|384252143|gb|EIE25620.1| hypothetical protein COCSUDRAFT_83627 [Coccomyxa subellipsoidea
C-169]
Length = 394
Score = 65.9 bits (159), Expect = 3e-08, Method: Compositional matrix adjust.
Identities = 75/308 (24%), Positives = 136/308 (44%), Gaps = 39/308 (12%)
Query: 15 VIPSTTTDNYSLVEALVAHMSDIDEKQLL-------LYAQIANL-VNLILQ-WPEISINE 65
V+ T+ + LV L H+ D+ QL + A A L L+L+ + ++ IN
Sbjct: 99 VVAQVTSRD--LVAGLETHVGDVPSHQLFQEAVMCFIAAACAGLEPQLVLKAYRQVVINA 156
Query: 66 IAENFSKLACNAHTICNSELRPLGTGLYPVISIINHSCLPNAVLVFEGRLAVVRAVQHVP 125
IA ++ A ++E R +G +YP +++NH+C PN F G + A +P
Sbjct: 157 IAITPAEHA-------SAEDR-MGLAIYPRAAMLNHACSPNVAAAFAGTRLHICATSDLP 208
Query: 126 KGAEVLISYIETAGSTM--TRQKALKEQYLFTCTCPRCIKLGQFDDIQESAILEGYRCKD 183
G + Y AG ++ R + LKEQY F C C C + +QE+A++ G +C+
Sbjct: 209 PGTTLRFCYGPQAGESIREVRLRQLKEQYHFWCRCSAC--EASYSGVQEAAMI-GMKCRL 265
Query: 184 DGCSGFLLRDSDDKGFTCQQCGL-VRSKEEIKKIASEVNILSKKTLALTSCGNHQ----E 238
C+G ++ CG + S+E+I+++A + A S H E
Sbjct: 266 --CNGPVVPSIG--------CGAEMSSEEQIERMARLSQADAAFASACLSLAQHSAVSAE 315
Query: 239 VVSTYKMIEKLQKKLYHPFSVNLMQTREKLIKILMELEDWKEALAYCQLTIPVYQRVYPQ 298
+ + ++Q+ L + L +T + L + D + A+ + Q + + + Y
Sbjct: 316 TLQSMHFCLEVQEVLLVTSNQVLGRTHDALARAYGPRGDTENAMLHAQKALDIVRSNYGD 375
Query: 299 FHPLLGLQ 306
H ++ Q
Sbjct: 376 GHLVVAHQ 383
>gi|378731922|gb|EHY58381.1| SET and MYND domain-containing protein [Exophiala dermatitidis
NIH/UT8656]
Length = 550
Score = 65.9 bits (159), Expect = 3e-08, Method: Compositional matrix adjust.
Identities = 36/101 (35%), Positives = 57/101 (56%), Gaps = 12/101 (11%)
Query: 72 KLAC----NAHTICNSELRPLGTGLYPVISIINHSCLPNAVLVFE-------GRLAVVRA 120
+L+C N+ T+ N+ +G L+P +++NHSC PNA + F+ G ++V A
Sbjct: 212 RLSCAVIVNSFTLSNATFDSIGVILHPKPALLNHSCDPNAYVRFDVSETDTLGSISV-HA 270
Query: 121 VQHVPKGAEVLISYIETAGSTMTRQKALKEQYLFTCTCPRC 161
++ + K E+ ISYI+T RQ+ L E+Y FTC C C
Sbjct: 271 LRDIAKDEEITISYIDTTVPCKRRQQQLSERYFFTCQCHLC 311
>gi|328875121|gb|EGG23486.1| hypothetical protein DFA_05619 [Dictyostelium fasciculatum]
Length = 391
Score = 65.9 bits (159), Expect = 3e-08, Method: Compositional matrix adjust.
Identities = 37/107 (34%), Positives = 54/107 (50%), Gaps = 7/107 (6%)
Query: 86 RPLGTGLYPVISIINHSCLPNAVLVFE--GRLAVVRAVQHVPKGAEVLISYIETAGSTMT 143
+ +G+GLY S NHSC PN V + G V+ A++ + +G E+ +YI+ + S
Sbjct: 268 KSIGSGLYDQCSYFNHSCQPNIFKVNQTPGGELVMVALRDIEQGEELFYNYIQISMSGEA 327
Query: 144 RQKALKEQYLFTCTCPRCIKLGQFDDIQESAILEGYRCKDDGCSGFL 190
R K LKE Y F C CP C + L+ Y CK C+G +
Sbjct: 328 RIKKLKESYFFNCQCPGCK-----NAPSHKQFLDKYLCKVKQCNGVI 369
>gi|332025390|gb|EGI65557.1| SET and MYND domain-containing protein 4 [Acromyrmex echinatior]
Length = 401
Score = 65.9 bits (159), Expect = 3e-08, Method: Compositional matrix adjust.
Identities = 40/124 (32%), Positives = 58/124 (46%), Gaps = 7/124 (5%)
Query: 86 RP--LGTGLYPVISIINHSCLPNAVLVFEGRLAVVRAVQHVPKGAEVLISY--IETAGST 141
RP LG +YP ++ NH C P F GR V+RA++ + G + +Y I T +
Sbjct: 200 RPVYLGVAIYPSVARFNHDCYPAVTRYFIGRHIVIRAIRGLGPGDVIAENYGPIFTKRTL 259
Query: 142 MTRQKALKEQYLFTCTCPRCIK-LGQFDDIQESAILEGYRCKDDGCSGFLLRDSDDKGFT 200
RQ+ L +Y F CTC C + F+++ ++ RC GC G LR K
Sbjct: 260 AERQRTLTGRYWFQCTCKACQEDWPCFENLTNDSV--KLRCPTVGCGGLHLRSRQGKPIK 317
Query: 201 CQQC 204
C C
Sbjct: 318 CPDC 321
>gi|322792366|gb|EFZ16350.1| hypothetical protein SINV_08812 [Solenopsis invicta]
Length = 510
Score = 65.9 bits (159), Expect = 3e-08, Method: Compositional matrix adjust.
Identities = 60/269 (22%), Positives = 111/269 (41%), Gaps = 31/269 (11%)
Query: 91 GLYPVISIINHSCLPN---AVLVFEGRLAVVRAVQHVPKGAEVLISYIETAGSTMTRQKA 147
GLYP ++++NHSC+ N ++ + R+ + R +P G E+ SY + TM R++
Sbjct: 213 GLYPTVAMMNHSCVSNTSHSISPIDYRIRL-RTTLKIPAGGELYASYTHSLLPTMLRREH 271
Query: 148 LKEQYLFTCTCPRCIKLGQFDDIQESAILEGYRCK--DDGCSGFLLRDSDDKGFTCQQCG 205
L E F C CPRC D + + +C D+G L + + C C
Sbjct: 272 LLEGKHFACACPRC-----SDPTELGTHMSSLKCNKCDNGIVLSLDSLDSESKWKCTHCD 326
Query: 206 LVRSKEEIKKIASEVNILSKKTLALTSCGNHQEVVSTYKMIEKLQKKLYHPFSVNLMQTR 265
+ + ++K+ + A++ + + + + I K + + HP L R
Sbjct: 327 FSTTGQAVRKVFRIIQAEVDAAEAISG-ADGADAIHERETIMKKYRSVLHPQHAFLSMLR 385
Query: 266 EKLIKILMELEDW----------KEALAYCQLTIPVYQRVYPQFHPLLGLQYYT------ 309
L ++ ++++ + + C+ + V V P + + G+ Y
Sbjct: 386 HSLTQMYGRVDEYLLDDLPDVVLEHKVDLCRSLLQVLNVVEPGYSRVRGMTLYELHAPLL 445
Query: 310 -CGKLEWFLGD-TENAIKS-MTEAVEILR 335
K +W G E +KS M EA IL+
Sbjct: 446 FLAKGQWNAGVIDETGLKSKMIEAANILK 474
>gi|194764587|ref|XP_001964410.1| GF23162 [Drosophila ananassae]
gi|190614682|gb|EDV30206.1| GF23162 [Drosophila ananassae]
Length = 392
Score = 65.9 bits (159), Expect = 3e-08, Method: Compositional matrix adjust.
Identities = 37/101 (36%), Positives = 58/101 (57%), Gaps = 9/101 (8%)
Query: 66 IAENFSKLACNAHTICNSELRPLGTGLYPVISIINHSCLPNAVLVF--EGRLAVVRAVQH 123
I E ++K+ A N+E G+GLY + S INHSC+PN+ F + V++A+
Sbjct: 267 IDEIYAKVGEFAGEFLNNE----GSGLYLLQSKINHSCVPNSCSTFPYSNDIVVLKALTP 322
Query: 124 VPKGAEVLISYIETAG---STMTRQKALKEQYLFTCTCPRC 161
+ +G E+ ISY++ S +R K L+E Y+F C CP+C
Sbjct: 323 IQEGDEICISYLDECQLERSRHSRHKVLRENYIFVCQCPKC 363
>gi|403222774|dbj|BAM40905.1| uncharacterized protein TOT_030000166 [Theileria orientalis strain
Shintoku]
Length = 150
Score = 65.9 bits (159), Expect = 3e-08, Method: Compositional matrix adjust.
Identities = 35/90 (38%), Positives = 50/90 (55%), Gaps = 3/90 (3%)
Query: 79 TICNSELRP--LGTGLYPVISIINHSCLPNAVLVF-EGRLAVVRAVQHVPKGAEVLISYI 135
TI EL P +G GL+ +S +NHSC PN + + + +A + + VP+G E ISYI
Sbjct: 56 TIQKKELYPETMGLGLFNYLSKMNHSCEPNLQIEYTKNNIAHIAPLVDVPRGEEATISYI 115
Query: 136 ETAGSTMTRQKALKEQYLFTCTCPRCIKLG 165
+ S RQ+ L + Y F C C +CI G
Sbjct: 116 DEKDSVENRQEKLYKNYGFKCDCNKCILEG 145
>gi|330796724|ref|XP_003286415.1| hypothetical protein DICPUDRAFT_77304 [Dictyostelium purpureum]
gi|325083610|gb|EGC37058.1| hypothetical protein DICPUDRAFT_77304 [Dictyostelium purpureum]
Length = 499
Score = 65.9 bits (159), Expect = 3e-08, Method: Compositional matrix adjust.
Identities = 33/79 (41%), Positives = 47/79 (59%), Gaps = 6/79 (7%)
Query: 89 GTGLYPVISIINHSCLPNAVLVFE-----GRLA-VVRAVQHVPKGAEVLISYIETAGSTM 142
G GL+P+ S +NHSCLPN + E R+ VV+A + +P G+E+L SY + T
Sbjct: 419 GVGLFPIFSCMNHSCLPNIEISNEIQDGVDRVKMVVKAKKLIPAGSEILHSYCDENLPTK 478
Query: 143 TRQKALKEQYLFTCTCPRC 161
R++ L QY F C CP+C
Sbjct: 479 ERKQLLLSQYGFKCLCPKC 497
>gi|302697297|ref|XP_003038327.1| hypothetical protein SCHCODRAFT_103387 [Schizophyllum commune H4-8]
gi|300112024|gb|EFJ03425.1| hypothetical protein SCHCODRAFT_103387, partial [Schizophyllum
commune H4-8]
Length = 360
Score = 65.9 bits (159), Expect = 3e-08, Method: Compositional matrix adjust.
Identities = 33/73 (45%), Positives = 43/73 (58%), Gaps = 2/73 (2%)
Query: 91 GLYPVISIINHSCLPNAVLVFE--GRLAVVRAVQHVPKGAEVLISYIETAGSTMTRQKAL 148
G++ S INHSC PNAV F+ VVRA++H+ KG E+ ISY ET TR++
Sbjct: 175 GVFERASRINHSCSPNAVYHFDLQSFCLVVRAIRHISKGEEIFISYSETLLQAATRRQNS 234
Query: 149 KEQYLFTCTCPRC 161
+ Y F C CP C
Sbjct: 235 LQDYGFRCACPAC 247
>gi|270007661|gb|EFA04109.1| hypothetical protein TcasGA2_TC014347 [Tribolium castaneum]
Length = 497
Score = 65.9 bits (159), Expect = 3e-08, Method: Compositional matrix adjust.
Identities = 59/206 (28%), Positives = 91/206 (44%), Gaps = 26/206 (12%)
Query: 22 DNYSLVEALVAHMSDIDEKQLLLYAQIANLVNLILQWPEISINEIAENFSKLACNAHTIC 81
++ S+V+ L AH SD Q V+++ +++ E E F + C+
Sbjct: 141 EDKSVVKCLKAHKSD----------QHGFEVDVMKDALALNVKEDEEKFMRFVCSVMDSN 190
Query: 82 NSEL-------RPLGTGLYPVISIINHSCLPNAVLVFEGRL-AVVRAVQHVPKGAEVLIS 133
E+ + GLYP+ S+ NHSC PN VF+ + VVRA + +P+G+E+ S
Sbjct: 191 AFEVLVGFEDNQASVKGLYPLGSLANHSCCPNTCHVFDDKQHMVVRASKFIPQGSEIFHS 250
Query: 134 YIETAGSTMTRQKALKEQYLFTCTCPRCIKLGQFDDIQESAILEGYRCKDDGCSGFLLRD 193
Y ST R+ L F C C RC +F S + CK + +
Sbjct: 251 YSRLIWSTSARRFHLYRTKHFLCKCQRCEDPTEFGSYIGSIL-----CKVCKTGKVIPTN 305
Query: 194 S--DDKGFTCQQCGLVRSKEEIKKIA 217
S DK + C+ CG + KEE+ I
Sbjct: 306 SLQTDK-WQCEVCGSLIKKEEVANIT 330
>gi|255084131|ref|XP_002508640.1| predicted protein [Micromonas sp. RCC299]
gi|226523917|gb|ACO69898.1| predicted protein [Micromonas sp. RCC299]
Length = 701
Score = 65.9 bits (159), Expect = 3e-08, Method: Compositional matrix adjust.
Identities = 41/135 (30%), Positives = 60/135 (44%), Gaps = 24/135 (17%)
Query: 51 LVNLILQWPEISINEIAENFSKLACNAHTICNSELRPLGTGLYPVISIINHSCLPNAVLV 110
L + + + +E+ + K N+H + + +LR LGTG+YP S+ NHSC PNA +
Sbjct: 246 LASAVRHAAALDPSEVEDMLGKTRENSHGVVDWKLRQLGTGIYPEASMFNHSCAPNAAVS 305
Query: 111 F----------------EGRLAVVRAVQHVP--------KGAEVLISYIETAGSTMTRQK 146
F E R V +VP KG EV I+Y E +R+
Sbjct: 306 FGAGGTLNTCCIGVDDYEARAMEVGPKGYVPLGRPRAIRKGEEVCIAYTELYRPAASRRL 365
Query: 147 ALKEQYLFTCTCPRC 161
L++ F C C RC
Sbjct: 366 TLEKSKGFVCECVRC 380
>gi|260940791|ref|XP_002615235.1| hypothetical protein CLUG_04117 [Clavispora lusitaniae ATCC 42720]
gi|238850525|gb|EEQ39989.1| hypothetical protein CLUG_04117 [Clavispora lusitaniae ATCC 42720]
Length = 402
Score = 65.9 bits (159), Expect = 3e-08, Method: Compositional matrix adjust.
Identities = 44/132 (33%), Positives = 59/132 (44%), Gaps = 11/132 (8%)
Query: 45 YAQIANLVNLILQ---WPEISINEIAENFSKLACNAHTI------CNSELRPLGTGLYPV 95
Y+ I + LI P ISI + + + NA I + E G G+YP
Sbjct: 271 YSNIYKFIRLIAPEEYLPFISIQAVRDIIGRNLTNAFGIWSPVTSSDEEREYFGFGVYPS 330
Query: 96 ISIINHSCLPNAVLVFEGRLAVVRAVQHVPKGAEVLISY-IETAGSTMTRQKALKEQYLF 154
S NHSC PN + G A + +P GAE+ ISY I + RQ AL+E + F
Sbjct: 331 ASFFNHSCKPNVTKIRRGASYEYIAKEDIPVGAELCISYGIRESDGLKQRQDALRE-WFF 389
Query: 155 TCTCPRCIKLGQ 166
C C RC G+
Sbjct: 390 ECGCSRCTTEGK 401
>gi|194883957|ref|XP_001976062.1| GG20202 [Drosophila erecta]
gi|190659249|gb|EDV56462.1| GG20202 [Drosophila erecta]
Length = 660
Score = 65.9 bits (159), Expect = 3e-08, Method: Compositional matrix adjust.
Identities = 55/219 (25%), Positives = 97/219 (44%), Gaps = 18/219 (8%)
Query: 48 IANLVNLILQWPEISINEIAE--NFSKLACNAHTICNSELRPLGTGLYPVISIINHSCLP 105
I +LV LQ+ + + +E+AE FS +G +YP +++ NHSC P
Sbjct: 427 ICSLVLRSLQFIQFNTHEVAELHKFSSSGREKSIF-------IGGAIYPTLALFNHSCDP 479
Query: 106 NAVLVFEGRLAVVRAVQHVPKGAEVLISY--IETAGSTMTRQKALKEQYLFTCTCPRCI- 162
V F G + +V+ + G + +Y + T RQ LKE Y F C+C CI
Sbjct: 480 GVVRYFRGTTIHINSVRPIEAGLPINENYGPMYTQDERAERQARLKELYWFECSCDACID 539
Query: 163 KLGQFDDIQESAILEGYRCK-DDGCSGFL--LRDSDDKGFTCQQCGLVRSKEEIKKIASE 219
+FDD+ I +RC + CS + +D C CG + + + K+ +
Sbjct: 540 NWPKFDDLPRDVI--RFRCDAPNNCSAIIEVPPSCNDFMVKCVTCGEITNILKGLKVMQD 597
Query: 220 VNILSKKTLALTSCGNHQEVVSTY-KMIEKLQKKLYHPF 257
++++ L G + + ++ + +I + + L PF
Sbjct: 598 TEMMTRTAKRLYETGEYLKALAKFIDLIRIMYEVLAPPF 636
>gi|326435247|gb|EGD80817.1| hypothetical protein PTSG_01403 [Salpingoeca sp. ATCC 50818]
Length = 431
Score = 65.9 bits (159), Expect = 3e-08, Method: Compositional matrix adjust.
Identities = 36/80 (45%), Positives = 46/80 (57%), Gaps = 9/80 (11%)
Query: 91 GLYPVISIINHSCLPNAVLVFE-----GRLA----VVRAVQHVPKGAEVLISYIETAGST 141
G Y ++ NHSC PNA + RL V+RAV + +G EV ISYIE +T
Sbjct: 174 GFYLRMAFCNHSCRPNACQYIDPNSTRARLNSPSIVLRAVSDIAEGEEVCISYIELMDTT 233
Query: 142 MTRQKALKEQYLFTCTCPRC 161
R++AL+E Y FTC CPRC
Sbjct: 234 PERREALQELYYFTCQCPRC 253
>gi|358394896|gb|EHK44289.1| hypothetical protein TRIATDRAFT_79382 [Trichoderma atroviride IMI
206040]
Length = 742
Score = 65.9 bits (159), Expect = 3e-08, Method: Compositional matrix adjust.
Identities = 28/72 (38%), Positives = 41/72 (56%)
Query: 90 TGLYPVISIINHSCLPNAVLVFEGRLAVVRAVQHVPKGAEVLISYIETAGSTMTRQKALK 149
TG++ S NHSCLPNA F G + +VRA++ +P G E+L+ Y+ RQ+ ++
Sbjct: 501 TGIWLQASYANHSCLPNATRAFIGDMMIVRALREIPVGGEILMQYVPQDKPFDKRQEVVE 560
Query: 150 EQYLFTCTCPRC 161
Y F C C C
Sbjct: 561 NHYGFKCDCDLC 572
>gi|195333441|ref|XP_002033400.1| GM21288 [Drosophila sechellia]
gi|194125370|gb|EDW47413.1| GM21288 [Drosophila sechellia]
Length = 660
Score = 65.9 bits (159), Expect = 3e-08, Method: Compositional matrix adjust.
Identities = 55/219 (25%), Positives = 96/219 (43%), Gaps = 18/219 (8%)
Query: 48 IANLVNLILQWPEISINEIAE--NFSKLACNAHTICNSELRPLGTGLYPVISIINHSCLP 105
I +LV LQ+ + + +E+AE FS +G +YP +++ NHSC P
Sbjct: 427 ICSLVLRSLQFIQFNTHEVAELHKFSSSGREKSIF-------IGGAIYPTLALFNHSCDP 479
Query: 106 NAVLVFEGRLAVVRAVQHVPKGAEVLISY--IETAGSTMTRQKALKEQYLFTCTCPRCI- 162
V F G + +V+ + G + +Y + T RQ LKE Y F C+C CI
Sbjct: 480 GVVRYFRGTTIHINSVRPIEAGLPINENYGPMYTQDERSERQARLKELYWFECSCDACID 539
Query: 163 KLGQFDDIQESAILEGYRCK-DDGCSGFL--LRDSDDKGFTCQQCGLVRSKEEIKKIASE 219
+FDD+ I +RC + CS + +D C CG + + + K+ +
Sbjct: 540 NWPKFDDLPRDVI--RFRCDAPNNCSAVIEVPPSCNDFMVKCVTCGEITNILKGLKVMQD 597
Query: 220 VNILSKKTLALTSCGNHQEVVSTY-KMIEKLQKKLYHPF 257
++++ L G + + + + +I + + L PF
Sbjct: 598 TEMMTRTAKRLYETGEYSKALPKFIDLIRIMYEVLAPPF 636
>gi|115395962|ref|XP_001213620.1| predicted protein [Aspergillus terreus NIH2624]
gi|114193189|gb|EAU34889.1| predicted protein [Aspergillus terreus NIH2624]
Length = 425
Score = 65.9 bits (159), Expect = 3e-08, Method: Compositional matrix adjust.
Identities = 60/262 (22%), Positives = 110/262 (41%), Gaps = 50/262 (19%)
Query: 59 PEISINEIAENFSKLACNAHTICNSELRPLGTGLYPVISIINHSCLPNAVLVFEGRLAVV 118
PE +I+ +KL N+ + + +G L+P ++INHSC NAV+ F+G
Sbjct: 140 PEDTISAFG---AKLDVNSFNMTTALADRIGLYLHPYAALINHSCAYNAVIGFDGAELFA 196
Query: 119 RAVQHVPKGAEVLISYIETAGSTMTRQKALKEQYLFTCTCPRCIKLGQFDDIQESAILEG 178
A++ + + ++ ISY++ R+ L+E+Y F C C +C + D
Sbjct: 197 TALRPITRDEQIFISYVDATNPVAVRRNELRERYFFDCRCAKCA--AEMD---------- 244
Query: 179 YRCKDDGCSGFLLRDSDDKGFTCQQCGL---VRSKEEIKKIASEVNILSKKTLALTSCGN 235
DG D+ K +T + V E ++ + S + L + A
Sbjct: 245 ---APDGGRAPAAADAQRKAYTLMEVATGAGVDPAEAVRALESAMRTLRESEGA------ 295
Query: 236 HQEVVSTYKMIEKLQKKLYHPFSVNLMQTREKLIKILMELEDWKEALAYCQLTIPVYQ-- 293
++++ E+ L+ R++LI L+ ++ A+ Q I V +
Sbjct: 296 ------SWRITEQ-----------PLVSLRDELIASLLGAGRFQ--TAFVQAAIRVVRVD 336
Query: 294 -RVYP-QFHPLLGLQYYTCGKL 313
VYP + HP+ L +T +L
Sbjct: 337 CVVYPVRGHPIRQLHAWTLARL 358
>gi|195485733|ref|XP_002091210.1| GE12361 [Drosophila yakuba]
gi|194177311|gb|EDW90922.1| GE12361 [Drosophila yakuba]
Length = 660
Score = 65.9 bits (159), Expect = 3e-08, Method: Compositional matrix adjust.
Identities = 55/219 (25%), Positives = 97/219 (44%), Gaps = 18/219 (8%)
Query: 48 IANLVNLILQWPEISINEIAE--NFSKLACNAHTICNSELRPLGTGLYPVISIINHSCLP 105
I +LV LQ+ + + +E+AE FS +G +YP +++ NHSC P
Sbjct: 427 ICSLVLRSLQFIQFNTHEVAELHKFSSSGREKSIF-------IGGAIYPTLALFNHSCDP 479
Query: 106 NAVLVFEGRLAVVRAVQHVPKGAEVLISY--IETAGSTMTRQKALKEQYLFTCTCPRCI- 162
V F G V +V+ + G + +Y + T RQ LKE Y F C+C CI
Sbjct: 480 GVVRYFRGTTIHVNSVRPIEAGLPINENYGPMYTQDGRPERQARLKELYWFECSCDACID 539
Query: 163 KLGQFDDIQESAILEGYRCK-DDGCSGFL--LRDSDDKGFTCQQCGLVRSKEEIKKIASE 219
+FDD+ I +RC + C+ + +D C CG + + + K+ +
Sbjct: 540 NWPKFDDLPRDVI--RFRCDAPNNCAAVIEVPPSCNDFMVKCVTCGEITNILKGLKVMQD 597
Query: 220 VNILSKKTLALTSCGNHQEVVSTY-KMIEKLQKKLYHPF 257
++++ L G + + ++ + +I + + L PF
Sbjct: 598 TEMMTRTAKRLYETGEYSKALAKFIDLIRIMYEVLAPPF 636
>gi|159472132|ref|XP_001694205.1| predicted protein [Chlamydomonas reinhardtii]
gi|158276868|gb|EDP02638.1| predicted protein [Chlamydomonas reinhardtii]
Length = 468
Score = 65.9 bits (159), Expect = 3e-08, Method: Compositional matrix adjust.
Identities = 33/73 (45%), Positives = 42/73 (57%), Gaps = 2/73 (2%)
Query: 91 GLYPVISIINHSCLPNAVLVFEGRLAVVRAVQHVPKGAEVLISYIETAG--STMTRQKAL 148
GL+ S++NHSC PNAV G V A HV +G+E+LI+Y+ A RQ L
Sbjct: 139 GLWHSFSMLNHSCCPNAVNYVVGDSMCVFAAGHVEQGSELLINYLGAASLRPVEERQAGL 198
Query: 149 KEQYLFTCTCPRC 161
E Y F+C CPRC
Sbjct: 199 AEAYAFSCGCPRC 211
>gi|340722924|ref|XP_003399849.1| PREDICTED: protein msta, isoform B-like [Bombus terrestris]
Length = 513
Score = 65.5 bits (158), Expect = 4e-08, Method: Compositional matrix adjust.
Identities = 67/316 (21%), Positives = 130/316 (41%), Gaps = 41/316 (12%)
Query: 50 NLVNLILQWPEISINEIAENFSKLACNAHTICNSELRPL----GTGLYPVISIINHSCLP 105
N+V+ I + ++ ++ +E AC I E+R LYP ++++NHSC+
Sbjct: 173 NIVDYIRK--QLKLDRFSEEQIHTACGILEINTFEIRTAKGCSARALYPTVALMNHSCIS 230
Query: 106 N---AVLVFEGRLAVVRAVQHVPKGAEVLISYIETAGSTMTRQKALKEQYLFTCTCPRCI 162
N ++ + R+ + R +P G E+ SY + T+ R++ L E F C C RC
Sbjct: 231 NTCHSISPTDYRVRL-RTTLKIPAGGELYGSYTHSLLPTLLRREHLLEGKYFACACSRC- 288
Query: 163 KLGQFDDIQESAILEGYRCK--DDGCSGFLLRDSDDKGFTCQQCGLVRSKEEIKKIASEV 220
D + + +C D+G L D + C C S ++K+ +
Sbjct: 289 ----SDPTELGTHVSSLKCNKCDNGIVLSLDSLDPDSSWKCTHCEFTTSGSAVRKV---L 341
Query: 221 NILSKKTLAL--TSCGNHQEVVSTYKMIEKLQKKLYHPFSVNLMQTREKLIKILMELEDW 278
+I+ + A+ S + + + + + K + + HP L R L ++ ++++
Sbjct: 342 HIIQAEVDAVETISGADGADAIQERETVVKKYRSVLHPRHGFLTMLRHSLTQMYGRVDEY 401
Query: 279 ----------KEALAYCQLTIPVYQRVYPQFHPLLGLQYYT-------CGKLEWFLGDTE 321
+ + C+L + V + P + + G+ Y K +W G +
Sbjct: 402 LLDDLPDVVLEHKIDICRLLLQVLDVIEPGYSRIRGMTLYELHAPLLFVAKTQWNAGVID 461
Query: 322 NAI--KSMTEAVEILR 335
A+ M EA IL+
Sbjct: 462 EAVLKSKMIEAANILK 477
>gi|298710004|emb|CBJ31723.1| SET and MYND domain containing 3 [Ectocarpus siliculosus]
Length = 636
Score = 65.5 bits (158), Expect = 4e-08, Method: Compositional matrix adjust.
Identities = 32/78 (41%), Positives = 43/78 (55%), Gaps = 3/78 (3%)
Query: 88 LGTGLYPVISIINHSCLPNAVLVF--EGRLAVVRAVQHVPKGAEVLISYIETAGSTMTRQ 145
G GLYP ++ NHSC PN G L VV AV+ VP G+++ ISY++ RQ
Sbjct: 228 FGVGLYPAGAMFNHSCRPNCSWRTNRSGELCVV-AVEDVPAGSQLFISYVDILQPWPVRQ 286
Query: 146 KALKEQYLFTCTCPRCIK 163
L+ + F C CPRC +
Sbjct: 287 DLLRCHFFFECACPRCCR 304
>gi|240979812|ref|XP_002403255.1| SET and MYND domain-containing protein (SMYD), putative [Ixodes
scapularis]
gi|215491338|gb|EEC00979.1| SET and MYND domain-containing protein (SMYD), putative [Ixodes
scapularis]
Length = 413
Score = 65.5 bits (158), Expect = 4e-08, Method: Compositional matrix adjust.
Identities = 41/106 (38%), Positives = 59/106 (55%), Gaps = 8/106 (7%)
Query: 89 GTGLYPVISIINHSCLPNAVLVF---EGRLAVVRAVQHVPKGAEVLISYIETAG---STM 142
G GLYP+ S+ HSC PNA F + L++V A++ + G EV +SYI+ S
Sbjct: 309 GCGLYPLQSLCAHSCCPNAEARFLHNDHTLSLV-ALRDIRPGEEVTVSYIDECSLSRSRH 367
Query: 143 TRQKALKEQYLFTCTCPRCIKLG-QFDDIQESAILEGYRCKDDGCS 187
+R K L+E +LFTCTCPRC + Q D+ +E ++ CS
Sbjct: 368 SRIKMLRESHLFTCTCPRCEEEATQQPDVTSDEEMEDSSDAEEACS 413
>gi|357611158|gb|EHJ67334.1| hypothetical protein KGM_20152 [Danaus plexippus]
Length = 397
Score = 65.5 bits (158), Expect = 4e-08, Method: Compositional matrix adjust.
Identities = 66/230 (28%), Positives = 97/230 (42%), Gaps = 27/230 (11%)
Query: 14 NVIPSTTTDNYSLVEALVAHMS---DIDEKQLLLYAQIANL------VNLILQWPEISIN 64
N +P D+ L AL A + + D+K LL Q L + + Q+ +I ++
Sbjct: 118 NKVPIEEVDDTLLSRALTAIRALLLNEDQKYLLTSLQANKLPQYGSEIRDLKQYFDIPLH 177
Query: 65 EIAENFSKLA-CNAHTICNSELRPLGT------GLYPVISIINHSCLPNAVLVFEGRLAV 117
E E F L C +T P G GLYPV SI+NH+C+PN F G L +
Sbjct: 178 E--EEFMILVICILNTNAFQMATPYGKKEMSLRGLYPVASILNHNCVPNTRNCFNGDLQM 235
Query: 118 -VRAVQHVPKGAEVLISYIETAGSTMTRQKALKEQYLFTCTCPRCIKLGQFDDIQESAIL 176
V+A + + G+E+ Y T R+ L + F C C RC D + +L
Sbjct: 236 TVKATKTINAGSEIFTCYSGMLWGTPARRLYLYKSKHFLCDCERCA-----DPTERGTLL 290
Query: 177 EGYRCKDDGCSGFLLRDSDDK---GFTCQQCGLVRSKEEIKKIASEVNIL 223
+C C G LL K + C +CG+ + I I S + L
Sbjct: 291 AALKCFSTECQGSLLPIQPLKTTTAWRCLECGMRVPNDNICVIQSALGSL 340
>gi|392579066|gb|EIW72193.1| hypothetical protein TREMEDRAFT_58351 [Tremella mesenterica DSM
1558]
Length = 660
Score = 65.5 bits (158), Expect = 4e-08, Method: Compositional matrix adjust.
Identities = 43/129 (33%), Positives = 61/129 (47%), Gaps = 11/129 (8%)
Query: 62 SINEIAENFSKLACNAHTICNSELRPLGTGLYPVISIINHSCLPNAVLVF-EG----RLA 116
S I + S N + + L +G L P +++ NHSC PNAV+VF EG R+
Sbjct: 355 STGSILQVISSFMINTFSASSPSLDLVGAALNPAMAMSNHSCSPNAVVVFPEGADSMRIV 414
Query: 117 VVRAVQHVPKGAEVLISYIETAGSTMTRQKALKEQYLFTCTCPRC---IKLGQFDDIQES 173
++A++ G EVL Y++ A RQ L+ Y F C CP C K G +
Sbjct: 415 AIKAIE---AGEEVLTHYVDLALPYAQRQAELRRTYHFECKCPTCDNRTKDGDKPVLDPR 471
Query: 174 AILEGYRCK 182
+IL CK
Sbjct: 472 SILIHRECK 480
>gi|328784417|ref|XP_624539.3| PREDICTED: protein msta, isoform A-like [Apis mellifera]
Length = 440
Score = 65.5 bits (158), Expect = 4e-08, Method: Compositional matrix adjust.
Identities = 54/213 (25%), Positives = 95/213 (44%), Gaps = 23/213 (10%)
Query: 19 TTTDNYSLVEALVAHMSDIDEKQLLLYAQIANLVNLILQWPEISINEIAENFSKLACNAH 78
T ++Y + +L +H ++++ + L IA+ + + +IS EIA L N H
Sbjct: 158 TNPESYKKLISLESHCNEMNNSKEPL--NIAHFIKRFFKADDISEEEIATIIGILQVNGH 215
Query: 79 TICNSELRPLG----TGLYPVISIINHSCLPNAVLVFE--GRLAVVRAVQHVPKGAEVLI 132
+ PL +Y + S+I H+C N F G L ++RA + KG + I
Sbjct: 216 EV------PLTDSPYVAVYEMASLIEHNCRANCSKSFTDMGGL-IIRAALPITKGDHISI 268
Query: 133 SYIETAGSTMTRQKALKEQYLFTCTCPRCIKLGQFDDIQESAILEGYRCKDDGCSGFLLR 192
Y + T R+ L + F C C RC +F + +C + CSG++L
Sbjct: 269 CYTDPLWGTANRRHHLFKTKFFECICNRCQDPTEF-----QTMFNALKCNNINCSGYILP 323
Query: 193 DS---DDKGFTCQQCGLVRSKEEIKKIASEVNI 222
+ ++ + C+ C V S EI+K+ ++ I
Sbjct: 324 KTFLEQEQDYICKICESVVSCTEIEKVLEDIGI 356
>gi|195498437|ref|XP_002096523.1| GE25007 [Drosophila yakuba]
gi|194182624|gb|EDW96235.1| GE25007 [Drosophila yakuba]
Length = 393
Score = 65.5 bits (158), Expect = 4e-08, Method: Compositional matrix adjust.
Identities = 32/78 (41%), Positives = 48/78 (61%), Gaps = 5/78 (6%)
Query: 89 GTGLYPVISIINHSCLPNAVLVF--EGRLAVVRAVQHVPKGAEVLISYIETA---GSTMT 143
G+GLY + S INHSC+PNA F + V++A+ + +G E+ ISY++ S +
Sbjct: 286 GSGLYLLQSKINHSCVPNACSTFPYSNDIIVLKALAPIQQGDEICISYLDECMLERSRHS 345
Query: 144 RQKALKEQYLFTCTCPRC 161
R K L+E Y+F C CP+C
Sbjct: 346 RHKVLRENYVFICQCPKC 363
>gi|224010509|ref|XP_002294212.1| predicted protein [Thalassiosira pseudonana CCMP1335]
gi|220970229|gb|EED88567.1| predicted protein [Thalassiosira pseudonana CCMP1335]
Length = 516
Score = 65.5 bits (158), Expect = 4e-08, Method: Compositional matrix adjust.
Identities = 34/87 (39%), Positives = 46/87 (52%), Gaps = 14/87 (16%)
Query: 91 GLYPVISIINHSCLPNAVLVF-------------EGR-LAVVRAVQHVPKGAEVLISYIE 136
GLYP+ ++INHSC PNAV VF +GR + +V A + KG E+ SY+
Sbjct: 244 GLYPLAAMINHSCCPNAVRVFGTLPAINDNMHRVQGREVMIVHATAKISKGDEITWSYLP 303
Query: 137 TAGSTMTRQKALKEQYLFTCTCPRCIK 163
R+ L ++ FTC C RCIK
Sbjct: 304 PCSPFNERRAVLSSKFGFTCQCVRCIK 330
>gi|340720588|ref|XP_003398716.1| PREDICTED: SET and MYND domain-containing protein 4-like [Bombus
terrestris]
Length = 735
Score = 65.5 bits (158), Expect = 4e-08, Method: Compositional matrix adjust.
Identities = 41/122 (33%), Positives = 61/122 (50%), Gaps = 13/122 (10%)
Query: 88 LGTGLYPVISIINHSCLPNAVLVFEGRLAVVRAVQHVPKGAEVLISYIE--TAGSTMTRQ 145
+ TG+YP S++NHSC PN + +F G+ +VRA + + + + Y +T RQ
Sbjct: 456 VATGIYPSASMMNHSCDPNIINIFVGQYLIVRASRDIDQSEGIFNCYGPHYRHMTTEDRQ 515
Query: 146 KALKEQYLFTCTCPRCI--KLGQFDDIQESAILEGYRCKDDGCSGFLLRDSDDKGFTCQQ 203
K LK QY FTC C C KL F I+ ++ +C C+ ++DS C
Sbjct: 516 KILKSQYCFTCKCKACTLPKLQYF--IERFNAVKCLKCNGPVCN---IKDS----IYCLD 566
Query: 204 CG 205
CG
Sbjct: 567 CG 568
>gi|194756896|ref|XP_001960706.1| GF13486 [Drosophila ananassae]
gi|190622004|gb|EDV37528.1| GF13486 [Drosophila ananassae]
Length = 517
Score = 65.5 bits (158), Expect = 4e-08, Method: Compositional matrix adjust.
Identities = 42/163 (25%), Positives = 68/163 (41%), Gaps = 11/163 (6%)
Query: 46 AQIANLVNLILQWPEISINEIAENFSKLACNAHTICNSELRPLGTGLYPVISIINHSCLP 105
A I + + + + +I L N H I S+ P G++ S +SCLP
Sbjct: 189 ASIGQFIPKFFKTQKFTEEQIMRAVGALQINGHEIPTSD--PPHVGVFYTASFTENSCLP 246
Query: 106 NAVLVF-EGRLAVVRAVQHVPKGAEVLISYIETAGSTMTRQKALKEQYLFTCTCPRCIKL 164
N F + ++ A Q + K A + I Y + T RQ+ L + LF C C RC +
Sbjct: 247 NLAKSFNKNGHCILWAPQEIKKNAHLSICYSDAMWGTADRQRHLMQTKLFKCACARCADV 306
Query: 165 GQFDDIQESAILEGYRCKDDGCSGFLLRDSDDK---GFTCQQC 204
+ D +C+D C G +L D+ + C++C
Sbjct: 307 TELD-----TFYSALKCEDRQCGGLMLPSKSDEWNGSWRCREC 344
>gi|195582454|ref|XP_002081043.1| GD10796 [Drosophila simulans]
gi|194193052|gb|EDX06628.1| GD10796 [Drosophila simulans]
Length = 646
Score = 65.5 bits (158), Expect = 4e-08, Method: Compositional matrix adjust.
Identities = 53/216 (24%), Positives = 94/216 (43%), Gaps = 26/216 (12%)
Query: 48 IANLVNLILQWPEISINEIAE--NFSKLACNAHTICNSELRPLGTGLYPVISIINHSCLP 105
I +LV LQ+ + + +E+AE FS +G +YP +++ NHSC P
Sbjct: 427 ICSLVLRSLQFIQFNTHEVAELHKFSSSGREKSIF-------IGGAIYPTLALFNHSCDP 479
Query: 106 NAVLVFEGRLAVVRAVQHVPKGAEVLISY--IETAGSTMTRQKALKEQYLFTCTCPRCI- 162
V F G + +V+ + G + +Y + T RQ LKE Y F C+C CI
Sbjct: 480 GVVRYFRGTTIHINSVRPIEAGLPINENYGPMYTQDERSERQARLKELYWFECSCDACID 539
Query: 163 KLGQFDDIQESAILEGYRCKDDGCSGFLLRDSDDKGFTCQQCGLVRSKEEIKKIASEVNI 222
+FDD+ I +RC D+ + C CG + + + K+ + +
Sbjct: 540 NWPKFDDLPRDVI--RFRC-----------DAPNNCSACVTCGEITNILKGLKVMQDTEM 586
Query: 223 LSKKTLALTSCGNHQEVVSTY-KMIEKLQKKLYHPF 257
+++ L G + + ++ + +I + + L PF
Sbjct: 587 MTRTAKRLYETGEYSKALAKFIDLIRIMYEVLAPPF 622
>gi|281202822|gb|EFA77024.1| hypothetical protein PPL_09776 [Polysphondylium pallidum PN500]
Length = 1187
Score = 65.1 bits (157), Expect = 5e-08, Method: Compositional matrix adjust.
Identities = 46/171 (26%), Positives = 81/171 (47%), Gaps = 20/171 (11%)
Query: 22 DNYSLVEALVAHMSDIDEKQLLLY-AQIANLVNL--ILQWPE----ISINEIAENFSKLA 74
D+Y + L S+ID KQ+ + +++N+ + + P+ ++ +I + + K+
Sbjct: 858 DSYDDLTHLTTDTSNIDRKQMESFETATQSIINVFKLAKGPKSLEPLTNQQILDLYPKML 917
Query: 75 CNAHTICNSELRP-LGTGLYPVISIINHSCLPNAVLVFEGR-LAVVRAVQHVPKGAEVLI 132
NAH + + G+YP + +NHSC PN V + + R+++ + G E+
Sbjct: 918 FNAHEYIDPLYHSEVARGIYPTAAYLNHSCEPNTVWHNDNNGMIAYRSIRDIKAGEEITT 977
Query: 133 SYIETAGSTMTRQKALKEQYLFTCTCPRCIKLGQFDDIQESAILEGYRCKD 183
+YI+ TRQ L QY F C C RC Q+ A G++C D
Sbjct: 978 TYIDITKYKSTRQLNLLSQYAFLCQCARC---------QDRAT--GFKCLD 1017
>gi|268637641|ref|XP_002649106.1| SET domain-containing protein [Dictyostelium discoideum AX4]
gi|256012848|gb|EEU04054.1| SET domain-containing protein [Dictyostelium discoideum AX4]
Length = 449
Score = 65.1 bits (157), Expect = 5e-08, Method: Compositional matrix adjust.
Identities = 48/185 (25%), Positives = 84/185 (45%), Gaps = 21/185 (11%)
Query: 19 TTTDNYSLVEALVAHMSDIDEKQLLLYAQ----IANLVNLILQWP----EISINEIAENF 70
T D+Y L + + +D++ + + + I++L++ + I+I+EI E F
Sbjct: 128 TPIDDYQDFMGLTSTLDKVDKEHMTKFKRGVTSISSLISSVRGVGYLKNTITIHEILECF 187
Query: 71 SKLACNAHTICNSELRPLGTGLYPVISIINHSCLPNAVLVFEGR-LAVVRAVQHVPKGAE 129
S + NAH + + +G G+ P NHSC+PN + + + + +V KG E
Sbjct: 188 SSVLTNAHQFSYATSKEIGRGVCPT-GYFNHSCMPNTTWSLDDQGMLLFSTSSNVKKGDE 246
Query: 130 VLISYIETAGSTMTRQKALKEQYLFTCTCPRCIKLGQFDDIQESAILEGYRCKDDGCSGF 189
+ + Y+ R++ L + Y F C CP C + + L GY C + C
Sbjct: 247 LSLGYLANEYPLKNRRRELLDGYYFFCQCPLC---------EFQSNLSGYLC--EKCKEP 295
Query: 190 LLRDS 194
LL DS
Sbjct: 296 LLNDS 300
>gi|452988674|gb|EME88429.1| hypothetical protein MYCFIDRAFT_43814 [Pseudocercospora fijiensis
CIRAD86]
Length = 334
Score = 65.1 bits (157), Expect = 5e-08, Method: Compositional matrix adjust.
Identities = 37/100 (37%), Positives = 51/100 (51%), Gaps = 8/100 (8%)
Query: 69 NFSKLACNAHTICNSELRPLGT-----GLYPVISIINHSCLPNAVLVFEGRLA--VVRAV 121
NF I S PLG G++P++S +NHSCLPNA + RL +V +
Sbjct: 89 NFPGSPTPFSNIVRSNGYPLGPSSSIGGIFPLVSRLNHSCLPNAQHAYNERLGKMLVHII 148
Query: 122 QHVPKGAEVLISYIETAGSTMTRQKALKEQYLFTCTCPRC 161
+ + E+ +SYI G + RQ LK +LFTCTC C
Sbjct: 149 RPILPNEEITLSYI-PGGPSPQRQTELKSNFLFTCTCTLC 187
>gi|302691170|ref|XP_003035264.1| hypothetical protein SCHCODRAFT_14438 [Schizophyllum commune H4-8]
gi|300108960|gb|EFJ00362.1| hypothetical protein SCHCODRAFT_14438 [Schizophyllum commune H4-8]
Length = 597
Score = 65.1 bits (157), Expect = 5e-08, Method: Compositional matrix adjust.
Identities = 32/100 (32%), Positives = 54/100 (54%), Gaps = 9/100 (9%)
Query: 71 SKLACNAHTICNSELRPLGTGLYPVISIINHSCLPNAVLVF---------EGRLAVVRAV 121
S+ A N++++ +L P+G + P++S++NHSC PNA VF + L V A+
Sbjct: 252 SRFATNSYSVTTPDLTPIGACVSPLVSLVNHSCSPNAATVFPRASKTPSTDEPLISVVAI 311
Query: 122 QHVPKGAEVLISYIETAGSTMTRQKALKEQYLFTCTCPRC 161
+ + ++ SYI+T R++ L+E Y F C C C
Sbjct: 312 RDIQPDEQIFTSYIDTTLPRALRRRELQEGYNFLCKCSLC 351
>gi|449459406|ref|XP_004147437.1| PREDICTED: SET and MYND domain-containing protein 4-like [Cucumis
sativus]
gi|449500639|ref|XP_004161155.1| PREDICTED: SET and MYND domain-containing protein 4-like [Cucumis
sativus]
Length = 775
Score = 65.1 bits (157), Expect = 5e-08, Method: Compositional matrix adjust.
Identities = 65/311 (20%), Positives = 122/311 (39%), Gaps = 46/311 (14%)
Query: 81 CNSELRPLGTGLYPVISIINHSCLPNAVLVFEGRLAVVRAVQHVPKGAEVLISYIETAG- 139
CN E +G +Y S+ NHSC PN F R +RA + G + +SY G
Sbjct: 464 CNMEQVRVGQAIYTTGSLFNHSCKPNIHAYFNSRTLFIRATVFMAVGCPLELSYGPQVGQ 523
Query: 140 -STMTRQKALKEQYLFTCTCPRCIKLGQFDDIQESAILEGYRCKDDGCSGFLLRDSDDKG 198
R + LK++Y F C C C + D ++ + C + C G +L D
Sbjct: 524 LDCKDRLQLLKDEYSFNCQCSGCSTVHISD-----LVINAFCCINPNCRGVVL---DRSI 575
Query: 199 FTCQQCG-----LVRSKEEIKKIASEVNILSKKTLALTSCGNHQEVVSTYKMIEK----- 248
F+C+ V + ++ + L CG++ ++ S+ ++K
Sbjct: 576 FSCENTKTKDFLTVNDQMILEPFMQTDSFLHAGPSHCLKCGSYCDIKSSRLTVDKAGIHF 635
Query: 249 --LQKKL-----------------------YHPFSVNLMQTREKLIKILMELEDWKEALA 283
LQ+++ H ++ + + + L + L + A
Sbjct: 636 TRLQQEINLNRVSETTVSDALGALISLKSTLHEYNRRIAEAEDNLSQAFSLLGKLELAAE 695
Query: 284 YCQLTIPVYQRVYPQFHPLLGLQYYTCGKLEWFLGDTENAIKSMTEAVEILRITHGTNSP 343
+C+ +I + +++Y + H +G + + +GD NA+ + +I R +G+N
Sbjct: 696 HCKASIRILEKLYGENHIAIGNELSKLSSILISVGD-HNAVDCIKRLSKIFRCYYGSNVN 754
Query: 344 FMKELILKLEE 354
M + LEE
Sbjct: 755 TMFPFLNILEE 765
>gi|330843375|ref|XP_003293631.1| hypothetical protein DICPUDRAFT_90323 [Dictyostelium purpureum]
gi|325076008|gb|EGC29833.1| hypothetical protein DICPUDRAFT_90323 [Dictyostelium purpureum]
Length = 343
Score = 65.1 bits (157), Expect = 5e-08, Method: Compositional matrix adjust.
Identities = 30/73 (41%), Positives = 43/73 (58%), Gaps = 1/73 (1%)
Query: 91 GLYPVISIINHSCLPNAVLVFEG-RLAVVRAVQHVPKGAEVLISYIETAGSTMTRQKALK 149
GLY + S INHSC PNA + F A +R ++ + G E+ ISY +T+ + R+ L
Sbjct: 260 GLYFLTSFINHSCDPNAYVQFPNDHTAEIRLLKPINPGEEITISYADTSKDIIDRRSQLF 319
Query: 150 EQYLFTCTCPRCI 162
E Y F C CP+C+
Sbjct: 320 ENYGFNCECPKCL 332
>gi|149238794|ref|XP_001525273.1| hypothetical protein LELG_03201 [Lodderomyces elongisporus NRRL
YB-4239]
gi|146450766|gb|EDK45022.1| hypothetical protein LELG_03201 [Lodderomyces elongisporus NRRL
YB-4239]
Length = 619
Score = 65.1 bits (157), Expect = 5e-08, Method: Compositional matrix adjust.
Identities = 36/97 (37%), Positives = 51/97 (52%), Gaps = 4/97 (4%)
Query: 68 ENFSKLA-CNAHTICNSELRPLGTGLYPVISIINHSCLPNAVLVFEGRLAVVRAVQHVP- 125
ENF L N +T+ + + R +G L P S+INHSC+PN+ V E R + +P
Sbjct: 213 ENFVNLIFINCNTLVDYKFRIVGYNLDPQFSMINHSCMPNSCQV-ETEYGCYRLINTLPI 271
Query: 126 -KGAEVLISYIETAGSTMTRQKALKEQYLFTCTCPRC 161
G E+ ++YI + T RQ L +Y F C CP C
Sbjct: 272 NAGDEITVNYIASGIPTELRQVQLFSRYFFRCKCPLC 308
>gi|324507181|gb|ADY43048.1| SET and MYND domain-containing protein 3 [Ascaris suum]
Length = 614
Score = 65.1 bits (157), Expect = 5e-08, Method: Compositional matrix adjust.
Identities = 67/296 (22%), Positives = 127/296 (42%), Gaps = 21/296 (7%)
Query: 76 NAHTICNSE-LRPLGTGLYPVISIINHSCLPNAVLVFEGRLAVVRAVQ---HVPKGAEVL 131
N H I + L +G GLY + +HSC PN + +G +A +RA+ ++
Sbjct: 231 NRHAISDDGYLEEIGKGLYLDLCAYDHSCRPNTIYTCKGFVATLRALDLTVNLMDRTTTF 290
Query: 132 ISYIETAGSTMTRQKALKEQYLFTCTCPRCIKLGQFDDIQESAILEGYRCKDDGCSGF-- 189
SYI+ +T R+K L++ + F C C RC+ D + S R K + F
Sbjct: 291 YSYIDLLSTTQERKKLLRDTWYFDCQCVRCV--DNDDHVLSSMFCPSCREKPERLCIFGE 348
Query: 190 -LLRDSDDKGFTCQQCGLVRSKEEIKKIASEVNILSKKTLALTSCGNHQEVVS--TYKMI 246
++ + TC +C + KE + + + + + + + Q++ ++ +
Sbjct: 349 SPYKNPVTQILTCPKCHIEVPKERVLEAVDAMRFIDR----IVAKNEIQQMAKEQAFRFL 404
Query: 247 EKLQKKLYHPF-SVNLMQTREKLIKILMELEDWKEALAYCQLTIPVYQRV---YPQFHPL 302
+ ++ + SVN+ K+I++L+ L D ++ L + + V +P HP
Sbjct: 405 KGIKDRFSKILPSVNVYMC--KIIQLLIPLIDPTDSKTLLSLHLESEECVRFCFPPNHPA 462
Query: 303 LGLQYYTCGKLEWFLGDTENAIKSMTEAVEILRITHGTNSPFMKELILKLEEAQAE 358
+ + + G +G A + A +IL+ T G P + + LEEA E
Sbjct: 463 VAVHLSSIGLFYLRIGHPHRAELYLEMACDILKFTLGHTHPLTVDKLSILEEAHRE 518
>gi|395331361|gb|EJF63742.1| SET domain-containing protein [Dichomitus squalens LYAD-421 SS1]
Length = 652
Score = 65.1 bits (157), Expect = 5e-08, Method: Compositional matrix adjust.
Identities = 44/133 (33%), Positives = 63/133 (47%), Gaps = 21/133 (15%)
Query: 85 LRPLGTGLYPVISIINHSCLPNAVLVF----------EGRLAVVRAVQHVPKGAEVLISY 134
L P+G + P +++ NHSC PNA LVF E L++V A+++V G E+ ISY
Sbjct: 314 LTPIGICVSPTVALANHSCDPNAALVFPRADGGSRVKEPLLSLV-ALRNVAPGKEIRISY 372
Query: 135 IETAGSTMTRQKALKEQYLFTCTCPRCIKLGQFDDIQESAILEGYRCKDDGCSG---FLL 191
++T RQ LKE Y F+C C C + D E C C G +
Sbjct: 373 VDTTLPNRLRQNELKEVYSFSCQCKLCCRTTAADP------REALWCPKS-CGGICPYPT 425
Query: 192 RDSDDKGFTCQQC 204
+S D+ C +C
Sbjct: 426 EESGDRATHCVKC 438
>gi|428176655|gb|EKX45538.1| hypothetical protein GUITHDRAFT_138770 [Guillardia theta CCMP2712]
Length = 598
Score = 65.1 bits (157), Expect = 5e-08, Method: Compositional matrix adjust.
Identities = 71/300 (23%), Positives = 127/300 (42%), Gaps = 28/300 (9%)
Query: 59 PEISINEIAENFSKLACNAHTICNSELRPLGTGLYPVISIINHSCLPNAVLVFEGRLAVV 118
P +S + A S L N+H + G+GL+ + ++ HSC PN G V
Sbjct: 257 PTVSDEDAARMLSILNTNSHELGEWG----GSGLFLLGCLMEHSCAPNCNFSTYGTQLWV 312
Query: 119 RAVQHVPKGAEVLISYIE-TAGSTMTRQKALKEQYLFTCTCPRCIKLGQFDDIQESAILE 177
A+ + KG + I Y + T T R+K LK+ Y F C CPRC L D+ + I
Sbjct: 313 TALNSIKKGESLSIDYNDSTFEPTAVRRKLLKDSYDFLCLCPRCTTL---PDVMRAFICP 369
Query: 178 GYRCKDDGCSGFLLRDSDDKGFTCQQCGLVRSKEEIKKIASEVNILSKKTLALTSCGNHQ 237
+CK C L S + C++C K ++ K + + + +K L + H
Sbjct: 370 VKQCKGKVCP--LGEGSKPSDWVCEKC-----KSQLDKKSRKRFLEIEKAL-MEEGPEHD 421
Query: 238 EVVSTYKMIEKLQKKLYHPFSVNLMQTREKLIKILMELEDW----KEALAYCQLTIPVYQ 293
T+ ++ + ++L ++ + E+ + E W K A + IP+
Sbjct: 422 VFHETHHIVLEALRELGSVYAKD-----EEAVADGSAEEIWLRVLKGADQARSILIPL-- 474
Query: 294 RVYPQFHPLLGLQYYTCGKLEWFLGDTENAIKSMTEAVEILRITHGTNSPFMKELILKLE 353
V P+FHP + + +L + A +++ + E+ R G +P ++ + +E
Sbjct: 475 -VLPKFHPTKAIWWDNVAQLRLSRKNQAGAQEALKRSYEVTRRCCGDATPSTQDALRLVE 533
>gi|118403760|ref|NP_001072288.1| SET and MYND domain containing 4 [Xenopus (Silurana) tropicalis]
gi|111305557|gb|AAI21315.1| hypothetical protein MGC145614 [Xenopus (Silurana) tropicalis]
Length = 739
Score = 65.1 bits (157), Expect = 5e-08, Method: Compositional matrix adjust.
Identities = 68/295 (23%), Positives = 116/295 (39%), Gaps = 64/295 (21%)
Query: 82 NSELRPLGTGLYPVISIINHSCLPNAVLVFEGRLAVVRAVQHVPKGAEVLISY--IETAG 139
N +R L T ++PV+S++NHSC PN + F GR VRA + + + EV Y +
Sbjct: 465 NKSIR-LATAVFPVLSLLNHSCDPNTTVSFTGRFVTVRANRPIRRDEEVTHCYGPHKLRM 523
Query: 140 STMTRQKALKEQYLFTCTCPRCIKLGQFDDIQESAILEGYRCKDDGCSGFLLRDSDDKGF 199
RQ+ LK+QY F C C C + L+ GF
Sbjct: 524 DVAERQQLLKDQYFFVCQCKACTE--------------------------ELKGKKTHGF 557
Query: 200 TCQQCGLVRSKEEI--------KKIASEVNILSKKTLALTSCGNHQEVVSTYKMIEKLQK 251
C C EE AS+ + S+ L GNH ++ +LQ
Sbjct: 558 FCPLCKAQLEGEEALYCTGARCTYTASQTQLTSR----LNQLGNHIQIAKV-----QLQD 608
Query: 252 KLYHPFSVNLM-----------QTREKLIKILMELE-------DWKEALAYCQLTIPVYQ 293
+ LM Q L +I+ L +WK A + + +I + +
Sbjct: 609 NKTDNAKMTLMSCLSEAECFLSQDHLLLGEIMDHLAQAEASDGNWKAAAGHLRKSISIVK 668
Query: 294 RVYPQFHPLLGLQYYTCGKLEWFLGDTENAIKSMTEAVEILRITHGTNSPFMKEL 348
Y LG + + ++ + + +A++++ EA ++L + +G + ++EL
Sbjct: 669 VHYGSSSMELGHELFKLAQILFNGFEVPDAMRTIMEAQKVLSMHYGPDHNLVQEL 723
>gi|383861061|ref|XP_003706005.1| PREDICTED: SET and MYND domain-containing protein 5-like [Megachile
rotundata]
Length = 388
Score = 65.1 bits (157), Expect = 5e-08, Method: Compositional matrix adjust.
Identities = 34/86 (39%), Positives = 52/86 (60%), Gaps = 7/86 (8%)
Query: 89 GTGLYPVISIINHSCLPNAVLVF--EGRLAVVRAVQHVPKGAEVLISYIETAG---STMT 143
G+GLY + S +NHSC+PNA++ F + V++A++ + E+ ISY++ S +
Sbjct: 287 GSGLYILQSAVNHSCIPNAIVEFPYSNNVLVLKAIRDIHPEEEICISYLDECDLERSRHS 346
Query: 144 RQKALKEQYLFTCTCPRCIKLGQFDD 169
RQKAL YLF C C +C L Q +D
Sbjct: 347 RQKALSSLYLFICCCSKC--LSQTND 370
>gi|340708900|ref|XP_003393055.1| PREDICTED: SET and MYND domain-containing protein 4-like [Bombus
terrestris]
Length = 673
Score = 65.1 bits (157), Expect = 6e-08, Method: Compositional matrix adjust.
Identities = 51/178 (28%), Positives = 83/178 (46%), Gaps = 18/178 (10%)
Query: 69 NFSKLACNAHTICNS---------ELRPLGTGLYPVISIINHSCLPNAVL-VFEGRLAVV 118
N + CNA+ I + E R +G +YP IS+INHSC PN V + V+
Sbjct: 450 NLQAINCNAYEIVENIYDKKTHVWEPRHIGGAIYPSISLINHSCYPNVVRHSYPSGTVVL 509
Query: 119 RAVQHVPKGAEVLISYIET--AGSTMTRQKALKEQYLFTCTCPRCIKLGQFDDIQESAIL 176
R ++ + KG E+L Y + ++R + L ++Y F C C C + QF + E+
Sbjct: 510 RTLRFIGKGTEILDCYGPHWLSEKRLSRLEYLWKKYRFLCACEACTQNWQF-PLPETM-- 566
Query: 177 EGYRCK--DDGCSGFLLRDSDDKGFTCQQCGLVRSKEEIKKIASEVNILSKKTLALTS 232
Y+CK + S L + + + +QC K ++KKI ++ +K L S
Sbjct: 567 -NYKCKMCSEIISTITLDEKHTQNVSSKQCCKCNKKTDLKKIKNQFRKSVEKRLDAIS 623
>gi|307204816|gb|EFN83374.1| SET and MYND domain-containing protein 4 [Harpegnathos saltator]
Length = 628
Score = 65.1 bits (157), Expect = 6e-08, Method: Compositional matrix adjust.
Identities = 65/300 (21%), Positives = 124/300 (41%), Gaps = 28/300 (9%)
Query: 70 FSKLACNAHTICNSELRPLGTGLYPVISIINHSCLPNAVLVFEGRLAVVRAVQHVPKGAE 129
+K+A + +C + + T +YP S++NHSC PN + F + +VRA++ + G E
Sbjct: 326 LNKIASDKDKLCVEQQDRIATAIYPSASMMNHSCDPNIITSFVDQYLIVRAMKDIQAGEE 385
Query: 130 VLISYIETAG--STMTRQKALKEQYLFTCTCPRCIKLGQFDDIQESAILEGYRCKDDGCS 187
V Y S RQ+ LK QY F C C C+ + ++++ E +C + C
Sbjct: 386 VSNCYGPNFRRMSREHRQEILKNQYGFECRCDPCV-MPEYENFMER--FHAIKCPE--CG 440
Query: 188 GFLLRDSDDKGFTCQQCG---LVRSKEEIKKIASEVNILSKKTLALTSCGNHQEVVSTYK 244
G L + C CG + + E+K + + +E V K
Sbjct: 441 G-ALEEFHSYSMHCLDCGATPYINCQIELKNAVTAFDAAQ----IYIELEREEEAVVKLK 495
Query: 245 MIEKLQKKLYHPFSVNLMQTREKLIKILMELEDWKEALAYCQLTIPVYQRVYP------- 297
+++ L + + +L T + L K+ + W E++++ + ++ + Y
Sbjct: 496 ECLSIRRNLLYKYHQDLTATLDILGKVYAIMGRWLESISHIEHSMAAIEERYGCSSMEVA 555
Query: 298 -QFHPLLG-----LQYYTCGKLEWFLGDTENAIKSMTEAVEILRITHGTNSPFMKELILK 351
+ + L LQ +W+ + + + A EI+ T+G +E+I K
Sbjct: 556 NELNKLTDICIRYLQEEPNRTTKWYKNVLKKTRRYLDRAEEIVNFTYGPWHEIHQEIIEK 615
>gi|443719695|gb|ELU09739.1| hypothetical protein CAPTEDRAFT_159764 [Capitella teleta]
Length = 338
Score = 65.1 bits (157), Expect = 6e-08, Method: Compositional matrix adjust.
Identities = 31/78 (39%), Positives = 45/78 (57%), Gaps = 5/78 (6%)
Query: 89 GTGLYPVISIINHSCLPNAVLVF--EGRLAVVRAVQHVPKGAEVLISYIETAG---STMT 143
G+GLY + S NHSC+PNA + F V+A+ ++ G E+ ISY++ S +
Sbjct: 234 GSGLYLLQSTCNHSCMPNAEITFPHNNSTLAVKALSNIKTGEEICISYLDECALERSRHS 293
Query: 144 RQKALKEQYLFTCTCPRC 161
R K L+E YLF C C +C
Sbjct: 294 RHKILRENYLFNCNCSKC 311
>gi|73980996|ref|XP_533007.2| PREDICTED: SET and MYND domain-containing protein 5 [Canis lupus
familiaris]
Length = 419
Score = 65.1 bits (157), Expect = 6e-08, Method: Compositional matrix adjust.
Identities = 33/79 (41%), Positives = 47/79 (59%), Gaps = 5/79 (6%)
Query: 89 GTGLYPVISIINHSCLPNAVLVF--EGRLAVVRAVQHVPKGAEVLISYIETAG---STMT 143
G+GLY + S NHSC+PNA F L V A++ + G E+ ISY++ S +
Sbjct: 304 GSGLYVLQSCCNHSCVPNAETTFPENNFLLHVTALEDIKPGEEICISYLDCCQRERSRHS 363
Query: 144 RQKALKEQYLFTCTCPRCI 162
R K L+E YLF C+CP+C+
Sbjct: 364 RHKILRENYLFVCSCPKCL 382
>gi|328866939|gb|EGG15322.1| SET domain-containing protein [Dictyostelium fasciculatum]
Length = 524
Score = 65.1 bits (157), Expect = 6e-08, Method: Compositional matrix adjust.
Identities = 30/75 (40%), Positives = 43/75 (57%), Gaps = 2/75 (2%)
Query: 89 GTGLYPVISIINHSCLPNAVLVFEGR--LAVVRAVQHVPKGAEVLISYIETAGSTMTRQK 146
G+GLY + NHSC PNA + E A +R++ +P G ++ ISYIE + TRQ
Sbjct: 444 GSGLYVYANSCNHSCSPNAAITNESTNFSATIRSITDIPNGNQIEISYIEEDQPSQTRQS 503
Query: 147 ALKEQYLFTCTCPRC 161
L ++Y F C C +C
Sbjct: 504 ELIDKYKFKCHCQKC 518
>gi|156550498|ref|XP_001601757.1| PREDICTED: protein msta, isoform A-like [Nasonia vitripennis]
Length = 519
Score = 65.1 bits (157), Expect = 6e-08, Method: Compositional matrix adjust.
Identities = 44/167 (26%), Positives = 69/167 (41%), Gaps = 12/167 (7%)
Query: 60 EISINEIAENFSKLACNAHTICNSELRPLGTGLYPVISIINHSCLPNAVLVFEGRLAVV- 118
+ S +EIA + N H + SE P+ +Y S + HSC N F +V
Sbjct: 198 DFSEDEIARAAGIMQINGHEVPISE--PVLIAVYDESSYVEHSCRANCSKSFTSSGGIVI 255
Query: 119 RAVQHVPKGAEVLISYIETAGSTMTRQKALKEQYLFTCTCPRCIKLGQFDDIQESAILEG 178
RA + KG ++I Y + T R+ L F CTCPRC +F ++
Sbjct: 256 RAAMEIEKGEHIMICYTDPLWGTANRRHHLLRTKFFECTCPRCSDPAEF-----GSMFNA 310
Query: 179 YRCKDDGCSGFLLRDS----DDKGFTCQQCGLVRSKEEIKKIASEVN 221
C CSG +L S + + C +C RS + ++ ++
Sbjct: 311 IECSKSDCSGCMLPKSFTVQEIPDYRCNKCDNSRSSNSVDEMLEQIG 357
>gi|19922036|ref|NP_610684.1| CG7759, isoform B [Drosophila melanogaster]
gi|16768704|gb|AAL28571.1| HL04910p [Drosophila melanogaster]
gi|21627490|gb|AAM68738.1| CG7759, isoform B [Drosophila melanogaster]
Length = 567
Score = 65.1 bits (157), Expect = 6e-08, Method: Compositional matrix adjust.
Identities = 54/219 (24%), Positives = 97/219 (44%), Gaps = 18/219 (8%)
Query: 48 IANLVNLILQWPEISINEIAE--NFSKLACNAHTICNSELRPLGTGLYPVISIINHSCLP 105
I +LV LQ+ + + +E+AE FS +G +YP +++ NHSC P
Sbjct: 334 ICSLVLRSLQFIQFNTHEVAELHKFSSSGREKSIF-------IGGAIYPTLALFNHSCDP 386
Query: 106 NAVLVFEGRLAVVRAVQHVPKGAEVLISY--IETAGSTMTRQKALKEQYLFTCTCPRCI- 162
V F G + +V+ + G + +Y + T RQ LK+ Y F C+C CI
Sbjct: 387 GVVRYFRGTTIHINSVRPIEAGLPINENYGPMYTQDERSERQARLKDLYWFECSCDACID 446
Query: 163 KLGQFDDIQESAILEGYRCK-DDGCSGFL--LRDSDDKGFTCQQCGLVRSKEEIKKIASE 219
+FDD+ I +RC + CS + +D C CG + + + K+ +
Sbjct: 447 NWPKFDDLPRDVI--RFRCDAPNNCSAVIEVPPSCNDFMVKCVTCGEITNILKGLKVMQD 504
Query: 220 VNILSKKTLALTSCGNHQEVVSTY-KMIEKLQKKLYHPF 257
++++ L G + + ++ + +I + + L PF
Sbjct: 505 TEMMTRTAKRLYETGEYPKALAKFVDLIRIMYEVLAPPF 543
>gi|391338176|ref|XP_003743437.1| PREDICTED: N-lysine methyltransferase SMYD2-A-like [Metaseiulus
occidentalis]
Length = 409
Score = 64.7 bits (156), Expect = 6e-08, Method: Compositional matrix adjust.
Identities = 44/164 (26%), Positives = 83/164 (50%), Gaps = 13/164 (7%)
Query: 27 VEALVAHMSDIDEKQLLLYAQIANLVNLILQWPEISI--NEIAENFSKLACNAHTICNS- 83
++ LV+H+ D+++ + A + IL +SI E+ E + K N H + +
Sbjct: 111 LDDLVSHLEDLNDDERQKVKAKAQQIKSILDRVGVSIERKELMEIYMKCKINCHLVIDHN 170
Query: 84 --ELRPLGTGLYPVISIINHSCLP--NAVLVFEGRLAVVRAVQH--VPKGAEVLISYIET 137
E G +Y S ++H+C+P N L+F+GR V+RA+++ + +++ I Y+ +
Sbjct: 171 EPEYLSRGRAIYLAASKLDHTCVPTENYTLMFDGRRLVIRAMKNFTLKDPSDLRIHYVPS 230
Query: 138 AGSTMTRQKALKEQYLFTCTCPRCIKLGQF----DDIQESAILE 177
S R+ L + Y F C+C +C+ +F DD +E + E
Sbjct: 231 NLSYNDRRDRLLKNYFFLCSCDQCVSQLRFPAADDDSREERVSE 274
>gi|328770867|gb|EGF80908.1| hypothetical protein BATDEDRAFT_88247 [Batrachochytrium
dendrobatidis JAM81]
Length = 428
Score = 64.7 bits (156), Expect = 6e-08, Method: Compositional matrix adjust.
Identities = 66/265 (24%), Positives = 114/265 (43%), Gaps = 47/265 (17%)
Query: 89 GTGLYPVISIINHSCLPNAVLVFEGRLAVVRAVQHVPKGAEVLISYIETAGSTM--TRQK 146
G+G+YP S+INH RA +++ KG + ISY E+AG ++ TRQ+
Sbjct: 153 GSGIYPTASLINH-----------------RASKNIRKGNAITISYGESAGRSLRATRQQ 195
Query: 147 ALKEQYLFTCTCPRCIKLGQFDDIQESAILEGYRCKDDGCSGFLLRDSDDKGFTCQQCGL 206
LK + F C C CI D Q L G +C GC + L + + + C +C
Sbjct: 196 NLKAGWGFECRCSACISEDDTKDRQ----LAGLKCP--GCP-YPLYEGEPR---CPKC-- 243
Query: 207 VRSKEEIKKIASEVNILSKKTLALTSCGNHQEVVSTYKMIEKLQ------KKLYHPFSVN 260
I I+ + +++K C +H++ + +I+ LQ + + HP S
Sbjct: 244 ----HSIVDISRQQKVMAKIDAVWHECQDHRKHDNVASLIQTLQGIQTDLESIMHPSSSR 299
Query: 261 LMQTREKLIKILMELE---DWKEALAYCQLTIPVYQRVYPQFHPLLGLQYYT-CGKLEWF 316
L + L + ++L D + L++C+ + R+ L ++ K+ +F
Sbjct: 300 LGCMYDNLASLNVQLAKQGDNRILLSWCEKLLESSIRIVEGIFGLTSIEAANEHNKMAYF 359
Query: 317 LG--DTENAIKSMTEAVEILRITHG 339
+ D E IK A + R +G
Sbjct: 360 MMAVDAEKGIKIARHAEGLFRTLYG 384
>gi|189237463|ref|XP_967982.2| PREDICTED: similar to msta CG33548-PB [Tribolium castaneum]
Length = 387
Score = 64.7 bits (156), Expect = 6e-08, Method: Compositional matrix adjust.
Identities = 59/205 (28%), Positives = 91/205 (44%), Gaps = 26/205 (12%)
Query: 22 DNYSLVEALVAHMSDIDEKQLLLYAQIANLVNLILQWPEISINEIAENFSKLACNAHTIC 81
++ S+V+ L AH SD Q V+++ +++ E E F + C+
Sbjct: 141 EDKSVVKCLKAHKSD----------QHGFEVDVMKDALALNVKEDEEKFMRFVCSVMDSN 190
Query: 82 NSEL-------RPLGTGLYPVISIINHSCLPNAVLVFEGRL-AVVRAVQHVPKGAEVLIS 133
E+ + GLYP+ S+ NHSC PN VF+ + VVRA + +P+G+E+ S
Sbjct: 191 AFEVLVGFEDNQASVKGLYPLGSLANHSCCPNTCHVFDDKQHMVVRASKFIPQGSEIFHS 250
Query: 134 YIETAGSTMTRQKALKEQYLFTCTCPRCIKLGQFDDIQESAILEGYRCKDDGCSGFLLRD 193
Y ST R+ L F C C RC +F S + CK + +
Sbjct: 251 YSRLIWSTSARRFHLYRTKHFLCKCQRCEDPTEFGSYIGSIL-----CKVCKTGKVIPTN 305
Query: 194 S--DDKGFTCQQCGLVRSKEEIKKI 216
S DK + C+ CG + KEE+ I
Sbjct: 306 SLQTDK-WQCEVCGSLIKKEEVANI 329
>gi|116179288|ref|XP_001219493.1| hypothetical protein CHGG_00272 [Chaetomium globosum CBS 148.51]
gi|88184569|gb|EAQ92037.1| hypothetical protein CHGG_00272 [Chaetomium globosum CBS 148.51]
Length = 588
Score = 64.7 bits (156), Expect = 6e-08, Method: Compositional matrix adjust.
Identities = 30/70 (42%), Positives = 45/70 (64%), Gaps = 1/70 (1%)
Query: 92 LYPVISIINHSCLPNAVLVFEGRLAVVRAVQHVPKGAEVLISYIETAGSTMTRQKALKEQ 151
L+P ++INHSC+PN+ F G L V+RA +++ KG E++ SY E+ G RQ+AL
Sbjct: 425 LWPWAALINHSCIPNSEREFVGDLMVIRATKNIAKGEEIVHSYDES-GVYDDRQRALMTT 483
Query: 152 YLFTCTCPRC 161
+ F C+C C
Sbjct: 484 WGFECSCALC 493
>gi|406867180|gb|EKD20219.1| xanthine dehydrogenase [Marssonina brunnea f. sp. 'multigermtubi'
MB_m1]
Length = 654
Score = 64.7 bits (156), Expect = 6e-08, Method: Compositional matrix adjust.
Identities = 31/77 (40%), Positives = 41/77 (53%), Gaps = 2/77 (2%)
Query: 88 LGTGLYPVISIINHSCLPNAVLVFE--GRLAVVRAVQHVPKGAEVLISYIETAGSTMTRQ 145
LG +Y + NHSC PNA F G L V+RA + +G E+ I Y G+T R+
Sbjct: 439 LGPFIYEIAGRFNHSCTPNAARGFSDVGELIVLRAFVDIQEGEEITIEYFSDGGTTTERR 498
Query: 146 KALKEQYLFTCTCPRCI 162
+ L +QY F C C CI
Sbjct: 499 QHLYKQYRFNCDCRACI 515
>gi|195111430|ref|XP_002000282.1| GI10143 [Drosophila mojavensis]
gi|193916876|gb|EDW15743.1| GI10143 [Drosophila mojavensis]
Length = 392
Score = 64.7 bits (156), Expect = 6e-08, Method: Compositional matrix adjust.
Identities = 31/78 (39%), Positives = 47/78 (60%), Gaps = 5/78 (6%)
Query: 89 GTGLYPVISIINHSCLPNA--VLVFEGRLAVVRAVQHVPKGAEVLISYIETAG---STMT 143
G+GLY + S +NHSC+PNA + + V++AV + KG E+ ISY++ S +
Sbjct: 286 GSGLYILQSKVNHSCVPNAQSTFPYSNDIVVLKAVAPIQKGEEICISYLDECQLERSRHS 345
Query: 144 RQKALKEQYLFTCTCPRC 161
R K L+E Y+F C C +C
Sbjct: 346 RHKILRENYIFVCQCSKC 363
>gi|195121780|ref|XP_002005397.1| GI19093 [Drosophila mojavensis]
gi|193910465|gb|EDW09332.1| GI19093 [Drosophila mojavensis]
Length = 515
Score = 64.7 bits (156), Expect = 7e-08, Method: Compositional matrix adjust.
Identities = 42/160 (26%), Positives = 67/160 (41%), Gaps = 8/160 (5%)
Query: 46 AQIANLVNLILQWPEISINEIAENFSKLACNAHTICNSELRPLGTGLYPVISIINHSCLP 105
A I + + + S EI L N H + S+ P ++ S +SC+P
Sbjct: 187 ASIGQFIPKFFRTQKFSEEEIMRAVGALQINGHEVPTSD--PPHVAVFYTASFTENSCVP 244
Query: 106 NAVLVF-EGRLAVVRAVQHVPKGAEVLISYIETAGSTMTRQKALKEQYLFTCTCPRCIKL 164
N F + ++ A + + KGA + I Y + T RQ+ L + LF CTC RC+ +
Sbjct: 245 NLAKSFTKNGHCILWAPKAIKKGANLSICYSDAVWGTADRQRHLMQTKLFKCTCERCVDV 304
Query: 165 GQFDDIQESAILEGYRCKDDGCSGFLLRDSDDKGFTCQQC 204
+ +C+D C+G LL D C +C
Sbjct: 305 TEL-----GTYYSALKCEDRKCTGLLLPIKADDWNGCWRC 339
>gi|24652742|ref|NP_725048.1| CG7759, isoform A [Drosophila melanogaster]
gi|21627489|gb|AAF58645.2| CG7759, isoform A [Drosophila melanogaster]
Length = 660
Score = 64.7 bits (156), Expect = 7e-08, Method: Compositional matrix adjust.
Identities = 54/219 (24%), Positives = 97/219 (44%), Gaps = 18/219 (8%)
Query: 48 IANLVNLILQWPEISINEIAE--NFSKLACNAHTICNSELRPLGTGLYPVISIINHSCLP 105
I +LV LQ+ + + +E+AE FS +G +YP +++ NHSC P
Sbjct: 427 ICSLVLRSLQFIQFNTHEVAELHKFSSSGREKSIF-------IGGAIYPTLALFNHSCDP 479
Query: 106 NAVLVFEGRLAVVRAVQHVPKGAEVLISY--IETAGSTMTRQKALKEQYLFTCTCPRCI- 162
V F G + +V+ + G + +Y + T RQ LK+ Y F C+C CI
Sbjct: 480 GVVRYFRGTTIHINSVRPIEAGLPINENYGPMYTQDERSERQARLKDLYWFECSCDACID 539
Query: 163 KLGQFDDIQESAILEGYRCK-DDGCSGFL--LRDSDDKGFTCQQCGLVRSKEEIKKIASE 219
+FDD+ I +RC + CS + +D C CG + + + K+ +
Sbjct: 540 NWPKFDDLPRDVI--RFRCDAPNNCSAVIEVPPSCNDFMVKCVTCGEITNILKGLKVMQD 597
Query: 220 VNILSKKTLALTSCGNHQEVVSTY-KMIEKLQKKLYHPF 257
++++ L G + + ++ + +I + + L PF
Sbjct: 598 TEMMTRTAKRLYETGEYPKALAKFVDLIRIMYEVLAPPF 636
>gi|350418674|ref|XP_003491932.1| PREDICTED: SET and MYND domain-containing protein 4-like [Bombus
impatiens]
Length = 681
Score = 64.7 bits (156), Expect = 7e-08, Method: Compositional matrix adjust.
Identities = 44/144 (30%), Positives = 64/144 (44%), Gaps = 17/144 (11%)
Query: 76 NAHTICNSEL---------RPL--GTGLYPVISIINHSCLPNAVLVFEGRLAVVRAVQHV 124
NAH + + L RPL G +YP ++ NH C P F GR V+RA++ +
Sbjct: 454 NAHEVFETRLGNEHRFRGSRPLYIGVAIYPTVARFNHDCYPAVTRYFVGRSIVIRAIRSL 513
Query: 125 PKGAEVLISY--IETAGSTMTRQKALKEQYLFTCTCPRCIK-LGQFDDIQESAILEGYRC 181
G V +Y I T S R++ L +Y F C C C + +F+ + + RC
Sbjct: 514 RAGDAVAENYGPIFTKRSLEDRRRTLAARYWFRCECTACREDWPRFETLTNDMVR--LRC 571
Query: 182 KDDGCSGFLLRDSD-DKGFTCQQC 204
+GCS R D D+ C C
Sbjct: 572 PTEGCSKLHPRPRDPDRSIECSSC 595
>gi|156366270|ref|XP_001627062.1| predicted protein [Nematostella vectensis]
gi|156213960|gb|EDO34962.1| predicted protein [Nematostella vectensis]
Length = 391
Score = 64.7 bits (156), Expect = 7e-08, Method: Compositional matrix adjust.
Identities = 52/170 (30%), Positives = 81/170 (47%), Gaps = 15/170 (8%)
Query: 1 MLKLYLRRKLQNDNVIPSTTTDNYSLVEALVA-HMSDIDEKQLLLYAQIANLVNLILQWP 59
M+++ L + +D + T + +S + ALV + I L LY V+ I +P
Sbjct: 207 MIRILLSEAMYDDRLEQWFTPEGFSSLFALVGTNGQGIGTSSLSLY------VHNIDSYP 260
Query: 60 EISINE---IAENFSKLACNAHTICNSELRPLGTGLYPVISIINHSCLPNAVLVF--EGR 114
+S +E I ++L + L G GLY + S NHSC PNA + F
Sbjct: 261 ALSDDERQAIDIFLNQLYEEMERVSGQFLNCEGAGLYALQSSCNHSCAPNAEVTFPKNNS 320
Query: 115 LAVVRAVQHVPKGAEVLISYIETAG---STMTRQKALKEQYLFTCTCPRC 161
V++A+ + G E+ ISY+E S +R K L+E Y+F CTC +C
Sbjct: 321 TLVLKALHPIKNGEEICISYLEECQRERSRHSRLKYLRENYIFDCTCTKC 370
>gi|313217681|emb|CBY38723.1| unnamed protein product [Oikopleura dioica]
Length = 228
Score = 64.7 bits (156), Expect = 7e-08, Method: Compositional matrix adjust.
Identities = 30/101 (29%), Positives = 52/101 (51%), Gaps = 2/101 (1%)
Query: 70 FSKLACNAHTICNSEL--RPLGTGLYPVISIINHSCLPNAVLVFEGRLAVVRAVQHVPKG 127
F+ + N IC+ E +G+ +Y +++NHSC PNA VF + +A++ + G
Sbjct: 30 FAAIKNNQFAICDEESGDYDVGSAVYIDHALVNHSCRPNAYPVFNKTNMIFKALRKIEPG 89
Query: 128 AEVLISYIETAGSTMTRQKALKEQYLFTCTCPRCIKLGQFD 168
E+ +Y +T R++ L + + F C CP C K + D
Sbjct: 90 EEITHAYTDTISPIQERREYLNDVWRFMCNCPGCTKSNEID 130
>gi|350401158|ref|XP_003486068.1| PREDICTED: SET and MYND domain-containing protein 4-like [Bombus
impatiens]
Length = 662
Score = 64.7 bits (156), Expect = 7e-08, Method: Compositional matrix adjust.
Identities = 41/122 (33%), Positives = 61/122 (50%), Gaps = 13/122 (10%)
Query: 88 LGTGLYPVISIINHSCLPNAVLVFEGRLAVVRAVQHVPKGAEVLISYIE--TAGSTMTRQ 145
+ TG+YP S++NHSC PN + +F G+ +VRA + + + E+ Y +T RQ
Sbjct: 377 IATGIYPSASMMNHSCDPNIINIFVGQYLIVRASRDIGQSEEIFNCYGPHYRHMTTEDRQ 436
Query: 146 KALKEQYLFTCTCPRCI--KLGQFDDIQESAILEGYRCKDDGCSGFLLRDSDDKGFTCQQ 203
K LK Y FTC C C KL F + +A + +C C+ ++DS C
Sbjct: 437 KILKSHYCFTCKCKACTLPKLQYFTERFNAA--KCLKCNGPVCN---IKDS----IYCLD 487
Query: 204 CG 205
CG
Sbjct: 488 CG 489
>gi|50552992|ref|XP_503906.1| YALI0E13552p [Yarrowia lipolytica]
gi|49649775|emb|CAG79499.1| YALI0E13552p [Yarrowia lipolytica CLIB122]
Length = 637
Score = 64.7 bits (156), Expect = 7e-08, Method: Compositional matrix adjust.
Identities = 34/101 (33%), Positives = 50/101 (49%)
Query: 63 INEIAENFSKLACNAHTICNSELRPLGTGLYPVISIINHSCLPNAVLVFEGRLAVVRAVQ 122
+ +I E L N +T C+ R G P +++INHSC+PNA L+F GR + +
Sbjct: 179 LEQILEYAGMLFVNQYTRCDDLGRQGGYIFDPTLALINHSCVPNAYLLFRGRKVHLVCWK 238
Query: 123 HVPKGAEVLISYIETAGSTMTRQKALKEQYLFTCTCPRCIK 163
+ G EV +SY T R+ L + F C CP C+
Sbjct: 239 PINDGDEVFLSYTRFMHPTPERRTLLYMHFRFWCECPGCVS 279
>gi|296413480|ref|XP_002836440.1| hypothetical protein [Tuber melanosporum Mel28]
gi|295630261|emb|CAZ80631.1| unnamed protein product [Tuber melanosporum]
Length = 763
Score = 64.7 bits (156), Expect = 7e-08, Method: Compositional matrix adjust.
Identities = 29/73 (39%), Positives = 42/73 (57%)
Query: 89 GTGLYPVISIINHSCLPNAVLVFEGRLAVVRAVQHVPKGAEVLISYIETAGSTMTRQKAL 148
G+GL+ + S INHSC N+ F G L +VRA + + +G E+ I+Y+E RQKA
Sbjct: 515 GSGLWIMPSYINHSCWQNSTRSFLGDLLIVRAARDLSEGEEITINYMENESGVQKRQKAF 574
Query: 149 KEQYLFTCTCPRC 161
++ F C C C
Sbjct: 575 LSEWGFECKCTMC 587
>gi|355720921|gb|AES07094.1| SMYD family member 5 [Mustela putorius furo]
Length = 159
Score = 64.7 bits (156), Expect = 7e-08, Method: Compositional matrix adjust.
Identities = 32/79 (40%), Positives = 47/79 (59%), Gaps = 5/79 (6%)
Query: 89 GTGLYPVISIINHSCLPNAVLVF--EGRLAVVRAVQHVPKGAEVLISYIETAG---STMT 143
G+GL+ + S NHSC+PNA F L V A++ + G E+ ISY++ S +
Sbjct: 46 GSGLFVLQSCCNHSCVPNAETSFPENNFLLHVTALEDIKPGEEICISYLDCCQRERSRHS 105
Query: 144 RQKALKEQYLFTCTCPRCI 162
R K L+E YLF C+CP+C+
Sbjct: 106 RHKILRENYLFVCSCPKCL 124
>gi|157124067|ref|XP_001660315.1| hypothetical protein AaeL_AAEL009757 [Aedes aegypti]
gi|108874116|gb|EAT38341.1| AAEL009757-PA [Aedes aegypti]
Length = 518
Score = 64.7 bits (156), Expect = 8e-08, Method: Compositional matrix adjust.
Identities = 63/242 (26%), Positives = 103/242 (42%), Gaps = 35/242 (14%)
Query: 83 SELRPLGTGLYPVISIINHSCLPNAVLVFEGRLAVVRAVQHVP-KGAEVL-ISYIETAGS 140
S++ L GLYP +++ HSC+PN +L +G + VR P K E+L ISY +
Sbjct: 190 SQMGNLARGLYPKTALMMHSCMPNTLLSVDG-YSNVRVFTSAPVKMGEILNISYTRSLFG 248
Query: 141 TMTRQKALKEQYLFTCTCPRCIKLGQFDDIQESAILEGYRCK--DDGCSGFLLRDSDDKG 198
T RQ L++ F C+C RC D + L RC DDG F D+
Sbjct: 249 TYDRQTHLRKGKYFICSCRRCK-----DPTELGTHLSSIRCSECDDGLCSFY---PDEPR 300
Query: 199 FTCQQCGLVRSKEEIKKI--ASEVNILSKKTLALTSCGNHQEVVSTYKMIEKLQKKL--Y 254
+ C +C + +E + +I A+ +I++ EV ++I K K L +
Sbjct: 301 WECNKCHKLLKREYVNEILCAARDDIMACPL----------EVRDLERVISKHSKILNPH 350
Query: 255 HPFSVNLMQTREKLIKILMELED--------WKEALAYCQLTIPVYQRVYPQFHPLLGLQ 306
H + QT +K + D + C+ +P+ + + P L+G+
Sbjct: 351 HSLVLEAKQTLMGELKAICMSTDPTAVPKPVLRRKFELCEEILPILRTLEPGLSRLVGIA 410
Query: 307 YY 308
Y
Sbjct: 411 MY 412
>gi|321473475|gb|EFX84442.1| hypothetical protein DAPPUDRAFT_194440 [Daphnia pulex]
Length = 524
Score = 64.3 bits (155), Expect = 8e-08, Method: Compositional matrix adjust.
Identities = 41/148 (27%), Positives = 69/148 (46%), Gaps = 15/148 (10%)
Query: 91 GLYPVISIINHSCLPNAVLVFEGRLAVV-RAVQHVPKGAEVLISYIETAGSTMTRQKALK 149
+Y S++ H C+ NA+ F + +V RA +PKG ++ + Y E TM RQ+ L
Sbjct: 231 AVYSTASLLEHDCVANAIKTFTNKGDIVIRAAVPIPKGEKIALCYTEPLWGTMNRQRHLS 290
Query: 150 EQYLFTCTCPRCIKLGQFDDIQESAILEGYRCKD--DGCSGFLLRDS---DDKGFTCQQC 204
+ F C C RC + D G+ C+ +G LL ++ D + C+QC
Sbjct: 291 QTKFFQCVCERCKDPTELD-----TFTSGFYCQKCPQQSAGILLTENPLDDASDWICRQC 345
Query: 205 GLVR----SKEEIKKIASEVNILSKKTL 228
G + + E I+ + E+ L K ++
Sbjct: 346 GARQPANYAAEIIESVGKEIVALKKGSV 373
>gi|348540237|ref|XP_003457594.1| PREDICTED: SET and MYND domain-containing protein 4-like
[Oreochromis niloticus]
Length = 725
Score = 64.3 bits (155), Expect = 8e-08, Method: Compositional matrix adjust.
Identities = 74/322 (22%), Positives = 128/322 (39%), Gaps = 57/322 (17%)
Query: 74 ACNAHTICNSELRPLGTGLYPVISIINHSCLPNAVLVF---------------------- 111
A N+ N E+R + T ++P +S++NHSC PN LVF
Sbjct: 405 ATNSLVQSNREIR-IATAVFPALSLLNHSCCPNTSLVFNTGGIIEPSGSALSADFSEGEA 463
Query: 112 ------EGRLAVVRAVQHVPKGAEVLISYIETAG--STMTRQKALKEQYLFTCTCPRCIK 163
+G VRA + + G E+L Y + +T R++ L+EQY F C C C +
Sbjct: 464 EWRHKAQGVTVTVRAAKLITPGQEILHCYGPHSSRMATQERRRLLQEQYYFLCQCEACSQ 523
Query: 164 LGQFDDIQESAILE-GYRCKDDG-----CSGFLLRDSDDK--GFTC--QQCGLVRSKEEI 213
Q D++E + G +D G C G L + S+D+ GF C C S ++
Sbjct: 524 HQQ--DVEEPRQQQPGAEGRDLGLLCIKCKGSLKKRSEDREAGFVCSLSSCSHCMSLSDV 581
Query: 214 KKIASEVNILSKKTLALTSCGNHQEVVSTYKMIEKLQKKLYHPFSVNLMQTR-------E 266
E+ + +K + L E + +L KK V L +T +
Sbjct: 582 NNRLQEIRVNLEKAVELMETDRPGEAL-------RLLKKTLCQSGVILAETHPLQGELAD 634
Query: 267 KLIKILMELEDWKEALAYCQLTIPVYQRVYPQFHPLLGLQYYTCGKLEWFLGDTENAIKS 326
+ + DW A ++ + + + + LG Q + +L + G A+
Sbjct: 635 ATARAYASMGDWNNAASHLERSTVAIASQFGKESVELGRQLFKLAQLHFNSGSRGPALSV 694
Query: 327 MTEAVEILRITHGTNSPFMKEL 348
+ + +L + G + ++EL
Sbjct: 695 IPKVRRLLCLHCGPHCHELQEL 716
>gi|407849671|gb|EKG04345.1| hypothetical protein TCSYLVIO_004595 [Trypanosoma cruzi]
Length = 359
Score = 64.3 bits (155), Expect = 8e-08, Method: Compositional matrix adjust.
Identities = 54/191 (28%), Positives = 89/191 (46%), Gaps = 32/191 (16%)
Query: 2 LKLYLRRKLQNDNVIPSTTTDNYSLVEALVAHMSDI--------DEKQLLLYAQIANLVN 53
L+L + ++TT+N + E L M+ + +E L A+ + L+
Sbjct: 143 LRLIRWEAFRKFRATHTSTTENDTTDEFLATGMTLLGRILRFTQEECTLFTVARWSELLG 202
Query: 54 LIL-----QWPEISINEIAENFSKLACNAHTI---------CNSELRPL-----GTGLYP 94
+L + P + + + E +L C T+ EL+ L G G+Y
Sbjct: 203 AVLLNGQERSPPSNYDRLKELVQRLPCGESTMDAFEQEVQMAGKELQHLLQSSRGQGVYT 262
Query: 95 VISIINHSCLPNAVLVF----EGRLAVVRAVQHVPKGAEVLISYIETAGSTMTRQKALKE 150
V + NHSC PN +V+ + L+VV A++++ G E+ ISYI+ + S RQ+ L E
Sbjct: 263 VGCLFNHSCEPNLQVVYSESGDETLSVV-ALRNIELGEELCISYIDESLSYPERQQELYE 321
Query: 151 QYLFTCTCPRC 161
YLF C CP+C
Sbjct: 322 HYLFVCQCPKC 332
>gi|328869415|gb|EGG17793.1| SET domain-containing protein [Dictyostelium fasciculatum]
Length = 363
Score = 64.3 bits (155), Expect = 8e-08, Method: Compositional matrix adjust.
Identities = 28/74 (37%), Positives = 42/74 (56%), Gaps = 1/74 (1%)
Query: 91 GLYPVISIINHSCLPNAVLVF-EGRLAVVRAVQHVPKGAEVLISYIETAGSTMTRQKALK 149
GL+ + S NHSC PN + F + AV+ A++ + KG E+ ISY ++ RQ L
Sbjct: 279 GLFFLSSFFNHSCDPNVYMAFPHDKTAVITALRDIKKGEELFISYGDSEKDMFDRQTHLF 338
Query: 150 EQYLFTCTCPRCIK 163
+ Y F C CP+C +
Sbjct: 339 DNYGFNCDCPKCTQ 352
>gi|357464329|ref|XP_003602446.1| SET and MYND domain-containing protein [Medicago truncatula]
gi|355491494|gb|AES72697.1| SET and MYND domain-containing protein [Medicago truncatula]
Length = 785
Score = 64.3 bits (155), Expect = 9e-08, Method: Compositional matrix adjust.
Identities = 38/118 (32%), Positives = 54/118 (45%), Gaps = 7/118 (5%)
Query: 76 NAHTICNSELRPLGTGLYPVISIINHSCLPNAVLVFEGRLAVVRAVQHVPKGAEVLISYI 135
+ H N E +G +Y V S+ NHSC PN F R +R Q V G ++ +SY
Sbjct: 462 SVHLTSNVEQVRVGKAIYKVGSLFNHSCQPNVHAYFLSRTLYLRTTQAVAAGCQLELSYG 521
Query: 136 ETAG--STMTRQKALKEQYLFTCTCPRCIKLGQFDDIQESAILEGYRCKDDGCSGFLL 191
G RQ LK++Y F C C C ++ D +L + C + CSG +L
Sbjct: 522 PQVGLWDCKDRQSFLKDEYAFHCQCTGCSEVNLSD-----IVLNAFHCVNPNCSGAVL 574
>gi|440802077|gb|ELR23016.1| Histone-lysine N-methyltransferase [Acanthamoeba castellanii str.
Neff]
Length = 481
Score = 64.3 bits (155), Expect = 9e-08, Method: Compositional matrix adjust.
Identities = 45/156 (28%), Positives = 75/156 (48%), Gaps = 12/156 (7%)
Query: 42 LLLYAQIANLVNLILQWPEISINEIAENFSKLACNAHTICNSELRPL---GTGLYPVISI 98
L++ A + +LQ + ++EI E +L N ++ R G+ LY ++
Sbjct: 288 LIMEALSGRATSDVLQG--LDVDEIVEYIGRLESNTFARYSTTKRKTVNYGSALYNCAAM 345
Query: 99 INHSCLPNAVLVFEGRLAVVRAVQHVPKGAEVLISYIETAGSTMTRQKALKEQYLFTCTC 158
NHSC P+ V F+G+ +RA++ + G E+ ++YI T RQ L++ Y F C C
Sbjct: 346 FNHSCFPSVVRQFDGQHLTLRALRPLNPGDELTMTYIPLRDDTPERQDELQQHYHFACRC 405
Query: 159 PRCIKLG-QFDDIQESAILEGYRCKDDG--CSGFLL 191
RC G + + E +L C +D C G L+
Sbjct: 406 QRCEARGAEVERYMEECLL----CPNDKRECDGLLV 437
>gi|302922324|ref|XP_003053442.1| hypothetical protein NECHADRAFT_74600 [Nectria haematococca mpVI
77-13-4]
gi|256734383|gb|EEU47729.1| hypothetical protein NECHADRAFT_74600 [Nectria haematococca mpVI
77-13-4]
Length = 530
Score = 64.3 bits (155), Expect = 9e-08, Method: Compositional matrix adjust.
Identities = 33/90 (36%), Positives = 52/90 (57%), Gaps = 1/90 (1%)
Query: 72 KLACNAHTICNSELRPLGTGLYPVISIINHSCLPNAVLVFEGRLAVVRAVQHVPKGAEVL 131
K+ NA ++++ +G L P +++ NHSC+PNA + F GR A++RA + + E+
Sbjct: 169 KIQTNAFHRYDADIGQVGIFLEPRLAMANHSCIPNATVQFVGRRAILRAERPIKADEEIE 228
Query: 132 ISYIETAGSTMTRQKALKEQYLFTCTCPRC 161
ISY + R++AL Y FTC C RC
Sbjct: 229 ISYTDYNYPLSKRKEALAP-YFFTCECTRC 257
>gi|198433887|ref|XP_002127168.1| PREDICTED: similar to SMYD family member 5 [Ciona intestinalis]
Length = 372
Score = 64.3 bits (155), Expect = 9e-08, Method: Compositional matrix adjust.
Identities = 35/79 (44%), Positives = 46/79 (58%), Gaps = 7/79 (8%)
Query: 89 GTGLYPVISIINHSCLPNAVLVFEG---RLAVVRAVQHVPKGAEVLISYI---ETAGSTM 142
G GLY + S NHSC PNA + F RLAV +A + + G E+ ISY+ + A
Sbjct: 281 GAGLYRIQSKCNHSCEPNAEVCFPNNNHRLAV-KACRDIAAGEEITISYLSQCQIARGCR 339
Query: 143 TRQKALKEQYLFTCTCPRC 161
+RQ+ LKE YLF C C +C
Sbjct: 340 SRQQYLKENYLFHCCCSKC 358
>gi|198454620|ref|XP_001359652.2| GA17400 [Drosophila pseudoobscura pseudoobscura]
gi|198132877|gb|EAL28802.2| GA17400 [Drosophila pseudoobscura pseudoobscura]
Length = 392
Score = 64.3 bits (155), Expect = 9e-08, Method: Compositional matrix adjust.
Identities = 38/101 (37%), Positives = 57/101 (56%), Gaps = 9/101 (8%)
Query: 66 IAENFSKLACNAHTICNSELRPLGTGLYPVISIINHSCLPNAVLVF--EGRLAVVRAVQH 123
I E +SK+ A N+E G+GLY + S INHSC+PNA F + V++A+
Sbjct: 267 IDELYSKVGEFAGEFLNNE----GSGLYLLQSKINHSCVPNACSTFPYSNDIVVLKALAP 322
Query: 124 VPKGAEVLISYIETAG---STMTRQKALKEQYLFTCTCPRC 161
+ +G E+ ISY++ S +R K L+E Y+F C C +C
Sbjct: 323 IQEGEEICISYLDECQLERSRHSRHKVLRENYIFVCQCLKC 363
>gi|260798620|ref|XP_002594298.1| hypothetical protein BRAFLDRAFT_117670 [Branchiostoma floridae]
gi|229279531|gb|EEN50309.1| hypothetical protein BRAFLDRAFT_117670 [Branchiostoma floridae]
Length = 400
Score = 64.3 bits (155), Expect = 1e-07, Method: Compositional matrix adjust.
Identities = 32/96 (33%), Positives = 50/96 (52%)
Query: 66 IAENFSKLACNAHTICNSELRPLGTGLYPVISIINHSCLPNAVLVFEGRLAVVRAVQHVP 125
+ E + K+A N+ I + + +G G+YP S+INHSC N + +F G +RA + +
Sbjct: 170 MEEIYGKIASNSFAILDENMCSIGIGVYPQASMINHSCKSNCIGMFYGPQIQIRANEFIR 229
Query: 126 KGAEVLISYIETAGSTMTRQKALKEQYLFTCTCPRC 161
G ++ YI T RQ+ L + Y F C C C
Sbjct: 230 PGEQIFHGYIPPLLPTAKRQEKLLKTYHFLCQCADC 265
Score = 40.4 bits (93), Expect = 1.3, Method: Compositional matrix adjust.
Identities = 27/104 (25%), Positives = 52/104 (50%)
Query: 255 HPFSVNLMQTREKLIKILMELEDWKEALAYCQLTIPVYQRVYPQFHPLLGLQYYTCGKLE 314
H +V +++ + L+++ +EL W++A+ Y + P Y+ ++ GL Y
Sbjct: 281 HSDNVYVIRILKDLVEVCVELGLWEKAVVYGKRIEPGYKMFLKRYDLDQGLLYQKMALAY 340
Query: 315 WFLGDTENAIKSMTEAVEILRITHGTNSPFMKELILKLEEAQAE 358
+ LGD A+ + +A L IT G +S ++E+ L E A+
Sbjct: 341 YRLGDQSAALPYLRQAKTTLTITSGEDSGLVQEVSDMLAECVAK 384
>gi|167521373|ref|XP_001745025.1| hypothetical protein [Monosiga brevicollis MX1]
gi|163776639|gb|EDQ90258.1| predicted protein [Monosiga brevicollis MX1]
Length = 399
Score = 64.3 bits (155), Expect = 1e-07, Method: Compositional matrix adjust.
Identities = 34/98 (34%), Positives = 46/98 (46%), Gaps = 17/98 (17%)
Query: 91 GLYPVISIINHSCLPNAV-----------------LVFEGRLAVVRAVQHVPKGAEVLIS 133
GL+P + INHSC PNA L EG ++R+V + +G + IS
Sbjct: 132 GLWPQAAAINHSCRPNATHYLDASAPMKPRESGADLPPEGGTMIIRSVSDIARGEPITIS 191
Query: 134 YIETAGSTMTRQKALKEQYLFTCTCPRCIKLGQFDDIQ 171
Y+E RQ+AL+ Y F CTC RC + D Q
Sbjct: 192 YVELGDPWPVRQEALRTGYGFACTCIRCTEEAALDSEQ 229
>gi|16741406|gb|AAH16525.1| SET and MYND domain containing 5 [Mus musculus]
Length = 300
Score = 63.9 bits (154), Expect = 1e-07, Method: Compositional matrix adjust.
Identities = 35/86 (40%), Positives = 50/86 (58%), Gaps = 7/86 (8%)
Query: 89 GTGLYPVISIINHSCLPNAVLVFEGRLAV--VRAVQHVPKGAEVLISYIETAG---STMT 143
G+GL+ + S NHSC+PNA F V V A++ + G E+ ISY++ S +
Sbjct: 187 GSGLFVLQSCCNHSCVPNAETSFPENNFVLHVTALEDIKPGEEICISYLDCCQRERSRHS 246
Query: 144 RQKALKEQYLFTCTCPRCIKLGQFDD 169
R K L+E YLF C+CP+C L + DD
Sbjct: 247 RHKILRENYLFNCSCPKC--LAEADD 270
>gi|414870819|tpg|DAA49376.1| TPA: hypothetical protein ZEAMMB73_691595 [Zea mays]
Length = 489
Score = 63.9 bits (154), Expect = 1e-07, Method: Compositional matrix adjust.
Identities = 73/308 (23%), Positives = 129/308 (41%), Gaps = 56/308 (18%)
Query: 83 SELRPLGTGLYPVISIINHSCLPNAVLVFEGRLAVVRAVQHVPKGAEVLISYIETAGSTM 142
S++R + LY S+ NHSC PN F R V+R + + G+ V +SY G
Sbjct: 171 SQVR-VAQALYVSGSLFNHSCQPNVHAYFLSRAFVLRTTEFIKSGSPVELSYGPQVGEMH 229
Query: 143 T--RQKALKEQYLFTCTCPRCIKLGQFDDIQES----------AILEG--YRCKDD---- 184
RQK+L+E Y F+C C C +L D + S AI E YR K++
Sbjct: 230 ISERQKSLQENYYFSCQCSSCSELNLSDLVMNSFCCPQSNCLGAISESTYYRSKENFVNV 289
Query: 185 -----------------------GCSGFLLRDSD-----DKG--FTCQQC-GLVRSKEEI 213
+ LL++S D G +C+ C L +
Sbjct: 290 SLGGSYVCKLSLPDVSKVDMDMENVAKSLLQNSGVSLNIDHGCCMSCRSCIDLSSALATS 349
Query: 214 KKIASEVNILSKKTLALTSCGNHQEVVSTYKMIEKLQKKLYHPFSVNLMQTREKLIKILM 273
+ S ++ L K T + + + +++L KKL HP+S L Q + + +
Sbjct: 350 HREESTIDRLKKHTFL-----DKTLITEALQSLKQL-KKLRHPYSKALAQAEDTIAEAFA 403
Query: 274 ELEDWKEALAYCQLTIPVYQRVYPQFHPLLGLQYYTCGKLEWFLGDTENAIKSMTEAVEI 333
++ D + A +C+ +I + +++Y H ++ + + LGD +A + +A I
Sbjct: 404 KVGDQELARKHCEASIQILEKLYHPKHIIIAHELIKLVSILLSLGDGASAAATFAQAEAI 463
Query: 334 LRITHGTN 341
+ +G++
Sbjct: 464 FSLYYGSH 471
>gi|328785542|ref|XP_001122203.2| PREDICTED: protein msta, isoform A-like [Apis mellifera]
Length = 490
Score = 63.9 bits (154), Expect = 1e-07, Method: Compositional matrix adjust.
Identities = 42/132 (31%), Positives = 64/132 (48%), Gaps = 9/132 (6%)
Query: 91 GLYPVISIINHSCLPNAVLVFEGRLAV-VRAVQHVPKGAEVLISYIETAGSTMTRQKALK 149
GLYP+ S+ NH C+PN F+ + + VRA + G E+ +SY T R++ L
Sbjct: 214 GLYPLGSLQNHCCIPNTRHYFDEKFRLYVRAALPISAGEEITMSYTSLFWDTTLRRQFLN 273
Query: 150 EQYLFTCTCPRCIKLGQFDDIQESAILEGYRCKDDGCSGFLLRDSD---DKGFTCQQCGL 206
F+C C RC +F+ + SA+L C D CSG LL + + C +C +
Sbjct: 274 VTKNFSCMCKRCSDPTEFNS-KLSALL----CASDKCSGELLPKNPLNMRTSWICNKCSI 328
Query: 207 VRSKEEIKKIAS 218
+ +I I S
Sbjct: 329 TVNHRQICSIRS 340
>gi|345484303|ref|XP_001603615.2| PREDICTED: SET and MYND domain-containing protein 5-like [Nasonia
vitripennis]
Length = 386
Score = 63.9 bits (154), Expect = 1e-07, Method: Compositional matrix adjust.
Identities = 36/88 (40%), Positives = 51/88 (57%), Gaps = 7/88 (7%)
Query: 89 GTGLYPVISIINHSCLPNAVLVF---EGRLAVVRAVQHVPKGAEVLISYIETAG---STM 142
G+GLY + S INHSC PNA + F + RL VVRA++ + E+ I+Y++ S
Sbjct: 288 GSGLYVLQSAINHSCAPNASVEFPHSDSRL-VVRALRDIKPDEEICIAYLDECHLERSRH 346
Query: 143 TRQKALKEQYLFTCTCPRCIKLGQFDDI 170
+RQKAL YLF C C +C + D+
Sbjct: 347 SRQKALSSLYLFVCKCDKCQQQASDPDV 374
>gi|428180005|gb|EKX48874.1| hypothetical protein GUITHDRAFT_105498 [Guillardia theta CCMP2712]
Length = 197
Score = 63.9 bits (154), Expect = 1e-07, Method: Compositional matrix adjust.
Identities = 45/158 (28%), Positives = 70/158 (44%), Gaps = 22/158 (13%)
Query: 76 NAHTICNSELRPLGTGLYPVISIINHSCLPNAVLVFEGRLAVVRAVQHVPKGAEVLISYI 135
NAH I N + +G G++P ++NH+C NAV F VP GAEV + I
Sbjct: 7 NAHPITNGSFKRIGLGIFPSACMLNHACNYNAVYSF------------VPGGAEVHANAI 54
Query: 136 E----TAGSTMTRQKALKEQYLFTCTCPRCIKLGQFDDIQESAILEGY-RCKDDGCSGFL 190
+ T+ + RQK ++E Y F C C C + S + + C C +
Sbjct: 55 DFGLNTSSQLIRRQKLIREGYFFQCACEWCAGTSGPERYFNSLVCNPWPDCDGHKCEAAI 114
Query: 191 LRDSD-DKGFTCQQCGLVRSKEEIKKIASEVNILSKKT 227
L D++ D+G C + + I +E+N L+K
Sbjct: 115 LPDANGDEGQAVVICDACKRRHPI----TEMNALNKSA 148
>gi|380018360|ref|XP_003693098.1| PREDICTED: protein msta, isoform A-like [Apis florea]
Length = 467
Score = 63.9 bits (154), Expect = 1e-07, Method: Compositional matrix adjust.
Identities = 53/213 (24%), Positives = 93/213 (43%), Gaps = 23/213 (10%)
Query: 19 TTTDNYSLVEALVAHMSDIDEKQLLLYAQIANLVNLILQWPEISINEIAENFSKLACNAH 78
T ++Y + +L +H ++I+ + L IA+ + + +IS EI L N H
Sbjct: 158 TNPESYKKLISLESHCNEINNSKEPL--NIAHFIKRFFKADDISEEEIVTIIGILQVNGH 215
Query: 79 TICNSELRPLG----TGLYPVISIINHSCLPNAVLVFE--GRLAVVRAVQHVPKGAEVLI 132
+ PL +Y + S+I H+C N F G L ++RA + KG + I
Sbjct: 216 EV------PLTDSPYVAVYEMASLIEHNCRANCSKSFTDMGGL-IIRAALPITKGDHISI 268
Query: 133 SYIETAGSTMTRQKALKEQYLFTCTCPRCIKLGQFDDIQESAILEGYRCKDDGCSGFLLR 192
Y + T R+ L + F C C RC +F + +C CSG++L
Sbjct: 269 CYTDPLWGTANRRHHLLKTKFFECICNRCQDPTEF-----QTMFNALKCNKINCSGYVLP 323
Query: 193 DS---DDKGFTCQQCGLVRSKEEIKKIASEVNI 222
+ ++ + C+ C + S EI+K+ ++ I
Sbjct: 324 KTFLEQEQDYVCKTCESIVSCTEIEKMLEDIGI 356
>gi|255082145|ref|XP_002508291.1| predicted protein [Micromonas sp. RCC299]
gi|226523567|gb|ACO69549.1| predicted protein [Micromonas sp. RCC299]
Length = 311
Score = 63.9 bits (154), Expect = 1e-07, Method: Compositional matrix adjust.
Identities = 43/159 (27%), Positives = 69/159 (43%), Gaps = 19/159 (11%)
Query: 22 DNYSLVEALVAHMSDIDEKQL-----LLYAQIANLVNLILQWP-EISINEIAENFSKLAC 75
D Y +E L++ + D+ +L A++A +L S++E +L
Sbjct: 148 DGYDALEGLMSGVDGGDDGELPHDAVATIAEVAKQARFLLAASCRCSMDECVRTLGRLQL 207
Query: 76 NAHTICNSEL------------RPLGTGLYPVISIINHSCLPNAVLVFEGRLA-VVRAVQ 122
N + SE RP+G G++P S NHSC PN F+G VV +
Sbjct: 208 NGFEMTASEPEEGADEAEGGGHRPVGVGVFPSASYTNHSCAPNCAQRFDGHGCIVVETAR 267
Query: 123 HVPKGAEVLISYIETAGSTMTRQKALKEQYLFTCTCPRC 161
V G E+ I Y++ R++ L++ + F C C RC
Sbjct: 268 DVRGGEELTIPYVDVRLGRRERRERLRKNFAFDCACERC 306
>gi|401405154|ref|XP_003882027.1| conserved hypothetical protein [Neospora caninum Liverpool]
gi|325116441|emb|CBZ51994.1| conserved hypothetical protein [Neospora caninum Liverpool]
Length = 576
Score = 63.9 bits (154), Expect = 1e-07, Method: Compositional matrix adjust.
Identities = 30/76 (39%), Positives = 46/76 (60%), Gaps = 2/76 (2%)
Query: 88 LGTGLYPVISIINHSCLPNAVLVFEG--RLAVVRAVQHVPKGAEVLISYIETAGSTMTRQ 145
+G GL+ +S+ NHSC PNA + + + A V A++ + + +VL+SYI+ +RQ
Sbjct: 419 MGIGLFRAVSMTNHSCWPNAEVDYPSLTKTAQVTALRDIAQDEQVLLSYIDETLPLASRQ 478
Query: 146 KALKEQYLFTCTCPRC 161
+ LK Y FTC C RC
Sbjct: 479 RLLKRHYKFTCACVRC 494
>gi|426335970|ref|XP_004029477.1| PREDICTED: SET and MYND domain-containing protein 5 isoform 2
[Gorilla gorilla gorilla]
Length = 303
Score = 63.9 bits (154), Expect = 1e-07, Method: Compositional matrix adjust.
Identities = 32/79 (40%), Positives = 47/79 (59%), Gaps = 5/79 (6%)
Query: 89 GTGLYPVISIINHSCLPNAVLVF--EGRLAVVRAVQHVPKGAEVLISYIETAG---STMT 143
G+GL+ + S NHSC+PNA F L V A++ + G E+ ISY++ S +
Sbjct: 188 GSGLFVLQSCCNHSCVPNAETSFPENNFLLHVTALEDIKPGEEICISYLDCCQRERSRHS 247
Query: 144 RQKALKEQYLFTCTCPRCI 162
R K L+E YLF C+CP+C+
Sbjct: 248 RHKILRENYLFVCSCPKCL 266
>gi|397473489|ref|XP_003808243.1| PREDICTED: SET and MYND domain-containing protein 5 isoform 2 [Pan
paniscus]
Length = 304
Score = 63.9 bits (154), Expect = 1e-07, Method: Compositional matrix adjust.
Identities = 32/79 (40%), Positives = 47/79 (59%), Gaps = 5/79 (6%)
Query: 89 GTGLYPVISIINHSCLPNAVLVF--EGRLAVVRAVQHVPKGAEVLISYIETAG---STMT 143
G+GL+ + S NHSC+PNA F L V A++ + G E+ ISY++ S +
Sbjct: 188 GSGLFVLQSCCNHSCVPNAETSFPENNFLLHVTALEDIKPGEEICISYLDCCQRERSRHS 247
Query: 144 RQKALKEQYLFTCTCPRCI 162
R K L+E YLF C+CP+C+
Sbjct: 248 RHKILRENYLFVCSCPKCL 266
>gi|367028494|ref|XP_003663531.1| hypothetical protein MYCTH_33039, partial [Myceliophthora
thermophila ATCC 42464]
gi|347010800|gb|AEO58286.1| hypothetical protein MYCTH_33039, partial [Myceliophthora
thermophila ATCC 42464]
Length = 419
Score = 63.9 bits (154), Expect = 1e-07, Method: Compositional matrix adjust.
Identities = 32/87 (36%), Positives = 46/87 (52%), Gaps = 2/87 (2%)
Query: 75 CNAHTICNSELRPLGTGLYPVISIINHSCLPNAVLVFEGRLAVVRAVQHVPKGAEVLISY 134
C C+ ++ +G L V ++INHSC PNA + EG VR+++ + G E+ +SY
Sbjct: 103 CKIARYCSKDV--IGYALDMVTAMINHSCAPNAFVTLEGCQLRVRSLKPIAAGEEITVSY 160
Query: 135 IETAGSTMTRQKALKEQYLFTCTCPRC 161
+ RQK LKE Y F C C C
Sbjct: 161 ADPTLPVFNRQKFLKETYFFDCRCKGC 187
>gi|255085478|ref|XP_002505170.1| predicted protein [Micromonas sp. RCC299]
gi|226520439|gb|ACO66428.1| predicted protein [Micromonas sp. RCC299]
Length = 625
Score = 63.9 bits (154), Expect = 1e-07, Method: Compositional matrix adjust.
Identities = 36/106 (33%), Positives = 57/106 (53%), Gaps = 11/106 (10%)
Query: 59 PEISINEIAENFSKLACNAHTICNSELRPLGT-GLYPVISIINHSCLPNAVLVFEGRLAV 117
PE+ E+ F++LA N + S +R + G+YP S +NHSC PNA + G
Sbjct: 282 PELPPQEV---FARLAAN--VVKTSAVRNANSVGVYPFPSFLNHSCAPNACKLMVGHTMF 336
Query: 118 VRAVQHVPKGAEVLISYIETAGSTMTR--QKALKEQYLFTCTCPRC 161
+RA + + G EV + Y + TM + + A+ +++ F C CPRC
Sbjct: 337 IRAARDLVAGEEVFMKYFDV---TMPKPERSAVAKRWGFECACPRC 379
>gi|321449488|gb|EFX61914.1| hypothetical protein DAPPUDRAFT_68494 [Daphnia pulex]
Length = 680
Score = 63.9 bits (154), Expect = 1e-07, Method: Compositional matrix adjust.
Identities = 38/111 (34%), Positives = 59/111 (53%), Gaps = 5/111 (4%)
Query: 88 LGTGLYPVISIINHSCLPNAVLVFEGRLAVVRAVQHVPKGAEVLISY--IETAGSTMTRQ 145
LG G+Y ++++NHSC P + F G + VVRA++ G V +Y I T + RQ
Sbjct: 476 LGAGVYSTVALLNHSCEPGVIRHFIGDVMVVRAIKSFQPGEMVNENYGPIFTQKRRVDRQ 535
Query: 146 KALKEQYLFTCTCPRCIK-LGQFDDIQESAILEGYRCKDDGCSGFLLRDSD 195
++LK++Y F C C C + ++ E A+ +RC D C L+ SD
Sbjct: 536 RSLKDRYWFDCRCNPCTENWPLIGEMTEEALC--FRCADRRCRKPLVVQSD 584
>gi|66525107|ref|XP_394075.2| PREDICTED: SET and MYND domain-containing protein 5 [Apis
mellifera]
Length = 387
Score = 63.9 bits (154), Expect = 1e-07, Method: Compositional matrix adjust.
Identities = 34/87 (39%), Positives = 53/87 (60%), Gaps = 7/87 (8%)
Query: 89 GTGLYPVISIINHSCLPNAVLVF--EGRLAVVRAVQHVPKGAEVLISYIETA---GSTMT 143
G+GLY + S +NHSC+PNA++ F + V++A++ + E+ ISY++ S +
Sbjct: 287 GSGLYILQSSVNHSCVPNAIVEFPYSNNVLVLKAIRDIHPEEEICISYLDECCLERSRHS 346
Query: 144 RQKALKEQYLFTCTCPRCIKLGQFDDI 170
RQKAL YLF C C +C L Q +D+
Sbjct: 347 RQKALNSLYLFQCYCNKC--LSQANDL 371
>gi|74187979|dbj|BAE37118.1| unnamed protein product [Mus musculus]
Length = 400
Score = 63.9 bits (154), Expect = 1e-07, Method: Compositional matrix adjust.
Identities = 35/86 (40%), Positives = 50/86 (58%), Gaps = 7/86 (8%)
Query: 89 GTGLYPVISIINHSCLPNAVLVFEGRLAV--VRAVQHVPKGAEVLISYIETAG---STMT 143
G+GL+ + S NHSC+PNA F V V A++ + G E+ ISY++ S +
Sbjct: 287 GSGLFVLQSCCNHSCVPNAETSFPENNFVLHVTALEDIKPGEEICISYLDCCQRERSRHS 346
Query: 144 RQKALKEQYLFTCTCPRCIKLGQFDD 169
R K L+E YLF C+CP+C L + DD
Sbjct: 347 RHKILRENYLFNCSCPKC--LAEADD 370
>gi|195426423|ref|XP_002061334.1| GK20779 [Drosophila willistoni]
gi|194157419|gb|EDW72320.1| GK20779 [Drosophila willistoni]
Length = 515
Score = 63.9 bits (154), Expect = 1e-07, Method: Compositional matrix adjust.
Identities = 46/173 (26%), Positives = 72/173 (41%), Gaps = 11/173 (6%)
Query: 48 IANLVNLILQWPEISINEIAENFSKLACNAHTICNSELRPLGTGLYPVISIINHSCLPNA 107
I + + + S EI L N H + S+ P ++ V S +SCLPN
Sbjct: 189 IGQFIPKFFKTQKFSEEEIMRTVGALQINGHEVPTSD--PPHVAVFYVASFTENSCLPNL 246
Query: 108 VLVFEGR-LAVVRAVQHVPKGAEVLISYIETAGSTMTRQKALKEQYLFTCTCPRCIKLGQ 166
F ++ A Q + K + + I Y + T RQ+ L + LF C C RC
Sbjct: 247 AKSFNKHGHCILWAPQAIKKNSHLSICYSDAIWGTSDRQRHLMQTKLFKCACLRCQ---- 302
Query: 167 FDDIQESAILEGYRCKDDGCSGFLLRDSDDK---GFTCQQCGLVRSKEEIKKI 216
D + +C+D CSG +L D G+ C++CG K+ ++ I
Sbjct: 303 -DPTELGTNYSAIKCEDRKCSGLMLPTKADDWHGGWQCKECGKQVPKQYVEGI 354
>gi|26450409|dbj|BAC42319.1| unknown protein [Arabidopsis thaliana]
Length = 604
Score = 63.9 bits (154), Expect = 1e-07, Method: Compositional matrix adjust.
Identities = 37/117 (31%), Positives = 56/117 (47%), Gaps = 10/117 (8%)
Query: 88 LGTGLYPVISIINHSCLPNAVLVFEGRLAVVRAVQHVPKGAEVLISYIETAG--STMTRQ 145
+G LY S+ NHSC PN L F R +++ + VP G + +SY G R
Sbjct: 475 VGQALYKTGSLFNHSCKPNIHLYFLSRGLIMQTTEFVPTGCPLELSYGPEVGKWDCKNRI 534
Query: 146 KALKEQYLFTCTCPRCIKLGQFDDIQESAILEGYRCKDDGCSGFLLRDSDDKGFTCQ 202
+ L+E+Y F C C C ++ D ++ GY C + C+G +L D TC+
Sbjct: 535 RFLEEEYFFHCRCRGCAQINISD-----LVINGYGCVNTNCTGVVL---DSNVATCE 583
>gi|344283937|ref|XP_003413727.1| PREDICTED: SET and MYND domain-containing protein 5 [Loxodonta
africana]
Length = 417
Score = 63.9 bits (154), Expect = 1e-07, Method: Compositional matrix adjust.
Identities = 35/86 (40%), Positives = 50/86 (58%), Gaps = 7/86 (8%)
Query: 89 GTGLYPVISIINHSCLPNAVLVF--EGRLAVVRAVQHVPKGAEVLISYIETAG---STMT 143
G+GL+ + S NHSC+PNA F L V A++ + G E+ ISY++ S +
Sbjct: 304 GSGLFVLQSCCNHSCVPNAETSFPENNFLLHVTALEDIKPGEEICISYLDCCQRERSRHS 363
Query: 144 RQKALKEQYLFTCTCPRCIKLGQFDD 169
R K L+E YLF C+CP+C L + DD
Sbjct: 364 RHKILRENYLFVCSCPKC--LAEADD 387
>gi|49257476|gb|AAH73806.1| SMYD family member 5 [Homo sapiens]
gi|167773713|gb|ABZ92291.1| SMYD family member 5 [synthetic construct]
gi|194374657|dbj|BAG62443.1| unnamed protein product [Homo sapiens]
Length = 302
Score = 63.9 bits (154), Expect = 1e-07, Method: Compositional matrix adjust.
Identities = 32/79 (40%), Positives = 47/79 (59%), Gaps = 5/79 (6%)
Query: 89 GTGLYPVISIINHSCLPNAVLVF--EGRLAVVRAVQHVPKGAEVLISYIETAG---STMT 143
G+GL+ + S NHSC+PNA F L V A++ + G E+ ISY++ S +
Sbjct: 188 GSGLFVLQSCCNHSCVPNAETSFPENNFLLHVTALEDIKPGEEICISYLDCCQRERSRHS 247
Query: 144 RQKALKEQYLFTCTCPRCI 162
R K L+E YLF C+CP+C+
Sbjct: 248 RHKILRENYLFVCSCPKCL 266
>gi|380016398|ref|XP_003692172.1| PREDICTED: SET and MYND domain-containing protein 5-like [Apis
florea]
Length = 387
Score = 63.9 bits (154), Expect = 1e-07, Method: Compositional matrix adjust.
Identities = 34/87 (39%), Positives = 53/87 (60%), Gaps = 7/87 (8%)
Query: 89 GTGLYPVISIINHSCLPNAVLVF--EGRLAVVRAVQHVPKGAEVLISYIETA---GSTMT 143
G+GLY + S +NHSC+PNA++ F + V++A++ + E+ ISY++ S +
Sbjct: 287 GSGLYILQSSVNHSCVPNAIVEFPYSNNVLVLKAIRDIHPEEEICISYLDECCLERSRHS 346
Query: 144 RQKALKEQYLFTCTCPRCIKLGQFDDI 170
RQKAL YLF C C +C L Q +D+
Sbjct: 347 RQKALNSLYLFQCYCNKC--LSQANDL 371
>gi|332813421|ref|XP_515547.3| PREDICTED: SET and MYND domain-containing protein 5 isoform 4 [Pan
troglodytes]
Length = 299
Score = 63.9 bits (154), Expect = 1e-07, Method: Compositional matrix adjust.
Identities = 32/79 (40%), Positives = 47/79 (59%), Gaps = 5/79 (6%)
Query: 89 GTGLYPVISIINHSCLPNAVLVF--EGRLAVVRAVQHVPKGAEVLISYIETAG---STMT 143
G+GL+ + S NHSC+PNA F L V A++ + G E+ ISY++ S +
Sbjct: 188 GSGLFVLQSCCNHSCVPNAETSFPENNFLLHVTALEDIKPGEEICISYLDCCQRERSRHS 247
Query: 144 RQKALKEQYLFTCTCPRCI 162
R K L+E YLF C+CP+C+
Sbjct: 248 RHKILRENYLFVCSCPKCL 266
>gi|348681060|gb|EGZ20876.1| hypothetical protein PHYSODRAFT_313346 [Phytophthora sojae]
Length = 494
Score = 63.5 bits (153), Expect = 1e-07, Method: Compositional matrix adjust.
Identities = 44/127 (34%), Positives = 61/127 (48%), Gaps = 17/127 (13%)
Query: 89 GTGLYPVISIINHSCLPNAVLVFEGR-LAVVRAVQHVPKGAEVLISYIETAGSTMTRQKA 147
GL+P+ +INHSC PN G +A VRA++ + +G E+ +SYI+ R+K
Sbjct: 208 AVGLFPICGLINHSCQPNCTWSNAGDGIAEVRALRDIKEGDEITLSYIDIDKERAERRKE 267
Query: 148 LKEQYLFTCTCPRCIKLGQFDDIQESA--ILEGYRCKDDGCS------GFLLRDSDDKGF 199
L+E F C C RC + ES LEG+ C CS +LL +DK
Sbjct: 268 LRETKHFDCQCGRCAA-----PLSESVDRYLEGFCCP--RCSVMASGKDYLLAQVEDK-L 319
Query: 200 TCQQCGL 206
C C L
Sbjct: 320 VCSDCQL 326
>gi|154689581|ref|NP_659167.2| SET and MYND domain-containing protein 5 [Mus musculus]
gi|90101759|sp|Q3TYX3.2|SMYD5_MOUSE RecName: Full=SET and MYND domain-containing protein 5; AltName:
Full=Protein NN8-4AG; AltName: Full=Retinoic
acid-induced protein 15
gi|148666711|gb|EDK99127.1| SET and MYND domain containing 5 [Mus musculus]
Length = 416
Score = 63.5 bits (153), Expect = 1e-07, Method: Compositional matrix adjust.
Identities = 35/86 (40%), Positives = 50/86 (58%), Gaps = 7/86 (8%)
Query: 89 GTGLYPVISIINHSCLPNAVLVFEGRLAV--VRAVQHVPKGAEVLISYIETAG---STMT 143
G+GL+ + S NHSC+PNA F V V A++ + G E+ ISY++ S +
Sbjct: 303 GSGLFVLQSCCNHSCVPNAETSFPENNFVLHVTALEDIKPGEEICISYLDCCQRERSRHS 362
Query: 144 RQKALKEQYLFTCTCPRCIKLGQFDD 169
R K L+E YLF C+CP+C L + DD
Sbjct: 363 RHKILRENYLFNCSCPKC--LAEADD 386
>gi|440302623|gb|ELP94930.1| set and mynd domain containing protein, putative [Entamoeba
invadens IP1]
Length = 428
Score = 63.5 bits (153), Expect = 1e-07, Method: Compositional matrix adjust.
Identities = 30/86 (34%), Positives = 48/86 (55%), Gaps = 1/86 (1%)
Query: 76 NAHTICNSELRPLGTGLYPVISIINHSCLPNAVLVFEGRLAVVRAVQHVPKGAEVLISYI 135
N+ + N E + +G GL+ S INHSC PN V + R ++++ + G E+ ISYI
Sbjct: 154 NSFCLTNGEEQVIGIGLFDYASFINHSCCPNCVPLQNKREMSIKSLTSIKSGEEIFISYI 213
Query: 136 ETAGSTMTRQKALKEQYLFTCTCPRC 161
+ S R+K L+ ++ F+C C C
Sbjct: 214 DITESFERREKELR-KWHFSCGCSLC 238
>gi|195352295|ref|XP_002042648.1| GM14901 [Drosophila sechellia]
gi|194124532|gb|EDW46575.1| GM14901 [Drosophila sechellia]
Length = 530
Score = 63.5 bits (153), Expect = 1e-07, Method: Compositional matrix adjust.
Identities = 51/181 (28%), Positives = 84/181 (46%), Gaps = 22/181 (12%)
Query: 43 LLYAQIANLVNLIL---QWPEISINEIAENFSKLACNAHTICNSELRPLGTGLYPVISII 99
+L A + +N +L WPE+ I IA + L N + R LYP ++I
Sbjct: 199 VLRANLITFINTVLGMKDWPEMDILRIA---AILDTNTFEVRQPRERRKIRALYPEAAMI 255
Query: 100 NHSCLPNAVLVFEGRLAVV-RAVQHVPKGAEVLISYIETAGSTMTRQKALKEQYLFTCTC 158
+H C+PN F+ + +V A + + KG + ISY + ST+ R+ L++ F C+C
Sbjct: 256 SHDCVPNMRHRFDDDMNIVFLAKRKIAKGEILSISYTQPLRSTIQRRVHLRQAKCFDCSC 315
Query: 159 PRCI---KLGQFDDIQESAILEGYRCKDDGCSGFLLRDS---DDKGFTCQQCGLVRSKEE 212
RC +LG F Q +CK +G ++ + + + CQ C RS +E
Sbjct: 316 ARCQDPEELGSFAGAQTC-----LKCK----AGKIISQNPLLNSAPWKCQLCNFKRSAKE 366
Query: 213 I 213
+
Sbjct: 367 V 367
>gi|367020696|ref|XP_003659633.1| hypothetical protein MYCTH_99686 [Myceliophthora thermophila ATCC
42464]
gi|347006900|gb|AEO54388.1| hypothetical protein MYCTH_99686 [Myceliophthora thermophila ATCC
42464]
Length = 573
Score = 63.5 bits (153), Expect = 1e-07, Method: Compositional matrix adjust.
Identities = 40/122 (32%), Positives = 66/122 (54%), Gaps = 2/122 (1%)
Query: 40 KQLLLYAQIANLVNLILQWPEISINEIAENFSKLACNAHTICNSELRPLGTGLYPVISII 99
K + L A A + +LQ E+ + + E K+ NA +++ G L ++++
Sbjct: 181 KDMELQATAAVVYAGLLQGEEV-LEKAREILCKIQTNAFNRLDADTGMAGIFLDVGLAMV 239
Query: 100 NHSCLPNAVLVFEGRLAVVRAVQHVPKGAEVLISYIETAGSTMTRQKALKEQYLFTCTCP 159
NHSC+PNA + F+ R A++RA + + +G E+ ISYI+ R +AL+ Y F C C
Sbjct: 240 NHSCVPNAFIGFDKRTAILRAERPIQEGEEITISYIDNTLPKAARYEALR-LYHFQCDCV 298
Query: 160 RC 161
RC
Sbjct: 299 RC 300
>gi|332021392|gb|EGI61760.1| SET and MYND domain-containing protein 4 [Acromyrmex echinatior]
Length = 749
Score = 63.5 bits (153), Expect = 1e-07, Method: Compositional matrix adjust.
Identities = 47/214 (21%), Positives = 89/214 (41%), Gaps = 10/214 (4%)
Query: 88 LGTGLYPVISIINHSCLPNAVLVFEGRLAVVRAVQHVPKGAEVLISYIETAGSTM--TRQ 145
+ T +YP S++NHSC PN + F G+ + +A + + G EV Y + RQ
Sbjct: 460 IATAIYPSASMMNHSCDPNIINSFLGQFLITKATRDIAAGEEVFNCYGADFRRMLRKDRQ 519
Query: 146 KALKEQYLFTCTCPRCIKLGQFDDIQESAILEGYRCKDDGCSGFLLRDSD---DKGFTCQ 202
+ ++ QY F C C C +++DI + +C + CSG L + C
Sbjct: 520 EKMESQYCFKCNCAAC-SAPEYEDILKK--FTAKKCPE--CSGPLNDNCSVLHRSSIHCM 574
Query: 203 QCGLVRSKEEIKKIASEVNILSKKTLALTSCGNHQEVVSTYKMIEKLQKKLYHPFSVNLM 262
CG + + +K N+ E + K +L+K + + ++
Sbjct: 575 DCGTAVYENIYDFTLRQAQHYFEKAEICIENENYDEALDKLKKCLRLRKDALYKYHDDIA 634
Query: 263 QTREKLIKILMELEDWKEALAYCQLTIPVYQRVY 296
T + + K+ + W +++Y + +I + Y
Sbjct: 635 DTMDFIAKVYAIIGQWLNSISYLEHSIVAIEEKY 668
>gi|255543292|ref|XP_002512709.1| protein with unknown function [Ricinus communis]
gi|223548670|gb|EEF50161.1| protein with unknown function [Ricinus communis]
Length = 379
Score = 63.5 bits (153), Expect = 1e-07, Method: Compositional matrix adjust.
Identities = 29/75 (38%), Positives = 41/75 (54%), Gaps = 4/75 (5%)
Query: 91 GLYPVISIINHSCLPNAV----LVFEGRLAVVRAVQHVPKGAEVLISYIETAGSTMTRQK 146
G+YP ++ NH CLPNA + + ++R + VP+G E+ +SY TRQK
Sbjct: 209 GIYPKAALFNHDCLPNACRFDYVDTQDTDLIIRMIHDVPQGREICLSYFPVNYDYSTRQK 268
Query: 147 ALKEQYLFTCTCPRC 161
L+E Y F C C RC
Sbjct: 269 RLREDYGFICDCDRC 283
>gi|31208665|ref|XP_313299.1| AGAP003552-PA [Anopheles gambiae str. PEST]
gi|21296724|gb|EAA08869.1| AGAP003552-PA [Anopheles gambiae str. PEST]
Length = 391
Score = 63.5 bits (153), Expect = 1e-07, Method: Compositional matrix adjust.
Identities = 35/87 (40%), Positives = 47/87 (54%), Gaps = 5/87 (5%)
Query: 80 ICNSELRPLGTGLYPVISIINHSCLPNAVLVF--EGRLAVVRAVQHVPKGAEVLISYIET 137
+ S L G+ LY S INHSC PNA VF + +RA + + G E+ ISY++
Sbjct: 279 VVGSFLNNEGSALYARQSKINHSCAPNAETVFPKSNHMLALRATRDIQPGEEICISYLDE 338
Query: 138 AG---STMTRQKALKEQYLFTCTCPRC 161
S +RQK LK+ YLF C C +C
Sbjct: 339 CNLQRSRHSRQKTLKDYYLFICQCEKC 365
>gi|389643366|ref|XP_003719315.1| TPR domain-containing protein [Magnaporthe oryzae 70-15]
gi|351639084|gb|EHA46948.1| TPR domain-containing protein [Magnaporthe oryzae 70-15]
gi|440463125|gb|ELQ32752.1| TPR domain-containing protein [Magnaporthe oryzae Y34]
gi|440483490|gb|ELQ63875.1| TPR domain-containing protein [Magnaporthe oryzae P131]
Length = 746
Score = 63.5 bits (153), Expect = 1e-07, Method: Compositional matrix adjust.
Identities = 26/72 (36%), Positives = 39/72 (54%)
Query: 90 TGLYPVISIINHSCLPNAVLVFEGRLAVVRAVQHVPKGAEVLISYIETAGSTMTRQKALK 149
TG++ + NH+C+PNA F G + +VRA + +P GAE+ + Y A +R+ K
Sbjct: 503 TGMWLHAAYANHTCIPNATRAFIGDMMIVRAARDIPAGAEIFMGYASLAEPFESRRSKFK 562
Query: 150 EQYLFTCTCPRC 161
Y F C C C
Sbjct: 563 TSYGFECDCEMC 574
>gi|221503737|gb|EEE29421.1| SET and MYND domain containing protein, putative [Toxoplasma gondii
VEG]
Length = 402
Score = 63.5 bits (153), Expect = 1e-07, Method: Compositional matrix adjust.
Identities = 32/76 (42%), Positives = 45/76 (59%), Gaps = 2/76 (2%)
Query: 88 LGTGLYPVISIINHSCLPNAVL--VFEGRLAVVRAVQHVPKGAEVLISYIETAGSTMTRQ 145
LG GL+ +S+ NHSC PNA + F A V A++ + + E+L+SYI+ + RQ
Sbjct: 263 LGVGLFRAVSMTNHSCWPNAEVDYPFLTNAAQVTALRDIAEKEEILLSYIDESLPLAERQ 322
Query: 146 KALKEQYLFTCTCPRC 161
+ LK Y FTC C RC
Sbjct: 323 RLLKSHYKFTCGCQRC 338
>gi|116181330|ref|XP_001220514.1| hypothetical protein CHGG_01293 [Chaetomium globosum CBS 148.51]
gi|88185590|gb|EAQ93058.1| hypothetical protein CHGG_01293 [Chaetomium globosum CBS 148.51]
Length = 348
Score = 63.5 bits (153), Expect = 1e-07, Method: Compositional matrix adjust.
Identities = 31/71 (43%), Positives = 40/71 (56%), Gaps = 1/71 (1%)
Query: 91 GLYPVISIINHSCLPNAVLVFEGRLAVVRAVQHVPKGAEVLISYIETAGSTMTRQKALKE 150
G++P +S NH C PN G AV+ +P G E+ ISYI S TRQ L+E
Sbjct: 173 GVFPEVSKFNHDCRPNVHYRLNGLKHTTIAVRDIPAGDELTISYIYGRASHATRQSQLRE 232
Query: 151 QYLFTCTCPRC 161
+ FTCTCP+C
Sbjct: 233 -WGFTCTCPQC 242
>gi|157823305|ref|NP_001101340.1| SET and MYND domain-containing protein 5 [Rattus norvegicus]
gi|149036567|gb|EDL91185.1| SET and MYND domain containing 5 (predicted) [Rattus norvegicus]
Length = 417
Score = 63.5 bits (153), Expect = 1e-07, Method: Compositional matrix adjust.
Identities = 35/86 (40%), Positives = 50/86 (58%), Gaps = 7/86 (8%)
Query: 89 GTGLYPVISIINHSCLPNAVLVF--EGRLAVVRAVQHVPKGAEVLISYIETAG---STMT 143
G+GL+ + S NHSC+PNA F L V A++ + G E+ ISY++ S +
Sbjct: 304 GSGLFVLQSCCNHSCVPNAETSFPENNFLLHVTALEDIEPGEEICISYLDCCQRERSRHS 363
Query: 144 RQKALKEQYLFTCTCPRCIKLGQFDD 169
R K L+E YLF C+CP+C L + DD
Sbjct: 364 RHKILRENYLFVCSCPKC--LAEADD 387
>gi|390360826|ref|XP_001181205.2| PREDICTED: SET and MYND domain-containing protein 4-like
[Strongylocentrotus purpuratus]
Length = 679
Score = 63.5 bits (153), Expect = 1e-07, Method: Compositional matrix adjust.
Identities = 60/267 (22%), Positives = 118/267 (44%), Gaps = 21/267 (7%)
Query: 83 SELRPLGTGLYPVISIINHSCLPNAVLVFEGRLAVVRAVQHVPKGAEVLISYIETAG--S 140
S+ R + T +YP +S++NH+C+PN + F + VRA + + G E+ Y G +
Sbjct: 410 SQFR-VATAVYPTVSMMNHACIPNIIPSFRKGILCVRATKKIMHGDEIQHCYGPQVGHMT 468
Query: 141 TMTRQKALKEQYLFTCTCPRCIKLGQFDDIQESAILEGYRCKDDGCSGFLLRDSDDKGFT 200
T RQ+AL QY FTC C C + + D + ++ +C + +
Sbjct: 469 TSDRQQALLNQYCFTCRCRACTRDPKSRD---NLCMKCPKCG---------QPLNAMTSI 516
Query: 201 CQQCG-----LVRSKEEIKKIASEVNILSKKTLALTSCGNHQEVVSTYKMIEKLQKKLYH 255
C +C LV KE + + ++ A+ ++V+S K + ++
Sbjct: 517 CGKCAVWIDVLVLLKEVDDTMITFAELVEMYPAAVKDVTLMRDVISKTKTCHDVLDRILI 576
Query: 256 PFSVNLMQTREKLIKILMELEDWKEALAYCQLTIPVYQRVYPQFHPLLGLQYYTCGKLEW 315
P ++L + K + L+++KEA + IP + + + + + Y ++ +
Sbjct: 577 PPHMHLAVAYNFMAKCHIALQEFKEAAPWLAKAIPTIECRFGRDSIEVANELYKLAQIYF 636
Query: 316 FLGDTENAIKSMTEAVEILRITHGTNS 342
+ A++++ A+E+ I H NS
Sbjct: 637 NGKEIAPAMEAIDRALELF-IRHYGNS 662
>gi|391340382|ref|XP_003744521.1| PREDICTED: son of sevenless homolog 2-like [Metaseiulus occidentalis]
Length = 1624
Score = 63.5 bits (153), Expect = 2e-07, Method: Compositional matrix adjust.
Identities = 84/373 (22%), Positives = 152/373 (40%), Gaps = 59/373 (15%)
Query: 1 MLKLYLRRKLQNDNVIPSTTTDNYSLVEALVAHMSDIDEKQLL--LYAQIANLVNLIL-- 56
M ++ LR + + +N T ++ ++H+ +I ++ + ++ NL+ L
Sbjct: 1278 MARIILRWRRELNNPSSETLLGTRRQLKDCMSHLQEIKRDRVRGGAFVEMLNLLKQFLTA 1337
Query: 57 -QWPEISINEIAENFSKLACNAHTICNSELRPLGTGLYPVISIINHSCLPNAVLVFEGRL 115
Q +S +I E F + N I N + G LY S+I+HSC PN F+G+
Sbjct: 1338 AQLEGVSDLDILEIFGIMCVNTIHISNDD-DSFGCALYLAPSLIDHSCYPNLTATFKGQK 1396
Query: 116 AVVRAVQHV-PKG-AEVLISYIETAGSTMTRQKALKEQYLFTCTCPRCIKLGQFDDIQES 173
V++ ++ PK A++ ++Y+ + R+K L+E+Y FTC C C
Sbjct: 1397 IVLKVLRPCEPKTVADLSLAYMPVCTTKERRRKTLREEYYFTCECEMC------------ 1444
Query: 174 AILEGYRCKDDGCSGFLLRDSDDKGFTCQQCGLVRSKEEIKKIASEVNILSKKTLALTSC 233
G +L ++D K+ EV L K +L L+S
Sbjct: 1445 ----------SGKVPEVLSEAD------------------PKLTDEVLELEKLSLDLSSP 1476
Query: 234 GNHQEVVSTYKMIEKLQKKLYHPFSVNLMQTREKLIKILMEL--EDWKEALAYCQLTIPV 291
NH++ + + E L KL ++ + R L+ + + A Y ++PV
Sbjct: 1477 ENHRKALKGVE--ELLSTKLKDLDDSDVAKFRAILVAADASVCASSYDRAYNYYSRSLPV 1534
Query: 292 YQRVYPQFHPLLGLQYYTCGKLEWFLGDTENA-------IKSMTEAVEILRITHGTNSPF 344
+RV+ + + +L + T I + EAV + RI G +
Sbjct: 1535 MKRVFTENKAEYAYKLVRLARLSTIVVKTSQDPEVFGPLIALLQEAVRVTRIALGEDHSD 1594
Query: 345 MKELILKLEEAQA 357
K++ L EE +A
Sbjct: 1595 TKDVALLYEELRA 1607
>gi|307212779|gb|EFN88450.1| SET and MYND domain-containing protein 4 [Harpegnathos saltator]
Length = 687
Score = 63.5 bits (153), Expect = 2e-07, Method: Compositional matrix adjust.
Identities = 37/124 (29%), Positives = 64/124 (51%), Gaps = 8/124 (6%)
Query: 88 LGTGLYPVISIINHSCLPNAVLVFEGRLAVVRAVQHVPKGAEVLISY--IETAGSTMTRQ 145
+G G+YP ++++NHSC P V F G ++RAV+ + G E+ +Y I T R+
Sbjct: 490 IGGGVYPTVAMLNHSCNPGVVRYFVGTTMILRAVRTISAGEEISENYGPIFTTTPENERK 549
Query: 146 KALKEQYLFTCTCPRCIKLGQFDDIQE-SAILEGYRCKDDGCSGFLL---RDSDDKGFTC 201
+ L+ QY F C C C G + ++E + ++C+ G +L D+++ C
Sbjct: 550 RRLRVQYWFDCNCEACT--GHWPLLEELDPTVLRFKCETGPSCGNVLPIRSDTNEFMIGC 607
Query: 202 QQCG 205
+CG
Sbjct: 608 VKCG 611
>gi|354492175|ref|XP_003508226.1| PREDICTED: SET and MYND domain-containing protein 5-like
[Cricetulus griseus]
Length = 435
Score = 63.5 bits (153), Expect = 2e-07, Method: Compositional matrix adjust.
Identities = 35/86 (40%), Positives = 50/86 (58%), Gaps = 7/86 (8%)
Query: 89 GTGLYPVISIINHSCLPNAVLVF--EGRLAVVRAVQHVPKGAEVLISYIETAG---STMT 143
G+GL+ + S NHSC+PNA F L V A++ + G E+ ISY++ S +
Sbjct: 322 GSGLFVLQSCCNHSCVPNAETSFPENNFLLHVTALEDIKPGEEICISYLDCCQRERSRHS 381
Query: 144 RQKALKEQYLFTCTCPRCIKLGQFDD 169
R K L+E YLF C+CP+C L + DD
Sbjct: 382 RHKILRENYLFICSCPKC--LAEADD 405
>gi|294886191|ref|XP_002771602.1| hypothetical protein, conserved [Perkinsus marinus ATCC 50983]
gi|239875308|gb|EER03418.1| hypothetical protein, conserved [Perkinsus marinus ATCC 50983]
Length = 265
Score = 63.5 bits (153), Expect = 2e-07, Method: Compositional matrix adjust.
Identities = 34/98 (34%), Positives = 51/98 (52%), Gaps = 9/98 (9%)
Query: 89 GTGLYPVISIINHSCLPNAVLVFEGRLAVVRAVQHVPKGAEVLISYIETAGSTMTRQKAL 148
GT +YPV S+ NHSC PN G L R V +G E+ ISYI + KA+
Sbjct: 17 GTAVYPVASMFNHSCTPNVTRASLGDLTWFRTCTDVKRGQELTISYI--GSDLLCEPKAV 74
Query: 149 KEQYL---FTCTCPRCIKLGQFDDIQESAILEGYRCKD 183
++++L F+C CP C K +D +S ++ + +D
Sbjct: 75 RQKHLARDFSCNCPACTK----EDDDDSVVIMPFTLQD 108
>gi|170060287|ref|XP_001865735.1| conserved hypothetical protein [Culex quinquefasciatus]
gi|167878799|gb|EDS42182.1| conserved hypothetical protein [Culex quinquefasciatus]
Length = 650
Score = 63.5 bits (153), Expect = 2e-07, Method: Compositional matrix adjust.
Identities = 66/257 (25%), Positives = 107/257 (41%), Gaps = 36/257 (14%)
Query: 22 DNYSLVEALVAHMSDIDEKQLLLYAQIANLVNLILQWPEISINEIAENF---------SK 72
D+Y + LV H + +A L+N L + AE+F
Sbjct: 364 DDYRKIYKLVTHEETRSAEDFFQRTLMATLLNACLTLGGFYKTKEAESFIGGLLLHNLQL 423
Query: 73 LACNAHTICNSELRP-----------LGTGLYPVISIINHSCLPNAVLVFEGRLAVVRAV 121
L NAH I SEL+ +G GLYP +++ NHSC P ++G VR+V
Sbjct: 424 LQFNAHEI--SELQREDDRDVGKSVFIGGGLYPTLALFNHSCEPGVTRYYKGNSVCVRSV 481
Query: 122 QHVPKGAEVLISY--IETAGSTMTRQKALKEQYLFTCTCPRCIK-LGQFDDIQESAILEG 178
+ + G+ + +Y + T R+ L QY F+C C C + F ++ ++ +
Sbjct: 482 RSIAAGSMIGENYGPLFTQTPRDERRATLLNQYRFSCNCRACSENWPLFSEMDDTVL--- 538
Query: 179 YRCKDDG---CSGFLLRDSDDKGF--TCQQCGLVRSKEEIKKIASEVNILSKKTLALTSC 233
R K DG CS LL S+ F C CG + + K+ + + K + S
Sbjct: 539 -RFKCDGGKICSNVLLIPSEINEFMIKCTDCGEHTNIMKGLKLVQDTETMFKLATKMHSA 597
Query: 234 GNHQEVVSTYKMIEKLQ 250
G + + YK +E++
Sbjct: 598 GEIEAAL--YKYVEEMN 612
>gi|157135679|ref|XP_001663543.1| hypothetical protein AaeL_AAEL003295 [Aedes aegypti]
gi|108881205|gb|EAT45430.1| AAEL003295-PA [Aedes aegypti]
Length = 565
Score = 63.5 bits (153), Expect = 2e-07, Method: Compositional matrix adjust.
Identities = 50/199 (25%), Positives = 86/199 (43%), Gaps = 38/199 (19%)
Query: 91 GLYPVISIINHSCLPNAVLVF---EGRLAVVRAVQHVPKGAEVLISYIETAGSTMTRQKA 147
GLYP+ ++ H C+PN F +G +A +++ KG + +Y + T R+
Sbjct: 263 GLYPIACMLEHCCMPNCFYTFNCTKGMKLTFKAGRNIKKGEHLTTTYTHSLWGTQQRRDH 322
Query: 148 LKEQYLFTCTCPRCIKLGQFDDIQESAILEGYRC---KDDGCSGFL-----LRDSDDKGF 199
LK F+C+C RC D + L +C + C+G+ L+++ D +
Sbjct: 323 LKANKYFSCSCARCA-----DPTELGTYLSALKCMGIDGNACAGYQLPIDPLKETSD--W 375
Query: 200 TCQQCGLVRSKEEIK----KIASEVNILSKKTLALTSCGNHQEVVSTYKMIEKLQKKL-- 253
C QC + +++ KI EV+ + E S+ K +E L KL
Sbjct: 376 KCNQCPITIESDQVNFLLSKIGEEVD-------------DAMERKSSVKQLEDLISKLLT 422
Query: 254 -YHPFSVNLMQTREKLIKI 271
HP L+Q + LI++
Sbjct: 423 FLHPNHHFLLQLKHSLIQM 441
>gi|350403870|ref|XP_003486931.1| PREDICTED: SET and MYND domain-containing protein 5-like [Bombus
impatiens]
Length = 386
Score = 63.5 bits (153), Expect = 2e-07, Method: Compositional matrix adjust.
Identities = 36/86 (41%), Positives = 51/86 (59%), Gaps = 7/86 (8%)
Query: 89 GTGLYPVISIINHSCLPNAVLVF--EGRLAVVRAVQHVPKGAEVLISYIETA---GSTMT 143
G+GLY S INHSC+PNAV+ F + V++A++ + E+ ISY++ S +
Sbjct: 287 GSGLYIFQSSINHSCVPNAVVEFPYSNNVLVLKAIRDIHPEEEICISYLDECCLERSRHS 346
Query: 144 RQKALKEQYLFTCTCPRCIKLGQFDD 169
RQKAL YLF C C +C L Q +D
Sbjct: 347 RQKALSSLYLFHCYCNKC--LSQVND 370
>gi|440796609|gb|ELR17718.1| Histone-lysine N-methyltransferase [Acanthamoeba castellanii str.
Neff]
Length = 419
Score = 63.5 bits (153), Expect = 2e-07, Method: Compositional matrix adjust.
Identities = 36/98 (36%), Positives = 47/98 (47%), Gaps = 1/98 (1%)
Query: 65 EIAENFSKLACNAHTICNSELRPLGTGLYPVISIINHSCLPN-AVLVFEGRLAVVRAVQH 123
E+ E SK+ N + + +L +G LY S NHSC PN EG+ A
Sbjct: 278 ELMEYISKIESNTFGMWSDKLVSMGMVLYAEGSYFNHSCAPNCGTRTGEGQAVQFVATHD 337
Query: 124 VPKGAEVLISYIETAGSTMTRQKALKEQYLFTCTCPRC 161
+P G EV I YI+ T +R+ L Y FTC CP C
Sbjct: 338 IPAGDEVCIRYIDVDKPTTSRRSELLSHYHFTCMCPLC 375
>gi|392558999|gb|EIW52184.1| hypothetical protein TRAVEDRAFT_75273 [Trametes versicolor
FP-101664 SS1]
Length = 721
Score = 63.5 bits (153), Expect = 2e-07, Method: Compositional matrix adjust.
Identities = 36/72 (50%), Positives = 41/72 (56%), Gaps = 3/72 (4%)
Query: 96 ISIINHSCLPNAVLVFEGRLAV--VRAVQHVPKGAEVLISYIETAGSTMTRQKALKEQYL 153
IS INHSC PNAV FE VRA+ +P GA+V ISYI+ A RQ+AL Y
Sbjct: 179 ISRINHSCSPNAVYRFEPATLTFEVRALSPIPSGAQVFISYIDPALPRAKRQEALS-SYG 237
Query: 154 FTCTCPRCIKLG 165
FTC C C G
Sbjct: 238 FTCACTACALTG 249
>gi|74196676|dbj|BAE34438.1| unnamed protein product [Mus musculus]
Length = 416
Score = 63.2 bits (152), Expect = 2e-07, Method: Compositional matrix adjust.
Identities = 35/86 (40%), Positives = 50/86 (58%), Gaps = 7/86 (8%)
Query: 89 GTGLYPVISIINHSCLPNAVLVFEGRLAV--VRAVQHVPKGAEVLISYIETAG---STMT 143
G+GL+ + S NHSC+PNA F V V A++ + G E+ ISY++ S +
Sbjct: 303 GSGLFVLQSCCNHSCVPNAETSFPENNFVLHVTALEDIKPGEEICISYLDCRQRERSRHS 362
Query: 144 RQKALKEQYLFTCTCPRCIKLGQFDD 169
R K L+E YLF C+CP+C L + DD
Sbjct: 363 RHKILRENYLFNCSCPKC--LAEADD 386
>gi|346320526|gb|EGX90126.1| TPR domain protein [Cordyceps militaris CM01]
Length = 656
Score = 63.2 bits (152), Expect = 2e-07, Method: Compositional matrix adjust.
Identities = 31/72 (43%), Positives = 42/72 (58%), Gaps = 1/72 (1%)
Query: 90 TGLYPVISIINHSCLPNAVLVFEGRLAVVRAVQHVPKGAEVLISYIETAGSTMTRQKALK 149
TGL+ S +NH+C+ NAV F G L V RA +H+ G E+ +Y E A RQ AL+
Sbjct: 509 TGLWVRASYVNHACVANAVKDFAGDLMVFRAARHIRAGEEIFHAYDENADYD-ARQAALR 567
Query: 150 EQYLFTCTCPRC 161
+ + F CTC C
Sbjct: 568 KTWGFACTCALC 579
>gi|351716047|gb|EHB18966.1| SET and MYND domain-containing protein 5 [Heterocephalus glaber]
Length = 420
Score = 63.2 bits (152), Expect = 2e-07, Method: Compositional matrix adjust.
Identities = 32/79 (40%), Positives = 47/79 (59%), Gaps = 5/79 (6%)
Query: 89 GTGLYPVISIINHSCLPNAVLVF--EGRLAVVRAVQHVPKGAEVLISYIETAG---STMT 143
G+GL+ + S NHSC+PNA F L V A++ + G E+ ISY++ S +
Sbjct: 304 GSGLFVLQSCCNHSCVPNAETAFPENNFLLHVTALEDINPGEEICISYLDCCQRERSRHS 363
Query: 144 RQKALKEQYLFTCTCPRCI 162
R K L+E YLF C+CP+C+
Sbjct: 364 RHKILRENYLFVCSCPKCL 382
>gi|328874283|gb|EGG22649.1| SET domain-containing protein [Dictyostelium fasciculatum]
Length = 1240
Score = 63.2 bits (152), Expect = 2e-07, Method: Compositional matrix adjust.
Identities = 64/277 (23%), Positives = 117/277 (42%), Gaps = 30/277 (10%)
Query: 88 LGTGLYPVISIINHSCLPNAVLVFEGRLAVVRAVQHVPKGAEVLISYIETAGSTMTRQK- 146
+ ++PV S++NHSC N +L + G +++++++ K E+ Y A R++
Sbjct: 893 IAYAIFPVSSLLNHSCDNNTILQYNGSSITIKSLKNISKNDEITGCYGPHAFHLELRERL 952
Query: 147 -ALKEQYLFTCTCPRC-IKLGQFDDIQESAILE--GYRCK-----------------DDG 185
LKE+Y F C C C K+G + LE G +CK D
Sbjct: 953 ECLKEEYFFICRCHACNAKIGPNLLRCPTQSLESGGEQCKGTLLERVDTNSLGQEEDDSY 1012
Query: 186 CSGFLLRDSDDKGFTCQQCGLVRSKEEIKKIASEV----NILSKKTLALTSCGNHQEVVS 241
G D + + F C C L K + + S+ I S T L +HQ
Sbjct: 1013 PEGASYDDYNGRKFVCSICSLELGKMDCFMLTSQSVMSGAIFSNATKLLIDTEHHQPPRE 1072
Query: 242 TYKMIEK---LQKKLYHPFSVNLMQTREKLIKILMELEDWKEALAYCQLTI-PVYQRVYP 297
+M+ K ++K++YH S + + + L + + + +KEA+ Y + I ++ R+
Sbjct: 1073 VEEMLFKCLEIRKQIYHVKSRKIGEVYDALSRYYIAKDKYKEAVTYAERAINNIFLRLGH 1132
Query: 298 QFHPLLGLQYYTCGKLEWFLGDTENAIKSMTEAVEIL 334
L ++ + G+ AIK++ + +L
Sbjct: 1133 SHSTELAREWAKLANIYINAGEPAKAIKAINTSESLL 1169
>gi|195156964|ref|XP_002019366.1| GL12277 [Drosophila persimilis]
gi|194115957|gb|EDW38000.1| GL12277 [Drosophila persimilis]
Length = 188
Score = 63.2 bits (152), Expect = 2e-07, Method: Compositional matrix adjust.
Identities = 38/101 (37%), Positives = 57/101 (56%), Gaps = 9/101 (8%)
Query: 66 IAENFSKLACNAHTICNSELRPLGTGLYPVISIINHSCLPNAVLVF--EGRLAVVRAVQH 123
I E +SK+ A N+E G+GLY + S INHSC+PNA F + V++A+
Sbjct: 62 IDELYSKVGEFAGEFLNNE----GSGLYLLQSKINHSCVPNACSTFPYSNDIVVLKALAP 117
Query: 124 VPKGAEVLISYIETAG---STMTRQKALKEQYLFTCTCPRC 161
+ +G E+ ISY++ S +R K L+E Y+F C C +C
Sbjct: 118 IQEGEEICISYLDECQLERSRHSRHKVLRENYIFVCQCLKC 158
>gi|332022210|gb|EGI62525.1| SET and MYND domain-containing protein 5 [Acromyrmex echinatior]
Length = 391
Score = 63.2 bits (152), Expect = 2e-07, Method: Compositional matrix adjust.
Identities = 31/78 (39%), Positives = 48/78 (61%), Gaps = 5/78 (6%)
Query: 89 GTGLYPVISIINHSCLPNAVLVF--EGRLAVVRAVQHVPKGAEVLISYIETAG---STMT 143
G+GLY + S +NHSC+PNAV+ F V++A++++ G E+ SY++ S +
Sbjct: 295 GSGLYVLQSSVNHSCIPNAVVEFPYSNNTLVLKAIRNIKVGEEICTSYLDECQLERSRHS 354
Query: 144 RQKALKEQYLFTCTCPRC 161
RQ+AL YLF C C +C
Sbjct: 355 RQQALSSLYLFVCHCDKC 372
>gi|440583731|emb|CCH47233.1| similar to histone-lysine N-methyltransferase ASHR2 [Lupinus
angustifolius]
Length = 357
Score = 63.2 bits (152), Expect = 2e-07, Method: Compositional matrix adjust.
Identities = 41/143 (28%), Positives = 62/143 (43%), Gaps = 24/143 (16%)
Query: 91 GLYPVISIINHSCLPNAV---LVFEGRL--------AVVRAVQHVPKGAEVLISYIETAG 139
+YP S+ NH C+PNA V L V+R +Q +P+G EV ISY +
Sbjct: 212 AIYPKASMFNHDCIPNACRFDYVDSADLDDEHNNTDMVIRMIQDLPEGREVCISYFRISR 271
Query: 140 STMTRQKALKEQYLFTCTCPRCI-----------KLGQFDDIQESAILEGYRCKDDGCSG 188
TR++ L + Y F+C C RC + ++ D+ + Y C C+G
Sbjct: 272 DYCTRKRILMDDYGFSCECDRCKIEANWPHDCQNYVEEYSDLPHVRFIAKYVCDRKNCNG 331
Query: 189 FLL--RDSDDKGFTCQQCGLVRS 209
L D+ C CG ++S
Sbjct: 332 TLAPKDDAHTNVLECNFCGNLKS 354
>gi|426335968|ref|XP_004029476.1| PREDICTED: SET and MYND domain-containing protein 5 isoform 1
[Gorilla gorilla gorilla]
Length = 419
Score = 63.2 bits (152), Expect = 2e-07, Method: Compositional matrix adjust.
Identities = 32/79 (40%), Positives = 47/79 (59%), Gaps = 5/79 (6%)
Query: 89 GTGLYPVISIINHSCLPNAVLVF--EGRLAVVRAVQHVPKGAEVLISYIETAG---STMT 143
G+GL+ + S NHSC+PNA F L V A++ + G E+ ISY++ S +
Sbjct: 304 GSGLFVLQSCCNHSCVPNAETSFPENNFLLHVTALEDIKPGEEICISYLDCCQRERSRHS 363
Query: 144 RQKALKEQYLFTCTCPRCI 162
R K L+E YLF C+CP+C+
Sbjct: 364 RHKILRENYLFVCSCPKCL 382
>gi|348566513|ref|XP_003469046.1| PREDICTED: SET and MYND domain-containing protein 5-like [Cavia
porcellus]
Length = 416
Score = 63.2 bits (152), Expect = 2e-07, Method: Compositional matrix adjust.
Identities = 32/79 (40%), Positives = 47/79 (59%), Gaps = 5/79 (6%)
Query: 89 GTGLYPVISIINHSCLPNAVLVF--EGRLAVVRAVQHVPKGAEVLISYIETAG---STMT 143
G+GL+ + S NHSC+PNA F L V A++ + G E+ ISY++ S +
Sbjct: 304 GSGLFVLQSCCNHSCVPNAETSFPENNFLLHVTALEDISPGEEICISYLDCCQRERSRHS 363
Query: 144 RQKALKEQYLFTCTCPRCI 162
R K L+E YLF C+CP+C+
Sbjct: 364 RHKILRENYLFVCSCPKCL 382
>gi|224057872|ref|XP_002299366.1| SET domain protein [Populus trichocarpa]
gi|222846624|gb|EEE84171.1| SET domain protein [Populus trichocarpa]
Length = 391
Score = 63.2 bits (152), Expect = 2e-07, Method: Compositional matrix adjust.
Identities = 33/79 (41%), Positives = 42/79 (53%), Gaps = 9/79 (11%)
Query: 91 GLYPVISIINHSCLPNAVLVFE--------GRLAVVRAVQHVPKGAEVLISYIETAGSTM 142
G+YP S+ NH CLPNA F+ VVR + VP+G E+ +SY +
Sbjct: 220 GIYPKASLFNHDCLPNACR-FDYVDTNNSGNTDIVVRMIHDVPQGREICLSYFPVNSNYS 278
Query: 143 TRQKALKEQYLFTCTCPRC 161
TR+K L E Y FTC C RC
Sbjct: 279 TRRKRLLEDYGFTCDCDRC 297
>gi|444723373|gb|ELW64030.1| SET and MYND domain-containing protein 5 [Tupaia chinensis]
Length = 415
Score = 63.2 bits (152), Expect = 2e-07, Method: Compositional matrix adjust.
Identities = 32/79 (40%), Positives = 47/79 (59%), Gaps = 5/79 (6%)
Query: 89 GTGLYPVISIINHSCLPNAVLVF--EGRLAVVRAVQHVPKGAEVLISYIETAG---STMT 143
G+GL+ + S NHSC+PNA F L V A++ + G E+ ISY++ S +
Sbjct: 304 GSGLFVLQSCCNHSCVPNAETSFPENNFLLHVTALEEIKPGEEICISYLDCCQRERSRHS 363
Query: 144 RQKALKEQYLFTCTCPRCI 162
R K L+E YLF C+CP+C+
Sbjct: 364 RHKILRENYLFVCSCPKCL 382
>gi|403260393|ref|XP_003922658.1| PREDICTED: SET and MYND domain-containing protein 5 [Saimiri
boliviensis boliviensis]
Length = 419
Score = 63.2 bits (152), Expect = 2e-07, Method: Compositional matrix adjust.
Identities = 32/79 (40%), Positives = 47/79 (59%), Gaps = 5/79 (6%)
Query: 89 GTGLYPVISIINHSCLPNAVLVF--EGRLAVVRAVQHVPKGAEVLISYIETAG---STMT 143
G+GL+ + S NHSC+PNA F L V A++ + G E+ ISY++ S +
Sbjct: 304 GSGLFVLQSCCNHSCVPNAETSFPENNFLLHVTALEDIKPGEEICISYLDCCQRERSRHS 363
Query: 144 RQKALKEQYLFTCTCPRCI 162
R K L+E YLF C+CP+C+
Sbjct: 364 RHKILRENYLFVCSCPKCL 382
>gi|291386545|ref|XP_002709793.1| PREDICTED: SMYD family member 5 [Oryctolagus cuniculus]
Length = 421
Score = 63.2 bits (152), Expect = 2e-07, Method: Compositional matrix adjust.
Identities = 32/79 (40%), Positives = 47/79 (59%), Gaps = 5/79 (6%)
Query: 89 GTGLYPVISIINHSCLPNAVLVF--EGRLAVVRAVQHVPKGAEVLISYIETAG---STMT 143
G+GL+ + S NHSC+PNA F L V A++ + G E+ ISY++ S +
Sbjct: 304 GSGLFVLQSCCNHSCVPNAETSFPENNFLLHVTALEDIKPGEEICISYLDCCQRERSRHS 363
Query: 144 RQKALKEQYLFTCTCPRCI 162
R K L+E YLF C+CP+C+
Sbjct: 364 RHKILRENYLFVCSCPKCL 382
>gi|195454342|ref|XP_002074198.1| GK12748 [Drosophila willistoni]
gi|194170283|gb|EDW85184.1| GK12748 [Drosophila willistoni]
Length = 394
Score = 63.2 bits (152), Expect = 2e-07, Method: Compositional matrix adjust.
Identities = 52/171 (30%), Positives = 88/171 (51%), Gaps = 16/171 (9%)
Query: 1 MLKLYLR--RKLQNDNVIPSTTTDNYSLVEALVAHMSDIDEKQLLLYAQIANLV-NLILQ 57
M +LYL + Q ++ TT D + + ++ S +L +Q +V +L L
Sbjct: 199 MEQLYLAFCQAFQGEDFARFTTPDAFKTLMGIMGTNSQGIATSVL--SQWVTIVSDLPLS 256
Query: 58 WPE-ISINEIAEN-FSKLACNAHTICNSELRPLGTGLYPVISIINHSCLPNAVLVF--EG 113
PE + +++ +N ++K+ A N+E G+GLY + S INHSC+PNA F
Sbjct: 257 DPEKVQLDQEIDNLYAKVGEFAGEFLNNE----GSGLYLLQSKINHSCVPNACSTFPYSN 312
Query: 114 RLAVVRAVQHVPKGAEVLISYIETAG---STMTRQKALKEQYLFTCTCPRC 161
+ V++A+ + +G E+ ISY++ S +R K L Y+F C CP+C
Sbjct: 313 DIVVLKALSPIQEGDEICISYLDECQLERSRHSRHKILCGNYIFICQCPKC 363
>gi|397473487|ref|XP_003808242.1| PREDICTED: SET and MYND domain-containing protein 5 isoform 1 [Pan
paniscus]
Length = 420
Score = 63.2 bits (152), Expect = 2e-07, Method: Compositional matrix adjust.
Identities = 32/79 (40%), Positives = 47/79 (59%), Gaps = 5/79 (6%)
Query: 89 GTGLYPVISIINHSCLPNAVLVF--EGRLAVVRAVQHVPKGAEVLISYIETAG---STMT 143
G+GL+ + S NHSC+PNA F L V A++ + G E+ ISY++ S +
Sbjct: 304 GSGLFVLQSCCNHSCVPNAETSFPENNFLLHVTALEDIKPGEEICISYLDCCQRERSRHS 363
Query: 144 RQKALKEQYLFTCTCPRCI 162
R K L+E YLF C+CP+C+
Sbjct: 364 RHKILRENYLFVCSCPKCL 382
>gi|355751414|gb|EHH55669.1| hypothetical protein EGM_04917 [Macaca fascicularis]
Length = 419
Score = 63.2 bits (152), Expect = 2e-07, Method: Compositional matrix adjust.
Identities = 32/79 (40%), Positives = 47/79 (59%), Gaps = 5/79 (6%)
Query: 89 GTGLYPVISIINHSCLPNAVLVF--EGRLAVVRAVQHVPKGAEVLISYIETAG---STMT 143
G+GL+ + S NHSC+PNA F L V A++ + G E+ ISY++ S +
Sbjct: 304 GSGLFVLQSCCNHSCVPNAETSFPENNFLLHVTALEDIKPGEEICISYLDCCQRERSRHS 363
Query: 144 RQKALKEQYLFTCTCPRCI 162
R K L+E YLF C+CP+C+
Sbjct: 364 RHKILRENYLFVCSCPKCL 382
>gi|332226837|ref|XP_003262596.1| PREDICTED: SET and MYND domain-containing protein 5 isoform 1
[Nomascus leucogenys]
Length = 418
Score = 63.2 bits (152), Expect = 2e-07, Method: Compositional matrix adjust.
Identities = 32/79 (40%), Positives = 47/79 (59%), Gaps = 5/79 (6%)
Query: 89 GTGLYPVISIINHSCLPNAVLVF--EGRLAVVRAVQHVPKGAEVLISYIETAG---STMT 143
G+GL+ + S NHSC+PNA F L V A++ + G E+ ISY++ S +
Sbjct: 304 GSGLFVLQSCCNHSCVPNAETSFPENNFLLHVTALEDIKPGEEICISYLDCCQRERSRHS 363
Query: 144 RQKALKEQYLFTCTCPRCI 162
R K L+E YLF C+CP+C+
Sbjct: 364 RHKILRENYLFVCSCPKCL 382
>gi|297667345|ref|XP_002811937.1| PREDICTED: SET and MYND domain-containing protein 5 [Pongo abelii]
Length = 419
Score = 63.2 bits (152), Expect = 2e-07, Method: Compositional matrix adjust.
Identities = 32/79 (40%), Positives = 47/79 (59%), Gaps = 5/79 (6%)
Query: 89 GTGLYPVISIINHSCLPNAVLVF--EGRLAVVRAVQHVPKGAEVLISYIETAG---STMT 143
G+GL+ + S NHSC+PNA F L V A++ + G E+ ISY++ S +
Sbjct: 304 GSGLFVLQSCCNHSCVPNAETSFPENNFLLHVTALEDIKPGEEICISYLDCCQRERSRHS 363
Query: 144 RQKALKEQYLFTCTCPRCI 162
R K L+E YLF C+CP+C+
Sbjct: 364 RHKILRENYLFVCSCPKCL 382
>gi|321476942|gb|EFX87901.1| SET and MYND domain-containing protein 4A [Daphnia pulex]
Length = 648
Score = 63.2 bits (152), Expect = 2e-07, Method: Compositional matrix adjust.
Identities = 27/78 (34%), Positives = 47/78 (60%), Gaps = 2/78 (2%)
Query: 88 LGTGLYPVISIINHSCLPNAVLVFEGRLAVVRAVQHVPKGAEVLISYIETAGSTMTRQKA 147
+G+ +PV+S++NHSC PN V + G + VV+ + + +G E+L +Y + R++
Sbjct: 479 IGSAAFPVVSLMNHSCNPNVVHLCYGDVMVVKVIHRIARGEEILDNYGYHYATHEKRERQ 538
Query: 148 LK--EQYLFTCTCPRCIK 163
LK +QY F C C C++
Sbjct: 539 LKLCQQYYFRCRCQSCVE 556
>gi|332813419|ref|XP_001150449.2| PREDICTED: SET and MYND domain-containing protein 5 isoform 1 [Pan
troglodytes]
Length = 415
Score = 63.2 bits (152), Expect = 2e-07, Method: Compositional matrix adjust.
Identities = 32/79 (40%), Positives = 47/79 (59%), Gaps = 5/79 (6%)
Query: 89 GTGLYPVISIINHSCLPNAVLVF--EGRLAVVRAVQHVPKGAEVLISYIETAG---STMT 143
G+GL+ + S NHSC+PNA F L V A++ + G E+ ISY++ S +
Sbjct: 304 GSGLFVLQSCCNHSCVPNAETSFPENNFLLHVTALEDIKPGEEICISYLDCCQRERSRHS 363
Query: 144 RQKALKEQYLFTCTCPRCI 162
R K L+E YLF C+CP+C+
Sbjct: 364 RHKILRENYLFVCSCPKCL 382
>gi|296223556|ref|XP_002807574.1| PREDICTED: LOW QUALITY PROTEIN: SET and MYND domain-containing
protein 5 [Callithrix jacchus]
Length = 431
Score = 63.2 bits (152), Expect = 2e-07, Method: Compositional matrix adjust.
Identities = 32/79 (40%), Positives = 47/79 (59%), Gaps = 5/79 (6%)
Query: 89 GTGLYPVISIINHSCLPNAVLVF--EGRLAVVRAVQHVPKGAEVLISYIETAG---STMT 143
G+GL+ + S NHSC+PNA F L V A++ + G E+ ISY++ S +
Sbjct: 320 GSGLFVLQSCCNHSCVPNAETSFPENNFLLHVTALEDIKPGEEICISYLDCCQRERSRHS 379
Query: 144 RQKALKEQYLFTCTCPRCI 162
R K L+E YLF C+CP+C+
Sbjct: 380 RHKILRENYLFVCSCPKCL 398
>gi|291231170|ref|XP_002735533.1| PREDICTED: SMYD family member 5-like [Saccoglossus kowalevskii]
Length = 395
Score = 63.2 bits (152), Expect = 2e-07, Method: Compositional matrix adjust.
Identities = 35/86 (40%), Positives = 48/86 (55%), Gaps = 7/86 (8%)
Query: 89 GTGLYPVISIINHSCLPNAVLVFEGRLAVVR--AVQHVPKGAEVLISYI---ETAGSTMT 143
G+ LY + S NHSC+PNA + F A V A+Q + + E+ ISY+ + S +
Sbjct: 295 GSALYSLQSCCNHSCVPNAEVTFPDNDAAVSVMALQDIQENEEICISYLGECDIGRSRHS 354
Query: 144 RQKALKEQYLFTCTCPRCIKLGQFDD 169
RQK L+E YLF C C +C Q DD
Sbjct: 355 RQKILRENYLFNCNCMKC--QSQCDD 378
>gi|194762123|ref|XP_001963207.1| GF15829 [Drosophila ananassae]
gi|190616904|gb|EDV32428.1| GF15829 [Drosophila ananassae]
Length = 507
Score = 63.2 bits (152), Expect = 2e-07, Method: Compositional matrix adjust.
Identities = 51/199 (25%), Positives = 84/199 (42%), Gaps = 25/199 (12%)
Query: 92 LYPVISIINHSCLPN---AVLVFEGRLAVVRAVQHVPKGAEVLISYIETAGSTMTRQKAL 148
LYP I+ H+C+PN ++ EG +RA+ + G + SY T T RQ+ L
Sbjct: 207 LYPYTGILAHNCVPNTARSIYPSEGYKIRLRAMVDLKDGQPLHHSYTYTLDGTAQRQRHL 266
Query: 149 KEQYLFTCTCPRC---IKLG-QFDDIQESAILEGYRCKDDGCSGFLLRDSDDKGFTCQQC 204
K+ F+CTC RC +LG F ++ EGY+ D + C C
Sbjct: 267 KQGKYFSCTCERCQDPTELGTHFSSLKCGQCTEGYQVPRQP-------TDSDSSWNCNNC 319
Query: 205 GLVRSKEEI----KKIASEVNILSKKTLALTSCGNHQEVVSTYKMIEKLQKKLYHPFSVN 260
G S+ E+ + + SEVN + + + ++ Y K L HP
Sbjct: 320 GASSSQAEVETLLQSLQSEVNKVQGLEMGAKRLEEAERLLRKY-------KSLLHPLHFI 372
Query: 261 LMQTREKLIKILMELEDWK 279
R+ LI++ ++ ++
Sbjct: 373 ATGLRQLLIEMYGRVQGYE 391
>gi|154689858|ref|NP_006053.2| SET and MYND domain-containing protein 5 [Homo sapiens]
gi|90101758|sp|Q6GMV2.2|SMYD5_HUMAN RecName: Full=SET and MYND domain-containing protein 5; AltName:
Full=Protein NN8-4AG; AltName: Full=Retinoic
acid-induced protein 15
gi|119620150|gb|EAW99744.1| SMYD family member 5, isoform CRA_a [Homo sapiens]
gi|119620151|gb|EAW99745.1| SMYD family member 5, isoform CRA_a [Homo sapiens]
gi|261859134|dbj|BAI46089.1| SMYD family member 5 [synthetic construct]
Length = 418
Score = 63.2 bits (152), Expect = 2e-07, Method: Compositional matrix adjust.
Identities = 32/79 (40%), Positives = 47/79 (59%), Gaps = 5/79 (6%)
Query: 89 GTGLYPVISIINHSCLPNAVLVF--EGRLAVVRAVQHVPKGAEVLISYIETAG---STMT 143
G+GL+ + S NHSC+PNA F L V A++ + G E+ ISY++ S +
Sbjct: 304 GSGLFVLQSCCNHSCVPNAETSFPENNFLLHVTALEDIKPGEEICISYLDCCQRERSRHS 363
Query: 144 RQKALKEQYLFTCTCPRCI 162
R K L+E YLF C+CP+C+
Sbjct: 364 RHKILRENYLFVCSCPKCL 382
>gi|195421076|ref|XP_002060836.1| GK23578 [Drosophila willistoni]
gi|194156921|gb|EDW71822.1| GK23578 [Drosophila willistoni]
Length = 128
Score = 63.2 bits (152), Expect = 2e-07, Method: Compositional matrix adjust.
Identities = 29/72 (40%), Positives = 41/72 (56%), Gaps = 1/72 (1%)
Query: 91 GLYPVISIINHSCLPNAVLVFE-GRLAVVRAVQHVPKGAEVLISYIETAGSTMTRQKALK 149
L+P+ ++NH C PNA FE G VV A + +PKG E+ ++Y + ST+ R+ L
Sbjct: 51 ALFPLAGLLNHHCTPNAAHHFEDGETIVVSATERIPKGTEITMTYAKLLWSTLARKLFLA 110
Query: 150 EQYLFTCTCPRC 161
F C CPRC
Sbjct: 111 MTKHFICQCPRC 122
>gi|431920345|gb|ELK18377.1| SET and MYND domain-containing protein 5 [Pteropus alecto]
Length = 418
Score = 63.2 bits (152), Expect = 2e-07, Method: Compositional matrix adjust.
Identities = 32/79 (40%), Positives = 47/79 (59%), Gaps = 5/79 (6%)
Query: 89 GTGLYPVISIINHSCLPNAVLVF--EGRLAVVRAVQHVPKGAEVLISYIETAG---STMT 143
G+GL+ + S NHSC+PNA F L V A++ + G E+ ISY++ S +
Sbjct: 304 GSGLFVLQSCCNHSCVPNAETSFPENNFLLHVTALEDIKPGEEICISYLDCCQRERSRHS 363
Query: 144 RQKALKEQYLFTCTCPRCI 162
R K L+E YLF C+CP+C+
Sbjct: 364 RHKILRENYLFVCSCPKCL 382
>gi|449550207|gb|EMD41172.1| hypothetical protein CERSUDRAFT_91931 [Ceriporiopsis subvermispora
B]
Length = 329
Score = 63.2 bits (152), Expect = 2e-07, Method: Compositional matrix adjust.
Identities = 28/74 (37%), Positives = 40/74 (54%), Gaps = 2/74 (2%)
Query: 90 TGLYPVISIINHSCLPNA--VLVFEGRLAVVRAVQHVPKGAEVLISYIETAGSTMTRQKA 147
+G +P++S INH C P+A FE + A + +P G E+ I+Y + RQ
Sbjct: 138 SGCFPILSRINHDCSPSANYFFAFEPYCGQLWAARDIPDGGEITIAYTTLPAPRVDRQAH 197
Query: 148 LKEQYLFTCTCPRC 161
L E+Y FTCTC C
Sbjct: 198 LSERYFFTCTCVAC 211
>gi|417400530|gb|JAA47200.1| Putative histone tail methylase [Desmodus rotundus]
Length = 415
Score = 63.2 bits (152), Expect = 2e-07, Method: Compositional matrix adjust.
Identities = 32/79 (40%), Positives = 47/79 (59%), Gaps = 5/79 (6%)
Query: 89 GTGLYPVISIINHSCLPNAVLVF--EGRLAVVRAVQHVPKGAEVLISYIETAG---STMT 143
G+GL+ + S NHSC+PNA F L V A++ + G E+ ISY++ S +
Sbjct: 304 GSGLFVLQSCCNHSCVPNAETSFPENNFLLHVTALEDIKPGEEICISYLDCCQRERSRHS 363
Query: 144 RQKALKEQYLFTCTCPRCI 162
R K L+E YLF C+CP+C+
Sbjct: 364 RHKILRENYLFVCSCPKCL 382
>gi|301758142|ref|XP_002914921.1| PREDICTED: SET and MYND domain-containing protein 5-like
[Ailuropoda melanoleuca]
gi|281346678|gb|EFB22262.1| hypothetical protein PANDA_002863 [Ailuropoda melanoleuca]
Length = 417
Score = 63.2 bits (152), Expect = 2e-07, Method: Compositional matrix adjust.
Identities = 32/79 (40%), Positives = 47/79 (59%), Gaps = 5/79 (6%)
Query: 89 GTGLYPVISIINHSCLPNAVLVF--EGRLAVVRAVQHVPKGAEVLISYIETAG---STMT 143
G+GL+ + S NHSC+PNA F L V A++ + G E+ ISY++ S +
Sbjct: 304 GSGLFVLQSCCNHSCVPNAETSFPENNFLLHVTALEDIKPGEEICISYLDCCQRERSRHS 363
Query: 144 RQKALKEQYLFTCTCPRCI 162
R K L+E YLF C+CP+C+
Sbjct: 364 RHKILRENYLFVCSCPKCL 382
>gi|384943470|gb|AFI35340.1| SET and MYND domain-containing protein 5 [Macaca mulatta]
Length = 418
Score = 63.2 bits (152), Expect = 2e-07, Method: Compositional matrix adjust.
Identities = 32/79 (40%), Positives = 47/79 (59%), Gaps = 5/79 (6%)
Query: 89 GTGLYPVISIINHSCLPNAVLVF--EGRLAVVRAVQHVPKGAEVLISYIETAG---STMT 143
G+GL+ + S NHSC+PNA F L V A++ + G E+ ISY++ S +
Sbjct: 304 GSGLFVLQSCCNHSCVPNAETSFPENNFLLHVTALEDIKPGEEICISYLDCCQRERSRHS 363
Query: 144 RQKALKEQYLFTCTCPRCI 162
R K L+E YLF C+CP+C+
Sbjct: 364 RHKILRENYLFVCSCPKCL 382
>gi|380797359|gb|AFE70555.1| SET and MYND domain-containing protein 5, partial [Macaca mulatta]
Length = 407
Score = 63.2 bits (152), Expect = 2e-07, Method: Compositional matrix adjust.
Identities = 32/79 (40%), Positives = 47/79 (59%), Gaps = 5/79 (6%)
Query: 89 GTGLYPVISIINHSCLPNAVLVF--EGRLAVVRAVQHVPKGAEVLISYIETAG---STMT 143
G+GL+ + S NHSC+PNA F L V A++ + G E+ ISY++ S +
Sbjct: 293 GSGLFVLQSCCNHSCVPNAETSFPENNFLLHVTALEDIKPGEEICISYLDCCQRERSRHS 352
Query: 144 RQKALKEQYLFTCTCPRCI 162
R K L+E YLF C+CP+C+
Sbjct: 353 RHKILRENYLFVCSCPKCL 371
>gi|109103368|ref|XP_001104101.1| PREDICTED: SET and MYND domain-containing protein 5 [Macaca
mulatta]
gi|402891240|ref|XP_003908860.1| PREDICTED: SET and MYND domain-containing protein 5 [Papio anubis]
gi|355565788|gb|EHH22217.1| hypothetical protein EGK_05442 [Macaca mulatta]
Length = 418
Score = 63.2 bits (152), Expect = 2e-07, Method: Compositional matrix adjust.
Identities = 32/79 (40%), Positives = 47/79 (59%), Gaps = 5/79 (6%)
Query: 89 GTGLYPVISIINHSCLPNAVLVF--EGRLAVVRAVQHVPKGAEVLISYIETAG---STMT 143
G+GL+ + S NHSC+PNA F L V A++ + G E+ ISY++ S +
Sbjct: 304 GSGLFVLQSCCNHSCVPNAETSFPENNFLLHVTALEDIKPGEEICISYLDCCQRERSRHS 363
Query: 144 RQKALKEQYLFTCTCPRCI 162
R K L+E YLF C+CP+C+
Sbjct: 364 RHKILRENYLFVCSCPKCL 382
>gi|311252397|ref|XP_003125060.1| PREDICTED: SET and MYND domain-containing protein 5 [Sus scrofa]
Length = 417
Score = 62.8 bits (151), Expect = 2e-07, Method: Compositional matrix adjust.
Identities = 32/79 (40%), Positives = 47/79 (59%), Gaps = 5/79 (6%)
Query: 89 GTGLYPVISIINHSCLPNAVLVF--EGRLAVVRAVQHVPKGAEVLISYIETAG---STMT 143
G+GL+ + S NHSC+PNA F L V A++ + G E+ ISY++ S +
Sbjct: 304 GSGLFVLQSCCNHSCVPNAETSFPENNFLLHVTALEDIKPGEEICISYLDCCQRERSRHS 363
Query: 144 RQKALKEQYLFTCTCPRCI 162
R K L+E YLF C+CP+C+
Sbjct: 364 RHKILRENYLFVCSCPKCL 382
>gi|194377790|dbj|BAG63258.1| unnamed protein product [Homo sapiens]
Length = 418
Score = 62.8 bits (151), Expect = 2e-07, Method: Compositional matrix adjust.
Identities = 32/79 (40%), Positives = 47/79 (59%), Gaps = 5/79 (6%)
Query: 89 GTGLYPVISIINHSCLPNAVLVF--EGRLAVVRAVQHVPKGAEVLISYIETAG---STMT 143
G+GL+ + S NHSC+PNA F L V A++ + G E+ ISY++ S +
Sbjct: 304 GSGLFVLQSCCNHSCVPNAETSFPENNFLLHVTALEDIKPGEEICISYLDCCQRERSRHS 363
Query: 144 RQKALKEQYLFTCTCPRCI 162
R K L+E YLF C+CP+C+
Sbjct: 364 RHKILRENYLFVCSCPKCL 382
>gi|328873515|gb|EGG21882.1| SET domain-containing protein [Dictyostelium fasciculatum]
Length = 438
Score = 62.8 bits (151), Expect = 2e-07, Method: Compositional matrix adjust.
Identities = 40/86 (46%), Positives = 48/86 (55%), Gaps = 10/86 (11%)
Query: 82 NSELRPLGTGLYPVISIINHSCLPNA-VLVFEGRLAV----VRAVQHVPKGAEVLISYIE 136
N E R G G+Y S NHSC PN V E L V +RAV+ KG E+ ISYI+
Sbjct: 324 NGESR--GCGVYVRNSFFNHSCNPNVNYWVVENTLEVECSLMRAVR---KGEELCISYID 378
Query: 137 TAGSTMTRQKALKEQYLFTCTCPRCI 162
TA S R++ L E YLF C C +CI
Sbjct: 379 TAASLRDRREKLSEGYLFHCRCEKCI 404
>gi|410955043|ref|XP_003984168.1| PREDICTED: SET and MYND domain-containing protein 5 [Felis catus]
Length = 416
Score = 62.8 bits (151), Expect = 2e-07, Method: Compositional matrix adjust.
Identities = 32/79 (40%), Positives = 47/79 (59%), Gaps = 5/79 (6%)
Query: 89 GTGLYPVISIINHSCLPNAVLVF--EGRLAVVRAVQHVPKGAEVLISYIETAG---STMT 143
G+GL+ + S NHSC+PNA F L V A++ + G E+ ISY++ S +
Sbjct: 304 GSGLFVLQSCCNHSCVPNAETSFPENNFLLHVTALEDIKPGEEICISYLDCCQRERSRHS 363
Query: 144 RQKALKEQYLFTCTCPRCI 162
R K L+E YLF C+CP+C+
Sbjct: 364 RHKILRENYLFVCSCPKCL 382
>gi|390362089|ref|XP_784346.3| PREDICTED: SET and MYND domain-containing protein 5-like
[Strongylocentrotus purpuratus]
Length = 396
Score = 62.8 bits (151), Expect = 2e-07, Method: Compositional matrix adjust.
Identities = 34/79 (43%), Positives = 49/79 (62%), Gaps = 7/79 (8%)
Query: 89 GTGLYPVISIINHSCLPNAVLVF---EGRLAVVRAVQHVPKGAEVLISYIETA---GSTM 142
G+ LY + S NHSC+PNA + F L++V A+ + +G EVLISY++ S
Sbjct: 288 GSALYKLQSCCNHSCVPNAEISFLHNNSTLSLV-ALTDITEGQEVLISYLDECCKERSRH 346
Query: 143 TRQKALKEQYLFTCTCPRC 161
+RQK L+E YLF+C C +C
Sbjct: 347 SRQKELRENYLFSCDCSKC 365
>gi|307183932|gb|EFN70520.1| SET and MYND domain-containing protein 4 [Camponotus floridanus]
Length = 421
Score = 62.8 bits (151), Expect = 2e-07, Method: Compositional matrix adjust.
Identities = 39/126 (30%), Positives = 59/126 (46%), Gaps = 9/126 (7%)
Query: 86 RP--LGTGLYPVISIINHSCLPNAVLVFEGRLAVVRAVQHVPKGAEVLISY--IETAGST 141
RP LG +YP ++ NH C P F GR V+RA++ + G + +Y I T +
Sbjct: 218 RPVYLGVAIYPTVARFNHDCYPAVTRYFVGRHIVIRAIRGLRPGDVIAENYGPIFTKRTL 277
Query: 142 MTRQKALKEQYLFTCTCPRCIK-LGQFDDIQESAILEGYRCKDDGCSGFLLRDSD--DKG 198
RQ+ L +Y F C+C C + +F+ + ++ RC GC+G R DK
Sbjct: 278 AERQRTLAGRYWFRCSCRACQEDWPRFETLTNDSVR--LRCPTTGCNGLHSRPQQRPDKP 335
Query: 199 FTCQQC 204
C C
Sbjct: 336 IKCSAC 341
>gi|380022725|ref|XP_003695189.1| PREDICTED: protein msta, isoform B-like [Apis florea]
Length = 513
Score = 62.8 bits (151), Expect = 2e-07, Method: Compositional matrix adjust.
Identities = 72/339 (21%), Positives = 137/339 (40%), Gaps = 42/339 (12%)
Query: 27 VEALVAHMSDIDEKQLLLYAQIANLVNLILQWPEISINEIAENFSKLACNAHTICNSELR 86
V+ + AH +K+ Q+ N+V I + ++ ++ +E + C I E+R
Sbjct: 151 VKDMEAHNKIRSQKKQWKLDQV-NIVEYIRK--QLKLDRFSEKEIQTVCGILEINTFEVR 207
Query: 87 PL----GTGLYPVISIINHSCLPN-----AVLVFEGRLAVVRAVQHVPKGAEVLISYIET 137
+YP ++++NHSC+ N + ++ RL R +P G E+ SY +
Sbjct: 208 TAKGFSARAIYPTVALMNHSCISNTCHSISPTDYKIRL---RTTLKIPPGGELYGSYTHS 264
Query: 138 AGSTMTRQKALKEQYLFTCTCPRCIKLGQFDDIQESAILEGYRCK--DDGCSGFLLRDSD 195
TM R++ L E F C C RC D + + +C D+G L
Sbjct: 265 LLPTMLRREHLLEGKHFACACARC-----SDPTELGTHMSSLKCNKCDNGIVLSLDSLDP 319
Query: 196 DKGFTCQQCGLVRSKEEIKKIASEVNILSKKTLALTSCGNHQEVVSTYKMIEKLQKKLYH 255
+ + C C S ++K+ +++ A++ + + + + + K + + H
Sbjct: 320 ESSWKCTHCEFSTSGSAVRKVLQIIHMEVDAVEAISG-ADGADAIQERETVAKKYRSVLH 378
Query: 256 PFSVNLMQTREKLIKILMELEDW----------KEALAYCQLTIPVYQRVYPQFHPLLGL 305
P L R L ++ +ED+ + + C+L + V + P + + G+
Sbjct: 379 PRHAFLTMLRHSLTQMYGRVEDYLLEDLPDVVLEHKIDMCKLLLQVLDVIEPGYSRIRGM 438
Query: 306 QYYTCGKLEWFLGDTE--------NAIKS-MTEAVEILR 335
Y F T+ A+KS M EA +IL+
Sbjct: 439 TLYELHAPLLFTAKTQWNAEVIDKAALKSKMIEASKILK 477
>gi|149727812|ref|XP_001488565.1| PREDICTED: SET and MYND domain-containing protein 5 [Equus
caballus]
Length = 417
Score = 62.8 bits (151), Expect = 2e-07, Method: Compositional matrix adjust.
Identities = 32/79 (40%), Positives = 47/79 (59%), Gaps = 5/79 (6%)
Query: 89 GTGLYPVISIINHSCLPNAVLVF--EGRLAVVRAVQHVPKGAEVLISYIETAG---STMT 143
G+GL+ + S NHSC+PNA F L V A++ + G E+ ISY++ S +
Sbjct: 304 GSGLFVLQSCCNHSCVPNAETSFPENNFLLHVTALEDIKPGEEICISYLDCCQRERSRHS 363
Query: 144 RQKALKEQYLFTCTCPRCI 162
R K L+E YLF C+CP+C+
Sbjct: 364 RHKILRENYLFVCSCPKCL 382
>gi|122692553|ref|NP_001073717.1| SET and MYND domain-containing protein 5 [Bos taurus]
gi|120419446|gb|ABM21542.1| SMYD family member 5 [Bos taurus]
gi|151553913|gb|AAI49126.1| SMYD5 protein [Bos taurus]
gi|296482713|tpg|DAA24828.1| TPA: SMYD family member 5 [Bos taurus]
Length = 418
Score = 62.8 bits (151), Expect = 2e-07, Method: Compositional matrix adjust.
Identities = 32/79 (40%), Positives = 47/79 (59%), Gaps = 5/79 (6%)
Query: 89 GTGLYPVISIINHSCLPNAVLVF--EGRLAVVRAVQHVPKGAEVLISYIETAG---STMT 143
G+GL+ + S NHSC+PNA F L V A++ + G E+ ISY++ S +
Sbjct: 304 GSGLFVLQSCCNHSCVPNAETSFPENNFLLHVTALEDIKPGEEICISYLDCCQRERSRHS 363
Query: 144 RQKALKEQYLFTCTCPRCI 162
R K L+E YLF C+CP+C+
Sbjct: 364 RHKILRENYLFVCSCPKCL 382
>gi|1245372|gb|AAB38131.1| NN8-4AG, partial [Homo sapiens]
Length = 412
Score = 62.8 bits (151), Expect = 2e-07, Method: Compositional matrix adjust.
Identities = 32/79 (40%), Positives = 47/79 (59%), Gaps = 5/79 (6%)
Query: 89 GTGLYPVISIINHSCLPNAVLVF--EGRLAVVRAVQHVPKGAEVLISYIETAG---STMT 143
G+GL+ + S NHSC+PNA F L V A++ + G E+ ISY++ S +
Sbjct: 298 GSGLFVLQSCCNHSCVPNAETSFPENNFLLHVTALEDIKPGEEICISYLDCCQRERSRHS 357
Query: 144 RQKALKEQYLFTCTCPRCI 162
R K L+E YLF C+CP+C+
Sbjct: 358 RHKILRENYLFVCSCPKCL 376
>gi|440901422|gb|ELR52368.1| SET and MYND domain-containing protein 5 [Bos grunniens mutus]
Length = 418
Score = 62.8 bits (151), Expect = 2e-07, Method: Compositional matrix adjust.
Identities = 32/79 (40%), Positives = 47/79 (59%), Gaps = 5/79 (6%)
Query: 89 GTGLYPVISIINHSCLPNAVLVF--EGRLAVVRAVQHVPKGAEVLISYIETAG---STMT 143
G+GL+ + S NHSC+PNA F L V A++ + G E+ ISY++ S +
Sbjct: 304 GSGLFVLQSCCNHSCVPNAETSFPENNFLLHVTALEDIKPGEEICISYLDCCQRERSRHS 363
Query: 144 RQKALKEQYLFTCTCPRCI 162
R K L+E YLF C+CP+C+
Sbjct: 364 RHKILRENYLFVCSCPKCL 382
>gi|219112705|ref|XP_002178104.1| predicted protein [Phaeodactylum tricornutum CCAP 1055/1]
gi|217410989|gb|EEC50918.1| predicted protein [Phaeodactylum tricornutum CCAP 1055/1]
Length = 501
Score = 62.8 bits (151), Expect = 3e-07, Method: Compositional matrix adjust.
Identities = 35/91 (38%), Positives = 54/91 (59%), Gaps = 10/91 (10%)
Query: 80 ICNSELRPLGTGLYPVISIINHSCLPNAVL----VFEGRLAVVRAVQHVPKGAEVLISYI 135
+ ++ P ++P+ + INHSC+PNA + + R+ VV A++ + G E+ ISYI
Sbjct: 408 VVEDKVAPEIVAVFPLTARINHSCVPNAQVQSQEFVDARIDVV-ALRDIAAGEEITISYI 466
Query: 136 ---ETAGSTMT--RQKALKEQYLFTCTCPRC 161
T+GS T R++ L +YLFTC CPRC
Sbjct: 467 GCGRTSGSKSTSRRRRELLAKYLFTCECPRC 497
>gi|195020683|ref|XP_001985247.1| GH16955 [Drosophila grimshawi]
gi|193898729|gb|EDV97595.1| GH16955 [Drosophila grimshawi]
Length = 529
Score = 62.8 bits (151), Expect = 3e-07, Method: Compositional matrix adjust.
Identities = 61/225 (27%), Positives = 105/225 (46%), Gaps = 28/225 (12%)
Query: 19 TTTDNYSLVEALVAHMSDIDEKQL--LLYAQIANLVNLIL---QWPEISINEIAENFSKL 73
+ + ++ + AL H+ + E L +L A + + +L WPE+ I IA + L
Sbjct: 171 SNPEAFARLAALEDHLKERIETPLYQVLRANLITFIKTVLGQRDWPELEILRIA---AIL 227
Query: 74 ACNAHTICNSELRPLGTGLYPVISIINHSCLPNAVLVFEGRLAVV-RAVQHVPKGAEVLI 132
NA + + R ++P ++I H C+PN F+ + ++ A + +PKGA + I
Sbjct: 228 DTNAFEVRQNGERRKVRAIFPGGAMIAHDCVPNLRHRFDDNMRIIFLAKRPIPKGAILSI 287
Query: 133 SYIETAGSTMTRQKALKEQYLFTCTCPRC---IKLGQFDDIQESAILEGYRCKDDGCSGF 189
SY + ST+ R+ LK+ F C C RC +LG F A L G +CK G
Sbjct: 288 SYTQPLRSTVQRRVHLKQVKCFDCACERCSDPTELGTF----AGAHLCG-KCK----VGK 338
Query: 190 LLRDS---DDKGFTCQQCGLVRSKEEI----KKIASEVNILSKKT 227
++ + + + CQ C + +S E+ ++ E+ L K T
Sbjct: 339 VISQNPLENAANWHCQLCNVKKSAREVLTQDARLQQEIESLDKTT 383
>gi|340725361|ref|XP_003401039.1| PREDICTED: SET and MYND domain-containing protein 5-like [Bombus
terrestris]
Length = 386
Score = 62.8 bits (151), Expect = 3e-07, Method: Compositional matrix adjust.
Identities = 35/87 (40%), Positives = 49/87 (56%), Gaps = 5/87 (5%)
Query: 89 GTGLYPVISIINHSCLPNAVLVF--EGRLAVVRAVQHVPKGAEVLISYIETA---GSTMT 143
G+GLY S INHSC+PNAV+ F + V++A++ + E+ ISY++ S +
Sbjct: 287 GSGLYIFQSSINHSCVPNAVVEFPYSNNVLVLKAIRDIHPEEEICISYLDECCLERSRHS 346
Query: 144 RQKALKEQYLFTCTCPRCIKLGQFDDI 170
RQK L YLF C C RC+ DI
Sbjct: 347 RQKVLSSLYLFHCYCNRCLSQVNNPDI 373
>gi|71666752|ref|XP_820332.1| hypothetical protein [Trypanosoma cruzi strain CL Brener]
gi|70885672|gb|EAN98481.1| hypothetical protein, conserved [Trypanosoma cruzi]
Length = 359
Score = 62.8 bits (151), Expect = 3e-07, Method: Compositional matrix adjust.
Identities = 51/174 (29%), Positives = 84/174 (48%), Gaps = 32/174 (18%)
Query: 19 TTTDNYSLVEALVAHMSDI--------DEKQLLLYAQIANLVNLIL-----QWPEISINE 65
+TT+N + + L M+ + +E L A+ + L+ +L + P + +
Sbjct: 160 STTENDTTDDFLATGMTLLGRILRFTQEECTLFTVARWSELLGAVLLNGQERSPPSNYDR 219
Query: 66 IAENFSKLACNAHTI---------CNSELRPL-----GTGLYPVISIINHSCLPNAVLVF 111
+ E +L C T+ +EL+ L G G+Y V + NHSC PN +V+
Sbjct: 220 LKELVRRLPCGESTMDAFEQEVQMAGNELQHLRQSSRGQGVYTVGCLFNHSCEPNLQVVY 279
Query: 112 ----EGRLAVVRAVQHVPKGAEVLISYIETAGSTMTRQKALKEQYLFTCTCPRC 161
+ L+VV A++ + G E+ ISYI+ + S RQ+ L E YLF C CP+C
Sbjct: 280 SESGDETLSVV-ALRDIEPGEELCISYIDESLSYPERQQELYEHYLFFCQCPKC 332
>gi|195132931|ref|XP_002010893.1| GI21458 [Drosophila mojavensis]
gi|193907681|gb|EDW06548.1| GI21458 [Drosophila mojavensis]
Length = 302
Score = 62.8 bits (151), Expect = 3e-07, Method: Compositional matrix adjust.
Identities = 30/72 (41%), Positives = 41/72 (56%), Gaps = 1/72 (1%)
Query: 91 GLYPVISIINHSCLPNAVLVFE-GRLAVVRAVQHVPKGAEVLISYIETAGSTMTRQKALK 149
L+P+ ++NH C PNA FE G VV A + +P GAE+ +SY + ST+ R+ L
Sbjct: 226 ALFPLAGLLNHQCTPNAAHHFEDGETIVVTATERIPAGAEITMSYTKLLWSTLARKIFLG 285
Query: 150 EQYLFTCTCPRC 161
F C CPRC
Sbjct: 286 MTKHFMCQCPRC 297
>gi|154346778|ref|XP_001569326.1| hypothetical protein LBRM_35_7140 [Leishmania braziliensis
MHOM/BR/75/M2904]
gi|134066668|emb|CAM44467.1| hypothetical protein LBRM_35_7140 [Leishmania braziliensis
MHOM/BR/75/M2904]
Length = 442
Score = 62.8 bits (151), Expect = 3e-07, Method: Compositional matrix adjust.
Identities = 34/84 (40%), Positives = 47/84 (55%), Gaps = 5/84 (5%)
Query: 89 GTGLYPVISIINHSCLPNAVLVFEG----RLAVVRAVQHVPKGAEVLISYIETAGSTMTR 144
G G+Y V + NHSC PN + + L VV A++ V G E+ ISYI+++ R
Sbjct: 357 GQGIYEVGCLFNHSCDPNLSVQYSSLNDETLTVV-ALRDVKAGEELTISYIDSSLPFAVR 415
Query: 145 QKALKEQYLFTCTCPRCIKLGQFD 168
Q+ L + YLF C CPRC+ G D
Sbjct: 416 QQQLLDHYLFECRCPRCVAEGTTD 439
>gi|443704468|gb|ELU01530.1| hypothetical protein CAPTEDRAFT_197951 [Capitella teleta]
Length = 614
Score = 62.8 bits (151), Expect = 3e-07, Method: Compositional matrix adjust.
Identities = 80/370 (21%), Positives = 147/370 (39%), Gaps = 68/370 (18%)
Query: 21 TDNYSLVEALVAHMSDIDEKQLLLYAQIANLVNLILQWPEISINE-----------IAEN 69
+ NY+ V L+ H + + L+ YA A L+ L E N + +
Sbjct: 260 SRNYTAVYHLLPHNESMLREDLIQYALTAALLLKCLDHTEYFKNHSPDFRFSIGGLLLRH 319
Query: 70 FSKLACNAHTI-------CNSELRP-----LGTGLYPVISIINHSCLPNAVLVFEGRLAV 117
+L CNAH I C+ + + T +YP S++NHSC P +++ + L
Sbjct: 320 ICQLVCNAHAITRLEQGLCSQAVVECQQVRIATAIYPTTSLLNHSCDP-SIIARKNEL-F 377
Query: 118 VRAVQHVPKGAEVLISYIETAGS--TMTRQKALKEQYLFTCTCPRCIKLGQFDDIQESAI 175
VR V+ V G E+ Y RQ+ L+ QY F C C C ++ +
Sbjct: 378 VRLVKDVKAGEEIFNCYGPHYARMPKKERQEVLQSQYFFKCDCSECTAEEPLEN-----L 432
Query: 176 LEGYRCKDDGCSGFLLRDSDDKGFTCQQCGLVRSKEEIKKIASEVNILSKKTLALTSCGN 235
L+ YRC+ CS ++ + C +C + ++ ++++ S ++ C
Sbjct: 433 LKAYRCQK--CSHAIISTGTQEVLKCSKCVTMV---DMNQLSAAEKQSSDDFMSSLKCLQ 487
Query: 236 HQEVVSTYKMIEK---LQKKLYHPFSVNLMQTREKLIKILMELEDWKEALAYCQLTIPVY 292
+++ K ++ ++K +YH + K+L E +D ALA C +
Sbjct: 488 VEDIDGAIKKLKNSLDIRKMIYH-----------RNHKLLTEAKD---ALARCYCIKGQF 533
Query: 293 QRVYPQFHPLLGLQYYTCGKLEWFLGDT--------------ENAIKSMTEAVEILRITH 338
R P L G L + + A++++ ++++ILR+ +
Sbjct: 534 TRAVPLLKDCLLCVETAFGSRSIELANELHKYAEVLVNARKLDEALETVNQSIDILRLNY 593
Query: 339 GTNSPFMKEL 348
G KE+
Sbjct: 594 GPEYSTAKEM 603
>gi|195485910|ref|XP_002091283.1| GE13567 [Drosophila yakuba]
gi|194177384|gb|EDW90995.1| GE13567 [Drosophila yakuba]
Length = 514
Score = 62.8 bits (151), Expect = 3e-07, Method: Compositional matrix adjust.
Identities = 44/173 (25%), Positives = 76/173 (43%), Gaps = 11/173 (6%)
Query: 48 IANLVNLILQWPEISINEIAENFSKLACNAHTICNSELRPLGTGLYPVISIINHSCLPNA 107
I + + + S EI + L N H + ++ P ++ S +SCLPN
Sbjct: 189 IGQFIPKFFKTQKFSEEEIMKAVGALQINGHEVPTTD--PSHVAVFYTASFTENSCLPNL 246
Query: 108 VLVF-EGRLAVVRAVQHVPKGAEVLISYIETAGSTMTRQKALKEQYLFTCTCPRCIKLGQ 166
F + ++ A + + K A + I Y + T RQ+ L + LF C C RC+ + +
Sbjct: 247 AKSFNKNGHCILWAPREIKKNAHLSICYSDAVWGTADRQRHLMQTKLFRCACERCVDVTE 306
Query: 167 FDDIQESAILEGYRCKDDGCSGFLL---RDSDDKGFTCQQCGLVRSKEEIKKI 216
D SAI +C+D C G +L D + + C++C K +++I
Sbjct: 307 LDTFY-SAI----KCEDRQCGGLMLPTKTDDWNGNWRCRECHKQVQKHYVERI 354
>gi|344246823|gb|EGW02927.1| SET and MYND domain-containing protein 5 [Cricetulus griseus]
Length = 633
Score = 62.8 bits (151), Expect = 3e-07, Method: Compositional matrix adjust.
Identities = 35/86 (40%), Positives = 50/86 (58%), Gaps = 7/86 (8%)
Query: 89 GTGLYPVISIINHSCLPNAVLVFEGR--LAVVRAVQHVPKGAEVLISYIETAG---STMT 143
G+GL+ + S NHSC+PNA F L V A++ + G E+ ISY++ S +
Sbjct: 520 GSGLFVLQSCCNHSCVPNAETSFPENNFLLHVTALEDIKPGEEICISYLDCCQRERSRHS 579
Query: 144 RQKALKEQYLFTCTCPRCIKLGQFDD 169
R K L+E YLF C+CP+C L + DD
Sbjct: 580 RHKILRENYLFICSCPKC--LAEADD 603
>gi|332016864|gb|EGI57673.1| Protein msta, isoform A [Acromyrmex echinatior]
Length = 421
Score = 62.8 bits (151), Expect = 3e-07, Method: Compositional matrix adjust.
Identities = 50/194 (25%), Positives = 90/194 (46%), Gaps = 17/194 (8%)
Query: 91 GLYPVISIINHSCLPNAVLVFEGRLAV-VRAVQHVPKGAEVLISYIETAGSTMTRQKALK 149
GLYP+ ++ NH C+PN F+ + + V AV + G E+ +SY + T +R++ LK
Sbjct: 147 GLYPMGALQNHCCVPNTRHHFDDQQRLYVSAVLPIAAGEEITMSYTDLLWDTSSRRQFLK 206
Query: 150 EQYLFTCTCPRCIKLGQFDDIQESAILEGYRCKDDGCSGFLLRD---SDDKGFTCQQCGL 206
F+C C RC D ++ + L C D CSG LL + + + C +C +
Sbjct: 207 VTKRFSCNCNRC-----SDPLEFGSQLSTLLCAKDNCSGCLLPRNPLNHESSWICDKCQI 261
Query: 207 -VRSKEEIKKIASEVNILSKKTLALTSCGNHQEVVSTYKMIEKLQKKLYHPFSVNLMQTR 265
V ++ I+ I S +N + T +E++ + E + +L + +M +
Sbjct: 262 SVNYRQVIECIHSGLNTFVSDAMYKTP----REIL---RFTEAILSRLVPATNYIMMDMK 314
Query: 266 EKLIKILMELEDWK 279
++I + D K
Sbjct: 315 YRIISYFGRVPDLK 328
>gi|312373131|gb|EFR20943.1| hypothetical protein AND_18265 [Anopheles darlingi]
Length = 675
Score = 62.8 bits (151), Expect = 3e-07, Method: Compositional matrix adjust.
Identities = 68/267 (25%), Positives = 112/267 (41%), Gaps = 38/267 (14%)
Query: 16 IPSTTTDNYSLVEALVAHMSDIDEKQLLLYAQIANLVNLILQWPEISINEIA-------- 67
I ++Y V LV H + + L +A L+ L + + ++ +
Sbjct: 373 IDRLPVNDYRKVYQLVTHEATRTPEDLFHRTLMATLLVSCLTLSDYAPDQASCEYLGGLM 432
Query: 68 -ENFSKLACNAHTICNSEL---RP--------LGTGLYPVISIINHSCLPNAVLVFEGRL 115
N L NAH I SE+ +P +G GLYP +++ NHSC P + G
Sbjct: 433 LHNLQLLQFNAHEI--SEMIREKPSDIGKSTFIGGGLYPTLALFNHSCDPGVTRYYRGNQ 490
Query: 116 AVVRAVQHVPKGAEVLISYIETAGSTMTRQKALKEQYLFTCTCPRCIK-LGQFDDIQESA 174
VR V+++ G T R++ L QY FTC C C++ F D+ S
Sbjct: 491 VCVRTVKNIAAG---------TRVRREERRETLLNQYRFTCYCEPCMQNWPLFSDMDPSI 541
Query: 175 ILEGYRCKDDG-CSGFLLRDSDDKGF--TCQQCGLVRSKEEIKKIASEVNILSKKTLALT 231
I +RC+ CS LL S+ F C +CG + + K + ++L + L
Sbjct: 542 IR--FRCEGGKICSNVLLIPSEINDFMVKCTECGEHTNIMKGLKSLQDTDMLFQTATRLH 599
Query: 232 SCGNHQEVVSTY-KMIEKLQKKLYHPF 257
S G ++ + Y +M+ + + L P+
Sbjct: 600 SAGEYEAALMKYVEMMAIMSEVLVPPY 626
>gi|409047429|gb|EKM56908.1| hypothetical protein PHACADRAFT_208075 [Phanerochaete carnosa
HHB-10118-sp]
Length = 606
Score = 62.4 bits (150), Expect = 3e-07, Method: Compositional matrix adjust.
Identities = 36/118 (30%), Positives = 57/118 (48%), Gaps = 14/118 (11%)
Query: 62 SINEIAENFSKLACNAHTICNSELRPLGTGLYPVISIINHSCLPNAVLVF---------- 111
S ++ S+ N T+ + L P+G + P +++ NHSC PN V+VF
Sbjct: 248 SAGDLVGLISRFTTNTFTLTSYTLSPVGICISPAMALTNHSCDPNVVIVFPRSSKKVTKE 307
Query: 112 -EGRLAVVRAVQHVPKGAEVLISYIETAGSTMTRQKALKEQYLFTCTCPRCIKLGQFD 168
+ +L +R++ H E++ +Y++ RQ ALKE Y FTC C C G D
Sbjct: 308 PQMQLIALRSILHE---EEIMTAYVDVTLPKELRQSALKEAYNFTCMCSLCKNTGPTD 362
>gi|307177472|gb|EFN66599.1| SET and MYND domain-containing protein 5 [Camponotus floridanus]
Length = 386
Score = 62.4 bits (150), Expect = 3e-07, Method: Compositional matrix adjust.
Identities = 34/87 (39%), Positives = 48/87 (55%), Gaps = 5/87 (5%)
Query: 80 ICNSELRPLGTGLYPVISIINHSCLPNAVLVF--EGRLAVVRAVQHVPKGAEVLISYI-- 135
+ S L G+GLY + S +NHSC PNA+ F V++A++ + G E+ SY+
Sbjct: 278 VVGSFLNNEGSGLYILQSSVNHSCTPNAIAEFPYSNNTLVLKAIRDIQVGEEICTSYLDE 337
Query: 136 -ETAGSTMTRQKALKEQYLFTCTCPRC 161
E S +RQKAL YLF C C +C
Sbjct: 338 CELERSRYSRQKALSSLYLFVCHCDKC 364
>gi|395841250|ref|XP_003793459.1| PREDICTED: SET and MYND domain-containing protein 5 isoform 2
[Otolemur garnettii]
Length = 300
Score = 62.4 bits (150), Expect = 3e-07, Method: Compositional matrix adjust.
Identities = 31/79 (39%), Positives = 46/79 (58%), Gaps = 5/79 (6%)
Query: 89 GTGLYPVISIINHSCLPNAVLVF--EGRLAVVRAVQHVPKGAEVLISYIETAG---STMT 143
G+GL+ + S NHSC+PNA F V A++ + G E+ ISY++ S +
Sbjct: 188 GSGLFVLQSCCNHSCVPNAETSFPENNFFLHVTALEDIKPGEEICISYLDCCQRERSRHS 247
Query: 144 RQKALKEQYLFTCTCPRCI 162
R K L+E YLF C+CP+C+
Sbjct: 248 RHKILRENYLFICSCPKCL 266
>gi|340723180|ref|XP_003399973.1| PREDICTED: SET and MYND domain-containing protein 4-like [Bombus
terrestris]
Length = 718
Score = 62.4 bits (150), Expect = 3e-07, Method: Compositional matrix adjust.
Identities = 43/144 (29%), Positives = 63/144 (43%), Gaps = 17/144 (11%)
Query: 76 NAHTICNSEL---------RPL--GTGLYPVISIINHSCLPNAVLVFEGRLAVVRAVQHV 124
NAH + + L RPL G +YP ++ NH C P F GR V+RA++ +
Sbjct: 491 NAHEVFETRLGNEHRFRGSRPLYIGVAIYPTVARFNHDCYPAVTRYFVGRSIVIRAIRSL 550
Query: 125 PKGAEVLISY--IETAGSTMTRQKALKEQYLFTCTCPRCIK-LGQFDDIQESAILEGYRC 181
G V +Y I T S R++ L +Y F C C C + +F+ + + RC
Sbjct: 551 RPGDTVAENYGPIFTKRSLEDRRRTLAARYWFRCECTACREDWPRFETLTNDMVR--LRC 608
Query: 182 KDDGCSGFLLRDSD-DKGFTCQQC 204
+GCS R D + C C
Sbjct: 609 PTEGCSKLHSRPRDPSRSIECSSC 632
>gi|440490836|gb|ELQ70343.1| hypothetical protein OOW_P131scaffold00045g35 [Magnaporthe oryzae
P131]
Length = 582
Score = 62.4 bits (150), Expect = 3e-07, Method: Compositional matrix adjust.
Identities = 28/74 (37%), Positives = 42/74 (56%)
Query: 88 LGTGLYPVISIINHSCLPNAVLVFEGRLAVVRAVQHVPKGAEVLISYIETAGSTMTRQKA 147
GT L V+S+INH C PNA + FEG VR+++ + G E+ +SY + + RQ+
Sbjct: 240 FGTSLDLVVSMINHDCSPNAHVFFEGSQVRVRSLKAIAAGGEITVSYCDPRLDVLLRQEI 299
Query: 148 LKEQYLFTCTCPRC 161
L++ F C C C
Sbjct: 300 LRQTQFFHCECTTC 313
>gi|440476470|gb|ELQ45066.1| hypothetical protein OOU_Y34scaffold00021g6 [Magnaporthe oryzae
Y34]
Length = 581
Score = 62.4 bits (150), Expect = 4e-07, Method: Compositional matrix adjust.
Identities = 28/74 (37%), Positives = 42/74 (56%)
Query: 88 LGTGLYPVISIINHSCLPNAVLVFEGRLAVVRAVQHVPKGAEVLISYIETAGSTMTRQKA 147
GT L V+S+INH C PNA + FEG VR+++ + G E+ +SY + + RQ+
Sbjct: 240 FGTSLDLVVSMINHDCSPNAHVFFEGSQVRVRSLKAIAAGGEITVSYCDPRLDVLLRQEI 299
Query: 148 LKEQYLFTCTCPRC 161
L++ F C C C
Sbjct: 300 LRQTQFFHCECTTC 313
>gi|307195015|gb|EFN77083.1| SET and MYND domain-containing protein 5 [Harpegnathos saltator]
Length = 385
Score = 62.4 bits (150), Expect = 4e-07, Method: Compositional matrix adjust.
Identities = 32/78 (41%), Positives = 45/78 (57%), Gaps = 5/78 (6%)
Query: 89 GTGLYPVISIINHSCLPNAVLVF--EGRLAVVRAVQHVPKGAEVLISYIETA---GSTMT 143
G+GLY + S +NHSC+PNA + F V+RA++ + E+ SY+E S +
Sbjct: 287 GSGLYILQSAVNHSCMPNASVEFPYSNNTLVLRAIRDIQPEEEICTSYLEECELERSRYS 346
Query: 144 RQKALKEQYLFTCTCPRC 161
RQKAL YLF C C +C
Sbjct: 347 RQKALSSLYLFVCHCDKC 364
>gi|281206858|gb|EFA81042.1| hypothetical protein PPL_05877 [Polysphondylium pallidum PN500]
Length = 425
Score = 62.4 bits (150), Expect = 4e-07, Method: Compositional matrix adjust.
Identities = 31/79 (39%), Positives = 41/79 (51%), Gaps = 6/79 (7%)
Query: 89 GTGLYPVISIINHSCLPNAVLVFEG------RLAVVRAVQHVPKGAEVLISYIETAGSTM 142
G GLYP+ S +NHSC PN + E V RA +++ G E+L +Y + T
Sbjct: 346 GIGLYPIFSCMNHSCQPNVEICNERTDGVTFNKVVFRAKKNIKAGQELLNNYCDVTLPTK 405
Query: 143 TRQKALKEQYLFTCTCPRC 161
RQ LK QY F C C +C
Sbjct: 406 ERQSQLKSQYDFICKCNKC 424
>gi|429853222|gb|ELA28312.1| TPR domain-containing protein [Colletotrichum gloeosporioides Nara
gc5]
Length = 643
Score = 62.4 bits (150), Expect = 4e-07, Method: Compositional matrix adjust.
Identities = 31/72 (43%), Positives = 42/72 (58%), Gaps = 1/72 (1%)
Query: 90 TGLYPVISIINHSCLPNAVLVFEGRLAVVRAVQHVPKGAEVLISYIETAGSTMTRQKALK 149
TGL+ + INHSC+PNA F G L V+R+++ + KG E+ SY E+ G RQ AL
Sbjct: 496 TGLWVRAAYINHSCVPNAKREFVGDLMVIRSLRKIKKGEEIFHSYDES-GDYEARQAALM 554
Query: 150 EQYLFTCTCPRC 161
+ F C C C
Sbjct: 555 TTWGFECGCALC 566
>gi|348515415|ref|XP_003445235.1| PREDICTED: SET and MYND domain-containing protein 5-like isoform 1
[Oreochromis niloticus]
Length = 417
Score = 62.4 bits (150), Expect = 4e-07, Method: Compositional matrix adjust.
Identities = 33/87 (37%), Positives = 51/87 (58%), Gaps = 7/87 (8%)
Query: 89 GTGLYPVISIINHSCLPNAVLVFEGR--LAVVRAVQHVPKGAEVLISYIETAG---STMT 143
G+GL+ + S NHSC+PNA F L + A+ + +G E+ ISY++ S +
Sbjct: 303 GSGLFLLQSSCNHSCIPNAEASFPDNNFLLHLSALCDINQGEEICISYLDCCQRDRSRHS 362
Query: 144 RQKALKEQYLFTCTCPRCIKLGQFDDI 170
R K L+E YLF C+CP+C+ Q D++
Sbjct: 363 RHKILRENYLFVCSCPKCV--SQMDEL 387
>gi|194899548|ref|XP_001979321.1| GG14794 [Drosophila erecta]
gi|190651024|gb|EDV48279.1| GG14794 [Drosophila erecta]
Length = 392
Score = 62.4 bits (150), Expect = 4e-07, Method: Compositional matrix adjust.
Identities = 31/78 (39%), Positives = 47/78 (60%), Gaps = 5/78 (6%)
Query: 89 GTGLYPVISIINHSCLPNAVLVF--EGRLAVVRAVQHVPKGAEVLISYIETA---GSTMT 143
G+GLY + S INHSC+PNA F + V++A+ + +G E+ ISY++ S +
Sbjct: 286 GSGLYLLQSKINHSCVPNACSTFPYSNDIVVLKALAPIQEGDEICISYLDECMLERSRHS 345
Query: 144 RQKALKEQYLFTCTCPRC 161
R K L+E Y+F C C +C
Sbjct: 346 RHKVLRENYVFICQCSKC 363
>gi|294892061|ref|XP_002773875.1| hypothetical protein, conserved [Perkinsus marinus ATCC 50983]
gi|239879079|gb|EER05691.1| hypothetical protein, conserved [Perkinsus marinus ATCC 50983]
Length = 140
Score = 62.4 bits (150), Expect = 4e-07, Method: Compositional matrix adjust.
Identities = 32/99 (32%), Positives = 47/99 (47%), Gaps = 4/99 (4%)
Query: 72 KLACNAHTICNSELRPLGTGLYPVISIINHSCLPNAVLVF-EGRLAVVRAVQHVPKGAEV 130
+ CN I + + G+YP ++ NHSC PN + F G RA++ + G E+
Sbjct: 41 RFPCNNFAIVDDLWSGIAAGVYPNAALFNHSCHPNVIPAFGHGSTLSFRAIRDISPGEEI 100
Query: 131 LISYIETAGSTMTRQKALKEQYLFTCTCPRCIKLGQFDD 169
SY+E + R+ L Y F C C RC G+ DD
Sbjct: 101 CHSYVELTLPSWKRRDVLLRDYEFLCECERC---GKRDD 136
>gi|357609801|gb|EHJ66685.1| hypothetical protein KGM_08794 [Danaus plexippus]
Length = 776
Score = 62.4 bits (150), Expect = 4e-07, Method: Compositional matrix adjust.
Identities = 86/393 (21%), Positives = 158/393 (40%), Gaps = 73/393 (18%)
Query: 13 DNV-IPSTTTDNYSLVEALVAHMSDIDEKQLLLYAQIANLVNLILQ--------WPE--- 60
DN+ I T TD + + LV + + +D + YA A ++ L L+ P
Sbjct: 367 DNIQIYKTGTDPFYRMFNLVTNFNKMDNTDYIQYALTATMLTLYLENFTSFFDYLPSKMP 426
Query: 61 ISINE----------IAENFSKLACNAH-----------------TICNSELRPLGTGLY 93
S++E I + +L CN H TI E+R T +Y
Sbjct: 427 CSMSESQLKLFAAAVILRSMGQLVCNGHATLSLAVVEEDDGRNGKTITEKEVR-RATAIY 485
Query: 94 PVISIINHSCLPNAVLVFEGRLAVVRAVQHVPKGAEVLISY--IETAGSTMTRQKALKEQ 151
P +++NHSC PN + F +VR + +P G EV Y R+KALK Q
Sbjct: 486 PSAAMMNHSCDPNIINTFYKSRLIVRCQRELPAGGEVFNCYGPHRARAPAAARRKALKAQ 545
Query: 152 YLFTCTCPRCIKLGQFDDIQESAILEGYRCKDDGCSGFLLRDSDDKGFTCQQCGLVRSKE 211
Y+FTC C C + D + ++ Y C+ C G + C QC E
Sbjct: 546 YMFTCHCADCNDTERKDFV---SLFSAYLCQ--SCKGPVWAHCVRP--LCTQCRSALHLE 598
Query: 212 EIKKIASEVNILSKKTLALTSCGNH-QEVVSTYKMIEKLQKKLYHPFSVNLMQTREKLIK 270
+ + L+ + + S +++ ++Y +L+++++H +L ++L +
Sbjct: 599 RAHTLLDRADDLATQAEQVVSLEERCEKMAASY----RLKQQVWHRHHASLRMAADRLAR 654
Query: 271 ILMELEDWKEALAYCQLTIPVYQRVYPQF-----HPLLGLQYYTCGKL----------EW 315
+ + D+ +++ + I + + F H L L ++ EW
Sbjct: 655 LYADTGDFGKSMELIKQNIQSLEYRFGSFSVEVAHELRKLSDVMLERILNSPQHLEYREW 714
Query: 316 FLGDTENAIKSMTEAVEILRITHGTNSPFMKEL 348
L A K + +A++++ + +G+ P + L
Sbjct: 715 CL----EAHKVVKKAIQLMELNYGSWEPLVSRL 743
>gi|303279368|ref|XP_003058977.1| predicted protein [Micromonas pusilla CCMP1545]
gi|226460137|gb|EEH57432.1| predicted protein [Micromonas pusilla CCMP1545]
Length = 322
Score = 62.0 bits (149), Expect = 4e-07, Method: Compositional matrix adjust.
Identities = 28/77 (36%), Positives = 41/77 (53%), Gaps = 1/77 (1%)
Query: 86 RPLGTGLYPVISIINHSCLPNAVLVFEGRLAV-VRAVQHVPKGAEVLISYIETAGSTMTR 144
RP+G G+YP ++ NH C PNA F+ V V + V KG E+ I Y++ R
Sbjct: 241 RPIGVGVYPSAAMFNHDCAPNAAQRFDAFGCVRVETTRRVRKGEELTIPYVDVMLGREER 300
Query: 145 QKALKEQYLFTCTCPRC 161
+ L++ + F C C RC
Sbjct: 301 RGKLRKNFAFECACARC 317
>gi|395329994|gb|EJF62379.1| hypothetical protein DICSQDRAFT_104880 [Dichomitus squalens
LYAD-421 SS1]
Length = 810
Score = 62.0 bits (149), Expect = 4e-07, Method: Compositional matrix adjust.
Identities = 31/72 (43%), Positives = 41/72 (56%), Gaps = 1/72 (1%)
Query: 90 TGLYPVISIINHSCLPNAVLVFEGRLAVVRAVQHVPKGAEVLISYIETAGSTMTRQKALK 149
TGLYP+ S+ NHSC+ NA+ G + ++RA + VP G E+ I Y S RQ LK
Sbjct: 556 TGLYPLASLFNHSCVANAIWYCIGDVMIIRAAEPVPAGTEITIPY-SVEESYFARQSVLK 614
Query: 150 EQYLFTCTCPRC 161
+ L CTC C
Sbjct: 615 KHMLEHCTCWLC 626
>gi|444708470|gb|ELW49533.1| SET and MYND domain-containing protein 3 [Tupaia chinensis]
Length = 125
Score = 62.0 bits (149), Expect = 4e-07, Method: Compositional matrix adjust.
Identities = 27/66 (40%), Positives = 43/66 (65%)
Query: 96 ISIINHSCLPNAVLVFEGRLAVVRAVQHVPKGAEVLISYIETAGSTMTRQKALKEQYLFT 155
+S++NHSC PN +VF G ++RAV+ + G E+ I Y++ ++ R+K L++QY F
Sbjct: 16 MSLLNHSCDPNCSIVFNGPHLLLRAVRDIEVGEELTICYLDMLMTSEERRKQLRDQYCFE 75
Query: 156 CTCPRC 161
C C RC
Sbjct: 76 CDCFRC 81
>gi|407404730|gb|EKF30079.1| hypothetical protein MOQ_006118 [Trypanosoma cruzi marinkellei]
Length = 359
Score = 62.0 bits (149), Expect = 4e-07, Method: Compositional matrix adjust.
Identities = 33/77 (42%), Positives = 47/77 (61%), Gaps = 5/77 (6%)
Query: 89 GTGLYPVISIINHSCLPNAVLVF----EGRLAVVRAVQHVPKGAEVLISYIETAGSTMTR 144
G G+Y V + NHSC PN +V+ + L+VV A++ + G E+ ISYI+ + S R
Sbjct: 257 GQGVYTVGCLFNHSCEPNLQVVYSESGDETLSVV-ALRDIEPGEELCISYIDESLSYPER 315
Query: 145 QKALKEQYLFTCTCPRC 161
Q+ L E YLF C CP+C
Sbjct: 316 QQELYEHYLFVCHCPKC 332
>gi|322785185|gb|EFZ11898.1| hypothetical protein SINV_10758 [Solenopsis invicta]
Length = 389
Score = 62.0 bits (149), Expect = 4e-07, Method: Compositional matrix adjust.
Identities = 31/78 (39%), Positives = 47/78 (60%), Gaps = 5/78 (6%)
Query: 89 GTGLYPVISIINHSCLPNAVLVF--EGRLAVVRAVQHVPKGAEVLISYIETAG---STMT 143
G+GLY + S +NHSC+PNAV+ F V++A++ + G E+ SY++ S +
Sbjct: 294 GSGLYVLQSSVNHSCVPNAVVEFPYSNNTLVLKAIRDIKVGEEICTSYLDECQLERSRHS 353
Query: 144 RQKALKEQYLFTCTCPRC 161
RQ+AL YLF C C +C
Sbjct: 354 RQQALSSLYLFVCHCDKC 371
>gi|294935956|ref|XP_002781574.1| hypothetical protein, conserved [Perkinsus marinus ATCC 50983]
gi|239892411|gb|EER13369.1| hypothetical protein, conserved [Perkinsus marinus ATCC 50983]
Length = 181
Score = 62.0 bits (149), Expect = 4e-07, Method: Compositional matrix adjust.
Identities = 31/75 (41%), Positives = 44/75 (58%), Gaps = 1/75 (1%)
Query: 88 LGTGLYPVISIINHSCLPNAVLVFEG-RLAVVRAVQHVPKGAEVLISYIETAGSTMTRQK 146
LG+GLY +++ NHSC PNA F G R V++++ V G EV SYI+ + RQ
Sbjct: 91 LGSGLYRGVAVTNHSCSPNAEASFGGSRCLRVKSLRPVHAGEEVFQSYIDENLPLVERQS 150
Query: 147 ALKEQYLFTCTCPRC 161
L++ Y F C C +C
Sbjct: 151 KLRQAYGFACRCGKC 165
>gi|395508886|ref|XP_003758739.1| PREDICTED: SET and MYND domain-containing protein 5 [Sarcophilus
harrisii]
Length = 609
Score = 62.0 bits (149), Expect = 4e-07, Method: Compositional matrix adjust.
Identities = 32/79 (40%), Positives = 47/79 (59%), Gaps = 5/79 (6%)
Query: 89 GTGLYPVISIINHSCLPNAVLVFEGR--LAVVRAVQHVPKGAEVLISYIETAG---STMT 143
G+GL+ + S NHSC+PNA F L V A++ + G E+ ISY++ S +
Sbjct: 496 GSGLFVLQSCCNHSCVPNAETSFPENNFLLPVTALEDIKPGEEICISYLDCCQRERSRYS 555
Query: 144 RQKALKEQYLFTCTCPRCI 162
R K L+E YLF C+CP+C+
Sbjct: 556 RHKILRENYLFVCSCPKCL 574
>gi|195583324|ref|XP_002081472.1| GD11032 [Drosophila simulans]
gi|194193481|gb|EDX07057.1| GD11032 [Drosophila simulans]
Length = 1033
Score = 62.0 bits (149), Expect = 5e-07, Method: Compositional matrix adjust.
Identities = 42/156 (26%), Positives = 71/156 (45%), Gaps = 11/156 (7%)
Query: 65 EIAENFSKLACNAHTICNSELRPLGTGLYPVISIINHSCLPNAVLVF-EGRLAVVRAVQH 123
EI + L N H + ++ P ++ S +SCLPN F + ++ A +
Sbjct: 725 EIMKAVGALQINGHEVPTTD--PPHVAVFYTASFTENSCLPNLAKSFNKNGHCILWAPRE 782
Query: 124 VPKGAEVLISYIETAGSTMTRQKALKEQYLFTCTCPRCIKLGQFDDIQESAILEGYRCKD 183
+ K A + I Y + T RQ+ L + LF C C RC+ + + D SAI +C+D
Sbjct: 783 IKKNAHLSICYSDAMWGTADRQRHLMQTKLFKCACERCVDVTEL-DTNYSAI----KCED 837
Query: 184 DGCSGFLL---RDSDDKGFTCQQCGLVRSKEEIKKI 216
C G +L D + + C++C K +++I
Sbjct: 838 RQCGGLMLPTKADDWNGSWRCRECHKQVQKHYVERI 873
>gi|301121168|ref|XP_002908311.1| conserved hypothetical protein [Phytophthora infestans T30-4]
gi|262103342|gb|EEY61394.1| conserved hypothetical protein [Phytophthora infestans T30-4]
Length = 424
Score = 62.0 bits (149), Expect = 5e-07, Method: Compositional matrix adjust.
Identities = 42/127 (33%), Positives = 61/127 (48%), Gaps = 16/127 (12%)
Query: 89 GTGLYPVISIINHSCLPNAVLVFEGRLAV-VRAVQHVPKGAEVLISYIETAGSTMTRQKA 147
GL+ + +INHSC PN G + VRA++ + +G E+ +SYI+ RQK
Sbjct: 162 AVGLFSICGLINHSCQPNCTWSNAGDSVMEVRALRDIEEGEEITLSYIDIDKERSERQKE 221
Query: 148 LKEQYLFTCTCPRC-IKLGQFDDIQESAILEGYRCKDDGCS-------GFLLRDSDDKGF 199
L++ F C C RC L + D +L+G+RC CS +LL +DK
Sbjct: 222 LRDTKHFDCQCERCSTPLSESVD----RVLDGFRCP--RCSVKASEEENYLLAQVEDK-L 274
Query: 200 TCQQCGL 206
C C L
Sbjct: 275 VCPDCQL 281
>gi|328698846|ref|XP_001948827.2| PREDICTED: SET and MYND domain-containing protein 4-like
[Acyrthosiphon pisum]
Length = 742
Score = 62.0 bits (149), Expect = 5e-07, Method: Compositional matrix adjust.
Identities = 48/183 (26%), Positives = 80/183 (43%), Gaps = 31/183 (16%)
Query: 18 STTTDNYSLVEALVAHMSDIDEKQLLLYAQIANLVNLILQ--------WPEISI----NE 65
S N + L+ H+ D+ +L Y+ A L+ + LQ P + + NE
Sbjct: 355 SQANSNNDQIYNLLTHIDDLKSLELYQYSLTATLLLIYLQKKTDFFEKHPNLVLDSVGNE 414
Query: 66 IAENFSKLACNAHTICN---------------SELRP-LGTGLYPVISIINHSCLPNAVL 109
+ + ++L CN + I E +P +GT ++P S++NHSC PN
Sbjct: 415 LLHHMTRLVCNGNAISTHMLSDYDSGSRTPIIDESQPRIGTAIFPTSSLLNHSCDPNIFS 474
Query: 110 VFEGRLAVVRAVQHVPKGAEVLISYIET--AGSTMTRQKALKEQYLFTCTCPRCIKLGQF 167
+ V++A + + +G E+ Y + RQ +LK QY F C C C+ Q
Sbjct: 475 SNILKYVVIKASRDISEGEEITNCYGPNFLRMRVVDRQASLKNQYHFDCECNTCLD-PQA 533
Query: 168 DDI 170
DD+
Sbjct: 534 DDL 536
>gi|17535507|ref|NP_496323.1| Protein SET-14 [Caenorhabditis elegans]
gi|3123316|sp|Q09415.2|SET14_CAEEL RecName: Full=SET domain-containing protein 14
gi|3878909|emb|CAA86783.1| Protein SET-14 [Caenorhabditis elegans]
Length = 429
Score = 62.0 bits (149), Expect = 5e-07, Method: Compositional matrix adjust.
Identities = 48/164 (29%), Positives = 77/164 (46%), Gaps = 14/164 (8%)
Query: 70 FSKLACNAHTICNSELRPLGTGLYPVISIINHSCLPNAVLVFEGRLAVVRAV-QHVPKGA 128
F K++ N+ + NS P+G GL +S NHSC P + + R A++ V +P
Sbjct: 147 FQKVSINSFVVGNSTGNPIGVGLCIKLSAANHSCKPLTRVCYRNRTAMLVPVSSELPSTL 206
Query: 129 E-VLISYIETAGSTMTRQKALKEQYLFTCTCPRCIKLGQFDDIQESAILEGYRCKDDGCS 187
E SYI+ R+ LK++Y F C C C+ D +A +E + C C
Sbjct: 207 EGACHSYIDELMPRDMRRDTLKKKYKFLCQCDGCL------DEDRNARMEAWTC--GICV 258
Query: 188 GFLLRDSDDKGFTCQQCGLVRSKE--EIKKIASEVNILSKKTLA 229
+R+ ++ C+ CG SK+ E+ + A E I ++ LA
Sbjct: 259 KGWMRNKENG--QCELCGWTMSKDHFELCRTAEEAGIAARSRLA 300
>gi|344303810|gb|EGW34059.1| hypothetical protein SPAPADRAFT_65242 [Spathaspora passalidarum
NRRL Y-27907]
Length = 619
Score = 62.0 bits (149), Expect = 5e-07, Method: Compositional matrix adjust.
Identities = 31/90 (34%), Positives = 47/90 (52%), Gaps = 2/90 (2%)
Query: 76 NAHTICNSELRPLGTGLYPVISIINHSCLPNAVLVFEGRLAVVRAVQHVP--KGAEVLIS 133
N + + + + P+G GL P S+INHSC PN +L + + V VP K +E+ ++
Sbjct: 226 NHNILLDQDNEPIGIGLDPDFSLINHSCAPNCILKHKQNWSGFELVNTVPIVKNSELTVT 285
Query: 134 YIETAGSTMTRQKALKEQYLFTCTCPRCIK 163
YI+T R L+ +Y F C C C K
Sbjct: 286 YIDTCFPKELRMLDLRSRYFFNCKCELCSK 315
>gi|342883051|gb|EGU83613.1| hypothetical protein FOXB_05861 [Fusarium oxysporum Fo5176]
Length = 374
Score = 62.0 bits (149), Expect = 5e-07, Method: Compositional matrix adjust.
Identities = 37/113 (32%), Positives = 58/113 (51%), Gaps = 7/113 (6%)
Query: 89 GTGLYPVISIINHSCLPNAVLVFEGRLAVV-RAVQHVPKGAEVLISYIE--TAGSTMTRQ 145
G LYP + NHSCLPN +G+ +V A + + KG E +I+Y + T +RQ
Sbjct: 261 GMALYPRAAQFNHSCLPNVTHQPDGQARMVYTAARDISKGEECMITYFDLTTHKDLTSRQ 320
Query: 146 KALKEQYLFTCTCPRCIKLGQFDDIQESAILEGYRCKDDGCSGFLLRDSDDKG 198
+EQ+ F CTC RC+K ++ +E+ + K + C SD++G
Sbjct: 321 NHTQEQFQFKCTCERCLK----EEAEENIEFQELELKLEPCPISSATTSDNEG 369
>gi|345493155|ref|XP_001601354.2| PREDICTED: SET and MYND domain-containing protein 4-like [Nasonia
vitripennis]
Length = 737
Score = 61.6 bits (148), Expect = 5e-07, Method: Compositional matrix adjust.
Identities = 61/288 (21%), Positives = 108/288 (37%), Gaps = 66/288 (22%)
Query: 72 KLACNAHTICNSELRP-------------LGTGLYPVISIINHSCLPNAVLVFEGRLAVV 118
+L CN H I + + T +YP S++NHSC PN + F+ + +V
Sbjct: 422 QLICNGHAITRLNISDSESGNVVTEYQCRIATAIYPSASMMNHSCDPNIINSFKDQYLIV 481
Query: 119 RAVQHVPKGAEVLISYIE--TAGSTMTRQKALKEQYLFTCTCPRCIKLGQFDDIQESAIL 176
+A + + EV Y RQ AL+ QY FTC C
Sbjct: 482 KATKDIAAKEEVFNCYGPHYRRMRKKDRQIALQNQYCFTCEC------------------ 523
Query: 177 EGYRCKDDGCSGFLLRDSDDK--GFTCQQCGLVRSKEEIKKIASEVNILSKKTLALTSCG 234
+ C+ L++ DK F C++C V I+S ++ C
Sbjct: 524 -------EACTQRALQNFSDKFQRFNCEECN------------GPVEIISHSSMRCLDCE 564
Query: 235 NHQEVVSTYKMIEKLQKKLYHPFSVNLMQTREKLIKILMELEDWKEALAYCQLTIPVYQR 294
++V + + + KL+ +NL + KEAL + + + +R
Sbjct: 565 TTFDLVKSQLLELEEANKLFEAAKINLKSQKV------------KEALENAKQCLEIRKR 612
Query: 295 VYPQFHPLLGLQYYTCGKLEWFLGDTENAIKSMTEAVEILRITHGTNS 342
+ ++H + L Y GK+ G ++I + ++ + G +S
Sbjct: 613 ILYEYHESVTLTYDLIGKIFAVTGRWLDSISHLEHSLAAVEERFGPDS 660
>gi|328770914|gb|EGF80955.1| hypothetical protein BATDEDRAFT_88226 [Batrachochytrium
dendrobatidis JAM81]
Length = 227
Score = 61.6 bits (148), Expect = 5e-07, Method: Compositional matrix adjust.
Identities = 27/75 (36%), Positives = 37/75 (49%)
Query: 88 LGTGLYPVISIINHSCLPNAVLVFEGRLAVVRAVQHVPKGAEVLISYIETAGSTMTRQKA 147
+G ++P S NHSC PN + R ++ V KG + ISYI+T R+
Sbjct: 96 MGRAVFPAASYFNHSCFPNCQSIKHDHKMAFRTLKDVSKGEMLTISYIDTNMPVSARRAR 155
Query: 148 LKEQYLFTCTCPRCI 162
L + Y F C C RCI
Sbjct: 156 LMDDYFFECMCERCI 170
>gi|449548889|gb|EMD39855.1| hypothetical protein CERSUDRAFT_122028 [Ceriporiopsis subvermispora
B]
Length = 454
Score = 61.6 bits (148), Expect = 5e-07, Method: Compositional matrix adjust.
Identities = 33/93 (35%), Positives = 53/93 (56%), Gaps = 6/93 (6%)
Query: 76 NAHTICNSELRPLGTGLYPVIS-IINHSCLPNAVLVF-----EGRLAVVRAVQHVPKGAE 129
N + I +S L P G++P+ S NHSC+PNAV+ + E V A++ + +G E
Sbjct: 184 NNNFIVHSHLYPYAHGIFPLASRTFNHSCVPNAVVKYIIRPSEPVCMQVVALREIQEGEE 243
Query: 130 VLISYIETAGSTMTRQKALKEQYLFTCTCPRCI 162
++I Y++ A S R+ AL+ Y F C+C C+
Sbjct: 244 IVIPYLDPALSYAARRDALQTNYGFICSCALCV 276
>gi|90077650|dbj|BAE88505.1| unnamed protein product [Macaca fascicularis]
Length = 418
Score = 61.6 bits (148), Expect = 5e-07, Method: Compositional matrix adjust.
Identities = 31/79 (39%), Positives = 46/79 (58%), Gaps = 5/79 (6%)
Query: 89 GTGLYPVISIINHSCLPNAVLVF--EGRLAVVRAVQHVPKGAEVLISYIETAG---STMT 143
G+G + + S NHSC+PNA F L V A++ + G E+ ISY++ S +
Sbjct: 304 GSGFFVLQSCCNHSCVPNAETSFPENNFLLHVTALEDIKPGEEICISYLDCCQRERSRHS 363
Query: 144 RQKALKEQYLFTCTCPRCI 162
R K L+E YLF C+CP+C+
Sbjct: 364 RHKILRENYLFVCSCPKCL 382
>gi|159465319|ref|XP_001690870.1| predicted protein [Chlamydomonas reinhardtii]
gi|158279556|gb|EDP05316.1| predicted protein [Chlamydomonas reinhardtii]
Length = 1470
Score = 61.6 bits (148), Expect = 5e-07, Method: Composition-based stats.
Identities = 42/126 (33%), Positives = 64/126 (50%), Gaps = 19/126 (15%)
Query: 61 ISINEIAENFSKLACNAHTICNSELR---PLG-TGLYPVISIINHSCLPNAVLVFEGRLA 116
+++N + E+F LA +C ELR P G GL+P + HSC P A G
Sbjct: 1174 VNLNCMGEDFQDLA-----LC--ELRGEAPRGHIGLWPEAAFAAHSCAPTATAYSIGDRL 1226
Query: 117 VVRAVQHVPKGAEVLISYIETAGSTMT-----RQKALKEQYLFTCTCPRCIKLGQFDDIQ 171
++RA +PKG EV ++++ GS +T R+ L+ QY FTC C RC +
Sbjct: 1227 LIRAAAEIPKGGEVSLNFL---GSLLTSPLSVRRAELRSQYGFTCGCSRCAAEARHTGTP 1283
Query: 172 ESAILE 177
+A++E
Sbjct: 1284 LAALVE 1289
>gi|125810418|ref|XP_001361485.1| GA20567 [Drosophila pseudoobscura pseudoobscura]
gi|54636660|gb|EAL26063.1| GA20567 [Drosophila pseudoobscura pseudoobscura]
Length = 660
Score = 61.6 bits (148), Expect = 6e-07, Method: Compositional matrix adjust.
Identities = 54/219 (24%), Positives = 95/219 (43%), Gaps = 18/219 (8%)
Query: 48 IANLVNLILQWPEISINEIAE--NFSKLACNAHTICNSELRPLGTGLYPVISIINHSCLP 105
I +LV LQ+ + + +E+AE FS +G +YP +++ NHSC P
Sbjct: 427 ICSLVLRSLQFIQFNTHEVAELHKFSSTGREKSIF-------IGGAIYPTLALFNHSCDP 479
Query: 106 NAVLVFEGRLAVVRAVQHVPKGAEVLISY--IETAGSTMTRQKALKEQYLFTCTCPRCI- 162
V F G + +V+ + G + +Y I T RQ +KE Y F C C CI
Sbjct: 480 GVVRYFRGTTIHINSVRPIEAGLPINENYGPIYTQDKREDRQAKMKELYWFECCCDACID 539
Query: 163 KLGQFDDIQESAILEGYRCK-DDGCSGFL--LRDSDDKGFTCQQCGLVRSKEEIKKIASE 219
F+D+ I +RC+ + C+ + +D C CG V + + K+ +
Sbjct: 540 NWPLFEDLPRDVI--RFRCEAPNNCAAVIEVPPSCNDFMVKCVTCGEVTNILKGLKVMQD 597
Query: 220 VNILSKKTLALTSCGNHQEVVSTY-KMIEKLQKKLYHPF 257
++++ L G++ + + + +I + L PF
Sbjct: 598 TEMMTRTGKRLYETGDYAKALQKFIDLIRIMYDVLAPPF 636
>gi|159110558|ref|XP_001705534.1| Hypothetical protein GL50803_17036 [Giardia lamblia ATCC 50803]
gi|157433620|gb|EDO77860.1| hypothetical protein GL50803_17036 [Giardia lamblia ATCC 50803]
Length = 409
Score = 61.6 bits (148), Expect = 6e-07, Method: Compositional matrix adjust.
Identities = 34/104 (32%), Positives = 56/104 (53%), Gaps = 8/104 (7%)
Query: 67 AENFSKLAC----NAHTICNSELRPLGTGLYPVISIINHSCLPNAVLVF----EGRLAVV 118
A+ F+ C N+++I + + +G GLY +IS NHSC PNA ++F R A +
Sbjct: 300 AQMFNYFVCILLTNSYSIEDKTGQEIGAGLYSLISCCNHSCTPNAQVIFGDSENAREATL 359
Query: 119 RAVQHVPKGAEVLISYIETAGSTMTRQKALKEQYLFTCTCPRCI 162
++ + E+ ISYI G ++ ++ Q+ FTC C RC+
Sbjct: 360 VLLRPCAQKEELYISYITDLGRSVVERRRELAQWCFTCQCTRCL 403
>gi|195151412|ref|XP_002016641.1| GL11690 [Drosophila persimilis]
gi|194110488|gb|EDW32531.1| GL11690 [Drosophila persimilis]
Length = 660
Score = 61.6 bits (148), Expect = 6e-07, Method: Compositional matrix adjust.
Identities = 54/219 (24%), Positives = 95/219 (43%), Gaps = 18/219 (8%)
Query: 48 IANLVNLILQWPEISINEIAE--NFSKLACNAHTICNSELRPLGTGLYPVISIINHSCLP 105
I +LV LQ+ + + +E+AE FS +G +YP +++ NHSC P
Sbjct: 427 ICSLVLRSLQFIQFNTHEVAELHKFSSTGREKSIF-------IGGAIYPTLALFNHSCDP 479
Query: 106 NAVLVFEGRLAVVRAVQHVPKGAEVLISY--IETAGSTMTRQKALKEQYLFTCTCPRCI- 162
V F G + +V+ + G + +Y I T RQ +KE Y F C C CI
Sbjct: 480 GVVRYFRGTTIHINSVRPIEAGLPINENYGPIYTQDKREERQAKMKELYWFECCCDACID 539
Query: 163 KLGQFDDIQESAILEGYRCK-DDGCSGFL--LRDSDDKGFTCQQCGLVRSKEEIKKIASE 219
F+D+ I +RC+ + C+ + +D C CG V + + K+ +
Sbjct: 540 NWPLFEDLPRDVI--RFRCEAPNNCAAVIEVPPSCNDFMVKCVTCGEVTNILKGLKVMQD 597
Query: 220 VNILSKKTLALTSCGNHQEVVSTY-KMIEKLQKKLYHPF 257
++++ L G++ + + + +I + L PF
Sbjct: 598 TEMMTRTGKRLYETGDYAKALQKFIDLIRIMYDVLAPPF 636
>gi|270011596|gb|EFA08044.1| hypothetical protein TcasGA2_TC005638 [Tribolium castaneum]
Length = 575
Score = 61.6 bits (148), Expect = 6e-07, Method: Compositional matrix adjust.
Identities = 52/182 (28%), Positives = 77/182 (42%), Gaps = 24/182 (13%)
Query: 92 LYPVISIINHSCLPNAVLVFE----GRLAVVRAVQHVPKGAEVLISYIETAGSTMTRQKA 147
LYP ++ H CL N V F+ G VRA + KG + Y T R++
Sbjct: 252 LYPTAYLMEHDCLCNTVHSFDTEENGYKITVRAALPIKKGDHISTMYTHALWGTQARREH 311
Query: 148 LKEQYLFTCTCPRCIKLGQFDDIQESAILEGYRCKD---DGCSGFLL---RDSDDKGFTC 201
LKE F+CTC RC D + + L RC D C G+ L D+ + C
Sbjct: 312 LKETKYFSCTCKRCK-----DPTEMGSYLSALRCLGTGVDSCDGYQLPADPTDDNTQWLC 366
Query: 202 QQCGLVRSKEE----IKKIASEVNILSKKTLALTSCGNHQEVVSTYKMIEKLQKKLYHPF 257
+C + + E I +I EV+ + + + N ++S KM+ L YH +
Sbjct: 367 NKCDIKLTNSEVSYLINQIGEEVDHVQLSSPTVKDLDN---LLS--KMLTFLHPNHYHVY 421
Query: 258 SV 259
SV
Sbjct: 422 SV 423
>gi|194698994|gb|ACF83581.1| unknown [Zea mays]
Length = 268
Score = 61.6 bits (148), Expect = 6e-07, Method: Compositional matrix adjust.
Identities = 32/77 (41%), Positives = 39/77 (50%), Gaps = 7/77 (9%)
Query: 92 LYPVISIINHSCLPNAV-LVFEGRLA------VVRAVQHVPKGAEVLISYIETAGSTMTR 144
+YP S+ NH CLPNA + R VVRA+ +P+G EV ISY R
Sbjct: 79 VYPRASLFNHDCLPNACHFDYPDRPGPGNTDIVVRALHDIPEGREVCISYFAANWRYADR 138
Query: 145 QKALKEQYLFTCTCPRC 161
Q+ L E Y F C C RC
Sbjct: 139 QRRLLEDYGFRCECDRC 155
>gi|328724471|ref|XP_003248160.1| PREDICTED: SET and MYND domain-containing protein 4-like
[Acyrthosiphon pisum]
Length = 627
Score = 61.6 bits (148), Expect = 6e-07, Method: Compositional matrix adjust.
Identities = 37/100 (37%), Positives = 53/100 (53%), Gaps = 5/100 (5%)
Query: 69 NFSKLACNAHTICNSELR-PLGTGLYPVISIINHSCLPNAVL--VFEGRLAVVRAVQHVP 125
N KL NA +I + + PL LYP IS+ NHSC N V R+ V++AVQ +P
Sbjct: 441 NARKLNINAPSISHQQFTFPLALTLYPTISLFNHSCDGNIKRSGVISDRIRVMKAVQPIP 500
Query: 126 KGAEVLISY--IETAGSTMTRQKALKEQYLFTCTCPRCIK 163
KG ++ +Y + +RQ A +++ F C C CIK
Sbjct: 501 KGTQLCCNYGIMFKEHDKESRQSACNDRFNFNCYCDPCIK 540
>gi|395841248|ref|XP_003793458.1| PREDICTED: SET and MYND domain-containing protein 5 isoform 1
[Otolemur garnettii]
Length = 416
Score = 61.6 bits (148), Expect = 6e-07, Method: Compositional matrix adjust.
Identities = 31/79 (39%), Positives = 46/79 (58%), Gaps = 5/79 (6%)
Query: 89 GTGLYPVISIINHSCLPNAVLVF--EGRLAVVRAVQHVPKGAEVLISYIETAG---STMT 143
G+GL+ + S NHSC+PNA F V A++ + G E+ ISY++ S +
Sbjct: 304 GSGLFVLQSCCNHSCVPNAETSFPENNFFLHVTALEDIKPGEEICISYLDCCQRERSRHS 363
Query: 144 RQKALKEQYLFTCTCPRCI 162
R K L+E YLF C+CP+C+
Sbjct: 364 RHKILRENYLFICSCPKCL 382
>gi|170041827|ref|XP_001848651.1| SET and MYND domain-containing protein 5 [Culex quinquefasciatus]
gi|167865410|gb|EDS28793.1| SET and MYND domain-containing protein 5 [Culex quinquefasciatus]
Length = 387
Score = 61.6 bits (148), Expect = 6e-07, Method: Compositional matrix adjust.
Identities = 33/87 (37%), Positives = 46/87 (52%), Gaps = 5/87 (5%)
Query: 80 ICNSELRPLGTGLYPVISIINHSCLPNAVLVF--EGRLAVVRAVQHVPKGAEVLISYIET 137
+ S L G+ LY + S +NHSC PNA F + + VQ + G E+ ISY++
Sbjct: 276 VVGSFLNNEGSALYSLQSKVNHSCAPNAECQFPHSNNVLALTTVQDIAAGEEICISYLDE 335
Query: 138 AG---STMTRQKALKEQYLFTCTCPRC 161
S +RQK L+E YLF C C +C
Sbjct: 336 CALERSRHSRQKMLRENYLFQCQCEKC 362
>gi|195123911|ref|XP_002006445.1| GI18563 [Drosophila mojavensis]
gi|193911513|gb|EDW10380.1| GI18563 [Drosophila mojavensis]
Length = 666
Score = 61.6 bits (148), Expect = 6e-07, Method: Compositional matrix adjust.
Identities = 44/194 (22%), Positives = 87/194 (44%), Gaps = 8/194 (4%)
Query: 88 LGTGLYPVISIINHSCLPNAVLVFEGRLAVVRAVQHVPKGAEVLISY--IETAGSTMTRQ 145
+G +YP +++ NHSC P V F G + V+ V G + +Y I T RQ
Sbjct: 468 IGGAIYPTLALFNHSCDPGVVRYFRGNTIHINTVRPVEAGLPINENYGPIYTQDKREDRQ 527
Query: 146 KALKEQYLFTCTCPRCIK-LGQFDDIQESAILEGYRCK-DDGCSGFL--LRDSDDKGFTC 201
LKE Y F C C C++ FDD+ I +RC+ + C+ + +D C
Sbjct: 528 ARLKELYWFECNCDACLENWPLFDDLPRDII--RFRCEAPNNCTAVIEVPPSCNDFMIKC 585
Query: 202 QQCGLVRSKEEIKKIASEVNILSKKTLALTSCGNHQEVVSTYKMIEKLQKKLYHPFSVNL 261
CG + + + K+ + ++++ L G++ + ++ + + ++ ++ P +
Sbjct: 586 VTCGELTNILKGLKVMQDTEMMTRTAKRLYDTGDYAKALNKFVDLLRIMYEVLAPPFPDF 645
Query: 262 MQTREKLIKILMEL 275
+ ++ L + L
Sbjct: 646 CECQQHLKDCFLNL 659
>gi|443683649|gb|ELT87827.1| hypothetical protein CAPTEDRAFT_118237 [Capitella teleta]
Length = 596
Score = 61.6 bits (148), Expect = 6e-07, Method: Compositional matrix adjust.
Identities = 34/101 (33%), Positives = 47/101 (46%), Gaps = 12/101 (11%)
Query: 73 LACNAHTICNSELRP----------LGTGLYPVISIINHSCLPNAVLVFEGRLAVVRAVQ 122
CNAH + L P LG G+Y +S+ NHSC P F G VVRA++
Sbjct: 377 FPCNAHEVPELYLDPNAIDLSMPNELGAGIYSTLSLFNHSCDPGVNRNFYGDTCVVRAIK 436
Query: 123 HVPKGAEVLISY--IETAGSTMTRQKALKEQYLFTCTCPRC 161
+ KG +V +Y + + R L+ QY F+C C C
Sbjct: 437 TIRKGHQVSDNYGALYATNTLKERHDKLQPQYFFSCRCEPC 477
>gi|198437054|ref|XP_002123001.1| PREDICTED: similar to SET and MYND domain containing 4 [Ciona
intestinalis]
Length = 773
Score = 61.6 bits (148), Expect = 6e-07, Method: Compositional matrix adjust.
Identities = 44/154 (28%), Positives = 72/154 (46%), Gaps = 8/154 (5%)
Query: 34 MSDIDEKQLLLYAQIANLVNLILQWPEISINEIAENF-SKLACNAHTICNSELRPLGTGL 92
M+ I+ QL ++ + L + ++ IN + +F ++ C+ T+ ++ + +
Sbjct: 440 MTKIEPGQLTSMGEVIEYL-LHRHYLQVPINGQSISFVTEELCDNVTVTRRDI--VASAF 496
Query: 93 YPVISIINHSCLPNAVLVFEGRLAVVRAVQHVPKGAEVLISYIETA--GSTMTRQKALKE 150
+P +S++NHSC N +F G R+ Q +P GAE+ Y + S RQK LKE
Sbjct: 497 FPTMSMMNHSCDCNTDALFNGSTVTFRSNQFIPVGAEITHCYGPSVFHASFEERQKTLKE 556
Query: 151 QYLFTCTCPRCI--KLGQFDDIQESAILEGYRCK 182
Y F C C C K G D L +CK
Sbjct: 557 NYSFDCDCTPCAMHKSGNTDKEALHTQLRAMKCK 590
>gi|361129398|gb|EHL01305.1| putative SET domain-containing protein 5 [Glarea lozoyensis 74030]
Length = 286
Score = 61.6 bits (148), Expect = 6e-07, Method: Compositional matrix adjust.
Identities = 36/103 (34%), Positives = 52/103 (50%), Gaps = 11/103 (10%)
Query: 69 NFSKLACNAHTICNSELRPLGT-----GLYPVISIINHSCLPNAVLVFEGRLA-----VV 118
NF +I S PLG G++ IS INH CLPNAV + G L V
Sbjct: 49 NFPGQGNPLTSIIRSNGYPLGADAGCGGVFENISRINHGCLPNAVQNWNGLLGEEGEETV 108
Query: 119 RAVQHVPKGAEVLISYIETAGSTMTRQKALKEQYLFTCTCPRC 161
A++ + +G E+ SY+ + G++ R+ LK+ + F CTC C
Sbjct: 109 YAIKDIKEGEEITTSYL-SGGTSKERRAVLKQSFGFDCTCKLC 150
>gi|91088857|ref|XP_971416.1| PREDICTED: similar to CG14590 CG14590-PA [Tribolium castaneum]
Length = 563
Score = 61.6 bits (148), Expect = 6e-07, Method: Compositional matrix adjust.
Identities = 52/182 (28%), Positives = 77/182 (42%), Gaps = 24/182 (13%)
Query: 92 LYPVISIINHSCLPNAVLVFE----GRLAVVRAVQHVPKGAEVLISYIETAGSTMTRQKA 147
LYP ++ H CL N V F+ G VRA + KG + Y T R++
Sbjct: 252 LYPTAYLMEHDCLCNTVHSFDTEENGYKITVRAALPIKKGDHISTMYTHALWGTQARREH 311
Query: 148 LKEQYLFTCTCPRCIKLGQFDDIQESAILEGYRCKD---DGCSGFLL---RDSDDKGFTC 201
LKE F+CTC RC D + + L RC D C G+ L D+ + C
Sbjct: 312 LKETKYFSCTCKRCK-----DPTEMGSYLSALRCLGTGVDSCDGYQLPADPTDDNTQWLC 366
Query: 202 QQCGLVRSKEE----IKKIASEVNILSKKTLALTSCGNHQEVVSTYKMIEKLQKKLYHPF 257
+C + + E I +I EV+ + + + N ++S KM+ L YH +
Sbjct: 367 NKCDIKLTNSEVSYLINQIGEEVDHVQLSSPTVKDLDN---LLS--KMLTFLHPNHYHVY 421
Query: 258 SV 259
SV
Sbjct: 422 SV 423
>gi|320168174|gb|EFW45073.1| predicted protein [Capsaspora owczarzaki ATCC 30864]
Length = 291
Score = 61.2 bits (147), Expect = 7e-07, Method: Compositional matrix adjust.
Identities = 33/76 (43%), Positives = 46/76 (60%), Gaps = 4/76 (5%)
Query: 89 GTGLYPVISIINHSCLPNAVLVFEGR---LAVVRAVQHVPKGAEVLISYIETAGSTMTRQ 145
G+GL+ + + +NHSC PN V + L+VV AV +P EV ISYI+T RQ
Sbjct: 213 GSGLFEIGNTMNHSCQPNVVSMTRATDFTLSVV-AVATIPVNTEVCISYIDTDLPKAKRQ 271
Query: 146 KALKEQYLFTCTCPRC 161
AL+E Y F+C+C +C
Sbjct: 272 AALEELYYFSCSCAKC 287
>gi|189233943|ref|XP_974086.2| PREDICTED: similar to AGAP008839-PA [Tribolium castaneum]
Length = 666
Score = 61.2 bits (147), Expect = 7e-07, Method: Compositional matrix adjust.
Identities = 57/235 (24%), Positives = 105/235 (44%), Gaps = 9/235 (3%)
Query: 48 IANLVNLILQWPEISINEIAE--NFSKLACNAHTICNSELRPLGTGLYPVISIINHSCLP 105
IA+L+ LQ + + +EI+E N ++ C+ + +G GLYP +++ NHSC P
Sbjct: 423 IASLILRHLQILQFNSHEISELRNLNEEMVTNGIQCHYKSEYIGAGLYPTLALFNHSCDP 482
Query: 106 NAVLVFEGRLAVVRAVQHVPKGAEVLISY--IETAGSTMTRQKALKEQYLFTCTCPRCIK 163
+ V G +VR ++ + G + +Y + T+ R+ L+ +Y F C C C +
Sbjct: 483 SIVRYNIGNRMIVRTIKPIKAGEIIYENYGPLYTSMDADERRVTLQNRYWFECYCTPCQQ 542
Query: 164 -LGQFDDIQESAILEGYRCKDDGCSGFLLRDSDD--KGFTCQQCGLVRSKEEIKKIASEV 220
F+ + + I G C+ + C DD F C C V K S++
Sbjct: 543 EWPLFEYMDPNQIKIG--CQKENCPFEFTLYKDDLCPYFQCDYCDGVTKIFPSLKGLSQL 600
Query: 221 NILSKKTLALTSCGNHQEVVSTYKMIEKLQKKLYHPFSVNLMQTREKLIKILMEL 275
I+ K L S G +E + + + K P ++++ +++L + + L
Sbjct: 601 AIMLPKAEDLYSAGETREAMKLFMQSLDILYKYSRPPCPDMIKVQQRLKTLFVHL 655
>gi|281210720|gb|EFA84886.1| hypothetical protein PPL_01879 [Polysphondylium pallidum PN500]
Length = 439
Score = 61.2 bits (147), Expect = 7e-07, Method: Compositional matrix adjust.
Identities = 39/149 (26%), Positives = 63/149 (42%), Gaps = 26/149 (17%)
Query: 73 LACNAH----TICNSELRPLGTGLYPVISIINHSCLPNAVLVFE----GRLAVVRAVQHV 124
L CN H T+ +G+ +Y S+ NHSC PN + G L ++ ++ +
Sbjct: 261 LECNTHEIGITVDEYNYCSIGSAVYEKASLFNHSCQPNVCRINRAGEWGALEMI-SLTDI 319
Query: 125 PKGAEVLISYIETAGSTMTRQKALKEQYLFTCTCPRCIKLGQFDDI-------------- 170
G E++ +YI+ + T RQ L E Y F C C C+
Sbjct: 320 AAGTELVYNYIQISLPTEDRQSKLSENYFFECKCNGCVNNTSISSRQKQQKQQQKQRQQK 379
Query: 171 ---QESAILEGYRCKDDGCSGFLLRDSDD 196
+ + L+ Y C D C+G L+ D++D
Sbjct: 380 QSNSQKSFLKKYLCTRDTCNGVLITDAND 408
>gi|406863858|gb|EKD16905.1| TPR domain protein [Marssonina brunnea f. sp. 'multigermtubi'
MB_m1]
Length = 739
Score = 61.2 bits (147), Expect = 7e-07, Method: Compositional matrix adjust.
Identities = 33/91 (36%), Positives = 49/91 (53%), Gaps = 6/91 (6%)
Query: 90 TGLYPVISIINHSCLPNAVLVFEGRLAVVRAVQHVPKGAEVLISYIETAGSTMTRQKALK 149
+GL+ + S INHSC PN F G + +VRA + +P E+ +SYIET Q+ L
Sbjct: 508 SGLWILASYINHSCEPNCRRAFIGDVQIVRAARDMPADTEITLSYIETDDPAKMNQR-LF 566
Query: 150 EQYLFTCTCPRCIKLGQFDDIQESAILEGYR 180
+ + F CTC C+ DD + A ++ R
Sbjct: 567 DGWGFDCTCAMCV-----DDRETPAAVKNRR 592
>gi|66521464|ref|XP_396314.2| PREDICTED: protein msta, isoform B-like [Apis mellifera]
Length = 513
Score = 61.2 bits (147), Expect = 7e-07, Method: Compositional matrix adjust.
Identities = 59/271 (21%), Positives = 109/271 (40%), Gaps = 35/271 (12%)
Query: 91 GLYPVISIINHSCLPN-----AVLVFEGRLAVVRAVQHVPKGAEVLISYIETAGSTMTRQ 145
+YP ++++NHSC+ N + ++ RL R +P G E+ SY + TM R+
Sbjct: 216 AIYPTVALMNHSCISNTCHSISPTDYKIRL---RTTLKIPVGGELYGSYTHSLLPTMLRR 272
Query: 146 KALKEQYLFTCTCPRCIKLGQFDDIQESAILEGYRCK--DDGCSGFLLRDSDDKGFTCQQ 203
+ L E F C C RC D + + +C D+G L + + C
Sbjct: 273 EHLLEGKNFACACARC-----SDPTELGTHMSSLKCNKCDNGIVLSLDSLDPESSWKCTH 327
Query: 204 CGLVRSKEEIKKIASEVNILSKKTLALTSCGNHQEVVSTYKMIEKLQKKLYHPFSVNLMQ 263
C S ++K+ + + A++ + + + + + K + + HP L
Sbjct: 328 CEFSTSGSAVRKVLQIIQMEVDAVEAISG-ADGADAIQERETVAKKYRSVLHPRHAFLTM 386
Query: 264 TREKLIKILMELEDW----------KEALAYCQLTIPVYQRVYPQFHPLLGLQYYTCGKL 313
R L ++ +ED+ + + C+L + V + P + + G+ Y
Sbjct: 387 LRHSLTQMYGRVEDYLLEDLPDVVLEHKIDMCRLLLQVLDVIEPGYSRIRGMTLYELHAP 446
Query: 314 EWFLGDTE--------NAIKS-MTEAVEILR 335
F T+ A+KS M EA +IL+
Sbjct: 447 LLFTAKTQWNAEVIDKAALKSKMIEASKILK 477
>gi|391334364|ref|XP_003741575.1| PREDICTED: SET and MYND domain-containing protein 1-like
[Metaseiulus occidentalis]
Length = 434
Score = 61.2 bits (147), Expect = 7e-07, Method: Compositional matrix adjust.
Identities = 38/141 (26%), Positives = 69/141 (48%), Gaps = 5/141 (3%)
Query: 27 VEALVAHMSDIDEKQLLLYAQIA-NLVNLILQWPEISINEIAENFSKLACNAHTICNSEL 85
+ L H+ D+ E L + A + +L+ + ++E+A +L N + +
Sbjct: 112 ISDLETHLEDLSEDDRLEVQEKAQRIASLMRAFTNTEVHEVALMLQRLRINMFQLSDHRA 171
Query: 86 RPLGTGLYPVISIINHSC--LPNAVLVFEGRLAVVRAVQH--VPKGAEVLISYIETAGST 141
G Y IS+++H+C N VL F+GR ++RA+++ V + I+YI+ T
Sbjct: 172 VTKGIACYLGISVVDHTCEDSGNFVLAFKGREIILRALKNFTVQDMGDFRINYIDATIPT 231
Query: 142 MTRQKALKEQYLFTCTCPRCI 162
R++AL Y F C C +C+
Sbjct: 232 EERRRALLAGYFFACQCAKCL 252
>gi|46115982|ref|XP_384009.1| hypothetical protein FG03833.1 [Gibberella zeae PH-1]
Length = 690
Score = 61.2 bits (147), Expect = 7e-07, Method: Compositional matrix adjust.
Identities = 39/142 (27%), Positives = 70/142 (49%), Gaps = 6/142 (4%)
Query: 25 SLVEALVAHMSDIDEKQLLLYAQIANLVNLILQWPEISINEIAENFSKLACNAHTIC--- 81
++V+ L+ + S + E+ L L + +++ E+ + + + + CNA +
Sbjct: 439 AVVQKLLNNPSQV-ERILGLQGDYKGIGQKLIELDEVPVIDTFQIHDIVQCNAFGLGQQT 497
Query: 82 -NSELRPLGTGLYPVISIINHSCLPNAVLVFEGRLAVVRAVQHVPKGAEVLISYIETAGS 140
+ ++ TGL+ S INHSC+PNA G L + RA + + G E+ +Y E+ S
Sbjct: 498 EDEDISNASTGLWVRASYINHSCIPNAKKDLIGDLILFRATRRIASGEEITHAYDEST-S 556
Query: 141 TMTRQKALKEQYLFTCTCPRCI 162
RQ A + + F C CP C+
Sbjct: 557 YEARQAAFRRTWNFECRCPLCL 578
>gi|194883134|ref|XP_001975659.1| GG20436 [Drosophila erecta]
gi|190658846|gb|EDV56059.1| GG20436 [Drosophila erecta]
Length = 515
Score = 61.2 bits (147), Expect = 7e-07, Method: Compositional matrix adjust.
Identities = 43/173 (24%), Positives = 75/173 (43%), Gaps = 11/173 (6%)
Query: 48 IANLVNLILQWPEISINEIAENFSKLACNAHTICNSELRPLGTGLYPVISIINHSCLPNA 107
I + + + + EI + L N H + ++ P ++ S +SCLPN
Sbjct: 189 IGQFIPKFFKTQKFTEEEIMKAVGALQINGHEVPTTD--PSHVAVFYTASFTENSCLPNL 246
Query: 108 VLVF-EGRLAVVRAVQHVPKGAEVLISYIETAGSTMTRQKALKEQYLFTCTCPRCIKLGQ 166
F + ++ A + K A + I Y + T RQ+ L + LF C C RC+ + +
Sbjct: 247 AKSFNKNGHCILWAPHEIKKNAHLSICYSDAVWGTADRQRHLMQTKLFKCACDRCVDVTE 306
Query: 167 FDDIQESAILEGYRCKDDGCSGFLL---RDSDDKGFTCQQCGLVRSKEEIKKI 216
D SAI +C+D C G +L D + + C++C K +++I
Sbjct: 307 L-DTNYSAI----KCEDRQCGGLMLPTKADDWNGSWRCRECHKQVQKHYVERI 354
>gi|357605609|gb|EHJ64694.1| hypothetical protein KGM_16822 [Danaus plexippus]
Length = 382
Score = 61.2 bits (147), Expect = 8e-07, Method: Compositional matrix adjust.
Identities = 32/79 (40%), Positives = 47/79 (59%), Gaps = 5/79 (6%)
Query: 89 GTGLYPVISIINHSCLPNAVLVFE-GRLAV-VRAVQHVPKGAEVLISYIETAG---STMT 143
G+GLY + S NHSC PNA F G + ++A++++ G E+ ISY++ S +
Sbjct: 283 GSGLYQLQSACNHSCAPNAESSFPYGNHRIQLKAIRNIKPGDEIHISYLDDCTLQRSRHS 342
Query: 144 RQKALKEQYLFTCTCPRCI 162
RQ+ L E YLF C C RC+
Sbjct: 343 RQRELSENYLFVCCCERCM 361
>gi|71985487|ref|NP_493620.2| Protein SET-10 [Caenorhabditis elegans]
gi|31873104|emb|CAB04267.2| Protein SET-10 [Caenorhabditis elegans]
Length = 430
Score = 61.2 bits (147), Expect = 8e-07, Method: Compositional matrix adjust.
Identities = 38/117 (32%), Positives = 61/117 (52%), Gaps = 8/117 (6%)
Query: 75 CNAHTICNSE-LRPLGTGLYPVISIINHSCLPNAVLVFEGRLAVVRAVQHVPKGAEVLIS 133
CN+ +I N + + P+G+GLY ++ NHSC + +VFEG +R Q E+ IS
Sbjct: 159 CNSFSIINEKRVEPIGSGLYVGVAKHNHSCASTSHVVFEGNQVFLRTNQE-EYSKELTIS 217
Query: 134 YIETAGSTMTRQKALKEQYLFTCTCPRCIKLGQFDDIQESAILEGYRCKDDGCSGFL 190
Y+ T R+K ++ + TC C C K + D I S+ +C+ C+GF+
Sbjct: 218 YVSRMLPTSERRKTIRGVHFLTCQCEMC-KNEELDLIGLSS-----KCRTKNCTGFV 268
>gi|19922236|ref|NP_610944.1| CG8503 [Drosophila melanogaster]
gi|7303209|gb|AAF58272.1| CG8503 [Drosophila melanogaster]
gi|15291193|gb|AAK92865.1| GH11294p [Drosophila melanogaster]
gi|220945042|gb|ACL85064.1| CG8503-PA [synthetic construct]
gi|220954946|gb|ACL90016.1| CG8503-PA [synthetic construct]
Length = 513
Score = 61.2 bits (147), Expect = 8e-07, Method: Compositional matrix adjust.
Identities = 43/173 (24%), Positives = 76/173 (43%), Gaps = 11/173 (6%)
Query: 48 IANLVNLILQWPEISINEIAENFSKLACNAHTICNSELRPLGTGLYPVISIINHSCLPNA 107
I + + + + EI + L N H + ++ P ++ S +SCLPN
Sbjct: 189 IGQFIPKFFKTQKFTEEEIMKAVGALQINGHEVPTTD--PSHVAVFYTASFTENSCLPNL 246
Query: 108 VLVF-EGRLAVVRAVQHVPKGAEVLISYIETAGSTMTRQKALKEQYLFTCTCPRCIKLGQ 166
F + ++ A + + K A + I Y + T RQ+ L + LF C C RC+ + +
Sbjct: 247 AKSFNKNGHCILWAPREIKKNAHLSICYSDAMWGTADRQRHLMQTKLFKCACERCVDVTE 306
Query: 167 FDDIQESAILEGYRCKDDGCSGFLLRDSDDK---GFTCQQCGLVRSKEEIKKI 216
D SAI +C+D C G +L D+ + C++C K +++I
Sbjct: 307 L-DTNYSAI----KCEDRQCGGLMLPTKADEWNGSWRCRECHKQVQKHYVERI 354
>gi|225425346|ref|XP_002275154.1| PREDICTED: histone-lysine N-methyltransferase ASHR2-like [Vitis
vinifera]
Length = 405
Score = 61.2 bits (147), Expect = 8e-07, Method: Compositional matrix adjust.
Identities = 30/78 (38%), Positives = 38/78 (48%), Gaps = 7/78 (8%)
Query: 91 GLYPVISIINHSCLPNA-------VLVFEGRLAVVRAVQHVPKGAEVLISYIETAGSTMT 143
G+YP S NH CLPNA +R + VP+G+E+ +SY +
Sbjct: 217 GIYPKASFFNHDCLPNACRFDYVDTASHHNTDITIRLIHDVPEGSEICLSYFPVNETYAD 276
Query: 144 RQKALKEQYLFTCTCPRC 161
RQK L E Y FTC C RC
Sbjct: 277 RQKRLLEDYGFTCYCDRC 294
>gi|383849380|ref|XP_003700323.1| PREDICTED: SET and MYND domain-containing protein 4-like [Megachile
rotundata]
Length = 691
Score = 61.2 bits (147), Expect = 8e-07, Method: Compositional matrix adjust.
Identities = 30/80 (37%), Positives = 46/80 (57%), Gaps = 3/80 (3%)
Query: 84 ELRPLGTGLYPVISIINHSCLPNAVLVFEGRLAVVRAVQHVPKGAEVLISYIET--AGST 141
++ + TG+YP S++NHSC PN + ++ + +VRAV+ + E+ SY+ T T
Sbjct: 403 QINIVATGIYPSASMMNHSCNPNIIKIYMDQYLIVRAVEDIFPTEEIFNSYVATYRYKKT 462
Query: 142 MTRQKALKEQYLFTCTCPRC 161
RQK L E Y F+C C C
Sbjct: 463 KARQKLL-ELYYFSCKCEAC 481
>gi|342182559|emb|CCC92038.1| conserved hypothetical protein [Trypanosoma congolense IL3000]
Length = 712
Score = 61.2 bits (147), Expect = 8e-07, Method: Compositional matrix adjust.
Identities = 37/118 (31%), Positives = 56/118 (47%), Gaps = 2/118 (1%)
Query: 75 CNAHTICNSELRPLGTGLYPVISIINHSCLPNAVLV-FEGRLAVVRAVQHVPKGAEVLIS 133
CN + +S+ +G ++P S NHSCLPN V + G +A A+Q + KG + I
Sbjct: 559 CNNFGLFSSKESCIGVSIFPEASYFNHSCLPNLCRVMYRGNIAAFYALQSIRKGEPLTIC 618
Query: 134 YIET-AGSTMTRQKALKEQYLFTCTCPRCIKLGQFDDIQESAILEGYRCKDDGCSGFL 190
Y++ ST R++ L Y F C C RC D ++ + E C G+L
Sbjct: 619 YVDVQEASTAERRRTLLTSYRFFCECRRCHGYSDEDGKGDTVMSEIRFCDTCDARGYL 676
>gi|195127975|ref|XP_002008442.1| GI11814 [Drosophila mojavensis]
gi|193920051|gb|EDW18918.1| GI11814 [Drosophila mojavensis]
Length = 527
Score = 61.2 bits (147), Expect = 8e-07, Method: Compositional matrix adjust.
Identities = 52/192 (27%), Positives = 86/192 (44%), Gaps = 21/192 (10%)
Query: 49 ANLVNLI------LQWPEISINEIAENFSKLACNAHTICNSELRPLGTGLYPVISIINHS 102
ANL++ I +WP++ I IA + L NA + + R GL+P ++ +H
Sbjct: 200 ANLISFIKTVLGLREWPDLDILRIA---AILDTNAFEVRQAGDRIKVRGLFPGGAMFSHD 256
Query: 103 CLPNAVLVFEGRLAVV-RAVQHVPKGAEVLISYIETAGSTMTRQKALKEQYLFTCTCPRC 161
C+PN F+ + ++ A + + KG + ISY + ST+ R+ LK+ F C C RC
Sbjct: 257 CVPNMRHRFDDDMNIMFLAKRPIAKGEILSISYTQQLRSTIQRRLHLKQVKCFECACARC 316
Query: 162 IKLGQFDDIQESAILEGYRCKDDGCSGFLLRD--SDDKGFTCQQCGLVRSKEEI----KK 215
D + + C F+ D + + C+ C L+R +E K
Sbjct: 317 A-----DPTELGTYAGAHMCAKCKVGKFISMDPLQNAANWRCEVCNLIRPAKEYLINDAK 371
Query: 216 IASEVNILSKKT 227
I +E+ L K T
Sbjct: 372 IEAELESLDKTT 383
>gi|451848012|gb|EMD61318.1| hypothetical protein COCSADRAFT_96412 [Cochliobolus sativus ND90Pr]
Length = 417
Score = 60.8 bits (146), Expect = 9e-07, Method: Compositional matrix adjust.
Identities = 34/89 (38%), Positives = 48/89 (53%), Gaps = 8/89 (8%)
Query: 91 GLYPVISIINHSCLPNAVLVFE-GRLA-VVRAVQHVPKGAEVLISYIETAGSTMTRQKAL 148
++P S +NH+C PNA V + L+ VR + + +G E+ ISY T TRQ L
Sbjct: 234 AVWPETSRLNHACAPNAQYVIDTDHLSHTVRVTRPIAEGEEITISYTSPLEPTETRQHHL 293
Query: 149 KEQYLFTCTCPRC------IKLGQFDDIQ 171
+ + FTCTCPRC I L +D+Q
Sbjct: 294 AQGFHFTCTCPRCTSPITDITLSIINDLQ 322
>gi|297845524|ref|XP_002890643.1| predicted protein [Arabidopsis lyrata subsp. lyrata]
gi|297336485|gb|EFH66902.1| predicted protein [Arabidopsis lyrata subsp. lyrata]
Length = 976
Score = 60.8 bits (146), Expect = 9e-07, Method: Compositional matrix adjust.
Identities = 26/73 (35%), Positives = 41/73 (56%), Gaps = 1/73 (1%)
Query: 89 GTGLYPVISIINHSCLPNAVLVFEGRLAVVRAVQHVPKGAEVLISYIETAGSTMTRQKAL 148
G GL+ + S INHSC+PNA + G +V A + + G E+ +Y + S + ++K +
Sbjct: 761 GVGLWTLASFINHSCIPNARRLHVGDYVIVHASRDIKAGEEITFAYFDVL-SPLEKRKEM 819
Query: 149 KEQYLFTCTCPRC 161
E + F C C RC
Sbjct: 820 AESWGFRCGCSRC 832
>gi|147844297|emb|CAN82112.1| hypothetical protein VITISV_031337 [Vitis vinifera]
Length = 405
Score = 60.8 bits (146), Expect = 9e-07, Method: Compositional matrix adjust.
Identities = 30/78 (38%), Positives = 38/78 (48%), Gaps = 7/78 (8%)
Query: 91 GLYPVISIINHSCLPNA-------VLVFEGRLAVVRAVQHVPKGAEVLISYIETAGSTMT 143
G+YP S NH CLPNA +R + VP+G+E+ +SY +
Sbjct: 217 GIYPKASFFNHDCLPNACRFDYVDTASHHNTDITIRLIHDVPEGSEICLSYFPVNETYAD 276
Query: 144 RQKALKEQYLFTCTCPRC 161
RQK L E Y FTC C RC
Sbjct: 277 RQKRLLEDYGFTCYCDRC 294
>gi|413917046|gb|AFW56978.1| hypothetical protein ZEAMMB73_771542 [Zea mays]
Length = 404
Score = 60.8 bits (146), Expect = 1e-06, Method: Compositional matrix adjust.
Identities = 49/180 (27%), Positives = 65/180 (36%), Gaps = 57/180 (31%)
Query: 91 GLYPVISIINHSCLPNAV-LVFEGRLA------VVRAVQHVPKGAEVLISYIETAGSTMT 143
+YP S+ NH CLPNA + R VVRA+ +P+G EV ISY
Sbjct: 214 AVYPRASLFNHDCLPNACHFDYPDRPGPGNTDIVVRALHDIPEGREVCISYFAANWRYAD 273
Query: 144 RQKALKEQYLFTCTCPRCIKLGQF------------------------------------ 167
RQ+ L E Y F C C RC Q+
Sbjct: 274 RQRRLLEDYGFRCECDRCQVESQWKDDDDNNGDDGDDTMEEEGEDDVGDRGDDGIEEEEG 333
Query: 168 --------DDIQESAILEGYRCKDDGCSGFL--LRDSDDKG----FTCQQCGLVRSKEEI 213
DD + Y C ++GC G L L S + F C CG +R +E++
Sbjct: 334 DGATNGGDDDFPHAYFFVRYLCDNEGCWGMLAPLPPSPNGDLSHVFECNLCGKLRKEEDV 393
>gi|197304712|ref|NP_001127869.1| N-lysine methyltransferase SMYD2-like [Nasonia vitripennis]
Length = 704
Score = 60.8 bits (146), Expect = 1e-06, Method: Compositional matrix adjust.
Identities = 46/174 (26%), Positives = 80/174 (45%), Gaps = 27/174 (15%)
Query: 16 IPSTTTDNYSLVEALVAHMSDIDEKQLL------LYAQIANLVNLILQWPEISINE---- 65
+ S + +Y V +L +H +D D L ++ + + +L PEI +E
Sbjct: 420 LASYESRDYKTVFSLESHCADADPADNLQRAIHSVFFAKSLIYSLNYFHPEIDNHERHLH 479
Query: 66 -----IAENFSKLACNAHTICNS---------ELRPLGTGLYPVISIINHSCLPNAVL-V 110
+ N + CNA+ I + E R +G +Y +S+ NHSC PN V
Sbjct: 480 ILAVALLHNMQAIKCNAYEIVENVRDDETKILEPRNVGGAIYTTVSLTNHSCYPNIVRHS 539
Query: 111 FEGRLAVVRAVQHVPKGAEVLISYIET--AGSTMTRQKALKEQYLFTCTCPRCI 162
F VV +++++P+G+E+L Y + +R++ L E+Y F C C C+
Sbjct: 540 FPNGTVVVTSLRYIPEGSEILDCYGQHFLENKRDSRRRLLAEKYYFDCQCEPCL 593
>gi|145347416|ref|XP_001418163.1| predicted protein [Ostreococcus lucimarinus CCE9901]
gi|144578392|gb|ABO96456.1| predicted protein [Ostreococcus lucimarinus CCE9901]
Length = 639
Score = 60.8 bits (146), Expect = 1e-06, Method: Compositional matrix adjust.
Identities = 25/67 (37%), Positives = 42/67 (62%)
Query: 70 FSKLACNAHTICNSELRPLGTGLYPVISIINHSCLPNAVLVFEGRLAVVRAVQHVPKGAE 129
S+ N TI + +++ +G G+YP S+ NHS PNA ++F+G+ VV+ ++ + G E
Sbjct: 218 LSRFEINGFTIADDDMQRVGFGIYPEASLFNHSSTPNAQVMFKGKTLVVKTLREIAVGEE 277
Query: 130 VLISYIE 136
+ ISY E
Sbjct: 278 ITISYGE 284
>gi|196009510|ref|XP_002114620.1| hypothetical protein TRIADDRAFT_28394 [Trichoplax adhaerens]
gi|190582682|gb|EDV22754.1| hypothetical protein TRIADDRAFT_28394 [Trichoplax adhaerens]
Length = 398
Score = 60.8 bits (146), Expect = 1e-06, Method: Compositional matrix adjust.
Identities = 32/77 (41%), Positives = 46/77 (59%), Gaps = 3/77 (3%)
Query: 89 GTGLYPVISIINHSCLPNAVLVFEGRLAV--VRAVQHVPKGAEVLISYIETA-GSTMTRQ 145
G+GLY + S NH C PN + F A V+A++++ +G E+ ISYI++ + RQ
Sbjct: 298 GSGLYLLQSCCNHDCSPNVEINFLDNNATLTVKAIRNISEGQELCISYIDSDIKNWKKRQ 357
Query: 146 KALKEQYLFTCTCPRCI 162
L E YLF CTC RC+
Sbjct: 358 AILMENYLFECTCNRCM 374
>gi|310793444|gb|EFQ28905.1| MYND finger [Glomerella graminicola M1.001]
Length = 548
Score = 60.8 bits (146), Expect = 1e-06, Method: Compositional matrix adjust.
Identities = 33/90 (36%), Positives = 47/90 (52%), Gaps = 1/90 (1%)
Query: 72 KLACNAHTICNSELRPLGTGLYPVISIINHSCLPNAVLVFEGRLAVVRAVQHVPKGAEVL 131
KL NA + G L +++INHSC+PNA++ F GR A +R+ + G+E+
Sbjct: 183 KLQTNAFSRSEEYYETGGVFLDTTLAMINHSCVPNALVQFGGRTATLRSASFIHPGSEIE 242
Query: 132 ISYIETAGSTMTRQKALKEQYLFTCTCPRC 161
ISYI+ RQ L Y F C C +C
Sbjct: 243 ISYIDQTQPKSRRQHELS-LYHFECHCSKC 271
>gi|193683573|ref|XP_001946512.1| PREDICTED: protein msta, isoform A-like [Acyrthosiphon pisum]
Length = 499
Score = 60.8 bits (146), Expect = 1e-06, Method: Compositional matrix adjust.
Identities = 44/167 (26%), Positives = 66/167 (39%), Gaps = 19/167 (11%)
Query: 57 QWPEISINEIAENFSKLACNAHTICNSELRPLGTGLYPVISIINHSCLPNAVLVF--EGR 114
+WP ++EI + N H + ++ P +Y S++ HSC+PN F +G
Sbjct: 168 EWP---VDEILRVCGIVQVNGHEVPLTD--PPYVAIYDAGSMLEHSCVPNCSKSFTRDGH 222
Query: 115 LAVVRAVQHVPKGAEVLISYIETAGSTMTRQKALKEQYLFTCTCPRCIKLGQFDDIQESA 174
L + A V G + ISY + T R L + F C CPRC D +
Sbjct: 223 LLIRTAAAAVESGGHLSISYTDVLWGTAQRLAHLADTKFFVCKCPRC-----SDPTELGT 277
Query: 175 ILEGYRCKDDGCSGFLL-------RDSDDKGFTCQQCGLVRSKEEIK 214
G +C + C G+ L D D + C C V I+
Sbjct: 278 YFSGVKCATEDCMGYSLPNVHPSTNDPFDVEWMCNFCFTVADPNRIE 324
>gi|322786609|gb|EFZ13004.1| hypothetical protein SINV_09986 [Solenopsis invicta]
Length = 490
Score = 60.8 bits (146), Expect = 1e-06, Method: Compositional matrix adjust.
Identities = 40/145 (27%), Positives = 68/145 (46%), Gaps = 9/145 (6%)
Query: 91 GLYPVISIINHSCLPNAVLVFEGRLAV-VRAVQHVPKGAEVLISYIETAGSTMTRQKALK 149
GLYP+ ++ NH C+PN+ F+ + + V A + G E+ +SY + T TR++ L+
Sbjct: 216 GLYPMGALQNHCCVPNSRHHFDDQQRLHVSAALPIAAGEEITMSYTDLLWDTSTRRQFLR 275
Query: 150 EQYLFTCTCPRCIKLGQFDDIQESAILEGYRCKDDGCSGFLLRDSD---DKGFTCQQCGL 206
F+C C RC D ++ + L C D C G LL + + + C +C
Sbjct: 276 ITKRFSCNCNRC-----SDPLEFGSRLSALLCAKDECLGHLLPRNPLNYESSWICDKCQT 330
Query: 207 VRSKEEIKKIASEVNILSKKTLALT 231
+ +I+ I S +N + T
Sbjct: 331 SVNHRQIECIHSGLNTFVSNVMYKT 355
>gi|297836294|ref|XP_002886029.1| hypothetical protein ARALYDRAFT_343234 [Arabidopsis lyrata subsp.
lyrata]
gi|297331869|gb|EFH62288.1| hypothetical protein ARALYDRAFT_343234 [Arabidopsis lyrata subsp.
lyrata]
Length = 424
Score = 60.8 bits (146), Expect = 1e-06, Method: Compositional matrix adjust.
Identities = 31/78 (39%), Positives = 40/78 (51%), Gaps = 7/78 (8%)
Query: 91 GLYPVISIINHSCLPNAVL------VFEGRLAVV-RAVQHVPKGAEVLISYIETAGSTMT 143
G+YP S NH CLPNA +G ++ R + VP+G EV +SY + +
Sbjct: 214 GIYPKTSFFNHDCLPNACRFDYVDSASDGNTDIIIRTIHDVPEGREVCLSYFPVNMNYSS 273
Query: 144 RQKALKEQYLFTCTCPRC 161
RQK L E Y F C C RC
Sbjct: 274 RQKRLLEDYGFKCDCDRC 291
>gi|402083521|gb|EJT78539.1| hypothetical protein GGTG_03639 [Gaeumannomyces graminis var.
tritici R3-111a-1]
Length = 779
Score = 60.5 bits (145), Expect = 1e-06, Method: Compositional matrix adjust.
Identities = 29/95 (30%), Positives = 46/95 (48%), Gaps = 7/95 (7%)
Query: 74 ACNAHTICNSELRPLGT-------GLYPVISIINHSCLPNAVLVFEGRLAVVRAVQHVPK 126
+C T+ N+ RP G G++P +I+NH+C+ N+ F G L + RA + +P
Sbjct: 459 SCPPLTVDNTRPRPEGVPEPPQTRGIWPHAAIMNHACVANSTRAFCGDLFITRASRDIPA 518
Query: 127 GAEVLISYIETAGSTMTRQKALKEQYLFTCTCPRC 161
G E+ Y+ RQ L+ + F C C C
Sbjct: 519 GDEITQQYVPVRADWGERQAQLRHWWGFECACALC 553
>gi|403170123|ref|XP_003329506.2| hypothetical protein PGTG_11256 [Puccinia graminis f. sp. tritici
CRL 75-36-700-3]
gi|375168575|gb|EFP85087.2| hypothetical protein PGTG_11256 [Puccinia graminis f. sp. tritici
CRL 75-36-700-3]
Length = 647
Score = 60.5 bits (145), Expect = 1e-06, Method: Compositional matrix adjust.
Identities = 35/102 (34%), Positives = 54/102 (52%), Gaps = 11/102 (10%)
Query: 71 SKLACNAHTICNSELRPLGTGLYPVISIINHSCLPNAVLVF----EGRLAV-------VR 119
S+ N+ T+ + L +G P + INHSC PNAV+VF EG + V
Sbjct: 277 SRFIDNSFTLTSIILDQIGVVFVPSAAFINHSCNPNAVVVFPEGGEGAGSTAGKEWVKVI 336
Query: 120 AVQHVPKGAEVLISYIETAGSTMTRQKALKEQYLFTCTCPRC 161
A++ + G E++ SYI++AG+ R+ L ++Y F C C C
Sbjct: 337 AIKPIEPGEEIVTSYIDSAGTRQERRNELVKRYKFVCDCQAC 378
>gi|307108187|gb|EFN56428.1| hypothetical protein CHLNCDRAFT_145047 [Chlorella variabilis]
Length = 383
Score = 60.5 bits (145), Expect = 1e-06, Method: Compositional matrix adjust.
Identities = 26/77 (33%), Positives = 45/77 (58%), Gaps = 2/77 (2%)
Query: 89 GTGLYPVISIINHSCLPNAVLVFE--GRLAVVRAVQHVPKGAEVLISYIETAGSTMTRQK 146
G+ Y + S++NHSC P+ + F +A A +H+ +G ++ +SY++ R++
Sbjct: 300 GSAAYLLASLLNHSCEPSLGVSFPRNNAVAAFTAARHIARGEQLTVSYVDAGQGLAARRQ 359
Query: 147 ALKEQYLFTCTCPRCIK 163
AL Y FTC CPRC++
Sbjct: 360 ALAWAYGFTCRCPRCVE 376
>gi|393233731|gb|EJD41300.1| hypothetical protein AURDEDRAFT_186645 [Auricularia delicata
TFB-10046 SS5]
Length = 293
Score = 60.5 bits (145), Expect = 1e-06, Method: Compositional matrix adjust.
Identities = 36/94 (38%), Positives = 50/94 (53%), Gaps = 8/94 (8%)
Query: 76 NAHTICNSELRPLGTGLYPVIS-IINHSCLPNAVLVF------EGRLAVVRAVQHVPKGA 128
N + + +S L PL +YP S +NHSC NAV +F R+ VV V+ V G
Sbjct: 83 NNNFVLHSHLTPLAAAVYPAASRSLNHSCASNAVPLFVFAPATPPRMEVV-LVRDVAPGD 141
Query: 129 EVLISYIETAGSTMTRQKALKEQYLFTCTCPRCI 162
E+ I YI+ A + R++ L+ Y F C C RCI
Sbjct: 142 EITIPYIDPALAPSARRERLRASYGFECACARCI 175
>gi|194862038|ref|XP_001969906.1| GG23681 [Drosophila erecta]
gi|190661773|gb|EDV58965.1| GG23681 [Drosophila erecta]
Length = 507
Score = 60.5 bits (145), Expect = 1e-06, Method: Compositional matrix adjust.
Identities = 51/199 (25%), Positives = 85/199 (42%), Gaps = 25/199 (12%)
Query: 92 LYPVISIINHSCLPN---AVLVFEGRLAVVRAVQHVPKGAEVLISYIETAGSTMTRQKAL 148
L+P I+ H+C+PN ++ EG +RA+ + +G + SY T T RQK L
Sbjct: 207 LFPYTGILAHNCVPNTSRSIYPSEGYKIRLRAMVDLEEGQPLHHSYTYTLDGTAQRQKHL 266
Query: 149 KEQYLFTCTCPRCI---KLG-QFDDIQESAILEGYRCKDDGCSGFLLRDSDDKGFTCQQC 204
K+ FTC C RC+ +LG F ++ EGY+ D ++C C
Sbjct: 267 KQGKFFTCQCDRCLDPTELGTHFSSLKCGQCAEGYQVPRQPTEA-------DTSWSCASC 319
Query: 205 GLVRSKEE----IKKIASEVNILSKKTLALTSCGNHQEVVSTYKMIEKLQKKLYHPFSVN 260
G S E ++ + SEVN + + + ++ Y K L HP
Sbjct: 320 GSDTSNVEALAMLQSLQSEVNAVQALPMGAKRLEEVERLLRKY-------KSLLHPLHFI 372
Query: 261 LMQTREKLIKILMELEDWK 279
R+ LI++ ++ ++
Sbjct: 373 ATGLRQLLIEMYGRVQGYE 391
>gi|328785590|ref|XP_003250620.1| PREDICTED: SET and MYND domain-containing protein 4-like, partial
[Apis mellifera]
Length = 444
Score = 60.5 bits (145), Expect = 1e-06, Method: Compositional matrix adjust.
Identities = 29/71 (40%), Positives = 41/71 (57%), Gaps = 2/71 (2%)
Query: 88 LGTGLYPVISIINHSCLPNAVLVFEGRLAVVRAVQHVPKGAEVLISYIE--TAGSTMTRQ 145
+ TG+YP SI+NHSC PN + +F + +VRA + + +G E+ Y +T RQ
Sbjct: 373 VATGIYPSASIMNHSCDPNIINIFVNQYLIVRASRDISQGEEIFNCYGPHYRHMTTENRQ 432
Query: 146 KALKEQYLFTC 156
K LK QY F C
Sbjct: 433 KILKNQYCFIC 443
>gi|393233728|gb|EJD41297.1| hypothetical protein AURDEDRAFT_106306 [Auricularia delicata
TFB-10046 SS5]
Length = 373
Score = 60.5 bits (145), Expect = 1e-06, Method: Compositional matrix adjust.
Identities = 36/94 (38%), Positives = 50/94 (53%), Gaps = 8/94 (8%)
Query: 76 NAHTICNSELRPLGTGLYPVIS-IINHSCLPNAVLVF------EGRLAVVRAVQHVPKGA 128
N + + +S L PL +YP S +NHSC NAV +F R+ VV V+ V G
Sbjct: 163 NNNFVLHSHLTPLAAAVYPAASRALNHSCATNAVPLFVFAPATPPRMEVV-LVRDVAPGD 221
Query: 129 EVLISYIETAGSTMTRQKALKEQYLFTCTCPRCI 162
E+ I YI+ A + R++ L+ Y F C C RCI
Sbjct: 222 EITIPYIDPALAPSARRERLRASYGFECACARCI 255
>gi|256071345|ref|XP_002572001.1| set and mynd domain containing [Schistosoma mansoni]
gi|350645143|emb|CCD60151.1| set and mynd domain containing, putative [Schistosoma mansoni]
Length = 527
Score = 60.5 bits (145), Expect = 1e-06, Method: Compositional matrix adjust.
Identities = 30/87 (34%), Positives = 45/87 (51%), Gaps = 3/87 (3%)
Query: 80 ICNSELRPLGTGLYPVISIINHSCLPNAVLVFEGRLAVVRAVQHVPKGAEVLISY---IE 136
IC + + TGL+P +S++NHSC PN FE ++R ++ + G EV Y
Sbjct: 384 ICKFQQIRIATGLFPCVSLLNHSCDPNTAHNFEESFLILRCLKPILPGTEVFHCYGPHYL 443
Query: 137 TAGSTMTRQKALKEQYLFTCTCPRCIK 163
S++ R L++QY F C C C K
Sbjct: 444 HYPSSVQRVTLLQQQYFFICDCEHCSK 470
>gi|198463716|ref|XP_001352921.2| GA14810 [Drosophila pseudoobscura pseudoobscura]
gi|198151380|gb|EAL30422.2| GA14810 [Drosophila pseudoobscura pseudoobscura]
Length = 532
Score = 60.5 bits (145), Expect = 1e-06, Method: Compositional matrix adjust.
Identities = 59/228 (25%), Positives = 101/228 (44%), Gaps = 28/228 (12%)
Query: 16 IPSTTTDNYSLVEALVAHMSDIDEKQL--LLYAQIANLVNLIL---QWPEISINEIAENF 70
+ S+ + ++ + AL H+ + L +L A + + IL W E+ I IA
Sbjct: 169 LKSSDPEGFARLSALEDHLEERLATPLYQVLRANLITFIKTILGMRDWSEVDILRIA--- 225
Query: 71 SKLACNAHTICNSELRPLGTGLYPVISIINHSCLPNAVLVFEGRLAVV-RAVQHVPKGAE 129
+ L N + R L+P ++I+H C PN F+ + ++ A + + KG
Sbjct: 226 AILDTNTFELRQPRERRKVRALFPGAAMISHDCAPNMRHRFDDDMNIIFLAKRPIAKGEI 285
Query: 130 VLISYIETAGSTMTRQKALKEQYLFTCTCPRC---IKLGQFDDIQESAILEGYRCKDDGC 186
+ ISY + ST+ R+ L++ F C C RC +LG F Q +CK
Sbjct: 286 LSISYTQPLRSTIQRRLHLRQAKCFDCACARCQDPTELGTFAGAQTCV-----KCK---- 336
Query: 187 SGFLLRDS---DDKGFTCQQCGLVRSKEEI----KKIASEVNILSKKT 227
+G ++ + + + CQ C L RS +E+ K+ E+ L K T
Sbjct: 337 AGKIISVNPLQNTANWKCQLCNLKRSAKEVLLSDAKLQQEIEALDKTT 384
>gi|226499648|ref|NP_001143943.1| uncharacterized protein LOC100276756 [Zea mays]
gi|195629926|gb|ACG36604.1| hypothetical protein [Zea mays]
Length = 404
Score = 60.5 bits (145), Expect = 1e-06, Method: Compositional matrix adjust.
Identities = 49/179 (27%), Positives = 65/179 (36%), Gaps = 57/179 (31%)
Query: 92 LYPVISIINHSCLPNAV-LVFEGRLA------VVRAVQHVPKGAEVLISYIETAGSTMTR 144
+YP S+ NH CLPNA + R VVRA+ +P+G EV ISY R
Sbjct: 215 VYPRASLFNHDCLPNACHFDYPDRPGPGNTDIVVRALHDIPEGREVCISYFAANWRYADR 274
Query: 145 QKALKEQYLFTCTCPRCIKLGQF------------------------------------- 167
Q+ L E Y F C C RC Q+
Sbjct: 275 QRRLLEDYGFRCECDRCQVESQWKDDDDNNGDDGDDTMEEEDEDDAGDRGDDGIEEEEGD 334
Query: 168 -------DDIQESAILEGYRCKDDGCSGFL--LRDSDDKG----FTCQQCGLVRSKEEI 213
DD + Y C ++GC G L L S + F C CG +R +E++
Sbjct: 335 GATNGGDDDFPHAYFFVRYLCDNEGCWGMLAPLPPSPNGDLSHVFECNLCGKLRKEEDV 393
>gi|198461003|ref|XP_002138938.1| GA25085 [Drosophila pseudoobscura pseudoobscura]
gi|198137203|gb|EDY69496.1| GA25085 [Drosophila pseudoobscura pseudoobscura]
Length = 515
Score = 60.5 bits (145), Expect = 1e-06, Method: Compositional matrix adjust.
Identities = 44/175 (25%), Positives = 72/175 (41%), Gaps = 11/175 (6%)
Query: 46 AQIANLVNLILQWPEISINEIAENFSKLACNAHTICNSELRPLGTGLYPVISIINHSCLP 105
A I + + + S E+ L N H + S+ P ++ S HSC+P
Sbjct: 187 ASIGQFIPKFFKTEKFSEEEVMRAVGALQINGHEVPTSD--PPHVAVFYTASFTEHSCMP 244
Query: 106 NAVLVF-EGRLAVVRAVQHVPKGAEVLISYIETAGSTMTRQKALKEQYLFTCTCPRCIKL 164
N F + ++ A + + K + + I Y + T RQ+ L + LF C C RC +
Sbjct: 245 NLAKSFNKNGHCILWAPREIKKNSHLSICYSDAMWGTADRQRHLVQTKLFKCACDRCTDV 304
Query: 165 GQFDDIQESAILEGYRCKDDGCSGFLLRDSDDK---GFTCQQCGLVRSKEEIKKI 216
+ SAI +C+D C G +L D + C++C K I +I
Sbjct: 305 TEL-GTNYSAI----KCEDRRCDGLMLPSKTDDWNGSWKCRECQKQVQKHYIDQI 354
>gi|195170308|ref|XP_002025955.1| GL10122 [Drosophila persimilis]
gi|194110819|gb|EDW32862.1| GL10122 [Drosophila persimilis]
Length = 515
Score = 60.5 bits (145), Expect = 1e-06, Method: Compositional matrix adjust.
Identities = 44/175 (25%), Positives = 72/175 (41%), Gaps = 11/175 (6%)
Query: 46 AQIANLVNLILQWPEISINEIAENFSKLACNAHTICNSELRPLGTGLYPVISIINHSCLP 105
A I + + + S E+ L N H + S+ P ++ S HSC+P
Sbjct: 187 ASIGQFIPKFFKTEKFSEEEVMRAVGALQINGHEVPTSD--PPHVAVFYTASFTEHSCMP 244
Query: 106 NAVLVF-EGRLAVVRAVQHVPKGAEVLISYIETAGSTMTRQKALKEQYLFTCTCPRCIKL 164
N F + ++ A + + K + + I Y + T RQ+ L + LF C C RC +
Sbjct: 245 NLAKSFNKNGHCILWAPREIKKNSHLSICYSDAMWGTADRQRHLVQTKLFKCACDRCTDV 304
Query: 165 GQFDDIQESAILEGYRCKDDGCSGFLLRDSDDK---GFTCQQCGLVRSKEEIKKI 216
+ SAI +C+D C G +L D + C++C K I +I
Sbjct: 305 TEL-GTNYSAI----KCEDRRCDGLMLPSKTDDWNGSWKCRECQKQVQKHYIDQI 354
>gi|302836969|ref|XP_002950044.1| hypothetical protein VOLCADRAFT_104571 [Volvox carteri f.
nagariensis]
gi|300264517|gb|EFJ48712.1| hypothetical protein VOLCADRAFT_104571 [Volvox carteri f.
nagariensis]
Length = 430
Score = 60.5 bits (145), Expect = 1e-06, Method: Compositional matrix adjust.
Identities = 31/73 (42%), Positives = 40/73 (54%), Gaps = 2/73 (2%)
Query: 91 GLYPVISIINHSCLPNAVLVFEGRLAVVRAVQHVPKGAEVLISYIET--AGSTMTRQKAL 148
GL+P ++ NHSC PNAV V GR +R V G EV I+Y G R+ L
Sbjct: 286 GLWPAHAMFNHSCAPNAVAVVAGRELHIRCSGPVSAGEEVCITYSGALGLGPLPLRRALL 345
Query: 149 KEQYLFTCTCPRC 161
++ + F CTCPRC
Sbjct: 346 EKNHRFRCTCPRC 358
>gi|353236057|emb|CCA68059.1| hypothetical protein PIIN_01926 [Piriformospora indica DSM 11827]
Length = 786
Score = 60.5 bits (145), Expect = 1e-06, Method: Compositional matrix adjust.
Identities = 77/309 (24%), Positives = 119/309 (38%), Gaps = 84/309 (27%)
Query: 89 GTGLYPVISIINHSCLPNAVLVFEGRLAVVRAVQHVPKGAEVLISYIETAGSTMTRQKAL 148
GTGLY + S NHSC+PNA F L V+RA Q + + E+ ISY A S +R+K L
Sbjct: 522 GTGLYLLASYFNHSCIPNAGHQFFSDLMVIRATQSIKQDEEITISYCSHA-SYASREKNL 580
Query: 149 KEQYLFTCTCPRCIKLGQFDDIQESAILEGYRCKDDGCSGFLLRDSDDKGFTCQQCGLVR 208
K + C C CI DD +G R
Sbjct: 581 KP-WFDQCDCQLCI--------------------DDRLAGSNRRS--------------- 604
Query: 209 SKEEIKK-IASEVNILSKKTLALTSCGNHQEVVSTYKMIEKLQKKL---YHPFSVNLMQT 264
++E + K + + L + AL Q++VSTY E+++ +L YH FS L Q
Sbjct: 605 ARESLSKAVRNAATPLPQARDAL------QKMVSTYSSQERVRPELSTAYHAFSHRLQQM 658
Query: 265 REKLIKILMELE---------DW----------KEALAYCQL-TIPVYQRVYP------- 297
+ + E +W E + L T+P+ R P
Sbjct: 659 AHEFVSKAAATEAIEKEIQSLEWLGVEVTEKGITEVSDHSHLPTLPISTRRIPNTFGEPD 718
Query: 298 -QFHPLLGLQYYTCG----KLEWFLGDTENAIKSMTEAVEILRITHG----TNSPFMKEL 348
QF + G+ + G + WF + K++ + + + TH P + E
Sbjct: 719 MQFLMIFGM-FVLLGCPNRAMSWFRAAIWMSEKTIGGGIPMFKRTHSDIIRVLGPVLAEF 777
Query: 349 ILKLEEAQA 357
+ K+E +A
Sbjct: 778 LSKIESERA 786
>gi|380494289|emb|CCF33265.1| MYND finger [Colletotrichum higginsianum]
Length = 549
Score = 60.5 bits (145), Expect = 1e-06, Method: Compositional matrix adjust.
Identities = 34/92 (36%), Positives = 48/92 (52%), Gaps = 1/92 (1%)
Query: 72 KLACNAHTICNSELRPLGTGLYPVISIINHSCLPNAVLVFEGRLAVVRAVQHVPKGAEVL 131
KL NA + G L +++INHSC+PNA++ F GR A +RA + G E+
Sbjct: 183 KLQTNAFSRTEEYYETGGVFLDTTLAMINHSCVPNALVQFGGRTATLRATSFLDPGDEIE 242
Query: 132 ISYIETAGSTMTRQKALKEQYLFTCTCPRCIK 163
ISYI+ R L + Y F C+C +C K
Sbjct: 243 ISYIDQTQPRGKRHGEL-DLYHFECSCYKCQK 273
>gi|281212379|gb|EFA86539.1| hypothetical protein PPL_00338 [Polysphondylium pallidum PN500]
Length = 468
Score = 60.5 bits (145), Expect = 1e-06, Method: Compositional matrix adjust.
Identities = 29/75 (38%), Positives = 46/75 (61%), Gaps = 3/75 (4%)
Query: 89 GTGLYPVISIINHSCLPNAVLVFE--GRLAVVRAVQHVPKGAEVLISYIETAGSTMTRQK 146
G G++ + ++INHSC+PN ++VFE LA ++A++ + G E+ SYIE + R +
Sbjct: 373 GFGVFGLQAMINHSCVPNCLVVFEQGSSLAYIKALRDIVPGEELFHSYIEESAPFEERSQ 432
Query: 147 ALKEQYLFTCTCPRC 161
L Y F C CP+C
Sbjct: 433 ELV-TYGFNCDCPKC 446
>gi|30680748|ref|NP_849991.1| histone-lysine N-methyltransferase ASHR2 [Arabidopsis thaliana]
gi|94707155|sp|Q9ZUM9.3|ASHR2_ARATH RecName: Full=Histone-lysine N-methyltransferase ASHR2; AltName:
Full=ASH1-related protein 2; AltName: Full=Protein SET
DOMAIN GROUP 39
gi|28393236|gb|AAO42047.1| putative SET-domain transcriptional regulator [Arabidopsis
thaliana]
gi|330251813|gb|AEC06907.1| histone-lysine N-methyltransferase ASHR2 [Arabidopsis thaliana]
Length = 398
Score = 60.5 bits (145), Expect = 1e-06, Method: Compositional matrix adjust.
Identities = 31/78 (39%), Positives = 40/78 (51%), Gaps = 7/78 (8%)
Query: 91 GLYPVISIINHSCLPNAVL------VFEGRLAVV-RAVQHVPKGAEVLISYIETAGSTMT 143
G+YP S NH CLPNA +G ++ R + VP+G EV +SY + +
Sbjct: 219 GIYPKTSFFNHDCLPNACRFDYVDSASDGNTDIIIRMIHDVPEGREVCLSYFPVNMNYSS 278
Query: 144 RQKALKEQYLFTCTCPRC 161
RQK L E Y F C C RC
Sbjct: 279 RQKRLLEDYGFKCDCDRC 296
>gi|268570006|ref|XP_002640670.1| Hypothetical protein CBG19732 [Caenorhabditis briggsae]
Length = 445
Score = 60.5 bits (145), Expect = 1e-06, Method: Compositional matrix adjust.
Identities = 52/231 (22%), Positives = 104/231 (45%), Gaps = 29/231 (12%)
Query: 65 EIAENFSKLACNAHTICNS-------ELRPLGTGLYPVISIINHSCLPNAVLVFEGRLAV 117
E +N +A +CNS + P+G+G+Y ++ NHSC +VF+G AV
Sbjct: 163 ESMDNNKMIAIICSVLCNSFGIIAEKRVEPIGSGMYVGLATHNHSCASTCHVVFDGNQAV 222
Query: 118 VRAVQHVPKGAEVLISYIETAGSTMTRQKALKEQYLFTCTCPRCIKLGQFDDIQESAILE 177
+R+ + ISY+ T RQK+++ + TC C C+ +D+ + +
Sbjct: 223 LRS-RDRQYCKNTTISYVSRMLPTFERQKSIRNVHFITCRCEMCLN----EDLDLTGL-- 275
Query: 178 GYRCKDDGCSGFLLRDSDDKGFTCQQCGLV---RSKEEIKKIASEVNILSKKTLALTSCG 234
+C+ C GF ++ ++ TC + ++ +S + K+ + L K
Sbjct: 276 ASKCQTSKCQGF-VKGANGSCTTCGKPAMIPFEQSTQSTSKLLDTLENLHK--------S 326
Query: 235 NHQEVVSTYKMIEKLQK---KLYHPFSVNLMQTREKLIKILMELEDWKEAL 282
+ V Y+ ++ LQ+ ++ +V ++Q E+L E+++ E +
Sbjct: 327 QQLDTVQEYRHLQNLQEEYGRILADCNVAILQLDEQLAYCASEMKNVPEDM 377
>gi|296085566|emb|CBI29298.3| unnamed protein product [Vitis vinifera]
Length = 327
Score = 60.1 bits (144), Expect = 1e-06, Method: Compositional matrix adjust.
Identities = 30/78 (38%), Positives = 38/78 (48%), Gaps = 7/78 (8%)
Query: 91 GLYPVISIINHSCLPNA-------VLVFEGRLAVVRAVQHVPKGAEVLISYIETAGSTMT 143
G+YP S NH CLPNA +R + VP+G+E+ +SY +
Sbjct: 148 GIYPKASFFNHDCLPNACRFDYVDTASHHNTDITIRLIHDVPEGSEICLSYFPVNETYAD 207
Query: 144 RQKALKEQYLFTCTCPRC 161
RQK L E Y FTC C RC
Sbjct: 208 RQKRLLEDYGFTCYCDRC 225
>gi|224029645|gb|ACN33898.1| unknown [Zea mays]
Length = 369
Score = 60.1 bits (144), Expect = 1e-06, Method: Compositional matrix adjust.
Identities = 35/111 (31%), Positives = 51/111 (45%), Gaps = 7/111 (6%)
Query: 80 ICNSELRPLGTGLYPVISIINHSCLPNAVLVFEGRLAVVRAVQHVPKGAEVLISYIETAG 139
+C+ E + LY S+ NHSC PN F R V+R + + G+ V +SY G
Sbjct: 122 MCSVEQVRVAQALYVSGSLFNHSCQPNVHAYFLSRAFVLRTTEFIKSGSPVELSYGPQVG 181
Query: 140 STMT--RQKALKEQYLFTCTCPRCIKLGQFDDIQESAILEGYRCKDDGCSG 188
RQK+L+E Y F+C C C +L D ++ + C C G
Sbjct: 182 EMHISERQKSLQENYYFSCQCSSCSELNLSD-----LVMNSFCCPQSNCLG 227
>gi|356518688|ref|XP_003528010.1| PREDICTED: uncharacterized protein LOC100783548 [Glycine max]
Length = 786
Score = 60.1 bits (144), Expect = 2e-06, Method: Compositional matrix adjust.
Identities = 36/118 (30%), Positives = 53/118 (44%), Gaps = 7/118 (5%)
Query: 76 NAHTICNSELRPLGTGLYPVISIINHSCLPNAVLVFEGRLAVVRAVQHVPKGAEVLISYI 135
AH+ N E +G +Y S+ NHSC PN F R +R V G+++ +SY
Sbjct: 461 GAHSTSNVEQVRVGKAIYKAGSLFNHSCQPNVHAYFLSRALYLRTTNVVAAGSQLELSYG 520
Query: 136 ETAG--STMTRQKALKEQYLFTCTCPRCIKLGQFDDIQESAILEGYRCKDDGCSGFLL 191
G R LK +Y F C C C ++ + D +L + C + CSG +L
Sbjct: 521 PQVGLWDCKDRLNFLKNEYAFHCLCTGCSEVNRSD-----LVLNAFHCVNPNCSGAVL 573
>gi|119189249|ref|XP_001245231.1| hypothetical protein CIMG_04672 [Coccidioides immitis RS]
gi|303323233|ref|XP_003071608.1| hypothetical protein CPC735_071450 [Coccidioides posadasii C735
delta SOWgp]
gi|240111310|gb|EER29463.1| hypothetical protein CPC735_071450 [Coccidioides posadasii C735
delta SOWgp]
gi|320031288|gb|EFW13261.1| conserved hypothetical protein [Coccidioides posadasii str.
Silveira]
gi|392868134|gb|EAS33877.2| hypothetical protein CIMG_04672 [Coccidioides immitis RS]
Length = 338
Score = 60.1 bits (144), Expect = 2e-06, Method: Compositional matrix adjust.
Identities = 28/74 (37%), Positives = 44/74 (59%), Gaps = 2/74 (2%)
Query: 91 GLYPVISIINHSCLPNAVLVFEGRL--AVVRAVQHVPKGAEVLISYIETAGSTMTRQKAL 148
G++P +S NH C PN++ + L AVV V+ V +G E+ +SY+E + RQ+ L
Sbjct: 119 GIFPKVSRFNHGCRPNSMRSYHPVLDQAVVHVVKDVSEGEEITVSYVEPGLAFYLRQEQL 178
Query: 149 KEQYLFTCTCPRCI 162
KE++ F C C C+
Sbjct: 179 KEKFGFICGCNLCL 192
>gi|154337874|ref|XP_001565163.1| conserved hypothetical protein [Leishmania braziliensis
MHOM/BR/75/M2904]
gi|134062210|emb|CAM36598.1| conserved hypothetical protein [Leishmania braziliensis
MHOM/BR/75/M2904]
Length = 466
Score = 60.1 bits (144), Expect = 2e-06, Method: Compositional matrix adjust.
Identities = 50/170 (29%), Positives = 79/170 (46%), Gaps = 28/170 (16%)
Query: 28 EALVAHMSDID--EKQLLLYAQIANLVNLILQWPE------ISINEIAENFSKLACNAHT 79
E L+A + + E+ L + + + L W E +S + + ++ NAH
Sbjct: 288 ELLIASGTSLSAVERHTLAFNEHQAVPAPALLWAEALVQVLLSFDTLRTLVGQMVLNAHA 347
Query: 80 ICNSELRPLGT-------------GLYPVISIINHSCLPN-AVLVFEGRLAVV-RAVQHV 124
I + L P+G GLY +++ NHSC+PN AV +G +V + + +
Sbjct: 348 INDYVLLPVGAPSPSAFDWVLKGAGLYSLLACFNHSCVPNVAVSNVDGTHEIVLKTTRPI 407
Query: 125 PKGAEVLISYIETAGSTMT---RQKALKEQYLFTCTCPRCIK-LGQFDDI 170
G + I+YI + M+ RQ+ L+ Y FTC CPRCIK G DI
Sbjct: 408 QAGEPLTITYIPLSAGAMSRAERQRQLR-NYFFTCHCPRCIKEAGVAGDI 456
>gi|307198438|gb|EFN79380.1| Protein msta, isoform B [Harpegnathos saltator]
Length = 514
Score = 60.1 bits (144), Expect = 2e-06, Method: Compositional matrix adjust.
Identities = 61/303 (20%), Positives = 122/303 (40%), Gaps = 35/303 (11%)
Query: 61 ISINEIAENFSKLACNAHTICNSELRPL----GTGLYPVISIINHSCLPN---AVLVFEG 113
+ ++ +E ++AC I E+R LYP ++++NHSC+ N + +
Sbjct: 183 LKLDRFSEECIQMACGILEINTFEIRTAKGYSARALYPTVAMMNHSCVSNMSHTISPIDY 242
Query: 114 RLAVVRAVQHVPKGAEVLISYIETAGSTMTRQKALKEQYLFTCTCPRCIKLGQFDDIQES 173
R+ + R +P E+ SY + T+ R++ L E F C CPRC D + +
Sbjct: 243 RIRL-RTTLKIPAAGELYASYTHSLLPTLLRREHLLEGKHFACACPRC-----SDPTELA 296
Query: 174 AILEGYRCK--DDGCSGFLLRDSDDKGFTCQQCGLVRSKEEIKKIASEVNILSKKTLALT 231
+ +C D+G L + + C C + + ++K+ + A++
Sbjct: 297 THMSSLKCNKCDNGIVLPLDSLDAESAWKCTHCEYSINGQAVRKVFRIIQAEVDNVEAIS 356
Query: 232 SCGNHQEVVSTYKMIEKLQKKLYHPFSVNLMQTREKLIKILMELEDW----------KEA 281
+ + + + + + + + HP L R L ++ ++++ +
Sbjct: 357 G-ADGADAIHERETVLRKYRSVLHPRHAFLTILRHSLTQMYGRVDEYLLDDLPDVVLEHK 415
Query: 282 LAYCQLTIPVYQRVYPQFHPLLGLQYYTCGKLEWFLGD--------TENAIKS-MTEAVE 332
+ C+L + + V P + + G+ Y FL E +KS MTEA
Sbjct: 416 VDMCRLLLQILDVVEPGYSRIRGMTLYELHAPLLFLAKGQWNADVIDEAGLKSKMTEAAN 475
Query: 333 ILR 335
IL+
Sbjct: 476 ILK 478
>gi|453088056|gb|EMF16097.1| SET domain-containing protein, partial [Mycosphaerella populorum
SO2202]
Length = 558
Score = 60.1 bits (144), Expect = 2e-06, Method: Compositional matrix adjust.
Identities = 24/76 (31%), Positives = 38/76 (50%)
Query: 88 LGTGLYPVISIINHSCLPNAVLVFEGRLAVVRAVQHVPKGAEVLISYIETAGSTMTRQKA 147
LG P++ NHSC PN ++F ++RA + K E+ + Y++ RQ
Sbjct: 221 LGIAFDPILCSANHSCEPNTAVIFNQPQLILRAQTKIKKDEEIFMKYVDITNPFSVRQAE 280
Query: 148 LKEQYLFTCTCPRCIK 163
L++ Y F C C +C K
Sbjct: 281 LRQSYFFGCRCTKCRK 296
>gi|440800514|gb|ELR21550.1| Histone-lysine N-methyltransferase [Acanthamoeba castellanii str.
Neff]
Length = 333
Score = 60.1 bits (144), Expect = 2e-06, Method: Compositional matrix adjust.
Identities = 33/76 (43%), Positives = 44/76 (57%), Gaps = 2/76 (2%)
Query: 89 GTGLYPVISIINHSCLPNAVLVF-EGRLAVVRAVQHVPKGAEVLISYIETAGSTMTRQKA 147
G GL+P+ +++NHSC PN + F R AVV A+ + +G E+ SYIE RQ
Sbjct: 254 GFGLFPIAAMMNHSCEPNTQVKFGRNREAVVVALCDIAEGEELTHSYIENDRPLAERQAD 313
Query: 148 LKEQYLFTCTCPRCIK 163
L E Y F C C RC+K
Sbjct: 314 LLE-YNFVCQCVRCLK 328
>gi|9295736|gb|AAF87042.1|AC006535_20 T24P13.14 [Arabidopsis thaliana]
Length = 969
Score = 60.1 bits (144), Expect = 2e-06, Method: Compositional matrix adjust.
Identities = 26/73 (35%), Positives = 41/73 (56%), Gaps = 1/73 (1%)
Query: 89 GTGLYPVISIINHSCLPNAVLVFEGRLAVVRAVQHVPKGAEVLISYIETAGSTMTRQKAL 148
G GL+ + S INHSC+PNA + G +V A + + G E+ +Y + S + ++K +
Sbjct: 754 GVGLWTLASFINHSCIPNARRLHVGDYVIVHASRDIKTGEEISFAYFDVL-SPLEKRKEM 812
Query: 149 KEQYLFTCTCPRC 161
E + F C C RC
Sbjct: 813 AESWGFCCGCSRC 825
>gi|189234759|ref|XP_001814707.1| PREDICTED: similar to SMYD family member 5 [Tribolium castaneum]
gi|270001538|gb|EEZ97985.1| hypothetical protein TcasGA2_TC000380 [Tribolium castaneum]
Length = 383
Score = 60.1 bits (144), Expect = 2e-06, Method: Compositional matrix adjust.
Identities = 32/88 (36%), Positives = 50/88 (56%), Gaps = 7/88 (7%)
Query: 89 GTGLYPVISIINHSCLPNAVLVF---EGRLAVVRAVQHVPKGAEVLISYIETAG---STM 142
G L+ + S NHSC+PNA + +L++V AV+ + +G E+ ISY++ S
Sbjct: 284 GVALFTLQSACNHSCVPNAEPTYLHNNNKLSLV-AVRDIQEGEEICISYLDECNLHRSRH 342
Query: 143 TRQKALKEQYLFTCTCPRCIKLGQFDDI 170
+R+K L E YLF C CP+C + D+
Sbjct: 343 SRRKELMENYLFACNCPKCEEQADEPDV 370
>gi|302796834|ref|XP_002980178.1| hypothetical protein SELMODRAFT_419787 [Selaginella moellendorffii]
gi|300151794|gb|EFJ18438.1| hypothetical protein SELMODRAFT_419787 [Selaginella moellendorffii]
Length = 662
Score = 60.1 bits (144), Expect = 2e-06, Method: Compositional matrix adjust.
Identities = 31/73 (42%), Positives = 45/73 (61%), Gaps = 3/73 (4%)
Query: 91 GLYPVISIINHSCLPNAVLVFEGRLAVVRAVQHVPKGAEVLISYIET--AGSTMTRQKAL 148
GL+ + +INHSC+PNA + G + VRA + + G EVL+SY+E A R+K +
Sbjct: 445 GLWASVGLINHSCIPNASKMSIGDVVFVRAAKDLKAGDEVLLSYLEPPLAPYAGYREKMI 504
Query: 149 KEQYLFTCTCPRC 161
+QY F C+C RC
Sbjct: 505 -QQYNFECSCERC 516
>gi|340057403|emb|CCC51748.1| conserved hypothetical protein [Trypanosoma vivax Y486]
Length = 438
Score = 60.1 bits (144), Expect = 2e-06, Method: Compositional matrix adjust.
Identities = 30/78 (38%), Positives = 44/78 (56%), Gaps = 3/78 (3%)
Query: 89 GTGLYPVISIINHSCLPNAVLVFEG---RLAVVRAVQHVPKGAEVLISYIETAGSTMTRQ 145
G G+Y V S+ NHSC PN ++ VV A++ G E+ ISYI+ + RQ
Sbjct: 350 GQGVYAVGSLFNHSCEPNLQVLNSATGDETLVVEALRDCEPGEELYISYIDESLPYAIRQ 409
Query: 146 KALKEQYLFTCTCPRCIK 163
+ L+E YLF C C +C++
Sbjct: 410 QQLREHYLFECRCSKCVR 427
>gi|169606119|ref|XP_001796480.1| hypothetical protein SNOG_06094 [Phaeosphaeria nodorum SN15]
gi|160706916|gb|EAT87158.2| hypothetical protein SNOG_06094 [Phaeosphaeria nodorum SN15]
Length = 323
Score = 60.1 bits (144), Expect = 2e-06, Method: Compositional matrix adjust.
Identities = 48/155 (30%), Positives = 71/155 (45%), Gaps = 21/155 (13%)
Query: 91 GLYPVISIINHSCLPNAVLVFEGRL--AVVRAVQHVPKGAEVLISYIETAGSTMTRQKAL 148
GL+P I+ INHSC PNA + L +V A + + KG E+ +SYI + RQK L
Sbjct: 117 GLFPKIARINHSCRPNASYYWSQTLNKRIVYATRRIAKGDEIFVSYIPLLLTQEQRQKHL 176
Query: 149 KEQYLFTCTCPRCIKLGQFDDIQESAILEGYRCKDDGCSGFLLRDSDDKGFTCQQCGLVR 208
++Y F CTC C QE A +E DD R + K F + L
Sbjct: 177 -DRYGFKCTCEAC--------AQEHAAME---ASDD------RRTTISKAFVDFEPQLTL 218
Query: 209 SKEEIKKIASEVNILSKKTLALTSCGNHQEVVSTY 243
+ + KK + ++ +L L H+E ++ Y
Sbjct: 219 TSPKTKKATQQARKNARASLKLAEL-VHEEGLADY 252
>gi|325188967|emb|CCA23495.1| conserved hypothetical protein [Albugo laibachii Nc14]
Length = 497
Score = 60.1 bits (144), Expect = 2e-06, Method: Compositional matrix adjust.
Identities = 31/76 (40%), Positives = 45/76 (59%), Gaps = 2/76 (2%)
Query: 89 GTGLYPVISIINHSCLPNAVLVF-EGRLAVVRAVQHVPKGAEVLISYIETAGSTMTRQKA 147
GT L+ +I +NHSCLPN ++V+ A V A+Q + E+ I YI+T S RQ
Sbjct: 415 GTALFSLICTMNHSCLPNCIVVYGHNGEAHVHAIQAIQPKDELCIEYIDTDRSYDERQYE 474
Query: 148 LKEQYLFTCTCPRCIK 163
L+E Y F C C +C++
Sbjct: 475 LRE-YHFRCQCMKCMR 489
>gi|389645452|ref|XP_003720358.1| hypothetical protein MGG_09876 [Magnaporthe oryzae 70-15]
gi|351640127|gb|EHA47991.1| hypothetical protein MGG_09876 [Magnaporthe oryzae 70-15]
Length = 427
Score = 60.1 bits (144), Expect = 2e-06, Method: Compositional matrix adjust.
Identities = 27/71 (38%), Positives = 41/71 (57%)
Query: 88 LGTGLYPVISIINHSCLPNAVLVFEGRLAVVRAVQHVPKGAEVLISYIETAGSTMTRQKA 147
GT L V+S+INH C PNA + FEG VR+++ + G E+ +SY + + RQ+
Sbjct: 128 FGTSLDLVVSMINHDCSPNAHVFFEGSQVRVRSLKAIAAGGEITVSYCDPRLDVLLRQEI 187
Query: 148 LKEQYLFTCTC 158
L++ F C C
Sbjct: 188 LRQTQFFHCEC 198
>gi|328855353|gb|EGG04480.1| hypothetical protein MELLADRAFT_108480 [Melampsora larici-populina
98AG31]
Length = 398
Score = 60.1 bits (144), Expect = 2e-06, Method: Compositional matrix adjust.
Identities = 36/108 (33%), Positives = 54/108 (50%), Gaps = 9/108 (8%)
Query: 58 WPEISINEIAENFSKLACNAHTICNSELRPLGTGLYPVISIINHSCLPNAVLVFEGR--- 114
W S++ I+ L NA +I +++L ++P +S INH C + V+ R
Sbjct: 171 WQNQSLSNISNYIRILQTNAISIGSNQL-----AIFPSLSRINHGCAGASNSVYNWREKE 225
Query: 115 -LAVVRAVQHVPKGAEVLISYIETAGSTMTRQKALKEQYLFTCTCPRC 161
+ VV A + + G E+LISY ++ S RQ LK Y F CTC C
Sbjct: 226 GVEVVHATKLIEVGEEILISYWDSKRSRSDRQDYLKSNYGFQCTCQTC 273
>gi|195591423|ref|XP_002085440.1| GD12310 [Drosophila simulans]
gi|194197449|gb|EDX11025.1| GD12310 [Drosophila simulans]
Length = 530
Score = 60.1 bits (144), Expect = 2e-06, Method: Compositional matrix adjust.
Identities = 51/178 (28%), Positives = 81/178 (45%), Gaps = 25/178 (14%)
Query: 49 ANLVNLIL------QWPEISINEIAENFSKLACNAHTICNSELRPLGTGLYPVISIINHS 102
ANL+ I WPE+ I IA + L N + R LYP ++I+H
Sbjct: 202 ANLITFIKTVLGMKDWPEMDILRIA---AILDTNTFEVRQPRERRKIRALYPGAAMISHD 258
Query: 103 CLPNAVLVFEGRLAVV-RAVQHVPKGAEVLISYIETAGSTMTRQKALKEQYLFTCTCPRC 161
C+PN F+ + +V A + + KG + ISY + ST+ R+ L++ F C+C RC
Sbjct: 259 CVPNMRHRFDDDMNIVFLAKRKIAKGEILSISYTQPLRSTIQRRVHLRQAKCFDCSCARC 318
Query: 162 I---KLGQFDDIQESAILEGYRCKDDGCSGFLLRDS---DDKGFTCQQCGLVRSKEEI 213
+LG F Q +CK +G ++ + + + CQ C RS +E+
Sbjct: 319 QDPEELGSFAGAQTC-----LKCK----AGKIISLNPLLNSAPWKCQLCNFKRSAKEV 367
>gi|186478896|ref|NP_173998.2| SET domain protein 35 [Arabidopsis thaliana]
gi|332192607|gb|AEE30728.1| SET domain protein 35 [Arabidopsis thaliana]
Length = 545
Score = 60.1 bits (144), Expect = 2e-06, Method: Compositional matrix adjust.
Identities = 26/73 (35%), Positives = 41/73 (56%), Gaps = 1/73 (1%)
Query: 89 GTGLYPVISIINHSCLPNAVLVFEGRLAVVRAVQHVPKGAEVLISYIETAGSTMTRQKAL 148
G GL+ + S INHSC+PNA + G +V A + + G E+ +Y + S + ++K +
Sbjct: 330 GVGLWTLASFINHSCIPNARRLHVGDYVIVHASRDIKTGEEISFAYFDVL-SPLEKRKEM 388
Query: 149 KEQYLFTCTCPRC 161
E + F C C RC
Sbjct: 389 AESWGFCCGCSRC 401
>gi|66810740|ref|XP_639077.1| SET domain-containing protein [Dictyostelium discoideum AX4]
gi|74854618|sp|Q54R14.1|Y3443_DICDI RecName: Full=SET domain-containing protein DDB_G0283443
gi|60467687|gb|EAL65706.1| SET domain-containing protein [Dictyostelium discoideum AX4]
Length = 393
Score = 60.1 bits (144), Expect = 2e-06, Method: Compositional matrix adjust.
Identities = 37/99 (37%), Positives = 49/99 (49%), Gaps = 5/99 (5%)
Query: 96 ISIINHSCLPNAVLVFEGRLAVVRAVQHVPKGAEVLISYIETAGSTMTRQKALKEQYLFT 155
IS NHSC PN V V E + + ++ + KG E+ ISYI+ S R LKE Y F
Sbjct: 273 ISYFNHSCFPNCVRVQENQSISIYSLIPIKKGDELSISYIDIRMSKNDRLLHLKEIYYFE 332
Query: 156 CTCPRCI--KLGQFDDIQESAILEGYRCKDDG--CSGFL 190
C C RC L ++ +E Y CK+ C+G L
Sbjct: 333 CKCKRCTLPPLSNL-SLEIEKTIENYTCKNQSIKCTGIL 370
>gi|426226456|ref|XP_004007359.1| PREDICTED: SET and MYND domain-containing protein 5, partial [Ovis
aries]
Length = 391
Score = 60.1 bits (144), Expect = 2e-06, Method: Compositional matrix adjust.
Identities = 31/79 (39%), Positives = 46/79 (58%), Gaps = 5/79 (6%)
Query: 89 GTGLYPVISIINHSCLPNAVLVF--EGRLAVVRAVQHVPKGAEVLISYIETAG---STMT 143
G+GL+ + S NHSC+PNA F L V A++ + G + ISY++ S +
Sbjct: 278 GSGLFVLQSCCNHSCVPNAETSFPENNFLLHVTALEDIKPGERLGISYLDCCQRERSRHS 337
Query: 144 RQKALKEQYLFTCTCPRCI 162
R K L+E YLF C+CP+C+
Sbjct: 338 RHKILRENYLFVCSCPKCL 356
>gi|302759442|ref|XP_002963144.1| hypothetical protein SELMODRAFT_404768 [Selaginella moellendorffii]
gi|300170005|gb|EFJ36607.1| hypothetical protein SELMODRAFT_404768 [Selaginella moellendorffii]
Length = 664
Score = 59.7 bits (143), Expect = 2e-06, Method: Compositional matrix adjust.
Identities = 31/73 (42%), Positives = 45/73 (61%), Gaps = 3/73 (4%)
Query: 91 GLYPVISIINHSCLPNAVLVFEGRLAVVRAVQHVPKGAEVLISYIET--AGSTMTRQKAL 148
GL+ + +INHSC+PNA + G + VRA + + G EVL+SY+E A R+K +
Sbjct: 447 GLWASVGLINHSCIPNASKMSIGDVVFVRAAKDLKAGDEVLLSYLEPPLAPYAGYREKMI 506
Query: 149 KEQYLFTCTCPRC 161
+QY F C+C RC
Sbjct: 507 -QQYNFECSCERC 518
>gi|18399103|ref|NP_565457.1| histone-lysine N-methyltransferase ASHR2 [Arabidopsis thaliana]
gi|14334524|gb|AAK59459.1| putative SET-domain transcriptional regulator protein [Arabidopsis
thaliana]
gi|17979109|gb|AAL47498.1| putative SET-domain transcriptional regulator protein [Arabidopsis
thaliana]
gi|20197479|gb|AAD10162.2| putative SET-domain transcriptional regulator [Arabidopsis
thaliana]
gi|330251812|gb|AEC06906.1| histone-lysine N-methyltransferase ASHR2 [Arabidopsis thaliana]
Length = 341
Score = 59.7 bits (143), Expect = 2e-06, Method: Compositional matrix adjust.
Identities = 31/78 (39%), Positives = 40/78 (51%), Gaps = 7/78 (8%)
Query: 91 GLYPVISIINHSCLPNAVL------VFEGRLAVV-RAVQHVPKGAEVLISYIETAGSTMT 143
G+YP S NH CLPNA +G ++ R + VP+G EV +SY + +
Sbjct: 219 GIYPKTSFFNHDCLPNACRFDYVDSASDGNTDIIIRMIHDVPEGREVCLSYFPVNMNYSS 278
Query: 144 RQKALKEQYLFTCTCPRC 161
RQK L E Y F C C RC
Sbjct: 279 RQKRLLEDYGFKCDCDRC 296
>gi|158288115|ref|XP_309979.4| AGAP011530-PA [Anopheles gambiae str. PEST]
gi|157019319|gb|EAA05706.4| AGAP011530-PA [Anopheles gambiae str. PEST]
Length = 522
Score = 59.7 bits (143), Expect = 2e-06, Method: Compositional matrix adjust.
Identities = 48/183 (26%), Positives = 83/183 (45%), Gaps = 10/183 (5%)
Query: 57 QWPEISINEIAENFSKLACNAHTICNSELRPLGTGLYPVISIINHSCLPNAVLVFEGRLA 116
+W E +EI + N H + +E P +Y + S++ HSC PN F R
Sbjct: 200 KWDE---DEILRVVGIIQVNGHEVPMTE--PSSVAIYNMASMLEHSCRPNLAKSFTNRGE 254
Query: 117 VVR-AVQHVPKGAEVLISYIETAGSTMTRQKALKEQYLFTCTCPRCIKLGQFDDIQESAI 175
VV A + +G + I Y + +T R + L++ +F C C RC +++ +
Sbjct: 255 VVMWAPNPIRRGDRLSICYTDVLWTTGNRLEHLQQTKMFRCECERCSDRTEYETYFSAVR 314
Query: 176 LEGYRCKDDGCSGFLLRDSDDKGFTCQQCGLVRSKEEIKKIASEVNILSKKTLALTSCGN 235
G++ KD C G+LL + + QC +R +E+ A+ IL + + L +
Sbjct: 315 CSGFQ-KDSKCKGYLLPVDNAQWTGEWQC--LRCSKEVSG-ATVCQILERARMDLEAMEK 370
Query: 236 HQE 238
H+E
Sbjct: 371 HRE 373
>gi|380489700|emb|CCF36529.1| TPR domain-containing protein [Colletotrichum higginsianum]
Length = 257
Score = 59.7 bits (143), Expect = 2e-06, Method: Compositional matrix adjust.
Identities = 34/84 (40%), Positives = 44/84 (52%), Gaps = 1/84 (1%)
Query: 86 RPLGTGLYPVISIINHSCLPNAVLVFEGRLAVVRAVQHVPKGAEVLISYIETAGSTMTRQ 145
R TGL+ + INHSC+ NA + G L V+RA+ + KG E+ SY E+A RQ
Sbjct: 104 RNASTGLWIWAAYINHSCIANAKKEYVGDLMVLRALGAIKKGEEIFHSYDESA-DYEARQ 162
Query: 146 KALKEQYLFTCTCPRCIKLGQFDD 169
KAL + F C C C DD
Sbjct: 163 KALLTTWGFECGCALCAAERTDDD 186
>gi|429860577|gb|ELA35307.1| mynd finger family protein [Colletotrichum gloeosporioides Nara
gc5]
Length = 551
Score = 59.7 bits (143), Expect = 2e-06, Method: Compositional matrix adjust.
Identities = 35/104 (33%), Positives = 54/104 (51%), Gaps = 4/104 (3%)
Query: 61 ISINEIAENFS---KLACNAHTICNSELRPLGTGLYPVISIINHSCLPNAVLVFEGRLAV 117
I++ + E+F KL NA + G L +++INHSC+PNA++ F GR A
Sbjct: 171 ITLKGLEESFQVLCKLQTNAFSRTEEYYETGGVFLDTTLAMINHSCVPNAMVQFGGRTAT 230
Query: 118 VRAVQHVPKGAEVLISYIETAGSTMTRQKALKEQYLFTCTCPRC 161
+R+ + G E+ ISY + RQ L + Y F C+C +C
Sbjct: 231 LRSATFINPGDEIEISYADHTQPKGKRQGEL-DLYHFECSCYKC 273
>gi|383850313|ref|XP_003700740.1| PREDICTED: protein msta, isoform B-like [Megachile rotundata]
Length = 494
Score = 59.7 bits (143), Expect = 2e-06, Method: Compositional matrix adjust.
Identities = 45/142 (31%), Positives = 66/142 (46%), Gaps = 9/142 (6%)
Query: 91 GLYPVISIINHSCLPNAVLVF-EGRLAVVRAVQHVPKGAEVLISYIETAGSTMTRQKALK 149
GLYP+ ++ NH+C PN F E V A + G E+ +SY T R++ L
Sbjct: 214 GLYPLGALQNHNCAPNTRHHFDENYRLYVSATVPISAGEELTMSYTSLFWDTTLRRQFLS 273
Query: 150 EQYLFTCTCPRCIKLGQFDDIQESAILEGYRCKDDGCSGFLL-RDSDDKG--FTCQQCGL 206
F+CTC RC D + ++L C D CSG LL RD G + C CG
Sbjct: 274 ITKHFSCTCKRCS-----DPTESGSMLCALLCAFDNCSGNLLPRDPLRIGTPWVCDMCGK 328
Query: 207 VRSKEEIKKIASEVNILSKKTL 228
+ +I I S + ++++ L
Sbjct: 329 SINYRQICSIRSGLAAITEEAL 350
>gi|332373072|gb|AEE61677.1| unknown [Dendroctonus ponderosae]
Length = 382
Score = 59.7 bits (143), Expect = 2e-06, Method: Compositional matrix adjust.
Identities = 36/91 (39%), Positives = 50/91 (54%), Gaps = 9/91 (9%)
Query: 85 LRPLGTGLYPVISIINHSCLPNAVLVF---EGRLAVVRAVQHVPKGAEVLISYI---ETA 138
L G GLY S NHSC PNA + F RL+++ A++ + G E+ ISY+ +
Sbjct: 280 LNSEGVGLYARQSCANHSCDPNAEISFVHNNYRLSLI-ALKDIQPGEEICISYLGDCDNE 338
Query: 139 GSTMTRQKALKEQYLFTCTCPRCIKLGQFDD 169
S +R+K L E YLF C C +C L + DD
Sbjct: 339 RSRHSRRKTLMENYLFACECSKC--LAEIDD 367
>gi|195334290|ref|XP_002033816.1| GM21522 [Drosophila sechellia]
gi|194125786|gb|EDW47829.1| GM21522 [Drosophila sechellia]
Length = 514
Score = 59.7 bits (143), Expect = 2e-06, Method: Compositional matrix adjust.
Identities = 42/173 (24%), Positives = 76/173 (43%), Gaps = 11/173 (6%)
Query: 48 IANLVNLILQWPEISINEIAENFSKLACNAHTICNSELRPLGTGLYPVISIINHSCLPNA 107
I + + + + +I + L N H + ++ P ++ S +SCLPN
Sbjct: 189 IGQFIPKFFKTQKFTEEQIMKAVGALQINGHEVPTTD--PSHVAVFYTASFTENSCLPNL 246
Query: 108 VLVF-EGRLAVVRAVQHVPKGAEVLISYIETAGSTMTRQKALKEQYLFTCTCPRCIKLGQ 166
F + ++ A + + K A + I Y + T RQ+ L + LF C C RC+ + +
Sbjct: 247 AKSFNKNGHCILWAPREIKKNAHLSICYSDAMWGTADRQRHLMQTKLFKCACERCVDVTE 306
Query: 167 FDDIQESAILEGYRCKDDGCSGFLL---RDSDDKGFTCQQCGLVRSKEEIKKI 216
D SAI +C+D C G +L D + + C++C K +++I
Sbjct: 307 L-DTNYSAI----KCEDRQCGGLMLPTKADDWNGSWRCRECHKQVQKHYVERI 354
>gi|159471331|ref|XP_001693810.1| histone methyltransferase [Chlamydomonas reinhardtii]
gi|158283313|gb|EDP09064.1| histone methyltransferase [Chlamydomonas reinhardtii]
Length = 484
Score = 59.7 bits (143), Expect = 2e-06, Method: Compositional matrix adjust.
Identities = 36/103 (34%), Positives = 49/103 (47%), Gaps = 10/103 (9%)
Query: 63 INEIAENFSKLACNAHTICNSELRPLG-TGLYPVISIINHSCLPNAVLVFEGRLAVVRAV 121
+ AE+ + AC P G GL+P +++NHSC PN V G VVRAV
Sbjct: 185 FGDDAEDLAACACRGE-------EPRGHIGLWPEFALLNHSCAPNTVNYVVGGSMVVRAV 237
Query: 122 QHVPKGAEVLISYIETAG--STMTRQKALKEQYLFTCTCPRCI 162
+ +G EV I Y+ R L + Y F C+CPRC+
Sbjct: 238 APISQGEEVTICYLGRPQLLPFNKRIAILADDYGFECSCPRCV 280
>gi|298708048|emb|CBJ30401.1| set and mynd domain containing [Ectocarpus siliculosus]
Length = 589
Score = 59.7 bits (143), Expect = 2e-06, Method: Compositional matrix adjust.
Identities = 40/121 (33%), Positives = 56/121 (46%), Gaps = 24/121 (19%)
Query: 60 EISINEIAENFSKLACNAHTICNSELRPLG-------TGLYPVISIINHSCLPNAVLVFE 112
E+S EI +L N HT+ + PLG G++P S++NHSC PNA F
Sbjct: 299 EVSAEEIQTVHRRLRRNFHTV---QPEPLGRLLPPTVVGVFPAASLVNHSCEPNA--CFH 353
Query: 113 GRLA---------VVRAVQHVPKGAEVLISYIETAGSTMT---RQKALKEQYLFTCTCPR 160
R A +R V G EV +SY+ T R++ L+ + F+C CPR
Sbjct: 354 SRRAGPEGPPLELALRCTTDVSAGEEVCVSYLAHCADAATKEGRRELLQNVWGFSCDCPR 413
Query: 161 C 161
C
Sbjct: 414 C 414
>gi|345570043|gb|EGX52868.1| hypothetical protein AOL_s00007g204 [Arthrobotrys oligospora ATCC
24927]
Length = 772
Score = 59.7 bits (143), Expect = 2e-06, Method: Compositional matrix adjust.
Identities = 29/73 (39%), Positives = 41/73 (56%)
Query: 90 TGLYPVISIINHSCLPNAVLVFEGRLAVVRAVQHVPKGAEVLISYIETAGSTMTRQKALK 149
+GL+ + + NHSC+ NA F G + + RAV +PK EVLISY + RQ L+
Sbjct: 519 SGLWILPAYTNHSCISNAHRTFLGDMMIFRAVVDMPKDTEVLISYTDHNLPYEERQGILQ 578
Query: 150 EQYLFTCTCPRCI 162
+ FTC C C+
Sbjct: 579 TSWKFTCQCKLCV 591
>gi|330801330|ref|XP_003288681.1| hypothetical protein DICPUDRAFT_34471 [Dictyostelium purpureum]
gi|325081244|gb|EGC34766.1| hypothetical protein DICPUDRAFT_34471 [Dictyostelium purpureum]
Length = 378
Score = 59.7 bits (143), Expect = 2e-06, Method: Compositional matrix adjust.
Identities = 41/132 (31%), Positives = 60/132 (45%), Gaps = 20/132 (15%)
Query: 73 LACNAHTIC-----NSELR-----PLGTGLYPVISIINHSCLPNAVLVFE------GRLA 116
L CN+H I NS + +G G+Y S+ NHSC PN V E G +
Sbjct: 226 LECNSHDISFPFRTNSSITTFNYYSIGIGIYFNSSLFNHSCQPNICKVIESCKNNFGCHS 285
Query: 117 VVRAVQHVPKGAEVLISYIETAGSTMTRQKALKEQYLFTCTCPRCIKLGQFDDIQESAIL 176
+V A++ + + E+ +YI+ + R + L+ Y FTC CP C + L
Sbjct: 286 MV-AIRDIKENEEISFNYIQLNQNKQERIEKLQSSYYFTCNCPSCCTN---NPNPHKEFL 341
Query: 177 EGYRCKDDGCSG 188
Y+CK CSG
Sbjct: 342 NKYKCKKLNCSG 353
>gi|312373795|gb|EFR21480.1| hypothetical protein AND_16998 [Anopheles darlingi]
Length = 486
Score = 59.7 bits (143), Expect = 2e-06, Method: Compositional matrix adjust.
Identities = 43/158 (27%), Positives = 74/158 (46%), Gaps = 14/158 (8%)
Query: 87 PLGTGLYPVISIINHSCLPNAVLVFEGRLAVV-RAVQHVPKGAEVLISYIETAGSTMTRQ 145
P +Y S++ HSC PN F R +V A + +G + I Y + +T R
Sbjct: 189 PSSVAIYNQASMLEHSCRPNLSKSFTNRGEIVLWAPNTIKRGERLSICYTDAMWATGNRL 248
Query: 146 KALKEQYLFTCTCPRCIKLGQFDDIQESAILEGYRCKDDGCSGFLLR-DSDD---KGFTC 201
+ L++ +F C C RC +++ + G++ KD C G++L DSD+ ++C
Sbjct: 249 EHLQQTKMFRCECERCSDPTEYETYFSALRCSGFQ-KDSKCKGYILPVDSDNWTTGDWSC 307
Query: 202 QQC-GLVRSKEEIKKIASEVNILSKKTLALTSCGNHQE 238
Q+C G+V A+ IL + + L + H+E
Sbjct: 308 QKCRGVVTG-------AAVGQILERARMDLEAMQKHRE 338
>gi|396460396|ref|XP_003834810.1| hypothetical protein LEMA_P069530.1 [Leptosphaeria maculans JN3]
gi|312211360|emb|CBX91445.1| hypothetical protein LEMA_P069530.1 [Leptosphaeria maculans JN3]
Length = 388
Score = 59.7 bits (143), Expect = 2e-06, Method: Compositional matrix adjust.
Identities = 28/77 (36%), Positives = 40/77 (51%), Gaps = 3/77 (3%)
Query: 88 LGTGLYPVISIINHSCLPNAVLVFEGRLAVVRAVQHVPKGAEVLISYIETAGSTMTRQK- 146
G G +P S NHSC PN +GR RA + + KG E+ I+Y+ ++R K
Sbjct: 309 FGYGCWPAASYFNHSCHPNVEKQRDGRAWTFRARRAIAKGDELCITYLSGEERKLSRAKR 368
Query: 147 --ALKEQYLFTCTCPRC 161
LK+ + F C+C RC
Sbjct: 369 MLRLKKTWGFDCSCERC 385
>gi|242078489|ref|XP_002444013.1| hypothetical protein SORBIDRAFT_07g005870 [Sorghum bicolor]
gi|241940363|gb|EES13508.1| hypothetical protein SORBIDRAFT_07g005870 [Sorghum bicolor]
Length = 404
Score = 59.7 bits (143), Expect = 2e-06, Method: Compositional matrix adjust.
Identities = 49/179 (27%), Positives = 64/179 (35%), Gaps = 57/179 (31%)
Query: 91 GLYPVISIINHSCLPNAV-LVFEGRLA------VVRAVQHVPKGAEVLISYIETAGSTMT 143
+YP S+ NH CLPNA + R VVRA+ +P+G EV ISY
Sbjct: 214 AVYPRASLFNHDCLPNACHFDYPDRPGPGNTDIVVRALHDIPEGREVCISYFAANWRYAD 273
Query: 144 RQKALKEQYLFTCTCPRCIKLGQF------------------------------------ 167
RQ+ L E Y F C C RC Q+
Sbjct: 274 RQRRLLEDYGFRCECDRCQVESQWKDDDDDNNGGDGDDTMEEEGEDDAGDRGDDGMEEEE 333
Query: 168 --------DDIQESAILEGYRCKDDGCSGFL--LRDSDDKG----FTCQQCGLVRSKEE 212
DD + Y C ++GC G L L S + F C CG +R +E+
Sbjct: 334 GDGATDGNDDFPHAYFFVRYLCDNEGCWGMLAPLPPSPNGDLSHVFECNLCGKLRKEED 392
>gi|195578361|ref|XP_002079034.1| GD23739 [Drosophila simulans]
gi|194191043|gb|EDX04619.1| GD23739 [Drosophila simulans]
Length = 325
Score = 59.7 bits (143), Expect = 2e-06, Method: Compositional matrix adjust.
Identities = 50/199 (25%), Positives = 86/199 (43%), Gaps = 25/199 (12%)
Query: 92 LYPVISIINHSCLPN---AVLVFEGRLAVVRAVQHVPKGAEVLISYIETAGSTMTRQKAL 148
L+P I+ H+C+PN ++ EG +RA+ + +G + SY T T RQK L
Sbjct: 25 LFPYTGILAHNCVPNTARSIYPSEGYKIRLRAMVDLEEGQPLHHSYTYTLDGTAQRQKHL 84
Query: 149 KEQYLFTCTCPRCI---KLG-QFDDIQESAILEGYRCKDDGCSGFLLRDSDDKGFTCQQC 204
K+ FTC C RC+ +LG F ++ EG++ D ++C C
Sbjct: 85 KQGKFFTCQCERCLDPTELGTHFSSLKCGQCAEGFQVPRQ-------PTEPDTSWSCAGC 137
Query: 205 GLVRSKEE----IKKIASEVNILSKKTLALTSCGNHQEVVSTYKMIEKLQKKLYHPFSVN 260
G S + ++ + SEVN + +A + ++ Y K L HP
Sbjct: 138 GSDTSNADALAMLQSLQSEVNAVQALPMAAKRLEEIERLLRKY-------KSLLHPLHFI 190
Query: 261 LMQTREKLIKILMELEDWK 279
R+ LI++ ++ ++
Sbjct: 191 ATGLRQLLIEMYGRVQGYE 209
>gi|51859341|gb|AAH81479.1| SMYD family member 5 [Danio rerio]
Length = 300
Score = 59.7 bits (143), Expect = 2e-06, Method: Compositional matrix adjust.
Identities = 34/86 (39%), Positives = 48/86 (55%), Gaps = 7/86 (8%)
Query: 89 GTGLYPVISIINHSCLPNAVLVFEGR--LAVVRAVQHVPKGAEVLISYIETAG---STMT 143
G+GL+ + S NHSC+PNA F L + A+ + G E+ ISY++ S +
Sbjct: 188 GSGLFLLQSSCNHSCVPNAEASFPDNNFLLHLTALGDIGPGEEICISYLDCCQRDRSRHS 247
Query: 144 RQKALKEQYLFTCTCPRCIKLGQFDD 169
R K L+E YLF C+C +C L Q DD
Sbjct: 248 RHKILRENYLFICSCQKC--LSQMDD 271
>gi|351737851|gb|AEQ60886.1| hypothetical protein [Acanthamoeba castellanii mamavirus]
Length = 263
Score = 59.7 bits (143), Expect = 2e-06, Method: Compositional matrix adjust.
Identities = 29/92 (31%), Positives = 45/92 (48%), Gaps = 6/92 (6%)
Query: 71 SKLACNAHTICNSELRPLGTGLYPVISIINHSCLPNAVLVFEGRLAVVRAVQHVPKGAEV 130
+K CNA N G+ + + NHSC+PN + V + V+++ G E+
Sbjct: 118 AKYMCNAFEFNN------GSAILINGAKFNHSCVPNVIFVSDENYMYFYTVRNIKTGEEL 171
Query: 131 LISYIETAGSTMTRQKALKEQYLFTCTCPRCI 162
+Y++ +T TR+ L QY F C C RCI
Sbjct: 172 TDNYVDIMSNTKTRKNRLFNQYGFDCQCERCI 203
>gi|310798972|gb|EFQ33865.1| tetratricopeptide [Glomerella graminicola M1.001]
Length = 657
Score = 59.7 bits (143), Expect = 2e-06, Method: Compositional matrix adjust.
Identities = 31/76 (40%), Positives = 45/76 (59%), Gaps = 1/76 (1%)
Query: 86 RPLGTGLYPVISIINHSCLPNAVLVFEGRLAVVRAVQHVPKGAEVLISYIETAGSTMTRQ 145
R TGL+ + INHSC+ NA + G L V+RA++ + +G E+ SY E+A TRQ
Sbjct: 503 RNASTGLWVWAAYINHSCVANAKKEYVGDLMVLRALRPIKEGEEIFHSYDESA-DYETRQ 561
Query: 146 KALKEQYLFTCTCPRC 161
+AL + F C+C C
Sbjct: 562 RALMTTWGFECSCALC 577
>gi|195171538|ref|XP_002026562.1| GL21959 [Drosophila persimilis]
gi|194111478|gb|EDW33521.1| GL21959 [Drosophila persimilis]
Length = 532
Score = 59.7 bits (143), Expect = 2e-06, Method: Compositional matrix adjust.
Identities = 58/228 (25%), Positives = 101/228 (44%), Gaps = 28/228 (12%)
Query: 16 IPSTTTDNYSLVEALVAHMSDIDEKQL--LLYAQIANLVNLIL---QWPEISINEIAENF 70
+ ++ + ++ + AL H+ + L +L A + + IL W E+ I IA
Sbjct: 169 LKASDPEGFARLSALEDHLEERLATPLYQVLRANLITFIKTILGMRDWSEVDILRIA--- 225
Query: 71 SKLACNAHTICNSELRPLGTGLYPVISIINHSCLPNAVLVFEGRLAVV-RAVQHVPKGAE 129
+ L N + R L+P ++I+H C PN F+ + ++ A + + KG
Sbjct: 226 AILDTNTFELRQPRERRKVRALFPGAAMISHDCAPNMRHRFDDDMNIIFLAKRPIAKGEI 285
Query: 130 VLISYIETAGSTMTRQKALKEQYLFTCTCPRC---IKLGQFDDIQESAILEGYRCKDDGC 186
+ ISY + ST+ R+ L++ F C C RC +LG F Q +CK
Sbjct: 286 LTISYTQPLRSTIQRRLHLRQAKCFDCACDRCQDPTELGTFAGAQTCV-----KCK---- 336
Query: 187 SGFLLRDS---DDKGFTCQQCGLVRSKEEI----KKIASEVNILSKKT 227
+G ++ + + + CQ C L RS +E+ K+ E+ L K T
Sbjct: 337 AGKIISVNPLQNTANWKCQLCNLKRSAKEVLLSDAKLQQEIEALDKTT 384
>gi|392592096|gb|EIW81423.1| SET domain-containing protein [Coniophora puteana RWD-64-598 SS2]
Length = 450
Score = 59.7 bits (143), Expect = 2e-06, Method: Compositional matrix adjust.
Identities = 35/107 (32%), Positives = 60/107 (56%), Gaps = 10/107 (9%)
Query: 62 SINEIAENFSKLACNAHTICNSELRPLGTGLYPVIS-IINHSCLPNAVLVF------EGR 114
+I ++ + FS+ + N + +S L + G++P+ S + NHSCLPNA + R
Sbjct: 167 TIPDVNDLFSRFSNNNFAV-HSHLTTVAHGVFPLASRLFNHSCLPNAAARYILSEDAAPR 225
Query: 115 LAVVRAVQHVPKGAEVLISYIETAGSTMTRQKALKEQYLFTCTCPRC 161
+ +V A+Q + G E+ + Y++ A +RQ+ + Y FTCTCP C
Sbjct: 226 MEIV-ALQDIGAGEEICVPYLDPA-LLQSRQQGFQLTYGFTCTCPSC 270
>gi|407844697|gb|EKG02089.1| hypothetical protein TCSYLVIO_006900 [Trypanosoma cruzi]
Length = 584
Score = 59.7 bits (143), Expect = 2e-06, Method: Compositional matrix adjust.
Identities = 58/234 (24%), Positives = 102/234 (43%), Gaps = 40/234 (17%)
Query: 75 CNAHTICNSELRPLGTGLYP--VISIINHSCLPNAVLVFEGRLAVVRAVQHVPKGAEVLI 132
CNA +C++ P+G L+ V S NHSCLPN + G + RA++ G E+ I
Sbjct: 327 CNALAVCDASGLPVGQALHAASVTSYFNHSCLPNCA-IEAGAIVTTRAIR---PGEELTI 382
Query: 133 SYI-ETAGSTMTRQKALKEQYLFTCTCPRCIKLGQFDDIQ-ESAILEGYRCKDDGCSGFL 190
SY+ + R++ L E+Y F C RC++ G D ESA+ R
Sbjct: 383 SYLPQLYWPAWLRREELAERYFFDC---RCVRCGDGDRSPFESALSATLRLGGS------ 433
Query: 191 LRDSDDKGFTCQQ--CGLVRSKEEIKKIASEVNILSKKTLALTSCGNHQEVVSTYKMIEK 248
R+ + + + Q CG VR+K+ +V++ + L ++++
Sbjct: 434 -REKEREYISSVQILCGRVRAKD-----VGDVDVGDRDAL--------------LHLLQE 473
Query: 249 LQKKLYHPFSVNLMQTREKLIKILMELEDWKEALAYCQLTIPVYQRVYPQFHPL 302
++ L+ PF + L + L D + L C + +++ + P HP+
Sbjct: 474 CRQHLF-PFHYLCHELHNTLSFVYAVLGDTRACLCGCLRELVMWEAIVPGAHPV 526
>gi|332372756|gb|AEE61520.1| unknown [Dendroctonus ponderosae]
Length = 501
Score = 59.7 bits (143), Expect = 2e-06, Method: Compositional matrix adjust.
Identities = 49/195 (25%), Positives = 85/195 (43%), Gaps = 15/195 (7%)
Query: 92 LYPVISIINHSCLPNAVLVFE----GRL----AVVRAVQHVPKGAEVLISYIETAGSTMT 143
L+P ++I++H+C+ N + G L VRA +PKG E+ SY + T+
Sbjct: 198 LFPKLAILSHNCVSNIHHAVDCNGDGELKDCVVTVRAAVDIPKGGELFSSYTFSLWPTLV 257
Query: 144 RQKALKEQYLFTCTCPRCIKLGQFDDIQESAILEGYRCKDDGCSGFLLRDSDDKGFTCQQ 203
R++ L+E F CTCPRC + D S L+ RC D+G D + C
Sbjct: 258 RREFLRESKFFDCTCPRCSSPTELDTHMSS--LKCQRC-DNGVIISTNTADDACEWKCTH 314
Query: 204 CGLVRSKEEIKKIASEVNILSKKTLALTSCGNHQEVVSTYKMIEKLQKKLYHPFSVNLMQ 263
C + ++K+ + + + + G Q + + K + L HP + +
Sbjct: 315 CDYKTNGRAVRKVFAAIQ---NEIDQVEYIGGPQGIQQRETIFRKYRSVL-HPNNSYMTT 370
Query: 264 TREKLIKILMELEDW 278
R LI++ + E +
Sbjct: 371 LRSALIQLYGKTEGY 385
>gi|348668850|gb|EGZ08673.1| hypothetical protein PHYSODRAFT_352669 [Phytophthora sojae]
Length = 1354
Score = 59.7 bits (143), Expect = 2e-06, Method: Compositional matrix adjust.
Identities = 44/140 (31%), Positives = 70/140 (50%), Gaps = 20/140 (14%)
Query: 38 DEKQLLLYAQIANLVNLILQWPEISINEIAENFSKLACNAHTICNSELRPL--------- 88
DE Q ++ + V +L W E + ++ + +L N + +SE PL
Sbjct: 1167 DEDQTIIAS-----VRRVLGWEEERL-QVLQTQERLERNDGEL-DSEFDPLMPLGYPSIE 1219
Query: 89 GTGLYPVISIINHSCLPNAVLVF--EGRLAVVRAVQHVPKGAEVLISYIETAGSTMTRQK 146
GT L+P+I +NHSC PN +++ G VV A++ + KG E+ I YI+ R+
Sbjct: 1220 GTALFPIICTMNHSCDPNCTVLYTKNGDGHVV-AIRDIHKGEELCICYIDVDMDVQMREA 1278
Query: 147 ALKEQYLFTCTCPRCIKLGQ 166
L+E Y F C C RC++ Q
Sbjct: 1279 NLRE-YKFKCFCSRCVQERQ 1297
Score = 42.4 bits (98), Expect = 0.31, Method: Compositional matrix adjust.
Identities = 25/62 (40%), Positives = 34/62 (54%), Gaps = 1/62 (1%)
Query: 89 GTGLYPVISIINHSCLPN-AVLVFEGRLAVVRAVQHVPKGAEVLISYIETAGSTMTRQKA 147
GT L+ I +NHSC PN VL + A V AVQ + +G E+ ISYI+ R +
Sbjct: 390 GTALFSGICTMNHSCDPNCTVLYTKDGAAHVFAVQDIQEGEELCISYIDVDQEVEERAEC 449
Query: 148 LK 149
L+
Sbjct: 450 LR 451
>gi|311978082|ref|YP_003987202.1| putative SET domain-containing protein [Acanthamoeba polyphaga
mimivirus]
gi|81999695|sp|Q5UNT8.1|YL678_MIMIV RecName: Full=Putative SET domain-containing protein L678
gi|55417289|gb|AAV50939.1| unknown [Acanthamoeba polyphaga mimivirus]
gi|308204535|gb|ADO18336.1| putative SET domain-containing protein [Acanthamoeba polyphaga
mimivirus]
gi|339061617|gb|AEJ34921.1| hypothetical protein MIMI_L678 [Acanthamoeba polyphaga mimivirus]
Length = 255
Score = 59.7 bits (143), Expect = 2e-06, Method: Compositional matrix adjust.
Identities = 29/92 (31%), Positives = 45/92 (48%), Gaps = 6/92 (6%)
Query: 71 SKLACNAHTICNSELRPLGTGLYPVISIINHSCLPNAVLVFEGRLAVVRAVQHVPKGAEV 130
+K CNA N G+ + + NHSC+PN + V + V+++ G E+
Sbjct: 118 AKYMCNAFEFNN------GSAILINGAKFNHSCVPNVIFVSDENYMYFYTVRNIKTGEEL 171
Query: 131 LISYIETAGSTMTRQKALKEQYLFTCTCPRCI 162
+Y++ +T TR+ L QY F C C RCI
Sbjct: 172 TDNYVDIMSNTKTRKNRLFNQYGFDCQCERCI 203
>gi|312383723|gb|EFR28693.1| hypothetical protein AND_03014 [Anopheles darlingi]
Length = 423
Score = 59.3 bits (142), Expect = 2e-06, Method: Compositional matrix adjust.
Identities = 45/167 (26%), Positives = 78/167 (46%), Gaps = 8/167 (4%)
Query: 18 STTTDNYSLVEALVAHMSDIDEKQL--LLYAQIANLVNLILQWPEISINEIAENFSKLAC 75
S + D Y+ ++ L +H+++ + L ++ + + + +L + S I E + L
Sbjct: 141 SHSPDTYAQLQRLESHVAERLKSPLYEVVRSNLVPFIRSVLGLQQYSAETILEVCAMLDT 200
Query: 76 NAHTICNSELRPLGTGLYPVISIINHSCLPNAVLVFEGRLAVV-RAVQHVPKGAEVLISY 134
N I E R LYP+ ++++H C PN F+ L +V A + +GA + SY
Sbjct: 201 NCFEIRLPERRTKVRALYPLGAMLSHDCRPNTKHYFDDALRMVLVATVDIERGATISTSY 260
Query: 135 IETAGSTMTRQKALKEQYLFTCTCPRCIKLGQFDDIQESAILEGYRC 181
+ T+ R+ ALK+ F C C RC D + L G+RC
Sbjct: 261 TQPLLGTLHRRLALKQSKHFDCQCERCT-----DPTELGTNLSGFRC 302
>gi|19921122|ref|NP_609464.1| CG43129, isoform D [Drosophila melanogaster]
gi|442627404|ref|NP_001260366.1| CG43129, isoform E [Drosophila melanogaster]
gi|7297774|gb|AAF53024.1| CG43129, isoform D [Drosophila melanogaster]
gi|17945207|gb|AAL48662.1| RE12806p [Drosophila melanogaster]
gi|440213690|gb|AGB92901.1| CG43129, isoform E [Drosophila melanogaster]
Length = 507
Score = 59.3 bits (142), Expect = 3e-06, Method: Compositional matrix adjust.
Identities = 50/199 (25%), Positives = 85/199 (42%), Gaps = 25/199 (12%)
Query: 92 LYPVISIINHSCLPN---AVLVFEGRLAVVRAVQHVPKGAEVLISYIETAGSTMTRQKAL 148
L+P I+ H+C+PN ++ EG +RA+ + +G + SY T T RQK L
Sbjct: 207 LFPYTGILAHNCVPNTSRSIYPSEGYKIRLRAMVDLEEGQPLHHSYTYTLDGTAQRQKHL 266
Query: 149 KEQYLFTCTCPRCI---KLG-QFDDIQESAILEGYRCKDDGCSGFLLRDSDDKGFTCQQC 204
K+ FTC C RC+ +LG F ++ EG++ D + C C
Sbjct: 267 KQGKFFTCQCERCLDPTELGTHFSSLKCGQCAEGFQVPRQP-------TEPDTSWNCANC 319
Query: 205 GLVRSKEE----IKKIASEVNILSKKTLALTSCGNHQEVVSTYKMIEKLQKKLYHPFSVN 260
G S + ++ + SEVN + +A + ++ Y K L HP
Sbjct: 320 GSDTSNADALAMLQSLQSEVNAVQALPMAAKRLEEIERLLRKY-------KSLLHPLHFI 372
Query: 261 LMQTREKLIKILMELEDWK 279
R+ LI++ ++ ++
Sbjct: 373 ATGLRQLLIEMYGRVQGYE 391
>gi|408398088|gb|EKJ77223.1| hypothetical protein FPSE_02597 [Fusarium pseudograminearum CS3096]
Length = 520
Score = 59.3 bits (142), Expect = 3e-06, Method: Compositional matrix adjust.
Identities = 33/88 (37%), Positives = 50/88 (56%), Gaps = 1/88 (1%)
Query: 76 NAHTICNSELRPLGTGLYPVISIINHSCLPNAVLVFEGRLAVVRAVQHVPKGAEVLISYI 135
NA +++L +G L P +++ NHSC+PNA++ F GR A++RA + + E+ ISY
Sbjct: 163 NAFHRYDTDLGQVGIFLEPKLAMANHSCIPNAMVQFIGRKAILRAEKPIKIDDEIEISYT 222
Query: 136 ETAGSTMTRQKALKEQYLFTCTCPRCIK 163
+ R+ AL Y F C C RC K
Sbjct: 223 DYTFPLSKRKLALAP-YFFDCMCLRCEK 249
>gi|453080542|gb|EMF08593.1| SET domain-containing protein [Mycosphaerella populorum SO2202]
Length = 350
Score = 59.3 bits (142), Expect = 3e-06, Method: Compositional matrix adjust.
Identities = 29/73 (39%), Positives = 43/73 (58%), Gaps = 2/73 (2%)
Query: 91 GLYPVISIINHSCLPNAVLVFEGRLAVVR--AVQHVPKGAEVLISYIETAGSTMTRQKAL 148
++ I+ INHSCLPNA + L +R +++ + G E+ +SYI G + TR+K L
Sbjct: 122 AIFLTIARINHSCLPNAQHSWNAPLQRMRIHSIKTLQPGEEITLSYILPCGPSSTRKKLL 181
Query: 149 KEQYLFTCTCPRC 161
KE + F C CP C
Sbjct: 182 KENFCFNCLCPLC 194
>gi|195387830|ref|XP_002052595.1| GJ20777 [Drosophila virilis]
gi|194149052|gb|EDW64750.1| GJ20777 [Drosophila virilis]
Length = 504
Score = 59.3 bits (142), Expect = 3e-06, Method: Compositional matrix adjust.
Identities = 59/234 (25%), Positives = 99/234 (42%), Gaps = 33/234 (14%)
Query: 65 EIAENFSK---------LACNAHTICNSELRPLGTGLYPVISIINHSCLPNAVLVF---E 112
++AE FS+ L NA S+ PL LYP I+ H+C+PN E
Sbjct: 169 KLAERFSEELIMQVVGVLEVNAFEARTSQGYPLRC-LYPYTGILAHNCVPNTARTIHPSE 227
Query: 113 GRLAVVRAVQHVPKGAEVLISYIETAGSTMTRQKALKEQYLFTCTCPRCIKLGQFDDIQE 172
G +RA+ + G ++ SY T T RQ L+E FTC+C RC+ +
Sbjct: 228 GYKIRLRAMVALEAGQQLQHSYTYTLDGTAQRQAHLREGKYFTCSCERCLDASELQTHFS 287
Query: 173 SAILEGYRCKDDGCSGFLLRDSD---DKGFTCQQCGLVRSKEEIKKIA----SEVNILSK 225
S ++ +C + G+LL + + C+ C S EE+++I +EVN +
Sbjct: 288 S--MKCGQCTE----GWLLPKHPTAMNSSWNCRACDHSTSSEEVRQIVAALQAEVNAVQA 341
Query: 226 KTLALTSCGNHQEVVSTYKMIEKLQKKLYHPFSVNLMQTREKLIKILMELEDWK 279
+ + ++ Y K L HP R+ LI++ ++ ++
Sbjct: 342 LDMGPKRLEESERLLRKY-------KSLLHPSHYIATSLRQLLIEMYGRVQGYE 388
>gi|195339975|ref|XP_002036592.1| GM18755 [Drosophila sechellia]
gi|194130472|gb|EDW52515.1| GM18755 [Drosophila sechellia]
Length = 507
Score = 59.3 bits (142), Expect = 3e-06, Method: Compositional matrix adjust.
Identities = 50/199 (25%), Positives = 86/199 (43%), Gaps = 25/199 (12%)
Query: 92 LYPVISIINHSCLPN---AVLVFEGRLAVVRAVQHVPKGAEVLISYIETAGSTMTRQKAL 148
L+P I+ H+C+PN ++ EG +RA+ + +G + SY T T RQK L
Sbjct: 207 LFPYTGILAHNCVPNTARSIYPSEGYKIRLRAMVDLEEGQPLHHSYTYTLDGTAQRQKHL 266
Query: 149 KEQYLFTCTCPRCI---KLG-QFDDIQESAILEGYRCKDDGCSGFLLRDSDDKGFTCQQC 204
K+ FTC C RC+ +LG F ++ EG++ D ++C C
Sbjct: 267 KQGKFFTCQCERCLDPTELGTHFSSLKCGQCAEGFQVPRQP-------TEPDTSWSCASC 319
Query: 205 GLVRSKEE----IKKIASEVNILSKKTLALTSCGNHQEVVSTYKMIEKLQKKLYHPFSVN 260
G S + ++ + SEVN + +A + ++ Y K L HP
Sbjct: 320 GSETSNVDALAMLQSLQSEVNAVQALPMAAKRLEEIERLLRKY-------KSLLHPLHFI 372
Query: 261 LMQTREKLIKILMELEDWK 279
R+ LI++ ++ ++
Sbjct: 373 ATGLRQLLIEMYGRVQGYE 391
>gi|321478979|gb|EFX89935.1| SET and MYND domain-containing 5 protein [Daphnia pulex]
Length = 387
Score = 59.3 bits (142), Expect = 3e-06, Method: Compositional matrix adjust.
Identities = 29/78 (37%), Positives = 42/78 (53%), Gaps = 5/78 (6%)
Query: 89 GTGLYPVISIINHSCLPNAV--LVFEGRLAVVRAVQHVPKGAEVLISYIETAG---STMT 143
G+ L+ + S NHSC PN + F + A + + +G E+ ISY++ S +
Sbjct: 290 GSALFAIQSACNHSCEPNCISTFPFSNHTVALVASKDLEEGEEIFISYLDECAQSRSRHS 349
Query: 144 RQKALKEQYLFTCTCPRC 161
R+K LKE YLF C C RC
Sbjct: 350 RRKILKENYLFHCNCSRC 367
>gi|398257480|gb|EJN41088.1| hypothetical protein lvs_L585 [Acanthamoeba polyphaga
lentillevirus]
Length = 237
Score = 59.3 bits (142), Expect = 3e-06, Method: Compositional matrix adjust.
Identities = 29/92 (31%), Positives = 45/92 (48%), Gaps = 6/92 (6%)
Query: 71 SKLACNAHTICNSELRPLGTGLYPVISIINHSCLPNAVLVFEGRLAVVRAVQHVPKGAEV 130
+K CNA N G+ + + NHSC+PN + V + V+++ G E+
Sbjct: 100 AKYMCNAFEFNN------GSAILINGAKFNHSCVPNVIFVSDENYMYFYTVRNIKTGEEL 153
Query: 131 LISYIETAGSTMTRQKALKEQYLFTCTCPRCI 162
+Y++ +T TR+ L QY F C C RCI
Sbjct: 154 TDNYVDIMSNTKTRKNRLFNQYGFDCQCERCI 185
>gi|300176601|emb|CBK24266.2| unnamed protein product [Blastocystis hominis]
Length = 167
Score = 59.3 bits (142), Expect = 3e-06, Method: Compositional matrix adjust.
Identities = 42/134 (31%), Positives = 67/134 (50%), Gaps = 15/134 (11%)
Query: 14 NVIPSTTTDNYSLVEALVAHMSDIDEKQLLLYAQIANLVNLILQ---WPEISI---NEIA 67
+++ T +Y+ + L ++ I L Y QIA L++ I+Q W S+ + I
Sbjct: 28 SMVDGVVTGDYTDIIDLCDNIDIIQPSMLSSYEQIATLLHSIVQSEPWYSDSLCPLSTIT 87
Query: 68 ENFSKLACNAHTICNSELR-PLG--------TGLYPVISIINHSCLPNAVLVFEGRLAVV 118
KL N + +L PLG LYP++S+ NH C PNA +VF+G A +
Sbjct: 88 SCIGKLYSNRFAVTTIDLSAPLGRSFTQETAIALYPLLSLANHRCTPNATVVFDGLKATL 147
Query: 119 RAVQHVPKGAEVLI 132
RA+Q + KG E+ +
Sbjct: 148 RALQPIHKGEEITV 161
>gi|440300086|gb|ELP92579.1| set and mynd domain containing protein, putative [Entamoeba
invadens IP1]
Length = 420
Score = 59.3 bits (142), Expect = 3e-06, Method: Compositional matrix adjust.
Identities = 31/82 (37%), Positives = 49/82 (59%), Gaps = 4/82 (4%)
Query: 83 SELRPLGTGLYPVISIINHSCLPNAVLVF---EGRLAVVRAVQHVPKGAEVLISYIETAG 139
S+ G GLY ++ +NHSC PN VL + +L+++ A++ + G E+ ISYI+ +
Sbjct: 337 SKFSVEGEGLYKYLNTLNHSCDPNCVLACTTDDFKLSLI-ALKDIKAGEELTISYIDNSM 395
Query: 140 STMTRQKALKEQYLFTCTCPRC 161
+ TR K L +QY F C C +C
Sbjct: 396 NKETRLKTLMDQYNFDCKCKKC 417
>gi|340517499|gb|EGR47743.1| predicted protein [Trichoderma reesei QM6a]
Length = 535
Score = 59.3 bits (142), Expect = 3e-06, Method: Compositional matrix adjust.
Identities = 36/95 (37%), Positives = 51/95 (53%), Gaps = 1/95 (1%)
Query: 67 AENFSKLACNAHTICNSELRPLGTGLYPVISIINHSCLPNAVLVFEGRLAVVRAVQHVPK 126
AE K+ N+ +SEL G L P +++ NHSC+PNA + F GR A++ A +
Sbjct: 168 AEMLCKIQNNSFQRFDSELGVAGLYLEPTLAMANHSCIPNASVQFIGRNALLIAENPIRA 227
Query: 127 GAEVLISYIETAGSTMTRQKALKEQYLFTCTCPRC 161
G E+ I+Y R++AL Y FTC C RC
Sbjct: 228 GDEIEIAYTFYTDPLPKRREALA-HYKFTCQCLRC 261
>gi|449547900|gb|EMD38867.1| hypothetical protein CERSUDRAFT_47730 [Ceriporiopsis subvermispora
B]
Length = 410
Score = 59.3 bits (142), Expect = 3e-06, Method: Compositional matrix adjust.
Identities = 28/76 (36%), Positives = 39/76 (51%), Gaps = 1/76 (1%)
Query: 88 LGTGLYPVISIINHSCLPNAVLVFEGRLAVVRAVQHVPKGAEVLISYIETAG-STMTRQK 146
LG +YP +S NH C PNA +GR + V +G E+ ISY G + R+K
Sbjct: 321 LGFAVYPRLSFFNHDCAPNARKERDGRALRFVTTREVAEGDELCISYGHVDGMAWRERRK 380
Query: 147 ALKEQYLFTCTCPRCI 162
L + + F C C RC+
Sbjct: 381 ELSDGWFFDCACGRCV 396
>gi|330798181|ref|XP_003287133.1| hypothetical protein DICPUDRAFT_77998 [Dictyostelium purpureum]
gi|325082849|gb|EGC36318.1| hypothetical protein DICPUDRAFT_77998 [Dictyostelium purpureum]
Length = 510
Score = 59.3 bits (142), Expect = 3e-06, Method: Compositional matrix adjust.
Identities = 35/121 (28%), Positives = 54/121 (44%), Gaps = 19/121 (15%)
Query: 89 GTGLYPVISIINHSCLPNAVLVFE-----------------GRLAV-VRAVQHVPKGAEV 130
G+ +Y +S+ NH C PN + E RL++ + ++ VP+ +E+
Sbjct: 361 GSAVYLKLSLFNHGCFPNCTTLLEYNINHNDYSYYQGFESDNRLSISIITLRDVPQNSEL 420
Query: 131 LISYIETAGSTMTRQKALKEQYLFTCTCPRCIKLGQFDDIQESAILEGYRCKDDGCSGFL 190
LI+YI R LK ++F C C RC ++ E I CK+ CSG L
Sbjct: 421 LITYIPLNQKGHERYHNLKSNWMFPCDCLRCFHEKTNPELTE-KIFNDSSCKNSKCSGGL 479
Query: 191 L 191
L
Sbjct: 480 L 480
>gi|307179275|gb|EFN67661.1| SET and MYND domain-containing protein 4 [Camponotus floridanus]
Length = 631
Score = 59.3 bits (142), Expect = 3e-06, Method: Compositional matrix adjust.
Identities = 35/107 (32%), Positives = 56/107 (52%), Gaps = 12/107 (11%)
Query: 73 LACNAHTICNS---------ELRPLGTGLYPVISIINHSCLPNAVL-VFEGRLAVVRAVQ 122
+ CNA+ I + E R +G +Y +S++NHSC PN V + + VVRA++
Sbjct: 411 INCNAYEIVENVHEEVTRVWEPRNIGGAIYTTVSLVNHSCYPNVVRHSYPSGIVVVRALR 470
Query: 123 HVPKGAEVLISYIET--AGSTMTRQKALKEQYLFTCTCPRCIKLGQF 167
+ KG E+L Y + S M R++ L ++Y F C C C + +F
Sbjct: 471 FIGKGCEILDCYGPQFLSESRMARRELLWKKYRFLCECDACTQDWKF 517
>gi|302832792|ref|XP_002947960.1| hypothetical protein VOLCADRAFT_116676 [Volvox carteri f.
nagariensis]
gi|300266762|gb|EFJ50948.1| hypothetical protein VOLCADRAFT_116676 [Volvox carteri f.
nagariensis]
Length = 1170
Score = 59.3 bits (142), Expect = 3e-06, Method: Composition-based stats.
Identities = 31/76 (40%), Positives = 40/76 (52%), Gaps = 13/76 (17%)
Query: 86 RPLGTGLYPVISIINHSCLPNAVLVFEGRLAVVRAVQHVPKGAEVLISYIETAGSTMTRQ 145
RP TGL+ +++NHSC+PN G +VR + V G EV R+
Sbjct: 703 RPGHTGLWTEHALMNHSCVPNVCTYVLGEAMLVRVSRPVQGGEEV-------------RR 749
Query: 146 KALKEQYLFTCTCPRC 161
KAL +QY FTC CPRC
Sbjct: 750 KALYDQYGFTCCCPRC 765
>gi|168054064|ref|XP_001779453.1| predicted protein [Physcomitrella patens subsp. patens]
gi|162669138|gb|EDQ55731.1| predicted protein [Physcomitrella patens subsp. patens]
Length = 377
Score = 59.3 bits (142), Expect = 3e-06, Method: Compositional matrix adjust.
Identities = 38/114 (33%), Positives = 57/114 (50%), Gaps = 12/114 (10%)
Query: 60 EISINEIAENFSKLACNAHTI----CNSELRPL-GTGLYPVISIINHSCLPNAV------ 108
E+S+ + ++ ACN + C E R + G ++ S+ NH CLPNA
Sbjct: 165 ELSLELTGKLLARDACNTFGLMAPSCAGEERKVRGYAMFAQASMFNHDCLPNACRFEYVD 224
Query: 109 LVFEGRLAV-VRAVQHVPKGAEVLISYIETAGSTMTRQKALKEQYLFTCTCPRC 161
+ +G V VRA+ + +G EV +SY RQ+ L+E+Y F CTC RC
Sbjct: 225 IDGDGNTDVIVRALHDMEEGTEVCLSYFPVDWPYGDRQQKLQEEYGFWCTCARC 278
>gi|389744758|gb|EIM85940.1| SET domain-containing protein [Stereum hirsutum FP-91666 SS1]
Length = 480
Score = 59.3 bits (142), Expect = 3e-06, Method: Compositional matrix adjust.
Identities = 34/102 (33%), Positives = 56/102 (54%), Gaps = 8/102 (7%)
Query: 67 AENFSKLACNAHTICNSELRPLGTGLYPVIS-IINHSCLPNAVLVF------EGRLAVVR 119
+E +S+ N + + +S L P+ G++PV S + NHSC PNA + G VV
Sbjct: 189 SELYSRFGRN-NFVVHSHLSPIAHGIFPVASRLFNHSCTPNAAAKYIFGRDKPGVRMVVV 247
Query: 120 AVQHVPKGAEVLISYIETAGSTMTRQKALKEQYLFTCTCPRC 161
A++++ E+ I Y++ A + RQ +L++ Y F CTC C
Sbjct: 248 ALRNISHNEEITIPYLDPALPFLERQLSLEQNYGFRCTCHVC 289
>gi|167525114|ref|XP_001746892.1| hypothetical protein [Monosiga brevicollis MX1]
gi|163774672|gb|EDQ88299.1| predicted protein [Monosiga brevicollis MX1]
Length = 500
Score = 59.3 bits (142), Expect = 3e-06, Method: Compositional matrix adjust.
Identities = 35/97 (36%), Positives = 49/97 (50%), Gaps = 1/97 (1%)
Query: 86 RPLGTGLYPVISIINHSCLPN-AVLVFEGRLAVVRAVQHVPKGAEVLISYIETAGSTMTR 144
R G +PVI+++NHSC PN AV+ G VR + V GAE+ +SY++ R
Sbjct: 233 RETAWGHFPVIALVNHSCHPNCAVVSRPGGELEVRTLGTVRAGAELFVSYVDLTLPRAER 292
Query: 145 QKALKEQYLFTCTCPRCIKLGQFDDIQESAILEGYRC 181
Q L FTCTC RC F E+++ +C
Sbjct: 293 QAHLLASKEFTCTCYRCQHPDAFPHEHEASMPSCPQC 329
>gi|358342712|dbj|GAA50167.1| SET and MYND domain-containing protein 4 [Clonorchis sinensis]
Length = 761
Score = 59.3 bits (142), Expect = 3e-06, Method: Compositional matrix adjust.
Identities = 53/196 (27%), Positives = 87/196 (44%), Gaps = 47/196 (23%)
Query: 5 YLRRKLQNDNVIPSTTTDNYSLVEALVAHMSDIDEKQLLLYAQ----IANLVNLILQWPE 60
+LRRK N I S ++ LV+H + + + L YA +A L++L L E
Sbjct: 395 WLRRK--NHQQISSQQLT----IDRLVSHFDEFEVEDLFQYALTAWLLAKLISLSLP-DE 447
Query: 61 ISINEIAENF------SKLACNAHTICN---------------------SELRPLGTGLY 93
+I E+ + +L CNAH I + ++R + T L+
Sbjct: 448 GAITELIDGLWCFDTLRRLQCNAHAITDVQETFQLETKEDETHHAVPRLGQVR-VATALF 506
Query: 94 PVISIINHSCLPNAVLVFEGRLAVVRAVQHVPKGAEVLISYIE---TAGSTMTRQKALKE 150
P +S++NH+C P+ FE + V+R + + EV Y S+ +R+ L++
Sbjct: 507 PCVSMLNHACEPSVSNSFENQFIVLRCAKPIHPSDEVYNCYGPHYLHDTSSSSRRAQLQK 566
Query: 151 QYLFTCTC-----PRC 161
QY F C+C PRC
Sbjct: 567 QYFFNCSCQHCTNPRC 582
>gi|406859672|gb|EKD12735.1| TPR domain protein [Marssonina brunnea f. sp. 'multigermtubi'
MB_m1]
Length = 797
Score = 59.3 bits (142), Expect = 3e-06, Method: Compositional matrix adjust.
Identities = 42/154 (27%), Positives = 70/154 (45%), Gaps = 11/154 (7%)
Query: 8 RKLQNDNVIPSTTTDNYSLVEALVAHMSDIDEKQLLLYAQIANLVNLILQWPEISINEIA 67
+KLQN+ +T D YS +S +D + ++ I +V S+N +
Sbjct: 472 QKLQNNPSTAATVHDLYSGSYKRATQVS-VDGRPVVDSFLIERIV---------SLNVFS 521
Query: 68 ENFSKLACNAHTICNSELRPLGTGLYPVISIINHSCLPNAVLVFEGRLAVVRAVQHVPKG 127
+ + R TGL+ + INHSC+ N F G + +VRA + +P
Sbjct: 522 SPLTSREAYGQDRQADDPRFGSTGLWVYAAYINHSCIVNCYRTFIGDMMIVRAAKDMPAD 581
Query: 128 AEVLISYIETAGSTMTRQKALKEQYLFTCTCPRC 161
E+ SY + T+ ++L++ + FTCTCP C
Sbjct: 582 TELEWSYADPINRAKTK-RSLQDNWGFTCTCPVC 614
>gi|146263138|gb|ABQ15327.1| Mcg1p [Magnaporthe grisea]
Length = 456
Score = 59.3 bits (142), Expect = 3e-06, Method: Compositional matrix adjust.
Identities = 30/92 (32%), Positives = 47/92 (51%), Gaps = 1/92 (1%)
Query: 90 TGLYPVISIINHSCLPN-AVLVFEGRLAVVRAVQHVPKGAEVLISYIETAGSTMTRQKAL 148
G + +S +NH C PN A V + + A + + G E+ ISY+++ S RQ+ L
Sbjct: 258 AGTFLGVSRLNHDCRPNLAFHVAKSFVHTTHATRDIAAGEELTISYVDSYSSRNVRQERL 317
Query: 149 KEQYLFTCTCPRCIKLGQFDDIQESAILEGYR 180
K + FTCTCP C G + +S + Y+
Sbjct: 318 KRNWGFTCTCPHCSLSGAMGKLSDSRLYAIYQ 349
>gi|401422463|ref|XP_003875719.1| conserved hypothetical protein [Leishmania mexicana
MHOM/GT/2001/U1103]
gi|322491958|emb|CBZ27231.1| conserved hypothetical protein [Leishmania mexicana
MHOM/GT/2001/U1103]
Length = 510
Score = 59.3 bits (142), Expect = 3e-06, Method: Compositional matrix adjust.
Identities = 50/156 (32%), Positives = 71/156 (45%), Gaps = 27/156 (17%)
Query: 32 AHMSDIDEKQLLL--YAQIANLVNLILQWPE------ISINEIAENFSKLACNAHTICNS 83
A +SD+ L + Q + L W E S + + ++ NAH I +
Sbjct: 340 APLSDVQRHTLTFDEHQQQQTVPAPTLLWAEALVQVIFSTDTLRTLVGQMVLNAHAINDY 399
Query: 84 ELRP-------------LGTGLYPVISIINHSCLPN-AVLVFEGRLAVV-RAVQHVPKGA 128
L P G GLY ++S NHSC+PN AV +G +V + + + G
Sbjct: 400 VLPPGEAPSSGVFDWVLKGAGLYSLLSCFNHSCVPNVAVSAVDGTHEIVLKTTRSIRAGE 459
Query: 129 EVLISYIE-TAGST--MTRQKALKEQYLFTCTCPRC 161
+ I+YI TAG+T RQ+ LK Y FTC CPRC
Sbjct: 460 PLAITYIPLTAGATSRAERQRQLK-NYFFTCHCPRC 494
>gi|328710001|ref|XP_003244133.1| PREDICTED: SET and MYND domain-containing protein 4-like isoform 1
[Acyrthosiphon pisum]
gi|328710003|ref|XP_003244134.1| PREDICTED: SET and MYND domain-containing protein 4-like isoform 2
[Acyrthosiphon pisum]
Length = 645
Score = 59.3 bits (142), Expect = 3e-06, Method: Compositional matrix adjust.
Identities = 37/125 (29%), Positives = 60/125 (48%), Gaps = 19/125 (15%)
Query: 56 LQWPEISINEIAENFSKLAC----NAHTI-----CNSELR--------PLGTGLYPVISI 98
LQ P + + E+ + C NAH I C ++L+ P+ + L PV+S+
Sbjct: 423 LQIPNHQLGTVGESLVHIICAVSSNAHGITQPSDCKTQLKLSLDNRFVPVASLLMPVLSL 482
Query: 99 INHSCLPNAVLVFEGRLAVVRAVQHVPKGAEVLISY--IETAGSTMTRQKALKEQYLFTC 156
+NH C PN V V+ A+Q + KG+++ +Y + +R + LK QY F+C
Sbjct: 483 LNHHCDPNVVRHNYNGTIVLTAIQPISKGSQLFDNYGLLYATHPKESRLQILKNQYYFSC 542
Query: 157 TCPRC 161
C C
Sbjct: 543 ECSSC 547
>gi|157115758|ref|XP_001652682.1| hypothetical protein AaeL_AAEL007346 [Aedes aegypti]
gi|108876750|gb|EAT40975.1| AAEL007346-PA [Aedes aegypti]
Length = 354
Score = 58.9 bits (141), Expect = 3e-06, Method: Compositional matrix adjust.
Identities = 43/135 (31%), Positives = 52/135 (38%), Gaps = 33/135 (24%)
Query: 58 WPEISINEIAENFSKLACNAHTICNSELR-----------------------------PL 88
W S I + +L CN H I SELR +
Sbjct: 42 WFLYSSAVILRHIGQLVCNGHAI--SELRVCTASENNCLEADSFNIKAGFLHRCFESTRV 99
Query: 89 GTGLYPVISIINHSCLPNAVLVFEGRLAVVRAVQHVPKGAEVLISYIETAG--STMTRQK 146
TG++P IS+ NHSC PN F V A + V G E+ Y S RQ
Sbjct: 100 FTGIFPQISMFNHSCDPNIRNCFSKSTLTVYATRDVEAGGEIFNCYGPNFKLMSREDRQS 159
Query: 147 ALKEQYLFTCTCPRC 161
ALK+QY F C C RC
Sbjct: 160 ALKQQYCFDCDCIRC 174
>gi|358395915|gb|EHK45302.1| hypothetical protein TRIATDRAFT_79702 [Trichoderma atroviride IMI
206040]
Length = 543
Score = 58.9 bits (141), Expect = 3e-06, Method: Compositional matrix adjust.
Identities = 35/102 (34%), Positives = 54/102 (52%), Gaps = 1/102 (0%)
Query: 60 EISINEIAENFSKLACNAHTICNSELRPLGTGLYPVISIINHSCLPNAVLVFEGRLAVVR 119
E I + AE K+ N+ +S+L +G L P +++ NHSC+PNA + F GR A++
Sbjct: 166 EEDIRKAAEMLCKIQNNSFQRFDSDLGVIGLFLEPTLAMANHSCIPNAAVQFIGRNALLI 225
Query: 120 AVQHVPKGAEVLISYIETAGSTMTRQKALKEQYLFTCTCPRC 161
A + G E+ ++Y R++AL Y F C C RC
Sbjct: 226 AENPIRAGDEMELAYTFYTDPLPKRKEALA-HYKFVCQCLRC 266
>gi|260800041|ref|XP_002594945.1| hypothetical protein BRAFLDRAFT_174600 [Branchiostoma floridae]
gi|229280183|gb|EEN50956.1| hypothetical protein BRAFLDRAFT_174600 [Branchiostoma floridae]
Length = 458
Score = 58.9 bits (141), Expect = 3e-06, Method: Compositional matrix adjust.
Identities = 76/364 (20%), Positives = 142/364 (39%), Gaps = 65/364 (17%)
Query: 24 YSLVEALVAHMSDIDEKQLLLYAQIANLVNLILQWPEISINEIAENFSKLACNAHTICNS 83
+ L+ + +H D + N+V +I Q + ++E +E+ C T+
Sbjct: 123 WKLISGMESHTEDQTREYRWGMEMGRNVVRVIRQ--TLGMDEFSEHEIMKVCGILTVNAF 180
Query: 84 EL-----------RPLGTGLYPVISIINHSCLPNAVLVFEGRLAV-VRAVQHVPKGAEVL 131
EL P +YP ++NH CL N + + + + VR V KG +
Sbjct: 181 ELFTTGPNKDRDSTPRAWAVYPTTYLMNHDCLANTLTSIDSKNKMHVRTRMPVKKGEALT 240
Query: 132 ISYIETAGSTMTRQKALKEQYLFTCTCPRCIKLGQFDDIQESAILEGYRCKDDGCSGFLL 191
Y E T R+ L F C+C RC D + + + +RC + C G ++
Sbjct: 241 AEYSECLWGTEIRRHQLHRYKYFWCSCQRC-----RDPTELGSFISSHRCTN--CGGNVV 293
Query: 192 R----DSDDKGFTCQQCGLVRSKEEIK----KIASEVNILSKKTLALTSCGNHQEVVSTY 243
D + GF C+ C ++ E +K ++ EV L K L + + Y
Sbjct: 294 ATKPLDFNSTGFRCESCRVIIRGEVLKGTNHQVEEEVFELESKEL--------ETPIEEY 345
Query: 244 KMIEKLQKKLYHPFSVNLMQTREKLIKI--------LMELEDWK--EALAYCQLTIPVYQ 293
+ + + + + HP +++ ++ L L EL + + + + C+ I +
Sbjct: 346 EDLVERHESILHPNHYAMLRCKKFLAHAYGRQPGYSLAELSEDRLNKKMEMCRTLIDTLE 405
Query: 294 RVYPQFHPLLGLQYYTCGKLEWFLGDTENAIKSMTEA-VEILRITHGTNSPFMKELILKL 352
V P+F+ DT + + + A VE+ R H +E+++ L
Sbjct: 406 IVSPRFN-----------------DDTAHFLYELHAALVEVARRRHDLGLISNQEMVVAL 448
Query: 353 EEAQ 356
EA+
Sbjct: 449 TEAK 452
>gi|308803330|ref|XP_003078978.1| SET domain protein 123 (ISS) [Ostreococcus tauri]
gi|116057431|emb|CAL51858.1| SET domain protein 123 (ISS) [Ostreococcus tauri]
Length = 311
Score = 58.9 bits (141), Expect = 3e-06, Method: Compositional matrix adjust.
Identities = 30/75 (40%), Positives = 44/75 (58%), Gaps = 2/75 (2%)
Query: 89 GTGLYPVISIINHSCLPNAVLVFE-GRLAV-VRAVQHVPKGAEVLISYIETAGSTMTRQK 146
GT +Y S+ NHSC PNA + +E G A+ +R ++ V G E I+Y++ + +R+
Sbjct: 232 GTAVYFTASLFNHSCAPNAHVSWENGDAAITIRTLRPVRAGEEFNITYVDANERSASRRA 291
Query: 147 ALKEQYLFTCTCPRC 161
LKE Y F C C RC
Sbjct: 292 RLKEWYGFDCACERC 306
>gi|226508960|ref|NP_001146688.1| uncharacterized protein LOC100280288 [Zea mays]
gi|219888317|gb|ACL54533.1| unknown [Zea mays]
Length = 318
Score = 58.9 bits (141), Expect = 3e-06, Method: Compositional matrix adjust.
Identities = 35/108 (32%), Positives = 51/108 (47%), Gaps = 8/108 (7%)
Query: 83 SELRPLGTGLYPVISIINHSCLPNAVLVFEGRLAVVRAVQHVPKGAEVLISYIETAGSTM 142
S++R + LY S+ NHSC PN F R V+R + + G+ V +SY G
Sbjct: 171 SQVR-VAQALYVSGSLFNHSCQPNVHAYFLSRAFVLRTTEFIKSGSPVELSYGPQVGEMH 229
Query: 143 T--RQKALKEQYLFTCTCPRCIKLGQFDDIQESAILEGYRCKDDGCSG 188
RQK+L+E Y F+C C C +L D ++ + C C G
Sbjct: 230 ISERQKSLQENYYFSCQCSSCSELNLSD-----LVMNSFCCPQSNCHG 272
>gi|328715119|ref|XP_003245540.1| PREDICTED: protein msta, isoform A-like [Acyrthosiphon pisum]
Length = 478
Score = 58.9 bits (141), Expect = 4e-06, Method: Compositional matrix adjust.
Identities = 43/139 (30%), Positives = 61/139 (43%), Gaps = 9/139 (6%)
Query: 91 GLYPVISIINHSCLPNAVLVFEGRLAVV-RAVQHVPKGAEVLISYIETAGSTMTRQKALK 149
GLYPV + +NH C+PN + F +L ++ +A + KG E+ SY + T RQ L
Sbjct: 216 GLYPVAAFMNHCCVPNTMHNFNEKLQMIAKASLPIYKGQEITTSYTYSIWPTSLRQNHLL 275
Query: 150 EQYLFTCTCPRCIKLGQFDDIQESAILEGYRCKDDGCSGFLL--RDSDDKG-FTCQQCGL 206
F CTC RC +F L C C G +L D K + C+ C
Sbjct: 276 TSKQFICTCSRCCDTEEF-----GTKLAALNCIVKNCDGRILPINPLDKKSIWQCKMCTK 330
Query: 207 VRSKEEIKKIASEVNILSK 225
+ S E+ +V L K
Sbjct: 331 LVSSTEMICFYKDVETLMK 349
>gi|310795375|gb|EFQ30836.1| hypothetical protein GLRG_05980 [Glomerella graminicola M1.001]
Length = 411
Score = 58.9 bits (141), Expect = 4e-06, Method: Compositional matrix adjust.
Identities = 33/83 (39%), Positives = 41/83 (49%), Gaps = 5/83 (6%)
Query: 82 NSELRPLGTGLYPVISIINHSCLPNAVLVFEGRLAVVRAVQHVPKGAEVLISYI---ETA 138
SEL G G +P S NHSC PN EGR RA + V KG ++ I+Y+ E A
Sbjct: 321 GSEL--FGHGCWPAASYFNHSCGPNVDKRREGRAWEFRASRDVGKGEQLCITYLGGEEKA 378
Query: 139 GSTMTRQKALKEQYLFTCTCPRC 161
TR L+ + F C C RC
Sbjct: 379 MPRETRMSTLRRNWGFECGCKRC 401
>gi|341876422|gb|EGT32357.1| CBN-SET-10 protein [Caenorhabditis brenneri]
Length = 428
Score = 58.9 bits (141), Expect = 4e-06, Method: Compositional matrix adjust.
Identities = 45/145 (31%), Positives = 69/145 (47%), Gaps = 19/145 (13%)
Query: 68 ENFSKLACNAHTICNS-----ELR--PLGTGLYPVISIINHSCLPNAVLVFEGRLAVVRA 120
EN +A +CNS E R P+G+G+Y ++ NHSC + +VF+ ++RA
Sbjct: 145 ENNRMIAMICSVLCNSFGIIDEKRGEPIGSGMYIGLAKHNHSCASTSHVVFDKNQVLLRA 204
Query: 121 VQHVPKGAEVLISYIETAGSTMTRQKALKEQYLFTCTCPRCIKLGQFDDIQESAILEGYR 180
+ ISY+ T RQK+++ + TC C C Q +D+ I R
Sbjct: 205 RKE-EYCRNTTISYVSRMLPTAERQKSIRSVHFITCRCEMC----QNEDLD--LIGLASR 257
Query: 181 CKDDGCSGFLLRDSDDKGFTCQQCG 205
C+ DGC+GF+ S +C CG
Sbjct: 258 CQTDGCNGFVKGSS-----SCGSCG 277
>gi|307170062|gb|EFN62504.1| SET and MYND domain-containing protein 4 [Camponotus floridanus]
Length = 333
Score = 58.9 bits (141), Expect = 4e-06, Method: Compositional matrix adjust.
Identities = 36/113 (31%), Positives = 53/113 (46%), Gaps = 16/113 (14%)
Query: 65 EIAENFSKLACNAHTICN--------------SELRPLGTGLYPVISIINHSCLPNAVLV 110
E A ++ L CN H I E + T +YP S++NHSC PN +
Sbjct: 187 EDAFSYGILVCNGHAITKINVTADDHENKLLIEEQIRIATAIYPSASMMNHSCDPNIINS 246
Query: 111 FEGRLAVVRAVQHVPKGAEVLISYIET--AGSTMTRQKALKEQYLFTCTCPRC 161
F + +V+A+Q + +G E+L Y RQ++L+ QY FTC C C
Sbjct: 247 FLDQTLIVKAIQDIKEGEEILHCYGVDFRRMPKAERQESLESQYYFTCNCKAC 299
Database: nr
Posted date: Mar 3, 2013 10:45 PM
Number of letters in database: 999,999,864
Number of sequences in database: 2,912,245
Database: /local_scratch/syshi//blastdatabase/nr.01
Posted date: Mar 3, 2013 10:52 PM
Number of letters in database: 999,999,666
Number of sequences in database: 2,912,720
Database: /local_scratch/syshi//blastdatabase/nr.02
Posted date: Mar 3, 2013 10:58 PM
Number of letters in database: 999,999,938
Number of sequences in database: 3,014,250
Database: /local_scratch/syshi//blastdatabase/nr.03
Posted date: Mar 3, 2013 11:03 PM
Number of letters in database: 999,999,780
Number of sequences in database: 2,805,020
Database: /local_scratch/syshi//blastdatabase/nr.04
Posted date: Mar 3, 2013 11:08 PM
Number of letters in database: 999,999,551
Number of sequences in database: 2,816,253
Database: /local_scratch/syshi//blastdatabase/nr.05
Posted date: Mar 3, 2013 11:13 PM
Number of letters in database: 999,999,897
Number of sequences in database: 2,981,387
Database: /local_scratch/syshi//blastdatabase/nr.06
Posted date: Mar 3, 2013 11:18 PM
Number of letters in database: 999,999,649
Number of sequences in database: 2,911,476
Database: /local_scratch/syshi//blastdatabase/nr.07
Posted date: Mar 3, 2013 11:24 PM
Number of letters in database: 999,999,452
Number of sequences in database: 2,920,260
Database: /local_scratch/syshi//blastdatabase/nr.08
Posted date: Mar 3, 2013 11:25 PM
Number of letters in database: 64,230,274
Number of sequences in database: 189,558
Lambda K H
0.320 0.135 0.393
Lambda K H
0.267 0.0410 0.140
Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Number of Hits to DB: 5,281,502,673
Number of Sequences: 23463169
Number of extensions: 207196425
Number of successful extensions: 560448
Number of sequences better than 100.0: 1000
Number of HSP's better than 100.0 without gapping: 1239
Number of HSP's successfully gapped in prelim test: 1462
Number of HSP's that attempted gapping in prelim test: 556083
Number of HSP's gapped (non-prelim): 3803
length of query: 368
length of database: 8,064,228,071
effective HSP length: 144
effective length of query: 224
effective length of database: 8,980,499,031
effective search space: 2011631782944
effective search space used: 2011631782944
T: 11
A: 40
X1: 16 ( 7.4 bits)
X2: 38 (14.6 bits)
X3: 64 (24.7 bits)
S1: 41 (21.8 bits)
S2: 77 (34.3 bits)