BLASTP 2.2.26 [Sep-21-2011]
Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer,
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997),
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs", Nucleic Acids Res. 25:3389-3402.
Reference for compositional score matrix adjustment: Altschul, Stephen F.,
John C. Wootton, E. Michael Gertz, Richa Agarwala, Aleksandr Morgulis,
Alejandro A. Schaffer, and Yi-Kuo Yu (2005) "Protein database searches
using compositionally adjusted substitution matrices", FEBS J. 272:5101-5109.
Query= 046120
(381 letters)
Database: nr
23,463,169 sequences; 8,064,228,071 total letters
Searching..................................................done
>gi|224057872|ref|XP_002299366.1| SET domain protein [Populus trichocarpa]
gi|222846624|gb|EEE84171.1| SET domain protein [Populus trichocarpa]
Length = 391
Score = 451 bits (1160), Expect = e-124, Method: Compositional matrix adjust.
Identities = 238/382 (62%), Positives = 281/382 (73%), Gaps = 20/382 (5%)
Query: 8 VTVAEIEGRGRGLVSTQSLKAGQIVLRDSPILLYSALPFINSSS----FCHNCFRKTMHS 63
V V EI+GRGRGLVSTQ L+ GQIVL DSPILLYSALP +C CF KT+ S
Sbjct: 11 VRVEEIQGRGRGLVSTQPLRGGQIVLIDSPILLYSALPLTKQQHSTFLYCDKCF-KTIQS 69
Query: 64 SSSICPSCS-VAFCSPKCSTAAASSSHSPYVCQALTRLLNLNSPDAANLPLDRQLQARFL 122
+S CP+CS FCSP C +AA +SSH+P+VCQ+L+RL + + ++RQ+QA+FL
Sbjct: 70 ASVSCPTCSHQRFCSPTCLSAALASSHTPWVCQSLSRLRDCQD-FLQHHSVERQIQAQFL 128
Query: 123 IAAYNLAVVNPSQFQILLAFQGTVTDNDTSAAHYL----SSLCPPPA----STTIELTAA 174
+AAYNLA V+PS FQILL+ QG D D + L SSLCPPP S ++EL AA
Sbjct: 129 VAAYNLAFVSPSDFQILLSLQGRAEDEDPAIVQSLHSVISSLCPPPPIEGFSFSLELIAA 188
Query: 175 LLAKDRLNAFGLMEPYI--EGQDGRRSVRAYGIYLKASFFNHDCLPNACRFDYVDAAAEN 232
L+AKDR NAFGLMEP E G+RSVRAYGIY KAS FNHDCLPNACRFDYVD
Sbjct: 189 LVAKDRFNAFGLMEPLNLNEENGGQRSVRAYGIYPKASLFNHDCLPNACRFDYVDTNNSG 248
Query: 233 NTDIIVRMIHDVPQGREICLSYFPVNYDYSTRQKRLLYDYDFACDCDRCKVEANWS-DND 291
NTDI+VRMIHDVPQGREICLSYFPVN +YSTR+KRLL DY F CDCDRCKVEA WS D
Sbjct: 249 NTDIVVRMIHDVPQGREICLSYFPVNSNYSTRRKRLLEDYGFTCDCDRCKVEATWSDDEG 308
Query: 292 NDDENNEEVMDEDQDEQMVASDDDAEVHGD--TNFPHAYFFMRYMCDRDNCWGTLAPLPP 349
+ D+N+ EVM+ED DE M A D E+ D T+FPHAYFF+RYMC+R+NCWGTLAP PP
Sbjct: 309 DGDDNDNEVMEEDVDEPMEAESDGEEIGNDNSTDFPHAYFFLRYMCNRNNCWGTLAPFPP 368
Query: 350 SDATPSTVMECNVCGNLKNDAI 371
SDA PS ++ECN CG++KND +
Sbjct: 369 SDAKPSNLLECNACGDIKNDEV 390
>gi|356525020|ref|XP_003531125.1| PREDICTED: histone-lysine N-methyltransferase ASHR2-like [Glycine
max]
Length = 419
Score = 436 bits (1120), Expect = e-119, Method: Compositional matrix adjust.
Identities = 240/420 (57%), Positives = 290/420 (69%), Gaps = 42/420 (10%)
Query: 1 MSAVSETVTVAEIEGRGRGLVSTQSLKAGQIVLRDSPILLYSALPFINSSS--------- 51
M+A S + V EI+GRGRG+V++Q LKAGQIVLRDSPILLYSALP + S
Sbjct: 3 MAAPSSLLKVEEIQGRGRGMVASQPLKAGQIVLRDSPILLYSALPLVRQSLSSSSSSAST 62
Query: 52 --FCHNCFR-------KTMHSSSSICPSC-SVAFCSPKCSTAAASSSHSPYVCQALTRLL 101
FC +CFR SS+ +CP+C FC+ C + A +SSHS +VCQAL+ L
Sbjct: 63 SCFCDHCFRILSPSLQGDSSSSTVLCPNCRHHCFCNSNCLSNALNSSHSSWVCQALSHL- 121
Query: 102 NLNSPDAANLPLDRQLQARFLIAAYNLAVVNPSQFQILLAFQGTVTDNDTSAAHYL---- 157
NS PL+ Q+Q FL+AAYNLA ++PS FQI+L+ QG+ D+ +AA +L
Sbjct: 122 RANSL-LLEQPLEHQVQVNFLVAAYNLANISPSDFQIMLSLQGSPDDSTIAAAQFLHPLI 180
Query: 158 SSLCP-----PPASTTIELTAALLAKDRLNAFGLMEPYIEGQDGRRSVRAYGIYLKASFF 212
SSLC P ++ELT+A+LAKD+LNAFG+M+P+ E D +RSVRAYGIY ASFF
Sbjct: 181 SSLCSLALIGPQNGFSLELTSAILAKDKLNAFGIMQPFSE-HDDQRSVRAYGIYPYASFF 239
Query: 213 NHDCLPNACRFDYVDAAAEN---NTDIIVRMIHDVPQGREICLSYFPVNYDYSTRQKRLL 269
NHDCLPNACRFDYVDA + NTD I+RMIHDVPQGREICLSYFPVN YS+RQKRL+
Sbjct: 240 NHDCLPNACRFDYVDANPSDDSHNTDFIIRMIHDVPQGREICLSYFPVNEKYSSRQKRLI 299
Query: 270 YDYDFACDCDRCKVEANWSDNDNDDEN---NEEVMDEDQDEQMVASDDDAEVHGDTN-FP 325
DY F C+CDRC VE+NWSDND+ ++N EEVMDEDQ E M ASD D H D N FP
Sbjct: 300 EDYGFTCNCDRCNVESNWSDNDSVEDNAEEEEEVMDEDQCETMAASDTDDHPHEDNNDFP 359
Query: 326 HAYFFMRYMCDRDNCWGTLAPLPPSDATPSTVMECNVCGNLKNDAI----GREDTVGMDD 381
HAYFF++YMCDR NCWGTLAPLPP TP VMECNVCG LK+D +D V M+D
Sbjct: 360 HAYFFLKYMCDRTNCWGTLAPLPPQGDTPCNVMECNVCGKLKSDNTFDIDEGQDEVSMED 419
>gi|225425346|ref|XP_002275154.1| PREDICTED: histone-lysine N-methyltransferase ASHR2-like [Vitis
vinifera]
Length = 405
Score = 434 bits (1117), Expect = e-119, Method: Compositional matrix adjust.
Identities = 236/397 (59%), Positives = 282/397 (71%), Gaps = 22/397 (5%)
Query: 5 SETVTVAEIEGRGRGLVSTQSLKAGQIVLRDSPILLYSALPFINSS-SFCHNCFRKTMHS 63
S V EIEGRGR LV++QSL+ GQI+L DSPILLYSA P +SS ++C NCFR
Sbjct: 11 SGLVKTVEIEGRGRALVASQSLRGGQIILTDSPILLYSAHPLSSSSNAYCSNCFRHLQTC 70
Query: 64 SSSI-CPSCSVAFCSPKCSTAAASSSHSPYVCQALTRLLNLNSPDAANLPLDRQLQARFL 122
S+ + C SC FCSP C T A SSSHSP+ C L+ L SP + +RQ+QARFL
Sbjct: 71 STLVSCSSCPCLFCSPDCLTHALSSSHSPWACLTLSLLRA--SPSLSLSHSERQVQARFL 128
Query: 123 IAAYNLAVVNPSQFQILLAFQGTV-----TDNDTSAAHYLSSLCPPP--ASTTIELTAAL 175
+AAYNLA+V+PS F ILL+ QG +D T LSSL PP A ++ELT AL
Sbjct: 129 VAAYNLAIVSPSHFHILLSLQGMALPSSDSDAPTFLHSLLSSLSPPQGVAGFSVELTTAL 188
Query: 176 LAKDRLNAFGLMEPYIEGQDGRRSVRAYGIYLKASFFNHDCLPNACRFDYVDAAAENNTD 235
LAKD+LNAFGLMEP G RSVRAYGIY KASFFNHDCLPNACRFDYVD A+ +NTD
Sbjct: 189 LAKDKLNAFGLMEPPALAPGGERSVRAYGIYPKASFFNHDCLPNACRFDYVDTASHHNTD 248
Query: 236 IIVRMIHDVPQGREICLSYFPVNYDYSTRQKRLLYDYDFACDCDRCKVEANWSDNDNDDE 295
I +R+IHDVP+G EICLSYFPVN Y+ RQKRLL DY F C CDRC+VEANW D+D +E
Sbjct: 249 ITIRLIHDVPEGSEICLSYFPVNETYADRQKRLLEDYGFTCYCDRCRVEANWKDDDEQEE 308
Query: 296 NNE---EVMDEDQDEQMVASDDDAEV---HGDTNFPHAYFFMRYMCDRDNCWGTLAPLPP 349
+ +VMDEDQDEQM+ S+++ E+ G+ +FPHAYFF+RYMC R+NCWGTLAPLPP
Sbjct: 309 EQDDEGQVMDEDQDEQMIGSENEIEIGDGGGENDFPHAYFFLRYMCTRENCWGTLAPLPP 368
Query: 350 --SDATPSTVMECNVCGNLKN---DAIGREDTVGMDD 381
SDA+PS +MECNVCGN K D ED + +DD
Sbjct: 369 SDSDASPSNLMECNVCGNSKKSDEDFNSDEDRLSVDD 405
>gi|147844297|emb|CAN82112.1| hypothetical protein VITISV_031337 [Vitis vinifera]
Length = 405
Score = 433 bits (1114), Expect = e-119, Method: Compositional matrix adjust.
Identities = 236/397 (59%), Positives = 281/397 (70%), Gaps = 22/397 (5%)
Query: 5 SETVTVAEIEGRGRGLVSTQSLKAGQIVLRDSPILLYSALPFINSS-SFCHNCFRKTMHS 63
S V EIEGRGR LV++QSL+ GQI+L DSPILLYSA P +SS ++C NCFR
Sbjct: 11 SGLVKTVEIEGRGRALVASQSLRGGQIILTDSPILLYSAHPLSSSSNAYCSNCFRHLQTC 70
Query: 64 SSSI-CPSCSVAFCSPKCSTAAASSSHSPYVCQALTRLLNLNSPDAANLPLDRQLQARFL 122
S+ + C SC FCSP C T A SSSHSP+ C L+ L SP + +RQ+QARFL
Sbjct: 71 STLVSCSSCPCLFCSPDCLTXALSSSHSPWACLTLSLLRA--SPSLSLSHSERQVQARFL 128
Query: 123 IAAYNLAVVNPSQFQILLAFQGTV-----TDNDTSAAHYLSSLCPPP--ASTTIELTAAL 175
+AAYNLA+V+PS F ILL+ QG +D T LSSL PP A ++ELT AL
Sbjct: 129 VAAYNLAIVSPSHFHILLSLQGMALPSSDSDAPTFLHSLLSSLSPPQGVAGFSVELTTAL 188
Query: 176 LAKDRLNAFGLMEPYIEGQDGRRSVRAYGIYLKASFFNHDCLPNACRFDYVDAAAENNTD 235
LAKD+LNAFGLMEP G RSVRAYGIY KASFFNHDCLPNACRFDYVD A+ +NTD
Sbjct: 189 LAKDKLNAFGLMEPPALAPGGERSVRAYGIYPKASFFNHDCLPNACRFDYVDTASHHNTD 248
Query: 236 IIVRMIHDVPQGREICLSYFPVNYDYSTRQKRLLYDYDFACDCDRCKVEANWSDNDNDDE 295
I +R+IHDVP+G EICLSYFPVN Y+ RQKRLL DY F C CDRC+VEANW D+D +E
Sbjct: 249 ITIRLIHDVPEGSEICLSYFPVNETYADRQKRLLEDYGFTCYCDRCRVEANWKDDDEQEE 308
Query: 296 NNE---EVMDEDQDEQMVASDDDAEV---HGDTNFPHAYFFMRYMCDRDNCWGTLAPLPP 349
+ +VMDEDQDEQM+ S+++ E+ G +FPHAYFF+RYMC R+NCWGTLAPLPP
Sbjct: 309 EQDDEGQVMDEDQDEQMIGSENEIEIGDGGGXNDFPHAYFFLRYMCTRENCWGTLAPLPP 368
Query: 350 --SDATPSTVMECNVCGNLKN---DAIGREDTVGMDD 381
SDA+PS +MECNVCGN K D ED + +DD
Sbjct: 369 SDSDASPSNLMECNVCGNSKKXDEDFNSDEDRLSVDD 405
>gi|30680748|ref|NP_849991.1| histone-lysine N-methyltransferase ASHR2 [Arabidopsis thaliana]
gi|94707155|sp|Q9ZUM9.3|ASHR2_ARATH RecName: Full=Histone-lysine N-methyltransferase ASHR2; AltName:
Full=ASH1-related protein 2; AltName: Full=Protein SET
DOMAIN GROUP 39
gi|28393236|gb|AAO42047.1| putative SET-domain transcriptional regulator [Arabidopsis
thaliana]
gi|330251813|gb|AEC06907.1| histone-lysine N-methyltransferase ASHR2 [Arabidopsis thaliana]
Length = 398
Score = 414 bits (1065), Expect = e-113, Method: Compositional matrix adjust.
Identities = 215/389 (55%), Positives = 287/389 (73%), Gaps = 30/389 (7%)
Query: 8 VTVAEIEGRGRGLVSTQSLKAGQIVLRDSPILLYSALPFINSS--SFCHNCFRKTMHSSS 65
+ VAEI GRGR LV+ QSL+AGQ++LR+SP+LLYSA PF++SS +C +CFR S+
Sbjct: 13 LRVAEIGGRGRSLVAAQSLRAGQVILRESPLLLYSAFPFLSSSVSPYCDHCFRLLASSAH 72
Query: 66 SICPSCS-VAFCSPKCSTAAASSSHSPYVCQALTRLLNLNSPDAANLPLDRQLQARFLIA 124
C SCS V+FCSP C +SH+P++C++L RL +S ++ P DRQ+QARFL++
Sbjct: 73 QKCQSCSLVSFCSPNCF-----ASHTPWLCESLRRLHQSSSSAFSDQPSDRQVQARFLLS 127
Query: 125 AYNLAVVNPSQFQILLAFQGTVTDN--------DTSAAHYL----SSLCPP-PASTTIEL 171
AYNLA +PS FQILL+ QG+ + N D++AA +L SS+CP P S + +L
Sbjct: 128 AYNLAAASPSDFQILLSLQGSGSSNGDPSCSAGDSAAAGFLHSLLSSVCPSLPVSISPDL 187
Query: 172 TAALLAKDRLNAFGLMEPYIEGQDGRRSVRAYGIYLKASFFNHDCLPNACRFDYVDAAAE 231
TAALL+KD++NAFGLMEP + +RSVRAYGIY K SFFNHDCLPNACRFDYVD+A++
Sbjct: 188 TAALLSKDKVNAFGLMEP-CSVSNEKRSVRAYGIYPKTSFFNHDCLPNACRFDYVDSASD 246
Query: 232 NNTDIIVRMIHDVPQGREICLSYFPVNYDYSTRQKRLLYDYDFACDCDRCKVEANWSDND 291
NTDII+RMIHDVP+GRE+CLSYFPVN +YS+RQKRLL DY F CDCDRCKVE +WS+ +
Sbjct: 247 GNTDIIIRMIHDVPEGREVCLSYFPVNMNYSSRQKRLLEDYGFKCDCDRCKVEFSWSEGE 306
Query: 292 NDDENNEEVMDEDQDEQMV---ASDDDAEVHG-----DTNFPHAYFFMRYMCDRDNCWGT 343
D+ E M++ +++ + +++ EV G ++NFPHAYFF+RYMC+++NC+GT
Sbjct: 307 EDENEIMEEMEDQDEQEEMEDSVGENEEEVCGNGVDDESNFPHAYFFVRYMCEKENCFGT 366
Query: 344 LAPLPPSDATPSTVMECNVCGNLKNDAIG 372
LAPLPP S V+ECNVCG++K D +G
Sbjct: 367 LAPLPPKTHDASRVLECNVCGSVKEDEVG 395
>gi|255543292|ref|XP_002512709.1| protein with unknown function [Ricinus communis]
gi|223548670|gb|EEF50161.1| protein with unknown function [Ricinus communis]
Length = 379
Score = 409 bits (1051), Expect = e-111, Method: Compositional matrix adjust.
Identities = 226/377 (59%), Positives = 273/377 (72%), Gaps = 30/377 (7%)
Query: 17 GRGLVSTQSLKAGQIVLRDSPILLYSALPFINSSS---------FCHNCFRKTMHSSSSI 67
GRG+VS Q L+ G +V+RDSP+LLYSA P SS +C NC+RK +HS+
Sbjct: 12 GRGVVSYQPLRGGDVVVRDSPLLLYSAFPLSRSSITSTPTSTSVYCDNCYRK-IHSAVIC 70
Query: 68 CPSCSV-AFCSPKCSTAAASSSHSPYVCQALTRLLNLNSPDAANLPLDRQLQARFLIAAY 126
CP+CS FCSP C +A SS++P+VCQAL+RL + +S + PL+RQ+QARFLIAAY
Sbjct: 71 CPTCSHHKFCSPNCVSA---SSNAPWVCQALSRLHDCSSL-VVHQPLERQVQARFLIAAY 126
Query: 127 NLAVVNPSQFQILLAFQGTVTDNDTSAAHYL----SSLCPPPAST--TIELTAALLAKDR 180
NL +V+PS FQILL+ QG D A +L SSLCPP ++ELT+ALLAKD+
Sbjct: 127 NLFLVSPSNFQILLSLQGQGGGGDDEDAQFLHSLISSLCPPTGGVPFSLELTSALLAKDK 186
Query: 181 LNAFGLMEPYIEGQDGRRSVRAYGIYLKASFFNHDCLPNACRFDYVDAAAENNTDIIVRM 240
LNAFGLMEP+ + DG+RSVRAYGIY KA+ FNHDCLPNACRFDYVD +TD+I+RM
Sbjct: 187 LNAFGLMEPF-DINDGKRSVRAYGIYPKAALFNHDCLPNACRFDYVDT---QDTDLIIRM 242
Query: 241 IHDVPQGREICLSYFPVNYDYSTRQKRLLYDYDFACDCDRCKVEANWSDNDN----DDEN 296
IHDVPQGREICLSYFPVNYDYSTRQKRL DY F CDCDRCKVEANWSD ++ DDEN
Sbjct: 243 IHDVPQGREICLSYFPVNYDYSTRQKRLREDYGFICDCDRCKVEANWSDQEHDDADDDEN 302
Query: 297 NEEVMDEDQDEQMVASDDDAEVHGDTNFPHAYFFMRYMCDRDNCWGTLAPLPPSDATPST 356
E M+ED +E M DDA D++FPHAYFF+RYMCD NCWGTLAPL S+ T
Sbjct: 303 ENEAMEEDSEEAMADEKDDAPSDNDSDFPHAYFFLRYMCDGTNCWGTLAPLAQSNVN-IT 361
Query: 357 VMECNVCGNLKNDAIGR 373
++ECNVCG +K D +G
Sbjct: 362 LLECNVCGKIKRDEVGH 378
>gi|297836294|ref|XP_002886029.1| hypothetical protein ARALYDRAFT_343234 [Arabidopsis lyrata subsp.
lyrata]
gi|297331869|gb|EFH62288.1| hypothetical protein ARALYDRAFT_343234 [Arabidopsis lyrata subsp.
lyrata]
Length = 424
Score = 391 bits (1005), Expect = e-106, Method: Compositional matrix adjust.
Identities = 206/373 (55%), Positives = 269/373 (72%), Gaps = 28/373 (7%)
Query: 8 VTVAEIEGRGRGLVSTQSLKAGQIVLRDSPILLYSALPFINSS--SFCHNCFRKTMHSSS 65
+ VAEI GRGR LV+ QSL+AGQ++LR+SP+LLYSA PF++SS +C +CFR S+
Sbjct: 12 LRVAEIGGRGRSLVAAQSLRAGQVILRESPLLLYSAFPFLSSSVSPYCDHCFRLLASSAH 71
Query: 66 SICPSCS-VAFCSPKCSTAAASSSHSPYVCQALTRLLNLNSPDAANLPLDRQLQARFLIA 124
C SCS V+FCSP C +SH+P++C++L RL + +S ++ P DRQ+QARFL++
Sbjct: 72 QKCQSCSLVSFCSPNCF-----ASHTPWLCESLLRL-HQSSSAFSDQPSDRQVQARFLLS 125
Query: 125 AYNLAVVNPSQFQILLAFQGTVTDN--------DTSAAH-YLSSLCPP-PASTTIELTAA 174
AYNLA +PS FQILL+ QG+ N D+ H LS++CPP P + ELTAA
Sbjct: 126 AYNLAAASPSDFQILLSLQGSGCSNGDPSSSATDSGFLHSLLSAVCPPLPVCISPELTAA 185
Query: 175 LLAKDRLNAFGLMEPYIEGQDGRRSVRAYGIYLKASFFNHDCLPNACRFDYVDAAAENNT 234
LLAKD++NAFGLMEP+ D +RSVRAYGIY K SFFNHDCLPNACRFDYVD+A++ NT
Sbjct: 186 LLAKDKVNAFGLMEPFSVSND-KRSVRAYGIYPKTSFFNHDCLPNACRFDYVDSASDGNT 244
Query: 235 DIIVRMIHDVPQGREICLSYFPVNYDYSTRQKRLLYDYDFACDCDRCKVEANWSDNDNDD 294
DII+R IHDVP+GRE+CLSYFPVN +YS+RQKRLL DY F CDCDRCKVE++WS+ + D+
Sbjct: 245 DIIIRTIHDVPEGREVCLSYFPVNMNYSSRQKRLLEDYGFKCDCDRCKVESSWSEGEEDE 304
Query: 295 ENNEEVMDEDQDEQMVASDDDAEVH--------GDTNFPHAYFFMRYMCDRDNCWGTLAP 346
E M + DE+ + + V ++NFPHAYFF+RYMC++++C+GTLAP
Sbjct: 305 NEVMEEMGNEDDEEEMEDSEGENVEEVCGNGVDDESNFPHAYFFVRYMCEKESCFGTLAP 364
Query: 347 LPPSDATPSTVME 359
LPP S V+E
Sbjct: 365 LPPKTHDASRVLE 377
>gi|357484667|ref|XP_003612621.1| Histone-lysine N-methyltransferase ASHR2 [Medicago truncatula]
gi|355513956|gb|AES95579.1| Histone-lysine N-methyltransferase ASHR2 [Medicago truncatula]
Length = 384
Score = 348 bits (892), Expect = 3e-93, Method: Compositional matrix adjust.
Identities = 183/384 (47%), Positives = 245/384 (63%), Gaps = 45/384 (11%)
Query: 1 MSAVSET--VTVAEIEGRGRGLVSTQSLKAGQIVLRDSPILLYSALPFINS--SSFCHNC 56
M+A + T + + I +GRGL++TQ LKAGQI+L +SP+LLYSA P + S +CHNC
Sbjct: 28 MAAQTSTSLLRLEHIPNKGRGLIATQDLKAGQIILTESPLLLYSATPLFSHTPSPYCHNC 87
Query: 57 FRKTMHSSSSICPSCS-VAFCSPKCSTAAASSSHSPYVCQALTRLLNLNSPDAANLPLDR 115
FR S + CPSCS FCS KC + A +SSHS + CQ L+ L N SP + P +
Sbjct: 88 FRTLPPSQTFQCPSCSNYLFCSQKCLSIALNSSHSSWTCQTLSHLQNPTSPLSEK-PCEL 146
Query: 116 QLQARFLIAAYNLAVVNPSQFQILLAFQGTVTDNDTSA--AHYLSSL----CPPPASTTI 169
Q+QAR ++AAYNLA+ PS+ Q LL+ G D D+ A +L SL C P + +
Sbjct: 147 QVQARLIVAAYNLAIHTPSKLQTLLSLHGNPNDQDSIVDNAKFLHSLISPFCSPHMNFSA 206
Query: 170 ELTAALLAKDRLNAFGLMEPYIEGQDG-RRSVRAYGIYLKASFFNHDCLPNACRFDYVDA 228
EL A ++AK+RLN+F LMEPY Q G +RS++AYGIY KA+FFNH C+PNACRFDYV++
Sbjct: 207 ELAAKIIAKERLNSFCLMEPY--SQKGPQRSIKAYGIYQKATFFNHGCIPNACRFDYVES 264
Query: 229 AA---ENNTDIIVRMIHDVPQGREICLSYFPVNYDYSTRQKRLLYDYDFACDCDRCKVEA 285
E+NTDI++R+I DV G EIC+SYF +N DYSTR+K L+ DY F C+CDRCK+EA
Sbjct: 265 GEPGDEHNTDIVIRLIKDVGVGSEICISYFRINKDYSTRKKILMEDYGFTCECDRCKIEA 324
Query: 286 NWSDNDNDDENNEEVMDEDQDEQMVASDDDAEVHGDTNFPHAYFFMRYMCDRDNCWGTLA 345
+W+D +N ++ PH F +++CD++NC GT+A
Sbjct: 325 SWNDGENKY---------------------------SDLPHVIFLSKFVCDKENCAGTMA 357
Query: 346 PLPPSDATPSTVMECNVCGNLKND 369
PLPP D S V+ECN CGNLK D
Sbjct: 358 PLPPKDGEKSNVLECNFCGNLKVD 381
>gi|18399103|ref|NP_565457.1| histone-lysine N-methyltransferase ASHR2 [Arabidopsis thaliana]
gi|14334524|gb|AAK59459.1| putative SET-domain transcriptional regulator protein [Arabidopsis
thaliana]
gi|17979109|gb|AAL47498.1| putative SET-domain transcriptional regulator protein [Arabidopsis
thaliana]
gi|20197479|gb|AAD10162.2| putative SET-domain transcriptional regulator [Arabidopsis
thaliana]
gi|330251812|gb|AEC06906.1| histone-lysine N-methyltransferase ASHR2 [Arabidopsis thaliana]
Length = 341
Score = 345 bits (884), Expect = 3e-92, Method: Compositional matrix adjust.
Identities = 176/299 (58%), Positives = 226/299 (75%), Gaps = 22/299 (7%)
Query: 8 VTVAEIEGRGRGLVSTQSLKAGQIVLRDSPILLYSALPFINSS--SFCHNCFRKTMHSSS 65
+ VAEI GRGR LV+ QSL+AGQ++LR+SP+LLYSA PF++SS +C +CFR S+
Sbjct: 13 LRVAEIGGRGRSLVAAQSLRAGQVILRESPLLLYSAFPFLSSSVSPYCDHCFRLLASSAH 72
Query: 66 SICPSCS-VAFCSPKCSTAAASSSHSPYVCQALTRLLNLNSPDAANLPLDRQLQARFLIA 124
C SCS V+FCSP C +SH+P++C++L RL +S ++ P DRQ+QARFL++
Sbjct: 73 QKCQSCSLVSFCSPNCF-----ASHTPWLCESLRRLHQSSSSAFSDQPSDRQVQARFLLS 127
Query: 125 AYNLAVVNPSQFQILLAFQGTVTDN--------DTSAAHYL----SSLCPP-PASTTIEL 171
AYNLA +PS FQILL+ QG+ + N D++AA +L SS+CP P S + +L
Sbjct: 128 AYNLAAASPSDFQILLSLQGSGSSNGDPSCSAGDSAAAGFLHSLLSSVCPSLPVSISPDL 187
Query: 172 TAALLAKDRLNAFGLMEPYIEGQDGRRSVRAYGIYLKASFFNHDCLPNACRFDYVDAAAE 231
TAALL+KD++NAFGLMEP + +RSVRAYGIY K SFFNHDCLPNACRFDYVD+A++
Sbjct: 188 TAALLSKDKVNAFGLMEP-CSVSNEKRSVRAYGIYPKTSFFNHDCLPNACRFDYVDSASD 246
Query: 232 NNTDIIVRMIHDVPQGREICLSYFPVNYDYSTRQKRLLYDYDFACDCDRCKVEANWSDN 290
NTDII+RMIHDVP+GRE+CLSYFPVN +YS+RQKRLL DY F CDCDRCKVE +WS+
Sbjct: 247 GNTDIIIRMIHDVPEGREVCLSYFPVNMNYSSRQKRLLEDYGFKCDCDRCKVEFSWSEG 305
>gi|440583731|emb|CCH47233.1| similar to histone-lysine N-methyltransferase ASHR2 [Lupinus
angustifolius]
Length = 357
Score = 319 bits (818), Expect = 1e-84, Method: Compositional matrix adjust.
Identities = 178/381 (46%), Positives = 247/381 (64%), Gaps = 47/381 (12%)
Query: 5 SETVTVAEIEGRGRGLVSTQSLKAGQIVLRDSPILLYSALPFINSSS------FCHNCFR 58
S + + I G+G+GLV+ QSLKAGQ++LR+SP+++YS+ P I++SS +C +CFR
Sbjct: 6 SSLLRLDHIAGKGKGLVAAQSLKAGQVILRESPLIIYSSSPLISTSSPSSLFPYCDHCFR 65
Query: 59 KTMHSSSSICPSCSVA-FCSPKCSTAAASSSHSPYVCQALTRLLNLNSPDAANLPLDRQL 117
T+ +++ CPSCS FCS KC + A ++ HS VC AL+ L + S P +RQ+
Sbjct: 66 -TLPTNTIPCPSCSYHHFCSHKCFSTAFNTFHSSLVCHALSHLKD--SESLQQQPYERQV 122
Query: 118 QARFLIAAYNLAVVNPSQFQILLAFQGTVTDNDTSAAHYLSSLC----PPPASTTIELTA 173
QARF++AAYNLA+++PS L+ GT D AA +L SL PP + +++LTA
Sbjct: 123 QARFVVAAYNLAIISPSGIHAFLSLHGTPDDTIVEAAKFLHSLISPLFPPSVNISVDLTA 182
Query: 174 ALLAKDRLNAFGLMEPYIEGQDG-RRSVRAYGIYLKASFFNHDCLPNACRFDYVDAA--- 229
LLAKDR+N+F LM PY DG +RS++AY IY KAS FNHDC+PNACRFDYVD+A
Sbjct: 183 QLLAKDRINSFCLMNPY--SPDGPQRSIKAYAIYPKASMFNHDCIPNACRFDYVDSADLD 240
Query: 230 -AENNTDIIVRMIHDVPQGREICLSYFPVNYDYSTRQKRLLYDYDFACDCDRCKVEANWS 288
NNTD+++RMI D+P+GRE+C+SYF ++ DY TR++ L+ DY F+C+CDRCK+EANW
Sbjct: 241 DEHNNTDMVIRMIQDLPEGREVCISYFRISRDYCTRKRILMDDYGFSCECDRCKIEANWP 300
Query: 289 DNDNDDENNEEVMDEDQDEQMVASDDDAEVHGDTNFPHAYFFMRYMCDRDNCWGTLAPLP 348
+D +N E ++ PH F +Y+CDR NC GTLA
Sbjct: 301 ---HDCQNYVEEY--------------------SDLPHVRFIAKYVCDRKNCNGTLA--- 334
Query: 349 PSDATPSTVMECNVCGNLKND 369
P D + V+ECN CGNLK+D
Sbjct: 335 PKDDAHTNVLECNFCGNLKSD 355
>gi|357145110|ref|XP_003573528.1| PREDICTED: histone-lysine N-methyltransferase ASHR2-like isoform 1
[Brachypodium distachyon]
gi|357145113|ref|XP_003573529.1| PREDICTED: histone-lysine N-methyltransferase ASHR2-like isoform 2
[Brachypodium distachyon]
Length = 389
Score = 278 bits (712), Expect = 2e-72, Method: Compositional matrix adjust.
Identities = 159/388 (40%), Positives = 225/388 (57%), Gaps = 39/388 (10%)
Query: 4 VSETVTVAEIEGRGRGLVSTQSLKAGQIVLRDSPILLYSALPFINSSSFCHNCFRKTMHS 63
+ + VA++ GRGRGLV+ + + G+++L + PILLY + + S+C CFR +
Sbjct: 2 AGDALRVADLPGRGRGLVAARDILEGEVLLSEPPILLYPS-SLASLPSYCSACFRCLPQA 60
Query: 64 S-SSICPSC-SVAFCSPKCSTAAASSSHSPYVCQALTRLLNLNSPDAANLPLDRQLQARF 121
++ CPSC + AFCSP C A++SH +C AL+RL A P Q Q F
Sbjct: 61 PHAAPCPSCRAAAFCSPAC----AAASHPRLLCAALSRL--------AAAPESHQEQLLF 108
Query: 122 LIAAYNLAVVNPSQFQILLAF----QGTVTDNDTSAAHYLSSLCPP---PASTTIELTAA 174
L++AY+L F LL+ QGT + S +SSL PP PA + +LTAA
Sbjct: 109 LLSAYSL---QEPAFHALLSLSSAPQGTQQQDAASLHAMVSSLAPPHMLPAGFSPDLTAA 165
Query: 175 LLAKDRLNAFGLMEPY-IEGQDGRRSVRAYGIYLKASFFNHDCLPNACRFDYVDAAAENN 233
LL+KDR N+F +MEPY + G R RAY +Y +AS NHDCLPNAC FDY D N
Sbjct: 166 LLSKDRTNSFSIMEPYRPDVAQGLRKARAYAVYHRASLLNHDCLPNACHFDYPDRPGPGN 225
Query: 234 TDIIVRMIHDVPQGREICLSYFPVNYDYSTRQKRLLYDYDFACDCDRCKVEANWS----- 288
TDI++R +H + G E+ +SYF N+ Y+ RQ+RLL DY F C+C+RC++E+ W
Sbjct: 226 TDIVLRALHGITAGMEVRISYFAANWRYADRQRRLLEDYGFRCECERCQIESKWKFDDDE 285
Query: 289 -------DNDNDDENNEEVMDEDQDEQMVASDDDAEVHGDTNFPHAYFFMRYMCDRDNCW 341
D+ ++E+++E + DE M + + + +FPHA+FF+R+MCDR++C
Sbjct: 286 NDGGDGDDDTMEEEHDKEDVGNGGDEGMEQEEGSDGDNDNDDFPHAFFFVRFMCDREDCH 345
Query: 342 GTLAPLPP-SDATPSTVMECNVCGNLKN 368
G LAPLPP S V ECNVC LK
Sbjct: 346 GMLAPLPPLPSGELSHVFECNVCEQLKK 373
>gi|413917046|gb|AFW56978.1| hypothetical protein ZEAMMB73_771542 [Zea mays]
Length = 404
Score = 278 bits (710), Expect = 5e-72, Method: Compositional matrix adjust.
Identities = 167/400 (41%), Positives = 234/400 (58%), Gaps = 28/400 (7%)
Query: 1 MSAVSETVTVAEIEGRGRGLVSTQSLKAGQIVLRDSPILLYSALPFINSSSFCHNCFRKT 60
M+A + + VA++ GRGRGLV+ ++++ G+++L + P+LLY + + S+C CFR
Sbjct: 10 MAAAGDVLRVADLPGRGRGLVAARNVREGEVLLSEPPVLLYPST-LSSLRSYCSACFRSL 68
Query: 61 MHSSSSICPSC-SVAFCSPKCSTAAASSSHSPYVCQALTRLLNLNSPDAANLPLDR-QLQ 118
+++ C SC + AFCSP C A++SH +C AL+ AA P + Q
Sbjct: 69 PPAATIPCVSCRAAAFCSPAC----AAASHPHLLCAALSHGGGAGL--AAAAPTEAIQEP 122
Query: 119 ARFLIAAYNLA-VVNPSQFQILLAFQGTVTDNDTSAAHYLSSLCPP----PASTTIELTA 173
FL++AY+L S + A D + H + P P + +LTA
Sbjct: 123 LLFLLSAYSLPESTMRSMLSLSSAPPPPPGAQDAAGLHAAVAALAPPHMLPQGLSPDLTA 182
Query: 174 ALLAKDRLNAFGLMEPYIEGQDGRR-SVRAYGIYLKASFFNHDCLPNACRFDYVDAAAEN 232
ALLAKDR N+F +MEPY G RAY +Y +AS FNHDCLPNAC FDY D
Sbjct: 183 ALLAKDRGNSFAIMEPYRHGMSLELLKARAYAVYPRASLFNHDCLPNACHFDYPDRPGPG 242
Query: 233 NTDIIVRMIHDVPQGREICLSYFPVNYDYSTRQKRLLYDYDFACDCDRCKVEANWSDNDN 292
NTDI+VR +HD+P+GRE+C+SYF N+ Y+ RQ+RLL DY F C+CDRC+VE+ W D+D+
Sbjct: 243 NTDIVVRALHDIPEGREVCISYFAANWRYADRQRRLLEDYGFRCECDRCQVESQWKDDDD 302
Query: 293 DDENNEEVMDEDQDEQMVA----------SDDDAEVHGDTNFPHAYFFMRYMCDRDNCWG 342
++ ++ + E++ E V D A GD +FPHAYFF+RY+CD + CWG
Sbjct: 303 NNGDDGDDTMEEEGEDDVGDRGDDGIEEEEGDGATNGGDDDFPHAYFFVRYLCDNEGCWG 362
Query: 343 TLAPLPPS-DATPSTVMECNVCGNLKN--DAIGREDTVGM 379
LAPLPPS + S V ECN+CG L+ D I E T GM
Sbjct: 363 MLAPLPPSPNGDLSHVFECNLCGKLRKEEDVIPDEGTSGM 402
>gi|242078489|ref|XP_002444013.1| hypothetical protein SORBIDRAFT_07g005870 [Sorghum bicolor]
gi|241940363|gb|EES13508.1| hypothetical protein SORBIDRAFT_07g005870 [Sorghum bicolor]
Length = 404
Score = 276 bits (707), Expect = 9e-72, Method: Compositional matrix adjust.
Identities = 165/402 (41%), Positives = 234/402 (58%), Gaps = 34/402 (8%)
Query: 2 SAVSETVTVAEIEGRGRGLVSTQSLKAGQIVLRDSPILLYSALPFINSSSFCHNCFRKTM 61
+A + + VA++ GRGRGLV+ ++++ G+++L + P+LLY + + +S+C CFR
Sbjct: 11 AAAGDALRVADLPGRGRGLVAARNVREGEVLLSEPPVLLYPST-LSSLASYCSACFRSLP 69
Query: 62 HSSSSICPSC-SVAFCSPKCSTAAASSSHSPYVCQALTRLLNLNSPDAANLPLDR-QLQA 119
+++ C SC + AFCSP C A++SH +C AL+ AA P + Q
Sbjct: 70 AAATVPCASCRAAAFCSPAC----AAASHPRLLCAALSHGGGAGV--AAAAPTEAIQEPL 123
Query: 120 RFLIAAYNLAVVNPSQFQILLAFQGTVTDNDTS--------AAHYLSSLCPPPASTTIEL 171
FL++AY+L S +L+ + A L+ P + +L
Sbjct: 124 LFLLSAYSLPE---STIHSILSLSSAPPPPPGAQDAAGLHAAVAALAPPHMLPQGFSPDL 180
Query: 172 TAALLAKDRLNAFGLMEPYIEGQDGRR-SVRAYGIYLKASFFNHDCLPNACRFDYVDAAA 230
TAALLAKDR N+F +MEPY G RAY +Y +AS FNHDCLPNAC FDY D
Sbjct: 181 TAALLAKDRANSFAIMEPYRVGMSLELLKARAYAVYPRASLFNHDCLPNACHFDYPDRPG 240
Query: 231 ENNTDIIVRMIHDVPQGREICLSYFPVNYDYSTRQKRLLYDYDFACDCDRCKVEANWSDN 290
NTDI+VR +HD+P+GRE+C+SYF N+ Y+ RQ+RLL DY F C+CDRC+VE+ W D+
Sbjct: 241 PGNTDIVVRALHDIPEGREVCISYFAANWRYADRQRRLLEDYGFRCECDRCQVESQWKDD 300
Query: 291 DND----------DENNEEVMDEDQDEQMVASDDDAEVHGDTNFPHAYFFMRYMCDRDNC 340
D+D +E E+ + D+ M + D G+ +FPHAYFF+RY+CD + C
Sbjct: 301 DDDNNGGDGDDTMEEEGEDDAGDRGDDGMEEEEGDGATDGNDDFPHAYFFVRYLCDNEGC 360
Query: 341 WGTLAPLPPS-DATPSTVMECNVCGNLKN--DAIGREDTVGM 379
WG LAPLPPS + S V ECN+CG L+ DA+ E T GM
Sbjct: 361 WGMLAPLPPSPNGDLSHVFECNLCGKLRKEEDAMPDEGTSGM 402
>gi|40253403|dbj|BAD05333.1| SET-domain transcriptional regulator-like protein [Oryza sativa
Japonica Group]
gi|42761302|dbj|BAD11545.1| SET-domain transcriptional regulator-like protein [Oryza sativa
Japonica Group]
gi|125602534|gb|EAZ41859.1| hypothetical protein OsJ_26404 [Oryza sativa Japonica Group]
Length = 392
Score = 273 bits (698), Expect = 9e-71, Method: Compositional matrix adjust.
Identities = 160/384 (41%), Positives = 227/384 (59%), Gaps = 26/384 (6%)
Query: 4 VSETVTVAEIEGRGRGLVSTQSLKAGQIVLRDSPILLYSALPFINSSSFCHNCFRKTMHS 63
+ + A++ GRGRGL++ +S++ G+++L + P+LLY A + SFC CFR ++ +
Sbjct: 2 AGDALRAADLPGRGRGLLAARSIREGEVILTEQPLLLYPAS-LASLPSFCSACFR-SLSA 59
Query: 64 SSSICPSCSVA-FCSPKCSTAAASSSHSPYVCQALTRLLNLNSPDAANLPLDRQLQARFL 122
++S CPSC A FCSP C A++SH +C AL+ + +A P L FL
Sbjct: 60 AASPCPSCRAAGFCSPSC----AAASHPRLLCTALSGGGGNGNLASAAEPHQEPLL--FL 113
Query: 123 IAAYNLA-VVNPSQFQILLAFQGTVTDNDTSAAHYL-SSLCPP---PASTTIELTAALLA 177
++AY+L + A +D D + H + ++L PP P + +LTAALL+
Sbjct: 114 LSAYSLPEPSLRVLLSLSSAATPPPSDQDPGSLHAMVAALVPPQMLPPGFSPDLTAALLS 173
Query: 178 KDRLNAFGLMEPY-IEGQDGRRSVRAYGIYLKASFFNHDCLPNACRFDYVDAAAENNTDI 236
KDR N+F +MEPY E R RAY +Y +AS NHDCLPNAC FDY D NTDI
Sbjct: 174 KDRTNSFSIMEPYRPEVPQPLRKARAYAVYPRASLLNHDCLPNACHFDYADRPGPGNTDI 233
Query: 237 IVRMIHDVPQGREICLSYFPVNYDYSTRQKRLLYDYDFACDCDRCKVEANWSDNDNDDEN 296
+VR +HD+ +GRE+CLSYF N+ Y RQ+RLL DY F C+C+RC+VE+ W +D+ D
Sbjct: 234 VVRALHDITEGREVCLSYFAANWQYKDRQQRLLEDYGFRCECERCQVESKWKQDDDSDGG 293
Query: 297 NEEVM----------DEDQDEQMVASDDDAEVHGDTNFPHAYFFMRYMCDRDNCWGTLAP 346
+ + E D+ M + D D +FPH+YFF+RY+C+ C+G LAP
Sbjct: 294 DGDDTMEEEEEDGNGGEGGDDGMEQEEGDGGSDSDDDFPHSYFFVRYLCNHGECYGMLAP 353
Query: 347 LPP-SDATPSTVMECNVCGNLKND 369
LPP + PS V ECNVCGNLKN+
Sbjct: 354 LPPLPNGEPSHVFECNVCGNLKNE 377
>gi|125560527|gb|EAZ05975.1| hypothetical protein OsI_28216 [Oryza sativa Indica Group]
Length = 392
Score = 272 bits (695), Expect = 2e-70, Method: Compositional matrix adjust.
Identities = 160/384 (41%), Positives = 227/384 (59%), Gaps = 26/384 (6%)
Query: 4 VSETVTVAEIEGRGRGLVSTQSLKAGQIVLRDSPILLYSALPFINSSSFCHNCFRKTMHS 63
+ + A++ GRGRGL++ +S++ G+++L + P+LLY A + SFC CFR ++ +
Sbjct: 2 AGDALRAADLPGRGRGLLAARSIREGEVILTEQPLLLYPAS-LSSLPSFCSACFR-SLSA 59
Query: 64 SSSICPSCSVA-FCSPKCSTAAASSSHSPYVCQALTRLLNLNSPDAANLPLDRQLQARFL 122
++S CPSC A FCSP C A++SH +C AL+ + +A P L FL
Sbjct: 60 AASPCPSCRAAGFCSPSC----AAASHPRLLCTALSGGGGNGNLASAAEPHQEPLL--FL 113
Query: 123 IAAYNLA-VVNPSQFQILLAFQGTVTDNDTSAAHYL-SSLCPP---PASTTIELTAALLA 177
++AY+L + A D D + H + ++L PP P + +LTAALL+
Sbjct: 114 LSAYSLPEPSLRVLLSLSSAATPPPRDQDPGSLHAMVTALVPPQMLPPGFSPDLTAALLS 173
Query: 178 KDRLNAFGLMEPY-IEGQDGRRSVRAYGIYLKASFFNHDCLPNACRFDYVDAAAENNTDI 236
KDR N+F +MEPY E R RAY +Y +AS NHDCLPNAC FDY D NTDI
Sbjct: 174 KDRTNSFSIMEPYRPEVPQPLRKARAYAVYPRASLLNHDCLPNACHFDYADRPGPGNTDI 233
Query: 237 IVRMIHDVPQGREICLSYFPVNYDYSTRQKRLLYDYDFACDCDRCKVEANWSDNDNDDEN 296
+VR +HD+ +GRE+CLSYF N+ Y RQ+RLL DY F C+C+RC+VE+ W +D+ D
Sbjct: 234 VVRALHDITEGREVCLSYFAANWQYKDRQQRLLEDYGFRCECERCQVESKWKQDDDSDGG 293
Query: 297 NEEVM----------DEDQDEQMVASDDDAEVHGDTNFPHAYFFMRYMCDRDNCWGTLAP 346
+ + E D++M + D D +FPH+YFF+RY+C+ C+G LAP
Sbjct: 294 DGDDTMEEEEEDGNGGEGGDDEMEQEEGDGGSDSDDDFPHSYFFVRYLCNHGECYGMLAP 353
Query: 347 LPP-SDATPSTVMECNVCGNLKND 369
LPP + PS V ECNVCGNLKN+
Sbjct: 354 LPPLPNGEPSHVFECNVCGNLKNE 377
>gi|226499648|ref|NP_001143943.1| uncharacterized protein LOC100276756 [Zea mays]
gi|195629926|gb|ACG36604.1| hypothetical protein [Zea mays]
Length = 404
Score = 271 bits (693), Expect = 4e-70, Method: Compositional matrix adjust.
Identities = 166/400 (41%), Positives = 232/400 (58%), Gaps = 28/400 (7%)
Query: 1 MSAVSETVTVAEIEGRGRGLVSTQSLKAGQIVLRDSPILLYSALPFINSSSFCHNCFRKT 60
M+A + + VA++ GRGRGLV+ + ++ G+++ + P+LLY + + S+C CFR
Sbjct: 10 MAAAGDVLRVADLPGRGRGLVAARIVREGEVLFSEPPVLLYPST-LSSLRSYCSACFRSL 68
Query: 61 MHSSSSICPSC-SVAFCSPKCSTAAASSSHSPYVCQALTRLLNLNSPDAANLPLDR-QLQ 118
+++ C SC + AFCSP C A++SH +C AL+ AA P + Q
Sbjct: 69 TPAATIPCVSCRAAAFCSPAC----AAASHPHLLCAALSHGGGAGL--AAAAPTEAIQEP 122
Query: 119 ARFLIAAYNLA-VVNPSQFQILLAFQGTVTDNDTSAAHYLSSLCPP----PASTTIELTA 173
FL++AY+L S + A D + H + P P + +LTA
Sbjct: 123 LLFLLSAYSLPESTMRSMLSLSSAPPPXPGAQDAAGLHAAVAALAPPHMLPQGLSPDLTA 182
Query: 174 ALLAKDRLNAFGLMEPYIEGQDGRR-SVRAYGIYLKASFFNHDCLPNACRFDYVDAAAEN 232
ALLAKDR N+F +MEPY G RAY +Y +AS FNHDCLPNAC FDY D
Sbjct: 183 ALLAKDRGNSFAIMEPYRHGMSLELLKARAYAVYPRASLFNHDCLPNACHFDYPDRPGPG 242
Query: 233 NTDIIVRMIHDVPQGREICLSYFPVNYDYSTRQKRLLYDYDFACDCDRCKVEANWSDNDN 292
NTDI+VR +HD+P+GRE+C+SYF N+ Y+ RQ+RLL DY F C+CDRC+VE+ W D+D+
Sbjct: 243 NTDIVVRALHDIPEGREVCISYFAANWRYADRQRRLLEDYGFRCECDRCQVESQWKDDDD 302
Query: 293 DDENNEEVMDEDQDEQMVA----------SDDDAEVHGDTNFPHAYFFMRYMCDRDNCWG 342
++ ++ + E++DE D A GD +FPHAYFF+RY+CD + CWG
Sbjct: 303 NNGDDGDDTMEEEDEDDAGDRGDDGIEEEEGDGATNGGDDDFPHAYFFVRYLCDNEGCWG 362
Query: 343 TLAPLPPS-DATPSTVMECNVCGNLKN--DAIGREDTVGM 379
LAPLPPS + S V ECN+CG L+ D I E T GM
Sbjct: 363 MLAPLPPSPNGDLSHVFECNLCGKLRKEEDVIPDEGTSGM 402
>gi|168054064|ref|XP_001779453.1| predicted protein [Physcomitrella patens subsp. patens]
gi|162669138|gb|EDQ55731.1| predicted protein [Physcomitrella patens subsp. patens]
Length = 377
Score = 268 bits (684), Expect = 5e-69, Method: Compositional matrix adjust.
Identities = 151/377 (40%), Positives = 214/377 (56%), Gaps = 26/377 (6%)
Query: 13 IEGRGRGLVSTQSLKAGQIVLRDSPILLYSALPFINSSSFCHNCFRK-TMHSSSSI-CPS 70
I GRGRG+V+T+ L+AG ++LR+SP+LLY S++FC NC R ++ + C +
Sbjct: 6 IAGRGRGMVATRMLRAGAVLLRESPLLLYQEAESAESTAFCSNCMRVLSLPGREKVPCSA 65
Query: 71 CSVA-FCSPKCSTAAASSSHSPYVCQALTRLLNLNSPDAANLPLDRQLQARFLIAAYNLA 129
C A FC C +A SH+ VC + L S D L+ + ARFLIAAYNLA
Sbjct: 66 CDGATFCGKSCFESARVGSHNTRVCSGIAMLQK--SKD--KFSLEHRTLARFLIAAYNLA 121
Query: 130 VVNPSQFQILLAFQGTVTDNDTSAAHY-----LSSLCPPPAST--TIELTAALLAKDRLN 182
V +P F LL +G D + + + + AS ++ELT LLA+D N
Sbjct: 122 VEDPLAFAKLLELEGQECDEEVVKSLHDFVVEIFGKLDGGASLELSLELTGKLLARDACN 181
Query: 183 AFGLMEPYIEGQDGRRSVRAYGIYLKASFFNHDCLPNACRFDYVDAAAENNTDIIVRMIH 242
FGLM P G++ R VR Y ++ +AS FNHDCLPNACRF+YVD + NTD+IVR +H
Sbjct: 182 TFGLMAPSCAGEE--RKVRGYAMFAQASMFNHDCLPNACRFEYVDIDGDGNTDVIVRALH 239
Query: 243 DVPQGREICLSYFPVNYDYSTRQKRLLYDYDFACDCDRCKVEANWSDNDNDD-------E 295
D+ +G E+CLSYFPV++ Y RQ++L +Y F C C RC VE+ W D D +
Sbjct: 240 DMEEGTEVCLSYFPVDWPYGDRQQKLQEEYGFWCTCARCNVESKWKDEDEEQDEEDEGYS 299
Query: 296 NNEEVMDEDQDEQMVASDDDAEVHGDTNFPHAYFFMRYMCDRDNCWGTLAPLPPSDATPS 355
N E + + + D + + +F HA FF++Y+C + C GT+APLPP +
Sbjct: 300 NGELELGKGEATLEDVESIDDDDDDEDDFGHAMFFVKYLCPVEGCGGTMAPLPPGELDHQ 359
Query: 356 TV---MECNVCGNLKND 369
+ MECN CG +++D
Sbjct: 360 GLEGSMECNFCGCVRSD 376
>gi|255644430|gb|ACU22720.1| unknown [Glycine max]
Length = 381
Score = 267 bits (682), Expect = 7e-69, Method: Compositional matrix adjust.
Identities = 145/295 (49%), Positives = 200/295 (67%), Gaps = 22/295 (7%)
Query: 10 VAEIEGRGRGLVSTQSLKAGQIVLRDSPILLYSALPFINSSS----FCHNCFR---KTMH 62
+ +I G+GRGLV++Q LKAGQIVL +SP++LYSA P + SS +C +CFR T +
Sbjct: 1 MEQIPGKGRGLVASQPLKAGQIVLTESPLILYSASPLLTPSSSPYTYCDHCFRILPLTHN 60
Query: 63 SSSSICPSCS-VAFCSPKCSTAAASSSHSPYVCQALTRLL-NLNSPDAANLPLDRQLQAR 120
S++ CPSCS +FCS KC + A SSHS +VC+AL L + NS P +RQ+QAR
Sbjct: 61 STTVTCPSCSNHSFCSQKCFSLALKSSHSTWVCKALMSLQQHPNSTLLQQHPQERQVQAR 120
Query: 121 FLIAAYNLAVVN--PSQFQILLAFQGTVTDNDTSAAHYLSSLC----PPPASTTIELTAA 174
++A++ L + N PS+ L+ GT D AA++L SL PP A +++L A
Sbjct: 121 LIVASHKLFLHNHTPSELDTFLSLHGTPDDAILDAANFLHSLISPLFPPQAQLSVDLIAQ 180
Query: 175 LLAKDRLNAFGLMEPYIEGQDG-RRSVRAYGIYLKASFFNHDCLPNACRFDYVDAA---- 229
LLAKDRLN+FGLM+PY DG +RS++AY IY KA+FFNHDC+PNACRFDYVD+
Sbjct: 181 LLAKDRLNSFGLMDPY--SPDGPQRSIKAYAIYPKATFFNHDCVPNACRFDYVDSTNDDY 238
Query: 230 AENNTDIIVRMIHDVPQGREICLSYFPVNYDYSTRQKRLLYDYDFACDCDRCKVE 284
N+TDI++R+I DV +G+E+C+SYF + DY TR++ L+ DY F C CDRC ++
Sbjct: 239 ERNSTDIVIRLIEDVDEGKEVCISYFRIGRDYCTRKRILMEDYGFTCGCDRCTLK 293
>gi|326511411|dbj|BAJ87719.1| predicted protein [Hordeum vulgare subsp. vulgare]
Length = 389
Score = 260 bits (665), Expect = 7e-67, Method: Compositional matrix adjust.
Identities = 160/395 (40%), Positives = 228/395 (57%), Gaps = 35/395 (8%)
Query: 5 SETVTVAEIEGRGRGLVSTQSLKAGQIVLRDSPILLYSALPFINSSSFCHNCFRKTMHSS 64
+T+ VAE+ GRGR L++ + + G+++L +SPILLY + + SS+C CFR
Sbjct: 3 GDTLRVAELPGRGRALLAARDILEGEVLLSESPILLYPS-SLASLSSYCSACFRSLPPPP 61
Query: 65 SSICPSC-SVAFCSPKCSTAAASSSHSPYVCQALTRLLNLNSPDAANLPLDRQLQARFLI 123
+ CPSC + AFCSP C A++SH +C AL+ L +P+A PL FL+
Sbjct: 62 HTPCPSCRAAAFCSPAC----AAASHPRLLCAALSSGLA-AAPEAHQEPL------LFLL 110
Query: 124 AAYNLAVVNPSQFQILLAF-QGTVTDNDTSAAHY---LSSLCPP---PASTTIELTAALL 176
+AY+L + S L + QG AA ++S+ PP PA +++LTAALL
Sbjct: 111 SAYSLQEPSLSAILSLSSAPQGPSPSQQQEAASLHAAVASVAPPHMLPAGFSLDLTAALL 170
Query: 177 AKDRLNAFGLMEPYIEGQDGR-RSVRAYGIYLKASFFNHDCLPNACRFDYVDAAAENNTD 235
+KDR N+F ++EPY R R +Y +AS NHDCLPNAC FDY D NTD
Sbjct: 171 SKDRTNSFSILEPYRPDVPLELRKARCCAVYPRASLLNHDCLPNACHFDYADRPGPGNTD 230
Query: 236 IIVRMIHDVPQGREICLSYFPVNYDYSTRQKRLLYDYDFACDCDRCKVEANWS------- 288
++VR +H + +G E+C+SYF N+ Y+ RQ+RLL DY F C+CDRC++E+ W
Sbjct: 231 MVVRALHGITEGNEVCISYFAANWRYADRQRRLLEDYGFRCECDRCQIESKWKLDDDNDG 290
Query: 289 ----DNDNDDENNEEVMDEDQDEQMVASDDDAEVHGDTNFPHAYFFMRYMCDRDNCWGTL 344
D D+E+ +E ++ DE M ++ D +FPHA+FF+RY+CDRD+C+G L
Sbjct: 291 ADGDDTMEDEEHGKEDAEDGGDEGM--EQEEGSDDDDDDFPHAFFFVRYLCDRDDCYGML 348
Query: 345 APLPP-SDATPSTVMECNVCGNLKNDAIGREDTVG 378
APLPP + S V ECN CG LK + E G
Sbjct: 349 APLPPLPNGELSHVFECNACGQLKKEEDEDEPDAG 383
>gi|326511387|dbj|BAJ87707.1| predicted protein [Hordeum vulgare subsp. vulgare]
Length = 389
Score = 260 bits (665), Expect = 7e-67, Method: Compositional matrix adjust.
Identities = 160/395 (40%), Positives = 228/395 (57%), Gaps = 35/395 (8%)
Query: 5 SETVTVAEIEGRGRGLVSTQSLKAGQIVLRDSPILLYSALPFINSSSFCHNCFRKTMHSS 64
+T+ VAE+ GRGR L++ + + G+++L +SPILLY + + SS+C CFR
Sbjct: 3 GDTLRVAELPGRGRALLAARDILEGEVLLSESPILLYPS-SLASLSSYCSACFRSLPPPP 61
Query: 65 SSICPSC-SVAFCSPKCSTAAASSSHSPYVCQALTRLLNLNSPDAANLPLDRQLQARFLI 123
+ CPSC + AFCSP C A++SH +C AL+ L +P+A PL FL+
Sbjct: 62 HTPCPSCRAAAFCSPAC----AAASHPRLLCAALSSGLA-AAPEAHQEPL------LFLL 110
Query: 124 AAYNLAVVNPSQFQILLAF-QGTVTDNDTSAAHY---LSSLCPP---PASTTIELTAALL 176
+AY+L + S L + QG AA ++S+ PP PA +++LTAALL
Sbjct: 111 SAYSLQEPSLSAILSLSSAPQGPSPSQQQEAASLHAAVASVAPPHMLPAGFSLDLTAALL 170
Query: 177 AKDRLNAFGLMEPYIEGQDGR-RSVRAYGIYLKASFFNHDCLPNACRFDYVDAAAENNTD 235
+KDR N+F ++EPY R R +Y +AS NHDCLPNAC FDY D NTD
Sbjct: 171 SKDRTNSFSILEPYRPDVPLELRKARCCAVYPRASLLNHDCLPNACHFDYADRPGPGNTD 230
Query: 236 IIVRMIHDVPQGREICLSYFPVNYDYSTRQKRLLYDYDFACDCDRCKVEANWS------- 288
++VR +H + +G E+C+SYF N+ Y+ RQ+RLL DY F C+CDRC++E+ W
Sbjct: 231 MVVRALHGITEGNEVCISYFAANWRYADRQRRLLEDYGFRCECDRCQIESKWKLDDDNDG 290
Query: 289 ----DNDNDDENNEEVMDEDQDEQMVASDDDAEVHGDTNFPHAYFFMRYMCDRDNCWGTL 344
D D+E+ +E ++ DE M ++ D +FPHA+FF+RY+CDRD+C+G L
Sbjct: 291 ADGDDTMEDEEHGKEDAEDGGDEGM--EQEEGSDDDDDDFPHAFFFVRYLCDRDDCYGML 348
Query: 345 APLPP-SDATPSTVMECNVCGNLKNDAIGREDTVG 378
APLPP + S V ECN CG LK + E G
Sbjct: 349 APLPPLPNGELSHVFECNACGQLKKEEDEDEPDAG 383
>gi|326508652|dbj|BAJ95848.1| predicted protein [Hordeum vulgare subsp. vulgare]
Length = 383
Score = 258 bits (659), Expect = 4e-66, Method: Compositional matrix adjust.
Identities = 159/390 (40%), Positives = 225/390 (57%), Gaps = 35/390 (8%)
Query: 10 VAEIEGRGRGLVSTQSLKAGQIVLRDSPILLYSALPFINSSSFCHNCFRKTMHSSSSICP 69
VAE+ GRGR L++ + + G+++L +SPILLY + + SS+C CFR + CP
Sbjct: 2 VAELPGRGRALLAARDILEGEVLLSESPILLYPS-SLASLSSYCSACFRSLPPPPHTPCP 60
Query: 70 SC-SVAFCSPKCSTAAASSSHSPYVCQALTRLLNLNSPDAANLPLDRQLQARFLIAAYNL 128
SC + AFCSP C A++SH +C AL+ L +P+A PL FL++AY+L
Sbjct: 61 SCRAAAFCSPAC----AAASHPRLLCAALSSGLA-AAPEAHQEPL------LFLLSAYSL 109
Query: 129 AVVNPSQFQILLAF-QGTVTDNDTSAAHY---LSSLCPP---PASTTIELTAALLAKDRL 181
+ S L + QG AA ++S+ PP PA +++LTAALL+KDR
Sbjct: 110 QEPSLSAILSLSSAPQGPSPSQQQEAASLHAAVASVAPPHMLPAGFSLDLTAALLSKDRT 169
Query: 182 NAFGLMEPYIEGQDGR-RSVRAYGIYLKASFFNHDCLPNACRFDYVDAAAENNTDIIVRM 240
N+F ++EPY R R +Y +AS NHDCLPNAC FDY D NTD++VR
Sbjct: 170 NSFSILEPYRPDVPLELRKARCCAVYPRASLLNHDCLPNACHFDYADRPGPGNTDMVVRA 229
Query: 241 IHDVPQGREICLSYFPVNYDYSTRQKRLLYDYDFACDCDRCKVEANWS-----------D 289
+H + +G E+C+SYF N+ Y+ RQ+RLL DY F C+CDRC++E+ W D
Sbjct: 230 LHGITEGNEVCISYFAANWRYADRQRRLLEDYGFRCECDRCQIESKWKLDDDNDGADGDD 289
Query: 290 NDNDDENNEEVMDEDQDEQMVASDDDAEVHGDTNFPHAYFFMRYMCDRDNCWGTLAPLPP 349
D+E+ +E ++ DE M ++ D +FPHA+FF+RY+CDRD+C+G LAPLPP
Sbjct: 290 TMEDEEHGKEDAEDGGDEGM--EQEEGSDDDDDDFPHAFFFVRYLCDRDDCYGMLAPLPP 347
Query: 350 -SDATPSTVMECNVCGNLKNDAIGREDTVG 378
+ S V ECN CG LK + E G
Sbjct: 348 LPNGELSHVFECNACGQLKKEEDEDEPDAG 377
>gi|194698994|gb|ACF83581.1| unknown [Zea mays]
Length = 268
Score = 224 bits (570), Expect = 7e-56, Method: Compositional matrix adjust.
Identities = 116/230 (50%), Positives = 150/230 (65%), Gaps = 14/230 (6%)
Query: 164 PASTTIELTAALLAKDRLNAFGLMEPYIEGQDGRR-SVRAYGIYLKASFFNHDCLPNACR 222
P + +LTAALLAKDR N+F +MEPY G RAY +Y +AS FNHDCLPNAC
Sbjct: 37 PQGLSPDLTAALLAKDRGNSFAIMEPYRHGMSLELLKARAYAVYPRASLFNHDCLPNACH 96
Query: 223 FDYVDAAAENNTDIIVRMIHDVPQGREICLSYFPVNYDYSTRQKRLLYDYDFACDCDRCK 282
FDY D NTDI+VR +HD+P+GRE+C+SYF N+ Y+ RQ+RLL DY F C+CDRC+
Sbjct: 97 FDYPDRPGPGNTDIVVRALHDIPEGREVCISYFAANWRYADRQRRLLEDYGFRCECDRCQ 156
Query: 283 VEANWSDNDNDDENNEEVMDEDQDEQMVA----------SDDDAEVHGDTNFPHAYFFMR 332
VE+ W D+D+++ ++ + E++ E V D A GD +FPHAYFF+R
Sbjct: 157 VESQWKDDDDNNGDDGDDTMEEEGEDDVGDRGDDGIEEEEGDGATNGGDDDFPHAYFFVR 216
Query: 333 YMCDRDNCWGTLAPLPPS-DATPSTVMECNVCGNLKN--DAIGREDTVGM 379
Y+CD + CWG LAPLPPS + S V ECN+CG L+ D I E T GM
Sbjct: 217 YLCDNEGCWGMLAPLPPSPNGDLSHVFECNLCGKLRKEEDVIPDEGTSGM 266
>gi|115475273|ref|NP_001061233.1| Os08g0205300 [Oryza sativa Japonica Group]
gi|40253402|dbj|BAD05332.1| SET-domain transcriptional regulator-like protein [Oryza sativa
Japonica Group]
gi|42761301|dbj|BAD11544.1| SET-domain transcriptional regulator-like protein [Oryza sativa
Japonica Group]
gi|113623202|dbj|BAF23147.1| Os08g0205300 [Oryza sativa Japonica Group]
Length = 331
Score = 223 bits (567), Expect = 2e-55, Method: Compositional matrix adjust.
Identities = 140/378 (37%), Positives = 195/378 (51%), Gaps = 75/378 (19%)
Query: 4 VSETVTVAEIEGRGRGLVSTQSLKAGQIVLRDSPILLYSALPFINSSSFCHNCFRKTMHS 63
+ + A++ GRGRGL++ +S++ G+++L + P+LL + ++ S
Sbjct: 2 AGDALRAADLPGRGRGLLAARSIREGEVILTEQPLLL----------------YPASLAS 45
Query: 64 SSSICPSCSVAFCSPKCSTAAASSSHSPYVCQALTRLLNLNSPDAANLPLDRQLQARFLI 123
S C +C + +AAAS S R SP A R +
Sbjct: 46 LPSFCSACFRSL------SAAASPCPS-------CRAAGFCSPSCAAASHPR-------L 85
Query: 124 AAYNLAVVNPSQFQILLAFQGTVTDNDTSAAHYLSSLCPPPASTTIELTAALLAKDRLNA 183
A A+V P Q+L P + +LTAALL+KDR N+
Sbjct: 86 HAMVAALVPP---QML------------------------PPGFSPDLTAALLSKDRTNS 118
Query: 184 FGLMEPY-IEGQDGRRSVRAYGIYLKASFFNHDCLPNACRFDYVDAAAENNTDIIVRMIH 242
F +MEPY E R RAY +Y +AS NHDCLPNAC FDY D NTDI+VR +H
Sbjct: 119 FSIMEPYRPEVPQPLRKARAYAVYPRASLLNHDCLPNACHFDYADRPGPGNTDIVVRALH 178
Query: 243 DVPQGREICLSYFPVNYDYSTRQKRLLYDYDFACDCDRCKVEANWSDNDNDDENNEEVM- 301
D+ +GRE+CLSYF N+ Y RQ+RLL DY F C+C+RC+VE+ W +D+ D + +
Sbjct: 179 DITEGREVCLSYFAANWQYKDRQQRLLEDYGFRCECERCQVESKWKQDDDSDGGDGDDTM 238
Query: 302 ---------DEDQDEQMVASDDDAEVHGDTNFPHAYFFMRYMCDRDNCWGTLAPLPP-SD 351
E D+ M + D D +FPH+YFF+RY+C+ C+G LAPLPP +
Sbjct: 239 EEEEEDGNGGEGGDDGMEQEEGDGGSDSDDDFPHSYFFVRYLCNHGECYGMLAPLPPLPN 298
Query: 352 ATPSTVMECNVCGNLKND 369
PS V ECNVCGNLKN+
Sbjct: 299 GEPSHVFECNVCGNLKNE 316
>gi|302774507|ref|XP_002970670.1| hypothetical protein SELMODRAFT_451410 [Selaginella moellendorffii]
gi|300161381|gb|EFJ27996.1| hypothetical protein SELMODRAFT_451410 [Selaginella moellendorffii]
Length = 292
Score = 219 bits (558), Expect = 2e-54, Method: Compositional matrix adjust.
Identities = 120/262 (45%), Positives = 168/262 (64%), Gaps = 16/262 (6%)
Query: 118 QARFLIAAYNLAVVNPSQFQ--ILLAFQGTVTDNDTSAAH-------YLSSLCPPPA-ST 167
QARFL+AA +LA N FQ +LL Q +V ND A L + C A
Sbjct: 20 QARFLLAAVSLAARNAEDFQRLLLLDGQASVFANDPRPAGVAELVLALLRTHCFELAKGL 79
Query: 168 TIELTAALLAKDRLNAFGLMEPYIEGQDGRRSVRAYGIYLKASFFNHDCLPNACRFDYVD 227
++E+ LLAKD++NAFGLM P +R VRAY Y +AS NHDCLPNACRF+Y+D
Sbjct: 80 SLEMFLNLLAKDKINAFGLMAP--SSGTAQRKVRAYARYAQASMLNHDCLPNACRFEYLD 137
Query: 228 AAAENNTDIIVRMIHDVPQGREICLSYFPVNYDYSTRQKRLLYDYDFACDCDRCKVEANW 287
+NTDI +R++HDVPQG EIC+SYFPVN++Y R++RL+ DY F C+C+RC+VE W
Sbjct: 138 KPGASNTDIYIRLLHDVPQGSEICISYFPVNWNYKERRERLVEDYGFECNCERCRVEQTW 197
Query: 288 SDNDNDDENNEEVMDEDQDEQMVASDDDAEVHGDTNFPHAYFFMRYMCDRDNCWGTLAPL 347
SD D + + E + + D +++ + GD +FPHA FF++Y+C ++C GT+APL
Sbjct: 198 SDGDGEGDEEEAMEHDQTDGDGEDEEEEMDKDGDGDFPHAMFFVKYLCPEESCGGTMAPL 257
Query: 348 PPSDATPSTVMECNVCGNLKND 369
P + + +MECNVCG L+++
Sbjct: 258 PGN----ANLMECNVCGRLRSE 275
>gi|296085566|emb|CBI29298.3| unnamed protein product [Vitis vinifera]
Length = 327
Score = 201 bits (511), Expect = 5e-49, Method: Compositional matrix adjust.
Identities = 93/128 (72%), Positives = 103/128 (80%), Gaps = 3/128 (2%)
Query: 163 PP---ASTTIELTAALLAKDRLNAFGLMEPYIEGQDGRRSVRAYGIYLKASFFNHDCLPN 219
PP A ++ELT ALLAKD+LNAFGLMEP G RSVRAYGIY KASFFNHDCLPN
Sbjct: 104 PPQGVAGFSVELTTALLAKDKLNAFGLMEPPALAPGGERSVRAYGIYPKASFFNHDCLPN 163
Query: 220 ACRFDYVDAAAENNTDIIVRMIHDVPQGREICLSYFPVNYDYSTRQKRLLYDYDFACDCD 279
ACRFDYVD A+ +NTDI +R+IHDVP+G EICLSYFPVN Y+ RQKRLL DY F C CD
Sbjct: 164 ACRFDYVDTASHHNTDITIRLIHDVPEGSEICLSYFPVNETYADRQKRLLEDYGFTCYCD 223
Query: 280 RCKVEANW 287
RC+VEANW
Sbjct: 224 RCRVEANW 231
Score = 73.9 bits (180), Expect = 1e-10, Method: Compositional matrix adjust.
Identities = 39/69 (56%), Positives = 48/69 (69%), Gaps = 2/69 (2%)
Query: 5 SETVTVAEIEGRGRGLVSTQSLKAGQIVLRDSPILLYSALPFINSS-SFCHNCFRKTMHS 63
S V EIEGRGR LV++QSL+ GQI+L DSPILLYSA P +SS ++C NCFR
Sbjct: 11 SGLVKTVEIEGRGRALVASQSLRGGQIILTDSPILLYSAHPLSSSSNAYCSNCFRHLQTC 70
Query: 64 SSSI-CPSC 71
S+ + C SC
Sbjct: 71 STLVSCSSC 79
>gi|302771838|ref|XP_002969337.1| hypothetical protein SELMODRAFT_410322 [Selaginella moellendorffii]
gi|300162813|gb|EFJ29425.1| hypothetical protein SELMODRAFT_410322 [Selaginella moellendorffii]
Length = 172
Score = 171 bits (433), Expect = 5e-40, Method: Compositional matrix adjust.
Identities = 78/159 (49%), Positives = 114/159 (71%), Gaps = 4/159 (2%)
Query: 211 FFNHDCLPNACRFDYVDAAAENNTDIIVRMIHDVPQGREICLSYFPVNYDYSTRQKRLLY 270
NHDCLPNACRF+Y+D +NTDI +R++HDVPQG EIC+SYFPVN++Y R++RL+
Sbjct: 1 MLNHDCLPNACRFEYLDKPGASNTDIYIRLLHDVPQGSEICVSYFPVNWNYKERRERLVE 60
Query: 271 DYDFACDCDRCKVEANWSDNDNDDENNEEVMDEDQDEQMVASDDDAEVHGDTNFPHAYFF 330
DY F C+C+RC+VE WSD D + + E + + D V +++ + GD +FPHA FF
Sbjct: 61 DYGFECNCERCRVEQTWSDGDGEGDEEEAMEHDQTDGDGVDEEEEMDKDGDGDFPHAMFF 120
Query: 331 MRYMCDRDNCWGTLAPLPPSDATPSTVMECNVCGNLKND 369
++Y+C ++C GT+APLP + + +MECNVCG L+++
Sbjct: 121 VKYLCPEESCGGTMAPLPGN----ANLMECNVCGRLRSE 155
>gi|384249285|gb|EIE22767.1| hypothetical protein COCSUDRAFT_42390 [Coccomyxa subellipsoidea
C-169]
Length = 329
Score = 142 bits (357), Expect = 4e-31, Method: Compositional matrix adjust.
Identities = 75/201 (37%), Positives = 114/201 (56%), Gaps = 14/201 (6%)
Query: 170 ELTAALLAKDRLNAFGLMEPYIEGQDGRRSVRAYGIYLKASFFNHDCLPNACR---FDYV 226
E A LAK+ LN +G+M P G +G R +R +Y KAS NH+CLPN R FD
Sbjct: 113 EEVADALAKEELNGYGIMAP--SGPEGERRIRGTALYPKASLLNHECLPNVARVDNFDEA 170
Query: 227 DAAAENNTDIIVRMIHDVPQGREICLSYFPVNYDYSTRQKRLLYDYDFACDCDRCKVEAN 286
D A NT + + +H++P G E SYFP++ Y RQ+R Y FAC+C RCK E+
Sbjct: 171 DVNAPENTAVHFKALHNLPAGEEFTQSYFPMHTSYHVRQQRCQDQYGFACNCPRCKEEST 230
Query: 287 W-SDNDNDDENNEEVMDEDQDEQMVASDDDAEVHGDTNFPH--AYFFMRYMCDRDNCWGT 343
W SD++ D+ +EE M+ +++ + ++AE GD F ++Y+C + +C+GT
Sbjct: 231 WPSDDEADNLGDEEQMEVEREAGSMG--EEAEQDGDRADAGYIQVFLLKYVCPQTDCFGT 288
Query: 344 LAPLPPSDATPSTVMECNVCG 364
+ P+ S++ CN+CG
Sbjct: 289 MVPIQAG----SSLHVCNMCG 305
>gi|307105392|gb|EFN53641.1| hypothetical protein CHLNCDRAFT_136351 [Chlorella variabilis]
Length = 430
Score = 137 bits (344), Expect = 1e-29, Method: Compositional matrix adjust.
Identities = 77/223 (34%), Positives = 112/223 (50%), Gaps = 31/223 (13%)
Query: 175 LLAKDRLNAFGLMEPYIEGQDGRRSVRAYGIYLKASFFNHDCLPNACRFDYVDAAAEN-- 232
LL K+ N +G+M P DG R +R +Y +AS NH+CLPN RFD DA++
Sbjct: 202 LLRKEATNGYGVMAP--SAPDGERRIRGTALYAQASLINHECLPNVARFDRFDASSAPAS 259
Query: 233 -------NTDIIVRMIHDVPQGREICLSYFPVNYDYSTRQKRLLYDYDFACDCDRCKVEA 285
NT + R +HD+P G E+ SYFP+ + + RQ+R DY F C C RCK EA
Sbjct: 260 SAPHPGANTAVEFRALHDIPAGEELTQSYFPLPWPLADRQQRCQEDYGFQCTCPRCKEEA 319
Query: 286 NWSDNDNDDENNEEVM----------------DEDQDEQMVASDDDAEVHGDTNFPH--A 327
WSD++ + + + + Q + AE G + P +
Sbjct: 320 TWSDSEWEADGDAGMEDALAAEGAADGAACGHHSHQHDAPGGGPAAAEQEGGAD-PACIS 378
Query: 328 YFFMRYMCDRDNCWGTLAPLPPSDATPSTVMECNVCGNLKNDA 370
F ++Y+C + C+GTLAPLPP A ECN+CG +++A
Sbjct: 379 LFLLKYVCPAEGCYGTLAPLPPG-AGAGDQYECNMCGRRRSEA 420
>gi|302835433|ref|XP_002949278.1| hypothetical protein VOLCADRAFT_104220 [Volvox carteri f.
nagariensis]
gi|300265580|gb|EFJ49771.1| hypothetical protein VOLCADRAFT_104220 [Volvox carteri f.
nagariensis]
Length = 467
Score = 112 bits (280), Expect = 3e-22, Method: Compositional matrix adjust.
Identities = 95/357 (26%), Positives = 140/357 (39%), Gaps = 97/357 (27%)
Query: 16 RGRGLVSTQSLKAGQIVLRDSPILLYSALPFINSSSFCHNCFRKTMHSSSSICPSCSVA- 74
+GR L +T+ +K G+ VL ++P+LL A + S C C R C C A
Sbjct: 23 KGRSLYTTRLVKGGETVLDEAPLLLLVAPE--TAQSTCIACLRAVRPQECIECSVCRQAC 80
Query: 75 FCSPKCSTAAASSS--HSPYVCQALTRLLNLNSPDAANLPLDRQLQARFLIAAYNLAVVN 132
FCS C A + HS +C A RL LP ++Q Q RFL+ A L
Sbjct: 81 FCSATCQQTAQGTPWLHSQPLCSAYARLAMFG------LPPEQQSQLRFLLHALALRHAG 134
Query: 133 PSQ---------------------------FQILLAFQGTVTDNDTSAAHYLSSLCPP-- 163
++ F L G T+ + + A L+ L
Sbjct: 135 TAEAGELGAGPQPLGKTPIGRHNQEKGQRRFAALSCLVGEATEAERAVAARLAPLLVEAL 194
Query: 164 ----------PASTTIELTAALLAKDRLNAFGLME-PYI--------------------- 191
P +E A LL K++LN++G++ PY
Sbjct: 195 RDLQGVGGSLPIPWDVEELARLLRKEQLNSYGVLAAPYQYQHHHHHHHHGGTAAPGPGCS 254
Query: 192 -EGQDGR------------------------RSVRAYGIYLKASFFNHDCLPNACRFDYV 226
EG +G+ R +R +Y +AS NH+CLPN RFD
Sbjct: 255 GEGGNGQTAPKSSCGAGCSGGGSGGEAGDGERRLRGSALYAQASLINHECLPNVARFDDF 314
Query: 227 DAAAENNTDIIVRMIHDVPQGREICLSYFPVNYDYSTRQKRLLYDYDFACDCDRCKV 283
D+ + T + R +HD+P G E+ SY P+++D + RQ + Y FAC C RC+
Sbjct: 315 DSDLPDRTHVTFRALHDLPPGTELAQSYVPLHWDLAERQAQCREVYGFACTCPRCQT 371
>gi|255083056|ref|XP_002504514.1| set domain protein [Micromonas sp. RCC299]
gi|226519782|gb|ACO65772.1| set domain protein [Micromonas sp. RCC299]
Length = 462
Score = 109 bits (272), Expect = 3e-21, Method: Compositional matrix adjust.
Identities = 120/461 (26%), Positives = 177/461 (38%), Gaps = 113/461 (24%)
Query: 7 TVTVAEIEGRGRGLVSTQSLKAGQIVLRDSPILLYSALPFINSSSFCHNCFRKTMHSSSS 66
V EI GRG GLV+ + + AG+ V+ + +L+ + F C +C H +++
Sbjct: 5 AVAQREIPGRGVGLVAARDIDAGETVVAEHALLVGVSEEF--RRVCCASCL---AHGAAT 59
Query: 67 ICPSCSVAFCSPKCSTAAA----SSSHSPYVCQALTRLLN-----LNSPDAANLPL---- 113
C C C A+ + H VC A R+ + L++PD L
Sbjct: 60 PCSGCGEVVLCDTCRDPASPWHHGAGHGAVVCAA-ERVASAWRPPLSAPDRERLRFLGAC 118
Query: 114 -----------DRQLQARFLIAAYNLAVVNPSQFQI---------------LLAFQGTV- 146
D + +AR V P+ I A Q T+
Sbjct: 119 AELRRAHVVNGDARARARLDAVLRLCPDVGPAGEGIDPREASAAERVRPALEAAVQATIA 178
Query: 147 -TDNDTSAAHYLSSLCPPPASTTIELTAALLAKDRLNAFGLMEPYI--------EGQDGR 197
T+ +T+ L ++ A IE AALLAK+ NAFG+M +G D
Sbjct: 179 TTEANTAGGGLLRAI----AGDAIE-NAALLAKETKNAFGVMAATRGGDGSHPNDGSDDD 233
Query: 198 RSVRAYGIYLKASFFNHDCLPNACRFDYVDAAAE-----------------NNTDIIVRM 240
R VR +Y AS NH C PN RFD D T++ +
Sbjct: 234 RRVRGGAVYELASRVNHACFPNVARFDNFDGVLSSPSHEFYVPHRDGPNPPGPTELRLVA 293
Query: 241 IHDVPQGREICLSYFPVNYDYSTRQKRLLYDYDFACDCDRCKVEANWSDNDNDDENNEEV 300
I +P G E+ +SY PV+ + R++RL + F C+C+RC +E W+ D +E
Sbjct: 294 IDKIPAGCEVLMSYLPVSESCARRRRRLRNTFGFGCECERCVIECAWATEDGAATGDEGA 353
Query: 301 MDEDQDEQMVASDDDAEVHGDT------------------------------NFP--HAY 328
+ A D+ AE+ D P +A
Sbjct: 354 YRDASRRAREARDEAAELEADAIADALDRECDEKEEVANQSLDAAELTRRADRIPPTYAL 413
Query: 329 FFMRYMCDRDNCWGTLAPLPPSDATPSTVMECNVCGNLKND 369
+F+R MC D C GTLAP P + A M CN CG ++D
Sbjct: 414 WFVRNMCPVDGCGGTLAP-PHARADH---MTCNYCGERRSD 450
>gi|357438875|ref|XP_003589714.1| Histone-lysine N-methyltransferase ASHR2 [Medicago truncatula]
gi|355478762|gb|AES59965.1| Histone-lysine N-methyltransferase ASHR2 [Medicago truncatula]
Length = 130
Score = 108 bits (269), Expect = 6e-21, Method: Compositional matrix adjust.
Identities = 64/129 (49%), Positives = 81/129 (62%), Gaps = 15/129 (11%)
Query: 211 FFNHDCLPNACRFDYVD-------AAAENNTDIIVRMIHDVPQGREICLSYFPVNYDYST 263
FF HDC PNACRFD+V+ AA+ NNTDII+RM HDV QG EICLSYF VN YS+
Sbjct: 4 FFKHDCFPNACRFDHVNVNPPHHAAASPNNTDIIIRMTHDVRQGMEICLSYFVVNKTYSS 63
Query: 264 RQKR-LLYDYDFACDCD-RCKVEANWSDNDNDDENNEEVMDEDQDEQMVASDDDAEVHGD 321
RQK+ LL DY F C+C+ C +A +N +DE VMD++++EQ +H
Sbjct: 64 RQKKILLEDYCFKCNCESNCSDDARVEENVEEDEQEVMVMDDEEEEQC------ENIHQT 117
Query: 322 TNFPHAYFF 330
PH +FF
Sbjct: 118 LTIPHIFFF 126
>gi|303281140|ref|XP_003059862.1| predicted protein [Micromonas pusilla CCMP1545]
gi|226458517|gb|EEH55814.1| predicted protein [Micromonas pusilla CCMP1545]
Length = 522
Score = 87.4 bits (215), Expect = 1e-14, Method: Compositional matrix adjust.
Identities = 116/515 (22%), Positives = 175/515 (33%), Gaps = 173/515 (33%)
Query: 8 VTVAEIEGRGRGLVSTQSLKAGQIVLRDSPILLYSALPFINSSSFCHNCFRKTMHS-SSS 66
+ +A+I GRG+G+V+T+ ++ G+ VL + + + + + ++ C C R+ +
Sbjct: 17 IALADIPGRGKGVVATREIRPGECVLEERATM--TGVSESHRATTCAGCLRRAARCVRTR 74
Query: 67 ICPSCSVAFCSPKCSTAA-----ASSSHSPYVCQALTRLLNLNSPDAANLPL---DRQLQ 118
C C C AA A + A TR+ A+ L DR+ +
Sbjct: 75 ACERCGEVLLCDDCHRAAELAFGADDARLAMAPNAHTRVKCRGEAVGASAALSTGDRE-R 133
Query: 119 ARFLIAAYNLAVVNPSQFQILLAFQGTVTDNDTSAAHYLSSLCP----------PPAS-- 166
RFL+A L + + + T D ++SLCP P AS
Sbjct: 134 FRFLVACAELRAMKTMRAE-------TQAREDDRGFDDVASLCPREDDDGGGRWPIASHV 186
Query: 167 -----------------------------------------TTIELTAALLAKDRLNAFG 185
TI A LLAK+ NAFG
Sbjct: 187 DAATRLHPLLFATMHDADIERGVLKVNGASLSSDDFERMYVGTIRENAMLLAKESKNAFG 246
Query: 186 LMEPYIEGQ------DGRRSVRAYGIYLKASFFNHDCLPNACRFDYVDAAAENNTD---- 235
+M + RSVR +Y AS NH C PN RFD D ++ +
Sbjct: 247 VMAAAAADERDENDFGAARSVRGGALYRDASRVNHGCFPNVARFDNFDGDFQDPSKAFSS 306
Query: 236 ------------IIVRMIHDVPQGREICL------------------------------- 252
+ + I +P G E+ +
Sbjct: 307 PSSLSALCEPSTLRLLAIDVIPAGAEVLMRTFSPGVSLRPGSLAFNPRPRRLSTPKSDAF 366
Query: 253 -------------SYFPVNYDYSTRQKRLLYDYDFACDCDRCKVEANWSDNDNDDENNEE 299
SY PVN R+ RL + FACDC RC +E W+ D + +EE
Sbjct: 367 QLHPDVASYGTTLSYLPVNEPVGARRARLRRGFGFACDCARCALEVRWALEDGERTGDEE 426
Query: 300 VMDEDQD-------------------------EQMVASDDDAEVHGDTNFPHAYFFMRYM 334
+ D E+ D A V +A +FM+ M
Sbjct: 427 PTETYDDATTAADASEAAEEARDEATAAALTEEEQTRRADRAPVE------YALWFMKNM 480
Query: 335 CDRDNCWGTLAPLPPSDATPSTVMECNVCGNLKND 369
C ++C GTLAP P+ +T CN CG ++ D
Sbjct: 481 CAEEDCGGTLAPPNPT----ATHCACNYCGRVRTD 511
>gi|156368233|ref|XP_001627600.1| predicted protein [Nematostella vectensis]
gi|156214514|gb|EDO35500.1| predicted protein [Nematostella vectensis]
Length = 377
Score = 75.9 bits (185), Expect = 3e-11, Method: Compositional matrix adjust.
Identities = 75/291 (25%), Positives = 109/291 (37%), Gaps = 48/291 (16%)
Query: 1 MSAVSETVTVAEIEGRGRGLVSTQSLKAGQIVLRDSPILLYSALPFINSSSFCHNCFRKT 60
M+ V E E +GRGL + + LK+G +L + P++ L + C C K
Sbjct: 1 MAPKPSVFEVFETESKGRGLRAAKPLKSGDTILSEQPVVY--MLSNMLRGQRCDFCLEKL 58
Query: 61 MHSSSSICPSCSVA-FCSPKCSTAAASSSHSPYVCQALTRLLNLNSPDAANLPLDRQLQA 119
S C C A +C C AA S C+ L R+ P L + R Q
Sbjct: 59 --SDLQRCSRCKFARYCGASCQRAAWRIHKSE--CERLKRVFP-RVPTDLVLLMFRVWQL 113
Query: 120 RFLIAAYNLAVVNPSQFQILLAFQGTVTDNDTSA---------AHYLSSLCPPPASTTIE 170
+ Y+ V N + D+D YL S PP
Sbjct: 114 KSQNGWYDSLVSNVEKI-----------DSDAKEDFVSVLMVLNEYLGSEISPPEGLE-- 160
Query: 171 LTAALLAKDRLNAFGLMEPYIEGQDGRRSVRAYGIYLKASFFNHDCLPNACRFDYVDAAA 230
L +K N+F + DG GI+ A NH C PN+ A
Sbjct: 161 ----LFSKISCNSFAIC-------DGEMQAIGTGIFPNAVCLNHSCAPNS-------VAV 202
Query: 231 ENNTDIIVRMIHDVPQGREICLSYFPVNYDYSTRQKRLLYDYDFACDCDRC 281
N T+I ++ + ++P G E+ +SY + TRQ+ L + F C C RC
Sbjct: 203 FNGTNIYIKALEEIPVGEELTISYIQQLHPRETRQEELQTQFCFYCQCHRC 253
>gi|194890613|ref|XP_001977353.1| GG18992 [Drosophila erecta]
gi|190649002|gb|EDV46280.1| GG18992 [Drosophila erecta]
Length = 503
Score = 75.9 bits (185), Expect = 3e-11, Method: Compositional matrix adjust.
Identities = 72/268 (26%), Positives = 112/268 (41%), Gaps = 24/268 (8%)
Query: 17 GRGLVSTQSLKAGQIVLRDSPILLYSALPFINSSSFCHNCFRKTMHSSSSIC-PSCSVAF 75
GRG+V+T+SLK G+I+ RDSP+L+ A +S + C C K + + +C C +
Sbjct: 34 GRGVVATRSLKRGEIIFRDSPLLIGLAAHEEDSLNACSVCL-KMLPDTRFMCRQGCGL-- 90
Query: 76 CSPKCSTAAASSSHSPYVCQALTRLLNLNSPDAANLPLDRQLQARFLIAAYNLAVVNPSQ 135
P CS A H L + N PD AN + R L + A NL+ Q
Sbjct: 91 --PVCSLCAKKKQHKSDC--DLLKSWGPNEPDVANSVIIRLL---CVARALNLS---KEQ 140
Query: 136 FQILLAFQGTVTDNDTSAAHYLSSLCP--PPASTTIELTAALLAKDRLNAFGLMEPYIEG 193
++ Q + +N + + P IE+ +A R N F + +
Sbjct: 141 RDLIYCLQANLDNNHRTEVRNAAKCFKNFPTDKKIIEIMNRTVAVLRTNGF---DKTTDR 197
Query: 194 QDGRRSVRAYGIYLKASFFNHDCLPNACRFDYVDAAAENNTDIIVRMIHDVPQGREICLS 253
+ + +Y NHDC+PNA + + D ++IVR D+P+G E+ +
Sbjct: 198 TNDNQEFNYRALYPLFGVVNHDCIPNA-YYTFED----KTNNMIVRAAVDIPEGFEVTTT 252
Query: 254 YFPVNYDYSTRQKRLLYDYDFACDCDRC 281
Y + R L F C C RC
Sbjct: 253 YTKLFTGNIARHLFLKMKKSFTCKCSRC 280
>gi|303282641|ref|XP_003060612.1| predicted protein [Micromonas pusilla CCMP1545]
gi|226458083|gb|EEH55381.1| predicted protein [Micromonas pusilla CCMP1545]
Length = 302
Score = 75.5 bits (184), Expect = 4e-11, Method: Compositional matrix adjust.
Identities = 71/317 (22%), Positives = 125/317 (39%), Gaps = 55/317 (17%)
Query: 3 AVSETVTVAEIEGRGRGLVSTQSLKAGQIVLRD--SPILLYSALPFINSSSFCHNCFRKT 60
++ E + +A+ + RGR +V+T + G+++L + + + Y L S +C C +
Sbjct: 2 SLHEGLKIAQSDDRGRHVVATTDIARGELLLSEECTVAMTYRDL----ESPYCSQCVKPM 57
Query: 61 MHSSSSICPSCSVA-FCSPKCSTAAASSSHSPY-VCQALTRLLN------LNSPDAANLP 112
+ + C C + +CS C+ AA S H+ C ++ + D +
Sbjct: 58 VDETPVECEGCFLERYCSKSCAEAAEGSHHARTGECATFATAVDAEAATLIEGTDHSEFE 117
Query: 113 LDRQLQARFLIAAYNLA---------VVNPSQFQILLAFQGTVTDNDTSAAHYLSSLCPP 163
D Q RF+I A + + + LL Q + D+ +LS +
Sbjct: 118 FDDAPQ-RFIIRVLCQAGGWRERGGGITGEATLRRLLKLQSHEPEVDSDEREWLSGI--- 173
Query: 164 PASTTIELT-------AALLAKDRLNAFGLME-------------PYIEGQDGRRSVRAY 203
A T+ L + + +G+ E ++
Sbjct: 174 -ARNTLRLMEQDVDEGVKVFGEKDAPGYGVEELVRLMCCVRCNSHTLYASEEWPSEPVGT 232
Query: 204 GIYLKASFFNHDCLPNACRFDYVDAAAENNTDIIVRMIHDVPQGREICLSYFPVNYDYST 263
+YLK S FNH CLP+A ++ T + VR + D+ G E+ +SY PV
Sbjct: 233 AVYLKGSAFNHSCLPSAEFYN-------EGTSLRVRSVRDISAGEEVTISYVPVTETLWD 285
Query: 264 RQKRLLYDYDFACDCDR 280
R++ L Y F C+CDR
Sbjct: 286 RRQALWRQYKFDCECDR 302
>gi|24640767|ref|NP_572539.2| CG12119 [Drosophila melanogaster]
gi|7291024|gb|AAF46462.1| CG12119 [Drosophila melanogaster]
Length = 500
Score = 75.5 bits (184), Expect = 4e-11, Method: Compositional matrix adjust.
Identities = 72/268 (26%), Positives = 110/268 (41%), Gaps = 24/268 (8%)
Query: 17 GRGLVSTQSLKAGQIVLRDSPILLYSALPFINSSSFCHNCFRKTMHSSSSIC-PSCSVAF 75
GRG+V+T+SLK G+I+ RDSP+L+ A +S + C C K + + +C C +
Sbjct: 34 GRGVVATRSLKRGEIIFRDSPLLIGLAAHEEDSLNACSVCL-KMLPDTRFMCRQGCGL-- 90
Query: 76 CSPKCSTAAASSSHSPYVCQALTRLLNLNSPDAANLPLDRQLQARFLIAAYNLAVVNPSQ 135
P CS A H L + N PD AN + R L + A NL+ Q
Sbjct: 91 --PVCSLCAKKKQHKSDC--DLFKSWGPNEPDVANSVIIRLL---CVARAINLS---KEQ 140
Query: 136 FQILLAFQGTVTDNDTSAAHYLSSLCP--PPASTTIELTAALLAKDRLNAFGLMEPYIEG 193
++ Q + +N + + P IE+ +A R N F + +
Sbjct: 141 RDLIYCLQANLDNNHRTEVRNAAKCFKNFPTDKKLIEIMNRTVAVLRTNGF---DKTTDR 197
Query: 194 QDGRRSVRAYGIYLKASFFNHDCLPNACRFDYVDAAAENNTDIIVRMIHDVPQGREICLS 253
+ + +Y NHDC+PNA E ++IVR D+P+G E+ +
Sbjct: 198 TNDNQEFNYRALYPLFGVVNHDCIPNA-----YYTFEEKTNNMIVRAAVDIPEGFEVTTT 252
Query: 254 YFPVNYDYSTRQKRLLYDYDFACDCDRC 281
Y + R L F C C RC
Sbjct: 253 YTKLFTGNIARHLFLKMKKSFTCKCSRC 280
>gi|195565893|ref|XP_002106530.1| GD16076 [Drosophila simulans]
gi|194203908|gb|EDX17484.1| GD16076 [Drosophila simulans]
Length = 497
Score = 75.5 bits (184), Expect = 4e-11, Method: Compositional matrix adjust.
Identities = 74/268 (27%), Positives = 111/268 (41%), Gaps = 24/268 (8%)
Query: 17 GRGLVSTQSLKAGQIVLRDSPILLYSALPFINSSSFCHNCFRKTMHSSSSIC-PSCSVAF 75
GRG+V+T+SLK G+I+ RDSP+L+ A +S + C C K + + +C C +
Sbjct: 34 GRGVVATRSLKRGEIIFRDSPLLIGLAAHEEDSLNACSVCL-KMLPDTRFMCRQGCGL-- 90
Query: 76 CSPKCSTAAASSSHSPYVCQALTRLLNLNSPDAANLPLDRQLQARFLIAAYNLAVVNPSQ 135
P CS A H L + N PD AN + R L + A NL+ Q
Sbjct: 91 --PVCSLCAKKKQHKSDC--DLFKSWGPNEPDVANSVIIRLL---CVARAINLS---KEQ 140
Query: 136 FQILLAFQGTVTDNDTSAAHYLSSLCP--PPASTTIELTAALLAKDRLNAFGLMEPYIEG 193
++ Q + +N + + P IE+ +A R N F + +
Sbjct: 141 RDLIYCLQANLDNNHRTEVRNAAKCFKNFPTDKKLIEIMNRTVAVLRTNGF---DKTTDR 197
Query: 194 QDGRRSVRAYGIYLKASFFNHDCLPNACRFDYVDAAAENNTDIIVRMIHDVPQGREICLS 253
+ + +Y NHDC+PNA Y NN +IVR D+P+G E+ +
Sbjct: 198 TNDNQEFNYRALYPLFGVVNHDCIPNAY---YTFEEKTNN--MIVRAAVDIPEGFEVTTT 252
Query: 254 YFPVNYDYSTRQKRLLYDYDFACDCDRC 281
Y + R L F C C RC
Sbjct: 253 YTKLFTGNIARHLFLKMKKSFTCKCSRC 280
>gi|17946274|gb|AAL49177.1| RE62495p [Drosophila melanogaster]
Length = 500
Score = 75.1 bits (183), Expect = 6e-11, Method: Compositional matrix adjust.
Identities = 74/268 (27%), Positives = 111/268 (41%), Gaps = 24/268 (8%)
Query: 17 GRGLVSTQSLKAGQIVLRDSPILLYSALPFINSSSFCHNCFRKTMHSSSSIC-PSCSVAF 75
GRG+V+T+SLK G+I+ RDSP+L+ A +S + C C K + + +C C +
Sbjct: 34 GRGVVATRSLKRGEIIFRDSPLLIGLAAHEEDSLNACSVCL-KMLPDTRFMCRQGCGL-- 90
Query: 76 CSPKCSTAAASSSHSPYVCQALTRLLNLNSPDAANLPLDRQLQARFLIAAYNLAVVNPSQ 135
P CS A H L + N PD AN + R L + A NL+ Q
Sbjct: 91 --PVCSLCAKKKQHKSDC--DLFKSWGPNEPDVANSVVIRLL---CVARAINLS---KEQ 140
Query: 136 FQILLAFQGTVTDNDTSAAHYLSSLCP--PPASTTIELTAALLAKDRLNAFGLMEPYIEG 193
++ Q + +N + + P IE+ +A R N F + +
Sbjct: 141 RDLIYCLQANLDNNHRTEVRNAAKCFKNFPTDKKLIEIMNRTVAVLRTNGF---DKTTDR 197
Query: 194 QDGRRSVRAYGIYLKASFFNHDCLPNACRFDYVDAAAENNTDIIVRMIHDVPQGREICLS 253
+ + +Y NHDC+PNA Y NN +IVR D+P+G E+ +
Sbjct: 198 TNDNQEFNYRALYPLFGVVNHDCIPNAY---YTFEEKTNN--MIVRAAVDIPEGFEVTTT 252
Query: 254 YFPVNYDYSTRQKRLLYDYDFACDCDRC 281
Y + R L F C C RC
Sbjct: 253 YTKLFTGNIARHLFLKMKKSFTCKCSRC 280
>gi|195481611|ref|XP_002101711.1| GE15465 [Drosophila yakuba]
gi|194189235|gb|EDX02819.1| GE15465 [Drosophila yakuba]
Length = 497
Score = 75.1 bits (183), Expect = 6e-11, Method: Compositional matrix adjust.
Identities = 74/268 (27%), Positives = 110/268 (41%), Gaps = 24/268 (8%)
Query: 17 GRGLVSTQSLKAGQIVLRDSPILLYSALPFINSSSFCHNCFRKTMHSSSSIC-PSCSVAF 75
GRG+V+T+SLK G+I+ RDSP+L+ A +S + C C K + + +C C +
Sbjct: 34 GRGVVATRSLKRGEIIFRDSPLLMGLAAHEEDSLNACSVCL-KMLPDTRFMCRQGCGL-- 90
Query: 76 CSPKCSTAAASSSHSPYVCQALTRLLNLNSPDAANLPLDRQLQARFLIAAYNLAVVNPSQ 135
P CS A H L + N PD AN + R L + A NL Q
Sbjct: 91 --PVCSLCAKKKQHKSDC--DLFKSWGPNEPDVANSVIIRLL---CVARAINLT---KEQ 140
Query: 136 FQILLAFQGTVTDNDTSAAHYLSSLCP--PPASTTIELTAALLAKDRLNAFGLMEPYIEG 193
++ Q + +N + + P IE+ +A R N F + +
Sbjct: 141 RDLIYCLQANLDNNHRTEVRNAAKCFKNFPTDKKLIEIMNRTVAVLRTNGF---DKTTDR 197
Query: 194 QDGRRSVRAYGIYLKASFFNHDCLPNACRFDYVDAAAENNTDIIVRMIHDVPQGREICLS 253
+ + +Y NHDC+PNA Y NN +IVR D+P+G E+ +
Sbjct: 198 TNDNQEFNYRALYPLFGVVNHDCIPNAY---YTFEEKTNN--MIVRAAVDIPEGFEVTTT 252
Query: 254 YFPVNYDYSTRQKRLLYDYDFACDCDRC 281
Y + R L F C C RC
Sbjct: 253 YTKLFTGNIARHLFLKMKKSFTCKCSRC 280
>gi|340513811|gb|EGR44092.1| predicted protein [Trichoderma reesei QM6a]
Length = 549
Score = 74.7 bits (182), Expect = 7e-11, Method: Compositional matrix adjust.
Identities = 74/311 (23%), Positives = 122/311 (39%), Gaps = 55/311 (17%)
Query: 3 AVSETVTVAEIEGRGRGLVSTQSLKAGQIVLRDSPILLYSALP-FINSSSFCHNCFRKTM 61
A S ++ + I G GL + + ++ G+++ P++L + + + + FC R+
Sbjct: 49 AGSLSIKQSPILNAGSGLYTNRDVEVGELIFTSVPLVLCAEVGDGMEACDFCFQQRRRVF 108
Query: 62 HSSSS-------------ICPSCSV-AFCSPKCSTAAASSSHSPYVCQALTRLLNLNSPD 107
H S S +C C + +CS C A + H Y C L+
Sbjct: 109 HPSESRFLGPGEMMPPLHVCNGCHMYQYCSKSCWQRAWDTGHL-YECGLLS--------- 158
Query: 108 AANLPLDRQLQARFLIAAYNLAVVNPSQFQILLAFQGTVTDNDTSAAHYLSSLCPPPAST 167
A L+ + R LI V+ P Q Q L + + + + A +
Sbjct: 159 GATTDLETRTLYRLLILMRK-KVLLPEQVQGLSRLESEMANFERRAKKTWPKV------- 210
Query: 168 TIELTAALLAKDRLNA-FGLMEPYI-----------EGQDGRRSVRAYGIYLKASFFNHD 215
L AL AK+R + G+ E I Q R + + L + NHD
Sbjct: 211 ---LELALEAKERTKSELGIYEILILYGIIRCNALPVDQTYRNAPLGIALDLGGALLNHD 267
Query: 216 CLPNACRFDYVDAAAENNTDIIVRMIHDVPQGREICLSYFPVNYDYSTRQKRLLYDYDFA 275
C PN A NNT + VR + + G E+ Y + YD++ R R+ Y F
Sbjct: 268 CDPNV-------AIVFNNTQVQVRALRKLKAGEELLHCYRDIAYDFTFRNPRITSRYQFR 320
Query: 276 CDCDRCKVEAN 286
C C+RC+ E++
Sbjct: 321 CHCERCRQESD 331
>gi|195359283|ref|XP_002045335.1| GM11659 [Drosophila sechellia]
gi|194129184|gb|EDW51227.1| GM11659 [Drosophila sechellia]
Length = 496
Score = 74.7 bits (182), Expect = 7e-11, Method: Compositional matrix adjust.
Identities = 74/270 (27%), Positives = 112/270 (41%), Gaps = 28/270 (10%)
Query: 17 GRGLVSTQSLKAGQIVLRDSPILLYSALPFINSSSFCHNCFRKTMHSSSSIC-PSCSVAF 75
GRG+V+T+SLK G+I+ RDSP+L+ A +S + C C K + + +C C +
Sbjct: 34 GRGVVATRSLKRGEIIFRDSPLLIGLAAHEEDSLNACSVCL-KMLPDTRFMCRQGCGL-- 90
Query: 76 CSPKCSTAAASSSHSPYVCQALTRLLNLNSPDAANLPLDRQLQARFLIAAYNLAVVNPSQ 135
P CS A H L + N PD AN + R L + A NL+ Q
Sbjct: 91 --PVCSLCAKKKQHKSDC--DLFKSWGPNEPDVANSVIIRLL---CVARAINLS---KEQ 140
Query: 136 FQILLAFQGTVTDNDTS----AAHYLSSLCPPPASTTIELTAALLAKDRLNAFGLMEPYI 191
++ Q + +N + AA + P IE+ +A R N F +
Sbjct: 141 RDLIYCLQANLDNNHRTEVRNAAKCFKNF--PTDKKLIEIMNRTVAVLRTNGF---DKTT 195
Query: 192 EGQDGRRSVRAYGIYLKASFFNHDCLPNACRFDYVDAAAENNTDIIVRMIHDVPQGREIC 251
+ + + +Y NHDC+PNA E ++IVR D+P+G E+
Sbjct: 196 DRTNDNQEFNYRALYPLFGVVNHDCIPNA-----YYTFEEKTNNMIVRAAVDIPEGFEVT 250
Query: 252 LSYFPVNYDYSTRQKRLLYDYDFACDCDRC 281
+Y + R L F C C RC
Sbjct: 251 TTYTKLFTGNIARHLFLKMKKSFTCKCSRC 280
>gi|255082145|ref|XP_002508291.1| predicted protein [Micromonas sp. RCC299]
gi|226523567|gb|ACO69549.1| predicted protein [Micromonas sp. RCC299]
Length = 311
Score = 73.9 bits (180), Expect = 1e-10, Method: Compositional matrix adjust.
Identities = 88/322 (27%), Positives = 133/322 (41%), Gaps = 62/322 (19%)
Query: 6 ETVTVAEIEGRGRGLVSTQSLKAGQIVLRDSPILLYSALPFINS-SSFCHNCFRKTMHSS 64
+ VT +EG+G L +T+ L G +LR +P+ A+P+ + C CF+
Sbjct: 11 DAVTTDRVEGKGEALRATRDLPKGARLLRVAPLF---AVPYAAELTRLCGGCFQP----R 63
Query: 65 SSICPSCSVAFCSPKCSTAAASSSHSPYVCQALTRL------LNLNSPDAANLPLDRQLQ 118
++C SC A +C AA + H C AL RL L L D L ++
Sbjct: 64 GAVCASCGSARLCSRCGAGAAGTLHG-LECHALARLRDGEEGLTLAHSDLRLLLRALAVR 122
Query: 119 A--RFLIAAYNLAVVNPSQFQILL-----AFQGTVTDNDTSAAHYLSSLCPPPASTTIEL 171
+ R + A + A + ++ ++ A +G ++ D L P A TI
Sbjct: 123 SMRRAVDAGGDPAAIAAAEDGDVIVDGYDALEGLMSGVDGGDDGEL----PHDAVATIAE 178
Query: 172 TAA-------------------LLAKDRLNAFGLM--EPYIEGQD-----GRRSVRAYGI 205
A L + +LN F + EP EG D G R V G+
Sbjct: 179 VAKQARFLLAASCRCSMDECVRTLGRLQLNGFEMTASEPE-EGADEAEGGGHRPV-GVGV 236
Query: 206 YLKASFFNHDCLPN-ACRFDYVDAAAENNTDIIVRMIHDVPQGREICLSYFPVNYDYSTR 264
+ AS+ NH C PN A RFD + I+V DV G E+ + Y V R
Sbjct: 237 FPSASYTNHSCAPNCAQRFD-------GHGCIVVETARDVRGGEELTIPYVDVRLGRRER 289
Query: 265 QKRLLYDYDFACDCDRCKVEAN 286
++RL ++ F C C+RC EA+
Sbjct: 290 RERLRKNFAFDCACERCAAEAD 311
>gi|440801098|gb|ELR22123.1| MYND finger domain containing protein [Acanthamoeba castellanii
str. Neff]
Length = 397
Score = 73.9 bits (180), Expect = 1e-10, Method: Compositional matrix adjust.
Identities = 75/321 (23%), Positives = 118/321 (36%), Gaps = 55/321 (17%)
Query: 12 EIEGRGRGLVSTQSLKAGQIVLRDSPILLYSALPFINSSSFCHNCFRKTMHSSSSI-CPS 70
+ E GRG+ + + KAG ++L +SP+ + + C C RK S++++ CP
Sbjct: 71 QAETMGRGMFALRDFKAGDVILEESPLAIAPRDCVVAQMLHCSQCLRKLAASTAAVACPH 130
Query: 71 CSV-AFCSPKCSTAAASSSHSPYVCQA-----------------------------LTRL 100
C + +CS +C+ A A SH +C+A + RL
Sbjct: 131 CRLDPYCSDRCA-AEAWESHHRLLCEAAQADPETTGQALRDFNALCREQARAYPLLIARL 189
Query: 101 LNLNSPDAANLPLDRQLQARF--------LIAAYNLAVVNPSQFQILLAFQGTVTDNDTS 152
+ S D A L D L+ + + A + P F + +
Sbjct: 190 IAQLSTDVAKLGADVSLEEALDRPNGLFSHLRHFCSADIEPQGFPPEFSMLDYIFGQTNL 249
Query: 153 AAHYLSS-------LCPPPASTTIELTAALLAKDRLNAFGLM---EPYIEGQDGRRSVRA 202
A+H LS+ L P L N FGL E EG R
Sbjct: 250 ASHILSTCAATICFLDPRCVLVDEHFCFWLFTILMRNTFGLWIVEEGEAEGVYTDRREDG 309
Query: 203 YGIYLKASFFNHDCLPNACRFDYVDAAAENNTDIIVRMIHDVPQGREICLSYFPVNYDYS 262
++L AS+FNH C PN D + + I D+ +G ++ + Y
Sbjct: 310 VALFLHASYFNHSCTPNV---DRCNRHGDKRVAFIA--CADIKKGEQLFIEYVDTRAMVD 364
Query: 263 TRQKRLLYDYDFACDCDRCKV 283
R++ L Y F C C +C V
Sbjct: 365 ERRQELAQRYGFLCSCPKCSV 385
>gi|440793165|gb|ELR14357.1| MYND finger domain containing protein [Acanthamoeba castellanii
str. Neff]
Length = 387
Score = 73.6 bits (179), Expect = 2e-10, Method: Compositional matrix adjust.
Identities = 75/321 (23%), Positives = 117/321 (36%), Gaps = 55/321 (17%)
Query: 12 EIEGRGRGLVSTQSLKAGQIVLRDSPILLYSALPFINSSSFCHNCFRKTMHSSSSI-CPS 70
+ E GRG+ + + KAG ++L +SP+ + C C RK S++++ CP
Sbjct: 62 QAETMGRGMFALRDFKAGDVILEESPLAFAPRDCVVAQMLHCSQCLRKLAASTAAVTCPH 121
Query: 71 CSV-AFCSPKCSTAAASSSHSPYVCQA-----------------------------LTRL 100
C + +CS +C+ A A SH +C+A + RL
Sbjct: 122 CRLDPYCSDRCA-AEAWESHHRLLCEAAQADPETTGQALRDFNALCREQARAYPLLIARL 180
Query: 101 LNLNSPDAANLPLDRQLQARF--------LIAAYNLAVVNPSQFQILLAFQGTVTDNDTS 152
+ S D A L D L+ + + A + P F + +
Sbjct: 181 IAQLSTDVAKLGADVSLEEALDRPNGLFSHLRHFCSADIEPQGFPPEFSMLDYIFGQTNL 240
Query: 153 AAHYLSS-------LCPPPASTTIELTAALLAKDRLNAFGLM---EPYIEGQDGRRSVRA 202
A+H LS+ L P L N FGL E EG R
Sbjct: 241 ASHILSTCAATICFLDPRCVLVDEHFCFWLFTILMRNTFGLWIVEEGEAEGVYTDRREDG 300
Query: 203 YGIYLKASFFNHDCLPNACRFDYVDAAAENNTDIIVRMIHDVPQGREICLSYFPVNYDYS 262
++L AS+FNH C PN D + + I D+ +G ++ + Y
Sbjct: 301 VALFLHASYFNHSCTPNV---DRCNRHGDKRVAFIA--CADIKKGEQLFIEYVDTRAPVD 355
Query: 263 TRQKRLLYDYDFACDCDRCKV 283
R++ L Y F C C +C V
Sbjct: 356 ERRQELAQRYGFLCSCPKCSV 376
>gi|195446904|ref|XP_002070974.1| GK25543 [Drosophila willistoni]
gi|194167059|gb|EDW81960.1| GK25543 [Drosophila willistoni]
Length = 503
Score = 73.2 bits (178), Expect = 2e-10, Method: Compositional matrix adjust.
Identities = 70/267 (26%), Positives = 106/267 (39%), Gaps = 22/267 (8%)
Query: 17 GRGLVSTQSLKAGQIVLRDSPILLYSALPFINSSSFCHNCFRKTMHSSSSICPSCSVAFC 76
GRG+V+T+ LK G+I+ RDSP+L+ A +S + C C + C +
Sbjct: 34 GRGVVATRHLKRGEIIFRDSPLLIGLAAHEEDSLNACSVCMILLPDTRFMCRQGCGL--- 90
Query: 77 SPKCSTAAASSSHSPYVCQALTRLLNLNSPDAANLPLDRQLQARFLIAAYNLAVVNPSQF 136
P CS A H L R N P+ AN + R L + A NL N Q
Sbjct: 91 -PVCSLCAKKKQHKTDC--DLFRSWGPNEPEVANSVIIRLL---CVARAINL---NKDQR 141
Query: 137 QILLAFQGTVTDNDTSAAHYLSSLCP--PPASTTIELTAALLAKDRLNAFGLMEPYIEGQ 194
++ Q + +N + + P IE+ +A R N F + +
Sbjct: 142 DLIYCLQANLDNNHRTEVRNAAKCFKKFPTDKKVIEIMNRTVAVIRTNGF---DKTTDRT 198
Query: 195 DGRRSVRAYGIYLKASFFNHDCLPNACRFDYVDAAAENNTDIIVRMIHDVPQGREICLSY 254
+ + +Y NHDC+PN+ E ++IVR D+P+G EI +Y
Sbjct: 199 NDNQEFNYRALYPLFGVMNHDCIPNS-----YYTFEEKTNNMIVRAAVDIPEGFEITTTY 253
Query: 255 FPVNYDYSTRQKRLLYDYDFACDCDRC 281
+ R L +F C C RC
Sbjct: 254 TKLFTGNIARHLFLKMKKNFTCKCPRC 280
>gi|224088154|ref|XP_002308346.1| SET domain protein [Populus trichocarpa]
gi|222854322|gb|EEE91869.1| SET domain protein [Populus trichocarpa]
Length = 283
Score = 72.8 bits (177), Expect = 3e-10, Method: Compositional matrix adjust.
Identities = 75/291 (25%), Positives = 125/291 (42%), Gaps = 32/291 (10%)
Query: 8 VTVAEIEGRGRGLVSTQSLKAGQIVLRDSPILLYSALPFINSSSFCHNCFRKTMHSSSSI 67
+ VA E GRG+ ST+ + AG ++ PIL + +L IN+ C+ C +K +S+
Sbjct: 4 IRVALTESAGRGVFSTRKICAGDLIHTAKPILAHPSLSTINT--VCYFCLKKL---TSTE 58
Query: 68 CPSCSVAFCSPKCSTAAASSSHSPYVCQALTRLLNLN---SPDAANLPLDRQLQARFLIA 124
VAFCS +C A Y + L + PL + A +I+
Sbjct: 59 FHGKGVAFCSQECKENAKVF----YEVETKADWLAFDDYCRNKGLKYPLLVKRLACMVIS 114
Query: 125 ----AYNLAVVNPSQFQILLAFQGTVTDNDTSAAHYLSSLCPPP-ASTTIELTAALLAKD 179
A +L ++ PS + + + ++++ A T E ++LA+
Sbjct: 115 GAASAESLDILQPSNLSHEMVLEMEEGYGLLKSGFAMANISDEQMAFLTKEWYNSVLARI 174
Query: 180 RLNAFGLMEPYIEGQDGRRSVRAY---------GIYLKASFFNHDCLPNACRFDYVDAAA 230
R+NAF + +D S A +Y+ S +NHDC PNA ++D
Sbjct: 175 RINAFRIELAVGSYEDLLSSAAACIEAEAAVGNAVYMLPSLYNHDCDPNA-HIVWID--- 230
Query: 231 ENNTDIIVRMIHDVPQGREICLSYFPVNYDYSTRQKRLLYDYDFACDCDRC 281
N D ++ + DV + E+ + Y + D+ RQ LL + F C+C RC
Sbjct: 231 --NADAQLKALCDVEEDEELRICYIDASMDHDARQSFLLQGFGFKCNCLRC 279
>gi|317038856|ref|XP_001402319.2| set and mynd domain containing protein [Aspergillus niger CBS
513.88]
gi|350631791|gb|EHA20161.1| hypothetical protein ASPNIDRAFT_195107 [Aspergillus niger ATCC
1015]
Length = 357
Score = 72.0 bits (175), Expect = 5e-10, Method: Compositional matrix adjust.
Identities = 75/305 (24%), Positives = 126/305 (41%), Gaps = 44/305 (14%)
Query: 16 RGRGLVSTQSLKAGQIVLRDSPILLYSAL--PFINSSSFCHN--CFRKTMHSSSSI-CPS 70
+GR L ++Q ++ G++++ D P L + P + S C N C R+T HSS I CP+
Sbjct: 45 KGRQLRASQPIRKGELLMIDVPYALIPVVDDPASSDSLLCSNPTCSRQTQHSSGRISCPN 104
Query: 71 ---CSVAFCSPKCSTAAASSSHSPYVCQALTRLLN-----LNSPDAANLPL------DRQ 116
V +CS C A + C L R + D + L RQ
Sbjct: 105 RCLADVVWCSSTCQEA--DQLRHEFECTWLQRYASPIRAKWGEYDFGMMWLIVRILASRQ 162
Query: 117 LQARFLIAAYNLAVVN-PSQFQILLAFQGTVTDNDTS--------AAHYLSSLCPPPAST 167
+ + L+ A N ++ + +F G++ + YL S P
Sbjct: 163 TERQQLLDANNETTQRFKGGWEAIQSFCGSIESWSHAQVRSWTALVKKYLRSSPILPHDL 222
Query: 168 TIELTAALLAKDRLNAFGL------MEPYIEGQDGRRSVRAYGIYLKASFFNHDCLPNAC 221
+++ AL+ ++ N+FGL + P E R +Y +A+ NH C PN
Sbjct: 223 SVDEIVALICREEANSFGLYPRETGVYPVPEPPVDRGEQFGAAVYPRAAIANHSCSPN-- 280
Query: 222 RFDYVDAAAENNTDIIVRMIHDVPQGREICLSYFPVN--YDYSTRQKRLLYDYDFACDCD 279
+ +++ ++ D+ G E C+SYF ++ D +R+ L + F C CD
Sbjct: 281 ----IMHKPDHHGRMVFTASKDIAAGEECCISYFDLSKRVDLKSRRDHLQGLFRFVCGCD 336
Query: 280 RCKVE 284
RC E
Sbjct: 337 RCTAE 341
>gi|410912232|ref|XP_003969594.1| PREDICTED: SET and MYND domain-containing protein 3-like [Takifugu
rubripes]
Length = 434
Score = 72.0 bits (175), Expect = 5e-10, Method: Compositional matrix adjust.
Identities = 78/303 (25%), Positives = 115/303 (37%), Gaps = 49/303 (16%)
Query: 15 GRGRGLVSTQSLKAGQIVLRDSPILLYSALPFINSSSFCHNCFRKTMHSSSSICPSCSVA 74
G+G GL +T ++ G++V P L + S + CH CF T H + C C +A
Sbjct: 12 GKGNGLRATVRIEEGELVYVTEP--LAYCVSQKQSRNVCHQCF--TRHETLLRCSQCKMA 67
Query: 75 -FCSPKCSTAAASSSHSPYVCQALTRLLNLNSPDAANLPLDR-QLQARFLIAAYNLAVVN 132
+CS C A S C+ L LL +P D +L AR + A + +
Sbjct: 68 RYCSATCQRRAWSDHKRE--CKCLQSLL-------PRIPTDSVRLAARLIFAMLSSCSSS 118
Query: 133 PSQFQILLAFQGTVTDNDTSAAHYLSSLCP----------------PPASTTIELTAALL 176
+ L + +T LS L P+ ++ +L+
Sbjct: 119 SEELYTLEEHESHLTSLSEQRKQGLSQLATMLKLYLHKEVPDLPQDTPSLSSCRDALSLI 178
Query: 177 AKDRLNAFGLMEPYIEGQDGRRSVRAYGIYLKASFFNHDCLPNACRFDYVDAAAENNTDI 236
AK N F + DG G+Y S NHDC PN C + T +
Sbjct: 179 AKVTCNCFTI-------SDGELQEIGVGLYPSLSLLNHDCRPN-CVMVF------EGTKL 224
Query: 237 IVRMIHDVPQGREICLSYFPVNYDYSTRQKRLLYDYDFACDCDRCKVEANWSDNDNDDEN 296
++R + + G E+ +SY RQ+RL Y F C C C N DND +
Sbjct: 225 LLRAVRGLSPGEELTISYIETLSLNEDRQQRLEDQYCFTCHCQCC----NSPDNDKLMLS 280
Query: 297 NEE 299
EE
Sbjct: 281 GEE 283
>gi|410903311|ref|XP_003965137.1| PREDICTED: SET and MYND domain-containing protein 1-like [Takifugu
rubripes]
Length = 471
Score = 71.6 bits (174), Expect = 6e-10, Method: Compositional matrix adjust.
Identities = 69/285 (24%), Positives = 120/285 (42%), Gaps = 39/285 (13%)
Query: 16 RGRGLVSTQSLKAGQIVLRDSPILLYSALPFIN-SSSFCHNCFRKTMHSSSSICPSCSVA 74
+GRGL +T+ + G++V + YSA+ F + +S CH+CFR + C C A
Sbjct: 17 KGRGLRATKEISTGEVVFAEPS---YSAVVFDSFASQVCHSCFRHQ--AQLHRCGQCKFA 71
Query: 75 -FCSPKCSTAAASSSHSPYVCQALTRLLNLNSPDAANLPLDR-QLQARFLIAAY-NLAVV 131
+C+ C TA CQA+ +L +P D+ +L AR + + + V
Sbjct: 72 HYCNRTCQTACWEEHKQE--CQAIRKL--------GTVPGDKVRLAARVMWRIHKDTGVA 121
Query: 132 NPSQFQILLAFQGTVTD--------NDTSAAHYLSSLCPPPASTTIELTAALLAKDRLNA 183
+ SQ + + V D DT +L +++ A + + N
Sbjct: 122 SDSQLLSVEELEDHVADLPEDHLKRIDTDVHVFLQYWSCGRTKHSLDEIAHIFGIIKCNG 181
Query: 184 FGLMEPYIEGQDGRRSVRAYGIYLKASFFNHDCLPNAC------RFDYVDAAAENNTDII 237
F L + Q G ++V G++ NHDC PN ++A + I
Sbjct: 182 FTLSD-----QRGLKAV-GVGLFPNLCLVNHDCWPNCSVVLNHGNHSATNSALHSKRRIE 235
Query: 238 VRMIHDVPQGREICLSYFPVNYDYSTRQKRLLYDYDFACDCDRCK 282
+R + + +G E+ +SY + RQ++L + F C C+ C+
Sbjct: 236 LRALRKICEGEELTVSYVDFLDTSAERQRKLKEHFYFECTCEHCR 280
>gi|326435247|gb|EGD80817.1| hypothetical protein PTSG_01403 [Salpingoeca sp. ATCC 50818]
Length = 431
Score = 71.2 bits (173), Expect = 9e-10, Method: Compositional matrix adjust.
Identities = 40/112 (35%), Positives = 58/112 (51%), Gaps = 12/112 (10%)
Query: 175 LLAKDRLNAFGLMEPYIEGQDGRRSVRAYGIYLKASFFNHDCLPNACRFDYVDA----AA 230
LLA LN+F L G + YG YL+ +F NH C PNAC+ Y+D A
Sbjct: 151 LLAVIHLNSFALSSERFPG------MTTYGFYLRMAFCNHSCRPNACQ--YIDPNSTRAR 202
Query: 231 ENNTDIIVRMIHDVPQGREICLSYFPVNYDYSTRQKRLLYDYDFACDCDRCK 282
N+ I++R + D+ +G E+C+SY + R++ L Y F C C RC+
Sbjct: 203 LNSPSIVLRAVSDIAEGEEVCISYIELMDTTPERREALQELYYFTCQCPRCE 254
>gi|358394896|gb|EHK44289.1| hypothetical protein TRIATDRAFT_79382 [Trichoderma atroviride IMI
206040]
Length = 742
Score = 70.9 bits (172), Expect = 1e-09, Method: Compositional matrix adjust.
Identities = 41/113 (36%), Positives = 60/113 (53%), Gaps = 16/113 (14%)
Query: 181 LNAFGLMEPYIEGQDGRR-SVR------AYGIYLKASFFNHDCLPNACRFDYVDAAAENN 233
LN FG P ++ +D ++ R + GI+L+AS+ NH CLPNA R D
Sbjct: 474 LNGFGC--PRVKSRDKKQPKARNLGFNDSTGIWLQASYANHSCLPNATRAFIGDM----- 526
Query: 234 TDIIVRMIHDVPQGREICLSYFPVNYDYSTRQKRLLYDYDFACDCDRCKVEAN 286
+IVR + ++P G EI + Y P + + RQ+ + Y F CDCD C+ EA
Sbjct: 527 --MIVRALREIPVGGEILMQYVPQDKPFDKRQEVVENHYGFKCDCDLCRAEAK 577
>gi|198420659|ref|XP_002128556.1| PREDICTED: similar to SET and MYND domain containing 3 [Ciona
intestinalis]
Length = 430
Score = 70.5 bits (171), Expect = 1e-09, Method: Compositional matrix adjust.
Identities = 73/276 (26%), Positives = 107/276 (38%), Gaps = 28/276 (10%)
Query: 14 EGRGRGLVSTQSLKAGQIVLRDSPILLYS-ALPFINSSSFCHNCFRKTMHSSSSICPSCS 72
E GRGL + ++L G VL P Y+ L +C CF+K C C
Sbjct: 11 EKSGRGLKAKRNLNPGSTVLSSEP---YAYLLSKKQKGVYCDFCFKK--QDGLLQCSGCK 65
Query: 73 -VAFCSPKCSTAAASSSHSPYVCQALTRLLNLNSPDAANLPLDRQLQARFLIAAYNLAVV 131
+ +C+ C A + H C AL ++ PD L L R L ++ L
Sbjct: 66 YMKYCNRNCQKMAWNEHHKAE-CPALKNVMPKRPPDFVIL-LGRLLWNMQQYSSAKLPEK 123
Query: 132 N-----PSQFQILLAFQGTVTDNDTSAAHYLSSLCP-PPASTTIELTAALLAKDRLNAFG 185
N S + L Q N H S P PP T ++ L A+ + N+F
Sbjct: 124 NSILDLESNYNKLSQNQKEALMNFLVILHTFWSPKPLPPQVTDNKMLLELCARIKNNSFA 183
Query: 186 LMEPYIEGQDGRRSVRAYGIYLKASFFNHDCLPNACRFDYVDAAAENNTDIIVRMIHDVP 245
+ ++ G G+YL SF NH C PN A N + +R + ++
Sbjct: 184 ICNEELQSDVGT------GVYLNCSFINHSCEPNC-------VAEFNMRTLKIRAVKNIT 230
Query: 246 QGREICLSYFPVNYDYSTRQKRLLYDYDFACDCDRC 281
G E+ +SY + RQ+ L+ Y F C C C
Sbjct: 231 AGEEVLISYVDLFATSFERQRELMSIYHFQCTCHSC 266
>gi|353234811|emb|CCA66832.1| hypothetical protein PIIN_00594 [Piriformospora indica DSM 11827]
Length = 456
Score = 70.1 bits (170), Expect = 2e-09, Method: Compositional matrix adjust.
Identities = 79/292 (27%), Positives = 118/292 (40%), Gaps = 25/292 (8%)
Query: 10 VAEIEGRGRGLVST-QSLKAGQIVLRDSPILLYSALPFINSSSFCHNCFRKTMHSSSSIC 68
V + + R R +V+ Q++K GQ++LR S L S LP + C C R+T + C
Sbjct: 9 VPDSQARSRAVVAPGQTVKPGQVILRSSS-LATSLLP-KSRGQRCDECCRQT---TVKAC 63
Query: 69 PSCSVAF-CSPKCSTAAASSSHSPYVCQALTRLLNLNSPDAANLPLDRQLQARFLIAAYN 127
C AF C +C +AA S H C L+ P + P +Q LI Y
Sbjct: 64 SRCKEAFYCDTRCQSAAWKSHHR-TTCALLSNGYRAKHPYISQ-PEGKQADLDLLITLYG 121
Query: 128 L-AVVNPSQFQ---ILLAFQGTVTDND-----TSAAHYLSSLCPPPASTTIELTAALLAK 178
L A PS F+ + + T+ + ++ + + PP TT T A K
Sbjct: 122 LVATSQPSMFEQSNWAVRREPTLLPEEPLECFSTLLPHSGQVGPPYIPTTFGKTQAEFLK 181
Query: 179 DRLNAF---GLMEPYIEGQDGRRSVRAYGIYLKASF-FNHDCLPNACRFDYVDAAAENNT 234
+ F + I A+GI+ AS FNH C PNA ++ +
Sbjct: 182 EAWARFENNNFVLHNISSMVPNSGAYAHGIFPHASRGFNHSCSPNAWPAFVLE---QRQA 238
Query: 235 DIIVRMIHDVPQGREICLSYFPVNYDYSTRQKRLLYDYDFACDCDRCKVEAN 286
+ +R + + + EI + Y RQ RL Y F C C RC +E
Sbjct: 239 WLEIRALISIKESEEITIPYLDPALSLPERQARLKATYGFDCTCSRCDLEKK 290
>gi|302672815|ref|XP_003026095.1| hypothetical protein SCHCODRAFT_238838 [Schizophyllum commune H4-8]
gi|300099775|gb|EFI91192.1| hypothetical protein SCHCODRAFT_238838 [Schizophyllum commune H4-8]
Length = 528
Score = 69.7 bits (169), Expect = 2e-09, Method: Compositional matrix adjust.
Identities = 36/121 (29%), Positives = 56/121 (46%), Gaps = 10/121 (8%)
Query: 164 PASTTIELTAALLAKDRLNAFGLMEPYIEGQDGRRSVRAYGIYLKASFFNHDCLPNACRF 223
P +T L A+LA+D NAFGL + + G + + +Y+ S+FNHDC PN +
Sbjct: 398 PYLSTPHLLRAILARDHANAFGLFD---MQETGESEMLGWAVYVAGSYFNHDCAPNVRKL 454
Query: 224 DYVDAAAENNTDIIVRMIHDVPQGREICLSYFPVNYDYSTRQKRLLYDYDFACDCDRCKV 283
+ DV G E+C+SY ++R + ++F C C RC+
Sbjct: 455 -------RRGRALQFVTTRDVAPGEELCISYVDTQDTKASRAAQFAQHWNFVCGCGRCRG 507
Query: 284 E 284
E
Sbjct: 508 E 508
>gi|119597486|gb|EAW77080.1| SET and MYND domain containing 1, isoform CRA_d [Homo sapiens]
Length = 497
Score = 69.3 bits (168), Expect = 3e-09, Method: Compositional matrix adjust.
Identities = 71/282 (25%), Positives = 114/282 (40%), Gaps = 26/282 (9%)
Query: 6 ETVTVAEIEGRGRGLVSTQSLKAGQIVLRDSPILLYSALPFINSSSF-CHNCFRKTMHSS 64
E V V EG+GRGL +T+ A I+ + YSA+ F + +F CH CF++
Sbjct: 7 ENVEVFTAEGKGRGLKATKEFWAADIIFAERA---YSAVVFDSLVNFVCHTCFKR--QEK 61
Query: 65 SSICPSCSVA-FCSPKCSTAAASSSHSPYVCQALTR---LLNLNSPDAANLPLDRQLQAR 120
C C A +C C A + + C A+ R + N N AA + + +
Sbjct: 62 LHRCGQCKFAHYCDRTCQKDAWLNHKNE--CSAIKRYGKVPNENIRLAARIMWRVEREGT 119
Query: 121 FLIAAYNLAVVNPSQFQILLAFQGTVTDNDTSAAHYLSSLCPPPASTTIELTAALLAKDR 180
L L V+ Q + + D +L P +++ + +
Sbjct: 120 GLTEGC-LVSVDDLQNHVEHFGEEEQKDLRVDVDTFLQYWPPQSQQFSMQYISHIFGVIN 178
Query: 181 LNAFGLMEPYIEGQDGRRSVRAYGIYLKASFFNHDCLPNACRFDYVDAAAENNTDIIVRM 240
N F L + Q G ++V GI+ NHDC PN C + NN I +R
Sbjct: 179 CNGFTLSD-----QRGLQAV-GVGIFPNLGLVNHDCWPN-CTVIF------NNGKIELRA 225
Query: 241 IHDVPQGREICLSYFPVNYDYSTRQKRLLYDYDFACDCDRCK 282
+ + +G E+ +SY R+++L Y F C C+ C+
Sbjct: 226 LGKISEGEELTVSYIDFLNVSEERKRQLKKQYYFDCTCEHCQ 267
>gi|91088053|ref|XP_966915.1| PREDICTED: similar to CG9642 CG9642-PA [Tribolium castaneum]
gi|270011878|gb|EFA08326.1| hypothetical protein TcasGA2_TC005968 [Tribolium castaneum]
Length = 492
Score = 69.3 bits (168), Expect = 3e-09, Method: Compositional matrix adjust.
Identities = 70/289 (24%), Positives = 120/289 (41%), Gaps = 42/289 (14%)
Query: 10 VAEIEGRGRGLVSTQSLKAGQIVLRDSPILLYSALPFINSSSFCHNCFRKTMHSSSSICP 69
VA+ E GR +V+++++K GQ++L+++P+LL P + C NC + CP
Sbjct: 10 VAKNEIFGRFIVASKNIKQGQLILKENPVLL---CPQVGGPIICFNCCAQL--KKLFFCP 64
Query: 70 SCSVA-FCSPKCSTAAASSSHSPYVCQALTRLL-----NLNSPDAANLPLDRQLQARFLI 123
C +A C+P C + H+ C AL L L++P+ PL L +F
Sbjct: 65 DCRIAVLCNPNCK----NQFHNLQECTALKSLPIKHDDVLSNPEIVT-PLRCLLWRQF-- 117
Query: 124 AAYNLAVVNPSQFQILLAFQGTVT---------DNDTSAAHYLSSL-CPPPASTTIELTA 173
N +Q + L + + N + L L + E+
Sbjct: 118 --------NKNQLENFLQLEAHLNFRRDSQIWRRNRVNIEDVLFKLKILDESDLKEEVVQ 169
Query: 174 ALLAKDRLNAFGLMEPYIEGQDGRRSVRAYGIYLKASFFNHDCLPNACRFDYVDAAAENN 233
+ +N F + +P ++ G+YL+A+ HDC+ N A +++
Sbjct: 170 KICGILDINTFDVRQPQRNRLGFNQAENLRGLYLRAALMAHDCVANT------HLAVDDD 223
Query: 234 TDIIVRMIHDVPQGREICLSYFPVNYDYSTRQKRLLYDYDFACDCDRCK 282
+ V D+P+G I +Y V R++RLL F C C RC+
Sbjct: 224 FVLYVHAAVDIPEGSPIYFNYTDVLQGNDERKRRLLNAKHFECQCSRCR 272
>gi|358378046|gb|EHK15729.1| hypothetical protein TRIVIDRAFT_228756 [Trichoderma virens Gv29-8]
Length = 583
Score = 69.3 bits (168), Expect = 3e-09, Method: Compositional matrix adjust.
Identities = 71/315 (22%), Positives = 121/315 (38%), Gaps = 39/315 (12%)
Query: 5 SETVTVAEIEGRGRGLVSTQSLKAGQIVLRDSPILLYSAL-PFINSSSFCHNCFRKTMHS 63
S ++ + I G G+ +T+ ++ G+++ P++ + + P + + FC R+ H
Sbjct: 85 SLSIQESTIPNAGSGMFTTRDVEEGELIFTSVPLVTCAEVGPGMEACDFCFQQRRRVFHP 144
Query: 64 SS-------------SICPSCSV-AFCSPKCSTAAASSSHSPYVCQALTRLLNLNSPDAA 109
IC C + +CS CS A + H Y C LL S D
Sbjct: 145 VEDRFLQPGEVLPPLHICNGCRLYQYCSQSCSQRAWDTGHL-YECG----LLAGASADVE 199
Query: 110 NLPLDRQLQARFLIAAYNLAVVNPSQFQILLAFQGTVTDNDTSAAHYLSSLCPPPASTTI 169
L R L+ V+ P Q + L + V + + +
Sbjct: 200 TRTLYR------LLILMRKKVLLPQQVKALARLENEVANFEKRTKKSWPRVLNLAREAKE 253
Query: 170 ELTAALLAKDRLNAFGLME--PYIEGQDGRRSVRAYGIYLKASFFNHDCLPNACRFDYVD 227
+ L + L +G++ Q R + + + + NH C PN
Sbjct: 254 RTKSELSIGEVLMLYGIVRCNSLPVDQTFRNAPLGIALDMGGALLNHCCDPNV------- 306
Query: 228 AAAENNTDIIVRMIHDVPQGREICLSYFPVNYDYSTRQKRLLYDYDFACDCDRCKVEANW 287
N+T + VR + + G E+ Y + YD++ R R+ Y F C CDRCK+E
Sbjct: 307 VIVFNSTQVQVRALRKIKDGEELLHCYRDIAYDFTFRNPRITARYQFNCQCDRCKME--- 363
Query: 288 SDND-NDDENNEEVM 301
SDN ++NE V+
Sbjct: 364 SDNHFKQAKSNENVI 378
>gi|321473475|gb|EFX84442.1| hypothetical protein DAPPUDRAFT_194440 [Daphnia pulex]
Length = 524
Score = 69.3 bits (168), Expect = 3e-09, Method: Compositional matrix adjust.
Identities = 67/290 (23%), Positives = 120/290 (41%), Gaps = 43/290 (14%)
Query: 4 VSETVTVAEIEGRGRGLVSTQSLKAGQIVLRDSPILLYSALPFINSSSFCHNCFRKTMHS 63
+ +T+TV GR L++ + LKAG ++L++ PI++ P + C C+
Sbjct: 46 IEQTLTV------GRHLIACRDLKAGDVILQEKPIVMG---PKHTAGQICLGCYSGV--D 94
Query: 64 SSSICPSCSVAFCSPKCSTAAASSSHSPYVCQALTRLLNLNSPDAANLPLDRQLQARFLI 123
+ C C C + A S H+ C + + P +LP+ + Q+ ++
Sbjct: 95 GRTRCSQCGWPMCG-RDDCHAHESDHAAE-CGVMA---SGGRPIVGSLPV-QAYQSVMVL 148
Query: 124 AAYNLAVVNPSQFQILLAFQGTVTD---------NDTSAAHYLSSLC--PPPASTTIELT 172
L NP ++ L+ + V + + +A ++ P ++L
Sbjct: 149 RCLALRDQNPERWDELMQLEAHVQERRQKGMEDVDQATAVRFIRETLGLQIPEELILQLC 208
Query: 173 AALLAKDRLNAFGLMEPYIEGQDGRRSVRAYGIYLKASFFNHDCLPNACRFDYVDAAAEN 232
L+ +N+F +P ++G V +Y AS HDC+ NA + N
Sbjct: 209 GILM----VNSFE--QPPMKGNSQHGLV---AVYSTASLLEHDCVANAIK------TFTN 253
Query: 233 NTDIIVRMIHDVPQGREICLSYFPVNYDYSTRQKRLLYDYDFACDCDRCK 282
DI++R +P+G +I L Y + RQ+ L F C C+RCK
Sbjct: 254 KGDIVIRAAVPIPKGEKIALCYTEPLWGTMNRQRHLSQTKFFQCVCERCK 303
>gi|384493687|gb|EIE84178.1| hypothetical protein RO3G_08888 [Rhizopus delemar RA 99-880]
Length = 410
Score = 69.3 bits (168), Expect = 3e-09, Method: Compositional matrix adjust.
Identities = 74/323 (22%), Positives = 130/323 (40%), Gaps = 43/323 (13%)
Query: 1 MSAVSETVTVAEIEG-RGRGLVSTQSLKAGQIVLRDSPILL---YSALPFINSSSFCHNC 56
AVS+ V ++ G+GL +++ +K +I+ + P + + S C C
Sbjct: 88 FKAVSDCVVARMVDPVTGKGLFASRDIKQDEILFTEEPYVYFPPWEGFSLARSGHVCGLC 147
Query: 57 FRKTMHSS--SSICPSCSVAFCSPKCSTAAASSSHSPYVCQ---ALTRLLNLNSPDAANL 111
+ ++++ + C C + +CS C A H + A+T +N +
Sbjct: 148 CKPILYANRLTQHCKHCDMYYCSKSCRATAWDQFHQLECTRLNPAITPFMNFCEGERWQA 207
Query: 112 PLD-RQLQARFLIAAYN------------LAVVNPSQFQ-----ILLAFQGTVTDNDTSA 153
P+ ++ AR ++A A V+ + Q + Q T + +
Sbjct: 208 PMAVARIYARLILAHQREELNDVIGRLDAFATVSQEERQAKETEWIFMEQPTRELWNKAR 267
Query: 154 AHYLSSLCPPPASTTI--ELTAAL---LAKDRLNAFGLMEPY-IEGQDGRRSVRAYGIYL 207
+ PPA I +L AL L +D M + I Q+G G+YL
Sbjct: 268 DLLRKAYKQPPAKCKITQDLPEALKEKLFEDEETFLNFMGKFNINNQNG-------GMYL 320
Query: 208 KASFFNHDCLPNACRFDYVDAAAENNTDIIVRMIHDVPQGREICLSYFPVNYDYSTRQKR 267
S NH+C PN DY + ++ I VR + D+ QG ++ +Y ++ TRQ
Sbjct: 321 VHSHINHNCYPNVS-IDYPNPKSQYK--IAVRAVRDIKQGEQLFETYVNPRWNKETRQTY 377
Query: 268 LLYDYDFACDCDRCKVEANWSDN 290
L Y F C C+RC+ + +D
Sbjct: 378 LDKSYLFNCKCERCEKDLELTDE 400
>gi|348534108|ref|XP_003454545.1| PREDICTED: SET and MYND domain-containing protein 3-like
[Oreochromis niloticus]
Length = 439
Score = 68.9 bits (167), Expect = 4e-09, Method: Compositional matrix adjust.
Identities = 72/285 (25%), Positives = 108/285 (37%), Gaps = 43/285 (15%)
Query: 15 GRGRGLVSTQSLKAGQIVLRDSPILLYSALPFINSSSFCHNCFRKTMHSSSSICPSCSVA 74
G+G GL + ++ G++V+ P L + + CH+CF T H S C C +A
Sbjct: 13 GKGNGLRAAGRIRKGELVISAEP--LACCVSNRLAKEVCHHCF--TRHESLLRCSQCKMA 68
Query: 75 -FCSPKCSTAAASSSHSPYVCQALTRLLNLNSPDAANLPLDR-QLQARFLIAAYNLAVVN 132
+C+ C A S C+ L LL LP D +L A+ + A + + +
Sbjct: 69 RYCNTTCQKQAWSGHKRE--CKCLQSLL-------PRLPTDSVRLAAKLIFALLSPSKSS 119
Query: 133 PSQFQILLAFQGTVTDNDTSAAHYLSSLCPPPASTTIELTAALLAKDRLNAFG------L 186
+ L + +T LS L ++ +EL A D L G
Sbjct: 120 SEELYTLEEHESHLTSMPEQKKQGLSQL-----ASMLELYLQQEAPDLLQEMGSALPPSC 174
Query: 187 MEPY----------IEGQDGRRSVRAYGIYLKASFFNHDCLPNACRFDYVDAAAENNTDI 236
EP DG G+Y S NHDC PN C + T +
Sbjct: 175 REPLSLIAKVTCNCFTISDGELQEIGVGLYPSLSLLNHDCRPN-CVMVF------EGTKL 227
Query: 237 IVRMIHDVPQGREICLSYFPVNYDYSTRQKRLLYDYDFACDCDRC 281
+R + D+ E+ +SY RQ++L Y F C C RC
Sbjct: 228 ELRAVRDIDPEDELTISYIETLSLTEDRQRQLEEQYHFTCHCQRC 272
>gi|66825697|ref|XP_646203.1| hypothetical protein DDB_G0270830 [Dictyostelium discoideum AX4]
gi|60474828|gb|EAL72765.1| hypothetical protein DDB_G0270830 [Dictyostelium discoideum AX4]
Length = 696
Score = 68.6 bits (166), Expect = 5e-09, Method: Compositional matrix adjust.
Identities = 78/318 (24%), Positives = 123/318 (38%), Gaps = 65/318 (20%)
Query: 14 EGRGRGLVSTQSLKAGQIVLRDSPILLYSALPFINSSSFCHNCFRKTMHSS--SSICPSC 71
+G GR +++++ L G +VLR SP S L C CF+K S + C +C
Sbjct: 219 DGFGRCVLASEDLPKGTMVLRVSPFA--SVLEDHKIEKNCGFCFKKINKSIRINQTCKNC 276
Query: 72 SVAFCSPKCSTAAASSSHSPYVCQAL-----------TR--------LLNLNSPDAANLP 112
P+CS S ++ C L TR LLN+ D N
Sbjct: 277 KNHLLCPQCSVDEYSLNYHKDECDILNFLKQYYPSSQTRDFRFMFRVLLNV-IKDKKNKS 335
Query: 113 LDRQLQAR--------FLIAAYNLAV--------VNPSQFQILLAFQGTVTD-----NDT 151
++ Q++ F+ +Y + V P Q + AF+ + N
Sbjct: 336 FSKENQSKQWLNHQNPFIFDSYKYLINLSRTLDKVQPEQME---AFKRSAQSVIAIFNKL 392
Query: 152 SAAHYLSSLCPPPASTTIELTAALLAKDRLNAFGLMEPYIEGQDGRRSVRAYGIYLKASF 211
+ C IE+ + +L+ N ++ P GI+ S+
Sbjct: 393 RGPKFFDE-CGVTIDEIIEIYSIVLS----NGHEMLHPL------NCHTYGLGIFPTGSY 441
Query: 212 FNHDCLPNACRFDYVDAAAENNTDIIVRMIHDVPQGREICLSYFPVNYDYSTRQKRLLYD 271
NH CLPNA F Y D ++ R + + +G EI SY + + S R+K LL
Sbjct: 442 LNHSCLPNA--FWYNDDQGM----MVFRTLRPIKKGEEILTSYTDITTECSERRKHLLKQ 495
Query: 272 YDFACDCDRCKVEANWSD 289
Y F C C +CK ++ +D
Sbjct: 496 YFFFCQCQQCKFQSKLTD 513
>gi|440300086|gb|ELP92579.1| set and mynd domain containing protein, putative [Entamoeba
invadens IP1]
Length = 420
Score = 68.2 bits (165), Expect = 6e-09, Method: Compositional matrix adjust.
Identities = 73/316 (23%), Positives = 127/316 (40%), Gaps = 47/316 (14%)
Query: 1 MSAVSETVTVAEIEGRGRGLVSTQSLKAGQIVLRDSPILLYSALPFINSSSFCHNCFRKT 60
MS + E ++E G+G+ S + + +I+ ++P++ ++ + N + C C +
Sbjct: 114 MSTIFEVKEISET--LGKGVFSKKEHQVNEIIYTETPVI--ASAKYKNYKNCCLYCLKAL 169
Query: 61 MHSSSSICPS-------------CSVAFCSPKCSTAAASSSHSPYVCQALTRLLNLNSPD 107
+ S+ S C +CS KC VC L ++++ +
Sbjct: 170 FPNESNTYTSVFNTNVVKGVKCKCGYVYCSKKCQDLDVGHQ---LVCGKLNKVVSFCTER 226
Query: 108 AANLPLD-RQLQARFLIAAYNLAVVNPS---QFQILLAFQGTV-------------TDND 150
+ PL ++ A+ L A Y + P Q L+ F TV T D
Sbjct: 227 CVSHPLTITKMFAKVLSATYREPTIYPFVVFHAQPLVKFDMTVEKIFVEIFGNKLETFQD 286
Query: 151 TSAAHY----LSSLCPPPASTTIELTA-ALLAKDRLNAFGLMEPYIEGQDGRRSVRAYGI 205
Y +SS+ AST + L A +LA D E + + + SV G+
Sbjct: 287 FGGWKYVYRIMSSVLKYNASTILPLNAIQMLATDPQKKVLSKEEALNWEVSKFSVEGEGL 346
Query: 206 YLKASFFNHDCLPNACRFDYVDAAAENNTDIIVRMIHDVPQGREICLSYFPVNYDYSTRQ 265
Y + NH C PN V A ++ + + + D+ G E+ +SY + + TR
Sbjct: 347 YKYLNTLNHSCDPNC-----VLACTTDDFKLSLIALKDIKAGEELTISYIDNSMNKETRL 401
Query: 266 KRLLYDYDFACDCDRC 281
K L+ Y+F C C +C
Sbjct: 402 KTLMDQYNFDCKCKKC 417
>gi|348670159|gb|EGZ09981.1| hypothetical protein PHYSODRAFT_523060 [Phytophthora sojae]
Length = 421
Score = 68.2 bits (165), Expect = 6e-09, Method: Compositional matrix adjust.
Identities = 82/307 (26%), Positives = 122/307 (39%), Gaps = 38/307 (12%)
Query: 16 RGRGLVSTQSLKAGQIVLRDSPILLYSALPFINSSSFCHNCFRKTMHSSSSICPSCSVAF 75
RGR + + +L AG + LR +P L LP + S CH CF + S C C+ AF
Sbjct: 17 RGRRVETETALAAGDLALR-APALATVLLPAL-WDSHCHKCF--ASGTRLSRCGRCNTAF 72
Query: 76 -CSPKCSTAAASSSHSPYVCQALTRLLNL---NSPDAANLPLDRQLQARFLIAAYNLAVV 131
CS C A H C+ L +L L N A L L R L+ + +
Sbjct: 73 YCSKACQQADWKPDHRKE-CKVLAQLAQLGLRNDQTADVLLLGRVLRRE------DAEGL 125
Query: 132 NPSQFQILLAFQGTVTDNDTSAAHYLS-SLCPPPASTTIELTAALLAKDRLNAFGLMEPY 190
P + L+ ++ + D + L+ L S +++ +L++ R N F + +
Sbjct: 126 QPKE---LVWYEEDMEDQELMLLAALAQKLELVDGSYSMDEMLRMLSRFRNNNFSICDEL 182
Query: 191 IEGQDGRRSVRAYGIYLKASFFNHDCLPNACRFDYVDAAAENNTDIIVRMIHDVPQGREI 250
+ Q G + + NH C PN C +V E + R + + G EI
Sbjct: 183 LLEQGA-------GCFPLGAMINHSCDPN-CAITFVPKTLE----MEFRAMRPIKAGEEI 230
Query: 251 CLSYFPVNYDYSTRQKRLLYDYDFACDCDRCKVEANWS-------DNDNDDENNEEVMDE 303
+Y V R +RL Y F C C RC V S D D D E+ E
Sbjct: 231 TQTYVDVALPRRERHERLQRKYHFNCACSRCSVPLQESGSLDAFLDADIDGVPKEQWSQE 290
Query: 304 DQDEQMV 310
+DEQ +
Sbjct: 291 RKDEQCI 297
>gi|125983736|ref|XP_001355633.1| GA11413 [Drosophila pseudoobscura pseudoobscura]
gi|54643949|gb|EAL32692.1| GA11413 [Drosophila pseudoobscura pseudoobscura]
Length = 496
Score = 68.2 bits (165), Expect = 6e-09, Method: Compositional matrix adjust.
Identities = 68/269 (25%), Positives = 110/269 (40%), Gaps = 26/269 (9%)
Query: 17 GRGLVSTQSLKAGQIVLRDSPILLYSALPFINSSSFCHNCFRKTMHSSSSICPSCSVAFC 76
GRGL++T+S+K G+I+ +D P+L+ + +S + C C R+ + C +
Sbjct: 34 GRGLMATRSIKRGEIIFKDMPLLISLSAHEEDSLNACSVCLRELPDTRF----MCRIGCG 89
Query: 77 SPKCSTAAASSSHSPYVCQALTRLLNLNSPDAANLPLDRQLQARFLIAAYNLAVVNPSQF 136
P CS + H L + N P+ AN + R L + A NL N Q
Sbjct: 90 LPVCSLCSKKKQHKTDC--DLLKSWGPNEPEVANSVIIRLL---CVARALNL---NKDQR 141
Query: 137 QILLAFQGTVTDND----TSAAHYLSSLCPPPASTTIELTAALLAKDRLNAFGLMEPYIE 192
++ Q + +N +AA + P E+ +A R N F + +
Sbjct: 142 DLIYCLQANLDNNHRVEVRNAAKCFKNF--PTDKKIFEIMNRTVAVLRTNGF---DKTTD 196
Query: 193 GQDGRRSVRAYGIYLKASFFNHDCLPNACRFDYVDAAAENNTDIIVRMIHDVPQGREICL 252
+ + +Y + NHDC+PN+ Y + N +IVR D+P+G EI
Sbjct: 197 RTNDNQEFNYRALYPLFAVMNHDCIPNS----YYTFEEKTNI-MIVRAAVDIPEGFEITT 251
Query: 253 SYFPVNYDYSTRQKRLLYDYDFACDCDRC 281
+Y + R L F C C RC
Sbjct: 252 TYTKLFTGNIARHLFLKIKRGFTCKCSRC 280
>gi|157125398|ref|XP_001654321.1| hypothetical protein AaeL_AAEL001920 [Aedes aegypti]
gi|108882685|gb|EAT46910.1| AAEL001920-PA [Aedes aegypti]
Length = 527
Score = 68.2 bits (165), Expect = 6e-09, Method: Compositional matrix adjust.
Identities = 70/280 (25%), Positives = 112/280 (40%), Gaps = 36/280 (12%)
Query: 17 GRGLVSTQSLKAGQIVLRDSPILLYSALPFINSSSFCHNCFRKTMHSSSSI-CPSCSVAF 75
GR LV+ ++LKAG+ +++ P+ + P+ S C C+++ S I C C
Sbjct: 56 GRYLVAARNLKAGEKIIQVEPLAMG---PWAASDPVCLGCYKRFAKGSKIIRCSGCGWRI 112
Query: 76 CSPKCSTAAASSSHSPYVCQALTR-----LLNLNSPDAANLPLDRQLQARFLIAAYNLAV 130
CS CS A+ +H C L LL +S + L +A F + L
Sbjct: 113 CSHTCSGLTANGAHKRLECNPLKEHNVVNLLETSSTEQIKL----MYEAIFALRCLLLKK 168
Query: 131 VNPSQFQILLAFQGTVTDNDTSAAHYLSSLCPPPASTTIELTAALLAKDRLNAFGLMEPY 190
+P ++ LL + +SSL + + + R + F E +
Sbjct: 169 TDPERYATLLEMESL-----NELRQKISSLWKRNQEMIV---GRIRDEWRFDEFSEGEIH 220
Query: 191 -----IEG---QDGRRSVRAYGIYLKASFFNHDCLPNACRFDYVDAAAENNTDII-VRMI 241
IE Q G+ V A +Y +A + HDC PN D + T ++ +R+
Sbjct: 221 TVCGIIEVNSFQIGQDDVYARALYPEAFYIMHDCTPNTTHTD------DPKTHVLSIRLT 274
Query: 242 HDVPQGREICLSYFPVNYDYSTRQKRLLYDYDFACDCDRC 281
+D+ G I LSY R++ L F C C RC
Sbjct: 275 NDLKAGDPITLSYSYTLQGTLKRRQHLFESKFFWCQCKRC 314
>gi|197102816|ref|NP_001125377.1| SET and MYND domain-containing protein 1 [Pongo abelii]
gi|55727869|emb|CAH90687.1| hypothetical protein [Pongo abelii]
Length = 477
Score = 68.2 bits (165), Expect = 6e-09, Method: Compositional matrix adjust.
Identities = 70/282 (24%), Positives = 114/282 (40%), Gaps = 26/282 (9%)
Query: 6 ETVTVAEIEGRGRGLVSTQSLKAGQIVLRDSPILLYSALPFINSSSF-CHNCFRKTMHSS 64
E V V EG+GRGL +T+ A I+ + YSA+ F + +F CH CF++
Sbjct: 7 ENVEVFTAEGKGRGLKATKEFWAADIIFAERA---YSAVVFDSLVNFVCHTCFKR--QEK 61
Query: 65 SSICPSCSVA-FCSPKCSTAAASSSHSPYVCQALTR---LLNLNSPDAANLPLDRQLQAR 120
C C A +C C A + + C A+ R + N N AA + + +
Sbjct: 62 LHRCGQCKFAHYCDRTCQKDAWLNHKNE--CLAIKRYGKVPNENIRLAARIMWRVEREGT 119
Query: 121 FLIAAYNLAVVNPSQFQILLAFQGTVTDNDTSAAHYLSSLCPPPASTTIELTAALLAKDR 180
L L V+ Q + + + +L P +++ + +
Sbjct: 120 GLTEGC-LVSVDDLQNHVEHFGEEEQKELRVDVDTFLQYWLPQSQQFSMQYISHIFGVIN 178
Query: 181 LNAFGLMEPYIEGQDGRRSVRAYGIYLKASFFNHDCLPNACRFDYVDAAAENNTDIIVRM 240
N F L + Q G ++V GI+ NHDC PN C + NN I +R
Sbjct: 179 CNGFTLSD-----QRGLQAV-GVGIFPNLGLVNHDCWPN-CTVIF------NNGKIELRA 225
Query: 241 IHDVPQGREICLSYFPVNYDYSTRQKRLLYDYDFACDCDRCK 282
+ + +G E+ +SY R+++L Y F C C+ C+
Sbjct: 226 LGKISEGEELTVSYIDFLNVSEERKRQLKKQYYFDCACEHCQ 267
>gi|449665374|ref|XP_002163555.2| PREDICTED: uncharacterized protein LOC100200645 [Hydra
magnipapillata]
Length = 1037
Score = 68.2 bits (165), Expect = 7e-09, Method: Compositional matrix adjust.
Identities = 74/291 (25%), Positives = 111/291 (38%), Gaps = 45/291 (15%)
Query: 3 AVSETVTVAEIEGRGRGLVSTQSLKAGQIVLRDSPILLYSALPFINSS------SFCHNC 56
V + V E E +GRG+ + Q +K G +L++ P+ A NS +C++
Sbjct: 2 VVFQVYEVFETEEKGRGVRALQDIKRGVEILKEEPL----ACILTNSKYRGIRCDYCYSE 57
Query: 57 FRKTMHSSSSICPSCS-VAFCSPKCSTAAASSSHSPYVCQALTRLLNLNSPDAANLPLDR 115
K + C C +A+C C A+ Y C+ LT+ PD L
Sbjct: 58 PEKLLK-----CSKCKFIAYCGKVCQ--ASDWKMHKYECKCLTKSAPKQPPDFCRLV--S 108
Query: 116 QLQARFLIAAYNLAVVNPSQFQILLAFQGTVTDNDTSAAH-YLSSLCPPPASTTIEL--- 171
QL F N + N L A +G +++ A + + L I +
Sbjct: 109 QLIFNFYYNKKNTLINN------LYANKGNISNARKEAFFTFAAVLVEYLQDVNININDI 162
Query: 172 -TAALLAKDRLNAFGLMEPYIEGQDGRRSVRAYGIYLKASFFNHDCLPNACRFDYVDAAA 230
L+ K N+F + + + GI+ AS FNH C PN A
Sbjct: 163 DIYGLMCKASCNSFAIT-------NAELNSLGTGIFSSASLFNHSCDPNC-------VAT 208
Query: 231 ENNTDIIVRMIHDVPQGREICLSYFPVNYDYSTRQKRLLYDYDFACDCDRC 281
N DI +R I + +G E+ LSY + RQ L Y F C C C
Sbjct: 209 FNGRDISIRAIKPIAEGEELMLSYISILATSDVRQLELRESYMFTCKCTVC 259
>gi|302922324|ref|XP_003053442.1| hypothetical protein NECHADRAFT_74600 [Nectria haematococca mpVI
77-13-4]
gi|256734383|gb|EEU47729.1| hypothetical protein NECHADRAFT_74600 [Nectria haematococca mpVI
77-13-4]
Length = 530
Score = 68.2 bits (165), Expect = 7e-09, Method: Compositional matrix adjust.
Identities = 75/271 (27%), Positives = 122/271 (45%), Gaps = 30/271 (11%)
Query: 16 RGRGLVSTQSLKAGQIVLRDSPILLYSALPFINSSSFCHNCFRKTMHSSSSICPSCSVAF 75
+GR L++++S G +L +P +L +L ++S+ C +CF+ S S C + VA+
Sbjct: 14 KGRCLIASRSFTPGDTILTFTPTILIPSLS--HASTVCSHCFKPGEPRSCSRCHA--VAY 69
Query: 76 CSPKCSTAAASSSHSPYVCQALTRLLNLNSPDAANLPLDRQLQARFLIAAYNLAVVNPSQ 135
C C +AA + HS C+ L ++ P LP + + L+ A A + +
Sbjct: 70 CDASCQSAAWKAIHSKE-CKVLQKVSAQGRP---GLPTPVRAVVQALVKAEVGAALEDLE 125
Query: 136 FQILLAFQGTVTDNDTSAAHYLSSLCPPPAST--TIELTAALLAKDRLNAFGLMEPYIEG 193
L +++G+ D +S +T ++ +LL K + NAF + I G
Sbjct: 126 GNDL-SWRGSDKWADMEMMAMGASAFAGLGTTQEAVKKALSLLCKIQTNAFHRYDADI-G 183
Query: 194 QDGRRSVRAYGIYL--KASFFNHDCLPNACRFDYVDAAAENNTDIIVRMIHDVPQGREIC 251
Q GI+L + + NH C+PNA +V A I+R + EI
Sbjct: 184 Q--------VGIFLEPRLAMANHSCIPNAT-VQFVGRRA------ILRAERPIKADEEIE 228
Query: 252 LSYFPVNYDYSTRQKRLLYDYDFACDCDRCK 282
+SY NY S R K L Y F C+C RC+
Sbjct: 229 ISYTDYNYPLSKR-KEALAPYFFTCECTRCR 258
>gi|357610542|gb|EHJ67027.1| hypothetical protein KGM_11275 [Danaus plexippus]
Length = 661
Score = 68.2 bits (165), Expect = 7e-09, Method: Compositional matrix adjust.
Identities = 87/330 (26%), Positives = 132/330 (40%), Gaps = 53/330 (16%)
Query: 3 AVSETVTVAEIEGRGRGLVSTQSLKAGQIVLRDSP---ILLYSALPFINSSSFCHNCFRK 59
A SETV +A E RGR + + ++AG+I+L + P +LL S S C NCF K
Sbjct: 240 AASETVEIAFDETRGRFAKADRDIQAGEILLIEEPHGGVLLSEF-----SKSHCQNCFNK 294
Query: 60 TMHSSSSICPSC-SVAFCSPKCSTAAASSSHSPYVCQALTRLLNLNSPDAANLPLDRQLQ 118
+ CP C +V FCS KC A S H Y C L L ++ L Q
Sbjct: 295 CLIPLP--CPKCPNVIFCSEKCLDIALKSYHG-YECHILPLLWKSGCSVTCHIALRMITQ 351
Query: 119 AR---FLIAAYNLAVVNPSQFQI--------LLAFQGTVTDND----TSAAHYLSSLCPP 163
F+ +L ++ L++ + T D T A +L L
Sbjct: 352 NSKDYFMKIMQDLKEKPTGPYKTEDYRNIYHLVSHENKRTKQDILHRTEMAIFLLKLLEI 411
Query: 164 P------ASTTIELTAALLAKDRLNAFGLMEPY-IEGQDGRRSVRAYG--------IYLK 208
A++ L L + NA + E ++ +DG R ++ G +Y
Sbjct: 412 SGYFNDDAASFGCLILKNLQVLQFNAHEVFEIQCLKPKDGTRFLKHEGKSVFIGGAVYPT 471
Query: 209 ASFFNHDCLPNACRFDYVDAAAENNTDIIVRMIHDVPQGREICLSYFPV--NYDYSTRQK 266
+ FNH C P R+ + I+V + ++ +G E+ +Y P+ RQ
Sbjct: 472 LALFNHSCEPGIVRYF-------CGSRIVVCAVKNIRKGEEVAENYGPIFTTVPKDKRQS 524
Query: 267 RLLYDYDFACDCDRCKVEANWSDNDNDDEN 296
+L Y F C C C E NW ++ EN
Sbjct: 525 QLKEQYWFDCKCLPC--EQNWPKYEDMTEN 552
>gi|195130539|ref|XP_002009709.1| GI15088 [Drosophila mojavensis]
gi|193908159|gb|EDW07026.1| GI15088 [Drosophila mojavensis]
Length = 492
Score = 67.8 bits (164), Expect = 7e-09, Method: Compositional matrix adjust.
Identities = 68/268 (25%), Positives = 111/268 (41%), Gaps = 24/268 (8%)
Query: 17 GRGLVSTQSLKAGQIVLRDSPILLYSALPFINSSSFCHNCFRKTMHSSSSIC-PSCSVAF 75
GRG+++T++LK G+I+ +DSP+L+ A ++ + C CF K + + +C C +
Sbjct: 34 GRGVMATRNLKRGEIIFQDSPLLIGLAAHEEDTLNACSVCF-KMLPDTRFMCRQGCGL-- 90
Query: 76 CSPKCSTAAASSSHSPYVCQALTRLLNLNSPDAANLPLDRQLQARFLIAAYNLAVVNPSQ 135
P CS A H + + N PD AN + R L + A NL+ Q
Sbjct: 91 --PVCSLCAKKKQHKTDC--DMFKAWGSNEPDVANSVIIRLL---CVARAINLS---KDQ 140
Query: 136 FQILLAFQGTVTDNDTSAAHYLSSLCP--PPASTTIELTAALLAKDRLNAFGLMEPYIEG 193
++ Q + +N + + P +E+ +A R N F + +
Sbjct: 141 RDLIYCLQANLDNNHRTEVRNAAKCFKNFPTDKKIVEIMNRTVAVLRTNGF---DKTTDR 197
Query: 194 QDGRRSVRAYGIYLKASFFNHDCLPNACRFDYVDAAAENNTDIIVRMIHDVPQGREICLS 253
+ +Y + NHDC+PN+ Y NN +IVR D+ +G EI +
Sbjct: 198 TTDNQEFNYRALYPLFAVMNHDCIPNSY---YTFEEKTNN--MIVRAAVDIAEGEEITTT 252
Query: 254 YFPVNYDYSTRQKRLLYDYDFACDCDRC 281
Y + R L F C C RC
Sbjct: 253 YTKLFTGNIARHLFLKMKKGFTCKCPRC 280
>gi|194747113|ref|XP_001955997.1| GF24983 [Drosophila ananassae]
gi|190623279|gb|EDV38803.1| GF24983 [Drosophila ananassae]
Length = 663
Score = 67.8 bits (164), Expect = 7e-09, Method: Compositional matrix adjust.
Identities = 76/314 (24%), Positives = 131/314 (41%), Gaps = 44/314 (14%)
Query: 1 MSAVSETVTVAEIEGRGRGLVSTQSLKAGQIVLRDSPILLYSALPFINSSSFCHNCFRKT 60
+S S+ V + E + +GR +V+++ +K G +VL + P+ F ++ CH+CF++
Sbjct: 252 LSGASKVVKLVETKDKGRFVVASEGVKTGDVVLCEEPVAACLIPSFFGTN--CHHCFKR- 308
Query: 61 MHSSSSICPSCS-VAFCSPKCSTAAASSSHSPYVCQALTRLLNLNSPDAANLPLDRQLQA 119
+H+ S C CS VAFCS +C A SS H + C+ + L+ + L QA
Sbjct: 309 LHTPVS-CLHCSGVAFCSAQCMGEACSSYHR-FECEFMDLLIGSGMSILCFIALRIFTQA 366
Query: 120 RFL---IAAYNLAVVNPS------------QFQILLAFQGTVTDNDTSAAHYLSSLCPPP 164
L +A NL + Q ++ F + + P
Sbjct: 367 NSLEDGLATANLLFEHLCSHEEVRQADDYLQRSLMAGFLMRILQKSLYFGRRKTEGVNP- 425
Query: 165 ASTTIELTAA-----LLAKDRLNAFGLMEPYIEGQ---DGRRSVR-AYGIYLKASFFNHD 215
T +EL A LL + NA + + + + DG +V G+Y S+FNH+
Sbjct: 426 --TAVELQVATALLGLLQVLQYNAHQIYQTQVTEEHRFDGSNTVYVGAGLYGTGSYFNHE 483
Query: 216 CLPNACRFDYVDAAAENNTDIIVRMIHDVPQGREICLSYFP--VNYDYSTRQKRLLYDYD 273
C P+ A +++ + ++Y P + + RQ+ L Y
Sbjct: 484 CWPSV-------AGHFVGKKLVLTATKPHRPNEVVAVNYGPLFIKMNLKERQRTLRGRYS 536
Query: 274 FACDCDRCKVEANW 287
F+C+C C + NW
Sbjct: 537 FSCNCMAC--QENW 548
>gi|357611158|gb|EHJ67334.1| hypothetical protein KGM_20152 [Danaus plexippus]
Length = 397
Score = 67.8 bits (164), Expect = 8e-09, Method: Compositional matrix adjust.
Identities = 69/267 (25%), Positives = 108/267 (40%), Gaps = 27/267 (10%)
Query: 17 GRGLVSTQSLKAGQIVLRDSPILLYSALPFINSSSFCHNCFRKTMHSSSSI-CPSCSVAF 75
GRGLV+TQ +KAG+++ D P L+Y I C C K + S C C++
Sbjct: 39 GRGLVATQDIKAGEVLFVDYP-LVYGPRSGIIVQRGCTVC--KNIDSDIFFKCSKCALIL 95
Query: 76 CSPKCSTAAASSSHSPYVCQALTRLLNLNSPDAANL-PLDRQLQARFLIAAYNLAVVNPS 134
CS +C + S C ++ + P+ + +D L +R L A L ++N
Sbjct: 96 CSVQCQNSDFHSGD----CSIIS-----HWPNKVPIEEVDDTLLSRALTAIRAL-LLNED 145
Query: 135 QFQILLAFQGTVTDNDTSAAHYLSSLCPPPASTTIELTAALLAKDRLNAFGLMEPYIEGQ 194
Q +L + Q S L P E ++ NAF + PY
Sbjct: 146 QKYLLTSLQANKLPQYGSEIRDLKQYFDIPLHEE-EFMILVICILNTNAFQMATPY---- 200
Query: 195 DGRRSVRAYGIYLKASFFNHDCLPNACRFDYVDAAAENNTDIIVRMIHDVPQGREICLSY 254
G++ + G+Y AS NH+C+PN + + V+ + G EI Y
Sbjct: 201 -GKKEMSLRGLYPVASILNHNCVPNT------RNCFNGDLQMTVKATKTINAGSEIFTCY 253
Query: 255 FPVNYDYSTRQKRLLYDYDFACDCDRC 281
+ + R+ L F CDC+RC
Sbjct: 254 SGMLWGTPARRLYLYKSKHFLCDCERC 280
>gi|330843375|ref|XP_003293631.1| hypothetical protein DICPUDRAFT_90323 [Dictyostelium purpureum]
gi|325076008|gb|EGC29833.1| hypothetical protein DICPUDRAFT_90323 [Dictyostelium purpureum]
Length = 343
Score = 67.8 bits (164), Expect = 8e-09, Method: Compositional matrix adjust.
Identities = 69/307 (22%), Positives = 128/307 (41%), Gaps = 37/307 (12%)
Query: 5 SETVTVAEIEGRGRGLVSTQSLK---------AGQIVLRDSPILLYSALPFINSSSFCHN 55
++ V + E + +GRGL TQ+ + I+ ++ P + Y ++ ++ C++
Sbjct: 32 NKLVEIKESDKKGRGLFYTQNTQLPVINKETNKPNILFKEEPFISYPSIIKLDHDKICNH 91
Query: 56 CFR-------KTMHSSSSICPSCSVAFCSPKCSTAAASSSHSPYVCQA----LTRLLNLN 104
C + +++ + C C+V +CS +C A+ HS +C++ L +
Sbjct: 92 CLKSLENNNSNNNNNNHTECEGCNVKYCSNECKEKASLQYHS-VLCKSNGSGFHYLEKHS 150
Query: 105 SPDAANLPL-DRQLQARFLIAAYNLAVVNPSQFQI-LLAFQGT---VTDNDTSAAHYLSS 159
+ PL ++ AR L+ + N + F + +L F + D S
Sbjct: 151 QIEKRRFPLLAGKILARVLMGYHLEKSANKTWFPLQMLTFANKPAPLEWKDDYLIFSRSL 210
Query: 160 LCPPPASTTIELTAALLAKDR----LNAFGL-MEPYIEGQDGRRSVRAYGIYLKASFFNH 214
L S+ + K LN G+ ++P + G+Y SF NH
Sbjct: 211 LKGMKESSLKKFNYDWFVKVMQILYLNTIGIDIDPNQTTTKMSTPESSIGLYFLTSFINH 270
Query: 215 DCLPNACRFDYVDAAAENNTDIIVRMIHDVPQGREICLSYFPVNYDYSTRQKRLLYDYDF 274
C PNA YV ++ +I R++ + G EI +SY + D R+ +L +Y F
Sbjct: 271 SCDPNA----YVQFPNDHTAEI--RLLKPINPGEEITISYADTSKDIIDRRSQLFENYGF 324
Query: 275 ACDCDRC 281
C+C +C
Sbjct: 325 NCECPKC 331
>gi|302767894|ref|XP_002967367.1| hypothetical protein SELMODRAFT_439878 [Selaginella moellendorffii]
gi|300165358|gb|EFJ31966.1| hypothetical protein SELMODRAFT_439878 [Selaginella moellendorffii]
Length = 614
Score = 67.8 bits (164), Expect = 8e-09, Method: Compositional matrix adjust.
Identities = 76/323 (23%), Positives = 122/323 (37%), Gaps = 75/323 (23%)
Query: 10 VAEIEGRGRGLVSTQSLKAGQIVLRDSPILLYSALPFINSSSFCHNCFRKTMHSSSSI-C 68
VA ++ RGR V+T+++KAG++V+ + I + + CH C + C
Sbjct: 68 VATVKNRGRCAVATRAIKAGEVVVAEQAIAFVPRGQY--RKAICHVCGVNVAENGPKFEC 125
Query: 69 PSCSVAFCSPKCSTAAASSSHSPYVCQALTRLLNLNSPDAANLPLDRQLQARFLIAAYNL 128
+C+ +C SH Y A+ R + A++ D A L +NL
Sbjct: 126 SACNYTIFCEECYKPP-QGSHENYC--AVYR-------EIASIARDENCDADLLRLVFNL 175
Query: 129 AVVNPS---------QFQIL---LAFQ---------------------GTVTDNDTSAAH 155
A+ Q ++L L FQ G T + +
Sbjct: 176 ALKKAGTAQARSSLLQLELLIETLPFQNSKDETCTVDGVIHSCFKDALGLETHRSKAPSS 235
Query: 156 YLSSL-----------C------PPPASTTIELTAALLAKDRLNAFGLMEPYIEGQDGRR 198
+L ++ C P P S +IE L N+ G+ Q+ +
Sbjct: 236 WLEAVRNGCTRLHKIACVGNGGKPAPVSCSIEELEMLATLINTNSHGM-----GAQNLQN 290
Query: 199 SVRAYGIYLKASFFNHDCLPNACRFDYVDAAAENNTDIIVRMIHDVPQGREICLSYFPVN 258
+ A G++ S NH C PN C A + + VR + D+P+G E+CLSY +
Sbjct: 291 THVALGLFPYVSILNHSCRPNCC-------FASEGSVMYVRAVQDIPKGAELCLSYINLY 343
Query: 259 YDYSTRQKRLLYDYDFACDCDRC 281
R+ L+ F C CDRC
Sbjct: 344 ESRRVRKTLLVATKHFDCTCDRC 366
>gi|195168440|ref|XP_002025039.1| GL26796 [Drosophila persimilis]
gi|194108484|gb|EDW30527.1| GL26796 [Drosophila persimilis]
Length = 498
Score = 67.8 bits (164), Expect = 9e-09, Method: Compositional matrix adjust.
Identities = 71/278 (25%), Positives = 115/278 (41%), Gaps = 28/278 (10%)
Query: 8 VTVAEIEGRGRGLVSTQSLKAGQIVLRDSPILLYSALPFINSSSFCHNCFRKTMHSSSSI 67
V V++I GRG L++T+S+K G+I+ +D P+L+ + +S + C C R+ +
Sbjct: 27 VGVSKIAGRG--LMATRSIKRGEIIFKDMPLLISLSAHEEDSLNACSVCLRELPDTRFMC 84
Query: 68 CPSCSVAFCSPKCSTAAASSSHSPYVCQALTRLLNLNSPDAANLPLDRQLQARFLIAAYN 127
C + P CS + H L + N P+ AN + R L + A N
Sbjct: 85 RMGCGL----PVCSLCSKKKQHKTDC--DLLKSWGPNEPEVANSVIIRLL---CVARALN 135
Query: 128 LAVVNPSQFQILLAFQGTVTDND----TSAAHYLSSLCPPPASTTIELTAALLAKDRLNA 183
L N Q ++ Q + +N +AA + P E+ +A R N
Sbjct: 136 L---NKDQRDLIYCLQANLDNNHRVEVRNAAKCFKNF--PTDKKIFEIMNRTVAVLRTNG 190
Query: 184 FGLMEPYIEGQDGRRSVRAYGIYLKASFFNHDCLPNACRFDYVDAAAENNTDIIVRMIHD 243
F + + + + +Y + NHDC+PN+ Y + N +IVR D
Sbjct: 191 F---DKTTDRTNDNQEFNYRALYPLFAVMNHDCIPNS----YYTFEEKTNI-MIVRAAVD 242
Query: 244 VPQGREICLSYFPVNYDYSTRQKRLLYDYDFACDCDRC 281
+P+G EI +Y + R L F C C RC
Sbjct: 243 IPEGFEITTTYTKLFTGNIARHLFLKIKRGFTCKCSRC 280
>gi|291386373|ref|XP_002709685.1| PREDICTED: SET and MYND domain containing 1 isoform 1 [Oryctolagus
cuniculus]
Length = 477
Score = 67.8 bits (164), Expect = 9e-09, Method: Compositional matrix adjust.
Identities = 71/284 (25%), Positives = 117/284 (41%), Gaps = 30/284 (10%)
Query: 6 ETVTVAEIEGRGRGLVSTQSLKAGQIVLRDSPILLYSALPFINSSSF-CHNCFRKTMHSS 64
E V V EG+GRGL +T+ A I+ + YSA+ F + +F CH CF++
Sbjct: 7 ENVEVFTSEGKGRGLKATKEFWAADIIFAERA---YSAVVFDSLVNFVCHTCFKR--QEK 61
Query: 65 SSICPSCSVA-FCSPKCSTAAASSSHSPYVCQALTRLLNLNSPDAANLPLDRQLQARFLI 123
C C A +C C A + + C A+ R + P+ N+ L ++ R
Sbjct: 62 LHRCGQCKFAHYCDRTCQKDAWLNHKNE--CSAIKRYGKV--PNE-NIRLAARIMWRVER 116
Query: 124 AAYNLA---VVNPSQFQILLAFQGTVTDNDTSAA--HYLSSLCPPPASTTIELTAALLAK 178
L +V+ Q + G + A +L P +++ + +
Sbjct: 117 EGTGLTEGCLVSVDDLQNHVDHFGEEEQKELRADVDTFLQYWPPQSQHFSMQYISHIFGV 176
Query: 179 DRLNAFGLMEPYIEGQDGRRSVRAYGIYLKASFFNHDCLPNACRFDYVDAAAENNTDIIV 238
N F L + Q G ++V GI+ + NHDC PN C + NN I +
Sbjct: 177 INCNGFTLSD-----QRGLQAV-GVGIFPNLALVNHDCWPN-CTVIF------NNGKIEL 223
Query: 239 RMIHDVPQGREICLSYFPVNYDYSTRQKRLLYDYDFACDCDRCK 282
R + + +G E+ +SY R+++L Y F C C+ C+
Sbjct: 224 RALGKISEGEELTVSYIDFLNVSEERRRQLKKQYYFDCTCEHCQ 267
>gi|225459467|ref|XP_002284386.1| PREDICTED: histone-lysine N-methyltransferase ASHR1-like [Vitis
vinifera]
Length = 477
Score = 67.8 bits (164), Expect = 9e-09, Method: Compositional matrix adjust.
Identities = 75/300 (25%), Positives = 121/300 (40%), Gaps = 48/300 (16%)
Query: 8 VTVAEIEGRGRGLVSTQSLKAGQIVLRDSPILLYSALPFINSS---SFCHNCFRKTMHSS 64
+TV+ +GR L+S + G++++ P Y ++P N+S S C CFR S+
Sbjct: 13 LTVSTTPEKGRCLLSIKDFSPGEVIISQEP---YVSVP--NNSAVHSRCEGCFRS---SN 64
Query: 65 SSICPSCSVA-FCSPKCSTAAASSSHSPYVCQALTRL-------LNLNSPDAANLPLDRQ 116
C +C V +C C + C AL+RL L + L + R+
Sbjct: 65 LKKCSACHVVWYCGSTCQKSDWKLHRLE--CNALSRLEKERQKSLTPSIRLMVKLYMRRK 122
Query: 117 LQARFLIAAYNLAVVNPSQFQILLAFQGTVTDND-------TSAAHYLSSLCPPPASTTI 169
LQ+ + + + ++ A +TD D A+ ++ + P
Sbjct: 123 LQSEKI-----MPTTARDNYNLVEALVSHITDIDEKQLVLYAQMANLVNLILQWPDINVK 177
Query: 170 ELTAALLAKDRLNAFGLMEPYIEGQDGRRSVRAYGIYLKASFFNHDCLPNACRFDYVDAA 229
E+ A +K NA + DG G+Y S NH CLPN+ + +
Sbjct: 178 EI-AENFSKLACNAHTIC-------DGELRPLGTGLYPVISIINHSCLPNSV-LVFEERL 228
Query: 230 AENNTDIIVRMIHDVPQGREICLSYFPVNYDYSTRQKRLLYDYDFACDCDRCKVEANWSD 289
A +VR + +P+G E+ +SY TRQK L Y F C C RC+ + D
Sbjct: 229 A------VVRAVQHIPKGTEVLISYIETAGSTITRQKALKEQYLFTCTCPRCRRMGQYDD 282
>gi|194869827|ref|XP_001972529.1| GG13835 [Drosophila erecta]
gi|190654312|gb|EDV51555.1| GG13835 [Drosophila erecta]
Length = 665
Score = 67.4 bits (163), Expect = 1e-08, Method: Compositional matrix adjust.
Identities = 75/312 (24%), Positives = 133/312 (42%), Gaps = 40/312 (12%)
Query: 1 MSAVSETVTVAEIEGRGRGLVSTQSLKAGQIVLRDSPILLYSALPFINSSSFCHNCFRKT 60
+ S+ V + E + +GR +V+ + L+AG ++L + P+ + L + CH+CF++
Sbjct: 251 LIGASKVVRLVETKDKGRFVVANEGLRAGDVLLCEEPVA--ACLEPTYFGTHCHHCFKRL 308
Query: 61 MHSSSSICPSCS-VAFCSPKCSTAAASSSHSPYVCQALTRLLNLNSPDAANLPLDRQLQA 119
S+ C CS VAFCS +C A+SS H + C+ + ++ + L QA
Sbjct: 309 --STPVSCLHCSGVAFCSAQCMGEASSSYHR-FECEYMDLMIGSGMSILCFIALRVFTQA 365
Query: 120 RFL---IAAYNLAVVN---------PSQFQILLAFQGTVTDNDTSAAHYLSSLCPPPAST 167
L +A NL + P + G + + ++ T
Sbjct: 366 PSLEQGLATANLLFEHLCSHEEERQPDDYLRRALMSGFLLRILQKSLYFGRRKTEGVNPT 425
Query: 168 TIELTAA-----LLAKDRLNAFGLMEPYIEGQ---DGRRSVR-AYGIYLKASFFNHDCLP 218
+EL A LL + NA + + + + DG ++V A G+Y S+FNH+C P
Sbjct: 426 AVELQVATALLGLLQVLQYNAHQIYQTQVTEEHRFDGSKTVYLAAGLYGTGSYFNHECWP 485
Query: 219 N-ACRFDYVDAAAENNTDIIVRMIHDVPQGREICLSYFP--VNYDYSTRQKRLLYDYDFA 275
+ AC F +++ + ++Y P + + RQ+ L Y F+
Sbjct: 486 STACHF--------VGKKLVLTATKPHRANELVAVNYGPIFIKNNLKERQRSLRGRYSFS 537
Query: 276 CDCDRCKVEANW 287
C+C C + NW
Sbjct: 538 CNCMAC--QENW 547
>gi|302141868|emb|CBI19071.3| unnamed protein product [Vitis vinifera]
Length = 480
Score = 67.4 bits (163), Expect = 1e-08, Method: Compositional matrix adjust.
Identities = 75/300 (25%), Positives = 121/300 (40%), Gaps = 48/300 (16%)
Query: 8 VTVAEIEGRGRGLVSTQSLKAGQIVLRDSPILLYSALPFINSS---SFCHNCFRKTMHSS 64
+TV+ +GR L+S + G++++ P Y ++P N+S S C CFR S+
Sbjct: 13 LTVSTTPEKGRCLLSIKDFSPGEVIISQEP---YVSVP--NNSAVHSRCEGCFRS---SN 64
Query: 65 SSICPSCSVA-FCSPKCSTAAASSSHSPYVCQALTRL-------LNLNSPDAANLPLDRQ 116
C +C V +C C + C AL+RL L + L + R+
Sbjct: 65 LKKCSACHVVWYCGSTCQKSDWKLHRLE--CNALSRLEKERQKSLTPSIRLMVKLYMRRK 122
Query: 117 LQARFLIAAYNLAVVNPSQFQILLAFQGTVTDND-------TSAAHYLSSLCPPPASTTI 169
LQ+ + + + ++ A +TD D A+ ++ + P
Sbjct: 123 LQSEKI-----MPTTARDNYNLVEALVSHITDIDEKQLVLYAQMANLVNLILQWPDINVK 177
Query: 170 ELTAALLAKDRLNAFGLMEPYIEGQDGRRSVRAYGIYLKASFFNHDCLPNACRFDYVDAA 229
E+ A +K NA + DG G+Y S NH CLPN+ + +
Sbjct: 178 EI-AENFSKLACNAHTIC-------DGELRPLGTGLYPVISIINHSCLPNSV-LVFEERL 228
Query: 230 AENNTDIIVRMIHDVPQGREICLSYFPVNYDYSTRQKRLLYDYDFACDCDRCKVEANWSD 289
A +VR + +P+G E+ +SY TRQK L Y F C C RC+ + D
Sbjct: 229 A------VVRAVQHIPKGTEVLISYIETAGSTITRQKALKEQYLFTCTCPRCRRMGQYDD 282
>gi|149036368|gb|EDL90986.1| rCG56113, isoform CRA_b [Rattus norvegicus]
Length = 477
Score = 67.4 bits (163), Expect = 1e-08, Method: Compositional matrix adjust.
Identities = 69/282 (24%), Positives = 115/282 (40%), Gaps = 26/282 (9%)
Query: 6 ETVTVAEIEGRGRGLVSTQSLKAGQIVLRDSPILLYSALPFINSSSF-CHNCFRKTMHSS 64
E V V EG+GRGL +T+ A ++ + YSA+ F + +F CH CF++
Sbjct: 7 ENVEVFTSEGKGRGLKATKEFWAADVIFAERA---YSAVVFDSLINFVCHTCFKR--QER 61
Query: 65 SSICPSCSVA-FCSPKCSTAAASSSHSPYVCQALTR---LLNLNSPDAANLPLDRQLQAR 120
C C A +C C A + + C A+ R + N N AA + + +
Sbjct: 62 LHRCGQCKFAHYCDRTCQKDAWLNHKNE--CSAIKRYGKVPNENIRLAARIMWRVEREGT 119
Query: 121 FLIAAYNLAVVNPSQFQILLAFQGTVTDNDTSAAHYLSSLCPPPASTTIELTAALLAKDR 180
L L V+ Q + + + +L P +++ + +
Sbjct: 120 GLTEG-CLVSVDDLQNHVEHFGEEEQKELRVDVDTFLQYWPPQSQQFSMQYISHIFGVIN 178
Query: 181 LNAFGLMEPYIEGQDGRRSVRAYGIYLKASFFNHDCLPNACRFDYVDAAAENNTDIIVRM 240
N F L + Q G ++V GI+ NHDC PN C + NN I +R
Sbjct: 179 CNGFTLSD-----QRGLQAV-GVGIFPNLGLVNHDCWPN-CTVIF------NNGKIELRA 225
Query: 241 IHDVPQGREICLSYFPVNYDYSTRQKRLLYDYDFACDCDRCK 282
+ + +G E+ +SY + R+++L Y F C C+ C+
Sbjct: 226 LGKISEGEELTVSYIDFLHLSEERRQQLKKQYYFDCSCEHCQ 267
>gi|194769376|ref|XP_001966780.1| GF19100 [Drosophila ananassae]
gi|190618301|gb|EDV33825.1| GF19100 [Drosophila ananassae]
Length = 499
Score = 67.0 bits (162), Expect = 1e-08, Method: Compositional matrix adjust.
Identities = 68/270 (25%), Positives = 110/270 (40%), Gaps = 28/270 (10%)
Query: 17 GRGLVSTQSLKAGQIVLRDSPILLYSALPFINSSSFCHNCFRKTMHSSSSICPSCSVAFC 76
GRG+++T+SLK G+I+ +DSP+L+ A ++ + C C ++ + C + C
Sbjct: 34 GRGVMATRSLKRGEIIFKDSPLLIGLAAHEEDALNACTVCLKELPDTKFMCRQGCGLPVC 93
Query: 77 SPKCSTAAASSSHSPYVCQALTRLLNLNSPDAANLPLDRQLQARFLIAAYNLAVVNPSQF 136
P C A C + + N P+ AN + R L + A NL Q
Sbjct: 94 -PLC----AKKKQHKTDCD-MFKSWGPNEPEVANSVIIRLL---CIARAINLT---KDQR 141
Query: 137 QILLAFQGTVTDNDTS----AAHYLSSLCPPPASTTIELTAALLAKDRLNAFGLMEPYIE 192
++ Q + +N + AA + P IE+ +A R N F + +
Sbjct: 142 DLIYCLQANLDNNHRTEVRNAAKCFKNF--PTDKKLIEIMNRTVAVMRTNGF---DKTTD 196
Query: 193 GQDGRRSVRAYGIYLKASFFNHDCLPNA-CRFDYVDAAAENNTDIIVRMIHDVPQGREIC 251
+ + +Y NHDC+PN+ FD E ++VR D+P+G EI
Sbjct: 197 RTNDNQEFNYRALYPLFGVMNHDCIPNSYYTFD------EKTNHMVVRAAVDIPEGFEIT 250
Query: 252 LSYFPVNYDYSTRQKRLLYDYDFACDCDRC 281
+Y + R L F C C RC
Sbjct: 251 TTYTKLFTGNIARHLFLKMKKGFTCKCPRC 280
>gi|432102449|gb|ELK30026.1| SET and MYND domain-containing protein 1 [Myotis davidii]
Length = 449
Score = 67.0 bits (162), Expect = 1e-08, Method: Compositional matrix adjust.
Identities = 70/282 (24%), Positives = 111/282 (39%), Gaps = 26/282 (9%)
Query: 6 ETVTVAEIEGRGRGLVSTQSLKAGQIVLRDSPILLYSALPFINSSSF-CHNCFRKTMHSS 64
E V V EG+GRGL +T+ A ++ + YSA+ F + +F CH CF++
Sbjct: 7 ENVEVFTSEGKGRGLKATKEFWASDVIFAERA---YSAVVFDSLVNFVCHTCFKR--QEK 61
Query: 65 SSICPSCSVA-FCSPKCSTAAASSSHSPYVCQALTR---LLNLNSPDAANLPLDRQLQAR 120
C C A +C C A C A+ R + N N AA + + +
Sbjct: 62 LHHCGQCKFARYCDRTCQKDAWVDHKKE--CLAVKRYGKVPNENIRLAARIMWRVEREGT 119
Query: 121 FLIAAYNLAVVNPSQFQILLAFQGTVTDNDTSAAHYLSSLCPPPASTTIELTAALLAKDR 180
L L V+ Q + + +L P +++ + +
Sbjct: 120 GLTEGC-LVSVDDLQNHVENFGEEEQKQLRVDVDTFLQYWLPQSQQFSMQYISHIFGVIN 178
Query: 181 LNAFGLMEPYIEGQDGRRSVRAYGIYLKASFFNHDCLPNACRFDYVDAAAENNTDIIVRM 240
N F L + Q G ++V GI+ NHDC PN C + NN I +R
Sbjct: 179 CNGFTLSD-----QRGLQAV-GVGIFPNLGLVNHDCWPN-CTVIF------NNGKIELRA 225
Query: 241 IHDVPQGREICLSYFPVNYDYSTRQKRLLYDYDFACDCDRCK 282
+ + +G E+ +SY R+K+L Y F C C+ C+
Sbjct: 226 LGKISEGEELTVSYIDFLNVSEERKKQLKKQYYFDCTCEHCQ 267
>gi|390353457|ref|XP_003728116.1| PREDICTED: SET and MYND domain-containing protein 3-like
[Strongylocentrotus purpuratus]
Length = 319
Score = 67.0 bits (162), Expect = 1e-08, Method: Compositional matrix adjust.
Identities = 76/292 (26%), Positives = 114/292 (39%), Gaps = 56/292 (19%)
Query: 16 RGRGLVSTQSLKAGQIVLRDSPILLYSALPFINSSSF---CHNCFR-----KTMHSSSSI 67
+GRGL + +++AG VL D P + +N C +C R T S
Sbjct: 14 KGRGLRAVSTIQAGTCVLEDEPYVY-----ILNEEQLQLRCGHCLRALSLGNTSEVESKN 68
Query: 68 CPSCS-VAFCSPKCSTAAASSSHSPYVCQALTRLLNLNSPDAANLPLDRQLQARFLIAA- 125
C SC V +C+ C A Y C +LL D L R L L +
Sbjct: 69 CKSCKRVVYCNKACKVAGRKEHR--YEC----KLLQGKEKDDVTL---RLLMKMILKSKN 119
Query: 126 YNLAVVN------PSQFQILLAFQGTVTDNDTSAAHY----------LSSLCPPPASTTI 169
Y L N PS ++ L +FQ + + + + L S P S +
Sbjct: 120 YELKKSNGTLCEFPSSYRDLCSFQLSTKAQEHLLSLFRCYEKGWRSKLRSHLSPFRSQEV 179
Query: 170 ELTAALLAKDRLNAFGLMEPYIEGQDGRRSVRAYGIYLKASFFNHDCLPNACRFDYVDAA 229
T L +N+F + + + R++ +Y++AS FNH C PN C F +
Sbjct: 180 LDTYQKLT---INSFSMYD------EMTRTIVGEALYIRASMFNHSCEPN-CTFVF---- 225
Query: 230 AENNTDIIVRMIHDVPQGREICLSYFPVNYDYSTRQKRLLYDYDFACDCDRC 281
+ + VR I + G E C+SY R+++L Y F C C RC
Sbjct: 226 --EGSRLSVRAIKRIEIGEECCISYMSSLLPSPLRKEKLRSIYGFTCQCPRC 275
>gi|321472149|gb|EFX83120.1| hypothetical protein DAPPUDRAFT_2393 [Daphnia pulex]
Length = 458
Score = 67.0 bits (162), Expect = 1e-08, Method: Compositional matrix adjust.
Identities = 77/295 (26%), Positives = 121/295 (41%), Gaps = 63/295 (21%)
Query: 10 VAEIEGRGRGLVSTQSLKAGQIVLRDSPILLYSALPFINSSSFCHNCFRKTMHSSSSICP 69
+ E E GR LV+++ +KAG+++L++ P+++ P +N+ C C++ + + C
Sbjct: 43 IKESEAEGRYLVASRLIKAGEVILQELPLVVG---PKLNTLPLCLGCYKSI--TDTYRCS 97
Query: 70 SCSVAFCSPKCSTAA-------------------ASSSHSPYVCQALTRLLNLNSPDAAN 110
C+ CS C +A + + + V Q +T L L PD+
Sbjct: 98 RCNWPLCSAACEESALHKNGECRMIDPTLMTNHLSQGAINSQVFQCITPLRYLTLPDSDR 157
Query: 111 LPLDR---QLQARFLIAAYNLAVVNPSQFQILLAFQGTVTDNDTSAAHYLSSLCPPPAST 167
LD L+ R + Y L N S F L ++ +T D S
Sbjct: 158 ERLDELVSHLEQRRGMDIYRLVEQNISSF---LRYRLLLTQYD---------------SE 199
Query: 168 TIELTAALLAKDRLNAFGLMEPYIEGQDGRRSVRAYGIYLKASFFNHDCLPNACR-FDYV 226
+I+ +L N F E I+ GR SVR G+Y AS NHDC+ N FD
Sbjct: 200 SIQRVCGILET---NCF---EIRIQ---GRVSVR--GLYPTASLMNHDCVANTRHVFDPA 248
Query: 227 DAAAENNTDIIVRMIHDVPQGREICLSYFPVNYDYSTRQKRLLYDYDFACDCDRC 281
D I + D+P G +I +Y ++ R+ L F C C RC
Sbjct: 249 DF------RIRILATKDIPAGDKISATYTRSLWNTLDRRLHLKSTKHFWCQCSRC 297
>gi|297266487|ref|XP_001093225.2| PREDICTED: SET and MYND domain-containing protein 1 [Macaca
mulatta]
Length = 399
Score = 67.0 bits (162), Expect = 1e-08, Method: Compositional matrix adjust.
Identities = 69/282 (24%), Positives = 113/282 (40%), Gaps = 26/282 (9%)
Query: 6 ETVTVAEIEGRGRGLVSTQSLKAGQIVLRDSPILLYSALPFINSSSF-CHNCFRKTMHSS 64
E V V EG+GRGL +T+ A ++ + YSA+ F + +F CH CF++
Sbjct: 7 ENVEVFTAEGKGRGLKATKEFWAADVIFAERA---YSAVVFDSLVNFVCHTCFKR--QEK 61
Query: 65 SSICPSCSVA-FCSPKCSTAAASSSHSPYVCQALTR---LLNLNSPDAANLPLDRQLQAR 120
C C A +C C A + + C A+ R + N N AA + + +
Sbjct: 62 LHRCGQCKFAHYCDRTCQKDAWLNHKNE--CSAIKRYGKVPNENIRLAARIMWRVEREGT 119
Query: 121 FLIAAYNLAVVNPSQFQILLAFQGTVTDNDTSAAHYLSSLCPPPASTTIELTAALLAKDR 180
L L V+ Q + + + +L P ++ + +
Sbjct: 120 GLTEGC-LVSVDDLQNHVEHFGEEEQKELRVDVDTFLQYWPPQSQQFGMQYISHIFGVIN 178
Query: 181 LNAFGLMEPYIEGQDGRRSVRAYGIYLKASFFNHDCLPNACRFDYVDAAAENNTDIIVRM 240
N F L + Q G ++V GI+ NHDC PN C + NN I +R
Sbjct: 179 CNGFTLSD-----QRGLQAV-GVGIFPNLGLVNHDCWPN-CTVIF------NNGKIELRA 225
Query: 241 IHDVPQGREICLSYFPVNYDYSTRQKRLLYDYDFACDCDRCK 282
+ + +G E+ +SY R+++L Y F C C+ C+
Sbjct: 226 LGKISEGEELTVSYIDFLNVSEERKRQLKKQYYFDCTCEHCQ 267
>gi|289739577|gb|ADD18536.1| putative histone tail methylase [Glossina morsitans morsitans]
Length = 429
Score = 67.0 bits (162), Expect = 2e-08, Method: Compositional matrix adjust.
Identities = 69/270 (25%), Positives = 112/270 (41%), Gaps = 28/270 (10%)
Query: 17 GRGLVSTQSLKAGQIVLRDSPILLYSALPFINSSSFCHNCFRKTMHSSSSIC-PSCSVAF 75
GRG+ +T+ LK G I+LRD PIL+ A ++ + C CF K + + +C C++
Sbjct: 34 GRGVFTTRKLKRGDIILRDIPILIGLAARQEDTLNSCSICF-KVLPDTKFMCRQGCAL-- 90
Query: 76 CSPKCSTAAASSSHSPYVCQALTRLLNLNSPDAANLPLDRQLQARFLIAAYNLAVVNPSQ 135
P CS A H AL + PD AN + R LI ++ Q
Sbjct: 91 --PVCSVCAKKKQHKDDC--ALFKSWEPIEPDVANSVIIR------LICIARAINLSKEQ 140
Query: 136 FQILLAFQGTVTDNDTS----AAHYLSSLCPPPASTTIELTAALLAKDRLNAFGLMEPYI 191
++ Q + +N + AA + P IE+ +A R N F +
Sbjct: 141 RDLIYCLQANLDNNHRTEVRNAAKCFNKF--PTDKKLIEIMNRTVAVLRTNGF---DETA 195
Query: 192 EGQDGRRSVRAYGIYLKASFFNHDCLPNACRFDYVDAAAENNTDIIVRMIHDVPQGREIC 251
+ + + ++ + NHDC+PN+ Y + N +++R D+P+G EI
Sbjct: 196 DRTNDNQEFFYRALFPLFALVNHDCVPNS----YYTFEEKTNY-MVLRASVDLPEGTEIT 250
Query: 252 LSYFPVNYDYSTRQKRLLYDYDFACDCDRC 281
+Y + R L +F C C RC
Sbjct: 251 TTYTKLFTGNIARHLFLKMKKNFTCKCSRC 280
>gi|5870834|gb|AAC53022.2| skm-BOP2 [Mus musculus]
Length = 472
Score = 66.6 bits (161), Expect = 2e-08, Method: Compositional matrix adjust.
Identities = 68/282 (24%), Positives = 115/282 (40%), Gaps = 26/282 (9%)
Query: 6 ETVTVAEIEGRGRGLVSTQSLKAGQIVLRDSPILLYSALPFINSSSF-CHNCFRKTMHSS 64
E V V EG+GRGL +T+ A ++ + YSA+ F + +F CH CF++
Sbjct: 2 ENVEVFTSEGKGRGLKATKEFWAADVIFAERA---YSAVVFDSLINFVCHTCFKR--QEK 56
Query: 65 SSICPSCSVA-FCSPKCSTAAASSSHSPYVCQALTR---LLNLNSPDAANLPLDRQLQAR 120
C C A +C C A + + C A+ + + N N AA + + +
Sbjct: 57 LHRCGQCKFAHYCDRTCQKDAWLNHKNE--CAAIKKYGKVPNENIRLAARIMWRVEREGT 114
Query: 121 FLIAAYNLAVVNPSQFQILLAFQGTVTDNDTSAAHYLSSLCPPPASTTIELTAALLAKDR 180
L L V+ Q + + + +L P +++ + +
Sbjct: 115 GLTEG-CLVSVDDLQNHVEHFGEEEQKELRVDVDTFLQYWPPQSQQFSMQYISHIFGVIN 173
Query: 181 LNAFGLMEPYIEGQDGRRSVRAYGIYLKASFFNHDCLPNACRFDYVDAAAENNTDIIVRM 240
N F L + Q G ++V GI+ NHDC PN C + NN I +R
Sbjct: 174 CNGFTLSD-----QRGLQAV-GVGIFPNLGLVNHDCWPN-CTVIF------NNGKIELRA 220
Query: 241 IHDVPQGREICLSYFPVNYDYSTRQKRLLYDYDFACDCDRCK 282
+ + +G E+ +SY + R+++L Y F C C+ C+
Sbjct: 221 LGKISEGEELTVSYIDFLHLSEERRRQLKKQYYFDCSCEHCQ 262
>gi|440799648|gb|ELR20692.1| Histone-lysine N-methyltransferase [Acanthamoeba castellanii str.
Neff]
Length = 548
Score = 66.6 bits (161), Expect = 2e-08, Method: Compositional matrix adjust.
Identities = 67/289 (23%), Positives = 115/289 (39%), Gaps = 32/289 (11%)
Query: 6 ETVTVAEIEGRGRGLVSTQSLKAGQIVLRDSPILLYSALPFINSSSFCHNCF----RKTM 61
+T+ + +G +++ + + AG +L + P + + L S CH+C R +
Sbjct: 56 KTIMCEKTPAKGTVVIAKKDIPAGTTILAE-PAIAWQPLED-RIMSVCHHCMTEVPRWAV 113
Query: 62 HSSSSICPSCSVAFCSPKCSTAAASSSHSPYVCQALTR-LLNLNSPDAANLPLDRQLQAR 120
+ +CSPKC A+ + + R + N D + L A
Sbjct: 114 GCGEGAGGCSGLGYCSPKCREASEALHRVEHKVFLQARDIANRTKADITLIKL-----AT 168
Query: 121 FLIAAYNLAVVNPSQFQI-LLAFQG-------TVTDNDTSAAHYLSSLCPPPASTTIELT 172
+IA +L+ + QF+ ++A G D+ T A + L P PA +
Sbjct: 169 RIIALRSLSEGHRVQFERGVMAMMGHEDECPQQWVDSITETAKLVMPLLPKPARLSAREF 228
Query: 173 AALLAKDRLNAFGLMEPYIEGQDGRRSVRAYGIYLKASFFNHDCLPNACRFDYVDAAAEN 232
+ A+ N+ ++ + + G+Y AS NH C PN C F
Sbjct: 229 VKVCARINTNSHRQHHMFVP-----QRILGVGLYPLASLINHSCQPN-CGF------YNR 276
Query: 233 NTDIIVRMIHDVPQGREICLSYFPVNYDYSTRQKRLLYDYDFACDCDRC 281
+ +R + DV +G E+C SY + S R+ LL F C C+RC
Sbjct: 277 GPTLYIRTLCDVKEGEELCYSYIDLYQSRSKRKAELLETKHFDCLCNRC 325
>gi|395508794|ref|XP_003758694.1| PREDICTED: SET and MYND domain-containing protein 1 [Sarcophilus
harrisii]
Length = 490
Score = 66.6 bits (161), Expect = 2e-08, Method: Compositional matrix adjust.
Identities = 71/293 (24%), Positives = 116/293 (39%), Gaps = 29/293 (9%)
Query: 3 AVSETVTVAEIEGRGRGLVSTQSLKAGQIVLRDSPILLYSALPFIN-SSSFCHNCFRKTM 61
A E V V EG+GRGL +T+ L A +V + YSA+ F + + CH CF++
Sbjct: 4 ASMEKVEVFTSEGKGRGLKTTKELWAADVVFAEPA---YSAVVFDSFTQVVCHTCFKR-- 58
Query: 62 HSSSSICPSCSVA-FCSPKCSTAAASSSHSPYVCQALTRLLNLNSPDAANLPLDRQLQAR 120
C C A +C C A + + C A+ + P + N+ L ++ R
Sbjct: 59 QEKLQRCGQCKFARYCGRPCQKDAWLNHKNE--CAAIKKH---GKPPSENIRLAARIMWR 113
Query: 121 FLIAAYNLA---VVNPSQFQILLAFQGTVTDNDTS--AAHYLSSLCPPPASTTIELTAAL 175
L +V+ Q + G D +L+ P +++ + +
Sbjct: 114 IEREGTGLTEGCLVSVDDLQNHVDNFGEEEQKDLRMDVDSFLNFWPPQSQQFSMQYISHI 173
Query: 176 LAKDRLNAFGLMEPYIEGQDGRRSVRAYGIYLKASFFNHDCLPNAC------RFDYVDAA 229
NAF L + Q G ++V GI+ NHDC PN + V +
Sbjct: 174 FGVINCNAFTLSD-----QRGLQAV-GVGIFPNLCLVNHDCWPNCTVIFNNGNHEAVKSM 227
Query: 230 AENNTDIIVRMIHDVPQGREICLSYFPVNYDYSTRQKRLLYDYDFACDCDRCK 282
I +R + + +G E+ +SY R+K+L Y F C C+ C+
Sbjct: 228 FHTQMRIELRALGKISEGEELTVSYIDFLNISQERKKQLKKQYYFDCTCEHCE 280
>gi|162458873|ref|NP_001105199.1| SET domain protein 123 [Zea mays]
gi|27466907|gb|AAO12860.1| SET domain protein 123 [Zea mays]
gi|194705040|gb|ACF86604.1| unknown [Zea mays]
gi|195624178|gb|ACG33919.1| SET domain protein 123 [Zea mays]
gi|414871456|tpg|DAA50013.1| TPA: SET domain protein 123 [Zea mays]
Length = 303
Score = 66.6 bits (161), Expect = 2e-08, Method: Compositional matrix adjust.
Identities = 68/289 (23%), Positives = 116/289 (40%), Gaps = 27/289 (9%)
Query: 8 VTVAEIEGRGRGLVSTQSLKAGQIVLRDSPILLYSALPFINSSSFCHNCFRKTMHSSSSI 67
+ V+ E GRG+ +T+ + +G+++ P++ + + ++ C+NC R+ +
Sbjct: 23 IRVSLTESAGRGVFATRPVSSGELLHSAQPLVSHPSHSLLHE--VCYNCLRRKPGEGRA- 79
Query: 68 CPSCSVAFCSPKCSTAAASSSHSPYVCQALTRLLNLNSPDAANLPLDRQLQARFLIAAYN 127
S FCS C A + +S D + ++L + A N
Sbjct: 80 -SSGGDYFCSDACRDHAKGFHDIEKNVDWSLFDYHCSSRDLKYPYMAKRLACMVISGAAN 138
Query: 128 LAVVNPSQFQILLAFQGTVTDNDTSAAHYLSSLCPP------PASTTIELTAALLAKDRL 181
+N Q +L QGT+ + S+ TI+ +LA+ R+
Sbjct: 139 ADCLNILQPALL--HQGTLIEMKEEFELLESTFRKAGFQEEITTFLTIDWYINVLARIRI 196
Query: 182 NAFGLMEPYIEGQDGRRSVRAY---------GIYLKASFFNHDCLPNACRFDYVDAAAEN 232
NAF + +D S A +Y+ SF+NHDC PNA
Sbjct: 197 NAFRIELVASSYEDLLSSAAASVSCDSSVGNAVYMLPSFYNHDCDPNA------HIVWLQ 250
Query: 233 NTDIIVRMIHDVPQGREICLSYFPVNYDYSTRQKRLLYDYDFACDCDRC 281
N D ++ + D+ +G E+C+ Y + D RQK L + F C C RC
Sbjct: 251 NADAKLKALRDIEEGEELCICYIDASMDADARQKILADGFGFECRCLRC 299
>gi|255545512|ref|XP_002513816.1| protein with unknown function [Ricinus communis]
gi|223546902|gb|EEF48399.1| protein with unknown function [Ricinus communis]
Length = 482
Score = 66.6 bits (161), Expect = 2e-08, Method: Compositional matrix adjust.
Identities = 76/290 (26%), Positives = 121/290 (41%), Gaps = 44/290 (15%)
Query: 8 VTVAEIEGRGRGLVSTQSLKAGQIVLRDSPILLYSALPFINSS-SFCHNCFRKTMHSSSS 66
+ VA + +GR LV+T+ G++++ P Y +P +++ S C CF SSS+
Sbjct: 13 LRVANVTEKGRSLVTTKDFNPGEVIISQEP---YVCVPNNSATESRCDRCF-----SSSN 64
Query: 67 I--CPSCSVA-FCSPKCSTAAASSSHSPYVCQALTRL----LNLNSPD---AANLPLDRQ 116
+ C +C VA +C C C+AL++L +P L + R+
Sbjct: 65 VKKCSACQVAWYCGSSCQKLEWKLHRIE--CEALSKLDKDRRKYVTPSIRLMVRLLIRRK 122
Query: 117 LQARFLIAAYNLAVVNPSQFQILLAFQGTVTDND----TSAAHYLSSLCPPPASTTIELT 172
LQ+ +I++ A N + L+A + + A+ ++ + P I+
Sbjct: 123 LQSEKIISST--ATDNYDLVEALVAHMKDIDEKQLVLYAQMANLVNLILRWP-DVNIKEI 179
Query: 173 AALLAKDRLNAFGLMEPYIEGQDGRRSVRAYGIYLKASFFNHDCLPNACR-FDYVDAAAE 231
A +K NA + D G+Y S NH CLPNA FD A
Sbjct: 180 AENFSKLACNAHTIC-------DSELRPLGTGLYPVVSIINHSCLPNAVLVFDGRLA--- 229
Query: 232 NNTDIIVRMIHDVPQGREICLSYFPVNYDYSTRQKRLLYDYDFACDCDRC 281
+V + +P+G E+ +SY TRQK L Y F C C RC
Sbjct: 230 -----VVHTVQHIPKGSEVLISYIETAGSTMTRQKALKQQYFFTCTCPRC 274
>gi|236461142|ref|NP_033892.2| SET and MYND domain-containing protein 1 isoform 2 [Mus musculus]
gi|74209105|dbj|BAE24950.1| unnamed protein product [Mus musculus]
gi|148666517|gb|EDK98933.1| SET and MYND domain containing 1, isoform CRA_a [Mus musculus]
Length = 477
Score = 66.6 bits (161), Expect = 2e-08, Method: Compositional matrix adjust.
Identities = 68/282 (24%), Positives = 115/282 (40%), Gaps = 26/282 (9%)
Query: 6 ETVTVAEIEGRGRGLVSTQSLKAGQIVLRDSPILLYSALPFINSSSF-CHNCFRKTMHSS 64
E V V EG+GRGL +T+ A ++ + YSA+ F + +F CH CF++
Sbjct: 7 ENVEVFTSEGKGRGLKATKEFWAADVIFAERA---YSAVVFDSLINFVCHTCFKR--QEK 61
Query: 65 SSICPSCSVA-FCSPKCSTAAASSSHSPYVCQALTR---LLNLNSPDAANLPLDRQLQAR 120
C C A +C C A + + C A+ + + N N AA + + +
Sbjct: 62 LHRCGQCKFAHYCDRTCQKDAWLNHKNE--CAAIKKYGKVPNENIRLAARIMWRVEREGT 119
Query: 121 FLIAAYNLAVVNPSQFQILLAFQGTVTDNDTSAAHYLSSLCPPPASTTIELTAALLAKDR 180
L L V+ Q + + + +L P +++ + +
Sbjct: 120 GLTEG-CLVSVDDLQNHVEHFGEEEQKELRVDVDTFLQYWPPQSQQFSMQYISHIFGVIN 178
Query: 181 LNAFGLMEPYIEGQDGRRSVRAYGIYLKASFFNHDCLPNACRFDYVDAAAENNTDIIVRM 240
N F L + Q G ++V GI+ NHDC PN C + NN I +R
Sbjct: 179 CNGFTLSD-----QRGLQAV-GVGIFPNLGLVNHDCWPN-CTVIF------NNGKIELRA 225
Query: 241 IHDVPQGREICLSYFPVNYDYSTRQKRLLYDYDFACDCDRCK 282
+ + +G E+ +SY + R+++L Y F C C+ C+
Sbjct: 226 LGKISEGEELTVSYIDFLHLSEERRRQLKKQYYFDCSCEHCQ 267
>gi|229892820|ref|NP_001153561.1| SET and MYND domain-containing protein 1 [Sus scrofa]
gi|197692962|gb|ACH71269.1| SET and MYND domain-containing 1 [Sus scrofa]
Length = 490
Score = 66.6 bits (161), Expect = 2e-08, Method: Compositional matrix adjust.
Identities = 69/288 (23%), Positives = 114/288 (39%), Gaps = 25/288 (8%)
Query: 6 ETVTVAEIEGRGRGLVSTQSLKAGQIVLRDSPILLYSALPFINSSSF-CHNCFRKTMHSS 64
E V V EG+GRGL +T+ A ++ + YSA+ F + +F CH CF++
Sbjct: 7 ENVEVITSEGKGRGLKATKEFWAADVIFAERA---YSAVVFDSLVNFVCHTCFKR--QEK 61
Query: 65 SSICPSCSVA-FCSPKCSTAAASSSHSPYVCQALTR---LLNLNSPDAANLPLDRQLQAR 120
C C A +C C A + + C A+ R + N N AA + + +
Sbjct: 62 LHRCGQCKFAHYCDRTCQKDAWLNHKNE--CSAIKRYGKVPNENIRLAARIMWRVEREGT 119
Query: 121 FLIAAYNLAVVNPSQFQILLAFQGTVTDNDTSAAHYLSSLCPPPASTTIELTAALLAKDR 180
L L V+ Q + + + +L P +++ + +
Sbjct: 120 GLTEGC-LVSVDDLQNHVEHFGEEEQKELRMDVDTFLQYWPPQSQQFSMQYISHIFGVIN 178
Query: 181 LNAFGLMEPYIEGQDGRRSVRAYGIYLKASFFNHDCLPNAC------RFDYVDAAAENNT 234
N F L E Q G ++V GI+ NH+C PN + V + T
Sbjct: 179 CNGFTLSE-----QRGLQAV-GVGIFPNLGLVNHNCWPNCTVIFNNGNHEAVKSMFHTQT 232
Query: 235 DIIVRMIHDVPQGREICLSYFPVNYDYSTRQKRLLYDYDFACDCDRCK 282
I +R + + +G E+ +SY R+K+L Y F C C+ C+
Sbjct: 233 RIELRALGKISEGEELTVSYIDFLNFSEERKKQLKKQYYFDCTCEHCQ 280
>gi|328873018|gb|EGG21385.1| SET domain-containing protein [Dictyostelium fasciculatum]
Length = 740
Score = 66.6 bits (161), Expect = 2e-08, Method: Compositional matrix adjust.
Identities = 73/305 (23%), Positives = 115/305 (37%), Gaps = 68/305 (22%)
Query: 17 GRGLVSTQSLKAGQIVLRDSPILLYSALPFINSSSFCHNCFRKTMHSSSSICPSCS---- 72
GR +++T+ +K G+++LR +P AL C +CFRK + S C SC
Sbjct: 272 GRCIMATRDIKQGELILRVAP--FGCALSDDQIDLNCGSCFRKIKYYKSEKCNSCKTNLL 329
Query: 73 ----------VAFCSPKCSTAAASSSHSPYV----------------CQALTRLLNLNSP 106
V+ +CS + H P L R+ +P
Sbjct: 330 CEICIKDPTIVSMHQEECSLISFLQKHYPNAQTRDFRFMIRVLLSGRANKLGRITKEKTP 389
Query: 107 DAANLPLDRQLQARFLIAAY--------NLAVVNPSQFQILLAFQGTVTD--NDTSAAHY 156
N Q F+ +Y +++ + Q Q ++T+ Y
Sbjct: 390 QLWN-------QQPFIYDSYEDLASLTTDVSKIESKQLQSFTVATNSITNFFAAVKGPKY 442
Query: 157 LSSLCPPPASTTIELTAALLAKDRLNAFGLMEPYIEGQDGRRSVRAYGIYLKASFFNHDC 216
LS P +L +L NA ++P + GR GIY AS+ NH C
Sbjct: 443 LSD--PFTKQEIFDLYPKIL----FNAHEYIDPLTHYEIGR------GIYPTASYINHTC 490
Query: 217 LPNACRFDYVDAAAENNTDIIVRMIHDVPQGREICLSYFPVNYDYSTRQKRLLYDYDFAC 276
LPN ++ +++ I+ R D+ +G EI SY + R+K L Y F C
Sbjct: 491 LPNTTWYN------DDHGLILYRSSRDILKGEEITTSYLDILKPKLQRRKD-LKQYSFVC 543
Query: 277 DCDRC 281
C+RC
Sbjct: 544 QCERC 548
>gi|414871458|tpg|DAA50015.1| TPA: hypothetical protein ZEAMMB73_470058 [Zea mays]
Length = 299
Score = 66.6 bits (161), Expect = 2e-08, Method: Compositional matrix adjust.
Identities = 69/287 (24%), Positives = 117/287 (40%), Gaps = 27/287 (9%)
Query: 8 VTVAEIEGRGRGLVSTQSLKAGQIVLRDSPILLYSALPFINSSSFCHNCFRKTMHSSSSI 67
+ V+ E GRG+ +T+ + +G+++ P++ + + ++ C+NC R+ +
Sbjct: 23 IRVSLTESAGRGVFATRPVSSGELLHSAQPLVSHPSHSLLHE--VCYNCLRRKPGEGRA- 79
Query: 68 CPSCSVAFCSPKCSTAAASSSHSPYVCQALTRLLNLNSPDAANLPLDRQLQARFLIAAYN 127
S FCS C A + +S D + ++L + A N
Sbjct: 80 -SSGGDYFCSDACRDHAKGFHDIEKNVDWSLFDYHCSSRDLKYPYMAKRLACMVISGAAN 138
Query: 128 LAVVNPSQFQILLAFQGTVTDNDTSAAHYLSSLCPPPASTTIELTAA----LLAKDRLNA 183
+N Q +L QGT+ + S+ A E+T +LA+ R+NA
Sbjct: 139 ADCLNILQPALL--HQGTLIEMKEEFELLESTF--RKAGFQEEITTYWYINVLARIRINA 194
Query: 184 FGLMEPYIEGQDGRRSVRAY---------GIYLKASFFNHDCLPNACRFDYVDAAAENNT 234
F + +D S A +Y+ SF+NHDC PNA N
Sbjct: 195 FRIELVASSYEDLLSSAAASVSCDSSVGNAVYMLPSFYNHDCDPNA------HIVWLQNA 248
Query: 235 DIIVRMIHDVPQGREICLSYFPVNYDYSTRQKRLLYDYDFACDCDRC 281
D ++ + D+ +G E+C+ Y + D RQK L + F C C RC
Sbjct: 249 DAKLKALRDIEEGEELCICYIDASMDADARQKILADGFGFECRCLRC 295
>gi|260799834|ref|XP_002594889.1| hypothetical protein BRAFLDRAFT_124463 [Branchiostoma floridae]
gi|229280126|gb|EEN50900.1| hypothetical protein BRAFLDRAFT_124463 [Branchiostoma floridae]
Length = 463
Score = 66.2 bits (160), Expect = 2e-08, Method: Compositional matrix adjust.
Identities = 72/287 (25%), Positives = 109/287 (37%), Gaps = 60/287 (20%)
Query: 16 RGRGLVSTQSLKAGQIVLRDSPILLYSALPFINS--SSFCHNCFRKTMHSSSSICPSCSV 73
+GRGL +T+ K G +V P A NS C CF + S C C
Sbjct: 13 KGRGLCATKVFKPGNLVRAADPY----AYVLCNSERGKRCDFCFARK--DDMSRCSGCKF 66
Query: 74 A-FCSPKCSTAAASSSHSPYVCQALT---------------RLLNLNSPDAANLPLDR-- 115
A +C KC AA + S C+++ R++N D+ +P +
Sbjct: 67 ARYCDGKCQKAAWTEHKSE--CKSIKTVKPETPTDSIRLIARIINKTKTDSPGVPGNSID 124
Query: 116 QLQARFLIAAYNLAVVNPSQFQILLAFQGTVTDNDTSAAHYLSSLCPPPASTTIELTAAL 175
+LQ+ N+ + +L + G +D A EL +
Sbjct: 125 ELQSNLREMPENVKEMFAQLAVVLRMYVGKDVMDD--------------AREIFELFGRM 170
Query: 176 LAKDRLNAFGLMEPYIEGQDGRRSVRAYGIYLKASFFNHDCLPNACRFDYVDAAAENNTD 235
N F + +P ++ GIY K S FNH C PN A N
Sbjct: 171 TC----NTFSICDPEMQ-------YIGIGIYPKMSLFNHSCEPNC-------VAVFNGLR 212
Query: 236 IIVRMIHDVPQGREICLSYFPVNYDYSTRQKRLLYDYDFACDCDRCK 282
+ VR I ++ G E+ +SY + S R+++LL Y F C C RC+
Sbjct: 213 MEVRAIQNIQPGEELLISYVEMLAMSSVRKQQLLQQYYFTCKCPRCQ 259
>gi|225713426|gb|ACO12559.1| msta, isoform A [Lepeophtheirus salmonis]
Length = 505
Score = 66.2 bits (160), Expect = 2e-08, Method: Compositional matrix adjust.
Identities = 73/283 (25%), Positives = 120/283 (42%), Gaps = 50/283 (17%)
Query: 17 GRGLVSTQSLKAGQIVLRDSPILLYSALPFINSSSFCHNCFRKTMHSSSSICPSCSVAFC 76
GR LV+T+ +KA +++L +SP+++ P+I+S++ C CF KT+ +S C SC+ FC
Sbjct: 59 GRHLVATRDIKALEVILEESPLVIG---PYISSNNQCLECF-KTLDTSPHSCISCNYPFC 114
Query: 77 SPKCS---------TAAASSSHSP-------YVCQALTRLLNL--NSPDAANLPLDRQLQ 118
S +CS +A H Y RLL+L + P+ N + L
Sbjct: 115 SQECSQGPWHKIECSAFQKYGHKRDENDNDFYSIITTLRLLSLQESKPEFWNTIMKDMLD 174
Query: 119 ARFLIAAYNLAVVNPSQFQILLAFQGTVTDNDTSAAHYLSSLCPPPASTTIELTAALLAK 178
I Y P ++ T + + + SL A T + L+
Sbjct: 175 HNSDIQEY-----TPEKW----------TSYEETVVKKIQSLTKGNADT--DTLTRLIGV 217
Query: 179 DRLNAFGLMEPYIEGQDGRRSVRAYGIYLKASFFNHDCLPNACRFDYVDAAAENNTDIIV 238
N L P+ +G+ G+ + G+YL S NH C+ N ++ D E +
Sbjct: 218 IDTNGADLNLPH-DGKHGKGT----GLYLIYSNLNHSCVCNTKTINFQDRRIE------M 266
Query: 239 RMIHDVPQGREICLSYFPVNYDYSTRQKRLLYDYDFACDCDRC 281
R ++ +G +IC Y N R+ + + F C C RC
Sbjct: 267 RASVNISKGEQICNQYVRPNKCTLARRTTIKNKWYFDCCCKRC 309
>gi|38093643|ref|NP_938015.1| SET and MYND domain-containing protein 1 [Homo sapiens]
gi|34925329|sp|Q8NB12.1|SMYD1_HUMAN RecName: Full=SET and MYND domain-containing protein 1
gi|21750163|dbj|BAC03732.1| unnamed protein product [Homo sapiens]
gi|57997113|emb|CAI46139.1| hypothetical protein [Homo sapiens]
gi|116497127|gb|AAI26192.1| SET and MYND domain containing 1 [Homo sapiens]
gi|119597484|gb|EAW77078.1| SET and MYND domain containing 1, isoform CRA_b [Homo sapiens]
gi|190689781|gb|ACE86665.1| SET and MYND domain containing 1 protein [synthetic construct]
gi|190691149|gb|ACE87349.1| SET and MYND domain containing 1 protein [synthetic construct]
gi|261858976|dbj|BAI46010.1| SET and MYND domain containing 1 [synthetic construct]
gi|313883064|gb|ADR83018.1| SET and MYND domain containing 1 (SMYD1) [synthetic construct]
Length = 490
Score = 66.2 bits (160), Expect = 2e-08, Method: Compositional matrix adjust.
Identities = 69/288 (23%), Positives = 113/288 (39%), Gaps = 25/288 (8%)
Query: 6 ETVTVAEIEGRGRGLVSTQSLKAGQIVLRDSPILLYSALPFINSSSF-CHNCFRKTMHSS 64
E V V EG+GRGL +T+ A I+ + YSA+ F + +F CH CF++
Sbjct: 7 ENVEVFTAEGKGRGLKATKEFWAADIIFAERA---YSAVVFDSLVNFVCHTCFKR--QEK 61
Query: 65 SSICPSCSVA-FCSPKCSTAAASSSHSPYVCQALTR---LLNLNSPDAANLPLDRQLQAR 120
C C A +C C A + + C A+ R + N N AA + + +
Sbjct: 62 LHRCGQCKFAHYCDRTCQKDAWLNHKNE--CSAIKRYGKVPNENIRLAARIMWRVEREGT 119
Query: 121 FLIAAYNLAVVNPSQFQILLAFQGTVTDNDTSAAHYLSSLCPPPASTTIELTAALLAKDR 180
L L V+ Q + + D +L P +++ + +
Sbjct: 120 GLTEGC-LVSVDDLQNHVEHFGEEEQKDLRVDVDTFLQYWPPQSQQFSMQYISHIFGVIN 178
Query: 181 LNAFGLMEPYIEGQDGRRSVRAYGIYLKASFFNHDCLPNAC------RFDYVDAAAENNT 234
N F L + Q G ++V GI+ NHDC PN + V +
Sbjct: 179 CNGFTLSD-----QRGLQAV-GVGIFPNLGLVNHDCWPNCTVIFNNGNHEAVKSMFHTQM 232
Query: 235 DIIVRMIHDVPQGREICLSYFPVNYDYSTRQKRLLYDYDFACDCDRCK 282
I +R + + +G E+ +SY R+++L Y F C C+ C+
Sbjct: 233 RIELRALGKISEGEELTVSYIDFLNVSEERKRQLKKQYYFDCTCEHCQ 280
>gi|48476975|gb|AAT44535.1| hypothetical rhabdomyosarcoma antigen MU-RMS-40.9A [Homo sapiens]
Length = 484
Score = 66.2 bits (160), Expect = 2e-08, Method: Compositional matrix adjust.
Identities = 69/288 (23%), Positives = 113/288 (39%), Gaps = 25/288 (8%)
Query: 6 ETVTVAEIEGRGRGLVSTQSLKAGQIVLRDSPILLYSALPFINSSSF-CHNCFRKTMHSS 64
E V V EG+GRGL +T+ A I+ + YSA+ F + +F CH CF++
Sbjct: 1 ENVEVFTAEGKGRGLKATKEFWAADIIFAERA---YSAVVFDSLVNFVCHTCFKR--QEK 55
Query: 65 SSICPSCSVA-FCSPKCSTAAASSSHSPYVCQALTR---LLNLNSPDAANLPLDRQLQAR 120
C C A +C C A + + C A+ R + N N AA + + +
Sbjct: 56 LHRCGQCKFAHYCDRTCQKDAWLNHKNE--CSAIKRYGKVPNENIRLAARIMWRVEREGT 113
Query: 121 FLIAAYNLAVVNPSQFQILLAFQGTVTDNDTSAAHYLSSLCPPPASTTIELTAALLAKDR 180
L L V+ Q + + D +L P +++ + +
Sbjct: 114 GLTEGC-LVSVDDLQNHVEHFGEEEQKDLRVDVDTFLQYWPPQSQQFSMQYISHIFGVIN 172
Query: 181 LNAFGLMEPYIEGQDGRRSVRAYGIYLKASFFNHDCLPNAC------RFDYVDAAAENNT 234
N F L + Q G ++V GI+ NHDC PN + V +
Sbjct: 173 CNGFTLSD-----QRGLQAV-GVGIFPNLGLVNHDCWPNCTVIFNNGNHEAVKSMFHTQM 226
Query: 235 DIIVRMIHDVPQGREICLSYFPVNYDYSTRQKRLLYDYDFACDCDRCK 282
I +R + + +G E+ +SY R+++L Y F C C+ C+
Sbjct: 227 RIELRALGKISEGEELTVSYIDFLNVSEERKRQLKKQYYFDCTCEHCQ 274
>gi|119597485|gb|EAW77079.1| SET and MYND domain containing 1, isoform CRA_c [Homo sapiens]
Length = 493
Score = 66.2 bits (160), Expect = 2e-08, Method: Compositional matrix adjust.
Identities = 69/288 (23%), Positives = 113/288 (39%), Gaps = 25/288 (8%)
Query: 6 ETVTVAEIEGRGRGLVSTQSLKAGQIVLRDSPILLYSALPFINSSSF-CHNCFRKTMHSS 64
E V V EG+GRGL +T+ A I+ + YSA+ F + +F CH CF++
Sbjct: 7 ENVEVFTAEGKGRGLKATKEFWAADIIFAERA---YSAVVFDSLVNFVCHTCFKR--QEK 61
Query: 65 SSICPSCSVA-FCSPKCSTAAASSSHSPYVCQALTR---LLNLNSPDAANLPLDRQLQAR 120
C C A +C C A + + C A+ R + N N AA + + +
Sbjct: 62 LHRCGQCKFAHYCDRTCQKDAWLNHKNE--CSAIKRYGKVPNENIRLAARIMWRVEREGT 119
Query: 121 FLIAAYNLAVVNPSQFQILLAFQGTVTDNDTSAAHYLSSLCPPPASTTIELTAALLAKDR 180
L L V+ Q + + D +L P +++ + +
Sbjct: 120 GLTEGC-LVSVDDLQNHVEHFGEEEQKDLRVDVDTFLQYWPPQSQQFSMQYISHIFGVIN 178
Query: 181 LNAFGLMEPYIEGQDGRRSVRAYGIYLKASFFNHDCLPNAC------RFDYVDAAAENNT 234
N F L + Q G ++V GI+ NHDC PN + V +
Sbjct: 179 CNGFTLSD-----QRGLQAV-GVGIFPNLGLVNHDCWPNCTVIFNNGNHEAVKSMFHTQM 232
Query: 235 DIIVRMIHDVPQGREICLSYFPVNYDYSTRQKRLLYDYDFACDCDRCK 282
I +R + + +G E+ +SY R+++L Y F C C+ C+
Sbjct: 233 RIELRALGKISEGEELTVSYIDFLNVSEERKRQLKKQYYFDCTCEHCQ 280
>gi|119597483|gb|EAW77077.1| SET and MYND domain containing 1, isoform CRA_a [Homo sapiens]
Length = 510
Score = 66.2 bits (160), Expect = 3e-08, Method: Compositional matrix adjust.
Identities = 69/288 (23%), Positives = 113/288 (39%), Gaps = 25/288 (8%)
Query: 6 ETVTVAEIEGRGRGLVSTQSLKAGQIVLRDSPILLYSALPFINSSSF-CHNCFRKTMHSS 64
E V V EG+GRGL +T+ A I+ + YSA+ F + +F CH CF++
Sbjct: 7 ENVEVFTAEGKGRGLKATKEFWAADIIFAERA---YSAVVFDSLVNFVCHTCFKR--QEK 61
Query: 65 SSICPSCSVA-FCSPKCSTAAASSSHSPYVCQALTR---LLNLNSPDAANLPLDRQLQAR 120
C C A +C C A + + C A+ R + N N AA + + +
Sbjct: 62 LHRCGQCKFAHYCDRTCQKDAWLNHKNE--CSAIKRYGKVPNENIRLAARIMWRVEREGT 119
Query: 121 FLIAAYNLAVVNPSQFQILLAFQGTVTDNDTSAAHYLSSLCPPPASTTIELTAALLAKDR 180
L L V+ Q + + D +L P +++ + +
Sbjct: 120 GLTEGC-LVSVDDLQNHVEHFGEEEQKDLRVDVDTFLQYWPPQSQQFSMQYISHIFGVIN 178
Query: 181 LNAFGLMEPYIEGQDGRRSVRAYGIYLKASFFNHDCLPNAC------RFDYVDAAAENNT 234
N F L + Q G ++V GI+ NHDC PN + V +
Sbjct: 179 CNGFTLSD-----QRGLQAV-GVGIFPNLGLVNHDCWPNCTVIFNNGNHEAVKSMFHTQM 232
Query: 235 DIIVRMIHDVPQGREICLSYFPVNYDYSTRQKRLLYDYDFACDCDRCK 282
I +R + + +G E+ +SY R+++L Y F C C+ C+
Sbjct: 233 RIELRALGKISEGEELTVSYIDFLNVSEERKRQLKKQYYFDCTCEHCQ 280
>gi|328769214|gb|EGF79258.1| hypothetical protein BATDEDRAFT_35411 [Batrachochytrium
dendrobatidis JAM81]
Length = 503
Score = 66.2 bits (160), Expect = 3e-08, Method: Compositional matrix adjust.
Identities = 77/328 (23%), Positives = 125/328 (38%), Gaps = 70/328 (21%)
Query: 5 SETVTVAEIEGRGRGLVSTQSLKAGQIVLRDSPILLYSALPFINSSSFCHNCFRKT---- 60
S ++++ + R +++T+S+ G VL + A+ FI+ + C C +
Sbjct: 29 SSGLSLSTSATKHRCILATKSIPVGSTVLSEP------AMAFISLGTLCEYCLSTSDIQG 82
Query: 61 MHSSSSICPSC-SVAFCSPKCSTAAASSSHSPYV-------CQALTRLLNLN-------- 104
H+ C C V +CS C A + H+ C A TR N N
Sbjct: 83 QHTQLQRCTGCRQVVYCSVVCQKADWIAGHNSACAVLKTIDCSATTRQSNRNDIAMLFKV 142
Query: 105 -----SPDAANLPLDRQLQARF----LIAAYNLAVVNPSQFQILLAFQGTVTDNDTSAAH 155
+P + LD ++ + N+ + P F L+ Q V D +T A
Sbjct: 143 VRIISNPSFQSTTLDDRMDLDCPMDNVALVRNIRSIQPLVF---LSLQSHVLDFETCQAS 199
Query: 156 Y--------------LSSLCPPPASTTIELTAA----LLAKDRLNAFGLMEPYIEGQDGR 197
Y + S P A + L + L + R N F ++ D
Sbjct: 200 YPHATNALNGHSMKQIESQLPSAAVKLMNLPVSDLIHHLGRFRCNNFTII-------DSN 252
Query: 198 RSVRAYGIYLKASFFNHDCLPNACRFDYVDAAAENNTDIIVRMIHDVPQGREICLSYFPV 257
G Y AS FNHDC PN A + + ++++ I D+ +G E+C+SY
Sbjct: 253 LFPVGEGTYPLASLFNHDCWPNC-------IAIFDGSRVVIQTIRDIAKGDELCISYIDP 305
Query: 258 NYDYSTRQKRLLYDYDFACDCDRCKVEA 285
D+ +R+ L Y F C C C E+
Sbjct: 306 ILDHDSRRMSLETKYCFNCQCSVCMSES 333
>gi|195493827|ref|XP_002094580.1| GE20125 [Drosophila yakuba]
gi|194180681|gb|EDW94292.1| GE20125 [Drosophila yakuba]
Length = 667
Score = 66.2 bits (160), Expect = 3e-08, Method: Compositional matrix adjust.
Identities = 73/312 (23%), Positives = 130/312 (41%), Gaps = 40/312 (12%)
Query: 1 MSAVSETVTVAEIEGRGRGLVSTQSLKAGQIVLRDSPILLYSALPFINSSSFCHNCFRKT 60
+ S+ V + E + +GR +V+ + L+ G ++L + P+ + L + CH+CF++
Sbjct: 253 LIGASKVVRLVETKEKGRFVVANEGLRTGDVLLCEEPVA--ACLEPTYFGTHCHHCFKRL 310
Query: 61 MHSSSSICPSCS-VAFCSPKCSTAAASSSHSPYVCQALTRLLNLNSPDAANLPLDRQLQA 119
S C CS +AFCS +C A SS H + C+ + ++ + L QA
Sbjct: 311 RTPVS--CLHCSGIAFCSAQCMGEACSSYHR-FECEYMDLMIGSGMSILCFIALRVFTQA 367
Query: 120 RFL---IAAYNLAVVN---------PSQFQILLAFQGTVTDNDTSAAHYLSSLCPPPAST 167
L +A NL + P + G + + ++ T
Sbjct: 368 SSLEQGLATANLLFEHLCSHEEERQPEDYLRRTLMSGFLLRILQKSLYFGRRKTEGVNPT 427
Query: 168 TIELTAA-----LLAKDRLNAFGLMEPYIEGQ---DGRRSVR-AYGIYLKASFFNHDCLP 218
+EL A LL + NA + + + + DG ++V A G+Y S+FNH+C P
Sbjct: 428 AVELQVATALLGLLQVLQYNAHQIYQTQVTEEHRFDGSKTVYLAAGLYGTGSYFNHECWP 487
Query: 219 N-ACRFDYVDAAAENNTDIIVRMIHDVPQGREICLSYFP--VNYDYSTRQKRLLYDYDFA 275
+ AC F +++ + ++Y P + + RQ+ L Y F+
Sbjct: 488 STACHF--------VGKKLVLTATKPHRANESVAVNYGPIFIKNNLKERQRALRGRYSFS 539
Query: 276 CDCDRCKVEANW 287
C+C C + NW
Sbjct: 540 CNCMAC--QENW 549
>gi|348566409|ref|XP_003468994.1| PREDICTED: SET and MYND domain-containing protein 1-like isoform 2
[Cavia porcellus]
Length = 477
Score = 65.9 bits (159), Expect = 3e-08, Method: Compositional matrix adjust.
Identities = 66/280 (23%), Positives = 111/280 (39%), Gaps = 22/280 (7%)
Query: 6 ETVTVAEIEGRGRGLVSTQSLKAGQIVLRDSPILLYSALPFINSSSF-CHNCFRKTMHSS 64
E V + EG+GRGL +T+ A ++ + YSA+ F + +F CH CF++
Sbjct: 7 ENVEIFTSEGKGRGLKATKEFWAADVIFAERA---YSAVVFDSLVNFVCHTCFKR--QEK 61
Query: 65 SSICPSCSVA-FCSPKCST-AAASSSHSPYVCQALTRLLNLNSPDAANLPLDRQLQARFL 122
C C A +C C A + H + ++ N N AA + + + L
Sbjct: 62 LHRCGQCKFAHYCDRTCQKDAWLNHKHECSAIKRYGKVPNENVRLAARIMWRVEREGTGL 121
Query: 123 IAAYNLAVVNPSQFQILLAFQGTVTDNDTSAAHYLSSLCPPPASTTIELTAALLAKDRLN 182
L V+ Q + + + +L P +++ + + N
Sbjct: 122 TEGC-LVSVDDLQNHVEHFGEEEQKELRVDVDTFLQYWPPQSQQFSMQYISHIFGVINCN 180
Query: 183 AFGLMEPYIEGQDGRRSVRAYGIYLKASFFNHDCLPNACRFDYVDAAAENNTDIIVRMIH 242
F L + Q G ++V GI+ NHDC PN C + NN I +R +
Sbjct: 181 GFTLSD-----QRGLQAV-GVGIFPNLGLVNHDCWPN-CTVIF------NNGKIELRALG 227
Query: 243 DVPQGREICLSYFPVNYDYSTRQKRLLYDYDFACDCDRCK 282
+ G E+ +SY R+++L Y F C C+ C+
Sbjct: 228 KISVGEELTVSYIDFLNVSEERKRQLKKQYYFDCTCEHCQ 267
>gi|57997548|emb|CAI46077.1| hypothetical protein [Homo sapiens]
Length = 490
Score = 65.9 bits (159), Expect = 3e-08, Method: Compositional matrix adjust.
Identities = 69/288 (23%), Positives = 113/288 (39%), Gaps = 25/288 (8%)
Query: 6 ETVTVAEIEGRGRGLVSTQSLKAGQIVLRDSPILLYSALPFINSSSF-CHNCFRKTMHSS 64
E V V EG+GRGL +T+ A I+ + YSA+ F + +F CH CF++
Sbjct: 7 ENVEVFTAEGKGRGLKATKEFWAADIIFAERA---YSAVVFDSLVNFVCHTCFKR--QEK 61
Query: 65 SSICPSCSVA-FCSPKCSTAAASSSHSPYVCQALTR---LLNLNSPDAANLPLDRQLQAR 120
C C A +C C A + + C A+ R + N N AA + + +
Sbjct: 62 LHRCGQCKFAHYCDRTCQKDAWLNHKNE--CSAIKRYGKVPNENIRLAARIMWRVEREGT 119
Query: 121 FLIAAYNLAVVNPSQFQILLAFQGTVTDNDTSAAHYLSSLCPPPASTTIELTAALLAKDR 180
L L V+ Q + + D +L P +++ + +
Sbjct: 120 GLTEGC-LVPVDDLQNHVEHFGEEEQKDLRVDVDTFLQYWPPQSQQFSMQYISHIFGVIN 178
Query: 181 LNAFGLMEPYIEGQDGRRSVRAYGIYLKASFFNHDCLPNAC------RFDYVDAAAENNT 234
N F L + Q G ++V GI+ NHDC PN + V +
Sbjct: 179 CNGFTLSD-----QRGLQAV-GVGIFPNLGLVNHDCWPNCTVIFNNGNHEAVKSMFHTQM 232
Query: 235 DIIVRMIHDVPQGREICLSYFPVNYDYSTRQKRLLYDYDFACDCDRCK 282
I +R + + +G E+ +SY R+++L Y F C C+ C+
Sbjct: 233 RIELRALGKISEGEELTVSYIDFLNVSEERKRQLKKQYYFDCTCEHCQ 280
>gi|426336314|ref|XP_004031420.1| PREDICTED: SET and MYND domain-containing protein 1 [Gorilla
gorilla gorilla]
Length = 459
Score = 65.9 bits (159), Expect = 3e-08, Method: Compositional matrix adjust.
Identities = 69/288 (23%), Positives = 113/288 (39%), Gaps = 25/288 (8%)
Query: 6 ETVTVAEIEGRGRGLVSTQSLKAGQIVLRDSPILLYSALPFINSSSF-CHNCFRKTMHSS 64
E V V EG+GRGL +T+ A I+ + YSA+ F + +F CH CF++
Sbjct: 7 ENVEVFTAEGKGRGLKATKEFWAADIIFAERA---YSAVVFDSLVNFVCHTCFKR--QEK 61
Query: 65 SSICPSCSVA-FCSPKCSTAAASSSHSPYVCQALTR---LLNLNSPDAANLPLDRQLQAR 120
C C A +C C A + + C A+ R + N N AA + + +
Sbjct: 62 LHRCGQCKFAHYCDRTCQKDAWLNHKNE--CSAIKRYGKVPNENIRLAARIMWRVEREGT 119
Query: 121 FLIAAYNLAVVNPSQFQILLAFQGTVTDNDTSAAHYLSSLCPPPASTTIELTAALLAKDR 180
L L V+ Q + + D +L P +++ + +
Sbjct: 120 GLTEGC-LVSVDDLQNHVEHFGEEEQKDLRVDVDTFLQYWPPQSQQFSMQYISHIFGVIN 178
Query: 181 LNAFGLMEPYIEGQDGRRSVRAYGIYLKASFFNHDCLPNAC------RFDYVDAAAENNT 234
N F L + Q G ++V GI+ NHDC PN + V +
Sbjct: 179 CNGFTLSD-----QRGLQAV-GVGIFPNLGLVNHDCWPNCTVIFNNGNHEAVKSMFHTQM 232
Query: 235 DIIVRMIHDVPQGREICLSYFPVNYDYSTRQKRLLYDYDFACDCDRCK 282
I +R + + +G E+ +SY R+++L Y F C C+ C+
Sbjct: 233 RIELRALGKISEGEELTVSYIDFLNVSEERKRQLKKQYYFDCTCEHCQ 280
>gi|395731585|ref|XP_003775930.1| PREDICTED: LOW QUALITY PROTEIN: SET and MYND domain-containing
protein 1-like [Pongo abelii]
Length = 490
Score = 65.9 bits (159), Expect = 3e-08, Method: Compositional matrix adjust.
Identities = 68/288 (23%), Positives = 113/288 (39%), Gaps = 25/288 (8%)
Query: 6 ETVTVAEIEGRGRGLVSTQSLKAGQIVLRDSPILLYSALPFINSSSF-CHNCFRKTMHSS 64
E V V EG+GRGL +T+ A I+ + YSA+ F + +F CH CF++
Sbjct: 7 ENVEVFTAEGKGRGLKATKEFWAADIIFAERA---YSAVVFDSLVNFVCHTCFKR--QEK 61
Query: 65 SSICPSCSVA-FCSPKCSTAAASSSHSPYVCQALTR---LLNLNSPDAANLPLDRQLQAR 120
C C A +C C A + + C A+ R + N N AA + + +
Sbjct: 62 LHRCGQCKFAHYCDRTCQKDAWLNHKNE--CSAIKRYGKVPNENIRLAARIMWRVEREGT 119
Query: 121 FLIAAYNLAVVNPSQFQILLAFQGTVTDNDTSAAHYLSSLCPPPASTTIELTAALLAKDR 180
L L V+ Q + + + +L P +++ + +
Sbjct: 120 GLTEGC-LVSVDDLQNHVEHFGEEEQKELRVDVDTFLQYWLPQSQQFSMQYISHIFGVIN 178
Query: 181 LNAFGLMEPYIEGQDGRRSVRAYGIYLKASFFNHDCLPNAC------RFDYVDAAAENNT 234
N F L + Q G ++V GI+ NHDC PN + V +
Sbjct: 179 CNGFTLSD-----QRGLQAV-GVGIFPNLGLVNHDCWPNCTVIFNNGNHEAVKSMFHTQM 232
Query: 235 DIIVRMIHDVPQGREICLSYFPVNYDYSTRQKRLLYDYDFACDCDRCK 282
I +R + + +G E+ +SY R+++L Y F C C+ C+
Sbjct: 233 RIELRALGKISEGEELTVSYIDFLNVSEERKRQLKKQYYFDCTCEHCQ 280
>gi|395853479|ref|XP_003799234.1| PREDICTED: SET and MYND domain-containing protein 1 [Otolemur
garnettii]
Length = 490
Score = 65.9 bits (159), Expect = 4e-08, Method: Compositional matrix adjust.
Identities = 70/288 (24%), Positives = 113/288 (39%), Gaps = 25/288 (8%)
Query: 6 ETVTVAEIEGRGRGLVSTQSLKAGQIVLRDSPILLYSALPFINSSSF-CHNCFRKTMHSS 64
E V V EG+GRGL +T+ A ++ + YSA+ F + +F CH CF++
Sbjct: 7 ENVEVFTSEGKGRGLKATKEFWAADVIFAERA---YSAVVFDSLVNFVCHTCFKR--QEK 61
Query: 65 SSICPSCSVA-FCSPKCSTAAASSSHSPYVCQALTR---LLNLNSPDAANLPLDRQLQAR 120
C C A +C C A + + C A+ R + N N AA + + +
Sbjct: 62 LHRCGQCKFAHYCDRTCQKDAWLNHKNE--CSAIKRYGKVPNENIRLAARIMWRVEREGT 119
Query: 121 FLIAAYNLAVVNPSQFQILLAFQGTVTDNDTSAAHYLSSLCPPPASTTIELTAALLAKDR 180
L L V+ Q + + + +L P +I+ + +
Sbjct: 120 GLTEGC-LVSVDDLQNHVEHFGEEEQKELRVDVDTFLQYWPPQSQPFSIQYISHIFGVIN 178
Query: 181 LNAFGLMEPYIEGQDGRRSVRAYGIYLKASFFNHDCLPNAC------RFDYVDAAAENNT 234
N F L + Q G ++V GI+ NHDC PN + V A
Sbjct: 179 CNGFTLSD-----QRGLQAV-GVGIFPNLGLVNHDCWPNCTVIFNNGNHEAVKAMFHTQM 232
Query: 235 DIIVRMIHDVPQGREICLSYFPVNYDYSTRQKRLLYDYDFACDCDRCK 282
I +R + + +G E+ +SY R+K+L Y F C C+ C+
Sbjct: 233 RIELRALGKISEGEELTVSYIDFLNVTEERRKQLKKQYYFDCACEHCQ 280
>gi|332813752|ref|XP_003309160.1| PREDICTED: SET and MYND domain-containing protein 1 [Pan
troglodytes]
Length = 452
Score = 65.5 bits (158), Expect = 4e-08, Method: Compositional matrix adjust.
Identities = 69/288 (23%), Positives = 113/288 (39%), Gaps = 25/288 (8%)
Query: 6 ETVTVAEIEGRGRGLVSTQSLKAGQIVLRDSPILLYSALPFINSSSF-CHNCFRKTMHSS 64
E V V EG+GRGL +T+ A I+ + YSA+ F + +F CH CF++
Sbjct: 7 ENVEVFITEGKGRGLKATKEFWAADIIFAERA---YSAVVFDSLVNFVCHTCFKR--QEK 61
Query: 65 SSICPSCSVA-FCSPKCSTAAASSSHSPYVCQALTR---LLNLNSPDAANLPLDRQLQAR 120
C C A +C C A + + C A+ R + N N AA + + +
Sbjct: 62 LHRCGQCKFAHYCDRTCQKDAWLNHKNE--CSAIKRYGKVPNENIRLAARIMWRVEREGT 119
Query: 121 FLIAAYNLAVVNPSQFQILLAFQGTVTDNDTSAAHYLSSLCPPPASTTIELTAALLAKDR 180
L L V+ Q + + D +L P +++ + +
Sbjct: 120 GLTEGC-LVSVDDLQNHVEHFGEEEQKDLRVDVDTFLQYWPPQSQQFSMQYISHIFGVIN 178
Query: 181 LNAFGLMEPYIEGQDGRRSVRAYGIYLKASFFNHDCLPNAC------RFDYVDAAAENNT 234
N F L + Q G ++V GI+ NHDC PN + V +
Sbjct: 179 CNGFTLSD-----QRGLQAV-GVGIFPNLGLVNHDCWPNCTVIFNNGNHEAVKSMFHTQM 232
Query: 235 DIIVRMIHDVPQGREICLSYFPVNYDYSTRQKRLLYDYDFACDCDRCK 282
I +R + + +G E+ +SY R+++L Y F C C+ C+
Sbjct: 233 RIELRALGKISEGEELTVSYIDFLNVSEERKRQLKKQYYFDCTCEHCQ 280
>gi|21355139|ref|NP_648574.1| SET and MYND domain protein 4 [Drosophila melanogaster]
gi|7294583|gb|AAF49923.1| SET and MYND domain protein 4 [Drosophila melanogaster]
gi|17945748|gb|AAL48922.1| RE32936p [Drosophila melanogaster]
Length = 663
Score = 65.5 bits (158), Expect = 4e-08, Method: Compositional matrix adjust.
Identities = 74/312 (23%), Positives = 132/312 (42%), Gaps = 40/312 (12%)
Query: 1 MSAVSETVTVAEIEGRGRGLVSTQSLKAGQIVLRDSPILLYSALPFINSSSFCHNCFRKT 60
+ S+ V + E + +GR +V+ + L+ G ++L + P+ + L + CH+CF++
Sbjct: 249 LVGASKVVRLVETKDKGRFVVANEGLRTGDVLLFEEPVA--ACLEPSYFGTHCHHCFKR- 305
Query: 61 MHSSSSICPSCS-VAFCSPKCSTAAASSSHSPYVCQALTRLLNLNSPDAANLPLDRQLQA 119
+H+ S C CS +AFCS +C A SS H + C+ + ++ + L QA
Sbjct: 306 LHTPVS-CLHCSGIAFCSAQCMGEACSSYHR-FECEYMDLMIGSGMSILCFIALRIFTQA 363
Query: 120 RFL---IAAYNLAVVN---------PSQFQILLAFQGTVTDNDTSAAHYLSSLCPPPAST 167
L +A NL + P + G + + ++ T
Sbjct: 364 PSLEQGLATANLLFEHLCSHEEDRQPDDYLRRALMSGFLLRILQKSLYFGRRKTEGVNPT 423
Query: 168 TIELTAA-----LLAKDRLNAFGLMEPYIEGQ---DGRRSVR-AYGIYLKASFFNHDCLP 218
+EL A LL + NA + + + + DG ++V A G+Y S+FNH+C P
Sbjct: 424 AVELQVATALLGLLQVLQYNAHQIYQTQVTEEHRFDGSKTVYLAAGLYGTGSYFNHECWP 483
Query: 219 N-ACRFDYVDAAAENNTDIIVRMIHDVPQGREICLSYFP--VNYDYSTRQKRLLYDYDFA 275
+ AC F +++ + ++Y P + + RQ+ L Y F+
Sbjct: 484 STACHF--------VGKKLVLTATRPHRANELVAVNYGPIFIKNNLKERQRSLRGRYSFS 535
Query: 276 CDCDRCKVEANW 287
C C C + NW
Sbjct: 536 CSCMAC--QENW 545
>gi|270012059|gb|EFA08507.1| hypothetical protein TcasGA2_TC006159 [Tribolium castaneum]
Length = 529
Score = 65.5 bits (158), Expect = 4e-08, Method: Compositional matrix adjust.
Identities = 65/273 (23%), Positives = 112/273 (41%), Gaps = 24/273 (8%)
Query: 17 GRGLVSTQSLKAGQIVLRDSPILLYSALPFINSSSFCHNCFRKTMHSSSSICPSCSVAFC 76
GR +VS ++LK G ++L ++P+++ P C C+ KT+ SS +C C
Sbjct: 49 GRYMVSRKNLKPGDVILSEAPLVIG---PCTGCKVQCLGCY-KTLESSFCTKCTCGWPLC 104
Query: 77 SPKCSTAAASSSHSPYVCQAL-----TRLLNLNSPDAANLPLDRQLQARFLIAAYNLAVV 131
SPKC HS C L ++ LN ++ + + + R L+ L
Sbjct: 105 SPKCKGLGKRFGHSQVECAVLKATKSSKFLNYDNFEEMRPNFNAIVPLRCLL----LKET 160
Query: 132 NPSQFQILLAFQGTVTDNDTSAAHYLSSLCPPPASTTIELTAALLAKDRLNAF-GLME-- 188
+ +Q L+ + + ++ I+ +++ +++ G++E
Sbjct: 161 DARSYQHLMTMETHNEIRRNIPELWQTNQRTVVDKIRIDWGLREYSQEEIHSVCGILEVN 220
Query: 189 PYIEGQDGRRSVRAYGIYLKASFFNHDCLPNACRFDYVDAAAENNTDIIVRMIHDVPQGR 248
+ GQ G V G+Y A +HDC+PN D E+ + VR + G
Sbjct: 221 SFEIGQQG---VNIRGLYPSAFLMSHDCVPNTNHID-----EESTFRLTVRASTRIEPGE 272
Query: 249 EICLSYFPVNYDYSTRQKRLLYDYDFACDCDRC 281
I LSY R++ LL + F C C RC
Sbjct: 273 MITLSYAYTLQSTLKRREHLLENKFFECQCRRC 305
>gi|167535778|ref|XP_001749562.1| hypothetical protein [Monosiga brevicollis MX1]
gi|163771954|gb|EDQ85613.1| predicted protein [Monosiga brevicollis MX1]
Length = 338
Score = 65.5 bits (158), Expect = 4e-08, Method: Compositional matrix adjust.
Identities = 30/82 (36%), Positives = 43/82 (52%), Gaps = 7/82 (8%)
Query: 205 IYLKASFFNHDCLPNACRFDYVDAAAENNTDIIVRMIHDVPQGREICLSYFPVNYDYSTR 264
+Y + S NH C PNA A N ++R + D+P+G E+C+SY R
Sbjct: 133 VYAELSRANHSCQPNA-------AVVYNGAAAVLRSMRDIPEGEEVCISYVDPTLARDVR 185
Query: 265 QKRLLYDYDFACDCDRCKVEAN 286
++ L+ Y FACDC RC EA+
Sbjct: 186 RRELVQSYGFACDCARCATEAS 207
>gi|91087991|ref|XP_973574.1| PREDICTED: similar to CG11160 CG11160-PA [Tribolium castaneum]
Length = 528
Score = 65.5 bits (158), Expect = 4e-08, Method: Compositional matrix adjust.
Identities = 65/273 (23%), Positives = 112/273 (41%), Gaps = 24/273 (8%)
Query: 17 GRGLVSTQSLKAGQIVLRDSPILLYSALPFINSSSFCHNCFRKTMHSSSSICPSCSVAFC 76
GR +VS ++LK G ++L ++P+++ P C C+ KT+ SS +C C
Sbjct: 48 GRYMVSRKNLKPGDVILSEAPLVIG---PCTGCKVQCLGCY-KTLESSFCTKCTCGWPLC 103
Query: 77 SPKCSTAAASSSHSPYVCQAL-----TRLLNLNSPDAANLPLDRQLQARFLIAAYNLAVV 131
SPKC HS C L ++ LN ++ + + + R L+ L
Sbjct: 104 SPKCKGLGKRFGHSQVECAVLKATKSSKFLNYDNFEEMRPNFNAIVPLRCLL----LKET 159
Query: 132 NPSQFQILLAFQGTVTDNDTSAAHYLSSLCPPPASTTIELTAALLAKDRLNAF-GLME-- 188
+ +Q L+ + + ++ I+ +++ +++ G++E
Sbjct: 160 DARSYQHLMTMETHNEIRRNIPELWQTNQRTVVDKIRIDWGLREYSQEEIHSVCGILEVN 219
Query: 189 PYIEGQDGRRSVRAYGIYLKASFFNHDCLPNACRFDYVDAAAENNTDIIVRMIHDVPQGR 248
+ GQ G V G+Y A +HDC+PN D E+ + VR + G
Sbjct: 220 SFEIGQQG---VNIRGLYPSAFLMSHDCVPNTNHID-----EESTFRLTVRASTRIEPGE 271
Query: 249 EICLSYFPVNYDYSTRQKRLLYDYDFACDCDRC 281
I LSY R++ LL + F C C RC
Sbjct: 272 MITLSYAYTLQSTLKRREHLLENKFFECQCRRC 304
>gi|397468470|ref|XP_003805903.1| PREDICTED: SET and MYND domain-containing protein 1 [Pan paniscus]
Length = 490
Score = 65.5 bits (158), Expect = 5e-08, Method: Compositional matrix adjust.
Identities = 69/288 (23%), Positives = 113/288 (39%), Gaps = 25/288 (8%)
Query: 6 ETVTVAEIEGRGRGLVSTQSLKAGQIVLRDSPILLYSALPFINSSSF-CHNCFRKTMHSS 64
E V V EG+GRGL +T+ A I+ + YSA+ F + +F CH CF++
Sbjct: 7 ENVEVFIAEGKGRGLKATKEFWAADIIFAERA---YSAVVFDSLVNFVCHTCFKR--QEK 61
Query: 65 SSICPSCSVA-FCSPKCSTAAASSSHSPYVCQALTR---LLNLNSPDAANLPLDRQLQAR 120
C C A +C C A + + C A+ R + N N AA + + +
Sbjct: 62 LHRCGQCKFAHYCDRTCQKDAWLNHKNE--CSAIKRYGKVPNENIRLAARIMWRVEREGT 119
Query: 121 FLIAAYNLAVVNPSQFQILLAFQGTVTDNDTSAAHYLSSLCPPPASTTIELTAALLAKDR 180
L L V+ Q + + D +L P +++ + +
Sbjct: 120 GLTEGC-LVSVDDLQNHVEHFGEEEQKDLRVDVDTFLQYWPPQSQQFSMQYISHIFGVIN 178
Query: 181 LNAFGLMEPYIEGQDGRRSVRAYGIYLKASFFNHDCLPNAC------RFDYVDAAAENNT 234
N F L + Q G ++V GI+ NHDC PN + V +
Sbjct: 179 CNGFTLSD-----QRGLQAV-GVGIFPNLGLVNHDCWPNCTVIFNNGNHEAVKSMFHTQM 232
Query: 235 DIIVRMIHDVPQGREICLSYFPVNYDYSTRQKRLLYDYDFACDCDRCK 282
I +R + + +G E+ +SY R+++L Y F C C+ C+
Sbjct: 233 RIELRALGKISEGEELTVSYIDFLNVSEERKRQLKKQYYFDCTCEHCQ 280
>gi|390474255|ref|XP_002757614.2| PREDICTED: SET and MYND domain-containing protein 1 [Callithrix
jacchus]
Length = 442
Score = 65.5 bits (158), Expect = 5e-08, Method: Compositional matrix adjust.
Identities = 68/288 (23%), Positives = 113/288 (39%), Gaps = 25/288 (8%)
Query: 6 ETVTVAEIEGRGRGLVSTQSLKAGQIVLRDSPILLYSALPFINSSSF-CHNCFRKTMHSS 64
E V V EG+GRGL +T+ A ++ + YSA+ F + +F CH CF++
Sbjct: 7 ENVEVFTAEGKGRGLKATKEFWAADVIFAERA---YSAVVFDSLVNFVCHTCFKR--QEK 61
Query: 65 SSICPSCSVA-FCSPKCSTAAASSSHSPYVCQALTR---LLNLNSPDAANLPLDRQLQAR 120
C C A +C C A + + C A+ R + N N AA + + +
Sbjct: 62 LHRCGQCKFAHYCDRTCQKDAWLNHKNE--CSAIRRYGKVPNENIRLAARIMWRVEREGT 119
Query: 121 FLIAAYNLAVVNPSQFQILLAFQGTVTDNDTSAAHYLSSLCPPPASTTIELTAALLAKDR 180
L L V+ Q + + + +L P +++ + +
Sbjct: 120 GLTEGC-LVSVDDLQNHVEHFGEEEQKELRVDVDTFLQYWPPQSQQFSMQYISHIFGVIN 178
Query: 181 LNAFGLMEPYIEGQDGRRSVRAYGIYLKASFFNHDCLPNAC------RFDYVDAAAENNT 234
N F L + Q G ++V GI+ NHDC PN + V +
Sbjct: 179 CNGFTLSD-----QRGLQAV-GVGIFPNLGLVNHDCWPNCTVIFNNGNHEAVKSMFHTQM 232
Query: 235 DIIVRMIHDVPQGREICLSYFPVNYDYSTRQKRLLYDYDFACDCDRCK 282
I +R + + +G E+ +SY R+K+L Y F C C+ C+
Sbjct: 233 RIELRALGKISEGEELTVSYIDFLNVSEERKKQLKKQYYFDCTCEHCQ 280
>gi|196000040|ref|XP_002109888.1| hypothetical protein TRIADDRAFT_53251 [Trichoplax adhaerens]
gi|190588012|gb|EDV28054.1| hypothetical protein TRIADDRAFT_53251 [Trichoplax adhaerens]
Length = 448
Score = 65.1 bits (157), Expect = 5e-08, Method: Compositional matrix adjust.
Identities = 68/288 (23%), Positives = 114/288 (39%), Gaps = 44/288 (15%)
Query: 5 SETVTVAEIEGRGRGLVSTQSLKAGQIVLRDSPILLYSALPFINSSSFCHNCFRKTMHSS 64
S+ V + E +GRG+ + L G V +++P + + SS+CH+C M S
Sbjct: 8 SDNVEIFETYEKGRGIRCKKQLAIGTSVGKENPFCHVVSQDML--SSYCHSCL--LMQSE 63
Query: 65 SSICPSCSV-AFCSPKCSTAAASSSHSPYVCQALTRLLNLNSPDAANL----------PL 113
C C + +C C Y C+++TRL PD+ L
Sbjct: 64 LYKCSRCKIIMYCCKSCQ--KEDWQWHKYECKSITRLGPKVPPDSIRLLGRVAYTILQGQ 121
Query: 114 DRQLQARFLIAAYNLAVVNPSQFQILLAFQGTVTDNDTSAAHYLSSLCPPPASTTIELTA 173
DR Q +FL++ N ++ S+ + T+ D YLS+ +
Sbjct: 122 DRADQFKFLLS--NRELLEGSR-------KNTIVDGINLLKEYLSN----KVAINENEII 168
Query: 174 ALLAKDRLNAFGLMEPYIEGQDGRRSVRAYGIYLKASFFNHDCLPNACRFDYVDAAAENN 233
++++ N F + ++ G+Y S NH C PN +A
Sbjct: 169 EIISRVTCNTFTICNSEMQ-------TVGIGVYPGLSLVNHSCSPNC-------SATFRG 214
Query: 234 TDIIVRMIHDVPQGREICLSYFPVNYDYSTRQKRLLYDYDFACDCDRC 281
+ +R+I + G E+ +SY S+RQ +L Y F C C+RC
Sbjct: 215 KQMQLRIIENTKIGDELLISYIDPMQVLSSRQNQLQSQYCFKCICERC 262
>gi|84180545|gb|ABC54714.1| histone methyltransferase SmyD1b [Danio rerio]
gi|190337458|gb|AAI63095.1| Smyd1b protein [Danio rerio]
Length = 473
Score = 65.1 bits (157), Expect = 5e-08, Method: Compositional matrix adjust.
Identities = 65/283 (22%), Positives = 111/283 (39%), Gaps = 30/283 (10%)
Query: 6 ETVTVAEIEGRGRGLVSTQSLKAGQIVLRDSPILLYSALPF-INSSSFCHNCFRKTMHSS 64
E V V + G+GRGL +T+ AG ++ + P ++++ F +SS CH+CFR+
Sbjct: 2 EFVEVFDSPGKGRGLRATKEAWAGDVLFAEPP---FASVVFDSQASSICHSCFRR--QEK 56
Query: 65 SSICPSCSVA-FCSPKCSTAAASSSHSPYVCQALTRLLNLNSPDAANLPLDRQLQARF-- 121
C C A +C C A C A+ P + N+ L ++ R
Sbjct: 57 LQRCGQCRFAQYCDKTCQRAGWEEHKLE--CAAIK---TYGKPPSENVRLAARILWRMDK 111
Query: 122 ---LIAAYNLAVVNPSQFQILLAFQGTVTDNDTSAAHYLSSLCPPPASTTIELTAALLAK 178
+++ L + + I + + D ++L T++ + +L
Sbjct: 112 QGSVVSDNQLTTLEDLEDHICDISEDDLKDFKVDIHNFLDYWPRNSKPHTVDSVSHILGV 171
Query: 179 DRLNAFGLMEPYIEGQDGRRSVRAYGIYLKASFFNHDCLPNACRFDYVDAAAENNTDIIV 238
N F + Q G ++V G++ NHDC PN NN I +
Sbjct: 172 INCNGF-----MVSDQRGLQAV-GVGLFPNLCLVNHDCWPNC-------TVILNNGKIEL 218
Query: 239 RMIHDVPQGREICLSYFPVNYDYSTRQKRLLYDYDFACDCDRC 281
R + + G E+ ++Y + RQ+ L Y F C C C
Sbjct: 219 RALGKISAGEEVTVAYVDYLNVSADRQRLLKQQYFFDCTCKHC 261
>gi|403303925|ref|XP_003942566.1| PREDICTED: SET and MYND domain-containing protein 1 [Saimiri
boliviensis boliviensis]
Length = 490
Score = 65.1 bits (157), Expect = 5e-08, Method: Compositional matrix adjust.
Identities = 68/288 (23%), Positives = 113/288 (39%), Gaps = 25/288 (8%)
Query: 6 ETVTVAEIEGRGRGLVSTQSLKAGQIVLRDSPILLYSALPFINSSSF-CHNCFRKTMHSS 64
E V V EG+GRGL +T+ A ++ + YSA+ F + +F CH CF++
Sbjct: 7 ENVEVFTAEGKGRGLKATKEFWAADVIFAERA---YSAVVFDSLVNFVCHTCFKR--QEK 61
Query: 65 SSICPSCSVA-FCSPKCSTAAASSSHSPYVCQALTR---LLNLNSPDAANLPLDRQLQAR 120
C C A +C C A + + C A+ R + N N AA + + +
Sbjct: 62 LHRCGQCKFAHYCDRTCQKDAWLNHKNE--CSAIKRYGKVPNENIRLAARIMWRVEREGT 119
Query: 121 FLIAAYNLAVVNPSQFQILLAFQGTVTDNDTSAAHYLSSLCPPPASTTIELTAALLAKDR 180
L L V+ Q + + + +L P +++ + +
Sbjct: 120 GLTEGC-LVSVDDLQNHVEHFGEEEQKELRVDVDTFLQYWPPQSQQFSMQYISHIFGVIN 178
Query: 181 LNAFGLMEPYIEGQDGRRSVRAYGIYLKASFFNHDCLPNAC------RFDYVDAAAENNT 234
N F L + Q G ++V GI+ NHDC PN + V +
Sbjct: 179 CNGFTLSD-----QRGLQAV-GVGIFPNLGLVNHDCWPNCTVIFNNGNHEAVKSMFHTQM 232
Query: 235 DIIVRMIHDVPQGREICLSYFPVNYDYSTRQKRLLYDYDFACDCDRCK 282
I +R + + +G E+ +SY R+K+L Y F C C+ C+
Sbjct: 233 RIELRALGKISEGEELTVSYIDFLNVSEERKKQLKKQYYFDCTCEHCQ 280
>gi|255084131|ref|XP_002508640.1| predicted protein [Micromonas sp. RCC299]
gi|226523917|gb|ACO69898.1| predicted protein [Micromonas sp. RCC299]
Length = 701
Score = 65.1 bits (157), Expect = 6e-08, Method: Compositional matrix adjust.
Identities = 75/314 (23%), Positives = 124/314 (39%), Gaps = 44/314 (14%)
Query: 16 RGRGLVSTQSLKAGQIVLRDSPILLYSALPFINSSSFCHNCFRKTMHSSSSICPS-CSVA 74
+GR + + + +K G+ +LR P + + C CF KT+ SS C C V
Sbjct: 119 KGRCVRARREIKPGETILRCDPFV--HVVHDRRREDHCAGCF-KTLQSSCVECGDFCGVK 175
Query: 75 FCSPKCSTAAASSSHSPYVCQALTRLLNLNSPDAANLPLDRQLQARFLIAAYNLAVVNPS 134
+C+ C + + A ++ N+ A+L AR + + +P+
Sbjct: 176 YCADDCRESDVAHV-------AECSMVRTNAGTGADL-----RGARMCLRLIHKRQTDPA 223
Query: 135 QFQILLAFQGTVTDNDTSAAHYLSSLCPPPASTTIELTAALLAKDRLNAFGLMEPYIEGQ 194
+F ++A + + AA L+S A+ +L K R N+ G++
Sbjct: 224 RFAEVMAALRCEKKSPSPAATKLASAVRHAAALDPSEVEDMLGKTRENSHGVV------- 276
Query: 195 DGRRSVRAYGIYLKASFFNHDCLPNACRF---------------DYVDAAAENNTDIIVR 239
D + GIY +AS FNH C PNA DY A E V
Sbjct: 277 DWKLRQLGTGIYPEASMFNHSCAPNAAVSFGAGGTLNTCCIGVDDYEARAMEVGPKGYVP 336
Query: 240 MIHD--VPQGREICLSYFPVNYDYSTRQKRLLYDYDFACDCDRCKVEANWSDNDND---- 293
+ + +G E+C++Y + ++R+ L F C+C RC + D +
Sbjct: 337 LGRPRAIRKGEEVCIAYTELYRPAASRRLTLEKSKGFVCECVRCANASVLRALDTELHAA 396
Query: 294 DENNEEVMDEDQDE 307
+ NEEV DE E
Sbjct: 397 TDGNEEVADEKAKE 410
>gi|348517231|ref|XP_003446138.1| PREDICTED: SET and MYND domain-containing protein 1-like isoform 1
[Oreochromis niloticus]
Length = 476
Score = 65.1 bits (157), Expect = 6e-08, Method: Compositional matrix adjust.
Identities = 72/329 (21%), Positives = 133/329 (40%), Gaps = 35/329 (10%)
Query: 6 ETVTVAEIEGRGRGLVSTQSLKAGQIVLRDSPI--LLYSALPFINSSSFCHNCFRKTMHS 63
ETV + + G+GRGL +T+ AG ++ ++ I +++ +L + CH+CFR+
Sbjct: 2 ETVAIFDSPGKGRGLKATKEFWAGDVIFSEASIAAVVFDSL----AERICHSCFRR--QE 55
Query: 64 SSSICPSCSVA-FCSPKCSTAA-ASSSHSPYVCQALTRLLNLNSPDAANL--PLDRQLQA 119
C C A +C C A A +A ++ N N A + LD++
Sbjct: 56 KLQRCGQCKFAHYCDRTCQRAGWAEHKLECSAIKAYGKVPNENIRLVARIMWRLDKEGST 115
Query: 120 RFLIAAYNLAVVNPSQFQILLAFQGTVTDNDTSAAHYLSSLCPPPASTTIELTAALLAKD 179
++ L ++ + I + + + ++L TI+ + +
Sbjct: 116 ---VSDMQLTTLDELEDHIADMPEDDLKELKVDIHNFLDYWPHNSKQHTIDDISHIFGVI 172
Query: 180 RLNAFGLMEPYIEGQDGRRSVRAYGIYLKASFFNHDCLPNACRFDYVDAAAENNTDIIVR 239
N F + Q G ++V G++ NHDC PN N+ I +R
Sbjct: 173 NCNGFS-----VSDQRGLQAV-GVGLFPNLCLVNHDCWPNC-------TVILNHGKIELR 219
Query: 240 MIHDVPQGREICLSYFPVNYDYSTRQKRLLYDYDFACDCDRCK------VEANWSDNDND 293
+ + +G E+ ++Y RQ+ L Y F C C+ CK ++ + D
Sbjct: 220 ALGKIAEGEELTVAYVDFLNLSEERQRLLKTQYFFDCTCEHCKNRIKDDIKIGGREEDGV 279
Query: 294 DENNEEVMD-EDQDEQMVASDDDAEVHGD 321
+ E+V + D QM+ + A ++GD
Sbjct: 280 KPSEEQVKEATDYCFQMLEKMEKARLNGD 308
>gi|328770914|gb|EGF80955.1| hypothetical protein BATDEDRAFT_88226 [Batrachochytrium
dendrobatidis JAM81]
Length = 227
Score = 64.7 bits (156), Expect = 7e-08, Method: Compositional matrix adjust.
Identities = 33/116 (28%), Positives = 58/116 (50%), Gaps = 12/116 (10%)
Query: 175 LLAKDRLNAFGLMEPYIEGQDGRRSVRAYGIYLKASFFNHDCLPNACRFDYVDAAAENNT 234
++++ N FG+ +P +G+ + ++ AS+FNH C PN + +++
Sbjct: 75 IISRIESNCFGIWKP-----NGKEACMGRAVFPAASYFNHSCFPNC-------QSIKHDH 122
Query: 235 DIIVRMIHDVPQGREICLSYFPVNYDYSTRQKRLLYDYDFACDCDRCKVEANWSDN 290
+ R + DV +G + +SY N S R+ RL+ DY F C C+RC E+ N
Sbjct: 123 KMAFRTLKDVSKGEMLTISYIDTNMPVSARRARLMDDYFFECMCERCISESGMVSN 178
>gi|195327115|ref|XP_002030267.1| GM24661 [Drosophila sechellia]
gi|194119210|gb|EDW41253.1| GM24661 [Drosophila sechellia]
Length = 660
Score = 64.7 bits (156), Expect = 7e-08, Method: Compositional matrix adjust.
Identities = 73/312 (23%), Positives = 130/312 (41%), Gaps = 40/312 (12%)
Query: 1 MSAVSETVTVAEIEGRGRGLVSTQSLKAGQIVLRDSPILLYSALPFINSSSFCHNCFRKT 60
+ S+ V + E + +GR +V+ + L+ G ++L + P+ + L + CH+CF++
Sbjct: 249 LVGASKVVRLVETKDKGRFVVANEGLRTGDVLLCEEPVA--ACLEPTYFGTHCHHCFKRL 306
Query: 61 MHSSSSICPSCS-VAFCSPKCSTAAASSSHSPYVCQALTRLLNLNSPDAANLPLDRQLQA 119
S C CS +AFCS +C A SS H + C+ + ++ + L QA
Sbjct: 307 RTPVS--CLHCSGIAFCSAQCMGEACSSYHR-FECEYMDLMIGSGMSILCFIALRIFTQA 363
Query: 120 RFL---IAAYNLAVVN---------PSQFQILLAFQGTVTDNDTSAAHYLSSLCPPPAST 167
L +A NL + P + G + + ++ T
Sbjct: 364 PSLEQGLATANLLFEHLCSHEEDRQPDDYLRRALMSGFLLRILQKSLYFGRRKTEGVNPT 423
Query: 168 TIELTAA-----LLAKDRLNAFGLMEPYIEGQ---DGRRSVR-AYGIYLKASFFNHDCLP 218
+EL A LL + NA + + + + DG ++V A G+Y S+FNH+C P
Sbjct: 424 AVELQVATALLGLLQVLQYNAHQIYQTQVTEEHRFDGSKTVNLAAGLYGTGSYFNHECWP 483
Query: 219 N-ACRFDYVDAAAENNTDIIVRMIHDVPQGREICLSYFP--VNYDYSTRQKRLLYDYDFA 275
+ AC F +++ + ++Y P + + RQ+ L Y F+
Sbjct: 484 STACHF--------VGKKLVLTATRPHRANELVAVNYGPIFIKNNLKERQRSLRGRYSFS 535
Query: 276 CDCDRCKVEANW 287
C+C C + NW
Sbjct: 536 CNCMAC--QENW 545
>gi|115481972|ref|NP_001064579.1| Os10g0410700 [Oryza sativa Japonica Group]
gi|78708616|gb|ABB47591.1| SET domain protein 123, putative, expressed [Oryza sativa Japonica
Group]
gi|113639188|dbj|BAF26493.1| Os10g0410700 [Oryza sativa Japonica Group]
gi|215767244|dbj|BAG99472.1| unnamed protein product [Oryza sativa Japonica Group]
gi|215767272|dbj|BAG99500.1| unnamed protein product [Oryza sativa Japonica Group]
gi|222612812|gb|EEE50944.1| hypothetical protein OsJ_31491 [Oryza sativa Japonica Group]
Length = 298
Score = 64.7 bits (156), Expect = 7e-08, Method: Compositional matrix adjust.
Identities = 73/301 (24%), Positives = 128/301 (42%), Gaps = 39/301 (12%)
Query: 2 SAVSETVTVAEIEGRGRGLVSTQSLKAGQIVLRDSPILLYSALPFINSSSFCHNCFRKTM 61
+A + + VA E GRG+ +T+ + AG+++ P++ + + P I+ C++C R+
Sbjct: 12 TAAAPPIRVALTESSGRGVFATRPISAGEVLHSAQPLVSHPSPPLIHE--VCYSCLRRKS 69
Query: 62 HSSSSICPSCSVAFCSPKCSTAAASSSHSPYVCQALTRLLNLNSPDAANLP-LDRQLQAR 120
S SC FCS C A H + + S P + ++L
Sbjct: 70 GSGGGSSGSCY--FCSDACREHA-KGFHGVEKKADWSLFDDHCSSRGLKYPYMAKRLACM 126
Query: 121 FLIAAYN---LAVVNPSQFQILLAFQGTVTDNDTSAAHYLSSLCPPPASTTIELTAAL-- 175
+ A + L ++ P++ QGT+T+ + A S+ A E+T L
Sbjct: 127 VISGAVSADCLDILQPARLH-----QGTLTEMEEEFALLDSTF--RKAGFQEEITTFLTK 179
Query: 176 ------LAKDRLNAFGL------MEPYIEGQDGRRSVRAY---GIYLKASFFNHDCLPNA 220
LA+ R+NAF + E + S A +Y+ SF+NHDC PN
Sbjct: 180 EWYINVLARIRINAFRIELVASSYENLLSSAVASVSCDAAVGNAVYMLPSFYNHDCDPNT 239
Query: 221 CRFDYVDAAAENNTDIIVRMIHDVPQGREICLSYFPVNYDYSTRQKRLLYDYDFACDCDR 280
++ +A D ++ + ++ +G E+ + Y + D RQ+ L + F C C R
Sbjct: 240 -HIVWLASA-----DARLKALRNIEEGEELRICYIDASMDVDARQRILAEGFGFECRCQR 293
Query: 281 C 281
C
Sbjct: 294 C 294
>gi|380022731|ref|XP_003695192.1| PREDICTED: protein msta, isoform A-like [Apis florea]
Length = 554
Score = 64.7 bits (156), Expect = 8e-08, Method: Compositional matrix adjust.
Identities = 69/286 (24%), Positives = 117/286 (40%), Gaps = 31/286 (10%)
Query: 10 VAEIEGRGRGLVSTQSLKAGQIVLRDSPILLYSALPFINSSSF-CHNCFRKTMHSSSSIC 68
+A E GR L +++ L+AG+++LR+ P+ + N F C + IC
Sbjct: 19 IAYSEKLGRYLQASKDLRAGEVILREDPVAVGPMSCMKNPICFECLSLLPDIEQDFHYIC 78
Query: 69 PSCSVA-FCSPKCSTAAASSSHSPYVCQALTRLLNLNSPDAANLPLDRQLQARFLIAAYN 127
C+V C C A HS Y C+ + + L+ + L F++ +
Sbjct: 79 SGCNVVTLCGFSCEERAY--YHSMYECEIIKNNMELSIENTDTLA-----AVLFVLRLWL 131
Query: 128 LAVVNPSQFQILLAFQGTVT---------DNDTSAAHYLSSL--CPPPASTTIELTAALL 176
L +P ++ +L+ + + D + + + L SL P + E+ L
Sbjct: 132 LKQKDPELWKRVLSLESHLNKRRNTIVWEDREINIVNVLKSLNFVPENDPSVSEIIQQLC 191
Query: 177 AKDRLNAFGLMEPYIEGQDGRRSVRAYGIYLKASFFNHDCLPNACRFDYVDAAAENNTDI 236
+N+F L P G DG + G+YL+AS HDC N V A++N +
Sbjct: 192 GILDVNSFELRSP--GGMDG---LLLRGLYLEASLMAHDCRGN------VHVTADDNFHL 240
Query: 237 IVRMIHDVPQGREICLSYFPVNYDYSTRQKRLLYDYDFACDCDRCK 282
V + +G I +Y + R++ L F C+CD CK
Sbjct: 241 TVYASIPIKEGDVIFFNYTSSLLGTTGRREYLRTGKYFECECDLCK 286
>gi|330846003|ref|XP_003294847.1| hypothetical protein DICPUDRAFT_159919 [Dictyostelium purpureum]
gi|325074603|gb|EGC28624.1| hypothetical protein DICPUDRAFT_159919 [Dictyostelium purpureum]
Length = 428
Score = 64.3 bits (155), Expect = 8e-08, Method: Compositional matrix adjust.
Identities = 77/318 (24%), Positives = 134/318 (42%), Gaps = 71/318 (22%)
Query: 17 GRGLVSTQSLKAGQIVLRDSPILLYSALPFINSSSFCHNCF------RKTMHSSSSICPS 70
GR LV+T+ ++ ++LRD P A+ C +CF ++ + + +C
Sbjct: 78 GRYLVATKDIEEQTVILRDLPYTW--AVDHAACDIVCQHCFLEVPLNQQILPTDFYMCEG 135
Query: 71 CS-VAFCSPKCS--------------TAAASSSHSPYVCQALTRLLN------LNSPDAA 109
C+ V +CS C + +SP++ + L+ L +
Sbjct: 136 CNRVGYCSIHCKYIDYNQHRFECQIFKELDTEEYSPFLLSEIKLLVRTLSRKWLEESINS 195
Query: 110 NLPLDRQLQARFLIAAYNLAVVNPSQFQILLAFQGTVTDND-----TSAAHYLSSLCPPP 164
+ +D ++ I YN NPS L+ + ND ++ +Y SL
Sbjct: 196 SAGVDESEISK--INTYN-QYKNPSS---LIPQDNGLRYNDYADLVSNVENYNESLKESL 249
Query: 165 A----STTIELTAAL------------LAKDRLNAFGLMEPYIEGQ--DGRRS-VRAYGI 205
+ ++L A + L ++R NAF YI+G+ DG R G+
Sbjct: 250 SYWICKYIVKLAAKIDKEEDEDELLNILLRNRCNAF-----YIQGRPRDGTSGESRGCGV 304
Query: 206 YLKASFFNHDCLPNACRFDYVDAAAENNT-DIIVRMIHDVPQGREICLSYFPVNYDYSTR 264
Y++ SFFNH C PN V+ NNT ++ ++ +V +G E+ +SY + + R
Sbjct: 305 YVRNSFFNHSCDPN------VNYWVVNNTLEVECTLLKNVKEGEELTISYIDTSAPLNKR 358
Query: 265 QKRLLYDYDFACDCDRCK 282
+++LL Y F C C +CK
Sbjct: 359 REKLLEGYLFTCLCTKCK 376
>gi|444515202|gb|ELV10791.1| SET and MYND domain-containing protein 1 [Tupaia chinensis]
Length = 462
Score = 64.3 bits (155), Expect = 9e-08, Method: Compositional matrix adjust.
Identities = 68/288 (23%), Positives = 113/288 (39%), Gaps = 25/288 (8%)
Query: 6 ETVTVAEIEGRGRGLVSTQSLKAGQIVLRDSPILLYSALPFINSSSF-CHNCFRKTMHSS 64
E V V EG+GRGL +T+ A ++ + YSA+ F + +F CH CF++
Sbjct: 7 ENVEVFTSEGKGRGLKATKEFWAADVIFAERA---YSAVVFDSLVNFVCHTCFKR--QEK 61
Query: 65 SSICPSCSVA-FCSPKCSTAAASSSHSPYVCQALTR---LLNLNSPDAANLPLDRQLQAR 120
C C A +C C A + + C A+ R + N N AA + + +
Sbjct: 62 LHRCGQCKFAHYCDRTCQKDAWLNHKNE--CSAIKRYGKVPNENIRLAARIMWRVEREGT 119
Query: 121 FLIAAYNLAVVNPSQFQILLAFQGTVTDNDTSAAHYLSSLCPPPASTTIELTAALLAKDR 180
L L V+ Q + + + +L P +++ + +
Sbjct: 120 GLTEGC-LVSVDDLQNHVEHFGEEEQKELRVDVDTFLQYWPPQSQQFSMQYISHIFGVIN 178
Query: 181 LNAFGLMEPYIEGQDGRRSVRAYGIYLKASFFNHDCLPNAC------RFDYVDAAAENNT 234
N F L + Q G ++V GI+ NHDC PN + V +
Sbjct: 179 CNGFTLSD-----QRGLQAV-GVGIFPNLGLVNHDCWPNCTVIFNNGNHEAVKSMFHTQM 232
Query: 235 DIIVRMIHDVPQGREICLSYFPVNYDYSTRQKRLLYDYDFACDCDRCK 282
I +R + + +G E+ +SY R+K+L Y F C C+ C+
Sbjct: 233 RIELRALGKISEGEELTVSYIDFLNLSEERRKQLKKQYYFDCTCEHCQ 280
>gi|452820074|gb|EME27122.1| SET and MYND domain-containing protein [Galdieria sulphuraria]
Length = 464
Score = 64.3 bits (155), Expect = 9e-08, Method: Compositional matrix adjust.
Identities = 75/297 (25%), Positives = 114/297 (38%), Gaps = 48/297 (16%)
Query: 19 GLVSTQSLKAGQIVLRDSPILLYSALPFI-NSSSFCHNCFRKTMHSSSSICPSCSVA-FC 76
GL +T+ + G +L + YS + F N + CH CF+ S C C A +C
Sbjct: 2 GLFATKPYEVGNSILVEEA---YSWVTFRDNGPNICHTCFK--FSKSLKKCSQCKFARYC 56
Query: 77 SPKCSTAAASSSHSPYVCQALTRLLNLNSPDAANLPLDRQLQARFLIAAYN-----LAVV 131
C A + + C+A+ S A +P +L A FL A N + +V
Sbjct: 57 GLDCQKKAWKENSHRWECKAIA------SSVEAKIPSIVRLAALFLFRALNSNDKDIRLV 110
Query: 132 NPSQFQILLAFQGTVTDNDTSAAHYLSS---------------LCPPPASTTIEL----- 171
+ + ++ + +D + LS L P EL
Sbjct: 111 DRCR-SFPYSYSNMDSRSDENDVDVLSRQRVDCLVEDLLRFIRLSHYPQHEVSELFEDNF 169
Query: 172 TAALLAKDRLNAFGLMEPYIEGQDGRRSVRAYGIYLKASFFNHDCLPN--ACRFDYVDAA 229
A+L+ +NA + D + G + KASF NHDC PN A +
Sbjct: 170 VASLIRMLEMNAHTIY-------DSELNTLGVGFFPKASFMNHDCRPNCVALFTGGFHSV 222
Query: 230 AENNTDIIVRMIHDVPQGREICLSYFPVNYDYSTRQKRLLYDYDFACDCDRCKVEAN 286
+ I +R I + G EI +SY V + R + L Y F C C RCK E++
Sbjct: 223 SGKPISIHIRCIRPIEAGEEIVISYLDVCLSWMDRLEWLKEHYQFECCCSRCKEESS 279
>gi|260940791|ref|XP_002615235.1| hypothetical protein CLUG_04117 [Clavispora lusitaniae ATCC 42720]
gi|238850525|gb|EEQ39989.1| hypothetical protein CLUG_04117 [Clavispora lusitaniae ATCC 42720]
Length = 402
Score = 64.3 bits (155), Expect = 9e-08, Method: Compositional matrix adjust.
Identities = 36/119 (30%), Positives = 59/119 (49%), Gaps = 8/119 (6%)
Query: 168 TIELTAALLAKDRLNAFGLMEPYIEGQDGRRSVRAYGIYLKASFFNHDCLPNACRFDYVD 227
+I+ ++ ++ NAFG+ P + D R +G+Y ASFFNH C PN + +
Sbjct: 291 SIQAVRDIIGRNLTNAFGIWSP-VTSSDEEREYFGFGVYPSASFFNHSCKPNVTK---IR 346
Query: 228 AAAENNTDIIVRMIHDVPQGREICLSYFPVNYDYSTRQKRLLYDYDFACDCDRCKVEAN 286
A + + I + D+P G E+C+SY D +++ L ++ F C C RC E
Sbjct: 347 RGA--SYEYIAK--EDIPVGAELCISYGIRESDGLKQRQDALREWFFECGCSRCTTEGK 401
>gi|242019690|ref|XP_002430292.1| histone-lysine N-methyltransferase ASHR1, putative [Pediculus
humanus corporis]
gi|212515407|gb|EEB17554.1| histone-lysine N-methyltransferase ASHR1, putative [Pediculus
humanus corporis]
Length = 487
Score = 64.3 bits (155), Expect = 9e-08, Method: Compositional matrix adjust.
Identities = 67/280 (23%), Positives = 110/280 (39%), Gaps = 29/280 (10%)
Query: 17 GRGLVSTQSLKAGQIVLRDSPILLYSALPFINSSSFCHNCFRKTMHSSSSICPSCSVA-F 75
G +V+++ LK G+I+L++ P++L P INS+ C CFR+ + C C +A
Sbjct: 3 GTFMVASKDLKPGEIILKEQPLILG---PSINSNILCLGCFRRILFEKFKFCRECQIAPV 59
Query: 76 CSPKCSTAAASSSHSPYVCQALTRLLNLNSPDA--------ANLPLDRQLQARFLIAAYN 127
C CS S HS Y C+ +L + D LPL FL +
Sbjct: 60 CGEDCSL---KSQHSKYECKYYKKLFENDKIDKNFIVSNFQIVLPLKCFSLKLFLFPDDD 116
Query: 128 LAVVNPSQFQILLAFQGTVTDNDTSAAHYLSSLCPPPASTTIELTAALLAKDRL------ 181
+ I + N+ +Y ++ S I L ++
Sbjct: 117 DDDDHLWDNFINMESHVDERQNEPIWNYYQENIIKVLNSVNILNDEECLLMQKICGILDV 176
Query: 182 NAFGLMEPYIEGQDGRRSVRAYGIYLKASFFNHDCLPNACRFDYVDAAAENNTDIIVRMI 241
N+F + P E +R GIY +A+F +HDC+ N + + ++ +
Sbjct: 177 NSFEMRCPDDEQISAGNQLR--GIYREAAFMSHDCIGNT------HVSLNDFFEMTIHAS 228
Query: 242 HDVPQGREICLSYFPVNYDYSTRQKRLLYDYDFACDCDRC 281
+ G+ I +Y + RQ+RL F C C RC
Sbjct: 229 VPIKSGQAIFFNYANPFECTAARQQRLKKGKYFNCFCKRC 268
>gi|440802077|gb|ELR23016.1| Histone-lysine N-methyltransferase [Acanthamoeba castellanii str.
Neff]
Length = 481
Score = 64.3 bits (155), Expect = 9e-08, Method: Compositional matrix adjust.
Identities = 85/324 (26%), Positives = 126/324 (38%), Gaps = 63/324 (19%)
Query: 8 VTVAEIEGRGRGLVSTQSLKAGQIVLRDSPILLYSALPFINSSSFCHNCFRKTMHSSSS- 66
V V GRGR V+ +++ AG +VLR+ A P CH CFR ++
Sbjct: 106 VEVRYARGRGRYAVAARAVAAGAVVLRERAFAAAVAAP--QQGLACHACFRYAPNAGPGG 163
Query: 67 -----ICPSCS-VAFCSPKCSTAAASSS--HSPYVCQALTRLLNLNSPDAANLPLDRQLQ 118
C C V+FCS C+T S H C+AL RL + + +D +
Sbjct: 164 AALRHTCARCGQVSFCSAACATPTPSGRFVHPKAECEALQRLHT--TCNLKGFSVDEVAE 221
Query: 119 ARFLIAAYNLAVVNPSQFQILL-AFQGTVTDN----DTSAAHYLSSLCPPPASTTIELTA 173
R LI L + + + G +D +TS AH++ + S +
Sbjct: 222 VRLLIRILVLRWIAANSANVEEDGNNGEGSDTSSAVETSFAHFMELM-----SNKHSFGS 276
Query: 174 ALLAKDRLNAFGLMEPY--------IEGQD--------GR-------------RSVRAYG 204
A + + R AF +ME ++G D GR R YG
Sbjct: 277 AAINRFRQRAFLIMEALSGRATSDVLQGLDVDEIVEYIGRLESNTFARYSTTKRKTVNYG 336
Query: 205 --IYLKASFFNHDCLPNACR-FDYVDAAAENNTDIIVRMIHDVPQGREICLSYFPVNYDY 261
+Y A+ FNH C P+ R FD + +R + + G E+ ++Y P+ D
Sbjct: 337 SALYNCAAMFNHSCFPSVVRQFD--------GQHLTLRALRPLNPGDELTMTYIPLRDDT 388
Query: 262 STRQKRLLYDYDFACDCDRCKVEA 285
RQ L Y FAC C RC+
Sbjct: 389 PERQDELQQHYHFACRCQRCEARG 412
>gi|358391549|gb|EHK40953.1| hypothetical protein TRIATDRAFT_29913 [Trichoderma atroviride IMI
206040]
Length = 554
Score = 64.3 bits (155), Expect = 9e-08, Method: Compositional matrix adjust.
Identities = 71/292 (24%), Positives = 114/292 (39%), Gaps = 46/292 (15%)
Query: 17 GRGLVSTQSLKAGQIVLRDSPILLYSALPFINSSSFCHNCF---RKTMHSSSS------- 66
G GL +T+ L AG+++ P++L + + +S C CF R+ +H
Sbjct: 66 GSGLFTTRDLAAGELIFTSVPLVLCAEVG--DSKEACDFCFQQRRRAIHPVEDRLADPGE 123
Query: 67 ------ICPSCSV-AFCSPKCSTAAASSSHSPYVCQALTRLLNLNSPDAANLPLDRQLQA 119
C C++ +CS C A + H Y C L AN P + +++
Sbjct: 124 TLPDVYKCMGCNLYQYCSESCWQRAWDTGHL-YECGLL-----------ANAPYELEIRM 171
Query: 120 RF-LIAAYNLAVVNPSQFQIL--LAFQGTVTDNDTSAAHYLSSLCPPPASTTIELTAALL 176
+ ++ V+ P Q Q L LA + + +S + + A + + L
Sbjct: 172 LYRILILLRKKVLLPEQVQALARLAHEQDKYEQLSSDWQGVKDIA---AEAKRRMKSELD 228
Query: 177 AKDRLNAFGLME--PYIEGQDGRRSVRAYGIYLKASFFNHDCLPNACRFDYVDAAAENNT 234
D L + L+ Q R S I L A+ NH+C PN N+T
Sbjct: 229 VADVLKLYCLIRCNAVPVDQTFRNSPLGSAIDLGAAMLNHNCEPNI-------VIVFNST 281
Query: 235 DIIVRMIHDVPQGREICLSYFPVNYDYSTRQKRLLYDYDFACDCDRCKVEAN 286
+ R + + G E+ Y + YD + R R+ Y F C CDRC E N
Sbjct: 282 RVEARAVRSIKAGEELQHCYRDIAYDCTFRSPRIAARYQFKCQCDRCIRETN 333
>gi|291386375|ref|XP_002709686.1| PREDICTED: SET and MYND domain containing 1 isoform 2 [Oryctolagus
cuniculus]
Length = 490
Score = 64.3 bits (155), Expect = 1e-07, Method: Compositional matrix adjust.
Identities = 69/290 (23%), Positives = 116/290 (40%), Gaps = 29/290 (10%)
Query: 6 ETVTVAEIEGRGRGLVSTQSLKAGQIVLRDSPILLYSALPFINSSSF-CHNCFRKTMHSS 64
E V V EG+GRGL +T+ A I+ + YSA+ F + +F CH CF++
Sbjct: 7 ENVEVFTSEGKGRGLKATKEFWAADIIFAERA---YSAVVFDSLVNFVCHTCFKR--QEK 61
Query: 65 SSICPSCSVA-FCSPKCSTAAASSSHSPYVCQALTRLLNLNSPDAANLPLDRQLQARFLI 123
C C A +C C A + + C A+ R + P+ N+ L ++ R
Sbjct: 62 LHRCGQCKFAHYCDRTCQKDAWLNHKNE--CSAIKRYGKV--PNE-NIRLAARIMWRVER 116
Query: 124 AAYNLA---VVNPSQFQILLAFQGTVTDNDTSAA--HYLSSLCPPPASTTIELTAALLAK 178
L +V+ Q + G + A +L P +++ + +
Sbjct: 117 EGTGLTEGCLVSVDDLQNHVDHFGEEEQKELRADVDTFLQYWPPQSQHFSMQYISHIFGV 176
Query: 179 DRLNAFGLMEPYIEGQDGRRSVRAYGIYLKASFFNHDCLPNAC------RFDYVDAAAEN 232
N F L + Q G ++V GI+ + NHDC PN + V +
Sbjct: 177 INCNGFTLSD-----QRGLQAV-GVGIFPNLALVNHDCWPNCTVIFNNGNHEAVKSMFHT 230
Query: 233 NTDIIVRMIHDVPQGREICLSYFPVNYDYSTRQKRLLYDYDFACDCDRCK 282
I +R + + +G E+ +SY R+++L Y F C C+ C+
Sbjct: 231 QMRIELRALGKISEGEELTVSYIDFLNVSEERRRQLKKQYYFDCTCEHCQ 280
>gi|302753848|ref|XP_002960348.1| hypothetical protein SELMODRAFT_437450 [Selaginella moellendorffii]
gi|300171287|gb|EFJ37887.1| hypothetical protein SELMODRAFT_437450 [Selaginella moellendorffii]
Length = 614
Score = 64.3 bits (155), Expect = 1e-07, Method: Compositional matrix adjust.
Identities = 75/323 (23%), Positives = 121/323 (37%), Gaps = 75/323 (23%)
Query: 10 VAEIEGRGRGLVSTQSLKAGQIVLRDSPILLYSALPFINSSSFCHNCFRKTMHSSSSI-C 68
VA ++ RGR V+T+++KAG++V+ + I + + CH C + C
Sbjct: 68 VATVKNRGRCAVATRAIKAGEVVVAEQAIAFVPRGQY--RKAICHVCGVNVAENGPKFEC 125
Query: 69 PSCSVAFCSPKCSTAAASSSHSPYVCQALTRLLNLNSPDAANLPLDRQLQARFLIAAYNL 128
+C+ +C SH Y A+ R + A++ D A L +NL
Sbjct: 126 SACNYTIFCEECYKPP-QGSHENYC--AVYR-------EIASIARDENCDADLLRLVFNL 175
Query: 129 AVVNPS---------QFQIL---LAFQ---------------------GTVTDNDTSAAH 155
A+ Q ++L L FQ G T + +
Sbjct: 176 ALKKAGTAQARSSFLQLELLIETLPFQNSKDETCTVDGVILSCFKDALGLETHRSKAPSS 235
Query: 156 YLSSL---CP--------------PPASTTIELTAALLAKDRLNAFGLMEPYIEGQDGRR 198
+L ++ C P S +IE L N+ G+ Q+ +
Sbjct: 236 WLEAVRNGCTRLHKIACVGNGGKLAPISCSIEELEMLATLINTNSHGM-----GAQNLQN 290
Query: 199 SVRAYGIYLKASFFNHDCLPNACRFDYVDAAAENNTDIIVRMIHDVPQGREICLSYFPVN 258
+ A G++ S NH C PN C A + + VR + D+P+G E+CLSY +
Sbjct: 291 THVALGLFPYVSILNHSCRPNCC-------FASEGSVMYVRAVQDIPKGAELCLSYINLY 343
Query: 259 YDYSTRQKRLLYDYDFACDCDRC 281
R+ L+ F C CDRC
Sbjct: 344 ESRRVRKTLLVATKHFDCTCDRC 366
>gi|302896962|ref|XP_003047360.1| hypothetical protein NECHADRAFT_123247 [Nectria haematococca mpVI
77-13-4]
gi|256728290|gb|EEU41647.1| hypothetical protein NECHADRAFT_123247 [Nectria haematococca mpVI
77-13-4]
Length = 354
Score = 64.3 bits (155), Expect = 1e-07, Method: Compositional matrix adjust.
Identities = 78/330 (23%), Positives = 131/330 (39%), Gaps = 73/330 (22%)
Query: 16 RGRGLVSTQSLKAGQIVLRDSPILLYSALPFINSSS---FCHN--CFRKTMHSSSSICPS 70
+GR L ++ +++ G ++L D+P Y+ +P +NS C N C R+ +
Sbjct: 41 KGRLLRASAAVREGTVLLIDTP---YAIVPSVNSDKEDLICSNLSCSRRVPRDGHGV--Q 95
Query: 71 CS------VAFCSPKCSTAAASSSHSPYVCQALTRLLNLNSPDAANLPLDRQLQARF--- 121
C V +C+ C AA + C L + L DR+ Q F
Sbjct: 96 CHRNCFQDVVWCNTACR--AADHVRHGFECLWLK--------EKGELIRDREGQYNFVTI 145
Query: 122 -----LIAAYNLAVVN---------PSQFQILLAFQGT--------------VTDNDTSA 153
L+A +N+ + + P + ++ + A
Sbjct: 146 WHVVRLLATWNVELKSGASISRHRYPRDDEFSRGWKAVEMCCSYLDSWPESQINHWKRLA 205
Query: 154 AHYLSSLCPPPASTTIELTAALLAKDRLNAFGLMEPYIEG---QDGRRSVR--AYGI--Y 206
YLS P + + +L+ K+ N FGL P + G R + R +YG+ Y
Sbjct: 206 EEYLSDASVLPCVLSPDEMLSLICKEETNTFGLY-PKVTGPLCMIDRPTARGDSYGLSLY 264
Query: 207 LKASFFNHDCLPNACRFDYVDAAAENNTDIIVRMIHDVPQGREICLSYF--PVNYDYSTR 264
+A+ FNH CLPN V + ++ D+ QG E ++YF V+ D ++R
Sbjct: 265 PRAAMFNHSCLPN------VTHKPDAQGRMVYTAARDIAQGEECMITYFDLTVHEDVASR 318
Query: 265 QKRLLYDYDFACDCDRCKVEANWSDNDNDD 294
QK + + F C C+RC E + +N D
Sbjct: 319 QKHVQEQFQFKCTCERCLSEEAEENVENMD 348
>gi|50550485|ref|XP_502715.1| YALI0D11792p [Yarrowia lipolytica]
gi|74634629|sp|Q6C9E7.1|SET5_YARLI RecName: Full=Potential protein lysine methyltransferase SET5;
AltName: Full=SET domain-containing protein 5
gi|49648583|emb|CAG80903.1| YALI0D11792p [Yarrowia lipolytica CLIB122]
Length = 438
Score = 63.9 bits (154), Expect = 1e-07, Method: Compositional matrix adjust.
Identities = 72/305 (23%), Positives = 112/305 (36%), Gaps = 48/305 (15%)
Query: 6 ETVTVAEIEGRGRGLVSTQSLKAGQIVLRDSPILLYSALPFIN---SSSFCHNCFRKTMH 62
E VT+ + RG+GL + + +K + + + LL L + C C R
Sbjct: 107 EKVTLNIAKNRGKGLYAKKDIKKDEELWNEQAFLLVPPLEHVGIMRKGMGCAFCSRPFQS 166
Query: 63 SSSSICPSCSVAFCSPKCS------TAAASSSHSPYVCQALTRLLNLNSPDAANLPLDRQ 116
+ CPSC+ +C +C A S V + R + ++
Sbjct: 167 HNGVECPSCAAKWCDNQCKKKDVTHVAMWHESRHGKVTRVEWRAFE-------DFCVENG 219
Query: 117 LQARFLIAAYNLAVV-NPSQFQI---LLAFQGTVTDNDTSAAHYLSSLCPPPASTTI--- 169
A + + L V+ +P + ++ ++AF D A +SL S +
Sbjct: 220 WMALYALGLLWLRVIRDPKKDEVQKQMMAFARVGQDERHKAVEQSNSLFASEQSEVLWKK 279
Query: 170 --ELTAALL-------AKDRLNAFGLMEPYIEGQDGRRSVRAYGIYLKASFFNHDCLPNA 220
E+ LL KD L G+ I DG +YL S NH C PN
Sbjct: 280 GYEMLDKLLLDTEFSYEKDFLPGLGMFN--INNVDG-------NMYLTQSHLNHSCEPN- 329
Query: 221 CRFDYVDAAAENNTD-IIVRMIHDVPQGREICLSYFPVNYDYSTRQKRLLYDYDFACDCD 279
VD T I VR D+ G E+ +Y + R+ L ++ F C+C
Sbjct: 330 -----VDVKNVGRTQGISVRAKRDIKTGEELFTTYVNPEHQLDDRRYNLRVNWGFNCNCT 384
Query: 280 RCKVE 284
RCK E
Sbjct: 385 RCKRE 389
>gi|410955280|ref|XP_003984284.1| PREDICTED: SET and MYND domain-containing protein 1 [Felis catus]
Length = 490
Score = 63.9 bits (154), Expect = 1e-07, Method: Compositional matrix adjust.
Identities = 67/288 (23%), Positives = 113/288 (39%), Gaps = 25/288 (8%)
Query: 6 ETVTVAEIEGRGRGLVSTQSLKAGQIVLRDSPILLYSALPFINSSSF-CHNCFRKTMHSS 64
E V V EG+GRGL +T+ A ++ + YSA+ F + +F CH CF++
Sbjct: 7 ENVEVFTSEGKGRGLKATKEFWAADVIFAERA---YSAVVFDSLVNFVCHTCFKR--QEK 61
Query: 65 SSICPSCSVA-FCSPKCSTAAASSSHSPYVCQALTR---LLNLNSPDAANLPLDRQLQAR 120
C C A +C C A + + C A+ R + N N AA + + +
Sbjct: 62 LHRCGQCKFAHYCDRTCQKDAWLNHKNE--CSAIKRYGKVPNENIRLAARIMWRVEREGT 119
Query: 121 FLIAAYNLAVVNPSQFQILLAFQGTVTDNDTSAAHYLSSLCPPPASTTIELTAALLAKDR 180
L L V+ Q + + + +L P +++ + +
Sbjct: 120 GLTEGC-LVAVDDLQNHVEHFGEEEQKELRVDVDTFLQYWPPQSQQFSMQYISHIFGVIS 178
Query: 181 LNAFGLMEPYIEGQDGRRSVRAYGIYLKASFFNHDCLPNAC------RFDYVDAAAENNT 234
N F L + Q G ++V GI+ NHDC PN + V +
Sbjct: 179 CNGFTLSD-----QRGLQAV-GVGIFPNLGLVNHDCWPNCTVIFNNGNHEAVKSMFHTQM 232
Query: 235 DIIVRMIHDVPQGREICLSYFPVNYDYSTRQKRLLYDYDFACDCDRCK 282
I +R + + +G E+ +SY R+++L Y F C C+ C+
Sbjct: 233 RIELRALGKISEGEELTVSYIDFLNVSEERKRQLKKQYYFDCTCEHCQ 280
>gi|402891521|ref|XP_003908994.1| PREDICTED: SET and MYND domain-containing protein 1 [Papio anubis]
gi|355565873|gb|EHH22302.1| hypothetical protein EGK_05542 [Macaca mulatta]
gi|355751476|gb|EHH55731.1| hypothetical protein EGM_04994 [Macaca fascicularis]
Length = 490
Score = 63.9 bits (154), Expect = 1e-07, Method: Compositional matrix adjust.
Identities = 67/288 (23%), Positives = 113/288 (39%), Gaps = 25/288 (8%)
Query: 6 ETVTVAEIEGRGRGLVSTQSLKAGQIVLRDSPILLYSALPFINSSSF-CHNCFRKTMHSS 64
E V V EG+GRGL +T+ A ++ + YSA+ F + +F CH CF++
Sbjct: 7 ENVEVFTAEGKGRGLKATKEFWAADVIFAERA---YSAVVFDSLVNFVCHTCFKR--QEK 61
Query: 65 SSICPSCSVA-FCSPKCSTAAASSSHSPYVCQALTR---LLNLNSPDAANLPLDRQLQAR 120
C C A +C C A + + C A+ R + N N AA + + +
Sbjct: 62 LHRCGQCKFAHYCDRTCQKDAWLNHKNE--CSAIKRYGKVPNENIRLAARIMWRVEREGT 119
Query: 121 FLIAAYNLAVVNPSQFQILLAFQGTVTDNDTSAAHYLSSLCPPPASTTIELTAALLAKDR 180
L L V+ Q + + + +L P +++ + +
Sbjct: 120 GLTEGC-LVSVDDLQNHVEHFGEEEQKELRVDVDTFLQYWPPQSQQFSMQYISHIFGVIN 178
Query: 181 LNAFGLMEPYIEGQDGRRSVRAYGIYLKASFFNHDCLPNAC------RFDYVDAAAENNT 234
N F L + Q G ++V GI+ NHDC PN + V +
Sbjct: 179 CNGFTLSD-----QRGLQAV-GVGIFPNLGLVNHDCWPNCTVIFNNGNHEAVKSMFHTQM 232
Query: 235 DIIVRMIHDVPQGREICLSYFPVNYDYSTRQKRLLYDYDFACDCDRCK 282
I +R + + +G E+ +SY R+++L Y F C C+ C+
Sbjct: 233 RIELRALGKISEGEELTVSYIDFLNVSEERKRQLKKQYYFDCTCEHCQ 280
>gi|281202822|gb|EFA77024.1| hypothetical protein PPL_09776 [Polysphondylium pallidum PN500]
Length = 1187
Score = 63.9 bits (154), Expect = 1e-07, Method: Compositional matrix adjust.
Identities = 72/302 (23%), Positives = 112/302 (37%), Gaps = 52/302 (17%)
Query: 17 GRGLVSTQSLKAGQIVLRDSPILLYSALPFINSSSFCHNCFRKTMHSSSSICPSCSVAFC 76
GR +++++ + G+++LR +P +AL + C +CFR + +C C
Sbjct: 728 GRKIIASRDIAKGELLLRVAP--YGAALVDDTLLTHCTSCFRNISYYKHHLCQKCKQCIL 785
Query: 77 SPKCSTAAASSSHSPYVCQALTRLLNLNSPDAANLPLDRQLQARFLIAAYNLAVVN---- 132
+C++ + C L L N P A D + R ++ + +A++N
Sbjct: 786 CEECNSDVDLVNEHNEECDILV-FLKQNVPGADTR--DFRFMVRVMLKS--IAILNGKLS 840
Query: 133 -------------PSQFQILLAFQGTVTDND-------------TSAAHYLSSLCPPPAS 166
P F TD T + + L P S
Sbjct: 841 KEQSPKCWSKNGVPFIFDSYDDLTHLTTDTSNIDRKQMESFETATQSIINVFKLAKGPKS 900
Query: 167 T---TIELTAALLAKDRLNAFGLMEPYIEGQDGRRSVRAYGIYLKASFFNHDCLPNACRF 223
T + L K NA ++P + R GIY A++ NH C PN
Sbjct: 901 LEPLTNQQILDLYPKMLFNAHEYIDPLYHSEVAR------GIYPTAAYLNHSCEPNTVWH 954
Query: 224 DYVDAAAENNTDIIVRMIHDVPQGREICLSYFPVNYDYSTRQKRLLYDYDFACDCDRCKV 283
+ +NN I R I D+ G EI +Y + STRQ LL Y F C C RC+
Sbjct: 955 N------DNNGMIAYRSIRDIKAGEEITTTYIDITKYKSTRQLNLLSQYAFLCQCARCQD 1008
Query: 284 EA 285
A
Sbjct: 1009 RA 1010
>gi|354543298|emb|CCE40016.1| hypothetical protein CPAR2_100550 [Candida parapsilosis]
Length = 477
Score = 63.9 bits (154), Expect = 1e-07, Method: Compositional matrix adjust.
Identities = 77/360 (21%), Positives = 141/360 (39%), Gaps = 50/360 (13%)
Query: 8 VTVAEIEGRGRGLVSTQSLKAGQIVLRDSPILLYSALP---FINSSSFCHNCFRKTMHSS 64
V V RG+ L + + +K G+++ ++P+ L + S C NC + S
Sbjct: 110 VQVVMTSKRGKCLYAKKPIKEGELIWSETPLFFVPPLANVHLVEHGSACANCGKLLQQSE 169
Query: 65 SSI------CPSCSVAFCSPKC----STAAASSSHSPYVCQALT-RLLNLNS-PDAANLP 112
C C+ +CS KC S ++ H+ + + ++++ N+ +
Sbjct: 170 MKTVLKGLDCDVCAEVWCSKKCKQIDSQLHGATKHNVFNPNSKKPKIIDSNAFLELQEYC 229
Query: 113 LDRQLQARFLIA------AYNLAVVNPSQFQIL-----------LAFQGTVTDNDTSAAH 155
L A + IA + + V QF + L G DN A
Sbjct: 230 LKESWNALYAIALIYANIVLDKSGVKQKQFNAMARVSQDVRYKALDSGGGTFDNMNGGAL 289
Query: 156 YLSSLCPPPASTTIELTAALLAKDRLNAFGLMEPYIEGQDGRRSVRAYGIYLKASFFNHD 215
++ E + K+ ++ + Y+ G ++ + +YL S NH+
Sbjct: 290 FVQEQQETLWKEGYEKFLRVFPKNTISYKEFL--YMMGTYNINNLDS-SVYLTQSHLNHN 346
Query: 216 CLPNACRFDYVDAAAENNTDIIVRMIHDVPQGREICLSYFPVNYDYSTRQKRLLYDYDFA 275
C PN VD A++ + + V D+ +G E+ +Y ++ RQ+ L ++ F
Sbjct: 347 CDPNTT----VDTASDRISGLKVFAKRDIKEGEELTTTYVNPSHTLHQRQRELRVNWGFI 402
Query: 276 CDCDRCKVEANW-------SDNDNDDENNEEVMDEDQDEQMVASDDDAEVHGDTNFPHAY 328
C CD+CK +AN S + D N M ++ EQ+ +D E+ D PH +
Sbjct: 403 CACDKCKSDANATGRRKSSSASKKQDAKNIRNMLKETKEQV---GEDGEIELDIP-PHEF 458
>gi|348505210|ref|XP_003440154.1| PREDICTED: SET and MYND domain-containing protein 1-like isoform 1
[Oreochromis niloticus]
Length = 454
Score = 63.9 bits (154), Expect = 1e-07, Method: Compositional matrix adjust.
Identities = 64/277 (23%), Positives = 115/277 (41%), Gaps = 38/277 (13%)
Query: 16 RGRGLVSTQSLKAGQIVLRDSPILLYSALPFIN-SSSFCHNCFRKTMHSSSSICPSCSVA 74
+GRGL + + L G+++ + ++A+ F + ++ CH+CFR ++ C C A
Sbjct: 17 KGRGLRAAKELNTGEVIFSEPS---FAAVVFDSLATQVCHSCFRH--QANLHRCAQCKFA 71
Query: 75 -FCSPKCSTAAASSSHSPYVCQALTRLLNLNSPDAANLPLDRQLQARFLIAAY-NLAVVN 132
+C C TA + C A+ +L S + N+ +L AR L + + + +
Sbjct: 72 HYCDRTCQTACWNEHKQE--CGAIKKL---GSAPSENI----RLAARVLWRMHKDTGIAS 122
Query: 133 PSQFQILLAFQGTVTDNDTS--------AAHYLSSLCPPPASTTIELTAALLAKDRLNAF 184
SQ + Q V D +L +++ + + + N F
Sbjct: 123 DSQLISVDQLQEHVADLAAEDFKQLRADVHKFLQYWSYGTRQHSVDYISHIFGIIKCNGF 182
Query: 185 GLMEPYIEGQDGRRSVRAYGIYLKASFFNHDCLPNACRFDYVDAAAENNTDIIVRMIHDV 244
L + Q G ++V G++ NHDC PN N+ I +R + +
Sbjct: 183 TLSD-----QRGLQAV-GVGLFPNLCLVNHDCWPNC-------TVILNHGKIELRALGKI 229
Query: 245 PQGREICLSYFPVNYDYSTRQKRLLYDYDFACDCDRC 281
P+G E+ +SY + RQK+L + F C C+ C
Sbjct: 230 PEGEELTVSYVDFLNLSADRQKKLKERFHFDCTCEHC 266
>gi|440635519|gb|ELR05438.1| hypothetical protein GMDG_01733 [Geomyces destructans 20631-21]
Length = 505
Score = 63.9 bits (154), Expect = 1e-07, Method: Compositional matrix adjust.
Identities = 71/295 (24%), Positives = 111/295 (37%), Gaps = 51/295 (17%)
Query: 3 AVSETVTVAEIEGRGRGLVSTQSLKAGQIVLRDSPILLYSALPFINSSSFCHNCFRKTMH 62
+V+ + + I G GL + +++ G + R +P++ NC R H
Sbjct: 34 SVALEILPSRIHNNGTGLFAMEAIPEGSEIFRSTPLV---------------NCVRSAKH 78
Query: 63 SSS--------SICPSCSV-AFCSPKCSTAAASSSHSPYVCQALTRLLNLNSPDAANLPL 113
+ S+C C V +CS C A H C+AL R SP A
Sbjct: 79 AKGEVMVLPKISLCTQCGVCGYCSKDCQAKALEKYHKSE-CEALKR-----SPSA----- 127
Query: 114 DRQLQARFL---IAAYNLAVVNPSQFQILLAFQGTVTDNDTSAAHYLSSLCPPPASTTIE 170
L R L +A + + ++Q L G N++ + A T+I+
Sbjct: 128 --DLWTRMLYRVLAMHKQGTYSEQEWQALHHLCGNQRQNNSVFDSHKGIDISKGAKTSIQ 185
Query: 171 LTAALLAKDRLNAFGLMEPY-IEGQDGRRSVRAYGIYLKASFFNHDCLPNACRFDYVDAA 229
A L L I Q+G + + L SF NH C PNA F
Sbjct: 186 SKLGNEAIQDLFCKVLTNVMCINHQEGSPGIT---LDLVGSFVNHSCDPNAFVFF----- 237
Query: 230 AENNTDIIVRMIHDVPQGREICLSYFPVNYDYSTRQKRLLYDYDFACDCDRCKVE 284
+ + +R + + G EI L+Y + R+++L Y F C C+RCK E
Sbjct: 238 --EGSQLRMRSLKPINAGDEITLTYTELREGVLMRKRKLHRGYSFFCRCNRCKKE 290
>gi|195393214|ref|XP_002055249.1| GJ18896 [Drosophila virilis]
gi|194149759|gb|EDW65450.1| GJ18896 [Drosophila virilis]
Length = 506
Score = 63.9 bits (154), Expect = 1e-07, Method: Compositional matrix adjust.
Identities = 65/268 (24%), Positives = 107/268 (39%), Gaps = 24/268 (8%)
Query: 17 GRGLVSTQSLKAGQIVLRDSPILLYSALPFINSSSFCHNCFRKTMHSSSSIC-PSCSVAF 75
GRG+++T+ LK G+I+ +DSP+L+ A ++ + C C K + + +C C +
Sbjct: 34 GRGVMATRQLKRGEIIFQDSPLLIGLAAQEEDTLNACSICL-KMLPDTRFMCRQGCGL-- 90
Query: 76 CSPKCSTAAASSSHSPYVCQALTRLLNLNSPDAANLPLDRQLQARFLIAAYNLAVVNPSQ 135
P CS A H + + N PD AN + R L + A NL Q
Sbjct: 91 --PICSLCAKKKQHKSDC--DMFKSWGPNEPDVANSVIIRLL---CVARAINLT---KDQ 140
Query: 136 FQILLAFQGTVTDNDTSAAHYLSSLCP--PPASTTIELTAALLAKDRLNAFGLMEPYIEG 193
++ Q + +N + + P +E+ +A R N F + +
Sbjct: 141 RDLIYCLQANLDNNHRTEVRNAAKCFKNFPTDKKLVEIMNRTVAVLRTNGF---DKTTDR 197
Query: 194 QDGRRSVRAYGIYLKASFFNHDCLPNACRFDYVDAAAENNTDIIVRMIHDVPQGREICLS 253
+ +Y + NHDC+PN+ E ++IVR D+ +G EI +
Sbjct: 198 TTDNQEFSYRALYPLFAVMNHDCIPNS-----YYTFEEKTNNMIVRAAVDIAEGEEITTT 252
Query: 254 YFPVNYDYSTRQKRLLYDYDFACDCDRC 281
Y + R L F C C RC
Sbjct: 253 YTKLFTGNIARHLYLKMKKGFTCKCPRC 280
>gi|342182557|emb|CCC92036.1| conserved hypothetical protein [Trypanosoma congolense IL3000]
Length = 387
Score = 63.9 bits (154), Expect = 1e-07, Method: Compositional matrix adjust.
Identities = 46/150 (30%), Positives = 63/150 (42%), Gaps = 27/150 (18%)
Query: 163 PP----ASTTIELTAALLAKDRLNAF---------GL----MEPYIEGQDGRRS-----V 200
PP AS +E+ LL ++ LN F GL ++P + G S +
Sbjct: 240 PPVSSDASWALEIVDGLLREETLNTFLGILALNSQGLNGFVIDPPVNGDTSSHSSIEWVL 299
Query: 201 RAYGIYLKASFFNHDCLPNACRFDYVDAAAENNTDIIVRMIHDVPQGREICLSYFPVNYD 260
+ GIY S FNH C PN F DI +R + DV G E+ +SY PV
Sbjct: 300 KGGGIYSLQSNFNHSCQPNVAVFTEC-----GTHDITLRTLRDVQAGEELTISYIPVENT 354
Query: 261 YSTRQKRLLYDYDFACDCDRCKVEANWSDN 290
+ ++L Y F C C C+ E N D
Sbjct: 355 NRAERHKMLEGYFFTCQCALCEYEKNLVDT 384
>gi|351696197|gb|EHA99115.1| SET and MYND domain-containing protein 1 [Heterocephalus glaber]
Length = 483
Score = 63.9 bits (154), Expect = 1e-07, Method: Compositional matrix adjust.
Identities = 67/288 (23%), Positives = 113/288 (39%), Gaps = 25/288 (8%)
Query: 6 ETVTVAEIEGRGRGLVSTQSLKAGQIVLRDSPILLYSALPFINSSSF-CHNCFRKTMHSS 64
E V V EG+GRGL +T+ A ++ + YSA+ F + +F CH CF++
Sbjct: 2 ENVEVFTSEGKGRGLKATKEFWAADVIFAERA---YSAVVFDSLVNFVCHTCFKR--QEK 56
Query: 65 SSICPSCSVA-FCSPKCSTAAASSSHSPYVCQALTR---LLNLNSPDAANLPLDRQLQAR 120
C C A +C C A + + C A+ R + N N AA + + +
Sbjct: 57 LHRCGQCKFAHYCDRTCQKDAWLNHKNE--CSAIKRYGKVPNENIRLAARIMWRVEREGT 114
Query: 121 FLIAAYNLAVVNPSQFQILLAFQGTVTDNDTSAAHYLSSLCPPPASTTIELTAALLAKDR 180
L L V+ Q + + + +L P +++ + +
Sbjct: 115 GLTEGC-LVSVDDLQNHVEHFGEEEQKELRVDVDMFLQYWPPQSQQFSMQYISHIFGVIN 173
Query: 181 LNAFGLMEPYIEGQDGRRSVRAYGIYLKASFFNHDCLPNAC------RFDYVDAAAENNT 234
N F L + Q G ++V GI+ NHDC PN + V +
Sbjct: 174 CNGFTLSD-----QRGLQAV-GVGIFPNLGLVNHDCWPNCTVIFNNGNHEAVKSMFHTQM 227
Query: 235 DIIVRMIHDVPQGREICLSYFPVNYDYSTRQKRLLYDYDFACDCDRCK 282
I +R + + +G E+ +SY R+++L Y F C C+ C+
Sbjct: 228 RIELRALGKISEGEELTVSYIDFLNVSEERKRQLKKQYYFDCTCEHCQ 275
>gi|157819445|ref|NP_001100065.1| SET and MYND domain-containing protein 1 [Rattus norvegicus]
gi|149036367|gb|EDL90985.1| rCG56113, isoform CRA_a [Rattus norvegicus]
Length = 490
Score = 63.9 bits (154), Expect = 1e-07, Method: Compositional matrix adjust.
Identities = 67/288 (23%), Positives = 114/288 (39%), Gaps = 25/288 (8%)
Query: 6 ETVTVAEIEGRGRGLVSTQSLKAGQIVLRDSPILLYSALPFINSSSF-CHNCFRKTMHSS 64
E V V EG+GRGL +T+ A ++ + YSA+ F + +F CH CF++
Sbjct: 7 ENVEVFTSEGKGRGLKATKEFWAADVIFAERA---YSAVVFDSLINFVCHTCFKR--QER 61
Query: 65 SSICPSCSVA-FCSPKCSTAAASSSHSPYVCQALTR---LLNLNSPDAANLPLDRQLQAR 120
C C A +C C A + + C A+ R + N N AA + + +
Sbjct: 62 LHRCGQCKFAHYCDRTCQKDAWLNHKNE--CSAIKRYGKVPNENIRLAARIMWRVEREGT 119
Query: 121 FLIAAYNLAVVNPSQFQILLAFQGTVTDNDTSAAHYLSSLCPPPASTTIELTAALLAKDR 180
L L V+ Q + + + +L P +++ + +
Sbjct: 120 GLTEGC-LVSVDDLQNHVEHFGEEEQKELRVDVDTFLQYWPPQSQQFSMQYISHIFGVIN 178
Query: 181 LNAFGLMEPYIEGQDGRRSVRAYGIYLKASFFNHDCLPNAC------RFDYVDAAAENNT 234
N F L + Q G ++V GI+ NHDC PN + V +
Sbjct: 179 CNGFTLSD-----QRGLQAV-GVGIFPNLGLVNHDCWPNCTVIFNNGNHEAVKSMFHTQM 232
Query: 235 DIIVRMIHDVPQGREICLSYFPVNYDYSTRQKRLLYDYDFACDCDRCK 282
I +R + + +G E+ +SY + R+++L Y F C C+ C+
Sbjct: 233 RIELRALGKISEGEELTVSYIDFLHLSEERRQQLKKQYYFDCSCEHCQ 280
>gi|296806220|ref|XP_002843920.1| histone methyltransferase SmyD1b [Arthroderma otae CBS 113480]
gi|238845222|gb|EEQ34884.1| histone methyltransferase SmyD1b [Arthroderma otae CBS 113480]
Length = 495
Score = 63.9 bits (154), Expect = 1e-07, Method: Compositional matrix adjust.
Identities = 80/339 (23%), Positives = 131/339 (38%), Gaps = 72/339 (21%)
Query: 15 GRGRGLVSTQSLKAGQIVLRDSPILLYSALPFINSSSFCHNCFRKTMHSSSSI------- 67
G G GL +T ++K GQ V D P + L + C NCF +++ +
Sbjct: 15 GAGTGLFATSTVKTGQAVF-DLPATFSTVLSTERLADACSNCFANAPFAANVMADVGIKL 73
Query: 68 --CPSCSVA-FCSPKCSTAAASSSHS--------------PYVCQALTRLLNLNS---PD 107
C C V +C C T ++ H P +A+ R+L L S PD
Sbjct: 74 RFCTGCRVVKYCGKGCQTENWAAIHKHECATYKRLYPKVLPVNSRAVLRILKLRSSKQPD 133
Query: 108 AA-NLPLDRQLQARFLIAAYNLAVVNPSQFQILLAFQGTVTDNDTSAAHYLSSLCPPPAS 166
+L + R L++ F +A N Q++ ++ + +H +
Sbjct: 134 VQQDLGMFRSLRSHF----KEIAETNKEQYERIM----LCAKAEKEYSH---------SD 176
Query: 167 TTIELTAALLAKDRLNAFGLMEPYIEGQDGRRSVRAYGIYLKASFFNHDCLPNACRFDYV 226
IE A LAK +N F P+ G ++ + A F NH C PNA
Sbjct: 177 LDIETIAEYLAKIEVNGFTFTTPF--GDPLGLCIQPF-----ACFVNHSCDPNA------ 223
Query: 227 DAAAENNTDIIVRMIHDVPQGREICLSYFPVNYDYSTRQKRLLYDYDFACDCDRCKVEAN 286
+ I V+ + + ++ +SY + RQK L Y F C C +C ++ +
Sbjct: 224 -VVGFDEGRITVKALRTIEPDEQVFISYIDNTNPFEIRQKELAERYFFTCRCSKC-LQGD 281
Query: 287 WSDND---------NDDENNEEVMDEDQDEQMVASDDDA 316
+ D ND E +E E Q ++++AS +A
Sbjct: 282 KAREDQFIPASPPSNDIETLKEA--EKQAQELLASAKNA 318
>gi|195040117|ref|XP_001991004.1| GH12314 [Drosophila grimshawi]
gi|193900762|gb|EDV99628.1| GH12314 [Drosophila grimshawi]
Length = 611
Score = 63.9 bits (154), Expect = 1e-07, Method: Compositional matrix adjust.
Identities = 69/289 (23%), Positives = 115/289 (39%), Gaps = 30/289 (10%)
Query: 1 MSAVSETVTVAEIEGRGRGLVSTQSLKAGQIVLRDSPILLYSALPFINSSSFCHNCFRKT 60
+ A S + VA+ GRG+ + + + G+++ R+ +L+ + C C+ +
Sbjct: 32 LRAESPSYRVADSPISGRGIFALRDIAKGELIFREHTLLVGPTAHRGRNLRTCTQCYGQP 91
Query: 61 MHSS--SSICPS-CSVAFCSPKCSTAAASSSHSPYVCQALTRLLNLNSPDAANLPLDRQL 117
S S++CP+ C + P C A+S+ H+P CQ + + A P
Sbjct: 92 PGDSDASALCPAGCGL----PVCGNCASSTRHTPE-CQLFRKWQPKEATKAGIEP----- 141
Query: 118 QARFLIAAYNLAVVNPSQFQILLAFQGTV----TDNDTSAAHYLSSLCPPPASTTIELTA 173
QA +++ +N Q ++L A Q AA P +E
Sbjct: 142 QALRILSVVRCFFLNEPQRKLLYAMQANADRYYIREVERAAECFEQF--PRDPELLEYFY 199
Query: 174 ALLAKDRLNAFGLMEPYIEGQDGRRSVRAYGIYLKASFFNHDCLPNACRFDYVDAAAENN 233
+ NAF +I+GQ+ V A ++ A NH C PNA E+
Sbjct: 200 RTVCAFNTNAFE-SRFHIDGQE----VVARALFPLAGLLNHQCTPNAAHH------FEDG 248
Query: 234 TDIIVRMIHDVPQGREICLSYFPVNYDYSTRQKRLLYDYDFACDCDRCK 282
I+V +P G EI +SY + + R+ L F C C RC+
Sbjct: 249 ETIVVTATERIPMGAEITMSYAKLLWSTLARKMFLGMTKHFMCRCPRCQ 297
Score = 48.1 bits (113), Expect = 0.007, Method: Compositional matrix adjust.
Identities = 28/91 (30%), Positives = 41/91 (45%), Gaps = 14/91 (15%)
Query: 204 GIYLKASFFNHDCLPNACRFDYVDAAAENNTDIIVRMIHDVPQGREICLSYFPVNYDYST 263
G++ + NH+C PNA + EN +VR +P+G EI +Y + + T
Sbjct: 391 GLFPLTAIMNHECTPNASHY------FENGRLAVVRAARAIPKGGEITTTYTKILWSNLT 444
Query: 264 RQKRLLYDYDFACDCDRCKVEANWSDNDNDD 294
R L F C+CDRC NDN +
Sbjct: 445 RGIFLKMTKYFVCNCDRC--------NDNSE 467
>gi|255081676|ref|XP_002508060.1| predicted protein [Micromonas sp. RCC299]
gi|226523336|gb|ACO69318.1| predicted protein [Micromonas sp. RCC299]
Length = 541
Score = 63.9 bits (154), Expect = 1e-07, Method: Compositional matrix adjust.
Identities = 77/309 (24%), Positives = 113/309 (36%), Gaps = 44/309 (14%)
Query: 8 VTVAEIEGRGRGLVSTQSLKAGQIVLRDSPILLYSALPFINSSSFCHNCFRKTMHSSSSI 67
V V + + RGR LV+ + K G +VL ++P LP + C CF K +
Sbjct: 24 VDVRDDKKRGRCLVAAKDFKRGDLVLANAPY----GLP-VPMGDRCAGCFMKEDDDRVKM 78
Query: 68 -CPSCS-VAFCSPKCSTAAASSSHSPYVCQALTRL-------------------LNLN-- 104
C +C V FCS +C A H+ C A+ R L+ N
Sbjct: 79 RCKACDMVWFCSERCMDGAMDGGHA-VECAAMRRYFADDPDDAIRENGFVQFSALHRNCG 137
Query: 105 ---SPDAANLPLDRQLQARFLIAAYNLAVVNPSQFQIL--LAFQGTVTDNDTSAAHYLSS 159
P+ + PLD L AA + + G T+ A
Sbjct: 138 KAAHPEDTDEPLDDHLPYLVPTAADKADTIARWALHGFPWREYCGDFAAEATAGARRCGL 197
Query: 160 LCPPPASTTIELTAALLAKDRLNAFGLMEPYIEGQDGRRSVRAYGIYLKASFFNHDCLPN 219
+ P+S T E + + R+N F + + + G + A Y + NH C P
Sbjct: 198 I---PSSATDETIKKEVVRSRMNDFFIQQ----WRGGDIDLMAGATYPLGALLNHSCAPT 250
Query: 220 AC---RFDYVDAAAENNTDIIVRMIHDVPQGREICLSYFPVNYDYSTRQKRLLYDYDFAC 276
R D DA R + DV +G E+ +Y + + R+ LL Y F C
Sbjct: 251 CVCSYRLDISDAVQYPTWIQEFRCVADVGKGTELTHAYVDASDWSNHRRAELLDRYGFVC 310
Query: 277 DCDRCKVEA 285
DC RC A
Sbjct: 311 DCPRCPTNA 319
>gi|149727214|ref|XP_001497840.1| PREDICTED: SET and MYND domain-containing protein 1 isoform 2
[Equus caballus]
Length = 490
Score = 63.9 bits (154), Expect = 1e-07, Method: Compositional matrix adjust.
Identities = 67/288 (23%), Positives = 113/288 (39%), Gaps = 25/288 (8%)
Query: 6 ETVTVAEIEGRGRGLVSTQSLKAGQIVLRDSPILLYSALPFINSSSF-CHNCFRKTMHSS 64
E V V EG+GRGL +T+ A I+ + YSA+ F + +F CH CF++
Sbjct: 7 ENVEVFNSEGKGRGLKATKEFWAADIIFAERA---YSAVVFDSLVNFVCHTCFKR--QEK 61
Query: 65 SSICPSCSVA-FCSPKCSTAAASSSHSPYVCQALTR---LLNLNSPDAANLPLDRQLQAR 120
C C A +C C A + + C A+ R + N N AA + + +
Sbjct: 62 LHRCGQCKFAHYCDRTCQKDAWLNHKNE--CSAIKRYGKVPNENIRLAARIMWRVEREGT 119
Query: 121 FLIAAYNLAVVNPSQFQILLAFQGTVTDNDTSAAHYLSSLCPPPASTTIELTAALLAKDR 180
L L V+ Q + + + +L P +++ + +
Sbjct: 120 GLTEGC-LVSVDDLQNHVEHFGEEEQRELRVDVDTFLQYWPPQSQQFSMQYISHIFGVIN 178
Query: 181 LNAFGLMEPYIEGQDGRRSVRAYGIYLKASFFNHDCLPNAC------RFDYVDAAAENNT 234
N F L + Q G ++V G++ NHDC PN + V +
Sbjct: 179 CNGFTLSD-----QRGLQAV-GVGVFPNLGLVNHDCWPNCTVIFNNGNHEAVKSMFHTQM 232
Query: 235 DIIVRMIHDVPQGREICLSYFPVNYDYSTRQKRLLYDYDFACDCDRCK 282
I +R + + +G E+ +SY R+++L Y F C C+ C+
Sbjct: 233 RIELRALGKISEGEELTVSYIDFLNVSEERKRQLKKQYYFDCTCEHCQ 280
>gi|73980306|ref|XP_852104.1| PREDICTED: SET and MYND domain-containing protein 1 isoform 2
[Canis lupus familiaris]
Length = 490
Score = 63.9 bits (154), Expect = 1e-07, Method: Compositional matrix adjust.
Identities = 68/288 (23%), Positives = 113/288 (39%), Gaps = 25/288 (8%)
Query: 6 ETVTVAEIEGRGRGLVSTQSLKAGQIVLRDSPILLYSALPFINSSSF-CHNCFRKTMHSS 64
E V V EG+GRGL +T+ A ++ + YSA+ F + +F CH CF++
Sbjct: 7 ENVEVFTSEGKGRGLKATKEFWAADVIFAERA---YSAVVFDSLVNFVCHTCFKR--QEK 61
Query: 65 SSICPSCSVA-FCSPKCSTAAASSSHSPYVCQALTR---LLNLNSPDAANLPLDRQLQAR 120
C C A +C C A + + C A+ R + N N AA + + +
Sbjct: 62 LHHCGQCKFAHYCDRTCQKDAWLNHKNE--CSAIKRYGKVPNENIRLAARIMWRVEREGT 119
Query: 121 FLIAAYNLAVVNPSQFQILLAFQGTVTDNDTSAAHYLSSLCPPPASTTIELTAALLAKDR 180
L L V+ Q + + + +L P +++ + +
Sbjct: 120 GLTEGC-LVAVDDLQNHVEHFGEEEQKELRLDVDTFLQYWPPQGQQFSMQYISHIFGVIS 178
Query: 181 LNAFGLMEPYIEGQDGRRSVRAYGIYLKASFFNHDCLPNAC------RFDYVDAAAENNT 234
N F L + Q G ++V GI+ NHDC PN + V +
Sbjct: 179 CNGFTLSD-----QRGLQAV-GVGIFPNLGLVNHDCWPNCTVIFNNGNHEAVKSMFHTQM 232
Query: 235 DIIVRMIHDVPQGREICLSYFPVNYDYSTRQKRLLYDYDFACDCDRCK 282
I +R + + +G E+ +SY R+K+L Y F C C+ C+
Sbjct: 233 RIELRALGKISEGEELTVSYIDFLNLSEDRKKQLKKQYYFDCTCEHCQ 280
>gi|412986494|emb|CCO14920.1| SET and MYND domain-containing protein 5 [Bathycoccus prasinos]
Length = 336
Score = 63.5 bits (153), Expect = 2e-07, Method: Compositional matrix adjust.
Identities = 84/337 (24%), Positives = 132/337 (39%), Gaps = 79/337 (23%)
Query: 10 VAEIEGRGRGLVSTQSLKAGQIVLRDSPILLYSALP---FINSSSFCHNCFRK------T 60
++ E +GR + + + K+G IV ++P++ + ++ +CH CF K +
Sbjct: 16 ISWCERKGRRVTTRKETKSGTIVSDEAPLVWLAGCEKDDDDDADEYCHRCFEKITKPNAS 75
Query: 61 MHSSSSI---CPSCSV-AFCSPKCSTAAASSSHSPY-VCQALTRLLNLNSPDAANLPLDR 115
+ SSSS C C+V +CS C AAS H C A + D P D
Sbjct: 76 LKSSSSFVVECEYCNVEKYCSVDCRRLAASEYHRKSGECFAFETGKDAEKSDPTRYPFD- 134
Query: 116 QLQARFLIAAYNLAVVNPSQFQILLAFQGTVTDN------------------DTSAAHYL 157
+ RF I AY + + + L + V++N D + ++L
Sbjct: 135 DVPNRFAIRAY-AKLGGWTGLEKTLEEKKKVSENLRKHLNECVAHVPNDYHHDDNKKNFL 193
Query: 158 SSLCPPPASTTIELTAALLAKD----------------RLNAFGLMEPYIEGQDGRRSVR 201
S+LC T AL+ D R NA L + + D +
Sbjct: 194 STLCSN--------TRALMGADEEIVKEEDFVRAAWAMRANAHTLYDYNYDDDDNNNNNN 245
Query: 202 ----------AYGIYLKASF-FNHDCLPNACRFDYVDAAAENNTDIIVRMIHDVPQGREI 250
G++ + S +NH C PNA F+ V + VR+I DV G EI
Sbjct: 246 NKKKRRRKKIGVGLFPEVSRSYNHSCEPNAF-FENV------RLRLRVRLIKDVASGEEI 298
Query: 251 CLSYFPVNYDYSTRQKR---LLYDYDFACDCDRCKVE 284
+SY + RQ R L Y+F C+C +C+ E
Sbjct: 299 TISYLNDDEKGEKRQHRRELLKKQYNFDCECAKCERE 335
>gi|328869415|gb|EGG17793.1| SET domain-containing protein [Dictyostelium fasciculatum]
Length = 363
Score = 63.5 bits (153), Expect = 2e-07, Method: Compositional matrix adjust.
Identities = 63/287 (21%), Positives = 112/287 (39%), Gaps = 28/287 (9%)
Query: 17 GRGLVSTQSLKAGQIVLRDSPILLYSALPFINSSSFCHNCFRK----------TMHSSSS 66
GRGL + + K G + ++ P + Y +L ++ S+ C++C ++
Sbjct: 76 GRGLFAITAFKKGSQIFQEEPFVSYPSLD-VDKSTICNHCLKQLNNNGNNNNDNKSKQQQ 134
Query: 67 ICPSCSVAFCSPKCSTAAASSSHSPYVCQALTRLLNLNS---PDAANLPL--DRQLQARF 121
C C +CS KC ++A H C + + N+ + PL R L
Sbjct: 135 QCTKCGERYCSEKCR-SSAEVQHHLASCPTSSNIDNITRYSLVEKRRFPLLAARILSRIL 193
Query: 122 LIAAYNLAVVNPSQFQIL-LAFQGTVTDNDTSAAHYLSSLCPPPASTT---IELTAALLA 177
L ++ + N Q+L A + + + SL ++ ++
Sbjct: 194 LENHFDKTMHNWENLQVLSFAKKDAPLEWKDDYDVFKKSLLTRESNQKRFDFNWFVRIMQ 253
Query: 178 KDRLNAFGLMEPYIEGQDGRRSVRAYGIYLKASFFNHDCLPNACRFDYVDAAAENNTDII 237
LN G+ + + G++ +SFFNH C PN V A ++ +
Sbjct: 254 ILYLNTLGI-DVGSTKPSISSPTSSIGLFFLSSFFNHSCDPN------VYMAFPHDKTAV 306
Query: 238 VRMIHDVPQGREICLSYFPVNYDYSTRQKRLLYDYDFACDCDRCKVE 284
+ + D+ +G E+ +SY D RQ L +Y F CDC +C E
Sbjct: 307 ITALRDIKKGEELFISYGDSEKDMFDRQTHLFDNYGFNCDCPKCTQE 353
>gi|328873515|gb|EGG21882.1| SET domain-containing protein [Dictyostelium fasciculatum]
Length = 438
Score = 63.5 bits (153), Expect = 2e-07, Method: Compositional matrix adjust.
Identities = 35/110 (31%), Positives = 56/110 (50%), Gaps = 13/110 (11%)
Query: 175 LLAKDRLNAFGLMEPYIEGQ---DGRRSVRAYGIYLKASFFNHDCLPNACRFDYVDAAAE 231
+L ++R NAF YI+G+ G R G+Y++ SFFNH C PN + E
Sbjct: 304 VLLRNRCNAF-----YIQGRPKNGGNGESRGCGVYVRNSFFNHSCNPNVNYW-----VVE 353
Query: 232 NNTDIIVRMIHDVPQGREICLSYFPVNYDYSTRQKRLLYDYDFACDCDRC 281
N ++ ++ V +G E+C+SY R+++L Y F C C++C
Sbjct: 354 NTLEVECSLMRAVRKGEELCISYIDTAASLRDRREKLSEGYLFHCRCEKC 403
>gi|378731922|gb|EHY58381.1| SET and MYND domain-containing protein [Exophiala dermatitidis
NIH/UT8656]
Length = 550
Score = 63.5 bits (153), Expect = 2e-07, Method: Compositional matrix adjust.
Identities = 75/306 (24%), Positives = 113/306 (36%), Gaps = 57/306 (18%)
Query: 6 ETVTVAEIEGRGRGLVSTQSLKA-GQIVLRDSPILLYSALPFINSSSFCHNCFRK----- 59
E V RG GL + + A Q++ P+L+ AL + C+ CFR+
Sbjct: 33 ERVYTKRTADRGNGLFAVDDIAAESQLLFVARPLLV--ALENAKLPTNCYYCFREPGGQG 90
Query: 60 ------TMHSSSSICPSCSVA-FCSPKCSTAAASSSHSPYVCQALTRLLNLNSPDAANLP 112
S C C VA FC KC T A S H C+ +RL P
Sbjct: 91 LWQMTERKEDSLKTCSRCKVAKFCDQKCQTEAWSQYHR-LECKLFSRLYPRVLPSTVRAV 149
Query: 113 LDRQLQARFLIAAYNLAVVNPSQFQILLAFQGTVTDNDTSAAHYLSSLCPPPASTT---- 168
+ L+ + ++ P +++ LLA Q D + L
Sbjct: 150 IR-------LLKQHKAGILPPGEWEQLLALQSHQQDLANAGGQRWQDLLIMSQGIKGYSG 202
Query: 169 --------IELTAALLAKDRLNAFGLMEPYIEGQDGRRSVRAYGIYL--KASFFNHDCLP 218
+ L+ A++ +N+F L + + G+ L K + NH C P
Sbjct: 203 TDEDDDLILRLSCAVI----VNSFTLSNATFD---------SIGVILHPKPALLNHSCDP 249
Query: 219 NA-CRFDYVDAAAENNT--DIIVRMIHDVPQGREICLSYFPVNYDYSTRQKRLLYDYDFA 275
NA RFD +E +T I V + D+ + EI +SY RQ++L Y F
Sbjct: 250 NAYVRFD----VSETDTLGSISVHALRDIAKDEEITISYIDTTVPCKRRQQQLSERYFFT 305
Query: 276 CDCDRC 281
C C C
Sbjct: 306 CQCHLC 311
>gi|348505212|ref|XP_003440155.1| PREDICTED: SET and MYND domain-containing protein 1-like isoform 2
[Oreochromis niloticus]
Length = 467
Score = 63.5 bits (153), Expect = 2e-07, Method: Compositional matrix adjust.
Identities = 65/283 (22%), Positives = 117/283 (41%), Gaps = 37/283 (13%)
Query: 16 RGRGLVSTQSLKAGQIVLRDSPILLYSALPFIN-SSSFCHNCFRKTMHSSSSICPSCSVA 74
+GRGL + + L G+++ + ++A+ F + ++ CH+CFR ++ C C A
Sbjct: 17 KGRGLRAAKELNTGEVIFSEPS---FAAVVFDSLATQVCHSCFRH--QANLHRCAQCKFA 71
Query: 75 -FCSPKCSTAAASSSHSPYVCQALTRLLNLNSPDAANLPLDRQLQARFLIAAY-NLAVVN 132
+C C TA + C A+ +L S + N+ +L AR L + + + +
Sbjct: 72 HYCDRTCQTACWNEHKQE--CGAIKKL---GSAPSENI----RLAARVLWRMHKDTGIAS 122
Query: 133 PSQFQILLAFQGTVTDNDTS--------AAHYLSSLCPPPASTTIELTAALLAKDRLNAF 184
SQ + Q V D +L +++ + + + N F
Sbjct: 123 DSQLISVDQLQEHVADLAAEDFKQLRADVHKFLQYWSYGTRQHSVDYISHIFGIIKCNGF 182
Query: 185 GLMEPYIEGQDGRRSVRAYGIYLKASFFNHDCLPNACRF------DYVDAAAENNTDIIV 238
L + Q G ++V G++ NHDC PN V +A + I +
Sbjct: 183 TLSD-----QRGLQAV-GVGLFPNLCLVNHDCWPNCTVILNHGNQSAVSSALHSQRRIEL 236
Query: 239 RMIHDVPQGREICLSYFPVNYDYSTRQKRLLYDYDFACDCDRC 281
R + +P+G E+ +SY + RQK+L + F C C+ C
Sbjct: 237 RALGKIPEGEELTVSYVDFLNLSADRQKKLKERFHFDCTCEHC 279
>gi|195435832|ref|XP_002065883.1| GK20590 [Drosophila willistoni]
gi|194161968|gb|EDW76869.1| GK20590 [Drosophila willistoni]
Length = 661
Score = 63.5 bits (153), Expect = 2e-07, Method: Compositional matrix adjust.
Identities = 74/328 (22%), Positives = 135/328 (41%), Gaps = 72/328 (21%)
Query: 1 MSAVSETVTVAEIEGRGRGLVSTQSLKAGQIVLRDSPILLYSALPFINSSSFCHNCFRKT 60
+S ++ V + E + +GR +V+ LK G +VL ++P+ F ++ CH+CF++
Sbjct: 250 LSGAAQGVKLVETKEKGRFVVANGGLKTGDVVLCENPVAACLLPNFFGTN--CHHCFKRL 307
Query: 61 MHSSSSICPSCS-VAFCSPKCSTAAASSSHS--------------PYVCQALTRLLNLNS 105
+ + C CS VAFCS +C A+ S H +C R+ S
Sbjct: 308 RNPVA--CLHCSGVAFCSAQCMGEASESYHRFECEFMDLLIGSGMSILCFIALRVFTQAS 365
Query: 106 P-----DAANLPLD--------RQ----LQARFLIAAYNLAVVNPSQFQILLAFQGTVTD 148
+ ANL + RQ LQ R L+A + L ++ +Q+ F T+
Sbjct: 366 SLEDGLNTANLLFEHLCSHEDVRQADDYLQ-RSLMAGFLLRILQKAQY-----FGRRKTE 419
Query: 149 NDTSAAHYLSSLCPPPASTTIELTAALLAKDRLNAFGLMEPYI------EGQDGRRSVR- 201
P + +++ +LL ++ + E Y +G +++
Sbjct: 420 GVN------------PTAVELQVATSLLGLLQVLQYNAHEIYQTLVTDDHRFEGSKTIHV 467
Query: 202 AYGIYLKASFFNHDCLPNACRFDYVDAAAENNTDIIVRMIHDVPQGREICLSYFPV--NY 259
G+Y S+FNH+C P+ + +V +++R + ++Y P+
Sbjct: 468 GAGLYGTGSYFNHECWPSVASY-FV------GKKLVLRTTKPHRPNEVVAVNYGPIFTTM 520
Query: 260 DYSTRQKRLLYDYDFACDCDRCKVEANW 287
RQ+ L Y F+C+C C + NW
Sbjct: 521 HLKERQRSLRGRYAFSCNCMAC--QENW 546
>gi|322785185|gb|EFZ11898.1| hypothetical protein SINV_10758 [Solenopsis invicta]
Length = 389
Score = 63.2 bits (152), Expect = 2e-07, Method: Compositional matrix adjust.
Identities = 54/193 (27%), Positives = 87/193 (45%), Gaps = 22/193 (11%)
Query: 115 RQLQARFLIAAYNLAVVNPSQFQILLAFQGTVTDNDTSAAHYLSSLCPPPASTTIELTAA 174
+Q+ L Y P F+ LLA GT +++ + ++A
Sbjct: 211 QQMMKEALNTEYTAHWFTPDGFRSLLALVGTNGQGIGTSSF---------SRWVKNVSAL 261
Query: 175 LLAKD-RLNAFGLMEPYIEGQDGR----RSVRAYGIYLKASFFNHDCLPNACRFDYVDAA 229
L KD R+ L+E + D + G+Y+ S NH C+PNA V+
Sbjct: 262 ELPKDERIYIDKLIEKIYDAMDEAVGPFLNNEGSGLYVLQSSVNHSCVPNA----VVEFP 317
Query: 230 AENNTDIIVRMIHDVPQGREICLSYF---PVNYDYSTRQKRLLYDYDFACDCDRCKVEAN 286
NNT ++++ I D+ G EIC SY + +RQ+ L Y F C CD+C+ +AN
Sbjct: 318 YSNNT-LVLKAIRDIKVGEEICTSYLDECQLERSRHSRQQALSSLYLFVCHCDKCQAQAN 376
Query: 287 WSDNDNDDENNEE 299
D +D+E++E+
Sbjct: 377 DPDVTSDEESDED 389
Score = 42.0 bits (97), Expect = 0.44, Method: Compositional matrix adjust.
Identities = 44/183 (24%), Positives = 75/183 (40%), Gaps = 46/183 (25%)
Query: 16 RGRGLVSTQSLKAGQIVLRDSPILLYSALPFINSSSF----CHNCFR---------KTMH 62
+G+GL S + + G ++L + P++ F+ +S F C +C R + +
Sbjct: 21 KGKGLFSNRVFREGDVILEERPVV---CCQFLWNSEFGYVACDHCLRPLETAEENVRRLT 77
Query: 63 SSSSI-----------------CPSCSVAFCSPKCSTAAASSSHSPYVCQALTRLLNLNS 105
+SS CP C V +CS +C T A HS QA R+ + +
Sbjct: 78 GNSSFVVPHKECCEVNKSLITSCPLCGVKYCSVECQTEAFQRYHSTLCLQA--RVKDESH 135
Query: 106 PDAANLPLDRQLQ-----ARFLIAAYNLAVVNPS--QFQILLAF----QGTVTDNDTSAA 154
P +Q+ A ++ A +AVVN + + + L F TV+++
Sbjct: 136 PLNHLCETWKQMHYPPESASIMLLARMVAVVNQANDKEEALAMFSQFCHRTVSNDTHEIV 195
Query: 155 HYL 157
HYL
Sbjct: 196 HYL 198
>gi|310793444|gb|EFQ28905.1| MYND finger [Glomerella graminicola M1.001]
Length = 548
Score = 63.2 bits (152), Expect = 2e-07, Method: Compositional matrix adjust.
Identities = 72/291 (24%), Positives = 118/291 (40%), Gaps = 31/291 (10%)
Query: 3 AVSETVTVAEIEGR---GRGLVSTQSLKAGQIVL-RDSPILLYSALPFINSSSFCHNCFR 58
AV++ + V +G GR + +T+ K G ++ + P ++ P + +C++C +
Sbjct: 2 AVNDGIEVRGTKGGDKGGRSIHATRRFKPGDVIACFEEPAVVLP--PGHRALEYCNHCLK 59
Query: 59 KTMHSSSSI--CPSC-SVAFCSPKCSTAAASSSHSPYVCQALTRLLNLNSPDA-ANLPLD 114
K + C C +VA+C P C A S H C+A+ RL + D A +P
Sbjct: 60 KQQPGGPKLRACTGCKTVAYCGPACQRANWSLVHK-LECKAIQRLHEIKPADEPAWVPTP 118
Query: 115 RQLQARFLIAAYNLAVVNPSQFQILLAFQGTVTDNDTSAAHYLSSLCPPPAS-TTIELTA 173
+ A+ ++ + LA + + + + TD + + L + +E
Sbjct: 119 IRAAAQVMLRPHVLAQFGELEGHVEQWRKKSETDLQLQSHGVVRCLGLDMGTFEALEAAF 178
Query: 174 ALLAKDRLNAFGLMEPYIEGQDGRRSVRAYGIYLKASF--FNHDCLPNACRFDYVDAAAE 231
+L K + NAF E Y E G++L + NH C+PNA V
Sbjct: 179 QVLCKLQTNAFSRSEEYYE---------TGGVFLDTTLAMINHSCVPNA----LVQFGGR 225
Query: 232 NNTDIIVRMIHDVPQGREICLSYFPVNYDYSTRQKRLLYDYDFACDCDRCK 282
T IH G EI +SY S RQ L Y F C C +C+
Sbjct: 226 TATLRSASFIHP---GSEIEISYIDQTQPKSRRQHELSL-YHFECHCSKCQ 272
>gi|5870832|gb|AAC53021.2| skm-BOP1 [Mus musculus]
Length = 485
Score = 63.2 bits (152), Expect = 2e-07, Method: Compositional matrix adjust.
Identities = 66/288 (22%), Positives = 114/288 (39%), Gaps = 25/288 (8%)
Query: 6 ETVTVAEIEGRGRGLVSTQSLKAGQIVLRDSPILLYSALPFINSSSF-CHNCFRKTMHSS 64
E V V EG+GRGL +T+ A ++ + YSA+ F + +F CH CF++
Sbjct: 2 ENVEVFTSEGKGRGLKATKEFWAADVIFAERA---YSAVVFDSLINFVCHTCFKRQ--EK 56
Query: 65 SSICPSCSVA-FCSPKCSTAAASSSHSPYVCQALTR---LLNLNSPDAANLPLDRQLQAR 120
C C A +C C A + + C A+ + + N N AA + + +
Sbjct: 57 LHRCGQCKFAHYCDRTCQKDAWLNHKNE--CAAIKKYGKVPNENIRLAARIMWRVEREGT 114
Query: 121 FLIAAYNLAVVNPSQFQILLAFQGTVTDNDTSAAHYLSSLCPPPASTTIELTAALLAKDR 180
L L V+ Q + + + +L P +++ + +
Sbjct: 115 GLTEG-CLVSVDDLQNHVEHFGEEEQKELRVDVDTFLQYWPPQSQQFSMQYISHIFGVIN 173
Query: 181 LNAFGLMEPYIEGQDGRRSVRAYGIYLKASFFNHDCLPNAC------RFDYVDAAAENNT 234
N F L + Q G ++V GI+ NHDC PN + V +
Sbjct: 174 CNGFTLSD-----QRGLQAV-GVGIFPNLGLVNHDCWPNCTVIFNNGNHEAVKSMFHTQM 227
Query: 235 DIIVRMIHDVPQGREICLSYFPVNYDYSTRQKRLLYDYDFACDCDRCK 282
I +R + + +G E+ +SY + R+++L Y F C C+ C+
Sbjct: 228 RIELRALGKISEGEELTVSYIDFLHLSEERRRQLKKQYYFDCSCEHCQ 275
>gi|392590224|gb|EIW79553.1| SET domain-containing protein [Coniophora puteana RWD-64-598 SS2]
Length = 410
Score = 63.2 bits (152), Expect = 2e-07, Method: Compositional matrix adjust.
Identities = 39/148 (26%), Positives = 65/148 (43%), Gaps = 24/148 (16%)
Query: 156 YLSSLCPPPASTTIELTAALLAKDRLNAFGLME-------------PYIEGQDGR----R 198
+L + PPP + + LLA++ NAFG+ E E +D R R
Sbjct: 270 FLGAALPPPLRAHVGVVRQLLARETGNAFGIWEGDQLPEGRISIGDSEKEAEDTRDMRER 329
Query: 199 SVRAYGIYLKASFFNHDCLPNACRFDYVDAAAENNTDIIVRMIHDVPQGREICLSYFPVN 258
+ +G+++ AS FNH C PN + + + + DV +G E+ +SY +
Sbjct: 330 EMFGWGVWVSASLFNHSCAPNVRK-------KRSGRSLEFFTLRDVKEGEELRISYASTD 382
Query: 259 YDYSTRQKRLLYDYDFACDCDRCKVEAN 286
R++ L + F C C RC E++
Sbjct: 383 KPVGERREALRTSWFFDCMCSRCVRESS 410
>gi|236461014|ref|NP_001153599.1| SET and MYND domain-containing protein 1 isoform 1 [Mus musculus]
gi|332278221|sp|P97443.3|SMYD1_MOUSE RecName: Full=SET and MYND domain-containing protein 1; AltName:
Full=CD8b-opposite; AltName: Full=Zinc finger protein
BOP; Short=m-BOP
gi|308387879|pdb|3N71|A Chain A, Crystal Structure Of Cardiac Specific Histone
Methyltransferase Smyd1
gi|49903961|gb|AAH76601.1| Smyd1 protein [Mus musculus]
gi|74209249|dbj|BAE24995.1| unnamed protein product [Mus musculus]
gi|148666518|gb|EDK98934.1| SET and MYND domain containing 1, isoform CRA_b [Mus musculus]
Length = 490
Score = 63.2 bits (152), Expect = 2e-07, Method: Compositional matrix adjust.
Identities = 66/288 (22%), Positives = 114/288 (39%), Gaps = 25/288 (8%)
Query: 6 ETVTVAEIEGRGRGLVSTQSLKAGQIVLRDSPILLYSALPFINSSSF-CHNCFRKTMHSS 64
E V V EG+GRGL +T+ A ++ + YSA+ F + +F CH CF++
Sbjct: 7 ENVEVFTSEGKGRGLKATKEFWAADVIFAERA---YSAVVFDSLINFVCHTCFKRQ--EK 61
Query: 65 SSICPSCSVA-FCSPKCSTAAASSSHSPYVCQALTR---LLNLNSPDAANLPLDRQLQAR 120
C C A +C C A + + C A+ + + N N AA + + +
Sbjct: 62 LHRCGQCKFAHYCDRTCQKDAWLNHKNE--CAAIKKYGKVPNENIRLAARIMWRVEREGT 119
Query: 121 FLIAAYNLAVVNPSQFQILLAFQGTVTDNDTSAAHYLSSLCPPPASTTIELTAALLAKDR 180
L L V+ Q + + + +L P +++ + +
Sbjct: 120 GLTEG-CLVSVDDLQNHVEHFGEEEQKELRVDVDTFLQYWPPQSQQFSMQYISHIFGVIN 178
Query: 181 LNAFGLMEPYIEGQDGRRSVRAYGIYLKASFFNHDCLPNAC------RFDYVDAAAENNT 234
N F L + Q G ++V GI+ NHDC PN + V +
Sbjct: 179 CNGFTLSD-----QRGLQAV-GVGIFPNLGLVNHDCWPNCTVIFNNGNHEAVKSMFHTQM 232
Query: 235 DIIVRMIHDVPQGREICLSYFPVNYDYSTRQKRLLYDYDFACDCDRCK 282
I +R + + +G E+ +SY + R+++L Y F C C+ C+
Sbjct: 233 RIELRALGKISEGEELTVSYIDFLHLSEERRRQLKKQYYFDCSCEHCQ 280
>gi|170056265|ref|XP_001863952.1| conserved hypothetical protein [Culex quinquefasciatus]
gi|167876021|gb|EDS39404.1| conserved hypothetical protein [Culex quinquefasciatus]
Length = 526
Score = 63.2 bits (152), Expect = 2e-07, Method: Compositional matrix adjust.
Identities = 67/281 (23%), Positives = 115/281 (40%), Gaps = 37/281 (13%)
Query: 17 GRGLVSTQSLKAGQIVLRDSPILLYSALPFINSSSFCHNCFRKTMHSSSSI-CPSCSVAF 75
GR L+++++LKAG+++++ P+ + P+ S C C R + ++ CP+C+
Sbjct: 56 GRYLIASRNLKAGEVIIQVEPL---AVGPWAESDPVCLGCHRTFEVGAKTVRCPTCNWRI 112
Query: 76 CSPKCSTAAASSSHSPYVC-----QALTRLLNL-NSPDAANLPLDRQLQARFLIAAYNLA 129
CSP C+ ++ HS C + +LL +S L + L R L+ L
Sbjct: 113 CSPTCT--GLTTHHSRLECIPLKEHGVAKLLETCSSATQIKLAYEAILALRCLL----LK 166
Query: 130 VVNPSQFQILLAFQGTVTDNDTSAAHYLSSLCPPPASTTIELTAALLAKDRLNAFGLMEP 189
P +++ LL D ++SL ++ + + + F E
Sbjct: 167 RTAPDRYEKLLEM-----DAMNERRQQITSLWKRNQEAIVQ---RIRDEWKFEEFSEAEV 218
Query: 190 Y-----IEG---QDGRRSVRAYGIYLKASFFNHDCLPNACRFDYVDAAAENNTDIIVRMI 241
+ IE Q G V A +Y +A + HDC PN D + ++ VR
Sbjct: 219 HTVCGIIEVNAFQIGPTEVHARALYPEAFYIMHDCTPNTTHTD-----QPGSHELTVRPT 273
Query: 242 HDVPQGREICLSYFPVNYDYSTRQKRLLYDYDFACDCDRCK 282
D+ G + LSY R++ L F C C RC+
Sbjct: 274 RDLKAGEPLTLSYSYTLQGTLKRRQHLCDSKFFWCQCPRCE 314
>gi|301102901|ref|XP_002900537.1| conserved hypothetical protein [Phytophthora infestans T30-4]
gi|262101800|gb|EEY59852.1| conserved hypothetical protein [Phytophthora infestans T30-4]
Length = 425
Score = 63.2 bits (152), Expect = 2e-07, Method: Compositional matrix adjust.
Identities = 80/300 (26%), Positives = 114/300 (38%), Gaps = 66/300 (22%)
Query: 16 RGRGLVSTQSLKAGQIVLRDSPILLYSALPFINSSSFCHNCFRKTMHSSSSICPSC-SVA 74
+GR V++ +L+AG VLR + A+ SSS C CF + S C C V
Sbjct: 12 KGRCNVASVALRAGSCVLRTPA---FCAV----SSSSCGWCFAPQL--SLQRCTGCRQVQ 62
Query: 75 FCSPKCSTAAASSSHSPYVCQALTRLLNLNSPDAANLP-----------------LDRQL 117
+CS +C S CQA + P A LP +D++
Sbjct: 63 YCSRRCQKLDWSQHRRE--CQAWRSI-----PVDATLPTVLLVCRLAAKLFLSSKVDQED 115
Query: 118 QARFLIAAYNLA-VVNPSQFQ----------ILLAFQGTVTDNDTSAAHYLSSLCPPPAS 166
+ R L ++L P Q Q +L ++ + S A SL P
Sbjct: 116 KNRILNLRHHLDDHTAPKQQQFCETTPLVHLLLSEYKVDKQEQTPSFAELQESLKPE--- 172
Query: 167 TTIELTAALLAKDRLNAFGLMEPYIEGQDGRRSVRAYGIYLKASFFNHDCLPNACRFDYV 226
L + N F +M G G++L+ S FNHDC PN C +
Sbjct: 173 -----ILKLFGQVSCNGFSIMNGVTNEPVG------IGLFLQGSMFNHDCDPN-CVVSF- 219
Query: 227 DAAAENNTDIIVRMIHDVPQGREICLSYFPVNYDYSTRQKRLLYDYDFACDCDRCKVEAN 286
++ V +I DV +G+E+ +SY V RQK L Y F C C RC E
Sbjct: 220 -----QGQEMNVHVIKDVKEGQELTISYVEVLQSTKKRQKILKDSYFFECQCSRCTTETT 274
>gi|432880358|ref|XP_004073658.1| PREDICTED: SET and MYND domain-containing protein 1-like isoform 3
[Oryzias latipes]
Length = 476
Score = 63.2 bits (152), Expect = 2e-07, Method: Compositional matrix adjust.
Identities = 71/283 (25%), Positives = 114/283 (40%), Gaps = 28/283 (9%)
Query: 5 SETVTVAEIEGRGRGLVSTQSLKAGQIVLRDSPILLYSALPFINSSSF--CHNCFRKTMH 62
E V + + +GRGL +++ G++V + YSA+ F +S +F CH CFR+
Sbjct: 6 KENVRLFDAGAKGRGLKASKEFSTGEVVFAEPS---YSAVVF-DSLAFQVCHCCFRR--Q 59
Query: 63 SSSSICPSCSVA-FCSPKCSTAAASSSHSPYVCQALTRLLNLNSPD---AANLPLDRQLQ 118
+ C C A +C+ C TA C A+ + S AA L L R +
Sbjct: 60 AQLHRCGQCRFAHYCNRTCQTACWEEHKQE--CAAIRKSGKAPSESVRLAARL-LWRLHK 116
Query: 119 ARFLIAAYNLAVVNPSQFQILLAFQGTVTDNDTSAAHYLSSLCPPPASTTIELTAALLAK 178
L + L V+ Q + Q + T ++E + +
Sbjct: 117 DNGLASDSQLVSVDQLQEHVDQLSQEQLQQLQTDVRVLQDYWSYGSKQHSVEYISHIFGI 176
Query: 179 DRLNAFGLMEPYIEGQDGRRSVRAYGIYLKASFFNHDCLPNACRFDYVDAAAENNTDIIV 238
+ N F L + Q G ++V G++ + NHDC PN A N+ I V
Sbjct: 177 IKCNGFTLTD-----QRGLQAV-GVGLFPNLALVNHDCWPNC-------TAILNHGKIEV 223
Query: 239 RMIHDVPQGREICLSYFPVNYDYSTRQKRLLYDYDFACDCDRC 281
R + + +G E+ +SY + RQK+L + F C C C
Sbjct: 224 RTLGKISEGEELTISYVDFLQLSADRQKQLKEQFHFECSCKHC 266
>gi|395331801|gb|EJF64181.1| hypothetical protein DICSQDRAFT_178674 [Dichomitus squalens
LYAD-421 SS1]
Length = 488
Score = 62.8 bits (151), Expect = 3e-07, Method: Compositional matrix adjust.
Identities = 42/119 (35%), Positives = 58/119 (48%), Gaps = 13/119 (10%)
Query: 168 TIELTAALLAKDRLNAFGLME-PYIEGQDGRRSVRAYGIYLKASFFNHDCLPNACRFDYV 226
T+ L D N+FG+ E P ++ + + +Y ASFFNH C PN V
Sbjct: 374 TVAGVRTALGIDPGNSFGVWEVPLMDESE----CLGFAVYPIASFFNHHCSPN------V 423
Query: 227 DAAAENNTDIIVRMIHDVPQGREICLSYFPV-NYDYSTRQKRLLYDYDFACDCDRCKVE 284
E T V V +G E+C+SY V D++TRQ+ LL + F+C C RCK E
Sbjct: 424 RKEREGRTLRFV-TTRTVEEGEELCISYGHVEGMDWATRQQELLEGWYFSCRCSRCKSE 481
>gi|348517233|ref|XP_003446139.1| PREDICTED: SET and MYND domain-containing protein 1-like isoform 2
[Oreochromis niloticus]
Length = 489
Score = 62.8 bits (151), Expect = 3e-07, Method: Compositional matrix adjust.
Identities = 72/335 (21%), Positives = 134/335 (40%), Gaps = 34/335 (10%)
Query: 6 ETVTVAEIEGRGRGLVSTQSLKAGQIVLRDSPI--LLYSALPFINSSSFCHNCFRKTMHS 63
ETV + + G+GRGL +T+ AG ++ ++ I +++ +L + CH+CFR+
Sbjct: 2 ETVAIFDSPGKGRGLKATKEFWAGDVIFSEASIAAVVFDSL----AERICHSCFRR--QE 55
Query: 64 SSSICPSCSVA-FCSPKCSTAA-ASSSHSPYVCQALTRLLNLNSPDAANL--PLDRQLQA 119
C C A +C C A A +A ++ N N A + LD++
Sbjct: 56 KLQRCGQCKFAHYCDRTCQRAGWAEHKLECSAIKAYGKVPNENIRLVARIMWRLDKEGST 115
Query: 120 RFLIAAYNLAVVNPSQFQILLAFQGTVTDNDTSAAHYLSSLCPPPASTTIELTAALLAKD 179
++ L ++ + I + + + ++L TI+ + +
Sbjct: 116 ---VSDMQLTTLDELEDHIADMPEDDLKELKVDIHNFLDYWPHNSKQHTIDDISHIFGVI 172
Query: 180 RLNAFGLMEPYIEGQDGRRSVRAYGIYLKASFFNHDCLPNACRF------DYVDAAAENN 233
N F + Q G ++V G++ NHDC PN V+ +
Sbjct: 173 NCNGFS-----VSDQRGLQAV-GVGLFPNLCLVNHDCWPNCTVILNHGNQSAVNTMFHSQ 226
Query: 234 TDIIVRMIHDVPQGREICLSYFPVNYDYSTRQKRLLYDYDFACDCDRCK------VEANW 287
I +R + + +G E+ ++Y RQ+ L Y F C C+ CK ++
Sbjct: 227 RRIELRALGKIAEGEELTVAYVDFLNLSEERQRLLKTQYFFDCTCEHCKNRIKDDIKIGG 286
Query: 288 SDNDNDDENNEEVMD-EDQDEQMVASDDDAEVHGD 321
+ D + E+V + D QM+ + A ++GD
Sbjct: 287 REEDGVKPSEEQVKEATDYCFQMLEKMEKARLNGD 321
>gi|254584925|ref|XP_002498030.1| ZYRO0G00462p [Zygosaccharomyces rouxii]
gi|238940924|emb|CAR29097.1| ZYRO0G00462p [Zygosaccharomyces rouxii]
Length = 485
Score = 62.8 bits (151), Expect = 3e-07, Method: Compositional matrix adjust.
Identities = 66/304 (21%), Positives = 116/304 (38%), Gaps = 50/304 (16%)
Query: 16 RGRGLVSTQSLKAGQIVLRDSPILL---YSALPFINSSSFCHNCFRKTMHSSSSI----- 67
+ RGL +T +LKAG ++ D P+++ L + S C C HS +I
Sbjct: 118 KQRGLFATNALKAGDMIFEDLPLVVVPPMDKLVLMQSGKACPLCGSSVSHSQHTIIMNGL 177
Query: 68 -CPSCSVAFCSPKCST---AAASSSHSPYVCQALTRLLNLNSPDAANLPLDRQLQARFLI 123
C C +CS C S H+ C R +NLN + + +
Sbjct: 178 DCGECGAVWCSKNCKKNDLTHTSLRHTRSRC----RQINLNGWKK----YESFCKQYVFV 229
Query: 124 AAYNLAVVNPS-------------QFQILLAFQGTVTDNDTSAAHYLSSLCPPPASTTIE 170
AAY++ ++ S +FQ+L + + + + + +L + T +
Sbjct: 230 AAYSIGIIYASMILDKNGKNGVHPKFQLLADVSQRIRKDSSDSTNLGGTLDASSGAFTSD 289
Query: 171 LTAALLAKDRLNAFGLMEPYIEGQDGRRSVRAYG----------IYLKASFFNHDCLPNA 220
+ + L+ F P ++ D + G I+ SF NHDC PN
Sbjct: 290 DPEPMWTQ-ALDLFKEAFPELDDMDMEAYLNCIGKFNINQVSGQIFPVYSFINHDCEPNV 348
Query: 221 CRFDYVDAAAENNTDIIVRMIHDVPQGREICLSYFPVNYDYSTRQKRLLYDYDFACDCDR 280
R++ D + + + +G E+ +Y + R++ L ++ F C CDR
Sbjct: 349 -RYEIDDKLR-----LKLYARKHIKKGEELLTTYVNPLHGVKLRRRELRVNWGFLCHCDR 402
Query: 281 CKVE 284
C E
Sbjct: 403 CNRE 406
>gi|344234777|gb|EGV66645.1| SET domain-containing protein [Candida tenuis ATCC 10573]
Length = 355
Score = 62.8 bits (151), Expect = 3e-07, Method: Compositional matrix adjust.
Identities = 80/355 (22%), Positives = 135/355 (38%), Gaps = 87/355 (24%)
Query: 1 MSAVSETVTVAEIEGRGRGLVSTQSLKAGQIVLRDSPILLYSALPFINSSSFCHNCFR-- 58
+S +S V+E + G+G S +S+ G +L P+ + F CH CF
Sbjct: 18 LSQISPFFQVSETKYGGKGCFSYESIPRGTDILVSKPLGFTISKDF--KKEVCHKCFNYD 75
Query: 59 --KTMHSSSSI--CPSCSVAFCSPKCSTAAASSSHSPYVCQALTRLLNLNS--------- 105
KTM S+ + S+ FCS KC +A S + ++L L L +
Sbjct: 76 NGKTMKFRLSLPTFKNQSICFCSEKCLSAFKSMKYYEVYLRSLLNLDKLYTTFCTEEMAE 135
Query: 106 -PDAANLPLDRQLQARF-LIAAYNLAVVN--PSQFQILL-----------AFQGTVTDND 150
P A ++ ++ + ++ +V+ P+++Q L+ + V N
Sbjct: 136 VPVAGVSEFIHTIKGKWEQVEQWDASVLKTKPTKWQKLIPKITETEYLEIKYVLGVLHNQ 195
Query: 151 TSAAHYLSSL--------------------------------CPPPASTTIE-LTAALLA 177
T A + S L PP I L L+
Sbjct: 196 TDAFVFESRLFDLLQSNEISKVLKFKNLLVSYITIFKFIRLTSPPQFQKFINPLDIRLII 255
Query: 178 KDRL-NAFGLMEPYIEGQDGRRSVRAYGIYLKASFFNHDCLPNACRFDYVDAAAENNTDI 236
+ L NAFG+ +G + +G+Y SFFNH C PN + N ++
Sbjct: 256 GNNLTNAFGIWTL----NEGDKDFLGFGVYPSGSFFNHSCTPNIIK-------RRINNEL 304
Query: 237 IVRMIHDVPQGREICLSYF-----PVNYDYSTRQKRLLYDYDFACDCDRCKVEAN 286
+ D+ + E+C+ Y PVN +++ L ++ F C C RC++EA
Sbjct: 305 HFLTLADIQKNDELCIDYGNYLEEPVNI-----RRKELSEWFFICTCKRCEMEAQ 354
>gi|88900471|ref|NP_001034725.1| SET and MYND domain-containing protein 1 [Danio rerio]
gi|84180543|gb|ABC54713.1| histone methyltransferase SmyD1a [Danio rerio]
Length = 486
Score = 62.8 bits (151), Expect = 3e-07, Method: Compositional matrix adjust.
Identities = 64/289 (22%), Positives = 112/289 (38%), Gaps = 29/289 (10%)
Query: 6 ETVTVAEIEGRGRGLVSTQSLKAGQIVLRDSPILLYSALPF-INSSSFCHNCFRKTMHSS 64
E V V + G+GRGL +T+ AG ++ + P ++++ F +SS CH+CFR+
Sbjct: 2 EFVEVFDSPGKGRGLRATKEAWAGDVLFAEPP---FASVVFDSQASSICHSCFRR--QEK 56
Query: 65 SSICPSCSVA-FCSPKCSTAAASSSHSPYVCQALTRLLNLNSPDAANLPLDRQLQARF-- 121
C C A +C C A C A+ P + N+ L ++ R
Sbjct: 57 LQRCGQCRFAQYCDKTCQRAGWEEHKLE--CAAIK---TYGKPPSENVRLAARILWRMDK 111
Query: 122 ---LIAAYNLAVVNPSQFQILLAFQGTVTDNDTSAAHYLSSLCPPPASTTIELTAALLAK 178
+++ L + + I + + D ++L T++ + +L
Sbjct: 112 QGSVVSDNQLTTLEDLEDHICDISEDDLKDFKVDIHNFLDYWPRNSKPHTVDSVSHILGV 171
Query: 179 DRLNAFGLMEPYIEGQDGRRSVRAYGIYLKASFFNHDCLPNACRF------DYVDAAAEN 232
N F + Q G ++V G++ NHDC PN +D +
Sbjct: 172 INCNGF-----MVSDQRGLQAV-GVGLFPNLCLVNHDCWPNCTVILNNGNQSAIDTVFHS 225
Query: 233 NTDIIVRMIHDVPQGREICLSYFPVNYDYSTRQKRLLYDYDFACDCDRC 281
I +R + + G E+ ++Y + RQ+ L Y F C C C
Sbjct: 226 QKRIELRALGKISAGEEVTVAYVDYLNVSADRQRLLKQQYFFDCTCKHC 274
>gi|170096156|ref|XP_001879298.1| predicted protein [Laccaria bicolor S238N-H82]
gi|164645666|gb|EDR09913.1| predicted protein [Laccaria bicolor S238N-H82]
Length = 367
Score = 62.8 bits (151), Expect = 3e-07, Method: Compositional matrix adjust.
Identities = 39/122 (31%), Positives = 57/122 (46%), Gaps = 11/122 (9%)
Query: 167 TTIELTAALLAKDRLNAFGLMEPYIEGQDGRRSVRAYGIYLKASFFNHDCLPNACRFDYV 226
+T E A+LA+D N FG+ + + D + +G+Y+ S+FNHDC PN +
Sbjct: 246 STPETVRAILARDHGNVFGIWDTAPD--DEESEMLGWGLYVSGSYFNHDCSPNLKK---- 299
Query: 227 DAAAENNTDIIVRMIHDVPQGREICLSYFPV--NYDYSTRQKRLLYDYDFACDCDRCKVE 284
+ I HDV E+C+SY V R+ L D+ F C C RC E
Sbjct: 300 ---RRHKRGINFYTTHDVNADEELCISYVDVCPGASVINRKTALENDWFFVCQCARCARE 356
Query: 285 AN 286
+N
Sbjct: 357 SN 358
>gi|195160613|ref|XP_002021169.1| GL24960 [Drosophila persimilis]
gi|194118282|gb|EDW40325.1| GL24960 [Drosophila persimilis]
Length = 661
Score = 62.8 bits (151), Expect = 3e-07, Method: Compositional matrix adjust.
Identities = 73/315 (23%), Positives = 133/315 (42%), Gaps = 46/315 (14%)
Query: 1 MSAVSETVTVAEIEGRGRGLVSTQSLKAGQIVLRDSPILLYSALPFINSSSFCHNCFRKT 60
+S + V + E + +GR +V+++ LK G +VL ++P+ F + CH+CF++
Sbjct: 250 LSGAANVVKLVETKEKGRFVVASECLKTGDVVLCENPVAACLIPSFFGTH--CHHCFKR- 306
Query: 61 MHSSSSICPSCS-VAFCSPKCSTAAASSSHSPYVCQALTRLLNLNSPDAANLPLDRQLQA 119
+H+ + C CS +AFCS +C A S H + C+ + L+ + L QA
Sbjct: 307 LHTPVA-CLHCSGIAFCSAQCMGEACESYHR-FECEFMDLLIGSGMSILCFIALRIFTQA 364
Query: 120 RFL---IAAYNLAVVNPS------------QFQILLAFQGTVTDNDTSAAHYLSSLCPPP 164
+ +A NL N Q ++ F + + P
Sbjct: 365 ASVDDGLATANLLFENLCSHEDVRQAEDYLQRSLMAGFLMRILQKAQYFGQRKTEGVNP- 423
Query: 165 ASTTIELTAA-----LLAKDRLNAFGLMEPYIEGQ---DGRRSVR-AYGIYLKASFFNHD 215
T +EL A LL + NA + + + + +G ++V G+Y S+FNH+
Sbjct: 424 --TAVELQVATALLGLLQVLQYNAHEIYQTQVTDEHRFEGSKTVHVGAGLYGTGSYFNHE 481
Query: 216 CLPNACRFDYVDAAAENNTDIIVRMIHDVPQGREICLSYFPVNYDYST---RQKRLLYDY 272
C P+ + +V +++ + ++Y P+ + Y+ RQ+ L Y
Sbjct: 482 CWPSVASY-FV------GKKLVLAATKPHRPNEVVAVNYGPI-FTYTNLKERQRSLRGRY 533
Query: 273 DFACDCDRCKVEANW 287
F+C C C + NW
Sbjct: 534 SFSCSCMAC--QENW 546
>gi|125978717|ref|XP_001353391.1| GA12774 [Drosophila pseudoobscura pseudoobscura]
gi|54642149|gb|EAL30898.1| GA12774 [Drosophila pseudoobscura pseudoobscura]
Length = 661
Score = 62.8 bits (151), Expect = 3e-07, Method: Compositional matrix adjust.
Identities = 73/315 (23%), Positives = 133/315 (42%), Gaps = 46/315 (14%)
Query: 1 MSAVSETVTVAEIEGRGRGLVSTQSLKAGQIVLRDSPILLYSALPFINSSSFCHNCFRKT 60
+S + V + E + +GR +V+++ LK G +VL ++P+ F + CH+CF++
Sbjct: 250 LSGAANVVKLVETKEKGRFVVASECLKTGDVVLCENPVAACLIPSFFGTH--CHHCFKR- 306
Query: 61 MHSSSSICPSCS-VAFCSPKCSTAAASSSHSPYVCQALTRLLNLNSPDAANLPLDRQLQA 119
+H+ + C CS +AFCS +C A S H + C+ + L+ + L QA
Sbjct: 307 LHTPVA-CLHCSGIAFCSAQCMGEACESYHR-FECEFMDLLIGSGMSILCFIALRIFTQA 364
Query: 120 RFL---IAAYNLAVVNPS------------QFQILLAFQGTVTDNDTSAAHYLSSLCPPP 164
+ +A NL N Q ++ F + + P
Sbjct: 365 ASVDDGLATANLLFENLCSHEDVRQAEDYLQRSLMAGFLMRILQKAQYFGQRKTEGVNP- 423
Query: 165 ASTTIELTAA-----LLAKDRLNAFGLMEPYIEGQ---DGRRSVR-AYGIYLKASFFNHD 215
T +EL A LL + NA + + + + +G ++V G+Y S+FNH+
Sbjct: 424 --TAVELQVATALLGLLQVLQYNAHEIYQTQVTDEHRFEGSKTVHVGAGLYGTGSYFNHE 481
Query: 216 CLPNACRFDYVDAAAENNTDIIVRMIHDVPQGREICLSYFPVNYDYST---RQKRLLYDY 272
C P+ + +V +++ + ++Y P+ + Y+ RQ+ L Y
Sbjct: 482 CWPSVASY-FV------GKKLVLAATKPHRPNEVVAVNYGPI-FTYTNLKERQRSLRGRY 533
Query: 273 DFACDCDRCKVEANW 287
F+C C C + NW
Sbjct: 534 SFSCSCMAC--QENW 546
>gi|241948289|ref|XP_002416867.1| conserved hypothetical protein [Candida dubliniensis CD36]
gi|223640205|emb|CAX44454.1| conserved hypothetical protein [Candida dubliniensis CD36]
Length = 378
Score = 62.8 bits (151), Expect = 3e-07, Method: Compositional matrix adjust.
Identities = 35/131 (26%), Positives = 64/131 (48%), Gaps = 10/131 (7%)
Query: 156 YLSSLCPPPASTTIELTA--ALLAKDRLNAFGLMEPYIEGQDGRRSVRAYGIYLKASFFN 213
++ C P T I+ T +++ K+ NAFG+ + + + +G+Y ASFFN
Sbjct: 253 FIKLTCTPELQTFIDSTTIRSIIGKNLSNAFGIWSEVTNPSEDKEFL-GFGVYPSASFFN 311
Query: 214 HDCLPNACRFDYVDAAAENNTDIIVRMIHDVPQGREICLSYFPVNYDYSTRQKRLLYDYD 273
H C PN + NNT+++ D+ G E+C++Y + +++ L ++
Sbjct: 312 HSCSPNIVK-------TRNNTEMVFTTSKDIEVGEELCINYGNYTDEPVELRQQQLKEWF 364
Query: 274 FACDCDRCKVE 284
F C C +C+ E
Sbjct: 365 FDCACTKCQTE 375
>gi|156366270|ref|XP_001627062.1| predicted protein [Nematostella vectensis]
gi|156213960|gb|EDO34962.1| predicted protein [Nematostella vectensis]
Length = 391
Score = 62.8 bits (151), Expect = 3e-07, Method: Compositional matrix adjust.
Identities = 88/388 (22%), Positives = 145/388 (37%), Gaps = 117/388 (30%)
Query: 16 RGRGLVSTQSLKAGQIVLRDSPIL----LYSALPFINSSSFCHNCFR---------KTMH 62
RGR L +++ K G + + P++ L++A S + C +C R + +
Sbjct: 17 RGRALFASRDFKEGDTIFEEDPLVCSQFLWNA---AYSYTACDHCMRSLETAQDMARRLS 73
Query: 63 SSSSI--------------------CPSCSVAFCSPKCSTAAASSSH--------SPYVC 94
S+ ++ CP C+VA+CS C A H +P
Sbjct: 74 SNPTLELPYSAECCAVTKAGEPISYCPQCNVAYCSENCRIKALDQYHRILCLGTSTPDPN 133
Query: 95 QALTRLL----NLN-SPDAANLPLDRQLQARFLIAAYNLAVVNPSQFQILLAFQGTVTDN 149
L +L N++ P+ AN+ L ++ A L A + V F F V +
Sbjct: 134 HPLVKLQETWKNIHYPPETANIMLIARIMATILQATN--SDVKKGSFS---HFCSNVVNK 188
Query: 150 DTSAAHYLSSLCPPPASTTIE-LTAALLAKDRLNAFGLMEPY--------IEGQ------ 194
+ AH L L I L + + DRL + E + GQ
Sbjct: 189 EQQIAHKLLGLHFQEQLDMIRILLSEAMYDDRLEQWFTPEGFSSLFALVGTNGQGIGTSS 248
Query: 195 ----------------DGRRSVRAY---------------------GIYLKASFFNHDCL 217
D R+++ + G+Y S NH C
Sbjct: 249 LSLYVHNIDSYPALSDDERQAIDIFLNQLYEEMERVSGQFLNCEGAGLYALQSSCNHSCA 308
Query: 218 PNACRFDYVDAAAENNTDIIVRMIHDVPQGREICLSYFPVNYDYSTRQKRLLY---DYDF 274
PNA +NN+ ++++ +H + G EIC+SY +R RL Y +Y F
Sbjct: 309 PNA-----EVTFPKNNSTLVLKALHPIKNGEEICISYLEECQRERSRHSRLKYLRENYIF 363
Query: 275 ACDCDRCKVEANWSD---NDNDDENNEE 299
C C +C+++AN D +D DDE+++E
Sbjct: 364 DCTCTKCELQANEPDESSSDGDDEHDDE 391
>gi|354483543|ref|XP_003503952.1| PREDICTED: SET and MYND domain-containing protein 1 [Cricetulus
griseus]
Length = 498
Score = 62.4 bits (150), Expect = 4e-07, Method: Compositional matrix adjust.
Identities = 69/292 (23%), Positives = 114/292 (39%), Gaps = 33/292 (11%)
Query: 6 ETVTVAEIEGRGRGLVSTQSLKAGQIVLRDSPILLYSALPF---INSSSFCHNCFRKTMH 62
E V V EG+GRGL +T+ A +V + YSA+ F IN CH CF++
Sbjct: 7 ENVEVFTSEGKGRGLKATKEFWAADVVFAERA---YSAVVFDSLINV--VCHTCFKR--Q 59
Query: 63 SSSSICPSCSVA-FCSPKCSTAAASSSHSPYVCQALTRLLNLNSPDAANLPLDRQLQARF 121
C C A +C C A + + C A+ R + P+ N+ L ++ R
Sbjct: 60 EKLHRCGQCKFAHYCDRTCQKDAWLNHKNE--CSAIKRYGKV--PNE-NIRLAGRIMWRV 114
Query: 122 LIAAYNLA---VVNPSQFQILLAFQGTVTDND--TSAAHYLSSLCPPPASTTIELTAALL 176
L +V+ Q + G + +L P +++ + +
Sbjct: 115 EREGTGLTEGCLVSVDDLQNHVEHFGEEEQKELRVDVDAFLQYWPPQGQQFSMQYISHIF 174
Query: 177 AKDRLNAFGLMEPYIEGQDGRRSVRAYGIYLKASFFNHDCLPNAC------RFDYVDAAA 230
N F L + Q G ++V GI+ NHDC PN + V +
Sbjct: 175 GVINCNGFTLSD-----QRGLQAV-GVGIFPNLGLVNHDCWPNCTVIFNNGNHEAVKSMF 228
Query: 231 ENNTDIIVRMIHDVPQGREICLSYFPVNYDYSTRQKRLLYDYDFACDCDRCK 282
I +R + + +G E+ +SY + R+++L Y F C C+ C+
Sbjct: 229 HTQMRIELRALGKISEGEELTVSYIDFLHLSEERRRQLRKQYYFDCSCEHCQ 280
>gi|449501420|ref|XP_002187966.2| PREDICTED: SET and MYND domain-containing protein 1 isoform 1
[Taeniopygia guttata]
Length = 478
Score = 62.4 bits (150), Expect = 4e-07, Method: Compositional matrix adjust.
Identities = 72/286 (25%), Positives = 119/286 (41%), Gaps = 34/286 (11%)
Query: 6 ETVTVAEIEGRGRGLVSTQSLKAGQIVLRDSPILLYSALPFIN-SSSFCHNCFRKT--MH 62
E V + EG+GRGL + + AG ++ + Y+A+ F + + CH CF++ +H
Sbjct: 7 ECVELFTAEGKGRGLKAQKEFLAGDVIFAEPA---YAAVVFDSLTHVICHTCFKRQERLH 63
Query: 63 SSSSICPSCSVA-FCSPKCSTAAASSSHSPYVCQALTRLLNLNSPDAANLPLDRQLQARF 121
C C A +C C AA S + C A+ R + +P N+ L ++ +
Sbjct: 64 R----CGQCKFAHYCDRTCQRAAWLSHKNE--CSAIKR--HGKAP-TENIRLAARILWKM 114
Query: 122 LIAAYNLA---VVNPSQFQILLAFQGTVTDNDTSA--AHYLSSLCPPPASTTIELTAALL 176
L+ +V + Q + G D A +L P ++L + +
Sbjct: 115 EREGSGLSEGCLVAIEELQNHVDSFGEEEKKDLRADVESFLEFWPPQCQQFGMQLISHIF 174
Query: 177 AKDRLNAFGLMEPYIEGQDGRRSVRAYGIYLKASFFNHDCLPNACRFDYVDAAAENNTDI 236
N F L + Q G ++V GI+ NHDC PN C + NN I
Sbjct: 175 GVISCNGFTLSD-----QRGLQAV-GVGIFPNLCQANHDCWPN-CTVVF------NNGKI 221
Query: 237 IVRMIHDVPQGREICLSYFPVNYDYSTRQKRLLYDYDFACDCDRCK 282
+R ++ + G E+ +SY R+++L Y F C C+ CK
Sbjct: 222 ELRALNKISPGDELTVSYVDFLSLSEERRRQLKKQYYFDCTCEHCK 267
>gi|432873524|ref|XP_004072259.1| PREDICTED: SET and MYND domain-containing protein 1-like isoform 1
[Oryzias latipes]
Length = 476
Score = 62.4 bits (150), Expect = 4e-07, Method: Compositional matrix adjust.
Identities = 70/326 (21%), Positives = 130/326 (39%), Gaps = 29/326 (8%)
Query: 6 ETVTVAEIEGRGRGLVSTQSLKAGQIVLRDSPILLYSALPFIN-SSSFCHNCFRKTMHSS 64
E V + + G+GRGL + + AG ++ ++ + +A+ F + + CH+CFR+
Sbjct: 2 ENVVIFDSPGKGRGLKAAKEFWAGDVIFSEASL---AAVVFDSLAERVCHSCFRR--QEK 56
Query: 65 SSICPSCSVA-FCSPKCSTAA-ASSSHSPYVCQALTRLLNLNSPDAANLPLDRQLQARFL 122
C C A +C C A A +A ++ N N A + L R +
Sbjct: 57 LQRCGQCKFAHYCDRTCQRAGWAEHKQECSAIKAYGKVPNENIRLVARI-LWRLDKDGST 115
Query: 123 IAAYNLAVVNPSQFQILLAFQGTVTDNDTSAAHYLSSLCPPPASTTIELTAALLAKDRLN 182
++ L ++ + I + + + ++L T++ + + N
Sbjct: 116 VSDMQLTTLDELEDHITDMQEDELKELKVDIHNFLDFWPRTSKQHTVDDISHIFGVINCN 175
Query: 183 AFGLMEPYIEGQDGRRSVRAYGIYLKASFFNHDCLPNACRFDYVDAAAENNTDIIVRMIH 242
F + Q G ++V G++ NHDC PN N+ I +R +
Sbjct: 176 GFS-----VSDQRGLQAV-GVGLFPNLCLVNHDCWPNC-------TVILNHGKIELRSLG 222
Query: 243 DVPQGREICLSYFPVNYDYSTRQKRLLYDYDFACDCDRCK------VEANWSDNDNDDEN 296
+ +G E+ ++Y RQ+ L Y F C C+ CK ++ + D +
Sbjct: 223 KIAEGEELTVAYVDYMNLSEERQRLLKTQYFFDCTCEHCKNKIKDDIKLGGREVDGVKPS 282
Query: 297 NEEVMD-EDQDEQMVASDDDAEVHGD 321
E+V + D QM+ D A ++GD
Sbjct: 283 EEQVKEATDYCYQMLEKMDKARLNGD 308
>gi|344247559|gb|EGW03663.1| SET and MYND domain-containing protein 1 [Cricetulus griseus]
Length = 467
Score = 62.4 bits (150), Expect = 4e-07, Method: Compositional matrix adjust.
Identities = 69/292 (23%), Positives = 114/292 (39%), Gaps = 33/292 (11%)
Query: 6 ETVTVAEIEGRGRGLVSTQSLKAGQIVLRDSPILLYSALPF---INSSSFCHNCFRKTMH 62
E V V EG+GRGL +T+ A +V + YSA+ F IN CH CF++
Sbjct: 7 ENVEVFTSEGKGRGLKATKEFWAADVVFAERA---YSAVVFDSLINV--VCHTCFKR--Q 59
Query: 63 SSSSICPSCSVA-FCSPKCSTAAASSSHSPYVCQALTRLLNLNSPDAANLPLDRQLQARF 121
C C A +C C A + + C A+ R + P+ N+ L ++ R
Sbjct: 60 EKLHRCGQCKFAHYCDRTCQKDAWLNHKNE--CSAIKRYGKV--PNE-NIRLAGRIMWRV 114
Query: 122 LIAAYNLA---VVNPSQFQILLAFQGTVTDND--TSAAHYLSSLCPPPASTTIELTAALL 176
L +V+ Q + G + +L P +++ + +
Sbjct: 115 EREGTGLTEGCLVSVDDLQNHVEHFGEEEQKELRVDVDAFLQYWPPQGQQFSMQYISHIF 174
Query: 177 AKDRLNAFGLMEPYIEGQDGRRSVRAYGIYLKASFFNHDCLPNAC------RFDYVDAAA 230
N F L + Q G ++V GI+ NHDC PN + V +
Sbjct: 175 GVINCNGFTLSD-----QRGLQAV-GVGIFPNLGLVNHDCWPNCTVIFNNGNHEAVKSMF 228
Query: 231 ENNTDIIVRMIHDVPQGREICLSYFPVNYDYSTRQKRLLYDYDFACDCDRCK 282
I +R + + +G E+ +SY + R+++L Y F C C+ C+
Sbjct: 229 HTQMRIELRALGKISEGEELTVSYIDFLHLSEERRRQLRKQYYFDCSCEHCQ 280
>gi|348566407|ref|XP_003468993.1| PREDICTED: SET and MYND domain-containing protein 1-like isoform 1
[Cavia porcellus]
Length = 490
Score = 62.4 bits (150), Expect = 4e-07, Method: Compositional matrix adjust.
Identities = 64/286 (22%), Positives = 110/286 (38%), Gaps = 21/286 (7%)
Query: 6 ETVTVAEIEGRGRGLVSTQSLKAGQIVLRDSPILLYSALPFINSSSF-CHNCFRKTMHSS 64
E V + EG+GRGL +T+ A ++ + YSA+ F + +F CH CF++
Sbjct: 7 ENVEIFTSEGKGRGLKATKEFWAADVIFAERA---YSAVVFDSLVNFVCHTCFKR--QEK 61
Query: 65 SSICPSCSVA-FCSPKCST-AAASSSHSPYVCQALTRLLNLNSPDAANLPLDRQLQARFL 122
C C A +C C A + H + ++ N N AA + + + L
Sbjct: 62 LHRCGQCKFAHYCDRTCQKDAWLNHKHECSAIKRYGKVPNENVRLAARIMWRVEREGTGL 121
Query: 123 IAAYNLAVVNPSQFQILLAFQGTVTDNDTSAAHYLSSLCPPPASTTIELTAALLAKDRLN 182
L V+ Q + + + +L P +++ + + N
Sbjct: 122 TEGC-LVSVDDLQNHVEHFGEEEQKELRVDVDTFLQYWPPQSQQFSMQYISHIFGVINCN 180
Query: 183 AFGLMEPYIEGQDGRRSVRAYGIYLKASFFNHDCLPNAC------RFDYVDAAAENNTDI 236
F L + Q G ++V GI+ NHDC PN + V + I
Sbjct: 181 GFTLSD-----QRGLQAV-GVGIFPNLGLVNHDCWPNCTVIFNNGNHEAVKSMFHTQMRI 234
Query: 237 IVRMIHDVPQGREICLSYFPVNYDYSTRQKRLLYDYDFACDCDRCK 282
+R + + G E+ +SY R+++L Y F C C+ C+
Sbjct: 235 ELRALGKISVGEELTVSYIDFLNVSEERKRQLKKQYYFDCTCEHCQ 280
>gi|402219261|gb|EJT99335.1| SET domain-containing protein [Dacryopinax sp. DJM-731 SS1]
Length = 445
Score = 62.4 bits (150), Expect = 4e-07, Method: Compositional matrix adjust.
Identities = 73/314 (23%), Positives = 125/314 (39%), Gaps = 55/314 (17%)
Query: 16 RGRGLVSTQSLKAGQIVLRDSPILL---YSALPFINSSSFCHNCFRKTMHSSSSI---CP 69
+G+GLV+ + G+ + R+ P + + S + C C + + ++ CP
Sbjct: 110 KGKGLVAKLPIAEGEALWREDPFVHAADWDVFDLQQSGNACQYCALPFIPAPPTLAVSCP 169
Query: 70 S----CSVAFCSPKCSTAAASSSHSPYVCQALTRLLNLNSPDAANLPLDRQLQARFLIAA 125
+ C+ AFCS C T A HS L R N S D + Q +A +A
Sbjct: 170 AISHGCNAAFCSRLCLTRAQQGGHS-----LLCRGRNPPSKDLLDFARKEQWRAVIALAV 224
Query: 126 YNLAVVNP-----SQFQILLAFQGTVT--DNDTSAA---------------------HYL 157
++ S+ Q +L GT D + A H++
Sbjct: 225 AVAKIMTAFERSDSEGQDMLEAWGTFATLDQEDRAKTLPEWEFTQERHRALWNMDYDHFV 284
Query: 158 SSLCPP----PASTTI-ELTAALLAKDRLNAFGLMEPYIEGQDGRRSVRA---YGIYLKA 209
++LCPP A T + ++ L++D ++ G GR S+ G++L
Sbjct: 285 TTLCPPETDKAAQTRLRKIVKKTLSEDVRTELFSYRGFLRGL-GRVSLNLDSHGGLFLLH 343
Query: 210 SFFNHDCLPNACRFDYVDAAAENNTDIIVRMI--HDVPQGREICLSYFPVNYDYSTRQKR 267
S NH C+PN A A+ + ++ D+ G E+ ++Y + Y R+K
Sbjct: 344 SHMNHSCIPNMAVKHPPAAQAQKYPSSRIALVANRDIQPGEELTVTYVNPTWPYR-RRKE 402
Query: 268 LLYDYDFACDCDRC 281
L ++ CDC+RC
Sbjct: 403 ELKEWGVECDCERC 416
>gi|302676884|ref|XP_003028125.1| hypothetical protein SCHCODRAFT_17369 [Schizophyllum commune H4-8]
gi|300101813|gb|EFI93222.1| hypothetical protein SCHCODRAFT_17369 [Schizophyllum commune H4-8]
Length = 427
Score = 62.0 bits (149), Expect = 4e-07, Method: Compositional matrix adjust.
Identities = 77/316 (24%), Positives = 118/316 (37%), Gaps = 66/316 (20%)
Query: 16 RGRGLVSTQSLKAGQIVLRDSPILL---YSALPFINSSSFCHNCFRKTMHSSSSI-CPS- 70
+G+GLV+ +K G+++ ++ P ++ ++A + C C H+ + C +
Sbjct: 91 KGKGLVAVDPIKEGEVIWKEDPFVMAPEWNAFEMQIQGTVCWQCCTPMQHTGREVSCRAH 150
Query: 71 ------CSVAFCSPKCSTAAASSSHSPYVCQAL----TRLLNLNSPDAANLPLD--RQLQ 118
C FC+ C S P +C A ++LL + D + L QL
Sbjct: 151 TSSANYCPARFCNRLC--LVRSDKTHPLLCPAQNPASSQLLRY-AHDKQWMALHGYAQLV 207
Query: 119 ARFLIAAYNLAVVNPSQFQILLAFQ-------------GTVTDNDTSAA---HYLSSLCP 162
AR ++A S+ I+ AF G D +T A H L++
Sbjct: 208 ARIILAYQGDEETFRSELDIMRAFAEIDVETRHRNTNPGLEPDKETWKAGHKHLLNTFVD 267
Query: 163 PPAS-------------TTIELTAALLAKDR--LNAFGLMEPYIEGQDGRRSVRAYGIYL 207
PP + T EL L D L G M E R+ G+Y+
Sbjct: 268 PPTADGKKKLKKIIRKPLTAELEKELFDYDTAFLKGLGRMVLNFE--------RSGGLYV 319
Query: 208 KASFFNHDCLPNAC--RFDYVDAAAENNTDIIVRMIHDVPQGREICLSYFPVNYDYSTRQ 265
S NH C+PN FD + N I + D+ G E+ +SY Y RQ
Sbjct: 320 LHSHMNHSCVPNISVRHFD----KSTNWARITMVAKKDLAPGEELMISYVDPEAPYKARQ 375
Query: 266 KRLLYDYDFACDCDRC 281
L + F C C RC
Sbjct: 376 AE-LEQWGFKCVCPRC 390
>gi|170042501|ref|XP_001848962.1| conserved hypothetical protein [Culex quinquefasciatus]
gi|167866038|gb|EDS29421.1| conserved hypothetical protein [Culex quinquefasciatus]
Length = 667
Score = 62.0 bits (149), Expect = 4e-07, Method: Compositional matrix adjust.
Identities = 75/317 (23%), Positives = 118/317 (37%), Gaps = 70/317 (22%)
Query: 16 RGRGLVSTQSLKAGQIVLRDSPI--LLYSALPFINSSSFCHNCFRKTMHSSSSICPSCS- 72
+GR +V+ Q LK + ++ + + LY+ S C+ CF K + + CP C+
Sbjct: 271 KGRYIVAAQDLKPAEPIVVEPAVGACLYTKF----FGSHCNACFAKLV--APVACPECAG 324
Query: 73 VAFCSPKCSTAAASSSHS--------------PYVCQALTRLLNLNSPDAANLPLDRQLQ 118
VAFCSP+C A S H +C RL+ + L +L+
Sbjct: 325 VAFCSPECRDRACSGYHRFECHYLDLLIGSGMSVLCHLALRLVTQAGTPEKAIELGAELK 384
Query: 119 ----------------ARFLIAAYNLAVVNPSQFQILLAFQGTVTDNDTSAAHYLSSLCP 162
R L+AA+ L + ++F T AA P
Sbjct: 385 QTLCAHEDRREPSDYFQRTLMAAFLLRCLQKAEF---------FGRRKTEAAE------P 429
Query: 163 PPASTTI-ELTAALLAKDRLNAFGLMEPYIEGQDGRRSVR----AYGIYLKASFFNHDCL 217
P + + ALL + NA + E I G+ S + G+Y ++ FNH+C
Sbjct: 430 TPLEAQVGGVILALLQSLQFNAHEIYETKISGEHRIDSAKVQYVGVGVYRTSAMFNHECY 489
Query: 218 PNACRFDYVDAAAENNTDIIVRMIHDVPQGREICLSY--FPVNYDYSTRQKRLLYDYDFA 275
P R T + + P G I +Y + + + RQ+ L Y F
Sbjct: 490 PGVSRTFL-------GTTMCLYTSRPFPAGATIPENYGMHFIRHPAAVRQRTLRSRYWFG 542
Query: 276 CDCDRCKVEANWSDNDN 292
C+C C + NW D
Sbjct: 543 CECRAC--QENWPLMDK 557
>gi|297739785|emb|CBI29967.3| unnamed protein product [Vitis vinifera]
Length = 660
Score = 62.0 bits (149), Expect = 4e-07, Method: Compositional matrix adjust.
Identities = 65/294 (22%), Positives = 123/294 (41%), Gaps = 37/294 (12%)
Query: 8 VTVAEIEGRGRGLVSTQSLKAGQIVLRDSPILLYSALPFINSSSFCHNCFRKTMHSSSSI 67
+ V+ E GRG+ +T+ + +G ++ P++ + +L I+S C+ C RK +SS
Sbjct: 380 IRVSITEMAGRGVFATRRIGSGDLIHTAKPLVSHPSLSSIHS--VCYFCLRKLKPVTSS- 436
Query: 68 CPSCSVAFCSPKCSTAAASSSHSPYVCQALTRLLNLNSPD------AANLPLDRQLQARF 121
C+V FCS +C + V A+ R + ++ D PL + A
Sbjct: 437 -EDCNVRFCSQECEEQSK-------VFVAVERKADWSAYDDYCRTRGLKYPLLVKRLACM 488
Query: 122 LIAAYN----LAVVNPSQFQI-LLAFQGTVTDNDTSAAHYLSSLCPPPASTTIELTAALL 176
+++ L ++ P+ +++ G SA + A T + +L
Sbjct: 489 VVSGVASADCLDILQPASLSSEMISEMGEGFSLLQSAFMKAKARDECMAFLTEQWYINVL 548
Query: 177 AKDRLNAFGL---------MEPYIEGQDGRRSVRAYGIYLKASFFNHDCLPNACRFDYVD 227
A+ R+N+F + + + +Y+ SF+NHDC PN ++D
Sbjct: 549 ARFRINSFRIELAGGSYEDLHSLAAASVETEAAVGNAVYMLPSFYNHDCDPNV-HIIWID 607
Query: 228 AAAENNTDIIVRMIHDVPQGREICLSYFPVNYDYSTRQKRLLYDYDFACDCDRC 281
N + ++ + ++ G E+ + Y + D+ RQ L + F C C RC
Sbjct: 608 -----NVNARLKALREIEAGEELRICYIDASMDHDARQTILFQGFGFRCSCLRC 656
>gi|195012821|ref|XP_001983754.1| GH16068 [Drosophila grimshawi]
gi|193897236|gb|EDV96102.1| GH16068 [Drosophila grimshawi]
Length = 662
Score = 62.0 bits (149), Expect = 5e-07, Method: Compositional matrix adjust.
Identities = 73/314 (23%), Positives = 130/314 (41%), Gaps = 44/314 (14%)
Query: 1 MSAVSETVTVAEIEGRGRGLVSTQSLKAGQIVLRDSPILLYSALPFINSSSFCHNCFRKT 60
+S + V + E + +GR +V+ + LK G +VL ++P+ F ++ CH+CF++
Sbjct: 251 LSGAANVVKLVETKDKGRFVVANEGLKTGDVVLSENPVAACLLPNFFGTN--CHHCFKR- 307
Query: 61 MHSSSSICPSCS-VAFCSPKCSTAAASSSHSPYVCQALTRLLNLNSPDAANLPLDRQLQA 119
+H+ + C CS +AFCS +C A + H + CQ + + + L QA
Sbjct: 308 LHTPVA-CLHCSGIAFCSAQCMGEACETYHR-FECQFMDLFIGSGMSILCYIALRIFTQA 365
Query: 120 RFL---IAAYNLA---------VVNPS---QFQILLAFQGTVTDNDTSAAHYLSSLCPPP 164
L + NL V P Q ++ F + H + P
Sbjct: 366 ASLEDGLQTSNLLFEHLVSHQDVRQPDDYLQRSLMAGFLMRILQKAQYFGHRKTEGVNP- 424
Query: 165 ASTTIELTAA-----LLAKDRLNAFGLMEPYIEGQ---DGRRSVR-AYGIYLKASFFNHD 215
T +EL A LL + NA + + + +G + + G+Y S+FNH+
Sbjct: 425 --TAVELQVATSLLGLLQVLQYNAHEIYHTMVTDEHCFEGCKVIYLGAGLYGTGSYFNHE 482
Query: 216 CLPNACRFDYVDAAAENNTDIIVRMIHDVPQGREICLSYFPV--NYDYSTRQKRLLYDYD 273
C P+ + YV +++ + ++Y P+ + RQ+ L Y
Sbjct: 483 CWPSTAGY-YV------GKRLVMAATKPHRPNEIVAVNYGPIFSKMNLKERQRSLRGRYA 535
Query: 274 FACDCDRCKVEANW 287
F+C+C C + NW
Sbjct: 536 FSCNCMTC--QENW 547
>gi|238879311|gb|EEQ42949.1| conserved hypothetical protein [Candida albicans WO-1]
Length = 379
Score = 61.6 bits (148), Expect = 5e-07, Method: Compositional matrix adjust.
Identities = 39/134 (29%), Positives = 66/134 (49%), Gaps = 16/134 (11%)
Query: 156 YLSSLCPPPASTTIELTA--ALLAKDRLNAFGLMEPYIEGQDGRRSVRAYGIYLKASFFN 213
++ C P I+ T +++ K+ NAFG+ + + + + +G+Y ASFFN
Sbjct: 253 FIKLTCTPELQAFIDSTTIRSIIGKNLSNAFGIWSEVTDPSEDKEFL-GFGVYPSASFFN 311
Query: 214 HDCLPNACRFDYVDAAAENNTDIIVRMIHDVPQGREICLSYFPVNYD---YSTRQKRLLY 270
H C PN + NN++++ D+ G E+C+SY NY RQK+L
Sbjct: 312 HSCSPNIVK-------TRNNSEMVFTTSKDIEIGEELCISY--GNYTDEPVELRQKQLK- 361
Query: 271 DYDFACDCDRCKVE 284
++ F C C +C+ E
Sbjct: 362 EWFFDCACTKCQTE 375
>gi|68484599|ref|XP_713759.1| potential protein lysine methyltransferase [Candida albicans
SC5314]
gi|68484668|ref|XP_713725.1| potential protein lysine methyltransferase [Candida albicans
SC5314]
gi|46435236|gb|EAK94622.1| potential protein lysine methyltransferase [Candida albicans
SC5314]
gi|46435271|gb|EAK94656.1| potential protein lysine methyltransferase [Candida albicans
SC5314]
Length = 379
Score = 61.6 bits (148), Expect = 5e-07, Method: Compositional matrix adjust.
Identities = 39/134 (29%), Positives = 66/134 (49%), Gaps = 16/134 (11%)
Query: 156 YLSSLCPPPASTTIELTA--ALLAKDRLNAFGLMEPYIEGQDGRRSVRAYGIYLKASFFN 213
++ C P I+ T +++ K+ NAFG+ + + + + +G+Y ASFFN
Sbjct: 253 FIKLTCTPELQAFIDSTTIRSIIGKNLSNAFGIWSEVTDPSEDKEFL-GFGVYPSASFFN 311
Query: 214 HDCLPNACRFDYVDAAAENNTDIIVRMIHDVPQGREICLSYFPVNYD---YSTRQKRLLY 270
H C PN + NN++++ D+ G E+C+SY NY RQK+L
Sbjct: 312 HSCSPNIVK-------TRNNSEMVFTTSKDIEIGEELCISY--GNYTDEPVELRQKQLK- 361
Query: 271 DYDFACDCDRCKVE 284
++ F C C +C+ E
Sbjct: 362 EWFFDCACTKCQTE 375
>gi|187607241|ref|NP_001120357.1| SET and MYND domain containing 1 [Xenopus (Silurana) tropicalis]
gi|170284459|gb|AAI61001.1| LOC100145429 protein [Xenopus (Silurana) tropicalis]
Length = 478
Score = 61.6 bits (148), Expect = 6e-07, Method: Compositional matrix adjust.
Identities = 77/312 (24%), Positives = 129/312 (41%), Gaps = 40/312 (12%)
Query: 6 ETVTVAEIEGRGRGLVSTQSLKAGQIVLRDSPILLYSALPFIN-SSSFCHNCFRKTMHSS 64
E V + E EG+GRGL + + AG I+ + YSA+ F N S S CH+CF++
Sbjct: 2 ENVEIFESEGKGRGLRAIRESWAGDIIFAEPA---YSAVVFDNLSHSVCHSCFKR--QEK 56
Query: 65 SSICPSCSVA-FCSPKCSTAAASSSHSPYVCQALTRLLNLNSPDAANLPLDRQLQARFLI 123
C C A +C C + ++ + C A+ + +P+ N+ L ++ R
Sbjct: 57 LLRCGQCKFAHYCDRTCQKESWANHKNE--CVAIKKA--GKAPN-ENIRLAARILWRIER 111
Query: 124 AAYNLA---VVNPSQFQILL-----AFQGTVTDNDTSAAHYLSSLCPPPASTTIELTAAL 175
L +V+ Q + A +G + ++ Y S I ++
Sbjct: 112 EGSGLTEGCLVSIDDLQNHIDKFDEAEKGLLMEDVQKFLEYWPSQSQQFGMQYISHIFSV 171
Query: 176 LAKDRLNAFGLMEPYIEGQDGRRSVRAYGIYLKASFFNHDCLPNACRFDYVDAAAENNTD 235
++ N F L + Q G ++V GI+ NHDC PN C + NN
Sbjct: 172 IS---CNGFTLSD-----QRGLQAV-GVGIFPNLCLANHDCWPN-CTVIF------NNGK 215
Query: 236 IIVRMIHDVPQGREICLSYFPVNYDYSTRQKRLLYDYDFACDCDRCKVEAN----WSDND 291
I +R + + +G E+ +SY R+ +L Y F C C+ C + + D
Sbjct: 216 IELRALGKINKGEELTVSYVDFLNLTEDRKAQLKKQYYFDCTCEHCTKKTKDALLLAVKD 275
Query: 292 NDDENNEEVMDE 303
+D+ E V+ E
Sbjct: 276 GEDKPEERVVKE 287
>gi|298708048|emb|CBJ30401.1| set and mynd domain containing [Ectocarpus siliculosus]
Length = 589
Score = 61.6 bits (148), Expect = 6e-07, Method: Compositional matrix adjust.
Identities = 79/296 (26%), Positives = 110/296 (37%), Gaps = 43/296 (14%)
Query: 16 RGRGLVSTQSLKAGQIVLRDSPILLYSALPFIN-SSSFCHNCFRKTMHSSSSICPSCSVA 74
RGRG+ + +++ AG VL +ALP +FC C S C C
Sbjct: 134 RGRGIFAVEAVPAGTEVLVAKQ---AAALPRDRYRPAFCRRCLTLLDKSLLIKCRRCEDR 190
Query: 75 FCSPKCSTAAASSSHSPYVCQALT-----RLLNLNSPD-AANLPLDRQLQARFLIAAYNL 128
FC +C AAA C + +L L D AA D L
Sbjct: 191 FCGKECVIAAAGEGTHEVTCGLVEGLGEEQLSELGRGDNAAATETD----------ILRL 240
Query: 129 AVVNPSQFQILLAFQGTVTDNDTSAAHYLSSLCPPPASTTIELTAALLAKDRLNAFGLME 188
V S+ L+ + + D T+E A+ RLN GL
Sbjct: 241 TVECMSRRGAGLSDEEEWIEIDDLVLPEDEGGGEGSGGGTLEPDVIREAQARLNDGGLEV 300
Query: 189 PYIEGQDGRRSVR------------------AYGIYLKASFFNHDCLPNACRFDYVDAAA 230
E Q R +R G++ AS NH C PNAC F A
Sbjct: 301 SAEEIQTVHRRLRRNFHTVQPEPLGRLLPPTVVGVFPAASLVNHSCEPNAC-FHSRRAGP 359
Query: 231 EN-NTDIIVRMIHDVPQGREICLSYFPVNYDYSTRQKR--LLYD-YDFACDCDRCK 282
E ++ +R DV G E+C+SY D +T++ R LL + + F+CDC RC+
Sbjct: 360 EGPPLELALRCTTDVSAGEEVCVSYLAHCADAATKEGRRELLQNVWGFSCDCPRCE 415
>gi|168051542|ref|XP_001778213.1| predicted protein [Physcomitrella patens subsp. patens]
gi|162670426|gb|EDQ56995.1| predicted protein [Physcomitrella patens subsp. patens]
Length = 490
Score = 61.6 bits (148), Expect = 6e-07, Method: Compositional matrix adjust.
Identities = 70/290 (24%), Positives = 121/290 (41%), Gaps = 37/290 (12%)
Query: 5 SETVTVAEIEGRGRGLVSTQSLKAGQIVLRDSPILLYSALPFINSSSFCHNCFRKTMHSS 64
S ++V+ +GR L++ ++ G+++L+ P + AL + + C C+R++ S+
Sbjct: 16 SHNLSVSVSPAKGRCLIANRAFHRGEVILQQEPYV--CALDDSSQALRCDRCYRQS--SN 71
Query: 65 SSICPSCSVAF-CSPKCSTAAASSSHSPYVCQALTRLLN--LNSPD-----AANLPLDRQ 116
C +C F C C + + C L RLL SP + R+
Sbjct: 72 LKRCSACKTVFYCCANCQRSGWGLH--KFECAILVRLLKEKKRSPTPSLRLVMRFLIKRR 129
Query: 117 LQA-RFLIAAYNLAVVNPSQFQILLAFQGTVTDNDTSAAHYLSSLCP---PPASTTIELT 172
LQA R + A AV N ++L +++ ++S+ P ++
Sbjct: 130 LQAERVQLKA---AVDNYDVCEVLPTHMSETSEDRLVMYAQMASIVQQMMAPDEVNVKEV 186
Query: 173 AALLAKDRLNAFGLMEPYIEGQDGRRSVRAYGIYLKASFFNHDCLPNAC-RFDYVDAAAE 231
++ + NA + + + G+Y S NH C+PNA FD AA
Sbjct: 187 TQMICRFACNAHTICDEEVRPL-------GTGLYPVISIVNHSCVPNAVLHFDGNRAA-- 237
Query: 232 NNTDIIVRMIHDVPQGREICLSYFPVNYDYSTRQKRLLYDYDFACDCDRC 281
+R + D +G EI +SY + +TR+K L Y F C+C RC
Sbjct: 238 ------LRALEDTQEGTEITISYVELAASTNTRRKALRDQYYFDCNCIRC 281
>gi|440302623|gb|ELP94930.1| set and mynd domain containing protein, putative [Entamoeba
invadens IP1]
Length = 428
Score = 61.6 bits (148), Expect = 7e-07, Method: Compositional matrix adjust.
Identities = 62/273 (22%), Positives = 102/273 (37%), Gaps = 39/273 (14%)
Query: 10 VAEIEGRGRGLVSTQSLKAGQIVLRDSPILLYSALPFINSSSFCHNCFRKTMHSSSSICP 69
V E G+GRG+ + V+ L+ L ++ +C CF K + +S I
Sbjct: 6 VKESTGKGRGVFLNTDITEKTEVVHSK--LIGGCLNYVEWKFYCEYCF-KELKENSKISC 62
Query: 70 SCSVAFCSPKCSTAAASSSHSPYVCQALTRLLNLNSPDAANLPLDRQLQARFLIAAYNLA 129
C +CS +C A H C + L + PD L L+ ++
Sbjct: 63 GCGFNYCSMECQKKAYDEYHKN-ECSIIKSLREI--PDGIGEIL--------LLYRCSIK 111
Query: 130 VVNPSQFQILLAFQGTVTDNDTSAAHYLSSLCPPPASTTIELTAALLAKDRLNAFGLMEP 189
V + L N+ A Y L +S + + A N+F L
Sbjct: 112 TVKWEDYCSLPNI------NNYKAVEYCVLLFKSNSSLLKTIWEVIQA----NSFCLT-- 159
Query: 190 YIEGQDGRRSVRAYGIYLKASFFNHDCLPNACRFDYVDAAAENNTDIIVRMIHDVPQGRE 249
+G V G++ ASF NH C PN +N ++ ++ + + G E
Sbjct: 160 -----NGEEQVIGIGLFDYASFINHSCCPNC-------VPLQNKREMSIKSLTSIKSGEE 207
Query: 250 ICLSYFPVNYDYSTRQKRLLYDYDFACDCDRCK 282
I +SY + + R+K L + F+C C C+
Sbjct: 208 IFISYIDITESFERREKELR-KWHFSCGCSLCE 239
>gi|225441557|ref|XP_002276611.1| PREDICTED: histone-lysine N-methyltransferase ATXR4-like [Vitis
vinifera]
Length = 327
Score = 61.6 bits (148), Expect = 7e-07, Method: Compositional matrix adjust.
Identities = 65/294 (22%), Positives = 123/294 (41%), Gaps = 37/294 (12%)
Query: 8 VTVAEIEGRGRGLVSTQSLKAGQIVLRDSPILLYSALPFINSSSFCHNCFRKTMHSSSSI 67
+ V+ E GRG+ +T+ + +G ++ P++ + +L I+S C+ C RK +SS
Sbjct: 47 IRVSITEMAGRGVFATRRIGSGDLIHTAKPLVSHPSLSSIHS--VCYFCLRKLKPVTSS- 103
Query: 68 CPSCSVAFCSPKCSTAAASSSHSPYVCQALTRLLNLNSPD------AANLPLDRQLQARF 121
C+V FCS +C + V A+ R + ++ D PL + A
Sbjct: 104 -EDCNVRFCSQECEEQSK-------VFVAVERKADWSAYDDYCRTRGLKYPLLVKRLACM 155
Query: 122 LIAAYN----LAVVNPSQFQI-LLAFQGTVTDNDTSAAHYLSSLCPPPASTTIELTAALL 176
+++ L ++ P+ +++ G SA + A T + +L
Sbjct: 156 VVSGVASADCLDILQPASLSSEMISEMGEGFSLLQSAFMKAKARDECMAFLTEQWYINVL 215
Query: 177 AKDRLNAFGL---------MEPYIEGQDGRRSVRAYGIYLKASFFNHDCLPNACRFDYVD 227
A+ R+N+F + + + +Y+ SF+NHDC PN ++D
Sbjct: 216 ARFRINSFRIELAGGSYEDLHSLAAASVETEAAVGNAVYMLPSFYNHDCDPNV-HIIWID 274
Query: 228 AAAENNTDIIVRMIHDVPQGREICLSYFPVNYDYSTRQKRLLYDYDFACDCDRC 281
N + ++ + ++ G E+ + Y + D+ RQ L + F C C RC
Sbjct: 275 -----NVNARLKALREIEAGEELRICYIDASMDHDARQTILFQGFGFRCSCLRC 323
>gi|328767279|gb|EGF77329.1| hypothetical protein BATDEDRAFT_27650 [Batrachochytrium
dendrobatidis JAM81]
Length = 445
Score = 61.6 bits (148), Expect = 7e-07, Method: Compositional matrix adjust.
Identities = 75/290 (25%), Positives = 115/290 (39%), Gaps = 33/290 (11%)
Query: 1 MSAVSETVTVAEIEGRGRGLVSTQSLKAGQIVLRDSPILLYSALPFINSSSFCHNCFRKT 60
M+A+++ V G GR +++T+ L+ G + + P + +N + C CFR
Sbjct: 1 MTAINDAAQVENRNGYGRCMIATKDLRVGLEFMMEKPYVAVVDDASLNQT--CSGCFRLA 58
Query: 61 MHSSSSICPSCSVA-FCSPKCSTAAASSSHSPYVCQALTRLLNLNSPDAANLPLDRQLQA 119
H C SC V +CS C + S H P C+ + P L L R +
Sbjct: 59 AHMQQ--CSSCKVVQYCSQTCQRSDWSI-HKPE-CEGFKAVQPRIPPSPVRL-LGRMMFK 113
Query: 120 RFLIAAYNLAVVNPSQFQILLAFQGTVTDNDTSAAHYLSSLCPPPA---STTIELTAALL 176
R VV Q + + + +A ++S PPA STT ++ A L
Sbjct: 114 RAKDCNEFERVVG--QLESHRDKRASKDIEHIAAMLQMASGFIPPALLLSTTADMIA-LC 170
Query: 177 AKDRLNAFGLMEPYIEGQDGRRSVRAYGIYLKASFFNHDCLPNACRFDYVDAAAENNTDI 236
K ++N + R IY + S NH C+PNAC + A
Sbjct: 171 CKIQVNTMT-------------TERGVAIYDRLSTVNHSCVPNACLTFGIGGIAR----- 212
Query: 237 IVRMIHDVPQGREICLSYFPVNYDYSTRQKRLLYDYDFACDCDRCKVEAN 286
+ + + G +I +SY V TRQ++L Y F C C C AN
Sbjct: 213 -LSPMTAIASGDQINISYVDVFQSCETRQRQLKEQYYFDCTCRLCTANAN 261
>gi|126305322|ref|XP_001379366.1| PREDICTED: SET and MYND domain-containing protein 1 [Monodelphis
domestica]
Length = 490
Score = 61.6 bits (148), Expect = 7e-07, Method: Compositional matrix adjust.
Identities = 67/290 (23%), Positives = 113/290 (38%), Gaps = 29/290 (10%)
Query: 6 ETVTVAEIEGRGRGLVSTQSLKAGQIVLRDSPILLYSALPFIN-SSSFCHNCFRKTMHSS 64
E V V EG+GRGL +T+ A ++ + YSA+ F + + CH CF++
Sbjct: 7 EKVEVFTSEGKGRGLKATKEFWAADVIFAERA---YSAVVFDSFTQVVCHTCFKR--QEK 61
Query: 65 SSICPSCSVA-FCSPKCSTAAASSSHSPYVCQALTRLLNLNSPDAANLPLDRQLQARFLI 123
C C A +C C A + + C A+ + P + N+ L ++ R
Sbjct: 62 LQRCGQCKFAHYCDRTCQKDAWLNH--KHECAAIKKH---GKPPSENIRLAARIMWRIER 116
Query: 124 AAYNLA---VVNPSQFQILLAFQGTVTDND--TSAAHYLSSLCPPPASTTIELTAALLAK 178
L +V+ Q + G D +L+ +++ + +
Sbjct: 117 EGTGLTEGCLVSVDDLQNHVDNFGEEEQKDLRVDVDSFLNFWPAQSQQFSMQYISHIFGV 176
Query: 179 DRLNAFGLMEPYIEGQDGRRSVRAYGIYLKASFFNHDCLPNAC------RFDYVDAAAEN 232
NAF L + Q G ++V GI+ NHDC PN + V +
Sbjct: 177 INCNAFTLSD-----QRGLQAV-GVGIFPNLCLVNHDCWPNCTVIFNNGNHEAVKSMFHT 230
Query: 233 NTDIIVRMIHDVPQGREICLSYFPVNYDYSTRQKRLLYDYDFACDCDRCK 282
I +R + + +G E+ +SY R+K+L Y F C C+ C+
Sbjct: 231 QMRIELRALGKISEGEELTVSYVDFLNVSQERKKQLKKQYYFDCTCEHCE 280
>gi|195589810|ref|XP_002084642.1| GD12724 [Drosophila simulans]
gi|194196651|gb|EDX10227.1| GD12724 [Drosophila simulans]
Length = 663
Score = 61.2 bits (147), Expect = 7e-07, Method: Compositional matrix adjust.
Identities = 75/329 (22%), Positives = 136/329 (41%), Gaps = 74/329 (22%)
Query: 1 MSAVSETVTVAEIEGRGRGLVSTQSLKAGQIVLRDSPILLYSALPFINSSSFCHNCFRKT 60
+ S+ V + E + +GR +V+ + L+ G ++L + P+ + L + CH+CF++
Sbjct: 249 LVGASKVVRLVETKDKGRFVVANEGLRTGDVLLCEEPVA--ACLEPSYFGTHCHHCFKRL 306
Query: 61 MHSSSSICPSCS-VAFCSPKCSTAAASSSHSPYVCQALTRLLN--------------LNS 105
S C CS +AFCS +C A SS H + C+ + ++ +
Sbjct: 307 RTPVS--CLHCSGIAFCSAQCMGEACSSYHR-FECEYMDLMIGSGMSILCFIALRIFTQA 363
Query: 106 PD------AANLPL--------DRQ---LQARFLIAAYNLAVVNPSQFQILLAFQGTVTD 148
P ANL DRQ R L++ + L ++ S L F T+
Sbjct: 364 PSLEQGLATANLLFEHLCSHEEDRQPDDYLRRALMSGFLLRILQKS-----LYFGRRKTE 418
Query: 149 NDTSAAHYLSSLCPPPASTTIELTAALLAKDRL---NAFGLMEPYIEGQD---GRRSVR- 201
+ P + +++ ALL ++ NA + + + + G ++V
Sbjct: 419 GVS------------PTAVEVQVATALLGLMQVLQYNAHQIYQTQVTEEHRFAGSKTVYL 466
Query: 202 AYGIYLKASFFNHDCLPN-ACRFDYVDAAAENNTDIIVRMIHDVPQGREICLSYFP--VN 258
A G+Y S+FNH+C P+ AC F +++ + ++Y P +
Sbjct: 467 AAGLYGTGSYFNHECWPSTACHF--------VGKKLVLTATRPHRANELVAVNYGPIFIK 518
Query: 259 YDYSTRQKRLLYDYDFACDCDRCKVEANW 287
+ RQ+ L Y F+C+C C + NW
Sbjct: 519 NNLKERQRSLRGRYSFSCNCMAC--QENW 545
>gi|426223567|ref|XP_004005946.1| PREDICTED: SET and MYND domain-containing protein 1 [Ovis aries]
Length = 490
Score = 61.2 bits (147), Expect = 7e-07, Method: Compositional matrix adjust.
Identities = 67/288 (23%), Positives = 113/288 (39%), Gaps = 25/288 (8%)
Query: 6 ETVTVAEIEGRGRGLVSTQSLKAGQIVLRDSPILLYSALPFIN-SSSFCHNCFRKTMHSS 64
E V V EG+GRGL +T+ A I+ + YSA+ F + + CH CF++
Sbjct: 7 ENVEVFTAEGKGRGLKATKEFCAADIIFAERA---YSAVVFDSLVNVVCHTCFKR--QEK 61
Query: 65 SSICPSCSVA-FCSPKCSTAAASSSHSPYVCQALTR---LLNLNSPDAANLPLDRQLQAR 120
C C A +C C A + + C A+ R + N N AA + + +
Sbjct: 62 LHRCGQCKFAHYCDRTCQKDAWLNHKNE--CSAIKRYGKVPNENIRLAARIMWRVEREGS 119
Query: 121 FLIAAYNLAVVNPSQFQILLAFQGTVTDNDTSAAHYLSSLCPPPASTTIELTAALLAKDR 180
L L V+ Q + + + +L P +++ + +
Sbjct: 120 GLTEGC-LVSVDDLQNHVEHFGEEEQKELRLDVDTFLQYWPPQSQQFSMQYISHIFGVIN 178
Query: 181 LNAFGLMEPYIEGQDGRRSVRAYGIYLKASFFNHDCLPNAC------RFDYVDAAAENNT 234
N F L + Q G ++V G++ + NHDC PN + V +
Sbjct: 179 CNGFTLSD-----QRGLQAV-GVGVFPNLALVNHDCWPNCTVIFNNGNHEAVKSMFHTQM 232
Query: 235 DIIVRMIHDVPQGREICLSYFPVNYDYSTRQKRLLYDYDFACDCDRCK 282
I +R + + +G E+ +SY R+K+L Y F C C+ C+
Sbjct: 233 RIELRALGKISEGEELTVSYIDFLNISEERKKQLKRQYYFDCTCEHCQ 280
>gi|358368449|dbj|GAA85066.1| SET and MYND domain protein [Aspergillus kawachii IFO 4308]
Length = 484
Score = 61.2 bits (147), Expect = 7e-07, Method: Compositional matrix adjust.
Identities = 71/286 (24%), Positives = 113/286 (39%), Gaps = 40/286 (13%)
Query: 15 GRGRGLVSTQSLKAGQIVLRDSPILLYSALPFINSSSFCHNCFRKTMHSSSSI----CPS 70
G GRGL ++ + G+ VL P+ + L + C CF + I C
Sbjct: 13 GLGRGLFASTDIAVGEDVLH-IPVPFVAVLDTEHLGEVCSGCFGQRQLEEEGIALKGCRG 71
Query: 71 CSVA-FCSPKCSTAAASSSHSPYVCQALTRLLNLNSPDAANLPLDRQLQARFLIAAYNLA 129
C V +C C HS + C +L LP++ + R ++ +
Sbjct: 72 CGVVKYCDKTCQAKDWKLGHS-FECTIYQKL------KPRILPINARAVLRMVLRSERQK 124
Query: 130 VVNPSQFQILLAFQGTVTD-NDTSAAHY----LSSLCPPPASTTI---ELTAALLAKDRL 181
+ + L + + D D SA+H+ LSS S T E+ +A+ AK L
Sbjct: 125 YSD-EELDQFLQLETHIKDIRDESASHWERISLSSKAIKAYSGTEMSEEVISAMGAKLDL 183
Query: 182 NAFGLMEPYIEGQDGRRSVRAYGIYLK--ASFFNHDCLPNACRFDYVDAAAENNTDIIVR 239
N+F L + G+YL A+ FNH C NA A + + ++ ++
Sbjct: 184 NSFNLTNAVYD---------RLGVYLHPYAAIFNHSCDHNA-------AVSFDGPNLHIK 227
Query: 240 MIHDVPQGREICLSYFPVNYDYSTRQKRLLYDYDFACDCDRCKVEA 285
I + + +I ++Y V Y RQ L Y F C C +C EA
Sbjct: 228 AIRPIQKDEQIFITYIDVTDPYPIRQHNLQSRYYFTCHCSKCTSEA 273
>gi|303280161|ref|XP_003059373.1| set domain protein [Micromonas pusilla CCMP1545]
gi|226459209|gb|EEH56505.1| set domain protein [Micromonas pusilla CCMP1545]
Length = 740
Score = 61.2 bits (147), Expect = 7e-07, Method: Compositional matrix adjust.
Identities = 32/99 (32%), Positives = 51/99 (51%), Gaps = 3/99 (3%)
Query: 198 RSVRAYG-IYLKASFFNHDCLPNACRFDYVD-AAAENNTDIIVRMIHDVPQGREICLSYF 255
R R G +Y K S FNH C PN C + + +A++ + V+ I + +G E + Y
Sbjct: 328 REERVGGAVYAKLSRFNHSCAPN-CTWHFAPPGSADSPVSVTVKAIRPIKKGDEATVCYL 386
Query: 256 PVNYDYSTRQKRLLYDYDFACDCDRCKVEANWSDNDNDD 294
+ R++RL +Y FAC+CDRC+ + + DD
Sbjct: 387 DASGGRDARRRRLWAEYRFACECDRCQRLIRQGEENPDD 425
>gi|301773928|ref|XP_002922386.1| PREDICTED: SET and MYND domain-containing protein 1-like
[Ailuropoda melanoleuca]
Length = 490
Score = 61.2 bits (147), Expect = 8e-07, Method: Compositional matrix adjust.
Identities = 65/288 (22%), Positives = 112/288 (38%), Gaps = 25/288 (8%)
Query: 6 ETVTVAEIEGRGRGLVSTQSLKAGQIVLRDSPILLYSALPFINSSSF-CHNCFRKTMHSS 64
E V V EG+GRGL +T+ A ++ + YSA+ F + +F CH CF++
Sbjct: 7 ENVEVFTSEGKGRGLKATKEFWAADVIFAERA---YSAVVFDSLVNFVCHTCFKR--QEK 61
Query: 65 SSICPSCSVA-FCSPKCSTAAASSSHSPYVCQALTRLLNLNSPD---AANLPLDRQLQAR 120
C C A +C C A + + C A+ R + S AA + + +
Sbjct: 62 LHRCGQCKFAHYCDRTCQKDAWLNHKNE--CSAIKRYGKVPSESIRLAARILWRVEREGT 119
Query: 121 FLIAAYNLAVVNPSQFQILLAFQGTVTDNDTSAAHYLSSLCPPPASTTIELTAALLAKDR 180
L L V+ Q + + + +L P +++ + +
Sbjct: 120 GLTEG-CLVAVDELQNHVEHFGEEEQKELRVDVDTFLQYWPPQSQQFSMQYISHVFGVIN 178
Query: 181 LNAFGLMEPYIEGQDGRRSVRAYGIYLKASFFNHDCLPNAC------RFDYVDAAAENNT 234
N F L + Q G ++V GI+ NH+C PN + V +
Sbjct: 179 CNGFTLSD-----QRGLQAV-GVGIFPNLGLVNHNCWPNCTVIFNNGNHEAVKSMFHTQM 232
Query: 235 DIIVRMIHDVPQGREICLSYFPVNYDYSTRQKRLLYDYDFACDCDRCK 282
I +R + + +G E+ +SY R+++L Y F C C+ C+
Sbjct: 233 RIELRALGKISEGEELTVSYIDFLNVSEERKRQLKKQYYFDCTCEHCQ 280
>gi|296482488|tpg|DAA24603.1| TPA: SET and MYND domain containing 1 [Bos taurus]
Length = 490
Score = 61.2 bits (147), Expect = 8e-07, Method: Compositional matrix adjust.
Identities = 67/288 (23%), Positives = 113/288 (39%), Gaps = 25/288 (8%)
Query: 6 ETVTVAEIEGRGRGLVSTQSLKAGQIVLRDSPILLYSALPFIN-SSSFCHNCFRKTMHSS 64
E V V EG+GRGL +T+ A I+ + YSA+ F + + CH CF++
Sbjct: 7 ENVEVFTAEGKGRGLKATKEFCAADIIFAERA---YSAVVFDSLVNVVCHTCFKR--QEK 61
Query: 65 SSICPSCSVA-FCSPKCSTAAASSSHSPYVCQALTR---LLNLNSPDAANLPLDRQLQAR 120
C C A +C C A + + C A+ R + N N AA + + +
Sbjct: 62 LHRCGQCKFAHYCDRTCQKDAWLNHKNE--CSAIKRYGKVPNENIRLAARIMWRVEREGS 119
Query: 121 FLIAAYNLAVVNPSQFQILLAFQGTVTDNDTSAAHYLSSLCPPPASTTIELTAALLAKDR 180
L L V+ Q + + + +L P +++ + +
Sbjct: 120 GLTEGC-LVSVDDLQNHVEHFGEEEQKELRLDVDMFLQYWPPQSQQFSMQYISHIFGVIN 178
Query: 181 LNAFGLMEPYIEGQDGRRSVRAYGIYLKASFFNHDCLPNAC------RFDYVDAAAENNT 234
N F L + Q G ++V G++ + NHDC PN + V +
Sbjct: 179 CNGFTLSD-----QRGLQAV-GVGVFPNLALVNHDCWPNCTVIFNNGNHEAVKSMFHTQM 232
Query: 235 DIIVRMIHDVPQGREICLSYFPVNYDYSTRQKRLLYDYDFACDCDRCK 282
I +R + + +G E+ +SY R+K+L Y F C C+ C+
Sbjct: 233 RIELRALGKISEGEELTVSYIDFLNVSEERKKQLKRQYYFDCTCEHCQ 280
>gi|431907306|gb|ELK11286.1| SET and MYND domain-containing protein 1 [Pteropus alecto]
Length = 476
Score = 61.2 bits (147), Expect = 8e-07, Method: Compositional matrix adjust.
Identities = 67/288 (23%), Positives = 112/288 (38%), Gaps = 25/288 (8%)
Query: 6 ETVTVAEIEGRGRGLVSTQSLKAGQIVLRDSPILLYSALPFINSSSF-CHNCFRKTMHSS 64
E V V +G+GR L +T+ A I+ + YSA+ F + +F CH CF++
Sbjct: 7 ENVEVFISDGKGRSLKATKEFWAADIIFAERA---YSAVVFDSLVNFVCHTCFKR--QEK 61
Query: 65 SSICPSCSVAF-CSPKCSTAAASSSHSPYVCQALTR---LLNLNSPDAANLPLDRQLQAR 120
C C A+ C C A + + C A+ R + N N AA + + +
Sbjct: 62 LHHCGQCKFAYYCDRTCQKDAWLNHKNE--CLAIKRYGKVPNENIRLAARIMWRVEREGT 119
Query: 121 FLIAAYNLAVVNPSQFQILLAFQGTVTDNDTSAAHYLSSLCPPPASTTIELTAALLAKDR 180
L L V+ Q ++ + +L P +++ + +
Sbjct: 120 GLTEGC-LVSVDDLQNHVMHFGEEEQKQLQVDVDTFLQYWPPKSQQFSMQYISNIFGVIN 178
Query: 181 LNAFGLMEPYIEGQDGRRSVRAYGIYLKASFFNHDCLPNAC------RFDYVDAAAENNT 234
N F L + Q G ++V GI+ NHDC PN + V +
Sbjct: 179 CNGFTLSD-----QRGLQAV-GVGIFPNLGLVNHDCWPNCTVIFNNGNHEAVKSMFHTQM 232
Query: 235 DIIVRMIHDVPQGREICLSYFPVNYDYSTRQKRLLYDYDFACDCDRCK 282
I +R + + +G E+ +SY R+K+L Y F C C+ C+
Sbjct: 233 RIELRALGKIAEGEELTVSYIDFLNVSEERKKQLKKQYYFDCTCEHCQ 280
>gi|452987883|gb|EME87638.1| hypothetical protein MYCFIDRAFT_27819 [Pseudocercospora fijiensis
CIRAD86]
Length = 351
Score = 61.2 bits (147), Expect = 8e-07, Method: Compositional matrix adjust.
Identities = 47/149 (31%), Positives = 64/149 (42%), Gaps = 18/149 (12%)
Query: 145 TVTDNDTSAAHYL--SSLCPPPASTTI--ELTAALLAKDRLNAFGLMEPYIEGQDGRRSV 200
T D D+S +L +S+ P + EL AL+ D NAFG+ G +
Sbjct: 213 TKQDLDSSCRAFLQLTSILPETILPALGAELCLALVQADNHNAFGIRG----GGEDSEEY 268
Query: 201 RAYGIYLKASFFNHDCLPNACRFDYVDAAAENNTDIIVRMIHDVPQGREICLSYF---PV 257
YG+Y AS+FNH C PN A + D+ G E C++Y
Sbjct: 269 MGYGLYPCASYFNHSCDPNI-------AKHRRGRSWEFHALRDISAGEECCITYLGGDER 321
Query: 258 NYDYSTRQKRLLYDYDFACDCDRCKVEAN 286
+ D R +RL +DF C C RCK EA
Sbjct: 322 DMDVLERLRRLRDVWDFVCVCQRCKAEAG 350
>gi|403218380|emb|CCK72870.1| hypothetical protein KNAG_0M00170 [Kazachstania naganishii CBS
8797]
Length = 491
Score = 61.2 bits (147), Expect = 8e-07, Method: Compositional matrix adjust.
Identities = 70/323 (21%), Positives = 127/323 (39%), Gaps = 52/323 (16%)
Query: 2 SAVSETVTVAEIEGRGRGLVSTQSLKAGQIVLRD-SPILL---YSALPFINSSSFCHNCF 57
+A+ + + + GRGL S + AG ++++D PI L L + + C C
Sbjct: 107 NALGNQLEIKDTPHMGRGLFSLKDFNAGDLLIKDLEPICLIPPMEKLTLMQNGKACSLCG 166
Query: 58 RKTMHSSSSI-------CPSCSVAFCSPKCSTAAASSSHSPYVC-QALTRLLNLNSPDAA 109
+ + +S C C +CS KC H+P + +L+N
Sbjct: 167 QSIANINSHFIVINGLDCNDCQAVWCSKKCKLKDII--HAPLTHNKGKNKLVN-----GG 219
Query: 110 NL-PLDRQLQARFLIAAYNLAVVNPSQFQILLAFQGTVTDNDTSAAHYLSSLCPPPASTT 168
N ++ +AAY+L ++ S++ + + + + ++S S +
Sbjct: 220 NWRKFEKFCSDNVFVAAYSLGIMF-SKYLLTKSKEAQTIKKKFDSLAFVSQRTRAQYSDS 278
Query: 169 IELTAALLAKDRLNAFGLMEP-YIEGQD----------------------GRRSVRAYG- 204
+ L A D +N EP + EG GR ++ A
Sbjct: 279 SNIGGTLDASDGVNTTADPEPKWKEGYQLFIDTFPSLKEEVDYERFLEYIGRFNLNAIAE 338
Query: 205 -IYLKASFFNHDCLPNACRFDYVDAAAENNTDIIVRMIHDVPQGREICLSYFPVNYDYST 263
+Y SF NH+C PN RF+ ++ I D+ +G +I ++Y ++ +
Sbjct: 339 QLYYIPSFINHNCEPNV-RFE-----KDSRLHINFYARKDIKKGEQIFMTYCNPLHEVNL 392
Query: 264 RQKRLLYDYDFACDCDRCKVEAN 286
R++ L +Y F C CDRC+ E
Sbjct: 393 RRRELRVNYGFLCFCDRCRKEVK 415
>gi|134085965|ref|NP_001076953.1| SET and MYND domain-containing protein 1 [Bos taurus]
gi|126717435|gb|AAI33279.1| SMYD1 protein [Bos taurus]
Length = 490
Score = 61.2 bits (147), Expect = 8e-07, Method: Compositional matrix adjust.
Identities = 67/288 (23%), Positives = 113/288 (39%), Gaps = 25/288 (8%)
Query: 6 ETVTVAEIEGRGRGLVSTQSLKAGQIVLRDSPILLYSALPFIN-SSSFCHNCFRKTMHSS 64
E V V EG+GRGL +T+ A I+ + YSA+ F + + CH CF++
Sbjct: 7 ENVEVFTAEGKGRGLKATKEFCAADIIFAERA---YSAVVFDSLVNVVCHTCFKR--QEK 61
Query: 65 SSICPSCSVA-FCSPKCSTAAASSSHSPYVCQALTR---LLNLNSPDAANLPLDRQLQAR 120
C C A +C C A + + C A+ R + N N AA + + +
Sbjct: 62 LHRCGQCKFAHYCDRTCQKDAWLNHKNE--CSAIKRYGKVPNENIRLAARIMWRVEREGS 119
Query: 121 FLIAAYNLAVVNPSQFQILLAFQGTVTDNDTSAAHYLSSLCPPPASTTIELTAALLAKDR 180
L L V+ Q + + + +L P +++ + +
Sbjct: 120 GLTEGC-LVSVDDLQNHVEHFGEEEQKELRLDVDTFLQYWPPQSQQFSMQYISHIFGVIN 178
Query: 181 LNAFGLMEPYIEGQDGRRSVRAYGIYLKASFFNHDCLPNAC------RFDYVDAAAENNT 234
N F L + Q G ++V G++ + NHDC PN + V +
Sbjct: 179 CNGFTLSD-----QRGLQAV-GVGVFPNLALVNHDCWPNCTVIFNNGNHEAVKSMFHTQM 232
Query: 235 DIIVRMIHDVPQGREICLSYFPVNYDYSTRQKRLLYDYDFACDCDRCK 282
I +R + + +G E+ +SY R+K+L Y F C C+ C+
Sbjct: 233 RIELRALGKISEGEELTVSYIDFLNVSEERKKQLKRQYYFDCTCEHCQ 280
>gi|384485892|gb|EIE78072.1| hypothetical protein RO3G_02776 [Rhizopus delemar RA 99-880]
Length = 554
Score = 61.2 bits (147), Expect = 9e-07, Method: Compositional matrix adjust.
Identities = 77/295 (26%), Positives = 123/295 (41%), Gaps = 48/295 (16%)
Query: 8 VTVAEIEGRGRGLVSTQSLKAGQIVLRDSPILLYSALPFINSSSF----CHNCFRKTMHS 63
V + + +GR V+++ L G + + L +A F+ S F CH C +
Sbjct: 61 VHLKNTKTKGRHAVASEELPEG------ATVSLETATAFVVRSEFIDQNCHFCLADL--T 112
Query: 64 SSSICPSCSVAF-CSPKCSTAAASSSHSPYVCQALTRLLNLNSPDAANLPLDRQLQAR-- 120
IC +CS+++ CS CS+ + H P VC A +++ + A ++ +D
Sbjct: 113 QKVICSNCSLSYYCSETCSSKD-HTLHQP-VCNAFSQVQAI--AHATDVDVDLLRLMLLL 168
Query: 121 ----FLIAAYNLAVVNPSQ-FQILL--------AFQGTVTDNDTSAAHYLSSLCPPPAST 167
+ + N V P + LL AF+ VT +A+ L S P
Sbjct: 169 LARRHMDSQSNQEDVTPFWCVEDLLSHIESAPAAFKQVVT----AASERLLSELPESVQI 224
Query: 168 TIELTAALLAKDRLNAFGLMEPYIEGQDGRRSVRAYGIY-LKASFFNHDCLPNACRFDYV 226
+E +L + NA GL + + R + A G++ L A FFNH C PN +V
Sbjct: 225 PVEDMVSLACRINSNAHGLGDNH-----SRNTDVALGLFPLGALFFNHGCNPNTA---FV 276
Query: 227 DAAAENNTDIIVRMIHDVPQGREICLSYFPVNYDYSTRQKRLLYDYDFACDCDRC 281
N + R I V + E+ +SY + D R++ LL F C C RC
Sbjct: 277 GLP---NGQLAFRTIRSVQKDEELVVSYIDIYSDRDERRQELLTTKHFWCKCKRC 328
>gi|327275307|ref|XP_003222415.1| PREDICTED: SET and MYND domain-containing protein 1-like isoform 1
[Anolis carolinensis]
Length = 472
Score = 60.8 bits (146), Expect = 9e-07, Method: Compositional matrix adjust.
Identities = 67/284 (23%), Positives = 116/284 (40%), Gaps = 30/284 (10%)
Query: 6 ETVTVAEIEGRGRGLVSTQSLKAGQIVLRDSPILLYSALPFINSSSF-CHNCFRKTMHSS 64
ETV V + +G+GRGL + + AG ++ + Y+A+ F + + CH CF++ H+
Sbjct: 2 ETVEVFDTDGKGRGLKAAKEFWAGDVIFAERA---YAAVVFDSLTHLVCHTCFKR--HAK 56
Query: 65 SSICPSCSVA-FCSPKCSTAAASSSHSPYVCQALTRLLNLNSPDAANLPLDRQLQARFLI 123
C C A +C C A + + C A+ + + +P+ N+ L ++ R
Sbjct: 57 LHRCGQCKFAHYCDRTCQKDAWVNHKNE--CAAIKK--HGKAPN-ENIRLAARIMWRIER 111
Query: 124 AAYNLA---VVNPSQFQILLAFQGTVTDNDTS--AAHYLSSLCPPPASTTIELTAALLAK 178
L V+ Q + G + +L ++ + +L
Sbjct: 112 EGGGLTEGCFVSIDSLQNHVEHFGEEEKKELRLDLESFLEFWPHDGKQFAMQYISHILGV 171
Query: 179 DRLNAFGLMEPYIEGQDGRRSVRAYGIYLKASFFNHDCLPNACRFDYVDAAAENNTDIIV 238
N F L + Q G ++V GI+ NHDC PN C + NN I +
Sbjct: 172 INCNGFTLSD-----QRGLQAV-GVGIFPNLCLVNHDCWPN-CTVIF------NNGKIEL 218
Query: 239 RMIHDVPQGREICLSYFPVNYDYSTRQKRLLYDYDFACDCDRCK 282
R + + G E+ +SY R+++L Y F C C+ C+
Sbjct: 219 RALGKISIGEELTVSYVDFLNVCEDRRQQLKKQYYFDCTCEHCQ 262
>gi|303286946|ref|XP_003062762.1| set domain protein [Micromonas pusilla CCMP1545]
gi|226455398|gb|EEH52701.1| set domain protein [Micromonas pusilla CCMP1545]
Length = 732
Score = 60.8 bits (146), Expect = 9e-07, Method: Compositional matrix adjust.
Identities = 41/146 (28%), Positives = 66/146 (45%), Gaps = 25/146 (17%)
Query: 154 AHYLSSLCPP-PASTTIELTAALLAKDRLNAFGLMEPYIEGQDGRRSVRAYGIYLKASFF 212
A+ ++S+ PP PA ++ A +++K +N+ G++ D G+Y A+ F
Sbjct: 355 ANAVNSMLPPGPARMDVDALAEVMSKVHVNSHGVV-------DAAGRALGTGVYPPAALF 407
Query: 213 NHDCLPNAC-RFDYV----------------DAAAENNTDIIVRMIHDVPQGREICLSYF 255
NH C PNA F + + N + VR + V +G EIC++Y
Sbjct: 408 NHSCAPNAVVSFGGADDDGDGSHSDRDGSCSNGSTNNGARLTVRCVSPVEEGEEICIAYA 467
Query: 256 PVNYDYSTRQKRLLYDYDFACDCDRC 281
V TR++ L FAC+C RC
Sbjct: 468 EVYATRETRREALWEKKAFACECRRC 493
>gi|332022210|gb|EGI62525.1| SET and MYND domain-containing protein 5 [Acromyrmex echinatior]
Length = 391
Score = 60.8 bits (146), Expect = 9e-07, Method: Compositional matrix adjust.
Identities = 54/205 (26%), Positives = 89/205 (43%), Gaps = 22/205 (10%)
Query: 103 LNSPDAANLPLDRQLQARFLIAAYNLAVVNPSQFQILLAFQGTVTDNDTSAAHYLSSLCP 162
L + + +Q+ L Y P F+ LLA GT +++
Sbjct: 200 LGEKFVGQIDILQQMMKEALNTEYTAHWFTPDGFRSLLALVGTNGQGIGTSSF------- 252
Query: 163 PPASTTIELTAALLAKD-RLNAFGLMEPYIEGQDGR----RSVRAYGIYLKASFFNHDCL 217
+ ++A L KD R+ L+E + D + G+Y+ S NH C+
Sbjct: 253 --SRWVKNVSALELPKDERIYIDKLIERIYDAMDEAVGPFLNNEGSGLYVLQSSVNHSCI 310
Query: 218 PNACRFDYVDAAAENNTDIIVRMIHDVPQGREICLSYF---PVNYDYSTRQKRLLYDYDF 274
PNA V+ NNT ++++ I ++ G EIC SY + +RQ+ L Y F
Sbjct: 311 PNA----VVEFPYSNNT-LVLKAIRNIKVGEEICTSYLDECQLERSRHSRQQALSSLYLF 365
Query: 275 ACDCDRCKVEANWSDNDNDDENNEE 299
C CD+C+ +AN D +D+E ++E
Sbjct: 366 VCHCDKCQAQANDPDVTSDEEESDE 390
>gi|196009510|ref|XP_002114620.1| hypothetical protein TRIADDRAFT_28394 [Trichoplax adhaerens]
gi|190582682|gb|EDV22754.1| hypothetical protein TRIADDRAFT_28394 [Trichoplax adhaerens]
Length = 398
Score = 60.8 bits (146), Expect = 9e-07, Method: Compositional matrix adjust.
Identities = 31/83 (37%), Positives = 50/83 (60%), Gaps = 6/83 (7%)
Query: 204 GIYLKASFFNHDCLPNACRFDYVDAAAENNTDIIVRMIHDVPQGREICLSYFPVNY-DYS 262
G+YL S NHDC PN +++D NN + V+ I ++ +G+E+C+SY + ++
Sbjct: 300 GLYLLQSCCNHDCSPNV-EINFLD----NNATLTVKAIRNISEGQELCISYIDSDIKNWK 354
Query: 263 TRQKRLLYDYDFACDCDRCKVEA 285
RQ L+ +Y F C C+RC VE+
Sbjct: 355 KRQAILMENYLFECTCNRCMVES 377
>gi|390361260|ref|XP_799878.2| PREDICTED: SET and MYND domain-containing protein 3-like
[Strongylocentrotus purpuratus]
Length = 315
Score = 60.8 bits (146), Expect = 1e-06, Method: Compositional matrix adjust.
Identities = 75/309 (24%), Positives = 114/309 (36%), Gaps = 57/309 (18%)
Query: 6 ETVTVAEIEGRGRGLVSTQSLKAGQIVLRDSPILLYSALPFINSSS--FCHNCFRKTMHS 63
E+V V + GR LV T AG+ +L + P A N FC C ++ S
Sbjct: 9 ESVRVVKSATCGRSLVFTSKFAAGKCLLEELPY----AYVLCNKKRGLFCDFCLKQC--S 62
Query: 64 SSSICPSCS-VAFCSPKCSTAAASSSHSPYVCQALTRLLNLNSPDAANL---PLDRQLQA 119
+ C SC V +CS C H C+ L R+ PD A L + +Q Q+
Sbjct: 63 TLLRCSSCKYVRYCSRPCQKGDWKRCHKQD-CKTLKRIHPRVPPDLAQLLAQIIRKQRQS 121
Query: 120 ----------------RFLIAAYNLAVVNPSQFQILLAFQGTVTDNDTSAAHYLSSLCPP 163
+ L+ F+ L + D P
Sbjct: 122 PPCTQDDEDCFPTTVDQLESHHEKLSDARKDHFESLWFVLQQCIEEDVL----------P 171
Query: 164 PASTTIELTAALLAKDRLNAFGLMEPYIEGQDGRRSVRAYGIYLKASFFNHDCLPNACRF 223
S+ +++ A + N+F + + + G A GIYL+AS NH C PN
Sbjct: 172 QPSSLVKMFGATIC----NSFSICDNDLNGI-------AVGIYLRASMLNHSCYPNC--- 217
Query: 224 DYVDAAAENNTDIIVRMIHDVPQGREICLSYFPVNYDYSTRQKRLLYDYDFACDCDRCKV 283
+ + +R + DV +G +SY V RQ L Y F+C+C +C
Sbjct: 218 ----VVVFDERKLQLRTVRDVKEGDVCTISYVDVINPAKERQTELEERYHFSCNCVKCID 273
Query: 284 EANWSDNDN 292
E N + D+
Sbjct: 274 EINELEPDD 282
>gi|224082230|ref|XP_002306611.1| SET domain protein [Populus trichocarpa]
gi|222856060|gb|EEE93607.1| SET domain protein [Populus trichocarpa]
Length = 458
Score = 60.8 bits (146), Expect = 1e-06, Method: Compositional matrix adjust.
Identities = 69/276 (25%), Positives = 108/276 (39%), Gaps = 37/276 (13%)
Query: 8 VTVAEIEGRGRGLVSTQSLKAGQIVLRDSPILLYSALPFINSS-SFCHNCFRKTMHSSSS 66
+ V+ + +GR L++T++ G+++LR P Y +P +S+ S C CF S
Sbjct: 13 LAVSNLPEKGRCLLTTKNFNPGEVILRQEP---YVCVPNNSSTVSRCDGCF---ASESLK 66
Query: 67 ICPSCSVA-FCSPKCSTAAASSSHSPYVCQALTRLLNLNSPDAANLPLDRQLQARFLIAA 125
C +C V +C C + C AL+RL A P R L R +
Sbjct: 67 KCSACQVVWYCGSTCQKSEWKLHRLE--CNALSRLEK--EKRKAVTPSIR-LMVRLYLRR 121
Query: 126 YNLAVVNPSQFQILLAFQGTVTDNDTSAAHYLSSLCPPPASTTIELTAALLAKDRLNAFG 185
++ + Q++L Q A+ + + P E+ A +K NA
Sbjct: 122 KLQNEMDLDEKQLVLYAQ---------MANLVHFILQWPEINLKEI-AENFSKLACNAHT 171
Query: 186 LMEPYIEGQDGRRSVRAYGIYLKASFFNHDCLPNACRFDYVDAAAENNTDIIVRMIHDVP 245
+ D G+Y S NH C+PNA +VR + +P
Sbjct: 172 IC-------DCELRPLGTGLYPVVSIINHSCMPNA-------VLTFEGKSSVVRAVEHIP 217
Query: 246 QGREICLSYFPVNYDYSTRQKRLLYDYDFACDCDRC 281
+G E+ ++Y TRQK L Y F C C RC
Sbjct: 218 EGAEVSIAYIDTAGSTMTRQKALKEQYFFTCTCPRC 253
>gi|365758076|gb|EHM99938.1| Set6p [Saccharomyces cerevisiae x Saccharomyces kudriavzevii VIN7]
Length = 374
Score = 60.8 bits (146), Expect = 1e-06, Method: Compositional matrix adjust.
Identities = 49/156 (31%), Positives = 72/156 (46%), Gaps = 23/156 (14%)
Query: 134 SQFQILLAFQGTVTDNDTSAAHYLSSLCPPPAST--TIELTAALLAKDRLNAFGLMEPYI 191
S+F +LL FQ V L L P TI L +L + NAFGL +
Sbjct: 232 SKFPVLLHFQKLVFQT-------LYILLPSHTRKMLTISLLRHILGTEYGNAFGLWQ--- 281
Query: 192 EGQDGR-RSVRAYGIYLKASFFNHDCLPNACRFDYVDAAAEN-NTDIIVRMIHDVPQGRE 249
EG+ R Y ++ +AS+FNH C PN ++ + N NTDI + +
Sbjct: 282 EGEAAESREYFGYWVFPEASYFNHSCTPNVTKYRKGNIMLFNVNTDI--------KKNEQ 333
Query: 250 ICLSYFPVNYDYSTRQKRLLYD-YDFACDCDRCKVE 284
IC+ Y V + ++++ L D + F C C+RCK E
Sbjct: 334 ICIDYSGVLELPTVKRRKFLADSWFFDCACERCKSE 369
>gi|294892061|ref|XP_002773875.1| hypothetical protein, conserved [Perkinsus marinus ATCC 50983]
gi|239879079|gb|EER05691.1| hypothetical protein, conserved [Perkinsus marinus ATCC 50983]
Length = 140
Score = 60.8 bits (146), Expect = 1e-06, Method: Compositional matrix adjust.
Identities = 37/114 (32%), Positives = 53/114 (46%), Gaps = 13/114 (11%)
Query: 168 TIELTAALLAKDRLNAFGLMEPYIEGQDGRRSVRAYGIYLKASFFNHDCLPNACRFDYVD 227
T + LL + N F +++ G A G+Y A+ FNH C PN V
Sbjct: 31 TFDAILGLLVRFPCNNFAIVDDLWSGI-------AAGVYPNAALFNHSCHPN------VI 77
Query: 228 AAAENNTDIIVRMIHDVPQGREICLSYFPVNYDYSTRQKRLLYDYDFACDCDRC 281
A + + + R I D+ G EIC SY + R+ LL DY+F C+C+RC
Sbjct: 78 PAFGHGSTLSFRAIRDISPGEEICHSYVELTLPSWKRRDVLLRDYEFLCECERC 131
>gi|401841590|gb|EJT43957.1| SET6-like protein [Saccharomyces kudriavzevii IFO 1802]
Length = 374
Score = 60.8 bits (146), Expect = 1e-06, Method: Compositional matrix adjust.
Identities = 49/156 (31%), Positives = 72/156 (46%), Gaps = 23/156 (14%)
Query: 134 SQFQILLAFQGTVTDNDTSAAHYLSSLCPPPAST--TIELTAALLAKDRLNAFGLMEPYI 191
S+F +LL FQ V L L P TI L +L + NAFGL +
Sbjct: 232 SKFPVLLHFQKLVFQT-------LYILLPSHTGKMLTIPLLRHILGTEYGNAFGLWQ--- 281
Query: 192 EGQDGR-RSVRAYGIYLKASFFNHDCLPNACRFDYVDAAAEN-NTDIIVRMIHDVPQGRE 249
EG+ R Y ++ +AS+FNH C PN ++ + N NTDI + +
Sbjct: 282 EGEAAESREYFGYWVFPEASYFNHSCTPNVTKYRKGNIMLFNVNTDI--------KKNEQ 333
Query: 250 ICLSYFPVNYDYSTRQKRLLYD-YDFACDCDRCKVE 284
IC+ Y V + ++++ L D + F C C+RCK E
Sbjct: 334 ICIDYSGVLELPTVKRRKFLADSWFFDCACERCKSE 369
>gi|167521373|ref|XP_001745025.1| hypothetical protein [Monosiga brevicollis MX1]
gi|163776639|gb|EDQ90258.1| predicted protein [Monosiga brevicollis MX1]
Length = 399
Score = 60.8 bits (146), Expect = 1e-06, Method: Compositional matrix adjust.
Identities = 40/135 (29%), Positives = 63/135 (46%), Gaps = 21/135 (15%)
Query: 163 PPASTTIELTAALLAKDRLNAFGLMEPYIEGQDGRRSVRAYGIYLKASFFNHDCLPNACR 222
PP ++ EL RL A + ++ + + G++ +A+ NH C PNA
Sbjct: 98 PPVQSSGEL-------QRLVAICMYNCFLTDHERQPGFDVLGLWPQAAAINHSCRPNAT- 149
Query: 223 FDYVDAAA-----ENNTDI-------IVRMIHDVPQGREICLSYFPVNYDYSTRQKRLLY 270
Y+DA+A E+ D+ I+R + D+ +G I +SY + + RQ+ L
Sbjct: 150 -HYLDASAPMKPRESGADLPPEGGTMIIRSVSDIARGEPITISYVELGDPWPVRQEALRT 208
Query: 271 DYDFACDCDRCKVEA 285
Y FAC C RC EA
Sbjct: 209 GYGFACTCIRCTEEA 223
>gi|301121168|ref|XP_002908311.1| conserved hypothetical protein [Phytophthora infestans T30-4]
gi|262103342|gb|EEY61394.1| conserved hypothetical protein [Phytophthora infestans T30-4]
Length = 424
Score = 60.8 bits (146), Expect = 1e-06, Method: Compositional matrix adjust.
Identities = 68/268 (25%), Positives = 99/268 (36%), Gaps = 50/268 (18%)
Query: 16 RGRGLVSTQSLKAGQIVLRDSPILLYSALPFI-NSSSFCHNCFRKTMHSSSSICPSCS-V 73
RGR L +T+++K G+ +LRD Y + N + C C R + C CS V
Sbjct: 16 RGRLLRATRNVKMGESILRDCA---YGNVVLSGNRAELCAVCLRAA--DADICCDDCSKV 70
Query: 74 AFCSPKCSTAAASSSHSPYVCQALTRLLNLNSPDAANLPLDRQLQARFLIAAYNLAVVNP 133
FCS +C + C+AL + + S + +L D + R A
Sbjct: 71 FFCSEECREKLRDIHENE--CEALEEVDLIASKTSVDLHSDEEGNVRSSYA--------- 119
Query: 134 SQFQILLAFQGTVTDNDTSAAHYLSSLCPPPASTTIELTAALLAKDRLNAFGLMEPYIEG 193
N H L P A R A ++E +
Sbjct: 120 ---------------NVKDLVHVLDKEAGPWADHV-----------RAGAKRILEDLPDE 153
Query: 194 QDGRRSVRAYGIYLKASFFNHDCLPNACRFDYVDAAAENNTDIIVRMIHDVPQGREICLS 253
V A G++ NH C PN + D+ E VR + D+ +G EI LS
Sbjct: 154 CHLPHLVAAVGLFSICGLINHSCQPNCTWSNAGDSVME------VRALRDIEEGEEITLS 207
Query: 254 YFPVNYDYSTRQKRLLYDYDFACDCDRC 281
Y ++ + S RQK L F C C+RC
Sbjct: 208 YIDIDKERSERQKELRDTKHFDCQCERC 235
>gi|390365247|ref|XP_003730778.1| PREDICTED: SET and MYND domain-containing protein 3-like
[Strongylocentrotus purpuratus]
Length = 487
Score = 60.5 bits (145), Expect = 1e-06, Method: Compositional matrix adjust.
Identities = 73/291 (25%), Positives = 116/291 (39%), Gaps = 35/291 (12%)
Query: 6 ETVTVAEIEGRGRGLVSTQSLKAGQIVLRDSPILLYSALPFINSSS-FCHNCFRKTMHSS 64
E+V V + GR LV T GQ +L + P Y+ N FC C +K S+
Sbjct: 7 ESVRVVKSATCGRSLVFTSKFARGQCILEELP---YAYTLHDNKRGLFCDFCLKKC--ST 61
Query: 65 SSICPSCS-VAFCSPKCSTAAASSSHSPYVCQALTRLLNLNSPDAANLPLDRQLQARFLI 123
C SC+ V++C+ C + H C+ L ++ D A L L + ++ R
Sbjct: 62 LKKCSSCNYVSYCNKSCQKKDWARCHKQD-CKTLKKIHPRVPSDLAQL-LSQIIRKRRKS 119
Query: 124 A--AYNLAVVNPSQFQILLAFQGTVT-DNDTSAAHYLSSL-----CPPPASTTIELTAAL 175
A + + P+ L + ++ ++ SA H L P + +++ A
Sbjct: 120 APCSQDDEDCFPTTVDQLESHHEKLSIESSVSALHKLKHCIEEEDVPTDPRSLLKMYGAT 179
Query: 176 LAKDRLNAFGLMEPYIEGQDGRRSVRAYGIYLKASFFNHDCLPNACRFDYVDAAAENNTD 235
N+FG+ D + GIYL+AS NH C DY +
Sbjct: 180 ----NCNSFGIF-------DNDLIAISDGIYLRASMVNHSC-------DYNCIVVFDERK 221
Query: 236 IIVRMIHDVPQGREICLSYFPVNYDYSTRQKRLLYDYDFACDCDRCKVEAN 286
+ +R + DV +G E + Y V + R+ L Y F C C +C E N
Sbjct: 222 LQLRTVKDVKEGEECTIGYVDVIHPAKERRAELEEKYHFTCKCVKCNEEIN 272
Score = 43.9 bits (102), Expect = 0.13, Method: Compositional matrix adjust.
Identities = 28/91 (30%), Positives = 39/91 (42%), Gaps = 11/91 (12%)
Query: 196 GRRSVRAYGIYLKASFFNHDCLPNACRFDYVDAAAENNTDIIVRMIHDVPQGREICLSYF 255
G+ ++ +Y KAS NH C PN A + I + + DV +G E ++Y
Sbjct: 354 GQLNIESY----KASMLNHSCDPNC-------AWVSDGRKIQIMTVKDVKEGEECTITYV 402
Query: 256 PVNYDYSTRQKRLLYDYDFACDCDRCKVEAN 286
V RQ L Y F C C +C E N
Sbjct: 403 DVMDPAKVRQADLKERYHFTCKCVKCIEEIN 433
>gi|336465134|gb|EGO53374.1| hypothetical protein NEUTE1DRAFT_150707 [Neurospora tetrasperma
FGSC 2508]
Length = 668
Score = 60.5 bits (145), Expect = 1e-06, Method: Compositional matrix adjust.
Identities = 77/298 (25%), Positives = 113/298 (37%), Gaps = 51/298 (17%)
Query: 16 RGRGLVSTQSLKAGQ-IVLRDSPILLYSALPF-INSSSFCHNC--------FRKTMHSSS 65
RGR L ST+ AG+ I + D+P+L ALP N + C+ C F + +
Sbjct: 22 RGRSLFSTERFGAGETIAIIDNPLL---ALPDDANMRTTCNYCLYVSGTIEFEGDVEAGP 78
Query: 66 SICPSCS----VAFCSPKCSTAAASSSHSPYVCQALTRLLNLNSPDAANLPLDRQLQARF 121
C +C+ +C+ +C A H C+ R+ D P+ Q
Sbjct: 79 RTCKACTGCKAAVYCNAECQRAHWKLVHKAE-CKMFKRIKERTGKDWLPTPVRAVAQVML 137
Query: 122 LIAAYNLAVV-----------NPSQFQILLAFQGTVTDNDTSAAHYLSSLCPPPASTTIE 170
L+ A + V N F+ G T A Y L + T++
Sbjct: 138 LLKAGDEETVKAFGPGGTLESNVEGFKTDEGLWGDFELQATGAVVYAGLL---QSDETLK 194
Query: 171 LTAALLAKDRLNAFGLMEPYIEGQDGRRSVRAYGIYLKASF--FNHDCLPNACRFDYVDA 228
+L K + NAF + G+ GIYL S NH C+PNA Y+
Sbjct: 195 QAMEVLCKMQTNAFN----RFDADTGQ-----AGIYLHPSLSMVNHSCVPNA----YI-- 239
Query: 229 AAENNTDIIVRMIHDVPQGREICLSYFPVNYDYSTRQKRLLYDYDFACDCDRCKVEAN 286
A ++ D+ G EI +SY RQ+ L Y F C+C RCK + N
Sbjct: 240 -AFEKRKAFLKAERDIEPGDEILISYIDHTMPRRARQESLRL-YHFQCNCIRCKDDLN 295
>gi|302691170|ref|XP_003035264.1| hypothetical protein SCHCODRAFT_14438 [Schizophyllum commune H4-8]
gi|300108960|gb|EFJ00362.1| hypothetical protein SCHCODRAFT_14438 [Schizophyllum commune H4-8]
Length = 597
Score = 60.5 bits (145), Expect = 1e-06, Method: Compositional matrix adjust.
Identities = 67/300 (22%), Positives = 113/300 (37%), Gaps = 27/300 (9%)
Query: 1 MSAVSETVTVAEIEGRGRGLVSTQSLKAGQIVLRDSPILLYSALPFINSSSFCHNCFRKT 60
+ + E + + GRGL + Q +K G +++ P + S L N +S+C NCF +
Sbjct: 64 LHGIPEYLYIRSSPSEGRGLWAKQPIKRGTVIIAVKPHV--SVLTTQNLASYCSNCFEEA 121
Query: 61 MHSSSSICPSCSVA-FCSPKCSTAAASSSHSPYVCQALTRLLNLNSPDAANLPLDRQLQA 119
+ C C V +C+ +C ++ C AL ++P D
Sbjct: 122 PEAGLKRCAHCRVVHYCNSECQNKDWATHK--RECTALQEWAKHAPAAEVSVPSDAVRCL 179
Query: 120 RFLIAAYNLAVVNPSQFQILLAFQGTVTDNDTSAAHYLSSLCPPPASTTIELTAALLAKD 179
L+ ++ + Q + A Q + S + + L S ++ L + D
Sbjct: 180 GRLLWKRQKKGLDSTWAQQIAAMQSSRKSLQPSGYEFHTHL-----SHSLVRYLGLSSPD 234
Query: 180 RLNAFGLMEPY-IEGQDGRRSVRAYGIYLK------------ASFFNHDCLPNACRFDYV 226
L FG+ P + R + +Y + S NH C PNA +
Sbjct: 235 GLQDFGISSPGDLLDLTSRFATNSYSVTTPDLTPIGACVSPLVSLVNHSCSPNAATV-FP 293
Query: 227 DAAAENNTD---IIVRMIHDVPQGREICLSYFPVNYDYSTRQKRLLYDYDFACDCDRCKV 283
A+ +TD I V I D+ +I SY + R++ L Y+F C C CK
Sbjct: 294 RASKTPSTDEPLISVVAIRDIQPDEQIFTSYIDTTLPRALRRRELQEGYNFLCKCSLCKT 353
>gi|356507670|ref|XP_003522587.1| PREDICTED: histone-lysine N-methyltransferase ASHR1-like [Glycine
max]
Length = 484
Score = 60.5 bits (145), Expect = 1e-06, Method: Compositional matrix adjust.
Identities = 70/295 (23%), Positives = 118/295 (40%), Gaps = 33/295 (11%)
Query: 8 VTVAEIEGRGRGLVSTQSLKAGQIVLRDSPILLYSALPFINSSSFCHNCFRKTMHSSSSI 67
++++ + +GR L++T+ G++++ P + ++ C CF ++ S
Sbjct: 13 LSLSTLPEKGRSLLATRDFYPGEVIISQEPYVCVPNNSSVSPQKRCDGCFTTINNNVLSR 72
Query: 68 CPSCSVAFCSPKCSTAAASSSHSPYV--CQALTRL-------LNLNSPDAANLPLDRQLQ 118
C C +AF C TA S + C+ L+ L L + L L R+LQ
Sbjct: 73 CSRCQLAF---YCGTACQRSEWKLHRLECEVLSSLHKYKRKSLTPSIRLMLRLYLRRKLQ 129
Query: 119 ARFLIAAYNLAVVNPSQFQILLAFQGTVTDND----TSAAHYLSSLCPPPASTTIELTAA 174
+I + A+ N + + L+A +T+ A+ ++S+ P E+ A
Sbjct: 130 NDKIIPS--TAMDNYNLVEALVAHMSDITEEQLVLYAQMANLVNSILEWPGINIKEI-AE 186
Query: 175 LLAKDRLNAFGLMEPYIEGQDGRRSVRAYGIYLKASFFNHDCLPNACRFDYVDAAAENNT 234
+K NA + D G+Y S NH CLPN+ +
Sbjct: 187 NFSKFACNAHTIC-------DSELRPVGTGLYPVISIINHSCLPNS-------VLVFEGS 232
Query: 235 DIIVRMIHDVPQGREICLSYFPVNYDYSTRQKRLLYDYDFACDCDRCKVEANWSD 289
+VR + +P G E+ +SY TRQK L Y F C C RC + D
Sbjct: 233 SALVRAVQHIPSGTEVLISYIETAESTMTRQKALKEQYLFTCTCPRCSKVGQYDD 287
>gi|307195015|gb|EFN77083.1| SET and MYND domain-containing protein 5 [Harpegnathos saltator]
Length = 385
Score = 60.5 bits (145), Expect = 1e-06, Method: Compositional matrix adjust.
Identities = 60/198 (30%), Positives = 91/198 (45%), Gaps = 27/198 (13%)
Query: 115 RQLQARFLIAAYNLAVVNPSQFQILLAFQGTVTDNDTSAAHYLSSLCPPPASTTIELTAA 174
R++ + L Y P F+ LLA G + ++ P S ++ +A
Sbjct: 204 REMMKKALNIEYTAHWFTPDGFKSLLALVGMNSQGIGTS----------PFSRWVKNVSA 253
Query: 175 L-LAKD-RLNAFGLMEP-YIEGQDGRRSV---RAYGIYLKASFFNHDCLPNACRFDYVDA 228
L + KD R+ L++ Y E ++ S G+Y+ S NH C+PNA V+
Sbjct: 254 LDMHKDQRIFIDKLIDRIYDEMEEVVGSFLDNEGSGLYILQSAVNHSCMPNAS----VEF 309
Query: 229 AAENNTDIIVRMIHDVPQGREICLSYFP---VNYDYSTRQKRLLYDYDFACDCDRCKVEA 285
NNT +++R I D+ EIC SY + +RQK L Y F C CD+C+ +A
Sbjct: 310 PYSNNT-LVLRAIRDIQPEEEICTSYLEECELERSRYSRQKALSSLYLFVCHCDKCQSQA 368
Query: 286 NWSDNDNDDENNEEVMDE 303
N D +D+ EEV DE
Sbjct: 369 NDPDMTSDE---EEVEDE 383
>gi|46108572|ref|XP_381344.1| hypothetical protein FG01168.1 [Gibberella zeae PH-1]
Length = 530
Score = 60.5 bits (145), Expect = 1e-06, Method: Compositional matrix adjust.
Identities = 73/282 (25%), Positives = 115/282 (40%), Gaps = 44/282 (15%)
Query: 16 RGRGLVSTQSLKAGQIVLRDSPILLYSALPFINSSSFCHNCFRKTMHSSSSICPSCSVAF 75
+GRG+ ST+S G ++L +P +L +L IN + C +CF++ + S C +V++
Sbjct: 14 KGRGIFSTKSFAPGDVILPFTPTILIPSLSHIN--TICSHCFKQAEVRACSRCH--AVSY 69
Query: 76 CSPKCSTAAASSSHSPYVCQALTRLLNLNSPDAANLPLDRQLQARFLIAAYNLAVVNPSQ 135
C C A ++ HS C+ L ++ P LP I A A+V P
Sbjct: 70 CDAACQAANWTAVHSKE-CKVLRKVTEQGRP---GLPTP--------IRAVVQALVKPEI 117
Query: 136 FQILLAFQGTVTDNDTSAAHYLSSLCPPPAST---------TIELTAALLAKDRLNAFGL 186
+ +G S + AS ++ +L K + NAF
Sbjct: 118 GNAIEDLEGNAASWRKSEKWADMEMMAMGASAFAGLGTGQEDVQKALTILCKIQTNAFHR 177
Query: 187 MEPYIEGQDGRRSVRAYGIYL--KASFFNHDCLPNACRFDYVDAAAENNTDIIVRMIHDV 244
+ + GQ GI+L K + NH C+PNA ++ A I+R +
Sbjct: 178 YDTDL-GQ--------VGIFLGTKLAMANHSCIPNAM-VQFIGRRA------ILRAEKPI 221
Query: 245 PQGREICLSYFPVNYDYSTRQKRLLYDYDFACDCDRCKVEAN 286
EI +SY + S R K L Y F C C RC+ + N
Sbjct: 222 KIDDEIEISYTDYTFPLSNR-KLALAPYFFDCMCLRCEKDLN 262
>gi|452982319|gb|EME82078.1| hypothetical protein MYCFIDRAFT_77623 [Pseudocercospora fijiensis
CIRAD86]
Length = 709
Score = 60.5 bits (145), Expect = 1e-06, Method: Compositional matrix adjust.
Identities = 38/125 (30%), Positives = 64/125 (51%), Gaps = 11/125 (8%)
Query: 186 LMEPYIEGQD-GRRSVRAYGIYLKASFFNHDCLPNACRFDYVDAAAENNTDIIVRMIHDV 244
+ +PY G+ +V + G+++K S+ NH C+PNA D ++V+ ++
Sbjct: 456 IAKPYPFGRSYAGDAVGSSGLWIKTSYINHSCIPNAEPSVIGDL-------MVVKATRNI 508
Query: 245 PQGREICLSYFPVNYDYSTRQKRLLYDYDFACDCDRCKVEANWSDNDNDDENNEEVMDED 304
QG EI +SYF + DY TR +++ + F C C+ C EA S D E ++ D
Sbjct: 509 TQGEEITISYFN-DIDYQTRSRKIKSSWGFDCTCELCTAEAATSPADM--EERIQLRDLI 565
Query: 305 QDEQM 309
++QM
Sbjct: 566 TEQQM 570
>gi|417400767|gb|JAA47309.1| Hypothetical protein [Desmodus rotundus]
Length = 428
Score = 60.5 bits (145), Expect = 1e-06, Method: Compositional matrix adjust.
Identities = 77/311 (24%), Positives = 114/311 (36%), Gaps = 56/311 (18%)
Query: 16 RGRGLVSTQSLKAGQIVLRDSPILLYSALPFINSSSFCHNCF---RKTMHSSSSICPSCS 72
RG GL + L+ G+++ R P L Y+ + C C K M C C
Sbjct: 14 RGNGLRALAPLRPGELLFRSDP-LAYTVCKG-SRGVVCDRCLLGKEKLMR-----CSQCR 66
Query: 73 VA-FCSPKCSTAAASSSHSPYVCQALTRLLNLNSPDAANLPLDRQLQARFLIAAYNLAVV 131
VA +CS KC A C+ L PD+ L L R L
Sbjct: 67 VAKYCSAKCQKKAWPDHKRE--CKCLKSCKPRYPPDSVRL-LGR--------VVVKLMEE 115
Query: 132 NPSQFQILLAFQ------GTVTDNDTSAAHYLSSLCPPPASTTIELTAALL--------- 176
PS+ + L +F +T+ L+ I+ + LL
Sbjct: 116 TPSESEKLYSFYDLESNISKLTEEKKEGIRQLAMTFQHFMREEIQDVSQLLPAFDIFEAF 175
Query: 177 AKDRLNAFGLMEPYIEGQDGRRSVRAYGIYLKASFFNHDCLPNACRFDYVDAAAENNTDI 236
AK NAF + ++G G+Y S NH C PN + N +
Sbjct: 176 AKVFCNAFTICNAEMQGV-------GVGLYPSMSLLNHSCDPNC-------SIVFNGPHL 221
Query: 237 IVRMIHDVPQGREICLSYFPVNYDYSTRQKRLLYDYDFACDCDRCKVEANWSDNDNDD-E 295
++R + D+ G E+ + Y + R+K L Y F CDC RC+ + D D D
Sbjct: 222 LLRAVRDIETGEELTVCYLDLLMTSEERRKHLRSQYCFDCDCVRCQTQ----DKDADMLT 277
Query: 296 NNEEVMDEDQD 306
+E+V E Q+
Sbjct: 278 GDEQVWKEVQE 288
>gi|297806681|ref|XP_002871224.1| hypothetical protein ARALYDRAFT_487474 [Arabidopsis lyrata subsp.
lyrata]
gi|297317061|gb|EFH47483.1| hypothetical protein ARALYDRAFT_487474 [Arabidopsis lyrata subsp.
lyrata]
Length = 325
Score = 60.5 bits (145), Expect = 1e-06, Method: Compositional matrix adjust.
Identities = 71/308 (23%), Positives = 121/308 (39%), Gaps = 62/308 (20%)
Query: 8 VTVAEIEGRGRGLVSTQSLKAGQIVLRDSPILLYSALPFINSSSFCHNCFRKTMHSSSSI 67
+ V E GR + +T+ + AG ++ P++ +L + S C+ C +K M S+
Sbjct: 44 IRVGLTESAGRAVFATRRIGAGDLIHTAKPVVACPSL--LKLDSVCYLCLKKLMGSAKF- 100
Query: 68 CPSCSVAFCSPKCSTAAAS-------SSHSPYVCQALTRLLNLNSPDAAN---------- 110
V++CS +C + + S +V R N P
Sbjct: 101 -EDRGVSYCSQECQENSKGFLDVERRADWSSFV--DYCRTHNFKYPLMVKRLCCMIISGV 157
Query: 111 LPLD-RQLQARFLIAAYNLAVVNPSQFQILLAF-QGTVTDNDTSAAHYLSSLCPPPASTT 168
LP D + +++ ++ + + AF + + D+D S T
Sbjct: 158 LPADCLDILQPAALSSEMISKIEDGYGLLWNAFRKANLKDDDVSF-------------LT 204
Query: 169 IELTAALLAKDRLNAF--------------GLMEPYIEGQDGRRSVRAYGIYLKASFFNH 214
+ A+LA+ R+NAF L +EG+ + +Y+ SF+NH
Sbjct: 205 KQWYTAILARIRINAFRIDLVGGSCGEDLLSLAAASVEGE----GAVGHAVYMLPSFYNH 260
Query: 215 DCLPNACRFDYVDAAAENNTDIIVRMIHDVPQGREICLSYFPVNYDYSTRQKRLLYDYDF 274
DC PNA +A A NT + DV +G E+ + Y + Y RQ L + F
Sbjct: 261 DCDPNAHIIWLQNADARLNT------LRDVEEGEELRICYIDASMGYEARQTILSQGFGF 314
Query: 275 ACDCDRCK 282
C+C RC+
Sbjct: 315 LCNCLRCQ 322
>gi|322708541|gb|EFZ00118.1| MYND finger family protein [Metarhizium anisopliae ARSEF 23]
Length = 526
Score = 60.5 bits (145), Expect = 2e-06, Method: Compositional matrix adjust.
Identities = 74/282 (26%), Positives = 114/282 (40%), Gaps = 43/282 (15%)
Query: 16 RGRGLVSTQSLKAGQIVLRDSPILLYSALPFINSSSFCHNCFRKTMHSSSSICPSCSVA- 74
+ R L +TQ K QI+ P++L+ +L + +S C +C R S C C A
Sbjct: 12 KHRALHATQPFKPSQIIHTFHPLILHPSLSQL--ASVCTHCLRP---GSPRACSRCHAAY 66
Query: 75 FCSPKCSTAAASSSHSPYVCQALTRLLNLNSPDAANLPLDRQLQARFLIAAYNLAVVNPS 134
+C+ C AA ++ HS C+AL + L S A+LP + A ++N
Sbjct: 67 YCNTSCQQAAWTAVHSKE-CKALQQ-RKLGSRTGADLPTP--------VRALLQTLLNKE 116
Query: 135 QFQILLAFQGTVTDNDTSAA--------HYLSSLCPPPASTTIELTAALLAKDRLNAFGL 186
+ A G + + + + + I LL K + NAF
Sbjct: 117 TEDGIAALDGHTEERRKTKSWPDLEMMAMAACAFAGRQGESNIRRAVELLCKIQTNAFHR 176
Query: 187 MEPYIEGQDGRRSVRAYGIYLKASF--FNHDCLPNACRFDYVDAAAENNTDIIVRMIHDV 244
+ + GQ G++L+ + NH C+PNA +V A I+R +
Sbjct: 177 WDVDL-GQ--------VGVFLEPTLAMANHSCVPNAV-VQFVGRKA------ILRAERPI 220
Query: 245 PQGREICLSYFPVNYDYSTRQKRLLYDYDFACDCDRCKVEAN 286
G EI +SY STR + L Y F C C RCK + N
Sbjct: 221 HAGDEIEISYTDYTMPLSTR-REALEQYSFECTCARCKGDLN 261
>gi|440638927|gb|ELR08846.1| hypothetical protein GMDG_03520 [Geomyces destructans 20631-21]
Length = 539
Score = 60.5 bits (145), Expect = 2e-06, Method: Compositional matrix adjust.
Identities = 74/281 (26%), Positives = 117/281 (41%), Gaps = 27/281 (9%)
Query: 15 GRGRGLVSTQSLKAGQIVLRDSPILLYSALPFINSSSFCHNCFRKTMHSSSSICPSCSVA 74
G G +T+S+ VL DS + ++ S+ C C + C CSV
Sbjct: 269 GDVLGAFATKSIAPSDTVLVDSTVAGVTS-----SAERCPTCCGGITEDILNDC--CSVP 321
Query: 75 FCSPKCSTAAASSSHSPYVCQALTRLLNLNSPDAANLP----LDRQLQARFLIAAYNLAV 130
+CSP CS A + H+ VC L ++ ++A L LD L R L +
Sbjct: 322 YCSPDCSKTALDTFHAS-VCGKDFGFLT-SAAESAELTTDLSLDSVLLLRVLALSMQEPT 379
Query: 131 VNPSQFQILLAFQGTVTDNDTSAAHYLSSLCPPPASTTIELTAALLAKDRLNAFGLMEPY 190
++P + +L T ++D ++ + P EL + A + + L
Sbjct: 380 LHPLKTSLLNRLTPTYKNDDLIIFNFSDHIVTP-IRVLGELGIDVFANAAYDTWVLHTIR 438
Query: 191 IEGQDGRRSVRAYGIYLKA-----SFFNHDCLPNACRFDYVDAAAEN-NTDIIVRMIHDV 244
Q+ + G A S FNH C PN VD E+ N+ + + + DV
Sbjct: 439 CRLQNNKHGQTLDGFPGTAVNPLYSMFNHSCAPN------VDWEHEDSNSTLRMFALRDV 492
Query: 245 PQGREICLSYF-PVNYDYSTRQKRLLYDYDFACDCDRCKVE 284
+G E+ +SY P+ DY+ RQ+ LL C+C+RCK E
Sbjct: 493 NEGEELFISYIRPLTMDYAERQESLLPWLGMECECERCKAE 533
>gi|409042906|gb|EKM52389.1| hypothetical protein PHACADRAFT_148986 [Phanerochaete carnosa
HHB-10118-sp]
Length = 436
Score = 60.5 bits (145), Expect = 2e-06, Method: Compositional matrix adjust.
Identities = 79/310 (25%), Positives = 127/310 (40%), Gaps = 48/310 (15%)
Query: 16 RGRGLVSTQSLKAGQIVLRDSPILL---YSALPFINSSSFCHNCFRKTMH-------SSS 65
+G+GLV+T+ + G++V ++ P +L +S +S C +C + S S
Sbjct: 105 KGKGLVATERIAEGEVVWKEDPFILAPEWSLYDLQTASVACTHCSTPLNNSPLAVPCSGS 164
Query: 66 SICPSCSVAFCSPKCSTAAASSSHSPYVC-----QALTRLLNLNSPDAANLPLDRQLQAR 120
+ PSCS FC+ C + AA + P +C AL L + L Q AR
Sbjct: 165 AKVPSCSARFCNRLCLSRAART--HPLLCPAQNPAALPLLEFARRHEWMALHALAQCTAR 222
Query: 121 FLIAAYNLAVVNPSQFQIL--LAFQG------------TVTDNDTSAAHY---LSSLCPP 163
L+A + + +L LA G D DT Y + + P
Sbjct: 223 VLLAEQQDEAAFSADWAVLRGLAQLGMEERAKGGWLGSAEPDRDTWKKAYELFVRAFKDP 282
Query: 164 PASTTIELTAALLAKDRLNA-----FGLMEPYIEGQDGRRSVRAY---GIYLKASFFNHD 215
+ + A LL K FG + ++ G GR S+ G+Y+ S NH
Sbjct: 283 GTESEKKRLARLLRKPTRKEVDDEIFG-YDAFLRGL-GRMSLNLEAHGGLYVLHSHVNHS 340
Query: 216 CLPNACRFDYVDAAAENNTDIIVRMIHDVPQGREICLSYFPVNYDYSTRQKRLLYDYDFA 275
C PN +A + ++VR D+ G E+ +SY + R+++LL ++ F
Sbjct: 341 CTPNLSARHLDQRSALSRITVVVR--RDIEVGEELFISYVNPDLPLEGRRRQLL-EWGFG 397
Query: 276 -CDCDRCKVE 284
C C RC +E
Sbjct: 398 TCQCPRCVME 407
>gi|195126158|ref|XP_002007541.1| GI12333 [Drosophila mojavensis]
gi|193919150|gb|EDW18017.1| GI12333 [Drosophila mojavensis]
Length = 661
Score = 60.1 bits (144), Expect = 2e-06, Method: Compositional matrix adjust.
Identities = 77/331 (23%), Positives = 131/331 (39%), Gaps = 78/331 (23%)
Query: 1 MSAVSETVTVAEIEGRGRGLVSTQSLKAGQIVLRDSPILLYSALPFINSSSFCHNCFRKT 60
+S ++ V + E + +GR +V+ Q LK G +VL + P+ F ++ CH+CF++
Sbjct: 250 LSGAAKVVKLVETKDKGRFVVADQGLKTGDVVLAEDPVAACLLPNFFGTN--CHHCFKR- 306
Query: 61 MHSSSSICPSCS-VAFCSPKCSTAAASSSHSPYVCQ----------------ALTRLLNL 103
+H+ C CS +AFCS +C A + H + CQ AL
Sbjct: 307 LHTPVP-CLHCSGIAFCSAQCMGEACETYHR-FECQFMDLFIGSGMSILCFIALRIFTQA 364
Query: 104 NSPD----AANLPL------------DRQLQARFLIAAYNLAVVNPSQFQILLAFQGTVT 147
S D ANL D LQ R L+A + + V+ +Q+ F T
Sbjct: 365 PSLDDGLATANLLFEHLCSHQDVREADDHLQ-RSLMAGFLMRVLQKAQY-----FGRRKT 418
Query: 148 DNDTSAAHYLSSLCPPPASTTIELTAALLAKDRLNAFGLMEPY---------IEGQDGRR 198
+ P + +++ A+L +L + E Y EG +
Sbjct: 419 EGVN------------PTAVELQVATAMLGLLQLLQYNAHEIYHTMVTDEHCFEG--CKV 464
Query: 199 SVRAYGIYLKASFFNHDCLPNACRFDYVDAAAENNTDIIVRMIHDVPQGREICLSYFPV- 257
G+Y S+FNH+C P+ + +V +++ + ++Y P+
Sbjct: 465 VYVGAGLYGTGSYFNHECWPSVAGY-FV------GKKLVMSATKPHRPNEIVAVNYGPIF 517
Query: 258 -NYDYSTRQKRLLYDYDFACDCDRCKVEANW 287
+ RQ+ L Y F C+C C + NW
Sbjct: 518 TKMNLKERQRSLRGRYAFNCNCMTC--QENW 546
>gi|408392117|gb|EKJ71478.1| hypothetical protein FPSE_08348 [Fusarium pseudograminearum CS3096]
Length = 358
Score = 60.1 bits (144), Expect = 2e-06, Method: Compositional matrix adjust.
Identities = 44/155 (28%), Positives = 63/155 (40%), Gaps = 22/155 (14%)
Query: 156 YLSSLCPPPASTTIELTAALLAKDRLNAFGLMEPYIEGQ------DGRRSVRAYGIYLKA 209
YLS P T + LL K+ N FGL Q R + +Y +A
Sbjct: 212 YLSDASVLPCMLTADEMLLLLCKEETNTFGLYPKATGTQFVNDNATSRGESYGFALYPRA 271
Query: 210 SFFNHDCLPNACRFDYVDAAAENNTDIIVRMIHDVPQGREICLSYF--PVNYDYSTRQKR 267
+ FNH CLPN V + ++ D+ +G E ++YF D S+RQK
Sbjct: 272 AQFNHSCLPN------VSHKPDGQARMVYTAARDISKGEECMITYFDLATRKDVSSRQKY 325
Query: 268 LLYDYDFACDCDRCKVEANWSDNDNDDENNEEVMD 302
+ F C C+RC E + E N ++MD
Sbjct: 326 AQTQFQFKCTCNRCLEE--------EAEENMDIMD 352
>gi|118781832|ref|XP_311885.3| AGAP002999-PA [Anopheles gambiae str. PEST]
gi|116129279|gb|EAA07926.3| AGAP002999-PA [Anopheles gambiae str. PEST]
Length = 664
Score = 60.1 bits (144), Expect = 2e-06, Method: Compositional matrix adjust.
Identities = 76/319 (23%), Positives = 118/319 (36%), Gaps = 74/319 (23%)
Query: 16 RGRGLVSTQSLKAGQIVLRDSPILLYSA-LPFINSSSFCHNCFRKTMHSSSSICPSC-SV 73
RGR +V+ L G+++L + Y+A L + C CF + + + CP C V
Sbjct: 267 RGRYVVAAADLGPGEVILTEPA---YAACLHAKYYGTHCSACFSRLI--APVACPDCCGV 321
Query: 74 AFCSPKCSTAAASSSHSPYVCQALTRLLNLNSPDAANLPLDRQLQA-------------- 119
AFCS C A ++ H + CQ L ++ ++ L QA
Sbjct: 322 AFCSVACRDKACATYHR-FECQYLDLMIGSGMSILCHVALRMVTQAGTPEKVLEEGKMLR 380
Query: 120 -----------------RFLIAAYNLAVVNPSQFQILLAFQGTVTDNDTSAAHYLSSLCP 162
R L+ A+ L + ++F F T+ P
Sbjct: 381 DTFCAHTEHRDPEDHFKRTLMTAFLLRCLQKAEF-----FGRRTTE------------AP 423
Query: 163 PPASTTIELTAALLA---KDRLNAFGLMEPYIEGQDGRRSVR----AYGIYLKASFFNHD 215
P +E+ A LL+ + NA + E I G+ + + GIY AS FNH+
Sbjct: 424 EPTEQELEVGAVLLSALQSLQFNAHEVYETRITGEHRFDTAKVQYIGVGIYRGASMFNHE 483
Query: 216 CLPNACRFDYVDAAAENNTDIIVRMIHDVPQGREICLSYFP--VNYDYSTRQKRLLYDYD 273
C P R T +I+ +P G + +Y P + + RQ+ L Y
Sbjct: 484 CYPGVTRTFL-------GTAMILHTSRPIPAGAVVPENYGPHFMRQPKAIRQRNLRSRYW 536
Query: 274 FACDCDRCKVEANWSDNDN 292
F CDC C +W D
Sbjct: 537 FKCDCRACA--EDWPQMDK 553
>gi|432880354|ref|XP_004073656.1| PREDICTED: SET and MYND domain-containing protein 1-like isoform 1
[Oryzias latipes]
Length = 489
Score = 59.7 bits (143), Expect = 2e-06, Method: Compositional matrix adjust.
Identities = 71/289 (24%), Positives = 115/289 (39%), Gaps = 27/289 (9%)
Query: 5 SETVTVAEIEGRGRGLVSTQSLKAGQIVLRDSPILLYSALPFINSSSF--CHNCFRKTMH 62
E V + + +GRGL +++ G++V + YSA+ F +S +F CH CFR+
Sbjct: 6 KENVRLFDAGAKGRGLKASKEFSTGEVVFAEPS---YSAVVF-DSLAFQVCHCCFRR--Q 59
Query: 63 SSSSICPSCSVA-FCSPKCSTAAASSSHSPYVCQALTRLLNLNSPD---AANLPLDRQLQ 118
+ C C A +C+ C TA C A+ + S AA L L R +
Sbjct: 60 AQLHRCGQCRFAHYCNRTCQTACWEEHKQE--CAAIRKSGKAPSESVRLAARL-LWRLHK 116
Query: 119 ARFLIAAYNLAVVNPSQFQILLAFQGTVTDNDTSAAHYLSSLCPPPASTTIELTAALLAK 178
L + L V+ Q + Q + T ++E + +
Sbjct: 117 DNGLASDSQLVSVDQLQEHVDQLSQEQLQQLQTDVRVLQDYWSYGSKQHSVEYISHIFGI 176
Query: 179 DRLNAFGLMEPYIEGQDGRRSVRAYGIYLKASFFNHDCLPNACRF------DYVDAAAEN 232
+ N F L + Q G ++V G++ + NHDC PN V +A +
Sbjct: 177 IKCNGFTLTD-----QRGLQAV-GVGLFPNLALVNHDCWPNCTAILNHGNHSAVSSALHS 230
Query: 233 NTDIIVRMIHDVPQGREICLSYFPVNYDYSTRQKRLLYDYDFACDCDRC 281
I VR + + +G E+ +SY + RQK+L + F C C C
Sbjct: 231 PRRIEVRTLGKISEGEELTISYVDFLQLSADRQKQLKEQFHFECSCKHC 279
>gi|400595566|gb|EJP63361.1| TPR domain-containing protein [Beauveria bassiana ARSEF 2860]
Length = 662
Score = 59.7 bits (143), Expect = 2e-06, Method: Compositional matrix adjust.
Identities = 81/308 (26%), Positives = 117/308 (37%), Gaps = 90/308 (29%)
Query: 9 TVAEIEGRGRGLVSTQSLKAGQIVLRDSPILLYSALPFINSSSFCHNCFRKTMHSSSSIC 68
TV G GRGL + + KAG++++
Sbjct: 381 TVKSSPGAGRGLFAARDFKAGELIM----------------------------------- 405
Query: 69 PSCSVAFCSPKCSTAAASSSHSPYVCQALTRLLNLNSPDAANLPLDRQLQARFLIAAYNL 128
C AFC T ++ SP V ALT N+ D + R A +
Sbjct: 406 --CEKAFC-----TVSSKDKASPAVT-ALT----------VNIGQDYSI--RVFPAGLHR 445
Query: 129 AVV-----NPSQFQILLA-----FQGTVTDNDTSAAHYLSSLCPPPASTTIELTAALLAK 178
AVV NPSQ +L +QG +T A S P T ++ +
Sbjct: 446 AVVQKLLNNPSQASKVLGLDSGDYQGV---GETGA-----STAEGPVVDTFQVHNIV--- 494
Query: 179 DRLNAFGLMEPYIEGQDGRRSVRAYGIYLKASFFNHDCLPNACRFDYVDAAAENNTDIIV 238
+ NAFGL + D S G++ +AS+ NH C+PN+ + D A + I+V
Sbjct: 495 -QRNAFGLGP---QSADEDVSNATTGLWARASYLNHSCMPNSVK----DFAGDL---IVV 543
Query: 239 RMIHDVPQGREICLSYFPVNYDYSTRQKRLLYDYDFACDCDRCKVEANWSDNDNDDENNE 298
R + + G EI +Y N DY RQ L + F C C C EA +D D
Sbjct: 544 RALQHIRTGDEITHAYQD-NGDYDARQALLQTTWGFTCRCKLCVAEA--ADGDEVRVKRR 600
Query: 299 EVMDEDQD 306
E+M E ++
Sbjct: 601 ELMKEAEE 608
>gi|322786609|gb|EFZ13004.1| hypothetical protein SINV_09986 [Solenopsis invicta]
Length = 490
Score = 59.7 bits (143), Expect = 2e-06, Method: Compositional matrix adjust.
Identities = 67/279 (24%), Positives = 112/279 (40%), Gaps = 41/279 (14%)
Query: 17 GRGLVSTQSLKAGQIVLRDSPILLYSALPFINSSSFCHNCFRKTMHSSSSICP------S 70
GRGL +T+ ++AG+++ D P+L+ + C C++ CP
Sbjct: 36 GRGLFATRDIQAGELIFTDVPLLM-GPRCYNKHLPMCVVCYKNN-------CPLFPCDYG 87
Query: 71 CSVAFCSPKCSTAAASSSHSPYVCQALTRLLNLNSPDAANLPLDRQLQARFLIAAYNLAV 130
C + CS +C +A H CQ L +P + L LI L+
Sbjct: 88 CGLPICSAECENSAM---HVQGECQFLREW----APTCGSTWSKDLLLVVVLIRGLALS- 139
Query: 131 VNPSQFQILLAFQGTVTDNDTSAAHYL--------SSLCPPPASTTIELTAALLAKDRLN 182
Q +L AF+ N H ++ P+ ++L + N
Sbjct: 140 --EEQRTLLYAFECHANLNRNYEVHQFILKIDLLKRNVANLPSEEQMKLMQRVCDVFNTN 197
Query: 183 AFGLMEPYIEGQDGRRSVRAYGIYLKASFFNHDCLPNACRFDYVDAAAENNTDIIVRMIH 242
+F ++ P D S+R G+Y + NH C+PN+ R + D + V
Sbjct: 198 SFEVVVPS-RNNDHTMSLR--GLYPMGALQNHCCVPNS-RHHFDDQQR-----LHVSAAL 248
Query: 243 DVPQGREICLSYFPVNYDYSTRQKRLLYDYDFACDCDRC 281
+ G EI +SY + +D STR++ L F+C+C+RC
Sbjct: 249 PIAAGEEITMSYTDLLWDTSTRRQFLRITKRFSCNCNRC 287
>gi|432880356|ref|XP_004073657.1| PREDICTED: SET and MYND domain-containing protein 1-like isoform 2
[Oryzias latipes]
Length = 478
Score = 59.7 bits (143), Expect = 2e-06, Method: Compositional matrix adjust.
Identities = 71/289 (24%), Positives = 115/289 (39%), Gaps = 27/289 (9%)
Query: 5 SETVTVAEIEGRGRGLVSTQSLKAGQIVLRDSPILLYSALPFINSSSF--CHNCFRKTMH 62
E V + + +GRGL +++ G++V + YSA+ F +S +F CH CFR+
Sbjct: 6 KENVRLFDAGAKGRGLKASKEFSTGEVVFAEPS---YSAVVF-DSLAFQVCHCCFRR--Q 59
Query: 63 SSSSICPSCSVA-FCSPKCSTAAASSSHSPYVCQALTRLLNLNSPD---AANLPLDRQLQ 118
+ C C A +C+ C TA C A+ + S AA L L R +
Sbjct: 60 AQLHRCGQCRFAHYCNRTCQTACWEEHKQE--CAAIRKSGKAPSESVRLAARL-LWRLHK 116
Query: 119 ARFLIAAYNLAVVNPSQFQILLAFQGTVTDNDTSAAHYLSSLCPPPASTTIELTAALLAK 178
L + L V+ Q + Q + T ++E + +
Sbjct: 117 DNGLASDSQLVSVDQLQEHVDQLSQEQLQQLQTDVRVLQDYWSYGSKQHSVEYISHIFGI 176
Query: 179 DRLNAFGLMEPYIEGQDGRRSVRAYGIYLKASFFNHDCLPNACRF------DYVDAAAEN 232
+ N F L + Q G ++V G++ + NHDC PN V +A +
Sbjct: 177 IKCNGFTLTD-----QRGLQAV-GVGLFPNLALVNHDCWPNCTAILNHGNHSAVSSALHS 230
Query: 233 NTDIIVRMIHDVPQGREICLSYFPVNYDYSTRQKRLLYDYDFACDCDRC 281
I VR + + +G E+ +SY + RQK+L + F C C C
Sbjct: 231 PRRIEVRTLGKISEGEELTISYVDFLQLSADRQKQLKEQFHFECSCKHC 279
>gi|356533089|ref|XP_003535101.1| PREDICTED: histone-lysine N-methyltransferase ATXR4-like [Glycine
max]
Length = 313
Score = 59.7 bits (143), Expect = 2e-06, Method: Compositional matrix adjust.
Identities = 69/304 (22%), Positives = 119/304 (39%), Gaps = 60/304 (19%)
Query: 8 VTVAEIEGRGRGLVSTQSLKAGQIVLRDSPILLYSALPFINSSSFCHNCFRKTMHSSSSI 67
+ VA E GR + +T+ + +G ++ +P + + + ++ + C++C HS S
Sbjct: 36 IRVALTESVGRAVFATRPIASGDLIHTATPTVCHPSSS--SARAACYSCLAALPHSQSQG 93
Query: 68 CPSCSVAFCSPKCSTAAASSSHSPYVCQALTRLLNLNSPDAANLPLDRQLQARFLIAAY- 126
P FCS +C S Y + + N + R L+ FL+
Sbjct: 94 IP-----FCSQRCH----QRSKGYYDVEMKANWVAFN-----DYCWTRGLKYPFLVKRLV 139
Query: 127 -----------NLAVVNPSQF---------QILLAFQGTVTDNDTSAAHYLSSLCPPPAS 166
L ++ P+ + L + T + H A
Sbjct: 140 CMVISGDARSDTLDILQPANLTPEMVLKMEEEFLLLRNAFTKALIADEHI--------AF 191
Query: 167 TTIELTAALLAKDRLNAF------GLMEPYIEGQDGRRSVRAY---GIYLKASFFNHDCL 217
T + +LA+ R+NAF GL E + A +YL SF+NHDC
Sbjct: 192 LTKQWYINILARIRINAFRIELAGGLYEDLLASAVASVEAEAAVGNAVYLLPSFYNHDCD 251
Query: 218 PNACRFDYVDAAAENNTDIIVRMIHDVPQGREICLSYFPVNYDYSTRQKRLLYDYDFACD 277
PNA ++D N D ++ + D+ +G E+ + Y + D + RQ+ L + F C+
Sbjct: 252 PNA-HIIWID-----NADAKLKALRDIVEGEELRICYIDASLDRNARQELLSRGFGFQCN 305
Query: 278 CDRC 281
C RC
Sbjct: 306 CSRC 309
>gi|213514028|ref|NP_001135157.1| SET and MYND domain-containing protein 1 [Salmo salar]
gi|197631975|gb|ACH70711.1| SET and MYND domain containing 1b [Salmo salar]
Length = 301
Score = 59.7 bits (143), Expect = 2e-06, Method: Compositional matrix adjust.
Identities = 60/288 (20%), Positives = 111/288 (38%), Gaps = 31/288 (10%)
Query: 6 ETVTVAEIEGRGRGLVSTQSLKAGQIVLRDSPILLYSALPFIN-SSSFCHNCFRKTMHSS 64
+ V V + G+GRGL +T+ L +G ++ + P ++A+ F + + CH+CFR+
Sbjct: 2 DNVEVFDSPGKGRGLRATKELLSGDVLFSEPP---FAAVVFDSLADKICHSCFRR--QDK 56
Query: 65 SSICPSCSVA-FCSPKCSTAAASSSHSPYVCQALTRLL----NLNSPDAANLPLDRQLQA 119
C C A +C C +A C A+ N N+ L ++
Sbjct: 57 LQRCGQCKFAQYCDKTCQSAGWKEH--KLECGAIKAFGAIGDNFGKAPNENIRLAARIMW 114
Query: 120 RF-----LIAAYNLAVVNPSQFQILLAFQGTVTDNDTSAAHYLSSLCPPPASTTIELTAA 174
R +++ + ++ + I + + + ++L ++ +
Sbjct: 115 RLDKEGGVMSDMQMTSLDDLENHISDMPEDDLKELKVDIHNFLDYWPRNSKQHRVDDISH 174
Query: 175 LLAKDRLNAFGLMEPYIEGQDGRRSVRAYGIYLKASFFNHDCLPNACRFDYVDAAAENNT 234
+ N F + Q G +V G++ NHDC PN N+
Sbjct: 175 IFGVINCNGF-----TVSDQRGLNAV-GVGLFPNLCLVNHDCWPNC-------TVILNHG 221
Query: 235 DIIVRMIHDVPQGREICLSYFPVNYDYSTRQKRLLYDYDFACDCDRCK 282
I +R + + G E+ +SY RQ++L Y F C C+ CK
Sbjct: 222 KIELRALGKIEPGEEVTVSYVDFLNVTEDRQRQLKMHYFFDCTCEHCK 269
>gi|154346778|ref|XP_001569326.1| hypothetical protein LBRM_35_7140 [Leishmania braziliensis
MHOM/BR/75/M2904]
gi|134066668|emb|CAM44467.1| hypothetical protein LBRM_35_7140 [Leishmania braziliensis
MHOM/BR/75/M2904]
Length = 442
Score = 59.7 bits (143), Expect = 2e-06, Method: Compositional matrix adjust.
Identities = 34/96 (35%), Positives = 46/96 (47%), Gaps = 4/96 (4%)
Query: 195 DGRRSVRAYGIYLKASFFNHDCLPNACRFDYVDAAAENNTDIIVRMIHDVPQGREICLSY 254
D RS R GIY FNH C PN V ++ N+ + V + DV G E+ +SY
Sbjct: 350 DLLRSCRGQGIYEVGCLFNHSCDPNLS----VQYSSLNDETLTVVALRDVKAGEELTISY 405
Query: 255 FPVNYDYSTRQKRLLYDYDFACDCDRCKVEANWSDN 290
+ ++ RQ++LL Y F C C RC E N
Sbjct: 406 IDSSLPFAVRQQQLLDHYLFECRCPRCVAEGTTDAN 441
>gi|254583928|ref|XP_002497532.1| ZYRO0F07722p [Zygosaccharomyces rouxii]
gi|238940425|emb|CAR28599.1| ZYRO0F07722p [Zygosaccharomyces rouxii]
Length = 386
Score = 59.7 bits (143), Expect = 2e-06, Method: Compositional matrix adjust.
Identities = 48/158 (30%), Positives = 72/158 (45%), Gaps = 27/158 (17%)
Query: 134 SQFQILLAFQGTVTDNDTSAAHYLSSLCPPPASTTIELTA--ALLAKDRLNAFGLMEPYI 191
S+F ILL+FQ V L L PP + + + +L + NAFG+ + +
Sbjct: 244 SRFPILLSFQIAVYQT-------LYILLPPTLHSQLSIPKFRHILGAEYGNAFGIWQ--M 294
Query: 192 EGQDGRRSVRAYGIYLKASFFNHDCLPNACRFDYVDAAAENNTDIIVRMIHDVPQGREIC 251
G R Y ++ +ASFFNH C PN VD + + + DV G ++C
Sbjct: 295 GESVGDREYLGYWLFPRASFFNHSCDPN------VDKVRKGRKMCFI-LNRDVTSGTQLC 347
Query: 252 LSY-----FPVNYDYSTRQKRLLYDYDFACDCDRCKVE 284
++Y PV S RQ+ + ++ F C CDRC E
Sbjct: 348 IAYNCDLTLPV----SGRQQTMKDNWFFECMCDRCSSE 381
>gi|388454549|ref|NP_001252862.1| SET and MYND domain-containing protein 3 [Macaca mulatta]
gi|380787707|gb|AFE65729.1| SET and MYND domain-containing protein 3 isoform 1 [Macaca mulatta]
gi|383412903|gb|AFH29665.1| SET and MYND domain-containing protein 3 isoform 1 [Macaca mulatta]
Length = 428
Score = 59.7 bits (143), Expect = 2e-06, Method: Compositional matrix adjust.
Identities = 75/313 (23%), Positives = 120/313 (38%), Gaps = 60/313 (19%)
Query: 16 RGRGLVSTQSLKAGQIVLRDSPILLYSALPFINSSSFCHNCF---RKTMHSSSSICPSCS 72
RG GL + +L+ G+++ R P L Y+ + C C K M C C
Sbjct: 14 RGNGLRAVTALRPGELLFRSDP-LAYTVCKG-SRGVVCDRCLLGKEKLMR-----CSQCR 66
Query: 73 VA-FCSPKCSTAAASSSHSPYVCQALTRLLNLNSPDAANLPLDRQL---------QARFL 122
VA +CS KC A C+ L PD+ L L R + ++ L
Sbjct: 67 VAKYCSAKCQKKAWPDHKRE--CKCLKSCKPRYPPDSVRL-LGRVVFKLMDGTPSESEKL 123
Query: 123 IAAYNLAV--------VNPSQFQILLAFQGTVTDNDTSAAHYLSSLCPPPASTTIELTAA 174
+ Y+L Q+++ FQ + + A+ PPA E A
Sbjct: 124 YSFYDLESNINKLTEDKKEGLRQLVMTFQHFMREEIQDASQL------PPAFDLFEAFAK 177
Query: 175 LLAKDRLNAFGLMEPYIEGQDGRRSVRAYGIYLKASFFNHDCLPNACRFDYVDAAAENNT 234
++ N+F + ++ G+Y S NH C PN + N
Sbjct: 178 VIC----NSFTICNAEMQEV-------GVGLYPSISLLNHSCDPNC-------SIVFNGP 219
Query: 235 DIIVRMIHDVPQGREICLSYFPVNYDYSTRQKRLLYDYDFACDCDRCKVEANWSDNDNDD 294
+++R + D+ G E+ + Y + R+K+L Y F CDC RC+ + D D D
Sbjct: 220 HLLLRAVRDIEVGEELTICYLDMLMTSEERRKQLRDQYCFECDCFRCQTQ----DKDADM 275
Query: 295 -ENNEEVMDEDQD 306
+E+V E Q+
Sbjct: 276 LTGDEQVWKEVQE 288
>gi|195058496|ref|XP_001995453.1| GH17754 [Drosophila grimshawi]
gi|193896239|gb|EDV95105.1| GH17754 [Drosophila grimshawi]
Length = 499
Score = 59.7 bits (143), Expect = 2e-06, Method: Compositional matrix adjust.
Identities = 70/271 (25%), Positives = 111/271 (40%), Gaps = 30/271 (11%)
Query: 17 GRGLVSTQSLKAGQIVLRDSPILLYSALPFINSSSFCHNCFRKTMHSSSSIC-PSCSVAF 75
GRG+++T++LK G+I+ +D P+L+ A +S + C C K + + +C C +
Sbjct: 34 GRGVMATRNLKRGEIIFQDMPLLIGLAAQEEDSLNACSICL-KLLPDTRFMCRQGCGL-- 90
Query: 76 CSPKCSTAAASSSHSPYVCQALTRLLNLNSPDAANLPLDRQLQARFLIAAYNLAVVNPSQ 135
P CS H + + N PD AN + R L + A NL Q
Sbjct: 91 --PICSLCGKKRQHKTDC--DMFKSWGPNEPDVANSVIIRLL---CVARAINLT---KDQ 140
Query: 136 FQILLAFQGTVTDNDTS----AAHYLSSLCPPPASTTIELTAALLAKDRLNAF-GLMEPY 190
++ Q + +N + AA + P +E+ +A R N F +
Sbjct: 141 RDLIYCLQANLDNNHRTEVRNAAKCFKNF--PTDKKLVEIMNRTVAVLRTNGFDKTTDRC 198
Query: 191 IEGQDGRRSVRAYGIYLKASFFNHDCLPNACRFDYVDAAAENNTDIIVRMIHDVPQGREI 250
+ Q+ S RA +Y NHDC+PN+ E ++IVR D+ +G EI
Sbjct: 199 TDNQEF--SYRA--LYPLFGVMNHDCIPNS-----YYTFEEKTNNMIVRAAVDILEGEEI 249
Query: 251 CLSYFPVNYDYSTRQKRLLYDYDFACDCDRC 281
+Y + R L F C C RC
Sbjct: 250 TTTYTKLFTGNIARHLFLKMKKGFTCKCPRC 280
>gi|432873526|ref|XP_004072260.1| PREDICTED: SET and MYND domain-containing protein 1-like isoform 2
[Oryzias latipes]
Length = 489
Score = 59.7 bits (143), Expect = 3e-06, Method: Compositional matrix adjust.
Identities = 70/332 (21%), Positives = 131/332 (39%), Gaps = 28/332 (8%)
Query: 6 ETVTVAEIEGRGRGLVSTQSLKAGQIVLRDSPILLYSALPFIN-SSSFCHNCFRKTMHSS 64
E V + + G+GRGL + + AG ++ ++ + +A+ F + + CH+CFR+
Sbjct: 2 ENVVIFDSPGKGRGLKAAKEFWAGDVIFSEASL---AAVVFDSLAERVCHSCFRR--QEK 56
Query: 65 SSICPSCSVA-FCSPKCSTAA-ASSSHSPYVCQALTRLLNLNSPDAANLPLDRQLQARFL 122
C C A +C C A A +A ++ N N A + L R +
Sbjct: 57 LQRCGQCKFAHYCDRTCQRAGWAEHKQECSAIKAYGKVPNENIRLVARI-LWRLDKDGST 115
Query: 123 IAAYNLAVVNPSQFQILLAFQGTVTDNDTSAAHYLSSLCPPPASTTIELTAALLAKDRLN 182
++ L ++ + I + + + ++L T++ + + N
Sbjct: 116 VSDMQLTTLDELEDHITDMQEDELKELKVDIHNFLDFWPRTSKQHTVDDISHIFGVINCN 175
Query: 183 AFGLMEPYIEGQDGRRSVRAYGIYLKASFFNHDCLPNACRF------DYVDAAAENNTDI 236
F + Q G ++V G++ NHDC PN V+ + I
Sbjct: 176 GFS-----VSDQRGLQAV-GVGLFPNLCLVNHDCWPNCTVILNHGNQSAVNTMFHSQRRI 229
Query: 237 IVRMIHDVPQGREICLSYFPVNYDYSTRQKRLLYDYDFACDCDRCK------VEANWSDN 290
+R + + +G E+ ++Y RQ+ L Y F C C+ CK ++ +
Sbjct: 230 ELRSLGKIAEGEELTVAYVDYMNLSEERQRLLKTQYFFDCTCEHCKNKIKDDIKLGGREV 289
Query: 291 DNDDENNEEVMD-EDQDEQMVASDDDAEVHGD 321
D + E+V + D QM+ D A ++GD
Sbjct: 290 DGVKPSEEQVKEATDYCYQMLEKMDKARLNGD 321
>gi|195171538|ref|XP_002026562.1| GL21959 [Drosophila persimilis]
gi|194111478|gb|EDW33521.1| GL21959 [Drosophila persimilis]
Length = 532
Score = 59.7 bits (143), Expect = 3e-06, Method: Compositional matrix adjust.
Identities = 77/297 (25%), Positives = 126/297 (42%), Gaps = 52/297 (17%)
Query: 10 VAEIEGRGRGLVSTQSLKAGQIVLRDSPILLYSALPFINSSSFCHNCFRKTMHSSSS--- 66
VA E GR L +T+ +K G+ +LR++P+LL P + S+ C C R +
Sbjct: 50 VATNETLGRHLRATRDVKIGEEILREAPLLLG---PKVASAPICLGCHRNLLAPQKQRGN 106
Query: 67 --ICPSCSVAFCSPKCSTAAASSSHSPYVCQALTRLLNLNSPDAANLPLDRQLQARF--- 121
C SCS C +C SSH CQ ++ N S D + ++ +
Sbjct: 107 YYKCSSCSWPLCGRECE----ESSHHRAECQLMSG-SNFQSKINYTPGEDERKESAYCVI 161
Query: 122 -LIAAYNLAVVNPSQFQILLAFQGTVTDNDTSAAH---------YLSSLCPPPASTTIEL 171
L+ L +P F L A + + + + + ++ ++ + +++
Sbjct: 162 MLLRCMQLKASDPEGFARLSALEDHLEERLATPLYQVLRANLITFIKTILGMRDWSEVDI 221
Query: 172 --TAALLAKDRLNAFGLMEPYIEGQDGRRSVRAYGIYLKASFFNHDCLPNA-CRFDYVDA 228
AA+L N F L +P RR VRA ++ A+ +HDC PN RFD
Sbjct: 222 LRIAAILDT---NTFELRQPR-----ERRKVRA--LFPGAAMISHDCAPNMRHRFD---- 267
Query: 229 AAENNTDIIVRMIHDVPQGREICLSYF-PVNYDYSTRQKRLLYDYD--FACDCDRCK 282
++ +II + +G + +SY P+ ST Q+RL F C CDRC+
Sbjct: 268 ---DDMNIIFLAKRPIAKGEILTISYTQPLR---STIQRRLHLRQAKCFDCACDRCQ 318
>gi|344278575|ref|XP_003411069.1| PREDICTED: SET and MYND domain-containing protein 3 [Loxodonta
africana]
Length = 428
Score = 59.3 bits (142), Expect = 3e-06, Method: Compositional matrix adjust.
Identities = 72/320 (22%), Positives = 114/320 (35%), Gaps = 50/320 (15%)
Query: 16 RGRGLVSTQSLKAGQIVLRDSPILLYSALPFINSSSFCHNCFRKTMHSSSSICPSCSVA- 74
RG GL ++ +L+ G ++ R P L Y+ + + C C C C VA
Sbjct: 14 RGNGLRASAALRPGDLLFRSDP-LAYTVCKG-SRGAVCDRCL--LGKEKLLRCSQCRVAK 69
Query: 75 FCSPKCSTAAASSSHSPYVCQALTRLLNLNSPDAANL------------PLDRQLQARFL 122
+CS KC A C+ L PD+ L P + + F
Sbjct: 70 YCSAKCQKKAWPDHKRE--CKCLKNCRPRYPPDSVRLLGRVVCKLMEGTPSESEKLYSFY 127
Query: 123 IAAYNLAVVNPSQFQILLAFQGTVTDNDTSAAHYLSSLCPPPASTTIELTAALLAKDRLN 182
N+ + + + L T D S L PP+ E A ++ N
Sbjct: 128 DLESNINKLTEDKKEGLRQLAMTFQDFMREEIQDASQL--PPSFDIFEAFAKVIC----N 181
Query: 183 AFGLMEPYIEGQDGRRSVRAYGIYLKASFFNHDCLPNACRFDYVDAAAENNTDIIVRMIH 242
+F + D G+Y S NH C PN + N +++R +
Sbjct: 182 SFTIC-------DAEMQEVGVGLYPSMSLLNHSCDPNC-------SIVFNGPHLLLRAVR 227
Query: 243 DVPQGREICLSYFPVNYDYSTRQKRLLYDYDFACDCDRCKV-----------EANWSDND 291
D+ G E+ + Y + R+K+L Y F CDC RC+ E W +
Sbjct: 228 DIEVGEELTICYLDMLMTSEERRKQLRDQYCFECDCFRCRTQDKDADMLTGNEQAWKEVQ 287
Query: 292 NDDENNEEVMDEDQDEQMVA 311
+ EE+ + EQ++A
Sbjct: 288 ESLKKIEELKAHRKWEQVLA 307
>gi|449524736|ref|XP_004169377.1| PREDICTED: histone-lysine N-methyltransferase ATXR4-like [Cucumis
sativus]
Length = 346
Score = 59.3 bits (142), Expect = 3e-06, Method: Compositional matrix adjust.
Identities = 61/292 (20%), Positives = 117/292 (40%), Gaps = 22/292 (7%)
Query: 8 VTVAEIEGRGRGLVSTQSLKAGQIVLRDSPILLYSALPFINSSSFCHNCFRKTMHSSSSI 67
+ V+ + GRG+ +T+ + AG+++ P++ + +L I+ C+ C +K ++
Sbjct: 60 IRVSLTDSAGRGVFATRKIGAGELIHTAKPLVAHPSLSSIHH--VCNFCLQKLQRYANVD 117
Query: 68 CPSCSVAFCSPKCSTAAASSSHSPYVCQALTRLLNLNSPDAANLPLDRQLQARFLIAAY- 126
+ +FCS +C + H + + N PL + A +I+
Sbjct: 118 SDARRASFCSEECEQHS-KVFHDVEMEADWSDYDNYCRERGFKYPLLVKRLACMVISGAM 176
Query: 127 ---NLAVVNPSQFQILLAFQGTVTDNDTSAAHYLSSLCPPPA-STTIELTAALLAKDRLN 182
+L ++ PS+ + + + A +++ T E +LA+ R+N
Sbjct: 177 SSDHLDILQPSRLSTDMVLELEEGYSLLRKALINANITDERMLFLTQEWYTGVLARIRIN 236
Query: 183 AFGL--------MEPYIEGQDGRRSVRAYGIYLKASFFNHDCLPNACRFDYVDAAAENNT 234
AF + + + +Y+ SF+NHDC PN NN
Sbjct: 237 AFRIELAGGYEDLHSLAAACVEAEAAVGNAVYMLPSFYNHDCDPN------THIIWINNA 290
Query: 235 DIIVRMIHDVPQGREICLSYFPVNYDYSTRQKRLLYDYDFACDCDRCKVEAN 286
+ ++ + DV E+ + Y + DY RQ L + F C+C RC N
Sbjct: 291 NAKLKALRDVDPDEELRICYIDTSMDYDARQTLLHRGFGFICNCARCSYGIN 342
>gi|348688194|gb|EGZ28008.1| putative SET-domain transcriptional regulator protein, Zn-binding
site [Phytophthora sojae]
Length = 457
Score = 59.3 bits (142), Expect = 3e-06, Method: Compositional matrix adjust.
Identities = 29/80 (36%), Positives = 42/80 (52%), Gaps = 2/80 (2%)
Query: 204 GIYLKASFFNHDCLPNACRFDYVDAAAENNTDIIVRMIHDVPQGREICLSY-FPVNYDYS 262
G++L + FNHDC PN +F DA AE VR+ + +G ++ +SY +P
Sbjct: 205 GVFLYNAMFNHDCNPNCVKFTPEDAGAEGGVS-EVRVARPIAKGEQLTISYLYPREQSRE 263
Query: 263 TRQKRLLYDYDFACDCDRCK 282
RQK L + F C C+ CK
Sbjct: 264 NRQKNLWEQFGFECSCELCK 283
>gi|281206757|gb|EFA80942.1| SET domain-containing protein [Polysphondylium pallidum PN500]
Length = 367
Score = 59.3 bits (142), Expect = 3e-06, Method: Compositional matrix adjust.
Identities = 62/301 (20%), Positives = 117/301 (38%), Gaps = 53/301 (17%)
Query: 16 RGRGLVSTQSLKAGQIVLRDSPILLYSALPFINSSSFCHNCFRK---------------T 60
RGRGL + +++GQ++ + P + Y +L I++ + C++C +
Sbjct: 68 RGRGLFTKLDVQSGQVIYTEKPFVSYQSLE-IDNKTICNHCLKSLDKDKIEKSKQKKDDD 126
Query: 61 MHSSSSICPSCSVAFCSPKCSTAAASSSH--SPYVCQALTRLLNLNSPDAANLP-LDRQL 117
S C CS +CS +C + + H S + +L + + P L ++
Sbjct: 127 FDDLFSKCVKCSTRYCSEQCKSESDIQYHLASCPSTSGFSEILKYTAVEKRRFPILAAKI 186
Query: 118 QARFLIAAY---------NLAVVNPSQFQILLA-------FQGTVTDNDTSAAHYLSSLC 161
AR L+ + NL V++ ++ L FQ + +++ +
Sbjct: 187 LARILLGYHFQKNMEHWENLQVLSFAKRDPPLEWKDDYNLFQRALLTKESNKKRF----- 241
Query: 162 PPPASTTIELTAALLAKDRLNAFGLMEPYIEGQDGRRSVRAYGIYLKASFFNHDCLPNAC 221
+ ++ +N G+ E + S G++ SF NH C PN
Sbjct: 242 ------NYDWFVRVMQILYINTLGIEVGATEPKAASTS-SGIGLFYLTSFINHSCDPNCY 294
Query: 222 RFDYVDAAAENNTDIIVRMIHDVPQGREICLSYFPVNYDYSTRQKRLLYDYDFACDCDRC 281
D A + + + G E+ ++Y N DY RQ L +Y F+C+C +C
Sbjct: 295 LAFPTDHTAH------LTALKPLKAGDELLIAYGDPNKDYIDRQSHLFDNYGFSCNCSKC 348
Query: 282 K 282
+
Sbjct: 349 Q 349
>gi|302853821|ref|XP_002958423.1| hypothetical protein VOLCADRAFT_99704 [Volvox carteri f.
nagariensis]
gi|300256228|gb|EFJ40499.1| hypothetical protein VOLCADRAFT_99704 [Volvox carteri f.
nagariensis]
Length = 729
Score = 59.3 bits (142), Expect = 3e-06, Method: Compositional matrix adjust.
Identities = 31/110 (28%), Positives = 51/110 (46%), Gaps = 18/110 (16%)
Query: 204 GIYLKASFFNHDCLPNACRFDYVDAAAENNTDIIVRMIHDVPQGREICLSYFPVNYDYST 263
G++ + + NH C PNA D ++VR +P+GRE+ L++ +
Sbjct: 616 GVWPEVALINHSCGPNAVAVMLYD-------RLVVRATRRIPRGREVLLNWLGSQGALAP 668
Query: 264 ---RQKRLLYDYDFACDCDRCKVEAN--------WSDNDNDDENNEEVMD 302
R+ +L YDFAC C RC++E + +SD + E E M+
Sbjct: 669 ARERRAQLSQMYDFACRCARCRLEGSEAVSLPFLYSDAERGSEEALEAME 718
>gi|72392603|ref|XP_847102.1| hypothetical protein [Trypanosoma brucei brucei strain 927/4
GUTat10.1]
gi|62175616|gb|AAX69749.1| hypothetical protein, conserved [Trypanosoma brucei]
gi|70803132|gb|AAZ13036.1| hypothetical protein, conserved [Trypanosoma brucei brucei strain
927/4 GUTat10.1]
Length = 399
Score = 59.3 bits (142), Expect = 3e-06, Method: Compositional matrix adjust.
Identities = 80/354 (22%), Positives = 125/354 (35%), Gaps = 88/354 (24%)
Query: 10 VAEIEGRGRGLVSTQSLKAGQIVLRDSPILLYSALPFIN-SSSFCHNCFRK--------- 59
VA +G+ +++T + A V ++P++ + + ++ FC C R+
Sbjct: 54 VANTVEKGKHILATADMPAQYSVFEEAPVVSWPSQGYLTLGVPFCLFCLRQRNNADIADE 113
Query: 60 TMHSSSSICPSCSVAFCSPKCSTAAASSSHSPYVCQALTRLLNLNSPDAANLPL------ 113
H+ +S C C FCS C A+A H +C L L D P+
Sbjct: 114 GEHNWNS-CAGCGSLFCSESC-LASAEMPHR-VLCGVLQELRKRECTDGCCRPITKESLA 170
Query: 114 ---------------DRQLQARFLIAAY--NLAVVNPSQFQILLAFQGTVTDNDTSA--- 153
RQ + L + N ++ F ++ A D +A
Sbjct: 171 NCVAWVVGRIAGAIKQRQFSGKLLEENHRENANSISRQLFHVVTAPFNRFLDAPKNAEFA 230
Query: 154 --------------------AHYLSSLCPPP---ASTTIELTAALLAKDRLNAF-GLMEP 189
A L S PP A +E+ LL D L F G++
Sbjct: 231 DVDANSWYEEIDKLLREPCRAVLLQSAAAPPSIGADWALEIVDGLLRHDTLERFLGILTL 290
Query: 190 YIEGQDGRRSV------------------RAYGIYLKASFFNHDCLPNACRFDYVDAAAE 231
+G +G +V + GIY S FNH C+PN V AE
Sbjct: 291 NSQGLNGFVAVPGAAGDMDSPSLPITWVLKGGGIYSLQSAFNHSCVPN------VAVLAE 344
Query: 232 NNT-DIIVRMIHDVPQGREICLSYFPVNYDYSTRQKRLLYDYDFACDCDRCKVE 284
T DI +R + + G E+ ++Y PV ++ L Y F C C C+ E
Sbjct: 345 GGTHDITLRTLRAIKNGEELTITYIPVENTTRAERQMKLEGYFFTCRCPLCEEE 398
>gi|347300358|ref|NP_001153564.2| SET and MYND domain-containing protein 3 [Sus scrofa]
Length = 428
Score = 59.3 bits (142), Expect = 3e-06, Method: Compositional matrix adjust.
Identities = 75/313 (23%), Positives = 119/313 (38%), Gaps = 60/313 (19%)
Query: 16 RGRGLVSTQSLKAGQIVLRDSPILLYSALPFINSSSFCHNCF---RKTMHSSSSICPSCS 72
RG GL + L+ G+++ R P L Y+ + C C K M C C
Sbjct: 14 RGNGLRALARLRPGELLFRSDP-LAYTVCKG-SRGVVCDRCLLGKEKLMR-----CSQCR 66
Query: 73 VA-FCSPKCSTAAASSSHSPYVCQALTRLLNLNSPDAANLPLDRQL---------QARFL 122
VA +CS KC A C+ L PD+ L L R + ++ L
Sbjct: 67 VAKYCSAKCQKKAWQDHKRE--CKCLKSCKPRYPPDSVRL-LGRVIFKLMEETPSESEKL 123
Query: 123 IAAYNLAV--------VNPSQFQILLAFQGTVTDNDTSAAHYLSSLCPPPASTTIELTAA 174
+ Y+L Q++L FQ + + A+ PP+ E A
Sbjct: 124 YSFYDLESNMNKLTEEKKEGLRQLVLTFQHFMREEIQDASQL------PPSFDIFEAFAK 177
Query: 175 LLAKDRLNAFGLMEPYIEGQDGRRSVRAYGIYLKASFFNHDCLPNACRFDYVDAAAENNT 234
++ N+F + ++ G+Y S NH C PN + N
Sbjct: 178 VIC----NSFTICNAEMQEV-------GVGLYPSMSLLNHSCDPNC-------SIVFNGP 219
Query: 235 DIIVRMIHDVPQGREICLSYFPVNYDYSTRQKRLLYDYDFACDCDRCKVEANWSDNDNDD 294
+++R + D+ G E+ + Y + R+K+L Y F CDC RC+ + D D D
Sbjct: 220 HLLLRAVRDIEAGEELTICYLDMLMTSEERRKQLRDQYCFECDCFRCQTQ----DKDADM 275
Query: 295 -ENNEEVMDEDQD 306
+E+V E Q+
Sbjct: 276 LTGDEQVWKEVQE 288
>gi|296230793|ref|XP_002760890.1| PREDICTED: SET and MYND domain-containing protein 3 [Callithrix
jacchus]
Length = 428
Score = 59.3 bits (142), Expect = 3e-06, Method: Compositional matrix adjust.
Identities = 71/300 (23%), Positives = 115/300 (38%), Gaps = 51/300 (17%)
Query: 16 RGRGLVSTQSLKAGQIVLRDSPILLYSALPFINSSSFCHNCFRKTMHSSSSI-CPSCSVA 74
RG GL + L+ G+++ R P L Y+ ++ S C R + + C C VA
Sbjct: 14 RGNGLRAVTPLRPGELLFRSDP-LAYT----VSKGSRGVVCDRCLLGKEKLMRCSQCRVA 68
Query: 75 -FCSPKCSTAAASSSHSPYVCQALTRLLNLNSPDAANLPLDRQL---------QARFLIA 124
+CS KC A C+ L PD+ L L R + ++ L +
Sbjct: 69 KYCSSKCQKKAWPDHKRE--CKCLKSCKPRYPPDSVRL-LGRVVFKLMEGSPSESEKLYS 125
Query: 125 AYNLAV--------VNPSQFQILLAFQGTVTDNDTSAAHYLSSLCPPPASTTIELTAALL 176
Y+L Q+++ FQ + + A+ PPA E A ++
Sbjct: 126 FYDLESNINKLTEDKKEGLRQLVMTFQHFMREEIQDASQL------PPAFDIFEAFAKVI 179
Query: 177 AKDRLNAFGLMEPYIEGQDGRRSVRAYGIYLKASFFNHDCLPNACRFDYVDAAAENNTDI 236
N+F + ++ G+Y S NH C PN + N +
Sbjct: 180 C----NSFTICNAEMQEV-------GVGLYPSISLLNHSCDPNC-------SVVFNGPHL 221
Query: 237 IVRMIHDVPQGREICLSYFPVNYDYSTRQKRLLYDYDFACDCDRCKVEANWSDNDNDDEN 296
++R + DV G E+ + Y + R+K+L Y F CDC RC+ + +D DE
Sbjct: 222 LLRAVRDVEVGEELTICYLDMLMTSEERRKQLRDQYCFECDCFRCQTQDKDADMLTGDER 281
>gi|71666752|ref|XP_820332.1| hypothetical protein [Trypanosoma cruzi strain CL Brener]
gi|70885672|gb|EAN98481.1| hypothetical protein, conserved [Trypanosoma cruzi]
Length = 359
Score = 59.3 bits (142), Expect = 3e-06, Method: Compositional matrix adjust.
Identities = 33/94 (35%), Positives = 48/94 (51%), Gaps = 4/94 (4%)
Query: 192 EGQDGRRSVRAYGIYLKASFFNHDCLPNACRFDYVDAAAENNTDIIVRMIHDVPQGREIC 251
E Q R+S R G+Y FNH C PN + Y ++ E + + +R D+ G E+C
Sbjct: 247 ELQHLRQSSRGQGVYTVGCLFNHSCEPNL-QVVYSESGDETLSVVALR---DIEPGEELC 302
Query: 252 LSYFPVNYDYSTRQKRLLYDYDFACDCDRCKVEA 285
+SY + Y RQ+ L Y F C C +C+ EA
Sbjct: 303 ISYIDESLSYPERQQELYEHYLFFCQCPKCEREA 336
>gi|350632678|gb|EHA21045.1| hypothetical protein ASPNIDRAFT_45712 [Aspergillus niger ATCC 1015]
Length = 367
Score = 59.3 bits (142), Expect = 3e-06, Method: Compositional matrix adjust.
Identities = 72/320 (22%), Positives = 121/320 (37%), Gaps = 70/320 (21%)
Query: 16 RGRGLVSTQSLKAGQIVLRDSPILLYSALPFI----NSSSFCHN--CFRK-TMHSSSSIC 68
+GR L + ++AG++++ D P Y+ LP + S C N C R+ + H+++S+
Sbjct: 62 KGRTLSTNGPIRAGELLMADLP---YAVLPTVAVGGGDDSICSNLGCSRRVSRHAANSVT 118
Query: 69 --PSCS--VAFCSPKCS---------TAAASSSHSPYVCQAL----------------TR 99
CS + +C+ C A +SP + Q L R
Sbjct: 119 CETGCSLDIWWCNESCKDEDQARHDVECAWLKKYSPVLRQELGDHDYYMLWIIIRLLAAR 178
Query: 100 LLNLNSPDAANLPLDRQLQARFLIAAYNLAVVN-------PSQFQILLAFQGTVTDNDTS 152
L L N+ Q RF+ + ++ SQ Q A
Sbjct: 179 YLELQGTTPTNVHQQLTFQDRFVSGWKGVQLLRSNRDSWPTSQLQHWSAL---------- 228
Query: 153 AAHYLSSLCPPPASTTIELTAALLAKDRLNAFGL------MEPYIEGQDGRRSVRAYGIY 206
A YL P + ++ L+ + N F L + P+ R + ++
Sbjct: 229 AEKYLLQSSLLPETLDLDTLVDLICAEETNVFELCPGPTEIIPHQSPGVERGTQYGLAVF 288
Query: 207 LKASFFNHDCLPNACRFDYVDAAAENNTDIIVRMIHDVPQGREICLSYFPVN--YDYSTR 264
L+ + NH C PN V A++ ++V + D+ G E C SYF ++ D R
Sbjct: 289 LRITLANHSCAPN------VTHQADDRGRMMVTALRDIAPGEECCTSYFDLSEYVDLQAR 342
Query: 265 QKRLLYDYDFACDCDRCKVE 284
+K+ + F C C RC E
Sbjct: 343 RKKTQELFTFTCTCPRCLQE 362
>gi|66802125|ref|XP_629856.1| SET domain-containing protein [Dictyostelium discoideum AX4]
gi|74851143|sp|Q54DL6.1|Y2140_DICDI RecName: Full=SET and MYND domain-containing protein DDB_G0292140
gi|60463233|gb|EAL61426.1| SET domain-containing protein [Dictyostelium discoideum AX4]
Length = 521
Score = 59.3 bits (142), Expect = 3e-06, Method: Compositional matrix adjust.
Identities = 38/111 (34%), Positives = 60/111 (54%), Gaps = 15/111 (13%)
Query: 175 LLAKDRLNAFGLMEPYIEGQ--DGRRS-VRAYGIYLKASFFNHDCLPNACRFDYVDAAAE 231
+L ++R NAF YI+G+ DG R G+Y++ SFFNH C PN V+
Sbjct: 333 ILLRNRCNAF-----YIQGRPRDGSSGESRGCGVYVRNSFFNHSCDPN------VNYWVV 381
Query: 232 NNT-DIIVRMIHDVPQGREICLSYFPVNYDYSTRQKRLLYDYDFACDCDRC 281
NNT ++ ++ +V +G E+ +SY + R+++LL Y F C C +C
Sbjct: 382 NNTLEVECTLLKNVKEGDELTISYIDTTSPLNKRREKLLEGYLFNCLCTKC 432
>gi|4324419|gb|AAD16880.1| unknown [Dictyostelium discoideum]
Length = 333
Score = 59.3 bits (142), Expect = 3e-06, Method: Compositional matrix adjust.
Identities = 31/86 (36%), Positives = 44/86 (51%), Gaps = 6/86 (6%)
Query: 204 GIYLKASFFNHDCLPNACRFDYVDAAAENNTDIIVRMIHDVPQGREICLSYFPVNYDYST 263
GI+ S+ NH CLPNA F Y D ++ R + + +G EI SY + + S
Sbjct: 71 GIFPTGSYLNHSCLPNA--FWYNDDQGM----MVFRTLRPIKKGEEILTSYTDITTECSE 124
Query: 264 RQKRLLYDYDFACDCDRCKVEANWSD 289
R+K LL Y F C C +CK + +D
Sbjct: 125 RRKHLLKQYFFFCQCQQCKFQKKLAD 150
>gi|118343990|ref|NP_001071820.1| SET and MYND domain containing protein [Ciona intestinalis]
gi|70571203|dbj|BAE06699.1| SET and MYND domain containing protein [Ciona intestinalis]
Length = 474
Score = 59.3 bits (142), Expect = 3e-06, Method: Compositional matrix adjust.
Identities = 72/300 (24%), Positives = 109/300 (36%), Gaps = 49/300 (16%)
Query: 8 VTVAEIEGRGRGLVSTQSLKAGQIVLRDSPILLYSALPFINSSSFCHNCFRKTMHSSSSI 67
V + + +GRGL +T+ + GQ VL+ P I+ CH C + +
Sbjct: 6 VELFQTSAKGRGLKATRKFETGQAVLKQEPYAYAVMSSHIDV--VCHYCLCAPGQPGAPV 63
Query: 68 -----CPSCSVA-FCSPKCSTAAASSSHSPYVCQALTRLLNLNSPDAANLPLDRQLQARF 121
C C A +C+ +C A C A+ R+ D L + R L R
Sbjct: 64 EDLHRCTGCKFAQYCTKECQKKAWPEHKQE--CAAIKRITPGKPVDQTRL-VGRILWRRK 120
Query: 122 LIAAYN-------------LAVVNPSQFQILLAFQGT-----VTDNDTSAAHYLSSLCPP 163
N + +V + + L+ + + + S Y + P
Sbjct: 121 REENLNGEKKDGKENDEKKVELVKIEELEDHLSKRNAEEKEAIDEKVYSFGDYFTYDEMP 180
Query: 164 PASTTIELTAALLAKDRLNAFGLMEPYIEGQDGRRSVR--AYGIYLKASFFNHDCLPNAC 221
+ E A L A NA GL + R V+ GIY S NHDC PN
Sbjct: 181 DSD---EEMAHLFAIIDCNAIGL--------NDHRGVQTIGVGIYPGISMLNHDCSPNC- 228
Query: 222 RFDYVDAAAENNTDIIVRMIHDVPQGREICLSYFPVNYDYSTRQKRLLYDYDFACDCDRC 281
A N + VR + + G E+C+SY R+++L Y F C+CD C
Sbjct: 229 ------VAMNNGPRLEVRALRVIQPGEELCISYIDSLETTEKRREKLKLQYYFDCECDTC 282
>gi|307173919|gb|EFN64667.1| Protein msta, isoform B [Camponotus floridanus]
Length = 513
Score = 58.9 bits (141), Expect = 3e-06, Method: Compositional matrix adjust.
Identities = 65/273 (23%), Positives = 112/273 (41%), Gaps = 38/273 (13%)
Query: 17 GRGLVSTQSLKAGQIVLRDSPILLYSALPFINSSSFCH-NCFRKTMHSSSSICP------ 69
GRG+ +T+ ++AG+++ D P+L+ C+ CF + + CP
Sbjct: 48 GRGMFATRDIQAGELIFTDVPLLI---------GPRCYRKCFPMCVVCYKNNCPLFPCDH 98
Query: 70 SCSVAFCSPKCSTAAASSSHSPYVCQALTRLLNLNSPDAANLPLDRQLQARFLIAAYNLA 129
C + CS +C + S RLL P + + L A I L+
Sbjct: 99 DCGLPICSTECENSTMHSQEC--------RLLREWMPTCGSSWSNELLLAVIPIRGLTLS 150
Query: 130 VVNPSQFQILLAFQGTVTDNDTSAAHYLS-SLCPPPASTTIELTAALLAKDRLNAFGLME 188
Q ++L AF+ + T L ++ P +EL + N+F ++
Sbjct: 151 ---KEQRKLLYAFECHLELTRTHEIDLLKRNVKKFPNKEQLELMKRICGVFNTNSFEVL- 206
Query: 189 PYIEGQDGRRSVRAYGIYLKASFFNHDCLPNACRFDYVDAAAENNTDIIVRMIHDVPQGR 248
+D S+R G+Y + NH C+PN R + D + T + + G
Sbjct: 207 -VASNKDCTTSLR--GLYPLGALQNHCCVPNT-RHHFDDQQQLHVTATL-----PIAAGE 257
Query: 249 EICLSYFPVNYDYSTRQKRLLYDYDFACDCDRC 281
EI +SY + +D S+R++ L F+C+C RC
Sbjct: 258 EITMSYTDLLWDTSSRRQFLKATKHFSCNCSRC 290
>gi|167533183|ref|XP_001748272.1| hypothetical protein [Monosiga brevicollis MX1]
gi|163773392|gb|EDQ87033.1| predicted protein [Monosiga brevicollis MX1]
Length = 456
Score = 58.9 bits (141), Expect = 3e-06, Method: Compositional matrix adjust.
Identities = 68/288 (23%), Positives = 108/288 (37%), Gaps = 33/288 (11%)
Query: 10 VAEIEGRGRGLVSTQSLKAGQIVLRDSPILLYSALPFINSSSFCHNCF--RKTMHSSSSI 67
V + GRGR + +T++L GQ VL + P L + + C +CF K H ++
Sbjct: 4 VVGVPGRGRAVRATKALARGQTVLLNPP-LAFVLRHEERVARRCEDCFVSEKPEHRLANC 62
Query: 68 CPSCSVAFCSPKCSTAAASSSHSPYVCQALTRLLNLNSP----DAANLPLDR--QLQARF 121
+ A+CS C T +H +VC+ L L P DAA + + L+ R
Sbjct: 63 SLCHTAAYCSKPCQTRNWKRAHK-HVCKLLQTLPENPQPPHIIDAAAMTVATLVALERRA 121
Query: 122 LIAAYNLAVVNP-----------SQFQILLAFQGTVTDNDTSAAHYLSSLCPPPASTTIE 170
+ +P + F + T+ + L ++ P ST +
Sbjct: 122 KLEDKESEQASPDPGSAVRQPRCADFWAMAQHTPTLNSEELDDVLQLVAVTQCPGSTDKQ 181
Query: 171 LTAALLAKDRLNAFGLMEPYIEGQDGRRSVRAYGIYLKASFFNHDCLPNACRFDYVDAAA 230
+L + N F + + + R G+Y + NH C PN C Y
Sbjct: 182 RVMDVLQRADCNNFSIWDELL-------LPRGAGVYPWGAILNHSCEPN-CVMTYRGPLH 233
Query: 231 ENNTDIIVRMIHDVPQGREICLSYFPVNYDYSTRQKRLLYDYDFACDC 278
V+ + D+ G E+C SY + R L Y F CDC
Sbjct: 234 AQ----AVKALRDIAVGEELCHSYIDLYAPTGQRHSHLGDQYGFECDC 277
>gi|392580063|gb|EIW73190.1| hypothetical protein TREMEDRAFT_73008 [Tremella mesenterica DSM
1558]
Length = 593
Score = 58.9 bits (141), Expect = 4e-06, Method: Compositional matrix adjust.
Identities = 84/383 (21%), Positives = 140/383 (36%), Gaps = 76/383 (19%)
Query: 14 EGRGRGLVSTQSLKAGQIVLRDSPILLYS---ALPFINSSSFCHNCFRKTMHSS-----S 65
+GRG+GLV+ Q + G+++ ++ P ++ + PF+ S C CF M S S
Sbjct: 179 DGRGKGLVAKQRILQGEVLWQEEPWVVTADANLYPFLLSQRMCTQCF--AMFSRPNPPLS 236
Query: 66 SICPSCSVA-FCSPKC-STAAASSSHSPYVC----------------------QALTRLL 101
C SC +A FC+ C S A AS +H +C +A++R++
Sbjct: 237 VPCGSCDMAHFCNRLCLSKARASGAHHDLLCPGQNPAAMELLTLIHKQGSRPLEAVSRII 296
Query: 102 N--LNSPDAANLPLDRQLQARFLIAAYNLAVVNPSQFQILLAFQGTVTDNDTSAAHYL-- 157
+ L +++ R ++ + F G + + AH L
Sbjct: 297 AKWRGEREWGGLEKGEEVRKRVWGGMARVSELEKESQMREWEFIGELRLKEWHHAHELLI 356
Query: 158 SSLCPPP-------------ASTTIELTAALLAKDRLNAFGLMEPYIE--GQDGRRSVRA 202
L PP + +L A L+ + + + ++E G G +
Sbjct: 357 KVLNPPSDDPGYKAFHKLLTSRRPKKLQAVPLSAEEAAQWFSFDSFLELLGLVGLNQEDS 416
Query: 203 YGIYLKASFFNHDCLPNACRFDYVDAAAENNTDIIVRMI-------------------HD 243
GIY + FNH C PN + + + + R I +
Sbjct: 417 GGIYALHAHFNHSCDPNIQARNLPKSFTPPTSSELPREIPPPNAQGVRGTNKLTMLARRN 476
Query: 244 VPQGREICLSYFPVNYDYSTRQKRLLYDYDFACDCDRCKVEANWSDNDNDDENNEEVMDE 303
+ G E+ LSY + R++ L Y F C C RC E N N + DE
Sbjct: 477 INPGEELTLSYVNIALPKWHRRRLLREQYGFWCSCPRCGREKE-EPPQNKMMNGGKSRDE 535
Query: 304 DQDEQMVASDDD---AEVHGDTN 323
+ E +V + AE G T+
Sbjct: 536 VEAEDIVRRIQEMSMAESQGGTD 558
>gi|261330291|emb|CBH13275.1| hypothetical protein, conserved [Trypanosoma brucei gambiense
DAL972]
Length = 399
Score = 58.9 bits (141), Expect = 4e-06, Method: Compositional matrix adjust.
Identities = 80/354 (22%), Positives = 125/354 (35%), Gaps = 88/354 (24%)
Query: 10 VAEIEGRGRGLVSTQSLKAGQIVLRDSPILLYSALPFIN-SSSFCHNCFRK--------- 59
VA +G+ +++T + A V ++P++ + + ++ FC C R+
Sbjct: 54 VANTVEKGKHILATADMPAQYSVFEEAPVVSWPSQGYLTLGVPFCLFCLRQRNNADIADE 113
Query: 60 TMHSSSSICPSCSVAFCSPKCSTAAASSSHSPYVCQALTRLLNLNSPDAANLPL------ 113
H+ +S C C FCS C A+A H +C L L D P+
Sbjct: 114 GEHNWNS-CAGCGSLFCSESC-LASAEMPHR-VLCGVLQELRKRECTDGCCRPITKESLA 170
Query: 114 ---------------DRQLQARFLIAAY--NLAVVNPSQFQILLAFQGTVTDNDTSA--- 153
RQ + L + N ++ F ++ A D +A
Sbjct: 171 NCVAWVVGRIAGAIKQRQFSGKLLEENHRENANSISRQLFHVVTAPFNRFLDAPKNAEFA 230
Query: 154 --------------------AHYLSSLCPPP---ASTTIELTAALLAKDRLNAF-GLMEP 189
A L S PP A +E+ LL D L F G++
Sbjct: 231 DVDANSWYEEINKLLREPCRAVLLQSAAAPPSIGADWALEIVDGLLRHDTLERFLGILTL 290
Query: 190 YIEGQDGRRSV------------------RAYGIYLKASFFNHDCLPNACRFDYVDAAAE 231
+G +G +V + GIY S FNH C+PN V AE
Sbjct: 291 NSQGLNGFVAVPGAAGDMDSPSLPITWVLKGGGIYSLQSAFNHSCVPN------VAVLAE 344
Query: 232 NNT-DIIVRMIHDVPQGREICLSYFPVNYDYSTRQKRLLYDYDFACDCDRCKVE 284
T DI +R + + G E+ ++Y PV ++ L Y F C C C+ E
Sbjct: 345 GGTHDITLRTLRAIKNGEELTITYIPVENTTRAERQMKLEGYFFTCRCPLCEEE 398
>gi|383866011|ref|XP_003708465.1| PREDICTED: SET and MYND domain-containing protein 4-like [Megachile
rotundata]
Length = 636
Score = 58.9 bits (141), Expect = 4e-06, Method: Compositional matrix adjust.
Identities = 85/335 (25%), Positives = 127/335 (37%), Gaps = 73/335 (21%)
Query: 3 AVSETVTVAEIEGRGRGLVSTQSLKAGQIVLRDSPILLYSALPFINSSSF--CHNCFRKT 60
A S+ V++A E GR LV+T+ K G IV + P A S F CH+C ++
Sbjct: 240 AASDGVSIAFSEKYGRHLVATKEFKPGDIVTIEDPF----AFTIYMQSYFTHCHHCLARS 295
Query: 61 MHSSSSICPSCSVA-FCSPKCSTAAASSSHSPYVCQALTRLLNLNSPDAANLPLDRQLQA 119
+ C C VA +CS C A +H C L + NL + D + + ++
Sbjct: 296 FNLIP--CLKCPVAQYCSETCRKLAWEMAHQ-IECPILALVGNLLNVDKNKIRMLTKI-V 351
Query: 120 RFLIAA-------------YNLAVVNPSQFQILLAFQGTVTDNDTSAAHYLSS------L 160
RFLI A +A NP +G + + +A L++ L
Sbjct: 352 RFLIVATAKGKKIQELLEDMKVAESNPDNRTAGFTDEGILDSTSSRSALSLATNMTTRPL 411
Query: 161 CPPPASTTIELTAALLAKDRLNAFGLMEPYIEGQD----------GRRSVRA-------- 202
A I AA+L + N F + +D G RA
Sbjct: 412 IGISAFACISALAAILLATQTNFFCKKYEVDQLRDINDLPEIKFCGSIMFRACVIMCSNC 471
Query: 203 ------------YGIYLKASFFNHDCLPNACR-FDYVDAAAENNTDIIVRMIHDVPQGRE 249
G+Y+ S +NH C PN R F+ +I R + + G +
Sbjct: 472 FLVQQEPGIKTGSGLYITHSLYNHSCAPNTFRHFE--------GMTMITRALEPIFPGDQ 523
Query: 250 ICLSYFPVNYDYSTRQKR---LLYDYDFACDCDRC 281
I +Y +Y Y TR +R ++ DY F CDC C
Sbjct: 524 IFTNYC-ASYAYMTRSERREKIMQDYFFECDCIAC 557
>gi|50287869|ref|XP_446364.1| hypothetical protein [Candida glabrata CBS 138]
gi|74637715|sp|Q6FTT0.1|SET5_CANGA RecName: Full=Potential protein lysine methyltransferase SET5;
AltName: Full=SET domain-containing protein 5
gi|49525671|emb|CAG59288.1| unnamed protein product [Candida glabrata]
Length = 515
Score = 58.9 bits (141), Expect = 4e-06, Method: Compositional matrix adjust.
Identities = 70/306 (22%), Positives = 119/306 (38%), Gaps = 53/306 (17%)
Query: 16 RGRGLVSTQSLKAGQIVLRDS-PILL---YSALPFINSSSFCHNCFRKTMHSSSSI---- 67
+GRGL +T+ ++ G+++ + PI + L I S C C +SS
Sbjct: 126 KGRGLYATRDIQQGELLFHEKVPIAMVPPMDKLKLIRSGKSCSMCGVSLSNSSHFTMLHG 185
Query: 68 --CPSCSVAFCSPKCSTAAASSSHSPYVCQALTRLLNLNSPDAANLPLDRQLQARFLIAA 125
C C+ +CS C + PY+ ++ N+ P N+ +R Q +AA
Sbjct: 186 LDCNGCNSIWCSTNCKQKDITH---PYLKHFGSKNKNIR-PMDWNM-FERHCQENIFVAA 240
Query: 126 YNLAVVNPS-------------QFQILLAFQGTVTDNDTSAAHYLSSLCPPPASTTIELT 172
Y++ V++ + QF+ L V + + + + S E
Sbjct: 241 YSIGVIHAASLIDKAQSDEINQQFEALAKISQRVRYESSDSNNIGGTFDAEIGSLKEENP 300
Query: 173 AALLAKDRLNAFGLM------------EPYIEGQDGRRSVRAYG--IYLKASFFNHDCLP 218
L K AF L E ++E GR + +Y SF NH+C P
Sbjct: 301 EPLWKK----AFDLFIRTFPDTAEMGYEVFLEYL-GRFHINQLSGQLYFLYSFLNHNCEP 355
Query: 219 NACRFDYVDAAAENNTDIIVRMIHDVPQGREICLSYFPVNYDYSTRQKRLLYDYDFACDC 278
N R+D N ++ V + + E+ +Y + S R++ L ++ F C+C
Sbjct: 356 NV-RYD-----INNKLELKVYARKFIKKDEELVTTYVNPLHGVSLRRRELRVNWGFICNC 409
Query: 279 DRCKVE 284
DRC E
Sbjct: 410 DRCAKE 415
>gi|389643366|ref|XP_003719315.1| TPR domain-containing protein [Magnaporthe oryzae 70-15]
gi|351639084|gb|EHA46948.1| TPR domain-containing protein [Magnaporthe oryzae 70-15]
gi|440463125|gb|ELQ32752.1| TPR domain-containing protein [Magnaporthe oryzae Y34]
gi|440483490|gb|ELQ63875.1| TPR domain-containing protein [Magnaporthe oryzae P131]
Length = 746
Score = 58.9 bits (141), Expect = 4e-06, Method: Compositional matrix adjust.
Identities = 34/109 (31%), Positives = 52/109 (47%), Gaps = 16/109 (14%)
Query: 181 LNAFGLMEPYIEGQDGRRS-------VRAYGIYLKASFFNHDCLPNACRFDYVDAAAENN 233
LN FG P + +D R + G++L A++ NH C+PNA R D
Sbjct: 476 LNGFGC--PRLRSRDKERMNVQEGGPESSTGMWLHAAYANHTCIPNATRAFIGDM----- 528
Query: 234 TDIIVRMIHDVPQGREICLSYFPVNYDYSTRQKRLLYDYDFACDCDRCK 282
+IVR D+P G EI + Y + + +R+ + Y F CDC+ C+
Sbjct: 529 --MIVRAARDIPAGAEIFMGYASLAEPFESRRSKFKTSYGFECDCEMCR 575
>gi|328875104|gb|EGG23469.1| SET domain-containing protein [Dictyostelium fasciculatum]
Length = 448
Score = 58.9 bits (141), Expect = 4e-06, Method: Compositional matrix adjust.
Identities = 38/122 (31%), Positives = 57/122 (46%), Gaps = 14/122 (11%)
Query: 169 IELTAALLAKDRLNAFGLMEPYIEGQDGRRSVRAYGIYLKASFFNHDCLPNACRFDYVDA 228
I + + K R N FG+ +D V I AS+FNH C PN VD
Sbjct: 263 ISMIRPITCKIRCNQFGIWSK----KDKCLGVSVTPI---ASYFNHSCCPNI-----VDV 310
Query: 229 AAENNTDIIVRMIHDVPQGREICLSYFPVNYDYSTRQKRLLYDYDFACDCDRCKVEANWS 288
T + + +H +P+G ++C+SY ++ +RQ L+Y Y F C C RC + +
Sbjct: 311 RG--TTLLEFKALHFIPKGSQLCISYLDLDQTTDSRQDYLIYSYYFKCGCKRCNDKGDSI 368
Query: 289 DN 290
DN
Sbjct: 369 DN 370
>gi|428182841|gb|EKX51700.1| hypothetical protein GUITHDRAFT_55948, partial [Guillardia theta
CCMP2712]
Length = 98
Score = 58.9 bits (141), Expect = 4e-06, Method: Compositional matrix adjust.
Identities = 35/113 (30%), Positives = 59/113 (52%), Gaps = 16/113 (14%)
Query: 170 ELTAALLAKDRLNAFGLMEPYIEGQDGRRSVRAYGIYLKASFFNHDCLPNAC-RFDYVDA 228
E+ A++++K + N G+++ ++GQ+ +G+Y ASF NH C PNA FD
Sbjct: 1 EVIASMISKAQRNMHGIVD--LKGQN-----FGHGLYPLASFINHSCEPNAIISFD---- 49
Query: 229 AAENNTDIIVRMIHDVPQGREICLSYFPVNYDYSTRQKRLLYDYDFACDCDRC 281
++VR + ++P+G EI ++Y + R+ LL F C C RC
Sbjct: 50 ----GNKLVVRALENIPRGTEITIAYVELYAPLDVRRDALLSRKGFLCRCSRC 98
>gi|414871461|tpg|DAA50018.1| TPA: hypothetical protein ZEAMMB73_470058 [Zea mays]
Length = 173
Score = 58.9 bits (141), Expect = 4e-06, Method: Compositional matrix adjust.
Identities = 38/123 (30%), Positives = 56/123 (45%), Gaps = 15/123 (12%)
Query: 168 TIELTAALLAKDRLNAFGLMEPYIEGQDGRRSVRAY---------GIYLKASFFNHDCLP 218
TI+ +LA+ R+NAF + +D S A +Y+ SF+NHDC P
Sbjct: 53 TIDWYINVLARIRINAFRIELVASSYEDLLSSAAASVSCDSSVGNAVYMLPSFYNHDCDP 112
Query: 219 NACRFDYVDAAAENNTDIIVRMIHDVPQGREICLSYFPVNYDYSTRQKRLLYDYDFACDC 278
NA N D ++ + D+ +G E+C+ Y + D RQK L + F C C
Sbjct: 113 NA------HIVWLQNADAKLKALRDIEEGEELCICYIDASMDADARQKILADGFGFECRC 166
Query: 279 DRC 281
RC
Sbjct: 167 LRC 169
>gi|134074941|emb|CAK47970.1| unnamed protein product [Aspergillus niger]
Length = 380
Score = 58.9 bits (141), Expect = 4e-06, Method: Compositional matrix adjust.
Identities = 70/297 (23%), Positives = 121/297 (40%), Gaps = 44/297 (14%)
Query: 16 RGRGLVSTQSLKAGQIVLRDSPILLYSAL--PFINSSSFCHN--CFRKTMHSSSSI-CPS 70
+GR L ++Q ++ G++++ D P L + P + S C N C R+T HSS I CP+
Sbjct: 45 KGRQLRASQPIRKGELLMIDVPYALIPVVDDPASSDSLLCSNPTCSRQTQHSSGRISCPN 104
Query: 71 ---CSVAFCSPKCSTAAASSSHSPYVCQALTRLLN-----LNSPDAANLPL------DRQ 116
V +CS C A + C L R + D + L RQ
Sbjct: 105 RCLADVVWCSSTCQEA--DQLRHEFECTWLQRYASPIRAKWGEYDFGMMWLIVRILASRQ 162
Query: 117 LQARFLIAAYNLAVVN-PSQFQILLAFQGTVTDNDTS--------AAHYLSSLCPPPAST 167
+ + L+ A N ++ + +F G++ + YL S P
Sbjct: 163 TERQQLLDANNETTQRFKGGWEAIQSFCGSIESWSHAQVRSWTALVKKYLRSSPILPHDL 222
Query: 168 TIELTAALLAKDRLNAFGL------MEPYIEGQDGRRSVRAYGIYLKASFFNHDCLPNAC 221
+++ AL+ ++ N+FGL + P E R +Y +A+ NH C PN
Sbjct: 223 SVDEIVALICREEANSFGLYPRETGVYPVPEPPVDRGEQFGAAVYPRAAIANHSCSPN-- 280
Query: 222 RFDYVDAAAENNTDIIVRMIHDVPQGREICLSYFPVN--YDYSTRQKRLLYDYDFAC 276
+ +++ ++ D+ G E C+SYF ++ D +R+ L + F C
Sbjct: 281 ----IMHKPDHHGRMVFTASKDIAAGEECCISYFDLSKRVDLKSRRDHLQGLFRFVC 333
>gi|134108020|ref|XP_777392.1| hypothetical protein CNBB1930 [Cryptococcus neoformans var.
neoformans B-3501A]
gi|338819406|sp|P0CR43.1|SET5_CRYNB RecName: Full=Potential protein lysine methyltransferase SET5;
AltName: Full=SET domain-containing protein 5
gi|50260082|gb|EAL22745.1| hypothetical protein CNBB1930 [Cryptococcus neoformans var.
neoformans B-3501A]
Length = 449
Score = 58.9 bits (141), Expect = 4e-06, Method: Compositional matrix adjust.
Identities = 35/104 (33%), Positives = 54/104 (51%), Gaps = 14/104 (13%)
Query: 15 GRGRGLVSTQSLKAGQIVLRDSPILLYS---ALPFINSSSFCHNCFRKTMHSSSSI---C 68
G+G+GLV+ + LK G+++ ++ P ++ S + S C CF S I C
Sbjct: 103 GKGKGLVAKEELKQGEMLWQEEPWIVTSDPGHYSLLTQSMMCSQCFSLFARPSPPISVPC 162
Query: 69 PSCSVA-FCSPKCSTAAASSSHSPYVCQALTRLLNLNSPDAANL 111
P C+ A FC+ C T + SSSH P +C L +PDA++L
Sbjct: 163 PHCTTAHFCNRLCYTKSLSSSHPPLLCPGL-------NPDASSL 199
>gi|190347709|gb|EDK40035.2| hypothetical protein PGUG_04133 [Meyerozyma guilliermondii ATCC
6260]
Length = 338
Score = 58.9 bits (141), Expect = 4e-06, Method: Compositional matrix adjust.
Identities = 42/136 (30%), Positives = 58/136 (42%), Gaps = 13/136 (9%)
Query: 151 TSAAHYLSSLCPPPAS--TTIELTAALLAKDRLNAFGLMEPYIEGQDGRRSVRAYGIYLK 208
T YL C P TI+ ++ NAFG+ + G+D + +Y
Sbjct: 211 TRMYKYLKLTCGPQLQPFITIDNIRRIIGTSLTNAFGIWSSTVPGED--KEYFGCALYPS 268
Query: 209 ASFFNHDCLPNACRFDYVDAAAENNTDIIVRMIHDVPQGREICLSYF-PVNYDYSTRQKR 267
ASFFNH C N R + I H V QG E+C+ Y DY TRQK
Sbjct: 269 ASFFNHSCSANVSR-------TRHGRSISFVTSHPVLQGEELCIQYGNHTTEDYHTRQKD 321
Query: 268 LLYDYDFACDCDRCKV 283
L ++ F C C +C++
Sbjct: 322 LK-EWFFECGCKKCEM 336
>gi|408398088|gb|EKJ77223.1| hypothetical protein FPSE_02597 [Fusarium pseudograminearum CS3096]
Length = 520
Score = 58.5 bits (140), Expect = 4e-06, Method: Compositional matrix adjust.
Identities = 75/274 (27%), Positives = 114/274 (41%), Gaps = 38/274 (13%)
Query: 16 RGRGLVSTQSLKAGQIVLRDSPILLYSALPFINSSSFCHNCFRKTMHSSSSICPSCSVAF 75
RGRG+ ST+S G ++L +P +L +L IN + C +CF++ + S C +V++
Sbjct: 14 RGRGIFSTKSFVPGDVILPFTPRILIPSLSHIN--NICSHCFKQAEVRACSRCH--AVSY 69
Query: 76 CSPKCSTAAASSSHSPYVCQALTRLLNLNSPDAANLPLDRQLQARFLIAAYNLAVVNPSQ 135
C C A ++ HS C+ L ++ P LP I A A+V P
Sbjct: 70 CDAACQAANWTAVHSKE-CKVLRKVTEQGRP---GLPTP--------IRAVVQALVKPEI 117
Query: 136 FQILLAFQGTVTDNDTSAAHYLSSLCPPPASTTIELTAALLAKD-RLNAFGLMEPYIEGQ 194
+ +G T S + AS L +D + NAF + + GQ
Sbjct: 118 GNAIEDLEGNGTSWRKSEKWADMEMMAMGASAFAGLGTG--QEDIQTNAFHRYDTDL-GQ 174
Query: 195 DGRRSVRAYGIYL--KASFFNHDCLPNACRFDYVDAAAENNTDIIVRMIHDVPQGREICL 252
GI+L K + NH C+PNA ++ A I+R + EI +
Sbjct: 175 --------VGIFLEPKLAMANHSCIPNAM-VQFIGRKA------ILRAEKPIKIDDEIEI 219
Query: 253 SYFPVNYDYSTRQKRLLYDYDFACDCDRCKVEAN 286
SY + S R K L Y F C C RC+ + N
Sbjct: 220 SYTDYTFPLSKR-KLALAPYFFDCMCLRCEKDLN 252
>gi|392566265|gb|EIW59441.1| SET domain-containing protein [Trametes versicolor FP-101664 SS1]
Length = 452
Score = 58.5 bits (140), Expect = 5e-06, Method: Compositional matrix adjust.
Identities = 32/102 (31%), Positives = 53/102 (51%), Gaps = 5/102 (4%)
Query: 202 AYGIY-LKASFFNHDCLPNA-CRFDYVDAAAENNTDIIVRMIHDVPQGREICLSYFPVNY 259
A+G++ L + FNH C+PNA C+ Y+ A+E +V + D+ +G EI + Y
Sbjct: 198 AHGVFPLASRLFNHSCIPNAACK--YIIRASEPVAMQVV-ALRDIAEGEEITIPYLDPAL 254
Query: 260 DYSTRQKRLLYDYDFACDCDRCKVEANWSDNDNDDENNEEVM 301
Y TRQ+ L +Y F CDC C+ ++ + E + +
Sbjct: 255 PYQTRQEALEVNYSFNCDCRLCRFQSGIHPVNAPPERGSDAL 296
>gi|19570495|dbj|BAB86333.1| histone methyltransferase [Homo sapiens]
gi|158260233|dbj|BAF82294.1| unnamed protein product [Homo sapiens]
gi|261858444|dbj|BAI45744.1| SET and MYND domain containing 3 [synthetic construct]
Length = 428
Score = 58.5 bits (140), Expect = 5e-06, Method: Compositional matrix adjust.
Identities = 75/313 (23%), Positives = 119/313 (38%), Gaps = 60/313 (19%)
Query: 16 RGRGLVSTQSLKAGQIVLRDSPILLYSALPFINSSSFCHNCF---RKTMHSSSSICPSCS 72
RG GL + L+ G+++ R P L Y+ + C C K M C C
Sbjct: 14 RGNGLRAVTPLRPGELLFRSDP-LAYTVCKG-SRGVVCDRCLLGKEKLMR-----CSQCR 66
Query: 73 VA-FCSPKCSTAAASSSHSPYVCQALTRLLNLNSPDAANLPLDRQL---------QARFL 122
VA +CS KC A C+ L PD+ L L R + ++ L
Sbjct: 67 VAKYCSAKCQKKAWPDHKRE--CKCLKSCKPRYPPDSVRL-LGRVVFKLMDGAPSESEKL 123
Query: 123 IAAYNLAV--------VNPSQFQILLAFQGTVTDNDTSAAHYLSSLCPPPASTTIELTAA 174
+ Y+L Q+++ FQ + + A+ PPA E A
Sbjct: 124 YSFYDLESNINKLTEDKKEGLRQLVMTFQHFMREEIQDASQL------PPAFDLFEAFAK 177
Query: 175 LLAKDRLNAFGLMEPYIEGQDGRRSVRAYGIYLKASFFNHDCLPNACRFDYVDAAAENNT 234
++ N+F + ++ G+Y S NH C PN + N
Sbjct: 178 VIC----NSFTICNAEMQEV-------GVGLYPSISLLNHSCDPNC-------SIVFNGP 219
Query: 235 DIIVRMIHDVPQGREICLSYFPVNYDYSTRQKRLLYDYDFACDCDRCKVEANWSDNDNDD 294
+++R + D+ G E+ + Y + R+K+L Y F CDC RC+ + D D D
Sbjct: 220 HLLLRAVRDIEVGEELTICYLDMLMTSEERRKQLRDQYCFECDCFRCQTQ----DKDADM 275
Query: 295 -ENNEEVMDEDQD 306
+E+V E Q+
Sbjct: 276 LTGDEQVWKEVQE 288
>gi|327533748|pdb|3QWP|A Chain A, Crystal Structure Of Set And Mynd Domain Containing 3;
Zinc Finger Mynd Domain-Containing Protein 1
Length = 429
Score = 58.5 bits (140), Expect = 5e-06, Method: Compositional matrix adjust.
Identities = 76/313 (24%), Positives = 122/313 (38%), Gaps = 60/313 (19%)
Query: 16 RGRGLVSTQSLKAGQIVLRDSPILLYSALPFINSSSFCHNCF---RKTMHSSSSICPSCS 72
RG GL + L+ G+++ R P L Y+ + C C K M C C
Sbjct: 15 RGNGLRAVTPLRPGELLFRSDP-LAYTVCKG-SRGVVCDRCLLGKEKLMR-----CSQCR 67
Query: 73 VA-FCSPKCSTAAASSSHSPYVCQALTRLLNLNSPDAANLPLDRQL---------QARFL 122
VA +CS KC A C+ L PD+ L L R + ++ L
Sbjct: 68 VAKYCSAKCQKKAWPDHKRE--CKCLKSCKPRYPPDSVRL-LGRVVFKLMDGAPSESEKL 124
Query: 123 IAAYNL-AVVNP-------SQFQILLAFQGTVTDNDTSAAHYLSSLCPPPASTTIELTAA 174
+ Y+L + +N Q+++ FQ + + A+ PPA E A
Sbjct: 125 YSFYDLESNINKLTEDRKEGLRQLVMTFQHFMREEIQDASQL------PPAFDLFEAFAK 178
Query: 175 LLAKDRLNAFGLMEPYIEGQDGRRSVRAYGIYLKASFFNHDCLPNACRFDYVDAAAENNT 234
++ N+F + ++ G+Y S NH C PN + N
Sbjct: 179 VIC----NSFTICNAEMQEV-------GVGLYPSISLLNHSCDPNC-------SIVFNGP 220
Query: 235 DIIVRMIHDVPQGREICLSYFPVNYDYSTRQKRLLYDYDFACDCDRCKVEANWSDNDNDD 294
+++R + D+ G E+ + Y + R+K+L Y F CDC RC+ + D D D
Sbjct: 221 HLLLRAVRDIEVGEELTICYLDMLMTSEERRKQLRDQYCFECDCFRCQTQ----DKDADM 276
Query: 295 -ENNEEVMDEDQD 306
+E+V E Q+
Sbjct: 277 LTGDEQVWKEVQE 289
>gi|267844824|ref|NP_001161212.1| SET and MYND domain-containing protein 3 isoform 1 [Homo sapiens]
gi|212276523|sp|Q9H7B4.4|SMYD3_HUMAN RecName: Full=SET and MYND domain-containing protein 3; AltName:
Full=Zinc finger MYND domain-containing protein 1
gi|119597548|gb|EAW77142.1| SET and MYND domain containing 3, isoform CRA_c [Homo sapiens]
Length = 428
Score = 58.5 bits (140), Expect = 5e-06, Method: Compositional matrix adjust.
Identities = 75/313 (23%), Positives = 119/313 (38%), Gaps = 60/313 (19%)
Query: 16 RGRGLVSTQSLKAGQIVLRDSPILLYSALPFINSSSFCHNCF---RKTMHSSSSICPSCS 72
RG GL + L+ G+++ R P L Y+ + C C K M C C
Sbjct: 14 RGNGLRAVTPLRPGELLFRSDP-LAYTVCKG-SRGVVCDRCLLGKEKLMR-----CSQCR 66
Query: 73 VA-FCSPKCSTAAASSSHSPYVCQALTRLLNLNSPDAANLPLDRQL---------QARFL 122
VA +CS KC A C+ L PD+ L L R + ++ L
Sbjct: 67 VAKYCSAKCQKKAWPDHKRE--CKCLKSCKPRYPPDSVRL-LGRVVFKLMDGAPSESEKL 123
Query: 123 IAAYNLAV--------VNPSQFQILLAFQGTVTDNDTSAAHYLSSLCPPPASTTIELTAA 174
+ Y+L Q+++ FQ + + A+ PPA E A
Sbjct: 124 YSFYDLESNINKLTEDKKEGLRQLVMTFQHFMREEIQDASQL------PPAFDLFEAFAK 177
Query: 175 LLAKDRLNAFGLMEPYIEGQDGRRSVRAYGIYLKASFFNHDCLPNACRFDYVDAAAENNT 234
++ N+F + ++ G+Y S NH C PN + N
Sbjct: 178 VIC----NSFTICNAEMQEV-------GVGLYPSISLLNHSCDPNC-------SIVFNGP 219
Query: 235 DIIVRMIHDVPQGREICLSYFPVNYDYSTRQKRLLYDYDFACDCDRCKVEANWSDNDNDD 294
+++R + D+ G E+ + Y + R+K+L Y F CDC RC+ + D D D
Sbjct: 220 HLLLRAVRDIEVGEELTICYLDMLMTSEERRKQLRDQYCFECDCFRCQTQ----DKDADM 275
Query: 295 -ENNEEVMDEDQD 306
+E+V E Q+
Sbjct: 276 LTGDEQVWKEVQE 288
>gi|46115820|ref|XP_383928.1| hypothetical protein FG03752.1 [Gibberella zeae PH-1]
Length = 358
Score = 58.5 bits (140), Expect = 5e-06, Method: Compositional matrix adjust.
Identities = 72/327 (22%), Positives = 122/327 (37%), Gaps = 59/327 (18%)
Query: 16 RGRGLVSTQSLKAGQIVLRDSPILLYSALPFINSSS---FCHN--CFRKTMHSSSSI-CP 69
+GR L +T ++ G I+L D+ Y+ +P + S + C N C R+ + ++ C
Sbjct: 45 KGRLLRATGPIREGTILLVDTA---YAIVPSVTSDAEPLICSNLSCSRRVPQNGRAVRCE 101
Query: 70 SC---SVAFCSPKCSTAAASSSHSPYVCQALTRLLNLNSPDAANLPLDRQLQARFLIAAY 126
+ V +C+ C A+ Y C L + + + L+A +
Sbjct: 102 NACFKDVVWCNIACR--ASDKLRHDYECAWLKKNADSMRRKESEYNFATLWHVVRLLATW 159
Query: 127 NLAVVN---------PSQFQILLAFQGT--------------VTDNDTSAAHYLSSLCPP 163
N + + P + L ++ +T YLS
Sbjct: 160 NAELHSGNAVAQHRYPWEAHFLRGWKAVDMCCAYLDSWPEEQLTHWKRLVNEYLSDTSVL 219
Query: 164 PASTTIELTAALLAKDRLNAFGLMEPYIEGQ------DGRRSVRAYGIYLKASFFNHDCL 217
P T + LL K+ N FGL Q R + +Y +A+ FNH CL
Sbjct: 220 PCMLTADDMLLLLCKEETNTFGLYPKATGTQFVNDNATSRGESYGFALYPRAAQFNHSCL 279
Query: 218 PNACRFDYVDAAAENNTDIIVRMIHDVPQGREICLSYFPV--NYDYSTRQKRLLYDYDFA 275
PN V + ++ D+ +G E ++YF + D S+RQK + F
Sbjct: 280 PN------VSHKPDGQARMVYTAARDISKGEECMITYFDLATRKDVSSRQKYAQTQFQFK 333
Query: 276 CDCDRCKVEANWSDNDNDDENNEEVMD 302
C C+RC E + + N ++MD
Sbjct: 334 CTCNRCLEE--------EADENMDIMD 352
>gi|312208186|pdb|3PDN|A Chain A, Crystal Structure Of Smyd3 In Complex With
Methyltransferase Inhibitor Sinefungin
gi|323714509|pdb|3OXL|A Chain A, Human Lysine Methyltransferase Smyd3 In Complex With
Adohcy (Form Ii)
gi|21410974|gb|AAH31010.1| SMYD3 protein [Homo sapiens]
gi|123984179|gb|ABM83506.1| SET and MYND domain containing 3 [synthetic construct]
gi|123998243|gb|ABM86723.1| SET and MYND domain containing 3 [synthetic construct]
Length = 428
Score = 58.5 bits (140), Expect = 5e-06, Method: Compositional matrix adjust.
Identities = 76/313 (24%), Positives = 122/313 (38%), Gaps = 60/313 (19%)
Query: 16 RGRGLVSTQSLKAGQIVLRDSPILLYSALPFINSSSFCHNCF---RKTMHSSSSICPSCS 72
RG GL + L+ G+++ R P L Y+ + C C K M C C
Sbjct: 14 RGNGLRAVTPLRPGELLFRSDP-LAYTVCKG-SRGVVCDRCLLGKEKLMR-----CSQCR 66
Query: 73 VA-FCSPKCSTAAASSSHSPYVCQALTRLLNLNSPDAANLPLDRQL---------QARFL 122
VA +CS KC A C+ L PD+ L L R + ++ L
Sbjct: 67 VAKYCSAKCQKKAWPDHKRE--CKCLKSCKPRYPPDSVRL-LGRVVFKLMDGAPSESEKL 123
Query: 123 IAAYNL-AVVNP-------SQFQILLAFQGTVTDNDTSAAHYLSSLCPPPASTTIELTAA 174
+ Y+L + +N Q+++ FQ + + A+ PPA E A
Sbjct: 124 YSFYDLESNINKLTEDRKEGLRQLVMTFQHFMREEIQDASQL------PPAFDLFEAFAK 177
Query: 175 LLAKDRLNAFGLMEPYIEGQDGRRSVRAYGIYLKASFFNHDCLPNACRFDYVDAAAENNT 234
++ N+F + ++ G+Y S NH C PN + N
Sbjct: 178 VIC----NSFTICNAEMQEV-------GVGLYPSISLLNHSCDPNC-------SIVFNGP 219
Query: 235 DIIVRMIHDVPQGREICLSYFPVNYDYSTRQKRLLYDYDFACDCDRCKVEANWSDNDNDD 294
+++R + D+ G E+ + Y + R+K+L Y F CDC RC+ + D D D
Sbjct: 220 HLLLRAVRDIEVGEELTICYLDMLMTSEERRKQLRDQYCFECDCFRCQTQ----DKDADM 275
Query: 295 -ENNEEVMDEDQD 306
+E+V E Q+
Sbjct: 276 LTGDEQVWKEVQE 288
>gi|333361525|pdb|3RU0|A Chain A, Cocrystal Structure Of Human Smyd3 With Inhibitor
Sinefungin Bound
gi|333361526|pdb|3RU0|B Chain B, Cocrystal Structure Of Human Smyd3 With Inhibitor
Sinefungin Bound
Length = 438
Score = 58.5 bits (140), Expect = 5e-06, Method: Compositional matrix adjust.
Identities = 76/313 (24%), Positives = 120/313 (38%), Gaps = 60/313 (19%)
Query: 16 RGRGLVSTQSLKAGQIVLRDSPILLYSALPFINSSSFCHNCF---RKTMHSSSSICPSCS 72
RG GL + L+ G+++ R P L Y+ + C C K M C C
Sbjct: 16 RGNGLRAVTPLRPGELLFRSDP-LAYTVCKG-SRGVVCDRCLLGKEKLMR-----CSQCR 68
Query: 73 VA-FCSPKCSTAAASSSHSPYVCQALTRLLNLNSPDAANLPLDRQL---------QARFL 122
VA +CS KC A C+ L PD+ L L R + ++ L
Sbjct: 69 VAKYCSAKCQKKAWPDHKRE--CKCLKSCKPRYPPDSVRL-LGRVVFKLMDGAPSESEKL 125
Query: 123 IAAYNLAV--------VNPSQFQILLAFQGTVTDNDTSAAHYLSSLCPPPASTTIELTAA 174
+ Y+L Q+++ FQ + + A+ PPA E A
Sbjct: 126 YSFYDLESNINKLTEDKKEGLRQLVMTFQHFMREEIQDASQL------PPAFDLFEAFAK 179
Query: 175 LLAKDRLNAFGLMEPYIEGQDGRRSVRAYGIYLKASFFNHDCLPNACRFDYVDAAAENNT 234
++ N+F + ++ G+Y S NH C PN C + N
Sbjct: 180 VIC----NSFTICNAEMQEV-------GVGLYPSISLLNHSCDPN-CSIVF------NGP 221
Query: 235 DIIVRMIHDVPQGREICLSYFPVNYDYSTRQKRLLYDYDFACDCDRCKVEANWSDNDNDD 294
+++R + D+ G E+ + Y + R+K+L Y F CDC RC+ + D D D
Sbjct: 222 HLLLRAVRDIEVGEELTICYLDMLMTSEERRKQLRDQYCFECDCFRCQTQ----DKDADM 277
Query: 295 -ENNEEVMDEDQD 306
+E+V E Q+
Sbjct: 278 LTGDEQVWKEVQE 290
>gi|323714508|pdb|3OXG|A Chain A, Human Lysine Methyltransferase Smyd3 In Complex With
Adohcy (Form Iii)
Length = 464
Score = 58.5 bits (140), Expect = 5e-06, Method: Compositional matrix adjust.
Identities = 77/313 (24%), Positives = 123/313 (39%), Gaps = 60/313 (19%)
Query: 16 RGRGLVSTQSLKAGQIVLRDSPILLYSALPFINSSSFCHNCF---RKTMHSSSSICPSCS 72
RG GL + L+ G+++ R P L Y+ + C C K M C C
Sbjct: 50 RGNGLRAVTPLRPGELLFRSDP-LAYTVCKG-SRGVVCDRCLLGKEKLMR-----CSQCR 102
Query: 73 VA-FCSPKCSTAAASSSHSPYVCQALTRLLNLNSPDAANLPLDRQL---------QARFL 122
VA +CS KC A C+ L PD+ L L R + ++ L
Sbjct: 103 VAKYCSAKCQKKAWPDHKRE--CKCLKSCKPRYPPDSVRL-LGRVVFKLMDGAPSESEKL 159
Query: 123 IAAYNL-AVVNP-------SQFQILLAFQGTVTDNDTSAAHYLSSLCPPPASTTIELTAA 174
+ Y+L + +N Q+++ FQ + + A+ PPA E A
Sbjct: 160 YSFYDLESNINKLTEDRKEGLRQLVMTFQHFMREEIQDASQL------PPAFDLFEAFAK 213
Query: 175 LLAKDRLNAFGLMEPYIEGQDGRRSVRAYGIYLKASFFNHDCLPNACRFDYVDAAAENNT 234
++ N+F + ++ G+Y S NH C PN C + N
Sbjct: 214 VIC----NSFTICNAEMQEV-------GVGLYPSISLLNHSCDPN-CSIVF------NGP 255
Query: 235 DIIVRMIHDVPQGREICLSYFPVNYDYSTRQKRLLYDYDFACDCDRCKVEANWSDNDNDD 294
+++R + D+ G E+ + Y + R+K+L Y F CDC RC+ + D D D
Sbjct: 256 HLLLRAVRDIEVGEELTICYLDMLMTSEERRKQLRDQYCFECDCFRCQTQ----DKDADM 311
Query: 295 -ENNEEVMDEDQD 306
+E+V E Q+
Sbjct: 312 LTGDEQVWKEVQE 324
>gi|332812383|ref|XP_514316.3| PREDICTED: SET and MYND domain-containing protein 3 [Pan
troglodytes]
gi|397473047|ref|XP_003808033.1| PREDICTED: SET and MYND domain-containing protein 3 [Pan paniscus]
gi|410208860|gb|JAA01649.1| SET and MYND domain containing 3 [Pan troglodytes]
gi|410254410|gb|JAA15172.1| SET and MYND domain containing 3 [Pan troglodytes]
gi|410289336|gb|JAA23268.1| SET and MYND domain containing 3 [Pan troglodytes]
gi|410332281|gb|JAA35087.1| SET and MYND domain containing 3 [Pan troglodytes]
Length = 428
Score = 58.5 bits (140), Expect = 5e-06, Method: Compositional matrix adjust.
Identities = 75/313 (23%), Positives = 119/313 (38%), Gaps = 60/313 (19%)
Query: 16 RGRGLVSTQSLKAGQIVLRDSPILLYSALPFINSSSFCHNCF---RKTMHSSSSICPSCS 72
RG GL + L+ G+++ R P L Y+ + C C K M C C
Sbjct: 14 RGNGLRAVTPLRPGELLFRSDP-LAYTVCKG-SRGVVCDRCLLGKEKLMR-----CSQCR 66
Query: 73 VA-FCSPKCSTAAASSSHSPYVCQALTRLLNLNSPDAANLPLDRQL---------QARFL 122
VA +CS KC A C+ L PD+ L L R + ++ L
Sbjct: 67 VAKYCSAKCQKKAWPDHKRE--CKCLKSCKPRYPPDSVRL-LGRVVFKLMDGAPSESEKL 123
Query: 123 IAAYNLAV--------VNPSQFQILLAFQGTVTDNDTSAAHYLSSLCPPPASTTIELTAA 174
+ Y+L Q+++ FQ + + A+ PPA E A
Sbjct: 124 YSFYDLESNINKLTEDKKEGLRQLVMTFQHFMREEIQDASQL------PPAFDLFEAFAK 177
Query: 175 LLAKDRLNAFGLMEPYIEGQDGRRSVRAYGIYLKASFFNHDCLPNACRFDYVDAAAENNT 234
++ N+F + ++ G+Y S NH C PN + N
Sbjct: 178 VIC----NSFTICNAEMQEV-------GVGLYPSISLLNHSCDPNC-------SIVFNGP 219
Query: 235 DIIVRMIHDVPQGREICLSYFPVNYDYSTRQKRLLYDYDFACDCDRCKVEANWSDNDNDD 294
+++R + D+ G E+ + Y + R+K+L Y F CDC RC+ + D D D
Sbjct: 220 HLLLRAVRDIEVGEELTICYLDMLMTSEERRKQLRDQYCFECDCFRCQTQ----DKDADM 275
Query: 295 -ENNEEVMDEDQD 306
+E+V E Q+
Sbjct: 276 LTGDEQVWKEVQE 288
>gi|224050278|ref|XP_002188029.1| PREDICTED: SET and MYND domain-containing protein 1 isoform 3
[Taeniopygia guttata]
Length = 490
Score = 58.5 bits (140), Expect = 5e-06, Method: Compositional matrix adjust.
Identities = 69/290 (23%), Positives = 116/290 (40%), Gaps = 29/290 (10%)
Query: 6 ETVTVAEIEGRGRGLVSTQSLKAGQIVLRDSPILLYSALPFIN-SSSFCHNCFRKTMHSS 64
E V + EG+GRGL + + AG ++ + Y+A+ F + + CH CF++
Sbjct: 7 ECVELFTAEGKGRGLKAQKEFLAGDVIFAEPA---YAAVVFDSLTHVICHTCFKR--QER 61
Query: 65 SSICPSCSVA-FCSPKCSTAAASSSHSPYVCQALTRLLNLNSPDAANLPLDRQLQARFLI 123
C C A +C C AA S + C A+ R + +P N+ L ++ +
Sbjct: 62 LHRCGQCKFAHYCDRTCQRAAWLSHKNE--CSAIKR--HGKAP-TENIRLAARILWKMER 116
Query: 124 AAYNLA---VVNPSQFQILLAFQGTVTDNDTSA--AHYLSSLCPPPASTTIELTAALLAK 178
L+ +V + Q + G D A +L P ++L + +
Sbjct: 117 EGSGLSEGCLVAIEELQNHVDSFGEEEKKDLRADVESFLEFWPPQCQQFGMQLISHIFGV 176
Query: 179 DRLNAFGLMEPYIEGQDGRRSVRAYGIYLKASFFNHDCLPNAC------RFDYVDAAAEN 232
N F L + Q G ++V GI+ NHDC PN + V +
Sbjct: 177 ISCNGFTLSD-----QRGLQAV-GVGIFPNLCQANHDCWPNCTVVFNNGNHEAVRSMFHT 230
Query: 233 NTDIIVRMIHDVPQGREICLSYFPVNYDYSTRQKRLLYDYDFACDCDRCK 282
I +R ++ + G E+ +SY R+++L Y F C C+ CK
Sbjct: 231 QMRIELRALNKISPGDELTVSYVDFLSLSEERRRQLKKQYYFDCTCEHCK 280
>gi|403165497|ref|XP_003325490.2| hypothetical protein PGTG_07323 [Puccinia graminis f. sp. tritici
CRL 75-36-700-3]
gi|375165747|gb|EFP81071.2| hypothetical protein PGTG_07323 [Puccinia graminis f. sp. tritici
CRL 75-36-700-3]
Length = 691
Score = 58.5 bits (140), Expect = 5e-06, Method: Compositional matrix adjust.
Identities = 33/104 (31%), Positives = 53/104 (50%), Gaps = 6/104 (5%)
Query: 203 YGIYLKASFFNHDCLPNACRFDYV---DAAAENNTDIIVRMIHDVPQGREICLSYFPVNY 259
+G+YL S FNH C PNA + D + I V D+ + EI LSY +
Sbjct: 539 HGLYLLHSRFNHSCDPNAVVLKHPGSEDIHKSAPSTIYVMARKDIAKDEEITLSYINPDL 598
Query: 260 DYSTRQKRLLYDYDFACDCDRCKVEANWSDNDNDDENNEEVMDE 303
R+++L DY F+C C RC+ E + + + ++ +E+MD+
Sbjct: 599 SLDARRRKLFEDYLFSCFCSRCRKEL---EQEEERKHYQELMDK 639
>gi|74204261|dbj|BAE39889.1| unnamed protein product [Mus musculus]
Length = 428
Score = 58.5 bits (140), Expect = 5e-06, Method: Compositional matrix adjust.
Identities = 72/303 (23%), Positives = 116/303 (38%), Gaps = 57/303 (18%)
Query: 16 RGRGLVSTQSLKAGQIVLRDSPILLYSALPFINSSSFCHNCF---RKTMHSSSSICPSCS 72
RG GL + L+ G+++ R P L Y+ + C C K MH C C
Sbjct: 14 RGNGLRAVAPLRPGELLFRSDP-LAYTVCKG-SRGVVCDRCLLGKEKLMH-----CSQCR 66
Query: 73 VA-FCSPKCSTAAASSSHSPYVCQALTRLLNLNSPDAANLPLDRQL---------QARFL 122
+A +CS KC A C L PD+ L L R + ++ L
Sbjct: 67 IAKYCSAKCQKKAWPDHRRE--CSCLKSCKPRYPPDSVRL-LGRVIVKLMDEKPSESEKL 123
Query: 123 IAAYNLAVVNPSQF---------QILLAFQGTVTDNDTSAAHYLSSLCPPPASTTIELTA 173
+ Y+L N S+ Q+ + FQ + + A+ PP+ E A
Sbjct: 124 YSFYDLES-NISKLTEDKKEGLRQLAMTFQHFMREEIQDASQL------PPSFDLFEAFA 176
Query: 174 ALLAKDRLNAFGLMEPYIEGQDGRRSVRAYGIYLKASFFNHDCLPNACRFDYVDAAAENN 233
++ N+F + ++ G+Y S NH C PN C + N
Sbjct: 177 KVIC----NSFTICNAEMQEV-------GVGLYPSMSLLNHSCDPN-CSIVF------NG 218
Query: 234 TDIIVRMIHDVPQGREICLSYFPVNYDYSTRQKRLLYDYDFACDCDRCKVEANWSDNDND 293
+++R + ++ G E+ + Y + R+K+L Y F CDC RC+ + +D
Sbjct: 219 PHLLLRAVREIEAGEELTICYLDMLMTSEERRKQLRDQYCFECDCIRCQTQDKDADMLTG 278
Query: 294 DEN 296
DE
Sbjct: 279 DEQ 281
>gi|170067842|ref|XP_001868639.1| conserved hypothetical protein [Culex quinquefasciatus]
gi|167863902|gb|EDS27285.1| conserved hypothetical protein [Culex quinquefasciatus]
Length = 504
Score = 58.5 bits (140), Expect = 5e-06, Method: Compositional matrix adjust.
Identities = 60/286 (20%), Positives = 114/286 (39%), Gaps = 37/286 (12%)
Query: 10 VAEIEGRGRGLVSTQSLKAGQIVLRDSPILLYSALPFINSSS--FCHNCFRKTMHSSSSI 67
+A EG GRGLV+T+ + +++ D PIL+ P +N+ FC +C R + ++
Sbjct: 45 IAVFEGYGRGLVATRDIAVNELIFLDRPILVG---PRVNNYDVIFCASCCR--IQKRLTL 99
Query: 68 CP-SCSVAFCSPKCSTAAASSSHSPYVCQALTRLLNLNSPDAANLPLDRQLQARFLIAAY 126
C C + C+ C +AA + C +T + P D++ + ++ A
Sbjct: 100 CSGGCRLPICA-DCDRSAAPETPHTAECTVIT----------SWQPKDQKRYCKTILYAL 148
Query: 127 ---NLAVVNPSQFQILLAFQGTVTDNDTSAA-------HYLSSLCPPPASTTIELTAALL 176
++N + +I+ +G D ++L P S + ++
Sbjct: 149 TSIRAMLLNDLESRIVFNMEGHAPRKDMITEIDRLLKDQIFANL--PEGSANLTFLRRIV 206
Query: 177 AKDRLNAFGLMEPYIEGQDGRRSVRAYGIYLKASFFNHDCLPNACRFDYVDAAAENNTDI 236
NAF + ++ + G+Y+ + NH CL N V + N +
Sbjct: 207 NVLNTNAFETFRIVQDEENNDHEIILRGLYILGALMNHHCLAN------VRYVFDENQVM 260
Query: 237 IVRMIHDVPQGREICLSYFPVNYDYSTRQKRLLYDYDFACDCDRCK 282
+ +G +I +Y V + R L + F C+C+RC+
Sbjct: 261 WCHASRPIRKGEQIFNNYSKVLWGTQHRIIHLWFSKHFLCECERCR 306
>gi|45387527|ref|NP_991103.1| SET and MYND domain containing 1 [Danio rerio]
gi|41223368|gb|AAH65475.1| SET and MYND domain containing 1a [Danio rerio]
Length = 485
Score = 58.5 bits (140), Expect = 5e-06, Method: Compositional matrix adjust.
Identities = 69/283 (24%), Positives = 112/283 (39%), Gaps = 37/283 (13%)
Query: 16 RGRGLVSTQSLKAGQIVLRDSPILLYSALPFINSSSF--CHNCFRKTMHSSSSICPSCSV 73
+GRGL T+ L AG++V ++ A ++S S CH+CFR+ ++ C C
Sbjct: 17 KGRGLRGTRDLSAGEVVFAEASF----AAVVLDSLSLQVCHSCFRRQVNPHR--CAQCKF 70
Query: 74 A-FCSPKCSTAAASSSHSPYVCQALTRLLNLNSPDAANLPLDRQLQARFLIAAYNLAVVN 132
A +C C AA C A+ N+ N+ L ++ R + +V+
Sbjct: 71 AHYCDRTCQRAAWDEHRKE--CSAIR---NIGKAPNENVRLVARILWRI---QKHTGLVS 122
Query: 133 PSQFQILLAFQ---GTVTDNDTSA----AHYLSSLCPPPASTTIE-LTAALLAKDRLNAF 184
SQ L + +T D + P + E + L N F
Sbjct: 123 DSQLTTLDMLEDHLSRMTPEDLKELKADVKTFYTYWPKKSKAVGEDYVSHLFGVISCNGF 182
Query: 185 GLMEPYIEGQDGRRSVRAYGIYLKASFFNHDCLPNACRF------DYVDAAAENNTDIIV 238
L + Q G +SV G++ NHDC PN +DA+ ++ I +
Sbjct: 183 TLSD-----QRGLQSV-GIGLFPNLCLVNHDCWPNCTVILNHGDQSALDASFHSSRRIEL 236
Query: 239 RMIHDVPQGREICLSYFPVNYDYSTRQKRLLYDYDFACDCDRC 281
R + + G+E+ +SY + RQ+ L Y F C C+ C
Sbjct: 237 RALEPISAGQELTVSYVDFLSVSTDRQRLLQQQYYFDCKCEHC 279
>gi|345328822|ref|XP_001510515.2| PREDICTED: SET and MYND domain-containing protein 1 isoform 2
[Ornithorhynchus anatinus]
Length = 477
Score = 58.5 bits (140), Expect = 6e-06, Method: Compositional matrix adjust.
Identities = 66/284 (23%), Positives = 114/284 (40%), Gaps = 30/284 (10%)
Query: 6 ETVTVAEIEGRGRGLVSTQSLKAGQIVLRDSPILLYSALPFINSSSF-CHNCFRKTMHSS 64
E V V E +GRGL +T+ A ++ + YSA+ F + + CH CF++
Sbjct: 7 EKVEVFTSESKGRGLKATKEFWAADLIFSERA---YSAVVFDSLTHLVCHTCFKR--QEK 61
Query: 65 SSICPSCSVA-FCSPKCSTAAASSSHSPYVCQALTRLLNLNSPDAANLPLDRQLQARFLI 123
C C A +C C A + + C A+ + + +P+ N+ L ++ R
Sbjct: 62 LQRCGQCKFAHYCDRTCQKDAWLNHKNE--CSAIKK--HGKAPNE-NIRLAARIMWRIER 116
Query: 124 AAYNLA---VVNPSQFQILLAFQGTVTDNDTS--AAHYLSSLCPPPASTTIELTAALLAK 178
L +V+ Q + G + +L P +++ + +
Sbjct: 117 EGTGLTEGCLVSIDDLQNHVDSFGEEEQKELRLDVESFLHFWPPQSQQFSMQYISHIFGV 176
Query: 179 DRLNAFGLMEPYIEGQDGRRSVRAYGIYLKASFFNHDCLPNACRFDYVDAAAENNTDIIV 238
N F L + Q G ++V G++ NHDC PN C + NN I +
Sbjct: 177 INCNGFTLSD-----QRGLQAV-GVGVFPNLCLVNHDCWPN-CTVIF------NNGKIEL 223
Query: 239 RMIHDVPQGREICLSYFPVNYDYSTRQKRLLYDYDFACDCDRCK 282
R + + +G E+ +SY + R+K L Y F C C+ C+
Sbjct: 224 RALGKISEGEELTVSYIDFLNVSADRKKLLKKQYYFDCTCEHCE 267
>gi|323714506|pdb|3OXF|A Chain A, Human Lysine Methyltransferase Smyd3 In Complex With
Adohcy (Form I)
gi|323714507|pdb|3OXF|B Chain B, Human Lysine Methyltransferase Smyd3 In Complex With
Adohcy (Form I)
Length = 436
Score = 58.5 bits (140), Expect = 6e-06, Method: Compositional matrix adjust.
Identities = 76/313 (24%), Positives = 122/313 (38%), Gaps = 60/313 (19%)
Query: 16 RGRGLVSTQSLKAGQIVLRDSPILLYSALPFINSSSFCHNCF---RKTMHSSSSICPSCS 72
RG GL + L+ G+++ R P L Y+ + C C K M C C
Sbjct: 14 RGNGLRAVTPLRPGELLFRSDP-LAYTVCKG-SRGVVCDRCLLGKEKLMR-----CSQCR 66
Query: 73 VA-FCSPKCSTAAASSSHSPYVCQALTRLLNLNSPDAANLPLDRQL---------QARFL 122
VA +CS KC A C+ L PD+ L L R + ++ L
Sbjct: 67 VAKYCSAKCQKKAWPDHKRE--CKCLKSCKPRYPPDSVRL-LGRVVFKLMDGAPSESEKL 123
Query: 123 IAAYNL-AVVNP-------SQFQILLAFQGTVTDNDTSAAHYLSSLCPPPASTTIELTAA 174
+ Y+L + +N Q+++ FQ + + A+ PPA E A
Sbjct: 124 YSFYDLESNINKLTEDRKEGLRQLVMTFQHFMREEIQDASQL------PPAFDLFEAFAK 177
Query: 175 LLAKDRLNAFGLMEPYIEGQDGRRSVRAYGIYLKASFFNHDCLPNACRFDYVDAAAENNT 234
++ N+F + ++ G+Y S NH C PN + N
Sbjct: 178 VIC----NSFTICNAEMQEV-------GVGLYPSISLLNHSCDPNC-------SIVFNGP 219
Query: 235 DIIVRMIHDVPQGREICLSYFPVNYDYSTRQKRLLYDYDFACDCDRCKVEANWSDNDNDD 294
+++R + D+ G E+ + Y + R+K+L Y F CDC RC+ + D D D
Sbjct: 220 HLLLRAVRDIEVGEELTICYLDMLMTSEERRKQLRDQYCFECDCFRCQTQ----DKDADM 275
Query: 295 -ENNEEVMDEDQD 306
+E+V E Q+
Sbjct: 276 LTGDEQVWKEVQE 288
>gi|449501413|ref|XP_002187998.2| PREDICTED: SET and MYND domain-containing protein 1 isoform 2
[Taeniopygia guttata]
Length = 491
Score = 58.5 bits (140), Expect = 6e-06, Method: Compositional matrix adjust.
Identities = 69/290 (23%), Positives = 116/290 (40%), Gaps = 29/290 (10%)
Query: 6 ETVTVAEIEGRGRGLVSTQSLKAGQIVLRDSPILLYSALPFIN-SSSFCHNCFRKTMHSS 64
E V + EG+GRGL + + AG ++ + Y+A+ F + + CH CF++
Sbjct: 7 ECVELFTAEGKGRGLKAQKEFLAGDVIFAEPA---YAAVVFDSLTHVICHTCFKR--QER 61
Query: 65 SSICPSCSVA-FCSPKCSTAAASSSHSPYVCQALTRLLNLNSPDAANLPLDRQLQARFLI 123
C C A +C C AA S + C A+ R + +P N+ L ++ +
Sbjct: 62 LHRCGQCKFAHYCDRTCQRAAWLSHKNE--CSAIKR--HGKAP-TENIRLAARILWKMER 116
Query: 124 AAYNLA---VVNPSQFQILLAFQGTVTDNDTSA--AHYLSSLCPPPASTTIELTAALLAK 178
L+ +V + Q + G D A +L P ++L + +
Sbjct: 117 EGSGLSEGCLVAIEELQNHVDSFGEEEKKDLRADVESFLEFWPPQCQQFGMQLISHIFGV 176
Query: 179 DRLNAFGLMEPYIEGQDGRRSVRAYGIYLKASFFNHDCLPNAC------RFDYVDAAAEN 232
N F L + Q G ++V GI+ NHDC PN + V +
Sbjct: 177 ISCNGFTLSD-----QRGLQAV-GVGIFPNLCQANHDCWPNCTVVFNNGNHEAVRSMFHT 230
Query: 233 NTDIIVRMIHDVPQGREICLSYFPVNYDYSTRQKRLLYDYDFACDCDRCK 282
I +R ++ + G E+ +SY R+++L Y F C C+ CK
Sbjct: 231 QMRIELRALNKISPGDELTVSYVDFLSLSEERRRQLKKQYYFDCTCEHCK 280
>gi|380491929|emb|CCF34973.1| hypothetical protein CH063_00009 [Colletotrichum higginsianum]
Length = 412
Score = 58.2 bits (139), Expect = 6e-06, Method: Compositional matrix adjust.
Identities = 39/123 (31%), Positives = 59/123 (47%), Gaps = 14/123 (11%)
Query: 167 TTIELTAALLAKDRLNAFGLMEPYIEGQDGRRSVRAYGIYLKASFFNHDCLPNACRFDYV 226
TT E L ++D NAFG+ EG + + +G + AS+FNH C PN + V
Sbjct: 297 TTPESLFLLSSRDSHNAFGIRSLEDEGSE----LFGFGCWPAASYFNHSCGPNVEK-KRV 351
Query: 227 DAAAENNTDIIVRMIHDVPQGREICLSYFPVNYDYSTRQKRLL---YDYDFACDCDRCKV 283
A E R ++V +G E+C++Y +R+ R+L ++ F C C RCK
Sbjct: 352 GRAWE------FRAANNVEKGGELCITYLGGEERKLSREARMLTLRRNWGFECQCKRCKE 405
Query: 284 EAN 286
E
Sbjct: 406 EGG 408
>gi|12007319|gb|AAG45134.1|AF310894_3 BOP [Dictyostelium discoideum]
Length = 403
Score = 58.2 bits (139), Expect = 6e-06, Method: Compositional matrix adjust.
Identities = 31/86 (36%), Positives = 44/86 (51%), Gaps = 6/86 (6%)
Query: 204 GIYLKASFFNHDCLPNACRFDYVDAAAENNTDIIVRMIHDVPQGREICLSYFPVNYDYST 263
GI+ S+ NH CLPNA F Y D ++ R + + +G EI SY + + S
Sbjct: 141 GIFPTGSYLNHSCLPNA--FWYNDDQGM----MVFRTLRPIKKGEEILTSYTDITTECSE 194
Query: 264 RQKRLLYDYDFACDCDRCKVEANWSD 289
R+K LL Y F C C +CK + +D
Sbjct: 195 RRKHLLKQYFFFCQCQQCKFQKKLAD 220
>gi|195376269|ref|XP_002046919.1| GJ12224 [Drosophila virilis]
gi|194154077|gb|EDW69261.1| GJ12224 [Drosophila virilis]
Length = 661
Score = 58.2 bits (139), Expect = 6e-06, Method: Compositional matrix adjust.
Identities = 72/314 (22%), Positives = 131/314 (41%), Gaps = 44/314 (14%)
Query: 1 MSAVSETVTVAEIEGRGRGLVSTQSLKAGQIVLRDSPILLYSALPFINSSSFCHNCFRKT 60
+S + V + E + +GR +V+ + LK G +VL ++P+ F ++ CH+CF++
Sbjct: 250 LSGAANVVKLVETKEKGRFVVANEGLKTGDVVLCENPVAACLLPNFFGTN--CHHCFKR- 306
Query: 61 MHSSSSICPSCS-VAFCSPKCSTAAASSSHSPYVCQALTRLLNLNSPDAANLPLDRQLQA 119
+H C CS +AFCS +C A + H + CQ + + + L QA
Sbjct: 307 LHIPVP-CLHCSGIAFCSAQCMGEACETYHR-FECQFMDLFIGSGMSILCFVALRVFTQA 364
Query: 120 RFLIAAYNLAVVNPSQFQILLAFQGTVTDND----TSAAHYLSSLCPP------------ 163
L A + F+ L + Q +D + A +L +
Sbjct: 365 ASLEDGLTTANL---LFEHLCSHQDVREADDYLQRSLMAGFLMRILQKALYFGRRKTEGV 421
Query: 164 -PASTTIELTAALLAKDRL---NAFGLMEPYIEGQ---DGRRSVR-AYGIYLKASFFNHD 215
P + +++ ALL ++ NA + + + DG + V G+Y S+FNH+
Sbjct: 422 NPTAVELQVATALLGLLQVLQYNAHEIYHTMVTDEHCFDGCKVVYVGAGLYGTGSYFNHE 481
Query: 216 CLPNACRFDYVDAAAENNTDIIVRMIHDVPQGREICLSYFPV--NYDYSTRQKRLLYDYD 273
C P+ + YV +++ + ++Y P+ + RQ+ L Y
Sbjct: 482 CWPSVAGY-YV------GKKLVMSATKPHRPNEIVAVNYGPIFTKMNLKERQRTLRGRYA 534
Query: 274 FACDCDRCKVEANW 287
F+C+C C + NW
Sbjct: 535 FSCNCLTC--QENW 546
>gi|348524789|ref|XP_003449905.1| PREDICTED: N-lysine methyltransferase SMYD2-like [Oreochromis
niloticus]
Length = 440
Score = 58.2 bits (139), Expect = 6e-06, Method: Compositional matrix adjust.
Identities = 66/281 (23%), Positives = 102/281 (36%), Gaps = 33/281 (11%)
Query: 15 GRGRGLVSTQSLKAGQIVLRDSPILLYSALPFINSSSFCHNCFRKTMHSSSSICPSCSVA 74
G+GRGL T+ K G+++ S L +C CF +T H + C C A
Sbjct: 23 GKGRGLRVTRPFKVGELLF--SCPAFSHVLSVKERGCYCEFCFTRTQHLAR--CGKCKKA 78
Query: 75 F-CSPKCSTAAASSSHSPYVCQALTRLLNLNSPD-----AANLPLDRQLQARFLIAAYNL 128
F C+ KC + C A+ P A + +++Q I L
Sbjct: 79 FYCNVKCQKG--DWAMHKLECSAMVAFGENWCPSELSRLVARILAKKKMQKERCICEKML 136
Query: 129 AVVN-PSQFQILLAFQGTVTDNDTSAAH--YLSSLCPPPASTTIELTAALLAKDRLNAFG 185
+ S + + +T+ D + H Y L P + L ++ N F
Sbjct: 137 LIGEMQSHTEDEDNEKREMTEADIAGLHRFYSKHLDVPDHKELL----TLFSQVACNGFT 192
Query: 186 LMEPYIEGQDGRRSVRAYGIYLKASFFNHDCLPNACRFDYVDAAAENNTDIIVRMIHDVP 245
+ +D S +Y + NH CLP+ T VR + D+
Sbjct: 193 I-------EDDELSHLGTAVYPDMALINHSCLPSV-------IVTFKGTSAEVRAVQDMK 238
Query: 246 QGREICLSYFPVNYDYSTRQKRLLYDYDFACDCDRCKVEAN 286
G E+ +SY + Y R RL Y F CDC CK ++
Sbjct: 239 PGDEVLISYIDLLYPTDDRNNRLRESYYFICDCQECKSQSK 279
>gi|449445136|ref|XP_004140329.1| PREDICTED: histone-lysine N-methyltransferase ASHR1-like [Cucumis
sativus]
gi|449502473|ref|XP_004161650.1| PREDICTED: histone-lysine N-methyltransferase ASHR1-like [Cucumis
sativus]
Length = 482
Score = 58.2 bits (139), Expect = 6e-06, Method: Compositional matrix adjust.
Identities = 77/297 (25%), Positives = 115/297 (38%), Gaps = 58/297 (19%)
Query: 8 VTVAEIEGRGRGLVSTQSLKAGQIVLRDSPILLYSALPFINSS---SFCHNCFRKTMHSS 64
+TV + +GR L +T++ + G+ ++ P Y +P N+S S C CF +
Sbjct: 13 LTVTILPEKGRSLFTTRNFRPGEEIISQEP---YVCVP--NNSPVESRCDRCFASI---N 64
Query: 65 SSICPSCSVA-FCSPKCSTAAASSSHSPYVCQALTRL-------LNLNSPDAANLPLDRQ 116
C +C VA +CS C + C AL RL + + L + R+
Sbjct: 65 LKKCSACKVAWYCSSVCQKSEWKLHRLE--CDALARLDKDRHKSVTPSIRLMIKLFIRRK 122
Query: 117 LQARFLI---AAYNLAVVNP---------SQFQILLAFQGTVTDNDTSAAHYLSSLCPPP 164
LQ+ +I A N +V + +L A GT+ + + P
Sbjct: 123 LQSEKVIPMAATDNYKLVEELIDHMSGIDEKQLVLYAQMGTLVN----------FILQWP 172
Query: 165 ASTTIELTAALLAKDRLNAFGLMEPYIEGQDGRRSVRAYGIYLKASFFNHDCLPNACRFD 224
E+ A +K NA + D G+Y S NH CLPNA
Sbjct: 173 GMNVKEI-AVNFSKLACNAHTIC-------DSELRPLGTGLYPVISIINHSCLPNAV--- 221
Query: 225 YVDAAAENNTDIIVRMIHDVPQGREICLSYFPVNYDYSTRQKRLLYDYDFACDCDRC 281
E T + VR + +P G E+ +SY TRQK L +Y F C C RC
Sbjct: 222 ---LVFEGRTAV-VRAVQHIPAGAEVSISYIETAGSTMTRQKTLKENYLFTCTCSRC 274
>gi|449437548|ref|XP_004136554.1| PREDICTED: histone-lysine N-methyltransferase ATXR4-like [Cucumis
sativus]
Length = 341
Score = 58.2 bits (139), Expect = 7e-06, Method: Compositional matrix adjust.
Identities = 60/287 (20%), Positives = 115/287 (40%), Gaps = 22/287 (7%)
Query: 8 VTVAEIEGRGRGLVSTQSLKAGQIVLRDSPILLYSALPFINSSSFCHNCFRKTMHSSSSI 67
+ V+ + GRG+ +T+ + AG+++ P++ + +L I+ C+ C +K ++
Sbjct: 60 IRVSLTDSAGRGVFATRKIGAGELIHTAKPLVAHPSLSSIHH--VCNFCLQKLQRYANVD 117
Query: 68 CPSCSVAFCSPKCSTAAASSSHSPYVCQALTRLLNLNSPDAANLPLDRQLQARFLIAAY- 126
+ +FCS +C + H + + N PL + A +I+
Sbjct: 118 SDARRASFCSEECEQHS-KVFHDVEMEADWSDYDNYCRERGFKYPLLVKRLACMVISGAM 176
Query: 127 ---NLAVVNPSQFQILLAFQGTVTDNDTSAAHYLSSLCPPPA-STTIELTAALLAKDRLN 182
+L ++ PS+ + + + A +++ T E +LA+ R+N
Sbjct: 177 SSDHLDILQPSRLSTDMVLELEEGYSLLRKALINANITDERMLFLTQEWYTGVLARIRIN 236
Query: 183 AFGL--------MEPYIEGQDGRRSVRAYGIYLKASFFNHDCLPNACRFDYVDAAAENNT 234
AF + + + +Y+ SF+NHDC PN NN
Sbjct: 237 AFRIELAGGYEDLHSLAAACVEAEAAVGNAVYMLPSFYNHDCDPN------THIIWINNA 290
Query: 235 DIIVRMIHDVPQGREICLSYFPVNYDYSTRQKRLLYDYDFACDCDRC 281
+ ++ + DV E+ + Y + DY RQ L + F C C RC
Sbjct: 291 NAKLKALRDVDPDEELRICYIDASMDYDARQTLLHRGFGFICKCARC 337
>gi|225715142|gb|ACO13417.1| SET and MYND domain-containing protein 3 [Esox lucius]
Length = 429
Score = 58.2 bits (139), Expect = 7e-06, Method: Compositional matrix adjust.
Identities = 74/292 (25%), Positives = 108/292 (36%), Gaps = 64/292 (21%)
Query: 15 GRGRGLVSTQSLKAGQIVLRDSPILLYSALPFINSSS------FCHNCFRKTMHSSSSIC 68
G+G GL + Q + AG+ LLY+A P S CH+CF + H + C
Sbjct: 12 GKGNGLRANQRVLAGE--------LLYTAEPLACCVSSQCRNVVCHSCFSR--HETLLRC 61
Query: 69 PSCSVA-FCSPKCSTAAASSSHSPYVCQALTRLLNLNSPDAANLPLDR-QLQARFLIAAY 126
C VA +C C A S C LL+L+ +P D +L AR +
Sbjct: 62 SQCKVARYCDVTCQKRAWSIHKRECKC-----LLSLH----PRIPTDSVRLAARIIFRLL 112
Query: 127 NLAVVNPSQFQILLAFQGTVTDNDTSAAHYLSSLCPPPASTTIEL--------------- 171
+ + P Q + + D LS L S+ ++L
Sbjct: 113 CPSQMTP-QLYSFEEHESHLCDMGEEKREGLSQL-----SSMLQLYLKQEQPDIIQKVPS 166
Query: 172 --TAALLAKDRLNAFGLMEPYIEGQDGRRSVRAYGIYLKASFFNHDCLPNACRFDYVDAA 229
+LLAK N F + D G+Y S NHDC P +C +
Sbjct: 167 FDPISLLAKVTCNCFTI-------SDAELQEIGVGLYPSMSLLNHDCRP-SCVMLF---- 214
Query: 230 AENNTDIIVRMIHDVPQGREICLSYFPVNYDYSTRQKRLLYDYDFACDCDRC 281
+ +R I D+ E+ +SY V RQ +L+ Y F+C C C
Sbjct: 215 --QGKTLQLRAIRDIQPTEEVTISYIGVLLPTRERQTQLMEQYHFSCQCGLC 264
>gi|291242073|ref|XP_002740933.1| PREDICTED: predicted protein-like [Saccoglossus kowalevskii]
Length = 669
Score = 58.2 bits (139), Expect = 7e-06, Method: Compositional matrix adjust.
Identities = 80/337 (23%), Positives = 132/337 (39%), Gaps = 86/337 (25%)
Query: 4 VSETVTVAEIEGRGRGLVSTQSLKAGQIVLRDSPILLYSALPFINSS-SFCHNCFRKTMH 62
+S VT+ +GR V+T+ +K G +++ ++P YS++ + S CH+C+ + +
Sbjct: 240 MSSAVTIESSPDKGRYAVATRDVKVGDVLIVENP---YSSVGLQPCNVSHCHHCYIRVL- 295
Query: 63 SSSSICPSCS-VAFCSPKCSTAAASSSHS----------------------PYVCQALTR 99
+S C C+ + +CS +C A+ H+ + L
Sbjct: 296 -ASIPCLQCAGIVYCSKECRNASWEMYHNLECHHLDLIQELGLGMGHLALRTIIRTGLAF 354
Query: 100 LL-------NLNSPDAA--NLPLDRQLQARFLIAAYNL------------------AVVN 132
LL N+N PD + +D + ++ + + YNL AV
Sbjct: 355 LLKFREQSANVNIPDESFHGCTVDGEYESNYY-SVYNLVGHSEDRKPGDLFKRVVKAVCL 413
Query: 133 PSQFQILLAFQGTVTDNDTSAA-----HYLSSLCPPPASTTIELTAALLAKDRLNAFGLM 187
Q FQ DN+ A H L+ L P + E++ L + + +
Sbjct: 414 LRCLQQTNFFQSVGADNEEDVAIFIGGHMLTHLQTIPCNAH-EISEYELWRSDITKCHFV 472
Query: 188 EPYIEGQDGRRSVRAYGIYLKASFFNHDCLPNACRFDYVDAAAENNTDIIVRMIHDVPQG 247
E G+Y S NH C P R Y + +VR I ++ +G
Sbjct: 473 EV------------GSGLYPTMSLVNHSCDPVVTRNCYGETC-------VVRAIRNIYKG 513
Query: 248 REICLSY---FPVNYDYSTRQKRLLYDYDFACDCDRC 281
EI +Y +PV +D S RQ RL + Y F C CD C
Sbjct: 514 EEITDNYGYLYPV-HDKSERQTRLKWQYFFECKCDAC 549
>gi|383864965|ref|XP_003707948.1| PREDICTED: SET and MYND domain-containing protein 4-like [Megachile
rotundata]
Length = 674
Score = 58.2 bits (139), Expect = 7e-06, Method: Compositional matrix adjust.
Identities = 84/362 (23%), Positives = 138/362 (38%), Gaps = 96/362 (26%)
Query: 1 MSAVSETVTVAEIEGRGRGLVSTQSLKAGQIVLRDSPILLYSALPFINSSSFCHNCF--R 58
+ + S+ VT+ E RGR LV+T+++KAG I++ +SP + N+ +F NC
Sbjct: 218 LESCSDAVTLQFDEKRGRHLVATRNIKAGYILIVESPFAFST-----NNEAFDRNCLNCH 272
Query: 59 KTMHSSSSICPSC----SVAFCSPKCSTAAASSSHSPYVCQALTRLLNLNSPDAANLPLD 114
T+ + S+ C +V+FCS KC A H Y C + ++ + + P
Sbjct: 273 VTLKLNDSVKIPCYFCRTVSFCSEKCRNEAWKLYHR-YECS----IFDIFCGNDSEQP-- 325
Query: 115 RQLQARFLIAAYNLAV-------------VNPSQFQIL----LAFQGTVTDNDTSAAHYL 157
Q Q+ +L+ AY + + ++ ++ +L L + GT T+ + S
Sbjct: 326 -QRQSSYLLLAYRMTIAGCLLSNTDKVNNMDKTEIPVLNDNFLRYHGTNTNQECSDLGIN 384
Query: 158 SSLCPPPASTTIELT-----------------AALLAK------DRLNAFGLMEPYIEGQ 194
P T ++L A L K +++ L E +I
Sbjct: 385 EVYSPRDYRTILKLKTHCEKIEPNINLIRAIEAIFLTKCFTFVLSKMDVVCLKETFISLA 444
Query: 195 ----------------------DGRRSV---RAYG--IYLKASFFNHDCLPNACRFDYVD 227
D + V R G IY S NH C PN R Y
Sbjct: 445 VAMLHHLQAINCNAYEIVENLYDKKAHVWEPRYVGGAIYPSVSLVNHSCYPNVVRHSYPS 504
Query: 228 AAAENNTDIIVRMIHDVPQGREICLSYFP--VNYDYSTRQKRLLYDYDFACDCDRCKVEA 285
++VR + + +G EI Y P ++ R++ L Y F C C+ C
Sbjct: 505 GV------VVVRSLRFIGKGTEIVDCYGPHWLSEGRLPRREYLWKKYRFLCACEAC--TQ 556
Query: 286 NW 287
NW
Sbjct: 557 NW 558
>gi|357605609|gb|EHJ64694.1| hypothetical protein KGM_16822 [Danaus plexippus]
Length = 382
Score = 58.2 bits (139), Expect = 7e-06, Method: Compositional matrix adjust.
Identities = 51/195 (26%), Positives = 90/195 (46%), Gaps = 18/195 (9%)
Query: 115 RQLQARFLIAAYNLAVVNPSQFQILLAFQGTVTDNDTSAAHYLSSLCPPPASTTIELTAA 174
R++ A + Y +NP F L+A GT ++ L ++ +LT +
Sbjct: 200 REMTANVIRGEYIDEFLNPEGFSSLMALVGTNGQGIGTSPLALW------VNSVSQLTMS 253
Query: 175 LLAKDRLNAF-GLMEPYIEGQDGR-RSVRAYGIYLKASFFNHDCLPNA-CRFDYVDAAAE 231
+ +L+ F + Y+E + G+ + G+Y S NH C PNA F Y
Sbjct: 254 DDERQQLDVFIDKIYQYVEEESGQFLNTEGSGLYQLQSACNHSCAPNAESSFPY------ 307
Query: 232 NNTDIIVRMIHDVPQGREICLSYF---PVNYDYSTRQKRLLYDYDFACDCDRCKVEANWS 288
N I ++ I ++ G EI +SY + +RQ+ L +Y F C C+RC ++N
Sbjct: 308 GNHRIQLKAIRNIKPGDEIHISYLDDCTLQRSRHSRQRELSENYLFVCCCERCMSQSNEP 367
Query: 289 DNDNDDENNEEVMDE 303
D +D+E +++ +D+
Sbjct: 368 DCTSDEEMSDDDLDD 382
>gi|322697990|gb|EFY89764.1| MYND finger family protein [Metarhizium acridum CQMa 102]
Length = 513
Score = 58.2 bits (139), Expect = 7e-06, Method: Compositional matrix adjust.
Identities = 80/314 (25%), Positives = 119/314 (37%), Gaps = 45/314 (14%)
Query: 16 RGRGLVSTQSLKAGQIVLRDSPILLYSALPFINSSSFCHNCFRKTMHSSSSICPSCSVAF 75
+ R L +TQ + Q + P++L+ +L + +S C +C R S C C A+
Sbjct: 12 KHRALHATQPFRPSQTIHTFHPLILHPSLSHL--ASVCTHCLRP---GSPRACSRCHAAY 66
Query: 76 -CSPKCSTAAASSSHSPYVCQALTRLLNLNSPDAANLPLD-RQLQARFLIAAYNLAVVNP 133
C+ C AA ++ HS C+AL + L S A+LP R L LI V
Sbjct: 67 YCNTSCQQAAWTAVHSKE-CKALQQR-KLGSRTGADLPTPVRALLQTLLIKEIEDGVA-- 122
Query: 134 SQFQILLAFQGTVTDNDTSA--------AHYLSSLCPPPASTTIELTAALLAKDRLNAFG 185
A G + + A + + + LL K + NAF
Sbjct: 123 -------ALDGHTEERRKTKSWPDLEMMALAACAFAGRQGESNVRKAVELLCKIQTNAFH 175
Query: 186 LMEPYIEGQDGRRSVRAYGIYLKASFF--NHDCLPNACRFDYVDAAAENNTDIIVRMIHD 243
+ + GQ G++L+ + NH C+PNA +V A I+R
Sbjct: 176 RWDVDL-GQ--------VGVFLEPTLAMANHSCVPNAV-VQFVGRKA------ILRAERP 219
Query: 244 VPQGREICLSYFPVNYDYSTRQKRLLYDYDFACDCDRCKVEANWSDNDNDDENNEEVMDE 303
+ G EI +SY S R + L Y F C C RCK + N N V D
Sbjct: 220 IQAGDEIEISYTDYTMPLSAR-REALEQYSFECTCARCKGDLNVYQVRTACPNTYSVADV 278
Query: 304 DQDEQMVASDDDAE 317
D +D A+
Sbjct: 279 SGDGHPALTDKAAQ 292
>gi|342883051|gb|EGU83613.1| hypothetical protein FOXB_05861 [Fusarium oxysporum Fo5176]
Length = 374
Score = 58.2 bits (139), Expect = 7e-06, Method: Compositional matrix adjust.
Identities = 69/308 (22%), Positives = 116/308 (37%), Gaps = 50/308 (16%)
Query: 16 RGRGLVSTQSLKAGQIVLRDSPILLYSALPFINSSS--FCHN--CFRKTMHSSSSI-CP- 69
+GR L + ++ G ++L D+P Y+ +P + S+ C N C R+ + ++ C
Sbjct: 43 KGRLLRAAGPIREGTVLLVDTP---YAIVPSVTSNEPLICSNLSCSRRVPQNGRAVRCEH 99
Query: 70 SCS--VAFCSPKCSTAAASSSHSPYVCQALTRLLNLNSPDAANLPLDRQLQARFLIAAYN 127
+CS VA+C+ C A + + C L + L+A +N
Sbjct: 100 ACSKDVAWCNVACR--ATDKARHDFECSWLKKHAEPLRQKEGEYNFATVWHVVRLLATWN 157
Query: 128 L------AVV---NPSQFQILLAFQGT--------------VTDNDTSAAHYLSSLCPPP 164
A+V +P L ++ + YLS P
Sbjct: 158 AELHSSNALVQQRHPWGAHFLRGWKAVDMCCAYLDSWPEVQIIHWKRLVHEYLSDATVLP 217
Query: 165 ASTTIELTAALLAKDRLNAFGLM------EPYIEGQDGRRSVRAYGIYLKASFFNHDCLP 218
+ E LL K+ N FGL +P + R +Y +A+ FNH CLP
Sbjct: 218 PLLSAEQMLLLLCKEETNTFGLYPRATGSQPVNDNAAPRGESYGMALYPRAAQFNHSCLP 277
Query: 219 NACRFDYVDAAAENNTDIIVRMIHDVPQGREICLSYF--PVNYDYSTRQKRLLYDYDFAC 276
N V + ++ D+ +G E ++YF + D ++RQ + F C
Sbjct: 278 N------VTHQPDGQARMVYTAARDISKGEECMITYFDLTTHKDLTSRQNHTQEQFQFKC 331
Query: 277 DCDRCKVE 284
C+RC E
Sbjct: 332 TCERCLKE 339
>gi|345329417|ref|XP_001513395.2| PREDICTED: SET and MYND domain-containing protein 3-like
[Ornithorhynchus anatinus]
Length = 415
Score = 57.8 bits (138), Expect = 8e-06, Method: Compositional matrix adjust.
Identities = 35/129 (27%), Positives = 55/129 (42%), Gaps = 18/129 (13%)
Query: 195 DGRRSVRAYGIYLKASFFNHDCLPNACRFDYVDAAAENNTDIIVRMIHDVPQGREICLSY 254
+G G+Y S NH C PN T + +R + ++ +G E+ + Y
Sbjct: 161 NGEMQEVGVGLYPSMSLLNHSCDPNC-------VIVFEGTSLFLRAVREIQKGEELTICY 213
Query: 255 FPVNYDYSTRQKRLLYDYDFACDCDRCKV-----------EANWSDNDNDDENNEEVMDE 303
V RQK+L Y FACDC RCK E W + + + EE+ +
Sbjct: 214 LDVLLPSQERQKQLKEQYCFACDCIRCKTQDKDVDMLAGEEPFWKEVKDAVDTVEELQSQ 273
Query: 304 DQDEQMVAS 312
+ EQ++A+
Sbjct: 274 KKWEQVLAT 282
>gi|392565527|gb|EIW58704.1| SET domain-containing protein [Trametes versicolor FP-101664 SS1]
Length = 402
Score = 57.8 bits (138), Expect = 9e-06, Method: Compositional matrix adjust.
Identities = 48/156 (30%), Positives = 66/156 (42%), Gaps = 22/156 (14%)
Query: 136 FQILLAFQGTVTDNDTSAAHYLSSLCPPPAS---TTIELTAALLAKDRLNAFGLME-PYI 191
+Q+L G+ AA + C P TI +L+ D NAFG+ E P +
Sbjct: 261 YQLLKGLLGS------RAAERSNGACEEPHLADLVTIANVRTILSVDPGNAFGIWEVPLM 314
Query: 192 EGQDGRRSVRAYGIYLKASFFNHDCLPNACRFDYVDAAAENNTDIIVRMIHDVPQGREIC 251
E + + +Y SFFNH C N + A T +V G E+C
Sbjct: 315 EESE----CLGFALYPVPSFFNHHCAANVRKERDGRALRFVTTRAVV-------AGEELC 363
Query: 252 LSYFPVN-YDYSTRQKRLLYDYDFACDCDRCKVEAN 286
+SY V D++TRQK LL + F C C RC E
Sbjct: 364 ISYGHVEAMDWATRQKELLEGWYFECRCGRCTAEGE 399
>gi|320168148|gb|EFW45047.1| predicted protein [Capsaspora owczarzaki ATCC 30864]
Length = 522
Score = 57.8 bits (138), Expect = 9e-06, Method: Compositional matrix adjust.
Identities = 43/142 (30%), Positives = 58/142 (40%), Gaps = 25/142 (17%)
Query: 170 ELTAALLAKDRL--------NAFGLMEPYIEGQDGRRSVRAYGIYLKASFFNHDCLPNAC 221
EL A KD L N FGL DG+ A ++ ASFFNH C P+
Sbjct: 314 ELMAGFPLKDVLKLVSQIESNGFGLW-------DGKGECYARALFPSASFFNHSCDPSCD 366
Query: 222 RFDYVDAAAENNTDIIVRMIHDVPQGREICLSYFPVNYDYSTRQKRLLYDYDFACDCDRC 281
R+ ++ + + + G E+ +SY VN TRQ LL Y F C C RC
Sbjct: 367 RY-------QDKFLLSIATRRPIAAGEELSISYIDVNAPCRTRQHELLDSYHFQCSCTRC 419
Query: 282 KVEA---NWSDNDNDDENNEEV 300
E + D E E++
Sbjct: 420 VRELAAHSAGSKSRDKEEREKI 441
>gi|219130732|ref|XP_002185512.1| predicted protein [Phaeodactylum tricornutum CCAP 1055/1]
gi|217403043|gb|EEC42999.1| predicted protein [Phaeodactylum tricornutum CCAP 1055/1]
Length = 493
Score = 57.8 bits (138), Expect = 9e-06, Method: Compositional matrix adjust.
Identities = 29/84 (34%), Positives = 40/84 (47%), Gaps = 7/84 (8%)
Query: 201 RAYGIYLKASFFNHDCLPNACRFDYVDAAAENNTDIIVRMIHDVPQGREICLSYFPVNYD 260
R G++ A+ NH C+ NA R D ++V D+P G E+ SY P
Sbjct: 218 RLLGLFPLAAMINHSCVGNAVRVLVDDV-------MVVHATTDLPAGTELVWSYGPPTTP 270
Query: 261 YSTRQKRLLYDYDFACDCDRCKVE 284
++ R RL Y F CDC RC+ E
Sbjct: 271 FAQRNTRLRAHYGFVCDCPRCQRE 294
>gi|345803330|ref|XP_537223.3| PREDICTED: SET and MYND domain-containing protein 3 [Canis lupus
familiaris]
Length = 428
Score = 57.8 bits (138), Expect = 9e-06, Method: Compositional matrix adjust.
Identities = 71/300 (23%), Positives = 109/300 (36%), Gaps = 51/300 (17%)
Query: 16 RGRGLVSTQSLKAGQIVLRDSPILLYSALPFINSSSFCHNCF---RKTMHSSSSICPSCS 72
RG GL ++ L+ G+++ R P L Y+ + C C K M C C
Sbjct: 14 RGNGLRASAQLRPGELLFRSDP-LAYTVCKG-SRGVVCDRCLLGKEKLMR-----CSQCR 66
Query: 73 VA-FCSPKCSTAAASSSHSPYVCQALTRLLNLNSPDAANLPLDRQLQARFLIAAYNLAVV 131
VA +CS KC A C+ L PD+ L L R + L
Sbjct: 67 VAKYCSAKCQKKAWPDHKRE--CKCLKSCKPRYPPDSVRL-LGR--------VVFKLMEE 115
Query: 132 NPSQFQILLAFQ------GTVTDNDTSAAHYLSSLCPPPASTTIELTAAL---------L 176
PS+ + L +F +T++ L I+ + L
Sbjct: 116 TPSESEKLYSFYDLESNINKLTEDKKDGLRQLVMTFQHFMREEIQDASQLPSSFDIFEAF 175
Query: 177 AKDRLNAFGLMEPYIEGQDGRRSVRAYGIYLKASFFNHDCLPNACRFDYVDAAAENNTDI 236
AK NAF + ++ G+Y S NH C PN + N +
Sbjct: 176 AKVICNAFTICNAEMQEV-------GVGLYPSMSLLNHSCDPNC-------SIVFNGPHL 221
Query: 237 IVRMIHDVPQGREICLSYFPVNYDYSTRQKRLLYDYDFACDCDRCKVEANWSDNDNDDEN 296
++R + D+ G E+ + Y + R+K+L Y F CDC RC+ + +D DE
Sbjct: 222 LLRAVRDIEAGEELTICYLDMLMTSEERRKQLKDQYCFECDCVRCETQDKDADMLTGDEQ 281
>gi|58264108|ref|XP_569210.1| hypothetical protein [Cryptococcus neoformans var. neoformans
JEC21]
gi|338819407|sp|P0CR42.1|SET5_CRYNJ RecName: Full=Potential protein lysine methyltransferase SET5;
AltName: Full=SET domain-containing protein 5
gi|57223860|gb|AAW41903.1| conserved hypothetical protein [Cryptococcus neoformans var.
neoformans JEC21]
Length = 449
Score = 57.8 bits (138), Expect = 9e-06, Method: Compositional matrix adjust.
Identities = 34/104 (32%), Positives = 54/104 (51%), Gaps = 14/104 (13%)
Query: 15 GRGRGLVSTQSLKAGQIVLRDSPILLYS---ALPFINSSSFCHNCFRKTMHSSSSI---C 68
G+G+GLV+ + LK G+++ ++ P ++ S + S C CF S + C
Sbjct: 103 GKGKGLVAKEELKQGEMLWQEEPWIVTSDPGHYSLLTQSMMCSQCFSLFARPSPPLSVPC 162
Query: 69 PSCSVA-FCSPKCSTAAASSSHSPYVCQALTRLLNLNSPDAANL 111
P C+ A FC+ C T + SSSH P +C L +PDA++L
Sbjct: 163 PHCTTAHFCNRLCYTKSLSSSHPPLLCPGL-------NPDASSL 199
>gi|395852657|ref|XP_003798851.1| PREDICTED: SET and MYND domain-containing protein 3 [Otolemur
garnettii]
Length = 428
Score = 57.8 bits (138), Expect = 9e-06, Method: Compositional matrix adjust.
Identities = 71/300 (23%), Positives = 111/300 (37%), Gaps = 51/300 (17%)
Query: 16 RGRGLVSTQSLKAGQIVLRDSPILLYSALPFINSSSFCHNCF---RKTMHSSSSICPSCS 72
RG GL + +L+ G+++ R P L Y+ + C C K M C C
Sbjct: 14 RGNGLRAVVALRPGELLFRSDP-LAYTVCKG-SRGIVCDRCLLGKEKLMR-----CSQCQ 66
Query: 73 VA-FCSPKCSTAAASSSHSPYVCQALTRLLNLNSPDAANLPLDRQLQARFLIAAYNLAVV 131
VA +CS KC A C+ L PD+ L L R + L
Sbjct: 67 VAKYCSAKCQKKAWQDHKRE--CKCLKSCKPRYPPDSVRL-LGR--------VIFKLMEE 115
Query: 132 NPSQFQILLAFQ------GTVTDNDTSAAHYLSSLCPPPASTTIELTAAL---------L 176
PS+ + L +F +T++ L+ I+ + L
Sbjct: 116 TPSESEKLYSFYDLESNIKKLTEDKKEGLRQLAMTFQHFMREEIQDVSQLPPSFDVFEAF 175
Query: 177 AKDRLNAFGLMEPYIEGQDGRRSVRAYGIYLKASFFNHDCLPNACRFDYVDAAAENNTDI 236
AK N+F + ++ G+Y S NH C PN C + N +
Sbjct: 176 AKVICNSFTICNAEMQEV-------GVGLYPSMSLLNHSCDPN-CSIVF------NGPHL 221
Query: 237 IVRMIHDVPQGREICLSYFPVNYDYSTRQKRLLYDYDFACDCDRCKVEANWSDNDNDDEN 296
++R + D+ G E+ + Y + R+K+L Y F CDC RC+ + +D DE
Sbjct: 222 LLRAVRDIEAGEELTICYLDMLMTSEERRKQLRDQYCFECDCLRCQTQDKDADMLTGDEQ 281
>gi|406608054|emb|CCH40488.1| SET and MYND domain-containing protein 5 [Wickerhamomyces ciferrii]
Length = 370
Score = 57.8 bits (138), Expect = 1e-05, Method: Compositional matrix adjust.
Identities = 33/118 (27%), Positives = 56/118 (47%), Gaps = 8/118 (6%)
Query: 168 TIELTAALLAKDRLNAFGLMEPYIEGQDGRRSVRAYGIYLKASFFNHDCLPNACRFDYVD 227
TIE + ++ N+FG + + + Y ++ +ASFFNH C PN ++ V+
Sbjct: 244 TIETIRLIFGREYGNSFGTWQLVDDKSSENKEFLGYSLFPEASFFNHSCSPNLKKYRKVN 303
Query: 228 AAAENNTDIIVRMIHDVPQGREICLSYFP-VNYDYSTRQKRLLYDYDFACDCDRCKVE 284
T D+ +G ++C+ YF ++ RQ+ L ++ F C CDRC E
Sbjct: 304 RMHFQTT-------ADIAKGEQMCIDYFHMLDESLMVRQEVLSKNWFFECACDRCDEE 354
>gi|198467928|ref|XP_001354556.2| GA10806 [Drosophila pseudoobscura pseudoobscura]
gi|198146172|gb|EAL31609.2| GA10806 [Drosophila pseudoobscura pseudoobscura]
Length = 533
Score = 57.8 bits (138), Expect = 1e-05, Method: Compositional matrix adjust.
Identities = 66/305 (21%), Positives = 112/305 (36%), Gaps = 55/305 (18%)
Query: 5 SETVTVAEIEGRGRGLVSTQSLKAGQIVLRDSPILLYSALPFINSSSFCHNCFRKT-MHS 63
S+ +A+ E GR LV+++ L+AG++++ + P+ + P ++ + C C++ +
Sbjct: 43 SDAFHIAQSEIYGRYLVASRQLEAGELLITEEPLAIG---PCVSGDAVCLGCYQPVGLGR 99
Query: 64 SSSICPSCSVAFCSPKCSTAAASSSHSPYVCQALTRLLNLNSPDAANLPLDRQLQAR--- 120
CP C C C H+ C L P AA L DR R
Sbjct: 100 EQYRCPGCGWPLCGASCPGLGRRIGHTTEECA----LYGERRPAAAELLNDRTSPPRVRD 155
Query: 121 -----FLIAAYNLAVVNPSQFQILLAFQGTVTD---NDTSAAHY-------------LSS 159
++ L P Q+ ++ + + N+ +HY L
Sbjct: 156 LYELVMIVRILLLRTHAPEQYAMITRMESHTEERRKNEKLWSHYEQKVVRRLRDEWKLDD 215
Query: 160 LCPPPASTTIELTAALLAKDRLNAFGLMEPYIEGQDGRRSVRAYGIYLKASFFNHDCLPN 219
L + + +L +N F + G++ +A +Y A HDC PN
Sbjct: 216 LAAEEEAQAVHEICGILD---VNCFEI---------GQKGAKARTLYPSAFLLAHDCSPN 263
Query: 220 ACRFDYVDAAAENNTDIIVRMIHDVPQGREICLSYFPVNYDYSTRQKRLLYDYD---FAC 276
D + A I++R V + + LSY Y KR + Y+ F C
Sbjct: 264 TAHTDDPRSYA-----ILLRTSRRVREQEALTLSY---AYTLQGTLKRRSFMYEGKLFWC 315
Query: 277 DCDRC 281
C RC
Sbjct: 316 RCQRC 320
>gi|321248842|ref|XP_003191260.1| hypothetical protein CGB_A2330W [Cryptococcus gattii WM276]
gi|317457727|gb|ADV19473.1| conserved hypothetical protein [Cryptococcus gattii WM276]
Length = 449
Score = 57.8 bits (138), Expect = 1e-05, Method: Compositional matrix adjust.
Identities = 34/104 (32%), Positives = 54/104 (51%), Gaps = 14/104 (13%)
Query: 15 GRGRGLVSTQSLKAGQIVLRDSPILLYS---ALPFINSSSFCHNCFRKTMHSSSSI---C 68
G+G+GLV+ + LK G+++ ++ P ++ S + S C CF S + C
Sbjct: 103 GKGKGLVAKEGLKQGEMLWQEEPWIVTSDPGHYSLLTQSMMCSQCFSLFARPSPPLSVPC 162
Query: 69 PSCSVA-FCSPKCSTAAASSSHSPYVCQALTRLLNLNSPDAANL 111
P C+ A FC+ C T + SSSH P +C L +PDA++L
Sbjct: 163 PHCTTAHFCNRLCYTKSLSSSHPPLLCPGL-------NPDASSL 199
>gi|392561695|gb|EIW54876.1| SET domain-containing protein [Trametes versicolor FP-101664 SS1]
Length = 342
Score = 57.4 bits (137), Expect = 1e-05, Method: Compositional matrix adjust.
Identities = 33/103 (32%), Positives = 48/103 (46%), Gaps = 5/103 (4%)
Query: 181 LNAFGLMEPYIEGQDGRRSVRAYGIYLKASFFNHDCLPNACRFDYVDAAAENNTDIIVRM 240
L A G+ E RS R G++L A+ NH C PN R + E + + R
Sbjct: 123 LPALGIFETNALPCGKARSGRKAGVFLTAARLNHSCRPNLAR-----SWDEASQQMTFRA 177
Query: 241 IHDVPQGREICLSYFPVNYDYSTRQKRLLYDYDFACDCDRCKV 283
+ DV +G E+CL+Y V + R + L Y F C C+ C +
Sbjct: 178 LRDVAEGEELCLNYVDVIGTRAQRTEELQSAYGFECVCEACAL 220
>gi|195169991|ref|XP_002025797.1| GL18252 [Drosophila persimilis]
gi|194110650|gb|EDW32693.1| GL18252 [Drosophila persimilis]
Length = 533
Score = 57.4 bits (137), Expect = 1e-05, Method: Compositional matrix adjust.
Identities = 66/305 (21%), Positives = 112/305 (36%), Gaps = 55/305 (18%)
Query: 5 SETVTVAEIEGRGRGLVSTQSLKAGQIVLRDSPILLYSALPFINSSSFCHNCFRKT-MHS 63
S+ +A+ E GR LV+++ L+AG++++ + P+ + P ++ + C C++ +
Sbjct: 43 SDAFHIAQSEIYGRYLVASRQLEAGELLITEEPLAIG---PCVSGDAVCLGCYQPVGLGR 99
Query: 64 SSSICPSCSVAFCSPKCSTAAASSSHSPYVCQALTRLLNLNSPDAANLPLDRQLQAR--- 120
CP C C C H+ C L P AA L DR R
Sbjct: 100 EQYRCPGCGWPLCGASCPGLGRRIGHTTEECA----LYGERRPAAAELLNDRTSPPRVRD 155
Query: 121 -----FLIAAYNLAVVNPSQFQILLAFQGTVTD---NDTSAAHY-------------LSS 159
++ L P Q+ ++ + + N+ +HY L
Sbjct: 156 LYELVMIVRILLLRTHAPEQYAMITRMESHTEERRKNEKLWSHYEQKVVRRLRDEWKLDD 215
Query: 160 LCPPPASTTIELTAALLAKDRLNAFGLMEPYIEGQDGRRSVRAYGIYLKASFFNHDCLPN 219
L + + +L +N F + G++ +A +Y A HDC PN
Sbjct: 216 LAAEEEAQAVHEICGILD---VNCFEI---------GQKGAKARTLYPSAFLLAHDCSPN 263
Query: 220 ACRFDYVDAAAENNTDIIVRMIHDVPQGREICLSYFPVNYDYSTRQKRLLYDYD---FAC 276
D + A I++R V + + LSY Y KR + Y+ F C
Sbjct: 264 TAHTDDPRSYA-----ILLRTSRRVREQEALTLSY---AYTLQGTLKRRSFMYEGKLFWC 315
Query: 277 DCDRC 281
C RC
Sbjct: 316 RCQRC 320
>gi|146414868|ref|XP_001483404.1| hypothetical protein PGUG_04133 [Meyerozyma guilliermondii ATCC
6260]
Length = 338
Score = 57.4 bits (137), Expect = 1e-05, Method: Compositional matrix adjust.
Identities = 42/136 (30%), Positives = 58/136 (42%), Gaps = 13/136 (9%)
Query: 151 TSAAHYLSSLCPPPAS--TTIELTAALLAKDRLNAFGLMEPYIEGQDGRRSVRAYGIYLK 208
T YL C P TI+ ++ NAFG+ + G+D + +Y
Sbjct: 211 TRMYKYLKLTCGPQLQPFITIDNIRRIIGTSLTNAFGIWSSTVPGED--KEYFGCALYPS 268
Query: 209 ASFFNHDCLPNACRFDYVDAAAENNTDIIVRMIHDVPQGREICLSYF-PVNYDYSTRQKR 267
ASFFNH C N R + I H V QG E+C+ Y DY TRQK
Sbjct: 269 ASFFNHSCSANVSR-------TRHGRLISFVTSHPVLQGEELCIQYGNHTTEDYHTRQKD 321
Query: 268 LLYDYDFACDCDRCKV 283
L ++ F C C +C++
Sbjct: 322 LK-EWFFECGCKKCEM 336
>gi|398404870|ref|XP_003853901.1| hypothetical protein MYCGRDRAFT_39064 [Zymoseptoria tritici IPO323]
gi|339473784|gb|EGP88877.1| hypothetical protein MYCGRDRAFT_39064 [Zymoseptoria tritici IPO323]
Length = 757
Score = 57.4 bits (137), Expect = 1e-05, Method: Compositional matrix adjust.
Identities = 50/174 (28%), Positives = 72/174 (41%), Gaps = 23/174 (13%)
Query: 122 LIAAYNLAVVNPSQFQILLAFQGTVTDNDTSAAHYLSSLCPPPASTTIELTAALLAKDRL 181
LI+ N + NP+Q + S + + P T ++ AAL +
Sbjct: 436 LISVVNKLLHNPNQAALFSELNDGGGYTPQSKIKLVDGVVP---VDTFQVQAAL----EI 488
Query: 182 NAFGLMEPYIEGQDGRR-------SVRAYGIYLKASFFNHDCLPNACRFDYVDAAAENNT 234
N FG P + G R S A G+++ ASF NHDC+ NA R D
Sbjct: 489 NGFGC--PEMATAIGTRPEEKEEASDTATGVWITASFANHDCIGNAHRSFIGDF------ 540
Query: 235 DIIVRMIHDVPQGREICLSYFPVNYDYSTRQKRLLYDYDFACDCDRCKVEANWS 288
+I+R D+ + EI +Y N D+ QK L +DF C C C E+ S
Sbjct: 541 -MIIRATKDIAKDEEILHTYVEANDDHEQFQKTLQRSWDFKCRCRLCVAESKTS 593
>gi|357606576|gb|EHJ65119.1| hypothetical protein KGM_22198 [Danaus plexippus]
Length = 515
Score = 57.4 bits (137), Expect = 1e-05, Method: Compositional matrix adjust.
Identities = 63/288 (21%), Positives = 120/288 (41%), Gaps = 37/288 (12%)
Query: 10 VAEIEGRGRGLVSTQSLKAGQIVLRDSPILLYSALPFINSSSFCHNCFRKTMHSSSSICP 69
V + E GR LV+ + +KAG+ +L D P +L P ++S C NC+ ++ +C
Sbjct: 7 VEKSEKLGRYLVAAKDIKAGERILSDQPFVLG---PSSDTSLVCFNCYLPLINKFL-VCK 62
Query: 70 SCSVA-FCSPKCSTAAASSSHSPYVCQALTRLLNLNSPDAANLPLDRQLQARFLIAAYNL 128
+C+VA C + H+ C R L L + + + + + + + ++ A
Sbjct: 63 NCAVAPLCPGDGCPDEFTKYHNRQECDVF-RNLKL-TKGISPMTMVQNVGSLSVLRALLK 120
Query: 129 AVVNPSQFQILLAFQGTVTDNDTS--------AAHYLSSLCPPPASTTIELTAALLAKDR 180
N ++++ + + + S ++ SL +L + A
Sbjct: 121 KETNLLEWKLFMELETHLERRRESNVWQYYDNTVKFIQSLGLLENGQNQDLVQKICAAID 180
Query: 181 LNAFGLMEPYIEGQDGRRSVRAYGIYLKASFFNHDCLPNACRFDYVDAAAENNTDIIVRM 240
+N+F + P I +R G+YL+A+ +HDC+ N + ++N ++
Sbjct: 181 VNSFEVRGPPIPAIGCAEILR--GVYLQAALLSHDCIANT------HMSIDDNNMLVCHA 232
Query: 241 IHDVPQGREICLSYFPVNYDYS-------TRQKRLLYDYDFACDCDRC 281
D+ +G I NY+Y+ RQ+ L+ F C C RC
Sbjct: 233 SVDIKKGESI-------NYNYTDPLKGTIPRQQHLIVGKYFKCTCTRC 273
>gi|357461631|ref|XP_003601097.1| Histone-lysine N-methyltransferase ASHR1 [Medicago truncatula]
gi|355490145|gb|AES71348.1| Histone-lysine N-methyltransferase ASHR1 [Medicago truncatula]
Length = 511
Score = 57.4 bits (137), Expect = 1e-05, Method: Compositional matrix adjust.
Identities = 80/322 (24%), Positives = 120/322 (37%), Gaps = 59/322 (18%)
Query: 8 VTVAEIEGRGRGLVSTQSLKAGQIVLRDSPILLYSALPFINSSSFCHNCFRKTMHSSSSI 67
+TV+ I +GR L +T+ G +++ P Y +P + C CF T S S
Sbjct: 13 LTVSTISEKGRSLFTTRDFHPGDVIISQEP---YVCVP---TQKRCDGCFSTTNLSKCSR 66
Query: 68 CPSCSVAFCSPKCSTAAASSSHSPYVCQALTRLLNLNSPDAA--------NLPLDRQLQA 119
C V +C C + CQAL+R L+ N + L L R+LQ
Sbjct: 67 CQV--VWYCGTPCQKSEWKLHRLE--CQALSR-LDSNKRKSVTPSIRLMLKLYLRRKLQD 121
Query: 120 RFLIAA-----YNL--AVVNPSQFQIL------LAFQGTVTDNDTSAAHYLSSLCPPPAS 166
+ +I + Y L A+V F I+ + + ++ N + ++ S
Sbjct: 122 QKIIPSTAMDNYKLVEALVARILFIIVVLSSRAMIVEFSLLTNMSVVIVLFNTYFHEDMS 181
Query: 167 TTIELTAALLAK-DRLNAFGLMEPYIEGQ-----------------DGRRSVRAYGIYLK 208
E L A+ L L P I + D G+Y
Sbjct: 182 DIKEEQLVLYAQMANLVHLILQWPEINIKEIANFFSKFACNAHTVCDSELRPLGTGLYPV 241
Query: 209 ASFFNHDCLPNACR-FDYVDAAAENNTDIIVRMIHDVPQGREICLSYFPVNYDYSTRQKR 267
S NH CLPN+ FD +A+ VR + +P+G E+ +SY TRQK
Sbjct: 242 VSIINHSCLPNSVLVFDGREAS--------VRALQHIPKGTEVLISYIETAGSTVTRQKA 293
Query: 268 LLYDYDFACDCDRCKVEANWSD 289
L Y F C C C + D
Sbjct: 294 LREQYLFQCVCPLCSKVGQYED 315
>gi|321476942|gb|EFX87901.1| SET and MYND domain-containing protein 4A [Daphnia pulex]
Length = 648
Score = 57.4 bits (137), Expect = 1e-05, Method: Compositional matrix adjust.
Identities = 87/330 (26%), Positives = 127/330 (38%), Gaps = 66/330 (20%)
Query: 1 MSAVSETVTVAEIEGRGRGLVSTQSLKAGQIVLRDSPILLYSALPFINSSSFCHNCFRKT 60
+ + S +V + RGR LV+T+ +K G+ V+ + S L + S CH+C T
Sbjct: 242 LPSASASVELINTPDRGRCLVATEDIKIGETVIVEKAHA--SILQYEFKESHCHHCLHWT 299
Query: 61 MHSSSSICPSCS-VAFCSPKCSTAAASSSH------SPYVCQALTRLLNLNSPDAANLPL 113
C CS V FCS +C A S H + ++C+ TR ++N+ L L
Sbjct: 300 --PGPVPCHKCSQVGFCSTQCRDEAWDSYHQFECGLTDFLCRT-TR--DVNTGQHGLLAL 354
Query: 114 DRQLQA-RFLIAAYNLAVVNPSQFQILL---------------AFQGTVTDNDTSA--AH 155
L+A R LI N +P F + + Q + TD A A
Sbjct: 355 RTVLKADRRLIIIANEQEKSPESFASQVFDSANYDTVHRLVDNSSQRSTTDIFRRAVMAV 414
Query: 156 YLSSLCP------PPASTTIELTAALLAKDRLNAFGLMEPYIEGQDG----RRSVRAYGI 205
YL+SL P LL NA + I G +S++ I
Sbjct: 415 YLTSLIQIRDGKDRPDEVLATAVLRLLHSYPCNAHEISHMAIPVPSGFCAQSKSLQLQQI 474
Query: 206 ---------YLKASFFNHDCLPNACRFDYVDAAAENNTDIIVRMIHDVPQGREICLSYFP 256
+ S NH C PN Y D ++V++IH + +G EI +Y
Sbjct: 475 QSCEIGSAAFPVVSLMNHSCNPNVVHLCYGDV-------MVVKVIHRIARGEEILDNY-- 525
Query: 257 VNYDYSTRQKR-----LLYDYDFACDCDRC 281
Y Y+T +KR L Y F C C C
Sbjct: 526 -GYHYATHEKRERQLKLCQQYYFRCRCQSC 554
>gi|395531511|ref|XP_003767821.1| PREDICTED: SET and MYND domain-containing protein 3 [Sarcophilus
harrisii]
Length = 428
Score = 57.4 bits (137), Expect = 1e-05, Method: Compositional matrix adjust.
Identities = 65/323 (20%), Positives = 110/323 (34%), Gaps = 54/323 (16%)
Query: 15 GRGRGLVSTQSLKAGQIVLRDSPILLYSALPFINSSSFCHNCFRKTMHSSSSICPSCSVA 74
GRG GL + L+ G+++ R P L Y+ + C C + C C VA
Sbjct: 13 GRGNGLRAVAPLRPGELLFRSEP-LAYTVCKG-SRGVVCERCLCRK--EKLLRCSQCKVA 68
Query: 75 -FCSPKCSTAAASSSHSPYVCQALTRLLNLNSPDAANLPLDRQLQARFLIAAYNLAVVNP 133
+C +C A C L S + P +L + + +
Sbjct: 69 RYCGSECQKQAWQDHKRECKC--------LKSSEPKFPPDSVRLLGKVVFKLLRESSCAS 120
Query: 134 SQFQILLAFQGTVTDNDTSAAHYLSSLCPPPASTTIELTAALLAKDRLN-------AFGL 186
+ + + + L L +T L K+ + AF +
Sbjct: 121 EKLYSFFDLESNIKNLSEEKKEGLRHL---------AMTLQLYLKEEIQNASQLPPAFDI 171
Query: 187 MEPYIEG-------QDGRRSVRAYGIYLKASFFNHDCLPNACRFDYVDAAAENNTDIIVR 239
E + + +G G+Y S NH C PN + +R
Sbjct: 172 FESFAKVICNGFTISNGEMQEVGVGLYPSMSLLNHSCDPNC-------VIVFEGPSLFLR 224
Query: 240 MIHDVPQGREICLSYFPVNYDYSTRQKRLLYDYDFACDCDRCKVEAN-----------WS 288
I ++P G E+ + Y V + RQK+L Y F CDC CK ++ W
Sbjct: 225 AIRNIPLGEELTICYLDVLMPTAERQKQLKEQYCFDCDCPLCKTQSKDADMLAGEEQAWK 284
Query: 289 DNDNDDENNEEVMDEDQDEQMVA 311
+ E++ +++ EQ++A
Sbjct: 285 EIQGSLIKIEDLQSQEKWEQVLA 307
>gi|79507571|ref|NP_196280.2| histone-lysine N-methyltransferase ATXR4 [Arabidopsis thaliana]
gi|229488103|sp|Q9FG08.2|ATXR4_ARATH RecName: Full=Histone-lysine N-methyltransferase ATXR4; AltName:
Full=Protein SET DOMAIN GROUP 38; AltName:
Full=Trithorax-related protein 4; Short=TRX-related
protein 4; Flags: Precursor
gi|332003660|gb|AED91043.1| histone-lysine N-methyltransferase ATXR4 [Arabidopsis thaliana]
Length = 325
Score = 57.4 bits (137), Expect = 1e-05, Method: Compositional matrix adjust.
Identities = 40/132 (30%), Positives = 59/132 (44%), Gaps = 24/132 (18%)
Query: 165 ASTTIELTAALLAKDRLNAF--------------GLMEPYIEGQDGRRSVRAYGIYLKAS 210
A T + A+LA+ R+NAF L +EG+ + +Y+ S
Sbjct: 201 AFLTKQWYTAILARIRINAFRIDLVGGSCGEDLLSLAAASVEGE----GAVGHAVYMLPS 256
Query: 211 FFNHDCLPNACRFDYVDAAAENNTDIIVRMIHDVPQGREICLSYFPVNYDYSTRQKRLLY 270
F+NHDC PNA +A A NT + DV +G E+ + Y + Y RQ L
Sbjct: 257 FYNHDCDPNAHIIWLHNADARLNT------LRDVEEGEELRICYIDASMGYEARQTILSQ 310
Query: 271 DYDFACDCDRCK 282
+ F C+C RC+
Sbjct: 311 GFGFLCNCLRCQ 322
>gi|449435328|ref|XP_004135447.1| PREDICTED: uncharacterized protein LOC101202892 [Cucumis sativus]
gi|449522881|ref|XP_004168454.1| PREDICTED: uncharacterized protein LOC101228219 [Cucumis sativus]
Length = 540
Score = 57.4 bits (137), Expect = 1e-05, Method: Compositional matrix adjust.
Identities = 32/98 (32%), Positives = 45/98 (45%), Gaps = 8/98 (8%)
Query: 198 RSVRAYGIYLKASFFNHDCLPNACRFDYVDAAAENNTDIIVRMIHDVPQGREICLSYFPV 257
R G+++ SF NH C+PNA R D I+V DV G EI +YF
Sbjct: 316 RDYYGVGLWVLPSFINHSCIPNARRLHIGD-------HILVHASRDVKAGEEITFAYFDP 368
Query: 258 NYDYSTRQKRLLYDYDFACDCDRCKVEANWSDNDNDDE 295
+ R KR+ + F C+C RC+ E S+ + E
Sbjct: 369 LSSWKDR-KRMSETWGFNCNCKRCRFEEEISNKEEMKE 405
>gi|410916601|ref|XP_003971775.1| PREDICTED: N-lysine methyltransferase SMYD2-B-like [Takifugu
rubripes]
Length = 432
Score = 57.4 bits (137), Expect = 1e-05, Method: Compositional matrix adjust.
Identities = 69/296 (23%), Positives = 106/296 (35%), Gaps = 43/296 (14%)
Query: 1 MSAVSETVTVAEIEGRGRGLVSTQSLKAGQIVLRDSPILLYS-ALPFINSSSFCHNCFRK 59
M+ + E V + G+GRGL T++ + G+++L YS L C +CF
Sbjct: 1 MTGIMEGVERFDSPGKGRGLRVTRAFQVGELLLSSQA---YSYVLSVKERGEHCESCF-- 55
Query: 60 TMHSSSSICPSCSVAF-CSPKCSTAAASSSHSPYVCQALTRLLNLNSPDAANLPLDRQLQ 118
T S + C C AF C KC + C A+ P +L
Sbjct: 56 TRKKSLARCGKCKKAFYCDVKCQKGDWAMHRLE--CSAMNAFGEKWCPSEIT-----RLV 108
Query: 119 ARFLIAAYNLAVVNPSQFQILLAFQGTVTDN-----------DTSAAH-YLSSLCPPPAS 166
AR L S+ +LL + T++ D + H + S P
Sbjct: 109 ARILTKKKMQKDRCASEKLLLLGEMQSHTEDMDNERRETMEADVAGLHQFFSKHLEIPGH 168
Query: 167 TTIELTAALLAKDRLNAFGLMEPYIEGQDGRRSVRAYGIYLKASFFNHDCLPNACRFDYV 226
+ L ++ N F + +D S +Y + NH C P+
Sbjct: 169 KDL---LTLFSQVACNGFTI-------EDEELSHLGTAVYPDVALINHSCRPSV------ 212
Query: 227 DAAAENNTDIIVRMIHDVPQGREICLSYFPVNYDYSTRQKRLLYDYDFACDCDRCK 282
+ T VR + D+ G E+ +SY V Y R RL Y F C C+ C+
Sbjct: 213 -IVTYSGTSAHVRALRDMKPGDEVLISYIDVLYPTEDRNNRLRESYYFTCQCEECE 267
>gi|159485954|ref|XP_001701009.1| hypothetical protein CHLREDRAFT_142739 [Chlamydomonas reinhardtii]
gi|158281508|gb|EDP07263.1| predicted protein, partial [Chlamydomonas reinhardtii]
Length = 504
Score = 57.4 bits (137), Expect = 1e-05, Method: Compositional matrix adjust.
Identities = 30/93 (32%), Positives = 44/93 (47%), Gaps = 8/93 (8%)
Query: 202 AYGIYLKASFFNHDCLPNACRFDYVDAAAENNTDIIVRMIHDVPQGREICLSYFPVNYDY 261
A G++ AS+FNH C+ NA R+ D + VR D+P G E+ ++Y +
Sbjct: 415 ATGLWGLASYFNHACVANAHRYFLGDF-------LFVRASRDLPAGAEVTITYLNPTLTW 467
Query: 262 STRQKRLLYDYDFACDCDRCKVEANWSDNDNDD 294
R +LL FAC C+ C E W D+
Sbjct: 468 EQRSAKLLR-RGFACGCELCMDETEWRRGHPDE 499
>gi|10178118|dbj|BAB11411.1| unnamed protein product [Arabidopsis thaliana]
Length = 258
Score = 57.4 bits (137), Expect = 1e-05, Method: Compositional matrix adjust.
Identities = 39/129 (30%), Positives = 58/129 (44%), Gaps = 24/129 (18%)
Query: 168 TIELTAALLAKDRLNAF--------------GLMEPYIEGQDGRRSVRAYGIYLKASFFN 213
T + A+LA+ R+NAF L +EG+ + +Y+ SF+N
Sbjct: 137 TKQWYTAILARIRINAFRIDLVGGSCGEDLLSLAAASVEGE----GAVGHAVYMLPSFYN 192
Query: 214 HDCLPNACRFDYVDAAAENNTDIIVRMIHDVPQGREICLSYFPVNYDYSTRQKRLLYDYD 273
HDC PNA +A A NT + DV +G E+ + Y + Y RQ L +
Sbjct: 193 HDCDPNAHIIWLHNADARLNT------LRDVEEGEELRICYIDASMGYEARQTILSQGFG 246
Query: 274 FACDCDRCK 282
F C+C RC+
Sbjct: 247 FLCNCLRCQ 255
>gi|301777093|ref|XP_002923966.1| PREDICTED: SET and MYND domain-containing protein 3-like
[Ailuropoda melanoleuca]
Length = 477
Score = 57.4 bits (137), Expect = 1e-05, Method: Compositional matrix adjust.
Identities = 76/311 (24%), Positives = 114/311 (36%), Gaps = 56/311 (18%)
Query: 16 RGRGLVSTQSLKAGQIVLRDSPILLYSALPFINSSSFCHNCF---RKTMHSSSSICPSCS 72
RG GL + L+ G+++ R P L Y+ + C C K M C C
Sbjct: 14 RGNGLRALAQLRPGELLFRSDP-LAYTVCKG-SRGVVCDRCLLGKEKLMR-----CSQCR 66
Query: 73 VA-FCSPKCSTAAASSSHSPYVCQALTRLLNLNSPDAANLPLDRQLQARFLIAAYNLAVV 131
VA +CS KC A C+ L PD+ L L R + L
Sbjct: 67 VAKYCSAKCQKKAWPDHKRE--CKCLKSCKPRYPPDSVRL-LGR--------VVFKLMEE 115
Query: 132 NPSQFQILLAFQ------GTVTDNDTSAAHYLSSLCPPPASTTIELTAAL---------L 176
PS+ + L +F +T+ L I+ + L
Sbjct: 116 TPSESEKLYSFYDLESNINKLTEEKKEGLRQLVMTFQHFMREEIQDASQLPLPFDIFEAF 175
Query: 177 AKDRLNAFGLMEPYIEGQDGRRSVRAYGIYLKASFFNHDCLPNACRFDYVDAAAENNTDI 236
AK NAF + ++ G+Y S NH C PN C + N +
Sbjct: 176 AKVICNAFTICNAEMQEV-------GVGLYPSMSLLNHSCDPN-CSIVF------NGPHL 221
Query: 237 IVRMIHDVPQGREICLSYFPVNYDYSTRQKRLLYDYDFACDCDRCKVEANWSDNDNDD-E 295
++R + D+ G E+ + Y + R+K+L Y F CDC RC+ + D D D
Sbjct: 222 LLRAVRDIEAGEELTICYLDMLMTSEERRKQLRDQYCFECDCFRCQTQ----DKDADMLT 277
Query: 296 NNEEVMDEDQD 306
+E+V E Q+
Sbjct: 278 GDEQVWKEVQE 288
>gi|401623295|gb|EJS41399.1| set6p [Saccharomyces arboricola H-6]
Length = 373
Score = 57.4 bits (137), Expect = 1e-05, Method: Compositional matrix adjust.
Identities = 44/155 (28%), Positives = 70/155 (45%), Gaps = 21/155 (13%)
Query: 134 SQFQILLAFQGTVTDNDTSAAHYLSSLCPPPAST--TIELTAALLAKDRLNAFGLMEPYI 191
S+F +LL FQ V L L P TI L +L + NAFG+ +
Sbjct: 231 SKFPVLLHFQKLVFQT-------LYILLPSHLRKMLTIPLLRHILGTEYGNAFGIWQ--- 280
Query: 192 EGQDGR-RSVRAYGIYLKASFFNHDCLPNACRFDYVDAAAENNTDIIVRMIHDVPQGREI 250
EG+ R Y ++ +AS+FNH C PN + ++ ++ D+ +G +I
Sbjct: 281 EGEAAESREYFGYWVFPEASYFNHSCNPNVTK-------HRKGNSMLFKVNTDIKKGEQI 333
Query: 251 CLSYFPVNYDYSTRQKRLLYD-YDFACDCDRCKVE 284
C+ Y V + ++ L D + F C C+RC +E
Sbjct: 334 CIDYSGVLSLPTLERRNFLADSWFFDCACERCVLE 368
>gi|66363372|ref|XP_628652.1| SET domain protein with MYND insert (Skm/BOP family)
[Cryptosporidium parvum Iowa II]
gi|46229836|gb|EAK90654.1| SET domain protein with MYND insert (Skm/BOP family)
[Cryptosporidium parvum Iowa II]
Length = 587
Score = 57.4 bits (137), Expect = 1e-05, Method: Compositional matrix adjust.
Identities = 76/346 (21%), Positives = 130/346 (37%), Gaps = 74/346 (21%)
Query: 1 MSAVSETVTVAEIEGRGRGLVSTQSLKAGQIVLRDSPILLYSALPFI-NSSSFCHNCF-- 57
M + E + + E E +GR +++ + ++ G+ +L + P Y + F N C CF
Sbjct: 14 MWNLPEKIEIKEDERKGRSIIAKEEIQIGESILMEEP---YCRILFSDNIEEICDTCFNY 70
Query: 58 -RKTMHSSSSI--CPSCS-VAFCSPKCSTAAASSSHSPYVCQALTRLLNLNSPDAANLPL 113
R S I C C + FCS KC + + H + C L + + +
Sbjct: 71 LRSEGSYSECILECQECKKIKFCSKKCMEESKTIHH--FECGILKLDILQMISNKVGVSF 128
Query: 114 DR-QLQARFLIAAY---------------NLAVVNPSQFQILL----AFQGTVTDNDTSA 153
DR +L RF+I + + N +Q L+ F ++ D
Sbjct: 129 DRSRLLTRFVIKLILELNDKNKSQREQNNSYLISNLNQINSLVDNQEKFLQSIKDVYQEL 188
Query: 154 AHYLSSLCPPPASTTIELTAALLAKDRL----------NAFGL----MEPYIEGQDGRR- 198
A L L P I+ + + D+L N+FG+ ++ + G
Sbjct: 189 A--LEILKIPKLKAEIDKLESDIITDKLLVKISCIIDSNSFGIPKFPLKCNVNGDSVSEL 246
Query: 199 ---------------------SVRAYGIYLKASFFNHDCLPNACRFDYVDAAAENNTDII 237
S+ +G++ +S FNH C PN C F V+ N + +
Sbjct: 247 VNPRQKAPNSSLELSNSLLNPSILGWGLFSYSSLFNHSCDPN-CDFIGVNPIP-NQSSVT 304
Query: 238 VRMI--HDVPQGREICLSYFPVNYDYSTRQKRLLYDYDFACDCDRC 281
+ +I + + EI ++Y + R K LL F C C+RC
Sbjct: 305 INLIANRKIQKDEEITINYVEIYDTRRNRIKNLLKTKHFICHCERC 350
>gi|348577081|ref|XP_003474313.1| PREDICTED: SET and MYND domain-containing protein 3-like [Cavia
porcellus]
Length = 428
Score = 57.4 bits (137), Expect = 1e-05, Method: Compositional matrix adjust.
Identities = 69/302 (22%), Positives = 112/302 (37%), Gaps = 55/302 (18%)
Query: 16 RGRGLVSTQSLKAGQIVLRDSPILLYSALPFINSSSFCHNCF---RKTMHSSSSICPSCS 72
RG GL + L+ G+++ R P L + + C +C K M C C
Sbjct: 14 RGNGLRAVAPLRPGELLFRSDP--LAYTVSKGSRGVVCDHCLLGKEKLMR-----CSQCR 66
Query: 73 VA-FCSPKCSTAAASSSHSPYVCQALTRLLNLNSPDAANLPLDRQL---------QARFL 122
VA +CS KC A C+ L PD+ L L R + ++ L
Sbjct: 67 VAKYCSAKCQKKAWQDHKQE--CKCLKSCKPRYPPDSVRL-LGRVVIKLMEKTPSESEKL 123
Query: 123 IAAYNLAV--------VNPSQFQILLAFQGTVTDNDTSAAHYLSSLCPPPASTTIELTAA 174
+ Y+L Q+ + FQ + + A+ PP+ E A
Sbjct: 124 YSFYDLESNINKLTEDKKEGLRQLAMTFQHFMREEIQDASQL------PPSFDIFEAFAK 177
Query: 175 LLAKDRLNAFGLMEPYIEGQDGRRSVRAYGIYLKASFFNHDCLPNACRFDYVDAAAENNT 234
++ N+F + ++ G+Y S NH C PN + N
Sbjct: 178 VIC----NSFTICNAEMQEV-------GVGLYPSMSLLNHSCDPNC-------SIVFNGP 219
Query: 235 DIIVRMIHDVPQGREICLSYFPVNYDYSTRQKRLLYDYDFACDCDRCKVEANWSDNDNDD 294
+++R + D+ G E+ + Y + R+K+L Y F CDC RC+ + +D D
Sbjct: 220 HLLLRAVRDIEVGEELTICYLDMLMTSEERRKQLRDQYCFECDCFRCQTQDKDADMLTGD 279
Query: 295 EN 296
E
Sbjct: 280 EQ 281
>gi|345480324|ref|XP_003424127.1| PREDICTED: SET and MYND domain-containing protein 3-like [Nasonia
vitripennis]
Length = 391
Score = 57.0 bits (136), Expect = 1e-05, Method: Compositional matrix adjust.
Identities = 68/274 (24%), Positives = 110/274 (40%), Gaps = 44/274 (16%)
Query: 26 LKAGQIVLRDSPILLYSALPFINSSSFCHNCFRKTMHSSSSICPSCSVAFCSPKCSTAAA 85
+K G ++ P L + + C CF+ S C V +C C +
Sbjct: 3 IKKGDCLISSKPFAY--VLSSKHKDNHCDYCFKSGKLLKCSGCQY--VYYCDRSCQKESW 58
Query: 86 SSSHSPYVCQALTRLLNLNSPDAANLPLDRQLQARFLIAAYN--------LAVVNPSQFQ 137
S S C L R+ PDAA +L AR ++ A +F+
Sbjct: 59 SVHKSE--CINLKRIAPRTIPDAA------RLMARIIVKLQKGGGDEKDYYAKNAYRKFK 110
Query: 138 ILLAFQGTVTDNDTSAAHYLSSLCPP----PASTTIELTAALLA---KDRLNAFGLMEPY 190
L++ + ++ H++S LC TT+ +A +L + +N++ +++P
Sbjct: 111 DLMSHYTDIKNDPKRIEHFVS-LCQVLEDFMEGTTLPNSAEILGLYGRICVNSYNILDPD 169
Query: 191 IEGQDGRRSVRAYGIYLKASFFNHDCLPNACRFDYVDAAAENNTDIIVRMIHDVPQ--GR 248
+ GIYL S +H C PNA A T I++R + D+P+
Sbjct: 170 MNSI-------GVGIYLGPSVIDHSCKPNA-------VAVFEGTTILIRALEDIPRLDWS 215
Query: 249 EICLSYFPVNYDYSTRQKRLLYDYDFACDCDRCK 282
+I +SY V STR L Y F C+C+RCK
Sbjct: 216 QIHISYIDVLNTTSTRCTELQNTYYFLCECERCK 249
>gi|340055687|emb|CCC50008.1| conserved hypothetical protein [Trypanosoma vivax Y486]
Length = 374
Score = 57.0 bits (136), Expect = 1e-05, Method: Compositional matrix adjust.
Identities = 73/296 (24%), Positives = 112/296 (37%), Gaps = 65/296 (21%)
Query: 17 GRGLVSTQSLKAGQIVLRDS-PILLYSALPFINSS--SFCHNCFRKTMHSSSSICPSCSV 73
GRGLV+ +++ G++V+R + P L Y +N ++C + + ++ P C
Sbjct: 113 GRGLVAKENIMQGKVVVRRADPWLRYP----VNDRMCAYCAGALPERFFTCAN--PKCHE 166
Query: 74 AFCSPKCSTAAASSSHSPYVCQALTRLLNLNSPDAANLPLDRQLQARFLIAAYNLAVVNP 133
+CS C T A S HS R + A L + Q++A I N A
Sbjct: 167 EYCSRDCRTMALSLYHS--------RTCSCEGLQALELDMYSQMKASESITEKNAAAAQL 218
Query: 134 SQFQIL-LAFQGTVTDNDTSAAHYLSS----------------------LCPPPASTTIE 170
++L ++ Q + S LS C S T E
Sbjct: 219 LMLRVLSVSLQQQIVPTALSEMRILSGRLLFHPKTLAGCFLEIYERFTRACGTATSITYE 278
Query: 171 LTAALLAKDRLNAFGLMEPYIEGQDGRRSVRAYGIYLKASFFNHDCLPNACRFDYVDAAA 230
+LA+ N F E +E + L S NH C N A
Sbjct: 279 EMVGILARVTANCFH-RENCVE------------LCLPRSMLNHSCEANV---------A 316
Query: 231 ENNTDIIVRMIHDVPQGREICLSYFP--VNYDYSTRQKRLLYDYDFACDCDRCKVE 284
E+ ++ DVP+GRE+ ++Y+P + DY R + L F C C RCK E
Sbjct: 317 EDVQTGDLKTTQDVPRGRELTINYYPQLKHLDYLERSQE-LKRRGFTCLCHRCKRE 371
>gi|320166460|gb|EFW43359.1| predicted protein [Capsaspora owczarzaki ATCC 30864]
Length = 443
Score = 57.0 bits (136), Expect = 1e-05, Method: Compositional matrix adjust.
Identities = 90/373 (24%), Positives = 133/373 (35%), Gaps = 73/373 (19%)
Query: 17 GRGLVSTQSLKAGQIVLRDSPILLYSALPFI-NSSSFCHNCFRKTMHSSSSICPSC-SVA 74
G GLV+T+ ++ G++V+R+ I L S P N S C R C C V
Sbjct: 103 GYGLVATRPIRRGEVVVREC-IFLESQPPLEGNDSDPCALTVR---------CEKCVQVY 152
Query: 75 FCSPKCS-TAAASSSHSPYVCQALTRL-LNLNSPDAANLPLDRQLQARFLIAAYNLAVVN 132
FCS C + A HS C AL L + P+ L RQ I A L ++
Sbjct: 153 FCSTACRDESKARDLHSEIECNALAFLGTDAIHPNDVISDLLRQAIRILSIRAKRLQLLP 212
Query: 133 -PSQFQIL---------LAFQGTVTDNDTSAAHYLSSLCPPPASTTIELTAA----LLAK 178
P + Q+ LA+ T+ D++ ++ + + + +
Sbjct: 213 LPGEPQLRNSYENIAKNLAYSPTLADDNDHTLKFVVKFANELVPAAVRIPGPEFLNIFHR 272
Query: 179 DRLNAFGLMEPYIEGQDGRRSVRAYGIYLKA-SFFNHDCLPNACRFDYV--DAAAENNTD 235
+ NAF ++ P GR YG Y NH C P R + +
Sbjct: 273 HQCNAFSMLGP------GRMDT-MYGAYTACWHLMNHSCAPVLMREHHAFHRPLVDGLPH 325
Query: 236 IIVRMIHDVPQGREICLSYFPVNYDYSTRQKRLLYDYDFACDCDRCKVEANWSDNDNDDE 295
R + DV +G EI Y + + R++ L Y F C+C RC
Sbjct: 326 FEARAVIDVAEGTEITWCYSNIRNPKAERREHLRQFYGFLCECPRCT------------- 372
Query: 296 NNEEVMDEDQDEQMVASDDDAEVHGDTNFPHAYFFMRYMCDRDNCWGTLAPLPPSDATPS 355
DE+ V FP R +C R NC G + P PS
Sbjct: 373 --------GPDERRVLE----------GFPE---HRRMLCRRHNC-GFVMPGDPSAEDDG 410
Query: 356 TVMECNVCGNLKN 368
+ CN CG ++
Sbjct: 411 ILGYCNQCGLVRR 423
>gi|242019692|ref|XP_002430293.1| protein msta, isoform A, putative [Pediculus humanus corporis]
gi|212515408|gb|EEB17555.1| protein msta, isoform A, putative [Pediculus humanus corporis]
Length = 501
Score = 57.0 bits (136), Expect = 1e-05, Method: Compositional matrix adjust.
Identities = 61/289 (21%), Positives = 109/289 (37%), Gaps = 21/289 (7%)
Query: 1 MSAVSETVTVAEIEGRGRGLVSTQSLKAGQIVLRDSPILLYSALPFINSSSFCHNCFRKT 60
M +++ + + + GR V+ Q + G +++++SP ++ P +S C C RK
Sbjct: 1 MLNLNKNFDILKNDKFGRYAVALQDYEPGDLIIQESP---FTYGPKSDSPPVCLGCHRKV 57
Query: 61 MHSSSSICPSCSVAFCSPKCSTAAASSSHSPYVCQAL-TRLLNLNSPDAANLPLDRQLQA 119
+C +C C KC S + + ++ R + +P +
Sbjct: 58 --DGKILCSTCGWPVCDEKCEKEPIHSENECIIFSSVKVRFTKVKNPQGNCPQYECITPL 115
Query: 120 RFLIAAYNLAVVNPSQFQILLAFQGTVTDNDT------SAAHYLSSLCPPPASTTIELTA 173
R L+ + SQ L + D + +L +C + +L
Sbjct: 116 RVLLGKEKFPEIWNSQISKLESHNEERQKTDIWKVESINTVGFLHKVCKLQERFSSDLIH 175
Query: 174 ALLAKDRLNAFGLMEPYIEGQDGRRSVRAYGIYLKASFFNHDCLPNACRFDYVDAAAENN 233
NAF P G D R +Y K+S F H C+PN + + ++
Sbjct: 176 FACGILETNAFS--SPSSYGYDMRY------LYPKSSLFLHSCIPNVGHYIFKNSNDGER 227
Query: 234 TDII-VRMIHDVPQGREICLSYFPVNYDYSTRQKRLLYDYDFACDCDRC 281
+++I VR + +G+E LSY + R+ L F C C RC
Sbjct: 228 SNVIHVRASLKIKKGQEFTLSYTQTLWPTLLRRSHLKEGKFFDCCCQRC 276
>gi|440799473|gb|ELR20518.1| Histone-lysine N-methyltransferase [Acanthamoeba castellanii str.
Neff]
Length = 404
Score = 57.0 bits (136), Expect = 1e-05, Method: Compositional matrix adjust.
Identities = 33/108 (30%), Positives = 48/108 (44%), Gaps = 10/108 (9%)
Query: 175 LLAKDRLNAFGLMEPYIEGQDGRRSVRAYGIYLKASFFNHDCLPNACRFDYVDAAAENNT 234
LL + NAFGL +G + +Y S FNH C+PN VD +
Sbjct: 240 LLCIQQCNAFGL-------SNGEGEMTGVALYPALSLFNHSCMPNCA---AVDDGTGSKR 289
Query: 235 DIIVRMIHDVPQGREICLSYFPVNYDYSTRQKRLLYDYDFACDCDRCK 282
++ + VP G E+ +SY ++ RQ +L Y F C C RC+
Sbjct: 290 VCAIKTLVAVPPGEELTISYIDLDLTRELRQDKLEESYAFRCTCARCR 337
>gi|350404370|ref|XP_003487084.1| PREDICTED: SET and MYND domain-containing protein 4-like [Bombus
impatiens]
Length = 635
Score = 57.0 bits (136), Expect = 1e-05, Method: Compositional matrix adjust.
Identities = 80/335 (23%), Positives = 130/335 (38%), Gaps = 73/335 (21%)
Query: 3 AVSETVTVAEIEGRGRGLVSTQSLKAGQIVLRDSPI--LLYSALPFINSSSFCHNCFRKT 60
A S+ ++++ E GR LV+T+ K G +V ++P ++Y+ + + CH+C ++
Sbjct: 239 AASDGISISFSEKYGRHLVATKEFKPGDVVTIENPYAYVIYTQRYY----THCHHCLSRS 294
Query: 61 MHSSSSICPSCSVA-FCSPKCSTAAASSSHSPYVCQALTRLLNLNSPDAANLPLDRQLQA 119
+ C C VA +CS KC A +H C + + NL + D + + ++
Sbjct: 295 YNLIP--CSHCPVAQYCSEKCRKLAWEMAHQ-IECPIMALVGNLLNVDKDKIRMLTKI-I 350
Query: 120 RFLIAA-------------YNLAVVNPSQFQILLAFQGTVTDNDTSAAHYLSS------L 160
RFLI LA NP +G + +A L++ L
Sbjct: 351 RFLIVITSKGKKFDELRVDMELAESNPDNRSAGFTDEGILDSTSARSALSLATNMTTRPL 410
Query: 161 CPPPASTTIELTAALLAKDRLNAFGLMEPYIEGQD----------GRRSVRA-------- 202
A I AA+L + N F + +D G RA
Sbjct: 411 IGISAFACISALAAILLATQTNFFCNKYEVNQLKDISNYPKIIFSGSLMFRACVIMSSNC 470
Query: 203 ------------YGIYLKASFFNHDCLPNACR-FDYVDAAAENNTDIIVRMIHDVPQGRE 249
G+Y+ S +NH C PN R F+ +I R + + G +
Sbjct: 471 FSVQQEPGIKTGSGLYITHSLYNHSCAPNTFRHFE--------ELTMITRALRPIYPGDQ 522
Query: 250 ICLSYFPVNYDYST---RQKRLLYDYDFACDCDRC 281
I +Y Y Y T R+K+++ DY F CDC C
Sbjct: 523 IFTNY-GAAYAYMTKSERRKKIIQDYFFECDCIAC 556
>gi|322779440|gb|EFZ09632.1| hypothetical protein SINV_01274 [Solenopsis invicta]
Length = 425
Score = 57.0 bits (136), Expect = 1e-05, Method: Compositional matrix adjust.
Identities = 66/283 (23%), Positives = 104/283 (36%), Gaps = 58/283 (20%)
Query: 26 LKAGQIVLRDSPILLYSALPFINSSSFCHNCFRKTMHSSSSICPSCS-VAFCSPKCSTAA 84
+K G +L P + L ++ C NCF+ C SC V +C+ C +
Sbjct: 1 VKKGTTILTGRPFVF--VLRSKYRTTRCDNCFKSEQLVKLLKCSSCQYVYYCNRNCQKQS 58
Query: 85 ASSSHSPYVCQALTRLLNLNSPDAANLPLDRQLQARFLI-----AAYNLAVVNPSQFQIL 139
VC L ++L + PD+ +L AR +I A + ++
Sbjct: 59 WRIHKRECVC--LKKILPVILPDSV------RLVARIIIKLKQGGADEMDYYTKENYRKF 110
Query: 140 LAFQGTVTDNDTSAAH-------------YLSSLCPPPASTTIELTAALLAKDRLNAFGL 186
+D A H +L P + I + + + +RLN
Sbjct: 111 TDLMSHYSDMRADAKHMEPFIKLYRFLAEFLDETLMPSIAELISIYGKM-STNRLNI--- 166
Query: 187 MEPYIEGQDGRRSVRAYGIYLKASFFNHDCLPNACRFDYVDAAAENNTDIIVRMIHDVPQ 246
QD +V +GIYL AS H C PNA A E T II++
Sbjct: 167 -------QDEWLNVIGFGIYLGASVVGHSCKPNAV------ATFEGTTIIIIK------- 206
Query: 247 GREICLSYFPVNYDYSTRQKRLLYDYDFACDCDRCKVEANWSD 289
+SY + R++ L Y F CDC+RC+ E + ++
Sbjct: 207 -----ISYVDLIKSKKDRREELYSLYHFWCDCERCEKEESLTE 244
>gi|426239579|ref|XP_004013697.1| PREDICTED: SET and MYND domain-containing protein 3 isoform 2 [Ovis
aries]
Length = 428
Score = 57.0 bits (136), Expect = 2e-05, Method: Compositional matrix adjust.
Identities = 70/302 (23%), Positives = 111/302 (36%), Gaps = 55/302 (18%)
Query: 16 RGRGLVSTQSLKAGQIVLRDSPILLYSALPFINSSSFCHNCF---RKTMHSSSSICPSCS 72
RG GL + L+ G+++ R P L + + C C K M C C
Sbjct: 14 RGNGLRALAPLRPGELLFRSDP--LAYTVSKGSRGVVCDRCLLGKEKLMR-----CSQCR 66
Query: 73 VA-FCSPKCSTAAASSSHSPYVCQALTRLLNLNSPDAANLPLDRQL---------QARFL 122
+A +CS KC A C+ L PD+ L L R + ++ L
Sbjct: 67 IAKYCSAKCQKKAWQDHKRE--CKCLKSCKPRYPPDSVRL-LGRVVFKLMQETPSESEKL 123
Query: 123 IAAYNLAV--------VNPSQFQILLAFQGTVTDNDTSAAHYLSSLCPPPASTTIELTAA 174
+ Y+L Q+ L FQ + + A+ PP+ E A
Sbjct: 124 YSFYDLESNINKLTEDKKEGLRQLALTFQHFMREEIQDASQL------PPSFDIFEAFAK 177
Query: 175 LLAKDRLNAFGLMEPYIEGQDGRRSVRAYGIYLKASFFNHDCLPNACRFDYVDAAAENNT 234
++ N+F + ++ G+Y S NH C PN + N
Sbjct: 178 VIC----NSFTICNAEMQEV-------GVGLYPSMSLLNHSCDPNC-------SIVFNGP 219
Query: 235 DIIVRMIHDVPQGREICLSYFPVNYDYSTRQKRLLYDYDFACDCDRCKVEANWSDNDNDD 294
+++R + DV G E+ + Y + R+K+L Y F CDC RC+ + +D D
Sbjct: 220 HLLLRTVRDVEAGEELTICYLDMLMTSEERRKQLRDQYCFDCDCFRCQTQDKDADMLTGD 279
Query: 295 EN 296
E
Sbjct: 280 EQ 281
>gi|289740749|gb|ADD19122.1| SET and MYND domain containing protein 5 [Glossina morsitans
morsitans]
Length = 395
Score = 57.0 bits (136), Expect = 2e-05, Method: Compositional matrix adjust.
Identities = 32/87 (36%), Positives = 47/87 (54%), Gaps = 10/87 (11%)
Query: 204 GIYLKASFFNHDCLPNA-CRFDYVDAAAENNTDIIVRMIHDVPQGREICLSYF---PVNY 259
G+YL S NH CLPNA F Y +N ++++ + + G EIC+SY +
Sbjct: 288 GLYLLQSKINHSCLPNAQVTFPY------SNDIVVLKALQPIQVGEEICISYLDEGQLER 341
Query: 260 DYSTRQKRLLYDYDFACDCDRCKVEAN 286
+RQK L +Y F C+C +C+ EAN
Sbjct: 342 SRHSRQKILKENYIFVCECFKCQREAN 368
>gi|384498493|gb|EIE88984.1| hypothetical protein RO3G_13695 [Rhizopus delemar RA 99-880]
Length = 553
Score = 57.0 bits (136), Expect = 2e-05, Method: Compositional matrix adjust.
Identities = 77/312 (24%), Positives = 127/312 (40%), Gaps = 26/312 (8%)
Query: 16 RGRGLVSTQSLKAGQIVLRDSPILLYSALPFINSSSFCHNCFRKTMHSSSSICPSC-SVA 74
RGR + ST ++ G +++ P+ + IN +C+ CFRK C C SV
Sbjct: 20 RGRYIKSTATIPKGTVIITSQPLGTVALPQTINE--YCNYCFRKQTRPPLQRCSRCKSVY 77
Query: 75 FCSPKCSTAAASSSHSPYVCQALTRLLNLNSPDAANLPLDRQLQ---ARFLIAAYNLAVV 131
FC C + A S H +VC R+ + ++ + +L + ++ +R+ A
Sbjct: 78 FCDMGCFSNAWLSYHQ-FVCDP-ARVRHEDAENELDLEMLEKVALNVSRYRKRAKTETEA 135
Query: 132 NPSQFQILL-AFQGTVTDNDTSAAHYLSS--LCPPPASTTIELTAALLAKDRL----NAF 184
++ + AF + + A L+S A + + +D L N F
Sbjct: 136 EGETVEVTMEAFFSLMGHDALQAGRVLASHRRLAREALKRAHVQQTSIDEDELVHYLNVF 195
Query: 185 GLMEPYIEGQDGRRSVRAYGIYLKASFFNHDCLPNACRFDYVDAAAENNTDIIVRMIHDV 244
++ Q+ G Y AS FNH C PNA + A AE +R I +
Sbjct: 196 KSNNFTLDDQE--MFAVGEGTYPVASLFNHTCRPNAV-IVFDGALAE------IRAIDTI 246
Query: 245 PQGREICLSYFPVNYDYSTRQKRLLYDYDFACDCDRCKVEANWSDNDN--DDENNEEVMD 302
EI +SY + + R++ L Y F C C RC + + DD EE D
Sbjct: 247 EPDTEITISYIDPAHARAHRKRALREKYFFDCRCVRCTRQEGPKAYLSLVDDLLGEEEDD 306
Query: 303 EDQDEQMVASDD 314
D+ +Q++ + D
Sbjct: 307 WDRAQQLLGNQD 318
>gi|326510051|dbj|BAJ87242.1| predicted protein [Hordeum vulgare subsp. vulgare]
Length = 293
Score = 57.0 bits (136), Expect = 2e-05, Method: Compositional matrix adjust.
Identities = 69/295 (23%), Positives = 117/295 (39%), Gaps = 39/295 (13%)
Query: 8 VTVAEIEGRGRGLVSTQSLKAGQIVLRDSPILLYSALPFINSSSFCHNCFRKTMHSSSSI 67
V VA + GRG+ +T+ + G+++ P++ + + + C+ C ++
Sbjct: 13 VRVALTDSAGRGVFATRPIATGEVLHSAQPLVTHPSRSLFHE--VCYRCLKRKAGKGDDS 70
Query: 68 CPSCSVAFCSPKCSTAAASSSHSPYVCQALTRLLNLNSPDAANLP-LDRQLQARFL---I 123
C FCS C A H + L + S P + ++L + +
Sbjct: 71 RGDCY--FCSDAC-WEHAEGFHGIEKKADWSLLDDHCSSRGLKYPYMVKRLACMVISGDV 127
Query: 124 AAYNLAVVNPSQFQILLAFQGTVTDNDTSAAHYLSSLCPPPASTTIELTAAL-------- 175
A L ++ P+Q Q T+ + + S+ A ELT L
Sbjct: 128 GADCLDILQPAQLH-----QDTIIEMEEEFELLKSTF--RKAGFQEELTTFLTKQWYINV 180
Query: 176 LAKDRLNAFGLMEPYIEGQDGRRSVRAY---------GIYLKASFFNHDCLPNACRFDYV 226
LA+ R+NAF + +D S A +Y+ SF+NHDC PN
Sbjct: 181 LARIRVNAFRIELVASSYEDLLSSAVASVTCDASVGNAVYMLPSFYNHDCDPNTHIVWLE 240
Query: 227 DAAAENNTDIIVRMIHDVPQGREICLSYFPVNYDYSTRQKRLLYDYDFACDCDRC 281
+A A+ NT + D+ +G E+ + Y + + + RQK L + F C C RC
Sbjct: 241 NADAKLNT------LRDIDEGEELRICYIDTSMNVNARQKILTEGFGFQCRCQRC 289
>gi|281208760|gb|EFA82935.1| SET domain-containing protein [Polysphondylium pallidum PN500]
Length = 422
Score = 57.0 bits (136), Expect = 2e-05, Method: Compositional matrix adjust.
Identities = 39/107 (36%), Positives = 47/107 (43%), Gaps = 14/107 (13%)
Query: 175 LLAKDRLNAFGLMEPYIEGQDGRRSVRAYGIYLKASFFNHDCLPNACRFDYVDAAAENNT 234
LL N FGL + Y E IYL+AS FNH C PNA R +
Sbjct: 232 LLTMSTRNCFGLWKNYEE-------CFGLAIYLEASLFNHSCYPNAAR-----VQRGRSI 279
Query: 235 DIIVRMIHDVPQGREICLSYFPVNYDYSTRQKRLLYDYDFACDCDRC 281
DII I D+ EIC+SY + R+ L +Y F C C RC
Sbjct: 280 DIIA--IRDIEPNEEICISYLNITNGSHERKDHLKNNYLFDCVCIRC 324
>gi|327275309|ref|XP_003222416.1| PREDICTED: SET and MYND domain-containing protein 1-like isoform 2
[Anolis carolinensis]
Length = 485
Score = 57.0 bits (136), Expect = 2e-05, Method: Compositional matrix adjust.
Identities = 65/290 (22%), Positives = 115/290 (39%), Gaps = 29/290 (10%)
Query: 6 ETVTVAEIEGRGRGLVSTQSLKAGQIVLRDSPILLYSALPFINSSSF-CHNCFRKTMHSS 64
ETV V + +G+GRGL + + AG ++ + Y+A+ F + + CH CF++ H+
Sbjct: 2 ETVEVFDTDGKGRGLKAAKEFWAGDVIFAERA---YAAVVFDSLTHLVCHTCFKR--HAK 56
Query: 65 SSICPSCSVA-FCSPKCSTAAASSSHSPYVCQALTRLLNLNSPDAANLPLDRQLQARFLI 123
C C A +C C A + + C A+ + + +P+ N+ L ++ R
Sbjct: 57 LHRCGQCKFAHYCDRTCQKDAWVNHKNE--CAAIKK--HGKAPN-ENIRLAARIMWRIER 111
Query: 124 AAYNLA---VVNPSQFQILLAFQGTVTDNDTS--AAHYLSSLCPPPASTTIELTAALLAK 178
L V+ Q + G + +L ++ + +L
Sbjct: 112 EGGGLTEGCFVSIDSLQNHVEHFGEEEKKELRLDLESFLEFWPHDGKQFAMQYISHILGV 171
Query: 179 DRLNAFGLMEPYIEGQDGRRSVRAYGIYLKASFFNHDCLPNAC------RFDYVDAAAEN 232
N F L + Q G ++V GI+ NHDC PN + V +
Sbjct: 172 INCNGFTLSD-----QRGLQAV-GVGIFPNLCLVNHDCWPNCTVIFNNGNHEAVRSMFHT 225
Query: 233 NTDIIVRMIHDVPQGREICLSYFPVNYDYSTRQKRLLYDYDFACDCDRCK 282
I +R + + G E+ +SY R+++L Y F C C+ C+
Sbjct: 226 QMRIELRALGKISIGEELTVSYVDFLNVCEDRRQQLKKQYYFDCTCEHCQ 275
>gi|270011320|gb|EFA07768.1| hypothetical protein TcasGA2_TC005322 [Tribolium castaneum]
Length = 1086
Score = 57.0 bits (136), Expect = 2e-05, Method: Compositional matrix adjust.
Identities = 72/352 (20%), Positives = 132/352 (37%), Gaps = 61/352 (17%)
Query: 1 MSAVSETVTVAEIEGRGRGLVSTQSLKAGQIVLRDSPILLYSALPFINSSSFCHNCFRKT 60
+ + S + + E R + + + ++ G+I+ + P + A + CH C++
Sbjct: 719 IPSASSCIEIKPDEKSRRRVFAARKIEIGEIIAVEKPFVFTLAAADLYH---CHECYQ-- 773
Query: 61 MHSSSSICPSCS-VAFCSPKCSTAAASSSHSPYVCQALTRLLNLNSPDAANLPLDRQLQA 119
+ + C CS +C +C A H Y C L L N+ A L ++
Sbjct: 774 LCYNPIPCEICSQTLYCGEECRDKAREKYHQ-YECPILISLKNIVGKHKAFL---LAIKM 829
Query: 120 RFLIA----------------------AYNLAVVNPSQFQILLAFQGTVTDNDTSA---- 153
F+I+ + A++ + ++ + G ++D A
Sbjct: 830 SFMISDENDVPEVYALVENLSRDNNDEVFTTALITALMYHLVKTYTGKFPEDDLEAENKF 889
Query: 154 AHYLSS---LCPPPASTTIELTAALLAKDRLNAFGLMEPYIEGQDGRRSVRAYGIYLKAS 210
H+L + +C A+ EL +++ + EP E + +Y +
Sbjct: 890 KHFLMTHLRICLTHAAGIDELYPNQVSEGQ-------EPGQELLSFKSETVGCALYPFYA 942
Query: 211 FFNHDCLPNACRFDYVDAAAENNTDIIVRMIHDVPQGREICLSYFPVNYDYST--RQKRL 268
F H C PN A + T ++R + + +G+E +SY P ++S R+ RL
Sbjct: 943 LFRHACCPNV-------FAHHHGTQRVLRAVRTIHEGQECFVSYGPYYVEHSKQERKSRL 995
Query: 269 LYDYDFACDCDRCKVEANWSDNDNDDENNEEVMDEDQDEQMVASDDDAEVHG 320
L Y F C C C E +W D N + QD VA +A +
Sbjct: 996 LSQYHFTCKCRAC--EEDWPQLD----LNPYYLTSKQDLDFVAKIRNANIEA 1041
Score = 41.2 bits (95), Expect = 0.90, Method: Compositional matrix adjust.
Identities = 57/300 (19%), Positives = 123/300 (41%), Gaps = 31/300 (10%)
Query: 5 SETVTVAEIEGRGRGLVSTQSLKAGQIVLRDSPILLYSALPFINSSSF-CHNCFRKTMHS 63
+ +V + E + +V+T+ ++ GQ++ ++P + +AL IN F CH+C+ +
Sbjct: 202 TNSVEIKVDEKLRKRVVATKDIQIGQVIAVETPCV--AAL--INVVLFHCHDCY--ILCY 255
Query: 64 SSSICPSCS-VAFCSPKCSTAAASSSHSPY--VCQALTRLLNLNSPDAANLPLDRQLQAR 120
+ C +C+ V +CS C A + H + ++ +L+ +++ L + +Q +
Sbjct: 256 NPIPCKTCTEVVYCSEACRENAFAKYHQKECPIYLSMRKLVGIDTHFQWALKMTLLVQTQ 315
Query: 121 FLIAAYNLAVVNPSQFQILLA--FQGTVTDNDTSAAHYLSSLCPPPASTTIELTAALLAK 178
+ + + L+A F + + T L+ + + + +
Sbjct: 316 ADKCCDRIDTDDDRTRKTLMATLFYHLIKNCTTV----LAGVDEAGVKNFKRVLMSYMHI 371
Query: 179 DRLNAFGLMEPYIEGQ----DGRRSVRAYGIYLKASFFNHDCLPNACRFDYVDAAAENNT 234
+ + E ++ G + ++ A +Y + H C PN +
Sbjct: 372 CDYHVSDIDEIFVHGGSRDLELKQETFAKAMYPFSDKLRHSCCPNV-------MGWYHGV 424
Query: 235 DIIVRMIHDVPQGREICLSYFPV--NYDYSTRQKRLLYDYDFACDCDRCKVEANWSDNDN 292
++R I + +G E SY P+ N + RQ + + Y+F C C CK NW ++
Sbjct: 425 TRVLRAIRTIKKGEECFFSYGPLYTNIEKEERQNYIFFIYNFKCACRACK--QNWPQTES 482
>gi|451845079|gb|EMD58393.1| hypothetical protein COCSADRAFT_350740 [Cochliobolus sativus
ND90Pr]
Length = 739
Score = 57.0 bits (136), Expect = 2e-05, Method: Compositional matrix adjust.
Identities = 70/287 (24%), Positives = 107/287 (37%), Gaps = 80/287 (27%)
Query: 8 VTVAEIEGRGRGLVSTQSLKAGQIVLRDSPILLYSALPFINSSSFCHNCFRKTMHSSSSI 67
VTV + +G+GRGL +T S+KAG +VL
Sbjct: 355 VTVQKAKGKGRGLFTTASMKAGDLVL---------------------------------- 380
Query: 68 CPSCSVAFCSPKCSTAAASSSHSPYVCQALTRLLNLNSPDAANLPLDRQLQARFLIAAYN 127
C AFC + S++ +T L+N+ + D A + QA +
Sbjct: 381 ---CEKAFCHAHVDDSGESNA-------KMTLLMNVET-DTAFMGG----QAHLIQLITQ 425
Query: 128 LAVVNPSQFQILLAFQGTVTDNDTSAAHYLSSLCPPPAST-TIELTAALLAKDRLNAFG- 185
NPS + A + D +S P T +E T AL N FG
Sbjct: 426 KLYKNPS----MAAAFTELHHGDYKTVDKMSVDGQPVVDTFLVERTMAL------NVFGC 475
Query: 186 ----------LMEPYIEGQDGRRSVRAYGIYLKASFFNHDCLPNACRFDYVDAAAENNTD 235
+M + +DG + + GI++KAS+ NH CL N R D
Sbjct: 476 PVTSLKSHNDVMSNKVSKRDG--TFHSCGIWIKASYINHSCLGNVRRSFIGDM------- 526
Query: 236 IIVRMIHDVPQGREICLSYFPVNYDYSTRQKRLLYDYDFACDCDRCK 282
+IVR D+ G E+ Y DY+ + ++ ++ F C C C+
Sbjct: 527 MIVRACKDLEAGTELAFPYVAPEGDYTPKVEQKFKNWGFICRCALCE 573
>gi|320591240|gb|EFX03679.1| oxysterol-binding protein [Grosmannia clavigera kw1407]
Length = 850
Score = 57.0 bits (136), Expect = 2e-05, Method: Compositional matrix adjust.
Identities = 34/92 (36%), Positives = 44/92 (47%), Gaps = 21/92 (22%)
Query: 194 QDGRRSVRAYGIYLKASFFNHDCLPNACRFDYVDAAAENNTDIIVRMI----HDVPQGRE 249
QDGR IY KA+ NHDC PN NTD+ R + D+ G E
Sbjct: 203 QDGR-----VAIYPKAALVNHDCRPNVF-----------NTDVAGRRVIMATRDIAVGEE 246
Query: 250 ICLSYFPVNYDYSTRQKRLLYDYDFACDCDRC 281
+ +Y P+ D +TRQ+RL+ Y F C C C
Sbjct: 247 LLTTYVPLLADTTTRQRRLI-QYGFHCSCVAC 277
>gi|119189249|ref|XP_001245231.1| hypothetical protein CIMG_04672 [Coccidioides immitis RS]
gi|303323233|ref|XP_003071608.1| hypothetical protein CPC735_071450 [Coccidioides posadasii C735
delta SOWgp]
gi|240111310|gb|EER29463.1| hypothetical protein CPC735_071450 [Coccidioides posadasii C735
delta SOWgp]
gi|320031288|gb|EFW13261.1| conserved hypothetical protein [Coccidioides posadasii str.
Silveira]
gi|392868134|gb|EAS33877.2| hypothetical protein CIMG_04672 [Coccidioides immitis RS]
Length = 338
Score = 57.0 bits (136), Expect = 2e-05, Method: Compositional matrix adjust.
Identities = 30/86 (34%), Positives = 44/86 (51%), Gaps = 9/86 (10%)
Query: 203 YGIYLKASFFNHDCLPNACR--FDYVDAAAENNTDIIVRMIHDVPQGREICLSYFPVNYD 260
YGI+ K S FNH C PN+ R +D A +V ++ DV +G EI +SY
Sbjct: 118 YGIFPKVSRFNHGCRPNSMRSYHPVLDQA-------VVHVVKDVSEGEEITVSYVEPGLA 170
Query: 261 YSTRQKRLLYDYDFACDCDRCKVEAN 286
+ RQ++L + F C C+ C + N
Sbjct: 171 FYLRQEQLKEKFGFICGCNLCLMPEN 196
>gi|195569233|ref|XP_002102615.1| GD19999 [Drosophila simulans]
gi|194198542|gb|EDX12118.1| GD19999 [Drosophila simulans]
Length = 391
Score = 56.6 bits (135), Expect = 2e-05, Method: Compositional matrix adjust.
Identities = 30/87 (34%), Positives = 47/87 (54%), Gaps = 10/87 (11%)
Query: 204 GIYLKASFFNHDCLPNACR-FDYVDAAAENNTDIIVRMIHDVPQGREICLSYF---PVNY 259
G+YL S NH C+PNAC F Y +N ++++ + + QG EIC+SY +
Sbjct: 288 GLYLLQSKINHSCVPNACSTFPY------SNDIVVLKALGPIQQGEEICISYLDECMLER 341
Query: 260 DYSTRQKRLLYDYDFACDCDRCKVEAN 286
+R K L +Y F C C +C+ +A+
Sbjct: 342 SRHSRHKVLRENYVFICQCPKCRAQAS 368
>gi|157169545|ref|XP_001657892.1| hypothetical protein AaeL_AAEL001050 [Aedes aegypti]
gi|108883672|gb|EAT47897.1| AAEL001050-PA [Aedes aegypti]
Length = 662
Score = 56.6 bits (135), Expect = 2e-05, Method: Compositional matrix adjust.
Identities = 82/316 (25%), Positives = 119/316 (37%), Gaps = 66/316 (20%)
Query: 16 RGRGLVSTQSLKAGQIVLRDSPILLYSALPFINS-SSFCHNCFRKTMHSSSSICPSC-SV 73
RGR +V+ + LK+G+ VL + Y A + + C+ CF + + +S C C V
Sbjct: 272 RGRYIVAAEDLKSGETVLVEPA---YGACLYPKYFGTHCNQCFTRLV--ASVPCEQCCGV 326
Query: 74 AFCSPKCSTAAASSSHSPYVCQALTRLLN----------------LNSPDAA-----NLP 112
AFCSP+C A + H + CQ L ++ SP+ A NL
Sbjct: 327 AFCSPECRDRACAGYHR-FECQYLDLMIGSGMSILCHLAFRLVTQAGSPEKAAEITKNLA 385
Query: 113 LDRQLQAR--------FLIAAYNLAVVNPSQFQILLAFQGTVTDNDTSAAHYLSSLCPPP 164
L + R L+AA+ L + S+F T AA P P
Sbjct: 386 LCSHAEKRSHEDYFRRTLMAAFLLRCLQKSEF---------FGRRKTEAAE------PTP 430
Query: 165 ASTTIE-LTAALLAKDRLNAFGLMEPYIEGQDGRRSVR----AYGIYLKASFFNHDCLPN 219
+ + LL + NA + E I G S + G+Y + NHDC
Sbjct: 431 IEAKVGGVILELLQALQFNAHEIYEVKIAGDHRVDSAKVQYIGVGVYKTGAMLNHDCHSG 490
Query: 220 ACRFDYVDAAAENNTDIIVRMIHDVPQ--GREICLSYFPVNYDYSTRQKRLLYDYDFACD 277
R +V + +T+ + VP+ G PV RQK L Y F CD
Sbjct: 491 VSR-TFVKSTMILHTNRPLTKGSLVPENYGMHFLRQPLPV------RQKVLRSRYWFKCD 543
Query: 278 CDRCKVEANWSDNDND 293
C C + D ND
Sbjct: 544 CKACFEDWPVFDKMND 559
>gi|24648668|ref|NP_650955.1| CG3353 [Drosophila melanogaster]
gi|23171865|gb|AAF55866.2| CG3353 [Drosophila melanogaster]
gi|33636489|gb|AAQ23542.1| RE75113p [Drosophila melanogaster]
Length = 393
Score = 56.6 bits (135), Expect = 2e-05, Method: Compositional matrix adjust.
Identities = 30/87 (34%), Positives = 47/87 (54%), Gaps = 10/87 (11%)
Query: 204 GIYLKASFFNHDCLPNACR-FDYVDAAAENNTDIIVRMIHDVPQGREICLSYF---PVNY 259
G+YL S NH C+PNAC F Y +N ++++ + + QG EIC+SY +
Sbjct: 288 GLYLLQSKINHSCVPNACSTFPY------SNDIVVLKALAPIQQGEEICISYLDECMLER 341
Query: 260 DYSTRQKRLLYDYDFACDCDRCKVEAN 286
+R K L +Y F C C +C+ +A+
Sbjct: 342 SRHSRHKVLRENYVFICQCPKCRAQAS 368
>gi|410922900|ref|XP_003974920.1| PREDICTED: SET and MYND domain-containing protein 1-like [Takifugu
rubripes]
Length = 489
Score = 56.6 bits (135), Expect = 2e-05, Method: Compositional matrix adjust.
Identities = 60/287 (20%), Positives = 111/287 (38%), Gaps = 23/287 (8%)
Query: 6 ETVTVAEIEGRGRGLVSTQSLKAGQIVLRDSPI--LLYSALPFINSSSFCHNCFRKTMHS 63
E V + + G+GRGL +T+ AG ++ + + +++ +L + CH+CFR+
Sbjct: 2 ENVAIFDSPGKGRGLKATKEFWAGDVIFSEPSLAAVVFDSL----AERICHSCFRR--QE 55
Query: 64 SSSICPSCSVA-FCSPKCSTAA-ASSSHSPYVCQALTRLLNLNSPDAANLPLDRQLQARF 121
C C A +C C A A +A + N N A L + R +
Sbjct: 56 KLQKCSQCKFAHYCDRTCQRAGWAEHKQECGAIKAYGKAPNENIRLVARL-MWRLDKEGS 114
Query: 122 LIAAYNLAVVNPSQFQILLAFQGTVTDNDTSAAHYLSSLCPPPASTTIELTAALLAKDRL 181
+++ L V + + + + + ++L TI+ + +
Sbjct: 115 VVSDMQLITVEELEDHVADMQEDEIKELKVDIHNFLDYWPRNSKQHTIDDISHIFGVINC 174
Query: 182 NAFGLMEPYIEGQDGRRSVRAYGIYLKASFFNHDCLPNACRF------DYVDAAAENNTD 235
N F + Q G ++V G++ NH+C PN V+ +
Sbjct: 175 NGFT-----VSDQRGLQAV-GVGLFPNLCMVNHNCWPNCTVILNHGNQSAVNTMFHSQRR 228
Query: 236 IIVRMIHDVPQGREICLSYFPVNYDYSTRQKRLLYDYDFACDCDRCK 282
I +R + + +G E+ ++Y R++ L Y F C CD CK
Sbjct: 229 IELRSLGKIAEGEELTVAYVDFLNLSEERRRLLKTQYFFDCQCDYCK 275
>gi|195355540|ref|XP_002044249.1| GM15092 [Drosophila sechellia]
gi|194129550|gb|EDW51593.1| GM15092 [Drosophila sechellia]
Length = 392
Score = 56.6 bits (135), Expect = 2e-05, Method: Compositional matrix adjust.
Identities = 30/87 (34%), Positives = 47/87 (54%), Gaps = 10/87 (11%)
Query: 204 GIYLKASFFNHDCLPNACR-FDYVDAAAENNTDIIVRMIHDVPQGREICLSYF---PVNY 259
G+YL S NH C+PNAC F Y +N ++++ + + QG EIC+SY +
Sbjct: 288 GLYLLQSKINHSCVPNACSTFPY------SNDIVVLKALGPIQQGEEICISYLDECMLER 341
Query: 260 DYSTRQKRLLYDYDFACDCDRCKVEAN 286
+R K L +Y F C C +C+ +A+
Sbjct: 342 SRHSRHKVLRENYVFICQCPKCRAQAS 368
>gi|255588343|ref|XP_002534573.1| protein with unknown function [Ricinus communis]
gi|223524997|gb|EEF27811.1| protein with unknown function [Ricinus communis]
Length = 319
Score = 56.6 bits (135), Expect = 2e-05, Method: Compositional matrix adjust.
Identities = 68/298 (22%), Positives = 118/298 (39%), Gaps = 48/298 (16%)
Query: 8 VTVAEIEGRGRGLVSTQSLKAGQIVLRDSPILLYSALPFINSSSFCHNCFRKTMHSSSSI 67
+ V E GRG+ ST+ + G+++ PI+ Y + ++++ C+ C +K + +
Sbjct: 42 IRVGVTESAGRGVFSTRRISGGELIHNAKPIVSYPSRS--STNTVCYFCLKKLASTENR- 98
Query: 68 CPSCSVAFCSPKCSTAAASSSHSPYVCQALTRLLNLN---SPDAANLPLDRQLQARFLIA 124
SVAF CS ++ Y + + PL + A +I+
Sbjct: 99 ----SVAF----CSQECKQNAKVFYDVETKADWSGFDDYCRTQGLKYPLMVKRLACMVIS 150
Query: 125 AYN----LAVVNPSQFQILLAFQGTVTDNDTSAAHYLSSLCPPPASTTIELTAAL----- 175
L ++ P+ + + + L C A+ + A L
Sbjct: 151 GAATVECLDILQPANLSPEMILE-------MEEGYDLLRSCFTKANIADDRLAFLTRQWY 203
Query: 176 ---LAKDRLNAF------GLMEPYIEGQDGRRSVRAY---GIYLKASFFNHDCLPNACRF 223
LA+ R+NAF GL E + A +Y+ SFFNHDC PNA
Sbjct: 204 INQLARIRINAFRIELAVGLYEDLLSSAAACIEAEAAVGNSVYMLPSFFNHDCDPNA-HI 262
Query: 224 DYVDAAAENNTDIIVRMIHDVPQGREICLSYFPVNYDYSTRQKRLLYDYDFACDCDRC 281
+++ N D ++ + D+ E+ + Y + D+ RQ LL + F C+C RC
Sbjct: 263 IWIE-----NADARLKALRDIDPDEELRICYIDASMDHGARQTILLQGFGFKCNCLRC 315
>gi|407404730|gb|EKF30079.1| hypothetical protein MOQ_006118 [Trypanosoma cruzi marinkellei]
Length = 359
Score = 56.6 bits (135), Expect = 2e-05, Method: Compositional matrix adjust.
Identities = 30/88 (34%), Positives = 45/88 (51%), Gaps = 4/88 (4%)
Query: 198 RSVRAYGIYLKASFFNHDCLPNACRFDYVDAAAENNTDIIVRMIHDVPQGREICLSYFPV 257
+S R G+Y FNH C PN + Y ++ E + + +R D+ G E+C+SY
Sbjct: 253 QSSRGQGVYTVGCLFNHSCEPNL-QVVYSESGDETLSVVALR---DIEPGEELCISYIDE 308
Query: 258 NYDYSTRQKRLLYDYDFACDCDRCKVEA 285
+ Y RQ+ L Y F C C +C+ EA
Sbjct: 309 SLSYPERQQELYEHYLFVCHCPKCEREA 336
>gi|357120626|ref|XP_003562026.1| PREDICTED: uncharacterized protein LOC100843629 [Brachypodium
distachyon]
Length = 542
Score = 56.6 bits (135), Expect = 2e-05, Method: Compositional matrix adjust.
Identities = 31/84 (36%), Positives = 39/84 (46%), Gaps = 8/84 (9%)
Query: 202 AYGIYLKASFFNHDCLPNACRFDYVDAAAENNTDIIVRMIHDVPQGREICLSYFPVNYDY 261
G+++ SF NH C PNACR D A IV ++ G EI YF V
Sbjct: 318 GVGLWVLPSFINHSCHPNACRTHVGDHA-------IVHASREIKAGEEITFPYFDVLVPV 370
Query: 262 STRQKRLLYDYDFACDCDRCKVEA 285
RQ+ + F C CDRC+ EA
Sbjct: 371 GKRQE-AARAWGFECRCDRCRFEA 393
>gi|390601664|gb|EIN11058.1| hypothetical protein PUNSTDRAFT_101047 [Punctularia strigosozonata
HHB-11173 SS5]
Length = 241
Score = 56.6 bits (135), Expect = 2e-05, Method: Compositional matrix adjust.
Identities = 34/115 (29%), Positives = 54/115 (46%), Gaps = 13/115 (11%)
Query: 174 ALLAKDRLNAFGLMEPYIEGQDGRRSVRAYGIYLKASFFNHDCLPNACRFDYVDAAAENN 233
A+ ++D N+FG+ + D + +GI+ ASFFNH C PN D + A E
Sbjct: 130 AIASRDVHNSFGI---WSHPGDSTAEMLGFGIWPDASFFNHSCAPNMGTKDRLGRAWE-- 184
Query: 234 TDIIVRMIHDVPQGREICLSYFPV----NYDYSTRQKRLLYDYDFACDCDRCKVE 284
R + + E+ +SY + D R++RL + F C C RC++E
Sbjct: 185 ----FRSSKQIQKDEELFISYLSLEELEEMDAFARRERLQNVWGFECSCHRCRIE 235
>gi|358059760|dbj|GAA94529.1| hypothetical protein E5Q_01181 [Mixia osmundae IAM 14324]
Length = 554
Score = 56.6 bits (135), Expect = 2e-05, Method: Compositional matrix adjust.
Identities = 77/287 (26%), Positives = 118/287 (41%), Gaps = 41/287 (14%)
Query: 16 RGRGLVSTQSLKAGQIVLRDSPILLYSALPFINSSSFCHNCFRKTM-HSSSSICPSCSVA 74
+ R + + Q+ AG + P ++ + + + C +C + T C SC +A
Sbjct: 26 KKRSVHARQAFAAGDTLFETRPP--FAVVGNSDILTRCFHCLKPTSPERELKRCSSCQLA 83
Query: 75 -FCSPKCSTAAASSSHSPYVCQALTRLLNLNSPDA-ANLPL-DRQLQARFLIAAYNLAVV 131
FC P C A + Y C A R ++P + A LPL + +L AR + A +
Sbjct: 84 RFCGPACVKAGWTPEGHKYECAAFKRCGFKDTPASLAKLPLTEARLLARMIWAQKH---- 139
Query: 132 NPSQFQILLAFQGTVTDNDTSAAHYLSSLCPPPASTTIELT----------AALL--AKD 179
+PSQ + L DN A L + AS + + AALL AK
Sbjct: 140 DPSQHRYDLHGDVDCFDNKEKA---LWARRIQEASVLVGQSLFNEVLGNAKAALLQMAKV 196
Query: 180 RLNAFGLMEPYIEGQDGRRSVRAYGIYLKASF--FNHDCLPN--ACRFDYVDAAAENNTD 235
++NA L P+ E A G+ L +SF NH C PN A + D E
Sbjct: 197 QMNAHTLCTPFGE---------ACGVSLSSSFALINHSCAPNTFAMSSHWPD---EKPKY 244
Query: 236 IIVRMIHDVPQGREICLSYFPVNYDYSTRQKRLLYDYDFACDCDRCK 282
+ V + G EI ++Y V + R++ + Y F CDC C+
Sbjct: 245 LRVAACRPIKAGDEITIAYVDVEEENLQRRQTIKATYGFDCDCRLCE 291
>gi|357117918|ref|XP_003560708.1| PREDICTED: histone-lysine N-methyltransferase ASHR1-like
[Brachypodium distachyon]
Length = 482
Score = 56.6 bits (135), Expect = 2e-05, Method: Compositional matrix adjust.
Identities = 87/371 (23%), Positives = 138/371 (37%), Gaps = 84/371 (22%)
Query: 8 VTVAEIEGRGRGLVSTQSLKAGQIVLRDSPILLYSALP-FINSSSFCHNCFRKTMHSSSS 66
+ +A I G+GRGLV+T+ G+++L P Y++ P I S C +CF S+
Sbjct: 17 LAIASIPGKGRGLVTTRRFFPGEVILCGEP---YASTPNKILVGSNCDHCF---TSSNLR 70
Query: 67 ICPSCSVA-FCSPKCSTAAASSSHSPYVCQALT-------RLLNLNSPDAANLPLDRQLQ 118
C C VA +C C C+A++ ++L L L R+LQ
Sbjct: 71 KCSVCRVAWYCGSVCQKEEWKLHQ--LECRAISALTDDRKKMLTPTIRLMVRLILRRKLQ 128
Query: 119 ARFLIAAYNLAVVNPSQFQILLAFQGTVTD-NDTSAAHY-----LSSLCPPPASTTIELT 172
+I + + ++ A + +++ +D Y L L P ++ T
Sbjct: 129 NEKVIPS-----TGTDNYGLVDALESHISEVDDKQLVLYAQMANLVQLILPAIELDLKET 183
Query: 173 AALLAKDRLNAFGLMEPYIEGQDGRRSVRAYGIYLKASFFNHDCLPNACRFDYVDAAAEN 232
A + +K NA + +P + R V G++ S NH C+PNA E
Sbjct: 184 AHIFSKFSCNAHTICDPEL------RPV-GTGLFPAISTINHSCVPNAVLL------FEG 230
Query: 233 NTDIIVRMIHDVPQGREICLSYFPVNYDYSTRQKRLLYDYDFACDCDRCKVEANWSDNDN 292
T VR + + E+ +SY R L + Y F C C RC
Sbjct: 231 RTA-YVRALQPLSNNTEVSISYIETAATTLKRHNDLKH-YFFTCTCPRC----------- 277
Query: 293 DDENNEEVMDEDQDEQMVASDDDAEVHGDTNFPHAYFFMRYMCDRDNCWGTLAPLPPSDA 352
+ S++D + G Y C NC G L LP S
Sbjct: 278 ----------------IKGSEEDPLLEG------------YRCKDQNCDGFL--LPDSGK 307
Query: 353 TPSTVMECNVC 363
T +C++C
Sbjct: 308 KAYTCQKCSMC 318
>gi|91089699|ref|XP_974834.1| PREDICTED: similar to CG8378 CG8378-PA [Tribolium castaneum]
gi|270011321|gb|EFA07769.1| hypothetical protein TcasGA2_TC005323 [Tribolium castaneum]
Length = 543
Score = 56.6 bits (135), Expect = 2e-05, Method: Compositional matrix adjust.
Identities = 75/323 (23%), Positives = 120/323 (37%), Gaps = 69/323 (21%)
Query: 1 MSAVSETVTVAEIEGRGRGLVSTQSLKAGQIVLRDSPILLYSALPFINSSSFCHNCFRKT 60
+ + S + V E E GR +V+T+ +K G+++ + P++ L S+ CH C +
Sbjct: 180 IQSASACIAVREEENWGRFVVATRDIKVGEVLAVEKPLV---TLVVNELSNHCHECV--S 234
Query: 61 MHSSSSICPSCSVA-FCSPKCSTAAASSSHSPYVCQALTRLLNLNSPDAANLPLDRQLQA 119
+ + C +C+ A +CS C A H Y C L L L L LQ
Sbjct: 235 LCYNLIPCKTCTQAMYCSESCRDYAFDMYHK-YECSILATLRFLQFDKLKLFALKISLQY 293
Query: 120 RFLIAAYNLA------------VVNPSQFQILLAFQ-----GTVTDNDTSAAHYLSSL-- 160
L +L V N ++ + F+ + D + ++ S+
Sbjct: 294 NDLGETSDLLYRSDRYREIHNLVTNTTKRSVPDLFERATAAALIYDLVKTHTNFFSAFNQ 353
Query: 161 -------------CPPPASTTIELTAALLAKDRLNAFGLMEPYIEGQDGRRSVRAYGIYL 207
P +EL N+ G+ EP A G +
Sbjct: 354 NNFKEILLLHMQTGPSNFHEIVELVP--------NSRGIYEP---------EEIASGAFA 396
Query: 208 KASFFNHDCLPNACRFDYVDAAAENNTDIIVRMIHDVPQGREICLSYFPVNY---DYSTR 264
S NH C PN RF Y + +++R I ++ +G E C + ++ D S R
Sbjct: 397 FLSLLNHSCCPNVARFSY-------GSTLVLRAIQNIQEG-EQCFDNYGYHFALMDKSER 448
Query: 265 QKRLLYDYDFACDCDRCKVEANW 287
+K L Y F C C C E NW
Sbjct: 449 KKHLQSQYYFNCVCQAC--EKNW 469
>gi|290998093|ref|XP_002681615.1| SET domain-containing protein [Naegleria gruberi]
gi|284095240|gb|EFC48871.1| SET domain-containing protein [Naegleria gruberi]
Length = 430
Score = 56.6 bits (135), Expect = 2e-05, Method: Compositional matrix adjust.
Identities = 76/311 (24%), Positives = 127/311 (40%), Gaps = 75/311 (24%)
Query: 15 GRGRGLVSTQSLKA-GQIVLRDSPILLYSALPFI----NSSSFCHNCFRKTMHSSSSICP 69
GR +V++ S + G+++L + P FI + C+ CF+ + S C
Sbjct: 23 GRYYEVVASSSTTSFGKLILHEKPFC------FIIDDRERTHCCNYCFKANIKLSR--CK 74
Query: 70 SCSVAFCSPKCSTAAASSSHSPYVCQALTRLLNLNSPDAANLPLDRQLQARFLIAAYNLA 129
+C V S CS S H C+AL P N R L+ + L+
Sbjct: 75 ACKV---SHYCSMNCYHSDHHLKECKALA-----THPKVTN-------SIRLLLKCF-LS 118
Query: 130 VVNPSQFQIL---------------------LAFQGTVTDNDTSAAHYLSSLCPPPASTT 168
+ QF+++ + F V D D S++ +L +
Sbjct: 119 CSDEKQFEMIQNLAPSSSSDEKVEDNLIKLAILFADYVKDVDLSSS-FLDN-----RKED 172
Query: 169 IELTAALLAKDRLNAFGLMEPYIEGQDGRRSVRAYGIYLKASFFNHDCLPN-ACRFDYVD 227
++ LL K + N F + + GIYLKAS FNH C+PN A FD
Sbjct: 173 LDFIYLLLLKLQRNTFSICNEEMNAIGS-------GIYLKASMFNHSCVPNCAILFD--- 222
Query: 228 AAAENNTDIIVRMIHD---VPQGREICLSYFPVNYDYSTRQKRLLYDYDFACDCDRCKVE 284
++ ++ VR+++ + +G + ++Y + + RQK+L Y F C C RC +
Sbjct: 223 ----SDKNLYVRILNPSSLLEEGTPLTINYVDLMDLTANRQKKLKEQYHFTCTCPRC-LN 277
Query: 285 ANWSDNDNDDE 295
+N N+N D+
Sbjct: 278 SNEEVNENVDK 288
>gi|195397183|ref|XP_002057208.1| GJ16480 [Drosophila virilis]
gi|194146975|gb|EDW62694.1| GJ16480 [Drosophila virilis]
Length = 600
Score = 56.6 bits (135), Expect = 2e-05, Method: Compositional matrix adjust.
Identities = 64/270 (23%), Positives = 101/270 (37%), Gaps = 25/270 (9%)
Query: 17 GRGLVSTQSLKAGQIVLRDSPILLYSALPFINSSSFCHNCFRKTMHSSSSICPSCSVAFC 76
GRG+ + + + G+++ R+ +L+ + C C+R+ S C+
Sbjct: 37 GRGIFAVRDIAQGELLFRERTLLVGPTAHRGRNLRTCTQCYRQPAGEDSDAAALCTAGCG 96
Query: 77 SPKCSTAAASSSHSPYVCQALTRLLNLNSPDAANLPLDRQLQARFLIAAYNLAVVNPSQF 136
P C AS H+ CQ L R P + R L+ +++ +N +Q
Sbjct: 97 LPVCGGCVASPRHA-VECQ-LFRKWQPKDPSRID---PRALR---ILSVVRCFFLNETQR 148
Query: 137 QILLAFQGTV----TDNDTSAAHYLSSLCPPPASTTIELTAALLAKDRLNAFGLMEPYIE 192
++L A Q AA P +E + NAF E +
Sbjct: 149 KLLYAMQANADRYYMREVERAAECFEHF--PREQDMLEYFYRTVCAFNTNAF---ESRCQ 203
Query: 193 GQDGRRSVRAYGIYLKASFFNHDCLPNACRFDYVDAAAENNTDIIVRMIHDVPQGREICL 252
DGR V A ++ A NH C PNA E+ I+V +P G EI +
Sbjct: 204 -VDGREVV-ARALFPLAGMLNHQCTPNAAHH------FEDGETIVVTATERIPLGAEITM 255
Query: 253 SYFPVNYDYSTRQKRLLYDYDFACDCDRCK 282
SY + + R+ L F C C RC+
Sbjct: 256 SYAKLLWSTLARKMFLGMTKHFMCQCPRCQ 285
Score = 50.4 bits (119), Expect = 0.001, Method: Compositional matrix adjust.
Identities = 26/78 (33%), Positives = 38/78 (48%), Gaps = 6/78 (7%)
Query: 204 GIYLKASFFNHDCLPNACRFDYVDAAAENNTDIIVRMIHDVPQGREICLSYFPVNYDYST 263
G++ + NH+C PNA + EN +VR D+P+G EI +Y + + T
Sbjct: 380 GLFPLTAIMNHECTPNASHY------FENGRLAVVRAARDIPKGGEITTTYTKILWSNLT 433
Query: 264 RQKRLLYDYDFACDCDRC 281
R L F C+CDRC
Sbjct: 434 RGIFLKMTKYFVCNCDRC 451
>gi|448122372|ref|XP_004204432.1| Piso0_000280 [Millerozyma farinosa CBS 7064]
gi|358349971|emb|CCE73250.1| Piso0_000280 [Millerozyma farinosa CBS 7064]
Length = 485
Score = 56.6 bits (135), Expect = 2e-05, Method: Compositional matrix adjust.
Identities = 62/326 (19%), Positives = 118/326 (36%), Gaps = 60/326 (18%)
Query: 6 ETVTVAEIEGRGRGLVSTQSLKAGQIVLRDSPILLYSALPFI------NSSSFCHNCFRK 59
+ + V + +G+GL + +++K G+ + + P+ AL + + S+C +
Sbjct: 108 DKIKVLQTSKKGKGLYAKKNVKKGEFLWEEKPLFYVPALANMTLIKNGKACSYCGKLLTQ 167
Query: 60 TMHSSSSI-------CPSCSVAFCSPKCSTAAASSSHSPYVCQALTRLLNLNSPDAANLP 112
SS + C CS +CS +C + AL + N+ +P +N
Sbjct: 168 GTRGSSGVSYLRGLDCDVCSEIWCSKECKNTDSK-------LHALLKH-NIYNPSKSNTR 219
Query: 113 LDRQLQARFLIAAYNLAVVNPSQFQILLAFQGTVTDNDTSAAHYLSSLCPPPASTT---- 168
+ A + Y L + + I + V+D S++ T
Sbjct: 220 KNINANAYLALQDYCLKEQWNALYAITRIYADIVSDKSGVKQKQFSAMARISQETRYKAL 279
Query: 169 ----------------IELTAALLAKDRLNAFGLMEPYIEGQDGRRSVRAYG-------- 204
++ L K+ + F + P I DG+ + +
Sbjct: 280 DSSAGSFDNFQGGALFVQEQQESLWKEGFSLFSNVFP-ISVDDGKINYEEFMYMLGTYNI 338
Query: 205 ------IYLKASFFNHDCLPNACRFDYVDAAAENNTDIIVRMIHDVPQGREICLSYFPVN 258
I+L S NH+C P+ V ++ + V D+ G E+ SY +
Sbjct: 339 NNLDSIIFLLQSHLNHNCEPSTT----VKLSSNRTEGLKVYAARDIKAGEELTTSYVNPS 394
Query: 259 YDYSTRQKRLLYDYDFACDCDRCKVE 284
+ RQ+ L ++ F C C++CK E
Sbjct: 395 HTVQQRQRELRVNWGFRCACEKCKNE 420
>gi|330917311|ref|XP_003297760.1| hypothetical protein PTT_08278 [Pyrenophora teres f. teres 0-1]
gi|311329388|gb|EFQ94158.1| hypothetical protein PTT_08278 [Pyrenophora teres f. teres 0-1]
Length = 532
Score = 56.6 bits (135), Expect = 2e-05, Method: Compositional matrix adjust.
Identities = 72/326 (22%), Positives = 125/326 (38%), Gaps = 49/326 (15%)
Query: 14 EGRGRGLVSTQSLKAGQIVLRDSPILLYSALPFINSSSFCHNCFRKTMHSSSSI------ 67
E G G+ +++ AG+ V L+ S L C NC+ T SS+
Sbjct: 29 ESNGNGVFASRRFGAGEEVASFKRPLVGS-LETERLLDTCANCYVWTEGSSTGTRLYVPE 87
Query: 68 ------CPSCS-VAFCSPKCSTAAASSSHSPYVCQALTRLLNLNSPDAANLPLDRQLQAR 120
C +C+ +CS C AA + H + C+ L + P A L
Sbjct: 88 GVKVDKCAACARFRYCSKACQKAAWNRGHK-HECKVLKPMAGRGLPKAF-------LACI 139
Query: 121 FLIAAYNLAVVNPSQFQILLAFQGTVTD--------NDTSAAHYLSSLCPPPASTTIELT 172
L+ +++ ++++ V D N A PP +
Sbjct: 140 ELLTRRKHGLISDQDWEMVCRLPSHVDDFKRNGTYGNIEMMAMGAPQFALPPNMFDRDFI 199
Query: 173 AALLAKDRLNAFGLMEPYIEGQDGRRSVRAYGIYLKASF--FNHDCLPNACRFDYVDAAA 230
AA+ A+ NA ++ P ++ GI L + NH C PNA F +D +
Sbjct: 200 AAMYARVMSNALTIITPTLD---------PLGIILDPTLCSLNHSCDPNA--FIMMDGPS 248
Query: 231 ENNTDIIVRMIHDVPQGREICLSYFPVNYDYSTRQKRLLYDYDFACDCDRCKVEANWS-D 289
+ +R + + + +EI +SY Y Y RQ+ L + F C C +C+ +A D
Sbjct: 249 -----VSIRTLRPIRKDKEIFISYIDTTYPYHKRQEELQTRWFFTCRCAKCQEKATLQED 303
Query: 290 NDNDDENNEEVMDEDQDEQMVASDDD 315
N +++ V+D + + M + +
Sbjct: 304 NWLVPADSKFVLDPEAKQAMAQTQEQ 329
>gi|6325092|ref|NP_015160.1| Set6p [Saccharomyces cerevisiae S288c]
gi|74583857|sp|Q12529.1|SET6_YEAST RecName: Full=Potential protein lysine methyltransferase SET6;
AltName: Full=SET domain-containing protein 6
gi|1370351|emb|CAA97872.1| unnamed protein product [Saccharomyces cerevisiae]
gi|1403544|emb|CAA65556.1| P2545 protein [Saccharomyces cerevisiae]
gi|285815377|tpg|DAA11269.1| TPA: Set6p [Saccharomyces cerevisiae S288c]
Length = 373
Score = 56.6 bits (135), Expect = 2e-05, Method: Compositional matrix adjust.
Identities = 45/155 (29%), Positives = 70/155 (45%), Gaps = 21/155 (13%)
Query: 134 SQFQILLAFQGTVTDNDTSAAHYLSSLCPPPAS--TTIELTAALLAKDRLNAFGLMEPYI 191
S+F +LL FQ V L L P +I L +L + NAFGL +
Sbjct: 231 SKFPVLLHFQKLVFQT-------LYILLPSHLHRMLSIPLLRHILGTEYGNAFGLWQ--- 280
Query: 192 EGQDG-RRSVRAYGIYLKASFFNHDCLPNACRFDYVDAAAENNTDIIVRMIHDVPQGREI 250
EG+ R Y ++ +AS+FNH C PN ++ ++ M D+ + +I
Sbjct: 281 EGEASDSREYFGYWVFPEASYFNHSCNPNITKY-------RKGNSMLFTMNRDIKKDEQI 333
Query: 251 CLSYFPVNYDYSTRQKRLLYD-YDFACDCDRCKVE 284
C+ Y V + +++ L D + F C C+RCK E
Sbjct: 334 CIDYSGVLDLPTVKRRAFLADSWFFDCACERCKSE 368
>gi|186478896|ref|NP_173998.2| SET domain protein 35 [Arabidopsis thaliana]
gi|332192607|gb|AEE30728.1| SET domain protein 35 [Arabidopsis thaliana]
Length = 545
Score = 56.6 bits (135), Expect = 2e-05, Method: Compositional matrix adjust.
Identities = 36/107 (33%), Positives = 50/107 (46%), Gaps = 10/107 (9%)
Query: 181 LNAFGLMEPYIEGQDGRRSVRAYGIYLK--ASFFNHDCLPNACRFDYVDAAAENNTDIIV 238
L+ L+E + G+ ++ YG+ L ASF NH C+PNA R D +IV
Sbjct: 307 LDVNSLVEDAVSGKVMGKNKEYYGVGLWTLASFINHSCIPNARRLHVGDY-------VIV 359
Query: 239 RMIHDVPQGREICLSYFPVNYDYSTRQKRLLYDYDFACDCDRCKVEA 285
D+ G EI +YF V R K + + F C C RCK E+
Sbjct: 360 HASRDIKTGEEISFAYFDVLSPLEKR-KEMAESWGFCCGCSRCKFES 405
>gi|299739226|ref|XP_001835144.2| hypothetical protein CC1G_07286 [Coprinopsis cinerea okayama7#130]
gi|298403677|gb|EAU86628.2| hypothetical protein CC1G_07286 [Coprinopsis cinerea okayama7#130]
Length = 620
Score = 56.2 bits (134), Expect = 2e-05, Method: Compositional matrix adjust.
Identities = 34/113 (30%), Positives = 54/113 (47%), Gaps = 9/113 (7%)
Query: 174 ALLAKDRLNAFGLMEPYIEGQDGRRSVRAYGIYLKASFFNHDCLPNACRFDYVDAAAENN 233
A+ A+D N FG+ E E +D + +G+Y+ ++FNHDC PN + +
Sbjct: 509 AITARDHSNVFGIWEQAEEEEDADCEMLGWGMYISGAYFNHDCSPNLKK-------RRSG 561
Query: 234 TDIIVRMIHDVPQGREICLSYFPVNYDYST--RQKRLLYDYDFACDCDRCKVE 284
+ I DV G E+C +Y + + S R + L ++ F C C RC E
Sbjct: 562 RSMQFFTIRDVQAGEELCTNYIDIGEECSVEMRNEELEREWFFRCACGRCVRE 614
>gi|195498437|ref|XP_002096523.1| GE25007 [Drosophila yakuba]
gi|194182624|gb|EDW96235.1| GE25007 [Drosophila yakuba]
Length = 393
Score = 56.2 bits (134), Expect = 2e-05, Method: Compositional matrix adjust.
Identities = 31/87 (35%), Positives = 47/87 (54%), Gaps = 10/87 (11%)
Query: 204 GIYLKASFFNHDCLPNACR-FDYVDAAAENNTDIIVRMIHDVPQGREICLSYF---PVNY 259
G+YL S NH C+PNAC F Y +N I+++ + + QG EIC+SY +
Sbjct: 288 GLYLLQSKINHSCVPNACSTFPY------SNDIIVLKALAPIQQGDEICISYLDECMLER 341
Query: 260 DYSTRQKRLLYDYDFACDCDRCKVEAN 286
+R K L +Y F C C +C+ +A+
Sbjct: 342 SRHSRHKVLRENYVFICQCPKCRAQAS 368
>gi|9295736|gb|AAF87042.1|AC006535_20 T24P13.14 [Arabidopsis thaliana]
Length = 969
Score = 56.2 bits (134), Expect = 2e-05, Method: Compositional matrix adjust.
Identities = 36/107 (33%), Positives = 50/107 (46%), Gaps = 10/107 (9%)
Query: 181 LNAFGLMEPYIEGQDGRRSVRAYGIYLK--ASFFNHDCLPNACRFDYVDAAAENNTDIIV 238
L+ L+E + G+ ++ YG+ L ASF NH C+PNA R D +IV
Sbjct: 731 LDVNSLVEDAVSGKVMGKNKEYYGVGLWTLASFINHSCIPNARRLHVGDY-------VIV 783
Query: 239 RMIHDVPQGREICLSYFPVNYDYSTRQKRLLYDYDFACDCDRCKVEA 285
D+ G EI +YF V R K + + F C C RCK E+
Sbjct: 784 HASRDIKTGEEISFAYFDVLSPLEKR-KEMAESWGFCCGCSRCKFES 829
>gi|380494289|emb|CCF33265.1| MYND finger [Colletotrichum higginsianum]
Length = 549
Score = 56.2 bits (134), Expect = 2e-05, Method: Compositional matrix adjust.
Identities = 67/276 (24%), Positives = 111/276 (40%), Gaps = 32/276 (11%)
Query: 17 GRGLVSTQSLKAGQIVLR-DSPILLYSALPFINSSSFCHNCFRKTMHSSSSI--CPSC-S 72
GR + +T+ K G ++ R D+P ++ P + +C++C +K + + C C +
Sbjct: 19 GRSIYATRRFKPGDVIARFDNPAVVLP--PGHRALEYCNHCVKKQRPAGVKLRACTGCKT 76
Query: 73 VAFCSPKCSTAAASSSHSPYVCQALTRLLN---LNSPDAANLPLDRQLQARFLIAAYNLA 129
VA+C P C A S H C+A+ RL + PD P+ + A+ ++ LA
Sbjct: 77 VAYCGPACQRANWSLVHK-LECKAIQRLHEAKPAHQPDWVPTPI--RAAAQVMLRPQVLA 133
Query: 130 VVNPSQFQILLAFQGTVTDNDTSAAHYLSSLCPPPAS-TTIELTAALLAKDRLNAFGLME 188
+ + + TD + + L + +E + +L K + NAF E
Sbjct: 134 RFEELEGHVEQWRKKDGTDLQLQSHGVVKCLGLDTVTFERLETSFQVLCKLQTNAFSRTE 193
Query: 189 PYIEGQDGRRSVRAYGIYLKASF--FNHDCLPNACRFDYVDAAAENNTDIIVRMIHDVPQ 246
Y E G++L + NH C+PNA V T +R +
Sbjct: 194 EYYE---------TGGVFLDTTLAMINHSCVPNAL----VQFGGRTAT---LRATSFLDP 237
Query: 247 GREICLSYFPVNYDYSTRQKRLLYDYDFACDCDRCK 282
G EI +SY R L Y F C C +C+
Sbjct: 238 GDEIEISYIDQTQPRGKRHGELDL-YHFECSCYKCQ 272
>gi|440471791|gb|ELQ40736.1| ankyrin repeat protein [Magnaporthe oryzae Y34]
gi|440479502|gb|ELQ60265.1| ankyrin repeat protein [Magnaporthe oryzae P131]
Length = 2018
Score = 56.2 bits (134), Expect = 2e-05, Method: Compositional matrix adjust.
Identities = 30/93 (32%), Positives = 44/93 (47%), Gaps = 7/93 (7%)
Query: 197 RRSVRAYGIYLKASFFNHDCLPN-ACRFDYVDAAAENNTDIIVRMIHDVPQGREICLSYF 255
RR V + +++ S NH+C PN A FD + + + I D+ G E+ +SY
Sbjct: 1675 RRGVDFHSTFVEVSRMNHECSPNCAAYFDPMTMSQR------IYAIRDIMPGEELTVSYI 1728
Query: 256 PVNYDYSTRQKRLLYDYDFACDCDRCKVEANWS 288
RQ RL D+ F C C RC EA++
Sbjct: 1729 DPVQTREARQNRLRKDWGFGCSCQRCTSEAHFG 1761
>gi|432852874|ref|XP_004067428.1| PREDICTED: SET and MYND domain-containing protein 3-like [Oryzias
latipes]
Length = 495
Score = 56.2 bits (134), Expect = 3e-05, Method: Compositional matrix adjust.
Identities = 30/87 (34%), Positives = 40/87 (45%), Gaps = 7/87 (8%)
Query: 195 DGRRSVRAYGIYLKASFFNHDCLPNACRFDYVDAAAENNTDIIVRMIHDVPQGREICLSY 254
DG G+Y S NHDC PN C + N T++ +R + D+ G E+ +SY
Sbjct: 250 DGELQEIGVGLYPSLSLLNHDCRPN-CVMVF------NGTELQLRAVQDINPGEELRISY 302
Query: 255 FPVNYDYSTRQKRLLYDYDFACDCDRC 281
RQK+L Y F C C RC
Sbjct: 303 IETLSLTEDRQKQLEEQYHFTCHCQRC 329
>gi|194889892|ref|XP_001977181.1| GG18383 [Drosophila erecta]
gi|190648830|gb|EDV46108.1| GG18383 [Drosophila erecta]
Length = 402
Score = 56.2 bits (134), Expect = 3e-05, Method: Compositional matrix adjust.
Identities = 64/282 (22%), Positives = 111/282 (39%), Gaps = 37/282 (13%)
Query: 17 GRGLVSTQSLKAGQIVLRDSPILLYSALPFINSSSFCHNCFRK-TMHSSSSICPSCSVAF 75
GR LV+++ L+AG+ ++R+ P+ + P ++ C C++ ++ S CP C+
Sbjct: 57 GRYLVASRQLEAGETLIREEPLAIG---PCVSGDPVCLGCYQPVSLKSDQYRCPGCAWPL 113
Query: 76 CSPKCSTAAASSSHSPYVCQALTRLLNLNSPDAANLPLDRQLQAR--------FLIAAYN 127
C C+ H+ CQ L A L +R A ++
Sbjct: 114 CGSACAGLKQRHGHTETECQ----LYGERRAVAGELLTERAGPAEVRDLYELIMIVRILL 169
Query: 128 LAVVNPSQFQILLAFQGTVTDNDTSAA---HYLSSLCPPPASTTIELTAALLAKDRLNAF 184
L +P QF ++ + + +A HY + T +L L A+
Sbjct: 170 LRQHDPEQFALIARMESHTEERRQNAVLWRHYEEKVV-HRLRVTWQLE-DLEAEQVHEVC 227
Query: 185 GLMEP--YIEGQDGRRSVRAYGIYLKASFFNHDCLPNACRFDYVDAAAENNTDIIVRMIH 242
G+++ + GQ+G +A +Y A HDC PN D + +I++R
Sbjct: 228 GILDVNCFEIGQNG---AKARTLYPSAFLLAHDCTPNTAHTD-----DPRSFEILLRTSR 279
Query: 243 DVPQGREICLSYFPVNYDYSTRQKRLLYDYD---FACDCDRC 281
V + + LSY Y KR + ++ F C C RC
Sbjct: 280 RVREREALTLSY---AYTLQGTLKRRAFMHEGKLFWCCCRRC 318
>gi|91089701|ref|XP_974846.1| PREDICTED: similar to conserved hypothetical protein [Tribolium
castaneum]
Length = 561
Score = 56.2 bits (134), Expect = 3e-05, Method: Compositional matrix adjust.
Identities = 72/352 (20%), Positives = 131/352 (37%), Gaps = 61/352 (17%)
Query: 1 MSAVSETVTVAEIEGRGRGLVSTQSLKAGQIVLRDSPILLYSALPFINSSSFCHNCFRKT 60
+ + S + + E R + + + ++ G+I+ + P + A + CH C++
Sbjct: 194 IPSASSCIEIKPDEKSRRRVFAARKIEIGEIIAVEKPFVFTLAAADLYH---CHECYQ-- 248
Query: 61 MHSSSSICPSCS-VAFCSPKCSTAAASSSHSPYVCQALTRLLNLNSPDAANLPLDRQLQA 119
+ + C CS +C +C A H Y C L L N+ A L ++
Sbjct: 249 LCYNPIPCEICSQTLYCGEECRDKAREKYHQ-YECPILISLKNIVGKHKAFL---LAIKM 304
Query: 120 RFLIA----------------------AYNLAVVNPSQFQILLAFQGTVTDNDTSA---- 153
F+I+ + A++ + ++ + G ++D A
Sbjct: 305 SFMISDENDVPEVYALVENLSRDNNDEVFTTALITALMYHLVKTYTGKFPEDDLEAENKF 364
Query: 154 AHYLSS---LCPPPASTTIELTAALLAKDRLNAFGLMEPYIEGQDGRRSVRAYGIYLKAS 210
H+L + +C A+ EL +++ + EP E + +Y +
Sbjct: 365 KHFLMTHLRICLTHAAGIDELYPNQVSEGQ-------EPGQELLSFKSETVGCALYPFYA 417
Query: 211 FFNHDCLPNACRFDYVDAAAENNTDIIVRMIHDVPQGREICLSYFP--VNYDYSTRQKRL 268
F H C PN A + T ++R + + +G+E +SY P V + R+ RL
Sbjct: 418 LFRHACCPNV-------FAHHHGTQRVLRAVRTIHEGQECFVSYGPYYVEHSKQERKSRL 470
Query: 269 LYDYDFACDCDRCKVEANWSDNDNDDENNEEVMDEDQDEQMVASDDDAEVHG 320
L Y F C C C E +W D N + QD VA +A +
Sbjct: 471 LSQYHFTCKCRAC--EEDWPQLD----LNPYYLTSKQDLDFVAKIRNANIEA 516
>gi|303282313|ref|XP_003060448.1| predicted protein [Micromonas pusilla CCMP1545]
gi|226457919|gb|EEH55217.1| predicted protein [Micromonas pusilla CCMP1545]
Length = 516
Score = 56.2 bits (134), Expect = 3e-05, Method: Compositional matrix adjust.
Identities = 29/88 (32%), Positives = 41/88 (46%), Gaps = 8/88 (9%)
Query: 198 RSVRAYGIYLKASFFNHDCLPNACRFDYVDAAAENNTDIIVRMIHDVPQGREICLSYFPV 257
R+ ++ G+Y SF NH C PNAC+ + R D+ G E+C+ YF V
Sbjct: 195 RNEKSVGVYPLMSFANHSCAPNACKLLI-------GHTMFTRAARDLVAGEEVCVKYFDV 247
Query: 258 NYDYSTRQKRLLYDYDFACDCDRCKVEA 285
S R + + F C C RC +EA
Sbjct: 248 TAPKSERNA-VAKRWGFECACARCGMEA 274
>gi|366998786|ref|XP_003684129.1| hypothetical protein TPHA_0B00230 [Tetrapisispora phaffii CBS 4417]
gi|357522425|emb|CCE61695.1| hypothetical protein TPHA_0B00230 [Tetrapisispora phaffii CBS 4417]
Length = 523
Score = 56.2 bits (134), Expect = 3e-05, Method: Compositional matrix adjust.
Identities = 63/310 (20%), Positives = 123/310 (39%), Gaps = 48/310 (15%)
Query: 11 AEIEGRGRGLVSTQSLKAGQIVLRDSPILL---YSALPFINSSSFCHNCFRKTMHSSSSI 67
++ +GRGL + ++ K +++ ++ PI++ L I + C C R S
Sbjct: 136 KDVLSKGRGLFAKKNFKKDELIFKEEPIVIVPPIERLNLIEAGKACALCGRLMNEISEQF 195
Query: 68 -------CPSC-SVAFCSPKCSTAAASSSHSPYVCQALTRLLNLNSPDAANLPLDRQLQA 119
C SC + +CS +C + + + ++ T+L +NS N ++ +
Sbjct: 196 IIKNGLDCESCINTVWCSKQCLKRDFTHKYLKHPFKSSTKL-KVNSKKWKNF--EKCCKE 252
Query: 120 RFLIAAYNLAVV-------NPSQFQILLAFQ--GTVTDNDTSAAHYLSSLCPPPASTTIE 170
I Y + + + ++F I F ++ D + L++ T
Sbjct: 253 SVFIEGYIIGHIYAASMIDHKNEFDIKKKFDYLAYISQKDRYQGNNLTNFEKSLEDTNDS 312
Query: 171 LTAALLAKDRLNAFGLMEPYIEGQD----------GR-----RSVRAYGIYLKASFFNHD 215
++ ++ +F + D GR +S + Y IY S NH+
Sbjct: 313 ISDEHPSEIWEKSFNYFKEAFPESDIDYEEYLNYIGRYNLNHKSAQLYSIY---SLVNHN 369
Query: 216 CLPNA-CRFDYVDAAAENNTDIIVRMIHDVPQGREICLSYFPVNYDYSTRQKRLLYDYDF 274
C PN DY + ++ + + + E+ +Y +D R+K LL +Y F
Sbjct: 370 CEPNVRVEVDY------HTKELKLYARKSISKNSELLTTYINPLHDVELRRKILLINYGF 423
Query: 275 ACDCDRCKVE 284
AC+C+RC+ E
Sbjct: 424 ACNCERCENE 433
>gi|302759442|ref|XP_002963144.1| hypothetical protein SELMODRAFT_404768 [Selaginella moellendorffii]
gi|300170005|gb|EFJ36607.1| hypothetical protein SELMODRAFT_404768 [Selaginella moellendorffii]
Length = 664
Score = 56.2 bits (134), Expect = 3e-05, Method: Compositional matrix adjust.
Identities = 26/82 (31%), Positives = 42/82 (51%), Gaps = 8/82 (9%)
Query: 204 GIYLKASFFNHDCLPNACRFDYVDAAAENNTDIIVRMIHDVPQGREICLSYF-PVNYDYS 262
G++ NH C+PNA + D + VR D+ G E+ LSY P Y+
Sbjct: 447 GLWASVGLINHSCIPNASKMSIGDV-------VFVRAAKDLKAGDEVLLSYLEPPLAPYA 499
Query: 263 TRQKRLLYDYDFACDCDRCKVE 284
+++++ Y+F C C+RCK+E
Sbjct: 500 GYREKMIQQYNFECSCERCKLE 521
>gi|255713868|ref|XP_002553216.1| KLTH0D11638p [Lachancea thermotolerans]
gi|238934596|emb|CAR22778.1| KLTH0D11638p [Lachancea thermotolerans CBS 6340]
Length = 353
Score = 56.2 bits (134), Expect = 3e-05, Method: Compositional matrix adjust.
Identities = 42/154 (27%), Positives = 71/154 (46%), Gaps = 19/154 (12%)
Query: 134 SQFQILLAFQGTVTDNDTSAAHYLSSLCPPPASTTIELTA--ALLAKDRLNAFGLMEPYI 191
S+F +LL Q V L L PP + ++ +L + N+FG+ +
Sbjct: 212 SRFPVLLDLQKRVFQT-------LYILLPPFLKSGFDIPTFRHILGSEYGNSFGIWQNE- 263
Query: 192 EGQDGRRSVRAYGIYLKASFFNHDCLPNACRFDYVDAAAENNTDIIVRMIHDVPQGREIC 251
E D R + Y + +ASFFNH C PN + + +I + DV + +E+C
Sbjct: 264 EASDSREYL-GYWVLPEASFFNHSCAPNLIKHRF-------GNKMIFTLNSDVTKDQELC 315
Query: 252 LSYFPV-NYDYSTRQKRLLYDYDFACDCDRCKVE 284
+ Y + + R+ L ++ F+C+C RCK+E
Sbjct: 316 IDYKDILDLKVDERRHILKENWFFSCECPRCKIE 349
>gi|67524199|ref|XP_660161.1| hypothetical protein AN2557.2 [Aspergillus nidulans FGSC A4]
gi|40745506|gb|EAA64662.1| hypothetical protein AN2557.2 [Aspergillus nidulans FGSC A4]
gi|259487990|tpe|CBF87094.1| TPA: conserved hypothetical protein [Aspergillus nidulans FGSC A4]
Length = 638
Score = 56.2 bits (134), Expect = 3e-05, Method: Compositional matrix adjust.
Identities = 36/122 (29%), Positives = 60/122 (49%), Gaps = 13/122 (10%)
Query: 170 ELTAALLAKDRLNAFGLMEPYIEGQD---GRRSVRAYGI--YLKASFFNHDCLPNACRFD 224
E +L+ K+ N+FGL D G++ YG+ Y +A+ NH C+PN
Sbjct: 519 EELLSLICKEESNSFGLYHGATGPPDSLQGQQRGPQYGLACYPRATLCNHSCVPN----- 573
Query: 225 YVDAAAENNTDIIVRMIHDVPQGREICLSYF--PVNYDYSTRQKRLLYDYDFACDCDRCK 282
+ + + +++ D+ G E C+SYF V+ D + R+KR + F+C C+RC
Sbjct: 574 -LKHGPDEQSRMVLTATRDIAAGEECCISYFDLTVHVDLNARRKRTRELFTFSCTCERCL 632
Query: 283 VE 284
E
Sbjct: 633 RE 634
>gi|367032576|ref|XP_003665571.1| hypothetical protein MYCTH_54709 [Myceliophthora thermophila ATCC
42464]
gi|347012842|gb|AEO60326.1| hypothetical protein MYCTH_54709 [Myceliophthora thermophila ATCC
42464]
Length = 607
Score = 56.2 bits (134), Expect = 3e-05, Method: Compositional matrix adjust.
Identities = 27/80 (33%), Positives = 44/80 (55%), Gaps = 8/80 (10%)
Query: 205 IYLKASFFNHDCLPNACRFDYVDAAAENNTDIIVRMIHDVPQGREICLSYFPVNYDYSTR 264
++++A++FNH C+PN R D I+VR + D+ G E+ SY V DY R
Sbjct: 460 LWIRAAYFNHSCVPNTEREFIGDL-------IVVRALRDIAAGEELVQSY-DVTGDYEGR 511
Query: 265 QKRLLYDYDFACDCDRCKVE 284
++ L+ + F C+C C+ E
Sbjct: 512 REALMTTWGFECNCALCEAE 531
>gi|297845524|ref|XP_002890643.1| predicted protein [Arabidopsis lyrata subsp. lyrata]
gi|297336485|gb|EFH66902.1| predicted protein [Arabidopsis lyrata subsp. lyrata]
Length = 976
Score = 56.2 bits (134), Expect = 3e-05, Method: Compositional matrix adjust.
Identities = 30/82 (36%), Positives = 39/82 (47%), Gaps = 8/82 (9%)
Query: 204 GIYLKASFFNHDCLPNACRFDYVDAAAENNTDIIVRMIHDVPQGREICLSYFPVNYDYST 263
G++ ASF NH C+PNA R D +IV D+ G EI +YF V
Sbjct: 763 GLWTLASFINHSCIPNARRLHVGDY-------VIVHASRDIKAGEEITFAYFDVLSPLEK 815
Query: 264 RQKRLLYDYDFACDCDRCKVEA 285
R K + + F C C RCK E+
Sbjct: 816 R-KEMAESWGFRCGCSRCKFES 836
>gi|151942634|gb|EDN60980.1| SET domain-containing protein [Saccharomyces cerevisiae YJM789]
Length = 373
Score = 56.2 bits (134), Expect = 3e-05, Method: Compositional matrix adjust.
Identities = 46/156 (29%), Positives = 71/156 (45%), Gaps = 23/156 (14%)
Query: 134 SQFQILLAFQGTVTDNDTSAAHYLSSLCPPPAS--TTIELTAALLAKDRLNAFGLMEPYI 191
S+F +LL FQ V L L P +I L +L + NAFGL +
Sbjct: 231 SKFPVLLHFQKLVFQT-------LYILLPSHLHRMLSIPLLRHILGTEYGNAFGLWQ--- 280
Query: 192 EGQDG-RRSVRAYGIYLKASFFNHDCLPNACRFDYVDAAAENNTDIIVRMIHDVPQGREI 250
EG+ R Y ++ +AS+FNH C PN ++ ++ + D+ + +I
Sbjct: 281 EGEASDSREYFGYWVFPEASYFNHSCNPNITKY-------RKGNSMLFTVNRDIKKDEQI 333
Query: 251 CLSYFPVNYDYSTRQKR--LLYDYDFACDCDRCKVE 284
C+ Y V D T ++R L ++ F C C+RCK E
Sbjct: 334 CIDYSGV-LDLPTVKRRAFLADNWFFVCACERCKSE 368
>gi|320168918|gb|EFW45817.1| predicted protein [Capsaspora owczarzaki ATCC 30864]
Length = 644
Score = 56.2 bits (134), Expect = 3e-05, Method: Compositional matrix adjust.
Identities = 30/81 (37%), Positives = 41/81 (50%), Gaps = 6/81 (7%)
Query: 204 GIYLKASFFNHDCLPNACRFDYVDAAAENNTDIIVRMIHDVPQGREICLSYFPVNYDYST 263
G + A+FFNH C PN F + + T V ++ DV G E+ LSY P Y
Sbjct: 333 GFFPLAAFFNHGCRPNCSGFVEI---TDQGTFFSVVLLEDVYLGDELVLSYLPPELLYKP 389
Query: 264 RQKR---LLYDYDFACDCDRC 281
R++R LL ++ F C C RC
Sbjct: 390 REQRRAHLLQNFHFECGCSRC 410
>gi|302796834|ref|XP_002980178.1| hypothetical protein SELMODRAFT_419787 [Selaginella moellendorffii]
gi|300151794|gb|EFJ18438.1| hypothetical protein SELMODRAFT_419787 [Selaginella moellendorffii]
Length = 662
Score = 56.2 bits (134), Expect = 3e-05, Method: Compositional matrix adjust.
Identities = 26/82 (31%), Positives = 42/82 (51%), Gaps = 8/82 (9%)
Query: 204 GIYLKASFFNHDCLPNACRFDYVDAAAENNTDIIVRMIHDVPQGREICLSYF-PVNYDYS 262
G++ NH C+PNA + D + VR D+ G E+ LSY P Y+
Sbjct: 445 GLWASVGLINHSCIPNASKMSIGDV-------VFVRAAKDLKAGDEVLLSYLEPPLAPYA 497
Query: 263 TRQKRLLYDYDFACDCDRCKVE 284
+++++ Y+F C C+RCK+E
Sbjct: 498 GYREKMIQQYNFECSCERCKLE 519
>gi|71043910|ref|NP_001020933.1| SET and MYND domain-containing protein 3 [Rattus norvegicus]
gi|66910630|gb|AAH97455.1| SET and MYND domain containing 3 [Rattus norvegicus]
Length = 428
Score = 56.2 bits (134), Expect = 3e-05, Method: Compositional matrix adjust.
Identities = 70/303 (23%), Positives = 115/303 (37%), Gaps = 57/303 (18%)
Query: 16 RGRGLVSTQSLKAGQIVLRDSPILLYSALPFINSSSFCHNCF---RKTMHSSSSICPSCS 72
RG GL + L+ G+++ R P L + + C C K M C C
Sbjct: 14 RGNGLRAVAPLRPGELLFRSDP--LAYTVSKGSRGVVCDRCLLGKEKLMR-----CSQCR 66
Query: 73 VA-FCSPKCSTAAASSSHSPYVCQALTRLLNLNSPDAANLPLDRQL---------QARFL 122
+A +CS KC A C+ L PD+ L L R + ++ L
Sbjct: 67 IAKYCSAKCQKKAWPDHKRE--CKCLKSCKPRYPPDSVRL-LGRVVVKLMDGKPSESEKL 123
Query: 123 IAAYNLAVVNPSQF---------QILLAFQGTVTDNDTSAAHYLSSLCPPPASTTIELTA 173
+ Y+L N S+ Q+ + FQ + + A+ PP+ E A
Sbjct: 124 YSFYDLES-NISKLTEDKKEGLRQLAMTFQHFMREEIQDASQL------PPSFDLFEAFA 176
Query: 174 ALLAKDRLNAFGLMEPYIEGQDGRRSVRAYGIYLKASFFNHDCLPNACRFDYVDAAAENN 233
++ N+F + ++ G+Y S NH C PN C + N
Sbjct: 177 KVIC----NSFTICNAEMQEV-------GVGLYPSMSLLNHSCDPN-CSIVF------NG 218
Query: 234 TDIIVRMIHDVPQGREICLSYFPVNYDYSTRQKRLLYDYDFACDCDRCKVEANWSDNDND 293
+++R + ++ G E+ + Y + R+K+L Y F CDC RC+ + +D
Sbjct: 219 PHLLLRAVREIEAGEELTICYLDMLMTSEERRKQLRDQYCFECDCIRCQTQDKDADMLTG 278
Query: 294 DEN 296
DE
Sbjct: 279 DEQ 281
>gi|449547900|gb|EMD38867.1| hypothetical protein CERSUDRAFT_47730 [Ceriporiopsis subvermispora
B]
Length = 410
Score = 55.8 bits (133), Expect = 3e-05, Method: Compositional matrix adjust.
Identities = 37/116 (31%), Positives = 55/116 (47%), Gaps = 13/116 (11%)
Query: 168 TIELTAALLAKDRLNAFGLME-PYIEGQDGRRSVRAYGIYLKASFFNHDCLPNACRFDYV 226
TIE L D N+FG+ E P +E + + +Y + SFFNHDC PNA +
Sbjct: 291 TIENVRTALGVDPGNSFGIWEVPIMEESE----CLGFAVYPRLSFFNHDCAPNARK---- 342
Query: 227 DAAAENNTDIIVRMIHDVPQGREICLSYFPVN-YDYSTRQKRLLYDYDFACDCDRC 281
+ + +V +G E+C+SY V+ + R+K L + F C C RC
Sbjct: 343 ---ERDGRALRFVTTREVAEGDELCISYGHVDGMAWRERRKELSDGWFFDCACGRC 395
>gi|350424241|ref|XP_003493732.1| PREDICTED: protein msta, isoform B-like [Bombus impatiens]
Length = 498
Score = 55.8 bits (133), Expect = 3e-05, Method: Compositional matrix adjust.
Identities = 72/282 (25%), Positives = 111/282 (39%), Gaps = 55/282 (19%)
Query: 17 GRGLVSTQSLKAGQIVLRDSPILL---YSALPFINSSSFCHNCFRKTMHSSSSICP---S 70
GRGL +T+ +K +++ DSP+L+ Y F C +C++ + ++ P
Sbjct: 42 GRGLFATRDIKQNELIFIDSPLLVGPKYIGKYF----EMCVSCYK----NECALFPCDRG 93
Query: 71 CSVAFCSPKCSTAAASSSHSPYVCQALTRLLNLNSPD-AANLPLDRQLQARFLIAAYNLA 129
C + CS +C S H Y C+ L L+ D + NL L +
Sbjct: 94 CGLPVCSTQCEN---SPKHVNYECEYLKSLVPTCGTDWSPNLLLA--------VVPIRAL 142
Query: 130 VVNPSQFQILLAFQGTVTDNDTSAAHYLSSLCP----PPASTTIELTAALLAKDRLNAFG 185
+ Q + L Q +D + + + L P+ +EL + N+F
Sbjct: 143 FLTEQQRKCLATLQ---SDKNLTCYPEIEQLKKNVTNSPSEEDMELMKHMCRILNTNSF- 198
Query: 186 LMEPYIEGQDGRRSVRAYGIYLKASFFNHDCLPNA-CRFD-----YVDAAAENNTDIIVR 239
I D SV G+Y ASF NH C+PN FD YV AA
Sbjct: 199 ---ETIMVHDKEHSVSLRGLYSIASFQNHCCVPNTRHHFDGEFRMYVSAAL--------- 246
Query: 240 MIHDVPQGREICLSYFPVNYDYSTRQKRLLYDYDFACDCDRC 281
+ G EI +Y + +D + R+K L F+C C RC
Sbjct: 247 ---PIAAGEEITSTYTSLFWDTTLRRKFLSITKHFSCMCKRC 285
>gi|260798620|ref|XP_002594298.1| hypothetical protein BRAFLDRAFT_117670 [Branchiostoma floridae]
gi|229279531|gb|EEN50309.1| hypothetical protein BRAFLDRAFT_117670 [Branchiostoma floridae]
Length = 400
Score = 55.8 bits (133), Expect = 3e-05, Method: Compositional matrix adjust.
Identities = 66/282 (23%), Positives = 111/282 (39%), Gaps = 42/282 (14%)
Query: 15 GRGRGL----VSTQSLKAGQIVLRDSPILLYSALPFINSSSFCHNCFRKTMHSSSSICPS 70
G+GRG+ + ++ G +++++ P Y+ + CH C ++ +S S C +
Sbjct: 13 GQGRGIRCNKKGSSGIEPGTLIVKEEP-YSYTLTDGELLRTRCHYCLKRLENSVS--CDA 69
Query: 71 CSVA-FCSPKCSTAAASSSHSPYVCQALTRLLNLNSPDAANLPLDRQLQARFLIAAY--- 126
C A +C+ +C AA H+P C+ +RL+NL P ++ R L +
Sbjct: 70 CRTAKYCNEECKKAA-KFHHTPE-CRGYSRLMNL--------PEHLRVMGRILYKMHARK 119
Query: 127 -NLAVVNP-----SQFQILLAFQGTVTDNDTSAAHYLSSLCPPPASTTIELTAALLAKDR 180
++ + P S + L + +T D S LS A + K
Sbjct: 120 TDMGALGPLSSLVSNVETLKNCEEGITSLD-SKMECLSQHMEKDALPDRAFMEEIYGKIA 178
Query: 181 LNAFGLMEPYIEGQDGRRSVRAYGIYLKASFFNHDCLPNACRFDYVDAAAENNTDIIVRM 240
N+F ++ D G+Y +AS NH C N Y I +R
Sbjct: 179 SNSFAIL-------DENMCSIGIGVYPQASMINHSCKSNCIGMFY-------GPQIQIRA 224
Query: 241 IHDVPQGREICLSYFPVNYDYSTRQKRLLYDYDFACDCDRCK 282
+ G +I Y P + RQ++LL Y F C C C+
Sbjct: 225 NEFIRPGEQIFHGYIPPLLPTAKRQEKLLKTYHFLCQCADCR 266
>gi|297836508|ref|XP_002886136.1| hypothetical protein ARALYDRAFT_319740 [Arabidopsis lyrata subsp.
lyrata]
gi|297331976|gb|EFH62395.1| hypothetical protein ARALYDRAFT_319740 [Arabidopsis lyrata subsp.
lyrata]
Length = 483
Score = 55.8 bits (133), Expect = 3e-05, Method: Compositional matrix adjust.
Identities = 68/291 (23%), Positives = 117/291 (40%), Gaps = 46/291 (15%)
Query: 8 VTVAEIEGRGRGLVSTQSLKAGQIVLRDSPILLYSALPFINSSSF---CHNCFRKTMHSS 64
++V+ + +GR L +T+ ++ G+++L P Y +P N++S C CF+ ++
Sbjct: 13 LSVSNLPHKGRSLFTTRDIRPGEVILSQKP---YICVP--NNTSLESRCDGCFKT---NN 64
Query: 65 SSICPSCSVA-FCSPKCSTAAASSSHSPYVCQALTRL----LNLNSPD---AANLPLDRQ 116
C C V +C C + + C+ALTRL +P L + R
Sbjct: 65 LKKCSGCQVVWYCGSFCQKSEWKLHR--HECKALTRLEKEKRKFVTPTIRLMVKLYIKRN 122
Query: 117 LQARFLIAAYNLAVVNPSQFQILLAFQGTVTDNDTSAAHY------LSSLCPPPASTTIE 170
LQ + L + + + ++ A +++ D L +L S +
Sbjct: 123 LQNEKV-----LPITSTDNYSLVEALVSHMSEIDEKQMLLYAQMANLVNLILQFPSVDLR 177
Query: 171 LTAALLAKDRLNAFGLMEPYIEGQDGRRSVRAYGIYLKASFFNHDCLPNACRFDYVDAAA 230
A +K NA + + + Q G++ S NH C PNA + + A
Sbjct: 178 EIAENFSKFSCNAHSICDSELRPQ-------GIGLFPLVSIINHSCSPNAV-LVFEEQMA 229
Query: 231 ENNTDIIVRMIHDVPQGREICLSYFPVNYDYSTRQKRLLYDYDFACDCDRC 281
+VR + ++ + EI +SY TRQK L Y F C C RC
Sbjct: 230 ------VVRAMDNISKDSEITISYIETAGSTLTRQKSLKEQYLFHCQCARC 274
>gi|429859827|gb|ELA34590.1| set domain-containing protein 5 [Colletotrichum gloeosporioides
Nara gc5]
Length = 648
Score = 55.8 bits (133), Expect = 3e-05, Method: Compositional matrix adjust.
Identities = 29/85 (34%), Positives = 41/85 (48%), Gaps = 5/85 (5%)
Query: 199 SVRAYGIYLKASFFNHDCLPNACRFDYVDAAAENNTDIIVRMIHDVPQGREICLSYFPVN 258
R GI+L+AS NH C NA D + V + D+ G EI +SY +
Sbjct: 424 GAREGGIFLEASRINHSCRHNAQNTWNAD-----RGQLTVHALRDIDDGEEITISYLGAS 478
Query: 259 YDYSTRQKRLLYDYDFACDCDRCKV 283
+YS+RQ RL + F C C+ C +
Sbjct: 479 ENYSSRQSRLKESFGFTCACELCTL 503
>gi|195156964|ref|XP_002019366.1| GL12277 [Drosophila persimilis]
gi|194115957|gb|EDW38000.1| GL12277 [Drosophila persimilis]
Length = 188
Score = 55.8 bits (133), Expect = 3e-05, Method: Compositional matrix adjust.
Identities = 29/87 (33%), Positives = 47/87 (54%), Gaps = 10/87 (11%)
Query: 204 GIYLKASFFNHDCLPNACR-FDYVDAAAENNTDIIVRMIHDVPQGREICLSYF---PVNY 259
G+YL S NH C+PNAC F Y +N ++++ + + +G EIC+SY +
Sbjct: 83 GLYLLQSKINHSCVPNACSTFPY------SNDIVVLKALAPIQEGEEICISYLDECQLER 136
Query: 260 DYSTRQKRLLYDYDFACDCDRCKVEAN 286
+R K L +Y F C C +C+ +A+
Sbjct: 137 SRHSRHKVLRENYIFVCQCLKCQAQAS 163
>gi|340924027|gb|EGS18930.1| hypothetical protein CTHT_0055430 [Chaetomium thermophilum var.
thermophilum DSM 1495]
Length = 278
Score = 55.8 bits (133), Expect = 3e-05, Method: Compositional matrix adjust.
Identities = 32/109 (29%), Positives = 53/109 (48%), Gaps = 7/109 (6%)
Query: 204 GIYLKASFFNHDCLPNACRFDYVDAAAENNTDIIVRMIHDVPQGREICLSYFPVNYDYST 263
+Y K S FNHDC PN + + +T + VR D+P G E+ +SY V +
Sbjct: 69 AVYPKVSRFNHDCRPNV----HYRLSGLIHTTVAVR---DIPAGTELTISYIYVRKSRAE 121
Query: 264 RQKRLLYDYDFACDCDRCKVEANWSDNDNDDENNEEVMDEDQDEQMVAS 312
RQK L ++F C C++C A + + ++ED + +++ S
Sbjct: 122 RQKELSESWNFTCTCEQCSKSAEEIAASDARMRRIKALEEDIERKVIES 170
>gi|189240077|ref|XP_971167.2| PREDICTED: similar to conserved hypothetical protein [Tribolium
castaneum]
Length = 637
Score = 55.8 bits (133), Expect = 3e-05, Method: Compositional matrix adjust.
Identities = 79/320 (24%), Positives = 124/320 (38%), Gaps = 65/320 (20%)
Query: 1 MSAVSETVTVAEIEGRGRGLVSTQSLKAGQIVLRDSPILLYSALPFINSSSF---CHNCF 57
S S +T+ GR +VS + G+I++ + +A+ +N F C +CF
Sbjct: 225 FSHASSKITLKTSPDVGRYVVSNVDIATGEILVAEP-----AAVACLNPEKFGTHCQHCF 279
Query: 58 RKTMHSSSSICPSCS-VAFCSPKCSTAAASSSHSPYVCQALTRLL----NLNSPDAANLP 112
+ + CP CS VAFCSPKC A ++ H Y C+ LL ++ S A +
Sbjct: 280 ARLLAPVG--CPHCSSVAFCSPKCRDDAITTYHK-YECKFFDLLLGSGMSVLSLMALRII 336
Query: 113 LDRQLQARFLI-------AAYNLAVVNPSQFQ--------ILLAF-------QGTVTDND 150
+ L I A YNL N S+ Q ++ AF G +N
Sbjct: 337 TQQSLTQTLEIYDKKNTNALYNLC-TNESKRQNSDFLQRSLMAAFLLRCLQKSGYFGENG 395
Query: 151 TSAAHYLSSLCPPPASTTIELTAALLAKDRLNAFGLMEPYIEGQDGRRSVRA-------Y 203
T P T ++ LL ++ F E Y S+++
Sbjct: 396 TVV----------PTQTEHKVGEMLLHYLQILQFNAHEIYETLYSEDHSLKSAKMINIGV 445
Query: 204 GIYLKASFFNHDCLPNACRFDYVDAAAENNTDIIVRMIHDVPQGREICLSYFPV--NYDY 261
+Y + FNH+C P+ R+ +V I++ I + I +Y P+
Sbjct: 446 AVYPTVALFNHECYPSVTRY-FVGKT------IVIASIRPLTPNTPISENYGPIFTRIKL 498
Query: 262 STRQKRLLYDYDFACDCDRC 281
+ RQ+ LL Y F C C C
Sbjct: 499 AERQRTLLGRYWFNCQCQAC 518
>gi|429853222|gb|ELA28312.1| TPR domain-containing protein [Colletotrichum gloeosporioides Nara
gc5]
Length = 643
Score = 55.8 bits (133), Expect = 3e-05, Method: Compositional matrix adjust.
Identities = 32/103 (31%), Positives = 52/103 (50%), Gaps = 12/103 (11%)
Query: 182 NAFGLMEPYIEGQDGRRSVRAYGIYLKASFFNHDCLPNACRFDYVDAAAENNTDIIVRMI 241
NAFG + ++ G S + G++++A++ NH C+PNA R D +++R +
Sbjct: 479 NAFGADDSHLGGV----SKPSTGLWVRAAYINHSCVPNAKREFVGDL-------MVIRSL 527
Query: 242 HDVPQGREICLSYFPVNYDYSTRQKRLLYDYDFACDCDRCKVE 284
+ +G EI SY + DY RQ L+ + F C C C E
Sbjct: 528 RKIKKGEEIFHSY-DESGDYEARQAALMTTWGFECGCALCAAE 569
>gi|291402060|ref|XP_002717678.1| PREDICTED: SET and MYND domain containing 3 [Oryctolagus cuniculus]
Length = 428
Score = 55.8 bits (133), Expect = 3e-05, Method: Compositional matrix adjust.
Identities = 68/301 (22%), Positives = 110/301 (36%), Gaps = 53/301 (17%)
Query: 16 RGRGLVSTQSLKAGQIVLRDSPILLYSALPFINSSSFCHNCF---RKTMHSSSSICPSCS 72
RG GL + L+ G+++ R P L Y+ + C C K M C C
Sbjct: 14 RGNGLRAVVPLRPGELLFRSDP-LAYTVCKG-SRGVVCDRCLLGKEKLMR-----CSQCR 66
Query: 73 VA-FCSPKCSTAAASSSHSPYVCQALTRLLNLNSPDAANL------------PLDRQLQA 119
VA +CS KC A C+ L PD+ L P + +
Sbjct: 67 VAKYCSAKCQKKAWPDHKRE--CKCLKSCKPRYPPDSVRLLGRVVFKLMEEIPSESEKLY 124
Query: 120 RFLIAAYNLAVVNPSQ----FQILLAFQGTVTDNDTSAAHYLSSLCPPPASTTIELTAAL 175
F N+ + + Q+ + FQ + + A+ PP+ E A +
Sbjct: 125 TFYDLESNINKLTEDKKEGLRQLAMTFQHFMREEIQDASQL------PPSFDIFEAFAKV 178
Query: 176 LAKDRLNAFGLMEPYIEGQDGRRSVRAYGIYLKASFFNHDCLPNACRFDYVDAAAENNTD 235
+ N+F + ++ G+Y S NH C PN + N
Sbjct: 179 IC----NSFTICNAEMQEV-------GVGLYPSMSLLNHSCDPNC-------SIVFNGPH 220
Query: 236 IIVRMIHDVPQGREICLSYFPVNYDYSTRQKRLLYDYDFACDCDRCKVEANWSDNDNDDE 295
+++R + D+ G E+ + Y + R+K+L Y F CDC RC+ + +D DE
Sbjct: 221 LLLRAVRDIEAGEELTICYLDMLMTSEERRKQLRDQYCFECDCFRCETQDKDADMLTGDE 280
Query: 296 N 296
Sbjct: 281 Q 281
>gi|310798972|gb|EFQ33865.1| tetratricopeptide [Glomerella graminicola M1.001]
Length = 657
Score = 55.8 bits (133), Expect = 3e-05, Method: Compositional matrix adjust.
Identities = 34/103 (33%), Positives = 56/103 (54%), Gaps = 11/103 (10%)
Query: 182 NAFGLMEPYIEGQDGRRSVRAYGIYLKASFFNHDCLPNACRFDYVDAAAENNTDIIVRMI 241
NAFG Y G++G R+ + G+++ A++ NH C+ NA + +YV +++R +
Sbjct: 489 NAFGPGGQY--GEEGARNA-STGLWVWAAYINHSCVANAKK-EYVGDL------MVLRAL 538
Query: 242 HDVPQGREICLSYFPVNYDYSTRQKRLLYDYDFACDCDRCKVE 284
+ +G EI SY + DY TRQ+ L+ + F C C C E
Sbjct: 539 RPIKEGEEIFHSY-DESADYETRQRALMTTWGFECSCALCTAE 580
>gi|168035501|ref|XP_001770248.1| predicted protein [Physcomitrella patens subsp. patens]
gi|162678465|gb|EDQ64923.1| predicted protein [Physcomitrella patens subsp. patens]
Length = 691
Score = 55.8 bits (133), Expect = 3e-05, Method: Compositional matrix adjust.
Identities = 39/132 (29%), Positives = 62/132 (46%), Gaps = 13/132 (9%)
Query: 162 PPPASTTIEL--TAALLAKDRLNAFGLMEPYIEGQDGRRSVRAY--GIYLKASFFNHDCL 217
P PA+T A L + NAFG + + ++ R R++ G+++ SF NH C
Sbjct: 379 PVPATTAFPQNEIARLCGIVQCNAFGHTQ--VTRKNARDITRSWSCGLWMLPSFMNHSCT 436
Query: 218 PNACRFDYVDAAAENNTDIIVRMIHDVPQGREICLSYFPVNYDYSTRQKRLLYDYDFACD 277
P+ V N IIV D+ G E+ ++YF + R+ +L+ ++F C
Sbjct: 437 PS------VATVVIGNAMIIV-AARDLKCGDELTVAYFDIFRPLQERRASMLHSWNFMCS 489
Query: 278 CDRCKVEANWSD 289
C RC +EA D
Sbjct: 490 CPRCALEARMED 501
>gi|38494371|gb|AAH61485.1| SET and MYND domain containing 3 [Mus musculus]
gi|148681224|gb|EDL13171.1| SET and MYND domain containing 3, isoform CRA_b [Mus musculus]
Length = 428
Score = 55.8 bits (133), Expect = 3e-05, Method: Compositional matrix adjust.
Identities = 71/303 (23%), Positives = 115/303 (37%), Gaps = 57/303 (18%)
Query: 16 RGRGLVSTQSLKAGQIVLRDSPILLYSALPFINSSSFCHNCF---RKTMHSSSSICPSCS 72
RG GL + L+ G+++ R P L Y+ + C C K M C C
Sbjct: 14 RGNGLRAVAPLRPGELLFRSDP-LAYTVCKG-SRGVVCDRCLLGKEKLMR-----CSQCR 66
Query: 73 VA-FCSPKCSTAAASSSHSPYVCQALTRLLNLNSPDAANLPLDRQL---------QARFL 122
+A +CS KC A C L PD+ L L R + ++ L
Sbjct: 67 IAKYCSAKCQKKAWPDHRRE--CSCLKSCKPRYPPDSVRL-LGRVIVKLMDEKPSESEKL 123
Query: 123 IAAYNLAVVNPSQF---------QILLAFQGTVTDNDTSAAHYLSSLCPPPASTTIELTA 173
+ Y+L N S+ Q+ + FQ + + A+ PP+ E A
Sbjct: 124 YSFYDLES-NISKLTEDKKEGLRQLAMTFQHFMREEIQDASQL------PPSFDLFEAFA 176
Query: 174 ALLAKDRLNAFGLMEPYIEGQDGRRSVRAYGIYLKASFFNHDCLPNACRFDYVDAAAENN 233
++ N+F + ++ G+Y S NH C PN C + N
Sbjct: 177 KVIC----NSFTICNAEMQEV-------GVGLYPSMSLLNHSCDPN-CSIVF------NG 218
Query: 234 TDIIVRMIHDVPQGREICLSYFPVNYDYSTRQKRLLYDYDFACDCDRCKVEANWSDNDND 293
+++R + ++ G E+ + Y + R+K+L Y F CDC RC+ + +D
Sbjct: 219 PHLLLRAVREIEAGEELTICYLDMLMTSEERRKQLRDQYCFECDCIRCQTQDKDADMLTG 278
Query: 294 DEN 296
DE
Sbjct: 279 DEQ 281
>gi|307177472|gb|EFN66599.1| SET and MYND domain-containing protein 5 [Camponotus floridanus]
Length = 386
Score = 55.8 bits (133), Expect = 3e-05, Method: Compositional matrix adjust.
Identities = 36/105 (34%), Positives = 54/105 (51%), Gaps = 14/105 (13%)
Query: 204 GIYLKASFFNHDCLPNA-CRFDYVDAAAENNTDIIVRMIHDVPQGREICLSYF---PVNY 259
G+Y+ S NH C PNA F Y +N ++++ I D+ G EIC SY +
Sbjct: 289 GLYILQSSVNHSCTPNAIAEFPY------SNNTLVLKAIRDIQVGEEICTSYLDECELER 342
Query: 260 DYSTRQKRLLYDYDFACDCDRCKVEANWSDNDNDDENNEEVMDED 304
+RQK L Y F C CD+C+ + ND + ++EE+ D+D
Sbjct: 343 SRYSRQKALSSLYLFVCHCDKCQTQI----NDPNVTSDEELDDDD 383
>gi|350405733|ref|XP_003487532.1| PREDICTED: SET and MYND domain-containing protein 4-like [Bombus
impatiens]
Length = 676
Score = 55.8 bits (133), Expect = 3e-05, Method: Compositional matrix adjust.
Identities = 58/210 (27%), Positives = 84/210 (40%), Gaps = 64/210 (30%)
Query: 89 HSPYVCQALTRLLNLN--------SPDAAN-LPLDRQLQARFLIAAYNLAVVNPSQFQIL 139
H Y+ L RLL N +PD+A +P D +L LI +NL ++ QF
Sbjct: 402 HRTYIATWLLRLLKRNPYFPEWVKTPDSAEAIPSDGELYIGSLIL-HNLMLI---QFN-- 455
Query: 140 LAFQGTVTDNDTSAAHYLSSLCPPPASTTIELTAALLAKDRLNAFGLMEPYIEGQDGRRS 199
AH +S L P S +LAK + +I G
Sbjct: 456 --------------AHEISELAVPKGSN-------ILAKAK-------SKFIGG------ 481
Query: 200 VRAYGIYLKASFFNHDCLPNACRFDYVDAAAENNTDIIVRMIHDVPQGREICLSYFPV-- 257
G+Y S FNH C P R+ T ++VR I +P G EI +Y P+
Sbjct: 482 ----GVYSTVSLFNHSCNPGIIRYFI-------GTTMVVRAIRTIPAGEEISENYGPIFT 530
Query: 258 NYDYSTRQKRLLYDYDFACDCDRCKVEANW 287
+ R+++L Y F C+C+ C A+W
Sbjct: 531 TTPEAERKRKLRVQYWFDCNCEACT--AHW 558
>gi|307170062|gb|EFN62504.1| SET and MYND domain-containing protein 4 [Camponotus floridanus]
Length = 333
Score = 55.8 bits (133), Expect = 3e-05, Method: Compositional matrix adjust.
Identities = 79/295 (26%), Positives = 118/295 (40%), Gaps = 60/295 (20%)
Query: 3 AVSETVTVAEIEGRGRGLVSTQSLKAGQIVLRDSPILLYSALPFI-----NSSSFCHNCF 57
+ S ++ V G+GR +V+ +++K G+ + + A F+ ++ CHNC
Sbjct: 49 SASASIEVKHAPGKGRYVVANRNIKKGE------TLFVEDAFTFVLVNNNKDNTHCHNCC 102
Query: 58 RKTMHSSSSICPSCSVAFCSPKCSTAAASSSHSPYVCQALTRLLNLNSPDAANLPLDRQL 117
+ + FC+ KC A S H R L L A+ + L RQ+
Sbjct: 103 KSYLDVPVPCTECVDTLFCNMKCWDEACLSYH---------RWLCL----ASQMGLLRQI 149
Query: 118 QARFLIAAYNLAVV-----NPSQF---QILLAFQGTVTDNDTSAAHYLSSLCPPPASTTI 169
L A L V N +F Q L+ +T D A Y +C A T I
Sbjct: 150 GITHL--ALKLLCVCATTTNNEKFNDVQQLVTNFNKLTPED--AFSYGILVCNGHAITKI 205
Query: 170 ELTAALLAKDRLNAFGLMEPYIEGQDGRRSVRAYGIYLKASFFNHDCLPNACRFDYVDAA 229
+TA D N L+E I A IY AS NH C PN ++D
Sbjct: 206 NVTA----DDHENKL-LIEEQIR--------IATAIYPSASMMNHSCDPNIIN-SFLDQT 251
Query: 230 AENNTDIIVRMIHDVPQGREICLSYFPVNYD---YSTRQKRLLYDYDFACDCDRC 281
+IV+ I D+ +G EI L + V++ + RQ+ L Y F C+C C
Sbjct: 252 ------LIVKAIQDIKEGEEI-LHCYGVDFRRMPKAERQESLESQYYFTCNCKAC 299
>gi|258575593|ref|XP_002541978.1| predicted protein [Uncinocarpus reesii 1704]
gi|237902244|gb|EEP76645.1| predicted protein [Uncinocarpus reesii 1704]
Length = 481
Score = 55.8 bits (133), Expect = 3e-05, Method: Compositional matrix adjust.
Identities = 34/110 (30%), Positives = 50/110 (45%), Gaps = 14/110 (12%)
Query: 181 LNAFGLMEPYIEGQDGRRSVRAYGIYLKASFFNHDCLPNACRFDYVDAAAENNTDIIVRM 240
+NAF L P+ + I A+ NH C PNA A + I VR
Sbjct: 177 INAFTLTTPFYDHVGA-------AIEPLAALCNHSCSPNA-------ATDFDKGKIWVRA 222
Query: 241 IHDVPQGREICLSYFPVNYDYSTRQKRLLYDYDFACDCDRCKVEANWSDN 290
+ D+ +G ++ +SY Y+ RQ LL Y F C C++C++E N D
Sbjct: 223 LRDIGKGEQVFVSYIETTDPYAHRQSELLKRYYFNCKCNKCEIEKNAPDT 272
>gi|294662600|pdb|3MEK|A Chain A, Crystal Structure Of Human Histone-Lysine N-
Methyltransferase Smyd3 In Complex With S-Adenosyl-L-
Methionine
Length = 429
Score = 55.8 bits (133), Expect = 3e-05, Method: Compositional matrix adjust.
Identities = 76/310 (24%), Positives = 118/310 (38%), Gaps = 54/310 (17%)
Query: 16 RGRGLVSTQSLKAGQIVLRDSPILLYSALPFINSSSFCHNCFRKTMHSSSSICPSCSVA- 74
RG GL + L+ G+++ R P L Y+ + C C C C VA
Sbjct: 15 RGNGLRAVTPLRPGELLFRSDP-LAYTVCKG-SRGVVCDRCL--LGKEKLXRCSQCRVAK 70
Query: 75 FCSPKCSTAAASSSHSPYVCQALTRLLNLNSPDAANLPLDRQL---------QARFLIAA 125
+CS KC A C+ L PD+ L L R + ++ L +
Sbjct: 71 YCSAKCQKKAWPDHKRE--CKCLKSCKPRYPPDSVRL-LGRVVFKLXDGAPSESEKLYSF 127
Query: 126 YNL-AVVNP-------SQFQILLAFQGTVTDNDTSAAHYLSSLCPPPASTTIELTAALLA 177
Y+L + +N Q++ FQ + A+ PPA E A ++
Sbjct: 128 YDLESNINKLTEDRKEGLRQLVXTFQHFXREEIQDASQL------PPAFDLFEAFAKVIC 181
Query: 178 KDRLNAFGLMEPYIEGQDGRRSVRAYGIYLKASFFNHDCLPNACRFDYVDAAAENNTDII 237
N+F + E Q+ G+Y S NH C PN + N ++
Sbjct: 182 ----NSFTICNA--EXQEV-----GVGLYPSISLLNHSCDPNC-------SIVFNGPHLL 223
Query: 238 VRMIHDVPQGREICLSYFPVNYDYSTRQKRLLYDYDFACDCDRCKVEANWSDNDNDD-EN 296
+R + D+ G E+ + Y R+K+L Y F CDC RC+ + D D D
Sbjct: 224 LRAVRDIEVGEELTICYLDXLXTSEERRKQLRDQYCFECDCFRCQTQ----DKDADXLTG 279
Query: 297 NEEVMDEDQD 306
+E+V E Q+
Sbjct: 280 DEQVWKEVQE 289
>gi|157124668|ref|XP_001654144.1| hypothetical protein AaeL_AAEL001857 [Aedes aegypti]
gi|108882773|gb|EAT46998.1| AAEL001857-PA [Aedes aegypti]
Length = 497
Score = 55.8 bits (133), Expect = 3e-05, Method: Compositional matrix adjust.
Identities = 68/293 (23%), Positives = 111/293 (37%), Gaps = 35/293 (11%)
Query: 4 VSETVTVAEIEGRGRGLVSTQSLKAGQIVLRDSPILLYSALPFINSSSFCHNCFRKTMHS 63
+SE V + GR VS + LKAG ++L + P ++ P +NSS C C S
Sbjct: 1 MSEPYKVLDNPELGRYAVSAKDLKAGDVILEEIP---FAIGPKVNSSPLCLECCCPVDGS 57
Query: 64 SSS-ICPSCSVAFCSPKCSTAAASSSHSPYVC----QALTRLLNLNSPDAANLPLDRQLQ 118
++ C C C +C+ + + C + + N+ A+ + LD
Sbjct: 58 ANGPKCSDCGWPMCE-ECNEDKDNVVYHKKECALFVKGKAKFQNVEDSTASCMQLDCITP 116
Query: 119 ARFLIAAYNLAVVNPSQFQILLAFQ---------GTVTDNDT-SAAHYLSSLCPPPASTT 168
R L+A + +P ++ L+ GT D + YL C +
Sbjct: 117 LRMLLAKED----HPERWDAELSVMEYHNDARKDGTTWHQDQQNIVGYLRGPCGLKDRFS 172
Query: 169 IELTAALLAKDRLNAFGLMEPYIEGQDGRRSVRAYGIYLKASFFNHDCLPNACRFDYVDA 228
E+ ++ +NAF + + A G+Y K H C+PN
Sbjct: 173 EEVIQQVIGILDINAFEVRT--------SKGYSARGLYPKLGVMAHSCVPNVVH----SI 220
Query: 229 AAENNTDIIVRMIHDVPQGREICLSYFPVNYDYSTRQKRLLYDYDFACDCDRC 281
+ +I R+ D+P+G ++ Y STRQ L F C C RC
Sbjct: 221 HPSDGYRLIGRVAIDIPEGEKLYTVYTYTLNGTSTRQAALKSSKYFTCRCPRC 273
>gi|74188167|dbj|BAE37175.1| unnamed protein product [Mus musculus]
Length = 428
Score = 55.8 bits (133), Expect = 4e-05, Method: Compositional matrix adjust.
Identities = 71/303 (23%), Positives = 115/303 (37%), Gaps = 57/303 (18%)
Query: 16 RGRGLVSTQSLKAGQIVLRDSPILLYSALPFINSSSFCHNCF---RKTMHSSSSICPSCS 72
RG GL + L+ G+++ R P L Y+ + C C K M C C
Sbjct: 14 RGNGLRAVAPLRPGELLFRSDP-LAYTVCKG-SRGVVCDRCLLGKEKLMR-----CSQCR 66
Query: 73 VA-FCSPKCSTAAASSSHSPYVCQALTRLLNLNSPDAANLPLDRQL---------QARFL 122
+A +CS KC A C L PD+ L L R + ++ L
Sbjct: 67 IAKYCSAKCQKKAWPDHRRE--CSCLKSCKPRYPPDSVRL-LGRVIVKLMDEKPSESEKL 123
Query: 123 IAAYNLAVVNPSQF---------QILLAFQGTVTDNDTSAAHYLSSLCPPPASTTIELTA 173
+ Y+L N S+ Q+ + FQ + + A+ PP+ E A
Sbjct: 124 YSFYDLES-NISKLTEDKKEGLRQLAMTFQHFMREEIQDASQL------PPSFDLFEAFA 176
Query: 174 ALLAKDRLNAFGLMEPYIEGQDGRRSVRAYGIYLKASFFNHDCLPNACRFDYVDAAAENN 233
++ N+F + ++ G+Y S NH C PN C + N
Sbjct: 177 KVIC----NSFTICNAEMQEV-------GVGLYPSMSLLNHSCDPN-CSIVF------NG 218
Query: 234 TDIIVRMIHDVPQGREICLSYFPVNYDYSTRQKRLLYDYDFACDCDRCKVEANWSDNDND 293
+++R + ++ G E+ + Y + R+K+L Y F CDC RC+ + +D
Sbjct: 219 PHLLLRAVREIEAGEELTICYLDMLMTSEERRKQLRDQYCFECDCIRCQTQDKDADMLTG 278
Query: 294 DEN 296
DE
Sbjct: 279 DEQ 281
>gi|21312378|ref|NP_081464.1| SET and MYND domain-containing protein 3 [Mus musculus]
gi|30913566|sp|Q9CWR2.1|SMYD3_MOUSE RecName: Full=SET and MYND domain-containing protein 3; AltName:
Full=Zinc finger MYND domain-containing protein 1
gi|12845900|dbj|BAB26947.1| unnamed protein product [Mus musculus]
gi|30851412|gb|AAH52431.1| SET and MYND domain containing 3 [Mus musculus]
Length = 428
Score = 55.8 bits (133), Expect = 4e-05, Method: Compositional matrix adjust.
Identities = 71/303 (23%), Positives = 115/303 (37%), Gaps = 57/303 (18%)
Query: 16 RGRGLVSTQSLKAGQIVLRDSPILLYSALPFINSSSFCHNCF---RKTMHSSSSICPSCS 72
RG GL + L+ G+++ R P L Y+ + C C K M C C
Sbjct: 14 RGNGLRAVAPLRPGELLFRSDP-LAYTVCKG-SRGVVCDRCLLGKEKLMR-----CSQCR 66
Query: 73 VA-FCSPKCSTAAASSSHSPYVCQALTRLLNLNSPDAANLPLDRQL---------QARFL 122
+A +CS KC A C L PD+ L L R + ++ L
Sbjct: 67 IAKYCSAKCQKKAWPDHRRE--CSCLKSCKPRYPPDSVRL-LGRVIVKLMDEKPSESEKL 123
Query: 123 IAAYNLAVVNPSQF---------QILLAFQGTVTDNDTSAAHYLSSLCPPPASTTIELTA 173
+ Y+L N S+ Q+ + FQ + + A+ PP+ E A
Sbjct: 124 YSFYDLES-NISKLTEDKKEGLRQLAMTFQHFMREEIQDASQL------PPSFDLFEAFA 176
Query: 174 ALLAKDRLNAFGLMEPYIEGQDGRRSVRAYGIYLKASFFNHDCLPNACRFDYVDAAAENN 233
++ N+F + ++ G+Y S NH C PN C + N
Sbjct: 177 KVIC----NSFTICNAEMQEV-------GVGLYPSMSLLNHSCDPN-CSIVF------NG 218
Query: 234 TDIIVRMIHDVPQGREICLSYFPVNYDYSTRQKRLLYDYDFACDCDRCKVEANWSDNDND 293
+++R + ++ G E+ + Y + R+K+L Y F CDC RC+ + +D
Sbjct: 219 PHLLLRAVREIEAGEELTICYLDMLMTSEERRKQLRDQYCFECDCIRCQTQDKDADMLTG 278
Query: 294 DEN 296
DE
Sbjct: 279 DEQ 281
>gi|189234759|ref|XP_001814707.1| PREDICTED: similar to SMYD family member 5 [Tribolium castaneum]
gi|270001538|gb|EEZ97985.1| hypothetical protein TcasGA2_TC000380 [Tribolium castaneum]
Length = 383
Score = 55.8 bits (133), Expect = 4e-05, Method: Compositional matrix adjust.
Identities = 82/366 (22%), Positives = 131/366 (35%), Gaps = 100/366 (27%)
Query: 16 RGRGLVSTQSLKAGQIVLRDSPILLYSALPFINSSSF----CHNCFR------------- 58
+G+GL S + K+G I+ + P++ + F ++++ C +C R
Sbjct: 12 KGKGLFSKKCFKSGAIIFEEEPLV---SCQFSWNAAYQYKACDHCLRPLETAEENARRLT 68
Query: 59 -------------KTMHSSSSICPSCSVAFCSPKCSTAAASSSHSPYVCQALTR-----L 100
T S + C C +CS +C T A + H Q R L
Sbjct: 69 GKPELELPFPECCTTDKSKFTSCTLCGTDYCSVECQTTAYNQYHRILCLQTTERNNFHPL 128
Query: 101 LNLNS--------PDAANLPLDRQLQARFLIAA-YNLAVVNPSQF---------QILLAF 142
LN P+ + L +L AR + + +LA+ QF ++
Sbjct: 129 EQLNEAWKHVHYPPETNTIMLIVRLLARIIQSPNRDLAIEQTLQFCHRTVNEDAELAHKL 188
Query: 143 QGTVTDNDTSAAHYLSSLCPPPASTTIELTAA----LLAKDRLNAFGLMEPYIEG----- 193
G N S H L C P LT A LLA N G+ I
Sbjct: 189 LGEKFANQLSLLHSLLLQCVPHEGIEQFLTPAGFQGLLALIGTNGQGVGTSTISQWVTRT 248
Query: 194 -----QDGRRSV----------------------RAYGIYLKASFFNHDCLPNACRFDYV 226
D R+V ++ S NH C+PNA
Sbjct: 249 SDLAISDDERAVLDKFIDKLYEDMDSHSGNFLNNEGVALFTLQSACNHSCVPNA-----E 303
Query: 227 DAAAENNTDIIVRMIHDVPQGREICLSYFP---VNYDYSTRQKRLLYDYDFACDCDRCKV 283
NN + + + D+ +G EIC+SY ++ +R+K L+ +Y FAC+C +C+
Sbjct: 304 PTYLHNNNKLSLVAVRDIQEGEEICISYLDECNLHRSRHSRRKELMENYLFACNCPKCEE 363
Query: 284 EANWSD 289
+A+ D
Sbjct: 364 QADEPD 369
>gi|115402861|ref|XP_001217507.1| predicted protein [Aspergillus terreus NIH2624]
gi|114189353|gb|EAU31053.1| predicted protein [Aspergillus terreus NIH2624]
Length = 349
Score = 55.8 bits (133), Expect = 4e-05, Method: Compositional matrix adjust.
Identities = 71/311 (22%), Positives = 122/311 (39%), Gaps = 51/311 (16%)
Query: 14 EGRGRGLVSTQSLKAGQIVLRDSPILLYSAL-PFINSSSFCHNCFRKTMHSSSSICPSC- 71
+ RGR + S++AG +V D+P L + P S C N + C +C
Sbjct: 32 QHRGRYSRVSASVRAGTVVFADAPYALIPTVDPTSKGSLICSNLMCRRQVKWDLECVTCP 91
Query: 72 -----SVAFCSPKCSTAAASSSHSPYVCQALTRL-LNLNSPDAANLPLDRQLQARFLIAA 125
V +C+ C + + C L + + L + L R L+AA
Sbjct: 92 NDCIRDVVWCNSACRIQ--DQARHDFECSWLKKHGVTLRQREGEYDFCMLWLVVR-LVAA 148
Query: 126 YNLAV-VNPSQ-------------FQILLAFQ--------GTVTDNDTSAAHYLSSLCPP 163
L + +P+ +Q + F T+ + YL +
Sbjct: 149 RRLEMEYDPTSHERYGWEDRFKRGWQAIEEFSTNRHLWPDATIQHWEYLIQTYLRNFPGF 208
Query: 164 PASTTIELTAALLAKDRLNAFGLMEPYIEG------QDGRRSVRAYGIYLKASFFNHDCL 217
P T + L+ ++ +N+FGL P I G Q R G Y +A+ NH C+
Sbjct: 209 PGPTEL---LGLICREEINSFGLY-PGITGTLPPGQQRKRGQQYGLGCYPRATMLNHSCV 264
Query: 218 PNACRFDYVDAAAENNTDIIVRMIHDVPQGREICLSYFPV--NYDYSTRQKRLLYDYDFA 275
PN ++ A+++ +++ D+ +E +SYF + + D RQ+ + F+
Sbjct: 265 PN------LNRASDDRGRMVITANQDIAADKECTISYFDLVEHADLEDRQRLTHEMFLFS 318
Query: 276 CDCDRCKVEAN 286
C C RC VEA+
Sbjct: 319 CTCQRCLVEAD 329
>gi|407849671|gb|EKG04345.1| hypothetical protein TCSYLVIO_004595 [Trypanosoma cruzi]
Length = 359
Score = 55.8 bits (133), Expect = 4e-05, Method: Compositional matrix adjust.
Identities = 31/94 (32%), Positives = 47/94 (50%), Gaps = 4/94 (4%)
Query: 192 EGQDGRRSVRAYGIYLKASFFNHDCLPNACRFDYVDAAAENNTDIIVRMIHDVPQGREIC 251
E Q +S R G+Y FNH C PN + Y ++ E + + +R ++ G E+C
Sbjct: 247 ELQHLLQSSRGQGVYTVGCLFNHSCEPNL-QVVYSESGDETLSVVALR---NIELGEELC 302
Query: 252 LSYFPVNYDYSTRQKRLLYDYDFACDCDRCKVEA 285
+SY + Y RQ+ L Y F C C +C+ EA
Sbjct: 303 ISYIDESLSYPERQQELYEHYLFVCQCPKCEREA 336
>gi|242033255|ref|XP_002464022.1| hypothetical protein SORBIDRAFT_01g010740 [Sorghum bicolor]
gi|241917876|gb|EER91020.1| hypothetical protein SORBIDRAFT_01g010740 [Sorghum bicolor]
Length = 482
Score = 55.5 bits (132), Expect = 4e-05, Method: Compositional matrix adjust.
Identities = 69/289 (23%), Positives = 110/289 (38%), Gaps = 43/289 (14%)
Query: 8 VTVAEIEGRGRGLVSTQSLKAGQIVLRDSPILLYSALP-FINSSSFCHNCFRKTMHSSSS 66
+ VA I G+GRGL++T++ G ++L P Y++ P I S C +CF S
Sbjct: 17 LAVASIPGKGRGLIATRTFFPGDVILNQEP---YASTPNKILVGSSCDHCFTSGNLRKCS 73
Query: 67 ICPSCSVAFCSPKCSTAAASSSHSPYVCQALT-------RLLNLNSPDAANLPLDRQLQA 119
+C +CS C C+A+ ++L L L R+LQ
Sbjct: 74 MCQV--TWYCSTNCQKEEWKLHQLE--CRAMAALTEDRKKMLTPTIRLMVRLVLKRKLQN 129
Query: 120 RFLIAAYNLAVVNPSQFQILLAFQGTVTDNDT------SAAHYLSSLCPPPASTTIELTA 173
+I + ++ + ++ A + + D + L SL P ++ A
Sbjct: 130 EKVIPSSSI-----DNYNLVDALESHIWKVDENHLVLYAQMANLVSLILPLIELDLKEIA 184
Query: 174 ALLAKDRLNAFGLMEPYIEGQDGRRSVRAYGIYLKASFFNHDCLPNACR-FDYVDAAAEN 232
+K NA + +P + G+Y S NH C+PNA FD A
Sbjct: 185 HTFSKFACNAHTICDPELRPL-------GTGLYPVISIINHSCVPNAVLIFDGRTA---- 233
Query: 233 NTDIIVRMIHDVPQGREICLSYFPVNYDYSTRQKRLLYDYDFACDCDRC 281
VR + + + E+ +SY R L Y F C C RC
Sbjct: 234 ----YVRALQPIGKNEEVSISYIETAAVTKKRHND-LKQYFFTCSCPRC 277
>gi|148225404|ref|NP_001085463.1| SET and MYND domain containing 1 [Xenopus laevis]
gi|49118725|gb|AAH72803.1| MGC80131 protein [Xenopus laevis]
Length = 478
Score = 55.5 bits (132), Expect = 4e-05, Method: Compositional matrix adjust.
Identities = 64/281 (22%), Positives = 110/281 (39%), Gaps = 26/281 (9%)
Query: 6 ETVTVAEIEGRGRGLVSTQSLKAGQIVLRDSPILLYSALPFIN-SSSFCHNCFRKTMHSS 64
E V + + EG+GRGL +T+ AG ++ + YSA+ F N S CH+CF++
Sbjct: 2 ENVEIFDSEGKGRGLRATRESWAGDVIFAEPA---YSAVVFDNLSHCVCHSCFKR--QEK 56
Query: 65 SSICPSCSVA-FCSPKCSTAA-ASSSHSPYVCQALTRLLNLNSPDAANLPLDRQLQARFL 122
C C A +C C + + H + + N N AA + + + L
Sbjct: 57 LLRCGQCKFAHYCDRTCQKESWVNHKHECLAIKKAGKAPNENIRLAARILWRIEREGGGL 116
Query: 123 IAAYNLAVVNPSQFQILLAFQGTVTDNDTSAAHYLSSLCPPPASTTIELTAALLAKDRLN 182
L ++ Q + G + ++ ++ + + + N
Sbjct: 117 TEGC-LVSIDDLQNHLDKFGDGEKSSLMEDVQKFMDFWPSQSQQFGMQYISHIFSVISCN 175
Query: 183 AFGLMEPYIEGQDGRRSVRAYG--IYLKASFFNHDCLPNACRFDYVDAAAENNTDIIVRM 240
F L + +R ++A G I+ NHDC PN C + NN I +R
Sbjct: 176 GFTLSD--------QRGLQAVGVAIFPNLCLTNHDCWPN-CTVIF------NNGKIELRA 220
Query: 241 IHDVPQGREICLSYFPVNYDYSTRQKRLLYDYDFACDCDRC 281
+ + +G E+ +SY R ++L Y F C C+ C
Sbjct: 221 LGKINKGDELTVSYVDFLNLTEDRMEQLKKQYYFDCTCEHC 261
>gi|350295434|gb|EGZ76411.1| hypothetical protein NEUTE2DRAFT_98210 [Neurospora tetrasperma FGSC
2509]
Length = 547
Score = 55.5 bits (132), Expect = 4e-05, Method: Compositional matrix adjust.
Identities = 73/287 (25%), Positives = 107/287 (37%), Gaps = 49/287 (17%)
Query: 16 RGRGLVSTQSLKAGQ-IVLRDSPILLYSALPF-INSSSFCHNC--------FRKTMHSSS 65
RGR L ST+S AG+ I + D+P+L ALP N + C+ C F + +
Sbjct: 22 RGRSLFSTESFGAGETIAIIDNPLL---ALPDDANMRTTCNYCLYVSGTIEFEGDVEAGP 78
Query: 66 SICPSCS----VAFCSPKCSTAAASSSHSPYVCQALTRLLNLNSPDAANLPLDRQLQARF 121
C +C+ +C+ +C A H C+ R+ D P
Sbjct: 79 RTCKACTGCKAAVYCNAECQRAHWKLVHKAE-CKMFKRIKERTGKDWLPTPA-------- 129
Query: 122 LIAAYNLAVVNPSQFQILLAFQGTVTDNDTSAAHYLSSLCPPPASTTIELTAALLAKDRL 181
N F+ G T A Y L + T++ +L K +
Sbjct: 130 -FGPGGTLESNVEGFKTDEGLWGDFELQATGAVVYAGLL---QSDETLKQAMEVLCKIQT 185
Query: 182 NAFGLMEPYIEGQDGRRSVRAYGIYLKASF--FNHDCLPNACRFDYVDAAAENNTDIIVR 239
NAF + G+ GIYL S NH C+PNA Y+ +++
Sbjct: 186 NAFN----RFDADTGQA-----GIYLHPSLSMVNHSCVPNA----YITF---EKRKAVLK 229
Query: 240 MIHDVPQGREICLSYFPVNYDYSTRQKRLLYDYDFACDCDRCKVEAN 286
D+ G EI +SY RQ+ L Y F C+C RCK + N
Sbjct: 230 AERDLEPGDEILISYIDHTMPRRARQESLRL-YHFQCNCIRCKDDLN 275
>gi|156839743|ref|XP_001643559.1| hypothetical protein Kpol_1000p13 [Vanderwaltozyma polyspora DSM
70294]
gi|171770010|sp|A7TPV3.1|SET5_VANPO RecName: Full=Potential protein lysine methyltransferase SET5;
AltName: Full=SET domain-containing protein 5
gi|156114175|gb|EDO15701.1| hypothetical protein Kpol_1000p13 [Vanderwaltozyma polyspora DSM
70294]
Length = 499
Score = 55.5 bits (132), Expect = 4e-05, Method: Compositional matrix adjust.
Identities = 70/320 (21%), Positives = 131/320 (40%), Gaps = 56/320 (17%)
Query: 16 RGRGLVSTQSLKAGQIVLRDS-PILLYSA---LPFINSSSFCHNCFRKTMHSSSSI---- 67
+GRGL + + ++ ++S PI++ + L + C C H S
Sbjct: 124 KGRGLYALKDFVKDDLIFKESYPIVVVPSMERLTLMKMGKACSLCGGLLSHLSKHFIVVH 183
Query: 68 ---CPSCSVAFCSPKCSTAAASSSHSPYVCQALTRLLNLNSPDA-ANLPLDRQLQARFLI 123
C +C +CS +C + ++ ++ +N D+ A ++ +
Sbjct: 184 SLDCDNCGGVWCSNECKNIDIGHNTLKHLHKS-----KINGIDSVAWGKFEKYCNDNVFV 238
Query: 124 AAYNLAVVNP-------------SQFQILLAFQGTVTDNDTSAAHY----------LSSL 160
AAY++ V+ +F +L + + N++ + + +S+
Sbjct: 239 AAYSIGVIMALALIDKKNTDKIRKKFNLLSSVSQGIRINESDSTNIGGTFDASSGAMSNK 298
Query: 161 CPPPA-STTIELTA-ALLAKDRLNAFGLMEPYIEGQDGRRSVRAYG--IYLKASFFNHDC 216
P P T+ EL A+ D L+ ME Y+ GR ++ ++ S NH+C
Sbjct: 299 DPEPIWKTSYELLKNAIPTTDELD----METYL-SYIGRYNINQISDQMFFLPSLINHNC 353
Query: 217 LPNACRFDYVDAAAENNTDIIVRMIHDVPQGREICLSYFPVNYDYSTRQKRLLYDYDFAC 276
PN RF+ V +N +I V ++ G+E+ +Y + R++ L +Y F C
Sbjct: 354 EPNV-RFEVV-----SNKEIRVYARKNISAGQELLTNYINPLHGVKLRRRELRVNYGFLC 407
Query: 277 DCDRCKVEANWSDNDNDDEN 296
CDRC E +ND ++EN
Sbjct: 408 HCDRCIKEIK-RNNDVENEN 426
>gi|448518203|ref|XP_003867935.1| hypothetical protein CORT_0B07940 [Candida orthopsilosis Co 90-125]
gi|380352274|emb|CCG22499.1| hypothetical protein CORT_0B07940 [Candida orthopsilosis]
Length = 372
Score = 55.5 bits (132), Expect = 4e-05, Method: Compositional matrix adjust.
Identities = 30/117 (25%), Positives = 55/117 (47%), Gaps = 8/117 (6%)
Query: 168 TIELTAALLAKDRLNAFGLMEPYIEGQDGRRSVRAYGIYLKASFFNHDCLPNACRFDYVD 227
T + +L+ K+ NAFG+ +D + + +Y ASFFNH C PN +
Sbjct: 260 TTDTVRSLIGKNLSNAFGIWSN-TTNKDEDKEFLGFSVYPSASFFNHSCDPNIIKIRM-- 316
Query: 228 AAAENNTDIIVRMIHDVPQGREICLSYFPVNYDYSTRQKRLLYDYDFACDCDRCKVE 284
D+ + D+ +G E+C++Y + +++ L ++ F C C +C +E
Sbjct: 317 -----RNDMCFETLRDIAEGEELCINYGNFQNEDVNKRQLELQEWFFDCGCTKCHLE 368
>gi|255085478|ref|XP_002505170.1| predicted protein [Micromonas sp. RCC299]
gi|226520439|gb|ACO66428.1| predicted protein [Micromonas sp. RCC299]
Length = 625
Score = 55.5 bits (132), Expect = 4e-05, Method: Compositional matrix adjust.
Identities = 29/93 (31%), Positives = 41/93 (44%), Gaps = 8/93 (8%)
Query: 198 RSVRAYGIYLKASFFNHDCLPNACRFDYVDAAAENNTDIIVRMIHDVPQGREICLSYFPV 257
R+ + G+Y SF NH C PNAC+ + +R D+ G E+ + YF V
Sbjct: 304 RNANSVGVYPFPSFLNHSCAPNACKLMV-------GHTMFIRAARDLVAGEEVFMKYFDV 356
Query: 258 NYDYSTRQKRLLYDYDFACDCDRCKVEANWSDN 290
R + + F C C RCK+EA D
Sbjct: 357 TMPKPERSA-VAKRWGFECACPRCKLEAVGEDE 388
>gi|317144906|ref|XP_003189640.1| set and mynd domain containing protein [Aspergillus oryzae RIB40]
Length = 372
Score = 55.5 bits (132), Expect = 4e-05, Method: Compositional matrix adjust.
Identities = 71/319 (22%), Positives = 120/319 (37%), Gaps = 69/319 (21%)
Query: 16 RGRGLVSTQSLKAGQIVLRDSPILLYSALPFIN----------SSSFCHNCFRKTMHSSS 65
+GR L +Q +K G+++L D P Y+ +P ++ S+ CH +T+ S
Sbjct: 44 KGRQLRVSQPVKKGELLLVDLP---YALIPVVDHPEQSDDVRCSNPACHRRVARTVERVS 100
Query: 66 SICP---SCSVAFCSPKCSTA--------------AASSSHSPY------VCQALTRLLN 102
CP S V +C+ C A A+S S + + + R+L
Sbjct: 101 --CPNRCSAEVVWCNSSCRDADNLRHEFECTWLKKYATSIRSKWGEYDFGMLWLIVRILA 158
Query: 103 LNSPDAANLPLDRQLQARFLIAAYNLAVVNPSQFQI----LLAFQGTVTDNDTSAAHYLS 158
+ N P + + + PS F+ + +F G+ S + +
Sbjct: 159 SRHVEFRNTPSSDKTSG-----SKTGSDSKPSSFETGWNAIRSFCGSQDSWTHSQVRHWT 213
Query: 159 SLCPP--------PASTTIELTAALLAKDRLNAFGLME------PYIEGQDGRRSVRAYG 204
L P T L+ ++ N+FGL P E R A G
Sbjct: 214 MLVKKYLRNSQSLPHGLTSSEVLVLICQEEANSFGLYPRETGAFPLPELAVDRGEQFAAG 273
Query: 205 IYLKASFFNHDCLPNACRFDYVDAAAENNTDIIVRMIHDVPQGREICLSYFPVNYDYSTR 264
+Y A+ NH C PN + ++ + ++ D+ G E C+SYF ++ +
Sbjct: 274 VYPTAALANHSCSPN------IIHKPDDQSRMVFVASKDIATGEECCISYFDLSKKVELK 327
Query: 265 QKR--LLYDYDFACDCDRC 281
+R L + F C CDRC
Sbjct: 328 DRRDHLQGSFRFVCKCDRC 346
>gi|195636164|gb|ACG37550.1| histone-lysine N-methyltransferase ASHR1 [Zea mays]
Length = 482
Score = 55.5 bits (132), Expect = 4e-05, Method: Compositional matrix adjust.
Identities = 71/293 (24%), Positives = 116/293 (39%), Gaps = 51/293 (17%)
Query: 8 VTVAEIEGRGRGLVSTQSLKAGQIVLRDSPILLYSALP-FINSSSFCHNCF--------- 57
+TVA I G+GRGL++T + G ++L P Y++ P I S C +CF
Sbjct: 17 LTVASIPGKGRGLIATFTFFPGDVILNQEP---YASTPNKILVGSSCDHCFTSGNLRKCS 73
Query: 58 --RKTMHSSSSICPSCSVAFCSPKCSTAAASSSHSPYVCQALTRLLNLNSPDAANLPLDR 115
R T + SS+ C +C AA + + + RL+ L L R
Sbjct: 74 MCRVTWYCSSN-CQKEEWKLHQLECRAMAALTENRKKMLTPTIRLM-------VRLALKR 125
Query: 116 QLQARFLIAAYNLAVVNPSQFQILLAFQGTVTDNDT------SAAHYLSSLCPPPASTTI 169
+LQ +I + ++ + ++ A + ++ D + L SL P +
Sbjct: 126 KLQNEKVIPSSSI-----DNYNLVDALESHISKVDENQLVLYAQMANLVSLILPFIELDL 180
Query: 170 ELTAALLAKDRLNAFGLMEPYIEGQDGRRSVRAYGIYLKASFFNHDCLPNACR-FDYVDA 228
+ A +K NA + +P + G+Y S NH C+PNA FD A
Sbjct: 181 KEIAQTFSKFACNAHTICDPELRPL-------GTGLYPVISIINHSCVPNAVLIFDGRTA 233
Query: 229 AAENNTDIIVRMIHDVPQGREICLSYFPVNYDYSTRQKRLLYDYDFACDCDRC 281
VR + + + E+ +SY + ++ L Y F C C RC
Sbjct: 234 --------YVRALQPIDKDEEVSISYIETA-AVTKKRNNDLKQYFFTCTCPRC 277
>gi|429860577|gb|ELA35307.1| mynd finger family protein [Colletotrichum gloeosporioides Nara
gc5]
Length = 551
Score = 55.5 bits (132), Expect = 4e-05, Method: Compositional matrix adjust.
Identities = 76/304 (25%), Positives = 121/304 (39%), Gaps = 52/304 (17%)
Query: 1 MSAVSETVTVAEIEGR---GRGLVSTQSLKAGQIV-LRDSPILLYSALPFINSSSFCHNC 56
M AV++ + V +G GR + +T+ K G ++ L D P ++ P + +C++C
Sbjct: 1 MPAVNDGIEVRGSKGGAKGGRSIHATRRFKPGDVIALFDEPAVVLP--PGHRALEYCNHC 58
Query: 57 FRKTMHSSSSICPSCS----VAFCSPKCSTAAASSSHSPYVCQALTRLLNL---NSPDAA 109
++ H+ +C+ VA+C C A + H C+A+ L + + PD
Sbjct: 59 LKRQGHAGVGKLRACTGCKTVAYCGAACQKANWTLIHK-VECKAIQFLHEVKPAHQPDWV 117
Query: 110 NLPLDRQLQARFLIAAYNLAVVNPSQFQILLAFQGTVTD---NDTS----AAHYLSSLCP 162
P I A ++ P + A +G V + ND AH + C
Sbjct: 118 PTP----------IRAGAQVMLRPKLKESFDALEGHVEEWRKNDEMDLQLQAHGVLRCCG 167
Query: 163 PPASTT--IELTAALLAKDRLNAFGLMEPYIEGQDGRRSVRAYGIYLKASF--FNHDCLP 218
A T +E + +L K + NAF E Y E G++L + NH C+P
Sbjct: 168 LDAITLKGLEESFQVLCKLQTNAFSRTEEYYE---------TGGVFLDTTLAMINHSCVP 218
Query: 219 NACRFDYVDAAAENNTDIIVRMIHDVPQGREICLSYFPVNYDYSTRQKRLLYDYDFACDC 278
NA V T +R + G EI +SY RQ L Y F C C
Sbjct: 219 NA----MVQFGGRTAT---LRSATFINPGDEIEISYADHTQPKGKRQGELDL-YHFECSC 270
Query: 279 DRCK 282
+C+
Sbjct: 271 YKCE 274
>gi|255085128|ref|XP_002504995.1| predicted protein [Micromonas sp. RCC299]
gi|226520264|gb|ACO66253.1| predicted protein [Micromonas sp. RCC299]
Length = 267
Score = 55.5 bits (132), Expect = 5e-05, Method: Compositional matrix adjust.
Identities = 28/80 (35%), Positives = 41/80 (51%), Gaps = 7/80 (8%)
Query: 205 IYLKASFFNHDCLPNACRFDYVDAAAENNTDIIVRMIHDVPQGREICLSYFPVNYDYSTR 264
+YL+ S FNH C+P+A ++ + T + VR + V G EI +SY P + R
Sbjct: 192 VYLQGSAFNHSCVPSA---EFCNVG----TSLTVRSLRRVQAGEEITVSYVPTTMNLKER 244
Query: 265 QKRLLYDYDFACDCDRCKVE 284
++ L Y F C C RC E
Sbjct: 245 RRCLEGQYKFVCACARCVAE 264
>gi|198454620|ref|XP_001359652.2| GA17400 [Drosophila pseudoobscura pseudoobscura]
gi|198132877|gb|EAL28802.2| GA17400 [Drosophila pseudoobscura pseudoobscura]
Length = 392
Score = 55.5 bits (132), Expect = 5e-05, Method: Compositional matrix adjust.
Identities = 29/87 (33%), Positives = 47/87 (54%), Gaps = 10/87 (11%)
Query: 204 GIYLKASFFNHDCLPNACR-FDYVDAAAENNTDIIVRMIHDVPQGREICLSYF---PVNY 259
G+YL S NH C+PNAC F Y +N ++++ + + +G EIC+SY +
Sbjct: 288 GLYLLQSKINHSCVPNACSTFPY------SNDIVVLKALAPIQEGEEICISYLDECQLER 341
Query: 260 DYSTRQKRLLYDYDFACDCDRCKVEAN 286
+R K L +Y F C C +C+ +A+
Sbjct: 342 SRHSRHKVLRENYIFVCQCLKCQAQAS 368
>gi|224101385|ref|XP_002312257.1| SET domain protein [Populus trichocarpa]
gi|222852077|gb|EEE89624.1| SET domain protein [Populus trichocarpa]
Length = 542
Score = 55.5 bits (132), Expect = 5e-05, Method: Compositional matrix adjust.
Identities = 43/131 (32%), Positives = 56/131 (42%), Gaps = 13/131 (9%)
Query: 159 SLCPPPASTTIELTAALLAKDRLNAF---GLMEPYIEGQDGRRSVRAYGI--YLKASFFN 213
SL P A EL L LN L+E + + R+ YG+ ++ ASF N
Sbjct: 279 SLFRPEAEEIGELNEKLDKVKILNVLDVNSLVEDSVSAKVLGRNSDYYGVGLWVLASFIN 338
Query: 214 HDCLPNACRFDYVDAAAENNTDIIVRMIHDVPQGREICLSYFPVNYDYSTRQKRLLYDYD 273
H C PNA R D ++V DV G EI +YF V S R + + +
Sbjct: 339 HSCNPNARRLHVGD-------HVLVHASRDVKAGEEITFAYFDVLSPLSKRNE-MSKTWG 390
Query: 274 FACDCDRCKVE 284
F C C RCK E
Sbjct: 391 FHCSCKRCKFE 401
>gi|308803330|ref|XP_003078978.1| SET domain protein 123 (ISS) [Ostreococcus tauri]
gi|116057431|emb|CAL51858.1| SET domain protein 123 (ISS) [Ostreococcus tauri]
Length = 311
Score = 55.5 bits (132), Expect = 5e-05, Method: Compositional matrix adjust.
Identities = 29/78 (37%), Positives = 40/78 (51%), Gaps = 7/78 (8%)
Query: 205 IYLKASFFNHDCLPNA-CRFDYVDAAAENNTDIIVRMIHDVPQGREICLSYFPVNYDYST 263
+Y AS FNH C PNA ++ DAA I +R + V G E ++Y N ++
Sbjct: 235 VYFTASLFNHSCAPNAHVSWENGDAA------ITIRTLRPVRAGEEFNITYVDANERSAS 288
Query: 264 RQKRLLYDYDFACDCDRC 281
R+ RL Y F C C+RC
Sbjct: 289 RRARLKEWYGFDCACERC 306
>gi|301089299|ref|XP_002894963.1| conserved hypothetical protein [Phytophthora infestans T30-4]
gi|262104481|gb|EEY62533.1| conserved hypothetical protein [Phytophthora infestans T30-4]
Length = 457
Score = 55.5 bits (132), Expect = 5e-05, Method: Compositional matrix adjust.
Identities = 28/80 (35%), Positives = 41/80 (51%), Gaps = 2/80 (2%)
Query: 204 GIYLKASFFNHDCLPNACRFDYVDAAAENNTDIIVRMIHDVPQGREICLSY-FPVNYDYS 262
G++L + FNHDC PN +F DA E VR+ + +G ++ +SY +P
Sbjct: 205 GVFLYNAMFNHDCNPNCVKFTPEDAGPEGGVS-EVRVARPIAKGEQLTISYLYPREQSRG 263
Query: 263 TRQKRLLYDYDFACDCDRCK 282
RQK L + F C C+ CK
Sbjct: 264 NRQKNLSEQFGFECCCELCK 283
>gi|330798181|ref|XP_003287133.1| hypothetical protein DICPUDRAFT_77998 [Dictyostelium purpureum]
gi|325082849|gb|EGC36318.1| hypothetical protein DICPUDRAFT_77998 [Dictyostelium purpureum]
Length = 510
Score = 55.1 bits (131), Expect = 5e-05, Method: Compositional matrix adjust.
Identities = 29/89 (32%), Positives = 40/89 (44%), Gaps = 11/89 (12%)
Query: 204 GIYLKASFFNHDCLPNACRF-----------DYVDAAAENNTDIIVRMIHDVPQGREICL 252
+YLK S FNH C PN Y ++N I + + DVPQ E+ +
Sbjct: 363 AVYLKLSLFNHGCFPNCTTLLEYNINHNDYSYYQGFESDNRLSISIITLRDVPQNSELLI 422
Query: 253 SYFPVNYDYSTRQKRLLYDYDFACDCDRC 281
+Y P+N R L ++ F CDC RC
Sbjct: 423 TYIPLNQKGHERYHNLKSNWMFPCDCLRC 451
>gi|451995963|gb|EMD88430.1| hypothetical protein COCHEDRAFT_1205676 [Cochliobolus
heterostrophus C5]
Length = 388
Score = 55.1 bits (131), Expect = 5e-05, Method: Compositional matrix adjust.
Identities = 53/212 (25%), Positives = 84/212 (39%), Gaps = 37/212 (17%)
Query: 84 AASSSHSPYVCQALTRLLNLNSPDAANLPLDRQLQARFLIAAYNLAVVNPSQFQILLAFQ 143
A S H V +AL + +PD + F ++ +P Q++ L A
Sbjct: 199 AISKQHKKAVSKALQGRI---TPDVMS----------FCVSGILWRHHHPEQWERLEALA 245
Query: 144 GTVT------DNDTSAAHYLSSLCPPPAS----TTIELTAALLAKDRLNAFGLMEPYIEG 193
T D D YL L P + T E L +D N+FG+ EG
Sbjct: 246 NDNTPYHSSDDLDAFTRAYLHLLVILPVALLPLVTAETLLTLSKRDSHNSFGIRSLEDEG 305
Query: 194 QDGRRSVRAYGIYLKASFFNHDCLPNACRFDYVDAAAENNTDIIVRMIHDVPQGREICLS 253
+ YG + AS+FNH C PN ++ E R ++ +G+E+C++
Sbjct: 306 SE----FFGYGCWPAASYFNHSCGPN------IEKKREGRV-WYFRTGREIEKGQELCIT 354
Query: 254 YFPVNYDYSTRQKRLL---YDYDFACDCDRCK 282
Y +R +R+L + F C C+RC+
Sbjct: 355 YLSGEERKLSRGQRMLRLKKTWGFDCGCERCE 386
>gi|366991437|ref|XP_003675484.1| hypothetical protein NCAS_0C01270 [Naumovozyma castellii CBS 4309]
gi|342301349|emb|CCC69117.1| hypothetical protein NCAS_0C01270 [Naumovozyma castellii CBS 4309]
Length = 374
Score = 55.1 bits (131), Expect = 5e-05, Method: Compositional matrix adjust.
Identities = 54/196 (27%), Positives = 82/196 (41%), Gaps = 28/196 (14%)
Query: 92 YVCQALTRLLNLNSPDAANLPLDRQLQARFLIAAYNLAVVNPSQFQILLAFQGTVTDNDT 151
+VC+ L RL ++ ++ N LQ+ V +F ILL FQ V
Sbjct: 199 FVCETLFRLKYMDQ-NSINFKSFYSLQSN--------EVSKMERFPILLHFQILVFKT-- 247
Query: 152 SAAHYLSSLCPPPASTTIELTA--ALLAKDRLNAFGLMEPYIEGQDGRRSVRAYGIYLKA 209
L L P +T+ +L + NAFG+ E E D R Y I+ +A
Sbjct: 248 -----LYILLPSFLRDRFSVTSFRHILGSEYGNAFGIWEVG-ESSDSREYF-GYWIFPRA 300
Query: 210 SFFNHDCLPNACRFDYVDAAAENNTDIIVRMIHDVPQGREICLSYFPVNYDYSTRQKRLL 269
S+FNH C PN + + + D+ G E+ + Y V +++ L
Sbjct: 301 SYFNHSCDPNLTK-------TRKGRTMFFTLNRDIDVGSELDIDYSGVLSLPVKERRKFL 353
Query: 270 YD-YDFACDCDRCKVE 284
+D + F C CDRCK+E
Sbjct: 354 HDSWFFDCQCDRCKLE 369
>gi|347963044|ref|XP_566389.3| AGAP000041-PA [Anopheles gambiae str. PEST]
gi|333467386|gb|EAL41307.3| AGAP000041-PA [Anopheles gambiae str. PEST]
Length = 522
Score = 55.1 bits (131), Expect = 5e-05, Method: Compositional matrix adjust.
Identities = 66/284 (23%), Positives = 113/284 (39%), Gaps = 33/284 (11%)
Query: 10 VAEIEGRGRGLVSTQSLKAGQIVLRDSPILLYSALPFINSSS--FCHNCFRKTMHSSSSI 67
+A EG GRGLV+T+ + ++V D P+L+ P +N+ FC +C ++ +
Sbjct: 64 IATFEGYGRGLVATRDIAVNELVFVDRPVLVG---PRVNNYEVIFCASCC--SILRRLQL 118
Query: 68 CPS-CSVAFCSPKCSTAAASSSHSPYVCQALTRLLNLNSPDAANLPLDRQLQARFLIAAY 126
C C + CS + ASS H+ A L+ P A L A I +
Sbjct: 119 CTGGCRLPICSRCDYSVGASSPHA-----AECALIQSWQPKDAGRYSKNILYALTSIRGF 173
Query: 127 NLAVVNPSQFQILLAFQG--------TVTDNDTSAAHYLSSLCPPPASTTIELTAALLAK 178
L+ + S I+L +G T + ++ + PA + +L
Sbjct: 174 LLSERDRS---IVLQMEGHPPRKEMTTEIERLLKDGYFWNLDGEGPAVRYLRQVVNILNT 230
Query: 179 DRLNAFGLMEPYIEGQDGRRSVRAYGIYLKASFFNHDCLPNACRFDYVDAAAENNTDIIV 238
NAF + ++ + + G+Y+ + NH C PN V + + V
Sbjct: 231 ---NAFETSRIVADEENNQHEIILRGLYILGALMNHCCRPN------VRYVFDGELRMRV 281
Query: 239 RMIHDVPQGREICLSYFPVNYDYSTRQKRLLYDYDFACDCDRCK 282
+ +G +I +Y + + R L + +F C CDRCK
Sbjct: 282 HASRPIKKGEQIMNNYSKILWGSQHRIIHLCFSKNFLCCCDRCK 325
>gi|340716457|ref|XP_003396714.1| PREDICTED: SET and MYND domain-containing protein 4-like [Bombus
terrestris]
Length = 635
Score = 55.1 bits (131), Expect = 6e-05, Method: Compositional matrix adjust.
Identities = 79/335 (23%), Positives = 130/335 (38%), Gaps = 73/335 (21%)
Query: 3 AVSETVTVAEIEGRGRGLVSTQSLKAGQIVLRDSPI--LLYSALPFINSSSFCHNCFRKT 60
A S+ ++++ E GR LV+T+ K G +V ++P ++Y+ + + CH+C ++
Sbjct: 239 AASDGISISFSEKYGRHLVATKEFKPGDVVTIENPYAYVIYTQRYY----THCHHCLSRS 294
Query: 61 MHSSSSICPSCSVA-FCSPKCSTAAASSSHSPYVCQALTRLLNLNSPDAANLPLDRQLQA 119
+ C C VA +CS KC A +H C + + NL + D + + ++
Sbjct: 295 YNLIP--CSYCPVAQYCSEKCRKLAWEMAHQ-IECPIMALVGNLLNVDKDKIRMLTKI-I 350
Query: 120 RFLIAA-------------YNLAVVNPSQFQILLAFQGTVTDNDTSAAHYLSS------L 160
RFLI LA NP +G + +A L++ L
Sbjct: 351 RFLIVVTSKGKKFDELRVDMELAESNPDNRSAGFTDEGILDSTSARSALSLATNMTTRPL 410
Query: 161 CPPPASTTIELTAALLAKDRLNAFGLMEPYIEGQD----------GRRSVRA-------- 202
A I AA+L + N F + +D G RA
Sbjct: 411 IGISAFACISALAAILLATQTNFFCNKYEVDQLKDISNYPKIIFSGSLMFRACVIMSSNC 470
Query: 203 ------------YGIYLKASFFNHDCLPNACR-FDYVDAAAENNTDIIVRMIHDVPQGRE 249
G+Y+ S +NH C PN R F+ +I R + + G +
Sbjct: 471 FSVQQEPGIKTGSGLYVTHSLYNHSCAPNTFRHFE--------ELTMITRALRPIYPGDQ 522
Query: 250 ICLSYFPVNYDYST---RQKRLLYDYDFACDCDRC 281
I +Y Y Y T R+++++ DY F CDC C
Sbjct: 523 IFTNY-GAAYAYMTKSERREKIIQDYFFECDCIAC 556
>gi|118776986|ref|XP_307207.3| AGAP011523-PA [Anopheles gambiae str. PEST]
gi|116133074|gb|EAA03232.3| AGAP011523-PA [Anopheles gambiae str. PEST]
Length = 473
Score = 55.1 bits (131), Expect = 6e-05, Method: Compositional matrix adjust.
Identities = 61/277 (22%), Positives = 103/277 (37%), Gaps = 32/277 (11%)
Query: 18 RGLVSTQSLKAGQIVLRDSPILLYSALPFINSSSFCHNCFRKTMHSSSSICPSCSVAFCS 77
R +V+ + LKAG +++ P ++ CH+ F+ S C C+ CS
Sbjct: 1 RYIVAARDLKAGDVIIETPPFVVGPCAETEPVCLGCHSAFQPG--SVVYRCEICNWRICS 58
Query: 78 PKCSTAAASSSHSPYVCQALTRLL---NLNSPDAANLPLDRQLQARFLIAAYNLAVVNPS 134
P C + SH C L +L + AA + L +A + L V+
Sbjct: 59 PSCQGLVPAGSHRQLECIPLRDKAVHNHLQTSPAAQVKL--MYEAILTLRCMLLKTVDRP 116
Query: 135 QFQILLAFQ----------GTVTDNDTSAAHYLSSLCPPPASTTIELTAALLAKDRLNAF 184
Q+ LL N + + + +EL + +NAF
Sbjct: 117 QYDRLLGMDPLNDVRQQIPKLWNRNQKEIVERIRNQWGFAEYSELEL-HTICGIIEVNAF 175
Query: 185 GLMEPYIEGQDGRRSVRAYGIYLKASFFNHDCLPNACRFDYVDAAAENNTDIIVRMIHDV 244
+ G+ +A ++ +A HDC PN D A + VR++ DV
Sbjct: 176 ---------EVGQEPTKARALFPEAYLLMHDCTPNTGHTD-----APQTHHLTVRVLRDV 221
Query: 245 PQGREICLSYFPVNYDYSTRQKRLLYDYDFACDCDRC 281
G + L+Y + R++ L + F+C+C RC
Sbjct: 222 KAGEPLTLTYAHILQGTLKRRQHLREEKFFSCNCRRC 258
>gi|298705753|emb|CBJ49061.1| conserved unknown protein [Ectocarpus siliculosus]
Length = 401
Score = 55.1 bits (131), Expect = 6e-05, Method: Compositional matrix adjust.
Identities = 27/86 (31%), Positives = 42/86 (48%), Gaps = 8/86 (9%)
Query: 202 AYGIYLKASFFNHDCLPNACRFDYVDAAAENNTDIIVRMIHDVPQGREICLSYFPVNYDY 261
G++ + NH C PN C+ A + + +R V G E+CLSY P++
Sbjct: 50 GLGLFPSGAMINHSCSPN-CQ------AWWRGSQLEIRCTKPVATGEELCLSYIPIDQPS 102
Query: 262 STRQKRLLYDYDFACDCDRCKVEANW 287
+ R+ +L + + FAC C RC V W
Sbjct: 103 TVRRAQLRHSWFFACRCRRC-VSRQW 127
>gi|409077404|gb|EKM77770.1| hypothetical protein AGABI1DRAFT_76762 [Agaricus bisporus var.
burnettii JB137-S8]
Length = 373
Score = 55.1 bits (131), Expect = 6e-05, Method: Compositional matrix adjust.
Identities = 34/113 (30%), Positives = 54/113 (47%), Gaps = 10/113 (8%)
Query: 172 TAALLAKDRLNAFGLMEPYIEGQDGRRSVRAYGIYLKASFFNHDCLPNACRFDYVDAAAE 231
T ALL ++ N FG+ + +E + + +G+Y S+FNHDC P R ++ A E
Sbjct: 269 TRALLGREHGNVFGIWDMALEEMNS--EMLGWGLYSFGSYFNHDCSPT-LRKEHKGRAIE 325
Query: 232 NNTDIIVRMIHDVPQGREICLSYFPVNYDYSTRQKRLLYDYDFACDCDRCKVE 284
I D+ G E+C+SY + ++K L ++ F C C C E
Sbjct: 326 YYA------IKDIDTGEELCISYVETD-QVVEKRKEALKEWFFDCGCGTCVAE 371
>gi|213512076|ref|NP_001133866.1| SET and MYND domain-containing protein 3 [Salmo salar]
gi|209155626|gb|ACI34045.1| SET and MYND domain-containing protein 3 [Salmo salar]
Length = 429
Score = 55.1 bits (131), Expect = 6e-05, Method: Compositional matrix adjust.
Identities = 81/303 (26%), Positives = 112/303 (36%), Gaps = 61/303 (20%)
Query: 1 MSAVSETVTVAEIEGRGRGLVSTQSLKAGQIVLRDSPILLYSALPFINSSS------FCH 54
M+A E AE +G GL +T + AG+ LLY+A P S CH
Sbjct: 1 MAAKFERFASAE---KGNGLRATHEIHAGE--------LLYTAEPLAYCVSNTCAKDLCH 49
Query: 55 NCF--RKTMHSSSSICPSCSVA-FCSPKCSTAAASSSHSPYVCQALTRLLNLNSPDAANL 111
+CF RKT+ C C VA +C C A S C+ L L +
Sbjct: 50 SCFSRRKTLLR----CSQCKVARYCDVTCQKQAWSDHKRE--CKCLRSL-------HPRI 96
Query: 112 PLDR-QLQARFLIAAYNLAVVNPSQFQILLAFQGTVTDNDTSAAHYLSSLC--------P 162
P D +L AR + + + +P L + + D LS L
Sbjct: 97 PTDSVRLAARIIFRLLSPSQTSPG-LCSLEEHESHLCDMGEERREGLSQLSSMLQLYIQQ 155
Query: 163 PPASTTIELTA----ALLAKDRLNAFGLMEPYIEGQDGRRSVRAYGIYLKASFFNHDCLP 218
+ T EL A +LLAK N F + DG G+Y S NHDC P
Sbjct: 156 EVPNITQELPAVDPLSLLAKLTCNCFTI-------SDGELREIGVGLYPSMSLLNHDCRP 208
Query: 219 NACRFDYVDAAAENNTDIIVRMIHDVPQGREICLSYFPVNYDYSTRQKRLLYDYDFACDC 278
+ E T + +R + D+ E+ +SY R+ +L Y F C C
Sbjct: 209 SCVML------FEGET-LHLRAVRDMQPAEELTISYIGTLAPTRDRRTQLEEQYHFTCQC 261
Query: 279 DRC 281
RC
Sbjct: 262 QRC 264
>gi|157869732|ref|XP_001683417.1| conserved hypothetical protein [Leishmania major strain Friedlin]
gi|68126482|emb|CAJ04456.1| conserved hypothetical protein [Leishmania major strain Friedlin]
Length = 513
Score = 55.1 bits (131), Expect = 6e-05, Method: Compositional matrix adjust.
Identities = 31/99 (31%), Positives = 48/99 (48%), Gaps = 7/99 (7%)
Query: 200 VRAYGIYLKASFFNHDCLPNACRFDYVDAAAENNTDIIVRMIHDVPQGREICLSYFPVNY 259
++ G+Y S FNH C+PNA VD E I+++ + G + ++Y P+
Sbjct: 418 LKGAGLYSLLSCFNHSCVPNAA-VSTVDGTHE----IVLKTTRPIRAGEPLTITYIPLAA 472
Query: 260 DYSTR--QKRLLYDYDFACDCDRCKVEANWSDNDNDDEN 296
++R ++R L +Y F C C RC EA S D N
Sbjct: 473 GTASRAERQRQLRNYFFTCHCPRCDTEAAASAAMTGDGN 511
>gi|22902087|gb|AAC60295.2| skmBOP [Takifugu rubripes]
Length = 433
Score = 55.1 bits (131), Expect = 6e-05, Method: Compositional matrix adjust.
Identities = 57/280 (20%), Positives = 105/280 (37%), Gaps = 36/280 (12%)
Query: 6 ETVTVAEIEGRGRGLVSTQSLKAGQIVLRDSPI--LLYSALPFINSSSFCHNCFRKTMHS 63
E V + + G+GRGL +T+ AG ++ + + +++ +L + CH+CFR+
Sbjct: 2 ENVAIFDSPGKGRGLKTTKEFWAGDVIFSEPSLAAVVFDSL----AERICHSCFRR--QE 55
Query: 64 SSSICPSCSVA-FCSPKCSTAAASSSHSPYVCQALTRLLNLNSPDAANLPLDRQLQARFL 122
C C A +C C A + C A+ N+ + +Q
Sbjct: 56 KLQKCSQCKFAHYCDRTCQRAGWAEHKQE--CGAIK---AYGKAPNENIRVVSHMQ---- 106
Query: 123 IAAYNLAVVNPSQFQILLAFQGTVTDNDTSAAHYLSSLCPPPASTTIELTAALLAKDRLN 182
L V + + + + + ++L TI+ + + N
Sbjct: 107 -----LITVEELEDHVADMQEDEIKELKVDIHNFLDYWPRNSKQHTIDDISHIFGVINCN 161
Query: 183 AFGLMEPYIEGQDGRRSVRAYGIYLKASFFNHDCLPNACRFDYVDAAAENNTDIIVRMIH 242
F + Q G ++V G++ NH+C PN N+ I +R +
Sbjct: 162 GFT-----VSDQRGLQAV-GVGLFPNLCMVNHNCWPNC-------TVILNHGKIELRSLG 208
Query: 243 DVPQGREICLSYFPVNYDYSTRQKRLLYDYDFACDCDRCK 282
+ +G E+ ++Y R++ L Y F C CD CK
Sbjct: 209 KIAEGEELTVAYVDFLNLSEERRRLLKTQYFFDCQCDYCK 248
>gi|259149991|emb|CAY86794.1| Set6p [Saccharomyces cerevisiae EC1118]
gi|323346145|gb|EGA80435.1| Set6p [Saccharomyces cerevisiae Lalvin QA23]
gi|365762745|gb|EHN04278.1| Set6p [Saccharomyces cerevisiae x Saccharomyces kudriavzevii VIN7]
Length = 373
Score = 55.1 bits (131), Expect = 6e-05, Method: Compositional matrix adjust.
Identities = 53/197 (26%), Positives = 84/197 (42%), Gaps = 30/197 (15%)
Query: 92 YVCQALTRLLNLNSPDAANLPLDRQLQARFLIAAYNLAVVNPSQFQILLAFQGTVTDNDT 151
+VC++L L +N P LQ+ L S+F +LL FQ V
Sbjct: 198 FVCESLFNLKYMN-PQCITYRAFNMLQSNELSKI--------SKFPVLLHFQKLVFQT-- 246
Query: 152 SAAHYLSSLCPPPAS--TTIELTAALLAKDRLNAFGLMEPYIEGQDG-RRSVRAYGIYLK 208
L L P +I L +L + NAFGL + EG+ R Y ++ +
Sbjct: 247 -----LYILLPSHLHRMLSIPLLRHILGTEYGNAFGLWQ---EGEASDSREYFGYWVFPE 298
Query: 209 ASFFNHDCLPNACRFDYVDAAAENNTDIIVRMIHDVPQGREICLSYFPVNYDYSTRQKRL 268
AS+FNH C PN ++ ++ + D+ + +IC+ Y V + +++
Sbjct: 299 ASYFNHSCNPNITKY-------RKGNSMLFTVNRDIKKDEQICIDYSGVLDLPTVKRRAF 351
Query: 269 LYD-YDFACDCDRCKVE 284
L D + F C C+RCK E
Sbjct: 352 LADSWFFDCACERCKSE 368
>gi|149040852|gb|EDL94809.1| similar to SET and MYND domain containing 3, isoform CRA_b [Rattus
norvegicus]
Length = 428
Score = 55.1 bits (131), Expect = 6e-05, Method: Compositional matrix adjust.
Identities = 70/303 (23%), Positives = 114/303 (37%), Gaps = 57/303 (18%)
Query: 16 RGRGLVSTQSLKAGQIVLRDSPILLYSALPFINSSSFCHNCF---RKTMHSSSSICPSCS 72
RG GL + L+ G+++ R P L + + C C K M C C
Sbjct: 14 RGNGLRAVAPLRPGELLFRSDP--LAYTVSKGSRGVVCDRCLLGKEKLMR-----CSQCR 66
Query: 73 VA-FCSPKCSTAAASSSHSPYVCQALTRLLNLNSPDAANLPLDRQL---------QARFL 122
+A +CS KC A C+ L PD+ L L R + ++ L
Sbjct: 67 IAKYCSAKCQKKAWPDHKRE--CKCLKSCKPRYPPDSVRL-LGRVVVKLMDGKPSESEKL 123
Query: 123 IAAYNLAVVNPSQF---------QILLAFQGTVTDNDTSAAHYLSSLCPPPASTTIELTA 173
+ Y+L N S+ Q+ + FQ + + A+ PP+ E A
Sbjct: 124 YSFYDLES-NISKLTEDKKEGLRQLAMTFQHFMREEIQDASQL------PPSFDLFEAFA 176
Query: 174 ALLAKDRLNAFGLMEPYIEGQDGRRSVRAYGIYLKASFFNHDCLPNACRFDYVDAAAENN 233
++ N+F + ++ G+Y S NH C PN C + N
Sbjct: 177 KVIC----NSFTICNAEMQEV-------GVGLYPSMSLLNHSCDPN-CSIVF------NG 218
Query: 234 TDIIVRMIHDVPQGREICLSYFPVNYDYSTRQKRLLYDYDFACDCDRCKVEANWSDNDND 293
+++R + ++ G E+ + Y + R+K+L Y F CDC RC + +D
Sbjct: 219 PHLLLRAVREIEAGEELTICYLDMLMTSEERRKQLRDQYCFECDCIRCPTQDKDADMLTG 278
Query: 294 DEN 296
DE
Sbjct: 279 DEQ 281
>gi|323331171|gb|EGA72589.1| Set6p [Saccharomyces cerevisiae AWRI796]
Length = 373
Score = 55.1 bits (131), Expect = 6e-05, Method: Compositional matrix adjust.
Identities = 44/155 (28%), Positives = 70/155 (45%), Gaps = 21/155 (13%)
Query: 134 SQFQILLAFQGTVTDNDTSAAHYLSSLCPPPAS--TTIELTAALLAKDRLNAFGLMEPYI 191
S+F +LL FQ V L L P +I L +L + NAFGL +
Sbjct: 231 SKFPVLLHFQKLVFQT-------LYILLPSHLHRMLSIPLLRHILGTEYGNAFGLWQ--- 280
Query: 192 EGQDG-RRSVRAYGIYLKASFFNHDCLPNACRFDYVDAAAENNTDIIVRMIHDVPQGREI 250
EG+ R Y ++ +AS+FNH C PN ++ ++ + D+ + +I
Sbjct: 281 EGEASDSREYFGYWVFPEASYFNHSCSPNITKY-------RKGNSMLFTVNRDIKKDEQI 333
Query: 251 CLSYFPVNYDYSTRQKRLLYD-YDFACDCDRCKVE 284
C+ Y V + +++ L D + F C C+RCK E
Sbjct: 334 CIDYSGVLDLPTVKRRAFLADSWFFDCACERCKSE 368
>gi|270007661|gb|EFA04109.1| hypothetical protein TcasGA2_TC014347 [Tribolium castaneum]
Length = 497
Score = 55.1 bits (131), Expect = 6e-05, Method: Compositional matrix adjust.
Identities = 64/286 (22%), Positives = 117/286 (40%), Gaps = 49/286 (17%)
Query: 10 VAEIEGRGRGLVSTQSLKAGQIVLRDSPILLY-SALPFINSSSFCHNCFRKTMHSSSSIC 68
+ + E G G+ +T++++ G+++ +D P++L A P S C +C+ + +C
Sbjct: 30 IKKSELGGFGVFATRAIEVGELIFKDFPVILGPRAAPTCPLS--CVSCYSR---RDLRLC 84
Query: 69 -PSCSVAFCSPKCSTAAASSSHSPYVCQALTRLLNLNSPDAANLPLDRQLQ--ARFLIAA 125
C + CS KC + V Q ++ ++ + ++ L R L L+
Sbjct: 85 GKKCGLLVCSEKCEKSLGHQKECKVVRQWQSKPIS----EDLSVQLVRVLSPIRSLLLGE 140
Query: 126 YNLAVV--------NPSQFQILLAFQGTVTDNDTSAAHYLSSLCPPPASTTIELTAALLA 177
+ +VV + F++ + + ++ +C S
Sbjct: 141 EDKSVVKCLKAHKSDQHGFEVDVMKDALALNVKEDEEKFMRFVCSVMDS----------- 189
Query: 178 KDRLNAFGLMEPYIEGQDGRRSVRAYGIYLKASFFNHDCLPNACR-FDYVDAAAENNTDI 236
NAF ++ + +D + SV+ G+Y S NH C PN C FD + +
Sbjct: 190 ----NAFEVLVGF---EDNQASVK--GLYPLGSLANHSCCPNTCHVFD-------DKQHM 233
Query: 237 IVRMIHDVPQGREICLSYFPVNYDYSTRQKRLLYDYDFACDCDRCK 282
+VR +PQG EI SY + + S R+ L F C C RC+
Sbjct: 234 VVRASKFIPQGSEIFHSYSRLIWSTSARRFHLYRTKHFLCKCQRCE 279
>gi|348670644|gb|EGZ10465.1| hypothetical protein PHYSODRAFT_518318 [Phytophthora sojae]
Length = 437
Score = 55.1 bits (131), Expect = 6e-05, Method: Compositional matrix adjust.
Identities = 27/89 (30%), Positives = 44/89 (49%), Gaps = 7/89 (7%)
Query: 204 GIYLKASFFNHDCLPNACRFDYVDAAAENNTDIIVRMIHDVPQGREICLSYFPVNYDYST 263
G+Y + + FNHDC PN C + ++ VR++ D+ E+ +SY +
Sbjct: 200 GLYPEGALFNHDCDPN-CVVSF------KGREMQVRVVRDIEVDEELTVSYVELLQSTKA 252
Query: 264 RQKRLLYDYDFACDCDRCKVEANWSDNDN 292
R++ L Y F C+C RCK N N++
Sbjct: 253 RRRELKESYFFDCECKRCKAATNGQSNED 281
>gi|317036659|ref|XP_001397803.2| SET and MYND domain protein [Aspergillus niger CBS 513.88]
Length = 481
Score = 55.1 bits (131), Expect = 6e-05, Method: Compositional matrix adjust.
Identities = 71/289 (24%), Positives = 111/289 (38%), Gaps = 46/289 (15%)
Query: 15 GRGRGLVSTQSLKAGQIVLRDSPILLYSALPFINSSSFCHNCFRKTMHSSSSI----CPS 70
G GRGL ++ + G+ VL P+ + L + C CF + I C
Sbjct: 13 GLGRGLFASTDIAVGEEVLH-IPVPFVAVLDTEHLGEVCSGCFGQRQLEEEGIALKGCRG 71
Query: 71 CS-VAFCSPKCSTAAASSSHSPY--VCQALT-RLLNLNSPDAANLPLDRQLQARFLIAAY 126
C V +C C HS + Q L R+L +N+ + L + Q
Sbjct: 72 CGIVKYCDKTCQAKNWKLGHSLECSIYQNLKPRVLPINARAVLRMVLRSERQK------- 124
Query: 127 NLAVVNPSQFQILLAFQGTVTD-NDTSAAHY----LSSLCPPPASTT---IELTAALLAK 178
+ + L + D D SA+ + LSS S T E+ +A+ AK
Sbjct: 125 ----YSSEELDQFLQLGTHIKDIRDQSASQWERISLSSKAIKAYSGTDMSEEVISAMGAK 180
Query: 179 DRLNAFGLMEPYIEGQDGRRSVRAYGIYLK--ASFFNHDCLPNACRFDYVDAAAENNTDI 236
LN+F L + G+YL A+ FNH C NA A + + ++
Sbjct: 181 LDLNSFNLTNAVYD---------RLGVYLHPYAAIFNHSCDHNA-------AVSFDGPNL 224
Query: 237 IVRMIHDVPQGREICLSYFPVNYDYSTRQKRLLYDYDFACDCDRCKVEA 285
++ + + + +I ++Y V Y RQ L Y F C C +C EA
Sbjct: 225 HIKALRPIRKDEQIFITYIDVTDPYPIRQANLQSRYYFTCHCSKCSREA 273
>gi|149640598|ref|XP_001510487.1| PREDICTED: SET and MYND domain-containing protein 1 isoform 1
[Ornithorhynchus anatinus]
Length = 490
Score = 55.1 bits (131), Expect = 6e-05, Method: Compositional matrix adjust.
Identities = 64/290 (22%), Positives = 113/290 (38%), Gaps = 29/290 (10%)
Query: 6 ETVTVAEIEGRGRGLVSTQSLKAGQIVLRDSPILLYSALPFINSSSF-CHNCFRKTMHSS 64
E V V E +GRGL +T+ A ++ + YSA+ F + + CH CF++
Sbjct: 7 EKVEVFTSESKGRGLKATKEFWAADLIFSERA---YSAVVFDSLTHLVCHTCFKR--QEK 61
Query: 65 SSICPSCSVA-FCSPKCSTAAASSSHSPYVCQALTRLLNLNSPDAANLPLDRQLQARFLI 123
C C A +C C A + + C A+ + + +P+ N+ L ++ R
Sbjct: 62 LQRCGQCKFAHYCDRTCQKDAWLNHKNE--CSAIKK--HGKAPNE-NIRLAARIMWRIER 116
Query: 124 AAYNLA---VVNPSQFQILLAFQGTVTDNDTS--AAHYLSSLCPPPASTTIELTAALLAK 178
L +V+ Q + G + +L P +++ + +
Sbjct: 117 EGTGLTEGCLVSIDDLQNHVDSFGEEEQKELRLDVESFLHFWPPQSQQFSMQYISHIFGV 176
Query: 179 DRLNAFGLMEPYIEGQDGRRSVRAYGIYLKASFFNHDCLPNAC------RFDYVDAAAEN 232
N F L + Q G ++V G++ NHDC PN + V +
Sbjct: 177 INCNGFTLSD-----QRGLQAV-GVGVFPNLCLVNHDCWPNCTVIFNNGNHEAVKSMFHT 230
Query: 233 NTDIIVRMIHDVPQGREICLSYFPVNYDYSTRQKRLLYDYDFACDCDRCK 282
I +R + + +G E+ +SY + R+K L Y F C C+ C+
Sbjct: 231 QMRIELRALGKISEGEELTVSYIDFLNVSADRKKLLKKQYYFDCTCEHCE 280
>gi|326919603|ref|XP_003206069.1| PREDICTED: SET and MYND domain-containing protein 1-like isoform 2
[Meleagris gallopavo]
Length = 478
Score = 54.7 bits (130), Expect = 6e-05, Method: Compositional matrix adjust.
Identities = 67/284 (23%), Positives = 112/284 (39%), Gaps = 30/284 (10%)
Query: 6 ETVTVAEIEGRGRGLVSTQSLKAGQIVLRDSPILLYSALPFIN-SSSFCHNCFRKTMHSS 64
E+V V EG+GRGL + + G ++ + Y+A+ F + + CH CF++
Sbjct: 7 ESVEVFTTEGKGRGLKAQKEFLPGDVIFAEPA---YAAVVFDSLTHVVCHTCFKR--QEK 61
Query: 65 SSICPSCSVAF-CSPKCSTAAASSSHSPYVCQALTRLLNLNSPDAANLPLDRQLQARFLI 123
C C A+ C C A + + C A+ + + +P N+ L ++ R
Sbjct: 62 LHRCGQCKFAYYCDRTCQRDAWLNHKNE--CSAIKK--HGKAP-TENIRLAARILWRIER 116
Query: 124 AAYNLA-----VVNPSQFQILLAFQGTVTDNDTSAAHYLSSLCPPPASTTIELTAALLAK 178
L+ ++ Q + + D +L ++ + +
Sbjct: 117 EGSGLSENCLVSIDDLQNHVENFDEEEKKDLRIDVESFLEFWPAQSQQFGMQYISHIFGV 176
Query: 179 DRLNAFGLMEPYIEGQDGRRSVRAYGIYLKASFFNHDCLPNACRFDYVDAAAENNTDIIV 238
NAF L + Q G ++V GI+ NHDC PN C + NN I +
Sbjct: 177 INCNAFTLSD-----QRGLQAV-GVGIFPNLCQANHDCWPN-CTVIF------NNGKIEL 223
Query: 239 RMIHDVPQGREICLSYFPVNYDYSTRQKRLLYDYDFACDCDRCK 282
R + + G E+ +SY RQK+L Y F C C+ CK
Sbjct: 224 RALSKISPGDELTVSYVDFLNVSEERQKQLKKQYYFDCTCEHCK 267
>gi|194899548|ref|XP_001979321.1| GG14794 [Drosophila erecta]
gi|190651024|gb|EDV48279.1| GG14794 [Drosophila erecta]
Length = 392
Score = 54.7 bits (130), Expect = 7e-05, Method: Compositional matrix adjust.
Identities = 29/87 (33%), Positives = 47/87 (54%), Gaps = 10/87 (11%)
Query: 204 GIYLKASFFNHDCLPNACR-FDYVDAAAENNTDIIVRMIHDVPQGREICLSYF---PVNY 259
G+YL S NH C+PNAC F Y +N ++++ + + +G EIC+SY +
Sbjct: 288 GLYLLQSKINHSCVPNACSTFPY------SNDIVVLKALAPIQEGDEICISYLDECMLER 341
Query: 260 DYSTRQKRLLYDYDFACDCDRCKVEAN 286
+R K L +Y F C C +C+ +A+
Sbjct: 342 SRHSRHKVLRENYVFICQCSKCRAQAS 368
>gi|443683649|gb|ELT87827.1| hypothetical protein CAPTEDRAFT_118237 [Capitella teleta]
Length = 596
Score = 54.7 bits (130), Expect = 7e-05, Method: Compositional matrix adjust.
Identities = 79/318 (24%), Positives = 123/318 (38%), Gaps = 54/318 (16%)
Query: 3 AVSETVTVAEIEGRGRGLVSTQSLKAGQIVLRDSPILLYSALPFINSSSFCHNCFRKTMH 62
++S +VA E GR + + +++ G+IVL + P S L N S+ CH+C + T+
Sbjct: 175 SLSSACSVASTENEGRFITAVRNIAPGEIVLIEKPFA--SVLLRANYSNHCHHCLKHTLE 232
Query: 63 SSSSICPSCSVA-FCSPKCSTAAASSSHSPYVCQALTRL----LNLNSPDAANLPLDRQL 117
C +C A FCS C A + H Y C L L +N A +
Sbjct: 233 GIP--CRTCPDARFCSEACRDTAMQTYHQ-YECSVLNTLHHSQINKFGCLAFRAITKQSY 289
Query: 118 QARFLIAAYNL--------AVVNPSQFQILLAFQGTVTD-------NDTSAAHYLSSLCP 162
Q+ I A +L + P + ++ D + T A YL L
Sbjct: 290 QSLKDIRAQDLPLNGCHSDGLYRPQDYNTIIQLVTHAKDRPVQDLFHRTVMAVYLLKLLQ 349
Query: 163 PPA----STTIELTAAL-------LAKDRLNAFGLMEPYIEGQDGRRSV---RAYGIYLK 208
+ +E+ A + L NA + E Y++ S+ GIY
Sbjct: 350 QTSYFNGEEDVEMQAYIAGLFLSHLQSFPCNAHEVPELYLDPNAIDLSMPNELGAGIYST 409
Query: 209 ASFFNHDCLPNACRFDYVDAAAENNTDIIVRMIHDVPQGREICLSYFPVNYDYST----- 263
S FNH C P R Y D +VR I + +G ++ +Y + Y+T
Sbjct: 410 LSLFNHSCDPGVNRNFYGDTC-------VVRAIKTIRKGHQVSDNYGAL---YATNTLKE 459
Query: 264 RQKRLLYDYDFACDCDRC 281
R +L Y F+C C+ C
Sbjct: 460 RHDKLQPQYFFSCRCEPC 477
>gi|302144054|emb|CBI23159.3| unnamed protein product [Vitis vinifera]
Length = 503
Score = 54.7 bits (130), Expect = 7e-05, Method: Compositional matrix adjust.
Identities = 29/81 (35%), Positives = 40/81 (49%), Gaps = 8/81 (9%)
Query: 204 GIYLKASFFNHDCLPNACRFDYVDAAAENNTDIIVRMIHDVPQGREICLSYFPVNYDYST 263
G+++ +F NH C PNA R D ++IV DV G EI +YF V +
Sbjct: 293 GLWILPAFINHSCNPNARRLHVGD-------NVIVHTSRDVKAGEEITFAYFDVLSPWRK 345
Query: 264 RQKRLLYDYDFACDCDRCKVE 284
R K + + F C+C RCK E
Sbjct: 346 R-KDMAKTWGFQCNCKRCKFE 365
>gi|225444416|ref|XP_002265832.1| PREDICTED: uncharacterized protein LOC100253788 [Vitis vinifera]
Length = 550
Score = 54.7 bits (130), Expect = 7e-05, Method: Compositional matrix adjust.
Identities = 29/81 (35%), Positives = 40/81 (49%), Gaps = 8/81 (9%)
Query: 204 GIYLKASFFNHDCLPNACRFDYVDAAAENNTDIIVRMIHDVPQGREICLSYFPVNYDYST 263
G+++ +F NH C PNA R D ++IV DV G EI +YF V +
Sbjct: 340 GLWILPAFINHSCNPNARRLHVGD-------NVIVHTSRDVKAGEEITFAYFDVLSPWRK 392
Query: 264 RQKRLLYDYDFACDCDRCKVE 284
R K + + F C+C RCK E
Sbjct: 393 R-KDMAKTWGFQCNCKRCKFE 412
>gi|158299452|ref|XP_319583.4| AGAP008839-PA [Anopheles gambiae str. PEST]
gi|157013526|gb|EAA14823.4| AGAP008839-PA [Anopheles gambiae str. PEST]
Length = 650
Score = 54.7 bits (130), Expect = 7e-05, Method: Compositional matrix adjust.
Identities = 73/302 (24%), Positives = 116/302 (38%), Gaps = 45/302 (14%)
Query: 17 GRGLVSTQSLKAGQIVLRDSPILLYSALPFINSSSFCHNCFRKTMHSSSSICPSCS-VAF 75
GR + LK I+L + P + S L S C +CF++ S CP C+ V F
Sbjct: 240 GRFARTNTDLKPNTILLLERPHV--SVLLEDYSLDHCTHCFKRV--SVPIACPLCADVVF 295
Query: 76 CSPKCSTAAASSSHSPYVCQALTRLLNLNSPDAANLPL---DRQLQARFLIAAYNLAVVN 132
CS +C T A ++ H Y C L L + ++ L ++ + FL LA +
Sbjct: 296 CSDECETKANATYHR-YECGFLPILWGSGASITCHMALRMITQKSEEYFLKLKPELAGLT 354
Query: 133 PSQFQIL-----------LAFQGTVTDNDTSAAHYLSSL------------CPPPASTTI 169
Q L + + T + D +++L CP +
Sbjct: 355 NEQIDKLPVDDYRKVYKLVTHESTRSPEDFFQRTLMATLLNACLTLGGYGACPQEQNFIG 414
Query: 170 ELTAALLAKDRLNAFGLMEPYIEGQD--GRRSVRAYGIYLKASFFNHDCLPNACRFDYVD 227
L L + NA + E E + G+ + G+Y + FNH C P R+
Sbjct: 415 GLLVHNLQLLQFNAHEVSEMIRETAEDIGKSTFIGGGLYPTLALFNHSCDPGVTRY---- 470
Query: 228 AAAENNTDIIVRMIHDVPQGREICLSYFPVNYDYSTRQKR--LLYDYDFACDCDRCKVEA 285
+ VR + ++P + +Y P+ ++R LL+ Y F C C C VE
Sbjct: 471 ---YRGNQVCVRTVKNIPADSMVAENYGPLFTQVRRDERRDTLLHQYRFTCQCVPC-VE- 525
Query: 286 NW 287
NW
Sbjct: 526 NW 527
>gi|238485368|ref|XP_002373922.1| set and mynd domain containing protein, putative [Aspergillus
flavus NRRL3357]
gi|220698801|gb|EED55140.1| set and mynd domain containing protein, putative [Aspergillus
flavus NRRL3357]
Length = 372
Score = 54.7 bits (130), Expect = 7e-05, Method: Compositional matrix adjust.
Identities = 69/315 (21%), Positives = 118/315 (37%), Gaps = 61/315 (19%)
Query: 16 RGRGLVSTQSLKAGQIVLRDSPILLYSALPFIN----------SSSFCHNCFRKTMHSSS 65
+GR L Q +K G+++L D P Y+ +P ++ S+ CH +T+ S
Sbjct: 44 KGRQLRVCQPVKKGELLLVDLP---YALIPVVDHPEQSDDVRCSNPACHRRVARTVERVS 100
Query: 66 SICP---SCSVAFCSPKCSTA--------------AASSSHSPY------VCQALTRLLN 102
CP S V +C+ C A A+S S + + + R+L
Sbjct: 101 --CPNRCSAEVVWCNSSCRDADNLRHEFECTWLKKYATSIRSKWGEYDFGMLWLIVRILA 158
Query: 103 LNSPDAANLPLDRQLQARFLIAAYNLAVVNPSQFQILLAFQGTVTDNDTSAAHYLSSLCP 162
+ N P + + L+ + + + +F G+ S + + L
Sbjct: 159 SRHVEFRNTPSSDKTSGSKTGSDSKLSSFE-TGWNAIRSFCGSQDSWTHSQVRHWTMLVK 217
Query: 163 P--------PASTTIELTAALLAKDRLNAFGLME------PYIEGQDGRRSVRAYGIYLK 208
P T L+ ++ N+FGL P E R A G+Y
Sbjct: 218 KYLRNSQSLPHGLTSSEVLVLICQEEANSFGLYPRETGAFPLPELAVDRGEQFAAGVYPT 277
Query: 209 ASFFNHDCLPNACRFDYVDAAAENNTDIIVRMIHDVPQGREICLSYFPVNYDYSTRQKR- 267
A+ NH C PN + ++ + ++ D+ G E C+SYF ++ + +R
Sbjct: 278 AALANHSCSPN------IIHKPDDQSRMVFVASKDIATGEECCISYFDLSKKVELKDRRD 331
Query: 268 -LLYDYDFACDCDRC 281
L + F C CDRC
Sbjct: 332 HLQGSFRFVCKCDRC 346
>gi|198463716|ref|XP_001352921.2| GA14810 [Drosophila pseudoobscura pseudoobscura]
gi|198151380|gb|EAL30422.2| GA14810 [Drosophila pseudoobscura pseudoobscura]
Length = 532
Score = 54.7 bits (130), Expect = 8e-05, Method: Compositional matrix adjust.
Identities = 80/311 (25%), Positives = 115/311 (36%), Gaps = 80/311 (25%)
Query: 10 VAEIEGRGRGLVSTQSLKAGQIVLRDSPILLYSALPFINSSSFCHNCFRKTMHSSSS--- 66
VA E GR L +T+ +K G+ +LR++P+LL P + S+ C C R +
Sbjct: 50 VATNETLGRHLRATRDVKIGEEILREAPLLLG---PKVASAPICLGCHRNLLAPQKQRGN 106
Query: 67 --ICPSCSVAFCSPKCSTAAA------------------------SSSHSPYVCQALTRL 100
C SCS C +C + S Y L R
Sbjct: 107 YYKCSSCSWPLCGRECEESPHHRAECQLMSGSNFQSKINYTPGEDERKESAYCVIMLLRC 166
Query: 101 LNLNSPDAANL----PLDRQLQARFLIAAYNLAVVNPSQFQILLAFQGTVTD-NDTSAAH 155
+ L S D L+ L+ R Y + N L+ F T+ D S
Sbjct: 167 MQLKSSDPEGFARLSALEDHLEERLATPLYQVLRAN------LITFIKTILGMRDWSEVD 220
Query: 156 YLSSLCPPPASTTIELTAALLAKDRLNAFGLMEPYIEGQDGRRSVRAYGIYLKASFFNHD 215
L AA+L N F L +P RR VRA ++ A+ +HD
Sbjct: 221 ILR-------------IAAILDT---NTFELRQPR-----ERRKVRA--LFPGAAMISHD 257
Query: 216 CLPNA-CRFDYVDAAAENNTDIIVRMIHDVPQGREICLSYF-PVNYDYSTRQKRLLYDYD 273
C PN RFD ++ +II + +G + +SY P+ ST Q+RL
Sbjct: 258 CAPNMRHRFD-------DDMNIIFLAKRPIAKGEILSISYTQPLR---STIQRRLHLRQA 307
Query: 274 --FACDCDRCK 282
F C C RC+
Sbjct: 308 KCFDCACARCQ 318
>gi|342888668|gb|EGU87905.1| hypothetical protein FOXB_01591 [Fusarium oxysporum Fo5176]
Length = 655
Score = 54.7 bits (130), Expect = 8e-05, Method: Compositional matrix adjust.
Identities = 44/154 (28%), Positives = 71/154 (46%), Gaps = 19/154 (12%)
Query: 132 NPSQFQILLAFQGTVTDNDTSAAHYLSSLCPPPASTTIELTAALLAKDRLNAFGLMEPYI 191
NPSQ + +L G + L + P T ++ + + NAFGL +
Sbjct: 447 NPSQVEKVLRLHG----DYHGLGRKLQEVDDSPVIDTFQVHDII----QRNAFGLGQ--- 495
Query: 192 EGQDGRRSVRAYGIYLKASFFNHDCLPNACRFDYVDAAAENNTDIIVRMIHDVPQGREIC 251
+ +D S + G++++AS+ NH C+PNA + D+V II R + + G EI
Sbjct: 496 QTEDEDVSNASTGLWIRASYINHSCIPNAKK-DFVGDL------IIFRAMRRIVTGEEIT 548
Query: 252 LSYFPVNYDYSTRQKRLLYDYDFACDCDRCKVEA 285
SY + DY R+ + ++F C C C VE
Sbjct: 549 HSY-DESSDYEARKTNIRRTWNFECCCQLCLVEG 581
>gi|380018360|ref|XP_003693098.1| PREDICTED: protein msta, isoform A-like [Apis florea]
Length = 467
Score = 54.7 bits (130), Expect = 8e-05, Method: Compositional matrix adjust.
Identities = 60/287 (20%), Positives = 110/287 (38%), Gaps = 33/287 (11%)
Query: 2 SAVSETVTVAEIEGRGRGLVSTQSLKAGQIVLRDSPILLYSALPFINSSSFCHNCFRKTM 61
S V ++ +AE GR V+T+++K G+I+LRD L+ + NS C C+
Sbjct: 39 SKVCKSFKLAENPSLGRHYVATRNIKVGEIILRDDQPLITGLM--YNSVPVCLQCYTMLN 96
Query: 62 HSSSSICPSCSVAFCSPKCSTAAASSSHSPYVCQALTRLLNLNSPDAANLPLDRQLQARF 121
+ C C C C+ S + + P + Q
Sbjct: 97 QEIAIPCEKCGWPLCQ-NCNEHGLECKFSCSRRDSKISITEFGYPHPS-------YQCIN 148
Query: 122 LIAAYNLAVVNPSQFQILLAFQGTVTDNDTSA-----AHYLSSLCPPPASTTIELTAALL 176
+I A +L NP ++ L++ + + + S AH++ + E+ +
Sbjct: 149 IIRALSLKDTNPESYKKLISLESHCNEINNSKEPLNIAHFIKRFFKADDISEEEIVTII- 207
Query: 177 AKDRLNAFGLMEPYIEGQDGRRSVRAY-GIYLKASFFNHDCLPNACRFDYVDAAAENNTD 235
G+++ + G + + Y +Y AS H+C N C + D
Sbjct: 208 --------GILQ--VNGHEVPLTDSPYVAVYEMASLIEHNCRAN-CSKSFTDMGG----- 251
Query: 236 IIVRMIHDVPQGREICLSYFPVNYDYSTRQKRLLYDYDFACDCDRCK 282
+I+R + +G I + Y + + R+ LL F C C+RC+
Sbjct: 252 LIIRAALPITKGDHISICYTDPLWGTANRRHHLLKTKFFECICNRCQ 298
>gi|207340657|gb|EDZ68942.1| YPL165Cp-like protein [Saccharomyces cerevisiae AWRI1631]
Length = 373
Score = 54.7 bits (130), Expect = 8e-05, Method: Compositional matrix adjust.
Identities = 44/155 (28%), Positives = 70/155 (45%), Gaps = 21/155 (13%)
Query: 134 SQFQILLAFQGTVTDNDTSAAHYLSSLCPPPAS--TTIELTAALLAKDRLNAFGLMEPYI 191
S+F +LL FQ V L L P +I L +L + NAFGL +
Sbjct: 231 SKFPVLLHFQKLVFQT-------LYILLPSHLHRMLSIPLLRHILGTEYGNAFGLWQ--- 280
Query: 192 EGQDG-RRSVRAYGIYLKASFFNHDCLPNACRFDYVDAAAENNTDIIVRMIHDVPQGREI 250
EG+ R Y ++ +AS+FNH C PN ++ ++ + D+ + +I
Sbjct: 281 EGEASDSREYFGYWVFPEASYFNHSCNPNITKY-------RKGNSMLFTVNRDIKKDEQI 333
Query: 251 CLSYFPVNYDYSTRQKRLLYD-YDFACDCDRCKVE 284
C+ Y V + +++ L D + F C C+RCK E
Sbjct: 334 CIDYSGVLDLPTVKRRAFLADSWFFDCACERCKSE 368
>gi|242206607|ref|XP_002469159.1| predicted protein [Postia placenta Mad-698-R]
gi|220731830|gb|EED85671.1| predicted protein [Postia placenta Mad-698-R]
Length = 447
Score = 54.7 bits (130), Expect = 8e-05, Method: Compositional matrix adjust.
Identities = 37/122 (30%), Positives = 55/122 (45%), Gaps = 13/122 (10%)
Query: 165 ASTTIELTAALLAKDRLNAFGLME-PYIEGQDGRRSVRAYGIYLKASFFNHDCLPNACRF 223
++ TI+ A+ + D N+FG+ E P E + + +Y + SFFNH C PN +
Sbjct: 329 SAITIDNVRAIFSVDPGNSFGIWEVPLTEESE----CLGFAVYPRPSFFNHHCSPNVNKE 384
Query: 224 DYVDAAAENNTDIIVRMIHDVPQGREICLSY-FPVNYDYSTRQKRLLYDYDFACDCDRCK 282
Y A T V G E+C+SY + RQK L + F C C +C
Sbjct: 385 RYGRGLAFVTT-------RSVRAGEELCISYGHAEGMGWRQRQKELREGWFFECSCGKCI 437
Query: 283 VE 284
V+
Sbjct: 438 VD 439
>gi|30680137|ref|NP_849969.1| histone-lysine N-methyltransferase ASHR1 [Arabidopsis thaliana]
gi|94707144|sp|Q7XJS0.2|ASHR1_ARATH RecName: Full=Histone-lysine N-methyltransferase ASHR1; AltName:
Full=ASH1-related protein 1; AltName: Full=Protein SET
DOMAIN GROUP 37
gi|145651792|gb|ABP88121.1| At2g17900 [Arabidopsis thaliana]
gi|330251606|gb|AEC06700.1| histone-lysine N-methyltransferase ASHR1 [Arabidopsis thaliana]
Length = 480
Score = 54.7 bits (130), Expect = 8e-05, Method: Compositional matrix adjust.
Identities = 68/287 (23%), Positives = 111/287 (38%), Gaps = 42/287 (14%)
Query: 10 VAEIEGRGRGLVSTQSLKAGQIVLRDSPILLYSALPFINSS-SFCHNCFRKTMHSSSSIC 68
V+ + +GR L + + + G+++L P Y +P SS S C CF+ ++ C
Sbjct: 15 VSNLPQKGRSLFTARDFRPGEVILSQKP---YICVPNNTSSESRCDGCFKT---NNLKKC 68
Query: 69 PSCSVA-FCSPKCSTAAASSSHSPYVCQALTRL----LNLNSPD---AANLPLDRQLQAR 120
+C V +C C + C+ALTRL +P L + R LQ
Sbjct: 69 SACQVVWYCGSSCQKSEWKLHRDE--CKALTRLEKEKRKFVTPTIRLMVRLYIKRNLQNE 126
Query: 121 FLIAAYNLAVVNPSQFQILLAFQGTVTDNDTSAAHY------LSSLCPPPASTTIELTAA 174
+ L + + ++ A +++ D L +L S + A
Sbjct: 127 KV-----LPITTTDNYSLVEALVSHMSEIDEKQMLLYAQMANLVNLILQFPSVDLREIAE 181
Query: 175 LLAKDRLNAFGLMEPYIEGQDGRRSVRAYGIYLKASFFNHDCLPNACRFDYVDAAAENNT 234
+K NA + + + Q G++ S NH C PNA + + A
Sbjct: 182 NFSKFSCNAHSICDSELRPQ-------GIGLFPLVSIINHSCSPNAV-LVFEEQMA---- 229
Query: 235 DIIVRMIHDVPQGREICLSYFPVNYDYSTRQKRLLYDYDFACDCDRC 281
+VR + ++ + EI +SY TRQK L Y F C C RC
Sbjct: 230 --VVRAMDNISKDSEITISYIETAGSTLTRQKSLKEQYLFHCQCARC 274
>gi|323335307|gb|EGA76596.1| Set6p [Saccharomyces cerevisiae Vin13]
Length = 373
Score = 54.7 bits (130), Expect = 8e-05, Method: Compositional matrix adjust.
Identities = 44/155 (28%), Positives = 70/155 (45%), Gaps = 21/155 (13%)
Query: 134 SQFQILLAFQGTVTDNDTSAAHYLSSLCPPPAS--TTIELTAALLAKDRLNAFGLMEPYI 191
S+F +LL FQ V L L P +I L +L + NAFGL +
Sbjct: 231 SKFPVLLHFQKLVFQT-------LYILLPSHLHRMLSIPLLRHILGTEYGNAFGLWQ--- 280
Query: 192 EGQDG-RRSVRAYGIYLKASFFNHDCLPNACRFDYVDAAAENNTDIIVRMIHDVPQGREI 250
EG+ R Y ++ +AS+FNH C PN ++ ++ + D+ + +I
Sbjct: 281 EGEASDSREYFGYWVFPEASYFNHSCNPNITKY-------RKGNSMLFTVNRDIKKDEQI 333
Query: 251 CLSYFPVNYDYSTRQKRLLYD-YDFACDCDRCKVE 284
C+ Y V + +++ L D + F C C+RCK E
Sbjct: 334 CIDYSGVLDLPTVKRRAFLADSWFFDCACERCKSE 368
>gi|19113306|ref|NP_596514.1| histone lysine methyltransferase Set6 (predicted)
[Schizosaccharomyces pombe 972h-]
gi|74626997|sp|O94256.1|SET6_SCHPO RecName: Full=SET domain and MYND-type zinc finger protein 6
gi|3810831|emb|CAA21792.1| histone lysine methyltransferase Set6 (predicted)
[Schizosaccharomyces pombe]
Length = 483
Score = 54.7 bits (130), Expect = 8e-05, Method: Compositional matrix adjust.
Identities = 69/288 (23%), Positives = 113/288 (39%), Gaps = 69/288 (23%)
Query: 17 GRGLVSTQSLKAGQIVLRDSPILLYSALPFINSSSFCHNCFR-----------KTMHSSS 65
G+G V+T ++ G+I++R +L +L N + C C K +H S
Sbjct: 15 GKGTVATDNIPIGKIIIRKRVDIL--SLDSANLTRTCSTCTEEKVKTQRCAACKIIHYCS 72
Query: 66 SICPSCSVAFCSPKCSTAAASSSHS--PYVCQALTRLLNLNSPDAANLPLDRQLQARFLI 123
C F +C AS + P VC+ L RL +L+
Sbjct: 73 KGCQKADWPFHKLECKALQASKQNGILPSVCRLLIRL--------------------YLL 112
Query: 124 AAYNLAVVNP-----SQFQILLAFQGTVTDNDTSAAHYLSSLCPPPASTTIELTAALLAK 178
N A++ P ++FQ + + ++A+HY EL L +
Sbjct: 113 WQKNPAIIEPMEGHQNEFQAVSSSWSDAELIASAASHY-------TQIYQAELFQKLFCR 165
Query: 179 DRLNAFGLMEPYIEGQDGRRSVRAYGIYLKASF--FNHDCLPNACRFDYVDAAAENNTDI 236
+NA L+ S + G+ L NH C PN C+ + D A
Sbjct: 166 LAVNAMNLVTS---------SFDSLGMCLDTILCRLNHSCDPN-CQIIF-DGA------- 207
Query: 237 IVRMI--HDVPQGREICLSYFPVNYDYSTRQKRLLYDYDFACDCDRCK 282
IV+++ D+ + ++ +SY + S RQK+LL Y F+C C RC+
Sbjct: 208 IVQLVSKRDIKKDEQLFISYIDIRLPKSIRQKQLLKKYFFSCYCPRCE 255
>gi|357620843|gb|EHJ72884.1| hypothetical protein KGM_13875 [Danaus plexippus]
Length = 516
Score = 54.7 bits (130), Expect = 8e-05, Method: Compositional matrix adjust.
Identities = 64/286 (22%), Positives = 113/286 (39%), Gaps = 46/286 (16%)
Query: 17 GRGLVSTQSLKAGQIVLRDSPILLYSALPFINSSSFCHNCFRKTM-------HSSSSICP 69
GR + +T+ +K G+++L++ P ++ P + C +C R+ + C
Sbjct: 34 GRFMTATRDIKQGEVILKEKPAVIG---PRMYCKVQCLSCGRQLQPIPNDDGNMDFIRCS 90
Query: 70 SCSVAFCSPKCSTAAASSSHSPYVCQALTRL-LNLNSPDAANLPLDRQLQARFLIAAYNL 128
SC+ C C A + Q+ + + PD A A +IA +
Sbjct: 91 SCNWPVCGIDCEKAEVHREECKIMVQSKYKCDIKYECPDKAE-------AAYCVIAPLRV 143
Query: 129 AVV---NPSQFQILLAFQGTVTDNDTSAAHYLSSLCPPPASTTIELTAALLAKDRL---- 181
++ NP F+ ++ + V D + + P I + D++
Sbjct: 144 LLMKDSNPRMFESIMNLESHV---DQRINTPIYQVLRPNLIMFIRQVLGMNFDDKIILEI 200
Query: 182 ------NAFGLMEPYIEGQDGRRSVRAYGIYLKASFFNHDCLPNACRFDYVDAAAENNTD 235
N+F + D + +RA IYL AS NH C PN R Y+ E+ T
Sbjct: 201 SSIFDNNSFD-----VRSADKTKRLRA--IYLLASMMNHSCRPNT-RHIYL---GEDKTL 249
Query: 236 IIVRMIHDVPQGREICLSYFPVNYDYSTRQKRLLYDYDFACDCDRC 281
++ +H + +G EI +Y + R+K L + F C C+RC
Sbjct: 250 ALIATVH-IAKGEEITATYTQPLWGTLDRRKFLKTNKYFDCKCERC 294
>gi|349581653|dbj|GAA26810.1| K7_Set6p [Saccharomyces cerevisiae Kyokai no. 7]
Length = 373
Score = 54.7 bits (130), Expect = 8e-05, Method: Compositional matrix adjust.
Identities = 44/155 (28%), Positives = 70/155 (45%), Gaps = 21/155 (13%)
Query: 134 SQFQILLAFQGTVTDNDTSAAHYLSSLCPPPAS--TTIELTAALLAKDRLNAFGLMEPYI 191
S+F +LL FQ V L L P +I L +L + NAFGL +
Sbjct: 231 SKFPVLLHFQKLVFQT-------LYILLPSHLHRMLSIPLLRHILGTEYGNAFGLWQ--- 280
Query: 192 EGQDG-RRSVRAYGIYLKASFFNHDCLPNACRFDYVDAAAENNTDIIVRMIHDVPQGREI 250
EG+ R Y ++ +AS+FNH C PN ++ ++ + D+ + +I
Sbjct: 281 EGEASDSREYFGYWVFPEASYFNHSCNPNITKY-------RKGNSMLFTVNRDIKKDEQI 333
Query: 251 CLSYFPVNYDYSTRQKRLLYD-YDFACDCDRCKVE 284
C+ Y V + +++ L D + F C C+RCK E
Sbjct: 334 CIDYSGVLDLPTVKRRAFLADSWFFDCACERCKSE 368
>gi|189237463|ref|XP_967982.2| PREDICTED: similar to msta CG33548-PB [Tribolium castaneum]
Length = 387
Score = 54.7 bits (130), Expect = 8e-05, Method: Compositional matrix adjust.
Identities = 64/286 (22%), Positives = 117/286 (40%), Gaps = 49/286 (17%)
Query: 10 VAEIEGRGRGLVSTQSLKAGQIVLRDSPILLY-SALPFINSSSFCHNCFRKTMHSSSSIC 68
+ + E G G+ +T++++ G+++ +D P++L A P S C +C+ + +C
Sbjct: 30 IKKSELGGFGVFATRAIEVGELIFKDFPVILGPRAAPTCPLS--CVSCYSR---RDLRLC 84
Query: 69 -PSCSVAFCSPKCSTAAASSSHSPYVCQALTRLLNLNSPDAANLPLDRQLQ--ARFLIAA 125
C + CS KC + V Q ++ ++ + ++ L R L L+
Sbjct: 85 GKKCGLLVCSEKCEKSLGHQKECKVVRQWQSKPIS----EDLSVQLVRVLSPIRSLLLGE 140
Query: 126 YNLAVV--------NPSQFQILLAFQGTVTDNDTSAAHYLSSLCPPPASTTIELTAALLA 177
+ +VV + F++ + + ++ +C S
Sbjct: 141 EDKSVVKCLKAHKSDQHGFEVDVMKDALALNVKEDEEKFMRFVCSVMDS----------- 189
Query: 178 KDRLNAFGLMEPYIEGQDGRRSVRAYGIYLKASFFNHDCLPNACR-FDYVDAAAENNTDI 236
NAF ++ + +D + SV+ G+Y S NH C PN C FD + +
Sbjct: 190 ----NAFEVLVGF---EDNQASVK--GLYPLGSLANHSCCPNTCHVFD-------DKQHM 233
Query: 237 IVRMIHDVPQGREICLSYFPVNYDYSTRQKRLLYDYDFACDCDRCK 282
+VR +PQG EI SY + + S R+ L F C C RC+
Sbjct: 234 VVRASKFIPQGSEIFHSYSRLIWSTSARRFHLYRTKHFLCKCQRCE 279
>gi|195479534|ref|XP_002100923.1| GE15901 [Drosophila yakuba]
gi|194188447|gb|EDX02031.1| GE15901 [Drosophila yakuba]
Length = 532
Score = 54.3 bits (129), Expect = 8e-05, Method: Compositional matrix adjust.
Identities = 63/282 (22%), Positives = 111/282 (39%), Gaps = 37/282 (13%)
Query: 17 GRGLVSTQSLKAGQIVLRDSPILLYSALPFINSSSFCHNCFRK-TMHSSSSICPSCSVAF 75
GR LV+++ L+AG+ ++R+ P+ + P ++ C C++ ++ CP C+
Sbjct: 58 GRYLVASRQLEAGETLIREEPLAIG---PCVSGDPVCLGCYQPVSLKPDQYRCPGCAWPL 114
Query: 76 CSPKCSTAAASSSHSPYVCQALTRLLNLNSPDAANLPLDRQLQAR--------FLIAAYN 127
C C+ H+ CQ L A L +R A ++
Sbjct: 115 CGITCTGLKHRHGHTETECQ----LFGERRAVAGELLTERAGPAEVRDLYELVMIVRILL 170
Query: 128 LAVVNPSQFQILLAFQGTVTDNDTSAA---HYLSSLCPPPASTTIELTAALLAKDRLNAF 184
L +P QF ++ + + +A HY + T +L L A+ +
Sbjct: 171 LRQHDPEQFALIARMESHTEERRQNAVLWRHYEEKVV-QRLRVTWQL-EDLEAEQVHDVC 228
Query: 185 GLMEP--YIEGQDGRRSVRAYGIYLKASFFNHDCLPNACRFDYVDAAAENNTDIIVRMIH 242
G+++ + GQ+G +A +Y A HDC PN D + +I++R
Sbjct: 229 GILDVNCFEIGQNG---AKARTLYPSAFLLAHDCTPNTAHTD-----DPRSFEILLRTSR 280
Query: 243 DVPQGREICLSYFPVNYDYSTRQKRLLYDYD---FACDCDRC 281
V + + LSY Y KR + ++ F C C RC
Sbjct: 281 RVREREALTLSY---AYTLQGTLKRRAFMHEGKLFWCCCRRC 319
>gi|212722234|ref|NP_001131420.1| uncharacterized protein LOC100192749 [Zea mays]
gi|194691468|gb|ACF79818.1| unknown [Zea mays]
gi|413933330|gb|AFW67881.1| histone-lysine N-methyltransferase ASHR1 [Zea mays]
Length = 482
Score = 54.3 bits (129), Expect = 9e-05, Method: Compositional matrix adjust.
Identities = 73/294 (24%), Positives = 112/294 (38%), Gaps = 53/294 (18%)
Query: 8 VTVAEIEGRGRGLVSTQSLKAGQIVLRDSPILLYSALP-FINSSSFCHNCFRKTMHSSSS 66
+TVA I G+GRGL++T + G ++L P Y++ P I S C +CF S
Sbjct: 17 LTVASIPGKGRGLIATCTFFPGDVILNQEP---YASTPNKILVGSSCDHCFTSGNLRKCS 73
Query: 67 ICPSCSVAFCSPKCSTAAASSSHSPYVCQALT-------RLLNLNSPDAANLPLDRQLQA 119
+C +CS C C+A+ ++L L L R+LQ
Sbjct: 74 MCRV--TWYCSSNCQKEEWKLHQLE--CRAMAALTEDRKKMLTPTIRLMVRLALKRKLQN 129
Query: 120 RFLIAA-----YNLA------VVNPSQFQILLAFQGTVTDNDTSAAHYLSSLCPPPASTT 168
+I + YNL + + Q++L Q L SL P
Sbjct: 130 EKVIPSSSIDNYNLVDGLESHISKVDENQLVLYAQ----------MANLVSLILPFIELD 179
Query: 169 IELTAALLAKDRLNAFGLMEPYIEGQDGRRSVRAYGIYLKASFFNHDCLPNACR-FDYVD 227
++ A +K NA + +P + G+Y S NH C+PNA FD
Sbjct: 180 LKEIAQTFSKFACNAHTICDPELRPL-------GTGLYPVISIINHSCVPNAVLIFDGRT 232
Query: 228 AAAENNTDIIVRMIHDVPQGREICLSYFPVNYDYSTRQKRLLYDYDFACDCDRC 281
A VR + + + E+ +SY + ++ L Y F C C RC
Sbjct: 233 A--------YVRALQPINKDEEVSISYIETA-TVTKKRNNDLKQYFFTCTCPRC 277
>gi|169613520|ref|XP_001800177.1| hypothetical protein SNOG_09891 [Phaeosphaeria nodorum SN15]
gi|111062036|gb|EAT83156.1| hypothetical protein SNOG_09891 [Phaeosphaeria nodorum SN15]
Length = 379
Score = 54.3 bits (129), Expect = 9e-05, Method: Compositional matrix adjust.
Identities = 50/163 (30%), Positives = 71/163 (43%), Gaps = 24/163 (14%)
Query: 133 PSQFQILLAFQGTVT---DNDTSAAH-----YLSSLCPPP--ASTTIELTAALLAKDRLN 182
P Q+ LLA T D AA +L S+ PP TT + L ++D N
Sbjct: 224 PEQYPSLLALADDETPYHSTDDLAAFTRTYLHLVSVLPPELLPITTAGVLFTLSSRDSHN 283
Query: 183 AFGLMEPYIEGQDGRRSVRAYGIYLKASFFNHDCLPNACRFDYVDAAAENNTDIIVRMIH 242
+FG+ +G + YG + AS+FNH C PN + V A E R
Sbjct: 284 SFGIRSLEDDGSE----FFGYGCWPAASYFNHSCAPNVEK-RRVGRAWE------FRSGK 332
Query: 243 DVPQGREICLSYFPVNYDYSTRQKR---LLYDYDFACDCDRCK 282
DV G E+C++Y +R KR L ++ F C C+RC+
Sbjct: 333 DVKCGEELCITYLSGGERKLSRDKRMETLKKNWSFQCGCERCE 375
>gi|448825163|ref|YP_007418094.1| putative set domain-containing protein [Megavirus lba]
gi|444236348|gb|AGD92118.1| putative set domain-containing protein [Megavirus lba]
Length = 107
Score = 54.3 bits (129), Expect = 9e-05, Method: Composition-based stats.
Identities = 27/78 (34%), Positives = 36/78 (46%), Gaps = 7/78 (8%)
Query: 204 GIYLKASFFNHDCLPNACRFDYVDAAAENNTDIIVRMIHDVPQGREICLSYFPVNYDYST 263
I FNH CLPN + N ++ + D+ +G E+ SY + +D T
Sbjct: 12 AILFNGRIFNHSCLPNIVFY-------RCNDEMCFITVRDIYKGEELLDSYVNITHDKKT 64
Query: 264 RQKRLLYDYDFACDCDRC 281
RQ RL Y F CDC RC
Sbjct: 65 RQSRLWNQYRFHCDCQRC 82
>gi|189240267|ref|XP_971538.2| PREDICTED: similar to CG18136 CG18136-PA [Tribolium castaneum]
Length = 468
Score = 54.3 bits (129), Expect = 9e-05, Method: Compositional matrix adjust.
Identities = 57/276 (20%), Positives = 112/276 (40%), Gaps = 20/276 (7%)
Query: 10 VAEIEGRGRGLVSTQSLKAGQIVLRDSPILLYSALPFINSSSFCHNCFRKTMHSSSSI-C 68
V E GR +V+T+ +++G+++ P+++ P + S C C +K + S C
Sbjct: 13 VKRDEVLGRCVVATRDIRSGEVIAESLPLIVG---PKMASPPLCLGCHKKLALAESRYDC 69
Query: 69 PSCSVAFCSPKCSTAAASSSHSPYVCQALTR--LLNLNSPDAANLPLDRQLQARFLIAAY 126
C C C ++ + +A + + N NS P+ R L+
Sbjct: 70 SKCFWPLCGQACESSPLHEAECEIFAKAGYKPTVKNDNSKQTVYCPI---APLRALLLKK 126
Query: 127 NLAVVNPSQFQILLAFQGTVTDNDTSAAHYLSSLCPPPASTTIELTAALLAKDRLNAFGL 186
N +P +F +LL FQ + D++ + + + P T + L + L +
Sbjct: 127 N----SPERFNLLLDFQSHLQDHEKTQVYQVLKKSLVPFFTQL-LKLDTNESEILTICSI 181
Query: 187 MEPY-IEGQDGRRSVRAYGIYLKASFFNHDCLPNACRFDYVDAAAENNTDIIVRMIHDVP 245
+ E +D +R V G+Y SF +H +C+ + +N +++ +
Sbjct: 182 FDTNCFEVRDTQRLVNIRGLYPTISFLSH-----SCKHNTKHCFNGDNFRLVLTATTPIK 236
Query: 246 QGREICLSYFPVNYDYSTRQKRLLYDYDFACDCDRC 281
+G + +Y + +R+ L F C C+RC
Sbjct: 237 KGDLVTTTYTQTLWGTLSRRSHLKMAKHFDCLCERC 272
>gi|340057403|emb|CCC51748.1| conserved hypothetical protein [Trypanosoma vivax Y486]
Length = 438
Score = 54.3 bits (129), Expect = 9e-05, Method: Compositional matrix adjust.
Identities = 28/85 (32%), Positives = 44/85 (51%), Gaps = 4/85 (4%)
Query: 201 RAYGIYLKASFFNHDCLPNACRFDYVDAAAENNTDIIVRMIHDVPQGREICLSYFPVNYD 260
R G+Y S FNH C PN +++A + T ++V + D G E+ +SY +
Sbjct: 349 RGQGVYAVGSLFNHSCEPN---LQVLNSATGDET-LVVEALRDCEPGEELYISYIDESLP 404
Query: 261 YSTRQKRLLYDYDFACDCDRCKVEA 285
Y+ RQ++L Y F C C +C E+
Sbjct: 405 YAIRQQQLREHYLFECRCSKCVRES 429
>gi|330915865|ref|XP_003297203.1| hypothetical protein PTT_07522 [Pyrenophora teres f. teres 0-1]
gi|311330265|gb|EFQ94703.1| hypothetical protein PTT_07522 [Pyrenophora teres f. teres 0-1]
Length = 388
Score = 54.3 bits (129), Expect = 9e-05, Method: Compositional matrix adjust.
Identities = 50/190 (26%), Positives = 80/190 (42%), Gaps = 31/190 (16%)
Query: 113 LDRQLQAR-------FLIAAYNLAVVNPSQFQILLAFQGTVT---DNDTSAAH-----YL 157
L + LQ R F ++ ++P Q++ + A T +D AA +L
Sbjct: 208 LQKALQERITPDVMSFCVSGIVWRYLHPEQWERVQALAPDKTPYHSSDDLAAFTRTYLHL 267
Query: 158 SSLCPPPAS--TTIELTAALLAKDRLNAFGLMEPYIEGQDGRRSVRAYGIYLKASFFNHD 215
++ P P T E L ++D N+FG+ EG + YG + AS+FNH
Sbjct: 268 LAILPNPLVPLITAETMMMLSSRDSHNSFGIRSLDDEGSE----FFGYGCWPAASYFNHS 323
Query: 216 CLPNACRFDYVDAAAENNTDIIVRMIHDVPQGREICLSYFPVNYDYSTRQK---RLLYDY 272
C PN ++ E R D+ G+E+C++Y +R K RL +
Sbjct: 324 CGPN------IEKNREGRA-WYFRAGQDIKHGQELCITYLSGEERKLSRGKRKLRLKKTW 376
Query: 273 DFACDCDRCK 282
F C C+RC+
Sbjct: 377 GFDCGCERCE 386
>gi|401422463|ref|XP_003875719.1| conserved hypothetical protein [Leishmania mexicana
MHOM/GT/2001/U1103]
gi|322491958|emb|CBZ27231.1| conserved hypothetical protein [Leishmania mexicana
MHOM/GT/2001/U1103]
Length = 510
Score = 54.3 bits (129), Expect = 1e-04, Method: Compositional matrix adjust.
Identities = 25/88 (28%), Positives = 46/88 (52%), Gaps = 7/88 (7%)
Query: 200 VRAYGIYLKASFFNHDCLPNACRFDYVDAAAENNTDIIVRMIHDVPQGREICLSYFPVNY 259
++ G+Y S FNH C+PN +A + +I+++ + G + ++Y P+
Sbjct: 416 LKGAGLYSLLSCFNHSCVPNV-----AVSAVDGTHEIVLKTTRSIRAGEPLAITYIPLTA 470
Query: 260 DYSTR--QKRLLYDYDFACDCDRCKVEA 285
++R ++R L +Y F C C RC +EA
Sbjct: 471 GATSRAERQRQLKNYFFTCHCPRCDMEA 498
>gi|258576329|ref|XP_002542346.1| predicted protein [Uncinocarpus reesii 1704]
gi|237902612|gb|EEP77013.1| predicted protein [Uncinocarpus reesii 1704]
Length = 273
Score = 54.3 bits (129), Expect = 1e-04, Method: Compositional matrix adjust.
Identities = 40/133 (30%), Positives = 62/133 (46%), Gaps = 14/133 (10%)
Query: 152 SAAHYLSSLCPPPASTTIELTAALLAKDRLNAF-GLMEPYIEGQDGRRSVRAYGIYLKAS 210
S H LSSL L A K++L AF G+ D + +G++L+A+
Sbjct: 75 SIGHQLSSLAKEQQREFFALANA--HKEKLGAFLGIAVTNAIIMD--YETKEHGVFLQAA 130
Query: 211 FFNHDCLPNACRFDY--VDAAAENNTDIIVRMIHDVPQGREICLSYFPVNYDYSTRQKRL 268
FNH C PNA R + +D +++ + DV +G EI +SY + +S R + L
Sbjct: 131 RFNHACRPNAMRTFHPILD-------QVVIHVAKDVSEGEEITVSYIEPAHVFSLRVQLL 183
Query: 269 LYDYDFACDCDRC 281
+ F C C+ C
Sbjct: 184 KAKFGFTCVCELC 196
>gi|195454342|ref|XP_002074198.1| GK12748 [Drosophila willistoni]
gi|194170283|gb|EDW85184.1| GK12748 [Drosophila willistoni]
Length = 394
Score = 54.3 bits (129), Expect = 1e-04, Method: Compositional matrix adjust.
Identities = 29/87 (33%), Positives = 47/87 (54%), Gaps = 10/87 (11%)
Query: 204 GIYLKASFFNHDCLPNACR-FDYVDAAAENNTDIIVRMIHDVPQGREICLSYF---PVNY 259
G+YL S NH C+PNAC F Y +N ++++ + + +G EIC+SY +
Sbjct: 288 GLYLLQSKINHSCVPNACSTFPY------SNDIVVLKALSPIQEGDEICISYLDECQLER 341
Query: 260 DYSTRQKRLLYDYDFACDCDRCKVEAN 286
+R K L +Y F C C +C+ +A+
Sbjct: 342 SRHSRHKILCGNYIFICQCPKCRAQAS 368
>gi|383861061|ref|XP_003706005.1| PREDICTED: SET and MYND domain-containing protein 5-like [Megachile
rotundata]
Length = 388
Score = 54.3 bits (129), Expect = 1e-04, Method: Compositional matrix adjust.
Identities = 51/195 (26%), Positives = 89/195 (45%), Gaps = 28/195 (14%)
Query: 115 RQLQARFLIAAYNLAVVNPSQFQILLAFQGTVTDN--DTSAAHYLSSLCPPPASTTIELT 172
RQ+ + L + P F+ LLA GT +S + ++ ++ + +EL
Sbjct: 204 RQMMQKSLNTEFTEHWFTPDGFRSLLALVGTNGQGIGTSSFSRWVKNV------SALELP 257
Query: 173 AALLAKDRLNAFGLMEPYIEGQDGRRSVRAY------GIYLKASFFNHDCLPNACRFDYV 226
++R+ F L++ + D +V ++ G+Y+ S NH C+PNA V
Sbjct: 258 E----EERIYVFKLIDRIYD--DMEEAVGSFLNNEGSGLYILQSAVNHSCIPNA----IV 307
Query: 227 DAAAENNTDIIVRMIHDVPQGREICLSYFP---VNYDYSTRQKRLLYDYDFACDCDRCKV 283
+ NN ++++ I D+ EIC+SY + +RQK L Y F C C +C
Sbjct: 308 EFPYSNNV-LVLKAIRDIHPEEEICISYLDECDLERSRHSRQKALSSLYLFICCCSKCLS 366
Query: 284 EANWSDNDNDDENNE 298
+ N D ++DE +E
Sbjct: 367 QTNDPDLTSEDEIDE 381
>gi|170029518|ref|XP_001842639.1| msta [Culex quinquefasciatus]
gi|167863223|gb|EDS26606.1| msta [Culex quinquefasciatus]
Length = 495
Score = 54.3 bits (129), Expect = 1e-04, Method: Compositional matrix adjust.
Identities = 63/288 (21%), Positives = 106/288 (36%), Gaps = 30/288 (10%)
Query: 5 SETVTVAEIEGRGRGLVSTQSLKAGQIVLRDSPILLYSALPFINSSSFCHNCFRKTMH-S 63
+E V E GR V+++ L+AG+I D+P ++ P + S+ C C +
Sbjct: 3 AELYKVLECPEMGRYGVASRDLRAGEIAYEDTP---FAVGPSVGSAPLCLECACPVDGCA 59
Query: 64 SSSICPSCSVAFCSPKCSTAAASSSHSPYVCQAL----TRLLNLNSPDAANLPLDRQLQA 119
+ CP C C KC +S + C+ ++ N+ A + LD
Sbjct: 60 GGARCPRCGWPLCE-KCGAEIEASVYHKAECELFAKHGSKFQNVEDSSEACVQLDCITPL 118
Query: 120 RFLIAA------YNLAVVNPSQFQILLAFQGTVTDNDTSAAHYLSSLCPPPASTTIELTA 173
R L+A +N +V + + ++ +L C + EL
Sbjct: 119 RVLLAKEANPDWWNAEIVMMEDHRAERDGNAFWKADQSNVVAFLRDSCGLKDRCSEELIQ 178
Query: 174 ALLAKDRLNAFGLMEPYIEGQDGRRSVRAYGIYLKASFFNHDCLPNACRFDYVDAAAENN 233
++ +NAF + S+R G+Y K H C+ N +
Sbjct: 179 KVIGILDVNAF---------EAHTCSLR--GLYPKMGIMAHSCVTNLAHTVH----PSKG 223
Query: 234 TDIIVRMIHDVPQGREICLSYFPVNYDYSTRQKRLLYDYDFACDCDRC 281
+I R D+ +G +C +Y + TRQ L F C C RC
Sbjct: 224 YRLIARAAVDIEEGAMLCTTYTHLLAGTRTRQAELQRTKYFTCQCKRC 271
>gi|218193604|gb|EEC76031.1| hypothetical protein OsI_13199 [Oryza sativa Indica Group]
Length = 481
Score = 54.3 bits (129), Expect = 1e-04, Method: Compositional matrix adjust.
Identities = 70/289 (24%), Positives = 112/289 (38%), Gaps = 43/289 (14%)
Query: 8 VTVAEIEGRGRGLVSTQSLKAGQIVLRDSPILLYSALP-FINSSSFCHNCFRKTMHSSSS 66
+ VA + G+GRGL + +S G++V+ P Y++ P I+ S C NCF +
Sbjct: 17 LAVASVPGKGRGLFAARSFFPGEVVISQEP---YASTPNKISVGSNCDNCF---ASRNLR 70
Query: 67 ICPSCSVA-FCSPKCSTAAASSSHSPYVCQAL-------TRLLNLNSPDAANLPLDRQLQ 118
C C VA +C C C+A+ ++L L L R+LQ
Sbjct: 71 KCSVCRVAWYCGSACQREEWKLHQLE--CRAIAALTEDRKKMLTPTIRLMVRLVLRRKLQ 128
Query: 119 ARFLIAAYNLAVVNPSQFQILLAFQGTVTDNDT------SAAHYLSSLCPPPASTTIELT 172
I + + ++ A + +++ D + L L P ++
Sbjct: 129 DDKAIPS-----SGTDNYNLVDALESHISEVDKNQLVLYAQMANLVQLILPSFELDLKEI 183
Query: 173 AALLAKDRLNAFGLMEPYIEGQDGRRSVRAYGIYLKASFFNHDCLPNACRFDYVDAAAEN 232
+K NA + +P + RS+ G+Y S NH C+PNA E
Sbjct: 184 THTFSKFACNAHTICDPEL------RSL-GTGLYPVLSIINHSCVPNAVLI------FEG 230
Query: 233 NTDIIVRMIHDVPQGREICLSYFPVNYDYSTRQKRLLYDYDFACDCDRC 281
T VR + + + E+ +SY RQ L + Y F C C RC
Sbjct: 231 RT-AYVRALQPISKNEEVSISYIETAATTMKRQDDLKH-YYFTCTCPRC 277
>gi|354543914|emb|CCE40636.1| hypothetical protein CPAR2_106710 [Candida parapsilosis]
Length = 370
Score = 54.3 bits (129), Expect = 1e-04, Method: Compositional matrix adjust.
Identities = 30/117 (25%), Positives = 55/117 (47%), Gaps = 8/117 (6%)
Query: 168 TIELTAALLAKDRLNAFGLMEPYIEGQDGRRSVRAYGIYLKASFFNHDCLPNACRFDYVD 227
T ++ +L+ K+ NAFG+ +D + + +Y ASFFNH C PN +
Sbjct: 258 TTDIVRSLIGKNLSNAFGIWSN-TTAKDEDKEFLGFSVYPSASFFNHSCDPNIIKIRV-- 314
Query: 228 AAAENNTDIIVRMIHDVPQGREICLSYFPVNYDYSTRQKRLLYDYDFACDCDRCKVE 284
D+ + D+ G E+C++Y + +++ L ++ F C C +C+ E
Sbjct: 315 -----RNDMRFETLRDIAIGEELCINYGNFQNEDVEKRQLELQEWFFDCGCTKCQSE 366
>gi|367009358|ref|XP_003679180.1| hypothetical protein TDEL_0A06370 [Torulaspora delbrueckii]
gi|359746837|emb|CCE89969.1| hypothetical protein TDEL_0A06370 [Torulaspora delbrueckii]
Length = 369
Score = 54.3 bits (129), Expect = 1e-04, Method: Compositional matrix adjust.
Identities = 41/154 (26%), Positives = 70/154 (45%), Gaps = 19/154 (12%)
Query: 134 SQFQILLAFQGTVTDNDTSAAHYLSSLCP--PPASTTIELTAALLAKDRLNAFGLMEPYI 191
++F ILL FQ + L + P + +++L +L + NAFG+ E
Sbjct: 227 ARFPILLDFQILIFKT-------LYIILPGYTRQNLSVKLFRHILGSEYGNAFGIWEQ-- 277
Query: 192 EGQDGRRSVRAYGIYLKASFFNHDCLPNACRFDYVDAAAENNTDIIVRMIHDVPQGREIC 251
E R Y +Y +AS+FNH C PN + + ++ DVP+G +C
Sbjct: 278 EELSENREFLGYAVYPRASYFNHSCDPNLTK-------SRIKGTMVFTANKDVPKGEPLC 330
Query: 252 LSYFPVNYDYSTRQKRLLYD-YDFACDCDRCKVE 284
+ Y + ++++LL + + F C C RC+ E
Sbjct: 331 IDYSGLLSHTMLKRRQLLKENWFFECCCTRCESE 364
>gi|189209411|ref|XP_001941038.1| conserved hypothetical protein [Pyrenophora tritici-repentis
Pt-1C-BFP]
gi|187977131|gb|EDU43757.1| conserved hypothetical protein [Pyrenophora tritici-repentis
Pt-1C-BFP]
Length = 388
Score = 54.3 bits (129), Expect = 1e-04, Method: Compositional matrix adjust.
Identities = 38/132 (28%), Positives = 60/132 (45%), Gaps = 16/132 (12%)
Query: 156 YLSSLCPPP--ASTTIELTAALLAKDRLNAFGLMEPYIEGQDGRRSVRAYGIYLKASFFN 213
+L ++ P P T E L ++D N+FG+ EG + YG + AS+FN
Sbjct: 266 HLLAILPQPLLPLVTAETLIMLSSRDSHNSFGIRSLDDEGSE----FFGYGCWPAASYFN 321
Query: 214 HDCLPNACRFDYVDAAAENNTDIIVRMIHDVPQGREICLSYFPVNYDYSTRQKRLL---Y 270
H C PN ++ E R D+ G+E+C++Y +R KR+L
Sbjct: 322 HSCGPN------IEKNREGRA-WYFRAGQDIKLGQELCITYLSGEERKLSRGKRMLRLQK 374
Query: 271 DYDFACDCDRCK 282
+ F C C+RC+
Sbjct: 375 TWGFDCGCERCE 386
>gi|398398565|ref|XP_003852740.1| hypothetical protein MYCGRDRAFT_109375 [Zymoseptoria tritici
IPO323]
gi|339472621|gb|EGP87716.1| hypothetical protein MYCGRDRAFT_109375 [Zymoseptoria tritici
IPO323]
Length = 383
Score = 54.3 bits (129), Expect = 1e-04, Method: Compositional matrix adjust.
Identities = 37/120 (30%), Positives = 53/120 (44%), Gaps = 14/120 (11%)
Query: 168 TIELTAALLAKDRLNAFGLMEPYIEGQDGRRSVRAYGIYLKASFFNHDCLPNACRFDYVD 227
T +L ++ D NAFG+ G + + +Y AS+FNH C PN + V
Sbjct: 272 TPKLCLTMVRADNHNAFGIRG----GGEDSEEYMGWAVYPSASYFNHSCSPNLAK-RRVG 326
Query: 228 AAAENNTDIIVRMIHDVPQGREICLSYF---PVNYDYSTRQKRLLYDYDFACDCDRCKVE 284
E T DV G E C++Y D + RQ+RL + F C C+RC+ E
Sbjct: 327 REWEFTT------ARDVDAGEECCITYLGGDEKGMDRAERQRRLKEVWGFDCGCERCRDE 380
>gi|194764587|ref|XP_001964410.1| GF23162 [Drosophila ananassae]
gi|190614682|gb|EDV30206.1| GF23162 [Drosophila ananassae]
Length = 392
Score = 53.9 bits (128), Expect = 1e-04, Method: Compositional matrix adjust.
Identities = 28/87 (32%), Positives = 47/87 (54%), Gaps = 10/87 (11%)
Query: 204 GIYLKASFFNHDCLPNACR-FDYVDAAAENNTDIIVRMIHDVPQGREICLSYF---PVNY 259
G+YL S NH C+PN+C F Y +N ++++ + + +G EIC+SY +
Sbjct: 288 GLYLLQSKINHSCVPNSCSTFPY------SNDIVVLKALTPIQEGDEICISYLDECQLER 341
Query: 260 DYSTRQKRLLYDYDFACDCDRCKVEAN 286
+R K L +Y F C C +C+ +A+
Sbjct: 342 SRHSRHKVLRENYIFVCQCPKCRAQAS 368
>gi|46115982|ref|XP_384009.1| hypothetical protein FG03833.1 [Gibberella zeae PH-1]
Length = 690
Score = 53.9 bits (128), Expect = 1e-04, Method: Compositional matrix adjust.
Identities = 44/159 (27%), Positives = 70/159 (44%), Gaps = 19/159 (11%)
Query: 132 NPSQFQILLAFQGTVTDNDTSAAHYLSSLCPPPASTTIELTAALLAKDRLNAFGLMEPYI 191
NPSQ + +L QG + L L P T ++ + + NAFGL +
Sbjct: 447 NPSQVERILGLQG----DYKGIGQKLIELDEVPVIDTFQIHDIV----QCNAFGLGQ--- 495
Query: 192 EGQDGRRSVRAYGIYLKASFFNHDCLPNACRFDYVDAAAENNTDIIVRMIHDVPQGREIC 251
+ +D S + G++++AS+ NH C+PNA + D I+ R + G EI
Sbjct: 496 QTEDEDISNASTGLWVRASYINHSCIPNAKKDLIGDL-------ILFRATRRIASGEEIT 548
Query: 252 LSYFPVNYDYSTRQKRLLYDYDFACDCDRCKVEANWSDN 290
+Y + Y RQ ++F C C C V+ + SD
Sbjct: 549 HAY-DESTSYEARQAAFRRTWNFECRCPLCLVQMDESDT 586
>gi|74187979|dbj|BAE37118.1| unnamed protein product [Mus musculus]
Length = 400
Score = 53.9 bits (128), Expect = 1e-04, Method: Compositional matrix adjust.
Identities = 82/367 (22%), Positives = 132/367 (35%), Gaps = 109/367 (29%)
Query: 16 RGRGLVSTQSLKAGQIVLRDSPIL----LYSALPFINSSSFCHNCFRKTMHSSSSI---- 67
+G+GL +TQ ++ G+ + + P++ L++AL + C K ++ +
Sbjct: 16 KGKGLFATQLIRKGETIFIERPLVAAQFLWNALYQYRACDHCLRALEKAEENAQRLTGKP 75
Query: 68 -------------------CPSCSVAFCSPKCSTAAASS---------SHSPY------- 92
CP C V +CS +C AAA SH P
Sbjct: 76 SQILPHPELCSVRKDLHQNCPHCQVMYCSAECRLAAAEQYHQILCPGPSHDPRHPLNKLQ 135
Query: 93 ------------------------VCQA-----LTRLLNLNSPDAANLP---LDRQLQAR 120
V QA RL N AN + + L+ +
Sbjct: 136 EAWRSVHYPPETASIMLMARMVATVKQAKDKDHWVRLFNHFCSRTANQEQAIVHKLLKGK 195
Query: 121 F------LIAAYNLAV--------VNPSQFQILLAFQGTVTDN--DTSAAHYLSSLCPPP 164
F L+ + A+ P F+ L A GT +S + ++
Sbjct: 196 FKDQLELLLGLFKEALYEEALSLWFTPEGFRSLFALVGTNGQGIGTSSLSQWVH------ 249
Query: 165 ASTTIELTAALLAKDRLNAF-GLMEPYIEGQDGR-RSVRAYGIYLKASFFNHDCLPNACR 222
A +ELT +++L+ F + IE G + G+++ S NH C+PNA
Sbjct: 250 ACDALELTPQ--DREQLDTFIDQLYKDIEAATGEFLNCEGSGLFVLQSCCNHSCVPNA-- 305
Query: 223 FDYVDAAAENNTDIIVRMIHDVPQGREICLSYF---PVNYDYSTRQKRLLYDYDFACDCD 279
+ ENN + V + D+ G EIC+SY +R K L +Y F C C
Sbjct: 306 ---ETSFPENNFVLHVTALEDIKPGEEICISYLDCCQRERSRHSRHKILRENYLFNCSCP 362
Query: 280 RCKVEAN 286
+C EA+
Sbjct: 363 KCLAEAD 369
>gi|413933331|gb|AFW67882.1| hypothetical protein ZEAMMB73_219813 [Zea mays]
Length = 443
Score = 53.9 bits (128), Expect = 1e-04, Method: Compositional matrix adjust.
Identities = 73/294 (24%), Positives = 112/294 (38%), Gaps = 53/294 (18%)
Query: 8 VTVAEIEGRGRGLVSTQSLKAGQIVLRDSPILLYSALP-FINSSSFCHNCFRKTMHSSSS 66
+TVA I G+GRGL++T + G ++L P Y++ P I S C +CF S
Sbjct: 17 LTVASIPGKGRGLIATCTFFPGDVILNQEP---YASTPNKILVGSSCDHCFTSGNLRKCS 73
Query: 67 ICPSCSVAFCSPKCSTAAASSSHSPYVCQALT-------RLLNLNSPDAANLPLDRQLQA 119
+C +CS C C+A+ ++L L L R+LQ
Sbjct: 74 MCRV--TWYCSSNCQKEEWKLHQLE--CRAMAALTEDRKKMLTPTIRLMVRLALKRKLQN 129
Query: 120 RFLIAA-----YNLA------VVNPSQFQILLAFQGTVTDNDTSAAHYLSSLCPPPASTT 168
+I + YNL + + Q++L Q L SL P
Sbjct: 130 EKVIPSSSIDNYNLVDGLESHISKVDENQLVLYAQ----------MANLVSLILPFIELD 179
Query: 169 IELTAALLAKDRLNAFGLMEPYIEGQDGRRSVRAYGIYLKASFFNHDCLPNACR-FDYVD 227
++ A +K NA + +P + G+Y S NH C+PNA FD
Sbjct: 180 LKEIAQTFSKFACNAHTICDPELRP-------LGTGLYPVISIINHSCVPNAVLIFDGRT 232
Query: 228 AAAENNTDIIVRMIHDVPQGREICLSYFPVNYDYSTRQKRLLYDYDFACDCDRC 281
A VR + + + E+ +SY + ++ L Y F C C RC
Sbjct: 233 A--------YVRALQPINKDEEVSISYIETA-TVTKKRNNDLKQYFFTCTCPRC 277
>gi|270012346|gb|EFA08794.1| hypothetical protein TcasGA2_TC006485 [Tribolium castaneum]
Length = 498
Score = 53.9 bits (128), Expect = 1e-04, Method: Compositional matrix adjust.
Identities = 57/276 (20%), Positives = 112/276 (40%), Gaps = 20/276 (7%)
Query: 10 VAEIEGRGRGLVSTQSLKAGQIVLRDSPILLYSALPFINSSSFCHNCFRKTMHSSSSI-C 68
V E GR +V+T+ +++G+++ P+++ P + S C C +K + S C
Sbjct: 13 VKRDEVLGRCVVATRDIRSGEVIAESLPLIVG---PKMASPPLCLGCHKKLALAESRYDC 69
Query: 69 PSCSVAFCSPKCSTAAASSSHSPYVCQALTR--LLNLNSPDAANLPLDRQLQARFLIAAY 126
C C C ++ + +A + + N NS P+ R L+
Sbjct: 70 SKCFWPLCGQACESSPLHEAECEIFAKAGYKPTVKNDNSKQTVYCPIA---PLRALLLKK 126
Query: 127 NLAVVNPSQFQILLAFQGTVTDNDTSAAHYLSSLCPPPASTTIELTAALLAKDRLNAFGL 186
N +P +F +LL FQ + D++ + + + P T + L + L +
Sbjct: 127 N----SPERFNLLLDFQSHLQDHEKTQVYQVLKKSLVPFFTQL-LKLDTNESEILTICSI 181
Query: 187 MEPY-IEGQDGRRSVRAYGIYLKASFFNHDCLPNACRFDYVDAAAENNTDIIVRMIHDVP 245
+ E +D +R V G+Y SF +H +C+ + +N +++ +
Sbjct: 182 FDTNCFEVRDTQRLVNIRGLYPTISFLSH-----SCKHNTKHCFNGDNFRLVLTATTPIK 236
Query: 246 QGREICLSYFPVNYDYSTRQKRLLYDYDFACDCDRC 281
+G + +Y + +R+ L F C C+RC
Sbjct: 237 KGDLVTTTYTQTLWGTLSRRSHLKMAKHFDCLCERC 272
>gi|125542564|gb|EAY88703.1| hypothetical protein OsI_10178 [Oryza sativa Indica Group]
Length = 536
Score = 53.9 bits (128), Expect = 1e-04, Method: Compositional matrix adjust.
Identities = 30/85 (35%), Positives = 41/85 (48%), Gaps = 8/85 (9%)
Query: 202 AYGIYLKASFFNHDCLPNACRFDYVDAAAENNTDIIVRMIHDVPQGREICLSYFPVNYDY 261
G+++ +F NH C PNA R D A IV D+ G EI +YF V
Sbjct: 311 GVGLWILPAFINHSCHPNARRTHVGDHA-------IVHASRDIKAGEEITFAYFDVLTPA 363
Query: 262 STRQKRLLYDYDFACDCDRCKVEAN 286
S R++ + F C CDRC+ EA+
Sbjct: 364 SKRRE-AARAWGFECQCDRCRFEAS 387
>gi|194762638|ref|XP_001963441.1| GF20287 [Drosophila ananassae]
gi|190629100|gb|EDV44517.1| GF20287 [Drosophila ananassae]
Length = 528
Score = 53.9 bits (128), Expect = 1e-04, Method: Compositional matrix adjust.
Identities = 69/293 (23%), Positives = 109/293 (37%), Gaps = 59/293 (20%)
Query: 17 GRGLVSTQSLKAGQIVLRDSPILLYSALPFINSSSFCHNCFRK-TMHSSSSICPSCSVAF 75
GR LV+++ L+AG++++ + P+ + P ++ C C++ ++ S+ CP C
Sbjct: 54 GRYLVASRQLEAGELLITEEPLAIG---PCVSGDPVCLGCYQPVSLDSTQYRCPGCQWPL 110
Query: 76 CSPKCSTAAASSSHSPYVCQALTRLLNLNSPDAANLPLDRQLQAR--------FLIAAYN 127
C C H+ CQ L AA L DR A ++
Sbjct: 111 CRDSCPGIHQPHGHTSEECQ----LYGERRAVAAGLLTDRTGPAEVRDLYELVMIVRILL 166
Query: 128 LAVVNPSQFQILLAFQGTVTD---NDTSAAHY-------------LSSLCPPPASTTIEL 171
L +P QF + + + N HY L L P I
Sbjct: 167 LRQRDPEQFAAIERMESHTEERRKNQVLWQHYEQKVVQRLREVWQLEDLQP----EQIHQ 222
Query: 172 TAALLAKDRLNAFGLMEPYIEGQDGRRSVRAYGIYLKASFFNHDCLPNACRFDYVDAAAE 231
+L +N F + GQ+G +A +Y A HDC PN D+ ++ A
Sbjct: 223 VCGILD---VNCFEI------GQNG---AKARTLYPSAFLLAHDCSPNTAHTDHPESYA- 269
Query: 232 NNTDIIVRMIHDVPQGREICLSYFPVNYDYSTRQKRLLYDYD---FACDCDRC 281
II+R V + + LSY Y KR + ++ F C C RC
Sbjct: 270 ----IILRTSRRVREREALTLSY---AYTLQGTLKRRSFMHEGKLFWCVCQRC 315
>gi|426193275|gb|EKV43209.1| hypothetical protein AGABI2DRAFT_211045 [Agaricus bisporus var.
bisporus H97]
Length = 371
Score = 53.9 bits (128), Expect = 1e-04, Method: Compositional matrix adjust.
Identities = 37/130 (28%), Positives = 59/130 (45%), Gaps = 11/130 (8%)
Query: 155 HYLSSLCPPPASTTIELTAALLAKDRLNAFGLMEPYIEGQDGRRSVRAYGIYLKASFFNH 214
+L L T +E T LL ++ N FG+ + +E + + +G+Y S+FNH
Sbjct: 251 RHLVYLLKKSVDTPLE-TRVLLGREHGNVFGIWDMALEEMNS--EMLGWGLYSFGSYFNH 307
Query: 215 DCLPNACRFDYVDAAAENNTDIIVRMIHDVPQGREICLSYFPVNYDYSTRQKRLLYDYDF 274
DC P R ++ A E I D+ G E+C+SY + ++K L ++ F
Sbjct: 308 DCSPT-LRKEHKGRAIEYYA------IKDIDTGEELCISYVETD-QVVEKRKEALKEWFF 359
Query: 275 ACDCDRCKVE 284
C C C E
Sbjct: 360 DCGCGTCVAE 369
>gi|256274267|gb|EEU09175.1| Set6p [Saccharomyces cerevisiae JAY291]
Length = 373
Score = 53.9 bits (128), Expect = 1e-04, Method: Compositional matrix adjust.
Identities = 44/155 (28%), Positives = 70/155 (45%), Gaps = 21/155 (13%)
Query: 134 SQFQILLAFQGTVTDNDTSAAHYLSSLCPPPAS--TTIELTAALLAKDRLNAFGLMEPYI 191
S+F +LL FQ V L L P +I L +L + NAFGL +
Sbjct: 231 SKFPVLLHFQKLV-------FQILYILLPSHLHRMLSIPLLRHILGTEYGNAFGLWQ--- 280
Query: 192 EGQDG-RRSVRAYGIYLKASFFNHDCLPNACRFDYVDAAAENNTDIIVRMIHDVPQGREI 250
EG+ R Y ++ +AS+FNH C PN ++ ++ + D+ + +I
Sbjct: 281 EGEASDSREYFGYWVFPEASYFNHSCNPNITKY-------RKGNSMLFTVNRDIKKDEQI 333
Query: 251 CLSYFPVNYDYSTRQKRLLYD-YDFACDCDRCKVE 284
C+ Y V + +++ L D + F C C+RCK E
Sbjct: 334 CIDYSGVLDLPTVKRRAFLADSWFFDCACERCKSE 368
>gi|242216833|ref|XP_002474221.1| predicted protein [Postia placenta Mad-698-R]
gi|220726639|gb|EED80582.1| predicted protein [Postia placenta Mad-698-R]
Length = 152
Score = 53.9 bits (128), Expect = 1e-04, Method: Compositional matrix adjust.
Identities = 37/122 (30%), Positives = 55/122 (45%), Gaps = 13/122 (10%)
Query: 165 ASTTIELTAALLAKDRLNAFGLME-PYIEGQDGRRSVRAYGIYLKASFFNHDCLPNACRF 223
++ TI+ A+ + D N+FG+ E P E + + +Y + SFFNH C PN +
Sbjct: 41 SAITIDNVRAIFSVDPGNSFGIWEVPLTEESE----CLGFAVYPRPSFFNHHCSPNVNKE 96
Query: 224 DYVDAAAENNTDIIVRMIHDVPQGREICLSY-FPVNYDYSTRQKRLLYDYDFACDCDRCK 282
Y A T V G E+C+SY + RQK L + F C C +C
Sbjct: 97 RYGRGLAFVTT-------RSVRAGEELCISYGHAEGMGWRQRQKELREGWFFECSCGKCI 149
Query: 283 VE 284
V+
Sbjct: 150 VD 151
>gi|407394357|gb|EKF26910.1| hypothetical protein MOQ_009378 [Trypanosoma cruzi marinkellei]
Length = 697
Score = 53.9 bits (128), Expect = 1e-04, Method: Compositional matrix adjust.
Identities = 33/108 (30%), Positives = 50/108 (46%), Gaps = 14/108 (12%)
Query: 175 LLAKDRLNAFGLMEPYIEGQDGRRSVRAYGIYLKASFFNHDCLPNACRFDYVDAAAENNT 234
L A + N+FGL DG + +Y +AS+FNH C PN CR Y A
Sbjct: 548 LCAALQCNSFGLFNA-----DG--NCIGVALYPEASYFNHSCCPNICRVTYRGLFA---- 596
Query: 235 DIIVRMIHDVPQGREICLSYFPVNYDYSTRQKRLLY-DYDFACDCDRC 281
+ ++P+G + + Y V + ++R L+ Y F C+C RC
Sbjct: 597 --AFHALREIPKGEPLTICYVDVQETSTAERRRTLFSSYRFFCECARC 642
>gi|157124069|ref|XP_001660316.1| hypothetical protein AaeL_AAEL009751 [Aedes aegypti]
gi|108874117|gb|EAT38342.1| AAEL009751-PA [Aedes aegypti]
Length = 500
Score = 53.9 bits (128), Expect = 1e-04, Method: Compositional matrix adjust.
Identities = 67/296 (22%), Positives = 113/296 (38%), Gaps = 41/296 (13%)
Query: 1 MSAVSETVTVAEIEGRGRGLVSTQSLKAGQIVLRDSPILLYSALPFINSSSFCHNCFRKT 60
M+ E + E + GR LV+ + L GQ +L + PI++ P+ +S C NCFR
Sbjct: 1 MARFEENCEILESDELGRYLVAKRDLARGQQILVEQPIVVG---PYWDSGIRCLNCFR-- 55
Query: 61 MHSSSSICPSCSVAFCSPKCSTAAASSSHSPYVCQALTRLLNLNSPDAANLPLDRQLQAR 120
+S +IC C A C+T H+ C+ LNS N D
Sbjct: 56 --NSCTICRKCKRAPLCFDCTT------HNETECE-----FYLNSGLNINFLFD-HFNVV 101
Query: 121 FLIAAYNLAVVNPSQFQILLAFQGTVT---DNDTSAAHYLSSLCP-----------PPAS 166
+ LA N ++F ++ + D D AH + P
Sbjct: 102 TPVRCLLLASSNRAKFDEMMTMEAHTEERRDTDIWNAHEKKVVRPLLENKAFDGKMENLK 161
Query: 167 TTIELTAALLAKDRLNAFGLMEPYIEGQDGRRSVRAYGIYLKASFFNHDCLPNACRFDYV 226
T EL + +N+F + + Q +R G+Y KA+ H+C+ N
Sbjct: 162 VTSELIQRICGFFDVNSFEIRGNMDDAQQEESLIR--GLYPKAALMAHNCVSNTL----- 214
Query: 227 DAAAENNTDIIVRMIHDVPQGREICLSYFPVNYDYSTRQKRLLYDYDFACDCDRCK 282
+ + + ++ + + V +G + +Y + R+ L F C C RC+
Sbjct: 215 -ISVDGDANLRLYVTTPVKKGEMLFYNYTRSLFGTFERRTHLKLGKYFVCTCSRCE 269
>gi|451897716|emb|CCT61066.1| hypothetical protein [Leptosphaeria maculans JN3]
Length = 726
Score = 53.9 bits (128), Expect = 1e-04, Method: Compositional matrix adjust.
Identities = 28/105 (26%), Positives = 48/105 (45%), Gaps = 9/105 (8%)
Query: 199 SVRAYGIYLKASFFNHDCLPNACRFDYVDAAAENNTDIIVRMIHDVPQGREICLSYFPVN 258
S+ G+++ +S+ NH CLPN+ R D + +R D+ Q EI Y
Sbjct: 485 SIVRNGVWIHSSYMNHSCLPNSVRSFIGDM-------LFLRAARDIEQDEEITTQYLAPE 537
Query: 259 YDYSTRQKRLLYDYDFACDCDRCKVEANWSDNDNDDENNEEVMDE 303
+Y TRQ++ + F C+C C + S ++ E+ +E
Sbjct: 538 LEYETRQQKFETTWGFRCNCTLCTFDQ--SVGQAKEKQRMEIFEE 580
>gi|154689581|ref|NP_659167.2| SET and MYND domain-containing protein 5 [Mus musculus]
gi|90101759|sp|Q3TYX3.2|SMYD5_MOUSE RecName: Full=SET and MYND domain-containing protein 5; AltName:
Full=Protein NN8-4AG; AltName: Full=Retinoic
acid-induced protein 15
gi|148666711|gb|EDK99127.1| SET and MYND domain containing 5 [Mus musculus]
Length = 416
Score = 53.9 bits (128), Expect = 1e-04, Method: Compositional matrix adjust.
Identities = 82/367 (22%), Positives = 132/367 (35%), Gaps = 109/367 (29%)
Query: 16 RGRGLVSTQSLKAGQIVLRDSPIL----LYSALPFINSSSFCHNCFRKTMHSSSSI---- 67
+G+GL +TQ ++ G+ + + P++ L++AL + C K ++ +
Sbjct: 32 KGKGLFATQLIRKGETIFIERPLVAAQFLWNALYQYRACDHCLRALEKAEENAQRLTGKP 91
Query: 68 -------------------CPSCSVAFCSPKCSTAAASS---------SHSPY------- 92
CP C V +CS +C AAA SH P
Sbjct: 92 SQILPHPELCSVRKDLHQNCPHCQVMYCSAECRLAAAEQYHQILCPGPSHDPRHPLNKLQ 151
Query: 93 ------------------------VCQA-----LTRLLNLNSPDAANLP---LDRQLQAR 120
V QA RL N AN + + L+ +
Sbjct: 152 EAWRSVHYPPETASIMLMARMVATVKQAKDKDHWVRLFNHFCSRTANQEQAIVHKLLKGK 211
Query: 121 F------LIAAYNLAV--------VNPSQFQILLAFQGTVTDN--DTSAAHYLSSLCPPP 164
F L+ + A+ P F+ L A GT +S + ++
Sbjct: 212 FKDQLELLLGLFKEALYEEALSLWFTPEGFRSLFALVGTNGQGIGTSSLSQWVH------ 265
Query: 165 ASTTIELTAALLAKDRLNAF-GLMEPYIEGQDGR-RSVRAYGIYLKASFFNHDCLPNACR 222
A +ELT +++L+ F + IE G + G+++ S NH C+PNA
Sbjct: 266 ACDALELTPQ--DREQLDTFIDQLYKDIEAATGEFLNCEGSGLFVLQSCCNHSCVPNA-- 321
Query: 223 FDYVDAAAENNTDIIVRMIHDVPQGREICLSYFPV---NYDYSTRQKRLLYDYDFACDCD 279
+ ENN + V + D+ G EIC+SY +R K L +Y F C C
Sbjct: 322 ---ETSFPENNFVLHVTALEDIKPGEEICISYLDCCQRERSRHSRHKILRENYLFNCSCP 378
Query: 280 RCKVEAN 286
+C EA+
Sbjct: 379 KCLAEAD 385
>gi|16930389|gb|AAL31881.1|AF410782_1 cardiac and skeletal muscle-specific BOP2 [Gallus gallus]
Length = 473
Score = 53.9 bits (128), Expect = 1e-04, Method: Compositional matrix adjust.
Identities = 67/286 (23%), Positives = 115/286 (40%), Gaps = 34/286 (11%)
Query: 6 ETVTVAEIEGRGRGLVSTQSLKAGQIVLRDSPILLYSALPFIN-SSSFCHNCFRKT--MH 62
E+V V EG+GRGL + + G ++ + Y+A+ F + + CH CF++ +H
Sbjct: 2 ESVEVFTTEGKGRGLKAQKEFLPGDVIFAEPA---YAAVVFDSLTHVICHTCFKRQERLH 58
Query: 63 SSSSICPSCSVAF-CSPKCSTAAASSSHSPYVCQALTRLLNLNSPDAANLPLDRQLQARF 121
C C A+ C C A + + C A+ + + +P N+ L ++ R
Sbjct: 59 R----CGQCKFAYYCDRTCQRDAWLNHKNE--CSAIKK--HGKAP-TENIRLAARILWRI 109
Query: 122 L-----IAAYNLAVVNPSQFQILLAFQGTVTDNDTSAAHYLSSLCPPPASTTIELTAALL 176
++ L ++ Q + + D +L ++ + +
Sbjct: 110 EREGGGLSENCLVSIDDLQNHVESFDEEEKKDLRVDVESFLEFWPAQSQQFGMQYISHIF 169
Query: 177 AKDRLNAFGLMEPYIEGQDGRRSVRAYGIYLKASFFNHDCLPNACRFDYVDAAAENNTDI 236
NAF L + Q G ++V GI+ NHDC PN C + NN I
Sbjct: 170 GVINCNAFTLSD-----QRGLQAV-GVGIFPNLCQANHDCWPN-CTVIF------NNGKI 216
Query: 237 IVRMIHDVPQGREICLSYFPVNYDYSTRQKRLLYDYDFACDCDRCK 282
+R + + G E+ +SY R+K+L Y F C C+ CK
Sbjct: 217 ELRALSKISPGDELTVSYVDFLNVSEERRKQLKKQYYFDCTCEHCK 262
>gi|380489700|emb|CCF36529.1| TPR domain-containing protein [Colletotrichum higginsianum]
Length = 257
Score = 53.9 bits (128), Expect = 1e-04, Method: Compositional matrix adjust.
Identities = 34/103 (33%), Positives = 55/103 (53%), Gaps = 11/103 (10%)
Query: 182 NAFGLMEPYIEGQDGRRSVRAYGIYLKASFFNHDCLPNACRFDYVDAAAENNTDIIVRMI 241
NAFG + G++G R+ + G+++ A++ NH C+ NA + +YV +++R +
Sbjct: 90 NAFGPGGQF--GEEGARNA-STGLWIWAAYINHSCIANAKK-EYV------GDLMVLRAL 139
Query: 242 HDVPQGREICLSYFPVNYDYSTRQKRLLYDYDFACDCDRCKVE 284
+ +G EI SY + DY RQK LL + F C C C E
Sbjct: 140 GAIKKGEEIFHSY-DESADYEARQKALLTTWGFECGCALCAAE 181
>gi|358391269|gb|EHK40673.1| hypothetical protein TRIATDRAFT_286297 [Trichoderma atroviride IMI
206040]
Length = 347
Score = 53.9 bits (128), Expect = 1e-04, Method: Compositional matrix adjust.
Identities = 27/78 (34%), Positives = 39/78 (50%), Gaps = 5/78 (6%)
Query: 204 GIYLKASFFNHDCLPNACRFDYVDAAAENNTDIIVRMIHDVPQGREICLSYFPVNYDYST 263
G++L A+ NH C PNA + + EN + V + D+ +G+EI +SY Y
Sbjct: 115 GLFLDAARINHSCRPNAHK-----SWNENLQRLTVHAVRDIERGQEITISYLGETLSYIE 169
Query: 264 RQKRLLYDYDFACDCDRC 281
RQ L + F C CD C
Sbjct: 170 RQAILKQRFRFDCGCDLC 187
>gi|221503737|gb|EEE29421.1| SET and MYND domain containing protein, putative [Toxoplasma gondii
VEG]
Length = 402
Score = 53.9 bits (128), Expect = 1e-04, Method: Compositional matrix adjust.
Identities = 29/85 (34%), Positives = 42/85 (49%), Gaps = 7/85 (8%)
Query: 202 AYGIYLKASFFNHDCLPNA-CRFDYVDAAAENNTDIIVRMIHDVPQGREICLSYFPVNYD 260
G++ S NH C PNA + ++ AA+ V + D+ + EI LSY +
Sbjct: 264 GVGLFRAVSMTNHSCWPNAEVDYPFLTNAAQ------VTALRDIAEKEEILLSYIDESLP 317
Query: 261 YSTRQKRLLYDYDFACDCDRCKVEA 285
+ RQ+ L Y F C C RC+VEA
Sbjct: 318 LAERQRLLKSHYKFTCGCQRCQVEA 342
>gi|255947380|ref|XP_002564457.1| Pc22g04180 [Penicillium chrysogenum Wisconsin 54-1255]
gi|211591474|emb|CAP97706.1| Pc22g04180 [Penicillium chrysogenum Wisconsin 54-1255]
Length = 351
Score = 53.9 bits (128), Expect = 1e-04, Method: Compositional matrix adjust.
Identities = 38/137 (27%), Positives = 60/137 (43%), Gaps = 14/137 (10%)
Query: 156 YLSSLCPPPASTTIELTAALLAKDRLNAFGLME------PYIEGQDGRRSVRAYGIYLKA 209
YL S P + + L+ ++ N+FGL P + R A +Y A
Sbjct: 205 YLRSSPALPHGMSADRVLHLICQEEANSFGLYPRETGHFPLPKDPVDRGEQFAAAVYPTA 264
Query: 210 SFFNHDCLPNACRFDYVDAAAENNTDIIVRMIHDVPQGREICLSYFPVNY--DYSTRQKR 267
+ NH CLPN V A++ ++ D+ G E C+SYF + D ++R++
Sbjct: 265 AIANHSCLPN------VIHKADDKGRMVFTASRDIFPGEECCISYFDLTQYTDLASRREH 318
Query: 268 LLYDYDFACDCDRCKVE 284
L + F C C+RC E
Sbjct: 319 LRKSFRFVCQCERCVSE 335
>gi|402083521|gb|EJT78539.1| hypothetical protein GGTG_03639 [Gaeumannomyces graminis var.
tritici R3-111a-1]
Length = 779
Score = 53.5 bits (127), Expect = 1e-04, Method: Compositional matrix adjust.
Identities = 28/81 (34%), Positives = 37/81 (45%), Gaps = 7/81 (8%)
Query: 204 GIYLKASFFNHDCLPNACRFDYVDAAAENNTDIIVRMIHDVPQGREICLSYFPVNYDYST 263
GI+ A+ NH C+ N+ R D I R D+P G EI Y PV D+
Sbjct: 483 GIWPHAAIMNHACVANSTRAFCGDL-------FITRASRDIPAGDEITQQYVPVRADWGE 535
Query: 264 RQKRLLYDYDFACDCDRCKVE 284
RQ +L + + F C C C E
Sbjct: 536 RQAQLRHWWGFECACALCAAE 556
>gi|334322119|ref|XP_003340188.1| PREDICTED: SET and MYND domain-containing protein 3-like
[Monodelphis domestica]
Length = 441
Score = 53.5 bits (127), Expect = 1e-04, Method: Compositional matrix adjust.
Identities = 73/337 (21%), Positives = 121/337 (35%), Gaps = 71/337 (21%)
Query: 16 RGRGLVSTQSLKAGQIVLRDSPI---LLYSALPFINSSSFCHNCFRKTMHSSSSICPSCS 72
RG GL + SL+ G+++ R P+ + +L + C K + C C
Sbjct: 14 RGNGLRALASLRPGELLFRSEPLAYTVCKESLGVVCERCLCRK--EKLLR-----CSQCK 66
Query: 73 VA-FCSPKCSTAAASSS---------------HSPY-----------VCQALTRLLNLNS 105
VA +C C ++ S H P VC L + +
Sbjct: 67 VARYCGSACQADSSEVSFVDLFGRQSPTSTFPHFPSEPPKHLANSGSVCINLIVYVYIPR 126
Query: 106 PDAANLPLDRQLQARFLIAAYNLAVVNPSQFQILLAFQGTVTDNDTSAAHYLSSLCPPPA 165
A LPL + F I + ++ ++ + + L T+ S L PPA
Sbjct: 127 KYNAMLPL---MIGPFPIFSPDIKNLSEEKKKGLGHLAVTLQLYLKEEIQDASQL--PPA 181
Query: 166 STTIELTAALLAKDRLNAFGLMEPYIEGQDGRRSVRAYGIYLKASFFNHDCLPNACRFDY 225
E A ++ N F + +G G+Y S NH C PN
Sbjct: 182 FDIFESFAKVIC----NGFAI-------SNGEMQEVGVGLYPSMSLLNHSCDPNC----- 225
Query: 226 VDAAAENNTDIIVRMIHDVPQGREICLSYFPVNYDYSTRQKRLLYDYDFACDCDRCKV-- 283
+ +R + D+ QG E+ + Y V + RQK+L Y F CDC C+
Sbjct: 226 --VIVFEGPSLFLRAVRDIQQGEELTICYLDVLMPSAERQKQLKEQYCFDCDCPGCETRS 283
Query: 284 ---------EANWSDNDNDDENNEEVMDEDQDEQMVA 311
E W + + + E++ +++ EQ++A
Sbjct: 284 KDDDMLSGEEQAWKEVQDSLKKVEDLRAQEKWEQILA 320
>gi|242036801|ref|XP_002465795.1| hypothetical protein SORBIDRAFT_01g045950 [Sorghum bicolor]
gi|241919649|gb|EER92793.1| hypothetical protein SORBIDRAFT_01g045950 [Sorghum bicolor]
Length = 543
Score = 53.5 bits (127), Expect = 1e-04, Method: Compositional matrix adjust.
Identities = 34/105 (32%), Positives = 47/105 (44%), Gaps = 14/105 (13%)
Query: 202 AYGIYLKASFFNHDCLPNACRFDYVDAAAENNTDIIVRMIHDVPQGREICLSYFPVNYDY 261
G+++ SF NH C PNA R D A IV D+ G EI YF V
Sbjct: 320 GVGLWVLPSFINHSCHPNARRTHIGDHA-------IVHASRDIKAGEEITFPYFDVLVPV 372
Query: 262 STRQKRLLYDYDFACDCDRCKVEANWSDNDNDDENNEEVMDEDQD 306
S R++ + F C CDRC+ EA D +E++ ++D
Sbjct: 373 SKRRE-ASRAWGFECKCDRCRFEA------EDSILRQEILKSEKD 410
>gi|116181330|ref|XP_001220514.1| hypothetical protein CHGG_01293 [Chaetomium globosum CBS 148.51]
gi|88185590|gb|EAQ93058.1| hypothetical protein CHGG_01293 [Chaetomium globosum CBS 148.51]
Length = 348
Score = 53.5 bits (127), Expect = 1e-04, Method: Compositional matrix adjust.
Identities = 35/118 (29%), Positives = 58/118 (49%), Gaps = 9/118 (7%)
Query: 195 DGRRSVRAY-GIYLKASFFNHDCLPNACRFDYVDAAAENNTDIIVRMIHDVPQGREICLS 253
DG R A+ G++ + S FNHDC PN + +T I VR D+P G E+ +S
Sbjct: 163 DGDRKHSAHLGVFPEVSKFNHDCRPNV----HYRLNGLKHTTIAVR---DIPAGDELTIS 215
Query: 254 YFPVNYDYSTRQKRLLYDYDFACDCDRCKVEANWSDNDNDDENNEEVMDEDQDEQMVA 311
Y ++TRQ + L ++ F C C +C + A + ++ E ++ + + M A
Sbjct: 216 YIYGRASHATRQSQ-LREWGFTCTCPQCTLNATETGASDNRLRQIEALETEIERIMGA 272
>gi|307211936|gb|EFN87850.1| Protein msta, isoform B [Harpegnathos saltator]
Length = 331
Score = 53.5 bits (127), Expect = 1e-04, Method: Compositional matrix adjust.
Identities = 64/274 (23%), Positives = 109/274 (39%), Gaps = 39/274 (14%)
Query: 17 GRGLVSTQSLKAGQIVLRDSPILLYSALPFINSS--SFCHNCFRKTMHSSSSICP----- 69
GRG+ + + ++ G+++ D P+LL P +S C C++ CP
Sbjct: 36 GRGMFAMRDIQPGELIFMDVPLLLG---PRCHSKYLPMCVVCYKND-------CPLFPCD 85
Query: 70 -SCSVAFCSPKCSTAAASSSHSPYVCQALTRLLNLNSPDAANLPLDRQLQARFLIAAYNL 128
C + CS KC + + +A R+L P + L LI A L
Sbjct: 86 HGCGLPICSTKCENSLMHA-------RAECRILRELEPTCGSTWSKDLLLTVVLIRALAL 138
Query: 129 AVVNPSQFQILLAFQGTVTDNDTSAAHYLS-SLCPPPASTTIELTAALLAKDRLNAFGLM 187
+ Q ++L AF+ + L ++ P + L + N+F +
Sbjct: 139 S---SEQRRLLFAFECHLNLAPNYEIDLLKRNITHLPDEEQLALMKRICGIFNTNSFEV- 194
Query: 188 EPYIEGQDGRRSVRAYGIYLKASFFNHDCLPNACRFDYVDAAAENNTDIIVRMIHDVPQG 247
+ +D R V GIY + NH C+PN R + D + + + G
Sbjct: 195 ---VAVRDKDRVVSLRGIYPIGALQNHCCVPNT-RHHFDD-----QQRLYMSAALPIAAG 245
Query: 248 REICLSYFPVNYDYSTRQKRLLYDYDFACDCDRC 281
+EI +SY + +D +R+ L F+CDC+RC
Sbjct: 246 QEITMSYTDLLWDTRSRRCFLRVTKHFSCDCNRC 279
>gi|449539909|gb|EMD30911.1| hypothetical protein CERSUDRAFT_163648 [Ceriporiopsis subvermispora
B]
Length = 433
Score = 53.5 bits (127), Expect = 2e-04, Method: Compositional matrix adjust.
Identities = 73/344 (21%), Positives = 127/344 (36%), Gaps = 70/344 (20%)
Query: 14 EGRGRGLVSTQSLKAGQIVLRDSPILL---YSALPFINSSSFCHNCFRKTMHSSSSI-CP 69
+G+G+GLV+ + + G+IV ++ P +L + SS C C +S+ I CP
Sbjct: 100 KGKGKGLVAKEYVAEGEIVWKEDPFILAPEWDIYDLQTSSLSCTYCSTPLQNSTLVIPCP 159
Query: 70 S------CSVAFCSPKCSTAAASSS--------------------------HSPYVCQAL 97
+ C+ FCS C + +A + H+ C A
Sbjct: 160 ASTSHTPCTARFCSRLCLSRSARTHPLLCPAQNPASAPLLAFARKAQWMALHALAQCTAR 219
Query: 98 TRLLNLNSPDA-----------ANLPLDRQLQARFLIAAYNLAVVNPSQFQILL-AFQGT 145
L N P A A L ++ + + ++ A +Q+ + AF+
Sbjct: 220 ILLTNQQDPSALQADLEVVRALAQLGMEERAKGEWMKGAEPDRATWKRAYQLFVQAFR-- 277
Query: 146 VTDNDTSAAHYLSSLCPPPASTTIELTAALLAKDR-LNAFGLMEPYIEGQDGRRSVRAYG 204
+ T AH + ++ L D L G M +E G
Sbjct: 278 ---DPTDDAHKKKLAKLLKKPLSKDIADELFDYDAFLRGLGRMSLNLEAHGG-------- 326
Query: 205 IYLKASFFNHDCLPNACRFDYVDAAAENNTDIIVRMIHDVPQGREICLSYFPVNYDYSTR 264
+Y+ S NH C PNA A + +I + D+ G E+ ++Y R
Sbjct: 327 LYVLHSHLNHSCAPNASIRHLDQRTALSRITVIAK--RDIAPGEELLITYVNPALPLPQR 384
Query: 265 QKRLLYDYDFA-CDCDRCKVEANWSDNDNDDENNEEVMDEDQDE 307
++ ++ ++ F C+C+RC E + D+ E D D+ E
Sbjct: 385 RREVM-EWGFGKCNCERCLEE----EKDHAKEGETGAADVDELE 423
>gi|194883134|ref|XP_001975659.1| GG20436 [Drosophila erecta]
gi|190658846|gb|EDV56059.1| GG20436 [Drosophila erecta]
Length = 515
Score = 53.5 bits (127), Expect = 2e-04, Method: Compositional matrix adjust.
Identities = 61/291 (20%), Positives = 108/291 (37%), Gaps = 61/291 (20%)
Query: 14 EGRGRGLVSTQSLKAGQIVLRDSPILLYSALPFINSSSFCHNCFRKTMHSSSSICPSCSV 73
E GR LV+T+++K +IVL+++P++ P S+ C C C C
Sbjct: 49 EQLGRHLVATRTIKPYEIVLKEAPLV---RGPAQISAPVCLGCLNGIEAEDHIECEQCGW 105
Query: 74 AFCSPKCSTAAASSS--------------------HSPYVCQALTR--LLNLNSPDAANL 111
C P+C + + H Y C + R L+ SP+ A
Sbjct: 106 PLCGPECKSLDEHKAECRLTKDRGQKVNVQEFGGPHPLYTCLSTVRCLLIGETSPEKA-- 163
Query: 112 PLDRQLQARFLIAAYNLAVVNPSQFQILLAFQGTVTDNDTSAAHYLSSLCPPPASTTIEL 171
S+FQ L + + T ++ A +S P
Sbjct: 164 ----------------------SKFQELESLESTRRGSNQWKADLVSIGQFIPKFFK--- 198
Query: 172 TAALLAKDRLNAFGLMEPYIEGQDGRRSVRAY-GIYLKASFFNHDCLPNACRFDYVDAAA 230
T ++ + A G ++ I G + + ++ ++ ASF + CLPN + +
Sbjct: 199 TQKFTEEEIMKAVGALQ--INGHEVPTTDPSHVAVFYTASFTENSCLPNLAK------SF 250
Query: 231 ENNTDIIVRMIHDVPQGREICLSYFPVNYDYSTRQKRLLYDYDFACDCDRC 281
N I+ H++ + + + Y + + RQ+ L+ F C CDRC
Sbjct: 251 NKNGHCILWAPHEIKKNAHLSICYSDAVWGTADRQRHLMQTKLFKCACDRC 301
>gi|432852549|ref|XP_004067303.1| PREDICTED: N-lysine methyltransferase SMYD2-A-like isoform 2
[Oryzias latipes]
Length = 424
Score = 53.5 bits (127), Expect = 2e-04, Method: Compositional matrix adjust.
Identities = 66/281 (23%), Positives = 102/281 (36%), Gaps = 45/281 (16%)
Query: 16 RGRGLVSTQSLKAGQIVLRDSPILLYSALPFINSSSFCHNCFRKTMHSSSSICPSCSVAF 75
+GRGL + + G++V P Y L + C +CF T C C A+
Sbjct: 17 KGRGLRALRDFAVGELVFA-CPAYSY-VLTVNERGAHCEHCF--TRREDLFKCGKCKQAY 72
Query: 76 -CSPKCSTA---------AASSSHSPYVCQALTRLLNLNSPDAANLPLDRQLQARFLIAA 125
C+ C A +H C + TR+ +P + L L ++ +
Sbjct: 73 YCNVDCQRGDWPMHKLECVAMCTHGENWCPSETRVTTERTP-SERLLLLKEFE------- 124
Query: 126 YNLAVVNPSQFQILLAFQGTVTDNDTSAAHYLSSLCPPPASTTIELTAALLAKDRLNAFG 185
S + + + + D +A HY S ELT L A+ N F
Sbjct: 125 --------SHLDKMDSEKDEMNQADIAALHYFYSKHISDLPDDQELTE-LFAQVNCNGFT 175
Query: 186 LMEPYIEGQDGRRSVRAYGIYLKASFFNHDCLPNACRFDYVDAAAENNTDIIVRMIHDVP 245
+ +D S ++ + NH C PN Y AE VR + ++
Sbjct: 176 I-------EDEELSHLGSAVFPDVALMNHSCSPNVI-VTYKGTVAE------VRAVQEIN 221
Query: 246 QGREICLSYFPVNYDYSTRQKRLLYDYDFACDCDRCKVEAN 286
G EI SY + Y R++RLL Y F C C C ++
Sbjct: 222 PGEEIFNSYIDLLYPTEDRKERLLDSYFFGCQCTECTTKSK 262
>gi|50305365|ref|XP_452642.1| hypothetical protein [Kluyveromyces lactis NRRL Y-1140]
gi|49641775|emb|CAH01493.1| KLLA0C09944p [Kluyveromyces lactis]
Length = 367
Score = 53.5 bits (127), Expect = 2e-04, Method: Compositional matrix adjust.
Identities = 41/149 (27%), Positives = 68/149 (45%), Gaps = 18/149 (12%)
Query: 135 QFQILLAFQGTVTDNDTSAAHYLSSLCPPPAST-TIELTAALLAKDRLNAFGLMEPYIEG 193
+F +LL FQ V ++ +L P T + +L + N+FG+ E
Sbjct: 231 RFPVLLEFQIKVFK-------WVYTLAPQYRYIMTTSVFRHILGSEYGNSFGIWET--SE 281
Query: 194 QDGRRSVRAYGIYLKASFFNHDCLPNACRFDYVDAAAENNTDIIVRMIHDVPQGREICLS 253
D R Y ++ +AS+FNH C+PN + E I++ DV +G E+C+
Sbjct: 282 SDDDREYLGYMVHPEASYFNHSCVPNVEK-----KRVERVFQYILQT--DVQKGEELCID 334
Query: 254 YFPVNYDYSTRQKRLLYD-YDFACDCDRC 281
Y + + R++ +L D + F C C RC
Sbjct: 335 YKGILHLDVNRRRAILKDNWFFECKCSRC 363
>gi|24641164|ref|NP_727478.1| CG11160, isoform B [Drosophila melanogaster]
gi|22832071|gb|AAN09278.1| CG11160, isoform B [Drosophila melanogaster]
Length = 403
Score = 53.5 bits (127), Expect = 2e-04, Method: Compositional matrix adjust.
Identities = 62/278 (22%), Positives = 112/278 (40%), Gaps = 29/278 (10%)
Query: 17 GRGLVSTQSLKAGQIVLRDSPILLYSALPFINSSSFCHNCFRK-TMHSSSSICPSCSVAF 75
GR LV+ + L+AG+ ++R+ P+ + P ++ C C+ ++ + CP C+
Sbjct: 58 GRYLVANRQLEAGETLIREEPLAIG---PCVSGDPVCLGCYHPVSLKADQYRCPGCAWPL 114
Query: 76 CSPKCSTAAASSSHSPYVCQ--ALTRLL--NLNSPDAANLPLDRQLQARFLIAAYNLAVV 131
C C+ H+ CQ A R + L + A + + ++ L
Sbjct: 115 CGSTCAGLKHRHGHTETECQLYAERRAVAGELLTERAGPAEVRDLYELVMIVRILLLRQH 174
Query: 132 NPSQFQILLAFQGTVTDNDTSAA---HYLSSLCPPPASTTIELTAALLAKDRLNAFGLME 188
+P QF ++ + + +A HY + T +L L A+ G+++
Sbjct: 175 DPEQFALIARMESHTEERRQNAVLWRHYEEKVV-QRLRVTWQLE-DLEAEQVHEVCGILD 232
Query: 189 P--YIEGQDGRRSVRAYGIYLKASFFNHDCLPNACRFDYVDAAAENNTDIIVRMIHDVPQ 246
+ GQ+G +A +Y A HDC PN D ++ +I++R V +
Sbjct: 233 VNCFEIGQNG---AKARTLYPSAFLLAHDCTPNTAHTD-----DPSSFEILLRTSRRVRE 284
Query: 247 GREICLSYFPVNYDYSTRQKRLLYDYD---FACDCDRC 281
+ LSY Y KR + ++ F C C RC
Sbjct: 285 REALTLSY---AYTLQGTLKRRAFMHEGKLFWCCCRRC 319
>gi|134057769|emb|CAK38164.1| unnamed protein product [Aspergillus niger]
Length = 358
Score = 53.5 bits (127), Expect = 2e-04, Method: Compositional matrix adjust.
Identities = 71/310 (22%), Positives = 122/310 (39%), Gaps = 64/310 (20%)
Query: 16 RGRGLVSTQSLKAGQIVLRDSPILLYSALPFI----NSSSFCHN--CFRK-TMHSSSSIC 68
+GR L + ++AG++++ D P Y+ LP + S C N C R+ + H+++S+
Sbjct: 62 KGRTLSTNGPIRAGELLMADLP---YAVLPTVAVGGGDDSICSNLGCSRRVSRHAANSVT 118
Query: 69 --PSCS--VAFCSPKCS---------TAAASSSHSP-----------YVCQALTRLLNLN 104
CS + +C+ C A +SP Y+ + RLL
Sbjct: 119 CETGCSLDIWWCNESCKDEDQARHDVECAWLKKYSPVLRQELGDHDYYMLWIIIRLLAAR 178
Query: 105 SPDA--ANLPLDRQLQARFLIAAYNLAVVNPSQFQILLAFQGTVTDNDTSAAHYLSSLCP 162
+ + A+L + +Q + N SQ Q T A YL
Sbjct: 179 NSHSKIASLAAGKGVQ----LLRSNRDSWPTSQLQ----------HWSTLAEKYLLQSSL 224
Query: 163 PPASTTIELTAALLAKDRLNAFGL------MEPYIEGQDGRRSVRAYGIYLKASFFNHDC 216
P + ++ L+ + N F L + P+ R + ++L+ + NH C
Sbjct: 225 LPETLDLDTLVDLICAEETNVFELCPGPTEIIPHQSPGVERGTQYGLAVFLRITLANHSC 284
Query: 217 LPNACRFDYVDAAAENNTDIIVRMIHDVPQGREICLSYFPVN--YDYSTRQKRLLYDYDF 274
PN V A++ ++V + D+ G E C SYF ++ D R+K+ + F
Sbjct: 285 APN------VTHQADDRGRMMVTALRDIAPGEECCTSYFDLSEYVDLQARRKKTQELFTF 338
Query: 275 ACDCDRCKVE 284
C C RC E
Sbjct: 339 TCTCPRCLQE 348
>gi|50294041|ref|XP_449432.1| hypothetical protein [Candida glabrata CBS 138]
gi|49528746|emb|CAG62408.1| unnamed protein product [Candida glabrata]
Length = 374
Score = 53.5 bits (127), Expect = 2e-04, Method: Compositional matrix adjust.
Identities = 53/197 (26%), Positives = 83/197 (42%), Gaps = 36/197 (18%)
Query: 92 YVCQALTRLLNLNSPDAANLPLDRQLQARFLIAAYNLAVVNPSQFQILLAFQGTVTDNDT 151
+V + L R+ +L+ P + L LQ+ + +F +LL FQ V
Sbjct: 199 FVSETLFRIKHLD-PGSDTLEAFHNLQSN--------ELSKIQKFPVLLNFQVLVFKT-- 247
Query: 152 SAAHYLSSLCPP--PASTTIELTAALLAKDRLNAFGLMEPYIEGQDGRRSVRAYGIYLKA 209
L L P S T++ +L + NAFG+ + E D R Y ++ KA
Sbjct: 248 -----LFVLLPDNLKNSLTVQAFRHILGSEYGNAFGIWQD-TEAVDSREYF-GYWVFPKA 300
Query: 210 SFFNHDCLPNACRFDYVDAAAENNTDIIVRMIHDVPQGREICLSY-----FPVNYDYSTR 264
S+FNH C PN + + ++ + DV G E+ + Y PV+ R
Sbjct: 301 SYFNHSCDPNITK-------TRIDRKMVFTLNRDVACGEELNIDYSGVLDLPVD----RR 349
Query: 265 QKRLLYDYDFACDCDRC 281
QK L ++ F C C+RC
Sbjct: 350 QKFLFENWFFVCGCNRC 366
>gi|169613454|ref|XP_001800144.1| hypothetical protein SNOG_09858 [Phaeosphaeria nodorum SN15]
gi|160702729|gb|EAT83123.2| hypothetical protein SNOG_09858 [Phaeosphaeria nodorum SN15]
Length = 645
Score = 53.5 bits (127), Expect = 2e-04, Method: Compositional matrix adjust.
Identities = 32/105 (30%), Positives = 55/105 (52%), Gaps = 11/105 (10%)
Query: 182 NAFGLMEPYIEGQDGRRSVRAYGIYLKASFFNHDCLPNACRFDYVDAAAENNTDIIVRMI 241
NAFG + G++ R+ + G+++ A++ NH C+PNA + +Y+ +I+R
Sbjct: 478 NAFGPGNQF--GEENARNA-STGLWVYAAYINHSCIPNATK-EYI------GDMMILRAT 527
Query: 242 HDVPQGREICLSYFPVNYDYSTRQKRLLYDYDFACDCDRCKVEAN 286
+ G EI +Y V+ DY RQ L+ + F C C C+ E+
Sbjct: 528 RAIAAGEEIFHAY-DVSSDYDARQASLMTTWGFKCACKLCEAESK 571
>gi|332373072|gb|AEE61677.1| unknown [Dendroctonus ponderosae]
Length = 382
Score = 53.5 bits (127), Expect = 2e-04, Method: Compositional matrix adjust.
Identities = 35/107 (32%), Positives = 54/107 (50%), Gaps = 13/107 (12%)
Query: 204 GIYLKASFFNHDCLPNACRFDYVDAAAENNTDIIVRMIHDVPQGREICLSYFP---VNYD 260
G+Y + S NH C PNA +V NN + + + D+ G EIC+SY
Sbjct: 286 GLYARQSCANHSCDPNA-EISFV----HNNYRLSLIALKDIQPGEEICISYLGDCDNERS 340
Query: 261 YSTRQKRLLYDYDFACDCDRCKVEANWSDNDNDDENNEEVMDEDQDE 307
+R+K L+ +Y FAC+C +C E D+ D +EE M ++D+
Sbjct: 341 RHSRRKTLMENYLFACECSKCLAEI-----DDPDVTSEEEMSAEEDD 382
>gi|348681060|gb|EGZ20876.1| hypothetical protein PHYSODRAFT_313346 [Phytophthora sojae]
Length = 494
Score = 53.5 bits (127), Expect = 2e-04, Method: Compositional matrix adjust.
Identities = 36/138 (26%), Positives = 58/138 (42%), Gaps = 11/138 (7%)
Query: 144 GTVTDNDTSAAHYLSSLCPPPASTTIELTAALLAKDRLNAFGLMEPYIEGQDGRRSVRAY 203
G D+ + A + P +E + A+ N++ + + D + V A
Sbjct: 155 GAWADHVRAGAKKILEDLPDECHLPVEEILVIAAQINENSYSM-----DALDEKHLVAAV 209
Query: 204 GIYLKASFFNHDCLPNACRFDYVDAAAENNTDIIVRMIHDVPQGREICLSYFPVNYDYST 263
G++ NH C PN + D AE VR + D+ +G EI LSY ++ + +
Sbjct: 210 GLFPICGLINHSCQPNCTWSNAGDGIAE------VRALRDIKEGDEITLSYIDIDKERAE 263
Query: 264 RQKRLLYDYDFACDCDRC 281
R+K L F C C RC
Sbjct: 264 RRKELRETKHFDCQCGRC 281
>gi|384250973|gb|EIE24451.1| SET domain-containing protein [Coccomyxa subellipsoidea C-169]
Length = 606
Score = 53.5 bits (127), Expect = 2e-04, Method: Compositional matrix adjust.
Identities = 28/80 (35%), Positives = 39/80 (48%), Gaps = 7/80 (8%)
Query: 202 AYGIYLKASFFNHDCLPNACRFDYVDAAAENNTDIIVRMIHDVPQGREICLSYFPVNYDY 261
A G+Y S NH CLPN C F A +D+ VR I V G ++ ++Y +
Sbjct: 321 ALGLYPALSMLNHSCLPN-CVF------ASCGSDMHVRAIRPVAAGEQLTVTYINIMEPR 373
Query: 262 STRQKRLLYDYDFACDCDRC 281
R + L+ FAC C+RC
Sbjct: 374 RIRARELMDTKHFACACERC 393
>gi|389636919|ref|XP_003716103.1| hypothetical protein MGG_14002 [Magnaporthe oryzae 70-15]
gi|351641922|gb|EHA49784.1| hypothetical protein MGG_14002 [Magnaporthe oryzae 70-15]
Length = 449
Score = 53.5 bits (127), Expect = 2e-04, Method: Compositional matrix adjust.
Identities = 30/93 (32%), Positives = 44/93 (47%), Gaps = 7/93 (7%)
Query: 197 RRSVRAYGIYLKASFFNHDCLPN-ACRFDYVDAAAENNTDIIVRMIHDVPQGREICLSYF 255
RR V + +++ S NH+C PN A FD + + + I D+ G E+ +SY
Sbjct: 241 RRGVDFHSTFVEVSRMNHECSPNCAAYFDPMTMSQR------IYAIRDIMPGEELTVSYI 294
Query: 256 PVNYDYSTRQKRLLYDYDFACDCDRCKVEANWS 288
RQ RL D+ F C C RC EA++
Sbjct: 295 DPVQTREARQNRLRKDWGFGCSCQRCTSEAHFG 327
>gi|195132929|ref|XP_002010892.1| GI21459 [Drosophila mojavensis]
gi|193907680|gb|EDW06547.1| GI21459 [Drosophila mojavensis]
Length = 460
Score = 53.5 bits (127), Expect = 2e-04, Method: Compositional matrix adjust.
Identities = 63/277 (22%), Positives = 107/277 (38%), Gaps = 30/277 (10%)
Query: 17 GRGLVSTQSLKAGQIVLRDSPILLYSALPFINSSSFCHNCFRKTMHSSSSICP-SCSVAF 75
GRG+ +T+ + AG+++ R+ +++ + + C C R + + +CP C++
Sbjct: 53 GRGVFATRDIAAGELIFRERALVVGPTARKGSQLNTCVCCHR-LLGAKQFLCPHRCTL-- 109
Query: 76 CSPKCSTAAASSSH-------SPYVCQALTRLLNLNSPDAANLPLDRQLQARFLIAAYNL 128
P C A S +H + + L+ SP + + L R L A +
Sbjct: 110 --PVCGECADSEAHHNECAHFQRWQPKDLSEECKELSPTGEPVDVVNPLSLRILTAV-RV 166
Query: 129 AVVNPSQFQILLAFQGTVT----DNDTSAAHYLSSLCPPPASTTIELTAALLAKDRLNAF 184
++ Q ++ A Q AA P + + LN
Sbjct: 167 FYLSKEQRALVDAMQANAERGYRQEIIKAAQCFRKF---PTTDKPFMDQLFRVVGVLNTN 223
Query: 185 GLMEPYIEGQDGRRSVRAYGIYLKASFFNHDCLPNACRFDYVDAAAENNTDIIVRMIHDV 244
P DG S+ G++ + NH+C PNA + EN +VR D+
Sbjct: 224 AFEAPC--RVDGHESL-LRGLFPLTAIMNHECTPNASHY------FENGRLAVVRAARDI 274
Query: 245 PQGREICLSYFPVNYDYSTRQKRLLYDYDFACDCDRC 281
P+G EI +Y + + TR L F C+CDRC
Sbjct: 275 PKGGEITTTYTKILWSNLTRGIFLKMTKYFVCNCDRC 311
>gi|406867180|gb|EKD20219.1| xanthine dehydrogenase [Marssonina brunnea f. sp. 'multigermtubi'
MB_m1]
Length = 654
Score = 53.5 bits (127), Expect = 2e-04, Method: Compositional matrix adjust.
Identities = 27/77 (35%), Positives = 37/77 (48%), Gaps = 5/77 (6%)
Query: 205 IYLKASFFNHDCLPNACRFDYVDAAAENNTDIIVRMIHDVPQGREICLSYFPVNYDYSTR 264
IY A FNH C PNA R ++ I++R D+ +G EI + YF + R
Sbjct: 443 IYEIAGRFNHSCTPNAAR-----GFSDVGELIVLRAFVDIQEGEEITIEYFSDGGTTTER 497
Query: 265 QKRLLYDYDFACDCDRC 281
++ L Y F CDC C
Sbjct: 498 RQHLYKQYRFNCDCRAC 514
>gi|393212658|gb|EJC98158.1| SET domain-containing protein [Fomitiporia mediterranea MF3/22]
Length = 459
Score = 53.5 bits (127), Expect = 2e-04, Method: Compositional matrix adjust.
Identities = 33/109 (30%), Positives = 47/109 (43%), Gaps = 9/109 (8%)
Query: 199 SVRAYGIY-LKASFFNHDCLPNACRFDYVDAAAENNTDIIVRMIHDVPQGREICLSYFPV 257
S +GI+ L + FNH CLPNA + ++V+ + + QG EI + YF
Sbjct: 208 SAVGHGIFPLASRLFNHSCLPNAI---VTYSFTSEGIQMVVKALTPIKQGEEITIPYFDP 264
Query: 258 NYDYSTRQKRLLYDYDFACDCD-----RCKVEANWSDNDNDDENNEEVM 301
Y RQ Y Y F C C RC V+A ++ + E M
Sbjct: 265 ALPYHQRQAICRYSYGFECTCSVCMFPRCTVDAREPPSEPGERERIENM 313
>gi|328769434|gb|EGF79478.1| hypothetical protein BATDEDRAFT_35395 [Batrachochytrium
dendrobatidis JAM81]
Length = 545
Score = 53.5 bits (127), Expect = 2e-04, Method: Compositional matrix adjust.
Identities = 45/199 (22%), Positives = 72/199 (36%), Gaps = 38/199 (19%)
Query: 171 LTAALLAKDRLNAFGLMEPYIEGQDGRRSVRAYGIYLKASFFNHDCLPNACRF------- 223
L L+ K+ N+FG+ G R G+Y A +FNH C P+
Sbjct: 315 LMMLLICKEECNSFGVYTFNYTGSKTERQSYGLGVYPSAVYFNHSCKPSVGHVARSLPDT 374
Query: 224 DYVDAAAENN-------------TDIIVRMIHDVPQGREICLSYFPVNYDYSTRQKRLLY 270
D + + N + I+ ++ Q E ++Y P+ RQK +
Sbjct: 375 DVTRSGSATNLAKLTDTGMSMFGSTILFFATRNLEQNEEAMIAYLPLEGTLEHRQKAIKD 434
Query: 271 DYDFACDCDRCKVEANWSDNDNDDENNEEVMDEDQDEQMVASDDDAEVHGDTNFP-HAYF 329
+ F C+C+RC EE +++++ V +D A D P H
Sbjct: 435 IFYFECNCERCL---------------EETLEKNKSPTAVQTDITASAMKDQKSPLHQTI 479
Query: 330 --FMRYMCDRDNCWGTLAP 346
F +C C G L P
Sbjct: 480 ETFGLCLCREIGCHGWLVP 498
>gi|18568265|gb|AAL75997.1|AF466646_5 putative SET-domain transcriptional regulator [Zea mays]
gi|413933333|gb|AFW67884.1| putative SET-domain transcriptional regulator [Zea mays]
Length = 410
Score = 53.1 bits (126), Expect = 2e-04, Method: Compositional matrix adjust.
Identities = 68/278 (24%), Positives = 107/278 (38%), Gaps = 54/278 (19%)
Query: 8 VTVAEIEGRGRGLVSTQSLKAGQIVLRDSPILLYSALP-FINSSSFCHNCFRKTMHSSSS 66
+TVA I G+GRGL++T + G ++L P Y++ P I S C +CF +
Sbjct: 17 LTVASIPGKGRGLIATCTFFPGDVILNQEP---YASTPNKILVGSSCDHCF---TSGNLR 70
Query: 67 ICPSCSVA-FCSPKCSTAAASSSHSPYVCQALTRLLNLNSPDAANLPLDRQLQARFLIAA 125
C C V +CS C +L L A L DR+
Sbjct: 71 KCSMCRVTWYCSSNCQKEEW-------------KLHQLECRAMAALTEDRK--------- 108
Query: 126 YNLAVVNPS-QFQILLAFQGTVTDNDTSAAHYLSSLCPPPASTTIELTAALLAKDRLNAF 184
++ P+ + + LA + + + + L SL P ++ A +K NA
Sbjct: 109 ---KMLTPTIRLMVRLALKRKLQNEKMAN---LVSLILPFIELDLKEIAQTFSKFACNAH 162
Query: 185 GLMEPYIEGQDGRRSVRAYGIYLKASFFNHDCLPNACR-FDYVDAAAENNTDIIVRMIHD 243
+ +P + G+Y S NH C+PNA FD A VR +
Sbjct: 163 TICDPELRPL-------GTGLYPVISIINHSCVPNAVLIFDGRTA--------YVRALQP 207
Query: 244 VPQGREICLSYFPVNYDYSTRQKRLLYDYDFACDCDRC 281
+ + E+ +SY + ++ L Y F C C RC
Sbjct: 208 INKDEEVSISYIETA-TVTKKRNNDLKQYFFTCTCPRC 244
>gi|354475964|ref|XP_003500195.1| PREDICTED: SET and MYND domain-containing protein 3 [Cricetulus
griseus]
Length = 428
Score = 53.1 bits (126), Expect = 2e-04, Method: Compositional matrix adjust.
Identities = 74/328 (22%), Positives = 120/328 (36%), Gaps = 66/328 (20%)
Query: 16 RGRGLVSTQSLKAGQIVLRDSPILLYSALPFINSSSFCHNCF---RKTMHSSSSICPSCS 72
RG GL + L+ G+++ R P L Y+ + C C K M C C
Sbjct: 14 RGNGLRAVVPLRPGELLFRCDP-LAYTVCKG-SRGVVCDRCLLGKEKLMR-----CSQCR 66
Query: 73 VA-FCSPKCSTAAASSSHSPYVCQALTRLLNLNSPDAANLPLDRQL---------QARFL 122
+A +CS KC A C+ L PD+ L L R + ++ L
Sbjct: 67 IAKYCSAKCQKKAWPDHKQE--CKCLKSCKPRYPPDSVRL-LARVIVKLMDEKPSESEKL 123
Query: 123 IAAYNLAV--------VNPSQFQILLAFQGTVTDNDTSAAHYLSSLCPPPASTTIELTAA 174
+ Y+L Q+ L FQ + A+ PP+ E A
Sbjct: 124 YSFYDLESNINKLTEDKKEGLRQLALTFQHFTREEIQDASQL------PPSFDLFEAFAK 177
Query: 175 LLAKDRLNAFGLMEPYIEGQDGRRSVRAYGIYLKASFFNHDCLPNACRFDYVDAAAENNT 234
++ N+F + ++ G+Y S NH C PN C + N
Sbjct: 178 VIC----NSFTICNAEMQEV-------GVGLYPSMSLLNHSCDPN-CSIVF------NGP 219
Query: 235 DIIVRMIHDVPQGREICLSYFPVNYDYSTRQKRLLYDYDFACDCDRCKV----------- 283
+++R + ++ G E+ + Y + R+K+L Y F CDC RC+
Sbjct: 220 HLLLRAVREIEAGEELTICYLDMLMTSEERRKQLRDQYCFECDCIRCQTHDKDADMLTGD 279
Query: 284 EANWSDNDNDDENNEEVMDEDQDEQMVA 311
E W + + EE+ + EQ++A
Sbjct: 280 EQIWKEVQESLKKIEELKAHWKWEQVLA 307
>gi|195566137|ref|XP_002106647.1| GD16997 [Drosophila simulans]
gi|194204029|gb|EDX17605.1| GD16997 [Drosophila simulans]
Length = 532
Score = 53.1 bits (126), Expect = 2e-04, Method: Compositional matrix adjust.
Identities = 63/282 (22%), Positives = 109/282 (38%), Gaps = 37/282 (13%)
Query: 17 GRGLVSTQSLKAGQIVLRDSPILLYSALPFINSSSFCHNCFRK-TMHSSSSICPSCSVAF 75
GR LV+ + L+AG+ ++R+ P+ + P ++ C C++ ++ CP C+
Sbjct: 58 GRYLVANRQLEAGETLIREEPLAIG---PCVSGDPVCLGCYQPVSLKPDQYRCPGCAWPL 114
Query: 76 CSPKCSTAAASSSHSPYVCQALTRLLNLNSPDAANLPLDRQLQAR--------FLIAAYN 127
C C+ H+ CQ L A L +R A ++
Sbjct: 115 CGSTCAGLKHRHGHTETECQ----LYGERRAVAGKLLTERAGPAEVRDLYELVMIVRILL 170
Query: 128 LAVVNPSQFQILLAFQGTVTDNDTSAA---HYLSSLCPPPASTTIELTAALLAKDRLNAF 184
L +P QF ++ + + +A HY + T +L L A+
Sbjct: 171 LRQHDPEQFALIARMESHTEERRQNAVLWRHYEEKVV-QRLRVTWQLE-DLEAEQVHEVC 228
Query: 185 GLMEP--YIEGQDGRRSVRAYGIYLKASFFNHDCLPNACRFDYVDAAAENNTDIIVRMIH 242
G+++ + GQ+G +A +Y A HDC PN D + +I++R
Sbjct: 229 GILDVNCFEIGQNG---AKARTLYPSAFLLAHDCTPNTAHTD-----DPRSFEILLRTSR 280
Query: 243 DVPQGREICLSYFPVNYDYSTRQKRLLYDYD---FACDCDRC 281
V + + LSY Y KR + ++ F C C RC
Sbjct: 281 RVREREALTLSY---AYTLQGTLKRRAFMHEGKLFWCCCRRC 319
>gi|449298403|gb|EMC94418.1| hypothetical protein BAUCODRAFT_149577 [Baudoinia compniacensis
UAMH 10762]
Length = 354
Score = 53.1 bits (126), Expect = 2e-04, Method: Compositional matrix adjust.
Identities = 36/125 (28%), Positives = 56/125 (44%), Gaps = 14/125 (11%)
Query: 165 ASTTIELTAALLAKDRLNAFGLMEPYIEGQDGRRSVRAYGIYLKASFFNHDCLPNACRFD 224
+S T E+ L + NAFG+ +G++ Y IY AS+FNH C PN +
Sbjct: 229 SSCTTEVCHTLASIGSHNAFGIRAGSEDGEE----YMGYAIYPDASYFNHSCSPNLLKRR 284
Query: 225 YVDAAAENNTDIIVRMIHDVPQGREICLSYF---PVNYDYSTRQKRLLYDYDFACDCDRC 281
A T ++ +G + C++Y D + R+ RL + F C C+RC
Sbjct: 285 MGRCWAFWTT-------REIKKGEQCCITYLGGDEKELDVADRRARLKRVWAFDCMCERC 337
Query: 282 KVEAN 286
K+E
Sbjct: 338 KLEVR 342
>gi|321473703|gb|EFX84670.1| hypothetical protein DAPPUDRAFT_314752 [Daphnia pulex]
Length = 497
Score = 53.1 bits (126), Expect = 2e-04, Method: Compositional matrix adjust.
Identities = 61/257 (23%), Positives = 104/257 (40%), Gaps = 30/257 (11%)
Query: 9 TVAEIEGRGRGLVSTQSLKAGQIVLRDSPILLYSALPFINSSSFCHNCFRKTMHSSSSIC 68
T+ + G+ LV+T++++A +I++ L + P + C NC+R+ S +C
Sbjct: 55 TIGKSSDIGKFLVATRNIRADEIIIDAEEAL--AVGPKQATHPVCLNCYRRV--DGSYLC 110
Query: 69 PSCSVAFCSPKCSTAAASSSHSPYVCQALTRLLNLNSPDAANLPLDRQLQA---RFLIAA 125
P C CS C+ + +H Y CQ L + P +L+ R
Sbjct: 111 PLCQWPLCSEVCAQ-MENGAHQKYECQLLA--------NREAYPTKEELEQEDCRLYDCI 161
Query: 126 YNLAVV-----NPSQFQILLAFQ-GTVTDNDTSAAHYLSSLCPPPASTTIELTAALLAKD 179
L +V NP + ++ L + + H P +L ++
Sbjct: 162 APLRIVLDNHRNPERSRLWLGLEDHRLYRQKIGIWHVDRKTVVEPVLNKWQLGENTNEEE 221
Query: 180 RLNAFGLMEP-YIEGQDGRRSVRAYGIYLKASFFNHDCLPN-ACRFDYVDAAAENNTDII 237
A G++E E D +++ A +Y A H+C+PN AC V+ A + T
Sbjct: 222 LQRACGILEVNAFEVCDDDKTISARAVYSHACLMAHNCVPNTAC---CVNPATQRMT--- 275
Query: 238 VRMIHDVPQGREICLSY 254
VR D+P G+ I SY
Sbjct: 276 VRAAVDIPAGQMITTSY 292
>gi|307111681|gb|EFN59915.1| hypothetical protein CHLNCDRAFT_132958 [Chlorella variabilis]
Length = 853
Score = 53.1 bits (126), Expect = 2e-04, Method: Compositional matrix adjust.
Identities = 73/313 (23%), Positives = 115/313 (36%), Gaps = 58/313 (18%)
Query: 1 MSAVSETVTVAEIEGRGRGLVSTQSLKAGQIVLRDSPILLYSALPFINSSSFCHNCFRKT 60
++A + V + E GRGL + + L AG+ VL + P AL + S C C
Sbjct: 324 VAAAGPALVVQQTEDAGRGLSAAEELPAGRDVLAEQPFAF--ALTKLGRRSVCSTCLAPL 381
Query: 61 MHSSSSI-CPSCSV-AFCSPKCSTAAASSSH---SPYVCQALTRLLNLNSP--------- 106
+ + C C + A+C+P C AA H P + T LL +
Sbjct: 382 ADAPAPYYCRRCPMPAYCTPACR--AADPFHQPGGPECGRPWTVLLPAEAVAALRLARRL 439
Query: 107 --DAANLPLDRQLQARFLIAAYNLAVVNPSQFQILLA--------FQGTVTDNDTSAAHY 156
A P R + + + A ++P++ L A ++ V + A
Sbjct: 440 RFGGAGSPAARHVAS----LGTHFAELDPTEVVQLAALAAVTHATWRQAVAEARGGGAPA 495
Query: 157 LSSLCPPPA------------STTIELTAALLAKDRLNAFGLMEPYIEGQDGRRSVRAYG 204
+ P A T E L + ++N ++ P G D R A
Sbjct: 496 EAGPGSPAAVEGQAADEEARGGVTAEGVLEALCRLQINGLAVVPPQRRGSDDRL---ALA 552
Query: 205 IYLKASFFNHDCLPN-ACRFDYVDAAAENNTDIIVRMIHDVPQGREICLSYFPVNYDYST 263
+Y S NH CLPN + RF+ ++VR + VP G + Y P + +
Sbjct: 553 LYPVGSLMNHSCLPNVSTRFE--------GGKLVVRTVEAVPAGEPLLHCYGPQAGEMTA 604
Query: 264 RQKRLLY--DYDF 274
Q+ LL Y F
Sbjct: 605 AQRGLLLLQQYHF 617
>gi|299470965|emb|CBN79949.1| conserved unknown protein [Ectocarpus siliculosus]
Length = 556
Score = 53.1 bits (126), Expect = 2e-04, Method: Compositional matrix adjust.
Identities = 49/196 (25%), Positives = 83/196 (42%), Gaps = 29/196 (14%)
Query: 87 SSHSPYVCQALTRLLNLNSPDAANLPLDRQLQARFLIAAYNLAVVNPSQFQILLAFQGTV 146
SSH+P V +A+ L++ N + + L A F +A LA S+ +G+
Sbjct: 117 SSHAPSVREAVDELVHHNDDRSPEQKEEYMLMANF-VARLCLAGCGDSK-------KGSA 168
Query: 147 TDNDTSAAHYLSSLCPPPASTTIELTAALLAKDRLNAFGLMEPYIEGQDGRRSVRAYGIY 206
++ L L ++ A+L K N F + E + G+ G G+Y
Sbjct: 169 LLWPSAQGRGLPGL--------VDAAYAVLGKLSCNVFSIAESALNGEVG------CGLY 214
Query: 207 LKASFFNHDCLPNACRFDYVDAAAENNTDIIVRMIHDVPQGREICLSYFPVNYDYSTRQK 266
L+A+ NH C PNA A + + + +R + +G EI + + R++
Sbjct: 215 LEAAAANHSCNPNA-------AQSFSGKTLSLRCTRPIRKGEEITIGITQIQKPGPARRE 267
Query: 267 RLLYDYDFACDCDRCK 282
L Y F C C+RC+
Sbjct: 268 SLRKTYFFECRCERCE 283
>gi|440792261|gb|ELR13489.1| tetratricopeptide repeat domain containing protein [Acanthamoeba
castellanii str. Neff]
Length = 764
Score = 53.1 bits (126), Expect = 2e-04, Method: Compositional matrix adjust.
Identities = 34/115 (29%), Positives = 52/115 (45%), Gaps = 16/115 (13%)
Query: 170 ELTAALLAKDRLNAFGLMEPYIEGQDGRRSVR--AYGIYLKASFFNHDCLPNACRFDYVD 227
E A L A + NA L +P S+R G+Y A+ NH CLPNA +
Sbjct: 334 ETVARLFAVAQANALTLTDPS--------SLRPIGQGLYASAALLNHSCLPNA------N 379
Query: 228 AAAENNTDIIVRMIHDVPQGREICLSYFPVNYDYSTRQKRLLYDYDFACDCDRCK 282
+ + + VR + + G E+ ++Y Y RQ+ L + FAC C +C+
Sbjct: 380 WSVDGEGRLCVRAVRPIEAGEEVTVAYVDPTLPYHARQQALQDHFFFACRCLQCR 434
>gi|24641162|ref|NP_572675.1| CG11160, isoform A [Drosophila melanogaster]
gi|22832070|gb|AAF47987.2| CG11160, isoform A [Drosophila melanogaster]
Length = 532
Score = 53.1 bits (126), Expect = 2e-04, Method: Compositional matrix adjust.
Identities = 62/278 (22%), Positives = 112/278 (40%), Gaps = 29/278 (10%)
Query: 17 GRGLVSTQSLKAGQIVLRDSPILLYSALPFINSSSFCHNCFRK-TMHSSSSICPSCSVAF 75
GR LV+ + L+AG+ ++R+ P+ + P ++ C C+ ++ + CP C+
Sbjct: 58 GRYLVANRQLEAGETLIREEPLAIG---PCVSGDPVCLGCYHPVSLKADQYRCPGCAWPL 114
Query: 76 CSPKCSTAAASSSHSPYVCQ--ALTRLL--NLNSPDAANLPLDRQLQARFLIAAYNLAVV 131
C C+ H+ CQ A R + L + A + + ++ L
Sbjct: 115 CGSTCAGLKHRHGHTETECQLYAERRAVAGELLTERAGPAEVRDLYELVMIVRILLLRQH 174
Query: 132 NPSQFQILLAFQGTVTDNDTSAA---HYLSSLCPPPASTTIELTAALLAKDRLNAFGLME 188
+P QF ++ + + +A HY + T +L L A+ G+++
Sbjct: 175 DPEQFALIARMESHTEERRQNAVLWRHYEEKVV-QRLRVTWQLE-DLEAEQVHEVCGILD 232
Query: 189 P--YIEGQDGRRSVRAYGIYLKASFFNHDCLPNACRFDYVDAAAENNTDIIVRMIHDVPQ 246
+ GQ+G +A +Y A HDC PN D ++ +I++R V +
Sbjct: 233 VNCFEIGQNG---AKARTLYPSAFLLAHDCTPNTAHTD-----DPSSFEILLRTSRRVRE 284
Query: 247 GREICLSYFPVNYDYSTRQKRLLYDYD---FACDCDRC 281
+ LSY Y KR + ++ F C C RC
Sbjct: 285 REALTLSY---AYTLQGTLKRRAFMHEGKLFWCCCRRC 319
>gi|340711193|ref|XP_003394163.1| PREDICTED: SET and MYND domain-containing protein 4-like [Bombus
terrestris]
Length = 676
Score = 53.1 bits (126), Expect = 2e-04, Method: Compositional matrix adjust.
Identities = 57/210 (27%), Positives = 83/210 (39%), Gaps = 64/210 (30%)
Query: 89 HSPYVCQALTRLLN--------LNSPDAAN-LPLDRQLQARFLIAAYNLAVVNPSQFQIL 139
H Y+ L RLL + +PD+A P D +L LI +NL ++ QF
Sbjct: 402 HRTYIAIWLLRLLKRSPYFPEWVKTPDSAEATPSDGELYIGSLIL-HNLMLI---QFN-- 455
Query: 140 LAFQGTVTDNDTSAAHYLSSLCPPPASTTIELTAALLAKDRLNAFGLMEPYIEGQDGRRS 199
AH +S L P S +LAK + +I G
Sbjct: 456 --------------AHEISELAVPKGSN-------ILAKAK-------SKFIGG------ 481
Query: 200 VRAYGIYLKASFFNHDCLPNACRFDYVDAAAENNTDIIVRMIHDVPQGREICLSYFPV-- 257
G+Y S FNH C P R+ T ++VR I +P G EI +Y P+
Sbjct: 482 ----GVYSTVSLFNHSCNPGIIRYFI-------GTTMVVRAIRTIPAGEEISENYGPIFT 530
Query: 258 NYDYSTRQKRLLYDYDFACDCDRCKVEANW 287
+ R+++L Y F C+C+ C A+W
Sbjct: 531 TTPEAERKRKLRVQYWFDCNCEACT--AHW 558
>gi|431906502|gb|ELK10625.1| SET and MYND domain-containing protein 3 [Pteropus alecto]
Length = 352
Score = 53.1 bits (126), Expect = 2e-04, Method: Compositional matrix adjust.
Identities = 32/106 (30%), Positives = 49/106 (46%), Gaps = 12/106 (11%)
Query: 202 AYGIYLKASFFNHDCLPNACRFDYVDAAAENNTDIIVRMIHDVPQGREICLSYFPVNYDY 261
G+Y S NH C PN C + N +++R + D+ G E+ + Y +
Sbjct: 5 GVGLYPSMSLLNHSCDPN-CSIVF------NGPHLLLRAVRDIEAGEELTICYLDMLMTS 57
Query: 262 STRQKRLLYDYDFACDCDRCKVEANWSDNDNDD-ENNEEVMDEDQD 306
R+K+L Y F CDC RC+ + D D D +E+V E Q+
Sbjct: 58 EERRKQLRDQYCFECDCSRCQTQ----DKDADMLTGDEQVWKEVQE 99
>gi|409048092|gb|EKM57570.1| hypothetical protein PHACADRAFT_192717 [Phanerochaete carnosa
HHB-10118-sp]
Length = 309
Score = 53.1 bits (126), Expect = 2e-04, Method: Compositional matrix adjust.
Identities = 26/83 (31%), Positives = 41/83 (49%), Gaps = 5/83 (6%)
Query: 204 GIYLKASFFNHDCLPNACRFDYVDAAAENNTDIIVRMIHDVPQGREICLSYFPVNYDYST 263
GI+L + FNH C+PN ++ DAA ++ R + D+ G E+CL Y +
Sbjct: 105 GIFLAGARFNHSCVPNVN--NHWDAA---RGQLVFRALRDIEAGEELCLGYGRLLAKRGE 159
Query: 264 RQKRLLYDYDFACDCDRCKVEAN 286
R+ L + F C C+ C +E
Sbjct: 160 RRAELSAKFGFDCACEACSLEGK 182
>gi|440796609|gb|ELR17718.1| Histone-lysine N-methyltransferase [Acanthamoeba castellanii str.
Neff]
Length = 419
Score = 53.1 bits (126), Expect = 2e-04, Method: Compositional matrix adjust.
Identities = 26/77 (33%), Positives = 38/77 (49%), Gaps = 6/77 (7%)
Query: 205 IYLKASFFNHDCLPNACRFDYVDAAAENNTDIIVRMIHDVPQGREICLSYFPVNYDYSTR 264
+Y + S+FNH C PN C + A + HD+P G E+C+ Y V+ ++R
Sbjct: 305 LYAEGSYFNHSCAPN-CGTRTGEGQA-----VQFVATHDIPAGDEVCIRYIDVDKPTTSR 358
Query: 265 QKRLLYDYDFACDCDRC 281
+ LL Y F C C C
Sbjct: 359 RSELLSHYHFTCMCPLC 375
>gi|365991467|ref|XP_003672562.1| hypothetical protein NDAI_0K01280 [Naumovozyma dairenensis CBS 421]
gi|343771338|emb|CCD27319.1| hypothetical protein NDAI_0K01280 [Naumovozyma dairenensis CBS 421]
Length = 383
Score = 53.1 bits (126), Expect = 2e-04, Method: Compositional matrix adjust.
Identities = 59/200 (29%), Positives = 90/200 (45%), Gaps = 32/200 (16%)
Query: 90 SPYVCQALTRLLNLNSPDAANLPLDRQLQARFLIAAYNLAVVNPSQFQILLAFQGTVTDN 149
+ +VC++L RL ++S D+ LQ+ L + +F +LL FQ V
Sbjct: 210 AKFVCESLFRLKYIDS-DSITKKCFNGLQSNELSKIH--------EFPVLLHFQELVFKT 260
Query: 150 DTSAAHYLSSLCPPPAST--TIELTAALLAKDRLNAFGLMEPYIEGQDGR-RSVRAYGIY 206
L L P T TIE ++ + NAFG+ E G+ R + +
Sbjct: 261 -------LYILLPDYLRTQLTIENFRHIVGSEYGNAFGIWET---GETPESREWFGFEVL 310
Query: 207 LKASFFNHDCLPNACRFDYVDAAAENNTDIIVRMIHDVPQGREICLSYFPVNYDYSTRQK 266
+AS+FNH C PN + E T + + +V +G EIC+ Y V D T+++
Sbjct: 311 PEASYFNHSCKPN------ITKTREGRT-MEFTLNSNVAKGTEICIDYSGV-LDLPTKKR 362
Query: 267 R-LLYD-YDFACDCDRCKVE 284
R L+D + F C C RCK E
Sbjct: 363 RKFLHDTWFFDCLCVRCKSE 382
>gi|302654157|ref|XP_003018889.1| SET and MYND domain protein, putative [Trichophyton verrucosum HKI
0517]
gi|291182573|gb|EFE38244.1| SET and MYND domain protein, putative [Trichophyton verrucosum HKI
0517]
Length = 498
Score = 53.1 bits (126), Expect = 2e-04, Method: Compositional matrix adjust.
Identities = 67/293 (22%), Positives = 111/293 (37%), Gaps = 60/293 (20%)
Query: 17 GRGLVSTQSLKAGQIVLRDSPILLYSALPFINSSSFCHNCFRK------TMHSSSSICPS 70
GRGL +T ++K G+ V P + L C NCF M S +
Sbjct: 20 GRGLFATSTIKMGKDVF-SMPATFSTVLSTERLKDACSNCFANLPFGVNVMASVDMKLWA 78
Query: 71 CS----VAFCSPKCSTAAASSSHS--------------PYVCQALTRLLNLNSPDAANLP 112
CS V +C KC +A H P +A+ R++ L S + ++
Sbjct: 79 CSGCKVVKYCDRKCQSANWKLIHKHECAIYKKLYPKILPVNSRAVLRIVKLRSSNEDHVK 138
Query: 113 LDRQLQARFLIAAYNLAVV---NPSQFQ-ILLAFQGTVTDNDTSAAHYLSSLCPPPASTT 168
D + FL +L + N Q++ I+L + + + S +
Sbjct: 139 SDLNM---FLTLRSHLDEITSSNQEQYERIMLCAKA---EKEYSGSEL-----------D 181
Query: 169 IELTAALLAKDRLNAFGLMEPYIEGQDGRRSVRAYGIYLKASFFNHDCLPNACRFDYVDA 228
+E A LA+ +N+F + G ++ + Y+ NH C PNA
Sbjct: 182 VETIAEYLARIEVNSFTFTTAF--GDPLGLCIQPFACYM-----NHSCEPNA-------V 227
Query: 229 AAENNTDIIVRMIHDVPQGREICLSYFPVNYDYSTRQKRLLYDYDFACDCDRC 281
+ IIV+ + ++ ++ +SY Y RQK+L Y F C C +C
Sbjct: 228 VGFDGGLIIVKALREIKPDEQVFISYIDNTYPLEVRQKQLAERYFFTCKCSKC 280
>gi|395328144|gb|EJF60538.1| SET domain-containing protein [Dichomitus squalens LYAD-421 SS1]
Length = 384
Score = 53.1 bits (126), Expect = 2e-04, Method: Compositional matrix adjust.
Identities = 32/106 (30%), Positives = 51/106 (48%), Gaps = 12/106 (11%)
Query: 193 GQDGRRSVRA---YGIYLKASFFNHDCLPNACRFDYVDAAAENNTDIIVRMIHDVPQGRE 249
G+ GR S GI+L+A+ NH C PN R V A ++ R + DV G E
Sbjct: 155 GKGGRPSTGGGSREGIFLQAARLNHSCRPNVARTWDVGA-----QEMAFRALRDVAPGEE 209
Query: 250 ICLSYFPVNYDYSTRQKR---LLYDYDFACDCDRCKVEANWSDNDN 292
+C+SY V+ TR++R + + F C C+ C ++ + +
Sbjct: 210 LCMSYVDVDI-LGTREERGAEIEGAFGFVCACEACMLDGREGEESD 254
>gi|392579385|gb|EIW72512.1| hypothetical protein TREMEDRAFT_58684 [Tremella mesenterica DSM
1558]
Length = 885
Score = 52.8 bits (125), Expect = 2e-04, Method: Compositional matrix adjust.
Identities = 35/111 (31%), Positives = 50/111 (45%), Gaps = 16/111 (14%)
Query: 178 KDRLNAFGLMEPYIEG-------QDGRRSVRAYGIYLKASFFNHDCLPNACRFDYVDAAA 230
K +N+FGL +G +DG + I+ S NH CLPN + D A
Sbjct: 522 KIAINSFGLSPITAKGFHSAPMRKDGEVD-KDTAIWPTVSMINHSCLPNTVHIAWGDVA- 579
Query: 231 ENNTDIIVRMIHDVPQGREICLSYFPVNYDYSTRQKRLLYDYDFACDCDRC 281
+VR ++D+ G E+ +SYF Y TR + Y F CDC+ C
Sbjct: 580 ------VVRALNDMSAGTELTISYFSREQPYETRAAK-AKQYLFTCDCNLC 623
>gi|328868687|gb|EGG17065.1| SET domain-containing protein [Dictyostelium fasciculatum]
Length = 333
Score = 52.8 bits (125), Expect = 2e-04, Method: Compositional matrix adjust.
Identities = 71/295 (24%), Positives = 114/295 (38%), Gaps = 50/295 (16%)
Query: 7 TVTVAEIEGRGRGLVSTQSLKAGQIVL---RDSPILLYSALPFINSSSFCHNCFRK-TMH 62
++++ + E GR ++S ++++ G V+ R S +Y+ L CH C + T+
Sbjct: 18 SISIQQGESTGRFILSRETIQRGSKVISLPRTSCHGIYNHLV----RQVCHTCLKHSTLG 73
Query: 63 SSSSICPSCS-VAFCSPKCSTAAASSS-HSPYVCQALTRLLNLNSPDAANLPLDRQLQAR 120
+ C +C V +CS +C AA H+ C L L + R +
Sbjct: 74 NLKLSCQTCKQVYYCSRECQDAAERGLFHNHLECLVLRWLRVPAGFTMHQVAEIRMVVNH 133
Query: 121 FLIAAYNL-------AVVNPSQF-----QILLAFQGTVTDNDTSAAHYLSSLCPPPASTT 168
F AY V ++F ++++ V + LC S
Sbjct: 134 FSKVAYGTDDKAIKQTVGEFNRFPKEDSKLIIKMSAVVKRA------FAGYLCDLDDSHI 187
Query: 169 IELTAALLAKDRLNAFGLMEPYIEGQDGRRSVRAYGI--YLKASFFNHDCLPNACRFDYV 226
E+ LA+ N+FGL + S YG+ Y ASF NH C PN R
Sbjct: 188 CEV----LAQSNRNSFGLWKS---------SDEQYGLAMYATASFLNHSCFPNCAR---- 230
Query: 227 DAAAENNTDIIVRMIHDVPQGREICLSYFPVNYDYSTRQKRLLYDYDFACDCDRC 281
+ N I + I D+ + EI + Y + + R+ L Y F C C RC
Sbjct: 231 ---VQRNAGIDIVAIRDIEENDEITICYINARDNDTARRMILKGCYYFDCQCIRC 282
>gi|212276262|ref|NP_001130593.1| uncharacterized protein LOC100191692 [Zea mays]
gi|194689580|gb|ACF78874.1| unknown [Zea mays]
gi|414865032|tpg|DAA43589.1| TPA: hypothetical protein ZEAMMB73_306282 [Zea mays]
Length = 544
Score = 52.8 bits (125), Expect = 2e-04, Method: Compositional matrix adjust.
Identities = 31/85 (36%), Positives = 39/85 (45%), Gaps = 8/85 (9%)
Query: 202 AYGIYLKASFFNHDCLPNACRFDYVDAAAENNTDIIVRMIHDVPQGREICLSYFPVNYDY 261
G+++ SF NH C PNA R D A IV D+ G EI YF V
Sbjct: 321 GVGLWVLPSFINHSCHPNARRTHIGDHA-------IVHASRDIKAGEEITFPYFDVLVPV 373
Query: 262 STRQKRLLYDYDFACDCDRCKVEAN 286
S R++ + F C CDRC+ EA
Sbjct: 374 SKRRE-ASRAWGFECKCDRCRFEAE 397
>gi|336372877|gb|EGO01216.1| hypothetical protein SERLA73DRAFT_179334 [Serpula lacrymans var.
lacrymans S7.3]
gi|336385718|gb|EGO26865.1| hypothetical protein SERLADRAFT_464417 [Serpula lacrymans var.
lacrymans S7.9]
Length = 167
Score = 52.8 bits (125), Expect = 3e-04, Method: Compositional matrix adjust.
Identities = 40/143 (27%), Positives = 63/143 (44%), Gaps = 36/143 (25%)
Query: 168 TIELTAALLAKDRLNAFGLME------PYIEGQDGR-------------RSVRAYGIYLK 208
T E+ A+LA+D NAFG+ E Y++G + R + + +GI+ +
Sbjct: 22 TSEMVRAVLARDPGNAFGIWELGQSVRSYLDGVEARDERKEYEGSTMDSKEMLGWGIWPE 81
Query: 209 ASFFNHDCLPNACRFDYVDAAAENNTDIIVRMIHDVPQGREICLSYFPVNYDYST----- 263
AS+FNH C PN V N I + V +G E+C++Y + +T
Sbjct: 82 ASYFNHSCNPNV-----VKKRIARNLVFITTSV--VQEGDELCIAYVDCDEIQTTEIDEF 134
Query: 264 -----RQKRLLYDYDFACDCDRC 281
R++ L + F C C RC
Sbjct: 135 AAGEKRRELLRKWWFFGCACSRC 157
>gi|367053083|ref|XP_003656920.1| hypothetical protein THITE_2057091 [Thielavia terrestris NRRL 8126]
gi|347004185|gb|AEO70584.1| hypothetical protein THITE_2057091 [Thielavia terrestris NRRL 8126]
Length = 429
Score = 52.8 bits (125), Expect = 3e-04, Method: Compositional matrix adjust.
Identities = 37/109 (33%), Positives = 56/109 (51%), Gaps = 11/109 (10%)
Query: 204 GIYLKASFFNHDCLPNA-CRFDYVDAAAENNTDIIVRMIHDVPQGREICLSYFPVNYDYS 262
G++ S FNHDC PN R + + +T I VR D+P+G E+ +SY S
Sbjct: 235 GMFPDVSKFNHDCRPNVHYRINNL-----THTAIAVR---DIPRGEELTISYIYPLAPLS 286
Query: 263 TRQKRLLYDYDFACDCDRCKVEANWSDNDNDDENNEEVMDEDQDEQMVA 311
TRQ + L D+DF C C +C + A + +D + ED+ E ++A
Sbjct: 287 TRQTQ-LRDWDFTCTCAQCTLPAT-ASAQSDARIRQIAALEDEIEAIMA 333
Score = 42.4 bits (98), Expect = 0.39, Method: Compositional matrix adjust.
Identities = 14/38 (36%), Positives = 28/38 (73%)
Query: 11 AEIEGRGRGLVSTQSLKAGQIVLRDSPILLYSALPFIN 48
AE+ G+GRGL++ ++++ G+I+++ P+LL P +N
Sbjct: 145 AEVPGKGRGLIANRTIRRGEIIMQRLPVLLIQMTPHVN 182
>gi|398392173|ref|XP_003849546.1| hypothetical protein MYCGRDRAFT_48051 [Zymoseptoria tritici IPO323]
gi|339469423|gb|EGP84522.1| hypothetical protein MYCGRDRAFT_48051 [Zymoseptoria tritici IPO323]
Length = 633
Score = 52.8 bits (125), Expect = 3e-04, Method: Compositional matrix adjust.
Identities = 34/105 (32%), Positives = 51/105 (48%), Gaps = 9/105 (8%)
Query: 182 NAFGLMEPYIEGQDGRRSVRAYGIYLKASFFNHDCLPNACRFDYVDAAAENNTDIIVRMI 241
NAFG I G+ + G++L A++ NH CLPNA + D ++VR
Sbjct: 465 NAFG-PGAVIAGEAENIRKASTGLWLMAAYINHSCLPNAEKSFLGDL-------MVVRAT 516
Query: 242 HDVPQGREICLSYFPVNYDYSTRQKRLLYDYDFACDCDRCKVEAN 286
++ G EI SY + DY RQ+ L+ + F C C+ CK +
Sbjct: 517 RNITAGSEITHSY-DSSSDYDARQEALMKTWGFRCRCELCKADGG 560
>gi|225717680|gb|ACO14686.1| msta, isoform A [Caligus clemensi]
Length = 521
Score = 52.8 bits (125), Expect = 3e-04, Method: Compositional matrix adjust.
Identities = 61/284 (21%), Positives = 113/284 (39%), Gaps = 40/284 (14%)
Query: 17 GRGLVSTQSLKAGQIVLRDSPILLYSALPFINSSSFCHNCFRKTMHSSSSICPSCSVAFC 76
GR LV+++ + G+I+ ++ P ++ P + SS C C+++ SS CP C C
Sbjct: 65 GRFLVASRKILRGEIIFQELPGIV-GPNPVV-SSPICVACYKQVGPRSS--CPKCGFPLC 120
Query: 77 SPKCSTAAASSSHSPYVCQALTRLLNLNSPDAANLPLDRQLQARFLIAAYNLAVV----- 131
C ++C+ L R +L + + L A L ++
Sbjct: 121 EESCERDHG------FICEHLHRF-------GIDLTPEEMCDSSILHAILPLKLLLSKES 167
Query: 132 NPSQFQILLAFQGTVTDNDTSAAHYLSSLCPPPASTTIELTAALLAKDRL-NAFGLMEPY 190
NP + L D + YL + P ++E + + + D L A G++E
Sbjct: 168 NPHVYNCLSILMDHREDR-LNDPEYLEGI-QTPILKSLERSLSTYSTDELLKALGIVEVN 225
Query: 191 ------IEGQDGRRSVRAYGIYLKASFFNHDCLPNACRFDYVDAAAENNTDIIVRMIHDV 244
+ G G R G++ S +HDC+P R + E + + ++
Sbjct: 226 SYEVYNLTGHSGFR-----GLFCLTSLLSHDCVPK--RRPII--GHETPYGLTMISTRNI 276
Query: 245 PQGREICLSYFPVNYDYSTRQKRLLYDYDFACDCDRCKVEANWS 288
+G + ++Y R++ L ++ F C C RC+ EA +
Sbjct: 277 NEGEILSINYVHTQKPNRIRRRTLKDNWYFECSCKRCEDEAEFG 320
>gi|118379402|ref|XP_001022867.1| conserved hypothetical protein [Tetrahymena thermophila]
gi|89304634|gb|EAS02622.1| conserved hypothetical protein [Tetrahymena thermophila SB210]
Length = 418
Score = 52.8 bits (125), Expect = 3e-04, Method: Compositional matrix adjust.
Identities = 41/136 (30%), Positives = 58/136 (42%), Gaps = 13/136 (9%)
Query: 152 SAAHYLSSLCPPPASTTIELTAALLAKDRLNAFGLMEPYIEGQDGRRSVRAYGIYLKASF 211
A L SL +E + AK NA + + E DG + G+Y + ++
Sbjct: 108 QALMLLKSLEEEVIPEKMEFLISAQAKFATNALTIQDSLFET-DGIGA----GLYEEVNY 162
Query: 212 FNHDCLPNA-CRFDYVDAAAENNTDIIVRMIHDVPQGREICLSYFPVNYDYSTRQKRLLY 270
NH C PN C F+ + + V I D+ QG EI SY D R++ L
Sbjct: 163 MNHSCTPNVICVFNKL-------PQVRVIAIRDIEQGEEIMNSYIDTKKDLDFRRRFLKQ 215
Query: 271 DYDFACDCDRCKVEAN 286
+Y F C+C RC E N
Sbjct: 216 NYFFLCECKRCIKEQN 231
>gi|328785466|ref|XP_001121590.2| PREDICTED: protein msta, isoform A-like [Apis mellifera]
Length = 526
Score = 52.8 bits (125), Expect = 3e-04, Method: Compositional matrix adjust.
Identities = 66/278 (23%), Positives = 107/278 (38%), Gaps = 44/278 (15%)
Query: 10 VAEIEGRGRGLVSTQSLKAGQIVLRDSPILLYSALPFINSSSF-CHNCFRKTMHSSSSIC 68
VA E GR L +++ L+AG+++LR+ P+ + + F C + + +C
Sbjct: 19 VAYSEKLGRYLQASKDLRAGEVILREDPVAVGPMSCVKDPICFECLSILPNIEEDVNYVC 78
Query: 69 PSCSVA-FCSPKCSTAAASSSHSPYVCQALTRLLNLNSPDAAN--LPLDRQLQARFLIAA 125
C+V C C + + LL P+ L L+ L R
Sbjct: 79 SGCNVVTLCGVTCEERG--------IYHSALWLLKQKDPELWKRVLSLESHLNKR----- 125
Query: 126 YNLAVVNPSQFQILLAFQG-TVTDNDTSAAHYLSSLCPPPASTTIELTAALLAKDRLNAF 184
N V + I+ + +ND S + + LC +L +N+F
Sbjct: 126 RNTIVWEDREINIVNVLKSLNFLENDPSVSEMIQQLC------------GILD---VNSF 170
Query: 185 GLMEPYIEGQDGRRSVRAYGIYLKASFFNHDCLPNACRFDYVDAAAENNTDIIVRMIHDV 244
L P G DG + G+YL+AS HDC N V A++N + V +
Sbjct: 171 ELRSP--GGMDG---LLLRGLYLEASMMAHDCRGN------VHVTADDNFHLTVYASIPI 219
Query: 245 PQGREICLSYFPVNYDYSTRQKRLLYDYDFACDCDRCK 282
+G I +Y + R++ L F C+CD CK
Sbjct: 220 KEGDTIFFNYTSSLLGTTGRREYLRTGKYFECECDLCK 257
>gi|150864533|ref|XP_001383385.2| hypothetical protein PICST_19605 [Scheffersomyces stipitis CBS
6054]
gi|149385789|gb|ABN65356.2| predicted protein, partial [Scheffersomyces stipitis CBS 6054]
Length = 375
Score = 52.8 bits (125), Expect = 3e-04, Method: Compositional matrix adjust.
Identities = 39/129 (30%), Positives = 61/129 (47%), Gaps = 10/129 (7%)
Query: 157 LSSLCPPPASTTIELTAALLAKDRLNAFGLMEPYIEGQDGRRSVRAYGIYLKASFFNHDC 216
L++L P T +L + + ++ NAFG+ ++ R +G+Y AS+FNH C
Sbjct: 254 LTALPPLQPFITSDLVRSSIGRNISNAFGIWSDSSSSEEEREYF-GFGVYPSASYFNHSC 312
Query: 217 LPNACRFDYVDAAAENNTDIIVRMIHDVPQGREICLSYFP-VNYDYSTRQKRLLYDYDFA 275
PN V EN + + D+ G E+C+ Y +N RQK L ++ F
Sbjct: 313 GPNV-----VKKRIENK--LTFTTLRDISAGEELCIDYGNYINEPVEVRQKE-LSEWFFN 364
Query: 276 CDCDRCKVE 284
C CD+C E
Sbjct: 365 CGCDKCVAE 373
>gi|195333441|ref|XP_002033400.1| GM21288 [Drosophila sechellia]
gi|194125370|gb|EDW47413.1| GM21288 [Drosophila sechellia]
Length = 660
Score = 52.8 bits (125), Expect = 3e-04, Method: Compositional matrix adjust.
Identities = 79/324 (24%), Positives = 121/324 (37%), Gaps = 49/324 (15%)
Query: 4 VSETVTVAEIEGRGRGLVSTQSLKAGQIVLRDSPILLYSALPFINSSSFCHNCFRKTMHS 63
VS V + GR ++ +K G+ +L + P + S L + + C NCF +T+
Sbjct: 237 VSPLVRIDSNRQEGRFARASADVKPGEELLVERPFV--SVLLEKFAKTHCENCFMRTVVP 294
Query: 64 SSSICPSCS-VAFCSPKCSTAAASSSHSPYVCQALTRLL----NLNSPDA----ANLPLD 114
+ CP C+ V +CS +C A+ H Y C + + ++N+ A A+ PLD
Sbjct: 295 VA--CPRCADVLYCSEQCREEASKKYHK-YECGIVPIIWRSGASINNHIALRIIASKPLD 351
Query: 115 ----------RQLQARFLIA------------AYNLAVVNPSQFQILLAFQGTVTDNDTS 152
QL LI+ + PS F + +T +
Sbjct: 352 YFLKLKPTIDEQLTPEQLISLPKDDFRRVAQLERHQGERQPSNFFQHVLMARFLTHCLRA 411
Query: 153 AAHYLSSLCPPPASTTIELTAALLAKDRLNAFGLMEPYIEGQDGRRSVRAYG--IYLKAS 210
++ S P S L L + N + E + GR G IY +
Sbjct: 412 GGYFGSEPKPDEVSIICSLVLRSLQFIQFNTHEVAELHKFSSSGREKSIFIGGAIYPTLA 471
Query: 211 FFNHDCLPNACRFDYVDAAAENNTDIIVRMIHDVPQGREICLSYFPV--NYDYSTRQKRL 268
FNH C P R+ T I + + + G I +Y P+ + S RQ RL
Sbjct: 472 LFNHSCDPGVVRYF-------RGTTIHINSVRPIEAGLPINENYGPMYTQDERSERQARL 524
Query: 269 LYDYDFACDCDRCKVEANWSDNDN 292
Y F C CD C NW D+
Sbjct: 525 KELYWFECSCDACI--DNWPKFDD 546
>gi|391347157|ref|XP_003747831.1| PREDICTED: SET and MYND domain-containing protein 1-like
[Metaseiulus occidentalis]
Length = 684
Score = 52.8 bits (125), Expect = 3e-04, Method: Compositional matrix adjust.
Identities = 47/189 (24%), Positives = 83/189 (43%), Gaps = 19/189 (10%)
Query: 154 AHYLSSLCPPPASTTIELTAALLAKDRLNAFGLMEPYIEGQDGRRSVRAYGIYLKASFFN 213
A + S+ P T +E +L + R+N + ++ D R +YL AS N
Sbjct: 380 AETILSIVSPFVFTQLEEVMDILQRCRINCHSI----VDHNDFHFFSRGRAVYLAASKTN 435
Query: 214 HDCLPNACRFDYVDAAAENNTDIIVRMIHD--VPQGREICLSYFPVNYDYSTRQKRLLYD 271
H C+ + +YV N I++R I D + ++ + Y P Y +R +R L +
Sbjct: 436 HSCVTSN---EYVQIF--NGRKILLRAIQDFRIDDPLQMTIHYMPPTLPYESRLRRCLNN 490
Query: 272 YDFACDCDRCKVEANWSDNDNDDENNEEVMDEDQD-EQMVASDDDAEVHGD-----TNFP 325
Y F C C +C +A + + D E V + +Q E+ +SD+ E+ N P
Sbjct: 491 YYFICSCRKCVWQAR--NPEPDQAAGETVAEIEQAFEEYHSSDEWYEIGTKLLEKVKNLP 548
Query: 326 HAYFFMRYM 334
F++ ++
Sbjct: 549 EKNFYVYWL 557
>gi|326477120|gb|EGE01130.1| SET and MYND domain-containing protein [Trichophyton equinum CBS
127.97]
Length = 499
Score = 52.8 bits (125), Expect = 3e-04, Method: Compositional matrix adjust.
Identities = 65/290 (22%), Positives = 106/290 (36%), Gaps = 54/290 (18%)
Query: 17 GRGLVSTQSLKAGQIVLRDSPILLYSALPFINSSSFCHNCFRK------TMHSSSSICPS 70
GRGL +T ++ G+ V P + L S C NCF M S +
Sbjct: 21 GRGLFATSTINMGKEVF-SMPATFSTVLSTERLSDVCSNCFANLPFGVNVMASVDMKLRA 79
Query: 71 CS----VAFCSPKCSTAAASSSHS--------------PYVCQALTRLLNLNSPDAANLP 112
CS V +C KC +A H P +A+ R++ L + + ++
Sbjct: 80 CSGCKVVKYCDEKCQSANWKLIHKHECAIYKKLYPKILPVNSRAVLRIVKLRNSNEEHVK 139
Query: 113 LDRQLQARFLIAAYNL-AVVNPSQFQILLAFQGTVTDNDTSAAHYLSSLCPPPASTTIEL 171
D + FL +L + N +Q Q + + S + IE
Sbjct: 140 SDLTM---FLSLRSHLNEITNSNQEQYERIMLSAKAEKEYSGS-----------DLDIET 185
Query: 172 TAALLAKDRLNAFGLMEPYIEGQDGRRSVRAYGIYLKASFFNHDCLPNACRFDYVDAAAE 231
A LA+ +N+F + G ++ + Y+ NH C PNA
Sbjct: 186 IAEYLARIEVNSFTFTTAF--GDPLGLCIQPFACYM-----NHSCEPNA-------VVGF 231
Query: 232 NNTDIIVRMIHDVPQGREICLSYFPVNYDYSTRQKRLLYDYDFACDCDRC 281
+ I V+ + + ++ +SY Y + RQK+L Y F C C +C
Sbjct: 232 DEGLITVKALCKIKPDEQVFISYIDNTYPFEVRQKQLAERYFFTCKCSKC 281
>gi|402077091|gb|EJT72440.1| hypothetical protein GGTG_09306 [Gaeumannomyces graminis var.
tritici R3-111a-1]
Length = 455
Score = 52.8 bits (125), Expect = 3e-04, Method: Compositional matrix adjust.
Identities = 27/78 (34%), Positives = 38/78 (48%), Gaps = 5/78 (6%)
Query: 204 GIYLKASFFNHDCLPNACRFDYVDAAAENNTDIIVRMIHDVPQGREICLSYFPVNYDYST 263
GI+L+AS NH CL NA + E + + I D+ QG EI + Y + +
Sbjct: 105 GIFLEASRINHACLQNAQ-----NTWNEGLQQLTIHAIRDINQGEEITIMYIEDRANRAA 159
Query: 264 RQKRLLYDYDFACDCDRC 281
RQ+ L D+ F C C C
Sbjct: 160 RQRTLQRDFRFTCSCQLC 177
>gi|326472015|gb|EGD96024.1| hypothetical protein TESG_03485 [Trichophyton tonsurans CBS 112818]
Length = 499
Score = 52.8 bits (125), Expect = 3e-04, Method: Compositional matrix adjust.
Identities = 65/290 (22%), Positives = 106/290 (36%), Gaps = 54/290 (18%)
Query: 17 GRGLVSTQSLKAGQIVLRDSPILLYSALPFINSSSFCHNCFRK------TMHSSSSICPS 70
GRGL +T ++ G+ V P + L S C NCF M S +
Sbjct: 21 GRGLFATSTINMGKEVF-SMPATFSTVLSTERLSDVCSNCFANLPFGVNVMASVDMKLRA 79
Query: 71 CS----VAFCSPKCSTAAASSSHS--------------PYVCQALTRLLNLNSPDAANLP 112
CS V +C KC +A H P +A+ R++ L + + ++
Sbjct: 80 CSGCKVVKYCDEKCQSANWKLIHKHECAIYKKLYPKILPVNSRAVLRIVKLRNSNEEHVK 139
Query: 113 LDRQLQARFLIAAYNL-AVVNPSQFQILLAFQGTVTDNDTSAAHYLSSLCPPPASTTIEL 171
D + FL +L + N +Q Q + + S + IE
Sbjct: 140 SDLTM---FLSLRSHLNEITNSNQEQYERIMLSAKAEKEYSGS-----------DLDIET 185
Query: 172 TAALLAKDRLNAFGLMEPYIEGQDGRRSVRAYGIYLKASFFNHDCLPNACRFDYVDAAAE 231
A LA+ +N+F + G ++ + Y+ NH C PNA
Sbjct: 186 IAEYLARIEVNSFTFTTAF--GDPLGLCIQPFACYM-----NHSCEPNA-------VVGF 231
Query: 232 NNTDIIVRMIHDVPQGREICLSYFPVNYDYSTRQKRLLYDYDFACDCDRC 281
+ I V+ + + ++ +SY Y + RQK+L Y F C C +C
Sbjct: 232 DEGLITVKALCKIKPDEQVFISYIDNTYPFEVRQKQLAERYFFTCKCSKC 281
>gi|310795375|gb|EFQ30836.1| hypothetical protein GLRG_05980 [Glomerella graminicola M1.001]
Length = 411
Score = 52.8 bits (125), Expect = 3e-04, Method: Compositional matrix adjust.
Identities = 36/119 (30%), Positives = 54/119 (45%), Gaps = 14/119 (11%)
Query: 167 TTIELTAALLAKDRLNAFGLMEPYIEGQDGRRSVRAYGIYLKASFFNHDCLPNACRFDYV 226
TT + L ++D NAFG+ EG + + +G + AS+FNH C PN V
Sbjct: 295 TTPQSLLTLSSRDSHNAFGIRSLEDEGSE----LFGHGCWPAASYFNHSCGPN------V 344
Query: 227 DAAAENNTDIIVRMIHDVPQGREICLSYFPVNYDYSTRQKR---LLYDYDFACDCDRCK 282
D E R DV +G ++C++Y R+ R L ++ F C C RC+
Sbjct: 345 DKRREGRA-WEFRASRDVGKGEQLCITYLGGEEKAMPRETRMSTLRRNWGFECGCKRCE 402
>gi|156544373|ref|XP_001607398.1| PREDICTED: protein msta, isoform A-like [Nasonia vitripennis]
Length = 507
Score = 52.8 bits (125), Expect = 3e-04, Method: Compositional matrix adjust.
Identities = 62/279 (22%), Positives = 104/279 (37%), Gaps = 31/279 (11%)
Query: 17 GRGLVSTQSLKAGQIVLRDSPILLYSALPFINSSSFCHNCFRKTMHSSSSICPSCSVAFC 76
GR LV+++ L+ G+ +L + P ++ C++ + T S +C CS C
Sbjct: 21 GRFLVASKDLEPGEQILTELPFVVGPKAATYPVCLSCYSVWPATEDDSKPLCSRCSWPVC 80
Query: 77 SPKCSTAAASSSHSPYVCQALTRLLNLNSPDAA-------NLP-LDRQLQARFLIAAYNL 128
P+C + H Y C S D A +P L+ R L+AA
Sbjct: 81 GPECEN---NPQHKDYECPIFEAAKEKFSIDVALSEEHQNGVPQLECITPLRLLLAAEKD 137
Query: 129 AVVNPSQFQILLAFQGTVTDNDTSAAHYLSSLCPPPASTTIELTAALLAKDRLN------ 182
S+ + + A + +++ +E L DR +
Sbjct: 138 PERWKSEIKDMEAHNKKRAQKNQWHIDHVN---------IVEYIRKRLKLDRFSEEDIQT 188
Query: 183 AFGLMEPYIEGQDGRRSVRAYGIYLKASFFNHDCLPNACRFDYVDAAAENNTDIIVRMIH 242
A G+++ + A G+Y K + NH+C+ N + N+ I +R
Sbjct: 189 ACGILDVNSHEIRTAKGFLARGLYPKVAIMNHNCVSNTAH-----SIDPNDYRIFLRSAV 243
Query: 243 DVPQGREICLSYFPVNYDYSTRQKRLLYDYDFACDCDRC 281
VP E+ SY R++ LL FAC C+RC
Sbjct: 244 KVPAAGELFGSYTHALLPTLLRREHLLESKHFACACNRC 282
>gi|440801197|gb|ELR22218.1| SET domain containing protein [Acanthamoeba castellanii str. Neff]
Length = 343
Score = 52.8 bits (125), Expect = 3e-04, Method: Compositional matrix adjust.
Identities = 32/84 (38%), Positives = 45/84 (53%), Gaps = 9/84 (10%)
Query: 204 GIYLKASFFNHDCLPNACRFDYVDAAAENNTDIIVRMIHDVPQGREICLSY---FPVNYD 260
G+Y+ +SF+NH C+PN R Y ENNT V + G+EI +SY N
Sbjct: 255 GLYILSSFYNHSCVPNT-RSSY----PENNT-WHVYASKPIEAGQEIFISYVDHMKSNKT 308
Query: 261 YSTRQKRLLYDYDFACDCDRCKVE 284
TR++ L Y F C+C RC++E
Sbjct: 309 VETRRQHLYNHYGFWCECPRCRLE 332
>gi|108710651|gb|ABF98446.1| MYND finger family protein, expressed [Oryza sativa Japonica Group]
Length = 352
Score = 52.8 bits (125), Expect = 3e-04, Method: Compositional matrix adjust.
Identities = 68/289 (23%), Positives = 109/289 (37%), Gaps = 43/289 (14%)
Query: 8 VTVAEIEGRGRGLVSTQSLKAGQIVLRDSPILLYSALP-FINSSSFCHNCFRKTMHSSSS 66
+ VA + G+GRGL + +S G++V+ P Y++ P I+ S C NCF +
Sbjct: 17 LAVASVPGKGRGLFAARSFFPGEVVISQEP---YASTPNKISVGSNCDNCF---ASRNLR 70
Query: 67 ICPSCSVA-FCSPKCSTAAASSSHSPYVCQALT-------RLLNLNSPDAANLPLDRQLQ 118
C C VA +C C C+A+ ++L L L R+LQ
Sbjct: 71 KCSVCRVAWYCGSACQREEWKLHQ--LECRAIAALTEDRKKMLTPTIRLMVRLVLRRKLQ 128
Query: 119 ARFLIAAYNLAVVNPSQFQILLAFQGTVTDNDT------SAAHYLSSLCPPPASTTIELT 172
I + + ++ A + +++ D + L L P ++
Sbjct: 129 DDKAIPS-----SGTDNYNLVDALESHISEVDKNQLVLYAQMANLVQLILPSFELDLKEI 183
Query: 173 AALLAKDRLNAFGLMEPYIEGQDGRRSVRAYGIYLKASFFNHDCLPNACRFDYVDAAAEN 232
+K NA + +P + G+Y S NH C+PNA E
Sbjct: 184 THTFSKFACNAHTICDPELRPL-------GTGLYPVLSIINHSCVPNAVLI------FEG 230
Query: 233 NTDIIVRMIHDVPQGREICLSYFPVNYDYSTRQKRLLYDYDFACDCDRC 281
T VR + + + E+ +SY RQ L + Y F C C RC
Sbjct: 231 RT-AYVRALQPISKNEEVSISYIETAATTMKRQDDLKH-YYFTCTCPRC 277
>gi|47204414|emb|CAF87026.1| unnamed protein product [Tetraodon nigroviridis]
Length = 386
Score = 52.8 bits (125), Expect = 3e-04, Method: Compositional matrix adjust.
Identities = 63/246 (25%), Positives = 99/246 (40%), Gaps = 35/246 (14%)
Query: 53 CHNCFRKTMHSSSSICPSCSVA-FCSPKCSTAAASSSHSPYVCQALTRLLNLNSPDAANL 111
CH+CFR+ + C C A +C C TA CQA+ RL +
Sbjct: 2 CHSCFRQQ--AQLHRCAQCRFAHYCDRTCQTACWEEHRQE--CQAIRRL--------GEV 49
Query: 112 PLDR-QLQARFLIAAY-NLAVVNPSQFQILLAFQGTVTDNDTSAAHYLSS--------LC 161
P DR +L AR + + + V + SQ + Q V D L +
Sbjct: 50 PADRIRLAARVMWRMHKDTGVASDSQLLPVEELQDHVADLPQEQLQRLHADVRTFQEFWS 109
Query: 162 PPPASTTIELTAALLAKDRLNAFGLMEPYIEGQDGRRSVRAYGIYLKASFFNHDCLPN-A 220
+ + E A +L + N F L + Q G ++V G++ NHDC PN +
Sbjct: 110 CGRSRPSREDIAHVLGIIKTNGFTLSD-----QRGLQAV-GVGLFPNLCLVNHDCWPNCS 163
Query: 221 CRFDYVDAAAEN-----NTDIIVRMIHDVPQGREICLSYFPVNYDYSTRQKRLLYDYDFA 275
++ + +A N + +R + V QG E+ +SY + RQ++L + F
Sbjct: 164 VVLNHGNQSATNPALHSQRRVELRALGKVCQGEELTVSYVDFLDTSAERQRKLSELFFFH 223
Query: 276 CDCDRC 281
C C RC
Sbjct: 224 CTCQRC 229
>gi|302845794|ref|XP_002954435.1| hypothetical protein VOLCADRAFT_118721 [Volvox carteri f.
nagariensis]
gi|300260365|gb|EFJ44585.1| hypothetical protein VOLCADRAFT_118721 [Volvox carteri f.
nagariensis]
Length = 405
Score = 52.8 bits (125), Expect = 3e-04, Method: Compositional matrix adjust.
Identities = 29/87 (33%), Positives = 40/87 (45%), Gaps = 17/87 (19%)
Query: 204 GIYLKASFFNHDCLPNACRFDYVDAAAENNTDIIVRMIHDVPQGREICLSYF------PV 257
G++ SF NH CLPNA F ++V + QG E+ ++Y PV
Sbjct: 122 GLFPSFSFLNHSCLPNAVNFVV-------GGIMVVVAARKIRQGSEVLINYLGRASLRPV 174
Query: 258 NYDYSTRQKRLLYDYDFACDCDRCKVE 284
RQ +L Y F+CDC RC+ E
Sbjct: 175 G----ERQGQLAEGYHFSCDCPRCRTE 197
>gi|405959013|gb|EKC25088.1| SET and MYND domain-containing protein 5 [Crassostrea gigas]
Length = 396
Score = 52.8 bits (125), Expect = 3e-04, Method: Compositional matrix adjust.
Identities = 34/90 (37%), Positives = 45/90 (50%), Gaps = 10/90 (11%)
Query: 204 GIYLKASFFNHDCLPNA-CRFDYVDAAAENNTDIIVRMIHDVPQGREICLSYFP---VNY 259
G+Y S NH CLPNA F Y NN + V +P G+EIC+SY ++
Sbjct: 298 GLYSLQSTCNHSCLPNAEITFPY------NNNVMAVVAKEKIPSGQEICISYLSECDLSR 351
Query: 260 DYSTRQKRLLYDYDFACDCDRCKVEANWSD 289
+RQ L +Y F CDC +C EA+ D
Sbjct: 352 SRHSRQNILKENYLFTCDCPKCLSEADEPD 381
>gi|325188357|emb|CCA22894.1| conserved hypothetical protein [Albugo laibachii Nc14]
Length = 528
Score = 52.8 bits (125), Expect = 3e-04, Method: Compositional matrix adjust.
Identities = 28/79 (35%), Positives = 39/79 (49%), Gaps = 7/79 (8%)
Query: 204 GIYLKASFFNHDCLPNA-CRFDYVDAAAENNTDIIVRMIHDVPQGREICLSYFPVNYDYS 262
G+Y KA+ NH C PN RFD A ++ I + G+E+ +SY + +
Sbjct: 228 GLYPKAAMLNHSCTPNCIVRFDLTRKQA------VIHTIETIHSGQELTISYVDIFDPTT 281
Query: 263 TRQKRLLYDYDFACDCDRC 281
R+KRL Y F C C RC
Sbjct: 282 IRKKRLRSSYFFDCTCQRC 300
>gi|194913080|ref|XP_001982622.1| GG12639 [Drosophila erecta]
gi|190648298|gb|EDV45591.1| GG12639 [Drosophila erecta]
Length = 462
Score = 52.8 bits (125), Expect = 3e-04, Method: Compositional matrix adjust.
Identities = 60/272 (22%), Positives = 105/272 (38%), Gaps = 31/272 (11%)
Query: 17 GRGLVSTQSLKAGQIVLRDSPILLYSALPFINSSSFCHNCFRKTMHSSSSICP-SCSVAF 75
GRG+ +T+ + AG+++ ++ ++ SS C +T+ + +C C++
Sbjct: 66 GRGVFATRDIAAGELIFQERALVTGPTARKGQLSSCI--CCHETLPQTGFLCRHRCTL-- 121
Query: 76 CSPKCSTAAASSSHSPYVCQALTRLL--NLNSPDAANLPLDRQLQARFLIAAYNLAVVNP 133
P C T A S H C+ R ++++ PL ++ + A + +
Sbjct: 122 --PVCETCADSEEHQAE-CEHFRRWQPKDVDAEQEQVNPLSLRI-----LTAVRVFHLGK 173
Query: 134 SQFQILLAFQGTVTD----NDTSAAHYLSSLCPPPASTTIELTAALLAKDRLNAFGLMEP 189
Q ++ A Q AA + P + + + LN P
Sbjct: 174 EQRHLVDAMQANAERAYRREIIQAAQCFRNF---PTTDRVFMDQLFRIVGVLNTNAFEAP 230
Query: 190 YIEGQDGRRSVRAYGIYLKASFFNHDCLPNACRFDYVDAAAENNTDIIVRMIHDVPQGRE 249
+ G G++ + NH+C PNA + EN +VR D+P+G E
Sbjct: 231 C---RSGGHETLLRGLFPLTAIMNHECTPNASHY------FENGRLAVVRAARDIPKGGE 281
Query: 250 ICLSYFPVNYDYSTRQKRLLYDYDFACDCDRC 281
I +Y + + TR L FACDC RC
Sbjct: 282 ITTTYTKILWSNLTRNIFLKMTKHFACDCARC 313
>gi|397601249|gb|EJK57855.1| hypothetical protein THAOC_22059 [Thalassiosira oceanica]
Length = 572
Score = 52.8 bits (125), Expect = 3e-04, Method: Compositional matrix adjust.
Identities = 32/101 (31%), Positives = 45/101 (44%), Gaps = 12/101 (11%)
Query: 201 RAYGIYLKASFFNHDCLPNACRFDYVDAAAENNTDII--------VRMIH---DVPQGRE 249
R G+Y A+ NH C PNA R +E T +I V ++H + +G E
Sbjct: 306 RILGVYPLAAMINHSCCPNAIRVFGTVPKSEIGTTVIDEECTGKDVMIVHASTKISRGEE 365
Query: 250 ICLSYFPVNYDYSTRQKRLLYDYDFACDCDRCKVEANWSDN 290
I SY P R++ +L Y F C+C RC E +D
Sbjct: 366 IVWSYIPPCGPVQ-RRRHMLKKYGFVCECTRCAKELEVADK 405
>gi|449501577|ref|XP_002187845.2| PREDICTED: SET and MYND domain-containing protein 5-like
[Taeniopygia guttata]
Length = 404
Score = 52.4 bits (124), Expect = 3e-04, Method: Compositional matrix adjust.
Identities = 53/195 (27%), Positives = 80/195 (41%), Gaps = 26/195 (13%)
Query: 109 ANLPLDRQLQARFLIAAYNLAVVNPSQFQILLAF-----QGTVTDNDTSAAHYLSSLCPP 163
L L R L L Y P F+ L A QG T + + H +L P
Sbjct: 205 GQLELLRLLFTEALYDEYLSRWFTPEGFRSLFALVGTNGQGIGTSSLSQWVHACDALDLP 264
Query: 164 PASTTIELTAALLAKDRLNAF-GLMEPYIEGQDGR-RSVRAYGIYLKASFFNHDCLPNAC 221
+L ++ L+AF + IE + G + G+Y+ S NH C+PNA
Sbjct: 265 -----------MLQREELDAFIDQLYKDIEKESGEFLNCEGSGLYVLQSCCNHSCIPNA- 312
Query: 222 RFDYVDAAAENNTDIIVRMIHDVPQGREICLSYFPVNYDYSTRQKR---LLYDYDFACDC 278
+ ENN + + + D+ G EIC+SY +R R L +Y F C C
Sbjct: 313 ----ETSFPENNFLLHLTALEDIEAGEEICISYLDCCQRERSRHSRNKILRENYLFTCSC 368
Query: 279 DRCKVEANWSDNDND 293
+C +A+ +D +D
Sbjct: 369 PKCLAQADDADVTSD 383
>gi|328866939|gb|EGG15322.1| SET domain-containing protein [Dictyostelium fasciculatum]
Length = 524
Score = 52.4 bits (124), Expect = 3e-04, Method: Compositional matrix adjust.
Identities = 28/82 (34%), Positives = 39/82 (47%), Gaps = 5/82 (6%)
Query: 200 VRAYGIYLKASFFNHDCLPNACRFDYVDAAAENNTDIIVRMIHDVPQGREICLSYFPVNY 259
V G+Y+ A+ NH C PNA + N +R I D+P G +I +SY +
Sbjct: 442 VHGSGLYVYANSCNHSCSPNAAITN-----ESTNFSATIRSITDIPNGNQIEISYIEEDQ 496
Query: 260 DYSTRQKRLLYDYDFACDCDRC 281
TRQ L+ Y F C C +C
Sbjct: 497 PSQTRQSELIDKYKFKCHCQKC 518
>gi|41469441|gb|AAS07242.1| putative MYND finger protein [Oryza sativa Japonica Group]
gi|108710650|gb|ABF98445.1| MYND finger family protein, expressed [Oryza sativa Japonica Group]
Length = 480
Score = 52.4 bits (124), Expect = 3e-04, Method: Compositional matrix adjust.
Identities = 68/289 (23%), Positives = 109/289 (37%), Gaps = 43/289 (14%)
Query: 8 VTVAEIEGRGRGLVSTQSLKAGQIVLRDSPILLYSALP-FINSSSFCHNCFRKTMHSSSS 66
+ VA + G+GRGL + +S G++V+ P Y++ P I+ S C NCF +
Sbjct: 17 LAVASVPGKGRGLFAARSFFPGEVVISQEP---YASTPNKISVGSNCDNCF---ASRNLR 70
Query: 67 ICPSCSVA-FCSPKCSTAAASSSHSPYVCQAL-------TRLLNLNSPDAANLPLDRQLQ 118
C C VA +C C C+A+ ++L L L R+LQ
Sbjct: 71 KCSVCRVAWYCGSACQREEWKLHQLE--CRAIAALTEDRKKMLTPTIRLMVRLVLRRKLQ 128
Query: 119 ARFLIAAYNLAVVNPSQFQILLAFQGTVTDNDT------SAAHYLSSLCPPPASTTIELT 172
I + + ++ A + +++ D + L L P ++
Sbjct: 129 DDKAIPS-----SGTDNYNLVDALESHISEVDKNQLVLYAQMANLVQLILPSFELDLKEI 183
Query: 173 AALLAKDRLNAFGLMEPYIEGQDGRRSVRAYGIYLKASFFNHDCLPNACRFDYVDAAAEN 232
+K NA + +P + G+Y S NH C+PNA E
Sbjct: 184 THTFSKFACNAHTICDPELRPL-------GTGLYPVLSIINHSCVPNAVLI------FEG 230
Query: 233 NTDIIVRMIHDVPQGREICLSYFPVNYDYSTRQKRLLYDYDFACDCDRC 281
T VR + + + E+ +SY RQ L + Y F C C RC
Sbjct: 231 RT-AYVRALQPISKNEEVSISYIETAATTMKRQDDLKH-YYFTCTCPRC 277
>gi|342182930|emb|CCC92410.1| conserved hypothetical protein [Trypanosoma congolense IL3000]
Length = 372
Score = 52.4 bits (124), Expect = 3e-04, Method: Compositional matrix adjust.
Identities = 78/288 (27%), Positives = 112/288 (38%), Gaps = 55/288 (19%)
Query: 17 GRGLVSTQSLKAGQIVLRDS-PILLYSALPFINSSSFCHNCFRKTMHSSSSIC--PSCSV 73
GRG+V+ ++LK G +V+R S P L Y + C C K + C P C
Sbjct: 113 GRGIVARENLKQGTVVVRRSEPWLRYP-----TADGLCALCA-KPLPERFFTCTNPQCHE 166
Query: 74 AFCSPKCSTAAASSSHSPYVCQ-------ALTRLLNLNSPDAANLPLDRQLQARFLIAAY 126
+CS C T A S HS C+ L + S +AA+ DR A L+
Sbjct: 167 EYCSRDCRTMALSLYHS-RTCRIGGLQAIELDLYTQMKSTEAAS---DRNSAAAQLLMLR 222
Query: 127 NLAV-----VNPSQFQILLAFQGTVTDNDTSAAHYLSSL-------CPPPASTTIELTAA 174
LAV + PS + G + + T+ A +L ++ C S E
Sbjct: 223 VLAVSLQQQIVPSALSEVRILSGRLLFDPTALAGHLLTIYERFTRACCTSTSIPYEEMIG 282
Query: 175 LLAKDRLNAFGLMEPYIEGQDGRRSVRAYGIYLKASFFNHDCLPNACRFDYVDAAAENNT 234
+LA+ N F + Y+E + L S NH C N AE+
Sbjct: 283 VLARVTANCFH-RDTYVE------------LNLPRSMLNHSCDAN---------VAEDGE 320
Query: 235 DIIVRMIHDVPQGREICLSYFPVNYDYS-TRQKRLLYDYDFACDCDRC 281
+R DV + E+ ++YFP S T + R L F C C RC
Sbjct: 321 TGEMRTTQDVGREAELTINYFPHLKGLSYTERSRELDRRGFVCHCLRC 368
>gi|327286307|ref|XP_003227872.1| PREDICTED: SET and MYND domain-containing protein 3-like [Anolis
carolinensis]
Length = 334
Score = 52.4 bits (124), Expect = 3e-04, Method: Compositional matrix adjust.
Identities = 65/307 (21%), Positives = 100/307 (32%), Gaps = 47/307 (15%)
Query: 15 GRGRGLVSTQSLKAGQIVLRDSPILLYSALPFINSSSFCHNCFRKTMHSSSSICPSCSVA 74
GRG GL ++++KAG ++ R+ P A+ + C C R+ C C VA
Sbjct: 43 GRGNGLRLSKAVKAGALLYREEPFAY--AVTKKRLGAACERCLRRKERLLR--CSQCKVA 98
Query: 75 -FCSPKCSTAAASSSHSPYVCQALTRLLNLNSPDAANLPLDRQLQARFLIAAYNLAVVNP 133
+C +C A C + S + P +L R + +
Sbjct: 99 RYCDARCQKEAWEDHKRECKC--------IKSVEPNFPPDSVRLVGRIIFKLLRQSTCPS 150
Query: 134 SQFQILLAFQGTVTDNDTSAAHYLSSLCP----------------PPASTTIELTAALLA 177
+ Q V L L PPA E A
Sbjct: 151 EELYSFSDLQSNVEKLSEEMKEGLGHLAKTLQLYLKVEIQDVSQLPPALDVFET----FA 206
Query: 178 KDRLNAFGLMEPYIEGQDGRRSVRAYGIYLKASFFNHDCLPNACRFDYVDAAAENNTDII 237
K N F + +G G+Y S N+ C PN A ++
Sbjct: 207 KVACNCFSI-------SNGEMQDVGVGLYPSMSLLNNSCDPNC-------AIVFEGPELH 252
Query: 238 VRMIHDVPQGREICLSYFPVNYDYSTRQKRLLYDYDFACDCDRCKVEANWSDNDNDDENN 297
+R I ++ +G E+ + Y RQ+RL Y F CDC C ++ D
Sbjct: 253 LRSIREIQEGEELTICYVETMMPTPERQERLKRQYCFECDCLMCCTKSKAYDIMKVTHGT 312
Query: 298 EEVMDED 304
E + +D
Sbjct: 313 EHSLVQD 319
>gi|403169846|ref|XP_003329264.2| hypothetical protein PGTG_10316 [Puccinia graminis f. sp. tritici
CRL 75-36-700-3]
gi|375168447|gb|EFP84845.2| hypothetical protein PGTG_10316 [Puccinia graminis f. sp. tritici
CRL 75-36-700-3]
Length = 416
Score = 52.4 bits (124), Expect = 3e-04, Method: Compositional matrix adjust.
Identities = 28/88 (31%), Positives = 44/88 (50%), Gaps = 8/88 (9%)
Query: 201 RAYG-IYLKASFFNHDCLPNACRFDYVDAAAENNTDII-VRMIHDVPQGREICLSYFPVN 258
R +G + L+ S FNH C PN F ++ T ++ +R + G E+ +SY P+
Sbjct: 245 RHFGALVLQGSRFNHACRPNVVYF------IDHETQLMNLRAFESISSGEELTISYRPLE 298
Query: 259 YDYSTRQKRLLYDYDFACDCDRCKVEAN 286
D +R+K L Y F C C C++ A
Sbjct: 299 MDRESRRKELQETYGFRCTCPHCQMSAE 326
>gi|290975196|ref|XP_002670329.1| predicted protein [Naegleria gruberi]
gi|284083887|gb|EFC37585.1| predicted protein [Naegleria gruberi]
Length = 626
Score = 52.4 bits (124), Expect = 3e-04, Method: Compositional matrix adjust.
Identities = 28/86 (32%), Positives = 41/86 (47%), Gaps = 5/86 (5%)
Query: 204 GIYLKASFFNHDCLPNACRFDYVDAAAENNT----DIIVRMIHDVPQGREICLSYFPVNY 259
G++ S F+H C PN C F D N+ I+V+ + V +G E+C+SY +
Sbjct: 295 GLFPMVSMFDHSCSPN-CSFQTFDDMKSRNSYSGNVILVQTVKKVKKGEELCISYIDIMN 353
Query: 260 DYSTRQKRLLYDYDFACDCDRCKVEA 285
R++ L Y F C C RC E
Sbjct: 354 PTCIRRRELWYSKYFVCRCSRCMSET 379
>gi|194752924|ref|XP_001958769.1| GF12554 [Drosophila ananassae]
gi|190620067|gb|EDV35591.1| GF12554 [Drosophila ananassae]
Length = 660
Score = 52.4 bits (124), Expect = 3e-04, Method: Compositional matrix adjust.
Identities = 79/324 (24%), Positives = 126/324 (38%), Gaps = 49/324 (15%)
Query: 4 VSETVTVAEIEGRGRGLVSTQSLKAGQIVLRDSPILLYSALPFINSSSFCHNCFRKTMHS 63
VS V + GR + +KAG+ +L + P + F + + C NCF ++
Sbjct: 237 VSPLVRIDRNRQEGRFARAAADVKAGEELLVERPFVAVLLEKF--AKTHCENCFVRS--G 292
Query: 64 SSSICPSCS-VAFCSPKCSTAAASSSHSPYVCQALTRLL----NLNSPDA----ANLPLD 114
C C+ V +CS +C AA+ H Y C + L ++N+ A A+ PLD
Sbjct: 293 VPVACSRCADVIYCSEQCREEAANKFHK-YECGIVPILWRSGASINNHIALRIIASKPLD 351
Query: 115 RQLQARFLI----AAYNLAVVNPSQFQILLAFQGTVTDNDTS-------AAHYLSS---- 159
L+ + I + L + F+ + + + S A +L+
Sbjct: 352 YFLKLKPTIDEELSPEKLISLPKDDFRRVAQLERHQGERQPSNFFQHVLMARFLTRCLKA 411
Query: 160 ---LCPPPASTTIELTAALLAKD----RLNAFGLMEPYIEGQDGRRSVRAYG--IYLKAS 210
P P I++ A+L+ + + N + E + GR G IY +
Sbjct: 412 GGYFGPEPKQDQIQVIASLVLRSLQFIQFNTHEVAELHKFSSSGREKSIFIGGAIYPTLA 471
Query: 211 FFNHDCLPNACRFDYVDAAAENNTDIIVRMIHDVPQGREICLSYFPV--NYDYSTRQKRL 268
FNH C P R+ NT I + + + G I +Y P+ + S RQ RL
Sbjct: 472 LFNHSCDPGVVRY------FRGNT-IHINSVRPIEAGLPINENYGPIYTQDERSERQARL 524
Query: 269 LYDYDFACDCDRCKVEANWSDNDN 292
Y F C CD C NW D+
Sbjct: 525 KELYWFECSCDACI--DNWPKFDD 546
>gi|440792951|gb|ELR14156.1| Histone-lysine N-methyltransferase [Acanthamoeba castellanii str.
Neff]
Length = 627
Score = 52.4 bits (124), Expect = 3e-04, Method: Compositional matrix adjust.
Identities = 27/84 (32%), Positives = 45/84 (53%), Gaps = 5/84 (5%)
Query: 202 AYGIYLKASFFNHDCLPNACRFDYVDAAAENNTDIIVRMIHDVPQGREICLSYFPVNYDY 261
A G+++ A+ NH C PN + + N+ I V I+++ +G E+C SY +
Sbjct: 511 ASGLFVFANSMNHSCAPNV-----IVVSCFNSFLIRVIAINEIKKGDELCFSYIDEEAPF 565
Query: 262 STRQKRLLYDYDFACDCDRCKVEA 285
RQ++L Y F C C++C +EA
Sbjct: 566 EQRQRQLEKLYLFECRCEKCAIEA 589
>gi|147810952|emb|CAN74411.1| hypothetical protein VITISV_025097 [Vitis vinifera]
Length = 588
Score = 52.4 bits (124), Expect = 3e-04, Method: Compositional matrix adjust.
Identities = 28/81 (34%), Positives = 38/81 (46%), Gaps = 8/81 (9%)
Query: 204 GIYLKASFFNHDCLPNACRFDYVDAAAENNTDIIVRMIHDVPQGREICLSYFPVNYDYST 263
G+++ +F NH C PNA R D +IV DV G EI +YF V +
Sbjct: 378 GLWILPAFINHSCNPNARRLHVGD-------HVIVHASRDVKAGEEITFAYFDVLSPWRK 430
Query: 264 RQKRLLYDYDFACDCDRCKVE 284
R + + F C+C RCK E
Sbjct: 431 RTD-MAKTWGFQCNCKRCKFE 450
>gi|167342138|gb|ABZ69017.1| SET domain-containing protein C [Pinus sylvestris]
Length = 141
Score = 52.4 bits (124), Expect = 4e-04, Method: Compositional matrix adjust.
Identities = 28/81 (34%), Positives = 38/81 (46%), Gaps = 8/81 (9%)
Query: 204 GIYLKASFFNHDCLPNACRFDYVDAAAENNTDIIVRMIHDVPQGREICLSYFPVNYDYST 263
G++L SF NH CLPN+ R E + I + + +G EI YF +
Sbjct: 12 GLWLLPSFINHSCLPNSSRL-------EMGSAIFIHACKPIKRGEEITFPYFDILLPLPQ 64
Query: 264 RQKRLLYDYDFACDCDRCKVE 284
RQ R ++ F C C RC VE
Sbjct: 65 RQGR-CKNWGFECKCRRCIVE 84
>gi|108710649|gb|ABF98444.1| MYND finger family protein, expressed [Oryza sativa Japonica Group]
Length = 481
Score = 52.4 bits (124), Expect = 4e-04, Method: Compositional matrix adjust.
Identities = 68/289 (23%), Positives = 109/289 (37%), Gaps = 43/289 (14%)
Query: 8 VTVAEIEGRGRGLVSTQSLKAGQIVLRDSPILLYSALP-FINSSSFCHNCFRKTMHSSSS 66
+ VA + G+GRGL + +S G++V+ P Y++ P I+ S C NCF +
Sbjct: 17 LAVASVPGKGRGLFAARSFFPGEVVISQEP---YASTPNKISVGSNCDNCF---ASRNLR 70
Query: 67 ICPSCSVA-FCSPKCSTAAASSSHSPYVCQAL-------TRLLNLNSPDAANLPLDRQLQ 118
C C VA +C C C+A+ ++L L L R+LQ
Sbjct: 71 KCSVCRVAWYCGSACQREEWKLHQLE--CRAIAALTEDRKKMLTPTIRLMVRLVLRRKLQ 128
Query: 119 ARFLIAAYNLAVVNPSQFQILLAFQGTVTDNDT------SAAHYLSSLCPPPASTTIELT 172
I + + ++ A + +++ D + L L P ++
Sbjct: 129 DDKAIPS-----SGTDNYNLVDALESHISEVDKNQLVLYAQMANLVQLILPSFELDLKEI 183
Query: 173 AALLAKDRLNAFGLMEPYIEGQDGRRSVRAYGIYLKASFFNHDCLPNACRFDYVDAAAEN 232
+K NA + +P + G+Y S NH C+PNA E
Sbjct: 184 THTFSKFACNAHTICDPELRPL-------GTGLYPVLSIINHSCVPNAVLI------FEG 230
Query: 233 NTDIIVRMIHDVPQGREICLSYFPVNYDYSTRQKRLLYDYDFACDCDRC 281
T VR + + + E+ +SY RQ L + Y F C C RC
Sbjct: 231 RT-AYVRALQPISKNEEVSISYIETAATTMKRQDDLKH-YYFTCTCPRC 277
>gi|71409653|ref|XP_807160.1| hypothetical protein [Trypanosoma cruzi strain CL Brener]
gi|70871100|gb|EAN85309.1| hypothetical protein, conserved [Trypanosoma cruzi]
Length = 697
Score = 52.4 bits (124), Expect = 4e-04, Method: Compositional matrix adjust.
Identities = 40/143 (27%), Positives = 61/143 (42%), Gaps = 29/143 (20%)
Query: 140 LAFQGTVTDNDTSAAHYLSSLCPPPASTTIELTAALLAKDRLNAFGLMEPYIEGQDGRRS 199
L G++ N T A + + LC AAL + N+FGL DG +
Sbjct: 528 LGEGGSIEANLTVTATFFNRLC-----------AAL----QCNSFGLF-----NADG--N 565
Query: 200 VRAYGIYLKASFFNHDCLPNACRFDYVDAAAENNTDIIVRMIHDVPQGREICLSYFPVNY 259
+Y +AS+FNH C PN CR Y A + ++ +G + + Y V
Sbjct: 566 CIGVALYPEASYFNHSCCPNICRVTYRGILA------AFHALREIRKGEPLTICYVDVQE 619
Query: 260 DYSTRQKRLLY-DYDFACDCDRC 281
+ ++R L+ Y F C+C RC
Sbjct: 620 TSTAERRRTLFSSYRFFCECARC 642
>gi|303279368|ref|XP_003058977.1| predicted protein [Micromonas pusilla CCMP1545]
gi|226460137|gb|EEH57432.1| predicted protein [Micromonas pusilla CCMP1545]
Length = 322
Score = 52.4 bits (124), Expect = 4e-04, Method: Compositional matrix adjust.
Identities = 29/84 (34%), Positives = 40/84 (47%), Gaps = 8/84 (9%)
Query: 204 GIYLKASFFNHDCLPNAC-RFDYVDAAAENNTDIIVRMIHDVPQGREICLSYFPVNYDYS 262
G+Y A+ FNHDC PNA RFD + V V +G E+ + Y V
Sbjct: 246 GVYPSAAMFNHDCAPNAAQRFDAFGC-------VRVETTRRVRKGEELTIPYVDVMLGRE 298
Query: 263 TRQKRLLYDYDFACDCDRCKVEAN 286
R+ +L ++ F C C RC+ EA
Sbjct: 299 ERRGKLRKNFAFECACARCEREAG 322
Score = 41.2 bits (95), Expect = 0.75, Method: Compositional matrix adjust.
Identities = 23/85 (27%), Positives = 42/85 (49%), Gaps = 6/85 (7%)
Query: 17 GRGLVSTQSLKAGQIVLRDSPILLYSALPFINS-SSFCHNCFRKTMHSSSSICPSCSVAF 75
G + +++ L G +++R +P ++A+P+ + CH CFR + C +C VA
Sbjct: 21 GEAMRASRDLPPGTLLVRVAP---FAAVPYPDEMRRSCHGCFRAC--GAERECGACGVAR 75
Query: 76 CSPKCSTAAASSSHSPYVCQALTRL 100
C+ + ++ Y C AL RL
Sbjct: 76 LCASCAGCETTKAYHAYECHALARL 100
>gi|4725963|emb|CAA15694.2| EG:103B4.4 [Drosophila melanogaster]
Length = 991
Score = 52.4 bits (124), Expect = 4e-04, Method: Compositional matrix adjust.
Identities = 27/78 (34%), Positives = 38/78 (48%), Gaps = 6/78 (7%)
Query: 204 GIYLKASFFNHDCLPNACRFDYVDAAAENNTDIIVRMIHDVPQGREICLSYFPVNYDYST 263
G++ + NH+C PNA + EN +VR D+P+G EI +Y + + T
Sbjct: 771 GLFPLTAIMNHECTPNASHY------FENGRLAVVRAARDIPKGGEITTTYTKILWGNLT 824
Query: 264 RQKRLLYDYDFACDCDRC 281
R L FACDC RC
Sbjct: 825 RNIFLKMTKHFACDCVRC 842
Score = 46.2 bits (108), Expect = 0.023, Method: Compositional matrix adjust.
Identities = 61/278 (21%), Positives = 107/278 (38%), Gaps = 42/278 (15%)
Query: 17 GRGLVSTQSLKAGQIVLRDSPILLYSALPFINSSSFCHNCFRKTMHS--SSSICPS-CSV 73
GRG+ +T+ + AG+ + R+ +L+ + C C+R S S+++CP+ C +
Sbjct: 53 GRGIFATREIAAGEELFREHTLLVGPTAHRSMNLRTCTLCYRLIPGSTDSAALCPAGCGL 112
Query: 74 AFCSPKCSTAAASSSHSPYVCQALTRLLNLNSPDAANLPLDRQLQARFLIAAYNLAVVNP 133
P CS S+ H C+ + L S + +A +++ ++
Sbjct: 113 ----PVCSECRDSTRHD-LECKLFRKWKPLESQRI-------EPRALRILSVVRCFFLDE 160
Query: 134 SQFQILLAFQGTV----TDNDTSAAHYLSSLCPPPASTTIELTAALLAKDRLNAFGLMEP 189
+ ++L A Q + AA P ++ + NAF
Sbjct: 161 ASRKLLYAMQANMDRYYMQEVQRAADCFEHF--PREQDMLDYFYRTICAFNTNAF----- 213
Query: 190 YIEGQDGRRSVRAYGIYLKASF-----FNHDCLPNACRFDYVDAAAENNTDIIVRMIHDV 244
+ R +V + + ++A F NH C PNA EN I+V +
Sbjct: 214 -----ESRSNVDGHEVLVRALFPLAGLLNHQCTPNAAHH------FENGETIVVCATERI 262
Query: 245 PQGREICLSYFPVNYDYSTRQKRLLYDYDFACDCDRCK 282
P G EI +SY + + R+ L F C C RC+
Sbjct: 263 PAGAEITMSYAKLLWSTLARKIFLGMTKHFICKCVRCQ 300
>gi|20152089|gb|AAM11404.1| RE25548p [Drosophila melanogaster]
Length = 403
Score = 52.4 bits (124), Expect = 4e-04, Method: Compositional matrix adjust.
Identities = 61/278 (21%), Positives = 111/278 (39%), Gaps = 29/278 (10%)
Query: 17 GRGLVSTQSLKAGQIVLRDSPILLYSALPFINSSSFCHNCFRK-TMHSSSSICPSCSVAF 75
GR LV+ + L+AG+ ++R+ P+ + P ++ C C+ ++ + CP C+
Sbjct: 58 GRYLVANRQLEAGETLIREEPLAIG---PCVSGDPVCLGCYHPVSLKADQYRCPGCAWPL 114
Query: 76 CSPKCSTAAASSSHSPYVCQ--ALTRLL--NLNSPDAANLPLDRQLQARFLIAAYNLAVV 131
C C+ H+ CQ A R + L + A + + ++ L
Sbjct: 115 CGSTCAGLKHRHGHTETECQLYAERRAVAGELLTERAGPAEVRDLYELVMIVRILLLRQH 174
Query: 132 NPSQFQILLAFQGTVTDNDTSAA---HYLSSLCPPPASTTIELTAALLAKDRLNAFGLME 188
+P QF ++ + + +A HY + T +L L A+ G+++
Sbjct: 175 DPEQFALIARMESHTEERRQNAVLWRHYEEKVV-QRLRVTWQLE-DLEAEQVHEVCGILD 232
Query: 189 P--YIEGQDGRRSVRAYGIYLKASFFNHDCLPNACRFDYVDAAAENNTDIIVRMIHDVPQ 246
+ GQ+G +A +Y A HDC PN D ++ +I++R V +
Sbjct: 233 VNCFEIGQNG---AKARTLYPSAFLLAHDCTPNTAHTD-----DPSSFEILLRTSRRVRE 284
Query: 247 GREICLSYFPVNYDYSTRQKRLLYDYD---FACDCDRC 281
+ L Y Y KR + ++ F C C RC
Sbjct: 285 REALTLGY---AYTLQGTLKRRAFMHEGKLFWCCCRRC 319
>gi|363540350|ref|YP_004894267.1| mg216 gene product [Megavirus chiliensis]
gi|350611515|gb|AEQ32959.1| putative set domain-containing protein [Megavirus chiliensis]
Length = 227
Score = 52.4 bits (124), Expect = 4e-04, Method: Compositional matrix adjust.
Identities = 27/78 (34%), Positives = 36/78 (46%), Gaps = 7/78 (8%)
Query: 204 GIYLKASFFNHDCLPNACRFDYVDAAAENNTDIIVRMIHDVPQGREICLSYFPVNYDYST 263
I FNH CLPN + N ++ + D+ +G E+ SY + +D T
Sbjct: 132 AILFNGRIFNHSCLPNIVFY-------RCNDEMCFITVRDIYKGEELLDSYVNITHDKKT 184
Query: 264 RQKRLLYDYDFACDCDRC 281
RQ RL Y F CDC RC
Sbjct: 185 RQSRLWNQYRFHCDCQRC 202
>gi|347964965|ref|XP_309220.5| AGAP001025-PA [Anopheles gambiae str. PEST]
gi|333466560|gb|EAA04943.5| AGAP001025-PA [Anopheles gambiae str. PEST]
Length = 519
Score = 52.4 bits (124), Expect = 4e-04, Method: Compositional matrix adjust.
Identities = 68/284 (23%), Positives = 99/284 (34%), Gaps = 39/284 (13%)
Query: 17 GRGLVSTQSLKAGQIVLRDSPILLYSALPFINSSSFCHNCFRKTMHSSSSI--------C 68
GR LV+T+ +K G+I+ RD P Y+ P I + C C R M + C
Sbjct: 55 GRHLVATRHIKQGEIIYRDEP---YAVGPKIANVPLCLGCNRNLMAGWDATRGLDRFHEC 111
Query: 69 PSCSVAFCSPKCSTAA---------ASSSHSPYVCQALTRLLNLNSPDAANLPLDRQLQA 119
C C P C A A S + P + + S +PL L
Sbjct: 112 SRCGWPLCGPGCEEVAQHRPECSVLAGSGYRPNIRPNPSNPEQRESAYCVIVPLRVLLLE 171
Query: 120 RFLIAAYNLAVVNPSQFQILLA--FQGTVTDNDTSAAHYLSSLCPPPASTTIELTAALLA 177
R Y S LA G + N + L T ++L+A L
Sbjct: 172 RIAPERYATVQGFESHLDERLASPLYGVLRSNLVPFLRQVLRLQQYSEQTVLKLSAIL-- 229
Query: 178 KDRLNAFGLMEPYIEGQDGRRSVRAYGIYLKASFFNHDCLPNACRFDYVDAAAENNTDII 237
N + + P + V+ G+Y + +HDC PN Y D + ++
Sbjct: 230 --DTNCYEIRLP-------EQHVKVRGLYPLGAMLSHDCRPNT--KHYFD----DRLHMV 274
Query: 238 VRMIHDVPQGREICLSYFPVNYDYSTRQKRLLYDYDFACDCDRC 281
+ D+P G I SY R+ L F C C+RC
Sbjct: 275 LVATVDIPAGGVIHASYTQPLLGTVQRRLALRQAKCFDCCCERC 318
>gi|167342086|gb|ABZ68991.1| SET domain-containing protein C [Pinus taeda]
gi|167342098|gb|ABZ68997.1| SET domain-containing protein C [Pinus taeda]
Length = 141
Score = 52.4 bits (124), Expect = 4e-04, Method: Compositional matrix adjust.
Identities = 27/81 (33%), Positives = 39/81 (48%), Gaps = 8/81 (9%)
Query: 204 GIYLKASFFNHDCLPNACRFDYVDAAAENNTDIIVRMIHDVPQGREICLSYFPVNYDYST 263
G++L SF NH CLPN+ R E + + + + +G EI YF +
Sbjct: 12 GLWLLPSFINHSCLPNSSRL-------EMGSAMFIHACKPIKRGEEITFPYFDILLPLPQ 64
Query: 264 RQKRLLYDYDFACDCDRCKVE 284
RQ+R ++ F C C RC VE
Sbjct: 65 RQRR-CENWGFECKCRRCIVE 84
>gi|350633710|gb|EHA22075.1| hypothetical protein ASPNIDRAFT_183370 [Aspergillus niger ATCC
1015]
Length = 386
Score = 52.4 bits (124), Expect = 4e-04, Method: Compositional matrix adjust.
Identities = 70/290 (24%), Positives = 111/290 (38%), Gaps = 46/290 (15%)
Query: 15 GRGRGLVSTQSLKAGQIVLRDSPILLYSALPFINSSSFCHNCFRKTMHSSSSI----CPS 70
G GRGL ++ + G+ VL P+ + L + C CF + I C
Sbjct: 13 GLGRGLFASTDIAVGEEVLH-IPVPFVAVLDTEHLGEVCSGCFGQRQLEEEGITLKGCRG 71
Query: 71 CS-VAFCSPKCSTAAASSSHSPY--VCQALT-RLLNLNSPDAANLPLDRQLQARFLIAAY 126
C V +C C HS + Q L R+L +N+ + L + Q
Sbjct: 72 CGIVKYCDKTCQAKDWKLGHSLECSIYQNLKPRVLPINARAVLRMVLRSERQK------- 124
Query: 127 NLAVVNPSQFQILLAFQGTVTD-NDTSAAHY----LSSLCPPPASTT---IELTAALLAK 178
+ + L + D D SA+ + LSS S T E+ +A+ AK
Sbjct: 125 ----YSSEELDQFLQLGTHIKDIRDQSASQWERISLSSKAIRAYSGTDMSEEVISAMGAK 180
Query: 179 DRLNAFGLMEPYIEGQDGRRSVRAYGIYLK--ASFFNHDCLPNACRFDYVDAAAENNTDI 236
LN+F L + G+YL A+ FNH C D+ A + + ++
Sbjct: 181 LDLNSFNLTNAVYD---------RLGVYLHPYAAIFNHSC-------DHNAAVSFDGPNL 224
Query: 237 IVRMIHDVPQGREICLSYFPVNYDYSTRQKRLLYDYDFACDCDRCKVEAN 286
++ + + + +I ++Y V Y RQ L Y F C C +C EA
Sbjct: 225 HIKALRPIRKDEQIFITYIDVTDPYPIRQANLQSRYYFTCHCSKCTREAG 274
>gi|201066227|gb|ACH92522.1| IP12582p [Drosophila melanogaster]
Length = 495
Score = 52.4 bits (124), Expect = 4e-04, Method: Compositional matrix adjust.
Identities = 27/78 (34%), Positives = 38/78 (48%), Gaps = 6/78 (7%)
Query: 204 GIYLKASFFNHDCLPNACRFDYVDAAAENNTDIIVRMIHDVPQGREICLSYFPVNYDYST 263
G++ + NH+C PNA + EN +VR D+P+G EI +Y + + T
Sbjct: 275 GLFPLTAIMNHECTPNASHY------FENGRLAVVRAARDIPKGGEITTTYTKILWGNLT 328
Query: 264 RQKRLLYDYDFACDCDRC 281
R L FACDC RC
Sbjct: 329 RNIFLKMTKHFACDCVRC 346
>gi|353241217|emb|CCA73046.1| hypothetical protein PIIN_07001 [Piriformospora indica DSM 11827]
Length = 565
Score = 52.4 bits (124), Expect = 4e-04, Method: Compositional matrix adjust.
Identities = 68/286 (23%), Positives = 102/286 (35%), Gaps = 39/286 (13%)
Query: 17 GRGLVSTQSLKAGQIVLRDSPILLYSALPFINSSSFCHNCFRKTMHSSSSICPSCS-VAF 75
GRGL + K G +L P + + F+ NC T + C C VA+
Sbjct: 57 GRGLFTQTDAKPGTRLLSVRPHIHAVSARFLED-----NCTLCTSEENVRRCTRCKKVAY 111
Query: 76 CSPKCSTAAASSSHSPYVCQALTRLLNLNSPDAANLPLDRQLQARFLIAAYNLAVVNPSQ 135
CS +C TA CQ+L R + D+ P D L+ N+ +
Sbjct: 112 CSTECQTADWGIHKQE--CQSLRRWAEASGSDST--PADSIRAISRLLWMRNIKGADSIW 167
Query: 136 FQILLAFQGT----VTDNDTSAAHYLSSLC------PPPASTTIELTAA-----LLAKDR 180
++ + A Q + S H SL P + + L++K
Sbjct: 168 WRQIAAMQSNREHLSSSTQESYTHLAQSLVLYMKIESPEGLREYGIESGKDLVDLMSKFT 227
Query: 181 LNAFGLMEPYIEGQDGRRSVRAYGIYLKA--SFFNHDCLPNACR-FDYVDAAAENNTDII 237
N+F L ++ A G+ + NH C PNA F A D+I
Sbjct: 228 TNSFTLTSTFLN---------AIGVATAPIPALINHSCQPNAVVVFPATRKGAPPTLDVI 278
Query: 238 VRMIHDVPQGREICLSYFPVNYDYSTRQKRLLYDYDFACDCDRCKV 283
I + +G E+ +Y + RQK L Y F C C CK+
Sbjct: 279 A--IQPIRRGEEVLAAYVDITLPREIRQKSLKETYAFECSCTLCKL 322
>gi|168069006|ref|XP_001786289.1| predicted protein [Physcomitrella patens subsp. patens]
gi|162661736|gb|EDQ48899.1| predicted protein [Physcomitrella patens subsp. patens]
Length = 1117
Score = 52.4 bits (124), Expect = 4e-04, Method: Compositional matrix adjust.
Identities = 27/87 (31%), Positives = 42/87 (48%), Gaps = 7/87 (8%)
Query: 198 RSVRAYGIYLKASFFNHDCLPNACRFDYVDAAAENNTDIIVRMIHDVPQGREICLSYFPV 257
R V +G++ SF NH CLP++ A + V D+ G E+ +YF +
Sbjct: 878 RKVDHWGLWWLPSFMNHSCLPSSSPIRVGKA-------LFVFASRDLRAGDEVTRAYFDI 930
Query: 258 NYDYSTRQKRLLYDYDFACDCDRCKVE 284
R++ + +DFAC C RCK+E
Sbjct: 931 FLPLDQRKELSMKGWDFACHCPRCKLE 957
>gi|195127975|ref|XP_002008442.1| GI11814 [Drosophila mojavensis]
gi|193920051|gb|EDW18918.1| GI11814 [Drosophila mojavensis]
Length = 527
Score = 52.4 bits (124), Expect = 4e-04, Method: Compositional matrix adjust.
Identities = 74/308 (24%), Positives = 117/308 (37%), Gaps = 76/308 (24%)
Query: 10 VAEIEGRGRGLVSTQSLKAGQIVLRDSPILLYSALPFINSSSFCHNCFRKTMHSSSSI-- 67
+A E GR L +T+ +K G++++R++P++L P + S C C R +
Sbjct: 49 LASNELLGRHLRATRDIKMGELIMREAPLVLG---PKVASMPICLGCHRNLLPPQKPALN 105
Query: 68 ---CPSCSVAFCSPKCSTAA---------ASS---------------SHSPYVCQALTRL 100
C +CS C P+C +A A+S S Y L R
Sbjct: 106 YYKCSACSWPLCGPECERSAFHVDECRLMAASKFQSKINYNPGQPVGKESAYCVIMLLRC 165
Query: 101 LNLNSPDAANLP----LDRQLQARFLIAAYNLAVVNPSQFQILLAFQGTVTDNDTSAAHY 156
+ L A L+ L+ R Y + N L++F TV
Sbjct: 166 MQLKESKPAAFARLCELEDHLKERLETPLYQVLRAN------LISFIKTVL--------- 210
Query: 157 LSSLCPPPASTTIELTAALLAKDRLNAFGLMEPYIEGQDGRRSVRAYGIYLKASFFNHDC 216
L P + + A L NAF + Q G R ++ G++ + F+HDC
Sbjct: 211 --GLREWPDLDILRIAAIL----DTNAFEVR------QAGDR-IKVRGLFPGGAMFSHDC 257
Query: 217 LPNA-CRFDYVDAAAENNTDIIVRMIHDVPQGREICLSYFPVNYDYSTRQKRLLYDYD-- 273
+PN RFD ++ +I+ + +G + +SY ST Q+RL
Sbjct: 258 VPNMRHRFD-------DDMNIMFLAKRPIAKGEILSISY--TQQLRSTIQRRLHLKQVKC 308
Query: 274 FACDCDRC 281
F C C RC
Sbjct: 309 FECACARC 316
>gi|449273578|gb|EMC83062.1| SET and MYND domain-containing protein 1 [Columba livia]
Length = 491
Score = 52.4 bits (124), Expect = 4e-04, Method: Compositional matrix adjust.
Identities = 67/292 (22%), Positives = 116/292 (39%), Gaps = 33/292 (11%)
Query: 6 ETVTVAEIEGRGRGLVSTQSLKAGQIVLRDSPILLYSALPFIN-SSSFCHNCFRKT--MH 62
E+V V EG+GRGL + + G ++ + Y+A+ F + + CH CF++ +H
Sbjct: 7 ESVEVFTTEGKGRGLKAQKEFLPGDVIFAEPA---YAAVVFDSLTHVICHTCFKRQERLH 63
Query: 63 SSSSICPSCSVAF-CSPKCSTAAASSSHSPYVCQALTRLLNLNSPDAANLPLDRQLQARF 121
C C A+ C C AA + + C A+ R + +P N+ L ++ +
Sbjct: 64 R----CGQCKFAYYCDRTCQRAAWLNHKNE--CSAIKR--HGKAP-TENIRLAARIMWKI 114
Query: 122 LIAAYNLA---VVNPSQFQILL-AFQGTVTDNDTSAAHYLSSLCPPPAST-TIELTAALL 176
L+ +V+ Q + +F + PP + ++ + +
Sbjct: 115 EREGSGLSEGCLVSIDDLQNHVDSFDEEEKKELRADVESFLEFWPPHSQQFGMQYISHIF 174
Query: 177 AKDRLNAFGLMEPYIEGQDGRRSVRAYGIYLKASFFNHDCLPNAC------RFDYVDAAA 230
N F L + Q G ++V GI+ NHDC PN + V +
Sbjct: 175 GVISCNGFTLSD-----QRGLQAV-GVGIFPNLCQANHDCWPNCTVIFNNGNHEAVRSMF 228
Query: 231 ENNTDIIVRMIHDVPQGREICLSYFPVNYDYSTRQKRLLYDYDFACDCDRCK 282
I +R + + G E+ +SY R+K+L Y F C C+ CK
Sbjct: 229 HTQMRIELRALTKISPGDELTVSYVDFLNVSEERRKQLKKQYYFDCTCEHCK 280
>gi|408392130|gb|EKJ71490.1| hypothetical protein FPSE_08303 [Fusarium pseudograminearum CS3096]
Length = 543
Score = 52.4 bits (124), Expect = 4e-04, Method: Compositional matrix adjust.
Identities = 41/142 (28%), Positives = 62/142 (43%), Gaps = 26/142 (18%)
Query: 205 IYLKASFFNHDCLPNA-CRFDYVDAAAENNTDIIVRMIHDVPQGREICLSYFPVNYDYST 263
I L S NH C PNA R++ + I V + D+ QG+EI ++Y Y
Sbjct: 102 ILLATSRINHSCKPNAQTRWN------QGLGKITVHAVEDIEQGQEITITYLGNPEVYEE 155
Query: 264 RQKRLLYDYDFACDCDRCKVEANWSDNDNDDENNEEV--MDEDQDEQ------------- 308
RQK+L + F C C C + ++ D DD+ +E+ + ED + +
Sbjct: 156 RQKKLTNAFGFDCHCRLCSLSP--AERDLDDKLIKEIDYLQEDLENEDSILEFPTRCLHR 213
Query: 309 --MVASDDDAEVHGDTNFPHAY 328
V S +AE G +N P Y
Sbjct: 214 IYKVVSQLEAEGVGTSNVPKLY 235
>gi|342182559|emb|CCC92038.1| conserved hypothetical protein [Trypanosoma congolense IL3000]
Length = 712
Score = 52.4 bits (124), Expect = 4e-04, Method: Compositional matrix adjust.
Identities = 47/172 (27%), Positives = 62/172 (36%), Gaps = 48/172 (27%)
Query: 182 NAFGLMEPYIEGQDGRRSVRAYGIYLKASFFNHDCLPNACRFDYVDAAAENNTDIIVRMI 241
N FGL + S I+ +AS+FNH CLPN CR Y A +
Sbjct: 560 NNFGLF-------SSKESCIGVSIFPEASYFNHSCLPNLCRVMYRGNIA------AFYAL 606
Query: 242 HDVPQGREICLSYFPVNYDYSTRQKR-LLYDYDFACDCDRCKVEANWSDNDNDDENNEEV 300
+ +G + + Y V + ++R LL Y F C+C RC +SD D
Sbjct: 607 QSIRKGEPLTICYVDVQEASTAERRRTLLTSYRFFCECRRCH---GYSDEDG-------- 655
Query: 301 MDEDQDEQMVASDDDAEVHGDTNFPHAYFFMRYMCDRDNCWGTLAPLPPSDA 352
GDT F CD + G L PL P DA
Sbjct: 656 ------------------KGDTVMSEIRF-----CDTCDARGYLRPLAPEDA 684
>gi|170115757|ref|XP_001889072.1| predicted protein [Laccaria bicolor S238N-H82]
gi|164636020|gb|EDR00320.1| predicted protein [Laccaria bicolor S238N-H82]
Length = 353
Score = 52.4 bits (124), Expect = 4e-04, Method: Compositional matrix adjust.
Identities = 37/94 (39%), Positives = 45/94 (47%), Gaps = 12/94 (12%)
Query: 204 GIYLKASFFNHDCLPNACRFDYVDAAAENNTDIIVRMIHDVPQGREICLSYFPVNYDYS- 262
GI L AS FNH C PNA RF + N + + + D+P G EIC++Y Y
Sbjct: 167 GIGLTASRFNHSCSPNA-RFSF----NPTNGQLRIFALSDIPVGEEICVAYLSSRRLYGQ 221
Query: 263 TRQKR---LLYDYDFACDCDRC---KVEANWSDN 290
TRQ R L Y F C C C K E SD+
Sbjct: 222 TRQHRQGILRSRYHFTCSCSVCSLPKAEMILSDS 255
>gi|302506763|ref|XP_003015338.1| SET and MYND domain protein, putative [Arthroderma benhamiae CBS
112371]
gi|291178910|gb|EFE34698.1| SET and MYND domain protein, putative [Arthroderma benhamiae CBS
112371]
Length = 498
Score = 52.4 bits (124), Expect = 4e-04, Method: Compositional matrix adjust.
Identities = 62/289 (21%), Positives = 105/289 (36%), Gaps = 52/289 (17%)
Query: 17 GRGLVSTQSLKAGQIVLRDSPILLYSALPFINSSSFCHNCFRK------TMHSSSSICPS 70
GRGL +T ++K G+ V P + L C NCF M S +
Sbjct: 20 GRGLFATSTIKMGKDVF-SMPATFSTVLSTERLKDACSNCFANLPFGVNVMASVDMKLWA 78
Query: 71 CS----VAFCSPKCSTAAASSSHS--------------PYVCQALTRLLNLNSPDAANLP 112
CS V +C KC +A H P +A+ R++ L + ++
Sbjct: 79 CSGCKVVKYCDRKCQSANWKLIHKHECAIYKKLYPKVLPVNSRAVLRIVKLRDSNEDHVK 138
Query: 113 LDRQLQARFLIAAYNLAVVNPSQFQILLAFQGTVTDNDTSAAHYLSSLCPPPASTTIELT 172
D L + ++ + + N +Q Q + + S + +E
Sbjct: 139 SD--LNMFLTLRSHLVEITNSNQEQYERIMLCAKAEKEYSGSEL-----------DVETI 185
Query: 173 AALLAKDRLNAFGLMEPYIEGQDGRRSVRAYGIYLKASFFNHDCLPNACRFDYVDAAAEN 232
A LA+ +N+F + G ++ + Y+ NH C PNA +
Sbjct: 186 AEYLARIEVNSFTFTTAF--GDPLGLCIQPFACYM-----NHSCEPNA-------VVGFD 231
Query: 233 NTDIIVRMIHDVPQGREICLSYFPVNYDYSTRQKRLLYDYDFACDCDRC 281
I V+ + ++ ++ +SY Y RQK+L Y F C C +C
Sbjct: 232 GGLITVKALREIKPDEQVFISYIDNTYPLEVRQKQLTERYFFTCKCSKC 280
>gi|407394360|gb|EKF26913.1| hypothetical protein MOQ_009381 [Trypanosoma cruzi marinkellei]
Length = 393
Score = 52.4 bits (124), Expect = 4e-04, Method: Compositional matrix adjust.
Identities = 72/353 (20%), Positives = 129/353 (36%), Gaps = 93/353 (26%)
Query: 16 RGRGLVSTQSLKAGQIVLRDSPILLYSALPFIN-SSSFCHNCFRKTMHSSSS-------- 66
+GR +++T + A + ++PI+ + A +I FC +C R+ +S ++
Sbjct: 30 KGRHVIATADIPAQSGLFEEAPIVSWPAQGYIALDIPFCSHCLRQKANSDAANTQNDSLW 89
Query: 67 -ICPSCSVAFCSPKCSTAAASSSHSPYVCQALTRL--------LNLNSPDAANLPLDRQL 117
C C FCS C ++ + H +C AL L N+ ++L + ++
Sbjct: 90 RRCDGCGSYFCSDNCEFSSKIAHH--ILCGALRELREEEDAGSFYTNTGKCSDLTITKES 147
Query: 118 QARFL--IAAYNLAVVNPSQF-----------------------------QILLAFQGTV 146
AR + + A + QF +++ A GT
Sbjct: 148 LARCVAWVVARIATSIKQQQFSGELLERHHTENEESISRQLFHIACAPFNRLIDAPNGTE 207
Query: 147 TDNDTSAAHY---------------LSSLCPPPA---STTIELTAALLAKDRLNAF-GLM 187
+ + + Y L + PP + + +E+ ALL D L F G M
Sbjct: 208 FGDVDALSWYRTVDRLLREPCRIALLEATLPPRSENDAWALEIVDALLRHDTLETFLGQM 267
Query: 188 EP--------YIEGQDGRRS----------VRAYGIYLKASFFNHDCLPNACRFDYVDAA 229
I+ G ++ ++ +Y S FNH C PN +
Sbjct: 268 SLNSQAINGFIIQCPSGEKAFSAPPLVEWVLKGGAVYTLQSVFNHSCDPNV-----TVSN 322
Query: 230 AENNTDIIVRMIHDVPQGREICLSYFPVNYDYSTRQKRLLYDYDFACDCDRCK 282
+ DI +R + V G E+ ++Y P+ ++ L Y F C C RC+
Sbjct: 323 VDGTHDITLRTLRPVKCGEELTITYIPLENTTPEQRNEKLKGYFFTCRCLRCQ 375
>gi|367019314|ref|XP_003658942.1| hypothetical protein MYCTH_99098 [Myceliophthora thermophila ATCC
42464]
gi|347006209|gb|AEO53697.1| hypothetical protein MYCTH_99098 [Myceliophthora thermophila ATCC
42464]
Length = 414
Score = 52.4 bits (124), Expect = 4e-04, Method: Compositional matrix adjust.
Identities = 38/114 (33%), Positives = 56/114 (49%), Gaps = 19/114 (16%)
Query: 204 GIYLKASFFNHDCLPNACRFDYVDAAAENNTDIIVRMIHDVPQGREICLSYFPVNYDYST 263
G++ S FNHDC PN + + +T + VR D+P G E+ +SY Y
Sbjct: 181 GLFPDVSKFNHDCRPNV----HYRISDLTHTTVAVR---DIPAGEELTISYI---YGLKP 230
Query: 264 RQKRL--LYDYDFACDCDRCKV---EANWSDNDNDDENNEEVMDEDQDEQMVAS 312
R +RL L ++ F C C +C + EA SDN + M ED+ E++VA
Sbjct: 231 RAERLEQLSEWGFMCTCPQCALSEPEAGASDN----RIRQIKMLEDEIEELVAK 280
>gi|363749425|ref|XP_003644930.1| hypothetical protein Ecym_2380 [Eremothecium cymbalariae
DBVPG#7215]
gi|356888563|gb|AET38113.1| Hypothetical protein Ecym_2380 [Eremothecium cymbalariae
DBVPG#7215]
Length = 366
Score = 52.4 bits (124), Expect = 4e-04, Method: Compositional matrix adjust.
Identities = 43/153 (28%), Positives = 67/153 (43%), Gaps = 19/153 (12%)
Query: 135 QFQILLAFQGTVTDNDTSAAHYLSSLCPPPAST--TIELTAALLAKDRLNAFGLMEPYIE 192
+F +LL FQ T +L P + T EL +L + N+FG+ + E
Sbjct: 225 EFPVLLKFQ-------TEVYMFLYVTLPENLKSLMTRELFRKILGSEYGNSFGIWQEN-E 276
Query: 193 GQDGRRSVRAYGIYLKASFFNHDCLPNACRFDYVDAAAENNTDIIVRMIHDVPQGREICL 252
D R + Y + + SFFNH C PN + N + ++ DV G E+C+
Sbjct: 277 AFDNREYL-GYWMLPEGSFFNHSCAPNLSK-----KRIGNVMNFVLN--KDVKMGDELCI 328
Query: 253 SYFPV-NYDYSTRQKRLLYDYDFACDCDRCKVE 284
Y + + R+K L ++ F C C RC +E
Sbjct: 329 DYKGILDLPIHQRRKILKTNWFFECQCSRCSLE 361
>gi|147905510|ref|NP_001085986.1| N-lysine methyltransferase SMYD2-B [Xenopus laevis]
gi|82184154|sp|Q6GN68.1|SMY2B_XENLA RecName: Full=N-lysine methyltransferase SMYD2-B; AltName:
Full=Histone methyltransferase SMYD2-B; AltName:
Full=SET and MYND domain-containing protein 2B
gi|49115919|gb|AAH73650.1| MGC82991 protein [Xenopus laevis]
Length = 430
Score = 52.4 bits (124), Expect = 4e-04, Method: Compositional matrix adjust.
Identities = 69/279 (24%), Positives = 101/279 (36%), Gaps = 39/279 (13%)
Query: 15 GRGRGLVSTQSLKAGQIVLRDSPILLYSALPFINSSSFCHNCFRKTMHSSSSICPSCSVA 74
G+GRGL +T+S G+++ P Y L + C CF T S C C A
Sbjct: 14 GKGRGLKATRSFALGELLF-SCPAYTY-VLTDNERGNHCDFCF--TRKEGLSKCGKCKQA 69
Query: 75 F-CSPKCSTA---------AASSSHSPYVCQALTRLLNLNSPDAANLPLDRQLQARFL-I 123
F C+ C ++ S C + T L +R RF+ +
Sbjct: 70 FYCNVDCQKGDWPMHKLECSSMCSSGQNWCPSETVRLTARILAKQKTQTERTASERFMSV 129
Query: 124 AAYNLAVVNPSQFQILLAFQGTVTDNDTSAAH-YLSSLCPPPASTTIELTAALLAKDRLN 182
+ S L + + +ND SA H + S + +E L A+ N
Sbjct: 130 KEFE------SHLSKLDNEKKELIENDISALHRFYSKNVHNCDNAALEF---LFAQVNCN 180
Query: 183 AFGLMEPYIEGQDGRRSVRAYGIYLKASFFNHDCLPNACRFDYVDAAAENNTDIIVRMIH 242
F + +D S I+ + NH C PN Y AE VR +
Sbjct: 181 GFTI-------EDEELSHLGSAIFPDVALMNHSCCPNVI-VTYKGTVAE------VRAVQ 226
Query: 243 DVPQGREICLSYFPVNYDYSTRQKRLLYDYDFACDCDRC 281
++ G E+ SY + Y R RL Y F+CDC C
Sbjct: 227 EIHAGEEVFTSYIDLLYPTEDRNDRLKDSYFFSCDCREC 265
>gi|410985721|ref|XP_003999165.1| PREDICTED: SET and MYND domain-containing protein 3 [Felis catus]
Length = 428
Score = 52.4 bits (124), Expect = 4e-04, Method: Compositional matrix adjust.
Identities = 72/312 (23%), Positives = 118/312 (37%), Gaps = 58/312 (18%)
Query: 16 RGRGLVSTQSLKAGQIVLRDSPILLYSALPFINSSSFCHNCF--RKTMHSSSSICPSCSV 73
RG GL + L+ G+++ R P L Y+ + C C RK + C C +
Sbjct: 14 RGNGLRALVQLRPGELLFRSDP-LAYTVCKG-SRGVVCVRCIIGRKYL----LYCSRCRL 67
Query: 74 -AFCSPKCSTAAASSSHSPYVCQALTRLLNLNSPDAANLPLDRQL---------QARFLI 123
A C KC A C+ L PD+ L L R + ++ L
Sbjct: 68 LAQCGAKCRKKAWQDHKRE--CKCLKSCKPRYPPDSVRL-LGRVVFKLMEETPSESEKLY 124
Query: 124 AAYNLAV--------VNPSQFQILLAFQGTVTDNDTSAAHYLSSLCPPPASTTIELTAAL 175
+ Y+L Q+++ FQ + + A+ PP+ E A +
Sbjct: 125 SFYDLESNINKLTEDKKEGLRQLVMTFQHFMREEIQDASQL------PPSFDIFEAFAKV 178
Query: 176 LAKDRLNAFGLMEPYIEGQDGRRSVRAYGIYLKASFFNHDCLPNACRFDYVDAAAENNTD 235
+ N+F + ++ G+Y S NH C PN + N
Sbjct: 179 IC----NSFTVCNAEMQEV-------GVGLYPSMSLLNHSCDPNC-------SIVFNGPH 220
Query: 236 IIVRMIHDVPQGREICLSYFPVNYDYSTRQKRLLYDYDFACDCDRCKVEANWSDNDNDD- 294
+++R + D+ G E+ + Y + R+K+L Y F CDC RC+ + D D D
Sbjct: 221 LLLRAVRDIEAGEELTICYLDMLMTSEERRKQLRDQYCFECDCIRCQTQ----DKDADML 276
Query: 295 ENNEEVMDEDQD 306
+E+V E Q+
Sbjct: 277 TGDEQVWKEVQE 288
>gi|389604014|ref|XP_003723152.1| conserved hypothetical protein [Leishmania braziliensis
MHOM/BR/75/M2904]
gi|322504894|emb|CBZ14681.1| conserved hypothetical protein [Leishmania braziliensis
MHOM/BR/75/M2904]
Length = 623
Score = 52.4 bits (124), Expect = 4e-04, Method: Compositional matrix adjust.
Identities = 61/231 (26%), Positives = 98/231 (42%), Gaps = 40/231 (17%)
Query: 68 CPSCS-VAFCSPKCSTAAASSSHSPYVCQALTRLLNLNSPDAANLPLDRQLQA--RFLIA 124
C +C+ VA+CSP C+ A H + C +TRL ++ + R+L A +F
Sbjct: 262 CTTCNMVAYCSPACAEAH-REQHEKFECSRMTRLSDMMESIES-----RRLDAPEQFFEL 315
Query: 125 AYNLAVVNPSQFQILLAFQGTVTDNDTSAAHYLSSLCP-----------PPASTTIELTA 173
AY+ + + + V + + SL P +TT+ +
Sbjct: 316 AYH-CITTLAGMKAKRQGHENVLKLTSHVDEVIQSLHPVGPLMCDLFKGEEETTTLYIII 374
Query: 174 ALLAKDRLNAFGLMEPYIEGQDGRRSVRAYGIY-LKASFFNHDCLPNACRFDYVDAAAEN 232
+L R NA + +P G +A + ASFFNH C+PN C D V A
Sbjct: 375 GVL---RCNALEVTDP-----SGLGVAQALHVGNTIASFFNHSCMPN-CAIDTVRHAI-- 423
Query: 233 NTDIIVRMIHDVPQGREICLSYFP-VNYDYSTRQKRLLYDYDFACDCDRCK 282
+ R+I+ G E+ ++Y P + + R++RL Y F C C RC+
Sbjct: 424 ---VTTRIIY---VGEELSIAYIPQLYWPTRLRRERLSEGYYFVCHCPRCQ 468
>gi|281202062|gb|EFA76267.1| autophagy protein 9 [Polysphondylium pallidum PN500]
Length = 1064
Score = 52.0 bits (123), Expect = 4e-04, Method: Compositional matrix adjust.
Identities = 36/119 (30%), Positives = 51/119 (42%), Gaps = 14/119 (11%)
Query: 172 TAALLAKDRLNAFGLMEPYIEGQDGRRSVRAYGIYLKASFFNHDCLPNACRFDYVDAAAE 231
T +L+ K R N FG+ R AS+FNH C PN A
Sbjct: 904 TRSLICKIRCNQFGIWSK-------RDKCIGVSCTPAASYFNHSCCPNL-------ADVR 949
Query: 232 NNTDIIVRMIHDVPQGREICLSYFPVNYDYSTRQKRLLYDYDFACDCDRCKVEANWSDN 290
T ++ + +H + +G I +SY ++ RQ L Y F C C RCK + + SDN
Sbjct: 950 GTTVVVFKALHFISKGTPISISYLDLDQPTHERQSYLKTFYYFTCQCLRCKDQTDESDN 1008
Score = 39.7 bits (91), Expect = 2.3, Method: Compositional matrix adjust.
Identities = 31/120 (25%), Positives = 52/120 (43%), Gaps = 15/120 (12%)
Query: 17 GRGLVSTQSLKAGQIVLRDSPILLYSALPFINSSSFCHNCFRKTMHSSSSI--------- 67
GR LV+++ L+ G+IVL L S + CHNC +S I
Sbjct: 646 GRYLVASKDLEIGEIVLVARSFL--SIESDLLRKQTCHNCLEYQNTKASDIDSALKSSVM 703
Query: 68 --CPSCS-VAFCSPKCSTAAASSSHSPYVCQALTRLLNLNSPDAANLPLDRQLQARFLIA 124
C C+ V +CS C + S+ H Y CQ +L + N+ ++ + D + + ++
Sbjct: 704 QRCVGCNEVYWCSESC-YSIGSAIHQRYECQYFKKLRSYNNSNSKAITGDNMTEVKMIVG 762
>gi|449302416|gb|EMC98425.1| hypothetical protein BAUCODRAFT_574506 [Baudoinia compniacensis
UAMH 10762]
Length = 657
Score = 52.0 bits (123), Expect = 5e-04, Method: Compositional matrix adjust.
Identities = 33/110 (30%), Positives = 51/110 (46%), Gaps = 12/110 (10%)
Query: 204 GIYLKASFFNHDCLPNACRFDYVDAAAENNTDIIVRMIHDVPQGREICLSYFPVNYDYST 263
G+++ AS+ NHDC+ NA R D +IVR + G EI LSY P Y T
Sbjct: 430 GMWITASYMNHDCIGNAKRSFIGDM-------LIVRATKAIAAGEEITLSYVPTEDAYET 482
Query: 264 RQKRLLYDYDFACDCDRCKVEANWSDNDNDDENNEEVMDEDQDEQMVASD 313
R + L ++ C C C E + D + +Q E+++A++
Sbjct: 483 RTE-TLQAWNIRCRCRLCVAE----EQDTIVARKQRAGLIEQAERLMAAN 527
>gi|443719695|gb|ELU09739.1| hypothetical protein CAPTEDRAFT_159764 [Capitella teleta]
Length = 338
Score = 52.0 bits (123), Expect = 5e-04, Method: Compositional matrix adjust.
Identities = 28/89 (31%), Positives = 46/89 (51%), Gaps = 8/89 (8%)
Query: 204 GIYLKASFFNHDCLPNACRFDYVDAAAENNTDIIVRMIHDVPQGREICLSYF---PVNYD 260
G+YL S NH C+PNA NN+ + V+ + ++ G EIC+SY +
Sbjct: 236 GLYLLQSTCNHSCMPNA-----EITFPHNNSTLAVKALSNIKTGEEICISYLDECALERS 290
Query: 261 YSTRQKRLLYDYDFACDCDRCKVEANWSD 289
+R K L +Y F C+C +C+ + + +D
Sbjct: 291 RHSRHKILRENYLFNCNCSKCESQLDEAD 319
>gi|270011901|gb|EFA08349.1| hypothetical protein TcasGA2_TC005992 [Tribolium castaneum]
Length = 803
Score = 52.0 bits (123), Expect = 5e-04, Method: Compositional matrix adjust.
Identities = 60/305 (19%), Positives = 113/305 (37%), Gaps = 46/305 (15%)
Query: 2 SAVSETVTVAEIEGRGRGLVSTQSLKAGQIVLRDSPILLYSALPFINSSSFCHNCFRKTM 61
+V + + E GR LV+T++LK G++V++++P++ P NS C C +
Sbjct: 40 KSVCQPFKICEDAVLGRHLVATRALKPGEVVIQEAPLIWG---PSYNSIPVCLGCGKAVD 96
Query: 62 HSSSSICPSCSVAFCSPKCSTAAASSSHSPYVCQ----ALTRLLNLNSPDAANLPLDRQL 117
+ S C C CS C + + Y Q ++ + P + + R L
Sbjct: 97 ETCSRPCQKCGWPVCSDLCEKSPSHIPECRYTSQRGEKITVKIFGVPHPIYKCITVLRCL 156
Query: 118 -QARFLIAAYNLAVVNPSQFQILLAFQGTVTDNDTSAAHYLSSLCPPP-ASTTIELTAAL 175
Q +FL P ++ L Q + ++ ++ + + +L+A
Sbjct: 157 YQKQFL----------PKNWKKLETLQNHLEEHKSTTNYESDRVTIAEFIRRFFKLSATF 206
Query: 176 LAKDRLNAFG----------LMEPYIEGQDGRRSVRAYGIYLKASFFNHDCLPNACRFDY 225
+D + G L EP+ IY AS H C PN +
Sbjct: 207 SEEDIMKVHGSTLVNGHEVPLTEPF-----------HVAIYSSASMLEHSCGPNCTK--- 252
Query: 226 VDAAAENNTDIIVRMIHDVPQGREICLSYFPVNYDYSTRQKRLLYDYDFACDCDRCKVEA 285
+ I++ + +G + + Y + +R+ L F C C+RC+ +
Sbjct: 253 ---SFTKQGHIVISAAKSIQEGDHLSICYSDPLWGTPSRRYFLHETKFFWCHCERCEDPS 309
Query: 286 NWSDN 290
+ N
Sbjct: 310 EFGTN 314
>gi|194743442|ref|XP_001954209.1| GF16861 [Drosophila ananassae]
gi|190627246|gb|EDV42770.1| GF16861 [Drosophila ananassae]
Length = 550
Score = 52.0 bits (123), Expect = 5e-04, Method: Compositional matrix adjust.
Identities = 72/297 (24%), Positives = 110/297 (37%), Gaps = 50/297 (16%)
Query: 10 VAEIEGRGRGLVSTQSLKAGQIVLRDSPILL-----YSALPFINSSSFCHNCFRKTMHSS 64
VAE E GR LV+TQ++ A QI + P+++ +S I CH R H
Sbjct: 48 VAEDEQLGRFLVATQNIAAKQIAFVEEPLVVGPKWYHSQDETIVPCVGCHTPCRLGKHQ- 106
Query: 65 SSICPSCSVAFCSPKCSTAAASSSHSPYVCQALTRLLNLNSPDAANLPLDRQLQARFLIA 124
C SC CS C+ A S +L+L A +R L F
Sbjct: 107 ---CRSCHWPVCSAACADEALECS-----------VLSLGPTPGARAD-NRTLNDYFRGD 151
Query: 125 AYNLAVV------NPSQFQILLAFQGTVTDNDTSAAH--------------YLSSLCPPP 164
A + NP+++ LL Q + + H +L L
Sbjct: 152 ALLVLRCLLLQRQNPAKWTALLEMQSHEEERQGTELHVEAEMRVVDFLQQRFLKRLKQTK 211
Query: 165 ASTTIELTAALLAKDRLNAFGLMEPYIEGQDGRRSVRAYGIYLKASFFNHDCLPNACRFD 224
+ + LL + G++E + V G++ +A H C PN C F
Sbjct: 212 PDLLADCGSELLHR----ICGIIETNYMVIELPSGVELSGLFRQACMMEHACQPN-CYFQ 266
Query: 225 YVDAAAENNTDIIVRMIHDVPQGREICLSYFPVNYDYSTRQKRLLYDYDFACDCDRC 281
+ DA + I VR D+ +G + ++Y + + RQ L F+C C RC
Sbjct: 267 F-DAFTQQ---ITVRAGCDLQKGDHLRIAYTNILWGTQLRQHHLRLTKHFSCHCARC 319
>gi|22450579|gb|AAM97151.1| unknown protein,3'-partial [Oryza sativa Japonica Group]
Length = 425
Score = 52.0 bits (123), Expect = 5e-04, Method: Compositional matrix adjust.
Identities = 68/289 (23%), Positives = 109/289 (37%), Gaps = 43/289 (14%)
Query: 8 VTVAEIEGRGRGLVSTQSLKAGQIVLRDSPILLYSALP-FINSSSFCHNCFRKTMHSSSS 66
+ VA + G+GRGL + +S G++V+ P Y++ P I+ S C NCF +
Sbjct: 17 LAVASVPGKGRGLFAARSFFPGEVVISQEP---YASTPNKISVGSNCDNCF---ASRNLR 70
Query: 67 ICPSCSVA-FCSPKCSTAAASSSHSPYVCQALT-------RLLNLNSPDAANLPLDRQLQ 118
C C VA +C C C+A+ ++L L L R+LQ
Sbjct: 71 KCSVCRVAWYCGSACQREEWKLHQLE--CRAIAALTEDRKKMLTPTIRLMVRLVLRRKLQ 128
Query: 119 ARFLIAAYNLAVVNPSQFQILLAFQGTVTDNDT------SAAHYLSSLCPPPASTTIELT 172
I + + ++ A + +++ D + L L P ++
Sbjct: 129 DDKAIPS-----SGTDNYNLVDALESHISEVDKNQLVLYAQMANLVQLILPSFELDLKEI 183
Query: 173 AALLAKDRLNAFGLMEPYIEGQDGRRSVRAYGIYLKASFFNHDCLPNACRFDYVDAAAEN 232
+K NA + +P + G+Y S NH C+PNA E
Sbjct: 184 THTFSKFACNAHTICDPELRPL-------GTGLYPVLSIINHSCVPNAVLI------FEG 230
Query: 233 NTDIIVRMIHDVPQGREICLSYFPVNYDYSTRQKRLLYDYDFACDCDRC 281
T VR + + + E+ +SY RQ L + Y F C C RC
Sbjct: 231 RT-AYVRALQPISKNEEVSISYIETAATTMKRQDDLKH-YYFTCTCPRC 277
>gi|383850313|ref|XP_003700740.1| PREDICTED: protein msta, isoform B-like [Megachile rotundata]
Length = 494
Score = 52.0 bits (123), Expect = 5e-04, Method: Compositional matrix adjust.
Identities = 67/280 (23%), Positives = 107/280 (38%), Gaps = 51/280 (18%)
Query: 17 GRGLVSTQSLKAGQIVLRDSPILLYSALPFINSSSFCHNCFRKTMHSSSSICP------S 70
GRG+ +T+ + A +++ D+P L+ C C++ S CP
Sbjct: 42 GRGMFATRDIAANELIFIDAP-LIVGPRCVGKHLEMCVCCYK-------SECPLFPCDRG 93
Query: 71 CSVAFCSPKCSTAAASSSHSPYVCQALTRLLNLNSPD-AANLPLD----RQL----QARF 121
C + CS +C S H Y C L L+ D + NL L R L Q R
Sbjct: 94 CGLPICSLECEN---SPKHVNYECNYLRSLVPTCGTDWSLNLLLAVIPIRALFMTEQQRK 150
Query: 122 LIAAYNLAVVNPSQFQILLAFQGTVTDNDTSAAHYLSSLCPPPASTTIELTAALLAKDRL 181
+AA SQ++I L + + + +C S + E T ++L KD
Sbjct: 151 CLAALQCDQTLTSQYEINLLQRNVEKSPSEADVELMKRVCGAFNSNSFE-TVSVLDKDH- 208
Query: 182 NAFGLMEPYIEGQDGRRSVRAYGIYLKASFFNHDCLPNACRFDYVDAAAENNTDIIVRMI 241
S G+Y + NH+C PN R + + N + V
Sbjct: 209 -----------------STSLRGLYPLGALQNHNCAPNT-RHHF-----DENYRLYVSAT 245
Query: 242 HDVPQGREICLSYFPVNYDYSTRQKRLLYDYDFACDCDRC 281
+ G E+ +SY + +D + R++ L F+C C RC
Sbjct: 246 VPISAGEELTMSYTSLFWDTTLRRQFLSITKHFSCTCKRC 285
>gi|330799867|ref|XP_003287962.1| hypothetical protein DICPUDRAFT_33308 [Dictyostelium purpureum]
gi|325081986|gb|EGC35483.1| hypothetical protein DICPUDRAFT_33308 [Dictyostelium purpureum]
Length = 400
Score = 52.0 bits (123), Expect = 5e-04, Method: Compositional matrix adjust.
Identities = 70/341 (20%), Positives = 121/341 (35%), Gaps = 89/341 (26%)
Query: 14 EGRGRGLVSTQSLKAGQIVLRDSPILLYSALPF-INSSSFCHNCFR--------KTMHSS 64
E GR +++T+ + G+ +L+ Y ++P I C NC + K
Sbjct: 14 ETEGRYIIATRDIDVGENLLKCKA---YFSMPCEIYKKITCGNCVKIIKRPQQEKKKKVK 70
Query: 65 SSICPSCS-VAFCSPKCSTAAASSSHSPYVCQALTRLLNLNSPDAANLPLD--------- 114
C C+ + +CS C + H+ Y C+ ++ +P + +D
Sbjct: 71 EYKCNVCNEIWYCSEFCK-KESQPIHAHYECKFFKKI---KAPKLSEWEIDPDTFTEVRM 126
Query: 115 ------RQLQARFLIAAYNLA----------VVNPSQFQ-------ILLAFQGTVTDNDT 151
R Q R L +NL+ ++N + I + T+ D
Sbjct: 127 MVGVISRFYQERVLNKKFNLSNYLKEQQEKRLLNQEELSSEDTLDDIFDLVENTIDDGSN 186
Query: 152 SAAH--------YLSSL------------CPPP------ASTTIELTAALLAKDRLNAFG 185
AA Y+++L P P I + L+ K R N FG
Sbjct: 187 KAAKELIDIITDYIANLFNLVITSSDNHDLPTPELKKEAVEKVISVIRELIHKVRCNQFG 246
Query: 186 LMEPYIEGQDGRRSVRAYGIYLKASFFNHDCLPNACRFDYVDAAAENNTDIIVRMIHDVP 245
+ A I +SFFNH C+PN + + + ++ V
Sbjct: 247 IW-------TKNDKCIAVAISPSSSFFNHSCIPNCINI-------RDGNKMTFKALYPVK 292
Query: 246 QGREICLSYFPVNYDYSTRQKRLLYDYDFACDCDRCKVEAN 286
+G + +SY ++ +R++ L Y Y F C C RC + N
Sbjct: 293 KGEPLAISYLDLDLPVESRKEYLKYGYYFDCGCPRCDEKTN 333
>gi|451850767|gb|EMD64068.1| hypothetical protein COCSADRAFT_320770 [Cochliobolus sativus
ND90Pr]
Length = 388
Score = 52.0 bits (123), Expect = 5e-04, Method: Compositional matrix adjust.
Identities = 54/216 (25%), Positives = 84/216 (38%), Gaps = 38/216 (17%)
Query: 80 CSTAAA-SSSHSPYVCQALTRLLNLNSPDAANLPLDRQLQARFLIAAYNLAVVNPSQFQI 138
C A A S H V +AL + +PD + F ++ +P Q++
Sbjct: 194 CQAAVAISKQHKKAVSKALQGRI---TPDVMS----------FCVSGILWRHHHPEQWER 240
Query: 139 LLAFQGTVT------DNDTSAAHYLSSLCPPPAS----TTIELTAALLAKDRLNAFGLME 188
L A T D D YL + P + T E L +D N+FG+
Sbjct: 241 LEALANDNTPYHSSDDLDAFTRAYLHLVAILPVALLPLVTAETLLTLSKRDSHNSFGIRS 300
Query: 189 PYIEGQDGRRSVRAYGIYLKASFFNHDCLPNACRFDYVDAAAENNTDIIVRMIHDVPQGR 248
EG + YG + AS+FNH C N ++ E R ++ +GR
Sbjct: 301 LEDEGSE----FFGYGCWPAASYFNHSCGQN------IEKKREGRI-WYFRAGRNIEKGR 349
Query: 249 EICLSYFPVNYDYSTRQKRLL---YDYDFACDCDRC 281
E+C++Y +R +R+L + F C C+RC
Sbjct: 350 ELCITYLSGEERMLSRGQRMLRLKKTWGFDCGCERC 385
>gi|125585065|gb|EAZ25729.1| hypothetical protein OsJ_09564 [Oryza sativa Japonica Group]
Length = 509
Score = 52.0 bits (123), Expect = 5e-04, Method: Compositional matrix adjust.
Identities = 29/85 (34%), Positives = 40/85 (47%), Gaps = 8/85 (9%)
Query: 202 AYGIYLKASFFNHDCLPNACRFDYVDAAAENNTDIIVRMIHDVPQGREICLSYFPVNYDY 261
G+++ +F NH C PNA R D A IV D+ G EI +YF V
Sbjct: 284 GVGLWILPAFINHSCHPNARRTHVGDHA-------IVHASRDIKAGEEITFAYFDVLTPA 336
Query: 262 STRQKRLLYDYDFACDCDRCKVEAN 286
S R++ + C CDRC+ EA+
Sbjct: 337 SKRRE-AARAWGLECQCDRCRFEAS 360
>gi|115450985|ref|NP_001049093.1| Os03g0168700 [Oryza sativa Japonica Group]
gi|19071649|gb|AAL84316.1|AC073556_33 hypothetical protein [Oryza sativa Japonica Group]
gi|108706390|gb|ABF94185.1| TPR Domain containing protein [Oryza sativa Japonica Group]
gi|113547564|dbj|BAF11007.1| Os03g0168700 [Oryza sativa Japonica Group]
Length = 536
Score = 52.0 bits (123), Expect = 5e-04, Method: Compositional matrix adjust.
Identities = 29/85 (34%), Positives = 40/85 (47%), Gaps = 8/85 (9%)
Query: 202 AYGIYLKASFFNHDCLPNACRFDYVDAAAENNTDIIVRMIHDVPQGREICLSYFPVNYDY 261
G+++ +F NH C PNA R D A IV D+ G EI +YF V
Sbjct: 311 GVGLWILPAFINHSCHPNARRTHVGDHA-------IVHASRDIKAGEEITFAYFDVLTPA 363
Query: 262 STRQKRLLYDYDFACDCDRCKVEAN 286
S R++ + C CDRC+ EA+
Sbjct: 364 SKRRE-AARAWGLECQCDRCRFEAS 387
>gi|66805955|ref|XP_636699.1| hypothetical protein DDB_G0288495 [Dictyostelium discoideum AX4]
gi|74852593|sp|Q54IV4.1|Y8495_DICDI RecName: Full=SET and MYND domain-containing protein DDB_G0288495
gi|60465095|gb|EAL63196.1| hypothetical protein DDB_G0288495 [Dictyostelium discoideum AX4]
Length = 532
Score = 52.0 bits (123), Expect = 5e-04, Method: Compositional matrix adjust.
Identities = 28/100 (28%), Positives = 43/100 (43%), Gaps = 18/100 (18%)
Query: 204 GIYLKASFFNHDCLPNACRFDYVDAAAE------------------NNTDIIVRMIHDVP 245
G+YLK S FNH C PN + + N I + + D+
Sbjct: 379 GVYLKLSLFNHSCFPNCTTLIEYNINKKNSNNNNGNNGNNNSYGDTNQLTISIITLRDIE 438
Query: 246 QGREICLSYFPVNYDYSTRQKRLLYDYDFACDCDRCKVEA 285
+ +E+ ++Y P+N + R K L ++ F CDC RC E
Sbjct: 439 ENQELLITYIPLNQKINDRVKSLKSNWLFQCDCKRCHFEK 478
>gi|452984368|gb|EME84125.1| hypothetical protein MYCFIDRAFT_173174 [Pseudocercospora fijiensis
CIRAD86]
Length = 816
Score = 52.0 bits (123), Expect = 5e-04, Method: Compositional matrix adjust.
Identities = 26/83 (31%), Positives = 39/83 (46%), Gaps = 7/83 (8%)
Query: 204 GIYLKASFFNHDCLPNACRFDYVDAAAENNTDIIVRMIHDVPQGREICLSYFPVNYDYST 263
G++ + FNH CLPN C + ++ IVR + G E+ L+Y P + D
Sbjct: 489 GLWYTGAMFNHSCLPN-CNWSWIGDM------FIVRANRVITMGEELTLAYIPSSNDSEK 541
Query: 264 RQKRLLYDYDFACDCDRCKVEAN 286
R+ L Y F C C C+ +A
Sbjct: 542 RRNTLRSQYGFECGCRICQADAT 564
>gi|344301790|gb|EGW32095.1| hypothetical protein SPAPADRAFT_61174 [Spathaspora passalidarum
NRRL Y-27907]
Length = 415
Score = 52.0 bits (123), Expect = 5e-04, Method: Compositional matrix adjust.
Identities = 64/313 (20%), Positives = 113/313 (36%), Gaps = 51/313 (16%)
Query: 8 VTVAEIEGRGRGLVSTQSLKAGQIVLRDSPILLYSALPFIN---SSSFCHNCFRKTMHSS 64
V + RG+GL + ++ G+++ + P+ L IN S + C +C R S
Sbjct: 113 VQIVMTSKRGKGLYAKHNIAKGELIWEEEPLFFIPPLANINLIKSGTACSHCGRLLQQSE 172
Query: 65 SSI------CPSCSVAFCSPKCSTAAASSSHSPYVCQALTRLLNLNSPDAANLPLDRQLQ 118
+ C C+ +C KC V L + N+ +P+A L +
Sbjct: 173 GNTVLRGLDCQVCAEVWCGTKCKKLDG-------VLHGLLKH-NVYNPNAKRSKL-FDPE 223
Query: 119 ARFLIAAYNLAVVNPSQFQILLAFQGTVTDNDTSAAHYLSSLC-------------PPPA 165
A + Y L + + I L + + D + A ++ A
Sbjct: 224 AFLHMQDYCLEDQWNALYAITLIYADILNDKSGTKAKQFKAMARVSQDIRYKALNSSAGA 283
Query: 166 STTIELTAALLAKDRLNAFGL-MEPY--------IEGQDGRRSVRAYGI-------YLKA 209
T+ A + + + N + L E + I+ Q+ + Y I +L
Sbjct: 284 FDTLSGGALFVQEQQENLWKLGYEKFLRIFPSNPIDYQEFMYMMGTYNINNLDSNVFLIQ 343
Query: 210 SFFNHDCLPNACRFDYVDAAAENNTDIIVRMIHDVPQGREICLSYFPVNYDYSTRQKRLL 269
S NH+C PN V+ + V D+ G E+ +Y RQ+ L
Sbjct: 344 SHLNHNCNPNTS----VETELLRTDGLKVYAARDIRAGEELTTTYVNPKNTVQQRQRELR 399
Query: 270 YDYDFACDCDRCK 282
++ F C C++CK
Sbjct: 400 VNWGFLCGCEKCK 412
>gi|393212472|gb|EJC97972.1| SET domain-containing protein [Fomitiporia mediterranea MF3/22]
Length = 444
Score = 52.0 bits (123), Expect = 5e-04, Method: Compositional matrix adjust.
Identities = 79/335 (23%), Positives = 124/335 (37%), Gaps = 95/335 (28%)
Query: 16 RGRGLVSTQSLKAGQIVLRDSPILLYSALPFINSSSFCHNCFRKTMHSSSSI------CP 69
+G+GL +TQ +K G+ V ++ P ++ + + H CF T +SS CP
Sbjct: 89 KGKGLEATQEIKEGEDVWKEDPFVISPSWDIYDLQIASHACFHCTTPFNSSTLPHSLPCP 148
Query: 70 S-----------CSVAFCSPKCSTAAASSSHSPYVCQALTRLLNLNSPDAANLPLDRQLQ 118
S C FC+ C T A S H P +C A + A++PL +
Sbjct: 149 SSTNPALSQNHPCPARFCNRLCITRAMSRQH-PLLCPAT---------NPASVPLLNFAR 198
Query: 119 ARFLIAAYNLAVVNPSQFQILLAFQ--------------------------------GTV 146
+R +AA+ +A ++L+AF+ G
Sbjct: 199 SRGWLAAHAVAQCI---ARVLMAFEQGRKQDLEEDLRFIRSLAQMSMDERWKIIETSGME 255
Query: 147 TDNDT-SAAHYLS--SLCPPPASTTIELTAALLAKDR--------------LNAFGLMEP 189
D+ T AH L+ + P +L + ++ K L+A M
Sbjct: 256 PDHQTWKTAHTLTLQAFHEPSNPRDKKLLSKIIRKPLPEDLVKTLFDYDAFLHALSRMGL 315
Query: 190 YIEGQDGRRSVRAYGIYLKASFFNHDCLPNAC--RFDYVDAAAENNTDIIVRMIHDVPQG 247
+E G +Y+ S NH C PN FD A + I +R D+ G
Sbjct: 316 NLEAHGG--------LYILHSHMNHSCEPNISVRHFDQRTALSR----ITMRARRDIQPG 363
Query: 248 REICLSYFPVNYDYSTRQKRLLYDYDFA-CDCDRC 281
E+ ++Y S R +R L + F C CDRC
Sbjct: 364 EELTVAYVDPALLLSAR-RRALIPWAFGTCMCDRC 397
>gi|380022845|ref|XP_003695246.1| PREDICTED: protein msta, isoform A-like [Apis florea]
Length = 485
Score = 52.0 bits (123), Expect = 5e-04, Method: Compositional matrix adjust.
Identities = 62/278 (22%), Positives = 111/278 (39%), Gaps = 48/278 (17%)
Query: 17 GRGLVSTQSLKAGQIVLRDSPILLYSALPFINSSSFCHNCFRKTMHSSSSICP--SCSVA 74
GRG+ +T+ +K +++ D+P+++ + C C++ CP C
Sbjct: 41 GRGMFATRDIKQNELIFIDAPLIVGPKCSN-KHTKMCVCCYKNE-------CPLFPCDKG 92
Query: 75 FCSPKCSTAAASSSHSPYVCQALTRLLNLNSPD------AANLPLD----RQLQARFLIA 124
P CST +S + C+ L L+ + D A +P+ ++Q + L
Sbjct: 93 CGLPVCSTQCENSPNHVNECEYLRSLVPICGTDWCLNLLLATIPIRGLFMTKMQRKCLAT 152
Query: 125 -AYNLAVVNPSQFQILLAFQGTVTDNDTSAAHYLSSLCPPPASTTIELTAALLAKDRLNA 183
YN + + + ++L + ++S P+ IE+ + N+
Sbjct: 153 LQYNKNLFSKYEIELL-------------KKNVMNS----PSDEDIEMMERVCRAFNTNS 195
Query: 184 FGLMEPYIEGQDGRRSVRAYGIYLKASFFNHDCLPNACRFDYVDAAAENNTDIIVRMIHD 243
F I D S G+Y S NH C+PN Y D + + VR
Sbjct: 196 F----ETIRVHDKDHSTNLRGLYPLGSLQNHCCIPNT--RHYFDEEFQ----LYVRAALP 245
Query: 244 VPQGREICLSYFPVNYDYSTRQKRLLYDYDFACDCDRC 281
+ G EI +SY + +D + R++ L +F+C C RC
Sbjct: 246 ISTGEEITMSYTSLFWDTTLRRQFLNVTKNFSCMCKRC 283
>gi|194768765|ref|XP_001966482.1| GF21975 [Drosophila ananassae]
gi|190617246|gb|EDV32770.1| GF21975 [Drosophila ananassae]
Length = 445
Score = 52.0 bits (123), Expect = 5e-04, Method: Compositional matrix adjust.
Identities = 28/88 (31%), Positives = 42/88 (47%), Gaps = 6/88 (6%)
Query: 194 QDGRRSVRAYGIYLKASFFNHDCLPNACRFDYVDAAAENNTDIIVRMIHDVPQGREICLS 253
+ G V G++ + NH+C PNA + +N T +VR D+P+G EI +
Sbjct: 215 RSGGHEVLLRGLFPLTAIMNHECTPNASHY------FDNGTLAVVRAARDIPKGGEITTT 268
Query: 254 YFPVNYDYSTRQKRLLYDYDFACDCDRC 281
Y + + TR L +F CDC RC
Sbjct: 269 YTKILWGNLTRGIFLKMTKNFNCDCVRC 296
>gi|336364311|gb|EGN92671.1| hypothetical protein SERLA73DRAFT_116951 [Serpula lacrymans var.
lacrymans S7.3]
gi|336378310|gb|EGO19468.1| hypothetical protein SERLADRAFT_478912 [Serpula lacrymans var.
lacrymans S7.9]
Length = 302
Score = 52.0 bits (123), Expect = 5e-04, Method: Compositional matrix adjust.
Identities = 30/93 (32%), Positives = 46/93 (49%), Gaps = 8/93 (8%)
Query: 192 EGQDGRRSVRAYGIYLKASFFNHDCLPNAC-RFDYVDAAAENNTDIIVRMIHDVPQGREI 250
+G G R+ A G++L S FN C+PN R+D EN + R + D+ G E+
Sbjct: 93 DGLSGHRASTA-GVFLCGSRFNSSCVPNVNNRWD------ENKQAVEFRALKDIVSGEEL 145
Query: 251 CLSYFPVNYDYSTRQKRLLYDYDFACDCDRCKV 283
C+SY + R+K L + F C C+ C +
Sbjct: 146 CISYGSLLETGEMRRKELAAKFKFHCRCETCSL 178
>gi|328784417|ref|XP_624539.3| PREDICTED: protein msta, isoform A-like [Apis mellifera]
Length = 440
Score = 52.0 bits (123), Expect = 5e-04, Method: Compositional matrix adjust.
Identities = 56/287 (19%), Positives = 109/287 (37%), Gaps = 33/287 (11%)
Query: 2 SAVSETVTVAEIEGRGRGLVSTQSLKAGQIVLRDSPILLYSALPFINSSSFCHNCFRKTM 61
S + ++ +AE GR ++T+++K G+I+LRD L+ + N+ C C+
Sbjct: 39 SKICKSFKLAENPSLGRHYIATRNIKVGEIILRDDQPLITGLM--YNTVPVCLQCYTVLN 96
Query: 62 HSSSSICPSCSVAFCSPKCSTAAASSSHSPYVCQALTRLLNLNSPDAANLPLDRQLQARF 121
+ C C C C+ S + + P + Q
Sbjct: 97 QEIAIPCEKCGWPLCQ-NCNEHGLECKFSSSRRDSKISITEFGYPHPS-------YQCIN 148
Query: 122 LIAAYNLAVVNPSQFQILLAFQGTVTDNDTSA-----AHYLSSLCPPPASTTIELTAALL 176
+I A +L NP ++ L++ + + + S AH++ + E+ +
Sbjct: 149 VIRALSLKDTNPESYKKLISLESHCNEMNNSKEPLNIAHFIKRFFKADDISEEEIATII- 207
Query: 177 AKDRLNAFGLMEPYIEGQDGRRSVRAY-GIYLKASFFNHDCLPNACRFDYVDAAAENNTD 235
G+++ + G + + Y +Y AS H+C N C + D
Sbjct: 208 --------GILQ--VNGHEVPLTDSPYVAVYEMASLIEHNCRAN-CSKSFTDMGG----- 251
Query: 236 IIVRMIHDVPQGREICLSYFPVNYDYSTRQKRLLYDYDFACDCDRCK 282
+I+R + +G I + Y + + R+ L F C C+RC+
Sbjct: 252 LIIRAALPITKGDHISICYTDPLWGTANRRHHLFKTKFFECICNRCQ 298
>gi|357607357|gb|EHJ65468.1| hypothetical protein KGM_05644 [Danaus plexippus]
Length = 370
Score = 52.0 bits (123), Expect = 5e-04, Method: Compositional matrix adjust.
Identities = 69/279 (24%), Positives = 105/279 (37%), Gaps = 43/279 (15%)
Query: 20 LVSTQSLKAGQIVLRDSPILLYSALPFINSSSFCHNCFRKTMHSSSSICPSCSVAFCSPK 79
L ++K G ++L + P L S C C K S C V +C+
Sbjct: 4 LKHINAVKTGDLILSEEPFAY--VLSSKEKGSRCDFCLEKGKVLKCSGCQF--VHYCNRS 59
Query: 80 CSTAAASSSHSPYVCQALTRLLNLNSPDAANLPLDRQLQ--ARFLIAAYNLAVVNPSQFQ 137
C A + C L R+ PDAA L L R L R AY A P+ F+
Sbjct: 60 CQKDAWEDH--KWECANLKRIAPKTIPDAARL-LARILNRLQRGNGGAYK-AFYTPTSFR 115
Query: 138 I---LLAFQGTVTDNDTSAAH----------YLSSLCPPPASTTIELTAALLAKDRLNAF 184
+ L++ + + H YL + P + + L ++ +N+F
Sbjct: 116 VWKDLMSHYSDLKSDKKRMDHFSTLSMVLFEYLKDISLPNTADLMGLYGRMV----INSF 171
Query: 185 GLMEPYIEGQDGRRSVRAYGIYLKASFFNHDCLPNACRFDYVDAAAENNTDIIVRMIHDV 244
++ D + GIYL +S +H C PNA A + I +R + D+
Sbjct: 172 TIL-------DIEMNSIGTGIYLASSVIDHSCNPNA-------VAVFDGKTINIRALKDM 217
Query: 245 P--QGREICLSYFPVNYDYSTRQKRLLYDYDFACDCDRC 281
++I +SY + RQ L Y F C CDRC
Sbjct: 218 NCLDWKKIRISYIDLMKTPYERQMELRQSYYFLCQCDRC 256
>gi|4753708|emb|CAB42051.1| MSTA protein [Drosophila melanogaster]
Length = 462
Score = 52.0 bits (123), Expect = 5e-04, Method: Compositional matrix adjust.
Identities = 27/78 (34%), Positives = 38/78 (48%), Gaps = 6/78 (7%)
Query: 204 GIYLKASFFNHDCLPNACRFDYVDAAAENNTDIIVRMIHDVPQGREICLSYFPVNYDYST 263
G++ + NH+C PNA + EN +VR D+P+G EI +Y + + T
Sbjct: 242 GLFPLTAIMNHECTPNASHY------FENGRLAVVRAARDIPKGGEITTTYTKILWGNLT 295
Query: 264 RQKRLLYDYDFACDCDRC 281
R L FACDC RC
Sbjct: 296 RNIFLKMTKHFACDCVRC 313
>gi|62473423|ref|NP_001014717.1| msta, isoform A [Drosophila melanogaster]
gi|25090702|sp|O46040.3|MSTAA_DROME RecName: Full=Protein msta, isoform A
gi|22831561|gb|AAN09072.1| msta, isoform A [Drosophila melanogaster]
Length = 462
Score = 52.0 bits (123), Expect = 5e-04, Method: Compositional matrix adjust.
Identities = 27/78 (34%), Positives = 38/78 (48%), Gaps = 6/78 (7%)
Query: 204 GIYLKASFFNHDCLPNACRFDYVDAAAENNTDIIVRMIHDVPQGREICLSYFPVNYDYST 263
G++ + NH+C PNA + EN +VR D+P+G EI +Y + + T
Sbjct: 242 GLFPLTAIMNHECTPNASHY------FENGRLAVVRAARDIPKGGEITTTYTKILWGNLT 295
Query: 264 RQKRLLYDYDFACDCDRC 281
R L FACDC RC
Sbjct: 296 RNIFLKMTKHFACDCVRC 313
>gi|448532414|ref|XP_003870424.1| Set5 protein [Candida orthopsilosis Co 90-125]
gi|380354779|emb|CCG24294.1| Set5 protein [Candida orthopsilosis]
Length = 499
Score = 52.0 bits (123), Expect = 5e-04, Method: Compositional matrix adjust.
Identities = 24/81 (29%), Positives = 44/81 (54%), Gaps = 4/81 (4%)
Query: 205 IYLKASFFNHDCLPNACRFDYVDAAAENNTDIIVRMIHDVPQGREICLSYFPVNYDYSTR 264
++L S NH+C PN VD A++ + + V D+ +G E+ +Y ++ + R
Sbjct: 360 VFLTQSHLNHNCDPNTT----VDTASDRISGLKVFAKRDIKEGEELTTTYVNPSHTLNQR 415
Query: 265 QKRLLYDYDFACDCDRCKVEA 285
Q+ L ++ F C C++CK +A
Sbjct: 416 QRELRVNWGFICACEKCKSDA 436
>gi|159485956|ref|XP_001701010.1| hypothetical protein CHLREDRAFT_142740 [Chlamydomonas reinhardtii]
gi|158281509|gb|EDP07264.1| predicted protein, partial [Chlamydomonas reinhardtii]
Length = 486
Score = 52.0 bits (123), Expect = 5e-04, Method: Compositional matrix adjust.
Identities = 27/84 (32%), Positives = 39/84 (46%), Gaps = 8/84 (9%)
Query: 204 GIYLKASFFNHDCLPNACRFDYVDAAAENNTDIIVRMIHDVPQGREICLSYFPVNYDYST 263
G++ A++FNH CL N R+ D + VR D+P G E+ ++Y +
Sbjct: 394 GLWTLAAYFNHACLDNTERYFLGD-------FLFVRASRDLPAGAEVTITYLNPLMSFRE 446
Query: 264 RQKRLLYDYDFACDCDRCKVEANW 287
R R L FAC C+ C E W
Sbjct: 447 R-ARALEKRGFACGCELCSGEIEW 469
>gi|24652742|ref|NP_725048.1| CG7759, isoform A [Drosophila melanogaster]
gi|21627489|gb|AAF58645.2| CG7759, isoform A [Drosophila melanogaster]
Length = 660
Score = 52.0 bits (123), Expect = 5e-04, Method: Compositional matrix adjust.
Identities = 77/324 (23%), Positives = 121/324 (37%), Gaps = 49/324 (15%)
Query: 4 VSETVTVAEIEGRGRGLVSTQSLKAGQIVLRDSPILLYSALPFINSSSFCHNCFRKTMHS 63
VS V + GR ++ +K G+ +L + P + S L + + C NCF +T+
Sbjct: 237 VSPLVRIDSNRQEGRFARASADVKPGEELLVERPFV--SVLLEKFAKTHCENCFMRTVVP 294
Query: 64 SSSICPSCS-VAFCSPKCSTAAASSSHSPYVCQALTRLL----NLNSPDA----ANLPLD 114
+ CP C+ V +CS +C A+ H Y C + + ++N+ A A+ PLD
Sbjct: 295 VA--CPRCADVLYCSEQCREEASKKYHK-YECGIVPIIWRSGASINNHIALRIIASKPLD 351
Query: 115 RQLQARFLI----------------------AAYNLAVVNPSQFQILLAFQGTVTDNDTS 152
L+ + I + PS F + +T+ +
Sbjct: 352 YFLKLKPTIDEELTPEQLISLPKDDFRRVAQLERHQGERQPSNFFQHVLMARFLTNCLRA 411
Query: 153 AAHYLSSLCPPPASTTIELTAALLAKDRLNAFGLMEPYIEGQDGRRSVRAYG--IYLKAS 210
++ S P S L L + N + E + GR G IY +
Sbjct: 412 GGYFGSEPKPDEVSIICSLVLRSLQFIQFNTHEVAELHKFSSSGREKSIFIGGAIYPTLA 471
Query: 211 FFNHDCLPNACRFDYVDAAAENNTDIIVRMIHDVPQGREICLSYFPV--NYDYSTRQKRL 268
FNH C P R+ T I + + + G I +Y P+ + S RQ RL
Sbjct: 472 LFNHSCDPGVVRYF-------RGTTIHINSVRPIEAGLPINENYGPMYTQDERSERQARL 524
Query: 269 LYDYDFACDCDRCKVEANWSDNDN 292
Y F C CD C NW D+
Sbjct: 525 KDLYWFECSCDACI--DNWPKFDD 546
>gi|156386886|ref|XP_001634142.1| predicted protein [Nematostella vectensis]
gi|156221221|gb|EDO42079.1| predicted protein [Nematostella vectensis]
Length = 73
Score = 52.0 bits (123), Expect = 5e-04, Method: Composition-based stats.
Identities = 26/78 (33%), Positives = 39/78 (50%), Gaps = 7/78 (8%)
Query: 204 GIYLKASFFNHDCLPNACRFDYVDAAAENNTDIIVRMIHDVPQGREICLSYFPVNYDYST 263
GI+ A NH C PN+ A N T+I ++ + ++P G E+ +SY + T
Sbjct: 3 GIFPNAVCLNHSCAPNS-------VAVFNGTNIYIKALEEIPVGEELTISYIQQLHPRET 55
Query: 264 RQKRLLYDYDFACDCDRC 281
RQ+ L + F C C RC
Sbjct: 56 RQEELQTQFCFYCQCHRC 73
>gi|167342096|gb|ABZ68996.1| SET domain-containing protein C [Pinus taeda]
Length = 141
Score = 51.6 bits (122), Expect = 5e-04, Method: Compositional matrix adjust.
Identities = 29/88 (32%), Positives = 41/88 (46%), Gaps = 8/88 (9%)
Query: 197 RRSVRAYGIYLKASFFNHDCLPNACRFDYVDAAAENNTDIIVRMIHDVPQGREICLSYFP 256
+RS G++L SF NH CLPN+ R E + + + + +G EI YF
Sbjct: 5 QRSQDFSGLWLLPSFINHSCLPNSSRL-------EMGSAMFIHACKPIKRGEEITFPYFD 57
Query: 257 VNYDYSTRQKRLLYDYDFACDCDRCKVE 284
+ RQ R ++ F C C RC VE
Sbjct: 58 ILLPLPQRQGR-CENWGFECKCRRCIVE 84
>gi|440790098|gb|ELR11386.1| MYND finger domain containing protein [Acanthamoeba castellanii
str. Neff]
Length = 307
Score = 51.6 bits (122), Expect = 6e-04, Method: Compositional matrix adjust.
Identities = 28/77 (36%), Positives = 40/77 (51%), Gaps = 1/77 (1%)
Query: 210 SFFNHDCLPNACRFDYVDAAAENNT-DIIVRMIHDVPQGREICLSYFPVNYDYSTRQKRL 268
S NH C PN A A ++ + + DV E+C+SY P+++ +TRQ L
Sbjct: 222 SSLNHSCQPNVKSTLRSQAQARGDSCRVSLIAKQDVEPNEELCISYVPLSWTRATRQTTL 281
Query: 269 LYDYDFACDCDRCKVEA 285
L +Y F C C+RC EA
Sbjct: 282 LKNYFFHCQCERCVSEA 298
>gi|195111430|ref|XP_002000282.1| GI10143 [Drosophila mojavensis]
gi|193916876|gb|EDW15743.1| GI10143 [Drosophila mojavensis]
Length = 392
Score = 51.6 bits (122), Expect = 6e-04, Method: Compositional matrix adjust.
Identities = 27/87 (31%), Positives = 46/87 (52%), Gaps = 10/87 (11%)
Query: 204 GIYLKASFFNHDCLPNA-CRFDYVDAAAENNTDIIVRMIHDVPQGREICLSYF---PVNY 259
G+Y+ S NH C+PNA F Y +N ++++ + + +G EIC+SY +
Sbjct: 288 GLYILQSKVNHSCVPNAQSTFPY------SNDIVVLKAVAPIQKGEEICISYLDECQLER 341
Query: 260 DYSTRQKRLLYDYDFACDCDRCKVEAN 286
+R K L +Y F C C +C+ +A+
Sbjct: 342 SRHSRHKILRENYIFVCQCSKCRAQAS 368
>gi|451849565|gb|EMD62868.1| hypothetical protein COCSADRAFT_144768 [Cochliobolus sativus
ND90Pr]
Length = 529
Score = 51.6 bits (122), Expect = 6e-04, Method: Compositional matrix adjust.
Identities = 68/325 (20%), Positives = 123/325 (37%), Gaps = 48/325 (14%)
Query: 14 EGRGRGLVSTQSLKAGQIVLRDSPILLYSALPFINSSSFCHNCFRKTMHSSS-------- 65
E G GL + Q AG+ + R L+ S L C NC+ T +S+
Sbjct: 29 ESVGNGLFANQRFGAGEEIARFKRPLVGS-LETERLLDTCANCYLWTEGASAGTRLYVPE 87
Query: 66 ----SICPSCS-VAFCSPKCSTAAASSSHSPYVCQALTRLLNLNSPDAANLPLDRQLQAR 120
S C +C +CS C A + H + C+ L + P A L +
Sbjct: 88 GAEVSKCAACQRFRYCSKACQKEAWNRGHR-HECKILKPMAGRGLPKAF-------LASM 139
Query: 121 FLIAAYNLAVVNPSQFQILLAFQGTVTD--------NDTSAAHYLSSLCPPPASTTIELT 172
L+ ++ ++++L + D N A + P +
Sbjct: 140 ELLTRRKHGLIPDKEWELLCQLPSHIDDFKRNGTYQNIEMMAMGAAQFALPQNIFDKDFV 199
Query: 173 AALLAKDRLNAFGLMEPYIEGQDGRRSVRAYGIYLKASF--FNHDCLPNACRFDYVDAAA 230
AA+ + NA L+ P ++ GI + + NH C PNA + +D
Sbjct: 200 AAMYGRVMSNALTLITPTLD---------PLGIIVDPTLCQMNHSCDPNA--YIMMDGPL 248
Query: 231 ENNTDIIVRMIHDVPQGREICLSYFPVNYDYSTRQKRLLYDYDFACDCDRCKVEANWSDN 290
I +R + + + +EI +SY + Y RQ L + F C C +C+ +A ++
Sbjct: 249 -----ISIRTLRPIRKDKEIFISYIDTTHPYQKRQDELQARWFFTCRCAKCQDKATHQED 303
Query: 291 DNDDENNEEVMDEDQDEQMVASDDD 315
+ N + + ++ + +AS+ +
Sbjct: 304 NWLVPANSDFVASKEELEALASNQE 328
>gi|195168896|ref|XP_002025266.1| GL13396 [Drosophila persimilis]
gi|194108722|gb|EDW30765.1| GL13396 [Drosophila persimilis]
Length = 787
Score = 51.6 bits (122), Expect = 6e-04, Method: Compositional matrix adjust.
Identities = 31/87 (35%), Positives = 42/87 (48%), Gaps = 7/87 (8%)
Query: 195 DGRRSVRAYGIYLKASFFNHDCLPNACRFDYVDAAAENNTDIIVRMIHDVPQGREICLSY 254
DGR ++ G++ + NH+C PNA Y D N IVR D+P+G EI +Y
Sbjct: 559 DGRETL-LRGLFPLTAIMNHECTPNAS--HYFD----NGRLAIVRAARDIPKGGEITTTY 611
Query: 255 FPVNYDYSTRQKRLLYDYDFACDCDRC 281
+ + TR L F CDC RC
Sbjct: 612 TKILWGNLTRGIFLKMTKHFMCDCTRC 638
>gi|357603389|gb|EHJ63740.1| hypothetical protein KGM_20849 [Danaus plexippus]
Length = 544
Score = 51.6 bits (122), Expect = 6e-04, Method: Compositional matrix adjust.
Identities = 60/283 (21%), Positives = 117/283 (41%), Gaps = 36/283 (12%)
Query: 10 VAEIEGRGRGLVSTQSLKAGQIVLRDSPILLYSALPFINSSSFCHNCFRKTMHSSSSICP 69
+ E+ G GR L +T+++KAG ++++++P++ P + C C+ ++ C
Sbjct: 48 IKELPGIGRYLEATRTIKAGDVIMKEAPLI---TGPAQITPPVCLGCYVLLEEGKTTRCD 104
Query: 70 SCSVAFCSPKCSTAAASSSHSPYVCQALTRL----LNLNSPDAANLPLDRQLQARFLIAA 125
C FCS KC+ ++ Y Q ++ + P+ + + R L R
Sbjct: 105 KCGWPFCSEKCTHSSVHEPECFYTQQRGEKVSISTYGIPHPNYQCVTVLRCLYQR----D 160
Query: 126 YNLAVVNPSQFQILLAF----QGT-VTDNDTS-AAHYLSSLCPPPASTTIELTAALLAKD 179
+N+ + N + Q L +GT +ND A ++ S +L ++
Sbjct: 161 HNMKLWN--KLQALETHTEDRKGTDKWENDRKMVAQFIWSF--------FKLERLFNEEE 210
Query: 180 RLNAFGLMEPYIEGQDGRRSVRAY-GIYLKASFFNHDCLPNACRFDYVDAAAENNTDIIV 238
+ G+++ I G + Y ++ K S H+C N + + +N +II+
Sbjct: 211 IMKCCGIIQ--INGHEVPLLEPEYVAVFDKISMVEHNCRANC------NKSFTSNGEIIL 262
Query: 239 RMIHDVPQGREICLSYFPVNYDYSTRQKRLLYDYDFACDCDRC 281
+P+G I + Y + R+ L F C C+RC
Sbjct: 263 SAGVAIPRGSHISVCYTDPLWGTEARRHHLSDSKFFECSCERC 305
>gi|149244448|ref|XP_001526767.1| conserved hypothetical protein [Lodderomyces elongisporus NRRL
YB-4239]
gi|146449161|gb|EDK43417.1| conserved hypothetical protein [Lodderomyces elongisporus NRRL
YB-4239]
Length = 381
Score = 51.6 bits (122), Expect = 6e-04, Method: Compositional matrix adjust.
Identities = 33/131 (25%), Positives = 59/131 (45%), Gaps = 18/131 (13%)
Query: 157 LSSLCPPPASTTIELTAALLAKDRLNAFGLMEPYIEGQDGR--RSVRAYGIYLKASFFNH 214
L PP +I + ++ NAFGL + E QD + + +Y ASFFNH
Sbjct: 261 LQPFITPPTVRSI------IGRNLSNAFGL---WSETQDSNEDKEFLGFAVYPSASFFNH 311
Query: 215 DCLPNACRFDYVDAAAENNTDIIVRMIHDVPQGREICLSYFPVNYDYSTRQKRLLYDYDF 274
C PN + +N + + ++ G E+C++Y + +K+ L ++ F
Sbjct: 312 SCEPNIKKI-----RVKNELKFVT--LREIAPGEELCINYGNFQNENVKERKKQLSEWFF 364
Query: 275 ACDCDRCKVEA 285
C C +C+ ++
Sbjct: 365 DCRCTKCEADS 375
>gi|371943475|gb|AEX61304.1| putative SET domain-containing protein [Megavirus courdo7]
Length = 157
Score = 51.6 bits (122), Expect = 6e-04, Method: Compositional matrix adjust.
Identities = 27/78 (34%), Positives = 36/78 (46%), Gaps = 7/78 (8%)
Query: 204 GIYLKASFFNHDCLPNACRFDYVDAAAENNTDIIVRMIHDVPQGREICLSYFPVNYDYST 263
I FNH CLPN + N ++ + D+ +G E+ SY + +D T
Sbjct: 62 AILFNGRIFNHSCLPNIVFY-------RCNDEMCFITVRDIYKGEELLDSYVNITHDKKT 114
Query: 264 RQKRLLYDYDFACDCDRC 281
RQ RL Y F CDC RC
Sbjct: 115 RQSRLWNQYRFHCDCQRC 132
>gi|198470120|ref|XP_002133373.1| GA22855 [Drosophila pseudoobscura pseudoobscura]
gi|198145302|gb|EDY72001.1| GA22855 [Drosophila pseudoobscura pseudoobscura]
Length = 844
Score = 51.6 bits (122), Expect = 6e-04, Method: Compositional matrix adjust.
Identities = 29/87 (33%), Positives = 42/87 (48%), Gaps = 7/87 (8%)
Query: 195 DGRRSVRAYGIYLKASFFNHDCLPNACRFDYVDAAAENNTDIIVRMIHDVPQGREICLSY 254
DGR ++ G++ + NH+C PNA + +N IVR D+P+G EI +Y
Sbjct: 616 DGRETL-LRGLFPLTAIMNHECTPNASHY------FDNGRLAIVRAARDIPKGGEITTTY 668
Query: 255 FPVNYDYSTRQKRLLYDYDFACDCDRC 281
+ + TR L F CDC RC
Sbjct: 669 TKILWGNLTRGIFLKMTKHFMCDCTRC 695
>gi|19922036|ref|NP_610684.1| CG7759, isoform B [Drosophila melanogaster]
gi|16768704|gb|AAL28571.1| HL04910p [Drosophila melanogaster]
gi|21627490|gb|AAM68738.1| CG7759, isoform B [Drosophila melanogaster]
Length = 567
Score = 51.6 bits (122), Expect = 6e-04, Method: Compositional matrix adjust.
Identities = 77/324 (23%), Positives = 121/324 (37%), Gaps = 49/324 (15%)
Query: 4 VSETVTVAEIEGRGRGLVSTQSLKAGQIVLRDSPILLYSALPFINSSSFCHNCFRKTMHS 63
VS V + GR ++ +K G+ +L + P + S L + + C NCF +T+
Sbjct: 144 VSPLVRIDSNRQEGRFARASADVKPGEELLVERPFV--SVLLEKFAKTHCENCFMRTVVP 201
Query: 64 SSSICPSCS-VAFCSPKCSTAAASSSHSPYVCQALTRLL----NLNSPDA----ANLPLD 114
+ CP C+ V +CS +C A+ H Y C + + ++N+ A A+ PLD
Sbjct: 202 VA--CPRCADVLYCSEQCREEASKKYHK-YECGIVPIIWRSGASINNHIALRIIASKPLD 258
Query: 115 RQLQARFLI----------------------AAYNLAVVNPSQFQILLAFQGTVTDNDTS 152
L+ + I + PS F + +T+ +
Sbjct: 259 YFLKLKPTIDEELTPEQLISLPKDDFRRVAQLERHQGERQPSNFFQHVLMARFLTNCLRA 318
Query: 153 AAHYLSSLCPPPASTTIELTAALLAKDRLNAFGLMEPYIEGQDGRRSVRAYG--IYLKAS 210
++ S P S L L + N + E + GR G IY +
Sbjct: 319 GGYFGSEPKPDEVSIICSLVLRSLQFIQFNTHEVAELHKFSSSGREKSIFIGGAIYPTLA 378
Query: 211 FFNHDCLPNACRFDYVDAAAENNTDIIVRMIHDVPQGREICLSYFPVNY--DYSTRQKRL 268
FNH C P R+ T I + + + G I +Y P+ + S RQ RL
Sbjct: 379 LFNHSCDPGVVRYF-------RGTTIHINSVRPIEAGLPINENYGPMYTQDERSERQARL 431
Query: 269 LYDYDFACDCDRCKVEANWSDNDN 292
Y F C CD C NW D+
Sbjct: 432 KDLYWFECSCDACI--DNWPKFDD 453
>gi|357603991|gb|EHJ63998.1| hypothetical protein KGM_04190 [Danaus plexippus]
Length = 564
Score = 51.6 bits (122), Expect = 6e-04, Method: Compositional matrix adjust.
Identities = 65/276 (23%), Positives = 101/276 (36%), Gaps = 33/276 (11%)
Query: 17 GRGLVSTQSLKAGQIVLRDSPILLYSALPFINSSSFCHNCFRKTMHSSSSICPSCSVAFC 76
GR +V+T+ ++ G++++ +P L+ P C C+R+ S + C C C
Sbjct: 90 GRYIVATRDIEPGEVII-TAPALVVG--PCSGCGLICLGCYRELEESHLTKCSGCKWPLC 146
Query: 77 SPKCSTAAASSSHSPYVCQALTRLLNLNSPDAANLP-LDRQLQARFLIAAYNLAVVNPSQ 135
S C + HS Y C+ L ++ PD N+ L A + L +PS+
Sbjct: 147 SNSCFGLGKYTGHSKYECETLKKI----PPDYKNMEDLRDSYHAITPLRCLLLKKEDPSK 202
Query: 136 FQILL------AFQGTVTD----NDTSAAHYLSSLCPPPASTTIELTAALLAKDRLNAFG 185
+ L A + + D ND + L + +L +NAF
Sbjct: 203 WCALSEMESHNAIRRSRGDIYPMNDRNVVQRLKKWGLGYDDDEVHTVCGIL---EVNAF- 258
Query: 186 LMEPYIEGQDGRRSVRAYGIYLKASFFNHDCLPNACRFDYVDAAAENNTDIIVRMIHDVP 245
+ G A +Y +A HDC P+ D+ T I I VP
Sbjct: 259 --------EVGASGANARALYSEAYLLAHDCTPSTSHTDHERRPGRPLT--IRAAIRHVP 308
Query: 246 QGREICLSYFPVNYDYSTRQKRLLYDYDFACDCDRC 281
G I L Y R++ L F C C RC
Sbjct: 309 -GDVISLCYAYTLQGTLKRREHLAVSKFFMCSCVRC 343
>gi|448124691|ref|XP_004204990.1| Piso0_000280 [Millerozyma farinosa CBS 7064]
gi|358249623|emb|CCE72689.1| Piso0_000280 [Millerozyma farinosa CBS 7064]
Length = 484
Score = 51.6 bits (122), Expect = 6e-04, Method: Compositional matrix adjust.
Identities = 64/332 (19%), Positives = 116/332 (34%), Gaps = 72/332 (21%)
Query: 6 ETVTVAEIEGRGRGLVSTQSLKAGQIVLRDSPILLYSALPFINSS--------SFCHNCF 57
+ + V + +G+GL + +++K G + + P L+ P N + S+C
Sbjct: 108 DKIKVLQTSKKGKGLYAKKNVKKGDFLWEEKP--LFYVPPLANMTLIKNGKACSYCGKLL 165
Query: 58 RKTMHSSSSI-------CPSCSVAFCSPKC----STAAASSSHSPYVCQALTRLLNLNSP 106
+ +S + C CS +CS +C S AS H N+ P
Sbjct: 166 TQGTRGNSGVSYLRGLDCDVCSEIWCSKECKNTDSKIHASLKH------------NVYHP 213
Query: 107 DAANLPLDRQLQARFLIAAYNLAVVNPSQFQILLAFQGTVTDNDTSAAHYLSSLCPPPAS 166
+N + A + Y L + + I + V+D S++
Sbjct: 214 SKSNTRKNINANAYLALQDYCLKEQWNALYSITRIYADIVSDKSGVKERQFSAMARISQE 273
Query: 167 TT--------------------IELTAALLAKDRLNAFGLMEP--------------YIE 192
T ++ L K+ + F + P Y+
Sbjct: 274 TRYKALNSSAGSFDNFQGGALFVQEQQESLWKEGFSLFSNVFPNSVDEGKINYDEFMYML 333
Query: 193 GQDGRRSVRAYGIYLKASFFNHDCLPNACRFDYVDAAAENNTDIIVRMIHDVPQGREICL 252
G ++ + I+L S NH+C PN V ++ + V D+ G E+
Sbjct: 334 GTFNINNLDSV-IFLIQSHLNHNCEPNTT----VKLSSNRTEGLKVYAARDIKAGEELTT 388
Query: 253 SYFPVNYDYSTRQKRLLYDYDFACDCDRCKVE 284
SY ++ RQ+ L ++ F C C++CK E
Sbjct: 389 SYVNPSHTVQQRQRELRVNWGFRCACEKCKSE 420
>gi|302686770|ref|XP_003033065.1| hypothetical protein SCHCODRAFT_107558 [Schizophyllum commune H4-8]
gi|300106759|gb|EFI98162.1| hypothetical protein SCHCODRAFT_107558, partial [Schizophyllum
commune H4-8]
Length = 422
Score = 51.6 bits (122), Expect = 6e-04, Method: Compositional matrix adjust.
Identities = 34/107 (31%), Positives = 49/107 (45%), Gaps = 10/107 (9%)
Query: 175 LLAKDRLNAFGLMEPYIEGQDGRRSVRAYGIYLKASFFNHDCLPNACRFDYVDAAAENNT 234
L+ + L G P EG DGR +Y S NH C PNA F + + +
Sbjct: 225 LIRANGLTVNGYTLPETEGADGR----YVAVYETLSRANHSCRPNA-HFAF----HKPSF 275
Query: 235 DIIVRMIHDVPQGREICLSYFPVNYDYSTRQKRLLYDYDFACDCDRC 281
+ +R + D+ G EI +SY P Y+ RQ+ L + Y +C C C
Sbjct: 276 SVRLRALRDIKAGEEILISYVPPEAPYAQRQEELAH-YGLSCACGVC 321
>gi|384247293|gb|EIE20780.1| SET domain-containing protein [Coccomyxa subellipsoidea C-169]
Length = 453
Score = 51.6 bits (122), Expect = 6e-04, Method: Compositional matrix adjust.
Identities = 36/111 (32%), Positives = 50/111 (45%), Gaps = 11/111 (9%)
Query: 175 LLAKDRLNAFGLMEPYIEGQDGRRSVRAYGIYLKASFFNHDCLPNACRFDYVDAAAENNT 234
L+ + L++ G E Y G +G Y S NH C PNA F + +
Sbjct: 346 LITRPLLDSLG--ESYGAGCEGS------AFYALQSCCNHSCAPNAHAFKRAHIDTDGSA 397
Query: 235 DIIVRMIHDVPQGREICLSYFPVNYDYSTRQKRLLYDYDFACDCDRCKVEA 285
I+ R + G E+CLSY + Y R+ L DY F C+CD+C EA
Sbjct: 398 TILARK--PIAPGEEVCLSYIDEDAPYHDRRAALA-DYGFTCECDKCVSEA 445
>gi|261330740|emb|CBH13725.1| hypothetical protein, conserved [Trypanosoma brucei gambiense
DAL972]
Length = 372
Score = 51.6 bits (122), Expect = 6e-04, Method: Compositional matrix adjust.
Identities = 72/285 (25%), Positives = 107/285 (37%), Gaps = 43/285 (15%)
Query: 17 GRGLVSTQSLKAGQIVL-RDSPILLYSALPFINSSSFCHNCFRKTMHSSSSIC--PSCSV 73
GRG+V+ ++LK G +V+ R P + Y + C C K + C P C
Sbjct: 113 GRGIVARENLKKGTVVIQRAEPWMRYPV-----ADGLCALCT-KPLPERFFTCTNPRCHE 166
Query: 74 AFCSPKCSTAAASSSHSPYVCQALTRLLNLNSPDAANLPLDRQLQARFLIAAYNLAVVNP 133
+CS C T A S HS R + A L L Q+++ N A
Sbjct: 167 EYCSRDCRTIALSRYHS--------RTCRIEGLQAIELDLFSQMKSSEAATDKNSAAAQL 218
Query: 134 SQFQILLA-FQGTVTDNDTSAAHYLSS-LCPPPASTTIELTAALLAKDRLNAFGLMEPYI 191
++L A Q + + S LS L PA+ + R + PY
Sbjct: 219 LMLRVLAASLQQQIVPSALSEVRILSGRLLFSPAALAGKFLEIYERFTRACSTSTSIPYE 278
Query: 192 E--GQDGRRSVRAYG------IYLKASFFNHDCLPNACRFDYVDAAAENNTDIIVRMIHD 243
E G R + + + L S NH C N AE+ +R D
Sbjct: 279 EMIGVLARVTTNCFHRENCVELSLPRSLLNHSCDAN---------VAEDAETGEMRTTQD 329
Query: 244 VPQGREICLSYFP----VNYDYSTRQKRLLYDYDFACDCDRCKVE 284
V +G E+ ++Y+P +NY +R+ L F C C RC+ E
Sbjct: 330 VARGTELTINYYPQLKELNYKSRSRE---LERRGFVCCCSRCRKE 371
>gi|405121568|gb|AFR96336.1| hypothetical protein CNAG_03112 [Cryptococcus neoformans var.
grubii H99]
Length = 447
Score = 51.6 bits (122), Expect = 7e-04, Method: Compositional matrix adjust.
Identities = 35/119 (29%), Positives = 50/119 (42%), Gaps = 23/119 (19%)
Query: 180 RLNAFGLMEPYIEGQDGRRSVRAYGIYLKA--SFFNHDCLPNACRFDYVDAAAENNTDII 237
++NAF L P S+ G+ + + NH C PNA + + DI
Sbjct: 122 QVNAFTLTSP---------SLAPIGMSISPLLALANHSCEPNA-----IAVFPKGGRDIF 167
Query: 238 VRMIHDVPQGREICLSYFPVNYDYSTRQKRLLYDYDFACDCDRCKV-------EANWSD 289
+ ++D+P G EI SY + Y RQ LL Y F C C C+ E +W D
Sbjct: 168 LVALNDIPPGEEILTSYIDTSTPYHHRQSELLSRYRFVCHCSLCQKSESTANGEQSWVD 226
>gi|296413480|ref|XP_002836440.1| hypothetical protein [Tuber melanosporum Mel28]
gi|295630261|emb|CAZ80631.1| unnamed protein product [Tuber melanosporum]
Length = 763
Score = 51.6 bits (122), Expect = 7e-04, Method: Compositional matrix adjust.
Identities = 25/81 (30%), Positives = 39/81 (48%), Gaps = 7/81 (8%)
Query: 204 GIYLKASFFNHDCLPNACRFDYVDAAAENNTDIIVRMIHDVPQGREICLSYFPVNYDYST 263
G+++ S+ NH C N+ R D +IVR D+ +G EI ++Y
Sbjct: 517 GLWIMPSYINHSCWQNSTRSFLGDL-------LIVRAARDLSEGEEITINYMENESGVQK 569
Query: 264 RQKRLLYDYDFACDCDRCKVE 284
RQK L ++ F C C C++E
Sbjct: 570 RQKAFLSEWGFECKCTMCEIE 590
>gi|66525107|ref|XP_394075.2| PREDICTED: SET and MYND domain-containing protein 5 [Apis
mellifera]
Length = 387
Score = 51.6 bits (122), Expect = 7e-04, Method: Compositional matrix adjust.
Identities = 33/96 (34%), Positives = 50/96 (52%), Gaps = 10/96 (10%)
Query: 204 GIYLKASFFNHDCLPNA-CRFDYVDAAAENNTDIIVRMIHDVPQGREICLSYFP---VNY 259
G+Y+ S NH C+PNA F Y +N ++++ I D+ EIC+SY +
Sbjct: 289 GLYILQSSVNHSCVPNAIVEFPY------SNNVLVLKAIRDIHPEEEICISYLDECCLER 342
Query: 260 DYSTRQKRLLYDYDFACDCDRCKVEANWSDNDNDDE 295
+RQK L Y F C C++C +AN D ++DE
Sbjct: 343 SRHSRQKALNSLYLFQCYCNKCLSQANDLDLTSEDE 378
>gi|339897286|ref|XP_003392344.1| conserved hypothetical protein [Leishmania infantum JPCM5]
gi|321399152|emb|CBZ08492.1| conserved hypothetical protein [Leishmania infantum JPCM5]
Length = 622
Score = 51.6 bits (122), Expect = 7e-04, Method: Compositional matrix adjust.
Identities = 67/235 (28%), Positives = 97/235 (41%), Gaps = 31/235 (13%)
Query: 58 RKTMHSSSSICPSCSVAFCSPKCSTAAASSSHSPYVCQALTRLLNL-NSPDAANLPLDRQ 116
+K HS+ + C V +CSP C+ A H + C +TRL ++ +A L + Q
Sbjct: 254 KKEKHSTCTTCDM--VVYCSPACADAH-REQHERFECARMTRLSDMMEGIEARKLDVPEQ 310
Query: 117 LQARFLIAAYNLAVVNPSQFQILLAFQGTVTDNDTSAAHYLSSLCP--PPASTTI---EL 171
F AY+ + + + A V + SL P P S E
Sbjct: 311 ----FFELAYH-CITTLAGIKAKRAGHERVLTLTAHVDEVIQSLHPIGPLMSDLFKGEEE 365
Query: 172 TAALLAKDRL---NAFGLMEPYIEGQDGRRSVRAYGIYLKASFFNHDCLPNACRFDYVDA 228
TA L + NA + +P G +++ A ASFFNH C PN C D V
Sbjct: 366 TATLYTIIGVLCCNALEVTDP--SGLGVAQALHAGNSI--ASFFNHSCTPN-CAIDTVRH 420
Query: 229 AAENNTDIIVRMIHDVPQGREICLSYFP-VNYDYSTRQKRLLYDYDFACDCDRCK 282
A I R IH G E+ ++Y P + + R++RL Y F C C RC+
Sbjct: 421 AI-----ITTRTIH---VGEELSIAYIPQLYWPTRLRRERLSEGYYFVCRCQRCQ 467
>gi|302850072|ref|XP_002956564.1| hypothetical protein VOLCADRAFT_97594 [Volvox carteri f.
nagariensis]
gi|300258091|gb|EFJ42331.1| hypothetical protein VOLCADRAFT_97594 [Volvox carteri f.
nagariensis]
Length = 1124
Score = 51.6 bits (122), Expect = 7e-04, Method: Compositional matrix adjust.
Identities = 28/73 (38%), Positives = 36/73 (49%), Gaps = 7/73 (9%)
Query: 205 IYLKASFFNHDCLPN-ACRFDYVDAAAENNTDIIVRMIHDVPQGREICLSYFPVNYDYST 263
+YL AS FNH C P F +DAAA D+ G E+C+SY V Y+
Sbjct: 497 VYLVASLFNHSCEPTLELSFPDLDAAA------CFVAARDIAPGEELCVSYLDVGLPYNV 550
Query: 264 RQKRLLYDYDFAC 276
RQ+ L + Y F C
Sbjct: 551 RQRHLKWSYGFVC 563
Score = 37.7 bits (86), Expect = 8.7, Method: Compositional matrix adjust.
Identities = 20/63 (31%), Positives = 35/63 (55%), Gaps = 9/63 (14%)
Query: 1 MSAVSETVTVA-----EIEGRGRGLVSTQSLKAGQIVLRDSPILLYSALPFINSS-SFCH 54
+ ET T+ ++ GRGRGLV+ S+ GQ+V R+ P++ A P + ++ C+
Sbjct: 186 FTVAQETATLGIQTNPQLPGRGRGLVAVGSIPRGQVVHREVPLV---AFPELGATHGVCY 242
Query: 55 NCF 57
+C
Sbjct: 243 HCL 245
>gi|189242482|ref|XP_001810915.1| PREDICTED: similar to CG8378 CG8378-PA [Tribolium castaneum]
Length = 538
Score = 51.2 bits (121), Expect = 7e-04, Method: Compositional matrix adjust.
Identities = 69/301 (22%), Positives = 117/301 (38%), Gaps = 44/301 (14%)
Query: 8 VTVAEIEGRGRGLVSTQSLKAGQIVLRDSPILLYSALPFINSSSFCHNCFRKTMHSSSSI 67
+ + + E GR +V+T++++ G+I+ + P Y+++ + S +CH C + +
Sbjct: 88 IEIMQNERYGRHVVATRNIEMGEILAIEKP---YASIVTDSVSVYCHECLKLCYNMIP-- 142
Query: 68 CPSCSVA-FCSPKCSTAAASSSHSPYVCQALTRLLNLNSPDAANLPLDRQLQARFLIAAY 126
C C+ A +CS C A S H Y C L L + L L L +R A
Sbjct: 143 CDKCTKALYCSDNCKDKAYESYHK-YECPIHLSLDPLLIDSSKRLALRIALISRNEWAG- 200
Query: 127 NLAVVNPSQFQILLAFQGTVTDNDTSAAHYLSSLCPPPASTTIELTAALLAK-------- 178
+L +P + F+ + + L TTI L K
Sbjct: 201 SLLNESPDEMYCSDRFKEVFNLDQNVRQRFTHDL---FGRTTIACGVFYLIKKYTTFLQE 257
Query: 179 ---DRLNAFGLMEPYIEGQDGRR------SVRAYGI-------YLKASFFNHDCLPNACR 222
DR L I + R ++ Y + Y S FNH C PN CR
Sbjct: 258 YDEDRFKEILLSLLLICSTNTVRVNEVSSTLGEYDVCGFACSHYPFFSMFNHSCWPNVCR 317
Query: 223 FDYVDAAAENNTDIIVRMIHDVPQGREICLSYFP--VNYDYSTRQKRLLYDYDFACDCDR 280
+ + + +++R I + +G + ++Y P ++ + RQ L + Y F C C
Sbjct: 318 -------SYHGSQMVLRAIRTIKKGEQCFVTYGPSYLSDNIVGRQAFLFFHYFFNCGCKA 370
Query: 281 C 281
C
Sbjct: 371 C 371
>gi|195038187|ref|XP_001990541.1| GH18189 [Drosophila grimshawi]
gi|193894737|gb|EDV93603.1| GH18189 [Drosophila grimshawi]
Length = 393
Score = 51.2 bits (121), Expect = 7e-04, Method: Compositional matrix adjust.
Identities = 27/87 (31%), Positives = 46/87 (52%), Gaps = 10/87 (11%)
Query: 204 GIYLKASFFNHDCLPNA-CRFDYVDAAAENNTDIIVRMIHDVPQGREICLSYF---PVNY 259
G+Y+ S NH C+PNA F Y +N ++++ + + +G EIC+SY +
Sbjct: 288 GLYILQSKINHSCVPNAQSTFPY------SNDIVVLKAVAPIEKGDEICISYLDECQLER 341
Query: 260 DYSTRQKRLLYDYDFACDCDRCKVEAN 286
+R K L +Y F C C +C+ +A+
Sbjct: 342 SRHSRHKTLRENYIFVCQCPKCRAQAS 368
>gi|255562926|ref|XP_002522468.1| protein with unknown function [Ricinus communis]
gi|223538353|gb|EEF39960.1| protein with unknown function [Ricinus communis]
Length = 538
Score = 51.2 bits (121), Expect = 7e-04, Method: Compositional matrix adjust.
Identities = 32/96 (33%), Positives = 41/96 (42%), Gaps = 12/96 (12%)
Query: 202 AYGIYLKASFFNHDCLPNACRFDYVDAAAENNTDIIVRMIHDVPQGREICLSYFPVNYDY 261
+G+++ ASF NH C PNA R D ++V D+ G EI YF V
Sbjct: 326 GFGLWVLASFINHSCNPNARRLHVGD-------HVLVHASRDIKAGEEITFPYFDV---L 375
Query: 262 STRQKRLLYD--YDFACDCDRCKVEANWSDNDNDDE 295
S KR+ + F C C RCK E D E
Sbjct: 376 SPLYKRMAMSKAWGFRCQCKRCKFEEQVCSKDEVKE 411
>gi|408398201|gb|EKJ77334.1| hypothetical protein FPSE_02412 [Fusarium pseudograminearum CS3096]
Length = 543
Score = 51.2 bits (121), Expect = 7e-04, Method: Compositional matrix adjust.
Identities = 31/102 (30%), Positives = 54/102 (52%), Gaps = 6/102 (5%)
Query: 204 GIYLKASFFNHDCLPNACRFDYVDAAAENNTDIIVRMIHDVPQGREICLSYFPVNYDYST 263
G++L+AS NH C NA + DY + + V + D+ +G EI ++Y + + T
Sbjct: 104 GVFLQASRINHACDNNAQK-DYNEGIKRHT----VHALRDIEEGEEITITYLGILKNRRT 158
Query: 264 RQKRLLYDYDFACDCDRCKVEANWSDNDNDDENNEEVMDEDQ 305
RQ+ L + F C C+ C + + SD ++D +E + ED+
Sbjct: 159 RQQALRTKFMFTCTCNLCSLPEDLSD-ESDRRLDEILALEDR 199
>gi|218184846|gb|EEC67273.1| hypothetical protein OsI_34249 [Oryza sativa Indica Group]
Length = 816
Score = 51.2 bits (121), Expect = 8e-04, Method: Compositional matrix adjust.
Identities = 79/332 (23%), Positives = 127/332 (38%), Gaps = 87/332 (26%)
Query: 16 RGRGLVSTQSLKAGQIVLRDSPILLYSALPFINSSSFCHNCFRKTMHSSSSICPSCSV-A 74
+GRG+ S + ++ + P+ + P + CH CF + + CPSC++
Sbjct: 250 KGRGMTSPNDISPASLIHAEDPLAVIIMKPC--RDTHCHYCFSEA-PADVVFCPSCTIPI 306
Query: 75 FCSPKCSTAAASSSHSPYVCQALTRL-LNLNSPDAANL---------PLDRQL-QARFLI 123
+CS +C A C TRL N N D A L P +Q+ + R
Sbjct: 307 YCSNRCQEKAIGQ----MSCNQNTRLESNNNVVDIAKLSVTSTKSKTPDSKQIAEHRHEC 362
Query: 124 AAYNLAVVNPSQF-------------QILLAFQGTVT----------DNDTSAAHYLSS- 159
+ A V P+ Q+L + T++ D D+SA+ + S
Sbjct: 363 GGVHWAAVLPADIVLAGRIMAQYIEKQLLAGKRSTISGPNLDLVHHYDQDSSASKFESHI 422
Query: 160 ------LCPPP--------ASTTIELTAALLAKDRLNAFGLMEPYIEGQDGRR------- 198
LC A ++ L+ + ++N+ ++ +++ DG +
Sbjct: 423 YATVLFLCLQSYYKSGVSWAEDSLSQLVLLICQIKVNSIAIV--HMKSMDGGKVLTKGFS 480
Query: 199 -----------SVR-AYGIYLKASFFNHDCLPNACRFDYVDAAAENNTDIIVRMIHDVPQ 246
VR A IY+ SFFNH C PN + A + T +I+R +
Sbjct: 481 GFSGDVMCSVEQVRVAQAIYMSGSFFNHSCQPN------IHAYFHSRT-LILRSTEYIKA 533
Query: 247 GREICLSYFPV--NYDYSTRQKRLLYDYDFAC 276
G I LSY P D RQK L +Y F+C
Sbjct: 534 GSPIELSYGPQVGEMDLPERQKSLRENYYFSC 565
>gi|380016398|ref|XP_003692172.1| PREDICTED: SET and MYND domain-containing protein 5-like [Apis
florea]
Length = 387
Score = 51.2 bits (121), Expect = 8e-04, Method: Compositional matrix adjust.
Identities = 33/95 (34%), Positives = 50/95 (52%), Gaps = 8/95 (8%)
Query: 204 GIYLKASFFNHDCLPNACRFDYVDAAAENNTDIIVRMIHDVPQGREICLSYFP---VNYD 260
G+Y+ S NH C+PNA V+ NN ++++ I D+ EIC+SY +
Sbjct: 289 GLYILQSSVNHSCVPNA----IVEFPYSNNV-LVLKAIRDIHPEEEICISYLDECCLERS 343
Query: 261 YSTRQKRLLYDYDFACDCDRCKVEANWSDNDNDDE 295
+RQK L Y F C C++C +AN D ++DE
Sbjct: 344 RHSRQKALNSLYLFQCYCNKCLSQANDLDLTSEDE 378
>gi|223634690|sp|A5DQN2.2|SET5_PICGU RecName: Full=Potential protein lysine methyltransferase SET5;
AltName: Full=SET domain-containing protein 5
gi|190348931|gb|EDK41485.2| hypothetical protein PGUG_05583 [Meyerozyma guilliermondii ATCC
6260]
Length = 483
Score = 51.2 bits (121), Expect = 8e-04, Method: Compositional matrix adjust.
Identities = 27/82 (32%), Positives = 37/82 (45%), Gaps = 3/82 (3%)
Query: 205 IYLKASFFNHDCLPNACRFDYVDAAAENNTDIIVRMIHDVPQGREICLSYFPVNYDYSTR 264
I+L S NH C PN D + A + V D+ G E+ SY ++ R
Sbjct: 342 IFLTQSHLNHSCHPNT---DVQASTASRTGPLKVFAARDIKAGEELTTSYVNPSHTLHQR 398
Query: 265 QKRLLYDYDFACDCDRCKVEAN 286
Q+ L ++ F C C RCK EA
Sbjct: 399 QRELRVNWGFICSCQRCKDEAK 420
>gi|345484303|ref|XP_001603615.2| PREDICTED: SET and MYND domain-containing protein 5-like [Nasonia
vitripennis]
Length = 386
Score = 51.2 bits (121), Expect = 8e-04, Method: Compositional matrix adjust.
Identities = 29/87 (33%), Positives = 45/87 (51%), Gaps = 10/87 (11%)
Query: 204 GIYLKASFFNHDCLPNA-CRFDYVDAAAENNTDIIVRMIHDVPQGREICLSYF---PVNY 259
G+Y+ S NH C PNA F + D+ ++VR + D+ EIC++Y +
Sbjct: 290 GLYVLQSAINHSCAPNASVEFPHSDSR------LVVRALRDIKPDEEICIAYLDECHLER 343
Query: 260 DYSTRQKRLLYDYDFACDCDRCKVEAN 286
+RQK L Y F C CD+C+ +A+
Sbjct: 344 SRHSRQKALSSLYLFVCKCDKCQQQAS 370
>gi|307102606|gb|EFN50876.1| hypothetical protein CHLNCDRAFT_141673 [Chlorella variabilis]
Length = 510
Score = 51.2 bits (121), Expect = 8e-04, Method: Compositional matrix adjust.
Identities = 33/112 (29%), Positives = 51/112 (45%), Gaps = 5/112 (4%)
Query: 174 ALLAKDRLNAFGLMEPYIEG-QDGRRSVRAYGIYLKASFFNHDCLPNACRFDYVDAAAEN 232
AL A ++ AF P+++ D Y S FNH C PNA F + +
Sbjct: 394 ALPAGEQAAAFQAAGPFVDALPDELPGCEGNAFYALHSCFNHSCDPNAEAFKRDE---DE 450
Query: 233 NTDIIVRMIHDVPQGREICLSYFPVNYDYSTRQKRLLYDYDFACDCDRCKVE 284
+ ++ + D+ G E+ LSY R+++L DY F C CD+C+ E
Sbjct: 451 DGGAVILALRDIQAGEEVTLSYIDEEAPLEERRQQL-ADYGFRCACDKCQAE 501
>gi|146413184|ref|XP_001482563.1| hypothetical protein PGUG_05583 [Meyerozyma guilliermondii ATCC
6260]
Length = 483
Score = 51.2 bits (121), Expect = 8e-04, Method: Compositional matrix adjust.
Identities = 27/82 (32%), Positives = 37/82 (45%), Gaps = 3/82 (3%)
Query: 205 IYLKASFFNHDCLPNACRFDYVDAAAENNTDIIVRMIHDVPQGREICLSYFPVNYDYSTR 264
I+L S NH C PN D + A + V D+ G E+ SY ++ R
Sbjct: 342 IFLTQSHLNHSCHPNT---DVQASTASRTGPLKVFAARDIKAGEELTTSYVNPSHTLHQR 398
Query: 265 QKRLLYDYDFACDCDRCKVEAN 286
Q+ L ++ F C C RCK EA
Sbjct: 399 QRELRVNWGFICSCQRCKDEAK 420
>gi|157117568|ref|XP_001658830.1| hypothetical protein AaeL_AAEL008032 [Aedes aegypti]
gi|108876001|gb|EAT40226.1| AAEL008032-PA [Aedes aegypti]
Length = 527
Score = 51.2 bits (121), Expect = 8e-04, Method: Compositional matrix adjust.
Identities = 64/280 (22%), Positives = 111/280 (39%), Gaps = 35/280 (12%)
Query: 14 EGRGRGLVSTQSLKAGQIVLRDSPILLYSALPFINSSS--FCHNCFRKTMHSSSSICP-S 70
EG GRG+V+T+ + +++ D PIL+ P +N+ FC +C R + ++C
Sbjct: 73 EGSGRGMVATRDIAENELIFMDRPILVG---PRVNNYDVIFCASCCR--LQKKLTLCSGG 127
Query: 71 CSVAFCSPKCSTAAASSSHSPYVCQALTRLLNLNSPDAANLPLDRQLQARFLIAAYNLAV 130
C + CS +AA +SH+ A R +N P N L A I + L
Sbjct: 128 CRLPVCSECDISAAPETSHT-----AECRRINSWEPKDQNRYCKTILYALTSIRGFLLDD 182
Query: 131 VNPSQFQILLAFQGTVTDNDTSAA-------HYLSSLCPPPASTTIELTAALLAKDRLNA 183
++ S I+L Q D + + L P + ++ ++ NA
Sbjct: 183 LSRS---IVLNMQAHPPRKDMTTEVDRILKDQVFNKL--PEDGSDVQYLRKVIDVLNTNA 237
Query: 184 FGLMEPYIEGQDGRRSVRAYGIYLKASFFNHDCLPNACRFDYVDAAAENNTDIIVRMIHD 243
F + + + G+Y+ + NH C PN R+ + + +++
Sbjct: 238 FETFRMVKDEDNNDHEIVLRGLYILGALMNHCCRPNI-RYVF-------DEKLVMHCYAS 289
Query: 244 VP--QGREICLSYFPVNYDYSTRQKRLLYDYDFACDCDRC 281
P +G +I +Y + R L + F C CDRC
Sbjct: 290 KPIRKGEQIFNNYSKSLWGTQHRIIHLCFSKHFLCGCDRC 329
>gi|167342074|gb|ABZ68985.1| SET domain-containing protein C [Pinus taeda]
Length = 141
Score = 51.2 bits (121), Expect = 8e-04, Method: Compositional matrix adjust.
Identities = 27/81 (33%), Positives = 38/81 (46%), Gaps = 8/81 (9%)
Query: 204 GIYLKASFFNHDCLPNACRFDYVDAAAENNTDIIVRMIHDVPQGREICLSYFPVNYDYST 263
G++L SF NH CLPN+ R E + + + + +G EI YF +
Sbjct: 12 GLWLLPSFINHSCLPNSSRL-------EMGSAMFIHACKPIKRGEEITFPYFNILLPLPQ 64
Query: 264 RQKRLLYDYDFACDCDRCKVE 284
RQ R ++ F C C RC VE
Sbjct: 65 RQGR-CENWGFECKCRRCIVE 84
>gi|195347880|ref|XP_002040479.1| GM18906 [Drosophila sechellia]
gi|194121907|gb|EDW43950.1| GM18906 [Drosophila sechellia]
Length = 302
Score = 51.2 bits (121), Expect = 8e-04, Method: Compositional matrix adjust.
Identities = 27/78 (34%), Positives = 38/78 (48%), Gaps = 6/78 (7%)
Query: 204 GIYLKASFFNHDCLPNACRFDYVDAAAENNTDIIVRMIHDVPQGREICLSYFPVNYDYST 263
G++ + NH+C PNA + EN +VR D+P+G EI +Y + + T
Sbjct: 85 GLFPLTAIMNHECTPNASHY------FENGRLAVVRAARDIPKGGEITTTYTKILWGNLT 138
Query: 264 RQKRLLYDYDFACDCDRC 281
R L FACDC RC
Sbjct: 139 RNIFLKMTKHFACDCVRC 156
>gi|66828109|ref|XP_647409.1| hypothetical protein DDB_G0268386 [Dictyostelium discoideum AX4]
gi|60475711|gb|EAL73646.1| hypothetical protein DDB_G0268386 [Dictyostelium discoideum AX4]
Length = 676
Score = 51.2 bits (121), Expect = 8e-04, Method: Compositional matrix adjust.
Identities = 68/305 (22%), Positives = 125/305 (40%), Gaps = 40/305 (13%)
Query: 16 RGRGLVSTQSLKAGQIVLRDSPILLYSALPFINSSSFCHNCFRKTMHSSSSICPSCS-VA 74
+GR L+S + +K G+ +LR +P ++ S FC C++K + C C A
Sbjct: 175 KGRYLISKRLIKRGEEILRIAPFSFSLNGKYV--SKFCGVCYQKKSIDNLLKCKDCDDFA 232
Query: 75 FCSP-KCSTAAASSSHSPYVCQALTRLLN----LNSPDAAN-------LPLDRQLQARFL 122
C K S H +C L +L N +N+ + AN L +R+L +
Sbjct: 233 ICKECKAPNEPLSKYHPNDLC-VLYKLNNEYYKMNNFEIANEDNKERGLTDERRLYINTI 291
Query: 123 IAAYNLAVVNPSQFQILLAFQGTVTD---------NDTSAAHYLSSLCPPPASTTIELT- 172
+ N + ++ A +V D N T+ + S + +EL
Sbjct: 292 YNIFKNKTNNKAVVEVDNAISSSVIDVLNLSKKLLNHTNESQRAYSDSTKMKTDILELIF 351
Query: 173 -----AALLAKDRL-NAFGLMEPYIEGQDG--RRSVRAYGIYLKASFFNHDCLPNACRFD 224
A L KD + N +++ G + +A G+Y + + NH C+PN+
Sbjct: 352 KSFYGANTLKKDEISNIIRIVDCNSHGYSDVISQKKKASGLYPFSCYTNHSCIPNS---- 407
Query: 225 YVDAAAENNTDIIVRMIHDVPQGREICLSYFPVNYDYSTRQKRLLYDYDFACDCDRCKVE 284
D + + +++ D+ + I +SY R+K LL ++F C CD+C +
Sbjct: 408 --DYFIDKHGVMVLYASSDIQESEAITISYIYFLNRVEKRRKDLLNGHNFFCTCDQCSFQ 465
Query: 285 ANWSD 289
++ ++
Sbjct: 466 SSLTE 470
>gi|401405154|ref|XP_003882027.1| conserved hypothetical protein [Neospora caninum Liverpool]
gi|325116441|emb|CBZ51994.1| conserved hypothetical protein [Neospora caninum Liverpool]
Length = 576
Score = 51.2 bits (121), Expect = 8e-04, Method: Compositional matrix adjust.
Identities = 28/84 (33%), Positives = 40/84 (47%), Gaps = 5/84 (5%)
Query: 202 AYGIYLKASFFNHDCLPNACRFDYVDAAAENNTDIIVRMIHDVPQGREICLSYFPVNYDY 261
G++ S NH C PNA VD + T + + D+ Q ++ LSY
Sbjct: 420 GIGLFRAVSMTNHSCWPNA----EVDYPSLTKTAQVT-ALRDIAQDEQVLLSYIDETLPL 474
Query: 262 STRQKRLLYDYDFACDCDRCKVEA 285
++RQ+ L Y F C C RC+VEA
Sbjct: 475 ASRQRLLKRHYKFTCACVRCQVEA 498
>gi|383849380|ref|XP_003700323.1| PREDICTED: SET and MYND domain-containing protein 4-like [Megachile
rotundata]
Length = 691
Score = 51.2 bits (121), Expect = 8e-04, Method: Compositional matrix adjust.
Identities = 34/104 (32%), Positives = 52/104 (50%), Gaps = 8/104 (7%)
Query: 181 LNAFGLMEPYIEGQDGRRSVRAYGIYLKASFFNHDCLPNACRFDYVDAAAENNTDIIVRM 240
+NA G++ + D + ++ A GIY AS NH C PN + Y+D +IVR
Sbjct: 387 VNAAGIIHSNVIEDDVQINIVATGIYPSASMMNHSCNPNIIKI-YMDQY------LIVRA 439
Query: 241 IHDVPQGREICLSYFPV-NYDYSTRQKRLLYDYDFACDCDRCKV 283
+ D+ EI SY Y + +++LL Y F+C C+ C V
Sbjct: 440 VEDIFPTEEIFNSYVATYRYKKTKARQKLLELYYFSCKCEACTV 483
>gi|242016107|ref|XP_002428677.1| set and mynd domain-containing protein, putative [Pediculus humanus
corporis]
gi|212513348|gb|EEB15939.1| set and mynd domain-containing protein, putative [Pediculus humanus
corporis]
Length = 697
Score = 51.2 bits (121), Expect = 8e-04, Method: Compositional matrix adjust.
Identities = 76/319 (23%), Positives = 126/319 (39%), Gaps = 64/319 (20%)
Query: 17 GRGLVSTQSLKAGQIVLRDSPILLYSALPFINSSSFCHNCFRKTMHSSSSICPSCS-VAF 75
GR + ++ G ++ + P LP +NSS CH+CF + + C +CS VAF
Sbjct: 274 GRFAAAANKIEIGDTIVVEEP-QTSVLLPKMNSSH-CHHCFTRLF--APIPCKTCSGVAF 329
Query: 76 CSPKCSTAAASSSHSPYVCQ----------ALTRLLNLNSPDAANLPLDRQLQARF---- 121
CS +C +A+++ H Y CQ +L L+L L +QL ++
Sbjct: 330 CSVQCRDSASNTYHR-YECQLLGFFMGSGMSLISYLSLRMVTQEGLIFFKQLHSKLKQMG 388
Query: 122 ---------------LIAAYNLAVVNPSQFQILLAFQGTVTDNDTSAAHYLSSLCPPPAS 166
+ YNL N F TV L++ P +
Sbjct: 389 KCPEDSCDLTKREKNYLNVYNLT-TNCKLRTSKDFFTRTVMAIFLVKCLKLTNYFESPEN 447
Query: 167 TTIE-------LTAALLAKD----RLNAFGLMEPYIE----GQDGRRSVRAYGIYLKASF 211
T + LTA+++ + NA + E +E +D + A G+Y +
Sbjct: 448 KTTDDFNEDEILTASIILHHLELLQFNAHEIYETRVEELYKTEDSKTVFIAIGVYPTVAL 507
Query: 212 FNHDCLPNACRFDYVDAAAENNTDIIVRMIHDVPQGREICLSYFPVNYDYST---RQKRL 268
FNH+C P+ R+ + +II++ + + + +Y P +Y T R + L
Sbjct: 508 FNHECSPSVTRYF-------SGKNIIIKAVRPLATNDILSDNYGP-HYGKKTLLERTREL 559
Query: 269 LYDYDFACDCDRCKVEANW 287
Y F C C C +E NW
Sbjct: 560 TSRYWFRCRCQAC-IE-NW 576
>gi|383142259|gb|AFG52489.1| Pinus taeda anonymous locus UMN_1780_01 genomic sequence
gi|383142264|gb|AFG52494.1| Pinus taeda anonymous locus UMN_1780_01 genomic sequence
Length = 155
Score = 51.2 bits (121), Expect = 8e-04, Method: Compositional matrix adjust.
Identities = 27/81 (33%), Positives = 38/81 (46%), Gaps = 8/81 (9%)
Query: 204 GIYLKASFFNHDCLPNACRFDYVDAAAENNTDIIVRMIHDVPQGREICLSYFPVNYDYST 263
G++L SF NH CLPN+ R E + + + + +G EI YF +
Sbjct: 14 GLWLLPSFINHSCLPNSSRL-------EMGSAMFIHACKPIKRGEEITFPYFDILLPLPQ 66
Query: 264 RQKRLLYDYDFACDCDRCKVE 284
RQ R ++ F C C RC VE
Sbjct: 67 RQGR-CENWGFECKCRRCIVE 86
>gi|189239848|ref|XP_973542.2| PREDICTED: similar to CG8503 CG8503-PA [Tribolium castaneum]
Length = 829
Score = 51.2 bits (121), Expect = 8e-04, Method: Compositional matrix adjust.
Identities = 58/297 (19%), Positives = 115/297 (38%), Gaps = 27/297 (9%)
Query: 2 SAVSETVTVAEIEGRGRGLVSTQSLKAGQIVLRDSPILLYSALPFINSSSFCHNCFRKTM 61
+V + + E GR LV+T++LK G++V++++P++ P NS C C +
Sbjct: 40 KSVCQPFKICEDAVLGRHLVATRALKPGEVVIQEAPLIWG---PSYNSIPVCLGCGKAVD 96
Query: 62 HSSSSICPSCSVAFCSPKCSTAAASSSHSPYVCQ----ALTRLLNLNSPDAANLPLDRQL 117
+ S C C CS C + + Y Q ++ + P + + R L
Sbjct: 97 ETCSRPCQKCGWPVCSDLCEKSPSHIPECRYTSQRGEKITVKIFGVPHPIYKCITVLRCL 156
Query: 118 -QARFLIAAYNLAVVNPSQFQILLAFQGTVTDNDTSAAHYLSSLCPPP-ASTTIELTAAL 175
Q +FL P ++ L Q + ++ ++ + + +L+A
Sbjct: 157 YQKQFL----------PKNWKKLETLQNHLEEHKSTTNYESDRVTIAEFIRRFFKLSATF 206
Query: 176 LAKDRLNAFGLMEPY-IEGQDGRRSVRAY-GIYLKASFFNHDCLPNACRFDYVDAAAENN 233
+D + G + + G + + + IY AS H C PN + +
Sbjct: 207 SEEDIMKVHGSTLIFQVNGHEVPLTEPFHVAIYSSASMLEHSCGPNCTK------SFTKQ 260
Query: 234 TDIIVRMIHDVPQGREICLSYFPVNYDYSTRQKRLLYDYDFACDCDRCKVEANWSDN 290
I++ + +G + + Y + +R+ L F C C+RC+ + + N
Sbjct: 261 GHIVISAAKSIQEGDHLSICYSDPLWGTPSRRYFLHETKFFWCHCERCEDPSEFGTN 317
>gi|167342090|gb|ABZ68993.1| SET domain-containing protein C [Pinus taeda]
Length = 141
Score = 51.2 bits (121), Expect = 8e-04, Method: Compositional matrix adjust.
Identities = 27/81 (33%), Positives = 38/81 (46%), Gaps = 8/81 (9%)
Query: 204 GIYLKASFFNHDCLPNACRFDYVDAAAENNTDIIVRMIHDVPQGREICLSYFPVNYDYST 263
G++L SF NH CLPN+ R E + + + + +G EI YF +
Sbjct: 12 GLWLLPSFINHSCLPNSSRL-------EMGSAMFIHACKPIKRGEEITFPYFDILLPLPQ 64
Query: 264 RQKRLLYDYDFACDCDRCKVE 284
RQ R ++ F C C RC VE
Sbjct: 65 RQGR-CENWGFECKCRRCIVE 84
>gi|167342070|gb|ABZ68983.1| SET domain-containing protein C [Pinus taeda]
gi|167342072|gb|ABZ68984.1| SET domain-containing protein C [Pinus taeda]
gi|167342076|gb|ABZ68986.1| SET domain-containing protein C [Pinus taeda]
gi|167342078|gb|ABZ68987.1| SET domain-containing protein C [Pinus taeda]
gi|167342080|gb|ABZ68988.1| SET domain-containing protein C [Pinus taeda]
gi|167342082|gb|ABZ68989.1| SET domain-containing protein C [Pinus taeda]
gi|167342084|gb|ABZ68990.1| SET domain-containing protein C [Pinus taeda]
gi|167342100|gb|ABZ68998.1| SET domain-containing protein C [Pinus taeda]
gi|167342106|gb|ABZ69001.1| SET domain-containing protein C [Pinus taeda]
gi|167342108|gb|ABZ69002.1| SET domain-containing protein C [Pinus taeda]
gi|167342110|gb|ABZ69003.1| SET domain-containing protein C [Pinus taeda]
gi|167342112|gb|ABZ69004.1| SET domain-containing protein C [Pinus taeda]
gi|167342114|gb|ABZ69005.1| SET domain-containing protein C [Pinus taeda]
gi|167342118|gb|ABZ69007.1| SET domain-containing protein C [Pinus taeda]
gi|167342120|gb|ABZ69008.1| SET domain-containing protein C [Pinus taeda]
gi|167342122|gb|ABZ69009.1| SET domain-containing protein C [Pinus taeda]
gi|167342126|gb|ABZ69011.1| SET domain-containing protein C [Pinus taeda]
gi|167342130|gb|ABZ69013.1| SET domain-containing protein C [Pinus taeda]
gi|167342132|gb|ABZ69014.1| SET domain-containing protein C [Pinus taeda]
gi|167342144|gb|ABZ69020.1| SET domain-containing protein C [Pinus elliottii]
Length = 141
Score = 51.2 bits (121), Expect = 8e-04, Method: Compositional matrix adjust.
Identities = 27/81 (33%), Positives = 38/81 (46%), Gaps = 8/81 (9%)
Query: 204 GIYLKASFFNHDCLPNACRFDYVDAAAENNTDIIVRMIHDVPQGREICLSYFPVNYDYST 263
G++L SF NH CLPN+ R E + + + + +G EI YF +
Sbjct: 12 GLWLLPSFINHSCLPNSSRL-------EMGSAMFIHACKPIKRGEEITFPYFDILLPLPQ 64
Query: 264 RQKRLLYDYDFACDCDRCKVE 284
RQ R ++ F C C RC VE
Sbjct: 65 RQGR-CENWGFECKCRRCIVE 84
>gi|396460396|ref|XP_003834810.1| hypothetical protein LEMA_P069530.1 [Leptosphaeria maculans JN3]
gi|312211360|emb|CBX91445.1| hypothetical protein LEMA_P069530.1 [Leptosphaeria maculans JN3]
Length = 388
Score = 51.2 bits (121), Expect = 8e-04, Method: Compositional matrix adjust.
Identities = 32/119 (26%), Positives = 53/119 (44%), Gaps = 14/119 (11%)
Query: 168 TIELTAALLAKDRLNAFGLMEPYIEGQDGRRSVRAYGIYLKASFFNHDCLPNACRFDYVD 227
T E+ L ++D N+FG+ +G + YG + AS+FNH C PN +
Sbjct: 280 TSEMIFTLSSRDSHNSFGVRSLEDDGSE----FFGYGCWPAASYFNHSCHPNVEK----- 330
Query: 228 AAAENNTDIIVRMIHDVPQGREICLSYFPVNYDYSTRQKRLL---YDYDFACDCDRCKV 283
+ R + +G E+C++Y +R KR+L + F C C+RC+
Sbjct: 331 --QRDGRAWTFRARRAIAKGDELCITYLSGEERKLSRAKRMLRLKKTWGFDCSCERCEA 387
>gi|344231392|gb|EGV63274.1| hypothetical protein CANTEDRAFT_105840 [Candida tenuis ATCC 10573]
Length = 478
Score = 51.2 bits (121), Expect = 8e-04, Method: Compositional matrix adjust.
Identities = 71/326 (21%), Positives = 108/326 (33%), Gaps = 69/326 (21%)
Query: 6 ETVTVAEIEGRGRGLVSTQSLKAGQIVLRDSPILL---YSALPFINSSSFCHNCFR--KT 60
E V V RG+GL + ++ G ++ + P + L I S C C + +
Sbjct: 107 EKVKVIMTTKRGKGLYAKSNIAKGTLIWEEKPFFFIPPLANLKLIKSGKACTYCGKLLTS 166
Query: 61 MHSSSSI-------CPSCSVAFCSPKCSTAAASSSHSPYVCQALTRLLNLNSPDAANLPL 113
SSS + C C +CS C + A + L+ A N
Sbjct: 167 ARSSSGVSMLRGLDCTGCPELWCSMSCKKMNGT-------LHASLKHLSKGRKSALNPEA 219
Query: 114 DRQLQARFLIAAYNLAVVNPSQFQILLAFQGTVTDNDTSAAHYLSSLCPPPASTTIE--- 170
L+ + +N + F I L F V D Y+ S+ +T +
Sbjct: 220 FLDLEEYCIKEQWN------ALFAITLMFADIVEDKSGEKLKYVKSMARVSQATRYKALE 273
Query: 171 --------LTAALL---------AKDRLNAFGLMEPYIEGQDGRRSVRAYGIYLKA---- 209
LT L K+ + F + P DG Y L +
Sbjct: 274 SSAGSFDSLTGGALFVKEQQENLWKEGFSKFSKVFPKATS-DGTVDYNEYLTLLGSYNIN 332
Query: 210 ----------SFFNHDCLPNACRFDYVDAAAENNT---DIIVRMIHDVPQGREICLSYFP 256
S NH C PN VD N+ I V D+ G E+ +Y
Sbjct: 333 NLDSCVFRIQSHLNHTCNPN------VDVETSPNSRYEGIKVFAAKDIKAGEELSTTYVN 386
Query: 257 VNYDYSTRQKRLLYDYDFACDCDRCK 282
N+ RQ+ L ++ F C C++CK
Sbjct: 387 PNHTVLQRQRELRANWGFTCSCNKCK 412
>gi|407867908|gb|EKG08717.1| hypothetical protein TCSYLVIO_000127 [Trypanosoma cruzi]
Length = 697
Score = 51.2 bits (121), Expect = 8e-04, Method: Compositional matrix adjust.
Identities = 40/143 (27%), Positives = 60/143 (41%), Gaps = 29/143 (20%)
Query: 140 LAFQGTVTDNDTSAAHYLSSLCPPPASTTIELTAALLAKDRLNAFGLMEPYIEGQDGRRS 199
L G+ N T A + + LC AAL + N+FGL DG +
Sbjct: 528 LGEGGSTEANLTVTATFFNRLC-----------AAL----QCNSFGLF-----NADG--N 565
Query: 200 VRAYGIYLKASFFNHDCLPNACRFDYVDAAAENNTDIIVRMIHDVPQGREICLSYFPVNY 259
+Y +AS+FNH C PN CR Y A + ++ +G + + Y V
Sbjct: 566 CIGVALYPEASYFNHSCCPNICRVTYRGILA------AFHALREIRKGEPLTICYVDVQE 619
Query: 260 DYSTRQKRLLY-DYDFACDCDRC 281
+ ++R L+ Y F C+C RC
Sbjct: 620 TSTAERRRTLFSSYRFFCECARC 642
>gi|384490543|gb|EIE81765.1| hypothetical protein RO3G_06470 [Rhizopus delemar RA 99-880]
Length = 400
Score = 51.2 bits (121), Expect = 8e-04, Method: Compositional matrix adjust.
Identities = 68/284 (23%), Positives = 119/284 (41%), Gaps = 37/284 (13%)
Query: 17 GRGLVSTQSLKAGQIVLRDSPILLYSALPFINSSSFCHNCFRKTMHSSSSICPS----CS 72
G +V+ LK G+ +L + P + L N ++ C CF + +C +
Sbjct: 43 GERVVANAKLKQGRTILEEDPFI--RQLNDANRTTHCTYCFTE-FKKIKKLCRNKDCKWE 99
Query: 73 VAFCSPKCSTAAASSSHSPYVCQALTRLLNLNSPDAANLPLDRQLQARFLIAAYNL---- 128
V +CS C + H + C R NL + D L + A + Y L
Sbjct: 100 VLYCSLNCQQQHWLTEHK-WFC----RFPNLANTDKNVLFALQGYIASRSKSEYTLPGLV 154
Query: 129 AVVNPSQFQILLAFQGTVTDNDTSAAHYLSSLCPPPASTTIELTAALLAKDRLNAFGLME 188
+ ++ + + L ++ + +++ S +L+ IE ++A+ R N F + +
Sbjct: 155 SNIDSYKPEDLEEYRKKIGNSELSKVFHLTE-------GVIESMVVIMAQIRCNTFAVKQ 207
Query: 189 PYI----EGQDGRRSVRAYGIYLKASFFNHDCLPNACRFDYVDAAAENN--TDIIVRMIH 242
I E ++ IYL AS FNH C PNA + NN + + V I
Sbjct: 208 FVISKTLEIEERETITLGRAIYLTASKFNHSCNPNA-----LVLFGTNNVGSHLKVHFIG 262
Query: 243 D-VPQGREICLSYFPVNYDYSTRQKRLLY--DYDFACDCDRCKV 283
D + EI +SY PV+ +S +Q++ +Y F C+C C++
Sbjct: 263 DEAKKNAEINISYGPVSTRHSKKQRKKKLKDEYFFDCNCSSCQL 306
>gi|328789942|ref|XP_392262.3| PREDICTED: SET and MYND domain-containing protein 4-like [Apis
mellifera]
Length = 677
Score = 51.2 bits (121), Expect = 9e-04, Method: Compositional matrix adjust.
Identities = 73/323 (22%), Positives = 117/323 (36%), Gaps = 67/323 (20%)
Query: 17 GRGLVSTQSLKAGQIVLRDSPI--LLYSALPFINSSSFCHNCFRKTMHSSSSICPSCS-V 73
GR ++T+ ++ G+I+ + P L + IN C CF K ++C +CS V
Sbjct: 252 GRHAIATKDIEPGEILAIEKPYSAFLLAEYRLIN----CFYCFTKIFVPIPAVCQTCSCV 307
Query: 74 AFCSPKCSTAAASSSHSPYVCQALTRLLNLNSPDAANLPLDRQLQARF--LIAAYNLAVV 131
A+CS C A + C L L + L L +Q F L ++
Sbjct: 308 AYCSISCRDKDAKIHENE--CSILPTLWASKTSINCFLALRIIVQQSFEKLYKLKDVKEN 365
Query: 132 NPSQFQI----------------LLAFQGTVTDNDTSAAHYLSSLC----------PPPA 165
+ +F++ L+ + T T D Y++S P
Sbjct: 366 SKDKFEVSASEPYRSNDFKIMFRLVTHEDTRTVEDLFHRTYIASWLLRLLKKGPYFPKHV 425
Query: 166 STTIELTAAL--------------LAKDRLNAFGLMEPYIEGQD-----GRRSVRAYGIY 206
T + A L L + NA + E I D + G+Y
Sbjct: 426 KTPDTIEAKLSDGELYIGGLILHNLMTIQFNAHEISELVIPKADNNLANAKSKFIGGGLY 485
Query: 207 LKASFFNHDCLPNACRFDYVDAAAENNTDIIVRMIHDVPQGREICLSYFPV--NYDYSTR 264
S FNH C P R+ T ++VR I + G EI +Y + S R
Sbjct: 486 PTISLFNHSCNPGIIRYFI-------GTTMVVRAIRSISSGEEISENYGQIFTTTPESER 538
Query: 265 QKRLLYDYDFACDCDRCKVEANW 287
+++L Y F C+C+ C+ +W
Sbjct: 539 KRKLRLQYFFDCNCEACR--EHW 559
>gi|167342142|gb|ABZ69019.1| SET domain-containing protein C [Pinus elliottii]
Length = 141
Score = 51.2 bits (121), Expect = 9e-04, Method: Compositional matrix adjust.
Identities = 27/81 (33%), Positives = 38/81 (46%), Gaps = 8/81 (9%)
Query: 204 GIYLKASFFNHDCLPNACRFDYVDAAAENNTDIIVRMIHDVPQGREICLSYFPVNYDYST 263
G++L SF NH CLPN+ R E + + + + +G EI YF +
Sbjct: 12 GLWLLPSFINHSCLPNSSRL-------EMGSAMFIHACKPIKRGEEITFPYFDILLPLPQ 64
Query: 264 RQKRLLYDYDFACDCDRCKVE 284
RQ R ++ F C C RC VE
Sbjct: 65 RQGR-CENWGFECKCRRCIVE 84
>gi|71421326|ref|XP_811771.1| hypothetical protein [Trypanosoma cruzi strain CL Brener]
gi|70876474|gb|EAN89920.1| hypothetical protein, conserved [Trypanosoma cruzi]
Length = 697
Score = 51.2 bits (121), Expect = 9e-04, Method: Compositional matrix adjust.
Identities = 40/143 (27%), Positives = 60/143 (41%), Gaps = 29/143 (20%)
Query: 140 LAFQGTVTDNDTSAAHYLSSLCPPPASTTIELTAALLAKDRLNAFGLMEPYIEGQDGRRS 199
L G+ N T A + + LC AAL + N+FGL DG +
Sbjct: 528 LGEGGSTEANLTVTATFFNRLC-----------AAL----QCNSFGLF-----NADG--N 565
Query: 200 VRAYGIYLKASFFNHDCLPNACRFDYVDAAAENNTDIIVRMIHDVPQGREICLSYFPVNY 259
+Y +AS+FNH C PN CR Y A + ++ +G + + Y V
Sbjct: 566 CIGVALYPEASYFNHSCCPNICRVTYRGILA------AFHALREIRKGEPLTICYVDVQE 619
Query: 260 DYSTRQKRLLY-DYDFACDCDRC 281
+ ++R L+ Y F C+C RC
Sbjct: 620 TSTAERRRTLFSSYRFFCECARC 642
>gi|425701087|gb|AFX92249.1| putative set domain-containing protein [Megavirus courdo11]
Length = 157
Score = 51.2 bits (121), Expect = 9e-04, Method: Compositional matrix adjust.
Identities = 30/78 (38%), Positives = 36/78 (46%), Gaps = 7/78 (8%)
Query: 204 GIYLKASFFNHDCLPNACRFDYVDAAAENNTDIIVRMIHDVPQGREICLSYFPVNYDYST 263
I FNH CLPN + D I VR D+ +G E+ SY + +D T
Sbjct: 62 AILFNGRIFNHSCLPNIVFYRCNDVMCF----ITVR---DIYKGEELLDSYVNITHDKKT 114
Query: 264 RQKRLLYDYDFACDCDRC 281
RQ RL Y F CDC RC
Sbjct: 115 RQSRLWNQYRFHCDCQRC 132
>gi|294656447|ref|XP_458717.2| DEHA2D05764p [Debaryomyces hansenii CBS767]
gi|218511783|sp|Q6BSV3.2|SET5_DEBHA RecName: Full=Potential protein lysine methyltransferase SET5;
AltName: Full=SET domain-containing protein 5
gi|199431481|emb|CAG86859.2| DEHA2D05764p [Debaryomyces hansenii CBS767]
Length = 493
Score = 51.2 bits (121), Expect = 9e-04, Method: Compositional matrix adjust.
Identities = 69/318 (21%), Positives = 114/318 (35%), Gaps = 47/318 (14%)
Query: 6 ETVTVAEIEGRGRGLVSTQSLKAGQIVLRDSPILLYSALP---FINSSSFCHNCFRKTMH 62
E V + RG+GL + + G ++ +S + + L I C C +
Sbjct: 116 EKVHIVMTAKRGKGLYAKNKISKGDLIWEESQLFYIAPLANVNLIKGGKACSYCGKLLTQ 175
Query: 63 SSSSI---------CPSCSVAFCSPKCSTAAAS-------SSHSPYVCQALTRLLNLNS- 105
SSS+ C CS +CS C + + ++P +L+N N+
Sbjct: 176 SSSNAGISVLKGLDCNVCSEIWCSQTCKKIDSQLHGMLKHNLYNPNSSGGSKKLINANAY 235
Query: 106 PDAANLPLDRQLQARFLIAAYNLAVV------NPSQFQILLAFQGTVTDND-TSAAHYLS 158
+ + L Q A + I ++ QF+ + V S+A
Sbjct: 236 LELQDYCLKEQWNALYAITIIYANIITDKSGIKGKQFRAMARVSQDVRYKALNSSAGAFD 295
Query: 159 SLCPPPASTTIELTAALLAKDRLNAFGLMEPY------IEGQDGRRSVRAYGI------- 205
+L + + L KD N F + P I Q+ + Y I
Sbjct: 296 NL--QGGALFVGEQQEKLWKDGYNRFKNVFPKSVDNGDISYQEFMYMMGTYNINNLDSCI 353
Query: 206 YLKASFFNHDCLPNACRFDYVDAAAENNTD-IIVRMIHDVPQGREICLSYFPVNYDYSTR 264
YL S NH+C PN V+ + T+ + V D+ G E+ +Y Y R
Sbjct: 354 YLTQSHLNHNCDPNT----NVETSTTVRTNGLKVFAARDIRAGEELTTTYVNPAYTVQQR 409
Query: 265 QKRLLYDYDFACDCDRCK 282
Q+ L ++ F C C +CK
Sbjct: 410 QRELRVNWGFMCGCQKCK 427
>gi|170041827|ref|XP_001848651.1| SET and MYND domain-containing protein 5 [Culex quinquefasciatus]
gi|167865410|gb|EDS28793.1| SET and MYND domain-containing protein 5 [Culex quinquefasciatus]
Length = 387
Score = 51.2 bits (121), Expect = 9e-04, Method: Compositional matrix adjust.
Identities = 28/87 (32%), Positives = 44/87 (50%), Gaps = 10/87 (11%)
Query: 204 GIYLKASFFNHDCLPNA-CRFDYVDAAAENNTDIIVRMIHDVPQGREICLSYF---PVNY 259
+Y S NH C PNA C+F + +N + + + D+ G EIC+SY +
Sbjct: 287 ALYSLQSKVNHSCAPNAECQFPH------SNNVLALTTVQDIAAGEEICISYLDECALER 340
Query: 260 DYSTRQKRLLYDYDFACDCDRCKVEAN 286
+RQK L +Y F C C++C+ + N
Sbjct: 341 SRHSRQKMLRENYLFQCQCEKCESQVN 367
>gi|383142258|gb|AFG52488.1| Pinus taeda anonymous locus UMN_1780_01 genomic sequence
gi|383142260|gb|AFG52490.1| Pinus taeda anonymous locus UMN_1780_01 genomic sequence
gi|383142261|gb|AFG52491.1| Pinus taeda anonymous locus UMN_1780_01 genomic sequence
gi|383142262|gb|AFG52492.1| Pinus taeda anonymous locus UMN_1780_01 genomic sequence
gi|383142263|gb|AFG52493.1| Pinus taeda anonymous locus UMN_1780_01 genomic sequence
Length = 155
Score = 51.2 bits (121), Expect = 9e-04, Method: Compositional matrix adjust.
Identities = 27/81 (33%), Positives = 38/81 (46%), Gaps = 8/81 (9%)
Query: 204 GIYLKASFFNHDCLPNACRFDYVDAAAENNTDIIVRMIHDVPQGREICLSYFPVNYDYST 263
G++L SF NH CLPN+ R E + + + + +G EI YF +
Sbjct: 14 GLWLLPSFINHSCLPNSSRL-------EMGSAMFIHACKPIKRGEEITFPYFDILLPLPQ 66
Query: 264 RQKRLLYDYDFACDCDRCKVE 284
RQ R ++ F C C RC VE
Sbjct: 67 RQGR-CENWGFECKCRRCIVE 86
>gi|336274372|ref|XP_003351940.1| hypothetical protein SMAC_00488 [Sordaria macrospora k-hell]
gi|380096224|emb|CCC06271.1| unnamed protein product [Sordaria macrospora k-hell]
Length = 537
Score = 51.2 bits (121), Expect = 0.001, Method: Compositional matrix adjust.
Identities = 74/297 (24%), Positives = 112/297 (37%), Gaps = 51/297 (17%)
Query: 17 GRGLVSTQSLKAGQ-IVLRDSPILLYSALPF-INSSSFCHNC--------FRKTMHSSSS 66
GR + ST+ AG+ I + ++P+L ALP N + C+ C F + +
Sbjct: 23 GRTVFSTERFGAGETIAILENPLL---ALPDDANMRTTCNYCLYVSGTIEFEGDVKAGPR 79
Query: 67 ICPSCS----VAFCSPKCSTAAASSSHSPYVCQALTRLLNLNSPDAANLPLDRQLQARFL 122
C +C+ +C+ +C H C+ R+ D P+ Q L
Sbjct: 80 TCKACTGCKAAVYCNAECQRQHWKLVHKAE-CKMFKRIRERTGKDWVPTPVRAVAQVMLL 138
Query: 123 IAAYNLAVVNPSQFQILLA--FQGTVTDND---------TSAAHYLSSLCPPPASTTIEL 171
+ A + V L +G TD + T A Y L + T++
Sbjct: 139 LKAGDEEVTKAFGPGGTLESNVEGFKTDEELWKDFELQATGAVVYAGLL---QSDETLKQ 195
Query: 172 TAALLAKDRLNAFGLMEPYIEGQDGRRSVRAYGIYLKASF--FNHDCLPNACRFDYVDAA 229
+L K + NAF + GQ G IYL S NH C+PNA Y+
Sbjct: 196 AMEVLCKIQTNAFNRFDADT-GQAG--------IYLHPSLSMVNHSCVPNA----YITF- 241
Query: 230 AENNTDIIVRMIHDVPQGREICLSYFPVNYDYSTRQKRLLYDYDFACDCDRCKVEAN 286
++ D+ G EI +SY RQ+ L Y F C+C RCK + N
Sbjct: 242 --EKRKAFLKAERDIEPGDEILISYIDHTAPRRARQESLRL-YHFQCNCVRCKDDLN 295
>gi|383858904|ref|XP_003704939.1| PREDICTED: protein msta, isoform A-like [Megachile rotundata]
Length = 530
Score = 50.8 bits (120), Expect = 0.001, Method: Compositional matrix adjust.
Identities = 71/299 (23%), Positives = 118/299 (39%), Gaps = 59/299 (19%)
Query: 10 VAEIEGRGRGLVSTQSLKAGQIVLRDSPILLYSALPFINSSSF-CHNCFR---KTMHSSS 65
VA E GR L + ++L+AG+++LR+ PI + P +S+ + C C R K +
Sbjct: 17 VACSEKFGRYLQAAKNLQAGEVILREKPI---AVGPITSSNDYLCFACLRLLPKLKKGTQ 73
Query: 66 SICPSCSVAFCSPKCSTAAAS--SSHSPYVCQALTRLLNLNSPDAAN-----LPL----- 113
+C C++A P C TA H+ Y C+ NL++ + LPL
Sbjct: 74 YVCSKCNIA---PLCGTACQEQPKHHTSYECEIFKTNENLSADNIETLIGILLPLRLWLV 130
Query: 114 ---DRQLQARF------LIAAYNLAVVNPSQFQILLAFQG--TVTDNDTSAAHYLSSLCP 162
D +L R + N +V + ++ + + ++DTS YL LC
Sbjct: 131 KQTDPELSKRIESMEAHMEERRNTSVWRDRETNVINVMKALHLIPEDDTSIPEYLQQLC- 189
Query: 163 PPASTTIELTAALLAKDRLNAFGLMEPYIEGQDGRRSVRAYGIYLKASFFNHDCLPNACR 222
+L +N F L P G + G+Y++ASF HDC N
Sbjct: 190 -----------GILD---VNTFELRSP-----GGLDGLLLRGLYVEASFMAHDCRGNT-- 228
Query: 223 FDYVDAAAENNTDIIVRMIHDVPQGREICLSYFPVNYDYSTRQKRLLYDYDFACDCDRC 281
+++ + V + +G I +Y R++ L F C+C C
Sbjct: 229 ----HLTVDDHFQLTVYASIPIKEGEAILFNYTSSLLGTLGRREHLRGGKYFECECSLC 283
>gi|383142266|gb|AFG52496.1| Pinus taeda anonymous locus UMN_1780_01 genomic sequence
Length = 155
Score = 50.8 bits (120), Expect = 0.001, Method: Compositional matrix adjust.
Identities = 27/81 (33%), Positives = 38/81 (46%), Gaps = 8/81 (9%)
Query: 204 GIYLKASFFNHDCLPNACRFDYVDAAAENNTDIIVRMIHDVPQGREICLSYFPVNYDYST 263
G++L SF NH CLPN+ R E + + + + +G EI YF +
Sbjct: 14 GLWLLPSFINHSCLPNSSRL-------EMGSAMFIHACKPIKRGEEITFPYFDILLPLPQ 66
Query: 264 RQKRLLYDYDFACDCDRCKVE 284
RQ R ++ F C C RC VE
Sbjct: 67 RQGR-CENWGFECKCRRCIVE 86
>gi|401422457|ref|XP_003875716.1| conserved hypothetical protein [Leishmania mexicana
MHOM/GT/2001/U1103]
gi|322491955|emb|CBZ27228.1| conserved hypothetical protein [Leishmania mexicana
MHOM/GT/2001/U1103]
Length = 872
Score = 50.8 bits (120), Expect = 0.001, Method: Compositional matrix adjust.
Identities = 49/181 (27%), Positives = 74/181 (40%), Gaps = 35/181 (19%)
Query: 175 LLAKDRLNAFGLMEPYIEGQDGRRSVRAYGIYLKASFFNHDCLPNACRFDYVDAAAENNT 234
L A + N+FG+ D + +GIY +AS+FNH C+PN CR +
Sbjct: 672 LCAAAQCNSFGVY-------DTQEHCIGFGIYPEASYFNHSCVPNLCRV--MHHGGRIAA 722
Query: 235 DIIVRMIHDVPQGREICLSYFPV-NYDYSTRQKRLLYDYDFACDCDRCKVEANWSDNDND 293
+R I PQ + + Y V + + R++ LL Y F C C+RC +A
Sbjct: 723 FYALRAI--APQ-EPLTICYTDVEQLNSAERRRNLLGTYRFFCMCERCSGKA-------- 771
Query: 294 DENNEEVMDEDQDEQMVASDDDAEVHGDTNF-----PHAYFFMRYMCDRDNCWGTLAPLP 348
E + A+ + A V G + P A+ +C + G L PLP
Sbjct: 772 ---------EGPQMAIAAAGNRASVSGPRSGCVEVSPAAFENPLLLCAECSIHGYLRPLP 822
Query: 349 P 349
P
Sbjct: 823 P 823
>gi|326919601|ref|XP_003206068.1| PREDICTED: SET and MYND domain-containing protein 1-like isoform 1
[Meleagris gallopavo]
Length = 486
Score = 50.8 bits (120), Expect = 0.001, Method: Compositional matrix adjust.
Identities = 65/290 (22%), Positives = 112/290 (38%), Gaps = 29/290 (10%)
Query: 6 ETVTVAEIEGRGRGLVSTQSLKAGQIVLRDSPILLYSALPFIN-SSSFCHNCFRKTMHSS 64
E+V V EG+GRGL + + G ++ + Y+A+ F + + CH CF++
Sbjct: 2 ESVEVFTTEGKGRGLKAQKEFLPGDVIFAEPA---YAAVVFDSLTHVVCHTCFKR--QEK 56
Query: 65 SSICPSCSVAF-CSPKCSTAAASSSHSPYVCQALTRLLNLNSPDAANLPLDRQLQARFL- 122
C C A+ C C A + + C A+ + + +P N+ L ++ R
Sbjct: 57 LHRCGQCKFAYYCDRTCQRDAWLNHKNE--CSAIKK--HGKAP-TENIRLAARILWRIER 111
Query: 123 ----IAAYNLAVVNPSQFQILLAFQGTVTDNDTSAAHYLSSLCPPPASTTIELTAALLAK 178
++ L ++ Q + + D +L ++ + +
Sbjct: 112 EGSGLSENCLVSIDDLQNHVENFDEEEKKDLRIDVESFLEFWPAQSQQFGMQYISHIFGV 171
Query: 179 DRLNAFGLMEPYIEGQDGRRSVRAYGIYLKASFFNHDCLPNAC------RFDYVDAAAEN 232
NAF L + Q G ++V GI+ NHDC PN + V +
Sbjct: 172 INCNAFTLSD-----QRGLQAV-GVGIFPNLCQANHDCWPNCTVIFNNGNHEAVRSMFHT 225
Query: 233 NTDIIVRMIHDVPQGREICLSYFPVNYDYSTRQKRLLYDYDFACDCDRCK 282
I +R + + G E+ +SY RQK+L Y F C C+ CK
Sbjct: 226 QMRIELRALSKISPGDELTVSYVDFLNVSEERQKQLKKQYYFDCTCEHCK 275
>gi|403413006|emb|CCL99706.1| predicted protein [Fibroporia radiculosa]
Length = 800
Score = 50.8 bits (120), Expect = 0.001, Method: Compositional matrix adjust.
Identities = 28/81 (34%), Positives = 41/81 (50%), Gaps = 3/81 (3%)
Query: 202 AYGIY-LKASFFNHDCLPNACRFDYVDAAAENNTDIIVRMIHDVPQGREICLSYFPVNYD 260
+GIY L + FNH C+PNA Y+ + E+ + V + D+ +G EI + Y
Sbjct: 551 GHGIYPLASRLFNHSCVPNAVA-KYIISPFES-VQMEVVALRDIAEGEEITIPYLDPALP 608
Query: 261 YSTRQKRLLYDYDFACDCDRC 281
TRQ+ L +Y F C C C
Sbjct: 609 LHTRQEALRLNYGFTCTCPLC 629
>gi|197692956|gb|ACH71266.1| SET and MYND domain-containing 3 [Sus scrofa]
Length = 369
Score = 50.8 bits (120), Expect = 0.001, Method: Compositional matrix adjust.
Identities = 61/258 (23%), Positives = 98/258 (37%), Gaps = 50/258 (19%)
Query: 68 CPSCSVA-FCSPKCSTAAASSSHSPYVCQALTRLLNLNSPDAANLPLDRQL--------- 117
C C VA +CS KC A C+ L PD+ L L R +
Sbjct: 3 CSQCRVAKYCSAKCQKKAWQDHKRE--CKCLKSCKPRYPPDSVRL-LGRVIFKLMEETPS 59
Query: 118 QARFLIAAYNLAV--------VNPSQFQILLAFQGTVTDNDTSAAHYLSSLCPPPASTTI 169
++ L + Y+L Q++L FQ + + A+ PP+
Sbjct: 60 ESEKLYSFYDLESNMNKLTEEKKEGLRQLVLTFQHFMREEIQDASQL------PPSFDIF 113
Query: 170 ELTAALLAKDRLNAFGLMEPYIEGQDGRRSVRAYGIYLKASFFNHDCLPNACRFDYVDAA 229
E A ++ N+F + ++ G+Y S NH C PN +
Sbjct: 114 EAFAKVIC----NSFTICNAEMQEV-------GVGLYPSMSLLNHSCDPNC-------SI 155
Query: 230 AENNTDIIVRMIHDVPQGREICLSYFPVNYDYSTRQKRLLYDYDFACDCDRCKVEANWSD 289
N +++R + D+ G E+ + Y + R+K+L Y F CDC RC+ + D
Sbjct: 156 VFNGPHLLLRAVRDIEAGEELTICYLDMLMTSEERRKQLRDQYCFECDCFRCQTQ----D 211
Query: 290 NDNDD-ENNEEVMDEDQD 306
D D +E+V E Q+
Sbjct: 212 KDADMLTGDEQVWKEVQE 229
>gi|378731988|gb|EHY58447.1| hypothetical protein HMPREF1120_06457 [Exophiala dermatitidis
NIH/UT8656]
Length = 545
Score = 50.8 bits (120), Expect = 0.001, Method: Compositional matrix adjust.
Identities = 29/85 (34%), Positives = 45/85 (52%), Gaps = 7/85 (8%)
Query: 209 ASFFNHDCLPNACRFDYVDAAAENNTDIIVRMIHDVPQGREICLSYFPVNYD-YSTRQKR 267
AS NH C+PN +F Y E ++ R I D+ +G+EIC +Y V ++ S RQ++
Sbjct: 239 ASRLNHSCVPN-VQFSY----DETTNEMQFRAIRDIARGKEICSNYDKVVFETRSKRQRK 293
Query: 268 LLYDYDFACDCDRCKVEAN-WSDND 291
Y F C C+ C+ W+ +D
Sbjct: 294 QQIYYGFVCTCEACEPHTEFWARSD 318
>gi|330801330|ref|XP_003288681.1| hypothetical protein DICPUDRAFT_34471 [Dictyostelium purpureum]
gi|325081244|gb|EGC34766.1| hypothetical protein DICPUDRAFT_34471 [Dictyostelium purpureum]
Length = 378
Score = 50.8 bits (120), Expect = 0.001, Method: Compositional matrix adjust.
Identities = 36/143 (25%), Positives = 54/143 (37%), Gaps = 39/143 (27%)
Query: 204 GIYLKASFFNHDCLPNACRFDYVDAAAENNTDIIVRMIHDVPQGREICLSYFPVNYDYST 263
GIY +S FNH C PN C+ +++ N + I D+ + EI +Y +N +
Sbjct: 254 GIYFNSSLFNHSCQPNICKV--IESCKNNFGCHSMVAIRDIKENEEISFNYIQLNQNKQE 311
Query: 264 RQKRLLYDYDFACDCDRCKVEANWSDNDNDDENNEEVMDEDQDEQMVASDDDAEVHGDTN 323
R ++L Y F C+C C ++N N
Sbjct: 312 RIEKLQSSYYFTCNCPSC-----CTNNPN------------------------------- 335
Query: 324 FPHAYFFMRYMCDRDNCWGTLAP 346
PH F +Y C + NC G P
Sbjct: 336 -PHKEFLNKYKCKKLNCSGLEYP 357
>gi|255730767|ref|XP_002550308.1| hypothetical protein CTRG_04606 [Candida tropicalis MYA-3404]
gi|240132265|gb|EER31823.1| hypothetical protein CTRG_04606 [Candida tropicalis MYA-3404]
Length = 409
Score = 50.8 bits (120), Expect = 0.001, Method: Compositional matrix adjust.
Identities = 29/116 (25%), Positives = 53/116 (45%), Gaps = 8/116 (6%)
Query: 171 LTAALLAKDRLNAFGLMEPYIEGQDGRRSVRAYGIYLKASFFNHDCLPNACRFDYVDAAA 230
+ A++ K+ NAFG+ + + + + YG+Y AS+FNH C PN +
Sbjct: 300 IVRAIIGKNLSNAFGIWSEVSDPSEDKEFL-GYGVYPSASYFNHSCEPNIVK-------T 351
Query: 231 ENNTDIIVRMIHDVPQGREICLSYFPVNYDYSTRQKRLLYDYDFACDCDRCKVEAN 286
+ + D+ G E+C++Y + ++ L ++ F C C +C E N
Sbjct: 352 RKGATLEFTTLRDINPGEELCINYGNYTNEPVEIRRAQLSEWFFDCGCTKCIRELN 407
>gi|157138251|ref|XP_001664198.1| hypothetical protein AaeL_AAEL003788 [Aedes aegypti]
gi|108880683|gb|EAT44908.1| AAEL003788-PA, partial [Aedes aegypti]
Length = 485
Score = 50.8 bits (120), Expect = 0.001, Method: Compositional matrix adjust.
Identities = 66/302 (21%), Positives = 108/302 (35%), Gaps = 70/302 (23%)
Query: 17 GRGLVSTQSLKAGQIVLRDSPILLYSALPFINSSSFCHNCFRKTMHSSSSI----CPSCS 72
GR L +T+ ++ G+++ DSP+LL + I CH M+S + C CS
Sbjct: 57 GRHLTATRPIRRGELIFNDSPLLLGPKIASIPVCLGCHRNLPPLMNSHLQVDFYHCRHCS 116
Query: 73 VAFCSPKCSTAAASSSHSPYVCQAL--------------TR--------------LLNLN 104
C P C TA + P + ++ TR LL +
Sbjct: 117 WPLCGPSCETATSHKDECPLLARSSKSIGSRLQFCFSDPTRKESVYCAVVPLRALLLKRS 176
Query: 105 SPDAAN---LPLDRQLQARFLIAAYNLAVVNPSQF-QILLAFQGTVTDNDTSAAHYLSSL 160
+P+ L L+ L+ R Y N F + +L G VT+ +
Sbjct: 177 NPERFRDGFLTLESHLETRIKTPLYAALRSNLVPFVRDILGMGGEVTERE---------- 226
Query: 161 CPPPASTTIELTAALLAKDRLNAFGLMEPYIEGQDGRRSVRAYGIYLKASFFNHDCLPNA 220
+E+ A L N + + P ++ V+ +Y + +H C PN
Sbjct: 227 -------LLEIAAIL----DTNCYDVRLPELD-------VKVRALYELGAMMSHHCRPNT 268
Query: 221 CRFDYVDAAAENNTDIIVRMIHDVPQGREICLSYFPVNYDYSTRQKRLLYDYDFACDCDR 280
Y D ++V D+P+ I +SY R+ + F C CDR
Sbjct: 269 --KHYFD----ERLRLVVVATVDIPKDAVISISYTQPLLSTIQRRYAIQQSKCFECCCDR 322
Query: 281 CK 282
C+
Sbjct: 323 CR 324
>gi|195170308|ref|XP_002025955.1| GL10122 [Drosophila persimilis]
gi|194110819|gb|EDW32862.1| GL10122 [Drosophila persimilis]
Length = 515
Score = 50.8 bits (120), Expect = 0.001, Method: Compositional matrix adjust.
Identities = 54/266 (20%), Positives = 107/266 (40%), Gaps = 17/266 (6%)
Query: 17 GRGLVSTQSLKAGQIVLRDSPILLYSALPFINSSSFCHNCFRKTMHSSSSICPSCSVAFC 76
GR LV+T+++K +I+L+++P++ P + C C C C C
Sbjct: 52 GRHLVATRTIKPYEIILKEAPLV---RGPAQITPPVCMGCLNSIEPEDHIDCDLCGWPLC 108
Query: 77 SPKCSTAAASSSHSPYVCQALTRLLNLNSPDAANLPLDRQLQARFLIAAYNLAVVNPSQF 136
P+C + + + Q + +N++ + ++ PL + + + S+F
Sbjct: 109 GPECKSLGEHRAEC-QLTQERGQKVNVHEFNGSH-PLYTCVSTVRCLLIGETSPEKASKF 166
Query: 137 QILLAFQGTVTDNDTSAAHYLSSLCPPPASTTIELTAALLAKDRLNAFGLMEPYIEGQDG 196
Q L + + T ++ A S P E ++ + A G ++ I G +
Sbjct: 167 QELESLESTRRGSNQWKADLASIGQFIPKFFKTE---KFSEEEVMRAVGALQ--INGHEV 221
Query: 197 RRSVRAY-GIYLKASFFNHDCLPNACRFDYVDAAAENNTDIIVRMIHDVPQGREICLSYF 255
S + ++ ASF H C+PN + + N I+ ++ + + + Y
Sbjct: 222 PTSDPPHVAVFYTASFTEHSCMPNLAK------SFNKNGHCILWAPREIKKNSHLSICYS 275
Query: 256 PVNYDYSTRQKRLLYDYDFACDCDRC 281
+ + RQ+ L+ F C CDRC
Sbjct: 276 DAMWGTADRQRHLVQTKLFKCACDRC 301
>gi|157136831|ref|XP_001656929.1| hypothetical protein AaeL_AAEL003516 [Aedes aegypti]
gi|108880958|gb|EAT45183.1| AAEL003516-PA [Aedes aegypti]
Length = 649
Score = 50.8 bits (120), Expect = 0.001, Method: Compositional matrix adjust.
Identities = 73/313 (23%), Positives = 111/313 (35%), Gaps = 49/313 (15%)
Query: 17 GRGLVSTQSLKAGQIVLRDSPILLYSALPFINSSSFCHNCFRKTMHSSSSICPSCS-VAF 75
GR + LK I+L + P + S L S S C CF++ + CP CS V F
Sbjct: 239 GRFAKTNMDLKPNTIILLEKPHV--SVLLEEYSLSHCSTCFKRV--TIPVCCPKCSDVIF 294
Query: 76 CSPKCSTAAASS--------------SHSPYVCQALTRLLNLNSP----------DAANL 111
CS C A SS S + C R++ + D
Sbjct: 295 CSEACEKKANSSYHRYECGFLPIFWKSGASITCHMALRMITQKTEEYFVGLRHELDGLTS 354
Query: 112 PLDRQLQARFLIAAYNLAVVNPSQFQILLAFQGTVTDNDTSAAHYLSSLCPPPASTTIEL 171
+ +L Y L V + + FQ T+ +A L S +
Sbjct: 355 EVTDKLNNDDYRKVYKL-VTHEDKRSPEDYFQRTLMATLLNACLSLGGYYKSKESES--F 411
Query: 172 TAALLAKD----RLNAFGLMEPYIEGQD--GRRSVRAYGIYLKASFFNHDCLPNACRFDY 225
LL + + NA + E E + G+ + G+Y + FNH C P R+
Sbjct: 412 IGGLLLHNLQFLQYNAHEISELQRENERDIGKSTFIGGGLYPTLALFNHSCEPGVTRY-- 469
Query: 226 VDAAAENNTDIIVRMIHDVPQGREICLSYFPVNYDYSTRQKR--LLYDYDFACDCDRCKV 283
+ VR + +P G + +Y P+ S ++R LL Y F C+C C
Sbjct: 470 -----YRGNSVCVRTVKGIPAGSMVAENYGPLFTQVSREERRSTLLNQYKFTCNCRACA- 523
Query: 284 EANWSDNDNDDEN 296
+W + D N
Sbjct: 524 -GDWPKFADMDSN 535
>gi|49257476|gb|AAH73806.1| SMYD family member 5 [Homo sapiens]
gi|167773713|gb|ABZ92291.1| SMYD family member 5 [synthetic construct]
gi|194374657|dbj|BAG62443.1| unnamed protein product [Homo sapiens]
Length = 302
Score = 50.8 bits (120), Expect = 0.001, Method: Compositional matrix adjust.
Identities = 55/188 (29%), Positives = 81/188 (43%), Gaps = 26/188 (13%)
Query: 109 ANLPLDRQLQARFLIAAYNLAV---VNPSQFQILLAFQGTVTDN--DTSAAHYLSSLCPP 163
L L R+L F A Y AV P F+ L A GT +S + ++
Sbjct: 99 GQLELLRRL---FTEALYEEAVSQWFTPDGFRSLFALVGTNGQGIGTSSLSQWVH----- 150
Query: 164 PASTTIELTAALLAKDRLNAF-GLMEPYIEGQDGR-RSVRAYGIYLKASFFNHDCLPNAC 221
A T+EL +++L+AF + IE G + G+++ S NH C+PNA
Sbjct: 151 -ACDTLELKPQ--DREQLDAFIDQLYKDIEAATGEFLNCEGSGLFVLQSCCNHSCVPNA- 206
Query: 222 RFDYVDAAAENNTDIIVRMIHDVPQGREICLSYFPV---NYDYSTRQKRLLYDYDFACDC 278
+ ENN + V + D+ G EIC+SY +R K L +Y F C C
Sbjct: 207 ----ETSFPENNFLLHVTALEDIKPGEEICISYLDCCQRERSRHSRHKILRENYLFVCSC 262
Query: 279 DRCKVEAN 286
+C EA+
Sbjct: 263 PKCLAEAD 270
>gi|195060014|ref|XP_001995739.1| GH17917 [Drosophila grimshawi]
gi|193896525|gb|EDV95391.1| GH17917 [Drosophila grimshawi]
Length = 545
Score = 50.8 bits (120), Expect = 0.001, Method: Compositional matrix adjust.
Identities = 68/286 (23%), Positives = 108/286 (37%), Gaps = 44/286 (15%)
Query: 17 GRGLVSTQSLKAGQIVLRDSPILLYSALPFINSSSFCHNCFRKT-MHSSSSICPSCSVAF 75
GR LV+ + L+AG++++ + P+ + P ++ C C++ + S CP C+
Sbjct: 70 GRYLVANRRLEAGELLISEEPLAIG---PCVSCEPVCLGCYQPVQLDSQQYRCPDCNWPL 126
Query: 76 CSPKCSTAAASSSHSPYVCQ--ALTRLLNLNSPDA-ANLPLDRQLQARFLIAAYNLAVVN 132
C P C + HS C A R L + A A L R L LI L +
Sbjct: 127 CGPTCRGIRQRTGHSAEECALYAERRALLADVLTANATLEQRRDLYELVLIVRIVLLKQH 186
Query: 133 PSQ----FQILLAFQGTVTDNDTSAAHY-------------LSSLCPPPASTTIELTAAL 175
++ + + + +N T HY L+++C +EL
Sbjct: 187 SAEQYANIRRMESHTAERRENATLWQHYEQKVVQRLRNDWQLANVC-----DELELHEIC 241
Query: 176 LAKDRLNAFGLMEPYIEGQDGRRSVRAYGIYLKASFFNHDCLPNACRFDYVDAAAENNTD 235
D +N F + G+R +A +Y A HDC PN D D A
Sbjct: 242 GILD-VNCFEI---------GQRGAKARTLYPSAFLLAHDCTPNTAHTD--DPA---TFA 286
Query: 236 IIVRMIHDVPQGREICLSYFPVNYDYSTRQKRLLYDYDFACDCDRC 281
I++R V + + LSY R+ + F C C RC
Sbjct: 287 ILLRTSRRVLEQEPLTLSYAYTLQGTLKRRTFIQGGKLFWCQCQRC 332
>gi|429849921|gb|ELA25247.1| hypothetical protein CGGC5_13535 [Colletotrichum gloeosporioides
Nara gc5]
Length = 380
Score = 50.8 bits (120), Expect = 0.001, Method: Compositional matrix adjust.
Identities = 24/74 (32%), Positives = 35/74 (47%), Gaps = 4/74 (5%)
Query: 208 KASFFNHDCLPNACRFDYVDAAAENNTDIIVRMIHDVPQGREICLSYFPVNYDYSTRQKR 267
+ + NHDC PNA Y+ + + +V + QG EI +SY PV R++
Sbjct: 210 RIARINHDCRPNA----YMRLPTDGLSGTVVAGEQGIAQGEEITISYLPVELARERRRRN 265
Query: 268 LLYDYDFACDCDRC 281
L D+ F C C C
Sbjct: 266 LQKDWGFECGCKLC 279
>gi|403217768|emb|CCK72261.1| hypothetical protein KNAG_0J01800 [Kazachstania naganishii CBS
8797]
Length = 380
Score = 50.8 bits (120), Expect = 0.001, Method: Compositional matrix adjust.
Identities = 45/154 (29%), Positives = 69/154 (44%), Gaps = 19/154 (12%)
Query: 134 SQFQILLAFQGTVTDNDTSAAHYLSSLCPPPAST--TIELTAALLAKDRLNAFGLMEPYI 191
++ +LL Q TS L L P T +IEL + + NAFGL +
Sbjct: 238 KKYPVLLHLQ-------TSVFKILYILLPDILKTHFSIELFRHIAGSEYANAFGLWQE-D 289
Query: 192 EGQDGRRSVRAYGIYLKASFFNHDCLPNACRFDYVDAAAENNTDIIVRMIHDVPQGREIC 251
EG D R + Y ++ AS+ NH C+PN + + + ++I D+ +G EIC
Sbjct: 290 EGVDSREFL-GYSVFPCASYANHSCVPNITK-------SRHGREMIFTANKDISEGAEIC 341
Query: 252 LSYFPV-NYDYSTRQKRLLYDYDFACDCDRCKVE 284
+ V + R+K L + F C C RC +E
Sbjct: 342 IDCSGVLSLAVEKRRKFLKETWFFDCCCGRCLLE 375
>gi|367012652|ref|XP_003680826.1| hypothetical protein TDEL_0D00310 [Torulaspora delbrueckii]
gi|359748486|emb|CCE91615.1| hypothetical protein TDEL_0D00310 [Torulaspora delbrueckii]
Length = 486
Score = 50.8 bits (120), Expect = 0.001, Method: Compositional matrix adjust.
Identities = 66/318 (20%), Positives = 106/318 (33%), Gaps = 77/318 (24%)
Query: 16 RGRGLVSTQSLKAGQIVLRDSPILL----YSALPFINSSSFCHNCFRKTMHSSSSI---- 67
+GRGL + + ++ G+++L +S L L I + C C HS +
Sbjct: 120 KGRGLFAKKEIRQGELILGESGPLAAIPPIDKLSLIQAGKVCSLCGNSLGHSDHFVMMNG 179
Query: 68 --CPSCSVAFCSPKCSTAAASSSHSPYVCQALTRLLNLNSPDAANLPLDRQLQARFLIAA 125
C C +C+ C + + +V + L N + ++ +AA
Sbjct: 180 LDCNGCGAVWCNRNCKKTDVTHASLKHV-KGKGSLTNA----SGWAKFEKSCMENVFVAA 234
Query: 126 YNLAVV-------NPSQFQILLAFQ---------------------------GTVTDND- 150
Y++ V+ + +L F+ G VT D
Sbjct: 235 YSVGVIYATSAVSKADENHVLQHFRSLAQVSQRARKQVSDSTNVGGTFDASSGAVTSEDP 294
Query: 151 ----TSAAHYLSSLCPPPASTTIELTAALLAKDRLNAFGLMEPYIEGQDGRRSVRAYGIY 206
A Y P E + K LN I GQ Y IY
Sbjct: 295 EPMWQQAYEYFKEAFPKSEDIDFESFLTYVGKFNLNQ-------ISGQ-------IYPIY 340
Query: 207 LKASFFNHDCLPNACRFDYVDAAAENNTDIIVRMIHDVPQGREICLSYFPVNYDYSTRQK 266
SF NHDC PN R++ D + V + G E+ +Y + R++
Sbjct: 341 ---SFINHDCEPNV-RYEIDD-----KLRLKVFARKPIKAGEELLTTYVNPLHGVKLRRR 391
Query: 267 RLLYDYDFACDCDRCKVE 284
L ++ F C C+RC E
Sbjct: 392 ELRVNWGFLCQCERCTKE 409
>gi|327305077|ref|XP_003237230.1| hypothetical protein TERG_01951 [Trichophyton rubrum CBS 118892]
gi|326460228|gb|EGD85681.1| hypothetical protein TERG_01951 [Trichophyton rubrum CBS 118892]
Length = 498
Score = 50.8 bits (120), Expect = 0.001, Method: Compositional matrix adjust.
Identities = 60/290 (20%), Positives = 107/290 (36%), Gaps = 54/290 (18%)
Query: 17 GRGLVSTQSLKAGQIVLRDSPILLYSALPFINSSSFCHNCFRKTMHSSSSI--------- 67
GRGL +T ++K G+ V P + L C NCF + +
Sbjct: 20 GRGLFATSTIKMGKDVF-SMPATFSTVLSTERLKDACSNCFANLPFGVNIMANVDMKLWA 78
Query: 68 CPSCSVA-FCSPKCSTAAASSSHS--------------PYVCQALTRLLNLNSPDAANLP 112
C C V +C KC +A H P +A+ R++ L + + ++
Sbjct: 79 CSGCKVVKYCDRKCQSANWKLIHKHECAIYKKLYPKILPVNSRAVLRIVKLRNSNEDHVK 138
Query: 113 LDRQLQARFLIAAYNLAVVNPSQFQ-ILLAFQGTVTDNDTSAAHYLSSLCPPPASTTIEL 171
D + + N Q++ I+L+ + + + S + +E
Sbjct: 139 SDLIMFLSLRSHLDEITKSNQEQYERIMLSAKA---EKEYSGSEL-----------DVET 184
Query: 172 TAALLAKDRLNAFGLMEPYIEGQDGRRSVRAYGIYLKASFFNHDCLPNACRFDYVDAAAE 231
A LA+ +N+F + G ++ + Y+ NH C PNA
Sbjct: 185 IAEYLARIEVNSFTFTTAF--GDPLGLCIQPFACYM-----NHSCEPNA-------VVGF 230
Query: 232 NNTDIIVRMIHDVPQGREICLSYFPVNYDYSTRQKRLLYDYDFACDCDRC 281
+ I V+ + ++ ++ +SY Y + RQK+L Y F C C +C
Sbjct: 231 DGGLITVKALREIKSDEQVFISYIDNTYPFEVRQKQLAERYFFTCKCSKC 280
>gi|198461003|ref|XP_002138938.1| GA25085 [Drosophila pseudoobscura pseudoobscura]
gi|198137203|gb|EDY69496.1| GA25085 [Drosophila pseudoobscura pseudoobscura]
Length = 515
Score = 50.8 bits (120), Expect = 0.001, Method: Compositional matrix adjust.
Identities = 54/266 (20%), Positives = 107/266 (40%), Gaps = 17/266 (6%)
Query: 17 GRGLVSTQSLKAGQIVLRDSPILLYSALPFINSSSFCHNCFRKTMHSSSSICPSCSVAFC 76
GR LV+T+++K +I+L+++P++ P + C C C C C
Sbjct: 52 GRHLVATRTIKPYEIILKEAPLV---RGPAQITPPVCMGCLNSIEPEDHIDCDLCGWPLC 108
Query: 77 SPKCSTAAASSSHSPYVCQALTRLLNLNSPDAANLPLDRQLQARFLIAAYNLAVVNPSQF 136
P+C + + + Q + +N++ + ++ PL + + + S+F
Sbjct: 109 GPECKSLGEHRAEC-QLTQERGQKVNVHEFNGSH-PLYTCVSTVRCLLIGETSPEKASKF 166
Query: 137 QILLAFQGTVTDNDTSAAHYLSSLCPPPASTTIELTAALLAKDRLNAFGLMEPYIEGQDG 196
Q L + + T ++ A S P E ++ + A G ++ I G +
Sbjct: 167 QELESLESTRRGSNQWKADLASIGQFIPKFFKTE---KFSEEEVMRAVGALQ--INGHEV 221
Query: 197 RRSVRAY-GIYLKASFFNHDCLPNACRFDYVDAAAENNTDIIVRMIHDVPQGREICLSYF 255
S + ++ ASF H C+PN + + N I+ ++ + + + Y
Sbjct: 222 PTSDPPHVAVFYTASFTEHSCMPNLAK------SFNKNGHCILWAPREIKKNSHLSICYS 275
Query: 256 PVNYDYSTRQKRLLYDYDFACDCDRC 281
+ + RQ+ L+ F C CDRC
Sbjct: 276 DAMWGTADRQRHLVQTKLFKCACDRC 301
>gi|452847305|gb|EME49237.1| hypothetical protein DOTSEDRAFT_143030 [Dothistroma septosporum
NZE10]
Length = 352
Score = 50.8 bits (120), Expect = 0.001, Method: Compositional matrix adjust.
Identities = 38/124 (30%), Positives = 56/124 (45%), Gaps = 14/124 (11%)
Query: 166 STTIELTAALLAKDRLNAFGLMEPYIEGQDGRRSVRAYGIYLKASFFNHDCLPNACRFDY 225
S T EL + D NAFG+ G+D + YG+Y +S+FNH C PN +
Sbjct: 239 SLTPELCRTMTMADNHNAFGIRAG---GEDCEEYM-GYGVYPSSSYFNHSCSPNVNK-RR 293
Query: 226 VDAAAENNTDIIVRMIHDVPQGREICLSYF---PVNYDYSTRQKRLLYDYDFACDCDRCK 282
V + E + D+ G E C++Y + D RQ RL + F C C+ C
Sbjct: 294 VGRSWEFSA------ARDIAAGEECCITYLGGDEKDLDRRARQSRLHEVWGFHCYCNLCT 347
Query: 283 VEAN 286
+A+
Sbjct: 348 AKAS 351
>gi|346320526|gb|EGX90126.1| TPR domain protein [Cordyceps militaris CM01]
Length = 656
Score = 50.8 bits (120), Expect = 0.001, Method: Compositional matrix adjust.
Identities = 75/290 (25%), Positives = 109/290 (37%), Gaps = 89/290 (30%)
Query: 8 VTVAEIEGRGRGLVSTQSLKAGQIVLRDSPILLYSALPFINSSSFCHNCFRKTMHSSSSI 67
TV + G GRGL +T+ KAG++++
Sbjct: 373 TTVRDSPGAGRGLFATRDFKAGELIM---------------------------------- 398
Query: 68 CPSCSVAFCSPKCSTAAASSSHSPYVCQALTRLLNLNSPDAANLPLDRQLQARFLIAAYN 127
C +FC+ SS P C A+T L + LD R A +
Sbjct: 399 ---CEKSFCT-------VSSLDKP--CTAVTAL---------TVDLDDDCAIRVFPAGLH 437
Query: 128 LAVV-----NPSQFQILLA-----FQGTVTDNDTSAAHYLSSLCPPPASTTIELTAALLA 177
AVV NPSQ +L ++G DT A S P ++ +
Sbjct: 438 RAVVQKLLDNPSQAPAVLGLDSGPYKGL---GDTGAV----STAEGPVVDAFQVHHIV-- 488
Query: 178 KDRLNAFGLMEPYIEGQDGRRSVRAYGIYLKASFFNHDCLPNACRFDYVDAAAENNTDII 237
+ NAFGL + D S G++++AS+ NH C+ NA + D+ D++
Sbjct: 489 --QRNAFGLGP---QSPDEDVSNATTGLWVRASYVNHACVANAVK-DFAG-------DLM 535
Query: 238 V-RMIHDVPQGREICLSYFPVNYDYSTRQKRLLYDYDFACDCDRCKVEAN 286
V R + G EI +Y N DY RQ L + FAC C C EA+
Sbjct: 536 VFRAARHIRAGEEIFHAY-DENADYDARQAALRKTWGFACTCALCAAEAH 584
>gi|212541368|ref|XP_002150839.1| SET and MYND domain protein, putative [Talaromyces marneffei ATCC
18224]
gi|210068138|gb|EEA22230.1| SET and MYND domain protein, putative [Talaromyces marneffei ATCC
18224]
Length = 527
Score = 50.8 bits (120), Expect = 0.001, Method: Compositional matrix adjust.
Identities = 70/307 (22%), Positives = 121/307 (39%), Gaps = 49/307 (15%)
Query: 17 GRGLVSTQSLKAG-QIVLRDSPILLYSALPFINSSSFCHNCFRKTMHSS--------SSI 67
G G+ +T ++K G +++ D+P++ + + + C CF + +S
Sbjct: 27 GLGVFATANIKKGADVLVIDTPMV--ALVEEKQLKNICSGCFDTSKAASIDSRRPGLVKA 84
Query: 68 CPSCSVAF-CSPKCSTAAASSSHSPYVCQALTRLLNLNSPDAANLPLDRQLQARFLIAAY 126
C C V + C C + HS C ++L P LP+ L+ +I +
Sbjct: 85 CVRCKVVYYCDKNCQRKDWKAGHS-VECAIYSKLY----PRILPLPVRAILR---VILSQ 136
Query: 127 NLAVVNPSQFQ--ILLAFQGTVTDNDTSAA---HYLSSLCPPPASTTIEL----TAALLA 177
+ V+P + + L++ + D+D + H+L + L L
Sbjct: 137 DKDKVSPYIHEEFLDLSYPKEIYDDDENEEANDHFLMAEAVLKYGDIEYLDEDYVEQLFG 196
Query: 178 KDRLNAFGLMEPYIEGQDGRRSVRAYGIYLK--ASFFNHDCLPNAC-RFDYVDAAAENNT 234
K N+F L + G+R G+YL A+ FNH C PNA FD
Sbjct: 197 KLSANSFTLTSAF-----GKRR----GVYLHPAAARFNHSCNPNASYSFD--------KG 239
Query: 235 DIIVRMIHDVPQGREICLSYFPVNYDYSTRQKRLLYDYDFACDCDRCKVEANWSDNDNDD 294
+R + +G +I + Y Y TR+ L Y F C C RC E +D +
Sbjct: 240 KCYIRATKPIAKGEQIFIPYIDTTYSVGTRRHELKERYKFDCQCPRCLNEIATTDPEEVK 299
Query: 295 ENNEEVM 301
+ +E+ +
Sbjct: 300 KRSEQEL 306
>gi|453088880|gb|EMF16920.1| SET domain-containing protein [Mycosphaerella populorum SO2202]
Length = 364
Score = 50.8 bits (120), Expect = 0.001, Method: Compositional matrix adjust.
Identities = 34/120 (28%), Positives = 49/120 (40%), Gaps = 14/120 (11%)
Query: 168 TIELTAALLAKDRLNAFGLMEPYIEGQDGRRSVRAYGIYLKASFFNHDCLPNACRFDYVD 227
T +L L+ D NAFG+ G + YG+Y AS+FNH C N +
Sbjct: 253 TADLCLNLVRADNHNAFGIRG----GGEDSEEYMGYGVYPSASYFNHSCDANIHK----- 303
Query: 228 AAAENNTDIIVRMIHDVPQGREICLSYF---PVNYDYSTRQKRLLYDYDFACDCDRCKVE 284
D+ G E+C++Y + + R+ RL + F C C RCK E
Sbjct: 304 --KRTGRSWTFHAAQDIQPGDELCITYLGGDEKDLNLPARRGRLEEVWGFVCHCARCKKE 361
>gi|195582454|ref|XP_002081043.1| GD10796 [Drosophila simulans]
gi|194193052|gb|EDX06628.1| GD10796 [Drosophila simulans]
Length = 646
Score = 50.8 bits (120), Expect = 0.001, Method: Compositional matrix adjust.
Identities = 82/325 (25%), Positives = 125/325 (38%), Gaps = 51/325 (15%)
Query: 4 VSETVTVAEIEGRGRGLVSTQSLKAGQIVLRDSPILLYSALPFINSSSFCHNCFRKTMHS 63
VS V + GR ++ +K G+ +L + P + S L + S C NCF +T+
Sbjct: 237 VSPLVRIDSNRQEGRFARASADVKPGEELLVERPFV--SVLLEKFAKSHCENCFMRTVVP 294
Query: 64 SSSICPSCS-VAFCSPKCSTAAASSSHSPYVCQALTRLL----NLNSPDA----ANLPL- 113
+ CP C+ V +CS +C A+ H Y C + + ++N+ A A+ PL
Sbjct: 295 VA--CPRCADVLYCSEQCREEASKKYHK-YECGIVPIIWRSGASINNHIALRIIASKPLD 351
Query: 114 ---------DRQLQARFLIAA-----YNLAVVN--------PSQFQILLAFQGTVTDNDT 151
D QL LI+ +A + P+ FQ L + +T
Sbjct: 352 YFLKLKPTIDEQLTPEQLISLPKDDFRRVAQLERHQGERQPPNFFQHALMAR-FLTHCLR 410
Query: 152 SAAHYLSSLCPPPASTTIELTAALLAKDRLNAFGLMEPYIEGQDGRRSVRAYG--IYLKA 209
+ ++ S P S L L + N + E + GR G IY
Sbjct: 411 AGGYFGSEPKPDEVSIICSLVLRSLQFIQFNTHEVAELHKFSSSGREKSIFIGGAIYPTL 470
Query: 210 SFFNHDCLPNACRFDYVDAAAENNTDIIVRMIHDVPQGREICLSYFPV--NYDYSTRQKR 267
+ FNH C P R+ T I + + + G I +Y P+ + S RQ R
Sbjct: 471 ALFNHSCDPGVVRYF-------RGTTIHINSVRPIEAGLPINENYGPMYTQDERSERQAR 523
Query: 268 LLYDYDFACDCDRCKVEANWSDNDN 292
L Y F C CD C NW D+
Sbjct: 524 LKELYWFECSCDACI--DNWPKFDD 546
>gi|242013801|ref|XP_002427589.1| protein msta, isoform B, putative [Pediculus humanus corporis]
gi|212512004|gb|EEB14851.1| protein msta, isoform B, putative [Pediculus humanus corporis]
Length = 472
Score = 50.4 bits (119), Expect = 0.001, Method: Compositional matrix adjust.
Identities = 59/279 (21%), Positives = 105/279 (37%), Gaps = 41/279 (14%)
Query: 20 LVSTQSLKAGQIVLRDSPILLYSALPFINSSSFCHNCFRKTMHSSSSI---CPSCSVAFC 76
LV+++S+K G+I+L D PI++ P S C C+ + S+ C +C+ C
Sbjct: 2 LVASKSIKQGEIILEDEPIVVG---PCQESKPLCIGCYTDIDLNDSNKYFRCVNCTWPLC 58
Query: 77 SPKCSTAAASSSHSPYVCQALTRLLNLNSPDAANLPLDRQLQA-RFLIAAYNLAVVNPSQ 135
S +C HS C+ ++ N + L R L+ L +NP +
Sbjct: 59 SNQCKGFFNKFGHSKEECETFSKSFVKNKEIKTFVNFYYALTPFRCLL----LKKINPEK 114
Query: 136 FQILLAFQGTVTDNDTSAAHY-------------LSSLCPPPASTTIELTAALLAKDRLN 182
++I+ + + + + L I +L +N
Sbjct: 115 WKIIKNMESHTNERKKDGKLWETNEKMAVDIIKNVWGLSDDDDDNDIHFVCGIL---EVN 171
Query: 183 AFGLMEPYIEGQDGRRSVRAYGIYLKASFFNHDCLPNACRFDYVDAAAENNTDIIVRMIH 242
+F + +S+RA +Y A F+H+C+PN + N +I R
Sbjct: 172 SFEV------NFQNVQSLRA--LYSNAFLFSHNCIPNT------SHSNNNEFKLIFRTTT 217
Query: 243 DVPQGREICLSYFPVNYDYSTRQKRLLYDYDFACDCDRC 281
+ + + +SY R+K L F C+C RC
Sbjct: 218 FIQKNSAVTISYAYTLQPTLLRRKHLEKTKFFQCNCQRC 256
>gi|66810740|ref|XP_639077.1| SET domain-containing protein [Dictyostelium discoideum AX4]
gi|74854618|sp|Q54R14.1|Y3443_DICDI RecName: Full=SET domain-containing protein DDB_G0283443
gi|60467687|gb|EAL65706.1| SET domain-containing protein [Dictyostelium discoideum AX4]
Length = 393
Score = 50.4 bits (119), Expect = 0.001, Method: Compositional matrix adjust.
Identities = 71/340 (20%), Positives = 129/340 (37%), Gaps = 82/340 (24%)
Query: 6 ETVTVAEIEGRGRGLVSTQSLKAGQIVLR-DSPILLYSALPFINSSSFCHNCFRKTMHSS 64
+ + + E G+ + S +K G+++++ SP+ +N FC NCF + +
Sbjct: 17 KKIEINETLESGKFIESKIDMKLGELIIKVQSPMCFSFHKHLVNQ--FCFNCFSNSHEIN 74
Query: 65 SSI---------------CPSCS-VAFCSPKC----------STAAASSSHSPYVCQALT 98
++ C +C + FCS +C + +++ H+P C L+
Sbjct: 75 NAKFNKFKVDINKNYIIRCNNCKLIYFCSDECFEKVMSIESFQDSTSTNIHTPLECLILS 134
Query: 99 RLLNLNSPDAANLPLDRQLQARFLI-----AAYNLAVVNPSQFQILLAFQGTV------- 146
+ N L Q + R +I AY + N ++F++LL +
Sbjct: 135 NYHDQTISPKIN-TLHDQTENRMIINYLSKIAY--STNNNNKFKLLLIEMNQLIGDFNND 191
Query: 147 TDNDTSAAHYLSSLCPPPASTTIE-------------LTAALLAKDRLNAFGLMEPYIEG 193
+N T + + + ++ S + + L AK + N+FGL +
Sbjct: 192 NNNQTLSLNEIKNI--KNKSFNLRKLFNNFFFNIDKVIIEELYAKSQRNSFGLWK----N 245
Query: 194 QDGRRSVRAYG---IY---------LKASFFNHDCLPNACRFDYVDAAAENNTDIIVRMI 241
D + YG IY + S+FNH C PN R + N I + +
Sbjct: 246 SDECFGLSMYGNQTIYNNNNDKDDNISISYFNHSCFPNCVR-------VQENQSISIYSL 298
Query: 242 HDVPQGREICLSYFPVNYDYSTRQKRLLYDYDFACDCDRC 281
+ +G E+ +SY + + R L Y F C C RC
Sbjct: 299 IPIKKGDELSISYIDIRMSKNDRLLHLKEIYYFECKCKRC 338
>gi|294463315|gb|ADE77193.1| unknown [Picea sitchensis]
Length = 288
Score = 50.4 bits (119), Expect = 0.001, Method: Compositional matrix adjust.
Identities = 35/126 (27%), Positives = 53/126 (42%), Gaps = 15/126 (11%)
Query: 168 TIELTAALLAKDRLNAF------GLMEPYIEGQDGRRSVR---AYGIYLKASFFNHDCLP 218
T E + +LA+ R N F G + SV +Y+ S +NHDC P
Sbjct: 168 TKEWYSGILARIRANVFRIELIGGYHTDLLAAASASISVETTVGSAVYMLPSMYNHDCDP 227
Query: 219 NACRFDYVDAAAENNTDIIVRMIHDVPQGREICLSYFPVNYDYSTRQKRLLYDYDFACDC 278
NA +A A ++ + ++ G E+ + Y + DY RQ L + F C C
Sbjct: 228 NAHILWIENAKAR------LKALRNIEPGEELRICYIDASMDYEARQSLLYQGFGFRCQC 281
Query: 279 DRCKVE 284
RCK +
Sbjct: 282 LRCKYK 287
>gi|167342128|gb|ABZ69012.1| SET domain-containing protein C [Pinus taeda]
Length = 141
Score = 50.4 bits (119), Expect = 0.001, Method: Compositional matrix adjust.
Identities = 27/81 (33%), Positives = 37/81 (45%), Gaps = 8/81 (9%)
Query: 204 GIYLKASFFNHDCLPNACRFDYVDAAAENNTDIIVRMIHDVPQGREICLSYFPVNYDYST 263
G++L SF NH CLPN+ R E + + + + +G EI YF +
Sbjct: 12 GLWLLPSFINHSCLPNSSRL-------EMGSAMFIHACKPIKRGEEITFPYFDILLPLPQ 64
Query: 264 RQKRLLYDYDFACDCDRCKVE 284
RQ R + F C C RC VE
Sbjct: 65 RQGR-CESWGFECKCRRCIVE 84
>gi|432852547|ref|XP_004067302.1| PREDICTED: N-lysine methyltransferase SMYD2-A-like isoform 1
[Oryzias latipes]
Length = 435
Score = 50.4 bits (119), Expect = 0.001, Method: Compositional matrix adjust.
Identities = 67/281 (23%), Positives = 99/281 (35%), Gaps = 34/281 (12%)
Query: 16 RGRGLVSTQSLKAGQIVLRDSPILLYSALPFINSSSFCHNCFRKTMHSSSSICPSCSVAF 75
+GRGL + + G++V P Y L + C +CF T C C A+
Sbjct: 17 KGRGLRALRDFAVGELVFA-CPAYSY-VLTVNERGAHCEHCF--TRREDLFKCGKCKQAY 72
Query: 76 -CSPKCSTA---------AASSSHSPYVCQALTRLLNLNSPDAANLPLDRQLQARFLIAA 125
C+ C A +H C + T L + +R R L
Sbjct: 73 YCNVDCQRGDWPMHKLECVAMCTHGENWCPSETVRLVSRIIMKQRVTTERTPSERLL--- 129
Query: 126 YNLAVVNPSQFQILLAFQGTVTDNDTSAAHYLSSLCPPPASTTIELTAALLAKDRLNAFG 185
L S + + + + D +A HY S ELT L A+ N F
Sbjct: 130 --LLKEFESHLDKMDSEKDEMNQADIAALHYFYSKHISDLPDDQELTE-LFAQVNCNGFT 186
Query: 186 LMEPYIEGQDGRRSVRAYGIYLKASFFNHDCLPNACRFDYVDAAAENNTDIIVRMIHDVP 245
+ +D S ++ + NH C PN Y AE VR + ++
Sbjct: 187 I-------EDEELSHLGSAVFPDVALMNHSCSPNVI-VTYKGTVAE------VRAVQEIN 232
Query: 246 QGREICLSYFPVNYDYSTRQKRLLYDYDFACDCDRCKVEAN 286
G EI SY + Y R++RLL Y F C C C ++
Sbjct: 233 PGEEIFNSYIDLLYPTEDRKERLLDSYFFGCQCTECTTKSK 273
>gi|407843612|gb|EKG01507.1| hypothetical protein TCSYLVIO_007495 [Trypanosoma cruzi]
Length = 382
Score = 50.4 bits (119), Expect = 0.001, Method: Compositional matrix adjust.
Identities = 60/243 (24%), Positives = 89/243 (36%), Gaps = 40/243 (16%)
Query: 69 PSCSVAFCSPKCSTAAASSSHSPYVCQALTRLLNLN---SPDAANLPLDRQLQARFLIAA 125
P+C +CS C T A S HS C + L L+ A+ +R A L+
Sbjct: 163 PNCHEEYCSRDCRTMALSLYHSRSCCNEGLQALELDMYTQMKASGTSTERNAAAAQLLML 222
Query: 126 YNLAV-----VNPSQFQILLAFQGTVTDNDTSAAH-------YLSSLCPPPASTTIELTA 173
LAV V PS L G + + AH + C S + E
Sbjct: 223 RVLAVSVQQQVIPSALSELRILSGRLLFSPRVLAHSFLEIYERFTRFCHTATSISYEEMI 282
Query: 174 ALLAKDRLNAFGLMEPYIEGQDGRRSVRAYGIYLKASFFNHDCLPNACRFDYVDAAAENN 233
+LA+ N F + +E + L S NH C+ N AAE
Sbjct: 283 GVLARVTANCFH-RDTCVE------------LNLSRSMLNHSCVAN---------AAEEL 320
Query: 234 TDIIVRMIHDVPQGREICLSYFP--VNYDYSTRQKRLLYDYDFACDCDRCKVEANWSDND 291
++ DV +G E+ ++Y+P N + R K L F C C +C+ E D
Sbjct: 321 HSGEIKTTRDVARGEELTINYYPQLKNLSFEERSKE-LERRGFECHCSQCRREKGLERRD 379
Query: 292 NDD 294
++
Sbjct: 380 LNE 382
>gi|91088857|ref|XP_971416.1| PREDICTED: similar to CG14590 CG14590-PA [Tribolium castaneum]
Length = 563
Score = 50.4 bits (119), Expect = 0.001, Method: Compositional matrix adjust.
Identities = 71/298 (23%), Positives = 106/298 (35%), Gaps = 38/298 (12%)
Query: 6 ETVTVAEIEGRGRGLVSTQSLKAGQIVLRDSPILLYSALPFI--NSSSFCHNCFRKTMHS 63
E T E+ G+ LV+T+ L G +++ + P L+Y P + C C R +
Sbjct: 46 EITTTKEV---GKCLVATRDLSPGDVIISELP-LVYGPRPHMVEEGPVPCPGCCRLIICE 101
Query: 64 SSSICPSCSVAFCSPKCSTAAASSSHSPYVCQALTRLLNLNSPDAANLPLDRQLQ-ARFL 122
+S CP C C P+C H + C +L+L A N D Q
Sbjct: 102 NSPRCPGCDFPVCHPRCPGLKDMEKHG-HEC----LILSLREIRAINGLHDFYRQDTLLA 156
Query: 123 IAAYNLAVVNPSQFQILLAFQGTVTDNDTSAAHY---------------LSSLCPPPAST 167
+ L NP +F L+ + + Y + L A T
Sbjct: 157 LRCLLLQKKNPKKFAQLMEMEAHLDKRGPDTEIYKQVQERTFDYLDDAFFTPLKVLQART 216
Query: 168 TIELTAALLAKDRLNAFGLME--PYIEGQDGRRSVRAYGIYLKASFFNHDCLPNACR-FD 224
E+ + + G+++ QD SV +Y A HDCL N FD
Sbjct: 217 GKEVLQDISKETVHKICGIIDVNALEINQDAEISV----LYPTAYLMEHDCLCNTVHSFD 272
Query: 225 YVDAAAENNTDIIVRMIHDVPQGREICLSYFPVNYDYSTRQKRLLYDYDFACDCDRCK 282
EN I VR + +G I Y + R++ L F+C C RCK
Sbjct: 273 ----TEENGYKITVRAALPIKKGDHISTMYTHALWGTQARREHLKETKYFSCTCKRCK 326
>gi|225559380|gb|EEH07663.1| conserved hypothetical protein [Ajellomyces capsulatus G186AR]
Length = 517
Score = 50.4 bits (119), Expect = 0.001, Method: Compositional matrix adjust.
Identities = 29/87 (33%), Positives = 44/87 (50%), Gaps = 5/87 (5%)
Query: 201 RAYGIYLKASFFNHDCLPNACRFDYVDAAAENNTD---IIVRMIHDVPQGREICLSYFPV 257
RA G L +++ NH CLPNA A+ N + + V + +G EI + Y +
Sbjct: 140 RALG--LDSAYLNHSCLPNAQHSLSAPASNGNRKERDFLTVYACRSIEEGEEITIPYESL 197
Query: 258 NYDYSTRQKRLLYDYDFACDCDRCKVE 284
D ++RQ+ LL +Y F C C C+ E
Sbjct: 198 YMDIASRQQFLLQEYGFECSCRLCEKE 224
>gi|353236057|emb|CCA68059.1| hypothetical protein PIIN_01926 [Piriformospora indica DSM 11827]
Length = 786
Score = 50.4 bits (119), Expect = 0.001, Method: Compositional matrix adjust.
Identities = 28/81 (34%), Positives = 43/81 (53%), Gaps = 9/81 (11%)
Query: 201 RAYGIYLKASFFNHDCLPNACRFDYVDAAAENNTDIIVRMIHDVPQGREICLSYFPVNYD 260
+ G+YL AS+FNH C+PNA + D +++R + Q EI +SY +
Sbjct: 521 KGTGLYLLASYFNHSCIPNAGHQFFSDL-------MVIRATQSIKQDEEITISYCS-HAS 572
Query: 261 YSTRQKRLLYDYDFACDCDRC 281
Y++R+K L +D CDC C
Sbjct: 573 YASREKNLKPWFD-QCDCQLC 592
>gi|345497540|ref|XP_001600173.2| PREDICTED: SET and MYND domain-containing protein 4-like [Nasonia
vitripennis]
Length = 621
Score = 50.4 bits (119), Expect = 0.001, Method: Compositional matrix adjust.
Identities = 40/142 (28%), Positives = 59/142 (41%), Gaps = 19/142 (13%)
Query: 190 YIEGQDGR----RSVRAYGIYLK--ASFFNHDCLPNACRFDYVDAAAENNTDIIVRMIHD 243
Y G D R + G+Y+ S NH C+PN R NN +IV +
Sbjct: 430 YTTGLDARMCENKQCCTTGLYIAPITSLLNHSCIPNVKR------CFSNNYSVIVYAVQP 483
Query: 244 VPQGREI--CLSYFPVNYDYSTRQKRLLYDYDFACDCDRCK-----VEANWSDNDNDDEN 296
+ +G ++ C Y+ S RQK L Y+F CDC CK VE N +N
Sbjct: 484 IKKGSQLFDCYQQEFYEYNISPRQKHLKKTYNFNCDCKACKEKWDIVEYEVVSKKNLKKN 543
Query: 297 NEEVMDEDQDEQMVASDDDAEV 318
E D+ Q++ + D ++
Sbjct: 544 APEWRLADEYLQLIQAIHDKKI 565
>gi|302697297|ref|XP_003038327.1| hypothetical protein SCHCODRAFT_103387 [Schizophyllum commune H4-8]
gi|300112024|gb|EFJ03425.1| hypothetical protein SCHCODRAFT_103387, partial [Schizophyllum
commune H4-8]
Length = 360
Score = 50.4 bits (119), Expect = 0.001, Method: Compositional matrix adjust.
Identities = 35/106 (33%), Positives = 51/106 (48%), Gaps = 10/106 (9%)
Query: 180 RLNAFGLMEPYIEGQDGRRSVRAY-GIYLKASFFNHDCLPNAC-RFDYVDAAAENNTDII 237
+ N GL + + D S AY G++ +AS NH C PNA FD + ++
Sbjct: 152 KTNGIGLGDDLVYPDDSGDS--AYVGVFERASRINHSCSPNAVYHFDL------QSFCLV 203
Query: 238 VRMIHDVPQGREICLSYFPVNYDYSTRQKRLLYDYDFACDCDRCKV 283
VR I + +G EI +SY +TR++ L DY F C C C +
Sbjct: 204 VRAIRHISKGEEIFISYSETLLQAATRRQNSLQDYGFRCACPACAM 249
>gi|156047862|ref|XP_001589898.1| hypothetical protein SS1G_08661 [Sclerotinia sclerotiorum 1980]
gi|154693059|gb|EDN92797.1| hypothetical protein SS1G_08661 [Sclerotinia sclerotiorum 1980
UF-70]
Length = 545
Score = 50.4 bits (119), Expect = 0.001, Method: Compositional matrix adjust.
Identities = 59/277 (21%), Positives = 108/277 (38%), Gaps = 25/277 (9%)
Query: 19 GLVSTQSLKAGQIVLRDSPILLYSALPFINSSSFCHNCFRKTMHSSSSICPSCSVAFCSP 78
G+V+ + + A + VL D P L ++S+ C C + + C C ++CSP
Sbjct: 272 GVVAIKDILAQEPVLYD-PTLA----AVVDSAGRCPACCGPVLDGVENSC--CRTSYCSP 324
Query: 79 KCSTAAASSSHSPYVCQALTRLLNLNSPDAANLPLDRQLQARFLIAAYNLAV----VNPS 134
CS A S H+ VC L P +++ + + ++ L+ L++ NP
Sbjct: 325 LCSQRALDSYHT-VVCGKDFDFLYETKPVSSSSSTESFIGSKLLLRVLALSIRENTANPL 383
Query: 135 QFQILLAFQGTVTDNDTSAAHY-LSSLCPPPASTTIELTAALLAKDRLNAFGLMEPYIEG 193
Q ++ N+ + + P EL + + + L +
Sbjct: 384 QSSLISRLTPAYNPNNPQLIAFNFTDHITTPIQILGELGVDVFTNGMYDTWVLHTIHCRL 443
Query: 194 QDGRR-----SVRAYGIYLKASFFNHDCLPNA-CRFDYVDAAAENNTDIIVRMIHDVPQG 247
Q+ + + R I S FNH C PN R D+ N+ + + ++ +G
Sbjct: 444 QNNKHGQTFDNFRGTAINPLYSMFNHSCDPNVDWRHDH------ENSTVTMFAERNIKKG 497
Query: 248 REICLSYFPVNYDYSTRQKRLLYDYDFACDCDRCKVE 284
E+ +SY RQ++L+ + C C +C E
Sbjct: 498 EEMFISYIGKGDSLKDRQRKLMPWFGMECACHKCDEE 534
>gi|393227660|gb|EJD35329.1| SET domain-containing protein [Auricularia delicata TFB-10046 SS5]
Length = 352
Score = 50.4 bits (119), Expect = 0.001, Method: Compositional matrix adjust.
Identities = 50/179 (27%), Positives = 74/179 (41%), Gaps = 24/179 (13%)
Query: 121 FLIAAYNLAVVNPSQFQILLAFQGTVTDNDTSA---AHYLS-------SLCPPPASTTIE 170
FL++ A NPS F +L +G T ++A AH S P + T +
Sbjct: 183 FLLSGILAACTNPSAFALLRQLEGVSTPYRSTAQLHAHLASYQTLLALLPVPLLSHVTPD 242
Query: 171 LTAALLAKDRLNAFGLMEPYIEGQDGRRSVRAYGIYLKASFFNHDCLPNACRFDYVDAAA 230
+ L + D N+FG+ GR + YGI+ ASFFNH C P +
Sbjct: 243 ILRTLSSVDAHNSFGIWSD----APGRGEMLGYGIWPDASFFNHSCHPVIAQ-----RVR 293
Query: 231 ENNTDIIVRMIHDVPQGREICLSYFPVN--YDYSTRQKR--LLYDYDFACDCDRCKVEA 285
E T V D E+ +SY + + + Q+R L ++ F C C RC E+
Sbjct: 294 EGRTWSFV-ASRDAAPDEEVTISYLSADELRELNVEQRRQCLQRNWGFVCMCARCVEES 351
>gi|383852386|ref|XP_003701709.1| PREDICTED: SET and MYND domain-containing protein 4-like [Megachile
rotundata]
Length = 675
Score = 50.4 bits (119), Expect = 0.001, Method: Compositional matrix adjust.
Identities = 81/341 (23%), Positives = 127/341 (37%), Gaps = 64/341 (18%)
Query: 5 SETVTVAEIEGR-GRGLVSTQSLKAGQIVLRDSPILLYSALPFINSSSFCHNCFRKTMHS 63
S+ VT+ G GR V+T++++ G+I++ + P + + + CH C K
Sbjct: 237 SKAVTIRNDGGDIGRHAVATRNIEPGEILVVEKPHCSFILAEY--RLTHCHLCMNKIFVP 294
Query: 64 SSSICPSCS-VAFCSPKCSTAAASSSHSPYVCQALTRLLNLNSPDAANLPLDRQLQARF- 121
S C CS +A+CS C A A S C L L + L L Q F
Sbjct: 295 IPSKCQVCSYLAYCSTSCRDADAKIHESE--CAVLPSLWMSKTSVTCFLALRAITQRSFE 352
Query: 122 -LIAAYNLAVVNPSQFQI----------LLAFQGTVTDND----------TSAAHYLSSL 160
I + +F+I A+ G VT D T A +L L
Sbjct: 353 EFIKLKDKLKSTKGRFEITQQRPYRSDDFEAYYGLVTHEDERTDEDMLHRTYIAVWLLRL 412
Query: 161 CP-----PPASTTIELTAALLAKDRL---------------NAFGLMEPYI-EGQD---- 195
P T + A ++D L NA + E I G+
Sbjct: 413 LKLSRYFPENVKTPDTAEAKPSEDELFIGSLILHGLMLLQFNAHEISELTIPRGEKTLAK 472
Query: 196 GRRSVRAYGIYLKASFFNHDCLPNACRFDYVDAAAENNTDIIVRMIHDVPQGREICLSYF 255
+ + G++ + FNH C P R+ T ++VR I + G EI +Y
Sbjct: 473 AKSTFIGGGLFPTIALFNHSCNPGVIRYFI-------GTTMVVRAIRSIAAGEEISENYG 525
Query: 256 PV--NYDYSTRQKRLLYDYDFACDCDRCKVEANWSDNDNDD 294
P+ + R++ L + Y F C+C+ C +W D+ D
Sbjct: 526 PIFTMSPEAERKRTLRWQYWFDCNCEACV--GHWPLLDDID 564
>gi|156101287|ref|XP_001616337.1| hypothetical protein [Plasmodium vivax Sal-1]
gi|148805211|gb|EDL46610.1| hypothetical protein, conserved [Plasmodium vivax]
Length = 517
Score = 50.4 bits (119), Expect = 0.001, Method: Compositional matrix adjust.
Identities = 71/317 (22%), Positives = 115/317 (36%), Gaps = 75/317 (23%)
Query: 5 SETVTVAEIEGRGRGLVSTQSLKAGQIVLRDSPILL------YSALPFINSSS------- 51
S+ V E +G+ +V+ ++AG ++ P + Y A I++ +
Sbjct: 6 SQMYKVDYKEDKGKCVVAVSQMRAGYCIVEAHPEIFVPLSVKYMAPRVIDTENKKTDYKT 65
Query: 52 --FCHNCFRKTMHSSSSICPSCS-VAFCSPKC----------STAAASSSHSPYVCQALT 98
C CF K S S CP+C V +CS C S+ C ++
Sbjct: 66 VNLCFYCFEK--FSKSFYCPNCKYVVYCSEVCLERGWKLHRDECEVFKSNIFDKFCPSII 123
Query: 99 RLLNLNS-----------PDAANLPLDRQLQARFLIAAYNLAVVNPSQFQILLAFQGTVT 147
+ +NS A+LP ++ F AY +AV S+ + + A
Sbjct: 124 MRMVINSYLSHFNFYEYCGSVADLPKEKY--EYFKYPAYIVAVALMSKKKKIFA---NFE 178
Query: 148 DNDTSAAHYLSSLCPPPASTTIELTAALLAKDRLNAFGLMEPYIEGQDGRRSVRAYGIYL 207
DN++ + + +T + + L GL G Y
Sbjct: 179 DNESILRNIIEKFIKISKNT------LQIIDNELEPAGL-----------------GFYK 215
Query: 208 K-ASFFNHDCLPNACRFDYVDAAAENNTDIIVRMIHDVPQGREICLSYFPVNYDYSTRQK 266
K +FNH CL N N + +R + DV G E+ +SY + +D +TR
Sbjct: 216 KPVPYFNHSCLSNC-------VTIFKNQKLFIRTLMDVYPGEELTISYVDIAFDKNTRLA 268
Query: 267 RLLYDYDFACDCDRCKV 283
+ Y F C C CKV
Sbjct: 269 ICMDQYFFTCTCKLCKV 285
>gi|146087265|ref|XP_001465774.1| conserved hypothetical protein [Leishmania infantum JPCM5]
gi|134069874|emb|CAM68201.1| conserved hypothetical protein [Leishmania infantum JPCM5]
Length = 455
Score = 50.4 bits (119), Expect = 0.001, Method: Compositional matrix adjust.
Identities = 26/84 (30%), Positives = 43/84 (51%), Gaps = 7/84 (8%)
Query: 200 VRAYGIYLKASFFNHDCLPNACRFDYVDAAAENNTDIIVRMIHDVPQGREICLSYFPVNY 259
++ G+Y S FNH C+PNA VD E I+++ + G + ++Y P+
Sbjct: 360 LKGAGLYSLLSCFNHSCVPNAA-VSTVDGTHE----IVLKTTRPIRAGEPLTITYIPLTA 414
Query: 260 DYSTR--QKRLLYDYDFACDCDRC 281
++R ++R L +Y F C C RC
Sbjct: 415 GAASRAERQRRLRNYFFTCHCPRC 438
>gi|432113965|gb|ELK36030.1| SET and MYND domain-containing protein 3, partial [Myotis davidii]
Length = 241
Score = 50.4 bits (119), Expect = 0.001, Method: Compositional matrix adjust.
Identities = 32/121 (26%), Positives = 52/121 (42%), Gaps = 18/121 (14%)
Query: 202 AYGIYLKASFFNHDCLPNACRFDYVDAAAENNTDIIVRMIHDVPQGREICLSYFPVNYDY 261
G+Y S NH C PN C + N +++R + D+ G E+ + Y +
Sbjct: 20 GVGLYPSMSLLNHSCDPN-CSIVF------NGPHLLLRAVRDIQAGEELTICYLDMLMTS 72
Query: 262 STRQKRLLYDYDFACDCDRCKV-----------EANWSDNDNDDENNEEVMDEDQDEQMV 310
+ R+++L Y F CDC RC E W + + EE+ E + EQ++
Sbjct: 73 AERREQLRNQYCFDCDCARCLTQDKDADMLTGDEKVWKEVQESLKKIEELKAEWKWEQVL 132
Query: 311 A 311
A
Sbjct: 133 A 133
>gi|328853238|gb|EGG02378.1| hypothetical protein MELLADRAFT_91461 [Melampsora larici-populina
98AG31]
Length = 479
Score = 50.4 bits (119), Expect = 0.001, Method: Compositional matrix adjust.
Identities = 34/141 (24%), Positives = 58/141 (41%), Gaps = 8/141 (5%)
Query: 160 LCPPPASTTIELTAALLAKDRLNAFGLMEPYIEGQDGRRSVRAYGIYLKASFFNHDCLPN 219
L P S ++L AA + + E G V Y + + S FNH+C PN
Sbjct: 238 LPSPTGSLLMKLDAAGHTRVERVHTAIARNAFETPHGSEEVLHYAVVPEPSVFNHECRPN 297
Query: 220 ACRFDYVDAAAENNTDIIVRMIHDVPQGREICLSYFPVNYDYSTRQKRLLYDYDFACDCD 279
+ F + + + + D+ G EI ++Y + + RQ + + Y F C C
Sbjct: 298 SAFF-----FDKKTLRVYINAVRDIAMGEEITIAYRDMKASRAERQASIAH-YGFQCKCS 351
Query: 280 RCKVEANWSDNDNDDENNEEV 300
C + A ++ DE +E+
Sbjct: 352 HCSMSAE--ESKASDERIQEI 370
>gi|328850107|gb|EGF99276.1| hypothetical protein MELLADRAFT_94773 [Melampsora larici-populina
98AG31]
Length = 415
Score = 50.4 bits (119), Expect = 0.001, Method: Compositional matrix adjust.
Identities = 35/107 (32%), Positives = 51/107 (47%), Gaps = 15/107 (14%)
Query: 177 AKDRLNAFGLMEPYIEGQDGRRSVRAYG-IYLKASFFNHDCLPN-ACRFDYVDAAAENNT 234
A +R N + +E ++GQ+ R +G + + + NHDC PN A FD N
Sbjct: 199 AFERNNFYHRVE--VKGQE-----RPFGAVVFEPTRLNHDCRPNSAYHFD------TNTL 245
Query: 235 DIIVRMIHDVPQGREICLSYFPVNYDYSTRQKRLLYDYDFACDCDRC 281
+ V + D+ G E+ +SY V+ TRQ L DY F C C C
Sbjct: 246 KLNVHALRDISPGEELTVSYIQVDVSKETRQGLLSRDYGFECGCSLC 292
>gi|336275943|ref|XP_003352725.1| hypothetical protein SMAC_01560 [Sordaria macrospora k-hell]
gi|380094615|emb|CCC07995.1| unnamed protein product [Sordaria macrospora k-hell]
Length = 726
Score = 50.4 bits (119), Expect = 0.001, Method: Compositional matrix adjust.
Identities = 27/89 (30%), Positives = 37/89 (41%), Gaps = 7/89 (7%)
Query: 202 AYGIYLKASFFNHDCLPNACRFDYVDAAAENNTDIIVRMIHDVPQGREICLSYFPVNYDY 261
A G++ +S NH C+PN R D +I R DV +G E+ Y PV
Sbjct: 465 AKGLWAHSSLMNHSCVPNTMRSFVGDM-------LICRATRDVKEGEELFQQYVPVKTLV 517
Query: 262 STRQKRLLYDYDFACDCDRCKVEANWSDN 290
R K + F C C C+ E S+
Sbjct: 518 DVRNKEFAEGWGFGCRCGLCEGERKSSEE 546
>gi|56788824|gb|AAH88504.1| smyd5-prov protein, partial [Xenopus (Silurana) tropicalis]
Length = 420
Score = 50.4 bits (119), Expect = 0.001, Method: Compositional matrix adjust.
Identities = 52/184 (28%), Positives = 75/184 (40%), Gaps = 29/184 (15%)
Query: 116 QLQARFLIAAYNLAV---VNPSQFQILLAF-----QGTVTDNDTSAAHYLSSL-CPPPAS 166
QL+ F A Y V P F+ L A QG T + + H +L PP
Sbjct: 218 QLRRLFTDALYEERVSRWFTPEGFRSLFALVGTNGQGIGTSSLSQWVHACDALELPPRER 277
Query: 167 TTIELTAALLAKDRLNAFGLMEPYIEGQDGR-RSVRAYGIYLKASFFNHDCLPNACRFDY 225
++ L KD IE G + G+YL S NH C+PNA
Sbjct: 278 EQLDSLIDQLYKD-----------IEKVTGEFLNCEGSGLYLLQSCCNHSCVPNA----- 321
Query: 226 VDAAAENNTDIIVRMIHDVPQGREICLSYFPV---NYDYSTRQKRLLYDYDFACDCDRCK 282
+ +NN + + + D+ G EIC+SY + +RQK L +Y F C C +C
Sbjct: 322 EASFPDNNFILHLTALEDIQPGEEICISYLDCCQRDRSRHSRQKILRENYLFVCSCPKCL 381
Query: 283 VEAN 286
+A+
Sbjct: 382 AQAD 385
>gi|154689858|ref|NP_006053.2| SET and MYND domain-containing protein 5 [Homo sapiens]
gi|90101758|sp|Q6GMV2.2|SMYD5_HUMAN RecName: Full=SET and MYND domain-containing protein 5; AltName:
Full=Protein NN8-4AG; AltName: Full=Retinoic
acid-induced protein 15
gi|119620150|gb|EAW99744.1| SMYD family member 5, isoform CRA_a [Homo sapiens]
gi|119620151|gb|EAW99745.1| SMYD family member 5, isoform CRA_a [Homo sapiens]
gi|261859134|dbj|BAI46089.1| SMYD family member 5 [synthetic construct]
Length = 418
Score = 50.4 bits (119), Expect = 0.001, Method: Compositional matrix adjust.
Identities = 55/188 (29%), Positives = 81/188 (43%), Gaps = 26/188 (13%)
Query: 109 ANLPLDRQLQARFLIAAYNLAV---VNPSQFQILLAFQGTVTDN--DTSAAHYLSSLCPP 163
L L R+L F A Y AV P F+ L A GT +S + ++
Sbjct: 215 GQLELLRRL---FTEALYEEAVSQWFTPDGFRSLFALVGTNGQGIGTSSLSQWVH----- 266
Query: 164 PASTTIELTAALLAKDRLNAF-GLMEPYIEGQDGR-RSVRAYGIYLKASFFNHDCLPNAC 221
A T+EL +++L+AF + IE G + G+++ S NH C+PNA
Sbjct: 267 -ACDTLELKPQ--DREQLDAFIDQLYKDIEAATGEFLNCEGSGLFVLQSCCNHSCVPNA- 322
Query: 222 RFDYVDAAAENNTDIIVRMIHDVPQGREICLSYFPV---NYDYSTRQKRLLYDYDFACDC 278
+ ENN + V + D+ G EIC+SY +R K L +Y F C C
Sbjct: 323 ----ETSFPENNFLLHVTALEDIKPGEEICISYLDCCQRERSRHSRHKILRENYLFVCSC 378
Query: 279 DRCKVEAN 286
+C EA+
Sbjct: 379 PKCLAEAD 386
>gi|195389466|ref|XP_002053397.1| GJ23360 [Drosophila virilis]
gi|194151483|gb|EDW66917.1| GJ23360 [Drosophila virilis]
Length = 393
Score = 50.4 bits (119), Expect = 0.001, Method: Compositional matrix adjust.
Identities = 27/87 (31%), Positives = 46/87 (52%), Gaps = 10/87 (11%)
Query: 204 GIYLKASFFNHDCLPNA-CRFDYVDAAAENNTDIIVRMIHDVPQGREICLSYF---PVNY 259
G+Y+ S NH C+PNA F Y +N ++++ + + +G EIC+SY +
Sbjct: 288 GLYILQSKINHSCVPNAQSTFPY------SNDIVVLKALAPIQKGDEICISYLDDCQLER 341
Query: 260 DYSTRQKRLLYDYDFACDCDRCKVEAN 286
+R K L +Y F C C +C+ +A+
Sbjct: 342 SRHSRHKTLRENYIFVCQCPKCRAQAS 368
>gi|166796007|ref|NP_001107677.1| SMYD family member 5 [Xenopus (Silurana) tropicalis]
gi|163916066|gb|AAI57309.1| smyd5 protein [Xenopus (Silurana) tropicalis]
Length = 421
Score = 50.4 bits (119), Expect = 0.001, Method: Compositional matrix adjust.
Identities = 52/184 (28%), Positives = 75/184 (40%), Gaps = 29/184 (15%)
Query: 116 QLQARFLIAAYNLAV---VNPSQFQILLAF-----QGTVTDNDTSAAHYLSSL-CPPPAS 166
QL+ F A Y V P F+ L A QG T + + H +L PP
Sbjct: 220 QLRRLFTDALYEERVSRWFTPEGFRSLFALVGTNGQGIGTSSLSQWVHACDALELPPRER 279
Query: 167 TTIELTAALLAKDRLNAFGLMEPYIEGQDGR-RSVRAYGIYLKASFFNHDCLPNACRFDY 225
++ L KD IE G + G+YL S NH C+PNA
Sbjct: 280 EQLDSLIDQLYKD-----------IEKVTGEFLNCEGSGLYLLQSCCNHSCVPNA----- 323
Query: 226 VDAAAENNTDIIVRMIHDVPQGREICLSYFPV---NYDYSTRQKRLLYDYDFACDCDRCK 282
+ +NN + + + D+ G EIC+SY + +RQK L +Y F C C +C
Sbjct: 324 EASFPDNNFILHLTALEDIQPGEEICISYLDCCQRDRSRHSRQKILRENYLFVCSCPKCL 383
Query: 283 VEAN 286
+A+
Sbjct: 384 AQAD 387
>gi|398015620|ref|XP_003860999.1| hypothetical protein, conserved [Leishmania donovani]
gi|322499223|emb|CBZ34294.1| hypothetical protein, conserved [Leishmania donovani]
Length = 455
Score = 50.4 bits (119), Expect = 0.002, Method: Compositional matrix adjust.
Identities = 26/84 (30%), Positives = 43/84 (51%), Gaps = 7/84 (8%)
Query: 200 VRAYGIYLKASFFNHDCLPNACRFDYVDAAAENNTDIIVRMIHDVPQGREICLSYFPVNY 259
++ G+Y S FNH C+PNA VD E I+++ + G + ++Y P+
Sbjct: 360 LKGAGLYSLLSCFNHSCVPNAA-VSTVDGTHE----IVLKTTRPIRAGEPLTITYIPLTA 414
Query: 260 DYSTR--QKRLLYDYDFACDCDRC 281
++R ++R L +Y F C C RC
Sbjct: 415 GAASRAERQRRLRNYFFTCHCPRC 438
>gi|194377790|dbj|BAG63258.1| unnamed protein product [Homo sapiens]
Length = 418
Score = 50.4 bits (119), Expect = 0.002, Method: Compositional matrix adjust.
Identities = 55/188 (29%), Positives = 81/188 (43%), Gaps = 26/188 (13%)
Query: 109 ANLPLDRQLQARFLIAAYNLAV---VNPSQFQILLAFQGTVTDN--DTSAAHYLSSLCPP 163
L L R+L F A Y AV P F+ L A GT +S + ++
Sbjct: 215 GQLELLRRL---FTEALYEEAVSQWFTPDGFRSLFALVGTNGQGIGTSSLSQWVH----- 266
Query: 164 PASTTIELTAALLAKDRLNAF-GLMEPYIEGQDGR-RSVRAYGIYLKASFFNHDCLPNAC 221
A T+EL +++L+AF + IE G + G+++ S NH C+PNA
Sbjct: 267 -ACDTLELKPQ--DREQLDAFIDQLYKDIEAATGEFLNCEGSGLFVLQSCCNHSCVPNA- 322
Query: 222 RFDYVDAAAENNTDIIVRMIHDVPQGREICLSYF---PVNYDYSTRQKRLLYDYDFACDC 278
+ ENN + V + D+ G EIC+SY +R K L +Y F C C
Sbjct: 323 ----ETSFPENNFLLHVTALEDIKPGEEICISYLDCCQRERSRHSRHKILRENYLFVCSC 378
Query: 279 DRCKVEAN 286
+C EA+
Sbjct: 379 PKCLAEAD 386
>gi|443897425|dbj|GAC74766.1| hypothetical protein PANT_12d00127 [Pseudozyma antarctica T-34]
Length = 500
Score = 50.4 bits (119), Expect = 0.002, Method: Compositional matrix adjust.
Identities = 74/335 (22%), Positives = 124/335 (37%), Gaps = 70/335 (20%)
Query: 16 RGRGLVSTQSLKAGQIVLRDSPILLY---SALPFINSSSFCHNCFRKTMHSSSSI----C 68
+G+GLV+++S G++V + + A ++S C CF + + C
Sbjct: 120 KGKGLVASRSFAEGELVFTEEAYIATPPPEAFDQVSSGELCAQCFLPVSSAPVQLAIKNC 179
Query: 69 PSCSVAFCSPKCSTAAASSSHSPY-------------------------VCQALTRLLNL 103
C V FC+ C+ A ++ H+ V ++L RLL+
Sbjct: 180 GKCKVRFCTSACARTAMATHHTLLCTELNPAAKPLVQLVQSQKWQSLHCVARSLARLLST 239
Query: 104 NSPDAANLPLDRQLQAR-FLIAAY-----------NLAVV--------NPSQFQILLAFQ 143
+P N + +Q + + LIAAY + A V NP +F+
Sbjct: 240 LTPH--NNGMGKQSREQDDLIAAYGDFETVHARLSSFATVSELERRSRNPGWATEKASFE 297
Query: 144 GTVTDNDTSAAHYL--------SSLCPPPASTTIELTAALLAKDRLNAFGLMEPYIEGQD 195
G + + T+ + L S+ P P ++L A+ K R + Y
Sbjct: 298 GILAEAYTALCNALDPYAEPRTSAPHPDPHHVQLDLIDAV--KTRKGQIKDLFTYASFLK 355
Query: 196 --GRRSV---RAYGIYLKASFFNHDCLPNACRFDYVDAAAENNTDIIVRMIHDVPQGREI 250
GR +V + G+Y SF NH C PN + + + + + + E+
Sbjct: 356 LLGRANVNMEKFGGLYSVHSFLNHSCAPNVQIRHVPERGILASMKVAALALRPIAKDEEL 415
Query: 251 CLSYFPVNYDYSTRQKRLLYDYDFA-CDCDRCKVE 284
+SY RQ L DY F C C +C E
Sbjct: 416 VISYIDPTTRLGRRQLVLYRDYCFGPCTCPKCVQE 450
>gi|405118616|gb|AFR93390.1| hypothetical protein CNAG_03890 [Cryptococcus neoformans var.
grubii H99]
Length = 449
Score = 50.4 bits (119), Expect = 0.002, Method: Compositional matrix adjust.
Identities = 37/104 (35%), Positives = 56/104 (53%), Gaps = 14/104 (13%)
Query: 15 GRGRGLVSTQSLKAGQIVLRDSPILLYS---ALPFINSSSFCHNCFRKTMHSSSSI---C 68
G+G+GLV+ + LK G+++ ++ P ++ S P + S C CF H S + C
Sbjct: 103 GKGKGLVAKEELKQGEMLWQEEPWIVTSDPGHYPLLIQSMMCSQCFSLFAHPSPPLSVPC 162
Query: 69 PSCSVA-FCSPKCSTAAASSSHSPYVCQALTRLLNLNSPDAANL 111
P C+ A FC+ C T + SSSHSP +C L +PDA +L
Sbjct: 163 PHCTTAHFCNRLCYTKSLSSSHSPLLCPGL-------NPDAGSL 199
>gi|443720661|gb|ELU10312.1| hypothetical protein CAPTEDRAFT_158133 [Capitella teleta]
Length = 441
Score = 50.1 bits (118), Expect = 0.002, Method: Compositional matrix adjust.
Identities = 61/277 (22%), Positives = 112/277 (40%), Gaps = 44/277 (15%)
Query: 25 SLKAGQIVLRDSPILLYSALPFINSSSFCHNCFRKTMHSSS-SICPSCSVA-FCSPKCST 82
LK G+ +++ P + L S C CFR+ + C SC V FC+ KC +
Sbjct: 2 KLKRGETWVKEDPFV--HVLSQTERDSRCDFCFRRPENDIQIPRCSSCKVVRFCNRKCQS 59
Query: 83 AAASSSHSPYVCQALTRLLNLNSPDAANLPLDR-QLQARFLIA----AYNLAVVNPSQ-F 136
+A S C++L R+ A +P D +L R LI ++ V + +
Sbjct: 60 SAWSVHKKE--CRSLKRV-------APKIPTDSVRLIFRLLIKLESDIEDVEVFGKKRNW 110
Query: 137 QILLAFQGTVTDNDTSAAHYLSSLCPPPAST-------TIELTAALLAKDRLNAFGLMEP 189
L++ V ++ ++ + + + ++E + + +N+F + +P
Sbjct: 111 ADLISHVDEVQEDQIRLQQFMVLMTTLKSFSENVMSMPSVEELFVIFGRVCVNSFSICDP 170
Query: 190 YIEGQDGRRSVRAYGIYLKA----SFFNHDCLPNACRFDYVDAAAENNTDIIVRMIHDVP 245
+ G+Y++ S +H C PNA A T + +R + V
Sbjct: 171 EMNPI-------GVGVYIRQIKPCSVLDHSCRPNA-------VAVFEGTTLRIRCVEPVD 216
Query: 246 QGREICLSYFPVNYDYSTRQKRLLYDYDFACDCDRCK 282
+++ +SY D +TR++ L Y F C C CK
Sbjct: 217 SEQDLRISYIDTLDDTTTRRRNLQQQYYFNCLCGECK 253
>gi|170060287|ref|XP_001865735.1| conserved hypothetical protein [Culex quinquefasciatus]
gi|167878799|gb|EDS42182.1| conserved hypothetical protein [Culex quinquefasciatus]
Length = 650
Score = 50.1 bits (118), Expect = 0.002, Method: Compositional matrix adjust.
Identities = 74/305 (24%), Positives = 114/305 (37%), Gaps = 51/305 (16%)
Query: 17 GRGLVSTQSLKAGQIVLRDSPILLYSALPFINSSSFCHNCFRKTMHSSSSICPSCS-VAF 75
GR + LK IVL + P + S L S + C CF++ S CP CS V F
Sbjct: 240 GRFAKTNVDLKPNTIVLVEKPHV--SVLLEEYSKTHCSTCFKRV--SVPVCCPKCSDVVF 295
Query: 76 CSPKCSTAAASSSHSPYVCQALTRLLNLNSPDAANLPL---DRQLQARFLIAAYNLAVVN 132
CS C ++A S H Y C L + ++ L +Q + FL L +
Sbjct: 296 CSEDCESSANSGYHK-YECGFLPIFWKSGASITCHMALRIITQQSEEYFLQLRPELDGLT 354
Query: 133 PSQFQIL-----LAFQGTVTDNDTSAAH--YLSSLCPPPASTTIELTAALLAKD------ 179
Q L VT +T +A + +L + + L K+
Sbjct: 355 SEQTDKLKHDDYRKIYKLVTHEETRSAEDFFQRTLMATLLNACLTLGGFYKTKEAESFIG 414
Query: 180 ----------RLNAFGLMEPYIEGQD----GRRSVRAYGIYLKASFFNHDCLPNACRFDY 225
+ NA + E ++ +D G+ G+Y + FNH C P R+
Sbjct: 415 GLLLHNLQLLQFNAHEISE--LQREDDRDVGKSVFIGGGLYPTLALFNHSCEPGVTRY-- 470
Query: 226 VDAAAENNTDIIVRMIHDVPQGREICLSYFPVNYDYSTRQKR---LLYDYDFACDCDRCK 282
+ VR + + G I +Y P+ + + R +R LL Y F+C+C C
Sbjct: 471 -----YKGNSVCVRSVRSIAAGSMIGENYGPL-FTQTPRDERRATLLNQYRFSCNCRAC- 523
Query: 283 VEANW 287
NW
Sbjct: 524 -SENW 527
>gi|342184438|emb|CCC93920.1| conserved hypothetical protein [Trypanosoma congolense IL3000]
Length = 471
Score = 50.1 bits (118), Expect = 0.002, Method: Compositional matrix adjust.
Identities = 28/85 (32%), Positives = 36/85 (42%), Gaps = 4/85 (4%)
Query: 201 RAYGIYLKASFFNHDCLPNACRFDYVDAAAENNTDIIVRMIHDVPQGREICLSYFPVNYD 260
R G+Y FNH C PN V + + + + D+ G E+ +SY
Sbjct: 371 RGQGVYTIGCLFNHSCEPNL----QVLYTVSGDETLSIEALRDIEPGEELNISYIDDTLP 426
Query: 261 YSTRQKRLLYDYDFACDCDRCKVEA 285
Y RQ LL Y F C C RC EA
Sbjct: 427 YPRRQLSLLEHYLFQCACPRCTREA 451
>gi|349604328|gb|AEP99912.1| SET and MYND domain-containing protein 3-like protein, partial
[Equus caballus]
Length = 358
Score = 50.1 bits (118), Expect = 0.002, Method: Compositional matrix adjust.
Identities = 27/93 (29%), Positives = 43/93 (46%), Gaps = 7/93 (7%)
Query: 204 GIYLKASFFNHDCLPNACRFDYVDAAAENNTDIIVRMIHDVPQGREICLSYFPVNYDYST 263
G+Y S NH C PN + N +++R + D+ G E+ + Y +
Sbjct: 126 GLYPSMSLLNHSCEPNC-------SIVFNGPHLLLRAVRDIEAGEELTICYLDMLMTSEE 178
Query: 264 RQKRLLYDYDFACDCDRCKVEANWSDNDNDDEN 296
R+K+L Y F CDC RC+ + +D DE+
Sbjct: 179 RRKQLRDQYCFECDCFRCQTQDKDADMLTGDEH 211
>gi|260788416|ref|XP_002589246.1| hypothetical protein BRAFLDRAFT_74594 [Branchiostoma floridae]
gi|229274421|gb|EEN45257.1| hypothetical protein BRAFLDRAFT_74594 [Branchiostoma floridae]
Length = 778
Score = 50.1 bits (118), Expect = 0.002, Method: Compositional matrix adjust.
Identities = 24/83 (28%), Positives = 38/83 (45%), Gaps = 3/83 (3%)
Query: 204 GIYLKASFFNHDCLPNACRFDYVDAAAENNTDIIVRMIHDVPQGREICLSYFPVNYDYST 263
G+Y S NH C N D +DA + +++R + +G E+ +Y +
Sbjct: 646 GLYALQSSLNHSCDKNV---DVMDAVVDGKPGVVIRAKQPIKKGGELYTTYIDTSMQRPQ 702
Query: 264 RQKRLLYDYDFACDCDRCKVEAN 286
R+ L Y F C+C RCK E +
Sbjct: 703 RRAWLYRAYHFWCECQRCKYEGD 725
>gi|270011596|gb|EFA08044.1| hypothetical protein TcasGA2_TC005638 [Tribolium castaneum]
Length = 575
Score = 50.1 bits (118), Expect = 0.002, Method: Compositional matrix adjust.
Identities = 71/298 (23%), Positives = 106/298 (35%), Gaps = 38/298 (12%)
Query: 6 ETVTVAEIEGRGRGLVSTQSLKAGQIVLRDSPILLYSALPFI--NSSSFCHNCFRKTMHS 63
E T E+ G+ LV+T+ L G +++ + P L+Y P + C C R +
Sbjct: 46 EITTTKEV---GKCLVATRDLSPGDVIISELP-LVYGPRPHMVEEGPVPCPGCCRLIICE 101
Query: 64 SSSICPSCSVAFCSPKCSTAAASSSHSPYVCQALTRLLNLNSPDAANLPLDRQLQ-ARFL 122
+S CP C C P+C H + C +L+L A N D Q
Sbjct: 102 NSPRCPGCDFPVCHPRCPGLKDMEKHG-HEC----LILSLREIRAINGLHDFYRQDTLLA 156
Query: 123 IAAYNLAVVNPSQFQILLAFQGTVTDNDTSAAHY---------------LSSLCPPPAST 167
+ L NP +F L+ + + Y + L A T
Sbjct: 157 LRCLLLQKKNPKKFAQLMEMEAHLDKRGPDTEIYKQVQERTFDYLDDAFFTPLKVLQART 216
Query: 168 TIELTAALLAKDRLNAFGLME--PYIEGQDGRRSVRAYGIYLKASFFNHDCLPNACR-FD 224
E+ + + G+++ QD SV +Y A HDCL N FD
Sbjct: 217 GKEVLQDISKETVHKICGIIDVNALEINQDAEISV----LYPTAYLMEHDCLCNTVHSFD 272
Query: 225 YVDAAAENNTDIIVRMIHDVPQGREICLSYFPVNYDYSTRQKRLLYDYDFACDCDRCK 282
EN I VR + +G I Y + R++ L F+C C RCK
Sbjct: 273 ----TEENGYKITVRAALPIKKGDHISTMYTHALWGTQARREHLKETKYFSCTCKRCK 326
>gi|167342134|gb|ABZ69015.1| SET domain-containing protein C [Pinus radiata]
gi|167342136|gb|ABZ69016.1| SET domain-containing protein C [Pinus radiata]
Length = 141
Score = 50.1 bits (118), Expect = 0.002, Method: Compositional matrix adjust.
Identities = 26/81 (32%), Positives = 38/81 (46%), Gaps = 8/81 (9%)
Query: 204 GIYLKASFFNHDCLPNACRFDYVDAAAENNTDIIVRMIHDVPQGREICLSYFPVNYDYST 263
G++L SF NH CLPN+ R E + + + + +G EI YF +
Sbjct: 12 GLWLLPSFINHSCLPNSSRL-------EMGSAMFIHACKPIKRGEEITFPYFDILLPLPQ 64
Query: 264 RQKRLLYDYDFACDCDRCKVE 284
RQ R ++ F C C RC +E
Sbjct: 65 RQGR-CENWGFECKCRRCILE 84
>gi|195023574|ref|XP_001985712.1| GH20950 [Drosophila grimshawi]
gi|193901712|gb|EDW00579.1| GH20950 [Drosophila grimshawi]
Length = 557
Score = 50.1 bits (118), Expect = 0.002, Method: Compositional matrix adjust.
Identities = 70/288 (24%), Positives = 115/288 (39%), Gaps = 39/288 (13%)
Query: 17 GRGLVSTQSLKAGQIVLRDSPILLYSALPFIN---SSSF--CHNCFRKTMHSSSSICPSC 71
GR LV+TQ+++A QI+ + P+++ + S+S C C+ + +C SC
Sbjct: 56 GRYLVATQTIQAKQIIFVEEPLVVGPKWFLTDDEKSASIVPCVACY-TPCRINKHLCRSC 114
Query: 72 SVAFCSPKCSTAAASSSHSPYVCQALTRLLNLNSP-----DAANLPLDRQLQARFLIAAY 126
CS C H C +L+L SP DA L + A ++
Sbjct: 115 RWPVCSISCE-------HEKLECS----ILSLGSPPSGKSDARGLHDYYRHDALLVLKCL 163
Query: 127 NLAVVNPSQFQILLAFQGTVTDNDTSAAH---------YLSS--LCPPPASTTIELTAAL 175
L +P ++ LL Q + + H YL L A+ E +
Sbjct: 164 LLQRQHPERWAALLEMQSHEEERLGTELHQEAEQRIVSYLQQRFLQRIKAAKNRESLLSE 223
Query: 176 LAKDRLNAF-GLMEPYIEGQDGRRSVRAYGIYLKASFFNHDCLPNACRFDYVDAAAENNT 234
+ + L+ G++E + V G++ +A H C PN C F + DAA +
Sbjct: 224 YSAELLHRLCGIIETNYMVIELATGVELSGVFRQACMMEHACQPN-CYFQF-DAATQR-- 279
Query: 235 DIIVRMIHDVPQGREICLSYFPVNYDYSTRQKRLLYDYDFACDCDRCK 282
I VR D+ +G + ++Y + + RQ L F C C RC+
Sbjct: 280 -IAVRAGCDLKKGDHLKITYTNILWGTQMRQHHLRMTKHFNCRCSRCE 326
>gi|383142265|gb|AFG52495.1| Pinus taeda anonymous locus UMN_1780_01 genomic sequence
Length = 155
Score = 50.1 bits (118), Expect = 0.002, Method: Compositional matrix adjust.
Identities = 27/81 (33%), Positives = 37/81 (45%), Gaps = 8/81 (9%)
Query: 204 GIYLKASFFNHDCLPNACRFDYVDAAAENNTDIIVRMIHDVPQGREICLSYFPVNYDYST 263
G++L SF NH CLPN+ R E + + + +G EI YF +
Sbjct: 14 GLWLLPSFINHSCLPNSSRL-------EMGSAMFTHACKPIKRGEEITFPYFDILLPLPQ 66
Query: 264 RQKRLLYDYDFACDCDRCKVE 284
RQ R ++ F C C RC VE
Sbjct: 67 RQGR-CENWGFECKCRRCIVE 86
>gi|189204167|ref|XP_001938419.1| MYND finger family protein [Pyrenophora tritici-repentis Pt-1C-BFP]
gi|187985518|gb|EDU51006.1| MYND finger family protein [Pyrenophora tritici-repentis Pt-1C-BFP]
Length = 454
Score = 50.1 bits (118), Expect = 0.002, Method: Compositional matrix adjust.
Identities = 34/131 (25%), Positives = 59/131 (45%), Gaps = 18/131 (13%)
Query: 163 PPASTTIELTAALLAKDRLNAFGLMEPYIEGQDGRRSVRAYGIYLKASF--FNHDCLPNA 220
PP + AA+ A+ NA ++ P ++ GI L + NH C PNA
Sbjct: 112 PPTMFDKDFIAAMYARVMSNALTIITPTLD---------PLGIILDPTLCSLNHSCDPNA 162
Query: 221 CRFDYVDAAAENNTDIIVRMIHDVPQGREICLSYFPVNYDYSTRQKRLLYDYDFACDCDR 280
F +D + + +R + + + +EI +SY Y Y RQ+ L + F C C +
Sbjct: 163 --FIMMDGPS-----VSIRTLRPIRKDKEIFISYIDTTYPYHKRQEELQTRWFFTCRCAK 215
Query: 281 CKVEANWSDND 291
C+ +A +++
Sbjct: 216 CQEKATLQEDN 226
>gi|167342102|gb|ABZ68999.1| SET domain-containing protein C [Pinus taeda]
gi|167342124|gb|ABZ69010.1| SET domain-containing protein C [Pinus taeda]
Length = 141
Score = 50.1 bits (118), Expect = 0.002, Method: Compositional matrix adjust.
Identities = 27/81 (33%), Positives = 37/81 (45%), Gaps = 8/81 (9%)
Query: 204 GIYLKASFFNHDCLPNACRFDYVDAAAENNTDIIVRMIHDVPQGREICLSYFPVNYDYST 263
G++L SF NH CLPN+ R E + + + +G EI YF +
Sbjct: 12 GLWLLPSFINHSCLPNSSRL-------EMGSAMFTHACKPIKRGEEITFPYFDILLPLPQ 64
Query: 264 RQKRLLYDYDFACDCDRCKVE 284
RQ R ++ F C C RC VE
Sbjct: 65 RQGR-CENWGFECKCRRCIVE 84
>gi|403288342|ref|XP_003935365.1| PREDICTED: SET and MYND domain-containing protein 3 [Saimiri
boliviensis boliviensis]
Length = 369
Score = 50.1 bits (118), Expect = 0.002, Method: Compositional matrix adjust.
Identities = 57/247 (23%), Positives = 91/247 (36%), Gaps = 45/247 (18%)
Query: 68 CPSCSVA-FCSPKCSTAAASSSHSPYVCQALTRLLNLNSPDAANLPLDRQL--------- 117
C C VA +CS KC A C+ L PD+ L L R +
Sbjct: 3 CSQCRVAKYCSSKCQKKAWPDHKRE--CKCLKSCKPRYPPDSVRL-LGRVVFKLMEGSPS 59
Query: 118 QARFLIAAYNLAV--------VNPSQFQILLAFQGTVTDNDTSAAHYLSSLCPPPASTTI 169
++ L + Y+L Q+++ FQ + + A+ PPA
Sbjct: 60 ESEKLYSFYDLESNINKLTEDKKEGLRQLVMTFQHFMREEIQDASQL------PPAFDIF 113
Query: 170 ELTAALLAKDRLNAFGLMEPYIEGQDGRRSVRAYGIYLKASFFNHDCLPNACRFDYVDAA 229
E A ++ N+F + ++ G+Y S NH C PN +
Sbjct: 114 EAFAKVIC----NSFTICNAEMQEV-------GVGLYPSISLLNHSCDPNC-------SV 155
Query: 230 AENNTDIIVRMIHDVPQGREICLSYFPVNYDYSTRQKRLLYDYDFACDCDRCKVEANWSD 289
N +++R + D+ G E+ + Y R+K+L Y F CDC RC+ + +D
Sbjct: 156 VFNGPHLLLRAVRDIEVGEELTICYLDTLMTSEERRKQLRDQYCFECDCFRCQTQDKDAD 215
Query: 290 NDNDDEN 296
DE
Sbjct: 216 MLTGDEQ 222
>gi|281210720|gb|EFA84886.1| hypothetical protein PPL_01879 [Polysphondylium pallidum PN500]
Length = 439
Score = 50.1 bits (118), Expect = 0.002, Method: Compositional matrix adjust.
Identities = 26/78 (33%), Positives = 40/78 (51%), Gaps = 4/78 (5%)
Query: 204 GIYLKASFFNHDCLPNACRFDYVDAAAENNTDIIVRMIHDVPQGREICLSYFPVNYDYST 263
+Y KAS FNH C PN CR ++ A E ++ + D+ G E+ +Y ++
Sbjct: 283 AVYEKASLFNHSCQPNVCR---INRAGEWGALEMISLT-DIAAGTELVYNYIQISLPTED 338
Query: 264 RQKRLLYDYDFACDCDRC 281
RQ +L +Y F C C+ C
Sbjct: 339 RQSKLSENYFFECKCNGC 356
>gi|294933189|ref|XP_002780642.1| SET and MYND domain-containing protein, putative [Perkinsus marinus
ATCC 50983]
gi|239890576|gb|EER12437.1| SET and MYND domain-containing protein, putative [Perkinsus marinus
ATCC 50983]
Length = 409
Score = 50.1 bits (118), Expect = 0.002, Method: Compositional matrix adjust.
Identities = 75/295 (25%), Positives = 115/295 (38%), Gaps = 41/295 (13%)
Query: 17 GRGLVSTQSLK--AGQIVLRDSPILLYSALPFINSSSFCHNCFRKTMHSSSSICPSCSVA 74
GRGL + + G + PI+ Y P + + + CH C C C A
Sbjct: 18 GRGLFAGEGSHYIPGDSIATYRPIV-YCVTPTM-AKACCHWCLNSDAPKYYQ-CSGCRYA 74
Query: 75 -FCSPKCSTAAASSSHSPYVCQALTRLLNLNSPDAANLPLDRQLQARFL--IAAYNLAVV 131
+CS +C+ AA CQ + +L + A++ PL L A L + + AV
Sbjct: 75 VYCSKECAKAAYKLGTHRRECQLIKKL---PAEHASSAPLTTFLAAAKLHWLVQEDEAVR 131
Query: 132 NPSQFQILLAFQGTVTDNDTSAAHYLSS----------LCPPPASTTIELTAALLAKDRL 181
+ A + D+ + L S + PP ++L L R
Sbjct: 132 HSVDEMCRHADSSDTLEADSGSTAILLSRYLDGTQADLIYSPPVPVMLDLLRVL----RY 187
Query: 182 NAFGLMEPYIEGQDGRRSVRAYGIYLKASFFNHDCLPNACRFDYVDAAAENNTDIIVRMI 241
NA + D + V A G+Y + S NH C PN + +++ +R I
Sbjct: 188 NAVTIT------NDSLQDV-ALGLYTEVSAMNHSCAPNVVLIF-------SGSEVTLRTI 233
Query: 242 HDVPQGREICLSYFPVNYD-YSTRQKRLLYDYDFACDCDRCKVEANWSDN-DNDD 294
V G E+ +SY V + R +RL Y F C C+RC E + D+ DN D
Sbjct: 234 RAVEDGAELFISYVDVCISPKAKRCQRLRDQYKFDCSCERCTREDEYLDDVDNGD 288
>gi|49533764|gb|AAT66763.1| Putative TPR domain containing protein, identical [Solanum
demissum]
Length = 438
Score = 50.1 bits (118), Expect = 0.002, Method: Compositional matrix adjust.
Identities = 24/86 (27%), Positives = 42/86 (48%), Gaps = 8/86 (9%)
Query: 200 VRAYGIYLKASFFNHDCLPNACRFDYVDAAAENNTDIIVRMIHDVPQGREICLSYFPVNY 259
V G+++ +SF NH C PN R D +++ D+ G+E+ +YF V
Sbjct: 281 VHGIGLWILSSFINHSCDPNVRRSHVGDH-------VMIHACRDIKAGKELTFAYFDVFT 333
Query: 260 DYSTRQKRLLYDYDFACDCDRCKVEA 285
+ R+++ ++ F C C RC +E
Sbjct: 334 PFRDREEK-AKNWGFVCKCKRCNLEK 358
>gi|429861968|gb|ELA36630.1| TPR domain-containing protein [Colletotrichum gloeosporioides Nara
gc5]
Length = 735
Score = 50.1 bits (118), Expect = 0.002, Method: Compositional matrix adjust.
Identities = 29/80 (36%), Positives = 38/80 (47%), Gaps = 10/80 (12%)
Query: 204 GIYLKASFFNHDCLPNACRFDYVDAAAENNTDIIVRMIHDVPQGREICLSYFPVNY--DY 261
GI+ AS NH CL N CR D +IVR D+P G E+ Y ++ DY
Sbjct: 495 GIWTIASHINHSCLENCCRSFIGDV-------MIVRASRDMPAGTELVFCYQSIDIGDDY 547
Query: 262 STRQKRLLYDYDFACDCDRC 281
QKR + F C+C+ C
Sbjct: 548 KATQKR-FSRWGFTCECELC 566
>gi|303278994|ref|XP_003058790.1| set domain protein [Micromonas pusilla CCMP1545]
gi|226459950|gb|EEH57245.1| set domain protein [Micromonas pusilla CCMP1545]
Length = 771
Score = 50.1 bits (118), Expect = 0.002, Method: Compositional matrix adjust.
Identities = 28/87 (32%), Positives = 42/87 (48%), Gaps = 6/87 (6%)
Query: 199 SVRAYGIYLKASFFNHDCLPNACRFDYVDAAAENNTDIIVRMIHDVPQGREICLSYFPVN 258
++R ++ AS NH C PN C YVD + ++V + G E+ ++Y V+
Sbjct: 400 AMRGAAVFPLASSLNHSCDPN-CEVAYVD-----DARVLVVARRTLKPGEELTIAYVDVD 453
Query: 259 YDYSTRQKRLLYDYDFACDCDRCKVEA 285
D R+ L Y F C C+RC EA
Sbjct: 454 ADVGERRDELREVYGFECVCERCSREA 480
>gi|224047131|ref|XP_002191464.1| PREDICTED: N-lysine methyltransferase SMYD2 [Taeniopygia guttata]
Length = 436
Score = 50.1 bits (118), Expect = 0.002, Method: Compositional matrix adjust.
Identities = 68/285 (23%), Positives = 101/285 (35%), Gaps = 51/285 (17%)
Query: 15 GRGRGLVSTQSLKAGQIVLRDSPILLYSALPFINSSSFCHNCFRKTMHSSSSICPSCSVA 74
G+GRGL + + G+++L P + + L S C CF + S C C A
Sbjct: 19 GKGRGLRARRRFAVGELLL-SCPAYV-AVLTVSERGSHCDGCFARK--EGLSKCGRCKQA 74
Query: 75 F-CSPKCST--------------AAASSSHSPYVCQALTRLLNLNSPDAANLPLDRQLQA 119
F C+ +C + H + R+L A + Q+
Sbjct: 75 FYCNVECQKEDWPMHKLECASMCTFGQNWHPSETVRLTARIL------AKQKTHPERTQS 128
Query: 120 RFLIAAYNLAVVNPSQFQILLAFQGTVTDNDTSAAHYLSSL---CPPPASTTIELTAALL 176
L+A S L + + ND +A H+ S P A+ + L
Sbjct: 129 EKLLAVKEF----ESHLDKLDNEKRELIQNDIAALHHFYSKHLDYPDNAALVV-----LF 179
Query: 177 AKDRLNAFGLMEPYIEGQDGRRSVRAYGIYLKASFFNHDCLPNACRFDYVDAAAENNTDI 236
A+ N F + +D S I+ + NH C PN Y AE
Sbjct: 180 AQVNCNGFTI-------EDEELSHLGSAIFPDVALMNHSCCPNVI-VTYKGTLAE----- 226
Query: 237 IVRMIHDVPQGREICLSYFPVNYDYSTRQKRLLYDYDFACDCDRC 281
VR + ++ G EI SY + Y R RL Y F+CDC C
Sbjct: 227 -VRAVREIEPGEEIFSSYIDLLYPTEDRNDRLRDSYFFSCDCREC 270
>gi|113205454|gb|AAW28574.2| TPR domain containing protein, putative [Solanum demissum]
Length = 569
Score = 50.1 bits (118), Expect = 0.002, Method: Compositional matrix adjust.
Identities = 26/91 (28%), Positives = 42/91 (46%), Gaps = 8/91 (8%)
Query: 200 VRAYGIYLKASFFNHDCLPNACRFDYVDAAAENNTDIIVRMIHDVPQGREICLSYFPVNY 259
V G+++ +SF NH C PN R D + + D+ G E+ +YF V
Sbjct: 318 VHGIGLWILSSFINHSCDPNVRRSHIGD-------HVTIHACRDIKAGEELTFAYFDVFT 370
Query: 260 DYSTRQKRLLYDYDFACDCDRCKVEANWSDN 290
+ R+++ ++ F C C RC +E S N
Sbjct: 371 PFRDREEK-AKNWGFVCKCKRCNLEKGVSSN 400
>gi|167342092|gb|ABZ68994.1| SET domain-containing protein C [Pinus taeda]
gi|167342094|gb|ABZ68995.1| SET domain-containing protein C [Pinus taeda]
Length = 141
Score = 50.1 bits (118), Expect = 0.002, Method: Compositional matrix adjust.
Identities = 27/81 (33%), Positives = 37/81 (45%), Gaps = 8/81 (9%)
Query: 204 GIYLKASFFNHDCLPNACRFDYVDAAAENNTDIIVRMIHDVPQGREICLSYFPVNYDYST 263
G++L SF NH CLPN+ R E + + + +G EI YF +
Sbjct: 12 GLWLLPSFINHSCLPNSSRL-------EMGSAMFTHACKPIKRGEEITFPYFDILLPLPQ 64
Query: 264 RQKRLLYDYDFACDCDRCKVE 284
RQ R ++ F C C RC VE
Sbjct: 65 RQGR-CENWGFECKCRRCIVE 84
>gi|325088450|gb|EGC41760.1| conserved hypothetical protein [Ajellomyces capsulatus H88]
Length = 439
Score = 50.1 bits (118), Expect = 0.002, Method: Compositional matrix adjust.
Identities = 29/87 (33%), Positives = 43/87 (49%), Gaps = 5/87 (5%)
Query: 201 RAYGIYLKASFFNHDCLPNACRFDYVDAAAENNTD---IIVRMIHDVPQGREICLSYFPV 257
RA G L +++ NH CLPNA A+ N + + V + +G EI + Y +
Sbjct: 111 RALG--LDSAYLNHSCLPNAQHSLSAPASNGNRKERDFLTVYACRSIEEGEEITIPYESL 168
Query: 258 NYDYSTRQKRLLYDYDFACDCDRCKVE 284
D + RQ+ LL +Y F C C C+ E
Sbjct: 169 YMDIAGRQQFLLQEYGFECSCRLCEKE 195
>gi|121715400|ref|XP_001275309.1| SET and MYND domain protein, putative [Aspergillus clavatus NRRL 1]
gi|119403466|gb|EAW13883.1| SET and MYND domain protein, putative [Aspergillus clavatus NRRL 1]
Length = 555
Score = 50.1 bits (118), Expect = 0.002, Method: Compositional matrix adjust.
Identities = 71/315 (22%), Positives = 120/315 (38%), Gaps = 56/315 (17%)
Query: 1 MSAVSETVTVAEIEGRGRGLVSTQSLKAGQ-IVLRDSPILLYSALPFINSSSFCHNCFRK 59
+ +V T G G GL +++ ++ G+ +VL P + + L S C C
Sbjct: 54 IPSVYTKATTGPSPGSGTGLFASKDIQPGETVVLIQKPFV--AVLDTAQLESKCSGCLGA 111
Query: 60 TMHSSSSI----CPSCSVA-FCSPKCSTAAASSSHS--------------PYVCQALTRL 100
+ ++ C C V +C+ C HS P +A+ R+
Sbjct: 112 HANRQEAVELKACTGCRVVKYCNKTCQAKDWKLFHSLECRIFQNLKPRVLPNNARAILRM 171
Query: 101 LNLNSPD---AANLPLDRQLQARFLIAAYNLAVVNPSQFQILLAFQGTVTDNDTSAAHYL 157
+ L+ A + L + L++ A + Q++ +L T+ A
Sbjct: 172 VMLSERGRYPAGEMELFKGLESHMKDVCIRNANGDKDQWERILL---------TAKAVKN 222
Query: 158 SSLCPPPASTTIELTAALLAKDRLNAFGLMEPYIEGQDGRRSVRAYGIYLK--ASFFNHD 215
S C S L A AK +N+F + D R GIY+ A+ NH
Sbjct: 223 YSGCAMDES----LIAEYAAKLDVNSFN----FDNVMDER-----LGIYMHPYAALMNHS 269
Query: 216 CLPNACRFDYVDAAAENNTDIIVRMIHDVPQGREICLSYFPVNYDYSTRQKRLLYDYDFA 275
C DY A ++ + V+ +H + +G +I ++Y Y RQK L Y F
Sbjct: 270 C-------DYNATVAFDDDRLHVKALHPIKKGEQIFITYVDTTNPYKIRQKELSDRYYFT 322
Query: 276 CDCDRCKVEANWSDN 290
C C +C+ AN ++
Sbjct: 323 CRCSKCQQGANGRED 337
>gi|148235066|ref|NP_001085635.1| SET and MYND domain-containing protein 5 [Xenopus laevis]
gi|82184468|sp|Q6GPQ4.1|SMYD5_XENLA RecName: Full=SET and MYND domain-containing protein 5
gi|49115717|gb|AAH73058.1| MGC82689 protein [Xenopus laevis]
Length = 421
Score = 50.1 bits (118), Expect = 0.002, Method: Compositional matrix adjust.
Identities = 52/187 (27%), Positives = 77/187 (41%), Gaps = 29/187 (15%)
Query: 116 QLQARFLIAAYNLAV---VNPSQFQILLAF-----QGTVTDNDTSAAHYLSSL-CPPPAS 166
QL+ F+ A Y + P F+ L A QG T + + H +L PP
Sbjct: 218 QLRRLFVDALYEERMSRWFTPEGFRSLFALVGTNGQGIGTSSLSQWVHACDALELPPRDR 277
Query: 167 TTIELTAALLAKDRLNAFGLMEPYIEGQDGR-RSVRAYGIYLKASFFNHDCLPNACRFDY 225
++ L KD IE G + G+YL S NH C+PNA
Sbjct: 278 EKLDALIDQLYKD-----------IEKVTGEFLNCEGSGLYLLQSCCNHSCVPNA----- 321
Query: 226 VDAAAENNTDIIVRMIHDVPQGREICLSYF---PVNYDYSTRQKRLLYDYDFACDCDRCK 282
+ +NN + + + D+ G EIC+SY + +RQK L +Y F C C +C
Sbjct: 322 EASFPDNNFILHLTALEDIQPGEEICISYLDCCQRDRSRHSRQKILRENYLFMCSCPKCL 381
Query: 283 VEANWSD 289
+A+ D
Sbjct: 382 AQADEPD 388
>gi|302686744|ref|XP_003033052.1| hypothetical protein SCHCODRAFT_107538 [Schizophyllum commune H4-8]
gi|300106746|gb|EFI98149.1| hypothetical protein SCHCODRAFT_107538, partial [Schizophyllum
commune H4-8]
Length = 404
Score = 50.1 bits (118), Expect = 0.002, Method: Compositional matrix adjust.
Identities = 46/154 (29%), Positives = 71/154 (46%), Gaps = 21/154 (13%)
Query: 172 TAALLAKDRLNAFGLMEPY----IEGQDGRRSVRAYGIYLKASFFNHDCLPNACRFDYVD 227
+ LL++ R NA+ + Y I+ QDG+ I+ + S NH C PNA + D
Sbjct: 188 SGPLLSRARTNAWHALGEYKFAGIDAQDGQ----YLAIFDELSRMNHSCRPNA--LYHWD 241
Query: 228 AAAENNTDIIVRMIHDVPQGREICLSY-FPVNYDYSTRQKRLLYDYDFACDCDRCKVEAN 286
++ + + +R + D+ G EI +SY V+ YS R+ LL Y F CDC C
Sbjct: 242 SSTFSGS---LRAVRDIEPGEEITVSYCGEVDRPYSDRRA-LLAPYGFGCDCRAC----- 292
Query: 287 WSDNDNDDENNEEVMDEDQDEQMVASDDDAEVHG 320
++ D D E+++ E D S D G
Sbjct: 293 -AEGDAADIRYEQIIAEYDDLPFPLSSDKRVEQG 325
>gi|303272359|ref|XP_003055541.1| predicted protein [Micromonas pusilla CCMP1545]
gi|226463515|gb|EEH60793.1| predicted protein [Micromonas pusilla CCMP1545]
Length = 896
Score = 50.1 bits (118), Expect = 0.002, Method: Compositional matrix adjust.
Identities = 37/114 (32%), Positives = 53/114 (46%), Gaps = 15/114 (13%)
Query: 213 NHDCLPNACRFDYVDAAAENNTD--------IIVRMIHDVPQGREICLSYFPVNYDYSTR 264
NH C+PN C+ + + D + + + DV +G EI +Y P N Y R
Sbjct: 527 NHSCVPN-CQIEARIVPGGGDGDDETCGGLRVTLVALRDVAEGEEITTAYVPSNQRYFRR 585
Query: 265 QKRLLYDYDFACDCDRCKVEANWSDNDNDDENNEEVMD----EDQDEQMVASDD 314
++ L Y FACDC RC +EA S N N +V+D DQ ++ DD
Sbjct: 586 RRELRRRYGFACDCPRCALEA--SKTSNAPPPNLKVIDWLRLADQAQEEARYDD 637
Score = 37.7 bits (86), Expect = 9.4, Method: Compositional matrix adjust.
Identities = 18/53 (33%), Positives = 29/53 (54%)
Query: 5 SETVTVAEIEGRGRGLVSTQSLKAGQIVLRDSPILLYSALPFINSSSFCHNCF 57
S V IEGRGRGLV+T+++ G ++ + P+ A+ +S C +C
Sbjct: 30 SGRVRCVAIEGRGRGLVATRAVYKGDVIAVEKPLAATQAMGNRSSVLACAHCL 82
>gi|195381151|ref|XP_002049318.1| GJ20818 [Drosophila virilis]
gi|194144115|gb|EDW60511.1| GJ20818 [Drosophila virilis]
Length = 495
Score = 50.1 bits (118), Expect = 0.002, Method: Compositional matrix adjust.
Identities = 78/311 (25%), Positives = 116/311 (37%), Gaps = 70/311 (22%)
Query: 1 MSAVSE----TVTVAEIEGRGRGLVSTQSLKAGQIVLRDSPILLYSALPFINSSSFCHNC 56
M+AV+E V + E GR +V+ ++KAG+ VL + P+LL + N C NC
Sbjct: 1 MAAVTEDFAKKCEVKQNETLGRYVVAATNIKAGETVLLEQPVLLLAN----NGDRRCCNC 56
Query: 57 FRKTMHSSSSICPSCSVA-FCS-------PKCSTAA----------ASSSHSPYVCQALT 98
+ T SS C C + C+ P C A +H+ VC AL
Sbjct: 57 HQLT----SSFCGKCRLMPLCADCVEHHGPDCRRLAELQLQEQQVEQLQAHTE-VCSALK 111
Query: 99 RLLNLNSPDAAN-----LPLDRQLQARF---LIAAYNLAVVNPSQFQILLAFQGTVTDND 150
LL + + L L+ L R + +Y V+ P Q LL + D
Sbjct: 112 YLLLREHAEKRSHYEQLLQLEAHLARRRDTDIWRSYRAEVIEPLQASGLLLQLRNGAEID 171
Query: 151 TSAAHYLSSLCPPPASTTIELTAALLAKDRLNAFGLMEPYIEGQDGRRSVRAYGIYLKAS 210
EL LLA +N F + P G ++R G+Y+ A+
Sbjct: 172 E------------------ELLQRLLAIADINGFEIRAPESGG-----AMR--GVYMTAA 206
Query: 211 FFNHDCLPNACRFDYVDAAAENNTDIIVRMIHDVPQGREICLSYFPVNYDYSTRQKRLLY 270
F H CLPN + A + I + + G + Y + R++ L
Sbjct: 207 LFAHSCLPN------LVTAVDEQRRIKLYANRIIAAGEILYNCYTDILLGTDERRRILKT 260
Query: 271 DYDFACDCDRC 281
F C C RC
Sbjct: 261 GKCFDCQCARC 271
>gi|195056333|ref|XP_001995065.1| GH22837 [Drosophila grimshawi]
gi|193899271|gb|EDV98137.1| GH22837 [Drosophila grimshawi]
Length = 495
Score = 50.1 bits (118), Expect = 0.002, Method: Compositional matrix adjust.
Identities = 76/304 (25%), Positives = 120/304 (39%), Gaps = 54/304 (17%)
Query: 1 MSAVSE----TVTVAEIEGRGRGLVSTQSLKAGQIVLRDSPILLYSALPFINSSSFCHNC 56
M+AVSE V E E GR ++++ +++ G+ VL + PILL S N+ C +C
Sbjct: 1 MAAVSEDFAKKCEVKENETMGRYVIASTNIREGETVLCEQPILLLSN----NTDRRCSSC 56
Query: 57 FRKTMHSSSSICPSCSVAFCSPKCSTAAASSSHSPYVCQALTRLLNLNSPDAANLPLDRQ 116
++ T +S C C + P C H C+ RL L + L Q
Sbjct: 57 YKLT----NSFCGKCRLL---PLCGECV---HHDARDCK---RLAELQLEEEQVEKLQAQ 103
Query: 117 LQARFLIAAYNLAVVNPSQ--FQILLAFQG--TVTDNDTSAAHYLSSLCPPPASTTI--- 169
+ + L +Q ++ LL + + N Y + P ST +
Sbjct: 104 AEVNNALTYLVLREHKETQPQYEQLLQMESHLALRRNTEIWRTYREQVVLPLQSTGLLQQ 163
Query: 170 ---------ELTAALLAKDRLNAFGLMEPYIEGQDGRRSVRAYGIYLKASFFNHDCLPNA 220
EL LL +N F + P G ++R G+Y++A+ F H C PN
Sbjct: 164 LRNGAEIDEELLQHLLGTTDVNGFEIRAPESGG-----TLR--GLYIRAALFPHSCTPN- 215
Query: 221 CRFDYVDAAAENNTDIIVRMIHDVPQGREICLSYFPVNYDYSTRQKRLLYDYD--FACDC 278
+ A + I V +P G EI + + N T ++RL+ F C C
Sbjct: 216 -----LVTAIDEERRIKVYANRYIPAG-EILYNCY-TNILLGTEERRLILKQGKCFDCQC 268
Query: 279 DRCK 282
RC+
Sbjct: 269 ARCQ 272
>gi|449274577|gb|EMC83667.1| SET and MYND domain-containing protein 5, partial [Columba livia]
Length = 383
Score = 50.1 bits (118), Expect = 0.002, Method: Compositional matrix adjust.
Identities = 51/195 (26%), Positives = 80/195 (41%), Gaps = 26/195 (13%)
Query: 109 ANLPLDRQLQARFLIAAYNLAVVNPSQFQILLAF-----QGTVTDNDTSAAHYLSSLCPP 163
L L R L L + P F+ L A QG T + + H +L P
Sbjct: 184 GQLELLRLLFTEALYDEHLSRWFTPEGFRSLFALVGTNGQGIGTSSLSQWVHACDALDLP 243
Query: 164 PASTTIELTAALLAKDRLNAF-GLMEPYIEGQDGR-RSVRAYGIYLKASFFNHDCLPNAC 221
+L ++ L+AF + +E + G + G+Y+ S NH C+PNA
Sbjct: 244 -----------MLQREELDAFIDQLYKDMEKESGEFLNCEGSGLYVLQSCCNHSCIPNA- 291
Query: 222 RFDYVDAAAENNTDIIVRMIHDVPQGREICLSYFPVNYDYSTRQKR---LLYDYDFACDC 278
+ ENN + + + D+ G EIC+SY +R R L +Y F C C
Sbjct: 292 ----ETSFPENNFLLHLTALEDIEAGEEICISYLDCCQRERSRHSRNKILRENYLFTCSC 347
Query: 279 DRCKVEANWSDNDND 293
+C +A+ +D +D
Sbjct: 348 PKCLAQADDADVTSD 362
>gi|346320789|gb|EGX90389.1| SET domain protein [Cordyceps militaris CM01]
Length = 540
Score = 50.1 bits (118), Expect = 0.002, Method: Compositional matrix adjust.
Identities = 78/294 (26%), Positives = 117/294 (39%), Gaps = 45/294 (15%)
Query: 3 AVSETVTVAEIEGRGRGLVSTQSLKAGQIVLRDSPILLYSALPFINSSSFCHNCFRKTMH 62
AV + V + RGR L +T+++ AG ++ SP++L +L + + C C R
Sbjct: 4 AVPDGVVIKPHSTRGRALHATKTVAAGGVLAVFSPLILVPSLSHL--TIVCSFCLRGV-- 59
Query: 63 SSSSICPSCSVA-FCSPKCSTAAASSSHSPYVCQALTRLLNLNSPDAANLPLDRQLQA-- 119
+ C C A +C +C AA SS H+ C ALT + P A R++ A
Sbjct: 60 -TPRPCSRCRAAYYCDAQCQAAAWSSGHAAE-CAALTGAI----PPARR---RREIPAPV 110
Query: 120 RFLIAAYNLAVVNPSQFQILL-AFQGTV---------TDNDTSAAHYLSSLCPPPASTTI 169
R L+ L + LL +G D + A + + +
Sbjct: 111 RALVQLLLLLRSGGGHTRDLLDGLEGHAAQRRRAPGWADMELMAMAGCAFAGMETSEGAV 170
Query: 170 ELTAALLAKDRLNAFGLMEPYIEGQDGRRSVRAYGIYLKASF--FNHDCLPNACRFDYVD 227
A +L K + NAF + + G G I+L+ + NH CLPNA +V
Sbjct: 171 REAAEILCKLQTNAFHRFDADL-GHVG--------IFLEPTLAMANHSCLPNAF-VQFVG 220
Query: 228 AAAENNTDIIVRMIHDVPQGREICLSYFPVNYDYSTRQKRLLYDYDFACDCDRC 281
A ++R + G EI +SY S RQ L Y F C C RC
Sbjct: 221 RTA------VLRAEQRIQSGDEIEISYTDYTSSLSKRQA-ALAPYHFECRCRRC 267
>gi|355559119|gb|EHH15899.1| hypothetical protein EGK_02058, partial [Macaca mulatta]
Length = 373
Score = 50.1 bits (118), Expect = 0.002, Method: Compositional matrix adjust.
Identities = 61/258 (23%), Positives = 98/258 (37%), Gaps = 50/258 (19%)
Query: 68 CPSCSVA-FCSPKCSTAAASSSHSPYVCQALTRLLNLNSPDAANLPLDRQL--------- 117
C C VA +CS KC A C+ L PD+ L L R +
Sbjct: 7 CSQCRVAKYCSAKCQKKAWPDHKRE--CKCLKSCKPRYPPDSVRL-LGRVVFKLMDGTPS 63
Query: 118 QARFLIAAYNLAV--------VNPSQFQILLAFQGTVTDNDTSAAHYLSSLCPPPASTTI 169
++ L + Y+L Q+++ FQ + + A+ PPA
Sbjct: 64 ESEKLYSFYDLESNINKLTEDKKEGLRQLVMTFQHFMREEIQDASQL------PPAFDLF 117
Query: 170 ELTAALLAKDRLNAFGLMEPYIEGQDGRRSVRAYGIYLKASFFNHDCLPNACRFDYVDAA 229
E A ++ N+F + ++ G+Y S NH C PN +
Sbjct: 118 EAFAKVIC----NSFTICNAEMQEV-------GVGLYPSISLLNHSCDPNC-------SI 159
Query: 230 AENNTDIIVRMIHDVPQGREICLSYFPVNYDYSTRQKRLLYDYDFACDCDRCKVEANWSD 289
N +++R + D+ G E+ + Y + R+K+L Y F CDC RC+ + D
Sbjct: 160 VFNGPHLLLRAVRDIEVGEELTICYLDMLMTSEERRKQLRDQYCFECDCFRCQTQ----D 215
Query: 290 NDNDD-ENNEEVMDEDQD 306
D D +E+V E Q+
Sbjct: 216 KDADMLTGDEQVWKEVQE 233
>gi|19075913|ref|NP_588413.1| histone lysine methyltransferase Set5 (predicted)
[Schizosaccharomyces pombe 972h-]
gi|74626858|sp|O74467.1|SET5_SCHPO RecName: Full=SET domain-containing protein 5
gi|3581881|emb|CAA20779.1| histone lysine methyltransferase Set5 (predicted)
[Schizosaccharomyces pombe]
Length = 319
Score = 50.1 bits (118), Expect = 0.002, Method: Compositional matrix adjust.
Identities = 29/86 (33%), Positives = 39/86 (45%), Gaps = 15/86 (17%)
Query: 204 GIYLKASFFNHDCLPNAC-----RFDYVDAAAENNTDIIVRMIHDVPQGREICLSYFPVN 258
G++L S NHDC PN R D V A + D+ G EI +Y ++
Sbjct: 95 GMFLLGSRMNHDCSPNVKHTWNPRLDQVTVHA----------VRDIEAGEEILTTYIDLH 144
Query: 259 YDYSTRQKRLLYDYDFACDCDRCKVE 284
++ RQK LL + F C C C VE
Sbjct: 145 KSHTERQKILLEHFGFKCYCSVCSVE 170
>gi|440907991|gb|ELR58064.1| SET and MYND domain-containing protein 3, partial [Bos grunniens
mutus]
Length = 221
Score = 49.7 bits (117), Expect = 0.002, Method: Compositional matrix adjust.
Identities = 31/111 (27%), Positives = 50/111 (45%), Gaps = 11/111 (9%)
Query: 204 GIYLKASFFNHDCLPNACRFDYVDAAAENNTDIIVRMIHDVPQGREICLSYFPVNYDYST 263
G+Y S NH C PN + N +++R + DV G E+ + Y +
Sbjct: 22 GLYPSMSLLNHSCDPNC-------SIVFNGPHLLLRAVRDVEAGEELTICYLDMLMTSEE 74
Query: 264 RQKRLLYDYDFACDCDRCKVEANWSDNDNDDENNEEVMDEDQDEQMVASDD 314
R+K+L Y F CDC RC+ + D D D +E + ++ E + +D
Sbjct: 75 RRKQLRGQYCFDCDCFRCQTQ----DKDADMLTGDEQVWKEVQESLKKIED 121
>gi|294867794|ref|XP_002765240.1| conserved hypothetical protein [Perkinsus marinus ATCC 50983]
gi|239865235|gb|EEQ97957.1| conserved hypothetical protein [Perkinsus marinus ATCC 50983]
Length = 93
Score = 49.7 bits (117), Expect = 0.002, Method: Compositional matrix adjust.
Identities = 33/85 (38%), Positives = 42/85 (49%), Gaps = 10/85 (11%)
Query: 202 AYGIYLKASFFNHDCLPNACRFDYVDAAAENNTDIIVRMIHDVPQ--GREICLSYFPVNY 259
A G Y + NH C PNA NNT + +I P G EI +SY P++
Sbjct: 11 ATGFYSTVARMNHSCSPNA-------KVIFNNTTNKMEVISLKPINIGDEIRISYVPIDL 63
Query: 260 DYSTRQKRLLYDYDFACDCDRCKVE 284
D +TR+ R L DY F C+C RC E
Sbjct: 64 DLNTRRHR-LKDYGFLCNCQRCLTE 87
>gi|338722720|ref|XP_001914950.2| PREDICTED: LOW QUALITY PROTEIN: SET and MYND domain-containing
protein 3 [Equus caballus]
Length = 440
Score = 49.7 bits (117), Expect = 0.002, Method: Compositional matrix adjust.
Identities = 67/314 (21%), Positives = 114/314 (36%), Gaps = 67/314 (21%)
Query: 16 RGRGLVSTQSLKAGQIVLRDSPILLYSALPFINSSSFCHNCF---RKTMHSSSSICPSCS 72
+G GL + L+ G+++ R P L Y+ + C C K M C C
Sbjct: 14 KGNGLRALVPLRPGELLFRSDP-LAYTVCKG-SRGVVCDRCLLGKEKLMR-----CSQCR 66
Query: 73 VA-FCSPKCSTAAASSSHSPYVCQALTRLLNLNSPDAANLPLDRQL-----------QAR 120
VA +CS KC A C+ L PD+ L L R + +
Sbjct: 67 VAKYCSAKCQKKAWQDHKRE--CKCLKSCKPRYPPDSVRL-LGRVVFKLMEETPSESEKL 123
Query: 121 FLIAAYNLAVVNPSQF------------------QILLAFQGTVTDNDTSAAHYLSSLCP 162
+ + +++V + Q+ + FQ + + A+
Sbjct: 124 YHVLHNDISVDDGPHISWWAHINKLTEDKKEGLRQLAMTFQHFMREEIQDASQL------ 177
Query: 163 PPASTTIELTAALLAKDRLNAFGLMEPYIEGQDGRRSVRAYGIYLKASFFNHDCLPNACR 222
PP+ E A ++ N+F + ++ G+Y S NH C PN
Sbjct: 178 PPSFDVFEAFAKVIC----NSFTICNAEMQEV-------GVGLYPSMSLLNHSCDPNC-- 224
Query: 223 FDYVDAAAENNTDIIVRMIHDVPQGREICLSYFPVNYDYSTRQKRLLYDYDFACDCDRCK 282
+ N +++R + D+ G E+ + Y + R+K+L Y F CDC RC+
Sbjct: 225 -----SIVFNGPHLLLRAVRDIEAGEELTICYLDMLMTSEERRKQLRDQYCFECDCFRCQ 279
Query: 283 VEANWSDNDNDDEN 296
+ +D DE+
Sbjct: 280 TQDKDADMLTGDEH 293
>gi|398012025|ref|XP_003859207.1| hypothetical protein, conserved [Leishmania donovani]
gi|322497420|emb|CBZ32495.1| hypothetical protein, conserved [Leishmania donovani]
Length = 622
Score = 49.7 bits (117), Expect = 0.002, Method: Compositional matrix adjust.
Identities = 63/228 (27%), Positives = 95/228 (41%), Gaps = 30/228 (13%)
Query: 66 SICPSCS-VAFCSPKCSTAAASSSHSPYVCQALTRLLNL-NSPDAANLPLDRQ---LQAR 120
S C +C+ VA+CSP C+ A H + C + RL ++ +A L + Q L
Sbjct: 259 STCTTCNMVAYCSPACADAH-REQHERFECARMRRLSDMMEGIEARKLDVPEQFFELAYH 317
Query: 121 FLIAAYNLAVVNPSQFQILLAFQGTVTDNDTSAAHYLSSLCPPPAS--TTIELTAALLAK 178
+ + P ++L T+T + L + P + E TA L
Sbjct: 318 CITTLAGIKAKRPGHERVL-----TLTAHVDEVIQSLHPIGPLMSDLFKGEEETATLYTI 372
Query: 179 DRL---NAFGLMEPYIEGQDGRRSVRAYGIYLKASFFNHDCLPNACRFDYVDAAAENNTD 235
+ NA + +P G +++ A ASFFNH C PN C D V A
Sbjct: 373 IGVLCCNALEVTDP--SGLGVAQALHAGNSI--ASFFNHSCTPN-CAIDTVRHAI----- 422
Query: 236 IIVRMIHDVPQGREICLSYFP-VNYDYSTRQKRLLYDYDFACDCDRCK 282
I R IH G E+ ++Y P + + R++RL Y F C C RC+
Sbjct: 423 ITTRTIH---VGEELSIAYIPQLYWPTRLRRERLSEGYYFVCRCQRCQ 467
>gi|167342140|gb|ABZ69018.1| SET domain-containing protein C [Pinus sylvestris]
Length = 141
Score = 49.7 bits (117), Expect = 0.002, Method: Compositional matrix adjust.
Identities = 26/81 (32%), Positives = 38/81 (46%), Gaps = 8/81 (9%)
Query: 204 GIYLKASFFNHDCLPNACRFDYVDAAAENNTDIIVRMIHDVPQGREICLSYFPVNYDYST 263
G++L SF NH CLPN+ R E + + + + +G EI YF +
Sbjct: 12 GLWLLPSFINHSCLPNSSRL-------EMGSAMFIHACKPIKRGEEITFPYFDILLPLPQ 64
Query: 264 RQKRLLYDYDFACDCDRCKVE 284
RQ R ++ F C C RC +E
Sbjct: 65 RQVR-GENWGFECQCRRCILE 84
>gi|428672307|gb|EKX73221.1| conserved hypothetical protein [Babesia equi]
Length = 402
Score = 49.7 bits (117), Expect = 0.002, Method: Compositional matrix adjust.
Identities = 33/98 (33%), Positives = 49/98 (50%), Gaps = 6/98 (6%)
Query: 187 MEPYIEGQDGRRSVRAYGIYLKASFFNHDCLPNACRFDYVDAAAENNTDIIVRMIHDVPQ 246
+E IE + + GI+ S NH C PN DYVD +NT I ++ ++
Sbjct: 302 IEKCIESMELFPDIIGLGIFDNISKMNHSCRPNV-EIDYVD----DNTARI-NLLCNISA 355
Query: 247 GREICLSYFPVNYDYSTRQKRLLYDYDFACDCDRCKVE 284
G+E +SY + RQK+L +Y F CDC++C E
Sbjct: 356 GQEATISYINEEDVFEIRQKKLSTNYGFQCDCNKCLEE 393
>gi|195456692|ref|XP_002075245.1| GK16970 [Drosophila willistoni]
gi|194171330|gb|EDW86231.1| GK16970 [Drosophila willistoni]
Length = 766
Score = 49.7 bits (117), Expect = 0.002, Method: Compositional matrix adjust.
Identities = 25/78 (32%), Positives = 38/78 (48%), Gaps = 6/78 (7%)
Query: 204 GIYLKASFFNHDCLPNACRFDYVDAAAENNTDIIVRMIHDVPQGREICLSYFPVNYDYST 263
G++ + NH+C PNA + +N +VR D+P+G EI +Y + + T
Sbjct: 546 GLFPLTAIMNHECTPNASHY------FDNGRMAVVRAARDIPKGGEITTTYTKILWSNLT 599
Query: 264 RQKRLLYDYDFACDCDRC 281
R L +F CDC RC
Sbjct: 600 RGIFLKMTKNFMCDCPRC 617
Score = 43.9 bits (102), Expect = 0.12, Method: Compositional matrix adjust.
Identities = 23/87 (26%), Positives = 36/87 (41%), Gaps = 6/87 (6%)
Query: 196 GRRSVRAYGIYLKASFFNHDCLPNACRFDYVDAAAENNTDIIVRMIHDVPQGREICLSYF 255
G + ++ A NH C PNA E+ I+V +P+G EI ++Y
Sbjct: 214 GGNEILVRALFPLAGLLNHHCTPNAAHH------FEDGETIVVSATERIPKGTEITMTYA 267
Query: 256 PVNYDYSTRQKRLLYDYDFACDCDRCK 282
+ + R+ L F C C RC+
Sbjct: 268 KLLWSTLARKLFLAMTKHFICQCPRCQ 294
>gi|50740707|ref|XP_419536.1| PREDICTED: SET and MYND domain-containing protein 3 [Gallus gallus]
Length = 428
Score = 49.7 bits (117), Expect = 0.002, Method: Compositional matrix adjust.
Identities = 65/283 (22%), Positives = 102/283 (36%), Gaps = 43/283 (15%)
Query: 15 GRGRGLVSTQSLKAGQIVLRDSPILLYSALPFINSSSFCHNCFRKTMHSSSSICPSCSVA 74
G+G GL S + ++ G+++ R P + C C ++ H C C VA
Sbjct: 13 GKGSGLRSRRQVRPGELLYRAEPFAYVVTKEQLGG--VCEQCLQRNEHLHR--CSQCKVA 68
Query: 75 -FCSPKCSTAAASSSHSPYVCQALTRLLNLNSPDAANL------PLDRQ---LQARFLIA 124
+C C A C+ L + PD+ L L RQ L R
Sbjct: 69 KYCGKSCQKEAWLDHKRE--CKCLQNVKPNFPPDSVRLAGRIVFKLLRQSACLSERL--- 123
Query: 125 AYNLAVVNPSQFQILLAFQGTVTDNDTSAAHYLSSLCP-----PPASTTIELTAALLAKD 179
Y+ + + Q+ + + + YL + PPA ++ K
Sbjct: 124 -YSFKDLQSNAEQLSEEMKEGLGHLAHTLQLYLRAEIQDASHLPPAIDFFQI----FTKV 178
Query: 180 RLNAFGLMEPYIEGQDGRRSVRAYGIYLKASFFNHDCLPNACRFDYVDAAAENNTDIIVR 239
N F + +G G+Y S NH C PN C + +++R
Sbjct: 179 TCNCFTI-------SNGEMQDVGVGLYPSMSLLNHSCDPN-CVIIF------EGYQLLLR 224
Query: 240 MIHDVPQGREICLSYFPVNYDYSTRQKRLLYDYDFACDCDRCK 282
I ++ G E+ +SY S RQK+L Y F CDC C+
Sbjct: 225 SIREIQIGEELTISYIESLMPTSERQKQLKRQYCFECDCCLCQ 267
>gi|12232401|ref|NP_073580.1| SET and MYND domain-containing protein 3 isoform 2 [Homo sapiens]
gi|10437096|dbj|BAB14981.1| unnamed protein product [Homo sapiens]
gi|119597549|gb|EAW77143.1| SET and MYND domain containing 3, isoform CRA_d [Homo sapiens]
Length = 369
Score = 49.7 bits (117), Expect = 0.002, Method: Compositional matrix adjust.
Identities = 61/258 (23%), Positives = 98/258 (37%), Gaps = 50/258 (19%)
Query: 68 CPSCSVA-FCSPKCSTAAASSSHSPYVCQALTRLLNLNSPDAANLPLDRQL--------- 117
C C VA +CS KC A C+ L PD+ L L R +
Sbjct: 3 CSQCRVAKYCSAKCQKKAWPDHKRE--CKCLKSCKPRYPPDSVRL-LGRVVFKLMDGAPS 59
Query: 118 QARFLIAAYNLAV--------VNPSQFQILLAFQGTVTDNDTSAAHYLSSLCPPPASTTI 169
++ L + Y+L Q+++ FQ + + A+ PPA
Sbjct: 60 ESEKLYSFYDLESNINKLTEDKKEGLRQLVMTFQHFMREEIQDASQL------PPAFDLF 113
Query: 170 ELTAALLAKDRLNAFGLMEPYIEGQDGRRSVRAYGIYLKASFFNHDCLPNACRFDYVDAA 229
E A ++ N+F + ++ G+Y S NH C PN +
Sbjct: 114 EAFAKVIC----NSFTICNAEMQEV-------GVGLYPSISLLNHSCDPNC-------SI 155
Query: 230 AENNTDIIVRMIHDVPQGREICLSYFPVNYDYSTRQKRLLYDYDFACDCDRCKVEANWSD 289
N +++R + D+ G E+ + Y + R+K+L Y F CDC RC+ + D
Sbjct: 156 VFNGPHLLLRAVRDIEVGEELTICYLDMLMTSEERRKQLRDQYCFECDCFRCQTQ----D 211
Query: 290 NDNDD-ENNEEVMDEDQD 306
D D +E+V E Q+
Sbjct: 212 KDADMLTGDEQVWKEVQE 229
>gi|403222774|dbj|BAM40905.1| uncharacterized protein TOT_030000166 [Theileria orientalis strain
Shintoku]
Length = 150
Score = 49.7 bits (117), Expect = 0.002, Method: Compositional matrix adjust.
Identities = 32/106 (30%), Positives = 51/106 (48%), Gaps = 8/106 (7%)
Query: 182 NAFGL--MEPYIEGQDGRRSVRAYGIYLKASFFNHDCLPNACRFDYVDAAAENNTDIIVR 239
N GL + I+ ++ G++ S NH C PN + +Y +NN I
Sbjct: 46 NQLGLDDINETIQKKELYPETMGLGLFNYLSKMNHSCEPN-LQIEY----TKNNIAHIAP 100
Query: 240 MIHDVPQGREICLSYFPVNYDYSTRQKRLLYDYDFACDCDRCKVEA 285
++ DVP+G E +SY RQ++L +Y F CDC++C +E
Sbjct: 101 LV-DVPRGEEATISYIDEKDSVENRQEKLYKNYGFKCDCNKCILEG 145
>gi|417400530|gb|JAA47200.1| Putative histone tail methylase [Desmodus rotundus]
Length = 415
Score = 49.7 bits (117), Expect = 0.002, Method: Compositional matrix adjust.
Identities = 51/185 (27%), Positives = 78/185 (42%), Gaps = 20/185 (10%)
Query: 109 ANLPLDRQLQARFLIAAYNLAVVNPSQFQILLAFQGTVTDN--DTSAAHYLSSLCPPPAS 166
L L R+L A L P F+ L A GT +S + ++ A
Sbjct: 215 GQLELLRRLFAEALYEEVLSQWFTPDGFRSLFALVGTNGQGIGTSSLSQWVH------AC 268
Query: 167 TTIELTAALLAKDRLNAF-GLMEPYIEGQDGR-RSVRAYGIYLKASFFNHDCLPNACRFD 224
+EL + +++L+AF + IE G + G+++ S NH C+PNA
Sbjct: 269 DALELKSQ--DREQLDAFIDQLYKDIEAATGEFLNCEGSGLFVLQSCCNHSCVPNA---- 322
Query: 225 YVDAAAENNTDIIVRMIHDVPQGREICLSYFPV---NYDYSTRQKRLLYDYDFACDCDRC 281
+ ENN + V + D+ G EIC+SY +R K L +Y F C C +C
Sbjct: 323 -ETSFPENNFLLHVTALEDIKPGEEICISYLDCCQRERSRHSRHKILRENYLFVCSCPKC 381
Query: 282 KVEAN 286
EA+
Sbjct: 382 LAEAD 386
Score = 38.9 bits (89), Expect = 4.4, Method: Compositional matrix adjust.
Identities = 23/101 (22%), Positives = 41/101 (40%), Gaps = 27/101 (26%)
Query: 16 RGRGLVSTQSLKAGQIVLRDSPIL----LYSALPFINSSSFCHNCFRKTMHSSSSI---- 67
+G+GL +TQ ++ G+ + + P++ L++AL + C K ++ +
Sbjct: 32 KGKGLFATQLIRKGETIFVERPLVAAQFLWNALYHYRACDHCLRALEKAEENAQRLTGKP 91
Query: 68 -------------------CPSCSVAFCSPKCSTAAASSSH 89
CP C V +CS +C AAA H
Sbjct: 92 GQVLPHPELCTVRKDLHQNCPHCQVMYCSAECRLAAAEQYH 132
>gi|45383804|ref|NP_989486.1| SET and MYND domain-containing protein 1 [Gallus gallus]
gi|16930387|gb|AAL31880.1|AF410781_1 cardiac and skeletal muscle-specific BOP1 [Gallus gallus]
Length = 486
Score = 49.7 bits (117), Expect = 0.002, Method: Compositional matrix adjust.
Identities = 65/292 (22%), Positives = 114/292 (39%), Gaps = 33/292 (11%)
Query: 6 ETVTVAEIEGRGRGLVSTQSLKAGQIVLRDSPILLYSALPFIN-SSSFCHNCFRKT--MH 62
E+V V EG+GRGL + + G ++ + Y+A+ F + + CH CF++ +H
Sbjct: 2 ESVEVFTTEGKGRGLKAQKEFLPGDVIFAEPA---YAAVVFDSLTHVICHTCFKRQERLH 58
Query: 63 SSSSICPSCSVAF-CSPKCSTAAASSSHSPYVCQALTRLLNLNSPDAANLPLDRQLQARF 121
C C A+ C C A + + C A+ + + +P N+ L ++ R
Sbjct: 59 R----CGQCKFAYYCDRTCQRDAWLNHKNE--CSAIKK--HGKAP-TENIRLAARILWRI 109
Query: 122 L-----IAAYNLAVVNPSQFQILLAFQGTVTDNDTSAAHYLSSLCPPPASTTIELTAALL 176
++ L ++ Q + + D +L ++ + +
Sbjct: 110 EREGGGLSENCLVSIDDLQNHVESFDEEEKKDLRVDVESFLEFWPAQSQQFGMQYISHIF 169
Query: 177 AKDRLNAFGLMEPYIEGQDGRRSVRAYGIYLKASFFNHDCLPNAC------RFDYVDAAA 230
NAF L + Q G ++V GI+ NHDC PN + V +
Sbjct: 170 GVINCNAFTLSD-----QRGLQAV-GVGIFPNLCQANHDCWPNCTVIFNNGNHEAVRSMF 223
Query: 231 ENNTDIIVRMIHDVPQGREICLSYFPVNYDYSTRQKRLLYDYDFACDCDRCK 282
I +R + + G E+ +SY R+K+L Y F C C+ CK
Sbjct: 224 HTQMRIELRALSKISPGDELTVSYVDFLNVSEERRKQLKKQYYFDCTCEHCK 275
>gi|363755786|ref|XP_003648109.1| hypothetical protein Ecym_7475 [Eremothecium cymbalariae
DBVPG#7215]
gi|356892145|gb|AET41292.1| hypothetical protein Ecym_7475 [Eremothecium cymbalariae
DBVPG#7215]
Length = 487
Score = 49.7 bits (117), Expect = 0.002, Method: Compositional matrix adjust.
Identities = 63/308 (20%), Positives = 121/308 (39%), Gaps = 56/308 (18%)
Query: 16 RGRGLVSTQSLKAGQIVLRD-------SPILLYSALPFINSSSFC-----HNCFRKTMHS 63
+G+ L + + G+++L + +P+ Y+ + + C +C MH+
Sbjct: 121 KGKALCTKVKIAKGELILHEEEPLTFIAPLEKYTLMQLSKACGTCGTSLNQSCHYYIMHN 180
Query: 64 SSSICPSCSVAFCSPKCSTAAASSSHSPYVCQALTRLLNLNSPDAANLPLDRQLQARFLI 123
C +C++ +CS KC T + S Y+ +++ NS + L ++ +
Sbjct: 181 LD--CDNCTIVWCSKKCKTIDTTHS---YLKHPMSKNKICNSQNW--LKFEQFCKENTWY 233
Query: 124 AAYNLAVVNPSQFQILLAFQGTVTDNDTSAAHYLSSLCPPPASTTIELTAAL-------- 175
+AY + ++ Q L+ QG + D ++ +S A+ + + L
Sbjct: 234 SAYAVGMIYARQN---LSDQGAIIDEQFNSLAKISQRIRLEAAESTNIGGTLDSGVDIIT 290
Query: 176 ----LAKDRLNAFGLMEPYIEGQD-------------GRRSVRAYG--IYLKASFFNHDC 216
A + L E + QD GR ++ IY S NH C
Sbjct: 291 TISSSAMWEEGFYLLCEAFPHLQDDNNFDLEAFLCDIGRFNLNQINGQIYPLFSHINHSC 350
Query: 217 LPNACRFDYVDAAAENNTDIIVRMIHDVPQGREICLSYFPVNYDYSTRQKRLLYDYDFAC 276
PN F++ + I D+ G E+ +Y +D ++R++ L ++ F C
Sbjct: 351 EPNT-YFEF------DKHGIKAFARKDIAAGEELLTTYVNPLHDVNSRRRELCVNWGFLC 403
Query: 277 DCDRCKVE 284
+C RCK E
Sbjct: 404 NCRRCKKE 411
>gi|157137747|ref|XP_001657162.1| hypothetical protein AaeL_AAEL003674 [Aedes aegypti]
gi|108880819|gb|EAT45044.1| AAEL003674-PA [Aedes aegypti]
Length = 446
Score = 49.7 bits (117), Expect = 0.002, Method: Compositional matrix adjust.
Identities = 65/273 (23%), Positives = 106/273 (38%), Gaps = 38/273 (13%)
Query: 27 KAGQIVLRDSPI--LLYSALPFINSSSFCHNCFRKTMHSSSSICPSCSVAFCSPKCSTAA 84
K G ++L++ P +L+S + C CF+ S C V +C+ C A
Sbjct: 7 KKGSLILKEKPFACVLHSRY----RAERCDKCFKPGKVLKCSNC--LYVRYCNRFCQKEA 60
Query: 85 ASSSHSPYVCQALTRLLNLNSPDAANL--PLDRQLQARFLIAAYNLAVVNPSQFQILLAF 142
C L + PDAA + + R+LQ +F L+
Sbjct: 61 WPDHQDE--CGKLKAIGTRTVPDAALMISRIIRKLQKGGDYQKGYYTAKFYRRFHDLMTH 118
Query: 143 QGTVTDNDTSAAHY------LSSLCPPPASTTIELTAALLAKDRLNAFGLMEPYIEGQDG 196
+ + ++ H+ L SL A + + K +N+F +++ D
Sbjct: 119 EDDIKNDAPRMEHFQSLLVVLRSLVEEAAMPSKAELLQIFGKMCINSFNILD------DE 172
Query: 197 RRSVRAYGIYLKASFFNHDCLPNACRFDYVDAAAENNTDIIVRMIHDVPQGRE-----IC 251
S+ G+YL S +H C PNA A + T I +R++ D G E I
Sbjct: 173 MNSI-GTGMYLGISIIDHSCRPNA-------LATFDGTTIHLRLLEDY-HGSEVDFSKIF 223
Query: 252 LSYFPVNYDYSTRQKRLLYDYDFACDCDRCKVE 284
+SY + R+ RL Y F C+C+RC+ E
Sbjct: 224 ISYIDLMNPAEERKARLRAQYYFECNCERCRDE 256
>gi|156539569|ref|XP_001601623.1| PREDICTED: SET and MYND domain-containing protein 4-like [Nasonia
vitripennis]
Length = 697
Score = 49.7 bits (117), Expect = 0.002, Method: Compositional matrix adjust.
Identities = 28/89 (31%), Positives = 46/89 (51%), Gaps = 9/89 (10%)
Query: 196 GRRSVRAYGIYLKASFFNHDCLPNACRFDYVDAAAENNTDIIVRMIHDVPQGREICLSYF 255
G+ G++ + FNH C P R+ ++ NT ++VR I +P G EI +Y
Sbjct: 494 GKSLFIGGGVFPTVALFNHSCNPGVVRY-FI-----GNT-MVVRAIKTIPAGAEISENYG 546
Query: 256 PV--NYDYSTRQKRLLYDYDFACDCDRCK 282
P+ + + R+++L Y F CDC+ CK
Sbjct: 547 PIFTEEEENDRKRKLRLQYWFDCDCEACK 575
>gi|403354753|gb|EJY76937.1| TPR repeat-containing protein [Oxytricha trifallax]
Length = 765
Score = 49.7 bits (117), Expect = 0.002, Method: Compositional matrix adjust.
Identities = 57/280 (20%), Positives = 103/280 (36%), Gaps = 66/280 (23%)
Query: 8 VTVAEIEGRGRGLVSTQSLKAGQIVLRDSPILLYSALPFINSSSFCHNCFRKTMHSSSSI 67
+ + +EG+ RG+++ + +K GQ+VL + +F N M S+
Sbjct: 408 IEIGFVEGKNRGIIAKEDIKKGQLVLVEK--------------AFSTN----EMRQDSNF 449
Query: 68 CPSCSVAFCSPKCSTAAASSSHSPYVCQALTRLLNLNSPDAANLPLDRQLQARFLIAAYN 127
P + F + + ++H L +P+A L ++L N
Sbjct: 450 DPKQCLVFLTDFTNVPLFRNTHIQL----------LENPNAEKL-------VKYLYNGQN 492
Query: 128 ----LAVVNPSQFQILLAFQGTVTDNDTSAAHYLSSLCPPPASTTIELTAALLAKD--RL 181
+ + +Q Q L Q ++T S + P +EL A K+ ++
Sbjct: 493 GKLQVDIQELAQKQNLSESQQSLTYQQMSKMLKYNFCGCPSLLRNLELLKAFERKEEEKI 552
Query: 182 NAFGLMEPYIEGQDGRRSVRAYGIYLKASFFNHDCLPNACRFDYVDAAAENNTDIIVRMI 241
++P + SFFNHDC N RF DAA I+ +
Sbjct: 553 EYINAIQPIL------------------SFFNHDCYANTSRFSIGDAAF-----IVAK-- 587
Query: 242 HDVPQGREICLSYFPVNYDYSTRQKRLLYDYDFACDCDRC 281
D+ +G E+ Y + + R++ + F C C+ C
Sbjct: 588 KDIKKGEELTQFYISLALPFDEREQLTQKAWGFECRCNSC 627
>gi|332236389|ref|XP_003267386.1| PREDICTED: SET and MYND domain-containing protein 3 [Nomascus
leucogenys]
Length = 369
Score = 49.7 bits (117), Expect = 0.002, Method: Compositional matrix adjust.
Identities = 61/258 (23%), Positives = 98/258 (37%), Gaps = 50/258 (19%)
Query: 68 CPSCSVA-FCSPKCSTAAASSSHSPYVCQALTRLLNLNSPDAANLPLDRQL--------- 117
C C VA +CS KC A C+ L PD+ L L R +
Sbjct: 3 CSQCRVAKYCSAKCQKKAWPDHKRE--CKCLKSCKPRYPPDSVRL-LGRVVFKLMDGAPS 59
Query: 118 QARFLIAAYNLAV--------VNPSQFQILLAFQGTVTDNDTSAAHYLSSLCPPPASTTI 169
++ L + Y+L Q+++ FQ + + A+ PPA
Sbjct: 60 ESEKLYSFYDLESNINKLTEDKKEGLRQLVMTFQHFMREEIQDASQL------PPAFDLF 113
Query: 170 ELTAALLAKDRLNAFGLMEPYIEGQDGRRSVRAYGIYLKASFFNHDCLPNACRFDYVDAA 229
E A ++ N+F + ++ G+Y S NH C PN +
Sbjct: 114 EAFAKVIC----NSFTICNAEMQEV-------GVGLYPSISLLNHSCDPNC-------SI 155
Query: 230 AENNTDIIVRMIHDVPQGREICLSYFPVNYDYSTRQKRLLYDYDFACDCDRCKVEANWSD 289
N +++R + D+ G E+ + Y + R+K+L Y F CDC RC+ + D
Sbjct: 156 VFNGPHLLLRAVRDIEVGEELTICYLDMLMTSEERRKQLRDQYCFECDCLRCQTQ----D 211
Query: 290 NDNDD-ENNEEVMDEDQD 306
D D +E+V E Q+
Sbjct: 212 KDADMLTGDEQVWKEVQE 229
>gi|242006418|ref|XP_002424047.1| SET and MYND domain-containing protein, putative [Pediculus humanus
corporis]
gi|212507353|gb|EEB11309.1| SET and MYND domain-containing protein, putative [Pediculus humanus
corporis]
Length = 236
Score = 49.7 bits (117), Expect = 0.003, Method: Compositional matrix adjust.
Identities = 32/96 (33%), Positives = 46/96 (47%), Gaps = 10/96 (10%)
Query: 204 GIYLKASFFNHDCLPNA-CRFDYVDAAAENNTDIIVRMIHDVPQGREICLSYF---PVNY 259
G+Y S NH C PNA F + +N + V ++ G EIC+SY ++
Sbjct: 143 GLYELQSTINHSCSPNAEVTFPH------SNYQLAVVATDNINPGDEICISYLDMCSLSR 196
Query: 260 DYSTRQKRLLYDYDFACDCDRCKVEANWSDNDNDDE 295
+RQK L +Y F C C +C+ E N D + DE
Sbjct: 197 SRHSRQKILQENYLFTCKCHKCEEEVNLPDETSSDE 232
>gi|73980996|ref|XP_533007.2| PREDICTED: SET and MYND domain-containing protein 5 [Canis lupus
familiaris]
Length = 419
Score = 49.7 bits (117), Expect = 0.003, Method: Compositional matrix adjust.
Identities = 30/86 (34%), Positives = 41/86 (47%), Gaps = 8/86 (9%)
Query: 204 GIYLKASFFNHDCLPNACRFDYVDAAAENNTDIIVRMIHDVPQGREICLSYF---PVNYD 260
G+Y+ S NH C+PNA ENN + V + D+ G EIC+SY
Sbjct: 306 GLYVLQSCCNHSCVPNA-----ETTFPENNFLLHVTALEDIKPGEEICISYLDCCQRERS 360
Query: 261 YSTRQKRLLYDYDFACDCDRCKVEAN 286
+R K L +Y F C C +C EA+
Sbjct: 361 RHSRHKILRENYLFVCSCPKCLAEAD 386
Score = 39.3 bits (90), Expect = 2.9, Method: Compositional matrix adjust.
Identities = 24/103 (23%), Positives = 41/103 (39%), Gaps = 27/103 (26%)
Query: 15 GRGRGLVSTQSLKAGQIVLRDSPIL----LYSALPFINSSSFCHNCFRKTMHSSSSI--- 67
+G+GL +TQ ++ G+ V + P++ L++AL + C K ++ +
Sbjct: 31 AKGKGLFATQLIRKGETVFVERPLVAAQFLWNALYRYRACDHCLRALEKAEENAQRLTGK 90
Query: 68 --------------------CPSCSVAFCSPKCSTAAASSSHS 90
CP C V +CS +C AAA H
Sbjct: 91 PGQVLPHPELCAVRKDLHQNCPHCQVMYCSAECRLAAAEQYHQ 133
>gi|409048091|gb|EKM57569.1| hypothetical protein PHACADRAFT_251260 [Phanerochaete carnosa
HHB-10118-sp]
Length = 314
Score = 49.7 bits (117), Expect = 0.003, Method: Compositional matrix adjust.
Identities = 25/80 (31%), Positives = 40/80 (50%), Gaps = 5/80 (6%)
Query: 204 GIYLKASFFNHDCLPNACRFDYVDAAAENNTDIIVRMIHDVPQGREICLSYFPVNYDYST 263
GI+L AS N C PN + + E ++ R + DV G E+C +Y V
Sbjct: 118 GIFLLASRINSSCTPNVSK-----SWDEIRNVMVFRTLRDVQDGEELCFNYCDVLSAQDE 172
Query: 264 RQKRLLYDYDFACDCDRCKV 283
R++ L+ ++ F C CD C++
Sbjct: 173 RKRTLMEEFGFDCTCDACRL 192
>gi|170115671|ref|XP_001889029.1| predicted protein [Laccaria bicolor S238N-H82]
gi|164635977|gb|EDR00277.1| predicted protein [Laccaria bicolor S238N-H82]
Length = 692
Score = 49.7 bits (117), Expect = 0.003, Method: Compositional matrix adjust.
Identities = 35/94 (37%), Positives = 43/94 (45%), Gaps = 12/94 (12%)
Query: 204 GIYLKASFFNHDCLPNACRFDYVDAAAENNTDIIVRMIHDVPQGREICLSYFPVNYDYS- 262
GI L FNH C PNA RF + N + + D+P G EIC++Y Y
Sbjct: 490 GIGLTGCRFNHSCSPNA-RFSF----NPTNGHFRIFALSDIPVGEEICVAYLSSRKLYGQ 544
Query: 263 TRQKR---LLYDYDFACDCDRC---KVEANWSDN 290
TRQ+R L Y F C C C K E SD+
Sbjct: 545 TRQQRQGLLRSRYHFTCSCSVCSLPKAEMTLSDS 578
>gi|167342088|gb|ABZ68992.1| SET domain-containing protein C [Pinus taeda]
gi|167342104|gb|ABZ69000.1| SET domain-containing protein C [Pinus taeda]
Length = 141
Score = 49.7 bits (117), Expect = 0.003, Method: Compositional matrix adjust.
Identities = 27/81 (33%), Positives = 37/81 (45%), Gaps = 8/81 (9%)
Query: 204 GIYLKASFFNHDCLPNACRFDYVDAAAENNTDIIVRMIHDVPQGREICLSYFPVNYDYST 263
G++L SF NH CLPN+ R E + + + V +G EI YF +
Sbjct: 12 GLWLLPSFINHSCLPNSSRL-------EMGSAMFIHACKPVKRGEEITFPYFDILLPLPQ 64
Query: 264 RQKRLLYDYDFACDCDRCKVE 284
RQ ++ F C C RC VE
Sbjct: 65 RQG-WCENWGFECKCRRCIVE 84
>gi|431920345|gb|ELK18377.1| SET and MYND domain-containing protein 5 [Pteropus alecto]
Length = 418
Score = 49.7 bits (117), Expect = 0.003, Method: Compositional matrix adjust.
Identities = 51/184 (27%), Positives = 75/184 (40%), Gaps = 29/184 (15%)
Query: 116 QLQARFLIAAYNLAV---VNPSQFQILLAF-----QGTVTDNDTSAAHYLSSLCPPPAST 167
LQ F A Y A+ P F+ L A QG T + + H +L P
Sbjct: 219 HLQRLFTEALYEEALSQWFTPDGFRSLFALVGTNGQGIGTSSLSQWVHACDALELKPQD- 277
Query: 168 TIELTAALLAKDRLNAF-GLMEPYIEGQDGR-RSVRAYGIYLKASFFNHDCLPNACRFDY 225
+++L+AF + IE G + G+++ S NH C+PNA
Sbjct: 278 ----------REQLDAFIDQLYKDIEAATGEFLNCEGSGLFVLQSCCNHSCVPNA----- 322
Query: 226 VDAAAENNTDIIVRMIHDVPQGREICLSYF---PVNYDYSTRQKRLLYDYDFACDCDRCK 282
+ ENN + V + D+ G EIC+SY +R K L +Y F C C +C
Sbjct: 323 ETSFPENNFLLHVTALEDIKPGEEICISYLDCCQRERSRHSRHKILRENYLFVCSCPKCL 382
Query: 283 VEAN 286
EA+
Sbjct: 383 AEAD 386
>gi|302772491|ref|XP_002969663.1| hypothetical protein SELMODRAFT_231366 [Selaginella moellendorffii]
gi|300162174|gb|EFJ28787.1| hypothetical protein SELMODRAFT_231366 [Selaginella moellendorffii]
Length = 484
Score = 49.7 bits (117), Expect = 0.003, Method: Compositional matrix adjust.
Identities = 66/292 (22%), Positives = 108/292 (36%), Gaps = 49/292 (16%)
Query: 8 VTVAEIEGRGRGLVSTQSLKAGQIVLRDSPILLYSALPFINSSSFCHNCFRKTMHSSSSI 67
+TV +GR LV+ + AGQ VL P + + L + + C CF+++ +
Sbjct: 16 LTVINAAEKGRCLVADRDFYAGQTVLEQDPYV--AVLDGDSRGNRCDACFKQS--PALRR 71
Query: 68 CPSCSVAF-CSPKCSTAAASSSHSPYVCQALTRLLNLNS----PDAANLPLDRQLQARFL 122
C +C F CSP C + C+ L RL L + R+LQA +
Sbjct: 72 CSACKFVFYCSPTCQRSQWKIHQEE--CKVLVRLSTEQKRPTLLLMLRLLVKRELQATGV 129
Query: 123 IAA-----YNLAVVNPSQF------QILLAFQGTVTDNDTSAAHYLSSLCPPPAST-TIE 170
+ Y + P+ F ++++ Q V A Y + I
Sbjct: 130 LPVTALDNYEIVRALPTHFSETGDERLVMYAQMAVLIKTILNARYAEDVKEITKDICRIS 189
Query: 171 LTAALLAKDRLNAFGLMEPYIEGQDGRRSVRAYGIYLKASFFNHDCLPNACR-FDYVDAA 229
+ D L G+ G++ S NH C N+ FD A
Sbjct: 190 CNGHTICDDELRPVGI-----------------GLFPVVSIINHSCSSNSLLLFDGKHA- 231
Query: 230 AENNTDIIVRMIHDVPQGREICLSYFPVNYDYSTRQKRLLYDYDFACDCDRC 281
+VR + + +G E+ +SY + ++R++ L Y F C C RC
Sbjct: 232 -------VVRALGTISRGCEVTVSYIELGASTNSRREALSDQYYFHCKCPRC 276
>gi|61097985|ref|NP_001012912.1| SET and MYND domain-containing protein 5 [Gallus gallus]
gi|75571258|sp|Q5ZIZ2.1|SMYD5_CHICK RecName: Full=SET and MYND domain-containing protein 5
gi|53134094|emb|CAG32301.1| hypothetical protein RCJMB04_22j23 [Gallus gallus]
Length = 420
Score = 49.7 bits (117), Expect = 0.003, Method: Compositional matrix adjust.
Identities = 46/172 (26%), Positives = 73/172 (42%), Gaps = 26/172 (15%)
Query: 132 NPSQFQILLAF-----QGTVTDNDTSAAHYLSSLCPPPASTTIELTAALLAKDRLNAF-G 185
P F+ L A QG T + + H +L P +L ++ L+AF
Sbjct: 244 TPEGFRSLFALVGTNGQGIGTSSLSQWVHACDALDLP-----------MLQREELDAFID 292
Query: 186 LMEPYIEGQDGR-RSVRAYGIYLKASFFNHDCLPNACRFDYVDAAAENNTDIIVRMIHDV 244
+ IE + G + G+Y+ S NH C+PNA + +NN + + + D+
Sbjct: 293 QLYKDIEKESGEFLNCEGSGLYMLQSCCNHSCIPNA-----ETSFPDNNFLLYLTALEDI 347
Query: 245 PQGREICLSYFPVNYDYSTRQKR---LLYDYDFACDCDRCKVEANWSDNDND 293
G EIC+SY +R R L +Y F C C +C +A+ D +D
Sbjct: 348 EAGEEICISYLDCCQRERSRHSRNKILRENYLFTCSCPKCLAQADDPDVTSD 399
>gi|336463860|gb|EGO52100.1| hypothetical protein NEUTE1DRAFT_132845 [Neurospora tetrasperma
FGSC 2508]
Length = 722
Score = 49.7 bits (117), Expect = 0.003, Method: Compositional matrix adjust.
Identities = 29/100 (29%), Positives = 41/100 (41%), Gaps = 8/100 (8%)
Query: 202 AYGIYLKASFFNHDCLPNACRFDYVDAAAENNTDIIVRMIHDVPQGREICLSYFPVNYDY 261
A G++ +S NH C+PN R D +I R DV +G E+ Y PV
Sbjct: 466 AKGLWAHSSLMNHSCVPNTMRSFVGDM-------LICRATRDVQEGDELFQQYVPVKTQV 518
Query: 262 STRQKRLLYDYDFACDCDRCKVEANWSDNDNDDENNEEVM 301
R K + F C C C+ E S + + E V+
Sbjct: 519 DVRNKEFEEGWGFECRCGLCEGERK-SSKERLKKRKEAVL 557
>gi|328871774|gb|EGG20144.1| hypothetical protein DFA_07264 [Dictyostelium fasciculatum]
Length = 535
Score = 49.7 bits (117), Expect = 0.003, Method: Compositional matrix adjust.
Identities = 40/145 (27%), Positives = 57/145 (39%), Gaps = 28/145 (19%)
Query: 167 TTIELTA-ALLAKDRLNAFGLMEPYIEGQDGRRSVRAYGIYLKASFFNHDCLPNACRF-- 223
T I+L LL K R N FGL Y G +YL S FNH CLPN
Sbjct: 339 TRIDLNILKLLGKVRANYFGLWN-YDNDVGGECYWAGAAVYLGLSLFNHSCLPNCSTIIQ 397
Query: 224 --DYVDAAAENNTD----------------------IIVRMIHDVPQGREICLSYFPVNY 259
D+ ++ NN + + + D+ E+ ++Y P++
Sbjct: 398 SEDFDRGSSANNYERMAVVAPDYMIDKELGRLNPLTFYIVALRDIQVDEELLITYIPLDQ 457
Query: 260 DYSTRQKRLLYDYDFACDCDRCKVE 284
R+ +L + F CDC RCK E
Sbjct: 458 KLKERRNQLKSLWLFDCDCRRCKEE 482
>gi|301603797|ref|XP_002931556.1| PREDICTED: SET and MYND domain-containing protein 3-like [Xenopus
(Silurana) tropicalis]
Length = 226
Score = 49.7 bits (117), Expect = 0.003, Method: Compositional matrix adjust.
Identities = 30/107 (28%), Positives = 45/107 (42%), Gaps = 26/107 (24%)
Query: 210 SFFNHDCLPNACRFDYVDAAAENNTDIIVRMIHDVPQGREICLSYFPVNYDYSTRQKRLL 269
S NH C PN T +++R + ++P+G E+ +SY V R+ +L
Sbjct: 2 SLLNHSCDPNC-------VIVFEGTCLLLRTVKEIPKGEELTISYIDVKMPTQGRRDQLQ 54
Query: 270 YDYDFACDCDRCKVEANWSDNDNDDENNEEVMDEDQDEQMVASDDDA 316
Y F CDC RC + D+DE M+A D +A
Sbjct: 55 RQYCFLCDCQRCLL-------------------RDKDEDMLAGDAEA 82
>gi|326518712|dbj|BAJ92517.1| predicted protein [Hordeum vulgare subsp. vulgare]
Length = 166
Score = 49.3 bits (116), Expect = 0.003, Method: Compositional matrix adjust.
Identities = 35/116 (30%), Positives = 54/116 (46%), Gaps = 15/116 (12%)
Query: 175 LLAKDRLNAFGLMEPYIEGQDGRRSVRAY---------GIYLKASFFNHDCLPNACRFDY 225
+LA+ R+NAF + +D S A +Y+ SF+NHDC PN
Sbjct: 53 VLARIRVNAFRIELVASSYEDLLSSAVASVTCDASVGNAVYMLPSFYNHDCDPNTHIVWL 112
Query: 226 VDAAAENNTDIIVRMIHDVPQGREICLSYFPVNYDYSTRQKRLLYDYDFACDCDRC 281
+A A+ NT + D+ +G E+ + Y + + + RQK L + F C C RC
Sbjct: 113 ENADAKLNT------LRDIDEGEELRICYIDTSMNVNARQKILTEGFGFQCRCQRC 162
>gi|390363795|ref|XP_788278.2| PREDICTED: SET and MYND domain-containing protein 3-like
[Strongylocentrotus purpuratus]
Length = 356
Score = 49.3 bits (116), Expect = 0.003, Method: Compositional matrix adjust.
Identities = 29/85 (34%), Positives = 37/85 (43%), Gaps = 7/85 (8%)
Query: 202 AYGIYLKASFFNHDCLPNACRFDYVDAAAENNTDIIVRMIHDVPQGREICLSYFPVNYDY 261
A GIYL+AS NH C PN A + + + + DV +G E +SY
Sbjct: 190 AVGIYLRASMLNHSCDPNC-------AWVCDGRKLQIMTVKDVKEGDECTISYVDAMKPA 242
Query: 262 STRQKRLLYDYDFACDCDRCKVEAN 286
RQ L Y F C C +C E N
Sbjct: 243 KVRQADLKESYHFTCKCVKCIEEIN 267
Score = 38.9 bits (89), Expect = 4.5, Method: Compositional matrix adjust.
Identities = 25/77 (32%), Positives = 32/77 (41%), Gaps = 7/77 (9%)
Query: 213 NHDCLPNACRFDYVDAAAENNTDIIVRMIHDVPQGREICLSYFPVNYDYSTRQKRLLYDY 272
NH C DY + + +R I DV G E +SY V RQ +L +Y
Sbjct: 3 NHSC-------DYNCIVVFDERKLQLRTIKDVKDGEECTISYVDVINPAKERQAKLEEEY 55
Query: 273 DFACDCDRCKVEANWSD 289
F C C +C E N D
Sbjct: 56 HFTCKCVKCVEEINRGD 72
>gi|355720910|gb|AES07090.1| SET and MYND domain containing 3 [Mustela putorius furo]
Length = 251
Score = 49.3 bits (116), Expect = 0.003, Method: Compositional matrix adjust.
Identities = 32/106 (30%), Positives = 49/106 (46%), Gaps = 12/106 (11%)
Query: 202 AYGIYLKASFFNHDCLPNACRFDYVDAAAENNTDIIVRMIHDVPQGREICLSYFPVNYDY 261
G+Y S NH C PN C + N +++R + D+ G E+ + Y +
Sbjct: 17 GVGLYPSMSLLNHSCDPN-CSIVF------NGPHLLLRAVRDIEAGEELTICYLDMLMTS 69
Query: 262 STRQKRLLYDYDFACDCDRCKVEANWSDNDNDD-ENNEEVMDEDQD 306
R+K+L Y F CDC RC+ + D D D +E+V E Q+
Sbjct: 70 EERRKQLRDQYCFECDCFRCQTQ----DKDADMLTGDEQVWKEVQE 111
>gi|312383723|gb|EFR28693.1| hypothetical protein AND_03014 [Anopheles darlingi]
Length = 423
Score = 49.3 bits (116), Expect = 0.003, Method: Compositional matrix adjust.
Identities = 71/297 (23%), Positives = 113/297 (38%), Gaps = 62/297 (20%)
Query: 17 GRGLVSTQSLKAGQIVLRDSPILLYSALPFINSSSFCHNCFRKTMHSSS----------- 65
GR +V+T++++ G+++ RD+P ++ P + S C C R M +++
Sbjct: 21 GRYVVATRNIRQGEVIFRDAPAVVG---PKMASVPVCLGCNRDLMATATVRQQGDPNFRF 77
Query: 66 SICPSCSVAFCSPKCSTA---------AASSSHSPYVCQALTRLLNLNSPDAANLPLDRQ 116
C C C +C A A SS+ P +C A S +PL
Sbjct: 78 HQCSRCGWPVCGAECEAADQHRSECSIMAGSSYRPKICPAREEQARRESAYCVIVPL--- 134
Query: 117 LQARFLIAAYNLAVVNPSQFQILLAFQGTVTDNDTSAAHYL--SSLCP----------PP 164
R L+ L +P + L + V + S + + S+L P
Sbjct: 135 ---RVLL----LQSHSPDTYAQLQRLESHVAERLKSPLYEVVRSNLVPFIRSVLGLQQYS 187
Query: 165 ASTTIELTAALLAKDRLNAFGLMEPYIEGQDGRRSVRAYGIYLKASFFNHDCLPNACRFD 224
A T +E+ A L N F + P + R VRA +Y + +HDC PN
Sbjct: 188 AETILEVCAML----DTNCFEIRLP-----ERRTKVRA--LYPLGAMLSHDCRPNT--KH 234
Query: 225 YVDAAAENNTDIIVRMIHDVPQGREICLSYFPVNYDYSTRQKRLLYDYDFACDCDRC 281
Y D A +++ D+ +G I SY R+ L F C C+RC
Sbjct: 235 YFDDALR----MVLVATVDIERGATISTSYTQPLLGTLHRRLALKQSKHFDCQCERC 287
>gi|85086843|ref|XP_957767.1| hypothetical protein NCU00296 [Neurospora crassa OR74A]
gi|28918862|gb|EAA28531.1| predicted protein [Neurospora crassa OR74A]
Length = 510
Score = 49.3 bits (116), Expect = 0.003, Method: Compositional matrix adjust.
Identities = 38/152 (25%), Positives = 60/152 (39%), Gaps = 17/152 (11%)
Query: 210 SFFNHDCLPNA-CRFDYVDAAAENNTDIIVRMIHDVPQGREICLSYFPVNYDYSTRQKRL 268
S NHDC PNA R+D+ ++ + D+ G EI +SY RQK L
Sbjct: 284 SRLNHDCRPNADYRWDWNSNKGGPGLVQVITAVKDILPGEEITISYINPLGSRKARQKLL 343
Query: 269 LYDYDFACDCDRCKV------EANWSD----------NDNDDENNEEVMDEDQDEQMVAS 312
+ F C C+ C EA+W + D +E E++ + S
Sbjct: 344 STAWGFECSCELCSRSGGRVEEADWRVRLIRKVWRLLKEGQDGGSEVERGEEEGGWKMGS 403
Query: 313 DDDAEVHGDTNFPHAYFFMRYMCDRDNCWGTL 344
+ E G+ + A M + + + WGT+
Sbjct: 404 RERGEGKGEGDGTKAAELMVSLYEMEGLWGTI 435
>gi|452977535|gb|EME77301.1| hypothetical protein MYCFIDRAFT_179882 [Pseudocercospora fijiensis
CIRAD86]
Length = 616
Score = 49.3 bits (116), Expect = 0.003, Method: Compositional matrix adjust.
Identities = 25/73 (34%), Positives = 36/73 (49%), Gaps = 7/73 (9%)
Query: 213 NHDCLPNACRFDYVDAAAENNTDIIVRMIHDVPQGREICLSYFPVNYDYSTRQKRLLYDY 272
NH C PNA A N +++R + + +G EI + Y V+ +S RQ L Y
Sbjct: 296 NHSCDPNA-------AVVFNQPRLVLRALRSISKGEEIFMKYIDVSNPFSVRQAELKESY 348
Query: 273 DFACDCDRCKVEA 285
F+C C +CK A
Sbjct: 349 FFSCRCSKCKKGA 361
>gi|326669800|ref|XP_003199084.1| PREDICTED: SET and MYND domain-containing protein 1-like [Danio
rerio]
Length = 231
Score = 49.3 bits (116), Expect = 0.003, Method: Compositional matrix adjust.
Identities = 50/221 (22%), Positives = 88/221 (39%), Gaps = 23/221 (10%)
Query: 6 ETVTVAEIEGRGRGLVSTQSLKAGQIVLRDSPILLYSALPF-INSSSFCHNCFRKTMHSS 64
E V V + G+GRGL +T+ AG ++ + P ++++ F +SS CH+CFR+
Sbjct: 2 EFVEVFDSPGKGRGLRATKEAWAGDVLFAEPP---FASVVFDSQASSICHSCFRR--QEK 56
Query: 65 SSICPSCSVA-FCSPKCSTAAASSSHSPYVCQALTRLLNLNSPDAANLPLDRQLQARF-- 121
C C A +C C A C A+ P + N+ L ++ R
Sbjct: 57 LQRCGQCRFAQYCDKTCQRAGWEE--HKLECAAIK---TYGKPPSENVRLAARILWRMDK 111
Query: 122 ---LIAAYNLAVVNPSQFQILLAFQGTVTDNDTSAAHYLSSLCPPPASTTIELTAALLAK 178
+++ L + + I + + D ++L T++ + +L
Sbjct: 112 QGSVVSDNQLTTLEDLEDHICDISEDDLKDFKVDIHNFLDYWPRNSKPHTVDSVSHILGV 171
Query: 179 DRLNAFGLMEPYIEGQDGRRSVRAYGIYLKASFFNHDCLPN 219
N F + Q G ++V G++ NHDC PN
Sbjct: 172 INCNGF-----MVSDQRGLQAV-GVGLFPNLCLVNHDCWPN 206
>gi|384248030|gb|EIE21515.1| hypothetical protein COCSUDRAFT_56730 [Coccomyxa subellipsoidea
C-169]
Length = 494
Score = 49.3 bits (116), Expect = 0.003, Method: Compositional matrix adjust.
Identities = 24/76 (31%), Positives = 36/76 (47%), Gaps = 7/76 (9%)
Query: 205 IYLKASFFNHDCLPNACRFDYVDAAAENNTDIIVRMIHDVPQGREICLSYFPVNYDYSTR 264
+Y++AS NH C PN + T VR + D+ +G E+ +SY + S R
Sbjct: 426 LYVRASLLNHSCRPNC-------VVVRSMTSGSVRALRDIEEGEELTISYIDLGLPPSAR 478
Query: 265 QKRLLYDYDFACDCDR 280
L ++ F C CDR
Sbjct: 479 GDELRKNFFFECTCDR 494
>gi|255950300|ref|XP_002565917.1| Pc22g20170 [Penicillium chrysogenum Wisconsin 54-1255]
gi|211592934|emb|CAP99305.1| Pc22g20170 [Penicillium chrysogenum Wisconsin 54-1255]
Length = 495
Score = 49.3 bits (116), Expect = 0.003, Method: Compositional matrix adjust.
Identities = 66/307 (21%), Positives = 108/307 (35%), Gaps = 62/307 (20%)
Query: 2 SAVSETVTVAEIEGRGRGLVSTQSLKAGQIVLR-DSPILLYSALPFINSSSFCHNCFRKT 60
S ++ TV G G GL +T + G+ VL +P L P + + C CF K
Sbjct: 9 SVITRTVPQPAPNGMGNGLFATAPINPGEDVLHIKTPFLAVLDSPRLEDT--CAGCFGKR 66
Query: 61 MHSSSSICPSCS----VAFCSPKCSTAAASSSHS----------PYVCQALTRLLNLNSP 106
+ + +C+ V +C C + +HS P V R L
Sbjct: 67 QVETGNELKACTGCRVVKYCDRTCQSKDWKFAHSLECPIFKNVKPMVLPNNARALLRVVL 126
Query: 107 DAANLPLDRQLQARFLIAAYNLAVVNPSQFQI---------LLAFQGTVTDNDTSAAHYL 157
A D + F ++ ++ SQ Q+ + + GT D T A++
Sbjct: 127 RTARNKYDSEESKVFDGLETHINEISESQGQLDRINLTARAVKNYSGTEMDEGTVASY-- 184
Query: 158 SSLCPPPASTTIELTAALLAKDRLNAFGL-MEPYIEGQDGRRSVRAYGIYLKASFFNHDC 216
+ ++L + L + GL M PY A NH C
Sbjct: 185 --------AAKLDLNSFNLTTSMYDRIGLYMHPY------------------AGLINHSC 218
Query: 217 LPNACRFDYVDAAAENNTDIIVRMIHDVPQGREICLSYFPVNYDYSTRQKRLLYDYDFAC 276
DY + ++ V+ + + +G +I +SY Y R+ L Y F C
Sbjct: 219 -------DYNSTVGFDGEELYVKAMRPIKKGEQIFISYIDTTTPYDIRRNELKERYFFDC 271
Query: 277 DCDRCKV 283
C +CK+
Sbjct: 272 QCTKCKM 278
>gi|157866065|ref|XP_001681739.1| conserved hypothetical protein [Leishmania major strain Friedlin]
gi|68125037|emb|CAJ03122.1| conserved hypothetical protein [Leishmania major strain Friedlin]
Length = 622
Score = 49.3 bits (116), Expect = 0.003, Method: Compositional matrix adjust.
Identities = 66/242 (27%), Positives = 97/242 (40%), Gaps = 44/242 (18%)
Query: 59 KTMHSSSSICPSCS-VAFCSPKCSTAAASSSHSPYVCQALTRLLNL-NSPDAANLPLDRQ 116
K S C +C+ V +CSP C+ A H + C +TRL ++ +A L + Q
Sbjct: 252 KATKEKPSTCTACNMVVYCSPACADAH-REQHERFECARMTRLSDMMQGIEARKLDVPEQ 310
Query: 117 LQARFLIAAYN----LAVVNPSQ--FQILLAFQGTVTDNDTSAAHYLSSLC--------- 161
F AY+ LA + + + +L V D + H + L
Sbjct: 311 ----FFELAYHCITTLAGIKAKRAGHERVLTLTAHV-DEVIQSLHPIGPLMCDLFKGEER 365
Query: 162 PPPASTTIELTAALLAKDRLNAFGLMEPYIEGQDGRRSVRAYGIYLKASFFNHDCLPNAC 221
P T I + NA L +P G +++ A ASFFNH C PN C
Sbjct: 366 PATLYTIIGVLCC-------NALELADP--SGLGVAQALHAGNSI--ASFFNHSCTPN-C 413
Query: 222 RFDYVDAAAENNTDIIVRMIHDVPQGREICLSYFP-VNYDYSTRQKRLLYDYDFACDCDR 280
D V A + R IH G E+ ++Y P + + R++RL Y F C C R
Sbjct: 414 AIDTVRHAI-----VTTRTIH---VGEELSIAYIPQLYWPTRLRRERLSESYYFVCRCQR 465
Query: 281 CK 282
C+
Sbjct: 466 CE 467
>gi|380471164|emb|CCF47414.1| TPR domain-containing protein [Colletotrichum higginsianum]
Length = 387
Score = 49.3 bits (116), Expect = 0.003, Method: Compositional matrix adjust.
Identities = 27/83 (32%), Positives = 37/83 (44%), Gaps = 7/83 (8%)
Query: 204 GIYLKASFFNHDCLPNACRFDYVDAAAENNTDIIVRMIHDVPQGREICLSYFPVNYDYST 263
G+++ +++ NH CLPNA R D +IV D+P G EI Y P Y
Sbjct: 146 GLWVHSAYANHACLPNANRSFIGDM-------LIVTATVDIPAGAEITHIYLPPKAAYLL 198
Query: 264 RQKRLLYDYDFACDCDRCKVEAN 286
R + + F C C C EA
Sbjct: 199 RMPQFRSSWGFVCSCALCAGEAK 221
>gi|325530307|sp|Q5RGL7.2|SMY2B_DANRE RecName: Full=N-lysine methyltransferase SMYD2-B; AltName:
Full=Histone methyltransferase SMYD2-B; AltName:
Full=SET and MYND domain-containing protein 2B
Length = 434
Score = 49.3 bits (116), Expect = 0.003, Method: Compositional matrix adjust.
Identities = 69/285 (24%), Positives = 104/285 (36%), Gaps = 43/285 (15%)
Query: 15 GRGRGLVSTQSLKAGQIVLRDSPILLYSALPFINSSSFCHNCFRKTMHSSSSICPSCSVA 74
G+GRGL +++ G+++ P Y L C CF T + C C A
Sbjct: 17 GKGRGLRVSRAYGVGELLF-SCPAYSY-VLSVGERGLICEQCF--TRKKGLAKCGKCKKA 72
Query: 75 F-CSPKCSTAAASSSHSPYVCQALTRLLNLNSPDAANLPLDRQLQARFLIAAYNLAVVNP 133
F C+ C + CQA+ P +L AR + +P
Sbjct: 73 FYCNANCQKK--NWPMHKLECQAMCAFGENWRPSETV-----RLVARIIARLKAQKERSP 125
Query: 134 SQFQILLA-FQGTVTDND-----TSAAH-------YLSSLCPPPASTTIELTAALLAKDR 180
S+ +LL + + D D + AH Y L P + L ++
Sbjct: 126 SEILLLLGEMEAHLEDMDNEKREMTEAHIAGLHQFYSKHLDFPDHQALL----TLFSQVH 181
Query: 181 LNAFGLMEPYIEGQDGRRSVRAYGIYLKASFFNHDCLPNACRFDYVDAAAENNTDIIVRM 240
N F + +D S I+ + NH C PN Y AE VR
Sbjct: 182 CNGFTV-------EDEELSNLGLAIFPDIALLNHSCSPNVI-VTYRGINAE------VRA 227
Query: 241 IHDVPQGREICLSYFPVNYDYSTRQKRLLYDYDFACDCDRCKVEA 285
+ D+ G+EI SY + Y + R +RL Y F+CDC C ++
Sbjct: 228 VKDISPGQEIYTSYIDLLYPTADRLERLRDMYYFSCDCKECTTKS 272
>gi|168051835|ref|XP_001778358.1| predicted protein [Physcomitrella patens subsp. patens]
gi|162670237|gb|EDQ56809.1| predicted protein [Physcomitrella patens subsp. patens]
Length = 1301
Score = 49.3 bits (116), Expect = 0.003, Method: Compositional matrix adjust.
Identities = 26/87 (29%), Positives = 39/87 (44%), Gaps = 7/87 (8%)
Query: 198 RSVRAYGIYLKASFFNHDCLPNACRFDYVDAAAENNTDIIVRMIHDVPQGREICLSYFPV 257
R V +G++ SF NH CLP+ C A + V D+ G E+ YF +
Sbjct: 955 RKVEHWGLWWLPSFINHSCLPSCCPILVGKA-------LFVFASRDLRAGDEVTRPYFDI 1007
Query: 258 NYDYSTRQKRLLYDYDFACDCDRCKVE 284
+ R++ + F C C RCK+E
Sbjct: 1008 FQPFDQRRESSTQGWGFVCYCPRCKLE 1034
>gi|390603364|gb|EIN12756.1| SET domain-containing protein [Punctularia strigosozonata HHB-11173
SS5]
Length = 450
Score = 49.3 bits (116), Expect = 0.003, Method: Compositional matrix adjust.
Identities = 64/274 (23%), Positives = 113/274 (41%), Gaps = 35/274 (12%)
Query: 22 STQSLKAGQIVLRDSPILLYSALPFINSSSFCHNCFRKTMHSSSSICPSCSV-AFCSPKC 80
+T+S G ++L D L +A+ ++ C++C R T S C C+ +CS +C
Sbjct: 24 ATRSFAPGDVILVDE--ALTTAILPSDAGRRCNHCLRLT--SDLKRCAGCAAYHYCSTQC 79
Query: 81 STAAASSSHSPYVCQALTRLLNLNSPDAANLPLDRQLQARFL-----IAAYNLAVVNP-- 133
+ S H +C+ ++ + SP L ++ A L I+ + + P
Sbjct: 80 QSQQWSIHHK-RICKHISSFIA--SPAYQGLHEHEKITAVLLSHLCTISQHGKDSLRPAL 136
Query: 134 SQFQILLAFQGTVTDNDTSAAHYLSSLCPPPASTTIELTAALLAKDRLNAFGLMEPYIEG 193
S F+ LL + T +CPPP ST + A N ++ ++
Sbjct: 137 STFEDLLPGPESATP----------PICPPPFSTEEQQKATYYHSKFGNNNFVIHSHL-- 184
Query: 194 QDGRRSVRAYGIYLKASF-FNHDCLPNACRFDYVDAAAENNTDIIVRMIHDVPQGREICL 252
+ A+GI+ +S FNH C PNA Y+ + ++ + H + G E+ +
Sbjct: 185 -----TSFAHGIFPMSSICFNHSCAPNAAA-RYILTPHQVPRMEVIALTH-IAAGTEVTI 237
Query: 253 SYFPVNYDYSTRQKRLLYDYDFACDCDRCKVEAN 286
Y + RQ+ Y F C C C ++A+
Sbjct: 238 PYLDPALPLANRQQITQITYGFICGCPLCTLQAS 271
>gi|313227557|emb|CBY22704.1| unnamed protein product [Oikopleura dioica]
Length = 589
Score = 49.3 bits (116), Expect = 0.003, Method: Compositional matrix adjust.
Identities = 60/291 (20%), Positives = 108/291 (37%), Gaps = 49/291 (16%)
Query: 14 EGRGRGLVSTQSLKAGQIVLRDSPILLYSALPFIN-SSSFCHNCFRKTMHSSSSI----- 67
E +GRG+ + + K G ++L+ P ++ + F + + CH+CF +
Sbjct: 109 EEKGRGIKAARDFKVGNLILKAEP---FAFVIFDHMAEHVCHHCFNMVVRDRQGQPTTQL 165
Query: 68 --CPSCSVA-FCSPKCSTAAASSSHSPYVCQALTRLLNLNSPDAANL----------PLD 114
C SC A +CS +C A S C A+ R+ + D + +
Sbjct: 166 LRCSSCKFARYCSRECQKKAWSMHKKE--CMAIKRIAPRTASDEVRMVSQILWKQAERGE 223
Query: 115 RQLQARFLIAAYNLA-VVNPSQFQILLAFQGTVTDNDTSAAHYLSSLCPPPASTTIELTA 173
++ ++ L L +N F+ + + + Y P + I+
Sbjct: 224 KRAKSEELCRVEELCDHLNDMSFEDV----NKLEEQSKEIGDYFGYENLPDSDEYID--- 276
Query: 174 ALLAKDRLNAFGLMEPYIEGQDGRRSVRAYGIYLKASF--FNHDCLPNACRFDYVDAAAE 231
+ FG++ R ++ G+ + + NHDC PN A
Sbjct: 277 --------HLFGIVSCNGMSITDMRGLQYLGVAIHPTLNLINHDCNPNV-------VAVS 321
Query: 232 NNTDIIVRMIHDVPQGREICLSYFPVNYDYSTRQKRLLYDYDFACDCDRCK 282
+I VR I + +G E+ +SY + TR+ L Y F C C C+
Sbjct: 322 CGPNIFVRAIKPIKEGDELFISYIDTSATSETRKNILKDQYYFDCTCKMCE 372
>gi|67539656|ref|XP_663602.1| hypothetical protein AN5998.2 [Aspergillus nidulans FGSC A4]
gi|40738557|gb|EAA57747.1| hypothetical protein AN5998.2 [Aspergillus nidulans FGSC A4]
gi|259479819|tpe|CBF70391.1| TPA: SET and MYND domain protein, putative (AFU_orthologue;
AFUA_2G10080) [Aspergillus nidulans FGSC A4]
Length = 497
Score = 49.3 bits (116), Expect = 0.003, Method: Compositional matrix adjust.
Identities = 27/80 (33%), Positives = 37/80 (46%), Gaps = 9/80 (11%)
Query: 204 GIYLK--ASFFNHDCLPNACRFDYVDAAAENNTDIIVRMIHDVPQGREICLSYFPVNYDY 261
GIYL A+ NH C DY + ++I V+ I + G +I +SY Y
Sbjct: 209 GIYLHPYAALINHSC-------DYNAVVGFDGSEIFVKAIRPIATGEQIFISYIDTTYPT 261
Query: 262 STRQKRLLYDYDFACDCDRC 281
RQK L Y F C+C +C
Sbjct: 262 RIRQKELQERYFFTCNCAKC 281
>gi|350290486|gb|EGZ71700.1| SET domain-containing protein [Neurospora tetrasperma FGSC 2509]
Length = 488
Score = 49.3 bits (116), Expect = 0.003, Method: Compositional matrix adjust.
Identities = 25/73 (34%), Positives = 34/73 (46%), Gaps = 1/73 (1%)
Query: 210 SFFNHDCLPNA-CRFDYVDAAAENNTDIIVRMIHDVPQGREICLSYFPVNYDYSTRQKRL 268
S NHDC PNA RFD+ + ++ + D+ G EI +SY RQK L
Sbjct: 226 SRLNHDCRPNADYRFDWNSSKGGPGLVQVITAVKDILPGEEITISYINPLRSRKARQKLL 285
Query: 269 LYDYDFACDCDRC 281
+ F C C+ C
Sbjct: 286 STAWGFECSCELC 298
>gi|261330294|emb|CBH13278.1| hypothetical protein, conserved [Trypanosoma brucei gambiense
DAL972]
Length = 713
Score = 49.3 bits (116), Expect = 0.003, Method: Compositional matrix adjust.
Identities = 52/211 (24%), Positives = 74/211 (35%), Gaps = 56/211 (26%)
Query: 164 PASTTIELTAALLAKDRLNAFGLMEPYIEGQDGRRSVRAYGIYLKASFFNHDCLPNACRF 223
P S + L + N FGL +P + + +AS+FNH CLPN CR
Sbjct: 542 PLSVSSSFFDRLCGAVQCNNFGLFDP-------KEDCIGVSVIPEASYFNHSCLPNLCRV 594
Query: 224 DYVDAAAENNTDIIVRMIHDVPQGREICLSYFPVN-YDYSTRQKRLLYDYDFACDCDRCK 282
A + ++ +G + + Y V + R++ LL Y F C C RC
Sbjct: 595 MCDGGIA------AFYALREIRKGEPLTICYVDVQEVSTAERRRTLLTSYRFFCQCKRC- 647
Query: 283 VEANWSDNDNDDENNEEVMDEDQDEQMVASDDDAEVHGDTNFPHAYFFMRYMCDRDNCWG 342
N V+D+ D D A++ +C G
Sbjct: 648 -------------NGSSVVDDKTDA------DAAKLR--------------LCGACAARG 674
Query: 343 TLAPLPPSDAT--------PSTVMECNVCGN 365
L PLPP+DA EC VC N
Sbjct: 675 YLRPLPPTDAAILCRWTMDSIATGECTVCHN 705
>gi|344246823|gb|EGW02927.1| SET and MYND domain-containing protein 5 [Cricetulus griseus]
Length = 633
Score = 49.3 bits (116), Expect = 0.003, Method: Compositional matrix adjust.
Identities = 29/86 (33%), Positives = 42/86 (48%), Gaps = 8/86 (9%)
Query: 204 GIYLKASFFNHDCLPNACRFDYVDAAAENNTDIIVRMIHDVPQGREICLSYF---PVNYD 260
G+++ S NH C+PNA + ENN + V + D+ G EIC+SY
Sbjct: 522 GLFVLQSCCNHSCVPNA-----ETSFPENNFLLHVTALEDIKPGEEICISYLDCCQRERS 576
Query: 261 YSTRQKRLLYDYDFACDCDRCKVEAN 286
+R K L +Y F C C +C EA+
Sbjct: 577 RHSRHKILRENYLFICSCPKCLAEAD 602
>gi|426335970|ref|XP_004029477.1| PREDICTED: SET and MYND domain-containing protein 5 isoform 2
[Gorilla gorilla gorilla]
Length = 303
Score = 49.3 bits (116), Expect = 0.003, Method: Compositional matrix adjust.
Identities = 54/188 (28%), Positives = 80/188 (42%), Gaps = 26/188 (13%)
Query: 109 ANLPLDRQLQARFLIAAYNLAV---VNPSQFQILLAFQGTVTDN--DTSAAHYLSSLCPP 163
L L R+L F A Y AV P F+ L A GT +S + ++
Sbjct: 99 GQLELLRRL---FTEALYEEAVSQWFTPDGFRSLFALVGTNGQGIGTSSLSQWVH----- 150
Query: 164 PASTTIELTAALLAKDRLNAF-GLMEPYIEGQDGR-RSVRAYGIYLKASFFNHDCLPNAC 221
A T+EL +++L+ F + IE G + G+++ S NH C+PNA
Sbjct: 151 -ACDTLELKPQ--DREQLDTFIDQLYKDIEAATGEFLNCEGSGLFVLQSCCNHSCVPNA- 206
Query: 222 RFDYVDAAAENNTDIIVRMIHDVPQGREICLSYFPV---NYDYSTRQKRLLYDYDFACDC 278
+ ENN + V + D+ G EIC+SY +R K L +Y F C C
Sbjct: 207 ----ETSFPENNFLLHVTALEDIKPGEEICISYLDCCQRERSRHSRHKILRENYLFVCSC 262
Query: 279 DRCKVEAN 286
+C EA+
Sbjct: 263 PKCLAEAD 270
>gi|61806679|ref|NP_001013568.1| N-lysine methyltransferase SMYD2-A [Danio rerio]
gi|82178565|sp|Q5BJI7.1|SMY2A_DANRE RecName: Full=N-lysine methyltransferase SMYD2-A; AltName:
Full=Histone methyltransferase SMYD2-A; AltName:
Full=SET and MYND domain-containing protein 2A
gi|60552471|gb|AAH91465.1| SET and MYND domain containing 2a [Danio rerio]
Length = 435
Score = 49.3 bits (116), Expect = 0.003, Method: Compositional matrix adjust.
Identities = 76/321 (23%), Positives = 115/321 (35%), Gaps = 65/321 (20%)
Query: 15 GRGRGLVSTQSLKAGQIVLRDSPILLYSALPFINSSSFCHNCFRKTMHSSSSICPSCSVA 74
G+GRGL + + K G +V P Y L C CF T S C C A
Sbjct: 16 GKGRGLKAIKHFKVGDLVF-ACPAYAY-VLTVNERGGRCECCF--TRKEGLSKCGKCKQA 71
Query: 75 F-CSPKCSTA---------AASSSHSPYVC-----QALTRLLNLNSPDAANLPLDRQLQA 119
+ C+ +C +A ++ C + + R++ P +R L
Sbjct: 72 YYCNVECQRGDWPMHKLECSAMCAYGENWCPSETVRLVARIILKQKHQTERTPSERVLTL 131
Query: 120 RFLIAAYNLAVVNPSQFQILLAFQGTVTDNDTSAAHYLSSL---CPPPASTTIELTAALL 176
R L A L + + D D +A H+ S P A+ T L+
Sbjct: 132 RELEA----------HLDKLDNEKNEMNDTDIAALHHFYSRHLDFPDNAALT-----ELI 176
Query: 177 AKDRLNAFGLMEPYIEGQDGRRSVRAYGIYLKASFFNHDCLPNACRFDYVDAAAENNTDI 236
A+ N F + +D S ++ + NH C PN Y AE
Sbjct: 177 AQVNCNGFTI-------EDEELSHLGSALFPDVALMNHSCSPNVI-VTYKGTVAE----- 223
Query: 237 IVRMIHDVPQGREICLSYFPVNYDYSTRQKRLLYDYDFACDCDRCKVEANWSDNDNDDEN 296
VR + ++ EI SY + Y R +RL Y F CDC C + + DE
Sbjct: 224 -VRAVQEINPEEEIFNSYIDLLYPTEDRIERLKDSYFFNCDCKEC-------TSKSKDEA 275
Query: 297 NEEVM-------DEDQDEQMV 310
E+ +E++ +QMV
Sbjct: 276 KMEIRQKLSIPPEEEEIKQMV 296
>gi|397473489|ref|XP_003808243.1| PREDICTED: SET and MYND domain-containing protein 5 isoform 2 [Pan
paniscus]
Length = 304
Score = 49.3 bits (116), Expect = 0.003, Method: Compositional matrix adjust.
Identities = 54/188 (28%), Positives = 80/188 (42%), Gaps = 26/188 (13%)
Query: 109 ANLPLDRQLQARFLIAAYNLAV---VNPSQFQILLAFQGTVTDN--DTSAAHYLSSLCPP 163
L L R+L F A Y AV P F+ L A GT +S + ++
Sbjct: 99 GQLELLRRL---FTEALYEEAVSQWFTPDGFRSLFALVGTNGQGIGTSSLSQWVH----- 150
Query: 164 PASTTIELTAALLAKDRLNAF-GLMEPYIEGQDGR-RSVRAYGIYLKASFFNHDCLPNAC 221
A T+EL +++L+ F + IE G + G+++ S NH C+PNA
Sbjct: 151 -ACDTLELKPQ--DREQLDTFIDQLYKDIEAATGEFLNCEGSGLFVLQSCCNHSCVPNA- 206
Query: 222 RFDYVDAAAENNTDIIVRMIHDVPQGREICLSYFPV---NYDYSTRQKRLLYDYDFACDC 278
+ ENN + V + D+ G EIC+SY +R K L +Y F C C
Sbjct: 207 ----ETSFPENNFLLHVTALEDIKPGEEICISYLDCCQRERSRHSRHKILRENYLFVCSC 262
Query: 279 DRCKVEAN 286
+C EA+
Sbjct: 263 PKCLAEAD 270
>gi|336469834|gb|EGO57996.1| hypothetical protein NEUTE1DRAFT_110161 [Neurospora tetrasperma
FGSC 2508]
Length = 449
Score = 49.3 bits (116), Expect = 0.003, Method: Compositional matrix adjust.
Identities = 25/73 (34%), Positives = 34/73 (46%), Gaps = 1/73 (1%)
Query: 210 SFFNHDCLPNA-CRFDYVDAAAENNTDIIVRMIHDVPQGREICLSYFPVNYDYSTRQKRL 268
S NHDC PNA RFD+ + ++ + D+ G EI +SY RQK L
Sbjct: 226 SRLNHDCRPNADYRFDWNSSKGGPGLVQVITAVKDILPGEEITISYINPLRSRKARQKLL 285
Query: 269 LYDYDFACDCDRC 281
+ F C C+ C
Sbjct: 286 STAWGFECSCELC 298
>gi|449544639|gb|EMD35612.1| hypothetical protein CERSUDRAFT_124865 [Ceriporiopsis subvermispora
B]
Length = 406
Score = 49.3 bits (116), Expect = 0.003, Method: Compositional matrix adjust.
Identities = 29/78 (37%), Positives = 38/78 (48%), Gaps = 5/78 (6%)
Query: 204 GIYLKASFFNHDCLPNACRFDYVDAAAENNTDIIVRMIHDVPQGREICLSYFPVNYDYST 263
G++L S FNH C PN R DAAA+ + R++ V G E+C +Y V +
Sbjct: 209 GLFLLGSRFNHSCTPNVAR--TWDAAAKA---MAFRVLRAVEPGEELCTNYVDVLGTRAE 263
Query: 264 RQKRLLYDYDFACDCDRC 281
R L Y FAC C C
Sbjct: 264 RAAELEEKYAFACLCSVC 281
>gi|332813421|ref|XP_515547.3| PREDICTED: SET and MYND domain-containing protein 5 isoform 4 [Pan
troglodytes]
Length = 299
Score = 49.3 bits (116), Expect = 0.003, Method: Compositional matrix adjust.
Identities = 54/188 (28%), Positives = 80/188 (42%), Gaps = 26/188 (13%)
Query: 109 ANLPLDRQLQARFLIAAYNLAV---VNPSQFQILLAFQGTVTDN--DTSAAHYLSSLCPP 163
L L R+L F A Y AV P F+ L A GT +S + ++
Sbjct: 99 GQLELLRRL---FTEALYEEAVSQWFTPDGFRSLFALVGTNGQGIGTSSLSQWVH----- 150
Query: 164 PASTTIELTAALLAKDRLNAF-GLMEPYIEGQDGR-RSVRAYGIYLKASFFNHDCLPNAC 221
A T+EL +++L+ F + IE G + G+++ S NH C+PNA
Sbjct: 151 -ACDTLELKPQ--DREQLDTFIDQLYKDIEAATGEFLNCEGSGLFVLQSCCNHSCVPNA- 206
Query: 222 RFDYVDAAAENNTDIIVRMIHDVPQGREICLSYFPV---NYDYSTRQKRLLYDYDFACDC 278
+ ENN + V + D+ G EIC+SY +R K L +Y F C C
Sbjct: 207 ----ETSFPENNFLLHVTALEDIKPGEEICISYLDCCQRERSRHSRHKILRENYLFVCSC 262
Query: 279 DRCKVEAN 286
+C EA+
Sbjct: 263 PKCLAEAD 270
>gi|403260393|ref|XP_003922658.1| PREDICTED: SET and MYND domain-containing protein 5 [Saimiri
boliviensis boliviensis]
Length = 419
Score = 49.3 bits (116), Expect = 0.003, Method: Compositional matrix adjust.
Identities = 54/191 (28%), Positives = 78/191 (40%), Gaps = 32/191 (16%)
Query: 109 ANLPLDRQLQARFLIAAYNLAV---VNPSQFQILLAF-----QGTVTDNDTSAAHYLSSL 160
L L R+L F A Y AV P F+ L A QG T + + H +L
Sbjct: 215 GQLELLRRL---FTEALYEEAVSQWFTPDGFRSLFALVGTNGQGIGTSSLSQWVHACDAL 271
Query: 161 CPPPASTTIELTAALLAKDRLNAF-GLMEPYIEGQDGR-RSVRAYGIYLKASFFNHDCLP 218
P +++L+AF + IE G + G+++ S NH C+P
Sbjct: 272 ELKPQD-----------REQLDAFIDQLYKDIEAATGEFLNCEGSGLFVLQSCCNHSCVP 320
Query: 219 NACRFDYVDAAAENNTDIIVRMIHDVPQGREICLSYF---PVNYDYSTRQKRLLYDYDFA 275
NA + ENN + V + D+ G EIC+SY +R K L +Y F
Sbjct: 321 NA-----ETSFPENNFLLHVTALEDIKPGEEICISYLDCCQRERSRHSRHKILRENYLFV 375
Query: 276 CDCDRCKVEAN 286
C C +C EA+
Sbjct: 376 CSCPKCLAEAD 386
>gi|380013978|ref|XP_003691021.1| PREDICTED: SET and MYND domain-containing protein 4-like [Apis
florea]
Length = 678
Score = 49.3 bits (116), Expect = 0.003, Method: Compositional matrix adjust.
Identities = 72/323 (22%), Positives = 115/323 (35%), Gaps = 67/323 (20%)
Query: 17 GRGLVSTQSLKAGQIVLRDSPI--LLYSALPFINSSSFCHNCFRKTMHSSSSICPSCS-V 73
GR ++T+ ++ G+I+ + P L + IN C CF K ++C +CS V
Sbjct: 253 GRHAIATKDIEPGEILAIEKPYSAFLLAEYRLIN----CFYCFTKIFVPVPAVCQTCSCV 308
Query: 74 AFCSPKCSTAAASSSHSPYVCQALTRLLNLNSPDAANLPLDRQLQARF--LIAAYNLAVV 131
A+CS C A + C L L + L L +Q F L +L
Sbjct: 309 AYCSISCRNKDAKIHENE--CPILPTLWLSKTSINCFLALRIIVQQSFDKLYKLKDLKTN 366
Query: 132 NPSQFQI----------------LLAFQGTVTDNDTSAAHYLSSLC----------PPPA 165
+ +F++ L+ + T D Y++S P
Sbjct: 367 SKDKFEVSASEPYRSDDFKIMFRLVTHEDKRTIEDLFHRTYIASWLLRLLKKGPYFPKHV 426
Query: 166 STTIELTAAL--------------LAKDRLNAFGLMEPYIEGQD-----GRRSVRAYGIY 206
T + A L L + NA + E I D + G+Y
Sbjct: 427 KTPDTVEAKLSDGELYIGGLILHNLMTIQFNAHEISELVIPKADNNLANAKSKFIGGGLY 486
Query: 207 LKASFFNHDCLPNACRFDYVDAAAENNTDIIVRMIHDVPQGREICLSYFPV--NYDYSTR 264
S FNH C P R+ T ++VR I + G EI +Y + R
Sbjct: 487 PTISLFNHSCNPGIIRYFI-------GTTMVVRAIRSISSGEEISENYGQIFTTTPEKER 539
Query: 265 QKRLLYDYDFACDCDRCKVEANW 287
+++L Y F C+C+ C+ +W
Sbjct: 540 KRKLRLQYFFDCNCEACR--EHW 560
>gi|301758142|ref|XP_002914921.1| PREDICTED: SET and MYND domain-containing protein 5-like
[Ailuropoda melanoleuca]
gi|281346678|gb|EFB22262.1| hypothetical protein PANDA_002863 [Ailuropoda melanoleuca]
Length = 417
Score = 49.3 bits (116), Expect = 0.003, Method: Compositional matrix adjust.
Identities = 51/183 (27%), Positives = 75/183 (40%), Gaps = 29/183 (15%)
Query: 117 LQARFLIAAYNLAV---VNPSQFQILLAF-----QGTVTDNDTSAAHYLSSLCPPPASTT 168
LQ F A Y A+ P F+ L A QG T + + H +L P
Sbjct: 220 LQRLFTEALYEEALSQWFTPDGFRSLFALVGTNGQGIGTSSLSQWVHACDALELKPQD-- 277
Query: 169 IELTAALLAKDRLNAF-GLMEPYIEGQDGR-RSVRAYGIYLKASFFNHDCLPNACRFDYV 226
+++L+AF + IE G + G+++ S NH C+PNA
Sbjct: 278 ---------REQLDAFIDQLYKDIEAATGEFLNCEGSGLFVLQSCCNHSCVPNA-----E 323
Query: 227 DAAAENNTDIIVRMIHDVPQGREICLSYFPV---NYDYSTRQKRLLYDYDFACDCDRCKV 283
+ ENN + V + D+ G EIC+SY +R K L +Y F C C +C
Sbjct: 324 TSFPENNFLLHVTALEDIKPGEEICISYLDCCQRERSRHSRHKILRENYLFVCSCPKCLA 383
Query: 284 EAN 286
EA+
Sbjct: 384 EAD 386
Score = 38.5 bits (88), Expect = 5.1, Method: Compositional matrix adjust.
Identities = 23/103 (22%), Positives = 41/103 (39%), Gaps = 27/103 (26%)
Query: 15 GRGRGLVSTQSLKAGQIVLRDSPIL----LYSALPFINSSSFCHNCFRKTMHSSSSI--- 67
+G+GL +TQ ++ G+ + + P++ L++AL + C K ++ +
Sbjct: 31 AKGKGLFATQLIRKGETIFVERPLVAAQFLWNALYRYRACDHCLRALEKAEENAQRLTGK 90
Query: 68 --------------------CPSCSVAFCSPKCSTAAASSSHS 90
CP C V +CS +C AAA H
Sbjct: 91 PGQVLPHPELCAVRKDLHQNCPHCQVMYCSAECRLAAAEQYHQ 133
>gi|72392607|ref|XP_847104.1| hypothetical protein [Trypanosoma brucei brucei strain 927/4
GUTat10.1]
gi|62175618|gb|AAX69751.1| hypothetical protein, conserved [Trypanosoma brucei]
gi|70803134|gb|AAZ13038.1| hypothetical protein, conserved [Trypanosoma brucei brucei strain
927/4 GUTat10.1]
Length = 713
Score = 49.3 bits (116), Expect = 0.003, Method: Compositional matrix adjust.
Identities = 52/211 (24%), Positives = 74/211 (35%), Gaps = 56/211 (26%)
Query: 164 PASTTIELTAALLAKDRLNAFGLMEPYIEGQDGRRSVRAYGIYLKASFFNHDCLPNACRF 223
P S + L + N FGL +P + + +AS+FNH CLPN CR
Sbjct: 542 PLSVSSSFFDRLCGAVQCNNFGLFDP-------KEDCIGVSVIPEASYFNHSCLPNLCRV 594
Query: 224 DYVDAAAENNTDIIVRMIHDVPQGREICLSYFPVN-YDYSTRQKRLLYDYDFACDCDRCK 282
A + ++ +G + + Y V + R++ LL Y F C C RC
Sbjct: 595 MCDGGIA------AFYALREIRKGEPLTICYVDVQEVSTAERRRTLLTSYRFFCQCKRC- 647
Query: 283 VEANWSDNDNDDENNEEVMDEDQDEQMVASDDDAEVHGDTNFPHAYFFMRYMCDRDNCWG 342
N V+D+ D D A++ +C G
Sbjct: 648 -------------NGSSVVDDKTDA------DAAKLR--------------LCGACAARG 674
Query: 343 TLAPLPPSDAT--------PSTVMECNVCGN 365
L PLPP+DA EC VC N
Sbjct: 675 YLRPLPPTDAAILCRWTMDSIATGECTVCHN 705
>gi|154337874|ref|XP_001565163.1| conserved hypothetical protein [Leishmania braziliensis
MHOM/BR/75/M2904]
gi|134062210|emb|CAM36598.1| conserved hypothetical protein [Leishmania braziliensis
MHOM/BR/75/M2904]
Length = 466
Score = 49.3 bits (116), Expect = 0.003, Method: Compositional matrix adjust.
Identities = 26/89 (29%), Positives = 44/89 (49%), Gaps = 7/89 (7%)
Query: 200 VRAYGIYLKASFFNHDCLPNACRFDYVDAAAENNTDIIVRMIHDVPQGREICLSYFPVNY 259
++ G+Y + FNH C+PN VD E I+++ + G + ++Y P++
Sbjct: 368 LKGAGLYSLLACFNHSCVPNVA-VSNVDGTHE----IVLKTTRPIQAGEPLTITYIPLSA 422
Query: 260 DYSTR--QKRLLYDYDFACDCDRCKVEAN 286
+R ++R L +Y F C C RC EA
Sbjct: 423 GAMSRAERQRQLRNYFFTCHCPRCIKEAG 451
>gi|429855146|gb|ELA30117.1| TPR domain-containing protein [Colletotrichum gloeosporioides Nara
gc5]
Length = 742
Score = 49.3 bits (116), Expect = 0.003, Method: Compositional matrix adjust.
Identities = 26/83 (31%), Positives = 39/83 (46%), Gaps = 7/83 (8%)
Query: 204 GIYLKASFFNHDCLPNACRFDYVDAAAENNTDIIVRMIHDVPQGREICLSYFPVNYDYST 263
G+++ +++ NH CLPN C +V + VR D+P+G EI Y P +
Sbjct: 501 GLWVYSAYANHSCLPN-CNRSFVGDM------LFVRATVDIPKGTEITHIYLPPKAAFLL 553
Query: 264 RQKRLLYDYDFACDCDRCKVEAN 286
R + + F C CD C EA
Sbjct: 554 RVPQFRRTWGFECSCDLCDGEAK 576
>gi|343427685|emb|CBQ71212.1| conserved hypothetical protein [Sporisorium reilianum SRZ2]
Length = 488
Score = 49.3 bits (116), Expect = 0.004, Method: Compositional matrix adjust.
Identities = 75/325 (23%), Positives = 114/325 (35%), Gaps = 60/325 (18%)
Query: 16 RGRGLVSTQSLKAGQIVLRDS---PILLYSALPFINSSSFCHNCFRKTMHSSSSI----C 68
+G+GL++T++ I+ + P AL I C +CF + ++ C
Sbjct: 114 KGKGLLATRAFARDDIIFTEDAYIPTPPPHALDQITRGELCAHCFLPISSAPVALAVKDC 173
Query: 69 PSCSVAFCSPKCSTAAASSSHSPY-------------------------VCQALTRLLNL 103
C FC+ C A ++ H+ V +ALTR+L+
Sbjct: 174 QRCKQRFCTTACFRAGMATHHALLCVGHNPGARGLMELVVREKWQSLHSVARALTRVLST 233
Query: 104 NSPDAANLPLDRQLQARFLIAAY-----------NLAVV--------NPSQFQILLAFQG 144
+ A L R +A L AY + A V NP +F
Sbjct: 234 LAYGA----LKRVGEAEELEQAYGDFETVYSRISSFATVSELERRARNPGWSTEKPSFDA 289
Query: 145 TVTDNDTSAAHYLS--SLCPPPASTTIELTAALLAKDRLNAFGLMEPYIEGQDGRRSVRA 202
+ + L S PPPA +T T L K + L P + G+ +
Sbjct: 290 LLAQAHAALRTALDPYSPTPPPAISTPLDTQKPLIKHLFDPATL--PKLLGRANINMEKF 347
Query: 203 YGIYLKASFFNHDCLPNACRFDYVDAAAENNTDIIVRMIHDVPQGREICLSYFPVNYDYS 262
G+Y SF NH C PN + + + + + Q E+ +SY
Sbjct: 348 GGLYSLHSFLNHSCHPNVQIRHMPERGILASMKVAAIALRPIAQNDELVISYIDPTTSLG 407
Query: 263 TRQKRLLYDYDFA-CDCDRCKVEAN 286
RQ L DY F C CD+C E N
Sbjct: 408 RRQLLLYRDYCFGPCTCDKCSHELN 432
>gi|295656875|ref|XP_002789017.1| conserved hypothetical protein [Paracoccidioides sp. 'lutzii' Pb01]
gi|226285046|gb|EEH40612.1| conserved hypothetical protein [Paracoccidioides sp. 'lutzii' Pb01]
Length = 442
Score = 49.3 bits (116), Expect = 0.004, Method: Compositional matrix adjust.
Identities = 29/89 (32%), Positives = 43/89 (48%), Gaps = 3/89 (3%)
Query: 199 SVRAYGIYLKASFFNHDCLPNACRFDYV---DAAAENNTDIIVRMIHDVPQGREICLSYF 255
+VR + L +++ NH CLPNA + + E + V V QG EI + Y
Sbjct: 110 AVRGRVLGLSSAYLNHACLPNAQHTLFTMKCNNDKEERYFLTVYACKKVAQGEEITIPYD 169
Query: 256 PVNYDYSTRQKRLLYDYDFACDCDRCKVE 284
+ D ++RQ+ LL +Y F C C C E
Sbjct: 170 VLYMDRASRQQFLLQEYGFECACKLCGKE 198
>gi|213406173|ref|XP_002173858.1| conserved hypothetical protein [Schizosaccharomyces japonicus
yFS275]
gi|212001905|gb|EEB07565.1| conserved hypothetical protein [Schizosaccharomyces japonicus
yFS275]
Length = 449
Score = 49.3 bits (116), Expect = 0.004, Method: Compositional matrix adjust.
Identities = 67/298 (22%), Positives = 115/298 (38%), Gaps = 51/298 (17%)
Query: 8 VTVAEIEGRGRGLVSTQSLKAGQIVLRDSPILLYSALPFINSS-SFCHNCFR-------- 58
V+V E G G GL T ++ G+ +L L +N + S+C + +
Sbjct: 5 VSVQEEPGAGLGLGVTADIERGECLLEKRLDLYVLNNELLNEACSYCCSQTKPTKRCAAC 64
Query: 59 KTMHSSSSICPSCSVAFCSPKCSTAAASSSHS--PYVCQALTRLLNLNSPDAANLPLDRQ 116
K +H S IC + +C +S++ P VC+ L RL + D + +
Sbjct: 65 KQVHYCSKICQKQDWSMHKLECKALRNASTNGLLPTVCRLLIRLYSQTQKDQSLFADCKD 124
Query: 117 LQARFLIAAYNLAVVNPSQFQILLAFQGTVTDNDTSAAHYLSSLCPPPASTTIELTAALL 176
+A +N A+ + ++ ++A+HY + + +L
Sbjct: 125 HEAEI---QHNTALWSDAELI------------SSAASHY-------TEARDVNAFLSLF 162
Query: 177 AKDRLNAFGLMEPYIEGQDGRRSVRAYGIYLKASF--FNHDCLPNACRFDYVDAAAENNT 234
K +NA L+ P + A G + ++ NH C PN C F A A
Sbjct: 163 CKLSINAMSLVTPAFD---------AIGTCMDSTLARINHSCQPN-CVFMVEGATAR--- 209
Query: 235 DIIVRMIHDVPQGREICLSYFPVNYDYSTRQKRLLYDYDFACDCDRCKVEANWSDNDN 292
+ + + +G + +SY Y+ R L Y F C C +C EA SDN++
Sbjct: 210 ---LVALQTLQKGDFVHISYVDTTLPYNVRTNELQQKYFFTCTCQKCMEEARRSDNEH 264
>gi|351716047|gb|EHB18966.1| SET and MYND domain-containing protein 5 [Heterocephalus glaber]
Length = 420
Score = 49.3 bits (116), Expect = 0.004, Method: Compositional matrix adjust.
Identities = 30/86 (34%), Positives = 42/86 (48%), Gaps = 8/86 (9%)
Query: 204 GIYLKASFFNHDCLPNACRFDYVDAAAENNTDIIVRMIHDVPQGREICLSYF---PVNYD 260
G+++ S NH C+PNA A ENN + V + D+ G EIC+SY
Sbjct: 306 GLFVLQSCCNHSCVPNA-----ETAFPENNFLLHVTALEDINPGEEICISYLDCCQRERS 360
Query: 261 YSTRQKRLLYDYDFACDCDRCKVEAN 286
+R K L +Y F C C +C EA+
Sbjct: 361 RHSRHKILRENYLFVCSCPKCLAEAD 386
Score = 38.5 bits (88), Expect = 5.2, Method: Compositional matrix adjust.
Identities = 23/102 (22%), Positives = 41/102 (40%), Gaps = 27/102 (26%)
Query: 15 GRGRGLVSTQSLKAGQIVLRDSPIL----LYSALPFINSSSFCHNCFRKTMHSSSSI--- 67
+G+GL +TQ ++ G+ + + P++ L++AL + C K ++ +
Sbjct: 31 AKGKGLFATQLIRKGETIFVERPLVAAQFLWNALYRYRACDHCLRALEKAEENAQRLTGK 90
Query: 68 --------------------CPSCSVAFCSPKCSTAAASSSH 89
CP C V +CS +C AAA H
Sbjct: 91 PGQVLPHPELCTVHKELHQNCPHCQVMYCSAECRLAAAEQYH 132
>gi|355746250|gb|EHH50875.1| hypothetical protein EGM_01766, partial [Macaca fascicularis]
Length = 352
Score = 48.9 bits (115), Expect = 0.004, Method: Compositional matrix adjust.
Identities = 38/160 (23%), Positives = 65/160 (40%), Gaps = 24/160 (15%)
Query: 137 QILLAFQGTVTDNDTSAAHYLSSLCPPPASTTIELTAALLAKDRLNAFGLMEPYIEGQDG 196
Q+++ FQ + + A+ PPA E A ++ N+F + ++
Sbjct: 70 QLVMTFQHFMREEIQDASQL------PPAFDLFEAFAKVIC----NSFTICNAEMQEV-- 117
Query: 197 RRSVRAYGIYLKASFFNHDCLPNACRFDYVDAAAENNTDIIVRMIHDVPQGREICLSYFP 256
G+Y S NH C PN + N +++R + D+ G E+ + Y
Sbjct: 118 -----GVGLYPSISLLNHSCDPNC-------SIVFNGPHLLLRAVRDIEVGEELTICYLD 165
Query: 257 VNYDYSTRQKRLLYDYDFACDCDRCKVEANWSDNDNDDEN 296
+ R+K+L Y F CDC RC+ + +D DE
Sbjct: 166 MLMTSEERRKQLRDQYCFECDCFRCQTQDKDADMLTGDEQ 205
>gi|194883957|ref|XP_001976062.1| GG20202 [Drosophila erecta]
gi|190659249|gb|EDV56462.1| GG20202 [Drosophila erecta]
Length = 660
Score = 48.9 bits (115), Expect = 0.004, Method: Compositional matrix adjust.
Identities = 78/324 (24%), Positives = 124/324 (38%), Gaps = 49/324 (15%)
Query: 4 VSETVTVAEIEGRGRGLVSTQSLKAGQIVLRDSPILLYSALPFINSSSFCHNCFRKTMHS 63
VS V + GR ++ +K G +L + P + S L + + C NCF +T+
Sbjct: 237 VSPLVRIDSNRQEGRFARASADVKPGDELLVERPFV--SVLLEKFAKTHCENCFMRTVVP 294
Query: 64 SSSICPSCS-VAFCSPKCSTAAASSSHSPYVCQALTRLL----NLNSPDA----ANLPLD 114
+ CP C+ V +CS +C A+ H Y C + + ++N+ A A+ PLD
Sbjct: 295 VA--CPRCADVLYCSEQCRQEASKKYHK-YECGIVPIIWRSGASINNHIALRIIASKPLD 351
Query: 115 RQLQARFLI----AAYNLAVVNPSQFQILLAFQGTVTDNDTS-------AAHYLSSLC-- 161
L+ + I L + F+ + + + S A +L+
Sbjct: 352 YFLKLKPTIDEELTPEQLISLPKDDFRRVAQLERHQGERQPSNFFQHVLMARFLTHCLRA 411
Query: 162 -------PPPASTTIELTAAL--LAKDRLNAFGLMEPYIEGQDGRRSVRAYG--IYLKAS 210
P PA +I + L L + N + E + GR G IY +
Sbjct: 412 GGYFGAEPKPAEVSIICSLVLRSLQFIQFNTHEVAELHKFSSSGREKSIFIGGAIYPTLA 471
Query: 211 FFNHDCLPNACRFDYVDAAAENNTDIIVRMIHDVPQGREICLSYFPV--NYDYSTRQKRL 268
FNH C P R+ T I + + + G I +Y P+ + + RQ RL
Sbjct: 472 LFNHSCDPGVVRYF-------RGTTIHINSVRPIEAGLPINENYGPMYTQDERAERQARL 524
Query: 269 LYDYDFACDCDRCKVEANWSDNDN 292
Y F C CD C NW D+
Sbjct: 525 KELYWFECSCDACI--DNWPKFDD 546
>gi|302918997|ref|XP_003052770.1| hypothetical protein NECHADRAFT_78092 [Nectria haematococca mpVI
77-13-4]
gi|256733710|gb|EEU47057.1| hypothetical protein NECHADRAFT_78092 [Nectria haematococca mpVI
77-13-4]
Length = 563
Score = 48.9 bits (115), Expect = 0.004, Method: Compositional matrix adjust.
Identities = 50/186 (26%), Positives = 79/186 (42%), Gaps = 35/186 (18%)
Query: 142 FQGTVTDNDTSAAHYLS---SLCPPPASTTIELTAALLAKDRLNAFGLMEPYIEGQDGRR 198
F+ +TD+ A L+ SL P +T LT+ + A+G+ + +
Sbjct: 55 FRSAMTDDQAVKAELLNKVNSLHPAHKTTFFALTSLQKYEPDHPAWGIF-----CSNALK 109
Query: 199 SVRA-YGIYLKASFFNHDCLPNACRFDYVDAAAENNT--DIIVRMIHDVPQGREICLSYF 255
V+ YG+Y A+ NH C+PNA A N T + + + D+ +G EI + Y
Sbjct: 110 EVKGMYGLYPTAARINHSCVPNA-------HYARNTTLGKLTLHAVKDIAEGDEITIFYL 162
Query: 256 PVNYDYSTRQKRLLYDYDFACDCDRCKVEANWSDNDNDDENNEEVMDEDQDEQMVASDDD 315
+ + RQK+L F C C C +E E + + DQ Q+ S D
Sbjct: 163 DIYNTRAERQKKLC---GFTCTCSLCSLEG------------ERLKESDQRLQL--SFDL 205
Query: 316 AEVHGD 321
E+ GD
Sbjct: 206 YELLGD 211
>gi|48257059|gb|AAH17079.2| SMYD3 protein, partial [Homo sapiens]
Length = 287
Score = 48.9 bits (115), Expect = 0.004, Method: Compositional matrix adjust.
Identities = 28/93 (30%), Positives = 43/93 (46%), Gaps = 7/93 (7%)
Query: 204 GIYLKASFFNHDCLPNACRFDYVDAAAENNTDIIVRMIHDVPQGREICLSYFPVNYDYST 263
G+Y S NH C PN C + N +++R + D+ G E+ + Y +
Sbjct: 55 GLYPSISLLNHSCDPN-CSIVF------NGPHLLLRAVRDIEVGEELTICYLDMLMTSEE 107
Query: 264 RQKRLLYDYDFACDCDRCKVEANWSDNDNDDEN 296
R+K+L Y F CDC RC+ + +D DE
Sbjct: 108 RRKQLRDQYCFECDCFRCQTQDKDADMLTGDEQ 140
>gi|167521854|ref|XP_001745265.1| hypothetical protein [Monosiga brevicollis MX1]
gi|163776223|gb|EDQ89843.1| predicted protein [Monosiga brevicollis MX1]
Length = 630
Score = 48.9 bits (115), Expect = 0.004, Method: Compositional matrix adjust.
Identities = 33/125 (26%), Positives = 55/125 (44%), Gaps = 16/125 (12%)
Query: 166 STTIELTAALLAKDRLNAFGLMEPYIE--GQDGR----RSVRAYGIYLKASFFNHDCLPN 219
+T L LL R N GL + ++ Q+G V G+++ S FNH C+PN
Sbjct: 345 TTVTGLRQHLLRVLRHNQHGLAQVELQPSEQNGEVGVVHQVYGAGLFVHGSLFNHSCVPN 404
Query: 220 ACRFDYVDAAAENNTDIIVRMIHDVPQGREICLSYFPVNYD---YSTRQKRLLYDYDFAC 276
+ D +++ +P E+ +SY P+ ++ RQ +L ++FAC
Sbjct: 405 VHLHFHGD-------ELVATASKPIPANSELTISYGPLAVRDAWHAARQTQLRNTFNFAC 457
Query: 277 DCDRC 281
C C
Sbjct: 458 QCIAC 462
>gi|426239577|ref|XP_004013696.1| PREDICTED: SET and MYND domain-containing protein 3 isoform 1 [Ovis
aries]
Length = 369
Score = 48.9 bits (115), Expect = 0.004, Method: Compositional matrix adjust.
Identities = 28/93 (30%), Positives = 42/93 (45%), Gaps = 7/93 (7%)
Query: 204 GIYLKASFFNHDCLPNACRFDYVDAAAENNTDIIVRMIHDVPQGREICLSYFPVNYDYST 263
G+Y S NH C PN + N +++R + DV G E+ + Y +
Sbjct: 137 GLYPSMSLLNHSCDPNC-------SIVFNGPHLLLRTVRDVEAGEELTICYLDMLMTSEE 189
Query: 264 RQKRLLYDYDFACDCDRCKVEANWSDNDNDDEN 296
R+K+L Y F CDC RC+ + +D DE
Sbjct: 190 RRKQLRDQYCFDCDCFRCQTQDKDADMLTGDEQ 222
>gi|119597545|gb|EAW77139.1| SET and MYND domain containing 3, isoform CRA_a [Homo sapiens]
gi|119597547|gb|EAW77141.1| SET and MYND domain containing 3, isoform CRA_a [Homo sapiens]
gi|193785773|dbj|BAG51208.1| unnamed protein product [Homo sapiens]
Length = 239
Score = 48.9 bits (115), Expect = 0.004, Method: Compositional matrix adjust.
Identities = 32/106 (30%), Positives = 49/106 (46%), Gaps = 12/106 (11%)
Query: 202 AYGIYLKASFFNHDCLPNACRFDYVDAAAENNTDIIVRMIHDVPQGREICLSYFPVNYDY 261
G+Y S NH C PN C + N +++R + D+ G E+ + Y +
Sbjct: 5 GVGLYPSISLLNHSCDPN-CSIVF------NGPHLLLRAVRDIEVGEELTICYLDMLMTS 57
Query: 262 STRQKRLLYDYDFACDCDRCKVEANWSDNDNDD-ENNEEVMDEDQD 306
R+K+L Y F CDC RC+ + D D D +E+V E Q+
Sbjct: 58 EERRKQLRDQYCFECDCFRCQTQ----DKDADMLTGDEQVWKEVQE 99
>gi|240282313|gb|EER45816.1| conserved hypothetical protein [Ajellomyces capsulatus H143]
Length = 437
Score = 48.9 bits (115), Expect = 0.004, Method: Compositional matrix adjust.
Identities = 26/81 (32%), Positives = 40/81 (49%), Gaps = 3/81 (3%)
Query: 207 LKASFFNHDCLPNACRFDYVDAAAENNTD---IIVRMIHDVPQGREICLSYFPVNYDYST 263
L +++ NH CLPNA A+ N + + V + +G EI + Y + D +
Sbjct: 115 LDSAYLNHSCLPNAQHSLSAPASNGNRKERDFLTVYACRSIEEGEEITIPYESLYMDIAG 174
Query: 264 RQKRLLYDYDFACDCDRCKVE 284
RQ+ LL +Y F C C C+ E
Sbjct: 175 RQQFLLQEYGFECSCRLCEKE 195
>gi|225684199|gb|EEH22483.1| conserved hypothetical protein [Paracoccidioides brasiliensis Pb03]
Length = 443
Score = 48.9 bits (115), Expect = 0.004, Method: Compositional matrix adjust.
Identities = 28/89 (31%), Positives = 44/89 (49%), Gaps = 3/89 (3%)
Query: 199 SVRAYGIYLKASFFNHDCLPNACRFDYVDAAAENNTD---IIVRMIHDVPQGREICLSYF 255
+VR + L +++ NH CLPNA + ++ + + V V QG EI + Y
Sbjct: 110 AVRGRVLGLSSAYLNHACLPNAQHTLFTMKCNDDREERYFLTVYACKKVAQGEEITIPYD 169
Query: 256 PVNYDYSTRQKRLLYDYDFACDCDRCKVE 284
+ D ++RQ+ LL +Y F C C C E
Sbjct: 170 VLYMDRASRQQFLLQEYGFECACKLCGKE 198
>gi|71834584|ref|NP_001025394.1| N-lysine methyltransferase SMYD2-B isoform 1 [Danio rerio]
Length = 437
Score = 48.9 bits (115), Expect = 0.004, Method: Compositional matrix adjust.
Identities = 69/285 (24%), Positives = 104/285 (36%), Gaps = 43/285 (15%)
Query: 15 GRGRGLVSTQSLKAGQIVLRDSPILLYSALPFINSSSFCHNCFRKTMHSSSSICPSCSVA 74
G+GRGL +++ G+++ P Y L C CF T + C C A
Sbjct: 17 GKGRGLRVSRAYGVGELLF-SCPAYSY-VLSVGERGLICEQCF--TRKKGLAKCGKCKKA 72
Query: 75 F-CSPKCSTAAASSSHSPYVCQALTRLLNLNSPDAANLPLDRQLQARFLIAAYNLAVVNP 133
F C+ C + CQA+ P +L AR + +P
Sbjct: 73 FYCNANCQKK--NWPMHKLECQAMCAFGENWRPSETV-----RLVARIIARLKAQKERSP 125
Query: 134 SQFQILLA-FQGTVTDND-----TSAAH-------YLSSLCPPPASTTIELTAALLAKDR 180
S+ +LL + + D D + AH Y L P + L ++
Sbjct: 126 SEILLLLGEMEAHLEDMDNEKREMTEAHIAGLHQFYSKHLDFPDHQALL----TLFSQVH 181
Query: 181 LNAFGLMEPYIEGQDGRRSVRAYGIYLKASFFNHDCLPNACRFDYVDAAAENNTDIIVRM 240
N F + +D S I+ + NH C PN Y AE VR
Sbjct: 182 CNGFTV-------EDEELSNLGLAIFPDIALLNHSCSPNVI-VTYRGINAE------VRA 227
Query: 241 IHDVPQGREICLSYFPVNYDYSTRQKRLLYDYDFACDCDRCKVEA 285
+ D+ G+EI SY + Y + R +RL Y F+CDC C ++
Sbjct: 228 VKDISPGQEIYTSYIDLLYPTADRLERLRDMYYFSCDCKECTTKS 272
>gi|426335968|ref|XP_004029476.1| PREDICTED: SET and MYND domain-containing protein 5 isoform 1
[Gorilla gorilla gorilla]
Length = 419
Score = 48.9 bits (115), Expect = 0.004, Method: Compositional matrix adjust.
Identities = 54/188 (28%), Positives = 80/188 (42%), Gaps = 26/188 (13%)
Query: 109 ANLPLDRQLQARFLIAAYNLAV---VNPSQFQILLAFQGTVTDN--DTSAAHYLSSLCPP 163
L L R+L F A Y AV P F+ L A GT +S + ++
Sbjct: 215 GQLELLRRL---FTEALYEEAVSQWFTPDGFRSLFALVGTNGQGIGTSSLSQWVH----- 266
Query: 164 PASTTIELTAALLAKDRLNAF-GLMEPYIEGQDGR-RSVRAYGIYLKASFFNHDCLPNAC 221
A T+EL +++L+ F + IE G + G+++ S NH C+PNA
Sbjct: 267 -ACDTLELKPQ--DREQLDTFIDQLYKDIEAATGEFLNCEGSGLFVLQSCCNHSCVPNA- 322
Query: 222 RFDYVDAAAENNTDIIVRMIHDVPQGREICLSYFPV---NYDYSTRQKRLLYDYDFACDC 278
+ ENN + V + D+ G EIC+SY +R K L +Y F C C
Sbjct: 323 ----ETSFPENNFLLHVTALEDIKPGEEICISYLDCCQRERSRHSRHKILRENYLFVCSC 378
Query: 279 DRCKVEAN 286
+C EA+
Sbjct: 379 PKCLAEAD 386
>gi|402858454|ref|XP_003893720.1| PREDICTED: SET and MYND domain-containing protein 3-like [Papio
anubis]
Length = 258
Score = 48.9 bits (115), Expect = 0.004, Method: Compositional matrix adjust.
Identities = 28/95 (29%), Positives = 43/95 (45%), Gaps = 7/95 (7%)
Query: 202 AYGIYLKASFFNHDCLPNACRFDYVDAAAENNTDIIVRMIHDVPQGREICLSYFPVNYDY 261
G+Y S NH C PN C + N +++R + D+ G E+ + Y +
Sbjct: 24 GVGLYPSISLLNHSCDPN-CSIVF------NGPHLLLRAVRDIEVGEELTICYLDMLMTS 76
Query: 262 STRQKRLLYDYDFACDCDRCKVEANWSDNDNDDEN 296
R+K+L Y F CDC RC+ + +D DE
Sbjct: 77 EERRKQLRDQYCFECDCFRCQTQDKDADMLTGDEQ 111
>gi|297661482|ref|XP_002809270.1| PREDICTED: SET and MYND domain-containing protein 3-like [Pongo
abelii]
gi|426334412|ref|XP_004028746.1| PREDICTED: SET and MYND domain-containing protein 3-like [Gorilla
gorilla gorilla]
gi|28200379|gb|AAO31695.1| hypothetical protein FLJ21080 [Homo sapiens]
gi|119597546|gb|EAW77140.1| SET and MYND domain containing 3, isoform CRA_b [Homo sapiens]
Length = 258
Score = 48.9 bits (115), Expect = 0.004, Method: Compositional matrix adjust.
Identities = 28/95 (29%), Positives = 43/95 (45%), Gaps = 7/95 (7%)
Query: 202 AYGIYLKASFFNHDCLPNACRFDYVDAAAENNTDIIVRMIHDVPQGREICLSYFPVNYDY 261
G+Y S NH C PN C + N +++R + D+ G E+ + Y +
Sbjct: 24 GVGLYPSISLLNHSCDPN-CSIVF------NGPHLLLRAVRDIEVGEELTICYLDMLMTS 76
Query: 262 STRQKRLLYDYDFACDCDRCKVEANWSDNDNDDEN 296
R+K+L Y F CDC RC+ + +D DE
Sbjct: 77 EERRKQLRDQYCFECDCFRCQTQDKDADMLTGDEQ 111
>gi|310791379|gb|EFQ26906.1| TPR domain-containing protein [Glomerella graminicola M1.001]
Length = 749
Score = 48.9 bits (115), Expect = 0.004, Method: Compositional matrix adjust.
Identities = 27/93 (29%), Positives = 40/93 (43%), Gaps = 7/93 (7%)
Query: 193 GQDGRRSVRAYGIYLKASFFNHDCLPNACRFDYVDAAAENNTDIIVRMIHDVPQGREICL 252
G D + + G+++ A++ NH CLPN R D +I D+P G EI
Sbjct: 497 GWDIWKQGMSRGLWVHAAYANHACLPNTNRSFIGDM-------LIATATVDIPAGTEITH 549
Query: 253 SYFPVNYDYSTRQKRLLYDYDFACDCDRCKVEA 285
Y P Y R + + + F C C C+ E
Sbjct: 550 IYLPPKAAYLLRMSQFRHSWGFVCSCALCEGEG 582
>gi|116181336|ref|XP_001220517.1| hypothetical protein CHGG_01296 [Chaetomium globosum CBS 148.51]
gi|88185593|gb|EAQ93061.1| hypothetical protein CHGG_01296 [Chaetomium globosum CBS 148.51]
Length = 703
Score = 48.9 bits (115), Expect = 0.004, Method: Compositional matrix adjust.
Identities = 24/83 (28%), Positives = 37/83 (44%), Gaps = 7/83 (8%)
Query: 204 GIYLKASFFNHDCLPNACRFDYVDAAAENNTDIIVRMIHDVPQGREICLSYFPVNYDYST 263
G+++ AS+ NHDC+ N R D ++ R D+ +G E+ Y PV
Sbjct: 459 GLWVHASYMNHDCVANTMRSFVGDM-------LVSRATRDIEEGEELFQQYVPVKPLPDV 511
Query: 264 RQKRLLYDYDFACDCDRCKVEAN 286
R ++ + F C C C EA
Sbjct: 512 RNRQFKEGWGFECGCTLCTAEAR 534
>gi|326915443|ref|XP_003204027.1| PREDICTED: SET and MYND domain-containing protein 3-like [Meleagris
gallopavo]
Length = 427
Score = 48.9 bits (115), Expect = 0.004, Method: Compositional matrix adjust.
Identities = 58/277 (20%), Positives = 101/277 (36%), Gaps = 32/277 (11%)
Query: 15 GRGRGLVSTQSLKAGQIVLRDSPILLYSALPFINSSSFCHNCFRKTMHSSSSICPSCSVA 74
G+G GL S + ++ G+++ R P + C CF++ H C C VA
Sbjct: 13 GKGSGLRSRRRVRPGELLYRAEPFAYVVTKEQLGG--VCEQCFQRNEHLHR--CSQCKVA 68
Query: 75 -FCSPKCSTAAASSSHSPYVCQALTRLLNLNSPDAANLPLDRQLQARFLIAAYNLAVVNP 133
+C C C L N+ N P +L R + + L +
Sbjct: 69 KYCGKSCQKEGWLDHKRECKC-----LRNVK----PNFPXSVRLAGRIV---FKLLRQSA 116
Query: 134 SQFQILLAFQGTVTDNDTSAAHYLSSLCPPPASTTIELTAALLAKDRLN-AFGLMEPYIE 192
+ L +F ++ + + L + + L A + L+ A + + +
Sbjct: 117 CLSEKLYSFSDLQSNAEQLSEEMKEGLRHLAHTLQLYLRAEIQDASHLSPAIDFFQIFTK 176
Query: 193 G-------QDGRRSVRAYGIYLKASFFNHDCLPNACRFDYVDAAAENNTDIIVRMIHDVP 245
+G G+Y S NH C PN C + +++R + ++
Sbjct: 177 VTCNCFTISNGEMQDVGVGLYPSMSLLNHSCDPN-CVIIF------EGYQLLLRSVREIQ 229
Query: 246 QGREICLSYFPVNYDYSTRQKRLLYDYDFACDCDRCK 282
G E+ +SY S RQ++L Y F CDC C+
Sbjct: 230 IGEELTVSYIESLMPTSERQEQLKRQYCFECDCCLCQ 266
>gi|346976798|gb|EGY20250.1| Mcg1p [Verticillium dahliae VdLs.17]
Length = 413
Score = 48.9 bits (115), Expect = 0.004, Method: Compositional matrix adjust.
Identities = 29/100 (29%), Positives = 43/100 (43%), Gaps = 15/100 (15%)
Query: 203 YGIYLKASFFNHDCLPNACRFDYVDAAAENNTDIIVRMI--HDVPQGREICLSYFPVNYD 260
+G++ S NHDC PNA AA + T ++ R D+ G EI +SY
Sbjct: 229 FGVFPAISRLNHDCGPNA-------AAWTDKTGLVHRAFASKDIAAGEEISISYVDALAP 281
Query: 261 YSTRQKRLLYDYDFACDCDRCKVEANWSDNDNDDENNEEV 300
+ R+ R+ + F C C RC + D D E+
Sbjct: 282 RAERRARMAGSWGFECGCRRC------AAGDESDARVAEI 315
>gi|145502993|ref|XP_001437474.1| hypothetical protein [Paramecium tetraurelia strain d4-2]
gi|124404624|emb|CAK70077.1| unnamed protein product [Paramecium tetraurelia]
Length = 367
Score = 48.9 bits (115), Expect = 0.004, Method: Compositional matrix adjust.
Identities = 66/281 (23%), Positives = 110/281 (39%), Gaps = 47/281 (16%)
Query: 27 KAGQIVLRDSPILLYSALPFINSSSFCHNCFRKTMHSSSSICPSCSVAFCSPKCSTAAAS 86
K Q+V+R P+ + F ++ C+ C + + + SIC +CS C
Sbjct: 17 KQDQVVMRCDPLAIVLFKEF--RANHCNYCLQGSQTNRCSICKQ--YYYCSVSCQKNDWK 72
Query: 87 SSHSPYVCQALTRLLNLNSPDAANLPLDRQLQARFLIAAYNLAVVNPSQFQILLAFQGTV 146
+ CQ L+++ +P + + R N+ + N F G V
Sbjct: 73 QHKNE--CQLLSKI-------TKEMPCNILILIRLF--QNNIDIQN---------FYGDV 112
Query: 147 ---TDNDTSA-----AHYLSSLCPPPASTTIELTAALLAKDRLNAFGLMEPYIEGQDGRR 198
D +T A Y+ + T +L ++ K LN F + + I G
Sbjct: 113 DKDIDQETYQQVFDCAAYIVKIAQLENETFAKL-LSIQVKIHLNTFTVTD--ING----- 164
Query: 199 SVRAYGIYLKASFFNHDCLPNACRFDYVDAAAE--NNTDIIVRMIHDVPQG----REICL 252
IY A+F NH C+ + V + N +I+ + Q +E+C+
Sbjct: 165 DTLGIAIYTPANFLNHKCIKTSANQKNVANCSHFFNQRQLIITTNNSFVQNENDPQELCI 224
Query: 253 SYFPVNYDYSTRQKRLLYDYDFACDCDRCKVEANWSDNDND 293
SY + ++ RQK L Y F CDCDRC E S+ ++D
Sbjct: 225 SYGNI-VNFKDRQKFLKAQYGFICDCDRCIQEQTNSEEESD 264
>gi|195132931|ref|XP_002010893.1| GI21458 [Drosophila mojavensis]
gi|193907681|gb|EDW06548.1| GI21458 [Drosophila mojavensis]
Length = 302
Score = 48.9 bits (115), Expect = 0.004, Method: Compositional matrix adjust.
Identities = 62/269 (23%), Positives = 104/269 (38%), Gaps = 24/269 (8%)
Query: 17 GRGLVSTQSLKAGQIVLRDSPILLYSALPFINSSSFCHNCFRKTMHSSSSICPSCSVAFC 76
GRG+ + ++++ G+ + R+ +L+ + C +C+ + S CS
Sbjct: 51 GRGVFAARNIEKGEELFREHTLLVGPTAHRGRNLRTCIHCYHQPPGESEE-SALCSAGCG 109
Query: 77 SPKCSTAAASSSHSPYVCQALTRLLNLNSPDAANLPLDRQLQARFLIAAYNLAVVNPSQF 136
P C A S H+ CQ L P ++ R L+ +++ ++ Q
Sbjct: 110 LPVCPDCAPSERHA-VECQ----LFRKWQPKHLSIIEPRALR---ILSVVRCFFLSEPQR 161
Query: 137 QILLAFQGTVTDNDTSAAHYLSSLCP---PPASTTIELTAALLAKDRLNAFGLMEPYIEG 193
Q+L A Q TD ++ C P +E + NAF +I+G
Sbjct: 162 QLLYAMQAN-TDRYYMREVQRAAECFEHFPRDPDMLEYFYRTVCAFNTNAFE-SRCHIDG 219
Query: 194 QDGRRSVRAYGIYLKASFFNHDCLPNACRFDYVDAAAENNTDIIVRMIHDVPQGREICLS 253
Q+ V ++ A NH C PNA E+ I+V +P G EI +S
Sbjct: 220 QE----VVTRALFPLAGLLNHQCTPNAAHH------FEDGETIVVTATERIPAGAEITMS 269
Query: 254 YFPVNYDYSTRQKRLLYDYDFACDCDRCK 282
Y + + R+ L F C C RC+
Sbjct: 270 YTKLLWSTLARKIFLGMTKHFMCQCPRCQ 298
>gi|167342116|gb|ABZ69006.1| SET domain-containing protein C [Pinus taeda]
Length = 141
Score = 48.9 bits (115), Expect = 0.004, Method: Compositional matrix adjust.
Identities = 26/81 (32%), Positives = 37/81 (45%), Gaps = 8/81 (9%)
Query: 204 GIYLKASFFNHDCLPNACRFDYVDAAAENNTDIIVRMIHDVPQGREICLSYFPVNYDYST 263
G++L S NH CLPN+ R E + + + + +G EI YF +
Sbjct: 12 GLWLLPSIINHSCLPNSSRL-------EMGSAMFIHACKPIKRGEEITFPYFDILLPLPQ 64
Query: 264 RQKRLLYDYDFACDCDRCKVE 284
RQ R ++ F C C RC VE
Sbjct: 65 RQGR-CENWGFECKCRRCIVE 84
>gi|392592096|gb|EIW81423.1| SET domain-containing protein [Coniophora puteana RWD-64-598 SS2]
Length = 450
Score = 48.9 bits (115), Expect = 0.004, Method: Compositional matrix adjust.
Identities = 27/83 (32%), Positives = 39/83 (46%), Gaps = 4/83 (4%)
Query: 202 AYGIY-LKASFFNHDCLPNACRFDYVDAAAENNTDIIVRMIHDVPQGREICLSYFPVNYD 260
A+G++ L + FNH CLPNA + A +I+ + D+ G EIC+ Y
Sbjct: 193 AHGVFPLASRLFNHSCLPNAAARYILSEDAAPRMEIVA--LQDIGAGEEICVPYLDPAL- 249
Query: 261 YSTRQKRLLYDYDFACDCDRCKV 283
+RQ+ Y F C C C V
Sbjct: 250 LQSRQQGFQLTYGFTCTCPSCSV 272
>gi|350295933|gb|EGZ76910.1| hypothetical protein NEUTE2DRAFT_153682 [Neurospora tetrasperma
FGSC 2509]
Length = 724
Score = 48.9 bits (115), Expect = 0.004, Method: Compositional matrix adjust.
Identities = 27/89 (30%), Positives = 37/89 (41%), Gaps = 7/89 (7%)
Query: 202 AYGIYLKASFFNHDCLPNACRFDYVDAAAENNTDIIVRMIHDVPQGREICLSYFPVNYDY 261
A G++ +S NH C+PN R D +I R DV +G E+ Y PV
Sbjct: 468 AKGLWAHSSLMNHSCVPNTMRSFVGDM-------LICRATRDVQEGEELFQQYVPVKTLL 520
Query: 262 STRQKRLLYDYDFACDCDRCKVEANWSDN 290
R K + F C C C+ E S+
Sbjct: 521 DVRNKEFEEGWGFECRCGLCEGERKSSEE 549
>gi|330841743|ref|XP_003292851.1| hypothetical protein DICPUDRAFT_89992 [Dictyostelium purpureum]
gi|325076863|gb|EGC30616.1| hypothetical protein DICPUDRAFT_89992 [Dictyostelium purpureum]
Length = 680
Score = 48.9 bits (115), Expect = 0.004, Method: Compositional matrix adjust.
Identities = 35/120 (29%), Positives = 57/120 (47%), Gaps = 20/120 (16%)
Query: 169 IELTAALLAKDRLNAFGLMEPYIEGQDGRRSVRAYG--IYLKASFFNHDCLPNACRFDYV 226
IEL + +L+ N ++ P + R+YG I+ S+ NH C PNA F Y
Sbjct: 385 IELYSIVLS----NGHEMLHPL--------TCRSYGMGIFPTGSYLNHSCSPNA--FWYN 430
Query: 227 DAAAENNTDIIVRMIHDVPQGREICLSYFPVNYDYSTRQKRLLYDYDFACDCDRCKVEAN 286
D ++ R + + +G E+ SY V R+K L+ Y F C C++C+ ++N
Sbjct: 431 DEQGM----MVFRSLRPLKKGEELLTSYTDVTNPLEDRRKYLMKQYFFFCQCNQCQYQSN 486
>gi|355751414|gb|EHH55669.1| hypothetical protein EGM_04917 [Macaca fascicularis]
Length = 419
Score = 48.9 bits (115), Expect = 0.004, Method: Compositional matrix adjust.
Identities = 54/191 (28%), Positives = 76/191 (39%), Gaps = 32/191 (16%)
Query: 109 ANLPLDRQLQARFLIAAYNLAV---VNPSQFQILLAF-----QGTVTDNDTSAAHYLSSL 160
L L R+L F A Y AV P F+ L A QG T + + H +L
Sbjct: 215 GQLELLRRL---FTEALYEEAVSQWFTPDGFRSLFALVGTNGQGIGTSSLSQWVHACDAL 271
Query: 161 -CPPPASTTIELTAALLAKDRLNAFGLMEPYIEGQDGR-RSVRAYGIYLKASFFNHDCLP 218
P +++ L KD IE G + G+++ S NH C+P
Sbjct: 272 ELKPQDREQLDIFIDQLYKD-----------IEAATGEFLNCEGSGLFVLQSCCNHSCVP 320
Query: 219 NACRFDYVDAAAENNTDIIVRMIHDVPQGREICLSYFPV---NYDYSTRQKRLLYDYDFA 275
NA + ENN + V + D+ G EIC+SY +R K L +Y F
Sbjct: 321 NA-----ETSFPENNFLLHVTALEDIKPGEEICISYLDCCQRERSRHSRHKILRENYLFV 375
Query: 276 CDCDRCKVEAN 286
C C +C EA+
Sbjct: 376 CSCPKCLAEAD 386
>gi|261332900|emb|CBH15895.1| hypothetical protein, conserved [Trypanosoma brucei gambiense
DAL972]
Length = 457
Score = 48.9 bits (115), Expect = 0.004, Method: Compositional matrix adjust.
Identities = 30/97 (30%), Positives = 39/97 (40%), Gaps = 5/97 (5%)
Query: 201 RAYGIYLKASFFNHDCLPNACRFDYVDAAAENNTDIIVRMIHDVPQGREICLSYFPVNYD 260
R GIY NH C PN V A + + + + D+ G E+ +SY
Sbjct: 365 RGQGIYTIGCLLNHSCEPNL----QVLYTAVGDETLSIEALRDIEPGEELNISYVDETLP 420
Query: 261 YSTRQKRLLYDYDFACDCDRCKVEA-NWSDNDNDDEN 296
Y RQ L Y F C C +C EA +W N E
Sbjct: 421 YPQRQLILYEHYFFICKCPKCTREAPDWERQVNGSEK 457
>gi|122692553|ref|NP_001073717.1| SET and MYND domain-containing protein 5 [Bos taurus]
gi|120419446|gb|ABM21542.1| SMYD family member 5 [Bos taurus]
gi|151553913|gb|AAI49126.1| SMYD5 protein [Bos taurus]
gi|296482713|tpg|DAA24828.1| TPA: SMYD family member 5 [Bos taurus]
Length = 418
Score = 48.9 bits (115), Expect = 0.004, Method: Compositional matrix adjust.
Identities = 29/86 (33%), Positives = 42/86 (48%), Gaps = 8/86 (9%)
Query: 204 GIYLKASFFNHDCLPNACRFDYVDAAAENNTDIIVRMIHDVPQGREICLSYFPV---NYD 260
G+++ S NH C+PNA + ENN + V + D+ G EIC+SY
Sbjct: 306 GLFVLQSCCNHSCVPNA-----ETSFPENNFLLHVTALEDIKPGEEICISYLDCCQRERS 360
Query: 261 YSTRQKRLLYDYDFACDCDRCKVEAN 286
+R K L +Y F C C +C EA+
Sbjct: 361 RHSRHKILRENYLFVCSCPKCLAEAD 386
Score = 38.5 bits (88), Expect = 5.8, Method: Compositional matrix adjust.
Identities = 23/103 (22%), Positives = 41/103 (39%), Gaps = 27/103 (26%)
Query: 15 GRGRGLVSTQSLKAGQIVLRDSPIL----LYSALPFINSSSFCHNCFRKTMHSSSSI--- 67
+G+GL +TQ ++ G+ + + P++ L++AL + C K ++ +
Sbjct: 31 AKGKGLFATQLIRKGETIFVERPLVAAQFLWNALYRYRACDHCLRALEKAEENAQRLTGK 90
Query: 68 --------------------CPSCSVAFCSPKCSTAAASSSHS 90
CP C V +CS +C AAA H
Sbjct: 91 PGQVLPHPELCTVRQDLHQSCPHCRVTYCSAECRLAAAEQYHQ 133
>gi|397473487|ref|XP_003808242.1| PREDICTED: SET and MYND domain-containing protein 5 isoform 1 [Pan
paniscus]
Length = 420
Score = 48.9 bits (115), Expect = 0.004, Method: Compositional matrix adjust.
Identities = 54/188 (28%), Positives = 80/188 (42%), Gaps = 26/188 (13%)
Query: 109 ANLPLDRQLQARFLIAAYNLAV---VNPSQFQILLAFQGTVTDN--DTSAAHYLSSLCPP 163
L L R+L F A Y AV P F+ L A GT +S + ++
Sbjct: 215 GQLELLRRL---FTEALYEEAVSQWFTPDGFRSLFALVGTNGQGIGTSSLSQWVH----- 266
Query: 164 PASTTIELTAALLAKDRLNAF-GLMEPYIEGQDGR-RSVRAYGIYLKASFFNHDCLPNAC 221
A T+EL +++L+ F + IE G + G+++ S NH C+PNA
Sbjct: 267 -ACDTLELKPQ--DREQLDTFIDQLYKDIEAATGEFLNCEGSGLFVLQSCCNHSCVPNA- 322
Query: 222 RFDYVDAAAENNTDIIVRMIHDVPQGREICLSYF---PVNYDYSTRQKRLLYDYDFACDC 278
+ ENN + V + D+ G EIC+SY +R K L +Y F C C
Sbjct: 323 ----ETSFPENNFLLHVTALEDIKPGEEICISYLDCCQRERSRHSRHKILRENYLFVCSC 378
Query: 279 DRCKVEAN 286
+C EA+
Sbjct: 379 PKCLAEAD 386
>gi|355720921|gb|AES07094.1| SMYD family member 5 [Mustela putorius furo]
Length = 159
Score = 48.9 bits (115), Expect = 0.004, Method: Compositional matrix adjust.
Identities = 35/114 (30%), Positives = 54/114 (47%), Gaps = 10/114 (8%)
Query: 178 KDRLNAF-GLMEPYIEGQDGR-RSVRAYGIYLKASFFNHDCLPNACRFDYVDAAAENNTD 235
+++L+AF + IE G + G+++ S NH C+PNA + ENN
Sbjct: 20 REQLDAFIDQLYKDIEAATGEFLNCEGSGLFVLQSCCNHSCVPNA-----ETSFPENNFL 74
Query: 236 IIVRMIHDVPQGREICLSYFPV---NYDYSTRQKRLLYDYDFACDCDRCKVEAN 286
+ V + D+ G EIC+SY +R K L +Y F C C +C EA+
Sbjct: 75 LHVTALEDIKPGEEICISYLDCCQRERSRHSRHKILRENYLFVCSCPKCLAEAD 128
>gi|336272855|ref|XP_003351183.1| hypothetical protein SMAC_08199 [Sordaria macrospora k-hell]
gi|380087872|emb|CCC14032.1| unnamed protein product [Sordaria macrospora k-hell]
Length = 320
Score = 48.9 bits (115), Expect = 0.004, Method: Compositional matrix adjust.
Identities = 31/109 (28%), Positives = 47/109 (43%), Gaps = 15/109 (13%)
Query: 175 LLAKDRLNAFGLMEPYIEGQDGRRSVRAYGIYLKASFFNHDCLPNACRFDYVDAAAENNT 234
LA + N GL P I G G++++AS NH C PN ++ E +
Sbjct: 88 FLAISKTNMLGLGSPPIGG----------GLFIEASRINHACKPNTQ-----NSWNERIS 132
Query: 235 DIIVRMIHDVPQGREICLSYFPVNYDYSTRQKRLLYDYDFACDCDRCKV 283
+ + D+ +G EI +SY Y RQ L + F C C+ C +
Sbjct: 133 RETIHAVRDIKKGEEITISYMGHFASYDERQAFLKDKFKFDCACEVCSL 181
>gi|71747970|ref|XP_823040.1| hypothetical protein [Trypanosoma brucei brucei strain 927/4
GUTat10.1]
gi|70832708|gb|EAN78212.1| hypothetical protein, conserved [Trypanosoma brucei brucei strain
927/4 GUTat10.1]
Length = 457
Score = 48.9 bits (115), Expect = 0.004, Method: Compositional matrix adjust.
Identities = 30/97 (30%), Positives = 39/97 (40%), Gaps = 5/97 (5%)
Query: 201 RAYGIYLKASFFNHDCLPNACRFDYVDAAAENNTDIIVRMIHDVPQGREICLSYFPVNYD 260
R GIY NH C PN V A + + + + D+ G E+ +SY
Sbjct: 365 RGQGIYTIGCLLNHSCEPNL----QVLYTAVGDETLSIEALRDIEPGEELNISYVDETLP 420
Query: 261 YSTRQKRLLYDYDFACDCDRCKVEA-NWSDNDNDDEN 296
Y RQ L Y F C C +C EA +W N E
Sbjct: 421 YPQRQLILYEHYFFICKCPKCTREAPDWERQVNGSEK 457
>gi|340725361|ref|XP_003401039.1| PREDICTED: SET and MYND domain-containing protein 5-like [Bombus
terrestris]
Length = 386
Score = 48.9 bits (115), Expect = 0.004, Method: Compositional matrix adjust.
Identities = 30/86 (34%), Positives = 44/86 (51%), Gaps = 8/86 (9%)
Query: 204 GIYLKASFFNHDCLPNACRFDYVDAAAENNTDIIVRMIHDVPQGREICLSYFP---VNYD 260
G+Y+ S NH C+PNA V+ NN ++++ I D+ EIC+SY +
Sbjct: 289 GLYIFQSSINHSCVPNA----VVEFPYSNNV-LVLKAIRDIHPEEEICISYLDECCLERS 343
Query: 261 YSTRQKRLLYDYDFACDCDRCKVEAN 286
+RQK L Y F C C+RC + N
Sbjct: 344 RHSRQKVLSSLYLFHCYCNRCLSQVN 369
>gi|307198437|gb|EFN79379.1| Protein msta, isoform A [Harpegnathos saltator]
Length = 517
Score = 48.9 bits (115), Expect = 0.004, Method: Compositional matrix adjust.
Identities = 71/284 (25%), Positives = 113/284 (39%), Gaps = 38/284 (13%)
Query: 15 GR-GRGLVSTQSLKAGQIVLRDSPILLYSALPFI-NSSSFCHNCFRK-TMHSSSSICPSC 71
GR GR LV+ +++ AG++++R+ PI + P C C R + +C C
Sbjct: 13 GRLGRYLVAAKNIAAGEVIIREEPI---AVGPMTYRKDRLCFACLRSLSKIGRQYVCSRC 69
Query: 72 SVA-FCSPKCSTAAASSSHSPYVCQALTR---LLNLNSPDAANLPLDRQLQARFLIAAYN 127
++A CS C T + H+ C L LL N DA + L +L +LI N
Sbjct: 70 NLAPLCSVACET--RTKHHTSDECLILGENRDLLAKNGIDATGILLALRL---WLIKCRN 124
Query: 128 LAVVNPSQFQ--ILLAFQGT--VTDNDTSAAHYLSSL-----CPPPASTTIELTAALLAK 178
AV L GT D + + + + L PP++ EL L
Sbjct: 125 PAVWEQVDHMEDHLHERMGTPVWKDREVNVVNVIRMLRVPDGIEPPSA---ELMQKLCGI 181
Query: 179 DRLNAFGLMEPYIEGQDGRRSVRAYGIYLKASFFNHDCLPNACRFDYVDAAAENNTDIIV 238
+N F L P DG + G+Y++A+ HDC N +++ + V
Sbjct: 182 LDVNTFELRSP--GALDG---LLLRGLYVEAALMAHDCRGNT------HLTVDDDFRLTV 230
Query: 239 RMIHDVPQGREICLSYFPVNYDYSTRQKRLLYDYDFACDCDRCK 282
+ + I +Y + R++ L F C+CD CK
Sbjct: 231 YASVAIGENEPILFNYTSSLLGTADRREHLREGKYFECECDLCK 274
>gi|395508886|ref|XP_003758739.1| PREDICTED: SET and MYND domain-containing protein 5 [Sarcophilus
harrisii]
Length = 609
Score = 48.9 bits (115), Expect = 0.004, Method: Compositional matrix adjust.
Identities = 29/86 (33%), Positives = 41/86 (47%), Gaps = 8/86 (9%)
Query: 204 GIYLKASFFNHDCLPNACRFDYVDAAAENNTDIIVRMIHDVPQGREICLSYF---PVNYD 260
G+++ S NH C+PNA + ENN + V + D+ G EIC+SY
Sbjct: 498 GLFVLQSCCNHSCVPNA-----ETSFPENNFLLPVTALEDIKPGEEICISYLDCCQRERS 552
Query: 261 YSTRQKRLLYDYDFACDCDRCKVEAN 286
+R K L +Y F C C +C EA
Sbjct: 553 RYSRHKILRENYLFVCSCPKCLAEAE 578
>gi|332813419|ref|XP_001150449.2| PREDICTED: SET and MYND domain-containing protein 5 isoform 1 [Pan
troglodytes]
Length = 415
Score = 48.9 bits (115), Expect = 0.004, Method: Compositional matrix adjust.
Identities = 54/188 (28%), Positives = 80/188 (42%), Gaps = 26/188 (13%)
Query: 109 ANLPLDRQLQARFLIAAYNLAV---VNPSQFQILLAFQGTVTDN--DTSAAHYLSSLCPP 163
L L R+L F A Y AV P F+ L A GT +S + ++
Sbjct: 215 GQLELLRRL---FTEALYEEAVSQWFTPDGFRSLFALVGTNGQGIGTSSLSQWVH----- 266
Query: 164 PASTTIELTAALLAKDRLNAF-GLMEPYIEGQDGR-RSVRAYGIYLKASFFNHDCLPNAC 221
A T+EL +++L+ F + IE G + G+++ S NH C+PNA
Sbjct: 267 -ACDTLELKPQ--DREQLDTFIDQLYKDIEAATGEFLNCEGSGLFVLQSCCNHSCVPNA- 322
Query: 222 RFDYVDAAAENNTDIIVRMIHDVPQGREICLSYFPV---NYDYSTRQKRLLYDYDFACDC 278
+ ENN + V + D+ G EIC+SY +R K L +Y F C C
Sbjct: 323 ----ETSFPENNFLLHVTALEDIKPGEEICISYLDCCQRERSRHSRHKILRENYLFVCSC 378
Query: 279 DRCKVEAN 286
+C EA+
Sbjct: 379 PKCLAEAD 386
>gi|109103368|ref|XP_001104101.1| PREDICTED: SET and MYND domain-containing protein 5 [Macaca
mulatta]
gi|402891240|ref|XP_003908860.1| PREDICTED: SET and MYND domain-containing protein 5 [Papio anubis]
gi|355565788|gb|EHH22217.1| hypothetical protein EGK_05442 [Macaca mulatta]
Length = 418
Score = 48.9 bits (115), Expect = 0.004, Method: Compositional matrix adjust.
Identities = 54/191 (28%), Positives = 76/191 (39%), Gaps = 32/191 (16%)
Query: 109 ANLPLDRQLQARFLIAAYNLAV---VNPSQFQILLAF-----QGTVTDNDTSAAHYLSSL 160
L L R+L F A Y AV P F+ L A QG T + + H +L
Sbjct: 215 GQLELLRRL---FTEALYEEAVSQWFTPDGFRSLFALVGTNGQGIGTSSLSQWVHACDAL 271
Query: 161 -CPPPASTTIELTAALLAKDRLNAFGLMEPYIEGQDGR-RSVRAYGIYLKASFFNHDCLP 218
P +++ L KD IE G + G+++ S NH C+P
Sbjct: 272 ELKPQDREQLDIFIDQLYKD-----------IEAATGEFLNCEGSGLFVLQSCCNHSCVP 320
Query: 219 NACRFDYVDAAAENNTDIIVRMIHDVPQGREICLSYFPV---NYDYSTRQKRLLYDYDFA 275
NA + ENN + V + D+ G EIC+SY +R K L +Y F
Sbjct: 321 NA-----ETSFPENNFLLHVTALEDIKPGEEICISYLDCCQRERSRHSRHKILRENYLFV 375
Query: 276 CDCDRCKVEAN 286
C C +C EA+
Sbjct: 376 CSCPKCLAEAD 386
>gi|440901422|gb|ELR52368.1| SET and MYND domain-containing protein 5 [Bos grunniens mutus]
Length = 418
Score = 48.9 bits (115), Expect = 0.004, Method: Compositional matrix adjust.
Identities = 29/86 (33%), Positives = 42/86 (48%), Gaps = 8/86 (9%)
Query: 204 GIYLKASFFNHDCLPNACRFDYVDAAAENNTDIIVRMIHDVPQGREICLSYFPV---NYD 260
G+++ S NH C+PNA + ENN + V + D+ G EIC+SY
Sbjct: 306 GLFVLQSCCNHSCVPNA-----ETSFPENNFLLHVTALEDIKPGEEICISYLDCCQRERS 360
Query: 261 YSTRQKRLLYDYDFACDCDRCKVEAN 286
+R K L +Y F C C +C EA+
Sbjct: 361 RHSRHKILRENYLFVCSCPKCLAEAD 386
Score = 38.5 bits (88), Expect = 6.2, Method: Compositional matrix adjust.
Identities = 23/103 (22%), Positives = 41/103 (39%), Gaps = 27/103 (26%)
Query: 15 GRGRGLVSTQSLKAGQIVLRDSPIL----LYSALPFINSSSFCHNCFRKTMHSSSSI--- 67
+G+GL +TQ ++ G+ + + P++ L++AL + C K ++ +
Sbjct: 31 AKGKGLFATQLIRKGETIFVERPLVAAQFLWNALYRYRACDHCLRALEKAEENAQRLTGK 90
Query: 68 --------------------CPSCSVAFCSPKCSTAAASSSHS 90
CP C V +CS +C AAA H
Sbjct: 91 PGQVLPHPELCTVRQDLHQSCPHCRVTYCSAECRLAAAEQYHQ 133
>gi|291386545|ref|XP_002709793.1| PREDICTED: SMYD family member 5 [Oryctolagus cuniculus]
Length = 421
Score = 48.9 bits (115), Expect = 0.004, Method: Compositional matrix adjust.
Identities = 29/86 (33%), Positives = 42/86 (48%), Gaps = 8/86 (9%)
Query: 204 GIYLKASFFNHDCLPNACRFDYVDAAAENNTDIIVRMIHDVPQGREICLSYF---PVNYD 260
G+++ S NH C+PNA + ENN + V + D+ G EIC+SY
Sbjct: 306 GLFVLQSCCNHSCVPNA-----ETSFPENNFLLHVTALEDIKPGEEICISYLDCCQRERS 360
Query: 261 YSTRQKRLLYDYDFACDCDRCKVEAN 286
+R K L +Y F C C +C EA+
Sbjct: 361 RHSRHKILRENYLFVCSCPKCLAEAD 386
Score = 38.5 bits (88), Expect = 5.0, Method: Compositional matrix adjust.
Identities = 23/103 (22%), Positives = 41/103 (39%), Gaps = 27/103 (26%)
Query: 15 GRGRGLVSTQSLKAGQIVLRDSPIL----LYSALPFINSSSFCHNCFRKTMHSSSSI--- 67
+G+GL +TQ ++ G+ + + P++ L++AL + C K ++ +
Sbjct: 31 AKGKGLFATQLIRKGETIFVERPLVAAQFLWNALYRYRACDHCLRALEKAEENAQRLTGK 90
Query: 68 --------------------CPSCSVAFCSPKCSTAAASSSHS 90
CP C V +CS +C AAA H
Sbjct: 91 PGQVLPHPELCTVRKDLHQNCPHCQVMYCSAECRLAAAEQYHQ 133
>gi|85114970|ref|XP_964786.1| hypothetical protein NCU00870 [Neurospora crassa OR74A]
gi|28926580|gb|EAA35550.1| predicted protein [Neurospora crassa OR74A]
Length = 724
Score = 48.9 bits (115), Expect = 0.004, Method: Compositional matrix adjust.
Identities = 27/89 (30%), Positives = 37/89 (41%), Gaps = 7/89 (7%)
Query: 202 AYGIYLKASFFNHDCLPNACRFDYVDAAAENNTDIIVRMIHDVPQGREICLSYFPVNYDY 261
A G++ +S NH C+PN R D +I R DV +G E+ Y PV
Sbjct: 468 AKGLWAHSSLMNHSCVPNTMRSFVGDM-------LICRATRDVQEGEELFQQYVPVKTLV 520
Query: 262 STRQKRLLYDYDFACDCDRCKVEANWSDN 290
R K + F C C C+ E S+
Sbjct: 521 DVRNKEFEEGWGFECRCGLCEGERKSSEE 549
>gi|332226837|ref|XP_003262596.1| PREDICTED: SET and MYND domain-containing protein 5 isoform 1
[Nomascus leucogenys]
Length = 418
Score = 48.9 bits (115), Expect = 0.005, Method: Compositional matrix adjust.
Identities = 29/86 (33%), Positives = 42/86 (48%), Gaps = 8/86 (9%)
Query: 204 GIYLKASFFNHDCLPNACRFDYVDAAAENNTDIIVRMIHDVPQGREICLSYFPV---NYD 260
G+++ S NH C+PNA + ENN + V + D+ G EIC+SY
Sbjct: 306 GLFVLQSCCNHSCVPNA-----ETSFPENNFLLHVTALEDIKPGEEICISYLDCCQRERS 360
Query: 261 YSTRQKRLLYDYDFACDCDRCKVEAN 286
+R K L +Y F C C +C EA+
Sbjct: 361 RHSRHKILRENYLFVCSCPKCLAEAD 386
>gi|299115343|emb|CBN74163.1| conserved unknown protein [Ectocarpus siliculosus]
Length = 579
Score = 48.9 bits (115), Expect = 0.005, Method: Compositional matrix adjust.
Identities = 70/311 (22%), Positives = 113/311 (36%), Gaps = 50/311 (16%)
Query: 6 ETVTVAEIEGRGRGLVSTQSLKAGQIVLRDSPILLYSALPFINSSSFCHNCFRKTMHSSS 65
+T V E GR +++++ LKAG++VLR+ P + + +S C C ++
Sbjct: 72 DTFKVTEDSVSGRCVIASRDLKAGELVLREPPFV--KVVRRDCASRQCAYCCQQVTERGK 129
Query: 66 SICPSCSVAFCSPKCSTAAASSSHSPYVCQALTRLLNLNSPDAANLPLDRQLQARFLIAA 125
+CS C A + AL +L +++ ++ L R L + A
Sbjct: 130 IEADVPFAVYCSRACQ--AREDALRAAEASALGKLAGISAARDVDIDLLRMLLRLLITRA 187
Query: 126 YNLAVVNPS---------------------------QFQILLAF----QGTVTDNDT--- 151
L + PS Q++ L A + D +
Sbjct: 188 KALGLREPSGDSDSVSRGVDEEGEDGTMGEGLFLRQQWENLYALMHHREAMAPDWISVVR 247
Query: 152 SAAHYLSSLCPPPASTTIELTAALLAKDRLNAFGLMEPYIEGQDGRRSVRAYGIYLKASF 211
A L L P +E L + +NA GL + G V G++ +
Sbjct: 248 EAGEDLLQLLPEWVRFDVEEVVQLACRVNVNAHGLRD-----DSGANLVIGVGMFPLTAM 302
Query: 212 FNHDCLPNACRFDYVDAAAENNTDIIVRMIHDVPQGREICLSYFPVNYDYSTRQKRLLYD 271
NH C PN C F Y E VR + V G E+ + Y + + R++ LL
Sbjct: 303 INHACRPN-CTFVYFGGNLE------VRTLEPVSAGAELSVYYIDLLQSTAARRQELLTS 355
Query: 272 YDFACDCDRCK 282
F C C RC+
Sbjct: 356 KHFLCKCSRCE 366
>gi|66822113|ref|XP_644411.1| SET domain-containing protein [Dictyostelium discoideum AX4]
gi|66822969|ref|XP_644839.1| SET domain-containing protein [Dictyostelium discoideum AX4]
gi|122057743|sp|Q557F7.1|Y3589_DICDI RecName: Full=SET and MYND domain-containing protein DDB_G0273589
gi|60472534|gb|EAL70485.1| SET domain-containing protein [Dictyostelium discoideum AX4]
gi|60472895|gb|EAL70844.1| SET domain-containing protein [Dictyostelium discoideum AX4]
Length = 386
Score = 48.9 bits (115), Expect = 0.005, Method: Compositional matrix adjust.
Identities = 60/325 (18%), Positives = 125/325 (38%), Gaps = 59/325 (18%)
Query: 14 EGRGRGLVSTQSLKAGQIVLRDSPILLYSALPFINSSSFCHNCFRK--TMHSSSSICPSC 71
E GR L++T+ ++ G+ +L+ ++ +S C NC ++ ++ S C C
Sbjct: 14 ENEGRYLIATRDIQIGEDLLKCKSYFAVTSETLKTTS--CFNCIKQLPSVIKLSLKCNQC 71
Query: 72 S-VAFCSPKCSTAAASSSHSPYVCQALTRLLNLNSPDAANLPLDRQLQARFLIAA----Y 126
+ + +C+ +C + H Y C+ +L + N ++ + R ++ Y
Sbjct: 72 NEIWYCNEQCKNENINK-HQHYECKFYKKLKSPKLKVYPNFDIETFTEIRMIVGLLSRYY 130
Query: 127 NLAVVNPSQFQILL-----------AFQGTVTDNDTSAAHYLSSLCPPPASTTIE----- 170
++N + L T+ D + ++ P A I+
Sbjct: 131 QDILLNNKFIEQQLNNNNNNNNDNEQLTNTLDDVFDLVENQVTEESNPAAKERIDSIVEF 190
Query: 171 --------LTAALLAKDRLNAFGLMEPYIEGQDGRRSV------RAYGIYLK-------- 208
L + K +N +E + + RS+ +GI+ K
Sbjct: 191 ISELFNLVLLGSTTTKSIINNDDKIEMIRKINEKSRSIIHKTRCNQFGIWTKNDKCIGVA 250
Query: 209 ----ASFFNHDCLPNACRFDYVDAAAENNTDIIVRMIHDVPQGREICLSYFPVNYDYSTR 264
+S+FNH C+PN + +++ + ++ + +G ++ +SY ++ R
Sbjct: 251 VSPSSSYFNHSCIPNC-------TDVRDGSNMTFKSLYPIKKGDQLTISYIELDQPIQDR 303
Query: 265 QKRLLYDYDFACDCDRCKVEANWSD 289
+ L Y Y F C C RC ++N D
Sbjct: 304 KDELKYGYYFDCICPRCNGDSNSID 328
>gi|380797359|gb|AFE70555.1| SET and MYND domain-containing protein 5, partial [Macaca mulatta]
Length = 407
Score = 48.9 bits (115), Expect = 0.005, Method: Compositional matrix adjust.
Identities = 54/191 (28%), Positives = 76/191 (39%), Gaps = 32/191 (16%)
Query: 109 ANLPLDRQLQARFLIAAYNLAV---VNPSQFQILLAF-----QGTVTDNDTSAAHYLSSL 160
L L R+L F A Y AV P F+ L A QG T + + H +L
Sbjct: 204 GQLELLRRL---FTEALYEEAVSQWFTPDGFRSLFALVGTNGQGIGTSSLSQWVHACDAL 260
Query: 161 -CPPPASTTIELTAALLAKDRLNAFGLMEPYIEGQDGR-RSVRAYGIYLKASFFNHDCLP 218
P +++ L KD IE G + G+++ S NH C+P
Sbjct: 261 ELKPQDREQLDIFIDQLYKD-----------IEAATGEFLNCEGSGLFVLQSCCNHSCVP 309
Query: 219 NACRFDYVDAAAENNTDIIVRMIHDVPQGREICLSYF---PVNYDYSTRQKRLLYDYDFA 275
NA + ENN + V + D+ G EIC+SY +R K L +Y F
Sbjct: 310 NA-----ETSFPENNFLLHVTALEDIKPGEEICISYLDCCQRERSRHSRHKILRENYLFV 364
Query: 276 CDCDRCKVEAN 286
C C +C EA+
Sbjct: 365 CSCPKCLAEAD 375
>gi|384943470|gb|AFI35340.1| SET and MYND domain-containing protein 5 [Macaca mulatta]
Length = 418
Score = 48.5 bits (114), Expect = 0.005, Method: Compositional matrix adjust.
Identities = 54/191 (28%), Positives = 76/191 (39%), Gaps = 32/191 (16%)
Query: 109 ANLPLDRQLQARFLIAAYNLAV---VNPSQFQILLAF-----QGTVTDNDTSAAHYLSSL 160
L L R+L F A Y AV P F+ L A QG T + + H +L
Sbjct: 215 GQLELLRRL---FTEALYEEAVSQWFTPDGFRSLFALVGTNGQGIGTSSLSQWVHACDAL 271
Query: 161 -CPPPASTTIELTAALLAKDRLNAFGLMEPYIEGQDGR-RSVRAYGIYLKASFFNHDCLP 218
P +++ L KD IE G + G+++ S NH C+P
Sbjct: 272 ELKPQDREQLDIFIDQLYKD-----------IEAATGEFLNCEGSGLFVLQSCCNHSCVP 320
Query: 219 NACRFDYVDAAAENNTDIIVRMIHDVPQGREICLSYFPV---NYDYSTRQKRLLYDYDFA 275
NA + ENN + V + D+ G EIC+SY +R K L +Y F
Sbjct: 321 NA-----ETSFPENNFLLHVTALEDIKPGEEICISYLDCCQRERSRHSRHKILRENYLFV 375
Query: 276 CDCDRCKVEAN 286
C C +C EA+
Sbjct: 376 CSCPKCLAEAD 386
>gi|297667345|ref|XP_002811937.1| PREDICTED: SET and MYND domain-containing protein 5 [Pongo abelii]
Length = 419
Score = 48.5 bits (114), Expect = 0.005, Method: Compositional matrix adjust.
Identities = 29/86 (33%), Positives = 42/86 (48%), Gaps = 8/86 (9%)
Query: 204 GIYLKASFFNHDCLPNACRFDYVDAAAENNTDIIVRMIHDVPQGREICLSYFPV---NYD 260
G+++ S NH C+PNA + ENN + V + D+ G EIC+SY
Sbjct: 306 GLFVLQSCCNHSCVPNA-----ETSFPENNFLLHVTALEDIKPGEEICISYLDCCQRERS 360
Query: 261 YSTRQKRLLYDYDFACDCDRCKVEAN 286
+R K L +Y F C C +C EA+
Sbjct: 361 RHSRHKILRENYLFVCSCPKCLAEAD 386
>gi|296223556|ref|XP_002807574.1| PREDICTED: LOW QUALITY PROTEIN: SET and MYND domain-containing
protein 5 [Callithrix jacchus]
Length = 431
Score = 48.5 bits (114), Expect = 0.005, Method: Compositional matrix adjust.
Identities = 29/86 (33%), Positives = 42/86 (48%), Gaps = 8/86 (9%)
Query: 204 GIYLKASFFNHDCLPNACRFDYVDAAAENNTDIIVRMIHDVPQGREICLSYFPV---NYD 260
G+++ S NH C+PNA + ENN + V + D+ G EIC+SY
Sbjct: 322 GLFVLQSCCNHSCVPNA-----ETSFPENNFLLHVTALEDIKPGEEICISYLDCCQRERS 376
Query: 261 YSTRQKRLLYDYDFACDCDRCKVEAN 286
+R K L +Y F C C +C EA+
Sbjct: 377 RHSRHKILRENYLFVCSCPKCLAEAD 402
>gi|440487007|gb|ELQ66819.1| hypothetical protein OOW_P131scaffold00351g3 [Magnaporthe oryzae
P131]
Length = 782
Score = 48.5 bits (114), Expect = 0.005, Method: Compositional matrix adjust.
Identities = 29/109 (26%), Positives = 41/109 (37%), Gaps = 7/109 (6%)
Query: 202 AYGIYLKASFFNHDCLPNACRFDYVDAAAENNTDIIVRMIHDVPQGREICLSYFPVNYDY 261
A G++ A+ NH C+ + R D I R + D+ G E+ Y PV D
Sbjct: 499 ARGLWPLAAAMNHACVASTVRAFVGDV-------FITRALRDIEPGEELTQQYVPVRADV 551
Query: 262 STRQKRLLYDYDFACDCDRCKVEANWSDNDNDDENNEEVMDEDQDEQMV 310
RQ + + F C C C E D + Q E+MV
Sbjct: 552 GARQGQYGQWWGFECGCVLCAAEKGREAGGGGDMVRRRRLLLKQIEKMV 600
>gi|347976375|ref|XP_003437517.1| unnamed protein product [Podospora anserina S mat+]
gi|170940375|emb|CAP65602.1| unnamed protein product [Podospora anserina S mat+]
Length = 715
Score = 48.5 bits (114), Expect = 0.005, Method: Compositional matrix adjust.
Identities = 42/135 (31%), Positives = 60/135 (44%), Gaps = 23/135 (17%)
Query: 164 PASTTIELTAALLAKDRLNAFGLMEPYIEGQDGR--------RSVRAYGIYLKASFFNHD 215
P + LT LL LN+FG P G R + G++ AS NH
Sbjct: 428 PVIDSFLLTRILL----LNSFGCKFPEKAGALKRPVGIDRLGNLFPSLGLWTFASHMNHS 483
Query: 216 CLPNACRFDYVDAAAENNTDIIVRMIHDVPQGREICLSYF-PVNYDYSTRQKRLLYDYDF 274
CL NA R + N +IVR I D+P EI + Y P N + Q++ + ++ F
Sbjct: 484 CLSNAFR-------SFNGDMMIVRAIADIPVDAEITIQYLNPFNPQH---QQKYIINWAF 533
Query: 275 ACDCDRCKVEANWSD 289
CDC C+ + + SD
Sbjct: 534 VCDCAMCEDDDHTSD 548
>gi|149727812|ref|XP_001488565.1| PREDICTED: SET and MYND domain-containing protein 5 [Equus
caballus]
Length = 417
Score = 48.5 bits (114), Expect = 0.005, Method: Compositional matrix adjust.
Identities = 29/86 (33%), Positives = 42/86 (48%), Gaps = 8/86 (9%)
Query: 204 GIYLKASFFNHDCLPNACRFDYVDAAAENNTDIIVRMIHDVPQGREICLSYFPV---NYD 260
G+++ S NH C+PNA + ENN + V + D+ G EIC+SY
Sbjct: 306 GLFVLQSCCNHSCVPNA-----ETSFPENNFLLHVTALEDIKPGEEICISYLDCCQRERS 360
Query: 261 YSTRQKRLLYDYDFACDCDRCKVEAN 286
+R K L +Y F C C +C EA+
Sbjct: 361 RHSRHKILRENYLFVCSCPKCLAEAD 386
Score = 38.9 bits (89), Expect = 4.5, Method: Compositional matrix adjust.
Identities = 26/106 (24%), Positives = 42/106 (39%), Gaps = 33/106 (31%)
Query: 15 GRGRGLVSTQSLKAGQIVLRDSPIL----LYSALPFINSSSFCHNCF------------- 57
+G+GL +TQ ++ G+ + + P++ L++AL + C
Sbjct: 31 AKGKGLFATQLIQKGETIFVERPLVAAQFLWNALYRYRACDHCLRALEKAEENAQRLTGK 90
Query: 58 -------------RKTMHSSSSICPSCSVAFCSPKCSTAAASSSHS 90
RK +H S CP C V +CS +C AAA H
Sbjct: 91 PGQVLPHPELCTVRKDLHQS---CPHCQVMYCSTECRVAAAEQYHQ 133
>gi|389631385|ref|XP_003713345.1| hypothetical protein MGG_10443 [Magnaporthe oryzae 70-15]
gi|351645678|gb|EHA53538.1| hypothetical protein MGG_10443 [Magnaporthe oryzae 70-15]
Length = 770
Score = 48.5 bits (114), Expect = 0.005, Method: Compositional matrix adjust.
Identities = 29/109 (26%), Positives = 41/109 (37%), Gaps = 7/109 (6%)
Query: 202 AYGIYLKASFFNHDCLPNACRFDYVDAAAENNTDIIVRMIHDVPQGREICLSYFPVNYDY 261
A G++ A+ NH C+ + R D I R + D+ G E+ Y PV D
Sbjct: 487 ARGLWPLAAAMNHACVASTVRAFVGDV-------FITRALRDIEPGEELTQQYVPVRADV 539
Query: 262 STRQKRLLYDYDFACDCDRCKVEANWSDNDNDDENNEEVMDEDQDEQMV 310
RQ + + F C C C E D + Q E+MV
Sbjct: 540 GARQGQYGQWWGFECGCVLCAAEKGREAGGGGDMVRRRRLLLKQIEKMV 588
>gi|66521464|ref|XP_396314.2| PREDICTED: protein msta, isoform B-like [Apis mellifera]
Length = 513
Score = 48.5 bits (114), Expect = 0.005, Method: Compositional matrix adjust.
Identities = 61/296 (20%), Positives = 110/296 (37%), Gaps = 42/296 (14%)
Query: 5 SETVTVAEIEGRGRGLVSTQSLKAGQIVLRDSPILLYSALPFINSSSFCHNCFRKT--MH 62
S T + + + GR +V+ + L+AG+ ++ + P ++ P + C +C+
Sbjct: 16 STTYKILQNDKVGRYMVANKELEAGEEIITEMPFIVG---PKAATYPLCLSCYVSWPPTL 72
Query: 63 SSSSICPSCSVAFCSPKCSTAAASSSHSPYVCQALTRL-------LNLNSPDAANLP-LD 114
+ +C C CS +C H Y CQ + L + +P L+
Sbjct: 73 NEKPLCSKCGWPVCSTECEN---QPQHKDYECQIFAQANEKFNVQAALEETNENGVPQLE 129
Query: 115 RQLQARFLIAAYNLAVVNPSQFQILLAFQGTVTDNDTSAAHYLSSLCPPPASTTIELTAA 174
R L+ + NP +++ + D A + + S S I +
Sbjct: 130 CITPLRLLLESEK----NPERWKNEV--------KDMEAHNKIRSQKKQWKSDQINIVEY 177
Query: 175 LLAKDRLNAF---------GLMEPYIEGQDGRRSVRAYGIYLKASFFNHDCLPNACRFDY 225
+ + +L F G++E + A IY + NH C+ N C
Sbjct: 178 IRKQLKLERFSEEQIQTVCGILEINTFEVRTAKGFSARAIYPTVALMNHSCISNTCH--- 234
Query: 226 VDAAAENNTDIIVRMIHDVPQGREICLSYFPVNYDYSTRQKRLLYDYDFACDCDRC 281
+ + + I +R +P G E+ SY R++ LL +FAC C RC
Sbjct: 235 --SISPTDYKIRLRTTLKIPVGGELYGSYTHSLLPTMLRREHLLEGKNFACACARC 288
>gi|410955043|ref|XP_003984168.1| PREDICTED: SET and MYND domain-containing protein 5 [Felis catus]
Length = 416
Score = 48.5 bits (114), Expect = 0.005, Method: Compositional matrix adjust.
Identities = 29/86 (33%), Positives = 42/86 (48%), Gaps = 8/86 (9%)
Query: 204 GIYLKASFFNHDCLPNACRFDYVDAAAENNTDIIVRMIHDVPQGREICLSYFPV---NYD 260
G+++ S NH C+PNA + ENN + V + D+ G EIC+SY
Sbjct: 306 GLFVLQSCCNHSCVPNA-----ETSFPENNFLLHVTALEDIKPGEEICISYLDCCQRERS 360
Query: 261 YSTRQKRLLYDYDFACDCDRCKVEAN 286
+R K L +Y F C C +C EA+
Sbjct: 361 RHSRHKILRENYLFVCSCPKCLAEAD 386
Score = 38.5 bits (88), Expect = 5.0, Method: Compositional matrix adjust.
Identities = 23/103 (22%), Positives = 41/103 (39%), Gaps = 27/103 (26%)
Query: 15 GRGRGLVSTQSLKAGQIVLRDSPIL----LYSALPFINSSSFCHNCFRKTMHSSSSI--- 67
+G+GL +TQ ++ G+ + + P++ L++AL + C K ++ +
Sbjct: 31 AKGKGLFATQLIRKGETIFVEQPLVAAQFLWNALYRYRACDHCLRALEKAEENAQRLTGK 90
Query: 68 --------------------CPSCSVAFCSPKCSTAAASSSHS 90
CP C V +CS +C AAA H
Sbjct: 91 PGQVLPHPELCAVRKDLHQHCPHCQVMYCSAECRLAAAEQYHQ 133
>gi|311252397|ref|XP_003125060.1| PREDICTED: SET and MYND domain-containing protein 5 [Sus scrofa]
Length = 417
Score = 48.5 bits (114), Expect = 0.005, Method: Compositional matrix adjust.
Identities = 29/86 (33%), Positives = 42/86 (48%), Gaps = 8/86 (9%)
Query: 204 GIYLKASFFNHDCLPNACRFDYVDAAAENNTDIIVRMIHDVPQGREICLSYFPV---NYD 260
G+++ S NH C+PNA + ENN + V + D+ G EIC+SY
Sbjct: 306 GLFVLQSCCNHSCVPNA-----ETSFPENNFLLHVTALEDIKPGEEICISYLDCCQRERS 360
Query: 261 YSTRQKRLLYDYDFACDCDRCKVEAN 286
+R K L +Y F C C +C EA+
Sbjct: 361 RHSRHKILRENYLFVCSCPKCLAEAD 386
Score = 38.9 bits (89), Expect = 4.2, Method: Compositional matrix adjust.
Identities = 23/103 (22%), Positives = 41/103 (39%), Gaps = 27/103 (26%)
Query: 15 GRGRGLVSTQSLKAGQIVLRDSPIL----LYSALPFINSSSFCHNCFRKTMHSSSSI--- 67
+G+GL +TQ ++ G+ + + P++ L++AL + C K ++ +
Sbjct: 31 AKGKGLFATQLIRKGETIFVEQPLVAAQFLWNALYRYRACDHCLRALEKAEENAQRLTGK 90
Query: 68 --------------------CPSCSVAFCSPKCSTAAASSSHS 90
CP C V +CS +C AAA H
Sbjct: 91 PGQVLPHPDLCTVRKDLHQNCPHCQVMYCSAECRLAAAEQYHQ 133
>gi|328875121|gb|EGG23486.1| hypothetical protein DFA_05619 [Dictyostelium fasciculatum]
Length = 391
Score = 48.5 bits (114), Expect = 0.005, Method: Compositional matrix adjust.
Identities = 33/145 (22%), Positives = 52/145 (35%), Gaps = 46/145 (31%)
Query: 204 GIYLKASFFNHDCLPNACRFDYVDAAAENNTDIIVRMIHDVPQGREICLSYFPVNYDYST 263
G+Y + S+FNH C PN + + ++++ + D+ QG E+ +Y ++
Sbjct: 273 GLYDQCSYFNHSCQPNIFKVNQTPGG-----ELVMVALRDIEQGEELFYNYIQISMSGEA 327
Query: 264 RQKRLLYDYDFACDCDRCKVEANWSDNDNDDENNEEVMDEDQDEQMVASDDDAEVHGDTN 323
R K+L Y F C C CK N
Sbjct: 328 RIKKLKESYFFNCQCPGCK----------------------------------------N 347
Query: 324 FP-HAYFFMRYMCDRDNCWGTLAPL 347
P H F +Y+C C G + PL
Sbjct: 348 APSHKQFLDKYLCKVKQCNGVITPL 372
Score = 40.8 bits (94), Expect = 1.2, Method: Compositional matrix adjust.
Identities = 29/108 (26%), Positives = 56/108 (51%), Gaps = 12/108 (11%)
Query: 8 VTVAEIEGRGRGLVSTQSLKAGQIVLRDSPILLYSALPFIN--SSSFCHNCF----RKTM 61
+T+ + + RGR V++ +KAG +++R+ PI+ + +++ ++FCH C +
Sbjct: 2 ITINQYKDRGRCFVASVPIKAGTVIVRELPII--QTVNYMDPLQNAFCHQCLTYHHQHQQ 59
Query: 62 HSSSSI-CPSCS-VAFCSPKCSTAAASSSHSPYVCQALTRLLNLNSPD 107
HSS + C C V +C+ C A+ + C + +RL + N D
Sbjct: 60 HSSPLLHCSKCKLVWYCNRDC--ASKDADQHELECPSYSRLHSTNLFD 105
>gi|310799799|gb|EFQ34692.1| hypothetical protein GLRG_09836 [Glomerella graminicola M1.001]
Length = 430
Score = 48.5 bits (114), Expect = 0.005, Method: Compositional matrix adjust.
Identities = 32/121 (26%), Positives = 49/121 (40%), Gaps = 17/121 (14%)
Query: 203 YGIYLKASFFNHDCLPNACRFDYVDAAAENNTDIIVRMIHDVPQGREICLSYFPVNYDYS 262
+G Y + S FNHDC PN Y+D +++ + D+ G E+ +SY
Sbjct: 241 FGNYPEVSRFNHDCRPNVAF--YID---DSDLRHYTHAVRDIQAGEELTISYVDSLSARV 295
Query: 263 TRQKRLLYDYDFACDCDRC------------KVEANWSDNDNDDENNEEVMDEDQDEQMV 310
RQ R ++ F C C C ++ W N E +DED E++V
Sbjct: 296 VRQDRARRNWGFGCGCAHCSLPEPFVRASDQRLVRMWEIEQQLSHWNGEAVDEDMIERLV 355
Query: 311 A 311
Sbjct: 356 G 356
>gi|157823305|ref|NP_001101340.1| SET and MYND domain-containing protein 5 [Rattus norvegicus]
gi|149036567|gb|EDL91185.1| SET and MYND domain containing 5 (predicted) [Rattus norvegicus]
Length = 417
Score = 48.5 bits (114), Expect = 0.005, Method: Compositional matrix adjust.
Identities = 29/86 (33%), Positives = 42/86 (48%), Gaps = 8/86 (9%)
Query: 204 GIYLKASFFNHDCLPNACRFDYVDAAAENNTDIIVRMIHDVPQGREICLSYFPV---NYD 260
G+++ S NH C+PNA + ENN + V + D+ G EIC+SY
Sbjct: 306 GLFVLQSCCNHSCVPNA-----ETSFPENNFLLHVTALEDIEPGEEICISYLDCCQRERS 360
Query: 261 YSTRQKRLLYDYDFACDCDRCKVEAN 286
+R K L +Y F C C +C EA+
Sbjct: 361 RHSRHKILRENYLFVCSCPKCLAEAD 386
Score = 38.1 bits (87), Expect = 6.5, Method: Compositional matrix adjust.
Identities = 23/102 (22%), Positives = 41/102 (40%), Gaps = 27/102 (26%)
Query: 15 GRGRGLVSTQSLKAGQIVLRDSPIL----LYSALPFINSSSFCHNCFRKTMHSSSSI--- 67
+G+GL +TQ ++ G+ + + P++ L++AL + C K ++ +
Sbjct: 31 AKGKGLFATQLIRKGETIFIERPLVASQFLWNALYRYRACDHCLRALEKAEENAQRLTGK 90
Query: 68 --------------------CPSCSVAFCSPKCSTAAASSSH 89
CP C V +CS +C AAA H
Sbjct: 91 PGQVLPHPELCSVRKDLHQNCPRCQVTYCSAECRLAAAEQYH 132
>gi|1245372|gb|AAB38131.1| NN8-4AG, partial [Homo sapiens]
Length = 412
Score = 48.5 bits (114), Expect = 0.005, Method: Compositional matrix adjust.
Identities = 29/86 (33%), Positives = 42/86 (48%), Gaps = 8/86 (9%)
Query: 204 GIYLKASFFNHDCLPNACRFDYVDAAAENNTDIIVRMIHDVPQGREICLSYF---PVNYD 260
G+++ S NH C+PNA + ENN + V + D+ G EIC+SY
Sbjct: 300 GLFVLQSCCNHSCVPNA-----ETSFPENNFLLHVTALEDIKPGEEICISYLDCCQRERS 354
Query: 261 YSTRQKRLLYDYDFACDCDRCKVEAN 286
+R K L +Y F C C +C EA+
Sbjct: 355 RHSRHKILRENYLFVCSCPKCLAEAD 380
>gi|390599694|gb|EIN09090.1| SET domain-containing protein [Punctularia strigosozonata HHB-11173
SS5]
Length = 562
Score = 48.5 bits (114), Expect = 0.005, Method: Compositional matrix adjust.
Identities = 70/305 (22%), Positives = 113/305 (37%), Gaps = 41/305 (13%)
Query: 4 VSETVTVAEIEGRGRGLVSTQSLKAGQIVLRDSPILLYSALPFINSSSFCHNCFRKTMHS 63
+ T+ + E GRGL +++ G +L P + + P++ SS C +C
Sbjct: 30 IPPTLDLRESPKSGRGLYVKVAIRRGSSLLSTRPHVSVLSTPYL--SSHCSSCAAPASPE 87
Query: 64 SSSI--CPSCSVA-FCSPKCSTAAASSSHSPYVCQALTRLL------NLNSPDAANLPLD 114
+ C C V +CS KC A + H P C AL R +L P+ A L
Sbjct: 88 RPQLKRCAKCHVVHYCSQKCQNADWKA-HKPE-CDALQRWATAAPSPDLAVPEEAVRCLG 145
Query: 115 RQLQARFLIAAYNLAVVNPSQFQILLAFQGTVTDNDTSAAHYLS-------SLCPPPAST 167
R L + N + + + + + + ++ + YL+ L P
Sbjct: 146 RMLWQK----QRNPSSIWSREIDSMQSHRSSLRPESFESHAYLAHALVRYLGLDAPEKLA 201
Query: 168 TIELTAA-----LLAKDRLNAFGLMEPYIEGQDGRRSVRAYGIYLK--ASFFNHDCLPNA 220
+ L +A L+++ NA L P S+ G+ + + NH C PNA
Sbjct: 202 SFGLRSAGDVVDLISRFTTNAITLAAP---------SLTPLGVSVSPAVALVNHSCAPNA 252
Query: 221 CR-FDYVDAAAENNTDIIVRMIHDVPQGREICLSYFPVNYDYSTRQKRLLYDYDFACDCD 279
F V + + V + D+ E+ +Y RQK L Y F C C
Sbjct: 253 VVVFPRVSKTVDQEPVMQVIALRDIHPDEEVTTAYIDTTVPREQRQKILRDTYHFTCSCS 312
Query: 280 RCKVE 284
C E
Sbjct: 313 LCAAE 317
>gi|367019312|ref|XP_003658941.1| hypothetical protein MYCTH_62625, partial [Myceliophthora
thermophila ATCC 42464]
gi|347006208|gb|AEO53696.1| hypothetical protein MYCTH_62625, partial [Myceliophthora
thermophila ATCC 42464]
Length = 735
Score = 48.5 bits (114), Expect = 0.005, Method: Compositional matrix adjust.
Identities = 28/95 (29%), Positives = 42/95 (44%), Gaps = 8/95 (8%)
Query: 192 EGQDGRRSVRAYGIYLKASFFNHDCLPNACRFDYVDAAAENNTDIIVRMIHDVPQGREIC 251
+G RR + A G+++ +S NH C+PN R D +I R D+ G E+
Sbjct: 478 DGSPARRGM-AKGLWVHSSRVNHSCVPNTMRSFVGDM-------LISRAARDIKAGEELF 529
Query: 252 LSYFPVNYDYSTRQKRLLYDYDFACDCDRCKVEAN 286
Y PV R +L + F C C+ C E+
Sbjct: 530 QQYVPVRTVVDVRNAQLRESWGFECACELCARESR 564
>gi|226293820|gb|EEH49240.1| conserved hypothetical protein [Paracoccidioides brasiliensis Pb18]
Length = 398
Score = 48.5 bits (114), Expect = 0.005, Method: Compositional matrix adjust.
Identities = 28/89 (31%), Positives = 44/89 (49%), Gaps = 3/89 (3%)
Query: 199 SVRAYGIYLKASFFNHDCLPNACRFDYVDAAAENNTD---IIVRMIHDVPQGREICLSYF 255
+VR + L +++ NH CLPNA + ++ + + V V QG EI + Y
Sbjct: 66 AVRGRVLGLSSAYLNHACLPNAQHTLFTMKCNDDREERYFLTVYACKKVAQGEEITIPYD 125
Query: 256 PVNYDYSTRQKRLLYDYDFACDCDRCKVE 284
+ D ++RQ+ LL +Y F C C C E
Sbjct: 126 VLYMDRASRQQFLLQEYGFECACKLCGKE 154
>gi|157125402|ref|XP_001654323.1| hypothetical protein AaeL_AAEL001936 [Aedes aegypti]
gi|108882687|gb|EAT46912.1| AAEL001936-PA [Aedes aegypti]
Length = 546
Score = 48.5 bits (114), Expect = 0.005, Method: Compositional matrix adjust.
Identities = 56/276 (20%), Positives = 103/276 (37%), Gaps = 30/276 (10%)
Query: 14 EGRGRGLVSTQSLKAGQIVLRDSPILLYSALPFINSSSFCHNCFRKTMHSSSSICPSCSV 73
E GR +++T+ +KAG +VL++SP L+Y N C C + + C C
Sbjct: 71 ERYGRYMMATKDIKAGDVVLKESP-LVYGPAQITNP--VCVGCLQGLTENQFLECERCGW 127
Query: 74 AFCSPKCSTAAASSSHSPYVCQALTR--LLNLNSPDAANLPLDRQLQARFLIAAYNLAVV 131
C +C + + + ++ L + +P Q + LA
Sbjct: 128 PVCKRECQDHPSHKAECKFTIARGSKMQLRHFYNPHPV-------YQCLIPLRCLLLAER 180
Query: 132 NPSQFQILLAFQGTVTDNDTSAA-----HYLSSLCPPPASTTIELTAALLAKDRLNAFGL 186
+P+++Q L+ + S ++ L P + + L G+
Sbjct: 181 DPAKWQALIKLESHEEQRRGSEQWRNDREGVAKLIP----RFFKCENKWSEDEILKIVGI 236
Query: 187 MEPYIEGQD-GRRSVRAYGIYLKASFFNHDCLPNACRFDYVDAAAENNTDIIVRMIHDVP 245
++ + G + + IY AS H C PN + + + +I+ + +
Sbjct: 237 LQ--VNGHEVPLTEPPSVAIYNNASMLEHSCRPN------LSKSFTSKKEIVFWAPNPIK 288
Query: 246 QGREICLSYFPVNYDYSTRQKRLLYDYDFACDCDRC 281
QG + +SY V + + RQ L F C C RC
Sbjct: 289 QGERLSISYSDVLWGTANRQDHLQQTKLFRCTCVRC 324
>gi|197304712|ref|NP_001127869.1| N-lysine methyltransferase SMYD2-like [Nasonia vitripennis]
Length = 704
Score = 48.5 bits (114), Expect = 0.005, Method: Compositional matrix adjust.
Identities = 77/335 (22%), Positives = 126/335 (37%), Gaps = 57/335 (17%)
Query: 1 MSAVSETVTVAEIEGRGRGLVSTQSLKAGQIVLRDSPILLYSALPFINSSSFCHNCFRKT 60
+++ S+ + + E +GR L +T+ +K G +++ D P + + + + C + K
Sbjct: 271 LTSCSDALALGSDEAKGRKLYATRDIKPGTVLIVDRPFASTTDVAALCRNCLCCHASLKL 330
Query: 61 MHSSSSICPSC-SVAFCSPKCSTAAASSSHSPYVC-----------QALTRLLNLNSPDA 108
S C C +V FCS C A + H Y C Q LL + A
Sbjct: 331 PESIRVPCEHCQAVQFCSEDCRRRAWNKFHK-YECRIFDYFYAKQAQKTNCLLAYRAVVA 389
Query: 109 ANLPLDRQ--LQARFLIAAYN---------LAVVNPSQFQILLAFQGTVTDNDTS----- 152
A L +++ L FL N LA ++ + + + D D +
Sbjct: 390 AALRNEKEFSLDQEFLKLHENDKDSLDIASLASYESRDYKTVFSLESHCADADPADNLQR 449
Query: 153 AAH---YLSSLCPP-----PASTTIELTAALLAKDRL--------NAFGLMEPYIEGQDG 196
A H + SL P E +LA L NA+ ++E + +
Sbjct: 450 AIHSVFFAKSLIYSLNYFHPEIDNHERHLHILAVALLHNMQAIKCNAYEIVENVRDDETK 509
Query: 197 RRSVRAYG--IYLKASFFNHDCLPNACRFDYVDAAAENNTDIIVRMIHDVPQGREI--CL 252
R G IY S NH C PN R + N ++V + +P+G EI C
Sbjct: 510 ILEPRNVGGAIYTTVSLTNHSCYPNIVRHSF------PNGTVVVTSLRYIPEGSEILDCY 563
Query: 253 SYFPVNYDYSTRQKRLLYDYDFACDCDRCKVEANW 287
+ +R++ L Y F C C+ C A+W
Sbjct: 564 GQHFLENKRDSRRRLLAEKYYFDCQCEPCL--ADW 596
>gi|328702715|ref|XP_001948576.2| PREDICTED: SET and MYND domain-containing protein 5-like
[Acyrthosiphon pisum]
Length = 387
Score = 48.5 bits (114), Expect = 0.005, Method: Compositional matrix adjust.
Identities = 53/179 (29%), Positives = 81/179 (45%), Gaps = 20/179 (11%)
Query: 131 VNPSQFQILLAFQGTVTDN-DTSAAHYLSSLCPPPAST-TIELTAALLAKDRLNAFGLME 188
+ P+ F+ LLA GT TS H ++ C S +EL + +A+ ME
Sbjct: 222 LTPTGFKSLLALVGTNGQGVGTSVFHQWANNCKKHLSPDQMELFEQFID----SAYDAME 277
Query: 189 PYIEGQDGRRSVRAYGIYLKASFFNHDCLPNACRFDYVDAAAENNTDIIVRMIHDVPQGR 248
+ G D + ++ S NH C PNA + + N + + I + G
Sbjct: 278 QVV-GVDFLDN-EGSALFQNHSSINHSCFPNAA------SVFDGNHVLRLVAIRMIEPGD 329
Query: 249 EICLSYF-PVNYDYS--TRQKRLLYDYDFACDCDRCKVEANWSDNDNDDENNEEVMDED 304
EI +SY P D+S TRQK L +Y F C C +C+ + N D +DD++ M+ED
Sbjct: 330 EINISYIAPCELDHSRHTRQKYLQENYVFTCRCIKCEEQINDPDITSDDDSE---MEED 385
>gi|113205415|gb|AAU90317.2| TPR domain containing protein, putative [Solanum demissum]
Length = 522
Score = 48.5 bits (114), Expect = 0.005, Method: Compositional matrix adjust.
Identities = 29/92 (31%), Positives = 42/92 (45%), Gaps = 20/92 (21%)
Query: 200 VRAYGIYLKASFFNHDCLPNACR---FDYVDAAAENNTDIIVRMIH---DVPQGREICLS 253
V G+++ +SF NH C PN R DYV MIH D+ G E+ +
Sbjct: 304 VHGIGLWILSSFINHSCDPNVRRSHVGDYV-------------MIHACRDIKAGEELTFA 350
Query: 254 YFPVNYDYSTRQKRLLYDYDFACDCDRCKVEA 285
YF V + R+++ + F C C RC +E
Sbjct: 351 YFDVFSPFRDREEK-TKSWGFVCKCKRCNLEK 381
>gi|71664708|ref|XP_819332.1| hypothetical protein [Trypanosoma cruzi strain CL Brener]
gi|70884628|gb|EAN97481.1| hypothetical protein, conserved [Trypanosoma cruzi]
Length = 382
Score = 48.5 bits (114), Expect = 0.005, Method: Compositional matrix adjust.
Identities = 60/245 (24%), Positives = 90/245 (36%), Gaps = 40/245 (16%)
Query: 67 ICPSCSVAFCSPKCSTAAASSSHSPYVCQALTRLLNLN---SPDAANLPLDRQLQARFLI 123
I P+C +CS C T A S HS C + L L+ A+ +R A L+
Sbjct: 161 INPNCHEEYCSRDCRTMALSLYHSRSCCNEGLQALELDMYTQMKASGTSTERNAAAAQLL 220
Query: 124 AAYNLAV-----VNPSQFQILLAFQGTVTDNDTSAAH-------YLSSLCPPPASTTIEL 171
LAV V PS L G + + AH + C S + E
Sbjct: 221 MLRVLAVSVQQQVIPSALSELRILSGRLLFSPRVLAHSFLEIYERFTRFCHTATSISYEE 280
Query: 172 TAALLAKDRLNAFGLMEPYIEGQDGRRSVRAYGIYLKASFFNHDCLPNACRFDYVDAAAE 231
+LA+ N F + +E + L S NH C+ N AAE
Sbjct: 281 MIGVLARVTANCFH-RDTCVE------------LNLSRSMLNHSCVAN---------AAE 318
Query: 232 NNTDIIVRMIHDVPQGREICLSYFP--VNYDYSTRQKRLLYDYDFACDCDRCKVEANWSD 289
++ DV +G E+ ++Y+P + + R K L F C C +C+ E
Sbjct: 319 ELHSGEIKTTRDVARGEELTINYYPQLKHLSFEERSKE-LERRGFECHCSQCRREKGVER 377
Query: 290 NDNDD 294
D ++
Sbjct: 378 RDLNE 382
>gi|110289369|gb|AAP54526.2| TPR Domain containing protein, expressed [Oryza sativa Japonica
Group]
Length = 687
Score = 48.5 bits (114), Expect = 0.005, Method: Compositional matrix adjust.
Identities = 77/334 (23%), Positives = 129/334 (38%), Gaps = 91/334 (27%)
Query: 16 RGRGLVSTQSLKAGQIVLRDSPILLYSALPFINS--SSFCHNCFRKTMHSSSSICPSCSV 73
+GRG+ S + ++ + P+ A+ + S + CH CF + + +CPSC++
Sbjct: 121 KGRGMSSPNDISPASLIHAEDPL----AVIIMKSCRDTHCHYCFSEAP-ADVVVCPSCTI 175
Query: 74 -AFCSPKCSTAAASSSHSPYVCQALTRL-LNLNSPDAANL---------PLDRQL-QARF 121
+CS +C A C T L N N D A L P +Q+ + R
Sbjct: 176 PIYCSNRCQEKAIGQ----MSCNQNTHLESNNNVVDIAKLSVTSTKSKTPDSKQIAEHRH 231
Query: 122 LIAAYNLAVVNPSQF-------------QILLAFQGTVT----------DNDTSAAHYLS 158
A V P+ Q+L+ + T++ D D+SA+ + S
Sbjct: 232 ECGGACWAAVLPADIVLAGRIMAQYIEKQLLVGKRSTISGPNLDLVHHYDQDSSASKFES 291
Query: 159 S-------LCPPP--------ASTTIELTAALLAKDRLNAFGLMEPYIEGQDGRRSVR-- 201
LC A ++ L+ + ++N+ ++ +++ DG +++
Sbjct: 292 HIYATVLFLCLQSYYKSGVSWAEDSLSQLVLLICQIKVNSIAIV--HMKSMDGVKALTKG 349
Query: 202 -----------------AYGIYLKASFFNHDCLPNACRFDYVDAAAENNTDIIVRMIHDV 244
A IY+ SFFNH C PN + A + T +I+R +
Sbjct: 350 FSGFSGDVMCSVEQVRVAQAIYMSGSFFNHSCRPN------IHAYFHSRT-LILRSTEYI 402
Query: 245 PQGREICLSYFPV--NYDYSTRQKRLLYDYDFAC 276
G I LSY P D RQK L +Y F+C
Sbjct: 403 KAGSPIELSYGPQVGEMDLPERQKSLRENYYFSC 436
>gi|410082399|ref|XP_003958778.1| hypothetical protein KAFR_0H02340 [Kazachstania africana CBS 2517]
gi|372465367|emb|CCF59643.1| hypothetical protein KAFR_0H02340 [Kazachstania africana CBS 2517]
Length = 371
Score = 48.5 bits (114), Expect = 0.005, Method: Compositional matrix adjust.
Identities = 50/198 (25%), Positives = 84/198 (42%), Gaps = 32/198 (16%)
Query: 92 YVCQALTRLLNLNSPDAANLPLDRQLQARFLIAAYNLAVVNPSQFQILLAFQGTVTDNDT 151
+VC +L RL N S D+ + + LQ+ + S+F ILL FQ +V +
Sbjct: 197 FVCSSLYRLSNF-SEDSLTMKSFQHLQSN--------ELEKISKFPILLHFQESVFNT-- 245
Query: 152 SAAHYLSSLCPPPASTTIELTA--ALLAKDRLNAFGLMEPYIEGQDGRRSVRAYGIYL-- 207
L L P + + +L + N+FGL + + S YG ++
Sbjct: 246 -----LYILLPNNLKCHLSTDSFRHILGSEYANSFGLWQE----DESLESRECYGYWVLP 296
Query: 208 KASFFNHDCLPNACRFDYVDAAAENNTDIIVRMIHDVPQGREICLSYF-PVNYDYSTRQK 266
+AS+FNH C PN + N + + DV G E+ ++Y +N + R++
Sbjct: 297 RASYFNHSCDPNITK-------TRNGRIMDFILNRDVKVGEELSINYAGTLNLPLTERRE 349
Query: 267 RLLYDYDFACDCDRCKVE 284
+ + F C C +C E
Sbjct: 350 FMRNGWFFDCGCQKCLKE 367
>gi|348566513|ref|XP_003469046.1| PREDICTED: SET and MYND domain-containing protein 5-like [Cavia
porcellus]
Length = 416
Score = 48.5 bits (114), Expect = 0.005, Method: Compositional matrix adjust.
Identities = 29/86 (33%), Positives = 42/86 (48%), Gaps = 8/86 (9%)
Query: 204 GIYLKASFFNHDCLPNACRFDYVDAAAENNTDIIVRMIHDVPQGREICLSYFPV---NYD 260
G+++ S NH C+PNA + ENN + V + D+ G EIC+SY
Sbjct: 306 GLFVLQSCCNHSCVPNA-----ETSFPENNFLLHVTALEDISPGEEICISYLDCCQRERS 360
Query: 261 YSTRQKRLLYDYDFACDCDRCKVEAN 286
+R K L +Y F C C +C EA+
Sbjct: 361 RHSRHKILRENYLFVCSCPKCLAEAD 386
Score = 38.5 bits (88), Expect = 6.2, Method: Compositional matrix adjust.
Identities = 23/102 (22%), Positives = 41/102 (40%), Gaps = 27/102 (26%)
Query: 15 GRGRGLVSTQSLKAGQIVLRDSPIL----LYSALPFINSSSFCHNCFRKTMHSSSSI--- 67
+G+GL +TQ ++ G+ + + P++ L++AL + C K ++ +
Sbjct: 31 AKGKGLFATQLIRKGETIFVERPLVAAQFLWNALYRYRACDHCLRALEKAEENAQRLTGK 90
Query: 68 --------------------CPSCSVAFCSPKCSTAAASSSH 89
CP C V +CS +C AAA H
Sbjct: 91 PGQVLPHPELCTVRKDLHQNCPHCQVMYCSAECRLAAAEQYH 132
>gi|344283937|ref|XP_003413727.1| PREDICTED: SET and MYND domain-containing protein 5 [Loxodonta
africana]
Length = 417
Score = 48.5 bits (114), Expect = 0.005, Method: Compositional matrix adjust.
Identities = 29/86 (33%), Positives = 42/86 (48%), Gaps = 8/86 (9%)
Query: 204 GIYLKASFFNHDCLPNACRFDYVDAAAENNTDIIVRMIHDVPQGREICLSYFPV---NYD 260
G+++ S NH C+PNA + ENN + V + D+ G EIC+SY
Sbjct: 306 GLFVLQSCCNHSCVPNA-----ETSFPENNFLLHVTALEDIKPGEEICISYLDCCQRERS 360
Query: 261 YSTRQKRLLYDYDFACDCDRCKVEAN 286
+R K L +Y F C C +C EA+
Sbjct: 361 RHSRHKILRENYLFVCSCPKCLAEAD 386
>gi|328853242|gb|EGG02382.1| hypothetical protein MELLADRAFT_110239 [Melampsora larici-populina
98AG31]
Length = 561
Score = 48.5 bits (114), Expect = 0.005, Method: Compositional matrix adjust.
Identities = 35/122 (28%), Positives = 54/122 (44%), Gaps = 21/122 (17%)
Query: 182 NAFGLMEPYIEGQDGRRSVRAYGIYLKASFFNHDCLPN-ACRFDYVDAAAENNTDIIVRM 240
NAFG GR V Y + + S FNH+C PN A FD + +
Sbjct: 345 NAFG-------SHRGREEVLHYAVVPEPSVFNHECRPNSAFYFD------DKTLRVYAHA 391
Query: 241 IHDVPQGREICLSYFPVNYDYSTRQKRLLYDYDFACDCDRCKVEANWSDNDN--DDENNE 298
+ D+ G EI ++Y + + RQ + + Y F+C C C + SD ++ D+ E
Sbjct: 392 VRDIALGEEITIAYRAMKASRAGRQAAIAH-YGFSCTCSHCSM----SDEESRASDQRIE 446
Query: 299 EV 300
E+
Sbjct: 447 EI 448
>gi|239606334|gb|EEQ83321.1| conserved hypothetical protein [Ajellomyces dermatitidis ER-3]
Length = 447
Score = 48.5 bits (114), Expect = 0.005, Method: Compositional matrix adjust.
Identities = 29/88 (32%), Positives = 41/88 (46%), Gaps = 7/88 (7%)
Query: 201 RAYGIYLKASFFNHDCLPNA----CRFDYVDAAAENNTDIIVRMIHDVPQGREICLSYFP 256
RA G L +++ NH CLPNA + D +IV + +G EI + Y
Sbjct: 113 RALG--LDSAYLNHSCLPNAQHSLAELTF-DGTGNRRNFLIVYACRTIEEGEEITIPYES 169
Query: 257 VNYDYSTRQKRLLYDYDFACDCDRCKVE 284
+ D RQ+ LL +Y F C C C+ E
Sbjct: 170 LYLDRVGRQQFLLQEYGFECACKLCEKE 197
>gi|224010509|ref|XP_002294212.1| predicted protein [Thalassiosira pseudonana CCMP1335]
gi|220970229|gb|EED88567.1| predicted protein [Thalassiosira pseudonana CCMP1335]
Length = 516
Score = 48.5 bits (114), Expect = 0.005, Method: Compositional matrix adjust.
Identities = 30/91 (32%), Positives = 41/91 (45%), Gaps = 7/91 (7%)
Query: 201 RAYGIYLKASFFNHDCLPNACR-FDYVDAAAENNTDI------IVRMIHDVPQGREICLS 253
R G+Y A+ NH C PNA R F + A +N + IV + +G EI S
Sbjct: 241 RLLGLYPLAAMINHSCCPNAVRVFGTLPAINDNMHRVQGREVMIVHATAKISKGDEITWS 300
Query: 254 YFPVNYDYSTRQKRLLYDYDFACDCDRCKVE 284
Y P ++ R+ L + F C C RC E
Sbjct: 301 YLPPCSPFNERRAVLSSKFGFTCQCVRCIKE 331
>gi|426194765|gb|EKV44696.1| hypothetical protein AGABI2DRAFT_120820 [Agaricus bisporus var.
bisporus H97]
Length = 674
Score = 48.5 bits (114), Expect = 0.005, Method: Compositional matrix adjust.
Identities = 67/304 (22%), Positives = 115/304 (37%), Gaps = 51/304 (16%)
Query: 14 EGRGRGLVSTQSLKAGQIVLRDSPILLYSALPFINSSSFCHNCFRKTMHSSSSICPSCSV 73
E RGRG+ S + + G+I+ + + +AL ++ C CF + CP+C +
Sbjct: 95 ETRGRGIWSREKRRRGEIIFTE--VQQSAALSTGLLATHCSACFVEASGVPLKRCPTCRI 152
Query: 74 A-FCSPKCST--------------AAASSSHSPYVCQALTRLLNLNSPDAANLPLDRQLQ 118
+C +C + A+S SP + + P A L R L
Sbjct: 153 IHYCDSECQSRDWTLHKRECPALQKWATSVPSPSPETSEDKKQGKVIPSDAIRALGRMLW 212
Query: 119 ARFLIAAYNLAVVNPSQFQILLAFQGTVTDNDTSA---AHYLSSLCP--------PPAST 167
+ L+ Q + + + ++ N+ S+ H + +L +S
Sbjct: 213 RK---QKKGLSGDWAKQVDTMQSHRARLSSNEKSSQLHTHMVHALVRYLGLESLDEFSSY 269
Query: 168 TIELTAALL---AKDRLNAFGLMEPYIEGQDGRRSVRAYGIYLKA--SFFNHDCLPNAC- 221
I+ A L+ ++ NAF + P ++ G+ + + FNH C PNA
Sbjct: 270 GIDSIAGLVDLVSRFTTNAFTVASP---------TLTPIGVSISPIIALFNHSCAPNAVP 320
Query: 222 ---RFDYVDAAAENNTDIIVRMIHDVPQGREICLSYFPVNYDYSTRQKRLLYDYDFACDC 278
R + A E +I + D+P E+ +SY RQK L Y F C C
Sbjct: 321 VFPRAPHNAKANEPMASVIT--LRDIPANEEVVISYIDTTLTKRERQKALKETYYFTCSC 378
Query: 279 DRCK 282
C+
Sbjct: 379 SLCE 382
>gi|374105643|gb|AEY94554.1| FAAL097Cp [Ashbya gossypii FDAG1]
Length = 368
Score = 48.5 bits (114), Expect = 0.005, Method: Compositional matrix adjust.
Identities = 55/224 (24%), Positives = 89/224 (39%), Gaps = 41/224 (18%)
Query: 77 SPKCSTAAASSSH-------------SPYVCQALTRLLNLNSPDAANLPLDRQLQARFLI 123
+PK S S H + +V + L RL +L + D QLQ+ L
Sbjct: 165 TPKLSAMKPSKRHLMIPAIDEEVYSCARFVTECLFRLKHLPNESTMYQAFD-QLQSNEL- 222
Query: 124 AAYNLAVVNPSQFQILLAFQGTVTDNDTSAAHYLSSLCPPPASTTIE--LTAALLAKDRL 181
++F +LL FQ V +L + P + L +L +
Sbjct: 223 -------QKITEFPVLLEFQLKVYK-------FLKVVLPSGLRDLLSSALFRKILGSEYG 268
Query: 182 NAFGLMEPYIEGQDGRRSVRAYGIYLKASFFNHDCLPNACRFDYVDAAAENNTDIIVRMI 241
N+FG+ E EG R + Y + +AS+FNH C PN A ++ +
Sbjct: 269 NSFGIWEQN-EGNYSRDYL-GYWVLPEASYFNHSCAPNL-------AKKRVGREMYFVLT 319
Query: 242 HDVPQGREICLSYFPV-NYDYSTRQKRLLYDYDFACDCDRCKVE 284
DV G ++C+ Y + + R+ L ++ F C C+RC +E
Sbjct: 320 SDVAAGEQLCIDYKGILDLPVVERRNILHSNWFFDCACERCALE 363
>gi|261190070|ref|XP_002621445.1| conserved hypothetical protein [Ajellomyces dermatitidis SLH14081]
gi|239591273|gb|EEQ73854.1| conserved hypothetical protein [Ajellomyces dermatitidis SLH14081]
Length = 445
Score = 48.5 bits (114), Expect = 0.005, Method: Compositional matrix adjust.
Identities = 29/88 (32%), Positives = 41/88 (46%), Gaps = 7/88 (7%)
Query: 201 RAYGIYLKASFFNHDCLPNA----CRFDYVDAAAENNTDIIVRMIHDVPQGREICLSYFP 256
RA G L +++ NH CLPNA + D +IV + +G EI + Y
Sbjct: 113 RALG--LDSAYLNHSCLPNAQHSLAELTF-DGTGNRRNFLIVYACRTIEEGEEITIPYES 169
Query: 257 VNYDYSTRQKRLLYDYDFACDCDRCKVE 284
+ D RQ+ LL +Y F C C C+ E
Sbjct: 170 LYLDRVGRQQFLLQEYGFECACKLCEKE 197
>gi|195029731|ref|XP_001987725.1| GH19808 [Drosophila grimshawi]
gi|193903725|gb|EDW02592.1| GH19808 [Drosophila grimshawi]
Length = 515
Score = 48.5 bits (114), Expect = 0.005, Method: Compositional matrix adjust.
Identities = 60/293 (20%), Positives = 112/293 (38%), Gaps = 57/293 (19%)
Query: 10 VAEIEGRGRGLVSTQSLKAGQIVLRDSPILLYSALPFINSSSFCHNCFRKTMHSSSSICP 69
+A+ E GR LV+T+++K +IVL+++P++ A C NC H S C
Sbjct: 45 IAKNEVLGRHLVATRNIKPYEIVLKEAPLMRGPAQISAPVCMGCLNCIEANDHIS---CE 101
Query: 70 SCSVAFCSPKCSTAAASSS--------------------HSPYVCQALTRLLNLNSPDAA 109
C C P+C + + H Y C + R L LN+ + A
Sbjct: 102 KCGWPLCGPECPSLTEHRAECQLTEERGQKVNVTEFNGPHPLYTCVSTVRCLLLNTTNPA 161
Query: 110 NLPLDRQLQARFLIAAYNLAVVNPSQFQILLAFQGTVTDNDTSAAHYLSSLCPPPASTTI 169
N QL+ + +Q++ LA G P T
Sbjct: 162 NAEKFNQLE------SLEQTRRGSNQWKADLASIGQFI---------------PKFFRTQ 200
Query: 170 ELTAALLAKDRLNAFGLMEPYIEGQDGRRSVRAY-GIYLKASFFNHDCLPNACRFDYVDA 228
+ + ++ + A G+++ I G + S + ++ ASF + C+PN +
Sbjct: 201 KFS----EEEIMRAVGILQ--INGHEVPISDPPHVAVFYTASFTENSCVPNLAK------ 248
Query: 229 AAENNTDIIVRMIHDVPQGREICLSYFPVNYDYSTRQKRLLYDYDFACDCDRC 281
+ + ++ ++ + + + Y + + RQ+ L+ F C C+RC
Sbjct: 249 SFNKHGHCVLWAPKEIKKNSHLSICYSDAVWGTADRQRHLMQTKLFKCACERC 301
>gi|194862038|ref|XP_001969906.1| GG23681 [Drosophila erecta]
gi|190661773|gb|EDV58965.1| GG23681 [Drosophila erecta]
Length = 507
Score = 48.5 bits (114), Expect = 0.005, Method: Compositional matrix adjust.
Identities = 63/280 (22%), Positives = 104/280 (37%), Gaps = 36/280 (12%)
Query: 17 GRGLVSTQSLKAGQIVLRDSPILLYSALPFINSSSFCHNCFRKTMHSSSSICPSCSVAFC 76
GR LV+ + +++ + P+ + P N C C+ + +C C C
Sbjct: 21 GRYLVAKGATLGHGLLIEELPL---AVGPKCNGPVVCLGCYETDPNPEEELCSECGWPLC 77
Query: 77 SPKCSTAAASSSHSPYVCQAL----TRLLNLNSPDAANLPLDRQLQARFLIAAYNLAVVN 132
+ +CS A ++H C+ L R L S + LD + R L+A N
Sbjct: 78 A-ECSEQA-DNAHFQLECRELRDARARFFPLPSGSSHCPQLDCIMPLRVLLAKE----AN 131
Query: 133 PSQFQILLA-----FQGTVTDNDT------SAAHYLSSLCPPPASTTIELTAALLAKDRL 181
P ++ +A + D D + A YL C + + EL ++ +
Sbjct: 132 PERWDSEVAPMEHHEEERKKDADVWHADRVNIAQYLRGPCKLASRFSEELIMQVVGVLEV 191
Query: 182 NAFGLMEPYIEGQDGRRSVRAYGIYLKASFFNHDCLPNACRFDYVDAAAENNTDIIVRMI 241
NAF P + ++ H+C+PN R Y I +R +
Sbjct: 192 NAFEARSP--------KGYPLRCLFPYTGILAHNCVPNTSRSIY----PSEGYKIRLRAM 239
Query: 242 HDVPQGREICLSYFPVNYDYSTRQKRLLYDYDFACDCDRC 281
D+ +G+ + SY + RQK L F C CDRC
Sbjct: 240 VDLEEGQPLHHSYTYTLDGTAQRQKHLKQGKFFTCQCDRC 279
>gi|171695572|ref|XP_001912710.1| hypothetical protein [Podospora anserina S mat+]
gi|170948028|emb|CAP60192.1| unnamed protein product [Podospora anserina S mat+]
Length = 444
Score = 48.5 bits (114), Expect = 0.005, Method: Compositional matrix adjust.
Identities = 29/78 (37%), Positives = 39/78 (50%), Gaps = 8/78 (10%)
Query: 204 GIYLKASFFNHDCLPNACRFDYVDAAAENNTDIIVRMIHDVPQGREICLSYFPVNYDYST 263
G+++ S FNHDC PN + +T + VR IH G E+ +SY ST
Sbjct: 202 GVFVDVSGFNHDCRPNV----HYRITNTTHTTVAVRDIHP---GDELTISYIYGIARRST 254
Query: 264 RQKRLLYDYDFACDCDRC 281
RQK L D+ F C C +C
Sbjct: 255 RQKE-LQDWGFECTCSQC 271
>gi|46139505|ref|XP_391443.1| hypothetical protein FG11267.1 [Gibberella zeae PH-1]
Length = 593
Score = 48.5 bits (114), Expect = 0.006, Method: Compositional matrix adjust.
Identities = 25/80 (31%), Positives = 42/80 (52%), Gaps = 5/80 (6%)
Query: 204 GIYLKASFFNHDCLPNACRFDYVDAAAENNTDIIVRMIHDVPQGREICLSYFPVNYDYST 263
G++L+AS NH C NA + DY + + V + D+ +G EI ++Y + + T
Sbjct: 104 GVFLQASRINHACDNNAQK-DYNEGIKRHT----VHALRDIEEGEEITITYLGILKNRRT 158
Query: 264 RQKRLLYDYDFACDCDRCKV 283
RQ+ L + F C C+ C +
Sbjct: 159 RQQALRTKFMFTCTCNLCSL 178
>gi|74196676|dbj|BAE34438.1| unnamed protein product [Mus musculus]
Length = 416
Score = 48.5 bits (114), Expect = 0.006, Method: Compositional matrix adjust.
Identities = 29/86 (33%), Positives = 42/86 (48%), Gaps = 8/86 (9%)
Query: 204 GIYLKASFFNHDCLPNACRFDYVDAAAENNTDIIVRMIHDVPQGREICLSYFPVNYDYST 263
G+++ S NH C+PNA + ENN + V + D+ G EIC+SY +
Sbjct: 305 GLFVLQSCCNHSCVPNA-----ETSFPENNFVLHVTALEDIKPGEEICISYLDCRQRERS 359
Query: 264 RQKR---LLYDYDFACDCDRCKVEAN 286
R R L +Y F C C +C EA+
Sbjct: 360 RHSRHKILRENYLFNCSCPKCLAEAD 385
Score = 38.1 bits (87), Expect = 7.9, Method: Compositional matrix adjust.
Identities = 23/102 (22%), Positives = 41/102 (40%), Gaps = 27/102 (26%)
Query: 16 RGRGLVSTQSLKAGQIVLRDSPIL----LYSALPFINSSSFCHNCFRKTMHSSSSI---- 67
+G+GL +TQ ++ G+ + + P++ L++AL + C K ++ +
Sbjct: 32 KGKGLFATQLIRKGETIFIERPLVAAQFLWNALYQYRACDHCLRALEKAEENAQRLTGKP 91
Query: 68 -------------------CPSCSVAFCSPKCSTAAASSSHS 90
CP C V +CS +C AAA H
Sbjct: 92 SQILPHPELCSVRKDLHQNCPHCQVMYCSAECRLAAAEQYHQ 133
>gi|327353090|gb|EGE81947.1| hypothetical protein BDDG_04890 [Ajellomyces dermatitidis ATCC
18188]
Length = 447
Score = 48.5 bits (114), Expect = 0.006, Method: Compositional matrix adjust.
Identities = 29/88 (32%), Positives = 41/88 (46%), Gaps = 7/88 (7%)
Query: 201 RAYGIYLKASFFNHDCLPNA----CRFDYVDAAAENNTDIIVRMIHDVPQGREICLSYFP 256
RA G L +++ NH CLPNA + D +IV + +G EI + Y
Sbjct: 113 RALG--LDSAYLNHSCLPNAQHSLAELTF-DGTGNRRNFLIVYACRTIEEGEEITIPYES 169
Query: 257 VNYDYSTRQKRLLYDYDFACDCDRCKVE 284
+ D RQ+ LL +Y F C C C+ E
Sbjct: 170 LYLDRVGRQQFLLQEYGFECACKLCEKE 197
>gi|70947750|ref|XP_743461.1| hypothetical protein [Plasmodium chabaudi chabaudi]
gi|56522968|emb|CAH78269.1| conserved hypothetical protein [Plasmodium chabaudi chabaudi]
Length = 353
Score = 48.5 bits (114), Expect = 0.006, Method: Compositional matrix adjust.
Identities = 41/171 (23%), Positives = 71/171 (41%), Gaps = 17/171 (9%)
Query: 204 GIYLK-ASFFNHDCLPNACRFDYVDAAAENNTDIIVRMIHDVPQGREICLSYFPVNYDYS 262
GIY K +FNH CL N N + +R + D+ G E+ +SY V +D +
Sbjct: 47 GIYKKPIPYFNHSCLSNC-------ITIFKNQRLYIRTLMDIYPGEELTISYLDVAFDRN 99
Query: 263 TRQKRLLYDYDFACDCDRCKVEANWSDNDNDDENNEEVMDEDQDEQMVASDDDAEVHGDT 322
R Y F C C CKV N + ++ NN+ + + ++ + + + + +
Sbjct: 100 ARLAICADQYFFTCTCKLCKV--NIASECHNIFNNDFICTQSENCKKFINYMEIVLMSEL 157
Query: 323 NFPHAYFFMRYMCDRDNCWGTLAPLPPSDATPSTVMECNVCGNLKNDAIGR 373
+Y + + T L S TV +C +C N ND++ +
Sbjct: 158 ERKISYL-------NKSHFKTFPILKKSTEKNETVWKCMLCKNETNDSVIK 201
>gi|354492175|ref|XP_003508226.1| PREDICTED: SET and MYND domain-containing protein 5-like
[Cricetulus griseus]
Length = 435
Score = 48.5 bits (114), Expect = 0.006, Method: Compositional matrix adjust.
Identities = 29/86 (33%), Positives = 42/86 (48%), Gaps = 8/86 (9%)
Query: 204 GIYLKASFFNHDCLPNACRFDYVDAAAENNTDIIVRMIHDVPQGREICLSYF---PVNYD 260
G+++ S NH C+PNA + ENN + V + D+ G EIC+SY
Sbjct: 324 GLFVLQSCCNHSCVPNA-----ETSFPENNFLLHVTALEDIKPGEEICISYLDCCQRERS 378
Query: 261 YSTRQKRLLYDYDFACDCDRCKVEAN 286
+R K L +Y F C C +C EA+
Sbjct: 379 RHSRHKILRENYLFICSCPKCLAEAD 404
Score = 37.7 bits (86), Expect = 9.1, Method: Compositional matrix adjust.
Identities = 25/107 (23%), Positives = 43/107 (40%), Gaps = 27/107 (25%)
Query: 10 VAEIEGRGRGLVSTQSLKAGQIVLRDSPIL----LYSALPFINSSSFCHNCFRKTMHSSS 65
VA + G+GL +TQ ++ G+ + + P++ L++AL + C K ++
Sbjct: 44 VASLGYIGKGLFATQLIRKGETIFIERPLVASQFLWNALYRYRACDHCLRALEKAEENAQ 103
Query: 66 SI-----------------------CPSCSVAFCSPKCSTAAASSSH 89
+ CP C V +CS +C AAA H
Sbjct: 104 RLTGKPGQVLPHPELCSVRKDLHQNCPHCQVMYCSAECRLAAAEQYH 150
>gi|290976641|ref|XP_002671048.1| SET domain-containing protein [Naegleria gruberi]
gi|284084613|gb|EFC38304.1| SET domain-containing protein [Naegleria gruberi]
Length = 914
Score = 48.5 bits (114), Expect = 0.006, Method: Compositional matrix adjust.
Identities = 58/286 (20%), Positives = 117/286 (40%), Gaps = 40/286 (13%)
Query: 10 VAEIEGRGRGLVSTQSLKAGQIVLRDSPILLYSALPFINSSSFCHNCFRKTMHSSSSICP 69
V E GRG+ ++ LK+GQ+VL + P L + +++ + C+NC K +
Sbjct: 281 VITTEQYGRGMYASSELKSGQVVLSEKPFLCVT----LDTENTCNNCLNK-LKKKIPCKN 335
Query: 70 SCSVA-FCSPKCSTAAASSSHSPYVCQALTRLLNLNSPDAANLPLDRQLQARFLIAAYNL 128
+C + +CS +C + A H + + + + N + +FL L
Sbjct: 336 NCGMEYYCSMECRSEAWDKYHRKLCGFDMRNINKMIVSGSTNSAKNHLYAIKFL--GMLL 393
Query: 129 AVVNPSQFQILLAFQGTVTDNDTSAAHYLSSLCPPPASTTIELTAALLAKDRLNAFGLME 188
+V P QG + + ++ L +S + + D + G +
Sbjct: 394 QIVYP---------QGELKQQEIRYVQFIEKLEENNSSFKCLIAWPNILNDLI---GKLS 441
Query: 189 PYIE-GQDGRRSVR-----AYGIYLKAS------FFNHDCLPNACRFDYVDAAAENNTDI 236
++ + G S + + G+ L ++ + NH C+PN V+ EN
Sbjct: 442 YHVSLDKIGNNSFKYGDAMSNGVALSSTLPKTIVYTNHSCVPN------VEYRYENGKCS 495
Query: 237 IVRMIHDVPQGREICLSYFPVNYDYSTRQKRLLYDYDFACDCDRCK 282
+V + + +G ++ +SY + R + +L Y F C+C++C+
Sbjct: 496 LVAL-RKIKKGEQLFISYVNPSLSAQKRAESIL-PYIFVCECEKCE 539
>gi|224006285|ref|XP_002292103.1| hypothetical protein THAPSDRAFT_23831 [Thalassiosira pseudonana
CCMP1335]
gi|220972622|gb|EED90954.1| hypothetical protein THAPSDRAFT_23831 [Thalassiosira pseudonana
CCMP1335]
Length = 575
Score = 48.5 bits (114), Expect = 0.006, Method: Compositional matrix adjust.
Identities = 28/78 (35%), Positives = 37/78 (47%), Gaps = 2/78 (2%)
Query: 204 GIYLKASFFNHDCLPNACRFDYVDAAAENNTDIIVRMIHDVPQGREICLSYFPVNYDYST 263
G+Y AS NH C PNA + + I M V G EI +SY V+
Sbjct: 278 GVYPAASMINHSCRPNAVPTFWFSTPSPPMLQIT--MCKSVRVGDEIAISYCDVSAPRYV 335
Query: 264 RQKRLLYDYDFACDCDRC 281
R++ L+ +Y FACDC C
Sbjct: 336 RREGLIKNYKFACDCSHC 353
>gi|119597550|gb|EAW77144.1| SET and MYND domain containing 3, isoform CRA_e [Homo sapiens]
Length = 170
Score = 48.5 bits (114), Expect = 0.006, Method: Compositional matrix adjust.
Identities = 27/93 (29%), Positives = 42/93 (45%), Gaps = 7/93 (7%)
Query: 204 GIYLKASFFNHDCLPNACRFDYVDAAAENNTDIIVRMIHDVPQGREICLSYFPVNYDYST 263
G+Y S NH C PN + N +++R + D+ G E+ + Y +
Sbjct: 7 GLYPSISLLNHSCDPNC-------SIVFNGPHLLLRAVRDIEVGEELTICYLDMLMTSEE 59
Query: 264 RQKRLLYDYDFACDCDRCKVEANWSDNDNDDEN 296
R+K+L Y F CDC RC+ + +D DE
Sbjct: 60 RRKQLRDQYCFECDCFRCQTQDKDADMLTGDEQ 92
>gi|326427056|gb|EGD72626.1| hypothetical protein PTSG_04361 [Salpingoeca sp. ATCC 50818]
Length = 464
Score = 48.5 bits (114), Expect = 0.006, Method: Compositional matrix adjust.
Identities = 61/267 (22%), Positives = 95/267 (35%), Gaps = 41/267 (15%)
Query: 33 LRDSPILLYSALPFIN------SSSFCHNCFRK--TMHSSSSICPSC-SVAFCSPKCSTA 83
LR ++ PF + + C CF + + C C ++ +CS C A
Sbjct: 41 LRKGTDVIKGKRPFAHVVEKAKMETVCQTCFVQCEQLRRPLQRCAGCKALRYCSAACQKA 100
Query: 84 AASSSHSPYVCQALTRLLNLNSPDAANLPLDRQLQARFLIAAYNLAVVNPSQFQIL-LAF 142
H P C AL R+ P+ FL N + +L L
Sbjct: 101 DWKD-HKPE-CAALKRIS----------PVVPATFVMFLARILRKMERNTGEMDVLQLHM 148
Query: 143 QGTVTDNDTSAA-----HYLSSLCPPPASTTIELTAALLAKDRLNAFGLMEPYIEGQDGR 197
G +D +L P ++ +L N+FG I G +G
Sbjct: 149 PGEPSDPQQQRGLFAILEHLRHFLPDAEKHLLKSAYPVLRITSANSFG-----ISGVEGN 203
Query: 198 RSVRAYGIYLKASFFNHDCLPNACRFDYVDAAAENNTDIIVRMIHDVPQGREICLSYFPV 257
G+Y S+ NH C PN C + A VR +HD+P +E+ ++Y
Sbjct: 204 NL--GVGLYDTVSYINHSCAPN-CSITFSGVYAR------VRSVHDLPPNQELTIAYIDP 254
Query: 258 NYDYSTRQKRLLYDYDFACDCDRCKVE 284
+ R+ L + F C+C RC+ E
Sbjct: 255 CDPRAKRRAHLKSQFMFDCECSRCERE 281
>gi|46122751|ref|XP_385929.1| hypothetical protein FG05753.1 [Gibberella zeae PH-1]
Length = 508
Score = 48.5 bits (114), Expect = 0.006, Method: Compositional matrix adjust.
Identities = 31/106 (29%), Positives = 51/106 (48%), Gaps = 9/106 (8%)
Query: 205 IYLKASFFNHDCLPNACRFDYVDAAAENNTDIIVRMIHDVPQGREICLSYFPVNYDYSTR 264
++L S NH C PNA + + I V + D+ QG+EI ++Y Y R
Sbjct: 91 VFLATSRINHSCKPNA-----QNRWNQGLGKITVHAVEDIEQGQEITITYLGNPEVYEER 145
Query: 265 QKRLLYDYDFACDCDRCKVEANWSDNDNDDENNEEV--MDEDQDEQ 308
QK+L + F C C C + ++ D DD+ +E+ + ED + +
Sbjct: 146 QKKLKNAFGFDCCCRLCSLSP--AERDLDDKLIKEIDHLQEDLENE 189
>gi|408393969|gb|EKJ73226.1| hypothetical protein FPSE_06650 [Fusarium pseudograminearum CS3096]
Length = 620
Score = 48.1 bits (113), Expect = 0.006, Method: Compositional matrix adjust.
Identities = 33/108 (30%), Positives = 51/108 (47%), Gaps = 11/108 (10%)
Query: 182 NAFGLMEPYIEGQDGRRSVRAYGIYLKASFFNHDCLPNACRFDYVDAAAENNTDIIVRMI 241
NAFGL + + +D S + G++++AS+ NH C+PNA + D I+ R
Sbjct: 489 NAFGLGQ---QTEDEDISNASTGLWVRASYINHSCIPNAKKDLIGDL-------ILFRAT 538
Query: 242 HDVPQGREICLSYFPVNYDYSTRQKRLLYDYDFACDCDRCKVEANWSD 289
+ G EI +Y + Y RQ ++F C C C V+ SD
Sbjct: 539 RRIASGEEITHAY-DESTSYEARQAAFCRTWNFECRCPLCLVQMKESD 585
>gi|367053077|ref|XP_003656917.1| hypothetical protein THITE_114734 [Thielavia terrestris NRRL 8126]
gi|347004182|gb|AEO70581.1| hypothetical protein THITE_114734 [Thielavia terrestris NRRL 8126]
Length = 699
Score = 48.1 bits (113), Expect = 0.006, Method: Compositional matrix adjust.
Identities = 25/83 (30%), Positives = 37/83 (44%), Gaps = 7/83 (8%)
Query: 202 AYGIYLKASFFNHDCLPNACRFDYVDAAAENNTDIIVRMIHDVPQGREICLSYFPVNYDY 261
A G+++ AS NH C+PN R D +I R D+ +G E+ Y PV
Sbjct: 453 AKGLWVHASRMNHSCVPNTMRSFVGDM-------LISRATRDIKEGEELFQQYVPVKALP 505
Query: 262 STRQKRLLYDYDFACDCDRCKVE 284
R ++ + F C+C C E
Sbjct: 506 DVRNEQFKKGWGFECECALCASE 528
>gi|330791636|ref|XP_003283898.1| hypothetical protein DICPUDRAFT_86015 [Dictyostelium purpureum]
gi|325086169|gb|EGC39563.1| hypothetical protein DICPUDRAFT_86015 [Dictyostelium purpureum]
Length = 555
Score = 48.1 bits (113), Expect = 0.006, Method: Compositional matrix adjust.
Identities = 58/303 (19%), Positives = 117/303 (38%), Gaps = 53/303 (17%)
Query: 10 VAEIEGRGRGLVSTQSLKAGQIVLRDSPILLYSALP---FINSSSFCHNC--FRKTMHSS 64
+ E +GR + +T LK G + ++ ++++P F+ +C NC ++ +
Sbjct: 9 LIEESDKGRYVEATDILKPGDSLFSENS---FASIPLENFLYKKDYCFNCCKYQISTEQQ 65
Query: 65 SSICPS------------CSVAFCSPKCSTAAASSSHSPYVCQALTRLLNLNSPDAANLP 112
+S P C++ FCS +C+ ++ + + L
Sbjct: 66 TSEQPKSTPGFPFQCKYGCNMWFCSNECADGMVHQLECSFISKFV------------ELS 113
Query: 113 LDRQLQARFLIAAYNLAVVNPSQFQILLAFQGTVTDNDTSAAHYLSSLCPPPASTTIELT 172
L A + A + + N ++ G +T+ + S + ++
Sbjct: 114 LKHDCDASISLLALRVVMKNKVASEVFEQTVGKLTNQKSIFIQNNKSFLDKYSKLFDQVY 173
Query: 173 AALLAKD-RLNAFGLMEPYIE-------------GQDGRRSVRAYGIYLKASFFNHDCLP 218
+ +++ L++ E Y+E + R+ + G + KA+ NH C P
Sbjct: 174 LIVKSENSELSSIFNKEEYLEVVCSIFVNSFAGLSNNFNRTPISNGYFYKAALLNHSCEP 233
Query: 219 NACRFDYVDAAAENNTDIIVRMIHDVPQGREICLSYFPVNYDYSTRQKRLLYDYDFACDC 278
N + N+ + +R++ D+ +G +I SY + RQK LL +F CDC
Sbjct: 234 NT-------FFSIQNSTLEMRVVKDINKGEQIFDSYVDLLLPTIERQKILLKSKNFVCDC 286
Query: 279 DRC 281
RC
Sbjct: 287 SRC 289
>gi|428171215|gb|EKX40134.1| hypothetical protein GUITHDRAFT_113871 [Guillardia theta CCMP2712]
Length = 149
Score = 48.1 bits (113), Expect = 0.006, Method: Compositional matrix adjust.
Identities = 37/145 (25%), Positives = 53/145 (36%), Gaps = 49/145 (33%)
Query: 247 GREICLSYFPVNYDYSTRQKRLLYDYD---------------FACDCDRCKVEANWSDND 291
G EI +SY + D RQKRL +Y F C+C RCK+E ++
Sbjct: 7 GTEISISYTSIAADLVHRQKRLRSEYGVAIITSPRNVDFQSGFVCNCQRCKIEKEIGVHE 66
Query: 292 NDDENNEEVMDEDQDEQMVASDDDAEVHGDTNFPHAYFFMRYMCDRDNCWGTLAPL---- 347
+E +E E+ D ++ + +Y C C GTLAP+
Sbjct: 67 GRNEGSE---SEEADALLL------------------YLAKYSCTDAECGGTLAPIIQLQ 105
Query: 348 ---------PPSDATPSTVMECNVC 363
+ VMECN C
Sbjct: 106 NEEQGNKRSSKNRKQDEVVMECNRC 130
>gi|429852375|gb|ELA27514.1| hypothetical protein CGGC5_11625 [Colletotrichum gloeosporioides
Nara gc5]
Length = 418
Score = 48.1 bits (113), Expect = 0.006, Method: Compositional matrix adjust.
Identities = 29/81 (35%), Positives = 41/81 (50%), Gaps = 8/81 (9%)
Query: 199 SVRAYGIYLKASFFNHDCLPNAC-RFDYVDAAAENNTDIIVRMIHDVPQGREICLSYFPV 257
+V YG++ + NHDC PNA R+ A E V D+ G E+ +SY P+
Sbjct: 229 NVSHYGLFTNIARINHDCRPNAVTRYSPRTLALE------VVAYRDIQPGEELSISYSPL 282
Query: 258 NYDYSTRQKRLLYDYDFACDC 278
N Y+ RQ R L ++ F C C
Sbjct: 283 NMLYADRQ-RTLQEWGFNCTC 302
>gi|401417428|ref|XP_003873207.1| conserved hypothetical protein [Leishmania mexicana
MHOM/GT/2001/U1103]
gi|322489435|emb|CBZ24694.1| conserved hypothetical protein [Leishmania mexicana
MHOM/GT/2001/U1103]
Length = 622
Score = 48.1 bits (113), Expect = 0.006, Method: Compositional matrix adjust.
Identities = 29/75 (38%), Positives = 38/75 (50%), Gaps = 10/75 (13%)
Query: 209 ASFFNHDCLPNACRFDYVDAAAENNTDIIVRMIHDVPQGREICLSYFP-VNYDYSTRQKR 267
ASFFNH C PN C D V A + R IH G E+ ++Y P + + R++R
Sbjct: 402 ASFFNHSCAPN-CAIDTVRHAI-----VTTRTIH---VGEELSIAYIPQLYWPTRLRRER 452
Query: 268 LLYDYDFACDCDRCK 282
L Y F C C RC+
Sbjct: 453 LSEGYYFVCRCQRCQ 467
>gi|400593140|gb|EJP61138.1| SET domain-containing protein 5 [Beauveria bassiana ARSEF 2860]
Length = 574
Score = 48.1 bits (113), Expect = 0.006, Method: Compositional matrix adjust.
Identities = 24/78 (30%), Positives = 42/78 (53%), Gaps = 5/78 (6%)
Query: 204 GIYLKASFFNHDCLPNACRFDYVDAAAENNTDIIVRMIHDVPQGREICLSYFPVNYDYST 263
G++L+AS NH C NA + EN + + + D+ +G+EI +SY +Y+
Sbjct: 300 GLFLEASRINHSCRHNA-----QNTWNENIGRLTIHALCDIEEGQEITISYLSRRMEYTE 354
Query: 264 RQKRLLYDYDFACDCDRC 281
R++ L + F C+C+ C
Sbjct: 355 RRRLLQAKFCFDCECELC 372
>gi|148681223|gb|EDL13170.1| SET and MYND domain containing 3, isoform CRA_a [Mus musculus]
Length = 239
Score = 48.1 bits (113), Expect = 0.006, Method: Compositional matrix adjust.
Identities = 27/95 (28%), Positives = 43/95 (45%), Gaps = 7/95 (7%)
Query: 202 AYGIYLKASFFNHDCLPNACRFDYVDAAAENNTDIIVRMIHDVPQGREICLSYFPVNYDY 261
G+Y S NH C PN C + N +++R + ++ G E+ + Y +
Sbjct: 5 GVGLYPSMSLLNHSCDPN-CSIVF------NGPHLLLRAVREIEAGEELTICYLDMLMTS 57
Query: 262 STRQKRLLYDYDFACDCDRCKVEANWSDNDNDDEN 296
R+K+L Y F CDC RC+ + +D DE
Sbjct: 58 EERRKQLRDQYCFECDCIRCQTQDKDADMLTGDEQ 92
>gi|356519848|ref|XP_003528581.1| PREDICTED: uncharacterized protein LOC100812233 [Glycine max]
Length = 527
Score = 48.1 bits (113), Expect = 0.006, Method: Compositional matrix adjust.
Identities = 26/84 (30%), Positives = 38/84 (45%), Gaps = 8/84 (9%)
Query: 202 AYGIYLKASFFNHDCLPNACRFDYVDAAAENNTDIIVRMIHDVPQGREICLSYFPVNYDY 261
G++L SF NH C+PNA R D ++V D+ G E+ +YF
Sbjct: 314 GVGLWLLPSFVNHSCVPNARRLHVGDY-------LVVHASKDLKAGEEVTFAYFDPLCGL 366
Query: 262 STRQKRLLYDYDFACDCDRCKVEA 285
S R K + ++ C C RC+ E
Sbjct: 367 SKR-KEMSVNWGIHCKCKRCRFEG 389
>gi|26336238|dbj|BAC31804.1| unnamed protein product [Mus musculus]
gi|26338432|dbj|BAC32887.1| unnamed protein product [Mus musculus]
Length = 239
Score = 48.1 bits (113), Expect = 0.006, Method: Compositional matrix adjust.
Identities = 27/95 (28%), Positives = 43/95 (45%), Gaps = 7/95 (7%)
Query: 202 AYGIYLKASFFNHDCLPNACRFDYVDAAAENNTDIIVRMIHDVPQGREICLSYFPVNYDY 261
G+Y S NH C PN C + N +++R + ++ G E+ + Y +
Sbjct: 5 GVGLYPSMSLLNHSCDPN-CSIVF------NGPHLLLRAVREIEAGEELTICYLDMLMTS 57
Query: 262 STRQKRLLYDYDFACDCDRCKVEANWSDNDNDDEN 296
R+K+L Y F CDC RC+ + +D DE
Sbjct: 58 EERRKQLRDQYCFECDCIRCQTQDKDADMLTGDEQ 92
>gi|395841248|ref|XP_003793458.1| PREDICTED: SET and MYND domain-containing protein 5 isoform 1
[Otolemur garnettii]
Length = 416
Score = 48.1 bits (113), Expect = 0.006, Method: Compositional matrix adjust.
Identities = 29/86 (33%), Positives = 42/86 (48%), Gaps = 8/86 (9%)
Query: 204 GIYLKASFFNHDCLPNACRFDYVDAAAENNTDIIVRMIHDVPQGREICLSYFPV---NYD 260
G+++ S NH C+PNA + ENN + V + D+ G EIC+SY
Sbjct: 306 GLFVLQSCCNHSCVPNA-----ETSFPENNFFLHVTALEDIKPGEEICISYLDCCQRERS 360
Query: 261 YSTRQKRLLYDYDFACDCDRCKVEAN 286
+R K L +Y F C C +C EA+
Sbjct: 361 RHSRHKILRENYLFICSCPKCLAEAD 386
>gi|451845507|gb|EMD58819.1| hypothetical protein COCSADRAFT_348193 [Cochliobolus sativus
ND90Pr]
Length = 338
Score = 48.1 bits (113), Expect = 0.007, Method: Compositional matrix adjust.
Identities = 29/86 (33%), Positives = 47/86 (54%), Gaps = 6/86 (6%)
Query: 201 RAYGIYLKASFFNHDCLPNACRFDYVDAAAENNTDIIVRMIHDVPQGREICLSYFPVNYD 260
+ ++ A+ FNH C PNAC F + +A+ T + +++DV G+EI +SY + +D
Sbjct: 133 KGAAVFPHAARFNHSCNPNAC-FSW-NASITKET---IHVMNDVAAGKEITISYCDMIHD 187
Query: 261 YSTRQKRLLYDYDFACDCDRCKVEAN 286
+R L + Y F CDC C + N
Sbjct: 188 KPSRAYELKH-YGFVCDCPACAGDEN 212
>gi|440635523|gb|ELR05442.1| hypothetical protein GMDG_01737 [Geomyces destructans 20631-21]
Length = 265
Score = 48.1 bits (113), Expect = 0.007, Method: Compositional matrix adjust.
Identities = 29/84 (34%), Positives = 41/84 (48%), Gaps = 12/84 (14%)
Query: 198 RSVRAYGIYLKASFFNHDCLPNACRFDYVDAAAENNTDIIVRMIHDVPQGREICLSYFPV 257
R+ + GI+L AS FNH C P R+ Y + +N ++I DV +G E+ + Y P
Sbjct: 170 RAGSSSGIFLVASRFNHSCNP-VIRYSYDN----SNRELIFATKRDVKEGEELTIMYTP- 223
Query: 258 NYDYSTRQKRLLYDYDFACDCDRC 281
+ L DY F CDC C
Sbjct: 224 ------NSQHLSMDYGFLCDCGVC 241
>gi|169604975|ref|XP_001795908.1| hypothetical protein SNOG_05503 [Phaeosphaeria nodorum SN15]
gi|111065447|gb|EAT86567.1| hypothetical protein SNOG_05503 [Phaeosphaeria nodorum SN15]
Length = 367
Score = 48.1 bits (113), Expect = 0.007, Method: Compositional matrix adjust.
Identities = 24/80 (30%), Positives = 41/80 (51%), Gaps = 5/80 (6%)
Query: 204 GIYLKASFFNHDCLPNACRFDYVDAAAENNTDIIVRMIHDVPQGREICLSYFPVNYDYST 263
G++L+A NH C NA F +N + + + D+ QG EI +SY + +
Sbjct: 135 GLFLQACRINHACDNNAQNF-----WNDNIDKLTIHAVRDIGQGEEITISYLSSSQNPQA 189
Query: 264 RQKRLLYDYDFACDCDRCKV 283
RQ+ L +++ F+C C C +
Sbjct: 190 RQQELQHNFKFSCACRLCSL 209
>gi|347831206|emb|CCD46903.1| similar to MYND domain protein [Botryotinia fuckeliana]
Length = 545
Score = 48.1 bits (113), Expect = 0.007, Method: Compositional matrix adjust.
Identities = 64/285 (22%), Positives = 109/285 (38%), Gaps = 36/285 (12%)
Query: 19 GLVSTQSLKAGQIVLRDSPILLYSALPFINSSSFCHNCFRKTMHSSSSICPSCSVAFCSP 78
G+V+T+ + A + VL D P L ++S+ C C + + C C +CS
Sbjct: 272 GVVATRDILAQETVLVD-PTLA----AVVDSADRCPACCGPFLDKIENSC--CKTLYCSS 324
Query: 79 KCSTAAASSSHSPYVCQALTRLLNLNSPDAANL---PLDRQLQARFLIAAYNLAVVNPSQ 135
CS A S H+ + L LL S +N + +L R L + V +P +
Sbjct: 325 SCSQNALDSYHTIVCGKDLDFLLGTESESLSNSRESSMGSKLFLRVLALSLKEDVASPLK 384
Query: 136 FQILLAFQGTVTDNDTSAA------HYLSSLCPPPASTTIELTAALLAKDRLNAFGLMEP 189
++ N+ H ++ P EL + A + + L
Sbjct: 385 TSLISRLTPAYNPNNPQLVVLNFKDHIIT-----PIRILRELGIDVFANSAYDTWVLHTI 439
Query: 190 YIEGQDGRRS-----VRAYGIYLKASFFNHDCLPNA-CRFDYVDAAAENNTDIIVRMIHD 243
Y Q+ + + G+ S FNH C PN R D + N+ + + D
Sbjct: 440 YCRLQNNKHGQTFDDICGTGVNPLYSMFNHSCDPNIDWRHD------DENSTVTMFAERD 493
Query: 244 VPQGREICLSYFPVNYDYSTRQKRLLYDYDFACDCDRC---KVEA 285
+ G E+ +SY R+++L+ + C C +C K+EA
Sbjct: 494 IKNGEEMFISYIGKGKGLEERRRKLMPWFGMDCACHKCDEEKLEA 538
>gi|294935956|ref|XP_002781574.1| hypothetical protein, conserved [Perkinsus marinus ATCC 50983]
gi|239892411|gb|EER13369.1| hypothetical protein, conserved [Perkinsus marinus ATCC 50983]
Length = 181
Score = 48.1 bits (113), Expect = 0.007, Method: Compositional matrix adjust.
Identities = 28/87 (32%), Positives = 40/87 (45%), Gaps = 6/87 (6%)
Query: 199 SVRAYGIYLKASFFNHDCLPNACRFDYVDAAAENNTDIIVRMIHDVPQGREICLSYFPVN 258
V G+Y + NH C PNA +A+ + + V+ + V G E+ SY N
Sbjct: 89 EVLGSGLYRGVAVTNHSCSPNA------EASFGGSRCLRVKSLRPVHAGEEVFQSYIDEN 142
Query: 259 YDYSTRQKRLLYDYDFACDCDRCKVEA 285
RQ +L Y FAC C +C+ EA
Sbjct: 143 LPLVERQSKLRQAYGFACRCGKCRTEA 169
>gi|383858836|ref|XP_003704905.1| PREDICTED: SET and MYND domain-containing protein DDB_G0273589-like
[Megachile rotundata]
Length = 523
Score = 48.1 bits (113), Expect = 0.007, Method: Compositional matrix adjust.
Identities = 59/291 (20%), Positives = 106/291 (36%), Gaps = 52/291 (17%)
Query: 10 VAEIEGRGRGLVSTQSLKAGQIVLRDSPILLYSALPFINSSSFCHNCFRKTMHSSSSICP 69
+AE GR ++ +++KAG+IVL+D L+ A P NS C C+ +++ C
Sbjct: 48 LAESSSLGRYYIAARNIKAGEIVLKDDRPLV--AGPMHNSVPVCLGCYMVLHENTAVPCT 105
Query: 70 SCSVAFC-----------------SPKCSTAAASSSHSPYVCQALTRLLNLNSPDAANLP 112
C C S K S H Y C + R L+L +
Sbjct: 106 KCGWPLCQNCKEHGTECDFTSSRRSDKVSITKFGYPHPSYQCINVIRALSLKDNN----- 160
Query: 113 LDRQLQARFLIAAYNLAVVNPSQFQILLAFQGTVTDNDTSAAHYLSSLCPPPASTTIELT 172
I +YN + S + + + + ++ +++ +
Sbjct: 161 ----------IESYNKLLTLESHSDKIKNKETFIFEESSNLVNFIKRFFK---------S 201
Query: 173 AALLAKDRLNAFGLMEPYIEGQDGRRSVRAY-GIYLKASFFNHDCLPNACRFDYVDAAAE 231
+ +D G+++ + G + + Y +Y AS H+C N C + D
Sbjct: 202 EDIPEEDIRKIIGVLQ--VNGHEVPLTDPPYVAVYELASLLEHNCKAN-CSKSFTDTGG- 257
Query: 232 NNTDIIVRMIHDVPQGREICLSYFPVNYDYSTRQKRLLYDYDFACDCDRCK 282
+I+ + +G I + Y + R+ LL F C CDRC+
Sbjct: 258 ----LIIHAATPITKGDHISICYTDPLWGTMNRRHHLLETKYFECTCDRCR 304
>gi|395841250|ref|XP_003793459.1| PREDICTED: SET and MYND domain-containing protein 5 isoform 2
[Otolemur garnettii]
Length = 300
Score = 48.1 bits (113), Expect = 0.007, Method: Compositional matrix adjust.
Identities = 29/86 (33%), Positives = 42/86 (48%), Gaps = 8/86 (9%)
Query: 204 GIYLKASFFNHDCLPNACRFDYVDAAAENNTDIIVRMIHDVPQGREICLSYFPV---NYD 260
G+++ S NH C+PNA + ENN + V + D+ G EIC+SY
Sbjct: 190 GLFVLQSCCNHSCVPNA-----ETSFPENNFFLHVTALEDIKPGEEICISYLDCCQRERS 244
Query: 261 YSTRQKRLLYDYDFACDCDRCKVEAN 286
+R K L +Y F C C +C EA+
Sbjct: 245 RHSRHKILRENYLFICSCPKCLAEAD 270
>gi|389631537|ref|XP_003713421.1| hypothetical protein MGG_15522 [Magnaporthe oryzae 70-15]
gi|351645754|gb|EHA53614.1| hypothetical protein MGG_15522 [Magnaporthe oryzae 70-15]
gi|440463734|gb|ELQ33288.1| hypothetical protein OOU_Y34scaffold00979g72 [Magnaporthe oryzae
Y34]
gi|440483676|gb|ELQ64025.1| hypothetical protein OOW_P131scaffold00888g14 [Magnaporthe oryzae
P131]
Length = 542
Score = 48.1 bits (113), Expect = 0.007, Method: Compositional matrix adjust.
Identities = 63/278 (22%), Positives = 104/278 (37%), Gaps = 32/278 (11%)
Query: 17 GRGLVSTQSLKAGQIVLR-DSPILLYSALP-FINSSSFCHNCFRKTMHSSSSICPSCSVA 74
GR L +T+ + G ++ D+P+ +A P +S+ CH+C + + +V
Sbjct: 28 GRSLRATRDFQPGDLIAEFDNPL---AAFPDAARASTTCHHCLDQNAKVFGCMGCEKAVK 84
Query: 75 FCSPKCSTAAASSSHSPYVCQALTRLLNLNSPDAANLPLDRQLQARFLIAAYNLAVV--- 131
+CS +C A HS C+ ++ D P+ +Q A V
Sbjct: 85 YCSSECQRANWKLVHSKE-CKVFRKVQTAVGKDWLPTPVRTLVQLLVRWAEVQQLVSQLE 143
Query: 132 -NPSQFQILLAFQGTVTDNDTSAAHYLSSLCPPPASTTIELTAALLAKDRLNAFGLMEPY 190
N +F+ + + HY + L+ +L K + N+F +
Sbjct: 144 GNEDRFKERKQLWEDMKLQAYAGIHYAGR---KEDDANLFLSLDVLCKIQTNSFDRFDA- 199
Query: 191 IEGQDGRRSVRAYGIYLKA--SFFNHDCLPNACRFDYVDAAAENNTDIIVRMIHDVPQGR 248
A G +L + NH C+PNA + A +R + QG
Sbjct: 200 --------DTGASGTFLDPVLAMANHSCVPNAVVLFWRRKA-------YLRAEMPIKQGS 244
Query: 249 EICLSYFPVNYDYSTRQKRLLYDYDFACDCDRCKVEAN 286
EI +SY RQ+ L + Y F C C RCK + N
Sbjct: 245 EISISYIDYTKPVRFRQEDL-WLYHFTCKCPRCKDDLN 281
>gi|45184727|ref|NP_982445.1| AAL097Cp [Ashbya gossypii ATCC 10895]
gi|44980073|gb|AAS50269.1| AAL097Cp [Ashbya gossypii ATCC 10895]
Length = 368
Score = 48.1 bits (113), Expect = 0.007, Method: Compositional matrix adjust.
Identities = 55/224 (24%), Positives = 89/224 (39%), Gaps = 41/224 (18%)
Query: 77 SPKCSTAAASSSH-------------SPYVCQALTRLLNLNSPDAANLPLDRQLQARFLI 123
+PK S S H + +V + L RL +L + D QLQ+ L
Sbjct: 165 TPKLSAMKPSKRHLMIPAIDEEVYSCARFVTECLFRLKHLPNESTMYQAFD-QLQSNEL- 222
Query: 124 AAYNLAVVNPSQFQILLAFQGTVTDNDTSAAHYLSSLCPPPASTTIE--LTAALLAKDRL 181
++F +LL FQ V +L + P + L +L +
Sbjct: 223 -------QKITEFPVLLEFQLKVYK-------FLKVVLPIGLRDLLSSALFRKILGSEYG 268
Query: 182 NAFGLMEPYIEGQDGRRSVRAYGIYLKASFFNHDCLPNACRFDYVDAAAENNTDIIVRMI 241
N+FG+ E EG R + Y + +AS+FNH C PN A ++ +
Sbjct: 269 NSFGIWEQN-EGNYSRDYL-GYWVLPEASYFNHSCAPNL-------AKKRVGREMYFVLT 319
Query: 242 HDVPQGREICLSYFPV-NYDYSTRQKRLLYDYDFACDCDRCKVE 284
DV G ++C+ Y + + R+ L ++ F C C+RC +E
Sbjct: 320 SDVAAGEQLCIDYKGILDLPVVERRNILHSNWFFDCACERCALE 363
>gi|116200772|ref|XP_001226198.1| hypothetical protein CHGG_10931 [Chaetomium globosum CBS 148.51]
gi|88175645|gb|EAQ83113.1| hypothetical protein CHGG_10931 [Chaetomium globosum CBS 148.51]
Length = 381
Score = 48.1 bits (113), Expect = 0.007, Method: Compositional matrix adjust.
Identities = 29/87 (33%), Positives = 46/87 (52%), Gaps = 7/87 (8%)
Query: 205 IYLKASFFNHDCLPNACRFDYVDAAAENNTD-IIVRMIHDVPQGREICLSYFPVNYDYST 263
+YL AS NH C+PNA D A++++T +++R D+ G EI SY +T
Sbjct: 155 LYLNASRLNHSCIPNA------DPASKDDTSYMVMRANRDINAGEEITSSYILRVVPRAT 208
Query: 264 RQKRLLYDYDFACDCDRCKVEANWSDN 290
R ++L + F C C C + +SD+
Sbjct: 209 RLQQLSKGWGFTCQCPACDPKNPFSDS 235
>gi|195121780|ref|XP_002005397.1| GI19093 [Drosophila mojavensis]
gi|193910465|gb|EDW09332.1| GI19093 [Drosophila mojavensis]
Length = 515
Score = 48.1 bits (113), Expect = 0.007, Method: Compositional matrix adjust.
Identities = 58/273 (21%), Positives = 108/273 (39%), Gaps = 17/273 (6%)
Query: 10 VAEIEGRGRGLVSTQSLKAGQIVLRDSPILLYSALPFINSSSFCHNCFRKTMHSSSSICP 69
VA E GR LV+T+++K +IVL+++P++ P S C C + C
Sbjct: 45 VAHNELLGRHLVATRNIKPYEIVLKEAPLM---RGPAQISVPVCMGCLNAIEPNDHITCD 101
Query: 70 SCSVAFCSPKCSTAAASSSHSPYVCQALTRLLNLNSPDAANLPLDRQLQARFLIAAYNLA 129
C C P+C + + +A + +N+ + + PL + +
Sbjct: 102 KCGWPLCGPECQALDEHKAECQ-LTEARGQKVNVQEFNGPH-PLYTCVSTVRCLLLSETN 159
Query: 130 VVNPSQFQILLAFQGTVTDNDTSAAHYLSSLCPPPASTTIELTAALLAKDRLNAFGLMEP 189
N +FQ L + + T ++ A S P T ++ + A G ++
Sbjct: 160 PANAEKFQQLESLEQTRRGSNQWKADLASIGQFIPKFFR---TQKFSEEEIMRAVGALQ- 215
Query: 190 YIEGQDGRRSVRAY-GIYLKASFFNHDCLPNACRFDYVDAAAENNTDIIVRMIHDVPQGR 248
I G + S + ++ ASF + C+PN + + N I+ + +G
Sbjct: 216 -INGHEVPTSDPPHVAVFYTASFTENSCVPNLAK------SFTKNGHCILWAPKAIKKGA 268
Query: 249 EICLSYFPVNYDYSTRQKRLLYDYDFACDCDRC 281
+ + Y + + RQ+ L+ F C C+RC
Sbjct: 269 NLSICYSDAVWGTADRQRHLMQTKLFKCTCERC 301
>gi|365760270|gb|EHN02002.1| Set5p [Saccharomyces cerevisiae x Saccharomyces kudriavzevii VIN7]
Length = 524
Score = 48.1 bits (113), Expect = 0.008, Method: Compositional matrix adjust.
Identities = 26/80 (32%), Positives = 44/80 (55%), Gaps = 5/80 (6%)
Query: 205 IYLKASFFNHDCLPNACRFDYVDAAAENNTDIIVRMIHDVPQGREICLSYFPVNYDYSTR 264
+YL SF NHDC PNA Y++ E+ ++ + + +G +IC++Y + R
Sbjct: 356 MYLWISFINHDCEPNA----YIEQTEEHE-ELKLHARKPIKKGEQICITYVNPLHGVRLR 410
Query: 265 QKRLLYDYDFACDCDRCKVE 284
++ L ++ F C CDRC+ E
Sbjct: 411 RRELRVNWGFLCQCDRCQNE 430
Score = 41.6 bits (96), Expect = 0.61, Method: Compositional matrix adjust.
Identities = 25/81 (30%), Positives = 37/81 (45%), Gaps = 11/81 (13%)
Query: 16 RGRGLVSTQSLKAGQIVLRDSPILLY----SALPFINSSSFCHNCFRK----TMHSSSSI 67
RGRGL + + GQI+L+++ ++Y L FI++ C C + T H
Sbjct: 121 RGRGLFAEKDFAKGQIILKENKPIVYVPPLDKLFFISNGKACARCGKALYDLTQHKIMVH 180
Query: 68 ---CPSCSVAFCSPKCSTAAA 85
C C +CS KC A A
Sbjct: 181 YLDCEVCKAIWCSEKCKKAHA 201
>gi|90077650|dbj|BAE88505.1| unnamed protein product [Macaca fascicularis]
Length = 418
Score = 48.1 bits (113), Expect = 0.008, Method: Compositional matrix adjust.
Identities = 54/191 (28%), Positives = 75/191 (39%), Gaps = 32/191 (16%)
Query: 109 ANLPLDRQLQARFLIAAYNLAV---VNPSQFQILLAF-----QGTVTDNDTSAAHYLSSL 160
L L R+L F A Y AV P F+ L A QG T + + H +L
Sbjct: 215 GQLELLRRL---FTEALYEEAVSQWFTPDGFRSLFALVGTNGQGIGTSSLSQWVHACDAL 271
Query: 161 -CPPPASTTIELTAALLAKDRLNAFGLMEPYIEGQDGR-RSVRAYGIYLKASFFNHDCLP 218
P +++ L KD IE G + G ++ S NH C+P
Sbjct: 272 ELKPQDREQLDIFIDQLYKD-----------IEAATGEFLNCEGSGFFVLQSCCNHSCVP 320
Query: 219 NACRFDYVDAAAENNTDIIVRMIHDVPQGREICLSYFPV---NYDYSTRQKRLLYDYDFA 275
NA + ENN + V + D+ G EIC+SY +R K L +Y F
Sbjct: 321 NA-----ETSFPENNFLLHVTALEDIKPGEEICISYLDCCQRERSRHSRHKILRENYLFV 375
Query: 276 CDCDRCKVEAN 286
C C +C EA+
Sbjct: 376 CSCPKCLAEAD 386
>gi|16741406|gb|AAH16525.1| SET and MYND domain containing 5 [Mus musculus]
Length = 300
Score = 48.1 bits (113), Expect = 0.008, Method: Compositional matrix adjust.
Identities = 29/86 (33%), Positives = 42/86 (48%), Gaps = 8/86 (9%)
Query: 204 GIYLKASFFNHDCLPNACRFDYVDAAAENNTDIIVRMIHDVPQGREICLSYFPV---NYD 260
G+++ S NH C+PNA + ENN + V + D+ G EIC+SY
Sbjct: 189 GLFVLQSCCNHSCVPNA-----ETSFPENNFVLHVTALEDIKPGEEICISYLDCCQRERS 243
Query: 261 YSTRQKRLLYDYDFACDCDRCKVEAN 286
+R K L +Y F C C +C EA+
Sbjct: 244 RHSRHKILRENYLFNCSCPKCLAEAD 269
>gi|357474929|ref|XP_003607750.1| SET domain protein [Medicago truncatula]
gi|355508805|gb|AES89947.1| SET domain protein [Medicago truncatula]
Length = 540
Score = 48.1 bits (113), Expect = 0.008, Method: Compositional matrix adjust.
Identities = 29/90 (32%), Positives = 39/90 (43%), Gaps = 16/90 (17%)
Query: 202 AYGIYLKASFFNHDCLPNACRFDYVDAAAENNTDIIVRMIHDVPQGREICLSYF----PV 257
G++L SF NH C PNA R D +IV D+ G EI +Y P+
Sbjct: 321 GVGLWLLPSFVNHSCCPNARRLHVGDY-------LIVHASRDLKAGEEITFAYLDPLSPL 373
Query: 258 NYDYSTRQKRLLYDYDFACDCDRCKVEANW 287
N ++K L + C C RCK E +
Sbjct: 374 N-----KRKELSVTWGILCKCKRCKFEGEF 398
>gi|115482840|ref|NP_001065013.1| Os10g0506800 [Oryza sativa Japonica Group]
gi|62733614|gb|AAX95731.1| tetratricopeptide repeat, putative [Oryza sativa Japonica Group]
gi|110289368|gb|ABB47859.2| TPR Domain containing protein, expressed [Oryza sativa Japonica
Group]
gi|113639622|dbj|BAF26927.1| Os10g0506800 [Oryza sativa Japonica Group]
gi|215695497|dbj|BAG90688.1| unnamed protein product [Oryza sativa Japonica Group]
Length = 793
Score = 48.1 bits (113), Expect = 0.008, Method: Compositional matrix adjust.
Identities = 77/334 (23%), Positives = 130/334 (38%), Gaps = 91/334 (27%)
Query: 16 RGRGLVSTQSLKAGQIVLRDSPILLYSALPFINS--SSFCHNCFRKTMHSSSSICPSCSV 73
+GRG+ S + ++ + P+ A+ + S + CH CF + + +CPSC++
Sbjct: 227 KGRGMSSPNDISPASLIHAEDPL----AVIIMKSCRDTHCHYCFSEAP-ADVVVCPSCTI 281
Query: 74 -AFCSPKCSTAAASSSHSPYVCQALTRL-LNLNSPDAANL---------PLDRQL-QARF 121
+CS +C A C T L N N D A L P +Q+ + R
Sbjct: 282 PIYCSNRCQEKAIGQ----MSCNQNTHLESNNNVVDIAKLSVTSTKSKTPDSKQIAEHRH 337
Query: 122 LIAAYNLAVVNPS-------------QFQILLAFQGTVT----------DNDTSAAHYLS 158
A V P+ + Q+L+ + T++ D D+SA+ + S
Sbjct: 338 ECGGACWAAVLPADIVLAGRIMAQYIEKQLLVGKRSTISGPNLDLVHHYDQDSSASKFES 397
Query: 159 S-------LCPPP--------ASTTIELTAALLAKDRLNAFGLMEPYIEGQDGRRSVR-- 201
LC A ++ L+ + ++N+ ++ +++ DG +++
Sbjct: 398 HIYATVLFLCLQSYYKSGVSWAEDSLSQLVLLICQIKVNSIAIV--HMKSMDGVKALTKG 455
Query: 202 -----------------AYGIYLKASFFNHDCLPNACRFDYVDAAAENNTDIIVRMIHDV 244
A IY+ SFFNH C PN + A + T +I+R +
Sbjct: 456 FSGFSGDVMCSVEQVRVAQAIYMSGSFFNHSCRPN------IHAYFHSRT-LILRSTEYI 508
Query: 245 PQGREICLSYFPV--NYDYSTRQKRLLYDYDFAC 276
G I LSY P D RQK L +Y F+C
Sbjct: 509 KAGSPIELSYGPQVGEMDLPERQKSLRENYYFSC 542
>gi|66822115|ref|XP_644412.1| SET domain-containing protein [Dictyostelium discoideum AX4]
gi|66822967|ref|XP_644838.1| SET domain-containing protein [Dictyostelium discoideum AX4]
gi|74857697|sp|Q557F6.1|Y3591_DICDI RecName: Full=SET and MYND domain-containing protein DDB_G0273591
gi|60472535|gb|EAL70486.1| SET domain-containing protein [Dictyostelium discoideum AX4]
gi|60472965|gb|EAL70914.1| SET domain-containing protein [Dictyostelium discoideum AX4]
Length = 413
Score = 47.8 bits (112), Expect = 0.008, Method: Compositional matrix adjust.
Identities = 29/115 (25%), Positives = 54/115 (46%), Gaps = 18/115 (15%)
Query: 169 IELTAALLAKDRLNAFGLMEPYIEGQDGRRSVRAYGIYLK--ASFFNHDCLPNACRFDYV 226
I+L L+ K R N FG+ ++ + G+ + +S+FNH C+PN
Sbjct: 239 IKLIRPLIQKVRCNQFGIWT---------KNDKCIGMAVSPSSSYFNHSCIPNC------ 283
Query: 227 DAAAENNTDIIVRMIHDVPQGREICLSYFPVNYDYSTRQKRLLYDYDFACDCDRC 281
+ + +D+ + + + +G +I +SY ++ R+ L + Y F C C RC
Sbjct: 284 -ESVRDGSDMTFKSLFPIKKGDQINISYLALDKSTKRRRDYLKFGYYFHCQCPRC 337
>gi|71052093|gb|AAH49367.2| SMYD2 protein [Homo sapiens]
Length = 371
Score = 47.8 bits (112), Expect = 0.008, Method: Compositional matrix adjust.
Identities = 66/283 (23%), Positives = 98/283 (34%), Gaps = 47/283 (16%)
Query: 15 GRGRGLVSTQSLKAGQIVLRDSPILLYSALPFINSSSFCHNCFRKTMHSSSSICPSCSVA 74
G+GRGL + Q + G ++ P Y L + C CF T S C C A
Sbjct: 16 GKGRGLRALQPFQVGDLLF-SCPAYAY-VLTVNERGNHCEYCF--TRKEGLSKCGRCKQA 71
Query: 75 F-CSPKCST-------------AAASSSHSPYVCQALT-RLLNLNSPDAANLPLDRQLQA 119
F C+ +C + +P LT R+L P ++ L
Sbjct: 72 FYCNVECQKEDWPMHKLECSPMVVFGENWNPSETVRLTARILAKQKIHPERTPSEKLLAV 131
Query: 120 RFLIAAYNLAVVNPSQFQILLAFQGTVTDNDTSAAHYL-SSLCPPPASTTIELTAALLAK 178
+ S L + + +D +A H+ S P S ++ L A+
Sbjct: 132 KEF----------ESHLDKLDNEKKDLIQSDIAALHHFYSKHLEFPDSDSL---VVLFAQ 178
Query: 179 DRLNAFGLMEPYIEGQDGRRSVRAYGIYLKASFFNHDCLPNACRFDYVDAAAENNTDIIV 238
N F + +D S I+ + NH C PN Y AE V
Sbjct: 179 VNCNGFTI-------EDEELSHLGSAIFPDVALMNHSCCPNVI-VTYKGTLAE------V 224
Query: 239 RMIHDVPQGREICLSYFPVNYDYSTRQKRLLYDYDFACDCDRC 281
R + ++ G E+ SY + Y R RL Y F C+C C
Sbjct: 225 RAVQEIKPGEEVFTSYIDLLYPTEDRNDRLRDSYFFTCECQEC 267
>gi|170040667|ref|XP_001848113.1| conserved hypothetical protein [Culex quinquefasciatus]
gi|167864252|gb|EDS27635.1| conserved hypothetical protein [Culex quinquefasciatus]
Length = 560
Score = 47.8 bits (112), Expect = 0.008, Method: Compositional matrix adjust.
Identities = 74/314 (23%), Positives = 121/314 (38%), Gaps = 56/314 (17%)
Query: 4 VSETVTVAEIEGRGRGLVSTQSLKAGQIVLRDSPILLYSALPFINSSSFCHNCFRKTMHS 63
+++ + + + G GRGLV+ ++ KAG ++L + IL A+ F +C C HS
Sbjct: 175 LAKGIAMKQYSGSGRGLVAERNFKAGDVILNEKAILSVVAVAF--KYLYCSRCGIPNQHS 232
Query: 64 SSSICPSC-SVAFCSPKCSTAAASSSHSPYVCQALTRLLNLNSPDAANLP---------- 112
CP+C +CS +C + +H Y C ++ N+ + D +NL
Sbjct: 233 LIP-CPNCVHCMYCSEECLAEDSRMAHR-YECSFGAQMANI-AYDGSNLATKLFFYGLTL 289
Query: 113 ----LDRQL---QARFLIAA--YNLAVVNPSQFQILLAFQGT---------VTDNDTSAA 154
LD+ + + +I A + L N + F T V+ +AA
Sbjct: 290 FNNDLDQMMKYCEKNSMIGADPFKLDHRNYDPLEEFKIFHETKLHLSPYIEVSFRLCAAA 349
Query: 155 HYLSSLCPPPASTTIELTAALLAKDRLNAFGLMEP---YIEGQDGRRS---VRAYGIYLK 208
Y L P + A + LN L + Y + R+S G+
Sbjct: 350 TYYVLLKQPSIKALVTTKAQ--KRSLLNCLYLYQRVAMYFAWE--RKSPPDCTVTGLLSY 405
Query: 209 ASFFNHDCLPNA-CRFDYVDAAAENNTDIIVRMIHDVPQGREICLSYFPVNYDYSTRQKR 267
NH C PNA +DY I +I V +G++I +S P + +Q
Sbjct: 406 GKLVNHSCNPNAMILYDY--------GQIKCILIRPVRKGQQITISLGPAWFVIGNQQAS 457
Query: 268 LLYDYDFACDCDRC 281
D F C C+ C
Sbjct: 458 ---DLSFTCQCEVC 468
>gi|344244710|gb|EGW00814.1| SET and MYND domain-containing protein 3 [Cricetulus griseus]
Length = 239
Score = 47.8 bits (112), Expect = 0.008, Method: Compositional matrix adjust.
Identities = 31/121 (25%), Positives = 51/121 (42%), Gaps = 18/121 (14%)
Query: 202 AYGIYLKASFFNHDCLPNACRFDYVDAAAENNTDIIVRMIHDVPQGREICLSYFPVNYDY 261
G+Y S NH C PN C + N +++R + ++ G E+ + Y +
Sbjct: 5 GVGLYPSMSLLNHSCDPN-CSIVF------NGPHLLLRAVREIEAGEELTICYLDMLMTS 57
Query: 262 STRQKRLLYDYDFACDCDRCKV-----------EANWSDNDNDDENNEEVMDEDQDEQMV 310
R+K+L Y F CDC RC+ E W + + EE+ + EQ++
Sbjct: 58 EERRKQLRDQYCFECDCIRCQTHDKDADMLTGDEQIWKEVQESLKKIEELKAHWKWEQVL 117
Query: 311 A 311
A
Sbjct: 118 A 118
>gi|302886135|ref|XP_003041958.1| hypothetical protein NECHADRAFT_122942 [Nectria haematococca mpVI
77-13-4]
gi|256722865|gb|EEU36245.1| hypothetical protein NECHADRAFT_122942 [Nectria haematococca mpVI
77-13-4]
Length = 647
Score = 47.8 bits (112), Expect = 0.008, Method: Compositional matrix adjust.
Identities = 42/153 (27%), Positives = 67/153 (43%), Gaps = 19/153 (12%)
Query: 132 NPSQFQILLAFQGTVTDNDTSAAHYLSSLCPPPASTTIELTAALLAKDRLNAFGLMEPYI 191
NPSQ + +L G + + L + P T ++ + + NAFG P
Sbjct: 439 NPSQIERVLDLFG----DYSGLGKKLVEVDGKPVVDTFQIHDII----QRNAFG---PGQ 487
Query: 192 EGQDGRRSVRAYGIYLKASFFNHDCLPNACRFDYVDAAAENNTDIIVRMIHDVPQGREIC 251
+ +D S + G+++ AS+ NH C+PNA + DY+ +IVR + G EI
Sbjct: 488 QTEDEDISNASTGLWIMASYMNHSCVPNAKK-DYIGDL------MIVRATRRIIAGEEI- 539
Query: 252 LSYFPVNYDYSTRQKRLLYDYDFACDCDRCKVE 284
L + + D+ R L + F C C C E
Sbjct: 540 LQRYDESTDHDVRTASLQRTWGFRCKCSLCLAE 572
>gi|242798932|ref|XP_002483270.1| SET and MYND domain protein, putative [Talaromyces stipitatus ATCC
10500]
gi|218716615|gb|EED16036.1| SET and MYND domain protein, putative [Talaromyces stipitatus ATCC
10500]
Length = 547
Score = 47.8 bits (112), Expect = 0.008, Method: Compositional matrix adjust.
Identities = 66/294 (22%), Positives = 111/294 (37%), Gaps = 56/294 (19%)
Query: 17 GRGLVSTQSLKAG-QIVLRDSPILLYSALPFINSSSFCHNCFRKTMHSS--------SSI 67
G+G+ +T +K G +++ D+P++ + + + C C+ + S
Sbjct: 46 GQGVFATIDVKKGADVLVVDNPLI--ALVEEAQLQNICSGCYDTSKAGSIDNRRPDLVKA 103
Query: 68 CPSCSVAF-CSPKCSTAAASSSHSPYVCQALTRLLNLNSPDAANLPLDRQLQARFL---- 122
C C V + C C + HS C+ L P LP+ L+ L
Sbjct: 104 CTRCKVVYYCDKNCQRKDWKAGHS-LECKTYAELY----PKILPLPVRAVLRILMLRRAD 158
Query: 123 -----IAAYNLAVVNPSQFQILLAFQGTVTDNDTSAAHYLSSLCPPPASTTIEL----TA 173
+ + +LA+ P + + ++ + H L + S +L A
Sbjct: 159 KIIPEVYSESLALTYPKIYDCV--------ESQETKDHLLMAKALREYSNLTDLDNKWVA 210
Query: 174 ALLAKDRLNAFGLMEPYIEGQDGRRSVRAYGIYLKA--SFFNHDCLPNACRFDYVDAAAE 231
+L + N+F L + GRR G+Y + FNH C PNA Y A +
Sbjct: 211 SLFGRLNANSFCLTSAF-----GRRR----GVYFHPGPARFNHSCDPNA---SYSFAKGK 258
Query: 232 NNTDIIVRMIHDVPQGREICLSYFPVNYDYSTRQKRLLYDYDFACDCDRCKVEA 285
+R I + + +I +SY Y TR+ L Y F C C +C EA
Sbjct: 259 ----CYIRAIRPIAKDEQIFISYVDTTYSVGTRRHELQERYRFECQCPKCLHEA 308
>gi|428162750|gb|EKX31864.1| hypothetical protein GUITHDRAFT_121938 [Guillardia theta CCMP2712]
Length = 368
Score = 47.8 bits (112), Expect = 0.008, Method: Compositional matrix adjust.
Identities = 26/75 (34%), Positives = 38/75 (50%), Gaps = 7/75 (9%)
Query: 209 ASFFNHDCLPNACRFDYVDAAAENNTDIIVRMIHDVPQGREICLSYFPVNYDY--STRQK 266
AS FNH C PNA + + D+ VR + + G+EI + Y Y + R++
Sbjct: 118 ASAFNHSCKPNAAIY-----PCPTSDDVQVRALRQIRTGQEIAICYDSTLYALPKAMRRQ 172
Query: 267 RLLYDYDFACDCDRC 281
LL+ + F CDC RC
Sbjct: 173 ELLHRWGFECDCPRC 187
>gi|401625526|gb|EJS43529.1| set5p [Saccharomyces arboricola H-6]
Length = 526
Score = 47.8 bits (112), Expect = 0.008, Method: Compositional matrix adjust.
Identities = 27/80 (33%), Positives = 45/80 (56%), Gaps = 5/80 (6%)
Query: 205 IYLKASFFNHDCLPNACRFDYVDAAAENNTDIIVRMIHDVPQGREICLSYFPVNYDYSTR 264
IY SF NHDC PNA Y++ A E++ ++ + + +G +IC++Y + R
Sbjct: 358 IYHWHSFMNHDCEPNA----YIEQAEEHD-ELKLHARRPIKKGEQICVTYVNPLHGVRLR 412
Query: 265 QKRLLYDYDFACDCDRCKVE 284
++ L ++ F C CDRC+ E
Sbjct: 413 RRELRVNWGFLCQCDRCQNE 432
>gi|303312507|ref|XP_003066265.1| hypothetical protein CPC735_054900 [Coccidioides posadasii C735
delta SOWgp]
gi|240105927|gb|EER24120.1| hypothetical protein CPC735_054900 [Coccidioides posadasii C735
delta SOWgp]
gi|320033649|gb|EFW15596.1| conserved hypothetical protein [Coccidioides posadasii str.
Silveira]
Length = 544
Score = 47.8 bits (112), Expect = 0.008, Method: Compositional matrix adjust.
Identities = 50/221 (22%), Positives = 92/221 (41%), Gaps = 21/221 (9%)
Query: 86 SSSHSPYVCQALTRLLNL-NSPDAANLPLDRQLQARFLIAAYNLAVVNPSQFQILLAFQG 144
S+ SP++ + T L+ L SP + + Q L A + + ++ Q
Sbjct: 107 GSTPSPFLRENATPLIRLLASPGIGWTAAEFKKQGTGLAAMMEMPRCQIITHEPVMVVQD 166
Query: 145 TVTDNDTSAAHYLSSLCPP------PASTTIELTAALLAKDRLNAFGLMEPYIEGQ---- 194
T+ S +Y L P T T ++++L FG Y E
Sbjct: 167 TLESMGGSIKNYNKMLAEKIKEVNCPNFTHFFFTLKKDSEEKLGKFGA---YFEKNCIPC 223
Query: 195 --DGRRSVRAYGIYLKASFFNHDCLPNACRFDYVDAAAE--NNTDIIVRMIHDVPQGREI 250
+ VR G + +S NH C+PNA + ++A E +T + +R ++ +G EI
Sbjct: 224 TLPSGQKVRVLGHW--SSKINHSCVPNAQQ-TLIEANTEGVKSTFLNIRACREISRGEEI 280
Query: 251 CLSYFPVNYDYSTRQKRLLYDYDFACDCDRCKVEANWSDND 291
++Y ++ D + R++ + + F C C+ C+ + D D
Sbjct: 281 TVAYDEIHLDTTERKQHMKEHFGFECACEYCESQDRELDAD 321
>gi|353236058|emb|CCA68060.1| hypothetical protein PIIN_01927 [Piriformospora indica DSM 11827]
Length = 798
Score = 47.8 bits (112), Expect = 0.008, Method: Compositional matrix adjust.
Identities = 31/87 (35%), Positives = 41/87 (47%), Gaps = 12/87 (13%)
Query: 195 DGRRSVRAYGIYLKASFFNHDCLPNACRFDYVDAAAENNTDIIVRMIHDVPQGREICLSY 254
DG+ +V IYL S+FNH C NA R + DA I++R + + G EI +SY
Sbjct: 522 DGKSTV----IYLLPSYFNHSCDANATRHFFGDA-------IVIRALTQIRAGDEITISY 570
Query: 255 FPVNYDYSTRQKRLLYDYDFACDCDRC 281
N Y R L + CDC C
Sbjct: 571 IYANEPYHERAN-TLKKWFPTCDCSLC 596
>gi|255714340|ref|XP_002553452.1| KLTH0D17160p [Lachancea thermotolerans]
gi|238934832|emb|CAR23014.1| KLTH0D17160p [Lachancea thermotolerans CBS 6340]
Length = 485
Score = 47.8 bits (112), Expect = 0.008, Method: Compositional matrix adjust.
Identities = 24/84 (28%), Positives = 41/84 (48%), Gaps = 6/84 (7%)
Query: 201 RAYGIYLKASFFNHDCLPNACRFDYVDAAAENNTDIIVRMIHDVPQGREICLSYFPVNYD 260
R I+ S+ NH+C PN V + T + + D+ +G E+ +Y +
Sbjct: 332 REGQIFTLYSYLNHNCEPN------VRYELDGKTGLKLFARKDIKKGEELLTTYVNPLHG 385
Query: 261 YSTRQKRLLYDYDFACDCDRCKVE 284
+ R++ LL ++ F CDC+RC E
Sbjct: 386 VTLRRRELLVNWGFLCDCERCNKE 409
>gi|409084007|gb|EKM84364.1| hypothetical protein AGABI1DRAFT_117750 [Agaricus bisporus var.
burnettii JB137-S8]
Length = 429
Score = 47.8 bits (112), Expect = 0.009, Method: Compositional matrix adjust.
Identities = 71/347 (20%), Positives = 127/347 (36%), Gaps = 79/347 (22%)
Query: 16 RGRGLVSTQSLKAGQIVLRDSPILL---YSALPFINSSSFCHNCFRKTMHS-------SS 65
+G+GLV+T +K G+++ ++ P + + L S C C S S
Sbjct: 97 KGKGLVATHEIKEGEVIWKEDPFITSPEWGILDLQMKSVACAFCTTPLTGSPLILSCAGS 156
Query: 66 SICPSCSVAFCSPKCSTAAASSSHSPYVCQALTRLLNLNSPDAANLPLDRQLQARFLIAA 125
++ C+ FC+ C T +A P +C + + A++PL + ++ +A
Sbjct: 157 NMASPCTARFCNRLCFTRSAKLH--PLLCP---------TKNPASIPLIQFVRESRWMAF 205
Query: 126 YNLAVVNPS--------------QFQILLAF---------------QGTVTDNDTSAAH- 155
Y L + ++++ A G + + H
Sbjct: 206 YALVLCTSRLLLESQSEDKSFNDNWEMIHALAELGMEERHKYSYNLSGQYEPDRETWRHA 265
Query: 156 ---YLSSLCPPPASTTIELTAALLAKDRLN-----AFGLMEPYIEGQDGRRS--VRAYG- 204
++ + P ++ + A + K N F ++ G GR S + A+G
Sbjct: 266 YELFIQAFKEPSSTADAKKLAKIFRKPHPNNFQADIFDYNTGFLRGL-GRMSLNIEAHGG 324
Query: 205 IYLKASFFNHDCLPNACRFDYVDAAAENNTDIIVRMIHDVPQGREICLSYFPVNYDYSTR 264
IY S NH C PN A + +I + D+ G E+ ++Y Y TR
Sbjct: 325 IYTLHSHLNHSCDPNVSIRHLDQTTALSRITVIAK--RDIKVGEELLITYVDPQLPYQTR 382
Query: 265 QKRLLYDYDFA-CDCDRCKVEANWSDNDNDDENNEEVMDEDQDEQMV 310
Q L + F C C RC E + V +ED +++M
Sbjct: 383 QNE-LKGWGFGRCTCSRCLEEV------------KTVKEEDSEKEMT 416
>gi|308812131|ref|XP_003083373.1| Predicted histone tail methylase containing SET domain (ISS)
[Ostreococcus tauri]
gi|116055253|emb|CAL57649.1| Predicted histone tail methylase containing SET domain (ISS)
[Ostreococcus tauri]
Length = 190
Score = 47.8 bits (112), Expect = 0.009, Method: Compositional matrix adjust.
Identities = 31/82 (37%), Positives = 42/82 (51%), Gaps = 10/82 (12%)
Query: 204 GIYL--KASFFNHDCLPNACRFDYV-DAAAENNTDIIVRMIHDVPQGREICLSYFPVNYD 260
G+ L AS NH C P +C ++ DA A V D+ +G EI +SY P ++
Sbjct: 95 GVVLLGSASNINHSCEP-SCEVAFIHDARAH------VIATRDITKGEEITISYVPGSWP 147
Query: 261 YSTRQKRLLYDYDFACDCDRCK 282
R+K LL Y FACDC C+
Sbjct: 148 LRRRRKELLDRYGFACDCALCE 169
>gi|119192884|ref|XP_001247048.1| hypothetical protein CIMG_00819 [Coccidioides immitis RS]
gi|392863720|gb|EAS35512.2| hypothetical protein CIMG_00819 [Coccidioides immitis RS]
Length = 545
Score = 47.8 bits (112), Expect = 0.009, Method: Compositional matrix adjust.
Identities = 32/124 (25%), Positives = 58/124 (46%), Gaps = 11/124 (8%)
Query: 176 LAKDRLNAFGLMEPYIEGQ------DGRRSVRAYGIYLKASFFNHDCLPNACRFDYVDAA 229
L KD +G Y E + VR G + +S NH C+PNA + ++A
Sbjct: 202 LKKDSEEKWGKFGAYFEKNCIPCTLPSGQKVRVLGHW--SSKINHSCVPNAQQ-TLIEAN 258
Query: 230 AE--NNTDIIVRMIHDVPQGREICLSYFPVNYDYSTRQKRLLYDYDFACDCDRCKVEANW 287
E +T + +R ++ +G EI ++Y ++ D + R++ + + F C C+ C+ +
Sbjct: 259 TEGVKSTFLNIRACREISRGEEITVAYDEIHLDTTERKQHMKEHFGFECACEYCESQDRE 318
Query: 288 SDND 291
D D
Sbjct: 319 LDAD 322
>gi|350403870|ref|XP_003486931.1| PREDICTED: SET and MYND domain-containing protein 5-like [Bombus
impatiens]
Length = 386
Score = 47.8 bits (112), Expect = 0.009, Method: Compositional matrix adjust.
Identities = 29/86 (33%), Positives = 44/86 (51%), Gaps = 8/86 (9%)
Query: 204 GIYLKASFFNHDCLPNACRFDYVDAAAENNTDIIVRMIHDVPQGREICLSYFP---VNYD 260
G+Y+ S NH C+PNA V+ NN ++++ I D+ EIC+SY +
Sbjct: 289 GLYIFQSSINHSCVPNA----VVEFPYSNNV-LVLKAIRDIHPEEEICISYLDECCLERS 343
Query: 261 YSTRQKRLLYDYDFACDCDRCKVEAN 286
+RQK L Y F C C++C + N
Sbjct: 344 RHSRQKALSSLYLFHCYCNKCLSQVN 369
>gi|82658264|ref|NP_001032477.1| SET and MYND domain-containing protein 3 [Danio rerio]
gi|81097726|gb|AAI09413.1| SET and MYND domain containing 3 [Danio rerio]
Length = 380
Score = 47.8 bits (112), Expect = 0.009, Method: Compositional matrix adjust.
Identities = 57/260 (21%), Positives = 96/260 (36%), Gaps = 51/260 (19%)
Query: 14 EGRGRGLVSTQSLKAGQIVLRDSPILLYSALPFINSSSFCHNCFRKTMHSSSSICPSCSV 73
EG+G GL + + +K G+++ P A F+ ++ C +C ++ S S C C
Sbjct: 12 EGKGNGLRALREIKPGEVIYSCEPFAFCVARDFLKTA--CQSCLKRG--ESLSRCSQCKT 67
Query: 74 A-FCSPKCSTAAASSSHSPYVCQALTRLLNLNSPDAANLPLDR-QLQARFLIAAYNLAVV 131
A +C+ +C A C+ L L +P D +L AR + + +
Sbjct: 68 ARYCNVQCQKQAWPDHKRE--CKCLKHL-------QPRIPTDSVRLVARIIFKLLSQSES 118
Query: 132 NPSQFQILLAFQGTVTDNDTSAAHYLSSLCPPPASTTIELTAA----------------- 174
+ + + Q + D L LC TT+++ A
Sbjct: 119 DQEELYSIAEHQSHLADMSEEKTEGLKHLC-----TTLQVYLAEENCDLSRLPSGLDPVS 173
Query: 175 LLAKDRLNAFGLMEPYIEGQDGRRSVRAYGIYLKASFFNHDCLPNACRFDYVDAAAENNT 234
LLA+ N F + DG G+Y S NHDC PN C +
Sbjct: 174 LLARVTCNCFSI-------SDGELQDVGVGLYPSMSLLNHDCQPN-CIMMF------EGK 219
Query: 235 DIIVRMIHDVPQGREICLSY 254
+ +R + + E+ +SY
Sbjct: 220 RLTLRAVRVIRSAEELTISY 239
>gi|242020146|ref|XP_002430517.1| SET and MYND domain-containing protein, putative [Pediculus humanus
corporis]
gi|212515674|gb|EEB17779.1| SET and MYND domain-containing protein, putative [Pediculus humanus
corporis]
Length = 378
Score = 47.8 bits (112), Expect = 0.009, Method: Compositional matrix adjust.
Identities = 59/239 (24%), Positives = 90/239 (37%), Gaps = 52/239 (21%)
Query: 68 CPSCS-VAFCSPKCSTAAASSSHSPYVCQALTRLLNLNSPDAANLPLDRQLQARFLIAAY 126
C C+ V +C C Y C L R+ PDAA +L +R +I
Sbjct: 3 CSGCAFVRYCDRTCQKNGWIIH--KYECHNLKRIAPRKLPDAA------RLLSRIIIKLN 54
Query: 127 N--------LAVVNPSQFQILLA---FQGTVTDNDTSAAHYLSSLCPPPASTTIELTAAL 175
N +F+ L++ F+ ++ N + L P TA L
Sbjct: 55 KGGADEKSYYTKSNYRKFKDLMSRKLFKHFMSLNSVLHEYLQDQLIPN--------TAEL 106
Query: 176 LAKDRLNAFGLMEPYIEGQDGRRSVRAYGIYLKASFFNHDCLPNACRFDYVDAAAENNTD 235
L +N F ++ D G+YL +S +H C PNA A T
Sbjct: 107 LM--VINTFNIL-------DSEMQSIGSGLYLASSIIDHSCSPNA-------VAVFKGTT 150
Query: 236 IIVRMIHDVP--QGREICLSYFPVNYDYSTRQKRLLYDYDFACDCDRCKVEANWSDNDN 292
I +R + D+P ++ +SY + RQ+ LL Y F C C +C +D+DN
Sbjct: 151 IFIRTLEDIPIMDWSKVFISYIELLNLPEIRQQELLSSYYFLCQCSKC------TDSDN 203
>gi|389744758|gb|EIM85940.1| SET domain-containing protein [Stereum hirsutum FP-91666 SS1]
Length = 480
Score = 47.8 bits (112), Expect = 0.009, Method: Compositional matrix adjust.
Identities = 33/124 (26%), Positives = 51/124 (41%), Gaps = 8/124 (6%)
Query: 202 AYGIYLKAS-FFNHDCLPNACRFDYVDAAAENNTDIIVRMIHDVPQGREICLSYFPVNYD 260
A+GI+ AS FNH C PNA Y+ + ++V + ++ EI + Y
Sbjct: 210 AHGIFPVASRLFNHSCTPNAAA-KYIFGRDKPGVRMVVVALRNISHNEEITIPYLDPALP 268
Query: 261 YSTRQKRLLYDYDFACDCDRCKVEANWSDNDNDDENNEEVMDEDQDEQMVASDDDAEVHG 320
+ RQ L +Y F C C C + + N N D ++E ++A D H
Sbjct: 269 FLERQLSLEQNYGFRCTCHVCSFGSQLARNPN------YTGDAPREEDILAIRDTLLNHV 322
Query: 321 DTNF 324
N
Sbjct: 323 GLNL 326
>gi|254572041|ref|XP_002493130.1| Zinc-finger protein of unknown function [Komagataella pastoris
GS115]
gi|238032928|emb|CAY70951.1| Zinc-finger protein of unknown function [Komagataella pastoris
GS115]
gi|328352852|emb|CCA39250.1| Potential protein lysine methyltransferase SET5 [Komagataella
pastoris CBS 7435]
Length = 464
Score = 47.8 bits (112), Expect = 0.009, Method: Compositional matrix adjust.
Identities = 58/306 (18%), Positives = 117/306 (38%), Gaps = 34/306 (11%)
Query: 14 EGRGRGLVSTQSLKAGQIVLRDSPILLY---SALPFINSSSFCHNCFRKTMHSSSSI--- 67
E G+GL + + + G+++ ++ P ++ L + C C + + ++
Sbjct: 122 EKSGKGLYAKRDISKGEVLWKEEPFIMVPPIELLRLVKHGKACTYCGKPMVSMTTDRLLK 181
Query: 68 ---CPSCSVAFCSPKCSTAAASSSHSPYVCQALTRLLNLNSPDAANLPLD--------RQ 116
C C+ +CS C + + + R +NS A L L+
Sbjct: 182 GLDCNVCNELWCSIGCKRKDLLHPQAKHGVYGI-RTKGINSK--AWLALEDFCIENDWSS 238
Query: 117 LQARFLIAAYNLAVVNPS---QFQILLAFQGTVTDNDTSAAHYLSSLCPPPASTTIELTA 173
+ A +I A L + + QF + TV D +S + + +S+ + +L
Sbjct: 239 VYAAAIIYARELLDKSKNVYDQFHSMAKISQTVRDQASSLSEFEASIHEGKLAECHQLFN 298
Query: 174 ALLAKDRLNAFGLMEPYIEGQDGRRSVRAYG--IYLKASFFNHDCLPNACRFDYVDAAAE 231
K ++ LME Y+ G ++ ++L + NH C N +D +
Sbjct: 299 KCFPKKPISYETLME-YV----GTYNINNVDNCLFLVYAHLNHSCDRNVS----IDLQPK 349
Query: 232 NNTDIIVRMIHDVPQGREICLSYFPVNYDYSTRQKRLLYDYDFACDCDRCKVEANWSDND 291
+ V + D+ G ++ +Y RQ+ L ++ F C+C RC +E+ +
Sbjct: 350 RADGLQVTALRDIAAGEQLTTTYVSPTMSLQDRQRELRLNWGFICECSRCVLESKQREKR 409
Query: 292 NDDENN 297
N+
Sbjct: 410 RSSRNS 415
>gi|154290093|ref|XP_001545647.1| hypothetical protein BC1G_15826 [Botryotinia fuckeliana B05.10]
Length = 545
Score = 47.8 bits (112), Expect = 0.010, Method: Compositional matrix adjust.
Identities = 64/285 (22%), Positives = 108/285 (37%), Gaps = 36/285 (12%)
Query: 19 GLVSTQSLKAGQIVLRDSPILLYSALPFINSSSFCHNCFRKTMHSSSSICPSCSVAFCSP 78
G+V+T+ + A + VL D P L ++S+ C C + + C C +CS
Sbjct: 272 GVVATRDILAQETVLVD-PTLA----AVVDSADRCPACCGPFLDKIENSC--CKTLYCSS 324
Query: 79 KCSTAAASSSHSPYVCQALTRLLNLNSPDAANL---PLDRQLQARFLIAAYNLAVVNPSQ 135
CS A S H+ + L LL S +N + +L R L + V +P +
Sbjct: 325 SCSQNALDSYHTIVCGKDLDFLLGTESESLSNSRESSMGSKLFLRVLALSLKEDVASPLK 384
Query: 136 FQILLAFQGTVTDNDTSAA------HYLSSLCPPPASTTIELTAALLAKDRLNAFGLMEP 189
++ N H ++ P EL + A + + L
Sbjct: 385 TSLISRLTPAYNPNSPQLVVLNFKDHIIT-----PIRILRELGIDVFANSAYDTWVLHTI 439
Query: 190 YIEGQDGRRS-----VRAYGIYLKASFFNHDCLPNA-CRFDYVDAAAENNTDIIVRMIHD 243
Y Q+ + + G+ S FNH C PN R D + N+ + + D
Sbjct: 440 YCRLQNNKHGQTFDDICGTGVNPLYSMFNHSCDPNIDWRHD------DENSTVTMFAERD 493
Query: 244 VPQGREICLSYFPVNYDYSTRQKRLLYDYDFACDCDRC---KVEA 285
+ G E+ +SY R+++L+ + C C +C K+EA
Sbjct: 494 IKNGEEMFISYIGKGKGLEERRRKLMPWFGMDCACHKCDEEKLEA 538
>gi|332024084|gb|EGI64301.1| SET and MYND domain-containing protein 3 [Acromyrmex echinatior]
Length = 441
Score = 47.8 bits (112), Expect = 0.010, Method: Compositional matrix adjust.
Identities = 31/89 (34%), Positives = 41/89 (46%), Gaps = 15/89 (16%)
Query: 204 GIYLKASFFNHDCLPNACRFDYVDAAAENNTDIIVRMIHDVPQ--------GREICLSYF 255
GIYL AS +H C PN T IIVR + D+P ++I +SY
Sbjct: 177 GIYLGASVIDHSCKPNV-------IVVFEGTTIIVRTLTDLPSLDWSQASIDKDIRISYV 229
Query: 256 PVNYDYSTRQKRLLYDYDFACDCDRCKVE 284
+ R++ L Y F CDC+RCK E
Sbjct: 230 DLLNSNKDRREELHSSYYFWCDCERCKKE 258
>gi|315046180|ref|XP_003172465.1| SET and MYND domain-containing protein 2 [Arthroderma gypseum CBS
118893]
gi|311342851|gb|EFR02054.1| SET and MYND domain-containing protein 2 [Arthroderma gypseum CBS
118893]
Length = 498
Score = 47.8 bits (112), Expect = 0.010, Method: Compositional matrix adjust.
Identities = 57/289 (19%), Positives = 101/289 (34%), Gaps = 52/289 (17%)
Query: 17 GRGLVSTQSLKAGQIVLRDSPILLYSALPFINSSSFCHNCFRKTMHSSSSI--------- 67
G GL +T ++K G+ V P+ + L C CF ++ +
Sbjct: 20 GTGLFATSTIKMGKNVF-SIPVTFSTVLSTERLGDACSGCFSNLPFGANVMASVDMKLRA 78
Query: 68 CPSCSVA-FCSPKCSTAAASSSHS--------------PYVCQALTRLLNLNSPDAANLP 112
C C V +CS C + H P +A+ R++ L + A++
Sbjct: 79 CSGCKVVKYCSKGCQAENWALIHKHECAIYKKLYPKILPVNSRAVLRIVKLRNSKDADVQ 138
Query: 113 LDRQLQARFLIAAYNLAVVNPSQFQILLAFQGTVTDNDTSAAHYLSSLCPPPASTTIELT 172
D + +A N Q++ ++ + S +E
Sbjct: 139 SDMTMFLGLRSHLKEIASSNQEQYERIMLCAKAEKEYSGSG-------------LDLETI 185
Query: 173 AALLAKDRLNAFGLMEPYIEGQDGRRSVRAYGIYLKASFFNHDCLPNACRFDYVDAAAEN 232
A LA+ +N+F + G ++ + Y+ NH C PNA +
Sbjct: 186 AEYLARIEVNSFTFTTSF--GDPLGLCIQPFACYM-----NHSCDPNA-------VVGFD 231
Query: 233 NTDIIVRMIHDVPQGREICLSYFPVNYDYSTRQKRLLYDYDFACDCDRC 281
I V+ + + ++ +SY Y + RQK+L Y F C C +C
Sbjct: 232 EGLITVKALRKIKPDEQVFISYIDNTYPFEVRQKQLAERYFFTCKCSKC 280
>gi|298710004|emb|CBJ31723.1| SET and MYND domain containing 3 [Ectocarpus siliculosus]
Length = 636
Score = 47.8 bits (112), Expect = 0.010, Method: Compositional matrix adjust.
Identities = 24/80 (30%), Positives = 37/80 (46%), Gaps = 6/80 (7%)
Query: 202 AYGIYLKASFFNHDCLPNACRFDYVDAAAENNTDIIVRMIHDVPQGREICLSYFPVNYDY 261
G+Y + FNH C PN C + + ++ V + DVP G ++ +SY + +
Sbjct: 229 GVGLYPAGAMFNHSCRPN-CSWR-----TNRSGELCVVAVEDVPAGSQLFISYVDILQPW 282
Query: 262 STRQKRLLYDYDFACDCDRC 281
RQ L + F C C RC
Sbjct: 283 PVRQDLLRCHFFFECACPRC 302
>gi|451998059|gb|EMD90524.1| hypothetical protein COCHEDRAFT_1103990 [Cochliobolus
heterostrophus C5]
Length = 338
Score = 47.8 bits (112), Expect = 0.010, Method: Compositional matrix adjust.
Identities = 28/86 (32%), Positives = 45/86 (52%), Gaps = 6/86 (6%)
Query: 201 RAYGIYLKASFFNHDCLPNACRFDYVDAAAENNTDIIVRMIHDVPQGREICLSYFPVNYD 260
+ ++ A+ FNH C PNAC F + + + + +++DV G+EI LSY + +D
Sbjct: 133 KGAAVFPHAARFNHSCNPNAC-FSWNPSINKET----IHVMNDVAAGKEITLSYCDMMHD 187
Query: 261 YSTRQKRLLYDYDFACDCDRCKVEAN 286
+R L + Y F CDC C + N
Sbjct: 188 KPSRAYELKH-YGFVCDCPACAGDEN 212
>gi|440801440|gb|ELR22460.1| Histone-lysine N-methyltransferase [Acanthamoeba castellanii str.
Neff]
Length = 545
Score = 47.8 bits (112), Expect = 0.010, Method: Compositional matrix adjust.
Identities = 30/111 (27%), Positives = 48/111 (43%), Gaps = 17/111 (15%)
Query: 185 GLMEPYIEGQDGRRSVRAYGIYLKASFFNHDCLPNACRFDYVDAAAENNTDIIVRMIHDV 244
GL PY + Y + S NH C PN C+ Y++ + +V + D+
Sbjct: 399 GLFPPY----------KGYAFFPLLSTLNHSCEPN-CQVAYLE-----DGQALVFALRDI 442
Query: 245 PQGREICLSYFPVNYDYSTRQKRLLYDYDFACDCDRCKVEANWSDNDNDDE 295
G E+ +SY + + RQ++L Y F C C RC +A + + E
Sbjct: 443 AAGEELSISYIYRHLPLAERQQQL-RSYGFVCACPRCAADAQAQEAPDKAE 492
>gi|395331093|gb|EJF63475.1| SET domain-containing protein [Dichomitus squalens LYAD-421 SS1]
Length = 445
Score = 47.8 bits (112), Expect = 0.010, Method: Compositional matrix adjust.
Identities = 28/85 (32%), Positives = 44/85 (51%), Gaps = 5/85 (5%)
Query: 202 AYGIY-LKASFFNHDCLPNA-CRFDYVDAAAENNTDIIVRMIHDVPQGREICLSYFPVNY 259
A+G++ L + NH C+PNA C+ Y+ A +E + V + D+ +G E+ + Y
Sbjct: 191 AHGVFPLASRLLNHSCVPNAACK--YMLAPSEP-VRMEVVALRDINEGDEVTIPYLDPAL 247
Query: 260 DYSTRQKRLLYDYDFACDCDRCKVE 284
+ TRQ L +Y F C C C E
Sbjct: 248 PFQTRQDALHVNYGFECGCRLCTFE 272
>gi|328791769|ref|XP_001121272.2| PREDICTED: SET and MYND domain-containing protein 4-like [Apis
mellifera]
Length = 668
Score = 47.8 bits (112), Expect = 0.010, Method: Compositional matrix adjust.
Identities = 30/95 (31%), Positives = 52/95 (54%), Gaps = 11/95 (11%)
Query: 1 MSAVSETVTVAEIEGRGRGLVSTQSLKAGQIVLRDSPILLYSALPFINSSSFCHNCF--R 58
+ + S+ VT+ E RGR L++T+++KAG +++ ++P + N + NC
Sbjct: 214 LKSCSDAVTLQFDEKRGRHLIATKNIKAGSVLIVETPFAFST-----NKEALGRNCLHCH 268
Query: 59 KTMHSSSSI---CPSC-SVAFCSPKCSTAAASSSH 89
T+ SS+S+ C C +V+FCS KC + A H
Sbjct: 269 ITLMSSNSVKIPCYYCQTVSFCSEKCRSKAWQIYH 303
Score = 42.7 bits (99), Expect = 0.28, Method: Compositional matrix adjust.
Identities = 33/110 (30%), Positives = 46/110 (41%), Gaps = 12/110 (10%)
Query: 182 NAFGLMEPYIEGQDGRRSVRAYG--IYLKASFFNHDCLPNACRFDYVDAAAENNTDIIVR 239
NA+ ++E + + R G IY S NH C PN R Y ++VR
Sbjct: 452 NAYEIVENIYDKKTHIWEPRQIGGAIYPSVSLINHSCYPNVVRHTYPSGI------VVVR 505
Query: 240 MIHDVPQGREICLSYFPVNYDYS--TRQKRLLYDYDFACDCDRCKVEANW 287
+ V +G EI Y P + + +R + L Y F C CD C NW
Sbjct: 506 TLRFVGKGTEILDCYGPHWFSENKLSRIEYLWKKYRFLCTCDACI--QNW 553
>gi|330812969|ref|XP_003291388.1| hypothetical protein DICPUDRAFT_98991 [Dictyostelium purpureum]
gi|325078448|gb|EGC32099.1| hypothetical protein DICPUDRAFT_98991 [Dictyostelium purpureum]
Length = 824
Score = 47.8 bits (112), Expect = 0.010, Method: Compositional matrix adjust.
Identities = 27/87 (31%), Positives = 42/87 (48%), Gaps = 6/87 (6%)
Query: 202 AYGIYLKASFFNHDCLPNACRFDYVDAAAENNTDIIVRMIHDVPQGREICLSYFPVNYDY 261
+G Y +FNH C+PN YVD +N + R + QG E+ L+Y+ +
Sbjct: 533 GHGFYPLGCYFNHSCVPNVTW--YVD----DNGMLAFRATRTISQGEELFLTYYDMFSTL 586
Query: 262 STRQKRLLYDYDFACDCDRCKVEANWS 288
R+ LL Y F C C +C+ ++ S
Sbjct: 587 DDRRPHLLDQYRFFCQCPQCEFQSKLS 613
Score = 43.9 bits (102), Expect = 0.15, Method: Compositional matrix adjust.
Identities = 33/122 (27%), Positives = 59/122 (48%), Gaps = 9/122 (7%)
Query: 8 VTVAEIEGRGRGLVSTQSLKAGQIVLRDSPILLYSA-LPFINSSSFCHNCFRKTMHS--- 63
V++ EG+GR +++ +++ G +VLR SP Y+A + I+ +C +CF K ++
Sbjct: 242 VSIEWQEGKGRCVIANNNIEKGSLVLRVSP---YTASIEDIHLHKYCGSCFNKIINKEDR 298
Query: 64 SSSICPSCSVAFCSPKCSTAAASSSHSPYVCQALTRLLNLNSPDAANLPLDRQLQARFLI 123
C C +CST + + C + + N PD +N D +L +R L+
Sbjct: 299 QKYKCKECKKFLFCEECSTDPVAIQYHHEEC-PIYQYFNRYHPD-SNETRDIRLMSRVLL 356
Query: 124 AA 125
A
Sbjct: 357 NA 358
>gi|189209509|ref|XP_001941087.1| conserved hypothetical protein [Pyrenophora tritici-repentis
Pt-1C-BFP]
gi|187977180|gb|EDU43806.1| conserved hypothetical protein [Pyrenophora tritici-repentis
Pt-1C-BFP]
Length = 659
Score = 47.4 bits (111), Expect = 0.010, Method: Compositional matrix adjust.
Identities = 35/121 (28%), Positives = 55/121 (45%), Gaps = 20/121 (16%)
Query: 182 NAFGLMEPYIEGQDGRRSVRAYGIYLKASFFNHDCLPNACRFDYVDAAAENNTDIIVRMI 241
N FG + G++ R+ + G+++ A++ NH CL N + D +I+R
Sbjct: 493 NGFGPGSQF--GEESARNA-STGLWIHAAYINHSCLANTAKEFAGDL-------LIIRAT 542
Query: 242 HDVPQGREICLSYFPVNYDYSTRQKRLLYDYDFACDCDRCKVEANWSDNDNDDENNEEVM 301
D+ G EI Y + DY TRQ L + F C C C+ E D++ +EV
Sbjct: 543 RDIKAGEEIFHPY-DASLDYETRQGFLDRTWGFRCFCRLCETE---------DKDGKEVR 592
Query: 302 D 302
D
Sbjct: 593 D 593
>gi|221484463|gb|EEE22759.1| hypothetical protein TGGT1_035070 [Toxoplasma gondii GT1]
Length = 859
Score = 47.4 bits (111), Expect = 0.011, Method: Compositional matrix adjust.
Identities = 28/90 (31%), Positives = 41/90 (45%), Gaps = 9/90 (10%)
Query: 202 AYGIYLKASFFNHDCLPNACRFDYVDAAAENNTDIIVRMIHDVPQGREICLSYFP---VN 258
Y IY SF +H CLPNA F + + ++R D+ G E+ L+Y +
Sbjct: 165 GYVIYFMPSFSSHSCLPNALWF------TDEDHTFVLRTRADIAAGDEVTLTYLSEDDLM 218
Query: 259 YDYSTRQKRLLYDYDFACDCDRCKVEANWS 288
R+K L DF C C+RC ++S
Sbjct: 219 RPTLQRRKVLSETKDFICTCERCSAPVDFS 248
>gi|340517499|gb|EGR47743.1| predicted protein [Trichoderma reesei QM6a]
Length = 535
Score = 47.4 bits (111), Expect = 0.011, Method: Compositional matrix adjust.
Identities = 68/283 (24%), Positives = 111/283 (39%), Gaps = 51/283 (18%)
Query: 16 RGRGLVSTQSLKAGQIVLRDSPILLYSALPFINS-SSFCHNCFRKTMHSSSSICPSC-SV 73
+GR L +T+S G ++ +P+LL LP + +S C +C R +C C +
Sbjct: 15 KGRALHATKSFPPGAVIFSFTPLLL---LPTVACLTSVCSHCLRP---GEPRLCSRCHAA 68
Query: 74 AFCSPKCSTAAASSSHSPYVCQALTRLLNLNSPDAANLPLDRQLQARFLIAAYNLAVVNP 133
A+C C AA + HS C+ALT+ + + LP R LI A +
Sbjct: 69 AYCDATCQAAAWKAVHS-RECKALTKTIR-DEERRRMLPTP----TRALIQALLWGKIRD 122
Query: 134 SQFQILLAFQGTVTDNDTSAAHY------------LSSLCPPPASTTIELTAALLAKDRL 181
L +G V + + S L + + A +L K +
Sbjct: 123 G----LADLEGHVLEKRAEGDEWRDIEMMAVAGCAFSGL--GRGDQDVRMAAEMLCKIQN 176
Query: 182 NAFGLMEPYIEGQDGRRSVRAYGIYLKASF--FNHDCLPNACRFDYVDAAAENNTDIIVR 239
N+F Q + G+YL+ + NH C+PNA ++ N +I
Sbjct: 177 NSF---------QRFDSELGVAGLYLEPTLAMANHSCIPNAS-VQFIG----RNALLIAE 222
Query: 240 MIHDVPQGREICLSYFPVNYDYSTRQKRLLYDYDFACDCDRCK 282
+ + G EI ++Y D +++ L Y F C C RC+
Sbjct: 223 --NPIRAGDEIEIAY-TFYTDPLPKRREALAHYKFTCQCLRCR 262
>gi|237837951|ref|XP_002368273.1| hypothetical protein TGME49_088330 [Toxoplasma gondii ME49]
gi|211965937|gb|EEB01133.1| hypothetical protein TGME49_088330 [Toxoplasma gondii ME49]
Length = 859
Score = 47.4 bits (111), Expect = 0.011, Method: Compositional matrix adjust.
Identities = 28/90 (31%), Positives = 41/90 (45%), Gaps = 9/90 (10%)
Query: 202 AYGIYLKASFFNHDCLPNACRFDYVDAAAENNTDIIVRMIHDVPQGREICLSYFP---VN 258
Y IY SF +H CLPNA F + + ++R D+ G E+ L+Y +
Sbjct: 165 GYVIYFMPSFSSHSCLPNALWF------TDEDHTFVLRTRADIAAGDEVTLTYLSEDDLM 218
Query: 259 YDYSTRQKRLLYDYDFACDCDRCKVEANWS 288
R+K L DF C C+RC ++S
Sbjct: 219 RPTLQRRKVLSETKDFICTCERCSAPVDFS 248
>gi|432945357|ref|XP_004083558.1| PREDICTED: N-lysine methyltransferase SMYD2-like [Oryzias latipes]
Length = 432
Score = 47.4 bits (111), Expect = 0.011, Method: Compositional matrix adjust.
Identities = 65/281 (23%), Positives = 99/281 (35%), Gaps = 43/281 (15%)
Query: 16 RGRGLVSTQSLKAGQIVLRDSPILLYSALPFINSSSFCHNCFRKTMHSSSSICPSCSVAF 75
+GRGL +++ G+++ S S L S C CFR+ + C C AF
Sbjct: 16 KGRGLRVSRAFSVGELLF--SCPAYSSVLSVTERGSCCDLCFRRK--EGLARCGKCKKAF 71
Query: 76 -CSPKCSTAAASSSHSPYVCQALTRLLNLNSPDAANLPLDRQLQARFLIAAYNLAVVNPS 134
C+ C A + C A+ P + +L AR L A +
Sbjct: 72 YCNTSCQKADWAMHRLE--CSAMVAFGENWCPSETS-----RLVARIL-AKKKMQKGRCD 123
Query: 135 QFQILL--AFQGTVTDNDTSA-----------AHYLSSLCPPPASTTIELTAALLAKDRL 181
+ILL Q + D D + S P+ + L ++
Sbjct: 124 SEKILLIGEMQSHLEDEDNEKKERTELDIAGLGRFFSKHLEVPSHKDL---LTLFSQVAC 180
Query: 182 NAFGLMEPYIEGQDGRRSVRAYGIYLKASFFNHDCLPNACRFDYVDAAAENNTDIIVRMI 241
N F + +D S +Y + NH CLP+ N T VR +
Sbjct: 181 NGFTI-------EDDELSHMGTAVYPDVALINHSCLPSV-------IVTFNGTLAQVRAV 226
Query: 242 HDVPQGREICLSYFPVNYDYSTRQKRLLYDYDFACDCDRCK 282
D+ G E+ +SY + Y R RL Y F C+C C+
Sbjct: 227 QDMKPGDEVLISYIDLLYPTDDRNSRLRESYYFTCNCLECQ 267
>gi|255080018|ref|XP_002503589.1| predicted protein [Micromonas sp. RCC299]
gi|226518856|gb|ACO64847.1| predicted protein [Micromonas sp. RCC299]
Length = 898
Score = 47.4 bits (111), Expect = 0.011, Method: Compositional matrix adjust.
Identities = 33/130 (25%), Positives = 57/130 (43%), Gaps = 27/130 (20%)
Query: 212 FNHDCLPNACRFDY----VDAAAENNTD-------------------IIVRMIHDVPQGR 248
NH CLPN C+ + VD A E + + + + + D+ G
Sbjct: 517 VNHSCLPN-CQVEITRGTVDDAEEGDDEKTGEKTKKPERGADCGSLRVSLLALRDISPGE 575
Query: 249 EICLSYFPVNYDYSTRQKRLLYDYDFACDCDRCKVEANWSDNDNDDENNEEVMDEDQDEQ 308
E+ + Y PV D ++R+ L + F C C RC++E+ + + + + D+ Q+E
Sbjct: 576 ELTVGYVPVTQDIASRRADLAARHGFVCSCARCELESGADGANPSVADMKRLADQAQEE- 634
Query: 309 MVASDDDAEV 318
A +DAE
Sbjct: 635 --ARYEDAEA 642
>gi|449681118|ref|XP_002159729.2| PREDICTED: uncharacterized protein LOC100210152, partial [Hydra
magnipapillata]
Length = 1044
Score = 47.4 bits (111), Expect = 0.011, Method: Compositional matrix adjust.
Identities = 68/295 (23%), Positives = 127/295 (43%), Gaps = 50/295 (16%)
Query: 3 AVSETVTVAEIEGRGRGLVSTQSLKAGQIVLRDSPILLYSALPFIN-SSSFCHNCFRKTM 61
+ SE + + +GR ++++ +KAG++++ + P +++L ++ C++C T
Sbjct: 225 STSEALKIVSTNSKGRHAIASRDIKAGEVIIIEKP---FASLCLPECYNTHCYHCL--TR 279
Query: 62 HSSSSICPSCS-VAFCSPKCSTAAASSSHSPYVCQALTRLLNLNSPDAANLPLDRQLQAR 120
+ C CS V +CS C + H + C+ L L+N D L
Sbjct: 280 FKINYPCRLCSTVNYCSISCEKESWEKFHC-FECEYLGVLIN----DDVGL--------- 325
Query: 121 FLIAAYNLAVVNPSQFQILLAFQGTVTDNDT---SAAHYLSSLCPPPASTTIELTAALLA 177
A ++ +LL+F+ + +D S+ Y +S+ + I + A LL
Sbjct: 326 ---AHLAFKIITNVGISMLLSFKENNSFDDLKPYSSTDY-NSIFSLIENLKI-VCAHLLK 380
Query: 178 KDRL---NAFGLMEPYIEGQDGRRS-VRAYG--IYLKASFFNHDCLPNACRFDYVDAAAE 231
++ NA + E ++ + + S ++ G +Y S NH C P+ R Y D
Sbjct: 381 HIQMLPCNAHEVSELQLKASNYKDSELKEIGSAVYATLSLLNHSCDPSVVRHCYGDTC-- 438
Query: 232 NNTDIIVRMIHDVPQGREICLSYFPVNYDYST-----RQKRLLYDYDFACDCDRC 281
++R I + +G EI +Y + Y+ RQ L+ Y FAC C+ C
Sbjct: 439 -----VLRAIKHIKEGSEIVDNY---GFLYAVESKVIRQSHLMEQYYFACQCEAC 485
>gi|389623721|ref|XP_003709514.1| hypothetical protein MGG_06798 [Magnaporthe oryzae 70-15]
gi|351649043|gb|EHA56902.1| hypothetical protein MGG_06798 [Magnaporthe oryzae 70-15]
Length = 440
Score = 47.4 bits (111), Expect = 0.011, Method: Compositional matrix adjust.
Identities = 25/79 (31%), Positives = 36/79 (45%), Gaps = 7/79 (8%)
Query: 204 GIYLKASFFNHDCLPNA-CRFDYVDAAAENNTDIIVRMIHDVPQGREICLSYFPVNYDYS 262
G++ S NH C PN+ +FD + + HD+ G EI +SY Y
Sbjct: 252 GLFPTISRINHACGPNSQVKFDPATLSQKAFAS------HDIEAGEEITISYAEFGMTYQ 305
Query: 263 TRQKRLLYDYDFACDCDRC 281
+RQ LL+ + F C C C
Sbjct: 306 SRQSTLLHRWGFKCTCALC 324
>gi|50740296|ref|XP_419420.1| PREDICTED: N-lysine methyltransferase SMYD2 [Gallus gallus]
Length = 436
Score = 47.4 bits (111), Expect = 0.011, Method: Compositional matrix adjust.
Identities = 69/284 (24%), Positives = 102/284 (35%), Gaps = 43/284 (15%)
Query: 15 GRGRGLVSTQSLKAGQIVLRDSPILLYSALPFINSSSFCHNCFRKTMHSSSSICPSCSVA 74
G+GRGL + + G+++ S + L S C CF + S C C A
Sbjct: 19 GKGRGLRALRRYAVGELLF--SCPAYTAVLTVSERGSHCDGCFARK--EGLSKCGRCKQA 74
Query: 75 F-CSPKCSTAAASSSHSPYVCQALTRL-LNLNSPDAANLP---LDRQ------LQARFLI 123
F C+ +C C A+ N N + L L +Q Q+ L+
Sbjct: 75 FYCNVECQKE--DWPMHKLECAAMCAFGQNWNPSETVRLTARILAKQKIHPERTQSEKLL 132
Query: 124 AAYNLAVVNPSQFQILLAFQGTVTDNDTSAAHYLSSL---CPPPASTTIELTAALLAKDR 180
A S L + + ND +A H+ S P A+ + L A+
Sbjct: 133 AVKEFE----SHLDKLDNEKRELIQNDIAALHHFYSKHMEYPDNAALVV-----LFAQVN 183
Query: 181 LNAFGLMEPYIEGQDGRRSVRAYGIYLKASFFNHDCLPNACRFDYVDAAAENNTDIIVRM 240
N F + +D S I+ + NH C PN Y AE VR
Sbjct: 184 CNGFTI-------EDEELSHLGSAIFPDVALMNHSCCPNVI-VTYKGTLAE------VRA 229
Query: 241 IHDVPQGREICLSYFPVNYDYSTRQKRLLYDYDFACDCDRCKVE 284
+ ++ G E+ SY + Y R RL Y F CDC C ++
Sbjct: 230 VKEIEPGEEVFTSYIDLLYPTEDRNDRLRDSYFFTCDCRECTMK 273
>gi|71001320|ref|XP_755341.1| SET and MYND domain protein [Aspergillus fumigatus Af293]
gi|66852979|gb|EAL93303.1| SET and MYND domain protein, putative [Aspergillus fumigatus Af293]
Length = 544
Score = 47.4 bits (111), Expect = 0.011, Method: Compositional matrix adjust.
Identities = 32/112 (28%), Positives = 43/112 (38%), Gaps = 18/112 (16%)
Query: 177 AKDRLNAFGLMEPYIEGQDGRRSVRAYGIYLK--ASFFNHDCLPNACRFDYVDAAAENNT 234
AK LN+F P + G YL A+ NH C DY +
Sbjct: 234 AKLELNSFNFHSPLAD---------RLGFYLHPYAALINHSC-------DYNSVVGSDGD 277
Query: 235 DIIVRMIHDVPQGREICLSYFPVNYDYSTRQKRLLYDYDFACDCDRCKVEAN 286
+ V+ + + +G EI +SY Y RQ LL Y F C C +C N
Sbjct: 278 ALYVKALRPIRKGEEILVSYIDATNPYKIRQTELLERYYFKCRCPKCGKGTN 329
>gi|146087252|ref|XP_001465771.1| conserved hypothetical protein [Leishmania infantum JPCM5]
gi|134069871|emb|CAM68198.1| conserved hypothetical protein [Leishmania infantum JPCM5]
Length = 857
Score = 47.4 bits (111), Expect = 0.011, Method: Compositional matrix adjust.
Identities = 53/209 (25%), Positives = 80/209 (38%), Gaps = 44/209 (21%)
Query: 175 LLAKDRLNAFGLMEPYIEGQDGRRSVRAYGIYLKASFFNHDCLPNACRFDYVDAAAENNT 234
L A + N+FG+ D + +G+Y +AS+FNH C+PN CR +
Sbjct: 657 LCAAAQCNSFGVY-------DTQDHCIGFGMYPEASYFNHSCVPNLCRV--MHHGGRIAA 707
Query: 235 DIIVRMIHDVPQGREICLSYFPV-NYDYSTRQKRLLYDYDFACDCDRCKVEANWSDNDND 293
+R I PQ + + Y V + + R++ LL Y F C C+RC +A
Sbjct: 708 FYALRAI--APQ-EPLTICYTDVEQLNSAERRRNLLGTYRFFCMCERCSGKAEGPQIAIA 764
Query: 294 DENNEEVMDEDQDEQMVASDDDAEVHGDTNFPHAYFFMRYMCDRDNCWGTLAPLPP---- 349
N E + + + S P A+ +C + G L PLPP
Sbjct: 765 AAGNRESVSGPRSGCVEVS------------PAAFENPLLLCAECSIHGYLRPLPPPFFS 812
Query: 350 ---------SDATPSTVM------ECNVC 363
+TPS M +C VC
Sbjct: 813 SSPVKSDSAPKSTPSWTMQEVEMRQCTVC 841
>gi|301607796|ref|XP_002933488.1| PREDICTED: SET and MYND domain-containing protein 5-like [Xenopus
(Silurana) tropicalis]
Length = 141
Score = 47.4 bits (111), Expect = 0.012, Method: Compositional matrix adjust.
Identities = 26/77 (33%), Positives = 42/77 (54%), Gaps = 9/77 (11%)
Query: 213 NHDCLPNACRFDYVDAAAENNTDIIVRMIHDVPQGREICLSYFPV---NYDYSTRQKRLL 269
NH C+PNA +A+ +N + + + D+ G EIC+SY + + +RQK L
Sbjct: 38 NHSCVPNA------EASFPDNFILHLTSLEDIQPGEEICISYLDCCQRDRSWHSRQKILR 91
Query: 270 YDYDFACDCDRCKVEAN 286
+Y F C C +C V+A+
Sbjct: 92 ENYLFMCSCPKCLVQAD 108
>gi|157134709|ref|XP_001656403.1| hypothetical protein AaeL_AAEL000436 [Aedes aegypti]
gi|108884280|gb|EAT48505.1| AAEL000436-PA [Aedes aegypti]
Length = 392
Score = 47.4 bits (111), Expect = 0.012, Method: Compositional matrix adjust.
Identities = 28/87 (32%), Positives = 41/87 (47%), Gaps = 10/87 (11%)
Query: 204 GIYLKASFFNHDCLPNA-CRFDYVDAAAENNTDIIVRMIHDVPQGREICLSYF---PVNY 259
+Y S NH C PN CRF + +N + + D+ G EIC+SY +
Sbjct: 290 ALYSYQSKVNHSCSPNVECRFPH------SNNVLALTATRDIKVGEEICISYLDECALER 343
Query: 260 DYSTRQKRLLYDYDFACDCDRCKVEAN 286
+RQK L +Y F C C++C + N
Sbjct: 344 SRHSRQKMLSENYLFQCQCEKCDSQVN 370
>gi|148906849|gb|ABR16570.1| unknown [Picea sitchensis]
Length = 441
Score = 47.4 bits (111), Expect = 0.012, Method: Compositional matrix adjust.
Identities = 33/120 (27%), Positives = 48/120 (40%), Gaps = 10/120 (8%)
Query: 162 PPPASTTIELTAALLAKDRLNAFGLMEPYIEGQDGRRSVRAYGIYLKASFFNHDCLPNAC 221
P P +++L LL K +L G D G+Y S NH C PNA
Sbjct: 121 PTP---SLQLMLRLLIKRKLQNAGFSCNAHTICDSELRPMGTGLYPVISIINHSCFPNA- 176
Query: 222 RFDYVDAAAENNTDIIVRMIHDVPQGREICLSYFPVNYDYSTRQKRLLYDYDFACDCDRC 281
+VR + + +G E+ +SY + ++R+K L Y F C C RC
Sbjct: 177 ------VLLFEGRQAVVRAVEPIREGSELTVSYIEIAASTASRKKSLKEQYFFDCKCLRC 230
>gi|259483433|tpe|CBF78818.1| TPA: conserved hypothetical protein [Aspergillus nidulans FGSC A4]
Length = 341
Score = 47.4 bits (111), Expect = 0.012, Method: Compositional matrix adjust.
Identities = 37/123 (30%), Positives = 51/123 (41%), Gaps = 22/123 (17%)
Query: 174 ALLAKDRLNAFGLME------PYIEGQDGRRSVRAYGIYLKASFFNHDCLPNACRFDYVD 227
AL+ K+ N+FGL P R A +Y +AS NH C PN
Sbjct: 209 ALICKEEANSFGLYPRETGVFPPPNPPVSRGEQFAAAVYPRASIANHSCCPNII------ 262
Query: 228 AAAENNTDIIVRMI----HDVPQGREICLSYFPVNYDYST--RQKRLLYDYDFACDCDRC 281
+ D + RM+ D+ G E C+SYF + S R++ L + F C C RC
Sbjct: 263 ----HKPDKVGRMVFTAGRDIAAGEECCISYFDMTQYVSLQDRRRHLQGLFRFKCGCPRC 318
Query: 282 KVE 284
E
Sbjct: 319 LEE 321
>gi|325530259|sp|E1C5V0.1|SMYD2_CHICK RecName: Full=N-lysine methyltransferase SMYD2; AltName:
Full=Histone methyltransferase SMYD2; AltName: Full=SET
and MYND domain-containing protein 2
Length = 436
Score = 47.4 bits (111), Expect = 0.012, Method: Compositional matrix adjust.
Identities = 69/284 (24%), Positives = 102/284 (35%), Gaps = 43/284 (15%)
Query: 15 GRGRGLVSTQSLKAGQIVLRDSPILLYSALPFINSSSFCHNCFRKTMHSSSSICPSCSVA 74
G+GRGL + + G+++ S + L S C CF + S C C A
Sbjct: 19 GKGRGLRALRRYAVGELLF--SCPAYTAVLTVSERGSHCDGCFARK--EGLSKCGRCKQA 74
Query: 75 F-CSPKCSTAAASSSHSPYVCQALTRL-LNLNSPDAANLP---LDRQ------LQARFLI 123
F C+ +C C A+ N N + L L +Q Q+ L+
Sbjct: 75 FYCNVECQKE--DWPMHKLECAAMCAFGQNWNPSETVRLTARILAKQKIHPERTQSEKLL 132
Query: 124 AAYNLAVVNPSQFQILLAFQGTVTDNDTSAAHYLSSL---CPPPASTTIELTAALLAKDR 180
A S L + + ND +A H+ S P A+ + L A+
Sbjct: 133 AVKEFE----SHLDKLDNEKRELIQNDIAALHHFYSKHMEYPDNAALVV-----LFAQVN 183
Query: 181 LNAFGLMEPYIEGQDGRRSVRAYGIYLKASFFNHDCLPNACRFDYVDAAAENNTDIIVRM 240
N F + +D S I+ + NH C PN Y AE VR
Sbjct: 184 CNGFTI-------EDEELSHLGSAIFPDVALMNHSCCPNVI-VTYKGTLAE------VRA 229
Query: 241 IHDVPQGREICLSYFPVNYDYSTRQKRLLYDYDFACDCDRCKVE 284
+ ++ G E+ SY + Y R RL Y F CDC C ++
Sbjct: 230 VKEIEPGEEVFTSYIDLLYPTEDRNDRLRDSYFFTCDCRECTMK 273
>gi|396485474|ref|XP_003842180.1| hypothetical protein LEMA_P079400.1 [Leptosphaeria maculans JN3]
gi|312218756|emb|CBX98701.1| hypothetical protein LEMA_P079400.1 [Leptosphaeria maculans JN3]
Length = 332
Score = 47.4 bits (111), Expect = 0.012, Method: Compositional matrix adjust.
Identities = 31/90 (34%), Positives = 45/90 (50%), Gaps = 6/90 (6%)
Query: 201 RAYGIYLKASFFNHDCLPNACRFDYVDAAAENNTDIIVRMIHDVPQGREICLSYFPVNYD 260
R ++ A+ FNH C PNAC F + +A T + +++DV G EI LSY + +D
Sbjct: 131 RGAAVFPHAARFNHSCNPNAC-FTW-NAGIGKET---IHIMNDVKAGEEITLSYCDMLHD 185
Query: 261 YSTRQKRLLYDYDFACDCDRCKVEANWSDN 290
R L + Y F CDC C + + D
Sbjct: 186 KKLRSWELKH-YGFRCDCRACAEDEDVEDT 214
>gi|398015614|ref|XP_003860996.1| hypothetical protein, conserved [Leishmania donovani]
gi|322499220|emb|CBZ34291.1| hypothetical protein, conserved [Leishmania donovani]
Length = 857
Score = 47.4 bits (111), Expect = 0.012, Method: Compositional matrix adjust.
Identities = 47/182 (25%), Positives = 70/182 (38%), Gaps = 37/182 (20%)
Query: 202 AYGIYLKASFFNHDCLPNACRFDYVDAAAENNTDIIVRMIHDVPQGREICLSYFPV-NYD 260
+G+Y +AS+FNH C+PN CR + +R I PQ + + Y V +
Sbjct: 677 GFGMYPEASYFNHSCVPNLCRV--MHHGGRIAAFYALRAI--APQ-EPLTICYTDVEQLN 731
Query: 261 YSTRQKRLLYDYDFACDCDRCKVEANWSDNDNDDENNEEVMDEDQDEQMVASDDDAEVHG 320
+ R++ LL Y F C C+RC +A N E + + + S
Sbjct: 732 SAERRRNLLGTYRFFCMCERCSGKAEGPQIAIAAAGNRESVSGPRSGCVEVS-------- 783
Query: 321 DTNFPHAYFFMRYMCDRDNCWGTLAPLPP-------------SDATPSTVM------ECN 361
P A+ +C + G L PLPP +TPS M +C
Sbjct: 784 ----PAAFENPLLLCAECSIHGYLRPLPPPFFSSSPVKSDSAPKSTPSWTMQEVEMRQCT 839
Query: 362 VC 363
VC
Sbjct: 840 VC 841
>gi|294655829|ref|XP_458021.2| DEHA2C07854p [Debaryomyces hansenii CBS767]
gi|199430638|emb|CAG86084.2| DEHA2C07854p [Debaryomyces hansenii CBS767]
Length = 350
Score = 47.4 bits (111), Expect = 0.012, Method: Compositional matrix adjust.
Identities = 36/118 (30%), Positives = 55/118 (46%), Gaps = 10/118 (8%)
Query: 168 TIELTAALLAKDRLNAFGLMEPYIEGQDGRRSVRAYGIYLKASFFNHDCLPNACRFDYVD 227
T EL +L + NAFG+ I + + + IY ASFFNH C PN +
Sbjct: 234 TSELVRTILGTNLTNAFGIWSQPINDAE-EKDYFGFAIYPSASFFNHSCDPNLVK----- 287
Query: 228 AAAENNTDIIVRMIHDVPQGREICLSYFPV-NYDYSTRQKRLLYDYDFACDCDRCKVE 284
E +T + R + D+ E+ +SY N + RQ++ L ++ F C C +C E
Sbjct: 288 -TREGDT-LYFRALKDIDPNTELFISYGNYSNENVQIRQEQ-LKEWFFNCLCTKCITE 342
>gi|157869726|ref|XP_001683414.1| conserved hypothetical protein [Leishmania major strain Friedlin]
gi|68126479|emb|CAJ04438.1| conserved hypothetical protein [Leishmania major strain Friedlin]
Length = 856
Score = 47.4 bits (111), Expect = 0.012, Method: Compositional matrix adjust.
Identities = 25/86 (29%), Positives = 41/86 (47%), Gaps = 6/86 (6%)
Query: 202 AYGIYLKASFFNHDCLPNACRFDYVDAAAENNTDIIVRMIHDVPQGREICLSYFPV-NYD 260
+GIY +AS+FNH C+PN CR + + + + + + Y V +
Sbjct: 676 GFGIYPEASYFNHSCVPNLCRVMH-----HGSRIAAFYALRAIAAQEPLTICYTDVEQLN 730
Query: 261 YSTRQKRLLYDYDFACDCDRCKVEAN 286
+ R++ LL Y F C C+RC +A
Sbjct: 731 SAERRRNLLSTYRFFCMCERCSGKAE 756
>gi|31208665|ref|XP_313299.1| AGAP003552-PA [Anopheles gambiae str. PEST]
gi|21296724|gb|EAA08869.1| AGAP003552-PA [Anopheles gambiae str. PEST]
Length = 391
Score = 47.4 bits (111), Expect = 0.012, Method: Compositional matrix adjust.
Identities = 27/86 (31%), Positives = 41/86 (47%), Gaps = 8/86 (9%)
Query: 204 GIYLKASFFNHDCLPNACRFDYVDAAAENNTDIIVRMIHDVPQGREICLSYFP---VNYD 260
+Y + S NH C PNA ++N + +R D+ G EIC+SY +
Sbjct: 290 ALYARQSKINHSCAPNA-----ETVFPKSNHMLALRATRDIQPGEEICISYLDECNLQRS 344
Query: 261 YSTRQKRLLYDYDFACDCDRCKVEAN 286
+RQK L Y F C C++C+ + N
Sbjct: 345 RHSRQKTLKDYYLFICQCEKCESQIN 370
>gi|67900474|ref|XP_680493.1| hypothetical protein AN7224.2 [Aspergillus nidulans FGSC A4]
gi|40742081|gb|EAA61271.1| hypothetical protein AN7224.2 [Aspergillus nidulans FGSC A4]
Length = 678
Score = 47.4 bits (111), Expect = 0.012, Method: Compositional matrix adjust.
Identities = 37/123 (30%), Positives = 51/123 (41%), Gaps = 22/123 (17%)
Query: 174 ALLAKDRLNAFGLME------PYIEGQDGRRSVRAYGIYLKASFFNHDCLPNACRFDYVD 227
AL+ K+ N+FGL P R A +Y +AS NH C PN
Sbjct: 546 ALICKEEANSFGLYPRETGVFPPPNPPVSRGEQFAAAVYPRASIANHSCCPNII------ 599
Query: 228 AAAENNTDIIVRMI----HDVPQGREICLSYFPVNYDYST--RQKRLLYDYDFACDCDRC 281
+ D + RM+ D+ G E C+SYF + S R++ L + F C C RC
Sbjct: 600 ----HKPDKVGRMVFTAGRDIAAGEECCISYFDMTQYVSLQDRRRHLQGLFRFKCGCPRC 655
Query: 282 KVE 284
E
Sbjct: 656 LEE 658
>gi|149040851|gb|EDL94808.1| similar to SET and MYND domain containing 3, isoform CRA_a [Rattus
norvegicus]
Length = 239
Score = 47.4 bits (111), Expect = 0.012, Method: Compositional matrix adjust.
Identities = 27/95 (28%), Positives = 42/95 (44%), Gaps = 7/95 (7%)
Query: 202 AYGIYLKASFFNHDCLPNACRFDYVDAAAENNTDIIVRMIHDVPQGREICLSYFPVNYDY 261
G+Y S NH C PN C + N +++R + ++ G E+ + Y +
Sbjct: 5 GVGLYPSMSLLNHSCDPN-CSIVF------NGPHLLLRAVREIEAGEELTICYLDMLMTS 57
Query: 262 STRQKRLLYDYDFACDCDRCKVEANWSDNDNDDEN 296
R+K+L Y F CDC RC + +D DE
Sbjct: 58 EERRKQLRDQYCFECDCIRCPTQDKDADMLTGDEQ 92
>gi|113674346|ref|NP_001038756.1| N-lysine methyltransferase SMYD2-B isoform 2 [Danio rerio]
gi|95132413|gb|AAI16607.1| SET and MYND domain containing 2b [Danio rerio]
Length = 423
Score = 47.4 bits (111), Expect = 0.012, Method: Compositional matrix adjust.
Identities = 67/285 (23%), Positives = 101/285 (35%), Gaps = 54/285 (18%)
Query: 15 GRGRGLVSTQSLKAGQIVLRDSPILLYSALPFINSSSFCHNCFRKTMHSSSSICPSCSVA 74
G+GRGL +++ G+++ P Y L C CF T + C C A
Sbjct: 17 GKGRGLRVSRAYGVGELLF-SCPAYSY-VLSVGERGLICEQCF--TRKKGLAKCGKCKKA 72
Query: 75 F-CSPKCSTAAASSSHSPYVCQALTRLLNLNSPDAANLPLDRQLQARFLIAAYNLAVVNP 133
F C+ C + CQA+ P +R +P
Sbjct: 73 FYCNANCQKK--NWPMHKLECQAMCAFGENWRPSETKAQKER----------------SP 114
Query: 134 SQFQILLA-FQGTVTDND-----TSAAH-------YLSSLCPPPASTTIELTAALLAKDR 180
S+ +LL + + D D + AH Y L P + L ++
Sbjct: 115 SEILLLLGEMEAHLEDMDNEKREMTEAHIAGLHQFYSKHLDFPDHQALL----TLFSQVH 170
Query: 181 LNAFGLMEPYIEGQDGRRSVRAYGIYLKASFFNHDCLPNACRFDYVDAAAENNTDIIVRM 240
N F + +D S I+ + NH C PN Y AE VR
Sbjct: 171 CNGFTV-------EDEELSNLGLAIFPDIALLNHSCSPNVI-VTYRGINAE------VRA 216
Query: 241 IHDVPQGREICLSYFPVNYDYSTRQKRLLYDYDFACDCDRCKVEA 285
+ D+ G+EI SY + Y + R +RL Y F+CDC C ++
Sbjct: 217 VKDISPGQEIYTSYIDLLYPTADRLERLRDMYYFSCDCKECTTKS 261
>gi|393215651|gb|EJD01142.1| MAS20-domain-containing protein [Fomitiporia mediterranea MF3/22]
Length = 555
Score = 47.4 bits (111), Expect = 0.013, Method: Compositional matrix adjust.
Identities = 72/343 (20%), Positives = 131/343 (38%), Gaps = 86/343 (25%)
Query: 14 EGRGRGLVSTQSLKAGQIVLRDSPILLYSALPFINSSSFCHNCFRKTMHSSSSICPS--- 70
+ R LV+++S KAG ++ +++PI+ + S C +C R+ + S ++I P
Sbjct: 190 QARSHLLVASKSFKAGDVIYKETPIVASLDPDLVASGRHCGHCLRE-VESDAAIKPESDR 248
Query: 71 CSVAFCSPKCSTAAASSSH-------SPYVCQALTRLLNLNSPDAANLPLDRQLQARF-- 121
+CS KC T + + S SP + +L A P + A F
Sbjct: 249 LGSIYCSEKCQTQSGTQSQDMLFSDKSPLPAEVAA---DLPQQSVAERPKTQDALAEFCR 305
Query: 122 -------LIAAYNLAVVNPSQF---------------QILLAFQGTVTDNDTSAAHYLSS 159
L+ A +A+ ++ + ++ Q TV D+ Y+
Sbjct: 306 QKGTSGLLLVARLVAIQIVAELAKALPQAAHMKEQLPEFSISEQYTVYDH-IERLRYIDV 364
Query: 160 LCPPPASTTI-ELTAALL----------------AKDRLNAFGLMEPYIEGQDGR----- 197
P T+ EL A L K NA G+ + EG+D +
Sbjct: 365 TLPDDEYKTLRELLLATLPFLEDVHSDERHAMFRGKVAYNAIGIY--FGEGRDDKPTTSE 422
Query: 198 --RSVRAYGI--------YLKASFFNHDCLPNACRFDYVDAAAENNTDIIVRMIHDVPQG 247
R+ A G+ YL +S+ H C P+ +E +++ + D+ +G
Sbjct: 423 LERTRIAAGVSKQIGSGLYLVSSYIGHSCAPSV-----RPVFSEGTSELHLLAERDIEEG 477
Query: 248 REICLSYFPVNYD--------YSTRQKRLLYDYDFACDCDRCK 282
E+ ++Y V+ ++ R+ L + FAC C +C+
Sbjct: 478 EELTMAYVDVSQRSEETPVEAFTRRRSELTQGWKFACACVKCE 520
>gi|444723373|gb|ELW64030.1| SET and MYND domain-containing protein 5 [Tupaia chinensis]
Length = 415
Score = 47.4 bits (111), Expect = 0.013, Method: Compositional matrix adjust.
Identities = 28/86 (32%), Positives = 42/86 (48%), Gaps = 8/86 (9%)
Query: 204 GIYLKASFFNHDCLPNACRFDYVDAAAENNTDIIVRMIHDVPQGREICLSYF---PVNYD 260
G+++ S NH C+PNA + ENN + V + ++ G EIC+SY
Sbjct: 306 GLFVLQSCCNHSCVPNA-----ETSFPENNFLLHVTALEEIKPGEEICISYLDCCQRERS 360
Query: 261 YSTRQKRLLYDYDFACDCDRCKVEAN 286
+R K L +Y F C C +C EA+
Sbjct: 361 RHSRHKILRENYLFVCSCPKCLAEAD 386
Score = 38.9 bits (89), Expect = 4.1, Method: Compositional matrix adjust.
Identities = 23/102 (22%), Positives = 41/102 (40%), Gaps = 27/102 (26%)
Query: 15 GRGRGLVSTQSLKAGQIVLRDSPIL----LYSALPFINSSSFCHNCFRKTMHSSSSI--- 67
+G+GL +TQ ++ G+ + + P++ L++AL + C K ++ +
Sbjct: 31 AKGKGLFATQLIRKGETIFVERPLVAAQFLWNALYRYRACDHCLRALEKAEENAQRLTGK 90
Query: 68 --------------------CPSCSVAFCSPKCSTAAASSSH 89
CP C V +CS +C AAA H
Sbjct: 91 PGQVLPHPELCTVRKDLHQNCPHCQVVYCSAECRLAAAEQYH 132
>gi|383858987|ref|XP_003704980.1| PREDICTED: protein msta, isoform B-like [Megachile rotundata]
Length = 514
Score = 47.4 bits (111), Expect = 0.013, Method: Compositional matrix adjust.
Identities = 68/296 (22%), Positives = 114/296 (38%), Gaps = 42/296 (14%)
Query: 5 SETVTVAEIEGRGRGLVSTQSLKAGQIVLRDSPILLYSALPFINSSSFCHNCFRKTMHS- 63
+E + + + GR +V+ + L+ G+ ++ + P ++ P + C +C+ S
Sbjct: 17 TEKYKILKNDKVGRFMVANKDLQPGEEIVTEMPFVVG---PKAFTYPLCLSCYAPWPPSL 73
Query: 64 -SSSICPSCSVAFCSPKCSTAAASSSHSPYVCQALTRL---LNLNSP-DAANLPLDRQLQ 118
+C C CS +C H Y CQ + N+ + + AN QL+
Sbjct: 74 TEKPLCSRCGWPVCSTECEN---QPQHKDYECQIFVQANEKFNVQAALEEANENGVPQLE 130
Query: 119 ----ARFLIAAYNLAVVNPSQFQILLAFQGTVTDNDTSAAHYLSSLCPPPASTTIELTAA 174
R L+ + NP ++ + + AH+ S +E
Sbjct: 131 CITPLRLLLESEK----NPERWNNEVKNMEAHNKIRSQKAHWKSD-----QVNVVEYLRN 181
Query: 175 LLAKDRLN------AFGLMEPYIEGQDGRRSVRAYG---IYLKASFFNHDCLPNACRFDY 225
L DR + A G++E I + R S R YG +Y + NH C+ N C
Sbjct: 182 QLKLDRFSEELIQTACGILE--INTFEVRTS-RGYGARALYPTVALMNHSCVSNTCH--- 235
Query: 226 VDAAAENNTDIIVRMIHDVPQGREICLSYFPVNYDYSTRQKRLLYDYDFACDCDRC 281
+ + + I +R VP E+ SY Y R++ LL FAC C RC
Sbjct: 236 --SISPTDYRIRLRTTVHVPPSGELYGSYTHSLYPTMLRREHLLEGKHFACACARC 289
>gi|328715119|ref|XP_003245540.1| PREDICTED: protein msta, isoform A-like [Acyrthosiphon pisum]
Length = 478
Score = 47.4 bits (111), Expect = 0.013, Method: Compositional matrix adjust.
Identities = 59/283 (20%), Positives = 105/283 (37%), Gaps = 31/283 (10%)
Query: 7 TVTVAEIEGRGRGLVSTQSLKAGQIVLRDSPILLY-SALPFINSSSF--CHNCFRKTMHS 63
T++ GRG+V+T+ G ++ D P+++ A+ + S+ C C+ M
Sbjct: 28 TISTGSATVAGRGVVATRDYSVGDVIFVDVPLIVSPRAMGMDGTGSWPVCAVCYAVMMTP 87
Query: 64 SSSICPS-CSVAFCSPKCSTAAASSSHSPYVCQALTRLLNLNSPDAANLPLDRQLQARFL 122
+ CP C + C +C+ YV Q + + + + +
Sbjct: 88 TG--CPGGCRLPVCGRQCADRPEHRDECRYVRQLRPKTKSDGQTWSVGI-------YNAI 138
Query: 123 IAAYNLAVVNPSQFQILLAFQGTVTDNDTSAAHYLSSLCPPPASTTIEL----TAALLAK 178
A L++ + L Q TD T L + ++EL ++
Sbjct: 139 TAVRGLSIRDGKYRYFLDVLQKKSTDKLTFEVEELKR------NISMELDKEDEQIMVTT 192
Query: 179 DRLNAFGLMEPYIEGQDGRRSVRAYGIYLKASFFNHDCLPNACRFDYVDAAAENNTDIIV 238
++ E + + S++ G+Y A+F NH C+PN +I
Sbjct: 193 CKIMDANCFETIVNRSNKLTSLK--GLYPVAAFMNHCCVPNTMH------NFNEKLQMIA 244
Query: 239 RMIHDVPQGREICLSYFPVNYDYSTRQKRLLYDYDFACDCDRC 281
+ + +G+EI SY + S RQ LL F C C RC
Sbjct: 245 KASLPIYKGQEITTSYTYSIWPTSLRQNHLLTSKQFICTCSRC 287
>gi|221505568|gb|EEE31213.1| hypothetical protein TGVEG_082100 [Toxoplasma gondii VEG]
Length = 858
Score = 47.4 bits (111), Expect = 0.013, Method: Compositional matrix adjust.
Identities = 28/90 (31%), Positives = 41/90 (45%), Gaps = 9/90 (10%)
Query: 202 AYGIYLKASFFNHDCLPNACRFDYVDAAAENNTDIIVRMIHDVPQGREICLSYFP---VN 258
Y IY SF +H CLPNA F + + ++R D+ G E+ L+Y +
Sbjct: 165 GYVIYFMPSFSSHSCLPNALWF------TDEDHTFVLRSRADIAAGDEVTLTYLSEDDLM 218
Query: 259 YDYSTRQKRLLYDYDFACDCDRCKVEANWS 288
R+K L DF C C+RC ++S
Sbjct: 219 RPTLQRRKVLSETKDFICTCERCSAPVDFS 248
>gi|451993775|gb|EMD86247.1| hypothetical protein COCHEDRAFT_1198196 [Cochliobolus
heterostrophus C5]
Length = 739
Score = 47.0 bits (110), Expect = 0.014, Method: Compositional matrix adjust.
Identities = 27/79 (34%), Positives = 36/79 (45%), Gaps = 7/79 (8%)
Query: 204 GIYLKASFFNHDCLPNACRFDYVDAAAENNTDIIVRMIHDVPQGREICLSYFPVNYDYST 263
GI++KAS+ NH CL N R D +IVR D+ G EI Y DY+
Sbjct: 502 GIWIKASYINHSCLGNVRRSFIGDM-------MIVRACKDLEAGTEIVFPYTAPRGDYTL 554
Query: 264 RQKRLLYDYDFACDCDRCK 282
K+ + F C C C+
Sbjct: 555 GIKQKFKSWGFICSCALCE 573
>gi|294887209|ref|XP_002772010.1| hypothetical protein, conserved [Perkinsus marinus ATCC 50983]
gi|239875918|gb|EER03826.1| hypothetical protein, conserved [Perkinsus marinus ATCC 50983]
Length = 74
Score = 47.0 bits (110), Expect = 0.014, Method: Compositional matrix adjust.
Identities = 26/73 (35%), Positives = 35/73 (47%), Gaps = 6/73 (8%)
Query: 213 NHDCLPNACRFDYVDAAAENNTDIIVRMIHDVPQGREICLSYFPVNYDYSTRQKRLLYDY 272
NH C PNA +A+ + + V+ + V G E+ SY N RQ +L Y
Sbjct: 2 NHSCSPNA------EASFRGSRCLRVKSLKPVHSGEEVFQSYIDENLPLVERQSKLRQAY 55
Query: 273 DFACDCDRCKVEA 285
FAC C RC+ EA
Sbjct: 56 GFACRCGRCRTEA 68
>gi|62473432|ref|NP_001014718.1| msta, isoform B [Drosophila melanogaster]
gi|76363525|sp|P83501.2|MSTAB_DROME RecName: Full=Protein msta, isoform B
gi|61677866|gb|AAF45751.3| msta, isoform B [Drosophila melanogaster]
Length = 448
Score = 47.0 bits (110), Expect = 0.014, Method: Compositional matrix adjust.
Identities = 61/278 (21%), Positives = 107/278 (38%), Gaps = 42/278 (15%)
Query: 17 GRGLVSTQSLKAGQIVLRDSPILLYSALPFINSSSFCHNCFRKTMHS--SSSICPS-CSV 73
GRG+ +T+ + AG+ + R+ +L+ + C C+R S S+++CP+ C +
Sbjct: 53 GRGIFATREIAAGEELFREHTLLVGPTAHRSMNLRTCTLCYRLIPGSTDSAALCPAGCGL 112
Query: 74 AFCSPKCSTAAASSSHSPYVCQALTRLLNLNSPDAANLPLDRQLQARFLIAAYNLAVVNP 133
P CS S+ H C+ + L S + +A +++ ++
Sbjct: 113 ----PVCSECRDSTRHD-LECKLFRKWKPLESQRI-------EPRALRILSVVRCFFLDE 160
Query: 134 SQFQILLAFQGTV----TDNDTSAAHYLSSLCPPPASTTIELTAALLAKDRLNAFGLMEP 189
+ ++L A Q + AA P ++ + NAF
Sbjct: 161 ASRKLLYAMQANMDRYYMQEVQRAADCFEHF--PREQDMLDYFYRTICAFNTNAF----- 213
Query: 190 YIEGQDGRRSVRAYGIYLKASF-----FNHDCLPNACRFDYVDAAAENNTDIIVRMIHDV 244
+ R +V + + ++A F NH C PNA EN I+V +
Sbjct: 214 -----ESRSNVDGHEVLVRALFPLAGLLNHQCTPNAAHH------FENGETIVVCATERI 262
Query: 245 PQGREICLSYFPVNYDYSTRQKRLLYDYDFACDCDRCK 282
P G EI +SY + + R+ L F C C RC+
Sbjct: 263 PAGAEITMSYAKLLWSTLARKIFLGMTKHFICKCVRCQ 300
>gi|449686608|ref|XP_002163562.2| PREDICTED: SET and MYND domain-containing protein 5-like [Hydra
magnipapillata]
Length = 395
Score = 47.0 bits (110), Expect = 0.014, Method: Compositional matrix adjust.
Identities = 86/395 (21%), Positives = 143/395 (36%), Gaps = 107/395 (27%)
Query: 5 SETVTVAEIEGRGRGLVSTQSLKAGQIVLRDSPILL----------YSALPF-INSSSFC 53
S+ V V E +G+GL + ++K G +L + P++L Y A + + S
Sbjct: 9 SDNVYVKETLSKGKGLFAASAIKKGDTILTEKPLVLCQFSWNRQYNYVACDYCMRSLETA 68
Query: 54 HNCFRKT---------MHSSSSI----------CPSCSVAFCSPKCSTAAASSSH----- 89
N R+ H S+ CP+C + FCS +C + A H
Sbjct: 69 QNMARRLAADYTLELPYHEQCSLSVQRVNSIYKCPNCCIPFCSKECYSEAYEKYHKSLCL 128
Query: 90 -------SP-YVCQALTRLLNLNSPDAANLPLDRQLQARFLIAAYNLAVVNPSQFQ---- 137
SP Y + + L+ P+ A++ L ++ A +IA + S+FQ
Sbjct: 129 HPDEMSESPVYRIEEAWKQLHY-PPETASVMLIVRILA--MIAQSQCPEMVMSEFQQFFS 185
Query: 138 --------ILLAFQGTVTDNDTSAAHYLSSLCPPPAST----TIELTAALLAKDRLNAFG 185
I G H L P TI +L A +N G
Sbjct: 186 KTKNEEHKITHKLLGNKFKEQIILMHSLLQEIVPTGELSKLLTISGVQSLFALIGMNGQG 245
Query: 186 L----MEPYIEGQDG-------RRSVRAY---------------------GIYLKASFFN 213
+ + Y+ D R + A+ G++ S N
Sbjct: 246 IGTSSLSEYVHNIDAKVMSDNEREQIDAFIDQLYLHMEKESGSFLNCEGSGLFKMQSRCN 305
Query: 214 HDCLPNA-CRFDYVDAAAENNTDIIVRMIHDVPQGREICLSYF---PVNYDYSTRQKRLL 269
H C PNA F Y NN+ +++ D+ + EIC+ Y + +R+K L
Sbjct: 306 HSCYPNAEATFPY------NNSTLVLVATEDITKDEEICVCYLDECQRSRSRHSRRKLLR 359
Query: 270 YDYDFACDCDRCKVEANWSDNDNDDENNEEVMDED 304
+Y F C C C+ E D++++ +++ M ED
Sbjct: 360 ENYLFECTCSLCESEV---DHESETSSDDSEMSED 391
>gi|350403683|ref|XP_003486875.1| PREDICTED: hypothetical protein LOC100747756 [Bombus impatiens]
Length = 814
Score = 47.0 bits (110), Expect = 0.014, Method: Compositional matrix adjust.
Identities = 75/301 (24%), Positives = 120/301 (39%), Gaps = 63/301 (20%)
Query: 10 VAEIEGRGRGLVSTQSLKAGQIVLRDSPILLYSALPFINSSS--FCHNCFR---KTMHSS 64
VA E GR L + ++L AG+++LR+ PI A+ I SS C C R K
Sbjct: 17 VAYSEKLGRYLQAAKNLTAGEVILREEPI----AVGPITSSKDPACFACLRLLPKIKKEL 72
Query: 65 SSICPSCSVAFCSPKCSTA--AASSSHSPYVCQALTRLLNLNSPDAANL----------- 111
+C C++A P CSTA S H+P C+ R +L + N+
Sbjct: 73 QYVCSKCNIA---PLCSTACEERSKHHTPDECEIFKRNKDLLINNTENIIGVLLPLRLWL 129
Query: 112 ------PLDRQLQARFLIAAYNLAVVNPSQFQILLAFQGTVTDNDTSAAHYLSS--LCP- 162
L +Q+++ + A+ N S ++ D + + + + S L P
Sbjct: 130 LRQRDAELWKQVES---MEAHTDKRRNTSVWK----------DRELNVVNVIKSLHLVPD 176
Query: 163 --PPASTTIELTAALLAKDRLNAFGLMEPYIEGQDGRRSVRAYGIYLKASFFNHDCLPNA 220
P S ++L +L +N F L P G + G+Y++AS HDC N
Sbjct: 177 GDPSVSELLQLLCGILD---VNCFELRSP-----GGLDGLLLRGLYVEASLMAHDCRGNT 228
Query: 221 CRFDYVDAAAENNTDIIVRMIHDVPQGREICLSYFPVNYDYSTRQKRLLYDYDFACDCDR 280
A++N + V + +G EI +Y R++ L F C+C
Sbjct: 229 ------HLTADDNFQLTVYASLPIKEGDEIYFNYTSSLLGTLGRREHLRGGKYFECECPL 282
Query: 281 C 281
C
Sbjct: 283 C 283
>gi|67514234|gb|AAH98305.1| SMYD2 protein [Homo sapiens]
Length = 272
Score = 47.0 bits (110), Expect = 0.014, Method: Compositional matrix adjust.
Identities = 66/274 (24%), Positives = 98/274 (35%), Gaps = 29/274 (10%)
Query: 15 GRGRGLVSTQSLKAGQIVLRDSPILLYSALPFINSSSFCHNCFRKTMHSSSSICPSCSVA 74
G+GRGL + Q + G ++ P Y L + C CF T S C C A
Sbjct: 16 GKGRGLRALQPFQVGDLLF-SCPAYAY-VLTVNERGNHCEYCF--TRKEGLSKCGRCKQA 71
Query: 75 F-CSPKCSTAAASSSH---SPYVCQALTRLLNLNSPDAANLPLDRQLQARFLIAAYNLAV 130
F C+ +C SP V + A + +++ + LAV
Sbjct: 72 FYCNVECQKEDWPMHKLECSPMVVFGENWNPSETVRLTARILAKQKIHPERTPSEKLLAV 131
Query: 131 VN-PSQFQILLAFQGTVTDNDTSAAHYLSS--LCPPPASTTIELTAALLAKDRLNAFGLM 187
S L + + +D +A H+ S L P + + L A+ N F +
Sbjct: 132 KEFESHLDKLDNEKKDLIQSDIAALHHFYSKHLGFPDNDSLV----VLFAQVNCNGFTI- 186
Query: 188 EPYIEGQDGRRSVRAYGIYLKASFFNHDCLPNACRFDYVDAAAENNTDIIVRMIHDVPQG 247
+D S I+ + NH C PN Y AE VR + ++ G
Sbjct: 187 ------EDEELSHLGSAIFPDVALMNHSCCPNVI-VTYKGTLAE------VRAVQEIKPG 233
Query: 248 REICLSYFPVNYDYSTRQKRLLYDYDFACDCDRC 281
E+ SY + Y R RL Y F C+C C
Sbjct: 234 EEVFTSYIDLLYPTEDRNDRLRDSYFFTCECQEC 267
>gi|302817792|ref|XP_002990571.1| hypothetical protein SELMODRAFT_428981 [Selaginella moellendorffii]
gi|300141739|gb|EFJ08448.1| hypothetical protein SELMODRAFT_428981 [Selaginella moellendorffii]
Length = 574
Score = 47.0 bits (110), Expect = 0.014, Method: Compositional matrix adjust.
Identities = 31/124 (25%), Positives = 58/124 (46%), Gaps = 8/124 (6%)
Query: 184 FGLMEPYIEGQDGRRSVRAYGIYLKASFFNHDCLPNACRFDYVDAAAENNTDIIVRMIHD 243
+G + G + RR + G+++ +F NH C P+A + V +A+ +VR D
Sbjct: 419 YGYKFERVIGLERRREFLS-GLWILTAFANHSCCPSATQ-KAVGSAS------LVRAARD 470
Query: 244 VPQGREICLSYFPVNYDYSTRQKRLLYDYDFACDCDRCKVEANWSDNDNDDENNEEVMDE 303
+ G E+ L+Y + R ++ + F C C+RC E+ + D N+ M++
Sbjct: 471 LKAGDEVTLAYLDPFMPWDVRSRQTKSRWGFECACERCHFESKLHSENRDLVNSGIKMEK 530
Query: 304 DQDE 307
+E
Sbjct: 531 MLEE 534
>gi|406868025|gb|EKD21062.1| SET and MYND domain-containing protein 2 [Marssonina brunnea f. sp.
'multigermtubi' MB_m1]
Length = 570
Score = 47.0 bits (110), Expect = 0.015, Method: Compositional matrix adjust.
Identities = 30/101 (29%), Positives = 45/101 (44%), Gaps = 17/101 (16%)
Query: 213 NHDCLPNACRFDYVDAAAENNTDIIVRMIHDVPQGREICLSYFPVNYDYSTRQKRLLYDY 272
NH C PNA N+ I +R + + + +I +SY D +RQ +L Y
Sbjct: 221 NHSCKPNAI-------IVSNSRSISLRALRPIKKNEQIFISYIDPTEDLPSRQSKLKERY 273
Query: 273 DFACDCDRCKVEANWSDNDNDDEN----NEEVMDEDQDEQM 309
F C CD C+ N+N E EE+ +E D++M
Sbjct: 274 FFTCKCDSCE------KNENPYEAFLTYQEEIGEETADKRM 308
>gi|340780606|pdb|3RIB|A Chain A, Human Lysine Methyltransferase Smyd2 In Complex With
Adohcy
gi|340780607|pdb|3RIB|B Chain B, Human Lysine Methyltransferase Smyd2 In Complex With
Adohcy
Length = 441
Score = 47.0 bits (110), Expect = 0.015, Method: Compositional matrix adjust.
Identities = 66/284 (23%), Positives = 98/284 (34%), Gaps = 49/284 (17%)
Query: 15 GRGRGLVSTQSLKAGQIVLRDSPILLYSALPFINSSSFCHNCFRKTMHSSSSICPSCSVA 74
G+GRGL + Q + G ++ P Y L + C CF T S C C A
Sbjct: 16 GKGRGLRALQPFQVGDLLF-SCPAYAY-VLTVNERGNHCEYCF--TRKEGLSKCGRCKQA 71
Query: 75 F-CSPKCST-------------AAASSSHSPYVCQALT-RLLNLNSPDAANLPLDRQLQA 119
F C+ +C + +P LT R+L P ++ L
Sbjct: 72 FYCNVECQKEDWPMHKLECSPMVVFGENWNPSETVRLTARILAKQKIHPERTPSEKLLAV 131
Query: 120 RFLIAAYNLAVVNPSQFQILLAFQGTVTDNDTSAAHYLSS--LCPPPASTTIELTAALLA 177
+ S L + + +D +A H+ S L P + + L A
Sbjct: 132 KEF----------ESHLDKLDNEKKDLIQSDIAALHHFYSKHLGFPDNDSLV----VLFA 177
Query: 178 KDRLNAFGLMEPYIEGQDGRRSVRAYGIYLKASFFNHDCLPNACRFDYVDAAAENNTDII 237
+ N F + +D S I+ + NH C PN Y AE
Sbjct: 178 QVNCNGFTI-------EDEELSHLGSAIFPDVALMNHSCCPNVI-VTYKGTLAE------ 223
Query: 238 VRMIHDVPQGREICLSYFPVNYDYSTRQKRLLYDYDFACDCDRC 281
VR + ++ G E+ SY + Y R RL Y F C+C C
Sbjct: 224 VRAVQEIKPGEEVFTSYIDLLYPTEDRNDRLRDSYFFTCECQEC 267
>gi|401406610|ref|XP_003882754.1| conserved hypothetical protein [Neospora caninum Liverpool]
gi|325117170|emb|CBZ52722.1| conserved hypothetical protein [Neospora caninum Liverpool]
Length = 535
Score = 47.0 bits (110), Expect = 0.015, Method: Compositional matrix adjust.
Identities = 27/101 (26%), Positives = 41/101 (40%), Gaps = 2/101 (1%)
Query: 200 VRAYGIYLKASFFNHDCLPNACRFDYVDAAAENNTDIIVRMIHDVPQGREICLSYFPVNY 259
+R G+Y + NH C PN C ++T + V + V G E+ +SY
Sbjct: 437 IRGGGVYTLHACVNHSCEPN-CAVSSWGPEGGDST-LTVTTVKAVEAGEELTISYVDEAL 494
Query: 260 DYSTRQKRLLYDYDFACDCDRCKVEANWSDNDNDDENNEEV 300
R++ L + FAC C RC E+ +D E
Sbjct: 495 PVRRRRQLLETTFGFACTCPRCMRESASDPTTGEDAREPET 535
>gi|409040050|gb|EKM49538.1| hypothetical protein PHACADRAFT_214100 [Phanerochaete carnosa
HHB-10118-sp]
Length = 461
Score = 47.0 bits (110), Expect = 0.015, Method: Compositional matrix adjust.
Identities = 27/84 (32%), Positives = 40/84 (47%), Gaps = 3/84 (3%)
Query: 202 AYGIY-LKASFFNHDCLPNACRFDYVDAAAENNTDIIVRMIHDVPQGREICLSYFPVNYD 260
A+G++ L + FNH C+PN C YV + E IV + D+ G E+ + Y
Sbjct: 201 AHGVFPLASRLFNHSCVPN-CASKYVITSTEMMGMEIV-ALRDIEFGDELTIPYLDPALP 258
Query: 261 YSTRQKRLLYDYDFACDCDRCKVE 284
+ RQ L Y F C+C C +
Sbjct: 259 FDIRQNTLQESYGFTCNCSLCNFQ 282
>gi|320032992|gb|EFW14942.1| conserved hypothetical protein [Coccidioides posadasii str.
Silveira]
Length = 358
Score = 47.0 bits (110), Expect = 0.015, Method: Compositional matrix adjust.
Identities = 28/97 (28%), Positives = 45/97 (46%), Gaps = 7/97 (7%)
Query: 204 GIYLKASFFNHDCLPNACRFDYVDAAAENNTDIIVRMIHDVPQGREICLSYFPVNYDYST 263
GI+ +A+ NH C NA + N + + + D+ +G EI +SY ++T
Sbjct: 135 GIFPRAARINHSCKQNA-----QNTWNHNLSKLTIHAFKDIEEGEEITISYVDGAETFNT 189
Query: 264 RQKRLLYDYDFACDCDRCKVEANWSDNDNDDENNEEV 300
RQ L + F C C+ C + A +N D EE+
Sbjct: 190 RQLCLEEAFGFVCQCELCSLSA--EENKKRDNRLEEM 224
>gi|195353810|ref|XP_002043396.1| GM16483 [Drosophila sechellia]
gi|194127519|gb|EDW49562.1| GM16483 [Drosophila sechellia]
Length = 486
Score = 47.0 bits (110), Expect = 0.015, Method: Compositional matrix adjust.
Identities = 68/289 (23%), Positives = 107/289 (37%), Gaps = 45/289 (15%)
Query: 17 GRGLVSTQSLKAGQIVLRDSPILLYSALPFINSSSF-----CHNCFRKTMHSSSSICPSC 71
GR L TQ + AGQIV + P+++ ++ C C+ + C C
Sbjct: 33 GRYLKVTQDIAAGQIVFIEEPLVVGPKWYLSDADKEASIVPCVGCYAPSRLGKHQ-CRRC 91
Query: 72 SVAFCSPKCSTAAASSSHSPYVCQALTRLLNLNSPDAANLPLDRQLQARFLIAAYNLAVV 131
CS C H P C L+ L SP A+ R L F A +
Sbjct: 92 RWPVCSAGCE-------HEPMECSVLS--LGSGSPTRAD---TRSLNDYFRGDALLVLKC 139
Query: 132 ------NPSQFQILLAFQGTVTDNDTSAAH---------YLSS--LCPPPASTTIELTAA 174
+P+++ LL Q + + H YL LC + LT
Sbjct: 140 LLLQRQSPTKWSALLEMQSHEEERKGTELHEEAEKRVVSYLQKRFLCRLKQTNPSLLTDC 199
Query: 175 LLAKDRLNAF-GLMEPYIEGQDGRRSVRAYGIYLKASFFNHDCLPNACRFDYVDAAAENN 233
+ L+ G++E + V G++ +A H C PN C F + +N
Sbjct: 200 --GPEMLHRLCGIIETNFMVIELPSGVELSGLFRQACMMEHACQPN-CDFQF-----DNK 251
Query: 234 TD-IIVRMIHDVPQGREICLSYFPVNYDYSTRQKRLLYDYDFACDCDRC 281
T + VR D+ +G + ++Y + + RQ L F+C C+RC
Sbjct: 252 TQQVAVRAGCDLRKGDHLRITYTNILWGTQLRQHHLRLTKHFSCHCNRC 300
>gi|46117750|ref|XP_384827.1| hypothetical protein FG04651.1 [Gibberella zeae PH-1]
Length = 812
Score = 47.0 bits (110), Expect = 0.015, Method: Compositional matrix adjust.
Identities = 30/82 (36%), Positives = 41/82 (50%), Gaps = 12/82 (14%)
Query: 204 GIYLKASFFNHDCLPNACRFDYVDAAAENNTDI-IVRMIHDVPQGREICLSYFPVNYDYS 262
G++L A+ NH C+ N CR ++ DI IVR D+P G E+ Y P D
Sbjct: 567 GVWLLAARINHSCVGN-CRRSFIG-------DIQIVRAARDIPAGTELTFPYCPTG-DSE 617
Query: 263 TRQ--KRLLYDYDFACDCDRCK 282
T Q + L + F CDC+ CK
Sbjct: 618 TYQDVQNKLAKWGFTCDCELCK 639
>gi|303322581|ref|XP_003071282.1| hypothetical protein CPC735_068190 [Coccidioides posadasii C735
delta SOWgp]
gi|240110984|gb|EER29137.1| hypothetical protein CPC735_068190 [Coccidioides posadasii C735
delta SOWgp]
Length = 367
Score = 47.0 bits (110), Expect = 0.015, Method: Compositional matrix adjust.
Identities = 28/97 (28%), Positives = 45/97 (46%), Gaps = 7/97 (7%)
Query: 204 GIYLKASFFNHDCLPNACRFDYVDAAAENNTDIIVRMIHDVPQGREICLSYFPVNYDYST 263
GI+ +A+ NH C NA + N + + + D+ +G EI +SY ++T
Sbjct: 135 GIFPRAARINHSCKQNA-----QNTWNHNLSKLTIHAFKDIEEGEEITISYVDGAETFNT 189
Query: 264 RQKRLLYDYDFACDCDRCKVEANWSDNDNDDENNEEV 300
RQ L + F C C+ C + A +N D EE+
Sbjct: 190 RQLCLEEAFGFVCQCELCSLSA--EENKKRDNRLEEM 224
>gi|119190077|ref|XP_001245645.1| hypothetical protein CIMG_05086 [Coccidioides immitis RS]
gi|392868546|gb|EAS34345.2| hypothetical protein CIMG_05086 [Coccidioides immitis RS]
Length = 367
Score = 47.0 bits (110), Expect = 0.015, Method: Compositional matrix adjust.
Identities = 28/97 (28%), Positives = 45/97 (46%), Gaps = 7/97 (7%)
Query: 204 GIYLKASFFNHDCLPNACRFDYVDAAAENNTDIIVRMIHDVPQGREICLSYFPVNYDYST 263
GI+ +A+ NH C NA + N + + + D+ +G EI +SY ++T
Sbjct: 135 GIFPRAARINHSCKQNA-----QNTWNHNLSKLTIHAFKDIEEGEEITISYVDGAETFNT 189
Query: 264 RQKRLLYDYDFACDCDRCKVEANWSDNDNDDENNEEV 300
RQ L + F C C+ C + A +N D EE+
Sbjct: 190 RQLCLEEAFGFVCQCELCSLSA--EENKKRDNRLEEM 224
>gi|403366787|gb|EJY83200.1| TPR repeat-containing protein [Oxytricha trifallax]
Length = 760
Score = 47.0 bits (110), Expect = 0.015, Method: Compositional matrix adjust.
Identities = 25/86 (29%), Positives = 39/86 (45%), Gaps = 7/86 (8%)
Query: 204 GIYLKASFFNHDCLPNACRFDYVDAAAENNTDIIVRMIHDVPQGREICLSYFPVNYDYST 263
+++ +SFFNHDC N R+ D + V D+ +G EI Y P+ Y
Sbjct: 556 ALWVISSFFNHDCYGNCSRYSIGDV-------LFVVANRDIQEGEEITQQYMPLMCTYEE 608
Query: 264 RQKRLLYDYDFACDCDRCKVEANWSD 289
R K ++F C C C+ + S+
Sbjct: 609 RVKTTELAWEFRCQCSSCQTYKSLSE 634
>gi|403158231|ref|XP_003307547.2| hypothetical protein PGTG_00497 [Puccinia graminis f. sp. tritici
CRL 75-36-700-3]
gi|375163733|gb|EFP74541.2| hypothetical protein PGTG_00497 [Puccinia graminis f. sp. tritici
CRL 75-36-700-3]
Length = 552
Score = 47.0 bits (110), Expect = 0.015, Method: Compositional matrix adjust.
Identities = 25/82 (30%), Positives = 39/82 (47%), Gaps = 8/82 (9%)
Query: 205 IYLKASFFNHDCLPNACRFDYVDAAAENNTDIIVRMIHDVPQGREICLSYFPVNYDYS-- 262
IYL AS FNH C PN F + + +R + G+E+ +SY D +
Sbjct: 382 IYLNASLFNHACRPNLFGFLNYETQL-----LYLRAFEPISMGQELTISYLSQEMDATDL 436
Query: 263 -TRQKRLLYDYDFACDCDRCKV 283
+R+++L Y F C C C++
Sbjct: 437 ESRRQKLQGSYGFLCTCSHCQM 458
Database: nr
Posted date: Mar 3, 2013 10:45 PM
Number of letters in database: 999,999,864
Number of sequences in database: 2,912,245
Database: /local_scratch/syshi//blastdatabase/nr.01
Posted date: Mar 3, 2013 10:52 PM
Number of letters in database: 999,999,666
Number of sequences in database: 2,912,720
Database: /local_scratch/syshi//blastdatabase/nr.02
Posted date: Mar 3, 2013 10:58 PM
Number of letters in database: 999,999,938
Number of sequences in database: 3,014,250
Database: /local_scratch/syshi//blastdatabase/nr.03
Posted date: Mar 3, 2013 11:03 PM
Number of letters in database: 999,999,780
Number of sequences in database: 2,805,020
Database: /local_scratch/syshi//blastdatabase/nr.04
Posted date: Mar 3, 2013 11:08 PM
Number of letters in database: 999,999,551
Number of sequences in database: 2,816,253
Database: /local_scratch/syshi//blastdatabase/nr.05
Posted date: Mar 3, 2013 11:13 PM
Number of letters in database: 999,999,897
Number of sequences in database: 2,981,387
Database: /local_scratch/syshi//blastdatabase/nr.06
Posted date: Mar 3, 2013 11:18 PM
Number of letters in database: 999,999,649
Number of sequences in database: 2,911,476
Database: /local_scratch/syshi//blastdatabase/nr.07
Posted date: Mar 3, 2013 11:24 PM
Number of letters in database: 999,999,452
Number of sequences in database: 2,920,260
Database: /local_scratch/syshi//blastdatabase/nr.08
Posted date: Mar 3, 2013 11:25 PM
Number of letters in database: 64,230,274
Number of sequences in database: 189,558
Lambda K H
0.320 0.133 0.408
Lambda K H
0.267 0.0410 0.140
Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Number of Hits to DB: 5,981,338,084
Number of Sequences: 23463169
Number of extensions: 249785594
Number of successful extensions: 1247467
Number of sequences better than 100.0: 1000
Number of HSP's better than 100.0 without gapping: 353
Number of HSP's successfully gapped in prelim test: 1665
Number of HSP's that attempted gapping in prelim test: 1243080
Number of HSP's gapped (non-prelim): 3942
length of query: 381
length of database: 8,064,228,071
effective HSP length: 144
effective length of query: 237
effective length of database: 8,980,499,031
effective search space: 2128378270347
effective search space used: 2128378270347
T: 11
A: 40
X1: 16 ( 7.4 bits)
X2: 38 (14.6 bits)
X3: 64 (24.7 bits)
S1: 41 (21.8 bits)
S2: 78 (34.7 bits)