BLASTP 2.2.26 [Sep-21-2011]
Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer,
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997),
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs", Nucleic Acids Res. 25:3389-3402.
Reference for compositional score matrix adjustment: Altschul, Stephen F.,
John C. Wootton, E. Michael Gertz, Richa Agarwala, Aleksandr Morgulis,
Alejandro A. Schaffer, and Yi-Kuo Yu (2005) "Protein database searches
using compositionally adjusted substitution matrices", FEBS J. 272:5101-5109.
Query= 011673
(480 letters)
Database: nr
23,463,169 sequences; 8,064,228,071 total letters
Searching..................................................done
>gi|296085170|emb|CBI28665.3| unnamed protein product [Vitis vinifera]
Length = 565
Score = 682 bits (1761), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 340/491 (69%), Positives = 394/491 (80%), Gaps = 16/491 (3%)
Query: 1 MEISTEAKLEPFLQWLQVNKVELRGCKIKYSDESKGFGIFSSNEFSDGVLLVVPLDLAIT 60
M + EAKL+ FLQWLQ+N+VELRGC+IKY D +KGFGIF +N+ SDG+ LVVPLDLAIT
Sbjct: 1 MASTEEAKLQHFLQWLQLNRVELRGCEIKYCDSNKGFGIFYANDASDGIPLVVPLDLAIT 60
Query: 61 PMRVLQDPLIGPECRAMFEDGEVDDRFLMILFLTVERLRKNSSWKPYLDMLPTTFGNPLW 120
PMRVLQDP +GPECRAMFE+GEVDDR LMILFLTVERLRKNSSWKPYLDMLPTTFG PLW
Sbjct: 61 PMRVLQDPFLGPECRAMFEEGEVDDRLLMILFLTVERLRKNSSWKPYLDMLPTTFGTPLW 120
Query: 121 FTDDELLELKGTTLYRATELQKQNLLTLYDDKVKDLVKKLLVLDGDSER---------AN 171
F DDE +ELKGT+++RATELQK+ L +LYDDKVKDLVKKLL+LDGDS+ AN
Sbjct: 121 FIDDEFIELKGTSVHRATELQKKQLQSLYDDKVKDLVKKLLILDGDSKGEVHFEDFLWAN 180
Query: 172 SIFWTRALNIPLPHSYVFPQNQEDLNKY--DSINNSAELSNDHNSRGELINGLNDIKNEA 229
SIFWTRALNIPLP SYVFPQ QE+ N + I +S + D S G L++G+++ +
Sbjct: 181 SIFWTRALNIPLPRSYVFPQIQEEQNSCIPNIIKDSGAFT-DQISSGNLVSGMDEKSTDV 239
Query: 230 QRVNSQVNGATSTLTSTQGETLWIEGLVPGIDFCNHDLKAAATWEVDGTGLITGVPFSMY 289
SQVN TS +S Q E LW+EGLVPGIDFCNHDLKAAATWEVD TGL TGVP SMY
Sbjct: 240 HGFESQVNRGTS--SSMQEEILWVEGLVPGIDFCNHDLKAAATWEVDNTGLKTGVPLSMY 297
Query: 290 LLSVERSSFHSEKEISISYGNKGNEELLYLYGFVIDNNPDDYLMIHYPAEAIHSIPLSDS 349
LLSVE+S H +KEISISYGNKGNEELLYLYGFVIDNNPDDYLM+HYP E ++P S+S
Sbjct: 298 LLSVEQSPCHMQKEISISYGNKGNEELLYLYGFVIDNNPDDYLMVHYPMELFKNVPFSES 357
Query: 350 KALLLEEQKAQLRCLLPKSLLEHGFFAAGHPKDGNNDNKLEVDRISSFSWSGQRRMPSYL 409
K LLE QKA++RCLL K+LL+ GFF A K+ N + ++ ++SWSGQR+ PSYL
Sbjct: 358 KGQLLEAQKAEMRCLLHKTLLDRGFFPASTLKNEQNGKSTD-HQVCNYSWSGQRKTPSYL 416
Query: 410 NKLVFPENFLTALRTIAMQEDEISKVSSLLEELVGSGGERQPSDAEVRAAVWETCGDSGA 469
NKLVFPE FLTALRTI+M+EDE+S+VSSLLEEL SGG RQP D+E RAAVWE CGDSGA
Sbjct: 417 NKLVFPEAFLTALRTISMEEDELSRVSSLLEELAESGG-RQPLDSETRAAVWEACGDSGA 475
Query: 470 LQLLVDLLQAK 480
LQ+LVDLL K
Sbjct: 476 LQVLVDLLNVK 486
>gi|225430662|ref|XP_002268533.1| PREDICTED: uncharacterized protein LOC100267311 [Vitis vinifera]
Length = 561
Score = 682 bits (1760), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 340/491 (69%), Positives = 394/491 (80%), Gaps = 16/491 (3%)
Query: 1 MEISTEAKLEPFLQWLQVNKVELRGCKIKYSDESKGFGIFSSNEFSDGVLLVVPLDLAIT 60
M + EAKL+ FLQWLQ+N+VELRGC+IKY D +KGFGIF +N+ SDG+ LVVPLDLAIT
Sbjct: 1 MASTEEAKLQHFLQWLQLNRVELRGCEIKYCDSNKGFGIFYANDASDGIPLVVPLDLAIT 60
Query: 61 PMRVLQDPLIGPECRAMFEDGEVDDRFLMILFLTVERLRKNSSWKPYLDMLPTTFGNPLW 120
PMRVLQDP +GPECRAMFE+GEVDDR LMILFLTVERLRKNSSWKPYLDMLPTTFG PLW
Sbjct: 61 PMRVLQDPFLGPECRAMFEEGEVDDRLLMILFLTVERLRKNSSWKPYLDMLPTTFGTPLW 120
Query: 121 FTDDELLELKGTTLYRATELQKQNLLTLYDDKVKDLVKKLLVLDGDSER---------AN 171
F DDE +ELKGT+++RATELQK+ L +LYDDKVKDLVKKLL+LDGDS+ AN
Sbjct: 121 FIDDEFIELKGTSVHRATELQKKQLQSLYDDKVKDLVKKLLILDGDSKGEVHFEDFLWAN 180
Query: 172 SIFWTRALNIPLPHSYVFPQNQEDLNKY--DSINNSAELSNDHNSRGELINGLNDIKNEA 229
SIFWTRALNIPLP SYVFPQ QE+ N + I +S + D S G L++G+++ +
Sbjct: 181 SIFWTRALNIPLPRSYVFPQIQEEQNSCIPNIIKDSGAFT-DQISSGNLVSGMDEKSTDV 239
Query: 230 QRVNSQVNGATSTLTSTQGETLWIEGLVPGIDFCNHDLKAAATWEVDGTGLITGVPFSMY 289
SQVN TS +S Q E LW+EGLVPGIDFCNHDLKAAATWEVD TGL TGVP SMY
Sbjct: 240 HGFESQVNRGTS--SSMQEEILWVEGLVPGIDFCNHDLKAAATWEVDNTGLKTGVPLSMY 297
Query: 290 LLSVERSSFHSEKEISISYGNKGNEELLYLYGFVIDNNPDDYLMIHYPAEAIHSIPLSDS 349
LLSVE+S H +KEISISYGNKGNEELLYLYGFVIDNNPDDYLM+HYP E ++P S+S
Sbjct: 298 LLSVEQSPCHMQKEISISYGNKGNEELLYLYGFVIDNNPDDYLMVHYPMELFKNVPFSES 357
Query: 350 KALLLEEQKAQLRCLLPKSLLEHGFFAAGHPKDGNNDNKLEVDRISSFSWSGQRRMPSYL 409
K LLE QKA++RCLL K+LL+ GFF A K+ N + ++ ++SWSGQR+ PSYL
Sbjct: 358 KGQLLEAQKAEMRCLLHKTLLDRGFFPASTLKNEQNGKSTD-HQVCNYSWSGQRKTPSYL 416
Query: 410 NKLVFPENFLTALRTIAMQEDEISKVSSLLEELVGSGGERQPSDAEVRAAVWETCGDSGA 469
NKLVFPE FLTALRTI+M+EDE+S+VSSLLEEL SGG RQP D+E RAAVWE CGDSGA
Sbjct: 417 NKLVFPEAFLTALRTISMEEDELSRVSSLLEELAESGG-RQPLDSETRAAVWEACGDSGA 475
Query: 470 LQLLVDLLQAK 480
LQ+LVDLL K
Sbjct: 476 LQVLVDLLNVK 486
>gi|224130176|ref|XP_002320771.1| predicted protein [Populus trichocarpa]
gi|222861544|gb|EEE99086.1| predicted protein [Populus trichocarpa]
Length = 551
Score = 665 bits (1717), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 337/496 (67%), Positives = 379/496 (76%), Gaps = 51/496 (10%)
Query: 16 LQVNKVELRGCKIKYSDESKGFGIFSSNEFSDGVLLVVPLDLAITPMRVLQDPLIGPECR 75
+QVNKVELRGC IKY ++KGFG+FSSN+ SDGVLLVVPLDLAITPMRVLQDPLIGPECR
Sbjct: 1 IQVNKVELRGCNIKYCGQNKGFGVFSSNDVSDGVLLVVPLDLAITPMRVLQDPLIGPECR 60
Query: 76 AMFEDGEVDDRFLMILFLTVERLRKNSSWKPYLDMLPTTFGNPLWFTDDELLELKGTTLY 135
+MFE+GEVDDRFLMILFL +ERLR NSSWKPYLDMLP TFGNPLWFTDDELLELKGTTLY
Sbjct: 61 SMFEEGEVDDRFLMILFLMLERLRNNSSWKPYLDMLPKTFGNPLWFTDDELLELKGTTLY 120
Query: 136 RATELQKQNLLTLYDDKVKDLVKKLLVLDGDSER---------ANSIFWTRALNIPLPHS 186
RATELQ++ LL+LY+DKVK LV+KLL+LDGD E ANS+FWTRALNIPLP S
Sbjct: 121 RATELQRKRLLSLYEDKVKGLVQKLLILDGDLESEVCFEDFLWANSVFWTRALNIPLPRS 180
Query: 187 YVFPQNQEDLNKYDSINNSAELSNDHNSRGELINGLNDIKNEAQRVNSQVNGATSTLTST 246
YVFPQ QED + S+N + +S +++ LI+G S+V G
Sbjct: 181 YVFPQVQEDQDSQSSLNIDSGVS---HTKALLISG------------SKVPGVDGQFD-- 223
Query: 247 QGETLWIEGLVPGIDFCNHDLKAAATWEVDGTGLITGVPFSMYLLSVERSSFHSEKEISI 306
ET+W+EGLVPGIDFCNHDLKA ATWEVDGTG+ TGVP SMYLLS E++ F EKEI+I
Sbjct: 224 --ETVWVEGLVPGIDFCNHDLKAVATWEVDGTGMTTGVPHSMYLLSAEKTPFQMEKEITI 281
Query: 307 SYGNKGNEELLYLYGFVIDNNPDDYLM----------------------IHYPAEAIHSI 344
SYGNKGNEELLYLYGFVIDNNPD+YLM +HYP EAI ++
Sbjct: 282 SYGNKGNEELLYLYGFVIDNNPDEYLMVMPLFGFCNSDVVLLGQYFLLDVHYPVEAIQNV 341
Query: 345 PLSDSKALLLEEQKAQLRCLLPKSLLEHGFFAAGHPKDGNNDNKLEVDRISSFSWSGQRR 404
P SDSK LLE QKA++RCLLPK LL HGFF AG + +N K + D+I SFSWSGQRR
Sbjct: 342 PFSDSKMQLLEAQKAEMRCLLPKRLLAHGFFPAGTTSNDDN-GKGKADKICSFSWSGQRR 400
Query: 405 MPSYLNKLVFPENFLTALRTIAMQEDEISKVSSLLEELVGSGGERQPSDAEVRAAVWETC 464
MPSY NKLVFPE FLT LRTIAMQEDE+ K SS LEELVGS G RQP+D EVR AVWE C
Sbjct: 401 MPSYANKLVFPEEFLTTLRTIAMQEDELLKASSFLEELVGSEGVRQPTDTEVRTAVWEAC 460
Query: 465 GDSGALQLLVDLLQAK 480
GDSGALQLL DLLQ K
Sbjct: 461 GDSGALQLLFDLLQTK 476
>gi|297848312|ref|XP_002892037.1| SET domain-containing protein [Arabidopsis lyrata subsp. lyrata]
gi|297337879|gb|EFH68296.1| SET domain-containing protein [Arabidopsis lyrata subsp. lyrata]
Length = 548
Score = 643 bits (1659), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 323/486 (66%), Positives = 371/486 (76%), Gaps = 32/486 (6%)
Query: 6 EAKLEPFLQWLQVNKVELRGCKIKYSDESKGFGIFSSNEFSDGVLLVVPLDLAITPMRVL 65
EAKLE FL WLQVN EL GC IKYSD KGFGIF+S E SD VLLVVPLDLAITPMRVL
Sbjct: 9 EAKLERFLDWLQVNGAELGGCNIKYSDSRKGFGIFASTETSDDVLLVVPLDLAITPMRVL 68
Query: 66 QDPLIGPECRAMFEDGEVDDRFLMILFLTVERLRKNSSWKPYLDMLPTTFGNPLWFTDDE 125
QDPL+GPEC+ MF+DG+VDDRFL+ILFLT ERLR NSSWKPYLDMLPT FGNPLWF+DD+
Sbjct: 69 QDPLLGPECQKMFQDGQVDDRFLIILFLTFERLRINSSWKPYLDMLPTRFGNPLWFSDDD 128
Query: 126 LLELKGTTLYRATELQKQNLLTLYDDKVKDLVKKLLVLDGDSER---------ANSIFWT 176
+LEL GT LY ATELQK+NLL+LY DKV+ LVK LL+LDGDS+ ANS+FW+
Sbjct: 129 ILELNGTNLYHATELQKKNLLSLYHDKVEVLVKNLLILDGDSQSKVSFEHFLWANSVFWS 188
Query: 177 RALNIPLPHSYVFPQNQEDLNKYDSINNSAELS--NDHNSRGELINGLNDIKNEAQRVNS 234
RALNIPLPHS+VFPQ+Q+D+ + S ++S E + N + +G L N +++
Sbjct: 189 RALNIPLPHSFVFPQSQDDIGECPSTSHSPETAPVNSNEEKGNLACNYNFMRS------- 241
Query: 235 QVNGATSTLTSTQGETLWIEGLVPGIDFCNHDLKAAATWEVDGTGLITGVPFSMYLLSVE 294
G+T+W+EGLVPGIDFCNHDLK ATWEVDGTG ++ VPFSMYLLS
Sbjct: 242 -------------GDTIWVEGLVPGIDFCNHDLKPMATWEVDGTGSVSRVPFSMYLLSAV 288
Query: 295 RSSFHSEKEISISYGNKGNEELLYLYGFVIDNNPDDYLMIHYPAEAIHSIPLSDSKALLL 354
KEISISYGNKGNEELLYLYGFVIDNNPDDYLM+HYP EAI IP SDSK LL
Sbjct: 289 AQRLIPNKEISISYGNKGNEELLYLYGFVIDNNPDDYLMVHYPVEAIPIIPFSDSKGQLL 348
Query: 355 EEQKAQLRCLLPKSLLEHGFFAAGHPKDGNNDNKLEVDRISSFSWSGQRRMPSYLNKLVF 414
E Q AQLRCLLPKS+L HGFF +D K E +R +FSWSGQR+MPSY+NKLVF
Sbjct: 349 EAQNAQLRCLLPKSVLNHGFFPQTTSTIRESDEK-ETERSCNFSWSGQRKMPSYMNKLVF 407
Query: 415 PENFLTALRTIAMQEDEISKVSSLLEELVGSGGERQPSDAEVRAAVWETCGDSGALQLLV 474
E+F+T LRTIAMQE+EI+KVS++LEELV S QPS+ EVR AVWE CGDSGALQLLV
Sbjct: 408 REDFMTGLRTIAMQEEEINKVSAMLEELVESRPGEQPSETEVRMAVWEACGDSGALQLLV 467
Query: 475 DLLQAK 480
DLL AK
Sbjct: 468 DLLNAK 473
>gi|79316289|ref|NP_001030933.1| SET domain-containing protein [Arabidopsis thaliana]
gi|63003834|gb|AAY25446.1| At1g01920 [Arabidopsis thaliana]
gi|332189233|gb|AEE27354.1| SET domain-containing protein [Arabidopsis thaliana]
Length = 547
Score = 637 bits (1642), Expect = e-180, Method: Compositional matrix adjust.
Identities = 324/486 (66%), Positives = 373/486 (76%), Gaps = 31/486 (6%)
Query: 6 EAKLEPFLQWLQVNKVELRGCKIKYSDESKGFGIF--SSNEFSDGVLLVVPLDLAITPMR 63
EAKLE FL WLQVN ELRGC IKYSD KGFGIF +S + SD VLLVVPLDLAITPMR
Sbjct: 7 EAKLERFLDWLQVNGGELRGCNIKYSDSLKGFGIFASTSTQASDEVLLVVPLDLAITPMR 66
Query: 64 VLQDPLIGPECRAMFEDGEVDDRFLMILFLTVERLRKNSSWKPYLDMLPTTFGNPLWFTD 123
VLQDPL+GPEC+ MFE G+VDDRFLMILFLT+ERLR NSSWKPYLDMLPT FGNPLWF+D
Sbjct: 67 VLQDPLLGPECQKMFEQGQVDDRFLMILFLTLERLRINSSWKPYLDMLPTRFGNPLWFSD 126
Query: 124 DELLELKGTTLYRATELQKQNLLTLYDDKVKDLVKKLLVLDGDSER---------ANSIF 174
D++LELKGT LY ATELQK+ LL+LY DKV+ LV KLL+LDGDSE ANS+F
Sbjct: 127 DDILELKGTNLYHATELQKKKLLSLYHDKVEVLVTKLLILDGDSESKVSFEHFLWANSVF 186
Query: 175 WTRALNIPLPHSYVFPQNQEDLNKYDSINNSAELSNDHNSRGELINGLNDIKNEAQRVNS 234
W+RALNIPLPHS+VFPQ+Q+D + S + S E + +N NE + + +
Sbjct: 187 WSRALNIPLPHSFVFPQSQDDTGECTSTSESPETAP--------VNS-----NEEKEIQA 233
Query: 235 QVNGATSTLTSTQGETLWIEGLVPGIDFCNHDLKAAATWEVDGTGLITGVPFSMYLLSVE 294
Q + + G+T+W+EGLVPGIDFCNHDLK ATWEVDG G ++ VPFSMYLLSV
Sbjct: 234 QPAPSVGS-----GDTIWVEGLVPGIDFCNHDLKPVATWEVDGIGSVSRVPFSMYLLSVA 288
Query: 295 RSSFHSEKEISISYGNKGNEELLYLYGFVIDNNPDDYLMIHYPAEAIHSIPLSDSKALLL 354
+ +KEISISYGNKGNEELLYLYGFVIDNNPDDYLM+HYP EAI SIP SDSK LL
Sbjct: 289 QRPI-PKKEISISYGNKGNEELLYLYGFVIDNNPDDYLMVHYPVEAIPSIPFSDSKGQLL 347
Query: 355 EEQKAQLRCLLPKSLLEHGFFAAGHPKDGNNDNKLEVDRISSFSWSGQRRMPSYLNKLVF 414
E Q AQLRCLLPKS+L HGFF +D K E R +FSWSG+R+MP+Y+NKLVF
Sbjct: 348 EAQNAQLRCLLPKSVLNHGFFPRTTSVIRESDEK-ETVRSCNFSWSGKRKMPTYMNKLVF 406
Query: 415 PENFLTALRTIAMQEDEISKVSSLLEELVGSGGERQPSDAEVRAAVWETCGDSGALQLLV 474
PE+F+T LRTIAMQE+EI KVS++LEELV S QPS+ EVR AVWE CGDSGALQLLV
Sbjct: 407 PEDFMTGLRTIAMQEEEIYKVSAMLEELVESRQGEQPSETEVRMAVWEACGDSGALQLLV 466
Query: 475 DLLQAK 480
DLL +K
Sbjct: 467 DLLNSK 472
>gi|42561607|ref|NP_171694.3| SET domain-containing protein [Arabidopsis thaliana]
gi|332189232|gb|AEE27353.1| SET domain-containing protein [Arabidopsis thaliana]
Length = 572
Score = 624 bits (1609), Expect = e-176, Method: Compositional matrix adjust.
Identities = 324/510 (63%), Positives = 371/510 (72%), Gaps = 54/510 (10%)
Query: 6 EAKLEPFLQWLQVNKVELRGCKIKYSDESKGFGIF--SSNEFSDGVLLVVPLDLAITPMR 63
EAKLE FL WLQVN ELRGC IKYSD KGFGIF +S + SD VLLVVPLDLAITPMR
Sbjct: 7 EAKLERFLDWLQVNGGELRGCNIKYSDSLKGFGIFASTSTQASDEVLLVVPLDLAITPMR 66
Query: 64 VLQDPLIGPECRAMFEDGEVDDRFLMILFLTVERLRKNSSWKPYLDMLPTTFGNPLWFTD 123
VLQDPL+GPEC+ MFE G+VDDRFLMILFLT+ERLR NSSWKPYLDMLPT FGNPLWF+D
Sbjct: 67 VLQDPLLGPECQKMFEQGQVDDRFLMILFLTLERLRINSSWKPYLDMLPTRFGNPLWFSD 126
Query: 124 DELLELKGTTLYRATELQKQNLLTLYDDKVKDLVKKLLVLDGDSER---------ANSIF 174
D++LELKGT LY ATELQK+ LL+LY DKV+ LV KLL+LDGDSE ANS+F
Sbjct: 127 DDILELKGTNLYHATELQKKKLLSLYHDKVEVLVTKLLILDGDSESKVSFEHFLWANSVF 186
Query: 175 WTRALNIPLPHSYVFPQNQEDLNKYDSINNSAELS--NDHNSRGELINGLNDIKNEAQRV 232
W+RALNIPLPHS+VFPQ+Q+D + S + S E + N + +G+
Sbjct: 187 WSRALNIPLPHSFVFPQSQDDTGECTSTSESPETAPVNSNEEKGK--------------- 231
Query: 233 NSQVNGATSTLTSTQGETLWIEGLVPGIDFCNHDLKAAATWEVDGTGLITGVPFSMYLLS 292
+ A + G+T+W+EGLVPGIDFCNHDLK ATWEVDG G ++ VPFSMYLLS
Sbjct: 232 --SLTSAQPAPSVGSGDTIWVEGLVPGIDFCNHDLKPVATWEVDGIGSVSRVPFSMYLLS 289
Query: 293 VERSSFHSEKEISISYGNKGNEELLYLYGFVIDNNPDDYLMI------------------ 334
V + +KEISISYGNKGNEELLYLYGFVIDNNPDDYLMI
Sbjct: 290 VAQRPI-PKKEISISYGNKGNEELLYLYGFVIDNNPDDYLMIKEMLVNFVLTSVVTFNNG 348
Query: 335 ----HYPAEAIHSIPLSDSKALLLEEQKAQLRCLLPKSLLEHGFFAAGHPKDGNNDNKLE 390
HYP EAI SIP SDSK LLE Q AQLRCLLPKS+L HGFF +D K E
Sbjct: 349 FIQVHYPVEAIPSIPFSDSKGQLLEAQNAQLRCLLPKSVLNHGFFPRTTSVIRESDEK-E 407
Query: 391 VDRISSFSWSGQRRMPSYLNKLVFPENFLTALRTIAMQEDEISKVSSLLEELVGSGGERQ 450
R +FSWSG+R+MP+Y+NKLVFPE+F+T LRTIAMQE+EI KVS++LEELV S Q
Sbjct: 408 TVRSCNFSWSGKRKMPTYMNKLVFPEDFMTGLRTIAMQEEEIYKVSAMLEELVESRQGEQ 467
Query: 451 PSDAEVRAAVWETCGDSGALQLLVDLLQAK 480
PS+ EVR AVWE CGDSGALQLLVDLL +K
Sbjct: 468 PSETEVRMAVWEACGDSGALQLLVDLLNSK 497
>gi|449525577|ref|XP_004169793.1| PREDICTED: uncharacterized protein LOC101224730 [Cucumis sativus]
Length = 553
Score = 624 bits (1608), Expect = e-176, Method: Compositional matrix adjust.
Identities = 316/491 (64%), Positives = 373/491 (75%), Gaps = 35/491 (7%)
Query: 1 MEISTEAKLEPFLQWLQVNKVELRGCKIKYSDESKGFGIFSSNEFSDGVLLVVPLDLAIT 60
M S +A LE FLQWLQVN +LRGC IKYSD SKG G+FS+N SDGVLLVVPLDLAIT
Sbjct: 1 MANSDQANLELFLQWLQVNGADLRGCTIKYSDLSKGCGLFSANYASDGVLLVVPLDLAIT 60
Query: 61 PMRVLQDPLIGPECRAMFEDGEVDDRFLMILFLTVERLRKNSSWKPYLDMLPTTFGNPLW 120
PMRVLQDPL GPECRAM+E+GEVDDRFLMILFL VERLR+NSSW PYLD+LPT FGNPLW
Sbjct: 61 PMRVLQDPLYGPECRAMYEEGEVDDRFLMILFLMVERLRENSSWNPYLDVLPTRFGNPLW 120
Query: 121 FTDDELLELKGTTLYRATELQKQNLLTLYDDKVKDLVKKLLVLDGDSER---------AN 171
FTDDELLELKGTTLYRATELQK +L +LY++KVK LV +LL L+G + R AN
Sbjct: 121 FTDDELLELKGTTLYRATELQKNSLQSLYENKVKKLVSRLLTLEGFTGREVSFEDFLWAN 180
Query: 172 SIFWTRALNIPLPHSYVFPQNQEDLNKYDSINNSAELSNDHNSRGELINGLNDIKNEAQR 231
SIFW RALNIP+PH YVFP+ QE + I +AE+S S+ +G
Sbjct: 181 SIFWARALNIPMPHDYVFPKIQEAVGSDSLIEETAEVSTSAVSKVHAASG---------- 230
Query: 232 VNSQVNGATSTLTSTQGETLWIEGLVPGIDFCNHDLKAAATWEVDGTGLITGVPFSMYLL 291
T S++ ET+W+EGLVPG+DFCNHDLKA ATWEVDGTG TGVPFSMYLL
Sbjct: 231 ---------ETFGSSKQETVWVEGLVPGVDFCNHDLKATATWEVDGTGSTTGVPFSMYLL 281
Query: 292 S-VERSSFHSEKEISISYGNKGNEELLYLYGFVIDNNPDDYLMIHYPAEAIHSIPLSDSK 350
S + RSS +E+SISYGNKGNEELLYLYGFV++NNPDDYLM+HYP EAI + SDSK
Sbjct: 282 SAISRSS--GLEEVSISYGNKGNEELLYLYGFVMENNPDDYLMVHYPLEAIQNASSSDSK 339
Query: 351 ALLLEEQKAQLRCLLPKSLLEHGFFAAGHPKDGNNDNKLEV-DRISSFSWSGQRRMPSYL 409
LL QKA++RCLLP+ LL+HGF PK N ++ +R ++SWSGQR++PSYL
Sbjct: 340 LQLLGVQKAEMRCLLPRRLLDHGFHP---PKTSNVKENVDCSNRACNYSWSGQRKLPSYL 396
Query: 410 NKLVFPENFLTALRTIAMQEDEISKVSSLLEELVGSGGERQPSDAEVRAAVWETCGDSGA 469
+KL+FPE FL+ALRTI+M EDE+ +VSSLL E+VG +R+P+D +V+AAVWE CGDSGA
Sbjct: 397 DKLIFPEKFLSALRTISMDEDELMQVSSLLAEIVGPEEDREPTDTDVQAAVWEACGDSGA 456
Query: 470 LQLLVDLLQAK 480
LQLLVDLLQ K
Sbjct: 457 LQLLVDLLQKK 467
>gi|449451195|ref|XP_004143347.1| PREDICTED: uncharacterized protein LOC101219815 [Cucumis sativus]
Length = 553
Score = 619 bits (1597), Expect = e-175, Method: Compositional matrix adjust.
Identities = 315/492 (64%), Positives = 373/492 (75%), Gaps = 37/492 (7%)
Query: 1 MEISTEAKLEPFLQWLQVNKVELRGCKIKYSDESKGFGIFSSNEFSDGVLLVVPLDLAIT 60
M S +A LE FLQWLQVN +LRGC IKYSD SKG G+FS+N SD VLLVVPLDLAIT
Sbjct: 1 MANSDQANLELFLQWLQVNGADLRGCTIKYSDLSKGCGLFSANYASDAVLLVVPLDLAIT 60
Query: 61 PMRVLQDPLIGPECRAMFEDGEVDDRFLMILFLTVERLRKNSSWKPYLDMLPTTFGNPLW 120
PMRVLQDPL GPECRAM+E+GEVDDRFLMILFL VERLR+NSSW PYLD+LPT FGNPLW
Sbjct: 61 PMRVLQDPLYGPECRAMYEEGEVDDRFLMILFLMVERLRENSSWNPYLDVLPTRFGNPLW 120
Query: 121 FTDDELLELKGTTLYRATELQKQNLLTLYDDKVKDLVKKLLVLDGDSER---------AN 171
FTD+ELLELKGTTLYRATELQK +L +LY++KVK LV +LL L+G + R AN
Sbjct: 121 FTDNELLELKGTTLYRATELQKNSLQSLYENKVKKLVSRLLTLEGFTGREVSFEDFLWAN 180
Query: 172 SIFWTRALNIPLPHSYVFPQNQEDLNKYDSINNSAELSNDHNSRGELINGLNDIKNEAQR 231
SIFW RALNIP+PH YVFP+ QE + I +AE+S S+ +G
Sbjct: 181 SIFWARALNIPMPHDYVFPKIQEAVGSDSLIEETAEVSTSAVSKVHAASG---------- 230
Query: 232 VNSQVNGATSTLTSTQGETLWIEGLVPGIDFCNHDLKAAATWEVDGTGLITGVPFSMYLL 291
T S++ ET+W+EGLVPG+DFCNHDLKA ATWEVDGTG TGVPFSMYLL
Sbjct: 231 ---------ETFGSSKQETVWVEGLVPGVDFCNHDLKATATWEVDGTGSTTGVPFSMYLL 281
Query: 292 S-VERSSFHSEKEISISYGNKGNEELLYLYGFVIDNNPDDYLMIHYPAEAIHSIPLSDSK 350
S + RSS +E+SISYGNKGNEELLYLYGFV++NNPDDYLM+HYP EAI + SDSK
Sbjct: 282 SAISRSS--GLEEVSISYGNKGNEELLYLYGFVMENNPDDYLMVHYPLEAIQNASSSDSK 339
Query: 351 ALLLEEQKAQLRCLLPKSLLEHGFFAAGHPKDGNN--DNKLEVDRISSFSWSGQRRMPSY 408
LL QKA++RCLLP+ LL+HGF HP +N +N +R ++SWSGQR++PSY
Sbjct: 340 LQLLGVQKAEMRCLLPRRLLDHGF----HPPKTSNVKENVDCSNRACNYSWSGQRKLPSY 395
Query: 409 LNKLVFPENFLTALRTIAMQEDEISKVSSLLEELVGSGGERQPSDAEVRAAVWETCGDSG 468
L+KL+FPE FL+ALRTI+M EDE+ +VSSLL E+VG +R+P+D +V+AAVWE CGDSG
Sbjct: 396 LDKLIFPEKFLSALRTISMDEDELMQVSSLLAEIVGPEEDREPTDTDVQAAVWEACGDSG 455
Query: 469 ALQLLVDLLQAK 480
ALQLLVDLLQ K
Sbjct: 456 ALQLLVDLLQKK 467
>gi|357507175|ref|XP_003623876.1| Ribulose-1,5 bisphosphate carboxylase/oxygenase large subunit
N-methyltransferase [Medicago truncatula]
gi|355498891|gb|AES80094.1| Ribulose-1,5 bisphosphate carboxylase/oxygenase large subunit
N-methyltransferase [Medicago truncatula]
Length = 571
Score = 610 bits (1572), Expect = e-172, Method: Compositional matrix adjust.
Identities = 309/508 (60%), Positives = 373/508 (73%), Gaps = 57/508 (11%)
Query: 6 EAKLEPFLQWLQVNKVELRGCKIKYSDESKGFGIFSSNEFSD------------------ 47
EAKLE FLQWLQ N VELRGC IKY D KGFGIF + SD
Sbjct: 5 EAKLETFLQWLQANGVELRGCNIKYCDSRKGFGIFCDKDVSDGNHLGPNSTDDAYIVRLE 64
Query: 48 ------------GVLLVVPLDLAITPMRVLQDPLIGPECRAMFEDGEVDDRFLMILFLTV 95
G+LLVVPL+L+ITPMRVLQDP +GPECRA+FE+G+VDDR LM+L LTV
Sbjct: 65 IRTRDFVVVCFSGILLVVPLELSITPMRVLQDPFLGPECRALFEEGDVDDRLLMMLLLTV 124
Query: 96 ERLRKNSSWKP---YLDMLPTTFGNPLWFTDDELLELKGTTLYRATELQKQNLLTLYDDK 152
ERLRK+S WKP YLDMLPTTFGN LWF+++EL EL+GTTLYRATELQK++LL LY+ K
Sbjct: 125 ERLRKDSLWKPYVWYLDMLPTTFGNTLWFSEEELQELRGTTLYRATELQKKSLLNLYETK 184
Query: 153 VKDLVKKLLVLDGDSERANSIFWTRALNIPLPHSYVFPQNQEDLNKYDSINNSAELSNDH 212
VKD+VKKLL LD DSE+ANS+FW+RALN PLP SYVFP+ Q+ ++ S D
Sbjct: 185 VKDIVKKLLTLDSDSEKANSVFWSRALNTPLPRSYVFPEMQD-------VHQSCTPKADE 237
Query: 213 NSRGELINGLNDIKNEAQRVNSQVNGATSTLTSTQGETLWIEGLVPGIDFCNHDLKAAAT 272
N G + +D+ E +T ++ QG+T+W+EGLVPGIDFCNHDLK AT
Sbjct: 238 N--GSQVTKSDDLTKE------------TTHSTVQGDTVWVEGLVPGIDFCNHDLKPIAT 283
Query: 273 WEVDGTGLITGVPFSMYLLSVERSSFHSEKEISISYGNKGNEELLYLYGFVIDNNPDDYL 332
WEVDGTGL TGVP SMYLLS +S ++EISISYGNKGNEELLYLYGFVID N DDYL
Sbjct: 284 WEVDGTGLTTGVPVSMYLLSAAQSPLQIDREISISYGNKGNEELLYLYGFVIDGNTDDYL 343
Query: 333 MIHYPAEAIHSIPLSDSKALLLEEQKAQLRCLLPKSLLEHGFFAAGHPKDGNNDNKLEVD 392
M+HYP+EAI++I S+SK+ LLE QKA++RCLLPK+LL++GFF +G G N+ +
Sbjct: 344 MVHYPSEAINTISFSESKSQLLEVQKAEMRCLLPKTLLDNGFFPSGTENSGENNKS---N 400
Query: 393 RISSFSWSGQRRMPSYLNKLVFPENFLTALRTIAMQEDEISKVSSLLEELVGSGGERQPS 452
++ +SWSGQR+ PSY++KLVFPE F+ LRTIAMQEDE+ KVSS+LEELVG GERQ S
Sbjct: 401 KVCKYSWSGQRKTPSYVDKLVFPEKFMATLRTIAMQEDELFKVSSMLEELVGPEGERQLS 460
Query: 453 DAEVRAAVWETCGDSGALQLLVDLLQAK 480
D +V++A+WE CGDSGALQLLVDLL K
Sbjct: 461 DIDVQSAIWEVCGDSGALQLLVDLLHVK 488
>gi|357125312|ref|XP_003564338.1| PREDICTED: uncharacterized protein LOC100842937 [Brachypodium
distachyon]
Length = 558
Score = 558 bits (1437), Expect = e-156, Method: Compositional matrix adjust.
Identities = 275/489 (56%), Positives = 353/489 (72%), Gaps = 31/489 (6%)
Query: 6 EAKLEPFLQWLQVNKVELRGCKIKYSDESKGFGIFSSNE---FSDGVLLVVPLDLAITPM 62
+ KL+ FLQWLQ N +LRGC I+ D +KGFG++S+ +DGV++VVPLDLA+TPM
Sbjct: 10 DGKLQSFLQWLQANGTDLRGCTIRACDRNKGFGVYSTGAEVCATDGVMMVVPLDLAVTPM 69
Query: 63 RVLQDPLIGPECRAMFEDGEVDDRFLMILFLTVERLRKNSSWKPYLDMLPTTFGNPLWFT 122
RVLQDPL+GP CRA+FE+G VDDR L++LFL ERLR S WKPYLDMLP+TFG+ +WF+
Sbjct: 70 RVLQDPLVGPRCRALFEEGGVDDRLLVMLFLMAERLRPTSLWKPYLDMLPSTFGSSVWFS 129
Query: 123 DDELLELKGTTLYRATELQKQNLLTLYDDKVKDLVKKLLVLD--GDSER--------ANS 172
DDEL EL+GTTL+RAT +Q+++L TL+DDKVK LV++LL +D G S ANS
Sbjct: 130 DDELAELEGTTLHRATVMQRKSLQTLFDDKVKGLVEELLRVDESGSSSEVQFEDFLWANS 189
Query: 173 IFWTRALNIPLPHSYVFPQNQEDLNKYDSINNSAELSNDHNSR-GELINGLNDIKNEAQR 231
IFWTRALNIPLPHSYVFP + N+ +R GE++ + +
Sbjct: 190 IFWTRALNIPLPHSYVFPGS----------------VNEQQTRTGEVLGDSSITIQQETD 233
Query: 232 VNSQVNGATSTLTSTQGETLWIEGLVPGIDFCNHDLKAAATWEVDGTGLITGVPFSMYLL 291
+ ++ N S E++W+EGLVPGIDFCNH++KA ATWEVD G TGVP SMYL+
Sbjct: 234 ITAKSNSGDENSESRNMESIWVEGLVPGIDFCNHNVKALATWEVDSVGDTTGVPASMYLM 293
Query: 292 SVERSSFHSEKEISISYGNKGNEELLYLYGFVIDNNPDDYLMIHYPAEAIHSIPLSDSKA 351
SS + EI I+YGNKGNEELLYLYGFV+DNNPDDYLM+HYP EA+ + +D K
Sbjct: 294 LAGNSSVEAGTEILINYGNKGNEELLYLYGFVVDNNPDDYLMVHYPVEALRQVKSADIKM 353
Query: 352 LLLEEQKAQLRCLLPKSLLEHGFFAAGHPKDGNNDNKLEVDRISSFSWSGQRRMPSYLNK 411
L+E QKA+LRCLLP+SLL++GFF +D+K + SS+SWSGQR++PSYLNK
Sbjct: 354 KLIEMQKAELRCLLPRSLLDNGFFGNSTNSVEADDDKRNANHFSSYSWSGQRKVPSYLNK 413
Query: 412 LVFPENFLTALRTIAMQEDEISKVSSLLEELVGSGGERQPSDAEVRAAVWETCGDSGALQ 471
++FP+ FL++LRTI+MQE E+ +V+SLLEE VGS + + SDAEVR+A WE CGD GAL
Sbjct: 414 IIFPQEFLSSLRTISMQEYELEQVASLLEE-VGSSEDSELSDAEVRSATWEVCGDHGALG 472
Query: 472 LLVDLLQAK 480
LLVDLL+ K
Sbjct: 473 LLVDLLKVK 481
>gi|115466126|ref|NP_001056662.1| Os06g0127200 [Oryza sativa Japonica Group]
gi|55295876|dbj|BAD67744.1| SET domain-containing protein-like [Oryza sativa Japonica Group]
gi|55296208|dbj|BAD67926.1| SET domain-containing protein-like [Oryza sativa Japonica Group]
gi|113594702|dbj|BAF18576.1| Os06g0127200 [Oryza sativa Japonica Group]
gi|218197481|gb|EEC79908.1| hypothetical protein OsI_21454 [Oryza sativa Indica Group]
gi|222634879|gb|EEE65011.1| hypothetical protein OsJ_19962 [Oryza sativa Japonica Group]
Length = 557
Score = 515 bits (1326), Expect = e-143, Method: Compositional matrix adjust.
Identities = 267/493 (54%), Positives = 345/493 (69%), Gaps = 37/493 (7%)
Query: 3 ISTEAKLEPFLQWLQVNKVELRGCKIKYSDESKGFGIFSSNE---FSDGVLLVVPLDLAI 59
+ +AKL+ FLQWLQ N +LRGC I+ +G+G+FS+ +D V++VVPLDLAI
Sbjct: 10 VPGDAKLDSFLQWLQANGADLRGCTIRRCGR-EGYGVFSTAAEAGATDEVVMVVPLDLAI 68
Query: 60 TPMRVLQDPLIGPECRAMFEDGEVDDRFLMILFLTVERLRKNSSWKPYLDMLPTTFGNPL 119
TPMRVLQDPL+GP CRA+FE+G VDDR L++LFL VERLR +S WKPYLDMLP+TFG+ +
Sbjct: 69 TPMRVLQDPLVGPRCRALFEEGGVDDRLLVMLFLMVERLRPSSLWKPYLDMLPSTFGSSI 128
Query: 120 WFTDDELLELKGTTLYRATELQKQNLLTLYDDKVKDLVKKLLVLDGDSER---------- 169
WFT+DEL EL+GTTL+RAT +Q+++L TL+D+KVK LV +LL +D
Sbjct: 129 WFTEDELAELEGTTLHRATVMQRKSLQTLFDNKVKGLVGELLNVDESGSSIEVRFEDFLW 188
Query: 170 ANSIFWTRALNIPLPHSYVFPQ--NQEDLNKYDSINNSAELSNDHNSRGELINGLNDIKN 227
ANSIFWTRALNIPLP YVFP+ +++ N D +S+ LS + G I N N
Sbjct: 189 ANSIFWTRALNIPLPRFYVFPESLDEKRANIGDDCGDSS-LSAPQGT-GTAITAKNISGN 246
Query: 228 EAQRVNSQVNGATSTLTSTQGETLWIEGLVPGIDFCNHDLKAAATWEVDGTGLITGVPFS 287
+ + S+ E++W+EGLVPGIDFCNH++KA ATWEVD G +TG P S
Sbjct: 247 DNPK-------------SSNTESIWVEGLVPGIDFCNHNVKALATWEVDSMGHVTGCPSS 293
Query: 288 MYLLSVERSSFHSEKEISISYGNKGNEELLYLYGFVIDNNPDDYLMIHYPAEAIHSIPLS 347
MYL+ ++S +E EI I+YGNKGNEELLYLYGFVIDNNPDDYLMIHYP EA+ + +
Sbjct: 294 MYLVLADKSFVKAETEICINYGNKGNEELLYLYGFVIDNNPDDYLMIHYPVEALRQVQSA 353
Query: 348 DSKALLLEEQKAQLRCLLPKSLLEHGFFAAGHPKDGNNDNKLEVDRISSFSWSGQRRMPS 407
D K LLE Q A+LRCLLP+SLLE+GFF G N +NK SS+SWSGQR++PS
Sbjct: 354 DIKMKLLEIQNAELRCLLPRSLLENGFF--GSCSGENKENKNNTSPFSSYSWSGQRKVPS 411
Query: 408 YLNKLVFPENFLTALRTIAMQEDEISKVSSLLEELVGSGGERQPSDAEVRAAVWETCGDS 467
Y+ K+VF + F++ LRTIA+QE E+ +SLL E+ G + D E+R+A+WE GD+
Sbjct: 412 YIEKIVFSQEFISTLRTIALQEHELEHTASLLGEI----GSNEDRDDELRSAIWEVSGDN 467
Query: 468 GALQLLVDLLQAK 480
GAL LLVDLL+ K
Sbjct: 468 GALSLLVDLLRVK 480
>gi|212721730|ref|NP_001132096.1| uncharacterized protein LOC100193512 [Zea mays]
gi|194693412|gb|ACF80790.1| unknown [Zea mays]
gi|413942692|gb|AFW75341.1| hypothetical protein ZEAMMB73_065275 [Zea mays]
gi|413942693|gb|AFW75342.1| hypothetical protein ZEAMMB73_065275 [Zea mays]
Length = 549
Score = 499 bits (1285), Expect = e-138, Method: Compositional matrix adjust.
Identities = 257/478 (53%), Positives = 326/478 (68%), Gaps = 40/478 (8%)
Query: 6 EAKLEPFLQWLQVNKVELRGCKIKYSDESKGFGIFSSNE----FSDGVLLVVPLDLAITP 61
+ KLE FLQWLQ N +LR C I+ KG G+FS+ +DGV +VVPLDLAITP
Sbjct: 9 DVKLESFLQWLQSNGADLRSCTIRACG-GKGLGVFSTAAPEPGSNDGVAMVVPLDLAITP 67
Query: 62 MRVLQDPLIGPECRAMFEDGEVDDRFLMILFLTVERLRKNSSWKPYLDMLPTTFGNPLWF 121
MRVLQDPL+GP CRA+ EDG VDDR L++LFL ER R S WKPYLDMLP+TFG+ LWF
Sbjct: 68 MRVLQDPLVGPRCRALLEDGVVDDRLLVMLFLMAERRRPGSLWKPYLDMLPSTFGSSLWF 127
Query: 122 TDDELLELKGTTLYRATELQKQNLLTLYDDKVKDLVKKLLVLDGDSER----------AN 171
T++EL EL+GTTL+RAT +Q+++L + +D+KVK LV++LL +D + AN
Sbjct: 128 TEEELAELEGTTLHRATLIQRKSLQSSFDEKVKGLVEELLHVDESASSVEVLFEDFLWAN 187
Query: 172 SIFWTRALNIPLPHSYVFPQNQEDLNKYDSINNSAELSNDHNSRGELINGLNDIKNEAQR 231
SIFWTRALNIPLPHSYVF + D +R + ND E
Sbjct: 188 SIFWTRALNIPLPHSYVFLGS----------------CGDQQARAD-----NDAHQEID- 225
Query: 232 VNSQVNGATSTLTSTQGETLWIEGLVPGIDFCNHDLKAAATWEVDGTGLITGVPFSMYLL 291
+ ++ A + E++W+EGLVPGIDFCNH++KA ATWEVD G TG+P SMYLL
Sbjct: 226 ITAKDCSADENSKPSNTESIWVEGLVPGIDFCNHNVKALATWEVDSVGNATGIPASMYLL 285
Query: 292 SVERSSFHSEKEISISYGNKGNEELLYLYGFVIDNNPDDYLMIHYPAEAIHSIPLSDSKA 351
++S + EI I+YGNKGNEELLYLYGFV+DNNPDDYLM+HYP EA+ I +D K
Sbjct: 286 LADKSPAETGAEICINYGNKGNEELLYLYGFVVDNNPDDYLMVHYPLEALRQIQSADIKM 345
Query: 352 LLLEEQKAQLRCLLPKSLLEHGFFAAGHPKDGNNDNKLEVDRISSFSWSGQRRMPSYLNK 411
LLE QK +LRCLLP+SLL++GFF +D D+K SSFSWSGQR++PSYL K
Sbjct: 346 KLLEMQKGELRCLLPRSLLDNGFFGIRSSED--KDSKKNTGPFSSFSWSGQRKVPSYLPK 403
Query: 412 LVFPENFLTALRTIAMQEDEISKVSSLLEELVGSGGERQPSDAEVRAAVWETCGDSGA 469
+VFP F++ LRTIAMQE E+ +V+SLL E VG +R+ S A +++A+WE CGD GA
Sbjct: 404 IVFPHEFMSTLRTIAMQEHELEQVASLLGE-VGYSEDRETSCAAIQSAIWEVCGDQGA 460
>gi|242094496|ref|XP_002437738.1| hypothetical protein SORBIDRAFT_10g001630 [Sorghum bicolor]
gi|241915961|gb|EER89105.1| hypothetical protein SORBIDRAFT_10g001630 [Sorghum bicolor]
Length = 478
Score = 491 bits (1264), Expect = e-136, Method: Compositional matrix adjust.
Identities = 263/477 (55%), Positives = 335/477 (70%), Gaps = 35/477 (7%)
Query: 8 KLEPFLQWLQVNKVELRGCKIKYSDESKGFGIFSSNE----FSDGVLLVVPLDLAITPMR 63
KLE FLQW Q N +LRGC I+ KG G+FS+ +DGV +VVPLDLAITPMR
Sbjct: 11 KLESFLQWFQANGADLRGCTIRACG-GKGLGVFSTAAPEPGANDGVAMVVPLDLAITPMR 69
Query: 64 VLQDPLIGPECRAMFEDGEVDDRFLMILFLTVERLRKNSSWKPYLDMLPTTFGNPLWFTD 123
VLQDPLIGP C A+FE+G VDDR L++LFL ER R S WKPYLDMLP+TFG+ LWFT+
Sbjct: 70 VLQDPLIGPRCHALFEEGRVDDRLLVMLFLMAERRRPGSLWKPYLDMLPSTFGSSLWFTE 129
Query: 124 DELLELKGTTLYRATELQKQNLLTLYDDKVKDLVKKLLVLDGDSER-----------ANS 172
+EL EL+GTTL+RAT +Q+++L + +D+KVK LV++LL +D +SE ANS
Sbjct: 130 EELAELEGTTLHRATVIQRKSLQSSFDEKVKGLVEELLHVD-ESESSIEVLFEDFLWANS 188
Query: 173 IFWTRALNIPLPHSYVFPQNQEDLNKYDSINNSAELSNDHNSRGELINGLNDIKNEAQRV 232
IFWTRALNIPLPHSYVFP + D + I + + I+ V
Sbjct: 189 IFWTRALNIPLPHSYVFPGSCGD----EQIRTGNDACYSSPPAQQEID-----------V 233
Query: 233 NSQVNGATSTLTSTQGETLWIEGLVPGIDFCNHDLKAAATWEVDGTGLITGVPFSMYLLS 292
++ + A S+ E++W+EGLVPGIDFCNH++KA ATWEVD G TG+P SMYLL
Sbjct: 234 TAKDHSADDNSKSSNTESIWVEGLVPGIDFCNHNVKALATWEVDSVGNATGIPASMYLLL 293
Query: 293 VERSSFHSEKEISISYGNKGNEELLYLYGFVIDNNPDDYLMIHYPAEAIHSIPLSDSKAL 352
++SS + EI I+YGNKGNEELLYLYGFV+DNNPDDYLM+HYP EA+ I +D+K
Sbjct: 294 ADKSSAEAGAEICINYGNKGNEELLYLYGFVVDNNPDDYLMVHYPLEALRQIQSADTKMQ 353
Query: 353 LLEEQKAQLRCLLPKSLLEHGFFAAGHPKDGNNDNKLEVDRISSFSWSGQRRMPSYLNKL 412
LLE QK +LRCLLP+ LL++GFF +D D+K SSFSWSGQR++PSYL+K
Sbjct: 354 LLEMQKGELRCLLPRRLLDNGFFGIHSSED--KDSKKNTSPFSSFSWSGQRKVPSYLHKN 411
Query: 413 VFPENFLTALRTIAMQEDEISKVSSLLEELVGSGGERQPSDAEVRAAVWETCGDSGA 469
VFP+ F++ LRTIAMQE E+ +V+SLL E VGS +R+ SDAE+++A+WE CGD GA
Sbjct: 412 VFPQEFMSTLRTIAMQEHELEQVASLLGE-VGSSEDRETSDAEIQSAIWEVCGDQGA 467
>gi|413942690|gb|AFW75339.1| hypothetical protein ZEAMMB73_065275 [Zea mays]
Length = 571
Score = 489 bits (1259), Expect = e-135, Method: Compositional matrix adjust.
Identities = 256/500 (51%), Positives = 324/500 (64%), Gaps = 62/500 (12%)
Query: 6 EAKLEPFLQWLQVNKVELRGCKIKYSDESKGFGIFSS----------------------- 42
+ KLE FLQWLQ N +LR C I+ KG G+FS+
Sbjct: 9 DVKLESFLQWLQSNGADLRSCTIRACG-GKGLGVFSTAAPEPGSNDGNAPLLSPESTRLI 67
Query: 43 ---NEFSDGVLLVVPLDLAITPMRVLQDPLIGPECRAMFEDGEVDDRFLMILFLTVERLR 99
GV +VVPLDLAITPMRVLQDPL+GP CRA+ EDG VDDR L++LFL ER R
Sbjct: 68 WTMRRRCAGVAMVVPLDLAITPMRVLQDPLVGPRCRALLEDGVVDDRLLVMLFLMAERRR 127
Query: 100 KNSSWKPYLDMLPTTFGNPLWFTDDELLELKGTTLYRATELQKQNLLTLYDDKVKDLVKK 159
S WKPYLDMLP+TFG+ LWFT++EL EL+GTTL+RAT +Q+++L + +D+KVK LV++
Sbjct: 128 PGSLWKPYLDMLPSTFGSSLWFTEEELAELEGTTLHRATLIQRKSLQSSFDEKVKGLVEE 187
Query: 160 LLVLDGDSER----------ANSIFWTRALNIPLPHSYVFPQNQEDLNKYDSINNSAELS 209
LL +D + ANSIFWTRALNIPLPHSYVF +
Sbjct: 188 LLHVDESASSVEVLFEDFLWANSIFWTRALNIPLPHSYVFLGS----------------C 231
Query: 210 NDHNSRGELINGLNDIKNEAQRVNSQVNGATSTLTSTQGETLWIEGLVPGIDFCNHDLKA 269
D +R + ND E + ++ A + E++W+EGLVPGIDFCNH++KA
Sbjct: 232 GDQQARAD-----NDAHQEID-ITAKDCSADENSKPSNTESIWVEGLVPGIDFCNHNVKA 285
Query: 270 AATWEVDGTGLITGVPFSMYLLSVERSSFHSEKEISISYGNKGNEELLYLYGFVIDNNPD 329
ATWEVD G TG+P SMYLL ++S + EI I+YGNKGNEELLYLYGFV+DNNPD
Sbjct: 286 LATWEVDSVGNATGIPASMYLLLADKSPAETGAEICINYGNKGNEELLYLYGFVVDNNPD 345
Query: 330 DYLMIHYPAEAIHSIPLSDSKALLLEEQKAQLRCLLPKSLLEHGFFAAGHPKDGNNDNKL 389
DYLM+HYP EA+ I +D K LLE QK +LRCLLP+SLL++GFF +D D+K
Sbjct: 346 DYLMVHYPLEALRQIQSADIKMKLLEMQKGELRCLLPRSLLDNGFFGIRSSED--KDSKK 403
Query: 390 EVDRISSFSWSGQRRMPSYLNKLVFPENFLTALRTIAMQEDEISKVSSLLEELVGSGGER 449
SSFSWSGQR++PSYL K+VFP F++ LRTIAMQE E+ +V+SLL E VG +R
Sbjct: 404 NTGPFSSFSWSGQRKVPSYLPKIVFPHEFMSTLRTIAMQEHELEQVASLLGE-VGYSEDR 462
Query: 450 QPSDAEVRAAVWETCGDSGA 469
+ S A +++A+WE CGD GA
Sbjct: 463 ETSCAAIQSAIWEVCGDQGA 482
>gi|255541436|ref|XP_002511782.1| conserved hypothetical protein [Ricinus communis]
gi|223548962|gb|EEF50451.1| conserved hypothetical protein [Ricinus communis]
Length = 446
Score = 465 bits (1196), Expect = e-128, Method: Compositional matrix adjust.
Identities = 235/350 (67%), Positives = 275/350 (78%), Gaps = 26/350 (7%)
Query: 140 LQKQNLLTLYDDKVKDLVKKLLVLDGDSER---------ANSIFWTRALNIPLPHSYVFP 190
LQK+ LL+LYDDKVK L+KKLL LDGDSE ANS+FW+RALNIPLPHSYVFP
Sbjct: 28 LQKKKLLSLYDDKVKGLMKKLLTLDGDSESEVSFEDFLWANSLFWSRALNIPLPHSYVFP 87
Query: 191 QNQEDLNKYDSINNSAELSNDHNSRGELINGLNDIKNEAQRVNSQVNGATSTLTSTQGET 250
Q +ED + S +S ELS + NS G+LIN ++ +T TS+QGET
Sbjct: 88 QVEEDQENHCSTIDS-ELSYNDNSAGDLINEKDE---------------RTTCTSSQGET 131
Query: 251 LWIEGLVPGIDFCNHDLKAAATWEVDGTGLITGVPFSMYLLSVERSSFHSEKEISISYGN 310
+W+EGLVPGIDFCNHDLKAAATWEVDGTGL+TGVP SMYLLS E++ +EKEI ISYGN
Sbjct: 132 VWVEGLVPGIDFCNHDLKAAATWEVDGTGLVTGVPSSMYLLSAEQTPIKTEKEIFISYGN 191
Query: 311 KGNEELLYLYGFVIDNNPDDYLMIHYPAEAIHSIPLSDSKALLLEEQKAQLRCLLPKSLL 370
KGNEELLYLYGFVIDNN DDYLM++YP EAI ++P SDSK LLE QKA++RCLLPK LL
Sbjct: 192 KGNEELLYLYGFVIDNNTDDYLMVNYPVEAIQNVPFSDSKMQLLEAQKAEMRCLLPKGLL 251
Query: 371 EHGFFAAGHPKDGNNDNKLEVDRISSFSWSGQRRMPSYLNKLVFPENFLTALRTIAMQED 430
+HGFF G K+ +N K + D+ + SWSGQR PSY+NKLVFPE+FLT+LRT+AMQED
Sbjct: 252 DHGFFPVGTSKNDSN-FKCKTDQFGNCSWSGQRETPSYVNKLVFPEDFLTSLRTLAMQED 310
Query: 431 EISKVSSLLEELVGSGGERQPSDAEVRAAVWETCGDSGALQLLVDLLQAK 480
E+ KVSSLLEEL+GS GERQP+D+EVRAAVWE CGDSGALQLLVDLLQ K
Sbjct: 311 ELYKVSSLLEELIGSEGERQPTDSEVRAAVWEACGDSGALQLLVDLLQTK 360
>gi|326521260|dbj|BAJ96833.1| predicted protein [Hordeum vulgare subsp. vulgare]
Length = 389
Score = 413 bits (1062), Expect = e-113, Method: Compositional matrix adjust.
Identities = 205/382 (53%), Positives = 264/382 (69%), Gaps = 31/382 (8%)
Query: 6 EAKLEPFLQWLQVNKVELRGCKIKYSDESKGFGIFSSN---EFSDGVLLVVPLDLAITPM 62
+AKL FLQWLQ N LRGC I+ + GFG++++ +DGV++ VPLDLA+TPM
Sbjct: 12 DAKLRCFLQWLQANGAYLRGCTIRACGRT-GFGVYATAVDAGGADGVVMAVPLDLAVTPM 70
Query: 63 RVLQDPLIGPECRAMFEDGEVDDRFLMILFLTVERLRKNSSWKPYLDMLPTTFGNPLWFT 122
RVLQDPL+GP CRA+FE+G VDDR L++LFL ERLR S WKPYLD+LP+TFG+ +WF
Sbjct: 71 RVLQDPLVGPRCRALFEEGGVDDRLLVMLFLMAERLRPTSLWKPYLDVLPSTFGSSVWFD 130
Query: 123 DDELLELKGTTLYRATELQKQNLLTLYDDKVKDLVKKLLVLDGDSER----------ANS 172
D+EL E++GTTL+RAT +Q+++L L+DDKVK LV++LL +DG ANS
Sbjct: 131 DEELAEVEGTTLHRATVMQRKSLQKLFDDKVKGLVEELLHVDGSGSSIEVRFEDFLWANS 190
Query: 173 IFWTRALNIPLPHSYVFPQNQEDLNKYDSINNSAELSNDHNSRGELINGLNDIKNEAQRV 232
IFWTRALNIPLPHSYVFP L NS GE+++G + + +
Sbjct: 191 IFWTRALNIPLPHSYVFP---------------GSLDEQQNSTGEVLSGSSLTNQQETDI 235
Query: 233 NSQVNGATSTLTSTQGETLWIEGLVPGIDFCNHDLKAAATWEVDGTGLITGVPFSMYLLS 292
++ E++W+EGLVPGIDFCNH++KA ATWEVD G +TG+P SMYL+
Sbjct: 236 TTKNCSGDKIPEYRNTESIWVEGLVPGIDFCNHNVKALATWEVDSEGNLTGIPASMYLML 295
Query: 293 VE--RSSFHSEKEISISYGNKGNEELLYLYGFVIDNNPDDYLMIHYPAEAIHSIPLSDSK 350
+ +SS + EI I+YGNKGNEELLYLYGFV+DNNPDDYLM+HYP EA+ + +D K
Sbjct: 296 AQAGKSSVETGTEIYINYGNKGNEELLYLYGFVVDNNPDDYLMVHYPVEALRQLQSADIK 355
Query: 351 ALLLEEQKAQLRCLLPKSLLEH 372
L+E QKA+LRCLLP SLLE
Sbjct: 356 MKLIEMQKAELRCLLPTSLLER 377
>gi|8570443|gb|AAF76470.1|AC020622_4 Contains similarity to a hypothetical protein gi|6983878 from Oryza
Sativa BAC gb|AP001168 [Arabidopsis thaliana]
Length = 441
Score = 409 bits (1052), Expect = e-111, Method: Compositional matrix adjust.
Identities = 228/387 (58%), Positives = 253/387 (65%), Gaps = 80/387 (20%)
Query: 1 MEISTE-AKLEPFLQWLQVNKVELRGCKIKYSDESKGFGIF--SSNEFSDGVLLVVPLDL 57
M IS E AKLE FL WLQVN ELRGC IKYSD KGFGIF +S + SD VLLVVPLDL
Sbjct: 1 MAISEEEAKLERFLDWLQVNGGELRGCNIKYSDSLKGFGIFASTSTQASDEVLLVVPLDL 60
Query: 58 AITPMRVLQDPLIGPECRAMFEDGEVDDRFLMILFLTVERLRKNSSWKPYLDMLPTTFGN 117
AITPMRVLQDPL+GPEC+ MFE G+VDDRFLMILFLT+ERLR NSSWKPYLDMLPT FGN
Sbjct: 61 AITPMRVLQDPLLGPECQKMFEQGQVDDRFLMILFLTLERLRINSSWKPYLDMLPTRFGN 120
Query: 118 PLWFTDDELLELKGTTLYRATELQKQNLLTLYDDKVKDLVKKLLVLDGDSERANSIFWTR 177
PLWF+DD++LELKGT LY AT DKV + L ANS+FW+R
Sbjct: 121 PLWFSDDDILELKGTNLYHAT------------DKVS--FEHFL-------WANSVFWSR 159
Query: 178 ALNIPLPHSYVFPQNQEDLNKYDSINNSAELSNDHNSRGELINGLNDIKNEAQRVNSQVN 237
ALNIPLPHS+VFPQ+Q+D + S A+ A V S
Sbjct: 160 ALNIPLPHSFVFPQSQDDTGECTSTKIQAQ--------------------PAPSVGS--- 196
Query: 238 GATSTLTSTQGETLWIEGLVPGIDFCNHDLKAAATWEVDGTGLITGVPFSMYLLSVERSS 297
G+T+W+EGLVPGIDFCNHDLK ATWEVDG G ++ VPFSMYLLSV +
Sbjct: 197 ----------GDTIWVEGLVPGIDFCNHDLKPVATWEVDGIGSVSRVPFSMYLLSVAQRP 246
Query: 298 FHSEKEISISYGNKGNEELLYLYGFVIDNNPDDYLMI----------------------H 335
+KEISISYGNKGNEELLYLYGFVIDNNPDDYLMI H
Sbjct: 247 I-PKKEISISYGNKGNEELLYLYGFVIDNNPDDYLMIKEMLVNFVLTSVVTFNNGFIQVH 305
Query: 336 YPAEAIHSIPLSDSKALLLEEQKAQLR 362
YP EAI SIP SDSK LLE Q + R
Sbjct: 306 YPVEAIPSIPFSDSKGQLLEAQLVESR 332
Score = 58.9 bits (141), Expect = 6e-06, Method: Compositional matrix adjust.
Identities = 31/48 (64%), Positives = 34/48 (70%)
Query: 433 SKVSSLLEELVGSGGERQPSDAEVRAAVWETCGDSGALQLLVDLLQAK 480
SK L +LV S QPS+ EVR AVWE CGDSGALQLLVDLL +K
Sbjct: 319 SKGQLLEAQLVESRQGEQPSETEVRMAVWEACGDSGALQLLVDLLNSK 366
>gi|302805649|ref|XP_002984575.1| hypothetical protein SELMODRAFT_42811 [Selaginella moellendorffii]
gi|300147557|gb|EFJ14220.1| hypothetical protein SELMODRAFT_42811 [Selaginella moellendorffii]
Length = 530
Score = 343 bits (881), Expect = 9e-92, Method: Compositional matrix adjust.
Identities = 204/502 (40%), Positives = 290/502 (57%), Gaps = 70/502 (13%)
Query: 6 EAKLEPFLQWLQVNKVELRGCKIKYSDESKGFGIFSSNEFSDG---------------VL 50
E +LE F +W Q + ++ RGC IK +++GFG+++ N+ + G VL
Sbjct: 2 EERLERFSRWSQEHGIQFRGCAIKRVSDAEGFGLYTQNDSARGDFLSFCAPLSTDFADVL 61
Query: 51 LVVPLDLAITPMRVLQDPLIGPECRAMFEDGEVDDRFLMILFLTVERLRKNSS-WKPYLD 109
+V PLDLA+TP+ +++DP++G R M + E+DDR L+++FL +ER R +S W PYL+
Sbjct: 62 VVTPLDLALTPVTIVKDPVLGNVYREMLGN-EIDDRLLVMIFLIIERARGRASFWAPYLE 120
Query: 110 MLPTTFGNPLWFTDDELLELKGTTLYRATELQKQNLLTLYDDKVKD---LVKKLLVLDGD 166
MLP+ FG PLWF D+EL+EL GTTL+ AT+ Q++ L ++Y + LV L D
Sbjct: 121 MLPSGFGTPLWFEDEELMELDGTTLFEATKAQQRCLPSVYIGTLCCQLFLVSLYLFRPDD 180
Query: 167 SER-------ANSIFWTRALNIPLPHSYVFPQNQEDLNKYDSINNSAELSNDHNSRGELI 219
E AN IFWTRALNIP P S+V ++S E++ D +R
Sbjct: 181 RELEFQEFLWANCIFWTRALNIPCPASFV-------------TSSSPEVAKDDGNR---- 223
Query: 220 NGLNDIKNEAQRVNSQVNGATSTLTSTQGETLWIEGLVPGIDFCNHDLKAAATWEVDGT- 278
V + + +S T+WIEGLVPGIDFCNH +A+ WE+DG+
Sbjct: 224 -----------LVIYVLPHPFISCSSKDVSTIWIEGLVPGIDFCNHTRRASGLWEIDGSD 272
Query: 279 GLITGVPFSMYLLSVERSSFHSEKEISISYGNKGNEELLYLYGFVIDNNPDDYLMIHYPA 338
G +GVP SMYL++ F E+ I+YG+KGNEELL+LYGFV ++N +DY+M+H+P
Sbjct: 273 GSTSGVPHSMYLIA--DVVFPPGSEVLINYGDKGNEELLFLYGFVEEDNSNDYVMVHFPK 330
Query: 339 EAIHSIPLSDSKALLLEEQKAQLRCLLPKSLLEHGFFAAGHPKDGNNDNKLEVDRISSFS 398
+ D K LL E L+ LLP SLL GF N D+K FS
Sbjct: 331 MFLDEDNTMDFKLQLLRELDLSLKWLLPSSLLASGFLRK------NPDDKATRTH-PGFS 383
Query: 399 WSGQRRMPSYLNKLVFPENFLTALRTIAMQEDEISKVSSLLEELVGSGGERQPSDAEVRA 458
WSG R+ PSYL+ LVFPE+ + +LR ++M E + V++LLE++ PS +++A
Sbjct: 384 WSGHRKAPSYLDCLVFPEDMVLSLRVLSMPETALHGVANLLEQI-----SHVPSKDDIQA 438
Query: 459 AVWETCGDSGALQLLVDLLQAK 480
AVWE CGD+ AL LLV +L AK
Sbjct: 439 AVWEVCGDAEALTLLVTILSAK 460
>gi|302793745|ref|XP_002978637.1| hypothetical protein SELMODRAFT_52721 [Selaginella moellendorffii]
gi|300153446|gb|EFJ20084.1| hypothetical protein SELMODRAFT_52721 [Selaginella moellendorffii]
Length = 523
Score = 338 bits (866), Expect = 5e-90, Method: Compositional matrix adjust.
Identities = 201/503 (39%), Positives = 288/503 (57%), Gaps = 79/503 (15%)
Query: 6 EAKLEPFLQWLQVNKVELRGCKIKYSDESKGFGIFSSNEFSDG---------------VL 50
E +LE F +W Q + ++ RGC IK +++GFG+++ N+ + G VL
Sbjct: 2 EERLERFSRWSQEHGIQFRGCAIKRGSDAEGFGLYTQNDSARGDFLSFCAPLSTDFADVL 61
Query: 51 LVVPLDLAITPMRVLQDPLIGPECRAMFEDGEVDDRFLMILFLTVERLRKNSS-WKPYLD 109
+V PLDLA+TP+ +++DP++G R M + E+DDR L+++FL +ER R +S W PYL+
Sbjct: 62 VVTPLDLALTPVTIVKDPVLGNVYREMLGN-EIDDRLLVMIFLIIERARGRASFWAPYLE 120
Query: 110 MLPTTFGNPLWFTDDELLELKGTTLYRATELQK-------QNLLTLY----DDKVKDLVK 158
MLP+ FG PLWF D+EL+EL GTTL+ AT+ Q ++LY DD+ + +
Sbjct: 121 MLPSGFGTPLWFEDEELMELDGTTLFEATKAQVFFPSTFVSTCMSLYLFRPDDRELEFQE 180
Query: 159 KLLVLDGDSERANSIFWTRALNIPLPHSYVFPQNQEDLNKYDSINNSAELSNDHNSRGEL 218
L AN IFWTRALNIP P S+V ++S E++ D +R
Sbjct: 181 FLW--------ANCIFWTRALNIPCPASFV-------------TSSSPEVAKDDGNR--- 216
Query: 219 INGLNDIKNEAQRVNSQVNGATSTLTSTQGETLWIEGLVPGIDFCNHDLKAAATWEVDGT 278
V + + ++ T+WIEGLVPGIDFCNH +A+ WE+DG+
Sbjct: 217 ------------LVIYVLPHPFISCSAKDVSTIWIEGLVPGIDFCNHTRRASGLWEIDGS 264
Query: 279 -GLITGVPFSMYLLSVERSSFHSEKEISISYGNKGNEELLYLYGFVIDNNPDDYLMIHYP 337
G +GVP SMYL++ F E+ I+YG+KGNEELL+LYGFV ++N +DY+M+H+P
Sbjct: 265 DGSTSGVPHSMYLIA--DVVFPPGSEVLINYGDKGNEELLFLYGFVEEDNSNDYVMVHFP 322
Query: 338 AEAIHSIPLSDSKALLLEEQKAQLRCLLPKSLLEHGFFAAGHPKDGNNDNKLEVDRISSF 397
+ D K LL E L+ LLP SLL GF N D+K F
Sbjct: 323 KMFLDEDNTMDFKLQLLRELDLSLQWLLPSSLLASGFLRK------NPDDKATRTH-PGF 375
Query: 398 SWSGQRRMPSYLNKLVFPENFLTALRTIAMQEDEISKVSSLLEELVGSGGERQPSDAEVR 457
SWSG R+ PSYL+ LVFPE+ + +LR ++M E + V++LLE++ PS +++
Sbjct: 376 SWSGHRKAPSYLDCLVFPEDMVLSLRVLSMPETALHGVANLLEQI-----SHVPSKDDIQ 430
Query: 458 AAVWETCGDSGALQLLVDLLQAK 480
AAVWE CGD+ AL LLV +L AK
Sbjct: 431 AAVWEVCGDAEALTLLVTILSAK 453
>gi|194689920|gb|ACF79044.1| unknown [Zea mays]
gi|413942688|gb|AFW75337.1| hypothetical protein ZEAMMB73_065275 [Zea mays]
gi|413942689|gb|AFW75338.1| hypothetical protein ZEAMMB73_065275 [Zea mays]
Length = 392
Score = 310 bits (795), Expect = 7e-82, Method: Compositional matrix adjust.
Identities = 159/300 (53%), Positives = 200/300 (66%), Gaps = 25/300 (8%)
Query: 170 ANSIFWTRALNIPLPHSYVFPQNQEDLNKYDSINNSAELSNDHNSRGELINGLNDIKNEA 229
ANSIFWTRALNIPLPHSYVF + D +R + ND E
Sbjct: 29 ANSIFWTRALNIPLPHSYVFLGS----------------CGDQQARAD-----NDAHQEI 67
Query: 230 QRVNSQVNGATSTLTSTQGETLWIEGLVPGIDFCNHDLKAAATWEVDGTGLITGVPFSMY 289
+ ++ A + E++W+EGLVPGIDFCNH++KA ATWEVD G TG+P SMY
Sbjct: 68 D-ITAKDCSADENSKPSNTESIWVEGLVPGIDFCNHNVKALATWEVDSVGNATGIPASMY 126
Query: 290 LLSVERSSFHSEKEISISYGNKGNEELLYLYGFVIDNNPDDYLMIHYPAEAIHSIPLSDS 349
LL ++S + EI I+YGNKGNEELLYLYGFV+DNNPDDYLM+HYP EA+ I +D
Sbjct: 127 LLLADKSPAETGAEICINYGNKGNEELLYLYGFVVDNNPDDYLMVHYPLEALRQIQSADI 186
Query: 350 KALLLEEQKAQLRCLLPKSLLEHGFFAAGHPKDGNNDNKLEVDRISSFSWSGQRRMPSYL 409
K LLE QK +LRCLLP+SLL++GFF +D D+K SSFSWSGQR++PSYL
Sbjct: 187 KMKLLEMQKGELRCLLPRSLLDNGFFGIRSSED--KDSKKNTGPFSSFSWSGQRKVPSYL 244
Query: 410 NKLVFPENFLTALRTIAMQEDEISKVSSLLEELVGSGGERQPSDAEVRAAVWETCGDSGA 469
K+VFP F++ LRTIAMQE E+ +V+SLL E VG +R+ S A +++A+WE CGD GA
Sbjct: 245 PKIVFPHEFMSTLRTIAMQEHELEQVASLLGE-VGYSEDRETSCAAIQSAIWEVCGDQGA 303
>gi|168016200|ref|XP_001760637.1| predicted protein [Physcomitrella patens subsp. patens]
gi|162687997|gb|EDQ74376.1| predicted protein [Physcomitrella patens subsp. patens]
Length = 450
Score = 280 bits (715), Expect = 1e-72, Method: Compositional matrix adjust.
Identities = 163/369 (44%), Positives = 222/369 (60%), Gaps = 63/369 (17%)
Query: 6 EAKLEPFLQWLQVNKVELRGCKIKYSD--ESKGFGIFSSNEFSDGVLLVVPLDLAITPMR 63
+ +L+ F W+Q+N V+ R C+I+ S E+ GFG+F++ + + GVL+V PL LAITPM
Sbjct: 12 DPRLQRFRDWMQINGVQSRFCEIRPSSNGENAGFGLFATKDNAQGVLMVTPLLLAITPMT 71
Query: 64 VLQDPLIGPECRAMFEDGEVDDRFLMILFLTVERLR-KNSSWKPYLDMLPTTFGNPLWFT 122
VLQDP +G + E+GEVDDR L++LFL +ER R + S W PYL++LP FG PL F+
Sbjct: 72 VLQDPELGGHYCKLMEEGEVDDRLLIMLFLVIERARGRFSFWAPYLEILPFKFGTPLSFS 131
Query: 123 DDELLELKGTTLYRATELQKQNLLTLYDDKVKDLVKKLLVLDGDSERANSIFWTRALNIP 182
++EL ELKGT L++AT+ Q L L + VLD RANS+FWTRALNIP
Sbjct: 132 EEELSELKGTHLFQATQQQSTTGLIL----------RCPVLD----RANSVFWTRALNIP 177
Query: 183 LPHSYVFPQNQEDLNKYDSINNSAELSNDHNSRGELINGLNDIKNEAQRVNSQVNGATST 242
PHS+ N+ ++ + + E S+ ++ +D+K +
Sbjct: 178 CPHSF---NNRFAVDLDSTTHKKPEESSAADT--------DDVKIPS------------- 213
Query: 243 LTSTQGETLWIEGLVPGIDFCNHDLKAAATWEVDGT-GLITGVPFSMYLLSVERSSFHSE 301
++W+EGLVPGIDFCNHDLKA A WEVDG G +TGVP SMYL++ +
Sbjct: 214 -------SVWVEGLVPGIDFCNHDLKAVALWEVDGPEGSVTGVPNSMYLVTGLDVVISNG 266
Query: 302 KEISISYGNKGNEELLYLYGFVIDNNPDDYLM--------------IHYPAEAIHSIPLS 347
EI ISYGNK NEELLYLYGFV+ NPDDYLM I++P +A+ S
Sbjct: 267 SEIFISYGNKSNEELLYLYGFVLVENPDDYLMVRSTIGCCGGIAMQIYFPKQALEYSVCS 326
Query: 348 DSKALLLEE 356
D+K+ LLE+
Sbjct: 327 DTKSQLLEQ 335
Score = 58.2 bits (139), Expect = 1e-05, Method: Compositional matrix adjust.
Identities = 28/46 (60%), Positives = 37/46 (80%), Gaps = 1/46 (2%)
Query: 436 SSLLEELVGSG-GERQPSDAEVRAAVWETCGDSGALQLLVDLLQAK 480
S LLE+L+ R P+DA+++AAVWETCG++GALQLLVDLL +K
Sbjct: 330 SQLLEQLISDPEAGRIPTDADIKAAVWETCGNAGALQLLVDLLTSK 375
>gi|238011186|gb|ACR36628.1| unknown [Zea mays]
Length = 200
Score = 206 bits (523), Expect = 2e-50, Method: Compositional matrix adjust.
Identities = 106/185 (57%), Positives = 135/185 (72%), Gaps = 17/185 (9%)
Query: 48 GVLLVVPLDLAITPMRVLQDPLIGPECRAMFEDGEVDDRFLMILFLTVERLRKNSSWKPY 107
GV +VVPLDLAITPMRVLQDPL+GP CRA+ EDG VDDR L++LFL ER R S WKPY
Sbjct: 7 GVAMVVPLDLAITPMRVLQDPLVGPRCRALLEDGVVDDRLLVMLFLMAERRRPGSLWKPY 66
Query: 108 LDMLPTTFGNPLWFTDDELLELKGTTLYRATELQKQNLLTLYDDKVKDLVKKLLVLDGDS 167
LDMLP+TFG+ LWFT++EL EL+GTTL+RAT +Q+++L + +D+KVK LV++LL +D +
Sbjct: 67 LDMLPSTFGSSLWFTEEELAELEGTTLHRATLIQRKSLQSSFDEKVKGLVEELLHVDESA 126
Query: 168 ER----------ANSIFWTRALNIPLPHSYVF------PQNQEDLNKYDSINNSA-ELSN 210
ANSIFWTRALNIPLPHSYVF Q + D + + I+ +A + S
Sbjct: 127 SSVEVLFEDFLWANSIFWTRALNIPLPHSYVFLGSCGDQQARADNDAHQEIDITAKDCSA 186
Query: 211 DHNSR 215
D NS+
Sbjct: 187 DENSK 191
>gi|413942691|gb|AFW75340.1| hypothetical protein ZEAMMB73_065275 [Zea mays]
Length = 210
Score = 191 bits (486), Expect = 5e-46, Method: Compositional matrix adjust.
Identities = 98/195 (50%), Positives = 126/195 (64%), Gaps = 27/195 (13%)
Query: 6 EAKLEPFLQWLQVNKVELRGCKIKYSDESKGFGIFSS----------------------- 42
+ KLE FLQWLQ N +LR C I+ KG G+FS+
Sbjct: 9 DVKLESFLQWLQSNGADLRSCTIRACG-GKGLGVFSTAAPEPGSNDGNAPLLSPESTRLI 67
Query: 43 ---NEFSDGVLLVVPLDLAITPMRVLQDPLIGPECRAMFEDGEVDDRFLMILFLTVERLR 99
GV +VVPLDLAITPMRVLQDPL+GP CRA+ EDG VDDR L++LFL ER R
Sbjct: 68 WTMRRRCAGVAMVVPLDLAITPMRVLQDPLVGPRCRALLEDGVVDDRLLVMLFLMAERRR 127
Query: 100 KNSSWKPYLDMLPTTFGNPLWFTDDELLELKGTTLYRATELQKQNLLTLYDDKVKDLVKK 159
S WKPYLDMLP+TFG+ LWFT++EL EL+GTTL+RAT +Q+++L + +D+KVK LV++
Sbjct: 128 PGSLWKPYLDMLPSTFGSSLWFTEEELAELEGTTLHRATLIQRKSLQSSFDEKVKGLVEE 187
Query: 160 LLVLDGDSERANSIF 174
LL +D + +F
Sbjct: 188 LLHVDESASSVEVLF 202
>gi|302839507|ref|XP_002951310.1| hypothetical protein VOLCADRAFT_91853 [Volvox carteri f.
nagariensis]
gi|300263285|gb|EFJ47486.1| hypothetical protein VOLCADRAFT_91853 [Volvox carteri f.
nagariensis]
Length = 730
Score = 123 bits (309), Expect = 2e-25, Method: Compositional matrix adjust.
Identities = 112/403 (27%), Positives = 164/403 (40%), Gaps = 115/403 (28%)
Query: 6 EAKLEPFLQWLQVNKVELRGCKIKYSDES--KGFGIFSS--------------------N 43
EA L+ +WL +L KI S G GIFS+ N
Sbjct: 69 EATLKQLHEWLTSQGADLGKVKIMKSQAGPGAGLGIFSAVAAAAPRPGAPWWSWLWPHRN 128
Query: 44 EFSDG-----VLLVVPLDLAITPMRVLQDPLIGPECRAMFEDGEVDDRFLMILFLTVERL 98
S L PLD AIT DP +G + M + +D+R ++++L +ERL
Sbjct: 129 HTSGAQREPTTLASFPLDAAITVRTACNDPQVGAQFSWMLDKHLLDERTAVMVYLLIERL 188
Query: 99 RKNSS-WKPYLDMLPTTFGNPLWFTDDELLELKGTTLYRATELQKQNLLTLYD--DKVKD 155
R + S + P++ LPTTF P+ FT EL EL+GT L+RAT + L + + +
Sbjct: 189 RGDKSRFAPWIRALPTTFDVPIAFTAQELAELRGTALHRATAAVVRRLEETWSRLEGPLN 248
Query: 156 LVKKLLVLDGDSER----ANSIFWTRALNIPLPHSYVFPQNQEDLNKYDSINNSAELSND 211
V + L L + A +FW+R ++P+P S +++
Sbjct: 249 AVARELGLPAPGWKDWLWAYCVFWSRGQSLPVPES----------------GSASSALVG 292
Query: 212 HNSRGELINGLNDIKNEAQRVNSQVNGATSTLTSTQGETLWIEGLVPGIDFCNHDLKAAA 271
H+S+G + ++ Q+ +EGLVPG+DFCNH L
Sbjct: 293 HSSKG------------SDKIPIQI----------------LEGLVPGLDFCNHRLGPPP 324
Query: 272 T--WEV-----------DGTGLIT----------------------GVPFSMYLLSVERS 296
WEV G G G P S L +
Sbjct: 325 QCWWEVVAPERPKGSPAAGPGSPASASCSASSPSSLQASSARSHSLGAPDSASLGATLHG 384
Query: 297 SFHSE--KEISISYGNKGNEELLYLYGFVIDNNPDDYLMIHYP 337
H +E+ ISYG K NEELL LYGF +++NP D+LM++ P
Sbjct: 385 GAHVRPGEELYISYGEKSNEELLMLYGFALEDNPHDHLMLYCP 427
>gi|159486014|ref|XP_001701039.1| hypothetical protein CHLREDRAFT_142793 [Chlamydomonas reinhardtii]
gi|158281538|gb|EDP07293.1| predicted protein, partial [Chlamydomonas reinhardtii]
Length = 769
Score = 107 bits (267), Expect = 1e-20, Method: Compositional matrix adjust.
Identities = 102/390 (26%), Positives = 160/390 (41%), Gaps = 94/390 (24%)
Query: 15 WLQVNKVELRGCKIKYSDES--KGFGIFSSNEFS------------------------DG 48
WL+ ++ +IK S G G+F++ S
Sbjct: 115 WLKAQGADVSKVQIKPSQAGPGAGLGLFATAAVSAQRHGASWWSWLWPRRGGAAGQREPT 174
Query: 49 VLLVVPLDLAITPMRVLQDPLIGPECRAMFEDGEVDDRFLMILFLTVERLRKN-SSWKPY 107
L PL AIT + QDP +G + M + +D+R ++++L VERLR + S++ P+
Sbjct: 175 TLAAFPLAAAITARKACQDPQLGEQFSWMLDRQLLDERTAVMVYLLVERLRGDKSAYAPW 234
Query: 108 LDMLPTTFGNPLWFTDDELLELKGTTLYRATELQKQNLLTLYDDK-------VKDLVKKL 160
+ LP +F P+ F +L EL+GTTL +A E + + + +DL +
Sbjct: 235 IRALPQSFDAPINFAAADLSELRGTTLAKAVEAVSRRMRDTWSRLESPLAALCRDLGLRR 294
Query: 161 LVLDGDSERANSIFWTRALNIPLPHSYVFPQNQEDLNKYDSINNSAELSNDHNSRGELIN 220
D D A ++W+R ++P+P E + ++ + + SN H
Sbjct: 295 PTWD-DWVWAYCVWWSRGQSLPVP---------ESGSATKALASLSAASNGHGHGNGASG 344
Query: 221 GLNDIKNE-----AQRVNSQVNGATSTLTSTQGETLWIEGLVPGIDFCNHDLKAAAT--W 273
N ++VN +V +EGLVPG+DF NH W
Sbjct: 345 NGNGGGAGLAAGAVRKVNIEV----------------VEGLVPGLDFANHRHSPPPQCWW 388
Query: 274 EV---------DGTGLITGVPFSMYLLS-----------VERSSFHSE------KEISIS 307
EV D TG TG + S + R H+ +E+ IS
Sbjct: 389 EVVAPERPTGKDATG-STGSASASATPSSASTSASSNATMVRLQLHAGTRVRPGEELFIS 447
Query: 308 YGNKGNEELLYLYGFVIDNNPDDYLMIHYP 337
YG+K NEELL LYGF NP ++LM++ P
Sbjct: 448 YGDKSNEELLMLYGFAAPGNPHEFLMLYCP 477
>gi|303272869|ref|XP_003055796.1| predicted protein [Micromonas pusilla CCMP1545]
gi|226463770|gb|EEH61048.1| predicted protein [Micromonas pusilla CCMP1545]
Length = 677
Score = 105 bits (262), Expect = 5e-20, Method: Compositional matrix adjust.
Identities = 92/308 (29%), Positives = 139/308 (45%), Gaps = 50/308 (16%)
Query: 49 VLLVVPLDLAITP-MRVLQDPLIGPECRAMFEDGEVDDRFLMILFLTVERLR-KNSSWKP 106
VL VP+ LA++ D G R M E GE+D+R ++L L VER R + S K
Sbjct: 178 VLASVPMRLAVSADAAANHDTPAGAVFREMLESGEIDERLAVMLLLIVERRRGEGSPIKA 237
Query: 107 YLDMLPTT--FGNPLWFTDDELLELKGTTLYRATELQKQNLLTLYDDKVKDLVKKLL--- 161
Y+D LP PL++ +E+ L+GT L+RA Q++ L ++ D KKL+
Sbjct: 238 YVDALPKPGEHKTPLFYDANEMKALRGTNLHRAVSAQRRRLASVLKDSAIPAGKKLMRAV 297
Query: 162 ---VLDGDSERANSIFWTRALNIPLPHSYVFPQNQEDLNKYDSINNSAE-LSNDHNSRGE 217
V + E+ W LP + DL + + SA+ ++ D E
Sbjct: 298 ASHVPPEEREKKTGGGW-------LPWWLGGGNGKRDLRRRRATKPSAKPITMD-----E 345
Query: 218 LINGLNDIKNEAQRV----NSQVNGATSTLTSTQGETLWIEGLVPGIDFCNHDL-KAAAT 272
+ + A + + + +GA +E +VPGIDF NH + A
Sbjct: 346 FLWAYATFWSRALALPIGPDPEASGA-------------VEAIVPGIDFANHSCARPNAR 392
Query: 273 WEVDGT----GLITGVP-FSMYLLSVERSSFHSEKEISISYGNKGNEELLYLYGFVIDNN 327
W V G G P ++ LSV +E+ ISYG+K NEELL+++GF N
Sbjct: 393 WAVANASGREGATAGEPTVTLECLSVPGPG----EEVLISYGDKPNEELLFVHGFAEREN 448
Query: 328 PDDYLMIH 335
P D L++H
Sbjct: 449 PHDALVLH 456
>gi|307102325|gb|EFN50631.1| hypothetical protein CHLNCDRAFT_59475 [Chlorella variabilis]
Length = 562
Score = 104 bits (259), Expect = 1e-19, Method: Compositional matrix adjust.
Identities = 96/381 (25%), Positives = 150/381 (39%), Gaps = 109/381 (28%)
Query: 4 STEAKLEPFLQWLQVNKVELRGCKIKYSDESKG-FGIFSSNEFSDGV------------- 49
+T +L WLQ ++ K SD G +G ++++ V
Sbjct: 49 TTAEQLTAMAAWLQQQGADIAAIDFKQSDTDAGRYGAYATDAIRQRVARSFWGTVAGWAG 108
Query: 50 --------LLVVPLDLAITPMRVLQDPLIGPECRAMFEDGEVDDRFLMILFLTVERLRKN 101
+ PL +T + + P GP + + E G D R +++L L VER R
Sbjct: 109 AGRNAAVTVASFPLSSTLTAATLARHPQQGPILQQLLELGVADGRDVVMLHLAVERHRLR 168
Query: 102 SSWK------PYLDMLPTTFGNPLWFTDDELLELKGTTLYRATELQKQNLLTLYDDKVKD 155
+L +LPT+FG L+F++ ++ L+GTTLY AT L++ +L ++ K++
Sbjct: 169 ERGPGTGGELAWLALLPTSFGTTLFFSELDMQWLRGTTLYTATRLRRDSLRGAWE-KLEP 227
Query: 156 LVKKLLVLDG--------DSERANSIFWTRALNIPLPHSYVFPQNQEDLNKYDSINNSAE 207
++L G D ANS+FW+RA+ FP D +I
Sbjct: 228 SARQLAQASGLQAAPGLDDWLWANSVFWSRAI--------AFPSPTPDGGSSVAIQ---- 275
Query: 208 LSNDHNSRGELINGLNDIKNEAQRVNSQVNGATSTLTSTQGETLWIEGLVPGIDFCNHDL 267
EG+VPG+DFCNH +
Sbjct: 276 ----------------------------------------------EGIVPGLDFCNHAV 289
Query: 268 KAAATWE----------VDGTGLITGVPFSMYLLSVERSSFHSEKEISISYGNKGNEELL 317
+ A E G G ++ L+ R + E++I YGNKGNEELL
Sbjct: 290 EVALQVEDALAQHRRPRSKHAGRPQG---AISLVCPRRGAPAPGAELTIDYGNKGNEELL 346
Query: 318 YLYGFVI-DNNPDDYLMIHYP 337
+LYGF + NN + L + P
Sbjct: 347 FLYGFALPGNNESEVLTLMLP 367
>gi|145354720|ref|XP_001421625.1| predicted protein [Ostreococcus lucimarinus CCE9901]
gi|144581863|gb|ABO99918.1| predicted protein [Ostreococcus lucimarinus CCE9901]
Length = 375
Score = 97.8 bits (242), Expect = 1e-17, Method: Compositional matrix adjust.
Identities = 83/328 (25%), Positives = 141/328 (42%), Gaps = 69/328 (21%)
Query: 68 PLIGPECRAMFEDGEVDDRFLMILFLTVERLR-KNSSWKPYLDMLPTTFGNPLWFTDDEL 126
P +G ++ G VD+R ++ F +ER R + S+WK Y+D LP + PL F+D+EL
Sbjct: 18 PTLGSTFESLLRRGVVDERLAVMCFFMIERRRGEESAWKEYIDSLPRAYDAPLSFSDEEL 77
Query: 127 -LELKGTTLYRATELQKQNLLTLYDDKVKDLVKKLLVLDGDSERANSIFWTRALNIPLPH 185
EL GTT+Y + QK ++ ++++ V+ ++
Sbjct: 78 ERELSGTTVYAPVKAQKAHVKKMFEECVRPAMR--------------------------- 110
Query: 186 SYVFPQNQEDLNKYDSINNSAELSNDHNSRGELINGLNDIKNEAQRVNSQVNGATS-TLT 244
EL+ N+ G ++ L D+ + Q + + +
Sbjct: 111 ---------------------ELTQADNAAGSSLHMLPDVSEKEFAWAFQTFWSRALAIP 149
Query: 245 STQGETLWIEGLVPGIDFCNH--DLKAAATWE-VDGTGLITGVPFSMYLLSVERSSFHSE 301
G ++ ++ +VPG+D NH +A A WE V+ + G ++ R+ E
Sbjct: 150 VGAGGSVTVDSVVPGVDMVNHAPRARANARWEHVEDSSRPDGGYVALVSAPPNRTMKDGE 209
Query: 302 KEISISYGNKGNEELLYLYGFVIDNNPDDYLMIHYP---AEAIHSIPLSDSKALLLEEQK 358
EI I+YG+K NEELL+ YGF + +N + M+ +P +A HS E+
Sbjct: 210 -EIFINYGDKSNEELLFTYGFALKDNAVEERMVFFPPWAGDAEHS-----------EDVT 257
Query: 359 AQLRCLLPKSLLEHGFFAAGHPKDGNND 386
++ L K L +H PK G D
Sbjct: 258 RRIELLRAKGLPQHVVLPTTPPKRGFRD 285
>gi|308812738|ref|XP_003083676.1| SET domain-containing protein-like (ISS) [Ostreococcus tauri]
gi|116055557|emb|CAL58225.1| SET domain-containing protein-like (ISS) [Ostreococcus tauri]
Length = 483
Score = 93.2 bits (230), Expect = 2e-16, Method: Compositional matrix adjust.
Identities = 92/343 (26%), Positives = 141/343 (41%), Gaps = 101/343 (29%)
Query: 25 GCKIKYSDESKG-FGIFSSNEFSDGVLLVVPLDLAITPMRVLQDPLIGPECRAMFEDGEV 83
G + Y D S+G G F N VL+ P ++ DP +G + + DG +
Sbjct: 70 GVGVFYVDGSEGKRGWFRKNR----VLMTSPDAAVVSARTATMDPKLGEKYADLMRDGTL 125
Query: 84 -DDRFLMILFLTVERLR-KNSSWKPYLDMLPTTFGNPLWFTDDEL-LELKGTTLYRATEL 140
D+R ++FL VER + + S+W Y+D LP ++ PL +D EL ELKGT +Y A
Sbjct: 126 PDERVAAMVFLMVERRKGEASAWGGYIDALPRSYDAPLSLSDVELERELKGTNVYDAAVA 185
Query: 141 QKQNLLTLYDDKVKDLVKKL---LVLDGDSERANSI--------------FWTRALNIPL 183
Q+ + ++D+ V+ ++ L GD++ A S+ FWTRAL IP+
Sbjct: 186 QRAKVREMFDENVRPAMRGLSEVAAASGDAKLATSLNNATIDEFKWAFQTFWTRALAIPV 245
Query: 184 PHSYVFPQNQEDLNKYDSINNSAELSNDHNSRGELINGLNDIKNEAQRVNSQVNGATSTL 243
N GE++ G
Sbjct: 246 -----------------------------NDTGEVVEG---------------------- 254
Query: 244 TSTQGETLWIEGLVPGIDFCNHD-LKAAATWEV--DGTGLITGVPFSMYLLSVERSSFHS 300
+VPGID NH KA A WE D T GV + L+S + H
Sbjct: 255 ------------IVPGIDMVNHSRTKANARWEHVDDNTRPDGGV---IALVSNGKKLGHG 299
Query: 301 EKEISISYGNKGNEELLYLYGFV------IDNNPDDYLMIHYP 337
+ EI I YG +E L + +GFV + + ++L+++ P
Sbjct: 300 D-EIFIDYGESSSEALFFTHGFVPEDDDTVSDGKGEHLVLYAP 341
>gi|255071473|ref|XP_002499410.1| predicted protein [Micromonas sp. RCC299]
gi|226514673|gb|ACO60669.1| predicted protein [Micromonas sp. RCC299]
Length = 323
Score = 88.6 bits (218), Expect = 6e-15, Method: Compositional matrix adjust.
Identities = 83/306 (27%), Positives = 132/306 (43%), Gaps = 50/306 (16%)
Query: 78 FEDGEVDDRFLMILFLTVERLR-KNSSWKPYLDMLPTTFGNPLWFTDDELLELKGTTLYR 136
+D +D+R ++L L +ER R S PY++ +P F PL ++DDE L GT L+
Sbjct: 1 MDDDVIDERMAVMLLLILERRRGARSPAAPYVEAIPARFRTPLHYSDDETRGLLGTNLHA 60
Query: 137 ATELQKQNLLTLYDDKVKDLVKKLLVLDGDSERANSIFWTRALNIPLPHSYVFPQNQEDL 196
AT Q++ L + ++V+ +L ER ++ R
Sbjct: 61 ATSQQRKTLELVLRERVRPAGARLFAAMRAWERERRGWFGRTF----------------- 103
Query: 197 NKYDSINNSAELSNDHNSRGELINGLNDIKNEAQRVNSQVNGATSTLTSTQGETLWIEGL 256
++ +++ D E + + A + + + T +E +
Sbjct: 104 --VGGFTSAGKITED-----EFRWAYSAYWSRALSLPIGADPSAPT----------VEAI 146
Query: 257 VPGIDFCNHDLKAA-ATWEVDGTGLITGVP---------FSMYLLSVERSSFHSEKEISI 306
VPGIDF NH A A WEV G+ G P + LL S +E+ I
Sbjct: 147 VPGIDFANHSCGAPNARWEV--RGVRGGAPDPNDPSGSGPRVELLGEFGSLPAPGEEVVI 204
Query: 307 SYGNKGNEELLYLYGFVIDNNPDDYLMIHYP-AEAIHSIPLSDSKALLLEEQKAQLRC-- 363
SYG+K NEELL+++GF +NP D L++ P A + ++P + A L E L
Sbjct: 205 SYGDKTNEELLFVHGFADRDNPHDALVLQPPWAVVLPAVPPARGLAGLDETTVDTLTLWG 264
Query: 364 LLPKSL 369
L PK+L
Sbjct: 265 LSPKAL 270
>gi|115657973|ref|XP_798530.2| PREDICTED: histone-lysine N-methyltransferase setd3-like
[Strongylocentrotus purpuratus]
Length = 682
Score = 79.7 bits (195), Expect = 3e-12, Method: Compositional matrix adjust.
Identities = 47/152 (30%), Positives = 76/152 (50%), Gaps = 18/152 (11%)
Query: 6 EAKLEPFLQWLQVNKVELRGCKIKYSDESKGFGIFSSNEFS-DGVLLVVPLDLAITPMRV 64
E E F +WL N V K+ DE G+G+ ++ + D L+ +P + +T
Sbjct: 77 EVHFETFFKWLNTNGVTTDAVKMAKFDE--GYGLQATQDIKMDQELMNIPRKVMMTDQNA 134
Query: 65 LQDPLIGPECRAMFEDGEVDDRFL-------MILFLTVERLRKNSSWKPYLDMLPTTFGN 117
+ P IG R DR L + +F+ E+L+ +S WKPYLD+LP+++
Sbjct: 135 VDSPTIGDLVRG--------DRLLKGMPNVSLAIFILSEKLKSDSFWKPYLDVLPSSYSL 186
Query: 118 PLWFTDDELLELKGTTLYRATELQKQNLLTLY 149
PL+FT DE+ +G+T+Y Q +N+ Y
Sbjct: 187 PLYFTPDEIQLFQGSTMYGECLKQHKNIARQY 218
>gi|389747418|gb|EIM88597.1| SET domain protein, partial [Stereum hirsutum FP-91666 SS1]
Length = 411
Score = 77.4 bits (189), Expect = 1e-11, Method: Compositional matrix adjust.
Identities = 88/311 (28%), Positives = 141/311 (45%), Gaps = 51/311 (16%)
Query: 34 SKGFGIFSSNEF-SDGVLLVVPLDLAITP---MRVLQDPLIGPECRAMFEDGEVDDRFLM 89
S G+ + ++ +D ++ P LAITP R LQ+ L P+ ++ ++R LM
Sbjct: 5 SSGYSVIAACPLEADSTIVTCPFTLAITPEQSKRALQNFL--PDIPDVW-----NERQLM 57
Query: 90 ILFLTVERLRKNSSWK--PYLDMLP--TTFGNPLWFTDDELLELKGTTLYRATELQKQNL 145
+++ R+ + + K PYL +LP + PL FT+ EL LKG+ LY AT + +
Sbjct: 58 CTYMSFHRILDHPALKHRPYLHVLPHPSKLRTPLHFTEQELEALKGSNLYGATLDRVREW 117
Query: 146 LTLYDDKVKDLVKKLLVLDGDSERANSIFWTRALNIPLPHSYVFPQNQEDLNKYDSINNS 205
T ++ + +D ++ LV +G W AL+ + VF
Sbjct: 118 QTEWN-QCRDAIR--LVNEG---------W--ALDFSWCVTLVF--------------GF 149
Query: 206 AELSNDHNSRGELINGLNDIKNEAQRVNSQVNGATSTLTSTQGETLWIEGLVPGIDFCNH 265
L DH+ R I + A S + +L +T+ L+PG+D NH
Sbjct: 150 RRLYTDHSHREHWKTVATYISSRA--FPSTLLSPHPSLVATKSSH---PILLPGVDSLNH 204
Query: 266 DLKAAATWEVDGTGLITGVP--FSMYLLSVERSSFHSEKEISISYGNKGNEELLYLYGFV 323
+W V VP ++ L+ + S HS EI +YG KGN ELL YGFV
Sbjct: 205 ARAHPVSWVVHVPSEPDSVPEASTISLVHHQPISAHS-VEIFNNYGPKGNPELLLGYGFV 263
Query: 324 IDNNPDDYLMI 334
+ +NPDD +++
Sbjct: 264 MPDNPDDTIVL 274
>gi|325186532|emb|CCA21071.1| conserved hypothetical protein [Albugo laibachii Nc14]
Length = 441
Score = 75.5 bits (184), Expect = 6e-11, Method: Compositional matrix adjust.
Identities = 52/195 (26%), Positives = 96/195 (49%), Gaps = 21/195 (10%)
Query: 1 MEISTEAKLEPFLQWLQVNKVELRGCKIKYSDESKGFGIFSSNEFSDG-VLLVVPLDLAI 59
ME S K+ LQWL+ V + SD +G G++++ G + + +P L I
Sbjct: 1 MESSYPIKIASLLQWLRSKSVTTDSLHFQKSDGHEGVGVYAAKSLQKGEITMEIPFHLTI 60
Query: 60 TPMRVLQDPLIGPECRAMFED-GEVDDRFLMILFLTVERLRKNSSW-KPYLDMLPTTFGN 117
+ + +Q L R + +D E+D ++ LFL +ER + + S+ +P++ LP+ F
Sbjct: 61 SKVTAMQSDL-----RQILQDKNELDQDEIVALFLMIERFKSSDSFFEPFIQSLPSQFDL 115
Query: 118 PLWFTDDELLELKGTTLYRATELQKQN-----------LLTLYDDKVKDLVKKLLVLDGD 166
P+++ D + EL+GT + ++ ++ LL Y++++ ++ + D
Sbjct: 116 PIFWNDSDFAELEGTNVALLAKIMRKQIEADFQAIHIPLLRAYEERLNLRTSEISI--SD 173
Query: 167 SERANSIFWTRALNI 181
E A SI WTRA I
Sbjct: 174 YEWALSIIWTRAFGI 188
>gi|68479052|ref|XP_716460.1| hypothetical protein CaO19.7326 [Candida albicans SC5314]
gi|46438129|gb|EAK97465.1| hypothetical protein CaO19.7326 [Candida albicans SC5314]
Length = 579
Score = 74.7 bits (182), Expect = 9e-11, Method: Compositional matrix adjust.
Identities = 91/352 (25%), Positives = 145/352 (41%), Gaps = 68/352 (19%)
Query: 9 LEPFLQWLQVNKVELRGCKIKYSDESKGF--GIFSSNEFSDGVL--LVVPLDLAITPMRV 64
+ L+W + N ++ +++ + SK + I+ N+ D + +P L ITP
Sbjct: 7 INKLLKWAESNGAQISP-DVEFKEISKNYIGAIYKGNKVPDSPFCPISIPSKLIITPQTA 65
Query: 65 LQDPLIGPECRAMFEDGEVDDRFLMILFLTVERLRKNSSWKPYLDMLP--TTFGNPLWFT 122
+ E ++ +++D ++ L L ERL NS + PYL++LP + +P ++
Sbjct: 66 FK------EFSKSLKNTDINDNSILKLHLCHERLNGNSFFYPYLNLLPSLSEIDSPYTWS 119
Query: 123 DDELLELKGTTLYRATELQKQNLLTLYDDKVKDLVKKLLVLDGDSERANSIFWTRALNIP 182
++ L+GT L + K+NL+TL V++ K + L D + + +N+
Sbjct: 120 ANDKSYLQGTNLGNSL---KENLVTL----VEEWWKAINALHDDLPKPEQHY----INMK 168
Query: 183 LPHSYVFPQNQEDLNKYDSINNSAELSNDHNSRGELINGLNDIKNEAQRVNSQVNGATST 242
+ Y F +DLNKY LND E A+
Sbjct: 169 FYYEYKF-YTDDDLNKY----------------------LNDENIENWTSFPNYLWASLI 205
Query: 243 LTSTQGETLWIEG--------LVPGIDFCNHDLKAAATWEVDGTGLITGVPFSMYLLSVE 294
L S I+ L+P +D NH+ K+ W+V
Sbjct: 206 LKSRSFPAYLIDKNNKQDSAMLLPVVDLLNHNSKSKVHWDVSDN-----------YFKFS 254
Query: 295 RSSFHSEKEISISYGNKGNEELLYLYGFVIDNNPDD--YLMIHYPAEAIHSI 344
S KEI +YG KGNEELL YGF I+NN D L I P E I +I
Sbjct: 255 SESIVPGKEIFNNYGLKGNEELLLAYGFCIENNSQDSVALKIKMPEEKIKAI 306
>gi|238880307|gb|EEQ43945.1| conserved hypothetical protein [Candida albicans WO-1]
Length = 579
Score = 74.7 bits (182), Expect = 9e-11, Method: Compositional matrix adjust.
Identities = 91/352 (25%), Positives = 145/352 (41%), Gaps = 68/352 (19%)
Query: 9 LEPFLQWLQVNKVELRGCKIKYSDESKGF--GIFSSNEFSDGVL--LVVPLDLAITPMRV 64
+ L+W + N ++ +++ + SK + I+ N+ D + +P L ITP
Sbjct: 7 INKLLKWAESNGAQISP-DVEFKEISKNYIGAIYKGNKVPDSPFCPISIPSKLIITPQTA 65
Query: 65 LQDPLIGPECRAMFEDGEVDDRFLMILFLTVERLRKNSSWKPYLDMLP--TTFGNPLWFT 122
+ E ++ +++D ++ L L ERL NS + PYL++LP + +P ++
Sbjct: 66 FK------EFSKSLKNTDINDNSILKLHLCHERLNGNSFFYPYLNLLPSLSEIDSPYTWS 119
Query: 123 DDELLELKGTTLYRATELQKQNLLTLYDDKVKDLVKKLLVLDGDSERANSIFWTRALNIP 182
++ L+GT L + K+NL+TL V++ K + L D + + +N+
Sbjct: 120 ANDKSYLQGTNLGNSL---KKNLVTL----VEEWWKAINALHDDLPKPEQHY----INMK 168
Query: 183 LPHSYVFPQNQEDLNKYDSINNSAELSNDHNSRGELINGLNDIKNEAQRVNSQVNGATST 242
+ Y F +DLNKY LND E A+
Sbjct: 169 FYYEYKF-YTDDDLNKY----------------------LNDENIENWTSFPNYLWASLI 205
Query: 243 LTSTQGETLWIEG--------LVPGIDFCNHDLKAAATWEVDGTGLITGVPFSMYLLSVE 294
L S I+ L+P +D NH+ K+ W+V
Sbjct: 206 LKSRSFPAYLIDKNNKQDSAMLLPVVDLLNHNSKSKVHWDVSDN-----------YFKFS 254
Query: 295 RSSFHSEKEISISYGNKGNEELLYLYGFVIDNNPDD--YLMIHYPAEAIHSI 344
S KEI +YG KGNEELL YGF I+NN D L I P E I +I
Sbjct: 255 SESIVPGKEIFNNYGLKGNEELLLAYGFCIENNSQDSVALKIKMPEEKIKAI 306
>gi|66813084|ref|XP_640721.1| hypothetical protein DDB_G0281543 [Dictyostelium discoideum AX4]
gi|60468751|gb|EAL66753.1| hypothetical protein DDB_G0281543 [Dictyostelium discoideum AX4]
Length = 1339
Score = 73.2 bits (178), Expect = 3e-10, Method: Compositional matrix adjust.
Identities = 48/170 (28%), Positives = 86/170 (50%), Gaps = 6/170 (3%)
Query: 12 FLQWLQVNKVELRGCKIKYSDESKGFGIFSSNEFSDG-VLLVVPLDLAITPMRVLQDPLI 70
F WL+ V+ +I +S G G+ ++ + + ++VVP I P++
Sbjct: 760 FENWLKAGGVQFPKLQIANFTDSTGRGVVTTKKVDENEAVVVVPKKYLINVDVAKAHPIL 819
Query: 71 GPECRAMFEDGEVDDRFLMILFLTVERLRKNSSWKPYLDMLPTTFGNPLWFTDDELLELK 130
GP +FE+ ++D ++ LF+ E+ NS W+P+ D LP+ F + ++ ELLEL+
Sbjct: 820 GP----IFEELHLNDDTILFLFVIYEKGNANSFWRPFYDTLPSYFTTSIHYSATELLELE 875
Query: 131 GTTLYRATELQKQNLLTLYDDKVKDLVKKLLVLDGDSERA-NSIFWTRAL 179
GT L+ T KQ L + D +L K+ + +S+ + + W R+L
Sbjct: 876 GTNLFEETLHTKQQLNSFRDYLFPELSKQYPDIFPESQFSWENFLWARSL 925
>gi|281201870|gb|EFA76078.1| hypothetical protein PPL_10657 [Polysphondylium pallidum PN500]
Length = 1234
Score = 72.0 bits (175), Expect = 7e-10, Method: Compositional matrix adjust.
Identities = 48/177 (27%), Positives = 89/177 (50%), Gaps = 5/177 (2%)
Query: 5 TEAKLEPFLQWLQVNKVELRGCKIKYSDESKGFGIFSSNEFSDG-VLLVVPLDLAITPMR 63
++A F +WL + V+ +I +S G GI ++ + + V++ VP I
Sbjct: 733 SDAVYRKFEKWLASDGVQCPKLQIANFQDSTGRGIVTTKKVEENEVIIKVPRKFLINVQV 792
Query: 64 VLQDPLIGPECRAMFEDGEVDDRFLMILFLTVERLRKNSSWKPYLDMLPTTFGNPLWFTD 123
+ P++G R E ++D ++ LF+ E+ NS W+P+ D LP+ F + +T
Sbjct: 793 AREHPILG---RIFEEFSGLNDDTILFLFVIYEKENPNSFWRPFFDTLPSYFPTSIHYTS 849
Query: 124 DELLELKGTTLYRATELQKQNLLTLYDDKVKDLVKKLLVLDGDSERA-NSIFWTRAL 179
ELLEL+GT L+ T K++L ++ D +L ++ + +S + + W R+L
Sbjct: 850 TELLELEGTNLFAETLQVKEHLQSIRDMLFPELSEQYPTIFPESLFSWENFLWARSL 906
>gi|440802665|gb|ELR23594.1| SET domain containing protein [Acanthamoeba castellanii str. Neff]
Length = 984
Score = 71.6 bits (174), Expect = 8e-10, Method: Compositional matrix adjust.
Identities = 81/334 (24%), Positives = 127/334 (38%), Gaps = 63/334 (18%)
Query: 9 LEPFLQWLQVNKVELRGCKIKYSDESKGFGIFSSNEFSDGVLLVVPLDLAITPMRVLQDP 68
+E F WLQ N+ S+ GI + + L +P+ L +T + +
Sbjct: 1 MEEFEGWLQANEARYPRLTFAVSELGGRGGIATEDILPGEELCSIPVRLVLT-TEIARKS 59
Query: 69 LIGPECRAMFE--DGE----VDDRFLMILFLTVERLRKNSSWKPYLDMLPTTFGNPLWFT 122
+G A GE R ++ +L +R +++ W PYL LP P
Sbjct: 60 EVGRLVAAHLNAVQGERLRVSAGRAILCAYLIHQRAAQDAFWGPYLRSLPKHDDRP---- 115
Query: 123 DDELLELKGTTLYRATELQKQNLLTLYDDKVKDLVKKLLVLDGDSERANSIFWTRALNIP 182
D+++ L GT L+ A + ++Q + +D L
Sbjct: 116 DEDIQHLAGTNLFYAMQEKQQQIRESFDLLFPALCHA----------------------- 152
Query: 183 LPHSYVFPQNQEDLNKYDSINNSAELSNDHNSRGELINGLNDIKNEAQRVNSQVNGATST 242
H VFP DL +D + + + L+ Q AT++
Sbjct: 153 --HPTVFP---PDLFTWDHFLWTFTACSSRSFPQTLV---------------QQPTATTS 192
Query: 243 LTSTQGETLWI-EGLVPGIDFCNHDLKAAATWEVDGTGLITGVPFSMYLLSVERSSFHSE 301
+ + L I E L+PG+D NH + TW +D P + L V +
Sbjct: 193 AHADPYDLLEIDECLLPGLDMLNHQYRKKITWALD--------PSTGRLKFVTEDTVEKG 244
Query: 302 KEISISYGNKGNEELLYLYGFVIDNNPDDYLMIH 335
E +YG KGNEELL YGF I++N DY+MI
Sbjct: 245 TEAFNNYGPKGNEELLMGYGFCIEDNEQDYVMIR 278
>gi|324503528|gb|ADY41532.1| SET domain-containing protein 3 [Ascaris suum]
Length = 502
Score = 71.2 bits (173), Expect = 1e-09, Method: Compositional matrix adjust.
Identities = 79/353 (22%), Positives = 137/353 (38%), Gaps = 72/353 (20%)
Query: 4 STEAKLEPFLQWLQVNKVELRGCKIKYSDESKGFGIFSSNEFS-DGVLLVVPLDLAITPM 62
S ++ FL+W +E G ++ SD + GFG+ +++ F D LL VP ++
Sbjct: 75 SRSEAVQRFLEWADRMGIEREGITVRCSDGAMGFGLEATHSFKQDAELLRVPRKAMLSWD 134
Query: 63 RVLQDPLIGPECRAMFEDGEVD---DRFLMILFLTVERLRKNSSWKPYLDMLPTTFGNPL 119
+ + ++ + FE + D + L + ++L +SSW PYLD LP TF PL
Sbjct: 135 QARKSAML----KKCFEQDMIVKTMDNVALALMVCCQKLSPDSSWLPYLDALPQTFSTPL 190
Query: 120 WFTDDELLELKGTTLYRATELQKQNLLTLYDDKVKDLVKKLLVLDGDSERANSIFWTRAL 179
+F+ EL +L + Y + L +Y + + V L + SER+
Sbjct: 191 YFSALELRKLSPSPAYEES-------LIMYRNVARQFVYFLAAVQR-SERS--------- 233
Query: 180 NIPLPHSYVFPQNQEDLNKYDSINNSAELSNDHNSRGE---LINGLNDIKN--------E 228
SA+ +H + G +N + N
Sbjct: 234 ------------------------RSAKKDKNHAAVGMEPLFLNAPFTVSNFTFDLYRWA 269
Query: 229 AQRVNSQVNGATSTLTS-TQGETLWIEGLVPGIDFCNHDLKAAATWEVDGTGLITGVPFS 287
V +++N S + G+ + + L+P +D NH+ T G + +
Sbjct: 270 VACVTTRINFIPSQYAKDSNGQPVAVPCLIPLLDMANHEFDHPLTVHFSTEGDYASIKAT 329
Query: 288 MYLLSVERSSFHSEKEISISYGNKGNEELLYLYGFVID--NNPDDY-LMIHYP 337
+ + E++I YG + N + GFV D N D Y L I +P
Sbjct: 330 --------KDYKAGDEVTIFYGIRTNRQFFLHNGFVPDGENKNDTYKLKIGFP 374
>gi|449301991|gb|EMC98000.1| hypothetical protein BAUCODRAFT_146595 [Baudoinia compniacensis
UAMH 10762]
Length = 633
Score = 68.6 bits (166), Expect = 8e-09, Method: Compositional matrix adjust.
Identities = 87/344 (25%), Positives = 139/344 (40%), Gaps = 65/344 (18%)
Query: 14 QWLQVNKVELR-GCKIKYSDESKGFGIFSSNEF--SDGVLLVVPLDLAITPMRVLQDPLI 70
QW + N L ++ Y+++S GF ++ + E + VP L+++ + L D
Sbjct: 12 QWFKANGGHLHPSIRLDYNEQS-GF-LWRAREPIPPQEIASSVPYSLSLSYINALVD--- 66
Query: 71 GPECRAMFEDGEVDDRFLMI--LFLTVERLRKNSS-WKPYLDMLPTT--FGNPLWF-TDD 124
+ +F+ D I +L + L K S WKPYLD+LP+ F PLWF
Sbjct: 67 --DAYPVFKQRRTDFTIEAIGFFYLMTQYLNKEQSFWKPYLDVLPSPSEFSTPLWFDAPA 124
Query: 125 ELLELKGTTLYRATELQKQNLLTLYDDKVKDLVKK---LLVLDGDSER-ANSIFWTRALN 180
+L L GT + +++ Y +K L + + + D R A + F +R+
Sbjct: 125 DLAWLDGTDVLHTMLARREVYAQYYQSGLKVLSESGIDVTLYTWDLFRWAITTFTSRSFT 184
Query: 181 IPLPHSYVFPQNQEDLNKYDSINNSAELSNDHNSRGELINGLNDIKNEAQRVNSQVNGAT 240
+ PQN+ KY ++ ++ N +
Sbjct: 185 ----SRVLLPQNR----KYWPVHRTS-------------------------TNGRRQTVL 211
Query: 241 STLTSTQGETLWIEGLVPGIDFCNHDLKAAATWEVDGTGLITGVPFSMYLLSVERSSFHS 300
++ + E L L PG+D NHD A W D FS+ L+ +
Sbjct: 212 LDMSHSPAEDLDFSVLFPGLDSGNHDPNAQVDWSFDANQ------FSIALVQ----PIEA 261
Query: 301 EKEISISYGNKGNEELLYLYGFVIDNNPDD--YLMIHYPAEAIH 342
E+ +YG K N+ELL YGF I NNP D L + P EA+
Sbjct: 262 GAEVCNNYGPKANDELLMGYGFCIPNNPRDEVLLTLKAPPEALQ 305
>gi|357615786|gb|EHJ69829.1| putative SET domain containing 3 [Danaus plexippus]
Length = 489
Score = 67.8 bits (164), Expect = 1e-08, Method: Compositional matrix adjust.
Identities = 80/333 (24%), Positives = 135/333 (40%), Gaps = 79/333 (23%)
Query: 9 LEPFLQWLQVNKVELRGCKIKYSDESKGFGIFSSNEFSDGVLLVVPLDLAITPMRVLQDP 68
+E ++ WL + E G +I D GFG+ ++ +FS+G L++ P +V+
Sbjct: 83 IENYVSWLHEHGAEFEGVEISEFD-GYGFGLKATKDFSEGSLILT------VPGKVMMSE 135
Query: 69 LIGPECRAMFEDGEVDD------RFLMILFLTVERLRKNSSWKPYLDMLPTTFGNPLWFT 122
P+ + E +D + LFL +E+ NS WKPY+D+LP + L+F
Sbjct: 136 K-DPKASDLSEFINIDPLLQNMPNVTLALFLLLEKNNPNSFWKPYIDVLPEKYSTVLYFN 194
Query: 123 DDELLELKGTTLYRATELQKQNLLTLYDDKVKDLVKKLLVLDGDSERANSIFWTRALNIP 182
+EL EL+ + ++ ++ L LY V R + F+ + I
Sbjct: 195 SEELAELRPSPVFESS-------LKLYRSIV---------------RQYAYFYNKIHTID 232
Query: 183 LPHSYVFPQNQEDLNKYDSINNSAELSNDHNSRGELINGLNDIKNEAQRVNSQVNGATST 242
LP +N +D+ +D N R + R N+ V G T
Sbjct: 233 LP----VLKNLQDIFTFD------------NYRWA-------VSTVMTRQNNIVQGTAFT 269
Query: 243 LTSTQGETLWIEGLVPGIDFCNHDL-KAAATWEVDGTGLITGVPFSMYLLSVERSSFHSE 301
LT+ +P D CNH K + ++ L G +++ + +
Sbjct: 270 LTN---------AFIPLWDMCNHKHGKITTDFNLE---LNRGECYAL-------QDYRRD 310
Query: 302 KEISISYGNKGNEELLYLYGFVIDNNPDDYLMI 334
++I I YG + N +L GFV +N D L I
Sbjct: 311 EQIFIFYGARPNSDLFLHNGFVYPDNDYDSLSI 343
>gi|241959368|ref|XP_002422403.1| ribosomal N-lysine methyltransferase, putative [Candida
dubliniensis CD36]
gi|223645748|emb|CAX40410.1| ribosomal N-lysine methyltransferase, putative [Candida
dubliniensis CD36]
Length = 579
Score = 67.8 bits (164), Expect = 1e-08, Method: Compositional matrix adjust.
Identities = 85/348 (24%), Positives = 147/348 (42%), Gaps = 58/348 (16%)
Query: 8 KLEPFLQWLQVNKVELRGCKIKYSDESKGF--GIFSSNEFSDGVL--LVVPLDLAITPMR 63
++ ++W + N ++ +++ + S + I+ N+ D + +P L ITP
Sbjct: 6 QINKLVKWAESNGAQI-SPDVEFKEISTNYIGAIYKGNKVPDSPFYPISIPFKLIITPRN 64
Query: 64 VLQDPLIGPECRAMFEDGEVDDRFLMILFLTVERLRKNSSWKPYLDMLPT--TFGNPLWF 121
++ E ++ ++ ++ L+L ER+ +NS + PYL++LP+ +P +
Sbjct: 65 AIE------EFSKSLQNTDISHNSILKLYLCHERVNENSFFHPYLNLLPSLSEIDSPYTW 118
Query: 122 TDDELLELKGTTLYRATELQKQNLLTLYDDKVKDLVKKLLVLDGDSERANSIFWTRALNI 181
+ ++ L+GT L + K+NL L V++ K + L D + F +N+
Sbjct: 119 SANDKSYLQGTNLGSSL---KENLALL----VEEWWKAINALHDDIPKPEQHF----INM 167
Query: 182 PLPHSYVFPQNQEDLNKY---DSINNSAELSNDHNSRGELINGLNDIKNEAQRVNSQVNG 238
+ Y F +DL+KY ++I N N LI N++ +
Sbjct: 168 KYYYEYKF-YTDDDLDKYLNDENIENWTSFPN--YLWASLILKSRSFPAYLIDKNNKQDS 224
Query: 239 ATSTLTSTQGETLWIEGLVPGIDFCNHDLKAAATWEVDGTGLITGVPFSMYLLSVERSSF 298
A L+P +D NH+ K+ W++ G S
Sbjct: 225 AM---------------LLPVVDLLNHNSKSKVHWDIFENHFKFG-----------SESI 258
Query: 299 HSEKEISISYGNKGNEELLYLYGFVIDNNPDD--YLMIHYPAEAIHSI 344
KEI +YG KGNEELL YGF I+NN D L I P E I +I
Sbjct: 259 EPGKEIFNNYGLKGNEELLLAYGFCIENNLQDSVALKIKMPEEKIKAI 306
>gi|345566622|gb|EGX49564.1| hypothetical protein AOL_s00078g53 [Arthrobotrys oligospora ATCC
24927]
Length = 611
Score = 67.8 bits (164), Expect = 1e-08, Method: Compositional matrix adjust.
Identities = 69/242 (28%), Positives = 95/242 (39%), Gaps = 59/242 (24%)
Query: 98 LRKNSSWKPYLDMLPTTFGNPLWFTDDELLELKGTTLYRATELQKQNLLTLYDDKVKDLV 157
L++ S + PY+ +LP TF PL+F DDE+ L GT L L ++ L + K +
Sbjct: 122 LKEQSQFWPYIRLLPKTFDTPLYFNDDEMERLAGTNLGAGDVLLRKQLWMEEWEAGKQFL 181
Query: 158 KKLLVLDGDSERANSIFWT---RALNIPLPHSYVFPQNQEDLNKYDSINNSAELSNDHNS 214
+ + +ERA W RA I S FP + SI + LS+D
Sbjct: 182 EGV-----GAERAREYTWDLFLRAATIYTSRS--FPSKLVGITMDSSIEENTMLSDD--- 231
Query: 215 RGELINGLNDIKNEAQRVNSQVNGATSTLTSTQGETLWIEGLVPGIDFCNHDLKAAATWE 274
NG L+P +D NH WE
Sbjct: 232 -----NGF-------------------------------PVLIPLVDILNHKPNTKIIWE 255
Query: 275 VDGTGLITGVPFSMYLLSVERSSFHSEKEISISYGNKGNEELLYLYGFVIDNNPDDYLMI 334
T S L++ E S S+ + +YG KGNEELL YGFVI NP D L +
Sbjct: 256 PTQT--------SFSLITPETISEGSQ--VFNNYGPKGNEELLMGYGFVIPENPGDSLAM 305
Query: 335 HY 336
+
Sbjct: 306 KF 307
>gi|428173103|gb|EKX42007.1| hypothetical protein GUITHDRAFT_141487 [Guillardia theta CCMP2712]
Length = 355
Score = 67.4 bits (163), Expect = 2e-08, Method: Compositional matrix adjust.
Identities = 43/163 (26%), Positives = 84/163 (51%), Gaps = 17/163 (10%)
Query: 3 ISTEAKLEPFLQWLQVNKVELRGCKIKYSDESKGFGIFSSNEFSDGVLLV-VPLDLAITP 61
IS K++ F +W+Q K+ + ++K S G + ++ +DG L + +P + + P
Sbjct: 40 ISDARKIDAFEKWIQSQKLAVNKLEVK-SIPGFRMGTTAKDDIADGELYIAIPDHMLMGP 98
Query: 62 MRVLQDP----------LIGPECRAMFEDGEV---DDRFLMILFLTVERLRKNSSWKPYL 108
RV +P ++ + +M E + ++ LM L + +K S WKPY
Sbjct: 99 ERV--EPGSRLDKKLMKIVKSQSISMQEQRRLLSEKNKVLMYFLLQMYNPKKESFWKPYF 156
Query: 109 DMLPTTFGNPLWFTDDELLELKGTTLYRATELQKQNLLTLYDD 151
D++PT +P+++++DEL EL G+ + ++K+ L +YD+
Sbjct: 157 DIMPTNLTSPIFWSEDELQELAGSEVSNMARIEKKRLRAMYDE 199
Score = 39.3 bits (90), Expect = 4.0, Method: Compositional matrix adjust.
Identities = 21/56 (37%), Positives = 33/56 (58%), Gaps = 5/56 (8%)
Query: 307 SYGNKGNEELLYLYGFVIDNNPDDYLMIHYPAEAIHSIPLSDSKALLLEEQKAQLR 362
SYGNK N E L GFV+++NP+D + I +P S D+K+ L++ + + R
Sbjct: 292 SYGNKSNYEYLLYNGFVMEDNPNDCVYISFP-----SSNARDAKSYLIKHIEEKRR 342
>gi|392563539|gb|EIW56718.1| SET domain-containing protein [Trametes versicolor FP-101664 SS1]
Length = 441
Score = 67.4 bits (163), Expect = 2e-08, Method: Compositional matrix adjust.
Identities = 95/353 (26%), Positives = 144/353 (40%), Gaps = 71/353 (20%)
Query: 12 FLQWLQVNKVELRGCKIKYSDESKGFGIFSSNEF-SDGVLLVVPLDLAITPMRVLQDPLI 70
F WL + ++ + GF I ++N+ +D ++ +P LAITP +D L
Sbjct: 16 FKSWLVEHGADIHP-DAHFEQMVSGFSILAANDIPADAAIVSIPFSLAITP-EGSKDALQ 73
Query: 71 GP-ECRAMFEDGEVD---DRFLMILFLTVERLRKNS----SWKPYLDMLPTT--FGNPLW 120
G + A G +D +R L+ +L + + S PYLD LP+ PL
Sbjct: 74 GTLKVSADAMQGALDSLSERQLICTYLCMHWIADRSLTVLRHGPYLDTLPSPERLVTPLH 133
Query: 121 FTDDELLELKGTTLYRATELQKQNLLTLYDDKVKDLVKKLLVLDGDSERANSIFWTRALN 180
FT +EL +GT LY AT + +Q D + V+ V+ A S W +A
Sbjct: 134 FTSNELEAFRGTNLYGAT-IDRQR------DWEAEWVQCQAVVS-----AASPAWGKAFT 181
Query: 181 IPLPHSYVFPQNQEDLNKYDSINNSAELSNDHNSRGELINGLNDIKNEAQRVNSQVNGAT 240
++Y + S LS+ + S V T
Sbjct: 182 ---------------WDRY--LTTSTYLSS-------------------RAFPSTVLSQT 205
Query: 241 STLTSTQGETLWIEGLVPGIDFCNHDLKAAATWEV-DGTGLITGVPFSMYLLSVERSSFH 299
TL +T L+PGID NH +W V DG + P ++ V +
Sbjct: 206 PTLHTTPSS---YPVLLPGIDALNHARAHPVSWVVSDGINTSSAEP---HISLVIHTPTT 259
Query: 300 SEKEISISYGNKGNEELLYLYGFVIDNNPDD--YLMIHYPAEAIHSIPLSDSK 350
+ E+ +YG K N EL+ YGF + NNPDD L I P+ A +P +D+K
Sbjct: 260 TGSELLNNYGPKPNAELILGYGFSLPNNPDDTIVLKIGGPSGAFAPVP-ADAK 311
>gi|328772335|gb|EGF82373.1| hypothetical protein BATDEDRAFT_86177 [Batrachochytrium
dendrobatidis JAM81]
Length = 966
Score = 67.0 bits (162), Expect = 2e-08, Method: Compositional matrix adjust.
Identities = 42/134 (31%), Positives = 65/134 (48%), Gaps = 11/134 (8%)
Query: 7 AKLEPFLQWLQVNKVELRGCKIKYSDESK--GFGIFSSNEFSDGVLLV-VPLDLAITPMR 63
A LE F QWL N + G IK D+SK G GIFS+ + G LV +PL L ++
Sbjct: 556 ASLESFTQWLHANGINTDGISIKKVDDSKDVGLGIFSTRQIHKGECLVKIPLKLILS--- 612
Query: 64 VLQDPLIGPECRAMFEDG---EVDDRFLMILFLTVERLRKNSSWKPYLDMLPTTFGNPLW 120
D P ++ + + D ++++ L E + S W+PY D+LP F P+
Sbjct: 613 --NDTSAMPALNSIVKSNVLLKTDPSVILVIRLLQEYINPMSLWQPYFDLLPRVFTIPVL 670
Query: 121 FTDDELLELKGTTL 134
+ +L GT++
Sbjct: 671 GSAQDLAAYTGTSI 684
>gi|291411315|ref|XP_002721936.1| PREDICTED: SET domain containing 3 [Oryctolagus cuniculus]
Length = 591
Score = 65.9 bits (159), Expect = 4e-08, Method: Compositional matrix adjust.
Identities = 76/333 (22%), Positives = 129/333 (38%), Gaps = 85/333 (25%)
Query: 12 FLQWLQVNKVELRGCKIKYSDESKGFGIFSSNEF-SDGVLLVVPLDLAITPMRVLQDPLI 70
++W N + G ++ + E +GFG+ ++ E ++ + L VP L +T + ++ ++
Sbjct: 82 LMKWASANGASVEGFEV-VNFEEEGFGLRATREIKAEELFLWVPRKLLMT-VESAKNSVL 139
Query: 71 GP---ECRAMFEDGEVDDRFLMILFLTVERLRKNSSWKPYLDMLPTTFGNPLWFTDDELL 127
GP + R + G + + L ER NS W+PY+ LP+ + PL+F +DE+
Sbjct: 140 GPLYSQDRILQAMGNI----ALAFHLLCERASPNSFWQPYIQTLPSEYDTPLYFEEDEVR 195
Query: 128 ELKGTTLYRATELQKQNLLTLYDDKVKDLVKKLLVLDGDSERANSIFWTRALNIPLPHSY 187
L+ T Q +N Y ++ R + PH+
Sbjct: 196 YLQSTQAIHDVFSQYKNTARQY-----------------------AYFYRVIQT-HPHAN 231
Query: 188 VFPQNQEDLNKYDSINNSAELSNDHNSRGELINGLNDIKNEAQRVNSQVNGATSTLTSTQ 247
P + +S + + ++ N I E +G+ TL
Sbjct: 232 KLP-----------LKDSFTYEDYRWAVSSVMTRQNQIPTE--------DGSRVTL---- 268
Query: 248 GETLWIEGLVPGIDFCNHDLKAAATWEVDGTGLITGVPFSMYLLSVERSS------FHSE 301
L+P D CNH GLIT + Y L +R FH+
Sbjct: 269 -------ALIPLWDMCNHT-----------NGLIT----TGYNLEDDRCECVALRDFHAG 306
Query: 302 KEISISYGNKGNEELLYLYGFVIDNNPDDYLMI 334
++I I YG + N E + GF DNN D + I
Sbjct: 307 EQIYIFYGTRSNAEFVIHSGFFFDNNSHDRVKI 339
>gi|412985257|emb|CCO20282.1| SET domain-containing protein-like (ISS) [Bathycoccus prasinos]
Length = 615
Score = 65.5 bits (158), Expect = 5e-08, Method: Compositional matrix adjust.
Identities = 73/275 (26%), Positives = 120/275 (43%), Gaps = 55/275 (20%)
Query: 7 AKLEPFLQWLQVNKVELRGCKIKYSDESKGFGIFSSNEFS-------------DGVLLVV 53
AKL FL N++ +K +D FG+FS E S + ++L V
Sbjct: 120 AKLHHFLG--DENRILAHAKPLKSADAPVHFGLFSDGEISRKEKRRSVRKESGETIVLNV 177
Query: 54 PLDLAITPMRVLQDPLIGPECRAMFEDGEVDDRFLMILF-LTVERLRKNSSWKPYLDMLP 112
P ++ D +G M ++ +++ R +F L RL+ S WK Y+D LP
Sbjct: 178 PDAYIVSASTAASDEDLGETFTKMLDENKINTRQATQMFVLAQRRLKSESGWKAYVDFLP 237
Query: 113 TTFG-NPLWFTDDELLE-LKGTTLYRATELQKQNLLTLYDDKVKDL--------VKKLL- 161
P+++T+ E+ LKGT LY + QK L Y+ VKD +K+++
Sbjct: 238 RRMDLVPMFWTEREIERGLKGTVLYEMVKTQKARLKEEYETVVKDAFDANVLPKLKEIIP 297
Query: 162 ---------VLDGDSER-----------ANSIFWTRALNIPLPHSYVFPQNQEDLNKYDS 201
+ G++ A ++FWTRAL IP+ + V + L D+
Sbjct: 298 SSSSSVFVSLFGGNANDTSPLSFEEFLWAKALFWTRALTIPIENGRVIVEALVPL--VDA 355
Query: 202 INNSAELSNDHNSRGELINGLNDIKNEAQRVNSQV 236
N+S + N+R +L ND+K+ RV S++
Sbjct: 356 CNHS---TKKPNARYQLS---NDLKSVELRVPSKI 384
>gi|452985350|gb|EME85107.1| hypothetical protein MYCFIDRAFT_214947 [Pseudocercospora fijiensis
CIRAD86]
Length = 623
Score = 65.1 bits (157), Expect = 8e-08, Method: Compositional matrix adjust.
Identities = 77/337 (22%), Positives = 136/337 (40%), Gaps = 49/337 (14%)
Query: 3 ISTEAKLEPFLQWLQVNKVELR-GCKIKYSDESKGFGIFSSNEFSDGVLLVVPLDLAITP 61
+++ +L+ QW + L G +I+Y+ E +S+ D + VP LA++
Sbjct: 1 MASHDRLDSLRQWFTNHGGYLNPGVRIEYNAERGVHCRAASDLAPDSRICTVPHSLALSS 60
Query: 62 MRVLQDPLIGPECRAMFEDGEVDDRFLMILFLTVERLRKNSS-WKPYLDMLP---TTFGN 117
+ L D + A+F + + + +L L S W+PYLD LP + G
Sbjct: 61 LNALVD-----DSFAVFRSRGLSPQAMGYFYLMHHFLHSEESFWRPYLDTLPPPTSEHGT 115
Query: 118 PLWFTDDELLELKGTTLYRATELQKQNLLTLYDDKVKDLVKKLLVLDGDSERANSIFWTR 177
P WF D+L L+ T ++ + +L + G+ ++
Sbjct: 116 PFWFDADDLAWLEDTDVFFTYK------------------ARLAMHQGNYSTGTAMISKA 157
Query: 178 ALNIPLPHSYVFPQNQEDLNKYDSINNSAELSNDHNSRGELINGLNDIKNEAQRVNSQVN 237
+++ P+S+ + + S ++ A +D +G + + Q V ++
Sbjct: 158 GIDV-TPYSWQLYKWAVTIFTSRSFSSRALHPHDSKYWAAYKSGPDGRR---QTVLLDLS 213
Query: 238 GATSTLTSTQGETLWIEGLVPGIDFCNHDLKAAATWEVDGTGLITGVPFSMYLLSVERSS 297
A+S E L L P +D NH A W D GL T
Sbjct: 214 RASS-------EHLDFPVLFPVLDIPNHSHDAKVDWTFD-PGLFT---------MATTDG 256
Query: 298 FHSEKEISISYGNKGNEELLYLYGFVIDNNPDDYLMI 334
+ EI +YG KGN+ELL YGF + +NP+D +++
Sbjct: 257 AETGSEILNNYGPKGNDELLLGYGFCLRDNPNDKVLM 293
>gi|217038301|gb|ACJ76599.1| SET domain-containing protein 3 (predicted) [Oryctolagus cuniculus]
Length = 394
Score = 65.1 bits (157), Expect = 8e-08, Method: Compositional matrix adjust.
Identities = 76/332 (22%), Positives = 129/332 (38%), Gaps = 85/332 (25%)
Query: 13 LQWLQVNKVELRGCKIKYSDESKGFGIFSSNEF-SDGVLLVVPLDLAITPMRVLQDPLIG 71
++W N + G ++ + E +GFG+ ++ E ++ + L VP L +T + ++ ++G
Sbjct: 83 MKWASANGASVEGFEV-VNFEEEGFGLRATREIKAEELFLWVPRKLLMT-VESAKNSVLG 140
Query: 72 P---ECRAMFEDGEVDDRFLMILFLTVERLRKNSSWKPYLDMLPTTFGNPLWFTDDELLE 128
P + R + G + + L ER NS W+PY+ LP+ + PL+F +DE+
Sbjct: 141 PLYSQDRILQAMGNI----ALAFHLLCERASPNSFWQPYIQTLPSEYDTPLYFEEDEVRY 196
Query: 129 LKGTTLYRATELQKQNLLTLYDDKVKDLVKKLLVLDGDSERANSIFWTRALNIPLPHSYV 188
L+ T Q +N Y ++ R + PH+
Sbjct: 197 LQSTQAIHDVFSQYKNTARQY-----------------------AYFYRVIQTH-PHANK 232
Query: 189 FPQNQEDLNKYDSINNSAELSNDHNSRGELINGLNDIKNEAQRVNSQVNGATSTLTSTQG 248
P + +S + + ++ N I E +G+ TL
Sbjct: 233 LP-----------LKDSFTYEDYRWAVSSVMTRQNQIPTE--------DGSRVTL----- 268
Query: 249 ETLWIEGLVPGIDFCNHDLKAAATWEVDGTGLITGVPFSMYLLSVERSS------FHSEK 302
L+P D CNH GLIT + Y L +R FH+ +
Sbjct: 269 ------ALIPLWDMCNHT-----------NGLIT----TGYNLEDDRCECVALRDFHAGE 307
Query: 303 EISISYGNKGNEELLYLYGFVIDNNPDDYLMI 334
+I I YG + N E + GF DNN D + I
Sbjct: 308 QIYIFYGTRSNAEFVIHSGFFFDNNSHDRVKI 339
>gi|258569485|ref|XP_002543546.1| conserved hypothetical protein [Uncinocarpus reesii 1704]
gi|237903816|gb|EEP78217.1| conserved hypothetical protein [Uncinocarpus reesii 1704]
Length = 480
Score = 64.3 bits (155), Expect = 1e-07, Method: Compositional matrix adjust.
Identities = 91/379 (24%), Positives = 154/379 (40%), Gaps = 69/379 (18%)
Query: 8 KLEPFLQWLQVNKVELRGCKIKYSD-----ESKGFGIFSSNEFS-DGVLLVVPLDLAITP 61
K + F+ WL+ + KI+ +D +G + E + D L +P +L ++
Sbjct: 21 KSDGFMNWLKSHPGVRVSFKIRIADLRSNAAGRGVETVACEEIAQDEELFAIPENLVLSV 80
Query: 62 MRV-LQDPLIGPECRAMFEDGEVDDRFLMILFLTVERLRKNSS-WKPYLDMLPTTFGNPL 119
L+D L F D E+D +I+ + E L +S W Y +LPT F +
Sbjct: 81 QNSKLKDHL-------NFTDKELDSWLSLIVTMIYEYLHGGASRWSSYFAVLPTDFDTLM 133
Query: 120 WFTDDELLELKGTTLYRATELQKQNLLTLYDDKVKDLVKKLLVLDGDSERANSIFWTRAL 179
+++ DEL EL+G+++ Q+ A+ + +
Sbjct: 134 FWSQDELRELQGSSVLSKIGRQE---------------------------ADEMIMGKVY 166
Query: 180 NIPLPHSYVFPQNQEDLNKYDSINNSAELSNDHNSRGELINGLN-DIKNEAQRVNSQVNG 238
+ L + +FP +E L+ ++S + + + G LI DI+NE R +G
Sbjct: 167 PLILDYPGLFPTPKE-LSSFNSQQGKEAILHLAHRMGTLIMAYAFDIENEMDREEEDQDG 225
Query: 239 ATSTLTSTQGETLWIEGLVPGIDFCNHDLKA--AATWEVDGTGLITG-VPFSMYLLSVER 295
+T + ET +G+VP D N D A ++ DG ++ VP SM
Sbjct: 226 EDGYITDNEQET--AKGMVPLADMLNADAHRNNARLFQEDGYFIMKSIVPISM------- 276
Query: 296 SSFHSEKEISISYGNKGNEELLYLYGFVIDNNPDDYLMIHYPAEAIHSIPLSDSKALLLE 355
E+EI YG +LL YG++ +N Y ++ EAI K +E
Sbjct: 277 -----EEEIFNDYGELPRADLLRRYGYITENY-SPYDVVEISLEAI-------CKVAGVE 323
Query: 356 EQKAQLRCLLPKSLLEHGF 374
+ QL L +LE G+
Sbjct: 324 KNCPQLELLETAEILEDGY 342
>gi|260948894|ref|XP_002618744.1| hypothetical protein CLUG_02203 [Clavispora lusitaniae ATCC 42720]
gi|238848616|gb|EEQ38080.1| hypothetical protein CLUG_02203 [Clavispora lusitaniae ATCC 42720]
Length = 599
Score = 63.5 bits (153), Expect = 2e-07, Method: Compositional matrix adjust.
Identities = 89/346 (25%), Positives = 149/346 (43%), Gaps = 55/346 (15%)
Query: 9 LEPFLQWLQVNKVELRGCKIKYSDESKGFGIFSSNEFSDGVLLVVPLDLAITPMRVLQDP 68
LE L W + N E+ + + + GFG ++ S + + VP L I ++
Sbjct: 2 LEDLLAWARSNGSEI-SSSVSFQNGDSGFGAVCTS--SGPISIRVPSALIIRLETAVEH- 57
Query: 69 LIGPECRAMFEDGEVDDRFL---MILFLTVERLRKN---SSWKPYLDMLPTTFGNP-LWF 121
+GP+ F D R + + +FL ER + S +KPY++ LP +P +W
Sbjct: 58 -LGPD----FGDLATKTRSINAVIKMFLARERAPAHLACSFFKPYIESLPANINSPYVWS 112
Query: 122 TDDELLELKGTTLYRATELQKQNLLTLYDDKVKDLVKKLLVLDGDSERANSIFWTRALNI 181
+D+ L L GT L + ++NL L +++ + VL E+ + +N+
Sbjct: 113 AEDQKL-LAGTNLGSSL---RENLAEL----IEEWWSVVSVLPDSMEKPTQHY----VNM 160
Query: 182 PLPHSYVFPQNQEDLNKYDS---INN---------SAELSNDHNSRGELINGLNDIKNEA 229
+ + F ED++KY + + N +A + + L+ + KNE
Sbjct: 161 KFYYEHKF-YTTEDMHKYVTDVDVANWTSFPCYLWAAMIFKSRSFPAYLLQEETERKNEG 219
Query: 230 QRVNSQVNGATSTLTSTQGETLWIEGLVPGIDFCNHDLKAAATWEVDGTGLITGVPFSMY 289
+ + + A T + T + LVP +D NH A TW G+G T
Sbjct: 220 KDIEKEGKTADQTKSETTPPSD-SAMLVPVVDLLNHSPTAKVTW--SGSGGFT------- 269
Query: 290 LLSVERSSFHSEKEISISYGNKGNEELLYLYGFVIDNNPDDYLMIH 335
LS E + +EI +YG KGNEELL YGF ++ N D + +
Sbjct: 270 -LSTEG---EAGQEIFNNYGQKGNEELLLAYGFCLETNEADSVALR 311
>gi|330797452|ref|XP_003286774.1| hypothetical protein DICPUDRAFT_54488 [Dictyostelium purpureum]
gi|325083217|gb|EGC36675.1| hypothetical protein DICPUDRAFT_54488 [Dictyostelium purpureum]
Length = 1335
Score = 63.5 bits (153), Expect = 2e-07, Method: Compositional matrix adjust.
Identities = 40/140 (28%), Positives = 71/140 (50%), Gaps = 5/140 (3%)
Query: 12 FLQWLQVNKVELRGCKIKYSDESKGFGIFSSNEFSDGVLLVV-PLDLAITPMRVLQDPLI 70
F WL+ V+ +I +S G G+ ++ + + ++V P I +P++
Sbjct: 806 FENWLKQGGVQFPKLQIANFTDSTGRGVVTTKKVDEDEVVVSVPRKYLINVDVAKSNPIL 865
Query: 71 GPECRAMFEDGEVDDRFLMILFLTVERLRKNSSWKPYLDMLPTTFGNPLWFTDDELLELK 130
GP +FE+ ++D ++ LF+ E+ N+ W+P+ D LP+ F + ++ ELLEL+
Sbjct: 866 GP----IFEELHLNDETILFLFVIYEKENPNTFWRPFYDTLPSYFTTSIHYSSTELLELE 921
Query: 131 GTTLYRATELQKQNLLTLYD 150
GT L+ T KQ L D
Sbjct: 922 GTNLFAETLAVKQQLQAFRD 941
>gi|270005260|gb|EFA01708.1| hypothetical protein TcasGA2_TC007288 [Tribolium castaneum]
Length = 253
Score = 62.8 bits (151), Expect = 3e-07, Method: Compositional matrix adjust.
Identities = 41/139 (29%), Positives = 76/139 (54%), Gaps = 22/139 (15%)
Query: 4 STEAKLEPFLQWLQVNKVELRGCKIKYSDESKGFG--IFSSNEFSDGVLLV-VPLDLAIT 60
++ A +E FL+W N +L GC I DE +G+G + ++ + ++ L++ VP L ++
Sbjct: 82 ASAATIENFLKWANENGAQLNGCSI---DEFEGYGLGVKANVDIAESSLVIAVPRKLMMS 138
Query: 61 PMRVLQDPLIGPECRAMFEDGEVDDRFL-------MILFLTVERLRKNSSWKPYLDMLPT 113
+ ++ +D D+ L + +FL +E+ + +S WKPY+D+LP
Sbjct: 139 ---------VENAKESVLKDLIEKDKILGSMPNVALSIFLLLEKYKGDSFWKPYIDILPK 189
Query: 114 TFGNPLWFTDDELLELKGT 132
T+ L+F+ DEL EL+G+
Sbjct: 190 TYTTVLYFSIDELEELRGS 208
>gi|198413420|ref|XP_002131202.1| PREDICTED: similar to SET domain containing 3 [Ciona intestinalis]
Length = 577
Score = 62.8 bits (151), Expect = 4e-07, Method: Compositional matrix adjust.
Identities = 80/362 (22%), Positives = 140/362 (38%), Gaps = 70/362 (19%)
Query: 3 ISTEAKLEPFLQWLQVNKVELRGCKIKYSDESKGFGIFSSNEFSDGV-LLVVPLDLAITP 61
I + F WL+ + VE I+ E +GFG+ + + L+ +P +T
Sbjct: 74 IDRTTAIPKFKSWLKEHGVEYSAIDIQEVSEEEGFGVIALQDIEIKCPLVTIPRKAMMTY 133
Query: 62 MRVLQDPLIGPECRAMFEDGEVDD---RFLMILFLTVERLRKNSSWKPYLDMLPTTFGNP 118
L G + E EV + L+L ER NS ++PY+DM+P F
Sbjct: 134 EDAKSSYLAG-----LIEGNEVLSVMPNVCLALYLHCERFTLNSKYQPYIDMIPQEFNTI 188
Query: 119 LWFTDDELLELKGTTLYRATELQKQNLLTLYDDKVKDLVKKLL----VLDGDSERANSIF 174
L+F E+ LKGT L++ ++ K +V++ V +G ++ +
Sbjct: 189 LYFKPHEMKYLKGTA-----------ALSVAINQFKSIVRQFALLYQVFNGSHQKED--- 234
Query: 175 WTRALNIPLPHSYVFPQNQEDLNKYDSINNSAELSNDHNSRGELINGLNDIKNEAQRVNS 234
L + +++ F D ++ + + + G+++ L++
Sbjct: 235 -VEKLPLQARNAFTF-----DTYRWCASAVTTRQNKIPTHVGDVLGDLDE---------- 278
Query: 235 QVNGATSTLTSTQGETLWIEGLVPGIDFCNHDLKAAATWEVDGTGLITGVPFSMYLLSVE 294
STL L+P D NH + +T L G+ +
Sbjct: 279 -----NSTL-----------ALIPMWDMFNHAIGPLST---AYNALTRGIE------CLA 313
Query: 295 RSSFHSEKEISISYGNKGNEELLYLYGFVIDNNPDDYLMIHYPAEAIHSIPLSDSKALLL 354
F + +++ I YG + N +LL GFV+ +P D + IH PL KA LL
Sbjct: 314 MQDFKTGEQVKICYGARTNSDLLIHNGFVMKESPFDKVRIHLGVS--QKDPLYSLKAKLL 371
Query: 355 EE 356
E+
Sbjct: 372 EK 373
>gi|255070351|ref|XP_002507257.1| predicted protein [Micromonas sp. RCC299]
gi|226522532|gb|ACO68515.1| predicted protein [Micromonas sp. RCC299]
Length = 986
Score = 62.8 bits (151), Expect = 4e-07, Method: Compositional matrix adjust.
Identities = 41/132 (31%), Positives = 61/132 (46%), Gaps = 6/132 (4%)
Query: 43 NEFSDG--VLLVVPLDLAITPMRVLQDPLIGPECRAMFEDGEVDDRFLMILFLTVERLR- 99
N DG + +P+ +TP D G V+DR +++ FL +ER R
Sbjct: 52 NGAQDGGDTIFSIPITCLMTPAAAFADVTYGKVFELFAAHQSVEDRTVLVFFLAIERQRG 111
Query: 100 KNSSWKPYLDMLPTTFGNPLWFTDDELLELKGTTLYRATELQKQNLLTLYDDKVKDLVKK 159
S W PY+ LP+ F NPL ++ E L L GT L AT+ LL L + V +
Sbjct: 112 MTSHWGPYIRELPSIFSNPLNWSRAETLRLAGTRLGGATKFHDCALLQLTEVCVPAFIAI 171
Query: 160 L---LVLDGDSE 168
L L+L +++
Sbjct: 172 LRAQLILSANTK 183
>gi|189236574|ref|XP_975615.2| PREDICTED: similar to SET domain containing 3 [Tribolium castaneum]
Length = 667
Score = 62.4 bits (150), Expect = 5e-07, Method: Compositional matrix adjust.
Identities = 40/137 (29%), Positives = 73/137 (53%), Gaps = 18/137 (13%)
Query: 4 STEAKLEPFLQWLQVNKVELRGCKIKYSDESKGFGIFSSNEFSDGVLLV-VPLDLAITPM 62
++ A +E FL+W N +L GC I E G G+ ++ + ++ L++ VP L ++
Sbjct: 82 ASAATIENFLKWANENGAQLNGCSID-EFEGYGLGVKANVDIAESSLVIAVPRKLMMS-- 138
Query: 63 RVLQDPLIGPECRAMFEDGEVDDRFL-------MILFLTVERLRKNSSWKPYLDMLPTTF 115
+ ++ +D D+ L + +FL +E+ + +S WKPY+D+LP T+
Sbjct: 139 -------VENAKESVLKDLIEKDKILGSMPNVALSIFLLLEKYKGDSFWKPYIDILPKTY 191
Query: 116 GNPLWFTDDELLELKGT 132
L+F+ DEL EL+G+
Sbjct: 192 TTVLYFSIDELEELRGS 208
>gi|170047729|ref|XP_001851364.1| conserved hypothetical protein [Culex quinquefasciatus]
gi|167870047|gb|EDS33430.1| conserved hypothetical protein [Culex quinquefasciatus]
Length = 254
Score = 62.4 bits (150), Expect = 5e-07, Method: Compositional matrix adjust.
Identities = 53/186 (28%), Positives = 89/186 (47%), Gaps = 16/186 (8%)
Query: 6 EAKLEPFLQWLQVNKVELRGCKIKYSDESKGF--GIFSSNEFSDG-VLLVVPLDLAITPM 62
+A +E F +W + + + G KI E GF G+ ++ +FS G V + VP L +
Sbjct: 70 QANIESFCRWAKEHGCQFDGIKI---SEFSGFELGLEATRDFSAGEVFVTVPRKLIFSVT 126
Query: 63 RVLQDPLIGPECRAMFEDGEVDDRFLMILFLTVERLRKNSSWKPYLDMLPTTFGNPLWFT 122
+ P I + M + ++L VER + NS WKPYLD+LP + L+FT
Sbjct: 127 ADSKIPDIMKDIPVMMVQNLSNLMLALLLI--VERFQPNSVWKPYLDVLPDRYSTVLYFT 184
Query: 123 DDELLELKGTTLYRATELQKQNLLTLYDDKVKDLVKKLLVLDGDSERANSIFW------T 176
++ ELKGT+ + Q +N+ Y +K ++ L D+ + ++W
Sbjct: 185 PADMAELKGTSAFVPALNQCKNIARQYGF-IKKFIQSKTALLKDNFTYD-LYWFLSEDGV 242
Query: 177 RALNIP 182
R +N+P
Sbjct: 243 RMVNVP 248
>gi|302762396|ref|XP_002964620.1| hypothetical protein SELMODRAFT_81798 [Selaginella moellendorffii]
gi|300168349|gb|EFJ34953.1| hypothetical protein SELMODRAFT_81798 [Selaginella moellendorffii]
Length = 464
Score = 62.4 bits (150), Expect = 6e-07, Method: Compositional matrix adjust.
Identities = 72/326 (22%), Positives = 125/326 (38%), Gaps = 81/326 (24%)
Query: 25 GCKIKYSDESKGFGIFSSNEFSDGVLLVVPLDLAITPMRVLQDPLIGPECRAMFEDGE-- 82
G + +S S G G+ ++ + G + IT R + P+ PE A+ E
Sbjct: 36 GLHVVHSGSSHGMGLVATQDLPQGSTI-------ITLPRRVPMPMPDPENAAVLAPSEGV 88
Query: 83 ---VDDR-------FLMILFLTVERLRKNSSWKPYLDMLPTTFGNPLWFTDDELLELKGT 132
+ +R + L L ER +K S W PY+ MLP +F P++F+ ++ +
Sbjct: 89 ICEIANRVPEELWAMRLGLKLLYERAQKGSYWWPYISMLPHSFTLPIFFSGVDIESIDYA 148
Query: 133 TLYRATELQKQNLLTLYDD--KVKDLVKKLLVLDGDSERANSIFWTRALNIPLPHSYVFP 190
+ + + + LL + K++ L +++ G S + ++ W A
Sbjct: 149 PVTHQVKKRCRFLLQFSAELAKLESLPEEVHPFAGQSVDSGALGWAMA------------ 196
Query: 191 QNQEDLNKYDSINNSAELSNDHNSRGELINGLNDIKNEAQRVNSQVNGATSTLTSTQGET 250
+ + A R++ G T+ L S
Sbjct: 197 ---------------------------------AVSSRAFRIH----GVTNKLCSAM--- 216
Query: 251 LWIEGLVPGIDFCNHDLKAAATWEVDGTGLITGVPFSMYLLSVERSSFHSEKEISISYGN 310
++P ID CNH + A E D + V F L V + + I+++YG
Sbjct: 217 -----MLPLIDMCNHSFQPNAHIEEDLSRDAQDVSF---LKVVTKRNLEKGSAITLNYGP 268
Query: 311 KGNEELLYLYGFVIDNNPDDYLMIHY 336
N+ LL YGFVI +NP D + + Y
Sbjct: 269 LSNDLLLLDYGFVIPDNPHDRIELRY 294
>gi|312385483|gb|EFR29969.1| hypothetical protein AND_00715 [Anopheles darlingi]
Length = 1785
Score = 62.0 bits (149), Expect = 6e-07, Method: Compositional matrix adjust.
Identities = 88/352 (25%), Positives = 141/352 (40%), Gaps = 80/352 (22%)
Query: 36 GFGIFSSNEFSDG-VLLVVPLDLAI--------TPMRVLQDPLIGPECRAMFEDGEVDDR 86
G G+FS + G L+ +P + + RV+ DP + E R + +V +
Sbjct: 711 GKGLFSRKSIAAGDRLISLPFEALLGGTTIEQDESFRVMFDPELLEE-RDRTTEEKVSFQ 769
Query: 87 FLMILFLTVERLRKNSSWKPYLDMLPTTFGNPLWFTDDELLELKGTTLYRATELQKQNLL 146
L+ +L V+ KN + PYL LP F NP + EL L L + Q Q +
Sbjct: 770 TLLAFYLCVQEHNKNPALAPYLKSLPDNFSNPYFCAKQELTHLPEVLLIAMVK-QNQQIK 828
Query: 147 TLYDDKVKDL---VKKLLVLDGDSERANSIFWTRALNIPLPHSYVFPQNQEDLNKYDSIN 203
+ + + + L ++ ++VL+ + A+S+ TR++ Y+ P+ +N +
Sbjct: 829 SEFKNLINTLHPELRSMIVLE-RFKWAHSVVNTRSV-------YIDPEIVRMINSFLP-- 878
Query: 204 NSAELSNDHNSRGELINGLNDIKNEAQRVNSQVNGATSTLTSTQGETLWIEGLVPGIDFC 263
G L +GL ++ A S L P +DF
Sbjct: 879 ----------HGGSLFDGL-------------LSDAPSM------------ALAPFLDFF 903
Query: 264 NHDLKAAATWEVDGT------GLITGVPFSM-YLLSVERSSFHSEKEISISYGNKGNEEL 316
NH ++ T L G P + Y L +ER F +I ISYG N L
Sbjct: 904 NHQSGTKTVSKLSLTVSQIRDRLAKGKPLELNYDLFIERE-FDRGAQIYISYGTHNNTSL 962
Query: 317 LYLYGFVIDNNPDDYLMIHYPAEAIHSIPLSDSKALLLEEQKAQLRCL-LPK 367
L YGF + NNP+D++ + L D A + +LRCL LP+
Sbjct: 963 LLEYGFFLTNNPNDFV----------ELTLEDVNAFI--RHDPELRCLRLPR 1002
>gi|301122791|ref|XP_002909122.1| conserved hypothetical protein [Phytophthora infestans T30-4]
gi|262099884|gb|EEY57936.1| conserved hypothetical protein [Phytophthora infestans T30-4]
Length = 426
Score = 62.0 bits (149), Expect = 6e-07, Method: Compositional matrix adjust.
Identities = 52/187 (27%), Positives = 89/187 (47%), Gaps = 14/187 (7%)
Query: 12 FLQWLQVNKVELRGCKIKYSDESKGFGIFSSNEFSDG-VLLVVPLDLAITPMRVLQDPLI 70
L+WL+ N I+Y + +G G+F+ + G V L +P L + + L
Sbjct: 10 LLEWLKANGGVDNLLDIRYLGKLEGHGVFAKQALTSGQVTLRIPFKLTMNIESAARSDL- 68
Query: 71 GPECRAMFEDGEVDDRFLMILFLTVER-LRKNSSWKPYLDMLPTTFGNPLWFTDDELLEL 129
R + + ++ D ++ L L ER R +S + P++ LPTTF P+++++ EL EL
Sbjct: 69 ---ARVLEKYPQIPDDEVLALHLMHERSKRSDSFFAPFIASLPTTFDLPVFWSESELNEL 125
Query: 130 KGTTLYRATELQKQNLL----TLYDDKVKDLVKKLLVLDG----DSERANSIFWTRALNI 181
KGT + T+L KQ L ++ V+D + +L D A S+ W+RA +
Sbjct: 126 KGTNVLLLTQLMKQQLQRDFENIHQAVVEDFPEVFALLPTLTLEDYTWAMSVIWSRAFGV 185
Query: 182 PLPHSYV 188
Y+
Sbjct: 186 TREKKYL 192
>gi|328864871|gb|EGG13257.1| hypothetical protein DFA_11018 [Dictyostelium fasciculatum]
Length = 1658
Score = 62.0 bits (149), Expect = 7e-07, Method: Compositional matrix adjust.
Identities = 40/143 (27%), Positives = 71/143 (49%), Gaps = 5/143 (3%)
Query: 5 TEAKLEPFLQWLQVNKVELRGCKIKYSDESKGFGIFSSNEFSDGVLLV-VPLDLAITPMR 63
T+ F +WL V +I ++S G G+ ++ + + +V VP I
Sbjct: 1178 TDVVYRKFEKWLTDGGVHFPKLQIANFNDSTGRGVVTTKKVEENECVVSVPRKFLINVDC 1237
Query: 64 VLQDPLIGPECRAMFEDGE-VDDRFLMILFLTVERLRKNSSWKPYLDMLPTTFGNPLWFT 122
+ P++ +FE+ ++D ++ LF+ E+ NS W+P+ D LP+ F + +T
Sbjct: 1238 ARKHPVLNS---ILFEEATGLNDDTILFLFVIYEKENPNSFWRPFFDTLPSYFPTSIHYT 1294
Query: 123 DDELLELKGTTLYRATELQKQNL 145
ELLEL+GT L+ T K++L
Sbjct: 1295 TTELLELEGTNLFEETIQIKEHL 1317
>gi|356577306|ref|XP_003556768.1| PREDICTED: ribulose-1,5 bisphosphate carboxylase/oxygenase large
subunit N-methyltransferase, chloroplastic-like [Glycine
max]
Length = 487
Score = 61.2 bits (147), Expect = 1e-06, Method: Compositional matrix adjust.
Identities = 62/221 (28%), Positives = 99/221 (44%), Gaps = 22/221 (9%)
Query: 2 EISTEAKLEPFLQWLQVNKVELRGCKIKYSDESKGFGIFSSNEFS-DGVLLVVPLDLAIT 60
E+S ++ F QWL+ V +K S +G G+ + + S + V+L VP L I
Sbjct: 45 EVSVSPAVDTFWQWLKEEGVVSAKTPVKPSVVPEGLGLVALKDISRNEVVLQVPKRLWIN 104
Query: 61 PMRVLQDPLIGPECRAMFEDGEVDDRFLMILFLTVERLRKNSSWKPYLDMLPTTFGNPLW 120
P V IG C + + LFL ER R NS WK Y +LP + ++
Sbjct: 105 PDAVAASE-IGKVCIGL------KPWLAVALFLIRERSRSNSLWKHYFSVLPKETDSTIY 157
Query: 121 FTDDELLELKGTTLYRATELQKQNLLTLYDDKVKDLV---KKL----LVLDGDSERANSI 173
++++EL EL+GT L T KQ + Y ++++ KKL L LD
Sbjct: 158 WSEEELSELQGTQLLNTTRSVKQYVENEYRRLEEEIILPNKKLFPSPLTLD-------DF 210
Query: 174 FWTRALNIPLPHSYVFPQNQEDLNKYDSINNSAELSNDHNS 214
FW + S + +N + D IN+SA ++ + ++
Sbjct: 211 FWAFGILRSRAFSRLRNENLVVIPFADFINHSARVTTEDHA 251
>gi|302815683|ref|XP_002989522.1| hypothetical protein SELMODRAFT_129980 [Selaginella moellendorffii]
gi|300142700|gb|EFJ09398.1| hypothetical protein SELMODRAFT_129980 [Selaginella moellendorffii]
Length = 464
Score = 61.2 bits (147), Expect = 1e-06, Method: Compositional matrix adjust.
Identities = 77/327 (23%), Positives = 123/327 (37%), Gaps = 83/327 (25%)
Query: 25 GCKIKYSDESKGFGIFSSNEFSDGVLLVVPLDLAITPMRVLQDPLIGPECRAMFEDGE-- 82
G + +S S G G+ ++ + G + IT R + P+ PE A+ E
Sbjct: 36 GLHVVHSGSSHGMGLVATQDLPQGSTI-------ITLPRRIPMPMPDPENAAVLAPSEGV 88
Query: 83 ---VDDR-------FLMILFLTVERLRKNSSWKPYLDMLPTTFGNPLWFT--DDELLELK 130
+ +R + L L ER +K S W PY+ MLP +F P++F+ D E ++
Sbjct: 89 ICEIANRVPEELWAMRLGLKLLYERAQKGSYWWPYISMLPHSFTLPIFFSGVDIESIDYA 148
Query: 131 GTTLYRATELQKQNLLTLYDDKVKDLVKKLLVLDGDSERANSIFWTRALNIPLPHSYVFP 190
T +VK + LL
Sbjct: 149 PVT-----------------HQVKKRCRFLL----------------------------- 162
Query: 191 QNQEDLNKYDSINNSAE-LSNDHNSRGELINGLNDIKNEAQRVNSQVNGATSTLTSTQGE 249
Q +L K +S+ + G L + + + A R++ G T+ L S
Sbjct: 163 QFSSELAKLESLPEEIHPFAGQFVDSGALGWAMAAVSSRAFRIH----GVTNKLCSAM-- 216
Query: 250 TLWIEGLVPGIDFCNHDLKAAATWEVDGTGLITGVPFSMYLLSVERSSFHSEKEISISYG 309
++P ID CNH + A E D + V F L V + + I+++YG
Sbjct: 217 ------MLPLIDMCNHSFQPNAHIEEDLSRDAQDVSF---LKVVTKRNLEKGSAITLNYG 267
Query: 310 NKGNEELLYLYGFVIDNNPDDYLMIHY 336
N+ LL YGFVI +NP D + + Y
Sbjct: 268 PLSNDLLLLDYGFVIPDNPHDRIELRY 294
>gi|168067849|ref|XP_001785817.1| predicted protein [Physcomitrella patens subsp. patens]
gi|162662541|gb|EDQ49381.1| predicted protein [Physcomitrella patens subsp. patens]
Length = 489
Score = 61.2 bits (147), Expect = 1e-06, Method: Compositional matrix adjust.
Identities = 43/160 (26%), Positives = 81/160 (50%), Gaps = 14/160 (8%)
Query: 3 ISTEAKLEPFLQWLQVNKVELRGCKIKYSDESKGFGIFSSNEFSDG-VLLVVPLDLAITP 61
+ EA+++ W Q + ++ G +K ++ S+G G+ + + G +L VP + I
Sbjct: 37 VGAEAQVQTIWSWAQSHGIQ--GEAVKPAEVSEGLGLIAQRPVNAGDEILNVPESVWIN- 93
Query: 62 MRVLQDPLIGPECRAMFEDGEVDDRFLMILFLTVERLRKNSSWKPYLDMLPTTFGNPLWF 121
+ +Q+ +G C + + LFL E +S W+PYLD LP + +PL++
Sbjct: 94 LAAVQNSSLGKACEGL------KPWVAVALFLIHESSNPSSKWRPYLDSLPKSLDSPLFW 147
Query: 122 TDDELLELKGTTLYRATELQKQNLLTLYDDKVKDLVKKLL 161
+D+EL EL GT L + L +++ +LV+++L
Sbjct: 148 SDEELAELVGTQLLGSV----TGYLEFLENEYNNLVEEVL 183
>gi|17367341|sp|Q43088.1|RBCMT_PEA RecName: Full=Ribulose-1,5 bisphosphate carboxylase/oxygenase large
subunit N-methyltransferase, chloroplastic; AltName:
Full=[Fructose-bisphosphate aldolase]-lysine
N-methyltransferase; AltName:
Full=[Ribulose-bisphosphate carboxylase]-lysine
N-methyltransferase; Short=PsLSMT; Short=RuBisCO LSMT;
Short=RuBisCO methyltransferase; Short=rbcMT; Flags:
Precursor
gi|508551|gb|AAA69903.1| ribulose-1,5 bisphosphate carboxylase large subunit
N-methyltransferase [Pisum sativum]
Length = 489
Score = 61.2 bits (147), Expect = 1e-06, Method: Compositional matrix adjust.
Identities = 46/141 (32%), Positives = 71/141 (50%), Gaps = 12/141 (8%)
Query: 3 ISTEAKLEP----FLQWLQVNKVELRGCKIKYSDESKGFGIFSSNEFS-DGVLLVVPLDL 57
+ TE L P F +WLQ V +K S ++G G+ + + S + V+L VP L
Sbjct: 44 VGTEPSLSPAVQTFWKWLQEEGVITAKTPVKASVVTEGLGLVALKDISRNDVILQVPKRL 103
Query: 58 AITPMRVLQDPLIGPECRAMFEDGEVDDRFLMILFLTVERLRKNSSWKPYLDMLPTTFGN 117
I P V IG C E+ +ILFL ER R++S WK Y +LP +
Sbjct: 104 WINPDAVAASE-IGRVC------SELKPWLSVILFLIRERSREDSVWKHYFGILPQETDS 156
Query: 118 PLWFTDDELLELKGTTLYRAT 138
++++++EL EL+G+ L + T
Sbjct: 157 TIYWSEEELQELQGSQLLKTT 177
>gi|328767499|gb|EGF77548.1| hypothetical protein BATDEDRAFT_27333 [Batrachochytrium
dendrobatidis JAM81]
Length = 483
Score = 61.2 bits (147), Expect = 1e-06, Method: Compositional matrix adjust.
Identities = 33/88 (37%), Positives = 52/88 (59%), Gaps = 10/88 (11%)
Query: 256 LVPGIDFCNHDLKAAATWEVDGTGLITGVPFSMYLLSVERSSFHSEKEISISYGNKGNEE 315
+VPGIDFCNHD A W+V + V + L++++ + + +++ISYG K N E
Sbjct: 209 IVPGIDFCNHDQTPNAYWKV------SSVDNCVNLIALD--TIDPDVQVTISYGCKPNSE 260
Query: 316 LLYLYGFVIDNNPDDYLMIHYPAEAIHS 343
LL+++GFV+D D + I P E I +
Sbjct: 261 LLFIHGFVVDEGYSDCITI--PLECIEA 286
>gi|426248573|ref|XP_004018037.1| PREDICTED: LOW QUALITY PROTEIN: histone-lysine N-methyltransferase
setd3 [Ovis aries]
Length = 596
Score = 60.8 bits (146), Expect = 1e-06, Method: Compositional matrix adjust.
Identities = 74/334 (22%), Positives = 130/334 (38%), Gaps = 87/334 (26%)
Query: 12 FLQWLQVNKVELRGCK-IKYSDESKGFGIFSSNEF-SDGVLLVVPLDLAITPMRVLQDPL 69
++W N + G + + + +E GFG+ ++ + ++ + L VP L +T + ++ +
Sbjct: 89 LMKWASENGASVEGFEMVNFKEE--GFGLRATRDIKAEELFLWVPRKLLMT-VESAKNSV 145
Query: 70 IGP---ECRAMFEDGEVDDRFLMILFLTVERLRKNSSWKPYLDMLPTTFGNPLWFTDDEL 126
+GP + R + G + + L ER NS W+PY+ LP+ + PL+F +DE+
Sbjct: 146 LGPLYSQDRILQAMGNI----TLAFHLLCERADPNSFWQPYIQTLPSEYDTPLYFEEDEV 201
Query: 127 LELKGTTLYRATELQKQNLLTLYDDKVKDLVKKLLVLDGDSERANSIFWTRALNIPLPHS 186
L+ T Q +N Y ++ R + PH+
Sbjct: 202 RYLQSTQAIHDVFSQYKNTARQY-----------------------AYFYRVIQTH-PHA 237
Query: 187 YVFPQNQEDLNKYDSINNSAELSNDHNSRGELINGLNDIKNEAQRVNSQVNGATSTLTST 246
+ P + +S + + ++ N I E +G+ TL
Sbjct: 238 HKLP-----------LKDSFTYEDYRWAVSSVMTRQNQIPTE--------DGSRVTL--- 275
Query: 247 QGETLWIEGLVPGIDFCNHDLKAAATWEVDGTGLITGVPFSMYLLSVERSS------FHS 300
L+P D CNH +GLIT + Y L +R F +
Sbjct: 276 --------ALIPLWDMCNHT-----------SGLIT----TGYNLEDDRCECVALQDFRA 312
Query: 301 EKEISISYGNKGNEELLYLYGFVIDNNPDDYLMI 334
++I I YG + N E + GF DNN D + I
Sbjct: 313 GEQIYIFYGTRSNAEFVIHSGFFFDNNSHDRVKI 346
>gi|356521657|ref|XP_003529470.1| PREDICTED: ribulose-1,5 bisphosphate carboxylase/oxygenase large
subunit N-methyltransferase, chloroplastic-like [Glycine
max]
Length = 487
Score = 60.8 bits (146), Expect = 2e-06, Method: Compositional matrix adjust.
Identities = 56/214 (26%), Positives = 95/214 (44%), Gaps = 8/214 (3%)
Query: 2 EISTEAKLEPFLQWLQVNKVELRGCKIKYSDESKGFGIFSSNEFS-DGVLLVVPLDLAIT 60
E+S ++ F QWL+ V +K +G G+ + + S + V+L VP L I
Sbjct: 45 EVSVSPAVDTFWQWLKEEGVVSGKTPVKPGVVPEGLGLVALKDISRNEVVLQVPKRLWIN 104
Query: 61 PMRVLQDPLIGPECRAMFEDGEVDDRFLMILFLTVERLRKNSSWKPYLDMLPTTFGNPLW 120
P V IG C + + LFL ER R +S WK Y +LP + ++
Sbjct: 105 PDAVAASE-IGKVCSGL------KPWLAVALFLIRERSRSDSLWKHYFSILPKETDSTIY 157
Query: 121 FTDDELLELKGTTLYRATELQKQNLLTLYDDKVKDLVKKLLVLDGDSERANSIFWTRALN 180
++++EL EL+GT L T KQ + + ++++ L S + FW +
Sbjct: 158 WSEEELSELQGTQLLNTTRSVKQYVQNEFRRLEEEIIIPNKKLFPSSITLDDFFWAFGIL 217
Query: 181 IPLPHSYVFPQNQEDLNKYDSINNSAELSNDHNS 214
S + +N + D IN+SA ++ D ++
Sbjct: 218 RSRAFSRLRNENLVVIPLADLINHSARVTTDDHA 251
>gi|332020872|gb|EGI61270.1| SET domain-containing protein 3 [Acromyrmex echinatior]
Length = 247
Score = 60.8 bits (146), Expect = 2e-06, Method: Compositional matrix adjust.
Identities = 42/149 (28%), Positives = 76/149 (51%), Gaps = 24/149 (16%)
Query: 12 FLQWLQVNKVELRGCKIKYSDESKGF--GIFSSNEFSDGVLLV-VPLDL------AITPM 62
F+ WL N ++ G I E G+ G+ + +F++ L++ +P L A + +
Sbjct: 86 FINWLTENGAQVDGLSIA---EFPGYDLGLKAETDFAENQLILEIPRALIFSTYTAASEL 142
Query: 63 RVLQ-DPLIGPECRAMFEDGEVDDRFLMILFLTVERLRKNSSWKPYLDMLPTTFGNPLWF 121
+LQ DPL+ + + + + L +ER ++NS WKPYLDMLP+++ L+
Sbjct: 143 TILQNDPLV-----------QHMPQVALAIALLIERYKENSKWKPYLDMLPSSYNTVLYM 191
Query: 122 TDDELLELKGTTLYRATELQKQNLLTLYD 150
++++ELKG+ A Q +N+ Y
Sbjct: 192 KTNDMIELKGSPTLEAALKQCRNIARQYS 220
>gi|344234084|gb|EGV65954.1| hypothetical protein CANTEDRAFT_101893 [Candida tenuis ATCC 10573]
Length = 607
Score = 60.8 bits (146), Expect = 2e-06, Method: Compositional matrix adjust.
Identities = 98/369 (26%), Positives = 150/369 (40%), Gaps = 80/369 (21%)
Query: 7 AKLEPFLQWLQVNKVELR-GCKIKYSDESKGFGIF--SSNEFSDGV---LLVVPLDLAIT 60
A + LQW + N L + K D + G F S E +D + + +PL LAIT
Sbjct: 2 ASISGLLQWCKENGTLLSPRVEFKNIDATNIGGFFQNSEKEAADDLEDEHIRLPLKLAIT 61
Query: 61 PMRVLQDPLIGPECRAMFEDGEVDDRF------------LMILFLTVERLRK---NSSWK 105
++ F DG D F L+ LFL E+ K S ++
Sbjct: 62 VSDAIKS----------FSDGNEADTFTNISNKTSNINSLLKLFLAREKSEKYLSKSFYQ 111
Query: 106 PYLDMLP--TTFGNPLWFTDDELLELKGTTLYRATELQKQNLLTLYDDKVKDLVKKLLVL 163
PYL +LP T +P ++ ++ L+GT L A K+NL L V++ + + +L
Sbjct: 112 PYLTLLPSLTDINSPYTWSPEDKESLRGTNLGSAL---KENLAAL----VEEWWQVINLL 164
Query: 164 DGDSERANSIFWTRALNIPLPHSYVFPQ---------NQEDLNKYDSINNSAELSNDHNS 214
++ F +N+ + + F Q ++D N + S N S S
Sbjct: 165 PESIDKPEQHF----VNMKFYYEFKFHQEKDLYELFNTEQDFNNWTSFTNYLWSSLILKS 220
Query: 215 RGELINGLNDIKNEAQRVNSQVNGATSTLTSTQGETLWIEGLVPGIDFCNHDLKAAATWE 274
R IKN + + +++ A L+P +D NH +KA W
Sbjct: 221 RS---FPAYLIKNVDKEKDIKMDEAM---------------LLPVVDLLNHSMKADVEWS 262
Query: 275 VDGTGLITGVPFSMYLLSVERSSFHSEKEISISYGNKGNEELLYLYGFVIDNNPDDY--L 332
V TG G F + + +S +E+ +YG KGNEELL YGF I+ N D L
Sbjct: 263 VTRTG---GTDF----FNFKSNSALVGRELFNNYGRKGNEELLLAYGFCIEGNEADTTAL 315
Query: 333 MIHYPAEAI 341
I P E +
Sbjct: 316 KIKIPVEML 324
>gi|296804474|ref|XP_002843089.1| conserved hypothetical protein [Arthroderma otae CBS 113480]
gi|238845691|gb|EEQ35353.1| conserved hypothetical protein [Arthroderma otae CBS 113480]
Length = 455
Score = 60.8 bits (146), Expect = 2e-06, Method: Compositional matrix adjust.
Identities = 70/300 (23%), Positives = 116/300 (38%), Gaps = 74/300 (24%)
Query: 36 GFGIFSSNEFSDG-VLLVVPLDLAITPMRVLQDPLIGPECRAMFEDGEVDDRFLMILFLT 94
G G+ + F +G +L +P T + DPL+GP R+ V+D + L
Sbjct: 25 GSGVKALRSFKEGERILTIPSACLWTVEKAYADPLLGPVLRSAQPPLSVEDALAVYLLFV 84
Query: 95 VERLRKNSSWKPYLDMLPTTFGNPLWFTDDELLELKGTTLYRATELQKQNLLTLYDDKVK 154
R + ++ +P ++ ++FT+DEL G++LY T +Q +V+
Sbjct: 85 RSRTSGYEGQRHHIAAMPQSYSASIFFTEDELQVCAGSSLYALTRQLEQ--------RVR 136
Query: 155 DLVKKLLVLDGDSERANSIFWTRALNIPL--PHSYVFPQNQEDLNKYDSINNSAELSNDH 212
D ++LLV PL H +FP +Q + Y
Sbjct: 137 DDYRQLLV-------------------PLLSQHRDLFPLDQFTIEDYKW----------- 166
Query: 213 NSRGELINGLNDIKNEAQRVNSQVNGATSTLTSTQGETLWIEGLVPGIDFCNH--DLKAA 270
L I + A ++ V+G TS + + P D NH D+K
Sbjct: 167 --------ALCSIWSRA--MDFAVSGTTS-----------VRLVAPLADMLNHSPDVKQC 205
Query: 271 ATWEVDGTGLITGVPFSMYLLSVERSSFHSEKEISISYGNKGNEELLYLYGFVIDNNPDD 330
++ P S L + + ++ I YG+ N LL LYGFV+ +NP+D
Sbjct: 206 HAYD----------PTSGDLSILAAKDYQVGDQVFIYYGSVPNNRLLRLYGFVLPDNPND 255
>gi|255723006|ref|XP_002546437.1| conserved hypothetical protein [Candida tropicalis MYA-3404]
gi|240130954|gb|EER30516.1| conserved hypothetical protein [Candida tropicalis MYA-3404]
Length = 578
Score = 60.8 bits (146), Expect = 2e-06, Method: Compositional matrix adjust.
Identities = 80/306 (26%), Positives = 129/306 (42%), Gaps = 54/306 (17%)
Query: 47 DGVLLVVPLDLAITPMRVLQDPLIGPECRAMFEDGEVDDRFLMILFLTVERLRKNSSWKP 106
+G + +P L IT + + G ++ ED + ++ +L ER+ +S + P
Sbjct: 48 NGYPISIPFKLIITTQNAITE--FGKYLKST-EDK--NSNAILKFYLCHERINADSFYHP 102
Query: 107 YLDMLPT--TFGNPLWFTDDELLELKGTTLYRATELQKQNLLTLYDDKVKDLVKKLLVLD 164
YL +LP+ +P ++ + LKGT L + K+NL +L V++ + + +L
Sbjct: 103 YLKLLPSLAAIDSPYTWSAQDKTYLKGTNLGNSL---KENLGSL----VEEWWEVINLLK 155
Query: 165 GDSERANSIFWTRALNIPLPHSYVFPQNQEDLNKY---DSINNSAELSNDHNSRGELING 221
+ + F +N+ + + F +DL+KY + INN N LI
Sbjct: 156 DEVSKPEQHF----INMKFYYDFKF-YTDDDLDKYLNEEDINNWTSFPN--YLWASLI-- 206
Query: 222 LNDIKNEAQRVNSQVNGATSTLTSTQGETLWIEGLVPGIDFCNHDLKAAATWEV-DGTGL 280
L A ++ N + L +P +D NH+ +A W+V DG
Sbjct: 207 LKSRSFPAYLIDKSCNKNDAML-------------LPVVDLLNHNPQAKVNWDVSDG--- 250
Query: 281 ITGVPFSMYLLSVERSSFHSEKEISISYGNKGNEELLYLYGFVIDNNPDD--YLMIHYPA 338
+ S KEI +YG KGNEELL YGF I+ NP D L I P
Sbjct: 251 ---------FFRFKSESIQPGKEIFNNYGLKGNEELLLAYGFCIEGNPRDSVALKIKMPE 301
Query: 339 EAIHSI 344
E + I
Sbjct: 302 EKLKEI 307
>gi|67484540|ref|XP_657490.1| hypothetical protein [Entamoeba histolytica HM-1:IMSS]
gi|56474743|gb|EAL52100.1| hypothetical protein, conserved [Entamoeba histolytica HM-1:IMSS]
Length = 791
Score = 60.5 bits (145), Expect = 2e-06, Method: Compositional matrix adjust.
Identities = 55/192 (28%), Positives = 92/192 (47%), Gaps = 18/192 (9%)
Query: 9 LEPFLQWLQVNKVELRGCKIKYSDESKGFGIFSSNEFS-DGVLLVVPLDLAITPMRVLQD 67
+E +W+ N + G +K + E G G+ ++ EF D V++ +P + I + +
Sbjct: 1 MEDIKKWVIQNGGVIDGVDVK-TFEGYGRGLCANKEFKKDEVIMSIPYSIQINRINLNH- 58
Query: 68 PLIGPECR-AMFEDGEVDDR----FLMILFLTVERLRKNSSWKPYLDMLPTTFGNPLWFT 122
I PE + F +G+ DDR L+ L+L V + PY+++LP T+ PL +T
Sbjct: 59 --IWPEVKLPKFNEGD-DDRDDLNGLVYLYLAVNKTNPKCFHWPYINVLPETYDCPLSYT 115
Query: 123 DDELLELKGTTLYRATELQKQNLLTLYDDKVKDLVKK----LLVLDGDSER---ANSIFW 175
DEL +KGT LY A E L+ + D L+++ D +R A+ FW
Sbjct: 116 IDELNLMKGTKLYAAVEKINAFLMKVVDYYNNKLIQQFPQYFQSFDDLFKRLQWAHQSFW 175
Query: 176 TRALNIPLPHSY 187
+RA + P +
Sbjct: 176 SRAFLVIYPQPF 187
>gi|312098619|ref|XP_003149111.1| hypothetical protein LOAG_13557 [Loa loa]
gi|307755724|gb|EFO14958.1| hypothetical protein LOAG_13557 [Loa loa]
Length = 288
Score = 60.5 bits (145), Expect = 2e-06, Method: Compositional matrix adjust.
Identities = 73/321 (22%), Positives = 131/321 (40%), Gaps = 60/321 (18%)
Query: 12 FLQWLQVNKVELRGCKIKYSDESKGFGIFSSNEFSDG-VLLVVPLDLAITPMRVLQDPLI 70
F++W+ N E G I+ G G++++ +F + ++ +P+++ IT V + P
Sbjct: 6 FMEWVIENGGEHFGVDIRDCSNEGGKGLYATTDFRENETVICIPMEIIITAGFVAEMP-- 63
Query: 71 GPECRAMFEDGEVDDRFLMILFLTVERLRKNSSWKPYLDMLPTTFGNPLWFTDDELLELK 130
C +F+ + ++ F VE+ KNS W PY +LP +F P + +LE +
Sbjct: 64 -GYCD-VFKRYRLKPFEALVYFFLVEK-EKNSKWDPYFKVLPKSFSTPA--SLHPVLEPE 118
Query: 131 G--TTLYRATELQKQNLLTLYDDKVKDLVKKLLVLDGDSERANSIFWTRALNIPLPHSYV 188
L + +QK L T+Y+ K V +G + +PH+
Sbjct: 119 DFPYCLRKQWCIQKNELKTMYE-------KARFVTEGTAGEF------------VPHNRF 159
Query: 189 FPQNQEDLNKYDSINNSAELSNDHNSRGELINGLNDIKNEAQRVNSQVNGATSTLTSTQG 248
+ Q + D+ G + + + R + N + +T+G
Sbjct: 160 YSQ-------------FVAILADNTIWGHFLWAWHIVNT---RCIYRDNKPHPLIDNTEG 203
Query: 249 ETLWIEGLVPGIDFCNH--DLKAAATWEVDGTGLITGVPFSMYLLSVERSSFHSEKEISI 306
++L +VP ID NH D + A W+ ++Y V R H ++I I
Sbjct: 204 DSL---AIVPLIDMLNHSNDSQCCAIWDSK---------LNLYKAIVTR-PIHEGEQIFI 250
Query: 307 SYGNKGNEELLYLYGFVIDNN 327
YG+ N L YGF + +N
Sbjct: 251 CYGSHTNGSLWIEYGFYLKDN 271
>gi|349578505|dbj|GAA23670.1| K7_Yhl039wp [Saccharomyces cerevisiae Kyokai no. 7]
Length = 585
Score = 60.5 bits (145), Expect = 2e-06, Method: Compositional matrix adjust.
Identities = 90/371 (24%), Positives = 153/371 (41%), Gaps = 72/371 (19%)
Query: 5 TEAKLEPFLQWLQVNKVELRGCKIKYS-DESKGFGIFSSNEFS---DGVLLVVPLDLAIT 60
T+ +L+ LQW Q N + KI + E G F + +FS D L+ VP L IT
Sbjct: 3 TQTELDNCLQWAQNNGAFIDP-KISFRITEDAGVSAFVNEKFSPKPDQALIRVPETLLIT 61
Query: 61 PMRVLQDPLIGPECRAMFEDGEVDDRFLMILFLTVERLRKNSSWKPYLDMLPTTFGNPLW 120
+ L + R++ V +L L + + S +KPYLD+LP P +
Sbjct: 62 SQQALSEFSQAANERSLL--NSVTQLYLSKLKFGTDAVHLKSFYKPYLDVLPLHLPQPYF 119
Query: 121 FTDDELLELKGTTLYRATELQKQNLLTLYDDKVKDLVKKLLVLDGDSERANSIFWTRALN 180
++ DE++ L GT +Y LT+ D + LVK+ +L +AL+
Sbjct: 120 WSTDEVMNLHGTDVY----------LTM-RDTLNKLVKEWRML------------FQALS 156
Query: 181 IPLPHSYVFPQNQEDLNKYDSINNSAELSNDHNSRGELINGLNDIKNEAQRVNSQVN--- 237
I HS Q+++ L+ + +SA + + + +N K E NS V
Sbjct: 157 I--EHS---SQDKQFLSLFQENKDSAVVPLE-----QFCAHINGCKLEDSEWNSFVAYLW 206
Query: 238 ---------------GATSTLTSTQGETLWIEGLVPGIDFCNHDLKAAATWEVDGTGLIT 282
G T ++ E L P +D NH WE++ +
Sbjct: 207 SYCIFNSRAFPRVILGRAGTDSTNLNEGF----LYPIVDLLNHKNDVPVRWEMNEQNELC 262
Query: 283 GVPFSMYLLSVERSSFHSEKEISISYGNKGNEELLYLYGFVIDNNPDDY--LMIHYPAEA 340
+ + ++F ++ E+ +YGN NE+ L YGF +N D+ L + P+
Sbjct: 263 FMS--------QTTTFSAQDELFNNYGNISNEKCLLNYGFWDSSNKYDFSRLTLKLPSTL 314
Query: 341 IHSIPLSDSKA 351
+ +P+ +K+
Sbjct: 315 VSGLPVDFNKS 325
>gi|24987776|pdb|1MLV|A Chain A, Structure And Catalytic Mechanism Of A Set Domain Protein
Methyltransferase
gi|24987777|pdb|1MLV|B Chain B, Structure And Catalytic Mechanism Of A Set Domain Protein
Methyltransferase
gi|24987778|pdb|1MLV|C Chain C, Structure And Catalytic Mechanism Of A Set Domain Protein
Methyltransferase
gi|33357815|pdb|1OZV|A Chain A, Crystal Structure Of The Set Domain Of Lsmt Bound To
Lysine And Adohcy
gi|33357816|pdb|1OZV|B Chain B, Crystal Structure Of The Set Domain Of Lsmt Bound To
Lysine And Adohcy
gi|33357817|pdb|1OZV|C Chain C, Crystal Structure Of The Set Domain Of Lsmt Bound To
Lysine And Adohcy
gi|33357822|pdb|1P0Y|A Chain A, Crystal Structure Of The Set Domain Of Lsmt Bound To
Melysine And Adohcy
gi|33357823|pdb|1P0Y|B Chain B, Crystal Structure Of The Set Domain Of Lsmt Bound To
Melysine And Adohcy
gi|33357824|pdb|1P0Y|C Chain C, Crystal Structure Of The Set Domain Of Lsmt Bound To
Melysine And Adohcy
Length = 444
Score = 60.1 bits (144), Expect = 2e-06, Method: Compositional matrix adjust.
Identities = 46/139 (33%), Positives = 70/139 (50%), Gaps = 12/139 (8%)
Query: 5 TEAKLEP----FLQWLQVNKVELRGCKIKYSDESKGFGIFSSNEFS-DGVLLVVPLDLAI 59
TE L P F +WLQ V +K S ++G G+ + + S + V+L VP L I
Sbjct: 2 TEPSLSPAVQTFWKWLQEEGVITAKTPVKASVVTEGLGLVALKDISRNDVILQVPKRLWI 61
Query: 60 TPMRVLQDPLIGPECRAMFEDGEVDDRFLMILFLTVERLRKNSSWKPYLDMLPTTFGNPL 119
P V IG C E+ +ILFL ER R++S WK Y +LP + +
Sbjct: 62 NPDAVAASE-IGRVC------SELKPWLSVILFLIRERSREDSVWKHYFGILPQETDSTI 114
Query: 120 WFTDDELLELKGTTLYRAT 138
+++++EL EL+G+ L + T
Sbjct: 115 YWSEEELQELQGSQLLKTT 133
>gi|169606334|ref|XP_001796587.1| hypothetical protein SNOG_06204 [Phaeosphaeria nodorum SN15]
gi|160706968|gb|EAT86035.2| hypothetical protein SNOG_06204 [Phaeosphaeria nodorum SN15]
Length = 634
Score = 60.1 bits (144), Expect = 2e-06, Method: Compositional matrix adjust.
Identities = 75/334 (22%), Positives = 135/334 (40%), Gaps = 71/334 (21%)
Query: 1 MEISTEAKLEPFLQWLQVNKVELR-GCKIKYSDESKGFGIFSSNEFSDGVLLVVPLDLAI 59
+E + K++ +QW + +L +I ++D S+GF + + V++ PL L +
Sbjct: 13 LESAAHPKVDALIQWFTEHGGQLSDAVQIAFND-SRGFHMRAVRPLQSPVVVTCPLKLTL 71
Query: 60 TPMRV--LQDPLIGPECRAMFEDGEVDDRFLMILFLTVERLR-KNSSWKPYLDMLP--TT 114
+ + + Q ++ + R G++ D L L L +R + K S W Y+ LP +
Sbjct: 72 SCLNLDPEQKHVLFVDSRLQPCQGKIPDHILTYLLLIEQRNKGKESPWHAYIACLPGAES 131
Query: 115 FGNPLWFTDDELLELKGTTLYRATELQKQNLLTLYDDKVKDLVKKLLVLDGDSERANSIF 174
PLWF D+++ L GT+L A + ++++L ++ ++K V D S+
Sbjct: 132 MTTPLWFDDEDMAFLAGTSLAPAAK-ERKSLYYQQWEQALGIMKDAGVALADEVDFESLL 190
Query: 175 WTRALNIPLPHSYVFPQNQEDLNKYDSINNSAELSNDHNSRGELINGLNDIKNEAQRVNS 234
W A I +++ S + DH + L+ + DI N
Sbjct: 191 W--AATIFTSRAFI----------------STHILPDHETV-PLLFPIVDILN------- 224
Query: 235 QVNGATSTLTSTQGETLWIEGLVPGIDFCNHDLKAAATWEVDGTGLITGVPFSMYLLSVE 294
H + A WE FS+ LL E
Sbjct: 225 ------------------------------HSVSAKVEWEFQPLA-----SFSLKLL--E 247
Query: 295 RSSFHSEKEISISYGNKGNEELLYLYGFVIDNNP 328
+F + +E+ +Y K N+ELL YGF +++NP
Sbjct: 248 GDTFTAGQELFNNYAPKQNDELLLGYGFCLEHNP 281
>gi|320163219|gb|EFW40118.1| predicted protein [Capsaspora owczarzaki ATCC 30864]
Length = 1188
Score = 60.1 bits (144), Expect = 3e-06, Method: Compositional matrix adjust.
Identities = 76/355 (21%), Positives = 136/355 (38%), Gaps = 91/355 (25%)
Query: 4 STEAKLEPFLQWLQ------VNKVELRG------------CKIKYSDESKGFGIFSSNEF 45
+ E ++ FL W + +K+ELR C + DE I +
Sbjct: 125 TPETRMTNFLSWARDMAGGVFDKIELRTTGPEGDRGFFATCDLAPGDELASMPI--ATII 182
Query: 46 SDGVLLVVPLDLAITPMRVLQDPLIGPECRAMFEDGEVDDRFLMILFLTVERLRKNSSWK 105
S+ + P+ +A+ +L+ + P + R L+ +L R + +S +
Sbjct: 183 SEQLASRSPVGMAMLSSPMLKRRGVTP----------IPGRTLICAYLIANRGKLDSPFY 232
Query: 106 PYLDMLPTTFGNPLWFTDDELLELKGTTLYRATELQKQNLLTLYDDKVKDLVKKLLVLDG 165
Y+++LP T+ +PLW+ D EL L GT N+ ++ + + L
Sbjct: 233 HYINILPQTYSDPLWWNDAELDHLDGT-----------NIGGYIQERRNQVRNQFL---- 277
Query: 166 DSERANSIFWTRALNIPLPHSYVFPQNQEDLNKYDSINNSAELSNDHNSRGELINGLNDI 225
++F + P +FP+ D+ Y++ + +
Sbjct: 278 ------NVFPVLSREQPA----LFPK---DVFTYEAY----------------LWAFSTC 308
Query: 226 KNEAQRVNSQVNGATSTLTSTQGETL---WIEGLVPGIDFCNHDLKAAATWEVDGTGL-- 280
+ A + VN T + G + +E L+P +D NH A+ TW D T +
Sbjct: 309 SSRAFPLRVTVNPTTGVESHAIGNPMKEPCVECLLPLLDMMNHQFGASITWFTDETSVRF 368
Query: 281 ITGVPFSMYLLSVERSSFHSEKEISISYGNKGNEELLYLYGFVIDNNPDDYLMIH 335
TG + +++ +YG K NEELL YGF + NN D++ I
Sbjct: 369 FTG------------AKVRKGEQVYNNYGPKSNEELLMGYGFCLPNNEADHVKIQ 411
>gi|348675930|gb|EGZ15748.1| hypothetical protein PHYSODRAFT_561468 [Phytophthora sojae]
Length = 430
Score = 60.1 bits (144), Expect = 3e-06, Method: Compositional matrix adjust.
Identities = 51/189 (26%), Positives = 89/189 (47%), Gaps = 14/189 (7%)
Query: 3 ISTEAKLEPFLQWLQVNKVELRGCKIKYSDESKGFGIFSSNEFSDG-VLLVVPLDLAITP 61
+S+ A + L+WL+ + I+Y + +G G+F+ + G V L VP L +
Sbjct: 1 MSSPAVVTRLLEWLEAHGAADSLLDIRYLGKLEGHGVFAKRALTSGQVTLQVPFKLTMNT 60
Query: 62 MRVLQDPLIGPECRAMFEDGEVDDRFLMILFLTVERLRKNSSW-KPYLDMLPTTFGNPLW 120
L + + ++ D ++ L L ER + S+ P++ +PTTF P++
Sbjct: 61 ESAATSDL----APVLEKYPQIPDDEVLALHLMHERSKGGESFFAPFIASMPTTFDLPVF 116
Query: 121 FTDDELLELKGTTLYRATELQKQNLLTLYDD-------KVKDLVKKLLVLDGDSER-ANS 172
+T+ EL ELKGT + T+L KQ+L +++ D+ L L D A S
Sbjct: 117 WTEAELNELKGTNVLLLTQLMKQHLERDFENIHQAVAADFPDIFASLPTLTIDDYMWAMS 176
Query: 173 IFWTRALNI 181
+ W+RA +
Sbjct: 177 VIWSRAFGV 185
>gi|258563540|ref|XP_002582515.1| conserved hypothetical protein [Uncinocarpus reesii 1704]
gi|237908022|gb|EEP82423.1| conserved hypothetical protein [Uncinocarpus reesii 1704]
Length = 445
Score = 59.7 bits (143), Expect = 3e-06, Method: Compositional matrix adjust.
Identities = 47/179 (26%), Positives = 80/179 (44%), Gaps = 19/179 (10%)
Query: 36 GFGIFSSNEFSDG-VLLVVPLDLAITPMRVLQDPLIGPECRAMFEDGEVDDRFLMILFLT 94
G G+ + F++G +L +P D+ T DPL+GP R+ VDD +
Sbjct: 25 GRGVRTLRRFNEGERILTIPRDVLWTVEHAYADPLLGPVLRSARPPLSVDDTLATYILFV 84
Query: 95 VERLRKNSSWKPYLDMLPTTFGNPLWFTDDELLELKGTTLYRATELQKQNLLTLYDDKVK 154
R + +L +P ++ + ++FT+DEL GT+LY T + L +D +
Sbjct: 85 RSRESGYDGLRSHLAAVPKSYSSSIFFTEDELEVCAGTSLYAIT----KQLGRCIEDDYR 140
Query: 155 DLVKKLLVLDGDSERANSIFWTRALNIPLPHSYVFPQNQEDLNKY-DSINNSAELSNDH 212
LV K A W+RA++ LP ++ L + D +N+S+E+ H
Sbjct: 141 ALVYKW---------ALCTVWSRAMDFALPDG----KSVRLLAPFADMLNHSSEVRQCH 186
Score = 38.5 bits (88), Expect = 7.3, Method: Compositional matrix adjust.
Identities = 24/75 (32%), Positives = 33/75 (44%), Gaps = 8/75 (10%)
Query: 256 LVPGIDFCNHDLKAAATWEVDGTGLITGVPFSMYLLSVERSSFHSEKEISISYGNKGNEE 315
L P D NH + D P S L + + + ++ I YG+ N
Sbjct: 169 LAPFADMLNHSSEVRQCHAYD--------PLSGNLSILAGKGYEAGDQVFIHYGSVPNNR 220
Query: 316 LLYLYGFVIDNNPDD 330
LL LYGFVI +NP+D
Sbjct: 221 LLRLYGFVIPSNPND 235
>gi|119914085|ref|XP_589822.3| PREDICTED: histone-lysine N-methyltransferase setd3 [Bos taurus]
gi|297488270|ref|XP_002696879.1| PREDICTED: histone-lysine N-methyltransferase setd3 [Bos taurus]
gi|296475307|tpg|DAA17422.1| TPA: SET domain containing 3 [Bos taurus]
Length = 601
Score = 59.7 bits (143), Expect = 3e-06, Method: Compositional matrix adjust.
Identities = 72/334 (21%), Positives = 130/334 (38%), Gaps = 87/334 (26%)
Query: 12 FLQWLQVNKVELRGCK-IKYSDESKGFGIFSSNEF-SDGVLLVVPLDLAITPMRVLQDPL 69
++W N + G + + + +E GFG+ ++ + ++ + L VP L +T + ++ +
Sbjct: 89 LMKWASENGASVEGFEMVNFKEE--GFGLRATRDIKAEELFLWVPRKLLMT-VESAKNSV 145
Query: 70 IGP---ECRAMFEDGEVDDRFLMILFLTVERLRKNSSWKPYLDMLPTTFGNPLWFTDDEL 126
+GP + R + G + + L ER NS W+PY+ LP+ + PL+F +DE+
Sbjct: 146 LGPLYSQDRILQAMGNI----TLAFHLLCERADPNSFWQPYIQTLPSEYDTPLYFEEDEV 201
Query: 127 LELKGTTLYRATELQKQNLLTLYDDKVKDLVKKLLVLDGDSERANSIFWTRALNIPLPHS 186
L+ T ++ + Y + + V+ PH+
Sbjct: 202 RYLQSTQAI-------HDVFSQYKNTARQYAYFYKVIQTH-----------------PHA 237
Query: 187 YVFPQNQEDLNKYDSINNSAELSNDHNSRGELINGLNDIKNEAQRVNSQVNGATSTLTST 246
+ P + +S + + ++ N I E +G+ TL
Sbjct: 238 HKLP-----------LKDSFTYEDYRWAVSSVMTRQNQIPTE--------DGSRVTL--- 275
Query: 247 QGETLWIEGLVPGIDFCNHDLKAAATWEVDGTGLITGVPFSMYLLSVERSS------FHS 300
L+P D CNH +GLIT + Y L +R F +
Sbjct: 276 --------ALIPLWDMCNHT-----------SGLIT----TGYNLEDDRCECVALQDFRA 312
Query: 301 EKEISISYGNKGNEELLYLYGFVIDNNPDDYLMI 334
++I I YG + N E + GF DNN D + I
Sbjct: 313 GEQIYIFYGTRSNAEFVIHSGFFFDNNSHDRVKI 346
>gi|224098926|ref|XP_002311320.1| SET domain-containing protein [Populus trichocarpa]
gi|222851140|gb|EEE88687.1| SET domain-containing protein [Populus trichocarpa]
Length = 490
Score = 59.7 bits (143), Expect = 3e-06, Method: Compositional matrix adjust.
Identities = 41/131 (31%), Positives = 69/131 (52%), Gaps = 8/131 (6%)
Query: 9 LEPFLQWLQVNKVELRGCKIKYSDESKGFGIFSSNEFS-DGVLLVVPLDLAITPMRVLQD 67
++ F QWL V + +G G+ + + S + V+L +P L I P V+
Sbjct: 50 VQTFWQWLSDQDVVSAKTPARPGLVPQGLGLVAQRDISRNEVVLEIPKKLWINP-DVVAA 108
Query: 68 PLIGPECRAMFEDGEVDDRFLMILFLTVERLRKNSSWKPYLDMLPTTFGNPLWFTDDELL 127
IG C G V + LFL E+L+++S+W+PYLD+LP + + ++++++EL
Sbjct: 109 SEIGNVC------GGVKPWVSVALFLIREKLKEDSTWRPYLDVLPESTNSTIFWSEEELA 162
Query: 128 ELKGTTLYRAT 138
EL+GT L T
Sbjct: 163 ELQGTQLLSTT 173
>gi|109158151|pdb|2H21|A Chain A, Structure Of Rubisco Lsmt Bound To Adomet
gi|109158152|pdb|2H21|B Chain B, Structure Of Rubisco Lsmt Bound To Adomet
gi|109158153|pdb|2H21|C Chain C, Structure Of Rubisco Lsmt Bound To Adomet
gi|109158154|pdb|2H23|A Chain A, Structure Of Rubisco Lsmt Bound To Trimethyllysine And
Adohcy
gi|109158155|pdb|2H23|B Chain B, Structure Of Rubisco Lsmt Bound To Trimethyllysine And
Adohcy
gi|109158156|pdb|2H23|C Chain C, Structure Of Rubisco Lsmt Bound To Trimethyllysine And
Adohcy
gi|109158157|pdb|2H2E|A Chain A, Structure Of Rubisco Lsmt Bound To Azaadomet And Lysine
gi|109158158|pdb|2H2E|B Chain B, Structure Of Rubisco Lsmt Bound To Azaadomet And Lysine
gi|109158159|pdb|2H2E|C Chain C, Structure Of Rubisco Lsmt Bound To Azaadomet And Lysine
gi|109158160|pdb|2H2J|A Chain A, Structure Of Rubisco Lsmt Bound To Sinefungin And
Monomethyllysine
gi|109158161|pdb|2H2J|B Chain B, Structure Of Rubisco Lsmt Bound To Sinefungin And
Monomethyllysine
gi|109158162|pdb|2H2J|C Chain C, Structure Of Rubisco Lsmt Bound To Sinefungin And
Monomethyllysine
Length = 440
Score = 59.7 bits (143), Expect = 4e-06, Method: Compositional matrix adjust.
Identities = 43/136 (31%), Positives = 69/136 (50%), Gaps = 8/136 (5%)
Query: 4 STEAKLEPFLQWLQVNKVELRGCKIKYSDESKGFGIFSSNEFS-DGVLLVVPLDLAITPM 62
S ++ F +WLQ V +K S ++G G+ + + S + V+L VP L I P
Sbjct: 1 SLSPAVQTFWKWLQEEGVITAKTPVKASVVTEGLGLVALKDISRNDVILQVPKRLWINPD 60
Query: 63 RVLQDPLIGPECRAMFEDGEVDDRFLMILFLTVERLRKNSSWKPYLDMLPTTFGNPLWFT 122
V IG C E+ +ILFL ER R++S WK Y +LP + ++++
Sbjct: 61 AVAASE-IGRVC------SELKPWLSVILFLIRERSREDSVWKHYFGILPQETDSTIYWS 113
Query: 123 DDELLELKGTTLYRAT 138
++EL EL+G+ L + T
Sbjct: 114 EEELQELQGSQLLKTT 129
>gi|6321748|ref|NP_011824.1| Efm1p [Saccharomyces cerevisiae S288c]
gi|731615|sp|P38732.1|EFM1_YEAST RecName: Full=N-lysine methyltransferase EFM1; AltName:
Full=Elongation factor methyltransferase 1
gi|2289868|gb|AAB65051.1| hypothetical protein [Saccharomyces cerevisiae]
gi|285809860|tpg|DAA06647.1| TPA: Efm1p [Saccharomyces cerevisiae S288c]
gi|392299055|gb|EIW10150.1| Efm1p [Saccharomyces cerevisiae CEN.PK113-7D]
Length = 585
Score = 59.3 bits (142), Expect = 4e-06, Method: Compositional matrix adjust.
Identities = 90/367 (24%), Positives = 155/367 (42%), Gaps = 64/367 (17%)
Query: 5 TEAKLEPFLQWLQVNKVELRGCKIKYS-DESKGFGIFSSNEFS---DGVLLVVPLDLAIT 60
T+ +L+ LQW Q N + KI + E G F + +FS D L+ VP L IT
Sbjct: 3 TQTELDNCLQWAQNNGAFIDP-KISFRITEDAGVSAFVNEKFSPKPDQALIRVPETLLIT 61
Query: 61 PMRVLQDPLIGPECRAMFEDGEVDDRFLMILFLTVERLRKNSSWKPYLDMLPTTFGNPLW 120
+ L + R++ V +L L + + S +KPYLD+LP P +
Sbjct: 62 SQQALSEFSQAANERSLL--NSVTQLYLSKLKFGTDAVHLKSFYKPYLDVLPLHLPQPYF 119
Query: 121 FTDDELLELKGTTLYRATELQKQNLLTLYDDKVKDLVKKLLVLDGDSERANSIFWTRALN 180
++ DE++ L GT +Y LT+ D + LVK+ +L +AL+
Sbjct: 120 WSTDEVMNLHGTDVY----------LTM-RDTLNKLVKEWRML------------FQALS 156
Query: 181 IPLPHSYVFPQNQEDLNKYDSINNSAELSNDHNSRGELINGLNDIKNEAQRVNSQVNGAT 240
I HS Q+++ L+ + +SA + + + +N K E NS V
Sbjct: 157 I--EHS---SQDKQFLSLFQENKDSAVVPLE-----QFCAHINGCKLEDSEWNSFVAYLW 206
Query: 241 S-TLTSTQGETLWIEG-------------LVPGIDFCNHDLKAAATWEVDGTGLITGVPF 286
S + +++ I G L P +D NH WE++ + +
Sbjct: 207 SYCIFNSRAFPRVILGRAGTDRTNLNEGFLYPIVDLLNHKNDVPVRWEMNEQNELCFMS- 265
Query: 287 SMYLLSVERSSFHSEKEISISYGNKGNEELLYLYGFVIDNNPDDY--LMIHYPAEAIHSI 344
+ ++F ++ E+ +YGN NE+ L YGF +N D+ L + P+ + +
Sbjct: 266 -------QTTTFSAQDELFNNYGNISNEKCLLNYGFWDSSNKFDFSRLTLKLPSTLVSGL 318
Query: 345 PLSDSKA 351
P+ +K+
Sbjct: 319 PVDFNKS 325
>gi|407035166|gb|EKE37568.1| [Ribulose-bisphosphate-carboxylase]-lysine N-methyltransferase
[Entamoeba nuttalli P19]
Length = 791
Score = 59.3 bits (142), Expect = 4e-06, Method: Compositional matrix adjust.
Identities = 56/199 (28%), Positives = 94/199 (47%), Gaps = 32/199 (16%)
Query: 9 LEPFLQWLQVNKVELRGCKIKYSDESKGFGIFSSNEFS-DGVLLVVPLDLAITPMRVLQD 67
+E +W+ N + G +K D G G+ ++ EF D +++ +P + I + +
Sbjct: 1 MEDIKKWVIQNGGVIDGVDVKTFD-GYGRGLCANKEFKKDEIIMSIPYSIQINRINLNH- 58
Query: 68 PLIGPECR-AMFEDGEVDDR----FLMILFLTVERLRKNSSWKPYLDMLPTTFGNPLWFT 122
I PE + F +G+ DDR L+ L+L V + PY+++LP T+ PL +T
Sbjct: 59 --IWPEVKLPKFNEGD-DDRDDLNGLVYLYLAVNKTNPKCFHWPYINVLPETYDCPLSYT 115
Query: 123 DDELLELKGTTLYRATELQKQNLLTL---YDDKV-----------KDLVKKLLVLDGDSE 168
DEL +KGT LY A E L+ + Y++K+ DL K+L +
Sbjct: 116 IDELNLMKGTKLYAAVEKINAFLMKVVDYYNNKLIQQFPQYFQPFDDLFKRL-------Q 168
Query: 169 RANSIFWTRALNIPLPHSY 187
A+ FW+RA + P +
Sbjct: 169 WAHQSFWSRAFLVIYPQPF 187
Score = 38.1 bits (87), Expect = 9.1, Method: Compositional matrix adjust.
Identities = 24/80 (30%), Positives = 38/80 (47%), Gaps = 7/80 (8%)
Query: 255 GLVPGIDFCNHDLKAAATWEVDGTGLITGVPFSMYLLSVERSSFHSEKEISISYGNKGNE 314
L+P DF NH +A T+ I+ + + ++I +Y + NE
Sbjct: 192 SLIPFCDFSNHCTQAKVTY-------ISNTQTETFSFQTNEALVKPGEQIFNNYRIRSNE 244
Query: 315 ELLYLYGFVIDNNPDDYLMI 334
+LL YGFV +NNP D L++
Sbjct: 245 KLLLGYGFVEENNPCDNLLL 264
>gi|396467116|ref|XP_003837845.1| hypothetical protein LEMA_P121650.1 [Leptosphaeria maculans JN3]
gi|312214409|emb|CBX94401.1| hypothetical protein LEMA_P121650.1 [Leptosphaeria maculans JN3]
Length = 632
Score = 59.3 bits (142), Expect = 5e-06, Method: Compositional matrix adjust.
Identities = 73/313 (23%), Positives = 130/313 (41%), Gaps = 48/313 (15%)
Query: 27 KIKYSDESKGFGIFSSNEFSDGVLLVVPLDLAITPMRVLQD-----PLIGP--ECRAMFE 79
++ Y+D S GF + + +LL PL L+++ + + D P+ CR
Sbjct: 245 EVVYND-SHGFHMLAKGPLHAALLLSCPLRLSLSILNLESDRNQVLPIDSTLQHCR---- 299
Query: 80 DGEVDDRFLMILFLTVERLRKNSS-WKPYLDMLP--TTFGNPLWFTDDELLELKGTTLYR 136
G++ D L L L +R R +S W Y+ LP + PLWF +++ L GT L
Sbjct: 300 -GKIPDHILAHLLLIEQRNRGAASPWHAYITCLPGPESMTTPLWFHEEDAAFLAGTGLAP 358
Query: 137 ATELQKQNLLTLYDDKVKDLVKKLLVLDGDSERANSIFWTRALNIPLPHSYVFPQNQEDL 196
A + +K++L +D+ V ++F L IPL + + +
Sbjct: 359 AAQERKRDLCREWDNVV------------------AVF--HELAIPLAAFTICALLESNE 398
Query: 197 NKYDSINNSAELSNDHNSRGELINGLNDIKNEAQRVNSQVNGATSTLTSTQGETLWIEGL 256
+ + +L+ ++D A + + ++ + Q ET+ + L
Sbjct: 399 SSRPPDRSHGAYCKVSLRASKLLTRISDSLFWAATIFTSRAFISTHILPGQ-ETVPV--L 455
Query: 257 VPGIDFCNHDLKAAATWEVDGTGLITGVPFSMYLLSVERS-SFHSEKEISISYGNKGNEE 315
P ID NH A W+ P + L + +F + +E+ +Y K N+E
Sbjct: 456 FPVIDILNHSPTARVEWDFQ--------PHQSFALKCQAGETFTAGQELFNNYAPKQNDE 507
Query: 316 LLYLYGFVIDNNP 328
LL YGF +++NP
Sbjct: 508 LLLGYGFCLEDNP 520
>gi|440907688|gb|ELR57800.1| SET domain-containing protein 3 [Bos grunniens mutus]
Length = 594
Score = 59.3 bits (142), Expect = 5e-06, Method: Compositional matrix adjust.
Identities = 72/334 (21%), Positives = 130/334 (38%), Gaps = 87/334 (26%)
Query: 12 FLQWLQVNKVELRGCK-IKYSDESKGFGIFSSNEF-SDGVLLVVPLDLAITPMRVLQDPL 69
++W N + G + + + +E GFG+ ++ + ++ + L VP L +T + ++ +
Sbjct: 82 LMKWASENGASVEGFEMVNFKEE--GFGLRATRDIKAEELFLWVPRKLLMT-VESAKNSV 138
Query: 70 IGP---ECRAMFEDGEVDDRFLMILFLTVERLRKNSSWKPYLDMLPTTFGNPLWFTDDEL 126
+GP + R + G + + L ER NS W+PY+ LP+ + PL+F +DE+
Sbjct: 139 LGPLYSQDRILQAMGNI----TLAFHLLCERADPNSFWQPYIQTLPSEYDTPLYFEEDEV 194
Query: 127 LELKGTTLYRATELQKQNLLTLYDDKVKDLVKKLLVLDGDSERANSIFWTRALNIPLPHS 186
L+ T ++ + Y + + V+ PH+
Sbjct: 195 RYLQSTQAI-------HDVFSQYKNTARQYAYFYKVIQTH-----------------PHA 230
Query: 187 YVFPQNQEDLNKYDSINNSAELSNDHNSRGELINGLNDIKNEAQRVNSQVNGATSTLTST 246
+ P + +S + + ++ N I E +G+ TL
Sbjct: 231 HKLP-----------LKDSFTYEDYRWAVSSVMTRQNQIPTE--------DGSRVTL--- 268
Query: 247 QGETLWIEGLVPGIDFCNHDLKAAATWEVDGTGLITGVPFSMYLLSVERSS------FHS 300
L+P D CNH +GLIT + Y L +R F +
Sbjct: 269 --------ALIPLWDMCNHT-----------SGLIT----TGYNLEDDRCECVALQDFRA 305
Query: 301 EKEISISYGNKGNEELLYLYGFVIDNNPDDYLMI 334
++I I YG + N E + GF DNN D + I
Sbjct: 306 GEQIYIFYGTRSNAEFVIHSGFFFDNNSHDRVKI 339
>gi|167389227|ref|XP_001738871.1| hypothetical protein [Entamoeba dispar SAW760]
gi|165897700|gb|EDR24782.1| hypothetical protein, conserved [Entamoeba dispar SAW760]
Length = 791
Score = 58.9 bits (141), Expect = 5e-06, Method: Compositional matrix adjust.
Identities = 55/199 (27%), Positives = 95/199 (47%), Gaps = 32/199 (16%)
Query: 9 LEPFLQWLQVNKVELRGCKIKYSDESKGFGIFSSNEF-SDGVLLVVPLDLAITPMRVLQD 67
+E +W+ N + G +K + E G G+ ++ EF D +++ +P + I + +
Sbjct: 1 MEDIKKWVIQNGGIIDGVDVK-TFEGYGRGLCANKEFKQDEIIMSIPYSIQINRINLNH- 58
Query: 68 PLIGPECR-AMFEDGEVDDR----FLMILFLTVERLRKNSSWKPYLDMLPTTFGNPLWFT 122
I PE + F +G+ DDR L+ L+L + + PY+++LP T+ PL +T
Sbjct: 59 --IWPEVKLPKFNEGD-DDRDDLNGLVYLYLAINKTNPKCFHWPYINVLPKTYDCPLSYT 115
Query: 123 DDELLELKGTTLYRATELQKQNLLTL---YDDKV-----------KDLVKKLLVLDGDSE 168
DEL +KGT LY A E L+ + Y++K+ DL K+L +
Sbjct: 116 IDELNIMKGTKLYVAVEKINAFLMKVVDYYNNKLIQQFPQYFQPFDDLFKRL-------Q 168
Query: 169 RANSIFWTRALNIPLPHSY 187
A+ FW+RA + P +
Sbjct: 169 WAHQSFWSRAFLVIYPQPF 187
>gi|338719872|ref|XP_001488117.2| PREDICTED: histone-lysine N-methyltransferase setd3-like [Equus
caballus]
Length = 609
Score = 58.9 bits (141), Expect = 5e-06, Method: Compositional matrix adjust.
Identities = 75/334 (22%), Positives = 129/334 (38%), Gaps = 87/334 (26%)
Query: 12 FLQWLQVNKVELRGCK-IKYSDESKGFGIFSSNEF-SDGVLLVVPLDLAITPMRVLQDPL 69
++W N + G + + + +E GFG+ ++ + ++ + L VP L +T + ++ +
Sbjct: 96 LMKWASENGASVDGFEMVNFKEE--GFGLRATRDIKAEELFLWVPRKLLMT-VESAKNSV 152
Query: 70 IGP---ECRAMFEDGEVDDRFLMILFLTVERLRKNSSWKPYLDMLPTTFGNPLWFTDDEL 126
+GP + R + G + + L ER NS W+PY+ LP+ + PL+F +DE+
Sbjct: 153 LGPLYSQDRILQAMGNI----TLAFHLLCERADPNSFWQPYIQTLPSEYDTPLYFEEDEV 208
Query: 127 LELKGTTLYRATELQKQNLLTLYDDKVKDLVKKLLVLDGDSERANSIFWTRALNIPLPHS 186
L+ T Q +N Y ++ R + PH+
Sbjct: 209 RYLQSTQAVHDVFSQYKNTARQY-----------------------AYFYRVIQTH-PHA 244
Query: 187 YVFPQNQEDLNKYDSINNSAELSNDHNSRGELINGLNDIKNEAQRVNSQVNGATSTLTST 246
P + +S + + ++ N I E +G+ TL
Sbjct: 245 NKLP-----------LKDSFTYEDYRWAVSSVMTRQNQIPTE--------DGSRVTL--- 282
Query: 247 QGETLWIEGLVPGIDFCNHDLKAAATWEVDGTGLITGVPFSMYLLSVERSS------FHS 300
L+P D CNH TGLIT + Y L +R F +
Sbjct: 283 --------ALIPLWDMCNHT-----------TGLIT----TGYNLEDDRCECVALQDFRA 319
Query: 301 EKEISISYGNKGNEELLYLYGFVIDNNPDDYLMI 334
++I I YG + N E + GF DNN D + I
Sbjct: 320 GEQIYIFYGTRSNAEFVIHSGFFFDNNSHDRVKI 353
>gi|347967016|ref|XP_003436005.1| AGAP002018-PB [Anopheles gambiae str. PEST]
gi|333469796|gb|EGK97407.1| AGAP002018-PB [Anopheles gambiae str. PEST]
Length = 504
Score = 58.9 bits (141), Expect = 5e-06, Method: Compositional matrix adjust.
Identities = 86/358 (24%), Positives = 133/358 (37%), Gaps = 73/358 (20%)
Query: 12 FLQWLQVNKVELRGCKIKYSDESKGFGIFSSNEFSDG-VLLVVPLDLAI-----TPMRVL 65
F++W ++ ++ E G G+ S G ++ VP + R L
Sbjct: 75 FMRWAVERGCQVENVRVAEHAEYGGLGLESCGPIPAGECIITVPRSMFFYVTNEPRYRQL 134
Query: 66 QDPLIGPECRAMFEDGEVDDRFLMILFLTVERLRKNSSWKPYLDMLPTTFGNPLWFTDDE 125
+ + G M E G + ++ L L +ER R S WKPYLD+LP + PL++T ++
Sbjct: 135 LELMPGA---MMSEQGNI----MLALALIMERFRAKSDWKPYLDLLPDRYTTPLYYTTED 187
Query: 126 LLELKGT-------TLYRATELQKQNLLTLYDDKVKDLVKKLLVLDGDSERANSIFWTRA 178
+ EL T L + Q + +KV +L + D +F
Sbjct: 188 MGELAETDAFLPALKLCKHIARQYGFIRRFVQEKVDEL-RDCFTYD--------VFRLLL 238
Query: 179 LNIPLPHSYVFPQNQEDLNKYDSINNSAELSN-DHNSRGELINGLNDIKNEAQRVNSQVN 237
++ +PHS+ NK N AE DH + L D+ N A
Sbjct: 239 FSLLIPHSWAVSTVMTRQNKVPV--NLAEFDGMDHTL---ALIPLWDMANHA-------F 286
Query: 238 GATSTLTSTQGETLWIEGLVPGIDFCNHDLKAAATWEVDGTGLITGVPFSMYLLSVERSS 297
T+ T ET + + N L+ + T EV I VP
Sbjct: 287 PDTANETRCVAETCY--------NATNEQLECSLTREVSD---IASVP------------ 323
Query: 298 FHSEKEISISYGNKGNEELLYLYGFVIDNNPDDYLMIHYPAEAIHSIPLSDSKALLLE 355
I I YG + + E L GFV NP + + + +IPL +A LLE
Sbjct: 324 ------IFIVYGTRTDAEFLVHNGFVCPRNPHANVQKRF--TLVPAIPLYKERAHLLE 373
>gi|392573178|gb|EIW66319.1| hypothetical protein TREMEDRAFT_35092 [Tremella mesenterica DSM
1558]
Length = 434
Score = 58.9 bits (141), Expect = 6e-06, Method: Compositional matrix adjust.
Identities = 81/345 (23%), Positives = 132/345 (38%), Gaps = 81/345 (23%)
Query: 12 FLQWLQVNKVELRGCKIKYSDESKGFGIFSSNEFS-DGVLLVVPLDLAITPM---RVLQD 67
L+W++ + + D G I++S S D +L PL L ITP R +
Sbjct: 11 LLEWIKQGRGWIHPSLYLKLDPYTGLSIYTSTPLSPDERILSCPLSLTITPTLATRAICA 70
Query: 68 PLIGPECRAMFEDGEV-DDRFLMILFLTVERLRK---NSSWKPYLDMLPT--TFGNPLWF 121
E + DG ++ L+ ++L + ++ N + Y+D LP T +PL F
Sbjct: 71 EYDIVESELIMRDGTPWSEKILLAVYLVLHWIKPYSPNLHHQIYVDALPPPETLTSPLHF 130
Query: 122 TDDELLELKGTTLYRATELQKQNLLTLYDDKVKDLVKKLLVLDGDSERANSIFWTRALNI 181
+D EL L+GT NLL D+ +L+++ ++ ++ + W R +
Sbjct: 131 SDAELDLLEGT-----------NLLGAVRDRKNELMEEAEIVR-EALKEPGTTWERYRDA 178
Query: 182 PL-PHSYVFPQNQEDLNKYDSINNSAELSNDHNSRGELINGLNDIKNEAQRVNSQVNGAT 240
S FP + I N E +D GE+ + +
Sbjct: 179 SFYVSSRAFPSS------LLQIQNGTE--SDMILPGEISHPI------------------ 212
Query: 241 STLTSTQGETLWIEGLVPGIDFCNHDLKAAATWEVDGT-----------GLITGVPFSMY 289
LVPG+D NH A TW LI + F
Sbjct: 213 ---------------LVPGLDLLNHARGQAVTWLTHSVPSPDLPIQSINSLIPCISF--- 254
Query: 290 LLSVERSSFHSEKEISISYGNKGNEELLYLYGFVIDNNPDDYLMI 334
+ + + +I +YG K NEE LY YGFV+ + PDD +++
Sbjct: 255 ---IPHTPMPAHTQIYNNYGPKSNEEFLYAYGFVLPDGPDDTVIL 296
>gi|110331827|gb|ABG67019.1| hypothetical protein LOC84193 [Bos taurus]
Length = 488
Score = 58.5 bits (140), Expect = 6e-06, Method: Compositional matrix adjust.
Identities = 72/333 (21%), Positives = 130/333 (39%), Gaps = 87/333 (26%)
Query: 13 LQWLQVNKVELRGCK-IKYSDESKGFGIFSSNEF-SDGVLLVVPLDLAITPMRVLQDPLI 70
++W N + G + + + +E GFG+ ++ + ++ + L VP L +T + ++ ++
Sbjct: 90 MKWASENGASVEGFEMVNFKEE--GFGLRATRDIKAEELFLWVPRKLLMT-VESAKNSVL 146
Query: 71 GP---ECRAMFEDGEVDDRFLMILFLTVERLRKNSSWKPYLDMLPTTFGNPLWFTDDELL 127
GP + R + G + + L ER NS W+PY+ LP+ + PL+F +DE+
Sbjct: 147 GPLYSQDRILQAMGNIT----LAFHLLCERADPNSFWQPYIQTLPSEYDTPLYFEEDEVR 202
Query: 128 ELKGTTLYRATELQKQNLLTLYDDKVKDLVKKLLVLDGDSERANSIFWTRALNIPLPHSY 187
L+ T ++ + Y + + V+ PH++
Sbjct: 203 YLQSTQAI-------HDVFSQYKNTARQYAYFYKVIQTH-----------------PHAH 238
Query: 188 VFPQNQEDLNKYDSINNSAELSNDHNSRGELINGLNDIKNEAQRVNSQVNGATSTLTSTQ 247
P + +S + + ++ N I E +G+ TL
Sbjct: 239 KLP-----------LKDSFTYEDYRWAVSSVMTRQNQIPTE--------DGSRVTL---- 275
Query: 248 GETLWIEGLVPGIDFCNHDLKAAATWEVDGTGLITGVPFSMYLLSVERSS------FHSE 301
L+P D CNH +GLIT + Y L +R F +
Sbjct: 276 -------ALIPLWDMCNHT-----------SGLIT----TGYNLEDDRCECVALQDFRAG 313
Query: 302 KEISISYGNKGNEELLYLYGFVIDNNPDDYLMI 334
++I I YG + N E + GF DNN D + I
Sbjct: 314 EQIYIFYGTRSNAEFVIHSGFFFDNNSHDRVKI 346
>gi|322712432|gb|EFZ04005.1| histone-lysine N-methyltransferase [Metarhizium anisopliae ARSEF
23]
Length = 462
Score = 58.5 bits (140), Expect = 7e-06, Method: Compositional matrix adjust.
Identities = 66/296 (22%), Positives = 114/296 (38%), Gaps = 66/296 (22%)
Query: 36 GFGIFSSNEFSDG-VLLVVPLDLAITPMRVLQDPLIGPECRAMFEDGEVDDRFLMILFLT 94
G G+ + F G +L +P L T D L+GP + V+D + +
Sbjct: 25 GRGVKARRRFKQGERILTIPSGLHWTVKHAQNDSLLGPALCSAQPPLSVEDTLAVHILFV 84
Query: 95 VERLRKNSSWKPYLDMLPTTFGNPLWFTDDELLELKGTTLYRATELQKQNLLTLYDDKVK 154
R + +++ LP ++ + ++FTDDEL G +LY T+ +Q + +D +
Sbjct: 85 RSRESGYDGLRSHVERLPASYSSSIFFTDDELEVCAGASLYTITKQLQQRI----EDDYR 140
Query: 155 DLVKKLLVLDGDSERANSIFWTRALNIPLPHSYVFPQNQEDLNKYDSINNSAELSNDHNS 214
DLV ++LV D +FP ++ L+ Y
Sbjct: 141 DLVVRVLVQYPD---------------------LFPLDKFTLHHYKW------------- 166
Query: 215 RGELINGLNDIKNEAQRVNSQVNGATSTLTSTQGETLWIEGLVPGIDFCNHDLKAAATWE 274
L + + A ++ Q++ +S I L P D NH ++
Sbjct: 167 ------ALCAVWSRA--MDFQLSDGSS-----------IRLLAPFADMLNHSSESKQCHV 207
Query: 275 VDGTGLITGVPFSMYLLSVERSSFHSEKEISISYGNKGNEELLYLYGFVIDNNPDD 330
D + S L + + + ++ I YG+ N LL LYGF+I NP+D
Sbjct: 208 YDAS--------SGDLSVLAGKDYEAGDQVYIHYGSIPNHRLLRLYGFIIPGNPND 255
>gi|291000152|ref|XP_002682643.1| predicted protein [Naegleria gruberi]
gi|284096271|gb|EFC49899.1| predicted protein [Naegleria gruberi]
Length = 619
Score = 58.5 bits (140), Expect = 7e-06, Method: Compositional matrix adjust.
Identities = 30/83 (36%), Positives = 50/83 (60%), Gaps = 2/83 (2%)
Query: 77 MFEDGEVDDRFLMILFLTVERLRKNSSWKPYLDMLPTTFGNPLWFTDDELLELKGTTLYR 136
MFE+G + F M L + ++ S+ PYL++LP F L+F +DE+ L+ T LY+
Sbjct: 101 MFENGLI--VFYMFLIYELHVEKEKSTHFPYLNLLPREFTTALYFDEDEMAALRSTNLYK 158
Query: 137 ATELQKQNLLTLYDDKVKDLVKK 159
+ + +QNL +Y+ KV+ L+ K
Sbjct: 159 SVQSIRQNLKQIYETKVEYLMNK 181
Score = 46.2 bits (108), Expect = 0.037, Method: Compositional matrix adjust.
Identities = 28/81 (34%), Positives = 38/81 (46%), Gaps = 9/81 (11%)
Query: 256 LVPGIDFCNHDLKAAATWEVDGTGLITGVPFSMYLLSVERSSFHSEKEISISYGNKGNEE 315
L+P +D NH A T+ TG YL + R S S + +YG K N+
Sbjct: 226 LLPTVDILNHKFNAKITY-------FTGSDRRFYLKT--RESLKSGDYVCNNYGAKSNDS 276
Query: 316 LLYLYGFVIDNNPDDYLMIHY 336
L YGFVI NN +D L + +
Sbjct: 277 FLLSYGFVIPNNSEDTLYVQF 297
>gi|440302460|gb|ELP94773.1| hypothetical protein EIN_341910 [Entamoeba invadens IP1]
Length = 823
Score = 58.5 bits (140), Expect = 7e-06, Method: Compositional matrix adjust.
Identities = 47/161 (29%), Positives = 76/161 (47%), Gaps = 12/161 (7%)
Query: 36 GFGIFSSNEFSDG-VLLVVPLDLAITPMR---VLQDPLIGPECRAMFEDGEV--DDRFLM 89
G G+ SS EF +G LL +P L + + V + + G E + E + D+ ++
Sbjct: 27 GNGLCSSKEFHEGDTLLSIPYHLQLNTIELHNVFESMVPGFEVPRLGEGAKNRDDENSVV 86
Query: 90 ILFLTVERLRKNSSWKPYLDMLPTTFGNPLWFTDDELLELKGTTLYRATELQKQNLLTL- 148
L+L + + + PY++ LPTTF PL ++++EL LKGT L E K L L
Sbjct: 87 YLYLAMNKTNEKCFHFPYINTLPTTFSCPLSYSENELKMLKGTKLLVTVEKTKTFLKKLS 146
Query: 149 --YDDKVKDLVKKLLVLDGDSER---ANSIFWTRALNIPLP 184
Y+ + D +R A+ +FW+RA + P
Sbjct: 147 DYYETLTHQYPTRFQQFDDFYQRLVWAHQVFWSRAFLVIYP 187
Score = 40.4 bits (93), Expect = 1.8, Method: Compositional matrix adjust.
Identities = 22/82 (26%), Positives = 39/82 (47%), Gaps = 7/82 (8%)
Query: 253 IEGLVPGIDFCNHDLKAAATWEVDGTGLITGVPFSMYLLSVERSSFHSEKEISISYGNKG 312
+ L+P DF NH+ + T+ ++ + L H ++I +Y +
Sbjct: 193 VASLIPFADFSNHNTETKVTY-------VSNRQTQTFSLQTNEKVLHCGEQIFNNYRIRP 245
Query: 313 NEELLYLYGFVIDNNPDDYLMI 334
NE++L YGFVI NP D +++
Sbjct: 246 NEKMLLGYGFVISENPYDEVLL 267
>gi|302696931|ref|XP_003038144.1| hypothetical protein SCHCODRAFT_255149 [Schizophyllum commune H4-8]
gi|300111841|gb|EFJ03242.1| hypothetical protein SCHCODRAFT_255149 [Schizophyllum commune H4-8]
Length = 425
Score = 58.5 bits (140), Expect = 7e-06, Method: Compositional matrix adjust.
Identities = 77/320 (24%), Positives = 131/320 (40%), Gaps = 70/320 (21%)
Query: 33 ESKGFGIFSSNEFSDGVLLV-VPLDLAITPMRVLQDPLIGPECRAMFEDGEVD--DRFLM 89
E+KG + + +V P+DL +TP + + I + +R +
Sbjct: 35 ETKGINAVALRDIPASEPVVWCPIDLIVTPQKARESIAIALGLTNAVRQASENWSERQSI 94
Query: 90 ILFLTVERLRKNS---SWKPYLDMLP--TTFGNPLWFTDDELLELKGTTLYRATELQKQN 144
L+L++ L N+ S +PY+D+LP +T + FT+DEL +GT LY AT + ++
Sbjct: 95 GLYLSLHSLLPNNPALSHRPYVDLLPPASTLRAAIHFTEDELAAFRGTNLYGAT-IDRRR 153
Query: 145 LLTLYDDKVKDLVKKLLVLDGDSERANSIFWTRALNIPLPHSYVFPQNQEDLNKYDSINN 204
LL K +DL++ E A+ + W DL
Sbjct: 154 LLEAEWQKSRDLLRTC-----KPELADLLTW-------------------DL-------- 181
Query: 205 SAELSNDHNSRGELINGLNDIKNEAQRVNSQVNGATSTLTSTQGETLWIEGLVPGIDFCN 264
A + + SR + L + N + +V+SQ L+PG+D N
Sbjct: 182 FAASTTHYTSRAFPSSMLEE--NPSLQVSSQTEPV----------------LIPGVDSLN 223
Query: 265 HDLKAAATWEVDGTGLITGVPFSM----------YLLSVERSSFHSEKEISISYGNKGNE 314
H TW V+ T + P + ++ V + + EI +YG K N
Sbjct: 224 HYRARPVTWLVE-TRPVGQRPVVVNERVPADAPPFVAVVPLTETPAGAEIFNNYGAKPNA 282
Query: 315 ELLYLYGFVIDNNPDDYLMI 334
EL+ YGF +++N DD +++
Sbjct: 283 ELILGYGFTLEDNTDDTIVL 302
>gi|307195796|gb|EFN77610.1| SET domain-containing protein 3 [Harpegnathos saltator]
Length = 246
Score = 58.5 bits (140), Expect = 7e-06, Method: Compositional matrix adjust.
Identities = 41/144 (28%), Positives = 69/144 (47%), Gaps = 14/144 (9%)
Query: 12 FLQWLQVNKVELRGCKIKYSDESKGF--GIFSSNEFSDGVLLV-VPLDLAITPMRVLQDP 68
F+ WL N + G I E G+ G+ + +F++ L++ +P L +
Sbjct: 85 FMNWLTENGARVDGVSIA---EFPGYDLGLKAEADFTENQLILEIPRALIFSTYNA---- 137
Query: 69 LIGPECRAMFEDGEVDD--RFLMILFLTVERLRKNSSWKPYLDMLPTTFGNPLWFTDDEL 126
PE + D V + + + L +ER ++NS WKPYLDMLPT++ L+ ++
Sbjct: 138 --APELVILQNDPLVQHMPQVALAIALLIERHKENSKWKPYLDMLPTSYTTVLYMKATDM 195
Query: 127 LELKGTTLYRATELQKQNLLTLYD 150
+ELKG+ A Q +N+ Y
Sbjct: 196 IELKGSPTLEAALKQCRNIARQYS 219
>gi|432098266|gb|ELK28072.1| Histone-lysine N-methyltransferase setd3 [Myotis davidii]
Length = 585
Score = 58.2 bits (139), Expect = 9e-06, Method: Compositional matrix adjust.
Identities = 74/333 (22%), Positives = 130/333 (39%), Gaps = 85/333 (25%)
Query: 12 FLQWLQVNKVELRGCKIKYSDESKGFGIFSSNEF-SDGVLLVVPLDLAITPMRVLQDPLI 70
++W N + G ++ ++ + +GFG+ ++ + ++ + L VP L +T + ++ ++
Sbjct: 71 LMKWASENGASVEGFEM-FNFKEEGFGLRATRDIKAEELFLWVPRKLLMT-VESAKNSVL 128
Query: 71 GP---ECRAMFEDGEVDDRFLMILFLTVERLRKNSSWKPYLDMLPTTFGNPLWFTDDELL 127
GP + R + G + + L ER NS W+PY+ LP+ + PL+F +DE+
Sbjct: 129 GPLYSQDRILQAMGNIT----LAFHLLCERADPNSFWQPYIQTLPSEYDTPLYFEEDEVR 184
Query: 128 ELKGTTLYRATELQKQNLLTLYDDKVKDLVKKLLVLDGDSERANSIFWTRALNIPLPHSY 187
L+ T ++ + Y + + V+ A +PL S+
Sbjct: 185 SLQSTQAV-------HDVFSQYKNTARQYAYFYKVIQTHP---------HANKLPLKDSF 228
Query: 188 VFPQNQEDLNKYDSINNSAELSNDHNSRGELINGLNDIKNEAQRVNSQVNGATSTLTSTQ 247
+ ED S ++ N I E +G+ TL
Sbjct: 229 TY----EDYRWAVS---------------SVMTRQNQIPTE--------DGSRVTL---- 257
Query: 248 GETLWIEGLVPGIDFCNHDLKAAATWEVDGTGLITGVPFSMYLLSVERSS------FHSE 301
L+P D CNH GLIT + Y L +R F +
Sbjct: 258 -------ALIPLWDMCNHT-----------NGLIT----TGYNLEDDRCECVALRDFRAG 295
Query: 302 KEISISYGNKGNEELLYLYGFVIDNNPDDYLMI 334
++I I YG + N E + GF DNN D + I
Sbjct: 296 EQIYIFYGTRSNAEFVIHSGFFFDNNSHDRVKI 328
>gi|344273731|ref|XP_003408672.1| PREDICTED: histone-lysine N-methyltransferase setd3-like [Loxodonta
africana]
Length = 597
Score = 58.2 bits (139), Expect = 9e-06, Method: Compositional matrix adjust.
Identities = 75/333 (22%), Positives = 128/333 (38%), Gaps = 85/333 (25%)
Query: 12 FLQWLQVNKVELRGCKIKYSDESKGFGIFSSNEF-SDGVLLVVPLDLAITPMRVLQDPLI 70
++W N + G ++ E +GFG+ ++ + ++ + L VP L +T + ++ ++
Sbjct: 82 LMKWASENGASVEGFEVVNFKE-EGFGLRATRDIKAEELFLWVPRKLLMT-VESAKNSVL 139
Query: 71 GP---ECRAMFEDGEVDDRFLMILFLTVERLRKNSSWKPYLDMLPTTFGNPLWFTDDELL 127
GP + R + G + + L ER NS W+PY+ LP+ + PL+F +DE+
Sbjct: 140 GPLYSQDRILQAMGNIT----LAFHLLCERANPNSFWQPYIQTLPSEYDTPLYFEEDEVR 195
Query: 128 ELKGTTLYRATELQKQNLLTLYDDKVKDLVKKLLVLDGDSERANSIFWTRALNIPLPHSY 187
L+ T ++ + Y + + V+ A +PL S+
Sbjct: 196 HLQSTQAI-------HDVFSQYKNTARQYAYFYKVIQTHPH---------ANKLPLKDSF 239
Query: 188 VFPQNQEDLNKYDSINNSAELSNDHNSRGELINGLNDIKNEAQRVNSQVNGATSTLTSTQ 247
+ ED S ++ N I E +G+ TL
Sbjct: 240 TY----EDYRWAVS---------------SVMTRQNQIPTE--------DGSRVTL---- 268
Query: 248 GETLWIEGLVPGIDFCNHDLKAAATWEVDGTGLITGVPFSMYLLSVERSS------FHSE 301
L+P D CNH GLIT + Y L +R F +
Sbjct: 269 -------ALIPLWDMCNHT-----------NGLIT----TGYNLEDDRCECVALQDFRAG 306
Query: 302 KEISISYGNKGNEELLYLYGFVIDNNPDDYLMI 334
++I I YG + N E + GF DNN D + I
Sbjct: 307 EQIYIFYGTRSNAEFVIHSGFFFDNNSHDRVKI 339
>gi|356514707|ref|XP_003526045.1| PREDICTED: cytochrome P450 704C1-like [Glycine max]
Length = 144
Score = 58.2 bits (139), Expect = 1e-05, Method: Compositional matrix adjust.
Identities = 26/40 (65%), Positives = 30/40 (75%)
Query: 441 ELVGSGGERQPSDAEVRAAVWETCGDSGALQLLVDLLQAK 480
+L G ERQ SD +V++AVWE CGDSGALQLL DLL K
Sbjct: 105 KLAGPERERQLSDTDVQSAVWEVCGDSGALQLLFDLLHVK 144
>gi|449019745|dbj|BAM83147.1| similar to protein N-methyltransferase [Cyanidioschyzon merolae
strain 10D]
Length = 576
Score = 57.8 bits (138), Expect = 1e-05, Method: Compositional matrix adjust.
Identities = 64/242 (26%), Positives = 100/242 (41%), Gaps = 26/242 (10%)
Query: 101 NSSWKPYLDMLPTTFG-NPLWFTDDELLELKGTTLYRATELQKQNLLTLYDDKVKDLVKK 159
N + P+ +LPTTF PL++TD E EL+ L RA Q++ LY V+ ++
Sbjct: 245 NCTLSPWFRLLPTTFDERPLYWTDAERRELQYDCLLRAISEQERKWKDLYGRFVRQTFER 304
Query: 160 L---LVLDGDSERANSIFWTRALNIPLPHSYVFPQNQEDLN-KYDSINNSAELS----ND 211
L + G W ALN+ + ++ P ++ + LS N
Sbjct: 305 GGIDLKVRGGLFSEQDFQW--ALNVVVTRAFSGPSETTPFGERWRQACFAGALSLASWNF 362
Query: 212 HNSRGELINGLNDIKNEAQRVNSQVNGAT---STLTSTQGETLWIEGLVPGIDFCNHDLK 268
H L++ + V SQ+ S QG L + P ID NH +
Sbjct: 363 H-----LMDAFAALNVFLVVVLSQLVFDALYPRIYQSMQGAPLKKYVIAPFIDLFNHSSR 417
Query: 269 AAATWEVDGTGLITGVPFSMYLLSVERSSFHSEKEISISYGNKGNEELLYLYGFVIDNNP 328
+ + + + LS+ H+ ++ ISYG N+ELL LYGFV ++NP
Sbjct: 418 VQSK-------VAYEYFYDAFSLSISNRDTHAGDQVFISYGTLTNDELLALYGFVEEDNP 470
Query: 329 DD 330
D
Sbjct: 471 HD 472
>gi|225561342|gb|EEH09622.1| conserved hypothetical protein [Ajellomyces capsulatus G186AR]
Length = 487
Score = 57.8 bits (138), Expect = 1e-05, Method: Compositional matrix adjust.
Identities = 46/161 (28%), Positives = 72/161 (44%), Gaps = 16/161 (9%)
Query: 36 GFGIFSSNEFSDG-VLLVVPLDLAITPMRVLQDPLIGPECRAMFEDGEVDDRFLMILFLT 94
G G+ + F +G + +P D+ T D L+GP R+ VDD M +
Sbjct: 43 GRGVRTLRCFKEGERIFTIPADVLWTVEHAYADSLLGPALRSARPPLSVDDTLAMYILFV 102
Query: 95 VERLRKNSSWKPYLDMLPTTFGNPLWFTDDELLELKGTTLYRATELQKQNLLTLYDDKVK 154
R + +L LP ++ + ++FTDDEL G++LY T + L +D +
Sbjct: 103 RSRESGYDGPRSHLATLPKSYSSSIFFTDDELEVCAGSSLYALT----KRLGRCIEDDYR 158
Query: 155 DLVKKLLV-------LDG----DSERANSIFWTRALNIPLP 184
LV +LLV LD D + A W+RA++ LP
Sbjct: 159 ALVVRLLVQHQDLFPLDKFTIEDYKWALCTVWSRAMDFVLP 199
>gi|303288325|ref|XP_003063451.1| predicted protein [Micromonas pusilla CCMP1545]
gi|226455283|gb|EEH52587.1| predicted protein [Micromonas pusilla CCMP1545]
Length = 478
Score = 57.8 bits (138), Expect = 1e-05, Method: Compositional matrix adjust.
Identities = 34/84 (40%), Positives = 47/84 (55%), Gaps = 9/84 (10%)
Query: 253 IEGLVPGIDFCNHDLKAAATWEV--DGTGLITGVPFSMYLLSVERSSFHSEKEISISYGN 310
++ LVP +D CNH + AT E+ D G + Y L ++ + E+ I+YG
Sbjct: 216 VDALVPLVDMCNHRAGSTATLEIVEDDAG------DAFYELRAGVAT-KAGDEVFINYGA 268
Query: 311 KGNEELLYLYGFVIDNNPDDYLMI 334
KGNEELL +GFVI NNP D L +
Sbjct: 269 KGNEELLRCHGFVIPNNPCDVLAV 292
>gi|323333293|gb|EGA74690.1| YHL039W-like protein [Saccharomyces cerevisiae AWRI796]
gi|323337392|gb|EGA78644.1| YHL039W-like protein [Saccharomyces cerevisiae Vin13]
gi|365765301|gb|EHN06812.1| YHL039W-like protein [Saccharomyces cerevisiae x Saccharomyces
kudriavzevii VIN7]
Length = 438
Score = 57.8 bits (138), Expect = 1e-05, Method: Compositional matrix adjust.
Identities = 89/367 (24%), Positives = 154/367 (41%), Gaps = 64/367 (17%)
Query: 5 TEAKLEPFLQWLQVNKVELRGCKIKYS-DESKGFGIFSSNEFS---DGVLLVVPLDLAIT 60
T+ +L+ LQW Q N + KI + E G F + + S D L+ VP L IT
Sbjct: 3 TQTELDNCLQWAQNNGAFIDP-KISFRITEDAGVSAFVNEKLSPKPDQALIRVPETLLIT 61
Query: 61 PMRVLQDPLIGPECRAMFEDGEVDDRFLMILFLTVERLRKNSSWKPYLDMLPTTFGNPLW 120
+ L + R++ V +L L + + S +KPYLD+LP P +
Sbjct: 62 SQQALSEFSQAANERSLL--NSVTQLYLSKLKFGTDAVHLKSFYKPYLDVLPLHLPQPYF 119
Query: 121 FTDDELLELKGTTLYRATELQKQNLLTLYDDKVKDLVKKLLVLDGDSERANSIFWTRALN 180
++ DE++ L GT +Y LT+ D + LVK+ +L +AL+
Sbjct: 120 WSTDEVMNLHGTDVY----------LTM-RDTLNKLVKEWRML------------FQALS 156
Query: 181 IPLPHSYVFPQNQEDLNKYDSINNSAELSNDHNSRGELINGLNDIKNEAQRVNSQVNGAT 240
I HS Q+++ L+ + +SA + + + +N K E NS V
Sbjct: 157 I--EHS---SQDKQFLSLFQENKDSAVVPLE-----QFCAHINGCKLEDSEWNSFVAYLW 206
Query: 241 S-TLTSTQGETLWIEG-------------LVPGIDFCNHDLKAAATWEVDGTGLITGVPF 286
S + +++ I G L P +D NH WE++ + +
Sbjct: 207 SYCIFNSRAFPRVILGRAGTDRTNLNEGFLYPIVDLLNHKNDVPVRWEMNEQNELCFMS- 265
Query: 287 SMYLLSVERSSFHSEKEISISYGNKGNEELLYLYGFVIDNNPDDY--LMIHYPAEAIHSI 344
+ ++F ++ E+ +YGN NE+ L YGF +N D+ L + P+ + +
Sbjct: 266 -------QTTTFSAQDELFNNYGNISNEKCLLNYGFWDSSNKFDFSRLTLKLPSTLVSGL 318
Query: 345 PLSDSKA 351
P+ +K+
Sbjct: 319 PVDFNKS 325
>gi|302410103|ref|XP_003002885.1| SET domain-containing protein RMS1 [Verticillium albo-atrum
VaMs.102]
gi|261357909|gb|EEY20337.1| SET domain-containing protein RMS1 [Verticillium albo-atrum
VaMs.102]
Length = 469
Score = 57.8 bits (138), Expect = 1e-05, Method: Compositional matrix adjust.
Identities = 44/138 (31%), Positives = 68/138 (49%), Gaps = 20/138 (14%)
Query: 12 FLQWLQVNKVELRGCKIKYSD---ESKGFGIFSSNEF-SDGVLLVVPLDLAITPMRVLQD 67
FLQW + E R ++ D ++ G GI ++ + + L +P I VL
Sbjct: 11 FLQWFKAAGGEFRDDLLQIVDLRPQAAGRGIIATRDIPEETTLFTIPRQAIIN---VLTS 67
Query: 68 PLIGPECRAMFEDGEVDDR----------FLMILFLTVERLRKNSS-WKPYLDMLPTTFG 116
L P+ DG +D+ +IL + E L+ +SS WKPY D+LP F
Sbjct: 68 EL--PQKLPQVFDGSIDEMDDNAEPLDSWGQLILVMLYEVLQGDSSRWKPYFDILPQQFD 125
Query: 117 NPLWFTDDELLELKGTTL 134
P++++D ELLEL+GT+L
Sbjct: 126 TPIFWSDGELLELQGTSL 143
>gi|207344851|gb|EDZ71856.1| YHL039Wp-like protein [Saccharomyces cerevisiae AWRI1631]
Length = 468
Score = 57.8 bits (138), Expect = 1e-05, Method: Compositional matrix adjust.
Identities = 89/367 (24%), Positives = 154/367 (41%), Gaps = 64/367 (17%)
Query: 5 TEAKLEPFLQWLQVNKVELRGCKIKYS-DESKGFGIFSSNEFS---DGVLLVVPLDLAIT 60
T+ +L+ LQW Q N + KI + E G F + + S D L+ VP L IT
Sbjct: 3 TQTELDNCLQWAQNNGAFIDP-KISFRITEDAGVSAFVNEKLSPKPDQALIRVPETLLIT 61
Query: 61 PMRVLQDPLIGPECRAMFEDGEVDDRFLMILFLTVERLRKNSSWKPYLDMLPTTFGNPLW 120
+ L + R++ V +L L + + S +KPYLD+LP P +
Sbjct: 62 SQQALSEFSQAANERSLL--NSVTQLYLSKLKFGTDAVHLKSFYKPYLDVLPLHLPQPYF 119
Query: 121 FTDDELLELKGTTLYRATELQKQNLLTLYDDKVKDLVKKLLVLDGDSERANSIFWTRALN 180
++ DE++ L GT +Y LT+ D + LVK+ +L +AL+
Sbjct: 120 WSTDEVMNLHGTDVY----------LTM-RDTLNKLVKEWRML------------FQALS 156
Query: 181 IPLPHSYVFPQNQEDLNKYDSINNSAELSNDHNSRGELINGLNDIKNEAQRVNSQVNGAT 240
I HS Q+++ L+ + +SA + + + +N K E NS V
Sbjct: 157 I--EHS---SQDKQFLSLFQENKDSAVVPLE-----QFCAHINGCKLEDSEWNSFVAYLW 206
Query: 241 S-TLTSTQGETLWIEG-------------LVPGIDFCNHDLKAAATWEVDGTGLITGVPF 286
S + +++ I G L P +D NH WE++ + +
Sbjct: 207 SYCIFNSRAFPRVILGRAGTDRTNLNEGFLYPIVDLLNHKNDVPVRWEMNEQNELCFMS- 265
Query: 287 SMYLLSVERSSFHSEKEISISYGNKGNEELLYLYGFVIDNNPDDY--LMIHYPAEAIHSI 344
+ ++F ++ E+ +YGN NE+ L YGF +N D+ L + P+ + +
Sbjct: 266 -------QTTTFSAQDELFNNYGNISNEKCLLNYGFWDSSNKFDFSRLTLKLPSTLVSGL 318
Query: 345 PLSDSKA 351
P+ +K+
Sbjct: 319 PVDFNKS 325
>gi|348679311|gb|EGZ19127.1| hypothetical protein PHYSODRAFT_493969 [Phytophthora sojae]
Length = 776
Score = 57.8 bits (138), Expect = 1e-05, Method: Compositional matrix adjust.
Identities = 45/145 (31%), Positives = 72/145 (49%), Gaps = 12/145 (8%)
Query: 2 EISTEA-KLEPFLQWLQVNKVELRGCKIKYSDESKGFGIFSSNEFSDG-VLLVVPLDLAI 59
EI E+ KL+ FLQW+ + +++Y E+ G+G F++ S G L VP+ + +
Sbjct: 381 EIDEESSKLDRFLQWIARQDFPVNHLELRYVSEAVGYGTFATKPLSAGDAYLKVPVQIVM 440
Query: 60 TPMRVLQDPLIGPECRAMFEDGEVDDRFLMILFLTVERLRK-----NSSWKPYLDMLP-- 112
L+ P + + + R M+L L + + S WKPYLDMLP
Sbjct: 441 NVHSALKSPWVRQTMHELQKHRASVSREEMLLLLHLLEEKIGPNCLQSRWKPYLDMLPAL 500
Query: 113 -TTFGNPLWFTDDE--LLELKGTTL 134
+T G+PL++ +DE L L+G L
Sbjct: 501 ESTLGSPLFYEEDEEQLKALQGIDL 525
>gi|400594002|gb|EJP61885.1| histone-lysine N-methyltransferase [Beauveria bassiana ARSEF 2860]
Length = 481
Score = 57.4 bits (137), Expect = 1e-05, Method: Compositional matrix adjust.
Identities = 72/304 (23%), Positives = 117/304 (38%), Gaps = 76/304 (25%)
Query: 36 GFGIFSSNEFSDG-VLLVVPLDLAITPMRVLQDPLIGPECRAMFEDGEVDDRFLMILFLT 94
G G+ + FS G +L +P T D L+GP RA+ V+D + + L
Sbjct: 25 GRGVQAQRAFSAGERILTIPAQCLWTVEHAYADRLLGPVLRALQPPLSVEDTLALHILLV 84
Query: 95 VERLRKN------SSWKPYLDMLPTTFGNPLWFTDDELLELKGTTLYRATELQKQNLLTL 148
R R + + + ++D+LP + ++F+D+E+ KG++LY L T
Sbjct: 85 RARRRPDDDGAYEAGRRSHVDVLPDRYTMSIFFSDEEMQVCKGSSLY--------TLTTQ 136
Query: 149 YDDKVKDLVKKLLVLDGDSERANSIFWTRALNIPLPHSYVFPQNQEDLNKYDSINNSAEL 208
++ D KKLL TR L + H +FP ++ + Y +
Sbjct: 137 LRGRIGDDYKKLL--------------TRVL---MRHRNLFPLSKFGIEHYKWALCTVW- 178
Query: 209 SNDHNSRGELINGLNDIKNEAQRVNSQVNGATSTLTSTQGETLWIEGLVPGIDFCNH--D 266
SRG T ++G +L + L P D NH D
Sbjct: 179 -----SRG------------------------MDFTVSEGNSLRL--LAPFADMLNHSSD 207
Query: 267 LKAAATWEVDGTGLITGVPFSMYLLSVERSSFHSEKEISISYGNKGNEELLYLYGFVIDN 326
+K ++ P + L + ++ ++ I YG N LL LYGFV+
Sbjct: 208 VKQCHAYD----------PTTGDLSILASKDYNVGDQVFIYYGPVPNNRLLRLYGFVLPE 257
Query: 327 NPDD 330
NP D
Sbjct: 258 NPHD 261
>gi|322795851|gb|EFZ18530.1| hypothetical protein SINV_15998 [Solenopsis invicta]
Length = 246
Score = 57.4 bits (137), Expect = 1e-05, Method: Compositional matrix adjust.
Identities = 42/149 (28%), Positives = 75/149 (50%), Gaps = 24/149 (16%)
Query: 12 FLQWLQVNKVELRGCKIKYSDESKGF--GIFSSNEFSDGVLLV-VPLDL------AITPM 62
F+ WL N + G I E GF G+ + +F++ L++ +P L A + +
Sbjct: 86 FINWLTENGARVDGLSIA---EFPGFDLGLKAETDFTENQLILEIPRALIFSTYTAASEL 142
Query: 63 RVLQ-DPLIGPECRAMFEDGEVDDRFLMILFLTVERLRKNSSWKPYLDMLPTTFGNPLWF 121
VLQ DPL+ + + + + L +E+ ++NS+WKPYL+MLP+ + L+
Sbjct: 143 IVLQNDPLV-----------QHMPQVALAVALLIEKYKENSTWKPYLEMLPSNYNTVLYM 191
Query: 122 TDDELLELKGTTLYRATELQKQNLLTLYD 150
++++ELKG+ A Q +N+ Y
Sbjct: 192 KTNDMIELKGSPTLEAALKQCRNIARQYS 220
>gi|194038089|ref|XP_001925323.1| PREDICTED: histone-lysine N-methyltransferase setd3 [Sus scrofa]
gi|456754196|gb|JAA74239.1| SET domain containing 3 [Sus scrofa]
Length = 595
Score = 57.4 bits (137), Expect = 2e-05, Method: Compositional matrix adjust.
Identities = 75/334 (22%), Positives = 130/334 (38%), Gaps = 87/334 (26%)
Query: 12 FLQWLQVNKVELRGCK-IKYSDESKGFGIFSSNEF-SDGVLLVVPLDLAITPMRVLQDPL 69
++W N + G + + + +E GFG+ ++ + ++ + L VP L +T + ++ +
Sbjct: 82 LMKWASDNGASVEGFEMVNFKEE--GFGLRATRDIKAEELFLWVPRKLLMT-VESAKNSV 138
Query: 70 IGP---ECRAMFEDGEVDDRFLMILFLTVERLRKNSSWKPYLDMLPTTFGNPLWFTDDEL 126
+GP + R + G + + L ER NS W+PY+ LP+ + PL+F +DE+
Sbjct: 139 LGPLYAQDRILQAMGNIT----LAFHLLCERADPNSFWQPYIQTLPSEYDTPLYFEEDEV 194
Query: 127 LELKGTTLYRATELQKQNLLTLYDDKVKDLVKKLLVLDGDSERANSIFWTRALNIPLPHS 186
L+ T ++ + Y + + V+ + A +PL S
Sbjct: 195 RYLQSTQAI-------HDVFSQYKNTARQYAYFYKVIQTHPQ---------AHKLPLKES 238
Query: 187 YVFPQNQEDLNKYDSINNSAELSNDHNSRGELINGLNDIKNEAQRVNSQVNGATSTLTST 246
+ + ED S ++ N I E +G+ TL
Sbjct: 239 FTY----EDYRWAVS---------------SVMTRQNQIPTE--------DGSRVTL--- 268
Query: 247 QGETLWIEGLVPGIDFCNHDLKAAATWEVDGTGLITGVPFSMYLLSVERSS------FHS 300
L+P D CNH GLIT + Y L +R F +
Sbjct: 269 --------ALIPLWDMCNHT-----------NGLIT----TGYNLEDDRCECVALRDFRA 305
Query: 301 EKEISISYGNKGNEELLYLYGFVIDNNPDDYLMI 334
++I I YG + N E + GF DNN D + I
Sbjct: 306 GEQIYIFYGTRSNAEFVIHSGFFFDNNSHDRVKI 339
>gi|322697804|gb|EFY89580.1| putative histone-lysine N-methyltransferase [Metarhizium acridum
CQMa 102]
Length = 466
Score = 57.4 bits (137), Expect = 2e-05, Method: Compositional matrix adjust.
Identities = 36/127 (28%), Positives = 62/127 (48%), Gaps = 5/127 (3%)
Query: 36 GFGIFSSNEFSDG-VLLVVPLDLAITPMRVLQDPLIGPECRAMFEDGEVDDRFLMILFLT 94
G G+ + F G +L +P L T D L+GP R+ V+D + +
Sbjct: 25 GRGVKARRRFKQGERILTIPSALHWTVQHAQADSLLGPALRSARPPLTVEDTLAVYVLFV 84
Query: 95 VERLRKNSSWKPYLDMLPTTFGNPLWFTDDELLELKGTTLYRATELQKQNLLTLYDDKVK 154
R + + ++ LPT++ + ++FT+DEL GT+LY T+ KQ + +D K
Sbjct: 85 RSRESGYNGPRSHVAALPTSYSSSIFFTEDELEVCAGTSLYTITKQLKQRI----EDDYK 140
Query: 155 DLVKKLL 161
DL+ ++L
Sbjct: 141 DLIARVL 147
>gi|384248321|gb|EIE21805.1| SET domain-containing protein, partial [Coccomyxa subellipsoidea
C-169]
Length = 275
Score = 57.4 bits (137), Expect = 2e-05, Method: Compositional matrix adjust.
Identities = 81/329 (24%), Positives = 127/329 (38%), Gaps = 66/329 (20%)
Query: 13 LQWLQVNKVELRGCKIKYSDESKGFGIFSSNEFSDG-VLLVVPLDLAITPMRV-LQDPLI 70
L WL++ +E C I D S G G+ ++ + S G V++ VP + + P + L
Sbjct: 7 LAWLRLAGIECSCCSIDVFDGS-GRGVVATKDISCGEVVVHVPDESVLMPENCSCSEALE 65
Query: 71 GPECRAMFEDGEVDDRFLMILFLTVERLRKNSSWKPYLDMLPTTF-GNPLWFTDDELLEL 129
D E++ L++ +T ++L K+S WK YLD LP + G PL++ ++L L
Sbjct: 66 DAGLTNASGDAEMESIGLILALMTEKKLGKSSKWKGYLDFLPKSIPGMPLFWDSEQLQSL 125
Query: 130 KGTTLYRATELQKQNLLTLYDDKVKDLVKKLLVLDGDSERANSIFWTRALNIPLP-HSYV 188
+GT+L ++K N D R L P +S V
Sbjct: 126 EGTSL-----IEKMNGCKAMPD-------------------------RPLEPPCKFNSVV 155
Query: 189 FPQNQEDLNKYDSINNSAELSNDHNSRGELINGLNDIKNEAQRVNSQVNGATSTLTSTQG 248
P Q + A L HN+ +R+ S + T G
Sbjct: 156 LPFLQSN----------AHLKLPHNAA------------STRRLYVWATAMVSAYSFTIG 193
Query: 249 ETLWIEGLVPGIDFCNHDLKAAATWEVDGTGLITGVPFSMYLLSVERSSFHSEKEISISY 308
E + + +VP D NH A + L + +++ SY
Sbjct: 194 EDRF-QAMVPMWDALNHITGHANV-------RLHHCARKGALRMIATCLITKGEQVINSY 245
Query: 309 GNKGNEELLYLYGFV-IDNNPDDYLMIHY 336
G+ N ELL YGFV D NP D L + +
Sbjct: 246 GDLPNSELLRRYGFVETDPNPHDCLEVAF 274
>gi|260835045|ref|XP_002612520.1| hypothetical protein BRAFLDRAFT_214305 [Branchiostoma floridae]
gi|229297897|gb|EEN68529.1| hypothetical protein BRAFLDRAFT_214305 [Branchiostoma floridae]
Length = 287
Score = 57.4 bits (137), Expect = 2e-05, Method: Compositional matrix adjust.
Identities = 82/347 (23%), Positives = 127/347 (36%), Gaps = 91/347 (26%)
Query: 1 MEISTEAKLEPFLQWLQVNKVELRGCKIKYSD-ESKGFGIFSSNEFS-DGVLLVVPLDLA 58
+ ++ E F QWL N R +K + G G+ ++ + ++LV+P L
Sbjct: 13 VSLAHEESFVRFFQWLHRNGC--RNVPLKPAVFPGTGRGMMATKALKHEELMLVIPQRLL 70
Query: 59 ITPMRVLQDPLIGPEC-RAMFEDGEVDDRFLMILFLTVERLRKNSS-WKPYLDMLPTTFG 116
IT M + D I P RA D + + +FL E+ R+ S W+PY+D+LP +
Sbjct: 71 IT-MDAIMDSYIAPYIERA---DPRLTPTQALAVFLMCEKYRREKSFWRPYIDILPEEYS 126
Query: 117 NPLWFTDDELL----ELKGTTLYRATELQKQNLLTLYDDKVKDLVKKLLVLDGDSERAN- 171
P +FT+D+ L+G + E K+ ++ K L L D E A
Sbjct: 127 CPTFFTEDDFRLLPNSLRGKAKAKKYECHKEY------KELAPFFKMLADLFPDQEDAFN 180
Query: 172 --------SIFWTRALNIPLPHSYVFPQNQEDLNKYDSINNSAELSNDHNSRGELINGLN 223
S TRAL++P
Sbjct: 181 FKDFKWAWSAIKTRALDVP----------------------------------------- 199
Query: 224 DIKNEAQRVNSQVNGATSTLTSTQGETLWIEGLVPGIDFCNHDLKAAATWEVDGTGLITG 283
I E+ R + A T T T + P +D NH +A +
Sbjct: 200 -IGRESCR---HLRDAEDTPTPT---------MFPLVDSINHAAQAKIRHRYNEK----- 241
Query: 284 VPFSMYLLSVERSSFHSEKEISISYGNKGNEELLYLYGFVIDNNPDD 330
S L S + + E+ SYG N+ LL +GFV+ NP+D
Sbjct: 242 ---SRCLESRTETVYRRHAEVMNSYGRADNDNLLLEFGFVVPGNPED 285
>gi|431839268|gb|ELK01195.1| SET domain-containing protein 3 [Pteropus alecto]
Length = 805
Score = 57.4 bits (137), Expect = 2e-05, Method: Compositional matrix adjust.
Identities = 75/334 (22%), Positives = 129/334 (38%), Gaps = 87/334 (26%)
Query: 12 FLQWLQVNKVELRGCK-IKYSDESKGFGIFSSNEF-SDGVLLVVPLDLAITPMRVLQDPL 69
++W N + G + + + +E GFG+ ++ + ++ + L VP L +T + ++ +
Sbjct: 261 LMKWASENGASVEGFEMVDFKEE--GFGLRATRDIKAEELFLWVPRKLLMT-VESAKNSV 317
Query: 70 IGP---ECRAMFEDGEVDDRFLMILFLTVERLRKNSSWKPYLDMLPTTFGNPLWFTDDEL 126
+GP + R + G + + L ER NS W+PY+ LP+ + PL+F +DE+
Sbjct: 318 LGPLYSQDRILQAMGNIT----LAFHLLCERADPNSFWQPYIQTLPSEYDTPLYFEEDEV 373
Query: 127 LELKGTTLYRATELQKQNLLTLYDDKVKDLVKKLLVLDGDSERANSIFWTRALNIPLPHS 186
L+ T ++ + Y + + V+ A +PL S
Sbjct: 374 RYLQSTQAI-------HDVFSQYKNTARQYAYFYKVIQTHPH---------ANKLPLKDS 417
Query: 187 YVFPQNQEDLNKYDSINNSAELSNDHNSRGELINGLNDIKNEAQRVNSQVNGATSTLTST 246
+ + ED S ++ N I E +G+ TL
Sbjct: 418 FTY----EDYRWAVS---------------SVMTRQNQIPTE--------DGSRVTL--- 447
Query: 247 QGETLWIEGLVPGIDFCNHDLKAAATWEVDGTGLITGVPFSMYLLSVERSS------FHS 300
L+P D CNH GLIT + Y L +R F +
Sbjct: 448 --------ALIPLWDMCNHT-----------NGLIT----TGYNLEDDRCECVALRDFRA 484
Query: 301 EKEISISYGNKGNEELLYLYGFVIDNNPDDYLMI 334
++I I YG + N E + GF DNN D + I
Sbjct: 485 GEQIYIFYGTRSNAEFVIHSGFFFDNNSHDRVKI 518
>gi|365987784|ref|XP_003670723.1| hypothetical protein NDAI_0F01610 [Naumovozyma dairenensis CBS 421]
gi|343769494|emb|CCD25480.1| hypothetical protein NDAI_0F01610 [Naumovozyma dairenensis CBS 421]
Length = 579
Score = 57.4 bits (137), Expect = 2e-05, Method: Compositional matrix adjust.
Identities = 92/367 (25%), Positives = 148/367 (40%), Gaps = 73/367 (19%)
Query: 9 LEPFLQWLQVNKVEL-RGCKIKYSDESKGFGIFSSNEFSDGVLLVVPLDLAITPMRVLQD 67
LE L W + V + G K ++ DE KGF ++ + ++ + +P L I+
Sbjct: 3 LENLLTWGEKYGVNIPDGLKFQH-DEKKGFYCIATKDVTNPTI-EIPQTLIIS------- 53
Query: 68 PLIGPECRAMFEDGEVDDR----FLMILFLTVERLRK----------------NSSWKPY 107
G R +F D E +D +L +LF ++ K N +KPY
Sbjct: 54 ---GKLSRDVFADIETNDENNNSWLKLLFAKLKFDNKATYLSTEEGNDKNEPLNLKFKPY 110
Query: 108 LDMLPTTFGNPLWFTDDELLELKGTTLYRATELQKQNLLTLYDDKVKDLVKKLLVLDGDS 167
+D LP +P +T +EL LKGT LY + L T++ + +L+ K+ +
Sbjct: 111 IDALPKIVDSPYLWTPNELKLLKGTNLYNSL---FGKLHTIFKEWY-NLINKM------N 160
Query: 168 ERANSIFWTRALNIPLPHSYVFPQNQEDLNKYDSINNSAELSNDHNSRGELINGLNDIKN 227
ER +S F F + +L YD+ +N +N H ++I + +
Sbjct: 161 ERESSNFQVD-----------FTLVEGELKIYDNWSNIDFHNNYHEFYNKIILETVEQRP 209
Query: 228 EAQRVNSQVNGATSTLTSTQGETLWIEG--------LVPGIDFCNHDLKAAATWEVDGTG 279
+ S + S I+ L+P ID NHD A W G
Sbjct: 210 KCWYSFSSFLWSHLIFISRAFPEYVIDNTRSPSNVVLLPIIDLMNHDYNAKVQWSSTNGG 269
Query: 280 LITGVPFSMYLLSVERSSFHSEKEISISYGNKGNEELLYLYGFVIDNNPDDYLM--IHYP 337
+ + + KE+ +YG KGNEELL YGFV+++N D ++ I P
Sbjct: 270 ---------FTYNYIGGTIAENKELFNNYGAKGNEELLTGYGFVLEDNMYDLVLLKIKPP 320
Query: 338 AEAIHSI 344
+ I I
Sbjct: 321 LKVISDI 327
>gi|428175234|gb|EKX44125.1| hypothetical protein GUITHDRAFT_109909 [Guillardia theta CCMP2712]
Length = 442
Score = 57.4 bits (137), Expect = 2e-05, Method: Compositional matrix adjust.
Identities = 77/343 (22%), Positives = 133/343 (38%), Gaps = 89/343 (25%)
Query: 9 LEPFLQWLQVNKVELRGCKIKYSDESKGFGIFSSNEFSDG-VLLVVPLDLAITPMRVLQD 67
++ FL+WL + + G G+ + F G +L VP +L ITP + +
Sbjct: 48 MKDFLEWLTAKGMNFQSQVDVAITNGTGRGLLARRSFMPGETMLAVPPELLITP-DMARR 106
Query: 68 PLIGPECR-------AMFEDGEVDDRFLMILFLTVERLRKNSSWKPYLDMLPTTFGNPLW 120
+G R + ED + L+ + LTV ++ + P++ +LP PL+
Sbjct: 107 SEVGRAFREHGLDDCSGGEDSTYECMPLLAMHLTVLYYNESHDFHPWMKILPRKLTTPLF 166
Query: 121 FTDDELLELKGTTLYRATELQKQNLLTLYDDKVKDLVKKLLVLDGDSERANSIFWTRALN 180
++D E EL+G+ LY +LDG WT
Sbjct: 167 WSDKEREELQGSNLYN-------------------------MLDG---------WT---- 188
Query: 181 IPLPHSYVFPQNQEDLNKYDSINNSAELSNDHNSRGELINGLNDIKNEAQRVNSQVNGAT 240
N E L++ ++A + HN +L + +K + A
Sbjct: 189 ----------MNVEKLHR-----STARVLGQHNVFPDLPKAIYSLK--------EFKWAY 225
Query: 241 STLTS----TQGETLWIEG----LVPGIDFCNH-DLKAAATWEVDGTGLITGVPFSMYLL 291
+T+ + G++ G + P D NH D+K + T+ S +
Sbjct: 226 ATIFARAFDVDGKSFGFSGRQRIMAPMADLFNHGDVKTSYTFNAA----------SGHFE 275
Query: 292 SVERSSFHSEKEISISYGNKGNEELLYLYGFVIDNNPDDYLMI 334
+ F ++I ++Y +K N E L YGFVI++NP DY+ I
Sbjct: 276 LFTQQFFSRGEQIFMNYDSKNNAEFLLQYGFVIESNPHDYVGI 318
>gi|332321746|sp|B2KI88.1|SETD3_RHIFE RecName: Full=Histone-lysine N-methyltransferase setd3; AltName:
Full=SET domain-containing protein 3
gi|183637154|gb|ACC64548.1| SET domain containing 3 isoform a (predicted) [Rhinolophus
ferrumequinum]
Length = 594
Score = 57.0 bits (136), Expect = 2e-05, Method: Compositional matrix adjust.
Identities = 73/333 (21%), Positives = 128/333 (38%), Gaps = 85/333 (25%)
Query: 12 FLQWLQVNKVELRGCKIKYSDESKGFGIFSSNEF-SDGVLLVVPLDLAITPMRVLQDPLI 70
++W N + G ++ S + +GFG+ ++ + ++ + L VP L +T + ++ ++
Sbjct: 82 LMKWASENGASVEGFEM-VSFKEEGFGLRATRDIKAEELFLWVPRKLLMT-VESAKNSVL 139
Query: 71 GP---ECRAMFEDGEVDDRFLMILFLTVERLRKNSSWKPYLDMLPTTFGNPLWFTDDELL 127
GP + R + G + + L ER NS W+PY+ LP+ + PL+F +DE+
Sbjct: 140 GPLYSQDRILQAMGNI----TLAFHLLCERADPNSFWQPYIQTLPSEYDTPLYFGEDEVR 195
Query: 128 ELKGTTLYRATELQKQNLLTLYDDKVKDLVKKLLVLDGDSERANSIFWTRALNIPLPHSY 187
L+ T Q +N Y ++ + + PH+
Sbjct: 196 YLQSTQAIHDVFSQYKNTARQY-----------------------AYFYKVIQTH-PHAN 231
Query: 188 VFPQNQEDLNKYDSINNSAELSNDHNSRGELINGLNDIKNEAQRVNSQVNGATSTLTSTQ 247
P + +S + + ++ N I E +G+ TL
Sbjct: 232 KLP-----------LKDSFTYEDYRWAVSSVMTRQNQIPTE--------DGSRVTL---- 268
Query: 248 GETLWIEGLVPGIDFCNHDLKAAATWEVDGTGLITGVPFSMYLLSVERSS------FHSE 301
L+P D CNH GLIT + Y L +R F +
Sbjct: 269 -------ALIPLWDMCNHT-----------NGLIT----TGYNLEDDRCECVALQDFQAG 306
Query: 302 KEISISYGNKGNEELLYLYGFVIDNNPDDYLMI 334
++I I YG + N E + GF DNN D + I
Sbjct: 307 EQIYIFYGTRSNAEFVIHSGFFFDNNSHDRVKI 339
>gi|325186836|emb|CCA21381.1| conserved hypothetical protein [Albugo laibachii Nc14]
Length = 473
Score = 57.0 bits (136), Expect = 2e-05, Method: Compositional matrix adjust.
Identities = 62/263 (23%), Positives = 106/263 (40%), Gaps = 64/263 (24%)
Query: 89 MILFLTVERLRKNSSWKPYLDMLPTTFGNPLWFTDDELLELKGTTLYRATELQKQNLLTL 148
+IL + L+ S W +L++LP N L+++ DE+ L GT LY ++ L
Sbjct: 121 IILLFEMYVLQSESKWAHHLEILPKEHRNLLYYSSDEVKALDGTNLYYVAHEMQERLHED 180
Query: 149 YD-------DKVKDLVKKLLVLDGDSERANSIF----WTRALNIPLPHSYVFPQNQEDLN 197
Y+ ++K ++K +L S A ++F + AL+I +
Sbjct: 181 YEFIETRVLPELKHILKHIL---SPSVSATTVFSFANYKWALSIIW-------------S 224
Query: 198 KYDSINNSAELSNDHNSRGELINGLNDIKNEAQRVNSQVNGATSTLTSTQGETLWIEGLV 257
++ SI EL V+ T+ T+ ++ +V
Sbjct: 225 RFVSIEIDQEL---------------------------VSTLPFTIDPTKKHC--VKAMV 255
Query: 258 PGIDFCNHDLKAAATWEVDGTGLITGVPFSMYLLSVERSSFHSEKEISISYGNKGNEELL 317
P D NHD KA T + D M+ L+ + + ++ I+YG N LL
Sbjct: 256 PVFDMLNHDPKAEMTHKYDAAS-------GMFQLTTHQH-LAAGTQLHINYGPLSNHALL 307
Query: 318 YLYGFVIDNNPDDYLMIHYPAEA 340
LYGF+ +NP D + +H E+
Sbjct: 308 ALYGFMHSHNPHDTVEVHLQMES 330
>gi|346980096|gb|EGY23548.1| SET domain-containing protein RMS1 [Verticillium dahliae VdLs.17]
Length = 469
Score = 57.0 bits (136), Expect = 2e-05, Method: Compositional matrix adjust.
Identities = 42/138 (30%), Positives = 69/138 (50%), Gaps = 20/138 (14%)
Query: 12 FLQWLQVNKVELRGCKIKYSD---ESKGFGIFSSNEF-SDGVLLVVPLDLAITPMRVLQD 67
+LQW + E R ++ D ++ G GI ++ + + +L +P I VL
Sbjct: 11 YLQWFKAAGGEFRDDLLQIVDLRPQAAGRGIIATRDIPEETILFTIPRQAIIN---VLTS 67
Query: 68 PLIGPECRAMFEDGEVDDR----------FLMILFLTVERLRKNSS-WKPYLDMLPTTFG 116
L P+ DG +D+ +IL + E L+ ++S WKPY D+LP F
Sbjct: 68 EL--PQKLPQVFDGSIDEMDDNAEPLDSWGQLILVMLYEVLQGDASRWKPYFDILPQQFD 125
Query: 117 NPLWFTDDELLELKGTTL 134
P++++D ELLEL+GT+L
Sbjct: 126 TPIFWSDGELLELQGTSL 143
>gi|323354740|gb|EGA86574.1| YHL039W-like protein [Saccharomyces cerevisiae VL3]
Length = 533
Score = 57.0 bits (136), Expect = 2e-05, Method: Compositional matrix adjust.
Identities = 89/367 (24%), Positives = 154/367 (41%), Gaps = 64/367 (17%)
Query: 5 TEAKLEPFLQWLQVNKVELRGCKIKYS-DESKGFGIFSSNEFS---DGVLLVVPLDLAIT 60
T+ +L+ LQW Q N + KI + E G F + + S D L+ VP L IT
Sbjct: 3 TQTELDNCLQWAQNNGAFIDP-KISFRITEDAGVSAFVNEKLSPKPDQALIRVPETLLIT 61
Query: 61 PMRVLQDPLIGPECRAMFEDGEVDDRFLMILFLTVERLRKNSSWKPYLDMLPTTFGNPLW 120
+ L + R++ V +L L + + S +KPYLD+LP P +
Sbjct: 62 SQQALSEFSQAANERSLL--NSVTQLYLSKLKFGTDAVHLKSFYKPYLDVLPLHLPQPYF 119
Query: 121 FTDDELLELKGTTLYRATELQKQNLLTLYDDKVKDLVKKLLVLDGDSERANSIFWTRALN 180
++ DE++ L GT +Y LT+ D + LVK+ +L +AL+
Sbjct: 120 WSTDEVMNLHGTDVY----------LTM-RDTLNKLVKEWRML------------FQALS 156
Query: 181 IPLPHSYVFPQNQEDLNKYDSINNSAELSNDHNSRGELINGLNDIKNEAQRVNSQVNGAT 240
I HS Q+++ L+ + +SA + + + +N K E NS V
Sbjct: 157 I--EHS---SQDKQFLSLFQENKDSAVVPLE-----QFCAHINGCKLEDSEWNSFVAYLW 206
Query: 241 S-TLTSTQGETLWIEG-------------LVPGIDFCNHDLKAAATWEVDGTGLITGVPF 286
S + +++ I G L P +D NH WE++ + +
Sbjct: 207 SYCIFNSRAFPRVILGRAGTDRTNLNEGFLYPIVDLLNHKNDVPVRWEMNEQNELCFMS- 265
Query: 287 SMYLLSVERSSFHSEKEISISYGNKGNEELLYLYGFVIDNNPDDY--LMIHYPAEAIHSI 344
+ ++F ++ E+ +YGN NE+ L YGF +N D+ L + P+ + +
Sbjct: 266 -------QTTTFSAQDELFNNYGNISNEKCLLNYGFWDSSNKFDFSRLTLKLPSTLVSGL 318
Query: 345 PLSDSKA 351
P+ +K+
Sbjct: 319 PVDFNKS 325
>gi|151944161|gb|EDN62453.1| conserved protein [Saccharomyces cerevisiae YJM789]
gi|190405746|gb|EDV09013.1| conserved hypothetical protein [Saccharomyces cerevisiae RM11-1a]
gi|256271432|gb|EEU06490.1| YHL039W-like protein [Saccharomyces cerevisiae JAY291]
gi|259146985|emb|CAY80240.1| EC1118_1H21_0111p [Saccharomyces cerevisiae EC1118]
gi|323308842|gb|EGA62078.1| YHL039W-like protein [Saccharomyces cerevisiae FostersO]
Length = 585
Score = 57.0 bits (136), Expect = 2e-05, Method: Compositional matrix adjust.
Identities = 89/367 (24%), Positives = 154/367 (41%), Gaps = 64/367 (17%)
Query: 5 TEAKLEPFLQWLQVNKVELRGCKIKYS-DESKGFGIFSSNEFS---DGVLLVVPLDLAIT 60
T+ +L+ LQW Q N + KI + E G F + + S D L+ VP L IT
Sbjct: 3 TQTELDNCLQWAQNNGAFIDP-KISFRITEDAGVSAFVNEKLSPKPDQALIRVPETLLIT 61
Query: 61 PMRVLQDPLIGPECRAMFEDGEVDDRFLMILFLTVERLRKNSSWKPYLDMLPTTFGNPLW 120
+ L + R++ V +L L + + S +KPYLD+LP P +
Sbjct: 62 SQQALSEFSQAANERSLL--NSVTQLYLSKLKFGTDAVHLKSFYKPYLDVLPLHLPQPYF 119
Query: 121 FTDDELLELKGTTLYRATELQKQNLLTLYDDKVKDLVKKLLVLDGDSERANSIFWTRALN 180
++ DE++ L GT +Y LT+ D + LVK+ +L +AL+
Sbjct: 120 WSTDEVMNLHGTDVY----------LTM-RDTLNKLVKEWRML------------FQALS 156
Query: 181 IPLPHSYVFPQNQEDLNKYDSINNSAELSNDHNSRGELINGLNDIKNEAQRVNSQVNGAT 240
I HS Q+++ L+ + +SA + + + +N K E NS V
Sbjct: 157 I--EHS---SQDKQFLSLFQENKDSAVVPLE-----QFCAHINGCKLEDSEWNSFVAYLW 206
Query: 241 S-TLTSTQGETLWIEG-------------LVPGIDFCNHDLKAAATWEVDGTGLITGVPF 286
S + +++ I G L P +D NH WE++ + +
Sbjct: 207 SYCIFNSRAFPRVILGRAGTDRTNLNEGFLYPIVDLLNHKNDVPVRWEMNEQNELCFMS- 265
Query: 287 SMYLLSVERSSFHSEKEISISYGNKGNEELLYLYGFVIDNNPDDY--LMIHYPAEAIHSI 344
+ ++F ++ E+ +YGN NE+ L YGF +N D+ L + P+ + +
Sbjct: 266 -------QTTTFSAQDELFNNYGNISNEKCLLNYGFWDSSNKFDFSRLTLKLPSTLVSGL 318
Query: 345 PLSDSKA 351
P+ +K+
Sbjct: 319 PVDFNKS 325
>gi|355778846|gb|EHH63882.1| hypothetical protein EGM_16943 [Macaca fascicularis]
Length = 595
Score = 56.6 bits (135), Expect = 2e-05, Method: Compositional matrix adjust.
Identities = 75/334 (22%), Positives = 129/334 (38%), Gaps = 87/334 (26%)
Query: 12 FLQWLQVNKVELRGCK-IKYSDESKGFGIFSSNEF-SDGVLLVVPLDLAITPMRVLQDPL 69
++W N + G + + + +E GFG+ ++ + ++ + L VP L +T + ++ +
Sbjct: 82 LMKWASENGASVEGFEMVNFKEE--GFGLRATRDIKAEELFLWVPRKLLMT-VESAKNSV 138
Query: 70 IGP---ECRAMFEDGEVDDRFLMILFLTVERLRKNSSWKPYLDMLPTTFGNPLWFTDDEL 126
+GP + R + G + + L ER NS W+PY+ LP+ + PL+F +DE+
Sbjct: 139 LGPLYSQDRILQAMGNI----ALAFHLLCERANPNSFWQPYIQTLPSEYDTPLYFEEDEV 194
Query: 127 LELKGTTLYRATELQKQNLLTLYDDKVKDLVKKLLVLDGDSERANSIFWTRALNIPLPHS 186
L+ T ++ + Y + + V+ A +PL S
Sbjct: 195 RYLQSTQAI-------HDVFSQYKNTARQYAYFYKVIQTHP---------HANKLPLKDS 238
Query: 187 YVFPQNQEDLNKYDSINNSAELSNDHNSRGELINGLNDIKNEAQRVNSQVNGATSTLTST 246
+ + ED S ++ N I E +G+ TL
Sbjct: 239 FTY----EDYRWAVS---------------SVMTRQNQIPTE--------DGSRVTL--- 268
Query: 247 QGETLWIEGLVPGIDFCNHDLKAAATWEVDGTGLITGVPFSMYLLSVERSS------FHS 300
L+P D CNH GLIT + Y L +R F +
Sbjct: 269 --------ALIPLWDMCNHT-----------NGLIT----TGYNLEDDRCECVALQDFRA 305
Query: 301 EKEISISYGNKGNEELLYLYGFVIDNNPDDYLMI 334
++I I YG + N E + GF DNN D + I
Sbjct: 306 GEQIYIFYGTRSNAEFVIHSGFFFDNNSHDRVKI 339
>gi|297695854|ref|XP_002825140.1| PREDICTED: histone-lysine N-methyltransferase setd3 isoform 1
[Pongo abelii]
gi|395746278|ref|XP_003778419.1| PREDICTED: histone-lysine N-methyltransferase setd3 isoform 2
[Pongo abelii]
Length = 595
Score = 56.6 bits (135), Expect = 3e-05, Method: Compositional matrix adjust.
Identities = 73/334 (21%), Positives = 128/334 (38%), Gaps = 87/334 (26%)
Query: 12 FLQWLQVNKVELRGCK-IKYSDESKGFGIFSSNEF-SDGVLLVVPLDLAITPMRVLQDPL 69
++W N + G + + + +E GFG+ ++ + ++ + L VP L +T + ++ +
Sbjct: 82 LMKWASENGASVEGFEMVNFKEE--GFGLRATRDIKAEELFLWVPRKLLMT-VESAKNSV 138
Query: 70 IGP---ECRAMFEDGEVDDRFLMILFLTVERLRKNSSWKPYLDMLPTTFGNPLWFTDDEL 126
+GP + R + G + + L ER NS W+PY+ LP+ + PL+F +DE+
Sbjct: 139 LGPLYSQDRILQAMGNI----ALAFHLLCERASPNSFWQPYIQTLPSEYDTPLYFEEDEV 194
Query: 127 LELKGTTLYRATELQKQNLLTLYDDKVKDLVKKLLVLDGDSERANSIFWTRALNIPLPHS 186
L+ T Q +N Y ++ + + PH+
Sbjct: 195 RYLQSTQAIHDVFSQYKNTARQY-----------------------AYFYKVIQTH-PHA 230
Query: 187 YVFPQNQEDLNKYDSINNSAELSNDHNSRGELINGLNDIKNEAQRVNSQVNGATSTLTST 246
P + +S + + ++ N I E +G+ TL
Sbjct: 231 NKLP-----------LKDSFTYEDYRWAVSSVMTRQNQIPTE--------DGSRVTL--- 268
Query: 247 QGETLWIEGLVPGIDFCNHDLKAAATWEVDGTGLITGVPFSMYLLSVERSS------FHS 300
L+P D CNH GLIT + Y L +R F +
Sbjct: 269 --------ALIPLWDMCNHT-----------NGLIT----TGYNLEDDRCECVALQDFRA 305
Query: 301 EKEISISYGNKGNEELLYLYGFVIDNNPDDYLMI 334
++I I YG + N E + GF DNN D + I
Sbjct: 306 GEQIYIFYGTRSNAEFVIHSGFFFDNNSHDRVKI 339
>gi|114654683|ref|XP_522946.2| PREDICTED: histone-lysine N-methyltransferase setd3 isoform 2 [Pan
troglodytes]
gi|332843114|ref|XP_003314566.1| PREDICTED: histone-lysine N-methyltransferase setd3 [Pan
troglodytes]
gi|397525919|ref|XP_003832895.1| PREDICTED: histone-lysine N-methyltransferase setd3 isoform 1 [Pan
paniscus]
gi|397525921|ref|XP_003832896.1| PREDICTED: histone-lysine N-methyltransferase setd3 isoform 2 [Pan
paniscus]
gi|410227562|gb|JAA11000.1| SET domain containing 3 [Pan troglodytes]
gi|410255618|gb|JAA15776.1| SET domain containing 3 [Pan troglodytes]
gi|410289938|gb|JAA23569.1| SET domain containing 3 [Pan troglodytes]
gi|410342147|gb|JAA40020.1| SET domain containing 3 [Pan troglodytes]
Length = 594
Score = 56.6 bits (135), Expect = 3e-05, Method: Compositional matrix adjust.
Identities = 73/334 (21%), Positives = 128/334 (38%), Gaps = 87/334 (26%)
Query: 12 FLQWLQVNKVELRGCK-IKYSDESKGFGIFSSNEF-SDGVLLVVPLDLAITPMRVLQDPL 69
++W N + G + + + +E GFG+ ++ + ++ + L VP L +T + ++ +
Sbjct: 82 LMKWASENGASVEGFEMVNFKEE--GFGLRATRDIKAEELFLWVPRKLLMT-VESAKNSV 138
Query: 70 IGP---ECRAMFEDGEVDDRFLMILFLTVERLRKNSSWKPYLDMLPTTFGNPLWFTDDEL 126
+GP + R + G + + L ER NS W+PY+ LP+ + PL+F +DE+
Sbjct: 139 LGPLYSQDRILQAMGNI----ALAFHLLCERASPNSFWQPYIQTLPSEYDTPLYFEEDEV 194
Query: 127 LELKGTTLYRATELQKQNLLTLYDDKVKDLVKKLLVLDGDSERANSIFWTRALNIPLPHS 186
L+ T Q +N Y ++ + + PH+
Sbjct: 195 RYLQSTQAIHDVFSQYKNTARQY-----------------------AYFYKVIQTH-PHA 230
Query: 187 YVFPQNQEDLNKYDSINNSAELSNDHNSRGELINGLNDIKNEAQRVNSQVNGATSTLTST 246
P + +S + + ++ N I E +G+ TL
Sbjct: 231 NKLP-----------LKDSFTYEDYRWAVSSVMTRQNQIPTE--------DGSRVTL--- 268
Query: 247 QGETLWIEGLVPGIDFCNHDLKAAATWEVDGTGLITGVPFSMYLLSVERSS------FHS 300
L+P D CNH GLIT + Y L +R F +
Sbjct: 269 --------ALIPLWDMCNHT-----------NGLIT----TGYNLEDDRCECVALQDFRA 305
Query: 301 EKEISISYGNKGNEELLYLYGFVIDNNPDDYLMI 334
++I I YG + N E + GF DNN D + I
Sbjct: 306 GEQIYIFYGTRSNAEFVIHSGFFFDNNSHDRVKI 339
>gi|332252553|ref|XP_003275417.1| PREDICTED: histone-lysine N-methyltransferase setd3 isoform 1
[Nomascus leucogenys]
gi|332252555|ref|XP_003275418.1| PREDICTED: histone-lysine N-methyltransferase setd3 isoform 2
[Nomascus leucogenys]
Length = 595
Score = 56.6 bits (135), Expect = 3e-05, Method: Compositional matrix adjust.
Identities = 73/334 (21%), Positives = 128/334 (38%), Gaps = 87/334 (26%)
Query: 12 FLQWLQVNKVELRGCK-IKYSDESKGFGIFSSNEF-SDGVLLVVPLDLAITPMRVLQDPL 69
++W N + G + + + +E GFG+ ++ + ++ + L VP L +T + ++ +
Sbjct: 82 LMKWASENGASVEGFEMVNFKEE--GFGLRATRDIKAEELFLWVPRKLLMT-VESAKNSV 138
Query: 70 IGP---ECRAMFEDGEVDDRFLMILFLTVERLRKNSSWKPYLDMLPTTFGNPLWFTDDEL 126
+GP + R + G + + L ER NS W+PY+ LP+ + PL+F +DE+
Sbjct: 139 LGPLYSQDRILQAMGNI----ALAFHLLCERASPNSFWQPYIQTLPSEYDTPLYFEEDEV 194
Query: 127 LELKGTTLYRATELQKQNLLTLYDDKVKDLVKKLLVLDGDSERANSIFWTRALNIPLPHS 186
L+ T Q +N Y ++ + + PH+
Sbjct: 195 RYLQSTQAIHDVFSQYKNTARQY-----------------------AYFYKVIQTH-PHA 230
Query: 187 YVFPQNQEDLNKYDSINNSAELSNDHNSRGELINGLNDIKNEAQRVNSQVNGATSTLTST 246
P + +S + + ++ N I E +G+ TL
Sbjct: 231 NKLP-----------LKDSFTYEDYRWAVSSVMTRQNQIPTE--------DGSRVTL--- 268
Query: 247 QGETLWIEGLVPGIDFCNHDLKAAATWEVDGTGLITGVPFSMYLLSVERSS------FHS 300
L+P D CNH GLIT + Y L +R F +
Sbjct: 269 --------ALIPLWDMCNHT-----------NGLIT----TGYNLEDDRCECVALQDFRA 305
Query: 301 EKEISISYGNKGNEELLYLYGFVIDNNPDDYLMI 334
++I I YG + N E + GF DNN D + I
Sbjct: 306 GEQIYIFYGTRSNAEFVIHSGFFFDNNSHDRVKI 339
>gi|426377975|ref|XP_004055723.1| PREDICTED: histone-lysine N-methyltransferase setd3 [Gorilla
gorilla gorilla]
Length = 594
Score = 56.6 bits (135), Expect = 3e-05, Method: Compositional matrix adjust.
Identities = 73/334 (21%), Positives = 128/334 (38%), Gaps = 87/334 (26%)
Query: 12 FLQWLQVNKVELRGCK-IKYSDESKGFGIFSSNEF-SDGVLLVVPLDLAITPMRVLQDPL 69
++W N + G + + + +E GFG+ ++ + ++ + L VP L +T + ++ +
Sbjct: 82 LMKWASENGASVEGFEMVNFKEE--GFGLRATRDIKAEELFLWVPRKLLMT-VESAKNSV 138
Query: 70 IGP---ECRAMFEDGEVDDRFLMILFLTVERLRKNSSWKPYLDMLPTTFGNPLWFTDDEL 126
+GP + R + G + + L ER NS W+PY+ LP+ + PL+F +DE+
Sbjct: 139 LGPLYSQDRILQAMGNI----ALAFHLLCERASPNSFWQPYIQTLPSEYDTPLYFEEDEV 194
Query: 127 LELKGTTLYRATELQKQNLLTLYDDKVKDLVKKLLVLDGDSERANSIFWTRALNIPLPHS 186
L+ T Q +N Y ++ + + PH+
Sbjct: 195 RYLQSTQAIHDVFSQYKNTARQY-----------------------AYFYKVIQTH-PHA 230
Query: 187 YVFPQNQEDLNKYDSINNSAELSNDHNSRGELINGLNDIKNEAQRVNSQVNGATSTLTST 246
P + +S + + ++ N I E +G+ TL
Sbjct: 231 NKLP-----------LKDSFTYEDYRWAVSSVMTRQNQIPTE--------DGSRVTL--- 268
Query: 247 QGETLWIEGLVPGIDFCNHDLKAAATWEVDGTGLITGVPFSMYLLSVERSS------FHS 300
L+P D CNH GLIT + Y L +R F +
Sbjct: 269 --------ALIPLWDMCNHT-----------NGLIT----TGYNLEDDRCECVALQDFRA 305
Query: 301 EKEISISYGNKGNEELLYLYGFVIDNNPDDYLMI 334
++I I YG + N E + GF DNN D + I
Sbjct: 306 GEQIYIFYGTRSNAEFVIHSGFFFDNNSHDRVKI 339
>gi|320168265|gb|EFW45164.1| hypothetical protein CAOG_03170 [Capsaspora owczarzaki ATCC 30864]
Length = 464
Score = 56.6 bits (135), Expect = 3e-05, Method: Compositional matrix adjust.
Identities = 41/147 (27%), Positives = 68/147 (46%), Gaps = 18/147 (12%)
Query: 16 LQVNKVELRGCKIKYSDESKGFGIFSSNEFSDGVLLV-VPLDLAITPMRVLQDPLIGPEC 74
L + L + +++ +G G+ + + G + VP L +T + + E
Sbjct: 147 LAADGKHLDAIEFVFANAEQGNGVIARRDIPAGQTFINVPEALMMTAEKARKS-----ET 201
Query: 75 RAMFEDGEVD-----------DRFLMILFLTVERLRK-NSSWKPYLDMLPTTFGNPLWFT 122
+ G +D D FL+ +FL VER R NS W PY+D+LP F PL+FT
Sbjct: 202 FQLITSGALDSTELSPAMAKLDNFLLRMFLIVERRRGGNSYWSPYIDLLPQRFRLPLYFT 261
Query: 123 DDELLELKGTTLYRATELQKQNLLTLY 149
+ EL LK + + +Q +N++ Y
Sbjct: 262 EAELELLKPSPALQEAFVQLRNVVRQY 288
>gi|440804743|gb|ELR25614.1| SET domain containing protein, partial [Acanthamoeba castellanii
str. Neff]
Length = 273
Score = 56.6 bits (135), Expect = 3e-05, Method: Compositional matrix adjust.
Identities = 47/161 (29%), Positives = 78/161 (48%), Gaps = 22/161 (13%)
Query: 36 GFGIFSSNEFSDG-VLLVVPLDLAITPMRVLQDPLIGPECRAMFEDGEVDDRFLM----- 89
G + ++++ + G LL VP L + L PE R + +D+ F +
Sbjct: 42 GRSVVAAHDIAAGETLLSVPFSLVVDSADALL-ATSAPEIRRI-----LDEEFPLSPTNE 95
Query: 90 -ILFLTVERLRKNSSWKPYLDMLPTTFGNPLWFTDDELLELKGTTLY-------RATELQ 141
L L V + NS W+ Y+D+LP+TF L+F+DDEL L+G++L+ RA E Q
Sbjct: 96 NALLLLVHKNDPNSPWQRYIDVLPSTFSTTLFFSDDELSYLEGSSLHYFARQRRRAIESQ 155
Query: 142 KQNLLT-LYDDKVKDLVKKLLVLDGDSERANSIFWTRALNI 181
+ T L+ D + + LD + A S+ W+R+ +
Sbjct: 156 YDTIFTPLFVDYPEHFAPEQFSLDA-WKWALSVIWSRSFVV 195
>gi|148744485|gb|AAI42996.1| SET domain containing 3 [Homo sapiens]
Length = 594
Score = 56.2 bits (134), Expect = 3e-05, Method: Compositional matrix adjust.
Identities = 73/334 (21%), Positives = 128/334 (38%), Gaps = 87/334 (26%)
Query: 12 FLQWLQVNKVELRGCK-IKYSDESKGFGIFSSNEF-SDGVLLVVPLDLAITPMRVLQDPL 69
++W N + G + + + +E GFG+ ++ + ++ + L VP L +T + ++ +
Sbjct: 82 LMKWASENGASVEGFEMVNFKEE--GFGLRATRDIKAEELFLWVPRKLLMT-VESAKNSV 138
Query: 70 IGP---ECRAMFEDGEVDDRFLMILFLTVERLRKNSSWKPYLDMLPTTFGNPLWFTDDEL 126
+GP + R + G + + L ER NS W+PY+ LP+ + PL+F +DE+
Sbjct: 139 LGPLYSQDRILQAMGNI----ALAFHLLCERASPNSFWQPYIQTLPSEYDTPLYFEEDEV 194
Query: 127 LELKGTTLYRATELQKQNLLTLYDDKVKDLVKKLLVLDGDSERANSIFWTRALNIPLPHS 186
L+ T Q +N Y ++ + + PH+
Sbjct: 195 RYLQSTQAIHDVFSQYKNTARQY-----------------------AYFYKVIQTH-PHA 230
Query: 187 YVFPQNQEDLNKYDSINNSAELSNDHNSRGELINGLNDIKNEAQRVNSQVNGATSTLTST 246
P + +S + + ++ N I E +G+ TL
Sbjct: 231 NKLP-----------LKDSFTYEDYRWAVSSVMTRQNQIPTE--------DGSRVTLA-- 269
Query: 247 QGETLWIEGLVPGIDFCNHDLKAAATWEVDGTGLITGVPFSMYLLSVERSS------FHS 300
L+P D CNH GLIT + Y L +R F +
Sbjct: 270 ---------LIPLWDMCNHT-----------NGLIT----TGYNLEDDRCECVALQDFRA 305
Query: 301 EKEISISYGNKGNEELLYLYGFVIDNNPDDYLMI 334
++I I YG + N E + GF DNN D + I
Sbjct: 306 GEQIYIFYGTRSNAEFVIHSGFFFDNNSHDRVKI 339
>gi|345561352|gb|EGX44442.1| hypothetical protein AOL_s00188g347 [Arthrobotrys oligospora ATCC
24927]
Length = 468
Score = 56.2 bits (134), Expect = 4e-05, Method: Compositional matrix adjust.
Identities = 41/159 (25%), Positives = 73/159 (45%), Gaps = 8/159 (5%)
Query: 34 SKGFGIFSSNEFSDG-VLLVVPLDLAITPMRVLQDPLIGPECRAMFEDGEVDDRFLMILF 92
+ G G+ S F +G +L +P + T D +I P ++M VDD + +
Sbjct: 23 ATGRGLGSLRHFKEGERILTIPSSILWTVEHAYADSIIRPVLQSMQGALSVDDTLAIYIL 82
Query: 93 LTVERLRKNSSWKPYLDMLPTTFGNPLWFTDDELLELKGTTLYRATELQKQNLLTLYDDK 152
R + + +++ LPT++ + ++FTDDEL G++LY T+ KQ + Y
Sbjct: 83 FVRSRESGYNGLRSHVEALPTSYSSSIFFTDDELEVCAGSSLYTITKQLKQQIQDDYRTL 142
Query: 153 VKDLVKKLLVLDG-------DSERANSIFWTRALNIPLP 184
V+ L + L + D + A W+RA++ P
Sbjct: 143 VERLFGQYLDIFSLGKFTIEDYKWALCTVWSRAMDFVQP 181
>gi|347967018|ref|XP_321037.5| AGAP002018-PA [Anopheles gambiae str. PEST]
gi|333469795|gb|EAA01259.5| AGAP002018-PA [Anopheles gambiae str. PEST]
Length = 493
Score = 56.2 bits (134), Expect = 4e-05, Method: Compositional matrix adjust.
Identities = 82/358 (22%), Positives = 130/358 (36%), Gaps = 84/358 (23%)
Query: 12 FLQWLQVNKVELRGCKIKYSDESKGFGIFSSNEFSDG-VLLVVPLDLAI-----TPMRVL 65
F++W ++ ++ E G G+ S G ++ VP + R L
Sbjct: 75 FMRWAVERGCQVENVRVAEHAEYGGLGLESCGPIPAGECIITVPRSMFFYVTNEPRYRQL 134
Query: 66 QDPLIGPECRAMFEDGEVDDRFLMILFLTVERLRKNSSWKPYLDMLPTTFGNPLWFTDDE 125
+ + G M E G + ++ L L +ER R S WKPYLD+LP + PL++T ++
Sbjct: 135 LELMPGA---MMSEQGNI----MLALALIMERFRAKSDWKPYLDLLPDRYTTPLYYTTED 187
Query: 126 LLELKGT-------TLYRATELQKQNLLTLYDDKVKDLVKKLLVLDGDSER-ANSIFWTR 177
+ EL T L + Q + +KV +L D R A S TR
Sbjct: 188 MGELAETDAFLPALKLCKHIARQYGFIRRFVQEKVDELRD---CFTYDVFRWAVSTVMTR 244
Query: 178 ALNIPLPHSYVFPQNQEDLNKYDSINNSAELSNDHNSRGELINGLNDIKNEAQRVNSQVN 237
+P+ +L ++D ++++ L + + D NE + V
Sbjct: 245 QNKVPV-----------NLAEFDGMDHTLALIPLWDMAN---HAFPDTANETRCVAETCY 290
Query: 238 GATSTLTSTQGETLWIEGLVPGIDFCNHDLKAAATWEVDGTGLITGVPFSMYLLSVERSS 297
AT N L+ + T EV I VP
Sbjct: 291 NAT-----------------------NEQLECSLTREVSD---IASVP------------ 312
Query: 298 FHSEKEISISYGNKGNEELLYLYGFVIDNNPDDYLMIHYPAEAIHSIPLSDSKALLLE 355
I I YG + + E L GFV NP + + + +IPL +A LLE
Sbjct: 313 ------IFIVYGTRTDAEFLVHNGFVCPRNPHANVQKRF--TLVPAIPLYKERAHLLE 362
>gi|440797255|gb|ELR18348.1| SET domain containing protein [Acanthamoeba castellanii str. Neff]
Length = 431
Score = 55.8 bits (133), Expect = 4e-05, Method: Compositional matrix adjust.
Identities = 48/163 (29%), Positives = 79/163 (48%), Gaps = 26/163 (15%)
Query: 36 GFGIFSSNEFSDG-VLLVVPLDLAITPMRVLQDPLI--GPECRAMFEDGEVDDRFLM--- 89
G + ++++ + G LL VP L + PL PE R + +D+ F +
Sbjct: 56 GRSVVAAHDIATGETLLSVPFSLVVDSADA---PLATAAPEIRRI-----LDEEFPLSAT 107
Query: 90 ---ILFLTVERLRKNSSWKPYLDMLPTTFGNPLWFTDDELLELKGTTLY-------RATE 139
L L V + NS W+ Y+D+LP+TF L+F+DDEL L+G++L+ RA E
Sbjct: 108 NENALLLLVHKNDPNSPWQRYIDVLPSTFSTTLFFSDDELSYLEGSSLHHFARQRRRAIE 167
Query: 140 LQKQNLLT-LYDDKVKDLVKKLLVLDGDSERANSIFWTRALNI 181
Q + T L+ D + + LD + A S+ W+R+ +
Sbjct: 168 SQYDTIFTPLFVDYPEHFAPEQFSLDA-WKWALSVIWSRSFVV 209
Score = 42.0 bits (97), Expect = 0.77, Method: Compositional matrix adjust.
Identities = 57/206 (27%), Positives = 84/206 (40%), Gaps = 44/206 (21%)
Query: 255 GLVPGIDFCNHDLKAAATWEVDGTGLITGVPFSMYLLSVERSSFHSEKEISISYGNK--- 311
GLVP D N A T +V + +L+ RS ++I ++YG
Sbjct: 215 GLVPWADMFN---MAPETEQVK----VAVDAVDHHLIYSARSPIKKGEQIFVAYGQSRQM 267
Query: 312 GNEELLYLYGFVIDNNPDDYLMIHYPAEAIHSIPLSDSKALLLEEQKAQLRCLLPKSLLE 371
N +LL YGFV++NNP D ++ +P HS S K R LL L+
Sbjct: 268 SNAQLLMDYGFVLENNPHDAVV--FP--MTHSSSASPRK-----------RGLLRAHDLD 312
Query: 372 HGFFAAGHPKDGNNDNKL------------EVDRISSFSWSGQRRMPSYLNKLVFPENFL 419
F G P G L E+D + S G++R+PS +++ N L
Sbjct: 313 RDQFFVGPPALGEFPEHLLAAFRVTVATEQELDALLEQSAQGRQRLPSRVSR----RNEL 368
Query: 420 TALRTIAMQEDEISKVSSLLEELVGS 445
ALR + E + K+ + E V S
Sbjct: 369 AALRLL---ESNVRKLRASYPEHVMS 391
>gi|150864623|ref|XP_001383522.2| hypothetical protein PICST_57570 [Scheffersomyces stipitis CBS
6054]
gi|149385879|gb|ABN65493.2| predicted protein [Scheffersomyces stipitis CBS 6054]
Length = 611
Score = 55.8 bits (133), Expect = 4e-05, Method: Compositional matrix adjust.
Identities = 68/249 (27%), Positives = 115/249 (46%), Gaps = 37/249 (14%)
Query: 88 LMILFLTVERLRK---NSSWKPYLDMLPT--TFGNPLWFTDDELLELKGTTLYRATELQK 142
L+ L+L ER K +S +KPYL +LPT +P +T D+ LKGT L + K
Sbjct: 92 LLKLYLARERSSKYLTSSFYKPYLQLLPTLKDINSPYCWTADDKDLLKGTNLGNSL---K 148
Query: 143 QNLLTLYDDKVKDLVKKLLVLDGDSERANSIFWTRALNIPLPHSYVFPQNQEDLNKYDSI 202
+N+ L V++ + + +L E+ ++ F +N+ + + F ++ EDL Y +
Sbjct: 149 ENIGQL----VEEWWQVINLLPESVEKPDAHF----VNMKFYYEFKFYKD-EDLYNYFVV 199
Query: 203 NNSAELSNDHNSRGELINGL-NDIKNEAQRVNSQVNGATSTLTSTQGETLWIEGLVPGID 261
N ND ++ N L + +++ + + + + E + L+P +D
Sbjct: 200 QNE----NDADNWTSFPNYLWASLILKSRSFPAYLLKSAVDADFKRDEAM----LLPLVD 251
Query: 262 FCNHDLKAAATWEVDGTGLITGVPFSMYLLSVERSSFHSEKEISISYGNKGNEELLYLYG 321
NHD KA +W VD + + S ++ +YG KGNEELL YG
Sbjct: 252 LLNHDSKANVSWSVDDK-----------CFNFKSDSVVPNAQLYNNYGLKGNEELLLAYG 300
Query: 322 FVIDNNPDD 330
F I++N D
Sbjct: 301 FCIEDNASD 309
>gi|343961019|dbj|BAK62099.1| SET domain containing 3 isoform a [Pan troglodytes]
Length = 492
Score = 55.8 bits (133), Expect = 5e-05, Method: Compositional matrix adjust.
Identities = 73/333 (21%), Positives = 128/333 (38%), Gaps = 87/333 (26%)
Query: 13 LQWLQVNKVELRGCK-IKYSDESKGFGIFSSNEF-SDGVLLVVPLDLAITPMRVLQDPLI 70
++W N + G + + + +E GFG+ ++ + ++ + L VP L +T + ++ ++
Sbjct: 83 MKWASENGASVEGFEMVNFKEE--GFGLRATRDIKAEELFLWVPRKLLMT-VESAKNSVL 139
Query: 71 GP---ECRAMFEDGEVDDRFLMILFLTVERLRKNSSWKPYLDMLPTTFGNPLWFTDDELL 127
GP + R + G + + L ER NS W+PY+ LP+ + PL+F +DE+
Sbjct: 140 GPLYSQDRILQAMGNI----ALAFHLLCERASPNSFWQPYIQTLPSEYDTPLYFEEDEVR 195
Query: 128 ELKGTTLYRATELQKQNLLTLYDDKVKDLVKKLLVLDGDSERANSIFWTRALNIPLPHSY 187
L+ T Q +N Y ++ + + PH+
Sbjct: 196 YLQSTQAIHDVFSQYKNTARQY-----------------------AYFYKVIQTH-PHAN 231
Query: 188 VFPQNQEDLNKYDSINNSAELSNDHNSRGELINGLNDIKNEAQRVNSQVNGATSTLTSTQ 247
P + +S + + ++ N I E +G+ TL
Sbjct: 232 KLP-----------LKDSFTYEDYRWAVSSVMTRQNQIPTE--------DGSRVTL---- 268
Query: 248 GETLWIEGLVPGIDFCNHDLKAAATWEVDGTGLITGVPFSMYLLSVERSS------FHSE 301
L+P D CNH GLIT + Y L +R F +
Sbjct: 269 -------ALIPLWDMCNHT-----------NGLIT----TGYNLEDDRCECVALQDFRAG 306
Query: 302 KEISISYGNKGNEELLYLYGFVIDNNPDDYLMI 334
++I I YG + N E + GF DNN D + I
Sbjct: 307 EQIYIFYGTRSNAEFVIHSGFFFDNNSHDRVKI 339
>gi|295668911|ref|XP_002795004.1| SET domain-containing protein [Paracoccidioides sp. 'lutzii' Pb01]
gi|226285697|gb|EEH41263.1| SET domain-containing protein [Paracoccidioides sp. 'lutzii' Pb01]
Length = 488
Score = 55.8 bits (133), Expect = 5e-05, Method: Compositional matrix adjust.
Identities = 84/327 (25%), Positives = 126/327 (38%), Gaps = 61/327 (18%)
Query: 10 EPFLQWLQVNKVELRGCKIKYSD---ESKGFGIFSSNEFS-DGVLLVVPLDLAITPMRVL 65
E FL WL+ + KIK +D E G GI + ++ + + L +P L ++
Sbjct: 18 EEFLDWLKQSPGVRVNPKIKIADLRSEGAGRGIVAYDDINKEEELFAIPQGLVLS----- 72
Query: 66 QDPLIGPECRAMFEDGEVD--DRFLMILFLTVERLRKNSS-WKPYLDMLPTTFGNPLWFT 122
+ + + E E D +IL + E L+ +S W PY +LPT F +++T
Sbjct: 73 ---FQNSKLKDLMEINERDLGQWLCLILVMIYEYLQGVASPWAPYFKVLPTDFDTLMFWT 129
Query: 123 DDELLELKGTTLYRATELQKQNLLTLYDDKVKDLVKKLLVLDGDSERANSIFWTRALNIP 182
D ELLELKG+ + L G S A +F L +
Sbjct: 130 DAELLELKGSAV--------------------------LGRIGKSA-AEEVFLRDLLPLV 162
Query: 183 LPHSYVFPQNQEDLNKYDSINNSAELSNDHNSRGELINGLN-DIKNEAQRVNSQVNGATS 241
+S +FP L Y+S + A L + + G LI D+KN+
Sbjct: 163 SKNSELFPLTS-GLLSYNSPDGKAALLSLAHRMGSLIMSYAFDVKNDEAEEVE----GEG 217
Query: 242 TLTSTQGETLWIEGLVPGIDFCNHDLKA--AATWEVDGTGLITGVPFSMYLLSVERSSFH 299
+ E +G++P D N D A ++ DG YL S
Sbjct: 218 GYVTDDEERQLPKGMIPLADLLNADADRNNACLFQEDG-----------YLAMKSIKSIR 266
Query: 300 SEKEISISYGNKGNEELLYLYGFVIDN 326
+EI YG ELL YG+V DN
Sbjct: 267 KGEEIFNDYGELPRAELLRRYGYVTDN 293
>gi|195396323|ref|XP_002056781.1| GJ16703 [Drosophila virilis]
gi|194146548|gb|EDW62267.1| GJ16703 [Drosophila virilis]
Length = 539
Score = 55.8 bits (133), Expect = 5e-05, Method: Compositional matrix adjust.
Identities = 81/382 (21%), Positives = 147/382 (38%), Gaps = 103/382 (26%)
Query: 7 AKLEPFLQWLQVNKVELRGCKIKYSDESKGFGIFSSNEFSDG-VLLVVPLDLAITPMRVL 65
AK+ F +W + V+ +I + G+ + + ++G ++L VP L +
Sbjct: 117 AKVAAFNEWARAGGVQTDCVEIT-TFPGYQLGLRVTRDLAEGELVLTVPRQL------IF 169
Query: 66 QDPLIGPECRAMFEDGEVDDRFLMILFLTVERLRKNSS-WKPYLDMLPTTFGNPLWFTDD 124
+ L+ R +F D + L +E++R +S W+P++D LPT + L+FT +
Sbjct: 170 SEELLPEAQRKLFID--FPTHLNVTYMLIIEKVRGAASNWQPFIDTLPTRYNTVLYFTVE 227
Query: 125 ELLELKGTTLYRATELQKQNLLTLYDDKVKDLVKKLLVLDGDSERANSIFWTRALNIPLP 184
++ L+G T A + +
Sbjct: 228 QMQRLRG--------------------------------------------TSACSAAVR 243
Query: 185 HSYVFPQNQEDLNKYDSINNSAELSNDHNSRGELINGLNDIKNEAQRVNSQVNGATSTLT 244
H V + Y S+ A + D + ++ G+ ++ E A ST+T
Sbjct: 244 HCRVIAR------IYASMYKCAYMQPDDS----VMAGMANLFTEYGLCYELYRWAVSTVT 293
Query: 245 STQG-------------ETLWIEGLVPGIDFCNHDLKAAATWEVDGTGLITGV--PFSMY 289
+ Q + L+P D NH G IT P +
Sbjct: 294 TRQNLVPRQLATDSDGVRNSPMSALIPFWDMANHR-----------CGKITSYYKPSAQQ 342
Query: 290 LLSVERSSFHSEKEISISYGNKGNEELLYLYGFVIDNNPDDYLMIHY---PAEAIHSIPL 346
+ + + +F + ++ I YG++ N + L +GF+ NN DY+ I P +A L
Sbjct: 343 MECIAQEAFKAGEQFFIYYGDRCNADRLVHHGFLDMNNLKDYVHIRLGLSPTDA-----L 397
Query: 347 SDSKALLLE----EQKAQLRCL 364
++ +ALLL E+KA+LR L
Sbjct: 398 AEQRALLLSELNIERKAELRVL 419
>gi|240278777|gb|EER42283.1| conserved hypothetical protein [Ajellomyces capsulatus H143]
gi|325090312|gb|EGC43622.1| conserved hypothetical protein [Ajellomyces capsulatus H88]
Length = 471
Score = 55.8 bits (133), Expect = 5e-05, Method: Compositional matrix adjust.
Identities = 43/161 (26%), Positives = 69/161 (42%), Gaps = 16/161 (9%)
Query: 36 GFGIFSSNEFSDG-VLLVVPLDLAITPMRVLQDPLIGPECRAMFEDGEVDDRFLMILFLT 94
G G+ + F +G +L +P D+ T D L+GP + VDD +
Sbjct: 25 GRGVKTLRHFKEGERILTIPSDVLWTVEHAYADSLLGPTLHSARPPLSVDDTLATYILFV 84
Query: 95 VERLRKNSSWKPYLDMLPTTFGNPLWFTDDELLELKGTTLYRATELQKQNLLTLYDDKVK 154
R + + +L LP ++ + ++FT+DEL GT+LY T + L D K
Sbjct: 85 RSRESGYNGLRSHLAALPKSYSSSIFFTEDELEVCTGTSLYAIT----KQLGRCIQDDYK 140
Query: 155 DLVKKLLVLDG-----------DSERANSIFWTRALNIPLP 184
LV +LL+ D + A W+RA++ LP
Sbjct: 141 ALVVRLLIQHRDLFPLSKFTIEDYKWALCTVWSRAMDFVLP 181
>gi|380015248|ref|XP_003691619.1| PREDICTED: histone-lysine N-methyltransferase setd3-like [Apis
florea]
Length = 483
Score = 55.8 bits (133), Expect = 5e-05, Method: Compositional matrix adjust.
Identities = 40/143 (27%), Positives = 69/143 (48%), Gaps = 14/143 (9%)
Query: 12 FLQWLQVNKVELRGCKIKYSDESKGF--GIFSSNEF-SDGVLLVVPLDLAITPMRVLQDP 68
F+ WL+ N + G + E G+ G+ + F + ++L +P L +
Sbjct: 84 FINWLKENGANVDGASVA---EFPGYDLGLKAERNFLENELILRIPRGLIFSIHNA---- 136
Query: 69 LIGPECRAMFEDGEVDD--RFLMILFLTVERLRKNSSWKPYLDMLPTTFGNPLWFTDDEL 126
PE + D + + + + L +ER ++NS WKPYLD+LPTT+ L+ T ++
Sbjct: 137 --APELITLQNDPLIQHMPQVALAIALLIERHKENSKWKPYLDILPTTYTTVLYMTAADM 194
Query: 127 LELKGTTLYRATELQKQNLLTLY 149
+ELKG+ A Q +N+ Y
Sbjct: 195 IELKGSPTLEAALKQCRNIARQY 217
>gi|3403234|gb|AAC29136.1| ribulose-1,5-bisphosphate carboxylase/oxygenase N-methyltransferase
[Spinacia oleracea]
gi|3403238|gb|AAC29138.1| ribulose-1,5-bisphosphate carboxylase/oxygenase small subunit
N-methyltransferase II [Spinacia oleracea]
Length = 495
Score = 55.5 bits (132), Expect = 7e-05, Method: Compositional matrix adjust.
Identities = 41/139 (29%), Positives = 68/139 (48%), Gaps = 9/139 (6%)
Query: 2 EISTEAKLEPFLQWLQVNKVELRGCKIKYSDESKGFGIFSSNEFS-DGVLLVVPLDLAIT 60
E T +++ F WL + C +K +G G+ + + S + V+L VP I
Sbjct: 49 ETDTPPEIQKFWGWLSDKGIISPKCPVKPGIVPEGLGLVAQKDISRNEVVLEVPQKFWIN 108
Query: 61 PMRVLQDPLIGPECRAMFEDGEVDDRFLMILFLTVER-LRKNSSWKPYLDMLPTTFGNPL 119
P V IG C + + LFL E+ L +SSWKPY+D+LP + + +
Sbjct: 109 PDTVAASE-IGSVCNGL------KPWVSVALFLMREKKLGNSSSWKPYIDILPDSTNSTI 161
Query: 120 WFTDDELLELKGTTLYRAT 138
+++++EL EL+G+ L T
Sbjct: 162 YWSEEELSELQGSQLLNTT 180
>gi|225678514|gb|EEH16798.1| SET domain-containing protein RMS1 [Paracoccidioides brasiliensis
Pb03]
Length = 488
Score = 55.5 bits (132), Expect = 7e-05, Method: Compositional matrix adjust.
Identities = 84/327 (25%), Positives = 131/327 (40%), Gaps = 61/327 (18%)
Query: 10 EPFLQWLQVNKVELRGCKIKYSD---ESKGFGIFSSNEFSDGV-LLVVPLDLAITPMRVL 65
E F+ WL+ + KIK +D E G GI + ++ ++ L +P L ++
Sbjct: 18 EEFMDWLKQSPGVRVNPKIKIADLRSEGAGRGIVAYDDINEEEELFAIPQGLVLS----- 72
Query: 66 QDPLIGPECRAMFEDGEVD--DRFLMILFLTVERLRKNSS-WKPYLDMLPTTFGNPLWFT 122
+ + + E E D +IL + E L+ +S W PY +LPT F +++T
Sbjct: 73 ---FQNSKLKDLMEINERDLGQWLCLILVMIYEYLQGAASPWAPYFKVLPTDFDTLMFWT 129
Query: 123 DDELLELKGTTLYRATELQKQNLLTLYDDKVKDLVKKLLVLDGDSERANSIFWTRALNIP 182
D ELLELKG+ + L G S A +F L +
Sbjct: 130 DAELLELKGSAV--------------------------LGRIGKSA-AEEVFLRDLLPLV 162
Query: 183 LPHSYVFPQNQEDLNKYDSINNSAELSNDHNSRGELINGLN-DIKNEAQRVNSQVNGATS 241
+S +FP L Y+S + A L + + G LI D++N+ +V G
Sbjct: 163 SKNSELFPLTG-GLLSYNSPDGKAALLSLAHRMGSLIMSYAFDVENDEAE---EVEGEDG 218
Query: 242 TLTSTQGETLWIEGLVPGIDFCNHDLKA--AATWEVDGTGLITGVPFSMYLLSVERSSFH 299
+T + L +G++P D N D A ++ DG YL S
Sbjct: 219 YVTDDEERQL-PKGMIPLADLLNADADRNNARLFQEDG-----------YLAMKSIKSIR 266
Query: 300 SEKEISISYGNKGNEELLYLYGFVIDN 326
+EI YG ELL YG+V D+
Sbjct: 267 KGEEIFNDYGELPRAELLRRYGYVTDS 293
>gi|168986666|gb|ACA35060.1| SET domain containing 3 isoform a (predicted) [Callithrix jacchus]
Length = 597
Score = 55.5 bits (132), Expect = 7e-05, Method: Compositional matrix adjust.
Identities = 72/334 (21%), Positives = 128/334 (38%), Gaps = 87/334 (26%)
Query: 12 FLQWLQVNKVELRGCK-IKYSDESKGFGIFSSNEF-SDGVLLVVPLDLAITPMRVLQDPL 69
++W N + G + + + +E GFG+ ++ + ++ + L VP L +T + ++ +
Sbjct: 84 LMKWASENGASVEGFEMVNFKEE--GFGLRATRDIKAEELFLWVPRKLLMT-VESAKNSV 140
Query: 70 IGP---ECRAMFEDGEVDDRFLMILFLTVERLRKNSSWKPYLDMLPTTFGNPLWFTDDEL 126
+GP + R + G + + L ER NS W+PY+ LP+ + PL+F ++E+
Sbjct: 141 LGPLYSQDRILQAMGNI----ALAFHLLCERASPNSFWQPYIQTLPSEYDTPLYFEEEEV 196
Query: 127 LELKGTTLYRATELQKQNLLTLYDDKVKDLVKKLLVLDGDSERANSIFWTRALNIPLPHS 186
L+ T Q +N Y ++ + + PH+
Sbjct: 197 RYLQSTQAVHDVFSQYKNTARQY-----------------------AYFYKVIQTH-PHA 232
Query: 187 YVFPQNQEDLNKYDSINNSAELSNDHNSRGELINGLNDIKNEAQRVNSQVNGATSTLTST 246
P + +S + + ++ N I E +G+ TL
Sbjct: 233 NKLP-----------LKDSFTYEDYRWAVSSVMTRQNQIPTE--------DGSRVTL--- 270
Query: 247 QGETLWIEGLVPGIDFCNHDLKAAATWEVDGTGLITGVPFSMYLLSVERSS------FHS 300
L+P D CNH GLIT + Y L +R F +
Sbjct: 271 --------ALIPLWDMCNHT-----------NGLIT----TGYNLEDDRCECVALQDFRA 307
Query: 301 EKEISISYGNKGNEELLYLYGFVIDNNPDDYLMI 334
++I I YG + N E + GF DNN D + I
Sbjct: 308 GEQIYIFYGTRSNAEFVIHSGFFFDNNSHDRVKI 341
>gi|3403236|gb|AAC29137.1| ribulose-1,5-bisphosphate carboxylase/oxygenase small subunit
N-methyltransferase I [Spinacia oleracea]
Length = 491
Score = 55.5 bits (132), Expect = 7e-05, Method: Compositional matrix adjust.
Identities = 79/330 (23%), Positives = 130/330 (39%), Gaps = 81/330 (24%)
Query: 2 EISTEAKLEPFLQWLQVNKVELRGCKIKYSDESKGFGIFSSNEFS-DGVLLVVPLDLAIT 60
E T +++ F WL + C +K +G G+ + + S + V+L VP I
Sbjct: 49 ETDTPPEIQKFWGWLSDKGIISPKCPVKPGIVPEGLGLVAQKDISRNEVVLEVPQKFWIN 108
Query: 61 PMRVLQDPLIGPECRAMFEDGEVDDRFLMILFLTVER-LRKNSSWKPYLDMLPTTFGNPL 119
P V IG C + + LFL E+ L +SSWKPY+D+LP + + +
Sbjct: 109 PDTVAASE-IGSVCNGL------KPWVSVALFLMREKKLGNSSSWKPYIDILPDSTNSTI 161
Query: 120 WFTDDELLELKGTTLYRATELQKQNLLTLYDDKVKDLVKKLLVLDGDSERANSIFWTRAL 179
+++++EL EL+G+ L T K+ L ++ L +++LV
Sbjct: 162 YWSEEELSELQGSQLLNTTLGVKE----LVANEFAKLEEEVLV----------------- 200
Query: 180 NIPLPHSYVFPQNQEDLNKYDSINNSAELSNDHNSRGELINGLNDIKNEAQRVNSQVNGA 239
PH +FP +D ++ + +++ A
Sbjct: 201 ----PHKQLFP--------FDV------------TQDDFFWAFGMLRSRA---------- 226
Query: 240 TSTLTSTQGETLWIEGLVPGIDFCNHDLKAAA---TWEVDGTGLITGVPFSMYLLSVER- 295
T +G++L L+P D NH A WE+ G GL FS L+ R
Sbjct: 227 ---FTCLEGQSL---VLIPLADLANHSPDITAPKYAWEIRGAGL-----FSRELVFSLRN 275
Query: 296 -SSFHSEKEISISYG-NKGNEELLYLYGFV 323
+ + ++ I Y NK N EL YG
Sbjct: 276 PTPVKAGDQVLIQYDLNKSNAELALDYGLT 305
>gi|332320543|sp|B0VX69.2|SETD3_CALJA RecName: Full=Histone-lysine N-methyltransferase setd3; AltName:
Full=SET domain-containing protein 3
Length = 595
Score = 55.1 bits (131), Expect = 8e-05, Method: Compositional matrix adjust.
Identities = 72/334 (21%), Positives = 128/334 (38%), Gaps = 87/334 (26%)
Query: 12 FLQWLQVNKVELRGCK-IKYSDESKGFGIFSSNEF-SDGVLLVVPLDLAITPMRVLQDPL 69
++W N + G + + + +E GFG+ ++ + ++ + L VP L +T + ++ +
Sbjct: 82 LMKWASENGASVEGFEMVNFKEE--GFGLRATRDIKAEELFLWVPRKLLMT-VESAKNSV 138
Query: 70 IGP---ECRAMFEDGEVDDRFLMILFLTVERLRKNSSWKPYLDMLPTTFGNPLWFTDDEL 126
+GP + R + G + + L ER NS W+PY+ LP+ + PL+F ++E+
Sbjct: 139 LGPLYSQDRILQAMGNI----ALAFHLLCERASPNSFWQPYIQTLPSEYDTPLYFEEEEV 194
Query: 127 LELKGTTLYRATELQKQNLLTLYDDKVKDLVKKLLVLDGDSERANSIFWTRALNIPLPHS 186
L+ T Q +N Y ++ + + PH+
Sbjct: 195 RYLQSTQAVHDVFSQYKNTARQY-----------------------AYFYKVIQTH-PHA 230
Query: 187 YVFPQNQEDLNKYDSINNSAELSNDHNSRGELINGLNDIKNEAQRVNSQVNGATSTLTST 246
P + +S + + ++ N I E +G+ TL
Sbjct: 231 NKLP-----------LKDSFTYEDYRWAVSSVMTRQNQIPTE--------DGSRVTL--- 268
Query: 247 QGETLWIEGLVPGIDFCNHDLKAAATWEVDGTGLITGVPFSMYLLSVERSS------FHS 300
L+P D CNH GLIT + Y L +R F +
Sbjct: 269 --------ALIPLWDMCNHT-----------NGLIT----TGYNLEDDRCECVALQDFRA 305
Query: 301 EKEISISYGNKGNEELLYLYGFVIDNNPDDYLMI 334
++I I YG + N E + GF DNN D + I
Sbjct: 306 GEQIYIFYGTRSNAEFVIHSGFFFDNNSHDRVKI 339
>gi|296215874|ref|XP_002754318.1| PREDICTED: histone-lysine N-methyltransferase setd3-like
[Callithrix jacchus]
Length = 610
Score = 55.1 bits (131), Expect = 8e-05, Method: Compositional matrix adjust.
Identities = 72/334 (21%), Positives = 128/334 (38%), Gaps = 87/334 (26%)
Query: 12 FLQWLQVNKVELRGCK-IKYSDESKGFGIFSSNEF-SDGVLLVVPLDLAITPMRVLQDPL 69
++W N + G + + + +E GFG+ ++ + ++ + L VP L +T + ++ +
Sbjct: 97 LMKWASENGASVEGFEMVNFKEE--GFGLRATRDIKAEELFLWVPRKLLMT-VESAKNSV 153
Query: 70 IGP---ECRAMFEDGEVDDRFLMILFLTVERLRKNSSWKPYLDMLPTTFGNPLWFTDDEL 126
+GP + R + G + + L ER NS W+PY+ LP+ + PL+F ++E+
Sbjct: 154 LGPLYSQDRILQAMGNI----ALAFHLLCERASPNSFWQPYIQTLPSEYDTPLYFEEEEV 209
Query: 127 LELKGTTLYRATELQKQNLLTLYDDKVKDLVKKLLVLDGDSERANSIFWTRALNIPLPHS 186
L+ T Q +N Y ++ + + PH+
Sbjct: 210 RYLQSTQAVHDVFSQYKNTARQY-----------------------AYFYKVIQTH-PHA 245
Query: 187 YVFPQNQEDLNKYDSINNSAELSNDHNSRGELINGLNDIKNEAQRVNSQVNGATSTLTST 246
P + +S + + ++ N I E +G+ TL
Sbjct: 246 NKLP-----------LKDSFTYEDYRWAVSSVMTRQNQIPTE--------DGSRVTL--- 283
Query: 247 QGETLWIEGLVPGIDFCNHDLKAAATWEVDGTGLITGVPFSMYLLSVERSS------FHS 300
L+P D CNH GLIT + Y L +R F +
Sbjct: 284 --------ALIPLWDMCNHT-----------NGLIT----TGYNLEDDRCECVALQDFRA 320
Query: 301 EKEISISYGNKGNEELLYLYGFVIDNNPDDYLMI 334
++I I YG + N E + GF DNN D + I
Sbjct: 321 GEQIYIFYGTRSNAEFVIHSGFFFDNNSHDRVKI 354
>gi|332321743|sp|C1FXW2.1|SETD3_DASNO RecName: Full=Histone-lysine N-methyltransferase setd3; AltName:
Full=SET domain-containing protein 3
gi|226526916|gb|ACO71275.1| SET domain containing 3 isoform a (predicted) [Dasypus
novemcinctus]
Length = 589
Score = 55.1 bits (131), Expect = 8e-05, Method: Compositional matrix adjust.
Identities = 73/334 (21%), Positives = 126/334 (37%), Gaps = 87/334 (26%)
Query: 12 FLQWLQVNKVELRGCK-IKYSDESKGFGIFSSNEF-SDGVLLVVPLDLAITPMRVLQDPL 69
++W N + G + + + +E GFG+ ++ + ++ + L VP L +T + ++ +
Sbjct: 82 LMKWASENGASVEGFEMVNFKEE--GFGLRATRDIKAEELFLWVPRKLLMT-VESAKNSM 138
Query: 70 IGP---ECRAMFEDGEVDDRFLMILFLTVERLRKNSSWKPYLDMLPTTFGNPLWFTDDEL 126
+GP + R + G + + L ER NS W+PY+ LP + PL+F +DE+
Sbjct: 139 LGPLYSQDRILQAMGNI----TLAFHLLCERANPNSFWQPYIQSLPGEYDTPLYFEEDEV 194
Query: 127 LELKGTTLYRATELQKQNLLTLYDDKVKDLVKKLLVLDGDSERANSIFWTRALNIPLPHS 186
L T Q +N Y ++ + + PH+
Sbjct: 195 RYLHSTQAIHDVFSQYKNTARQY-----------------------AYFYKVIQTH-PHA 230
Query: 187 YVFPQNQEDLNKYDSINNSAELSNDHNSRGELINGLNDIKNEAQRVNSQVNGATSTLTST 246
P + +S + + ++ N I E +G+ TL
Sbjct: 231 NKLP-----------LKDSFTYEDYRWAVSSVMTRQNQIPTE--------DGSRVTL--- 268
Query: 247 QGETLWIEGLVPGIDFCNHDLKAAATWEVDGTGLITGVPFSMYLLSVERSS------FHS 300
L+P D CNH GLIT + Y L +R F +
Sbjct: 269 --------ALIPLWDMCNHT-----------NGLIT----TGYNLEDDRCECVALQDFRA 305
Query: 301 EKEISISYGNKGNEELLYLYGFVIDNNPDDYLMI 334
++I I YG + N E + GF DNN D + I
Sbjct: 306 GEQIYIFYGTRSNAEFVIHSGFFFDNNSHDRVKI 339
>gi|195132508|ref|XP_002010685.1| GI21676 [Drosophila mojavensis]
gi|193907473|gb|EDW06340.1| GI21676 [Drosophila mojavensis]
Length = 593
Score = 55.1 bits (131), Expect = 8e-05, Method: Compositional matrix adjust.
Identities = 77/374 (20%), Positives = 141/374 (37%), Gaps = 89/374 (23%)
Query: 7 AKLEPFLQWLQVNKVELRGCKIKYSDESKGFGIFSSNEFSDG-VLLVVPLDLAITPMRVL 65
+K+E F +W + V+ +I + G+ ++ + G +L VP L +
Sbjct: 173 SKIEAFNEWARAGGVKTDCVEIA-TFPGYQLGLRATRDIKAGEQVLSVPRKL------IF 225
Query: 66 QDPLIGPECRAMFEDGEVDDRFLMILFLTVERLR-KNSSWKPYLDMLPTTFGNPLWFTDD 124
+ L+ + R +F + + L + E+LR +S W+P++D LP+ + L+FT +
Sbjct: 226 SEELLPEKQRQLFRNFPTHLKVTYTLIM--EKLRGADSPWQPFIDTLPSRYNTVLYFTVE 283
Query: 125 ELLELKGTTLYRATELQKQNLLTLYDDKVKDLVKKLLVLDGDSERANSIFWTRALNIPLP 184
++ L+GT+ A + +
Sbjct: 284 QMQRLRGTS--------------------------------------------ACSAAVR 299
Query: 185 HSYVFPQNQEDLNKYDSINNSAELSNDHNSRGELINGLND-------IKNEAQRVNSQVN 237
H V + Y S+ A + D + G + N D + V ++ N
Sbjct: 300 HCRVIAR------LYASMYKCAFMQLDDSVMGGMANLFTDYGLCYELYRWAVSTVTTRQN 353
Query: 238 GATSTLTSTQGETLWIEGLVPGIDFCNHDLKAAATWEVDGTGLITGVPFSMYLLSVERSS 297
+ L I L+P D NH ++ G + + +
Sbjct: 354 LVPRQEIPSDAANLPISALIPYWDMANHRSGKITSFYDQAAGQME---------CTAQEA 404
Query: 298 FHSEKEISISYGNKGNEELLYLYGFVIDNNPDDYLMIHY---PAEAIHSIPLSDSKALLL 354
+ S ++ I YG++ N + L GFV NP DY+ I P +A L++ +A+LL
Sbjct: 405 YKSGEQYFIYYGDRSNADRLVHNGFVDMQNPKDYVQIRLGLSPTDA-----LAEQRAILL 459
Query: 355 E----EQKAQLRCL 364
E+KA+LR L
Sbjct: 460 AELNIERKAELRVL 473
>gi|171678927|ref|XP_001904412.1| hypothetical protein [Podospora anserina S mat+]
gi|170937534|emb|CAP62192.1| unnamed protein product [Podospora anserina S mat+]
Length = 466
Score = 54.7 bits (130), Expect = 9e-05, Method: Compositional matrix adjust.
Identities = 42/159 (26%), Positives = 72/159 (45%), Gaps = 14/159 (8%)
Query: 36 GFGIFSSNEFSDG-VLLVVPLDLAITPMRVLQDPLIGPECRAMFEDGEVDDRFLMILFLT 94
G G+ + F G +L +P + T DPL+GP R+ V+D +
Sbjct: 47 GRGVRALRRFKKGERILTIPCGVLWTVEHAFADPLLGPALRSARPPLSVEDILATYILFI 106
Query: 95 VERLRKNSSWKPYLDMLPTTFGNPLWFTDDELLELKGTTLYRATELQKQNLLTLYDDKVK 154
R + ++ LPT++ + ++F+ DEL GT+LY T+ +++ DD +
Sbjct: 107 RSRESGYDGLRSHVAALPTSYSSSIFFSKDELEVCAGTSLYTITKQLDRSI----DDDYR 162
Query: 155 DLV-------KKLLVLDGDS--ERANSIFWTRALNIPLP 184
LV + LL LD + + A W+RA++ LP
Sbjct: 163 ALVVGVLAQHRDLLPLDKFTIEDWALCTVWSRAMDFALP 201
>gi|315039895|ref|XP_003169325.1| hypothetical protein MGYG_08872 [Arthroderma gypseum CBS 118893]
gi|311337746|gb|EFQ96948.1| hypothetical protein MGYG_08872 [Arthroderma gypseum CBS 118893]
Length = 455
Score = 54.7 bits (130), Expect = 1e-04, Method: Compositional matrix adjust.
Identities = 43/159 (27%), Positives = 72/159 (45%), Gaps = 14/159 (8%)
Query: 36 GFGIFSSNEFSDG-VLLVVPLDLAITPMRVLQDPLIGPECRAMFEDGEVDDRFLMILFLT 94
G G+ + F +G +L +P T + DPL+GP RA V+D + L
Sbjct: 25 GRGVKALRSFKEGERILTIPSACLWTVKKAYADPLLGPVLRAAQPPLSVEDSLALYLLFV 84
Query: 95 VERLRKNSSWKPYLDMLPTTFGNPLWFTDDELLELKGTTLYRATELQKQNLLTLYDDKVK 154
R + ++ +P ++ ++FTDDEL KG++LY T +Q ++DD +
Sbjct: 85 KSRTLGYEGQRHHIAAMPQSYSASIFFTDDELQVCKGSSLYALTPQLEQR---VHDDYRQ 141
Query: 155 DLV------KKLLVLDG----DSERANSIFWTRALNIPL 183
LV + L LD D + A W+RA++ +
Sbjct: 142 LLVALLSQHRDLFPLDQFTIEDYKWALCSIWSRAMDFAV 180
>gi|226294776|gb|EEH50196.1| SET domain-containing protein [Paracoccidioides brasiliensis Pb18]
Length = 488
Score = 54.7 bits (130), Expect = 1e-04, Method: Compositional matrix adjust.
Identities = 84/327 (25%), Positives = 131/327 (40%), Gaps = 61/327 (18%)
Query: 10 EPFLQWLQVNKVELRGCKIKYSD---ESKGFGIFSSNEFSDGV-LLVVPLDLAITPMRVL 65
E F+ WL+ + KIK +D E G GI + ++ ++ L +P L ++
Sbjct: 18 EEFMDWLKQSPGVRVNPKIKIADLRSEGAGRGIVAYDDINEEEELFAIPQGLVLS----- 72
Query: 66 QDPLIGPECRAMFEDGEVD--DRFLMILFLTVERLRKNSS-WKPYLDMLPTTFGNPLWFT 122
+ + + E E D +IL + E L+ +S W PY +LPT F +++T
Sbjct: 73 ---FQNSKLKDLMEINERDLGQWLCLILVMIYEYLQGAASPWAPYFKVLPTDFDTLMFWT 129
Query: 123 DDELLELKGTTLYRATELQKQNLLTLYDDKVKDLVKKLLVLDGDSERANSIFWTRALNIP 182
D ELLELKG+ + L G S A +F L +
Sbjct: 130 DAELLELKGSAV--------------------------LGRIGKS-TAEEVFLRDLLPLV 162
Query: 183 LPHSYVFPQNQEDLNKYDSINNSAELSNDHNSRGELINGLN-DIKNEAQRVNSQVNGATS 241
+S +FP L Y+S + A L + + G LI D++N+ +V G
Sbjct: 163 SKNSELFPLTG-GLLSYNSPDGKAALLSLAHRMGSLIMSYAFDVENDEAE---EVEGEDG 218
Query: 242 TLTSTQGETLWIEGLVPGIDFCNHDLKA--AATWEVDGTGLITGVPFSMYLLSVERSSFH 299
+T + L +G++P D N D A ++ DG YL S
Sbjct: 219 YVTDDEERQL-PKGMIPLADLLNADADRNNARLFQEDG-----------YLSMKSIKSIR 266
Query: 300 SEKEISISYGNKGNEELLYLYGFVIDN 326
+EI YG ELL YG+V D+
Sbjct: 267 KGEEIFNDYGELPRAELLRRYGYVTDS 293
>gi|323304691|gb|EGA58452.1| YHL039W-like protein [Saccharomyces cerevisiae FostersB]
Length = 422
Score = 54.7 bits (130), Expect = 1e-04, Method: Compositional matrix adjust.
Identities = 81/338 (23%), Positives = 141/338 (41%), Gaps = 62/338 (18%)
Query: 33 ESKGFGIFSSNEFS---DGVLLVVPLDLAITPMRVLQDPLIGPECRAMFEDGEVDDRFLM 89
E G F + +FS D L+ VP L IT + L + R++ V +L
Sbjct: 15 EDAGVSAFVNEKFSPKPDQALIRVPETLLITSQQALSEFXQAANERSLL--NSVTQLYLS 72
Query: 90 ILFLTVERLRKNSSWKPYLDMLPTTFGNPLWFTDDELLELKGTTLYRATELQKQNLLTLY 149
L + + S +KPYLD+LP P +++ DE++ L GT +Y LT+
Sbjct: 73 KLKFGTDAVHLKSFYKPYLDVLPLHXPQPYFWSTDEVMNLHGTDVY----------LTM- 121
Query: 150 DDKVKDLVKKLLVLDGDSERANSIFWTRALNIPLPHSYVFPQNQEDLNKYDSINNSAELS 209
D + LVK+ +L +AL+I HS Q+++ L+ + +SA +
Sbjct: 122 RDTLNKLVKEWRML------------FQALSI--EHS---SQDKQFLSLFQENKDSAVVP 164
Query: 210 NDHNSRGELINGLNDIKNEAQRVNSQVNGATS-TLTSTQGETLWIEG------------- 255
+ + +N K E NS V S + +++ I G
Sbjct: 165 LE-----QFCAHINGCKLEDSEWNSFVAYLWSYCIFNSRAFPRVILGRAGTDRTNLNEGF 219
Query: 256 LVPGIDFCNHDLKAAATWEVDGTGLITGVPFSMYLLSVERSSFHSEKEISISYGNKGNEE 315
L P +D NH WE++ + + + ++F ++ E+ +YGN NE+
Sbjct: 220 LYPIVDLLNHKNDVPVRWEMNEQNELCFMS--------QTTTFSAQDELFNNYGNISNEK 271
Query: 316 LLYLYGFVIDNNPDDY--LMIHYPAEAIHSIPLSDSKA 351
L YGF +N D+ L + P+ + +P+ +K+
Sbjct: 272 CLLNYGFWDSSNKFDFSRLTLKLPSTLVSGLPVDFNKS 309
>gi|268573124|ref|XP_002641539.1| C. briggsae CBR-SET-27 protein [Caenorhabditis briggsae]
Length = 483
Score = 54.7 bits (130), Expect = 1e-04, Method: Compositional matrix adjust.
Identities = 66/287 (22%), Positives = 116/287 (40%), Gaps = 49/287 (17%)
Query: 85 DRFLMILFLTVERLR-KNSSWKPYLDMLPTTFGNPLWFTDDELLELKGTTLYRATELQKQ 143
D + LFL V L+ + S W Y+ +LP +F PL++T+++LL+LK + ++ +
Sbjct: 132 DNVGLALFLAVHWLQNEKSKWHSYISILPNSFPTPLFYTEEQLLQLKPSPIF-------E 184
Query: 144 NLLTLYDDKVKDLVKKLLVLDGDSERANSIFWTRALNIPLPHSYVFPQNQEDLNKYDSIN 203
LT Y + LL + N I+ E +
Sbjct: 185 EALTFYRTIARQFCYFLLAV-----AKNKIY-------------------ESAQRRKDAR 220
Query: 204 NSAELSNDHNSRGELINGLNDIKN-EAQRVNSQVNGATSTLTS-TQGETLWIEGLVPGID 261
N+ ++ +N+ + N + V ++VN S S G+ + I L+P +D
Sbjct: 221 NTMDVPIFYNAPFTVYNFTPRLYFWSVGTVTTRVNMVPSENGSGDDGKAIMIPALIPLLD 280
Query: 262 FCNHDLKAAATWEVDGTGLITGVPFSMYLLSVERSSFHSEKEISISYGNKGNEELLYLYG 321
NH+ + T VD L+ P + + KE++I YG + E L G
Sbjct: 281 MANHE--SVVTDPVD--DLVCYAPADECAVITSHCDLEAGKEVTIFYGCRSKGEHLIHNG 336
Query: 322 FV-IDNNPDDYLMIHYPAEAIHSIPLSD----SKALLLEEQKAQLRC 363
F+ I++ D+ + IP SD +K L+E+ + C
Sbjct: 337 FIPINHQKQDFFKLKI------GIPKSDKTLEAKKKLIEKYVQNVYC 377
>gi|260803924|ref|XP_002596839.1| hypothetical protein BRAFLDRAFT_284593 [Branchiostoma floridae]
gi|229282099|gb|EEN52851.1| hypothetical protein BRAFLDRAFT_284593 [Branchiostoma floridae]
Length = 500
Score = 54.7 bits (130), Expect = 1e-04, Method: Compositional matrix adjust.
Identities = 71/336 (21%), Positives = 125/336 (37%), Gaps = 81/336 (24%)
Query: 3 ISTEAKLEPFLQWLQVNKVELRGCKIKYSDESKGFGIFSSNEF-SDGVLLVVPLDLAITP 61
+S E FL+WL+ + V+ I+ E G+G+ + + ++ + + +P L +T
Sbjct: 86 VSREQCFPVFLKWLEDHGVKSDAVTIE-KFEVGGYGLKAVKDIKAEELFITIPRKLMLT- 143
Query: 62 MRVLQDPLIGPECRAMFEDGEVDDRFLMI-------LFLTVERLRKNSSWKPYLDMLPTT 114
++ +GP + DR L + L + E+ NS W PY+++ P T
Sbjct: 144 TETARESSLGPLIK--------KDRILQVMANVSLALHVLCEKYSSNSFWAPYINIFPGT 195
Query: 115 FGNPLWFTDDELLELKGTTLYRATELQKQNLLTLYDDKVKDLVKKLLVLDGDSERANSIF 174
+ PL+F + E+L L+G+ L ++L Y + + E
Sbjct: 196 YTTPLYFEEGEMLHLQGS-------LNFSDVLNQYKSIARQYAYFYKLFQTQPE------ 242
Query: 175 WTRALNIPLPHSYVFPQNQEDLNKYDSINNSAELSNDHNSRGELINGLNDIKNEAQRVNS 234
A +PL + F +E + S
Sbjct: 243 ---AAGLPLKECFTF-------------------------------------DEYRWAVS 262
Query: 235 QVNGATSTLTSTQGETLWIEGLVPGIDFCNHDLKAAATWEVDGTGLITGVPFSMYLLSVE 294
V + + ++ G L I L+P D CNH +T G+ S L++
Sbjct: 263 TVMTRQNQVPTSDGRHL-ITALIPMWDMCNHSNGEVSTEFNLGSD-------SAECLAMR 314
Query: 295 RSSFHSEKEISISYGNKGNEELLYLYGFVIDNNPDD 330
F ++ ++ I YG + N E L GFV N D
Sbjct: 315 --EFPTDSQVYIFYGMRSNAEFLIHNGFVYPENVHD 348
>gi|119196663|ref|XP_001248935.1| hypothetical protein CIMG_02706 [Coccidioides immitis RS]
gi|392861859|gb|EAS37551.2| SET domain-containing protein [Coccidioides immitis RS]
Length = 469
Score = 54.7 bits (130), Expect = 1e-04, Method: Compositional matrix adjust.
Identities = 88/373 (23%), Positives = 147/373 (39%), Gaps = 70/373 (18%)
Query: 12 FLQWLQVNKVELRGCKIKYSD---ESKGFGIFSSNEF-SDGVLLVVPLDLAITPMRV-LQ 66
F+ WL+ + KI+ +D G G+ + E D L +P DL ++ ++
Sbjct: 16 FMGWLKQHPGVRVNPKIRIADLRPTGAGRGVVACEEIVQDEELFAIPEDLVLSVANSKVK 75
Query: 67 DPLIGPECRAMFEDGEVDDRFLMILFLTVERLRKN-SSWKPYLDMLPTTFGNPLWFTDDE 125
D R F D D +I+ + E L+ S W PY +LPT F +++T++E
Sbjct: 76 D-------RINFADENFDTWLSLIVTMIFEYLQGGVSKWSPYFGVLPTDFDTLMFWTENE 128
Query: 126 LLELKGTTLYRATELQKQNLLTLYDDKVKDLVKKLLVLDGDSERANSIFWTRALNIPLPH 185
L EL+G+++ DK+ + A+ + + L + L H
Sbjct: 129 LRELQGSSVL---------------DKI------------GKQEADQVILDKVLPVVLEH 161
Query: 186 SYVFPQNQEDLNKYDSINNSAELSNDHNSRGELINGLN-DIKNEAQRVNSQVNGATSTLT 244
+FP L +DS + + + G LI DI+ ++ +
Sbjct: 162 PDLFPPVN-GLASFDSPSGKEVVLQLAHRMGTLIMAYAFDIE-----MDQDEDQDGEDGY 215
Query: 245 STQGETLWIEGLVPGIDFCNHDLKA--AATWEVDGTGLITGV-PFSMYLLSVERSSFHSE 301
T E +G+VP D N D A ++ DG ++ + P S+ E
Sbjct: 216 VTDDEQEKAKGMVPLADLLNADAHRNNARLFQEDGYFIMRSIAPISI------------E 263
Query: 302 KEISISYGNKGNEELLYLYGFVIDNNPDDYLMIHYPAEAIHSIPLSDSKALLLEEQKAQL 361
EI YG +LL YG++ +N Y ++ EAI +I +EE QL
Sbjct: 264 MEIFNDYGELPRSDLLRRYGYITENYA-PYDVVEISLEAICNIA-------GVEEGCCQL 315
Query: 362 RCLLPKSLLEHGF 374
L +LE G+
Sbjct: 316 ELLEDAGVLEDGY 328
>gi|320170264|gb|EFW47163.1| predicted protein [Capsaspora owczarzaki ATCC 30864]
Length = 938
Score = 54.7 bits (130), Expect = 1e-04, Method: Compositional matrix adjust.
Identities = 54/196 (27%), Positives = 81/196 (41%), Gaps = 10/196 (5%)
Query: 1 MEISTEAKLEPFLQWLQVNKVELRGCKIKYSDESKGFGIFSSNEFS-DGVLLVVPLDLAI 59
M+ + + FL W++ N E + D +G + + N D + +P L I
Sbjct: 30 MDSQEQTSMSRFLTWMRGNGAEFGHVDVS-QDWHQGRRLIADNPLKPDDRIAAIPTLLTI 88
Query: 60 TPMRVLQDPLIGPECRAMFEDGEVDDRFLMILFLTVER-LRKNSSWKPYLDMLPTTFGNP 118
+ LQ L E G DD L+ LFL E+ L S+W PY+++LP N
Sbjct: 89 SLDTALQVGLPRAFTTIWHESGSQDD--LLALFLLREKALGARSAWAPYIEILPKKLSNL 146
Query: 119 LWFTDDELLELKGTTLYRATELQKQNL----LTLYDDKVKDLVKKLLVLDGDSERANSIF 174
L+F D EL +L+ L QK L L L + K ++ D A +I
Sbjct: 147 LFFNDGELAQLQNEQLVEQVSQQKSELQGRFLALRQHEADIFGGKAELVLSDFLWARAIV 206
Query: 175 WTRALNIPLPHSYVFP 190
+RA I Y+ P
Sbjct: 207 LSRAFTIH-ARRYLIP 221
Score = 42.7 bits (99), Expect = 0.36, Method: Compositional matrix adjust.
Identities = 33/106 (31%), Positives = 52/106 (49%), Gaps = 8/106 (7%)
Query: 256 LVPGIDFCNHDLKAAATWEVDGTGLITGVPFS--MYLLSVERSSFHSEKEISISYGNKGN 313
L+P D NH + G F M+LL+ +R +E E+ YGN N
Sbjct: 219 LIPFADLLNHRFHPTRGLDESGEFFYRHHDFQNGMFLLTCDRPVNENE-EVEDDYGNLSN 277
Query: 314 EELLYLYGFVIDNNPDDYLMIHYPAEAIHSIPLSDSKALLLEEQKA 359
+ L LYGFV ++NP + + I+ A+ +H + +ALLL+ + A
Sbjct: 278 AQFLQLYGFVPESNPHECVEINL-ADLLH----GEREALLLKSEYA 318
>gi|428163078|gb|EKX32170.1| hypothetical protein GUITHDRAFT_121664 [Guillardia theta CCMP2712]
Length = 449
Score = 54.7 bits (130), Expect = 1e-04, Method: Compositional matrix adjust.
Identities = 40/165 (24%), Positives = 77/165 (46%), Gaps = 11/165 (6%)
Query: 3 ISTEAKLEPFLQWLQVNKVELRGCKIKYSDESKGFGIFSSNEFSDG--VLLVVPLDLAIT 60
++T + +W N + K+ D+ + I + E + V+L +P +L +
Sbjct: 20 VATSSDGSDVYEWAAANGANV--SKVVLRDDGEAGPILHAKEDIEAGEVILSLPANL-LF 76
Query: 61 PMRVL-QDPLIGPECRAMFEDGEVDDRFLMILFLTVERLRKNSSWKPYLDMLPTTFGNPL 119
P RV P++ M E+ + + L+L ER +S WKP+L LP F + L
Sbjct: 77 PTRVSDHSPVVH-----MIENTTIGRITAICLYLISERADSSSHWKPWLQSLPPRFFHAL 131
Query: 120 WFTDDELLELKGTTLYRATELQKQNLLTLYDDKVKDLVKKLLVLD 164
+++D++L + ++ + +K+N+ Y+ V L+ KL D
Sbjct: 132 SYSEDDMLHFQASSFKELRDRKKKNVRQEYEQTVAPLLHKLPAFD 176
>gi|428163884|gb|EKX32933.1| hypothetical protein GUITHDRAFT_120884 [Guillardia theta CCMP2712]
Length = 320
Score = 54.7 bits (130), Expect = 1e-04, Method: Compositional matrix adjust.
Identities = 50/180 (27%), Positives = 84/180 (46%), Gaps = 16/180 (8%)
Query: 9 LEPFLQWLQVNKVELRGCKIKYSDESKGFGIFSSNEFSDG-VLLVVPLDLAITPMRVLQD 67
+ L+WL+ V+L G + + E G+ + +S S G L VP L + P ++
Sbjct: 1 MSALLRWLEGGGVQLGGVEAVWR-EGMGWALRASKRISPGETFLKVPRHLLLGPHQLRAS 59
Query: 68 PLIGPECRAMFEDGEVDDRFLMILFLTVERLRKNSSWKPYLDMLPTTFGNPLWFTDDELL 127
L + E ++ D M+L L E + +S ++PYLD+LP T P+ ++ +E
Sbjct: 60 SL-----DRLLEGWQLPD--CMLLLLMCESVNSSSFFRPYLDLLPDTVDTPITWSKEEAK 112
Query: 128 ELKGT-TLYRATELQKQNLLTLYD------DKVKDLVKKLLVLDGDSERANSIFWTRALN 180
EL G+ L+RA +L+ + + + DK D LL + A SI +RA
Sbjct: 113 ELVGSPVLHRAVKLRHELARSFQEMKDKVFDKYPDRFPPLLFSYERYQWAYSILRSRAFG 172
>gi|260831632|ref|XP_002610762.1| hypothetical protein BRAFLDRAFT_91548 [Branchiostoma floridae]
gi|229296131|gb|EEN66772.1| hypothetical protein BRAFLDRAFT_91548 [Branchiostoma floridae]
Length = 604
Score = 54.3 bits (129), Expect = 1e-04, Method: Compositional matrix adjust.
Identities = 80/290 (27%), Positives = 122/290 (42%), Gaps = 78/290 (26%)
Query: 88 LMILFLTVERLR-KNSSWKPYLDMLPTTFGNPLWFTDDELLELKGTTLYRATELQKQNLL 146
++ LFL +E+ + K+S W PY+ LP +F P++FT+ EL L + +A +L+K+ LL
Sbjct: 238 VLSLFLLLEKNKGKDSFWYPYIRSLPNSFTTPVYFTESELNALSPSLQEKARDLKKE-LL 296
Query: 147 TLYDDKVKDLVKKLLVLDG----DSER-ANSIFWTRALNIPLPHSYVFPQNQEDLNKYDS 201
++D + L LD D+ R A S+ TR L QED
Sbjct: 297 HAFNDLEPFVTSCLPELDSTFTFDAFRWAWSVLKTRTL------------YQEDC----- 339
Query: 202 INNSAELSNDHNSRGELINGLNDIKNEAQRVNSQVNGATSTLTSTQGETLWIEGLVPGID 261
S LSN E Q TST LVP +D
Sbjct: 340 --RSPYLSNK----------------EPQ---------TST-------------LVPMLD 359
Query: 262 FCNHDLKAAATWEVDGTGLITGVPFSMYLLSVERSSFHSEKEISISYGNKGNEELLYLYG 321
NH A A + + V S Y + V + ++ ISYG + N EL+ +G
Sbjct: 360 LINHSPSAKARFGYN-------VNTSCYEVRV-LEPYRKYDQVFISYGFEENTELMLKFG 411
Query: 322 FVIDNNPDDYLMIHYPAEAIHSIPLSDSKALLLEEQKAQLRCLLPKSLLE 371
F + NP D++ I+ +E + S+P + EE+K ++ L LL
Sbjct: 412 FFVPENPKDFMKINL-SEMLESLPQIND-----EERKNKVDLLFDSGLLR 455
>gi|315045047|ref|XP_003171899.1| SET domain-containing protein 6 [Arthroderma gypseum CBS 118893]
gi|311344242|gb|EFR03445.1| SET domain-containing protein 6 [Arthroderma gypseum CBS 118893]
Length = 485
Score = 54.3 bits (129), Expect = 1e-04, Method: Compositional matrix adjust.
Identities = 79/324 (24%), Positives = 139/324 (42%), Gaps = 58/324 (17%)
Query: 12 FLQWLQVNKVELRGCKIKYSDESK----GFGIFSSNEFS-DGVLLVVPLDLAITPMRVLQ 66
FL WL+ + + + Y + + G GI +S + + D L V+P DL ++
Sbjct: 20 FLLWLKRSSLHFKMHPGIYIADLRSIGAGRGICASRDITEDEELFVIPEDLILSVQNSEA 79
Query: 67 DPLIGPECRAMFEDGEVDDRFLMILFLTVERLR-KNSSWKPYLDMLPTTFGNPLWFTDDE 125
++G +D ++ +I+ + E + + S W PY +LP++F +++TD++
Sbjct: 80 RTVLG------LDDKQLGPWLSLIIAMIYEYYQGEQSKWYPYFGVLPSSFDTLMFWTDEQ 133
Query: 126 LLELKGTTLYRATELQKQNLLTLYDDKVKDLVKKLLVLDGDSERANSIFWTRALNIPLPH 185
L EL+G+ + ++ K DD + V L+ +ANS+ + P
Sbjct: 134 LSELQGSAV--VGKIGK----AAADDTILQKVVPLI-------QANSLHFP-------PR 173
Query: 186 SYVFPQNQEDLNKYDSINNSAELSNDHNSRGELINGLNDI-KNEAQRVNSQVNGATSTLT 244
S + P N D + SA LS H ++ DI K E ++ +G
Sbjct: 174 SDMPPLNSPD-------SQSALLSLAHRMASLIMAYAFDIEKAEEADEDTAEDGYM---- 222
Query: 245 STQGETLWIEGLVPGIDFCNHDLKA--AATWEVDGTGLITGVPFSMYLLSVERSSFHSEK 302
T E +G+VP D N D + A ++ +G+ ++ V + HS +
Sbjct: 223 -TDDEDEPAKGMVPLADIFNADAQRNNARLFQEEGSFVMKAV-----------RNIHSGE 270
Query: 303 EISISYGNKGNEELLYLYGFVIDN 326
EI YG +LL YG+V DN
Sbjct: 271 EIFNDYGELPRADLLRRYGYVTDN 294
>gi|396468374|ref|XP_003838159.1| hypothetical protein LEMA_P116830.1 [Leptosphaeria maculans JN3]
gi|312214726|emb|CBX94680.1| hypothetical protein LEMA_P116830.1 [Leptosphaeria maculans JN3]
Length = 660
Score = 54.3 bits (129), Expect = 1e-04, Method: Compositional matrix adjust.
Identities = 46/140 (32%), Positives = 69/140 (49%), Gaps = 11/140 (7%)
Query: 6 EAKLEPFLQWLQVNKVELRGCKIKYSD---ESKGFGIFSSNEF-SDGVLLVVPLDLAITP 61
+A + FL W + E R K SD + G GI ++ + ++ L +P + I
Sbjct: 5 QAVTDRFLAWFKSVGGEFRDDLSKISDLRSQDAGRGIVATRDIPAETTLFTIPRNAIINV 64
Query: 62 -----MRVLQDPLIGPECRAMFEDGE-VDDRFLMILFLTVERLR-KNSSWKPYLDMLPTT 114
R+L G A E E +D +IL + E L + S WKPY+D+LPT+
Sbjct: 65 ETSDLARLLPGIFDGTLNDAEDEKAEPLDPWASLILVMLREYLHGEQSYWKPYIDILPTS 124
Query: 115 FGNPLWFTDDELLELKGTTL 134
F P+++T DEL EL+GT L
Sbjct: 125 FDTPIFWTQDELKELEGTVL 144
>gi|57529914|ref|NP_001006486.1| histone-lysine N-methyltransferase setd3 [Gallus gallus]
gi|363734802|ref|XP_003641459.1| PREDICTED: histone-lysine N-methyltransferase setd3-like [Gallus
gallus]
gi|75571462|sp|Q5ZML9.1|SETD3_CHICK RecName: Full=Histone-lysine N-methyltransferase setd3; AltName:
Full=SET domain-containing protein 3
gi|53127281|emb|CAG31024.1| hypothetical protein RCJMB04_1k10 [Gallus gallus]
Length = 593
Score = 54.3 bits (129), Expect = 1e-04, Method: Compositional matrix adjust.
Identities = 74/334 (22%), Positives = 127/334 (38%), Gaps = 87/334 (26%)
Query: 12 FLQWLQVNKVELRGCKIKYSDESKGFGIFSSNEF-SDGVLLVVPLDLAITPMRVLQDPLI 70
++W N G +I +E +GFG+ ++ E ++ + L VP L +T + ++ ++
Sbjct: 82 LIKWATENGASTEGFEIANFEE-EGFGLKATREIKAEELFLWVPRKLLMT-VESAKNSVL 139
Query: 71 GP---ECRAMFEDGEVDDRFLMILFLTVERLRKNSSWKPYLDMLPTTFGNPLWFTDDELL 127
G + R + G + + L ER NS W PY+ LP+ + PL+F +DE+
Sbjct: 140 GSLYSQDRILQAMGNI----TLAFHLLCERANPNSFWLPYIQTLPSEYDTPLYFEEDEVQ 195
Query: 128 ELKGTTLYRATELQKQNLLTLYDDKVKDLVKKLLVLDGDSERANSIFWTRALNIPLPHSY 187
L+ T ++ + Y + + V+ A +PL S+
Sbjct: 196 YLRSTQAI-------HDVFSQYKNTARQYAYFYKVIQTHP---------NASKLPLKDSF 239
Query: 188 VFPQNQEDLNKYDSINNSAELSNDHN-SRGELINGLNDIKNEAQRVNSQVNGATSTLTST 246
+ +D+ + ++ N I E +G+ TL
Sbjct: 240 TY--------------------DDYRWAVSSVMTRQNQIPTE--------DGSRVTL--- 268
Query: 247 QGETLWIEGLVPGIDFCNHDLKAAATWEVDGTGLITGVPFSMYLLSVERSS------FHS 300
L+P D CNH GLIT + Y L +R F +
Sbjct: 269 --------ALIPLWDMCNH-----------TNGLIT----TGYNLEDDRCECVALQDFKA 305
Query: 301 EKEISISYGNKGNEELLYLYGFVIDNNPDDYLMI 334
++I I YG + N E + GF DNN D + I
Sbjct: 306 GEQIYIFYGTRSNAEFVIHSGFFFDNNSHDRVKI 339
>gi|449280698|gb|EMC87934.1| SET domain-containing protein 3 [Columba livia]
Length = 593
Score = 54.3 bits (129), Expect = 1e-04, Method: Compositional matrix adjust.
Identities = 74/334 (22%), Positives = 127/334 (38%), Gaps = 87/334 (26%)
Query: 12 FLQWLQVNKVELRGCKIKYSDESKGFGIFSSNEF-SDGVLLVVPLDLAITPMRVLQDPLI 70
++W N G +I +E +GFG+ ++ E ++ + L VP L +T + ++ ++
Sbjct: 82 LIKWATENGASTEGFEIANFEE-EGFGLKATREIKAEELFLWVPRRLLMT-VESAKNSVL 139
Query: 71 GP---ECRAMFEDGEVDDRFLMILFLTVERLRKNSSWKPYLDMLPTTFGNPLWFTDDELL 127
G + R + G + + L ER NS W PY+ LP+ + PL+F +DE+
Sbjct: 140 GSLYSQDRILQAMGNI----TLAFHLLCERANPNSFWLPYIQTLPSEYNTPLYFEEDEVR 195
Query: 128 ELKGTTLYRATELQKQNLLTLYDDKVKDLVKKLLVLDGDSERANSIFWTRALNIPLPHSY 187
L+ T ++ + Y + + V+ A +PL S+
Sbjct: 196 YLQSTQAI-------HDVFSQYKNTARQYAYFYKVIQTHP---------NASKLPLKDSF 239
Query: 188 VFPQNQEDLNKYDSINNSAELSNDHN-SRGELINGLNDIKNEAQRVNSQVNGATSTLTST 246
+ +D+ + ++ N I E +G+ TL
Sbjct: 240 TY--------------------DDYRWAVSSVMTRQNQIPTE--------DGSRVTL--- 268
Query: 247 QGETLWIEGLVPGIDFCNHDLKAAATWEVDGTGLITGVPFSMYLLSVERSS------FHS 300
L+P D CNH GLIT + Y L +R F +
Sbjct: 269 --------ALIPLWDMCNHT-----------NGLIT----TGYNLEDDRCECVALQDFKA 305
Query: 301 EKEISISYGNKGNEELLYLYGFVIDNNPDDYLMI 334
++I I YG + N E + GF DNN D + I
Sbjct: 306 GEQIYIFYGTRSNAEFVIHSGFFFDNNSHDRVKI 339
>gi|148686779|gb|EDL18726.1| mCG18357, isoform CRA_d [Mus musculus]
Length = 597
Score = 54.3 bits (129), Expect = 1e-04, Method: Compositional matrix adjust.
Identities = 74/334 (22%), Positives = 128/334 (38%), Gaps = 87/334 (26%)
Query: 12 FLQWLQVNKVELRGCK-IKYSDESKGFGIFSSNEF-SDGVLLVVPLDLAITPMRVLQDPL 69
++W N + G + + + +E GFG+ ++ + ++ + L VP L +T + ++ +
Sbjct: 85 LMKWASENGASVEGFEMVNFKEE--GFGLRATRDIKAEELFLWVPRKLLMT-VESAKNSV 141
Query: 70 IGP---ECRAMFEDGEVDDRFLMILFLTVERLRKNSSWKPYLDMLPTTFGNPLWFTDDEL 126
+GP + R + G + + L ER NS W+PY+ LP+ + PL+F ++E+
Sbjct: 142 LGPLYSQDRILQAMGNI----ALAFHLLCERASPNSFWQPYIQTLPSEYDTPLYFEEEEV 197
Query: 127 LELKGTTLYRATELQKQNLLTLYDDKVKDLVKKLLVLDGDSERANSIFWTRALNIPLPHS 186
L+ T ++ + Y + + V+ A +PL S
Sbjct: 198 RCLQSTQAI-------HDVFSQYKNTARQYAYFYKVIQTHP---------HANKLPLKES 241
Query: 187 YVFPQNQEDLNKYDSINNSAELSNDHNSRGELINGLNDIKNEAQRVNSQVNGATSTLTST 246
+ + ED S ++ N I E +G+ TL
Sbjct: 242 FTY----EDYRWAVS---------------SVMTRQNQIPTE--------DGSRVTL--- 271
Query: 247 QGETLWIEGLVPGIDFCNHDLKAAATWEVDGTGLITGVPFSMYLLSVERSS------FHS 300
L+P D CNH GLIT + Y L +R F +
Sbjct: 272 --------ALIPLWDMCNHT-----------NGLIT----TGYNLEDDRCECVALQDFQA 308
Query: 301 EKEISISYGNKGNEELLYLYGFVIDNNPDDYLMI 334
+I I YG + N E + GF DNN D + I
Sbjct: 309 GDQIYIFYGTRSNAEFVIHSGFFFDNNSHDRVKI 342
>gi|354483159|ref|XP_003503762.1| PREDICTED: histone-lysine N-methyltransferase setd3 [Cricetulus
griseus]
gi|344254671|gb|EGW10775.1| SET domain-containing protein 3 [Cricetulus griseus]
Length = 577
Score = 54.3 bits (129), Expect = 2e-04, Method: Compositional matrix adjust.
Identities = 72/334 (21%), Positives = 128/334 (38%), Gaps = 87/334 (26%)
Query: 12 FLQWLQVNKVELRGCK-IKYSDESKGFGIFSSNEF-SDGVLLVVPLDLAITPMRVLQDPL 69
++W N + G + + + +E GFG+ ++ + ++ + L VP L +T + ++ +
Sbjct: 82 LMKWASENGASVEGFEMVNFKEE--GFGLRATRDIKAEELFLWVPRKLLMT-VESAKNSV 138
Query: 70 IGP---ECRAMFEDGEVDDRFLMILFLTVERLRKNSSWKPYLDMLPTTFGNPLWFTDDEL 126
+GP + R + G + + L ER NS W+PY+ LP+ + PL+F ++E+
Sbjct: 139 LGPLYSQDRILQAMGNI----ALAFHLLCERASPNSFWQPYIQTLPSEYDTPLYFEEEEV 194
Query: 127 LELKGTTLYRATELQKQNLLTLYDDKVKDLVKKLLVLDGDSERANSIFWTRALNIPLPHS 186
L+ T Q +N Y ++ + + PH+
Sbjct: 195 RCLQSTQAIHDVFSQYKNTARQY-----------------------AYFYKVIQTH-PHA 230
Query: 187 YVFPQNQEDLNKYDSINNSAELSNDHNSRGELINGLNDIKNEAQRVNSQVNGATSTLTST 246
P + +S + + ++ N I E +G+ TL
Sbjct: 231 NKLP-----------LKDSFTYEDYRWAVSSVMTRQNQIPTE--------DGSRVTL--- 268
Query: 247 QGETLWIEGLVPGIDFCNHDLKAAATWEVDGTGLITGVPFSMYLLSVERSS------FHS 300
L+P D CNH GLIT + Y L +R F +
Sbjct: 269 --------ALIPLWDMCNHT-----------NGLIT----TGYNLEDDRCECVALQDFQA 305
Query: 301 EKEISISYGNKGNEELLYLYGFVIDNNPDDYLMI 334
++I I YG + N E + GF DNN D + I
Sbjct: 306 GEQIYIFYGTRSNAEFVIHSGFFFDNNSHDRVKI 339
>gi|341877649|gb|EGT33584.1| CBN-SET-27 protein [Caenorhabditis brenneri]
Length = 501
Score = 54.3 bits (129), Expect = 2e-04, Method: Compositional matrix adjust.
Identities = 81/368 (22%), Positives = 147/368 (39%), Gaps = 59/368 (16%)
Query: 9 LEPFLQWLQVNKVELRGCKIKYSDESKGFGIFSSNEFSDG-VLLVVPLDLAITPMRVLQD 67
++ FL+W N + I + S G + ++ G ++ VP + +T L +
Sbjct: 74 IKAFLKWSDENGIARNNVTIGPTKTS-GLSLQATGPIPKGHIVARVPRNAMMT----LDN 128
Query: 68 PLIGPECRAMFEDGEVD---DRFLMILFLTVERLR-KNSSWKPYLDMLPTTFGNPLWFTD 123
R FE ++ D + LFL ++ + S + PY+ +LP F PL++T+
Sbjct: 129 ARKSNSLRKAFEKDQIVAGMDNVGLALFLATHWMQNEKSRFSPYIAILPNCFPTPLFYTE 188
Query: 124 DELLELKGTTLYRATELQKQNLLTLYDDKVKDLVKKLLVLDGDSERANSIFWTRALNIPL 183
++LL+LK + ++ + LT Y + L+ + +
Sbjct: 189 EQLLQLKPSPIF-------EEALTFYRTISRQFCYFLMAVSKNK---------------- 225
Query: 184 PHSYVFPQNQEDLNKYDSINNSAELSNDHNSRGELINGLNDIKNEA-QRVNSQVNGATS- 241
Y Q ++D N+ E+ +NS + N + + A V ++VN S
Sbjct: 226 --MYEAAQRRKD------ARNTMEVPIFYNSPFTVANFTSRLYFWAVGVVTTRVNMVPSE 277
Query: 242 TLTSTQGETLWIEGLVPGIDFCNHDLKAAATWEVDGT--GLITGVPFSMYLLSVERSSFH 299
TL + + I L+P +D NH+ +E DG L+ P +
Sbjct: 278 TLIDKDEKPIAIPALIPFLDMANHE-----NFETDGPIEDLVCYSPLEECAVITSHCDMD 332
Query: 300 SEKEISISYGNKGNEELLYLYGFVIDNNPDDYLMIHYPAEAIHSIPLSDS----KALLLE 355
+ +E++I YG + E L GFV N+ +M + IP +D K L+E
Sbjct: 333 AGREVTIFYGCRSKGEHLIHNGFVPLNHGKQEIM-----KMKIGIPKTDKNLDVKKKLIE 387
Query: 356 EQKAQLRC 363
+ A + C
Sbjct: 388 KYVANVFC 395
>gi|348554489|ref|XP_003463058.1| PREDICTED: histone-lysine N-methyltransferase setd3-like [Cavia
porcellus]
Length = 789
Score = 53.9 bits (128), Expect = 2e-04, Method: Compositional matrix adjust.
Identities = 74/334 (22%), Positives = 128/334 (38%), Gaps = 87/334 (26%)
Query: 12 FLQWLQVNKVELRGCK-IKYSDESKGFGIFSSNEF-SDGVLLVVPLDLAITPMRVLQDPL 69
++W N + G + + + +E GFG+ ++ + ++ + L VP L +T + ++ +
Sbjct: 82 LMKWASENGASVEGFEMVNFKEE--GFGLRATRDIKAEELFLWVPRKLLMT-VESAKNSI 138
Query: 70 IGP---ECRAMFEDGEVDDRFLMILFLTVERLRKNSSWKPYLDMLPTTFGNPLWFTDDEL 126
+GP + R + G + + L ER NS W PY+ LP+ + PL+F ++E+
Sbjct: 139 LGPLYSQDRILQAMGNI----ALAFHLLCERANPNSFWLPYIQTLPSEYDTPLYFEEEEV 194
Query: 127 LELKGTTLYRATELQKQNLLTLYDDKVKDLVKKLLVLDGDSERANSIFWTRALNIPLPHS 186
L+ T ++ + Y + + V+ A +PL S
Sbjct: 195 QCLQSTQAI-------HDVFSQYKNTARQYAYFYKVIQTHP---------HANKLPLKDS 238
Query: 187 YVFPQNQEDLNKYDSINNSAELSNDHNSRGELINGLNDIKNEAQRVNSQVNGATSTLTST 246
+ + ED S ++ N I E +G+ TL
Sbjct: 239 FTY----EDYRWAVS---------------SVMTRQNQIPTE--------DGSRVTL--- 268
Query: 247 QGETLWIEGLVPGIDFCNHDLKAAATWEVDGTGLITGVPFSMYLLSVERSS------FHS 300
L+P D CNH GLIT + Y L +R F +
Sbjct: 269 --------ALIPLWDMCNHT-----------NGLIT----TGYNLEDDRCECVALQDFRA 305
Query: 301 EKEISISYGNKGNEELLYLYGFVIDNNPDDYLMI 334
++I I YG + N E + GF DNN D + I
Sbjct: 306 GEQIYIFYGTRSNAEFVIHSGFFFDNNSHDRVKI 339
>gi|268370088|ref|NP_082538.2| histone-lysine N-methyltransferase setd3 [Mus musculus]
gi|81879567|sp|Q91WC0.1|SETD3_MOUSE RecName: Full=Histone-lysine N-methyltransferase setd3; AltName:
Full=Endothelial differentiation inhibitory protein D10;
AltName: Full=SET domain-containing protein 3
gi|16359331|gb|AAH16123.1| SET domain containing 3 [Mus musculus]
gi|18044800|gb|AAH19973.1| Setd3 protein [Mus musculus]
gi|26327255|dbj|BAC27371.1| unnamed protein product [Mus musculus]
gi|74145116|dbj|BAE27425.1| unnamed protein product [Mus musculus]
gi|74151505|dbj|BAE38861.1| unnamed protein product [Mus musculus]
gi|148686776|gb|EDL18723.1| mCG18357, isoform CRA_a [Mus musculus]
Length = 594
Score = 53.9 bits (128), Expect = 2e-04, Method: Compositional matrix adjust.
Identities = 74/334 (22%), Positives = 128/334 (38%), Gaps = 87/334 (26%)
Query: 12 FLQWLQVNKVELRGCK-IKYSDESKGFGIFSSNEF-SDGVLLVVPLDLAITPMRVLQDPL 69
++W N + G + + + +E GFG+ ++ + ++ + L VP L +T + ++ +
Sbjct: 82 LMKWASENGASVEGFEMVNFKEE--GFGLRATRDIKAEELFLWVPRKLLMT-VESAKNSV 138
Query: 70 IGP---ECRAMFEDGEVDDRFLMILFLTVERLRKNSSWKPYLDMLPTTFGNPLWFTDDEL 126
+GP + R + G + + L ER NS W+PY+ LP+ + PL+F ++E+
Sbjct: 139 LGPLYSQDRILQAMGNI----ALAFHLLCERASPNSFWQPYIQTLPSEYDTPLYFEEEEV 194
Query: 127 LELKGTTLYRATELQKQNLLTLYDDKVKDLVKKLLVLDGDSERANSIFWTRALNIPLPHS 186
L+ T ++ + Y + + V+ A +PL S
Sbjct: 195 RCLQSTQAI-------HDVFSQYKNTARQYAYFYKVIQTHP---------HANKLPLKES 238
Query: 187 YVFPQNQEDLNKYDSINNSAELSNDHNSRGELINGLNDIKNEAQRVNSQVNGATSTLTST 246
+ + ED S ++ N I E +G+ TL
Sbjct: 239 FTY----EDYRWAVS---------------SVMTRQNQIPTE--------DGSRVTL--- 268
Query: 247 QGETLWIEGLVPGIDFCNHDLKAAATWEVDGTGLITGVPFSMYLLSVERSS------FHS 300
L+P D CNH GLIT + Y L +R F +
Sbjct: 269 --------ALIPLWDMCNHT-----------NGLIT----TGYNLEDDRCECVALQDFQA 305
Query: 301 EKEISISYGNKGNEELLYLYGFVIDNNPDDYLMI 334
+I I YG + N E + GF DNN D + I
Sbjct: 306 GDQIYIFYGTRSNAEFVIHSGFFFDNNSHDRVKI 339
>gi|308806489|ref|XP_003080556.1| SET-domain transcriptional regulator-like protein (ISS)
[Ostreococcus tauri]
gi|116059016|emb|CAL54723.1| SET-domain transcriptional regulator-like protein (ISS)
[Ostreococcus tauri]
Length = 394
Score = 53.9 bits (128), Expect = 2e-04, Method: Compositional matrix adjust.
Identities = 32/82 (39%), Positives = 43/82 (52%), Gaps = 6/82 (7%)
Query: 254 EGLVPGIDFCNHDLKAAATWEVDGTGLITGVPFSMYLLSVERSSFHSEKEISISYGNKGN 313
E +VPG+ CNH L A + V T L G + L + R E I+ISYG N
Sbjct: 172 EVIVPGVFLCNHSLSAHS---VRYTSLERGT--KAFRLELSRGCVEGEA-ITISYGRLDN 225
Query: 314 EELLYLYGFVIDNNPDDYLMIH 335
+LL YGF ++NNP D + +H
Sbjct: 226 ADLLMFYGFSLENNPYDRVSLH 247
>gi|303322220|ref|XP_003071103.1| SET domain containing protein [Coccidioides posadasii C735 delta
SOWgp]
gi|240110802|gb|EER28958.1| SET domain containing protein [Coccidioides posadasii C735 delta
SOWgp]
Length = 469
Score = 53.9 bits (128), Expect = 2e-04, Method: Compositional matrix adjust.
Identities = 88/373 (23%), Positives = 146/373 (39%), Gaps = 70/373 (18%)
Query: 12 FLQWLQVNKVELRGCKIKYSD---ESKGFGIFSSNEF-SDGVLLVVPLDLAITPMRV-LQ 66
F+ WL+ + KI+ +D G G+ + E D L +P DL ++ ++
Sbjct: 16 FMGWLKQHPGVRVNPKIRIADLRPTGAGRGVVACEEIVQDEELFAIPEDLVLSVANSKIK 75
Query: 67 DPLIGPECRAMFEDGEVDDRFLMILFLTVERLRKN-SSWKPYLDMLPTTFGNPLWFTDDE 125
D R F D D +I+ + E L+ S W PY +LPT F +++T++E
Sbjct: 76 D-------RINFADENFDTWLSLIVTMIFEYLQGGVSKWSPYFGVLPTDFDTLMFWTENE 128
Query: 126 LLELKGTTLYRATELQKQNLLTLYDDKVKDLVKKLLVLDGDSERANSIFWTRALNIPLPH 185
L EL+G+++ DK+ + A+ + + L + L H
Sbjct: 129 LRELQGSSVL---------------DKI------------GKQEADQVILDKVLPVVLEH 161
Query: 186 SYVFPQNQEDLNKYDSINNSAELSNDHNSRGELINGLN-DIKNEAQRVNSQVNGATSTLT 244
+FP L +DS + + G LI DI+ ++ +
Sbjct: 162 PDLFPPVN-GLASFDSPCGKEVVLQLAHRMGTLIMAYAFDIE-----MDQDEDQDGEDGY 215
Query: 245 STQGETLWIEGLVPGIDFCNHDLKA--AATWEVDGTGLITGV-PFSMYLLSVERSSFHSE 301
T E +G+VP D N D A ++ DG ++ + P S+ E
Sbjct: 216 VTDDEQEKAKGMVPLADLLNADAHRNNARLFQEDGYFIMRSIAPISI------------E 263
Query: 302 KEISISYGNKGNEELLYLYGFVIDNNPDDYLMIHYPAEAIHSIPLSDSKALLLEEQKAQL 361
EI YG +LL YG++ +N Y ++ EAI +I +EE QL
Sbjct: 264 MEIFNDYGELPRSDLLRRYGYITENYA-PYDVVEISLEAICNIA-------GVEEGCCQL 315
Query: 362 RCLLPKSLLEHGF 374
L +LE G+
Sbjct: 316 ELLEDAGVLEDGY 328
>gi|254574038|ref|XP_002494128.1| SET-domain lysine-N-methyltransferase [Komagataella pastoris GS115]
gi|238033927|emb|CAY71949.1| SET-domain lysine-N-methyltransferase [Komagataella pastoris GS115]
gi|328354053|emb|CCA40450.1| hypothetical protein PP7435_Chr4-0275 [Komagataella pastoris CBS
7435]
Length = 616
Score = 53.9 bits (128), Expect = 2e-04, Method: Compositional matrix adjust.
Identities = 83/360 (23%), Positives = 149/360 (41%), Gaps = 90/360 (25%)
Query: 6 EAKLEPFLQWLQVNKVELRGC-KIKYSDESKGFGIFSSNEF-SDGVLLVVPLDLAITPMR 63
+ ++E ++W + C + +Y+D + GF + + + + +PL LAI P
Sbjct: 2 DQRIEDLIEWTKAGGAHFPDCLEFRYTD-TDGFAAYKIKQADASESTITLPLSLAILPET 60
Query: 64 VLQDPLIGPECRAMFEDGE---VDDRFLMILFLTVER-LRKNSSWKPYLDMLPT--TFGN 117
L E + +ED + + L LFL +R L+ +S K Y+ LP+ G
Sbjct: 61 SLS------EFQLTYEDLQSKNIHPNSLTKLFLCKQRTLKDDSPLKVYIKSLPSGEEVGV 114
Query: 118 PLWFTDDELLELKGTTLYRATELQKQNLLTLYDDKVKDLVKKLLVLDGDSERANSIFWTR 177
P ++T +E LKGT L+ A +L+ +++ + V +L + S R +S F
Sbjct: 115 PYYWTLEEQDLLKGTNLHAAMKLKLASVIDEWFAVVSELPQ--------SHRPSSYF--- 163
Query: 178 ALNIPLPHSYVFPQNQEDLNKYDSINNSAELSNDHNSRGELINGLNDIKNEAQRVNSQVN 237
+D+N Y + + A ++D V +Q++
Sbjct: 164 ----------------QDVNLYHAFKSGA---------------VSDSVLHKTTVTTQLD 192
Query: 238 GATSTLTSTQGETLW--------------------IEGLV---PGIDFCNHDLKAAATWE 274
+ ++ T+ LW ++GLV P +D NH+ +W+
Sbjct: 193 KSFTSFTNY----LWAHVILTSRSFPYDLVNKNNPVDGLVMVLPLLDLFNHEPARKVSWD 248
Query: 275 VDGTGLITGVPFSMYLLSVERSSFHSEKEISISYGNKGNEELLYLYGFVIDNNPDDYLMI 334
VD FS + + S +I+ +YG KGNEELL YGF I+++ D + +
Sbjct: 249 VDREH----KKFS--FTNDDFSDLPIGAQINNNYGPKGNEELLLNYGFCIEDSKTDVVAL 302
>gi|8778402|gb|AAF79410.1|AC068197_20 F16A14.25 [Arabidopsis thaliana]
Length = 474
Score = 53.9 bits (128), Expect = 2e-04, Method: Compositional matrix adjust.
Identities = 40/128 (31%), Positives = 64/128 (50%), Gaps = 8/128 (6%)
Query: 12 FLQWLQVNKVELRGCKIKYSDESKGFGIFSSNEFS-DGVLLVVPLDLAITPMRVLQDPLI 70
F +WL+ V + + +G G+ + + + V+L +P L I P V I
Sbjct: 52 FWKWLRDQGVVSGKSVAEPAVVPEGLGLVARRDIGRNEVVLEIPKRLWINPETVTASK-I 110
Query: 71 GPECRAMFEDGEVDDRFLMILFLTVERLRKNSSWKPYLDMLPTTFGNPLWFTDDELLELK 130
GP C G + + LFL E+ + SSW+ YLDMLP + + ++++++EL ELK
Sbjct: 111 GPLC------GGLKPWVSVALFLIREKYEEESSWRVYLDMLPQSTDSTVFWSEEELAELK 164
Query: 131 GTTLYRAT 138
GT L T
Sbjct: 165 GTQLLSTT 172
>gi|403350379|gb|EJY74649.1| SET domain containing protein [Oxytricha trifallax]
Length = 2165
Score = 53.9 bits (128), Expect = 2e-04, Method: Compositional matrix adjust.
Identities = 36/133 (27%), Positives = 67/133 (50%), Gaps = 14/133 (10%)
Query: 8 KLEPFLQWLQVNKVELRGCKIKYSDESKGFGIFSSNEFSDG-VLLVVPLDLAITPMRVLQ 66
+ E L+WL+ KI+Y + G+ ++ + G ++L VP IT L+
Sbjct: 146 RFETLLKWLEQGGSHFEKLKIRYYT-ADYRGVHAARDIKKGEIILYVPKHQIIT----LE 200
Query: 67 DPLIGPECRAMFEDGEVDDRFL------MILFLTVERLRKNSSWKPYLDMLPTTFGN-PL 119
+ P + M+E G + R + + ++ E+ + S W+ Y+D+LP F N P+
Sbjct: 201 MAMTSPVGKKMYEKG-LRQRLISPKHSFLSTYIMQEKRKPESQWQIYIDILPKNFSNFPI 259
Query: 120 WFTDDELLELKGT 132
+FT++E + LKG+
Sbjct: 260 FFTEEERIWLKGS 272
>gi|58264960|ref|XP_569636.1| cytoplasm protein [Cryptococcus neoformans var. neoformans JEC21]
gi|57225868|gb|AAW42329.1| cytoplasm protein, putative [Cryptococcus neoformans var.
neoformans JEC21]
Length = 455
Score = 53.9 bits (128), Expect = 2e-04, Method: Compositional matrix adjust.
Identities = 81/337 (24%), Positives = 122/337 (36%), Gaps = 78/337 (23%)
Query: 32 DESKGFGIFSSNEFS-DGVLLVVPLDLAITPMRVLQ----------DPLIGPECRAMFED 80
DES G +F+ N + L+ P LA+T Q LI P+ A E
Sbjct: 31 DESTGLSVFADNTIKPEEKLVSCPFSLAVTSELATQAVCEIAGVEKGQLIWPKGHAK-EG 89
Query: 81 GEVDDRFLMILFLT---VERLRKNSSW------KPYLDMLP--TTFGNPLWFTDDELLEL 129
++R + +L + + R + W K YL LP + PL+FT+ EL L
Sbjct: 90 KPWNERMRIGAYLGFHWIYQERPETEWPRSLVHKTYLASLPPASNLSTPLYFTESELQLL 149
Query: 130 KGTTLYRATELQKQNLLTLYDDKVKDLVKKLLVLDGDSERANSIFWTRAL-NIPLPHSYV 188
G+ NLL +D+ K+ + L + + + W R L S
Sbjct: 150 SGS-----------NLLGAVEDRRKEWSAESAALR-SVLKEDGLTWERYLATATYMSSRA 197
Query: 189 FPQNQEDLNKYDSINNSAELSNDHNSRGELINGLNDIKNEAQRVNSQVNGATSTLTSTQG 248
FP DL E ++++G +
Sbjct: 198 FPSKLLDLPP---------------------------DGEMTPQPTRIDGVSKP------ 224
Query: 249 ETLWIEGLVPGIDFCNHDLKAAATWEVDGTGLITGVPFSMYLLSVERSSFHSEKEISISY 308
L+PG+D NH W + G + V S S ++ +Y
Sbjct: 225 ------ALLPGVDIFNHARGQPILWLSSLVSIPNGSRGVPSISLVSSSVCESGSQLFNNY 278
Query: 309 GNKGNEELLYLYGFVIDNNPDDYLMIHYPAEAIHSIP 345
G K NEELL YGFV+D+NPDD + + I S+P
Sbjct: 279 GAKPNEELLLGYGFVLDSNPDDIVNLRL---GIASLP 312
>gi|449495943|ref|XP_004159992.1| PREDICTED: ribulose-1,5 bisphosphate carboxylase/oxygenase large
subunit N-methyltransferase, chloroplastic-like [Cucumis
sativus]
Length = 503
Score = 53.9 bits (128), Expect = 2e-04, Method: Compositional matrix adjust.
Identities = 55/222 (24%), Positives = 99/222 (44%), Gaps = 22/222 (9%)
Query: 1 MEISTEAKLEPFLQWLQVNKVELRGCKIKYSDESKGFGIFSSNEFSDG-VLLVVPLDLAI 59
++ S + + F QW++ + +K + +G G+ ++ S V+L VP I
Sbjct: 62 LDPSLPSNVHTFWQWVRQEGMVSYKTHVKPAIFPEGLGLATTKNLSKNEVVLEVPKRFWI 121
Query: 60 TPMRVLQDPLIGPECRAMFEDGEVDDRFLMILFLTVERLRKNSSWKPYLDMLPTTFGNPL 119
P V D IG C + + LFL E L+ +S W+ YLD+LP + +
Sbjct: 122 NPDAV-ADSEIGNVCSGL------KPWISVALFLIRENLKGDSRWRRYLDILPQETDSTV 174
Query: 120 WFTDDELLELKGTTLYRAT----ELQKQNLLTLYDD---KVKDLVKKLLVLDGDSERANS 172
+++++EL E++GT L T E K L + ++ + KDL + LD
Sbjct: 175 FWSEEELAEIQGTQLLSTTLNVKEYVKSEFLKVEEEILLRHKDLFPSRITLD-------D 227
Query: 173 IFWTRALNIPLPHSYVFPQNQEDLNKYDSINNSAELSNDHNS 214
FW + S + QN + D +N+SA ++ + ++
Sbjct: 228 FFWAFGILRSRAFSRLRGQNLVLIPFADLVNHSANVTTEEHA 269
>gi|392341246|ref|XP_002726820.2| PREDICTED: histone-lysine N-methyltransferase setd3 [Rattus
norvegicus]
gi|392349051|ref|XP_216781.6| PREDICTED: histone-lysine N-methyltransferase setd3 [Rattus
norvegicus]
gi|149044195|gb|EDL97577.1| rCG27725, isoform CRA_a [Rattus norvegicus]
Length = 596
Score = 53.9 bits (128), Expect = 2e-04, Method: Compositional matrix adjust.
Identities = 72/334 (21%), Positives = 127/334 (38%), Gaps = 87/334 (26%)
Query: 12 FLQWLQVNKVELRGCK-IKYSDESKGFGIFSSNEF-SDGVLLVVPLDLAITPMRVLQDPL 69
++W N + G + + + +E GFG+ ++ + ++ + L VP L +T + ++ +
Sbjct: 82 LMKWASENGASVEGFEMVNFKEE--GFGLRATRDIKAEELFLWVPRKLLMT-VESAKNSI 138
Query: 70 IGP---ECRAMFEDGEVDDRFLMILFLTVERLRKNSSWKPYLDMLPTTFGNPLWFTDDEL 126
+GP + R + G + + L ER NS W+PY+ LP+ + PL+F ++E+
Sbjct: 139 LGPLYSQDRILQAMGNI----ALAFHLLCERASPNSFWQPYIQTLPSEYDTPLYFEEEEV 194
Query: 127 LELKGTTLYRATELQKQNLLTLYDDKVKDLVKKLLVLDGDSERANSIFWTRALNIPLPHS 186
L+ T Q +N Y ++ + + PH+
Sbjct: 195 RCLQSTQAIHDVFSQYKNTARQY-----------------------AYFYKVIQTH-PHA 230
Query: 187 YVFPQNQEDLNKYDSINNSAELSNDHNSRGELINGLNDIKNEAQRVNSQVNGATSTLTST 246
P + +S + + ++ N I E +G+ TL
Sbjct: 231 NKLP-----------LKDSFTYEDYRWAVSSVMTRQNQIPTE--------DGSRVTL--- 268
Query: 247 QGETLWIEGLVPGIDFCNHDLKAAATWEVDGTGLITGVPFSMYLLSVERSS------FHS 300
L+P D CNH GLIT + Y L +R F +
Sbjct: 269 --------ALIPLWDMCNHT-----------NGLIT----TGYNLEDDRCECVALQDFQA 305
Query: 301 EKEISISYGNKGNEELLYLYGFVIDNNPDDYLMI 334
+I I YG + N E + GF DNN D + I
Sbjct: 306 GDQIYIFYGTRSNAEFVIHSGFFFDNNSHDRVKI 339
>gi|15223054|ref|NP_172856.1| [ribulose-bisphosphate carboxylase]-lysine N-methyltransferase
[Arabidopsis thaliana]
gi|17369870|sp|Q9XI84.1|RBCMT_ARATH RecName: Full=[Fructose-bisphosphate aldolase]-lysine
N-methyltransferase, chloroplastic; AltName:
Full=Aldolases N-methyltransferase; AltName:
Full=[Ribulose-bisphosphate carboxylase]-lysine
N-methyltransferase-like; Short=AtLSMT-L;
Short=LSMT-like enzyme; Flags: Precursor
gi|5080779|gb|AAD39289.1|AC007576_12 Putative ribulose-1,5 bisphosphate carboxylase/oxygenase large
subunit N-methyltransferase [Arabidopsis thaliana]
gi|28973755|gb|AAO64193.1| putative ribulose-1,5 bisphosphate carboxylase oxygenase large
subunit N-methyltransferase [Arabidopsis thaliana]
gi|332190979|gb|AEE29100.1| [ribulose-bisphosphate carboxylase]-lysine N-methyltransferase
[Arabidopsis thaliana]
Length = 482
Score = 53.9 bits (128), Expect = 2e-04, Method: Compositional matrix adjust.
Identities = 40/128 (31%), Positives = 64/128 (50%), Gaps = 8/128 (6%)
Query: 12 FLQWLQVNKVELRGCKIKYSDESKGFGIFSSNEFS-DGVLLVVPLDLAITPMRVLQDPLI 70
F +WL+ V + + +G G+ + + + V+L +P L I P V I
Sbjct: 52 FWKWLRDQGVVSGKSVAEPAVVPEGLGLVARRDIGRNEVVLEIPKRLWINPETVTASK-I 110
Query: 71 GPECRAMFEDGEVDDRFLMILFLTVERLRKNSSWKPYLDMLPTTFGNPLWFTDDELLELK 130
GP C G + + LFL E+ + SSW+ YLDMLP + + ++++++EL ELK
Sbjct: 111 GPLC------GGLKPWVSVALFLIREKYEEESSWRVYLDMLPQSTDSTVFWSEEELAELK 164
Query: 131 GTTLYRAT 138
GT L T
Sbjct: 165 GTQLLSTT 172
>gi|449442309|ref|XP_004138924.1| PREDICTED: ribulose-1,5 bisphosphate carboxylase/oxygenase large
subunit N-methyltransferase, chloroplastic-like [Cucumis
sativus]
Length = 503
Score = 53.5 bits (127), Expect = 2e-04, Method: Compositional matrix adjust.
Identities = 55/222 (24%), Positives = 99/222 (44%), Gaps = 22/222 (9%)
Query: 1 MEISTEAKLEPFLQWLQVNKVELRGCKIKYSDESKGFGIFSSNEFSDG-VLLVVPLDLAI 59
++ S + + F QW++ + +K + +G G+ ++ S V+L VP I
Sbjct: 62 LDPSLPSNVHTFWQWVRQEGMVSYKTHVKPAIFPEGLGLATTKNLSKNEVVLEVPKRFWI 121
Query: 60 TPMRVLQDPLIGPECRAMFEDGEVDDRFLMILFLTVERLRKNSSWKPYLDMLPTTFGNPL 119
P V D IG C + + LFL E L+ +S W+ YLD+LP + +
Sbjct: 122 NPDAV-ADSEIGNVCSGL------KPWISVALFLIRENLKGDSRWRRYLDILPQETDSTV 174
Query: 120 WFTDDELLELKGTTLYRAT----ELQKQNLLTLYDD---KVKDLVKKLLVLDGDSERANS 172
+++++EL E++GT L T E K L + ++ + KDL + LD
Sbjct: 175 FWSEEELAEIQGTQLLSTTLNVKEYVKSEFLKVEEEILLRHKDLFPSRITLD-------D 227
Query: 173 IFWTRALNIPLPHSYVFPQNQEDLNKYDSINNSAELSNDHNS 214
FW + S + QN + D +N+SA ++ + ++
Sbjct: 228 FFWAFGILRSRAFSRLRGQNLVLIPFADLVNHSANVTTEEHA 269
>gi|443722302|gb|ELU11224.1| hypothetical protein CAPTEDRAFT_181634 [Capitella teleta]
Length = 541
Score = 53.5 bits (127), Expect = 2e-04, Method: Compositional matrix adjust.
Identities = 35/131 (26%), Positives = 63/131 (48%), Gaps = 10/131 (7%)
Query: 6 EAKLEPFLQWLQVNKVELRGCKIKYSDESKGFGIFSSNEFSDG-VLLVVPLDLAITPMRV 64
E + F+ WL+ N V+ +I++ D G+GI ++ +F +G + L +P + +T
Sbjct: 74 EKNFDGFMGWLKSNSVDAEAVEIQHFDVG-GYGIKATRDFKEGELFLAIPRSVMMT---- 128
Query: 65 LQDPLIGPECRAMFEDGEVDD---RFLMILFLTVERLRKNSSWKPYLDMLPTTFGNPLWF 121
D A+ D + L+ L + E S W PYL +LP ++ +PL+F
Sbjct: 129 -TDTAKNSALGALIADNRILQTMPNILLALHVLCELCSPASFWLPYLKILPHSYSSPLYF 187
Query: 122 TDDELLELKGT 132
++L LK +
Sbjct: 188 NPEDLQLLKAS 198
>gi|301097023|ref|XP_002897607.1| conserved hypothetical protein [Phytophthora infestans T30-4]
gi|262106825|gb|EEY64877.1| conserved hypothetical protein [Phytophthora infestans T30-4]
Length = 673
Score = 53.5 bits (127), Expect = 3e-04, Method: Compositional matrix adjust.
Identities = 38/136 (27%), Positives = 69/136 (50%), Gaps = 9/136 (6%)
Query: 8 KLEPFLQWLQVNKVELRGCKIKYSDESKGFGIFSSNEFSDG-VLLVVPLDLAITPMRVLQ 66
KLE F QW+ + + +++Y E+ G+G F+ + G V L VP+ + + ++
Sbjct: 386 KLERFQQWIARHHFPVNSLELRYVSEAVGYGTFAVKRLAIGDVYLKVPVQVVMNVWSAVK 445
Query: 67 DPLIGPECRAMFEDGEVDDRFLMILFLTVERLR-----KNSSWKPYLDMLP---TTFGNP 118
+ + + + DR M+L L + + ++S WKPYL+MLP + +P
Sbjct: 446 SRWVSQTMQELQKQRVSVDREEMLLLLHLLEEKFGPNHRHSFWKPYLEMLPDLEDSLNSP 505
Query: 119 LWFTDDELLELKGTTL 134
L++ + EL L+GT L
Sbjct: 506 LFYEEVELKTLEGTDL 521
>gi|134109425|ref|XP_776827.1| hypothetical protein CNBC3180 [Cryptococcus neoformans var.
neoformans B-3501A]
gi|50259507|gb|EAL22180.1| hypothetical protein CNBC3180 [Cryptococcus neoformans var.
neoformans B-3501A]
Length = 455
Score = 53.1 bits (126), Expect = 3e-04, Method: Compositional matrix adjust.
Identities = 80/337 (23%), Positives = 124/337 (36%), Gaps = 78/337 (23%)
Query: 32 DESKGFGIFSSNEFS-DGVLLVVPLDLAITP----------MRVLQDPLIGPECRAMFED 80
DES G +F+ N + L+ P LA+T V + LI P+ A E
Sbjct: 31 DESTGLSVFADNTIKPEEKLVSCPFSLAVTSELATQAVCEIAGVEKGQLIWPKGHAK-EG 89
Query: 81 GEVDDRFLMILFLT---VERLRKNSSW------KPYLDMLP--TTFGNPLWFTDDELLEL 129
++R + +L + + R + W K YL LP + PL+FT+ EL L
Sbjct: 90 KPWNERMRIGAYLGFHWIYQERPETEWPRSLVHKTYLASLPPASNLSTPLYFTESELQLL 149
Query: 130 KGTTLYRATELQKQNLLTLYDDKVKDLVKKLLVLDGDSERANSIFWTRAL-NIPLPHSYV 188
G+ NLL +D+ K+ + L + + + W R L S
Sbjct: 150 SGS-----------NLLGAVEDRRKEWSAESAALR-SVLKEDGLTWERYLATATYMSSRA 197
Query: 189 FPQNQEDLNKYDSINNSAELSNDHNSRGELINGLNDIKNEAQRVNSQVNGATSTLTSTQG 248
FP DL E ++++G + +
Sbjct: 198 FPSKLLDLPP---------------------------DGEVTPQPTRIDGVSKPV----- 225
Query: 249 ETLWIEGLVPGIDFCNHDLKAAATWEVDGTGLITGVPFSMYLLSVERSSFHSEKEISISY 308
L+PG+D NH W + G + V S + ++ +Y
Sbjct: 226 -------LLPGVDIFNHARGQPILWLSSLVSIPNGSRGVPSISLVSSSVCEAGSQLFNNY 278
Query: 309 GNKGNEELLYLYGFVIDNNPDDYLMIHYPAEAIHSIP 345
G K NEELL YGFV+D+NPDD + + I S+P
Sbjct: 279 GAKPNEELLLGYGFVLDSNPDDIVNLRL---GIASLP 312
>gi|336468018|gb|EGO56181.1| hypothetical protein NEUTE1DRAFT_83233 [Neurospora tetrasperma FGSC
2508]
gi|350289741|gb|EGZ70966.1| SET domain-containing protein [Neurospora tetrasperma FGSC 2509]
Length = 459
Score = 53.1 bits (126), Expect = 3e-04, Method: Compositional matrix adjust.
Identities = 41/160 (25%), Positives = 72/160 (45%), Gaps = 16/160 (10%)
Query: 36 GFGIFSSNEFSDG-VLLVVPLDLAITPMRVLQDPLIGPECRAMFEDGEVDDRFLMILFLT 94
G G+ + F +G +L +P + T DPL+GP R+ V+D +
Sbjct: 25 GRGVKTLRPFKEGEKILTIPAGILWTVKHAYADPLLGPALRSAQPPLSVEDTLATYILFV 84
Query: 95 VERLRKNSSWKPYLDMLPTTFGNPLWFTDDELLELKGTTLYRATELQKQNLLTLYDDKVK 154
R + ++ LPT++ + + F +D+L GT+LY T+ +Q++ +D +
Sbjct: 85 KSRESGYDGQRSHIAALPTSYSSSILFAEDDLEACAGTSLYTITKQLEQSI----EDDHR 140
Query: 155 DLVKKLLV-------LDG----DSERANSIFWTRALNIPL 183
LV +L V LD D + A W+RA++ L
Sbjct: 141 ALVVRLFVQHPDLFPLDKFTVEDYKWALCTVWSRAMDFVL 180
>gi|451992452|gb|EMD84936.1| hypothetical protein COCHEDRAFT_1149681 [Cochliobolus
heterostrophus C5]
Length = 478
Score = 53.1 bits (126), Expect = 3e-04, Method: Compositional matrix adjust.
Identities = 42/145 (28%), Positives = 68/145 (46%), Gaps = 26/145 (17%)
Query: 12 FLQWLQVNKVELRGCKIKYSD---ESKGFGIFSSNEF-SDGVLLVVPLDLAITPMRVLQD 67
FL W + N R ++ D ++ G GI + + D L VP ++ ++ Q
Sbjct: 11 FLAWFKENGGVFRNDLLQIRDLRSKNAGRGIIAKQDIPEDTTLFTVPRNIILS----TQT 66
Query: 68 PLIGPECRAMFEDG-----------------EVDDRFLMILFLTVERLRKNSS-WKPYLD 109
+G + +FE +D +IL + E L+ ++S WKPYLD
Sbjct: 67 SDLGAKLPGIFEQHVDGNDDDDGDGQDHEPESLDSWGSLILVMLYEYLQGDASPWKPYLD 126
Query: 110 MLPTTFGNPLWFTDDELLELKGTTL 134
+LP F P+++T DEL EL+GT+L
Sbjct: 127 ILPQAFETPIFWTADELKELEGTSL 151
>gi|326921018|ref|XP_003206761.1| PREDICTED: LOW QUALITY PROTEIN: SET domain-containing protein
3-like [Meleagris gallopavo]
Length = 593
Score = 53.1 bits (126), Expect = 3e-04, Method: Compositional matrix adjust.
Identities = 74/334 (22%), Positives = 125/334 (37%), Gaps = 87/334 (26%)
Query: 12 FLQWLQVNKVELRGCKIKYSDESKGFGIFSSNEF-SDGVLLVVPLDLAITPMRVLQDPLI 70
++W N G +I +E +GFG+ ++ E ++ + L VP L +T + + ++
Sbjct: 82 LIKWATENGASTEGFEIANFEE-EGFGLKATREIKAEELFLWVPRKLLMT-VESAKSSVL 139
Query: 71 GP---ECRAMFEDGEVDDRFLMILFLTVERLRKNSSWKPYLDMLPTTFGNPLWFTDDELL 127
G + R + G + + L ER NS W PY+ LP + PL+F +DE+
Sbjct: 140 GSLYSQDRILQAMGNI----TLAFHLLCERANPNSFWLPYIQTLPNEYDTPLYFEEDEVQ 195
Query: 128 ELKGTTLYRATELQKQNLLTLYDDKVKDLVKKLLVLDGDSERANSIFWTRALNIPLPHSY 187
L+ T ++ + Y + + V+ A +PL S+
Sbjct: 196 YLRSTQAI-------HDVFSQYKNTARQYAYFYKVIQTHP---------NASKLPLKDSF 239
Query: 188 VFPQNQEDLNKYDSINNSAELSNDHN-SRGELINGLNDIKNEAQRVNSQVNGATSTLTST 246
+ +D+ + ++ N I E +G+ TL
Sbjct: 240 TY--------------------DDYRWAVSSVMTRQNQIPTE--------DGSRVTL--- 268
Query: 247 QGETLWIEGLVPGIDFCNHDLKAAATWEVDGTGLITGVPFSMYLLSVERSS------FHS 300
L+P D CNH GLIT + Y L +R F +
Sbjct: 269 --------ALIPLWDMCNHT-----------NGLIT----TGYNLEDDRCECVALQDFKA 305
Query: 301 EKEISISYGNKGNEELLYLYGFVIDNNPDDYLMI 334
++I I YG + N E + GF DNN D + I
Sbjct: 306 GEQIYIFYGTRSNAEFVIHSGFFFDNNSHDRVKI 339
>gi|297849804|ref|XP_002892783.1| hypothetical protein ARALYDRAFT_471564 [Arabidopsis lyrata subsp.
lyrata]
gi|297338625|gb|EFH69042.1| hypothetical protein ARALYDRAFT_471564 [Arabidopsis lyrata subsp.
lyrata]
Length = 482
Score = 53.1 bits (126), Expect = 3e-04, Method: Compositional matrix adjust.
Identities = 36/105 (34%), Positives = 56/105 (53%), Gaps = 8/105 (7%)
Query: 35 KGFGIFSSNEFS-DGVLLVVPLDLAITPMRVLQDPLIGPECRAMFEDGEVDDRFLMILFL 93
+G G+ + + + V+L +P L I P V IGP C G + + LFL
Sbjct: 75 EGLGLVARRDIGRNEVVLEIPKRLWINPETVTASK-IGPLC------GGLKPWVSVALFL 127
Query: 94 TVERLRKNSSWKPYLDMLPTTFGNPLWFTDDELLELKGTTLYRAT 138
E+ + SSW+ YLDMLP + + ++++++EL ELKGT L T
Sbjct: 128 IREKYEEESSWRLYLDMLPQSTDSTVFWSEEELAELKGTQLLSTT 172
>gi|195040205|ref|XP_001991024.1| GH12451 [Drosophila grimshawi]
gi|193900782|gb|EDV99648.1| GH12451 [Drosophila grimshawi]
Length = 573
Score = 53.1 bits (126), Expect = 3e-04, Method: Compositional matrix adjust.
Identities = 67/283 (23%), Positives = 113/283 (39%), Gaps = 48/283 (16%)
Query: 93 LTVERLR-KNSSWKPYLDMLPTTFGNPLWFTDDELLELKGTTLYRATELQKQNLLTLYDD 151
L +E++R S+W+P++D LPT + L+FT D++ L+GT+ A + LY
Sbjct: 203 LIIEKVRGAASAWQPFIDTLPTRYSTVLYFTVDQMQRLRGTSACSAAMRHCLVIARLY-- 260
Query: 152 KVKDLVKKLLVLDGDSERANSIFWTRALNIPLPHSYVFPQNQEDLNKYDSINNSAELSND 211
+ K + GD N+ + +F + Y ++
Sbjct: 261 --ASMYKCAYIQPGD-------------NVMAAKANLFTEYGLCYELYRWAVSTV----- 300
Query: 212 HNSRGELINGLNDIKNEAQRVNSQVNGATSTLTSTQGET----LWIEGLVPGIDFCNHDL 267
+R L+ E +V Q+ G T ET I L+P D NH
Sbjct: 301 -TTRQNLVPRELSTVGEVDQV-CQLGGFEGTEIKRDAETGARNAPISALIPYWDMTNHR- 357
Query: 268 KAAATWEVDGTGLITGV--PFSMYLLSVERSSFHSEKEISISYGNKGNEELLYLYGFVID 325
G IT + + + +F + ++ I YG++ N + L +GF+
Sbjct: 358 ----------CGKITSYYDRAAQQMECTAQEAFKAGEQFFIYYGDRSNADRLVHHGFLDM 407
Query: 326 NNPDDYLMIHYPAEAIHSIPLSDSKALLLE----EQKAQLRCL 364
+N DY+ I PL + ++LLL E+KA+LR L
Sbjct: 408 HNLKDYVQIRLGLSPTD--PLVEQRSLLLAELNIERKAELRVL 448
>gi|336472467|gb|EGO60627.1| hypothetical protein NEUTE1DRAFT_75928 [Neurospora tetrasperma FGSC
2508]
gi|350294307|gb|EGZ75392.1| SET domain-containing protein [Neurospora tetrasperma FGSC 2509]
Length = 469
Score = 53.1 bits (126), Expect = 3e-04, Method: Compositional matrix adjust.
Identities = 80/344 (23%), Positives = 132/344 (38%), Gaps = 68/344 (19%)
Query: 9 LEPFLQWLQVNKVELRGCKIKYSDESKGFGIFSSNEFSDGV-LLVVPLDLAITPMRVLQD 67
+E W +N + K+ + E KGFG+ E V L+ VP L + V +
Sbjct: 8 IEALPAWALLNGIAFPHVKVA-NIEGKGFGVVRDGELKPEVPLMTVPNSLVLNVQTVDEY 66
Query: 68 PLIGPECRAMFEDGEVDDRFLMILFLTVERLRKNSSWKPYLDMLPTTFGNPLWFTDDELL 127
+ + G V ++ + ++ W Y+ LP T P +T+DE L
Sbjct: 67 AKEDKNFKQLL--GAVGHHLVLASKTHQAPVGVSNPWTEYIKFLPKTVLVPTLWTEDERL 124
Query: 128 ELKGTTLYRATELQKQNLLTLYDDKVKDLVKKLLVLDGDSERANSIFWTRALNIPLPHSY 187
L+GT+L A K +T D V++ L + N I W
Sbjct: 125 LLRGTSLESAVN-AKMTAITAEFDAVREAASSLPIW-------NDILW------------ 164
Query: 188 VFPQNQEDLNKYDSINNSAELSNDHNSRGELINGLNDIKNEAQRVNSQVNGATSTLTSTQ 247
Y+ N+SA L R L++ L ++ + +G
Sbjct: 165 ----------PYEDGNSSASLR-----RWILLDALY----RSRVLELPKSG--------- 196
Query: 248 GETLWIEGLVPGIDFCNHDLKAAATWEVDGTGLITGVPFSMYLLSVERSSFHSEKEISIS 307
E +VP ID NH +A+A ++ + + LL SS +E++IS
Sbjct: 197 ------ESMVPCIDMINHSTRASAYYDENAKDEVV-------LLPRPDSSISPGEEVTIS 243
Query: 308 YGN-KGNEELLYLYGFVIDNNPDDYLMIHYPAEAIHSIPLSDSK 350
YG+ K E+L+ YGF+ + L++ P E PL+ +K
Sbjct: 244 YGDAKPAAEMLFSYGFIDPEATVESLVL--PLEPFEDDPLAKAK 285
>gi|224051705|ref|XP_002200601.1| PREDICTED: histone-lysine N-methyltransferase setd3 [Taeniopygia
guttata]
Length = 593
Score = 53.1 bits (126), Expect = 3e-04, Method: Compositional matrix adjust.
Identities = 73/334 (21%), Positives = 127/334 (38%), Gaps = 87/334 (26%)
Query: 12 FLQWLQVNKVELRGCKIKYSDESKGFGIFSSNEF-SDGVLLVVPLDLAITPMRVLQDPLI 70
++W N G +I +E +GFG+ ++ E ++ + L VP L +T + ++ ++
Sbjct: 82 LIKWATENGASTEGFEIANFEE-EGFGLKATREIKAEELFLWVPRKLLMT-VESAKNSVL 139
Query: 71 GP---ECRAMFEDGEVDDRFLMILFLTVERLRKNSSWKPYLDMLPTTFGNPLWFTDDELL 127
G + R + G + + L ER +S W PY+ LP+ + PL+F +DE+
Sbjct: 140 GSLYSQDRILQAMGNI----TLAFHLLCERANPHSFWLPYIQTLPSEYDTPLYFEEDEVQ 195
Query: 128 ELKGTTLYRATELQKQNLLTLYDDKVKDLVKKLLVLDGDSERANSIFWTRALNIPLPHSY 187
L+ T ++ + Y + + V+ A +PL S+
Sbjct: 196 HLQSTQAI-------HDVFSQYKNTARQYAYFYKVIQTHP---------NASKLPLKDSF 239
Query: 188 VFPQNQEDLNKYDSINNSAELSNDHN-SRGELINGLNDIKNEAQRVNSQVNGATSTLTST 246
+ +D+ + ++ N I E +G+ TL
Sbjct: 240 TY--------------------DDYRWAVSSVMTRQNQIPTE--------DGSRVTL--- 268
Query: 247 QGETLWIEGLVPGIDFCNHDLKAAATWEVDGTGLITGVPFSMYLLSVERSS------FHS 300
L+P D CNH GLIT + Y L +R F +
Sbjct: 269 --------ALIPLWDMCNHT-----------NGLIT----TGYNLEDDRCECVALQDFKA 305
Query: 301 EKEISISYGNKGNEELLYLYGFVIDNNPDDYLMI 334
++I I YG + N E + GF DNN D + I
Sbjct: 306 GEQIYIFYGTRSNAEFVIHSGFFFDNNSHDRVKI 339
>gi|320034953|gb|EFW16895.1| conserved hypothetical protein [Coccidioides posadasii str.
Silveira]
Length = 469
Score = 53.1 bits (126), Expect = 3e-04, Method: Compositional matrix adjust.
Identities = 89/373 (23%), Positives = 147/373 (39%), Gaps = 70/373 (18%)
Query: 12 FLQWLQVNKVELRGCKIKYSD---ESKGFGIFSSNEF-SDGVLLVVPLDLAITPMRV-LQ 66
F+ WL+ + KI+ +D G G+ + E D L +P DL ++ ++
Sbjct: 16 FMGWLKQHPGVRVNPKIRIADLRPTGAGRGVVACEEIVQDEELFAIPEDLVLSVANSKIK 75
Query: 67 DPLIGPECRAMFEDGEVDDRFLMILFLTVERLRKN-SSWKPYLDMLPTTFGNPLWFTDDE 125
D R F D D +I+ + E L+ S W PY +LPT F +++T++E
Sbjct: 76 D-------RINFADENFDTWLSLIVTMIFEYLQGGVSKWSPYFGVLPTDFDTLMFWTENE 128
Query: 126 LLELKGTTLYRATELQKQNLLTLYDDKVKDLVKKLLVLDGDSERANSIFWTRALNIPLPH 185
L EL+G+++ DK+ ++LD + L + L H
Sbjct: 129 LRELQGSSVL---------------DKIGKQETDQVILD------------KVLPVVLEH 161
Query: 186 SYVFPQNQEDLNKYDSINNSAELSNDHNSRGELINGLN-DIKNEAQRVNSQVNGATSTLT 244
+FP L +DS + + + G LI DI+ ++ +
Sbjct: 162 PDLFPPVN-GLASFDSPSGKEVVLQLAHRMGTLIMAYAFDIE-----MDQDEDQDGEDGY 215
Query: 245 STQGETLWIEGLVPGIDFCNHDLKA--AATWEVDGTGLITGV-PFSMYLLSVERSSFHSE 301
T E +G+VP D N D A ++ DG ++ + P S+ E
Sbjct: 216 VTDDEQEKAKGMVPLADLLNADAHRNNARLFQEDGYFIMRSIAPISI------------E 263
Query: 302 KEISISYGNKGNEELLYLYGFVIDNNPDDYLMIHYPAEAIHSIPLSDSKALLLEEQKAQL 361
EI YG +LL YG++ +N Y ++ EAI +I +EE QL
Sbjct: 264 MEIFNDYGELPRSDLLRRYGYITENYA-PYDVVEISLEAICNIA-------GVEEGCCQL 315
Query: 362 RCLLPKSLLEHGF 374
L +LE G+
Sbjct: 316 ELLEDAGVLEDGY 328
>gi|384484604|gb|EIE76784.1| hypothetical protein RO3G_01488 [Rhizopus delemar RA 99-880]
Length = 400
Score = 52.8 bits (125), Expect = 4e-04, Method: Compositional matrix adjust.
Identities = 43/161 (26%), Positives = 75/161 (46%), Gaps = 10/161 (6%)
Query: 1 MEISTEAKLEPFLQWLQVNKVELRGCKI--KYSDESKGFGIFSSNEFSDGVLLVVPLDLA 58
M S E K E F WL N + Y E G G+ ++ + +G LL
Sbjct: 1 MSNSFEEKGEAFCNWLTSNGATISSSITLKDYRSEGAGRGVTANKDIKEGDLLFS----- 55
Query: 59 ITPMRVLQDPLIGPECRAMFEDGEVDDRFLMILFLTVERLRKNSSWKPYLDMLPTTFGNP 118
P +L L + E E+ +IL + E + +S WKPY D+LP F P
Sbjct: 56 -LPRSILLSQLTSSLKDQVSELSELSGWSPLILCMMYEIEKPDSFWKPYFDVLPREFTTP 114
Query: 119 LWFTDDELLELKGTTLYRATELQKQNLLTLYDDKVKDLVKK 159
+++ ++L EL+GT + +++ K+ L+ ++++ ++KK
Sbjct: 115 MFWNQEDLKELEGTDI--ISKIGKKESEELFHNELEPIIKK 153
>gi|400596811|gb|EJP64567.1| histone-lysine N-methyltransferase [Beauveria bassiana ARSEF 2860]
Length = 406
Score = 52.8 bits (125), Expect = 4e-04, Method: Compositional matrix adjust.
Identities = 67/298 (22%), Positives = 110/298 (36%), Gaps = 68/298 (22%)
Query: 36 GFGIFSSNEFS-DGVLLVVPLDLAITPMRVLQDPLIGPECRAMFEDGEVDDRFLMILFLT 94
G G+ + F D +L +P + T DPL GP +++ V+D + +
Sbjct: 25 GRGVKAQRPFKEDERILTIPANCLWTVKGAYADPLFGPVLQSVQPPLSVEDTLALYILFV 84
Query: 95 VERLR--KNSSWKPYLDMLPTTFGNPLWFTDDELLELKGTTLYRATELQKQNLLTLYDDK 152
R + + ++ MLP+ + ++FTD+EL G++LY L T +
Sbjct: 85 RSRGEDPAYAERQTHVAMLPSEYTLSMYFTDEELRVCAGSSLY--------TLTTHLRGR 136
Query: 153 VKDLVKKLLVLDGDSERANSIFWTRALNIPLPHSYVFPQNQEDLNKYDSINNSAELSNDH 212
V D KKLL + + H +FP ++ Y +S
Sbjct: 137 VGDDYKKLLT-----------------GVFMRHRDLFPLDKFSFQHYKWALSSIW----- 174
Query: 213 NSRGELINGLNDIKNEAQRVNSQVNGATSTLTSTQGETLWIEGLVPGIDFCNHDLKAAAT 272
SRG T ++G ++ + + P D NH A
Sbjct: 175 -SRG------------------------MDFTISEGNSVRL--MAPFADMLNHASDAKQC 207
Query: 273 WEVDGTGLITGVPFSMYLLSVERSSFHSEKEISISYGNKGNEELLYLYGFVIDNNPDD 330
D P + L + + ++ I YGN N LL LYGFV+ +NP+D
Sbjct: 208 HAYD--------PSTGSLTVLACRDYEVGDQVFIYYGNVSNSRLLRLYGFVLPDNPND 257
>gi|302816067|ref|XP_002989713.1| hypothetical protein SELMODRAFT_447801 [Selaginella moellendorffii]
gi|300142490|gb|EFJ09190.1| hypothetical protein SELMODRAFT_447801 [Selaginella moellendorffii]
Length = 400
Score = 52.8 bits (125), Expect = 4e-04, Method: Compositional matrix adjust.
Identities = 80/328 (24%), Positives = 123/328 (37%), Gaps = 93/328 (28%)
Query: 38 GIFSSNEFSDGVLLV-VPLDLAITPMRVLQDPLIGPECRAMFEDGEVDDRFLMILFLTVE 96
G+F++ G +V +P +L +T ++ +C E D L +L L +
Sbjct: 11 GLFAARSIRAGEQIVRIPHELVLTAEKL-------DDCVKKLLSTEYDWCPLTLLILAEQ 63
Query: 97 RLRKNSSWKPYLDMLPTTFGN---PLWFTDDELLELKGTTLYRATELQKQNLLTLYDDKV 153
+ S W PY+ LP+ FG+ +++ +EL L+ T +R T +++ + Y+ V
Sbjct: 64 HKGEASRWAPYVSCLPS-FGDHHSTIFWGKEELKFLECTRAFRGTAERREMISDEYN-SV 121
Query: 154 KDLVKKLLVLDGDSERANSIFWTRALNIPLPHSYVFPQNQEDLNKYDSINNSAELSNDHN 213
KD++ PH +
Sbjct: 122 KDVISSC-----------------------PHVF-------------------------- 132
Query: 214 SRGELINGLNDIKNEAQRVNSQVNGATSTLTSTQGETLWIEGLVPGIDFCNHDLKAAATW 273
GE I+ A V+ NGA S+ S + P +DFCNHD + AT
Sbjct: 133 --GEDISLFQFAHAYATVVSRAWNGALSSEISMR----------PFVDFCNHDPVSHATV 180
Query: 274 EVDGTGLITGVPFSMYLLSVERSSFHSEKEISISYGNKGNEELLYLYGFVIDNNPDDYLM 333
D T + ER E E+ ISYG + N L YGFV+ NN D
Sbjct: 181 SHDTCKDATII--------AERDYTKGE-EVFISYGKRSNAVLAVDYGFVLPNNLSDQ-- 229
Query: 334 IHYPAEAIHSIPLSDSKALLLEEQKAQL 361
AE IP +D L E+K +L
Sbjct: 230 ----AELWMEIPWNDP----LREKKLEL 249
>gi|12848462|dbj|BAB27964.1| unnamed protein product [Mus musculus]
gi|46241521|gb|AAS82953.1| endothelial differentiation inhibitory protein D10 [Mus musculus]
Length = 594
Score = 52.8 bits (125), Expect = 4e-04, Method: Compositional matrix adjust.
Identities = 74/334 (22%), Positives = 128/334 (38%), Gaps = 87/334 (26%)
Query: 12 FLQWLQVNKVELRGCK-IKYSDESKGFGIFSSNEF-SDGVLLVVPLDLAITPMRVLQDPL 69
++W N + G + + + +E GFG+ ++ + ++ + L VP L +T + ++ +
Sbjct: 82 LMKWASENGASVEGFEMVNFKEE--GFGLRATRDIKAEELFLWVPRKLLMT-VESAKNSV 138
Query: 70 IGP---ECRAMFEDGEVDDRFLMILFLTVERLRKNSSWKPYLDMLPTTFGNPLWFTDDEL 126
+GP + R + G + + L ER NS W+PY+ LP+ + PL+F ++E+
Sbjct: 139 LGPLYSQDRILQAMGNI----ALAFHLLCERASPNSFWQPYIQTLPSEYDTPLYFEEEEV 194
Query: 127 LELKGTTLYRATELQKQNLLTLYDDKVKDLVKKLLVLDGDSERANSIFWTRALNIPLPHS 186
L+ T ++ + Y + + V+ A +PL S
Sbjct: 195 RCLQSTQAI-------HDVFSQYKNTARQYAYFYKVIQTHP---------HANKLPLKES 238
Query: 187 YVFPQNQEDLNKYDSINNSAELSNDHNSRGELINGLNDIKNEAQRVNSQVNGATSTLTST 246
+ + ED S ++ N I E +G+ TL
Sbjct: 239 FTY----EDYRWAVS---------------SVMTRQNQIPTE--------DGSRVTL--- 268
Query: 247 QGETLWIEGLVPGIDFCNHDLKAAATWEVDGTGLITGVPFSMYLLSVERSS------FHS 300
L+P D CNH GLIT + Y L +R F +
Sbjct: 269 --------ALIPLWDMCNHT-----------NGLIT----TGYNLEDDRCECVALQDFQA 305
Query: 301 EKEISISYGNKGNEELLYLYGFVIDNNPDDYLMI 334
+I I YG + N E + GF DNN D + I
Sbjct: 306 GDQIYIFYGTRSNAESVIHSGFFFDNNSHDRVKI 339
>gi|392349055|ref|XP_003750278.1| PREDICTED: histone-lysine N-methyltransferase setd3-like [Rattus
norvegicus]
Length = 416
Score = 52.8 bits (125), Expect = 4e-04, Method: Compositional matrix adjust.
Identities = 71/332 (21%), Positives = 126/332 (37%), Gaps = 85/332 (25%)
Query: 13 LQWLQVNKVELRGCKIKYSDESKGFGIFSSNEF-SDGVLLVVPLDLAITPMRVLQDPLIG 71
++W N + G ++ E +GFG+ ++ + ++ + L VP L +T + ++ ++G
Sbjct: 22 MKWASENGASVEGFEMVNFKE-EGFGLRATRDIKAEELFLWVPRKLLMT-VESAKNSILG 79
Query: 72 P---ECRAMFEDGEVDDRFLMILFLTVERLRKNSSWKPYLDMLPTTFGNPLWFTDDELLE 128
P + R + G + + L ER NS W+PY+ LP+ + PL+F ++E+
Sbjct: 80 PLYSQDRILQAMGNI----ALAFHLLCERASPNSFWQPYIQTLPSEYDTPLYFEEEEVRC 135
Query: 129 LKGTTLYRATELQKQNLLTLYDDKVKDLVKKLLVLDGDSERANSIFWTRALNIPLPHSYV 188
L+ T ++ + Y + + V+ PH+
Sbjct: 136 LQSTQAI-------HDVFSQYKNTARQYAYFYKVIQTH-----------------PHANK 171
Query: 189 FPQNQEDLNKYDSINNSAELSNDHNSRGELINGLNDIKNEAQRVNSQVNGATSTLTSTQG 248
P + +S + + ++ N I E +G+ TL
Sbjct: 172 LP-----------LKDSFTYEDYRWAVSSVMTRQNQIPTE--------DGSRVTL----- 207
Query: 249 ETLWIEGLVPGIDFCNHDLKAAATWEVDGTGLITGVPFSMYLLSVERSS------FHSEK 302
L+P D CNH GLIT + Y L +R F +
Sbjct: 208 ------ALIPLWDMCNHT-----------NGLIT----TGYNLEDDRCECVALQDFQAGD 246
Query: 303 EISISYGNKGNEELLYLYGFVIDNNPDDYLMI 334
+I I YG + N E + GF DNN D + I
Sbjct: 247 QIYIFYGTRSNAEFVIHSGFFFDNNSHDRVKI 278
>gi|328872715|gb|EGG21082.1| hypothetical protein DFA_00957 [Dictyostelium fasciculatum]
Length = 643
Score = 52.8 bits (125), Expect = 4e-04, Method: Compositional matrix adjust.
Identities = 38/157 (24%), Positives = 75/157 (47%), Gaps = 8/157 (5%)
Query: 4 STEAKLEPFLQWLQVNKVELRGCKIKYSDESKGFG---IFSSNEFSDGVLLVVPLDLAIT 60
+T L+ F QWL NK I D FG + ++N D +L+ +P + +T
Sbjct: 204 TTPEDLKSFQQWLS-NKNTYLNPSIDIVDLGPPFGRSMVANTNIKKDEILVEIPKGIMMT 262
Query: 61 PMRVLQD-PLIGPECRAMFEDGEVDDRFLMILFLTVERLRKNSSWKPYLDMLPTTFGNPL 119
P ++++ P + + D + + I++ L ++S W Y+ +LP F +
Sbjct: 263 PKSMIKNLPRFIIDWMDEMKISRTDQQAIAIIY---SILHEDSYWYEYVSILPKQFTTTV 319
Query: 120 WFTDDELLELKGTTLYRATELQKQNLLTLYDDKVKDL 156
+FT +E+ +L+ + ++R TE++ + YD + L
Sbjct: 320 YFTREEMTQLQASPVHRFTEMRLNGVHRHYDTTISRL 356
>gi|320170797|gb|EFW47696.1| hypothetical protein CAOG_05634 [Capsaspora owczarzaki ATCC 30864]
Length = 903
Score = 52.8 bits (125), Expect = 4e-04, Method: Compositional matrix adjust.
Identities = 49/174 (28%), Positives = 78/174 (44%), Gaps = 8/174 (4%)
Query: 12 FLQWLQ-VNKVELRGCKIKYSD-ESKGFGIFSSNEFSDGV-LLVVPLDLAITPMRVLQDP 68
LQWL + + +D E G G+ ++ GV +L +P L I L +
Sbjct: 390 LLQWLHNAGMTSIAENHLSIADFEHTGRGVLANERIEAGVEVLHLPQHLLINIHVALDES 449
Query: 69 LIGPECRAMFE-DGEVDDRFLMILFLTVERLRKNSS--WKPYLDMLPTTFGNPLWFTDDE 125
P R + + E DD L++L++ E+L S+ W P+ + LP T+ +PL F E
Sbjct: 450 --HPIGRVLSDLRDEYDDDTLLLLYVLHEKLVAGSASRWAPFFETLPATYNSPLLFHVTE 507
Query: 126 LLELKGTTLYRATELQKQNLLTLYDDKVKDLVKKLLVLDGDSERANSIFWTRAL 179
LLEL+GT L T K L L++ + D+ ++ W RA+
Sbjct: 508 LLELEGTRLIDETFEIKDGLRVLHESLGPLAEAYPALFPTDAFTYENLLWVRAM 561
>gi|312377430|gb|EFR24262.1| hypothetical protein AND_11269 [Anopheles darlingi]
Length = 222
Score = 52.4 bits (124), Expect = 5e-04, Method: Compositional matrix adjust.
Identities = 23/57 (40%), Positives = 38/57 (66%), Gaps = 4/57 (7%)
Query: 77 MFEDGEVDDRFLMILFLTVERLRKNSSWKPYLDMLPTTFGNPLWFTDDELLELKGTT 133
M E G + ++ L L +ER RK+S W+PYLD+LP + PL++T ++++EL+ T
Sbjct: 130 MKEQGNI----MLALALIMERFRKDSFWRPYLDVLPERYTTPLFYTPEDMIELRETA 182
>gi|213407234|ref|XP_002174388.1| lysine methyltransferase [Schizosaccharomyces japonicus yFS275]
gi|212002435|gb|EEB08095.1| lysine methyltransferase [Schizosaccharomyces japonicus yFS275]
Length = 537
Score = 52.4 bits (124), Expect = 5e-04, Method: Compositional matrix adjust.
Identities = 41/139 (29%), Positives = 65/139 (46%), Gaps = 9/139 (6%)
Query: 9 LEPFLQWLQVNKVELR-GCKIKYSDESKG--FGIFSSNEFSDGVLLVVPLDLAITPMRVL 65
++ FLQ N L G + + SD +G I S + D VL+ P IT +
Sbjct: 1 MDSFLQEAFNNGCYLHPGIQFQRSDNVEGTFIAIASKDIDGDQVLISCPESYIITLQKAK 60
Query: 66 QDPLIGPECRAMFEDGEVDDRFLMILFLTVERLR-KNSSWKPYLDMLPTTFGNPLWFTDD 124
+ CR + + ++ F +ERL+ + S W Y++ LP TF PL+FTDD
Sbjct: 61 NEL-----CRLSPKFADEKMHTIVCTFFALERLKGEKSQWAKYIEYLPKTFDTPLYFTDD 115
Query: 125 ELLELKGTTLYRATELQKQ 143
EL L+ T ++ +K+
Sbjct: 116 ELKSLEHTNIFYGCNDRKR 134
Score = 39.3 bits (90), Expect = 4.2, Method: Compositional matrix adjust.
Identities = 26/72 (36%), Positives = 38/72 (52%), Gaps = 7/72 (9%)
Query: 256 LVPGIDFCNHDLKAAATWEVDGTGLITGVPFSMYLLSVERSSFHSEKEISISYGNKGNEE 315
L+P +D NH + W +T ++ L+SV+ S S ++ +YG KGNEE
Sbjct: 187 LIPLVDSLNHKPRCPIIW-----NKVTKESHAVQLVSVKPIS--SGGQVYNNYGPKGNEE 239
Query: 316 LLYLYGFVIDNN 327
LL YGF + NN
Sbjct: 240 LLMGYGFCLPNN 251
>gi|389622275|ref|XP_003708791.1| hypothetical protein MGG_14610 [Magnaporthe oryzae 70-15]
gi|351648320|gb|EHA56179.1| hypothetical protein MGG_14610 [Magnaporthe oryzae 70-15]
gi|440464619|gb|ELQ34017.1| hypothetical protein OOU_Y34scaffold00823g1 [Magnaporthe oryzae
Y34]
Length = 419
Score = 52.4 bits (124), Expect = 5e-04, Method: Compositional matrix adjust.
Identities = 39/161 (24%), Positives = 72/161 (44%), Gaps = 16/161 (9%)
Query: 36 GFGIFSSNEFSDG-VLLVVPLDLAITPMRVLQDPLIGPECRAMFEDGEVDDRFLMILFLT 94
G G+ + F +G +L +P T + D L+GP R++ V+D +
Sbjct: 25 GRGVRTLRHFKEGEKILTIPCGSLWTVEQAHADSLLGPALRSVRPPLSVEDILATYILFV 84
Query: 95 VERLRKNSSWKPYLDMLPTTFGNPLWFTDDELLELKGTTLYRATELQKQNLLTLYDDKVK 154
R + ++ LP+++ + ++F ++EL GT+LY T+ +Q + +D +
Sbjct: 85 RSRESGYDGLRSHVAALPSSYSSSIFFAEEELEVCAGTSLYTVTKQLEQRI----EDDYR 140
Query: 155 DLVKKLLVLDG-----------DSERANSIFWTRALNIPLP 184
LV +LLV D + A W+RA++ LP
Sbjct: 141 ALVMRLLVQHRDLFPLEQFTIEDYKWALCTVWSRAMDFVLP 181
>gi|17368377|sp|P94026.1|RBCMT_TOBAC RecName: Full=Ribulose-1,5 bisphosphate carboxylase/oxygenase large
subunit N-methyltransferase, chloroplastic; AltName:
Full=[Ribulose-bisphosphate carboxylase]-lysine
N-methyltransferase; Short=RuBisCO LSMT; Short=RuBisCO
methyltransferase; Short=rbcMT; Flags: Precursor
gi|1731475|gb|AAC49565.1| ribulose-1,5-bisphosphate carboxylase/oxygenase large subunit
N-methyltransferase [Nicotiana tabacum]
gi|1731477|gb|AAC49566.1| ribulose-1,5 bisphosphate carboxylase/oxygenase large subunit
N-methyltransferase [Nicotiana tabacum]
Length = 491
Score = 52.4 bits (124), Expect = 5e-04, Method: Compositional matrix adjust.
Identities = 37/131 (28%), Positives = 63/131 (48%), Gaps = 8/131 (6%)
Query: 9 LEPFLQWLQVNKVELRGCKIKYSDESKGFGIFSSNEFSDG-VLLVVPLDLAITPMRVLQD 67
++ F QWL V +K +G G+ + + + G +L VP I P V +
Sbjct: 57 VQTFWQWLCKEGVVTTKTPVKPGIVPEGLGLVAKRDIAKGETVLQVPKRFWINPDAVAES 116
Query: 68 PLIGPECRAMFEDGEVDDRFLMILFLTVERLRKNSSWKPYLDMLPTTFGNPLWFTDDELL 127
IG C + + LFL E+ R +S WK Y+D+LP + + ++++++EL
Sbjct: 117 E-IGNVCSGL------KPWISVALFLLREKWRDDSKWKYYMDVLPKSTDSTIYWSEEELS 169
Query: 128 ELKGTTLYRAT 138
E++GT L T
Sbjct: 170 EIQGTQLLSTT 180
>gi|443895438|dbj|GAC72784.1| N-methyltransferase [Pseudozyma antarctica T-34]
Length = 506
Score = 52.0 bits (123), Expect = 6e-04, Method: Compositional matrix adjust.
Identities = 77/354 (21%), Positives = 133/354 (37%), Gaps = 59/354 (16%)
Query: 12 FLQWLQVNKVELRGCKIKYSDESKGFGIFSSNEFS-DGVLLVVPLDLAITPMRVLQDPLI 70
L+WL + VEL D S G +++ + D ++ VP L IT + +
Sbjct: 7 LLRWLAASGVELHNSLYLEQDPSTGLSFYTAEPLARDTTVIRVPKCLCITSVTARE---- 62
Query: 71 GPECRAMFEDGEVDDRFL----------MILFLTVERLRKNS---------------SWK 105
A+ ++ D L ++L++ RL
Sbjct: 63 --RIEALLAAFDISDSGLEASSLPAPDWILLYIATCRLATEYLSTASEPSQVLAAALPHL 120
Query: 106 PYLDMLPTTFGNPLWFTDDELLELKGTTLYRATELQKQNLLTLYDDKVKDLVKKLLVLDG 165
PY+D +P PL F+ EL L T L +TE + + + Y+ + + L L+G
Sbjct: 121 PYVDHIPKIIETPLHFSPAELTLLSATPLVGSTERRLRETIVDYE-RARSL------LEG 173
Query: 166 DSERA-----NSIFWTRALNIPLPHSYVFPQNQEDLNKYDSINNSAELSNDHNSRGELIN 220
++A ++ T+ +IP + +K + AE + S +
Sbjct: 174 PLQQALHQPFSAFLATKLGSIPKEQLDADALMAKTYHKGLELWRWAESAFTSRSFPARLV 233
Query: 221 GLNDIKNEAQRVNSQVNGATSTLTSTQGETLWIEGLVPGIDFCNHDLKAAATWEVDGTGL 280
GL+D +EA +GA+ S L+P D NH TW
Sbjct: 234 GLSDGLSEAS------SGASPLAKSA-------PILIPAYDTFNHARAHPVTWTFSSASQ 280
Query: 281 ITGVPFSMYLLSVERSSFHSEKEISISYGNKGNEELLYLYGFVIDNNPDDYLMI 334
+ M +++ + + ++ +YG K NEELL YGF +D +D L +
Sbjct: 281 VDSR--DMVCMTINYAVDSAPAQVYNNYGGKSNEELLAGYGFTLDTITEDTLAL 332
>gi|383863099|ref|XP_003707020.1| PREDICTED: histone-lysine N-methyltransferase setd3-like [Megachile
rotundata]
Length = 222
Score = 52.0 bits (123), Expect = 6e-04, Method: Compositional matrix adjust.
Identities = 38/126 (30%), Positives = 68/126 (53%), Gaps = 14/126 (11%)
Query: 12 FLQWLQVNKVELRGCKIKYSDESKGF--GIFSSNEFSDG-VLLVVPLDLAITPMRVLQDP 68
F+ WL+ N ++ G I E G+ G+ + ++F + ++L +P L +
Sbjct: 61 FMNWLKENSADIEGASIA---EFPGYDLGLKAEHDFQENELILKIPRHLIFNTLNA---- 113
Query: 69 LIGPECRAMFEDGEVDD--RFLMILFLTVERLRKNSSWKPYLDMLPTTFGNPLWFTDDEL 126
PE A+ D + D + + + L +ER ++NS WKPYLD+LPTT+ ++ T ++
Sbjct: 114 --APELTALKNDPLIHDMPQVALAIALLIERYKENSKWKPYLDVLPTTYTTVMYMTAADM 171
Query: 127 LELKGT 132
+ELKG+
Sbjct: 172 IELKGS 177
>gi|260819628|ref|XP_002605138.1| hypothetical protein BRAFLDRAFT_122719 [Branchiostoma floridae]
gi|229290469|gb|EEN61148.1| hypothetical protein BRAFLDRAFT_122719 [Branchiostoma floridae]
Length = 453
Score = 52.0 bits (123), Expect = 6e-04, Method: Compositional matrix adjust.
Identities = 59/236 (25%), Positives = 102/236 (43%), Gaps = 34/236 (14%)
Query: 1 MEISTEAKLEPFLQWLQVNKVELRGCKIKYSDESKGFGIFSSNEFS-DGVLLVVPLDLAI 59
+ ++ E F QWL N K E+ G G+ ++ + ++LV+P L I
Sbjct: 22 VSLAHEESFVRFFQWLHRNGCRNVPLKPAVFPET-GRGLMATKALKHEELILVIPKRLLI 80
Query: 60 TPMRVLQDPLIGPEC-RAMFEDGEVDDRFLMILFLTVERLRKNSS-WKPYLDMLPTTFGN 117
T + + D + P RA D ++ + +FL E+ R+ S W+PY+D+LP +
Sbjct: 81 T-IDAIMDSYLAPYIERA---DSQLTPSQALAVFLMCEKCRREKSFWRPYIDILPEEYTC 136
Query: 118 PLWFTDDELL----ELKGTTLYRATELQKQNLLTLYDDKVKDLVKKLLVLDGDSERAN-- 171
P +FT+++ L+G + E K+ + ++ K L L D E A
Sbjct: 137 PAFFTEEDFRLLPNSLRGKAKAKKYECHKEFM------ELAPFFKMLADLFPDQEDAFNF 190
Query: 172 -------SIFWTRALNIPLPHSYVFP-QNQEDLNK------YDSINNSAELSNDHN 213
S TRA ++PL + ++ ED + DSIN++A+ H
Sbjct: 191 KDFKWAWSAIKTRAFDVPLGGETCYRLRDSEDTSNPTMFPLVDSINHAAQAKIRHR 246
>gi|170093191|ref|XP_001877817.1| SET-domain protein [Laccaria bicolor S238N-H82]
gi|164647676|gb|EDR11920.1| SET-domain protein [Laccaria bicolor S238N-H82]
Length = 524
Score = 51.6 bits (122), Expect = 8e-04, Method: Compositional matrix adjust.
Identities = 39/131 (29%), Positives = 65/131 (49%), Gaps = 13/131 (9%)
Query: 12 FLQWLQVNKVELRGCK---IKYSDESKGFGIFSSNEFSDG-VLLVVPLDLAITPMRVLQD 67
FL W Q ++ + I + G G + + +G VL +P DL ++ R
Sbjct: 34 FLDWFQAHQGTIDTSSVDVINFLPSEGGRGAVAVKDIPEGHVLFTIPRDLTLS-TRTSSL 92
Query: 68 PL---IGPECRAMFEDGEVDDRFLMILFLTVERLRKNSS-WKPYLDMLPTTFGNPLWFTD 123
PL +G A +G +IL + E + +SS W Y D+LPT+F P+++T+
Sbjct: 93 PLRFGMGAWKNAKLHEGWAG----LILCMMWEAAQGSSSKWSGYFDILPTSFDTPMFWTE 148
Query: 124 DELLELKGTTL 134
+EL EL+GT++
Sbjct: 149 EELAELRGTSV 159
>gi|326472332|gb|EGD96341.1| SET domain-containing protein [Trichophyton tonsurans CBS 112818]
Length = 485
Score = 51.6 bits (122), Expect = 8e-04, Method: Compositional matrix adjust.
Identities = 90/407 (22%), Positives = 161/407 (39%), Gaps = 82/407 (20%)
Query: 36 GFGIFSSNEFS-DGVLLVVPLDLAITPMRVLQDPLIGPECRAMFEDGEVDDRFLMILFLT 94
G GI +S + + D L V+P DL ++ ++G +D ++ +I+ +
Sbjct: 48 GRGICASRDIAEDEELFVIPDDLILSVQNSEARSVLG------LDDKQLGPWLSLIITMI 101
Query: 95 VERLR-KNSSWKPYLDMLPTTFGNPLWFTDDELLELKGTTLYRATELQKQNLLTLYDDKV 153
E + + S W PY +LP++F +++TD++L EL+G+ + ++ K DD +
Sbjct: 102 YEYYQGEQSKWYPYFRILPSSFDTLMFWTDEQLSELQGSAV--VGKIGK----AAADDTI 155
Query: 154 KDLVKKLLVLDGDSERANSIFWTRALNIPLPHSYVFPQNQEDLNKYDSINNSAELSNDHN 213
V L+ +AN PH + N LN DS N A L H
Sbjct: 156 LQKVVPLI-------QAN------------PHHFPPRPNMPPLNSPDSQN--ALLCLAHR 194
Query: 214 SRGELINGLNDIKNEAQRVNSQVNGATSTLTSTQGETLWIEGLVPGIDFCNHDLKA--AA 271
++ DI ++ + T E +G+VP D N D + A
Sbjct: 195 MGSIIMAYAFDI----EKADEADEDTAEDGYMTDDEDEPAKGMVPLADIFNADAQRNNAR 250
Query: 272 TWEVDGTGLITGVPFSMYLLSVERSSFHSEKEISISYGNKGNEELLYLYGFVIDNNPDDY 331
++ +G+ ++ + + HS +EI YG +LL YG+V DN Y
Sbjct: 251 LFQEEGSFVMKAI-----------KNIHSGEEIFNDYGELPRADLLRRYGYVTDNYA-QY 298
Query: 332 LMIHYPAEAIHSIP-LSDSKALLLEEQKAQLRCLLPKSLLEHGFFAAGHPKDGNNDNKLE 390
++ + + I + L DSK +L L +LE G+ P +G
Sbjct: 299 DVVEFSLDGICKVAGLPDSKP---SSTNPRLELLDNLDMLEEGYSIPRVPPNGT------ 349
Query: 391 VDRISSFSWSGQRRMPSYLNKLVFPENFLTALRTIAMQEDEISKVSS 437
K PE+FL LR + + ++++++ +
Sbjct: 350 -------------------LKDAIPEDFLVLLRALTLPVEDLNRLKA 377
>gi|156361027|ref|XP_001625323.1| predicted protein [Nematostella vectensis]
gi|156212150|gb|EDO33223.1| predicted protein [Nematostella vectensis]
Length = 447
Score = 51.6 bits (122), Expect = 9e-04, Method: Compositional matrix adjust.
Identities = 36/131 (27%), Positives = 62/131 (47%), Gaps = 8/131 (6%)
Query: 34 SKGFGIFSSNEF-SDGVLLVVPLDLAITPMRVLQDPLIGPECRAMFEDGEVDDRFLMILF 92
S G G+ + S ++ VP L IT VL+ + M + + +L++LF
Sbjct: 74 STGRGMLAIERIHSSECVISVPERLLITASSVLESAIGNYVAERMKGGAKSSNDYLLVLF 133
Query: 93 LTVER-LRKNSSWKPYLDMLPTTFGNPLWFTDDELL----ELKGTTLYRATELQK--QNL 145
L E+ L K S W PY+ LP TF P +FT EL + + + T++++ ++
Sbjct: 134 LMYEKYLEKGSFWAPYIRTLPDTFNTPCYFTRKELFLLPEQCREQAFEQVTQIKQSYKSF 193
Query: 146 LTLYDDKVKDL 156
Y+D ++D
Sbjct: 194 AKAYNDVLQDF 204
>gi|367023575|ref|XP_003661072.1| hypothetical protein MYCTH_2300057 [Myceliophthora thermophila ATCC
42464]
gi|347008340|gb|AEO55827.1| hypothetical protein MYCTH_2300057 [Myceliophthora thermophila ATCC
42464]
Length = 496
Score = 51.6 bits (122), Expect = 0.001, Method: Compositional matrix adjust.
Identities = 85/357 (23%), Positives = 142/357 (39%), Gaps = 67/357 (18%)
Query: 7 AKLEPFLQWLQ-VNKVELRGCKIKYSD---ESKGFGIFSSNEFS-DGVLLVVPLDLAI-T 60
A + FL W Q + R I D + G GI + + + D VL +P I T
Sbjct: 13 ATTQSFLTWFQSLPGATFRSDLIAIEDLRSRNAGRGIVARTDIAADTVLFTIPRSSIICT 72
Query: 61 PMRVLQDPLIGPECRAMFEDGEVDDR------------FLMILFLTVERLRKNSS-WKPY 107
L++ + G EDG D L+IL L E L+ ++S WKPY
Sbjct: 73 ATSALKNEIPGIFDLEGDEDGNSDSGGEDGTSSSQDSWTLLILILIYEYLQGDASQWKPY 132
Query: 108 LDMLPTTFGNPLWFTDDELLELKGTTLYRATELQKQNLLTLYDDKVKDLVKKLLVLDGDS 167
LD+LP+ F P++++ EL EL+ + LV K+
Sbjct: 133 LDVLPSAFDTPMFWSPTELAELQASA----------------------LVTKV-----GR 165
Query: 168 ERANSIFWTRALNIPLPHSYV-FPQNQEDLNKYDSINNSAELSNDHNSRGELINGLNDIK 226
E A+ + ++ L + H +V FP ++ L+ + EL+ H ++ D++
Sbjct: 166 EEADRMIRSKILPVIRGHDHVFFPHGRQRLDD----DQLFELA--HRMGSAIMAYAFDLE 219
Query: 227 NEAQRVNSQVNGATSTLTSTQGETLWIEGLVPGIDFCNHDLKAAATWEVDGTGLITGVPF 286
+ N + + + +G T+ G+VP D N D + A + G +T
Sbjct: 220 KDDD-ANEEASEQDEWVDDREGRTML--GMVPMADMLNADAEFNA-YINHGADSLTATAL 275
Query: 287 SMYLLSVERSSFHSEKEISISYGNKGNEELLYLYGFVIDNNPDDYLMIHYPAEAIHS 343
+ + +EI YG N ELL YG+V + Y ++ P + I +
Sbjct: 276 ---------RTIKAGEEILNYYGPLPNGELLRRYGYVTPKHA-RYDVVELPWDLIEA 322
>gi|164423408|ref|XP_963594.2| hypothetical protein NCU08733 [Neurospora crassa OR74A]
gi|157070080|gb|EAA34358.2| conserved hypothetical protein [Neurospora crassa OR74A]
Length = 459
Score = 51.6 bits (122), Expect = 0.001, Method: Compositional matrix adjust.
Identities = 40/160 (25%), Positives = 71/160 (44%), Gaps = 16/160 (10%)
Query: 36 GFGIFSSNEFSDG-VLLVVPLDLAITPMRVLQDPLIGPECRAMFEDGEVDDRFLMILFLT 94
G G+ + F +G +L +P + T DPL+GP R+ V+D +
Sbjct: 25 GRGVKTLRPFKEGEKILTIPAGILWTVKHAYADPLLGPALRSAQPPLSVEDTLATYILFV 84
Query: 95 VERLRKNSSWKPYLDMLPTTFGNPLWFTDDELLELKGTTLYRATELQKQNLLTLYDDKVK 154
R + ++ LP ++ + + F +D+L GT+LY T+ +Q++ +D +
Sbjct: 85 KSRESGYDGQRSHIAALPASYSSSILFAEDDLEACAGTSLYTITKQLEQSI----EDDHR 140
Query: 155 DLVKKLLV-------LDG----DSERANSIFWTRALNIPL 183
LV +L V LD D + A W+RA++ L
Sbjct: 141 ALVVRLFVQHPDLFPLDKFTVEDYKWALCTVWSRAMDFVL 180
>gi|321254166|ref|XP_003192985.1| cytoplasm protein [Cryptococcus gattii WM276]
gi|317459454|gb|ADV21198.1| cytoplasm protein, putative [Cryptococcus gattii WM276]
Length = 455
Score = 51.6 bits (122), Expect = 0.001, Method: Compositional matrix adjust.
Identities = 80/326 (24%), Positives = 131/326 (40%), Gaps = 79/326 (24%)
Query: 32 DESKGFGIFSSNEFS-DGVLLVVPLDLAITP----------MRVLQDPLIGPECRAMFED 80
DES G +F+ + + L+ P LA+T + + LI P + + ++
Sbjct: 31 DESTGLSVFADDTIHPEEKLVSCPFSLAVTSELATKAVCEITGIEEGQLIWP--KDLGKE 88
Query: 81 GEV-DDRFLMILFLT---VERLRKNSSW------KPYLDMLPTT--FGNPLWFTDDELLE 128
GE ++R + +L V + R ++ W K YL LP+ PL+FT+ EL
Sbjct: 89 GEPWNERMRIGAYLGLHWVYQERPDAEWPRSLVHKTYLASLPSASNLPTPLYFTESELQL 148
Query: 129 LKGTTLYRATELQKQNLLTLYDDKVKDLVKKLLVLDGDSERANSIFWTRALNIP-LPHSY 187
L G+ L A E ++ + ++ +L DG + W R L S
Sbjct: 149 LSGSNLLGAVEDRRVEW-----SAESEALRIILKEDG-------LTWERYLATSTYMSSR 196
Query: 188 VFPQNQEDLNKYDSINNSAELSNDHNSRGELINGLNDIKNEAQRVNSQVNGATSTLTSTQ 247
FP DL S GE+I +++++G + +
Sbjct: 197 AFPSKLLDLQ----------------SDGEMIPQ-----------STRIDGVSKPV---- 225
Query: 248 GETLWIEGLVPGIDFCNHDLKAAATWEVDGTGLITGVPFSMYLLSVERSSF-HSEKEISI 306
L+PG+D NH W + VP + +S+ SS + ++
Sbjct: 226 --------LLPGVDIFNHARGQPILW-LSSMVSTPNVPLGVPSISLVSSSICEAGSQLFN 276
Query: 307 SYGNKGNEELLYLYGFVIDNNPDDYL 332
+YG K NEELL YGFV++ NPDD +
Sbjct: 277 NYGAKPNEELLLGYGFVLNPNPDDIV 302
>gi|346465219|gb|AEO32454.1| hypothetical protein [Amblyomma maculatum]
Length = 353
Score = 51.2 bits (121), Expect = 0.001, Method: Compositional matrix adjust.
Identities = 73/333 (21%), Positives = 131/333 (39%), Gaps = 82/333 (24%)
Query: 13 LQWLQVNKVELRG--CKIKYSDESKGFGIFSSNEFSDGVLLVVPLDLAITPMRVLQDPLI 70
L+W+ N EL C ++++ +G + L VP L IT L L
Sbjct: 30 LEWMIANGFELHVQLCVREFTETGRGLATLQKVTAGE-TFLRVPTCLLITTTTALSSSLH 88
Query: 71 GPECRAMFEDGEVDDRFLMILFLTVERLRK-NSSWKPYLDMLPTTFGNPLWFTDDELLEL 129
G R + ++ ++ LFL E+LR +S W+ ++D LP ++ P+ F +LL
Sbjct: 89 GFLVRHHRQLTAIE---VLTLFLINEKLRGLDSEWRFFIDSLPVSYTTPV-FLGSKLLAR 144
Query: 130 KGTTLYRATELQKQNLLTLYDDKVKDLVKKLLVLDGDSERANSIF--------WTRALNI 181
T+ R E Q + + +++ L+K+ L+ D + F WT A+N
Sbjct: 145 LPETMCRKAEAQVSRIRRTF-VRLQILLKRALLDDSALLNLSENFTWHLFVWAWT-AVNT 202
Query: 182 PLPHSYVFPQNQEDLNKYDSINNSAELSNDHNSRGELINGLNDIKNEAQRVNSQVNGATS 241
+F +++ D + +D +D+ + ++ LN
Sbjct: 203 RC----IFSKHRTDHSFWD---------DDYCALAPFLDCLN------------------ 231
Query: 242 TLTSTQGETLWIEGLVPGIDFCNHDLKAAATWEVDGTGLITGVPFSMYLLSVERSSFHSE 301
H KA V+G+ Y V +++
Sbjct: 232 -----------------------HHWKADVETTVEGS----------YFEIVTNNNYEPN 258
Query: 302 KEISISYGNKGNEELLYLYGFVIDNNPDDYLMI 334
++ ISYG+ N++LL YGFV+ +NP+D + I
Sbjct: 259 DQVFISYGSHDNKKLLLEYGFVLADNPNDVVAI 291
>gi|340720054|ref|XP_003398458.1| PREDICTED: histone-lysine N-methyltransferase setd3-like [Bombus
terrestris]
Length = 484
Score = 51.2 bits (121), Expect = 0.001, Method: Compositional matrix adjust.
Identities = 39/143 (27%), Positives = 70/143 (48%), Gaps = 14/143 (9%)
Query: 12 FLQWLQVNKVELRGCKIKYSDESKGF--GIFSSNEF-SDGVLLVVPLDLAITPMRVLQDP 68
F+ WL+ N + G + E G+ G+ + F + ++L +P +L +
Sbjct: 85 FINWLKQNGANVYGASVA---EFPGYDLGLKAERNFLENELILRIPRELIFSIHNA---- 137
Query: 69 LIGPECRAMFEDG--EVDDRFLMILFLTVERLRKNSSWKPYLDMLPTTFGNPLWFTDDEL 126
PE A+ D ++ + + + L +E+ ++ S WKPYLD+LPTT+ L+ T ++
Sbjct: 138 --APELVALQNDPLLQLMPQVALAIALLIEKHKEYSKWKPYLDILPTTYTTVLYMTAADM 195
Query: 127 LELKGTTLYRATELQKQNLLTLY 149
ELKG+ A Q +N+ Y
Sbjct: 196 NELKGSPTLEAALKQCRNIARQY 218
>gi|261190993|ref|XP_002621905.1| SET domain-containing protein [Ajellomyces dermatitidis SLH14081]
gi|239590949|gb|EEQ73530.1| SET domain-containing protein [Ajellomyces dermatitidis SLH14081]
gi|239613147|gb|EEQ90134.1| SET domain-containing protein [Ajellomyces dermatitidis ER-3]
gi|327354785|gb|EGE83642.1| SET domain-containing protein [Ajellomyces dermatitidis ATCC 18188]
Length = 481
Score = 51.2 bits (121), Expect = 0.001, Method: Compositional matrix adjust.
Identities = 84/324 (25%), Positives = 129/324 (39%), Gaps = 60/324 (18%)
Query: 12 FLQWLQVNKVELRGCKIKYSD---ESKGFGIFS-SNEFSDGVLLVVPLDLAITPMRV-LQ 66
F+ WL+ + KIK +D E G GI + SN D L +P +L ++ L+
Sbjct: 20 FMCWLKQSAGVRVSPKIKIADLRSEGAGRGIVALSNINEDEELFAIPQNLVLSFQNSKLK 79
Query: 67 DPLIGPECRAMFEDGEVDDRFLMILFLTVERLRKNSS-WKPYLDMLPTTFGNPLWFTDDE 125
D L + ++ +IL + E L+ +S W Y +LPT F +++TD+E
Sbjct: 80 DLL-------HISEKDLGPWLCLILVMIYEYLQGGASPWSRYFQVLPTEFDTLMFWTDEE 132
Query: 126 LLELKGTTLYRATELQKQNLLTLYDDKVKDLVKKLLVLDGDSERANSIFWTRALNIPLPH 185
L EL G+ ++ K+ D ++ IF + N PH
Sbjct: 133 LRELSGSA----------------------VLNKIGKSDAEAAILRDIFPIVSTN---PH 167
Query: 186 SYVFPQNQEDLNKYDSINNSAELSNDHNSRGELINGLN-DIKNEAQRVNSQVNGATSTLT 244
+FP L YDS + A L + + G LI DI+ +G
Sbjct: 168 --LFPP-ISGLGSYDSPDGRATLLSLAHRMGSLIMAYAFDIEKGEDEEGEVQDG----YI 220
Query: 245 STQGETLWIEGLVPGIDFCNHDLKA--AATWEVDGTGLITGVPFSMYLLSVERSSFHSEK 302
+ +GE L +G+VP D N D A ++ DG YL + +
Sbjct: 221 TDEGEEL-TKGMVPLADLLNADADRNNARLFQEDG-----------YLAMKSIKPIRNGE 268
Query: 303 EISISYGNKGNEELLYLYGFVIDN 326
EI YG +LL YG+V DN
Sbjct: 269 EIFNDYGELPRADLLRRYGYVTDN 292
>gi|425766115|gb|EKV04742.1| hypothetical protein PDIG_87340 [Penicillium digitatum PHI26]
gi|425778867|gb|EKV16969.1| hypothetical protein PDIP_33360 [Penicillium digitatum Pd1]
Length = 679
Score = 51.2 bits (121), Expect = 0.001, Method: Compositional matrix adjust.
Identities = 39/136 (28%), Positives = 60/136 (44%), Gaps = 22/136 (16%)
Query: 253 IEGLVPGIDFCNHDLKAAATWEVDGTGLITGVPFSMYLLSVERSSFHSEKEISISYGNKG 312
+ L+P ID NH A W + LL +E S + +EIS +YG +
Sbjct: 223 VSALLPLIDLPNHRPMAKVEWRAGDK--------DIGLLVLEDHS--AGQEISNNYGPRN 272
Query: 313 NEELLYLYGFVIDNNPDDYLMIHYPAEAIHSIPLSDSKALLLEEQKAQ------------ 360
NE+LL YGF I NP DY ++H + + + ++ L L Q A+
Sbjct: 273 NEQLLINYGFCIAGNPTDYRIVHLGVKPDSPLGAAKARQLELFPQVAKNIEDHYYIFNPF 332
Query: 361 LRCLLPKSLLEHGFFA 376
L P++ +EH F+
Sbjct: 333 YPLLAPETTMEHSVFS 348
>gi|350408192|ref|XP_003488333.1| PREDICTED: histone-lysine N-methyltransferase setd3-like [Bombus
impatiens]
Length = 484
Score = 51.2 bits (121), Expect = 0.001, Method: Compositional matrix adjust.
Identities = 39/143 (27%), Positives = 70/143 (48%), Gaps = 14/143 (9%)
Query: 12 FLQWLQVNKVELRGCKIKYSDESKGF--GIFSSNEF-SDGVLLVVPLDLAITPMRVLQDP 68
F+ WL+ N + G + E G+ G+ + F + ++L +P +L +
Sbjct: 85 FINWLKQNGANVYGASVA---EFPGYDLGLKAERNFLENELILRIPRELIFSIHNA---- 137
Query: 69 LIGPECRAMFEDG--EVDDRFLMILFLTVERLRKNSSWKPYLDMLPTTFGNPLWFTDDEL 126
PE A+ D ++ + + + L +E+ ++ S WKPYLD+LPTT+ L+ T ++
Sbjct: 138 --APELVALQNDPLLQLMPQVALAIALLIEKHKEYSKWKPYLDILPTTYTTVLYMTAADM 195
Query: 127 LELKGTTLYRATELQKQNLLTLY 149
ELKG+ A Q +N+ Y
Sbjct: 196 NELKGSPTLEAALKQCRNIARQY 218
>gi|358055500|dbj|GAA98620.1| hypothetical protein E5Q_05307 [Mixia osmundae IAM 14324]
Length = 462
Score = 51.2 bits (121), Expect = 0.001, Method: Compositional matrix adjust.
Identities = 35/115 (30%), Positives = 59/115 (51%), Gaps = 11/115 (9%)
Query: 256 LVPGIDFCNHDLKAAATWEVDGTGLITGVPFSMYLLSVERSSFHSEKEISISYGNKGNEE 315
L PG+D NH TW+ T +G F++ V+ + +++ +YG K NEE
Sbjct: 197 LFPGVDAFNHQQARKVTWQ---TSSASGR-FNL----VQDEPTAAGQQVFNNYGPKSNEE 248
Query: 316 LLYLYGFVIDNNPDDYLMIHYPAEAI---HSIPLSDSKALLLEEQKAQLRCLLPK 367
L YGF+I NNPDD++++ + H+ P +++L Q +LR L+ +
Sbjct: 249 FLLGYGFIIPNNPDDHMVLKLAPPRLAPGHAEPKRSLESVLQACQLTELRHLVRR 303
>gi|323447496|gb|EGB03414.1| hypothetical protein AURANDRAFT_72732 [Aureococcus anophagefferens]
Length = 403
Score = 51.2 bits (121), Expect = 0.001, Method: Compositional matrix adjust.
Identities = 47/154 (30%), Positives = 82/154 (53%), Gaps = 12/154 (7%)
Query: 4 STEAKLE-PFLQWLQVNKVELRGCK--IKYSDESKGFGIFSSNEFSDGVLLVVPLDLAIT 60
++EA+ E F+QWL N + C Y DE +G E ++ +L+ VPL IT
Sbjct: 25 ASEAQKEIAFVQWLTENGGKFADCVELRSYDDEVRGVHATRDLE-TEEILVEVPLKCLIT 83
Query: 61 PMRVLQDPLIGPECRAMFE-DGEVD--DRFLMILFLTVERLRKNSSWKPYLDMLPTTFGN 117
+ + + +G RA+ E + E+D ++LF+ ++R ++ + PY D+LP+T N
Sbjct: 84 -VEMGKATDVG---RAVLEAELELDAPKHVFLMLFVLLDRRDSSTFFAPYYDILPSTLSN 139
Query: 118 -PLWFTDDELLELKGTTLYRATELQKQNLLTLYD 150
P+++ DEL LKG+ L E +K+ + Y+
Sbjct: 140 MPIFWQPDELEWLKGSYLLTQIEERKRAIKADYE 173
>gi|385303944|gb|EIF47986.1| ribosomal n-lysine [Dekkera bruxellensis AWRI1499]
Length = 462
Score = 51.2 bits (121), Expect = 0.001, Method: Compositional matrix adjust.
Identities = 32/79 (40%), Positives = 40/79 (50%), Gaps = 7/79 (8%)
Query: 256 LVPGIDFCNHDLKAAATWEVDGTGLITGVPFSMYLLSVERSSFHSEKEISISYGNKGNEE 315
LVP +DF NH TW DG GV +E ++ + E+ +YG KGNEE
Sbjct: 205 LVPLLDFMNHKPLTKVTWSFDGXSF--GVS-----SQIELANGSGKYEVYNNYGPKGNEE 257
Query: 316 LLYLYGFVIDNNPDDYLMI 334
LL YGFVI N D L +
Sbjct: 258 LLMAYGFVIPGNEFDILQL 276
Score = 42.0 bits (97), Expect = 0.61, Method: Compositional matrix adjust.
Identities = 24/65 (36%), Positives = 36/65 (55%), Gaps = 2/65 (3%)
Query: 78 FEDGEVDDRFLMILFLTVERLRKNSSWKPYLDMLPTT--FGNPLWFTDDELLELKGTTLY 135
FE + + R L+ +L S PYLD+LP +PL + +DELL LKGT L+
Sbjct: 84 FEWSQFESRHLLTAYLAFXLGDPESINDPYLDLLPKAEEIRSPLIYNEDELLLLKGTNLF 143
Query: 136 RATEL 140
+ T++
Sbjct: 144 KGTQV 148
>gi|156384284|ref|XP_001633261.1| predicted protein [Nematostella vectensis]
gi|156220328|gb|EDO41198.1| predicted protein [Nematostella vectensis]
Length = 403
Score = 50.8 bits (120), Expect = 0.001, Method: Compositional matrix adjust.
Identities = 65/307 (21%), Positives = 118/307 (38%), Gaps = 77/307 (25%)
Query: 35 KGFGIFSSNEFSDG-VLLVVPLDL---AITPMRVLQDPLIGPE--CRAMFEDGEVDDRFL 88
+G G+ ++ + + V + VP L +T + PLI E R+M +
Sbjct: 20 QGLGLRATADLQENQVFVAVPEKLLMSVVTAKKSSLGPLISREHGLRSM-------PHVV 72
Query: 89 MILFLTVERLRKNSSWKPYLDMLPTTFGNPLWFTDDELLELKGTTLYRATELQKQNLLTL 148
+ L + ERL ++S+W PYL++LP ++ L+F+ D+++ L+G+ E KQ
Sbjct: 73 LALHVLCERLHEDSTWAPYLNILPRSYSTCLYFSPDDMMALQGSP--SMGEALKQ----- 125
Query: 149 YDDKVKDLVKKLLVLDGDSERANSIFWTRALNIPLPHSYVFPQNQEDLNKYDSINNSAEL 208
+ VK V ++ + E A +PL +S+ F
Sbjct: 126 FRGIVKQYVYFFRLVQINPE---------ASRLPLKNSFTF------------------- 157
Query: 209 SNDHNSRGELINGLNDIKNEAQRVNSQVNGATSTLTSTQGETLWIEGLVPGIDFCNHDLK 268
+D + V ++ N + T ++ L+P D CNH
Sbjct: 158 --------------DDFRWAVSTVMTRQNDVKVSSNET------VKALIPMWDMCNHCNG 197
Query: 269 AAATWEVDGTGLITGVPFSMYLLSVERSSFHSEKEISISYGNKGNEELLYLYGFVIDNNP 328
T D T + + F + ++ I YG + N + L+ GFV
Sbjct: 198 PFTTGFDDSTKEVKSLAF---------KPTRAGDQVFIFYGRRNNADRLFHNGFVYTEAE 248
Query: 329 DDYLMIH 335
+D++ I
Sbjct: 249 EDWVNIQ 255
>gi|307190528|gb|EFN74525.1| SET domain-containing protein 3 [Camponotus floridanus]
Length = 210
Score = 50.8 bits (120), Expect = 0.001, Method: Compositional matrix adjust.
Identities = 34/125 (27%), Positives = 60/125 (48%), Gaps = 12/125 (9%)
Query: 12 FLQWLQVNKVELRGCKIKYSDESKGF--GIFSSNEFSDGVLLVVPLDLAITPMRVLQDPL 69
F+ WL N + G I E G+ G+ + +F++ L+ L I +
Sbjct: 84 FMNWLTENGAHVDGVSIA---EFPGYDLGLKAETDFAENQLM-----LEIPRAIIFSTYT 135
Query: 70 IGPECRAMFEDGEVDD--RFLMILFLTVERLRKNSSWKPYLDMLPTTFGNPLWFTDDELL 127
PE + D V + + + L +ER ++NS W+ YLDMLP+ + L+ ++++
Sbjct: 136 AAPELTVLQNDPLVQHMPQVALAIALLIERHKENSKWECYLDMLPSGYNTVLYMKANDMI 195
Query: 128 ELKGT 132
ELKG+
Sbjct: 196 ELKGS 200
>gi|440464432|gb|ELQ33864.1| hypothetical protein OOU_Y34scaffold00857g1 [Magnaporthe oryzae
Y34]
Length = 464
Score = 50.8 bits (120), Expect = 0.001, Method: Compositional matrix adjust.
Identities = 39/161 (24%), Positives = 71/161 (44%), Gaps = 16/161 (9%)
Query: 36 GFGIFSSNEFSDG-VLLVVPLDLAITPMRVLQDPLIGPECRAMFEDGEVDDRFLMILFLT 94
G G+ + F +G +L +P T + D L+GP R++ V+D +
Sbjct: 25 GRGVRTLRHFKEGEKILTIPCGSLWTVEQAHADSLLGPALRSIRPPLSVEDILATYILFV 84
Query: 95 VERLRKNSSWKPYLDMLPTTFGNPLWFTDDELLELKGTTLYRATELQKQNLLTLYDDKVK 154
R + ++ LP+++ + ++F +EL GT+LY T+ +Q + +D +
Sbjct: 85 RSRESGYDGLRSHVAALPSSYSSSIFFAGEELEVCAGTSLYTITKQLEQRI----EDDYR 140
Query: 155 DLVKKLLVLDG-----------DSERANSIFWTRALNIPLP 184
LV +LLV D + A W+RA++ LP
Sbjct: 141 ALVMRLLVQHRDLFPLEQFTIEDYKWALCTVWSRAMDFVLP 181
>gi|225462926|ref|XP_002267249.1| PREDICTED: probable ribulose-1,5 bisphosphate carboxylase/oxygenase
large subunit N-methyltransferase, chloroplastic [Vitis
vinifera]
gi|296087793|emb|CBI35049.3| unnamed protein product [Vitis vinifera]
Length = 484
Score = 50.8 bits (120), Expect = 0.001, Method: Compositional matrix adjust.
Identities = 80/338 (23%), Positives = 124/338 (36%), Gaps = 85/338 (25%)
Query: 1 MEISTEAKLEPFLQWLQVNKVELRGCKIKYSDESKGFGIFSSNEFS-DGVLLVVPLDLAI 59
+E + ++ F +WL V +K +G G+ + + + + +L VP I
Sbjct: 43 LETNPPPPVQTFWKWLFDQGVVSGKTPVKPGIVPEGLGLVAQRDIARNEAVLEVPKRFWI 102
Query: 60 TPMRVLQDPLIGPECRAMFEDGEVDDRFLMILFLTVERLRKNSSWKPYLDMLPTTFGNPL 119
P V IG C G + + LFL E+LR S W+ YLD+LP + +
Sbjct: 103 NPDAVAASE-IGSVC------GGLKPWVSVALFLIREKLRDESPWRSYLDILPEYTNSTI 155
Query: 120 WFTDDELLELKGTTLYRATELQKQNLLTLYDDKVKDLVKKLLVLDGDSERANSIFWTRAL 179
+++++EL+E++GT L T G E S F
Sbjct: 156 YWSEEELVEIQGTQLSNTTL-------------------------GVKEYVQSEFLKVEE 190
Query: 180 NIPLPHS--YVFPQNQEDLNKYDSINNSAELSNDHNSRGELINGLNDIKNEAQRVNSQVN 237
+ LPHS + FP +D I S S + G N +
Sbjct: 191 EVILPHSQLFPFPVTLDDFLWAFGILRSRAFSR--------LRGQNLV------------ 230
Query: 238 GATSTLTSTQGETLWIEGLVPGIDFCNHDLKAAA---TWEVDGTGLITGVPFSMYLLSVE 294
L+P D NH WE+ G GL FS L
Sbjct: 231 ------------------LIPLADLINHSPSITTEEYAWEIKGAGL-----FSRDQLFSL 267
Query: 295 RS--SFHSEKEISISYG-NKGNEELLYLYGFVIDNNPD 329
R+ S + +++ I Y +K N EL YGF I++ P+
Sbjct: 268 RTPVSVKAGEQVLIQYDLDKSNAELALDYGF-IESRPN 304
>gi|147843303|emb|CAN82664.1| hypothetical protein VITISV_015206 [Vitis vinifera]
Length = 507
Score = 50.8 bits (120), Expect = 0.002, Method: Compositional matrix adjust.
Identities = 80/338 (23%), Positives = 124/338 (36%), Gaps = 85/338 (25%)
Query: 1 MEISTEAKLEPFLQWLQVNKVELRGCKIKYSDESKGFGIFSSNEFS-DGVLLVVPLDLAI 59
+E + ++ F +WL V +K +G G+ + + + + +L VP I
Sbjct: 43 LETNPPPPVQTFWKWLFDQGVVSGKTPVKPGIVPEGLGLVAQRDIARNEAVLEVPKRFWI 102
Query: 60 TPMRVLQDPLIGPECRAMFEDGEVDDRFLMILFLTVERLRKNSSWKPYLDMLPTTFGNPL 119
P V IG C G + + LFL E+LR S W+ YLD+LP + +
Sbjct: 103 NPDAVAASE-IGSVC------GGLKPWVSVALFLIREKLRDESPWRSYLDILPEYTNSTI 155
Query: 120 WFTDDELLELKGTTLYRATELQKQNLLTLYDDKVKDLVKKLLVLDGDSERANSIFWTRAL 179
+++++EL+E++GT L T G E S F
Sbjct: 156 YWSEEELVEIQGTQLSNTTL-------------------------GVKEYVQSEFLKVEE 190
Query: 180 NIPLPHS--YVFPQNQEDLNKYDSINNSAELSNDHNSRGELINGLNDIKNEAQRVNSQVN 237
+ LPHS + FP +D I S S + G N +
Sbjct: 191 EVILPHSQLFPFPVTLDDFLWAFGILRSRAFSR--------LRGQNLV------------ 230
Query: 238 GATSTLTSTQGETLWIEGLVPGIDFCNHDLKAAA---TWEVDGTGLITGVPFSMYLLSVE 294
L+P D NH WE+ G GL FS L
Sbjct: 231 ------------------LIPLADLINHSPSITTEEYAWEIKGAGL-----FSRDQLFSL 267
Query: 295 RS--SFHSEKEISISYG-NKGNEELLYLYGFVIDNNPD 329
R+ S + +++ I Y +K N EL YGF I++ P+
Sbjct: 268 RTPVSVKAGEQVLIQYDLDKSNAELALDYGF-IESRPN 304
>gi|242007310|ref|XP_002424484.1| SET domain-containing protein, putative [Pediculus humanus
corporis]
gi|212507902|gb|EEB11746.1| SET domain-containing protein, putative [Pediculus humanus
corporis]
Length = 492
Score = 50.8 bits (120), Expect = 0.002, Method: Compositional matrix adjust.
Identities = 76/335 (22%), Positives = 129/335 (38%), Gaps = 82/335 (24%)
Query: 6 EAKLEPFLQWLQVNKVELRGCKIKYSDESKGFGIFSSNEFSDGVLLV-VPLDLAITPMRV 64
E + W++ N IK+ +E G+G+ ++ + + L+ +P ++ +T V
Sbjct: 82 EDHFSNLISWIKENGGVADNVTIKHFNEM-GYGLEAAKDLEESELICAIPKNVMMTLDNV 140
Query: 65 LQDPLIGPECRAMFEDGEVDDRF---LMILFLTVERLR-KNSSWKPYLDMLPTTFGNPLW 120
PL + ++E+ + + LFL +E ++ +NS W Y+ LP+ + L+
Sbjct: 141 KVSPL-----KYLYENNPILKNMGNVALALFLILEHVKNENSFWHHYISSLPSDYNTVLY 195
Query: 121 FTDDELLELKGT-TLYRATELQKQNLLTLYDDKVKDLVKKLLVLDGDSERANSIFWTRAL 179
F ++ LE+K + T AT+ K
Sbjct: 196 FDLNDFLEMKNSPTFEMATKHCK------------------------------------- 218
Query: 180 NIPLPHSYVFPQNQEDLNKYDSINNSAELSNDHNS---RGELINGLNDIKNEAQRVNSQV 236
NI ++Y NN + SND S R L + V ++
Sbjct: 219 NIARQYAY--------------FNNLFQNSNDEASLILRNVFTYQL--YRWAVSTVMTRQ 262
Query: 237 NGATSTLTSTQGETLWIEGLVPGIDFCNHDLKAAAT-WEVDGTGLITGVPFSMYLLSVER 295
N S+ TS E I GL+P D CNH +T ++VD + + PF
Sbjct: 263 NFIPSSSTSNDVEN-GINGLIPLWDMCNHTNGYLSTQYKVDRSECLACKPFK-------- 313
Query: 296 SSFHSEKEISISYGNKGNEELLYLYGFVIDNNPDD 330
+++ I YG + N + L GFV D NP D
Sbjct: 314 ----KGEQVLIFYGERSNSDFLVHNGFVYDENPHD 344
>gi|146181028|ref|XP_001021989.2| SET domain containing protein [Tetrahymena thermophila]
gi|146144300|gb|EAS01744.2| SET domain containing protein [Tetrahymena thermophila SB210]
Length = 590
Score = 50.8 bits (120), Expect = 0.002, Method: Compositional matrix adjust.
Identities = 25/61 (40%), Positives = 37/61 (60%), Gaps = 1/61 (1%)
Query: 92 FLTVERLRKNSSWKPYLDMLPTTFGN-PLWFTDDELLELKGTTLYRATELQKQNLLTLYD 150
FL ER K S WKPYLD+LP+ + P++F++D+L LKG+ +K ++ YD
Sbjct: 233 FLLQERKNKESKWKPYLDILPSDYNQFPIFFSEDDLSWLKGSPFQNQVREKKADIKRDYD 292
Query: 151 D 151
D
Sbjct: 293 D 293
>gi|327295769|ref|XP_003232579.1| hypothetical protein TERG_06571 [Trichophyton rubrum CBS 118892]
gi|326464890|gb|EGD90343.1| hypothetical protein TERG_06571 [Trichophyton rubrum CBS 118892]
Length = 488
Score = 50.8 bits (120), Expect = 0.002, Method: Compositional matrix adjust.
Identities = 87/359 (24%), Positives = 152/359 (42%), Gaps = 69/359 (19%)
Query: 36 GFGIFSSNEFS-DGVLLVVPLDLAITPMRVLQDPLIGPECRAMFEDGEVDDRFL-----M 89
G GI +S + + D L ++P DL ++ + E R+ E +DD+ L +
Sbjct: 48 GRGICASRDIAEDEELFIIPDDLVLS--------VQNSEARSALE---LDDKQLGPWLSL 96
Query: 90 ILFLTVERLR-KNSSWKPYLDMLPTTFGNPLWFTDDELLELKGTTLYRATELQKQNLLTL 148
I+ + E + + S W PY +LP++F +++TD++LLEL+G+ + ++ K
Sbjct: 97 IITMIYEYYQGEQSKWYPYFRILPSSFDTLMFWTDEQLLELQGSAV--VGKIGK----AA 150
Query: 149 YDDKVKDLVKKLLVLDGDSERANSIFWTRALNIPLPHSYVFPQNQEDLNKYDSINNSAEL 208
DD + V L+ +AN + N+P LN DS N A L
Sbjct: 151 ADDTILQKVVPLI-------QANPRHFPPRPNMP------------PLNSSDSQN--ALL 189
Query: 209 SNDHNSRGELINGLNDI-KNEAQRVNSQVNGATSTLTSTQGETLWIEGLVPGIDFCNHDL 267
H ++ DI K + ++ +G T E +G+VP D N D
Sbjct: 190 CLAHRMGSIIMAYAFDIEKTDEVDEDTAEDGYM-----TDDEDEPAKGMVPLADIFNADA 244
Query: 268 KA--AATWEVDGTGLITGVPFSMYLLSVERSSFHSEKEISISYGNKGNEELLYLYGFVID 325
+ A ++ +G+ ++ + + HS +EI YG +LL YG+V D
Sbjct: 245 QRNNARLFQEEGSFVMKAI-----------KNIHSGEEIFNDYGELPRADLLRRYGYVTD 293
Query: 326 NNPDDYLMIHYPAEAIHSIP-LSDSKALLLEEQKAQLRCLLPKSLLEHGFFAAGHPKDG 383
N Y ++ + ++I + L DS+ +L L +LE G+ P +G
Sbjct: 294 NYA-QYDVVEFSLDSICKVAGLPDSEP---SSTNPRLELLDNLDMLEEGYSIPRIPPNG 348
>gi|448097927|ref|XP_004198797.1| Piso0_002187 [Millerozyma farinosa CBS 7064]
gi|359380219|emb|CCE82460.1| Piso0_002187 [Millerozyma farinosa CBS 7064]
Length = 603
Score = 50.8 bits (120), Expect = 0.002, Method: Compositional matrix adjust.
Identities = 70/276 (25%), Positives = 120/276 (43%), Gaps = 36/276 (13%)
Query: 73 ECRAMFEDGEVDDRFLMILFLTVERLRKNSSWKPYLDMLPTT--FGNPLWFTDDELLELK 130
+C +FE V + L+ L T + + KPYLDMLP P +++ ++ +K
Sbjct: 70 DCGTLFEGTNVPLKLLLCLGRTAQYGSQIFH-KPYLDMLPRVDILNVPYFWSKEQKECIK 128
Query: 131 GTTLYRATELQKQNLLTLYDDKVKDLVKKL-LVLDGDSERANSIFWTRALNIPLPHSYVF 189
GT L + L+ L L ++ K + L+ + S+ AN +N+ + + F
Sbjct: 129 GTNL--GSSLKDHMALIL-----EEWWKAINLLPEAVSKPANHF-----INMKFYYEFKF 176
Query: 190 PQNQEDLNKYDSINNSAELSNDHNSRGELINGLNDIKNEAQRVNSQVNGATSTLTSTQGE 249
+D KY ++++ ++ N S + +K+ A N +T +
Sbjct: 177 -HTDDDFYKYFVVDSANDVEN-WTSFPNYLWASCILKSRAFPTYLLKNEVQEEMTEDEC- 233
Query: 250 TLWIEGLVPGIDFCNHDLKAAATWEV--DGTGLITGVPFSMYLLSVERSSFHSEKEISIS 307
L+P +D NHD+K+ W D ++ + F Y S +++ +
Sbjct: 234 -----MLLPIVDLLNHDVKSEVEWSATKDANSIVNFL-FRSY-------SAQDGQQLFNN 280
Query: 308 YGNKGNEELLYLYGFVIDNNPDDY--LMIHYPAEAI 341
YG KGNEELL YGF +++N D L I P E I
Sbjct: 281 YGMKGNEELLMGYGFCLEDNAADTCALRIKVPKEII 316
>gi|402584499|gb|EJW78440.1| hypothetical protein WUBG_10651 [Wuchereria bancrofti]
Length = 362
Score = 50.4 bits (119), Expect = 0.002, Method: Compositional matrix adjust.
Identities = 53/240 (22%), Positives = 98/240 (40%), Gaps = 45/240 (18%)
Query: 89 MILFLTVERLRKNSSWKPYLDMLPTTFGNPLWFTDDELLELKGTTLYRATELQKQNLLTL 148
+ + L ++L S W+PY+ +LP F PL+FT ++L L+ + L+ + L L
Sbjct: 6 LAVMLCCQKLVPESHWQPYIKVLPENFNTPLFFTVEQLQFLRPSPLFEES-------LLL 58
Query: 149 YDDKVKDLVKKLL-VLDGDSERANSIFWTRALNIPLPHSYVFPQNQEDLNKYDSINNSAE 207
Y + + + LL ++ D R + ++++K + I ++
Sbjct: 59 YRNVSRQFIHFLLEIIRSDQFRHRK------------------KKSKEMSKLEPIYVNSP 100
Query: 208 LSNDHNSRGELINGLNDIKNEAQRVNSQVNGATS-TLTSTQGETLWIEGLVPGIDFCNHD 266
L+ + + N + +++++N S L G+ I GL+P +D NH
Sbjct: 101 LTAANFT-------FNLYRWSVACISTRINMIPSEVLRDDIGQPRLIPGLIPFLDMANHS 153
Query: 267 LKAAATWEVDGTGLITGVPFSMYLLSVERSSFHSEK---EISISYGNKGNEELLYLYGFV 323
A E V FS+ E + K ++I YG + N + L GFV
Sbjct: 154 YIEGAFHE--------SVHFSVEFDCAEIIAVRDYKPLEPVNIFYGWRSNRDFLLHNGFV 205
>gi|449702130|gb|EMD42824.1| Hypothetical protein EHI5A_004190 [Entamoeba histolytica KU27]
Length = 749
Score = 50.4 bits (119), Expect = 0.002, Method: Compositional matrix adjust.
Identities = 41/130 (31%), Positives = 63/130 (48%), Gaps = 13/130 (10%)
Query: 70 IGPECR-AMFEDGEVDDR----FLMILFLTVERLRKNSSWKPYLDMLPTTFGNPLWFTDD 124
I PE + F +G+ DDR L+ L+L V + PY+++LP T+ PL +T D
Sbjct: 17 IWPEVKLPKFNEGD-DDRDDLNGLVYLYLAVNKTNPKCFHWPYINVLPETYDCPLSYTID 75
Query: 125 ELLELKGTTLYRATELQKQNLLTLYDDKVKDLVKK----LLVLDGDSER---ANSIFWTR 177
EL +KGT LY A E L+ + D L+++ D +R A+ FW+R
Sbjct: 76 ELNLMKGTKLYAAVEKINAFLMKVVDYYNNKLIQQFPQYFQSFDDLFKRLQWAHQSFWSR 135
Query: 178 ALNIPLPHSY 187
A + P +
Sbjct: 136 AFLVIYPQPF 145
>gi|340923530|gb|EGS18433.1| hypothetical protein CTHT_0050300 [Chaetomium thermophilum var.
thermophilum DSM 1495]
Length = 463
Score = 50.4 bits (119), Expect = 0.002, Method: Compositional matrix adjust.
Identities = 24/48 (50%), Positives = 34/48 (70%), Gaps = 1/48 (2%)
Query: 88 LMILFLTVERLRKNSS-WKPYLDMLPTTFGNPLWFTDDELLELKGTTL 134
L+IL L E L+ SS WKPYLD+LP++F P+++ DEL EL+ + L
Sbjct: 88 LLILVLMYEHLQGESSRWKPYLDVLPSSFDTPMFWAPDELAELQASAL 135
>gi|403414266|emb|CCM00966.1| predicted protein [Fibroporia radiculosa]
Length = 420
Score = 50.4 bits (119), Expect = 0.002, Method: Compositional matrix adjust.
Identities = 43/147 (29%), Positives = 64/147 (43%), Gaps = 17/147 (11%)
Query: 3 ISTEAKLEPFLQWLQVNKVELRGCKIKYSDESKGFGIFSSNEF-SDGVLLVVPLDLAITP 61
+S + F WL N E + E G+ + +S + SD ++ P LAITP
Sbjct: 11 VSANGDVVAFKNWLAENGAEFHP-HAAFRTERSGYSVIASQDLRSDTTVVSCPFSLAITP 69
Query: 62 --MRVLQDPLIGPECRAMFEDGEVDDRFLMILFLTVERLRKNSS------WKPYLDMLPT 113
+ L+GP F +R L+ ++ + + S+ W PY+ MLP
Sbjct: 70 EVSKNALTTLLGP----TFTGQSWSERQLICSYICMHWILDPSASSELAHW-PYIRMLPA 124
Query: 114 --TFGNPLWFTDDELLELKGTTLYRAT 138
PL F+D EL LKG+ LY AT
Sbjct: 125 PDKLRTPLHFSDTELEALKGSNLYGAT 151
Score = 41.2 bits (95), Expect = 1.1, Method: Compositional matrix adjust.
Identities = 27/80 (33%), Positives = 39/80 (48%), Gaps = 6/80 (7%)
Query: 256 LVPGIDFCNHDLKAAATWEVDGTGLITGVPFSMYLLS-VERSSFHSEKEISISYGNKGNE 314
L+PGID NH +W V G + +S V S + E+ +YG K N
Sbjct: 218 LLPGIDSLNHSRGQPVSWVVS-----IGTSSDVNRISLVLHKSTPAGSELLNNYGPKPNA 272
Query: 315 ELLYLYGFVIDNNPDDYLMI 334
EL+ YGF + NPDD +++
Sbjct: 273 ELILGYGFSLPENPDDTIVL 292
>gi|12718364|emb|CAC28558.1| related to histone-lysine N-methyltransferase [Neurospora crassa]
Length = 471
Score = 50.4 bits (119), Expect = 0.002, Method: Compositional matrix adjust.
Identities = 33/132 (25%), Positives = 60/132 (45%), Gaps = 5/132 (3%)
Query: 36 GFGIFSSNEFSDG-VLLVVPLDLAITPMRVLQDPLIGPECRAMFEDGEVDDRFLMILFLT 94
G G+ + F +G +L +P + T DPL+GP R+ V+D +
Sbjct: 33 GRGVKTLRPFKEGEKILTIPAGILWTVKHAYADPLLGPALRSAQPPLSVEDTLATYILFV 92
Query: 95 VERLRKNSSWKPYLDMLPTTFGNPLWFTDDELLELKGTTLYRATELQKQNLLTLYDDKVK 154
R + ++ LP ++ + + F +D+L GT+LY T+ +Q++ +D +
Sbjct: 93 KSRESGYDGQRSHIAALPASYSSSILFAEDDLEACAGTSLYTITKQLEQSI----EDDHR 148
Query: 155 DLVKKLLVLDGD 166
LV +L V D
Sbjct: 149 ALVVRLFVQHPD 160
>gi|342321631|gb|EGU13564.1| Cytoplasm protein, putative [Rhodotorula glutinis ATCC 204091]
Length = 1108
Score = 50.4 bits (119), Expect = 0.002, Method: Compositional matrix adjust.
Identities = 80/338 (23%), Positives = 131/338 (38%), Gaps = 81/338 (23%)
Query: 33 ESKGFGIFSSNEFSDGVLLVV-PLDLAITP---MRVLQDPLIGPECRAMFEDGEVDDRFL 88
+S G +++S G V P LAITP R + D + P + + D +
Sbjct: 681 DSFGTCVYASEPLEKGCKAVCCPFSLAITPKQARRCVPD-TLLPSSPSSSRPKRLPDHEI 739
Query: 89 MILFLTVERLRKNS---------SWKPYLDMLPTT--FGNPLWFTDDELLELKGTTLYRA 137
M L+L + L K + +PY+D LP + PL+FT E L+GT LY A
Sbjct: 740 MTLYLCLHLLPKPAVERVSGLVLEHQPYVDFLPKSEAMRTPLYFTPAERELLRGTNLYGA 799
Query: 138 TELQKQNLLTLYDDKVKDLVKKLLVLDGDSERANSIFWTRAL-NIPLPHSYVFPQNQEDL 196
T+ + +D + +++ D E + W R L + S F + D
Sbjct: 800 TQER--------EDDWRAEWREVTSWVTDEEVRKELTWERWLWGCTILSSRAFSSDLIDG 851
Query: 197 NKYDSINNSAELSNDHNSRGEL-INGLNDIKNEAQRVNSQVNGATSTLTSTQGETLWIEG 255
+K +S + N R E + +D + E +RV+ +V + T+
Sbjct: 852 DKDNSTPVLFPGVDLLNHRPEARVTWFSDTETEIKRVDGRVEKGSLTI------------ 899
Query: 256 LVPGIDFCNHDLKAAATWEVDGTGLITGVPFSMYLLSVERSSFHSEKEISISYGNKGNEE 315
+ ++ A A ++ +YG K NEE
Sbjct: 900 ------VLDEEIPAGA-------------------------------QVYNTYGAKANEE 922
Query: 316 LLYLYGFVIDNNPDDYLMIHYPAEAIHSIPLSDSKALL 353
LL YGFV+ +N D+L + S+PL+ S +LL
Sbjct: 923 LLLGYGFVLPSNRADFLTLKL------SMPLNASPSLL 954
>gi|332321742|sp|E2RBS6.1|SETD3_CANFA RecName: Full=Histone-lysine N-methyltransferase setd3; AltName:
Full=SET domain-containing protein 3
Length = 588
Score = 50.1 bits (118), Expect = 0.002, Method: Compositional matrix adjust.
Identities = 36/143 (25%), Positives = 70/143 (48%), Gaps = 12/143 (8%)
Query: 12 FLQWLQVNKVELRGCK-IKYSDESKGFGIFSSNEF-SDGVLLVVPLDLAITPMRVLQDPL 69
++W N + G + + + +E GFG+ ++ + ++ + L VP L +T + ++ +
Sbjct: 82 LMKWASENGASVEGFEMVNFKEE--GFGLRATRDIKAEELFLWVPRKLLMT-VESAKNSV 138
Query: 70 IGP---ECRAMFEDGEVDDRFLMILFLTVERLRKNSSWKPYLDMLPTTFGNPLWFTDDEL 126
+GP + R + G + + L ER NS W+PY+ LP+ + PL+F +DE+
Sbjct: 139 LGPLYSQDRILQAMGNI----TLAFHLLCERADPNSFWQPYIQTLPSEYDTPLYFEEDEV 194
Query: 127 LELKGTTLYRATELQKQNLLTLY 149
+L+ T Q +N Y
Sbjct: 195 RDLQSTQAIHDVFSQYKNTARQY 217
>gi|428182808|gb|EKX51668.1| hypothetical protein GUITHDRAFT_102933 [Guillardia theta CCMP2712]
Length = 436
Score = 50.1 bits (118), Expect = 0.002, Method: Compositional matrix adjust.
Identities = 32/105 (30%), Positives = 57/105 (54%), Gaps = 3/105 (2%)
Query: 35 KGFGIFSSNEFSDG-VLLVVPLDLAITPMRVLQDPLIGPECRAMFED-GEVDDRFLMILF 92
+G G+F++ G L ++P I P V +D +G + + GE + + F
Sbjct: 52 EGLGVFANRRLEPGETLFMIPKSCCIYPELVFEDRQLGKSMQKLASAAGEGIEVVALATF 111
Query: 93 LTVERLR-KNSSWKPYLDMLPTTFGNPLWFTDDELLELKGTTLYR 136
L E+++ SS+KP++D+LP +PL +TD+E+ L+GT +R
Sbjct: 112 LAREKMKGSESSYKPFIDVLPWDSLHPLLWTDEEVDLLEGTYAHR 156
>gi|73964462|ref|XP_547974.2| PREDICTED: SET domain containing 3 [Canis lupus familiaris]
Length = 589
Score = 50.1 bits (118), Expect = 0.002, Method: Compositional matrix adjust.
Identities = 36/143 (25%), Positives = 70/143 (48%), Gaps = 12/143 (8%)
Query: 12 FLQWLQVNKVELRGCK-IKYSDESKGFGIFSSNEF-SDGVLLVVPLDLAITPMRVLQDPL 69
++W N + G + + + +E GFG+ ++ + ++ + L VP L +T + ++ +
Sbjct: 82 LMKWASENGASVEGFEMVNFKEE--GFGLRATRDIKAEELFLWVPRKLLMT-VESAKNSV 138
Query: 70 IGP---ECRAMFEDGEVDDRFLMILFLTVERLRKNSSWKPYLDMLPTTFGNPLWFTDDEL 126
+GP + R + G + + L ER NS W+PY+ LP+ + PL+F +DE+
Sbjct: 139 LGPLYSQDRILQAMGNI----TLAFHLLCERADPNSFWQPYIQTLPSEYDTPLYFEEDEV 194
Query: 127 LELKGTTLYRATELQKQNLLTLY 149
+L+ T Q +N Y
Sbjct: 195 RDLQSTQAIHDVFSQYKNTARQY 217
>gi|328772383|gb|EGF82421.1| hypothetical protein BATDEDRAFT_86633 [Batrachochytrium
dendrobatidis JAM81]
Length = 648
Score = 50.1 bits (118), Expect = 0.002, Method: Compositional matrix adjust.
Identities = 60/238 (25%), Positives = 86/238 (36%), Gaps = 63/238 (26%)
Query: 101 NSSWKPYLDMLPTTFGNPLWFTDDELLELKGTT--LYRATELQK--QNLLTLYDDKVKDL 156
+S W PYL LP + PL +T D + L G T LY E + QN + ++
Sbjct: 145 DSHWLPYLASLPKNYALPLMWTRDRIQNLLGGTSLLYMMIERLEWIQNSTKVVENACGHY 204
Query: 157 VKKLLVLDGDSERANSIFWTRALNIPLPHSYVFPQNQEDLNKYDSINNSAELSNDHNSRG 216
+ + A W+RA FP+ + L+ D H
Sbjct: 205 FPTGALTVQSMQWATCSIWSRA----------FPKAKPSLDLQDG---------SHQDVQ 245
Query: 217 ELINGLNDIKNEAQRVNSQVNGATSTLTSTQGETLWIEGLVPGIDFCNHDLKAAATWEVD 276
+ I GL++I L P +D NH W +
Sbjct: 246 DWI-GLSEI-----------------------------CLFPILDMFNHKRGYRVEWRMT 275
Query: 277 GTGLITGVPFSMYLLSVERSSFHSEKEISISYGNKGNEELLYLYGFVIDNNPDDYLMI 334
G+ P + S E+ +YG KGNE LL YGFVI+NNP+DY +
Sbjct: 276 EKGVSFITPDGICKGS----------ELLNNYGPKGNENLLSNYGFVIENNPEDYFKV 323
>gi|355718753|gb|AES06373.1| SET domain containing 3 [Mustela putorius furo]
Length = 585
Score = 50.1 bits (118), Expect = 0.002, Method: Compositional matrix adjust.
Identities = 36/143 (25%), Positives = 70/143 (48%), Gaps = 12/143 (8%)
Query: 12 FLQWLQVNKVELRGCK-IKYSDESKGFGIFSSNEF-SDGVLLVVPLDLAITPMRVLQDPL 69
++W N + G + + + +E GFG+ ++ + ++ + L VP L +T + ++ +
Sbjct: 82 LMKWASENGASVEGFEMVNFKEE--GFGLRATRDIKAEELFLWVPRKLLMT-VESAKNSV 138
Query: 70 IGP---ECRAMFEDGEVDDRFLMILFLTVERLRKNSSWKPYLDMLPTTFGNPLWFTDDEL 126
+GP + R + G + + L ER NS W+PY+ LP+ + PL+F +DE+
Sbjct: 139 LGPLYSQDRILQAMGNI----TLAFHLLCERADPNSFWQPYIQTLPSEYDTPLYFEEDEV 194
Query: 127 LELKGTTLYRATELQKQNLLTLY 149
+L+ T Q +N Y
Sbjct: 195 RDLQSTQAIHDVFSQYKNTARQY 217
>gi|405119137|gb|AFR93910.1| cytoplasmic protein [Cryptococcus neoformans var. grubii H99]
Length = 455
Score = 50.1 bits (118), Expect = 0.003, Method: Compositional matrix adjust.
Identities = 80/338 (23%), Positives = 129/338 (38%), Gaps = 80/338 (23%)
Query: 32 DESKGFGIFSSNEFS-DGVLLVVPLDLAITP----------MRVLQDPLIGPECRAMFED 80
D S G +F+ + D L+ P LA+T V + LI P+ A ++
Sbjct: 31 DGSTGLSVFADKTINPDEKLVSCPFSLAVTSELATQAVCEIAGVEKGRLIWPKGHA--KE 88
Query: 81 GEV-DDRFLMILFLT---VERLRKNSSW------KPYLDMLP--TTFGNPLWFTDDELLE 128
GE ++R + +L + + R + W K YL LP + PL+FT+ EL
Sbjct: 89 GEPWNERMRIGAYLGFHWIYQERPETEWPRSLVHKTYLASLPPASNLSTPLYFTESELQL 148
Query: 129 LKGTTLYRATELQKQNLLTLYDDKVKDLVKKLLVLDGDSERANSIFWTRAL-NIPLPHSY 187
L G+ L A E +++ ++++ +L DG + W R L S
Sbjct: 149 LSGSNLLGAVEDRRKEW-----SAESEVLRSILNEDG-------LTWERYLATATYMSSR 196
Query: 188 VFPQNQEDLNKYDSINNSAELSNDHNSRGELINGLNDIKNEAQRVNSQVNGATSTLTSTQ 247
FP DL +E +++++G + +
Sbjct: 197 AFPSKLLDLPS---------------------------DSEMTPQSTRIDGVSKPV---- 225
Query: 248 GETLWIEGLVPGIDFCNHDLKAAATWEVDGTGLITGVPFSMYLLSVERSSFHSEKEISIS 307
L+PG+D NH W G + V S + ++ +
Sbjct: 226 --------LLPGVDIFNHARGQPILWLSSLVSTPNGSQGVPSVSLVSSSICEAGSQLFNN 277
Query: 308 YGNKGNEELLYLYGFVIDNNPDDYLMIHYPAEAIHSIP 345
YG K NEELL YGFV+D+NPDD + + I S+P
Sbjct: 278 YGAKPNEELLLGYGFVLDSNPDDIVNLRL---GIASLP 312
>gi|238882888|gb|EEQ46526.1| conserved hypothetical protein [Candida albicans WO-1]
Length = 433
Score = 50.1 bits (118), Expect = 0.003, Method: Compositional matrix adjust.
Identities = 62/249 (24%), Positives = 95/249 (38%), Gaps = 78/249 (31%)
Query: 88 LMILFLTVERLRKNSS-WKPYLDMLPTTFGNPLWFTDDELLELK-----GTTLYRATELQ 141
L+ L+LT ER R + S WKP+LDMLP+ D EL+ + T L +TE++
Sbjct: 120 LLSLYLTFERKRSHKSFWKPFLDMLPS-------MDDFELMPIDWPQEVCTLLPSSTEVR 172
Query: 142 KQNLLTLYDDK---VKDLVKKLLVLDGDSERANSIFWTRALNIPLPHSYVFPQNQEDLNK 198
+ + + +D+ + +L+K + DGD F QE L
Sbjct: 173 NKKVRSRFDNDYQVICELIKTKIDKDGDVT-------------------TFLPRQEVLLS 213
Query: 199 YDSINNSAELSNDHNSRGELINGLNDIKNEAQRVNSQVNGATSTLTSTQGETLWIEGLVP 258
+ IN+ L L KN A + P
Sbjct: 214 WLCINSRC-----------LYMDLPTSKNSADNFT----------------------MAP 240
Query: 259 GIDFCNHDLKAAATWEVDGTGLITGVPFSMYLLSVERSSFHSEKEISISYGNKGNEELLY 318
+DF NH T ++DG G S +++ ++ +SYG N+ LL
Sbjct: 241 YVDFMNHSCDDHCTLKIDGKGFQVRT----------TSQYNTGDQVYLSYGPHSNDFLLC 290
Query: 319 LYGFVIDNN 327
YGFVI +N
Sbjct: 291 EYGFVIPDN 299
>gi|448535330|ref|XP_003870959.1| Rkm1 protein [Candida orthopsilosis Co 90-125]
gi|380355315|emb|CCG24832.1| Rkm1 protein [Candida orthopsilosis]
Length = 565
Score = 50.1 bits (118), Expect = 0.003, Method: Compositional matrix adjust.
Identities = 82/338 (24%), Positives = 131/338 (38%), Gaps = 76/338 (22%)
Query: 8 KLEPFLQWLQVNKVELRGCKIKYSDESKGF--GIFSSNEFSDGVLLVVPLDLAITPMRVL 65
K+ LQW N ++ I++ + G+ I SN D + + +L IT
Sbjct: 3 KIHSLLQWATDNGADVSK-NIEFKEIKPGYYGAISVSN---DKASIKISRELVIT----- 53
Query: 66 QDPLIGPECRAMFEDG--EVDDRFLMILFLTVERLRKNSSWKPYLDMLPT--TFGNPLWF 121
+ A+F+D E + L+ ++L + + S +KPYLD+LP+ +P +
Sbjct: 54 -----CDKATALFKDAYKEANHSSLLKVYLCYLQ-TQGSFYKPYLDLLPSLQAIDSPYTW 107
Query: 122 TDDELLELKGTTLYRATELQKQNLLTLYDDKVKDLVKKLLVLDGDSERANSIFWTRALNI 181
+ + LKGT L + ++N+ +L + W A+NI
Sbjct: 108 SAQDKAYLKGTNLGNSL---RENIASLVEG-----------------------WWSAVNI 141
Query: 182 PLPHSYVFPQNQEDLNKYDSINNSAELSNDHNSRGELINGLNDIKNEAQRVNSQVNGATS 241
+P P++ IN N + + N N++ A+
Sbjct: 142 -IPKEVPKPESH-------FINLKFYYENKFYTDDDYYNYFNEVDINNWTSFPNYLWASL 193
Query: 242 TLTSTQGETLWIEG--------LVPGIDFCNHDLKAAATWE-VDGTGLITGVPFSMYLLS 292
L S I+ L+P +D NH+ KA A W VDG L
Sbjct: 194 ILKSRAFPAYIIDPSLPKNEPMLLPVVDLLNHNPKAKAEWCCVDGEFLF----------- 242
Query: 293 VERSSFHSEKEISISYGNKGNEELLYLYGFVIDNNPDD 330
+ +EI +YG KGNEELL YGF I+NN D
Sbjct: 243 -QSDGVSKGEEIFNNYGQKGNEELLLAYGFAIENNTAD 279
>gi|388250581|gb|AFK23406.1| histone-lysine N-methyltransferase [Cordyceps militaris]
Length = 479
Score = 50.1 bits (118), Expect = 0.003, Method: Compositional matrix adjust.
Identities = 71/302 (23%), Positives = 115/302 (38%), Gaps = 54/302 (17%)
Query: 36 GFGIFSSNEFSDG-VLLVVPLDLAITPMRVLQDPLIGPECRAMFEDGEVDDRFLMILFLT 94
G G+ + F G +L +P T DPL+GP R+ V+D + L
Sbjct: 25 GRGVKALRSFKKGERILTIPSACLWTAEAARADPLLGPVLRSAQPPLSVEDTLAIHLLFV 84
Query: 95 VERLRKNSSWKPYLDMLPTTFGNPLWFTDDELLELKGTTLYRATELQKQNLLTLYDDKVK 154
R + ++ +P ++F +DEL +G++L+ L T + +V+
Sbjct: 85 KSRTAGYEGQRLHIAAMPQRHSASIFFAEDELQVCEGSSLH--------TLTTQLEQRVQ 136
Query: 155 DLVKKLLVLDGDSERANSIFWTRALNIPLPHSYVFPQNQ---EDLN---KYDSINNSAEL 208
D ++LLV + H +FP +Q ED++ + S L
Sbjct: 137 DDFRQLLV-----------------QLLSQHRDLFPLDQFTIEDVSYIAAFPRPTRSISL 179
Query: 209 SNDHNSRGELINGLNDIKNEAQRVNSQVNGATSTLTSTQGETLWIEGLVPGIDFCNHDLK 268
N + + L I + A ++ A S TS + + P D NH L
Sbjct: 180 MNLYFPF-QYKWALCTIWSRA------MDFAVSDTTS-------VRLVAPLADMLNHSLD 225
Query: 269 AAATWEVDGTGLITGVPFSMYLLSVERSSFHSEKEISISYGNKGNEELLYLYGFVIDNNP 328
D P S L + + +I I YG+ N LL LYGFV+ +NP
Sbjct: 226 VKQCHAYD--------PTSGDLSILAAKDYQVGDQIFIYYGSVPNNRLLRLYGFVLLDNP 277
Query: 329 DD 330
+D
Sbjct: 278 ND 279
>gi|116180202|ref|XP_001219950.1| hypothetical protein CHGG_00729 [Chaetomium globosum CBS 148.51]
gi|88185026|gb|EAQ92494.1| hypothetical protein CHGG_00729 [Chaetomium globosum CBS 148.51]
Length = 510
Score = 50.1 bits (118), Expect = 0.003, Method: Compositional matrix adjust.
Identities = 59/236 (25%), Positives = 100/236 (42%), Gaps = 39/236 (16%)
Query: 15 WLQVNKVELRGCKIKYSDESKGFGIFSSNEFS-------DGVLLVVPLDLAITPMRVLQD 67
W +N V K+ + E KG+G+ S + L+ VP L + V +
Sbjct: 14 WAHLNDVSFHNIKV-INTEDKGYGVVSDEDLKATSQANDTSALITVPHGLVLNAAAVEEY 72
Query: 68 PLIGPECRAMFED-GEVDDRFLMILFLTVERLRKN----------SSWKPYLDMLPTTFG 116
R + + G R ++LFL V+ + + + W YL LP T
Sbjct: 73 AKEDKGFRLLLDAIGHRSTRADVLLFLLVQMVLGSRPSHANVGLSNPWTEYLKFLPETVL 132
Query: 117 NPLWFTDDELLELKGTTLYRATELQKQNLLTLYD---DKVKDLV--KKLLVLDG------ 165
P +T+DE L L+GT+L A + + L +D +K D++ LL ++G
Sbjct: 133 VPTLWTEDERLLLRGTSLEAAVDAKISALDAEFDLVREKSSDIIAWNDLLWMEGVPVSFT 192
Query: 166 DSERANSIFWTRALNIPLPHSYVFPQNQEDLNKYDSINNSAELSN--DHNSRGELI 219
D R ++++ +R L +P + P D IN+SA S D N++ E++
Sbjct: 193 DWIRLDALYRSRCLELPTSGESMVPC-------IDMINHSATPSAYYDENTKDEVV 241
>gi|410962953|ref|XP_003988033.1| PREDICTED: histone-lysine N-methyltransferase setd3 [Felis catus]
Length = 591
Score = 49.7 bits (117), Expect = 0.003, Method: Compositional matrix adjust.
Identities = 36/143 (25%), Positives = 70/143 (48%), Gaps = 12/143 (8%)
Query: 12 FLQWLQVNKVELRGCK-IKYSDESKGFGIFSSNEF-SDGVLLVVPLDLAITPMRVLQDPL 69
++W N + G + + + +E GFG+ ++ + ++ + L VP L +T + ++ +
Sbjct: 82 LMKWASENGASVEGFEMVNFKEE--GFGLRATRDIKAEELFLWVPRKLLMT-VESAKNSV 138
Query: 70 IGP---ECRAMFEDGEVDDRFLMILFLTVERLRKNSSWKPYLDMLPTTFGNPLWFTDDEL 126
+GP + R + G + + L ER NS W+PY+ LP+ + PL+F +DE+
Sbjct: 139 LGPLYSQDRILQAMGNIT----LAFHLLCERADPNSFWQPYIQTLPSEYDTPLYFEEDEV 194
Query: 127 LELKGTTLYRATELQKQNLLTLY 149
+L+ T Q +N Y
Sbjct: 195 RDLQSTQAIHDVFSQYKNTARQY 217
>gi|444705829|gb|ELW47217.1| Histone-lysine N-methyltransferase setd3 [Tupaia chinensis]
Length = 539
Score = 49.7 bits (117), Expect = 0.003, Method: Compositional matrix adjust.
Identities = 37/143 (25%), Positives = 69/143 (48%), Gaps = 12/143 (8%)
Query: 12 FLQWLQVNKVELRGCK-IKYSDESKGFGIFSSNEF-SDGVLLVVPLDLAITPMRVLQDPL 69
++W N + G + + + +E GFG+ ++ E ++ + L VP L +T + ++ +
Sbjct: 82 LMKWASENGASVDGFEMVNFKEE--GFGLRATREIKAEELFLWVPRKLLMT-VESAKNSV 138
Query: 70 IGP---ECRAMFEDGEVDDRFLMILFLTVERLRKNSSWKPYLDMLPTTFGNPLWFTDDEL 126
+GP + R + G + + L ER NS W+PY+ LP+ + PL+F +DE+
Sbjct: 139 LGPLYSQDRILQAMGNI----ALAFHLLCERASPNSFWQPYIQTLPSEYDTPLYFEEDEV 194
Query: 127 LELKGTTLYRATELQKQNLLTLY 149
L+ T Q +N Y
Sbjct: 195 RYLQSTQAIHDVFSQYKNTARQY 217
>gi|326427099|gb|EGD72669.1| hypothetical protein PTSG_04400 [Salpingoeca sp. ATCC 50818]
Length = 1063
Score = 49.7 bits (117), Expect = 0.004, Method: Compositional matrix adjust.
Identities = 28/85 (32%), Positives = 44/85 (51%), Gaps = 12/85 (14%)
Query: 256 LVPGIDFCNHDLKAAATWEVDGTGLITGVPFSMYLLSVERSSFHSEKEISISYGNKGNEE 315
L+P +D +HD +A TW GT ++ + R + + + +YG KGNEE
Sbjct: 308 LLPLLDLFDHDPQADVTWRNTGT----------HVRLITREAVAPGEPVFNNYGGKGNEE 357
Query: 316 LLYLYGFVIDNNPDD--YLMIHYPA 338
L+ YGF + NN D ++M+ PA
Sbjct: 358 LMLAYGFALPNNKHDDMHVMLGLPA 382
>gi|365989204|ref|XP_003671432.1| hypothetical protein NDAI_0H00150 [Naumovozyma dairenensis CBS 421]
gi|343770205|emb|CCD26189.1| hypothetical protein NDAI_0H00150 [Naumovozyma dairenensis CBS 421]
Length = 589
Score = 49.3 bits (116), Expect = 0.004, Method: Compositional matrix adjust.
Identities = 58/228 (25%), Positives = 95/228 (41%), Gaps = 31/228 (13%)
Query: 100 KNSSWKPYLDMLPTTFGNPLWFTDDELLELKGTTLYRATELQKQNLLTLYDDKVKDLVKK 159
K S W Y+++L P ++T D+L +LKGT LY + Q+ T+ + + +L++
Sbjct: 102 KKSQWDKYVEILSLDLNQPYFWTVDQLQQLKGTDLYIKIQ---QDFATIIQEYI-ELLQI 157
Query: 160 LLVLDGDSERANSIFWTRALNIPLPHSYVFPQNQEDLNKYDSINNSAELSNDHNSRGELI 219
L V D E+ + + +N LP + N + S S + SR
Sbjct: 158 LKVDILDQEKLQTATISHYINSHLP---TLLDGKLPWNHFVSYLWSHCI---FKSRAFPQ 211
Query: 220 NGLNDIKNEAQRVNSQVNGATSTLTSTQGETLWIEGLVPGIDFCNHDLKAAATWEVDGTG 279
LN+ ++ +N + L P +D NH WE +
Sbjct: 212 LLLNNAGSDVGNIN-------------------LAFLFPIVDLLNHKNDVVVKWE--SSN 250
Query: 280 LITGVPFSMYLLSVERSSFHSEKEISISYGNKGNEELLYLYGFVIDNN 327
I + L + + + H +I +YGNK NEELL YGF+ +NN
Sbjct: 251 DINNKNDNKVLTFITQETLHVGDQIFNNYGNKSNEELLLGYGFIQENN 298
>gi|85090666|ref|XP_958526.1| hypothetical protein NCU09827 [Neurospora crassa OR74A]
gi|28919896|gb|EAA29290.1| predicted protein [Neurospora crassa OR74A]
Length = 532
Score = 49.3 bits (116), Expect = 0.004, Method: Compositional matrix adjust.
Identities = 23/51 (45%), Positives = 34/51 (66%)
Query: 84 DDRFLMILFLTVERLRKNSSWKPYLDMLPTTFGNPLWFTDDELLELKGTTL 134
D L+IL L E L+ +S+W PYL +LPT F P+++T+ EL EL+ + L
Sbjct: 124 DSWTLLILILMHEYLQGSSNWSPYLSILPTQFDTPMFWTEAELSELQASAL 174
>gi|68467835|ref|XP_722076.1| potential protein lysine methyltransferase [Candida albicans
SC5314]
gi|68468152|ref|XP_721915.1| potential protein lysine methyltransferase [Candida albicans
SC5314]
gi|46443858|gb|EAL03137.1| potential protein lysine methyltransferase [Candida albicans
SC5314]
gi|46444024|gb|EAL03302.1| potential protein lysine methyltransferase [Candida albicans
SC5314]
Length = 433
Score = 49.3 bits (116), Expect = 0.004, Method: Compositional matrix adjust.
Identities = 63/249 (25%), Positives = 97/249 (38%), Gaps = 78/249 (31%)
Query: 88 LMILFLTVERLRKNSS-WKPYLDMLPTTFGNPLWFTDDELLELK-----GTTLYRATELQ 141
L+ L+LT ER R + S WKP+LDMLP+ D EL+ + T L +TE++
Sbjct: 120 LLSLYLTFERKRSHKSFWKPFLDMLPS-------MDDFELMPIDWPQEVCTLLPSSTEVR 172
Query: 142 KQNLLTLYDDK---VKDLVKKLLVLDGDSERANSIFWTRALNIPLPHSYVFPQNQEDLNK 198
+ + + +D+ + +L+K + DGD + LP QE L
Sbjct: 173 NKKVRSRFDNDYQVICELIKTKIDKDGD------------VTTLLP-------RQEVLLS 213
Query: 199 YDSINNSAELSNDHNSRGELINGLNDIKNEAQRVNSQVNGATSTLTSTQGETLWIEGLVP 258
+ IN+ L L KN A + P
Sbjct: 214 WLCINSRC-----------LYMDLPTSKNSADNFT----------------------MAP 240
Query: 259 GIDFCNHDLKAAATWEVDGTGLITGVPFSMYLLSVERSSFHSEKEISISYGNKGNEELLY 318
+DF NH T ++DG G S +++ ++ +SYG N+ LL
Sbjct: 241 YVDFMNHSCDDHCTLKIDGKGFQVRT----------TSQYNTGDQVYLSYGPHSNDFLLC 290
Query: 319 LYGFVIDNN 327
YGFVI +N
Sbjct: 291 EYGFVIPDN 299
>gi|386780935|ref|NP_001247800.1| SET domain containing 3 [Macaca mulatta]
gi|355693560|gb|EHH28163.1| hypothetical protein EGK_18532 [Macaca mulatta]
gi|380817110|gb|AFE80429.1| histone-lysine N-methyltransferase setd3 isoform a [Macaca mulatta]
gi|383422129|gb|AFH34278.1| histone-lysine N-methyltransferase setd3 isoform a [Macaca mulatta]
gi|384949778|gb|AFI38494.1| histone-lysine N-methyltransferase setd3 isoform a [Macaca mulatta]
Length = 595
Score = 49.3 bits (116), Expect = 0.004, Method: Compositional matrix adjust.
Identities = 36/143 (25%), Positives = 69/143 (48%), Gaps = 12/143 (8%)
Query: 12 FLQWLQVNKVELRGCK-IKYSDESKGFGIFSSNEF-SDGVLLVVPLDLAITPMRVLQDPL 69
++W N + G + + + +E GFG+ ++ + ++ + L VP L +T + ++ +
Sbjct: 82 LMKWASENGASVEGFEMVNFKEE--GFGLRATRDIKAEELFLWVPRKLLMT-VESAKNSV 138
Query: 70 IGP---ECRAMFEDGEVDDRFLMILFLTVERLRKNSSWKPYLDMLPTTFGNPLWFTDDEL 126
+GP + R + G + + L ER NS W+PY+ LP+ + PL+F +DE+
Sbjct: 139 LGPLYSQDRILQAMGNI----ALAFHLLCERANPNSFWQPYIQTLPSEYDTPLYFEEDEV 194
Query: 127 LELKGTTLYRATELQKQNLLTLY 149
L+ T Q +N Y
Sbjct: 195 RYLQSTQAIHDVFSQYKNTARQY 217
>gi|302510645|ref|XP_003017274.1| hypothetical protein ARB_04152 [Arthroderma benhamiae CBS 112371]
gi|291180845|gb|EFE36629.1| hypothetical protein ARB_04152 [Arthroderma benhamiae CBS 112371]
Length = 479
Score = 49.3 bits (116), Expect = 0.004, Method: Compositional matrix adjust.
Identities = 95/435 (21%), Positives = 175/435 (40%), Gaps = 92/435 (21%)
Query: 12 FLQWLQVNKVELR---GCKIK-YSDESKGFGIFSSNEFSDGVLLVVPLDLAITPMRVLQD 67
FL WL+ + + G I G GI E L V+P DL ++
Sbjct: 20 FLLWLKRSSPHFKMHPGIHIADLRSTGAGRGISEDEE-----LFVIPNDLILSVQNSEAR 74
Query: 68 PLIGPECRAMFEDGEVDDRFLMILFLTVERLR-KNSSWKPYLDMLPTTFGNPLWFTDDEL 126
++G +D ++ +I+ + E + + S W PY +LP++F +++TD++L
Sbjct: 75 SVLG------LDDKQLGPWLSLIITMIYEYYQGEQSKWYPYFRILPSSFDTLMFWTDEQL 128
Query: 127 LELKGTTLYRATELQKQNLLTLYDDKVKDLVKKLLVLDGDSERANSIFWTRALNIPLPHS 186
EL+G+ + ++ K + DD + V L+ +ANS ++ N+P
Sbjct: 129 SELQGSAV--VGKIGK----AVADDTILQKVVPLI-------QANSRYFPPRPNMP---- 171
Query: 187 YVFPQNQEDLNKYDSINNSAELSNDHNSRGELINGLNDI-KNEAQRVNSQVNGATSTLTS 245
LN DS N A L H ++ DI K + ++ +G
Sbjct: 172 --------PLNSPDSQN--ALLCLAHRMGSIIMAYAFDIEKTDEADEDTAEDGYM----- 216
Query: 246 TQGETLWIEGLVPGIDFCNHDLKA--AATWEVDGTGLITGVPFSMYLLSVERSSFHSEKE 303
T E +G+VP D N D + A ++ +G+ ++ + + HS +E
Sbjct: 217 TDDEDEPAKGMVPLADIFNADAQRNNARLFQEEGSFVMKAI-----------KNIHSGEE 265
Query: 304 ISISYGNKGNEELLYLYGFVIDNNPDDYLMIHYPAEAIHSIP-LSDSKALLLEEQKAQLR 362
I YG +LL YG+V DN Y ++ + +AI + L D + +L
Sbjct: 266 IFNDYGELPRADLLRRYGYVTDNYAQ-YDVVEFSLDAICKVAGLPDGEP---SPTNPRLE 321
Query: 363 CLLPKSLLEHGFFAAGHPKDGNNDNKLEVDRISSFSWSGQRRMPSYLNKLVFPENFLTAL 422
L +LE G+ P++G ++ + PE+FL L
Sbjct: 322 LLDNLDMLEEGYNIPRIPRNGTLEDAI-------------------------PEDFLVLL 356
Query: 423 RTIAMQEDEISKVSS 437
R + + ++++++ +
Sbjct: 357 RALTLPLEDLNRLGA 371
>gi|281182452|ref|NP_001162549.1| histone-lysine N-methyltransferase setd3 [Papio anubis]
gi|332321745|sp|A9X1D0.1|SETD3_PAPAN RecName: Full=Histone-lysine N-methyltransferase setd3; AltName:
Full=SET domain-containing protein 3
gi|163781076|gb|ABY40825.1| SET domain containing 3, isoform 1 (predicted) [Papio anubis]
Length = 595
Score = 49.3 bits (116), Expect = 0.004, Method: Compositional matrix adjust.
Identities = 36/143 (25%), Positives = 69/143 (48%), Gaps = 12/143 (8%)
Query: 12 FLQWLQVNKVELRGCK-IKYSDESKGFGIFSSNEF-SDGVLLVVPLDLAITPMRVLQDPL 69
++W N + G + + + +E GFG+ ++ + ++ + L VP L +T + ++ +
Sbjct: 82 LMKWASENGASVEGFEMVNFKEE--GFGLRATRDIKAEELFLWVPRKLLMT-VESAKNSV 138
Query: 70 IGP---ECRAMFEDGEVDDRFLMILFLTVERLRKNSSWKPYLDMLPTTFGNPLWFTDDEL 126
+GP + R + G + + L ER NS W+PY+ LP+ + PL+F +DE+
Sbjct: 139 LGPLYSQDRILQAMGNI----ALAFHLLCERANPNSFWQPYIQTLPSEYDTPLYFEEDEV 194
Query: 127 LELKGTTLYRATELQKQNLLTLY 149
L+ T Q +N Y
Sbjct: 195 RYLQSTQAIHDVFSQYKNTARQY 217
>gi|353240986|emb|CCA72828.1| hypothetical protein PIIN_06764 [Piriformospora indica DSM 11827]
Length = 424
Score = 49.3 bits (116), Expect = 0.004, Method: Compositional matrix adjust.
Identities = 30/87 (34%), Positives = 43/87 (49%), Gaps = 13/87 (14%)
Query: 256 LVPGIDFCNHDLKAAATWEVDGTGLITGVPFSMYLLSVERSSFH----SEKEISISYGNK 311
L+P +D NH+ +W +D P + L+ FH S E+ +YG K
Sbjct: 214 LIPLVDAFNHERAKPVSWSIDQP---NSKPTCLSLI------FHTPSPSGSELYNNYGAK 264
Query: 312 GNEELLYLYGFVIDNNPDDYLMIHYPA 338
N+ELL YGF I +NPDD L++ P
Sbjct: 265 PNDELLLGYGFTIPDNPDDTLLLKLPG 291
>gi|389644184|ref|XP_003719724.1| SET domain-containing protein 8 [Magnaporthe oryzae 70-15]
gi|351639493|gb|EHA47357.1| SET domain-containing protein 8 [Magnaporthe oryzae 70-15]
Length = 478
Score = 49.3 bits (116), Expect = 0.005, Method: Compositional matrix adjust.
Identities = 52/212 (24%), Positives = 90/212 (42%), Gaps = 27/212 (12%)
Query: 5 TEAKLEPFLQWLQVNKVELRGCKIKYSDESKGFGIFSSNEFS--DG----VLLVVPLDLA 58
++ ++ L W + N V G I + E KG+G+ + + DG VLL VP +L
Sbjct: 4 SDVPIDTLLIWARFNGVVFDGAAITQT-EGKGYGLVAQRDLQAKDGEDTTVLLSVPRELL 62
Query: 59 ITPMRVLQDPLIGPECRAMFED-GEVDDRFLMILFLTVERLRKNSS---------WKPYL 108
+ V Q R +F+ G R +ILFL + + +S W Y+
Sbjct: 63 LNSEYVEQCSKTDQRFRDLFDAAGHQSPRQDVILFLMAQIIHIWASDEGGGVSNPWTQYI 122
Query: 109 DMLPTTFGNPLWFTDDELLELKGTTLYRATELQKQNLLTLYDDKVKD----------LVK 158
LP T P + +DE L+GT+L A + + L +D+ ++ L +
Sbjct: 123 KYLPRTVPLPTLWNEDERQLLRGTSLEAAVHSKLRALENEFDNLLEKAAEIPSWNEVLCE 182
Query: 159 KLLVLDGDSERANSIFWTRALNIPLPHSYVFP 190
K +V D R ++ + +R + +P + P
Sbjct: 183 KQVVTVSDYARLDAWYRSRCMELPASGPTMVP 214
>gi|40068481|ref|NP_115609.2| histone-lysine N-methyltransferase setd3 isoform a [Homo sapiens]
gi|74750394|sp|Q86TU7.1|SETD3_HUMAN RecName: Full=Histone-lysine N-methyltransferase setd3; AltName:
Full=SET domain-containing protein 3
gi|28071092|emb|CAD61927.1| unnamed protein product [Homo sapiens]
gi|119602070|gb|EAW81664.1| SET domain containing 3, isoform CRA_a [Homo sapiens]
gi|119602072|gb|EAW81666.1| SET domain containing 3, isoform CRA_a [Homo sapiens]
gi|119602073|gb|EAW81667.1| SET domain containing 3, isoform CRA_a [Homo sapiens]
gi|194380984|dbj|BAG64060.1| unnamed protein product [Homo sapiens]
gi|307686103|dbj|BAJ20982.1| SET domain containing 3 [synthetic construct]
Length = 594
Score = 49.3 bits (116), Expect = 0.005, Method: Compositional matrix adjust.
Identities = 36/143 (25%), Positives = 69/143 (48%), Gaps = 12/143 (8%)
Query: 12 FLQWLQVNKVELRGCK-IKYSDESKGFGIFSSNEF-SDGVLLVVPLDLAITPMRVLQDPL 69
++W N + G + + + +E GFG+ ++ + ++ + L VP L +T + ++ +
Sbjct: 82 LMKWASENGASVEGFEMVNFKEE--GFGLRATRDIKAEELFLWVPRKLLMT-VESAKNSV 138
Query: 70 IGP---ECRAMFEDGEVDDRFLMILFLTVERLRKNSSWKPYLDMLPTTFGNPLWFTDDEL 126
+GP + R + G + + L ER NS W+PY+ LP+ + PL+F +DE+
Sbjct: 139 LGPLYSQDRILQAMGNI----ALAFHLLCERASPNSFWQPYIQTLPSEYDTPLYFEEDEV 194
Query: 127 LELKGTTLYRATELQKQNLLTLY 149
L+ T Q +N Y
Sbjct: 195 RYLQSTQAIHDVFSQYKNTARQY 217
>gi|10439587|dbj|BAB15525.1| unnamed protein product [Homo sapiens]
Length = 512
Score = 49.3 bits (116), Expect = 0.005, Method: Compositional matrix adjust.
Identities = 36/142 (25%), Positives = 69/142 (48%), Gaps = 12/142 (8%)
Query: 13 LQWLQVNKVELRGCK-IKYSDESKGFGIFSSNEF-SDGVLLVVPLDLAITPMRVLQDPLI 70
++W N + G + + + +E GFG+ ++ + ++ + L VP L +T + ++ ++
Sbjct: 1 MKWASENGASVEGFEMVNFKEE--GFGLRATRDIKAEELFLWVPRKLLMT-VESAKNSVL 57
Query: 71 GP---ECRAMFEDGEVDDRFLMILFLTVERLRKNSSWKPYLDMLPTTFGNPLWFTDDELL 127
GP + R + G + + L ER NS W+PY+ LP+ + PL+F +DE+
Sbjct: 58 GPLYSQDRILQAMGNI----ALAFHLLCERASPNSFWQPYIQTLPSEYDTPLYFEEDEVR 113
Query: 128 ELKGTTLYRATELQKQNLLTLY 149
L+ T Q +N Y
Sbjct: 114 YLQSTQAIHDVFSQYKNTARQY 135
>gi|169409575|gb|ACA57918.1| SET domain containing 3 isoform a (predicted) [Callicebus moloch]
Length = 597
Score = 49.3 bits (116), Expect = 0.005, Method: Compositional matrix adjust.
Identities = 36/143 (25%), Positives = 69/143 (48%), Gaps = 12/143 (8%)
Query: 12 FLQWLQVNKVELRGCK-IKYSDESKGFGIFSSNEF-SDGVLLVVPLDLAITPMRVLQDPL 69
++W N + G + + + +E GFG+ ++ + ++ + L VP L +T + ++ +
Sbjct: 84 LMKWASENGASVEGFEMVNFKEE--GFGLRATRDIKAEELFLWVPRKLLMT-VESAKNSV 140
Query: 70 IGP---ECRAMFEDGEVDDRFLMILFLTVERLRKNSSWKPYLDMLPTTFGNPLWFTDDEL 126
+GP + R + G + + L ER NS W+PY+ LP+ + PL+F +DE+
Sbjct: 141 LGPLYSQDRILQAMGNI----ALAFHLLCERASPNSFWQPYIQTLPSEYDTPLYFEEDEV 196
Query: 127 LELKGTTLYRATELQKQNLLTLY 149
L+ T Q +N Y
Sbjct: 197 RYLQSTQAIHDVFSQYKNTARQY 219
>gi|116197927|ref|XP_001224775.1| hypothetical protein CHGG_07119 [Chaetomium globosum CBS 148.51]
gi|88178398|gb|EAQ85866.1| hypothetical protein CHGG_07119 [Chaetomium globosum CBS 148.51]
Length = 555
Score = 49.3 bits (116), Expect = 0.005, Method: Compositional matrix adjust.
Identities = 39/161 (24%), Positives = 71/161 (44%), Gaps = 16/161 (9%)
Query: 36 GFGIFSSNEFSDG-VLLVVPLDLAITPMRVLQDPLIGPECRAMFEDGEVDDRFLMILFLT 94
G G+ + F G +L +P + T DPL+GP R+ V+D +
Sbjct: 112 GRGVKTLRRFKKGERILTIPSGILWTVEHAYADPLVGPVLRSARPPLSVEDTLATYILFI 171
Query: 95 VERLRKNSSWKPYLDMLPTTFGNPLWFTDDELLELKGTTLYRATELQKQNLLTLYDDKVK 154
R + ++ PT++ + ++F ++EL GT+LY T+ +++ +D +
Sbjct: 172 RSRESGYDGLRSHVAAFPTSYPSSIFFAEEELEVCAGTSLYTITKKLDRSI----EDDYR 227
Query: 155 DLVKKLLV-------LDG----DSERANSIFWTRALNIPLP 184
LV ++L LD D + A W+RA++ LP
Sbjct: 228 TLVVRVLAQSRDLFPLDKFSIEDYKWALCTVWSRAMDFVLP 268
>gi|328854233|gb|EGG03367.1| hypothetical protein MELLADRAFT_90239 [Melampsora larici-populina
98AG31]
Length = 509
Score = 49.3 bits (116), Expect = 0.005, Method: Compositional matrix adjust.
Identities = 41/148 (27%), Positives = 68/148 (45%), Gaps = 19/148 (12%)
Query: 1 MEISTEAKLEPFLQWLQVNKVELRGCKIKYSDESKGFGIFSSNEFSDGVLLVVPLDLAIT 60
M S+E ++E F WL+ V + +G G+ ++ + G +L +I
Sbjct: 1 MNSSSETRIEAFNHWLKDRGVIQHPSISIHHFGHQGHGLIATEDIEAGTIL-----FSIP 55
Query: 61 PMRVLQDPL--IGPE---CRAMFEDGEVDDRFLMILFLTV--ERLR-------KNSSWKP 106
V PL IG + D E R + L +T+ ER R + SW+P
Sbjct: 56 RPPVSNSPLLTIGTSDFLSKLGTSDAEKISRNWIPLLMTMMWERARGYDQSVPSHMSWRP 115
Query: 107 YLDMLPTTFGNPLWFTDDELLELKGTTL 134
Y +M+PT F ++++DDEL EL+ +T+
Sbjct: 116 YFEMMPTEFDTLMFWSDDELKELQASTV 143
>gi|440465458|gb|ELQ34778.1| hypothetical protein OOU_Y34scaffold00745g53 [Magnaporthe oryzae
Y34]
gi|440487677|gb|ELQ67452.1| hypothetical protein OOW_P131scaffold00314g25 [Magnaporthe oryzae
P131]
Length = 544
Score = 49.3 bits (116), Expect = 0.005, Method: Compositional matrix adjust.
Identities = 27/87 (31%), Positives = 49/87 (56%), Gaps = 8/87 (9%)
Query: 89 MILFLTVERLRKNSS-WKPYLDMLPTTFGNPLWFTDDELLELKGTTLYRATELQKQNLLT 147
+IL + E LR++ S W+PYLD+LPT F P+++T +E+ EL+ + + + ++
Sbjct: 76 LILVMMYEHLRRDPSPWRPYLDVLPTEFETPMFWTSEEIAELQASPV--VASIGREEADV 133
Query: 148 LYDDKVKDLVKKLL-----VLDGDSER 169
+ K+ ++K+ V D D ER
Sbjct: 134 MIRTKILPVIKENAAVFGGVTDADDER 160
>gi|440472932|gb|ELQ41762.1| SET domain-containing protein 8 [Magnaporthe oryzae Y34]
gi|440478704|gb|ELQ59514.1| SET domain-containing protein 8 [Magnaporthe oryzae P131]
Length = 478
Score = 49.3 bits (116), Expect = 0.005, Method: Compositional matrix adjust.
Identities = 52/212 (24%), Positives = 90/212 (42%), Gaps = 27/212 (12%)
Query: 5 TEAKLEPFLQWLQVNKVELRGCKIKYSDESKGFGIFSSNEFS--DG----VLLVVPLDLA 58
++ ++ L W + N V G I + E KG+G+ + + DG VLL VP +L
Sbjct: 4 SDVPIDTLLIWARFNGVVFDGAAITQT-EGKGYGLVAQRDLQAKDGEDTTVLLSVPRELL 62
Query: 59 ITPMRVLQDPLIGPECRAMFED-GEVDDRFLMILFLTVERLRKNSS---------WKPYL 108
+ V Q R +F+ G R +ILFL + + +S W Y+
Sbjct: 63 LNSEYVEQCSKTDQRFRDLFDAAGHQSPRQDVILFLMAQIIHIWASDEGGGVSNPWTQYI 122
Query: 109 DMLPTTFGNPLWFTDDELLELKGTTLYRATELQKQNLLTLYDDKVKD----------LVK 158
LP T P + +DE L+GT+L A + + L +D+ ++ L +
Sbjct: 123 KYLPRTVPLPTLWNEDERQLLRGTSLEAAVHSKLRALENEFDNLLEKAAEIPSWNEALCE 182
Query: 159 KLLVLDGDSERANSIFWTRALNIPLPHSYVFP 190
K +V D R ++ + +R + +P + P
Sbjct: 183 KQVVTVSDYARLDAWYRSRCMELPASGPTMVP 214
>gi|332321478|sp|B1MTJ4.2|SETD3_CALMO RecName: Full=Histone-lysine N-methyltransferase setd3; AltName:
Full=SET domain-containing protein 3
Length = 595
Score = 49.3 bits (116), Expect = 0.005, Method: Compositional matrix adjust.
Identities = 36/143 (25%), Positives = 69/143 (48%), Gaps = 12/143 (8%)
Query: 12 FLQWLQVNKVELRGCK-IKYSDESKGFGIFSSNEF-SDGVLLVVPLDLAITPMRVLQDPL 69
++W N + G + + + +E GFG+ ++ + ++ + L VP L +T + ++ +
Sbjct: 82 LMKWASENGASVEGFEMVNFKEE--GFGLRATRDIKAEELFLWVPRKLLMT-VESAKNSV 138
Query: 70 IGP---ECRAMFEDGEVDDRFLMILFLTVERLRKNSSWKPYLDMLPTTFGNPLWFTDDEL 126
+GP + R + G + + L ER NS W+PY+ LP+ + PL+F +DE+
Sbjct: 139 LGPLYSQDRILQAMGNI----ALAFHLLCERASPNSFWQPYIQTLPSEYDTPLYFEEDEV 194
Query: 127 LELKGTTLYRATELQKQNLLTLY 149
L+ T Q +N Y
Sbjct: 195 RYLQSTQAIHDVFSQYKNTARQY 217
>gi|346319394|gb|EGX88996.1| Protein kinase-like domain [Cordyceps militaris CM01]
Length = 1753
Score = 48.9 bits (115), Expect = 0.005, Method: Compositional matrix adjust.
Identities = 67/296 (22%), Positives = 107/296 (36%), Gaps = 66/296 (22%)
Query: 36 GFGIFSSNEFSDG-VLLVVPLDLAITPMRVLQDPLIGPECRAMFEDGEVDDRFLMILFLT 94
G G+ + F G +L +P T DPL+GP R+ V+D + L
Sbjct: 909 GRGVKALRSFKKGERILTIPSACLWTAEAARADPLLGPVLRSAQPPLSVEDTLAIHLLFV 968
Query: 95 VERLRKNSSWKPYLDMLPTTFGNPLWFTDDELLELKGTTLYRATELQKQNLLTLYDDKVK 154
R + ++ +P ++F +DEL +G++L+ L T + +V+
Sbjct: 969 KSRTAGYEGQRLHIAAMPQRHSASIFFAEDELQVCEGSSLH--------TLTTQLEQRVQ 1020
Query: 155 DLVKKLLVLDGDSERANSIFWTRALNIPLPHSYVFPQNQEDLNKYDSINNSAELSNDHNS 214
D ++LLV + H +FP +Q + Y
Sbjct: 1021 DDFRQLLV-----------------QLLSQHRDLFPLDQFTIEDYKW------------- 1050
Query: 215 RGELINGLNDIKNEAQRVNSQVNGATSTLTSTQGETLWIEGLVPGIDFCNHDLKAAATWE 274
L I + A ++ A S TS + + P D NH L
Sbjct: 1051 ------ALCTIWSRA------MDFAVSDTTS-------VRLVAPLADMLNHSLDVKQCHA 1091
Query: 275 VDGTGLITGVPFSMYLLSVERSSFHSEKEISISYGNKGNEELLYLYGFVIDNNPDD 330
D P S L + + +I I YG+ N LL LYGFV+ +NP+D
Sbjct: 1092 YD--------PTSGDLSILAAKDYQVGDQIFIYYGSVPNNRLLRLYGFVLLDNPND 1139
>gi|449662705|ref|XP_002165483.2| PREDICTED: uncharacterized protein LOC100209819 [Hydra
magnipapillata]
Length = 819
Score = 48.9 bits (115), Expect = 0.005, Method: Compositional matrix adjust.
Identities = 49/194 (25%), Positives = 93/194 (47%), Gaps = 9/194 (4%)
Query: 4 STEAKLEPFLQWLQVNKVELRG-CKIKYSDESKGFGIFSSNEFSDG-VLLVVPLDLAITP 61
+ E KL FLQW + N + L K+ ++ S +G+ ++ + G VL VP L +
Sbjct: 95 TVEKKLLNFLQWCKANNLNLSSKVKVDFNGTSHRYGMLATEDIKKGEVLFTVPRQLLLNQ 154
Query: 62 -MRVLQDPLIGPECRAMFEDGEVDDR---FLMILFLTVERLRKNSSWKPYLDMLP--TTF 115
L++ L E ++D +++ L E +K+S W YL ++P + F
Sbjct: 155 NTATLKNRLNEFEKWLDTHGKSLNDSSGWLPLLITLMWEFNQKDSFWASYLLLVPEISEF 214
Query: 116 GNPLWFTDDEL-LELKGTTLYRATELQKQNLLTLYDDKVKDLVKKLLVLDGDSERANSIF 174
G+PL++ ++E LE +G L + ++N+ T Y + V +++ L G E + F
Sbjct: 215 GHPLFWKEEEYNLEFQGMPLLNDIIVDRENIETEYAEFVLLFLRRNKDLFGSLENYSLEF 274
Query: 175 WTRALNIPLPHSYV 188
+ R + + +S+
Sbjct: 275 FKRMVAFVMAYSFT 288
>gi|145516108|ref|XP_001443948.1| hypothetical protein [Paramecium tetraurelia strain d4-2]
gi|124411348|emb|CAK76551.1| unnamed protein product [Paramecium tetraurelia]
Length = 572
Score = 48.9 bits (115), Expect = 0.006, Method: Compositional matrix adjust.
Identities = 35/141 (24%), Positives = 72/141 (51%), Gaps = 4/141 (2%)
Query: 13 LQWLQVNKVELRGCKIK-YSDESKGFGIFSSNEFSDGVLLVVPLDLAITPMRVLQDPLIG 71
L+WL++ K KI+ YS++ +G + + ++L +P IT + + ++ +
Sbjct: 140 LEWLKIGKAIFPKIKIECYSEDYRGVNAKQTIN-AKELILFIPKSHMIT-LEMAKETTVA 197
Query: 72 PECRAMFEDGEVDDRFLMILFLTVERLRKNSSWKPYLDMLPTTFGN-PLWFTDDELLELK 130
+ D + FL E+ R NS WKPY+D+LP+++ + P+++ + +L LK
Sbjct: 198 KKMMQFRLDLLSPKHSFLSTFLLQEKFRPNSFWKPYIDILPSSYPSFPIFYNNSDLEWLK 257
Query: 131 GTTLYRATELQKQNLLTLYDD 151
G+ + + + +L Y+D
Sbjct: 258 GSPFLKQIKDKLADLQKDYND 278
>gi|354548388|emb|CCE45124.1| hypothetical protein CPAR2_701280 [Candida parapsilosis]
Length = 565
Score = 48.9 bits (115), Expect = 0.006, Method: Compositional matrix adjust.
Identities = 29/75 (38%), Positives = 39/75 (52%), Gaps = 11/75 (14%)
Query: 256 LVPGIDFCNHDLKAAATWEVDGTGLITGVPFSMYLLSVERSSFHSEKEISISYGNKGNEE 315
L+P +D NH+ K W G +L + +S S +E+ +YG KGNEE
Sbjct: 216 LLPVVDLLNHNPKTKVQWSGTDGG---------FLFQSDDAS--SGEELFNNYGQKGNEE 264
Query: 316 LLYLYGFVIDNNPDD 330
LL YGF I+NNP D
Sbjct: 265 LLLAYGFAIENNPAD 279
>gi|389629438|ref|XP_003712372.1| hypothetical protein MGG_04887 [Magnaporthe oryzae 70-15]
gi|351644704|gb|EHA52565.1| hypothetical protein MGG_04887 [Magnaporthe oryzae 70-15]
Length = 499
Score = 48.9 bits (115), Expect = 0.006, Method: Compositional matrix adjust.
Identities = 27/87 (31%), Positives = 49/87 (56%), Gaps = 8/87 (9%)
Query: 89 MILFLTVERLRKNSS-WKPYLDMLPTTFGNPLWFTDDELLELKGTTLYRATELQKQNLLT 147
+IL + E LR++ S W+PYLD+LPT F P+++T +E+ EL+ + + + ++
Sbjct: 111 LILVMMYEHLRRDPSPWRPYLDVLPTEFETPMFWTSEEIAELQASPV--VASIGREEADV 168
Query: 148 LYDDKVKDLVKKLL-----VLDGDSER 169
+ K+ ++K+ V D D ER
Sbjct: 169 MIRTKILPVIKENAAVFGGVTDADDER 195
>gi|332321744|sp|B5FW36.1|SETD3_OTOGA RecName: Full=Histone-lysine N-methyltransferase setd3; AltName:
Full=SET domain-containing protein 3
gi|197215622|gb|ACH53017.1| SET domain containing 3 isoform a (predicted) [Otolemur garnettii]
Length = 595
Score = 48.9 bits (115), Expect = 0.006, Method: Compositional matrix adjust.
Identities = 36/143 (25%), Positives = 69/143 (48%), Gaps = 12/143 (8%)
Query: 12 FLQWLQVNKVELRGCK-IKYSDESKGFGIFSSNEF-SDGVLLVVPLDLAITPMRVLQDPL 69
++W N + G + + + +E GFG+ ++ + ++ + L VP L +T + ++ +
Sbjct: 82 LMKWASENGASVEGFEMVNFKEE--GFGLRATRDIKAEELFLWVPRKLLMT-VESAKNSV 138
Query: 70 IGP---ECRAMFEDGEVDDRFLMILFLTVERLRKNSSWKPYLDMLPTTFGNPLWFTDDEL 126
+GP + R + G + + L ER NS W+PY+ LP+ + PL+F +DE+
Sbjct: 139 LGPLYSQDRILQAMGNI----ALAFHLLCERASPNSFWQPYIQSLPSEYDTPLYFEEDEV 194
Query: 127 LELKGTTLYRATELQKQNLLTLY 149
L+ T Q +N Y
Sbjct: 195 RYLQSTQAIHDVFSQYKNTARQY 217
>gi|299748031|ref|XP_002911244.1| tho2 protein [Coprinopsis cinerea okayama7#130]
gi|298407787|gb|EFI27750.1| tho2 protein [Coprinopsis cinerea okayama7#130]
Length = 2474
Score = 48.9 bits (115), Expect = 0.006, Method: Compositional matrix adjust.
Identities = 27/79 (34%), Positives = 43/79 (54%), Gaps = 2/79 (2%)
Query: 256 LVPGIDFCNHDLKAAATWEVDGTGLITGVPFSMYLLSVERSSFHSEKEISISYGNKGNEE 315
L+P IDF NH +W + T S+ L++ S+ + +E+ +YG K N E
Sbjct: 178 LLPAIDFLNHARGQRVSWIANQTQEDGKSVSSISLIA--HSAIWTGQEVFNNYGPKPNSE 235
Query: 316 LLYLYGFVIDNNPDDYLMI 334
L+ YGF I +NPDD +++
Sbjct: 236 LILSYGFSIQDNPDDSIIL 254
>gi|452841392|gb|EME43329.1| hypothetical protein DOTSEDRAFT_131367 [Dothistroma septosporum
NZE10]
Length = 445
Score = 48.9 bits (115), Expect = 0.006, Method: Compositional matrix adjust.
Identities = 83/340 (24%), Positives = 138/340 (40%), Gaps = 73/340 (21%)
Query: 12 FLQWLQVNKVELRGCKIKYSD---ESKGFGIFSSNEF-SDGVLLVVPLDLAITPMRVLQD 67
F+ WL+ N + KI D ++ G GI + + D L VP +T
Sbjct: 14 FVNWLRDNGASI-SAKITLDDLRQQNAGRGIVAVEDLDEDEELFSVPRSTMLTTE----- 67
Query: 68 PLIGPECRAMFEDGEVDDRFL-MILFLTVERLR-KNSSWKPYLDMLPTTFGNPLWFTDDE 125
A+ + EVDD +L +I+ + +E L S WKPY D+LP +F N ++++D E
Sbjct: 68 --TSRNGEAVLQ--EVDDPWLSLIVVMALEYLDGSQSRWKPYFDVLPVSFDNLMFWSDRE 123
Query: 126 LLELKGTTLYRATELQKQNLLTLYDDKVKDLVKKLLVLDGDSERANSIFWTRALNIPLPH 185
L L+G+T +V K+ E A++ F R IP+
Sbjct: 124 LRHLEGST----------------------VVGKI-----GKEAADATF--REQLIPVI- 153
Query: 186 SYVFPQNQEDLNKYDSINNSAELSNDHNSRGELINGLNDIKNEAQRVNSQVNGATSTLTS 245
E ++K + +N L H ++ D++ + + + +G S
Sbjct: 154 --------ERISKAKAADNEELLRMCHRMGSTIMAYGFDLETSSDQAKN--DGEEWEEDS 203
Query: 246 TQGETLWIEGLVPGIDFCNHDL---KAAATWEVDGTGLITGVPFSMYLLSVERSSFHSEK 302
GETL +G+VP D N D A +E D + T P + +
Sbjct: 204 DAGETL-PKGMVPLADMLNADADRNNAKLFYEDDKVVMKTIKP------------VKAGE 250
Query: 303 EISISYGNKGNEELLYLYGFVIDNNPDDYLMIHYPAEAIH 342
E+ +G+ +LL YG++ DN Y ++ PA+ I
Sbjct: 251 ELYNDFGSLPRADLLRRYGYLTDNYA-QYDVVEIPADLIK 289
>gi|448101795|ref|XP_004199647.1| Piso0_002187 [Millerozyma farinosa CBS 7064]
gi|359381069|emb|CCE81528.1| Piso0_002187 [Millerozyma farinosa CBS 7064]
Length = 603
Score = 48.9 bits (115), Expect = 0.006, Method: Compositional matrix adjust.
Identities = 68/273 (24%), Positives = 115/273 (42%), Gaps = 30/273 (10%)
Query: 73 ECRAMFEDGEVDDRFLMILFLTVERLRKNSSWKPYLDMLPT--TFGNPLWFTDDELLELK 130
+C +FE V + L+ L T + KPYLDMLP P +++ ++ +K
Sbjct: 70 DCGTLFEGTNVPLKLLLCLGRTAQ-YGSQIFHKPYLDMLPRADVLNVPYFWSKEQKECIK 128
Query: 131 GTTLYRATELQKQNLLTLYDDKVKDLVKKLLVLDGDSERANSIFWTRALNIPLPHSYVFP 190
GT L + + +L + + L+ +G S+ AN +N+ + + F
Sbjct: 129 GTNLGSSLKDHMAVILEEWWRAIN------LLPEGVSKPANHF-----INMKFYYEFKF- 176
Query: 191 QNQEDLNKYDSINNSAELSNDHNSRGELINGLNDIKNEAQRVNSQVNGATSTLTSTQGET 250
+DL KY ++++ ++ N S + +K+ A N + +
Sbjct: 177 HTDDDLYKYFVVDSANDVEN-WTSFPNYLWASCILKSRAFPAYLLKNEVEEEIKEDEC-- 233
Query: 251 LWIEGLVPGIDFCNHDLKAAATWEVDGTGLITGVPFSMYLLSVERSSFHSEKEISISYGN 310
L+P +D NHD+K+ W T V F S + +++ +YG
Sbjct: 234 ----MLLPIVDLLNHDVKSEVEWSATKDENAT-VNFLFRSFSAQDG-----QQLFNNYGR 283
Query: 311 KGNEELLYLYGFVIDNNPDD--YLMIHYPAEAI 341
KGNEELL YGF +++N D L I P E I
Sbjct: 284 KGNEELLMGYGFCLEDNAADTCALRIKVPKEII 316
>gi|328700922|ref|XP_003241429.1| PREDICTED: SET domain-containing protein 3-like [Acyrthosiphon
pisum]
Length = 463
Score = 48.9 bits (115), Expect = 0.006, Method: Compositional matrix adjust.
Identities = 44/161 (27%), Positives = 71/161 (44%), Gaps = 11/161 (6%)
Query: 9 LEPFLQWLQVNKVELRGCKIKYSDESKGFGIFSSNEFSDG-VLLVVPLDLAITPMRVLQD 67
+E +W N L G +I + E+ +G+ ++ + G L+ VP L +T +
Sbjct: 86 IEKLTKWATKNGAILNGVEI-HQFENYAYGMKANKNITVGDKLVTVPRALMMTEENIPSS 144
Query: 68 PLIGPECRAMFEDGEVDDRFLMILFLTVERLRKN--SSWKPYLDMLPTTFGNPLWFTDDE 125
PL + M + + +F+ VE LRK+ S W YL LP T+ P++F +
Sbjct: 145 PLWKLHSQDMMLRNMPN--VALAIFILVESLRKDKKSFWHSYLTTLPVTYSTPVYFDVAD 202
Query: 126 LLELKGTTLYRATELQKQNLLTLYDDKVKDLVKKLLVLDGD 166
L LKG+ + A +N+ Y KKL L D
Sbjct: 203 LEALKGSPAFEAALKLNRNIARQY-----AYFKKLFQLSND 238
>gi|323348380|gb|EGA82627.1| YHL039W-like protein [Saccharomyces cerevisiae Lalvin QA23]
Length = 277
Score = 48.9 bits (115), Expect = 0.006, Method: Compositional matrix adjust.
Identities = 40/135 (29%), Positives = 63/135 (46%), Gaps = 7/135 (5%)
Query: 5 TEAKLEPFLQWLQVNKVELRGCKIKYS-DESKGFGIFSSNEFS---DGVLLVVPLDLAIT 60
T+ +L+ LQW Q N + KI + E G F + + S D L+ VP L IT
Sbjct: 3 TQTELDNCLQWAQNNGAFIDP-KISFRITEDAGVSAFVNEKLSPKPDQALIRVPETLLIT 61
Query: 61 PMRVLQDPLIGPECRAMFEDGEVDDRFLMILFLTVERLRKNSSWKPYLDMLPTTFGNPLW 120
+ L + R++ V +L L + + S +KPYLD+LP P +
Sbjct: 62 SQQALSEFSQAANERSLL--NSVTQLYLSKLKFGTDAVHLKSFYKPYLDVLPLHLPQPYF 119
Query: 121 FTDDELLELKGTTLY 135
++ DE++ L GT +Y
Sbjct: 120 WSTDEVMNLHGTDVY 134
>gi|440804394|gb|ELR25271.1| rubisco lsmt substrate-binding protein [Acanthamoeba castellanii
str. Neff]
Length = 408
Score = 48.9 bits (115), Expect = 0.006, Method: Compositional matrix adjust.
Identities = 22/77 (28%), Positives = 44/77 (57%)
Query: 83 VDDRFLMILFLTVERLRKNSSWKPYLDMLPTTFGNPLWFTDDELLELKGTTLYRATELQK 142
VD+R + LF+ E + +S W+PY D LP+ P+++ D+++ L G+ L+ A QK
Sbjct: 42 VDNRLPLALFMLHELRKPDSFWRPYFDALPSRVNLPMFWADEDMQLLAGSPLHAAVLAQK 101
Query: 143 QNLLTLYDDKVKDLVKK 159
+ + + + +V++
Sbjct: 102 KQARDWHTEHIVPIVRR 118
>gi|395827792|ref|XP_003787079.1| PREDICTED: histone-lysine N-methyltransferase setd3 [Otolemur
garnettii]
Length = 595
Score = 48.5 bits (114), Expect = 0.007, Method: Compositional matrix adjust.
Identities = 33/126 (26%), Positives = 65/126 (51%), Gaps = 12/126 (9%)
Query: 12 FLQWLQVNKVELRGCK-IKYSDESKGFGIFSSNEF-SDGVLLVVPLDLAITPMRVLQDPL 69
++W N + G + + + +E GFG+ ++ + ++ + L VP L +T + ++ +
Sbjct: 82 LMKWASENGASVEGFEMVNFKEE--GFGLRATRDIKAEELFLWVPRKLLMT-VESAKNSV 138
Query: 70 IGP---ECRAMFEDGEVDDRFLMILFLTVERLRKNSSWKPYLDMLPTTFGNPLWFTDDEL 126
+GP + R + G + + L ER NS W+PY+ LP+ + PL+F +DE+
Sbjct: 139 LGPLYSQDRILQAMGNI----ALAFHLLCERASPNSFWQPYIQSLPSEYDTPLYFEEDEV 194
Query: 127 LELKGT 132
L+ T
Sbjct: 195 RYLQST 200
>gi|320588047|gb|EFX00522.1| set domain containing protein [Grosmannia clavigera kw1407]
Length = 508
Score = 48.5 bits (114), Expect = 0.007, Method: Compositional matrix adjust.
Identities = 64/266 (24%), Positives = 106/266 (39%), Gaps = 64/266 (24%)
Query: 89 MILFLTVERLRKNSS-WKPYLDMLPTTFGNPLWFTDDELLELKGTTLYRATELQKQNLLT 147
+IL + E LR + S WK YLD+LP TF P+++T++EL EL+ +
Sbjct: 71 LILIMIFEFLRGDKSPWKSYLDVLPATFDTPMFWTENELRELQASP-------------- 116
Query: 148 LYDDKVKDLVKKLLVLDGDSERANSIFWTRALNIPLPHSYV-FPQNQEDLNKYD------ 200
V + + K+ A+ + + L + HS V FP+ ++++ D
Sbjct: 117 -----VVERIGKV--------EADEMITKKVLPVIRAHSGVFFPKGSQEISDEDLFGLAH 163
Query: 201 -----SINNSAELSNDHNSRGELINGLNDIKNEA-----QRVNSQVNG-ATSTLTSTQGE 249
+ S L ND + E +G D A + V NG A + ++ +
Sbjct: 164 RMGSTILAYSFSLENDDSDDEEKGDGCKDGSGCACHGGDKMVLEDENGDADADAIESEDD 223
Query: 250 TLWIE--------GLVPGIDFCNHDLKAAATWEVDGTGLITGVPFSMYLLSVERSSFHSE 301
W+E G+VP D N D + A GT ++T +
Sbjct: 224 DGWVEDREDQLPLGMVPMADMLNADAEFNAHIN-HGTDVLTATSV---------RHIKAG 273
Query: 302 KEISISYGNKGNEELLYLYGFVIDNN 327
+EI YG N EL+ YG+ + +
Sbjct: 274 EEILNFYGPLSNGELMRRYGYTTEKH 299
>gi|145354549|ref|XP_001421544.1| predicted protein [Ostreococcus lucimarinus CCE9901]
gi|144581782|gb|ABO99837.1| predicted protein [Ostreococcus lucimarinus CCE9901]
Length = 488
Score = 48.5 bits (114), Expect = 0.007, Method: Compositional matrix adjust.
Identities = 29/86 (33%), Positives = 43/86 (50%), Gaps = 13/86 (15%)
Query: 255 GLVPGIDFCNHDLKAA----ATWEVDGTGLITGVPFSMYLLSVERSSFHSEKEISISYGN 310
++P ID CNH + A E D G GV + R + S + I +SYGN
Sbjct: 247 AMLPVIDICNHSFNPSVSVRAIEEGDNAG---GVEL------IARRALTSGEPIELSYGN 297
Query: 311 KGNEELLYLYGFVIDNNPDDYLMIHY 336
N+ELL YGF++ +NP D + + +
Sbjct: 298 LSNDELLLDYGFIVKDNPFDCVKLRW 323
>gi|189193345|ref|XP_001933011.1| predicted protein [Pyrenophora tritici-repentis Pt-1C-BFP]
gi|187978575|gb|EDU45201.1| predicted protein [Pyrenophora tritici-repentis Pt-1C-BFP]
Length = 642
Score = 48.5 bits (114), Expect = 0.007, Method: Compositional matrix adjust.
Identities = 44/158 (27%), Positives = 71/158 (44%), Gaps = 26/158 (16%)
Query: 4 STEAKLEPFLQWLQVNKVELR-GCKIKYSDESKGFGIFSSNEFSDGVLLVVPLDLAITPM 62
+ A E + W N +L +I YSD ++GF + + S V+ PL L ++ +
Sbjct: 10 NAHALAESLVAWFTSNGGQLSPDVEIVYSD-ARGFHMRALRPLSSPVVTKCPLKLTLSCL 68
Query: 63 RV---------LQDPLIGPECRAMFEDGEVDDRFLMILFLTVERLRKNSS-WKPYLDMLP 112
+ ++ PL +CR G + D L L L +R + +S W Y+ LP
Sbjct: 69 NLDPGEQGVLHIESPLR--QCR-----GRIPDHILTYLLLIEQRDKGQASPWHAYIACLP 121
Query: 113 TT--FGNPLWFTDDELLELKGTTLY-----RATELQKQ 143
+ PLWF + ++ L GT+L R ELQ+Q
Sbjct: 122 NSRDMTTPLWFDEGDMAFLAGTSLVPAAKERKAELQQQ 159
Score = 47.0 bits (110), Expect = 0.024, Method: Compositional matrix adjust.
Identities = 31/88 (35%), Positives = 45/88 (51%), Gaps = 9/88 (10%)
Query: 241 STLTSTQGETLWIEGLVPGIDFCNHDLKAAATWEVDGTGLITGVPFSMYLLSVERSSFHS 300
ST + ET+ I L P +D NH + A W+ G F++ L + SF
Sbjct: 197 STHILPERETVPI--LFPVVDILNHSVSAKVEWDFQ-----PGQSFALKCL--DGDSFQP 247
Query: 301 EKEISISYGNKGNEELLYLYGFVIDNNP 328
E+E+ +Y K N+ELL YGF ++NNP
Sbjct: 248 EQELFNNYAPKQNDELLLGYGFCLENNP 275
>gi|40068483|ref|NP_954574.1| histone-lysine N-methyltransferase setd3 isoform b [Homo sapiens]
gi|28071060|emb|CAD61911.1| unnamed protein product [Homo sapiens]
gi|111309143|gb|AAI20968.1| SET domain containing 3 [Homo sapiens]
gi|118341365|gb|AAI27625.1| SET domain containing 3 [Homo sapiens]
gi|118341638|gb|AAI27626.1| SET domain containing 3 [Homo sapiens]
gi|119602071|gb|EAW81665.1| SET domain containing 3, isoform CRA_b [Homo sapiens]
gi|156138972|gb|AAI48252.1| SET domain containing 3 [Homo sapiens]
Length = 296
Score = 48.5 bits (114), Expect = 0.008, Method: Compositional matrix adjust.
Identities = 33/126 (26%), Positives = 65/126 (51%), Gaps = 12/126 (9%)
Query: 12 FLQWLQVNKVELRGCK-IKYSDESKGFGIFSSNEF-SDGVLLVVPLDLAITPMRVLQDPL 69
++W N + G + + + +E GFG+ ++ + ++ + L VP L +T + ++ +
Sbjct: 82 LMKWASENGASVEGFEMVNFKEE--GFGLRATRDIKAEELFLWVPRKLLMT-VESAKNSV 138
Query: 70 IGP---ECRAMFEDGEVDDRFLMILFLTVERLRKNSSWKPYLDMLPTTFGNPLWFTDDEL 126
+GP + R + G + + L ER NS W+PY+ LP+ + PL+F +DE+
Sbjct: 139 LGPLYSQDRILQAMGNI----ALAFHLLCERASPNSFWQPYIQTLPSEYDTPLYFEEDEV 194
Query: 127 LELKGT 132
L+ T
Sbjct: 195 RYLQST 200
>gi|229596469|ref|XP_001008992.3| SET domain containing protein [Tetrahymena thermophila]
gi|225565279|gb|EAR88747.3| SET domain containing protein [Tetrahymena thermophila SB210]
Length = 629
Score = 48.5 bits (114), Expect = 0.008, Method: Compositional matrix adjust.
Identities = 42/151 (27%), Positives = 69/151 (45%), Gaps = 9/151 (5%)
Query: 5 TEAKLEPFLQWLQVNKVELRGCKIKYSDESKGFGIFSSNEFSDGVLLVVPLDLAITPMRV 64
T K E L W+Q NK E K+KY + +D ++ +P + IT
Sbjct: 146 TLKKSENLLSWVQANKGEFSSIKLKYLSTHNRSIVSKRIIQADETVISIPQEQVIT---- 201
Query: 65 LQDPLIGPECRAMFEDGE---VDDRFLMILFLTVERLRKNSS-WKPYLDMLPTTFGN-PL 119
L C+ + E LFL E+ +K++S +K Y+D LPT + P
Sbjct: 202 LDVASSSDFCKILTEKNTQLVQQKHAYFALFLLQEQKKKDASHYKAYIDSLPTDLSSFPA 261
Query: 120 WFTDDELLELKGTTLYRATELQKQNLLTLYD 150
F+++EL L+GT + + QK+++ T Y+
Sbjct: 262 LFSEEELQYLEGTAALKLVQEQKEDIKTDYE 292
>gi|328726082|ref|XP_001952202.2| PREDICTED: SET domain-containing protein 3-like [Acyrthosiphon
pisum]
Length = 241
Score = 48.5 bits (114), Expect = 0.008, Method: Compositional matrix adjust.
Identities = 39/144 (27%), Positives = 66/144 (45%), Gaps = 6/144 (4%)
Query: 9 LEPFLQWLQVNKVELRGCKIKYSDESKGFGIFSSNEFSDG-VLLVVPLDLAITPMRVLQD 67
+E +W N L G +I + E+ +G+ ++ + G L+ VP L +T +
Sbjct: 86 IEKLTKWATKNGAILNGVEI-HQFENYAYGMKANKNITVGDKLVTVPRALMMTEENIPSS 144
Query: 68 PLIGPECRAMFEDGEVDDRFLMILFLTVERLRKN--SSWKPYLDMLPTTFGNPLWFTDDE 125
PL + M + + +F+ VE LRK+ S W YL LP T+ P++F +
Sbjct: 145 PLWKLHSQDMMLRNMPN--VALAIFILVESLRKDKKSFWHSYLTTLPVTYSTPVYFDVAD 202
Query: 126 LLELKGTTLYRATELQKQNLLTLY 149
L LKG+ + A +N+ Y
Sbjct: 203 LEALKGSPAFEAALKLNRNIARQY 226
>gi|145528147|ref|XP_001449873.1| hypothetical protein [Paramecium tetraurelia strain d4-2]
gi|124417462|emb|CAK82476.1| unnamed protein product [Paramecium tetraurelia]
Length = 605
Score = 48.5 bits (114), Expect = 0.008, Method: Compositional matrix adjust.
Identities = 42/153 (27%), Positives = 72/153 (47%), Gaps = 15/153 (9%)
Query: 8 KLEPFLQWLQVNKVELRGCKIK-YSDESKGFGIFSSNEFSDGVLLVVPLDLAITPMRVLQ 66
K + L+WL+ + KI+ Y+++ +G + S V+L VP IT + + +
Sbjct: 160 KQKRLLEWLKSGQALFPKIKIECYAEDYRGVNARKAIS-SKEVILFVPRSHMIT-LEMAK 217
Query: 67 DPLIGPECRAMFEDGEVDDRFLMILFLTVERLRKNSSWKPYLDMLPTTFGN-PLWFTDDE 125
D + + D + FL E+ ++S WKPYLD+LP ++ N P++F D +
Sbjct: 218 DTPVAKKIIQYRLDLLSPKHSFLSTFLLQEKKIQDSFWKPYLDVLPKSYSNFPIFFNDSD 277
Query: 126 LLELKGTTLYRATELQKQNLLTLYDDKVKDLVK 158
L LKG+ + + DK+ DL K
Sbjct: 278 LEWLKGSPFLKQVK-----------DKITDLKK 299
>gi|398405066|ref|XP_003853999.1| hypothetical protein MYCGRDRAFT_91971 [Zymoseptoria tritici IPO323]
gi|339473882|gb|EGP88975.1| hypothetical protein MYCGRDRAFT_91971 [Zymoseptoria tritici IPO323]
Length = 597
Score = 48.5 bits (114), Expect = 0.008, Method: Compositional matrix adjust.
Identities = 69/300 (23%), Positives = 105/300 (35%), Gaps = 75/300 (25%)
Query: 40 FSSNEFSDGVLLVVPLDLAITPMRVLQDPLIGPECRAMFEDGEVDDRFLMILFLTVERLR 99
F+S+ D + VP LA++ + L D + ++F + + + +L + +
Sbjct: 41 FASSLGPDSRICTVPHSLALSSLNALVD-----DSFSVFRNRGLAPEAIGYFYLMHQYIN 95
Query: 100 KNSS-WKPYLDMLPTT---FGNPLWFTDDELLELKGTTLYRATELQKQNLLTLYDDKVKD 155
K S W+PYL LP P WF D++L L T + T+ +Q L + K D
Sbjct: 96 KAKSFWRPYLQTLPGPEHEHLTPFWFEDEDLQWLADTDVLHTTK-ARQKLQEEHYAKGID 154
Query: 156 LVKKLLVLDGDSERANSIFWTRALNIPLPHSY-VFPQNQEDLNKYDSINNSAELSNDHNS 214
++ + + D E +W +Y V PQ Q D SAE +
Sbjct: 155 MLNRAKI---DVEPYTCKYWA---------AYKVGPQGQRQTALVDMSRVSAEDQD---- 198
Query: 215 RGELINGLNDIKNEAQRVNSQVNGATSTLTSTQGETLWIEGLVPGIDFCNHDLKAAATWE 274
L P ID NH A W
Sbjct: 199 --------------------------------------FPVLFPVIDIPNHSPTARVDWA 220
Query: 275 VDGTGLITGVPFSMYLLSVERSSFHSEKEISISYGNKGNEELLYLYGFVIDNNPDDYLMI 334
D V + +E +YG K N+ELL YGF I NN DD +++
Sbjct: 221 FDPGRFSITV----------KDPIPGGEEAFNNYGPKSNDELLLGYGFCIPNNLDDKVLL 270
>gi|303271033|ref|XP_003054878.1| set domain protein [Micromonas pusilla CCMP1545]
gi|226462852|gb|EEH60130.1| set domain protein [Micromonas pusilla CCMP1545]
Length = 664
Score = 48.5 bits (114), Expect = 0.008, Method: Compositional matrix adjust.
Identities = 72/329 (21%), Positives = 123/329 (37%), Gaps = 53/329 (16%)
Query: 17 QVNKVELRGCKIKYSDESKGFGIFSSNEFSDG-VLLVVPLDLAITPMRVLQDPLIGPECR 75
V VE+R +G G+ ++ + G L+ +PL+ ++ + IG E
Sbjct: 135 HVGPVEIRAVW------GRGRGVVTTRNVTKGETLVAIPLEKCLSTFSARKS-AIG-EAL 186
Query: 76 AMFEDGEVDDRFLMILFLTVERL--RKNSSWKPYLDMLPTTFGNPLWFTDDELLELKGTT 133
EV ++ L L E R+ S W P++ +LP PL +T EL +L+G+
Sbjct: 187 KTITSREVTIDAVIALHLLHELYVQREKSEWWPWVSILPRDVETPLLWTPRELAQLEGSN 246
Query: 134 LYRATELQKQNLLTLYDDKVKDLVKKLLVLDGD----SER---ANSIFWTRALNIPLPHS 186
L + + T D L +K L + +ER A +I W+RA ++P+P
Sbjct: 247 LIGFRDAVLKGWTTQRDALFPKLTQKFPSLFPEEHFRTERWAWAMAIVWSRAADVPVPRP 306
Query: 187 -YVFPQNQEDLNKYDSINNSAELSNDHNSRGELINGLNDIKNEAQRVNSQVNGATSTLTS 245
+FP + + I ++ N +G + V
Sbjct: 307 EAIFPSGDDKSRELRVIVPLFDMIN---------HGYDHAPVTPGGVKGGGGEGREKGGV 357
Query: 246 TQGETLWIEGLVPGIDFCNHDLKAAATWEVDGTGLITGVPFSMYLLSVERSSFHSEKEIS 305
++ L+P D + + GVPF V
Sbjct: 358 GVDDS---PALIPSWDPSRRMV-----------AIRAGVPFPGPNYEVR----------- 392
Query: 306 ISYGNKGNEELLYLYGFVIDNNPDDYLMI 334
+YG K ++ +L YGFV NNPD+ + +
Sbjct: 393 FNYGAKPSQHVLLQYGFVPMNNPDESVEV 421
>gi|50554725|ref|XP_504771.1| YALI0E34397p [Yarrowia lipolytica]
gi|49650640|emb|CAG80378.1| YALI0E34397p [Yarrowia lipolytica CLIB122]
Length = 492
Score = 48.1 bits (113), Expect = 0.009, Method: Compositional matrix adjust.
Identities = 74/302 (24%), Positives = 113/302 (37%), Gaps = 87/302 (28%)
Query: 36 GFGIFSSNEFSDGVLLVVPLDLAITPMRVLQDPLIGPECRAMFEDGEVDDRFLMILFLTV 95
G I + D L+ +P L I + +++ G D + L F V
Sbjct: 28 GDSIVAKAALKDQELINIPTKLIINHEKAVKE--FGAASSTF--STVADKQSLTKYFFAV 83
Query: 96 ERLR-KNSSWKPYLDMLPTTFGNPLWFTDDELLELKGTTLYRATELQKQNLLTLYDDKVK 154
E+ + +S + PY+ LP PL+FT ++ L GT L E K + L++ + K
Sbjct: 84 EKNKGSDSFFHPYISQLPKKVTTPLYFTPEQQESLVGTNL----EFYKNDKTELWEKEFK 139
Query: 155 DL---VKKLLVLDGDSERANSIFWTRALNIPLPHSYVFPQNQEDLNKYDSINNSAELSND 211
L VK + L+ D A++IF +R+ FP+
Sbjct: 140 KLQQFVKTSVTLE-DYLYASTIFTSRS----------FPER------------------- 169
Query: 212 HNSRGELINGLNDIKNEAQRVNSQVNGATSTLTSTQGETLWIEGLVPGIDFCNHDLKAAA 271
L D KNE + L+P +D NH A
Sbjct: 170 ----------LMDPKNED-----------------------LSMLIPVLDLINHKPLTAV 196
Query: 272 TWEVDGTGLITGVPFSMYLLS-VERSSFHSEKEISISYGNKGNEELLYLYGFVIDNNPDD 330
W V TG F+ S V + S ++ +YG+KGNEELL YGF + +N D
Sbjct: 197 EWNV------TGDAFAFKACSDVVKGS-----QVFNNYGSKGNEELLGAYGFALKDNEFD 245
Query: 331 YL 332
+
Sbjct: 246 TM 247
>gi|111306423|gb|AAI20969.1| SETD3 protein [Homo sapiens]
Length = 284
Score = 48.1 bits (113), Expect = 0.009, Method: Compositional matrix adjust.
Identities = 33/126 (26%), Positives = 65/126 (51%), Gaps = 12/126 (9%)
Query: 12 FLQWLQVNKVELRGCK-IKYSDESKGFGIFSSNEF-SDGVLLVVPLDLAITPMRVLQDPL 69
++W N + G + + + +E GFG+ ++ + ++ + L VP L +T + ++ +
Sbjct: 82 LMKWASENGASVEGFEMVNFKEE--GFGLRATRDIKAEELFLWVPRKLLMT-VESAKNSV 138
Query: 70 IGP---ECRAMFEDGEVDDRFLMILFLTVERLRKNSSWKPYLDMLPTTFGNPLWFTDDEL 126
+GP + R + G + + L ER NS W+PY+ LP+ + PL+F +DE+
Sbjct: 139 LGPLYSQDRILQAMGNI----ALAFHLLCERASPNSFWQPYIQTLPSEYDTPLYFEEDEV 194
Query: 127 LELKGT 132
L+ T
Sbjct: 195 RYLQST 200
>gi|211826273|gb|AAH09054.2| SETD3 protein [Homo sapiens]
Length = 228
Score = 48.1 bits (113), Expect = 0.009, Method: Compositional matrix adjust.
Identities = 33/124 (26%), Positives = 64/124 (51%), Gaps = 10/124 (8%)
Query: 13 LQWLQVNKVELRGCKIKYSDESKGFGIFSSNEF-SDGVLLVVPLDLAITPMRVLQDPLIG 71
++W N + G ++ E +GFG+ ++ + ++ + L VP L +T + ++ ++G
Sbjct: 15 MKWASENGASVEGFEMVNFKE-EGFGLRATRDIKAEELFLWVPRKLLMT-VESAKNSVLG 72
Query: 72 P---ECRAMFEDGEVDDRFLMILFLTVERLRKNSSWKPYLDMLPTTFGNPLWFTDDELLE 128
P + R + G + + L ER NS W+PY+ LP+ + PL+F +DE+
Sbjct: 73 PLYSQDRILQAMGNI----ALAFHLLCERASPNSFWQPYIQTLPSEYDTPLYFEEDEVRY 128
Query: 129 LKGT 132
L+ T
Sbjct: 129 LQST 132
>gi|320163048|gb|EFW39947.1| predicted protein [Capsaspora owczarzaki ATCC 30864]
Length = 476
Score = 48.1 bits (113), Expect = 0.009, Method: Compositional matrix adjust.
Identities = 27/99 (27%), Positives = 56/99 (56%), Gaps = 3/99 (3%)
Query: 89 MILFLTVERLRKNSSWKPYLDMLPTTFGNPLWFTDDELLELKGTTLYRATELQKQNLLTL 148
+++ + E K+S W+PYLD+LP T P+++ DD+L L+GT+ + L K++ T+
Sbjct: 101 LVVAMMYEITNKDSFWRPYLDLLPETLDTPMFWNDDDLELLEGTSTL--SHLGKEDAETI 158
Query: 149 YDDKVKDLVKKLLVLDGDSERANSIFWTRALNIPLPHSY 187
+ +++ + KL D + N + R ++ + +S+
Sbjct: 159 FTEQIVPFM-KLHPTHFDLKVHNMALYHRVASVIMAYSF 196
>gi|242066082|ref|XP_002454330.1| hypothetical protein SORBIDRAFT_04g028760 [Sorghum bicolor]
gi|241934161|gb|EES07306.1| hypothetical protein SORBIDRAFT_04g028760 [Sorghum bicolor]
Length = 490
Score = 48.1 bits (113), Expect = 0.009, Method: Compositional matrix adjust.
Identities = 57/246 (23%), Positives = 92/246 (37%), Gaps = 63/246 (25%)
Query: 91 LFLTVERLRKNSSWKPYLDMLPTTFGNPLWFTDDELLELKGTTLYRATELQKQNLLTLYD 150
L L ER + +S W PY+ LP TF P++F +++ L+ L + + LL
Sbjct: 143 LRLLQERAKSDSFWWPYIANLPETFTVPIFFPGEDIKNLQYAPLLHQVNKRCRFLLEFE- 201
Query: 151 DKVKDLVKKLLVLDGDSERANSIFWTRALNIPLPHSYVFPQNQEDLNKYDSINNSAELSN 210
K++ +KL + P S
Sbjct: 202 ---KEIQQKL--------------------------HTVP------------------SV 214
Query: 211 DHNSRGELINGLNDIKNEAQRVNSQVNGATSTLTSTQGETLWIEGLVPGIDFCNHDLKAA 270
DH G+ +N + + ++ A+S GE I L+P ID CNH
Sbjct: 215 DHPFYGQDVN--------SSSLGWAMSAASSRAFRLHGE---IPMLLPLIDMCNHSFNPN 263
Query: 271 ATWEVDGTGLITGVPFSMYLLSVERSSFHSEKEISISYGNKGNEELLYLYGFVIDNNPDD 330
A +G+ + + S+ + V I+++YG N+ L YGFVI NP D
Sbjct: 264 ARIVQEGS--VNSLDMSVKV--VAEKKIEQNASITLNYGCHPNDFFLLDYGFVITPNPYD 319
Query: 331 YLMIHY 336
+ + Y
Sbjct: 320 QVELSY 325
>gi|428177623|gb|EKX46502.1| hypothetical protein GUITHDRAFT_138238 [Guillardia theta CCMP2712]
Length = 486
Score = 48.1 bits (113), Expect = 0.010, Method: Compositional matrix adjust.
Identities = 34/117 (29%), Positives = 58/117 (49%), Gaps = 9/117 (7%)
Query: 4 STEAKLEPFLQWLQVNKVELRGCKIKYSDESKGFGIFSSNEFS-DGVLLVVPLDLAI--- 59
S + + E L+W + + + ++ DE G +F+S + D V+ VVP ++I
Sbjct: 165 SEQERREKLLEWAREHGIGFEKISLQ-EDEFGGTAMFASEDLEEDEVIGVVPFSISIGRE 223
Query: 60 TPMRVLQDPLIGPECRAMFEDGEVDDRFLMILFLTVERLRKNSSWKPYLDMLPTTFG 116
+ R L+G ++ED + +FL +ER +S ++PYLDMLPT G
Sbjct: 224 SLWRSRHGELLG----QLYEDERTPPDLISCIFLLLERRSSSSFFRPYLDMLPTPSG 276
>gi|336260071|ref|XP_003344832.1| hypothetical protein SMAC_06115 [Sordaria macrospora k-hell]
Length = 456
Score = 48.1 bits (113), Expect = 0.010, Method: Compositional matrix adjust.
Identities = 33/132 (25%), Positives = 62/132 (46%), Gaps = 5/132 (3%)
Query: 36 GFGIFSSNEFSDG-VLLVVPLDLAITPMRVLQDPLIGPECRAMFEDGEVDDRFLMILFLT 94
G G+ + F +G +L +P + T DPL+GP ++ +D L
Sbjct: 25 GRGVKTLRPFKEGEKILTIPSSILWTVEHAYADPLLGPALCSVQPPLSPEDTLTTYLLFV 84
Query: 95 VERLRKNSSWKPYLDMLPTTFGNPLWFTDDELLELKGTTLYRATELQKQNLLTLYDDKVK 154
R + ++ LPT++ + ++FT++EL GT+LY T+ +Q++ +D +
Sbjct: 85 RSRESGYDGQRSHVAALPTSYSSSIFFTEEELEVCAGTSLYTITKQLEQSI----EDDHR 140
Query: 155 DLVKKLLVLDGD 166
LV +L + D
Sbjct: 141 ALVMQLFIQHRD 152
>gi|340503061|gb|EGR29686.1| SET domain protein [Ichthyophthirius multifiliis]
Length = 286
Score = 48.1 bits (113), Expect = 0.010, Method: Compositional matrix adjust.
Identities = 35/129 (27%), Positives = 64/129 (49%), Gaps = 13/129 (10%)
Query: 12 FLQWLQVNKVELRGCKIKYSDESKGFGIFSSNEFS-DGVLLVVPLDLAITPMRVLQDPLI 70
L+WL K ++ K+ Y + +G I S + + D +L +P IT Q+ +
Sbjct: 127 LLEWLNDGKCDIWKIKMVYYNNYRG--IHSKQKINKDETILFIPQKYMITLELCKQNTI- 183
Query: 71 GPECRAMFEDGEVD----DRFLMILFLTVERLRKNSSWKPYLDMLPTTFGN-PLWFTDDE 125
C+ + E + ++ +++ E+ NS WKPYLD+LP F P+ +T++E
Sbjct: 184 ---CKQI-EQRNIKLLSPKHSILSIYILSEKKNPNSFWKPYLDILPCEFTTFPILYTEEE 239
Query: 126 LLELKGTTL 134
+ LKG+ +
Sbjct: 240 IQWLKGSLI 248
>gi|367036851|ref|XP_003648806.1| hypothetical protein THITE_2106671 [Thielavia terrestris NRRL 8126]
gi|346996067|gb|AEO62470.1| hypothetical protein THITE_2106671 [Thielavia terrestris NRRL 8126]
Length = 479
Score = 48.1 bits (113), Expect = 0.010, Method: Compositional matrix adjust.
Identities = 59/239 (24%), Positives = 101/239 (42%), Gaps = 46/239 (19%)
Query: 15 WLQVNKVELRGCKIKYSDESKGFGIFSSNEFSDG-------VLLVVPLDLAITPMRVLQD 67
W+ +N+V ++ S E KG+G+ + S G L+ VP DL + V +
Sbjct: 14 WVHLNEVSFNQVEVA-SIEGKGYGVVCKTDLSAGNADSNNPALISVPHDLVLNSTAVSEY 72
Query: 68 PLIGPECRAMFED-GEVDDRFLMILFLTVERLRKNSS----------WKPYLDMLPTTFG 116
R + + G R ++LFL V+ + S W YL LP T
Sbjct: 73 AKEDKNFRQLLDAVGRRSGRADVLLFLLVQTAMTSRSGPPLVGVSNPWTEYLRFLPETVL 132
Query: 117 NPLWFTDDELLELKGTTLYRATELQKQNLLTLYDDKVKDLVKKL--------LVLDG--- 165
P +T+DE L L+GT+L A + ++ D + +V+K L+ DG
Sbjct: 133 VPTLWTEDERLLLQGTSLEAAVNAK----ISALDAEFAQIVEKSSDIACWRELLWDGGTV 188
Query: 166 ---DSERANSIFWTRALNIPLPHSYVFPQNQEDLNKYDSINNSAELSN--DHNSRGELI 219
D R ++++ +R L +P+ + P D +N+SA S D N + +++
Sbjct: 189 SFTDWIRLDALYRSRCLELPVHGESMVP-------CIDMLNHSATPSAYYDENPQDDVV 240
>gi|44890428|gb|AAH66931.1| SETD3 protein [Homo sapiens]
Length = 292
Score = 48.1 bits (113), Expect = 0.010, Method: Compositional matrix adjust.
Identities = 33/126 (26%), Positives = 65/126 (51%), Gaps = 12/126 (9%)
Query: 12 FLQWLQVNKVELRGCK-IKYSDESKGFGIFSSNEF-SDGVLLVVPLDLAITPMRVLQDPL 69
++W N + G + + + +E GFG+ ++ + ++ + L VP L +T + ++ +
Sbjct: 82 LMKWASENGASVEGFEMVNFKEE--GFGLRATRDIKAEELFLWVPRKLLMT-VESAKNSV 138
Query: 70 IGP---ECRAMFEDGEVDDRFLMILFLTVERLRKNSSWKPYLDMLPTTFGNPLWFTDDEL 126
+GP + R + G + + L ER NS W+PY+ LP+ + PL+F +DE+
Sbjct: 139 LGPLYSQDRILQAMGNI----ALAFHLLCERASPNSFWQPYIQTLPSEYDTPLYFEEDEV 194
Query: 127 LELKGT 132
L+ T
Sbjct: 195 RYLQST 200
>gi|409080258|gb|EKM80618.1| hypothetical protein AGABI1DRAFT_71041 [Agaricus bisporus var.
burnettii JB137-S8]
Length = 492
Score = 47.8 bits (112), Expect = 0.011, Method: Compositional matrix adjust.
Identities = 46/154 (29%), Positives = 68/154 (44%), Gaps = 13/154 (8%)
Query: 12 FLQWLQVNKVELRGCK---IKYSDESKGFGIFSSNEFSDG-VLLVVPLDLAITPMRVLQD 67
F+QW Q I + G G + + DG L +P L I+
Sbjct: 4 FIQWFQRKGAYFDAESVGIIGFPPSEGGRGAVALRDIPDGHTLFSIPRALTISTRTCSLP 63
Query: 68 PLIGPEC--RAMFEDGEVDDRFLMILFLTVERLRKNSS-WKPYLDMLPTTFGNPLWFTDD 124
G E +A G V +IL L E +SS W YLD+LP F P+++T+
Sbjct: 64 TKFGLEAWRKAQLHQGWVG----LILCLMWETAAGSSSKWAGYLDILPNRFDTPMFWTEY 119
Query: 125 ELLELKGTTLYRATELQKQNLLTLYDDKVKDLVK 158
+LLELKGT++ +L K + Y++K+ VK
Sbjct: 120 DLLELKGTSV--VEKLGKLDAQADYNEKLIPAVK 151
>gi|427784595|gb|JAA57749.1| Putative histone-lysine n-methyltransferase setd3 [Rhipicephalus
pulchellus]
Length = 485
Score = 47.8 bits (112), Expect = 0.011, Method: Compositional matrix adjust.
Identities = 86/346 (24%), Positives = 121/346 (34%), Gaps = 94/346 (27%)
Query: 8 KLEPFLQWLQVNKVELRGCKIK-YSDESKGF----GIFSSNEFSDGVLLVVPLDLAITPM 62
K FL+W N L IK D GF I SN+F L VPL L +T
Sbjct: 73 KWPAFLKWCSDNGAYLGSVSIKDLPDGEYGFVADEHIEESNQF-----LGVPLKLMMTTA 127
Query: 63 RVLQDPLIGPECRAMFEDGEVDDRFLM-------ILFLTVERLRKNSS-WKPYLDMLPTT 114
+ L GP R DD +M +FL +E SS W PY+ LP +
Sbjct: 128 AAKKSKL-GPLLR--------DDPIMMSMSNVALAMFLILEFCTGESSFWHPYISTLPAS 178
Query: 115 FGNPLWFTDDELLELKGTTLYRATELQKQNLLTLYDDKVKDLVKKLLVLDGDSERANSIF 174
F L+F+ +EL L G+T+ +++ Y + IF
Sbjct: 179 FNTVLYFSVEELELLHGSTVLDEALKLHRSIARQY------------------SYFHKIF 220
Query: 175 WTRALNIPLPHSYVFPQNQEDLNKYDSINNSAELSNDHNSRGELINGLNDIKNEAQRVNS 234
T L LP+ F DL ++ V +
Sbjct: 221 RTHPLAKSLPYKDCFTY---DLYRW----------------------------AVSAVMT 249
Query: 235 QVNGATSTLTSTQG-----ETLWIEGLVPGIDFCNH-DLKAAATWEVDGTGLITGVPFSM 288
+ N LT T G T + +VP D CNH D K +++ L ++M
Sbjct: 250 RQNAV--PLTDTAGGDDEDGTDAMTAMVPLWDMCNHSDGKVFTDYDISANMLRC---YAM 304
Query: 289 YLLSVERSSFHSEKEISISYGNKGNEELLYLYGFVIDNNPDDYLMI 334
F +E++I YG + N E GFV N D + I
Sbjct: 305 -------RDFEKGQEVTIFYGRRTNAEFFIHNGFVFPENRHDSVDI 343
>gi|380089029|emb|CCC12973.1| unnamed protein product [Sordaria macrospora k-hell]
Length = 465
Score = 47.8 bits (112), Expect = 0.012, Method: Compositional matrix adjust.
Identities = 33/132 (25%), Positives = 62/132 (46%), Gaps = 5/132 (3%)
Query: 36 GFGIFSSNEFSDG-VLLVVPLDLAITPMRVLQDPLIGPECRAMFEDGEVDDRFLMILFLT 94
G G+ + F +G +L +P + T DPL+GP ++ +D L
Sbjct: 34 GRGVKTLRPFKEGEKILTIPSSILWTVEHAYADPLLGPALCSVQPPLSPEDTLTTYLLFV 93
Query: 95 VERLRKNSSWKPYLDMLPTTFGNPLWFTDDELLELKGTTLYRATELQKQNLLTLYDDKVK 154
R + ++ LPT++ + ++FT++EL GT+LY T+ +Q++ +D +
Sbjct: 94 RSRESGYDGQRSHVAALPTSYSSSIFFTEEELEVCAGTSLYTITKQLEQSI----EDDHR 149
Query: 155 DLVKKLLVLDGD 166
LV +L + D
Sbjct: 150 ALVMQLFIQHRD 161
>gi|340780678|pdb|3SMT|A Chain A, Crystal Structure Of Human Set Domain-Containing Protein3
Length = 497
Score = 47.8 bits (112), Expect = 0.012, Method: Compositional matrix adjust.
Identities = 36/141 (25%), Positives = 67/141 (47%), Gaps = 12/141 (8%)
Query: 14 QWLQVNKVELRGCK-IKYSDESKGFGIFSSNEF-SDGVLLVVPLDLAITPMRVLQDPLIG 71
+W N + G + + + +E GFG+ ++ + ++ + L VP L T + ++ ++G
Sbjct: 83 KWASENGASVEGFEXVNFKEE--GFGLRATRDIKAEELFLWVPRKLLXT-VESAKNSVLG 139
Query: 72 P---ECRAMFEDGEVDDRFLMILFLTVERLRKNSSWKPYLDMLPTTFGNPLWFTDDELLE 128
P + R + G + + L ER NS W+PY+ LP+ + PL+F +DE+
Sbjct: 140 PLYSQDRILQAXGNI----ALAFHLLCERASPNSFWQPYIQTLPSEYDTPLYFEEDEVRY 195
Query: 129 LKGTTLYRATELQKQNLLTLY 149
L+ T Q +N Y
Sbjct: 196 LQSTQAIHDVFSQYKNTARQY 216
>gi|428162643|gb|EKX31766.1| hypothetical protein GUITHDRAFT_149078 [Guillardia theta CCMP2712]
Length = 581
Score = 47.8 bits (112), Expect = 0.013, Method: Compositional matrix adjust.
Identities = 47/156 (30%), Positives = 66/156 (42%), Gaps = 35/156 (22%)
Query: 239 ATSTLTSTQGETLWI---------------EGLVPGIDFCNHDLKAAATWEVDGTGLITG 283
A ++S++GE LW E +VP +D NH +
Sbjct: 174 AAGRMSSSKGEFLWALACVSSRSFDADELGEVMVPILDCFNHKRPRDTAYSYRREEAPAR 233
Query: 284 VPFSMYLLSVERSSFHSEKEISISYGNKGNEELLYLYGFVIDNN--PDDYL------MIH 335
F + L E+E+ I+YG KG+ ELL YGF + +N PD + +
Sbjct: 234 AGFVLTSLR----DLGEEEEVYIAYGAKGSRELLLNYGFCVMDNVEPDGSMNDTVNAFVQ 289
Query: 336 YPAE---AIHSIPLSDSKALLL---EEQK--AQLRC 363
YP + I S L+DSKA L +EQK A LRC
Sbjct: 290 YPPDEWAKIRSCVLADSKAKLKRWGDEQKRFAPLRC 325
>gi|301764186|ref|XP_002917505.1| PREDICTED: SET domain-containing protein 3-like [Ailuropoda
melanoleuca]
Length = 591
Score = 47.8 bits (112), Expect = 0.013, Method: Compositional matrix adjust.
Identities = 35/143 (24%), Positives = 70/143 (48%), Gaps = 12/143 (8%)
Query: 12 FLQWLQVNKVELRGCK-IKYSDESKGFGIFSSNEF-SDGVLLVVPLDLAITPMRVLQDPL 69
++W N + G + + + +E GFG+ ++ + ++ + L VP L +T + ++ +
Sbjct: 82 LMKWASENGASVEGFEMVNFKEE--GFGLRATRDIKAEELFLWVPRKLLMT-VESAKNSV 138
Query: 70 IGP---ECRAMFEDGEVDDRFLMILFLTVERLRKNSSWKPYLDMLPTTFGNPLWFTDDEL 126
+GP + R + G + + L ER NS W+PY+ LP+ + PL+F ++E+
Sbjct: 139 LGPLYSQDRILQAMGNI----TLAFHLLCERADPNSFWQPYIQTLPSEYDTPLYFEEEEV 194
Query: 127 LELKGTTLYRATELQKQNLLTLY 149
+L+ T Q +N Y
Sbjct: 195 RDLQCTQAIHDVFSQYKNTARQY 217
>gi|281338628|gb|EFB14212.1| hypothetical protein PANDA_005835 [Ailuropoda melanoleuca]
Length = 585
Score = 47.8 bits (112), Expect = 0.013, Method: Compositional matrix adjust.
Identities = 35/143 (24%), Positives = 70/143 (48%), Gaps = 12/143 (8%)
Query: 12 FLQWLQVNKVELRGCK-IKYSDESKGFGIFSSNEF-SDGVLLVVPLDLAITPMRVLQDPL 69
++W N + G + + + +E GFG+ ++ + ++ + L VP L +T + ++ +
Sbjct: 82 LMKWASENGASVEGFEMVNFKEE--GFGLRATRDIKAEELFLWVPRKLLMT-VESAKNSV 138
Query: 70 IGP---ECRAMFEDGEVDDRFLMILFLTVERLRKNSSWKPYLDMLPTTFGNPLWFTDDEL 126
+GP + R + G + + L ER NS W+PY+ LP+ + PL+F ++E+
Sbjct: 139 LGPLYSQDRILQAMGNI----TLAFHLLCERADPNSFWQPYIQTLPSEYDTPLYFEEEEV 194
Query: 127 LELKGTTLYRATELQKQNLLTLY 149
+L+ T Q +N Y
Sbjct: 195 RDLQCTQAIHDVFSQYKNTARQY 217
>gi|367024041|ref|XP_003661305.1| hypothetical protein MYCTH_107134, partial [Myceliophthora
thermophila ATCC 42464]
gi|347008573|gb|AEO56060.1| hypothetical protein MYCTH_107134, partial [Myceliophthora
thermophila ATCC 42464]
Length = 506
Score = 47.8 bits (112), Expect = 0.013, Method: Compositional matrix adjust.
Identities = 63/244 (25%), Positives = 99/244 (40%), Gaps = 41/244 (16%)
Query: 9 LEPFLQWLQVNKVELRGCKIKYSDESKGFGIFSSNEFS-------DGVLLVVPLDLAITP 61
+E W ++N V K+ + E KG+G+ S + S LL VP L +
Sbjct: 8 IEGLPAWARLNDVVFHDVKVTRT-ERKGYGMVSGTDLSVTDDTTDSPTLLAVPHALVLNA 66
Query: 62 MRVLQDPLIGPECRAMFED-GEVDDRFLMILFLTVERLRKNSS----------WKPYLDM 110
V + R + + G R +LFL V+ + + S W YL
Sbjct: 67 EAVSEYAKEDKNFRQLLDAVGHGSTRTDALLFLLVQTVLASRSSHPAVGVSNPWTEYLKF 126
Query: 111 LPTTFGNPLWFTDDELLELKGTTLYRATELQKQNL---LTLYDDKVKDLVKKLLVLDG-- 165
LP T P +T++E L L+GT+L A + + L L +K D+V +L G
Sbjct: 127 LPETVPVPTLWTEEERLLLRGTSLETAVDAKISALDAEFGLVREKSSDIVGWNDLLWGAE 186
Query: 166 --------DSERANSIFWTRALNIPLPHSYVFPQNQEDLNKYDSINNSAELSN--DHNSR 215
D R ++++ +R L +P + P D IN+SA S D N R
Sbjct: 187 GASAVSFMDWIRLDALYRSRCLELPRSGESMVPC-------IDMINHSATPSAYYDENPR 239
Query: 216 GELI 219
E++
Sbjct: 240 DEVV 243
>gi|358384831|gb|EHK22428.1| hypothetical protein TRIVIDRAFT_84056 [Trichoderma virens Gv29-8]
Length = 458
Score = 47.8 bits (112), Expect = 0.014, Method: Compositional matrix adjust.
Identities = 40/181 (22%), Positives = 77/181 (42%), Gaps = 11/181 (6%)
Query: 15 WLQVNKVELRGCKIKYSDESKGFGIFSSNEFSDG-VLLVVPLDLAITPMRVLQDPLIGPE 73
WL+ + EL G ++ + + G G+ + F G +L +P T D ++GP
Sbjct: 4 WLRESGAELDGLELAHF-PAIGRGVRTLRCFKQGERILTIPSGCLWTVEHAYADAVLGPV 62
Query: 74 CRAMFEDGEVDDRFLMILFLTVERLRKNSSWKPYLDMLPTTFGNPLWFTDDELLELKGTT 133
R+ V+D + + R + ++ LP ++ + ++F DDEL G++
Sbjct: 63 LRSAQPPLSVEDTLAIYILFVRSRESGYDGLRSHVAALPASYSSSIFFEDDELEVCAGSS 122
Query: 134 LYRATELQKQNLLTLYDDKVKDL---------VKKLLVLDGDSERANSIFWTRALNIPLP 184
LY T +Q + Y V + + K + + + A W+RA++ LP
Sbjct: 123 LYTITRQLEQRIEEDYRGLVVRVFGLHLDLFPLNKFTIENVGYKWALCTVWSRAMDFVLP 182
Query: 185 H 185
+
Sbjct: 183 N 183
>gi|330924024|ref|XP_003300479.1| hypothetical protein PTT_11726 [Pyrenophora teres f. teres 0-1]
gi|311325361|gb|EFQ91406.1| hypothetical protein PTT_11726 [Pyrenophora teres f. teres 0-1]
Length = 642
Score = 47.8 bits (112), Expect = 0.014, Method: Compositional matrix adjust.
Identities = 44/150 (29%), Positives = 68/150 (45%), Gaps = 20/150 (13%)
Query: 9 LEPFLQWLQVNKVELRG-CKIKYSDESKGFGIFSSNEFSDGVLLVVPLDLAITPMRVLQD 67
+E + W N +L +I Y+D ++GF + + S V+ PL L ++ + + D
Sbjct: 15 VESLVAWFTSNGGQLSPDVQIVYND-ARGFHMRALQPLSSPVVTTCPLKLTLSHLNL--D 71
Query: 68 PLIGPECRAMFE------DGEVDDRFLMILFLTVERLRKNSS-WKPYLDMLPTT--FGNP 118
P G E E G + D L L L +R R +S W Y+ LP+ P
Sbjct: 72 P--GEEGVLHIESPLRQCQGRIPDHILTYLLLIEQRDRGQASPWHAYIACLPSPRDMTTP 129
Query: 119 LWFTDDELLELKGTTLY-----RATELQKQ 143
LWF + ++ L GT+L R ELQ+Q
Sbjct: 130 LWFNEGDMAFLAGTSLAPAAKERRAELQQQ 159
Score = 43.5 bits (101), Expect = 0.26, Method: Compositional matrix adjust.
Identities = 28/88 (31%), Positives = 42/88 (47%), Gaps = 9/88 (10%)
Query: 241 STLTSTQGETLWIEGLVPGIDFCNHDLKAAATWEVDGTGLITGVPFSMYLLSVERSSFHS 300
ST + ET+ I L P +D NH + A W+ S L ++ SF
Sbjct: 197 STHILPEKETVPI--LFPVVDILNHSVSAKVEWDFQPRQ-------SFALKCLDGHSFQP 247
Query: 301 EKEISISYGNKGNEELLYLYGFVIDNNP 328
+E+ +Y K N+ELL YGF +++NP
Sbjct: 248 RQELFNNYAPKQNDELLLGYGFCLEDNP 275
>gi|19112238|ref|NP_595446.1| ribosomal lysine methyltransferase Set10 [Schizosaccharomyces pombe
972h-]
gi|74626910|sp|O74738.1|SET10_SCHPO RecName: Full=Ribosomal N-lysine methyltransferase set10
gi|3738151|emb|CAA21252.1| ribosomal lysine methyltransferase Set10 [Schizosaccharomyces
pombe]
Length = 547
Score = 47.8 bits (112), Expect = 0.014, Method: Compositional matrix adjust.
Identities = 42/118 (35%), Positives = 55/118 (46%), Gaps = 15/118 (12%)
Query: 256 LVPGIDFCNHDLKAAATWEVDGTGLITGVPFSMYLLSVERSSFHSEKEISISYGNKGNEE 315
L+P ID NH K W D + S L++ F++ YG KGNEE
Sbjct: 189 LLPLIDSLNHKPKQPILWNSDFQDEKSVQLISQELVAKGNQLFNN-------YGPKGNEE 241
Query: 316 LLYLYGFVIDNNPDDYLMIHYPAEAIH-SIPLSDSKALLLEE----QKAQLRCLLPKS 368
LL YGF + +NP D + + AIH +P D KA +LE Q + L LPKS
Sbjct: 242 LLMGYGFCLPDNPFDTVTLKV---AIHPDLPHKDQKAAILENDCQFQLSNLVFFLPKS 296
Score = 41.2 bits (95), Expect = 1.2, Method: Compositional matrix adjust.
Identities = 34/128 (26%), Positives = 60/128 (46%), Gaps = 19/128 (14%)
Query: 25 GCKIKYS-------DESKGFGIF---SSNEFS-DGVLLVVPLDLAITPMRVLQDPLIGPE 73
GCK+ S D++ FG + + N+ + D +L+ P + AIT + + E
Sbjct: 12 GCKLHKSVEFIQSRDDNACFGSYIAVAQNDIAPDQLLISCPFEYAITYNKAKE------E 65
Query: 74 CRAMFEDGE-VDDRFLMILFLTVERLRK-NSSWKPYLDMLPTTFGNPLWFTDDELLELKG 131
+ + + E + + FL +E L+ S W Y++ LP TF PL+F +++ L
Sbjct: 66 LKKLNPNFESCNPHITLCTFLALESLKGIQSKWYGYIEYLPKTFNTPLYFNENDNAFLIS 125
Query: 132 TTLYRATE 139
T Y A +
Sbjct: 126 TNAYSAAQ 133
>gi|146162512|ref|XP_001009518.2| SET domain containing protein [Tetrahymena thermophila]
gi|146146406|gb|EAR89273.2| SET domain containing protein [Tetrahymena thermophila SB210]
Length = 789
Score = 47.4 bits (111), Expect = 0.015, Method: Compositional matrix adjust.
Identities = 40/130 (30%), Positives = 64/130 (49%), Gaps = 19/130 (14%)
Query: 12 FLQWLQVNKVELRGCKIKYSDESKGFGIFSSNEFSDG-VLLVVPLDLAITPMRVLQDPLI 70
L+WL E K+ Y + +G + + + G +L +P+D IT L+
Sbjct: 351 LLKWLSDTSSEFNKIKMVYYNNYRG--VHARQKIKKGECILFIPVDNMIT----LELSKE 404
Query: 71 GPECRAMFEDGEV-----DDRFLMILFLTVERLRKNSSWKPYLDMLP---TTFGNPLWFT 122
P C+ + E + FL I ++ +E+ S WKP+LD+LP TTF P+ +T
Sbjct: 405 LPICQ-LIESKNIRLLSPKHTFLSI-YIIIEKKNHKSFWKPFLDILPVEYTTF--PILYT 460
Query: 123 DDELLELKGT 132
D+EL LKG+
Sbjct: 461 DEELFWLKGS 470
>gi|302679960|ref|XP_003029662.1| hypothetical protein SCHCODRAFT_59023 [Schizophyllum commune H4-8]
gi|300103352|gb|EFI94759.1| hypothetical protein SCHCODRAFT_59023 [Schizophyllum commune H4-8]
Length = 522
Score = 47.4 bits (111), Expect = 0.015, Method: Compositional matrix adjust.
Identities = 40/151 (26%), Positives = 71/151 (47%), Gaps = 12/151 (7%)
Query: 9 LEPFLQWLQVN----KVELRGCKIKYSDESKGFGIFSSNEF-SDGVLLVVPLDLAITPMR 63
L F QW N E G ++ +E G G+ + + +D L +P + ++
Sbjct: 3 LGAFEQWFVANGGFVDKEYVGL-TEFPEEEGGRGMVALKDIPADHTLFSIPRSIVLSTRT 61
Query: 64 VLQDPLIGPECRAMFEDGEVDDRFL-MILFLTVERLRKNSSWKPYLDMLPTTFGNPLWFT 122
L G + +++ ++D + +IL + E + + WK YLD LPT F P+++T
Sbjct: 62 SPLPSLFGADA---WKERQLDKGWGGLILCMMWESAQPDRKWKGYLDSLPTEFSTPMFWT 118
Query: 123 DDELLELKGTTLYRATELQKQNLLTLYDDKV 153
++E+ EL GT + + KQ Y +KV
Sbjct: 119 EEEIAELTGTAVV--DHIGKQEADQEYTNKV 147
>gi|403274243|ref|XP_003928891.1| PREDICTED: histone-lysine N-methyltransferase setd3 [Saimiri
boliviensis boliviensis]
Length = 513
Score = 47.4 bits (111), Expect = 0.016, Method: Compositional matrix adjust.
Identities = 35/142 (24%), Positives = 69/142 (48%), Gaps = 12/142 (8%)
Query: 13 LQWLQVNKVELRGCK-IKYSDESKGFGIFSSNEF-SDGVLLVVPLDLAITPMRVLQDPLI 70
++W N + G + + + +E GFG+ ++ + ++ + L VP L +T + ++ ++
Sbjct: 1 MKWASENGASVEGFEMVNFKEE--GFGLRATRDIKAEELFLWVPRKLLMT-VESAKNSVL 57
Query: 71 GP---ECRAMFEDGEVDDRFLMILFLTVERLRKNSSWKPYLDMLPTTFGNPLWFTDDELL 127
GP + R + G + + L ER NS W+PY+ LP+ + PL+F ++E+
Sbjct: 58 GPLYSQDRILQAMGNI----ALAFHLLCERASPNSFWQPYIQTLPSEYDTPLYFEEEEVR 113
Query: 128 ELKGTTLYRATELQKQNLLTLY 149
L+ T Q +N Y
Sbjct: 114 YLQSTQAIHDVFSQYKNTARQY 135
>gi|384501024|gb|EIE91515.1| hypothetical protein RO3G_16226 [Rhizopus delemar RA 99-880]
Length = 354
Score = 47.4 bits (111), Expect = 0.018, Method: Compositional matrix adjust.
Identities = 27/77 (35%), Positives = 43/77 (55%), Gaps = 11/77 (14%)
Query: 255 GLVPGIDFCNHDL-KAAATWEV-DGTGLITGVPFSMYLLSVERSSFHSEKEISISYGNKG 312
LVP D NH L ++ WE+ D GL +L + ++E+++ YG+K
Sbjct: 107 ALVPYFDLANHSLNESNIKWELTDDEGL---------MLVTTKDIKSQDEELTLFYGSKS 157
Query: 313 NEELLYLYGFVIDNNPD 329
N+ELL+L+GF I +NP+
Sbjct: 158 NQELLFLHGFCIQDNPE 174
>gi|41056027|ref|NP_956348.1| histone-lysine N-methyltransferase setd3 [Danio rerio]
gi|82187658|sp|Q7SXS7.1|SETD3_DANRE RecName: Full=Histone-lysine N-methyltransferase setd3; AltName:
Full=SET domain-containing protein 3
gi|32766447|gb|AAH55261.1| SET domain containing 3 [Danio rerio]
Length = 596
Score = 47.4 bits (111), Expect = 0.018, Method: Compositional matrix adjust.
Identities = 76/350 (21%), Positives = 134/350 (38%), Gaps = 93/350 (26%)
Query: 1 MEISTEAKLEPFLQWLQVNKVELR----GCKIK-YSDESKGFGIFSSNEF-SDGVLLVVP 54
M +S E E F L E R G +I ++DE G+G+ ++ + ++ + L +P
Sbjct: 67 MSVSFEGIREDFFSELMAWAAECRASCDGFEISNFADE--GYGLKATKDIKAEELFLWIP 124
Query: 55 LDLAITPMRVLQDPLIGP---ECRAMFEDGEVDDRFLMILFLTVERLRKNSSWKPYLDML 111
+ +T + ++ ++GP + R + G V + L L ER +S W PY+ L
Sbjct: 125 RKMLMT-VESAKNSVLGPLYSQDRILQAMGNVT----LALHLLCERANPSSPWLPYIKTL 179
Query: 112 PTTFGNPLWFTDDELLELKGTTLYRATELQKQNLLTLYDDKVKDLVKKLLVLDGDSERAN 171
P+ + PL+F ++E+ L T Q++L+ Y + + V+
Sbjct: 180 PSEYDTPLYFEEEEVRHLLATQAI-------QDVLSQYKNTARQYAYFYKVIHTHP---- 228
Query: 172 SIFWTRALNIPLPHSYVFPQNQEDLNKYDSINNSAELSNDHN-SRGELINGLNDIKNEAQ 230
A +PL ++ F +D+ + ++ N I
Sbjct: 229 -----NASKLPLKDAFTF--------------------DDYRWAVSSVMTRQNQIPT--- 260
Query: 231 RVNSQVNGATSTLTSTQGETLWIEGLVPGIDFCNHDLKAAATWEVDGTGLITGVPFSMYL 290
+G+ TL L+P D CNH GLIT + Y
Sbjct: 261 -----ADGSRVTL-----------ALIPLWDMCNHT-----------NGLIT----TGYN 289
Query: 291 LSVERSS------FHSEKEISISYGNKGNEELLYLYGFVIDNNPDDYLMI 334
L +R + ++I I YG + N E + GF ++N D + I
Sbjct: 290 LEDDRCECVALKDYKEGEQIYIFYGTRSNAEFVIHNGFFFEDNAHDRVKI 339
>gi|302820198|ref|XP_002991767.1| hypothetical protein SELMODRAFT_430007 [Selaginella moellendorffii]
gi|300140448|gb|EFJ07171.1| hypothetical protein SELMODRAFT_430007 [Selaginella moellendorffii]
Length = 389
Score = 47.4 bits (111), Expect = 0.018, Method: Compositional matrix adjust.
Identities = 76/328 (23%), Positives = 117/328 (35%), Gaps = 104/328 (31%)
Query: 38 GIFSSNEFSDGVLLV-VPLDLAITPMRVLQDPLIGPECRAMFEDGEVDDRFLMILFLTVE 96
G+F++ G +V +P DL +T ++ +C E D L +L L +
Sbjct: 11 GLFAARSIRAGEQIVRIPHDLVLTAEKL-------DDCVKKLLSTEYDWCPLTLLILAEQ 63
Query: 97 RLRKNSSWKPYLDMLPTTFGN---PLWFTDDELLELKGTTLYRATELQKQNLLTLYDDKV 153
+ S W PY+ LP+ FG+ +++ +EL L+ T +R T +++ + Y V
Sbjct: 64 HKGEASRWAPYVSCLPS-FGDHHSTIFWEKEELKFLECTRAFRGTAERREMISDEYI-SV 121
Query: 154 KDLVKKLLVLDGDSERANSIFWTRALNIPLPHSYVFPQNQEDLNKYDSINNSAELSNDHN 213
K+++ PH +
Sbjct: 122 KNVISSC-----------------------PHVF-------------------------- 132
Query: 214 SRGELINGLNDIKNEAQRVNSQVNGATSTLTSTQGETLWIEGLVPGIDFCNHDLKAAATW 273
GE I+ A V+ NGA S+ S + P +DFCNHD + AT
Sbjct: 133 --GEDISLFQFAHAYATVVSRAWNGALSSEISMR----------PFVDFCNHDPVSHAT- 179
Query: 274 EVDGTGLITGVPFSMYLLSVERSSFHSEKEISISYGNKGNEELLYLYGFVIDNNPDDYLM 333
V S + ISYG + N L YGFV+ NN D
Sbjct: 180 -------------------VSHDSCKDATVVFISYGKRSNAVLAVDYGFVLPNNLSDQ-- 218
Query: 334 IHYPAEAIHSIPLSDSKALLLEEQKAQL 361
AE IP +D L E+K +L
Sbjct: 219 ----AELWMEIPWNDP----LREKKLEL 238
>gi|312101598|ref|XP_003149686.1| hypothetical protein LOAG_14135 [Loa loa]
Length = 314
Score = 47.4 bits (111), Expect = 0.019, Method: Compositional matrix adjust.
Identities = 50/240 (20%), Positives = 99/240 (41%), Gaps = 44/240 (18%)
Query: 89 MILFLTVERLRKNSSWKPYLDMLPTTFGNPLWFTDDELLELKGTTLYRATELQKQNLLTL 148
+ + L ++L S W+PY+ +LP F PL+FT ++L L+ + L+ + L L
Sbjct: 12 LAMMLCCQKLIPESHWQPYIKVLPECFDTPLFFTVEQLQCLRPSPLFEES-------LLL 64
Query: 149 YDDKVKDLVKKLL-VLDGDSERANSIFWTRALNIPLPHSYVFPQNQEDLNKYDSINNSAE 207
Y + + + LL ++ D R ++++ +++ + I ++
Sbjct: 65 YRNVSRQFIHFLLEIIRSDEFRHRK-----------------KKSKDKISELEPIYVNSP 107
Query: 208 LSNDHNSRGELINGLNDIKNEAQRVNSQVNGATSTLTSTQ-GETLWIEGLVPGIDFCNHD 266
L+ + + N + +++++N S + G+ I GL+P +D NH
Sbjct: 108 LTAANFT-------FNLYRWSVACISTRINMIPSEVWKDDIGQPRMIPGLIPFLDMANHS 160
Query: 267 LKAAATWEVDGTGLITGVPFSMYLLSVERSSFHSEK---EISISYGNKGNEELLYLYGFV 323
+A E V FS E + K ++I YG + N + L GF+
Sbjct: 161 YTESAFHE--------AVHFSDEFDCAEVIAVRDYKPLEPVNIFYGWRSNRDFLLHNGFI 212
>gi|148226164|ref|NP_001079674.1| SET domain containing 4 [Xenopus laevis]
gi|28422727|gb|AAH46855.1| MGC53706 protein [Xenopus laevis]
Length = 456
Score = 47.4 bits (111), Expect = 0.019, Method: Compositional matrix adjust.
Identities = 73/318 (22%), Positives = 117/318 (36%), Gaps = 75/318 (23%)
Query: 36 GFGIFSSNEFSDGVLLV-VPLDLAITPMRVLQDPLIGPECRAMFEDGEVDDRFLMILFLT 94
G G+ ++ + G L++ +P IT VLQ +G R V + FL
Sbjct: 59 GRGLMATRDLKPGELIIALPETCLITTETVLQS-YLGKYIR--LWRPHVSPLLALCTFLI 115
Query: 95 VERLRKNSS-WKPYLDMLPTTFGNPLWFTDDELLELKGTTLYRATELQKQNLLTLYDDKV 153
ER + S WKPYLD++P+T+ P+++ + E++ L L + QK + L+ +
Sbjct: 116 AERFAGDCSQWKPYLDVIPSTYSCPVYW-ELEIIHLLPAPLRKKALEQKTEVQELHTE-- 172
Query: 154 KDLVKKLLVLDGDSERANSIFWTRALNIPLPHSYVFPQNQEDLNKYDSINNSAELSNDHN 213
S+ + +L PL F N D+ YD++ +
Sbjct: 173 ------------------SLAFFSSLQ-PL-----FCDNVADIYTYDALRWAW------- 201
Query: 214 SRGELINGLNDIKNEAQRVNSQVNGATSTLTSTQGETLWIE----GLVPGIDFCNH--DL 267
VN T + TQ + L + L P +D NH ++
Sbjct: 202 --------------------CTVNTRTVYMKHTQQDRLLAQQDVCALAPYLDLLNHSPEV 241
Query: 268 KAAATWEVDGTGLITGVPFSMYLLSVERSSFHSEKEISISYGNKGNEELLYLYGFVIDNN 327
+ A + D Y + S + I YG N+ LL YGFV NN
Sbjct: 242 QVEAEFSKDR---------RCYEIRTN-SGCRKHDQAFICYGPHDNQRLLLEYGFVAANN 291
Query: 328 PDDYLMIHYPAEAIHSIP 345
P + + A H P
Sbjct: 292 PHRSVYVTKDAILAHLSP 309
>gi|160774366|gb|AAI55279.1| SET domain containing 3 [Danio rerio]
Length = 596
Score = 47.4 bits (111), Expect = 0.019, Method: Compositional matrix adjust.
Identities = 76/350 (21%), Positives = 134/350 (38%), Gaps = 93/350 (26%)
Query: 1 MEISTEAKLEPFLQWLQVNKVELR----GCKIK-YSDESKGFGIFSSNEF-SDGVLLVVP 54
M +S E E F L E R G +I ++DE G+G+ ++ + ++ + L +P
Sbjct: 67 MSVSFEGIREDFFSELMAWAAECRASCDGFEISNFADE--GYGLKATKDIKAEELFLWIP 124
Query: 55 LDLAITPMRVLQDPLIGP---ECRAMFEDGEVDDRFLMILFLTVERLRKNSSWKPYLDML 111
+ +T + ++ ++GP + R + G V + L L ER +S W PY+ L
Sbjct: 125 RKMLMT-VESAKNSVLGPLYSQDRILQAMGNVT----LALHLLCERANPSSPWLPYIKTL 179
Query: 112 PTTFGNPLWFTDDELLELKGTTLYRATELQKQNLLTLYDDKVKDLVKKLLVLDGDSERAN 171
P+ + PL+F ++E+ L T Q++L+ Y + + V+
Sbjct: 180 PSEYDTPLYFEEEEVRHLLATQAI-------QDVLSQYKNTARQYAYFYKVIHTHP---- 228
Query: 172 SIFWTRALNIPLPHSYVFPQNQEDLNKYDSINNSAELSNDHN-SRGELINGLNDIKNEAQ 230
A +PL ++ F +D+ + ++ N I
Sbjct: 229 -----NASKLPLKDAFTF--------------------DDYRWAVSSVMTRQNQIPT--- 260
Query: 231 RVNSQVNGATSTLTSTQGETLWIEGLVPGIDFCNHDLKAAATWEVDGTGLITGVPFSMYL 290
+G+ TL L+P D CNH GLIT + Y
Sbjct: 261 -----ADGSRVTL-----------ALIPLWDMCNHT-----------NGLIT----TGYN 289
Query: 291 LSVERSS------FHSEKEISISYGNKGNEELLYLYGFVIDNNPDDYLMI 334
L +R + ++I I YG + N E + GF ++N D + I
Sbjct: 290 LEDDRCECVALKDYKEGEQIYIFYGTRSNAEFVIHNGFFFEDNAHDRVKI 339
>gi|402076002|gb|EJT71425.1| hypothetical protein GGTG_10683 [Gaeumannomyces graminis var.
tritici R3-111a-1]
Length = 497
Score = 47.0 bits (110), Expect = 0.020, Method: Compositional matrix adjust.
Identities = 42/156 (26%), Positives = 70/156 (44%), Gaps = 30/156 (19%)
Query: 2 EISTEAKLEPFLQWLQ----------VNKVELRGCKIKYSDESKGFGIFSSNEFS-DGVL 50
E + K + FL W + ++ V+LR + G GI ++ + D VL
Sbjct: 3 ETDFDKKSQDFLVWFKNLPGATFHDHISIVDLRA-------RNAGRGIVTTADIEPDTVL 55
Query: 51 LVVPLDLAITPMR---VLQDPLI-------GPECRAMFEDGEVDDRFL-MILFLTVERLR 99
+P I P + P + GP +DG D + +IL + E LR
Sbjct: 56 FTIPRQHIIYPANSELARKLPAVFAGVGGSGPTPDVDDDDGRAQDSWTSLILVMIHEHLR 115
Query: 100 KNSS-WKPYLDMLPTTFGNPLWFTDDELLELKGTTL 134
++S W+PYLD+LP F P++++ EL EL+ + +
Sbjct: 116 GSASPWRPYLDVLPARFETPMFWSAAELAELQASPV 151
>gi|80479475|gb|AAI08868.1| Unknown (protein for MGC:132347) [Xenopus laevis]
Length = 456
Score = 47.0 bits (110), Expect = 0.020, Method: Compositional matrix adjust.
Identities = 73/318 (22%), Positives = 116/318 (36%), Gaps = 75/318 (23%)
Query: 36 GFGIFSSNEFSDGVLLV-VPLDLAITPMRVLQDPLIGPECRAMFEDGEVDDRFLMILFLT 94
G G+ ++ + G L++ +P IT VLQ +G R V + FL
Sbjct: 59 GRGLMATRDLKPGELIIALPETCLITTETVLQS-YLGKYIR--LWRPHVSPLLALCTFLI 115
Query: 95 VERLR-KNSSWKPYLDMLPTTFGNPLWFTDDELLELKGTTLYRATELQKQNLLTLYDDKV 153
ER + S WKPYLD++P+T+ P+++ + E++ L L + QK + L+ + +
Sbjct: 116 AERFAGERSQWKPYLDVIPSTYSCPVYW-ELEIVHLLPAPLRQKALEQKTEVQELHTESL 174
Query: 154 KDLVKKLLVLDGDSERANSIFWTRALNIPLPHSYVFPQNQEDLNKYDSINNSAELSNDHN 213
NS+ PL F N D+ YD++ +
Sbjct: 175 AFF--------------NSL-------QPL-----FCDNVADIYTYDALRWAW------- 201
Query: 214 SRGELINGLNDIKNEAQRVNSQVNGATSTLTSTQGETLWIE----GLVPGIDFCNH--DL 267
VN T + TQ + L + L P +D NH ++
Sbjct: 202 --------------------CTVNTRTVYMKHTQQDRLLAQQDVCALAPYLDLLNHSPEV 241
Query: 268 KAAATWEVDGTGLITGVPFSMYLLSVERSSFHSEKEISISYGNKGNEELLYLYGFVIDNN 327
+ A + D Y + S + I YG N+ LL YGFV NN
Sbjct: 242 QVEAEFSKDR---------RCYEIRTN-SGCRKHDQAFICYGPHDNQRLLLEYGFVAANN 291
Query: 328 PDDYLMIHYPAEAIHSIP 345
P + + A H P
Sbjct: 292 PHRSVYVTKDAILAHLSP 309
>gi|148686777|gb|EDL18724.1| mCG18357, isoform CRA_b [Mus musculus]
Length = 466
Score = 47.0 bits (110), Expect = 0.021, Method: Compositional matrix adjust.
Identities = 58/252 (23%), Positives = 89/252 (35%), Gaps = 75/252 (29%)
Query: 89 MILFLTVERLRKNSSWKPYLDMLPTTFGNPLWFTDDELLELKGTTLYRATELQKQNLLTL 148
+ L ER NS W+PY+ LP+ + PL+F ++E+ L+ T ++ +
Sbjct: 29 LAFHLLCERASPNSFWQPYIQTLPSEYDTPLYFEEEEVRCLQSTQAI-------HDVFSQ 81
Query: 149 YDDKVKDLVKKLLVLDGDSERANSIFWTRALNIPLPHSYVFPQNQEDLNKYDSINNSAEL 208
Y + + V+ A +PL S+ + ED S
Sbjct: 82 YKNTARQYAYFYKVIQTHP---------HANKLPLKESFTY----EDYRWAVS------- 121
Query: 209 SNDHNSRGELINGLNDIKNEAQRVNSQVNGATSTLTSTQGETLWIEGLVPGIDFCNHDLK 268
++ N I E +G+ TL L+P D CNH
Sbjct: 122 --------SVMTRQNQIPTE--------DGSRVTL-----------ALIPLWDMCNHT-- 152
Query: 269 AAATWEVDGTGLITGVPFSMYLLSVERSS------FHSEKEISISYGNKGNEELLYLYGF 322
GLIT + Y L +R F + +I I YG + N E + GF
Sbjct: 153 ---------NGLIT----TGYNLEDDRCECVALQDFQAGDQIYIFYGTRSNAEFVIHSGF 199
Query: 323 VIDNNPDDYLMI 334
DNN D + I
Sbjct: 200 FFDNNSHDRVKI 211
>gi|393904017|gb|EJD73630.1| SET domain-containing protein 3 [Loa loa]
Length = 444
Score = 47.0 bits (110), Expect = 0.021, Method: Compositional matrix adjust.
Identities = 50/240 (20%), Positives = 99/240 (41%), Gaps = 44/240 (18%)
Query: 89 MILFLTVERLRKNSSWKPYLDMLPTTFGNPLWFTDDELLELKGTTLYRATELQKQNLLTL 148
+ + L ++L S W+PY+ +LP F PL+FT ++L L+ + L+ + L L
Sbjct: 6 LAMMLCCQKLIPESHWQPYIKVLPECFDTPLFFTVEQLQCLRPSPLFEES-------LLL 58
Query: 149 YDDKVKDLVKKLL-VLDGDSERANSIFWTRALNIPLPHSYVFPQNQEDLNKYDSINNSAE 207
Y + + + LL ++ D R ++++ +++ + I ++
Sbjct: 59 YRNVSRQFIHFLLEIIRSDEFRHRK-----------------KKSKDKISELEPIYVNSP 101
Query: 208 LSNDHNSRGELINGLNDIKNEAQRVNSQVNGATSTLTSTQ-GETLWIEGLVPGIDFCNHD 266
L+ + + N + +++++N S + G+ I GL+P +D NH
Sbjct: 102 LTAANFT-------FNLYRWSVACISTRINMIPSEVWKDDIGQPRMIPGLIPFLDMANHS 154
Query: 267 LKAAATWEVDGTGLITGVPFSMYLLSVERSSFHSEK---EISISYGNKGNEELLYLYGFV 323
+A E V FS E + K ++I YG + N + L GF+
Sbjct: 155 YTESAFHE--------AVHFSDEFDCAEVIAVRDYKPLEPVNIFYGWRSNRDFLLHNGFI 206
>gi|170588849|ref|XP_001899186.1| SET domain containing protein [Brugia malayi]
gi|158593399|gb|EDP31994.1| SET domain containing protein [Brugia malayi]
Length = 278
Score = 47.0 bits (110), Expect = 0.022, Method: Compositional matrix adjust.
Identities = 30/112 (26%), Positives = 52/112 (46%), Gaps = 14/112 (12%)
Query: 12 FLQWLQVNKVELRGCKIKYSDESKGFGIFSSNEFSDG-VLLVVPLDLAITPMRVLQDP-- 68
F++W N G I+ G G+F++ +F + ++ +P+ L IT + + P
Sbjct: 6 FMEWAVGNGAYHSGIDIRDCSNEGGKGLFATTDFRENETIISIPVGLIITAGFIAEMPDY 65
Query: 69 --LIGPECRAMFEDGEVDDRFLMILFLTVERLRKNSSWKPYLDMLPTTFGNP 118
+ C FE ++ F VE+ +NS W PYL++LP +F P
Sbjct: 66 CDVFKRYCLKPFE--------ALVYFFLVEK-EQNSKWTPYLEVLPKSFSTP 108
>gi|453087416|gb|EMF15457.1| SET domain-containing protein [Mycosphaerella populorum SO2202]
Length = 454
Score = 47.0 bits (110), Expect = 0.022, Method: Compositional matrix adjust.
Identities = 44/165 (26%), Positives = 72/165 (43%), Gaps = 16/165 (9%)
Query: 30 YSDESKGFGIFSSNEFSDGVLLVVPLDLAITPMRVLQDPLIGPECRAMFEDGEVDDRFLM 89
+SD +G E + +L +P L T R DP++GP + VDD
Sbjct: 21 FSDAGRGIRALRRFEEKEKIL-TIPHGLLWTVKRAYADPVLGPLLSSTRPPLSVDDTLAT 79
Query: 90 -ILFLTVERLRKNSSWKP--YLDMLPTTFGNPLWFTDDELLELKGTTLYRATELQKQNLL 146
ILF+ R RK+ P ++ LP ++ + ++F D EL G++LY T+ + +
Sbjct: 80 YILFI---RARKSGYDGPQSHVAALPASYSSSIFFADAELEICAGSSLYTTTKHLARQIE 136
Query: 147 TLYDDKVKDLVKK--------LLVLDGDSERANSIFWTRALNIPL 183
Y D V L + +D D + A W+RA++ L
Sbjct: 137 VDYKDLVARLFGRHRDVFPSDKFTID-DYKWALCTVWSRAMDFKL 180
>gi|158295743|ref|XP_001688855.1| AGAP006364-PD [Anopheles gambiae str. PEST]
gi|347965224|ref|XP_003435732.1| AGAP013401-PA [Anopheles gambiae str. PEST]
gi|333469389|gb|EGK97284.1| AGAP013401-PA [Anopheles gambiae str. PEST]
Length = 451
Score = 47.0 bits (110), Expect = 0.022, Method: Compositional matrix adjust.
Identities = 38/132 (28%), Positives = 56/132 (42%), Gaps = 14/132 (10%)
Query: 255 GLVPGIDFCNHDLKAAATWEVDGTGLITG---------VPFSMYLLSVERSSFHSEKEIS 305
L P +DF NH A V+G L T P +Y +++ + ++I
Sbjct: 234 ALAPFLDFFNHRCGAKT---VNGLSLSTSQIRDCLLKERPLELYYNLHTDTAYRAGEQIF 290
Query: 306 ISYGNKGNEELLYLYGFVIDNNPDDYLMIHYPAEAIHSIPLSDSKALLLEEQKAQLRCLL 365
ISYG N +LL YGF I +NPDD+ + I++ D + L + + R L
Sbjct: 291 ISYGTHNNTKLLLEYGFSIPSNPDDF--VELTIGTINAFMKHDPELRCLRLPREKYRFLA 348
Query: 366 PKSLLEHGFFAA 377
L E FF
Sbjct: 349 DHRLDEQLFFVG 360
>gi|367036287|ref|XP_003648524.1| hypothetical protein THITE_2106073 [Thielavia terrestris NRRL 8126]
gi|346995785|gb|AEO62188.1| hypothetical protein THITE_2106073 [Thielavia terrestris NRRL 8126]
Length = 496
Score = 47.0 bits (110), Expect = 0.024, Method: Compositional matrix adjust.
Identities = 22/48 (45%), Positives = 33/48 (68%), Gaps = 1/48 (2%)
Query: 88 LMILFLTVERLRKNSS-WKPYLDMLPTTFGNPLWFTDDELLELKGTTL 134
L+IL L E L+ +S W+PYLD+LP TF P++++ EL EL+ + L
Sbjct: 108 LLILVLIYEHLQGEASRWRPYLDVLPPTFDTPMFWSPTELSELQASAL 155
>gi|255947868|ref|XP_002564701.1| Pc22g06730 [Penicillium chrysogenum Wisconsin 54-1255]
gi|211591718|emb|CAP97961.1| Pc22g06730 [Penicillium chrysogenum Wisconsin 54-1255]
Length = 679
Score = 47.0 bits (110), Expect = 0.024, Method: Compositional matrix adjust.
Identities = 35/103 (33%), Positives = 49/103 (47%), Gaps = 12/103 (11%)
Query: 253 IEGLVPGIDFCNHDLKAAATWEVDGTGLITGVPFSMYLLSVERSSFHSEKEISISYGNKG 312
+ L+P ID NH A W + LL +E S + +EIS +YG +
Sbjct: 223 VSALLPLIDLPNHRPMAKVEWRAGDE--------DIGLLVLEDHS--AGQEISNNYGPRN 272
Query: 313 NEELLYLYGFVIDNNPDDYLMIHYPAEAIHSIPLSDSKALLLE 355
NE+LL YGF I NP DY ++ + PL ++KA LE
Sbjct: 273 NEQLLINYGFCIAGNPTDYRIVLLGVKP--DSPLGEAKARQLE 313
>gi|402226353|gb|EJU06413.1| SET domain-containing protein [Dacryopinax sp. DJM-731 SS1]
Length = 421
Score = 47.0 bits (110), Expect = 0.025, Method: Compositional matrix adjust.
Identities = 29/97 (29%), Positives = 42/97 (43%), Gaps = 11/97 (11%)
Query: 256 LVPGIDFCNHDLKAAATW--------EVDGTGLITGVPFSMYLLSVERSSFHSEKEISIS 307
L+P +D NH TW E TG P Y+ V + + ++ +
Sbjct: 207 LLPVLDCMNHKRAHPVTWLTSTKLLPEALATGANENGP---YINLVHHPTLEAGLQVFNN 263
Query: 308 YGNKGNEELLYLYGFVIDNNPDDYLMIHYPAEAIHSI 344
YG KGN E L YGF + NNP D + + P +H +
Sbjct: 264 YGPKGNAEFLLGYGFTLPNNPSDTIFLSIPQGQLHEV 300
>gi|330806388|ref|XP_003291152.1| hypothetical protein DICPUDRAFT_155733 [Dictyostelium purpureum]
gi|325078672|gb|EGC32310.1| hypothetical protein DICPUDRAFT_155733 [Dictyostelium purpureum]
Length = 465
Score = 46.6 bits (109), Expect = 0.025, Method: Compositional matrix adjust.
Identities = 23/71 (32%), Positives = 39/71 (54%)
Query: 88 LMILFLTVERLRKNSSWKPYLDMLPTTFGNPLWFTDDELLELKGTTLYRATELQKQNLLT 147
L +F+ +L + S W PY+ +LP +F L+F+D+EL EL+ + L T ++K +
Sbjct: 119 LQAVFIMYSKLNEKSFWHPYVTVLPESFSTSLYFSDNELDELQASQLKEFTIIRKDGIER 178
Query: 148 LYDDKVKDLVK 158
Y+ L K
Sbjct: 179 HYESTFSRLSK 189
>gi|149044196|gb|EDL97578.1| rCG27725, isoform CRA_b [Rattus norvegicus]
Length = 538
Score = 46.6 bits (109), Expect = 0.026, Method: Compositional matrix adjust.
Identities = 32/126 (25%), Positives = 65/126 (51%), Gaps = 12/126 (9%)
Query: 12 FLQWLQVNKVELRGCK-IKYSDESKGFGIFSSNEF-SDGVLLVVPLDLAITPMRVLQDPL 69
++W N + G + + + +E GFG+ ++ + ++ + L VP L +T + ++ +
Sbjct: 82 LMKWASENGASVEGFEMVNFKEE--GFGLRATRDIKAEELFLWVPRKLLMT-VESAKNSI 138
Query: 70 IGP---ECRAMFEDGEVDDRFLMILFLTVERLRKNSSWKPYLDMLPTTFGNPLWFTDDEL 126
+GP + R + G + + L ER NS W+PY+ LP+ + PL+F ++E+
Sbjct: 139 LGPLYSQDRILQAMGNI----ALAFHLLCERASPNSFWQPYIQTLPSEYDTPLYFEEEEV 194
Query: 127 LELKGT 132
L+ T
Sbjct: 195 RCLQST 200
>gi|296419472|ref|XP_002839331.1| hypothetical protein [Tuber melanosporum Mel28]
gi|295635461|emb|CAZ83522.1| unnamed protein product [Tuber melanosporum]
Length = 541
Score = 46.6 bits (109), Expect = 0.026, Method: Compositional matrix adjust.
Identities = 25/59 (42%), Positives = 39/59 (66%), Gaps = 2/59 (3%)
Query: 89 MILFLTVERLR-KNSSWKPYLDMLPTTFGNPLWFTDDELLELKGTTLYRATELQKQNLL 146
+ +FL +E L+ K S W PYL +LP F PL+F+D++L L+G L ATE++ + L+
Sbjct: 87 LCMFLCLEWLKGKESFWWPYLCVLPREFDTPLYFSDEDLQFLQGCNL-EATEVEARKLI 144
Score = 43.1 bits (100), Expect = 0.32, Method: Compositional matrix adjust.
Identities = 37/114 (32%), Positives = 49/114 (42%), Gaps = 20/114 (17%)
Query: 256 LVPGIDFCNHDLKAAATWEVDGTGL--ITGVPFSMYLLSVERSSFHSEKEISISYGNKGN 313
L P ID NH TW+ T L I+GV S + E+ +YG K N
Sbjct: 204 LFPLIDSLNHYPATIITWQPSDTSLRIISGVGVS------------AGAEVYNNYGPKAN 251
Query: 314 EELLYLYGFVIDNNPDDYLMIHYPAEAIHSIPLSDSKALLLEEQKAQLRCLLPK 367
EELL YGF + NP D ++ S PL+ + +L E L L P+
Sbjct: 252 EELLMGYGFTLLQNPFDSFLLK------SSPPLTPLQHSILGESPTGLYHLTPR 299
>gi|156064409|ref|XP_001598126.1| hypothetical protein SS1G_00212 [Sclerotinia sclerotiorum 1980]
gi|154691074|gb|EDN90812.1| hypothetical protein SS1G_00212 [Sclerotinia sclerotiorum 1980
UF-70]
Length = 470
Score = 46.6 bits (109), Expect = 0.027, Method: Compositional matrix adjust.
Identities = 33/134 (24%), Positives = 63/134 (47%), Gaps = 9/134 (6%)
Query: 6 EAKLEPFLQWLQVNKVEL--RGCKIKYSDESKGFGIFSSNEFSDG-VLLVVPLDLAITPM 62
EA+ F WL+ V + + E +G G+ ++ + D ++ +P + +
Sbjct: 7 EARTATFSSWLKEMGVRTNPKMALVDLRQEGRGRGVVATGDIDDDEIIFSIPRNAVLNAQ 66
Query: 63 RVLQDPLIGPECRAMFEDGEVDDRFLMILFLTVERLRKNSSWKPYLDMLPTTFGNPLWFT 122
V PL P R +FE ++ ++ L E +NS W PYL +LP + ++++
Sbjct: 67 NVA--PL--PVSRRLFE--KMPSWLVLTSILMTEAQMENSKWAPYLAVLPERLDSLVFWS 120
Query: 123 DDELLELKGTTLYR 136
D EL EL+ + + +
Sbjct: 121 DSELAELQASAVVK 134
>gi|168063638|ref|XP_001783777.1| predicted protein [Physcomitrella patens subsp. patens]
gi|162664720|gb|EDQ51429.1| predicted protein [Physcomitrella patens subsp. patens]
Length = 395
Score = 46.6 bits (109), Expect = 0.027, Method: Compositional matrix adjust.
Identities = 46/188 (24%), Positives = 80/188 (42%), Gaps = 24/188 (12%)
Query: 54 PLDLAITPMRVLQDPLIGP-----ECRAMFEDGEVDDRFLMILFLTVERLRKNSSWKPYL 108
P+++ +RV D +I P E + + G + L + L + L + S W PY+
Sbjct: 8 PIEVGEQVLRVSGDLMITPNKLPTEVKELLPTGVTEWARLALFILVEQHLGQASQWAPYI 67
Query: 109 DMLPT--TFGNPLWFTDDELLELKGTTLYRATELQK--------------QNLLTLYDDK 152
+ LPT + +++ +EL ++ T+L+R T ++ Q ++ ++
Sbjct: 68 NCLPTCGALHSTVFWKKEELELVRFTSLHRETMQRRAVIGSEFASVLPVLQKCPHIFGER 127
Query: 153 VKDLVKKLLVLDGDSERANSIFWTRALNIPLPHSYVFPQNQEDLNKYDSINNSAELSNDH 212
V K G S R +S TR L +P + N L YD AE+ D
Sbjct: 128 VLHSKFKQAYATGKSLRRSSN--TRILTVPFVDFFNHDSNCRALLSYDEERACAEVIADK 185
Query: 213 N-SRGELI 219
N +RGE +
Sbjct: 186 NYARGEQV 193
>gi|346474100|gb|AEO36894.1| hypothetical protein [Amblyomma maculatum]
Length = 459
Score = 46.6 bits (109), Expect = 0.027, Method: Compositional matrix adjust.
Identities = 92/411 (22%), Positives = 151/411 (36%), Gaps = 95/411 (23%)
Query: 8 KLEPFLQWLQVNKVELRGCKIKYSDESKG-FGIFSSNEFSDGV-LLVVPLDLAITPMRVL 65
K FL+W N L IK D G +G+ + + + + L +P+ L +T
Sbjct: 43 KWPAFLKWCSENGAYLGSVAIK--DRPDGDYGLVAEEKIEESMQFLGIPMKLVMTTASAR 100
Query: 66 QDPLIGPECRAMFEDGEVDDRFL-------MILFLTVERLRKNSS-WKPYLDMLPTTFGN 117
+ L GP R DD + + +FL +E SS W PY+ +LP +F
Sbjct: 101 KSKL-GPLLR--------DDPIMKSMSNVALAIFLILELSAGESSFWHPYISVLPDSFNT 151
Query: 118 PLWFTDDELLELKGTTLYRATELQKQNLLTLYDDKVKDLVKKLLVLDGDSERANSIFWTR 177
L+F +EL L G+ + +++ Y + IF T
Sbjct: 152 VLYFNIEELELLSGSAVLDEALKLHRSIARQY------------------AYFHKIFRTH 193
Query: 178 ALNIPLPHSYVFPQNQEDLNKYDSINNSAELSNDHNSRGELINGL--NDIKNEAQRVNSQ 235
L LP F DL ++ SA ++ + +GL +D+ +
Sbjct: 194 PLAKSLPFKDCFTY---DLYRW---AVSAVMTRQNAVPWTESDGLGGDDV---------E 238
Query: 236 VNGATSTLTSTQGETLWIEGLVPGIDFCNHDLKAAATWEVDGTGLITGVPFSMYLLSVER 295
++G + + LVP D CNH DG L + +
Sbjct: 239 IDGTAA-----------VTALVPLWDMCNHS---------DGKVLTDYDSSASMVRCYAM 278
Query: 296 SSFHSEKEISISYGNKGNEELLYLYGFVIDNNPDDYLMIHY------PAEAI-------H 342
F +E++I YG + N E GFV ++N D + I P A+ H
Sbjct: 279 RDFDKGEEVTIFYGKRTNAEFFIHNGFVFEDNRYDAVDIKLGVSKKDPLFAVKSKLCEDH 338
Query: 343 SIPLSDSKALLLEEQKAQ------LRCLLPKSLLEHGFFAAGHPKDGNNDN 387
+ LS + AL+ ++ LR L+ K + F+A H ++ N
Sbjct: 339 DLSLSGTFALVARDRPVSEDLSTFLRILVLKDASQPEAFSAEHILTSSDSN 389
>gi|451852693|gb|EMD65988.1| hypothetical protein COCSADRAFT_86793 [Cochliobolus sativus ND90Pr]
Length = 478
Score = 46.6 bits (109), Expect = 0.028, Method: Compositional matrix adjust.
Identities = 23/47 (48%), Positives = 33/47 (70%), Gaps = 1/47 (2%)
Query: 89 MILFLTVERLRKNSS-WKPYLDMLPTTFGNPLWFTDDELLELKGTTL 134
+IL + E L+ +S WK YLD+LP F P+++T DEL EL+GT+L
Sbjct: 109 LILVMLYEYLQGEASRWKTYLDILPQAFETPIFWTPDELKELEGTSL 155
>gi|358392567|gb|EHK41971.1| hypothetical protein TRIATDRAFT_251278, partial [Trichoderma
atroviride IMI 206040]
Length = 956
Score = 46.6 bits (109), Expect = 0.028, Method: Compositional matrix adjust.
Identities = 40/173 (23%), Positives = 85/173 (49%), Gaps = 21/173 (12%)
Query: 1 MEISTEAKLEPFLQWLQV-------NKVELRGCKIKYSDESKGFGIFSSNEF-SDGVLLV 52
M +++ + FL W + +E+R + + + G GI + + +D VL
Sbjct: 490 MATDFQSQTKVFLDWFKSLPGSTFSEHIEIRDLR----ERNAGRGIVALQDIPADTVLFT 545
Query: 53 VP----LDLAITPMRV-LQDPLIGPECRAMFEDGEVDDRF-LMILFLTVERLRKN-SSWK 105
VP +++ + +R L D + + ++ D + +I+ L E + + SSWK
Sbjct: 546 VPRSAIVNIETSELRAKLPDVFLNQDTAMEVDNKPQQDPWSTLIIVLIYEYFKGDQSSWK 605
Query: 106 PYLDMLPTTFGNPLWFTDDELLELKGTTLYRATELQKQNLLTLYDDKVKDLVK 158
PYLD+LP +F P++++D E+ EL+ + +++ K N ++ K+ +++
Sbjct: 606 PYLDVLPASFETPMFWSDAEVDELQASATR--SKIGKTNAEEMFHAKILPVIR 656
>gi|440792294|gb|ELR13522.1| SET domain containing protein [Acanthamoeba castellanii str. Neff]
Length = 568
Score = 46.6 bits (109), Expect = 0.028, Method: Compositional matrix adjust.
Identities = 40/142 (28%), Positives = 66/142 (46%), Gaps = 21/142 (14%)
Query: 5 TEAKLEPFLQWLQVNKVE---LRGCKIKYSDESKGFGIFSSNEFSDG-VLLVVPLDLAIT 60
T LE WL N ++ L G + ++ +G G+ + +F G L VP L T
Sbjct: 64 TRDDLEQLRVWLLKNGLDSKWLEGIEFA-ANLPEGSGVVAKKDFKKGEPFLQVPRKLMFT 122
Query: 61 PMRVLQDPLIGPECRAMFEDGEVDDRFL-------MILFLTVERLRKNSSWKPYLDMLPT 113
+ +Q+ +G + D+FL + L L VE+ +S W PY+ LP
Sbjct: 123 -CQAMQNTPLGQLLKV--------DKFLAQSPSLCLALHLLVEKHNHSSFWTPYIKTLPK 173
Query: 114 TFGNPLWFTDDELLELKGTTLY 135
++G L+FT +EL L+G+ +
Sbjct: 174 SYGTCLYFTLEELEGLRGSPTF 195
>gi|294654395|ref|XP_456446.2| DEHA2A02464p [Debaryomyces hansenii CBS767]
gi|199428848|emb|CAG84398.2| DEHA2A02464p [Debaryomyces hansenii CBS767]
Length = 613
Score = 46.6 bits (109), Expect = 0.028, Method: Compositional matrix adjust.
Identities = 30/88 (34%), Positives = 43/88 (48%), Gaps = 9/88 (10%)
Query: 256 LVPGIDFCNHDLKAAATWEVDGTGLITGVPFSMYLLSVERSSFHSEKEISISYGNKGNEE 315
L+P ID NH++ A W V + T F +S +++ +YG KGNEE
Sbjct: 244 LLPIIDLLNHNMNAKVDWSVSSSNSCTYFNF-------RSNSAVPGEQLYNNYGMKGNEE 296
Query: 316 LLYLYGFVIDNNPDD--YLMIHYPAEAI 341
LL YGF +++N D L I P E +
Sbjct: 297 LLLAYGFCLEDNSADSAALKIKVPLEML 324
>gi|358395377|gb|EHK44764.1| hypothetical protein TRIATDRAFT_80097 [Trichoderma atroviride IMI
206040]
Length = 463
Score = 46.6 bits (109), Expect = 0.029, Method: Compositional matrix adjust.
Identities = 45/173 (26%), Positives = 80/173 (46%), Gaps = 16/173 (9%)
Query: 50 LLVVPLDLAITPMRVLQDPLIGPECRAMFEDGEVDDRFLMILFLTVERLRKNSSWKP--Y 107
+L +P D T DPL+GP R++ V+D + ++L RLR++ P +
Sbjct: 40 ILTIPGDSLWTVEHADSDPLLGPVLRSVQPPLSVEDT--LAVYLLFVRLREHGYEGPRSH 97
Query: 108 LDMLPTTFGNPLWFTDDELLELKGTTLYRATELQKQNLLTLYDDKV-------KDLVKKL 160
+ +P + + ++F +DEL GT+LY T+ ++ + Y V DL+
Sbjct: 98 VAAMPARYSSSIFFNEDELEVCAGTSLYTITKQLEERIEDDYRVLVMRVFTQHPDLLPLA 157
Query: 161 LVLDGDSERANSIFWTRALNIPLPHSYVFPQNQEDLNKY-DSINNSAELSNDH 212
+ D + A W+RA++ LP+ + L + D IN+S E+ H
Sbjct: 158 KISIQDYKWALCTVWSRAMDFVLPNG----KPLRVLAPFADMINHSPEVKQCH 206
Score = 38.5 bits (88), Expect = 7.4, Method: Compositional matrix adjust.
Identities = 26/75 (34%), Positives = 32/75 (42%), Gaps = 8/75 (10%)
Query: 256 LVPGIDFCNHDLKAAATWEVDGTGLITGVPFSMYLLSVERSSFHSEKEISISYGNKGNEE 315
L P D NH + D P S L + + +I ISYG+ N
Sbjct: 189 LAPFADMINHSPEVKQCHAYD--------PSSGNLSVLAGKDYEIGDQIYISYGSIPNNR 240
Query: 316 LLYLYGFVIDNNPDD 330
LL LYGFVI NP+D
Sbjct: 241 LLRLYGFVIPENPND 255
>gi|428179206|gb|EKX48078.1| hypothetical protein GUITHDRAFT_106158 [Guillardia theta CCMP2712]
Length = 410
Score = 46.6 bits (109), Expect = 0.029, Method: Compositional matrix adjust.
Identities = 36/103 (34%), Positives = 51/103 (49%), Gaps = 13/103 (12%)
Query: 245 STQGETLWIEGLVPGIDFCNHDLKAAATWEVDGTGLITGVPFSMYLLSVERSSFHSEKEI 304
S +GET + LVP +DF NH +T V G +Y ++ RS + +++
Sbjct: 147 SLRGET--VGCLVPVLDFLNH-----STAPVAACGFCKDA--MVYRVTCLRS-YEEGEQV 196
Query: 305 SISYGNKGNEELLYLYGFVIDNNPDDYLMI---HYPAEAIHSI 344
I YGN N LL YGFV+++NP D M+ H P H I
Sbjct: 197 MIHYGNWSNAGLLEHYGFVLEDNPLDSCMLWLRHPPNPPEHLI 239
>gi|363746364|ref|XP_003643627.1| PREDICTED: histone-lysine N-methyltransferase setd3-like, partial
[Gallus gallus]
Length = 225
Score = 46.6 bits (109), Expect = 0.029, Method: Compositional matrix adjust.
Identities = 34/124 (27%), Positives = 62/124 (50%), Gaps = 10/124 (8%)
Query: 13 LQWLQVNKVELRGCKIKYSDESKGFGIFSSNEF-SDGVLLVVPLDLAITPMRVLQDPLIG 71
++W N G +I + E +GFG+ ++ E ++ + L VP L +T + ++ ++G
Sbjct: 83 IKWATENGASTEGFEIA-NFEEEGFGLKATREIKAEELFLWVPRKLLMT-VESAKNSVLG 140
Query: 72 P---ECRAMFEDGEVDDRFLMILFLTVERLRKNSSWKPYLDMLPTTFGNPLWFTDDELLE 128
+ R + G + + L ER NS W PY+ LP+ + PL+F +DE+
Sbjct: 141 SLYSQDRILQAMGNIT----LAFHLLCERANPNSFWLPYIQTLPSEYDTPLYFEEDEVQY 196
Query: 129 LKGT 132
L+ T
Sbjct: 197 LRST 200
>gi|148686778|gb|EDL18725.1| mCG18357, isoform CRA_c [Mus musculus]
Length = 536
Score = 46.6 bits (109), Expect = 0.030, Method: Compositional matrix adjust.
Identities = 32/126 (25%), Positives = 65/126 (51%), Gaps = 12/126 (9%)
Query: 12 FLQWLQVNKVELRGCK-IKYSDESKGFGIFSSNEF-SDGVLLVVPLDLAITPMRVLQDPL 69
++W N + G + + + +E GFG+ ++ + ++ + L VP L +T + ++ +
Sbjct: 82 LMKWASENGASVEGFEMVNFKEE--GFGLRATRDIKAEELFLWVPRKLLMT-VESAKNSV 138
Query: 70 IGP---ECRAMFEDGEVDDRFLMILFLTVERLRKNSSWKPYLDMLPTTFGNPLWFTDDEL 126
+GP + R + G + + L ER NS W+PY+ LP+ + PL+F ++E+
Sbjct: 139 LGPLYSQDRILQAMGNI----ALAFHLLCERASPNSFWQPYIQTLPSEYDTPLYFEEEEV 194
Query: 127 LELKGT 132
L+ T
Sbjct: 195 RCLQST 200
>gi|169847976|ref|XP_001830696.1| hypothetical protein CC1G_03233 [Coprinopsis cinerea okayama7#130]
gi|116508170|gb|EAU91065.1| hypothetical protein CC1G_03233 [Coprinopsis cinerea okayama7#130]
Length = 496
Score = 46.6 bits (109), Expect = 0.030, Method: Compositional matrix adjust.
Identities = 41/156 (26%), Positives = 68/156 (43%), Gaps = 11/156 (7%)
Query: 10 EPFLQWLQVNKVELRGCKIKYSD---ESKGFGIFSSNEFSDG-VLLVVPLDLAITPMRVL 65
E F+QW + + + +D G G + + +G VL +P L ++
Sbjct: 8 EAFIQWFLNHNGTMDREAMGIADFPAAEGGRGAVALKDLPEGHVLFTIPRALTLSTRTSR 67
Query: 66 QDPLIGPE--CRAMFEDGEVDDRFLMILFLTVERLRKNSSWKPYLDMLPTTFGNPLWFTD 123
L G E R G LM+ + K S W YLD+LP F P+++ +
Sbjct: 68 LPELFGLEEWKRLKLHQGWAG---LMLCMMWEAAQGKESRWAGYLDILPAAFDTPMFWNE 124
Query: 124 DELLELKGTTLYRATELQKQNLLTLYDDKVKDLVKK 159
++L EL GT++ +L K++ YD K+K + K
Sbjct: 125 EDLSELAGTSI--VGKLGKEDAERDYDSKIKPAIAK 158
>gi|342881738|gb|EGU82570.1| hypothetical protein FOXB_06936 [Fusarium oxysporum Fo5176]
Length = 467
Score = 46.6 bits (109), Expect = 0.031, Method: Compositional matrix adjust.
Identities = 19/47 (40%), Positives = 35/47 (74%), Gaps = 1/47 (2%)
Query: 89 MILFLTVERLR-KNSSWKPYLDMLPTTFGNPLWFTDDELLELKGTTL 134
+IL + E L+ +NS WKPY D+LP++F P++++D+EL +L+ + +
Sbjct: 102 LILIMIYEYLQGENSKWKPYFDVLPSSFDTPMFWSDNELDQLQASHM 148
>gi|303272707|ref|XP_003055715.1| predicted protein [Micromonas pusilla CCMP1545]
gi|226463689|gb|EEH60967.1| predicted protein [Micromonas pusilla CCMP1545]
Length = 647
Score = 46.6 bits (109), Expect = 0.032, Method: Compositional matrix adjust.
Identities = 29/95 (30%), Positives = 44/95 (46%), Gaps = 12/95 (12%)
Query: 238 GATSTLTSTQGETLWIEGLVPGIDFCNHDLKAA-ATWEVDGTGLITGVPFSMYLLSVERS 296
G T T + + LVP +D NH K +W +D G + L V
Sbjct: 293 GGVRGQTQTDEDESFAGALVPLLDCANHHRKPRECSWTIDEDGCV---------LVVAIR 343
Query: 297 SFHSEKEISISYGNKGNEELLYLYGFVIDNN--PD 329
+F + I I+YG +GN +L+ YGF +++N PD
Sbjct: 344 AFDAGGAIRIAYGARGNHDLMLRYGFAVEDNTEPD 378
>gi|393245275|gb|EJD52786.1| SET domain-containing protein [Auricularia delicata TFB-10046 SS5]
Length = 519
Score = 46.2 bits (108), Expect = 0.033, Method: Compositional matrix adjust.
Identities = 41/134 (30%), Positives = 65/134 (48%), Gaps = 10/134 (7%)
Query: 29 KYSDESKGFGIFSSNEFSDG-VLLVVPLDLAITPMRVLQDP-LIGPE--CRAMFEDGEVD 84
K+ DE G G+ + E G L VP L ++P R Q P LIG + R G
Sbjct: 23 KFPDEEGGRGLVAVKEIQVGETLFAVPRTLLLSP-RTCQLPQLIGAQDWKRLNLHKGWSG 81
Query: 85 DRFLMILFLTVERLRKNSSWKPYLDMLPTTFGNPLWFTDDELLELKGTTLYRATELQKQN 144
L++ L E S W Y +PT F +++T +EL +LKG+++ ++ K++
Sbjct: 82 ---LILCMLWEEAQGPASQWAGYFAAMPTEFSTLMFWTPEELEDLKGSSI--TEKIGKED 136
Query: 145 LLTLYDDKVKDLVK 158
+ + Y D+V VK
Sbjct: 137 VESEYHDRVLPAVK 150
>gi|363747032|ref|XP_003643892.1| PREDICTED: histone-lysine N-methyltransferase setd3-like, partial
[Gallus gallus]
Length = 283
Score = 46.2 bits (108), Expect = 0.034, Method: Compositional matrix adjust.
Identities = 34/125 (27%), Positives = 62/125 (49%), Gaps = 10/125 (8%)
Query: 12 FLQWLQVNKVELRGCKIKYSDESKGFGIFSSNEF-SDGVLLVVPLDLAITPMRVLQDPLI 70
++W N G +I + E +GFG+ ++ E ++ + L VP L +T + ++ ++
Sbjct: 82 LIKWATENGASTEGFEIA-NFEEEGFGLKATREIKAEELFLWVPRKLLMT-VESAKNSVL 139
Query: 71 GP---ECRAMFEDGEVDDRFLMILFLTVERLRKNSSWKPYLDMLPTTFGNPLWFTDDELL 127
G + R + G + + L ER NS W PY+ LP+ + PL+F +DE+
Sbjct: 140 GSLYSQDRILQAMGNIT----LAFHLLCERANPNSFWLPYIQTLPSEYDTPLYFEEDEVQ 195
Query: 128 ELKGT 132
L+ T
Sbjct: 196 YLRST 200
>gi|302754606|ref|XP_002960727.1| hypothetical protein SELMODRAFT_449995 [Selaginella moellendorffii]
gi|300171666|gb|EFJ38266.1| hypothetical protein SELMODRAFT_449995 [Selaginella moellendorffii]
Length = 430
Score = 46.2 bits (108), Expect = 0.034, Method: Compositional matrix adjust.
Identities = 23/70 (32%), Positives = 43/70 (61%)
Query: 89 MILFLTVERLRKNSSWKPYLDMLPTTFGNPLWFTDDELLELKGTTLYRATELQKQNLLTL 148
+ L+L E+ + +S W Y+ +LP T +PL+++++EL ELKGT L + K+ L
Sbjct: 67 VALYLLHEKAKPHSDWSAYIRVLPRTLDSPLFWSEEELAELKGTQLLSSINGFKEFLKRE 126
Query: 149 YDDKVKDLVK 158
YD + ++++
Sbjct: 127 YDKVMTEVIE 136
>gi|302804384|ref|XP_002983944.1| hypothetical protein SELMODRAFT_119151 [Selaginella moellendorffii]
gi|300148296|gb|EFJ14956.1| hypothetical protein SELMODRAFT_119151 [Selaginella moellendorffii]
Length = 439
Score = 46.2 bits (108), Expect = 0.034, Method: Compositional matrix adjust.
Identities = 23/70 (32%), Positives = 43/70 (61%)
Query: 89 MILFLTVERLRKNSSWKPYLDMLPTTFGNPLWFTDDELLELKGTTLYRATELQKQNLLTL 148
+ L+L E+ + +S W Y+ +LP T +PL+++++EL ELKGT L + K+ L
Sbjct: 67 VALYLLHEKAKPHSDWSAYIRVLPRTLDSPLFWSEEELAELKGTQLLSSMNGFKEFLKRE 126
Query: 149 YDDKVKDLVK 158
YD + ++++
Sbjct: 127 YDKVMTEVIE 136
>gi|403215363|emb|CCK69862.1| hypothetical protein KNAG_0D01100 [Kazachstania naganishii CBS
8797]
Length = 599
Score = 46.2 bits (108), Expect = 0.034, Method: Compositional matrix adjust.
Identities = 28/76 (36%), Positives = 41/76 (53%), Gaps = 8/76 (10%)
Query: 256 LVPGIDFCNHDLKAAATWEVD-GTGLITGVPFSMYLLSVERSSFHSEKEISISYGNKGNE 314
L+P +D NHD ++ W D G+ I Y + + +EI +YG KGNE
Sbjct: 234 LLPILDLLNHDYESKVLWSTDRGSAFI-------YQNLEDIETLKQGQEIFNNYGPKGNE 286
Query: 315 ELLYLYGFVIDNNPDD 330
ELL YGFV+++N +D
Sbjct: 287 ELLTSYGFVLEDNVND 302
>gi|207340708|gb|EDZ68974.1| YPL208Wp-like protein [Saccharomyces cerevisiae AWRI1631]
Length = 501
Score = 46.2 bits (108), Expect = 0.035, Method: Compositional matrix adjust.
Identities = 28/91 (30%), Positives = 46/91 (50%), Gaps = 12/91 (13%)
Query: 256 LVPGIDFCNHDLKAAATWEVDGTGLITGVPFSMYLLSVERSSFHSEKEISISYGNKGNEE 315
L+P +D NHD ++ W P + + + + +E+S +YG KGNEE
Sbjct: 231 LLPIVDLLNHDYRSKVKW----------YPENGWFCYEKIGTASQSRELSNNYGGKGNEE 280
Query: 316 LLYLYGFVIDNNPDD--YLMIHYPAEAIHSI 344
LL YGFV+++N D L + P + + +I
Sbjct: 281 LLSGYGFVLEDNIFDSVALKVKLPLDVVSTI 311
>gi|426197159|gb|EKV47086.1| hypothetical protein AGABI2DRAFT_203917 [Agaricus bisporus var.
bisporus H97]
Length = 491
Score = 46.2 bits (108), Expect = 0.035, Method: Compositional matrix adjust.
Identities = 45/154 (29%), Positives = 68/154 (44%), Gaps = 13/154 (8%)
Query: 12 FLQWLQVNKVELRGCK---IKYSDESKGFGIFSSNEFSDG-VLLVVPLDLAITPMRVLQD 67
F+QW Q I + G G + + +G L +P L I+
Sbjct: 4 FIQWFQRKGAYFDAESVGIIGFPPSEGGRGAVALRDIPNGHTLFSIPRTLTISTRTCSLP 63
Query: 68 PLIGPEC--RAMFEDGEVDDRFLMILFLTVERLRKNSS-WKPYLDMLPTTFGNPLWFTDD 124
G E +A G V +IL L E +SS W YLD+LP F P+++T+
Sbjct: 64 TKFGLEAWRKAQLHQGWVG----LILCLMWETAAGSSSKWAGYLDILPNRFDTPMFWTEY 119
Query: 125 ELLELKGTTLYRATELQKQNLLTLYDDKVKDLVK 158
+LLELKGT++ +L K + Y++K+ VK
Sbjct: 120 DLLELKGTSV--VEKLGKVDAQADYNEKLLPAVK 151
>gi|412994115|emb|CCO14626.1| unnamed protein product [Bathycoccus prasinos]
Length = 390
Score = 46.2 bits (108), Expect = 0.035, Method: Compositional matrix adjust.
Identities = 27/84 (32%), Positives = 44/84 (52%), Gaps = 5/84 (5%)
Query: 256 LVPGIDFCNHDLKAAATWEVDGTGLITGVPFSMYLLSVERSSFHSEKEISISYGN-KGNE 314
+PG+D NHD A A E+ S+ + ++ +++E +ISYGN + N+
Sbjct: 216 FIPGVDLLNHD--ANANCEIRLVSNKNNASTSIEVYAIR--DIENDEECTISYGNHRSND 271
Query: 315 ELLYLYGFVIDNNPDDYLMIHYPA 338
ELL YGF + NN +D + + A
Sbjct: 272 ELLRKYGFCVPNNRNDSIDVRLRA 295
>gi|50303389|ref|XP_451636.1| hypothetical protein [Kluyveromyces lactis NRRL Y-1140]
gi|49640768|emb|CAH02029.1| KLLA0B02354p [Kluyveromyces lactis]
Length = 594
Score = 46.2 bits (108), Expect = 0.036, Method: Compositional matrix adjust.
Identities = 29/75 (38%), Positives = 37/75 (49%), Gaps = 9/75 (12%)
Query: 256 LVPGIDFCNHDLKAAATWEVDGTGLITGVPFSMYLLSVERSSFHSEKEISISYGNKGNEE 315
L+P +D NH+ ++ W G S +E S EI +YG KGNEE
Sbjct: 220 LLPVLDLLNHENRSKIQWSCSSEG-------SFIFEKLEPVS--KGTEICNNYGAKGNEE 270
Query: 316 LLYLYGFVIDNNPDD 330
LLY YGFV+D N D
Sbjct: 271 LLYGYGFVVDGNEFD 285
>gi|242210759|ref|XP_002471221.1| predicted protein [Postia placenta Mad-698-R]
gi|220729780|gb|EED83649.1| predicted protein [Postia placenta Mad-698-R]
Length = 256
Score = 46.2 bits (108), Expect = 0.037, Method: Compositional matrix adjust.
Identities = 26/79 (32%), Positives = 41/79 (51%), Gaps = 5/79 (6%)
Query: 256 LVPGIDFCNHDLKAAATWEVDGTGLITGVPFSMYLLSVERSSFHSEKEISISYGNKGNEE 315
L+PG+D NH +W V P S+ L V ++ + E+ +YG K N E
Sbjct: 49 LLPGVDALNHARGQPVSWAVS---TAPNAPSSISL--VLHNAHPAGAELFNNYGPKPNAE 103
Query: 316 LLYLYGFVIDNNPDDYLMI 334
L+ YGF + +NPDD +++
Sbjct: 104 LILGYGFALPHNPDDTIVL 122
>gi|349581613|dbj|GAA26770.1| K7_Rkm1p [Saccharomyces cerevisiae Kyokai no. 7]
Length = 583
Score = 46.2 bits (108), Expect = 0.039, Method: Compositional matrix adjust.
Identities = 28/91 (30%), Positives = 46/91 (50%), Gaps = 12/91 (13%)
Query: 256 LVPGIDFCNHDLKAAATWEVDGTGLITGVPFSMYLLSVERSSFHSEKEISISYGNKGNEE 315
L+P +D NHD ++ W P + + + + +E+S +YG KGNEE
Sbjct: 231 LLPIVDLLNHDYRSKVKW----------YPENGWFCYEKIGTASQSRELSNNYGGKGNEE 280
Query: 316 LLYLYGFVIDNNPDD--YLMIHYPAEAIHSI 344
LL YGFV+++N D L + P + + +I
Sbjct: 281 LLSGYGFVLEDNIFDSVALKVKLPLDVVSTI 311
>gi|392587357|gb|EIW76691.1| SET domain-containing protein [Coniophora puteana RWD-64-598 SS2]
Length = 430
Score = 46.2 bits (108), Expect = 0.039, Method: Compositional matrix adjust.
Identities = 27/80 (33%), Positives = 40/80 (50%), Gaps = 1/80 (1%)
Query: 256 LVPGIDFCNHDLKAAATWEVDGTGLITGVPFSMYLLSVERSSFHSEKEISISYGNKGNEE 315
L+P +D NH +W V G V + V +F+ +EI +YG K N E
Sbjct: 202 LLPVLDSLNHRRAQPVSW-VIGRSSEPEVSTETTISIVSHDAFNEGQEIYNNYGPKPNSE 260
Query: 316 LLYLYGFVIDNNPDDYLMIH 335
L+ YGF + NNPDD +++
Sbjct: 261 LILGYGFSLPNNPDDTIVLQ 280
>gi|323302664|gb|EGA56470.1| Rkm1p [Saccharomyces cerevisiae FostersB]
Length = 583
Score = 46.2 bits (108), Expect = 0.039, Method: Compositional matrix adjust.
Identities = 28/91 (30%), Positives = 46/91 (50%), Gaps = 12/91 (13%)
Query: 256 LVPGIDFCNHDLKAAATWEVDGTGLITGVPFSMYLLSVERSSFHSEKEISISYGNKGNEE 315
L+P +D NHD ++ W P + + + + +E+S +YG KGNEE
Sbjct: 231 LLPIVDLLNHDYRSKVKW----------YPENGWFCYEKIGTASQSRELSNNYGGKGNEE 280
Query: 316 LLYLYGFVIDNNPDD--YLMIHYPAEAIHSI 344
LL YGFV+++N D L + P + + +I
Sbjct: 281 LLSGYGFVLEDNIFDSVALKVKLPLDVVSTI 311
>gi|151942593|gb|EDN60939.1| methyltransferase [Saccharomyces cerevisiae YJM789]
Length = 583
Score = 46.2 bits (108), Expect = 0.039, Method: Compositional matrix adjust.
Identities = 28/91 (30%), Positives = 46/91 (50%), Gaps = 12/91 (13%)
Query: 256 LVPGIDFCNHDLKAAATWEVDGTGLITGVPFSMYLLSVERSSFHSEKEISISYGNKGNEE 315
L+P +D NHD ++ W P + + + + +E+S +YG KGNEE
Sbjct: 231 LLPIVDLLNHDYRSKVKW----------YPENGWFCYEKIGTASQSRELSNNYGGKGNEE 280
Query: 316 LLYLYGFVIDNNPDD--YLMIHYPAEAIHSI 344
LL YGFV+++N D L + P + + +I
Sbjct: 281 LLSGYGFVLEDNIFDSVALKVKLPLDVVSTI 311
>gi|6325048|ref|NP_015116.1| Rkm1p [Saccharomyces cerevisiae S288c]
gi|74583777|sp|Q08961.1|RKM1_YEAST RecName: Full=Ribosomal N-lysine methyltransferase 1
gi|1370432|emb|CAA97923.1| unnamed protein product [Saccharomyces cerevisiae]
gi|285815336|tpg|DAA11228.1| TPA: Rkm1p [Saccharomyces cerevisiae S288c]
gi|392296227|gb|EIW07330.1| Rkm1p [Saccharomyces cerevisiae CEN.PK113-7D]
Length = 583
Score = 46.2 bits (108), Expect = 0.040, Method: Compositional matrix adjust.
Identities = 28/91 (30%), Positives = 46/91 (50%), Gaps = 12/91 (13%)
Query: 256 LVPGIDFCNHDLKAAATWEVDGTGLITGVPFSMYLLSVERSSFHSEKEISISYGNKGNEE 315
L+P +D NHD ++ W P + + + + +E+S +YG KGNEE
Sbjct: 231 LLPIVDLLNHDYRSKVKW----------YPENGWFCYEKIGTASQSRELSNNYGGKGNEE 280
Query: 316 LLYLYGFVIDNNPDD--YLMIHYPAEAIHSI 344
LL YGFV+++N D L + P + + +I
Sbjct: 281 LLSGYGFVLEDNIFDSVALKVKLPLDVVSTI 311
>gi|365762711|gb|EHN04244.1| Rkm1p [Saccharomyces cerevisiae x Saccharomyces kudriavzevii VIN7]
Length = 583
Score = 46.2 bits (108), Expect = 0.040, Method: Compositional matrix adjust.
Identities = 28/91 (30%), Positives = 46/91 (50%), Gaps = 12/91 (13%)
Query: 256 LVPGIDFCNHDLKAAATWEVDGTGLITGVPFSMYLLSVERSSFHSEKEISISYGNKGNEE 315
L+P +D NHD ++ W P + + + + +E+S +YG KGNEE
Sbjct: 231 LLPIVDLLNHDYRSKVKW----------YPENGWFCYEKIGTASQSRELSNNYGGKGNEE 280
Query: 316 LLYLYGFVIDNNPDD--YLMIHYPAEAIHSI 344
LL YGFV+++N D L + P + + +I
Sbjct: 281 LLSGYGFVLEDNIFDSVALKVKLPLDVVSTI 311
>gi|323335277|gb|EGA76566.1| Rkm1p [Saccharomyces cerevisiae Vin13]
Length = 583
Score = 46.2 bits (108), Expect = 0.040, Method: Compositional matrix adjust.
Identities = 28/91 (30%), Positives = 46/91 (50%), Gaps = 12/91 (13%)
Query: 256 LVPGIDFCNHDLKAAATWEVDGTGLITGVPFSMYLLSVERSSFHSEKEISISYGNKGNEE 315
L+P +D NHD ++ W P + + + + +E+S +YG KGNEE
Sbjct: 231 LLPIVDLLNHDYRSKVKW----------YPENGWFCYEKIGTASQSRELSNNYGGKGNEE 280
Query: 316 LLYLYGFVIDNNPDD--YLMIHYPAEAIHSI 344
LL YGFV+++N D L + P + + +I
Sbjct: 281 LLSGYGFVLEDNIFDSVALKVKLPLDVVSTI 311
>gi|190407755|gb|EDV11020.1| methyltransferase [Saccharomyces cerevisiae RM11-1a]
Length = 583
Score = 46.2 bits (108), Expect = 0.040, Method: Compositional matrix adjust.
Identities = 28/91 (30%), Positives = 46/91 (50%), Gaps = 12/91 (13%)
Query: 256 LVPGIDFCNHDLKAAATWEVDGTGLITGVPFSMYLLSVERSSFHSEKEISISYGNKGNEE 315
L+P +D NHD ++ W P + + + + +E+S +YG KGNEE
Sbjct: 231 LLPIVDLLNHDYRSKVKW----------YPENGWFCYEKIGTASQSRELSNNYGGKGNEE 280
Query: 316 LLYLYGFVIDNNPDD--YLMIHYPAEAIHSI 344
LL YGFV+++N D L + P + + +I
Sbjct: 281 LLSGYGFVLEDNIFDSVALKVKLPLDVVSTI 311
>gi|320586350|gb|EFW99029.1| set domain containing protein [Grosmannia clavigera kw1407]
Length = 537
Score = 46.2 bits (108), Expect = 0.041, Method: Compositional matrix adjust.
Identities = 28/75 (37%), Positives = 36/75 (48%), Gaps = 6/75 (8%)
Query: 256 LVPGIDFCNHDLKAAATWEVDGTGLITGVPFSMYLLSVERSSFHSE------KEISISYG 309
L+P +D NHD A ATWE L G SV SF + +++ +YG
Sbjct: 258 LLPVLDLANHDPTARATWEATSAHLADGSASGHEAASVTGVSFRVQQRYAPGQQVFNNYG 317
Query: 310 NKGNEELLYLYGFVI 324
K N ELL YGFV+
Sbjct: 318 MKTNSELLLGYGFVL 332
>gi|259149949|emb|CAY86752.1| Rkm1p [Saccharomyces cerevisiae EC1118]
gi|323346113|gb|EGA80403.1| Rkm1p [Saccharomyces cerevisiae Lalvin QA23]
Length = 583
Score = 46.2 bits (108), Expect = 0.041, Method: Compositional matrix adjust.
Identities = 28/91 (30%), Positives = 46/91 (50%), Gaps = 12/91 (13%)
Query: 256 LVPGIDFCNHDLKAAATWEVDGTGLITGVPFSMYLLSVERSSFHSEKEISISYGNKGNEE 315
L+P +D NHD ++ W P + + + + +E+S +YG KGNEE
Sbjct: 231 LLPIVDLLNHDYRSKVKW----------YPENGWFCYEKIGTASQSRELSNNYGGKGNEE 280
Query: 316 LLYLYGFVIDNNPDD--YLMIHYPAEAIHSI 344
LL YGFV+++N D L + P + + +I
Sbjct: 281 LLSGYGFVLEDNIFDSVALKVKLPLDVVSTI 311
>gi|255562948|ref|XP_002522479.1| Ribulose-1,5 bisphosphate carboxylase/oxygenase large subunit
N-methyltransferase, chloroplast precursor, putative
[Ricinus communis]
gi|223538364|gb|EEF39971.1| Ribulose-1,5 bisphosphate carboxylase/oxygenase large subunit
N-methyltransferase, chloroplast precursor, putative
[Ricinus communis]
Length = 502
Score = 46.2 bits (108), Expect = 0.041, Method: Compositional matrix adjust.
Identities = 44/149 (29%), Positives = 66/149 (44%), Gaps = 26/149 (17%)
Query: 9 LEPFLQWLQVNKVELRGCKIKYSDESKGFGIFSSNEFS-DGVLLVVPLDLAITPMRVLQD 67
LE F QWL V K +G G+ + + + + V+L +P L I P V
Sbjct: 52 LEIFWQWLSDQGVVSGKSPAKPGVVKEGLGLIAERDIARNEVVLEIPKKLWINPDAVAAS 111
Query: 68 PLIGPECRAMFEDGEVDDRFLMILFLTVERLRK-NSSWKPYLDMLPTTFGNPLWF----- 121
IG C + + LFL E+L+K S+W PYLD+LP T + +++
Sbjct: 112 D-IGNVCSGL------KPWISVALFLIREKLKKEGSTWWPYLDILPDTTNSTIYWWVLLV 164
Query: 122 ------------TDDELLELKGTTLYRAT 138
+++EL EL+GT L R T
Sbjct: 165 AFYVLVLSFQRRSEEELAELQGTQLLRTT 193
>gi|71995786|ref|NP_497604.2| Protein SET-27 [Caenorhabditis elegans]
gi|373220599|emb|CCD73865.1| Protein SET-27 [Caenorhabditis elegans]
Length = 502
Score = 46.2 bits (108), Expect = 0.041, Method: Compositional matrix adjust.
Identities = 51/250 (20%), Positives = 94/250 (37%), Gaps = 50/250 (20%)
Query: 85 DRFLMILFLTVERLR-KNSSWKPYLDMLPTTFGNPLWFTDDELLELKGTTLYRATELQKQ 143
D + LFL + ++ + S WK Y+ +LPTTF PL++++++LL+LK + ++ L +
Sbjct: 149 DNVGLALFLACQWIQNEKSKWKSYISILPTTFPTPLFYSEEQLLQLKPSPIFEEAILFYR 208
Query: 144 N--------LLTLYDDKVKDLVKKLLVLDGDSERANSIFWTRALNIP--LPHSYVFPQNQ 193
LL + +K+ + ++ D + IF+ N+ P Y +
Sbjct: 209 TISRQFCYFLLAIAKNKIYEAAQRR--KDARNAMETPIFYNVPFNVANFTPKLYFW---- 262
Query: 194 EDLNKYDSINNSAELSNDHNSRGELINGLNDIKNEAQRVNSQVNGATSTLTSTQGETLWI 253
+ G + +N + +E Q G + I
Sbjct: 263 --------------------AVGVVTTRVNMVPSENQ-------------VGEDGNPVII 289
Query: 254 EGLVPGIDFCNHDLKAAATWEVDGTGLITGVPFSMYLLSVERSSFHSEKEISISYGNKGN 313
L+P +D NH+ L+ P + + E++I YG +
Sbjct: 290 PALIPVLDMANHENVLTDVLTEPIEDLVCYSPEEECAVITSHCDVKAGNEVTIFYGCRSK 349
Query: 314 EELLYLYGFV 323
E L GFV
Sbjct: 350 GEHLLHNGFV 359
>gi|323331144|gb|EGA72562.1| Rkm1p [Saccharomyces cerevisiae AWRI796]
gi|323351942|gb|EGA84481.1| Rkm1p [Saccharomyces cerevisiae VL3]
Length = 523
Score = 46.2 bits (108), Expect = 0.041, Method: Compositional matrix adjust.
Identities = 28/91 (30%), Positives = 46/91 (50%), Gaps = 12/91 (13%)
Query: 256 LVPGIDFCNHDLKAAATWEVDGTGLITGVPFSMYLLSVERSSFHSEKEISISYGNKGNEE 315
L+P +D NHD ++ W P + + + + +E+S +YG KGNEE
Sbjct: 171 LLPIVDLLNHDYRSKVKW----------YPENGWFCYEKIGTASQSRELSNNYGGKGNEE 220
Query: 316 LLYLYGFVIDNNPDD--YLMIHYPAEAIHSI 344
LL YGFV+++N D L + P + + +I
Sbjct: 221 LLSGYGFVLEDNIFDSVALKVKLPLDVVSTI 251
>gi|302307608|ref|NP_984333.2| ADR237Cp [Ashbya gossypii ATCC 10895]
gi|299789080|gb|AAS52157.2| ADR237Cp [Ashbya gossypii ATCC 10895]
gi|374107548|gb|AEY96456.1| FADR237Cp [Ashbya gossypii FDAG1]
Length = 574
Score = 45.8 bits (107), Expect = 0.043, Method: Compositional matrix adjust.
Identities = 67/251 (26%), Positives = 96/251 (38%), Gaps = 60/251 (23%)
Query: 104 WKPYLDMLPT--TFGNPLWFTDDELLELKGTTLYRATELQKQNLLTLYDDKVKDLVKKLL 161
+KPYL +LPT P ++T+ ELL L+G +Y K+NL L ++
Sbjct: 105 FKPYLSVLPTHKEMHTPYFWTNSELLLLRGMDIYLKA---KKNLRQLVNE---------- 151
Query: 162 VLDGDSERANSIFWTRALNIPLPHSYVFPQNQEDLNKYDSINNSAELSNDHNSRGELI-N 220
W H V + K+ + NS+E + GE I N
Sbjct: 152 -------------W---------HELVTAGELRNDTKFYDLFNSSE----NFDAGEYISN 185
Query: 221 GLNDIKNEAQRVNSQVNGATSTLTSTQGETLWI--------EGLVPGIDFCNHDLKAAAT 272
L D A+S +S TL + L P ID NH T
Sbjct: 186 QLADPTTTDWTDFPAYLWASSIFSSRAFPTLILGTTTDLNEAFLNPIIDLLNHSAGTNVT 245
Query: 273 WEVDGTGLITGVPFSMYLLSVERSSFHSEKEISISYGNKGNEELLYLYGFVIDNNPDD-- 330
W + + V FS + + E+ +YG+K N+ELL YGFV+ NN D
Sbjct: 246 WSYNEQ--VAAVTFST------AQTLETGDELYNNYGDKSNDELLLNYGFVLPNNEHDKS 297
Query: 331 YLMIHYPAEAI 341
L P+E+I
Sbjct: 298 TLCFRIPSESI 308
>gi|428165190|gb|EKX34191.1| hypothetical protein GUITHDRAFT_147375 [Guillardia theta CCMP2712]
Length = 681
Score = 45.8 bits (107), Expect = 0.044, Method: Compositional matrix adjust.
Identities = 28/89 (31%), Positives = 46/89 (51%), Gaps = 10/89 (11%)
Query: 243 LTSTQGETLWIEGLVPGIDFCNHDLKAAATWEVDGTGLITGVPFSMYLLSVERSSFHSEK 302
+++T+G + L P D NHD + A ++ D S + + R+ E
Sbjct: 261 VSTTEGVKCFC--LCPLADMLNHDPSSPALFDFDPAT-------SCFAIRTSRAWSEGE- 310
Query: 303 EISISYGNKGNEELLYLYGFVIDNNPDDY 331
E++ISYG NE+LL YGFV+D+N ++
Sbjct: 311 EVTISYGELSNEDLLQFYGFVLDDNMHEF 339
>gi|365760461|gb|EHN02181.1| YHL039W-like protein [Saccharomyces cerevisiae x Saccharomyces
kudriavzevii VIN7]
Length = 305
Score = 45.8 bits (107), Expect = 0.046, Method: Compositional matrix adjust.
Identities = 82/337 (24%), Positives = 133/337 (39%), Gaps = 54/337 (16%)
Query: 9 LEPFLQWLQVNKVELRGCKIKYS-DESKGFGIFSSNEFS---DGVLLVVPLDLAITPMRV 64
L+ L+W Q N + KI + + G F + FS D L+ VP L IT +
Sbjct: 7 LQNCLEWAQNNGAFIDP-KISFKITQEAGVSAFVNENFSPRPDQALIRVPEVLLITSQQA 65
Query: 65 LQD-PLIGPECRAMFEDGEVDDRFLMILFLTVERLRKNSSWKPYLDMLPTTFGNPLWFTD 123
L + P E + V +L L + + S +KPYLD+LP P ++
Sbjct: 66 LSEFPQAVSEKNLL---SSVTQLYLSKLKFGPDAVHLKSFYKPYLDVLPLNLPQPYFWCT 122
Query: 124 DELLELKGTTLYRATELQKQNLLTLYDDKVKDLVKKLLVLDGDSERANSIFWTRALNIPL 183
DE++ L GT +Y LT+ +D + KL +E +F +AL+I
Sbjct: 123 DEVVNLHGTDVY----------LTM-----RDTLNKL------AEEWRQLF--QALSI-- 157
Query: 184 PHSYVFPQNQEDLNKYDSINNSAELS--------NDHNSRGELINGLNDIKNEAQRVNSQ 235
H+ Q+++ L+ + SA + N H N NS+
Sbjct: 158 EHA---AQDKQFLSLFQGSGKSAVVPLEQFCAHINLHKPEASEWNSFAAYLWSHCIFNSR 214
Query: 236 VNGATSTLTSTQGETLWIEG-LVPGIDFCNHDLKAAATWEVDGTGLITGVPFSMYLLSVE 294
+ T EG L P +DF NH W ++ + + +
Sbjct: 215 AFPRIILNRADTDSTNLNEGFLYPIVDFLNHKNDVPVKWVMNEDNELCFMS--------Q 266
Query: 295 RSSFHSEKEISISYGNKGNEELLYLYGFVIDNNPDDY 331
++F ++ E+ +YGN NE+ L YGF +N D+
Sbjct: 267 SATFSAQGELFNNYGNISNEKCLLNYGFWDSSNKYDF 303
>gi|401623356|gb|EJS41459.1| YPL208W [Saccharomyces arboricola H-6]
Length = 583
Score = 45.8 bits (107), Expect = 0.047, Method: Compositional matrix adjust.
Identities = 31/91 (34%), Positives = 44/91 (48%), Gaps = 12/91 (13%)
Query: 256 LVPGIDFCNHDLKAAATWEVDGTGLITGVPFSMYLLSVERSSFHSEKEISISYGNKGNEE 315
L+P ID NHD ++ W P + + + S +E+ +YG KGNEE
Sbjct: 231 LLPIIDLLNHDYRSKVQW----------YPENGWFCYEKMGSVSQSEELCNNYGGKGNEE 280
Query: 316 LLYLYGFVIDNNPDD--YLMIHYPAEAIHSI 344
LL YGFV+++N D L I P I +I
Sbjct: 281 LLSGYGFVLEDNIFDSVALKIKLPLGVISTI 311
>gi|406698545|gb|EKD01780.1| nucleus protein [Trichosporon asahii var. asahii CBS 8904]
Length = 533
Score = 45.8 bits (107), Expect = 0.048, Method: Compositional matrix adjust.
Identities = 41/157 (26%), Positives = 66/157 (42%), Gaps = 13/157 (8%)
Query: 33 ESKGFGIFSSNEFSDG-VLLVVPLDLAITPMRVLQDPLIGPECRAMFEDGEVDDRFLMIL 91
E G G ++ + G L +P ++ + PE A E G +IL
Sbjct: 31 EGMGNGAVATRDIPSGTALFSIPSSYLLSEHTSTLSTHLKPEDWASLEGGWT----RLIL 86
Query: 92 FLTVERLRKNSSWKPYLDMLPTTFGNPLWFTDDELLELKGTTLYRATELQKQNLLTLYDD 151
L E R S W+ YLD +P +F P+W+ +L LKGT + + + + YD+
Sbjct: 87 ALMWEDSRAESPWRAYLDAMPGSFSTPMWWPAPDLALLKGTDI--ENRIGRASADRDYDE 144
Query: 152 KVKDLVKK---LLVLDGDSE---RANSIFWTRALNIP 182
+V L+ + V D E R S +R+ +P
Sbjct: 145 RVAPLLAAYPGVFVGDFSKECYHRQGSRVLSRSFTVP 181
>gi|340517549|gb|EGR47793.1| hypothetical protein TRIREDRAFT_122428 [Trichoderma reesei QM6a]
Length = 482
Score = 45.8 bits (107), Expect = 0.048, Method: Compositional matrix adjust.
Identities = 39/162 (24%), Positives = 70/162 (43%), Gaps = 21/162 (12%)
Query: 9 LEPFLQWLQVNKVELRGCKIKYSDESKGFGIFSSNEFSD--------GVLLVVPLDLAIT 60
+E F W +N V+ +I+ + E KGFG+ + ++ +D +L +P DL ++
Sbjct: 6 IEAFPAWALLNNVDFVDAEIR-NIEGKGFGLVAKHDITDESRDASGPATILRIPRDLVLS 64
Query: 61 PMRVLQDPLIGPECRAMFE-DGEVDDRFLMILFLTVE-----------RLRKNSSWKPYL 108
V + + + + + G R ++L+L R ++ W Y+
Sbjct: 65 AEAVEEYAKVDQNFKQLLDVAGHQSTRGDIMLYLLTHLVQSKATSPGTRAFASTPWTEYI 124
Query: 109 DMLPTTFGNPLWFTDDELLELKGTTLYRATELQKQNLLTLYD 150
LP P +T+DE LKGT+L A + L + YD
Sbjct: 125 RFLPRPIPVPTMWTNDERELLKGTSLEAAVSAKLSALSSEYD 166
Score = 41.6 bits (96), Expect = 0.94, Method: Compositional matrix adjust.
Identities = 30/97 (30%), Positives = 46/97 (47%), Gaps = 10/97 (10%)
Query: 255 GLVPGIDFCNHDLKAAATWEVDGTGLITGVPFSMYLLSVERSSFHSEKEISISYGN-KGN 313
+VPG+D NH +A ++ G + LL S + EI+ISYG K
Sbjct: 211 AMVPGLDMANHSQSHSAYYDESSDG-------DVVLLPRPGSKIPAGAEITISYGEAKPA 263
Query: 314 EELLYLYGFVIDNNPDDYLMIHYPAEAIHSIPLSDSK 350
E+L+ YGF+ ++ L +H EA+ PL +K
Sbjct: 264 AEMLFSYGFIDKDSTVKELTLHL--EALPDDPLGRAK 298
>gi|401886592|gb|EJT50619.1| nucleus protein [Trichosporon asahii var. asahii CBS 2479]
Length = 533
Score = 45.8 bits (107), Expect = 0.049, Method: Compositional matrix adjust.
Identities = 41/157 (26%), Positives = 66/157 (42%), Gaps = 13/157 (8%)
Query: 33 ESKGFGIFSSNEFSDG-VLLVVPLDLAITPMRVLQDPLIGPECRAMFEDGEVDDRFLMIL 91
E G G ++ + G L +P ++ + PE A E G +IL
Sbjct: 31 EGMGNGAVATRDIPSGTALFSIPSSYLLSEHTSTLSTHLKPEDWASLEGGWT----RLIL 86
Query: 92 FLTVERLRKNSSWKPYLDMLPTTFGNPLWFTDDELLELKGTTLYRATELQKQNLLTLYDD 151
L E R S W+ YLD +P +F P+W+ +L LKGT + + + + YD+
Sbjct: 87 ALMWEDSRAESPWRAYLDAMPGSFSTPMWWPAPDLALLKGTDI--ENRIGRASADRDYDE 144
Query: 152 KVKDLVKK---LLVLDGDSE---RANSIFWTRALNIP 182
+V L+ + V D E R S +R+ +P
Sbjct: 145 RVAPLLAAYPGVFVGDFSKECYHRQGSRVLSRSFTVP 181
>gi|332321747|sp|B7ZUF3.1|SETD3_XENTR RecName: Full=Histone-lysine N-methyltransferase setd3; AltName:
Full=SET domain-containing protein 3
gi|213624517|gb|AAI71209.1| LOC549331 protein [Xenopus (Silurana) tropicalis]
Length = 582
Score = 45.8 bits (107), Expect = 0.050, Method: Compositional matrix adjust.
Identities = 34/126 (26%), Positives = 63/126 (50%), Gaps = 12/126 (9%)
Query: 12 FLQWLQVNKVELRGCK-IKYSDESKGFGIFSSNEF-SDGVLLVVPLDLAITPMRVLQDPL 69
++W + N G + +++ +E GFG+ ++ E ++ + L VP L +T + + +
Sbjct: 82 LMEWCKENGASTDGFELVEFPEE--GFGLKATREIKAEELFLWVPRKLLMT-VESAKGSV 138
Query: 70 IGP---ECRAMFEDGEVDDRFLMILFLTVERLRKNSSWKPYLDMLPTTFGNPLWFTDDEL 126
+GP + R + G + + L ER NS W PY+ LP + PL+F +DE+
Sbjct: 139 LGPLYSQDRILQAMGNI----TLAFHLLCERADPNSFWLPYIKTLPNEYDTPLYFNEDEV 194
Query: 127 LELKGT 132
L+ T
Sbjct: 195 QYLQST 200
>gi|62857953|ref|NP_001016577.1| histone-lysine N-methyltransferase setd3 [Xenopus (Silurana)
tropicalis]
gi|89272100|emb|CAJ81720.1| novel protein containing a SET domain [Xenopus (Silurana)
tropicalis]
Length = 581
Score = 45.8 bits (107), Expect = 0.051, Method: Compositional matrix adjust.
Identities = 34/126 (26%), Positives = 63/126 (50%), Gaps = 12/126 (9%)
Query: 12 FLQWLQVNKVELRGCK-IKYSDESKGFGIFSSNEF-SDGVLLVVPLDLAITPMRVLQDPL 69
++W + N G + +++ +E GFG+ ++ E ++ + L VP L +T + + +
Sbjct: 81 LMEWCKENGASTDGFELVEFPEE--GFGLKATREIKAEELFLWVPRKLLMT-VESAKGSV 137
Query: 70 IGP---ECRAMFEDGEVDDRFLMILFLTVERLRKNSSWKPYLDMLPTTFGNPLWFTDDEL 126
+GP + R + G + + L ER NS W PY+ LP + PL+F +DE+
Sbjct: 138 LGPLYSQDRILQAMGNI----TLAFHLLCERADPNSFWLPYIKTLPNEYDTPLYFNEDEV 193
Query: 127 LELKGT 132
L+ T
Sbjct: 194 QYLQST 199
>gi|340519125|gb|EGR49364.1| predicted protein [Trichoderma reesei QM6a]
Length = 963
Score = 45.8 bits (107), Expect = 0.052, Method: Compositional matrix adjust.
Identities = 25/84 (29%), Positives = 49/84 (58%), Gaps = 9/84 (10%)
Query: 82 EVDDR------FLMILFLTVERLR-KNSSWKPYLDMLPTTFGNPLWFTDDELLELKGTTL 134
EVDD+ +I+ + E + S WKPY+D+LP++F P++++D EL EL+ +
Sbjct: 579 EVDDKPQQDPWSTLIIVMMYEYFKGSESKWKPYIDVLPSSFETPMFWSDAELDELQASAT 638
Query: 135 YRATELQKQNLLTLYDDKVKDLVK 158
+++ K + ++ DKV +++
Sbjct: 639 R--SKVGKASAEEMFQDKVLPVIR 660
>gi|409079523|gb|EKM79884.1| hypothetical protein AGABI1DRAFT_119942 [Agaricus bisporus var.
burnettii JB137-S8]
Length = 401
Score = 45.8 bits (107), Expect = 0.054, Method: Compositional matrix adjust.
Identities = 57/234 (24%), Positives = 90/234 (38%), Gaps = 56/234 (23%)
Query: 106 PYLDMLP--TTFGNPLWFTDDELLELKGTTLYRATELQKQNLLTLYDDKVKDLVKKLLVL 163
PYLD LP PL FT E+ KG+ LY AT +++ L + + + +L+
Sbjct: 99 PYLDTLPPLNQLRTPLQFTKIEIETFKGSNLYHATLNRERQLKEEWQE-----CQSVLIS 153
Query: 164 DGDSERANSIFWTRALNIPLPHSYVFPQNQEDLNKYDSINNSAELSNDHNSRGELINGLN 223
DS W R L +YV
Sbjct: 154 QNDS-WGKGFTWERYLTAA---TYV----------------------------------- 174
Query: 224 DIKNEAQRVNSQVNGATSTLTSTQGETLWIEGLVPGIDFCNHDLKAAATWEV---DGTGL 280
++ S + +L +T ET ++ L+PG+D NH A +W V D +G
Sbjct: 175 ----SSRAFPSTILSPNPSLIATP-ETKFV--LLPGVDAFNHKRAQAVSWSVTYPDKSGS 227
Query: 281 ITGVPFSMYLLSVERSSFHSEKEISISYGNKGNEELLYLYGFVIDNNPDDYLMI 334
+ + V + + +EI +YG K N L+ YGF + NPDD +++
Sbjct: 228 LASSYKGPTISLVPHTKTSAGEEIFNNYGPKPNGNLILGYGFSLPANPDDTILL 281
>gi|134254196|gb|AAI35195.1| LOC549331 protein [Xenopus (Silurana) tropicalis]
Length = 507
Score = 45.4 bits (106), Expect = 0.055, Method: Compositional matrix adjust.
Identities = 34/125 (27%), Positives = 63/125 (50%), Gaps = 12/125 (9%)
Query: 13 LQWLQVNKVELRGCK-IKYSDESKGFGIFSSNEF-SDGVLLVVPLDLAITPMRVLQDPLI 70
++W + N G + +++ +E GFG+ ++ E ++ + L VP L +T + + ++
Sbjct: 8 MEWCKENGASTDGFELVEFPEE--GFGLKATREIKAEELFLWVPRKLLMT-VESAKGSVL 64
Query: 71 GP---ECRAMFEDGEVDDRFLMILFLTVERLRKNSSWKPYLDMLPTTFGNPLWFTDDELL 127
GP + R + G + + L ER NS W PY+ LP + PL+F +DE+
Sbjct: 65 GPLYSQDRILQAMGNI----TLAFHLLCERADPNSFWLPYIKTLPNEYDTPLYFNEDEVQ 120
Query: 128 ELKGT 132
L+ T
Sbjct: 121 YLQST 125
>gi|322706860|gb|EFY98439.1| SET domain protein [Metarhizium anisopliae ARSEF 23]
Length = 595
Score = 45.4 bits (106), Expect = 0.056, Method: Compositional matrix adjust.
Identities = 29/81 (35%), Positives = 42/81 (51%), Gaps = 8/81 (9%)
Query: 255 GLVPGIDFCNHDLKAAATWEVDGTGLITGVPFSMYLLSVERSSFHSEKEISISYGN-KGN 313
+VPG+D NH KA A +E D ++ LL S +E++ISYG+ K
Sbjct: 316 AMVPGLDMVNHSSKATAYYEEDDHD-------NVVLLIRPGCPVRSGEEVTISYGDAKPA 368
Query: 314 EELLYLYGFVIDNNPDDYLMI 334
E+L+ YGF+ NN D L +
Sbjct: 369 SEMLFSYGFIDPNNIVDKLTL 389
Score = 41.2 bits (95), Expect = 1.1, Method: Compositional matrix adjust.
Identities = 37/163 (22%), Positives = 72/163 (44%), Gaps = 15/163 (9%)
Query: 3 ISTEAKLEPFLQWLQVNKVELRGCKIKYSDESKGFGIFSSNEF----SDGVL---LVVPL 55
+S++ ++ F W +N V+ ++ E KGFG+ + + +DG + +P
Sbjct: 107 MSSQLPIDTFPAWAHLNDVQFTHVNLQDVGEGKGFGLVAHADLESAEADGTSKGPVTIPH 166
Query: 56 DLAITPMRVLQDPLIGPECRAMFED-GEVDDRFLMILFLTVERLRKN-------SSWKPY 107
DL ++ V + + + E G R ++L+L + + + + W Y
Sbjct: 167 DLVLSAEAVEDFAKVDHNFKQLLEAVGRQSTRGDIMLYLVSQFAQSSRPKGLSPTPWTEY 226
Query: 108 LDMLPTTFGNPLWFTDDELLELKGTTLYRATELQKQNLLTLYD 150
+ +LP P +T+ E L L GT+L A E + +L +D
Sbjct: 227 IRLLPRPIPVPTMWTEPERLLLNGTSLEAALEAKLLSLGKEFD 269
>gi|171676308|ref|XP_001903107.1| hypothetical protein [Podospora anserina S mat+]
gi|170936220|emb|CAP60879.1| unnamed protein product [Podospora anserina S mat+]
Length = 495
Score = 45.4 bits (106), Expect = 0.057, Method: Compositional matrix adjust.
Identities = 64/265 (24%), Positives = 109/265 (41%), Gaps = 43/265 (16%)
Query: 12 FLQWLQVNKVELRGCKIKYSD---ESKGFGIFSSNEFS-DGVLLVVP----LDLAITPMR 63
FL W Q I D + G GI + + + D VL +P L A +P++
Sbjct: 18 FLSWFQSLPGATFHKDIAIEDLRSRNAGRGIVAQADIAADTVLFTIPRNSILCAATSPLK 77
Query: 64 VLQDPLIGPECRAMFEDGEVDDR------FLMILFLTVERLRKNSS-WKPYLDMLPTTFG 116
+ + + E G+ D L+IL L E L+ +SS WKPYLD+LP+TF
Sbjct: 78 DILPEIFDLDNDDEDESGDESDGDNQNSWTLLILILIHEYLQGSSSQWKPYLDVLPSTFN 137
Query: 117 NPLWFTDDELLELKGTTLYRATELQKQNLLTLYDDKVKDLVKKLLVLDGDSERANSIFWT 176
P+++T +L L+ + + +++ ++ + K+ +++ IF+
Sbjct: 138 TPMFWTPSQLSFLQASAV--TSKIGQEEADKMIASKILPVIRS----------HPQIFFP 185
Query: 177 RALN-------IPLPH-------SYVFPQNQEDLNKYDSINNSAELSNDHNSRGEL-ING 221
+ I L H SY F Q D+ + + N E D + L +
Sbjct: 186 SSATPLSDDQLIQLAHRMGSTIMSYAFDLEQ-DMEIPEQLENDDEWEEDREGKTMLGMVP 244
Query: 222 LNDIKNEAQRVNSQVNGATSTLTST 246
+ DI N N+ +N A LT+
Sbjct: 245 MADILNADAEFNAHINHAEDALTAV 269
>gi|156848754|ref|XP_001647258.1| hypothetical protein Kpol_1002p47 [Vanderwaltozyma polyspora DSM
70294]
gi|156117943|gb|EDO19400.1| hypothetical protein Kpol_1002p47 [Vanderwaltozyma polyspora DSM
70294]
Length = 579
Score = 45.4 bits (106), Expect = 0.061, Method: Compositional matrix adjust.
Identities = 41/142 (28%), Positives = 62/142 (43%), Gaps = 34/142 (23%)
Query: 256 LVPGIDFCNHDLKAAATWEVDGTGL----ITGVPFSMYLLSVERSSFHSEKEISISYGNK 311
L+P +D NHD + W D + +P + E++ +YG K
Sbjct: 228 LLPVLDLLNHDYDSVVEWSSDNGDFQYKNMNTIP--------------ANTELTNNYGRK 273
Query: 312 GNEELLYLYGFVIDNNPDDYLMIHYPAEAIHSIPLSDS--KALLLEEQKAQLRCLLPKSL 369
GNEELL YGFVI+NN D + + I L D+ +L EE QL ++
Sbjct: 274 GNEELLSGYGFVIENNIYDSVAL--------KIKLQDTLITKILKEEPMLQL-----PTI 320
Query: 370 LEHGFFAAGHPKDGNNDNKLEV 391
+ +FA + +GN N E+
Sbjct: 321 SNYTWFAFDY-SNGNESNNKEI 341
>gi|308501895|ref|XP_003113132.1| CRE-SET-27 protein [Caenorhabditis remanei]
gi|308265433|gb|EFP09386.1| CRE-SET-27 protein [Caenorhabditis remanei]
Length = 501
Score = 45.4 bits (106), Expect = 0.062, Method: Compositional matrix adjust.
Identities = 63/283 (22%), Positives = 110/283 (38%), Gaps = 48/283 (16%)
Query: 89 MILFLTVE-RLRKNSSWKPYLDMLPTTFGNPLWFTDDELLELKGTTLYRATELQKQNLLT 147
+ LFL + L + S W PY+ +LP +F PL++TD++LL+LK + ++ + L
Sbjct: 153 LALFLATQWLLNEKSKWLPYISILPNSFPTPLFYTDEQLLQLKPSPIF-------EEALL 205
Query: 148 LYDDKVKDLVKKLLVLDGDSERANSIFWTRALNIPLPHSYVFPQNQEDLNKYDSINNSAE 207
Y + L+ + N I+ E + N+ E
Sbjct: 206 FYRTISRQFCYFLMAV-----AKNKIY-------------------ESAQRRKDARNTME 241
Query: 208 LSNDHNSRGELINGLNDIKNEA-QRVNSQVNGATSTLTSTQGETL-WIEGLVPGIDFCNH 265
+N+ + N + A V ++VN S ++ + E I L+P +D NH
Sbjct: 242 TPLFYNAPFTVANLTPGLYFWAVGVVTTRVNMVPSEHSTDKDEKPNLIAALIPFLDMANH 301
Query: 266 DLKAAATWEVDGTGLITGVPFSMYLLSVERSSFHSEKEISISYGNKGNEELLYLYGFV-I 324
+ E L+ P + + +++I YG + E L GFV I
Sbjct: 302 E---NVVTEDPVEDLVCYSPAEECAVITSHCDLEAGNQVTIFYGCRSRGEHLLHNGFVPI 358
Query: 325 DNNPDDYLMIHYPAEAIHSIPLS----DSKALLLEEQKAQLRC 363
+ D L + IP + DSK L+E+ ++C
Sbjct: 359 HHQRQDVLKLKI------GIPKTDKTLDSKTKLIEKYVQNVQC 395
>gi|255720552|ref|XP_002556556.1| KLTH0H16126p [Lachancea thermotolerans]
gi|238942522|emb|CAR30694.1| KLTH0H16126p [Lachancea thermotolerans CBS 6340]
Length = 571
Score = 45.4 bits (106), Expect = 0.062, Method: Compositional matrix adjust.
Identities = 28/75 (37%), Positives = 37/75 (49%), Gaps = 8/75 (10%)
Query: 256 LVPGIDFCNHDLKAAATWEVDGTGLITGVPFSMYLLSVERSSFHSEKEISISYGNKGNEE 315
L P +DF NH W+++ GV FS + +EI +YG+K NEE
Sbjct: 224 LYPIVDFLNHHSGQKVQWQLNKDR--NGVSFS------SGNQIEKGQEIFNNYGDKSNEE 275
Query: 316 LLYLYGFVIDNNPDD 330
LL YGF I NN +D
Sbjct: 276 LLLNYGFAIQNNMND 290
>gi|317144568|ref|XP_001820210.2| SET domain protein [Aspergillus oryzae RIB40]
gi|391871646|gb|EIT80803.1| N-methyltransferase [Aspergillus oryzae 3.042]
Length = 703
Score = 45.4 bits (106), Expect = 0.062, Method: Compositional matrix adjust.
Identities = 37/133 (27%), Positives = 59/133 (44%), Gaps = 22/133 (16%)
Query: 256 LVPGIDFCNHDLKAAATWEVDGTGLITGVPFSMYLLSVERSSFHSEKEISISYGNKGNEE 315
L+P ID NH A W G G + L +E + + +EIS +YG + NE+
Sbjct: 239 LLPFIDILNHRPLAKVEWRA-GKGNVA-------FLVLEDVA--AGQEISNNYGPRNNEQ 288
Query: 316 LLYLYGFVIDNNPDDYLMIHYPAEAIHSIPLSDSKALLLEEQKAQ------------LRC 363
L+ YGF + NNP DY ++ A + ++ S+ L + A+
Sbjct: 289 LMMNYGFCLPNNPCDYRIVSLRAPPGSPLQMARSQQLQMFPGLAKETDDPYYVFNVFYPL 348
Query: 364 LLPKSLLEHGFFA 376
L P + +EH F+
Sbjct: 349 LAPDTPMEHSIFS 361
>gi|238485948|ref|XP_002374212.1| conserved hypothetical protein [Aspergillus flavus NRRL3357]
gi|83768069|dbj|BAE58208.1| unnamed protein product [Aspergillus oryzae RIB40]
gi|220699091|gb|EED55430.1| conserved hypothetical protein [Aspergillus flavus NRRL3357]
Length = 713
Score = 45.4 bits (106), Expect = 0.063, Method: Compositional matrix adjust.
Identities = 37/135 (27%), Positives = 59/135 (43%), Gaps = 22/135 (16%)
Query: 256 LVPGIDFCNHDLKAAATWEVDGTGLITGVPFSMYLLSVERSSFHSEKEISISYGNKGNEE 315
L+P ID NH A W G G + L +E + + +EIS +YG + NE+
Sbjct: 249 LLPFIDILNHRPLAKVEWRA-GKGNVA-------FLVLEDVA--AGQEISNNYGPRNNEQ 298
Query: 316 LLYLYGFVIDNNPDDYLMIHYPAEAIHSIPLSDSKALLLEEQKAQ------------LRC 363
L+ YGF + NNP DY ++ A + ++ S+ L + A+
Sbjct: 299 LMMNYGFCLPNNPCDYRIVSLRAPPGSPLQMARSQQLQMFPGLAKETDDPYYVFNVFYPL 358
Query: 364 LLPKSLLEHGFFAAG 378
L P + +EH F+
Sbjct: 359 LAPDTPMEHSIFSPA 373
>gi|242210761|ref|XP_002471222.1| predicted protein [Postia placenta Mad-698-R]
gi|220729781|gb|EED83650.1| predicted protein [Postia placenta Mad-698-R]
Length = 264
Score = 45.4 bits (106), Expect = 0.063, Method: Compositional matrix adjust.
Identities = 26/79 (32%), Positives = 41/79 (51%), Gaps = 5/79 (6%)
Query: 256 LVPGIDFCNHDLKAAATWEVDGTGLITGVPFSMYLLSVERSSFHSEKEISISYGNKGNEE 315
L+PG+D NH +W V P S+ L V ++ + E+ +YG K N E
Sbjct: 188 LLPGVDALNHARGQPVSWAVSTA---PNAPSSISL--VLHNAHPAGAELFNNYGPKPNAE 242
Query: 316 LLYLYGFVIDNNPDDYLMI 334
L+ YGF + +NPDD +++
Sbjct: 243 LILGYGFALPHNPDDTIVL 261
>gi|327295326|ref|XP_003232358.1| hypothetical protein TERG_07206 [Trichophyton rubrum CBS 118892]
gi|326465530|gb|EGD90983.1| hypothetical protein TERG_07206 [Trichophyton rubrum CBS 118892]
Length = 692
Score = 45.4 bits (106), Expect = 0.065, Method: Compositional matrix adjust.
Identities = 35/123 (28%), Positives = 61/123 (49%), Gaps = 17/123 (13%)
Query: 256 LVPGIDFCNHDLKAAATWEVDGTGLITGVPFSMYLLSVERSSFHSEKEISISYGNKGNEE 315
++P +DF NH A W+ + T + L V +F E E+ +YG N++
Sbjct: 230 MLPLVDFSNHKPLAKIEWQAEATEI---------RLKVVEPTFTGE-EVHNNYGPLNNQQ 279
Query: 316 LLYLYGFVIDNNPDDY--LMIHYPAEAIHSIPLSDSKALLLEE-QKAQLRCLLPKSLLEH 372
L+ YGF I +NP D+ L ++ P + PL++++ +E Q+ + L K LL +
Sbjct: 280 LMTTYGFCIVDNPCDFRDLNVNAPPDT----PLANARQFRYQEFQEPHGKSLDNKCLLFN 335
Query: 373 GFF 375
F+
Sbjct: 336 IFY 338
>gi|323456050|gb|EGB11917.1| hypothetical protein AURANDRAFT_61181 [Aureococcus anophagefferens]
Length = 516
Score = 45.4 bits (106), Expect = 0.067, Method: Compositional matrix adjust.
Identities = 30/89 (33%), Positives = 46/89 (51%), Gaps = 10/89 (11%)
Query: 244 TSTQGETLWIEGLVPGIDFCNHDLKAAATWEVDGTGLITGVPFSM----YLLSVERSSFH 299
S GE L +VP +DF NH +++ VD + P+ +L +R+ +
Sbjct: 270 ASRAGELL---AMVPYVDFINHSPFSSSY--VDAREVPKAFPWEEKEDEVVLFADRA-YK 323
Query: 300 SEKEISISYGNKGNEELLYLYGFVIDNNP 328
+++ ISYG K N +LL LYGF +D NP
Sbjct: 324 KFEQVFISYGPKSNADLLLLYGFALDRNP 352
>gi|357153645|ref|XP_003576520.1| PREDICTED: probable ribulose-1,5 bisphosphate carboxylase/oxygenase
large subunit N-methyltransferase, chloroplastic-like
[Brachypodium distachyon]
Length = 492
Score = 45.4 bits (106), Expect = 0.069, Method: Compositional matrix adjust.
Identities = 51/215 (23%), Positives = 96/215 (44%), Gaps = 16/215 (7%)
Query: 5 TEAKLEPFLQWLQVNKVELRGCKIKYSDESKGFGIFSSNEFSDG-VLLVVPLDLAITPMR 63
++A L+ F +W+ + E G G+ ++ G V+ VP L +
Sbjct: 54 SDAALQNFRRWISSQGADTGAASPTVVPE--GLGLVAARNLPRGEVVAEVPKKLWMDADA 111
Query: 64 VLQDPLIGPECRAMFEDGEVDDRFLMILFLTVERLRKNSS-WKPYLDMLPTTFGNPLWFT 122
V IG CR+ G++ + L + E R S W PYL +LP + ++++
Sbjct: 112 VAASD-IGRACRS---GGDLRPWVSVSLLILREAARGGDSLWAPYLAILPRQTDSTIFWS 167
Query: 123 DDELLELKGTTLYRATELQKQNLLTLYDDKVKDLVKKLLVLDGDSERANSIFWTRALNIP 182
++ELLE++GT L T K+ + + +D+ ++ L D+ + W +
Sbjct: 168 EEELLEIQGTQLLSTTMGVKEYVQSEFDNVEAKIIGPNKDLFPDTITFDDFLWAFGIL-- 225
Query: 183 LPHSYVFPQNQED----LNKYDSINNSAELSNDHN 213
S VFP+ + D + D IN+SA++++ +
Sbjct: 226 --RSRVFPELRGDKLALIPFADLINHSADITSKQS 258
>gi|169596576|ref|XP_001791712.1| hypothetical protein SNOG_01053 [Phaeosphaeria nodorum SN15]
gi|160701343|gb|EAT92548.2| hypothetical protein SNOG_01053 [Phaeosphaeria nodorum SN15]
Length = 206
Score = 45.1 bits (105), Expect = 0.072, Method: Compositional matrix adjust.
Identities = 22/47 (46%), Positives = 33/47 (70%), Gaps = 1/47 (2%)
Query: 89 MILFLTVERLRKNSS-WKPYLDMLPTTFGNPLWFTDDELLELKGTTL 134
+IL + E L+ +S WKPY D+LP +F P+++T+ EL EL+GT L
Sbjct: 84 LILVILYEYLQGEASRWKPYFDVLPQSFDTPIFWTEAELKELEGTCL 130
>gi|302914506|ref|XP_003051150.1| hypothetical protein NECHADRAFT_106131 [Nectria haematococca mpVI
77-13-4]
gi|256732088|gb|EEU45437.1| hypothetical protein NECHADRAFT_106131 [Nectria haematococca mpVI
77-13-4]
Length = 499
Score = 45.1 bits (105), Expect = 0.073, Method: Compositional matrix adjust.
Identities = 31/97 (31%), Positives = 49/97 (50%), Gaps = 9/97 (9%)
Query: 254 EGLVPGIDFCNHDLKAAATWEVDGTGLITGVPFSMYLLSVERSSFHSEKEISISYGNKGN 313
+ +VPG+D NH A +E D + + L VE + +E+SISYG+K
Sbjct: 229 DAMVPGLDMANHSHDPTAYYEEDDKDDVV----LLLRLGVEVTG---GEEVSISYGDKSP 281
Query: 314 EELLYLYGFVIDNNPDDYLMIHYPAEAIHSIPLSDSK 350
E+L+ YGF+ ++ L + P EA+ PL +K
Sbjct: 282 AEMLFSYGFIDRDSAAHDLTL--PLEALPDDPLGKAK 316
>gi|126290266|ref|XP_001367810.1| PREDICTED: histone-lysine N-methyltransferase setd3-like
[Monodelphis domestica]
Length = 595
Score = 45.1 bits (105), Expect = 0.078, Method: Compositional matrix adjust.
Identities = 37/145 (25%), Positives = 70/145 (48%), Gaps = 24/145 (16%)
Query: 1 MEISTEAKLEPF----LQWLQVNKVELRGCK-IKYSDESKGFGIFSSNEF-SDGVLLVVP 54
+ ++ E K E + ++W N G + + + +E GFG+ ++ E ++ + L VP
Sbjct: 67 LSVTFEGKREDYFPDLIKWAAANGASTDGFELVNFKEE--GFGLRATREIKAEELFLWVP 124
Query: 55 LDLAITPMRVLQDPLIGPECRAMFEDGEVDDRFL-------MILFLTVERLRKNSSWKPY 107
L +T + ++ ++G A++ DR L + L ER +S W PY
Sbjct: 125 RKLLMT-VESAKNSVLG----ALYSQ----DRILQAMGNITLAFHLLCERANPSSFWLPY 175
Query: 108 LDMLPTTFGNPLWFTDDELLELKGT 132
+ LP+ + PL+F +DE+ L+ T
Sbjct: 176 IQTLPSEYDTPLYFEEDEVQHLQST 200
>gi|302786274|ref|XP_002974908.1| hypothetical protein SELMODRAFT_102436 [Selaginella moellendorffii]
gi|300157067|gb|EFJ23693.1| hypothetical protein SELMODRAFT_102436 [Selaginella moellendorffii]
Length = 389
Score = 45.1 bits (105), Expect = 0.080, Method: Compositional matrix adjust.
Identities = 36/116 (31%), Positives = 61/116 (52%), Gaps = 14/116 (12%)
Query: 38 GIFSSNEFSDG-VLLVVPLDLAITPMRVLQDPLIGPECRAMFEDGEVDDRFLMILFLTVE 96
G+F+S G +L V DL ITP ++ PE +V + LFL
Sbjct: 2 GLFASRPIHTGECMLHVSHDLMITPEKL-------PEEVTKLLSKDVSAWAKLALFLLAH 54
Query: 97 RLRK-NSSWKPYLDMLPTTFG---NPLWFTDDELLELKGTTLYRATELQKQNLLTL 148
+ +K S+W PY+ LP FG + +++T DEL+ LK + +YR T +Q+++++ +
Sbjct: 55 QKKKETSAWAPYISCLP-PFGSMHSTIFWTQDELVYLKVSPVYRET-VQRKDVVRM 108
Score = 38.5 bits (88), Expect = 6.7, Method: Compositional matrix adjust.
Identities = 33/112 (29%), Positives = 45/112 (40%), Gaps = 9/112 (8%)
Query: 249 ETLWIEGLVPGIDFCNHDLKAAATWEVDGTGLITGVPFSMYLLSVERSSFHSEKEISISY 308
ET+ LVP +DF NHD A D V V + + ++ ISY
Sbjct: 146 ETIKSLALVPFVDFFNHDANCRAMLSYDEDRHCAEV--------VSDRDYATGDQVVISY 197
Query: 309 GNKGNEELLYLYGFVIDNNPDDYLMIHYPAEAIHSIPLSDSKALLLEEQKAQ 360
G N L +GF + NP D + + + + PL DSK LL Q
Sbjct: 198 GQLSNATLALDFGFALPFNPHDQVAGIWLSLS-EKDPLRDSKLKLLHSHNMQ 248
>gi|336473420|gb|EGO61580.1| hypothetical protein NEUTE1DRAFT_58975 [Neurospora tetrasperma FGSC
2508]
gi|350293291|gb|EGZ74376.1| SET domain-containing protein [Neurospora tetrasperma FGSC 2509]
Length = 533
Score = 45.1 bits (105), Expect = 0.082, Method: Compositional matrix adjust.
Identities = 25/58 (43%), Positives = 37/58 (63%), Gaps = 2/58 (3%)
Query: 79 EDGEVDDRF-LMILFLTVERLRKNSS-WKPYLDMLPTTFGNPLWFTDDELLELKGTTL 134
ED + D + L+IL L E L+ +SS W PYL +LP F P+++T+ EL EL+ + L
Sbjct: 120 EDSQSQDSWTLLILILMHEYLQGSSSNWSPYLSILPHQFDTPMFWTEAELAELQASAL 177
>gi|340520781|gb|EGR51016.1| N-methyltransferase [Trichoderma reesei QM6a]
Length = 470
Score = 45.1 bits (105), Expect = 0.085, Method: Compositional matrix adjust.
Identities = 30/128 (23%), Positives = 58/128 (45%), Gaps = 9/128 (7%)
Query: 36 GFGIFSSNEFSDG-VLLVVPLDLAITPMRVLQDPLIGPECRAMFEDGEVDDRFLMILFLT 94
G G+ + F G ++L +P D+ + DP +GP R++ V+D +
Sbjct: 25 GRGVKTLRRFKQGEMILTIPSDVLWSVEHAYSDPNLGPALRSVMPPLSVEDILATYILFV 84
Query: 95 VERLRKNSSWKPYLDMLPTTFGNPLWFTDDELLELKGTTLYRATELQKQNLLTLYDDKVK 154
R + ++ LP + + ++F++ EL GT+LY T+ +Q ++K
Sbjct: 85 RSRESGYDGLRTHVSALPGIYSSSIFFSEGELEVCAGTSLYTVTKQLEQ--------RIK 136
Query: 155 DLVKKLLV 162
D ++L V
Sbjct: 137 DDYRQLAV 144
>gi|302814473|ref|XP_002988920.1| hypothetical protein SELMODRAFT_129035 [Selaginella moellendorffii]
gi|300143257|gb|EFJ09949.1| hypothetical protein SELMODRAFT_129035 [Selaginella moellendorffii]
Length = 389
Score = 45.1 bits (105), Expect = 0.086, Method: Compositional matrix adjust.
Identities = 36/116 (31%), Positives = 61/116 (52%), Gaps = 14/116 (12%)
Query: 38 GIFSSNEFSDG-VLLVVPLDLAITPMRVLQDPLIGPECRAMFEDGEVDDRFLMILFLTVE 96
G+F+S G +L V DL ITP ++ PE +V + LFL
Sbjct: 2 GLFASRPIHTGECMLHVSHDLMITPEKL-------PEEVTKLLSKDVSAWAKLALFLLAH 54
Query: 97 RLRK-NSSWKPYLDMLPTTFG---NPLWFTDDELLELKGTTLYRATELQKQNLLTL 148
+ +K S+W PY+ LP FG + +++T DEL+ LK + +YR T +Q+++++ +
Sbjct: 55 QKKKETSAWAPYISCLP-PFGSMHSTIFWTQDELVYLKVSPVYRET-VQRKDVVRM 108
>gi|432901733|ref|XP_004076920.1| PREDICTED: SET domain-containing protein 4-like [Oryzias latipes]
Length = 441
Score = 45.1 bits (105), Expect = 0.087, Method: Compositional matrix adjust.
Identities = 35/121 (28%), Positives = 56/121 (46%), Gaps = 6/121 (4%)
Query: 30 YSDESKGFGIFSSNEFSDGVLLVVPLDLAITPMRVLQDPLIGPECRAMFEDGEVDDRFLM 89
+SD +G + G+L+ +P +T VL +GP ++ +
Sbjct: 55 FSDTDRGLQTLQPIQ-PGGMLVSLPESCLLTTSTVLHS-YLGPFLKSW--KPRPSSLVAL 110
Query: 90 ILFLTVERLRKNSS-WKPYLDMLPTTFGNPLWFTDDELLELKGTTLYRATELQKQNLLTL 148
+FL ER R +S W PY+D+LP ++ P +FTD + L RA E Q++ L L
Sbjct: 111 CVFLVCERHRGEASDWFPYIDVLPCSYCCPPYFTDTVMAVLPSGVRRRAEE-QREGLQHL 169
Query: 149 Y 149
Y
Sbjct: 170 Y 170
>gi|219121061|ref|XP_002185762.1| ribulose-1,5-bisphosphate carboxylase/oxygenase small subunit
N-methyltransferase I [Phaeodactylum tricornutum CCAP
1055/1]
gi|209582611|gb|ACI65232.1| ribulose-1,5-bisphosphate carboxylase/oxygenase small subunit
N-methyltransferase I [Phaeodactylum tricornutum CCAP
1055/1]
Length = 575
Score = 45.1 bits (105), Expect = 0.090, Method: Compositional matrix adjust.
Identities = 29/82 (35%), Positives = 38/82 (46%), Gaps = 3/82 (3%)
Query: 247 QGETLWIEGLVPGIDFCNHDLKAAATWEVDGTGLITGVPFSMYLLSVERSSFHSEKEISI 306
QGETL LVP D NH + A + G ++ + ++I I
Sbjct: 285 QGETL---ALVPYADLINHSPFSQAYIDARQNGDWLFKSGDEEVILYADRGYRRMEQIYI 341
Query: 307 SYGNKGNEELLYLYGFVIDNNP 328
SYG K N ELL LYGF ++ NP
Sbjct: 342 SYGPKSNAELLLLYGFAVERNP 363
Score = 41.6 bits (96), Expect = 0.84, Method: Compositional matrix adjust.
Identities = 39/130 (30%), Positives = 66/130 (50%), Gaps = 11/130 (8%)
Query: 26 CKIKYSDESKGFGIFSSNEFSDG-VLLVVPLDLAITPMRVLQDPLIGPECRAMFEDGEVD 84
K + ++ES G G+ + + +DG LL +P+ L +T + +G + E++
Sbjct: 116 TKDEITNESSGRGLLARRDINDGDELLRIPMALCMT--KSAARKAVGKDVLP----SEIN 169
Query: 85 DRFLMILFLTVER-LR-KNSSWKPYLDMLPTTFG-NPLW-FTDDELLELKGTTLYRATEL 140
+ M L ER +R + S WKPYLD+LP NP + + D++L L G+ + AT+
Sbjct: 170 EYLAMACHLIYERNVRGEESPWKPYLDVLPDIDEVNPTFTWPDEDLAFLNGSPVIAATKS 229
Query: 141 QKQNLLTLYD 150
+ L YD
Sbjct: 230 LQMKLRREYD 239
>gi|451854686|gb|EMD67978.1| hypothetical protein COCSADRAFT_32937 [Cochliobolus sativus ND90Pr]
Length = 184
Score = 45.1 bits (105), Expect = 0.090, Method: Compositional matrix adjust.
Identities = 35/134 (26%), Positives = 59/134 (44%), Gaps = 19/134 (14%)
Query: 33 ESKGFGIFSSNEFSDGVLLVVPLDLAITPMRV---------LQDPLIGPECRAMFEDGEV 83
++GF + + + ++ PL+L + + + +Q PL +CR ++
Sbjct: 43 HAQGFHLCARTPLTSPIVASCPLNLTFSILNLDPGEKEVQHIQSPL--QQCR-----DKI 95
Query: 84 DDRFLMILFLTVERLRKN-SSWKPYLDMLP--TTFGNPLWFTDDELLELKGTTLYRATEL 140
D L L L +R + N S W YL LP PLWF D + L GT+L A +
Sbjct: 96 PDHILAYLMLLEQRDKGNDSPWSAYLACLPGPQDMTTPLWFDDVDFAFLAGTSLAPAAKE 155
Query: 141 QKQNLLTLYDDKVK 154
+K L ++ V+
Sbjct: 156 RKAELHQQWEHAVQ 169
>gi|308809523|ref|XP_003082071.1| N-methyltransferase (ISS) [Ostreococcus tauri]
gi|116060538|emb|CAL55874.1| N-methyltransferase (ISS) [Ostreococcus tauri]
Length = 413
Score = 45.1 bits (105), Expect = 0.090, Method: Compositional matrix adjust.
Identities = 29/90 (32%), Positives = 46/90 (51%), Gaps = 3/90 (3%)
Query: 256 LVPGIDFCNHDLKAAATWEVDGTGLITGVPFSMYLLSVERSSFHSEKEISISYGNKGNEE 315
L P ID NH +++ AT+ V G S + E +S + E+ ISYG++ N
Sbjct: 212 LAPVIDLANHRVESTATYGVSADG--KNFELSWNENAPEGASPVANTEVFISYGDRMNNA 269
Query: 316 LLYL-YGFVIDNNPDDYLMIHYPAEAIHSI 344
+L L YGF+ DNN ++ L + + A +
Sbjct: 270 ILMLHYGFIDDNNRNERLPMEFIAPGARKV 299
>gi|50307933|ref|XP_453965.1| hypothetical protein [Kluyveromyces lactis NRRL Y-1140]
gi|49643100|emb|CAG99052.1| KLLA0E00441p [Kluyveromyces lactis]
Length = 558
Score = 45.1 bits (105), Expect = 0.090, Method: Compositional matrix adjust.
Identities = 26/75 (34%), Positives = 38/75 (50%), Gaps = 10/75 (13%)
Query: 256 LVPGIDFCNHDLKAAATWEVDGTGLITGVPFSMYLLSVERSSFHSEKEISISYGNKGNEE 315
L+P +D NH W+ +G+ + FS ++ E+ +YG+K NEE
Sbjct: 211 LLPIVDLLNHKSDTKVHWKSEGSF----ITFS------SEEIIEAKGELYNNYGDKSNEE 260
Query: 316 LLYLYGFVIDNNPDD 330
LL YGF ID+NP D
Sbjct: 261 LLLGYGFAIDSNPHD 275
>gi|145353540|ref|XP_001421068.1| predicted protein [Ostreococcus lucimarinus CCE9901]
gi|144581304|gb|ABO99361.1| predicted protein [Ostreococcus lucimarinus CCE9901]
Length = 813
Score = 44.7 bits (104), Expect = 0.095, Method: Compositional matrix adjust.
Identities = 44/174 (25%), Positives = 78/174 (44%), Gaps = 29/174 (16%)
Query: 49 VLLVVPLDLAITPMRVLQDPLIG--PECRAMFEDGEVDDRFLMILFLTVERLR-KNSSWK 105
VLL +PL+ T L+D + C A +D ++ L + ER R + ++
Sbjct: 35 VLLEIPLERGFTLAAALEDDAVKRVASCCARHDD-------VVALHVCAERFRGEKATRA 87
Query: 106 PYLDMLPTTFGNPLWFTDDELLELKGTTLYRATELQKQNLLTLYDDKVKDLVKKLLVL-- 163
++ LP +F +++++EL EL GTT R T NL + + L KK+ +
Sbjct: 88 AHVATLPRSFDTAFFWSEEELRELTGTTCLRET----MNLREETKNDYETLTKKMEAIGE 143
Query: 164 -------DGDSER---ANSIFWTRALNIPLPHSYVFPQNQEDLNKYDSINNSAE 207
+ D ER A S W+R ++ +P + + + +D N+SA+
Sbjct: 144 GGWMREHEVDYERYAWARSNLWSRQCDLLMPDG---KRTRAMVPTFDIFNHSAK 194
>gi|50286869|ref|XP_445864.1| hypothetical protein [Candida glabrata CBS 138]
gi|49525170|emb|CAG58783.1| unnamed protein product [Candida glabrata]
Length = 585
Score = 44.7 bits (104), Expect = 0.098, Method: Compositional matrix adjust.
Identities = 39/154 (25%), Positives = 74/154 (48%), Gaps = 13/154 (8%)
Query: 8 KLEPFLQWLQVNKVELRGCKIKYS-DESKGFGIFSSN-----EFSDGVLLVVPLDLAITP 61
KL+ L W + N +LR ++Y+ DE KG+ + + EF D ++ VP L IT
Sbjct: 4 KLDGLLAWARENGAQLREDCLEYNYDEKKGYHVLIKDIDALWEFKDAGVITVPRKLFITR 63
Query: 62 MRVLQDPLIGPE-CRAMFEDGEVDDRFLMILFLTVERLRKNSSWKPYLDMLP---TTFGN 117
Q I + + + + ++T + + ++PY+++LP + +
Sbjct: 64 QLAQQYFKINDQKLSTKYNNNPNGLTEFFLSYITFTKENEYEFFRPYINILPKLNSMRIH 123
Query: 118 PLWFTDDELLELKGTTLYRATELQKQNLLTLYDD 151
P ++T +EL +KG+ L+ + + LL LYD+
Sbjct: 124 PFFWTKEELQLIKGSDLFLTIKFK---LLALYDE 154
>gi|145524453|ref|XP_001448054.1| hypothetical protein [Paramecium tetraurelia strain d4-2]
gi|124415587|emb|CAK80657.1| unnamed protein product [Paramecium tetraurelia]
Length = 581
Score = 44.7 bits (104), Expect = 0.098, Method: Compositional matrix adjust.
Identities = 38/141 (26%), Positives = 67/141 (47%), Gaps = 4/141 (2%)
Query: 13 LQWLQVNKVELRGCKIK-YSDESKGFGIFSSNEFSDGVLLVVPLDLAITPMRVLQDPLIG 71
L+WL+ K + KI+ YS+ +G + ++L +P IT + P+
Sbjct: 140 LEWLKHGKAQFPKIKIECYSESYRGVNA-KQKINAKELILFIPKSHMITLEMAKETPVAK 198
Query: 72 PECRAMFEDGEVDDRFLMILFLTVERLRKNSSWKPYLDMLPTTFGN-PLWFTDDELLELK 130
+ + FL FL E+ R NS WKPYLD+LP ++ + P++F + +L L+
Sbjct: 199 KMIQFRLDLLSPKHSFLST-FLLQEKSRPNSFWKPYLDILPQSYPSFPIFFNNYDLEWLQ 257
Query: 131 GTTLYRATELQKQNLLTLYDD 151
G+ + + +L Y+D
Sbjct: 258 GSPFLKQINDKLSDLKKDYND 278
>gi|62860180|ref|NP_001017105.1| SET domain containing 4 [Xenopus (Silurana) tropicalis]
gi|89267009|emb|CAJ81787.1| novel protein containing a SET domain [Xenopus (Silurana)
tropicalis]
Length = 442
Score = 44.7 bits (104), Expect = 0.099, Method: Compositional matrix adjust.
Identities = 34/111 (30%), Positives = 57/111 (51%), Gaps = 7/111 (6%)
Query: 36 GFGIFSSNEFSDGVLLV-VPLDLAITPMRVLQDPLIGPECRAMFEDGEVDDRFLMILFLT 94
G G+ ++ + G L++ +P IT VLQ +G R V + FL
Sbjct: 59 GRGLMATRDLQPGELIISLPDSCLITTETVLQS-YLGKYIRTW--SPPVSPLLALCTFLI 115
Query: 95 VERL-RKNSSWKPYLDMLPTTFGNPLWFTDDELLELKGTTLYRATELQKQN 144
ER+ R+ S WKPYLD+LP+++ P+++ + E++ L L R L++Q
Sbjct: 116 AERVARERSPWKPYLDVLPSSYSCPVYW-ESEIISLLPAPL-RQKALEQQT 164
>gi|345560038|gb|EGX43167.1| hypothetical protein AOL_s00215g623 [Arthrobotrys oligospora ATCC
24927]
Length = 426
Score = 44.7 bits (104), Expect = 0.10, Method: Compositional matrix adjust.
Identities = 20/70 (28%), Positives = 41/70 (58%), Gaps = 2/70 (2%)
Query: 88 LMILFLTVERLRKNSSWKPYLDMLPTTFGNPLWFTDDELLELKGTTLYRATELQKQNLLT 147
++IL + E + +S W+ Y D +PT F ++++ DEL EL+G+ + ++ K+
Sbjct: 73 ILILIIMYEASKPDSQWRSYFDSMPTEFDTLMYWSQDELKELEGSAVL--NKIGKEEAEA 130
Query: 148 LYDDKVKDLV 157
+Y +++K V
Sbjct: 131 MYLEEIKTFV 140
>gi|412986734|emb|CCO15160.1| predicted protein [Bathycoccus prasinos]
Length = 450
Score = 44.7 bits (104), Expect = 0.11, Method: Compositional matrix adjust.
Identities = 47/209 (22%), Positives = 87/209 (41%), Gaps = 35/209 (16%)
Query: 150 DDKVKDLVKKLLV------LDGDSERANSIFWTRAL---NIPLPHSYVFPQNQEDLNKYD 200
DD L K++LV L D ++ I + AL I + + F N +Y+
Sbjct: 110 DDWAASLAKQILVKRRRSTLSDDEKKELKIAYANALPKEMIGIANKCCFYDNN---TRYE 166
Query: 201 SINNSAELSNDHNSRGELINGLNDIKNEAQRVN------------SQVNGATSTLTSTQG 248
+ E + DH+++ EL+ + + ++R SQV T + +G
Sbjct: 167 LVCKFFEATGDHDAKEELMKYEKQVSSRSRRHGEEEEEEKYGWALSQVFSRTFRIEDARG 226
Query: 249 ETLWIEGLVPGIDFCNH---DLKAAATWEVDGTGLITGVPFSMYLLSVERSSFHSEKEIS 305
++P +D NH + + TW V S +++ +R+ E E+
Sbjct: 227 RRAPRRVMIPIVDLLNHSSVEEEVNVTWRVKE-------DLSAFIVEAKRNVGKDE-ELI 278
Query: 306 ISYGNKGNEELLYLYGFVIDNNPDDYLMI 334
+SYG + ++ L YGF+ NP + +M+
Sbjct: 279 LSYGERNDQHFLLFYGFLPSMNPCNSVMM 307
>gi|322698908|gb|EFY90674.1| putative histone-lysine N-methyltransferase [Metarhizium acridum
CQMa 102]
Length = 437
Score = 44.7 bits (104), Expect = 0.11, Method: Compositional matrix adjust.
Identities = 38/161 (23%), Positives = 72/161 (44%), Gaps = 16/161 (9%)
Query: 36 GFGIFSSNEFSDGV-LLVVPLDLAITPMRVLQDPLIGPECRAMFEDGEVDDRFLMILFLT 94
G G+ + F +G +L +P + T D ++GP R+ V+D + +
Sbjct: 25 GRGVRTLKCFKEGENILTIPSGILWTVEHAYADSILGPVLRSTSLPLSVEDTLAIYILFV 84
Query: 95 VERLRKNSSWKPYLDMLPTTFGNPLWFTDDELLELKGTTLYRATELQKQNLLTLYDDKVK 154
R + ++ LP ++ + ++F +D+L GT+LY T+ +Q + +D +
Sbjct: 85 RSRKSGYDGPRNHVAALPASYSSSIFFMEDQLEVCAGTSLYTITKQLEQRI----EDDYR 140
Query: 155 DLVKKLL-------VLDG----DSERANSIFWTRALNIPLP 184
LV ++L LD D + A W+RA++ LP
Sbjct: 141 GLVVRMLGQYPDLFPLDKFTVEDYKWALCTVWSRAMDFVLP 181
>gi|254585507|ref|XP_002498321.1| ZYRO0G07502p [Zygosaccharomyces rouxii]
gi|238941215|emb|CAR29388.1| ZYRO0G07502p [Zygosaccharomyces rouxii]
Length = 562
Score = 44.7 bits (104), Expect = 0.11, Method: Compositional matrix adjust.
Identities = 29/76 (38%), Positives = 36/76 (47%), Gaps = 12/76 (15%)
Query: 256 LVPGIDFCNHDLKAAATWEV-DGTGLITGVPFSMYLLSVERSSFHSEKEISISYGNKGNE 314
L P +DF NH W DG + V + S S +E+ +YG+K NE
Sbjct: 218 LYPIVDFLNHKNDTKVKWCFQDGK-----------MCFVSKESLKSGEELFNNYGDKSNE 266
Query: 315 ELLYLYGFVIDNNPDD 330
ELL YGFV DNN D
Sbjct: 267 ELLLNYGFVQDNNQYD 282
>gi|321470773|gb|EFX81748.1| hypothetical protein DAPPUDRAFT_317395 [Daphnia pulex]
Length = 495
Score = 44.7 bits (104), Expect = 0.11, Method: Compositional matrix adjust.
Identities = 59/261 (22%), Positives = 95/261 (36%), Gaps = 73/261 (27%)
Query: 102 SSWKPYLDMLPTTFGNPLWFTDDELLELKGTTLYRATELQKQNLLTLYDDKVKDLVKKLL 161
S WKPYLD LP+++ ++FT DE+ ELKG+ +DD ++
Sbjct: 168 SFWKPYLDALPSSYDTVMYFTPDEITELKGS--------------PAFDDALR------- 206
Query: 162 VLDGDSERANSIFWTRALNIPLPHSYVFPQNQEDLNKYDSINNSAELSNDHN-------S 214
NI +SY + Q++++ S + ND+ +
Sbjct: 207 ---------------MCRNIARQYSYFYSLLQKNVDPALSNLRANFTYNDYRWAVSTVMT 251
Query: 215 RGELINGLNDIKNEAQRVNSQVNGATSTLTSTQGETLWIEGLVPGIDFCNH-DLKAAATW 273
R LI +I + VN L+P DFCNH D + + +
Sbjct: 252 RQNLIPSQEEISGNDKDQLPPVN-----------------ALIPLWDFCNHQDGQFSTEF 294
Query: 274 EVDGTGLITGVPFSMYLLSVERSSFHSEKEISISYGNKGNEELLYLYGFVIDNNPDDYLM 333
+++ + F +++ I YG + E GFV NN D L
Sbjct: 295 QLESRRTVCQA----------GRDFGPGEQVFIFYGTRTCAEQFIHNGFVDINNAHDALT 344
Query: 334 IHYPAEAIHSIPLSDSKALLL 354
+ S PL+ +A LL
Sbjct: 345 LKVGLS--KSDPLAGQRATLL 363
>gi|444909511|ref|ZP_21229702.1| hypothetical protein D187_00317 [Cystobacter fuscus DSM 2262]
gi|444720460|gb|ELW61244.1| hypothetical protein D187_00317 [Cystobacter fuscus DSM 2262]
Length = 445
Score = 44.7 bits (104), Expect = 0.11, Method: Compositional matrix adjust.
Identities = 40/151 (26%), Positives = 76/151 (50%), Gaps = 11/151 (7%)
Query: 4 STEAKLEPFLQWLQVNKVELRGCKI-KYSDESKGFGIFSSNEFSDG-VLLVVPLDLAITP 61
S+E KL L+W++ I + +D + + + + ++G V+L +P T
Sbjct: 7 SSEQKLSSLLRWMEQGGALFPKMHIVRQADGERS--VLARTDIAEGEVVLQIPTTHLFTL 64
Query: 62 MRVLQDPLIGPECRAMFEDGEVDDRFLMIL-FLTVERLR-KNSSWKPYLDMLPTTFGN-P 118
R + IG ++ + D+ FL + +L E+ R +S WKP++D LP + + P
Sbjct: 65 ERA-KASDIGRRIQSQLQP---DNDFLYLASWLLEEKHRGADSFWKPFVDSLPEAYPHVP 120
Query: 119 LWFTDDELLELKGTTLYRATELQKQNLLTLY 149
L++++ E +KG+ L R E+Q+Q+ Y
Sbjct: 121 LFYSEQERARMKGSQLERLVEVQRQSFEQEY 151
>gi|358056251|dbj|GAA97802.1| hypothetical protein E5Q_04481 [Mixia osmundae IAM 14324]
Length = 433
Score = 44.7 bits (104), Expect = 0.11, Method: Compositional matrix adjust.
Identities = 47/186 (25%), Positives = 84/186 (45%), Gaps = 12/186 (6%)
Query: 3 ISTEAKLEPFLQWLQVNKVELRGCKIKYSDESKGFGI-FSSNEFSDGVLLVVPLDLAITP 61
++T L+ L W + N ++ S G G+ +SN S+ L +P L ++
Sbjct: 1 MATYEGLKVLLDWFKSNGGSVQPHVEFASYPDMGCGMRATSNLRSETELFSIPRSLVLS- 59
Query: 62 MRVLQDPLIGPECRAMFEDGEVDDRFLMILFLTVERLRKNSSWKPYLDMLPTTFGNPLWF 121
V PL P+ + + +IL L E++ S WK YL+ +PT F + +++
Sbjct: 60 --VHTSPL--PKSLPDWSEISTQGWVGLILCLMYEQIDPASHWKRYLNSMPTCFDSLMFW 115
Query: 122 TDDELLELKGTTLY-----RATELQKQNLLTLYDDKVKDLVKKLLVLD-GDSERANSIFW 175
+DDEL EL+G+++ E ++L Y K D+ K L R S+
Sbjct: 116 SDDELRELQGSSVLDKIGREEAEGSYYSILVPYLSKHADIFKPLEAYSLALYHRCGSLIL 175
Query: 176 TRALNI 181
+R+ ++
Sbjct: 176 SRSFHV 181
>gi|221131915|ref|XP_002160713.1| PREDICTED: SET domain-containing protein 4-like [Hydra
magnipapillata]
Length = 429
Score = 44.7 bits (104), Expect = 0.11, Method: Compositional matrix adjust.
Identities = 49/197 (24%), Positives = 96/197 (48%), Gaps = 13/197 (6%)
Query: 33 ESKGFGIFSSNEFSDGVLLV-VPLDLAITPMRVLQDPLIGPECRAMFEDG-EVDDRFLMI 90
+ G G+ ++ S G L++ +PL+L IT +L++ + +F + + ++L I
Sbjct: 53 KKTGRGLKTTKSVSPGDLIIALPLNLLITFDTILENN----DLNFIFRNHPSICQKYLFI 108
Query: 91 LFLTVERLR-KNSSWKPYLDMLPTTFGNPLWFTDDELLELKGTTLYRATELQKQNLLTL- 148
LFL +E+ + +NS + YL+ LP F P + + DE ++L + T LQ + +L
Sbjct: 109 LFLLIEKKKGENSYFFHYLNTLPENFSTPSYISQDE-MQLCPNFIQEETGLQNRQILNAI 167
Query: 149 -YDDKVKDLVKK-LLVLDGDSERANSIFWTRALNIPLPHSYVFPQNQEDLNKYDSINNSA 206
+ + L+ L +D + + A ++ TR++ H F +N +N ++
Sbjct: 168 KHISCIHSLIANDLSCIDSEVKWAWNVINTRSVYFNAKHLKCF-KNISSINVDFALAPVL 226
Query: 207 ELSNDHNSRGELINGLN 223
+L N HN ++ G N
Sbjct: 227 DLLN-HNDTANVVAGFN 242
>gi|428176276|gb|EKX45161.1| hypothetical protein GUITHDRAFT_139093 [Guillardia theta CCMP2712]
Length = 281
Score = 44.7 bits (104), Expect = 0.12, Method: Compositional matrix adjust.
Identities = 27/75 (36%), Positives = 37/75 (49%), Gaps = 7/75 (9%)
Query: 256 LVPGIDFCNHDLKAAATWEVDGTGLITGVPFSMYLLSVERSSFHSEKEISISYGNKGNEE 315
L+P +DF NHD + A G+ S V + + ++I+I YGN GN+E
Sbjct: 213 LLPFVDFANHDFEPNAQIRRSGSS-------SPSAELVAQRDLSASEQITICYGNLGNQE 265
Query: 316 LLYLYGFVIDNNPDD 330
LL YGF I N D
Sbjct: 266 LLLNYGFEITGNKFD 280
>gi|406602709|emb|CCH45757.1| hypothetical protein BN7_5343 [Wickerhamomyces ciferrii]
Length = 569
Score = 44.7 bits (104), Expect = 0.12, Method: Compositional matrix adjust.
Identities = 71/258 (27%), Positives = 104/258 (40%), Gaps = 59/258 (22%)
Query: 101 NSSWKPYLDMLPTTFG--NPLWFTDDELLELKGTTLYRATELQKQNLLTLY-------DD 151
+S +KPYLD+LP +PL+++D E +KGT + A + + L+ + D
Sbjct: 101 SSHFKPYLDILPEGHDTLSPLFWSDVERNTIKGTDAHLAIDYKISQLVDEWYSIVSKIDS 160
Query: 152 KV--KDLVKKLLVLDGDSERANSIFWTRALNIPLPHSYVFPQNQEDLNKYDSINNSAELS 209
K K + + G ++ + FW + LN+P + S S
Sbjct: 161 KFQPKTFQEDISFYKGFTKGKSEQFW-KYLNLP--------------QSWTSFPAYLWAS 205
Query: 210 NDHNSRGE--LINGLNDIKNEAQRVNSQVNGATSTLTSTQGETLWIEGLVPGIDFCNHDL 267
NSR L+ G NE R +N A LVP D NHD
Sbjct: 206 AIFNSRAFPFLLAG-----NEICR---DLNEAF---------------LVPIFDLLNHDN 242
Query: 268 KAAATWEVDGTGLITGVPFSMYLLSVERSSFHSEKEISISYGNKGNEELLYLYGFVIDNN 327
+A W D G F + E+ + EI SYG K N+EL++ YGF I+NN
Sbjct: 243 EANVKW--DSLDSSNGKNF---IFKTEQK-LKNGDEIYNSYGPKTNQELMFGYGFAIENN 296
Query: 328 PDDY--LMIHYPAEAIHS 343
+D L + P I S
Sbjct: 297 KEDRATLALRIPEANIES 314
>gi|170042895|ref|XP_001849144.1| conserved hypothetical protein [Culex quinquefasciatus]
gi|167866318|gb|EDS29701.1| conserved hypothetical protein [Culex quinquefasciatus]
Length = 474
Score = 44.7 bits (104), Expect = 0.12, Method: Compositional matrix adjust.
Identities = 39/138 (28%), Positives = 60/138 (43%), Gaps = 12/138 (8%)
Query: 19 NKVELRGCKIKYSDESKGFGIFSSNEFSDG-VLLVVPLDLAITPMRVLQD----PLIGPE 73
N ELR E G G+FS F G ++ +P + I+ + +D +
Sbjct: 35 NGTELRVANF----EETGKGLFSRRGFKAGDCIISLPFESLISLKSIEEDEAFLACFNKD 90
Query: 74 CRAMFEDGEVDDRFLMILFLTVERLRKNSSWKPYLDMLPTTFGNPLWFTDDELLELKGTT 133
A F GE+ + L+ L+L RL NS K YL+ +P +F P + + E+ L
Sbjct: 91 ALADFSKGELQFQSLLALYLIYLRLEDNSPRKAYLNSIPESFTTPYFCSKLEMANLPNVV 150
Query: 134 L---YRATELQKQNLLTL 148
L + E+ KQN L
Sbjct: 151 LKQMVQQNEIIKQNFTLL 168
Score = 40.4 bits (93), Expect = 1.8, Method: Compositional matrix adjust.
Identities = 35/130 (26%), Positives = 57/130 (43%), Gaps = 8/130 (6%)
Query: 255 GLVPGIDFCNHDLKAAATWEVDGTGLITGVPFSMYLLSVERSSFHSEK------EISISY 308
L P +DF NH+ A T ++ + F ++ + K +I ISY
Sbjct: 226 ALAPFLDFFNHNAGAETTSKLSISYESLTKQFKKNQITHLYYELFTSKPVPPFSQIFISY 285
Query: 309 GNKGNEELLYLYGFVIDNNPDDYLMIHYPAEAIHSIPLSDSKALLLEEQKAQLRCLLPKS 368
G N +LL YGF + NP D+L + + I++ +D + L+ + + R + S
Sbjct: 286 GTHNNTKLLLEYGFSLPANPQDFLEL--TLDDINAFIRADPELRPLKIHREKYRFIADHS 343
Query: 369 LLEHGFFAAG 378
L E FF G
Sbjct: 344 LGEQLFFVPG 353
>gi|298706765|emb|CBJ29688.1| putative ribulose-1,5-bisphosphate carboxylase/oxygenase small
subunit N-methyltransferase I [Ectocarpus siliculosus]
Length = 521
Score = 44.7 bits (104), Expect = 0.12, Method: Compositional matrix adjust.
Identities = 28/76 (36%), Positives = 39/76 (51%), Gaps = 6/76 (7%)
Query: 255 GLVPGIDFCNHDLKAAATWEVDGTGLITGVPFSMY--LLSVERSSFHSEKEISISYGNKG 312
LVP D NH+ A + + GL FS ++ S+ +++ ISYG KG
Sbjct: 292 ALVPYADLFNHNPFANSYIDARQQGLF----FSKTDEVVVYADRSYKKMEQVYISYGPKG 347
Query: 313 NEELLYLYGFVIDNNP 328
N +LL LYGF +D NP
Sbjct: 348 NSDLLLLYGFSLDRNP 363
>gi|242804795|ref|XP_002484448.1| conserved hypothetical protein [Talaromyces stipitatus ATCC 10500]
gi|218717793|gb|EED17214.1| conserved hypothetical protein [Talaromyces stipitatus ATCC 10500]
Length = 409
Score = 44.7 bits (104), Expect = 0.12, Method: Compositional matrix adjust.
Identities = 29/80 (36%), Positives = 40/80 (50%), Gaps = 10/80 (12%)
Query: 255 GLVPGIDFCNHDLKAAATWEVDGTGLITGVPFSMYLLSVERSSFHSEKEISISYGNKGNE 314
LVP D+ NHD KA EV+ G Y R F +E+ ISYG+ N+
Sbjct: 211 ALVPFADYFNHDDKAPC--EVNFNG-------EYYTFKASRR-FEKGEELFISYGSHSND 260
Query: 315 ELLYLYGFVIDNNPDDYLMI 334
LL YGF++D+N D + +
Sbjct: 261 FLLVEYGFLLDDNKSDAIFL 280
>gi|255087300|ref|XP_002505573.1| set domain protein [Micromonas sp. RCC299]
gi|226520843|gb|ACO66831.1| set domain protein [Micromonas sp. RCC299]
Length = 509
Score = 44.3 bits (103), Expect = 0.12, Method: Compositional matrix adjust.
Identities = 36/139 (25%), Positives = 61/139 (43%), Gaps = 12/139 (8%)
Query: 5 TEAKLEPFLQWLQVNKVELRGCKIKYSDES---KGFGIFSSNEFSDG-VLLVVPLDLAIT 60
T+A + WL V++ D + +G+G+ ++ + G +L +P L +T
Sbjct: 54 TQADFDALWAWLGSEGVDVSKVSPALVDAAPGGRGWGLVAAEDIGGGDAVLAIPRSLWMT 113
Query: 61 PMRVLQDPLIGPECRAMFEDGEVDDRFLMILFLTVER-LRKNSSWKPYLDMLPTTFGNPL 119
L P IG C G+ + L L ER + + S W Y++ LP PL
Sbjct: 114 VDTALASP-IGAHC------GDEAGWIAVALQLLHERSIGEKSRWAAYVNALPAQLDAPL 166
Query: 120 WFTDDELLELKGTTLYRAT 138
+++ +E+ L GT L A
Sbjct: 167 FWSAEEVATLTGTQLLDAA 185
>gi|440792461|gb|ELR13682.1| [Ribulose-bisphosphate-carboxylase]-lysine N-methyltransferase
[Acanthamoeba castellanii str. Neff]
Length = 400
Score = 44.3 bits (103), Expect = 0.13, Method: Compositional matrix adjust.
Identities = 36/122 (29%), Positives = 55/122 (45%), Gaps = 24/122 (19%)
Query: 217 ELINGLNDIKNEAQRVNSQVNGATSTLTSTQGETLWIEGLVPGIDFCNHDL----KAAAT 272
+L G + E +R S V T + GE ++I P +DF NH + KAA T
Sbjct: 123 QLFGGSIPSEEEYRRALSLVYSRTFDFSELIGEHVFI----PFVDFLNHSINDTGKAACT 178
Query: 273 W----EVDGTGLITGVPFSMYLLSVERSSFHSEKEISISYGNKGNEELLYLYGFVIDNNP 328
+ + D L+ G + +E+ ISYG K + +LL YGF+ +NN
Sbjct: 179 YSYNHDKDCFELLAGADYD------------EGEEVFISYGEKTSSQLLASYGFMYENNA 226
Query: 329 DD 330
+D
Sbjct: 227 ED 228
>gi|407923069|gb|EKG16157.1| hypothetical protein MPH_06594 [Macrophomina phaseolina MS6]
Length = 305
Score = 44.3 bits (103), Expect = 0.13, Method: Compositional matrix adjust.
Identities = 22/72 (30%), Positives = 45/72 (62%), Gaps = 3/72 (4%)
Query: 89 MILFLTVERLRKNSS-WKPYLDMLPTTFGNPLWFTDDELLELKGTTLYRATELQKQNLLT 147
+I+ + E LR ++S WKPY D+LP F ++++DDEL EL+ + + + ++ K +
Sbjct: 43 LIVTMIYEYLRGDASPWKPYFDVLPAHFDTLMFWSDDELAELQASAVTQ--KIGKDSANE 100
Query: 148 LYDDKVKDLVKK 159
++ + + LV++
Sbjct: 101 MFTNTIIPLVRR 112
>gi|225554758|gb|EEH03053.1| SET domain-containing protein [Ajellomyces capsulatus G186AR]
Length = 485
Score = 44.3 bits (103), Expect = 0.13, Method: Compositional matrix adjust.
Identities = 35/128 (27%), Positives = 60/128 (46%), Gaps = 11/128 (8%)
Query: 12 FLQWLQVNKVELRGCKIKYSD---ESKGFGIFSSNEFS-DGVLLVVPLDLAITPMRVLQD 67
F+ WL+ KIK +D E G GI + ++ D L +P +L ++
Sbjct: 20 FMSWLKQRPGVKVSPKIKIADLRSEGAGRGIVADDDIGEDEELFAIPQNLVLSFQNSSLK 79
Query: 68 PLIGPECRAMFEDGEVDDRFLMILFLTVERLRKN-SSWKPYLDMLPTTFGNPLWFTDDEL 126
L+ F + + D +I+ + E L+ S+W Y +LPT F +++TD+EL
Sbjct: 80 DLLD------FNERDFDPWLCLIVVMIYEYLQGGASTWSRYFQLLPTNFDTLMFWTDEEL 133
Query: 127 LELKGTTL 134
EL G+ +
Sbjct: 134 RELSGSAV 141
>gi|452000836|gb|EMD93296.1| hypothetical protein COCHEDRAFT_1170833 [Cochliobolus
heterostrophus C5]
Length = 643
Score = 44.3 bits (103), Expect = 0.14, Method: Compositional matrix adjust.
Identities = 43/165 (26%), Positives = 74/165 (44%), Gaps = 26/165 (15%)
Query: 33 ESKGFGIFSSNEFSDGVLLVVPLDLAITPMRV---------LQDPLIGPECRAMFEDGEV 83
++GF + + + V+ PL+L ++ + + +Q PL +CR ++
Sbjct: 43 HAQGFHLCARTPLTSPVVASCPLNLTLSILNLDSGEKEVQHIQSPL--QQCR-----DKI 95
Query: 84 DDRFLMILFLTVERLRKNSS-WKPYLDMLP--TTFGNPLWFTDDELLELKGTTLYRATEL 140
D L L L +R + N S W YL LP PLWF D + L GT+L A +
Sbjct: 96 PDHILAYLLLLEQRDKGNDSPWSAYLACLPGPRDMTTPLWFDDVDFAFLAGTSLAPAAKE 155
Query: 141 QKQNLLTLYDDKVKDLVKKLLVLDGDS------ERANSIFWTRAL 179
+K L ++ ++ ++K L + D + A +IF +RA
Sbjct: 156 RKAELRQQWEHALQ-VIKHLDLHLADVISLESLQWAATIFTSRAF 199
Score = 42.4 bits (98), Expect = 0.50, Method: Compositional matrix adjust.
Identities = 45/176 (25%), Positives = 70/176 (39%), Gaps = 29/176 (16%)
Query: 253 IEGLVPGIDFCNHDLKAAATWEVDGTGLITGVPFSMYLLSVERSSFHSEKEISISYGNKG 312
I L P ID NH + A W+ + S L ++ S +E+ +Y K
Sbjct: 211 IPMLFPVIDILNHSVTAKVEWDFEPHR-------SFALKCLQADSVKPGEELFNNYAPKQ 263
Query: 313 NEELLYLYGFVIDNNPDDYLMIHYP-----AEAIHSIPLSDSKALLLEEQKAQLRCLLPK 367
N+ELL YGF +++NP + + + H + L D K + E + L K
Sbjct: 264 NDELLLGYGFCLEDNPIEQFALKLAFQPQLQQYAHQLGLLDGKNVPFEMTRDFLATDPNK 323
Query: 368 SLLEHGFFAAGHP--KDGNNDNKLEVDRISSFSWSGQRRMPSYLNKLVFPENFLTA 421
+H GHP + GNN + F R +P Y+ F + +TA
Sbjct: 324 E--QHFLRTRGHPFGRYGNN--------VPFF-----RGVPPYIVHFFFIQTIITA 364
>gi|452825744|gb|EME32739.1| ribulose-1,5 bisphosphate carboxylase oxygenase large subunit
N-methyltransferase, putative isoform 1 [Galdieria
sulphuraria]
Length = 487
Score = 44.3 bits (103), Expect = 0.14, Method: Compositional matrix adjust.
Identities = 55/240 (22%), Positives = 84/240 (35%), Gaps = 81/240 (33%)
Query: 102 SSWKPYLDMLPTTFGNPL-WFTDDELLELKGTTLYRATELQKQNLLTLYDDKVKDLVKKL 160
S WKPY+D+LP L +++ EL +L+ L ++ + LY + L +
Sbjct: 166 SLWKPYIDILPHALNTGLVYWSSSELAQLQYRPLIEEVKINQYYREALYTRVFESLSSPV 225
Query: 161 LVLDGDSERANSIFW------TRALNIPLPHSYVFPQNQEDLNKYDSINNSAELSNDHNS 214
V +E+ N FW +RA IP ++ N +
Sbjct: 226 RVW-LQNEKENVFFWALDMVQSRAFGIP------------------------DVGNKTYA 260
Query: 215 RGELINGLNDIKNEAQRVNSQVNGATSTLTSTQGETLWIEGLVPGIDFCNHDLKAAATWE 274
+++ LN RVNSQ + ++ + Q E L PG D
Sbjct: 261 LLPMMDMLN------HRVNSQTHFLYDSI-ANQYEMKTYSKLSPGTD------------- 300
Query: 275 VDGTGLITGVPFSMYLLSVERSSFHSEKEISISYGNKGNEELLYLYGFVIDNNPDDYLMI 334
I ISYG N+ LL+ YGF+ NNP DY +
Sbjct: 301 -----------------------------IYISYGPLDNDHLLHFYGFLQTNNPSDYFQV 331
>gi|242081035|ref|XP_002445286.1| hypothetical protein SORBIDRAFT_07g007800 [Sorghum bicolor]
gi|241941636|gb|EES14781.1| hypothetical protein SORBIDRAFT_07g007800 [Sorghum bicolor]
Length = 490
Score = 44.3 bits (103), Expect = 0.14, Method: Compositional matrix adjust.
Identities = 34/120 (28%), Positives = 54/120 (45%), Gaps = 5/120 (4%)
Query: 33 ESKGFGIFSSNEFSDG-VLLVVPLDLAITPMRVLQDPLIGPECRAMFEDGEVDDRFLMIL 91
+ G G+ +S G + L +P L I+ + Q + A+ + + +++L
Sbjct: 176 QGAGRGMVASESIGVGEIALEIPESLIISDELLCQSEVF----LALKDFNSITSETMLLL 231
Query: 92 FLTVERLRKNSSWKPYLDMLPTTFGNPLWFTDDELLELKGTTLYRATELQKQNLLTLYDD 151
+ ER S +KPY D LP F L F D L L+GT L+ KQ+L YD+
Sbjct: 232 WSMRERYNLASKFKPYFDTLPANFNTGLSFGIDGLAALEGTLLFDEIMQAKQHLRQQYDE 291
>gi|46136815|ref|XP_390099.1| hypothetical protein FG09923.1 [Gibberella zeae PH-1]
Length = 484
Score = 44.3 bits (103), Expect = 0.14, Method: Compositional matrix adjust.
Identities = 26/97 (26%), Positives = 48/97 (49%), Gaps = 9/97 (9%)
Query: 254 EGLVPGIDFCNHDLKAAATWEVDGTGLITGVPFSMYLLSVERSSFHSEKEISISYGNKGN 313
+ +VPG+D NH A ++ D + LL ++ + +E++ISYG+K
Sbjct: 227 DAMVPGLDMANHSHHPTAYYDEDDKD-------DVVLLVRPETTVSAGEEVNISYGDKNP 279
Query: 314 EELLYLYGFVIDNNPDDYLMIHYPAEAIHSIPLSDSK 350
E+L+ YGF+ +N ++ P + + PL +K
Sbjct: 280 AEMLFSYGFI--DNESTVEGLNLPVKVLPDDPLGKAK 314
>gi|453083670|gb|EMF11715.1| SET domain-containing protein [Mycosphaerella populorum SO2202]
Length = 477
Score = 44.3 bits (103), Expect = 0.15, Method: Compositional matrix adjust.
Identities = 21/55 (38%), Positives = 36/55 (65%), Gaps = 2/55 (3%)
Query: 82 EVDDRFLMILFLTVERLRK--NSSWKPYLDMLPTTFGNPLWFTDDELLELKGTTL 134
E+D +L ++ + V K SSW PY ++LP TF + ++++D+ELL LKG+ +
Sbjct: 80 ELDHPWLSLILVMVHEYLKGTKSSWYPYFNLLPETFDSLMFWSDEELLSLKGSAV 134
>gi|154272535|ref|XP_001537120.1| conserved hypothetical protein [Ajellomyces capsulatus NAm1]
gi|150409107|gb|EDN04563.1| conserved hypothetical protein [Ajellomyces capsulatus NAm1]
Length = 485
Score = 44.3 bits (103), Expect = 0.15, Method: Compositional matrix adjust.
Identities = 35/128 (27%), Positives = 59/128 (46%), Gaps = 11/128 (8%)
Query: 12 FLQWLQVNKVELRGCKIKYSD---ESKGFGIFSSNEFS-DGVLLVVPLDLAITPMRVLQD 67
F+ WL+ KIK +D E G GI + ++ D L +P L ++
Sbjct: 20 FMSWLKQRPGVKVSPKIKIADLRSEGAGRGIVADDDIGEDEELFAIPQSLVLSFQNSRLK 79
Query: 68 PLIGPECRAMFEDGEVDDRFLMILFLTVERLRKN-SSWKPYLDMLPTTFGNPLWFTDDEL 126
L+ F + + D +I+ + E L+ S+W Y +LPT F +++TD+EL
Sbjct: 80 DLLD------FNERDFDPWLCLIVVMIYEYLQGGASTWSRYFQLLPTNFDTLMFWTDEEL 133
Query: 127 LELKGTTL 134
EL G+ +
Sbjct: 134 RELSGSAV 141
>gi|412992279|emb|CCO19992.1| predicted protein [Bathycoccus prasinos]
Length = 640
Score = 44.3 bits (103), Expect = 0.15, Method: Compositional matrix adjust.
Identities = 32/88 (36%), Positives = 40/88 (45%), Gaps = 18/88 (20%)
Query: 253 IEGLVPGIDFCNHDLKAAATWEVDGTGLITGVPFSMYLLSVERSSFHSEK------EISI 306
I+ L P FCNH + V +S LS E F + + EI +
Sbjct: 406 IDVLSPVAFFCNHGIYPHC------------VHYSQLRLSDECLVFPAMRDIEKNEEIML 453
Query: 307 SYGNKGNEELLYLYGFVIDNNPDDYLMI 334
SYG K N ELL YGF ID+NP D + I
Sbjct: 454 SYGAKSNGELLLFYGFCIDDNPYDSIDI 481
>gi|449017905|dbj|BAM81307.1| similar to ribulose-1,5 bisphosphate carboxylase/oxygenase large
subunit N-methyltransferase, chloroplast precursor
[Cyanidioschyzon merolae strain 10D]
Length = 567
Score = 44.3 bits (103), Expect = 0.15, Method: Compositional matrix adjust.
Identities = 84/387 (21%), Positives = 142/387 (36%), Gaps = 110/387 (28%)
Query: 5 TEAKLEPFLQWLQVNKVELR--------------GCKIKYSDESKGFGIFSSNEFSDG-V 49
T ++LE F++WL N+V + + E G G + + G V
Sbjct: 70 TSSRLEAFVKWLAANEVFVSDKATWGRASHPLVVAEQTLIESEPAGRGFLARRDIQAGEV 129
Query: 50 LLVVPL------DLAITPMRVLQDPLIGPECRAMFEDGEVDDRFLMILFLTVER-----L 98
L VP D+A+ L P + E ++ F + L ER
Sbjct: 130 LFQVPFHLCFTKDVAVRRFAALNVPELADE----------EEFFALATLLLYERGLDESW 179
Query: 99 RKN-----SSWKPYLDMLPTT---FG-------------NPLW-FTDDELLELKGTTLYR 136
+K+ S W PYLD+LP F + LW + +DE+ L+G+
Sbjct: 180 KKSGRGPGSFWGPYLDILPPVPWEFKGAEPAESLSMDPLDALWLWAEDEMQWLQGSPTLL 239
Query: 137 ATELQKQNLLTLYDDKVKDLVKK---LLVLDGDS--ER---ANSIFWTRALNIPLPHSYV 188
+ + + Y + + L ++ + L+G ER A + ++RA+++P + +
Sbjct: 240 SARALRSKVEREYAEACERLYRRHPHIFDLEGAFRLERFLWAFGVLFSRAVSLPAENGML 299
Query: 189 FPQNQEDLNKYDSINNSAELSNDHNSRGELINGLNDIKNEAQRVNSQVNGATSTLTSTQG 248
++ A+L+N I+ A R +S+ +G
Sbjct: 300 ------------ALVPYADLANHSAFCVSFIDARTAAFPYAFRASSK---------QKRG 338
Query: 249 ETLWIEGLVPGIDFCNHDLKAAATWEVDGTGLITGVPFSMYLLSVERS------SFHSE- 301
+ W L P D G + S Y +R F+ +
Sbjct: 339 Q-WWQRFLAPN---------------SDDAGAVANTDSSHYREDAQREVVAYADRFYDKF 382
Query: 302 KEISISYGNKGNEELLYLYGFVIDNNP 328
+++ +SYG K N ELL LYGFV D NP
Sbjct: 383 EQVYVSYGQKSNAELLLLYGFVSDRNP 409
>gi|357479689|ref|XP_003610130.1| 3-hydroxy-3-methylglutaryl-coenzyme A reductase [Medicago
truncatula]
gi|355511185|gb|AES92327.1| 3-hydroxy-3-methylglutaryl-coenzyme A reductase [Medicago
truncatula]
Length = 689
Score = 44.3 bits (103), Expect = 0.16, Method: Compositional matrix adjust.
Identities = 81/367 (22%), Positives = 145/367 (39%), Gaps = 38/367 (10%)
Query: 30 YSD-ESKGFGIFSSNEFSDG-VLLVVPLDLAITPMRVLQDPLIGPECRAMFEDGEVDDRF 87
Y+D E G G+ + + G + + +P+ L I+ VL+ + R + E +
Sbjct: 240 YTDVEGAGRGMIAGKDLKVGDIAIEIPVSLIISEELVLETDMY----RLLKEVDGISSET 295
Query: 88 LMILFLTVERLRKNSSWKPYLDMLPTTFGNPLWFTDDELLELKGTTLYRATELQKQNLLT 147
+++L+ E+ +S +K Y D LP F L F + + L GT L+ +Q+
Sbjct: 296 ILLLWSMKEKYNCDSKFKIYFDTLPEKFNTGLSFGIEAITMLDGTLLFEEIMQSRQSRNC 355
Query: 148 LYDDKVKDLVKKLLVLDGDSERANSIFWTRALNIPLPHSYVFPQNQEDLNKYDSI--NNS 205
+ ++ ++++++ + + N+ + L P H V + S+ N S
Sbjct: 356 IIEEA--NVLQQMTLQHPRAMLLNTTKGKQKLEKPKVHGLVRRATYISRLAWFSLCLNYS 413
Query: 206 AELSNDHNSRGELI----NGLNDIKNEAQRVNSQVNGATSTLTSTQGETLWIEG-----L 256
+ H EL+ NG DI A S + ++ +G L
Sbjct: 414 MSPVHLHAQYDELVPALCNGFPDIFPPEIYTWENFLWACELWYSNSMKIMYSDGKLRTCL 473
Query: 257 VPGIDFCNHDLKAAATW--EVDGTGLITGVPFSMYLLSVERSSFHSEKEISISYGNKGNE 314
+P F NH L T +VD P + L S S +E +SYGN +
Sbjct: 474 IPLAGFLNHSLCPHITHYGKVD--------PSTNSLKFCLSRSCRSGEECCLSYGNFSSS 525
Query: 315 ELLYLYGFVID-NNPDDYLMIHYPAEAIHSI---PLSD-----SKALLLEEQKAQLRCLL 365
+ YGF+ +NP D + + + + SI P+S+ + L + L
Sbjct: 526 HFITFYGFLPQGDNPYDVIPLDIDSSDVDSIEDKPVSNWTTHMVRGTWLSNNHSIFYYGL 585
Query: 366 PKSLLEH 372
P LL+H
Sbjct: 586 PSPLLDH 592
>gi|366986581|ref|XP_003673057.1| hypothetical protein NCAS_0A01060 [Naumovozyma castellii CBS 4309]
gi|342298920|emb|CCC66666.1| hypothetical protein NCAS_0A01060 [Naumovozyma castellii CBS 4309]
Length = 591
Score = 44.3 bits (103), Expect = 0.16, Method: Compositional matrix adjust.
Identities = 30/95 (31%), Positives = 44/95 (46%), Gaps = 11/95 (11%)
Query: 256 LVPGIDFCNHDLKAAATWEVDGTGLITGVPFSMYLLSVERSSFHSEKEISISYGNKGNEE 315
L+P ID NH + +W + G + + E+ +YG KGNEE
Sbjct: 234 LLPIIDLMNHHYSSKVSWSSNEEGA---------FIYQNQMVLEKGDELLNNYGAKGNEE 284
Query: 316 LLYLYGFVIDNNPDDYLM--IHYPAEAIHSIPLSD 348
LL YGFV+++N D +M I P I I ++D
Sbjct: 285 LLASYGFVLEDNAFDLVMLRIKLPLPLIEEILIND 319
>gi|256272812|gb|EEU07782.1| Rkm1p [Saccharomyces cerevisiae JAY291]
Length = 583
Score = 44.3 bits (103), Expect = 0.16, Method: Compositional matrix adjust.
Identities = 28/91 (30%), Positives = 45/91 (49%), Gaps = 12/91 (13%)
Query: 256 LVPGIDFCNHDLKAAATWEVDGTGLITGVPFSMYLLSVERSSFHSEKEISISYGNKGNEE 315
L+P +D NHD + W P + + + + +E+S +YG KGNEE
Sbjct: 231 LLPIVDLLNHDYCSKVKW----------YPENGWFCYEKIGTASQSRELSNNYGGKGNEE 280
Query: 316 LLYLYGFVIDNNPDD--YLMIHYPAEAIHSI 344
LL YGFV+++N D L + P + + +I
Sbjct: 281 LLSGYGFVLEDNIFDSVALKVKLPLDVVSTI 311
>gi|307103410|gb|EFN51670.1| hypothetical protein CHLNCDRAFT_139898 [Chlorella variabilis]
Length = 543
Score = 44.3 bits (103), Expect = 0.16, Method: Compositional matrix adjust.
Identities = 24/81 (29%), Positives = 40/81 (49%), Gaps = 8/81 (9%)
Query: 256 LVPGIDFCNHDLKAAATWEVDGTGLITGVPFSMYLLSVERSSFHSEKEISISYGNKGNEE 315
++P ID NH +AA G G SM +++ + + + + ISYG N+
Sbjct: 219 MLPLIDMANHSFQAANAKIAPGPG------GSMCMVATR--ALQAGEPVLISYGALSNDF 270
Query: 316 LLYLYGFVIDNNPDDYLMIHY 336
LL YGF++ NP D + + +
Sbjct: 271 LLMDYGFIVPGNPHDTVQLRF 291
>gi|361129824|gb|EHL01706.1| putative Ribosomal N-lysine methyltransferase 4 [Glarea lozoyensis
74030]
Length = 483
Score = 43.9 bits (102), Expect = 0.16, Method: Compositional matrix adjust.
Identities = 33/138 (23%), Positives = 68/138 (49%), Gaps = 10/138 (7%)
Query: 4 STEAKLEPFLQWLQVNKVELR-GCKIKYSD---ESKGFGIFSSNEFS-DGVLLVVPLDLA 58
S +A E FL+WL +K+ +R K+ D E +G G+ ++ +F D V+ +P
Sbjct: 5 SFQATTEAFLEWL--SKIGVRINPKMTLKDLKSEGRGRGVVAAADFEEDEVVFCIPRTAV 62
Query: 59 ITPMRVLQDPLIGPECRAMFEDGEVDDRFLMILFLTVERLRKNSSWKPYLDMLPTTFGNP 118
+ V G A+ + + + + + E + +S W PYL +LP +
Sbjct: 63 LNVNNVFAGQDSGASKEALLQ---MPNWLALTATMMSEGQQSDSRWAPYLAVLPQKLDSL 119
Query: 119 LWFTDDELLELKGTTLYR 136
++++++EL EL+ +++ +
Sbjct: 120 VFWSEEELAELQASSVAK 137
>gi|384246211|gb|EIE19702.1| SET domain-containing protein [Coccomyxa subellipsoidea C-169]
Length = 503
Score = 43.9 bits (102), Expect = 0.16, Method: Compositional matrix adjust.
Identities = 62/308 (20%), Positives = 114/308 (37%), Gaps = 70/308 (22%)
Query: 9 LEPFLQWLQVNKVELRGCKIKYSDESKGFGIFSSNEFSDGVLLVVPLDLAITPMRVLQDP 68
L P W++ + L+ ++ + S G LV A +V++
Sbjct: 52 LPPLSAWVEQRGLPLKKLNVRPEIVEGDLCLVVSKPTKKGQPLVAVPSSAWLTQQVVRSS 111
Query: 69 LIGPECRAMFEDGEVDDRFLMILFLTVERLRKNSSWKPYLDMLPTTFGNPLWFTDDELLE 128
IG ++ ED ++ + LFL ER + +++W+ +LD +P PL+++++EL +
Sbjct: 112 SIG----SLVED--LEPWLQIALFLLHERSKPDAAWQGFLDSIPAAPDVPLFWSEEELSQ 165
Query: 129 LKGTTLYRATELQKQNLLTLYDDKVKDLVKKLLVLDGDSERANSIFWTRALNIPLPHSYV 188
L+GT L + + +Q ++ K +L ++L PH
Sbjct: 166 LEGTQLLSSVQGYRQ----FFEAKYAELEEQLFA---------------------PHREA 200
Query: 189 FPQNQEDLNKYDSINNSAELSNDHNSRGELINGLNDIKNEAQRVNSQVNGATSTLTSTQG 248
FP L+ + L + RV+S ++G
Sbjct: 201 FPPKSHQLDDF----------------------LWAVATVRSRVHSPLDGEDV------- 231
Query: 249 ETLWIEGLVPGIDFCNHDLKAAATWEVDGTGLITGVPFSMYLLSVERSSFHSEKE-ISIS 307
LVP D H A W++ G G+ L VE ++E E +++
Sbjct: 232 ------ALVPLADLVQHRKLQGARWQLQLAG---GLFSKAQALVVEAQRDYAEGEVVTMD 282
Query: 308 YGNKGNEE 315
+G EE
Sbjct: 283 FGAPLTEE 290
>gi|378731232|gb|EHY57691.1| hypothetical protein HMPREF1120_05719 [Exophiala dermatitidis
NIH/UT8656]
Length = 714
Score = 43.9 bits (102), Expect = 0.16, Method: Compositional matrix adjust.
Identities = 27/79 (34%), Positives = 39/79 (49%), Gaps = 10/79 (12%)
Query: 256 LVPGIDFCNHDLKAAATWEVDGTGLITGVPFSMYLLSVERSSFHSEKEISISYGNKGNEE 315
L+P +D NH A W+ +S L V + S +EI +YG + NE
Sbjct: 268 LLPLVDISNHKPGAKVEWQAR---------YSFVGLQV-LEPYESGQEIFNNYGPRDNET 317
Query: 316 LLYLYGFVIDNNPDDYLMI 334
LL YGF I +NP D+++I
Sbjct: 318 LLVAYGFTIPDNPFDHVVI 336
>gi|160331079|ref|XP_001712247.1| met [Hemiselmis andersenii]
gi|159765694|gb|ABW97922.1| met [Hemiselmis andersenii]
Length = 464
Score = 43.9 bits (102), Expect = 0.16, Method: Compositional matrix adjust.
Identities = 37/142 (26%), Positives = 58/142 (40%), Gaps = 41/142 (28%)
Query: 255 GLVPGIDFCNHDLKAAATWEVDGTGLITGVPFSMYLLSVERSSFHSEKEISI-------- 306
LVP DF NH+ PFS ++ ++ SF EI +
Sbjct: 254 SLVPYADFMNHN------------------PFSTSYINSKKISFSKNHEIVMYADKDYNK 295
Query: 307 ------SYGNKGNEELLYLYGFVIDNNPDDYLMIHYPAEAIHSIPLSDSKALLLEEQKAQ 360
+YG K N ELL LYGF+++ NP D + + I LSD + ++++
Sbjct: 296 FDQIFTTYGQKTNLELLLLYGFILERNPFDSIELR--------ISLSDKDSFFEKKKQFM 347
Query: 361 LRCLLPKSLLEHGFFAAGHPKD 382
+ C S + F +PK+
Sbjct: 348 IEC-EKTSEITFPIFYYKYPKE 368
>gi|390367697|ref|XP_787519.3| PREDICTED: N-lysine methyltransferase setd6-like
[Strongylocentrotus purpuratus]
Length = 466
Score = 43.9 bits (102), Expect = 0.17, Method: Compositional matrix adjust.
Identities = 43/191 (22%), Positives = 86/191 (45%), Gaps = 11/191 (5%)
Query: 4 STEAKLEPFLQWL-QVNKVELRGCKIKYSDESKGFGIFSSNEFSDG-VLLVVPLDLAITP 61
ST+ ++ FL W Q ++ ++ S G+ + ++ S G L VP + + P
Sbjct: 29 STDCQM--FLSWCEQEGIIQNSKVTVRRSGSCAQCGMIALDDISKGETLFTVPRSVLLHP 86
Query: 62 MRVLQDPLIGPECRAMFEDGEVDDRFL-MILFLTVERLRKNSSWKPYLDMLP--TTFGNP 118
P++ + E + ++ +IL + E ++S W+PYLD+ P + P
Sbjct: 87 ATC--SPVVAQRLEEDEDSLETESGWVPLILAVMYEHTNRSSRWRPYLDLFPDYSELDQP 144
Query: 119 L-WFTDDELLELKGTTLYRATELQKQNLLTLYDDKVKDLVKKLLVLDGDSERANSIFWTR 177
+ W ++ EL+GT + A + +N+ Y D +KK L + E+ N + R
Sbjct: 145 MFWDSNYMQPELRGTGIAEAVQRDLRNIDRDYHDVALPFIKKNADLFSE-EKHNLDLYKR 203
Query: 178 ALNIPLPHSYV 188
++ + +S+
Sbjct: 204 TVSFIMAYSFT 214
>gi|219122993|ref|XP_002181819.1| predicted protein [Phaeodactylum tricornutum CCAP 1055/1]
gi|217407095|gb|EEC47033.1| predicted protein [Phaeodactylum tricornutum CCAP 1055/1]
Length = 579
Score = 43.9 bits (102), Expect = 0.17, Method: Compositional matrix adjust.
Identities = 29/80 (36%), Positives = 43/80 (53%), Gaps = 10/80 (12%)
Query: 256 LVPGIDFCNHDLKAAATWEVDGTGLITGVPFS-MYLLSVERSSFHSEKEISISYGNKGNE 314
+ P ID NH V G ++ F+ Y L+ +++ S E+ ISYG + N+
Sbjct: 336 ICPMIDMANHQ-------SVKFAGQVSFEYFANAYSLATDQA-IPSGDEVYISYGPRSND 387
Query: 315 ELLYLYGFVIDNNPDD-YLM 333
+LL YGFV NNP+D Y+M
Sbjct: 388 QLLQYYGFVERNNPNDVYVM 407
>gi|367005530|ref|XP_003687497.1| hypothetical protein TPHA_0J02430 [Tetrapisispora phaffii CBS 4417]
gi|357525801|emb|CCE65063.1| hypothetical protein TPHA_0J02430 [Tetrapisispora phaffii CBS 4417]
Length = 587
Score = 43.9 bits (102), Expect = 0.18, Method: Compositional matrix adjust.
Identities = 34/91 (37%), Positives = 44/91 (48%), Gaps = 11/91 (12%)
Query: 256 LVPGIDFCNHDLKAAATWEVDGTGLITGVPFSMYLLSVERSSFHSEKEISISYGNKGNEE 315
L+P ID NH+ W D G F LSV ++ E++ +YG KGNEE
Sbjct: 232 LLPIIDLLNHNYSTKVEWLSDNDG-----SFCYRNLSVTPAN----TELNNNYGGKGNEE 282
Query: 316 LLYLYGFVIDNNPDD--YLMIHYPAEAIHSI 344
LL YGFV+ +N D L I+ P I I
Sbjct: 283 LLSGYGFVMKDNLFDSVALKINLPETMILQI 313
>gi|145349216|ref|XP_001419036.1| predicted protein [Ostreococcus lucimarinus CCE9901]
gi|144579266|gb|ABO97329.1| predicted protein [Ostreococcus lucimarinus CCE9901]
Length = 476
Score = 43.9 bits (102), Expect = 0.18, Method: Compositional matrix adjust.
Identities = 31/87 (35%), Positives = 47/87 (54%), Gaps = 7/87 (8%)
Query: 249 ETLWI-EGLVPGIDFCNHDLKAAATWEVDGTGLITGVPFSMYLLSVERSSFHSEKEISIS 307
+TL I E +VPG+ CNH L A + V T L G + L + R + + I+IS
Sbjct: 240 QTLQIYEVIVPGVFLCNHALYAHS---VRYTSLERGT--RAFRLELARGARPGDA-ITIS 293
Query: 308 YGNKGNEELLYLYGFVIDNNPDDYLMI 334
YG N +L+ YGF + +NP D +++
Sbjct: 294 YGRLDNADLMAYYGFTLPSNPYDRVVL 320
>gi|400597281|gb|EJP65016.1| SET domain-containing protein [Beauveria bassiana ARSEF 2860]
Length = 484
Score = 43.9 bits (102), Expect = 0.18, Method: Compositional matrix adjust.
Identities = 21/44 (47%), Positives = 29/44 (65%), Gaps = 1/44 (2%)
Query: 89 MILFLTVERLR-KNSSWKPYLDMLPTTFGNPLWFTDDELLELKG 131
+IL L E LR S+W+PYLD+LP TF P+++T EL L+
Sbjct: 107 LILVLLYEHLRGAASAWRPYLDVLPATFETPMFWTGAELGALQA 150
>gi|340923760|gb|EGS18663.1| hypothetical protein CTHT_0052690 [Chaetomium thermophilum var.
thermophilum DSM 1495]
Length = 505
Score = 43.9 bits (102), Expect = 0.19, Method: Compositional matrix adjust.
Identities = 85/360 (23%), Positives = 140/360 (38%), Gaps = 76/360 (21%)
Query: 15 WLQVNKVELRGCKIKYSDESKGFGIFSSNEF---------SDGV--LLVVPLDLAITPMR 63
W +N V K+ + E KG+G+ ++ DG LL VP DL +
Sbjct: 14 WALLNNVSFTNVKVAQT-EDKGYGVVTTRHLVQANADEATPDGATALLTVPHDLVLNQQA 72
Query: 64 VLQDPLIGPECRAMFED-GEVDDRFLMILFLTVER-LRKNSSWKP---------YLDMLP 112
V + R + E G R ++LFL V+ L SS+ P YL LP
Sbjct: 73 VHEYAKEDKNFRQLLEAVGHRSARADVLLFLLVQSALASRSSYTPVGVSNPWAQYLRFLP 132
Query: 113 TTFGNPLWFTDDELLELKGTTLYRATELQKQNLLTLYDDKVKDLVKKLLVLDGDSERANS 172
T P + +DE L L+GT+L A + KL LD +
Sbjct: 133 ETVLVPTLWNEDERLLLRGTSLEPAID------------------AKLSALDAEF----G 170
Query: 173 IFWTRALNIPLPHSYVFPQNQEDLNKYDSINNSAELSNDHNSRGELINGLNDIKNEAQRV 232
+ ++ ++P + ++ Q+ L + I S+ S + I R+
Sbjct: 171 LVREKSCDLPCWNELLWQQDDGQLAEGAEI---GAFSSPTPSFTDWI-----------RL 216
Query: 233 NSQVNGATSTLTSTQGETLWIEGLVPGIDFCNH-DLKAAATWEVDGTGLITGVPFSMYLL 291
++ L + GE + VP ID NH + +A +E + + LL
Sbjct: 217 DALYRSRCLELPGS-GEAM-----VPCIDMINHSSVSPSAYYEENAQDEVV-------LL 263
Query: 292 SVERSSFHSEKEISISYGN-KGNEELLYLYGFVIDNNPDDYLMIHYPAEAIHSIPLSDSK 350
S + E++ISYG+ K E+L+ YGF+ + + L++ P PL+ +K
Sbjct: 264 LRPGVSLPAGAEVTISYGDVKPAAEMLFNYGFIDPQSASERLVL--PLRPFSDDPLAKAK 321
>gi|71019075|ref|XP_759768.1| hypothetical protein UM03621.1 [Ustilago maydis 521]
gi|46099291|gb|EAK84524.1| hypothetical protein UM03621.1 [Ustilago maydis 521]
Length = 685
Score = 43.9 bits (102), Expect = 0.19, Method: Compositional matrix adjust.
Identities = 21/74 (28%), Positives = 40/74 (54%), Gaps = 4/74 (5%)
Query: 97 RLRKNSSWKPYLDMLPTTFGNPLWFTDDELLELKGTTLYRATELQKQNLLTLYDDKVKDL 156
R +W PYLD+LPT F P+++ + +L L GT++ A ++ + Y +K
Sbjct: 167 RAHGEQTWGPYLDILPTEFSTPMFWQEKDLYHLSGTSI--ADKIARDEAEADYHNKAAPF 224
Query: 157 VKKL--LVLDGDSE 168
++ L + ++G +E
Sbjct: 225 IRSLPSIFMEGVAE 238
>gi|115391295|ref|XP_001213152.1| conserved hypothetical protein [Aspergillus terreus NIH2624]
gi|114194076|gb|EAU35776.1| conserved hypothetical protein [Aspergillus terreus NIH2624]
Length = 691
Score = 43.9 bits (102), Expect = 0.19, Method: Compositional matrix adjust.
Identities = 36/133 (27%), Positives = 55/133 (41%), Gaps = 24/133 (18%)
Query: 256 LVPGIDFCNHDLKAAATWEVDGTGLITGVPFSMYLLSVERSSFHSEKEISISYGNKGNEE 315
L+P ID NH A W + L V + +E++ +YG + NE+
Sbjct: 233 LLPFIDVLNHRPLAKVEWRAGERDV----------LFVVLEHVAAGEEVANNYGPRNNEQ 282
Query: 316 LLYLYGFVIDNNPDDYLMIHYPAEAIHSIPLSDSKALLLEEQKAQLR------------C 363
L+ YGF + NNP DY + A PL D+K LE ++ +
Sbjct: 283 LMMNYGFCLQNNPCDYRTLSL--RAPPGSPLQDAKQAQLEMFPSREKDDHYYVFNIFYPL 340
Query: 364 LLPKSLLEHGFFA 376
L P + +EH F+
Sbjct: 341 LAPDTSMEHAIFS 353
>gi|168043570|ref|XP_001774257.1| predicted protein [Physcomitrella patens subsp. patens]
gi|162674384|gb|EDQ60893.1| predicted protein [Physcomitrella patens subsp. patens]
Length = 458
Score = 43.9 bits (102), Expect = 0.20, Method: Compositional matrix adjust.
Identities = 43/168 (25%), Positives = 75/168 (44%), Gaps = 12/168 (7%)
Query: 4 STEAKLEPFLQWLQVNKVELRGCKIKYSDESKGFGIFSSNEFSDG-VLLVVPLDLAITPM 62
+ P L WL+ + + ++G + S G +L V +L ITP
Sbjct: 29 AVHTAFSPLLAWLESRGETEALTSLTIGNTNQGRALLSIRHIKRGEQVLRVSRELMITPN 88
Query: 63 RVLQDPLIGPECRAMFEDGEVDDRFLMILF-LTVERLRKNSSWKPYLDMLPTTFG--NPL 119
R+ P C +V++ + LF L + K S W+PY+ LP G N +
Sbjct: 89 RL-------PSCVEESLSEDVNEWSRLALFQLLHKHAGKASPWEPYIRCLPPLRGLQNTV 141
Query: 120 WFTDDELLELKGTTLYRATELQKQNLLTLYDDKVKDLVKKLLVLDGDS 167
++ D+EL L+ + +Y TE ++ L++ D V+ +V K L G++
Sbjct: 142 FWRDEELELLRQSNVYDQTE-HRKTLISNQFDLVQAVVNKYPELFGET 188
>gi|255719552|ref|XP_002556056.1| KLTH0H04004p [Lachancea thermotolerans]
gi|238942022|emb|CAR30194.1| KLTH0H04004p [Lachancea thermotolerans CBS 6340]
Length = 585
Score = 43.9 bits (102), Expect = 0.20, Method: Compositional matrix adjust.
Identities = 36/92 (39%), Positives = 46/92 (50%), Gaps = 14/92 (15%)
Query: 256 LVPGIDFCNHDLKAAATW-EVDGTGLITGVPFSMYLLSVERSSFHSEKEISISYGNKGNE 314
L+P ID NHD ++ W + DG GV L +V R +EI +YG KGNE
Sbjct: 227 LLPIIDLLNHDYRSKVEWNQRDGA---FGV---RKLETVLRG-----EEIFNNYGGKGNE 275
Query: 315 ELLYLYGFVIDNNPDDY--LMIHYPAEAIHSI 344
ELL YGFV++ N D L I P + I
Sbjct: 276 ELLSGYGFVLEENIFDTVALKIQLPLTTVSEI 307
>gi|323306935|gb|EGA60219.1| Rkm1p [Saccharomyces cerevisiae FostersO]
Length = 427
Score = 43.9 bits (102), Expect = 0.20, Method: Compositional matrix adjust.
Identities = 27/91 (29%), Positives = 45/91 (49%), Gaps = 12/91 (13%)
Query: 256 LVPGIDFCNHDLKAAATWEVDGTGLITGVPFSMYLLSVERSSFHSEKEISISYGNKGNEE 315
L+P +D NHD ++ W P + + + + +E+S +YG KGNEE
Sbjct: 75 LLPIVDLLNHDYRSKVKW----------YPENGWFCYEKIGTASQSRELSNNYGGKGNEE 124
Query: 316 LLYLYGFVIDNNPDD--YLMIHYPAEAIHSI 344
LL YGF +++N D L + P + + +I
Sbjct: 125 LLSGYGFXLEDNIFDSVALKVKLPLDVVSTI 155
>gi|294868786|ref|XP_002765694.1| hypothetical protein Pmar_PMAR013760 [Perkinsus marinus ATCC 50983]
gi|239865773|gb|EEQ98411.1| hypothetical protein Pmar_PMAR013760 [Perkinsus marinus ATCC 50983]
Length = 330
Score = 43.9 bits (102), Expect = 0.20, Method: Compositional matrix adjust.
Identities = 33/128 (25%), Positives = 55/128 (42%), Gaps = 16/128 (12%)
Query: 14 QWLQVNKVELRGCKIKYSDESKGFGIFSSNEFSDG-VLLVVPLDLAITPMRVLQDPLIGP 72
QW + ++L + + G ++ + G LL VP +TP V
Sbjct: 20 QWAIRHGIQLHPAVSWFHATNGMIGCTATADICQGERLLYVPHSACVTPSGV-------- 71
Query: 73 ECRAMFEDGEVDDRFLMILFLTVERLRKNSSWKPYLDMLPTTFGNPLWFTDDELLELKGT 132
G + + ++ L R NS + YL LP+ F +PL ++ DEL+ LKGT
Sbjct: 72 -------QGLYEPQVMLAASLVKHRTDPNSPFHDYLQSLPSEFDHPLEWSADELVCLKGT 124
Query: 133 TLYRATEL 140
T++ +L
Sbjct: 125 TVWEMHQL 132
>gi|255071849|ref|XP_002499599.1| predicted protein [Micromonas sp. RCC299]
gi|226514861|gb|ACO60857.1| predicted protein [Micromonas sp. RCC299]
Length = 588
Score = 43.5 bits (101), Expect = 0.21, Method: Compositional matrix adjust.
Identities = 20/40 (50%), Positives = 25/40 (62%), Gaps = 1/40 (2%)
Query: 291 LSVERSSFHSEKEISISYGNKGNEELLYLYGFVIDNNPDD 330
L V RS H+ +E+ +SYG K N ELL YGF + NP D
Sbjct: 389 LPVARS-VHAGEEVFVSYGAKSNAELLLFYGFALPGNPYD 427
>gi|410928182|ref|XP_003977480.1| PREDICTED: histone-lysine N-methyltransferase setd3-like [Takifugu
rubripes]
Length = 598
Score = 43.5 bits (101), Expect = 0.22, Method: Compositional matrix adjust.
Identities = 81/360 (22%), Positives = 133/360 (36%), Gaps = 105/360 (29%)
Query: 34 SKGFGIFSSNEF-SDGVLLVVPLDLAITPMRVLQDPLIGP---ECRAMFEDGEVDDRFLM 89
++G+G+ ++ + ++ + L +P + +T + + ++GP + R + + D +
Sbjct: 103 AEGYGLRATRDIKAEELFLWIPRKMLMT-VESAKKSVLGPLYNQDRIL----QAMDNVTL 157
Query: 90 ILFLTVERLRKNSSWKPYLDMLPTTFGNPLWFTDDELLELKGTTLYRATELQKQNLLTLY 149
L L ER S W PY+ LP + PL++ DE+ L+GT Q++L+ Y
Sbjct: 158 ALHLLCERANPASFWLPYIRTLPQEYDTPLFYEQDEVQLLQGTQAV-------QDVLSQY 210
Query: 150 DDKVKDLV--KKLLVLDGDSERANSIFWTRALNIPLPHSYVFPQNQEDLNKYDSINNSAE 207
+ + KL+ S + +PL S+ F
Sbjct: 211 RNTARQYAYFYKLIQTHPASSK-----------LPLKDSFTF------------------ 241
Query: 208 LSNDHN-SRGELINGLNDIKNEAQRVNSQVNGATSTLTSTQGETLWIEGLVPGIDFCNHD 266
+D+ + ++ N I E +G TL L+P D CNH
Sbjct: 242 --DDYRWAVSSVMTRQNQIPTE--------DGRQVTL-----------ALIPLWDMCNHR 280
Query: 267 LKAAATWEVDGTGLITGVPFSMYLLSVERSS------FHSEKEISISYGNKGNEELLYLY 320
GLIT + Y L +R + ++I I YG + N E +
Sbjct: 281 -----------NGLIT----TGYNLEDDRCECVALQDYKKNEQIYIFYGTRSNAEFVIHN 325
Query: 321 GFVIDNNPDDYLMIH-----------YPAE--AIHSIPLSDSKALLLEEQ--KAQLRCLL 365
GF N D + I AE A IP+S AL EQ AQL L
Sbjct: 326 GFFYQENAHDQVKIKLGISKSERLYAMKAEVLARAGIPVSSIFALYCNEQPISAQLLAFL 385
>gi|66825817|ref|XP_646263.1| SET domain-containing protein [Dictyostelium discoideum AX4]
gi|60474297|gb|EAL72234.1| SET domain-containing protein [Dictyostelium discoideum AX4]
Length = 567
Score = 43.5 bits (101), Expect = 0.22, Method: Compositional matrix adjust.
Identities = 35/134 (26%), Positives = 62/134 (46%), Gaps = 10/134 (7%)
Query: 5 TEAKL-EPFLQWLQVNKVELRGCKIKY-SDESKGFGIFSSNEFSDGVLLV-VPLDLAITP 61
TEA+L F++WL+ + CK+K + S+G G+ ++ + +G V +P +L IT
Sbjct: 65 TEAQLVANFIEWLKGKGFDESKCKVKIDRNTSEGTGLVATQDIKEGEDFVEIPSNLFITT 124
Query: 62 MRVLQ---DPLIGPECRAMFEDGEVDDRFLMILFLTVERLRKNSSWKPYLDMLPTTFGNP 118
Q P I R + + L+ +FL E S W PY+ +LP +
Sbjct: 125 AVAFQGLGKPPILENDRLI----QSIPGILLSIFLVKELSNPTSEWGPYIKLLPKQYNTV 180
Query: 119 LWFTDDELLELKGT 132
++ E + +G+
Sbjct: 181 YYWGLKEFTQFRGS 194
>gi|388851551|emb|CCF54741.1| uncharacterized protein [Ustilago hordei]
Length = 487
Score = 43.5 bits (101), Expect = 0.22, Method: Compositional matrix adjust.
Identities = 36/128 (28%), Positives = 55/128 (42%), Gaps = 14/128 (10%)
Query: 256 LVPGIDFCNHDLKAAATW-EVDGTGLITGVPFSMYLLSVERSSFHSEKEISISYGNKGNE 314
L+P D NH TW + GTG G M +++ + ++ +YG K NE
Sbjct: 245 LIPAYDTFNHARAHPVTWTHIAGTGGEVG----MVEMTLNYETTEGGVQVWNNYGGKSNE 300
Query: 315 ELLYLYGFVIDNNPDDYLMIHYPAEAI----HSIPLSDSKALLLEEQK-----AQLRCLL 365
E L YGFV+D +D L + E + H + + L E++K + +C
Sbjct: 301 EFLSGYGFVLDTISEDTLALQLGGEGMLGKTHYWRIPPVPSALSEQEKEGSGFKRAKCPC 360
Query: 366 PKSLLEHG 373
P L E G
Sbjct: 361 PSLLEELG 368
>gi|299473350|emb|CBN77749.1| conserved unknown protein [Ectocarpus siliculosus]
Length = 563
Score = 43.5 bits (101), Expect = 0.24, Method: Compositional matrix adjust.
Identities = 23/59 (38%), Positives = 42/59 (71%), Gaps = 4/59 (6%)
Query: 87 FLMILFLTVERLRKNSSWKPYLDMLPTTFGN-PLWFTDDELLELKGTTLYRATELQKQN 144
FLM+ LT R R ++ ++PY D+LPTT N P+++T++E+ L+G+ Y T+++++N
Sbjct: 181 FLMLFILTDMR-RPDTFFRPYYDLLPTTLSNMPIFWTEEEMRLLQGS--YLVTQVEERN 236
>gi|146419922|ref|XP_001485920.1| hypothetical protein PGUG_01591 [Meyerozyma guilliermondii ATCC
6260]
Length = 592
Score = 43.5 bits (101), Expect = 0.24, Method: Compositional matrix adjust.
Identities = 79/326 (24%), Positives = 135/326 (41%), Gaps = 42/326 (12%)
Query: 12 FLQWLQVNKVEL-RGCKIKYSDESKGFGIFSSNEFSDGVLLVVPLDLAITPMRVLQDPLI 70
+QW + +EL G + + E+ +++N + + +P++LAIT L+
Sbjct: 5 LVQWAKTQGLELNEGIEFRGIGENNTGAFYTTNN-GEKPYIRLPVELAITVDSALRS--F 61
Query: 71 GPECRAMFEDGEVDDRFLMILFLTVERLR-KNSSWKPYLDMLPT--TFGNPLWFTDDELL 127
G + A+ + + + L L L ER R KNS+ K YL+ LPT P + +
Sbjct: 62 GQDLEALRDQCDSSNTVLK-LCLARERSRLKNSTIKKYLECLPTLQQMNTPYCWDAETKR 120
Query: 128 ELKGTTLYRATELQKQNLLTLYDD--KVKDLVKKLLVLDGDSERANSIFWTRALNIPLPH 185
L+GT L + K+N+ L ++ K+ +L+ L V + N ++ +
Sbjct: 121 YLQGTNLGSSL---KENIGVLVEEWWKIINLLPDL-VQKPEQHFVNMKYYYESKFYTDDD 176
Query: 186 SYVFPQNQEDLNKYDSINNSAELSNDHNSRGELINGLND-IKNEAQRVNSQVNGATSTLT 244
+Y + ED + S N S SR + D + + +R ++
Sbjct: 177 AYAYFVTNEDPANWTSFPNFLWASIILKSRSFPAYLIADAVDWDVKRHDTM--------- 227
Query: 245 STQGETLWIEGLVPGIDFCNHDLKAAATWEVDGTGLITGVPFSMYLLSVERSSFHSEKEI 304
L+P ID NH A W GL S ++ + S ++
Sbjct: 228 -----------LLPVIDLLNHLPSAHVEW-----GLERKESKSYFVF--KSDDVKSGSQL 269
Query: 305 SISYGNKGNEELLYLYGFVIDNNPDD 330
+YG KGNEELL YGF +++N D
Sbjct: 270 FNNYGMKGNEELLLAYGFCLEDNSSD 295
>gi|194896580|ref|XP_001978500.1| GG17647 [Drosophila erecta]
gi|190650149|gb|EDV47427.1| GG17647 [Drosophila erecta]
Length = 544
Score = 43.5 bits (101), Expect = 0.25, Method: Compositional matrix adjust.
Identities = 35/132 (26%), Positives = 60/132 (45%), Gaps = 19/132 (14%)
Query: 7 AKLEPFLQWLQVNKVELRGCKIKYSDESKGFGIFSSNEFSDGVLLVVPL---DLAIT-PM 62
AK+E F W + V G +I IF + G+ PL +L +T P
Sbjct: 118 AKVEAFSAWAKDGGVHSEGLEI---------AIFPGYQL--GLRATRPLAKEELVLTVPR 166
Query: 63 RVLQDPLIGPECRAMFEDGEVDDRFLMILFLTVERLRKN-SSWKPYLDMLPTTFGNPLWF 121
+++ +CR G++ + L +E++R S W+PY+D+LP + L+F
Sbjct: 167 KLIFSEENNSDCRLF---GKMPQATHWVYDLVIEKIRGEFSEWRPYIDILPAKYSTVLYF 223
Query: 122 TDDELLELKGTT 133
T ++ L+GT
Sbjct: 224 TIKQMERLRGTA 235
>gi|367016539|ref|XP_003682768.1| hypothetical protein TDEL_0G01900 [Torulaspora delbrueckii]
gi|359750431|emb|CCE93557.1| hypothetical protein TDEL_0G01900 [Torulaspora delbrueckii]
Length = 573
Score = 43.5 bits (101), Expect = 0.25, Method: Compositional matrix adjust.
Identities = 62/244 (25%), Positives = 87/244 (35%), Gaps = 75/244 (30%)
Query: 105 KPYLDMLPTTFGNPLWFTDDELLELKGTTLYRATELQKQNLLTLYDDKVKDLVKKLLVLD 164
KPY D+LPT P ++ E+ LKGT +Y L KQNL + +VK+ VL
Sbjct: 101 KPYFDILPTKLDQPYFWKLQEVELLKGTDIYL---LMKQNL--------RKIVKEWHVL- 148
Query: 165 GDSERANSIFWTRALNIPLPHSYVFPQN-----QEDLNKYDSINNSAELSNDHNSRG--E 217
L + P++ Q + +D + E H S
Sbjct: 149 ------------------LDQLKLKPEDGELYEQSEAQDFDILKYICEYREQHKSISWKS 190
Query: 218 LINGLNDIKNEAQRVNSQVNGATSTLTSTQGETLWIEG---------LVPGIDFCNH--D 266
+ L AT TS L +E L P +D NH D
Sbjct: 191 FVGYL---------------WATGIFTSRAFPKLILEEKCSSINEAFLYPLVDLLNHKND 235
Query: 267 LKAAATWEVDGTGLITGVPFSMYLLSVERSSFHSEKEISISYGNKGNEELLYLYGFVIDN 326
K T+ D ++ + +E+ +YG K NE+LL YGFV D
Sbjct: 236 TKVKWTFTNDNVCFVS------------QEIMKEGEEVFNNYGEKSNEDLLLSYGFVQDQ 283
Query: 327 NPDD 330
NP D
Sbjct: 284 NPYD 287
>gi|66806627|ref|XP_637036.1| hypothetical protein DDB_G0287857 [Dictyostelium discoideum AX4]
gi|60465490|gb|EAL63575.1| hypothetical protein DDB_G0287857 [Dictyostelium discoideum AX4]
Length = 532
Score = 43.5 bits (101), Expect = 0.25, Method: Compositional matrix adjust.
Identities = 47/185 (25%), Positives = 75/185 (40%), Gaps = 47/185 (25%)
Query: 255 GLVPGIDFCNHDLKAAATWEVDGTGLITGVPFSMYLLSVERSSFHSEKEISISYGNKGNE 314
LVP D NH + D V + ++ F + ++ ISYG N
Sbjct: 264 ALVPLADLFNHSSDVNTETKFDEKKQCYQV--------ITKTKFEKDSQVFISYGKHSNF 315
Query: 315 ELLYLYGFVIDNNPDDYLMIHYPAEAIHSIPLSDSKA----LLLEEQKAQLRCLLPK-SL 369
L+ YGF+I+NN +D SIPL A +L +E K L+ K S+
Sbjct: 316 TLMNYYGFIIENNSND------------SIPLVQEDAIPDIILEKEMKQDLKSYERKMSI 363
Query: 370 LE-HGFFAAGHPKDGNNDNKLEVDRISSFSWSGQRRMPSYLNKLVFPENFLTALRTIAMQ 428
LE +G G N+ + +D+ FSW N+L+ L+ + M
Sbjct: 364 LEQYGLSVYGE----NSKFLVSMDKELPFSW-----------------NYLSILKVLYMT 402
Query: 429 EDEIS 433
++E++
Sbjct: 403 KEELN 407
>gi|391342782|ref|XP_003745694.1| PREDICTED: histone-lysine N-methyltransferase setd3-like
[Metaseiulus occidentalis]
Length = 278
Score = 43.5 bits (101), Expect = 0.26, Method: Compositional matrix adjust.
Identities = 31/103 (30%), Positives = 47/103 (45%), Gaps = 17/103 (16%)
Query: 239 ATSTLTSTQGETLWIE------GLVPGIDFCNHD-LKAAATWEVDGTGLITGVPFSMYLL 291
A ST+ + Q E + LVP D CNHD L++ ++V S L+
Sbjct: 42 ACSTVMTRQNELPSLTPGRMQMALVPLWDMCNHDTLRSGTDYDV----------ASQQLV 91
Query: 292 SVERSSFHSEKEISISYGNKGNEELLYLYGFVIDNNPDDYLMI 334
S + ++++I YGN+ N + + GFV D N D L I
Sbjct: 92 SFATREYKKNEQVNIFYGNRANAQFMLHNGFVPDENQWDSLAI 134
>gi|395504553|ref|XP_003756612.1| PREDICTED: histone-lysine N-methyltransferase setd3 [Sarcophilus
harrisii]
Length = 602
Score = 43.1 bits (100), Expect = 0.28, Method: Compositional matrix adjust.
Identities = 34/130 (26%), Positives = 63/130 (48%), Gaps = 20/130 (15%)
Query: 12 FLQWLQVNKVELRGCK-IKYSDESKGFGIFSSNEF-SDGVLLVVPLDLAITPMRVLQDPL 69
++W N G + + + +E GFG+ ++ E ++ + L VP L +T + ++ +
Sbjct: 89 LIKWAAENGASTDGFELVNFKEE--GFGLRATREIKAEELFLWVPRKLLMT-VESAKNSV 145
Query: 70 IGPECRAMFEDGEVDDRFL-------MILFLTVERLRKNSSWKPYLDMLPTTFGNPLWFT 122
+G A++ DR L + L ER +S W PY+ LP+ + PL+F
Sbjct: 146 LG----ALYSQ----DRILQAMGNITLAFHLLCERANPSSFWLPYIQTLPSEYDTPLYFE 197
Query: 123 DDELLELKGT 132
+DE+ L+ T
Sbjct: 198 EDEVQHLQST 207
>gi|426192525|gb|EKV42461.1| hypothetical protein AGABI2DRAFT_188618 [Agaricus bisporus var.
bisporus H97]
Length = 387
Score = 43.1 bits (100), Expect = 0.28, Method: Compositional matrix adjust.
Identities = 26/82 (31%), Positives = 40/82 (48%), Gaps = 3/82 (3%)
Query: 256 LVPGIDFCNHDLKAAATWEV---DGTGLITGVPFSMYLLSVERSSFHSEKEISISYGNKG 312
L+PG+D NH A +W V D +G + V + + +EI +YG K
Sbjct: 186 LLPGVDAFNHKRAQAVSWSVSYPDKSGSFASSYKGPTISLVPHTKTSAGEEIFNNYGPKP 245
Query: 313 NEELLYLYGFVIDNNPDDYLMI 334
N L+ YGF + NPDD +++
Sbjct: 246 NGSLILGYGFSLPANPDDTILL 267
>gi|405953717|gb|EKC21325.1| SET domain-containing protein 6 [Crassostrea gigas]
Length = 384
Score = 43.1 bits (100), Expect = 0.28, Method: Compositional matrix adjust.
Identities = 44/187 (23%), Positives = 84/187 (44%), Gaps = 18/187 (9%)
Query: 12 FLQWLQVNKVELRGCKIKYSDESKGFGIFSSNEFSDGVLLVVPLDLAITPMRVLQDPLIG 71
FL+W N K+ + G + N G++ + + + R+ + L+
Sbjct: 33 FLKWFSSNDDNFFSGKVTIGPD----GSCAQN----GMVAIADVQEGESLFRISRKILLH 84
Query: 72 PECR---AMFEDGEVDDRF---LMILFLTVERLRKNSSWKPYLDMLPTTFGNPLWFTDDE 125
P+ A+FE V+ +++ + E K+S WKPY D+LP T P+++T++E
Sbjct: 85 PKSSSISALFEKDPVNSESGWSELLICMMQEYNTKDSPWKPYFDVLPETVDLPMFWTEEE 144
Query: 126 LLE-LKGTTLYRATELQKQNLLTLYDDKVKDLVKKLLVLDGDSERANSI-FWTRALNIPL 183
+ L GT + A + +LT + V +KK D SE + + + R ++ +
Sbjct: 145 REKLLTGTGVVEAVNRDNKKILTEFQSVVSPYLKKH--KDTISESCDDLELYKRMVSYVM 202
Query: 184 PHSYVFP 190
+S+ P
Sbjct: 203 AYSFTEP 209
>gi|393230612|gb|EJD38215.1| SET domain-containing protein [Auricularia delicata TFB-10046 SS5]
Length = 381
Score = 43.1 bits (100), Expect = 0.29, Method: Compositional matrix adjust.
Identities = 27/80 (33%), Positives = 39/80 (48%), Gaps = 7/80 (8%)
Query: 256 LVPGIDFCNHDLKAAATWEVDGTGLITGVPFSMYLLS-VERSSFHSEKEISISYGNKGNE 314
LVP ID NH +W V + LS V+ + + E+ +YG K N
Sbjct: 175 LVPLIDSLNHARAHPVSWSVSPAD------NGAHTLSIVQHAPVAAGAEVLNNYGPKPNA 228
Query: 315 ELLYLYGFVIDNNPDDYLMI 334
EL+ YGF + +NPDD L++
Sbjct: 229 ELVLGYGFALPDNPDDTLVL 248
>gi|85099007|ref|XP_960703.1| hypothetical protein NCU06658 [Neurospora crassa OR74A]
gi|28922220|gb|EAA31467.1| predicted protein [Neurospora crassa OR74A]
gi|28950107|emb|CAD70887.1| conserved hypothetical protein [Neurospora crassa]
Length = 469
Score = 43.1 bits (100), Expect = 0.30, Method: Compositional matrix adjust.
Identities = 30/98 (30%), Positives = 49/98 (50%), Gaps = 10/98 (10%)
Query: 254 EGLVPGIDFCNHDLKAAATWEVDGTGLITGVPFSMYLLSVERSSFHSEKEISISYGN-KG 312
E +VP ID NH +A+A ++ + + LL SS +E++ISYG+ K
Sbjct: 197 ESMVPCIDMINHSTRASAYYDENAKDEVV-------LLPRPDSSISPGEEVTISYGDAKP 249
Query: 313 NEELLYLYGFVIDNNPDDYLMIHYPAEAIHSIPLSDSK 350
E+L+ YGF+ + L++ P E PL+ +K
Sbjct: 250 AAEMLFSYGFIDPEATVESLVL--PLEPFEDDPLAKAK 285
Score = 39.7 bits (91), Expect = 3.4, Method: Compositional matrix adjust.
Identities = 55/229 (24%), Positives = 93/229 (40%), Gaps = 29/229 (12%)
Query: 9 LEPFLQWLQVNKVELRGCKIKYSDESKGFGIFSSNEFSDGV-LLVVPLDLAITPMRVLQD 67
+E W +N + K+ + E KGFG+ E V L+ VP L + V +
Sbjct: 8 IEALPAWALLNGITFPHVKVT-NIEGKGFGVVRDGELQPEVPLMTVPNSLVLNVQAVDEY 66
Query: 68 PLIGPECRAMFEDGEVDDRFLMILFLTVERLRKNSSWKPYLDMLPTTFGNPLWFTDDELL 127
+ + G V ++ + ++ W Y+ LP T P +T+DE L
Sbjct: 67 AKEDKNFKQLL--GAVGHHLVLASKTHQAPVGVSNPWTEYIKFLPKTVLVPTLWTEDERL 124
Query: 128 ELKGTTLYRATELQKQNLLTLYDDKVKDLVKKLL----VL----DGDSERA-------NS 172
L+GT+L A K +T D V++ L VL DG+S + ++
Sbjct: 125 LLRGTSLESAVN-AKMTAITAEFDAVREAASSLPSWNDVLWPYEDGNSSASLRSWILLDA 183
Query: 173 IFWTRALNIPLPHSYVFPQNQEDLNKYDSINNSAELSN--DHNSRGELI 219
++ +R L +P + P D IN+S S D N++ E++
Sbjct: 184 LYRSRVLELPKSGESMVPC-------IDMINHSTRASAYYDENAKDEVV 225
>gi|336264087|ref|XP_003346822.1| hypothetical protein SMAC_05080 [Sordaria macrospora k-hell]
gi|380090292|emb|CCC11868.1| unnamed protein product [Sordaria macrospora k-hell]
Length = 528
Score = 43.1 bits (100), Expect = 0.31, Method: Compositional matrix adjust.
Identities = 44/161 (27%), Positives = 71/161 (44%), Gaps = 32/161 (19%)
Query: 6 EAKLEPFLQWL--QVNKVELRGCK-IKYSDESKGFGIFSSNEF-SDGVLLVVP----LDL 57
E + + FL+W Q + K + D+S G GI ++ +D L +P L
Sbjct: 15 EQQTQKFLEWFTSQPGATFHKDIKLVDLRDKSAGRGIIATAPIPADTTLFTIPRSSILCA 74
Query: 58 AITPM--------RVLQDPLIG-------------PECRAMFED--GEVDDRFLMILFLT 94
A +P+ + +P + PE ED D L+IL L
Sbjct: 75 ATSPLASKLPALFKGPAEPTVAEESNDHTADDDDVPELVEELEDDPNAQDPWTLLILILV 134
Query: 95 VERLRKNSS-WKPYLDMLPTTFGNPLWFTDDELLELKGTTL 134
E L+ ++S W PYL +LP F P+++T++EL EL+ + L
Sbjct: 135 HEYLQGSASPWYPYLSVLPEKFDTPMFWTNEELGELQASAL 175
>gi|328866266|gb|EGG14651.1| hypothetical protein DFA_10909 [Dictyostelium fasciculatum]
Length = 581
Score = 43.1 bits (100), Expect = 0.31, Method: Compositional matrix adjust.
Identities = 40/125 (32%), Positives = 59/125 (47%), Gaps = 17/125 (13%)
Query: 30 YSDESKGFGIFSSNEFSDGVLLV-VPLDLAITPMRVLQDPLIGPECRAMFEDGE-----V 83
+ D G G+ + G LLV VPL L I + +L ++ P FE E +
Sbjct: 39 FGDSGGGRGVIAKRTIESGDLLVEVPLSLLIHSLPILS--VVPP-----FEHIETVLKLL 91
Query: 84 DDRFLMILFLTVERL-RKNSSWKPYLDMLPTTFGNPLWFTDDELLELKGTTLYR--ATEL 140
D + + L ERL R S W YLD +P + + +TD E+ EL Y+ AT+L
Sbjct: 92 DSKQTICFQLIYERLIRNRSRWYGYLDCIPKEYNTTVSYTDAEIGEL-SYPYYKNEATKL 150
Query: 141 QKQNL 145
+K+ L
Sbjct: 151 RKEML 155
Score = 38.5 bits (88), Expect = 8.2, Method: Compositional matrix adjust.
Identities = 52/185 (28%), Positives = 75/185 (40%), Gaps = 48/185 (25%)
Query: 256 LVPGIDFCNHD--LKAAATWEVDGTGLITGVPFSMYLLSVERSSFHSEKEISISYGNKGN 313
LVP D NH+ +K A++ + +Y + F +++ ISYG N
Sbjct: 357 LVPLADLFNHNPNVKTMASY------CAADRCYRVY----TDTRFEKGEQVFISYGLHNN 406
Query: 314 EELLYLYGFVIDNNPDDYLMIHYPAEAIHSIPLSDSKALLL--EEQKAQLRCLLPKS--L 369
LL+ YGFVIDNN D + I A PL S L +EQ+ R + K L
Sbjct: 407 ATLLHYYGFVIDNNHLDGIEIDSEASL---PPLRPSAFYHLPAKEQREYERLVERKENIL 463
Query: 370 LEHGFFAAGHPKDGNNDNKLEV--DRISSFSWSGQRRMPSYLNKLVFPENFLTALRTIAM 427
+++G + K EV D F+W N+LT LR + M
Sbjct: 464 IQNGL----------SQGKYEVVNDPTMPFTW-----------------NYLTTLRVMMM 496
Query: 428 QEDEI 432
+ EI
Sbjct: 497 TKSEI 501
>gi|322701166|gb|EFY92917.1| SET domain protein, putative [Metarhizium acridum CQMa 102]
Length = 430
Score = 43.1 bits (100), Expect = 0.32, Method: Compositional matrix adjust.
Identities = 27/74 (36%), Positives = 37/74 (50%), Gaps = 8/74 (10%)
Query: 255 GLVPGIDFCNHDLKAAATWEVDGTGLITGVPFSMYLLSVERSSFHSEKEISISYGN-KGN 313
+VPG+D NH KA A +E D + LL S +E +ISYG+ K
Sbjct: 151 AMVPGLDMVNHSSKATAYYEEDDNDHVV-------LLIRPGCQVRSGEEATISYGDAKPA 203
Query: 314 EELLYLYGFVIDNN 327
E+L+ YGF+ NN
Sbjct: 204 SEMLFSYGFIDPNN 217
>gi|209730352|gb|ACI66045.1| SET domain-containing protein 3 [Salmo salar]
Length = 280
Score = 43.1 bits (100), Expect = 0.32, Method: Compositional matrix adjust.
Identities = 29/119 (24%), Positives = 59/119 (49%), Gaps = 10/119 (8%)
Query: 12 FLQWLQVNKVELRGCKIKYSDESKGFGIFSSNEF-SDGVLLVVPLDLAITPMRVLQDPLI 70
+ W Q N G +I + +GFG+ ++ + ++ + L +P + +T + ++ ++
Sbjct: 82 LMAWAQENGASCEGVEIAHF-AGEGFGLRATKDIKAEELFLWIPRTMLMT-VESAKNSVL 139
Query: 71 GP---ECRAMFEDGEVDDRFLMILFLTVERLRKNSSWKPYLDMLPTTFGNPLWFTDDEL 126
GP + R + G V + L L ER +S W PY+ LP + PL++ ++E+
Sbjct: 140 GPLYSQDRILQAMGNV----TLALHLLCERADPSSPWLPYIKTLPGDYDTPLYYEEEEV 194
>gi|322703179|gb|EFY94792.1| UV-endonuclease UVE-1 [Metarhizium anisopliae ARSEF 23]
Length = 1118
Score = 43.1 bits (100), Expect = 0.32, Method: Compositional matrix adjust.
Identities = 40/144 (27%), Positives = 69/144 (47%), Gaps = 26/144 (18%)
Query: 7 AKLEPFLQWLQV-------NKVELRGCKIKYSDESKGFGIFSSNEF-SDGVLLVVPLDLA 58
A+ FLQW + + +E+ + G GI + + +D L +P D
Sbjct: 659 ARTAAFLQWFRALPGATFSDAIEI----VDLRSRDAGRGIVALRDIPADTTLFTIPRDAI 714
Query: 59 I-TPMRVLQDPLIGPECRAMFE-DGEVDDR------FLMILFLTVER-LRKNSSWKPYLD 109
I + L++ L PE +FE G+ D++ +IL + E L S WKPY+D
Sbjct: 715 INSDTSSLREKL--PE---LFESQGDEDEQQALDSWSALILIMMYEFFLGHQSKWKPYID 769
Query: 110 MLPTTFGNPLWFTDDELLELKGTT 133
+LP TF P++++++EL L+ +
Sbjct: 770 VLPLTFDTPMFWSEEELSYLQASA 793
>gi|410083884|ref|XP_003959519.1| hypothetical protein KAFR_0K00290 [Kazachstania africana CBS 2517]
gi|372466111|emb|CCF60384.1| hypothetical protein KAFR_0K00290 [Kazachstania africana CBS 2517]
Length = 598
Score = 43.1 bits (100), Expect = 0.33, Method: Compositional matrix adjust.
Identities = 34/118 (28%), Positives = 52/118 (44%), Gaps = 22/118 (18%)
Query: 256 LVPGIDFCNHDLKAAATWEVDGTGLITGVPFSMYLLSVERSSFHSEKEISISYGNKGNEE 315
L+P +D NHD +A A W + + + + E+ +YG KGNEE
Sbjct: 229 LLPIVDLLNHDYQAKAEWSSNED--------RSFCYKNLNENLKAGDEVLNNYGAKGNEE 280
Query: 316 LLYLYGFVIDNN-----------PDDYL--MIHYPAEAIHSIPLSDSKALLLEEQKAQ 360
LL YGFV++NN PDD + ++ Y + I + + D E + AQ
Sbjct: 281 LLSGYGFVLENNICDSLLLKLKLPDDTIRDILDYEPD-IELLTIDDYTTFAFETEMAQ 337
>gi|212532027|ref|XP_002146170.1| SET domain protein [Talaromyces marneffei ATCC 18224]
gi|210071534|gb|EEA25623.1| SET domain protein [Talaromyces marneffei ATCC 18224]
Length = 497
Score = 43.1 bits (100), Expect = 0.34, Method: Compositional matrix adjust.
Identities = 48/210 (22%), Positives = 90/210 (42%), Gaps = 36/210 (17%)
Query: 15 WLQVNKVELRGCKIKYSDESKGFGIFSSNEFSDG--------VLLVVPLDLAITPMRVLQ 66
W Q+N ++ + K + G + E+S +L+ VP D+ ++ V +
Sbjct: 15 WAQLNGIKFHDIEFKKLEYGSGIVAKTDKEYSSAQEAAEKPEILMTVPPDMVLSLDLVHE 74
Query: 67 DPLIGPECRAMFE---DGEVDDRFLMILFL---------TVERLRKNSSWKPYLDMLPTT 114
P RA+ E D R +++FL T ++ + W Y+ LP+
Sbjct: 75 FAKSDPYLRAVLEASGDFGWTARGAILIFLLCHITYASNTHAKIGVQNPWSEYIKFLPSE 134
Query: 115 FGNPLWFTDDELLELKGTTLYRATELQKQNLLTLYDDKVKDLVKKL-------------- 160
P +T+DEL+ L GT+L A + K + L D+++D + +
Sbjct: 135 TLLPTLWTEDELVLLYGTSLKDAVD-HKLSALEAEFDRLRDATRSIAWCEREWWDEETGQ 193
Query: 161 LVLDGDSERANSIFWTRALNIPLPHSYVFP 190
L LD D + ++++ +RAL++P + P
Sbjct: 194 LTLD-DWKIVDAMYRSRALDLPGSGHVMVP 222
>gi|21594028|gb|AAM65946.1| unknown [Arabidopsis thaliana]
Length = 471
Score = 43.1 bits (100), Expect = 0.34, Method: Compositional matrix adjust.
Identities = 53/254 (20%), Positives = 94/254 (37%), Gaps = 69/254 (27%)
Query: 91 LFLTVERLRKNSSWKPYLDMLPTTFGNPLWFTDDELLELKGTTLYRATELQKQNLLTLYD 150
L L ER +S W PY+ LP T+ P++F +++ L+ L
Sbjct: 112 LRLLQERANADSFWWPYISNLPETYTVPIFFPGEDIKNLQYAPLLH-------------- 157
Query: 151 DKVKDLVKKLLVLDGDSERANSIFWTRALNIPLPHSYVFPQNQEDLNKYDSINNSAELSN 210
+V + + LL + + R ED+ ++
Sbjct: 158 -QVNKICRFLLEFEQEIRR----------------------TLEDVK-----------AS 183
Query: 211 DHNSRGELING------LNDIKNEAQRV--NSQVNGATSTLTSTQGETLWIEGLVPGIDF 262
DH G+ +N ++ + A R+ N ++ G +S + ++P ID
Sbjct: 184 DHPFSGQDVNASALGWTMSAVSTRAFRLHGNKKLQGGSSDD---------VPMMLPLIDM 234
Query: 263 CNHDLKAAATWEVDGTGLITGVPFSMYLLSVERSSFHSEKEISISYGNKGNEELLYLYGF 322
CNH K A + G + + V + + ++YG N+ L YGF
Sbjct: 235 CNHSFKPNARIIQEQNGADSNTLVKV----VAETEVKENDPLLLNYGCLSNDFFLLDYGF 290
Query: 323 VIDNNPDDYLMIHY 336
VI++NP D + + Y
Sbjct: 291 VIESNPYDTIELKY 304
>gi|307109960|gb|EFN58197.1| hypothetical protein CHLNCDRAFT_142047 [Chlorella variabilis]
Length = 485
Score = 42.7 bits (99), Expect = 0.36, Method: Compositional matrix adjust.
Identities = 28/79 (35%), Positives = 40/79 (50%), Gaps = 10/79 (12%)
Query: 254 EGLVPGIDFCNHDLKAAATWEVDGTGLITGVPFSMYLLSVERSSFHSEKEISISYGNKGN 313
E L+P D NHD AA+ + T + +L ER + + +++ ISYG K +
Sbjct: 243 EALLPWADLLNHDCAAASFLDWSAT-------EAAVVLRAERR-YRAGEQLLISYGQKTS 294
Query: 314 EELLYLYGFVID--NNPDD 330
ELL YGF D +NP D
Sbjct: 295 GELLLSYGFCPDLGSNPHD 313
>gi|226508108|ref|NP_001151788.1| SET domain containing protein [Zea mays]
gi|195649689|gb|ACG44312.1| SET domain containing protein [Zea mays]
Length = 536
Score = 42.7 bits (99), Expect = 0.38, Method: Compositional matrix adjust.
Identities = 33/120 (27%), Positives = 54/120 (45%), Gaps = 5/120 (4%)
Query: 33 ESKGFGIFSSNEFSDG-VLLVVPLDLAITPMRVLQDPLIGPECRAMFEDGEVDDRFLMIL 91
+ G G+ +S G + L +P L I+ + Q + A+ + + +++L
Sbjct: 176 QGAGRGMIASESIGVGDIALEIPEFLIISDELLCQSEVF----LALKDFNNITSETMLLL 231
Query: 92 FLTVERLRKNSSWKPYLDMLPTTFGNPLWFTDDELLELKGTTLYRATELQKQNLLTLYDD 151
+ ER S +KPY D LP F L F D L L+GT L+ +Q+L YD+
Sbjct: 232 WSMRERYNLGSKFKPYFDTLPANFNTGLSFGIDALAALEGTLLFDEIIQARQHLRQQYDE 291
>gi|428174941|gb|EKX43834.1| hypothetical protein GUITHDRAFT_140267 [Guillardia theta CCMP2712]
Length = 805
Score = 42.7 bits (99), Expect = 0.38, Method: Compositional matrix adjust.
Identities = 39/147 (26%), Positives = 66/147 (44%), Gaps = 25/147 (17%)
Query: 12 FLQWLQVNKVELRGCKIKYSDESKGFG--IFSSNEFS-DGVLLVVPLDLAITPMRVLQDP 68
F +WL+ N V+ K++ +D G G +++ D +L +PL +A V + P
Sbjct: 552 FSKWLRRNGVDDSKVKLR-ADGGHGMGNSLYARQMIKEDELLFRIPLKIAFYSDAVRRHP 610
Query: 69 LIGPECR-AMFEDGEVDDRFLMILFL-------------------TVERLRKNSS-WKPY 107
+G + A G + FL+ L L T +L +S W PY
Sbjct: 611 TLGSVIKGARIPQGMQGETFLLSLMLMGPLTHLEQYEACQVGHMETGCKLSNETSFWLPY 670
Query: 108 LDMLPTTFGNPLWFTDDELLELKGTTL 134
+ +LP TF P+++ + E ELKG+ +
Sbjct: 671 IKILPKTFSAPIFWNEVERQELKGSQV 697
>gi|428183877|gb|EKX52734.1| hypothetical protein GUITHDRAFT_101885 [Guillardia theta CCMP2712]
Length = 451
Score = 42.7 bits (99), Expect = 0.40, Method: Compositional matrix adjust.
Identities = 26/80 (32%), Positives = 42/80 (52%), Gaps = 6/80 (7%)
Query: 255 GLVPGIDFCNHDLKAAATWEVDGTGLITGVPFSMYLLSVERSSFHSEKEISISYGNKGNE 314
L+P ID CNHD + ++ T L+ + +L + + + +E+ I+YG N+
Sbjct: 247 ALLPIIDSCNHDGR------INKTDLVFNPLSNELILRNGQGTLKAGEEVRITYGTLDND 300
Query: 315 ELLYLYGFVIDNNPDDYLMI 334
ELL +GFV DN D + I
Sbjct: 301 ELLQRFGFVEDNCLHDKVKI 320
>gi|428171155|gb|EKX40074.1| hypothetical protein GUITHDRAFT_113813 [Guillardia theta CCMP2712]
Length = 353
Score = 42.7 bits (99), Expect = 0.40, Method: Compositional matrix adjust.
Identities = 18/42 (42%), Positives = 26/42 (61%)
Query: 293 VERSSFHSEKEISISYGNKGNEELLYLYGFVIDNNPDDYLMI 334
V F + + ISYG K N ELL YGF++ +NP+D+ +I
Sbjct: 219 VAEKDFKVGQSVEISYGLKSNHELLLSYGFILPDNPEDFFVI 260
>gi|424513480|emb|CCO66102.1| predicted protein [Bathycoccus prasinos]
Length = 571
Score = 42.7 bits (99), Expect = 0.40, Method: Compositional matrix adjust.
Identities = 83/332 (25%), Positives = 129/332 (38%), Gaps = 72/332 (21%)
Query: 40 FSSNEFSDGVLLVVPLDLAITPMRVLQDPLIGPECRAMFEDGEVDDRFLMILFLTVERLR 99
S N V+L +P D +T + + P++ P + E E + + L+L E+ +
Sbjct: 121 LSKNVEGGDVILSIPQDNCVTAVDAKEHPIVAP----LIE--EKPELVQLALWLCCEKAK 174
Query: 100 -KNSSWKPYLDMLPTTFGNP---LWFTDDELLE-LKGTTLYRATELQKQNLLTLYDDKVK 154
K S W PYL L GNP L FT++E E LKGT++ + ++ D K
Sbjct: 175 AKGSEWWPYLKTL---NGNPNSVLRFTEEEFKELLKGTSIDKEARQRR--------DSAK 223
Query: 155 DLVKKLLVLDGDSERANSIFWTRALNIPLPHSYVFPQNQEDLNKYDSINNSAELSNDHNS 214
+ + L RA P Y L+ Y + SA
Sbjct: 224 EEYEAL----------------RAAIAEDPGKY-------PLDVYAFLTESA-------- 252
Query: 215 RGELINGLNDIKNEAQRVNSQVNGATSTLTSTQGETLWIEGLVPGID---------FCNH 265
I+ L+ + AQ +NS A L + + I G P + +
Sbjct: 253 ---FIDALDIVCARAQWLNSANCYAMVPLM----DAIPICGAPPPVSPEDPSFARFYEIR 305
Query: 266 DLKAAATWEVDGTGLITGVPFSMYLLSVERSSFHSEKEISISYGNKGNEELLYLYGFVID 325
D+K T G S+ L + R+S S K + I + + N EL +G V D
Sbjct: 306 DIKTGLTAVRCGYADYDVDSASVVLCANTRASAGS-KILQIDHSVRNNSELYLSFGDVDD 364
Query: 326 NNPDDYLMIHYPAEAIHSIPLSDSKALLLEEQ 357
+P DY ++P E + PL +K +LE Q
Sbjct: 365 QHPGDYE--YWPTELSENDPLYAAKKSVLEAQ 394
>gi|195439104|ref|XP_002067471.1| GK16171 [Drosophila willistoni]
gi|194163556|gb|EDW78457.1| GK16171 [Drosophila willistoni]
Length = 511
Score = 42.7 bits (99), Expect = 0.40, Method: Compositional matrix adjust.
Identities = 18/43 (41%), Positives = 31/43 (72%), Gaps = 1/43 (2%)
Query: 93 LTVERLR-KNSSWKPYLDMLPTTFGNPLWFTDDELLELKGTTL 134
L VE+ R +S WKPY+D+LP+ + L+FT +++ L+GT++
Sbjct: 163 LVVEKSRGADSIWKPYIDVLPSRYNTVLYFTVEQMRRLRGTSV 205
>gi|452821842|gb|EME28868.1| ribulose-1,5-bisphosphate carboxylase/oxygenase small subunit
N-methyltransferase I [Galdieria sulphuraria]
Length = 490
Score = 42.7 bits (99), Expect = 0.41, Method: Compositional matrix adjust.
Identities = 26/80 (32%), Positives = 43/80 (53%), Gaps = 3/80 (3%)
Query: 255 GLVPGIDFCNHDLKAAATWEVDGTGLITGVPFSMYLLSVERSSFHSEKEISISYGNKGNE 314
LVP D NH +A + + GV + ++ V+R + +++ +SYG + N+
Sbjct: 277 ALVPYADLLNHSPFCSAFIDEEKIPFGNGV--TEAVVYVDRL-YEPYEQVYVSYGPRSNQ 333
Query: 315 ELLYLYGFVIDNNPDDYLMI 334
ELL LYGF ++ NP D + I
Sbjct: 334 ELLLLYGFSLERNPFDCVEI 353
>gi|409042840|gb|EKM52323.1| hypothetical protein PHACADRAFT_126051 [Phanerochaete carnosa
HHB-10118-sp]
Length = 435
Score = 42.7 bits (99), Expect = 0.42, Method: Compositional matrix adjust.
Identities = 30/94 (31%), Positives = 45/94 (47%), Gaps = 4/94 (4%)
Query: 242 TLTSTQGETLWIEGLVPGIDFCNHDLKAAATWEVDGTGLITGVPFSMYLLSVERSSFH-S 300
TL ST G L+PG+D NH +W V + S +S+ + +
Sbjct: 208 TLVSTPGS---YPVLLPGVDSLNHARAQPVSWVVSNPQPSAPLSDSEPSISLALHTLTPA 264
Query: 301 EKEISISYGNKGNEELLYLYGFVIDNNPDDYLMI 334
E+ +YG K N EL+ YGF + NNPDD +++
Sbjct: 265 GAELLNNYGPKPNAELILGYGFTLPNNPDDTIVL 298
>gi|118357514|ref|XP_001012006.1| hypothetical protein TTHERM_00808050 [Tetrahymena thermophila]
gi|89293773|gb|EAR91761.1| hypothetical protein TTHERM_00808050 [Tetrahymena thermophila
SB210]
Length = 454
Score = 42.7 bits (99), Expect = 0.42, Method: Compositional matrix adjust.
Identities = 49/203 (24%), Positives = 82/203 (40%), Gaps = 20/203 (9%)
Query: 198 KYDSINNSAELSNDHNSRGELINGLNDIKNEAQRVNSQVNGATSTLTSTQGETLWIEGLV 257
++D+I N + D+ + G+N+ E ++ + + T + Q T + ++
Sbjct: 145 EFDAIKNYISSNKDY---SHIFEGINEA--EFLQLVAMIESRTLFFKNEQDSTSEVGAMI 199
Query: 258 PGIDFCNHDLKAAATWEVDGTGLITGVPFSMYLLSVERSSFHSEKEISISYGNKGNEELL 317
P D NH +D S + F +E++I I+YGN NE L
Sbjct: 200 PFYDLANHTFMEG----IDHFKYFYFDQISKEYVMRAYKHFVAEEQIFITYGNYNNEHFL 255
Query: 318 YLYGFVIDNNPDDYLMIHYPAEAIHSIPLSDSKALLLEEQKAQLRCLLPKSLLEHGFFAA 377
YGF+ NN + L ++ + I D L E L+ + +L H F A
Sbjct: 256 DYYGFIPFNNQREALQFTIQSDQLPKILGKDFTKKLRE------NSLIGQQILNH--FYA 307
Query: 378 GHPKDGNNDNKLEVDRIS-SFSW 399
G N +E++ S SFSW
Sbjct: 308 GQNLKPN--IVIEIEEASQSFSW 328
>gi|350636529|gb|EHA24889.1| hypothetical protein ASPNIDRAFT_40813 [Aspergillus niger ATCC 1015]
Length = 437
Score = 42.7 bits (99), Expect = 0.42, Method: Compositional matrix adjust.
Identities = 26/80 (32%), Positives = 36/80 (45%), Gaps = 10/80 (12%)
Query: 255 GLVPGIDFCNHDLKAAATWEVDGTGLITGVPFSMYLLSVERSSFHSEKEISISYGNKGNE 314
G+VP D+ NH AA DG Y R + +E+ +SYGN N+
Sbjct: 236 GMVPFADYFNHVDDAACEVNFDG---------KKYTFRATRR-YEKGEEVYMSYGNHSND 285
Query: 315 ELLYLYGFVIDNNPDDYLMI 334
LL YGF + NP D + +
Sbjct: 286 FLLIEYGFTLSTNPSDCIYL 305
>gi|336384741|gb|EGO25889.1| hypothetical protein SERLADRAFT_437599 [Serpula lacrymans var.
lacrymans S7.9]
Length = 499
Score = 42.7 bits (99), Expect = 0.44, Method: Compositional matrix adjust.
Identities = 30/130 (23%), Positives = 61/130 (46%), Gaps = 7/130 (5%)
Query: 9 LEPFLQWLQVNKVELRGCKIKYSD-ESKGFGIFSSNEFSDG-VLLVVPLDLAITPMRVLQ 66
+E F+ W Q + + + D +G G + ++ +G L +P L ++
Sbjct: 1 MESFITWFQSHGGFIDSSVMDVVDFPGQGRGAIAVSDIPEGHTLFTLPRSLTLSTRTSYL 60
Query: 67 DPLIGPECRAMFEDGEVDDRF--LMILFLTVERLRKNSSWKPYLDMLPTTFGNPLWFTDD 124
+G E F+ +D+ + L++ + E S W YL LP++F P++++ +
Sbjct: 61 PSNMGAESWKKFK---LDEGWAGLILCMMWEEAQESKSKWSEYLASLPSSFTTPMFWSSE 117
Query: 125 ELLELKGTTL 134
+L EL+GT +
Sbjct: 118 DLFELRGTAV 127
>gi|118356416|ref|XP_001011464.1| hypothetical protein TTHERM_00782080 [Tetrahymena thermophila]
gi|89293231|gb|EAR91219.1| hypothetical protein TTHERM_00782080 [Tetrahymena thermophila
SB210]
Length = 673
Score = 42.7 bits (99), Expect = 0.45, Method: Compositional matrix adjust.
Identities = 44/202 (21%), Positives = 90/202 (44%), Gaps = 14/202 (6%)
Query: 2 EISTEAKLEPFLQWLQVNKVELRGCKIKYSDESKGFGIFSSNEF--SDGVLLVVPLDLAI 59
EI + K + +L+WL+ N + + + G+ ++ E + V++ +P L +
Sbjct: 16 EIQKDKKYQVYLEWLKKNGALFENIEFPVAFGNGGYVGVAAKERIPPNKVIVAIPNKLLL 75
Query: 60 TPMRVLQDPL--IGPECRAMFE-DGEVDDRF-LMILFLTVERLRKNSS-WKPYLDMLPTT 114
+ V PL + + +F D D F + L+L E+++ + S W PYL++ PT
Sbjct: 76 STGIVDSSPLKPVLQQNPHLFNIDQNYDADFNKLTLYLMTEKVKADKSFWAPYLNISPTQ 135
Query: 115 FGNPLWFTDDELLELKGTTLYRATELQKQNLLTLYDDKVKDLVKKLLVLDGDSERANSIF 174
F W TD E+ + +++ + +Q++ + + +K +V ++ D +
Sbjct: 136 FTLLDW-TDKEVENIGDPYMFKIFKEYRQSMEQTWKEFLKVIVNYPNIISTDCCNKKLFY 194
Query: 175 W------TRALNIPLPHSYVFP 190
W TR PH+ + P
Sbjct: 195 WSYQFVTTRCYGWNFPHTLLVP 216
>gi|403217882|emb|CCK72375.1| hypothetical protein KNAG_0J02970 [Kazachstania naganishii CBS
8797]
Length = 581
Score = 42.7 bits (99), Expect = 0.45, Method: Compositional matrix adjust.
Identities = 54/231 (23%), Positives = 91/231 (39%), Gaps = 43/231 (18%)
Query: 102 SSWKPYLDMLPTTFGNPLWFTDDELLELKGTTLYRATELQKQNLLTLYDDKVKDLVKKLL 161
SS KPYLD+LP P ++ +L LKG L A NL+ ++++ K LL
Sbjct: 102 SSLKPYLDILPLHLNQPYFWHRSQLDLLKGADLILAVH---ANLVEIWNE-----WKNLL 153
Query: 162 VLDGDSERANSIFWTRALNIPLPHSYVFPQNQEDLNKYDSINN--SAELSNDHNSRGELI 219
G + A + DL+ +D + + S+ + H+
Sbjct: 154 TEFGIEQEAKTC---------------------DLDSFDKLVDYISSNIDRLHSGETAWT 192
Query: 220 NGLNDIKNEAQRVNSQVNGATSTLTSTQGETLWIEGLVPGIDFCNHDLKAAATWEVDGTG 279
+ + + + A + + E L L+P +D NH W G
Sbjct: 193 SFIAYVWSSAI---FRSRAFPELMLEENVENLQTAFLLPVVDLLNHRNGTKVKWTF-SDG 248
Query: 280 LITGVPFSMYLLSVERSSFHSEKEISISYGNKGNEELLYLYGFVIDNNPDD 330
+ + E ++++E+ +YG++ NEELL YGFV +NN D
Sbjct: 249 KVNFIS--------EAKEINAKEELFNNYGDRSNEELLLAYGFVEENNVHD 291
>gi|317035930|ref|XP_001397212.2| ribosomal N-lysine methyltransferase [Aspergillus niger CBS 513.88]
Length = 434
Score = 42.7 bits (99), Expect = 0.45, Method: Compositional matrix adjust.
Identities = 26/80 (32%), Positives = 36/80 (45%), Gaps = 10/80 (12%)
Query: 255 GLVPGIDFCNHDLKAAATWEVDGTGLITGVPFSMYLLSVERSSFHSEKEISISYGNKGNE 314
G+VP D+ NH AA DG Y R + +E+ +SYGN N+
Sbjct: 225 GMVPFADYFNHVDDAACEVNFDG---------KKYTFRATRR-YEKGEEVYMSYGNHSND 274
Query: 315 ELLYLYGFVIDNNPDDYLMI 334
LL YGF + NP D + +
Sbjct: 275 FLLIEYGFTLSTNPSDCIYL 294
>gi|413917183|gb|AFW57115.1| hypothetical protein ZEAMMB73_742803 [Zea mays]
Length = 514
Score = 42.7 bits (99), Expect = 0.45, Method: Compositional matrix adjust.
Identities = 32/120 (26%), Positives = 54/120 (45%), Gaps = 5/120 (4%)
Query: 33 ESKGFGIFSSNEFSDG-VLLVVPLDLAITPMRVLQDPLIGPECRAMFEDGEVDDRFLMIL 91
+ G G+ +S G + L +P L I+ + Q + ++ + + +++L
Sbjct: 145 QGAGRGMIASESIGVGDIALEIPESLIISDELLCQSEVF----LSLKDFNNITSETMLLL 200
Query: 92 FLTVERLRKNSSWKPYLDMLPTTFGNPLWFTDDELLELKGTTLYRATELQKQNLLTLYDD 151
+ ER S +KPY D LP F L F D L L+GT L+ +Q+L YD+
Sbjct: 201 WSMRERYNLGSKFKPYFDTLPANFNTGLSFGIDALAALEGTLLFDEIIQARQHLRQQYDE 260
>gi|363749087|ref|XP_003644761.1| hypothetical protein Ecym_2195 [Eremothecium cymbalariae
DBVPG#7215]
gi|356888394|gb|AET37944.1| Hypothetical protein Ecym_2195 [Eremothecium cymbalariae
DBVPG#7215]
Length = 589
Score = 42.4 bits (98), Expect = 0.47, Method: Compositional matrix adjust.
Identities = 31/79 (39%), Positives = 38/79 (48%), Gaps = 10/79 (12%)
Query: 256 LVPGIDFCNHDLKAAATWEVDGTGLITGVPFSMYLLSVERSSFHSEKEISISYGNKGNEE 315
L+P +D NH + W D GV L VE EI +YG K NEE
Sbjct: 230 LLPILDLLNHSQHSKIEWAGDN-----GVFSFRKLEPVEVGD-----EIFNNYGGKSNEE 279
Query: 316 LLYLYGFVIDNNPDDYLMI 334
LL YGFVI++N DYL +
Sbjct: 280 LLVGYGFVIEDNKCDYLAL 298
>gi|336371990|gb|EGO00330.1| hypothetical protein SERLA73DRAFT_89272 [Serpula lacrymans var.
lacrymans S7.3]
Length = 499
Score = 42.4 bits (98), Expect = 0.47, Method: Compositional matrix adjust.
Identities = 30/130 (23%), Positives = 61/130 (46%), Gaps = 7/130 (5%)
Query: 9 LEPFLQWLQVNKVELRGCKIKYSD-ESKGFGIFSSNEFSDG-VLLVVPLDLAITPMRVLQ 66
+E F+ W Q + + + D +G G + ++ +G L +P L ++
Sbjct: 1 MESFITWFQSHGGFIDSSVMDVVDFPGQGRGAIAVSDIPEGHTLFTLPRSLTLSTRTSYL 60
Query: 67 DPLIGPECRAMFEDGEVDDRF--LMILFLTVERLRKNSSWKPYLDMLPTTFGNPLWFTDD 124
+G E F+ +D+ + L++ + E S W YL LP++F P++++ +
Sbjct: 61 PSNMGAESWKKFK---LDEGWAGLILCMMWEEAQESKSKWSEYLASLPSSFTTPMFWSSE 117
Query: 125 ELLELKGTTL 134
+L EL+GT +
Sbjct: 118 DLFELRGTAV 127
>gi|71004942|ref|XP_757137.1| hypothetical protein UM00990.1 [Ustilago maydis 521]
gi|46096767|gb|EAK82000.1| hypothetical protein UM00990.1 [Ustilago maydis 521]
Length = 479
Score = 42.4 bits (98), Expect = 0.48, Method: Compositional matrix adjust.
Identities = 51/228 (22%), Positives = 86/228 (37%), Gaps = 29/228 (12%)
Query: 107 YLDMLPTTFGNPLWFTDDELLELKGTTLYRATELQKQNLLTLYDDKVKDLVKKLLVLDGD 166
Y+ +P+ PL F+ EL L+ T L +TE + + +T Y+ + L+ +L +
Sbjct: 119 YVSHIPSVIETPLHFSPTELDLLRNTPLVGSTERRLRETITDYERATRVLLAQLHSVHSS 178
Query: 167 SERANSIFWTRALNIPLPHSYVFPQNQEDLNKYDSINNSAELSNDHNSRGELINGLNDIK 226
P + + + + + K D+++ A + + + EL
Sbjct: 179 -----------------PFTSLLTETLQPIPK-DALHADALMQHTWHHGLELWRW----- 215
Query: 227 NEAQRVNSQVNGATSTLTSTQGETLWIEGLVPGIDFCNHDLKAAATWEVDGTGLITGVPF 286
E+ + L QGE L+PG D NH A TW
Sbjct: 216 AESAFTSRSFPPRLIGLHHDQGEA---PILIPGYDIFNHARAHAVTWSFSRPTEQAECGS 272
Query: 287 SMYLLSVERSSFHSEKEISISYGNKGNEELLYLYGFVIDNNPDDYLMI 334
L+ E H ++ +YG K NEE L Y F +D +D L +
Sbjct: 273 VEMTLNYEVPGAH---QVYNNYGGKSNEEFLSSYAFTLDETHEDTLAL 317
>gi|294948379|ref|XP_002785721.1| hypothetical protein Pmar_PMAR008080 [Perkinsus marinus ATCC 50983]
gi|239899769|gb|EER17517.1| hypothetical protein Pmar_PMAR008080 [Perkinsus marinus ATCC 50983]
Length = 353
Score = 42.4 bits (98), Expect = 0.48, Method: Compositional matrix adjust.
Identities = 33/128 (25%), Positives = 54/128 (42%), Gaps = 16/128 (12%)
Query: 14 QWLQVNKVELRGCKIKYSDESKGFGIFSSNEFSDG-VLLVVPLDLAITPMRVLQDPLIGP 72
QW ++L + + G ++ + G LL VP +TP V
Sbjct: 20 QWAIRQGIQLHPAVSWFHATNGMIGCTATADICQGERLLFVPHSACVTPSGV-------- 71
Query: 73 ECRAMFEDGEVDDRFLMILFLTVERLRKNSSWKPYLDMLPTTFGNPLWFTDDELLELKGT 132
G + + ++ L R NS + YL LP+ F +PL ++ DEL+ LKGT
Sbjct: 72 -------QGLYEPQVMLAASLVKHRTDPNSPFHDYLQSLPSEFEHPLEWSADELVCLKGT 124
Query: 133 TLYRATEL 140
T++ +L
Sbjct: 125 TVWEMHQL 132
>gi|425773952|gb|EKV12277.1| hypothetical protein PDIG_46020 [Penicillium digitatum PHI26]
gi|425782378|gb|EKV20291.1| hypothetical protein PDIP_17950 [Penicillium digitatum Pd1]
Length = 487
Score = 42.4 bits (98), Expect = 0.50, Method: Compositional matrix adjust.
Identities = 70/327 (21%), Positives = 127/327 (38%), Gaps = 60/327 (18%)
Query: 12 FLQWLQVNKVELRGCKIKYSD---ESKGFGIFSSNEFSDGV-LLVVPLDLAITPMRVLQD 67
F+ WLQ + K++ +D G G+ + + +G L +P + +T
Sbjct: 20 FMSWLQASPGVQLNPKLRLADLRATGAGRGVVAQSNIVEGEELFSIPRTMVLT------- 72
Query: 68 PLIGPECRAMFEDG---EVDDRFLMILFLTVERLR-KNSSWKPYLDMLPTTFGNPLWFTD 123
+ E R + + ++ ++L + E L+ + S W PY +LP+ F ++++
Sbjct: 73 -VQNSELRTLLAENLEEQMGPWLSLMLVMVYEYLQGEKSRWAPYFRVLPSRFDTLMFWSP 131
Query: 124 DELLELKGTTLYRATELQKQNLLTLYDDKVKDLVKKLLVLDGDSERANSIFWTRALNIPL 183
EL EL+ +T+ ++ + N ++ ++D + +L D
Sbjct: 132 AELQELQASTIVE--KIGRSN----AEESIRDSIAPILAKRPD----------------- 168
Query: 184 PHSYVFPQNQEDLNKYDSINNSAELSNDHNSRGELINGLN-DI-KNEAQRVNSQVNGATS 241
+FP L ++ I A L + G LI DI K E +VN S
Sbjct: 169 ----LFPP-PPGLASWEGIAGDAALIQVGHVMGSLIMAYAFDIEKAEDDDDEGEVND-ES 222
Query: 242 TLTSTQGETLWIEGLVPGIDFCNHDLKA--AATWEVDGTGLITGVPFSMYLLSVERSSFH 299
+T + E +G+VP D N D A ++ +G ++ +
Sbjct: 223 YMTDDEEEEQLPKGMVPLADLLNADADRNNARLYQEEGALVMKAI-----------KPIQ 271
Query: 300 SEKEISISYGNKGNEELLYLYGFVIDN 326
EI YG +LL YG+V DN
Sbjct: 272 KGDEIFNDYGEIPRADLLRRYGYVTDN 298
>gi|440639458|gb|ELR09377.1| hypothetical protein GMDG_03941 [Geomyces destructans 20631-21]
Length = 485
Score = 42.4 bits (98), Expect = 0.52, Method: Compositional matrix adjust.
Identities = 34/150 (22%), Positives = 63/150 (42%), Gaps = 18/150 (12%)
Query: 15 WLQVNKVELRGCKIKYSDESKGFGIFSSNEFSDG-------VLLVVPLDLAITPMRVLQD 67
W + N +E ++ + E G+G+ S+ + +L VP DL ++ + +
Sbjct: 14 WAKFNGIEFEDISVEKNKEYGGYGVVSTTKIDSAPEQEGNLTVLNVPKDLILSAETIAEH 73
Query: 68 PLIGPECRAMFED-GEVDDRFLMILFLTVERLRKNSS----------WKPYLDMLPTTFG 116
+ + E G R ++LFL ++ R +S W Y+ MLP
Sbjct: 74 AKVDKHFGQILEAVGGSTLRGDVMLFLLMQVTRASSDPSIKFSVRGPWTEYVKMLPEYIS 133
Query: 117 NPLWFTDDELLELKGTTLYRATELQKQNLL 146
P + DD++ L GT+L +A + L+
Sbjct: 134 LPTAWHDDQINLLNGTSLEKAVAAKVSALV 163
>gi|198470241|ref|XP_001355267.2| GA17108 [Drosophila pseudoobscura pseudoobscura]
gi|198145358|gb|EAL32324.2| GA17108 [Drosophila pseudoobscura pseudoobscura]
Length = 568
Score = 42.4 bits (98), Expect = 0.52, Method: Compositional matrix adjust.
Identities = 32/133 (24%), Positives = 66/133 (49%), Gaps = 11/133 (8%)
Query: 4 STEAKLEPFLQWLQVNKVELRGCKIKYSDESKGFGIFSSNEFS-DGVLLVVPLDLAITPM 62
S +AK+ F +W + V+ +I + G+ ++ + + + +L VP L +
Sbjct: 131 SRQAKVTAFSEWAKAGGVKTDCLEIAIFPGYQ-LGLRATQDIAAEQPVLSVPRTLIFSEE 189
Query: 63 RVLQDPLIGPEC-RAMFEDGEVDDRFLMILFLTVERLR-KNSSWKPYLDMLPTTFGNPLW 120
+ PE R +F + + F + L +E++R +S W+PY+D+LP + L+
Sbjct: 190 HL-------PETDRKLFCNFPLLTNFNLAYALVIEKVRGPDSVWRPYIDVLPARYNTVLY 242
Query: 121 FTDDELLELKGTT 133
F+ +++ L+GT
Sbjct: 243 FSIEQMQRLRGTA 255
>gi|240276868|gb|EER40379.1| SET domain-containing protein [Ajellomyces capsulatus H143]
Length = 485
Score = 42.4 bits (98), Expect = 0.53, Method: Compositional matrix adjust.
Identities = 36/132 (27%), Positives = 62/132 (46%), Gaps = 19/132 (14%)
Query: 12 FLQWLQVNKVELRGCKIKYSD---ESKGFGIFSSNEFS-DGVLLVVPLDLAI----TPMR 63
F+ WL+ KIK +D E G GI + ++ D L +P +L + + ++
Sbjct: 20 FMFWLKQRPGVKVSPKIKIADLRSEGAGRGIVADDDIGEDEELFAIPQNLVLGFQNSRLK 79
Query: 64 VLQDPLIGPECRAMFEDGEVDDRFLMILFLTVERLRKN-SSWKPYLDMLPTTFGNPLWFT 122
L D F + + D +I+ + E L+ S+W Y +LPT F +++T
Sbjct: 80 DLLD----------FNERDFDPWLCLIVVMIYEYLQGGASTWSRYFQLLPTNFDTLMFWT 129
Query: 123 DDELLELKGTTL 134
D+EL EL G+ +
Sbjct: 130 DEELRELSGSAV 141
>gi|408393455|gb|EKJ72719.1| hypothetical protein FPSE_07119 [Fusarium pseudograminearum CS3096]
Length = 465
Score = 42.4 bits (98), Expect = 0.54, Method: Compositional matrix adjust.
Identities = 38/142 (26%), Positives = 69/142 (48%), Gaps = 25/142 (17%)
Query: 10 EPFLQWLQ----------VNKVELRGCKIKYSDESKGFGIFSSNEF-SDGVLLVVPLDLA 58
E FLQW + + V+LR D + G GI + + ++ L +P +
Sbjct: 11 EKFLQWFKSLPGSTFSDDIKIVDLR-------DRNAGRGIIALRDIPAETTLFTIPRKGS 63
Query: 59 ITPMRVLQDPLIGPECRAMFEDGEVDDRFL-----MILFLTVERLRKNSS-WKPYLDMLP 112
I + + P P+ + + E D L +IL + E LR +SS WK Y D+LP
Sbjct: 64 IN-IETSELPQKIPDVFDLDKPDEDDVPGLDSWSSLILIMIYEYLRGDSSQWKSYFDVLP 122
Query: 113 TTFGNPLWFTDDELLELKGTTL 134
++F P++++++EL +L+ + +
Sbjct: 123 SSFDTPMFWSENELDQLQASHM 144
>gi|356524495|ref|XP_003530864.1| PREDICTED: histone-lysine N-methyltransferase setd3-like [Glycine
max]
Length = 472
Score = 42.4 bits (98), Expect = 0.54, Method: Compositional matrix adjust.
Identities = 34/110 (30%), Positives = 54/110 (49%), Gaps = 9/110 (8%)
Query: 24 RGCKIKYSDESKGFGIFSSNEFSDGV-LLVVPLDLAITPMRVLQDP-LIGPECRAMFEDG 81
R KI D S G G+ + E G L+V+P L + + QDP L+ R + E+
Sbjct: 47 RAVKITQLDSSNGLGLVAKEEIPRGSDLIVLPHHLPLRFTSLQQDPSLLHHLARQVPEE- 105
Query: 82 EVDDRFLMILFLTV--ERLRKNSSWKPYLDMLPTTFGNPLWFTDDELLEL 129
+ M L L + ER + S W PY+ LP T+ P++F+ +++ L
Sbjct: 106 ----LWAMKLGLKLLQERAKVGSFWWPYISNLPETYTVPIFFSGEDIKNL 151
>gi|345325921|ref|XP_001512684.2| PREDICTED: histone-lysine N-methyltransferase setd3-like
[Ornithorhynchus anatinus]
Length = 392
Score = 42.4 bits (98), Expect = 0.54, Method: Compositional matrix adjust.
Identities = 32/124 (25%), Positives = 62/124 (50%), Gaps = 11/124 (8%)
Query: 13 LQWLQVNKVELRGCKIKYSDESKGFGIFSSNEF-SDGVLLVVPLDLAITPMRVLQDPLIG 71
++W N G ++ +E GFG+ ++ E ++ + L VP L +T + ++ ++G
Sbjct: 83 MKWATANGASTEGFELVNFEE--GFGLRATREIKAEELFLWVPRKLLMT-VESAKNSVLG 139
Query: 72 P---ECRAMFEDGEVDDRFLMILFLTVERLRKNSSWKPYLDMLPTTFGNPLWFTDDELLE 128
+ R + G + F ++ ER +S W PY+ LP+ + PL+F +DE+
Sbjct: 140 SLYSQDRILQAMGNITLAFHLL----CERANPSSFWLPYIQTLPSEYDTPLYFEEDEVQY 195
Query: 129 LKGT 132
L+ T
Sbjct: 196 LQST 199
>gi|325095092|gb|EGC48402.1| SET domain-containing protein [Ajellomyces capsulatus H88]
Length = 485
Score = 42.4 bits (98), Expect = 0.54, Method: Compositional matrix adjust.
Identities = 36/132 (27%), Positives = 62/132 (46%), Gaps = 19/132 (14%)
Query: 12 FLQWLQVNKVELRGCKIKYSD---ESKGFGIFSSNEFS-DGVLLVVPLDLAI----TPMR 63
F+ WL+ KIK +D E G GI + ++ D L +P +L + + ++
Sbjct: 20 FMFWLKQRPGVKVSPKIKIADLRSEGAGRGIVADDDIGEDEELFAIPQNLVLGFQNSRLK 79
Query: 64 VLQDPLIGPECRAMFEDGEVDDRFLMILFLTVERLRKN-SSWKPYLDMLPTTFGNPLWFT 122
L D F + + D +I+ + E L+ S+W Y +LPT F +++T
Sbjct: 80 DLLD----------FNERDFDPWLCLIVVMIYEYLQGGASTWSRYFQLLPTNFDTLMFWT 129
Query: 123 DDELLELKGTTL 134
D+EL EL G+ +
Sbjct: 130 DEELRELSGSAV 141
>gi|449544081|gb|EMD35055.1| hypothetical protein CERSUDRAFT_107074 [Ceriporiopsis subvermispora
B]
Length = 457
Score = 42.4 bits (98), Expect = 0.54, Method: Compositional matrix adjust.
Identities = 28/83 (33%), Positives = 41/83 (49%), Gaps = 4/83 (4%)
Query: 256 LVPGIDFCNHDLKAAATWEVDG-TGLITGVPFSMYLLSVE---RSSFHSEKEISISYGNK 311
L+PG+D NH +W V G +G T L+V S E+ +YG K
Sbjct: 240 LLPGVDSLNHARAQPVSWVVSGISGGDTNTDTESSDLAVSLLLHSPTPRGAELLNNYGPK 299
Query: 312 GNEELLYLYGFVIDNNPDDYLMI 334
N EL+ YGF + +NPDD +++
Sbjct: 300 PNAELVLGYGFALPSNPDDTIVL 322
>gi|195168946|ref|XP_002025291.1| GL13316 [Drosophila persimilis]
gi|194108747|gb|EDW30790.1| GL13316 [Drosophila persimilis]
Length = 568
Score = 42.4 bits (98), Expect = 0.54, Method: Compositional matrix adjust.
Identities = 32/133 (24%), Positives = 66/133 (49%), Gaps = 11/133 (8%)
Query: 4 STEAKLEPFLQWLQVNKVELRGCKIKYSDESKGFGIFSSNEFS-DGVLLVVPLDLAITPM 62
S +AK+ F +W + V+ +I + G+ ++ + + + +L VP L +
Sbjct: 131 SRQAKVTAFSEWAKAGGVKTDCLEIAIFPGYQ-LGLRATQDIAAEQPVLSVPRTLIFSEE 189
Query: 63 RVLQDPLIGPEC-RAMFEDGEVDDRFLMILFLTVERLR-KNSSWKPYLDMLPTTFGNPLW 120
+ PE R +F + + F + L +E++R +S W+PY+D+LP + L+
Sbjct: 190 HL-------PETDRKLFCNFPLLTNFNLAYALVIEKVRGPDSVWRPYIDVLPARYNTVLY 242
Query: 121 FTDDELLELKGTT 133
F+ +++ L+GT
Sbjct: 243 FSIEQMQRLRGTA 255
>gi|302896454|ref|XP_003047107.1| SET domain protein [Nectria haematococca mpVI 77-13-4]
gi|256728035|gb|EEU41394.1| SET domain protein [Nectria haematococca mpVI 77-13-4]
Length = 1037
Score = 42.4 bits (98), Expect = 0.56, Method: Compositional matrix adjust.
Identities = 18/47 (38%), Positives = 33/47 (70%), Gaps = 1/47 (2%)
Query: 89 MILFLTVERLR-KNSSWKPYLDMLPTTFGNPLWFTDDELLELKGTTL 134
+IL L E L+ + S WKPY D+LP++F P+++++ EL +L+ + +
Sbjct: 667 LILVLMYEYLQGEKSQWKPYFDVLPSSFDTPMFWSESELDQLQASHM 713
>gi|399949805|gb|AFP65462.1| putative ribulose-1,5-bisphosphate carboxylase/oxygenase small
subunit N-methyltransferase I [Chroomonas mesostigmatica
CCMP1168]
Length = 464
Score = 42.4 bits (98), Expect = 0.56, Method: Compositional matrix adjust.
Identities = 25/76 (32%), Positives = 34/76 (44%), Gaps = 4/76 (5%)
Query: 255 GLVPGIDFCNHDLKAAATWEVDGTGLITGVPFSMYLLSVERSSFHSEKEISISYGNKGNE 314
LVP DF NH+ + + MY ++ +I +YG K N
Sbjct: 249 ALVPYADFINHNPFSTSYINSKKIAFSENNEIVMY----ADKDYNKFDQIFTTYGQKTNL 304
Query: 315 ELLYLYGFVIDNNPDD 330
ELL LYGF+I+ NP D
Sbjct: 305 ELLVLYGFIIERNPFD 320
>gi|50557134|ref|XP_505975.1| YALI0F28061p [Yarrowia lipolytica]
gi|49651845|emb|CAG78787.1| YALI0F28061p [Yarrowia lipolytica CLIB122]
Length = 454
Score = 42.4 bits (98), Expect = 0.57, Method: Compositional matrix adjust.
Identities = 34/129 (26%), Positives = 65/129 (50%), Gaps = 11/129 (8%)
Query: 30 YSDESKGFGIFSSNEFS-DGVLLVVPLDLAITPMRVLQDPLIGPECRAMFEDGEVDDRFL 88
Y + +G G+ +S + D VL +P + + V DP + + E +++
Sbjct: 33 YRSDHQGRGVIASEDIEEDEVLFKIPRS---SFLSVENDPDF---IKQVPEAKKLNSWLQ 86
Query: 89 MILFLTVERLRKNSSWKPYLDMLPTTFGNPLWFTDDELLELKGTTLYRATELQKQNLLTL 148
+IL++ + + WKPY D+LPT + + +TDDEL LKG+ + + ++ K
Sbjct: 87 LILYMM--KAGSMTKWKPYFDVLPTQLDSLMMWTDDELEGLKGSMIVK--KIGKAGAEED 142
Query: 149 YDDKVKDLV 157
Y +K+K ++
Sbjct: 143 YQEKLKPII 151
>gi|358374896|dbj|GAA91484.1| ribosomal N-lysine methyltransferase [Aspergillus kawachii IFO
4308]
Length = 445
Score = 42.4 bits (98), Expect = 0.57, Method: Compositional matrix adjust.
Identities = 26/80 (32%), Positives = 36/80 (45%), Gaps = 10/80 (12%)
Query: 255 GLVPGIDFCNHDLKAAATWEVDGTGLITGVPFSMYLLSVERSSFHSEKEISISYGNKGNE 314
G+VP D+ NH AA DG Y R + +E+ +SYGN N+
Sbjct: 236 GMVPFADYFNHVDDAACDVNFDG---------KKYTFRATRR-YEKGEEVYMSYGNHSND 285
Query: 315 ELLYLYGFVIDNNPDDYLMI 334
LL YGF + NP D + +
Sbjct: 286 FLLVEYGFTLPTNPSDSIYL 305
>gi|170090678|ref|XP_001876561.1| SET domain protein [Laccaria bicolor S238N-H82]
gi|164648054|gb|EDR12297.1| SET domain protein [Laccaria bicolor S238N-H82]
Length = 434
Score = 42.4 bits (98), Expect = 0.58, Method: Compositional matrix adjust.
Identities = 75/335 (22%), Positives = 133/335 (39%), Gaps = 71/335 (21%)
Query: 26 CKIKYSDESKGFGIFSSNEF-SDGVLLVVPLDLAITPMRV---LQDPLIGPE--CRAMFE 79
++++S+ + G + +S + D ++ P DLAIT L + L E +
Sbjct: 24 TRVRFSEATSGSRVVASEDIPEDTEIVSCPFDLAITRQHAQLALGNFLDTSEIISSTHWS 83
Query: 80 DGEVDDRFLMILFLTVERLRKNS---SWKPYLDMLPTT--FGNPLWFTDDELLELKGTTL 134
+ + +++ ++ + + N + PYL LP++ PL FT +E+ KGT L
Sbjct: 84 ERQWISTYIIFHWIICDESQPNLKILAHYPYLKTLPSSEKLLTPLHFTSEEIQLFKGTNL 143
Query: 135 YRATELQKQNLLTLYDDKVKDLVKKLLVLDGDSERANSIFWTRALNIPLP-HSYVFPQNQ 193
Y AT ++++ T + D + V + + S W+ + HS F ++
Sbjct: 144 YGATLDRERDWRTEWADCLDR------VSQANQDWGKSFTWSATFSESTAYHSLSFFYDR 197
Query: 194 EDLNKYDSINNSAELSNDHNSRGELINGLNDIKNEAQRVNSQVNGATSTLTSTQGETLWI 253
E +Y + +A + L++ +K+ S LT
Sbjct: 198 E---RYLT---AATYLSSRAFPSSLLSPTPSLKH-------------SPLTEPV------ 232
Query: 254 EGLVPGIDFCNHDLKAAATWEVDGTGLITGVPFSMYLLSVERSSFHSEKEISI------- 306
L+PGID NH +W V Y +V S E +IS+
Sbjct: 233 --LLPGIDSLNHARGQPVSWVV------------RYPNNVPPDSSLQEPKISLILHTPAT 278
Query: 307 -------SYGNKGNEELLYLYGFVIDNNPDDYLMI 334
+YG K N EL+ YGF + NPDD +++
Sbjct: 279 GGNELFNNYGPKPNSELILGYGFSLPQNPDDTILL 313
>gi|356534483|ref|XP_003535783.1| PREDICTED: histone-lysine N-methyltransferase setd3-like [Glycine
max]
Length = 463
Score = 42.4 bits (98), Expect = 0.58, Method: Compositional matrix adjust.
Identities = 34/135 (25%), Positives = 62/135 (45%), Gaps = 9/135 (6%)
Query: 14 QWLQVNKVELRGCKIKYSDESKGFGIFSSNEFSDG-VLLVVPLDLAITPMRVLQDPLIGP 72
QWL+ + + S G +F+S G +L VP + IT +L P
Sbjct: 38 QWLERKAACSISSSLFIGNSSYGRSLFASKIIQTGDCILKVPYRVQITADNLL------P 91
Query: 73 ECRAMFEDGEVDDRFLMILFLTVERLRKNSSWKPYLDMLPTT--FGNPLWFTDDELLELK 130
E R++ + + L + L ++L + S W PY+ LP N +++T+ EL ++
Sbjct: 92 EIRSLIGEEVGNIAKLATVILIEKKLGQGSEWYPYISCLPQQGELHNTVFWTESELEMIR 151
Query: 131 GTTLYRATELQKQNL 145
+++Y+ T QK +
Sbjct: 152 PSSVYQETIDQKSQI 166
>gi|213406233|ref|XP_002173888.1| SET domain-containing protein [Schizosaccharomyces japonicus
yFS275]
gi|212001935|gb|EEB07595.1| SET domain-containing protein [Schizosaccharomyces japonicus
yFS275]
Length = 431
Score = 42.4 bits (98), Expect = 0.58, Method: Compositional matrix adjust.
Identities = 26/71 (36%), Positives = 38/71 (53%), Gaps = 11/71 (15%)
Query: 254 EGLVPGIDFCNHDLKAAATWEVDGTGLITGVPFSMYLLSVERSSFHSEKEISISYGN-KG 312
E +VP +D CNH + A W T S+ L++ + S S++EI + YGN KG
Sbjct: 194 EAIVPVVDICNHSHNSNAKWRFGET--------SVELVATQHIS--SKEEILLCYGNTKG 243
Query: 313 NEELLYLYGFV 323
E L+ YGF+
Sbjct: 244 AAEYLFSYGFL 254
>gi|299115489|emb|CBN75653.1| conserved unknown protein [Ectocarpus siliculosus]
Length = 451
Score = 42.4 bits (98), Expect = 0.59, Method: Compositional matrix adjust.
Identities = 35/142 (24%), Positives = 67/142 (47%), Gaps = 9/142 (6%)
Query: 1 MEISTEAKLEPFLQWLQ--VNKVELRGCKIKYSDESKGFGIFSS-NEFSDGVLLVVPLDL 57
M + + E +L W Q + V ++ + + G+F N ++ +++ VP +
Sbjct: 1 MAEGSNSPAEGYLDWAQEELGVVVHHPLRVSSTPGKEERGVFCEENIPAETIVVSVPWEA 60
Query: 58 AITPMRVLQDPLIGPECRAMFEDGEVDDRFLMILFLTVER-LRKNSSWKPYLDMLPTTFG 116
+T D G + E G +D L +L L L++ S K ++D+LP +
Sbjct: 61 LMT-----VDSAKGTPFEGLMEAGAREDDVLCLLLLYHRHILKERSPLKGHMDVLPREYH 115
Query: 117 NPLWFTDDELLELKGTTLYRAT 138
++++DDEL L+GT+L+ T
Sbjct: 116 QTIFYSDDELELLRGTSLHAVT 137
>gi|406868331|gb|EKD21368.1| SET domain-containing protein [Marssonina brunnea f. sp.
'multigermtubi' MB_m1]
Length = 480
Score = 42.4 bits (98), Expect = 0.59, Method: Compositional matrix adjust.
Identities = 38/164 (23%), Positives = 73/164 (44%), Gaps = 17/164 (10%)
Query: 5 TEAKLEPFLQWLQVNKVELRGCKIKYSDESKGFGIFSS------NEFSDGVLLVVPLDLA 58
T+ + W ++N V+ ++ SKG G+ +S N F LLVVP +L
Sbjct: 4 TQLPISALPAWSKLNAVDFLDISVQDLGSSKGLGLVTSRALSSKNTFDIPTLLVVPNELV 63
Query: 59 ITPMRVLQDPLIGPECRAMF-EDGEVDDRFLMILFLTVE----------RLRKNSSWKPY 107
++ V + + R + G R ++LFL ++ + ++ W Y
Sbjct: 64 LSREAVGEWAKVDGHLRELLGAAGGKSTRGDIMLFLLMQITIAARHHGMNVGASNPWTEY 123
Query: 108 LDMLPTTFGNPLWFTDDELLELKGTTLYRATELQKQNLLTLYDD 151
+ MLP + P ++++E + L GT+L A + +L++ +D
Sbjct: 124 VRMLPESIPVPTMWSEEERVMLTGTSLETAVSAKCASLISEIED 167
Score = 40.4 bits (93), Expect = 1.8, Method: Compositional matrix adjust.
Identities = 28/92 (30%), Positives = 44/92 (47%), Gaps = 20/92 (21%)
Query: 254 EGLVPGIDFCNHDLKAAATWEVDGTGLITGVPFSMYLLSVERSSFHSEKEISISYGNKGN 313
E ++P +D NH +A + +E I LL + +E E++ISYG+ +
Sbjct: 212 ESMIPCVDMVNHAAEANSYYERTSDNNIA-------LLLRPDTQLEAESEVTISYGSSKS 264
Query: 314 E-ELLYLYGFV------------IDNNPDDYL 332
E E+L+ YGF+ ID +PDD L
Sbjct: 265 EAEMLFSYGFIDEQGTSKGLTLNIDPSPDDPL 296
>gi|341883062|gb|EGT38997.1| CBN-SET-29 protein [Caenorhabditis brenneri]
Length = 414
Score = 42.0 bits (97), Expect = 0.62, Method: Compositional matrix adjust.
Identities = 72/355 (20%), Positives = 137/355 (38%), Gaps = 79/355 (22%)
Query: 9 LEPFLQWLQVNKVELRGCKIKYSDESKGFGIFSSNEFSDGV-LLVVPLDLAITPMRVLQD 67
+E F++W + + G +I + G GI+++ F G ++ +P I V+
Sbjct: 1 MEEFIKWCKARGYKFDGLEITCPPGNCGNGIYATTGFRTGKPIITLPEHDMINSALVVDL 60
Query: 68 PLIGPECRAMFEDGEVDDRFLMILFLTVERLRKNSSWKPYLDMLPTTFGNPLWFTDDELL 127
P + + E ++ ++ +F + E + S+W PYL +LP TF P + D +
Sbjct: 61 PFYKKKLAKINE--KMKPMEILTMFFSFEDFEQ-SAWSPYLKVLPKTFDTPAFKGIDYDV 117
Query: 128 ELKGTTLYRATELQKQNLLTLYDDKVKDLVKKLLVLDGDSERANSIFWTRALNIPLPHSY 187
++ + QK+ + + +K++ L +L + I W + +
Sbjct: 118 NTLPLSIRKYWIDQKKEISEI-SEKLRHLFPEL--------THDKILWAWHV---VNTRC 165
Query: 188 VFPQNQEDLNKYDSINNSAELSNDHNSRGELINGLNDIKNEAQRVNSQVNGATSTLTSTQ 247
+F +N+E +D+++NS
Sbjct: 166 IFVENEE----HDNVDNS-----------------------------------------D 180
Query: 248 GETLWIEGLVPGIDFCNHDLKAAATWEVDGTGLITGVPFSMYLLSVERSSFHSEKEISIS 307
G+T+ ++P +D NHD + + G L Y++ +R E +I +
Sbjct: 181 GDTI---AVIPYVDMLNHDPE-----KYQGVALHEKRN-GRYVVQAKRQIQEGE-QIFVC 230
Query: 308 YGNKGNEELLYLYGFVIDNNPDDYLMIHYPAEA------IHSIPLSDSKALLLEE 356
YG N LL YGF + N ++I P E I I L+ ++LEE
Sbjct: 231 YGAHDNARLLVEYGFTLPQNLGAKVLI--PQEVLLTLAKIAGIKLTREHEMVLEE 283
>gi|443730800|gb|ELU16158.1| hypothetical protein CAPTEDRAFT_140019 [Capitella teleta]
Length = 255
Score = 42.0 bits (97), Expect = 0.63, Method: Compositional matrix adjust.
Identities = 33/108 (30%), Positives = 54/108 (50%), Gaps = 4/108 (3%)
Query: 36 GFGIFSSNEFSDG-VLLVVPLDLAITPMRVLQDPLIGPECRAMFEDGEVDDRFLMILFLT 94
G G+ G ++ +P L IT VL+ +GP F + +++FL
Sbjct: 5 GRGVMVRRRLLTGDTIIAIPESLLITTSTVLRS-YLGPVIHD-FLPCRLSPTETLVIFLL 62
Query: 95 VERLRKNSS-WKPYLDMLPTTFGNPLWFTDDELLELKGTTLYRATELQ 141
ER + SS WKPY+D+LP+++ + L +T E+ L T RA +L+
Sbjct: 63 CERNKGCSSFWKPYVDILPSSYTDILHWTSKEMDLLPKFTKRRACDLR 110
>gi|390596436|gb|EIN05838.1| SET domain-containing protein [Punctularia strigosozonata HHB-11173
SS5]
Length = 398
Score = 42.0 bits (97), Expect = 0.63, Method: Compositional matrix adjust.
Identities = 26/86 (30%), Positives = 42/86 (48%), Gaps = 7/86 (8%)
Query: 256 LVPGIDFCNHDLKAAATWEV---DGTGLITGVPFS----MYLLSVERSSFHSEKEISISY 308
L+PGID NH A +W + + P + + + V S+ E+ +Y
Sbjct: 192 LLPGIDALNHARGAPVSWVIKSRSAQSQVNAAPDTGSSDLSISLVLHSATPKGHELFNNY 251
Query: 309 GNKGNEELLYLYGFVIDNNPDDYLMI 334
G K N EL+ YGF + +NPDD +++
Sbjct: 252 GPKPNSELILGYGFSLPSNPDDTIVL 277
>gi|154294851|ref|XP_001547864.1| hypothetical protein BC1G_13548 [Botryotinia fuckeliana B05.10]
Length = 372
Score = 42.0 bits (97), Expect = 0.64, Method: Compositional matrix adjust.
Identities = 67/282 (23%), Positives = 102/282 (36%), Gaps = 79/282 (28%)
Query: 84 DDRFLMILFLTVER------------LRKNSSWKPYLDMLPTTFGNPLWFTDDELLELKG 131
D + ++L +T+ R + N+ W Y+ LP P +T+DE L G
Sbjct: 23 DAKLFLLLQMTIARNNTNEQPEIALNIGVNNPWSEYIRFLPEKIPVPTMWTEDEKTLLNG 82
Query: 132 TTLYRATELQKQNLLTLYDDKVKDLVKKLLVLDGDSERANSIFWTRALNIPLPHSYVFPQ 191
T+L + LV K++ L D E + + IP HS +
Sbjct: 83 TSLENS------------------LVNKMVALSRDFENLRE----KTIEIPWCHSCWW-- 118
Query: 192 NQEDLNKYDSINNSAELSNDHNSRGELINGLNDIKNEAQRVNSQVNGATSTLTSTQGETL 251
K D LS+ RV++ + L GE
Sbjct: 119 ------KEDGPLKPLLLSD------------------WIRVDAWYR--SRCLEIETGE-- 150
Query: 252 WIEGLVPGIDFCNHDLKAAATWE--VDGTGLITGVPFSMYLLSVERSSFHSEKEISISYG 309
E +VP +D NH A WE +G ++ VP + EI+ISYG
Sbjct: 151 --EVMVPVLDMVNHSFTPNAHWEHTSNGNAILVLVPDIL---------LDEGTEITISYG 199
Query: 310 NKGNEELLYLYGFVIDNNPDDYLMIHYPAEAIHSIPLSDSKA 351
K + E L+ YGF+ P L++ E I + PL +K
Sbjct: 200 VKSDAENLFNYGFIDAEVPLTSLILE--VEPIATDPLRVAKV 239
>gi|70993754|ref|XP_751724.1| SET domain protein [Aspergillus fumigatus Af293]
gi|66849358|gb|EAL89686.1| SET domain protein [Aspergillus fumigatus Af293]
gi|159125354|gb|EDP50471.1| SET domain protein [Aspergillus fumigatus A1163]
Length = 674
Score = 42.0 bits (97), Expect = 0.65, Method: Compositional matrix adjust.
Identities = 27/79 (34%), Positives = 37/79 (46%), Gaps = 10/79 (12%)
Query: 256 LVPGIDFCNHDLKAAATWEVDGTGLITGVPFSMYLLSVERSSFHSEKEISISYGNKGNEE 315
L+P ID NH A W V F V S +EIS +YG + NE+
Sbjct: 212 LLPCIDLMNHRPLAKVEWRAGKQD----VAF------VVLEDVASGQEISNNYGPRNNEQ 261
Query: 316 LLYLYGFVIDNNPDDYLMI 334
L+ YGF + +NP DY ++
Sbjct: 262 LMMNYGFCLPDNPCDYRIV 280
>gi|145537195|ref|XP_001454314.1| hypothetical protein [Paramecium tetraurelia strain d4-2]
gi|124422069|emb|CAK86917.1| unnamed protein product [Paramecium tetraurelia]
Length = 481
Score = 42.0 bits (97), Expect = 0.66, Method: Compositional matrix adjust.
Identities = 41/154 (26%), Positives = 76/154 (49%), Gaps = 13/154 (8%)
Query: 13 LQWLQVNKVELRGCKIKYSDESKGFGIFSSNEF--SDGVLLVVPLDLAITPMRVLQDPLI 70
+QWL+ K E+ K++ +S+G+ +++F +L +P ++ V + LI
Sbjct: 44 IQWLKDGKAEV--SKVQIEVKSEGYRTLRASQFIRQGEWVLFIPRTHYLSLEEVKKSCLI 101
Query: 71 GPECRAMFEDGEVDDRF--LMILFLTVERLRKNSSWKPYLDMLPTTF-GNPLWFTDDELL 127
R M + + + + L E R+NS WKPY+D+LP G P +F ++
Sbjct: 102 N---RKMIQLNYIPNNIQTYFVNHLLQENRRQNSFWKPYIDVLPKDVSGFPTYFDAEQDA 158
Query: 128 ELKGT-TLYRATELQKQNLLTLYDDKVKDLVKKL 160
LKG+ TL+ T + ++ + D +K+ VK+
Sbjct: 159 LLKGSPTLF--TVMNQRKIFREEYDNLKEAVKEF 190
>gi|293333172|ref|NP_001168589.1| uncharacterized protein LOC100382373 [Zea mays]
gi|223949395|gb|ACN28781.1| unknown [Zea mays]
gi|414885391|tpg|DAA61405.1| TPA: hypothetical protein ZEAMMB73_723554 [Zea mays]
Length = 489
Score = 42.0 bits (97), Expect = 0.67, Method: Compositional matrix adjust.
Identities = 51/217 (23%), Positives = 97/217 (44%), Gaps = 15/217 (6%)
Query: 4 STEAKLEPFLQWLQVNKVELRGCKIKYSDESKGFGIFSSNEFSDG-VLLVVPLDLAITPM 62
+T + +E F +WL + G K + +G G+ ++ + G V+ VP L +
Sbjct: 50 ATVSAMEDFRRWLASHGAG-DGGKAIPAAVPEGLGLVAARDLPRGEVVAEVPKKLWMDAD 108
Query: 63 RVLQDPLIGPECRAMFEDGEVDDRFLMILFLTVERLR-KNSSWKPYLDMLPTTFGNPLWF 121
V + RA G + + L L E R +S W PYL +LP + +++
Sbjct: 109 AVAASDI----GRACGGGGGLRPWVAVALLLLSEVARGADSPWAPYLAILPRQTDSTIFW 164
Query: 122 TDDELLELKGTTLYRATELQKQNLLTLYDDKVKDLVKKLLVLDGDSERANSIFWTRALNI 181
+++ELLE++GT L T K+ + + +D +++ L S + W
Sbjct: 165 SEEELLEIQGTQLLSTTVGVKEYVQSEFDSVQAEIISTNKDLFPGSITFDDFLWA----F 220
Query: 182 PLPHSYVFPQNQED----LNKYDSINNSAELSNDHNS 214
+ S VFP+ + D + D +N+S ++++ +S
Sbjct: 221 GMLRSRVFPELRGDKLALIPFADLVNHSPNITSEGSS 257
>gi|357444999|ref|XP_003592777.1| Ribulose-1,5 bisphosphate carboxylase/oxygenase large subunit
N-methyltransferase [Medicago truncatula]
gi|355481825|gb|AES63028.1| Ribulose-1,5 bisphosphate carboxylase/oxygenase large subunit
N-methyltransferase [Medicago truncatula]
Length = 451
Score = 42.0 bits (97), Expect = 0.67, Method: Compositional matrix adjust.
Identities = 35/140 (25%), Positives = 62/140 (44%), Gaps = 9/140 (6%)
Query: 9 LEPFLQWLQVNKVELRGCKIKYSDESKGFGIFSSNEFSDG-VLLVVPLDLAITPMRVLQD 67
L FL WL+ + S+ S G +F+S G +L VP L +TP
Sbjct: 34 LHNFLPWLENKASSTISSSLSISNSSYGNSLFASKSIQTGDCILQVPYSLQLTPDN---- 89
Query: 68 PLIGPECRAMFEDGEVDDRFLMILFLTVERLRKNSSWKPYLDMLP--TTFGNPLWFTDDE 125
+ PE + + + L + L + L ++S W PY+ LP N +++ + E
Sbjct: 90 --LPPEIKPFISEDVGNIAKLATVLLIHKNLGQDSEWHPYISCLPPQAEMHNTIFWNESE 147
Query: 126 LLELKGTTLYRATELQKQNL 145
L ++ +++Y+ T QK +
Sbjct: 148 LEMIRQSSVYQETIYQKSQI 167
>gi|302753470|ref|XP_002960159.1| hypothetical protein SELMODRAFT_437298 [Selaginella moellendorffii]
gi|300171098|gb|EFJ37698.1| hypothetical protein SELMODRAFT_437298 [Selaginella moellendorffii]
Length = 377
Score = 42.0 bits (97), Expect = 0.68, Method: Compositional matrix adjust.
Identities = 41/134 (30%), Positives = 60/134 (44%), Gaps = 14/134 (10%)
Query: 12 FLQWL--QVNKVELRGCKIKYSDESKGFGIFSSNEFSDG-VLLVVPLDLAITPMRVLQDP 68
F+ WL + + +L +I S ES G G+F+S G +L + LDL I P +
Sbjct: 42 FMSWLASRNDDGDLSAVRIGTSQES-GRGLFASRPVRAGERVLEISLDLMIAPSDL---- 96
Query: 69 LIGPECRAMFEDGEVDDRFLMILFLTVERLRKNSS-WKPYLDML--PTTFGNPLWFTDDE 125
P+ +M V + L + +ER + SS W PY+ L P N + D E
Sbjct: 97 ---PDELSMVLPSTVKPWTKLALIVLMERYKGQSSVWAPYISCLPQPAELDNTFLWEDTE 153
Query: 126 LLELKGTTLYRATE 139
L LK + LY T
Sbjct: 154 LSYLKASPLYGKTR 167
>gi|294655366|ref|XP_002770117.1| DEHA2B12870p [Debaryomyces hansenii CBS767]
gi|199429902|emb|CAR65486.1| DEHA2B12870p [Debaryomyces hansenii CBS767]
Length = 600
Score = 42.0 bits (97), Expect = 0.69, Method: Compositional matrix adjust.
Identities = 84/334 (25%), Positives = 136/334 (40%), Gaps = 53/334 (15%)
Query: 12 FLQWLQVNKVELRG-CKIKYSDESKGFGIFSSNEFSDGVLLV-VPLDLAITP-MRVLQDP 68
FL W + N L + +YS + KG+ ++FS G L+ VP + I P ++ P
Sbjct: 9 FLSWARENGAFLDSRIEFQYSKQ-KGYSALI-HDFSSGEELIKVPKQMVIGPHLKEQYLP 66
Query: 69 LIG-PECRAMFEDGEVDDRFLMILFLTVERLRKNSSWKPYLDMLPTTFGNPLWFTDDELL 127
I + F + E+ L+I L + ++ +K Y +++P F P ++ +E+
Sbjct: 67 GINYAHSDSYFTNNEM--TILLISKLAFDTSLNDNRFKQYFEIVPKDFNIPYFWNSNEID 124
Query: 128 ELKGTTLYRATELQKQNLLTLYDDKVKDLVKKLLVLDGDSERANSIFWTRALNIPLPHSY 187
+ GT L + L+T + + ++ L++ D E+
Sbjct: 125 LVNGTDLELIFKTNFSKLVTEWHELMEQLIQ-----DSSEEKFKQ--------------- 164
Query: 188 VFPQNQEDLNKYDSINNSAELSNDHNSRGELINGLNDIKNEAQRVNSQVNGATSTLTSTQ 247
Q +D+ YD +N A +S N GEL LN +NE+ + S LTS
Sbjct: 165 ---QILQDMKFYD--DNIAGIS--LNYHGELYEYLN-CENESWTSFRNYLWSYSILTSRG 216
Query: 248 GETLWIEG-----------LVPGIDFCNHDLKAAATWEVDGTGLITGVPFSMYLLSVERS 296
+ ++ LVP ID NH W + V F + E
Sbjct: 217 LPYILLKNEITRDDIQKAILVPIIDLLNHKNDYKVKWNGLKESKGSHVSFQSF---EEGD 273
Query: 297 SFHSEKEISISYGNKGNEELLYLYGFVIDNNPDD 330
F E+ +YGNK N +LL YGFV +NN D
Sbjct: 274 KF---GELYNNYGNKSNLQLLLGYGFVSENNKYD 304
>gi|168002824|ref|XP_001754113.1| predicted protein [Physcomitrella patens subsp. patens]
gi|162694667|gb|EDQ81014.1| predicted protein [Physcomitrella patens subsp. patens]
Length = 638
Score = 42.0 bits (97), Expect = 0.70, Method: Compositional matrix adjust.
Identities = 32/120 (26%), Positives = 56/120 (46%), Gaps = 9/120 (7%)
Query: 35 KGFGIFSSNEFSDG-VLLVVPLDLAITPMRVLQDPLIGPECRAMFEDGEVDDRFLMILFL 93
+G G+ + +F +G V L +P + +T + V+ P++ D + L+L
Sbjct: 107 EGVGLAAGRDFKEGEVALKIPENYTVTGVDVVNHPVVAAPAAGR------GDVIGLTLWL 160
Query: 94 TVER-LRKNSSWKPYLDMLPTTFGNP-LWFTDDELLELKGTTLYRATELQKQNLLTLYDD 151
ER L + S W PYL P+T +P LW +++ LKG+ + + L Y+D
Sbjct: 161 MYERSLGEKSVWYPYLQTFPSTTLSPILWTAEEQQKLLKGSPALEEVQQRSAALEGEYED 220
>gi|212542185|ref|XP_002151247.1| SET domain protein [Talaromyces marneffei ATCC 18224]
gi|210066154|gb|EEA20247.1| SET domain protein [Talaromyces marneffei ATCC 18224]
Length = 709
Score = 42.0 bits (97), Expect = 0.72, Method: Compositional matrix adjust.
Identities = 34/115 (29%), Positives = 49/115 (42%), Gaps = 24/115 (20%)
Query: 256 LVPGIDFCNHDLKAAATWEV--DGTGLITGVPFSMYLLSVERSSFHSEKEISISYGNKGN 313
L+P ID NH + W D GL+ S + +E+ +YG + N
Sbjct: 230 LLPLIDATNHKPLSKVEWRAGTDSIGLVV------------MSDVAAGEEVGNNYGPRNN 277
Query: 314 EELLYLYGFVIDNNPDDYLMIHYPAEAIHSIPLSDSKALLLEEQKAQLRCLLPKS 368
E+L+ YGF I +NP +Y ++ A PL L + KAQ PKS
Sbjct: 278 EQLMMNYGFCIPDNPCEYRVVSLRA------PLDSP----LAQIKAQYEQHFPKS 322
>gi|408397548|gb|EKJ76689.1| hypothetical protein FPSE_03100 [Fusarium pseudograminearum CS3096]
Length = 467
Score = 42.0 bits (97), Expect = 0.73, Method: Compositional matrix adjust.
Identities = 26/95 (27%), Positives = 46/95 (48%), Gaps = 9/95 (9%)
Query: 256 LVPGIDFCNHDLKAAATWEVDGTGLITGVPFSMYLLSVERSSFHSEKEISISYGNKGNEE 315
+VPG+D NH A ++ D + LL + + +E++ISYG+K E
Sbjct: 229 MVPGLDMANHSHHPTAYYDEDDKD-------DVVLLVRPGTKVSAGEEVNISYGDKNPAE 281
Query: 316 LLYLYGFVIDNNPDDYLMIHYPAEAIHSIPLSDSK 350
+L+ YGF+ +N ++ P + + PL +K
Sbjct: 282 MLFSYGFI--DNESTVEGLNLPVKVLPDDPLGKAK 314
>gi|407404792|gb|EKF30119.1| hypothetical protein MOQ_006074 [Trypanosoma cruzi marinkellei]
Length = 337
Score = 42.0 bits (97), Expect = 0.73, Method: Compositional matrix adjust.
Identities = 35/122 (28%), Positives = 57/122 (46%), Gaps = 17/122 (13%)
Query: 6 EAKLEPFLQWLQVNKVELRGCKIKYSDESKGFGIFSSNEFSDGV-LLVVPLD-------- 56
E + E F+++L NK + G ++ S E + G+ ++ +F +G +L VP+
Sbjct: 28 ERRHETFMRFLTANKAKFDGVDVRPSAE-QSRGLIATRKFREGTTVLSVPMHTFAISAER 86
Query: 57 -LAITPMRVLQDPLIGPECRAMFEDGEVDDRFL-----MILFLTVERLRKNSSWKPYLDM 110
LA +R L P + E R+ + D L + L + ERL NS + PY D
Sbjct: 87 LLAGEHLRSLHPPTLD-EVRSFLTACSIKDPVLCEQVFLALLVAGERLDPNSFFTPYFDS 145
Query: 111 LP 112
LP
Sbjct: 146 LP 147
>gi|194764087|ref|XP_001964163.1| GF21412 [Drosophila ananassae]
gi|190619088|gb|EDV34612.1| GF21412 [Drosophila ananassae]
Length = 1017
Score = 42.0 bits (97), Expect = 0.74, Method: Compositional matrix adjust.
Identities = 20/68 (29%), Positives = 38/68 (55%), Gaps = 5/68 (7%)
Query: 87 FLMILFLTVERLRKN-SSWKPYLDMLPTTFGNPLWFTDDELLELKGTTL----YRATELQ 141
F ++ L +E+ R + S+WKPY+D+LP + L++T ++ +L+ T + R L
Sbjct: 193 FKLVFALIIEKARGDQSAWKPYIDVLPAKYNTVLYYTPAQMQKLRRTAVCSVAVRQCRLI 252
Query: 142 KQNLLTLY 149
+ T+Y
Sbjct: 253 ARQYATMY 260
Score = 40.4 bits (93), Expect = 1.8, Method: Compositional matrix adjust.
Identities = 43/146 (29%), Positives = 66/146 (45%), Gaps = 24/146 (16%)
Query: 235 QVNGATSTLTSTQGETLWIEGLVPGIDFCNHDLKAAATWEVDGTGLITGVPFSMYLLSVE 294
QV GA+ T E + L+P D NH G+G IT F V
Sbjct: 784 QVKGASGEENVTDDEP--VSALIPFWDMANH-----------GSGRITTF-FDSTAGEVS 829
Query: 295 ---RSSFHSEKEISISYGNKGNEELLYLYGFVIDNNPDDYLMIHYPAEAIHSIPLSDSKA 351
+++ + ++ I YG++ N E L GFV +N +DY+ I PL++ +A
Sbjct: 830 CNAQAACSAGEQFFIYYGDRTNTEFLVNNGFVDPDNRNDYVNIRLGLSPTD--PLAEKRA 887
Query: 352 LLLE----EQKAQLRCLLPKSLLEHG 373
++L E+KA LR +LP + HG
Sbjct: 888 IILAALGIERKATLR-VLPAPVYIHG 912
>gi|378728064|gb|EHY54523.1| SET domain-containing protein 6 [Exophiala dermatitidis NIH/UT8656]
Length = 495
Score = 42.0 bits (97), Expect = 0.75, Method: Compositional matrix adjust.
Identities = 36/134 (26%), Positives = 58/134 (43%), Gaps = 20/134 (14%)
Query: 12 FLQWLQVNKVELRGCKIKYSD---ESKGFGIFSSNEF-SDGVLLVVPLDLAIT------P 61
F+ W K++ D E+ G G + + SD L +P L +T P
Sbjct: 20 FVNWFTAANGTRLSPKVQLKDLRHENAGRGAVAIADIASDEELFAIPRSLVLTTATSSIP 79
Query: 62 MRVLQDPLIGPECRAMFEDGEVDDRFLMILFLTVERLRKNSS-WKPYLDMLPTTFGNPLW 120
VL++ + + G +I+ + E LR SS W PY +LPTTF ++
Sbjct: 80 RSVLKE---------LEDKGATGAWPPLIVTIIYEYLRGESSPWHPYFKILPTTFNTLMF 130
Query: 121 FTDDELLELKGTTL 134
+ D EL EL+ + +
Sbjct: 131 WNDAELAELQASAV 144
>gi|149059902|gb|EDM10785.1| hypothetical protein RDA279, isoform CRA_e [Rattus norvegicus]
Length = 475
Score = 42.0 bits (97), Expect = 0.75, Method: Compositional matrix adjust.
Identities = 73/321 (22%), Positives = 116/321 (36%), Gaps = 71/321 (22%)
Query: 14 QWLQVNKVELRGCKIKYSDESKGFGIFSSNEFSDG-VLLVVPLDLAITPMRVLQDPLIGP 72
+WL+ K E G + G G+ S +G V++ +P +T V++ +GP
Sbjct: 72 KWLKERKFEDTGLLVPACFPGTGRGLMSKASLQEGQVIISLPESCLLTTDTVIRSS-VGP 130
Query: 73 ECRAMFEDGEVDDRFLMILFLTVERLRKNSS-WKPYLDMLPTTFGNPLWFTDDELLELKG 131
+ V + FL ER + S WK YLD+LP ++ P+ + E+++L
Sbjct: 131 YIKKW--KPPVSPLLALCTFLVSERHAGSHSLWKSYLDILPKSYTCPVCL-EPEVVDLLP 187
Query: 132 TTLYRATELQKQNLLTLYDDKVKDLVKKLLVLDGDSERANSIFWTRALNIPLPHSYVFPQ 191
L E Q+ + L+ +D L L +S +SIF A
Sbjct: 188 GPLRAKAEEQRARVQDLFASS-RDFFSTLQPLFAES--VDSIFSYHAF------------ 232
Query: 192 NQEDLNKYDSINNSAELSNDHNSRGELINGLNDIKNEAQRVNSQVNGATSTLTSTQGETL 251
L + ++N A +K+ Q S++ +T
Sbjct: 233 ----LWAWCTVNTRAVY----------------LKSRRQEC-----------LSSEPDTC 261
Query: 252 WIEGLVPGIDFCNH----DLKAAATWEVDGTGLITGVPFSMYLLSVERSSFHSEKEISIS 307
L P +D NH +KAA + + T S +E I
Sbjct: 262 ---ALAPFLDLLNHSPHVQVKAAFNEKTRCYEIRTA------------SRCRKHQEAFIC 306
Query: 308 YGNKGNEELLYLYGFVIDNNP 328
YG N+ LL YGFV NP
Sbjct: 307 YGPHDNQRLLLEYGFVAFGNP 327
>gi|198432683|ref|XP_002129380.1| PREDICTED: similar to predicted protein [Ciona intestinalis]
Length = 239
Score = 42.0 bits (97), Expect = 0.75, Method: Compositional matrix adjust.
Identities = 45/140 (32%), Positives = 62/140 (44%), Gaps = 23/140 (16%)
Query: 7 AKLEPFLQWLQVNKVEL-RGCKIKYSDESKGFGIFSSNEFSDGVLLV-VPLD-------L 57
A ++ FLQWL K L + KI + FG+ +S + S+G LL VP D
Sbjct: 26 ACIDAFLQWLAEEKFALSKKIKISERNSCHRFGLSASEDISNGELLFSVPRDSLLWEKNC 85
Query: 58 AITPMRVLQDPLIGPECRAMFEDGEVDDRFLMILFLTVERLRKNSSWKPYLDMLP--TTF 115
+I+ D L G G V +IL L E + +S WKPY+D LP +
Sbjct: 86 SISKRLNACDNLNG-------GSGWVK----IILCLMYEHTKPSSRWKPYMDFLPPVSCM 134
Query: 116 GNPL-WFTDDELLELKGTTL 134
PL W D L+G+ L
Sbjct: 135 DQPLHWDPDIREKMLEGSGL 154
>gi|24640264|ref|NP_727144.1| CG32732 [Drosophila melanogaster]
gi|22831862|gb|AAF46222.2| CG32732 [Drosophila melanogaster]
gi|28316927|gb|AAO39485.1| RE55639p [Drosophila melanogaster]
gi|220957744|gb|ACL91415.1| CG32732-PA [synthetic construct]
Length = 537
Score = 42.0 bits (97), Expect = 0.76, Method: Compositional matrix adjust.
Identities = 34/129 (26%), Positives = 59/129 (45%), Gaps = 11/129 (8%)
Query: 7 AKLEPFLQWLQVNKVELRGCKIKYSDESKGFGIFSSNEFS-DGVLLVVPLDLAITPMRVL 65
AK+E F W + V G +I + G+ ++ + D ++L VP L ++
Sbjct: 116 AKVEAFSAWAKDGGVHSEGLEIAIFPGYQ-LGLRATRPLAKDELVLSVPRKLILSEENNS 174
Query: 66 QDPLIGPECRAMFEDGEVDDRFLMILFLTVERLRKN-SSWKPYLDMLPTTFGNPLWFTDD 124
L G +A + D L +E++R S W+PY+D+LP + L+FT
Sbjct: 175 DCRLFGKMTQATHLNLAYD--------LVIEKIRGEFSEWRPYIDVLPAKYNTVLYFTTK 226
Query: 125 ELLELKGTT 133
++ L+GT
Sbjct: 227 QMELLRGTA 235
>gi|119500300|ref|XP_001266907.1| SET domain protein [Neosartorya fischeri NRRL 181]
gi|119415072|gb|EAW25010.1| SET domain protein [Neosartorya fischeri NRRL 181]
Length = 704
Score = 42.0 bits (97), Expect = 0.76, Method: Compositional matrix adjust.
Identities = 27/79 (34%), Positives = 37/79 (46%), Gaps = 10/79 (12%)
Query: 256 LVPGIDFCNHDLKAAATWEVDGTGLITGVPFSMYLLSVERSSFHSEKEISISYGNKGNEE 315
L+P ID NH A W V F V S +EIS +YG + NE+
Sbjct: 242 LLPCIDLMNHRPLAKVEWRAGKQD----VAF------VVLEDVGSGQEISNNYGPRNNEQ 291
Query: 316 LLYLYGFVIDNNPDDYLMI 334
L+ YGF + +NP DY ++
Sbjct: 292 LMMNYGFCLPDNPCDYRIV 310
>gi|336463341|gb|EGO51581.1| hypothetical protein NEUTE1DRAFT_125257 [Neurospora tetrasperma
FGSC 2508]
gi|350297448|gb|EGZ78425.1| SET domain-containing protein [Neurospora tetrasperma FGSC 2509]
Length = 503
Score = 42.0 bits (97), Expect = 0.79, Method: Compositional matrix adjust.
Identities = 26/80 (32%), Positives = 38/80 (47%), Gaps = 1/80 (1%)
Query: 256 LVPGIDFCNHDLKAAATWEVDGTGLITGVPFSMYLLSVERSS-FHSEKEISISYGNKGNE 314
L P D NH +T E +G V FS ++ + + + +E+ I YGN N+
Sbjct: 236 LQPVADLFNHTSYDPSTEEKEGNKKTCSVAFSPTAFTITTTRPYAAGEEVYICYGNHSND 295
Query: 315 ELLYLYGFVIDNNPDDYLMI 334
LL YGF+ D N D + I
Sbjct: 296 FLLIEYGFLFDENVWDEVCI 315
>gi|18395523|ref|NP_564222.1| rubisco methyltransferase-like protein [Arabidopsis thaliana]
gi|9743350|gb|AAF97974.1|AC000103_24 F21J9.27 [Arabidopsis thaliana]
gi|332192432|gb|AEE30553.1| rubisco methyltransferase-like protein [Arabidopsis thaliana]
Length = 476
Score = 41.6 bits (96), Expect = 0.83, Method: Compositional matrix adjust.
Identities = 52/254 (20%), Positives = 92/254 (36%), Gaps = 69/254 (27%)
Query: 91 LFLTVERLRKNSSWKPYLDMLPTTFGNPLWFTDDELLELKGTTLYRATELQKQNLLTLYD 150
L L ER +S W PY+ LP T+ P++F +++ L+ L + + LL
Sbjct: 117 LRLLQERANADSFWWPYISNLPETYTVPIFFPGEDIKNLQYAPLLHQVNKRCRFLLEFEQ 176
Query: 151 DKVKDLVKKLLVLDGDSERANSIFWTRALNIPLPHSYVFPQNQEDLNKYDSINNSAELSN 210
+ + L ED+ ++
Sbjct: 177 EIRRTL-------------------------------------EDVK-----------AS 188
Query: 211 DHNSRGELING------LNDIKNEAQRV--NSQVNGATSTLTSTQGETLWIEGLVPGIDF 262
DH G+ +N ++ + A R+ N ++ G +S + ++P ID
Sbjct: 189 DHPFSGQDVNASALGWTMSAVSTRAFRLHGNKKLQGGSSDD---------VPMMLPLIDM 239
Query: 263 CNHDLKAAATWEVDGTGLITGVPFSMYLLSVERSSFHSEKEISISYGNKGNEELLYLYGF 322
CNH K A + G + + V + + ++YG N+ L YGF
Sbjct: 240 CNHSFKPNARIIQEQNGADSNT----LVKVVAETEVKENDPLLLNYGCLSNDFFLLDYGF 295
Query: 323 VIDNNPDDYLMIHY 336
VI++NP D + + Y
Sbjct: 296 VIESNPYDTIELKY 309
>gi|242769547|ref|XP_002341787.1| SET domain protein [Talaromyces stipitatus ATCC 10500]
gi|218724983|gb|EED24400.1| SET domain protein [Talaromyces stipitatus ATCC 10500]
Length = 739
Score = 41.6 bits (96), Expect = 0.85, Method: Compositional matrix adjust.
Identities = 30/101 (29%), Positives = 44/101 (43%), Gaps = 16/101 (15%)
Query: 253 IEGLVPGIDFCNHDLKAAATWEV--DGTGLITGVPFSMYLLSVERSSFHSEKEISISYGN 310
I L+P ID NH + W D GL+ S + E+ +YG
Sbjct: 258 ISVLLPLIDATNHKPLSKVEWRAGTDSIGLVV------------MSDLRAGDEVGNNYGP 305
Query: 311 KGNEELLYLYGFVIDNNPDDYLMIHYPAEAIHSIPLSDSKA 351
+ NE+L+ YGF I +NP +Y ++ A PL+ KA
Sbjct: 306 RNNEQLMMNYGFCIPDNPCEYRVVSL--RAPPDSPLAQIKA 344
>gi|297845640|ref|XP_002890701.1| hypothetical protein ARALYDRAFT_472886 [Arabidopsis lyrata subsp.
lyrata]
gi|297336543|gb|EFH66960.1| hypothetical protein ARALYDRAFT_472886 [Arabidopsis lyrata subsp.
lyrata]
Length = 471
Score = 41.6 bits (96), Expect = 0.86, Method: Compositional matrix adjust.
Identities = 56/254 (22%), Positives = 94/254 (37%), Gaps = 69/254 (27%)
Query: 91 LFLTVERLRKNSSWKPYLDMLPTTFGNPLWFTDDELLELKGTTLYRATELQKQNLLTLYD 150
L L ER +S W PY+ LP TF P++F +++ L+ L LY
Sbjct: 112 LRLLQERANVDSFWWPYISNLPETFTVPIFFPGEDIKNLQYAPL-------------LY- 157
Query: 151 DKVKDLVKKLLVLDGDSERANSIFWTRALNIPLPHSYVFPQNQEDLNKYDSINNSAELSN 210
+V + LL + + R ED+ ++
Sbjct: 158 -QVNKRCRFLLEFEQEIRR----------------------TLEDVK-----------AS 183
Query: 211 DHNSRGELING------LNDIKNEAQRV--NSQVNGATSTLTSTQGETLWIEGLVPGIDF 262
DH G+ +N ++ + A R+ N ++ G +S + ++P ID
Sbjct: 184 DHPFSGQDVNASALGWTMSAVSTRAFRLHGNKKLQGGSSDD---------VPMMLPLIDM 234
Query: 263 CNHDLKAAATWEVDGTGLITGVPFSMYLLSVERSSFHSEKEISISYGNKGNEELLYLYGF 322
CNH K V G + + V + + ++YG N+ L YGF
Sbjct: 235 CNHSFKP----NVKIIQEQNGAESNTLVKVVAETELKENDPLLLNYGCLSNDFFLLDYGF 290
Query: 323 VIDNNPDDYLMIHY 336
VI++NP D + + Y
Sbjct: 291 VIESNPYDTIELKY 304
>gi|145350419|ref|XP_001419603.1| predicted protein [Ostreococcus lucimarinus CCE9901]
gi|144579835|gb|ABO97896.1| predicted protein [Ostreococcus lucimarinus CCE9901]
Length = 524
Score = 41.6 bits (96), Expect = 0.87, Method: Compositional matrix adjust.
Identities = 39/135 (28%), Positives = 63/135 (46%), Gaps = 21/135 (15%)
Query: 256 LVPGIDFCNHDLKAAATWEVDGTGLITGVPFSMYLLSVERSSFHSEKEISISYGNKGNEE 315
LVP D NH AA +++G + L+ +RS E ++ SYG + + E
Sbjct: 258 LVPWADMLNHKPGCAAFIDLNGDAVN---------LTTDRSYVKGE-QVWASYGQRPSSE 307
Query: 316 LLYLYGFV--IDNNPDDYLMIHYPAEAIHSIPLSDSKALLLEE------QKAQLRC-LLP 366
LL YGF + NPDD + + + PL+D+KA +L + + LR P
Sbjct: 308 LLISYGFAPEVGENPDDEYALTLGVDV--NDPLADAKAQVLRDMGLSPVETFPLRLNGYP 365
Query: 367 KSLLEHGFFAAGHPK 381
+ LL++ F +P+
Sbjct: 366 RQLLQYASFILCNPE 380
>gi|378726996|gb|EHY53455.1| hypothetical protein HMPREF1120_01647 [Exophiala dermatitidis
NIH/UT8656]
Length = 488
Score = 41.6 bits (96), Expect = 0.87, Method: Compositional matrix adjust.
Identities = 52/207 (25%), Positives = 91/207 (43%), Gaps = 34/207 (16%)
Query: 9 LEPFLQWLQVNKVELRGCKIK---YSDES--KGFGIFSSNEFSDG-VLLVVPLDLAITPM 62
LE W N V L I+ ++D+ KG GI +++ +G L+ VPLDL ++
Sbjct: 8 LENLQAWAHFNNVHLFNAAIEAHVFTDDGIDKGGGIRATSRHEEGEPLVAVPLDLVLSKE 67
Query: 63 RVLQDPLIGPECRAMFEDGEV---DDRFLMILFLTVER-----------LRKNSSWKPYL 108
RV Q + + E + R ++LFL + L+ + + Y+
Sbjct: 68 RVAQCAKSDQRLKELIEAAASLFQNPRTAVLLFLVYQMTINSPDNHDAGLKLKNPFADYV 127
Query: 109 DMLPTTFGNPLWFTDDELLELKGTTLYRATELQKQNLLTLYDDKVKDLVKKLL------- 161
LP P ++T +E L GTTL A + QK L D++K+ + +
Sbjct: 128 QCLPKDVPLPTFYTPEERELLTGTTLAEALD-QKLVSLEREFDRLKEATQTIPWCQRVWW 186
Query: 162 -----VLDGDSER-ANSIFWTRALNIP 182
+LD D + A++++ +RA+ +P
Sbjct: 187 DEQTGLLDFDDWKLADALYRSRAMELP 213
>gi|358388339|gb|EHK25932.1| hypothetical protein TRIVIDRAFT_82204 [Trichoderma virens Gv29-8]
Length = 915
Score = 41.6 bits (96), Expect = 0.89, Method: Compositional matrix adjust.
Identities = 24/85 (28%), Positives = 48/85 (56%), Gaps = 9/85 (10%)
Query: 81 GEVDDR------FLMILFLTVERLRKN-SSWKPYLDMLPTTFGNPLWFTDDELLELKGTT 133
G VDD+ ++I+ + E + + S WKPY+D+LP +F P++++ EL EL+ +
Sbjct: 531 GIVDDKPQQDPWSILIIIMMFEYFKGDESKWKPYMDVLPASFETPMFWSGAELDELQASA 590
Query: 134 LYRATELQKQNLLTLYDDKVKDLVK 158
T++ K + ++ KV +++
Sbjct: 591 TR--TKVGKADAEEMFHAKVLPVIR 613
>gi|291235388|ref|XP_002737626.1| PREDICTED: SET domain containing 4-like [Saccoglossus kowalevskii]
Length = 353
Score = 41.6 bits (96), Expect = 0.93, Method: Compositional matrix adjust.
Identities = 33/120 (27%), Positives = 61/120 (50%), Gaps = 13/120 (10%)
Query: 13 LQWLQVNKVELRGCKIKYSD-ESKGFGIFSSNEFSDG-VLLVVPLDLAITPMRVLQDPL- 69
++W+ N +G +K ++ + G G+ ++ F G ++ +P L IT VL L
Sbjct: 35 VRWMSRNG--FKGALLKPANFKETGRGLMATKPFQIGDQVISIPEMLLITTQNVLSSYLG 92
Query: 70 --IGPECRAMFEDGEVDDRFLMILFLTVERLR-KNSSWKPYLDMLPTTFGNPLWFTDDEL 126
I + R +V + +L ER R K+S W Y+ +LP ++ NP++FT++E+
Sbjct: 93 DFIKQQTRPKLSPMQV-----ICTYLICERSRQKDSFWYNYIKVLPKSYSNPVYFTNEEI 147
>gi|166091525|ref|NP_001107219.1| SET domain-containing protein 4 [Rattus norvegicus]
gi|165971256|gb|AAI58670.1| Setd4 protein [Rattus norvegicus]
Length = 439
Score = 41.6 bits (96), Expect = 0.93, Method: Compositional matrix adjust.
Identities = 73/321 (22%), Positives = 116/321 (36%), Gaps = 71/321 (22%)
Query: 14 QWLQVNKVELRGCKIKYSDESKGFGIFSSNEFSDG-VLLVVPLDLAITPMRVLQDPLIGP 72
+WL+ K E G + G G+ S +G V++ +P +T V++ +GP
Sbjct: 36 KWLKERKFEDTGLLVPACFPGTGRGLMSKASLQEGQVIISLPESCLLTTDTVIRSS-VGP 94
Query: 73 ECRAMFEDGEVDDRFLMILFLTVERLRKNSS-WKPYLDMLPTTFGNPLWFTDDELLELKG 131
+ V + FL ER + S WK YLD+LP ++ P+ + E+++L
Sbjct: 95 YIKKW--KPPVSPLLALCTFLVSERHAGSHSLWKSYLDILPKSYTCPVCL-EPEVVDLLP 151
Query: 132 TTLYRATELQKQNLLTLYDDKVKDLVKKLLVLDGDSERANSIFWTRALNIPLPHSYVFPQ 191
L E Q+ + L+ +D L L +S +SIF A
Sbjct: 152 GPLRAKAEEQRARVQDLFASS-RDFFSTLQPLFAES--VDSIFSYHAF------------ 196
Query: 192 NQEDLNKYDSINNSAELSNDHNSRGELINGLNDIKNEAQRVNSQVNGATSTLTSTQGETL 251
L + ++N A +K+ Q S++ +T
Sbjct: 197 ----LWAWCTVNTRAVY----------------LKSRRQEC-----------LSSEPDTC 225
Query: 252 WIEGLVPGIDFCNH----DLKAAATWEVDGTGLITGVPFSMYLLSVERSSFHSEKEISIS 307
L P +D NH +KAA + + T S +E I
Sbjct: 226 ---ALAPFLDLLNHSPHVQVKAAFNEKTRCYEIRTA------------SRCRKHQEAFIC 270
Query: 308 YGNKGNEELLYLYGFVIDNNP 328
YG N+ LL YGFV NP
Sbjct: 271 YGPHDNQRLLLEYGFVAFGNP 291
>gi|162606198|ref|XP_001713614.1| putative ribulose-1,5-bisphosphate carboxylase/oxygenase small
subunit N-methyltransferase I [Guillardia theta]
gi|13794534|gb|AAK39909.1|AF165818_117 putative ribulose-1,5-bisphosphate carboxylase/oxygenase small
subunit N-methyltransferase I [Guillardia theta]
Length = 460
Score = 41.6 bits (96), Expect = 0.93, Method: Compositional matrix adjust.
Identities = 34/117 (29%), Positives = 45/117 (38%), Gaps = 33/117 (28%)
Query: 256 LVPGIDFCNHDLKAAATWEVDGTGLITGVPFSMYLLSVERSSFHSEKEI----------- 304
L+P ID NH+ PFS +S + KEI
Sbjct: 253 LIPYIDLLNHN------------------PFSSSFISYRKIPLSDSKEIVVYSDKNCNKF 294
Query: 305 ---SISYGNKGNEELLYLYGFVIDNNPDDYLMIHYPAEAIHSIPLSDSKALLLEEQK 358
ISYG K N ELL LYGF+ + NP D ++I I + K+ L +K
Sbjct: 295 DQLYISYGQKSNLELLNLYGFIAERNPYDSVIIRISMSP-KDIFFKEKKSFLFSNKK 350
>gi|401837357|gb|EJT41296.1| EFM1-like protein [Saccharomyces kudriavzevii IFO 1802]
Length = 549
Score = 41.6 bits (96), Expect = 0.94, Method: Compositional matrix adjust.
Identities = 39/132 (29%), Positives = 59/132 (44%), Gaps = 9/132 (6%)
Query: 9 LEPFLQWLQVNKVELRGCKIKYS-DESKGFGIFSSNEFS---DGVLLVVPLDLAITPMRV 64
L+ L+W Q N + KI + + G F + FS D L+ VP L IT +
Sbjct: 7 LQNCLEWAQNNGAFIDP-KISFKITQEAGVSAFVNENFSPRPDQALVRVPEVLLITSQQA 65
Query: 65 LQD-PLIGPECRAMFEDGEVDDRFLMILFLTVERLRKNSSWKPYLDMLPTTFGNPLWFTD 123
L + P E + V +L L + + S +KPYLD+LP P ++
Sbjct: 66 LSEFPQAVSEKNLL---SSVTQLYLSKLKFGPDAVHLKSFYKPYLDVLPLNLPQPYFWCT 122
Query: 124 DELLELKGTTLY 135
DE++ L GT +Y
Sbjct: 123 DEVVNLHGTDVY 134
Score = 38.9 bits (89), Expect = 5.6, Method: Compositional matrix adjust.
Identities = 26/98 (26%), Positives = 47/98 (47%), Gaps = 10/98 (10%)
Query: 256 LVPGIDFCNHDLKAAATWEVDGTGLITGVPFSMYLLSVERSSFHSEKEISISYGNKGNEE 315
L P +DF NH W ++ + + + ++F +++E+ +YGN NE+
Sbjct: 236 LYPIVDFLNHKNDVPVKWVMNEDNELCFMS--------QSATFSAQEELFNNYGNISNEK 287
Query: 316 LLYLYGFVIDNNPDDY--LMIHYPAEAIHSIPLSDSKA 351
L YGF +N D+ L + P+ SIP+ +K+
Sbjct: 288 CLLNYGFWDSSNKYDFSRLTLKLPSALPKSIPVDFNKS 325
>gi|320170159|gb|EFW47058.1| predicted protein [Capsaspora owczarzaki ATCC 30864]
Length = 640
Score = 41.6 bits (96), Expect = 0.94, Method: Compositional matrix adjust.
Identities = 29/122 (23%), Positives = 57/122 (46%), Gaps = 3/122 (2%)
Query: 39 IFSSNEFSDGVLLV---VPLDLAITPMRVLQDPLIGPECRAMFEDGEVDDRFLMILFLTV 95
+F+SN L+ PL + T +R L++P+I R D + + + L
Sbjct: 200 LFASNPIEAATLVATVPAPLVMFETYLRTLENPMILAIDRRFKTMSVPDPSYALAMALLY 259
Query: 96 ERLRKNSSWKPYLDMLPTTFGNPLWFTDDELLELKGTTLYRATELQKQNLLTLYDDKVKD 155
E S W+ ++ LP T + ++++ +E L+ L R T++ +++L LY+
Sbjct: 260 ESYEPKSMWREWISSLPQTLDSTVFWSAEEQDALQSLPLKRKTQILERHLQQLYNATTPR 319
Query: 156 LV 157
L+
Sbjct: 320 LL 321
>gi|116200550|ref|XP_001226087.1| hypothetical protein CHGG_10820 [Chaetomium globosum CBS 148.51]
gi|88175534|gb|EAQ83002.1| hypothetical protein CHGG_10820 [Chaetomium globosum CBS 148.51]
Length = 461
Score = 41.6 bits (96), Expect = 0.94, Method: Compositional matrix adjust.
Identities = 19/48 (39%), Positives = 32/48 (66%), Gaps = 1/48 (2%)
Query: 88 LMILFLTVERLRKNSS-WKPYLDMLPTTFGNPLWFTDDELLELKGTTL 134
L+IL + E + ++S WKPYLD+LP F P++++ EL +L+ + L
Sbjct: 113 LLILAMIYEHQQGDASRWKPYLDVLPPAFDTPMFWSPAELSQLQASAL 160
>gi|302845036|ref|XP_002954057.1| hypothetical protein VOLCADRAFT_94881 [Volvox carteri f.
nagariensis]
gi|300260556|gb|EFJ44774.1| hypothetical protein VOLCADRAFT_94881 [Volvox carteri f.
nagariensis]
Length = 598
Score = 41.6 bits (96), Expect = 0.95, Method: Compositional matrix adjust.
Identities = 42/162 (25%), Positives = 64/162 (39%), Gaps = 24/162 (14%)
Query: 186 SYVFPQNQEDLNKYDSINNSAELSNDHNSRGELINGLNDIKNEAQRVNSQVNGATSTLTS 245
S +P E L+ D + A S +RG L G + + V+S+ + +
Sbjct: 136 SLRYPPALEALHAADWLRADAFQSTSDEARGGL--GEGEFRWALSVVHSR-----TFANA 188
Query: 246 TQGETLWIEGLVPGIDFCNHDLKAAATWEVD----GTGLITGVPFSMYLLSVERSS---- 297
G + + LVP +D NH AA + G G + LL +RSS
Sbjct: 189 APGGGVGVRMLVPLVDMLNHGGDTAAQGSLGLVGPGGGEVATDNVRWDLLPPDRSSAGGW 248
Query: 298 ---------FHSEKEISISYGNKGNEELLYLYGFVIDNNPDD 330
H +E+ +SYG + N++ YGFV NP D
Sbjct: 249 SMAVSATRDIHPGQELLLSYGERPNDDFFLHYGFVPRANPHD 290
>gi|336261436|ref|XP_003345507.1| hypothetical protein SMAC_07495 [Sordaria macrospora k-hell]
gi|380088183|emb|CCC13858.1| unnamed protein product [Sordaria macrospora k-hell]
Length = 499
Score = 41.6 bits (96), Expect = 0.96, Method: Compositional matrix adjust.
Identities = 27/79 (34%), Positives = 41/79 (51%), Gaps = 10/79 (12%)
Query: 256 LVPGIDFCNHDLKAAATWEVDGTGLITGVPFSMYLLSVERSSFHSEKEISISYGNKGNEE 315
L P D NH +A +W +T P + L+ R+S+ +++ +YG K N E
Sbjct: 236 LQPLYDIGNHSPEAEYSWN------LTSEPSACELIC--RNSYEPGQQVFNNYGKKTNSE 287
Query: 316 LLYLYGFVIDNNPDDYLMI 334
LL YGFV +NN DY+ +
Sbjct: 288 LLLGYGFVTENN--DYIHV 304
>gi|15228958|ref|NP_191216.1| SET domain-containing protein [Arabidopsis thaliana]
gi|7594535|emb|CAB88060.1| putative protein [Arabidopsis thaliana]
gi|332646015|gb|AEE79536.1| SET domain-containing protein [Arabidopsis thaliana]
Length = 531
Score = 41.6 bits (96), Expect = 1.0, Method: Compositional matrix adjust.
Identities = 39/146 (26%), Positives = 65/146 (44%), Gaps = 12/146 (8%)
Query: 8 KLEPFLQWLQVNKVELRGCKIKYSDESKGFGIFSSNEFSDGVLLVVPLDLAITPMRVLQD 67
+L F +W+Q N V+ DE+ G + + + +G ++ I+ L
Sbjct: 5 RLRAFKRWMQANGVDCSEALNLVDDENDGVSVRAFCDLKEGDVVA-----NISKTACLTI 59
Query: 68 PLIGPECRAMFEDGEVDDRFLMILFLTVER-LRKNSSWKPYLDMLPTTFGNPL-WFTDDE 125
G R M E ++D + + L ER L + S W YL +LP PL W +D
Sbjct: 60 KTSG--AREMIESADLDGSLGLSVALMYERSLGEESPWAGYLQILPIQEDLPLVWSLEDL 117
Query: 126 LLELKGTTLYRATELQKQNLLTLYDD 151
L GT L++ L K++ + +Y+D
Sbjct: 118 DSLLSGTELHK---LVKEDHVLIYED 140
>gi|348671353|gb|EGZ11174.1| hypothetical protein PHYSODRAFT_361758 [Phytophthora sojae]
Length = 486
Score = 41.2 bits (95), Expect = 1.0, Method: Compositional matrix adjust.
Identities = 37/145 (25%), Positives = 73/145 (50%), Gaps = 7/145 (4%)
Query: 10 EPFLQWLQVNKVELRGCKIK-YSDESKGFGIFSSNEFSDG-VLLVVPLDLAITPMRVLQD 67
+ +QWL+ N + + ++ Y+ E +G + S + G +LV+P IT + + +
Sbjct: 43 QELIQWLEGNGADTKKLALQEYAPEVRG--VHSRKVLAPGERILVIPKKCLIT-VEMGKQ 99
Query: 68 PLIGPECRAMFEDGEVDDRFLMILFLTVERLRKNSSW-KPYLDMLPTTFGN-PLWFTDDE 125
IG + A D +++FL + R +S+ + Y LP+T N P++++D+E
Sbjct: 100 TDIGRKLLARNVDFVAPKHIFLMMFLLTDMERAETSFFRNYYSTLPSTLSNMPIFWSDEE 159
Query: 126 LLELKGTTLYRATELQKQNLLTLYD 150
L LKG+ + + + +K + YD
Sbjct: 160 LGWLKGSYIIQQIQERKAAIRKDYD 184
>gi|449542715|gb|EMD33693.1| hypothetical protein CERSUDRAFT_56467 [Ceriporiopsis subvermispora
B]
Length = 510
Score = 41.2 bits (95), Expect = 1.1, Method: Compositional matrix adjust.
Identities = 54/252 (21%), Positives = 104/252 (41%), Gaps = 29/252 (11%)
Query: 88 LMILFLTVERLRKNSSWKPYLDMLPTTFGNPLWFTDDELLELKGTTLYRATELQKQNLLT 147
L++ + E L +S W Y+ +P TF P++++++E+ ELKGT + ++ + +
Sbjct: 81 LILCMMWEESLGGSSKWSEYMSSMPDTFTTPMFWSEEEIQELKGTAV--VDKIGRDDAER 138
Query: 148 LYDDKVKDLVKKL--LVLDGDSERANSI--FWTRALNIPLPHSYVFPQNQEDLNKYDSIN 203
Y +K+ +K L L+G + ++ + I +V P +D ++ D
Sbjct: 139 DYYEKLIPAIKSRPDLFLEGSIPKFYALERYHMMGSRILSRSFHVEPWRGDDGHESDESE 198
Query: 204 NSAELSNDHNSRGELINGLNDI------------KNEAQRVNSQVNGATSTLTSTQGETL 251
+ + + G ++ N + E +++S +G T ET
Sbjct: 199 THGDAPHSDDPNGMDVDADNPSGDTTADSIEPPHEEEVSQLDSAFDGNTD--DDEDAETP 256
Query: 252 WIEGLVPGIDFCNHDLKAAATWEVDGTGLITGVPFSMYLLSVERSSFHSEKEISISYGNK 311
+VP D N A +E + L YL V ++ ++I +YG+
Sbjct: 257 ADVAMVPMADMLN------ARFESENAKLFYE---EHYLKMVATKPINAGEQIWNTYGDP 307
Query: 312 GNEELLYLYGFV 323
N +LL YG V
Sbjct: 308 PNSDLLRRYGHV 319
>gi|255945819|ref|XP_002563677.1| Pc20g11910 [Penicillium chrysogenum Wisconsin 54-1255]
gi|211588412|emb|CAP86520.1| Pc20g11910 [Penicillium chrysogenum Wisconsin 54-1255]
Length = 487
Score = 41.2 bits (95), Expect = 1.1, Method: Compositional matrix adjust.
Identities = 70/323 (21%), Positives = 121/323 (37%), Gaps = 52/323 (16%)
Query: 12 FLQWLQVNKVELRGCKIKYSD---ESKGFGIFSSNEFSDGV-LLVVPLDLAITPMRVLQD 67
F+ WLQ + K++ +D G G+ + + S+G L VP + +T
Sbjct: 20 FMSWLQASPGVQLNPKLRLADLRATGAGRGVVAQSNISEGEELFSVPRAMVLTVQNSELR 79
Query: 68 PLIGPECRAMFEDGEVDDRFLMILFLTVERLR-KNSSWKPYLDMLPTTFGNPLWFTDDEL 126
L+G + ++ ++L + E L+ + S W PY +LP+ F ++++ EL
Sbjct: 80 TLLGENL-----EEQMGPWLSLMLVMVYEYLQGEKSRWAPYFRVLPSRFDTLMFWSPAEL 134
Query: 127 LELKGTTLYRATELQKQNLLTLYDDKVKDLVKKLLVLDGDSERANSIFWTRALNIPLPHS 186
EL+ +T +V+K+ + NSI A P
Sbjct: 135 QELQAST----------------------IVEKIGRSGAEESIRNSIAPILAKR---PDL 169
Query: 187 YVFPQNQEDLNKYDSINNSAELSNDHNSRGELINGLN-DIKNEAQRVNSQVNGATSTLTS 245
+ PQ L ++ A L + G LI DI+ + S +T
Sbjct: 170 FPPPQG---LASWEGDAGDAALIQVGHIMGSLIMAYAFDIEKSEDDGDEGEANDESYMTD 226
Query: 246 TQGETLWIEGLVPGIDFCNHDLKA--AATWEVDGTGLITGVPFSMYLLSVERSSFHSEKE 303
+ E +G+VP D N D A ++ +G ++ + +E
Sbjct: 227 DEEEEQLPKGMVPLADLLNADADRNNARLYQEEGALVMKAI-----------KPIQQGEE 275
Query: 304 ISISYGNKGNEELLYLYGFVIDN 326
I YG +LL YG+V DN
Sbjct: 276 IFNDYGEIPRADLLRRYGYVTDN 298
>gi|347835472|emb|CCD50044.1| hypothetical protein [Botryotinia fuckeliana]
Length = 562
Score = 41.2 bits (95), Expect = 1.1, Method: Compositional matrix adjust.
Identities = 66/282 (23%), Positives = 104/282 (36%), Gaps = 79/282 (28%)
Query: 84 DDRFLMILFLTVER------------LRKNSSWKPYLDMLPTTFGNPLWFTDDELLELKG 131
D + ++L +T+ R + N+ W Y+ LP P +T+DE L G
Sbjct: 165 DAKLFLLLQMTIARNNTNEQPEIALNIGVNNPWSEYIRFLPEKIPVPTMWTEDEKTLLNG 224
Query: 132 TTLYRATELQKQNLLTLYDDKVKDLVKKLLVLDGDSERANSIFWTRALNIPLPHSYVFPQ 191
T+L + LV K++ L D E + + IP HS + +
Sbjct: 225 TSLENS------------------LVNKMVALSRDFENLRE----KTIEIPWCHSCWWKE 262
Query: 192 NQEDLNKYDSINNSAELSNDHNSRGELINGLNDIKNEAQRVNSQVNGATSTLTSTQGETL 251
+ G L L ++ RV++ + L GE
Sbjct: 263 D-----------------------GPLKPLL---LSDWIRVDAWYR--SRCLEIETGE-- 292
Query: 252 WIEGLVPGIDFCNHDLKAAATWE--VDGTGLITGVPFSMYLLSVERSSFHSEKEISISYG 309
E +VP +D NH A WE +G ++ VP + EI+ISYG
Sbjct: 293 --EVMVPVLDMVNHSFTPNAHWEHTSNGNAILVLVPDIL---------LDEGTEITISYG 341
Query: 310 NKGNEELLYLYGFVIDNNPDDYLMIHYPAEAIHSIPLSDSKA 351
K + E L+ YGF+ P L++ E I + PL +K
Sbjct: 342 VKSDAENLFNYGFIDAEVPLTSLILE--VEPIATDPLRVAKV 381
>gi|33468718|emb|CAE30375.1| SI:dZ63M10.4 (novel protein similar to human chromosome 21 open
reading frame 18 (C21orf18)) [Danio rerio]
Length = 440
Score = 41.2 bits (95), Expect = 1.1, Method: Compositional matrix adjust.
Identities = 22/62 (35%), Positives = 35/62 (56%), Gaps = 2/62 (3%)
Query: 89 MILFLTVERLRKNSS-WKPYLDMLPTTFGNPLWFTDDELLELKGTTLYRATELQKQNLLT 147
+ FL ER +S W PY+D+LP T+ PL+F D+ ++EL +L + QK+
Sbjct: 111 LCCFLISERHHGEASEWNPYIDILPKTYTCPLYFPDN-VIELLPRSLQKKATQQKEQFQE 169
Query: 148 LY 149
L+
Sbjct: 170 LF 171
>gi|14596097|gb|AAK68776.1| Unknown protein [Arabidopsis thaliana]
gi|18377562|gb|AAL66947.1| unknown protein [Arabidopsis thaliana]
Length = 364
Score = 41.2 bits (95), Expect = 1.1, Method: Compositional matrix adjust.
Identities = 52/198 (26%), Positives = 77/198 (38%), Gaps = 39/198 (19%)
Query: 168 ERAN--SIFWTRALNIPLPHSYVFPQ--NQEDLNKYDSINNSAELSNDHNSRG------- 216
ERAN S +W N LP +Y P ED+ A L + N R
Sbjct: 10 ERANADSFWWPYISN--LPETYTVPIFFPGEDIKNL----QYAPLLHQVNKRCRFLLEFE 63
Query: 217 -ELINGLNDIKNE-----AQRVNSQVNGATSTLTSTQ------------GETLWIEGLVP 258
E+ L D+K Q VN+ G T + ST+ G + + ++P
Sbjct: 64 QEIRRTLEDVKASDHPFSGQDVNASALGWTMSAVSTRAFRLHGNKKLQGGSSDDVPMMLP 123
Query: 259 GIDFCNHDLKAAATWEVDGTGLITGVPFSMYLLSVERSSFHSEKEISISYGNKGNEELLY 318
ID CNH K A + G + + V + + ++YG N+ L
Sbjct: 124 LIDMCNHSFKPNARIIQEQNGADSNTLVKV----VAETEVKENDPLLLNYGCLSNDFFLL 179
Query: 319 LYGFVIDNNPDDYLMIHY 336
YGFVI++NP D + + Y
Sbjct: 180 DYGFVIESNPYDTIELKY 197
>gi|444915331|ref|ZP_21235465.1| SET domain containing protein [Cystobacter fuscus DSM 2262]
gi|444713560|gb|ELW54457.1| SET domain containing protein [Cystobacter fuscus DSM 2262]
Length = 449
Score = 41.2 bits (95), Expect = 1.2, Method: Compositional matrix adjust.
Identities = 40/145 (27%), Positives = 69/145 (47%), Gaps = 8/145 (5%)
Query: 4 STEAKLEPFLQWLQVNKVELRGCKIKYSDESKGFGIFSSNEFSDG-VLLVVPLDLAITPM 62
S+ KL L+WL+ ++ ++ + + + S G +L VP +T +
Sbjct: 13 SSNQKLSNLLRWLEEGGARFPKLQLVRREDGER-AVLAQAPISAGETVLQVPRTHMLT-L 70
Query: 63 RVLQDPLIGPECRAMFEDGEVDDRFLMIL-FLTVERLRKNSSWKPYLDMLPTTFGN-PLW 120
+ ++ IG RA+ E + D+ L + FL E+ R+ S WKPY+D LP ++ PL+
Sbjct: 71 ELARESDIG---RAIAEGLDPDNEDLYLASFLLQEKHREGSFWKPYIDSLPESYSQMPLF 127
Query: 121 FTDDELLELKGTTLYRATELQKQNL 145
+ +E LKG Q Q+L
Sbjct: 128 YGSEEHALLKGCFALTLLTHQAQSL 152
>gi|428177025|gb|EKX45907.1| hypothetical protein GUITHDRAFT_138732 [Guillardia theta CCMP2712]
Length = 505
Score = 41.2 bits (95), Expect = 1.2, Method: Compositional matrix adjust.
Identities = 33/139 (23%), Positives = 64/139 (46%), Gaps = 20/139 (14%)
Query: 200 DSINNSAELSNDHNSRGELINGLNDIKNEAQRVNSQVNGATSTLTSTQ------GETLWI 253
DS++ ++++ +H++ E++ D + A +T+ S ET
Sbjct: 221 DSMSITSQIEREHHTIKEVLGRFQDCAEFGEFSLESYKWAQATIMSRAFDLDEGQETARR 280
Query: 254 EG-----LVPGIDFCNHDLKAAATWEVDGTGLITGVPFSMYLLSVERSSFHSEKEISISY 308
+G LVP D NH A+ + + D G + L + E ++ + +E+ I+Y
Sbjct: 281 QGEQNLLLVPLCDMVNHSPDASFSIDCDAAGNVN-------LFASE--NYKAGQEVHINY 331
Query: 309 GNKGNEELLYLYGFVIDNN 327
G+ NE+LL +GFV++
Sbjct: 332 GSSSNEQLLLSFGFVLEGG 350
>gi|113930683|ref|NP_001039027.1| SET domain-containing protein 4 [Danio rerio]
gi|66911144|gb|AAH96876.1| SET domain containing 4 [Danio rerio]
Length = 440
Score = 41.2 bits (95), Expect = 1.2, Method: Compositional matrix adjust.
Identities = 22/62 (35%), Positives = 35/62 (56%), Gaps = 2/62 (3%)
Query: 89 MILFLTVERLRKNSS-WKPYLDMLPTTFGNPLWFTDDELLELKGTTLYRATELQKQNLLT 147
+ FL ER +S W PY+D+LP T+ PL+F D+ ++EL +L + QK+
Sbjct: 111 LCCFLISERHHGEASEWNPYIDILPKTYTCPLYFPDN-VIELLPRSLQKKATQQKEQFQE 169
Query: 148 LY 149
L+
Sbjct: 170 LF 171
>gi|121703688|ref|XP_001270108.1| SET domain protein [Aspergillus clavatus NRRL 1]
gi|119398252|gb|EAW08682.1| SET domain protein [Aspergillus clavatus NRRL 1]
Length = 492
Score = 41.2 bits (95), Expect = 1.2, Method: Compositional matrix adjust.
Identities = 32/128 (25%), Positives = 59/128 (46%), Gaps = 11/128 (8%)
Query: 12 FLQWLQVNKVELRGCKIKYSD---ESKGFGIFSSNEFSDGV-LLVVPLDLAITPMRVLQD 67
F+ WL KI+ +D +S G G+ + + +G L +P DL ++
Sbjct: 20 FITWLAQRPGVRISPKIRIADLRSQSAGRGVVAQSAIVEGEELFSIPRDLVLSTENSKLK 79
Query: 68 PLIGPECRAMFEDGEVDDRFLMILFLTVERL-RKNSSWKPYLDMLPTTFGNPLWFTDDEL 126
L+ + GE+ ++L + E L R+ S+W PY + P F ++++ EL
Sbjct: 80 SLLSQDL------GELGPWLSLMLVMIYEYLLREQSAWAPYYRIFPENFDTLMFWSPAEL 133
Query: 127 LELKGTTL 134
EL+G+ +
Sbjct: 134 QELQGSAI 141
>gi|432952574|ref|XP_004085141.1| PREDICTED: histone-lysine N-methyltransferase setd3-like [Oryzias
latipes]
Length = 606
Score = 41.2 bits (95), Expect = 1.2, Method: Compositional matrix adjust.
Identities = 38/153 (24%), Positives = 68/153 (44%), Gaps = 10/153 (6%)
Query: 1 MEISTEAKLEPFLQWLQVNKVELRGCKIKYSDESKGFGIFSSNEF-SDGVLLVVPLDLAI 59
E S E + W Q N G I + ++G+G+ ++ + ++ + L VP + +
Sbjct: 71 FEGSREDSFADLMSWAQENGASCDGFTIT-NFGTEGYGLRTTRDIKAEELFLWVPRKMLM 129
Query: 60 TPMRVLQDPLIGP---ECRAMFEDGEVDDRFLMILFLTVERLRKNSSWKPYLDMLPTTFG 116
T + Q+ ++GP + R + G V + L L ER S W PY+ LP +
Sbjct: 130 T-VESAQNSVLGPIYSQDRILQAMGNVT----LALHLLCERGDPASFWSPYIRSLPQEYD 184
Query: 117 NPLWFTDDELLELKGTTLYRATELQKQNLLTLY 149
PL++ +++ L GT + Q +N Y
Sbjct: 185 TPLYYQQEDVQLLLGTQAVQDVLNQYKNTARQY 217
>gi|452982650|gb|EME82409.1| hypothetical protein MYCFIDRAFT_40308 [Pseudocercospora fijiensis
CIRAD86]
Length = 449
Score = 41.2 bits (95), Expect = 1.2, Method: Compositional matrix adjust.
Identities = 39/137 (28%), Positives = 65/137 (47%), Gaps = 32/137 (23%)
Query: 12 FLQWLQVNKVELRGCKIKYSD---ESKGFGIFSSNEF-SDGVLLVVPLDLAIT------P 61
FL WL+ N KI+ +D + G G+ ++++ SD + +P +T P
Sbjct: 13 FLTWLK-NTGATISPKIQLADLRDRAAGRGVVATSDLTSDEEIFRIPRTSILTTETTDLP 71
Query: 62 MRVLQ---DPLIGPECRAMFEDGEVDDRFLMILFLTVERLR-KNSSWKPYLDMLPTTFGN 117
+LQ DP + +IL + E L NS +KPYLD+LP +F
Sbjct: 72 QEILQQLTDPWLS-----------------LILAMIFEYLLGTNSRFKPYLDILPESFNT 114
Query: 118 PLWFTDDELLELKGTTL 134
+++TD+EL L+G+ +
Sbjct: 115 LMFWTDNELQYLQGSAI 131
>gi|195565510|ref|XP_002106342.1| GD16174 [Drosophila simulans]
gi|194203718|gb|EDX17294.1| GD16174 [Drosophila simulans]
Length = 395
Score = 41.2 bits (95), Expect = 1.2, Method: Compositional matrix adjust.
Identities = 34/129 (26%), Positives = 58/129 (44%), Gaps = 11/129 (8%)
Query: 7 AKLEPFLQWLQVNKVELRGCKIKYSDESKGFGIFSSNEFS-DGVLLVVPLDLAITPMRVL 65
AK+E F W + V G +I + G+ ++ + D ++L VP L +
Sbjct: 26 AKVEAFSAWAKDGGVHSEGLEIAIFPGYQ-LGLRATRPLAKDELVLSVPRKLIFSEESNS 84
Query: 66 QDPLIGPECRAMFEDGEVDDRFLMILFLTVERLRKN-SSWKPYLDMLPTTFGNPLWFTDD 124
L G +A + D L +E++R S W+PY+D+LP + L+FT
Sbjct: 85 DCRLFGKMTQATHLNLAYD--------LVIEKIRGEFSEWRPYIDVLPAKYSTVLYFTTK 136
Query: 125 ELLELKGTT 133
++ L+GT
Sbjct: 137 QMELLRGTA 145
>gi|298715435|emb|CBJ28046.1| conserved unknown protein [Ectocarpus siliculosus]
Length = 719
Score = 41.2 bits (95), Expect = 1.3, Method: Compositional matrix adjust.
Identities = 35/116 (30%), Positives = 53/116 (45%), Gaps = 14/116 (12%)
Query: 248 GETLWIEGLVPGIDFCNH-DLKAAATWEVDGTGLITGVPFSMYLLSVERSSFHSEKEISI 306
G+ L LVP D NH +++ ++V G G + L + + E+
Sbjct: 198 GKRLKSSALVPFADCLNHGNVQTKYDFDVGGNG-------TFRLFPSGNNRYPRNSEVLN 250
Query: 307 SYGNKGNEELLYLYGFVIDNNPDDYLMIHYPAEAIHSIPLSDSKALLLEEQKAQLR 362
SYG + N+ LL YGF + +N D AE I S+P S ++ L +KA LR
Sbjct: 251 SYGRRANDNLLLDYGFAMLDNEWD------AAEVICSLPPSHDQSPLDRRRKACLR 300
>gi|195480581|ref|XP_002101314.1| GE17555 [Drosophila yakuba]
gi|194188838|gb|EDX02422.1| GE17555 [Drosophila yakuba]
Length = 548
Score = 41.2 bits (95), Expect = 1.3, Method: Compositional matrix adjust.
Identities = 16/42 (38%), Positives = 27/42 (64%), Gaps = 1/42 (2%)
Query: 93 LTVERLRKN-SSWKPYLDMLPTTFGNPLWFTDDELLELKGTT 133
L +E++R S W+PY+D+LP + L+FT ++ L+GT
Sbjct: 198 LLIEKIRGEFSEWRPYIDVLPAKYSTVLYFTTKQMERLRGTA 239
>gi|302658278|ref|XP_003020845.1| SET domain protein [Trichophyton verrucosum HKI 0517]
gi|291184711|gb|EFE40227.1| SET domain protein [Trichophyton verrucosum HKI 0517]
Length = 692
Score = 41.2 bits (95), Expect = 1.3, Method: Compositional matrix adjust.
Identities = 27/100 (27%), Positives = 51/100 (51%), Gaps = 4/100 (4%)
Query: 83 VDDRFLMILFLTVERLR-KNSSWKPYLDMLP--TTFGNPLWFTDDELLELKGTTLYRATE 139
V ++ F+ E+L+ ++S W PYL LP + F + L++ D++L L+GT LY+ +
Sbjct: 102 VRKSIILAFFMVHEQLKGRDSHWWPYLATLPRASEFTSALFYQDNDLEWLQGTNLYQTHQ 161
Query: 140 LQKQNLLTLYDDKVKDLVKKLLVLDGDSERANSIFWTRAL 179
+ + YD + +++ L +S R + W L
Sbjct: 162 AYRNAVQEEYDSAIS-ILRDEGFLAVESYRWDIFCWAYTL 200
>gi|241712095|ref|XP_002413441.1| conserved hypothetical protein [Ixodes scapularis]
gi|215507255|gb|EEC16749.1| conserved hypothetical protein [Ixodes scapularis]
Length = 227
Score = 41.2 bits (95), Expect = 1.3, Method: Compositional matrix adjust.
Identities = 34/129 (26%), Positives = 62/129 (48%), Gaps = 13/129 (10%)
Query: 12 FLQWLQVNKVELRGCKIK-YSDESKGFGIFSSNEFSDG-VLLVVPLDLAITPMRVLQDPL 69
F +W N + G ++ D+ GF + + G V L VPL + +T + + L
Sbjct: 8 FFKWCLDNGATINGITLQALPDDEYGFA--AEQDIQVGPVFLGVPLGMMMTTIGARKSKL 65
Query: 70 IGPECRAMFEDGEVD---DRFLMILFLTVERLRKNSS-WKPYLDMLPTTFGNPLWFTDDE 125
A+ +D + + + +FL +E ++S W PY+ +LP +F L+F+ DE
Sbjct: 66 -----GALLKDDPIMKSMENVALSMFLILELCAGSASFWHPYISILPRSFNTVLYFSVDE 120
Query: 126 LLELKGTTL 134
L L G+++
Sbjct: 121 LQLLTGSSV 129
>gi|327280314|ref|XP_003224897.1| PREDICTED: SET domain-containing protein 3-like, partial [Anolis
carolinensis]
Length = 225
Score = 40.8 bits (94), Expect = 1.4, Method: Compositional matrix adjust.
Identities = 31/125 (24%), Positives = 62/125 (49%), Gaps = 12/125 (9%)
Query: 13 LQWLQVNKVELRGCKIK-YSDESKGFGIFSSNEFS-DGVLLVVPLDLAITPMRVLQDPLI 70
++W + N G ++ + DE GFG+ ++ E + + L VP + +T + ++ ++
Sbjct: 83 IKWARENGAYTEGFEVANFGDE--GFGLRATREIKGEELFLWVPRKMLMT-VESAKNSVL 139
Query: 71 GP---ECRAMFEDGEVDDRFLMILFLTVERLRKNSSWKPYLDMLPTTFGNPLWFTDDELL 127
G + R + G + + L ER +S W PY+ LP+ + PL+F ++E+
Sbjct: 140 GSLYSQDRILQAMGNIT----LAFHLLCERCNPSSFWLPYIQTLPSEYNTPLYFEEEEVQ 195
Query: 128 ELKGT 132
L+ T
Sbjct: 196 YLQST 200
>gi|397593323|gb|EJK55933.1| hypothetical protein THAOC_24272 [Thalassiosira oceanica]
Length = 567
Score = 40.8 bits (94), Expect = 1.4, Method: Compositional matrix adjust.
Identities = 75/358 (20%), Positives = 137/358 (38%), Gaps = 77/358 (21%)
Query: 12 FLQWLQVNKVELRGCKIKYSDESKGFGIFSSNEF-SDGVLLVVPLDLAITPMRVLQDPLI 70
F W + N ++ G ++ ++ G G++ + + ++ VL+ VP ++ I + ++
Sbjct: 95 FTDWAKENNIKYAGVEVSSGGDNSGLGLYVTQDINANDVLIQVPTNMVIQAESPDYNTVM 154
Query: 71 GPECRAMFEDGEVDDRFLM---ILFLTVERLRK---------NSSWKPYLDMLPTTFGNP 118
R +F+ R L L + + K S +P+L+ LP + P
Sbjct: 155 E---REVFDSNPKAYRNLQWWAALSVQLNYYDKINPVNDKAGGISIQPWLNSLPRVYDTP 211
Query: 119 LW-FTDDELLELKGTTLYRATELQK----QNLLTLYDDKVKDLVKKLLVLD------GDS 167
+ + + L EL+ + A LQK + T+ KD + K+ D
Sbjct: 212 AFNWAESSLEELQYRPMIEAVALQKRAWKKEFETVQKAASKDFLSKVSFEDFVWGCETAR 271
Query: 168 ERANSIFWTRALNIPLPHSYVFPQNQEDLNKYDSINNSAELSNDHNSRGELINGLNDIKN 227
RA S ++ + P+P++ V + Y ++ G+ I+
Sbjct: 272 SRAFSGAYSGSAFNPIPYATVTVL----VAVYVAL------------------GIGSIEQ 309
Query: 228 EAQRVNSQVNGATSTLTSTQGETLWIEGLV--------PGIDFCNHDLKAAATWEVDGTG 279
A NGA + + + + L+ P ID NH GTG
Sbjct: 310 AA-------NGAALVICGSILKDFVVPKLLKIQKYVICPLIDMANHV----------GTG 352
Query: 280 LITGVPFSMYLLSVERSSFHSEK---EISISYGNKGNEELLYLYGFVIDNNPDDYLMI 334
V F + S+ + K E+ ISYG + N++LL YGFV + N D ++
Sbjct: 353 AAGNVSFEYFADGYSLSALSNAKKGSEMFISYGPRSNDQLLQYYGFVEEQNAHDIYIL 410
>gi|348690659|gb|EGZ30473.1| hypothetical protein PHYSODRAFT_553476 [Phytophthora sojae]
Length = 437
Score = 40.8 bits (94), Expect = 1.4, Method: Compositional matrix adjust.
Identities = 29/102 (28%), Positives = 49/102 (48%), Gaps = 8/102 (7%)
Query: 253 IEGLVPGIDFCNHDLKAAATWEVDGTGLITGVPFSMYLLSVERSSFHSEKEISISYGNKG 312
+ +VP D NHD +A + D ++ F + V +++ ++ I+YG
Sbjct: 213 FKAMVPVFDMLNHDPEAEMSHFFD----MSTQRFKL----VSHQHWNAGAQMFINYGPLS 264
Query: 313 NEELLYLYGFVIDNNPDDYLMIHYPAEAIHSIPLSDSKALLL 354
N +LL LYGFVI NP D + + P + + + + LLL
Sbjct: 265 NHKLLALYGFVIIGNPFDAVEMWLPMDEASTKFFQEKEQLLL 306
>gi|302785554|ref|XP_002974548.1| hypothetical protein SELMODRAFT_101776 [Selaginella moellendorffii]
gi|300157443|gb|EFJ24068.1| hypothetical protein SELMODRAFT_101776 [Selaginella moellendorffii]
Length = 467
Score = 40.8 bits (94), Expect = 1.4, Method: Compositional matrix adjust.
Identities = 88/359 (24%), Positives = 134/359 (37%), Gaps = 91/359 (25%)
Query: 14 QWLQ-----VNKVELRGCKIKYSDESKGFGIFSSNEFSDG-VLLVVPLDLAITPMRVLQD 67
QWL + KVEL+ + G G+ S G LL +P LAIT
Sbjct: 43 QWLSQAGLPIQKVELKNVG------AGGRGLVSKRMLYKGDRLLFLPATLAITTESEWAC 96
Query: 68 PLIGPECRAMFEDGEVDDRFLMILFLTVERLRKNSSWKPYLDMLPTTFGNPL-WFTDDEL 126
G RA + FL ++ L K+S W PY+ LP G+ L W D
Sbjct: 97 AEAGKVIRA---KDLPEWPFLACYLISEASLGKSSPWYPYIAALPRRPGSILLWTALDVE 153
Query: 127 LELKGTTLY-RATELQKQNLLTLYDDKVKDLVKKLLVLDGDSERANSIFWTRALNIPLPH 185
L T++ RA + ++ +D DL K+
Sbjct: 154 AHLSATSIKDRALQCVRE-----VEDTFNDLNKQ-------------------------- 182
Query: 186 SYVFPQNQEDLNKYDSINNSAELSNDHNSRGELINGLNDIKNEAQRVNSQVNGATSTLTS 245
VF +N+E+ + N L + + G L + L + + Q++
Sbjct: 183 --VFMKNREEFPP-EVFN----LESFKWAFGILFSRLVRLPSLGQKL------------- 222
Query: 246 TQGETLWIEGLVPGIDFCNHDLKAAATWEVD-GTGLITGVPFSMYLLSVERSSFHSEKEI 304
L+P D NHD + + D G+ IT +++R + S KE+
Sbjct: 223 ---------ALIPFGDMLNHDTEVTTFLDFDSGSKSIT--------CTLDRG-YESNKEV 264
Query: 305 SISYGNKGNEELLYLYGFVID-NNPDDYLMIHY---PAEAIHSIPLSDSKALLLEEQKA 359
ISYG + N ELL YGFV N +D + I PA+ ++ L K L Q++
Sbjct: 265 FISYGKRSNGELLVAYGFVPSGKNSEDSVSITLGLDPADEMYEAKLGALKEHGLSPQQS 323
>gi|145352367|ref|XP_001420521.1| predicted protein [Ostreococcus lucimarinus CCE9901]
gi|144580755|gb|ABO98814.1| predicted protein [Ostreococcus lucimarinus CCE9901]
Length = 442
Score = 40.8 bits (94), Expect = 1.4, Method: Compositional matrix adjust.
Identities = 32/102 (31%), Positives = 52/102 (50%), Gaps = 7/102 (6%)
Query: 256 LVPGIDFCNHDLKAAATWEVDGTGLITGVPFSMYLLSVERSSFHSEKEISISYGNKGNEE 315
L P ID NH +++ A + V G + S+E + + E+ ISYG++ N
Sbjct: 240 LAPTIDLANHRVESTAKYGVSADG--KNFELTWNEDSIEGPTPVANTEVFISYGDRMNNA 297
Query: 316 LLYL-YGFVIDNNPDDYLMIHY---PAEAIHSIPLSDS-KAL 352
LL L YGF+ D N ++ L + + A +H ++D+ KAL
Sbjct: 298 LLMLHYGFIDDENRNERLPMEFIAPGARRVHGSRVADACKAL 339
>gi|67900706|ref|XP_680609.1| hypothetical protein AN7340.2 [Aspergillus nidulans FGSC A4]
gi|40742521|gb|EAA61711.1| hypothetical protein AN7340.2 [Aspergillus nidulans FGSC A4]
gi|259483305|tpe|CBF78584.1| TPA: conserved hypothetical protein [Aspergillus nidulans FGSC A4]
Length = 441
Score = 40.8 bits (94), Expect = 1.4, Method: Compositional matrix adjust.
Identities = 26/82 (31%), Positives = 39/82 (47%), Gaps = 18/82 (21%)
Query: 255 GLVPGIDFCNH--DLKAAATWEVDGTGLITGVPFSMYLLSVERSSFHSEKEISISYGNKG 312
G+VP D+ NH D T++ D Y+ E+ +EI +SYG
Sbjct: 251 GMVPFADYFNHRDDASCEVTFDRDS-----------YIFRAEKG-----EEIYMSYGPHS 294
Query: 313 NEELLYLYGFVIDNNPDDYLMI 334
N+ LL YGF +D+NP D + +
Sbjct: 295 NDFLLVEYGFYLDDNPSDRVYL 316
>gi|328771298|gb|EGF81338.1| hypothetical protein BATDEDRAFT_87914 [Batrachochytrium
dendrobatidis JAM81]
Length = 607
Score = 40.8 bits (94), Expect = 1.4, Method: Compositional matrix adjust.
Identities = 47/189 (24%), Positives = 78/189 (41%), Gaps = 18/189 (9%)
Query: 7 AKLEPFLQWLQVNKVELRGCKIKYS-DESKGFGIFSSNEFSDGVLLVVPLDLAITPMRVL 65
++L QW NK+ KI+ D + GF +F+ G +L AI +L
Sbjct: 2 SRLNILKQWFGENKIAYDEEKIRIEHDTNNGFRVFAKQTLEVGDILC-----AIPKEAIL 56
Query: 66 QDPLIGPECRAMFEDGEVDDRFLMILFLTVER-LRKNSSWKPYLDMLPTTFGNPLWFTDD 124
G + E+ + + +++ L ER L + S W Y+ LP PL++ D
Sbjct: 57 SIKNCG--VADVLEEQGLGGQLGLVIALMFERSLGEKSPWYGYIQSLPLRENIPLFWEKD 114
Query: 125 ELLELKGTTLYRATELQKQNLLTLYDDKVKDLVKK--------LLVLDGDSERANSIFWT 176
+ L GT + E ++L Y + V LVK+ + D D A S+ +
Sbjct: 115 QQACLDGTAVAHLLEPMPKDLKADYKEHVVPLVKENSKVFNAAKMTFD-DFTAATSLVTS 173
Query: 177 RALNIPLPH 185
RA + + H
Sbjct: 174 RAFRVDVYH 182
>gi|389639446|ref|XP_003717356.1| hypothetical protein MGG_06237 [Magnaporthe oryzae 70-15]
gi|351643175|gb|EHA51037.1| hypothetical protein MGG_06237 [Magnaporthe oryzae 70-15]
gi|440465360|gb|ELQ34683.1| hypothetical protein OOU_Y34scaffold00748g2 [Magnaporthe oryzae
Y34]
gi|440490993|gb|ELQ70482.1| hypothetical protein OOW_P131scaffold00027g18 [Magnaporthe oryzae
P131]
Length = 500
Score = 40.8 bits (94), Expect = 1.4, Method: Compositional matrix adjust.
Identities = 28/76 (36%), Positives = 37/76 (48%), Gaps = 8/76 (10%)
Query: 256 LVPGIDFCNHDLKAAATWEVDGTGLITGVPFSMYLLSVERSS--FHSEKEISISYGNKGN 313
L+P D NHD ATW+ + VP LL RS + ++I +YG K N
Sbjct: 241 LMPLFDIANHDPLVQATWDSES------VPGECRLLVNGRSGQGYRPGEQIFNNYGLKTN 294
Query: 314 EELLYLYGFVIDNNPD 329
ELL YGFV+ + D
Sbjct: 295 SELLVAYGFVLSESDD 310
>gi|302826668|ref|XP_002994755.1| hypothetical protein SELMODRAFT_432653 [Selaginella moellendorffii]
gi|300136963|gb|EFJ04180.1| hypothetical protein SELMODRAFT_432653 [Selaginella moellendorffii]
Length = 688
Score = 40.8 bits (94), Expect = 1.5, Method: Compositional matrix adjust.
Identities = 33/112 (29%), Positives = 52/112 (46%), Gaps = 10/112 (8%)
Query: 38 GIFSSNEFSDG-VLLVVPLDLAITPMRV------LQDPLIGPECRAMFEDGEVDDRFL-M 89
G+F+S G +L + LDL I P R+ LQ P + E E+D+ L
Sbjct: 438 GLFASRPVRAGERVLEISLDLMIAPTRLPDQLSTLQSSAWAPYISCLPEPAELDNTVLDY 497
Query: 90 ILFLTVERLRKNSSWKPYLDML--PTTFGNPLWFTDDELLELKGTTLYRATE 139
+F++ + ++S+W PY+ L P N + D EL L+ + LY T
Sbjct: 498 RVFVSQKFQLQSSAWAPYISCLPEPAELDNTFLWEDTELSYLRASPLYGKTR 549
>gi|121707885|ref|XP_001271968.1| SET domain protein [Aspergillus clavatus NRRL 1]
gi|119400116|gb|EAW10542.1| SET domain protein [Aspergillus clavatus NRRL 1]
Length = 677
Score = 40.8 bits (94), Expect = 1.5, Method: Compositional matrix adjust.
Identities = 24/79 (30%), Positives = 40/79 (50%), Gaps = 10/79 (12%)
Query: 256 LVPGIDFCNHDLKAAATWEVDGTGLITGVPFSMYLLSVERSSFHSEKEISISYGNKGNEE 315
L+P ID NH A W + YL+ + ++ +EI+ +YG + NE+
Sbjct: 215 LLPCIDLMNHRPLAKVEWRAGKQDV-------AYLVLEDVAA---GQEIANNYGPRNNEQ 264
Query: 316 LLYLYGFVIDNNPDDYLMI 334
L+ YGF + +NP DY ++
Sbjct: 265 LMMNYGFCLPDNPCDYRIV 283
Score = 40.4 bits (93), Expect = 2.0, Method: Compositional matrix adjust.
Identities = 25/73 (34%), Positives = 42/73 (57%), Gaps = 3/73 (4%)
Query: 89 MILFLTVERLR-KNSSWKPYLDMLPT--TFGNPLWFTDDELLELKGTTLYRATELQKQNL 145
I FL + L+ ++ W PY+ LP + PL++ D+L LKGT+L+ A E + + L
Sbjct: 95 FIFFLIGQYLQGEDGFWFPYIRTLPQPLSLTTPLYYEGDDLGWLKGTSLWPAREQRMELL 154
Query: 146 LTLYDDKVKDLVK 158
Y++ V++L K
Sbjct: 155 KEAYENGVRELRK 167
>gi|148686780|gb|EDL18727.1| mCG18357, isoform CRA_e [Mus musculus]
Length = 458
Score = 40.8 bits (94), Expect = 1.5, Method: Compositional matrix adjust.
Identities = 25/96 (26%), Positives = 50/96 (52%), Gaps = 8/96 (8%)
Query: 40 FSSNEFSDGVLLVVPLDLAITPMRVLQDPLIGP---ECRAMFEDGEVDDRFLMILFLTVE 96
+ ++ ++ + L VP L +T + ++ ++GP + R + G + + L E
Sbjct: 32 YGRSKKAEELFLWVPRKLLMT-VESAKNSVLGPLYSQDRILQAMGNI----ALAFHLLCE 86
Query: 97 RLRKNSSWKPYLDMLPTTFGNPLWFTDDELLELKGT 132
R NS W+PY+ LP+ + PL+F ++E+ L+ T
Sbjct: 87 RASPNSFWQPYIQTLPSEYDTPLYFEEEEVRCLQST 122
>gi|357137816|ref|XP_003570495.1| PREDICTED: histone-lysine N-methyltransferase setd3-like
[Brachypodium distachyon]
Length = 485
Score = 40.8 bits (94), Expect = 1.5, Method: Compositional matrix adjust.
Identities = 32/110 (29%), Positives = 48/110 (43%), Gaps = 6/110 (5%)
Query: 229 AQRVNSQVNGATSTLTSTQGETLW--IEGLVPGIDFCNHDLKAAATWEVDGTGLITGVPF 286
Q VNS G + S++ L I L+P +D CNH A DG P
Sbjct: 215 GQDVNSSSLGWAMSAASSRAFRLHGEIPMLLPLVDMCNHSFSPNARIVQDGD---VDSP- 270
Query: 287 SMYLLSVERSSFHSEKEISISYGNKGNEELLYLYGFVIDNNPDDYLMIHY 336
M + V + ++++YG N+ L YGFV+ +NP D + + Y
Sbjct: 271 DMSVKVVADTQIDQNATVTLNYGCYPNDFYLLDYGFVVTSNPYDQVELSY 320
>gi|255723423|ref|XP_002546645.1| hypothetical protein CTRG_06123 [Candida tropicalis MYA-3404]
gi|240130776|gb|EER30339.1| hypothetical protein CTRG_06123 [Candida tropicalis MYA-3404]
Length = 428
Score = 40.8 bits (94), Expect = 1.6, Method: Compositional matrix adjust.
Identities = 21/72 (29%), Positives = 33/72 (45%), Gaps = 10/72 (13%)
Query: 256 LVPGIDFCNHDLKAAATWEVDGTGLITGVPFSMYLLSVERSSFHSEKEISISYGNKGNEE 315
+ P +DF NH T ++DG G S++ + ++ +SYG N+
Sbjct: 236 MAPYVDFMNHSCDDHCTLKIDGKGFQVS----------STCSYNIDDQVYLSYGPHSNDF 285
Query: 316 LLYLYGFVIDNN 327
LL YGF+I N
Sbjct: 286 LLCEYGFIIPGN 297
>gi|406602781|emb|CCH45655.1| Ribosomal N-lysine methyltransferase 1 [Wickerhamomyces ciferrii]
Length = 586
Score = 40.8 bits (94), Expect = 1.6, Method: Compositional matrix adjust.
Identities = 25/79 (31%), Positives = 36/79 (45%), Gaps = 6/79 (7%)
Query: 256 LVPGIDFCNHDLKAAATWEVDGTGLITGVPFSMYLLSVERSSFHSEKEISISYGNKGNEE 315
L+P ID NHD W + TG + + + EI +YG K NEE
Sbjct: 228 LLPIIDLLNHDNSTNVEWSFEETGD------EGFFTLLNKDPHTKGDEIFNNYGAKSNEE 281
Query: 316 LLYLYGFVIDNNPDDYLMI 334
LL YGF +++N D + +
Sbjct: 282 LLLGYGFTLEDNKHDTIAL 300
>gi|294659704|ref|XP_462118.2| DEHA2G13354p [Debaryomyces hansenii CBS767]
gi|199434171|emb|CAG90604.2| DEHA2G13354p [Debaryomyces hansenii CBS767]
Length = 480
Score = 40.8 bits (94), Expect = 1.6, Method: Compositional matrix adjust.
Identities = 23/72 (31%), Positives = 36/72 (50%), Gaps = 10/72 (13%)
Query: 256 LVPGIDFCNHDLKAAATWEVDGTGLITGVPFSMYLLSVERSSFHSEKEISISYGNKGNEE 315
+ P +DF NH ++DGTG F +Y S++ ++++ +SYG NE
Sbjct: 283 MAPYVDFLNHSCDDQCGLKIDGTG------FQVY----TTCSYNPDEQLFLSYGPHSNEF 332
Query: 316 LLYLYGFVIDNN 327
LL YGF + N
Sbjct: 333 LLCEYGFTLPEN 344
>gi|412990750|emb|CCO18122.1| predicted protein [Bathycoccus prasinos]
Length = 543
Score = 40.8 bits (94), Expect = 1.6, Method: Compositional matrix adjust.
Identities = 40/145 (27%), Positives = 65/145 (44%), Gaps = 21/145 (14%)
Query: 255 GLVPGIDFCNHDLKAAATWEVD-GTGLITGVPFSMYLLSVERSSFHSEKEISISYGNKGN 313
LVP D NH A ++D G+ + L+ +RS + +++ SYG + +
Sbjct: 271 ALVPWADMLNHQPGCEAFIDLDRGSRKVC--------LTTDRS-YEPGEQVWASYGQRPS 321
Query: 314 EELLYLYGF--VIDNNPDDYLMIHYPAEAIHSIPLSDSKALLLEEQKAQ------LRC-L 364
ELL YGF + +NPDD ++ + P + +K L Q Q LR
Sbjct: 322 SELLISYGFAPAVGDNPDDEYALNLQID--EEDPFASAKVNALASQNIQAFETFPLRLNG 379
Query: 365 LPKSLLEHGFFAAGHPKDGNNDNKL 389
P+ LL++ FA P D + ++L
Sbjct: 380 YPRQLLQYASFAMCTPDDPSKVDEL 404
>gi|34784341|gb|AAH57968.1| Setd3 protein [Mus musculus]
Length = 408
Score = 40.8 bits (94), Expect = 1.6, Method: Compositional matrix adjust.
Identities = 16/44 (36%), Positives = 26/44 (59%)
Query: 89 MILFLTVERLRKNSSWKPYLDMLPTTFGNPLWFTDDELLELKGT 132
+ L ER NS W+PY+ LP+ + PL+F ++E+ L+ T
Sbjct: 29 LAFHLLCERASPNSFWQPYIQTLPSEYDTPLYFEEEEVRCLQST 72
>gi|328873307|gb|EGG21674.1| SET domain-containing protein [Dictyostelium fasciculatum]
Length = 514
Score = 40.8 bits (94), Expect = 1.7, Method: Compositional matrix adjust.
Identities = 40/167 (23%), Positives = 69/167 (41%), Gaps = 40/167 (23%)
Query: 3 ISTEAKLEPFLQWLQVNKVELRGCKIKYSDE------SKGFGIFSSNEFSDG-VLLVVPL 55
+S + ++ F +WL N GCK + ++G G+ + + +G + VP
Sbjct: 57 MSEQQVVDSFTKWLSDN-----GCKEAFDKVKIVRGLTEGSGLVALGDIGEGDEFIAVPS 111
Query: 56 DLAITPM--------RVLQDPLIGPECRAMFEDGEVDDRFLMILFLTVERLRKN-----S 102
L +T +V+++PL R++ L LTV +++ S
Sbjct: 112 KLFMTQETAIKSIGEKVIREPLF---------------RYIPSLLLTVHLIQEQLIMPKS 156
Query: 103 SWKPYLDMLPTTFGNPLWFTDDELLELKGTTLYRATELQKQNLLTLY 149
W PY+ MLP T+ L FT D+ L G+ + +N L Y
Sbjct: 157 FWAPYIRMLPRTYRTILQFTMDDFRALLGSAVLEEAISTYRNTLRQY 203
>gi|453082527|gb|EMF10574.1| hypothetical protein SEPMUDRAFT_150634 [Mycosphaerella populorum
SO2202]
Length = 454
Score = 40.8 bits (94), Expect = 1.7, Method: Compositional matrix adjust.
Identities = 27/102 (26%), Positives = 42/102 (41%), Gaps = 12/102 (11%)
Query: 243 LTSTQGETLWIEGLVPGIDFCNHDLKAAATWEVDGTGLITGVPFSMYLLSVERSSFHSEK 302
L+ + + L L P +D NH A W D + +
Sbjct: 38 LSRSSADELDFPVLFPVLDILNHQHAAHVDWSFDPGRFTVSI----------SDAVTQGS 87
Query: 303 EISISYGNKGNEELLYLYGFVIDNNPDD--YLMIHYPAEAIH 342
E+ +YG K N+ELL YGF + +NP+D L++ P E +
Sbjct: 88 EVFNNYGPKSNDELLLGYGFCLSSNPNDQVLLVLKTPPEGLQ 129
>gi|169626351|ref|XP_001806576.1| hypothetical protein SNOG_16462 [Phaeosphaeria nodorum SN15]
gi|160705819|gb|EAT76160.2| hypothetical protein SNOG_16462 [Phaeosphaeria nodorum SN15]
Length = 474
Score = 40.8 bits (94), Expect = 1.7, Method: Compositional matrix adjust.
Identities = 34/130 (26%), Positives = 66/130 (50%), Gaps = 11/130 (8%)
Query: 10 EPFLQWLQVNKVELRGCKIKYSD---ESKGFGIFSSNEFSDGVLLVVPLDLAITPMRVLQ 66
+ FL WL+ + VE+ KI+ D G G+ ++ + + LL + T + ++
Sbjct: 9 QAFLAWLRRSGVEI-SPKIQLEDLRNAQAGRGVVATQDIPEHELL---FRIPRTAILSVE 64
Query: 67 DPLIGPEC-RAMFEDGEVDDRFLMILFLTVERLRKNSS-WKPYLDMLPTTFGNPLWFTDD 124
+ ++ E A FE + +IL + E + ++S W PY +LPT F +++++D
Sbjct: 65 NSILSTEIPAATFE--MLGPWLSLILVMLYEYINGDASNWAPYFSVLPTEFNTLMFWSED 122
Query: 125 ELLELKGTTL 134
EL EL+ + +
Sbjct: 123 ELAELQASAV 132
>gi|330933580|ref|XP_003304225.1| hypothetical protein PTT_16721 [Pyrenophora teres f. teres 0-1]
gi|311319308|gb|EFQ87682.1| hypothetical protein PTT_16721 [Pyrenophora teres f. teres 0-1]
Length = 476
Score = 40.8 bits (94), Expect = 1.7, Method: Compositional matrix adjust.
Identities = 34/128 (26%), Positives = 62/128 (48%), Gaps = 7/128 (5%)
Query: 10 EPFLQWLQVNKVELRGCKIKYSD---ESKGFGIFSSNEFSDGVLLVVPLDLAITPMRVLQ 66
+ FL WL+ + E+ KIK D + G G+ +S E ++ LL + T + ++
Sbjct: 9 QAFLAWLRKSGAEI-SPKIKLEDLRNKDAGRGVVASQEIAEHELL---FRIPRTSILSVE 64
Query: 67 DPLIGPECRAMFEDGEVDDRFLMILFLTVERLRKNSSWKPYLDMLPTTFGNPLWFTDDEL 126
+ ++ E A L+++ L S+W PY +LPT F +++T+DEL
Sbjct: 65 NSILSTEIPAATLSLLGPWLSLILVMLYEYHNGSASNWAPYFAVLPTEFNTLMFWTEDEL 124
Query: 127 LELKGTTL 134
EL+ + +
Sbjct: 125 AELQASAV 132
>gi|67538920|ref|XP_663234.1| hypothetical protein AN5630.2 [Aspergillus nidulans FGSC A4]
gi|40743533|gb|EAA62723.1| hypothetical protein AN5630.2 [Aspergillus nidulans FGSC A4]
gi|259484901|tpe|CBF81518.1| TPA: SET domain protein (AFU_orthologue; AFUA_4G11040) [Aspergillus
nidulans FGSC A4]
Length = 707
Score = 40.8 bits (94), Expect = 1.7, Method: Compositional matrix adjust.
Identities = 27/79 (34%), Positives = 40/79 (50%), Gaps = 10/79 (12%)
Query: 256 LVPGIDFCNHDLKAAATWEVDGTGLITGVPFSMYLLSVERSSFHSEKEISISYGNKGNEE 315
L+P ID NH A W + V F + +E S + +EI+ +YG + NE+
Sbjct: 227 LLPFIDILNHRPLAKVEWRAG----LQNVDFVV----LEDVSVN--EEIANNYGPRNNEQ 276
Query: 316 LLYLYGFVIDNNPDDYLMI 334
L+ YGF + NNP DY +
Sbjct: 277 LMMNYGFCLANNPCDYRTV 295
>gi|50287013|ref|XP_445936.1| hypothetical protein [Candida glabrata CBS 138]
gi|49525242|emb|CAG58855.1| unnamed protein product [Candida glabrata]
Length = 599
Score = 40.8 bits (94), Expect = 1.8, Method: Compositional matrix adjust.
Identities = 26/72 (36%), Positives = 36/72 (50%), Gaps = 6/72 (8%)
Query: 256 LVPGIDFCNHDLKAAATWEVDGTGLITGVPFSMYLLSVERSSFHSEKEISISYGNKGNEE 315
L+P +D NH+ + W G F L+ ++ E+ +YG KGNEE
Sbjct: 234 LLPILDLMNHNYNSKVQWFPKEHQ--NGTSFCYQCLADMKAG----DELDNNYGGKGNEE 287
Query: 316 LLYLYGFVIDNN 327
LL YGFVID+N
Sbjct: 288 LLNGYGFVIDDN 299
>gi|217074704|gb|ACJ85712.1| unknown [Medicago truncatula]
Length = 209
Score = 40.8 bits (94), Expect = 1.8, Method: Compositional matrix adjust.
Identities = 35/140 (25%), Positives = 62/140 (44%), Gaps = 9/140 (6%)
Query: 9 LEPFLQWLQVNKVELRGCKIKYSDESKGFGIFSSNEFSDG-VLLVVPLDLAITPMRVLQD 67
L FL WL+ + S+ S G +F+S G +L VP L +TP
Sbjct: 34 LHNFLPWLENKASSTISSPLSISNSSYGNSLFASKSIQTGDCILQVPYSLQLTPDN---- 89
Query: 68 PLIGPECRAMFEDGEVDDRFLMILFLTVERLRKNSSWKPYLDMLP--TTFGNPLWFTDDE 125
+ PE + + + L + L + L ++S W PY+ LP N +++ + E
Sbjct: 90 --LPPEIKPFISEDVGNIAKLATVLLIHKNLGQDSEWHPYISCLPPQAEMHNTIFWNESE 147
Query: 126 LLELKGTTLYRATELQKQNL 145
L ++ +++Y+ T QK +
Sbjct: 148 LEMIRQSSVYQETIYQKSQI 167
>gi|297829320|ref|XP_002882542.1| SET domain-containing protein [Arabidopsis lyrata subsp. lyrata]
gi|297328382|gb|EFH58801.1| SET domain-containing protein [Arabidopsis lyrata subsp. lyrata]
Length = 504
Score = 40.4 bits (93), Expect = 1.8, Method: Compositional matrix adjust.
Identities = 70/301 (23%), Positives = 109/301 (36%), Gaps = 82/301 (27%)
Query: 38 GIFSSNEFSDG-VLLVVPLDLAITPMRVLQDPLIGPECRAMFEDGEVDDRFLMILFLTVE 96
G+ +S G LL VP L I+ PE + + +V D L+ +L E
Sbjct: 102 GLVASQNLRKGEKLLFVPPSLVIS----ADSEWTNPEAGEVMKRYDVPDWPLLATYLISE 157
Query: 97 -RLRKNSSWKPYLDMLPTTFGNPLWFTDDEL-LELKGTTLYRATELQKQNLLTLYDDKVK 154
L+K+S W Y+ LP + L++T EL + L+ + + + N++ Y+D
Sbjct: 158 ASLQKSSRWYNYISALPRQPYSLLYWTRTELDMYLEASQIRERAIERITNVVGTYEDLRS 217
Query: 155 DLVKKLLVLDGDSERANSIFWTRALNIPLPHSYVFPQNQEDLNKYDSINNSAELSNDHN- 213
+ K H ++FP+ E+ ND
Sbjct: 218 RIFSK-------------------------HPHLFPK---------------EVFNDETF 237
Query: 214 --SRGELINGLNDIKNEAQRVNSQVNGATSTLTSTQGETLWIEGLVPGIDFCNHDLKAAA 271
S G L + L + + R LVP D NH+ +
Sbjct: 238 KWSFGILFSRLVRLPSMDGRF----------------------ALVPWADMLNHNCEVET 275
Query: 272 TWEVDGTGLITGVPFSMYLLSVERSSFHSEKEISISYGNKGNEELLYLYGFVI--DNNPD 329
+ D + GV F+ + +++ ISYGNK N ELL YGFV NP
Sbjct: 276 FLDYDKSS--KGVVFTT------DRPYQPGEQVFISYGNKSNGELLLSYGFVPREGTNPS 327
Query: 330 D 330
D
Sbjct: 328 D 328
>gi|66828265|ref|XP_647487.1| hypothetical protein DDB_G0268558 [Dictyostelium discoideum AX4]
gi|60475797|gb|EAL73732.1| hypothetical protein DDB_G0268558 [Dictyostelium discoideum AX4]
Length = 459
Score = 40.4 bits (93), Expect = 1.9, Method: Compositional matrix adjust.
Identities = 20/63 (31%), Positives = 35/63 (55%)
Query: 88 LMILFLTVERLRKNSSWKPYLDMLPTTFGNPLWFTDDELLELKGTTLYRATELQKQNLLT 147
L +FL +L S W PY+ +LP F ++F+++EL EL+ + L T ++K +
Sbjct: 114 LQAIFLMYCKLNDKSFWYPYVSVLPKEFTTSIYFSEEELDELQSSKLKEFTIIRKDGIER 173
Query: 148 LYD 150
Y+
Sbjct: 174 HYN 176
>gi|443921579|gb|ELU41168.1| SET domain-containing protein [Rhizoctonia solani AG-1 IA]
Length = 447
Score = 40.4 bits (93), Expect = 1.9, Method: Compositional matrix adjust.
Identities = 27/79 (34%), Positives = 40/79 (50%), Gaps = 4/79 (5%)
Query: 256 LVPGIDFCNHDLKAAATWEVDGTGLITGVPFSMYLLSVERSSFHSEKEISISYGNKGNEE 315
L+PGID NH +W + + T ++ LL ER + E +YG K N E
Sbjct: 240 LLPGIDSLNHKRAIPVSWVANVSEPTTSS--TLDLLIHER--VPAGMECFNNYGPKPNSE 295
Query: 316 LLYLYGFVIDNNPDDYLMI 334
L+ YGF + NPDD +++
Sbjct: 296 LMLGYGFALPLNPDDTILL 314
>gi|395326815|gb|EJF59220.1| SET domain-containing protein [Dichomitus squalens LYAD-421 SS1]
Length = 429
Score = 40.4 bits (93), Expect = 2.0, Method: Compositional matrix adjust.
Identities = 27/80 (33%), Positives = 39/80 (48%), Gaps = 1/80 (1%)
Query: 256 LVPGIDFCNHDLKAAATWEVDGTGLITGVPFSMYLLS-VERSSFHSEKEISISYGNKGNE 314
L+PGID NH +W V + S +S V + E+ +YG K N
Sbjct: 208 LLPGIDALNHARGHPVSWVVSAPSQTSSSQRSESSISLVIHTPTPRGSELLNNYGPKPNS 267
Query: 315 ELLYLYGFVIDNNPDDYLMI 334
EL+ YGF + NNPDD +++
Sbjct: 268 ELILGYGFSLPNNPDDTIVL 287
>gi|448515852|ref|XP_003867429.1| Rkm2 protein [Candida orthopsilosis Co 90-125]
gi|380351768|emb|CCG21991.1| Rkm2 protein [Candida orthopsilosis]
Length = 431
Score = 40.4 bits (93), Expect = 2.0, Method: Compositional matrix adjust.
Identities = 30/98 (30%), Positives = 48/98 (48%), Gaps = 13/98 (13%)
Query: 256 LVPGIDFCNHDLKAAATWEVDGTGLITGVPFSMYLLSVERSSFHSEKEISISYGNKGNEE 315
+ P IDF NH ++DG G F ++ SS+ +++++ SYG N+
Sbjct: 233 MAPYIDFINHSPNDHCNLKIDGKG------FQVF----TTSSYSADEQLYFSYGPHSNDF 282
Query: 316 LLYLYGFVIDNNPDDYLMIHYPAEAIHSIPLSDSKALL 353
LL YGF++ N D + I +E I S+ S+ K L
Sbjct: 283 LLTEYGFIVPENKWDDIDI---SEDILSLLKSNQKEFL 317
Score = 39.3 bits (90), Expect = 4.7, Method: Compositional matrix adjust.
Identities = 14/26 (53%), Positives = 21/26 (80%)
Query: 88 LMILFLTVERLRKNSSWKPYLDMLPT 113
L+ +++T+ER R S WKP++DMLPT
Sbjct: 119 LLSMYITIERKRDTSYWKPFIDMLPT 144
>gi|116200882|ref|XP_001226253.1| hypothetical protein CHGG_10986 [Chaetomium globosum CBS 148.51]
gi|88175700|gb|EAQ83168.1| hypothetical protein CHGG_10986 [Chaetomium globosum CBS 148.51]
Length = 400
Score = 40.4 bits (93), Expect = 2.0, Method: Compositional matrix adjust.
Identities = 33/103 (32%), Positives = 49/103 (47%), Gaps = 16/103 (15%)
Query: 255 GLVPGIDFCNH-DLKAAATWEVDGTGLITGVPFSMYLLSVERSSFHSEKEISISYGNKGN 313
+VP DF NH D T++ DG +++S +R +H ++E+ ISYG N
Sbjct: 203 AIVPIADFFNHADTGCEVTFDKDG-----------FIVSADRD-YHGDQEVYISYGAHTN 250
Query: 314 EELLYLYGFVIDNNPDDYLMIHYPAEAIHSIPLSDSKALLLEE 356
+ LL YGF+ N D + + E I P + K LL E
Sbjct: 251 DFLLAEYGFLPAANRWDEVCVD---EVILPKPSTAHKELLQGE 290
>gi|254580477|ref|XP_002496224.1| ZYRO0C13420p [Zygosaccharomyces rouxii]
gi|238939115|emb|CAR27291.1| ZYRO0C13420p [Zygosaccharomyces rouxii]
Length = 565
Score = 40.4 bits (93), Expect = 2.0, Method: Compositional matrix adjust.
Identities = 28/83 (33%), Positives = 36/83 (43%), Gaps = 12/83 (14%)
Query: 264 NHDLKAAATWEVDGTGLITGVPFSMYLLSVERSSFHSEKEISISYGNKGNEELLYLYGFV 323
NHD + W +G + +E+ +YG KGNEELL YGFV
Sbjct: 216 NHDAHSKVQWYSEGEAFCLE----------KNGGVAKGQELFNNYGAKGNEELLAGYGFV 265
Query: 324 IDNNPDDY--LMIHYPAEAIHSI 344
++N DY L I P I SI
Sbjct: 266 QEDNEFDYVALKIKLPLNTIESI 288
>gi|226505024|ref|NP_001151430.1| SET domain containing protein [Zea mays]
gi|195646778|gb|ACG42857.1| SET domain containing protein [Zea mays]
gi|413923893|gb|AFW63825.1| SET domain containing protein [Zea mays]
Length = 491
Score = 40.4 bits (93), Expect = 2.0, Method: Compositional matrix adjust.
Identities = 31/109 (28%), Positives = 52/109 (47%), Gaps = 6/109 (5%)
Query: 230 QRVNSQVNGATSTLTSTQGETLW--IEGLVPGIDFCNHDLKAAATWEVDGTGLITGVPFS 287
Q VNS G + S++ L + L+P ID CNH A + + + + S
Sbjct: 222 QDVNSSSLGWAMSAASSRAFRLHGEVPMLLPLIDMCNHSFNPNARIVQERS--VNSLDMS 279
Query: 288 MYLLSVERSSFHSEKEISISYGNKGNEELLYLYGFVIDNNPDDYLMIHY 336
+ +L+ ++ + I+++YG N+ L YGFVI NP D + + Y
Sbjct: 280 VKVLAEKK--IKQNEAITLNYGCYPNDFFLLDYGFVITQNPYDQVELSY 326
>gi|194707708|gb|ACF87938.1| unknown [Zea mays]
Length = 352
Score = 40.4 bits (93), Expect = 2.0, Method: Compositional matrix adjust.
Identities = 31/110 (28%), Positives = 52/110 (47%), Gaps = 6/110 (5%)
Query: 229 AQRVNSQVNGATSTLTSTQGETLW--IEGLVPGIDFCNHDLKAAATWEVDGTGLITGVPF 286
Q VNS G + S++ L + L+P ID CNH A + + + +
Sbjct: 82 GQDVNSSSLGWAMSAASSRAFRLHGEVPMLLPLIDMCNHSFNPNARIVQERS--VNSLDM 139
Query: 287 SMYLLSVERSSFHSEKEISISYGNKGNEELLYLYGFVIDNNPDDYLMIHY 336
S+ +L+ ++ + I+++YG N+ L YGFVI NP D + + Y
Sbjct: 140 SVKVLAEKK--IKQNEAITLNYGCYPNDFFLLDYGFVITQNPYDQVELSY 187
>gi|241956097|ref|XP_002420769.1| ribosomal N-lysine methyltransferase, putative [Candida
dubliniensis CD36]
gi|223644111|emb|CAX41854.1| ribosomal N-lysine methyltransferase, putative [Candida
dubliniensis CD36]
Length = 435
Score = 40.4 bits (93), Expect = 2.0, Method: Compositional matrix adjust.
Identities = 23/72 (31%), Positives = 32/72 (44%), Gaps = 10/72 (13%)
Query: 256 LVPGIDFCNHDLKAAATWEVDGTGLITGVPFSMYLLSVERSSFHSEKEISISYGNKGNEE 315
+ P +DF NH T ++DG G S ++ ++ +SYG NE
Sbjct: 238 MAPYVDFMNHSCDDHCTLKIDGKGFQVRTT----------SQYNIGDQVYLSYGPHSNEF 287
Query: 316 LLYLYGFVIDNN 327
LL YGFVI N
Sbjct: 288 LLCEYGFVIPEN 299
>gi|149044197|gb|EDL97579.1| rCG27725, isoform CRA_c [Rattus norvegicus]
Length = 468
Score = 40.4 bits (93), Expect = 2.0, Method: Compositional matrix adjust.
Identities = 16/44 (36%), Positives = 26/44 (59%)
Query: 89 MILFLTVERLRKNSSWKPYLDMLPTTFGNPLWFTDDELLELKGT 132
+ L ER NS W+PY+ LP+ + PL+F ++E+ L+ T
Sbjct: 29 LAFHLLCERASPNSFWQPYIQTLPSEYDTPLYFEEEEVRCLQST 72
>gi|322694827|gb|EFY86647.1| SET domain protein [Metarhizium acridum CQMa 102]
Length = 467
Score = 40.4 bits (93), Expect = 2.1, Method: Compositional matrix adjust.
Identities = 39/135 (28%), Positives = 61/135 (45%), Gaps = 8/135 (5%)
Query: 7 AKLEPFLQWLQVNKVELRGCKIKYSD---ESKGFGIFSSNEF-SDGVLLVVPLDLAI-TP 61
A+ FLQW + I+ D G GI + + +D L +P D I +
Sbjct: 8 ARTAAFLQWFKALPGATFSDAIEIVDLRSRDAGRGITALRDIPADTTLFTIPRDAIINSE 67
Query: 62 MRVLQDPLIGPECRAMFEDGE--VDDRFLMILFLTVER-LRKNSSWKPYLDMLPTTFGNP 118
L+ L ED E +D +IL + E L S WKPY+D+LP TF P
Sbjct: 68 TSSLRKKLPDLFESQGDEDEEQALDSWSALILIMMYEFFLGDESKWKPYIDVLPLTFDTP 127
Query: 119 LWFTDDELLELKGTT 133
++++++EL L+ +
Sbjct: 128 MFWSEEELSYLQASA 142
>gi|58177849|gb|AAH89108.1| Setd3 protein [Rattus norvegicus]
Length = 450
Score = 40.4 bits (93), Expect = 2.1, Method: Compositional matrix adjust.
Identities = 16/44 (36%), Positives = 26/44 (59%)
Query: 89 MILFLTVERLRKNSSWKPYLDMLPTTFGNPLWFTDDELLELKGT 132
+ L ER NS W+PY+ LP+ + PL+F ++E+ L+ T
Sbjct: 11 LAFHLLCERASPNSFWQPYIQTLPSEYDTPLYFEEEEVRCLQST 54
>gi|354547319|emb|CCE44053.1| hypothetical protein CPAR2_502780 [Candida parapsilosis]
Length = 433
Score = 40.4 bits (93), Expect = 2.1, Method: Compositional matrix adjust.
Identities = 25/76 (32%), Positives = 38/76 (50%), Gaps = 11/76 (14%)
Query: 256 LVPGIDFCNHDLKAAATWEVDGTGLITGVPFSMYLLSVERSSFHSEKEISISYGNKGNEE 315
+ P +DF NH ++DG G F ++ S++ ++++ SYG NE
Sbjct: 233 MAPYVDFINHSPDDHCNLKIDGKG------FQVF----TTSAYSRDEQLYFSYGPHSNEF 282
Query: 316 LLYLYGFVI-DNNPDD 330
LL YGF+I DN DD
Sbjct: 283 LLTEYGFIIVDNKWDD 298
>gi|387016380|gb|AFJ50309.1| Histone-lysine N-methyltransferase setd3 [Crotalus adamanteus]
Length = 592
Score = 40.4 bits (93), Expect = 2.2, Method: Compositional matrix adjust.
Identities = 28/104 (26%), Positives = 52/104 (50%), Gaps = 9/104 (8%)
Query: 33 ESKGFGIFSSNEF-SDGVLLVVPLDLAITPMRVLQDPLIGP---ECRAMFEDGEVDDRFL 88
E +GFG+ ++ + ++ + L VP L +T + ++ ++G + R + G +
Sbjct: 102 EEEGFGLKATRDIKAEELFLWVPRKLLMT-VESAKNSILGSLYSQDRILQAMGNI----T 156
Query: 89 MILFLTVERLRKNSSWKPYLDMLPTTFGNPLWFTDDELLELKGT 132
+ L ER NS W PY+ LP + L+F +DE+ L+ T
Sbjct: 157 LAFHLLCERYNPNSFWLPYIQTLPNEYNTALYFEEDEVQYLQST 200
>gi|218191524|gb|EEC73951.1| hypothetical protein OsI_08832 [Oryza sativa Indica Group]
Length = 486
Score = 40.4 bits (93), Expect = 2.2, Method: Compositional matrix adjust.
Identities = 33/109 (30%), Positives = 47/109 (43%), Gaps = 6/109 (5%)
Query: 230 QRVNSQVNGATSTLTSTQGETLW--IEGLVPGIDFCNHDLKAAATWEVDGTGLITGVPFS 287
Q VNS G + ST+ L I L+P ID CNH A +G P
Sbjct: 217 QDVNSSSLGWAMSAASTRAFRLHGEIPMLLPLIDMCNHSFNPNARIVQEGN---VDSP-D 272
Query: 288 MYLLSVERSSFHSEKEISISYGNKGNEELLYLYGFVIDNNPDDYLMIHY 336
M + V + ++++YG N+ L YGFVI +N D + + Y
Sbjct: 273 MSVKVVAETKIDQNAAVTLNYGCYPNDFFLLDYGFVITSNSYDQVELSY 321
>gi|336262426|ref|XP_003345997.1| hypothetical protein SMAC_06551 [Sordaria macrospora k-hell]
gi|380089590|emb|CCC12472.1| unnamed protein product [Sordaria macrospora k-hell]
Length = 482
Score = 40.4 bits (93), Expect = 2.3, Method: Compositional matrix adjust.
Identities = 34/120 (28%), Positives = 54/120 (45%), Gaps = 18/120 (15%)
Query: 254 EGLVPGIDFCNHDLKAAATWEVDGTGLITGVPFSMYLLSVERSSFHSEKEISISYGN-KG 312
E +VP ID NH A+A ++ + + LL + S KE++ISYG+ K
Sbjct: 210 ESMVPCIDMINHSTCASAYYDENTKD-------EVILLPRPDRTISSGKEVTISYGDAKP 262
Query: 313 NEELLYLYGFVIDNNPDDYLMIHYPAEAIHSIPLSDSKALLLEEQKAQLRCLLPKSLLEH 372
E+L+ YGF+ + L++ P E PL +K ++ LPK + H
Sbjct: 263 AAEMLFSYGFIDPETTVESLVL--PLEPFGDDPLEKAKLFAFKD--------LPKVHVAH 312
>gi|115448513|ref|NP_001048036.1| Os02g0733800 [Oryza sativa Japonica Group]
gi|46390671|dbj|BAD16153.1| SET domain-containing protein-like [Oryza sativa Japonica Group]
gi|113537567|dbj|BAF09950.1| Os02g0733800 [Oryza sativa Japonica Group]
gi|215768359|dbj|BAH00588.1| unnamed protein product [Oryza sativa Japonica Group]
Length = 486
Score = 40.0 bits (92), Expect = 2.4, Method: Compositional matrix adjust.
Identities = 33/109 (30%), Positives = 47/109 (43%), Gaps = 6/109 (5%)
Query: 230 QRVNSQVNGATSTLTSTQGETLW--IEGLVPGIDFCNHDLKAAATWEVDGTGLITGVPFS 287
Q VNS G + ST+ L I L+P ID CNH A +G P
Sbjct: 217 QDVNSSSLGWAMSAASTRAFRLHGEIPMLLPLIDMCNHSFNPNARIVQEGN---VDSP-D 272
Query: 288 MYLLSVERSSFHSEKEISISYGNKGNEELLYLYGFVIDNNPDDYLMIHY 336
M + V + ++++YG N+ L YGFVI +N D + + Y
Sbjct: 273 MSVKVVAETKIDQNAAVTLNYGCYPNDFFLLDYGFVITSNSYDQVELSY 321
>gi|326512906|dbj|BAK03360.1| predicted protein [Hordeum vulgare subsp. vulgare]
Length = 485
Score = 40.0 bits (92), Expect = 2.4, Method: Compositional matrix adjust.
Identities = 28/101 (27%), Positives = 45/101 (44%), Gaps = 7/101 (6%)
Query: 236 VNGATSTLTSTQGETLWIEGLVPGIDFCNHDLKAAATWEVDGTGLITGVPFSMYLLSVER 295
++ A+S GE I L+P +D CNH A DG P M + +
Sbjct: 227 MSAASSRAFRLHGE---IPMLLPLVDMCNHSFSPNARIVQDGD---VESP-DMSVKVIAE 279
Query: 296 SSFHSEKEISISYGNKGNEELLYLYGFVIDNNPDDYLMIHY 336
+ ++++YG N+ L YGFV+ +NP D + + Y
Sbjct: 280 TQIDQNAAVTLNYGCYPNDFYLLDYGFVVTSNPYDQVELSY 320
>gi|121719466|ref|XP_001276432.1| SET domain protein [Aspergillus clavatus NRRL 1]
gi|119404630|gb|EAW15006.1| SET domain protein [Aspergillus clavatus NRRL 1]
Length = 426
Score = 40.0 bits (92), Expect = 2.4, Method: Compositional matrix adjust.
Identities = 33/105 (31%), Positives = 46/105 (43%), Gaps = 14/105 (13%)
Query: 255 GLVPGIDFCNHDLKAAATWEVDGTGLITGVPFSMYLLSVERSSFHSEKEISISYGNKGNE 314
GLVP D+ NH A DG Y + R F +EI +SYG N+
Sbjct: 225 GLVPFADYFNHADDADTEVVFDG---------RKYTFTATRQ-FEKGEEIFMSYGAHSND 274
Query: 315 ELLYLYGFVIDNNPDDYLMIHYPAEAIHSIPLSDSKALLLEEQKA 359
L YGF +D+N D + + + I S LS+ + LE Q+
Sbjct: 275 FLFVEYGFFLDHNESDVIFL----DDIISKELSEDERKELESQQG 315
>gi|358395796|gb|EHK45183.1| hypothetical protein TRIATDRAFT_39811 [Trichoderma atroviride IMI
206040]
Length = 484
Score = 40.0 bits (92), Expect = 2.4, Method: Compositional matrix adjust.
Identities = 31/97 (31%), Positives = 47/97 (48%), Gaps = 10/97 (10%)
Query: 255 GLVPGIDFCNHDLKAAATWEVDGTGLITGVPFSMYLLSVERSSFHSEKEISISYGN-KGN 313
+VPG+D NH K A ++ G + LL S S +EI+ISYG K
Sbjct: 213 AMVPGLDMANHSPKYLARYDETPEG-------DVVLLPSSGSGVSSGEEITISYGEAKSA 265
Query: 314 EELLYLYGFVIDNNPDDYLMIHYPAEAIHSIPLSDSK 350
E+L+ YGF+ + L++H +A+ PL +K
Sbjct: 266 AEMLFSYGFIDQESGVKELVLHL--DALPDDPLGKAK 300
Score = 39.3 bits (90), Expect = 4.8, Method: Compositional matrix adjust.
Identities = 48/203 (23%), Positives = 88/203 (43%), Gaps = 31/203 (15%)
Query: 9 LEPFLQWLQVNKVELRGCKIKYSDESKGFGIFSSNEF--------SDGVLLVVPLDLAIT 60
+E F W +N VE + +++ S E KGFG+ + + S ++ +P DL ++
Sbjct: 8 IEAFPAWALLNDVEFKSAEVR-SIEGKGFGLVAKKDIPGVSDDSSSTEAIIRIPRDLVLS 66
Query: 61 PMRVLQDPLIGPECRAMFE-DGEVDDR--FLMILFLTVERLRKNSS---------WKPYL 108
V + R + E G R L+ L + ++NSS W Y+
Sbjct: 67 AEAVEAYAKVDQHFRQLLEVAGHQSTRGDILLYLLTHLILSKRNSSGSKGCASTPWTEYI 126
Query: 109 DMLPTTFGNPLWFTDDELLELKGTTLYRATELQKQNLLTLYDDKVK---------DLVKK 159
LP + P +T +E L+GT+L + + L YD+ + DL+ +
Sbjct: 127 KFLPRSISVPTMWTSEEREFLQGTSLESSVNAKLSVLSREYDELSEKASTLPFWNDLLSE 186
Query: 160 LLVLDGDSERANSIFWTRALNIP 182
+L+ D A++++ +R L +P
Sbjct: 187 SGMLE-DWILADALYRSRCLELP 208
>gi|391340216|ref|XP_003744440.1| PREDICTED: SET domain-containing protein 4-like [Metaseiulus
occidentalis]
Length = 381
Score = 40.0 bits (92), Expect = 2.4, Method: Compositional matrix adjust.
Identities = 24/95 (25%), Positives = 42/95 (44%), Gaps = 14/95 (14%)
Query: 244 TSTQGETLWIEG----LVPGIDFCNHDLKAAATWEVDGTGLITGVPFSMYLLSVERSSFH 299
++ +++EG L P +D NH KA+ T + + + +
Sbjct: 175 SAVNTRCIYVEGHGSTLAPFLDLLNHHWKASIE-----TSFVNN-----HFIIRSNVGYE 224
Query: 300 SEKEISISYGNKGNEELLYLYGFVIDNNPDDYLMI 334
+ E+ I YG+ N L YGFV+D NP+D + +
Sbjct: 225 AGSEVFIGYGSHDNRTLFLNYGFVLDENPNDCITV 259
>gi|302759643|ref|XP_002963244.1| hypothetical protein SELMODRAFT_80789 [Selaginella moellendorffii]
gi|300168512|gb|EFJ35115.1| hypothetical protein SELMODRAFT_80789 [Selaginella moellendorffii]
Length = 467
Score = 40.0 bits (92), Expect = 2.4, Method: Compositional matrix adjust.
Identities = 87/359 (24%), Positives = 134/359 (37%), Gaps = 91/359 (25%)
Query: 14 QWLQ-----VNKVELRGCKIKYSDESKGFGIFSSNEFSDG-VLLVVPLDLAITPMRVLQD 67
QWL + KVEL+ + G G+ S G LL +P LAIT
Sbjct: 43 QWLSQAGLPIQKVELKNVG------AGGRGLVSKRMLYKGDRLLFLPATLAITTESEWAC 96
Query: 68 PLIGPECRAMFEDGEVDDRFLMILFLTVERLRKNSSWKPYLDMLPTTFGNPL-WFTDDEL 126
G RA + FL ++ L K+S W PY+ LP G+ L W D
Sbjct: 97 AEAGKVIRA---KDLPEWPFLACYLISEASLGKSSPWYPYIAALPRRPGSILLWTALDVE 153
Query: 127 LELKGTTLY-RATELQKQNLLTLYDDKVKDLVKKLLVLDGDSERANSIFWTRALNIPLPH 185
L T++ RA + ++ +D DL K+
Sbjct: 154 THLSATSIKDRALQCVRE-----VEDTFNDLNKQ-------------------------- 182
Query: 186 SYVFPQNQEDLNKYDSINNSAELSNDHNSRGELINGLNDIKNEAQRVNSQVNGATSTLTS 245
VF +N+E+ + N L + + G L + L + + Q++
Sbjct: 183 --VFMKNREEFPP-EVFN----LKSFKWAFGILFSRLVRLPSLGQKL------------- 222
Query: 246 TQGETLWIEGLVPGIDFCNHDLKAAATWEVD-GTGLITGVPFSMYLLSVERSSFHSEKEI 304
L+P D NHD + + D G+ IT +++R + S +E+
Sbjct: 223 ---------ALIPFGDMLNHDTEVTTFLDFDSGSKSIT--------CTLDRG-YESNREV 264
Query: 305 SISYGNKGNEELLYLYGFVID-NNPDDYLMIHY---PAEAIHSIPLSDSKALLLEEQKA 359
ISYG + N ELL YGFV N +D + I PA+ ++ L K L Q++
Sbjct: 265 FISYGKRSNGELLVAYGFVPSGKNSEDSVSITLGLDPADEMYEAKLGTLKEHGLSPQQS 323
>gi|255078590|ref|XP_002502875.1| predicted protein [Micromonas sp. RCC299]
gi|226518141|gb|ACO64133.1| predicted protein [Micromonas sp. RCC299]
Length = 425
Score = 40.0 bits (92), Expect = 2.5, Method: Compositional matrix adjust.
Identities = 28/97 (28%), Positives = 40/97 (41%), Gaps = 24/97 (24%)
Query: 257 VPGIDFCNHDLKAA-----------------ATWEV------DGTGLITGVPFSMYL-LS 292
VPG+D CNHD A AT E+ + TG G +M+ L
Sbjct: 181 VPGVDMCNHDFAGANAEVAVVTSPEDVQGVRATAEIADVDEREKTGGARGDGDTMFFQLR 240
Query: 293 VERSSFHSEKEISISYGNKGNEELLYLYGFVIDNNPD 329
+E++ISYG N+ +GFV + NP+
Sbjct: 241 AGDDGVEQGEEVTISYGPWPNDPFFLYFGFVPEGNPN 277
>gi|26325362|dbj|BAC26435.1| unnamed protein product [Mus musculus]
Length = 162
Score = 40.0 bits (92), Expect = 2.5, Method: Compositional matrix adjust.
Identities = 16/44 (36%), Positives = 26/44 (59%)
Query: 89 MILFLTVERLRKNSSWKPYLDMLPTTFGNPLWFTDDELLELKGT 132
+ L ER NS W+PY+ LP+ + PL+F ++E+ L+ T
Sbjct: 29 LAFHLLCERASPNSFWQPYIQTLPSEYDTPLYFEEEEVRCLQST 72
>gi|145355885|ref|XP_001422177.1| predicted protein [Ostreococcus lucimarinus CCE9901]
gi|144582417|gb|ABP00494.1| predicted protein [Ostreococcus lucimarinus CCE9901]
Length = 495
Score = 40.0 bits (92), Expect = 2.5, Method: Compositional matrix adjust.
Identities = 68/299 (22%), Positives = 117/299 (39%), Gaps = 43/299 (14%)
Query: 36 GFGIFSSNEFSDG-----VLLVVPLDLAITPMRVLQDPLIGPECRAMFEDGEVDDRFLMI 90
G+G+ ++ DG V++ +P D + L D RA G + L +
Sbjct: 80 GYGVRATCVCDDGIARGDVIVAIPRDAMLDARSALGDAAF-ERARAR---GLSSFQLLTV 135
Query: 91 LFLTVERLRKNSS-WKPYLDMLPTTFG--NP-LWFTDDELLELKGTTLYRATELQKQNLL 146
L RL+ +S WKPYLD LP G +P LW +D L + + L+
Sbjct: 136 SLLREWRLKDTTSRWKPYLDTLPEDDGRWHPLLWRDEDVEQHLPANSTHAGARLRGLIRA 195
Query: 147 TLYDDKV-KDLVKKLLVLDGDSERANSIFWTRALNIPLPHSYVFPQNQEDLNKYDSINNS 205
D ++ + +V +L + D + + W A++I + ++ LN+ D
Sbjct: 196 CEEDTRLFRSIVDELNIDDENWPSMRHVRW--AVSIVISRAFR-------LNELDDEECL 246
Query: 206 AELSNDHNSRGELINGLNDIKNEAQRVNSQVNGATSTLTSTQGETLWIEGLVPGIDFCNH 265
E+ +D L+ LND+ + + +G + + LVP D NH
Sbjct: 247 REVRDD-----ALLETLNDLDADCWEGSGGDSGEDDEFS--------VMALVPWADGLNH 293
Query: 266 DLKAAATWEVDGTGLITGVPFSMYLLSVERSSFHSEKEISISYG-NKGNEELLYLYGFV 323
A ++T S ++ +++ SYG N +E+L YGFV
Sbjct: 294 SSDAG------DEAILTYDTLSQTATLRAHKAYACGEQVFDSYGSNLSDEDLFVNYGFV 346
>gi|310800174|gb|EFQ35067.1| SET domain-containing protein [Glomerella graminicola M1.001]
Length = 485
Score = 40.0 bits (92), Expect = 2.5, Method: Compositional matrix adjust.
Identities = 30/103 (29%), Positives = 48/103 (46%), Gaps = 10/103 (9%)
Query: 254 EGLVPGIDFCNHDLKAAATWEVDGTGLITGVPFSMYLLSVERSSFHSEKEISISYGN-KG 312
E +VP +D NH KA A +E + + LL S +E++ISYG+ K
Sbjct: 213 EAMVPVLDLANHSSKANAYYEQNSKD-------EVVLLLRPGCRVSSGEEMTISYGDAKS 265
Query: 313 NEELLYLYGFVIDNNPDDYLMIHYPAEAIHSIPLSDSKALLLE 355
E+L+ YGF+ + D + + P + PL +K + E
Sbjct: 266 GAEMLFSYGFIDPASAADRITL--PLTPLEDDPLGKAKLHIFE 306
>gi|46130858|ref|XP_389160.1| hypothetical protein FG08984.1 [Gibberella zeae PH-1]
Length = 1000
Score = 40.0 bits (92), Expect = 2.6, Method: Compositional matrix adjust.
Identities = 17/47 (36%), Positives = 34/47 (72%), Gaps = 1/47 (2%)
Query: 89 MILFLTVERLRKNSS-WKPYLDMLPTTFGNPLWFTDDELLELKGTTL 134
+IL + E L+ +SS WK Y D+LP++F P++++++EL +L+ + +
Sbjct: 633 LILIMIYEYLQGDSSQWKSYFDVLPSSFDTPMFWSENELDQLQASHM 679
>gi|19114522|ref|NP_593610.1| lysine methyltransferase Set8 (predicted) [Schizosaccharomyces
pombe 972h-]
gi|74626633|sp|O14135.1|SET8_SCHPO RecName: Full=SET domain-containing protein 8
gi|2462684|emb|CAB16739.1| lysine methyltransferase Set8 (predicted) [Schizosaccharomyces
pombe]
Length = 429
Score = 40.0 bits (92), Expect = 2.6, Method: Compositional matrix adjust.
Identities = 23/69 (33%), Positives = 34/69 (49%), Gaps = 11/69 (15%)
Query: 256 LVPGIDFCNHDLKAAATWEVDGTGLITGVPFSMYLLSVERSSFHSEKEISISYGN-KGNE 314
L P ID CNH K+ A W + +YL +E++I+YG+ KG+
Sbjct: 189 LAPVIDLCNHSSKSNAKWSFSEDAM------QLYL----DKDIDENEEVTINYGSEKGSA 238
Query: 315 ELLYLYGFV 323
E L+ YGF+
Sbjct: 239 EFLFSYGFL 247
Score = 39.7 bits (91), Expect = 3.5, Method: Compositional matrix adjust.
Identities = 33/120 (27%), Positives = 57/120 (47%), Gaps = 11/120 (9%)
Query: 35 KGFGIFSSNEFSDG-VLLVVPLDLAITPMRVLQ-----DPLIGPECRAMFEDGEVDDR-- 86
KG GIFS N ++ G VLL VPL+ I V Q D + D +
Sbjct: 27 KGNGIFSLNRYTSGTVLLEVPLENIICRKTVEQFRNSCDKFASIATLEEWNDMSFRTQAM 86
Query: 87 -FLMILFLTVERLRKNSSWKPYLDMLPTTFGNPLWFTDDELLELKGTTLYRATELQKQNL 145
FL L+L ++ + + W +L +LP + P + + E+ L+GT+++ ++++ L
Sbjct: 87 LFLCYLWLGIQP--RTNKWDKFLTVLPLSINTPAQWPEKEVYSLQGTSIFNPVCVKRKIL 144
>gi|313230987|emb|CBY18985.1| unnamed protein product [Oikopleura dioica]
Length = 265
Score = 40.0 bits (92), Expect = 2.6, Method: Compositional matrix adjust.
Identities = 26/72 (36%), Positives = 36/72 (50%), Gaps = 13/72 (18%)
Query: 255 GLVPGIDFCNHDLKAAATWEV-DGTGLITGVPFSMYLLSVERSSFHSEKEISISYG-NKG 312
L+PG+D CNH + + D +T + + S KEI ISYG +KG
Sbjct: 186 SLIPGLDICNHWNSPNCRYSLKDDRFFLTA-----------KDTIDSNKEIFISYGYSKG 234
Query: 313 NEELLYLYGFVI 324
+ ELL YGFV+
Sbjct: 235 DHELLSTYGFVL 246
>gi|313243727|emb|CBY42355.1| unnamed protein product [Oikopleura dioica]
Length = 269
Score = 40.0 bits (92), Expect = 2.6, Method: Compositional matrix adjust.
Identities = 26/72 (36%), Positives = 36/72 (50%), Gaps = 13/72 (18%)
Query: 255 GLVPGIDFCNHDLKAAATWEV-DGTGLITGVPFSMYLLSVERSSFHSEKEISISYG-NKG 312
L+PG+D CNH + + D +T + + S KEI ISYG +KG
Sbjct: 186 SLIPGLDVCNHWNSPNCRYSLKDDRFFLTA-----------KDTIDSNKEIFISYGYSKG 234
Query: 313 NEELLYLYGFVI 324
+ ELL YGFV+
Sbjct: 235 DHELLSTYGFVL 246
>gi|260835124|ref|XP_002612559.1| hypothetical protein BRAFLDRAFT_219602 [Branchiostoma floridae]
gi|229297937|gb|EEN68568.1| hypothetical protein BRAFLDRAFT_219602 [Branchiostoma floridae]
Length = 327
Score = 40.0 bits (92), Expect = 2.6, Method: Compositional matrix adjust.
Identities = 37/138 (26%), Positives = 66/138 (47%), Gaps = 10/138 (7%)
Query: 12 FLQWLQVNKVELRGCKIKYSD-ESKGFGIFSSNEFSDGVLLV-VPLDLAITPMRVLQDPL 69
++WL+ N R + +D G G+ S+ +G +V +P +L IT V+ L
Sbjct: 9 LMRWLRRNG--FRDSHLVLTDFPDTGRGVMSTRNLKEGDCIVSLPENLLITTTTVVNSHL 66
Query: 70 IGPECRAMFEDGEVDDRFLMILFLTVERLR-KNSSWKPYLDMLPTTFGNPLWFTDDELLE 128
G + + + ++ L+L E+ R K+S W PY+ LPT++ P +F+ E+
Sbjct: 67 -GQYIKTW--KPRLTPKQVLSLYLIAEKSRGKDSFWYPYIQTLPTSYTTPSYFSTAEVDA 123
Query: 129 LKGTTLYRATELQKQNLL 146
L L R L+ + +L
Sbjct: 124 LPA--LVREATLRHRKVL 139
>gi|149059900|gb|EDM10783.1| hypothetical protein RDA279, isoform CRA_c [Rattus norvegicus]
Length = 314
Score = 40.0 bits (92), Expect = 2.6, Method: Compositional matrix adjust.
Identities = 42/163 (25%), Positives = 71/163 (43%), Gaps = 9/163 (5%)
Query: 14 QWLQVNKVELRGCKIKYSDESKGFGIFSSNEFSDG-VLLVVPLDLAITPMRVLQDPLIGP 72
+WL+ K E G + G G+ S +G V++ +P +T V++ +GP
Sbjct: 72 KWLKERKFEDTGLLVPACFPGTGRGLMSKASLQEGQVIISLPESCLLTTDTVIRSS-VGP 130
Query: 73 ECRAMFEDGEVDDRFLMILFLTVERLRKNSS-WKPYLDMLPTTFGNPLWFTDDELLELKG 131
+ V + FL ER + S WK YLD+LP ++ P+ + E+++L
Sbjct: 131 YIKKW--KPPVSPLLALCTFLVSERHAGSHSLWKSYLDILPKSYTCPVCL-EPEVVDLLP 187
Query: 132 TTLYRATELQKQNLLTLYDDKVKDLVKKLLVLDGDSERANSIF 174
L E Q+ + L+ +D L L +S +SIF
Sbjct: 188 GPLRAKAEEQRARVQDLFASS-RDFFSTLQPLFAES--VDSIF 227
>gi|407920105|gb|EKG13323.1| hypothetical protein MPH_09605 [Macrophomina phaseolina MS6]
Length = 574
Score = 40.0 bits (92), Expect = 2.7, Method: Compositional matrix adjust.
Identities = 29/83 (34%), Positives = 34/83 (40%), Gaps = 14/83 (16%)
Query: 256 LVPGIDFCNHDLKAAATWEVDGTGL--ITGVPFSMYLLSVERSSFHSEKEISISYGNKGN 313
L+P ID NH W + IT P EI +YG KGN
Sbjct: 207 LMPVIDLLNHRFPTKVAWFFNEGNFQFITEEPVP------------KGHEIFNNYGGKGN 254
Query: 314 EELLYLYGFVIDNNPDDYLMIHY 336
EELL YGF I NN D + I +
Sbjct: 255 EELLNGYGFCIPNNHCDEVAIRF 277
>gi|319411930|emb|CBQ73973.1| conserved hypothetical protein [Sporisorium reilianum SRZ2]
Length = 658
Score = 40.0 bits (92), Expect = 2.7, Method: Compositional matrix adjust.
Identities = 19/68 (27%), Positives = 38/68 (55%), Gaps = 4/68 (5%)
Query: 103 SWKPYLDMLPTTFGNPLWFTDDELLELKGTTLYRATELQKQNLLTLYDDKVKDLVKKL-- 160
+W PY D++PT F P+++ + +L L GT++ A ++ + Y +K ++ L
Sbjct: 168 TWGPYFDIMPTHFSTPMFWAEADLHHLSGTSI--ADKVARDEAEADYHNKAIPFIRSLPG 225
Query: 161 LVLDGDSE 168
+ L+G +E
Sbjct: 226 VFLEGTAE 233
>gi|223992783|ref|XP_002286075.1| rubisco small subunit small subunit n-methyltransferase
[Thalassiosira pseudonana CCMP1335]
gi|220977390|gb|EED95716.1| rubisco small subunit small subunit n-methyltransferase
[Thalassiosira pseudonana CCMP1335]
Length = 434
Score = 40.0 bits (92), Expect = 2.9, Method: Compositional matrix adjust.
Identities = 25/81 (30%), Positives = 37/81 (45%), Gaps = 3/81 (3%)
Query: 248 GETLWIEGLVPGIDFCNHDLKAAATWEVDGTGLITGVPFSMYLLSVERSSFHSEKEISIS 307
GE L +VP D NH + A + +G ++ + +++ IS
Sbjct: 205 GERL---AMVPYADLINHSAFSGAFIDARESGDWLFKNGEEEVILYADRGYRQMEQVYIS 261
Query: 308 YGNKGNEELLYLYGFVIDNNP 328
YG K N ELL LYGF ++ NP
Sbjct: 262 YGQKSNAELLLLYGFALERNP 282
>gi|320580679|gb|EFW94901.1| hypothetical protein HPODL_3273 [Ogataea parapolymorpha DL-1]
Length = 423
Score = 40.0 bits (92), Expect = 2.9, Method: Compositional matrix adjust.
Identities = 59/247 (23%), Positives = 96/247 (38%), Gaps = 89/247 (36%)
Query: 101 NSSWKPYLDMLPTT---FGNPLWFTDDELLELKGTTLYRATELQKQ-------NLLTLYD 150
++ ++PY+ LP + G PL++TD E LKGT Y + ++ +L++L +
Sbjct: 98 STMYEPYIAFLPDSCLEVGLPLFWTDHEQELLKGTDAYPRLKRTREELFERWSSLMSLLN 157
Query: 151 DKVK-DLVKKLLVLDGDS---------ERANSIFWTRALNIPLPHSYVFPQNQEDLNKYD 200
++ K DLV K L DS A SI+ TRA FP
Sbjct: 158 EQKKLDLVMKEAPLCKDSLSWKSFEAFSWAYSIYCTRA----------FP---------- 197
Query: 201 SINNSAELSNDHNSRGELINGLNDIKNEAQRVNSQVNGATSTLTSTQGETLWIEGLVPGI 260
N ++ +++R +L I L P +
Sbjct: 198 ----------------------NFLRKQSER------------------SLNIGFLCPIV 217
Query: 261 DFCNHDLKAAATWEVDGTGLITGVPFSMYLLSVERSSFHSEKEISISYGNKGNEELLYLY 320
D NH TW + + + S +R + +EI +YGNK N +LL Y
Sbjct: 218 DLLNHKNGEKVTWTCEDNSFV-------FKASAKR--IRAGEEIYNNYGNKSNTDLLLNY 268
Query: 321 GFVIDNN 327
GF++++N
Sbjct: 269 GFILNDN 275
>gi|367002402|ref|XP_003685935.1| hypothetical protein TPHA_0F00140 [Tetrapisispora phaffii CBS 4417]
gi|357524235|emb|CCE63501.1| hypothetical protein TPHA_0F00140 [Tetrapisispora phaffii CBS 4417]
Length = 576
Score = 40.0 bits (92), Expect = 3.0, Method: Compositional matrix adjust.
Identities = 33/124 (26%), Positives = 50/124 (40%), Gaps = 19/124 (15%)
Query: 256 LVPGIDFCNHDLKAAATWEVDGTGLITGVPFSMYLLSVERSSFHSEKEISISYGNKGNEE 315
L P +D NH W + V F V + + E+ +YGNK NEE
Sbjct: 227 LYPVVDLLNHKNDTRVKWSYNENK----VQF------VSEEKINVDMELFNNYGNKSNEE 276
Query: 316 LLYLYGFVIDNNPDDYLMIHYPAEAIHSIPLSDSKALLLEEQKAQLRCL---------LP 366
LL YGF+ +NN D + E + +S +E+ + C+ +P
Sbjct: 277 LLLNYGFIEENNVHDNTRLTLRLETAALVGAQNSGVKFSKEELVKSDCVQFLIDNNGEVP 336
Query: 367 KSLL 370
KSL+
Sbjct: 337 KSLI 340
>gi|308802011|ref|XP_003078319.1| N-methyltransferase (ISS) [Ostreococcus tauri]
gi|116056770|emb|CAL53059.1| N-methyltransferase (ISS) [Ostreococcus tauri]
Length = 429
Score = 39.7 bits (91), Expect = 3.0, Method: Compositional matrix adjust.
Identities = 40/140 (28%), Positives = 62/140 (44%), Gaps = 23/140 (16%)
Query: 256 LVPGIDFCNHDLKAAATWEVDGTGLITGVPFSMYLLSVERSSFHSEKEISISYGNKGNEE 315
L+P ID NH +E D + P + + +E+ ISYG N+E
Sbjct: 216 LMPLIDAFNHKTMIKTEFEFDRGAFVLRAP----------KDYETGEEVLISYGVLNNDE 265
Query: 316 LLYLYGFV-IDNNPDDY----LMIHYPAEAIHSIPLSDSKALLLEEQK-AQLRCLLPKSL 369
L+ YGFV +DN D Y LM + A P+ +AL ++++ + L+ P+
Sbjct: 266 LITRYGFVDVDNVADIYRFEGLMSYLQA---SYDPM--KRALGADQKRLSTLKRTHPE-- 318
Query: 370 LEHGFFAAGHPKDGNNDNKL 389
L+ + DGN D KL
Sbjct: 319 LDQALWEGNFISDGNADPKL 338
>gi|448092000|ref|XP_004197467.1| Piso0_004720 [Millerozyma farinosa CBS 7064]
gi|448096594|ref|XP_004198498.1| Piso0_004720 [Millerozyma farinosa CBS 7064]
gi|359378889|emb|CCE85148.1| Piso0_004720 [Millerozyma farinosa CBS 7064]
gi|359379920|emb|CCE84117.1| Piso0_004720 [Millerozyma farinosa CBS 7064]
Length = 595
Score = 39.7 bits (91), Expect = 3.2, Method: Compositional matrix adjust.
Identities = 41/140 (29%), Positives = 57/140 (40%), Gaps = 30/140 (21%)
Query: 256 LVPGIDFCNHDLKAAATWEV---DGTGLITGVPFSMYLLSVERSSFHSEKEISISYGNKG 312
L+P D NH + W GTG + + +ER E+ + +YGNK
Sbjct: 231 LIPLFDILNHKSNSPIRWTPVMESGTGNV--------IFQLERGVKKGEQLFN-NYGNKS 281
Query: 313 NEELLYLYGFVIDNNPDDYLMIH--------YPAEAIHSIPLSDSK------ALLLEEQK 358
N ELL YGF + NP D I + H + D K LL +
Sbjct: 282 NCELLLSYGFAEEKNPHDSASITLKIDDGNIFKKALAHGVKFFDGKNKIKDINLLTKGVN 341
Query: 359 AQL--RCLLPKSLLEHGFFA 376
Q+ CLLP +L++ FFA
Sbjct: 342 FQISNECLLPSNLID--FFA 359
>gi|358388734|gb|EHK26327.1| hypothetical protein TRIVIDRAFT_35595 [Trichoderma virens Gv29-8]
Length = 482
Score = 39.7 bits (91), Expect = 3.4, Method: Compositional matrix adjust.
Identities = 36/168 (21%), Positives = 71/168 (42%), Gaps = 21/168 (12%)
Query: 3 ISTEAKLEPFLQWLQVNKVELRGCKIKYSDESKGFGIFSSNEFSDG--------VLLVVP 54
+ ++ +E F W +N V+ +I+ D KG G+ + ++ ++ ++ +P
Sbjct: 1 MQSQLPIEAFPAWALLNNVDFASAEIRNID-GKGLGLVAKHDITEAGRDASSSPAIVRIP 59
Query: 55 LDLAITPMRVLQDPLIGPECRAMFE-DGEVDDRFLMILFLTVE-----------RLRKNS 102
DL ++ V + + + + E G R ++L+L + R ++
Sbjct: 60 RDLVLSAEAVEEYAKVDQNFKQLLEVAGRQSTRGDILLYLLTQIIQSKGTSPSTRPFAST 119
Query: 103 SWKPYLDMLPTTFGNPLWFTDDELLELKGTTLYRATELQKQNLLTLYD 150
W Y+ LP P +T++E LKGT+L A + L YD
Sbjct: 120 PWTEYIKFLPRYIPIPTMWTNEERELLKGTSLEAAVSAKLSALSHEYD 167
>gi|190345582|gb|EDK37493.2| hypothetical protein PGUG_01591 [Meyerozyma guilliermondii ATCC
6260]
Length = 592
Score = 39.7 bits (91), Expect = 3.4, Method: Compositional matrix adjust.
Identities = 26/75 (34%), Positives = 36/75 (48%), Gaps = 7/75 (9%)
Query: 256 LVPGIDFCNHDLKAAATWEVDGTGLITGVPFSMYLLSVERSSFHSEKEISISYGNKGNEE 315
L+P ID NH A W GL S ++ + S ++ +YG KGNEE
Sbjct: 228 LLPVIDLLNHSPSAHVEW-----GLERKESKSYFVF--KSDDVKSGSQLFNNYGMKGNEE 280
Query: 316 LLYLYGFVIDNNPDD 330
LL YGF +++N D
Sbjct: 281 LLLAYGFCLEDNSSD 295
>gi|146415322|ref|XP_001483631.1| hypothetical protein PGUG_04360 [Meyerozyma guilliermondii ATCC
6260]
Length = 466
Score = 39.7 bits (91), Expect = 3.4, Method: Compositional matrix adjust.
Identities = 61/272 (22%), Positives = 96/272 (35%), Gaps = 70/272 (25%)
Query: 91 LFLTVERLRK-NSSWKPYLDMLPTT----FGNPLWFTDDELLELKGTTLYRATELQKQNL 145
L++ +E+ RK NS W ++ LP F +W + EL K +
Sbjct: 153 LYICLEKKRKENSFWCAFISSLPKLEELDFAPIVWEVESELTGSKAADFFE--------- 203
Query: 146 LTLYDDKVKDLVKKLLVLDGDSERANSIFWTRALNIPLPHSYVFPQNQEDLNKYDSINNS 205
L ++ KK+LV + A S F T A + PL E L + IN+
Sbjct: 204 --LLPRSSRNHAKKVLVRFNEDYTAVSEFLTAAKSEPL-------NKMEFLWAWMCINSR 254
Query: 206 AELSNDHNSRGELINGLNDIKNEAQRVNSQVNGATSTLTSTQGETLWIEGLVPGIDFCNH 265
+ +S+ E N L P +DF NH
Sbjct: 255 CLYMSFPSSKAEADNF---------------------------------TLAPYVDFLNH 281
Query: 266 DLKAAATWEVDGTGLITGVPFSMYLLSVERSSFHSEKEISISYGNKGNEELLYLYGFVID 325
D ++D G L + + +E+ SYG NE LL Y F ++
Sbjct: 282 DCDEKCAIKIDSRGF----------LVISCVDHAAGQELLFSYGPHSNEFLLCEYAFTME 331
Query: 326 NNPDDYLMIHYPAEAIHSIPLSDSKALLLEEQ 357
N + L + + E I ++D++ L EQ
Sbjct: 332 TNKWNNLDVSHHIEGI----MNDAQKSFLREQ 359
>gi|7329638|emb|CAB82703.1| putative protein [Arabidopsis thaliana]
Length = 486
Score = 39.7 bits (91), Expect = 3.4, Method: Compositional matrix adjust.
Identities = 36/153 (23%), Positives = 71/153 (46%), Gaps = 18/153 (11%)
Query: 1 MEISTEAKLEP-FLQWLQVNKVELRGCKIKYS----DESKGFGIFSSNEFSDG-VLLVVP 54
+E+ T+A L+ FL WL+ + G KI + + G +F+S G +L VP
Sbjct: 34 LELQTQASLDNNFLPWLE----RIAGAKITNTLSIGKSTYGRSLFASKVIYAGDCMLKVP 89
Query: 55 LDLAITPMRVLQDPLIGPECRAMFEDGEVDDRFLMILFLTVERLRKNSSWKPYLDML--P 112
+ ITP + D R + + + L + + +++ + S W PY+ L P
Sbjct: 90 FNAQITPDELPSD------IRVLLSNEVGNIGMLAAVLIREKKMGQKSRWVPYISRLPQP 143
Query: 113 TTFGNPLWFTDDELLELKGTTLYRATELQKQNL 145
+ +++ +DEL ++ + +++ T QK +
Sbjct: 144 AEMHSSIFWGEDELSMIRCSAVHQETVKQKAQI 176
>gi|440804288|gb|ELR25165.1| hypothetical protein ACA1_288860 [Acanthamoeba castellanii str.
Neff]
Length = 445
Score = 39.7 bits (91), Expect = 3.4, Method: Compositional matrix adjust.
Identities = 43/153 (28%), Positives = 57/153 (37%), Gaps = 37/153 (24%)
Query: 282 TGVPFSMYLLSVERSSFHSEKEISISYGNKGNEELLYLYGFVIDNNPDDYLMIHYPAEAI 341
T VPF+ +L H+ + ISYG GNE L++LYGF I +N ++ P +
Sbjct: 194 TLVPFADFL-------NHANVDAFISYGKHGNEPLIHLYGFAIPDNSEEIYNFVDPLQV- 245
Query: 342 HSIPLSDSKALLLEEQKAQLRCLLPKSLLEHGFFAAGHPKDGNNDNKLEVDRISSFSWSG 401
L EE+ A CL SLL G N N+ SW
Sbjct: 246 --------DVCLREEKDA---CLQDASLLT-GLMGCSLYVTHMNVNRFYAWPSGELSW-- 291
Query: 402 QRRMPSYLNKLVFPENFLTALRTIAMQEDEISK 434
NF ALR + EDE +
Sbjct: 292 ---------------NFDVALRVYLLDEDEFVR 309
>gi|390354259|ref|XP_001201449.2| PREDICTED: SET domain-containing protein 4-like [Strongylocentrotus
purpuratus]
Length = 455
Score = 39.7 bits (91), Expect = 3.5, Method: Compositional matrix adjust.
Identities = 26/76 (34%), Positives = 32/76 (42%), Gaps = 8/76 (10%)
Query: 255 GLVPGIDFCNHDLKAAATWEVDGTGLITGVPFSMYLLSVERSSFHSEKEISISYGNKGNE 314
L P +D NH A T D P S S+H+ ++ I YG N
Sbjct: 241 ALAPFLDLLNHSPGAEVTAGFD--------PVSNCYRIKTLDSYHAYDQVFIHYGPHDNV 292
Query: 315 ELLYLYGFVIDNNPDD 330
LL YGFVI +NP D
Sbjct: 293 NLLLEYGFVIPSNPHD 308
>gi|15231493|ref|NP_187424.1| rubisco methyltransferase-like protein [Arabidopsis thaliana]
gi|6466950|gb|AAF13085.1|AC009176_12 putative ribulose-1,5-bisphosphate carboxylase/oxygenase small
subunit N-methyltransferase I [Arabidopsis thaliana]
gi|6648179|gb|AAF21177.1|AC013483_1 putative ribulose-1,5-bisphosphate carboxylase/oxygenase small
subunit N-methyltransferase I [Arabidopsis thaliana]
gi|15028205|gb|AAK76599.1| putative ribulose-1,5-bisphosphate carboxylase/oxygenase small
subunit N-methyltransferase I [Arabidopsis thaliana]
gi|19310671|gb|AAL85066.1| putative ribulose-1,5-bisphosphate carboxylase/oxygenase small
subunit N-methyltransferase I [Arabidopsis thaliana]
gi|332641064|gb|AEE74585.1| rubisco methyltransferase-like protein [Arabidopsis thaliana]
Length = 504
Score = 39.7 bits (91), Expect = 3.5, Method: Compositional matrix adjust.
Identities = 28/78 (35%), Positives = 38/78 (48%), Gaps = 10/78 (12%)
Query: 255 GLVPGIDFCNHDLKAAATWEVDGTGLITGVPFSMYLLSVERSSFHSEKEISISYGNKGNE 314
LVP D NH+ + + D + GV F+ +R + +++ ISYGNK N
Sbjct: 259 ALVPWADMLNHNCEVETFLDYDKSS--KGVVFT-----TDRP-YQPGEQVFISYGNKSNG 310
Query: 315 ELLYLYGFVI--DNNPDD 330
ELL YGFV NP D
Sbjct: 311 ELLLSYGFVPREGTNPSD 328
>gi|42565948|ref|NP_191068.2| SET domain-containing protein [Arabidopsis thaliana]
gi|56236044|gb|AAV84478.1| At3g55080 [Arabidopsis thaliana]
gi|59958342|gb|AAX12881.1| At3g55080 [Arabidopsis thaliana]
gi|332645816|gb|AEE79337.1| SET domain-containing protein [Arabidopsis thaliana]
Length = 463
Score = 39.7 bits (91), Expect = 3.6, Method: Compositional matrix adjust.
Identities = 36/153 (23%), Positives = 71/153 (46%), Gaps = 18/153 (11%)
Query: 1 MEISTEAKLEP-FLQWLQVNKVELRGCKIKYS----DESKGFGIFSSNEFSDG-VLLVVP 54
+E+ T+A L+ FL WL+ + G KI + + G +F+S G +L VP
Sbjct: 34 LELQTQASLDNNFLPWLE----RIAGAKITNTLSIGKSTYGRSLFASKVIYAGDCMLKVP 89
Query: 55 LDLAITPMRVLQDPLIGPECRAMFEDGEVDDRFLMILFLTVERLRKNSSWKPYLDML--P 112
+ ITP + D R + + + L + + +++ + S W PY+ L P
Sbjct: 90 FNAQITPDELPSD------IRVLLSNEVGNIGMLAAVLIREKKMGQKSRWVPYISRLPQP 143
Query: 113 TTFGNPLWFTDDELLELKGTTLYRATELQKQNL 145
+ +++ +DEL ++ + +++ T QK +
Sbjct: 144 AEMHSSIFWGEDELSMIRCSAVHQETVKQKAQI 176
>gi|3065835|gb|AAC14296.1| putative methyltransferase [Arabidopsis thaliana]
Length = 504
Score = 39.7 bits (91), Expect = 3.6, Method: Compositional matrix adjust.
Identities = 28/78 (35%), Positives = 38/78 (48%), Gaps = 10/78 (12%)
Query: 255 GLVPGIDFCNHDLKAAATWEVDGTGLITGVPFSMYLLSVERSSFHSEKEISISYGNKGNE 314
LVP D NH+ + + D + GV F+ +R + +++ ISYGNK N
Sbjct: 259 ALVPWADMLNHNCEVETFLDYDKSS--KGVVFT-----TDRP-YQPGEQVFISYGNKSNG 310
Query: 315 ELLYLYGFVI--DNNPDD 330
ELL YGFV NP D
Sbjct: 311 ELLLSYGFVPREGTNPSD 328
>gi|397613505|gb|EJK62256.1| hypothetical protein THAOC_17139 [Thalassiosira oceanica]
Length = 648
Score = 39.7 bits (91), Expect = 3.7, Method: Compositional matrix adjust.
Identities = 28/107 (26%), Positives = 44/107 (41%), Gaps = 11/107 (10%)
Query: 255 GLVPGIDFCNHDLKAAATWEVDGTGLITGVPFSMYLLSVERSSFHSEKEISISYGNKGNE 314
+VP D NH + A + +G ++ + +++ ISYG K N
Sbjct: 369 AMVPYADLINHSAFSQAFIDARESGDWLFKSGEEEVILYADRGYRQMEQVYISYGQKSNA 428
Query: 315 ELLYLYGFVIDNNPDDYL-----------MIHYPAEAIHSIPLSDSK 350
ELL LYGF ++ NP + + I E + PL+D K
Sbjct: 429 ELLLLYGFALERNPYNSVDVTVSIAPRTKQIAEANEGVEEDPLADEK 475
>gi|145553305|ref|XP_001462327.1| hypothetical protein [Paramecium tetraurelia strain d4-2]
gi|124430166|emb|CAK94954.1| unnamed protein product [Paramecium tetraurelia]
Length = 481
Score = 39.7 bits (91), Expect = 3.7, Method: Compositional matrix adjust.
Identities = 41/153 (26%), Positives = 71/153 (46%), Gaps = 11/153 (7%)
Query: 13 LQWLQVNKVELRGCKIKYSDESKGFGIFSSNEF--SDGVLLVVPLDLAITPMRVLQDPLI 70
+QWL+ K E+ K+ +S+G+ +++F +L VP ++ V + LI
Sbjct: 44 IQWLKDGKAEV--SKVSIEVKSEGYRTLRASQFIRQGEWVLFVPRTHYLSLEEVKKSCLI 101
Query: 71 GPECRAMFEDGEVDDRF--LMILFLTVERLRKNSSWKPYLDMLPTTF-GNPLWFTDDELL 127
R M + + + + L E R+NS WKPY+D+LP G P F ++
Sbjct: 102 N---RKMIQLNYIPNNIQTYFVNHLLQENRRQNSFWKPYIDVLPKDVSGFPTNFDAEQDA 158
Query: 128 ELKGTTLYRATELQKQNLLTLYDDKVKDLVKKL 160
LKG+ Q++ YD+ +K+ VK+
Sbjct: 159 LLKGSPTLFTVMNQRKTFQEEYDN-LKEAVKEF 190
>gi|406607002|emb|CCH41620.1| SET domain-containing protein 4 [Wickerhamomyces ciferrii]
Length = 424
Score = 39.7 bits (91), Expect = 3.7, Method: Compositional matrix adjust.
Identities = 22/72 (30%), Positives = 32/72 (44%), Gaps = 10/72 (13%)
Query: 256 LVPGIDFCNHDLKAAATWEVDGTGLITGVPFSMYLLSVERSSFHSEKEISISYGNKGNEE 315
+ P +DF NH ++D TG + S++ E+ +SYG NE
Sbjct: 237 MAPYVDFINHSTNDQCKLKIDRTGFHV----------ITTSNYKENDELYLSYGPHSNEF 286
Query: 316 LLYLYGFVIDNN 327
LL YGF + NN
Sbjct: 287 LLCEYGFHLSNN 298
>gi|443733230|gb|ELU17670.1| hypothetical protein CAPTEDRAFT_97123, partial [Capitella teleta]
Length = 199
Score = 39.7 bits (91), Expect = 3.8, Method: Compositional matrix adjust.
Identities = 21/54 (38%), Positives = 35/54 (64%), Gaps = 1/54 (1%)
Query: 89 MILFLTVERLRKNSS-WKPYLDMLPTTFGNPLWFTDDELLELKGTTLYRATELQ 141
+++FL ER + SS WKPY+D+LP+++ + L +T E+ L T RA +L+
Sbjct: 1 LVIFLLCERNKGCSSFWKPYVDILPSSYTDILHWTSKEMDLLPKFTKRRACDLR 54
>gi|313239201|emb|CBY14158.1| unnamed protein product [Oikopleura dioica]
Length = 393
Score = 39.7 bits (91), Expect = 3.8, Method: Compositional matrix adjust.
Identities = 30/105 (28%), Positives = 45/105 (42%), Gaps = 6/105 (5%)
Query: 27 KIKYSDESKGFGIFSSNEFSDGVLLV-VPLDLAITPMRVLQDPLIGPECRAMFEDGEV-- 83
K+K SD G G+FSS LL+ VP+D +T + R + ++
Sbjct: 22 KLKISDGDCGRGVFSSAVIEQSELLISVPIDALLTTRKAQHVVESHKSARQVLQNFSTCL 81
Query: 84 --DDRFLMILFLTVERLRKNSSWKPYLDMLPTTFGNPLWFTDDEL 126
D + LFL +E +NS W +L +P NP + EL
Sbjct: 82 NGTDLLVCALFLELEN-EENSKWSAFLSSIPNQLWNPFMLDEKEL 125
>gi|406860468|gb|EKD13526.1| putative SET domain-containing protein [Marssonina brunnea f. sp.
'multigermtubi' MB_m1]
Length = 474
Score = 39.3 bits (90), Expect = 4.0, Method: Compositional matrix adjust.
Identities = 34/142 (23%), Positives = 63/142 (44%), Gaps = 30/142 (21%)
Query: 6 EAKLEPFLQWL---------QVNKVELRGCKIKYSDESKGFGIFSSNEFS-DGVLLVVPL 55
+AK F WL +V V+LR +G G+ + ++ D VL +P
Sbjct: 7 QAKTASFFAWLGRIGVHHSPKVALVDLRSA-------GRGRGLVAQSDIGEDEVLFTIPR 59
Query: 56 DLAI---TPMRVLQDPLIGPECRAMFEDGEVDDRFLMILFLTVERLRKNSSWKPYLDMLP 112
D + T + +P I E+ + + E +++S W PYL +LP
Sbjct: 60 DAVLNTTTALGSADNPAIL----------EMPCWLALTAIILTEGQQEDSKWAPYLALLP 109
Query: 113 TTFGNPLWFTDDELLELKGTTL 134
+ + +++++ ELLEL+ +T+
Sbjct: 110 SRLDSLVFWSESELLELQASTV 131
>gi|367029027|ref|XP_003663797.1| hypothetical protein MYCTH_2080826, partial [Myceliophthora
thermophila ATCC 42464]
gi|347011067|gb|AEO58552.1| hypothetical protein MYCTH_2080826, partial [Myceliophthora
thermophila ATCC 42464]
Length = 357
Score = 39.3 bits (90), Expect = 4.1, Method: Compositional matrix adjust.
Identities = 25/82 (30%), Positives = 38/82 (46%), Gaps = 11/82 (13%)
Query: 256 LVPGIDFCNHDLKAAATWEVDGTGLITGVPFSMYLLSVERSSFHSEKEISISYGNKGNEE 315
L P +D NH A TW+ T VP + L+ + ++ +YG K N E
Sbjct: 246 LQPLLDIANHSPTARYTWDT------TSVPDTCRLIC--HDPYQPGTQVYNNYGLKTNSE 297
Query: 316 LLYLYGFVIDNNP---DDYLMI 334
LL YGF++ P +DY+ +
Sbjct: 298 LLLAYGFILPETPSLHNDYVHV 319
>gi|298711968|emb|CBJ32910.1| conserved unknown protein [Ectocarpus siliculosus]
Length = 247
Score = 39.3 bits (90), Expect = 4.1, Method: Compositional matrix adjust.
Identities = 27/103 (26%), Positives = 47/103 (45%), Gaps = 14/103 (13%)
Query: 256 LVPGIDFCNHDLKAAATWEVDGTGLITGVPFSM---YLLSVERSSFHSEKEISISYGNKG 312
L P +D NH +G+ + V ++ Y + + + +++ ISYG +
Sbjct: 12 LTPVVDLINHQ-----------SGIDSDVSYNYFYGYFAVTTQRGWTAGEQVLISYGPRS 60
Query: 313 NEELLYLYGFVIDNNPDDYLMIHYPAEAIHSIPLSDSKALLLE 355
N+ LL YGFV +NP+D I + + + DS +L E
Sbjct: 61 NDHLLRRYGFVEQDNPNDVYRITGLIDKLSDVLGKDSVRVLRE 103
>gi|255087454|ref|XP_002505650.1| predicted protein [Micromonas sp. RCC299]
gi|226520920|gb|ACO66908.1| predicted protein [Micromonas sp. RCC299]
Length = 774
Score = 39.3 bits (90), Expect = 4.1, Method: Compositional matrix adjust.
Identities = 41/143 (28%), Positives = 59/143 (41%), Gaps = 17/143 (11%)
Query: 4 STEAKLEPFLQWLQVNKVELRGCKI-----KYSDESKGFGIFSSNEFSDG-VLLVVPLDL 57
S + L FL W + EL + D + +F+ + + G VL+V+P
Sbjct: 84 SDDPTLANFLTWCTMRGFELHPSLVVKNASGAGDAPRHNAVFARGDIAPGDVLVVIPKCW 143
Query: 58 AITPMRVLQDPLIGPECRAMFEDGEVD-DRFLMILFLTVER-LRKNSSWKPYLDMLPTTF 115
+TP G + D D D +IL + ER L S W PY +LPT
Sbjct: 144 CLTPR-------TGSITNVLPRDVLADLDEAALILTVMYERALGSRSPWAPYFALLPTPS 196
Query: 116 GN-P-LWFTDDELLELKGTTLYR 136
N P LW D L+GT ++R
Sbjct: 197 ENLPFLWDERDATRWLEGTEVFR 219
>gi|336467028|gb|EGO55192.1| hypothetical protein NEUTE1DRAFT_147775 [Neurospora tetrasperma
FGSC 2508]
gi|350288355|gb|EGZ69591.1| SET domain-containing protein [Neurospora tetrasperma FGSC 2509]
Length = 504
Score = 39.3 bits (90), Expect = 4.3, Method: Compositional matrix adjust.
Identities = 48/182 (26%), Positives = 73/182 (40%), Gaps = 24/182 (13%)
Query: 1 MEISTEAKLEPFLQWLQVNKVELRGCKIKYSDESKGFGIF---SSNEF--SDGVLLVVPL 55
ME + + E L W + L Y DE GF + S+ E S + P
Sbjct: 1 MESPHKERFEALLDWAHKHGASLHPLLEVYEDEVTGFSLRVKPSATELLGSGFKAVSCPT 60
Query: 56 DLAITPMRVLQDPLIGPECRAMFEDGE---VDDRFLMIL--------FLTVERLR-KNSS 103
+ ++ + L D I P + + E +RF+ L +L + L+ K+S
Sbjct: 61 SITLSYLNALTDGPITPSSTTLAPNTENPAFPERFMNSLPPHVIGRFYLIQQYLKGKSSF 120
Query: 104 WKPYLDML--PTTFGN----PLWFTDDELLELKGTTLYRATELQKQNLLTLYDDKVKDLV 157
W PY+ L P+ P W DD L LKGT Y A + + N+ + Y K L
Sbjct: 121 WAPYISTLADPSQLDKWALPPFWAEDDIEL-LKGTNAYVAIQEIQSNVKSEYKQARKILK 179
Query: 158 KK 159
K+
Sbjct: 180 KE 181
>gi|444314545|ref|XP_004177930.1| hypothetical protein TBLA_0A06190 [Tetrapisispora blattae CBS 6284]
gi|387510969|emb|CCH58411.1| hypothetical protein TBLA_0A06190 [Tetrapisispora blattae CBS 6284]
Length = 550
Score = 39.3 bits (90), Expect = 4.4, Method: Compositional matrix adjust.
Identities = 25/79 (31%), Positives = 39/79 (49%), Gaps = 7/79 (8%)
Query: 256 LVPGIDFCNHDLKAAATWEVDGTGLITGVPFSMYLLSVERSSFHSEKEISISYGNKGNEE 315
L+P +D NH + W+ + G + ++ EI +YG KGNEE
Sbjct: 201 LLPVLDLLNHSNYSKVEWDGNKGG-------NFIYKKLDLQEIEIGDEIYNNYGGKGNEE 253
Query: 316 LLYLYGFVIDNNPDDYLMI 334
LL YGFVI++N D +++
Sbjct: 254 LLNGYGFVIEDNLFDSVLL 272
>gi|241603784|ref|XP_002405757.1| SET domain-containing protein, putative [Ixodes scapularis]
gi|215502568|gb|EEC12062.1| SET domain-containing protein, putative [Ixodes scapularis]
Length = 429
Score = 39.3 bits (90), Expect = 4.6, Method: Compositional matrix adjust.
Identities = 24/80 (30%), Positives = 40/80 (50%), Gaps = 10/80 (12%)
Query: 255 GLVPGIDFCNHDLKAAATWEVDGTGLITGVPFSMYLLSVERSSFHSEKEISISYGNKGNE 314
L P +D NH KA+ T ++ G F + + S + +++ ISYG N
Sbjct: 225 ALAPFLDCLNHHWKASIE-----TAMV-GENFEI----LSHKSHDANEQVFISYGPHSNR 274
Query: 315 ELLYLYGFVIDNNPDDYLMI 334
L YGFV+ +NP+D +++
Sbjct: 275 RLFLDYGFVLPDNPNDVVVV 294
>gi|392569623|gb|EIW62796.1| SET domain-containing protein [Trametes versicolor FP-101664 SS1]
Length = 509
Score = 39.3 bits (90), Expect = 4.6, Method: Compositional matrix adjust.
Identities = 33/127 (25%), Positives = 60/127 (47%), Gaps = 8/127 (6%)
Query: 12 FLQWLQVNKVELRGCKI---KYSDESKGFGIFSSNEFSDGVLLVVPLDLAITPMRVLQDP 68
F+ W Q+ L K+ ++ + +G I + D L +P +L ++
Sbjct: 9 FVHWFQLQHGNLDTEKVGIVEFPEHGRG-AIALQDIPEDYTLFTIPRELTLSTRTCSLPT 67
Query: 69 LIGPECRAMFEDGEVDDRFLMILFLTVERLR-KNSSWKPYLDMLPTTFGNPLWFTDDELL 127
L+G +A E G + +IL + E R +S W YL LP++F P+++ ++L
Sbjct: 68 LMG---QAWKEHGLHEGWAGLILCMIWEESRGSDSKWSGYLATLPSSFDTPMFWGQEDLN 124
Query: 128 ELKGTTL 134
EL+GT +
Sbjct: 125 ELQGTAV 131
>gi|320581155|gb|EFW95376.1| Dihydrolipoamide dehydrogenase [Ogataea parapolymorpha DL-1]
Length = 632
Score = 39.3 bits (90), Expect = 4.7, Method: Compositional matrix adjust.
Identities = 27/79 (34%), Positives = 38/79 (48%), Gaps = 10/79 (12%)
Query: 256 LVPGIDFCNHDLKAAATWEVDGTGLITGVPFSMYLLSVERSSFHSEKEISISYGNKGNEE 315
L+P ID NH K+ TW G F + L + E+ +YG KGNEE
Sbjct: 45 LLPIIDLFNHKPKSKVTWVPSSAG------FELRL----QGGADINGELFNNYGPKGNEE 94
Query: 316 LLYLYGFVIDNNPDDYLMI 334
LL YGF + +N ++ L +
Sbjct: 95 LLMGYGFALVDNENEQLQL 113
>gi|442753255|gb|JAA68787.1| Putative set domain-containing protein [Ixodes ricinus]
Length = 428
Score = 39.3 bits (90), Expect = 4.7, Method: Compositional matrix adjust.
Identities = 24/80 (30%), Positives = 40/80 (50%), Gaps = 10/80 (12%)
Query: 255 GLVPGIDFCNHDLKAAATWEVDGTGLITGVPFSMYLLSVERSSFHSEKEISISYGNKGNE 314
L P +D NH KA+ T ++ G F + + S + +++ ISYG N
Sbjct: 224 ALAPFLDCLNHHWKASIE-----TAMV-GENFEI----LSHKSHDANEQVFISYGPHSNR 273
Query: 315 ELLYLYGFVIDNNPDDYLMI 334
L YGFV+ +NP+D +++
Sbjct: 274 RLFLDYGFVLPDNPNDVVVV 293
>gi|119467702|ref|XP_001257657.1| SET domain protein [Neosartorya fischeri NRRL 181]
gi|119405809|gb|EAW15760.1| SET domain protein [Neosartorya fischeri NRRL 181]
Length = 492
Score = 39.3 bits (90), Expect = 4.8, Method: Compositional matrix adjust.
Identities = 32/128 (25%), Positives = 58/128 (45%), Gaps = 11/128 (8%)
Query: 12 FLQWLQVNKVELRGCKIKYSD---ESKGFGIFSSNEFSDGV-LLVVPLDLAITPMRVLQD 67
F+ WL KI +D +S G G+ + ++ DG L +P L ++
Sbjct: 20 FITWLAGKPGVRINSKINIADLRSKSAGRGVVARSDIFDGEELFSIPRGLVLSAQNSKLK 79
Query: 68 PLIGPECRAMFEDGEVDDRFLMILFLTVERL-RKNSSWKPYLDMLPTTFGNPLWFTDDEL 126
L+ + E+ +IL + E L + S+W PY +LP +F ++++ EL
Sbjct: 80 DLLSQDLE------ELGPWLSLILVMMYEYLLGEQSAWAPYFKVLPKSFDTLMFWSPSEL 133
Query: 127 LELKGTTL 134
EL+G+ +
Sbjct: 134 QELQGSAI 141
>gi|254568640|ref|XP_002491430.1| Nuclear protein that contains a SET-domain [Komagataella pastoris
GS115]
gi|238031227|emb|CAY69150.1| Nuclear protein that contains a SET-domain [Komagataella pastoris
GS115]
Length = 538
Score = 39.3 bits (90), Expect = 4.8, Method: Compositional matrix adjust.
Identities = 41/130 (31%), Positives = 68/130 (52%), Gaps = 18/130 (13%)
Query: 10 EPFLQWLQVNKVELRG-CKIK-YSDESKGFGIFSSNEFSDGVLLVVPLDLAITPMRVLQD 67
E + +WL+ N +EL G KI+ D+++G G+ ++ + ++ +L +I+ VL
Sbjct: 10 EAYKKWLEAN-LELAGNVKIEDLRDQNEGRGVVATRDINEDEVL-----FSISKDNVLN- 62
Query: 68 PLIGPECRAMFED--GEVDDRFL----MILFLTVER-LRKNSSWKPYLDMLPTTFGNPLW 120
I A +D G + R +IL L ER L + S WK YLD LPT+F N ++
Sbjct: 63 --IQTSSLANVKDNNGVILKRLNHWEGLILCLAYERSLGELSRWKGYLDTLPTSFNNLIF 120
Query: 121 FTDDELLELK 130
+ + +L LK
Sbjct: 121 WNEQDLNSLK 130
>gi|308812294|ref|XP_003083454.1| N-methyltransferase (ISS) [Ostreococcus tauri]
gi|116055335|emb|CAL58003.1| N-methyltransferase (ISS) [Ostreococcus tauri]
Length = 492
Score = 39.3 bits (90), Expect = 4.9, Method: Compositional matrix adjust.
Identities = 23/83 (27%), Positives = 38/83 (45%), Gaps = 14/83 (16%)
Query: 256 LVPGIDFCNHDLKAAATWEV--DGTGLITGVPFSMYLLSVERSSFHSEKEISISYGNKGN 313
L+P ID CNH A+A +G L+T + + I + YG N
Sbjct: 239 LLPVIDVCNHSFDASARVRECENGVELVTT------------RDLKAGQPIELCYGELSN 286
Query: 314 EELLYLYGFVIDNNPDDYLMIHY 336
+EL YGF++++N D + + +
Sbjct: 287 DELFLDYGFIVEDNAFDTVKLRW 309
>gi|380477696|emb|CCF44010.1| SET domain-containing protein, partial [Colletotrichum
higginsianum]
Length = 448
Score = 39.3 bits (90), Expect = 4.9, Method: Compositional matrix adjust.
Identities = 29/98 (29%), Positives = 47/98 (47%), Gaps = 10/98 (10%)
Query: 254 EGLVPGIDFCNHDLKAAATWEVDGTGLITGVPFSMYLLSVERSSFHSEKEISISYGN-KG 312
E +VP +D NH +A A +E +G + LL S +E++ISYG+ K
Sbjct: 213 EAMVPVLDLANHSSEANAYYEENGKD-------EVVLLLRPGCRVSSGEEMTISYGDAKS 265
Query: 313 NEELLYLYGFVIDNNPDDYLMIHYPAEAIHSIPLSDSK 350
E+L+ YGF+ + D + + P + PL +K
Sbjct: 266 GAEMLFSYGFIDPVSAADRMTL--PLMPLEDDPLGKAK 301
>gi|149237793|ref|XP_001524773.1| hypothetical protein LELG_03805 [Lodderomyces elongisporus NRRL
YB-4239]
gi|146451370|gb|EDK45626.1| hypothetical protein LELG_03805 [Lodderomyces elongisporus NRRL
YB-4239]
Length = 572
Score = 38.9 bits (89), Expect = 5.2, Method: Compositional matrix adjust.
Identities = 37/160 (23%), Positives = 80/160 (50%), Gaps = 15/160 (9%)
Query: 12 FLQWLQVNKVEL--RGCKIKYSDESKGFGIFS-SNEFSDGVLLVVPLDLAITPMRVLQDP 68
+L+WL+ N ++ + Y+ +G G+ + +N D ++ +P + + V+Q+
Sbjct: 18 YLEWLKQNNYDISDKVAIHDYTSVKQGRGVIALANIDKDEIIATIP---KLALLNVIQNS 74
Query: 69 LIG--PECRAMFEDGEVDDRFLMILFLTVERLRKNSSWKPYLDMLPTT-FGNPLWFTDDE 125
L+ PE + + ++IL ++ R+ S WK YL++LPT+ F ++++ +E
Sbjct: 75 LVAKYPELKYGLLHLNHWEALIIILLYELQN-REQSKWKSYLNVLPTSNFDQLMFWSSNE 133
Query: 126 LLELK-GTTLYRATELQKQNLLTLYDDKVKDLVKKLLVLD 164
L +L+ L R + Q + D++ ++ KL + D
Sbjct: 134 LNQLQPSCILERVGKDQADKMF----DRIMKIIHKLGITD 169
>gi|159474448|ref|XP_001695337.1| predicted protein [Chlamydomonas reinhardtii]
gi|158275820|gb|EDP01595.1| predicted protein [Chlamydomonas reinhardtii]
Length = 360
Score = 38.9 bits (89), Expect = 5.4, Method: Compositional matrix adjust.
Identities = 33/116 (28%), Positives = 57/116 (49%), Gaps = 17/116 (14%)
Query: 256 LVPGIDFCNHDLKAAATWEVDGTGLITGVPFSMYLLSVERSSFHSEKEISISYGNKGNEE 315
LVP +D NHD++ +++ G + V V + ++++I+YG+ N+E
Sbjct: 199 LVPVLDLVNHDVR-----DINQLGNSSTVDL------VAGKDLAAGEQVTITYGSMRNDE 247
Query: 316 LLYLYGFV--IDNNPDDYLMIHYPAEAIHSIPLSDS----KALLLEEQKAQLRCLL 365
LL YGFV + P + + H + + PLSDS +L + A+LR +L
Sbjct: 248 LLMYYGFVDTVTEPPRLFSVDHRDFKLYEANPLSDSPLEGPPEVLRTELARLRGIL 303
>gi|440640494|gb|ELR10413.1| hypothetical protein GMDG_00825 [Geomyces destructans 20631-21]
Length = 492
Score = 38.9 bits (89), Expect = 5.7, Method: Compositional matrix adjust.
Identities = 33/144 (22%), Positives = 67/144 (46%), Gaps = 13/144 (9%)
Query: 1 MEISTEAKLEPFLQWLQVNKVELRGCKIKYSDE-------SKGFGIFSSNEFS-DGVLLV 52
M+ ++K E F++WL N V G +I E +G + + +F+ D ++
Sbjct: 1 MDDQFDSKTEAFVKWL--NHV---GVRISAKAELTCLRADGRGRALVAKGDFAEDELIFS 55
Query: 53 VPLDLAITPMRVLQDPLIGPECRAMFEDGEVDDRFLMILFLTVERLRKNSSWKPYLDMLP 112
VP ++ L + L G + + + + + + E LR S W PY ++LP
Sbjct: 56 VPRTSTLSVKAALPEMLSGRQDISPEDIESMPGWAALTAVIISEGLRPESKWAPYFNVLP 115
Query: 113 TTFGNPLWFTDDELLELKGTTLYR 136
T + ++++ +EL EL+ + + +
Sbjct: 116 TKLDSLVFWSPEELAELQASAVLK 139
>gi|410082986|ref|XP_003959071.1| hypothetical protein KAFR_0I01550 [Kazachstania africana CBS 2517]
gi|372465661|emb|CCF59936.1| hypothetical protein KAFR_0I01550 [Kazachstania africana CBS 2517]
Length = 584
Score = 38.9 bits (89), Expect = 6.0, Method: Compositional matrix adjust.
Identities = 39/135 (28%), Positives = 54/135 (40%), Gaps = 27/135 (20%)
Query: 256 LVPGIDFCNHDLKAAATW--EVDGTGLITGVPFSMYLLSVERSSFHSEKEISISYGNKGN 313
L P +D NH + W E D T + E+ +YG+K N
Sbjct: 230 LYPIVDLLNHKNGTSVKWTFEDDQAHFFTN-----------EKNLKKHTELFNNYGDKSN 278
Query: 314 EELLYLYGFVIDNNPDD--YLMIHYPAEAIHSIPLSDSKALLLEEQKAQLRCL------- 364
EELL YGFV NN D L + + I S+P +L +E CL
Sbjct: 279 EELLLGYGFVQSNNAHDDTKLTLKLDPQLIESMP--SFGIVLNKENTVGTECLQFTLSSR 336
Query: 365 --LPKSLLE-HGFFA 376
LP++LL+ GF +
Sbjct: 337 NPLPRNLLKLFGFLS 351
>gi|218202140|gb|EEC84567.1| hypothetical protein OsI_31339 [Oryza sativa Indica Group]
Length = 649
Score = 38.9 bits (89), Expect = 6.0, Method: Compositional matrix adjust.
Identities = 44/185 (23%), Positives = 82/185 (44%), Gaps = 15/185 (8%)
Query: 36 GFGIFSSNEFSDG-VLLVVPLDLAITPMRVLQDPLIGPECRAMFEDGEVDDRFLMILFLT 94
G G+ ++ + G VL VP L + V L G R G + + L L
Sbjct: 241 GLGLVAARDLPRGEVLAEVPKKLWLDADAVAASDLGGAVGR-----GGLRPWVAVALLLL 295
Query: 95 VERLR-KNSSWKPYLDMLPTTFGNPLWFTDDELLELKGTTLYRATELQKQNLLTLYDDKV 153
E R S W PYL +LP + ++++++ELLE++GT L T K+ + + ++
Sbjct: 296 REAARGAGSPWAPYLAILPRQTDSTIFWSEEELLEIQGTQLLSTTMGVKEYVQSEFESVE 355
Query: 154 KDLVKKLLVLDGDSERANSIFWTRALNIPLPHSYVFPQNQED----LNKYDSINNSAELS 209
+++ + L + N W + S VF + + D + D +N+S +++
Sbjct: 356 AEIISENRELFPGTVTFNDFLWA----FGILRSRVFAELRGDKLALIPFADLVNHSDDIT 411
Query: 210 NDHNS 214
+ +S
Sbjct: 412 SKESS 416
>gi|195353393|ref|XP_002043189.1| GM17489 [Drosophila sechellia]
gi|194127287|gb|EDW49330.1| GM17489 [Drosophila sechellia]
Length = 537
Score = 38.9 bits (89), Expect = 6.0, Method: Compositional matrix adjust.
Identities = 34/129 (26%), Positives = 57/129 (44%), Gaps = 11/129 (8%)
Query: 7 AKLEPFLQWLQVNKVELRGCKIKYSDESKGFGIFSSNEFS-DGVLLVVPLDLAITPMRVL 65
AK+E F W + V G +I + G+ S+ + D ++L VP L +
Sbjct: 116 AKVEAFSAWAKDGGVHSEGLEIAIFPGYQ-LGLRSTRPLAKDELVLSVPRKLIFSEESNS 174
Query: 66 QDPLIGPECRAMFEDGEVDDRFLMILFLTVERLRKN-SSWKPYLDMLPTTFGNPLWFTDD 124
L G +A + D L +E++R S W+ Y+D+LP + L+FT
Sbjct: 175 DCRLFGKMTQATHLNLAYD--------LVIEKIRGEFSEWRTYIDVLPAKYSTVLYFTTK 226
Query: 125 ELLELKGTT 133
++ L+GT
Sbjct: 227 QMELLRGTA 235
>gi|428164251|gb|EKX33284.1| hypothetical protein GUITHDRAFT_166511 [Guillardia theta CCMP2712]
Length = 294
Score = 38.9 bits (89), Expect = 6.1, Method: Compositional matrix adjust.
Identities = 21/56 (37%), Positives = 31/56 (55%), Gaps = 1/56 (1%)
Query: 89 MILFLTVERLR-KNSSWKPYLDMLPTTFGNPLWFTDDELLELKGTTLYRATELQKQ 143
M L L E+ R + S W Y+ +LPT F PL F+ DEL EL+ E++++
Sbjct: 170 MALRLIYEKRRGEESKWYQYIQILPTNFDVPLLFSQDELKELQDPLFIHEVEIEQK 225
>gi|365758118|gb|EHM99977.1| Rkm1p [Saccharomyces cerevisiae x Saccharomyces kudriavzevii VIN7]
Length = 324
Score = 38.9 bits (89), Expect = 6.2, Method: Compositional matrix adjust.
Identities = 21/45 (46%), Positives = 30/45 (66%), Gaps = 2/45 (4%)
Query: 302 KEISISYGNKGNEELLYLYGFVIDNNPDD--YLMIHYPAEAIHSI 344
+E+S +YG KGNEELL YGFV+++N D L I P + + +I
Sbjct: 8 QELSNNYGGKGNEELLSGYGFVLEDNIFDSVALKIKLPLDVVSTI 52
>gi|159122413|gb|EDP47534.1| SET domain protein [Aspergillus fumigatus A1163]
Length = 492
Score = 38.9 bits (89), Expect = 6.3, Method: Compositional matrix adjust.
Identities = 32/128 (25%), Positives = 58/128 (45%), Gaps = 11/128 (8%)
Query: 12 FLQWLQVNKVELRGCKIKYSD---ESKGFGIFSSNEFSDGV-LLVVPLDLAITPMRVLQD 67
F+ WL KI +D +S G G+ + ++ DG L +P L ++
Sbjct: 20 FITWLAGKPGVKINSKISIADLRSKSAGRGVVARSDIFDGEELFSIPRGLVLSAQNSKLK 79
Query: 68 PLIGPECRAMFEDGEVDDRFLMILFLTVERL-RKNSSWKPYLDMLPTTFGNPLWFTDDEL 126
L+ + E+ +IL + E L + S+W PY +LP +F ++++ EL
Sbjct: 80 DLLSQDLE------ELGPWLSLILVMMYEYLLGEQSAWAPYFKILPKSFDTLMFWSPSEL 133
Query: 127 LELKGTTL 134
EL+G+ +
Sbjct: 134 RELQGSAI 141
>gi|149235281|ref|XP_001523519.1| conserved hypothetical protein [Lodderomyces elongisporus NRRL
YB-4239]
gi|146452928|gb|EDK47184.1| conserved hypothetical protein [Lodderomyces elongisporus NRRL
YB-4239]
Length = 610
Score = 38.9 bits (89), Expect = 6.3, Method: Compositional matrix adjust.
Identities = 41/137 (29%), Positives = 53/137 (38%), Gaps = 19/137 (13%)
Query: 256 LVPGIDFCNHDLKAAATWEVDGTGLITGVPFSMYLLSVERSSFHSEKEISIS-------- 307
L+P +D NH A W TG + F E E +S
Sbjct: 265 LLPIVDLLNHQPNAKVKW----TGFDVKESAKNGAEGEDNIGFKFETEEPVSKGSQVFNN 320
Query: 308 YGNKGNEELLYLYGFVIDNNPDDY--LMIHYPAEAIHSIPLSDSKALLLEEQKAQLRCLL 365
YG KGNEELL YGF I+NN D L I P I + S K +E+ +
Sbjct: 321 YGLKGNEELLSAYGFTIENNELDVVALKIQIPDVVIDMVEKSGIKLPKIEDYTNSISA-- 378
Query: 366 PKSLLEHG---FFAAGH 379
K + + G F +A H
Sbjct: 379 QKDMAKDGVLFFISASH 395
>gi|345565943|gb|EGX48890.1| hypothetical protein AOL_s00079g111 [Arthrobotrys oligospora ATCC
24927]
Length = 445
Score = 38.9 bits (89), Expect = 6.3, Method: Compositional matrix adjust.
Identities = 24/83 (28%), Positives = 38/83 (45%), Gaps = 16/83 (19%)
Query: 256 LVPGIDFCNHDLK----AAATWEVDGTGLITGVPFSMYLLSVERSSFHSEKEISISYGNK 311
+ P ID+ NHD K ++ +DG + T +S+ +EI ++YG
Sbjct: 208 MCPFIDYYNHDAKGDESCTVSFSIDGLRVTTQKEYSV------------GEEIFVTYGEY 255
Query: 312 GNEELLYLYGFVIDNNPDDYLMI 334
N+ LL YGF + N D + I
Sbjct: 256 NNDHLLVEYGFTLPKNQADNMNI 278
>gi|70984218|ref|XP_747626.1| SET domain protein [Aspergillus fumigatus Af293]
gi|66845253|gb|EAL85588.1| SET domain protein [Aspergillus fumigatus Af293]
Length = 492
Score = 38.9 bits (89), Expect = 6.3, Method: Compositional matrix adjust.
Identities = 32/128 (25%), Positives = 58/128 (45%), Gaps = 11/128 (8%)
Query: 12 FLQWLQVNKVELRGCKIKYSD---ESKGFGIFSSNEFSDGV-LLVVPLDLAITPMRVLQD 67
F+ WL KI +D +S G G+ + ++ DG L +P L ++
Sbjct: 20 FITWLAGKPGVKINSKISIADLRSKSAGRGVVARSDIFDGEELFSIPRGLVLSAQNSKLK 79
Query: 68 PLIGPECRAMFEDGEVDDRFLMILFLTVERL-RKNSSWKPYLDMLPTTFGNPLWFTDDEL 126
L+ + E+ +IL + E L + S+W PY +LP +F ++++ EL
Sbjct: 80 DLLSQDLE------ELGPWLSLILVMMYEYLLGEQSAWAPYFKILPKSFDTLMFWSPSEL 133
Query: 127 LELKGTTL 134
EL+G+ +
Sbjct: 134 RELQGSAI 141
>gi|145485580|ref|XP_001428798.1| hypothetical protein [Paramecium tetraurelia strain d4-2]
gi|124395886|emb|CAK61400.1| unnamed protein product [Paramecium tetraurelia]
Length = 331
Score = 38.9 bits (89), Expect = 6.4, Method: Compositional matrix adjust.
Identities = 27/89 (30%), Positives = 36/89 (40%), Gaps = 10/89 (11%)
Query: 256 LVPGIDFCNHDLKAAATWEVDGTGLITGVPFSMYLLSVERSSFHSEKEISISYGNKGNEE 315
LVP D NH K ++ GT L+ + +EI +G GN
Sbjct: 187 LVPFGDMLNHHDKCQTKQKIIGTDLVF----------ITTKQIQENEEIYNFFGEHGNSF 236
Query: 316 LLYLYGFVIDNNPDDYLMIHYPAEAIHSI 344
LL YGF DNN D L + Y + I +
Sbjct: 237 LLCWYGFTYDNNIYDKLYLLYEDDQIKEV 265
>gi|325183831|emb|CCA18289.1| conserved hypothetical protein [Albugo laibachii Nc14]
gi|325183979|emb|CCA18437.1| conserved hypothetical protein [Albugo laibachii Nc14]
Length = 561
Score = 38.9 bits (89), Expect = 6.4, Method: Compositional matrix adjust.
Identities = 37/143 (25%), Positives = 69/143 (48%), Gaps = 7/143 (4%)
Query: 12 FLQWLQVNKVELRGCKIK-YSDESKGFGIFSSNEFSDG-VLLVVPLDLAITPMRVLQDPL 69
+ WLQ E + ++ Y+ E +G + NE G +L +P + IT + + +
Sbjct: 120 LIDWLQNQGAETKKLMLQQYAPEVRG--VHCRNELVPGERILFIPKNCLIT-VEMGKQTE 176
Query: 70 IGPECRAMFEDGEVDDRFLMILFLTVERLRKN-SSWKPYLDMLPTTFGN-PLWFTDDELL 127
IG + A + +IL+L + +K+ + +K Y LP+T N P++++D EL
Sbjct: 177 IGQKVLAHNIEFVAPKHIFLILYLLTDMEKKDLTFFKYYYSTLPSTLKNMPIFWSDQELS 236
Query: 128 ELKGTTLYRATELQKQNLLTLYD 150
LKG+ + + +K + YD
Sbjct: 237 WLKGSYILHQIQERKAAIRKDYD 259
>gi|313214063|emb|CBY42615.1| unnamed protein product [Oikopleura dioica]
Length = 393
Score = 38.9 bits (89), Expect = 6.4, Method: Compositional matrix adjust.
Identities = 30/105 (28%), Positives = 45/105 (42%), Gaps = 6/105 (5%)
Query: 27 KIKYSDESKGFGIFSSNEFSDGVLLV-VPLDLAITPMRVLQDPLIGPECRAMFEDGEV-- 83
K+K SD G G+FSS LL+ VP+D +T + R + ++
Sbjct: 22 KLKISDGDCGRGVFSSAVIEQSELLISVPIDALLTTRKAQHVVESHKSARQVLQNFSTCL 81
Query: 84 --DDRFLMILFLTVERLRKNSSWKPYLDMLPTTFGNPLWFTDDEL 126
D + LFL +E +NS W +L +P NP + EL
Sbjct: 82 NGTDLLVCALFLELET-GENSKWTAFLSSIPKQLWNPFMLDEKEL 125
>gi|190347905|gb|EDK40262.2| hypothetical protein PGUG_04360 [Meyerozyma guilliermondii ATCC
6260]
Length = 466
Score = 38.9 bits (89), Expect = 6.4, Method: Compositional matrix adjust.
Identities = 30/102 (29%), Positives = 49/102 (48%), Gaps = 14/102 (13%)
Query: 256 LVPGIDFCNHDLKAAATWEVDGTGLITGVPFSMYLLSVERSSFHSEKEISISYGNKGNEE 315
L P +DF NHD ++D G FS+ + V+ ++ +E+ SYG NE
Sbjct: 272 LAPYVDFLNHDCDEKCAIKIDSRG------FSV-ISCVDHAA---GQELLFSYGPHSNEF 321
Query: 316 LLYLYGFVIDNNPDDYLMIHYPAEAIHSIPLSDSKALLLEEQ 357
LL Y F ++ N + L + + E I ++D++ L EQ
Sbjct: 322 LLCEYAFTMETNKWNNLDVSHHIEGI----MNDAQKSFLREQ 359
>gi|344300819|gb|EGW31140.1| hypothetical protein SPAPADRAFT_142076 [Spathaspora passalidarum
NRRL Y-27907]
Length = 436
Score = 38.5 bits (88), Expect = 6.7, Method: Compositional matrix adjust.
Identities = 21/72 (29%), Positives = 35/72 (48%), Gaps = 10/72 (13%)
Query: 256 LVPGIDFCNHDLKAAATWEVDGTGLITGVPFSMYLLSVERSSFHSEKEISISYGNKGNEE 315
+ P +DF NH T ++DG G F ++ ++ +++ +SYG N+
Sbjct: 241 MAPYVDFLNHSPNDHCTLKIDGRG------FQVF----STCAYSENEQVYLSYGPHSNDF 290
Query: 316 LLYLYGFVIDNN 327
LL YGF I +N
Sbjct: 291 LLCEYGFTISDN 302
>gi|328352061|emb|CCA38460.1| hypothetical protein PP7435_Chr2-0775 [Komagataella pastoris CBS
7435]
Length = 846
Score = 38.5 bits (88), Expect = 7.1, Method: Compositional matrix adjust.
Identities = 39/133 (29%), Positives = 69/133 (51%), Gaps = 24/133 (18%)
Query: 10 EPFLQWLQVNKVELRG-CKIK-YSDESKGFGIFSSNEFS-DGVLLVVP----LDLAITPM 62
E + +WL+ N +EL G KI+ D+++G G+ ++ + + D VL + L++ + +
Sbjct: 10 EAYKKWLEAN-LELAGNVKIEDLRDQNEGRGVVATRDINEDEVLFSISKDNVLNIQTSSL 68
Query: 63 RVLQDPLIGPECRAMFEDGEVDDRF----LMILFLTVER-LRKNSSWKPYLDMLPTTFGN 117
++D +G + R +IL L ER L + S WK YLD LPT+F N
Sbjct: 69 ANVKD-----------NNGVILKRLNHWEGLILCLAYERSLGELSRWKGYLDTLPTSFNN 117
Query: 118 PLWFTDDELLELK 130
+++ + +L LK
Sbjct: 118 LIFWNEQDLNSLK 130
>gi|70995934|ref|XP_752722.1| SET domain protein [Aspergillus fumigatus Af293]
gi|66850357|gb|EAL90684.1| SET domain protein [Aspergillus fumigatus Af293]
Length = 490
Score = 38.5 bits (88), Expect = 7.2, Method: Compositional matrix adjust.
Identities = 39/167 (23%), Positives = 72/167 (43%), Gaps = 25/167 (14%)
Query: 9 LEPFLQWLQVNKVELRGCKIK--YSDES--KGFGIFSSNEFSD-----GVLLVVPLDLAI 59
+E W ++N + L G + YS+ KG I ++ E D LL VP DLA+
Sbjct: 8 IETLSSWAKLNGISLEGIAFQKLYSEHGTDKGSAIVATAEKKDEEGEANTLLTVPSDLAL 67
Query: 60 TPMRVLQDPLIGPECRAMFE---DGEVDDRFLMILFLTVE-------------RLRKNSS 103
T V I R + + D R +++FL ++ ++ ++
Sbjct: 68 TLEYVHNHAKIDRHLREVLDAVGDFGRTARGAILIFLIIQITHASPDFVNKRQKIGISNP 127
Query: 104 WKPYLDMLPTTFGNPLWFTDDELLELKGTTLYRATELQKQNLLTLYD 150
W Y+ +P + P +++ +E L+GT+L A + + +L +D
Sbjct: 128 WTEYIRFMPASVPLPTFYSAEERELLRGTSLQTAVDAKLGSLEKEFD 174
>gi|50252331|dbj|BAD28364.1| putative ribulose-1,5 bisphosphate carboxylase/oxygenase large
subunit N-methyltransferase, chloroplast precursor
[Oryza sativa Japonica Group]
gi|215769445|dbj|BAH01674.1| unnamed protein product [Oryza sativa Japonica Group]
Length = 495
Score = 38.5 bits (88), Expect = 7.3, Method: Compositional matrix adjust.
Identities = 44/185 (23%), Positives = 82/185 (44%), Gaps = 15/185 (8%)
Query: 36 GFGIFSSNEFSDG-VLLVVPLDLAITPMRVLQDPLIGPECRAMFEDGEVDDRFLMILFLT 94
G G+ ++ + G VL VP L + V L G R G + + L L
Sbjct: 87 GLGLVAARDLPRGEVLAEVPKKLWLDADAVAASDLGGAVGR-----GGLRPWVAVALLLL 141
Query: 95 VERLR-KNSSWKPYLDMLPTTFGNPLWFTDDELLELKGTTLYRATELQKQNLLTLYDDKV 153
E R S W PYL +LP + ++++++ELLE++GT L T K+ + + ++
Sbjct: 142 REAARGAGSPWAPYLAILPRQTDSTIFWSEEELLEIQGTQLLSTTMGVKEYVQSEFESVE 201
Query: 154 KDLVKKLLVLDGDSERANSIFWTRALNIPLPHSYVFPQNQED----LNKYDSINNSAELS 209
+++ + L + N W + S VF + + D + D +N+S +++
Sbjct: 202 AEIISENRELFPGTVTFNDFLWA----FGILRSRVFAELRGDKLALIPFADLVNHSDDIT 257
Query: 210 NDHNS 214
+ +S
Sbjct: 258 SKESS 262
>gi|159131477|gb|EDP56590.1| SET domain protein [Aspergillus fumigatus A1163]
Length = 490
Score = 38.5 bits (88), Expect = 7.4, Method: Compositional matrix adjust.
Identities = 39/167 (23%), Positives = 72/167 (43%), Gaps = 25/167 (14%)
Query: 9 LEPFLQWLQVNKVELRGCKIK--YSDES--KGFGIFSSNEFSD-----GVLLVVPLDLAI 59
+E W ++N + L G + YS+ KG I ++ E D LL VP DLA+
Sbjct: 8 IETLSSWAKLNGISLEGIAFQKLYSEHGTDKGSAIVATAEKKDEEGEANTLLTVPSDLAL 67
Query: 60 TPMRVLQDPLIGPECRAMFE---DGEVDDRFLMILFLTVE-------------RLRKNSS 103
T V I R + + D R +++FL ++ ++ ++
Sbjct: 68 TLEYVHNHAKIDRHLREVLDAVGDFGRTARGAILIFLIIQITHASPDFVNKRQKIGISNP 127
Query: 104 WKPYLDMLPTTFGNPLWFTDDELLELKGTTLYRATELQKQNLLTLYD 150
W Y+ +P + P +++ +E L+GT+L A + + +L +D
Sbjct: 128 WTEYIRFMPASVPLPTFYSAEERELLRGTSLQTAVDAKLGSLEKEFD 174
>gi|353236313|emb|CCA68310.1| related to SET7-Regulatory protein [Piriformospora indica DSM
11827]
Length = 493
Score = 38.5 bits (88), Expect = 7.5, Method: Compositional matrix adjust.
Identities = 34/125 (27%), Positives = 58/125 (46%), Gaps = 7/125 (5%)
Query: 12 FLQWLQVNKVELRGCKIKYSDESKGFGIFSSNEF-SDGVLLVVPLDLAITPMRVLQDPLI 70
FL+W + + L E G G + ++ D VL VP + ++ ++
Sbjct: 9 FLKWFRDSGATLHPAVGIKDFEGVGRGAVALHDIQKDTVLFTVPRSILLSTRTAPLRDIL 68
Query: 71 GPECRAMFEDGEVDDRFLMILFLTVERLR-KNSSWKPYLDMLPTTFGNPLWFTDDELLEL 129
G E + + E +IL + E R K+S W YL LPT F +++TD+EL +L
Sbjct: 69 GDEDWSTLKGWEG-----LILSMMYEDSRVKDSPWSGYLQDLPTKFDTLMFWTDEELEQL 123
Query: 130 KGTTL 134
+ +T+
Sbjct: 124 QASTV 128
>gi|260822399|ref|XP_002606589.1| hypothetical protein BRAFLDRAFT_277814 [Branchiostoma floridae]
gi|229291933|gb|EEN62599.1| hypothetical protein BRAFLDRAFT_277814 [Branchiostoma floridae]
Length = 459
Score = 38.5 bits (88), Expect = 7.6, Method: Compositional matrix adjust.
Identities = 70/342 (20%), Positives = 124/342 (36%), Gaps = 71/342 (20%)
Query: 8 KLEPFLQWLQVNKVELR-GCKIKYSDESKGFGIFSSNEFSDGVLLVVPLDLAITPMRVLQ 66
KL+ FLQW +L + +G+ + E +G L A+ +
Sbjct: 21 KLDSFLQWCAKEDFQLNPKVHVGREGSCAQYGMVAQEELEEGECLFKVDKSAVLSTETTE 80
Query: 67 DPLIGPECRAMFEDGEVDDRFLM--ILFLTVERLRKNSSWKPYLDMLP--TTFGNPLWFT 122
+ E ++ D D + IL L E NS W+PYL ++P + P+++T
Sbjct: 81 IAHLLKEETSLHGDSLHGDSGWVPQILALMYEYTNPNSRWRPYLQLVPDFSQLDQPMFWT 140
Query: 123 DDEL-LELKGTTLYRATELQKQNLLTLYDDKVKDLVKKLLVLDGDSERANSIFWTRALNI 181
+DE+ +L T + A+ DL K L + + AL
Sbjct: 141 EDEIERDLCNTGIPEASS--------------SDLTKMKLE-----------YTSLALPF 175
Query: 182 PLPHSYVFPQNQEDLNKYDSINNSAELSNDHNSRGELINGLNDIKNEAQRVNSQVNGATS 241
H ++F E+++ ++ + S E +NG D G S
Sbjct: 176 IRKHRHIF---SEEVHSFELYKRMVAFIMAY-SFFEPVNGRED------------EGGKS 219
Query: 242 TLTSTQGETLWIEGLVPGIDFCNHDLK--AAATWEVDGTGLITGVPFSMYLLSVERSSFH 299
+L +VP D NH K A W+ D ++T +
Sbjct: 220 SLPL----------MVPMADILNHVAKNNAQLEWDADCLRMVT------------TRTVA 257
Query: 300 SEKEISISYGNKGNEELLYLYGFVIDNNPDDYLMIHYPAEAI 341
+ +E+ ++G N +LL++YGF + Y + P + +
Sbjct: 258 AGEEVFNTFGQLANWQLLHMYGFAEAWPENIYDTVDIPMQVV 299
>gi|320166344|gb|EFW43243.1| predicted protein [Capsaspora owczarzaki ATCC 30864]
Length = 514
Score = 38.5 bits (88), Expect = 7.6, Method: Compositional matrix adjust.
Identities = 33/153 (21%), Positives = 65/153 (42%), Gaps = 16/153 (10%)
Query: 16 LQVNKVELR-----GCKIKYSDESKGFGIFSSNEFSDGVLLVVPLDLAITPMRVLQDPLI 70
+++ +V++R K+ ES+G G+ + G +V + P + P
Sbjct: 61 IEMARVDIRPSTDTSASAKFLFESRGLGLVLNAPARRGEAIVT-----LPPRARFRVPAF 115
Query: 71 GPECRAMFE------DGEVDDRFLMILFLTVERLRKNSSWKPYLDMLPTTFGNPLWFTDD 124
R++ + D +D + L L ER R +S W+ +L MLP + L + D
Sbjct: 116 DSALRSLIDEFNEQHDNAIDPMTALALGLMYERSRADSPWRAWLRMLPDPIESMLEWNDV 175
Query: 125 ELLELKGTTLYRATELQKQNLLTLYDDKVKDLV 157
EL ++ + E + +NL +Y+ + +
Sbjct: 176 ELWPVEQLYVKELREERIRNLEAVYESVITPFI 208
>gi|189189204|ref|XP_001930941.1| SET domain-containing protein RMS1 [Pyrenophora tritici-repentis
Pt-1C-BFP]
gi|187972547|gb|EDU40046.1| SET domain-containing protein RMS1 [Pyrenophora tritici-repentis
Pt-1C-BFP]
Length = 476
Score = 38.5 bits (88), Expect = 7.6, Method: Compositional matrix adjust.
Identities = 35/137 (25%), Positives = 64/137 (46%), Gaps = 25/137 (18%)
Query: 10 EPFLQWLQVNKVELRGCKIKYSD---ESKGFGIFSSNEFSDGVLL-------VVPLDLAI 59
+ FL WL+ + E+ KIK D + G G+ +S + ++ LL ++ ++ +I
Sbjct: 9 QAFLAWLRQSGAEI-SPKIKLEDLRNKDAGRGVVASQDIAEHELLFRIPRASILSVENSI 67
Query: 60 --TPMRVLQDPLIGPECRAMFEDGEVDDRFLMILFLTVERLRKNSSWKPYLDMLPTTFGN 117
T + L+GP L+++ L S+W PY +LPT F
Sbjct: 68 LSTEIPAATLSLLGPWLS------------LILVMLYEYHNGSASNWAPYFAVLPTEFNT 115
Query: 118 PLWFTDDELLELKGTTL 134
+++T+DEL EL+ + +
Sbjct: 116 LMFWTEDELAELQASAV 132
>gi|390602144|gb|EIN11537.1| SET domain-containing protein [Punctularia strigosozonata HHB-11173
SS5]
Length = 503
Score = 38.5 bits (88), Expect = 7.7, Method: Compositional matrix adjust.
Identities = 29/100 (29%), Positives = 48/100 (48%), Gaps = 4/100 (4%)
Query: 36 GFGIFSSNEFSDG-VLLVVPLDLAITPMRVLQDPLIGPECRAMFEDGEVDDRFLMILFLT 94
G G + + +G L +P L ++ R PL PE +E G+ +IL +
Sbjct: 30 GRGAVALGDIPEGHALFTLPRSLTLS-TRTSSLPLRIPEEWKKYELGK--GWVGLILCMM 86
Query: 95 VERLRKNSSWKPYLDMLPTTFGNPLWFTDDELLELKGTTL 134
E + +S W YL LP F P++++++EL EL T +
Sbjct: 87 WEEAQPDSLWAEYLASLPDVFSTPMFWSEEELSELNATAI 126
>gi|389741836|gb|EIM83024.1| SET domain-containing protein [Stereum hirsutum FP-91666 SS1]
Length = 502
Score = 38.5 bits (88), Expect = 7.8, Method: Compositional matrix adjust.
Identities = 17/47 (36%), Positives = 30/47 (63%)
Query: 88 LMILFLTVERLRKNSSWKPYLDMLPTTFGNPLWFTDDELLELKGTTL 134
L++ + E +S W YL LP++F P++++ D+L ELKGT++
Sbjct: 81 LILCMMWEEAQGASSRWSTYLASLPSSFDTPMFWSPDDLEELKGTSV 127
>gi|384253193|gb|EIE26668.1| hypothetical protein COCSUDRAFT_59188 [Coccomyxa subellipsoidea
C-169]
Length = 610
Score = 38.5 bits (88), Expect = 8.2, Method: Compositional matrix adjust.
Identities = 17/33 (51%), Positives = 21/33 (63%)
Query: 308 YGNKGNEELLYLYGFVIDNNPDDYLMIHYPAEA 340
YG+K NEELL YGFV++ N DY + EA
Sbjct: 100 YGHKSNEELLLGYGFVLEPNTADYFSVSLGLEA 132
>gi|260807503|ref|XP_002598548.1| hypothetical protein BRAFLDRAFT_118329 [Branchiostoma floridae]
gi|229283821|gb|EEN54560.1| hypothetical protein BRAFLDRAFT_118329 [Branchiostoma floridae]
Length = 448
Score = 38.5 bits (88), Expect = 8.7, Method: Compositional matrix adjust.
Identities = 25/116 (21%), Positives = 54/116 (46%), Gaps = 2/116 (1%)
Query: 36 GFGIFSSNEFSDGVLLVVPLDLAITPMRVLQDPLIGPECRAMFEDGEVDDRFLMILFLTV 95
G G+ + G ++ I + + D ++GP + E + + +
Sbjct: 56 GRGLMVPRKIKRGQTMIKMPQHMILSTKTVLDSVLGPYIESA-EPQLTTIQAITTFLIYQ 114
Query: 96 ERLRKNSSWKPYLDMLPTTFGNPLWFTDDELLELKGTTLYRATELQKQNLLTLYDD 151
+ + + S WKPYLD+LP + +P++F +++ L L +L + +KQ + Y++
Sbjct: 115 KHIGETSFWKPYLDILPNEYTHPVYFGEEDFLYLPH-SLRANIKAKKQECIKSYEE 169
>gi|342879010|gb|EGU80287.1| hypothetical protein FOXB_09214 [Fusarium oxysporum Fo5176]
Length = 530
Score = 38.1 bits (87), Expect = 9.0, Method: Compositional matrix adjust.
Identities = 25/96 (26%), Positives = 45/96 (46%), Gaps = 9/96 (9%)
Query: 255 GLVPGIDFCNHDLKAAATWEVDGTGLITGVPFSMYLLSVERSSFHSEKEISISYGNKGNE 314
+VP +D NH A ++ D + +P +S+ E++ISYG K
Sbjct: 251 AMVPVLDMANHSHSQTAYYDEDDEDNVVLLPRPGMEISIG-------DEVTISYGEKSPA 303
Query: 315 ELLYLYGFVIDNNPDDYLMIHYPAEAIHSIPLSDSK 350
E+++ YGF+ + + L + P E++ PL +K
Sbjct: 304 EMIFSYGFIDRESTVEGLTL--PLESLADDPLGKAK 337
>gi|194750092|ref|XP_001957464.1| GF10426 [Drosophila ananassae]
gi|190624746|gb|EDV40270.1| GF10426 [Drosophila ananassae]
Length = 449
Score = 38.1 bits (87), Expect = 9.3, Method: Compositional matrix adjust.
Identities = 24/78 (30%), Positives = 37/78 (47%), Gaps = 6/78 (7%)
Query: 255 GLVPGIDFCNHDLKAAATWEVDGTGLITGVPFSMYLLSVERSSFHSEKEISISYGNKGNE 314
L P +D NH T E+ G + ++ L ++ H + + ISYG N
Sbjct: 236 ALAPFLDLFNHSDSVQTTAEIHGQNYVV----TLKDLPTPQTKLHDQ--LFISYGALPNF 289
Query: 315 ELLYLYGFVIDNNPDDYL 332
+LL YGF ++NN +DY
Sbjct: 290 KLLTEYGFWLENNANDYF 307
>gi|340966944|gb|EGS22451.1| hypothetical protein CTHT_0019870 [Chaetomium thermophilum var.
thermophilum DSM 1495]
Length = 499
Score = 38.1 bits (87), Expect = 9.5, Method: Compositional matrix adjust.
Identities = 30/106 (28%), Positives = 46/106 (43%), Gaps = 11/106 (10%)
Query: 232 VNSQVNGATSTLTSTQGETLWIEGLVPGIDFCNHDLKAAATWEVDGTGLITGVPFSMYLL 291
++ V STL + E L P D NH + A TW+ T P L+
Sbjct: 216 LSQSVKDYVSTLLPSAREIDDFSILQPLFDIANHSMTATYTWDT------TSDPNCCQLI 269
Query: 292 SVERSSFHSEKEISISYGNKGNEELLYLYGFVI---DNNPDDYLMI 334
+ S+ ++ +YG K N ELL YGF++ D +DY+ +
Sbjct: 270 C--QDSYRPGDQVFNNYGFKTNSELLLAYGFILPETDTLHNDYVHV 313
>gi|224101881|ref|XP_002312458.1| SET domain-containing protein [Populus trichocarpa]
gi|222852278|gb|EEE89825.1| SET domain-containing protein [Populus trichocarpa]
Length = 362
Score = 38.1 bits (87), Expect = 9.5, Method: Compositional matrix adjust.
Identities = 28/90 (31%), Positives = 46/90 (51%), Gaps = 5/90 (5%)
Query: 91 LFLTVERLRKNSSWKPYLDMLPTTFGNPLWFTDDELLELKGTTLYRATELQKQNLLTLYD 150
L L ER R NS W PY+ LP T+ P++F +++ L+ L + + LL ++
Sbjct: 5 LKLLQERARVNSFWWPYISNLPRTYSVPIFFQGEDIKNLQYAPLLYQVNRRCRFLLD-FE 63
Query: 151 DKVKDLVKKLLVLDGDSER----ANSIFWT 176
+VK V+ L D +R A+S+ W+
Sbjct: 64 QEVKRAVENLKQNDHPFDRQDVDASSLGWS 93
>gi|297820264|ref|XP_002878015.1| SET domain-containing protein [Arabidopsis lyrata subsp. lyrata]
gi|297323853|gb|EFH54274.1| SET domain-containing protein [Arabidopsis lyrata subsp. lyrata]
Length = 473
Score = 38.1 bits (87), Expect = 9.6, Method: Compositional matrix adjust.
Identities = 37/153 (24%), Positives = 70/153 (45%), Gaps = 18/153 (11%)
Query: 1 MEISTEAKLEP-FLQWLQVNKVELRGCKIKYS----DESKGFGIFSSNEFSDG-VLLVVP 54
+E T+A L+ F+ WL+ + G KI + + G +F+S G +L VP
Sbjct: 34 LEPQTQASLDKDFIPWLE----RIAGAKITNTLSIGKSTYGRSLFASKVIHAGDCMLKVP 89
Query: 55 LDLAITPMRVLQDPLIGPECRAMFEDGEVDDRFLMILFLTVERLRKNSSWKPYLDML--P 112
++ ITP D L P+ R D + L + + ++ + S W PY+ L P
Sbjct: 90 FNVQITP-----DEL-SPDIRVSLTDEVGNIGKLAAVLIREKKKGQKSRWVPYISRLPQP 143
Query: 113 TTFGNPLWFTDDELLELKGTTLYRATELQKQNL 145
+ +++ +DE ++ + +++ T QK +
Sbjct: 144 AEMHSTIFWGEDEFSMIRCSAVHKETVKQKAQI 176
>gi|255726968|ref|XP_002548410.1| predicted protein [Candida tropicalis MYA-3404]
gi|240134334|gb|EER33889.1| predicted protein [Candida tropicalis MYA-3404]
Length = 498
Score = 38.1 bits (87), Expect = 9.6, Method: Compositional matrix adjust.
Identities = 36/147 (24%), Positives = 71/147 (48%), Gaps = 11/147 (7%)
Query: 8 KLEPFLQWLQVNKVELRGCKIKYSD---ESKGFGIFSSNEF-SDGVLLVVPLDLAITPMR 63
K E FL WL N V++ KI D ++G GI ++ + ++ L +P + I
Sbjct: 18 KTESFLNWLISNGVKV-SPKISIHDYRLSNQGRGIIANEDIEANEQLFTLPRSVLIN--- 73
Query: 64 VLQDPLIGPECRAMFEDGEVDDRFLMILFLTVERLRKNSSWKPYLDMLPTTFGNPLWFTD 123
V+ + LI +D +I+ L+ E + + WK Y +++P F +++ +
Sbjct: 74 VVNNSLIEKYPNVQDNLKSMDQWLSLIIILSYE-FKFGNKWKNYFNIIPDEFNQLIYWKN 132
Query: 124 DELLELKGTTLYRATELQKQNLLTLYD 150
+EL +L+ + + + K+N L +Y+
Sbjct: 133 EELKDLEPSCI--LERIGKENNLNMYN 157
>gi|451999637|gb|EMD92099.1| hypothetical protein COCHEDRAFT_1134267 [Cochliobolus
heterostrophus C5]
Length = 476
Score = 38.1 bits (87), Expect = 9.7, Method: Compositional matrix adjust.
Identities = 23/75 (30%), Positives = 39/75 (52%), Gaps = 16/75 (21%)
Query: 102 SSWKPYLDMLPTTFGNPLWFTDDELLELKGTTLYR------ATELQKQNLLTLY------ 149
S+W PY +LPT F +++T+DEL EL+ + + A E+ + LL +
Sbjct: 100 SNWAPYFAVLPTDFDTLMFWTEDELTELQASAVVNKIGKEGANEVFIEQLLPVIEEFADV 159
Query: 150 ----DDKVKDLVKKL 160
D++ KDL K++
Sbjct: 160 IFSGDERAKDLAKEM 174
>gi|342319096|gb|EGU11047.1| SET-domain protein [Rhodotorula glutinis ATCC 204091]
Length = 356
Score = 38.1 bits (87), Expect = 9.7, Method: Compositional matrix adjust.
Identities = 17/40 (42%), Positives = 29/40 (72%), Gaps = 1/40 (2%)
Query: 95 VERLRKNSSWKPYLDMLPTTFGNPLWFTDDELLELKGTTL 134
+E LR SSW+PY +LPT F + ++++ +E EL+G+T+
Sbjct: 44 IEYLR-TSSWEPYFSLLPTQFESLMFWSPEEFEELEGSTV 82
>gi|328772032|gb|EGF82071.1| hypothetical protein BATDEDRAFT_23340 [Batrachochytrium
dendrobatidis JAM81]
Length = 419
Score = 38.1 bits (87), Expect = 10.0, Method: Compositional matrix adjust.
Identities = 31/139 (22%), Positives = 62/139 (44%), Gaps = 8/139 (5%)
Query: 12 FLQWLQVNKVELRGCKIKYSDESKGFGIFSSNEFSDGVLLV-VPLDLAITPMRVLQDPLI 70
F +WL +N + + + ++ G G+ ++++F G +V +P L + P R +
Sbjct: 21 FQKWLVLNNCSISSLVLAHFSDT-GRGLMATSDFQIGDPVVRIPARLLLVPRRTHKLFNN 79
Query: 71 GPECRAMFEDGEVDDRFLMILFLTVERLRKNSSWKPYLDMLPTTFGNPLWFTDDELLELK 130
P A+ + + LF+ +++ W PY+D+LP +F D +LL +
Sbjct: 80 HPAIVALKQHPSIA------LFIAWQKIHPTPEWSPYIDILPRSFDTMPLCIDLKLLAML 133
Query: 131 GTTLYRATELQKQNLLTLY 149
+ + Q+ L T Y
Sbjct: 134 PYDIQEIAKNQQSKLDTDY 152
Database: nr
Posted date: Mar 3, 2013 10:45 PM
Number of letters in database: 999,999,864
Number of sequences in database: 2,912,245
Database: /local_scratch/syshi//blastdatabase/nr.01
Posted date: Mar 3, 2013 10:52 PM
Number of letters in database: 999,999,666
Number of sequences in database: 2,912,720
Database: /local_scratch/syshi//blastdatabase/nr.02
Posted date: Mar 3, 2013 10:58 PM
Number of letters in database: 999,999,938
Number of sequences in database: 3,014,250
Database: /local_scratch/syshi//blastdatabase/nr.03
Posted date: Mar 3, 2013 11:03 PM
Number of letters in database: 999,999,780
Number of sequences in database: 2,805,020
Database: /local_scratch/syshi//blastdatabase/nr.04
Posted date: Mar 3, 2013 11:08 PM
Number of letters in database: 999,999,551
Number of sequences in database: 2,816,253
Database: /local_scratch/syshi//blastdatabase/nr.05
Posted date: Mar 3, 2013 11:13 PM
Number of letters in database: 999,999,897
Number of sequences in database: 2,981,387
Database: /local_scratch/syshi//blastdatabase/nr.06
Posted date: Mar 3, 2013 11:18 PM
Number of letters in database: 999,999,649
Number of sequences in database: 2,911,476
Database: /local_scratch/syshi//blastdatabase/nr.07
Posted date: Mar 3, 2013 11:24 PM
Number of letters in database: 999,999,452
Number of sequences in database: 2,920,260
Database: /local_scratch/syshi//blastdatabase/nr.08
Posted date: Mar 3, 2013 11:25 PM
Number of letters in database: 64,230,274
Number of sequences in database: 189,558
Lambda K H
0.316 0.135 0.395
Lambda K H
0.267 0.0410 0.140
Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Number of Hits to DB: 7,839,488,094
Number of Sequences: 23463169
Number of extensions: 340441661
Number of successful extensions: 855660
Number of sequences better than 100.0: 753
Number of HSP's better than 100.0 without gapping: 344
Number of HSP's successfully gapped in prelim test: 409
Number of HSP's that attempted gapping in prelim test: 854341
Number of HSP's gapped (non-prelim): 1403
length of query: 480
length of database: 8,064,228,071
effective HSP length: 146
effective length of query: 334
effective length of database: 8,933,572,693
effective search space: 2983813279462
effective search space used: 2983813279462
T: 11
A: 40
X1: 16 ( 7.3 bits)
X2: 38 (14.6 bits)
X3: 64 (24.7 bits)
S1: 41 (21.6 bits)
S2: 79 (35.0 bits)