BLASTP 2.2.26 [Sep-21-2011]
Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer,
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997),
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs", Nucleic Acids Res. 25:3389-3402.
Reference for compositional score matrix adjustment: Altschul, Stephen F.,
John C. Wootton, E. Michael Gertz, Richa Agarwala, Aleksandr Morgulis,
Alejandro A. Schaffer, and Yi-Kuo Yu (2005) "Protein database searches
using compositionally adjusted substitution matrices", FEBS J. 272:5101-5109.
Query= 007127
(617 letters)
Database: swissprot
539,616 sequences; 191,569,459 total letters
Searching..................................................done
>sp|Q3ECY6|SDG41_ARATH Protein SET DOMAIN GROUP 41 OS=Arabidopsis thaliana GN=SDG41 PE=2
SV=1
Length = 558
Score = 360 bits (925), Expect = 1e-98, Method: Compositional matrix adjust.
Identities = 233/631 (36%), Positives = 334/631 (52%), Gaps = 100/631 (15%)
Query: 1 MEMRASEEIRQGEDITPPLFPLTFAFHDSLLDGHCSSCFSPLPSC------CSS------ 48
ME+RA+E+I D+ PPL PL + +DS L HCSSCFS LP CS+
Sbjct: 1 MEIRAAEDIEIRTDLFPPLSPLASSLYDSFLSSHCSSCFSLLPPSPPQPLYCSAACSLTD 60
Query: 49 ------------LPLSSAELRAALHLLHSPLPTTSLPPPPRLFGLLTNRDKLMSSSDSDV 96
P+ +++R +LHLL+S TS P RL LLTN LM+ D +
Sbjct: 61 SFTNSPQFPPEITPILPSDIRTSLHLLNSTAVDTS-SSPHRLNNLLTNHHLLMA--DPSI 117
Query: 97 ASKIREGAREMARARGNLSDDVAWEEAALCLVMTNAVEVQDDKTGRILGIAVYDKDFSWI 156
+ I A +A + + EEAA+C V+TNAVEV D G LGIA+Y+ FSWI
Sbjct: 118 SVAIHHAANFIATVIRSNRKNTELEEAAICAVLTNAVEVHDSN-GLALGIALYNSSFSWI 176
Query: 157 NHSCSPNACYRFSLSEPNAPSFRNEKKMRIAPHVVFDSTEAETPGKSDVCISCELKEGSK 216
NHSCSPN+CYRF N + HV T + + VC L G+
Sbjct: 177 NHSCSPNSCYRFV----------NNRTSYHDVHVTNTETSSNLELQEQVC-GTSLNSGNG 225
Query: 217 RHGPRIIVRSIKPINKGEEVTVAYTDLLQPKGMRQSELWSKYQFVCHCRRCSASPPSYVD 276
+GP++IVRSIK I GEE+TV+Y DLLQP G+RQS+LWSKY+F+C+C RC+ASPP+YVD
Sbjct: 226 -NGPKLIVRSIKRIKSGEEITVSYIDLLQPTGLRQSDLWSKYRFMCNCGRCAASPPAYVD 284
Query: 277 MALEETFSSNPEFLSLSS-DYNFLKDEANQKLTDWMDEGTSEYLLVG-DPESCCQKLENI 334
LE + E ++ D + KDEA K+ D++ E ++L DP++CC+ +E++
Sbjct: 285 SILEGVLTLESEKTTVGHFDGSTNKDEAVGKMNDYIQEAIDDFLSDNIDPKTCCEMIESV 344
Query: 335 LTQGLQGELLESEKVKIQLNLRLHPLHHLSLNAYTTLASAYKIRSIDLLALNSDIDGQQL 394
L G+Q + E + LRLH H+++LNAY TLA+AY+IRS ID +
Sbjct: 345 LHHGIQFK--EDSQPHC---LRLHACHYVALNAYITLATAYRIRS---------IDSETG 390
Query: 395 EAFDMSRTSAAYSLLLASTTDHLFRSESSLIAASANFWASAGESLLTLARSPGWNLFVKP 454
DMSR SAAYSL LA + HLF +E S ++A FW +AGE L LA P
Sbjct: 391 IVCDMSRISAAYSLFLAGVSHHLFCAERSFAISAAKFWKNAGELLFDLA----------P 440
Query: 455 ELPISTSSPEIHECSKCSLVDRLQVNPFLSQSRNADFQIICNEFLACITNMTRKVWGFLT 514
+L + S +C+KC +++ + + + D + + L+C+ ++++ W FLT
Sbjct: 441 KLLMELSVESDVKCTKCLMLE--------TSNSHRDIKEKSRQILSCVRDISQVTWSFLT 492
Query: 515 HGCGYLQMLKDPIDFSWLRQSSNLCHTPCCSDEESNKETGYQESICRRVMQRCDGEERIT 574
GC YL+ + P+DFS R + EES+K ++ +
Sbjct: 493 RGCPYLEKFRSPVDFSLTRTNGE--------REESSK------------------DQTVN 526
Query: 575 IFQLGVHCIAYGGYLANICYGPNSHWPCKIK 605
+ L HC+ Y L ++CYG SH + +
Sbjct: 527 VLLLSSHCLLYADLLTDLCYGQKSHLVSRFR 557
>sp|Q5RGL7|SMY2B_DANRE N-lysine methyltransferase SMYD2-B OS=Danio rerio GN=smyd2b PE=2
SV=2
Length = 434
Score = 49.7 bits (117), Expect = 7e-05, Method: Compositional matrix adjust.
Identities = 38/154 (24%), Positives = 58/154 (37%), Gaps = 51/154 (33%)
Query: 128 VMTNAVEVQDDKTGRILGIAVYDKDFSWINHSCSPNACYRFSLSEPNAPSFRNEKKMRIA 187
V N V+D++ LG+A++ D + +NHSCSPN
Sbjct: 180 VHCNGFTVEDEELSN-LGLAIF-PDIALLNHSCSPNVI---------------------- 215
Query: 188 PHVVFDSTEAETPGKSDVCISCELKEGSKRHGPRIIVRSIKPINKGEEVTVAYTDLLQPK 247
V + AE VR++K I+ G+E+ +Y DLL P
Sbjct: 216 --VTYRGINAE-------------------------VRAVKDISPGQEIYTSYIDLLYPT 248
Query: 248 GMRQSELWSKYQFVCHCRRCSASPPSYVDMALEE 281
R L Y F C C+ C+ V M++ +
Sbjct: 249 ADRLERLRDMYYFSCDCKECTTKSMDVVKMSVRK 282
>sp|Q54R14|Y3443_DICDI SET domain-containing protein DDB_G0283443 OS=Dictyostelium
discoideum GN=DDB_G0283443 PE=3 SV=1
Length = 393
Score = 49.3 bits (116), Expect = 8e-05, Method: Compositional matrix adjust.
Identities = 22/61 (36%), Positives = 36/61 (59%)
Query: 222 IIVRSIKPINKGEEVTVAYTDLLQPKGMRQSELWSKYQFVCHCRRCSASPPSYVDMALEE 281
I + S+ PI KG+E++++Y D+ K R L Y F C C+RC+ P S + + +E+
Sbjct: 293 ISIYSLIPIKKGDELSISYIDIRMSKNDRLLHLKEIYYFECKCKRCTLPPLSNLSLEIEK 352
Query: 282 T 282
T
Sbjct: 353 T 353
>sp|Q557F7|Y3589_DICDI SET and MYND domain-containing protein DDB_G0273589
OS=Dictyostelium discoideum GN=DDB_G0273589 PE=3 SV=1
Length = 386
Score = 48.9 bits (115), Expect = 1e-04, Method: Compositional matrix adjust.
Identities = 23/61 (37%), Positives = 34/61 (55%)
Query: 217 RHGPRIIVRSIKPINKGEEVTVAYTDLLQPKGMRQSELWSKYQFVCHCRRCSASPPSYVD 276
R G + +S+ PI KG+++T++Y +L QP R+ EL Y F C C RC+ S
Sbjct: 270 RDGSNMTFKSLYPIKKGDQLTISYIELDQPIQDRKDELKYGYYFDCICPRCNGDSNSIDS 329
Query: 277 M 277
M
Sbjct: 330 M 330
>sp|Q9CWR2|SMYD3_MOUSE Histone-lysine N-methyltransferase SMYD3 OS=Mus musculus GN=Smyd3
PE=2 SV=1
Length = 428
Score = 48.1 bits (113), Expect = 2e-04, Method: Compositional matrix adjust.
Identities = 19/50 (38%), Positives = 32/50 (64%)
Query: 218 HGPRIIVRSIKPINKGEEVTVAYTDLLQPKGMRQSELWSKYQFVCHCRRC 267
+GP +++R+++ I GEE+T+ Y D+L R+ +L +Y F C C RC
Sbjct: 217 NGPHLLLRAVREIEAGEELTICYLDMLMTSEERRKQLRDQYCFECDCIRC 266
>sp|Q9H7B4|SMYD3_HUMAN Histone-lysine N-methyltransferase SMYD3 OS=Homo sapiens GN=SMYD3
PE=1 SV=4
Length = 428
Score = 47.8 bits (112), Expect = 3e-04, Method: Compositional matrix adjust.
Identities = 19/53 (35%), Positives = 32/53 (60%)
Query: 218 HGPRIIVRSIKPINKGEEVTVAYTDLLQPKGMRQSELWSKYQFVCHCRRCSAS 270
+GP +++R+++ I GEE+T+ Y D+L R+ +L +Y F C C RC
Sbjct: 217 NGPHLLLRAVRDIEVGEELTICYLDMLMTSEERRKQLRDQYCFECDCFRCQTQ 269
>sp|Q54D67|Y2454_DICDI SET and MYND domain-containing protein DDB_G0292454
OS=Dictyostelium discoideum GN=DDB_G0292454 PE=3 SV=1
Length = 343
Score = 47.0 bits (110), Expect = 4e-04, Method: Compositional matrix adjust.
Identities = 18/41 (43%), Positives = 29/41 (70%)
Query: 227 IKPINKGEEVTVAYTDLLQPKGMRQSELWSKYQFVCHCRRC 267
+KPIN G+E+T++YTD + R+S+L+ Y F C C++C
Sbjct: 293 LKPINPGDEITISYTDTTKDLVDRRSQLFENYGFNCECKKC 333
>sp|Q5BJI7|SMY2A_DANRE N-lysine methyltransferase SMYD2-A OS=Danio rerio GN=smyd2a PE=2
SV=1
Length = 435
Score = 46.6 bits (109), Expect = 6e-04, Method: Compositional matrix adjust.
Identities = 47/197 (23%), Positives = 75/197 (38%), Gaps = 57/197 (28%)
Query: 92 SDSDVASKIREGAREMARARGNLSDDVAWEEAALCLVMTNAVEVQDDKTGRILGIAVYDK 151
+D+D+A+ +R + + D+ A E + V N ++D++ LG A++
Sbjct: 149 NDTDIAALHHFYSRHL-----DFPDNAALTEL-IAQVNCNGFTIEDEELSH-LGSALF-P 200
Query: 152 DFSWINHSCSPNACYRFSLSEPNAPSFRNEKKMRIAPHVVFDSTEAETPGKSDVCISCEL 211
D + +NHSCSPN V + T AE
Sbjct: 201 DVALMNHSCSPNVI------------------------VTYKGTVAE------------- 223
Query: 212 KEGSKRHGPRIIVRSIKPINKGEEVTVAYTDLLQPKGMRQSELWSKYQFVCHCRRCSASP 271
VR+++ IN EE+ +Y DLL P R L Y F C C+ C++
Sbjct: 224 ------------VRAVQEINPEEEIFNSYIDLLYPTEDRIERLKDSYFFNCDCKECTSKS 271
Query: 272 PSYVDMALEETFSSNPE 288
M + + S PE
Sbjct: 272 KDEAKMEIRQKLSIPPE 288
>sp|Q7ZXV5|SMY2A_XENLA N-lysine methyltransferase SMYD2-A OS=Xenopus laevis GN=smyd2-a
PE=2 SV=1
Length = 430
Score = 46.2 bits (108), Expect = 8e-04, Method: Compositional matrix adjust.
Identities = 37/141 (26%), Positives = 52/141 (36%), Gaps = 51/141 (36%)
Query: 128 VMTNAVEVQDDKTGRILGIAVYDKDFSWINHSCSPNACYRFSLSEPNAPSFRNEKKMRIA 187
V N ++D++ LG A++ D + +NHSC PN
Sbjct: 177 VNCNGFTIEDEELSH-LGSAIF-PDVALMNHSCCPNII---------------------- 212
Query: 188 PHVVFDSTEAETPGKSDVCISCELKEGSKRHGPRIIVRSIKPINKGEEVTVAYTDLLQPK 247
V F T AE +R+++ I+ G+EV +Y DLL P
Sbjct: 213 --VTFKGTVAE-------------------------IRAVQEIHAGDEVFTSYIDLLYPT 245
Query: 248 GMRQSELWSKYQFVCHCRRCS 268
R L Y F C CR CS
Sbjct: 246 EDRNDRLMDSYFFTCDCRECS 266
>sp|Q6GN68|SMY2B_XENLA N-lysine methyltransferase SMYD2-B OS=Xenopus laevis GN=smyd2-b
PE=2 SV=1
Length = 430
Score = 46.2 bits (108), Expect = 8e-04, Method: Compositional matrix adjust.
Identities = 41/152 (26%), Positives = 55/152 (36%), Gaps = 51/152 (33%)
Query: 117 DVAWEEAALCLVMTNAVEVQDDKTGRILGIAVYDKDFSWINHSCSPNACYRFSLSEPNAP 176
D A E V N ++D++ LG A++ D + +NHSC PN
Sbjct: 166 DNAALEFLFAQVNCNGFTIEDEELSH-LGSAIF-PDVALMNHSCCPNVI----------- 212
Query: 177 SFRNEKKMRIAPHVVFDSTEAETPGKSDVCISCELKEGSKRHGPRIIVRSIKPINKGEEV 236
V + T AE VR+++ I+ GEEV
Sbjct: 213 -------------VTYKGTVAE-------------------------VRAVQEIHAGEEV 234
Query: 237 TVAYTDLLQPKGMRQSELWSKYQFVCHCRRCS 268
+Y DLL P R L Y F C CR CS
Sbjct: 235 FTSYIDLLYPTEDRNDRLKDSYFFSCDCRECS 266
>sp|E1C5V0|SMYD2_CHICK N-lysine methyltransferase SMYD2 OS=Gallus gallus GN=SMYD2 PE=3
SV=1
Length = 436
Score = 45.8 bits (107), Expect = 0.001, Method: Compositional matrix adjust.
Identities = 21/45 (46%), Positives = 25/45 (55%)
Query: 224 VRSIKPINKGEEVTVAYTDLLQPKGMRQSELWSKYQFVCHCRRCS 268
VR++K I GEEV +Y DLL P R L Y F C CR C+
Sbjct: 227 VRAVKEIEPGEEVFTSYIDLLYPTEDRNDRLRDSYFFTCDCRECT 271
>sp|O94256|SET6_SCHPO SET domain and MYND-type zinc finger protein 6
OS=Schizosaccharomyces pombe (strain 972 / ATCC 24843)
GN=set6 PE=4 SV=1
Length = 483
Score = 45.1 bits (105), Expect = 0.002, Method: Compositional matrix adjust.
Identities = 19/49 (38%), Positives = 30/49 (61%)
Query: 219 GPRIIVRSIKPINKGEEVTVAYTDLLQPKGMRQSELWSKYQFVCHCRRC 267
G + + S + I K E++ ++Y D+ PK +RQ +L KY F C+C RC
Sbjct: 206 GAIVQLVSKRDIKKDEQLFISYIDIRLPKSIRQKQLLKKYFFSCYCPRC 254
>sp|Q6CX91|SET5_KLULA Potential protein lysine methyltransferase SET5 OS=Kluyveromyces
lactis (strain ATCC 8585 / CBS 2359 / DSM 70799 / NBRC
1267 / NRRL Y-1140 / WM37) GN=SET5 PE=3 SV=1
Length = 492
Score = 44.3 bits (103), Expect = 0.003, Method: Compositional matrix adjust.
Identities = 20/51 (39%), Positives = 30/51 (58%), Gaps = 2/51 (3%)
Query: 217 RHGPRIIVRSIKPINKGEEVTVAYTDLLQPKGMRQSELWSKYQFVCHCRRC 267
+HG ++ R K I KGE++ + Y + L +R+ EL Y F+CHC RC
Sbjct: 362 KHGIKLYAR--KDIKKGEQLRLTYVNPLHGVTLRRRELRVNYGFLCHCPRC 410
>sp|Q7M6Z3|SMYD2_RAT N-lysine methyltransferase SMYD2 OS=Rattus norvegicus GN=Smyd2 PE=2
SV=1
Length = 433
Score = 43.9 bits (102), Expect = 0.003, Method: Compositional matrix adjust.
Identities = 22/64 (34%), Positives = 32/64 (50%), Gaps = 2/64 (3%)
Query: 224 VRSIKPINKGEEVTVAYTDLLQPKGMRQSELWSKYQFVCHCRRCSASPPSYVDMALEETF 283
VR+++ I+ G+EV +Y DLL P R L Y F C CR C+ + + +
Sbjct: 224 VRAVQEIHPGDEVFTSYIDLLYPTEDRNDRLRDSYFFTCECRECTTKDKDKAKVEIRKL- 282
Query: 284 SSNP 287
SNP
Sbjct: 283 -SNP 285
>sp|Q8R5A0|SMYD2_MOUSE N-lysine methyltransferase SMYD2 OS=Mus musculus GN=Smyd2 PE=1 SV=1
Length = 433
Score = 43.9 bits (102), Expect = 0.004, Method: Compositional matrix adjust.
Identities = 19/45 (42%), Positives = 26/45 (57%)
Query: 224 VRSIKPINKGEEVTVAYTDLLQPKGMRQSELWSKYQFVCHCRRCS 268
VR+++ I+ G+EV +Y DLL P R L Y F C CR C+
Sbjct: 224 VRAVQEIHPGDEVFTSYIDLLYPTEDRNDRLRDSYFFTCECRECT 268
>sp|Q0P585|SMYD2_BOVIN N-lysine methyltransferase SMYD2 OS=Bos taurus GN=SMYD2 PE=2 SV=1
Length = 433
Score = 43.5 bits (101), Expect = 0.005, Method: Compositional matrix adjust.
Identities = 19/47 (40%), Positives = 26/47 (55%)
Query: 224 VRSIKPINKGEEVTVAYTDLLQPKGMRQSELWSKYQFVCHCRRCSAS 270
VR+++ I+ GEEV +Y DLL P R L Y F C C+ C+
Sbjct: 224 VRAVQEIHPGEEVFTSYIDLLYPTEDRNDRLRDSYFFTCECQECTTK 270
>sp|C3RZA1|SMYD2_PIG N-lysine methyltransferase SMYD2 OS=Sus scrofa GN=SMYD2 PE=2 SV=1
Length = 433
Score = 43.5 bits (101), Expect = 0.006, Method: Compositional matrix adjust.
Identities = 19/45 (42%), Positives = 26/45 (57%)
Query: 224 VRSIKPINKGEEVTVAYTDLLQPKGMRQSELWSKYQFVCHCRRCS 268
VR+++ I+ GEEV +Y DLL P R L Y F C C+ C+
Sbjct: 224 VRAVQEIHPGEEVFTSYIDLLYPTEDRNDRLRDSYFFTCECQECT 268
>sp|P97443|SMYD1_MOUSE SET and MYND domain-containing protein 1 OS=Mus musculus GN=Smyd1
PE=1 SV=3
Length = 490
Score = 43.1 bits (100), Expect = 0.006, Method: Compositional matrix adjust.
Identities = 21/61 (34%), Positives = 34/61 (55%)
Query: 221 RIIVRSIKPINKGEEVTVAYTDLLQPKGMRQSELWSKYQFVCHCRRCSASPPSYVDMALE 280
RI +R++ I++GEE+TV+Y D L R+ +L +Y F C C C + +A +
Sbjct: 233 RIELRALGKISEGEELTVSYIDFLHLSEERRRQLKKQYYFDCSCEHCQKGLKDDLFLAAK 292
Query: 281 E 281
E
Sbjct: 293 E 293
>sp|Q9NRG4|SMYD2_HUMAN N-lysine methyltransferase SMYD2 OS=Homo sapiens GN=SMYD2 PE=1 SV=2
Length = 433
Score = 43.1 bits (100), Expect = 0.007, Method: Compositional matrix adjust.
Identities = 19/45 (42%), Positives = 25/45 (55%)
Query: 224 VRSIKPINKGEEVTVAYTDLLQPKGMRQSELWSKYQFVCHCRRCS 268
VR+++ I GEEV +Y DLL P R L Y F C C+ C+
Sbjct: 224 VRAVQEIKPGEEVFTSYIDLLYPTEDRNDRLRDSYFFTCECQECT 268
>sp|Q8NB12|SMYD1_HUMAN SET and MYND domain-containing protein 1 OS=Homo sapiens GN=SMYD1
PE=2 SV=1
Length = 490
Score = 42.7 bits (99), Expect = 0.008, Method: Compositional matrix adjust.
Identities = 19/48 (39%), Positives = 29/48 (60%)
Query: 221 RIIVRSIKPINKGEEVTVAYTDLLQPKGMRQSELWSKYQFVCHCRRCS 268
RI +R++ I++GEE+TV+Y D L R+ +L +Y F C C C
Sbjct: 233 RIELRALGKISEGEELTVSYIDFLNVSEERKRQLKKQYYFDCTCEHCQ 280
>sp|Q5UNT8|YL678_MIMIV Putative SET domain-containing protein L678 OS=Acanthamoeba
polyphaga mimivirus GN=MIMI_L678 PE=4 SV=1
Length = 255
Score = 42.0 bits (97), Expect = 0.014, Method: Compositional matrix adjust.
Identities = 16/45 (35%), Positives = 27/45 (60%)
Query: 226 SIKPINKGEEVTVAYTDLLQPKGMRQSELWSKYQFVCHCRRCSAS 270
+++ I GEE+T Y D++ R++ L+++Y F C C RC S
Sbjct: 161 TVRNIKTGEELTDNYVDIMSNTKTRKNRLFNQYGFDCQCERCIGS 205
>sp|Q557F6|Y3591_DICDI SET and MYND domain-containing protein DDB_G0273591
OS=Dictyostelium discoideum GN=DDB_G0273591 PE=3 SV=1
Length = 413
Score = 41.2 bits (95), Expect = 0.028, Method: Compositional matrix adjust.
Identities = 23/74 (31%), Positives = 37/74 (50%)
Query: 197 AETPGKSDVCISCELKEGSKRHGPRIIVRSIKPINKGEEVTVAYTDLLQPKGMRQSELWS 256
A +P S SC S R G + +S+ PI KG+++ ++Y L + R+ L
Sbjct: 267 AVSPSSSYFNHSCIPNCESVRDGSDMTFKSLFPIKKGDQINISYLALDKSTKRRRDYLKF 326
Query: 257 KYQFVCHCRRCSAS 270
Y F C C RC+++
Sbjct: 327 GYYFHCQCPRCNST 340
>sp|Q9ZUM9|ASHR2_ARATH Histone-lysine N-methyltransferase ASHR2 OS=Arabidopsis thaliana
GN=ASHR2 PE=2 SV=3
Length = 398
Score = 40.8 bits (94), Expect = 0.030, Method: Compositional matrix adjust.
Identities = 31/125 (24%), Positives = 44/125 (35%), Gaps = 43/125 (34%)
Query: 143 ILGIAVYDKDFSWINHSCSPNACYRFSLSEPNAPSFRNEKKMRIAPHVVFDSTEAETPGK 202
+ +Y K S+ NH C PNAC FD ++ + G
Sbjct: 215 VRAYGIYPKT-SFFNHDCLPNACR-------------------------FDYVDSASDGN 248
Query: 203 SDVCISCELKEGSKRHGPRIIVRSIKPINKGEEVTVAYTDLLQPKGMRQSELWSKYQFVC 262
+D II+R I + +G EV ++Y + RQ L Y F C
Sbjct: 249 TD-----------------IIIRMIHDVPEGREVCLSYFPVNMNYSSRQKRLLEDYGFKC 291
Query: 263 HCRRC 267
C RC
Sbjct: 292 DCDRC 296
>sp|Q5PP37|ATXR2_ARATH Histone-lysine N-methyltransferase ATXR2 OS=Arabidopsis thaliana
GN=ATXR2 PE=2 SV=1
Length = 473
Score = 40.4 bits (93), Expect = 0.042, Method: Compositional matrix adjust.
Identities = 21/57 (36%), Positives = 32/57 (56%), Gaps = 2/57 (3%)
Query: 211 LKEGSKRHGPRIIVRSIKPINKGEEVTVAYTDLLQPKGMRQSELWSKYQFVCHCRRC 267
K R G +I+ +++ I+K EEVT++Y D P RQ+ L + Y F C C +C
Sbjct: 412 FKREEDRDGQAVII-ALRRISKNEEVTISYIDEELPYKERQA-LLADYGFSCKCSKC 466
>sp|Q75BF1|SET5_ASHGO Potential protein lysine methyltransferase SET5 OS=Ashbya gossypii
(strain ATCC 10895 / CBS 109.51 / FGSC 9923 / NRRL
Y-1056) GN=SET5 PE=3 SV=2
Length = 488
Score = 40.0 bits (92), Expect = 0.059, Method: Compositional matrix adjust.
Identities = 26/82 (31%), Positives = 39/82 (47%), Gaps = 1/82 (1%)
Query: 208 SCELKEGSKRHGPRIIVRSIKPINKGEEVTVAYTDLLQPKGMRQSELWSKYQFVCHCRRC 267
SCE + G I V + K I EE+TV+Y + L +R+ EL + F+C C RC
Sbjct: 346 SCEPNIYYELEGHHINVYARKEIKSDEELTVSYVNPLHDVDLRRRELRVNWGFLCLCDRC 405
Query: 268 SAS-PPSYVDMALEETFSSNPE 288
+D A + ++ PE
Sbjct: 406 KREISKKNIDKAGHSSTTAGPE 427
>sp|Q54DL6|Y2140_DICDI SET and MYND domain-containing protein DDB_G0292140
OS=Dictyostelium discoideum GN=DDB_G0292140 PE=3 SV=1
Length = 521
Score = 39.3 bits (90), Expect = 0.095, Method: Compositional matrix adjust.
Identities = 34/134 (25%), Positives = 51/134 (38%), Gaps = 48/134 (35%)
Query: 136 QDDKTGRILGIAVYDKDFSWINHSCSPNACYRFSLSEPNAPSFRNEKKMRIAPHVVFDST 195
+D +G G VY ++ S+ NHSC PN Y VV ++
Sbjct: 349 RDGSSGESRGCGVYVRN-SFFNHSCDPNVNY----------------------WVVNNTL 385
Query: 196 EAETPGKSDVCISCELKEGSKRHGPRIIVRSIKPINKGEEVTVAYTDLLQPKGMRQSELW 255
E E C L +K + +G+E+T++Y D P R+ +L
Sbjct: 386 EVE----------CTL---------------LKNVKEGDELTISYIDTTSPLNKRREKLL 420
Query: 256 SKYQFVCHCRRCSA 269
Y F C C +C A
Sbjct: 421 EGYLFNCLCTKCVA 434
>sp|A7TPV3|SET5_VANPO Potential protein lysine methyltransferase SET5 OS=Vanderwaltozyma
polyspora (strain ATCC 22028 / DSM 70294) GN=SET5 PE=3
SV=1
Length = 499
Score = 38.9 bits (89), Expect = 0.11, Method: Compositional matrix adjust.
Identities = 18/46 (39%), Positives = 26/46 (56%)
Query: 222 IIVRSIKPINKGEEVTVAYTDLLQPKGMRQSELWSKYQFVCHCRRC 267
I V + K I+ G+E+ Y + L +R+ EL Y F+CHC RC
Sbjct: 367 IRVYARKNISAGQELLTNYINPLHGVKLRRRELRVNYGFLCHCDRC 412
>sp|Q7XJS0|ASHR1_ARATH Histone-lysine N-methyltransferase ASHR1 OS=Arabidopsis thaliana
GN=ASHR1 PE=2 SV=2
Length = 480
Score = 38.5 bits (88), Expect = 0.15, Method: Compositional matrix adjust.
Identities = 17/46 (36%), Positives = 26/46 (56%)
Query: 223 IVRSIKPINKGEEVTVAYTDLLQPKGMRQSELWSKYQFVCHCRRCS 268
+VR++ I+K E+T++Y + RQ L +Y F C C RCS
Sbjct: 230 VVRAMDNISKDSEITISYIETAGSTLTRQKSLKEQYLFHCQCARCS 275
>sp|Q8EWX1|RPOB_MYCPE DNA-directed RNA polymerase subunit beta OS=Mycoplasma penetrans
(strain HF-2) GN=rpoB PE=3 SV=1
Length = 1375
Score = 38.1 bits (87), Expect = 0.21, Method: Compositional matrix adjust.
Identities = 30/116 (25%), Positives = 44/116 (37%), Gaps = 12/116 (10%)
Query: 411 ASTTDHLFRSESSLIAASANFWASAGESLLT--LARSPGWNLFVKPELPISTSSPEIHEC 468
AS TD +F + +I F + E + + RSPG + K ++ +S S I E
Sbjct: 120 ASNTDGIFFASIPMITEKGTFIVNGIEKFVISQIVRSPGAYILTKSQIKLSNSRKRIQEG 179
Query: 469 SKCSLVDRLQVNPFLSQSRNADFQIICNEFLACITNMTRKVWGFLTHGCGYLQMLK 524
C ++ F+ + N DF I M R G TH LK
Sbjct: 180 YICEILPAKGTLFFIFMAENKDF----------IQVMFRDAIGESTHTIPITAFLK 225
>sp|O74467|SET5_SCHPO SET domain-containing protein 5 OS=Schizosaccharomyces pombe
(strain 972 / ATCC 24843) GN=set5 PE=1 SV=1
Length = 319
Score = 38.1 bits (87), Expect = 0.22, Method: Compositional matrix adjust.
Identities = 16/49 (32%), Positives = 25/49 (51%)
Query: 221 RIIVRSIKPINKGEEVTVAYTDLLQPKGMRQSELWSKYQFVCHCRRCSA 269
++ V +++ I GEE+ Y DL + RQ L + F C+C CS
Sbjct: 121 QVTVHAVRDIEAGEEILTTYIDLHKSHTERQKILLEHFGFKCYCSVCSV 169
>sp|A5DQN2|SET5_PICGU Potential protein lysine methyltransferase SET5 OS=Meyerozyma
guilliermondii (strain ATCC 6260 / CBS 566 / DSM 6381 /
JCM 1539 / NBRC 10279 / NRRL Y-324) GN=SET5 PE=3 SV=2
Length = 483
Score = 37.7 bits (86), Expect = 0.29, Method: Compositional matrix adjust.
Identities = 19/51 (37%), Positives = 28/51 (54%), Gaps = 1/51 (1%)
Query: 217 RHGPRIIVRSIKPINKGEEVTVAYTDLLQPKGMRQSELWSKYQFVCHCRRC 267
R GP + V + + I GEE+T +Y + RQ EL + F+C C+RC
Sbjct: 366 RTGP-LKVFAARDIKAGEELTTSYVNPSHTLHQRQRELRVNWGFICSCQRC 415
>sp|P38890|SET5_YEAST Putative protein lysine methyltransferase SET5 OS=Saccharomyces
cerevisiae (strain ATCC 204508 / S288c) GN=SET5 PE=1
SV=1
Length = 526
Score = 36.2 bits (82), Expect = 0.83, Method: Compositional matrix adjust.
Identities = 36/133 (27%), Positives = 60/133 (45%), Gaps = 17/133 (12%)
Query: 228 KPINKGEEVTVAYTDLLQPKGMRQSELWSKYQFVCHCRRCSASPPSYVDMALEETFSSNP 287
KPI KGE++ + Y + L +R+ EL + F+C C RC TF P
Sbjct: 390 KPIKKGEQIRITYVNPLHGVRLRRRELRVNWGFLCQCDRCQNELS---------TFERVP 440
Query: 288 EFLSLSSDYNFLKDEANQKLTDWMDEGTSEYLLVGDPESCCQK----LENILTQGLQGEL 343
++D N ++ + D ++G+ + G+ +S ++ L+ IL G + EL
Sbjct: 441 NLEKKNADANLGVEKIDS--NDSSEDGSKKS--TGNRKSSMREAQPDLKEILKNGKEFEL 496
Query: 344 LESEKVKIQLNLR 356
E V Q N+R
Sbjct: 497 DIPETVDTQGNVR 509
>sp|A6ZTB4|SET5_YEAS7 Potential protein lysine methyltransferase SET5 OS=Saccharomyces
cerevisiae (strain YJM789) GN=SET5 PE=3 SV=1
Length = 526
Score = 35.8 bits (81), Expect = 0.92, Method: Compositional matrix adjust.
Identities = 36/133 (27%), Positives = 60/133 (45%), Gaps = 17/133 (12%)
Query: 228 KPINKGEEVTVAYTDLLQPKGMRQSELWSKYQFVCHCRRCSASPPSYVDMALEETFSSNP 287
KPI KGE++ + Y + L +R+ EL + F+C C RC TF P
Sbjct: 390 KPIKKGEQIRITYVNPLHGVRLRRRELRVNWGFLCQCDRCQNELS---------TFERVP 440
Query: 288 EFLSLSSDYNFLKDEANQKLTDWMDEGTSEYLLVGDPESCCQK----LENILTQGLQGEL 343
++D N ++ + D ++G+ + G+ +S ++ L+ IL G + EL
Sbjct: 441 NLEKKNADANLGVEKIDS--NDNSEDGSKKS--TGNRKSSMREAQPDLKEILKNGKEFEL 496
Query: 344 LESEKVKIQLNLR 356
E V Q N+R
Sbjct: 497 DIPETVDTQGNVR 509
>sp|P0CR42|SET5_CRYNJ Potential protein lysine methyltransferase SET5 OS=Cryptococcus
neoformans var. neoformans serotype D (strain JEC21 /
ATCC MYA-565) GN=SET5 PE=3 SV=1
Length = 449
Score = 35.8 bits (81), Expect = 0.93, Method: Compositional matrix adjust.
Identities = 25/112 (22%), Positives = 45/112 (40%), Gaps = 23/112 (20%)
Query: 156 INHSCSPNACYRFSLSEPNAPSFRNEKKMRIAPHVVFDSTEAETPGKSDVCISCELKEGS 215
+NHSC PN R +L + P ++ + + P + +
Sbjct: 348 MNHSCEPNIQVR-NLPKSYTPPTQDTLPVNLPPPI----------------------QAG 384
Query: 216 KRHGPRIIVRSIKPINKGEEVTVAYTDLLQPKGMRQSELWSKYQFVCHCRRC 267
R ++ + + I GEE+T++Y ++ + R+ L Y F C C RC
Sbjct: 385 DRVSNKLTILARHEIQPGEELTISYVNMKMSRDERRQALREGYGFWCACDRC 436
>sp|Q6FTT0|SET5_CANGA Potential protein lysine methyltransferase SET5 OS=Candida glabrata
(strain ATCC 2001 / CBS 138 / JCM 3761 / NBRC 0622 /
NRRL Y-65) GN=SET5 PE=3 SV=1
Length = 515
Score = 35.8 bits (81), Expect = 1.00, Method: Compositional matrix adjust.
Identities = 16/45 (35%), Positives = 25/45 (55%)
Query: 224 VRSIKPINKGEEVTVAYTDLLQPKGMRQSELWSKYQFVCHCRRCS 268
V + K I K EE+ Y + L +R+ EL + F+C+C RC+
Sbjct: 369 VYARKFIKKDEELVTTYVNPLHGVSLRRRELRVNWGFICNCDRCA 413
>sp|Q54Q80|Y4059_DICDI SET and MYND domain-containing protein DDB_G0284059
OS=Dictyostelium discoideum GN=DDB_G0284059 PE=3 SV=1
Length = 1280
Score = 35.8 bits (81), Expect = 1.1, Method: Compositional matrix adjust.
Identities = 20/63 (31%), Positives = 32/63 (50%), Gaps = 2/63 (3%)
Query: 208 SCELKEGSKRHGPRIIVRSIKPINKGEEVTVAYT--DLLQPKGMRQSELWSKYQFVCHCR 265
SC+ + G + ++S+ I KGEE+ Y L P R L++++ FVC C+
Sbjct: 932 SCDNNTHLQYDGCSLTIKSLFNIEKGEEILGCYGPHAFLNPLKDRLINLYNEFFFVCRCK 991
Query: 266 RCS 268
CS
Sbjct: 992 ACS 994
>sp|P0CR43|SET5_CRYNB Potential protein lysine methyltransferase SET5 OS=Cryptococcus
neoformans var. neoformans serotype D (strain B-3501A)
GN=SET5 PE=3 SV=1
Length = 449
Score = 35.8 bits (81), Expect = 1.1, Method: Compositional matrix adjust.
Identities = 25/112 (22%), Positives = 45/112 (40%), Gaps = 23/112 (20%)
Query: 156 INHSCSPNACYRFSLSEPNAPSFRNEKKMRIAPHVVFDSTEAETPGKSDVCISCELKEGS 215
+NHSC PN R +L + P ++ + + P + +
Sbjct: 348 MNHSCEPNIQVR-NLPKSYTPPTQDTLPVNLPPPI----------------------QAG 384
Query: 216 KRHGPRIIVRSIKPINKGEEVTVAYTDLLQPKGMRQSELWSKYQFVCHCRRC 267
R ++ + + I GEE+T++Y ++ + R+ L Y F C C RC
Sbjct: 385 DRVSNKLTILARHEIQPGEELTISYVNMKMSRDERRQALREGYGFWCACDRC 436
>sp|O46040|MSTAA_DROME Protein msta, isoform A OS=Drosophila melanogaster GN=msta PE=2
SV=3
Length = 462
Score = 35.0 bits (79), Expect = 1.7, Method: Compositional matrix adjust.
Identities = 18/50 (36%), Positives = 24/50 (48%)
Query: 218 HGPRIIVRSIKPINKGEEVTVAYTDLLQPKGMRQSELWSKYQFVCHCRRC 267
+G +VR+ + I KG E+T YT +L R L F C C RC
Sbjct: 264 NGRLAVVRAARDIPKGGEITTTYTKILWGNLTRNIFLKMTKHFACDCVRC 313
>sp|Q9FG08|ATXR4_ARATH Histone-lysine N-methyltransferase ATXR4 OS=Arabidopsis thaliana
GN=ATXR4 PE=2 SV=2
Length = 325
Score = 35.0 bits (79), Expect = 1.7, Method: Compositional matrix adjust.
Identities = 14/53 (26%), Positives = 28/53 (52%)
Query: 218 HGPRIIVRSIKPINKGEEVTVAYTDLLQPKGMRQSELWSKYQFVCHCRRCSAS 270
H + +++ + +GEE+ + Y D RQ+ L + F+C+C RC ++
Sbjct: 272 HNADARLNTLRDVEEGEELRICYIDASMGYEARQTILSQGFGFLCNCLRCQST 324
>sp|P83501|MSTAB_DROME Protein msta, isoform B OS=Drosophila melanogaster GN=msta PE=4
SV=2
Length = 448
Score = 35.0 bits (79), Expect = 1.9, Method: Compositional matrix adjust.
Identities = 37/190 (19%), Positives = 59/190 (31%), Gaps = 56/190 (29%)
Query: 78 LFGLLTNRDKLMSSSDSDVASKIREGAREMARARGNLSDDVAWEEAALCLVMTNAVEVQD 137
L+ + N D+ A RE D + + +C TNA E +
Sbjct: 166 LYAMQANMDRYYMQEVQRAADCFEHFPRE--------QDMLDYFYRTICAFNTNAFESRS 217
Query: 138 DKTGRILGIAVYDKDFSWINHSCSPNACYRFSLSEPNAPSFRNEKKMRIAPHVVFDSTEA 197
+ G + + +NH C+PNA + F
Sbjct: 218 NVDGHEVLVRALFPLAGLLNHQCTPNAAHHFE---------------------------- 249
Query: 198 ETPGKSDVCISCELKEGSKRHGPRIIVRSIKPINKGEEVTVAYTDLLQPKGMRQSELWSK 257
+G I+V + + I G E+T++Y LL R+ L
Sbjct: 250 --------------------NGETIVVCATERIPAGAEITMSYAKLLWSTLARKIFLGMT 289
Query: 258 YQFVCHCRRC 267
F+C C RC
Sbjct: 290 KHFICKCVRC 299
>sp|A3M0J3|SET5_PICST Potential protein lysine methyltransferase SET5 OS=Scheffersomyces
stipitis (strain ATCC 58785 / CBS 6054 / NBRC 10063 /
NRRL Y-11545) GN=SET5 PE=3 SV=2
Length = 478
Score = 35.0 bits (79), Expect = 2.0, Method: Compositional matrix adjust.
Identities = 15/44 (34%), Positives = 23/44 (52%)
Query: 224 VRSIKPINKGEEVTVAYTDLLQPKGMRQSELWSKYQFVCHCRRC 267
V + + I GEE+T Y + RQ EL + F+C C++C
Sbjct: 370 VYAARDIRAGEELTTTYVNPAHTVQQRQRELRVNWGFICGCQKC 413
>sp|Q6M0E6|G3P_METMP Glyceraldehyde-3-phosphate dehydrogenase OS=Methanococcus
maripaludis (strain S2 / LL) GN=gap PE=3 SV=1
Length = 340
Score = 35.0 bits (79), Expect = 2.0, Method: Compositional matrix adjust.
Identities = 26/107 (24%), Positives = 46/107 (42%), Gaps = 9/107 (8%)
Query: 21 PLTFAFHDSLLDGHCSSCFSPLPSCCSSLPLSSAELRAALHLLHSPLPTTSLPPPPRLFG 80
P+T H H S +P + S+ + L +HS + TS +
Sbjct: 188 PVTVPSH------HGPDVVSVIPELDGKIMTSAVIVPTTLMHMHSIMVETSGTNRDEIID 241
Query: 81 LLTNRDKLMSSSDS---DVASKIREGAREMARARGNLSDDVAWEEAA 124
L ++++ S D +KI E AR++ R+R +L++ WEE+
Sbjct: 242 ALAKTPRILTVKASEGFDSTAKIIEYARDLGRSRYDLNEIAVWEESV 288
>sp|A9A6M9|G3P_METM6 Glyceraldehyde-3-phosphate dehydrogenase OS=Methanococcus
maripaludis (strain C6 / ATCC BAA-1332) GN=gap PE=3 SV=1
Length = 340
Score = 33.9 bits (76), Expect = 3.5, Method: Compositional matrix adjust.
Identities = 23/94 (24%), Positives = 42/94 (44%), Gaps = 3/94 (3%)
Query: 34 HCSSCFSPLPSCCSSLPLSSAELRAALHLLHSPLPTTSLPPPPRLFGLLTNRDKLMS--- 90
H S +P + S+ + L +HS + TS + L ++++
Sbjct: 195 HGPDVVSVIPELDGKIMTSAVIVPTTLMHMHSIMVETSGTNRDEIIDALAKTPRILTLKA 254
Query: 91 SSDSDVASKIREGAREMARARGNLSDDVAWEEAA 124
S D +KI E AR++ R+R +L++ WEE+
Sbjct: 255 SEGFDSTAKIIEYARDLGRSRYDLNEIAVWEESV 288
>sp|Q09415|SET14_CAEEL SET domain-containing protein 14 OS=Caenorhabditis elegans
GN=set-14 PE=4 SV=2
Length = 429
Score = 33.9 bits (76), Expect = 3.6, Method: Compositional matrix adjust.
Identities = 13/29 (44%), Positives = 18/29 (62%)
Query: 239 AYTDLLQPKGMRQSELWSKYQFVCHCRRC 267
+Y D L P+ MR+ L KY+F+C C C
Sbjct: 212 SYIDELMPRDMRRDTLKKKYKFLCQCDGC 240
>sp|O00507|USP9Y_HUMAN Probable ubiquitin carboxyl-terminal hydrolase FAF-Y OS=Homo sapiens
GN=USP9Y PE=1 SV=2
Length = 2555
Score = 33.9 bits (76), Expect = 3.6, Method: Compositional matrix adjust.
Identities = 23/45 (51%), Positives = 26/45 (57%), Gaps = 3/45 (6%)
Query: 22 LTFAFHDSLLDGHCSSCF-SPLPS--CCSSLPLSSAELRAALHLL 63
+ F H SL DG C S F SP PS C +L LS LRA L+LL
Sbjct: 2121 IVFIAHFSLQDGSCPSPFASPGPSSQACDNLSLSDHLLRATLNLL 2165
>sp|A4S1K1|ARGJ_OSTLU Arginine biosynthesis bifunctional protein ArgJ, chloroplastic
OS=Ostreococcus lucimarinus (strain CCE9901)
GN=OSTLU_33128 PE=3 SV=1
Length = 461
Score = 33.9 bits (76), Expect = 3.8, Method: Compositional matrix adjust.
Identities = 32/128 (25%), Positives = 55/128 (42%), Gaps = 22/128 (17%)
Query: 113 NLSDDVAWE-EAALCLVMTNAVEVQDDKTGRILG----------IAVYDKDFSWINHSCS 161
L+ +AW+ E A CL+ N DD+ R++ A++ D +W +C+
Sbjct: 316 GLAKAIAWDGEGATCLIECNVSGAADDEDARVIARSVVCSSLAKAAIFGHDPNWGRLACA 375
Query: 162 PNACYRFSLSEPNAPSF-RNEKKMRIAPHVVFDSTEAETPGKSD-VCISCELKEGSKRHG 219
+ P F +N+ K+ + PH + D + P D V S LKE + HG
Sbjct: 376 AG------YAAPVKNRFDQNDLKLSLGPHQLMDKGQ---PLDFDAVAASRYLKEVTGVHG 426
Query: 220 PRIIVRSI 227
++ S+
Sbjct: 427 TCVVDISV 434
>sp|Q6GMV2|SMYD5_HUMAN SET and MYND domain-containing protein 5 OS=Homo sapiens GN=SMYD5
PE=1 SV=2
Length = 418
Score = 33.5 bits (75), Expect = 5.2, Method: Compositional matrix adjust.
Identities = 18/49 (36%), Positives = 26/49 (53%), Gaps = 3/49 (6%)
Query: 224 VRSIKPINKGEEVTVAYTDLLQPKGMRQSE---LWSKYQFVCHCRRCSA 269
V +++ I GEE+ ++Y D Q + R S L Y FVC C +C A
Sbjct: 335 VTALEDIKPGEEICISYLDCCQRERSRHSRHKILRENYLFVCSCPKCLA 383
>sp|Q6BSV3|SET5_DEBHA Potential protein lysine methyltransferase SET5 OS=Debaryomyces
hansenii (strain ATCC 36239 / CBS 767 / JCM 1990 / NBRC
0083 / IGC 2968) GN=SET5 PE=3 SV=2
Length = 493
Score = 33.1 bits (74), Expect = 6.6, Method: Compositional matrix adjust.
Identities = 15/44 (34%), Positives = 23/44 (52%)
Query: 224 VRSIKPINKGEEVTVAYTDLLQPKGMRQSELWSKYQFVCHCRRC 267
V + + I GEE+T Y + RQ EL + F+C C++C
Sbjct: 383 VFAARDIRAGEELTTTYVNPAYTVQQRQRELRVNWGFMCGCQKC 426
>sp|Q5A1M3|SET5_CANAL Potential protein lysine methyltransferase SET5 OS=Candida albicans
(strain SC5314 / ATCC MYA-2876) GN=SET5 PE=3 SV=1
Length = 473
Score = 32.3 bits (72), Expect = 10.0, Method: Compositional matrix adjust.
Identities = 14/40 (35%), Positives = 20/40 (50%)
Query: 228 KPINKGEEVTVAYTDLLQPKGMRQSELWSKYQFVCHCRRC 267
+ I GEE+T Y + RQ EL + F+C C +C
Sbjct: 368 RDIKSGEELTTTYVNPSHTVHQRQRELRVNWGFICACAKC 407
Database: swissprot
Posted date: Mar 23, 2013 2:32 AM
Number of letters in database: 191,569,459
Number of sequences in database: 539,616
Lambda K H
0.319 0.132 0.408
Lambda K H
0.267 0.0410 0.140
Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Number of Hits to DB: 230,365,520
Number of Sequences: 539616
Number of extensions: 9635786
Number of successful extensions: 24647
Number of sequences better than 100.0: 50
Number of HSP's better than 100.0 without gapping: 42
Number of HSP's successfully gapped in prelim test: 11
Number of HSP's that attempted gapping in prelim test: 24538
Number of HSP's gapped (non-prelim): 108
length of query: 617
length of database: 191,569,459
effective HSP length: 124
effective length of query: 493
effective length of database: 124,657,075
effective search space: 61455937975
effective search space used: 61455937975
T: 11
A: 40
X1: 16 ( 7.4 bits)
X2: 38 (14.6 bits)
X3: 64 (24.7 bits)
S1: 41 (21.8 bits)
S2: 64 (29.3 bits)