BLASTP 2.2.26 [Sep-21-2011]
Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer,
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997),
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs", Nucleic Acids Res. 25:3389-3402.
Reference for compositional score matrix adjustment: Altschul, Stephen F.,
John C. Wootton, E. Michael Gertz, Richa Agarwala, Aleksandr Morgulis,
Alejandro A. Schaffer, and Yi-Kuo Yu (2005) "Protein database searches
using compositionally adjusted substitution matrices", FEBS J. 272:5101-5109.
Query= 007176
(614 letters)
Database: swissprot
539,616 sequences; 191,569,459 total letters
Searching..................................................done
>sp|Q6NQ79|ARID4_ARATH AT-rich interactive domain-containing protein 4 OS=Arabidopsis
thaliana GN=ARID4 PE=2 SV=1
Length = 747
Score = 948 bits (2450), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 462/613 (75%), Positives = 527/613 (85%), Gaps = 4/613 (0%)
Query: 2 QGVPYVIYWKHSFSCYAACHFLQALLSVVQSSCSHTWDAFQLAHASFRLYCVRNNIVMAS 61
+GV YVIYWK+ FS YAACHF +L SV+QSSCS TWD F +A ASFRLYC +N V+ S
Sbjct: 133 KGVQYVIYWKNVFSKYAACHFRHSLFSVIQSSCSDTWDVFHVAEASFRLYCTSDNAVLPS 192
Query: 62 NS-QKGSSKLGPHLLGDPPKIDIALSEMDVQGEENSPENLPAIKIYDDDVTMRFLVCGVP 120
NS +K + ++GP LLG+PPKID+ E D EENS E+LP+IKIYD+DVT+RFL+CG P
Sbjct: 193 NSNRKMNYEMGPCLLGEPPKIDVVSPEADELEEENSLESLPSIKIYDEDVTVRFLLCGPP 252
Query: 121 CTLDTSLLGPLEDGLNALLNIEIRGSKLHNRTSAPPPPLQAGAFSRGVVTMRCDLSTCSS 180
CT+DT LLG L DGLNALL IE+RGSKLHNR+SAP PPLQAG F+RGVVTMRCD+STCSS
Sbjct: 253 CTVDTFLLGSLMDGLNALLRIEMRGSKLHNRSSAPAPPLQAGTFTRGVVTMRCDVSTCSS 312
Query: 181 AHISLLVSGSAQTCFNDQLLENHIKNELIENSQLVHALPNSGDNRLPPSEPRKSASIACG 240
AHIS+LVSG+AQTCF+DQLLENHIK+E++E QLVH++ NS + + SEPR+SASIACG
Sbjct: 313 AHISMLVSGNAQTCFSDQLLENHIKHEVVEKIQLVHSVVNSEETKRGFSEPRRSASIACG 372
Query: 241 ASVFEVSMKVSTWASQVLRQLAPDVSYRSLVMLGIASIQGLSVASFEKDDAERLLFFCTR 300
ASV EVSM+V TWA QVLRQLAPDVSYRSLV+LG+ASIQGLSVASFEKDDAERLLFFC +
Sbjct: 373 ASVCEVSMQVPTWALQVLRQLAPDVSYRSLVVLGVASIQGLSVASFEKDDAERLLFFCGQ 432
Query: 301 QGKADHTENSVLTRPPSWLTSPAPSRKRSEPCRESKGVESENVCNVRPKLNSAAMRPIPH 360
Q +++L++ P+WLT P P+RKRSEPCRESK E EN K+N AA+RPIPH
Sbjct: 433 QINDTSNHDALLSKIPNWLTPPLPTRKRSEPCRESK--EIENGGPTSRKINVAALRPIPH 490
Query: 361 TRHYKMLPFSGFSEIERYDGDQVKANLPVAPLKHSSAGPTPVTHRKSLSSSYQAQQIISL 420
TR +KM+PFSG+SEI R+DGD K +LP+ P KH ++G TPVTHRK+ S SYQ +QIISL
Sbjct: 491 TRRHKMIPFSGYSEIGRFDGDHTKGSLPMPP-KHGASGGTPVTHRKAFSGSYQRKQIISL 549
Query: 421 NPLPLKKHGCGRAPIQVCSEEEFLRDVMQFLILRGHTRLVPQGGLAEFPDAILNAKRLDL 480
NPLPLKKH CGRA IQVCSEEEFLRDVMQFL++RGHTRLVP GGLAEFPDA+LN+KRLDL
Sbjct: 550 NPLPLKKHDCGRAHIQVCSEEEFLRDVMQFLLIRGHTRLVPPGGLAEFPDAVLNSKRLDL 609
Query: 481 FNLYREVVSRGGFHVGNGINWKGQVFSKMRNHTLTNRMTGVGNTLKRHYETYLLEYELAH 540
FNLYREVVSRGGFHVGNGINWKGQVFSKMRNHTLTNRMTGVGNTLKRHYETYLLEYE AH
Sbjct: 610 FNLYREVVSRGGFHVGNGINWKGQVFSKMRNHTLTNRMTGVGNTLKRHYETYLLEYEYAH 669
Query: 541 DDVDGECCLLCHSSAAGDWVNCGICGEWAHFGCDRRQGLGAFKDYAKTDGLEYVCPQCSV 600
DDVDGECCL+C SS AGDWVNCG CGEWAHFGCDRR GLGAFKDYAKTDGLEYVCP CSV
Sbjct: 670 DDVDGECCLICRSSTAGDWVNCGSCGEWAHFGCDRRPGLGAFKDYAKTDGLEYVCPNCSV 729
Query: 601 TNFKKKSQKTSNG 613
+N++KKSQKTSNG
Sbjct: 730 SNYRKKSQKTSNG 742
>sp|C6KTD2|HKNMT_PLAF7 Putative histone-lysine N-methyltransferase PFF1440w OS=Plasmodium
falciparum (isolate 3D7) GN=PFF1440w PE=3 SV=1
Length = 6753
Score = 45.8 bits (107), Expect = 0.001, Method: Composition-based stats.
Identities = 18/66 (27%), Positives = 30/66 (45%), Gaps = 6/66 (9%)
Query: 548 CLLCHSSAAGD----WVNCGICGEWAHFGCDRRQGLGAFKDYAKTDGLEYVCPQCSVTNF 603
C++C+ D WV C +C W H CD+ + + ++Y CP CS+ F
Sbjct: 1764 CIMCNEKYDEDDSKKWVQCDVCKFWIHLSCDKNESRNI--ETLSNKNIDYKCPTCSIGTF 1821
Query: 604 KKKSQK 609
K ++
Sbjct: 1822 HDKIER 1827
>sp|Q3UXZ9|KDM5A_MOUSE Lysine-specific demethylase 5A OS=Mus musculus GN=Kdm5a PE=1 SV=2
Length = 1690
Score = 44.7 bits (104), Expect = 0.002, Method: Compositional matrix adjust.
Identities = 32/102 (31%), Positives = 48/102 (47%), Gaps = 14/102 (13%)
Query: 442 EFLRDVMQFLILRGHTRLVPQGGLAEFPDAILNAKRLDLFNLYREVVSRGGFHVGNGINW 501
+FL + +F L+G T +P ++ K LDL+ L + V S+GGF +
Sbjct: 89 DFLDQLAKFWELQGSTLKIP----------VVERKILDLYALSKIVASKGGFEIVT---- 134
Query: 502 KGQVFSKMRNHTLTNRMTGVGNTLKRHYETYLLEYELAHDDV 543
K + +SK+ + G G+ LK HYE L YEL V
Sbjct: 135 KEKKWSKVGSRLGYLPGKGTGSLLKSHYERILYPYELFQSGV 176
>sp|P29375|KDM5A_HUMAN Lysine-specific demethylase 5A OS=Homo sapiens GN=KDM5A PE=1 SV=3
Length = 1690
Score = 43.9 bits (102), Expect = 0.004, Method: Compositional matrix adjust.
Identities = 32/102 (31%), Positives = 48/102 (47%), Gaps = 14/102 (13%)
Query: 442 EFLRDVMQFLILRGHTRLVPQGGLAEFPDAILNAKRLDLFNLYREVVSRGGFHVGNGINW 501
+FL + +F L+G T +P ++ K LDL+ L + V S+GGF +
Sbjct: 89 DFLDQLAKFWELQGSTLKIP----------VVERKILDLYALSKIVASKGGFEMVT---- 134
Query: 502 KGQVFSKMRNHTLTNRMTGVGNTLKRHYETYLLEYELAHDDV 543
K + +SK+ + G G+ LK HYE L YEL V
Sbjct: 135 KEKKWSKVGSRLGYLPGKGTGSLLKSHYERILYPYELFQSGV 176
>sp|A6PWV5|ARI3C_MOUSE AT-rich interactive domain-containing protein 3C OS=Mus musculus
GN=Arid3c PE=2 SV=2
Length = 409
Score = 42.4 bits (98), Expect = 0.012, Method: Compositional matrix adjust.
Identities = 31/99 (31%), Positives = 44/99 (44%), Gaps = 16/99 (16%)
Query: 441 EEFLRDVMQFLILRGH-TRLVPQGGLAEFPDAILNAKRLDLFNLYREVVSRGGF-HVGNG 498
+EFL D+ F+ RG VP I+ + LDL+ L+R V ++GG V N
Sbjct: 114 KEFLDDLFSFMQKRGTPVNRVP----------IMAKQVLDLYALFRLVTAKGGLVEVINR 163
Query: 499 INWKGQVFSKMRNHTLTNRMTGVGNTLKRHYETYLLEYE 537
W+ R +L +T TL+ Y YL YE
Sbjct: 164 KVWR----EVTRGLSLPTTITSAAFTLRTQYMKYLYPYE 198
>sp|A6NKF2|ARI3C_HUMAN AT-rich interactive domain-containing protein 3C OS=Homo sapiens
GN=ARID3C PE=2 SV=1
Length = 412
Score = 41.6 bits (96), Expect = 0.018, Method: Compositional matrix adjust.
Identities = 31/99 (31%), Positives = 44/99 (44%), Gaps = 16/99 (16%)
Query: 441 EEFLRDVMQFLILRGH-TRLVPQGGLAEFPDAILNAKRLDLFNLYREVVSRGGF-HVGNG 498
+EFL D+ F+ RG VP I+ + LDL+ L+R V ++GG V N
Sbjct: 117 KEFLDDLFSFMQKRGTPVNRVP----------IMAKQVLDLYALFRLVTAKGGLVEVINR 166
Query: 499 INWKGQVFSKMRNHTLTNRMTGVGNTLKRHYETYLLEYE 537
W+ R +L +T TL+ Y YL YE
Sbjct: 167 KVWR----EVTRGLSLPTTITSAAFTLRTQYMKYLYPYE 201
>sp|Q9VMJ7|KDM5_DROME Lysine-specific demethylase lid OS=Drosophila melanogaster GN=lid
PE=1 SV=1
Length = 1838
Score = 40.4 bits (93), Expect = 0.043, Method: Compositional matrix adjust.
Identities = 28/96 (29%), Positives = 43/96 (44%), Gaps = 14/96 (14%)
Query: 443 FLRDVMQFLILRGHTRLVPQGGLAEFPDAILNAKRLDLFNLYREVVSRGGFHVGNGINWK 502
FL + +F L+G + +P ++ K LDL+ L+R V GG K
Sbjct: 232 FLDQIAKFWELQGSSLKIP----------MVERKALDLYTLHRIVQEEGGMEQTT----K 277
Query: 503 GQVFSKMRNHTLTNRMTGVGNTLKRHYETYLLEYEL 538
+ ++K+ N VG TLK HYE L +E+
Sbjct: 278 DRKWAKVANRMQYPSSKSVGATLKAHYERILHPFEV 313
>sp|Q4H3P5|DRI_CIOIN Protein dead ringer homolog OS=Ciona intestinalis GN=Ci-DRIL1/2
PE=2 SV=1
Length = 571
Score = 40.0 bits (92), Expect = 0.058, Method: Compositional matrix adjust.
Identities = 32/98 (32%), Positives = 46/98 (46%), Gaps = 14/98 (14%)
Query: 441 EEFLRDVMQFLILRGHTRLVPQGGLAEFPDAILNAKRLDLFNLYREVVSRGGF-HVGNGI 499
+EFL D+ F+ RG P + I+ + LDL+ LYR VV +GG V N
Sbjct: 253 KEFLDDLFSFMQKRG----TPVNRIP-----IMAKQVLDLYQLYRLVVEKGGLVEVINKK 303
Query: 500 NWKGQVFSKMRNHTLTNRMTGVGNTLKRHYETYLLEYE 537
W+ +K N L + +T TL+ Y YL +E
Sbjct: 304 IWRE--ITKGLN--LPSSITSAAFTLRTQYMKYLYPFE 337
>sp|Q24573|DRI_DROME Protein dead ringer OS=Drosophila melanogaster GN=retn PE=1 SV=2
Length = 911
Score = 38.5 bits (88), Expect = 0.15, Method: Compositional matrix adjust.
Identities = 36/126 (28%), Positives = 53/126 (42%), Gaps = 28/126 (22%)
Query: 413 QAQQIISLNPLPLKKHGCGRAPIQVCSEEEFLRDVMQFLILRGHTRLVPQGGLAEFPDAI 472
Q +Q+ +N P +K EFL D+ F+ RG P L I
Sbjct: 283 QVRQLYEINDDPKRK--------------EFLDDLFSFMQKRG----TPINRLP-----I 319
Query: 473 LNAKRLDLFNLYREVVSRGGF-HVGNGINWKGQVFSKMRNHTLTNRMTGVGNTLKRHYET 531
+ LDL+ LY V++RGG V N W+ ++ + L + +T TL+ Y
Sbjct: 320 MAKSVLDLYELYNLVIARGGLVDVINKKLWQ-EIIKGLH---LPSSITSAAFTLRTQYMK 375
Query: 532 YLLEYE 537
YL YE
Sbjct: 376 YLYPYE 381
>sp|A2BEA6|ARI3A_DANRE AT-rich interactive domain-containing protein 3A OS=Danio rerio
GN=arid3a PE=1 SV=1
Length = 570
Score = 38.1 bits (87), Expect = 0.21, Method: Compositional matrix adjust.
Identities = 31/99 (31%), Positives = 44/99 (44%), Gaps = 16/99 (16%)
Query: 441 EEFLRDVMQFLILRGH-TRLVPQGGLAEFPDAILNAKRLDLFNLYREVVSRGGF-HVGNG 498
+EFL D+ F+ RG +P I+ + LDL+ LY+ V +GG V N
Sbjct: 236 KEFLDDLFSFMQKRGTPVNRIP----------IMAKQVLDLYTLYKLVTEKGGLVEVINK 285
Query: 499 INWKGQVFSKMRNHTLTNRMTGVGNTLKRHYETYLLEYE 537
W+ +K N L +T TL+ Y YL YE
Sbjct: 286 KIWRE--ITKGLN--LPTSITSAAFTLRTQYMKYLYPYE 320
>sp|Q5F3R2|KDM5B_CHICK Lysine-specific demethylase 5B OS=Gallus gallus GN=KDM5B PE=2 SV=1
Length = 1522
Score = 37.7 bits (86), Expect = 0.30, Method: Compositional matrix adjust.
Identities = 29/96 (30%), Positives = 41/96 (42%), Gaps = 14/96 (14%)
Query: 443 FLRDVMQFLILRGHTRLVPQGGLAEFPDAILNAKRLDLFNLYREVVSRGGFHVGNGINWK 502
FL + +F L+G T +P + K LDLF L R V GGF V K
Sbjct: 81 FLDQIAKFWELQGCTLKIPH----------VERKILDLFQLNRLVAEEGGFDVVC----K 126
Query: 503 GQVFSKMRNHTLTNRMTGVGNTLKRHYETYLLEYEL 538
+ ++K+ VG+ ++ HYE L Y L
Sbjct: 127 ERKWTKIATRMGFAPGKAVGSHIRAHYERILYPYNL 162
>sp|Q8MQH7|DRI_STRPU Protein dead ringer homolog OS=Strongylocentrotus purpuratus GN=dri
PE=2 SV=1
Length = 490
Score = 37.4 bits (85), Expect = 0.37, Method: Compositional matrix adjust.
Identities = 31/98 (31%), Positives = 45/98 (45%), Gaps = 14/98 (14%)
Query: 441 EEFLRDVMQFLILRGHTRLVPQGGLAEFPDAILNAKRLDLFNLYREVVSRGGF-HVGNGI 499
+EFL D+ ++ RG P + I+ + LDL+ LY VV++GG V N
Sbjct: 206 KEFLDDLFSYMQKRG----TPVNRIP-----IMAKQVLDLYELYNLVVAKGGLVEVINKK 256
Query: 500 NWKGQVFSKMRNHTLTNRMTGVGNTLKRHYETYLLEYE 537
W+ +K N L +T TL+ Y YL YE
Sbjct: 257 QWR--EITKGLN--LPASITSAAFTLRTQYMKYLYPYE 290
>sp|O02326|CFI1_CAEEL AT-rich interactive domain-containing protein cfi-1
OS=Caenorhabditis elegans GN=cfi-1 PE=1 SV=3
Length = 467
Score = 37.0 bits (84), Expect = 0.43, Method: Compositional matrix adjust.
Identities = 23/79 (29%), Positives = 37/79 (46%), Gaps = 5/79 (6%)
Query: 472 ILNAKRLDLFNLYREVVSRGGF-HVGNGINWKGQVFSKMRNHTLTNRMTGVGNTLKRHYE 530
I+ + LDL+ LYR VV GG + N W+ + L + +T TL+ Y+
Sbjct: 207 IMAKQVLDLYELYRLVVQHGGLVEIINKKLWR----EITKGLNLPSSITSAAFTLRTQYQ 262
Query: 531 TYLLEYELAHDDVDGECCL 549
YL +YE + + + L
Sbjct: 263 KYLYDYECEKEKLSNQSDL 281
>sp|Q9M364|ATX3_ARATH Histone-lysine N-methyltransferase ATX3 OS=Arabidopsis thaliana
GN=ATX3 PE=2 SV=2
Length = 1018
Score = 36.2 bits (82), Expect = 0.74, Method: Compositional matrix adjust.
Identities = 19/56 (33%), Positives = 24/56 (42%), Gaps = 4/56 (7%)
Query: 545 GECCLLCHSSAAGDWVNCGICGEWAHFGCDRRQGLGAFKDYAKTDGLEYVCPQCSV 600
G C + H S GDWV C C W H CD + + + + Y CP C V
Sbjct: 366 GICKRIWHPSDDGDWVCCDGCDVWVHAECDNITN----ERFKELEHNNYYCPDCKV 417
>sp|Q68CP9|ARID2_HUMAN AT-rich interactive domain-containing protein 2 OS=Homo sapiens
GN=ARID2 PE=1 SV=2
Length = 1835
Score = 35.8 bits (81), Expect = 1.2, Method: Compositional matrix adjust.
Identities = 28/98 (28%), Positives = 42/98 (42%), Gaps = 12/98 (12%)
Query: 443 FLRDVMQFLILRGHTRLVPQGGLAEFPDAILNAKRLDLFNLYREVVSRGGFHVGNGINWK 502
FL ++ QF H+R P + P + K LDL LY V + GGF + N
Sbjct: 19 FLDELRQF----HHSRGSP---FKKIP--AVGGKELDLHGLYTRVTTLGGFAKVSEKNQW 69
Query: 503 GQVFSKMRNHTLTNRMTGVGNTLKRHYETYLLEYELAH 540
G++ + + LK++Y YL +YE H
Sbjct: 70 GEIVEEFN---FPRSCSNAAFALKQYYLRYLEKYEKVH 104
>sp|Q9SGS2|HMGB9_ARATH High mobility group B protein 9 OS=Arabidopsis thaliana GN=HMGB9
PE=2 SV=1
Length = 338
Score = 34.7 bits (78), Expect = 2.0, Method: Compositional matrix adjust.
Identities = 25/78 (32%), Positives = 33/78 (42%), Gaps = 9/78 (11%)
Query: 466 AEFPDAILNAKRLDLFNLYREVVSRGGFH-VGNGINWK--GQVFSKMRNHTLTNRMTGVG 522
+F ++ K LDL LY EV RGG+ V W+ G VF + T
Sbjct: 57 TKFMIPVIGGKELDLHVLYVEVTRRGGYEKVVVEKKWREVGGVFR------FSATTTSAS 110
Query: 523 NTLKRHYETYLLEYELAH 540
L++HY L YE H
Sbjct: 111 FVLRKHYLNLLFHYEQVH 128
>sp|Q9FYH1|HAC2_ARATH Histone acetyltransferase HAC2 OS=Arabidopsis thaliana GN=HAC2 PE=2
SV=2
Length = 1367
Score = 34.3 bits (77), Expect = 2.9, Method: Compositional matrix adjust.
Identities = 18/42 (42%), Positives = 21/42 (50%), Gaps = 7/42 (16%)
Query: 558 DWVNCGICGEWAHFGCDRRQGL-GAFKDYAKTDGLEYVCPQC 598
+WV C C +W H C GL KD KT EY+CP C
Sbjct: 727 EWVQCESCEKWQHQIC----GLYNKLKDEDKT--AEYICPTC 762
>sp|Q8GZ42|ATX5_ARATH Histone-lysine N-methyltransferase ATX5 OS=Arabidopsis thaliana
GN=ATX5 PE=2 SV=1
Length = 1043
Score = 33.9 bits (76), Expect = 4.2, Method: Compositional matrix adjust.
Identities = 17/40 (42%), Positives = 19/40 (47%), Gaps = 4/40 (10%)
Query: 559 WVNCGICGEWAHFGCDRRQGLGAFKDYAKTDGLEYVCPQC 598
WV C C W H CD + FKD +TD Y CP C
Sbjct: 431 WVRCDGCKVWIHSACD-QISHKHFKDLGETD---YYCPTC 466
Database: swissprot
Posted date: Mar 23, 2013 2:32 AM
Number of letters in database: 191,569,459
Number of sequences in database: 539,616
Lambda K H
0.319 0.134 0.413
Lambda K H
0.267 0.0410 0.140
Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Number of Hits to DB: 233,773,419
Number of Sequences: 539616
Number of extensions: 10035020
Number of successful extensions: 22339
Number of sequences better than 100.0: 28
Number of HSP's better than 100.0 without gapping: 2
Number of HSP's successfully gapped in prelim test: 26
Number of HSP's that attempted gapping in prelim test: 22311
Number of HSP's gapped (non-prelim): 41
length of query: 614
length of database: 191,569,459
effective HSP length: 124
effective length of query: 490
effective length of database: 124,657,075
effective search space: 61081966750
effective search space used: 61081966750
T: 11
A: 40
X1: 16 ( 7.4 bits)
X2: 38 (14.6 bits)
X3: 64 (24.7 bits)
S1: 41 (21.8 bits)
S2: 64 (29.3 bits)