BLASTP 2.2.26 [Sep-21-2011]
Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer,
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997),
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs", Nucleic Acids Res. 25:3389-3402.
Reference for compositional score matrix adjustment: Altschul, Stephen F.,
John C. Wootton, E. Michael Gertz, Richa Agarwala, Aleksandr Morgulis,
Alejandro A. Schaffer, and Yi-Kuo Yu (2005) "Protein database searches
using compositionally adjusted substitution matrices", FEBS J. 272:5101-5109.
Query= 003362
(826 letters)
Database: swissprot
539,616 sequences; 191,569,459 total letters
Searching..................................................done
>sp|Q2KIA6|PRPF3_BOVIN U4/U6 small nuclear ribonucleoprotein Prp3 OS=Bos taurus GN=PRPF3
PE=2 SV=1
Length = 683
Score = 268 bits (684), Expect = 2e-70, Method: Compositional matrix adjust.
Identities = 199/523 (38%), Positives = 281/523 (53%), Gaps = 68/523 (13%)
Query: 314 AAAAAASVQPPTSSVPAFPGLANITNIEAVKRAQELAAKMGFRQDPEFAPIINCFPGQPP 373
AA AS+Q + P G AN+ + + MG AP P
Sbjct: 206 AAELQASIQAQLALKPGLIGNANMVGLANLH-------AMGI------AP--------PK 244
Query: 374 VDAAVPQKPTKAPVLRVDALGREIDEHGNVVNRTKPSNLSTLKVNINKQKKDAF-QILKP 432
V+ KPT L +D GR +D G + T + TLK NI K++ F Q LK
Sbjct: 245 VELKDQTKPTP---LILDEQGRTVDATGKEIELT--HRMPTLKANIRAVKREQFKQQLKE 299
Query: 433 ELEVDPNVNPHFDPRMGINKSKLLRPKRMTFQFVEEGKWSKEAEILRVKSQFGEAGAKER 492
+ D N FDPR+ I S+ +R TF+F ++GK+ K A+ LR K+Q ++
Sbjct: 300 KPSEDMESNTFFDPRVSIAPSQR---QRRTFKFHDKGKFEKIAQRLRTKAQL-----EKL 351
Query: 493 QAKQAQLAKAKGGTDINPNLIEVAERVITKEKPKDPIPEIEWWDAPLLLTGSYADISDDV 552
QA+ +Q A+ K G + L +A + KE + IPEIEWWD+ ++ G D+
Sbjct: 352 QAEISQAAR-KTGIHTSTRLALIAPK---KELKEGDIPEIEWWDSYIIPNGF------DL 401
Query: 553 TIEDKLKRE---KITIYVEHPRPIEPPAEPAPPPPQPLKLTKKEQKKLRTQRRLAREKDR 609
T E+ KRE IT VEHP + PP + P + LTKKEQKKLR Q R +K+
Sbjct: 402 T-EENPKREDYFGITNLVEHPAQLNPPVDNDTPVTLGVYLTKKEQKKLRRQTRREAQKEL 460
Query: 610 QEMIRQGLIEPPKPKVKMSNLMKVLGSEATQDPTRLEKEIRSAAAEREQAHIDRNIARKL 669
QE +R GL+ PP+PKV++SNLM+VLG+EA QDPT++E +R+ A+R++AH + N ARKL
Sbjct: 461 QEKVRLGLMPPPEPKVRISNLMRVLGTEAVQDPTKVEAHVRAQMAKRQKAHEEANAARKL 520
Query: 670 TPAERREKKERKLFDDPSSVETIVSVYKINDLSHPKTRFKVDVNAHENRLTGCAVICEGI 729
T AE+R+ K+ K + S +SVY++ +LS+P +FK++ NA + LTG V+ + +
Sbjct: 521 T-AEQRKVKKIKKLKEDISQGVHISVYRVRNLSNPAKKFKIEANAGQLYLTGVVVLHKDV 579
Query: 730 NVVVVEGGSKSIKRYGKLMLRRIDWAKAVKEEDEDEDETTD-----------KPVNKCVL 778
NVVVVEGG K+ K++ +LML RI W DE T K NKCVL
Sbjct: 580 NVVVVEGGPKAQKKFKRLMLHRIKW-------DEQTSNTKGDDDEESDEEAVKKTNKCVL 632
Query: 779 VWQGNVARPSFNRFFVHECMTEAAAKKVFADAGVAHYWDLAVN 821
VW+G SF +C TE A++ F G HYWDLA++
Sbjct: 633 VWEGTAKDRSFGEMKFKQCPTENMAREHFKKHGAEHYWDLALS 675
>sp|Q5R5F1|PRPF3_PONAB U4/U6 small nuclear ribonucleoprotein Prp3 OS=Pongo abelii GN=PRPF3
PE=2 SV=1
Length = 683
Score = 267 bits (683), Expect = 2e-70, Method: Compositional matrix adjust.
Identities = 183/454 (40%), Positives = 259/454 (57%), Gaps = 44/454 (9%)
Query: 383 TKAPVLRVDALGREIDEHGNVVNRTKPSNLSTLKVNINKQKKDAF-QILKPELEVDPNVN 441
TK L +D GR +D G + T + TLK NI K++ F Q LK + D N
Sbjct: 251 TKPTPLILDEQGRTVDATGKEIELT--HRMPTLKANIRAVKREQFRQQLKEKPSEDMESN 308
Query: 442 PHFDPRMGINKSKLLRPKRMTFQFVEEGKWSKEAEILRVKSQFGEAGAKERQAKQAQLAK 501
FDPR+ I S+ +R TF+F ++GK+ K A+ LR K+Q ++ QA+ +Q A+
Sbjct: 309 TFFDPRVSIAPSQR---QRRTFKFHDKGKFEKIAQRLRTKAQL-----EKLQAEISQAAR 360
Query: 502 AKGGTDINPNLIEVAERVITKEKPKDPIPEIEWWDAPLLLTGSYADISDDVTIEDKLKRE 561
K G + L +A + KE + IPEIEWWD+ ++ G D+T E+ KRE
Sbjct: 361 -KTGIHTSTRLALIAPK---KELKEGDIPEIEWWDSYIIPNGF------DLT-EENPKRE 409
Query: 562 ---KITIYVEHPRPIEPPAEPAPPPPQPLKLTKKEQKKLRTQRRLAREKDRQEMIRQGLI 618
IT VEHP + PP + P + LTKKEQKKLR Q R +K+ QE +R GL+
Sbjct: 410 DYFGITNLVEHPAQLNPPVDNDTPVTLGVYLTKKEQKKLRRQTRREAQKELQEKVRLGLM 469
Query: 619 EPPKPKVKMSNLMKVLGSEATQDPTRLEKEIRSAAAEREQAHIDRNIARKLTPAERREKK 678
PP+PKV++SNLM+VLG+EA QDPT++E +R+ A+R++AH + N ARKLT AE+R+ K
Sbjct: 470 PPPEPKVRISNLMRVLGTEAVQDPTKVEAHVRAQMAKRQKAHEEANAARKLT-AEQRKVK 528
Query: 679 ERKLFDDPSSVETIVSVYKINDLSHPKTRFKVDVNAHENRLTGCAVICEGINVVVVEGGS 738
+ K + S +SVY++ +LS+P +FK++ NA + LTG V+ + +NVVVVEGG
Sbjct: 529 KIKKLKEDISQGVHISVYRVRNLSNPAKKFKIEANAGQLYLTGVVVLHKDVNVVVVEGGP 588
Query: 739 KSIKRYGKLMLRRIDWAKAVKEEDEDEDETTD-----------KPVNKCVLVWQGNVARP 787
K+ K++ +LML RI W DE T K NKCVLVW+G
Sbjct: 589 KAQKKFKRLMLHRIKW-------DEQTSNTKGDDDEESDEEAVKKTNKCVLVWEGTAKDR 641
Query: 788 SFNRFFVHECMTEAAAKKVFADAGVAHYWDLAVN 821
SF +C TE A++ F G HYWDLA++
Sbjct: 642 SFGEMKFKQCPTENMAREHFKKHGAEHYWDLALS 675
>sp|O43395|PRPF3_HUMAN U4/U6 small nuclear ribonucleoprotein Prp3 OS=Homo sapiens GN=PRPF3
PE=1 SV=2
Length = 683
Score = 267 bits (682), Expect = 3e-70, Method: Compositional matrix adjust.
Identities = 183/454 (40%), Positives = 259/454 (57%), Gaps = 44/454 (9%)
Query: 383 TKAPVLRVDALGREIDEHGNVVNRTKPSNLSTLKVNINKQKKDAF-QILKPELEVDPNVN 441
TK L +D GR +D G + T + TLK NI K++ F Q LK + D N
Sbjct: 251 TKPTPLILDEQGRTVDATGKEIELT--HRMPTLKANIRAVKREQFKQQLKEKPSEDMESN 308
Query: 442 PHFDPRMGINKSKLLRPKRMTFQFVEEGKWSKEAEILRVKSQFGEAGAKERQAKQAQLAK 501
FDPR+ I S+ +R TF+F ++GK+ K A+ LR K+Q ++ QA+ +Q A+
Sbjct: 309 TFFDPRVSIAPSQR---QRRTFKFHDKGKFEKIAQRLRTKAQL-----EKLQAEISQAAR 360
Query: 502 AKGGTDINPNLIEVAERVITKEKPKDPIPEIEWWDAPLLLTGSYADISDDVTIEDKLKRE 561
K G + L +A + KE + IPEIEWWD+ ++ G D+T E+ KRE
Sbjct: 361 -KTGIHTSTRLALIAPK---KELKEGDIPEIEWWDSYIIPNGF------DLT-EENPKRE 409
Query: 562 ---KITIYVEHPRPIEPPAEPAPPPPQPLKLTKKEQKKLRTQRRLAREKDRQEMIRQGLI 618
IT VEHP + PP + P + LTKKEQKKLR Q R +K+ QE +R GL+
Sbjct: 410 DYFGITNLVEHPAQLNPPVDNDTPVTLGVYLTKKEQKKLRRQTRREAQKELQEKVRLGLM 469
Query: 619 EPPKPKVKMSNLMKVLGSEATQDPTRLEKEIRSAAAEREQAHIDRNIARKLTPAERREKK 678
PP+PKV++SNLM+VLG+EA QDPT++E +R+ A+R++AH + N ARKLT AE+R+ K
Sbjct: 470 PPPEPKVRISNLMRVLGTEAVQDPTKVEAHVRAQMAKRQKAHEEANAARKLT-AEQRKVK 528
Query: 679 ERKLFDDPSSVETIVSVYKINDLSHPKTRFKVDVNAHENRLTGCAVICEGINVVVVEGGS 738
+ K + S +SVY++ +LS+P +FK++ NA + LTG V+ + +NVVVVEGG
Sbjct: 529 KIKKLKEDISQGVHISVYRVRNLSNPAKKFKIEANAGQLYLTGVVVLHKDVNVVVVEGGP 588
Query: 739 KSIKRYGKLMLRRIDWAKAVKEEDEDEDETTD-----------KPVNKCVLVWQGNVARP 787
K+ K++ +LML RI W DE T K NKCVLVW+G
Sbjct: 589 KAQKKFKRLMLHRIKW-------DEQTSNTKGDDDEESDEEAVKKTNKCVLVWEGTAKDR 641
Query: 788 SFNRFFVHECMTEAAAKKVFADAGVAHYWDLAVN 821
SF +C TE A++ F G HYWDLA++
Sbjct: 642 SFGEMKFKQCPTENMAREHFKKHGAEHYWDLALS 675
>sp|Q922U1|PRPF3_MOUSE U4/U6 small nuclear ribonucleoprotein Prp3 OS=Mus musculus GN=Prpf3
PE=1 SV=1
Length = 683
Score = 267 bits (682), Expect = 3e-70, Method: Compositional matrix adjust.
Identities = 184/454 (40%), Positives = 258/454 (56%), Gaps = 44/454 (9%)
Query: 383 TKAPVLRVDALGREIDEHGNVVNRTKPSNLSTLKVNINKQKKDAF-QILKPELEVDPNVN 441
TK L +D GR +D G V T + TLK NI K++ F Q LK + D N
Sbjct: 251 TKPTPLILDEQGRTVDATGKEVELT--HRMPTLKANIRAVKREQFKQQLKEKPSEDMESN 308
Query: 442 PHFDPRMGINKSKLLRPKRMTFQFVEEGKWSKEAEILRVKSQFGEAGAKERQAKQAQLAK 501
FDPR+ I S+ +R TF+F ++GK+ K A+ LR K+Q ++ QA+ +Q A+
Sbjct: 309 TFFDPRVSIAPSQR---QRRTFKFHDKGKFEKIAQRLRTKAQL-----EKLQAEISQAAR 360
Query: 502 AKGGTDINPNLIEVAERVITKEKPKDPIPEIEWWDAPLLLTGSYADISDDVTIEDKLKRE 561
K G + L +A + KE + IPEIEWWD+ ++ G D+T E+ KRE
Sbjct: 361 -KTGIHTSTRLALIAPK---KELKEGDIPEIEWWDSYIIPNGF------DLT-EENPKRE 409
Query: 562 ---KITIYVEHPRPIEPPAEPAPPPPQPLKLTKKEQKKLRTQRRLAREKDRQEMIRQGLI 618
IT VEHP + PP + P + LTKKEQKKLR Q R +K+ QE +R GL
Sbjct: 410 DYFGITNLVEHPAQLNPPVDNDTPVTLGVYLTKKEQKKLRRQTRREAQKELQEKVRLGLT 469
Query: 619 EPPKPKVKMSNLMKVLGSEATQDPTRLEKEIRSAAAEREQAHIDRNIARKLTPAERREKK 678
PP+PKV++SNLM+VLG+EA QDPT++E +R+ A+R++AH + N ARKLT AE+R+ K
Sbjct: 470 PPPEPKVRISNLMRVLGTEAVQDPTKVEAHVRAQMAKRQKAHEEANAARKLT-AEQRKVK 528
Query: 679 ERKLFDDPSSVETIVSVYKINDLSHPKTRFKVDVNAHENRLTGCAVICEGINVVVVEGGS 738
+ K + S +SVY++ +LS+P +FK++ NA + LTG V+ + +NVVVVEGG
Sbjct: 529 KIKKLKEDISQGVHISVYRVRNLSNPAKKFKIEANAGQLYLTGVVVLHKDVNVVVVEGGP 588
Query: 739 KSIKRYGKLMLRRIDWAKAVKEEDEDEDETTD-----------KPVNKCVLVWQGNVARP 787
K+ K++ +LML RI W DE T K NKCVLVW+G
Sbjct: 589 KAQKKFKRLMLHRIKW-------DEQTSNTKGDDDEESDEEAVKKTNKCVLVWEGTAKDR 641
Query: 788 SFNRFFVHECMTEAAAKKVFADAGVAHYWDLAVN 821
SF +C TE A++ F G HYWDLA++
Sbjct: 642 SFGEMKFKQCPTENMAREHFKKHGAEHYWDLALS 675
>sp|Q5ZJ85|PRPF3_CHICK U4/U6 small nuclear ribonucleoprotein Prp3 OS=Gallus gallus
GN=PRPF3 PE=2 SV=1
Length = 684
Score = 263 bits (671), Expect = 6e-69, Method: Compositional matrix adjust.
Identities = 179/451 (39%), Positives = 255/451 (56%), Gaps = 37/451 (8%)
Query: 383 TKAPVLRVDALGREIDEHGNVVNRTKPSNLSTLKVNINKQKKDAF-QILKPELEVDPNVN 441
TK L +D GR +D G + T + TLK NI K++ F Q LK + D N
Sbjct: 251 TKPTPLILDEQGRTVDATGKEIELT--HRMPTLKANIRAVKREQFKQQLKEKPSEDMESN 308
Query: 442 PHFDPRMGINKSKLLRPKRMTFQFVEEGKWSKEAEILRVKSQFGEAGAKERQAKQAQLAK 501
+FD R+ I ++ R KR TF+F E+GK+ K A+ LR K+Q K+ QA+ +Q A+
Sbjct: 309 TYFDLRVSITPAQ--RQKR-TFKFHEKGKFEKIAQRLRTKAQL-----KKLQAEISQAAR 360
Query: 502 AKGGTDINPNLIEVAERVITKEKPKDPIPEIEWWDAPLLLTGSYADISDDVTIEDKLKRE 561
K G + L + + KE + IPEIEWWD+ ++ G D+ T K +
Sbjct: 361 -KTGIHTSTKLALITPK---KELKEGEIPEIEWWDSYIIPNG--LDLKGG-TSSKKDEYF 413
Query: 562 KITIYVEHPRPIEPPAEPAPPPPQPLKLTKKEQKKLRTQRRLAREKDRQEMIRQGLIEPP 621
IT VEHP + PP + P + LTKKEQKKLR Q R +K+ QE +R GL+ PP
Sbjct: 414 GITNLVEHPAQLNPPVDSDTPVTLGVYLTKKEQKKLRRQTRREAQKELQEKVRLGLMPPP 473
Query: 622 KPKVKMSNLMKVLGSEATQDPTRLEKEIRSAAAEREQAHIDRNIARKLTPAERREKKERK 681
+PKV++SNLM+VLG+EA QDPT++E +R+ A+R++AH + N ARKLT AE+R+ K+ K
Sbjct: 474 EPKVRISNLMRVLGTEAVQDPTKVEAHVRAQMAKRQKAHEEANAARKLT-AEQRKAKKIK 532
Query: 682 LFDDPSSVETIVSVYKINDLSHPKTRFKVDVNAHENRLTGCAVICEGINVVVVEGGSKSI 741
+ S ++VY++ +LS+P +FK++ NA + LTG V+ + +NVVVVEGG K+
Sbjct: 533 KLKEDVSQGVHIAVYRVRNLSNPAKKFKIEANAGQLYLTGVVVLHKDVNVVVVEGGPKAQ 592
Query: 742 KRYGKLMLRRIDWAKAVKEEDEDEDETTD-----------KPVNKCVLVWQGNVARPSFN 790
K++ +LML RI W DE T K NKC LVW+G SF
Sbjct: 593 KKFKRLMLHRIKW-------DEQTSNTKGEDDDESDEESVKKTNKCSLVWEGTAKDRSFG 645
Query: 791 RFFVHECMTEAAAKKVFADAGVAHYWDLAVN 821
+C TE A++ F G HYWDLA++
Sbjct: 646 EMKFKQCPTENMAREHFKKHGAEHYWDLALS 676
>sp|Q09856|YAF2_SCHPO Uncharacterized protein C29E6.02 OS=Schizosaccharomyces pombe
(strain 972 / ATCC 24843) GN=SPAC29E6.02 PE=4 SV=1
Length = 542
Score = 191 bits (485), Expect = 2e-47, Method: Compositional matrix adjust.
Identities = 153/449 (34%), Positives = 224/449 (49%), Gaps = 57/449 (12%)
Query: 405 NRTKPSNLSTLKVNINKQKKDAFQILKPE--LEVDPNVN----------PHFDPRMGINK 452
NR + ++ ST V++++ Q+LKP E +P ++ P +DPR +
Sbjct: 124 NRKRTASFSTKGVSLSQH-----QLLKPPAITEQNPFLDTAPTPRLEDSPFYDPRT--KE 176
Query: 453 SKLLRPKRMTFQFVEEGKWSKEAEILRVKSQFGEAGAKERQAKQAQLAKAKGGTDINPNL 512
S+ R R E GK+ +EA R +++ E K+ L K G
Sbjct: 177 SRKTRGSR-NLHLNESGKFIEEANQARRQARL------EDLKKRIALHSHKAG------- 222
Query: 513 IEVAERVITKEKPKDPIPEIEWWDAPLLLTGSYADISDDVTIEDKLKREKITIYVEHPRP 572
IE + +K +D IP IEWWD P + Y D D+ + I ++HP P
Sbjct: 223 IEDELDITSKSIGRDTIPNIEWWDLPFI--KDYNDYGDENNWLIDGPQSIINSAIQHPIP 280
Query: 573 IEPPAEPAPPPPQPLKLTKKEQKKLRTQRRLAREKDRQEMIRQGL-IEPPKP-KVKMSNL 630
+ PP P + LTKKEQKK+R Q R K++Q+ RQ L IEPP+P KVK+SNL
Sbjct: 281 VLPPYAKNQPSSHSVFLTKKEQKKIRRQTRAEARKEKQD--RQLLGIEPPEPPKVKLSNL 338
Query: 631 MKVLGSEATQDPTRLEKEIRSAAAEREQAHIDRNIARKLTPAERREKKERKLFDDPSSVE 690
M VLG +A +DPT++E E+R ER H N RKLTP ER+EK RK D+ S+
Sbjct: 339 MHVLGDDAIKDPTKIEAEVRKQVEERRLRHERENEERKLTPEERKEKAFRKK-DEDSAAG 397
Query: 691 TIVSVYKINDLSHPKTRFKVDVNAHENRLTGCAVICEGINVVVVEGGSKSIKRYGKLMLR 750
V++I L+H R K+D+NA + TG ++ N+V+ E G K+IK+ +LML
Sbjct: 398 LRCLVFRIKYLAHRPHRLKIDLNAKQWGATGVCILNANFNLVIFEAGQKAIKKLKRLMLE 457
Query: 751 RIDWA------------KAVKEEDEDEDETTDKPVNKCVLVWQGNVARPSFNRFFVHECM 798
RIDW K V E + + T N C LVW+G + R +F + C
Sbjct: 458 RIDWTDTSRNSIIAQGNKLVDTEGRELNYTE----NTCNLVWEGEIGRRAFRYWSFRSCP 513
Query: 799 TEAAAKKVFADAGVA-HYWDLAVNFNDEI 826
+E AK + G A H+W LA ++++ +
Sbjct: 514 SENDAKSYLEEQGGAEHFWMLAKSWSENV 542
>sp|Q03338|PRP3_YEAST U4/U6 small nuclear ribonucleoprotein PRP3 OS=Saccharomyces
cerevisiae (strain ATCC 204508 / S288c) GN=PRP3 PE=1
SV=1
Length = 469
Score = 91.7 bits (226), Expect = 2e-17, Method: Compositional matrix adjust.
Identities = 83/303 (27%), Positives = 134/303 (44%), Gaps = 22/303 (7%)
Query: 532 IEWWDAPLL-----LTGSYADISDDVTIEDKL----KREKITIYVEHPRPIEPPAEPAPP 582
EWWD L L Y D++ E+KL EHP I A P P
Sbjct: 166 YEWWDTAYLDEKGELMEKY-DMNGTSPAEEKLAEDIDEVDDDDDDEHP-SIRYVAHPLPE 223
Query: 583 PPQPLK------LTKKEQKKLRTQRRLAREKDRQEMIRQGLIEPPKPKVKMSNLMKVL-G 635
K LT+ E+K+LR RR + R+ I+ GL+ P+PKVK+SN+M V
Sbjct: 224 KINEAKVSIKAYLTQHERKRLRRNRRKMAREAREIKIKLGLLPKPEPKVKLSNMMSVFEN 283
Query: 636 SEATQDPTRLEKEIRSAAAEREQAHIDRNIARKLTPAERREKKERKLFDDPSSVETIVSV 695
+ DPT EK ++ R++ H++ N R +RR++ + P+ V
Sbjct: 284 DQNITDPTAWEKVVKDQVDLRKRKHLEENERRHEDAIKRRKEAVNMNVEKPTVYH--CKV 341
Query: 696 YKINDLSHPKTRFKVDVNAHENRLTGCAVIC--EGINVVVVEGGSKSIKRYGKLMLRRID 753
++ +L +PK RFK+ +N+ E L G + +G +++V G KS K Y L+++RI
Sbjct: 342 FQFKNLQNPKIRFKLKMNSKELSLKGLCLRIRDDGPGIIIVVGNEKSCKFYENLVMKRIK 401
Query: 754 WAKAVKEEDEDEDETTDKPVNKCVLVWQGNVARPSFNRFFVHECMTEAAAKKVFADAGVA 813
W + + D D N W+G + F +F+ C + + +
Sbjct: 402 WNEDFELHTNTGDIKMDMHNNSISKTWEGYLQDCKFKGWFMKVCNDQDSLLRTLGQFDSE 461
Query: 814 HYW 816
H++
Sbjct: 462 HFY 464
>sp|F1QBY1|NIPLB_DANRE Nipped-B-like protein B OS=Danio rerio GN=nipblb PE=2 SV=1
Length = 2876
Score = 37.0 bits (84), Expect = 0.68, Method: Compositional matrix adjust.
Identities = 39/110 (35%), Positives = 60/110 (54%), Gaps = 9/110 (8%)
Query: 2 DKDKSSKRSHDHHRDRDGEHRHRHRDRDDKHRDHHHHRSSDHKSSRRDERERSRDREKSI 61
D+DK ++ + RD++ E + R +D+D K + + D + + E++R + REK
Sbjct: 793 DQDKELEKGREKDRDKEME-KAREKDQD-KELEKGREKDQDKELEKGQEKDRDKVREKDR 850
Query: 62 D-----ERDFKREKSHDD-RERDRDFKREKSHEERDSRRDRDRSREVKRE 105
D +RD REK D RE+DRD REK E+ RDRD+ RE R+
Sbjct: 851 DKVRDKDRDKVREKDRDKVREKDRDKLREKDR-EKIRERDRDKGREKDRD 899
Score = 35.8 bits (81), Expect = 1.5, Method: Compositional matrix adjust.
Identities = 54/188 (28%), Positives = 94/188 (50%), Gaps = 28/188 (14%)
Query: 2 DKDKSSKRSHDHHRDRD-----GEHRHRHRDR---DDKHRDHHHHRSSDHKSSRRDER-- 51
DK++ + D + RD G ++ R +DR +K RD + D K S+ E+
Sbjct: 707 DKERVKDQEKDQEKGRDKEVEKGRYKERVKDRVKEQEKVRDKEQVKGRDKKRSKDLEKCR 766
Query: 52 ------ERSRDREKSID-ERDFKREKSHD-----DRERDRDFKREKSHE-----ERDSRR 94
E +DREK+ D E + REK D RE+DRD + EK+ E E + R
Sbjct: 767 EKDQDKELEKDREKNQDKELEKGREKDQDKELEKGREKDRDKEMEKAREKDQDKELEKGR 826
Query: 95 DRDRSREVKREEFDDGDDVASEVKKKRKARDESEERGEKKNARVLDEKREKKRKFEDNFV 154
++D+ +E+++ + D D V + + K + +D + R EK +V ++ R+K R+ + +
Sbjct: 827 EKDQDKELEKGQEKDRDKVREKDRDKVRDKDRDKVR-EKDRDKVREKDRDKLREKDREKI 885
Query: 155 KEEEDDGG 162
+E + D G
Sbjct: 886 RERDRDKG 893
>sp|Q54DH8|TAF1_DICDI Transcription initiation factor TFIID subunit 1 OS=Dictyostelium
discoideum GN=taf1 PE=3 SV=1
Length = 2310
Score = 36.2 bits (82), Expect = 1.2, Method: Compositional matrix adjust.
Identities = 28/77 (36%), Positives = 38/77 (49%), Gaps = 9/77 (11%)
Query: 5 KSSKRSHDHHRD---RDGEHR---HR---HRDRDDKHRDHHHHRSSDHKSSRRDERERSR 55
+SS R HR R EHR HR HR D +HR HRSS+H+S R+ R
Sbjct: 1789 RSSDRGDREHRSSEHRSSEHRSSEHRSSEHRSGDKEHRSDREHRSSEHRSGDREHRSDRE 1848
Query: 56 DREKSIDERDFKREKSH 72
RE +R+ + ++ H
Sbjct: 1849 HREHRSGDREHRSDREH 1865
>sp|O23212|U2A2A_ARATH Splicing factor U2af large subunit A OS=Arabidopsis thaliana
GN=U2AF65A PE=2 SV=2
Length = 573
Score = 35.8 bits (81), Expect = 1.4, Method: Compositional matrix adjust.
Identities = 23/53 (43%), Positives = 28/53 (52%), Gaps = 11/53 (20%)
Query: 3 KDKSSKRSHDHHRDRDGEHRHRHRDRD---------DKHRDHH--HHRSSDHK 44
KD+ ++ D R+RD E R RDRD DK RDH HHRSS H+
Sbjct: 52 KDREREKGRDKDRERDSEVSRRSRDRDGEKSKERSRDKDRDHRERHHRSSRHR 104
>sp|Q6PGZ3|MIPT3_DANRE TRAF3-interacting protein 1 OS=Danio rerio GN=traf3ip1 PE=2 SV=1
Length = 629
Score = 35.8 bits (81), Expect = 1.5, Method: Compositional matrix adjust.
Identities = 35/113 (30%), Positives = 59/113 (52%), Gaps = 15/113 (13%)
Query: 3 KDKSSKRSHDHHRDRD---------GEHRHRHRDRDDKHRDHHHHRSSDHKSSRRD---E 50
+DK ++ +HHRDR+ G + D+ + + R D E
Sbjct: 146 QDKENREGREHHRDREERKGIKESSGSREQKDPDQPKDQESKRDDKDRRRDAERSDKGRE 205
Query: 51 RERSRDREKSIDE-RDFKREKSHD-DRERDRDFKREKSHEERDSRRDRDRSRE 101
RER++DR++ D+ RD +++K+ + +RER++D REK ERD RD+ + RE
Sbjct: 206 RERTKDRDRDKDKSRDREKDKTREKEREREKDRNREKER-ERDKDRDKKKERE 257
>sp|P41512|TOP1_XENLA DNA topoisomerase 1 OS=Xenopus laevis GN=top1 PE=2 SV=1
Length = 829
Score = 34.7 bits (78), Expect = 3.5, Method: Compositional matrix adjust.
Identities = 32/86 (37%), Positives = 48/86 (55%), Gaps = 13/86 (15%)
Query: 14 HRDRDGEHRHRHRDRD-DKHRDHHHHRSSDHKSSRRDERERSRDREKSIDERDFKREKSH 72
HR+RDGE +HRD++ +KHRD H+ D + + E+ R +D EK E+D ++ K
Sbjct: 77 HRERDGE---KHRDKNGEKHRDGEKHKEKDIEKHKEVEKHRVKDGEKH-KEKDVEKHKE- 131
Query: 73 DDRERDRDFKREKSHEERDSRRDRDR 98
D E+ RD E+ RD+DR
Sbjct: 132 KDVEKHRD-------GEKHKHRDKDR 150
>sp|Q8C761|WDR60_MOUSE WD repeat-containing protein 60 OS=Mus musculus GN=Wdr60 PE=2 SV=1
Length = 999
Score = 33.5 bits (75), Expect = 7.6, Method: Compositional matrix adjust.
Identities = 31/92 (33%), Positives = 53/92 (57%), Gaps = 11/92 (11%)
Query: 20 EHRHRHRDRDDKHRDHHHHR----SSDHKSSRRDERERSRDREKSIDERDFKREKSHDDR 75
EH+ R DR+ +H++ R S++H R ER+R R +E+ D +D +++K +
Sbjct: 53 EHKSRDPDREARHKEKTAERDLYTSTEHP---RGERDRERHKERRKDAKDREKDKL---K 106
Query: 76 ERDRDFKREKSHEERDSRRDRDRSREVKREEF 107
ER RD + EK+H R R+R++ R ++EE
Sbjct: 107 ERHRDQEAEKAHS-RGKDREREKDRRARKEEI 137
Database: swissprot
Posted date: Mar 23, 2013 2:32 AM
Number of letters in database: 191,569,459
Number of sequences in database: 539,616
Lambda K H
0.310 0.128 0.360
Lambda K H
0.267 0.0410 0.140
Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Number of Hits to DB: 319,208,746
Number of Sequences: 539616
Number of extensions: 14683267
Number of successful extensions: 146759
Number of sequences better than 100.0: 50
Number of HSP's better than 100.0 without gapping: 1179
Number of HSP's successfully gapped in prelim test: 2620
Number of HSP's that attempted gapping in prelim test: 86896
Number of HSP's gapped (non-prelim): 29597
length of query: 826
length of database: 191,569,459
effective HSP length: 126
effective length of query: 700
effective length of database: 123,577,843
effective search space: 86504490100
effective search space used: 86504490100
T: 11
A: 40
X1: 16 ( 7.2 bits)
X2: 38 (14.6 bits)
X3: 64 (24.7 bits)
S1: 42 (21.8 bits)
S2: 66 (30.0 bits)