BLASTP 2.2.26 [Sep-21-2011]
Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer,
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997),
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs", Nucleic Acids Res. 25:3389-3402.
Reference for compositional score matrix adjustment: Altschul, Stephen F.,
John C. Wootton, E. Michael Gertz, Richa Agarwala, Aleksandr Morgulis,
Alejandro A. Schaffer, and Yi-Kuo Yu (2005) "Protein database searches
using compositionally adjusted substitution matrices", FEBS J. 272:5101-5109.
Query= 006987
(622 letters)
Database: swissprot
539,616 sequences; 191,569,459 total letters
Searching..................................................done
>sp|O43395|PRPF3_HUMAN U4/U6 small nuclear ribonucleoprotein Prp3 OS=Homo sapiens GN=PRPF3
PE=1 SV=2
Length = 683
Score = 271 bits (692), Expect = 2e-71, Method: Compositional matrix adjust.
Identities = 183/454 (40%), Positives = 259/454 (57%), Gaps = 44/454 (9%)
Query: 179 TKAPVLRVDALGREIDEHGNVVNRTKPSNLSTLKVNINKQKKDAF-QILKPELEVDPNVN 237
TK L +D GR +D G + T + TLK NI K++ F Q LK + D N
Sbjct: 251 TKPTPLILDEQGRTVDATGKEIELT--HRMPTLKANIRAVKREQFKQQLKEKPSEDMESN 308
Query: 238 PHFDPRMGINKSKLLRPKRMTFQFVEEGKWSKEAEILRVKSQFGEAGAKERQAKQAQLAK 297
FDPR+ I S+ +R TF+F ++GK+ K A+ LR K+Q ++ QA+ +Q A+
Sbjct: 309 TFFDPRVSIAPSQR---QRRTFKFHDKGKFEKIAQRLRTKAQL-----EKLQAEISQAAR 360
Query: 298 AKGGTDINPNLIEVAERVITKEKPKDPIPEIEWWDAPLLLTGSYADISDDVTIEDKLKRE 357
K G + L +A + KE + IPEIEWWD+ ++ G D+T E+ KRE
Sbjct: 361 -KTGIHTSTRLALIAPK---KELKEGDIPEIEWWDSYIIPNGF------DLT-EENPKRE 409
Query: 358 K---ITIYVEHPRPIEPPAEPAPPPPQPLKLTKKEQKKLRTQRRLAREKDRQEMIRQGLI 414
IT VEHP + PP + P + LTKKEQKKLR Q R +K+ QE +R GL+
Sbjct: 410 DYFGITNLVEHPAQLNPPVDNDTPVTLGVYLTKKEQKKLRRQTRREAQKELQEKVRLGLM 469
Query: 415 EPPKPKVKMSNLMKVLGSEATQDPTRLEKEIRSAAAEREQAHIDRNIARKLTPAERREKK 474
PP+PKV++SNLM+VLG+EA QDPT++E +R+ A+R++AH + N ARKLT AE+R+ K
Sbjct: 470 PPPEPKVRISNLMRVLGTEAVQDPTKVEAHVRAQMAKRQKAHEEANAARKLT-AEQRKVK 528
Query: 475 ERKLFDDPSSVETIVSVYKINDLSHPKTRFKVDVNAHENRLTGCAVICEGINVVVVEGGS 534
+ K + S +SVY++ +LS+P +FK++ NA + LTG V+ + +NVVVVEGG
Sbjct: 529 KIKKLKEDISQGVHISVYRVRNLSNPAKKFKIEANAGQLYLTGVVVLHKDVNVVVVEGGP 588
Query: 535 KSIKRYGKLMLRRIDWAKAVKEEDEDEDETTD-----------KPVNKCVLVWQGNVARP 583
K+ K++ +LML RI W DE T K NKCVLVW+G
Sbjct: 589 KAQKKFKRLMLHRIKW-------DEQTSNTKGDDDEESDEEAVKKTNKCVLVWEGTAKDR 641
Query: 584 SFNRFFVHECMTEAAAKKVFADAGVAHYWDLAVN 617
SF +C TE A++ F G HYWDLA++
Sbjct: 642 SFGEMKFKQCPTENMAREHFKKHGAEHYWDLALS 675
>sp|Q2KIA6|PRPF3_BOVIN U4/U6 small nuclear ribonucleoprotein Prp3 OS=Bos taurus GN=PRPF3
PE=2 SV=1
Length = 683
Score = 271 bits (692), Expect = 2e-71, Method: Compositional matrix adjust.
Identities = 183/454 (40%), Positives = 259/454 (57%), Gaps = 44/454 (9%)
Query: 179 TKAPVLRVDALGREIDEHGNVVNRTKPSNLSTLKVNINKQKKDAF-QILKPELEVDPNVN 237
TK L +D GR +D G + T + TLK NI K++ F Q LK + D N
Sbjct: 251 TKPTPLILDEQGRTVDATGKEIELT--HRMPTLKANIRAVKREQFKQQLKEKPSEDMESN 308
Query: 238 PHFDPRMGINKSKLLRPKRMTFQFVEEGKWSKEAEILRVKSQFGEAGAKERQAKQAQLAK 297
FDPR+ I S+ +R TF+F ++GK+ K A+ LR K+Q ++ QA+ +Q A+
Sbjct: 309 TFFDPRVSIAPSQR---QRRTFKFHDKGKFEKIAQRLRTKAQL-----EKLQAEISQAAR 360
Query: 298 AKGGTDINPNLIEVAERVITKEKPKDPIPEIEWWDAPLLLTGSYADISDDVTIEDKLKRE 357
K G + L +A + KE + IPEIEWWD+ ++ G D+T E+ KRE
Sbjct: 361 -KTGIHTSTRLALIAPK---KELKEGDIPEIEWWDSYIIPNGF------DLT-EENPKRE 409
Query: 358 K---ITIYVEHPRPIEPPAEPAPPPPQPLKLTKKEQKKLRTQRRLAREKDRQEMIRQGLI 414
IT VEHP + PP + P + LTKKEQKKLR Q R +K+ QE +R GL+
Sbjct: 410 DYFGITNLVEHPAQLNPPVDNDTPVTLGVYLTKKEQKKLRRQTRREAQKELQEKVRLGLM 469
Query: 415 EPPKPKVKMSNLMKVLGSEATQDPTRLEKEIRSAAAEREQAHIDRNIARKLTPAERREKK 474
PP+PKV++SNLM+VLG+EA QDPT++E +R+ A+R++AH + N ARKLT AE+R+ K
Sbjct: 470 PPPEPKVRISNLMRVLGTEAVQDPTKVEAHVRAQMAKRQKAHEEANAARKLT-AEQRKVK 528
Query: 475 ERKLFDDPSSVETIVSVYKINDLSHPKTRFKVDVNAHENRLTGCAVICEGINVVVVEGGS 534
+ K + S +SVY++ +LS+P +FK++ NA + LTG V+ + +NVVVVEGG
Sbjct: 529 KIKKLKEDISQGVHISVYRVRNLSNPAKKFKIEANAGQLYLTGVVVLHKDVNVVVVEGGP 588
Query: 535 KSIKRYGKLMLRRIDWAKAVKEEDEDEDETTD-----------KPVNKCVLVWQGNVARP 583
K+ K++ +LML RI W DE T K NKCVLVW+G
Sbjct: 589 KAQKKFKRLMLHRIKW-------DEQTSNTKGDDDEESDEEAVKKTNKCVLVWEGTAKDR 641
Query: 584 SFNRFFVHECMTEAAAKKVFADAGVAHYWDLAVN 617
SF +C TE A++ F G HYWDLA++
Sbjct: 642 SFGEMKFKQCPTENMAREHFKKHGAEHYWDLALS 675
>sp|Q5R5F1|PRPF3_PONAB U4/U6 small nuclear ribonucleoprotein Prp3 OS=Pongo abelii GN=PRPF3
PE=2 SV=1
Length = 683
Score = 270 bits (691), Expect = 2e-71, Method: Compositional matrix adjust.
Identities = 183/454 (40%), Positives = 259/454 (57%), Gaps = 44/454 (9%)
Query: 179 TKAPVLRVDALGREIDEHGNVVNRTKPSNLSTLKVNINKQKKDAF-QILKPELEVDPNVN 237
TK L +D GR +D G + T + TLK NI K++ F Q LK + D N
Sbjct: 251 TKPTPLILDEQGRTVDATGKEIELT--HRMPTLKANIRAVKREQFRQQLKEKPSEDMESN 308
Query: 238 PHFDPRMGINKSKLLRPKRMTFQFVEEGKWSKEAEILRVKSQFGEAGAKERQAKQAQLAK 297
FDPR+ I S+ +R TF+F ++GK+ K A+ LR K+Q ++ QA+ +Q A+
Sbjct: 309 TFFDPRVSIAPSQR---QRRTFKFHDKGKFEKIAQRLRTKAQL-----EKLQAEISQAAR 360
Query: 298 AKGGTDINPNLIEVAERVITKEKPKDPIPEIEWWDAPLLLTGSYADISDDVTIEDKLKRE 357
K G + L +A + KE + IPEIEWWD+ ++ G D+T E+ KRE
Sbjct: 361 -KTGIHTSTRLALIAPK---KELKEGDIPEIEWWDSYIIPNGF------DLT-EENPKRE 409
Query: 358 K---ITIYVEHPRPIEPPAEPAPPPPQPLKLTKKEQKKLRTQRRLAREKDRQEMIRQGLI 414
IT VEHP + PP + P + LTKKEQKKLR Q R +K+ QE +R GL+
Sbjct: 410 DYFGITNLVEHPAQLNPPVDNDTPVTLGVYLTKKEQKKLRRQTRREAQKELQEKVRLGLM 469
Query: 415 EPPKPKVKMSNLMKVLGSEATQDPTRLEKEIRSAAAEREQAHIDRNIARKLTPAERREKK 474
PP+PKV++SNLM+VLG+EA QDPT++E +R+ A+R++AH + N ARKLT AE+R+ K
Sbjct: 470 PPPEPKVRISNLMRVLGTEAVQDPTKVEAHVRAQMAKRQKAHEEANAARKLT-AEQRKVK 528
Query: 475 ERKLFDDPSSVETIVSVYKINDLSHPKTRFKVDVNAHENRLTGCAVICEGINVVVVEGGS 534
+ K + S +SVY++ +LS+P +FK++ NA + LTG V+ + +NVVVVEGG
Sbjct: 529 KIKKLKEDISQGVHISVYRVRNLSNPAKKFKIEANAGQLYLTGVVVLHKDVNVVVVEGGP 588
Query: 535 KSIKRYGKLMLRRIDWAKAVKEEDEDEDETTD-----------KPVNKCVLVWQGNVARP 583
K+ K++ +LML RI W DE T K NKCVLVW+G
Sbjct: 589 KAQKKFKRLMLHRIKW-------DEQTSNTKGDDDEESDEEAVKKTNKCVLVWEGTAKDR 641
Query: 584 SFNRFFVHECMTEAAAKKVFADAGVAHYWDLAVN 617
SF +C TE A++ F G HYWDLA++
Sbjct: 642 SFGEMKFKQCPTENMAREHFKKHGAEHYWDLALS 675
>sp|Q922U1|PRPF3_MOUSE U4/U6 small nuclear ribonucleoprotein Prp3 OS=Mus musculus GN=Prpf3
PE=1 SV=1
Length = 683
Score = 270 bits (691), Expect = 2e-71, Method: Compositional matrix adjust.
Identities = 184/454 (40%), Positives = 258/454 (56%), Gaps = 44/454 (9%)
Query: 179 TKAPVLRVDALGREIDEHGNVVNRTKPSNLSTLKVNINKQKKDAF-QILKPELEVDPNVN 237
TK L +D GR +D G V T + TLK NI K++ F Q LK + D N
Sbjct: 251 TKPTPLILDEQGRTVDATGKEVELT--HRMPTLKANIRAVKREQFKQQLKEKPSEDMESN 308
Query: 238 PHFDPRMGINKSKLLRPKRMTFQFVEEGKWSKEAEILRVKSQFGEAGAKERQAKQAQLAK 297
FDPR+ I S+ +R TF+F ++GK+ K A+ LR K+Q ++ QA+ +Q A+
Sbjct: 309 TFFDPRVSIAPSQR---QRRTFKFHDKGKFEKIAQRLRTKAQL-----EKLQAEISQAAR 360
Query: 298 AKGGTDINPNLIEVAERVITKEKPKDPIPEIEWWDAPLLLTGSYADISDDVTIEDKLKRE 357
K G + L +A + KE + IPEIEWWD+ ++ G D+T E+ KRE
Sbjct: 361 -KTGIHTSTRLALIAPK---KELKEGDIPEIEWWDSYIIPNGF------DLT-EENPKRE 409
Query: 358 K---ITIYVEHPRPIEPPAEPAPPPPQPLKLTKKEQKKLRTQRRLAREKDRQEMIRQGLI 414
IT VEHP + PP + P + LTKKEQKKLR Q R +K+ QE +R GL
Sbjct: 410 DYFGITNLVEHPAQLNPPVDNDTPVTLGVYLTKKEQKKLRRQTRREAQKELQEKVRLGLT 469
Query: 415 EPPKPKVKMSNLMKVLGSEATQDPTRLEKEIRSAAAEREQAHIDRNIARKLTPAERREKK 474
PP+PKV++SNLM+VLG+EA QDPT++E +R+ A+R++AH + N ARKLT AE+R+ K
Sbjct: 470 PPPEPKVRISNLMRVLGTEAVQDPTKVEAHVRAQMAKRQKAHEEANAARKLT-AEQRKVK 528
Query: 475 ERKLFDDPSSVETIVSVYKINDLSHPKTRFKVDVNAHENRLTGCAVICEGINVVVVEGGS 534
+ K + S +SVY++ +LS+P +FK++ NA + LTG V+ + +NVVVVEGG
Sbjct: 529 KIKKLKEDISQGVHISVYRVRNLSNPAKKFKIEANAGQLYLTGVVVLHKDVNVVVVEGGP 588
Query: 535 KSIKRYGKLMLRRIDWAKAVKEEDEDEDETTD-----------KPVNKCVLVWQGNVARP 583
K+ K++ +LML RI W DE T K NKCVLVW+G
Sbjct: 589 KAQKKFKRLMLHRIKW-------DEQTSNTKGDDDEESDEEAVKKTNKCVLVWEGTAKDR 641
Query: 584 SFNRFFVHECMTEAAAKKVFADAGVAHYWDLAVN 617
SF +C TE A++ F G HYWDLA++
Sbjct: 642 SFGEMKFKQCPTENMAREHFKKHGAEHYWDLALS 675
>sp|Q5ZJ85|PRPF3_CHICK U4/U6 small nuclear ribonucleoprotein Prp3 OS=Gallus gallus
GN=PRPF3 PE=2 SV=1
Length = 684
Score = 264 bits (674), Expect = 2e-69, Method: Compositional matrix adjust.
Identities = 182/462 (39%), Positives = 261/462 (56%), Gaps = 40/462 (8%)
Query: 168 PPVDAAVPQKPTKAPVLRVDALGREIDEHGNVVNRTKPSNLSTLKVNINKQKKDAF-QIL 226
P V+ KPT P++ +D GR +D G + T + TLK NI K++ F Q L
Sbjct: 243 PKVELKDQTKPT--PLI-LDEQGRTVDATGKEIELT--HRMPTLKANIRAVKREQFKQQL 297
Query: 227 KPELEVDPNVNPHFDPRMGINKSKLLRPKRMTFQFVEEGKWSKEAEILRVKSQFGEAGAK 286
K + D N +FD R+ I ++ R KR TF+F E+GK+ K A+ LR K+Q K
Sbjct: 298 KEKPSEDMESNTYFDLRVSITPAQ--RQKR-TFKFHEKGKFEKIAQRLRTKAQL-----K 349
Query: 287 ERQAKQAQLAKAKGGTDINPNLIEVAERVITKEKPKDPIPEIEWWDAPLLLTGSYADISD 346
+ QA+ +Q A+ K G + L + + KE + IPEIEWWD+ ++ G D+
Sbjct: 350 KLQAEISQAAR-KTGIHTSTKLALITPK---KELKEGEIPEIEWWDSYIIPNG--LDLKG 403
Query: 347 DVTIEDKLKREKITIYVEHPRPIEPPAEPAPPPPQPLKLTKKEQKKLRTQRRLAREKDRQ 406
T K + IT VEHP + PP + P + LTKKEQKKLR Q R +K+ Q
Sbjct: 404 G-TSSKKDEYFGITNLVEHPAQLNPPVDSDTPVTLGVYLTKKEQKKLRRQTRREAQKELQ 462
Query: 407 EMIRQGLIEPPKPKVKMSNLMKVLGSEATQDPTRLEKEIRSAAAEREQAHIDRNIARKLT 466
E +R GL+ PP+PKV++SNLM+VLG+EA QDPT++E +R+ A+R++AH + N ARKLT
Sbjct: 463 EKVRLGLMPPPEPKVRISNLMRVLGTEAVQDPTKVEAHVRAQMAKRQKAHEEANAARKLT 522
Query: 467 PAERREKKERKLFDDPSSVETIVSVYKINDLSHPKTRFKVDVNAHENRLTGCAVICEGIN 526
AE+R+ K+ K + S ++VY++ +LS+P +FK++ NA + LTG V+ + +N
Sbjct: 523 -AEQRKAKKIKKLKEDVSQGVHIAVYRVRNLSNPAKKFKIEANAGQLYLTGVVVLHKDVN 581
Query: 527 VVVVEGGSKSIKRYGKLMLRRIDWAKAVKEEDEDEDETTD-----------KPVNKCVLV 575
VVVVEGG K+ K++ +LML RI W DE T K NKC LV
Sbjct: 582 VVVVEGGPKAQKKFKRLMLHRIKW-------DEQTSNTKGEDDDESDEESVKKTNKCSLV 634
Query: 576 WQGNVARPSFNRFFVHECMTEAAAKKVFADAGVAHYWDLAVN 617
W+G SF +C TE A++ F G HYWDLA++
Sbjct: 635 WEGTAKDRSFGEMKFKQCPTENMAREHFKKHGAEHYWDLALS 676
>sp|Q09856|YAF2_SCHPO Uncharacterized protein C29E6.02 OS=Schizosaccharomyces pombe
(strain 972 / ATCC 24843) GN=SPAC29E6.02 PE=4 SV=1
Length = 542
Score = 191 bits (485), Expect = 1e-47, Method: Compositional matrix adjust.
Identities = 150/445 (33%), Positives = 222/445 (49%), Gaps = 49/445 (11%)
Query: 201 NRTKPSNLSTLKVNINKQKKDAFQILKPE--LEVDPNVN----------PHFDPRMGINK 248
NR + ++ ST V++++ Q+LKP E +P ++ P +DPR +
Sbjct: 124 NRKRTASFSTKGVSLSQH-----QLLKPPAITEQNPFLDTAPTPRLEDSPFYDPR--TKE 176
Query: 249 SKLLRPKRMTFQFVEEGKWSKEAEILRVKSQFGEAGAKERQAKQAQLAKAKGGTDINPNL 308
S+ R R E GK+ +EA R +++ E K+ L K G
Sbjct: 177 SRKTRGSR-NLHLNESGKFIEEANQARRQARL------EDLKKRIALHSHKAG------- 222
Query: 309 IEVAERVITKEKPKDPIPEIEWWDAPLLLTGSYADISDDVTIEDKLKREKITIYVEHPRP 368
IE + +K +D IP IEWWD P + Y D D+ + I ++HP P
Sbjct: 223 IEDELDITSKSIGRDTIPNIEWWDLPFI--KDYNDYGDENNWLIDGPQSIINSAIQHPIP 280
Query: 369 IEPPAEPAPPPPQPLKLTKKEQKKLRTQRRLAREKDRQEMIRQGL-IEPPKP-KVKMSNL 426
+ PP P + LTKKEQKK+R Q R K++Q+ RQ L IEPP+P KVK+SNL
Sbjct: 281 VLPPYAKNQPSSHSVFLTKKEQKKIRRQTRAEARKEKQD--RQLLGIEPPEPPKVKLSNL 338
Query: 427 MKVLGSEATQDPTRLEKEIRSAAAEREQAHIDRNIARKLTPAERREKKERKLFDDPSSVE 486
M VLG +A +DPT++E E+R ER H N RKLTP ER+EK RK D+ S+
Sbjct: 339 MHVLGDDAIKDPTKIEAEVRKQVEERRLRHERENEERKLTPEERKEKAFRKK-DEDSAAG 397
Query: 487 TIVSVYKINDLSHPKTRFKVDVNAHENRLTGCAVICEGINVVVVEGGSKSIKRYGKLMLR 546
V++I L+H R K+D+NA + TG ++ N+V+ E G K+IK+ +LML
Sbjct: 398 LRCLVFRIKYLAHRPHRLKIDLNAKQWGATGVCILNANFNLVIFEAGQKAIKKLKRLMLE 457
Query: 547 RIDWAKAVKEED--------EDEDETTDKPVNKCVLVWQGNVARPSFNRFFVHECMTEAA 598
RIDW + + E + N C LVW+G + R +F + C +E
Sbjct: 458 RIDWTDTSRNSIIAQGNKLVDTEGRELNYTENTCNLVWEGEIGRRAFRYWSFRSCPSEND 517
Query: 599 AKKVFADAGVA-HYWDLAVNFNDEI 622
AK + G A H+W LA ++++ +
Sbjct: 518 AKSYLEEQGGAEHFWMLAKSWSENV 542
>sp|Q03338|PRP3_YEAST U4/U6 small nuclear ribonucleoprotein PRP3 OS=Saccharomyces
cerevisiae (strain ATCC 204508 / S288c) GN=PRP3 PE=1
SV=1
Length = 469
Score = 89.4 bits (220), Expect = 7e-17, Method: Compositional matrix adjust.
Identities = 110/404 (27%), Positives = 175/404 (43%), Gaps = 34/404 (8%)
Query: 237 NPHF-DPRMGINKSKLLRPKRMTFQFVEEGKWSKEAEILRVKSQFGEAGAKERQAKQAQL 295
NP+ DP + S+ R +F ++G+ SK R + E +R+ KQ +
Sbjct: 67 NPYLSDPHDRGSSSRFNRRYERGLKFYQKGEISKRIAQERTLQKQQEEEELKRKLKQEED 126
Query: 296 AKAK----GGTDINPNLIEVAERVITKEKPK-----DPIPEIEWWDAPLL-----LTGSY 341
K K D+ PNL E+ E + K D EWWD L L Y
Sbjct: 127 EKDKRKLIESGDL-PNL-ELHEDKFLLDLSKFKIYYDNNHGYEWWDTAYLDEKGELMEKY 184
Query: 342 ADISDDVTIEDKL----KREKITIYVEHPRPIEPPAEPAPPPPQPLK------LTKKEQK 391
D++ E+KL EHP I A P P K LT+ E+K
Sbjct: 185 -DMNGTSPAEEKLAEDIDEVDDDDDDEHP-SIRYVAHPLPEKINEAKVSIKAYLTQHERK 242
Query: 392 KLRTQRRLAREKDRQEMIRQGLIEPPKPKVKMSNLMKVLGSEAT-QDPTRLEKEIRSAAA 450
+LR RR + R+ I+ GL+ P+PKVK+SN+M V ++ DPT EK ++
Sbjct: 243 RLRRNRRKMAREAREIKIKLGLLPKPEPKVKLSNMMSVFENDQNITDPTAWEKVVKDQVD 302
Query: 451 EREQAHIDRNIARKLTPAERREKKERKLFDDPSSVETIVSVYKINDLSHPKTRFKVDVNA 510
R++ H++ N R +RR++ + P+ V++ +L +PK RFK+ +N+
Sbjct: 303 LRKRKHLEENERRHEDAIKRRKEAVNMNVEKPTVYH--CKVFQFKNLQNPKIRFKLKMNS 360
Query: 511 HENRLTG-CAVI-CEGINVVVVEGGSKSIKRYGKLMLRRIDWAKAVKEEDEDEDETTDKP 568
E L G C I +G +++V G KS K Y L+++RI W + + D D
Sbjct: 361 KELSLKGLCLRIRDDGPGIIIVVGNEKSCKFYENLVMKRIKWNEDFELHTNTGDIKMDMH 420
Query: 569 VNKCVLVWQGNVARPSFNRFFVHECMTEAAAKKVFADAGVAHYW 612
N W+G + F +F+ C + + + H++
Sbjct: 421 NNSISKTWEGYLQDCKFKGWFMKVCNDQDSLLRTLGQFDSEHFY 464
>sp|P13201|GB_HCMVT Envelope glycoprotein B OS=Human cytomegalovirus (strain Towne)
GN=gB PE=1 SV=1
Length = 907
Score = 39.3 bits (90), Expect = 0.094, Method: Compositional matrix adjust.
Identities = 38/149 (25%), Positives = 68/149 (45%), Gaps = 34/149 (22%)
Query: 1 MAMILPMP-AALQILMVPL----LNQLLWHRGLRLILLWLLVILPFLLSTDGTSSAAGKS 55
+A L P A I++V + + L++ R RL + L + P+L+S DGT+ +G +
Sbjct: 744 VATFLKNPFGAFTIILVAIAVVIIIYLIYTRQRRLCMQPLQNLFPYLVSADGTTVTSGNT 803
Query: 56 GSLSLDALAKAKKALQMQKELSEKLKKIATLSKGSSSDGSGKVQGPAATASDAAAAAAAA 115
SL A ++++ SG+ +GP +SDA+ AA
Sbjct: 804 KDTSLQAPPSYEESVY----------------------NSGR-KGPGPPSSDASTAA--- 837
Query: 116 SVQPPTSSVPAFPGLANITNIEAVKRAQE 144
PP ++ A+ L + ++A +RAQ+
Sbjct: 838 ---PPYTNEQAYQMLLALVRLDAEQRAQQ 863
>sp|P06473|GB_HCMVA Envelope glycoprotein B OS=Human cytomegalovirus (strain AD169)
GN=gB PE=1 SV=1
Length = 906
Score = 39.3 bits (90), Expect = 0.100, Method: Compositional matrix adjust.
Identities = 38/149 (25%), Positives = 67/149 (44%), Gaps = 34/149 (22%)
Query: 1 MAMILPMP-AALQILMVPL----LNQLLWHRGLRLILLWLLVILPFLLSTDGTSSAAGKS 55
+A L P A I++V + + L++ R RL L + P+L+S DGT+ +G +
Sbjct: 743 VATFLKNPFGAFTIILVAIAVVIITYLIYTRQRRLCTQPLQNLFPYLVSADGTTVTSGST 802
Query: 56 GSLSLDALAKAKKALQMQKELSEKLKKIATLSKGSSSDGSGKVQGPAATASDAAAAAAAA 115
SL A ++++ SG+ +GP +SDA+ AA
Sbjct: 803 KDTSLQAPPSYEESVY----------------------NSGR-KGPGPPSSDASTAA--- 836
Query: 116 SVQPPTSSVPAFPGLANITNIEAVKRAQE 144
PP ++ A+ L + ++A +RAQ+
Sbjct: 837 ---PPYTNEQAYQMLLALARLDAEQRAQQ 862
>sp|F5HB53|GB_HCMVM Envelope glycoprotein B OS=Human cytomegalovirus (strain Merlin)
GN=gB PE=3 SV=1
Length = 907
Score = 38.9 bits (89), Expect = 0.12, Method: Compositional matrix adjust.
Identities = 38/149 (25%), Positives = 67/149 (44%), Gaps = 34/149 (22%)
Query: 1 MAMILPMP-AALQILMVPL----LNQLLWHRGLRLILLWLLVILPFLLSTDGTSSAAGKS 55
+A L P A I++V + + L++ R RL L + P+L+S DGT+ +G +
Sbjct: 744 VATFLKNPFGAFTIILVAIAVVIITYLIYTRQRRLCTQPLQNLFPYLVSADGTTVTSGST 803
Query: 56 GSLSLDALAKAKKALQMQKELSEKLKKIATLSKGSSSDGSGKVQGPAATASDAAAAAAAA 115
SL A ++++ SG+ +GP +SDA+ AA
Sbjct: 804 KDTSLQAPPSYEESVY----------------------NSGR-KGPGPPSSDASTAA--- 837
Query: 116 SVQPPTSSVPAFPGLANITNIEAVKRAQE 144
PP ++ A+ L + ++A +RAQ+
Sbjct: 838 ---PPYTNEQAYQMLLALARLDAEQRAQQ 863
>sp|Q8VEK0|CC50A_MOUSE Cell cycle control protein 50A OS=Mus musculus GN=Tmem30a PE=1 SV=1
Length = 364
Score = 36.2 bits (82), Expect = 0.77, Method: Compositional matrix adjust.
Identities = 18/57 (31%), Positives = 33/57 (57%), Gaps = 1/57 (1%)
Query: 533 GSKSIKRYGKLMLRRIDWAKAVKEEDEDEDETTDKPVNKCVLVWQGNVARPSFNRFF 589
G +S++ K ++ ++W KAV E D EDE+ + +N+ +VW A P+F + +
Sbjct: 221 GKESLEEKFKDTIKPVNWHKAVYELDP-EDESNNGFINEDFIVWMRTAALPTFRKLY 276
>sp|Q14LB9|PNP_SPICI Polyribonucleotide nucleotidyltransferase OS=Spiroplasma citri
GN=pnp PE=3 SV=1
Length = 702
Score = 35.0 bits (79), Expect = 1.7, Method: Compositional matrix adjust.
Identities = 45/143 (31%), Positives = 69/143 (48%), Gaps = 17/143 (11%)
Query: 282 EAGAKERQAKQAQLAKAKGGTDINPNLIEVAERVITKEKPKDPIPEIEWWDA-PLLLT-- 338
EAGAKE ++ L G D+ LI +I K P E+E + P ++T
Sbjct: 189 EAGAKEV-SENLMLQAILAGHDVIQQLIAFQHEIIAKVGV--PKMEVELFQVRPEIITYV 245
Query: 339 -GSYA-DISDDVTIEDKLKREKITIYVEHPRPIEPPAEPAPPPPQPLKLTKKEQKKLRTQ 396
+YA D+ I++K KR + +EH IE + P P+ L++KEQK+L +
Sbjct: 246 NNNYAKDLITAARIKEKTKRYE---TIEHL--IE---QAIKNYPMPVSLSEKEQKQLTVE 297
Query: 397 RRLAREK-DRQEMIRQGLIEPPK 418
+ A RQE+ RQ LI+ +
Sbjct: 298 LKTALHNIIRQEVRRQILIDKTR 320
>sp|Q8IDX6|RBP2A_PLAF7 Reticulocyte-binding protein 2 homolog a OS=Plasmodium falciparum
(isolate 3D7) GN=PF13_0198 PE=3 SV=1
Length = 3130
Score = 35.0 bits (79), Expect = 1.8, Method: Composition-based stats.
Identities = 28/98 (28%), Positives = 47/98 (47%), Gaps = 11/98 (11%)
Query: 384 KLTKKEQKKLRTQRRLAREKDRQEMIRQGLIEPPKPKVKMSNLMKVLGSEATQDPTRLEK 443
+L K+E+ K + Q RL REK Q + L + +++ +K Q+ RL+K
Sbjct: 2796 RLQKEEELKRQEQERLEREKQEQLQKEEELKRQEQERLQKEEALK------RQEQERLQK 2849
Query: 444 EIRSAAAEREQAHIDRNIARKLTPAERREKKERKLFDD 481
E +EQ ++R +K+ AER + + KL D
Sbjct: 2850 E--EELKRQEQERLER---KKIELAEREQHIKSKLESD 2882
Score = 34.7 bits (78), Expect = 2.5, Method: Composition-based stats.
Identities = 26/95 (27%), Positives = 46/95 (48%), Gaps = 10/95 (10%)
Query: 384 KLTKKEQKKLRTQRRLAREKDRQEMIRQGLIEPPKPKVKMSNLMKVLGSEATQDPTRLEK 443
+L K+E+ K + Q RL REK Q + L + +++ +K Q+ RL+K
Sbjct: 2746 RLQKEEELKRQEQERLEREKQEQLQKEEELKRQEQERLQKEEALK------RQEQERLQK 2799
Query: 444 EIRSAAAEREQAHIDRNIARKLTPAE--RREKKER 476
E +EQ ++R +L E +R+++ER
Sbjct: 2800 E--EELKRQEQERLEREKQEQLQKEEELKRQEQER 2832
>sp|Q8VIM5|MYCD_MOUSE Myocardin OS=Mus musculus GN=Myocd PE=1 SV=2
Length = 935
Score = 34.3 bits (77), Expect = 3.1, Method: Compositional matrix adjust.
Identities = 28/95 (29%), Positives = 46/95 (48%), Gaps = 3/95 (3%)
Query: 390 QKKLRT--QRRLAREKDRQEMIRQGLIEPPKPKVKMSNLMKVLGSEATQDPTRLEKEIRS 447
++K R+ Q RL + + ++++ QGLI P K + + K L S T+D R + RS
Sbjct: 13 RRKFRSVLQLRLQQRRTQEQLANQGLIPPLKGPTEFHDPRKQLDSAKTEDSLRRKGRNRS 72
Query: 448 AAAEREQAHI-DRNIARKLTPAERREKKERKLFDD 481
A HI + A + P + + K +L DD
Sbjct: 73 DRASLVTMHILQASTAERSIPTAQMKLKRARLADD 107
>sp|Q8R5I7|MYCD_RAT Myocardin OS=Rattus norvegicus GN=Myocd PE=1 SV=2
Length = 938
Score = 34.3 bits (77), Expect = 3.2, Method: Compositional matrix adjust.
Identities = 28/95 (29%), Positives = 46/95 (48%), Gaps = 3/95 (3%)
Query: 390 QKKLRT--QRRLAREKDRQEMIRQGLIEPPKPKVKMSNLMKVLGSEATQDPTRLEKEIRS 447
++K R+ Q RL + + ++++ QGLI P K + + K L S T+D R + RS
Sbjct: 13 RRKFRSVLQLRLQQRRTQEQLANQGLIPPLKSPTEFHDPRKKLDSAKTEDSLRRKVRNRS 72
Query: 448 AAAEREQAHI-DRNIARKLTPAERREKKERKLFDD 481
A HI + A + P + + K +L DD
Sbjct: 73 DRASLVNMHILQASTAERSIPTAQMKLKRARLADD 107
>sp|Q7S1I0|ATG26_NEUCR Sterol 3-beta-glucosyltransferase OS=Neurospora crassa (strain ATCC
24698 / 74-OR23-1A / CBS 708.71 / DSM 1257 / FGSC 987)
GN=atg-26 PE=3 SV=1
Length = 1553
Score = 33.5 bits (75), Expect = 5.3, Method: Compositional matrix adjust.
Identities = 20/66 (30%), Positives = 34/66 (51%), Gaps = 3/66 (4%)
Query: 368 PIEPPAEPAPPPPQPLKLTKKEQKKLRTQRRLAREKDRQEMIRQGLIEPPKPKVKMSNLM 427
P E PA+P P P +P LTK+ + R++R+ R +R +P P +++ + +
Sbjct: 38 PSEAPAQPGPGPARPSALTKRPSHREVAAYRMSRKLQR---LRADSNQPHSPTMELPDRL 94
Query: 428 KVLGSE 433
K G E
Sbjct: 95 KDNGKE 100
Database: swissprot
Posted date: Mar 23, 2013 2:32 AM
Number of letters in database: 191,569,459
Number of sequences in database: 539,616
Lambda K H
0.314 0.131 0.377
Lambda K H
0.267 0.0410 0.140
Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Number of Hits to DB: 235,217,652
Number of Sequences: 539616
Number of extensions: 10461703
Number of successful extensions: 55948
Number of sequences better than 100.0: 50
Number of HSP's better than 100.0 without gapping: 62
Number of HSP's successfully gapped in prelim test: 310
Number of HSP's that attempted gapping in prelim test: 54261
Number of HSP's gapped (non-prelim): 1654
length of query: 622
length of database: 191,569,459
effective HSP length: 124
effective length of query: 498
effective length of database: 124,657,075
effective search space: 62079223350
effective search space used: 62079223350
T: 11
A: 40
X1: 16 ( 7.3 bits)
X2: 38 (14.6 bits)
X3: 64 (24.7 bits)
S1: 42 (22.0 bits)
S2: 64 (29.3 bits)