BLASTP 2.2.26 [Sep-21-2011]
Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer,
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997),
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs", Nucleic Acids Res. 25:3389-3402.
Reference for compositional score matrix adjustment: Altschul, Stephen F.,
John C. Wootton, E. Michael Gertz, Richa Agarwala, Aleksandr Morgulis,
Alejandro A. Schaffer, and Yi-Kuo Yu (2005) "Protein database searches
using compositionally adjusted substitution matrices", FEBS J. 272:5101-5109.
Query= 008447
(565 letters)
Database: swissprot
539,616 sequences; 191,569,459 total letters
Searching..................................................done
>sp|Q6NYV9|LEO1_DANRE RNA polymerase-associated protein LEO1 OS=Danio rerio GN=leo1 PE=1
SV=1
Length = 696
Score = 130 bits (326), Expect = 3e-29, Method: Compositional matrix adjust.
Identities = 79/230 (34%), Positives = 129/230 (56%), Gaps = 11/230 (4%)
Query: 177 LELEIPLRPPPADPTKMNMIKVSNIMGIDPKPFDPKTYVEEDTFVTDESG-AKKRIRLE- 234
+E+EIP + + + +K+ N + ++P+PFDP+ Y +E + R++L+
Sbjct: 336 IEVEIP-KVSTDLGSDLYFVKLPNFLSVEPRPFDPQYYEDEFEDEEMLDEEGRTRLKLKV 394
Query: 235 NNIVRWRTVKNKDGMTSYESNARFVRWSDGSLQLQIGNEVLDITVQDAQHDQAHLFLRHG 294
N +RWR+ ++++G ESNAR V+WSDGS+ L +GNEV D+ Q D HLF+R G
Sbjct: 395 ENTIRWRSRRDEEGNEVKESNARIVKWSDGSMSLHLGNEVFDVYKAPLQGDHNHLFIRQG 454
Query: 295 KGILQSQGRILKKMRFIPSSLSSNSHRLLTALVDSRHKKVYKVKNC-ITDIDPEREKEEK 353
G LQ Q K+ F P S S +HR +T + R K K++ + DPE ++ E
Sbjct: 455 TG-LQGQAVFKTKLTFRPHSTDSATHRKMTLSLADRCSKTQKIRILPMAGRDPESQRNEM 513
Query: 354 ERAESQNIRANVLLNRKREKINRKYTQTVERRRQLSTGYLE-DALEEDDE 402
+ E + +RA++ +RE R+ + + +R L+ GYLE D +ED+E
Sbjct: 514 IKKEEERLRASI----RRESQQRRMREK-QHQRGLNAGYLEPDRYDEDEE 558
>sp|Q52KV5|LEO1_XENLA RNA polymerase-associated protein LEO1 OS=Xenopus laevis GN=leo1
PE=2 SV=1
Length = 703
Score = 129 bits (324), Expect = 6e-29, Method: Compositional matrix adjust.
Identities = 76/221 (34%), Positives = 121/221 (54%), Gaps = 10/221 (4%)
Query: 177 LELEIPLRPPPADPTKMNMIKVSNIMGIDPKPFDPKTYVEEDTFVTDESG-AKKRIRLE- 234
+E+EIP + + +K+ N + ++P+PFDP+ Y +E + R++L+
Sbjct: 389 IEVEIP-KVNTDLGNDLYFVKLPNFLSVEPRPFDPQYYEDEFEDEEMLDEEGRTRLKLKV 447
Query: 235 NNIVRWRTVKNKDGMTSYESNARFVRWSDGSLQLQIGNEVLDITVQDAQHDQAHLFLRHG 294
N +RWRT K+++G +SNAR V+WSDGS+ L +GNEV D+ Q D HLF+R G
Sbjct: 448 ENTIRWRTRKDEEGNDVRDSNARIVKWSDGSMSLHLGNEVFDVYKAPLQGDHNHLFIRQG 507
Query: 295 KGILQSQGRILKKMRFIPSSLSSNSHRLLTALVDSRHKKVYKVKNC-ITDIDPEREKEEK 353
G LQ Q K+ F P S S +HR +T + R K K++ + DPE ++ E
Sbjct: 508 TG-LQGQAVFKTKLTFRPHSTDSATHRKMTLSLADRCSKTQKIRILPMAGRDPESQRSEM 566
Query: 354 ERAESQNIRANVLLNRKREKINRKYTQTVERRRQLSTGYLE 394
+ E + +RA++ +RE R+ + + +R LS YLE
Sbjct: 567 IKKEEERLRASI----RRESQQRRMREK-QHQRGLSANYLE 602
>sp|Q8WVC0|LEO1_HUMAN RNA polymerase-associated protein LEO1 OS=Homo sapiens GN=LEO1 PE=1
SV=1
Length = 666
Score = 125 bits (315), Expect = 6e-28, Method: Compositional matrix adjust.
Identities = 75/221 (33%), Positives = 120/221 (54%), Gaps = 10/221 (4%)
Query: 177 LELEIPLRPPPADPTKMNMIKVSNIMGIDPKPFDPKTYVEEDTFVTDESG-AKKRIRLE- 234
+E+EIP + + +K+ N + ++P+PFDP+ Y +E + R++L+
Sbjct: 358 IEVEIP-KVNTDLGNDLYFVKLPNFLSVEPRPFDPQYYEDEFEDEEMLDEEGRTRLKLKV 416
Query: 235 NNIVRWRTVKNKDGMTSYESNARFVRWSDGSLQLQIGNEVLDITVQDAQHDQAHLFLRHG 294
N +RWR ++++G ESNAR V+WSDGS+ L +GNEV D+ Q D HLF+R G
Sbjct: 417 ENTIRWRIRRDEEGNEIKESNARIVKWSDGSMSLHLGNEVFDVYKAPLQGDHNHLFIRQG 476
Query: 295 KGILQSQGRILKKMRFIPSSLSSNSHRLLTALVDSRHKKVYKVKNC-ITDIDPEREKEEK 353
G LQ Q K+ F P S S +HR +T + R K K++ + DPE ++ E
Sbjct: 477 TG-LQGQAVFKTKLTFRPHSTDSATHRKMTLSLADRCSKTQKIRILPMAGRDPECQRTEM 535
Query: 354 ERAESQNIRANVLLNRKREKINRKYTQTVERRRQLSTGYLE 394
+ E + +RA++ +RE R+ + + +R LS YLE
Sbjct: 536 IKKEEERLRASI----RRESQQRRMREK-QHQRGLSASYLE 571
>sp|Q641X2|LEO1_RAT RNA polymerase-associated protein LEO1 OS=Rattus norvegicus GN=Leo1
PE=2 SV=1
Length = 678
Score = 125 bits (315), Expect = 7e-28, Method: Compositional matrix adjust.
Identities = 75/221 (33%), Positives = 120/221 (54%), Gaps = 10/221 (4%)
Query: 177 LELEIPLRPPPADPTKMNMIKVSNIMGIDPKPFDPKTYVEEDTFVTDESG-AKKRIRLE- 234
+E+EIP + + +K+ N + ++P+PFDP+ Y +E + R++L+
Sbjct: 370 IEVEIP-KVNTDLGNDLYFVKLPNFLSVEPRPFDPQYYEDEFEDEEMLDEEGRTRLKLKV 428
Query: 235 NNIVRWRTVKNKDGMTSYESNARFVRWSDGSLQLQIGNEVLDITVQDAQHDQAHLFLRHG 294
N +RWR ++++G ESNAR V+WSDGS+ L +GNEV D+ Q D HLF+R G
Sbjct: 429 ENTIRWRMRRDEEGNEIKESNARIVKWSDGSMSLHLGNEVFDVYKAPLQGDHNHLFIRQG 488
Query: 295 KGILQSQGRILKKMRFIPSSLSSNSHRLLTALVDSRHKKVYKVKNC-ITDIDPEREKEEK 353
G LQ Q K+ F P S S +HR +T + R K K++ + DPE ++ E
Sbjct: 489 TG-LQGQAVFKTKLTFRPHSTDSATHRKMTLSLADRCSKTQKIRILPMAGRDPECQRTEM 547
Query: 354 ERAESQNIRANVLLNRKREKINRKYTQTVERRRQLSTGYLE 394
+ E + +RA++ +RE R+ + + +R LS YLE
Sbjct: 548 IKKEEERLRASI----RRESQQRRMREK-QHQRGLSASYLE 583
>sp|Q5XJE5|LEO1_MOUSE RNA polymerase-associated protein LEO1 OS=Mus musculus GN=Leo1 PE=1
SV=2
Length = 667
Score = 125 bits (315), Expect = 7e-28, Method: Compositional matrix adjust.
Identities = 75/221 (33%), Positives = 120/221 (54%), Gaps = 10/221 (4%)
Query: 177 LELEIPLRPPPADPTKMNMIKVSNIMGIDPKPFDPKTYVEEDTFVTDESG-AKKRIRLE- 234
+E+EIP + + +K+ N + ++P+PFDP+ Y +E + R++L+
Sbjct: 359 IEVEIP-KVNTDLGNDLYFVKLPNFLSVEPRPFDPQYYEDEFEDEEMLDEEGRTRLKLKV 417
Query: 235 NNIVRWRTVKNKDGMTSYESNARFVRWSDGSLQLQIGNEVLDITVQDAQHDQAHLFLRHG 294
N +RWR ++++G ESNAR V+WSDGS+ L +GNEV D+ Q D HLF+R G
Sbjct: 418 ENTIRWRIRRDEEGNEIKESNARIVKWSDGSMSLHLGNEVFDVYKAPLQGDHNHLFIRQG 477
Query: 295 KGILQSQGRILKKMRFIPSSLSSNSHRLLTALVDSRHKKVYKVKNC-ITDIDPEREKEEK 353
G LQ Q K+ F P S S +HR +T + R K K++ + DPE ++ E
Sbjct: 478 TG-LQGQAVFKTKLTFRPHSTDSATHRKMTLSLADRCSKTQKIRILPMAGRDPECQRTEM 536
Query: 354 ERAESQNIRANVLLNRKREKINRKYTQTVERRRQLSTGYLE 394
+ E + +RA++ +RE R+ + + +R LS YLE
Sbjct: 537 IKKEEERLRASI----RRESQQRRMREK-QHQRGLSASYLE 572
>sp|Q5R4D6|LEO1_PONAB RNA polymerase-associated protein LEO1 OS=Pongo abelii GN=LEO1 PE=2
SV=1
Length = 666
Score = 124 bits (310), Expect = 3e-27, Method: Compositional matrix adjust.
Identities = 76/222 (34%), Positives = 122/222 (54%), Gaps = 12/222 (5%)
Query: 177 LELEIPLRPPPADPTKMNMIKVSNIMGIDPKPFDPKTYVEEDT---FVTDESGAKKRIRL 233
+E+EIP + + +K+ N + ++P+PFDP+ Y +E + E + ++++
Sbjct: 358 IEVEIP-KVNTDLGNDLYFVKLPNFLSVEPRPFDPQYYEDEFEDEEMLDGEGRTRLKLKV 416
Query: 234 ENNIVRWRTVKNKDGMTSYESNARFVRWSDGSLQLQIGNEVLDITVQDAQHDQAHLFLRH 293
EN I RWR ++++G ESNAR V+WSDGS+ L +GNEV D+ Q D HLF+R
Sbjct: 417 ENTI-RWRIRRDEEGNEIKESNARIVKWSDGSMSLHLGNEVFDVYKAPLQGDHNHLFIRQ 475
Query: 294 GKGILQSQGRILKKMRFIPSSLSSNSHRLLTALVDSRHKKVYKVKNC-ITDIDPEREKEE 352
G G LQ Q K+ F P S S +HR +T + R K K++ + DPE ++ E
Sbjct: 476 GTG-LQGQAVFKTKLTFRPHSTDSATHRKMTLSLADRCSKTQKIRILPMAGRDPECQRTE 534
Query: 353 KERAESQNIRANVLLNRKREKINRKYTQTVERRRQLSTGYLE 394
+ E + +RA++ +RE R+ + + +R LS YLE
Sbjct: 535 MIKKEEERLRASI----RRESQQRRMREK-QHQRGLSASYLE 571
>sp|Q94546|ATU_DROME Another transcription unit protein OS=Drosophila melanogaster
GN=Atu PE=1 SV=2
Length = 725
Score = 114 bits (284), Expect = 3e-24, Method: Compositional matrix adjust.
Identities = 69/175 (39%), Positives = 96/175 (54%), Gaps = 12/175 (6%)
Query: 195 MIKVSNIMGIDPKPFDPKTYV----EEDTFVTDESGAKKRIRLE-NNIVRWRTVKNKDGM 249
IK+ N + + PFDP+TY EE+T DE G ++RI+L+ +N +RWR N G
Sbjct: 397 FIKLPNFLSVVTHPFDPETYEDEIDEEETM--DEEG-RQRIKLKVSNTIRWREYMNNKGD 453
Query: 250 TSYESNARFVRWSDGSLQLQIGNEVLDITVQDAQHDQAHLFLRHGKGILQSQGRILKKMR 309
ESNARFVRWSDGS+ L +GNE+ D Q D HLF+R G G LQ Q K+
Sbjct: 454 MVRESNARFVRWSDGSMSLHLGNEIFDAYRQPLLGDHNHLFVRQGTG-LQGQSVFRTKLT 512
Query: 310 FIPSSLSSNSHRLLTALVDSRHKKVYKVKNCITDI--DPEREKEEKERAESQNIR 362
F P S S +H+ +T + R K +K +T + DP ++ + R E +R
Sbjct: 513 FRPHSTESFTHKKMTMSLADRSSKTSGIK-ILTQVGKDPTTDRPTQLREEEAKLR 566
>sp|Q54MB8|LEO1_DICDI RNA polymerase-associated protein LEO1 OS=Dictyostelium discoideum
GN=leo1 PE=3 SV=1
Length = 487
Score = 105 bits (261), Expect = 1e-21, Method: Compositional matrix adjust.
Identities = 59/155 (38%), Positives = 94/155 (60%), Gaps = 5/155 (3%)
Query: 192 KMNMIKVSNIMGIDPKPFDPKTYVEEDTFVTDESGAKKRIRLENNIVRWRTVKNKDGMTS 251
K+ +K+ NI+GI PKPFDP T+ +E+ DE K + +E+ ++RWR + +G +
Sbjct: 181 KIMKLKLLNILGIQPKPFDPITFEDEEAMNGDE---KSKFNVES-VIRWRWGLDLNGRPA 236
Query: 252 YESNARFVRWSDGSLQLQIGNEVLDITVQDAQHDQAHLFLRHGKGILQSQGRILKKMRFI 311
ESN R V WSDGS L IGNEVL+I Q Q++Q +++ G ++ +G+I ++
Sbjct: 237 KESNTRLVTWSDGSSHLYIGNEVLEIKEQPLQNEQFYVYSSQ-DGFIECEGKIDSRLSIR 295
Query: 312 PSSLSSNSHRLLTALVDSRHKKVYKVKNCITDIDP 346
P+++ S H+ L+ V R KV K+K+ T +DP
Sbjct: 296 PTNIKSKVHQRLSENVAKRTVKVSKIKSIHTTLDP 330
>sp|P38439|LEO1_YEAST RNA polymerase-associated protein LEO1 OS=Saccharomyces cerevisiae
(strain ATCC 204508 / S288c) GN=LEO1 PE=1 SV=1
Length = 464
Score = 92.8 bits (229), Expect = 6e-18, Method: Compositional matrix adjust.
Identities = 63/190 (33%), Positives = 103/190 (54%), Gaps = 17/190 (8%)
Query: 197 KVSNIMGIDPKPFDPKTYVEEDTFVTDESGAKK-----RIRLENNIVRWRTVKNKDGMTS 251
++ N + IDP PFDP ++ + S +++ R+ ++ N VRWR ++KD
Sbjct: 172 RIPNFLTIDPIPFDPPSFEAKVNERASNSASREDQLDDRL-IDENTVRWRYSRDKDQHVF 230
Query: 252 YESNARFVRWSDGSLQLQIGNEVLDITVQDAQHDQAHLFLRHGKG-ILQSQ--GRILKKM 308
ESN + V+WSDG+ L++G E DI V D + L + H + ++Q G I K +
Sbjct: 231 KESNTQIVQWSDGTYSLKVGEECTDILVNDTSN--TFLTVSHDQQELIQCYEGGEIKKTL 288
Query: 309 RFIPSSLSSNSHRLLTALVDSRHKKVYKVKNC-ITDIDPEREKEEKERAESQNIRANVLL 367
FIP+S +S H+ L+ V R+++ K I +DPE EK+E ER +SQ +L
Sbjct: 289 MFIPTSTNSKIHQKLSKAVIRRNQRQSKGPGTYIVSMDPEVEKKELERKQSQ-----ILR 343
Query: 368 NRKREKINRK 377
+R+R ++ K
Sbjct: 344 DRRRRQLKEK 353
>sp|Q9P6R2|YOH8_SCHPO Uncharacterized protein C13E7.08c OS=Schizosaccharomyces pombe
(strain 972 / ATCC 24843) GN=SPBC13E7.08c PE=3 SV=1
Length = 429
Score = 77.8 bits (190), Expect = 2e-13, Method: Compositional matrix adjust.
Identities = 63/223 (28%), Positives = 102/223 (45%), Gaps = 37/223 (16%)
Query: 153 EDMHYESEEEHVEPKHKEKPVGPPLELEIP-LRPPPADPTKMNMIKVSNIMGIDPKPFDP 211
+D + + EE E + PV LE+ +P + P + + + N + ++ P+DP
Sbjct: 47 DDGLFSNTEEATEAPEADVPVKKVLEVAVPNFKSPASASNDVFHAHIPNFLSVEQTPYDP 106
Query: 212 KTYVEEDTFVTD----ESGAKKRIRLE-NNIVRWRTVKNKDGMTSYESNARFVRWSDGSL 266
+ Y E ++ +RI+ + +N VRWR + SY+SNA+ V+WSDGS
Sbjct: 107 EQYAAEAEADAALLEHDAHWGQRIKHKVDNTVRWRLGPSG----SYQSNAQIVQWSDGSY 162
Query: 267 QLQIGNEVLDI--------TVQDAQHDQAHLFLRHGKGILQSQGRILKKMRFIPSSLSSN 318
L+IGN++ D T A H+ HL L+ Q F+PS++++
Sbjct: 163 SLRIGNDIYDTQNKLISQPTFVTASHEAQHL--------LRVQTSFKSSFTFLPSAINTA 214
Query: 319 ------SHRLLTALVDSRHKKVYKVKNCITDIDPEREKEEKER 355
S RL T V SR V+ I + DPE K + E+
Sbjct: 215 TRSKLPSMRLTTVQVPSR-----SVQEIIIEKDPELLKRQAEK 252
Database: swissprot
Posted date: Mar 23, 2013 2:32 AM
Number of letters in database: 191,569,459
Number of sequences in database: 539,616
Lambda K H
0.306 0.127 0.348
Lambda K H
0.267 0.0410 0.140
Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Number of Hits to DB: 232,626,010
Number of Sequences: 539616
Number of extensions: 11536948
Number of successful extensions: 156919
Number of sequences better than 100.0: 50
Number of HSP's better than 100.0 without gapping: 1674
Number of HSP's successfully gapped in prelim test: 2924
Number of HSP's that attempted gapping in prelim test: 66108
Number of HSP's gapped (non-prelim): 40614
length of query: 565
length of database: 191,569,459
effective HSP length: 123
effective length of query: 442
effective length of database: 125,196,691
effective search space: 55336937422
effective search space used: 55336937422
T: 11
A: 40
X1: 16 ( 7.1 bits)
X2: 38 (14.6 bits)
X3: 64 (24.7 bits)
S1: 43 (21.9 bits)
S2: 64 (29.3 bits)