BLASTP 2.2.26 [Sep-21-2011]
Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer,
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997),
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs", Nucleic Acids Res. 25:3389-3402.
Reference for compositional score matrix adjustment: Altschul, Stephen F.,
John C. Wootton, E. Michael Gertz, Richa Agarwala, Aleksandr Morgulis,
Alejandro A. Schaffer, and Yi-Kuo Yu (2005) "Protein database searches
using compositionally adjusted substitution matrices", FEBS J. 272:5101-5109.
Query= 000465
(1478 letters)
Database: swissprot
539,616 sequences; 191,569,459 total letters
Searching..................................................done
>sp|Q8NDF8|PAPD5_HUMAN PAP-associated domain-containing protein 5 OS=Homo sapiens GN=PAPD5
PE=1 SV=2
Length = 572
Score = 73.2 bits (178), Expect = 2e-11, Method: Compositional matrix adjust.
Identities = 67/259 (25%), Positives = 101/259 (38%), Gaps = 88/259 (33%)
Query: 1213 LHEEIDSFCKQVAAENTARKPYINWAVKRVTRSLQVLWPRSRTNIFGSNATGLSLPSSDV 1272
LHEEI F + ++ K + V R+ ++ LWP + IFGS TGL LP+SD+
Sbjct: 120 LHEEISDFYEYMSPRPEEEKMRME-VVNRIESVIKELWPSADVQIFGSFKTGLYLPTSDI 178
Query: 1273 DLVVC-----LPPVRNLEPIKEAGILEGRNGIKETCLQHAARYLANQEWVKSDSLKTVEN 1327
DLVV LP L ++EA L + DS+K ++
Sbjct: 179 DLVVFGKWENLP----LWTLEEA--------------------LRKHKVADEDSVKVLDK 214
Query: 1328 TAIPIIMLVVEVPHDLIASAASSVQSPKEDAAHTTLKHDNHVHSDMVALDDSASPKCSHT 1387
+PII L + T +K D S
Sbjct: 215 ATVPIIKLT---------------------DSFTEVKVD-----------------ISFN 236
Query: 1388 SSDNIKAATSVRLDISFKSPSHTGLQTTDLVKELTEQFPASTPLALVLKQFLADRSLDQS 1447
+ ++AA ++ + T+++P L LVLKQFL R L++
Sbjct: 237 VQNGVRAADLIK--------------------DFTKKYPVLPYLVLVLKQFLLQRDLNEV 276
Query: 1448 YSGGLSSYCLMLLITRFLQ 1466
++GG+ SY L L+ FLQ
Sbjct: 277 FTGGIGSYSLFLMAVSFLQ 295
>sp|Q68ED3|PAPD5_MOUSE PAP-associated domain-containing protein 5 OS=Mus musculus GN=Papd5
PE=2 SV=2
Length = 633
Score = 72.0 bits (175), Expect = 3e-11, Method: Compositional matrix adjust.
Identities = 67/259 (25%), Positives = 101/259 (38%), Gaps = 88/259 (33%)
Query: 1213 LHEEIDSFCKQVAAENTARKPYINWAVKRVTRSLQVLWPRSRTNIFGSNATGLSLPSSDV 1272
LHEEI F + ++ K + V R+ ++ LWP + IFGS TGL LP+SD+
Sbjct: 134 LHEEISDFYEYMSPRPEEEKMRME-VVSRIESVIKELWPSADVQIFGSFKTGLYLPTSDI 192
Query: 1273 DLVVC-----LPPVRNLEPIKEAGILEGRNGIKETCLQHAARYLANQEWVKSDSLKTVEN 1327
DLVV LP L ++EA L + DS+K ++
Sbjct: 193 DLVVFGKWENLP----LWTLEEA--------------------LRKHKVADEDSVKVLDK 228
Query: 1328 TAIPIIMLVVEVPHDLIASAASSVQSPKEDAAHTTLKHDNHVHSDMVALDDSASPKCSHT 1387
+PII L + T +K D S
Sbjct: 229 ATVPIIKLT---------------------DSFTEVKVD-----------------ISFN 250
Query: 1388 SSDNIKAATSVRLDISFKSPSHTGLQTTDLVKELTEQFPASTPLALVLKQFLADRSLDQS 1447
+ ++AA ++ + T+++P L LVLKQFL R L++
Sbjct: 251 VQNGVRAADLIK--------------------DFTKKYPVLPYLVLVLKQFLLQRDLNEV 290
Query: 1448 YSGGLSSYCLMLLITRFLQ 1466
++GG+ SY L L+ FLQ
Sbjct: 291 FTGGIGSYSLFLMAVSFLQ 309
>sp|G5EFL0|GLD4_CAEEL Poly(A) RNA polymerase gld-4 OS=Caenorhabditis elegans GN=gld-4 PE=1
SV=1
Length = 845
Score = 66.2 bits (160), Expect = 2e-09, Method: Compositional matrix adjust.
Identities = 31/71 (43%), Positives = 48/71 (67%), Gaps = 2/71 (2%)
Query: 1396 TSVRLDISFKSPSHTGLQTTDLVKELTEQFPASTPLALVLKQFLADRSLDQSYSGGLSSY 1455
T + +DISF + G++ + ++ E+FP PL L+LKQFL R+L+Q+++GGLSSY
Sbjct: 191 TRLSIDISFNTVQ--GVRAASYIAKVKEEFPLIEPLVLLLKQFLHYRNLNQTFTGGLSSY 248
Query: 1456 CLMLLITRFLQ 1466
L+LL+ F Q
Sbjct: 249 GLVLLLVNFFQ 259
>sp|Q9UTN3|CID14_SCHPO Poly(A) RNA polymerase cid14 OS=Schizosaccharomyces pombe (strain 972
/ ATCC 24843) GN=cid14 PE=1 SV=2
Length = 684
Score = 63.5 bits (153), Expect = 1e-08, Method: Compositional matrix adjust.
Identities = 30/71 (42%), Positives = 46/71 (64%), Gaps = 2/71 (2%)
Query: 1396 TSVRLDISFKSPSHTGLQTTDLVKELTEQFPASTPLALVLKQFLADRSLDQSYSGGLSSY 1455
T V +DISF P GL+T +V +++PA PL +++K FL R+L++ + GGLSSY
Sbjct: 351 TKVHVDISFNQPG--GLKTCLVVNGFMKKYPALRPLVIIIKHFLNMRALNEVFLGGLSSY 408
Query: 1456 CLMLLITRFLQ 1466
++ L+ FLQ
Sbjct: 409 AIVCLVVSFLQ 419
Score = 41.6 bits (96), Expect = 0.048, Method: Compositional matrix adjust.
Identities = 28/91 (30%), Positives = 46/91 (50%), Gaps = 9/91 (9%)
Query: 1194 LEVQNCP-TRKASLSL-MHSLLHEEIDSFCKQVA---AENTARKPYINWAVKRVTRSLQV 1248
L+ Q+CP R+ + + + H++I F + E+ RK V R+ +++
Sbjct: 220 LDSQSCPWHRQYKVEREVSRIFHQDILHFIDYITPTPEEHAVRKTL----VSRINQAVLQ 275
Query: 1249 LWPRSRTNIFGSNATGLSLPSSDVDLVVCLP 1279
WP +FGS T L LP+SD+DLV+ P
Sbjct: 276 KWPDVSLYVFGSFETKLYLPTSDLDLVIISP 306
>sp|Q5XG87|PAPD7_HUMAN DNA polymerase sigma OS=Homo sapiens GN=PAPD7 PE=1 SV=2
Length = 542
Score = 60.5 bits (145), Expect = 1e-07, Method: Compositional matrix adjust.
Identities = 35/103 (33%), Positives = 55/103 (53%), Gaps = 7/103 (6%)
Query: 1364 KHDNHVHSDMVALDDSASPKCSHTSSDNIKAATSVRLDISFKSPSHTGLQTTDLVKELTE 1423
KH+ + LD + P T + T V++DISF TG++ + +K +
Sbjct: 70 KHNVAEPCSIKVLDKATVPIIKLTDQE-----TEVKVDISFNM--ETGVRAAEFIKNYMK 122
Query: 1424 QFPASTPLALVLKQFLADRSLDQSYSGGLSSYCLMLLITRFLQ 1466
++ L LVLKQFL R L++ ++GG+SSY L+L+ FLQ
Sbjct: 123 KYSLLPYLILVLKQFLLQRDLNEVFTGGISSYSLILMAISFLQ 165
Score = 48.9 bits (115), Expect = 3e-04, Method: Compositional matrix adjust.
Identities = 33/102 (32%), Positives = 49/102 (48%), Gaps = 27/102 (26%)
Query: 1238 AVKRVTRSLQVLWPRSRTNIFGSNATGLSLPSSDVDLVVC----LPPVRNLEPIKEAGIL 1293
VKR+ ++ LWP + IFGS +TGL LP+SD+DLVV PP++ LE
Sbjct: 14 VVKRIETVVKDLWPTADVQIFGSFSTGLYLPTSDIDLVVFGKWERPPLQLLEQALR---- 69
Query: 1294 EGRNGIKETCLQHAARYLANQEWVKSDSLKTVENTAIPIIML 1335
++ + E C S+K ++ +PII L
Sbjct: 70 --KHNVAEPC-----------------SIKVLDKATVPIIKL 92
>sp|Q6PB75|PAPD7_MOUSE DNA polymerase sigma OS=Mus musculus GN=Papd7 PE=2 SV=1
Length = 542
Score = 60.1 bits (144), Expect = 1e-07, Method: Compositional matrix adjust.
Identities = 35/103 (33%), Positives = 55/103 (53%), Gaps = 7/103 (6%)
Query: 1364 KHDNHVHSDMVALDDSASPKCSHTSSDNIKAATSVRLDISFKSPSHTGLQTTDLVKELTE 1423
KH+ + LD + P T + T V++DISF TG++ + +K +
Sbjct: 70 KHNVAEPCSIKVLDKATVPIIKLTDQE-----TEVKVDISFNM--ETGVRAAEFIKNYMK 122
Query: 1424 QFPASTPLALVLKQFLADRSLDQSYSGGLSSYCLMLLITRFLQ 1466
++ L LVLKQFL R L++ ++GG+SSY L+L+ FLQ
Sbjct: 123 KYSLLPYLILVLKQFLLQRDLNEVFTGGISSYSLILMAISFLQ 165
Score = 48.5 bits (114), Expect = 4e-04, Method: Compositional matrix adjust.
Identities = 33/102 (32%), Positives = 49/102 (48%), Gaps = 27/102 (26%)
Query: 1238 AVKRVTRSLQVLWPRSRTNIFGSNATGLSLPSSDVDLVVC----LPPVRNLEPIKEAGIL 1293
VKR+ ++ LWP + IFGS +TGL LP+SD+DLVV PP++ LE
Sbjct: 14 VVKRIETVVKDLWPTADVQIFGSFSTGLYLPTSDIDLVVFGKWERPPLQLLEQALR---- 69
Query: 1294 EGRNGIKETCLQHAARYLANQEWVKSDSLKTVENTAIPIIML 1335
++ + E C S+K ++ +PII L
Sbjct: 70 --KHNVAEPC-----------------SIKVLDKATVPIIKL 92
>sp|P53632|PAP2_YEAST Poly(A) RNA polymerase protein 2 OS=Saccharomyces cerevisiae (strain
ATCC 204508 / S288c) GN=PAP2 PE=1 SV=1
Length = 584
Score = 56.6 bits (135), Expect = 2e-06, Method: Compositional matrix adjust.
Identities = 55/234 (23%), Positives = 89/234 (38%), Gaps = 76/234 (32%)
Query: 1236 NWAVKRVTRSLQVLWPRSRTNIFGSNATGLSLPSSDVDLVVCLPPVRNLEPIKEAGILEG 1295
N + + +++ LWP + ++FGS +T L LP SD+D VV E G E
Sbjct: 201 NQTISTIREAVKQLWPDADLHVFGSYSTDLYLPGSDIDCVVT----------SELGGKES 250
Query: 1296 RNGIKETCLQHAARYLANQEWVKSDSLKTVENTAIPIIMLVVEVPHDLIASAASSVQSPK 1355
RN + + LA + ++ V +PII V PH
Sbjct: 251 RNNLYSLASHLKKKNLATE-------VEVVAKARVPIIKFV--EPH-------------- 287
Query: 1356 EDAAHTTLKHDNHVHSDMVALDDSASPKCSHTSSDNIKAATSVRLDISFKSPSHTGLQTT 1415
+ +H D+ S ++ I+AA
Sbjct: 288 -----------SGIHIDV-----------SFERTNGIEAAK------------------- 306
Query: 1416 DLVKELTEQFPASTPLALVLKQFLADRSLDQSYSGGLSSYCLMLLITRFLQHEH 1469
L++E + P L L++KQFL R L+ ++GGL + ++ L+ FL H H
Sbjct: 307 -LIREWLDDTPGLRELVLIVKQFLHARRLNNVHTGGLGGFSIICLVFSFL-HMH 358
>sp|P48561|TRF5_YEAST Poly(A) RNA polymerase protein 1 OS=Saccharomyces cerevisiae (strain
ATCC 204508 / S288c) GN=TRF5 PE=1 SV=2
Length = 642
Score = 53.1 bits (126), Expect = 2e-05, Method: Compositional matrix adjust.
Identities = 54/230 (23%), Positives = 92/230 (40%), Gaps = 75/230 (32%)
Query: 1236 NWAVKRVTRSLQVLWPRSRTNIFGSNATGLSLPSSDVDLVVCLPPVRNLEPIKEAGILEG 1295
N + ++ R+++ LW + ++FGS AT L LP SD+D C+ RN + E
Sbjct: 198 NRTIDKLRRAVKELWSDADLHVFGSFATDLYLPGSDID---CVVNSRNRDK-------ED 247
Query: 1296 RNGIKETCLQHAARYLANQEWVKSDSLKTVENTAIPIIMLVVEVPHDLIASAASSVQSPK 1355
RN I E AR+L N+ + ++ + T +PII + P+
Sbjct: 248 RNYIYE-----LARHLKNKGL--AIRMEVIVKTRVPIIKFI----------------EPQ 284
Query: 1356 EDAAHTTLKHDNHVHSDMVALDDSASPKCSHTSSDNIKAATSVRLDISFKSPSHTGLQTT 1415
+ +H D+ S ++ ++AA +R
Sbjct: 285 -----------SQLHIDV-----------SFERTNGLEAAKLIR---------------- 306
Query: 1416 DLVKELTEQFPASTPLALVLKQFLADRSLDQSYSGGLSSYCLMLLITRFL 1465
E P L L++KQFL R L+ ++GGL + ++ L+ FL
Sbjct: 307 ----EWLRDSPGLRELVLIIKQFLHSRRLNNVHTGGLGGFTVICLVYSFL 352
>sp|Q9HFW3|TRF5_ASHGO Poly(A) RNA polymerase protein 1 OS=Ashbya gossypii (strain ATCC
10895 / CBS 109.51 / FGSC 9923 / NRRL Y-1056) GN=TRF5
PE=3 SV=1
Length = 626
Score = 45.8 bits (107), Expect = 0.002, Method: Compositional matrix adjust.
Identities = 54/231 (23%), Positives = 87/231 (37%), Gaps = 75/231 (32%)
Query: 1236 NWAVKRVTRSLQVLWPRSRTNIFGSNATGLSLPSSDVDLVVCLPPVRNLEPIKEAGILEG 1295
N A+KR+ ++Q WP + + FGS AT L LP SD+D VV ++G +
Sbjct: 217 NDALKRIRDAVQDFWPDANLHCFGSYATDLYLPGSDIDCVVN----------SKSGDKDN 266
Query: 1296 RNGIKETCLQHAARYLANQEWVKSDSLKTVENTAIPIIMLVVEVPHDLIASAASSVQSPK 1355
+N + LA Q + + +PII V
Sbjct: 267 KNALYSLASYLKRNGLATQ-------VSVIAKARVPIIKFV------------------- 300
Query: 1356 EDAAHTTLKHDNHVHSDMVALDDSASPKCSHTSSDNIKAATSVRLDISFKSPSHTGLQTT 1415
E A+ +H D+ S ++ ++AA +R L T
Sbjct: 301 EPAS--------QIHIDL-----------SFERTNGVEAAKIIR----------GWLHDT 331
Query: 1416 DLVKELTEQFPASTPLALVLKQFLADRSLDQSYSGGLSSYCLMLLITRFLQ 1466
++EL L++KQFL R L+ + GGL + ++ L FL+
Sbjct: 332 PGLRELV----------LIVKQFLHARRLNDVHIGGLGGFSIICLAYSFLK 372
>sp|B2RX14|TUT4_MOUSE Terminal uridylyltransferase 4 OS=Mus musculus GN=Zcchc11 PE=1 SV=2
Length = 1644
Score = 42.0 bits (97), Expect = 0.043, Method: Compositional matrix adjust.
Identities = 20/54 (37%), Positives = 28/54 (51%)
Query: 1414 TTDLVKELTEQFPASTPLALVLKQFLADRSLDQSYSGGLSSYCLMLLITRFLQH 1467
TTDL+ L + P TPL L + + +D GG+ SYC L++ FLQ
Sbjct: 499 TTDLLAALGKVEPVFTPLVLAFRYWAKLCYIDSQTDGGIPSYCFALMVMFFLQQ 552
>sp|Q5TAX3|TUT4_HUMAN Terminal uridylyltransferase 4 OS=Homo sapiens GN=ZCCHC11 PE=1 SV=3
Length = 1644
Score = 39.7 bits (91), Expect = 0.17, Method: Compositional matrix adjust.
Identities = 19/54 (35%), Positives = 27/54 (50%)
Query: 1414 TTDLVKELTEQFPASTPLALVLKQFLADRSLDQSYSGGLSSYCLMLLITRFLQH 1467
TTDL+ L + P PL L + + +D GG+ SYC L++ FLQ
Sbjct: 479 TTDLLTALGKIEPVFIPLVLAFRYWAKLCYIDSQTDGGIPSYCFALMVMFFLQQ 532
>sp|Q9VD44|GLD2A_DROME Poly(A) RNA polymerase gld-2 homolog A OS=Drosophila melanogaster
GN=Gld2 PE=1 SV=3
Length = 1364
Score = 37.7 bits (86), Expect = 0.65, Method: Compositional matrix adjust.
Identities = 21/69 (30%), Positives = 40/69 (57%), Gaps = 2/69 (2%)
Query: 1398 VRLDISFKSPSHTGLQTTDLVKELTEQFPASTPLALVLKQFLADRSLDQSYSGGLSSYCL 1457
V +DI+F + G++ T L+ ++ P+AL +KQ+ +++ + + +SSY L
Sbjct: 1051 VEVDINFNN--SVGIRNTHLLYCYSQLDWRVRPMALTVKQWAQYHNINNAKNMTISSYSL 1108
Query: 1458 MLLITRFLQ 1466
ML++ FLQ
Sbjct: 1109 MLMVIHFLQ 1117
>sp|Q503I9|GLD2_DANRE Poly(A) RNA polymerase GLD2 OS=Danio rerio GN=papd4 PE=2 SV=1
Length = 489
Score = 36.2 bits (82), Expect = 2.1, Method: Compositional matrix adjust.
Identities = 20/71 (28%), Positives = 38/71 (53%), Gaps = 2/71 (2%)
Query: 1396 TSVRLDISFKSPSHTGLQTTDLVKELTEQFPASTPLALVLKQFLADRSLDQSYSGGLSSY 1455
+ V D++F + G++ T L++ PL LV+K++ ++ + G LSSY
Sbjct: 274 SGVEFDLNFNN--TVGIRNTFLLRTYAFVEKRVRPLVLVIKKWANHHCINDASRGTLSSY 331
Query: 1456 CLMLLITRFLQ 1466
L+L++ +LQ
Sbjct: 332 TLVLMVLHYLQ 342
>sp|Q2HJ44|GLD2_BOVIN Poly(A) RNA polymerase GLD2 OS=Bos taurus GN=PAPD4 PE=2 SV=1
Length = 484
Score = 34.7 bits (78), Expect = 6.8, Method: Compositional matrix adjust.
Identities = 17/57 (29%), Positives = 32/57 (56%)
Query: 1410 TGLQTTDLVKELTEQFPASTPLALVLKQFLADRSLDQSYSGGLSSYCLMLLITRFLQ 1466
G++ T L++ PL LV+K++ + ++ + G LSSY L+L++ +LQ
Sbjct: 286 VGIRNTFLLRTYAYLENRVRPLVLVIKKWASHHDINDASRGTLSSYSLVLMVLHYLQ 342
>sp|Q6PIY7|GLD2_HUMAN Poly(A) RNA polymerase GLD2 OS=Homo sapiens GN=PAPD4 PE=1 SV=1
Length = 484
Score = 34.3 bits (77), Expect = 7.3, Method: Compositional matrix adjust.
Identities = 17/57 (29%), Positives = 32/57 (56%)
Query: 1410 TGLQTTDLVKELTEQFPASTPLALVLKQFLADRSLDQSYSGGLSSYCLMLLITRFLQ 1466
G++ T L++ PL LV+K++ + ++ + G LSSY L+L++ +LQ
Sbjct: 286 VGIRNTFLLRTYAYLENRVRPLVLVIKKWASHHQINDASRGTLSSYSLVLMVLHYLQ 342
>sp|Q5U315|GLD2_RAT Poly(A) RNA polymerase GLD2 OS=Rattus norvegicus GN=Papd4 PE=2 SV=1
Length = 484
Score = 34.3 bits (77), Expect = 7.5, Method: Compositional matrix adjust.
Identities = 37/151 (24%), Positives = 69/151 (45%), Gaps = 13/151 (8%)
Query: 1132 SEEGCVRMDGSEVVWPSWRNKNLSAHPMIQPLSGALLQDHLIAISQLARDQEHPDVAFPL 1191
S+E R+DG + S + + ++ PLSG D ++ L P++ +
Sbjct: 79 SDEKAFRLDGKRQRFHSPHQEPTIINQLV-PLSG----DRRYSMPPLFHTHYVPEIVRCV 133
Query: 1192 QPL-EVQNCPTRKASLSLMHSLLHEEIDSF---CKQVAAENTARKPYINWAVKRVTRSLQ 1247
PL E+ R+ +L L ++I C+Q A++ ++ ++ R +Q
Sbjct: 134 PPLREIPLLEPREITLPEAKDKLSQQILELFETCQQQASDLKKKE----LCRAQLQREIQ 189
Query: 1248 VLWPRSRTNIFGSNATGLSLPSSDVDLVVCL 1278
+L+P+SR + GS+ G SSD DL + +
Sbjct: 190 LLFPQSRLFLVGSSLNGFGARSSDGDLCLVV 220
Score = 33.9 bits (76), Expect = 9.5, Method: Compositional matrix adjust.
Identities = 17/57 (29%), Positives = 32/57 (56%)
Query: 1410 TGLQTTDLVKELTEQFPASTPLALVLKQFLADRSLDQSYSGGLSSYCLMLLITRFLQ 1466
G++ T L++ PL LV+K++ + ++ + G LSSY L+L++ +LQ
Sbjct: 286 VGIRNTFLLRTYAYLENRVRPLVLVIKKWASHHEINDASRGTLSSYSLVLMVLHYLQ 342
Database: swissprot
Posted date: Mar 23, 2013 2:32 AM
Number of letters in database: 191,569,459
Number of sequences in database: 539,616
Lambda K H
0.315 0.130 0.390
Lambda K H
0.267 0.0410 0.140
Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Number of Hits to DB: 552,524,022
Number of Sequences: 539616
Number of extensions: 23877940
Number of successful extensions: 74464
Number of sequences better than 100.0: 50
Number of HSP's better than 100.0 without gapping: 24
Number of HSP's successfully gapped in prelim test: 117
Number of HSP's that attempted gapping in prelim test: 73641
Number of HSP's gapped (non-prelim): 791
length of query: 1478
length of database: 191,569,459
effective HSP length: 130
effective length of query: 1348
effective length of database: 121,419,379
effective search space: 163673322892
effective search space used: 163673322892
T: 11
A: 40
X1: 16 ( 7.3 bits)
X2: 38 (14.6 bits)
X3: 64 (24.7 bits)
S1: 41 (21.6 bits)
S2: 68 (30.8 bits)