BLASTP 2.2.26 [Sep-21-2011]
Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer,
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997),
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs", Nucleic Acids Res. 25:3389-3402.
Reference for compositional score matrix adjustment: Altschul, Stephen F.,
John C. Wootton, E. Michael Gertz, Richa Agarwala, Aleksandr Morgulis,
Alejandro A. Schaffer, and Yi-Kuo Yu (2005) "Protein database searches
using compositionally adjusted substitution matrices", FEBS J. 272:5101-5109.
Query= 011198
(491 letters)
Database: swissprot
539,616 sequences; 191,569,459 total letters
Searching..................................................done
>sp|Q9NUW8|TYDP1_HUMAN Tyrosyl-DNA phosphodiesterase 1 OS=Homo sapiens GN=TDP1 PE=1 SV=2
Length = 608
Score = 184 bits (466), Expect = 2e-45, Method: Compositional matrix adjust.
Identities = 134/427 (31%), Positives = 217/427 (50%), Gaps = 39/427 (9%)
Query: 78 QRKKLSSNEHV----SIADGDIIELIPGHHFFKYVTLSRSQKRVSNDGAT-----NGELS 128
+R+K S E + S +D ++ +P K V NDG +G +
Sbjct: 75 KRQKSGSQEDLGWCLSSSDDELQPEMPQKQAEKVVIKKEKDISAPNDGTAQRTENHGAPA 134
Query: 129 SKKMRQQDEQDNENGKNSEEALCNFHVSRDKLPSTFRLLRVQGLPAWANTSCVSIRDVIQ 188
++++++++ +G+ + + + P F L RV G+ N+ + I+D++
Sbjct: 135 CHRLKEEEDEYETSGEGQD----IWDMLDKGNPFQFYLTRVSGVKPKYNSGALHIKDILS 190
Query: 189 ---GDIIVAILSNYMVDIDWLLPACPVLAKIPHVLVIHGESDGTLEHM-KRNKP-ANWIL 243
G ++ + NY D+DWL+ P + +L++HG+ H+ + KP N L
Sbjct: 191 PLFGTLVSSAQFNYCFDVDWLVKQYPPEFRKKPILLVHGDKREAKAHLHAQAKPYENISL 250
Query: 244 HKPPLPISFGTHHSKAMLLIYPRGVRIIVHTANLIHVDWNNKSQGLWMQD-FP-LKDQNN 301
+ L I+FGTHH+K MLL+Y G+R+++HT+NLIH DW+ K+QG+W+ +P + D +
Sbjct: 251 CQAKLDIAFGTHHTKMMLLLYEEGLRVVIHTSNLIHADWHQKTQGIWLSPLYPRIADGTH 310
Query: 302 LSEE--CGFENDLIDYLSTLKWPEFSANLPAHGNFKINPSFFKKFNFSSAAVRLIASVPG 359
S E F+ DLI YL P + K + S V LI S PG
Sbjct: 311 KSGESPTHFKADLISYLMAYNAPSLKEWI----------DVIHKHDLSETNVYLIGSTPG 360
Query: 360 YHTGSSLKKWGHMKLRTVLQECTFEKGFKKS-PLVYQFSSLGSL---DEKWM-AELSSSM 414
GS WGH +L+ +L++ +S P+V QFSS+GSL + KW+ +E SM
Sbjct: 361 RFQGSQKDNWGHFRLKKLLKDHASSMPNAESWPVVGQFSSVGSLGADESKWLCSEFKESM 420
Query: 415 SSGFSEDKTPLGIGEPL-IVWPTVEDVRCSLEGYAAGNAIP-SPQKNVDKDFLKKYWAKW 472
+ E KTP PL +++P+VE+VR SLEGY AG ++P S Q +++L Y+ KW
Sbjct: 421 LTLGKESKTPGKSSVPLYLIYPSVENVRTSLEGYPAGGSLPYSIQTAEKQNWLHSYFHKW 480
Query: 473 KASHTGR 479
A +GR
Sbjct: 481 SAETSGR 487
>sp|Q8BJ37|TYDP1_MOUSE Tyrosyl-DNA phosphodiesterase 1 OS=Mus musculus GN=Tdp1 PE=2 SV=2
Length = 609
Score = 177 bits (449), Expect = 2e-43, Method: Compositional matrix adjust.
Identities = 117/337 (34%), Positives = 180/337 (53%), Gaps = 30/337 (8%)
Query: 161 PSTFRLLRVQGLPAWANTSCVSIRDVIQ---GDIIVAILSNYMVDIDWLLPACPVLAKIP 217
P F L RV G+ A N+ + I+D++ G ++ + NY D+DWL+ P +
Sbjct: 164 PFQFYLTRVSGIKAKYNSKALHIKDILSPLFGTLVSSAQFNYCFDVDWLIKQYPPEFRKN 223
Query: 218 HVLVIHG---ESDGTLEHMKRNKPANWILHKPPLPISFGTHHSKAMLLIYPRGVRIIVHT 274
+L++HG E+ L H + AN L + L I+FGTHH+K MLL+Y G+R+++HT
Sbjct: 224 PILLVHGDKREAKADL-HAQAKPYANISLCQAKLDIAFGTHHTKMMLLLYEEGLRVVIHT 282
Query: 275 ANLIHVDWNNKSQGLWMQD-FPLKDQNNLS---EECGFENDLIDYLSTLKWPEFSANLPA 330
+NLI DW+ K+QG+W+ +P DQ + + F+ DL YL+ P +
Sbjct: 283 SNLIREDWHQKTQGIWLSPLYPRIDQGSHTAGESSTRFKADLTSYLTAYNAPPLQEWI-- 340
Query: 331 HGNFKINPSFFKKFNFSSAAVRLIASVPGYHTGSSLKKWGHMKLRTVLQ--ECTFEKGFK 388
++ + S V LI S PG GS WGH +LR +LQ + KG +
Sbjct: 341 --------DIIQEHDLSETNVYLIGSTPGRFQGSHRDNWGHFRLRKLLQAHAPSTPKG-E 391
Query: 389 KSPLVYQFSSLGSL---DEKWM-AELSSSMSSGFSEDKTPLGIGEPL-IVWPTVEDVRCS 443
P+V QFSS+GSL + KW+ +E S+ + E + P PL +++P+VE+VR S
Sbjct: 392 CWPIVGQFSSIGSLGPDESKWLCSEFKDSLLALREEGRPPGKSAVPLHLIYPSVENVRTS 451
Query: 444 LEGYAAGNAIPSPQKNVDKD-FLKKYWAKWKASHTGR 479
LEGY AG ++P + +K +L Y+ KW A +GR
Sbjct: 452 LEGYPAGGSLPYSIQTAEKQRWLHSYFHKWSAETSGR 488
>sp|Q4G056|TYDP1_RAT Tyrosyl-DNA phosphodiesterase 1 OS=Rattus norvegicus GN=Tdp1 PE=2
SV=1
Length = 609
Score = 174 bits (442), Expect = 9e-43, Method: Compositional matrix adjust.
Identities = 117/344 (34%), Positives = 177/344 (51%), Gaps = 44/344 (12%)
Query: 161 PSTFRLLRVQGLPAWANTSCVSIRDVIQ---GDIIVAILSNYMVDIDWLLPACPVLAKIP 217
P F L RV G+ A N+ + I+D++ G ++ + NY D++WL+ P +
Sbjct: 164 PFQFYLTRVSGIKAKYNSKALHIKDILSPLFGTLVSSAQFNYCFDVNWLIKQYPPEFRKK 223
Query: 218 HVLVIHG---ESDGTLEHMKRNKPANWILHKPPLPISFGTHHSKAMLLIYPRGVRIIVHT 274
+L++HG E+ L H + AN L + L I+FGTHH+K MLL+Y G+R+++HT
Sbjct: 224 PILLVHGDKREAKADL-HAQAKPYANISLCQAKLDIAFGTHHTKMMLLLYEEGLRVVIHT 282
Query: 275 ANLIHVDWNNKSQGLWMQD-FPLKDQNNLS---EECGFENDLIDYLSTLKWPEFSANLPA 330
+NLI DW+ K+QG+W+ +P Q N + F+ DL YL P +
Sbjct: 283 SNLIREDWHQKTQGIWLSPLYPRIYQGNHTSGESSTHFKADLTSYLMAYNAPPLQEWI-- 340
Query: 331 HGNFKINPSFFKKFNFSSAAVRLIASVPGYHTGSSLKKWGHMKLRTVLQ---------EC 381
++ + S V LI S PG GS WGH +LR +LQ EC
Sbjct: 341 --------DIIQEHDLSETNVYLIGSTPGRFQGSHKDNWGHFRLRKLLQAHAPSAPRGEC 392
Query: 382 TFEKGFKKSPLVYQFSSLGSL---DEKWM-AELSSSMSSGFSEDKTPLGIGEPL-IVWPT 436
P+V QFSS+GSL + KW+ +E S+ + E +TP PL +++P+
Sbjct: 393 W--------PVVGQFSSIGSLGPDESKWLCSEFKESLLAVREEGRTPGRSAVPLHLIYPS 444
Query: 437 VEDVRCSLEGYAAGNAIPSPQKNVDKD-FLKKYWAKWKASHTGR 479
VE+VR SLEGY AG ++P + +K +L Y+ KW A +GR
Sbjct: 445 VENVRTSLEGYPAGGSLPYGIQTAEKQRWLHPYFHKWSAETSGR 488
>sp|Q9VQM4|TYDP1_DROME Probable tyrosyl-DNA phosphodiesterase OS=Drosophila melanogaster
GN=gkt PE=2 SV=1
Length = 580
Score = 110 bits (276), Expect = 2e-23, Method: Compositional matrix adjust.
Identities = 95/313 (30%), Positives = 146/313 (46%), Gaps = 30/313 (9%)
Query: 183 IRDVIQGDIIVAILSNYMVDIDWLLPA---CPVLAKIPHVLVIHGESDGTLEHMKRNKPA 239
I D G+I + N+MVDI WLL +L K P +L+ ES L K +
Sbjct: 175 ILDESLGEIESTVQINFMVDIGWLLGHYYFAGILDK-PLLLLYGDESPELLSIGKFKQQV 233
Query: 240 NWILHKPPLPISFGTHHSKAMLLIYPRG-VRIIVHTANLIHVDWNNKSQGLWMQ----DF 294
I K P P F T H+K M L Y G +R+++ TANL DW+N++QGLW+
Sbjct: 234 TAIRVKMPTP--FATSHTKMMFLGYSDGSMRVVISTANLYEDDWHNRTQGLWISPKLPAL 291
Query: 295 PLKDQNNLSEE-CGFENDLIDYLSTLKWPEFSANLPAHGNFKINPSFFKKFNFSSAAVRL 353
P+ E GF+ DL+ YL K + + + + +FS+ V
Sbjct: 292 PVDADTGAGESLTGFKQDLMLYLVEYKISQLQPWI----------ARIRNSDFSAINVFF 341
Query: 354 IASVPGYHTGSSLKK--WGHMKLRTVLQECTFEKGFKKSPLVYQFSSLGSLDEKWMAELS 411
+ SVPG H S++ WGH +L ++L + + P+V Q SS+GSL A +
Sbjct: 342 LGSVPGGHREGSVRGHPWGHARLASLLAKHAAPID-DRIPVVCQSSSIGSLGANVQAWIQ 400
Query: 412 SSMSSGFSEDKTPLGIGEPL----IVWPTVEDVRCSLEGYAAGNAIPSPQKNVDKD-FLK 466
+ +D TP+G + +++P+ +V S +G G +P + DK +LK
Sbjct: 401 QDFVNSLKKDSTPVGKLRQMPPFKMIYPSYGNVAGSHDGMLGGGCLPYGKNTNDKQPWLK 460
Query: 467 KYWAKWKASHTGR 479
Y +WK+S R
Sbjct: 461 DYLQQWKSSDRFR 473
>sp|Q9TXV7|TYDP1_CAEEL Probable tyrosyl-DNA phosphodiesterase OS=Caenorhabditis elegans
GN=F52C12.1 PE=3 SV=1
Length = 451
Score = 105 bits (263), Expect = 6e-22, Method: Compositional matrix adjust.
Identities = 94/312 (30%), Positives = 150/312 (48%), Gaps = 31/312 (9%)
Query: 185 DVIQGDI--IVAILSNYMVDIDWLLPACP-VLAKIPHVLVIHGESDGTLEHMKRNKPANW 241
D I DI I ++ ++M+D ++L+ + P L + P LV+ L +N+
Sbjct: 58 DEILADIRPINSLHFSFMLDFEFLIGSYPPSLREYPITLVVGAPDAPDLLKCTKNQKLVT 117
Query: 242 ILHKPPLPISFGTHHSKAMLLIYPRG-VRIIVHTANLIHVDWNNKSQGLWMQDFPLKDQN 300
++ LPI FGTHH+K +L G +IV TANL+ DW K+Q + +F +K +
Sbjct: 118 VVGAS-LPIPFGTHHTKMSILEDEDGRFHVIVSTANLVPDDWEFKTQQFYY-NFGVKIAS 175
Query: 301 NLSEECGFENDLIDYLSTLKWPEFSANLPAHGNFKINPSFFKKFNFSSAAVRLIASVPGY 360
F++DL++YLS + +K +FS + RLI S PGY
Sbjct: 176 GTVPRSDFQDDLLEYLSMYR-----------NQLDTWKQLLQKVDFSQISDRLIFSTPGY 224
Query: 361 HTGSSLKKWGHMKLRTVLQE-CTFEKGF---KKSPLVYQFSSLGSLDE---KWMAE--LS 411
HT ++ GH +L +L E F+ + ++ V Q SS+GSL W L
Sbjct: 225 HTDPPTQRPGHPRLFRILSEKFPFDASYEHTERCTFVAQCSSIGSLGSAPINWFRGQFLQ 284
Query: 412 SSMSSGFSEDKTPLGIGEPLIVWPTVEDVRCSLEGYAAGNAIPSPQK-NVDKDFLKKYWA 470
S + S + P + +V+P+VEDVR S +GYA G ++P + + +L+
Sbjct: 285 SLEGANPSPKQKPAKM---YLVFPSVEDVRTSCQGYAGGCSVPYRNSVHARQKWLQGNMC 341
Query: 471 KWKASHTGRRYN 482
KW+ S+ RR N
Sbjct: 342 KWR-SNAKRRTN 352
>sp|Q9USG9|TYDP1_SCHPO Probable tyrosyl-DNA phosphodiesterase OS=Schizosaccharomyces pombe
(strain 972 / ATCC 24843) GN=SPCP31B10.05 PE=3 SV=1
Length = 536
Score = 69.7 bits (169), Expect = 4e-11, Method: Compositional matrix adjust.
Identities = 92/391 (23%), Positives = 153/391 (39%), Gaps = 76/391 (19%)
Query: 156 SRDKLPSTFRLLRVQGLPAWANTSCVSIRDVIQGDIIVAILS-NYMVDIDWLLPAC---- 210
S + + S L ++ LP N C+ ++ +I + N+ VD+++LL
Sbjct: 16 SNEIIDSPIFLNKISALPESENVHCLLLKQLIGSPQLKQTWQFNFCVDLNFLLENMHASV 75
Query: 211 --PVLAKIPHVLVIHGESDGTLEHMKRNKPANWILHKPPLPISFGTHHSKAMLLIYPR-G 267
V +I H +S L + P N L+ +P+ +GTHHSK M+ +
Sbjct: 76 FPTVDVRITHGYDSKSDSLARLTAQMNHCPVNVKLYSVYVPM-WGTHHSKIMVNFFKDDS 134
Query: 268 VRIIVHTANLIHVDWNNKSQGLWMQ------------------------------DFPLK 297
+I++HTANL+ DW SQ ++ +K
Sbjct: 135 CQIVIHTANLVEPDWIGMSQAIFKTPLLYPKANDSLSTSSVPEYGNPSKIRKHEGSLDIK 194
Query: 298 DQNN---LSEECGFEN----------DLIDYLSTLKWPEFSANLPAHGNFKINPSFFKKF 344
D N + + FEN D + + +F A L + + K +
Sbjct: 195 DDRNCDIIDVDSAFENFKHKSDTRSSDDLGVIGRQFQQDFLAYLKNYRHTYELIEKLKMY 254
Query: 345 NFSSAAVRLIASVPGYHTGSSLKKWGHMKLRTVLQECTFEKGFKKSP---------LVYQ 395
+FS+ I SVPG G WG KL+ +L+ EK KK + Q
Sbjct: 255 DFSAIRAIFIGSVPGKFEGEEESSWGLGKLKKILK--MLEKDSKKDEKTKFEESDICISQ 312
Query: 396 FSSLGSLDEKWMAELSSSMSSGFSEDKTPLGIGEPLIVWPTVEDVRCSLEGYAAGNAI-- 453
SS+GS K E + ++ GF + G ++PTV++V+ S+ G+ +G++I
Sbjct: 313 CSSMGSFGPK--QEYIAELTDGFGCQR-----GNWKFLFPTVKEVQQSMLGWQSGSSIHF 365
Query: 454 ----PSPQKNVDKDFLKKYWAKWKASHTGRR 480
+ V+ K KW A GR+
Sbjct: 366 NILGKTAASQVETLKKGKNLCKWVAMKAGRQ 396
>sp|Q7TT00|SP20H_MOUSE Transcription factor SPT20 homolog OS=Mus musculus GN=Supt20h PE=1
SV=1
Length = 530
Score = 35.4 bits (80), Expect = 1.1, Method: Compositional matrix adjust.
Identities = 43/186 (23%), Positives = 71/186 (38%), Gaps = 38/186 (20%)
Query: 309 ENDLIDYLSTLKWPEFSANLPAHGNFKINPSFFKKFNFSSAAVRLIASVPGYHTGSSLKK 368
E +L++YL + P +L I F V IA + Y S++K
Sbjct: 108 EGELLEYLDAEELPPILVDLLEKSQVNI---------FHCGCV--IAEIRDYRQSSNMKS 156
Query: 369 WG----HMKLRTVLQECTFEKGFKKSPLVYQFSSLGSLDEKWMAELSSSMSSGFSEDKTP 424
G H+ LR +Q LV S+ S + KW E E +
Sbjct: 157 PGYQSRHILLRPTMQT-----------LVCDVHSITSDNHKWTQE-----DKLLLESQLI 200
Query: 425 LGIGEPLIVWPTVEDVRCSLEGYAAGNAIPSPQKNVDKDFLKKYWAKWKASHTGRRYNID 484
L EPL + P+V V C+ N + ++ ++ +K+ W ++ S R+ ++
Sbjct: 201 LATAEPLCLDPSVA-VACT------ANRLLYNRQKMNTRPMKRCWKRYSRSSLNRQQDLS 253
Query: 485 LYLPPP 490
PPP
Sbjct: 254 HGPPPP 259
>sp|Q8IW19|APLF_HUMAN Aprataxin and PNK-like factor OS=Homo sapiens GN=APLF PE=1 SV=1
Length = 511
Score = 34.7 bits (78), Expect = 1.8, Method: Compositional matrix adjust.
Identities = 26/91 (28%), Positives = 46/91 (50%), Gaps = 4/91 (4%)
Query: 23 PKLPLSQGPNVIGRTNI-PVSDKRLSRKHITLTASADGSASLVVDGTNPVVVKSGDQRK- 80
P++ L+ G VIGR + ++DKR+SR+H L A G + TNP +S ++ +
Sbjct: 14 PRVALAPGETVIGRGPLLGITDKRVSRRHAILEV-AGGQLRIKPIHTNPCFYQSSEKSQL 72
Query: 81 -KLSSNEHVSIADGDIIELIPGHHFFKYVTL 110
L N + GD L+ + F+ +++
Sbjct: 73 LPLKPNLWCYLNPGDSFSLLVDKYIFRILSI 103
>sp|Q9V677|ECM29_DROME Proteasome-associated protein ECM29 homolog OS=Drosophila
melanogaster GN=CG8858 PE=1 SV=1
Length = 1890
Score = 33.1 bits (74), Expect = 4.8, Method: Compositional matrix adjust.
Identities = 34/133 (25%), Positives = 57/133 (42%), Gaps = 13/133 (9%)
Query: 31 PNVIGRTNIPVSDKRLSRKHITLTASADGSASLVVDGTNPVVVKSGDQRKKLSSNE---- 86
PNV +P D + +T S D S LV+ G +++S R+K+ +
Sbjct: 814 PNVPVEIQVPSKDGDDDDEEMTEYTSIDSSTKLVIFGVVFQLLRSSSARQKIREDSARCL 873
Query: 87 -HVSIADGDIIELIPGHHFFKYVTLSRSQKRVSNDGATNGELSSKKMRQQDEQDNENGKN 145
+++I DG E + K++TL++ QK D A N +S + D G+
Sbjct: 874 GYLAIGDG---EHFTKRNLDKFLTLTKIQK----DAALNIAISEAIVNTLCGYDVNKGQ- 925
Query: 146 SEEALCNFHVSRD 158
+E N H + D
Sbjct: 926 PDEKFVNKHCNDD 938
>sp|Q9D842|APLF_MOUSE Aprataxin and PNK-like factor OS=Mus musculus GN=Aplf PE=2 SV=2
Length = 499
Score = 32.7 bits (73), Expect = 5.8, Method: Compositional matrix adjust.
Identities = 18/47 (38%), Positives = 27/47 (57%), Gaps = 8/47 (17%)
Query: 8 YLVPLDNNLREDNSLPKLPLSQGPNVIGRTN-IPVSDKRLSRKHITL 53
+L PLD P++P+ G VIGR + ++DKR+SR+H L
Sbjct: 6 FLQPLDGG-------PRVPVGPGQTVIGRGPLLGITDKRVSRRHAIL 45
Database: swissprot
Posted date: Mar 23, 2013 2:32 AM
Number of letters in database: 191,569,459
Number of sequences in database: 539,616
Lambda K H
0.317 0.134 0.410
Lambda K H
0.267 0.0410 0.140
Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Number of Hits to DB: 200,377,963
Number of Sequences: 539616
Number of extensions: 8843611
Number of successful extensions: 19939
Number of sequences better than 100.0: 12
Number of HSP's better than 100.0 without gapping: 5
Number of HSP's successfully gapped in prelim test: 7
Number of HSP's that attempted gapping in prelim test: 19905
Number of HSP's gapped (non-prelim): 18
length of query: 491
length of database: 191,569,459
effective HSP length: 122
effective length of query: 369
effective length of database: 125,736,307
effective search space: 46396697283
effective search space used: 46396697283
T: 11
A: 40
X1: 16 ( 7.3 bits)
X2: 38 (14.6 bits)
X3: 64 (24.7 bits)
S1: 41 (21.6 bits)
S2: 63 (28.9 bits)