BLASTP 2.2.26 [Sep-21-2011]
Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer,
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997),
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs", Nucleic Acids Res. 25:3389-3402.
Reference for compositional score matrix adjustment: Altschul, Stephen F.,
John C. Wootton, E. Michael Gertz, Richa Agarwala, Aleksandr Morgulis,
Alejandro A. Schaffer, and Yi-Kuo Yu (2005) "Protein database searches
using compositionally adjusted substitution matrices", FEBS J. 272:5101-5109.
Query= 013742
(437 letters)
Database: swissprot
539,616 sequences; 191,569,459 total letters
Searching..................................................done
>sp|Q9NUW8|TYDP1_HUMAN Tyrosyl-DNA phosphodiesterase 1 OS=Homo sapiens GN=TDP1 PE=1 SV=2
Length = 608
Score = 273 bits (698), Expect = 2e-72, Method: Compositional matrix adjust.
Identities = 177/439 (40%), Positives = 244/439 (55%), Gaps = 56/439 (12%)
Query: 3 DIDWLLPACPVLAKIPHVLVIHGESDGTLEHM-KRNKP-ANWILHKPPLPISFGTHHSKA 60
D+DWL+ P + +L++HG+ H+ + KP N L + L I+FGTHH+K
Sbjct: 207 DVDWLVKQYPPEFRKKPILLVHGDKREAKAHLHAQAKPYENISLCQAKLDIAFGTHHTKM 266
Query: 61 MLLIYPRGVRIIVHTANLIHVDWNNKSQGLWMQD-FP-LKDQNNLSEE--CGFENDLIDY 116
MLL+Y G+R+++HT+NLIH DW+ K+QG+W+ +P + D + S E F+ DLI Y
Sbjct: 267 MLLLYEEGLRVVIHTSNLIHADWHQKTQGIWLSPLYPRIADGTHKSGESPTHFKADLISY 326
Query: 117 LSTLKWPEFSANLPAHGNFKINPSFFKKFNFSSAAVRLIASVPGYHTGSSLKKWGHMKLR 176
L P + K + S V LI S PG GS WGH +L+
Sbjct: 327 LMAYNAPSLKEWI----------DVIHKHDLSETNVYLIGSTPGRFQGSQKDNWGHFRLK 376
Query: 177 TVLQECTFEKGFKKS-PLVYQFSSLGSL---DEKWM-AELSSSMSSGFSEDKTPLGIGEP 231
+L++ +S P+V QFSS+GSL + KW+ +E SM + E KTP P
Sbjct: 377 KLLKDHASSMPNAESWPVVGQFSSVGSLGADESKWLCSEFKESMLTLGKESKTPGKSSVP 436
Query: 232 L-IVWPTVEDVRCSLEGYAAGNAIP-SPQKNVDKDFLKKYWAKWKASHTGRSRAMPHIKT 289
L +++P+VE+VR SLEGY AG ++P S Q +++L Y+ KW A +GRS AMPHIKT
Sbjct: 437 LYLIYPSVENVRTSLEGYPAGGSLPYSIQTAEKQNWLHSYFHKWSAETSGRSNAMPHIKT 496
Query: 290 FARY--NGQKLAWFLLTSANLSKAAWGALQKNNSQLMIRSYELGVLILPSAKRHGCGFSC 347
+ R + K+AWFL+TSANLSKAAWGAL+KN +QLMIRSYELGVL LPSA F
Sbjct: 497 YMRPSPDFSKIAWFLVTSANLSKAAWGALEKNGTQLMIRSYELGVLFLPSA------FGL 550
Query: 348 TSNIVPSEIKSGSTETSQIQKTKLVTLTWHGSSDAGASSEVVYLPVPYELPPQRYSSEDV 407
S V + +GS E + PVPY+LPP+ Y S+D
Sbjct: 551 DSFKVKQKFFAGSQEP------------------------MATFPVPYDLPPELYGSKDR 586
Query: 408 PWSWDKRYTKK-DVYGQVW 425
PW W+ Y K D +G +W
Sbjct: 587 PWIWNIPYVKAPDTHGNMW 605
>sp|Q8BJ37|TYDP1_MOUSE Tyrosyl-DNA phosphodiesterase 1 OS=Mus musculus GN=Tdp1 PE=2 SV=2
Length = 609
Score = 263 bits (672), Expect = 2e-69, Method: Compositional matrix adjust.
Identities = 172/440 (39%), Positives = 242/440 (55%), Gaps = 58/440 (13%)
Query: 3 DIDWLLPACPVLAKIPHVLVIHGES-DGTLEHMKRNKP-ANWILHKPPLPISFGTHHSKA 60
D+DWL+ P + +L++HG+ + + + KP AN L + L I+FGTHH+K
Sbjct: 208 DVDWLIKQYPPEFRKNPILLVHGDKREAKADLHAQAKPYANISLCQAKLDIAFGTHHTKM 267
Query: 61 MLLIYPRGVRIIVHTANLIHVDWNNKSQGLWMQD-FPLKDQNNLS---EECGFENDLIDY 116
MLL+Y G+R+++HT+NLI DW+ K+QG+W+ +P DQ + + F+ DL Y
Sbjct: 268 MLLLYEEGLRVVIHTSNLIREDWHQKTQGIWLSPLYPRIDQGSHTAGESSTRFKADLTSY 327
Query: 117 LSTLKWPEFSANLPAHGNFKINPSFFKKFNFSSAAVRLIASVPGYHTGSSLKKWGHMKLR 176
L+ P + ++ + S V LI S PG GS WGH +LR
Sbjct: 328 LTAYNAPPLQEWI----------DIIQEHDLSETNVYLIGSTPGRFQGSHRDNWGHFRLR 377
Query: 177 TVLQE--CTFEKGFKKSPLVYQFSSLGSL---DEKWM-AELSSSMSSGFSEDKTPLGIGE 230
+LQ + KG + P+V QFSS+GSL + KW+ +E S+ + E + P
Sbjct: 378 KLLQAHAPSTPKG-ECWPIVGQFSSIGSLGPDESKWLCSEFKDSLLALREEGRPPGKSAV 436
Query: 231 PL-IVWPTVEDVRCSLEGYAAGNAIPSPQKNVDKD-FLKKYWAKWKASHTGRSRAMPHIK 288
PL +++P+VE+VR SLEGY AG ++P + +K +L Y+ KW A +GRS AMPHIK
Sbjct: 437 PLHLIYPSVENVRTSLEGYPAGGSLPYSIQTAEKQRWLHSYFHKWSAETSGRSNAMPHIK 496
Query: 289 TFARYNGQ--KLAWFLLTSANLSKAAWGALQKNNSQLMIRSYELGVLILPSAKRHGCGFS 346
T+ R + KLAWFL+TSANLSKAAWGAL+KN +QLMIRSYELGVL LPSA F
Sbjct: 497 TYMRPSPDFSKLAWFLVTSANLSKAAWGALEKNGTQLMIRSYELGVLFLPSA------FG 550
Query: 347 CTSNIVPSEIKSGSTETSQIQKTKLVTLTWHGSSDAGASSEVVYLPVPYELPPQRYSSED 406
+ V + S S E + PVPY+LPP+ Y S+D
Sbjct: 551 LDTFKVKQKFFSSSCEPT------------------------ASFPVPYDLPPELYRSKD 586
Query: 407 VPWSWDKRYTKK-DVYGQVW 425
PW W+ Y K D +G +W
Sbjct: 587 RPWIWNIPYVKAPDTHGNMW 606
>sp|Q4G056|TYDP1_RAT Tyrosyl-DNA phosphodiesterase 1 OS=Rattus norvegicus GN=Tdp1 PE=2
SV=1
Length = 609
Score = 261 bits (666), Expect = 9e-69, Method: Compositional matrix adjust.
Identities = 171/440 (38%), Positives = 238/440 (54%), Gaps = 58/440 (13%)
Query: 3 DIDWLLPACPVLAKIPHVLVIHG---ESDGTLEHMKRNKPANWILHKPPLPISFGTHHSK 59
D++WL+ P + +L++HG E+ L H + AN L + L I+FGTHH+K
Sbjct: 208 DVNWLIKQYPPEFRKKPILLVHGDKREAKADL-HAQAKPYANISLCQAKLDIAFGTHHTK 266
Query: 60 AMLLIYPRGVRIIVHTANLIHVDWNNKSQGLWMQD-FPLKDQNNLSE---ECGFENDLID 115
MLL+Y G+R+++HT+NLI DW+ K+QG+W+ +P Q N + F+ DL
Sbjct: 267 MMLLLYEEGLRVVIHTSNLIREDWHQKTQGIWLSPLYPRIYQGNHTSGESSTHFKADLTS 326
Query: 116 YLSTLKWPEFSANLPAHGNFKINPSFFKKFNFSSAAVRLIASVPGYHTGSSLKKWGHMKL 175
YL P + ++ + S V LI S PG GS WGH +L
Sbjct: 327 YLMAYNAPPLQEWI----------DIIQEHDLSETNVYLIGSTPGRFQGSHKDNWGHFRL 376
Query: 176 RTVLQECTFEKGFKKS-PLVYQFSSLGSL---DEKWM-AELSSSMSSGFSEDKTPLGIGE 230
R +LQ + P+V QFSS+GSL + KW+ +E S+ + E +TP
Sbjct: 377 RKLLQAHAPSAPRGECWPVVGQFSSIGSLGPDESKWLCSEFKESLLAVREEGRTPGRSAV 436
Query: 231 PL-IVWPTVEDVRCSLEGYAAGNAIPSPQKNVDKD-FLKKYWAKWKASHTGRSRAMPHIK 288
PL +++P+VE+VR SLEGY AG ++P + +K +L Y+ KW A +GRS AMPHIK
Sbjct: 437 PLHLIYPSVENVRTSLEGYPAGGSLPYGIQTAEKQRWLHPYFHKWSAETSGRSNAMPHIK 496
Query: 289 TFARYNGQ--KLAWFLLTSANLSKAAWGALQKNNSQLMIRSYELGVLILPSAKRHGCGFS 346
T+ R + KLAWFL+TSANLSKAAWGAL+KN +QLMIRSYELGVL LPSA F
Sbjct: 497 TYMRPSPDFSKLAWFLVTSANLSKAAWGALEKNGAQLMIRSYELGVLFLPSA------FG 550
Query: 347 CTSNIVPSEIKSGSTETSQIQKTKLVTLTWHGSSDAGASSEVVYLPVPYELPPQRYSSED 406
+ V + S S+E + PVPY+LPP+ Y S+D
Sbjct: 551 LDTFKVKQKFFSSSSEP------------------------MASFPVPYDLPPELYGSKD 586
Query: 407 VPWSWDKRYTKK-DVYGQVW 425
PW W+ Y K D +G +W
Sbjct: 587 RPWIWNIPYVKAPDTHGNMW 606
>sp|Q9VQM4|TYDP1_DROME Probable tyrosyl-DNA phosphodiesterase OS=Drosophila melanogaster
GN=gkt PE=2 SV=1
Length = 580
Score = 168 bits (426), Expect = 7e-41, Method: Compositional matrix adjust.
Identities = 126/357 (35%), Positives = 182/357 (50%), Gaps = 35/357 (9%)
Query: 1 MVDIDWLLPA---CPVLAKIPHVLVIHGESDGTLEHMKRNKPANWILHKPPLPISFGTHH 57
MVDI WLL +L K P +L+ ES L K + I K P P F T H
Sbjct: 192 MVDIGWLLGHYYFAGILDK-PLLLLYGDESPELLSIGKFKQQVTAIRVKMPTP--FATSH 248
Query: 58 SKAMLLIYPRG-VRIIVHTANLIHVDWNNKSQGLWMQ----DFPLKDQNNLSEE-CGFEN 111
+K M L Y G +R+++ TANL DW+N++QGLW+ P+ E GF+
Sbjct: 249 TKMMFLGYSDGSMRVVISTANLYEDDWHNRTQGLWISPKLPALPVDADTGAGESLTGFKQ 308
Query: 112 DLIDYLSTLKWPEFSANLPAHGNFKINPSFFKKFNFSSAAVRLIASVPGYHTGSSLKK-- 169
DL+ YL K + + + + +FS+ V + SVPG H S++
Sbjct: 309 DLMLYLVEYKISQLQPWI----------ARIRNSDFSAINVFFLGSVPGGHREGSVRGHP 358
Query: 170 WGHMKLRTVLQECTFEKGFKKSPLVYQFSSLGSLDEKWMAELSSSMSSGFSEDKTPLGIG 229
WGH +L ++L + + P+V Q SS+GSL A + + +D TP+G
Sbjct: 359 WGHARLASLLAKHAAPID-DRIPVVCQSSSIGSLGANVQAWIQQDFVNSLKKDSTPVGKL 417
Query: 230 EPL----IVWPTVEDVRCSLEGYAAGNAIPSPQKNVDKD-FLKKYWAKWKASHTGRSRAM 284
+ +++P+ +V S +G G +P + DK +LK Y +WK+S RSRAM
Sbjct: 418 RQMPPFKMIYPSYGNVAGSHDGMLGGGCLPYGKNTNDKQPWLKDYLQQWKSSDRFRSRAM 477
Query: 285 PHIKTFARYN--GQKLAWFLLTSANLSKAAWGALQKNNS---QLMIRSYELGVLILP 336
PHIK++ R+N Q + WF+LTSANLSKAAWG KN++ L I +YE GVL LP
Sbjct: 478 PHIKSYTRFNLEDQSVYWFVLTSANLSKAAWGCFNKNSNIQPCLRIANYEAGVLFLP 534
>sp|Q9TXV7|TYDP1_CAEEL Probable tyrosyl-DNA phosphodiesterase OS=Caenorhabditis elegans
GN=F52C12.1 PE=3 SV=1
Length = 451
Score = 166 bits (419), Expect = 4e-40, Method: Compositional matrix adjust.
Identities = 133/441 (30%), Positives = 206/441 (46%), Gaps = 83/441 (18%)
Query: 1 MVDIDWLLPACP-VLAKIPHVLVIHGESDGTLEHMKRNKPANWILHKPPLPISFGTHHSK 59
M+D ++L+ + P L + P LV+ L +N+ ++ LPI FGTHH+K
Sbjct: 75 MLDFEFLIGSYPPSLREYPITLVVGAPDAPDLLKCTKNQKLVTVVGAS-LPIPFGTHHTK 133
Query: 60 AMLLIYPRG-VRIIVHTANLIHVDWNNKSQGLWMQDFPLKDQNNLSEECGFENDLIDYLS 118
+L G +IV TANL+ DW K+Q + +F +K + F++DL++YLS
Sbjct: 134 MSILEDEDGRFHVIVSTANLVPDDWEFKTQQFYY-NFGVKIASGTVPRSDFQDDLLEYLS 192
Query: 119 TLKWPEFSANLPAHGNFKINPSFFKKFNFSSAAVRLIASVPGYHTGSSLKKWGHMKLRTV 178
+ +K +FS + RLI S PGYHT ++ GH +L +
Sbjct: 193 MYR-----------NQLDTWKQLLQKVDFSQISDRLIFSTPGYHTDPPTQRPGHPRLFRI 241
Query: 179 LQE-CTFEKGFK---KSPLVYQFSSLGSLDE---KWMAE--LSSSMSSGFSEDKTPLGIG 229
L E F+ ++ + V Q SS+GSL W L S + S + P +
Sbjct: 242 LSEKFPFDASYEHTERCTFVAQCSSIGSLGSAPINWFRGQFLQSLEGANPSPKQKPAKM- 300
Query: 230 EPLIVWPTVEDVRCSLEGYAAGNAIPSPQK-NVDKDFLKKYWAKWKASHTGRSRAMPHIK 288
+V+P+VEDVR S +GYA G ++P + + +L+ KW+++ R+ A+PH K
Sbjct: 301 --YLVFPSVEDVRTSCQGYAGGCSVPYRNSVHARQKWLQGNMCKWRSNAKRRTNAVPHCK 358
Query: 289 TFARYNGQKLAWFLLTSANLSKAAWGAL----QKNNSQLMIRSYELGVLILPSAKRHGCG 344
T+ +Y+ + W LLTSANLSKAAWG + KN QLMIRS+E+GVLI
Sbjct: 359 TYVKYDKKVAIWQLLTSANLSKAAWGEVSFNKSKNVEQLMIRSWEMGVLI---------- 408
Query: 345 FSCTSNIVPSEIKSGSTETSQIQKTKLVTLTWHGSSDAGASSEVVYLPVPYELPPQRYSS 404
T+ S+ +P++ P YS+
Sbjct: 409 ----------------TDPSRFN-------------------------IPFDYPLVPYSA 427
Query: 405 EDVPWSWDKRYTKKDVYGQVW 425
D P+ DK++ K D+ G +W
Sbjct: 428 TDEPFVTDKKHEKPDILGCIW 448
>sp|Q9USG9|TYDP1_SCHPO Probable tyrosyl-DNA phosphodiesterase OS=Schizosaccharomyces pombe
(strain 972 / ATCC 24843) GN=SPCP31B10.05 PE=3 SV=1
Length = 536
Score = 127 bits (320), Expect = 1e-28, Method: Compositional matrix adjust.
Identities = 133/498 (26%), Positives = 207/498 (41%), Gaps = 99/498 (19%)
Query: 2 VDIDWLLPAC------PVLAKIPHVLVIHGESDGTLEHMKRNKPANWILHKPPLPISFGT 55
VD+++LL V +I H +S L + P N L+ +P+ +GT
Sbjct: 62 VDLNFLLENMHASVFPTVDVRITHGYDSKSDSLARLTAQMNHCPVNVKLYSVYVPM-WGT 120
Query: 56 HHSKAMLLIYPR-GVRIIVHTANLIHVDWNNKSQGLWMQ--------------------- 93
HHSK M+ + +I++HTANL+ DW SQ ++
Sbjct: 121 HHSKIMVNFFKDDSCQIVIHTANLVEPDWIGMSQAIFKTPLLYPKANDSLSTSSVPEYGN 180
Query: 94 ---------DFPLKDQNN---LSEECGFEN----------DLIDYLSTLKWPEFSANLPA 131
+KD N + + FEN D + + +F A L
Sbjct: 181 PSKIRKHEGSLDIKDDRNCDIIDVDSAFENFKHKSDTRSSDDLGVIGRQFQQDFLAYLKN 240
Query: 132 HGNFKINPSFFKKFNFSSAAVRLIASVPGYHTGSSLKKWGHMKLRTVLQECTFEKGFKKS 191
+ + K ++FS+ I SVPG G WG KL+ +L+ EK KK
Sbjct: 241 YRHTYELIEKLKMYDFSAIRAIFIGSVPGKFEGEEESSWGLGKLKKILK--MLEKDSKKD 298
Query: 192 P---------LVYQFSSLGSLDEKWMAELSSSMSSGFSEDKTPLGIGEPLIVWPTVEDVR 242
+ Q SS+GS K E + ++ GF + G ++PTV++V+
Sbjct: 299 EKTKFEESDICISQCSSMGSFGPK--QEYIAELTDGFGCQR-----GNWKFLFPTVKEVQ 351
Query: 243 CSLEGYAAGNAI------PSPQKNVDKDFLKKYWAKWKASHTGRSRAMPHIKTFARY--N 294
S+ G+ +G++I + V+ K KW A GR R PHIKT+ R+ +
Sbjct: 352 QSMLGWQSGSSIHFNILGKTAASQVETLKKGKNLCKWVAMKAGRQRVAPHIKTYMRFSND 411
Query: 295 GQKLAWFLLTSANLSKAAWGALQKNNSQ------LMIRSYELGVLILPSAKRHGCGFSCT 348
G+ L W L+TSANLSK AWG L+ + ++ L IRSYE GVL+ P C
Sbjct: 412 GELLRWVLVTSANLSKPAWGTLEGHKAKSRSTRGLRIRSYEAGVLLYPKLFEESQRAPC- 470
Query: 349 SNIVPSEIKSGSTETSQIQKTKLVTLTWHGSSDAGASSEVVYLPVPYELPPQRYSSEDVP 408
I+ K+ + + ++ ++G V+ + + ++ PP Y +D
Sbjct: 471 --IMTPTYKTNTPNLDEKRR------EFYG-------KRVIGVRMCWDFPPVEYEDKDEI 515
Query: 409 WSWDKRYTKKDVYGQVWP 426
WS T KD G VWP
Sbjct: 516 WSPVINRTDKDWLGYVWP 533
>sp|P38319|TYDP1_YEAST Tyrosyl-DNA phosphodiesterase 1 OS=Saccharomyces cerevisiae (strain
ATCC 204508 / S288c) GN=TDP1 PE=1 SV=1
Length = 544
Score = 35.8 bits (81), Expect = 0.67, Method: Compositional matrix adjust.
Identities = 18/38 (47%), Positives = 24/38 (63%), Gaps = 5/38 (13%)
Query: 296 QKLAWFLLTSANLSKAAWGALQKNNSQLMIRSYELGVL 333
++L W L TSANLS+ AWG + + R+YE GVL
Sbjct: 454 KELEWCLYTSANLSQTAWGTVSRKP-----RNYEAGVL 486
>sp|Q5E0C4|HUTG_VIBF1 Formimidoylglutamase OS=Vibrio fischeri (strain ATCC 700601 /
ES114) GN=hutG PE=3 SV=1
Length = 350
Score = 33.1 bits (74), Expect = 4.0, Method: Compositional matrix adjust.
Identities = 19/86 (22%), Positives = 43/86 (50%), Gaps = 5/86 (5%)
Query: 301 FLLTSANLSKAAW-GALQKNNSQLMIRSYELGVLILPSAKRHGC---GFSCTSNIVPSEI 356
F+ +++++ +W G + + +L +R ++ V + S + G GF+C ++ ++
Sbjct: 5 FMKNNSSVNMTSWMGRVDHEDGELGLRWHQ-KVKVTNSTNQDGIMLLGFACDEGVIRNKG 63
Query: 357 KSGSTETSQIQKTKLVTLTWHGSSDA 382
+ G+ Q+ + L L WH SD
Sbjct: 64 RKGAYAAPQVIRRALANLAWHHQSDV 89
Database: swissprot
Posted date: Mar 23, 2013 2:32 AM
Number of letters in database: 191,569,459
Number of sequences in database: 539,616
Lambda K H
0.318 0.133 0.424
Lambda K H
0.267 0.0410 0.140
Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Number of Hits to DB: 174,700,564
Number of Sequences: 539616
Number of extensions: 7493048
Number of successful extensions: 14813
Number of sequences better than 100.0: 8
Number of HSP's better than 100.0 without gapping: 6
Number of HSP's successfully gapped in prelim test: 2
Number of HSP's that attempted gapping in prelim test: 14766
Number of HSP's gapped (non-prelim): 12
length of query: 437
length of database: 191,569,459
effective HSP length: 120
effective length of query: 317
effective length of database: 126,815,539
effective search space: 40200525863
effective search space used: 40200525863
T: 11
A: 40
X1: 16 ( 7.3 bits)
X2: 38 (14.6 bits)
X3: 64 (24.7 bits)
S1: 41 (21.7 bits)
S2: 63 (28.9 bits)