BLASTP 2.2.26 [Sep-21-2011]
Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer,
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997),
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs", Nucleic Acids Res. 25:3389-3402.
Reference for compositional score matrix adjustment: Altschul, Stephen F.,
John C. Wootton, E. Michael Gertz, Richa Agarwala, Aleksandr Morgulis,
Alejandro A. Schaffer, and Yi-Kuo Yu (2005) "Protein database searches
using compositionally adjusted substitution matrices", FEBS J. 272:5101-5109.
Query= 014528
(423 letters)
Database: swissprot
539,616 sequences; 191,569,459 total letters
Searching..................................................done
>sp|Q9NUW8|TYDP1_HUMAN Tyrosyl-DNA phosphodiesterase 1 OS=Homo sapiens GN=TDP1 PE=1 SV=2
Length = 608
Score = 259 bits (661), Expect = 4e-68, Method: Compositional matrix adjust.
Identities = 166/399 (41%), Positives = 224/399 (56%), Gaps = 54/399 (13%)
Query: 27 NWILHKPPLPISFGTHHSKAMLLIYPRGVRIIVHTANLIHVDWNNKSQGLWMQD-FP-LK 84
N L + L I+FGTHH+K MLL+Y G+R+++HT+NLIH DW+ K+QG+W+ +P +
Sbjct: 247 NISLCQAKLDIAFGTHHTKMMLLLYEEGLRVVIHTSNLIHADWHQKTQGIWLSPLYPRIA 306
Query: 85 DQNNLSEE--CGFENDLIDYLSTLKWPEFSANLPAHGNFKINPSFFKKFNFSSAAVRLIA 142
D + S E F+ DLI YL P + K + S V LI
Sbjct: 307 DGTHKSGESPTHFKADLISYLMAYNAPSLKEWI----------DVIHKHDLSETNVYLIG 356
Query: 143 SVPGYHTGSSLKKWGHMKLRTVLQECTFEKGFKKS-PLVYQFSSLGSL---DEKWM-AEL 197
S PG GS WGH +L+ +L++ +S P+V QFSS+GSL + KW+ +E
Sbjct: 357 STPGRFQGSQKDNWGHFRLKKLLKDHASSMPNAESWPVVGQFSSVGSLGADESKWLCSEF 416
Query: 198 SSSMSSGFSEDKTPLGIGEPL-IVWPTVEDVRCSLEGYAAGNAIP-SPQKNVDKDFLKKY 255
SM + E KTP PL +++P+VE+VR SLEGY AG ++P S Q +++L Y
Sbjct: 417 KESMLTLGKESKTPGKSSVPLYLIYPSVENVRTSLEGYPAGGSLPYSIQTAEKQNWLHSY 476
Query: 256 WAKWKASHTGRSRAMPHIKTFARYNGQ--KLAWFLLTSANLSKAAWGALQKNNSQLMIRS 313
+ KW A +GRS AMPHIKT+ R + K+AWFL+TSANLSKAAWGAL+KN +QLMIRS
Sbjct: 477 FHKWSAETSGRSNAMPHIKTYMRPSPDFSKIAWFLVTSANLSKAAWGALEKNGTQLMIRS 536
Query: 314 YELGVLILPSAKRHGCGFSCTSNIVPSEIKSGSTETSQIQKTKLVTLTWHGSSDAGASSE 373
YELGVL LPSA F S V + +GS E
Sbjct: 537 YELGVLFLPSA------FGLDSFKVKQKFFAGSQEP------------------------ 566
Query: 374 VVYLPVPYELPPQRYSSEDVPWSWDKRYTKK-DVYGQVW 411
+ PVPY+LPP+ Y S+D PW W+ Y K D +G +W
Sbjct: 567 MATFPVPYDLPPELYGSKDRPWIWNIPYVKAPDTHGNMW 605
>sp|Q8BJ37|TYDP1_MOUSE Tyrosyl-DNA phosphodiesterase 1 OS=Mus musculus GN=Tdp1 PE=2 SV=2
Length = 609
Score = 250 bits (639), Expect = 1e-65, Method: Compositional matrix adjust.
Identities = 160/400 (40%), Positives = 219/400 (54%), Gaps = 54/400 (13%)
Query: 26 ANWILHKPPLPISFGTHHSKAMLLIYPRGVRIIVHTANLIHVDWNNKSQGLWMQD-FPLK 84
AN L + L I+FGTHH+K MLL+Y G+R+++HT+NLI DW+ K+QG+W+ +P
Sbjct: 247 ANISLCQAKLDIAFGTHHTKMMLLLYEEGLRVVIHTSNLIREDWHQKTQGIWLSPLYPRI 306
Query: 85 DQNNLS---EECGFENDLIDYLSTLKWPEFSANLPAHGNFKINPSFFKKFNFSSAAVRLI 141
DQ + + F+ DL YL+ P + ++ + S V LI
Sbjct: 307 DQGSHTAGESSTRFKADLTSYLTAYNAPPLQEWI----------DIIQEHDLSETNVYLI 356
Query: 142 ASVPGYHTGSSLKKWGHMKLRTVLQECTFEKGFKKS-PLVYQFSSLGSL---DEKWM-AE 196
S PG GS WGH +LR +LQ + P+V QFSS+GSL + KW+ +E
Sbjct: 357 GSTPGRFQGSHRDNWGHFRLRKLLQAHAPSTPKGECWPIVGQFSSIGSLGPDESKWLCSE 416
Query: 197 LSSSMSSGFSEDKTPLGIGEPL-IVWPTVEDVRCSLEGYAAGNAIPSPQKNVDKD-FLKK 254
S+ + E + P PL +++P+VE+VR SLEGY AG ++P + +K +L
Sbjct: 417 FKDSLLALREEGRPPGKSAVPLHLIYPSVENVRTSLEGYPAGGSLPYSIQTAEKQRWLHS 476
Query: 255 YWAKWKASHTGRSRAMPHIKTFARYNGQ--KLAWFLLTSANLSKAAWGALQKNNSQLMIR 312
Y+ KW A +GRS AMPHIKT+ R + KLAWFL+TSANLSKAAWGAL+KN +QLMIR
Sbjct: 477 YFHKWSAETSGRSNAMPHIKTYMRPSPDFSKLAWFLVTSANLSKAAWGALEKNGTQLMIR 536
Query: 313 SYELGVLILPSAKRHGCGFSCTSNIVPSEIKSGSTETSQIQKTKLVTLTWHGSSDAGASS 372
SYELGVL LPSA F + V + S S E +
Sbjct: 537 SYELGVLFLPSA------FGLDTFKVKQKFFSSSCEPT---------------------- 568
Query: 373 EVVYLPVPYELPPQRYSSEDVPWSWDKRYTKK-DVYGQVW 411
PVPY+LPP+ Y S+D PW W+ Y K D +G +W
Sbjct: 569 --ASFPVPYDLPPELYRSKDRPWIWNIPYVKAPDTHGNMW 606
>sp|Q4G056|TYDP1_RAT Tyrosyl-DNA phosphodiesterase 1 OS=Rattus norvegicus GN=Tdp1 PE=2
SV=1
Length = 609
Score = 250 bits (638), Expect = 2e-65, Method: Compositional matrix adjust.
Identities = 161/400 (40%), Positives = 219/400 (54%), Gaps = 54/400 (13%)
Query: 26 ANWILHKPPLPISFGTHHSKAMLLIYPRGVRIIVHTANLIHVDWNNKSQGLWMQD-FPLK 84
AN L + L I+FGTHH+K MLL+Y G+R+++HT+NLI DW+ K+QG+W+ +P
Sbjct: 247 ANISLCQAKLDIAFGTHHTKMMLLLYEEGLRVVIHTSNLIREDWHQKTQGIWLSPLYPRI 306
Query: 85 DQNNLSE---ECGFENDLIDYLSTLKWPEFSANLPAHGNFKINPSFFKKFNFSSAAVRLI 141
Q N + F+ DL YL P + ++ + S V LI
Sbjct: 307 YQGNHTSGESSTHFKADLTSYLMAYNAPPLQEWI----------DIIQEHDLSETNVYLI 356
Query: 142 ASVPGYHTGSSLKKWGHMKLRTVLQECTFEKGFKKS-PLVYQFSSLGSL---DEKWM-AE 196
S PG GS WGH +LR +LQ + P+V QFSS+GSL + KW+ +E
Sbjct: 357 GSTPGRFQGSHKDNWGHFRLRKLLQAHAPSAPRGECWPVVGQFSSIGSLGPDESKWLCSE 416
Query: 197 LSSSMSSGFSEDKTPLGIGEPL-IVWPTVEDVRCSLEGYAAGNAIPSPQKNVDKD-FLKK 254
S+ + E +TP PL +++P+VE+VR SLEGY AG ++P + +K +L
Sbjct: 417 FKESLLAVREEGRTPGRSAVPLHLIYPSVENVRTSLEGYPAGGSLPYGIQTAEKQRWLHP 476
Query: 255 YWAKWKASHTGRSRAMPHIKTFARYNGQ--KLAWFLLTSANLSKAAWGALQKNNSQLMIR 312
Y+ KW A +GRS AMPHIKT+ R + KLAWFL+TSANLSKAAWGAL+KN +QLMIR
Sbjct: 477 YFHKWSAETSGRSNAMPHIKTYMRPSPDFSKLAWFLVTSANLSKAAWGALEKNGAQLMIR 536
Query: 313 SYELGVLILPSAKRHGCGFSCTSNIVPSEIKSGSTETSQIQKTKLVTLTWHGSSDAGASS 372
SYELGVL LPSA F + V + S S+E
Sbjct: 537 SYELGVLFLPSA------FGLDTFKVKQKFFSSSSEP----------------------- 567
Query: 373 EVVYLPVPYELPPQRYSSEDVPWSWDKRYTKK-DVYGQVW 411
+ PVPY+LPP+ Y S+D PW W+ Y K D +G +W
Sbjct: 568 -MASFPVPYDLPPELYGSKDRPWIWNIPYVKAPDTHGNMW 606
>sp|Q9VQM4|TYDP1_DROME Probable tyrosyl-DNA phosphodiesterase OS=Drosophila melanogaster
GN=gkt PE=2 SV=1
Length = 580
Score = 162 bits (411), Expect = 3e-39, Method: Compositional matrix adjust.
Identities = 119/342 (34%), Positives = 174/342 (50%), Gaps = 34/342 (9%)
Query: 2 GIL---LLLFYQTTWWTLIGCCQRNKPANWILHKPPLPISFGTHHSKAMLLIYPRG-VRI 57
GIL LLL Y L+ + + I K P P F T H+K M L Y G +R+
Sbjct: 206 GILDKPLLLLYGDESPELLSIGKFKQQVTAIRVKMPTP--FATSHTKMMFLGYSDGSMRV 263
Query: 58 IVHTANLIHVDWNNKSQGLWMQD----FPLKDQNNLSEE-CGFENDLIDYLSTLKWPEFS 112
++ TANL DW+N++QGLW+ P+ E GF+ DL+ YL K +
Sbjct: 264 VISTANLYEDDWHNRTQGLWISPKLPALPVDADTGAGESLTGFKQDLMLYLVEYKISQLQ 323
Query: 113 ANLPAHGNFKINPSFFKKFNFSSAAVRLIASVPGYHTGSSLK--KWGHMKLRTVLQECTF 170
+ + + +FS+ V + SVPG H S++ WGH +L ++L +
Sbjct: 324 PWI----------ARIRNSDFSAINVFFLGSVPGGHREGSVRGHPWGHARLASLLAKHAA 373
Query: 171 EKGFKKSPLVYQFSSLGSLDEKWMAELSSSMSSGFSEDKTPLGIGEPL----IVWPTVED 226
+ P+V Q SS+GSL A + + +D TP+G + +++P+ +
Sbjct: 374 PID-DRIPVVCQSSSIGSLGANVQAWIQQDFVNSLKKDSTPVGKLRQMPPFKMIYPSYGN 432
Query: 227 VRCSLEGYAAGNAIPSPQKNVDKD-FLKKYWAKWKASHTGRSRAMPHIKTFARYN--GQK 283
V S +G G +P + DK +LK Y +WK+S RSRAMPHIK++ R+N Q
Sbjct: 433 VAGSHDGMLGGGCLPYGKNTNDKQPWLKDYLQQWKSSDRFRSRAMPHIKSYTRFNLEDQS 492
Query: 284 LAWFLLTSANLSKAAWGALQKNNSQ---LMIRSYELGVLILP 322
+ WF+LTSANLSKAAWG KN++ L I +YE GVL LP
Sbjct: 493 VYWFVLTSANLSKAAWGCFNKNSNIQPCLRIANYEAGVLFLP 534
>sp|Q9TXV7|TYDP1_CAEEL Probable tyrosyl-DNA phosphodiesterase OS=Caenorhabditis elegans
GN=F52C12.1 PE=3 SV=1
Length = 451
Score = 161 bits (407), Expect = 1e-38, Method: Compositional matrix adjust.
Identities = 123/392 (31%), Positives = 185/392 (47%), Gaps = 81/392 (20%)
Query: 35 LPISFGTHHSKAMLLIYPRG-VRIIVHTANLIHVDWNNKSQGLWMQDFPLKDQNNLSEEC 93
LPI FGTHH+K +L G +IV TANL+ DW K+Q + +F +K +
Sbjct: 123 LPIPFGTHHTKMSILEDEDGRFHVIVSTANLVPDDWEFKTQQFYY-NFGVKIASGTVPRS 181
Query: 94 GFENDLIDYLSTLKWPEFSANLPAHGNFKINPSFFKKFNFSSAAVRLIASVPGYHTGSSL 153
F++DL++YLS + +K +FS + RLI S PGYHT
Sbjct: 182 DFQDDLLEYLSMYR-----------NQLDTWKQLLQKVDFSQISDRLIFSTPGYHTDPPT 230
Query: 154 KKWGHMKLRTVLQE-CTFEKGFK---KSPLVYQFSSLGSLDE---KWMAE--LSSSMSSG 204
++ GH +L +L E F+ ++ + V Q SS+GSL W L S +
Sbjct: 231 QRPGHPRLFRILSEKFPFDASYEHTERCTFVAQCSSIGSLGSAPINWFRGQFLQSLEGAN 290
Query: 205 FSEDKTPLGIGEPLIVWPTVEDVRCSLEGYAAGNAIPSPQK-NVDKDFLKKYWAKWKASH 263
S + P + +V+P+VEDVR S +GYA G ++P + + +L+ KW+++
Sbjct: 291 PSPKQKPAKM---YLVFPSVEDVRTSCQGYAGGCSVPYRNSVHARQKWLQGNMCKWRSNA 347
Query: 264 TGRSRAMPHIKTFARYNGQKLAWFLLTSANLSKAAWGAL----QKNNSQLMIRSYELGVL 319
R+ A+PH KT+ +Y+ + W LLTSANLSKAAWG + KN QLMIRS+E+GVL
Sbjct: 348 KRRTNAVPHCKTYVKYDKKVAIWQLLTSANLSKAAWGEVSFNKSKNVEQLMIRSWEMGVL 407
Query: 320 ILPSAKRHGCGFSCTSNIVPSEIKSGSTETSQIQKTKLVTLTWHGSSDAGASSEVVYLPV 379
I T+ S+ +
Sbjct: 408 I--------------------------TDPSRFN-------------------------I 416
Query: 380 PYELPPQRYSSEDVPWSWDKRYTKKDVYGQVW 411
P++ P YS+ D P+ DK++ K D+ G +W
Sbjct: 417 PFDYPLVPYSATDEPFVTDKKHEKPDILGCIW 448
>sp|Q9USG9|TYDP1_SCHPO Probable tyrosyl-DNA phosphodiesterase OS=Schizosaccharomyces pombe
(strain 972 / ATCC 24843) GN=SPCP31B10.05 PE=3 SV=1
Length = 536
Score = 125 bits (313), Expect = 7e-28, Method: Compositional matrix adjust.
Identities = 124/455 (27%), Positives = 192/455 (42%), Gaps = 93/455 (20%)
Query: 25 PANWILHKPPLPISFGTHHSKAMLLIY-PRGVRIIVHTANLIHVDWNNKSQGLWMQ---- 79
P N L+ +P+ +GTHHSK M+ + +I++HTANL+ DW SQ ++
Sbjct: 105 PVNVKLYSVYVPM-WGTHHSKIMVNFFKDDSCQIVIHTANLVEPDWIGMSQAIFKTPLLY 163
Query: 80 --------------------------DFPLKDQNN---LSEECGFEN----------DLI 100
+KD N + + FEN D +
Sbjct: 164 PKANDSLSTSSVPEYGNPSKIRKHEGSLDIKDDRNCDIIDVDSAFENFKHKSDTRSSDDL 223
Query: 101 DYLSTLKWPEFSANLPAHGNFKINPSFFKKFNFSSAAVRLIASVPGYHTGSSLKKWGHMK 160
+ +F A L + + K ++FS+ I SVPG G WG K
Sbjct: 224 GVIGRQFQQDFLAYLKNYRHTYELIEKLKMYDFSAIRAIFIGSVPGKFEGEEESSWGLGK 283
Query: 161 LRTVLQECTFEKGFKKSP---------LVYQFSSLGSLDEKWMAELSSSMSSGFSEDKTP 211
L+ +L+ EK KK + Q SS+GS K E + ++ GF +
Sbjct: 284 LKKILK--MLEKDSKKDEKTKFEESDICISQCSSMGSFGPK--QEYIAELTDGFGCQR-- 337
Query: 212 LGIGEPLIVWPTVEDVRCSLEGYAAGNAI------PSPQKNVDKDFLKKYWAKWKASHTG 265
G ++PTV++V+ S+ G+ +G++I + V+ K KW A G
Sbjct: 338 ---GNWKFLFPTVKEVQQSMLGWQSGSSIHFNILGKTAASQVETLKKGKNLCKWVAMKAG 394
Query: 266 RSRAMPHIKTFARY--NGQKLAWFLLTSANLSKAAWGALQKNNSQ------LMIRSYELG 317
R R PHIKT+ R+ +G+ L W L+TSANLSK AWG L+ + ++ L IRSYE G
Sbjct: 395 RQRVAPHIKTYMRFSNDGELLRWVLVTSANLSKPAWGTLEGHKAKSRSTRGLRIRSYEAG 454
Query: 318 VLILPSAKRHGCGFSCTSNIVPSEIKSGSTETSQIQKTKLVTLTWHGSSDAGASSEVVYL 377
VL+ P C I+ K+ + + ++ ++G V+ +
Sbjct: 455 VLLYPKLFEESQRAPC---IMTPTYKTNTPNLDEKRR------EFYG-------KRVIGV 498
Query: 378 PVPYELPPQRYSSEDVPWSWDKRYTKKDVYGQVWP 412
+ ++ PP Y +D WS T KD G VWP
Sbjct: 499 RMCWDFPPVEYEDKDEIWSPVINRTDKDWLGYVWP 533
>sp|P38319|TYDP1_YEAST Tyrosyl-DNA phosphodiesterase 1 OS=Saccharomyces cerevisiae (strain
ATCC 204508 / S288c) GN=TDP1 PE=1 SV=1
Length = 544
Score = 35.4 bits (80), Expect = 0.80, Method: Compositional matrix adjust.
Identities = 18/38 (47%), Positives = 24/38 (63%), Gaps = 5/38 (13%)
Query: 282 QKLAWFLLTSANLSKAAWGALQKNNSQLMIRSYELGVL 319
++L W L TSANLS+ AWG + + R+YE GVL
Sbjct: 454 KELEWCLYTSANLSQTAWGTVSRKP-----RNYEAGVL 486
>sp|Q5E0C4|HUTG_VIBF1 Formimidoylglutamase OS=Vibrio fischeri (strain ATCC 700601 /
ES114) GN=hutG PE=3 SV=1
Length = 350
Score = 33.1 bits (74), Expect = 3.7, Method: Compositional matrix adjust.
Identities = 19/86 (22%), Positives = 43/86 (50%), Gaps = 5/86 (5%)
Query: 287 FLLTSANLSKAAW-GALQKNNSQLMIRSYELGVLILPSAKRHGC---GFSCTSNIVPSEI 342
F+ +++++ +W G + + +L +R ++ V + S + G GF+C ++ ++
Sbjct: 5 FMKNNSSVNMTSWMGRVDHEDGELGLRWHQ-KVKVTNSTNQDGIMLLGFACDEGVIRNKG 63
Query: 343 KSGSTETSQIQKTKLVTLTWHGSSDA 368
+ G+ Q+ + L L WH SD
Sbjct: 64 RKGAYAAPQVIRRALANLAWHHQSDV 89
Database: swissprot
Posted date: Mar 23, 2013 2:32 AM
Number of letters in database: 191,569,459
Number of sequences in database: 539,616
Lambda K H
0.318 0.133 0.429
Lambda K H
0.267 0.0410 0.140
Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Number of Hits to DB: 166,963,706
Number of Sequences: 539616
Number of extensions: 7121872
Number of successful extensions: 13798
Number of sequences better than 100.0: 8
Number of HSP's better than 100.0 without gapping: 6
Number of HSP's successfully gapped in prelim test: 2
Number of HSP's that attempted gapping in prelim test: 13754
Number of HSP's gapped (non-prelim): 12
length of query: 423
length of database: 191,569,459
effective HSP length: 120
effective length of query: 303
effective length of database: 126,815,539
effective search space: 38425108317
effective search space used: 38425108317
T: 11
A: 40
X1: 16 ( 7.4 bits)
X2: 38 (14.6 bits)
X3: 64 (24.7 bits)
S1: 41 (21.7 bits)
S2: 63 (28.9 bits)