BLASTP 2.2.26 [Sep-21-2011]


Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, 
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), 
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs",  Nucleic Acids Res. 25:3389-3402.


Reference for compositional score matrix adjustment: Altschul, Stephen F., 
John C. Wootton, E. Michael Gertz, Richa Agarwala, Aleksandr Morgulis,
Alejandro A. Schaffer, and Yi-Kuo Yu (2005) "Protein database searches
using compositionally adjusted substitution matrices", FEBS J. 272:5101-5109.

Query= 014528
         (423 letters)

Database: swissprot 
           539,616 sequences; 191,569,459 total letters

Searching..................................................done



>sp|Q9NUW8|TYDP1_HUMAN Tyrosyl-DNA phosphodiesterase 1 OS=Homo sapiens GN=TDP1 PE=1 SV=2
          Length = 608

 Score =  259 bits (661), Expect = 4e-68,   Method: Compositional matrix adjust.
 Identities = 166/399 (41%), Positives = 224/399 (56%), Gaps = 54/399 (13%)

Query: 27  NWILHKPPLPISFGTHHSKAMLLIYPRGVRIIVHTANLIHVDWNNKSQGLWMQD-FP-LK 84
           N  L +  L I+FGTHH+K MLL+Y  G+R+++HT+NLIH DW+ K+QG+W+   +P + 
Sbjct: 247 NISLCQAKLDIAFGTHHTKMMLLLYEEGLRVVIHTSNLIHADWHQKTQGIWLSPLYPRIA 306

Query: 85  DQNNLSEE--CGFENDLIDYLSTLKWPEFSANLPAHGNFKINPSFFKKFNFSSAAVRLIA 142
           D  + S E    F+ DLI YL     P     +              K + S   V LI 
Sbjct: 307 DGTHKSGESPTHFKADLISYLMAYNAPSLKEWI----------DVIHKHDLSETNVYLIG 356

Query: 143 SVPGYHTGSSLKKWGHMKLRTVLQECTFEKGFKKS-PLVYQFSSLGSL---DEKWM-AEL 197
           S PG   GS    WGH +L+ +L++        +S P+V QFSS+GSL   + KW+ +E 
Sbjct: 357 STPGRFQGSQKDNWGHFRLKKLLKDHASSMPNAESWPVVGQFSSVGSLGADESKWLCSEF 416

Query: 198 SSSMSSGFSEDKTPLGIGEPL-IVWPTVEDVRCSLEGYAAGNAIP-SPQKNVDKDFLKKY 255
             SM +   E KTP     PL +++P+VE+VR SLEGY AG ++P S Q    +++L  Y
Sbjct: 417 KESMLTLGKESKTPGKSSVPLYLIYPSVENVRTSLEGYPAGGSLPYSIQTAEKQNWLHSY 476

Query: 256 WAKWKASHTGRSRAMPHIKTFARYNGQ--KLAWFLLTSANLSKAAWGALQKNNSQLMIRS 313
           + KW A  +GRS AMPHIKT+ R +    K+AWFL+TSANLSKAAWGAL+KN +QLMIRS
Sbjct: 477 FHKWSAETSGRSNAMPHIKTYMRPSPDFSKIAWFLVTSANLSKAAWGALEKNGTQLMIRS 536

Query: 314 YELGVLILPSAKRHGCGFSCTSNIVPSEIKSGSTETSQIQKTKLVTLTWHGSSDAGASSE 373
           YELGVL LPSA      F   S  V  +  +GS E                         
Sbjct: 537 YELGVLFLPSA------FGLDSFKVKQKFFAGSQEP------------------------ 566

Query: 374 VVYLPVPYELPPQRYSSEDVPWSWDKRYTKK-DVYGQVW 411
           +   PVPY+LPP+ Y S+D PW W+  Y K  D +G +W
Sbjct: 567 MATFPVPYDLPPELYGSKDRPWIWNIPYVKAPDTHGNMW 605


>sp|Q8BJ37|TYDP1_MOUSE Tyrosyl-DNA phosphodiesterase 1 OS=Mus musculus GN=Tdp1 PE=2 SV=2
          Length = 609

 Score =  250 bits (639), Expect = 1e-65,   Method: Compositional matrix adjust.
 Identities = 160/400 (40%), Positives = 219/400 (54%), Gaps = 54/400 (13%)

Query: 26  ANWILHKPPLPISFGTHHSKAMLLIYPRGVRIIVHTANLIHVDWNNKSQGLWMQD-FPLK 84
           AN  L +  L I+FGTHH+K MLL+Y  G+R+++HT+NLI  DW+ K+QG+W+   +P  
Sbjct: 247 ANISLCQAKLDIAFGTHHTKMMLLLYEEGLRVVIHTSNLIREDWHQKTQGIWLSPLYPRI 306

Query: 85  DQNNLS---EECGFENDLIDYLSTLKWPEFSANLPAHGNFKINPSFFKKFNFSSAAVRLI 141
           DQ + +       F+ DL  YL+    P     +             ++ + S   V LI
Sbjct: 307 DQGSHTAGESSTRFKADLTSYLTAYNAPPLQEWI----------DIIQEHDLSETNVYLI 356

Query: 142 ASVPGYHTGSSLKKWGHMKLRTVLQECTFEKGFKKS-PLVYQFSSLGSL---DEKWM-AE 196
            S PG   GS    WGH +LR +LQ         +  P+V QFSS+GSL   + KW+ +E
Sbjct: 357 GSTPGRFQGSHRDNWGHFRLRKLLQAHAPSTPKGECWPIVGQFSSIGSLGPDESKWLCSE 416

Query: 197 LSSSMSSGFSEDKTPLGIGEPL-IVWPTVEDVRCSLEGYAAGNAIPSPQKNVDKD-FLKK 254
              S+ +   E + P     PL +++P+VE+VR SLEGY AG ++P   +  +K  +L  
Sbjct: 417 FKDSLLALREEGRPPGKSAVPLHLIYPSVENVRTSLEGYPAGGSLPYSIQTAEKQRWLHS 476

Query: 255 YWAKWKASHTGRSRAMPHIKTFARYNGQ--KLAWFLLTSANLSKAAWGALQKNNSQLMIR 312
           Y+ KW A  +GRS AMPHIKT+ R +    KLAWFL+TSANLSKAAWGAL+KN +QLMIR
Sbjct: 477 YFHKWSAETSGRSNAMPHIKTYMRPSPDFSKLAWFLVTSANLSKAAWGALEKNGTQLMIR 536

Query: 313 SYELGVLILPSAKRHGCGFSCTSNIVPSEIKSGSTETSQIQKTKLVTLTWHGSSDAGASS 372
           SYELGVL LPSA      F   +  V  +  S S E +                      
Sbjct: 537 SYELGVLFLPSA------FGLDTFKVKQKFFSSSCEPT---------------------- 568

Query: 373 EVVYLPVPYELPPQRYSSEDVPWSWDKRYTKK-DVYGQVW 411
                PVPY+LPP+ Y S+D PW W+  Y K  D +G +W
Sbjct: 569 --ASFPVPYDLPPELYRSKDRPWIWNIPYVKAPDTHGNMW 606


>sp|Q4G056|TYDP1_RAT Tyrosyl-DNA phosphodiesterase 1 OS=Rattus norvegicus GN=Tdp1 PE=2
           SV=1
          Length = 609

 Score =  250 bits (638), Expect = 2e-65,   Method: Compositional matrix adjust.
 Identities = 161/400 (40%), Positives = 219/400 (54%), Gaps = 54/400 (13%)

Query: 26  ANWILHKPPLPISFGTHHSKAMLLIYPRGVRIIVHTANLIHVDWNNKSQGLWMQD-FPLK 84
           AN  L +  L I+FGTHH+K MLL+Y  G+R+++HT+NLI  DW+ K+QG+W+   +P  
Sbjct: 247 ANISLCQAKLDIAFGTHHTKMMLLLYEEGLRVVIHTSNLIREDWHQKTQGIWLSPLYPRI 306

Query: 85  DQNNLSE---ECGFENDLIDYLSTLKWPEFSANLPAHGNFKINPSFFKKFNFSSAAVRLI 141
            Q N +       F+ DL  YL     P     +             ++ + S   V LI
Sbjct: 307 YQGNHTSGESSTHFKADLTSYLMAYNAPPLQEWI----------DIIQEHDLSETNVYLI 356

Query: 142 ASVPGYHTGSSLKKWGHMKLRTVLQECTFEKGFKKS-PLVYQFSSLGSL---DEKWM-AE 196
            S PG   GS    WGH +LR +LQ         +  P+V QFSS+GSL   + KW+ +E
Sbjct: 357 GSTPGRFQGSHKDNWGHFRLRKLLQAHAPSAPRGECWPVVGQFSSIGSLGPDESKWLCSE 416

Query: 197 LSSSMSSGFSEDKTPLGIGEPL-IVWPTVEDVRCSLEGYAAGNAIPSPQKNVDKD-FLKK 254
              S+ +   E +TP     PL +++P+VE+VR SLEGY AG ++P   +  +K  +L  
Sbjct: 417 FKESLLAVREEGRTPGRSAVPLHLIYPSVENVRTSLEGYPAGGSLPYGIQTAEKQRWLHP 476

Query: 255 YWAKWKASHTGRSRAMPHIKTFARYNGQ--KLAWFLLTSANLSKAAWGALQKNNSQLMIR 312
           Y+ KW A  +GRS AMPHIKT+ R +    KLAWFL+TSANLSKAAWGAL+KN +QLMIR
Sbjct: 477 YFHKWSAETSGRSNAMPHIKTYMRPSPDFSKLAWFLVTSANLSKAAWGALEKNGAQLMIR 536

Query: 313 SYELGVLILPSAKRHGCGFSCTSNIVPSEIKSGSTETSQIQKTKLVTLTWHGSSDAGASS 372
           SYELGVL LPSA      F   +  V  +  S S+E                        
Sbjct: 537 SYELGVLFLPSA------FGLDTFKVKQKFFSSSSEP----------------------- 567

Query: 373 EVVYLPVPYELPPQRYSSEDVPWSWDKRYTKK-DVYGQVW 411
            +   PVPY+LPP+ Y S+D PW W+  Y K  D +G +W
Sbjct: 568 -MASFPVPYDLPPELYGSKDRPWIWNIPYVKAPDTHGNMW 606


>sp|Q9VQM4|TYDP1_DROME Probable tyrosyl-DNA phosphodiesterase OS=Drosophila melanogaster
           GN=gkt PE=2 SV=1
          Length = 580

 Score =  162 bits (411), Expect = 3e-39,   Method: Compositional matrix adjust.
 Identities = 119/342 (34%), Positives = 174/342 (50%), Gaps = 34/342 (9%)

Query: 2   GIL---LLLFYQTTWWTLIGCCQRNKPANWILHKPPLPISFGTHHSKAMLLIYPRG-VRI 57
           GIL   LLL Y      L+   +  +    I  K P P  F T H+K M L Y  G +R+
Sbjct: 206 GILDKPLLLLYGDESPELLSIGKFKQQVTAIRVKMPTP--FATSHTKMMFLGYSDGSMRV 263

Query: 58  IVHTANLIHVDWNNKSQGLWMQD----FPLKDQNNLSEE-CGFENDLIDYLSTLKWPEFS 112
           ++ TANL   DW+N++QGLW+       P+       E   GF+ DL+ YL   K  +  
Sbjct: 264 VISTANLYEDDWHNRTQGLWISPKLPALPVDADTGAGESLTGFKQDLMLYLVEYKISQLQ 323

Query: 113 ANLPAHGNFKINPSFFKKFNFSSAAVRLIASVPGYHTGSSLK--KWGHMKLRTVLQECTF 170
             +          +  +  +FS+  V  + SVPG H   S++   WGH +L ++L +   
Sbjct: 324 PWI----------ARIRNSDFSAINVFFLGSVPGGHREGSVRGHPWGHARLASLLAKHAA 373

Query: 171 EKGFKKSPLVYQFSSLGSLDEKWMAELSSSMSSGFSEDKTPLGIGEPL----IVWPTVED 226
                + P+V Q SS+GSL     A +     +   +D TP+G    +    +++P+  +
Sbjct: 374 PID-DRIPVVCQSSSIGSLGANVQAWIQQDFVNSLKKDSTPVGKLRQMPPFKMIYPSYGN 432

Query: 227 VRCSLEGYAAGNAIPSPQKNVDKD-FLKKYWAKWKASHTGRSRAMPHIKTFARYN--GQK 283
           V  S +G   G  +P  +   DK  +LK Y  +WK+S   RSRAMPHIK++ R+N   Q 
Sbjct: 433 VAGSHDGMLGGGCLPYGKNTNDKQPWLKDYLQQWKSSDRFRSRAMPHIKSYTRFNLEDQS 492

Query: 284 LAWFLLTSANLSKAAWGALQKNNSQ---LMIRSYELGVLILP 322
           + WF+LTSANLSKAAWG   KN++    L I +YE GVL LP
Sbjct: 493 VYWFVLTSANLSKAAWGCFNKNSNIQPCLRIANYEAGVLFLP 534


>sp|Q9TXV7|TYDP1_CAEEL Probable tyrosyl-DNA phosphodiesterase OS=Caenorhabditis elegans
           GN=F52C12.1 PE=3 SV=1
          Length = 451

 Score =  161 bits (407), Expect = 1e-38,   Method: Compositional matrix adjust.
 Identities = 123/392 (31%), Positives = 185/392 (47%), Gaps = 81/392 (20%)

Query: 35  LPISFGTHHSKAMLLIYPRG-VRIIVHTANLIHVDWNNKSQGLWMQDFPLKDQNNLSEEC 93
           LPI FGTHH+K  +L    G   +IV TANL+  DW  K+Q  +  +F +K  +      
Sbjct: 123 LPIPFGTHHTKMSILEDEDGRFHVIVSTANLVPDDWEFKTQQFYY-NFGVKIASGTVPRS 181

Query: 94  GFENDLIDYLSTLKWPEFSANLPAHGNFKINPSFFKKFNFSSAAVRLIASVPGYHTGSSL 153
            F++DL++YLS  +                     +K +FS  + RLI S PGYHT    
Sbjct: 182 DFQDDLLEYLSMYR-----------NQLDTWKQLLQKVDFSQISDRLIFSTPGYHTDPPT 230

Query: 154 KKWGHMKLRTVLQE-CTFEKGFK---KSPLVYQFSSLGSLDE---KWMAE--LSSSMSSG 204
           ++ GH +L  +L E   F+  ++   +   V Q SS+GSL      W     L S   + 
Sbjct: 231 QRPGHPRLFRILSEKFPFDASYEHTERCTFVAQCSSIGSLGSAPINWFRGQFLQSLEGAN 290

Query: 205 FSEDKTPLGIGEPLIVWPTVEDVRCSLEGYAAGNAIPSPQK-NVDKDFLKKYWAKWKASH 263
            S  + P  +    +V+P+VEDVR S +GYA G ++P     +  + +L+    KW+++ 
Sbjct: 291 PSPKQKPAKM---YLVFPSVEDVRTSCQGYAGGCSVPYRNSVHARQKWLQGNMCKWRSNA 347

Query: 264 TGRSRAMPHIKTFARYNGQKLAWFLLTSANLSKAAWGAL----QKNNSQLMIRSYELGVL 319
             R+ A+PH KT+ +Y+ +   W LLTSANLSKAAWG +     KN  QLMIRS+E+GVL
Sbjct: 348 KRRTNAVPHCKTYVKYDKKVAIWQLLTSANLSKAAWGEVSFNKSKNVEQLMIRSWEMGVL 407

Query: 320 ILPSAKRHGCGFSCTSNIVPSEIKSGSTETSQIQKTKLVTLTWHGSSDAGASSEVVYLPV 379
           I                          T+ S+                           +
Sbjct: 408 I--------------------------TDPSRFN-------------------------I 416

Query: 380 PYELPPQRYSSEDVPWSWDKRYTKKDVYGQVW 411
           P++ P   YS+ D P+  DK++ K D+ G +W
Sbjct: 417 PFDYPLVPYSATDEPFVTDKKHEKPDILGCIW 448


>sp|Q9USG9|TYDP1_SCHPO Probable tyrosyl-DNA phosphodiesterase OS=Schizosaccharomyces pombe
           (strain 972 / ATCC 24843) GN=SPCP31B10.05 PE=3 SV=1
          Length = 536

 Score =  125 bits (313), Expect = 7e-28,   Method: Compositional matrix adjust.
 Identities = 124/455 (27%), Positives = 192/455 (42%), Gaps = 93/455 (20%)

Query: 25  PANWILHKPPLPISFGTHHSKAMLLIY-PRGVRIIVHTANLIHVDWNNKSQGLWMQ---- 79
           P N  L+   +P+ +GTHHSK M+  +     +I++HTANL+  DW   SQ ++      
Sbjct: 105 PVNVKLYSVYVPM-WGTHHSKIMVNFFKDDSCQIVIHTANLVEPDWIGMSQAIFKTPLLY 163

Query: 80  --------------------------DFPLKDQNN---LSEECGFEN----------DLI 100
                                        +KD  N   +  +  FEN          D +
Sbjct: 164 PKANDSLSTSSVPEYGNPSKIRKHEGSLDIKDDRNCDIIDVDSAFENFKHKSDTRSSDDL 223

Query: 101 DYLSTLKWPEFSANLPAHGNFKINPSFFKKFNFSSAAVRLIASVPGYHTGSSLKKWGHMK 160
             +      +F A L  + +        K ++FS+     I SVPG   G     WG  K
Sbjct: 224 GVIGRQFQQDFLAYLKNYRHTYELIEKLKMYDFSAIRAIFIGSVPGKFEGEEESSWGLGK 283

Query: 161 LRTVLQECTFEKGFKKSP---------LVYQFSSLGSLDEKWMAELSSSMSSGFSEDKTP 211
           L+ +L+    EK  KK            + Q SS+GS   K   E  + ++ GF   +  
Sbjct: 284 LKKILK--MLEKDSKKDEKTKFEESDICISQCSSMGSFGPK--QEYIAELTDGFGCQR-- 337

Query: 212 LGIGEPLIVWPTVEDVRCSLEGYAAGNAI------PSPQKNVDKDFLKKYWAKWKASHTG 265
              G    ++PTV++V+ S+ G+ +G++I       +    V+     K   KW A   G
Sbjct: 338 ---GNWKFLFPTVKEVQQSMLGWQSGSSIHFNILGKTAASQVETLKKGKNLCKWVAMKAG 394

Query: 266 RSRAMPHIKTFARY--NGQKLAWFLLTSANLSKAAWGALQKNNSQ------LMIRSYELG 317
           R R  PHIKT+ R+  +G+ L W L+TSANLSK AWG L+ + ++      L IRSYE G
Sbjct: 395 RQRVAPHIKTYMRFSNDGELLRWVLVTSANLSKPAWGTLEGHKAKSRSTRGLRIRSYEAG 454

Query: 318 VLILPSAKRHGCGFSCTSNIVPSEIKSGSTETSQIQKTKLVTLTWHGSSDAGASSEVVYL 377
           VL+ P          C   I+    K+ +    + ++       ++G         V+ +
Sbjct: 455 VLLYPKLFEESQRAPC---IMTPTYKTNTPNLDEKRR------EFYG-------KRVIGV 498

Query: 378 PVPYELPPQRYSSEDVPWSWDKRYTKKDVYGQVWP 412
            + ++ PP  Y  +D  WS     T KD  G VWP
Sbjct: 499 RMCWDFPPVEYEDKDEIWSPVINRTDKDWLGYVWP 533


>sp|P38319|TYDP1_YEAST Tyrosyl-DNA phosphodiesterase 1 OS=Saccharomyces cerevisiae (strain
           ATCC 204508 / S288c) GN=TDP1 PE=1 SV=1
          Length = 544

 Score = 35.4 bits (80), Expect = 0.80,   Method: Compositional matrix adjust.
 Identities = 18/38 (47%), Positives = 24/38 (63%), Gaps = 5/38 (13%)

Query: 282 QKLAWFLLTSANLSKAAWGALQKNNSQLMIRSYELGVL 319
           ++L W L TSANLS+ AWG + +       R+YE GVL
Sbjct: 454 KELEWCLYTSANLSQTAWGTVSRKP-----RNYEAGVL 486


>sp|Q5E0C4|HUTG_VIBF1 Formimidoylglutamase OS=Vibrio fischeri (strain ATCC 700601 /
           ES114) GN=hutG PE=3 SV=1
          Length = 350

 Score = 33.1 bits (74), Expect = 3.7,   Method: Compositional matrix adjust.
 Identities = 19/86 (22%), Positives = 43/86 (50%), Gaps = 5/86 (5%)

Query: 287 FLLTSANLSKAAW-GALQKNNSQLMIRSYELGVLILPSAKRHGC---GFSCTSNIVPSEI 342
           F+  +++++  +W G +   + +L +R ++  V +  S  + G    GF+C   ++ ++ 
Sbjct: 5   FMKNNSSVNMTSWMGRVDHEDGELGLRWHQ-KVKVTNSTNQDGIMLLGFACDEGVIRNKG 63

Query: 343 KSGSTETSQIQKTKLVTLTWHGSSDA 368
           + G+    Q+ +  L  L WH  SD 
Sbjct: 64  RKGAYAAPQVIRRALANLAWHHQSDV 89


  Database: swissprot
    Posted date:  Mar 23, 2013  2:32 AM
  Number of letters in database: 191,569,459
  Number of sequences in database:  539,616
  
Lambda     K      H
   0.318    0.133    0.429 

Lambda     K      H
   0.267   0.0410    0.140 


Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Number of Hits to DB: 166,963,706
Number of Sequences: 539616
Number of extensions: 7121872
Number of successful extensions: 13798
Number of sequences better than 100.0: 8
Number of HSP's better than 100.0 without gapping: 6
Number of HSP's successfully gapped in prelim test: 2
Number of HSP's that attempted gapping in prelim test: 13754
Number of HSP's gapped (non-prelim): 12
length of query: 423
length of database: 191,569,459
effective HSP length: 120
effective length of query: 303
effective length of database: 126,815,539
effective search space: 38425108317
effective search space used: 38425108317
T: 11
A: 40
X1: 16 ( 7.4 bits)
X2: 38 (14.6 bits)
X3: 64 (24.7 bits)
S1: 41 (21.7 bits)
S2: 63 (28.9 bits)