BLASTP 2.2.26 [Sep-21-2011]


Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, 
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), 
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs",  Nucleic Acids Res. 25:3389-3402.


Reference for compositional score matrix adjustment: Altschul, Stephen F., 
John C. Wootton, E. Michael Gertz, Richa Agarwala, Aleksandr Morgulis,
Alejandro A. Schaffer, and Yi-Kuo Yu (2005) "Protein database searches
using compositionally adjusted substitution matrices", FEBS J. 272:5101-5109.

Query= 013742
         (437 letters)

Database: swissprot 
           539,616 sequences; 191,569,459 total letters

Searching..................................................done



>sp|Q9NUW8|TYDP1_HUMAN Tyrosyl-DNA phosphodiesterase 1 OS=Homo sapiens GN=TDP1 PE=1 SV=2
          Length = 608

 Score =  273 bits (698), Expect = 2e-72,   Method: Compositional matrix adjust.
 Identities = 177/439 (40%), Positives = 244/439 (55%), Gaps = 56/439 (12%)

Query: 3   DIDWLLPACPVLAKIPHVLVIHGESDGTLEHM-KRNKP-ANWILHKPPLPISFGTHHSKA 60
           D+DWL+   P   +   +L++HG+      H+  + KP  N  L +  L I+FGTHH+K 
Sbjct: 207 DVDWLVKQYPPEFRKKPILLVHGDKREAKAHLHAQAKPYENISLCQAKLDIAFGTHHTKM 266

Query: 61  MLLIYPRGVRIIVHTANLIHVDWNNKSQGLWMQD-FP-LKDQNNLSEE--CGFENDLIDY 116
           MLL+Y  G+R+++HT+NLIH DW+ K+QG+W+   +P + D  + S E    F+ DLI Y
Sbjct: 267 MLLLYEEGLRVVIHTSNLIHADWHQKTQGIWLSPLYPRIADGTHKSGESPTHFKADLISY 326

Query: 117 LSTLKWPEFSANLPAHGNFKINPSFFKKFNFSSAAVRLIASVPGYHTGSSLKKWGHMKLR 176
           L     P     +              K + S   V LI S PG   GS    WGH +L+
Sbjct: 327 LMAYNAPSLKEWI----------DVIHKHDLSETNVYLIGSTPGRFQGSQKDNWGHFRLK 376

Query: 177 TVLQECTFEKGFKKS-PLVYQFSSLGSL---DEKWM-AELSSSMSSGFSEDKTPLGIGEP 231
            +L++        +S P+V QFSS+GSL   + KW+ +E   SM +   E KTP     P
Sbjct: 377 KLLKDHASSMPNAESWPVVGQFSSVGSLGADESKWLCSEFKESMLTLGKESKTPGKSSVP 436

Query: 232 L-IVWPTVEDVRCSLEGYAAGNAIP-SPQKNVDKDFLKKYWAKWKASHTGRSRAMPHIKT 289
           L +++P+VE+VR SLEGY AG ++P S Q    +++L  Y+ KW A  +GRS AMPHIKT
Sbjct: 437 LYLIYPSVENVRTSLEGYPAGGSLPYSIQTAEKQNWLHSYFHKWSAETSGRSNAMPHIKT 496

Query: 290 FARY--NGQKLAWFLLTSANLSKAAWGALQKNNSQLMIRSYELGVLILPSAKRHGCGFSC 347
           + R   +  K+AWFL+TSANLSKAAWGAL+KN +QLMIRSYELGVL LPSA      F  
Sbjct: 497 YMRPSPDFSKIAWFLVTSANLSKAAWGALEKNGTQLMIRSYELGVLFLPSA------FGL 550

Query: 348 TSNIVPSEIKSGSTETSQIQKTKLVTLTWHGSSDAGASSEVVYLPVPYELPPQRYSSEDV 407
            S  V  +  +GS E                         +   PVPY+LPP+ Y S+D 
Sbjct: 551 DSFKVKQKFFAGSQEP------------------------MATFPVPYDLPPELYGSKDR 586

Query: 408 PWSWDKRYTKK-DVYGQVW 425
           PW W+  Y K  D +G +W
Sbjct: 587 PWIWNIPYVKAPDTHGNMW 605


>sp|Q8BJ37|TYDP1_MOUSE Tyrosyl-DNA phosphodiesterase 1 OS=Mus musculus GN=Tdp1 PE=2 SV=2
          Length = 609

 Score =  263 bits (672), Expect = 2e-69,   Method: Compositional matrix adjust.
 Identities = 172/440 (39%), Positives = 242/440 (55%), Gaps = 58/440 (13%)

Query: 3   DIDWLLPACPVLAKIPHVLVIHGES-DGTLEHMKRNKP-ANWILHKPPLPISFGTHHSKA 60
           D+DWL+   P   +   +L++HG+  +   +   + KP AN  L +  L I+FGTHH+K 
Sbjct: 208 DVDWLIKQYPPEFRKNPILLVHGDKREAKADLHAQAKPYANISLCQAKLDIAFGTHHTKM 267

Query: 61  MLLIYPRGVRIIVHTANLIHVDWNNKSQGLWMQD-FPLKDQNNLS---EECGFENDLIDY 116
           MLL+Y  G+R+++HT+NLI  DW+ K+QG+W+   +P  DQ + +       F+ DL  Y
Sbjct: 268 MLLLYEEGLRVVIHTSNLIREDWHQKTQGIWLSPLYPRIDQGSHTAGESSTRFKADLTSY 327

Query: 117 LSTLKWPEFSANLPAHGNFKINPSFFKKFNFSSAAVRLIASVPGYHTGSSLKKWGHMKLR 176
           L+    P     +             ++ + S   V LI S PG   GS    WGH +LR
Sbjct: 328 LTAYNAPPLQEWI----------DIIQEHDLSETNVYLIGSTPGRFQGSHRDNWGHFRLR 377

Query: 177 TVLQE--CTFEKGFKKSPLVYQFSSLGSL---DEKWM-AELSSSMSSGFSEDKTPLGIGE 230
            +LQ    +  KG +  P+V QFSS+GSL   + KW+ +E   S+ +   E + P     
Sbjct: 378 KLLQAHAPSTPKG-ECWPIVGQFSSIGSLGPDESKWLCSEFKDSLLALREEGRPPGKSAV 436

Query: 231 PL-IVWPTVEDVRCSLEGYAAGNAIPSPQKNVDKD-FLKKYWAKWKASHTGRSRAMPHIK 288
           PL +++P+VE+VR SLEGY AG ++P   +  +K  +L  Y+ KW A  +GRS AMPHIK
Sbjct: 437 PLHLIYPSVENVRTSLEGYPAGGSLPYSIQTAEKQRWLHSYFHKWSAETSGRSNAMPHIK 496

Query: 289 TFARYNGQ--KLAWFLLTSANLSKAAWGALQKNNSQLMIRSYELGVLILPSAKRHGCGFS 346
           T+ R +    KLAWFL+TSANLSKAAWGAL+KN +QLMIRSYELGVL LPSA      F 
Sbjct: 497 TYMRPSPDFSKLAWFLVTSANLSKAAWGALEKNGTQLMIRSYELGVLFLPSA------FG 550

Query: 347 CTSNIVPSEIKSGSTETSQIQKTKLVTLTWHGSSDAGASSEVVYLPVPYELPPQRYSSED 406
             +  V  +  S S E +                           PVPY+LPP+ Y S+D
Sbjct: 551 LDTFKVKQKFFSSSCEPT------------------------ASFPVPYDLPPELYRSKD 586

Query: 407 VPWSWDKRYTKK-DVYGQVW 425
            PW W+  Y K  D +G +W
Sbjct: 587 RPWIWNIPYVKAPDTHGNMW 606


>sp|Q4G056|TYDP1_RAT Tyrosyl-DNA phosphodiesterase 1 OS=Rattus norvegicus GN=Tdp1 PE=2
           SV=1
          Length = 609

 Score =  261 bits (666), Expect = 9e-69,   Method: Compositional matrix adjust.
 Identities = 171/440 (38%), Positives = 238/440 (54%), Gaps = 58/440 (13%)

Query: 3   DIDWLLPACPVLAKIPHVLVIHG---ESDGTLEHMKRNKPANWILHKPPLPISFGTHHSK 59
           D++WL+   P   +   +L++HG   E+   L H +    AN  L +  L I+FGTHH+K
Sbjct: 208 DVNWLIKQYPPEFRKKPILLVHGDKREAKADL-HAQAKPYANISLCQAKLDIAFGTHHTK 266

Query: 60  AMLLIYPRGVRIIVHTANLIHVDWNNKSQGLWMQD-FPLKDQNNLSE---ECGFENDLID 115
            MLL+Y  G+R+++HT+NLI  DW+ K+QG+W+   +P   Q N +       F+ DL  
Sbjct: 267 MMLLLYEEGLRVVIHTSNLIREDWHQKTQGIWLSPLYPRIYQGNHTSGESSTHFKADLTS 326

Query: 116 YLSTLKWPEFSANLPAHGNFKINPSFFKKFNFSSAAVRLIASVPGYHTGSSLKKWGHMKL 175
           YL     P     +             ++ + S   V LI S PG   GS    WGH +L
Sbjct: 327 YLMAYNAPPLQEWI----------DIIQEHDLSETNVYLIGSTPGRFQGSHKDNWGHFRL 376

Query: 176 RTVLQECTFEKGFKKS-PLVYQFSSLGSL---DEKWM-AELSSSMSSGFSEDKTPLGIGE 230
           R +LQ         +  P+V QFSS+GSL   + KW+ +E   S+ +   E +TP     
Sbjct: 377 RKLLQAHAPSAPRGECWPVVGQFSSIGSLGPDESKWLCSEFKESLLAVREEGRTPGRSAV 436

Query: 231 PL-IVWPTVEDVRCSLEGYAAGNAIPSPQKNVDKD-FLKKYWAKWKASHTGRSRAMPHIK 288
           PL +++P+VE+VR SLEGY AG ++P   +  +K  +L  Y+ KW A  +GRS AMPHIK
Sbjct: 437 PLHLIYPSVENVRTSLEGYPAGGSLPYGIQTAEKQRWLHPYFHKWSAETSGRSNAMPHIK 496

Query: 289 TFARYNGQ--KLAWFLLTSANLSKAAWGALQKNNSQLMIRSYELGVLILPSAKRHGCGFS 346
           T+ R +    KLAWFL+TSANLSKAAWGAL+KN +QLMIRSYELGVL LPSA      F 
Sbjct: 497 TYMRPSPDFSKLAWFLVTSANLSKAAWGALEKNGAQLMIRSYELGVLFLPSA------FG 550

Query: 347 CTSNIVPSEIKSGSTETSQIQKTKLVTLTWHGSSDAGASSEVVYLPVPYELPPQRYSSED 406
             +  V  +  S S+E                         +   PVPY+LPP+ Y S+D
Sbjct: 551 LDTFKVKQKFFSSSSEP------------------------MASFPVPYDLPPELYGSKD 586

Query: 407 VPWSWDKRYTKK-DVYGQVW 425
            PW W+  Y K  D +G +W
Sbjct: 587 RPWIWNIPYVKAPDTHGNMW 606


>sp|Q9VQM4|TYDP1_DROME Probable tyrosyl-DNA phosphodiesterase OS=Drosophila melanogaster
           GN=gkt PE=2 SV=1
          Length = 580

 Score =  168 bits (426), Expect = 7e-41,   Method: Compositional matrix adjust.
 Identities = 126/357 (35%), Positives = 182/357 (50%), Gaps = 35/357 (9%)

Query: 1   MVDIDWLLPA---CPVLAKIPHVLVIHGESDGTLEHMKRNKPANWILHKPPLPISFGTHH 57
           MVDI WLL       +L K P +L+   ES   L   K  +    I  K P P  F T H
Sbjct: 192 MVDIGWLLGHYYFAGILDK-PLLLLYGDESPELLSIGKFKQQVTAIRVKMPTP--FATSH 248

Query: 58  SKAMLLIYPRG-VRIIVHTANLIHVDWNNKSQGLWMQ----DFPLKDQNNLSEE-CGFEN 111
           +K M L Y  G +R+++ TANL   DW+N++QGLW+       P+       E   GF+ 
Sbjct: 249 TKMMFLGYSDGSMRVVISTANLYEDDWHNRTQGLWISPKLPALPVDADTGAGESLTGFKQ 308

Query: 112 DLIDYLSTLKWPEFSANLPAHGNFKINPSFFKKFNFSSAAVRLIASVPGYHTGSSLKK-- 169
           DL+ YL   K  +    +          +  +  +FS+  V  + SVPG H   S++   
Sbjct: 309 DLMLYLVEYKISQLQPWI----------ARIRNSDFSAINVFFLGSVPGGHREGSVRGHP 358

Query: 170 WGHMKLRTVLQECTFEKGFKKSPLVYQFSSLGSLDEKWMAELSSSMSSGFSEDKTPLGIG 229
           WGH +L ++L +        + P+V Q SS+GSL     A +     +   +D TP+G  
Sbjct: 359 WGHARLASLLAKHAAPID-DRIPVVCQSSSIGSLGANVQAWIQQDFVNSLKKDSTPVGKL 417

Query: 230 EPL----IVWPTVEDVRCSLEGYAAGNAIPSPQKNVDKD-FLKKYWAKWKASHTGRSRAM 284
             +    +++P+  +V  S +G   G  +P  +   DK  +LK Y  +WK+S   RSRAM
Sbjct: 418 RQMPPFKMIYPSYGNVAGSHDGMLGGGCLPYGKNTNDKQPWLKDYLQQWKSSDRFRSRAM 477

Query: 285 PHIKTFARYN--GQKLAWFLLTSANLSKAAWGALQKNNS---QLMIRSYELGVLILP 336
           PHIK++ R+N   Q + WF+LTSANLSKAAWG   KN++    L I +YE GVL LP
Sbjct: 478 PHIKSYTRFNLEDQSVYWFVLTSANLSKAAWGCFNKNSNIQPCLRIANYEAGVLFLP 534


>sp|Q9TXV7|TYDP1_CAEEL Probable tyrosyl-DNA phosphodiesterase OS=Caenorhabditis elegans
           GN=F52C12.1 PE=3 SV=1
          Length = 451

 Score =  166 bits (419), Expect = 4e-40,   Method: Compositional matrix adjust.
 Identities = 133/441 (30%), Positives = 206/441 (46%), Gaps = 83/441 (18%)

Query: 1   MVDIDWLLPACP-VLAKIPHVLVIHGESDGTLEHMKRNKPANWILHKPPLPISFGTHHSK 59
           M+D ++L+ + P  L + P  LV+       L    +N+    ++    LPI FGTHH+K
Sbjct: 75  MLDFEFLIGSYPPSLREYPITLVVGAPDAPDLLKCTKNQKLVTVVGAS-LPIPFGTHHTK 133

Query: 60  AMLLIYPRG-VRIIVHTANLIHVDWNNKSQGLWMQDFPLKDQNNLSEECGFENDLIDYLS 118
             +L    G   +IV TANL+  DW  K+Q  +  +F +K  +       F++DL++YLS
Sbjct: 134 MSILEDEDGRFHVIVSTANLVPDDWEFKTQQFYY-NFGVKIASGTVPRSDFQDDLLEYLS 192

Query: 119 TLKWPEFSANLPAHGNFKINPSFFKKFNFSSAAVRLIASVPGYHTGSSLKKWGHMKLRTV 178
             +                     +K +FS  + RLI S PGYHT    ++ GH +L  +
Sbjct: 193 MYR-----------NQLDTWKQLLQKVDFSQISDRLIFSTPGYHTDPPTQRPGHPRLFRI 241

Query: 179 LQE-CTFEKGFK---KSPLVYQFSSLGSLDE---KWMAE--LSSSMSSGFSEDKTPLGIG 229
           L E   F+  ++   +   V Q SS+GSL      W     L S   +  S  + P  + 
Sbjct: 242 LSEKFPFDASYEHTERCTFVAQCSSIGSLGSAPINWFRGQFLQSLEGANPSPKQKPAKM- 300

Query: 230 EPLIVWPTVEDVRCSLEGYAAGNAIPSPQK-NVDKDFLKKYWAKWKASHTGRSRAMPHIK 288
              +V+P+VEDVR S +GYA G ++P     +  + +L+    KW+++   R+ A+PH K
Sbjct: 301 --YLVFPSVEDVRTSCQGYAGGCSVPYRNSVHARQKWLQGNMCKWRSNAKRRTNAVPHCK 358

Query: 289 TFARYNGQKLAWFLLTSANLSKAAWGAL----QKNNSQLMIRSYELGVLILPSAKRHGCG 344
           T+ +Y+ +   W LLTSANLSKAAWG +     KN  QLMIRS+E+GVLI          
Sbjct: 359 TYVKYDKKVAIWQLLTSANLSKAAWGEVSFNKSKNVEQLMIRSWEMGVLI---------- 408

Query: 345 FSCTSNIVPSEIKSGSTETSQIQKTKLVTLTWHGSSDAGASSEVVYLPVPYELPPQRYSS 404
                           T+ S+                           +P++ P   YS+
Sbjct: 409 ----------------TDPSRFN-------------------------IPFDYPLVPYSA 427

Query: 405 EDVPWSWDKRYTKKDVYGQVW 425
            D P+  DK++ K D+ G +W
Sbjct: 428 TDEPFVTDKKHEKPDILGCIW 448


>sp|Q9USG9|TYDP1_SCHPO Probable tyrosyl-DNA phosphodiesterase OS=Schizosaccharomyces pombe
           (strain 972 / ATCC 24843) GN=SPCP31B10.05 PE=3 SV=1
          Length = 536

 Score =  127 bits (320), Expect = 1e-28,   Method: Compositional matrix adjust.
 Identities = 133/498 (26%), Positives = 207/498 (41%), Gaps = 99/498 (19%)

Query: 2   VDIDWLLPAC------PVLAKIPHVLVIHGESDGTLEHMKRNKPANWILHKPPLPISFGT 55
           VD+++LL          V  +I H      +S   L     + P N  L+   +P+ +GT
Sbjct: 62  VDLNFLLENMHASVFPTVDVRITHGYDSKSDSLARLTAQMNHCPVNVKLYSVYVPM-WGT 120

Query: 56  HHSKAMLLIYPR-GVRIIVHTANLIHVDWNNKSQGLWMQ--------------------- 93
           HHSK M+  +     +I++HTANL+  DW   SQ ++                       
Sbjct: 121 HHSKIMVNFFKDDSCQIVIHTANLVEPDWIGMSQAIFKTPLLYPKANDSLSTSSVPEYGN 180

Query: 94  ---------DFPLKDQNN---LSEECGFEN----------DLIDYLSTLKWPEFSANLPA 131
                       +KD  N   +  +  FEN          D +  +      +F A L  
Sbjct: 181 PSKIRKHEGSLDIKDDRNCDIIDVDSAFENFKHKSDTRSSDDLGVIGRQFQQDFLAYLKN 240

Query: 132 HGNFKINPSFFKKFNFSSAAVRLIASVPGYHTGSSLKKWGHMKLRTVLQECTFEKGFKKS 191
           + +        K ++FS+     I SVPG   G     WG  KL+ +L+    EK  KK 
Sbjct: 241 YRHTYELIEKLKMYDFSAIRAIFIGSVPGKFEGEEESSWGLGKLKKILK--MLEKDSKKD 298

Query: 192 P---------LVYQFSSLGSLDEKWMAELSSSMSSGFSEDKTPLGIGEPLIVWPTVEDVR 242
                      + Q SS+GS   K   E  + ++ GF   +     G    ++PTV++V+
Sbjct: 299 EKTKFEESDICISQCSSMGSFGPK--QEYIAELTDGFGCQR-----GNWKFLFPTVKEVQ 351

Query: 243 CSLEGYAAGNAI------PSPQKNVDKDFLKKYWAKWKASHTGRSRAMPHIKTFARY--N 294
            S+ G+ +G++I       +    V+     K   KW A   GR R  PHIKT+ R+  +
Sbjct: 352 QSMLGWQSGSSIHFNILGKTAASQVETLKKGKNLCKWVAMKAGRQRVAPHIKTYMRFSND 411

Query: 295 GQKLAWFLLTSANLSKAAWGALQKNNSQ------LMIRSYELGVLILPSAKRHGCGFSCT 348
           G+ L W L+TSANLSK AWG L+ + ++      L IRSYE GVL+ P          C 
Sbjct: 412 GELLRWVLVTSANLSKPAWGTLEGHKAKSRSTRGLRIRSYEAGVLLYPKLFEESQRAPC- 470

Query: 349 SNIVPSEIKSGSTETSQIQKTKLVTLTWHGSSDAGASSEVVYLPVPYELPPQRYSSEDVP 408
             I+    K+ +    + ++       ++G         V+ + + ++ PP  Y  +D  
Sbjct: 471 --IMTPTYKTNTPNLDEKRR------EFYG-------KRVIGVRMCWDFPPVEYEDKDEI 515

Query: 409 WSWDKRYTKKDVYGQVWP 426
           WS     T KD  G VWP
Sbjct: 516 WSPVINRTDKDWLGYVWP 533


>sp|P38319|TYDP1_YEAST Tyrosyl-DNA phosphodiesterase 1 OS=Saccharomyces cerevisiae (strain
           ATCC 204508 / S288c) GN=TDP1 PE=1 SV=1
          Length = 544

 Score = 35.8 bits (81), Expect = 0.67,   Method: Compositional matrix adjust.
 Identities = 18/38 (47%), Positives = 24/38 (63%), Gaps = 5/38 (13%)

Query: 296 QKLAWFLLTSANLSKAAWGALQKNNSQLMIRSYELGVL 333
           ++L W L TSANLS+ AWG + +       R+YE GVL
Sbjct: 454 KELEWCLYTSANLSQTAWGTVSRKP-----RNYEAGVL 486


>sp|Q5E0C4|HUTG_VIBF1 Formimidoylglutamase OS=Vibrio fischeri (strain ATCC 700601 /
           ES114) GN=hutG PE=3 SV=1
          Length = 350

 Score = 33.1 bits (74), Expect = 4.0,   Method: Compositional matrix adjust.
 Identities = 19/86 (22%), Positives = 43/86 (50%), Gaps = 5/86 (5%)

Query: 301 FLLTSANLSKAAW-GALQKNNSQLMIRSYELGVLILPSAKRHGC---GFSCTSNIVPSEI 356
           F+  +++++  +W G +   + +L +R ++  V +  S  + G    GF+C   ++ ++ 
Sbjct: 5   FMKNNSSVNMTSWMGRVDHEDGELGLRWHQ-KVKVTNSTNQDGIMLLGFACDEGVIRNKG 63

Query: 357 KSGSTETSQIQKTKLVTLTWHGSSDA 382
           + G+    Q+ +  L  L WH  SD 
Sbjct: 64  RKGAYAAPQVIRRALANLAWHHQSDV 89


  Database: swissprot
    Posted date:  Mar 23, 2013  2:32 AM
  Number of letters in database: 191,569,459
  Number of sequences in database:  539,616
  
Lambda     K      H
   0.318    0.133    0.424 

Lambda     K      H
   0.267   0.0410    0.140 


Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Number of Hits to DB: 174,700,564
Number of Sequences: 539616
Number of extensions: 7493048
Number of successful extensions: 14813
Number of sequences better than 100.0: 8
Number of HSP's better than 100.0 without gapping: 6
Number of HSP's successfully gapped in prelim test: 2
Number of HSP's that attempted gapping in prelim test: 14766
Number of HSP's gapped (non-prelim): 12
length of query: 437
length of database: 191,569,459
effective HSP length: 120
effective length of query: 317
effective length of database: 126,815,539
effective search space: 40200525863
effective search space used: 40200525863
T: 11
A: 40
X1: 16 ( 7.3 bits)
X2: 38 (14.6 bits)
X3: 64 (24.7 bits)
S1: 41 (21.7 bits)
S2: 63 (28.9 bits)