BLASTP 2.2.26 [Sep-21-2011]


Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, 
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), 
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs",  Nucleic Acids Res. 25:3389-3402.


Reference for compositional score matrix adjustment: Altschul, Stephen F., 
John C. Wootton, E. Michael Gertz, Richa Agarwala, Aleksandr Morgulis,
Alejandro A. Schaffer, and Yi-Kuo Yu (2005) "Protein database searches
using compositionally adjusted substitution matrices", FEBS J. 272:5101-5109.

Query= 011198
         (491 letters)

Database: swissprot 
           539,616 sequences; 191,569,459 total letters

Searching..................................................done



>sp|Q9NUW8|TYDP1_HUMAN Tyrosyl-DNA phosphodiesterase 1 OS=Homo sapiens GN=TDP1 PE=1 SV=2
          Length = 608

 Score =  184 bits (466), Expect = 2e-45,   Method: Compositional matrix adjust.
 Identities = 134/427 (31%), Positives = 217/427 (50%), Gaps = 39/427 (9%)

Query: 78  QRKKLSSNEHV----SIADGDIIELIPGHHFFKYVTLSRSQKRVSNDGAT-----NGELS 128
           +R+K  S E +    S +D ++   +P     K V          NDG       +G  +
Sbjct: 75  KRQKSGSQEDLGWCLSSSDDELQPEMPQKQAEKVVIKKEKDISAPNDGTAQRTENHGAPA 134

Query: 129 SKKMRQQDEQDNENGKNSEEALCNFHVSRDKLPSTFRLLRVQGLPAWANTSCVSIRDVIQ 188
             ++++++++   +G+  +     + +     P  F L RV G+    N+  + I+D++ 
Sbjct: 135 CHRLKEEEDEYETSGEGQD----IWDMLDKGNPFQFYLTRVSGVKPKYNSGALHIKDILS 190

Query: 189 ---GDIIVAILSNYMVDIDWLLPACPVLAKIPHVLVIHGESDGTLEHM-KRNKP-ANWIL 243
              G ++ +   NY  D+DWL+   P   +   +L++HG+      H+  + KP  N  L
Sbjct: 191 PLFGTLVSSAQFNYCFDVDWLVKQYPPEFRKKPILLVHGDKREAKAHLHAQAKPYENISL 250

Query: 244 HKPPLPISFGTHHSKAMLLIYPRGVRIIVHTANLIHVDWNNKSQGLWMQD-FP-LKDQNN 301
            +  L I+FGTHH+K MLL+Y  G+R+++HT+NLIH DW+ K+QG+W+   +P + D  +
Sbjct: 251 CQAKLDIAFGTHHTKMMLLLYEEGLRVVIHTSNLIHADWHQKTQGIWLSPLYPRIADGTH 310

Query: 302 LSEE--CGFENDLIDYLSTLKWPEFSANLPAHGNFKINPSFFKKFNFSSAAVRLIASVPG 359
            S E    F+ DLI YL     P     +              K + S   V LI S PG
Sbjct: 311 KSGESPTHFKADLISYLMAYNAPSLKEWI----------DVIHKHDLSETNVYLIGSTPG 360

Query: 360 YHTGSSLKKWGHMKLRTVLQECTFEKGFKKS-PLVYQFSSLGSL---DEKWM-AELSSSM 414
              GS    WGH +L+ +L++        +S P+V QFSS+GSL   + KW+ +E   SM
Sbjct: 361 RFQGSQKDNWGHFRLKKLLKDHASSMPNAESWPVVGQFSSVGSLGADESKWLCSEFKESM 420

Query: 415 SSGFSEDKTPLGIGEPL-IVWPTVEDVRCSLEGYAAGNAIP-SPQKNVDKDFLKKYWAKW 472
            +   E KTP     PL +++P+VE+VR SLEGY AG ++P S Q    +++L  Y+ KW
Sbjct: 421 LTLGKESKTPGKSSVPLYLIYPSVENVRTSLEGYPAGGSLPYSIQTAEKQNWLHSYFHKW 480

Query: 473 KASHTGR 479
            A  +GR
Sbjct: 481 SAETSGR 487


>sp|Q8BJ37|TYDP1_MOUSE Tyrosyl-DNA phosphodiesterase 1 OS=Mus musculus GN=Tdp1 PE=2 SV=2
          Length = 609

 Score =  177 bits (449), Expect = 2e-43,   Method: Compositional matrix adjust.
 Identities = 117/337 (34%), Positives = 180/337 (53%), Gaps = 30/337 (8%)

Query: 161 PSTFRLLRVQGLPAWANTSCVSIRDVIQ---GDIIVAILSNYMVDIDWLLPACPVLAKIP 217
           P  F L RV G+ A  N+  + I+D++    G ++ +   NY  D+DWL+   P   +  
Sbjct: 164 PFQFYLTRVSGIKAKYNSKALHIKDILSPLFGTLVSSAQFNYCFDVDWLIKQYPPEFRKN 223

Query: 218 HVLVIHG---ESDGTLEHMKRNKPANWILHKPPLPISFGTHHSKAMLLIYPRGVRIIVHT 274
            +L++HG   E+   L H +    AN  L +  L I+FGTHH+K MLL+Y  G+R+++HT
Sbjct: 224 PILLVHGDKREAKADL-HAQAKPYANISLCQAKLDIAFGTHHTKMMLLLYEEGLRVVIHT 282

Query: 275 ANLIHVDWNNKSQGLWMQD-FPLKDQNNLS---EECGFENDLIDYLSTLKWPEFSANLPA 330
           +NLI  DW+ K+QG+W+   +P  DQ + +       F+ DL  YL+    P     +  
Sbjct: 283 SNLIREDWHQKTQGIWLSPLYPRIDQGSHTAGESSTRFKADLTSYLTAYNAPPLQEWI-- 340

Query: 331 HGNFKINPSFFKKFNFSSAAVRLIASVPGYHTGSSLKKWGHMKLRTVLQ--ECTFEKGFK 388
                      ++ + S   V LI S PG   GS    WGH +LR +LQ    +  KG +
Sbjct: 341 --------DIIQEHDLSETNVYLIGSTPGRFQGSHRDNWGHFRLRKLLQAHAPSTPKG-E 391

Query: 389 KSPLVYQFSSLGSL---DEKWM-AELSSSMSSGFSEDKTPLGIGEPL-IVWPTVEDVRCS 443
             P+V QFSS+GSL   + KW+ +E   S+ +   E + P     PL +++P+VE+VR S
Sbjct: 392 CWPIVGQFSSIGSLGPDESKWLCSEFKDSLLALREEGRPPGKSAVPLHLIYPSVENVRTS 451

Query: 444 LEGYAAGNAIPSPQKNVDKD-FLKKYWAKWKASHTGR 479
           LEGY AG ++P   +  +K  +L  Y+ KW A  +GR
Sbjct: 452 LEGYPAGGSLPYSIQTAEKQRWLHSYFHKWSAETSGR 488


>sp|Q4G056|TYDP1_RAT Tyrosyl-DNA phosphodiesterase 1 OS=Rattus norvegicus GN=Tdp1 PE=2
           SV=1
          Length = 609

 Score =  174 bits (442), Expect = 9e-43,   Method: Compositional matrix adjust.
 Identities = 117/344 (34%), Positives = 177/344 (51%), Gaps = 44/344 (12%)

Query: 161 PSTFRLLRVQGLPAWANTSCVSIRDVIQ---GDIIVAILSNYMVDIDWLLPACPVLAKIP 217
           P  F L RV G+ A  N+  + I+D++    G ++ +   NY  D++WL+   P   +  
Sbjct: 164 PFQFYLTRVSGIKAKYNSKALHIKDILSPLFGTLVSSAQFNYCFDVNWLIKQYPPEFRKK 223

Query: 218 HVLVIHG---ESDGTLEHMKRNKPANWILHKPPLPISFGTHHSKAMLLIYPRGVRIIVHT 274
            +L++HG   E+   L H +    AN  L +  L I+FGTHH+K MLL+Y  G+R+++HT
Sbjct: 224 PILLVHGDKREAKADL-HAQAKPYANISLCQAKLDIAFGTHHTKMMLLLYEEGLRVVIHT 282

Query: 275 ANLIHVDWNNKSQGLWMQD-FPLKDQNNLS---EECGFENDLIDYLSTLKWPEFSANLPA 330
           +NLI  DW+ K+QG+W+   +P   Q N +       F+ DL  YL     P     +  
Sbjct: 283 SNLIREDWHQKTQGIWLSPLYPRIYQGNHTSGESSTHFKADLTSYLMAYNAPPLQEWI-- 340

Query: 331 HGNFKINPSFFKKFNFSSAAVRLIASVPGYHTGSSLKKWGHMKLRTVLQ---------EC 381
                      ++ + S   V LI S PG   GS    WGH +LR +LQ         EC
Sbjct: 341 --------DIIQEHDLSETNVYLIGSTPGRFQGSHKDNWGHFRLRKLLQAHAPSAPRGEC 392

Query: 382 TFEKGFKKSPLVYQFSSLGSL---DEKWM-AELSSSMSSGFSEDKTPLGIGEPL-IVWPT 436
                    P+V QFSS+GSL   + KW+ +E   S+ +   E +TP     PL +++P+
Sbjct: 393 W--------PVVGQFSSIGSLGPDESKWLCSEFKESLLAVREEGRTPGRSAVPLHLIYPS 444

Query: 437 VEDVRCSLEGYAAGNAIPSPQKNVDKD-FLKKYWAKWKASHTGR 479
           VE+VR SLEGY AG ++P   +  +K  +L  Y+ KW A  +GR
Sbjct: 445 VENVRTSLEGYPAGGSLPYGIQTAEKQRWLHPYFHKWSAETSGR 488


>sp|Q9VQM4|TYDP1_DROME Probable tyrosyl-DNA phosphodiesterase OS=Drosophila melanogaster
           GN=gkt PE=2 SV=1
          Length = 580

 Score =  110 bits (276), Expect = 2e-23,   Method: Compositional matrix adjust.
 Identities = 95/313 (30%), Positives = 146/313 (46%), Gaps = 30/313 (9%)

Query: 183 IRDVIQGDIIVAILSNYMVDIDWLLPA---CPVLAKIPHVLVIHGESDGTLEHMKRNKPA 239
           I D   G+I   +  N+MVDI WLL       +L K P +L+   ES   L   K  +  
Sbjct: 175 ILDESLGEIESTVQINFMVDIGWLLGHYYFAGILDK-PLLLLYGDESPELLSIGKFKQQV 233

Query: 240 NWILHKPPLPISFGTHHSKAMLLIYPRG-VRIIVHTANLIHVDWNNKSQGLWMQ----DF 294
             I  K P P  F T H+K M L Y  G +R+++ TANL   DW+N++QGLW+       
Sbjct: 234 TAIRVKMPTP--FATSHTKMMFLGYSDGSMRVVISTANLYEDDWHNRTQGLWISPKLPAL 291

Query: 295 PLKDQNNLSEE-CGFENDLIDYLSTLKWPEFSANLPAHGNFKINPSFFKKFNFSSAAVRL 353
           P+       E   GF+ DL+ YL   K  +    +          +  +  +FS+  V  
Sbjct: 292 PVDADTGAGESLTGFKQDLMLYLVEYKISQLQPWI----------ARIRNSDFSAINVFF 341

Query: 354 IASVPGYHTGSSLKK--WGHMKLRTVLQECTFEKGFKKSPLVYQFSSLGSLDEKWMAELS 411
           + SVPG H   S++   WGH +L ++L +        + P+V Q SS+GSL     A + 
Sbjct: 342 LGSVPGGHREGSVRGHPWGHARLASLLAKHAAPID-DRIPVVCQSSSIGSLGANVQAWIQ 400

Query: 412 SSMSSGFSEDKTPLGIGEPL----IVWPTVEDVRCSLEGYAAGNAIPSPQKNVDKD-FLK 466
               +   +D TP+G    +    +++P+  +V  S +G   G  +P  +   DK  +LK
Sbjct: 401 QDFVNSLKKDSTPVGKLRQMPPFKMIYPSYGNVAGSHDGMLGGGCLPYGKNTNDKQPWLK 460

Query: 467 KYWAKWKASHTGR 479
            Y  +WK+S   R
Sbjct: 461 DYLQQWKSSDRFR 473


>sp|Q9TXV7|TYDP1_CAEEL Probable tyrosyl-DNA phosphodiesterase OS=Caenorhabditis elegans
           GN=F52C12.1 PE=3 SV=1
          Length = 451

 Score =  105 bits (263), Expect = 6e-22,   Method: Compositional matrix adjust.
 Identities = 94/312 (30%), Positives = 150/312 (48%), Gaps = 31/312 (9%)

Query: 185 DVIQGDI--IVAILSNYMVDIDWLLPACP-VLAKIPHVLVIHGESDGTLEHMKRNKPANW 241
           D I  DI  I ++  ++M+D ++L+ + P  L + P  LV+       L    +N+    
Sbjct: 58  DEILADIRPINSLHFSFMLDFEFLIGSYPPSLREYPITLVVGAPDAPDLLKCTKNQKLVT 117

Query: 242 ILHKPPLPISFGTHHSKAMLLIYPRG-VRIIVHTANLIHVDWNNKSQGLWMQDFPLKDQN 300
           ++    LPI FGTHH+K  +L    G   +IV TANL+  DW  K+Q  +  +F +K  +
Sbjct: 118 VVGAS-LPIPFGTHHTKMSILEDEDGRFHVIVSTANLVPDDWEFKTQQFYY-NFGVKIAS 175

Query: 301 NLSEECGFENDLIDYLSTLKWPEFSANLPAHGNFKINPSFFKKFNFSSAAVRLIASVPGY 360
                  F++DL++YLS  +                     +K +FS  + RLI S PGY
Sbjct: 176 GTVPRSDFQDDLLEYLSMYR-----------NQLDTWKQLLQKVDFSQISDRLIFSTPGY 224

Query: 361 HTGSSLKKWGHMKLRTVLQE-CTFEKGF---KKSPLVYQFSSLGSLDE---KWMAE--LS 411
           HT    ++ GH +L  +L E   F+  +   ++   V Q SS+GSL      W     L 
Sbjct: 225 HTDPPTQRPGHPRLFRILSEKFPFDASYEHTERCTFVAQCSSIGSLGSAPINWFRGQFLQ 284

Query: 412 SSMSSGFSEDKTPLGIGEPLIVWPTVEDVRCSLEGYAAGNAIPSPQK-NVDKDFLKKYWA 470
           S   +  S  + P  +    +V+P+VEDVR S +GYA G ++P     +  + +L+    
Sbjct: 285 SLEGANPSPKQKPAKM---YLVFPSVEDVRTSCQGYAGGCSVPYRNSVHARQKWLQGNMC 341

Query: 471 KWKASHTGRRYN 482
           KW+ S+  RR N
Sbjct: 342 KWR-SNAKRRTN 352


>sp|Q9USG9|TYDP1_SCHPO Probable tyrosyl-DNA phosphodiesterase OS=Schizosaccharomyces pombe
           (strain 972 / ATCC 24843) GN=SPCP31B10.05 PE=3 SV=1
          Length = 536

 Score = 69.7 bits (169), Expect = 4e-11,   Method: Compositional matrix adjust.
 Identities = 92/391 (23%), Positives = 153/391 (39%), Gaps = 76/391 (19%)

Query: 156 SRDKLPSTFRLLRVQGLPAWANTSCVSIRDVIQGDIIVAILS-NYMVDIDWLLPAC---- 210
           S + + S   L ++  LP   N  C+ ++ +I    +      N+ VD+++LL       
Sbjct: 16  SNEIIDSPIFLNKISALPESENVHCLLLKQLIGSPQLKQTWQFNFCVDLNFLLENMHASV 75

Query: 211 --PVLAKIPHVLVIHGESDGTLEHMKRNKPANWILHKPPLPISFGTHHSKAMLLIYPR-G 267
              V  +I H      +S   L     + P N  L+   +P+ +GTHHSK M+  +    
Sbjct: 76  FPTVDVRITHGYDSKSDSLARLTAQMNHCPVNVKLYSVYVPM-WGTHHSKIMVNFFKDDS 134

Query: 268 VRIIVHTANLIHVDWNNKSQGLWMQ------------------------------DFPLK 297
            +I++HTANL+  DW   SQ ++                                   +K
Sbjct: 135 CQIVIHTANLVEPDWIGMSQAIFKTPLLYPKANDSLSTSSVPEYGNPSKIRKHEGSLDIK 194

Query: 298 DQNN---LSEECGFEN----------DLIDYLSTLKWPEFSANLPAHGNFKINPSFFKKF 344
           D  N   +  +  FEN          D +  +      +F A L  + +        K +
Sbjct: 195 DDRNCDIIDVDSAFENFKHKSDTRSSDDLGVIGRQFQQDFLAYLKNYRHTYELIEKLKMY 254

Query: 345 NFSSAAVRLIASVPGYHTGSSLKKWGHMKLRTVLQECTFEKGFKKSP---------LVYQ 395
           +FS+     I SVPG   G     WG  KL+ +L+    EK  KK            + Q
Sbjct: 255 DFSAIRAIFIGSVPGKFEGEEESSWGLGKLKKILK--MLEKDSKKDEKTKFEESDICISQ 312

Query: 396 FSSLGSLDEKWMAELSSSMSSGFSEDKTPLGIGEPLIVWPTVEDVRCSLEGYAAGNAI-- 453
            SS+GS   K   E  + ++ GF   +     G    ++PTV++V+ S+ G+ +G++I  
Sbjct: 313 CSSMGSFGPK--QEYIAELTDGFGCQR-----GNWKFLFPTVKEVQQSMLGWQSGSSIHF 365

Query: 454 ----PSPQKNVDKDFLKKYWAKWKASHTGRR 480
                +    V+     K   KW A   GR+
Sbjct: 366 NILGKTAASQVETLKKGKNLCKWVAMKAGRQ 396


>sp|Q7TT00|SP20H_MOUSE Transcription factor SPT20 homolog OS=Mus musculus GN=Supt20h PE=1
           SV=1
          Length = 530

 Score = 35.4 bits (80), Expect = 1.1,   Method: Compositional matrix adjust.
 Identities = 43/186 (23%), Positives = 71/186 (38%), Gaps = 38/186 (20%)

Query: 309 ENDLIDYLSTLKWPEFSANLPAHGNFKINPSFFKKFNFSSAAVRLIASVPGYHTGSSLKK 368
           E +L++YL   + P    +L       I         F    V  IA +  Y   S++K 
Sbjct: 108 EGELLEYLDAEELPPILVDLLEKSQVNI---------FHCGCV--IAEIRDYRQSSNMKS 156

Query: 369 WG----HMKLRTVLQECTFEKGFKKSPLVYQFSSLGSLDEKWMAELSSSMSSGFSEDKTP 424
            G    H+ LR  +Q            LV    S+ S + KW  E          E +  
Sbjct: 157 PGYQSRHILLRPTMQT-----------LVCDVHSITSDNHKWTQE-----DKLLLESQLI 200

Query: 425 LGIGEPLIVWPTVEDVRCSLEGYAAGNAIPSPQKNVDKDFLKKYWAKWKASHTGRRYNID 484
           L   EPL + P+V  V C+       N +   ++ ++   +K+ W ++  S   R+ ++ 
Sbjct: 201 LATAEPLCLDPSVA-VACT------ANRLLYNRQKMNTRPMKRCWKRYSRSSLNRQQDLS 253

Query: 485 LYLPPP 490
              PPP
Sbjct: 254 HGPPPP 259


>sp|Q8IW19|APLF_HUMAN Aprataxin and PNK-like factor OS=Homo sapiens GN=APLF PE=1 SV=1
          Length = 511

 Score = 34.7 bits (78), Expect = 1.8,   Method: Compositional matrix adjust.
 Identities = 26/91 (28%), Positives = 46/91 (50%), Gaps = 4/91 (4%)

Query: 23  PKLPLSQGPNVIGRTNI-PVSDKRLSRKHITLTASADGSASLVVDGTNPVVVKSGDQRK- 80
           P++ L+ G  VIGR  +  ++DKR+SR+H  L   A G   +    TNP   +S ++ + 
Sbjct: 14  PRVALAPGETVIGRGPLLGITDKRVSRRHAILEV-AGGQLRIKPIHTNPCFYQSSEKSQL 72

Query: 81  -KLSSNEHVSIADGDIIELIPGHHFFKYVTL 110
             L  N    +  GD   L+   + F+ +++
Sbjct: 73  LPLKPNLWCYLNPGDSFSLLVDKYIFRILSI 103


>sp|Q9V677|ECM29_DROME Proteasome-associated protein ECM29 homolog OS=Drosophila
           melanogaster GN=CG8858 PE=1 SV=1
          Length = 1890

 Score = 33.1 bits (74), Expect = 4.8,   Method: Compositional matrix adjust.
 Identities = 34/133 (25%), Positives = 57/133 (42%), Gaps = 13/133 (9%)

Query: 31  PNVIGRTNIPVSDKRLSRKHITLTASADGSASLVVDGTNPVVVKSGDQRKKLSSNE---- 86
           PNV     +P  D     + +T   S D S  LV+ G    +++S   R+K+  +     
Sbjct: 814 PNVPVEIQVPSKDGDDDDEEMTEYTSIDSSTKLVIFGVVFQLLRSSSARQKIREDSARCL 873

Query: 87  -HVSIADGDIIELIPGHHFFKYVTLSRSQKRVSNDGATNGELSSKKMRQQDEQDNENGKN 145
            +++I DG   E     +  K++TL++ QK    D A N  +S   +      D   G+ 
Sbjct: 874 GYLAIGDG---EHFTKRNLDKFLTLTKIQK----DAALNIAISEAIVNTLCGYDVNKGQ- 925

Query: 146 SEEALCNFHVSRD 158
            +E   N H + D
Sbjct: 926 PDEKFVNKHCNDD 938


>sp|Q9D842|APLF_MOUSE Aprataxin and PNK-like factor OS=Mus musculus GN=Aplf PE=2 SV=2
          Length = 499

 Score = 32.7 bits (73), Expect = 5.8,   Method: Compositional matrix adjust.
 Identities = 18/47 (38%), Positives = 27/47 (57%), Gaps = 8/47 (17%)

Query: 8  YLVPLDNNLREDNSLPKLPLSQGPNVIGRTN-IPVSDKRLSRKHITL 53
          +L PLD         P++P+  G  VIGR   + ++DKR+SR+H  L
Sbjct: 6  FLQPLDGG-------PRVPVGPGQTVIGRGPLLGITDKRVSRRHAIL 45


  Database: swissprot
    Posted date:  Mar 23, 2013  2:32 AM
  Number of letters in database: 191,569,459
  Number of sequences in database:  539,616
  
Lambda     K      H
   0.317    0.134    0.410 

Lambda     K      H
   0.267   0.0410    0.140 


Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Number of Hits to DB: 200,377,963
Number of Sequences: 539616
Number of extensions: 8843611
Number of successful extensions: 19939
Number of sequences better than 100.0: 12
Number of HSP's better than 100.0 without gapping: 5
Number of HSP's successfully gapped in prelim test: 7
Number of HSP's that attempted gapping in prelim test: 19905
Number of HSP's gapped (non-prelim): 18
length of query: 491
length of database: 191,569,459
effective HSP length: 122
effective length of query: 369
effective length of database: 125,736,307
effective search space: 46396697283
effective search space used: 46396697283
T: 11
A: 40
X1: 16 ( 7.3 bits)
X2: 38 (14.6 bits)
X3: 64 (24.7 bits)
S1: 41 (21.6 bits)
S2: 63 (28.9 bits)