BLASTP 2.2.26 [Sep-21-2011]
Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer,
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997),
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs", Nucleic Acids Res. 25:3389-3402.
Reference for compositional score matrix adjustment: Altschul, Stephen F.,
John C. Wootton, E. Michael Gertz, Richa Agarwala, Aleksandr Morgulis,
Alejandro A. Schaffer, and Yi-Kuo Yu (2005) "Protein database searches
using compositionally adjusted substitution matrices", FEBS J. 272:5101-5109.
Query= 026782
(233 letters)
Database: swissprot
539,616 sequences; 191,569,459 total letters
Searching..................................................done
>sp|P51173|APEA_DICDI DNA-(apurinic or apyrimidinic site) lyase OS=Dictyostelium
discoideum GN=apeA PE=2 SV=2
Length = 361
Score = 111 bits (277), Expect = 4e-24, Method: Compositional matrix adjust.
Identities = 67/195 (34%), Positives = 102/195 (52%), Gaps = 23/195 (11%)
Query: 21 FLLSALKYEPDGRVILAEFETFYLLNTYAPNNGWKEEENSFQRRRKWDKRIQEFVLQCSG 80
F + K++ +GRVI E++ FY++NTY PN G + + R ++WD Q ++ + +
Sbjct: 184 FGIGIAKHDNEGRVITLEYDQFYIVNTYIPNAGTRGLQRLDYRIKEWDVDFQAYLEKLNA 243
Query: 81 -KPLIWCGDLNVSHEEIDVSHPEFFAAAKLNGYVPPNKEDWGQPGFTIAERKRFGAILKE 139
KP+IWCGDLNV+H EID+ +P K + GFTI ER F L++
Sbjct: 244 TKPIIWCGDLNVAHTEIDLKNP---------------KTNKKSAGFTIEERTSFSNFLEK 288
Query: 140 GRLIDAYRFLHKEKDMDCGFSWSGNPIGKYRGKRMRIDYFIVSEELKDRIIACEMQGHGI 199
G +D+YR + K+ F WS G+ + R+DYF+VS+ L D I +
Sbjct: 289 G-YVDSYRHFNPGKEGSYTF-WSYLGGGRSKNVGWRLDYFVVSKRLMDSIKISPFHRTSV 346
Query: 200 ELEGFYGSDHCPVSL 214
GSDHCP+ +
Sbjct: 347 -----MGSDHCPIGV 356
>sp|P28352|APEX1_MOUSE DNA-(apurinic or apyrimidinic site) lyase OS=Mus musculus GN=Apex1
PE=1 SV=2
Length = 317
Score = 101 bits (251), Expect = 5e-21, Method: Compositional matrix adjust.
Identities = 64/192 (33%), Positives = 105/192 (54%), Gaps = 25/192 (13%)
Query: 27 KYEPDGRVILAEFETFYLLNTYAPNNGWKEEENSFQRRRKWDKRIQEFVLQ-CSGKPLIW 85
+++ +GRVI+AEFE+F L+ Y PN G + R++WD+ ++F+ S KPL+
Sbjct: 149 EHDQEGRVIVAEFESFVLVTAYVPNAG--RGLVRLEYRQRWDEAFRKFLKDLASRKPLVL 206
Query: 86 CGDLNVSHEEIDVSHPEFFAAAKLNGYVPPNKEDWGQPGFTIAERKRFGAILKEGRLIDA 145
CGDLNV+HEEID+ +P+ NK++ GFT ER+ FG +L+ L D+
Sbjct: 207 CGDLNVAHEEIDLRNPKG------------NKKN---AGFTPQERQGFGELLQAVPLADS 251
Query: 146 YRFLHKEKDMDCGFSWSGNPIGKYRGKRMRIDYFIVSEELKDRIIACEMQGHGIELEGFY 205
+R L+ F W+ + + R+DYF++S L + +++ +
Sbjct: 252 FRHLYPNTAYAYTF-WTYMMNARSKNVGWRLDYFLLSHSLLPALCDSKIRSKAL------ 304
Query: 206 GSDHCPVSLELS 217
GSDHCP++L L+
Sbjct: 305 GSDHCPITLYLA 316
>sp|P43138|APEX1_RAT DNA-(apurinic or apyrimidinic site) lyase OS=Rattus norvegicus
GN=Apex1 PE=1 SV=2
Length = 317
Score = 100 bits (250), Expect = 6e-21, Method: Compositional matrix adjust.
Identities = 64/192 (33%), Positives = 105/192 (54%), Gaps = 25/192 (13%)
Query: 27 KYEPDGRVILAEFETFYLLNTYAPNNGWKEEENSFQRRRKWDKRIQEFVLQ-CSGKPLIW 85
+++ +GRVI+AEFE+F L+ Y PN G + R++WD+ ++F+ S KPL+
Sbjct: 149 EHDQEGRVIVAEFESFILVTAYVPNAG--RGLVRLEYRQRWDEAFRKFLKDLASRKPLVL 206
Query: 86 CGDLNVSHEEIDVSHPEFFAAAKLNGYVPPNKEDWGQPGFTIAERKRFGAILKEGRLIDA 145
CGDLNV+HEEID+ +P+ NK++ GFT ER+ FG +L+ L D+
Sbjct: 207 CGDLNVAHEEIDLRNPKG------------NKKN---AGFTPQERQGFGEMLQAVPLADS 251
Query: 146 YRFLHKEKDMDCGFSWSGNPIGKYRGKRMRIDYFIVSEELKDRIIACEMQGHGIELEGFY 205
+R L+ F W+ + + R+DYF++S L + +++ +
Sbjct: 252 FRHLYPNTAYAYTF-WTYMMNARSKNVGWRLDYFLLSHSLLPALCDSKIRSKAL------ 304
Query: 206 GSDHCPVSLELS 217
GSDHCP++L L+
Sbjct: 305 GSDHCPITLYLA 316
>sp|P45951|ARP_ARATH Apurinic endonuclease-redox protein OS=Arabidopsis thaliana GN=ARP
PE=2 SV=2
Length = 536
Score = 100 bits (250), Expect = 7e-21, Method: Compositional matrix adjust.
Identities = 67/191 (35%), Positives = 102/191 (53%), Gaps = 27/191 (14%)
Query: 28 YEPDGRVILAEFETFYLLNTYAPNNGWKEEENSFQRRRKWDKRIQEFVLQC-SGKPLIWC 86
++ +GR++ AEF++FYL+NTY PN+G + S+ R +WD+ + + + KP++
Sbjct: 369 HDTEGRIVTAEFDSFYLINTYVPNSGDGLKRLSY-RIEEWDRTLSNHIKELEKSKPVVLT 427
Query: 87 GDLNVSHEEIDVSHPEFFAAAKLNGYVPPNKEDWGQPGFTIAERKRFGAILKEGRLIDAY 146
GDLN +HEEID+ +P A K + GFTI ER+ FGA L + +D +
Sbjct: 428 GDLNCAHEEIDIFNP---AGNKRSA------------GFTIEERQSFGANLLDKGFVDTF 472
Query: 147 RFLHKEKDMDCGFS-WSGNPIGKYRGKRMRIDYFIVSEELKDRIIACEMQGHGIELEGFY 205
R K+ G++ W G+ K R+DYF+VS+ IA + I L
Sbjct: 473 R---KQHPGVVGYTYWGYRHGGRKTNKGWRLDYFLVSQS-----IAANVHDSYI-LPDIN 523
Query: 206 GSDHCPVSLEL 216
GSDHCP+ L L
Sbjct: 524 GSDHCPIGLIL 534
>sp|A2T6Y4|APEX1_PANTR DNA-(apurinic or apyrimidinic site) lyase OS=Pan troglodytes
GN=APEX1 PE=3 SV=1
Length = 318
Score = 99.4 bits (246), Expect = 2e-20, Method: Compositional matrix adjust.
Identities = 63/192 (32%), Positives = 105/192 (54%), Gaps = 25/192 (13%)
Query: 27 KYEPDGRVILAEFETFYLLNTYAPNNGWKEEENSFQRRRKWDKRIQEFVLQ-CSGKPLIW 85
+++ +GRVI+AEF++F L+ Y PN G + R++WD+ ++F+ S KPL+
Sbjct: 150 EHDQEGRVIVAEFDSFVLVTAYVPNAG--RGLVRLEYRQRWDEAFRKFLKGLASRKPLVL 207
Query: 86 CGDLNVSHEEIDVSHPEFFAAAKLNGYVPPNKEDWGQPGFTIAERKRFGAILKEGRLIDA 145
CGDLNV+HEEID+ +P+ NK++ GFT ER+ FG +L+ L D+
Sbjct: 208 CGDLNVAHEEIDLRNPKG------------NKKN---AGFTPQERQGFGELLQAVPLADS 252
Query: 146 YRFLHKEKDMDCGFSWSGNPIGKYRGKRMRIDYFIVSEELKDRIIACEMQGHGIELEGFY 205
+R L+ F W+ + + R+DYF++S L + +++ +
Sbjct: 253 FRHLYPNTPYAYTF-WTYMMNARSKNVGWRLDYFLLSHSLLPALCDSKIRSKAL------ 305
Query: 206 GSDHCPVSLELS 217
GSDHCP++L L+
Sbjct: 306 GSDHCPITLYLA 317
>sp|A1YFZ3|APEX1_PANPA DNA-(apurinic or apyrimidinic site) lyase OS=Pan paniscus GN=APEX1
PE=3 SV=1
Length = 318
Score = 99.4 bits (246), Expect = 2e-20, Method: Compositional matrix adjust.
Identities = 63/192 (32%), Positives = 105/192 (54%), Gaps = 25/192 (13%)
Query: 27 KYEPDGRVILAEFETFYLLNTYAPNNGWKEEENSFQRRRKWDKRIQEFVLQ-CSGKPLIW 85
+++ +GRVI+AEF++F L+ Y PN G + R++WD+ ++F+ S KPL+
Sbjct: 150 EHDQEGRVIVAEFDSFVLVTAYVPNAG--RGLVRLEYRQRWDEAFRKFLKGLASRKPLVL 207
Query: 86 CGDLNVSHEEIDVSHPEFFAAAKLNGYVPPNKEDWGQPGFTIAERKRFGAILKEGRLIDA 145
CGDLNV+HEEID+ +P+ NK++ GFT ER+ FG +L+ L D+
Sbjct: 208 CGDLNVAHEEIDLRNPKG------------NKKN---AGFTPQERQGFGELLQAVPLADS 252
Query: 146 YRFLHKEKDMDCGFSWSGNPIGKYRGKRMRIDYFIVSEELKDRIIACEMQGHGIELEGFY 205
+R L+ F W+ + + R+DYF++S L + +++ +
Sbjct: 253 FRHLYPNTPYAYTF-WTYMMNARSKNVGWRLDYFLLSHSLLPALCDSKIRSKAL------ 305
Query: 206 GSDHCPVSLELS 217
GSDHCP++L L+
Sbjct: 306 GSDHCPITLYLA 317
>sp|P27695|APEX1_HUMAN DNA-(apurinic or apyrimidinic site) lyase OS=Homo sapiens GN=APEX1
PE=1 SV=2
Length = 318
Score = 99.4 bits (246), Expect = 2e-20, Method: Compositional matrix adjust.
Identities = 63/192 (32%), Positives = 105/192 (54%), Gaps = 25/192 (13%)
Query: 27 KYEPDGRVILAEFETFYLLNTYAPNNGWKEEENSFQRRRKWDKRIQEFVLQ-CSGKPLIW 85
+++ +GRVI+AEF++F L+ Y PN G + R++WD+ ++F+ S KPL+
Sbjct: 150 EHDQEGRVIVAEFDSFVLVTAYVPNAG--RGLVRLEYRQRWDEAFRKFLKGLASRKPLVL 207
Query: 86 CGDLNVSHEEIDVSHPEFFAAAKLNGYVPPNKEDWGQPGFTIAERKRFGAILKEGRLIDA 145
CGDLNV+HEEID+ +P+ NK++ GFT ER+ FG +L+ L D+
Sbjct: 208 CGDLNVAHEEIDLRNPKG------------NKKN---AGFTPQERQGFGELLQAVPLADS 252
Query: 146 YRFLHKEKDMDCGFSWSGNPIGKYRGKRMRIDYFIVSEELKDRIIACEMQGHGIELEGFY 205
+R L+ F W+ + + R+DYF++S L + +++ +
Sbjct: 253 FRHLYPNTPYAYTF-WTYMMNARSKNVGWRLDYFLLSHSLLPALCDSKIRSKAL------ 305
Query: 206 GSDHCPVSLELS 217
GSDHCP++L L+
Sbjct: 306 GSDHCPITLYLA 317
>sp|A2T7I6|APEX1_PONPY DNA-(apurinic or apyrimidinic site) lyase OS=Pongo pygmaeus
GN=APEX1 PE=3 SV=1
Length = 318
Score = 99.4 bits (246), Expect = 2e-20, Method: Compositional matrix adjust.
Identities = 63/192 (32%), Positives = 104/192 (54%), Gaps = 25/192 (13%)
Query: 27 KYEPDGRVILAEFETFYLLNTYAPNNGWKEEENSFQRRRKWDKRIQEFVLQ-CSGKPLIW 85
+++ +GRVI+AEF++F L+ Y PN G + R++WD+ + F+ S KPL+
Sbjct: 150 EHDQEGRVIVAEFDSFVLVTAYVPNAG--RGLVRLEYRQRWDEAFRRFLKGLASRKPLVL 207
Query: 86 CGDLNVSHEEIDVSHPEFFAAAKLNGYVPPNKEDWGQPGFTIAERKRFGAILKEGRLIDA 145
CGDLNV+HEEID+ +P+ NK++ GFT ER+ FG +L+ L D+
Sbjct: 208 CGDLNVAHEEIDLRNPKG------------NKKN---AGFTPQERQGFGELLQAVPLADS 252
Query: 146 YRFLHKEKDMDCGFSWSGNPIGKYRGKRMRIDYFIVSEELKDRIIACEMQGHGIELEGFY 205
+R L+ F W+ + + R+DYF++S L + +++ +
Sbjct: 253 FRHLYPNTPYAYTF-WTYMMNARSKNVGWRLDYFLLSHSLLTALCDSKIRSKAL------ 305
Query: 206 GSDHCPVSLELS 217
GSDHCP++L L+
Sbjct: 306 GSDHCPITLYLA 317
>sp|A1YES6|APEX1_GORGO DNA-(apurinic or apyrimidinic site) lyase OS=Gorilla gorilla
gorilla GN=APEX1 PE=3 SV=1
Length = 318
Score = 99.4 bits (246), Expect = 2e-20, Method: Compositional matrix adjust.
Identities = 63/192 (32%), Positives = 104/192 (54%), Gaps = 25/192 (13%)
Query: 27 KYEPDGRVILAEFETFYLLNTYAPNNGWKEEENSFQRRRKWDKRIQEFVLQ-CSGKPLIW 85
+++ +GRVI+AEF++F L+ Y PN G + R++WD+ + F+ S KPL+
Sbjct: 150 EHDQEGRVIVAEFDSFVLVTAYVPNAG--RGLVRLEYRQRWDEAFRRFLKGLASRKPLVL 207
Query: 86 CGDLNVSHEEIDVSHPEFFAAAKLNGYVPPNKEDWGQPGFTIAERKRFGAILKEGRLIDA 145
CGDLNV+HEEID+ +P+ NK++ GFT ER+ FG +L+ L D+
Sbjct: 208 CGDLNVAHEEIDLRNPKG------------NKKN---AGFTPQERQGFGELLQAVPLADS 252
Query: 146 YRFLHKEKDMDCGFSWSGNPIGKYRGKRMRIDYFIVSEELKDRIIACEMQGHGIELEGFY 205
+R L+ F W+ + + R+DYF++S L + +++ +
Sbjct: 253 FRHLYPNTPYAYTF-WTYMMNARSKNVGWRLDYFLLSHSLLPALCDSKIRSKAL------ 305
Query: 206 GSDHCPVSLELS 217
GSDHCP++L L+
Sbjct: 306 GSDHCPITLYLA 317
>sp|P23196|APEX1_BOVIN DNA-(apurinic or apyrimidinic site) lyase OS=Bos taurus GN=APEX1
PE=1 SV=2
Length = 318
Score = 98.2 bits (243), Expect = 4e-20, Method: Compositional matrix adjust.
Identities = 61/192 (31%), Positives = 105/192 (54%), Gaps = 25/192 (13%)
Query: 27 KYEPDGRVILAEFETFYLLNTYAPNNGWKEEENSFQRRRKWDKRIQEFVLQ-CSGKPLIW 85
+++ +GRVI+AE++ F L+ Y PN G + R++WD+ ++F+ S KPL+
Sbjct: 150 EHDQEGRVIVAEYDAFVLVTAYVPNAG--RGLVRLEYRQRWDEAFRKFLKGLASRKPLVL 207
Query: 86 CGDLNVSHEEIDVSHPEFFAAAKLNGYVPPNKEDWGQPGFTIAERKRFGAILKEGRLIDA 145
CGDLNV+HEEID+ +P+ NK++ GFT ER+ FG +L+ L D+
Sbjct: 208 CGDLNVAHEEIDLRNPKG------------NKKN---AGFTPQERQGFGELLQAVPLTDS 252
Query: 146 YRFLHKEKDMDCGFSWSGNPIGKYRGKRMRIDYFIVSEELKDRIIACEMQGHGIELEGFY 205
+R L+ F W+ + + R+DYF++S+ + + +++ +
Sbjct: 253 FRHLYPNTAYAYTF-WTYMMNARSKNVGWRLDYFLLSQSVLPALCDSKIRSKAL------ 305
Query: 206 GSDHCPVSLELS 217
GSDHCP++L L+
Sbjct: 306 GSDHCPITLYLA 317
>sp|P37454|EXOA_BACSU Exodeoxyribonuclease OS=Bacillus subtilis (strain 168) GN=exoA PE=1
SV=1
Length = 252
Score = 97.4 bits (241), Expect = 7e-20, Method: Compositional matrix adjust.
Identities = 63/190 (33%), Positives = 100/190 (52%), Gaps = 28/190 (14%)
Query: 27 KYEPDGRVILAEFETFYLLNTYAPNNGWKEEENSFQRRRKWDKRIQEFVLQC-SGKPLIW 85
+++ +GRVI EFE +++ Y PN+ E + R +W++ + ++L+ KP+I
Sbjct: 85 EHDQEGRVITLEFENVFVMTVYTPNSRRGLERIDY--RMQWEEALLSYILELDQKKPVIL 142
Query: 86 CGDLNVSHEEIDVSHPEFFAAAKLNGYVPPNKEDWGQPGFTIAERKRFGAILKEGRLIDA 145
CGDLNV+H+EID+ +P K + GF+ ER+ F L+ G +D+
Sbjct: 143 CGDLNVAHQEIDLKNP---------------KANRNNAGFSDQEREAFTRFLEAG-FVDS 186
Query: 146 YRFLHKEKDMDCGFSWSGNPIG-KYRGKRMRIDYFIVSEELKDRIIACEMQGHGIELEGF 204
+R H D++ +SW G + R RIDYF+VSE LK++I + +
Sbjct: 187 FR--HVYPDLEGAYSWWSYRAGARDRNIGWRIDYFVVSESLKEQIEDASISADVM----- 239
Query: 205 YGSDHCPVSL 214
GSDHCPV L
Sbjct: 240 -GSDHCPVEL 248
>sp|P27864|RRP1_DROME Recombination repair protein 1 OS=Drosophila melanogaster GN=Rrp1
PE=1 SV=2
Length = 679
Score = 94.0 bits (232), Expect = 8e-19, Method: Compositional matrix adjust.
Identities = 59/184 (32%), Positives = 95/184 (51%), Gaps = 26/184 (14%)
Query: 32 GRVILAEFETFYLLNTYAPNNGWKEEENSFQRRRKWDKRIQEFVLQCSG-KPLIWCGDLN 90
GR+I AE+E FYL+N Y PN+G K + + R +W+K Q +V + KP++ CGD+N
Sbjct: 517 GRMITAEYEKFYLINVYVPNSGRK--LVNLEPRMRWEKLFQAYVKKLDALKPVVICGDMN 574
Query: 91 VSHEEIDVSHPEFFAAAKLNGYVPPNKEDWGQPGFTIAERKRFGAILKEGRLIDAYRFLH 150
VSH ID+ +P K + GFT ER + +L G +D +R L+
Sbjct: 575 VSHMPIDLENP---------------KNNTKNAGFTQEERDKMTELLGLG-FVDTFRHLY 618
Query: 151 KEKDMDCGFSWSGNPIGKYRGKRMRIDYFIVSEELKDRIIACEMQGHGIELEGFYGSDHC 210
++ F W+ + R R+DY +VSE +++ E++ + GSDHC
Sbjct: 619 PDRKGAYTF-WTYMANARARNVGWRLDYCLVSERFVPKVVEHEIRSQCL------GSDHC 671
Query: 211 PVSL 214
P+++
Sbjct: 672 PITI 675
>sp|A0MTA1|APEX1_DANRE DNA-(apurinic or apyrimidinic site) lyase OS=Danio rerio GN=apex1
PE=1 SV=1
Length = 310
Score = 84.7 bits (208), Expect = 5e-16, Method: Compositional matrix adjust.
Identities = 57/192 (29%), Positives = 95/192 (49%), Gaps = 26/192 (13%)
Query: 27 KYEPDGRVILAEFETFYLLNTYAPNNGWKEEENSFQRRRKWDKRIQEFVLQCSG-KPLIW 85
+++ +GRVI AEF F+L+ Y PN R+ WD + ++ KPL+
Sbjct: 143 EHDKEGRVITAEFPDFFLVTAYVPNA--SRGLVRLDYRKTWDVDFRAYLCGLDARKPLVL 200
Query: 86 CGDLNVSHEEIDVSHPEFFAAAKLNGYVPPNKEDWGQPGFTIAERKRFGAILKEGRLIDA 145
CGDLNV+H+EID+ +P K + GFT ER+ F +L+ G D+
Sbjct: 201 CGDLNVAHQEIDLKNP---------------KGNRKNAGFTPEEREGFTQLLEAG-FTDS 244
Query: 146 YRFLHKEKDMDCGFSWSGNPIGKYRGKRMRIDYFIVSEELKDRIIACEMQGHGIELEGFY 205
+R L+ ++ F W+ + + R+DYF++S L + +++ +
Sbjct: 245 FRELYPDQAYAYTF-WTYMMNARSKNVGWRLDYFVLSSALLPGLCDSKIRNTAM------ 297
Query: 206 GSDHCPVSLELS 217
GSDHCP++L L+
Sbjct: 298 GSDHCPITLFLA 309
>sp|P0A2X4|EXOA_STRR6 Exodeoxyribonuclease OS=Streptococcus pneumoniae (strain ATCC
BAA-255 / R6) GN=exoA PE=3 SV=1
Length = 275
Score = 83.6 bits (205), Expect = 1e-15, Method: Compositional matrix adjust.
Identities = 54/189 (28%), Positives = 88/189 (46%), Gaps = 29/189 (15%)
Query: 31 DGRVILAEFETFYLLNTYAPNNGWKEEENSFQRRRKWDKRIQEFVLQCSG-KPLIWCGDL 89
+GR+I EF+ F++ Y PN G + + R+ WD + E++ + KP++ GD
Sbjct: 111 EGRIITLEFDAFFVTQVYTPNAG--DGLKRLEERQVWDAKYAEYLAELDKEKPVLATGDY 168
Query: 90 NVSHEEIDVSHPEFFAAAKLNGYVPPNKEDWGQPGFTIAERKRFGAILKEGRLIDAYRFL 149
NV+H EID+++P N+ PGFT ER F +L G D +R +
Sbjct: 169 NVAHNEIDLANP------------ASNRR---SPGFTDEERAGFTNLLATG-FTDTFRHV 212
Query: 150 HKEKDMDCGFSW--SGNPIGKYRGKRMRIDYFIVSEELKDRIIACEMQGHGIELEGFYGS 207
H D+ ++W + K RIDY++ S + D++ +M G
Sbjct: 213 H--GDVPERYTWWAQRSKTSKINNTGWRIDYWLTSNRIADKVTKSDMIDSGAR------Q 264
Query: 208 DHCPVSLEL 216
DH P+ LE+
Sbjct: 265 DHTPIVLEI 273
>sp|P0A2X3|EXOA_STRPN Exodeoxyribonuclease OS=Streptococcus pneumoniae serotype 4 (strain
ATCC BAA-334 / TIGR4) GN=exoA PE=3 SV=1
Length = 275
Score = 83.6 bits (205), Expect = 1e-15, Method: Compositional matrix adjust.
Identities = 54/189 (28%), Positives = 88/189 (46%), Gaps = 29/189 (15%)
Query: 31 DGRVILAEFETFYLLNTYAPNNGWKEEENSFQRRRKWDKRIQEFVLQCSG-KPLIWCGDL 89
+GR+I EF+ F++ Y PN G + + R+ WD + E++ + KP++ GD
Sbjct: 111 EGRIITLEFDAFFVTQVYTPNAG--DGLKRLEERQVWDAKYAEYLAELDKEKPVLATGDY 168
Query: 90 NVSHEEIDVSHPEFFAAAKLNGYVPPNKEDWGQPGFTIAERKRFGAILKEGRLIDAYRFL 149
NV+H EID+++P N+ PGFT ER F +L G D +R +
Sbjct: 169 NVAHNEIDLANP------------ASNRR---SPGFTDEERAGFTNLLATG-FTDTFRHV 212
Query: 150 HKEKDMDCGFSW--SGNPIGKYRGKRMRIDYFIVSEELKDRIIACEMQGHGIELEGFYGS 207
H D+ ++W + K RIDY++ S + D++ +M G
Sbjct: 213 H--GDVPERYTWWAQRSKTSKINNTGWRIDYWLTSNRIADKVTKSDMIDSGAR------Q 264
Query: 208 DHCPVSLEL 216
DH P+ LE+
Sbjct: 265 DHTPIVLEI 273
>sp|P38207|APN2_YEAST DNA-(apurinic or apyrimidinic site) lyase 2 OS=Saccharomyces
cerevisiae (strain ATCC 204508 / S288c) GN=APN2 PE=1
SV=1
Length = 520
Score = 48.9 bits (115), Expect = 3e-05, Method: Compositional matrix adjust.
Identities = 55/221 (24%), Positives = 85/221 (38%), Gaps = 43/221 (19%)
Query: 24 SALKYEPDGRVILAEFET-FYLLNTYAPNNGWKEEENSFQRRRKWD---KRIQEFVLQCS 79
SAL+ + +GR ++ E +++ Y P N EE R R +R++ L
Sbjct: 156 SALELDSEGRCVMVELACGIVIISVYCPANSNSSEEGEMFRLRFLKVLLRRVRN--LDKI 213
Query: 80 GKPLIWCGDLNVSHEEIDVSHPEFFAAAKLNGYVPPNKEDWGQPGF-------------- 125
GK ++ GD+NV + ID +A L + P + G
Sbjct: 214 GKKIVLMGDVNVCRDLID-------SADTLEQFSIPITDPMGGTKLEAQYRDKAIQFIIN 266
Query: 126 -TIAERKRFGAIL---------KEGRLIDAYRFLHKEKDMDCGFSWSGNPIGKYRGKRMR 175
R+ F IL K G LID R + + W+ + R
Sbjct: 267 PDTPHRRIFNQILADSLLPDASKRGILIDTTRLIQTRNRLKMYTVWNMLKNLRPSNYGSR 326
Query: 176 IDYFIVSEELKDRIIACEMQGHGIELEGFYGSDHCPVSLEL 216
ID+ +VS +L+ I A ++ L GSDHCPV +L
Sbjct: 327 IDFILVSLKLERCIKAADI------LPDILGSDHCPVYSDL 361
>sp|P44318|EX3_HAEIN Exodeoxyribonuclease III OS=Haemophilus influenzae (strain ATCC
51907 / DSM 11121 / KW20 / Rd) GN=xthA PE=3 SV=1
Length = 267
Score = 48.9 bits (115), Expect = 3e-05, Method: Compositional matrix adjust.
Identities = 49/195 (25%), Positives = 88/195 (45%), Gaps = 29/195 (14%)
Query: 33 RVILAEFET----FYLLNTYAPNNGWKEEENSFQRRRKWDKRIQEFVLQCSGK--PLIWC 86
R+I+A+ ET ++N Y P + E F + K+ +Q+++ + K P++
Sbjct: 90 RIIMADLETEFGLLTVINGYFPQGESRAHETKFPAKEKFYADLQQYLEKEHDKSNPILIM 149
Query: 87 GDLNVSHEEIDVSHPEFFAAAKLNGYVPPNKEDW---GQPGFTIAERKRFGAILKEGRLI 143
GD+N+S ++D+ G N++ W G+ F ER + + G L
Sbjct: 150 GDMNISPSDLDI------------GIGDENRKRWLRTGKCSFLPEERAWYQRLYDYG-LE 196
Query: 144 DAYRFLH-KEKDMDCGFSWSGNPIGKYRGKRMRIDYFIVSEELKDRIIACEMQGHGIELE 202
D++R L+ D F + RG +RID+ +VS++L +R C G +++
Sbjct: 197 DSFRKLNPTANDKFSWFDYRSKGFDDNRG--LRIDHILVSQKLAER---CVDVGIALDIR 251
Query: 203 GF-YGSDHCPVSLEL 216
SDH P+ E
Sbjct: 252 AMEKPSDHAPIWAEF 266
>sp|Q68G58|APEX2_MOUSE DNA-(apurinic or apyrimidinic site) lyase 2 OS=Mus musculus
GN=Apex2 PE=1 SV=1
Length = 516
Score = 48.5 bits (114), Expect = 4e-05, Method: Compositional matrix adjust.
Identities = 52/197 (26%), Positives = 82/197 (41%), Gaps = 37/197 (18%)
Query: 33 RVILAEFETFYLLNTYAPN-NGWKEEENSFQRRRKWDKRIQEFVLQCSGKPLIWCGDLNV 91
R + + +T L+N Y P+ + K E +F+ R +++ L +G +I GDLN
Sbjct: 140 RTLEGKEKTLTLINVYCPHADPGKPERLTFKMRFYRLLQMRAEALLAAGSHVIILGDLNT 199
Query: 92 SHEEIDVSHPEFFAAAKLNGYVPPNKEDWGQPGFTIAERKRFGAILKE---------GRL 142
+H ID A+ L + +ED G RK +L G
Sbjct: 200 AHRPID-----HCDASSLECF----EEDPG--------RKWMDGLLSNPGDEAGPHIGLF 242
Query: 143 IDAYRFLH--KEKDMDCGFSWSGNPIGKYRGKRMRIDYFIVSEELKDRIIACEMQGHGIE 200
+D+YR+LH +++ C WS ++ R+DY L DR + +
Sbjct: 243 MDSYRYLHPKQQRAFTC---WSVVSGARHLNYGSRLDYV-----LGDRALVIDTFQASFL 294
Query: 201 LEGFYGSDHCPVSLELS 217
L GSDHCPV L+
Sbjct: 295 LPEVMGSDHCPVGAVLN 311
>sp|Q5E9N9|APEX2_BOVIN DNA-(apurinic or apyrimidinic site) lyase 2 OS=Bos taurus GN=APEX2
PE=2 SV=1
Length = 514
Score = 47.4 bits (111), Expect = 8e-05, Method: Compositional matrix adjust.
Identities = 62/224 (27%), Positives = 92/224 (41%), Gaps = 48/224 (21%)
Query: 23 LSALKYEPDGRVILAEFE---------TFYLLNTYAPN-NGWKEEENSFQRRRKWDKRIQ 72
L AL E GR +L + + T L+N Y P+ + K E +F+ R +I+
Sbjct: 124 LRALDSE--GRALLTQHKICTWEGKEKTLTLINVYCPHADPGKPERLTFKMRFYRLLQIR 181
Query: 73 EFVLQCSGKPLIWCGDLNVSHEEIDVSHPEFFAAAKLNGYVPPNKEDWGQPGFTIAERKR 132
L +G +I GDLN +H ID + A + + +ED G RK
Sbjct: 182 AEALLAAGSHVIILGDLNTAHRPID-----HWDAVNMECF----EEDPG--------RKW 224
Query: 133 FGAILKE---------GRLIDAYR-FLHKEKDMDCGFS-WSGNPIGKYRGKRMRIDYFIV 181
+L G ID+YR F K+K F+ WS ++ R+DY
Sbjct: 225 MDGLLSNLGCESGSHMGPFIDSYRCFQPKQKG---AFTCWSTVSGARHLNYGSRLDYV-- 279
Query: 182 SEELKDRIIACEMQGHGIELEGFYGSDHCPVSLELSEASSDSEK 225
L DR + + L GSDHCPV LS +S +++
Sbjct: 280 ---LGDRTLVIDTFQSSFLLPEVMGSDHCPVGAVLSVSSVPAKQ 320
>sp|Q9UBZ4|APEX2_HUMAN DNA-(apurinic or apyrimidinic site) lyase 2 OS=Homo sapiens
GN=APEX2 PE=1 SV=1
Length = 518
Score = 46.6 bits (109), Expect = 1e-04, Method: Compositional matrix adjust.
Identities = 56/216 (25%), Positives = 86/216 (39%), Gaps = 46/216 (21%)
Query: 31 DGRVILAEFE---------TFYLLNTYAPN-NGWKEEENSFQRRRKWDKRIQEFVLQCSG 80
+GR +L + + T L+N Y P+ + + E F+ R +I+ L +G
Sbjct: 130 EGRALLTQHKIRTWEGKEKTLTLINVYCPHADPGRPERLVFKMRFYRLLQIRAEALLAAG 189
Query: 81 KPLIWCGDLNVSHEEIDVSHPEFFAAAKLNGYVPPNKEDWGQPGFTIAERKRFGAILKE- 139
+I GDLN +H ID + A L + +ED G RK ++L
Sbjct: 190 SHVIILGDLNTAHRPID-----HWDAVNLECF----EEDPG--------RKWMDSLLSNL 232
Query: 140 --------GRLIDAYRFLH--KEKDMDCGFSWSGNPIGKYRGKRMRIDYFIVSEELKDRI 189
G ID+YR +E C WS ++ R+DY L DR
Sbjct: 233 GCQSASHVGPFIDSYRCFQPKQEGAFTC---WSAVTGARHLNYGSRLDYV-----LGDRT 284
Query: 190 IACEMQGHGIELEGFYGSDHCPVSLELSEASSDSEK 225
+ + L GSDHCPV LS +S +++
Sbjct: 285 LVIDTFQASFLLPEVMGSDHCPVGAVLSVSSVPAKQ 320
>sp|P0A1A9|EX3_SALTY Exodeoxyribonuclease III OS=Salmonella typhimurium (strain LT2 /
SGSC1412 / ATCC 700720) GN=xthA PE=3 SV=1
Length = 268
Score = 44.3 bits (103), Expect = 7e-04, Method: Compositional matrix adjust.
Identities = 50/196 (25%), Positives = 87/196 (44%), Gaps = 31/196 (15%)
Query: 29 EPDGRVILAEFET----FYLLNTYAPNNGWKEEENSFQRRRKWDKRIQEFV---LQCSGK 81
E R+I+AE + ++N Y P ++ F + ++ + +Q ++ L+C
Sbjct: 86 EAQRRIIMAEIPSPLGNITVINGYFPQGESRDHPLKFPAKAQFYQNLQNYLETELKCD-N 144
Query: 82 PLIWCGDLNVSHEEIDVSHPEFFAAAKLNGYVPPNKEDW---GQPGFTIAERKRFGAILK 138
P++ GD+N+S ++D+ G N++ W G+ F ER+ +LK
Sbjct: 145 PVLIMGDMNISPTDLDI------------GIGEENRKRWLRTGKCSFLPEEREWMSRLLK 192
Query: 139 EGRLIDAYRFLHKEKDMDCGFSW-SGNPIGKYRGKRMRIDYFIVSEELKDRIIACEMQGH 197
G L+D +R + + MD FSW G + +RID + S L +R C G
Sbjct: 193 WG-LVDTFRQANPQT-MD-KFSWFDYRSKGFVDNRGLRIDLLLASAPLAER---CAETGI 246
Query: 198 GIELEGF-YGSDHCPV 212
++ SDH PV
Sbjct: 247 DYDIRSMEKPSDHAPV 262
>sp|P0A1B0|EX3_SALTI Exodeoxyribonuclease III OS=Salmonella typhi GN=xthA PE=3 SV=1
Length = 268
Score = 44.3 bits (103), Expect = 7e-04, Method: Compositional matrix adjust.
Identities = 50/196 (25%), Positives = 87/196 (44%), Gaps = 31/196 (15%)
Query: 29 EPDGRVILAEFET----FYLLNTYAPNNGWKEEENSFQRRRKWDKRIQEFV---LQCSGK 81
E R+I+AE + ++N Y P ++ F + ++ + +Q ++ L+C
Sbjct: 86 EAQRRIIMAEIPSPLGNITVINGYFPQGESRDHPLKFPAKAQFYQNLQNYLETELKCD-N 144
Query: 82 PLIWCGDLNVSHEEIDVSHPEFFAAAKLNGYVPPNKEDW---GQPGFTIAERKRFGAILK 138
P++ GD+N+S ++D+ G N++ W G+ F ER+ +LK
Sbjct: 145 PVLIMGDMNISPTDLDI------------GIGEENRKRWLRTGKCSFLPEEREWMSRLLK 192
Query: 139 EGRLIDAYRFLHKEKDMDCGFSW-SGNPIGKYRGKRMRIDYFIVSEELKDRIIACEMQGH 197
G L+D +R + + MD FSW G + +RID + S L +R C G
Sbjct: 193 WG-LVDTFRQANPQT-MD-KFSWFDYRSKGFVDNRGLRIDLLLASAPLAER---CAETGI 246
Query: 198 GIELEGF-YGSDHCPV 212
++ SDH PV
Sbjct: 247 DYDIRSMEKPSDHAPV 262
>sp|P87175|APN2_SCHPO DNA-(apurinic or apyrimidinic site) lyase 2 OS=Schizosaccharomyces
pombe (strain 972 / ATCC 24843) GN=apn2 PE=1 SV=1
Length = 523
Score = 40.8 bits (94), Expect = 0.008, Method: Compositional matrix adjust.
Identities = 48/197 (24%), Positives = 83/197 (42%), Gaps = 35/197 (17%)
Query: 31 DGRVILAEFETFYLLNTYAPNNGWKEEENSFQRRRKWDKRIQEFV---LQCSGKPLIWCG 87
+GR IL +F+ F L+ Y P N EN + RR + K ++E + ++ + +I G
Sbjct: 134 EGRCILLDFQMFILIGVYCPVNS---GENRLEYRRAFYKALRERIERLIKEGNRKIILVG 190
Query: 88 DLNVSHEEIDVSHPEFFAAAKLNGYVPPNKEDWGQPGFTIAERKRF----GAILKEGRLI 143
D+N+ ID + + L +P +I E +++ + G L+
Sbjct: 191 DVNILCNPIDTADQKDIIRESL---IP-----------SIMESRQWIRDLLLPSRLGLLL 236
Query: 144 DAYRFLHKEKD--MDCGFSWSGNPIGKYRGKRMRIDYFIVSEELKDRIIACEMQGHGIEL 201
D R H + C W+ + RIDY + + +L + ++ +
Sbjct: 237 DIGRIQHPTRKGMFTC---WNTRLNTRPTNYGTRIDYTLATPDLLPWVQDADIMAEVM-- 291
Query: 202 EGFYGSDHCPVSLELSE 218
GSDHCPV L+L E
Sbjct: 292 ----GSDHCPVYLDLKE 304
>sp|P09030|EX3_ECOLI Exodeoxyribonuclease III OS=Escherichia coli (strain K12) GN=xthA
PE=1 SV=4
Length = 268
Score = 38.1 bits (87), Expect = 0.050, Method: Compositional matrix adjust.
Identities = 47/196 (23%), Positives = 82/196 (41%), Gaps = 31/196 (15%)
Query: 29 EPDGRVILAEFETFY----LLNTYAPNNGWKEEENSFQRRRKWDKRIQEFVLQCSGK--- 81
E R+I+AE + ++N Y P ++ F + ++ + +Q + L+ K
Sbjct: 86 EAQRRIIMAEIPSLLGNVTVINGYFPQGESRDHPIKFPAKAQFYQNLQNY-LETELKRDN 144
Query: 82 PLIWCGDLNVSHEEIDVSHPEFFAAAKLNGYVPPNKEDW---GQPGFTIAERKRFGAILK 138
P++ GD+N+S ++D+ G N++ W G+ F ER+ ++
Sbjct: 145 PVLIMGDMNISPTDLDI------------GIGEENRKRWLRTGKCSFLPEEREWMDRLMS 192
Query: 139 EGRLIDAYRFLHKEKDMDCGFSWSGNPIGKYRGKR-MRIDYFIVSEELKDRIIACEMQGH 197
G L+D +R H FSW + R +RID + S+ L + C G
Sbjct: 193 WG-LVDTFR--HANPQTADRFSWFDYRSKGFDDNRGLRIDLLLASQPLAE---CCVETGI 246
Query: 198 GIELEGF-YGSDHCPV 212
E+ SDH PV
Sbjct: 247 DYEIRSMEKPSDHAPV 262
>sp|Q8BTF7|DNLI4_MOUSE DNA ligase 4 OS=Mus musculus GN=Lig4 PE=2 SV=2
Length = 911
Score = 35.0 bits (79), Expect = 0.45, Method: Compositional matrix adjust.
Identities = 16/54 (29%), Positives = 30/54 (55%), Gaps = 1/54 (1%)
Query: 154 DMDCGFSWSGNPIGKYRGKRMRIDYFIVSEELKDRIIACEMQGHGIELEGFYGS 207
D++C +SW +P+ +R + +D + V +L RI A + +EL F+G+
Sbjct: 798 DLECRYSWDHSPLSMFRHYTIYLDLYAVINDLSSRIEATRLGITALELR-FHGA 850
>sp|Q5SSE9|ABCAD_MOUSE ATP-binding cassette sub-family A member 13 OS=Mus musculus GN=Abca13
PE=2 SV=1
Length = 5034
Score = 32.0 bits (71), Expect = 4.0, Method: Composition-based stats.
Identities = 13/36 (36%), Positives = 21/36 (58%)
Query: 112 YVPPNKEDWGQPGFTIAERKRFGAILKEGRLIDAYR 147
++P N DW Q G T+ GA+L E R++++ R
Sbjct: 4393 HLPANAVDWRQYGITLYSHPYGGALLNEDRILESIR 4428
>sp|Q9UTK6|YKR3_SCHPO G-patch domain-containing protein C1486.03 OS=Schizosaccharomyces
pombe (strain 972 / ATCC 24843) GN=SPAC1486.03c PE=3
SV=1
Length = 797
Score = 31.6 bits (70), Expect = 4.3, Method: Compositional matrix adjust.
Identities = 23/59 (38%), Positives = 32/59 (54%), Gaps = 10/59 (16%)
Query: 136 ILKEGRLIDAYRFLHKEKDM--------DCGFSWSGNPIGKYRGKRMRIDYFIVSEELK 186
IL+EGRL ++R KE+DM + GF SG IG R +R I + SE++K
Sbjct: 20 ILEEGRLRPSFRGQQKERDMLGIFGEEDEDGFHNSG--IGSARLRRKNISFVEKSEQVK 76
Database: swissprot
Posted date: Mar 23, 2013 2:32 AM
Number of letters in database: 191,569,459
Number of sequences in database: 539,616
Lambda K H
0.321 0.140 0.443
Lambda K H
0.267 0.0410 0.140
Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Number of Hits to DB: 93,879,721
Number of Sequences: 539616
Number of extensions: 4146617
Number of successful extensions: 7368
Number of sequences better than 100.0: 29
Number of HSP's better than 100.0 without gapping: 15
Number of HSP's successfully gapped in prelim test: 14
Number of HSP's that attempted gapping in prelim test: 7307
Number of HSP's gapped (non-prelim): 30
length of query: 233
length of database: 191,569,459
effective HSP length: 114
effective length of query: 119
effective length of database: 130,053,235
effective search space: 15476334965
effective search space used: 15476334965
T: 11
A: 40
X1: 16 ( 7.4 bits)
X2: 38 (14.6 bits)
X3: 64 (24.7 bits)
S1: 41 (21.8 bits)
S2: 59 (27.3 bits)