BLASTP 2.2.26 [Sep-21-2011]
Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer,
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997),
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs", Nucleic Acids Res. 25:3389-3402.
Reference for compositional score matrix adjustment: Altschul, Stephen F.,
John C. Wootton, E. Michael Gertz, Richa Agarwala, Aleksandr Morgulis,
Alejandro A. Schaffer, and Yi-Kuo Yu (2005) "Protein database searches
using compositionally adjusted substitution matrices", FEBS J. 272:5101-5109.
Query= 006881
(627 letters)
Database: swissprot
539,616 sequences; 191,569,459 total letters
Searching..................................................done
>sp|Q9UBZ4|APEX2_HUMAN DNA-(apurinic or apyrimidinic site) lyase 2 OS=Homo sapiens
GN=APEX2 PE=1 SV=1
Length = 518
Score = 191 bits (485), Expect = 2e-47, Method: Compositional matrix adjust.
Identities = 125/390 (32%), Positives = 189/390 (48%), Gaps = 77/390 (19%)
Query: 1 MKIVTYNVNGLRQRVSQFG----------SLRKLLDSFDADIICFQETKLRRQELKSDLV 50
+++V++N+NG+R+ + ++ ++LD DADI+C QETK+ R L L
Sbjct: 2 LRVVSWNINGIRRPLQGVANQEPSNCAAVAVGRILDELDADIVCLQETKVTRDALTEPLA 61
Query: 51 MADGYESFFSCTRTSDKGRTGYSGVATFCRVKSPFSSTEVALPVAAEEGFTGLLETSGSK 110
+ +GY S+FS +R R+GYSGVATFC+ + A PVAAEEG +GL T
Sbjct: 62 IVEGYNSYFSFSRN----RSGYSGVATFCK--------DNATPVAAEEGLSGLFATQNGD 109
Query: 111 I--MEGLEDFSKDELLKIDSEGRCVITDH---------GHFILFNVYGPRADSEDTVRIQ 159
+ +++F+++EL +DSEGR ++T H L NVY P AD R+
Sbjct: 110 VGCYGNMDEFTQEELRALDSEGRALLTQHKIRTWEGKEKTLTLINVYCPHADPGRPERLV 169
Query: 160 FKLQFF---HKRWEFLLCQGRRIFVVGDLNIAPAAIDRCDAG--PDFAKNEFRIWFRSML 214
FK++F+ R E LL G + ++GDLN A ID DA F ++ R W S+L
Sbjct: 170 FKMRFYRLLQIRAEALLAAGSHVIILGDLNTAHRPIDHWDAVNLECFEEDPGRKWMDSLL 229
Query: 215 VESG-------GSFFDVFRSKHPERREAYTCWPSNTGAEQFNYGTRIDHILCAGPCLHQK 267
G G F D +R P++ A+TCW + TGA NYG+R+D++L
Sbjct: 230 SNLGCQSASHVGPFIDSYRCFQPKQEGAFTCWSAVTGARHLNYGSRLDYVL--------- 280
Query: 268 HDLQSHNFVTCHVNECDILIDYKRWKPGNAPRWKGGMSTRLEGSDHAPVYMCLGEVPEIP 327
+ ++ID + + + GSDH PV L V +P
Sbjct: 281 -------------GDRTLVIDTFQ---------ASFLLPEVMGSDHCPVGAVL-SVSSVP 317
Query: 328 QHSTPSLASRYLPIIRGVQQTLVSVLMKRE 357
P L +R+LP G Q ++ L+ E
Sbjct: 318 AKQCPPLCTRFLPEFAGTQLKILRFLVPLE 347
Score = 79.3 bits (194), Expect = 8e-14, Method: Compositional matrix adjust.
Identities = 29/49 (59%), Positives = 37/49 (75%)
Query: 572 PLCKGHKEPCVARVVKKPGPTFGRRFFVCARAEGPASNPEANCGYFKWA 620
PLC GH+EPCV R VKKPGP GRRF++CAR GP ++P + C +F W+
Sbjct: 467 PLCGGHREPCVMRTVKKPGPNLGRRFYMCARPRGPPTDPSSRCNFFLWS 515
>sp|Q5E9N9|APEX2_BOVIN DNA-(apurinic or apyrimidinic site) lyase 2 OS=Bos taurus GN=APEX2
PE=2 SV=1
Length = 514
Score = 187 bits (476), Expect = 2e-46, Method: Compositional matrix adjust.
Identities = 125/387 (32%), Positives = 189/387 (48%), Gaps = 77/387 (19%)
Query: 1 MKIVTYNVNGLRQRVSQF----------GSLRKLLDSFDADIICFQETKLRRQELKSDLV 50
+++V++N+NG+R + ++ ++LD DADI+C QETK+ R L L
Sbjct: 2 LRLVSWNINGIRSPLQGVRCEEPSSCSAMAMGRILDKLDADIVCLQETKVTRDVLTEPLA 61
Query: 51 MADGYESFFSCTRTSDKGRTGYSGVATFCRVKSPFSSTEVALPVAAEEGFTGLLETSGSK 110
+ +GY S+FS +R R+GYSGVATFC+ + A PVAAEEG +GLL T
Sbjct: 62 IIEGYNSYFSFSRN----RSGYSGVATFCK--------DSATPVAAEEGLSGLLSTQNGD 109
Query: 111 I--MEGLEDFSKDELLKIDSEGRCVITDH---------GHFILFNVYGPRADSEDTVRIQ 159
+ ++DF+++EL +DSEGR ++T H L NVY P AD R+
Sbjct: 110 VGCYGNMDDFTQEELRALDSEGRALLTQHKICTWEGKEKTLTLINVYCPHADPGKPERLT 169
Query: 160 FKLQFF---HKRWEFLLCQGRRIFVVGDLNIAPAAIDRCDA--GPDFAKNEFRIWFRSML 214
FK++F+ R E LL G + ++GDLN A ID DA F ++ R W +L
Sbjct: 170 FKMRFYRLLQIRAEALLAAGSHVIILGDLNTAHRPIDHWDAVNMECFEEDPGRKWMDGLL 229
Query: 215 ----VESG---GSFFDVFRSKHPERREAYTCWPSNTGAEQFNYGTRIDHILCAGPCLHQK 267
ESG G F D +R P+++ A+TCW + +GA NYG+R+D++L
Sbjct: 230 SNLGCESGSHMGPFIDSYRCFQPKQKGAFTCWSTVSGARHLNYGSRLDYVL--------- 280
Query: 268 HDLQSHNFVTCHVNECDILIDYKRWKPGNAPRWKGGMSTRLEGSDHAPVYMCLGEVPEIP 327
+ ++ID + + + GSDH PV L V +P
Sbjct: 281 -------------GDRTLVIDTFQ---------SSFLLPEVMGSDHCPVGAVL-SVSSVP 317
Query: 328 QHSTPSLASRYLPIIRGVQQTLVSVLM 354
P L + +LP G Q ++ L+
Sbjct: 318 AKQCPPLCTCFLPEFAGTQLKILRFLV 344
Score = 79.0 bits (193), Expect = 1e-13, Method: Compositional matrix adjust.
Identities = 28/52 (53%), Positives = 39/52 (75%)
Query: 569 TSIPLCKGHKEPCVARVVKKPGPTFGRRFFVCARAEGPASNPEANCGYFKWA 620
+ +PLC GH+EPCV R VKKPGP GR F++CAR +GP ++P + C +F W+
Sbjct: 460 SPMPLCGGHREPCVMRTVKKPGPNLGRHFYMCARPQGPPTDPSSRCNFFLWS 511
>sp|Q68G58|APEX2_MOUSE DNA-(apurinic or apyrimidinic site) lyase 2 OS=Mus musculus
GN=Apex2 PE=1 SV=1
Length = 516
Score = 185 bits (470), Expect = 9e-46, Method: Compositional matrix adjust.
Identities = 124/389 (31%), Positives = 188/389 (48%), Gaps = 76/389 (19%)
Query: 1 MKIVTYNVNGLRQRVSQFG---------SLRKLLDSFDADIICFQETKLRRQELKSDLVM 51
+++V++N+NG+R + +LR++LD DADI+C QETK+ R L L +
Sbjct: 2 LRVVSWNINGIRSPLQGLACQEPSSCPTALRRVLDELDADIVCLQETKVTRDVLTEPLAI 61
Query: 52 ADGYESFFSCTRTSDKGRTGYSGVATFCRVKSPFSSTEVALPVAAEEGFTGLLETSGSKI 111
+GY S + + R+GYSGVATFC+ + A PVAAEEG +G+ T I
Sbjct: 62 VEGYNS----YFSFSRSRSGYSGVATFCK--------DSATPVAAEEGLSGVFATLNGDI 109
Query: 112 --MEGLEDFSKDELLKIDSEGRCVITDH---------GHFILFNVYGPRADSEDTVRIQF 160
+++F+++EL +DSEGR ++T H L NVY P AD R+ F
Sbjct: 110 GCYGNMDEFTQEELRVLDSEGRALLTQHKIRTLEGKEKTLTLINVYCPHADPGKPERLTF 169
Query: 161 KLQFF---HKRWEFLLCQGRRIFVVGDLNIAPAAIDRCDAGPD--FAKNEFRIWFRSMLV 215
K++F+ R E LL G + ++GDLN A ID CDA F ++ R W +L
Sbjct: 170 KMRFYRLLQMRAEALLAAGSHVIILGDLNTAHRPIDHCDASSLECFEEDPGRKWMDGLLS 229
Query: 216 ESG-------GSFFDVFRSKHPERREAYTCWPSNTGAEQFNYGTRIDHILCAGPCLHQKH 268
G G F D +R HP+++ A+TCW +GA NYG+R+D++L
Sbjct: 230 NPGDEAGPHIGLFMDSYRYLHPKQQRAFTCWSVVSGARHLNYGSRLDYVL---------- 279
Query: 269 DLQSHNFVTCHVNECDILIDYKRWKPGNAPRWKGGMSTRLEGSDHAPVYMCLGEVPEIPQ 328
+ ++ID + + + GSDH PV L V +P
Sbjct: 280 ------------GDRALVIDTFQ---------ASFLLPEVMGSDHCPVGAVL-NVSCVPA 317
Query: 329 HSTPSLASRYLPIIRGVQQTLVSVLMKRE 357
P+L +R+LP G Q ++ L+ E
Sbjct: 318 KQCPALCTRFLPEFAGTQLKILRFLVPLE 346
Score = 78.2 bits (191), Expect = 2e-13, Method: Compositional matrix adjust.
Identities = 51/174 (29%), Positives = 73/174 (41%), Gaps = 33/174 (18%)
Query: 447 FHVDRARKKAKKSQLGQLSLKSFFHKRSNVSHDDNNSITDTSLNVNNSVTDTSLSQEEVP 506
H R RK + Q +L S+F S++S + L V T + +
Sbjct: 373 MHSTRLRKSQGGPKRKQKNLMSYFQPSSSLSQTSGVELPTLPL-VGPLTTPKTAEEVATA 431
Query: 507 ESHHHSNKIPVTDYSCSVHELHGVNSSVCSHDQDEKKGKRFLDKERNNVALLEWRRIQQL 566
NK+P +DEK ER W+ +
Sbjct: 432 TVLEEKNKVP--------------------ESKDEKG-------ERTAF----WKSMLS- 459
Query: 567 METSIPLCKGHKEPCVARVVKKPGPTFGRRFFVCARAEGPASNPEANCGYFKWA 620
+ +PLC GH+EPCV R VKK GP FGR+F++CAR GP S+P + C +F W+
Sbjct: 460 GPSPMPLCGGHREPCVMRTVKKTGPNFGRQFYMCARPRGPPSDPSSRCNFFLWS 513
>sp|P87175|APN2_SCHPO DNA-(apurinic or apyrimidinic site) lyase 2 OS=Schizosaccharomyces
pombe (strain 972 / ATCC 24843) GN=apn2 PE=1 SV=1
Length = 523
Score = 145 bits (367), Expect = 7e-34, Method: Compositional matrix adjust.
Identities = 110/349 (31%), Positives = 164/349 (46%), Gaps = 72/349 (20%)
Query: 1 MKIVTYNVNGLRQRVSQF-----GSLRKLLDSFDADIICFQETKLRRQELKSDLVMADGY 55
M+I+++NVNG++ + F S +++ AD+IC QE K+++ + +G+
Sbjct: 1 MRILSWNVNGIQNPFNYFPWNKKNSYKEIFQELQADVICVQELKMQKDSFPQQYAVVEGF 60
Query: 56 ESFFSCTRTSDKGRTGYSGVATFCRVKSPFSSTEVALPVAAEEGFTGLLETSGSKIM--- 112
+S+F T K R GYSGV + + +VA+PV AEEG TG+L G K
Sbjct: 61 DSYF----TFPKIRKGYSGVGFYVK-------KDVAIPVKAEEGITGILPVRGQKYSYSE 109
Query: 113 ----EGLEDFSKDELLK----IDSEGRCVITDHGHFILFNVYGPRADSEDTVRIQFKLQF 164
E + F KD K IDSEGRC++ D FIL VY P E+ R++++ F
Sbjct: 110 APEHEKIGFFPKDIDRKTANWIDSEGRCILLDFQMFILIGVYCPVNSGEN--RLEYRRAF 167
Query: 165 F---HKRWEFLLCQG-RRIFVVGDLNIAPAAIDRCDAGPDFAKN------EFRIWFRSML 214
+ +R E L+ +G R+I +VGD+NI ID D ++ E R W R +L
Sbjct: 168 YKALRERIERLIKEGNRKIILVGDVNILCNPIDTADQKDIIRESLIPSIMESRQWIRDLL 227
Query: 215 VESGGSFF-DVFRSKHPERREAYTCWPSNTGAEQFNYGTRIDHILCAGPCLHQKHDLQSH 273
+ S D+ R +HP R+ +TCW + NYGTRID+ L L
Sbjct: 228 LPSRLGLLLDIGRIQHPTRKGMFTCWNTRLNTRPTNYGTRIDYTLATPDLLP-------- 279
Query: 274 NFVTCHVNECDILIDYKRWKPGNAPRWKGGMSTRLEGSDHAPVYMCLGE 322
V + DI+ + + GSDH PVY+ L E
Sbjct: 280 -----WVQDADIMAE-------------------VMGSDHCPVYLDLKE 304
Score = 55.8 bits (133), Expect = 9e-07, Method: Compositional matrix adjust.
Identities = 23/46 (50%), Positives = 32/46 (69%), Gaps = 2/46 (4%)
Query: 560 WRRIQQLMETSIPLCKGHKEPCVARVVKKPGPTFGRRFFVCARAEG 605
W++I E + PLC+GHKEPC V+KPG +GR+F++CAR G
Sbjct: 446 WKQI--FSERAPPLCEGHKEPCKYLTVRKPGINYGRKFWICARPVG 489
>sp|P38207|APN2_YEAST DNA-(apurinic or apyrimidinic site) lyase 2 OS=Saccharomyces
cerevisiae (strain ATCC 204508 / S288c) GN=APN2 PE=1
SV=1
Length = 520
Score = 100 bits (248), Expect = 5e-20, Method: Compositional matrix adjust.
Identities = 116/434 (26%), Positives = 166/434 (38%), Gaps = 106/434 (24%)
Query: 1 MKIVTYNVNGLR-----QRVSQFG-SLRKLLDSFDADIICFQETKLRRQELKSDLVMADG 54
++ +T+NVNG+R Q SQ SLR + D F ADII FQE K + + S DG
Sbjct: 17 IRFLTFNVNGIRTFFHYQPFSQMNQSLRSVFDFFRADIITFQELKTEKLSI-SKWGRVDG 75
Query: 55 YESFFSCTRTSDKGRTGYSGVATFCRVKSPFSSTEVALPVA-AEEGFTGLLETSGSK--- 110
+ SF S +T R GYSGV + R+ AL V AEEG TG L K
Sbjct: 76 FYSFISIPQT----RKGYSGVGCWIRIPEKNHPLYHALQVVKAEEGITGYLTIKNGKHSA 131
Query: 111 ------IMEGL-------EDFSKDELLKIDSEGRCVITDHG-HFILFNVYGPRADSEDTV 156
+ +G+ D + L++DSEGRCV+ + ++ +VY P +
Sbjct: 132 ISYRNDVNQGIGGYDSLDPDLDEKSALELDSEGRCVMVELACGIVIISVYCPANSNSSEE 191
Query: 157 RIQFKLQFFH---KRWEFLLCQGRRIFVVGDLNIAPAAIDRCDAGPDFA----------- 202
F+L+F +R L G++I ++GD+N+ ID D F+
Sbjct: 192 GEMFRLRFLKVLLRRVRNLDKIGKKIVLMGDVNVCRDLIDSADTLEQFSIPITDPMGGTK 251
Query: 203 ----------------KNEFRIWFRSMLVES-------GGSFFDVFRSKHPERR-EAYTC 238
R F +L +S G D R R + YT
Sbjct: 252 LEAQYRDKAIQFIINPDTPHRRIFNQILADSLLPDASKRGILIDTTRLIQTRNRLKMYTV 311
Query: 239 WPSNTGAEQFNYGTRIDHILCAGPCLHQKHDLQSHNFVTCHVNECDILIDYKRWKPGNAP 298
W NYG+RID IL S C + DIL D
Sbjct: 312 WNMLKNLRPSNYGSRIDFILV------------SLKLERC-IKAADILPD---------- 348
Query: 299 RWKGGMSTRLEGSDHAPVYMCLGEV-----PEIPQHSTPSLASRYLPIIRGVQQTLVSVL 353
+ GSDH PVY L + P Q P +RY +R ++ +
Sbjct: 349 ---------ILGSDHCPVYSDLDILDDRIEPGTTQVPIPKFEARYKYNLR--NHNVLEMF 397
Query: 354 MKREVAKQGKSCKF 367
K++ K+ K+
Sbjct: 398 AKKDTNKESNKQKY 411
Score = 44.3 bits (103), Expect = 0.003, Method: Compositional matrix adjust.
Identities = 17/49 (34%), Positives = 30/49 (61%), Gaps = 2/49 (4%)
Query: 572 PLCKGHKEPCVARVVKKPGPTFGRRFFVCARAEGPASNPEANCGYFKWA 620
PLC+ +E + P GR+F++C R+ G ++N E++CG+F+W
Sbjct: 474 PLCRHGEESMLKTSKTSANP--GRKFWICKRSRGDSNNTESSCGFFQWV 520
>sp|P45951|ARP_ARATH Apurinic endonuclease-redox protein OS=Arabidopsis thaliana GN=ARP
PE=2 SV=2
Length = 536
Score = 90.9 bits (224), Expect = 3e-17, Method: Compositional matrix adjust.
Identities = 89/301 (29%), Positives = 137/301 (45%), Gaps = 67/301 (22%)
Query: 1 MKIVTYNVNGLRQ--RVSQFGSLRKLLDSFDADIICFQETKLRR---QELKSDLVMADGY 55
+K++T+NVNGLR + F +L +L + DI+C QETKL+ +E+K L+ DGY
Sbjct: 276 VKVMTWNVNGLRGLLKFESFSAL-QLAQRENFDILCLQETKLQVKDVEEIKKTLI--DGY 332
Query: 56 E-SFFSCTRTSDKGRTGYSGVATFCRVKSPFSSTEVALPVAAEEGFTGLLETSGSKIMEG 114
+ SF+SC+ + + GYSG A R+K P++ G TGL SG
Sbjct: 333 DHSFWSCSVS----KLGYSGTAIISRIK----------PLSVRYG-TGL---SGH----- 369
Query: 115 LEDFSKDELLKIDSEGRCVITDHGHFILFNVYGPRADSEDTVRIQFKLQFFHKRWEFLLC 174
D+EGR V + F L N Y P +S D ++ +L + + W+ L
Sbjct: 370 ------------DTEGRIVTAEFDSFYLINTYVP--NSGDGLK---RLSYRIEEWDRTLS 412
Query: 175 -------QGRRIFVVGDLNIAPAAIDRCDAGPDFAKNEFRIW----FRSMLVESGGSFFD 223
+ + + + GDLN A ID + + F I F + L++ G F D
Sbjct: 413 NHIKELEKSKPVVLTGDLNCAHEEIDIFNPAGNKRSAGFTIEERQSFGANLLDKG--FVD 470
Query: 224 VFRSKHPERREAYTCWPSNTGAEQFNYGTRIDHILCAGPCLHQKHDLQSHNFVTCHVNEC 283
FR +HP YT W G + N G R+D+ L + HD +++ +N
Sbjct: 471 TFRKQHPGVV-GYTYWGYRHGGRKTNKGWRLDYFLVSQSIAANVHD----SYILPDINGS 525
Query: 284 D 284
D
Sbjct: 526 D 526
>sp|P27864|RRP1_DROME Recombination repair protein 1 OS=Drosophila melanogaster GN=Rrp1
PE=1 SV=2
Length = 679
Score = 90.5 bits (223), Expect = 3e-17, Method: Compositional matrix adjust.
Identities = 77/295 (26%), Positives = 129/295 (43%), Gaps = 54/295 (18%)
Query: 1 MKIVTYNVNGLRQRVSQFGSLRKLLDSFDADIICFQETKLRRQELKSDLVMADGYESFFS 60
+KI ++NV GLR + + G +L+D + DI C QETK +L ++ GY ++
Sbjct: 427 LKICSWNVAGLRAWLKKDG--LQLIDLEEPDIFCLQETKCANDQLPEEVTRLPGYHPYWL 484
Query: 61 CTRTSDKGRTGYSGVATFCRVKSPFSSTEVALPVAAEEGFTGLLETSGSKIMEGLEDFSK 120
C GY+GVA + ++ +P+ E G G E+F
Sbjct: 485 CMPG------GYAGVAIYSKI----------MPIHVEYGI-------------GNEEF-- 513
Query: 121 DELLKIDSEGRCVITDHGHFILFNVYGPRA-----DSEDTVRIQFKLQFFHKRWEFLLCQ 175
D GR + ++ F L NVY P + + E +R + Q + K+ + L
Sbjct: 514 ------DDVGRMITAEYEKFYLINVYVPNSGRKLVNLEPRMRWEKLFQAYVKKLDAL--- 564
Query: 176 GRRIFVVGDLNIAPAAIDRCDAGPDFAKNEFRIWFRSMLVESGG-SFFDVFRSKHPERRE 234
+ + + GD+N++ ID + + F R + E G F D FR +P+R+
Sbjct: 565 -KPVVICGDMNVSHMPIDLENPKNNTKNAGFTQEERDKMTELLGLGFVDTFRHLYPDRKG 623
Query: 235 AYTCWPSNTGAEQFNYGTRIDHILCAGPCLHQ--KHDLQSHNFVTCHVNECDILI 287
AYT W A N G R+D+ L + + + +H+++S + H C I I
Sbjct: 624 AYTFWTYMANARARNVGWRLDYCLVSERFVPKVVEHEIRSQCLGSDH---CPITI 675
>sp|P37454|EXOA_BACSU Exodeoxyribonuclease OS=Bacillus subtilis (strain 168) GN=exoA PE=1
SV=1
Length = 252
Score = 85.1 bits (209), Expect = 2e-15, Method: Compositional matrix adjust.
Identities = 79/278 (28%), Positives = 123/278 (44%), Gaps = 56/278 (20%)
Query: 1 MKIVTYNVNGLRQRVSQFGSLRKLLDSFDADIICFQETKLRRQELKSDLVMADGYESFFS 60
MK++++NVNGLR + + L L + DADIIC QETK+ Q+ + DL D Y +++
Sbjct: 1 MKLISWNVNGLRAVMRKMDFLSYLKEE-DADIICLQETKI--QDGQVDLQPED-YHVYWN 56
Query: 61 CTRTSDKGRTGYSGVATFCRVKSPFSSTEVALPVAAEEGFTGLLETSGSKIMEGLEDFSK 120
+ GYSG A F + + P +V + EE
Sbjct: 57 YAV-----KKGYSGTAVFSK-QEPL---QVIYGIGVEEH--------------------- 86
Query: 121 DELLKIDSEGRCVITDHGHFILFNVYGPRADSEDTVRIQFKLQFFHKRWEFLL--CQGRR 178
D EGR + + + + VY P + RI +++Q+ ++L Q +
Sbjct: 87 ------DQEGRVITLEFENVFVMTVYTPNS-RRGLERIDYRMQWEEALLSYILELDQKKP 139
Query: 179 IFVVGDLNIAPAAID-------RCDAGPDFAKNEFRIWFRSMLVESGGSFFDVFRSKHPE 231
+ + GDLN+A ID R +AG F+ E + R +E+G F D FR +P+
Sbjct: 140 VILCGDLNVAHQEIDLKNPKANRNNAG--FSDQEREAFTR--FLEAG--FVDSFRHVYPD 193
Query: 232 RREAYTCWPSNTGAEQFNYGTRIDHILCAGPCLHQKHD 269
AY+ W GA N G RID+ + + Q D
Sbjct: 194 LEGAYSWWSYRAGARDRNIGWRIDYFVVSESLKEQIED 231
>sp|A0MTA1|APEX1_DANRE DNA-(apurinic or apyrimidinic site) lyase OS=Danio rerio GN=apex1
PE=1 SV=1
Length = 310
Score = 79.7 bits (195), Expect = 6e-14, Method: Compositional matrix adjust.
Identities = 81/278 (29%), Positives = 116/278 (41%), Gaps = 62/278 (22%)
Query: 1 MKIVTYNVNGLRQRVSQFGSLRKLLDSFDADIICFQETKLRRQELKSDLVMADGYESFFS 60
MKI ++NV+GLR V + G + D DI+C QETK + L +D+ G +
Sbjct: 55 MKITSWNVDGLRAWVKKNG--LDWVRKEDPDILCLQETKCAEKALPADIT---GMPEYPH 109
Query: 61 CTRTSDKGRTGYSGVATFCRVKSPFSSTEVALPVAAEEGFTGLLETSGSKIMEGLEDFSK 120
+ + GYSGVA C+ + V + EE
Sbjct: 110 KYWAGSEDKEGYSGVAMLCKTE----PLNVTYGIGKEEH--------------------- 144
Query: 121 DELLKIDSEGRCVITDHGHFILFNVYGPRADSEDTVRIQFKLQFFHKRWEF----LLC-- 174
D EGR + + F L Y P A S VR+ ++ K W+ LC
Sbjct: 145 ------DKEGRVITAEFPDFFLVTAYVPNA-SRGLVRLDYR-----KTWDVDFRAYLCGL 192
Query: 175 QGRRIFVV-GDLNIAPAAID-------RCDAGPDFAKNEFRIWFRSMLVESGGSFFDVFR 226
R+ V+ GDLN+A ID R +AG F E R F + L+E+G F D FR
Sbjct: 193 DARKPLVLCGDLNVAHQEIDLKNPKGNRKNAG--FTPEE-REGF-TQLLEAG--FTDSFR 246
Query: 227 SKHPERREAYTCWPSNTGAEQFNYGTRIDHILCAGPCL 264
+P++ AYT W A N G R+D+ + + L
Sbjct: 247 ELYPDQAYAYTFWTYMMNARSKNVGWRLDYFVLSSALL 284
>sp|P09030|EX3_ECOLI Exodeoxyribonuclease III OS=Escherichia coli (strain K12) GN=xthA
PE=1 SV=4
Length = 268
Score = 72.8 bits (177), Expect = 7e-12, Method: Compositional matrix adjust.
Identities = 69/275 (25%), Positives = 114/275 (41%), Gaps = 50/275 (18%)
Query: 1 MKIVTYNVNGLRQRVSQFGSLRKLLDSFDADIICFQETKLRRQELKSDLVMADGYESFFS 60
MK V++N+NGLR R Q L +++ D+I QETK+ + V GY F+
Sbjct: 1 MKFVSFNINGLRARPHQ---LEAIVEKHQPDVIGLQETKVHDDMFPLEEVAKLGYNVFYH 57
Query: 61 CTRTSDKGRTGYSGVATFCRVKSPFSSTEVALPVAAEEGFTGLLETSGSKIMEGLEDFSK 120
G+ G+ GVA + P+A GF G E + +I
Sbjct: 58 -------GQKGHYGVALLTK----------ETPIAVRRGFPGDDEEAQRRI--------- 91
Query: 121 DELLKIDSEGRCVITDHGHFILFNVYGPRADSED-TVRIQFKLQFFHKRWEFLLCQGRR- 178
I +E ++ G+ + N Y P+ +S D ++ K QF+ +L + +R
Sbjct: 92 -----IMAEIPSLL---GNVTVINGYFPQGESRDHPIKFPAKAQFYQNLQNYLETELKRD 143
Query: 179 --IFVVGDLNIAPAAIDRCDAGPDFAKNEFRIWFRSMLVES--------GGSFFDVFRSK 228
+ ++GD+NI+P +D G + K R S L E D FR
Sbjct: 144 NPVLIMGDMNISPTDLD-IGIGEENRKRWLRTGKCSFLPEEREWMDRLMSWGLVDTFRHA 202
Query: 229 HPERREAYTCWPSNTGAEQFNYGTRIDHILCAGPC 263
+P+ + ++ + + N G RID +L + P
Sbjct: 203 NPQTADRFSWFDYRSKGFDDNRGLRIDLLLASQPL 237
>sp|P43138|APEX1_RAT DNA-(apurinic or apyrimidinic site) lyase OS=Rattus norvegicus
GN=Apex1 PE=1 SV=2
Length = 317
Score = 71.6 bits (174), Expect = 2e-11, Method: Compositional matrix adjust.
Identities = 75/269 (27%), Positives = 112/269 (41%), Gaps = 51/269 (18%)
Query: 1 MKIVTYNVNGLRQRVSQFGSLRKLLDSFDADIICFQETKLRRQELKSDLVMADGYESFFS 60
+KI ++NV+GLR + + G + DI+C QETK +L ++L G +
Sbjct: 61 LKICSWNVDGLRAWIKKKG--LDWVKEEAPDILCLQETKCSENKLPAELQELPGLTHQY- 117
Query: 61 CTRTSDKGRTGYSGVATFCRVKSPFSSTEVALPVAAEEGFTGLLETSGSKIMEGLEDFSK 120
+ SDK GYSGV R + P +V+ + EE
Sbjct: 118 WSAPSDK--EGYSGVGLLSR-QCPL---KVSYGIGEEEH--------------------- 150
Query: 121 DELLKIDSEGRCVITDHGHFILFNVYGPRADSEDTVRIQFKLQFFHKRWEFL--LCQGRR 178
D EGR ++ + FIL Y P A VR++++ ++ +FL L +
Sbjct: 151 ------DQEGRVIVAEFESFILVTAYVPNA-GRGLVRLEYRQRWDEAFRKFLKDLASRKP 203
Query: 179 IFVVGDLNIAPAAID-------RCDAGPDFAKNEFRIWFRSMLVESGGSFFDVFRSKHPE 231
+ + GDLN+A ID + +AG F E R F ML D FR +P
Sbjct: 204 LVLCGDLNVAHEEIDLRNPKGNKKNAG--FTPQE-RQGFGEML--QAVPLADSFRHLYPN 258
Query: 232 RREAYTCWPSNTGAEQFNYGTRIDHILCA 260
AYT W A N G R+D+ L +
Sbjct: 259 TAYAYTFWTYMMNARSKNVGWRLDYFLLS 287
>sp|P51173|APEA_DICDI DNA-(apurinic or apyrimidinic site) lyase OS=Dictyostelium
discoideum GN=apeA PE=2 SV=2
Length = 361
Score = 70.9 bits (172), Expect = 3e-11, Method: Compositional matrix adjust.
Identities = 66/270 (24%), Positives = 111/270 (41%), Gaps = 55/270 (20%)
Query: 1 MKIVTYNVNGLRQRVSQFGSLRKLLDSFDADIICFQETKLRRQELKSDLVMADGYESFFS 60
MKI+++NV G + +S+ + ++ + D++C QETK+ +K D M GYE F
Sbjct: 105 MKIISWNVAGFKSVLSK--GFTEYVEKENPDVLCLQETKINPSNIKKDQ-MPKGYEYHFI 161
Query: 61 CTRTSDKGRTGYSGVATFCRVKSPFSSTEVALPVAAEEGFTGLLETSGSKIMEGLEDFSK 120
+ + G+ G + K P A G
Sbjct: 162 -----EADQKGHHGTGVLTKKK----------PNAITFGIG------------------- 187
Query: 121 DELLKIDSEGRCVITDHGHFILFNVYGPRADSEDTVRIQFKLQFFHKRWE-----FL--L 173
+ K D+EGR + ++ F + N Y P A + R+ +++ K W+ +L L
Sbjct: 188 --IAKHDNEGRVITLEYDQFYIVNTYIPNAGTRGLQRLDYRI----KEWDVDFQAYLEKL 241
Query: 174 CQGRRIFVVGDLNIAPAAIDRCDAGPDFAKNEFRIWFR---SMLVESGGSFFDVFRSKHP 230
+ I GDLN+A ID + + F I R S +E G + D +R +P
Sbjct: 242 NATKPIIWCGDLNVAHTEIDLKNPKTNKKSAGFTIEERTSFSNFLEKG--YVDSYRHFNP 299
Query: 231 ERREAYTCWPSNTGAEQFNYGTRIDHILCA 260
+ +YT W G N G R+D+ + +
Sbjct: 300 GKEGSYTFWSYLGGGRSKNVGWRLDYFVVS 329
>sp|P23196|APEX1_BOVIN DNA-(apurinic or apyrimidinic site) lyase OS=Bos taurus GN=APEX1
PE=1 SV=2
Length = 318
Score = 70.5 bits (171), Expect = 4e-11, Method: Compositional matrix adjust.
Identities = 74/273 (27%), Positives = 113/273 (41%), Gaps = 51/273 (18%)
Query: 1 MKIVTYNVNGLRQRVSQFGSLRKLLDSFDADIICFQETKLRRQELKSDLVMADGYESFFS 60
+KI ++NV+GLR + + G + DI+C QETK +L +L G +
Sbjct: 62 LKICSWNVDGLRAWIKKKG--LDWVKEEAPDILCLQETKCSENKLPVELQELSGLSHQY- 118
Query: 61 CTRTSDKGRTGYSGVATFCRVKSPFSSTEVALPVAAEEGFTGLLETSGSKIMEGLEDFSK 120
+ SDK GYSGV R + P +V+ + EE
Sbjct: 119 WSAPSDK--EGYSGVGLLSR-QCPL---KVSYGIGEEEH--------------------- 151
Query: 121 DELLKIDSEGRCVITDHGHFILFNVYGPRADSEDTVRIQFKLQFFHKRWEFL--LCQGRR 178
D EGR ++ ++ F+L Y P A VR++++ ++ +FL L +
Sbjct: 152 ------DQEGRVIVAEYDAFVLVTAYVPNA-GRGLVRLEYRQRWDEAFRKFLKGLASRKP 204
Query: 179 IFVVGDLNIAPAAID-------RCDAGPDFAKNEFRIWFRSMLVESGGSFFDVFRSKHPE 231
+ + GDLN+A ID + +AG F E R F +L D FR +P
Sbjct: 205 LVLCGDLNVAHEEIDLRNPKGNKKNAG--FTPQE-RQGFGELL--QAVPLTDSFRHLYPN 259
Query: 232 RREAYTCWPSNTGAEQFNYGTRIDHILCAGPCL 264
AYT W A N G R+D+ L + L
Sbjct: 260 TAYAYTFWTYMMNARSKNVGWRLDYFLLSQSVL 292
>sp|P0A1A9|EX3_SALTY Exodeoxyribonuclease III OS=Salmonella typhimurium (strain LT2 /
SGSC1412 / ATCC 700720) GN=xthA PE=3 SV=1
Length = 268
Score = 70.5 bits (171), Expect = 4e-11, Method: Compositional matrix adjust.
Identities = 69/282 (24%), Positives = 118/282 (41%), Gaps = 64/282 (22%)
Query: 1 MKIVTYNVNGLRQRVSQFGSLRKLLDSFDADIICFQETKLRRQELKSDLVMADGYESFFS 60
MK V++N+NGLR R Q L +++ D+I QETK+ + + V GY F+
Sbjct: 1 MKFVSFNINGLRARPHQ---LEAIVEKHQPDVIGLQETKVHDEMFPLEEVAKLGYNVFYH 57
Query: 61 CTRTSDKGRTGYSGVATFCRVKSPFSSTEVALPVAAEEGFTGLLETSGSKIMEGLEDFSK 120
G+ G+ GVA + A P++ GF E + +I+
Sbjct: 58 -------GQKGHYGVALLTK----------ATPISVRRGFPDDGEEAQRRII-------- 92
Query: 121 DELLKIDSEGRCVITDHGHFILFNVYGPRADSED-TVRIQFKLQFFHKRWEFLLCQGR-- 177
+ +I S G+ + N Y P+ +S D ++ K QF+ +L + +
Sbjct: 93 --MAEIPSP-------LGNITVINGYFPQGESRDHPLKFPAKAQFYQNLQNYLETELKCD 143
Query: 178 -RIFVVGDLNIAPAAID---------------RCDAGPDFAKNEFRIWFRSMLVESGGSF 221
+ ++GD+NI+P +D +C P E R W S L++ G
Sbjct: 144 NPVLIMGDMNISPTDLDIGIGEENRKRWLRTGKCSFLP-----EEREWM-SRLLKWG--L 195
Query: 222 FDVFRSKHPERREAYTCWPSNTGAEQFNYGTRIDHILCAGPC 263
D FR +P+ + ++ + + N G RID +L + P
Sbjct: 196 VDTFRQANPQTMDKFSWFDYRSKGFVDNRGLRIDLLLASAPL 237
>sp|P0A1B0|EX3_SALTI Exodeoxyribonuclease III OS=Salmonella typhi GN=xthA PE=3 SV=1
Length = 268
Score = 70.5 bits (171), Expect = 4e-11, Method: Compositional matrix adjust.
Identities = 69/282 (24%), Positives = 118/282 (41%), Gaps = 64/282 (22%)
Query: 1 MKIVTYNVNGLRQRVSQFGSLRKLLDSFDADIICFQETKLRRQELKSDLVMADGYESFFS 60
MK V++N+NGLR R Q L +++ D+I QETK+ + + V GY F+
Sbjct: 1 MKFVSFNINGLRARPHQ---LEAIVEKHQPDVIGLQETKVHDEMFPLEEVAKLGYNVFYH 57
Query: 61 CTRTSDKGRTGYSGVATFCRVKSPFSSTEVALPVAAEEGFTGLLETSGSKIMEGLEDFSK 120
G+ G+ GVA + A P++ GF E + +I+
Sbjct: 58 -------GQKGHYGVALLTK----------ATPISVRRGFPDDGEEAQRRII-------- 92
Query: 121 DELLKIDSEGRCVITDHGHFILFNVYGPRADSED-TVRIQFKLQFFHKRWEFLLCQGR-- 177
+ +I S G+ + N Y P+ +S D ++ K QF+ +L + +
Sbjct: 93 --MAEIPSP-------LGNITVINGYFPQGESRDHPLKFPAKAQFYQNLQNYLETELKCD 143
Query: 178 -RIFVVGDLNIAPAAID---------------RCDAGPDFAKNEFRIWFRSMLVESGGSF 221
+ ++GD+NI+P +D +C P E R W S L++ G
Sbjct: 144 NPVLIMGDMNISPTDLDIGIGEENRKRWLRTGKCSFLP-----EEREWM-SRLLKWG--L 195
Query: 222 FDVFRSKHPERREAYTCWPSNTGAEQFNYGTRIDHILCAGPC 263
D FR +P+ + ++ + + N G RID +L + P
Sbjct: 196 VDTFRQANPQTMDKFSWFDYRSKGFVDNRGLRIDLLLASAPL 237
>sp|P27695|APEX1_HUMAN DNA-(apurinic or apyrimidinic site) lyase OS=Homo sapiens GN=APEX1
PE=1 SV=2
Length = 318
Score = 70.5 bits (171), Expect = 4e-11, Method: Compositional matrix adjust.
Identities = 73/269 (27%), Positives = 112/269 (41%), Gaps = 51/269 (18%)
Query: 1 MKIVTYNVNGLRQRVSQFGSLRKLLDSFDADIICFQETKLRRQELKSDLVMADGYESFFS 60
+KI ++NV+GLR + + G + DI+C QETK +L ++L G +
Sbjct: 62 LKICSWNVDGLRAWIKKKG--LDWVKEEAPDILCLQETKCSENKLPAELQELPGLSHQY- 118
Query: 61 CTRTSDKGRTGYSGVATFCRVKSPFSSTEVALPVAAEEGFTGLLETSGSKIMEGLEDFSK 120
+ SDK GYSGV R + P K+ G+ D
Sbjct: 119 WSAPSDK--EGYSGVGLLSR-QCPL------------------------KVSYGIGDEEH 151
Query: 121 DELLKIDSEGRCVITDHGHFILFNVYGPRADSEDTVRIQFKLQFFHKRWEFL--LCQGRR 178
D+ EGR ++ + F+L Y P A VR++++ ++ +FL L +
Sbjct: 152 DQ------EGRVIVAEFDSFVLVTAYVPNA-GRGLVRLEYRQRWDEAFRKFLKGLASRKP 204
Query: 179 IFVVGDLNIAPAAID-------RCDAGPDFAKNEFRIWFRSMLVESGGSFFDVFRSKHPE 231
+ + GDLN+A ID + +AG F E R F +L D FR +P
Sbjct: 205 LVLCGDLNVAHEEIDLRNPKGNKKNAG--FTPQE-RQGFGELL--QAVPLADSFRHLYPN 259
Query: 232 RREAYTCWPSNTGAEQFNYGTRIDHILCA 260
AYT W A N G R+D+ L +
Sbjct: 260 TPYAYTFWTYMMNARSKNVGWRLDYFLLS 288
>sp|P28352|APEX1_MOUSE DNA-(apurinic or apyrimidinic site) lyase OS=Mus musculus GN=Apex1
PE=1 SV=2
Length = 317
Score = 70.1 bits (170), Expect = 5e-11, Method: Compositional matrix adjust.
Identities = 73/269 (27%), Positives = 112/269 (41%), Gaps = 51/269 (18%)
Query: 1 MKIVTYNVNGLRQRVSQFGSLRKLLDSFDADIICFQETKLRRQELKSDLVMADGYESFFS 60
+KI ++NV+GLR + + G + DI+C QETK +L ++L G +
Sbjct: 61 LKICSWNVDGLRAWIKKKG--LDWVKEEAPDILCLQETKCSENKLPAELQELPGLTHQY- 117
Query: 61 CTRTSDKGRTGYSGVATFCRVKSPFSSTEVALPVAAEEGFTGLLETSGSKIMEGLEDFSK 120
+ SDK GYSGV R + P +V+ + EE
Sbjct: 118 WSAPSDK--EGYSGVGLLSR-QCPL---KVSYGIGEEEH--------------------- 150
Query: 121 DELLKIDSEGRCVITDHGHFILFNVYGPRADSEDTVRIQFKLQFFHKRWEFL--LCQGRR 178
D EGR ++ + F+L Y P A VR++++ ++ +FL L +
Sbjct: 151 ------DQEGRVIVAEFESFVLVTAYVPNA-GRGLVRLEYRQRWDEAFRKFLKDLASRKP 203
Query: 179 IFVVGDLNIAPAAID-------RCDAGPDFAKNEFRIWFRSMLVESGGSFFDVFRSKHPE 231
+ + GDLN+A ID + +AG F E R F +L D FR +P
Sbjct: 204 LVLCGDLNVAHEEIDLRNPKGNKKNAG--FTPQE-RQGFGELL--QAVPLADSFRHLYPN 258
Query: 232 RREAYTCWPSNTGAEQFNYGTRIDHILCA 260
AYT W A N G R+D+ L +
Sbjct: 259 TAYAYTFWTYMMNARSKNVGWRLDYFLLS 287
>sp|A2T7I6|APEX1_PONPY DNA-(apurinic or apyrimidinic site) lyase OS=Pongo pygmaeus
GN=APEX1 PE=3 SV=1
Length = 318
Score = 69.7 bits (169), Expect = 6e-11, Method: Compositional matrix adjust.
Identities = 85/330 (25%), Positives = 129/330 (39%), Gaps = 83/330 (25%)
Query: 1 MKIVTYNVNGLRQRVSQFGSLRKLLDSFDADIICFQETKLRRQELKSDLVMADGYESFFS 60
+KI ++NV+GLR + + G + DI+C QETK +L ++L G +
Sbjct: 62 LKICSWNVDGLRAWIKKKG--LDWVKEEAPDILCLQETKCSENKLPAELQELPGLSHQY- 118
Query: 61 CTRTSDKGRTGYSGVATFCRVKSPFSSTEVALPVAAEEGFTGLLETSGSKIMEGLEDFSK 120
+ SDK GYSGV R + P +V+ + EE
Sbjct: 119 WSAPSDK--EGYSGVGLLSR-QCPL---KVSYGIGEEEH--------------------- 151
Query: 121 DELLKIDSEGRCVITDHGHFILFNVYGPRADSEDTVRIQFKLQFFHKRWEFL--LCQGRR 178
D EGR ++ + F+L Y P A VR++++ ++ FL L +
Sbjct: 152 ------DQEGRVIVAEFDSFVLVTAYVPNA-GRGLVRLEYRQRWDEAFRRFLKGLASRKP 204
Query: 179 IFVVGDLNIAPAAID-------RCDAGPDFAKNEFRIWFRSMLVESGGSFFDVFRSKHPE 231
+ + GDLN+A ID + +AG F E R F +L D FR +P
Sbjct: 205 LVLCGDLNVAHEEIDLRNPKGNKKNAG--FTPQE-RQGFGELL--QAVPLADSFRHLYPN 259
Query: 232 RREAYTCWPSNTGAEQFNYGTRIDHILCAGPCLHQKHDLQSHNFVTCHVNECDILIDYKR 291
AYT W A N G R+D+ L SH+ +T L D K
Sbjct: 260 TPYAYTFWTYMMNARSKNVGWRLDYFLL------------SHSLLTA-------LCDSK- 299
Query: 292 WKPGNAPRWKGGMSTRLEGSDHAPVYMCLG 321
+ ++ GSDH P+ + L
Sbjct: 300 ------------IRSKALGSDHCPITLYLA 317
>sp|A2T6Y4|APEX1_PANTR DNA-(apurinic or apyrimidinic site) lyase OS=Pan troglodytes
GN=APEX1 PE=3 SV=1
Length = 318
Score = 69.7 bits (169), Expect = 6e-11, Method: Compositional matrix adjust.
Identities = 73/269 (27%), Positives = 112/269 (41%), Gaps = 51/269 (18%)
Query: 1 MKIVTYNVNGLRQRVSQFGSLRKLLDSFDADIICFQETKLRRQELKSDLVMADGYESFFS 60
+KI ++NV+GLR + + G + DI+C QETK +L ++L G +
Sbjct: 62 LKICSWNVDGLRAWIKKKG--LDWVKEEAPDILCLQETKCSENKLPAELQELPGLSHQY- 118
Query: 61 CTRTSDKGRTGYSGVATFCRVKSPFSSTEVALPVAAEEGFTGLLETSGSKIMEGLEDFSK 120
+ SDK GYSGV R + P +V+ + EE
Sbjct: 119 WSAPSDK--EGYSGVGLLSR-QCPL---KVSYGIGEEEH--------------------- 151
Query: 121 DELLKIDSEGRCVITDHGHFILFNVYGPRADSEDTVRIQFKLQFFHKRWEFL--LCQGRR 178
D EGR ++ + F+L Y P A VR++++ ++ +FL L +
Sbjct: 152 ------DQEGRVIVAEFDSFVLVTAYVPNA-GRGLVRLEYRQRWDEAFRKFLKGLASRKP 204
Query: 179 IFVVGDLNIAPAAID-------RCDAGPDFAKNEFRIWFRSMLVESGGSFFDVFRSKHPE 231
+ + GDLN+A ID + +AG F E R F +L D FR +P
Sbjct: 205 LVLCGDLNVAHEEIDLRNPKGNKKNAG--FTPQE-RQGFGELL--QAVPLADSFRHLYPN 259
Query: 232 RREAYTCWPSNTGAEQFNYGTRIDHILCA 260
AYT W A N G R+D+ L +
Sbjct: 260 TPYAYTFWTYMMNARSKNVGWRLDYFLLS 288
>sp|A1YFZ3|APEX1_PANPA DNA-(apurinic or apyrimidinic site) lyase OS=Pan paniscus GN=APEX1
PE=3 SV=1
Length = 318
Score = 69.7 bits (169), Expect = 6e-11, Method: Compositional matrix adjust.
Identities = 73/269 (27%), Positives = 112/269 (41%), Gaps = 51/269 (18%)
Query: 1 MKIVTYNVNGLRQRVSQFGSLRKLLDSFDADIICFQETKLRRQELKSDLVMADGYESFFS 60
+KI ++NV+GLR + + G + DI+C QETK +L ++L G +
Sbjct: 62 LKICSWNVDGLRAWIKKKG--LDWVKEEAPDILCLQETKCSENKLPAELQELPGLSHQY- 118
Query: 61 CTRTSDKGRTGYSGVATFCRVKSPFSSTEVALPVAAEEGFTGLLETSGSKIMEGLEDFSK 120
+ SDK GYSGV R + P +V+ + EE
Sbjct: 119 WSAPSDK--EGYSGVGLLSR-QCPL---KVSYGIGEEEH--------------------- 151
Query: 121 DELLKIDSEGRCVITDHGHFILFNVYGPRADSEDTVRIQFKLQFFHKRWEFL--LCQGRR 178
D EGR ++ + F+L Y P A VR++++ ++ +FL L +
Sbjct: 152 ------DQEGRVIVAEFDSFVLVTAYVPNA-GRGLVRLEYRQRWDEAFRKFLKGLASRKP 204
Query: 179 IFVVGDLNIAPAAID-------RCDAGPDFAKNEFRIWFRSMLVESGGSFFDVFRSKHPE 231
+ + GDLN+A ID + +AG F E R F +L D FR +P
Sbjct: 205 LVLCGDLNVAHEEIDLRNPKGNKKNAG--FTPQE-RQGFGELL--QAVPLADSFRHLYPN 259
Query: 232 RREAYTCWPSNTGAEQFNYGTRIDHILCA 260
AYT W A N G R+D+ L +
Sbjct: 260 TPYAYTFWTYMMNARSKNVGWRLDYFLLS 288
>sp|A1YES6|APEX1_GORGO DNA-(apurinic or apyrimidinic site) lyase OS=Gorilla gorilla
gorilla GN=APEX1 PE=3 SV=1
Length = 318
Score = 68.2 bits (165), Expect = 2e-10, Method: Compositional matrix adjust.
Identities = 70/269 (26%), Positives = 110/269 (40%), Gaps = 51/269 (18%)
Query: 1 MKIVTYNVNGLRQRVSQFGSLRKLLDSFDADIICFQETKLRRQELKSDLVMADGYESFFS 60
+KI ++NV+GLR + + G + DI+C QETK +L ++L G +
Sbjct: 62 LKICSWNVDGLRAWIKKKG--LDWVKEEAPDILCLQETKCSENKLPAELQELPGLSYQYW 119
Query: 61 CTRTSDKGRTGYSGVATFCRVKSPFSSTEVALPVAAEEGFTGLLETSGSKIMEGLEDFSK 120
++ + GYSGV R + P +V+ + EE
Sbjct: 120 ---SAPXXKEGYSGVGLLSR-QCPL---KVSYGIGEEEH--------------------- 151
Query: 121 DELLKIDSEGRCVITDHGHFILFNVYGPRADSEDTVRIQFKLQFFHKRWEFL--LCQGRR 178
D EGR ++ + F+L Y P A VR++++ ++ FL L +
Sbjct: 152 ------DQEGRVIVAEFDSFVLVTAYVPNA-GRGLVRLEYRQRWDEAFRRFLKGLASRKP 204
Query: 179 IFVVGDLNIAPAAID-------RCDAGPDFAKNEFRIWFRSMLVESGGSFFDVFRSKHPE 231
+ + GDLN+A ID + +AG F E R F +L D FR +P
Sbjct: 205 LVLCGDLNVAHEEIDLRNPKGNKKNAG--FTPQE-RQGFGELL--QAVPLADSFRHLYPN 259
Query: 232 RREAYTCWPSNTGAEQFNYGTRIDHILCA 260
AYT W A N G R+D+ L +
Sbjct: 260 TPYAYTFWTYMMNARSKNVGWRLDYFLLS 288
>sp|P44318|EX3_HAEIN Exodeoxyribonuclease III OS=Haemophilus influenzae (strain ATCC
51907 / DSM 11121 / KW20 / Rd) GN=xthA PE=3 SV=1
Length = 267
Score = 60.8 bits (146), Expect = 3e-08, Method: Compositional matrix adjust.
Identities = 66/280 (23%), Positives = 113/280 (40%), Gaps = 66/280 (23%)
Query: 1 MKIVTYNVNGLRQRVSQFGSLRKLLDSFDADIICFQETKLRRQELKSDLVMADGYESFFS 60
MK +++N+NGLR R Q L +++ + D+I QE K+ + ++ GY F
Sbjct: 1 MKFISFNINGLRARPHQ---LEAIIEKYQPDVIGLQEIKVADEAFPYEITENLGYHVFHH 57
Query: 61 CTRTSDKGRTGYSGVATFCRVKSPFSSTEVALPVAAEEGFTGLLETSGSK-IMEGLEDFS 119
G+ G+ GVA + + P GF E + + IM LE
Sbjct: 58 -------GQKGHYGVALLTKQE----------PKVIRRGFPTDNEDAQKRIIMADLE--- 97
Query: 120 KDELLKIDSEGRCVITDHGHFILFNVYGPRADSE-DTVRIQFKLQFFHKRWEFLLCQGRR 178
T+ G + N Y P+ +S + K +F+ ++L + +
Sbjct: 98 ---------------TEFGLLTVINGYFPQGESRAHETKFPAKEKFYADLQQYLEKEHDK 142
Query: 179 ---IFVVGDLNIAPAAID---------------RCDAGPDFAKNEFRIWFRSMLVESGGS 220
I ++GD+NI+P+ +D +C P E R W++ L + G
Sbjct: 143 SNPILIMGDMNISPSDLDIGIGDENRKRWLRTGKCSFLP-----EERAWYQR-LYDYGLE 196
Query: 221 FFDVFRSKHPERREAYTCWPSNTGAEQFNYGTRIDHILCA 260
D FR +P + ++ + + N G RIDHIL +
Sbjct: 197 --DSFRKLNPTANDKFSWFDYRSKGFDDNRGLRIDHILVS 234
>sp|P0A2X4|EXOA_STRR6 Exodeoxyribonuclease OS=Streptococcus pneumoniae (strain ATCC
BAA-255 / R6) GN=exoA PE=3 SV=1
Length = 275
Score = 60.8 bits (146), Expect = 3e-08, Method: Compositional matrix adjust.
Identities = 72/282 (25%), Positives = 116/282 (41%), Gaps = 59/282 (20%)
Query: 1 MKIVTYNVNGLRQRVSQFGSLRKL-------LDSFDADIICFQETKL-------RRQELK 46
MK++++N++ L ++ + KL L + +ADII QETKL + E+
Sbjct: 1 MKLISWNIDSLNAALTSDSARAKLSQEVLQTLVAENADIIAIQETKLSAKGPTKKHVEIL 60
Query: 47 SDLVMADGYESFFSCTRTSDKGRTGYSGVATFCRVKSPFSSTEVALPVAAEEGFTGLLET 106
+L GYE+ + ++ + R GY+G T K + T ++ P E
Sbjct: 61 EELF--PGYENTWRSSQ--EPARKGYAG--TMFLYKKELTPT-ISFP-----------EI 102
Query: 107 SGSKIMEGLEDFSKDELLKIDSEGRCVITDHGHFILFNVYGPRADSEDTVRIQFKLQFFH 166
M D EGR + + F + VY P A + R++ + +
Sbjct: 103 GAPSTM--------------DLEGRIITLEFDAFFVTQVYTPNA-GDGLKRLEERQVWDA 147
Query: 167 KRWEFL--LCQGRRIFVVGDLNIAPAAIDRCDAG-----PDFAKNEFRIWFRSMLVESGG 219
K E+L L + + + GD N+A ID + P F E R F ++L
Sbjct: 148 KYAEYLAELDKEKPVLATGDYNVAHNEIDLANPASNRRSPGFTDEE-RAGFTNLL---AT 203
Query: 220 SFFDVFRSKHPERREAYTCWPSNTGAEQF-NYGTRIDHILCA 260
F D FR H + E YT W + + N G RID+ L +
Sbjct: 204 GFTDTFRHVHGDVPERYTWWAQRSKTSKINNTGWRIDYWLTS 245
>sp|P0A2X3|EXOA_STRPN Exodeoxyribonuclease OS=Streptococcus pneumoniae serotype 4 (strain
ATCC BAA-334 / TIGR4) GN=exoA PE=3 SV=1
Length = 275
Score = 60.8 bits (146), Expect = 3e-08, Method: Compositional matrix adjust.
Identities = 72/282 (25%), Positives = 116/282 (41%), Gaps = 59/282 (20%)
Query: 1 MKIVTYNVNGLRQRVSQFGSLRKL-------LDSFDADIICFQETKL-------RRQELK 46
MK++++N++ L ++ + KL L + +ADII QETKL + E+
Sbjct: 1 MKLISWNIDSLNAALTSDSARAKLSQEVLQTLVAENADIIAIQETKLSAKGPTKKHVEIL 60
Query: 47 SDLVMADGYESFFSCTRTSDKGRTGYSGVATFCRVKSPFSSTEVALPVAAEEGFTGLLET 106
+L GYE+ + ++ + R GY+G T K + T ++ P E
Sbjct: 61 EELF--PGYENTWRSSQ--EPARKGYAG--TMFLYKKELTPT-ISFP-----------EI 102
Query: 107 SGSKIMEGLEDFSKDELLKIDSEGRCVITDHGHFILFNVYGPRADSEDTVRIQFKLQFFH 166
M D EGR + + F + VY P A + R++ + +
Sbjct: 103 GAPSTM--------------DLEGRIITLEFDAFFVTQVYTPNA-GDGLKRLEERQVWDA 147
Query: 167 KRWEFL--LCQGRRIFVVGDLNIAPAAIDRCDAG-----PDFAKNEFRIWFRSMLVESGG 219
K E+L L + + + GD N+A ID + P F E R F ++L
Sbjct: 148 KYAEYLAELDKEKPVLATGDYNVAHNEIDLANPASNRRSPGFTDEE-RAGFTNLL---AT 203
Query: 220 SFFDVFRSKHPERREAYTCWPSNTGAEQF-NYGTRIDHILCA 260
F D FR H + E YT W + + N G RID+ L +
Sbjct: 204 GFTDTFRHVHGDVPERYTWWAQRSKTSKINNTGWRIDYWLTS 245
>sp|Q8K203|NEIL3_MOUSE Endonuclease 8-like 3 OS=Mus musculus GN=Neil3 PE=1 SV=1
Length = 606
Score = 50.1 bits (118), Expect = 6e-05, Method: Compositional matrix adjust.
Identities = 23/49 (46%), Positives = 29/49 (59%), Gaps = 6/49 (12%)
Query: 572 PLCKGHKEPCVARVVKKPGPTFGRRFFVCARAEGPASNPEANCGYFKWA 620
PLCK H CV RVV+K G GR+F+ C+ G A CG+F+WA
Sbjct: 506 PLCKMHHRRCVLRVVRKDGENKGRQFYACSLPRG------AQCGFFEWA 548
Score = 38.1 bits (87), Expect = 0.20, Method: Compositional matrix adjust.
Identities = 38/162 (23%), Positives = 62/162 (38%), Gaps = 26/162 (16%)
Query: 462 GQLSLKSFFHKRSNVSHDDNNSITDTSLNVNNSVTDTSLSQEEVPESHHHSNKIPVTDYS 521
Q L S HK+ +H + + N+ ++++ L H S+ P+
Sbjct: 456 AQSKLFSSAHKKFKPAHTSATELK----SYNSGLSNSELQTNRTRGHHSKSDGSPL---- 507
Query: 522 CSVHELHGVNSSVCSHDQDEKKGKRFLD---KERNNVALLEWRRIQQLMETSIPLCKGHK 578
C +H V V E KG++F EW + S P C+ H
Sbjct: 508 CKMHHRRCVLRVV--RKDGENKGRQFYACSLPRGAQCGFFEW------ADLSFPFCR-HG 558
Query: 579 EPCVARVVKKPGPTFGRRFFVCARAEGPASNPEANCGYFKWA 620
+ + + V K GP G+ FFVC + + C +F+WA
Sbjct: 559 KRSIMKTVLKIGPNNGKNFFVCPLEK------KKQCNFFQWA 594
>sp|Q3MHN7|NEIL3_BOVIN Endonuclease 8-like 3 OS=Bos taurus GN=NEIL3 PE=2 SV=1
Length = 606
Score = 49.7 bits (117), Expect = 7e-05, Method: Compositional matrix adjust.
Identities = 23/55 (41%), Positives = 28/55 (50%), Gaps = 6/55 (10%)
Query: 566 LMETSIPLCKGHKEPCVARVVKKPGPTFGRRFFVCARAEGPASNPEANCGYFKWA 620
L+ P C H PC RVV+K G GR F+ C A EA CG+F+WA
Sbjct: 500 LLNAGSPRCSKHGRPCALRVVRKSGENKGRHFYACPLAR------EAQCGFFEWA 548
Score = 38.5 bits (88), Expect = 0.17, Method: Compositional matrix adjust.
Identities = 27/84 (32%), Positives = 36/84 (42%), Gaps = 18/84 (21%)
Query: 541 EKKGKRF----LDKERNNVALLEWRRIQQLMETSIPLCKGHKEPCVARVVKKPGPTFGRR 596
E KG+ F L +E EW + S P C H + + R V K GP G+
Sbjct: 525 ENKGRHFYACPLAREAQ-CGFFEW------ADLSFPFC-NHGKRSIMRTVLKIGPNNGKN 576
Query: 597 FFVCARAEGPASNPEANCGYFKWA 620
FFVC + E C +F+WA
Sbjct: 577 FFVCPLGK------EKQCNFFQWA 594
>sp|Q8TAT5|NEIL3_HUMAN Endonuclease 8-like 3 OS=Homo sapiens GN=NEIL3 PE=1 SV=3
Length = 605
Score = 43.5 bits (101), Expect = 0.005, Method: Compositional matrix adjust.
Identities = 20/49 (40%), Positives = 25/49 (51%), Gaps = 6/49 (12%)
Query: 572 PLCKGHKEPCVARVVKKPGPTFGRRFFVCARAEGPASNPEANCGYFKWA 620
P C H C+ RVV K G GR+F+ C EA CG+F+WA
Sbjct: 505 PRCSKHNRLCILRVVGKDGENKGRQFYACPLPR------EAQCGFFEWA 547
Score = 37.0 bits (84), Expect = 0.44, Method: Compositional matrix adjust.
Identities = 26/84 (30%), Positives = 36/84 (42%), Gaps = 18/84 (21%)
Query: 541 EKKGKRF----LDKERNNVALLEWRRIQQLMETSIPLCKGHKEPCVARVVKKPGPTFGRR 596
E KG++F L +E EW + S P C H + + V K GP G+
Sbjct: 524 ENKGRQFYACPLPREAQ-CGFFEW------ADLSFPFC-NHGKRSTMKTVLKIGPNNGKN 575
Query: 597 FFVCARAEGPASNPEANCGYFKWA 620
FFVC + E C +F+WA
Sbjct: 576 FFVCPLGK------EKQCNFFQWA 593
>sp|Q6ZU11|YD002_HUMAN Uncharacterized protein FLJ44066 OS=Homo sapiens PE=2 SV=1
Length = 926
Score = 40.0 bits (92), Expect = 0.060, Method: Compositional matrix adjust.
Identities = 20/52 (38%), Positives = 29/52 (55%), Gaps = 7/52 (13%)
Query: 568 ETSIPLCKGHKEPCVARVVKKPGPTFGRRFFVCARAEGPASNPEANCGYFKW 619
E ++P C H +P +VKK GP GR F+ C +GP ++ C +FKW
Sbjct: 165 ENNVPSCH-HSQPAKLVMVKKEGPNKGRLFYTC---DGPKAD---RCKFFKW 209
>sp|O70157|TOP3A_MOUSE DNA topoisomerase 3-alpha OS=Mus musculus GN=Top3a PE=1 SV=1
Length = 1003
Score = 39.3 bits (90), Expect = 0.087, Method: Compositional matrix adjust.
Identities = 17/42 (40%), Positives = 23/42 (54%), Gaps = 6/42 (14%)
Query: 579 EPCVARVVKKPGPTFGRRFFVCARAEGPASNPEANCGYFKWA 620
+P V R V+K GP GR+F CA+ E CG+F+W
Sbjct: 903 QPAVTRTVQKDGPNKGRQFHTCAKPR------EQQCGFFQWV 938
>sp|Q13472|TOP3A_HUMAN DNA topoisomerase 3-alpha OS=Homo sapiens GN=TOP3A PE=1 SV=1
Length = 1001
Score = 38.9 bits (89), Expect = 0.13, Method: Compositional matrix adjust.
Identities = 17/42 (40%), Positives = 23/42 (54%), Gaps = 6/42 (14%)
Query: 579 EPCVARVVKKPGPTFGRRFFVCARAEGPASNPEANCGYFKWA 620
+P V R V+K GP GR+F CA+ E CG+F+W
Sbjct: 901 QPSVTRTVQKDGPNKGRQFHTCAKPR------EQQCGFFQWV 936
>sp|Q9NG98|TOP3A_DROME DNA topoisomerase 3-alpha OS=Drosophila melanogaster GN=Top3alpha
PE=2 SV=2
Length = 1250
Score = 37.4 bits (85), Expect = 0.35, Method: Compositional matrix adjust.
Identities = 16/53 (30%), Positives = 26/53 (49%), Gaps = 8/53 (15%)
Query: 568 ETSIPLCKGHKEPCVARVVKKPGPTFGRRFFVCARAEGPASNPEANCGYFKWA 620
E+ LC G ++P V+K GP GR ++ C + + C +F+WA
Sbjct: 1029 ESETVLCTGCQQPARQNTVRKNGPNLGRLYYKCPKPD--------ECNFFQWA 1073
>sp|O61660|TOP3_CAEEL DNA topoisomerase 3 OS=Caenorhabditis elegans GN=top-3 PE=2 SV=1
Length = 759
Score = 35.8 bits (81), Expect = 0.97, Method: Compositional matrix adjust.
Identities = 16/40 (40%), Positives = 25/40 (62%), Gaps = 4/40 (10%)
Query: 581 CVARVVKKPGPTFGRRFFVCARAEGPASNPEANCGYFKWA 620
V +VV+K GP G++F+ C+ P ++ E C +FKWA
Sbjct: 724 AVTKVVQKEGPNKGKKFYTCSL---PYTSSE-KCNFFKWA 759
>sp|B0TF76|QUEA_HELMI S-adenosylmethionine:tRNA ribosyltransferase-isomerase
OS=Heliobacterium modesticaldum (strain ATCC 51547 /
Ice1) GN=queA PE=3 SV=1
Length = 344
Score = 34.3 bits (77), Expect = 3.0, Method: Compositional matrix adjust.
Identities = 26/102 (25%), Positives = 47/102 (46%), Gaps = 5/102 (4%)
Query: 122 ELLKIDSEGRCVITDHGHFILFNVYGPRADSEDTVRIQFKLQFFHKRWEFLLCQGRR--- 178
+L+ I G ++ + I ++G + DS+ TV I RWE L+ GRR
Sbjct: 45 DLVHILHPGDLLVVNRTRVIPARLFGKKRDSDVTVEIVLLTPMGDDRWEVLVRPGRRLKP 104
Query: 179 -IFV-VGDLNIAPAAIDRCDAGPDFAKNEFRIWFRSMLVESG 218
+FV +G+ +A ++ D G + + F +++ E G
Sbjct: 105 GVFVDLGEGRLAAEIVETTDFGGRVVRFHYSGDFDTLIDEIG 146
>sp|A8K979|ERI2_HUMAN ERI1 exoribonuclease 2 OS=Homo sapiens GN=ERI2 PE=2 SV=2
Length = 691
Score = 34.3 bits (77), Expect = 3.4, Method: Compositional matrix adjust.
Identities = 25/88 (28%), Positives = 37/88 (42%), Gaps = 30/88 (34%)
Query: 560 WRRIQQLMETSI-------------PLCKGHKEPCVAR----VVKKPGPTFGRRFFVCAR 602
WRR+ ++ +++ PLCK C R VV GP G+ F+ C
Sbjct: 570 WRRLPSILTSTVNLQEPWKSGKMTPPLCK-----CGRRSKRLVVSNNGPNHGKVFYCC-- 622
Query: 603 AEGPASNPEAN---CGYFKWAFSKSKQK 627
P + N CGYFKW + K++
Sbjct: 623 ---PIGKYQENRKCCGYFKWEQTLQKER 647
>sp|O35602|RX_MOUSE Retinal homeobox protein Rx OS=Mus musculus GN=Rax PE=2 SV=2
Length = 342
Score = 34.3 bits (77), Expect = 3.4, Method: Compositional matrix adjust.
Identities = 28/102 (27%), Positives = 45/102 (44%), Gaps = 9/102 (8%)
Query: 479 DDNNSITDTSLNVNNSVTDTSLSQEEVPESHHHSNKIPVTDYSCSVHELHGVNSSVCSHD 538
+ + L+V + D+ LS+EE P+ H N+ T Y +HEL D
Sbjct: 105 EQGEARPSPGLSVGPAAGDSKLSEEEPPKKKHRRNRTTFTTY--QLHELERAFEKSHYPD 162
Query: 539 ---QDEKKGKRFLDKERNNVAL----LEWRRIQQLMETSIPL 573
++E GK L + R V +WRR ++L +S+ L
Sbjct: 163 VYSREELAGKVNLPEVRVQVWFQNRRAKWRRQEKLEVSSMKL 204
Database: swissprot
Posted date: Mar 23, 2013 2:32 AM
Number of letters in database: 191,569,459
Number of sequences in database: 539,616
Lambda K H
0.318 0.133 0.407
Lambda K H
0.267 0.0410 0.140
Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Number of Hits to DB: 238,969,620
Number of Sequences: 539616
Number of extensions: 10202945
Number of successful extensions: 27689
Number of sequences better than 100.0: 50
Number of HSP's better than 100.0 without gapping: 16
Number of HSP's successfully gapped in prelim test: 92
Number of HSP's that attempted gapping in prelim test: 27182
Number of HSP's gapped (non-prelim): 450
length of query: 627
length of database: 191,569,459
effective HSP length: 124
effective length of query: 503
effective length of database: 124,657,075
effective search space: 62702508725
effective search space used: 62702508725
T: 11
A: 40
X1: 16 ( 7.3 bits)
X2: 38 (14.6 bits)
X3: 64 (24.7 bits)
S1: 41 (21.7 bits)
S2: 64 (29.3 bits)