BLASTP 2.2.26 [Sep-21-2011]
Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer,
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997),
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs", Nucleic Acids Res. 25:3389-3402.
Reference for compositional score matrix adjustment: Altschul, Stephen F.,
John C. Wootton, E. Michael Gertz, Richa Agarwala, Aleksandr Morgulis,
Alejandro A. Schaffer, and Yi-Kuo Yu (2005) "Protein database searches
using compositionally adjusted substitution matrices", FEBS J. 272:5101-5109.
Query= 011576
(482 letters)
Database: swissprot
539,616 sequences; 191,569,459 total letters
Searching..................................................done
>sp|Q9UBZ4|APEX2_HUMAN DNA-(apurinic or apyrimidinic site) lyase 2 OS=Homo sapiens
GN=APEX2 PE=1 SV=1
Length = 518
Score = 199 bits (507), Expect = 3e-50, Method: Compositional matrix adjust.
Identities = 127/390 (32%), Positives = 192/390 (49%), Gaps = 74/390 (18%)
Query: 1 MKIVTYNVNGLRQRVSQFG----------SLRKLLDSFDADIICFQETKLRRQELKSDLV 50
+++V++N+NG+R+ + ++ ++LD DADI+C QETK+ R L L
Sbjct: 2 LRVVSWNINGIRRPLQGVANQEPSNCAAVAVGRILDELDADIVCLQETKVTRDALTEPLA 61
Query: 51 MADGYESFFSCTRTSDKGRTGYSGVATFCRVKSPFSSTEVALPVAAEEGFTGLLETSGSK 110
+ +GY S+FS +R R+GYSGVATFC+ + A PVAAEEG +GL T
Sbjct: 62 IVEGYNSYFSFSRN----RSGYSGVATFCK--------DNATPVAAEEGLSGLFATQNGD 109
Query: 111 I--MEGLEDFSKDELLKIDSEGRCVITDHG---------HFILFNVYGPRADSEDTVRIQ 159
+ +++F+++EL +DSEGR ++T H L NVY P AD R+
Sbjct: 110 VGCYGNMDEFTQEELRALDSEGRALLTQHKIRTWEGKEKTLTLINVYCPHADPGRPERLV 169
Query: 160 FKLQFFHVLQKRWEFLLCQGRRIFVVGDLNIAPAAIDRCDAG--PDFAKNEFRIWFRSML 217
FK++F+ +LQ R E LL G + ++GDLN A ID DA F ++ R W S+L
Sbjct: 170 FKMRFYRLLQIRAEALLAAGSHVIILGDLNTAHRPIDHWDAVNLECFEEDPGRKWMDSLL 229
Query: 218 VESG-------GSFFDVFRSKHPERREAYTCWPSNTGAEQFNYGTRIDHILCAGPCLHQK 270
G G F D +R P++ A+TCW + TGA NYG+R+D++L
Sbjct: 230 SNLGCQSASHVGPFIDSYRCFQPKQEGAFTCWSAVTGARHLNYGSRLDYVL--------- 280
Query: 271 HDLQSHNFVTCHVNECDILIDYKRWKPGNAPRWKGGMSTRLEGSDHAPVYMCLGEVPEIP 330
+ ++ID + + + GSDH PV L V +P
Sbjct: 281 -------------GDRTLVIDTFQ---------ASFLLPEVMGSDHCPVGAVLS-VSSVP 317
Query: 331 QHSTPSLASRYLPIIRGVQQTLVSVLMKRE 360
P L +R+LP G Q ++ L+ E
Sbjct: 318 AKQCPPLCTRFLPEFAGTQLKILRFLVPLE 347
>sp|Q5E9N9|APEX2_BOVIN DNA-(apurinic or apyrimidinic site) lyase 2 OS=Bos taurus GN=APEX2
PE=2 SV=1
Length = 514
Score = 196 bits (498), Expect = 4e-49, Method: Compositional matrix adjust.
Identities = 127/387 (32%), Positives = 192/387 (49%), Gaps = 74/387 (19%)
Query: 1 MKIVTYNVNGLRQRVSQF----------GSLRKLLDSFDADIICFQETKLRRQELKSDLV 50
+++V++N+NG+R + ++ ++LD DADI+C QETK+ R L L
Sbjct: 2 LRLVSWNINGIRSPLQGVRCEEPSSCSAMAMGRILDKLDADIVCLQETKVTRDVLTEPLA 61
Query: 51 MADGYESFFSCTRTSDKGRTGYSGVATFCRVKSPFSSTEVALPVAAEEGFTGLLETSGSK 110
+ +GY S+FS +R R+GYSGVATFC+ + A PVAAEEG +GLL T
Sbjct: 62 IIEGYNSYFSFSRN----RSGYSGVATFCK--------DSATPVAAEEGLSGLLSTQNGD 109
Query: 111 I--MEGLEDFSKDELLKIDSEGRCVITDHG---------HFILFNVYGPRADSEDTVRIQ 159
+ ++DF+++EL +DSEGR ++T H L NVY P AD R+
Sbjct: 110 VGCYGNMDDFTQEELRALDSEGRALLTQHKICTWEGKEKTLTLINVYCPHADPGKPERLT 169
Query: 160 FKLQFFHVLQKRWEFLLCQGRRIFVVGDLNIAPAAIDRCDA--GPDFAKNEFRIWFRSML 217
FK++F+ +LQ R E LL G + ++GDLN A ID DA F ++ R W +L
Sbjct: 170 FKMRFYRLLQIRAEALLAAGSHVIILGDLNTAHRPIDHWDAVNMECFEEDPGRKWMDGLL 229
Query: 218 ----VESG---GSFFDVFRSKHPERREAYTCWPSNTGAEQFNYGTRIDHILCAGPCLHQK 270
ESG G F D +R P+++ A+TCW + +GA NYG+R+D++L
Sbjct: 230 SNLGCESGSHMGPFIDSYRCFQPKQKGAFTCWSTVSGARHLNYGSRLDYVL--------- 280
Query: 271 HDLQSHNFVTCHVNECDILIDYKRWKPGNAPRWKGGMSTRLEGSDHAPVYMCLGEVPEIP 330
+ ++ID + + + GSDH PV L V +P
Sbjct: 281 -------------GDRTLVIDTFQ---------SSFLLPEVMGSDHCPVGAVL-SVSSVP 317
Query: 331 QHSTPSLASRYLPIIRGVQQTLVSVLM 357
P L + +LP G Q ++ L+
Sbjct: 318 AKQCPPLCTCFLPEFAGTQLKILRFLV 344
>sp|Q68G58|APEX2_MOUSE DNA-(apurinic or apyrimidinic site) lyase 2 OS=Mus musculus
GN=Apex2 PE=1 SV=1
Length = 516
Score = 193 bits (491), Expect = 2e-48, Method: Compositional matrix adjust.
Identities = 126/389 (32%), Positives = 191/389 (49%), Gaps = 73/389 (18%)
Query: 1 MKIVTYNVNGLRQRVSQFG---------SLRKLLDSFDADIICFQETKLRRQELKSDLVM 51
+++V++N+NG+R + +LR++LD DADI+C QETK+ R L L +
Sbjct: 2 LRVVSWNINGIRSPLQGLACQEPSSCPTALRRVLDELDADIVCLQETKVTRDVLTEPLAI 61
Query: 52 ADGYESFFSCTRTSDKGRTGYSGVATFCRVKSPFSSTEVALPVAAEEGFTGLLETSGSKI 111
+GY S + + R+GYSGVATFC+ + A PVAAEEG +G+ T I
Sbjct: 62 VEGYNS----YFSFSRSRSGYSGVATFCK--------DSATPVAAEEGLSGVFATLNGDI 109
Query: 112 --MEGLEDFSKDELLKIDSEGRCVITDHG---------HFILFNVYGPRADSEDTVRIQF 160
+++F+++EL +DSEGR ++T H L NVY P AD R+ F
Sbjct: 110 GCYGNMDEFTQEELRVLDSEGRALLTQHKIRTLEGKEKTLTLINVYCPHADPGKPERLTF 169
Query: 161 KLQFFHVLQKRWEFLLCQGRRIFVVGDLNIAPAAIDRCDAGPD--FAKNEFRIWFRSMLV 218
K++F+ +LQ R E LL G + ++GDLN A ID CDA F ++ R W +L
Sbjct: 170 KMRFYRLLQMRAEALLAAGSHVIILGDLNTAHRPIDHCDASSLECFEEDPGRKWMDGLLS 229
Query: 219 ESG-------GSFFDVFRSKHPERREAYTCWPSNTGAEQFNYGTRIDHILCAGPCLHQKH 271
G G F D +R HP+++ A+TCW +GA NYG+R+D++L
Sbjct: 230 NPGDEAGPHIGLFMDSYRYLHPKQQRAFTCWSVVSGARHLNYGSRLDYVL---------- 279
Query: 272 DLQSHNFVTCHVNECDILIDYKRWKPGNAPRWKGGMSTRLEGSDHAPVYMCLGEVPEIPQ 331
+ ++ID + + + GSDH PV L V +P
Sbjct: 280 ------------GDRALVIDTFQ---------ASFLLPEVMGSDHCPVGAVL-NVSCVPA 317
Query: 332 HSTPSLASRYLPIIRGVQQTLVSVLMKRE 360
P+L +R+LP G Q ++ L+ E
Sbjct: 318 KQCPALCTRFLPEFAGTQLKILRFLVPLE 346
>sp|P87175|APN2_SCHPO DNA-(apurinic or apyrimidinic site) lyase 2 OS=Schizosaccharomyces
pombe (strain 972 / ATCC 24843) GN=apn2 PE=1 SV=1
Length = 523
Score = 152 bits (383), Expect = 7e-36, Method: Compositional matrix adjust.
Identities = 111/349 (31%), Positives = 166/349 (47%), Gaps = 69/349 (19%)
Query: 1 MKIVTYNVNGLRQRVSQF-----GSLRKLLDSFDADIICFQETKLRRQELKSDLVMADGY 55
M+I+++NVNG++ + F S +++ AD+IC QE K+++ + +G+
Sbjct: 1 MRILSWNVNGIQNPFNYFPWNKKNSYKEIFQELQADVICVQELKMQKDSFPQQYAVVEGF 60
Query: 56 ESFFSCTRTSDKGRTGYSGVATFCRVKSPFSSTEVALPVAAEEGFTGLLETSGSKI---- 111
+S+F T K R GYSGV + + +VA+PV AEEG TG+L G K
Sbjct: 61 DSYF----TFPKIRKGYSGVGFYVK-------KDVAIPVKAEEGITGILPVRGQKYSYSE 109
Query: 112 ---MEGLEDFSKDELLK----IDSEGRCVITDHGHFILFNVYGPRADSEDTVRIQFKLQF 164
E + F KD K IDSEGRC++ D FIL VY P E+ R++++ F
Sbjct: 110 APEHEKIGFFPKDIDRKTANWIDSEGRCILLDFQMFILIGVYCPVNSGEN--RLEYRRAF 167
Query: 165 FHVLQKRWEFLLCQG-RRIFVVGDLNIAPAAIDRCDAGPDFAKN------EFRIWFRSML 217
+ L++R E L+ +G R+I +VGD+NI ID D ++ E R W R +L
Sbjct: 168 YKALRERIERLIKEGNRKIILVGDVNILCNPIDTADQKDIIRESLIPSIMESRQWIRDLL 227
Query: 218 VESGGSFF-DVFRSKHPERREAYTCWPSNTGAEQFNYGTRIDHILCAGPCLHQKHDLQSH 276
+ S D+ R +HP R+ +TCW + NYGTRID+ L L
Sbjct: 228 LPSRLGLLLDIGRIQHPTRKGMFTCWNTRLNTRPTNYGTRIDYTLATPDLLP-------- 279
Query: 277 NFVTCHVNECDILIDYKRWKPGNAPRWKGGMSTRLEGSDHAPVYMCLGE 325
V + DI+ + + GSDH PVY+ L E
Sbjct: 280 -----WVQDADIMAE-------------------VMGSDHCPVYLDLKE 304
>sp|P38207|APN2_YEAST DNA-(apurinic or apyrimidinic site) lyase 2 OS=Saccharomyces
cerevisiae (strain ATCC 204508 / S288c) GN=APN2 PE=1
SV=1
Length = 520
Score = 108 bits (269), Expect = 1e-22, Method: Compositional matrix adjust.
Identities = 118/434 (27%), Positives = 168/434 (38%), Gaps = 103/434 (23%)
Query: 1 MKIVTYNVNGLR-----QRVSQFG-SLRKLLDSFDADIICFQETKLRRQELKSDLVMADG 54
++ +T+NVNG+R Q SQ SLR + D F ADII FQE K + + S DG
Sbjct: 17 IRFLTFNVNGIRTFFHYQPFSQMNQSLRSVFDFFRADIITFQELKTEKLSI-SKWGRVDG 75
Query: 55 YESFFSCTRTSDKGRTGYSGVATFCRVKSPFSSTEVALPVA-AEEGFTGLLETSGSK--- 110
+ SF S +T R GYSGV + R+ AL V AEEG TG L K
Sbjct: 76 FYSFISIPQT----RKGYSGVGCWIRIPEKNHPLYHALQVVKAEEGITGYLTIKNGKHSA 131
Query: 111 ------IMEGL-------EDFSKDELLKIDSEGRCVITDHG-HFILFNVYGPRADSEDTV 156
+ +G+ D + L++DSEGRCV+ + ++ +VY P +
Sbjct: 132 ISYRNDVNQGIGGYDSLDPDLDEKSALELDSEGRCVMVELACGIVIISVYCPANSNSSEE 191
Query: 157 RIQFKLQFFHVLQKRWEFLLCQGRRIFVVGDLNIAPAAIDRCDAGPDFA----------- 205
F+L+F VL +R L G++I ++GD+N+ ID D F+
Sbjct: 192 GEMFRLRFLKVLLRRVRNLDKIGKKIVLMGDVNVCRDLIDSADTLEQFSIPITDPMGGTK 251
Query: 206 ----------------KNEFRIWFRSMLVES-------GGSFFDVFRSKHPERR-EAYTC 241
R F +L +S G D R R + YT
Sbjct: 252 LEAQYRDKAIQFIINPDTPHRRIFNQILADSLLPDASKRGILIDTTRLIQTRNRLKMYTV 311
Query: 242 WPSNTGAEQFNYGTRIDHILCAGPCLHQKHDLQSHNFVTCHVNECDILIDYKRWKPGNAP 301
W NYG+RID IL S C + DIL D
Sbjct: 312 WNMLKNLRPSNYGSRIDFILV------------SLKLERC-IKAADILPD---------- 348
Query: 302 RWKGGMSTRLEGSDHAPVYMCLGEV-----PEIPQHSTPSLASRYLPIIRGVQQTLVSVL 356
+ GSDH PVY L + P Q P +RY +R ++ +
Sbjct: 349 ---------ILGSDHCPVYSDLDILDDRIEPGTTQVPIPKFEARYKYNLRN--HNVLEMF 397
Query: 357 MKREVAKQGKSCKF 370
K++ K+ K+
Sbjct: 398 AKKDTNKESNKQKY 411
>sp|P45951|ARP_ARATH Apurinic endonuclease-redox protein OS=Arabidopsis thaliana GN=ARP
PE=2 SV=2
Length = 536
Score = 89.7 bits (221), Expect = 5e-17, Method: Compositional matrix adjust.
Identities = 86/297 (28%), Positives = 135/297 (45%), Gaps = 56/297 (18%)
Query: 1 MKIVTYNVNGLRQ--RVSQFGSLRKLLDSFDADIICFQETKLRR---QELKSDLVMADGY 55
+K++T+NVNGLR + F +L +L + DI+C QETKL+ +E+K L+ DGY
Sbjct: 276 VKVMTWNVNGLRGLLKFESFSAL-QLAQRENFDILCLQETKLQVKDVEEIKKTLI--DGY 332
Query: 56 E-SFFSCTRTSDKGRTGYSGVATFCRVKSPFSSTEVALPVAAEEGFTGLLETSGSKIMEG 114
+ SF+SC+ + + GYSG A R+K P++ G TGL SG
Sbjct: 333 DHSFWSCSVS----KLGYSGTAIISRIK----------PLSVRYG-TGL---SGH----- 369
Query: 115 LEDFSKDELLKIDSEGRCVITDHGHFILFNVYGPRADSEDTVRIQFKLQFFHVLQKRWEF 174
D+EGR V + F L N Y P + + R+ ++++ +
Sbjct: 370 ------------DTEGRIVTAEFDSFYLINTYVPNS-GDGLKRLSYRIEEWDRTLSNHIK 416
Query: 175 LLCQGRRIFVVGDLNIAPAAIDRCDAGPDFAKNEFRIW----FRSMLVESGGSFFDVFRS 230
L + + + + GDLN A ID + + F I F + L++ G F D FR
Sbjct: 417 ELEKSKPVVLTGDLNCAHEEIDIFNPAGNKRSAGFTIEERQSFGANLLDKG--FVDTFRK 474
Query: 231 KHPERREAYTCWPSNTGAEQFNYGTRIDHILCAGPCLHQKHDLQSHNFVTCHVNECD 287
+HP YT W G + N G R+D+ L + HD +++ +N D
Sbjct: 475 QHPGVV-GYTYWGYRHGGRKTNKGWRLDYFLVSQSIAANVHD----SYILPDINGSD 526
>sp|P27864|RRP1_DROME Recombination repair protein 1 OS=Drosophila melanogaster GN=Rrp1
PE=1 SV=2
Length = 679
Score = 87.4 bits (215), Expect = 2e-16, Method: Compositional matrix adjust.
Identities = 76/296 (25%), Positives = 129/296 (43%), Gaps = 53/296 (17%)
Query: 1 MKIVTYNVNGLRQRVSQFGSLRKLLDSFDADIICFQETKLRRQELKSDLVMADGYESFFS 60
+KI ++NV GLR + + G +L+D + DI C QETK +L ++ GY ++
Sbjct: 427 LKICSWNVAGLRAWLKKDG--LQLIDLEEPDIFCLQETKCANDQLPEEVTRLPGYHPYWL 484
Query: 61 CTRTSDKGRTGYSGVATFCRVKSPFSSTEVALPVAAEEGFTGLLETSGSKIMEGLEDFSK 120
C GY+GVA + ++ +P+ E G G E+F
Sbjct: 485 CMPG------GYAGVAIYSKI----------MPIHVEYGI-------------GNEEF-- 513
Query: 121 DELLKIDSEGRCVITDHGHFILFNVYGPRADSEDTVRIQFKL---QFFHVLQKRWEFLLC 177
D GR + ++ F L NVY P + V ++ ++ + F K+ + L
Sbjct: 514 ------DDVGRMITAEYEKFYLINVYVPNS-GRKLVNLEPRMRWEKLFQAYVKKLDAL-- 564
Query: 178 QGRRIFVVGDLNIAPAAIDRCDAGPDFAKNEFRIWFRSMLVESGG-SFFDVFRSKHPERR 236
+ + + GD+N++ ID + + F R + E G F D FR +P+R+
Sbjct: 565 --KPVVICGDMNVSHMPIDLENPKNNTKNAGFTQEERDKMTELLGLGFVDTFRHLYPDRK 622
Query: 237 EAYTCWPSNTGAEQFNYGTRIDHILCAGPCLHQ--KHDLQSHNFVTCHVNECDILI 290
AYT W A N G R+D+ L + + + +H+++S + H C I I
Sbjct: 623 GAYTFWTYMANARARNVGWRLDYCLVSERFVPKVVEHEIRSQCLGSDH---CPITI 675
>sp|P37454|EXOA_BACSU Exodeoxyribonuclease OS=Bacillus subtilis (strain 168) GN=exoA PE=1
SV=1
Length = 252
Score = 84.7 bits (208), Expect = 1e-15, Method: Compositional matrix adjust.
Identities = 80/279 (28%), Positives = 123/279 (44%), Gaps = 55/279 (19%)
Query: 1 MKIVTYNVNGLRQRVSQFGSLRKLLDSFDADIICFQETKLRRQELKSDLVMADGYESFFS 60
MK++++NVNGLR + + L L + DADIIC QETK+ Q+ + DL D Y +++
Sbjct: 1 MKLISWNVNGLRAVMRKMDFLSYLKEE-DADIICLQETKI--QDGQVDLQPED-YHVYWN 56
Query: 61 CTRTSDKGRTGYSGVATFCRVKSPFSSTEVALPVAAEEGFTGLLETSGSKIMEGLEDFSK 120
+ GYSG A F + + P +V + EE
Sbjct: 57 YAV-----KKGYSGTAVFSK-QEPL---QVIYGIGVEEH--------------------- 86
Query: 121 DELLKIDSEGRCVITDHGHFILFNVYGPRADSEDTVRIQFKLQFFHVLQKRWEFLLCQGR 180
D EGR + + + + VY P + RI +++Q+ L + L Q +
Sbjct: 87 ------DQEGRVITLEFENVFVMTVYTPNS-RRGLERIDYRMQWEEALLS-YILELDQKK 138
Query: 181 RIFVVGDLNIAPAAID-------RCDAGPDFAKNEFRIWFRSMLVESGGSFFDVFRSKHP 233
+ + GDLN+A ID R +AG F+ E + R +E+G F D FR +P
Sbjct: 139 PVILCGDLNVAHQEIDLKNPKANRNNAG--FSDQEREAFTR--FLEAG--FVDSFRHVYP 192
Query: 234 ERREAYTCWPSNTGAEQFNYGTRIDHILCAGPCLHQKHD 272
+ AY+ W GA N G RID+ + + Q D
Sbjct: 193 DLEGAYSWWSYRAGARDRNIGWRIDYFVVSESLKEQIED 231
>sp|A0MTA1|APEX1_DANRE DNA-(apurinic or apyrimidinic site) lyase OS=Danio rerio GN=apex1
PE=1 SV=1
Length = 310
Score = 78.6 bits (192), Expect = 1e-13, Method: Compositional matrix adjust.
Identities = 81/281 (28%), Positives = 116/281 (41%), Gaps = 65/281 (23%)
Query: 1 MKIVTYNVNGLRQRVSQFGSLRKLLDSFDADIICFQETKLRRQELKSDLVMADGYESFFS 60
MKI ++NV+GLR V + G + D DI+C QETK + L +D+ G +
Sbjct: 55 MKITSWNVDGLRAWVKKNG--LDWVRKEDPDILCLQETKCAEKALPADIT---GMPEYPH 109
Query: 61 CTRTSDKGRTGYSGVATFCRVKSPFSSTEVALPVAAEEGFTGLLETSGSKIMEGLEDFSK 120
+ + GYSGVA C+ + V + EE
Sbjct: 110 KYWAGSEDKEGYSGVAMLCKTE----PLNVTYGIGKEEH--------------------- 144
Query: 121 DELLKIDSEGRCVITDHGHFILFNVYGPRADSEDTVRIQFKLQFFHVLQKRWEF----LL 176
D EGR + + F L Y P A S VR+ ++ K W+ L
Sbjct: 145 ------DKEGRVITAEFPDFFLVTAYVPNA-SRGLVRLDYR--------KTWDVDFRAYL 189
Query: 177 C--QGRRIFVV-GDLNIAPAAID-------RCDAGPDFAKNEFRIWFRSMLVESGGSFFD 226
C R+ V+ GDLN+A ID R +AG F E R F + L+E+G F D
Sbjct: 190 CGLDARKPLVLCGDLNVAHQEIDLKNPKGNRKNAG--FTPEE-REGF-TQLLEAG--FTD 243
Query: 227 VFRSKHPERREAYTCWPSNTGAEQFNYGTRIDHILCAGPCL 267
FR +P++ AYT W A N G R+D+ + + L
Sbjct: 244 SFRELYPDQAYAYTFWTYMMNARSKNVGWRLDYFVLSSALL 284
>sp|P09030|EX3_ECOLI Exodeoxyribonuclease III OS=Escherichia coli (strain K12) GN=xthA
PE=1 SV=4
Length = 268
Score = 77.4 bits (189), Expect = 2e-13, Method: Compositional matrix adjust.
Identities = 71/275 (25%), Positives = 114/275 (41%), Gaps = 47/275 (17%)
Query: 1 MKIVTYNVNGLRQRVSQFGSLRKLLDSFDADIICFQETKLRRQELKSDLVMADGYESFFS 60
MK V++N+NGLR R Q L +++ D+I QETK+ + V GY F+
Sbjct: 1 MKFVSFNINGLRARPHQ---LEAIVEKHQPDVIGLQETKVHDDMFPLEEVAKLGYNVFYH 57
Query: 61 CTRTSDKGRTGYSGVATFCRVKSPFSSTEVALPVAAEEGFTGLLETSGSKIMEGLEDFSK 120
G+ G+ GVA + P+A GF G E + +I
Sbjct: 58 -------GQKGHYGVALLTK----------ETPIAVRRGFPGDDEEAQRRI--------- 91
Query: 121 DELLKIDSEGRCVITDHGHFILFNVYGPRADSED-TVRIQFKLQFFHVLQKRWEFLLCQG 179
I +E ++ G+ + N Y P+ +S D ++ K QF+ LQ E L +
Sbjct: 92 -----IMAEIPSLL---GNVTVINGYFPQGESRDHPIKFPAKAQFYQNLQNYLETELKRD 143
Query: 180 RRIFVVGDLNIAPAAIDRCDAGPDFAKNEFRIWFRSMLVES--------GGSFFDVFRSK 231
+ ++GD+NI+P +D G + K R S L E D FR
Sbjct: 144 NPVLIMGDMNISPTDLD-IGIGEENRKRWLRTGKCSFLPEEREWMDRLMSWGLVDTFRHA 202
Query: 232 HPERREAYTCWPSNTGAEQFNYGTRIDHILCAGPC 266
+P+ + ++ + + N G RID +L + P
Sbjct: 203 NPQTADRFSWFDYRSKGFDDNRGLRIDLLLASQPL 237
>sp|P0A1A9|EX3_SALTY Exodeoxyribonuclease III OS=Salmonella typhimurium (strain LT2 /
SGSC1412 / ATCC 700720) GN=xthA PE=3 SV=1
Length = 268
Score = 75.9 bits (185), Expect = 6e-13, Method: Compositional matrix adjust.
Identities = 72/282 (25%), Positives = 118/282 (41%), Gaps = 61/282 (21%)
Query: 1 MKIVTYNVNGLRQRVSQFGSLRKLLDSFDADIICFQETKLRRQELKSDLVMADGYESFFS 60
MK V++N+NGLR R Q L +++ D+I QETK+ + + V GY F+
Sbjct: 1 MKFVSFNINGLRARPHQ---LEAIVEKHQPDVIGLQETKVHDEMFPLEEVAKLGYNVFYH 57
Query: 61 CTRTSDKGRTGYSGVATFCRVKSPFSSTEVALPVAAEEGFTGLLETSGSKIMEGLEDFSK 120
G+ G+ GVA + A P++ GF E + +I+
Sbjct: 58 -------GQKGHYGVALLTK----------ATPISVRRGFPDDGEEAQRRII-------- 92
Query: 121 DELLKIDSEGRCVITDHGHFILFNVYGPRADSED-TVRIQFKLQFFHVLQKRWEFLLCQG 179
+ +I S G+ + N Y P+ +S D ++ K QF+ LQ E L
Sbjct: 93 --MAEIPSP-------LGNITVINGYFPQGESRDHPLKFPAKAQFYQNLQNYLETELKCD 143
Query: 180 RRIFVVGDLNIAPAAID---------------RCDAGPDFAKNEFRIWFRSMLVESGGSF 224
+ ++GD+NI+P +D +C P E R W S L++ G
Sbjct: 144 NPVLIMGDMNISPTDLDIGIGEENRKRWLRTGKCSFLP-----EEREWM-SRLLKWG--L 195
Query: 225 FDVFRSKHPERREAYTCWPSNTGAEQFNYGTRIDHILCAGPC 266
D FR +P+ + ++ + + N G RID +L + P
Sbjct: 196 VDTFRQANPQTMDKFSWFDYRSKGFVDNRGLRIDLLLASAPL 237
>sp|P0A1B0|EX3_SALTI Exodeoxyribonuclease III OS=Salmonella typhi GN=xthA PE=3 SV=1
Length = 268
Score = 75.9 bits (185), Expect = 6e-13, Method: Compositional matrix adjust.
Identities = 72/282 (25%), Positives = 118/282 (41%), Gaps = 61/282 (21%)
Query: 1 MKIVTYNVNGLRQRVSQFGSLRKLLDSFDADIICFQETKLRRQELKSDLVMADGYESFFS 60
MK V++N+NGLR R Q L +++ D+I QETK+ + + V GY F+
Sbjct: 1 MKFVSFNINGLRARPHQ---LEAIVEKHQPDVIGLQETKVHDEMFPLEEVAKLGYNVFYH 57
Query: 61 CTRTSDKGRTGYSGVATFCRVKSPFSSTEVALPVAAEEGFTGLLETSGSKIMEGLEDFSK 120
G+ G+ GVA + A P++ GF E + +I+
Sbjct: 58 -------GQKGHYGVALLTK----------ATPISVRRGFPDDGEEAQRRII-------- 92
Query: 121 DELLKIDSEGRCVITDHGHFILFNVYGPRADSED-TVRIQFKLQFFHVLQKRWEFLLCQG 179
+ +I S G+ + N Y P+ +S D ++ K QF+ LQ E L
Sbjct: 93 --MAEIPSP-------LGNITVINGYFPQGESRDHPLKFPAKAQFYQNLQNYLETELKCD 143
Query: 180 RRIFVVGDLNIAPAAID---------------RCDAGPDFAKNEFRIWFRSMLVESGGSF 224
+ ++GD+NI+P +D +C P E R W S L++ G
Sbjct: 144 NPVLIMGDMNISPTDLDIGIGEENRKRWLRTGKCSFLP-----EEREWM-SRLLKWG--L 195
Query: 225 FDVFRSKHPERREAYTCWPSNTGAEQFNYGTRIDHILCAGPC 266
D FR +P+ + ++ + + N G RID +L + P
Sbjct: 196 VDTFRQANPQTMDKFSWFDYRSKGFVDNRGLRIDLLLASAPL 237
>sp|P51173|APEA_DICDI DNA-(apurinic or apyrimidinic site) lyase OS=Dictyostelium
discoideum GN=apeA PE=2 SV=2
Length = 361
Score = 74.3 bits (181), Expect = 2e-12, Method: Compositional matrix adjust.
Identities = 64/266 (24%), Positives = 111/266 (41%), Gaps = 44/266 (16%)
Query: 1 MKIVTYNVNGLRQRVSQFGSLRKLLDSFDADIICFQETKLRRQELKSDLVMADGYESFFS 60
MKI+++NV G + +S+ + ++ + D++C QETK+ +K D M GYE F
Sbjct: 105 MKIISWNVAGFKSVLSK--GFTEYVEKENPDVLCLQETKINPSNIKKDQ-MPKGYEYHFI 161
Query: 61 CTRTSDKGRTGYSGVATFCRVKSPFSSTEVALPVAAEEGFTGLLETSGSKIMEGLEDFSK 120
+ + G+ G + K P A G
Sbjct: 162 -----EADQKGHHGTGVLTKKK----------PNAITFGIG------------------- 187
Query: 121 DELLKIDSEGRCVITDHGHFILFNVYGPRADSEDTVRIQFKLQFFHVLQKRWEFLLCQGR 180
+ K D+EGR + ++ F + N Y P A + R+ ++++ + V + + L +
Sbjct: 188 --IAKHDNEGRVITLEYDQFYIVNTYIPNAGTRGLQRLDYRIKEWDVDFQAYLEKLNATK 245
Query: 181 RIFVVGDLNIAPAAIDRCDAGPDFAKNEFRIWFR---SMLVESGGSFFDVFRSKHPERRE 237
I GDLN+A ID + + F I R S +E G + D +R +P +
Sbjct: 246 PIIWCGDLNVAHTEIDLKNPKTNKKSAGFTIEERTSFSNFLEKG--YVDSYRHFNPGKEG 303
Query: 238 AYTCWPSNTGAEQFNYGTRIDHILCA 263
+YT W G N G R+D+ + +
Sbjct: 304 SYTFWSYLGGGRSKNVGWRLDYFVVS 329
>sp|P43138|APEX1_RAT DNA-(apurinic or apyrimidinic site) lyase OS=Rattus norvegicus
GN=Apex1 PE=1 SV=2
Length = 317
Score = 71.6 bits (174), Expect = 1e-11, Method: Compositional matrix adjust.
Identities = 76/272 (27%), Positives = 113/272 (41%), Gaps = 54/272 (19%)
Query: 1 MKIVTYNVNGLRQRVSQFGSLRKLLDSFDADIICFQETKLRRQELKSDLVMADGYESFFS 60
+KI ++NV+GLR + + G + DI+C QETK +L ++L G +
Sbjct: 61 LKICSWNVDGLRAWIKKKG--LDWVKEEAPDILCLQETKCSENKLPAELQELPGLTHQY- 117
Query: 61 CTRTSDKGRTGYSGVATFCRVKSPFSSTEVALPVAAEEGFTGLLETSGSKIMEGLEDFSK 120
+ SDK GYSGV R + P +V+ + EE
Sbjct: 118 WSAPSDK--EGYSGVGLLSR-QCPL---KVSYGIGEEEH--------------------- 150
Query: 121 DELLKIDSEGRCVITDHGHFILFNVYGPRADSEDTVRIQFKLQFFHVLQKRWEFL--LCQ 178
D EGR ++ + FIL Y P A VR++++ ++ +K FL L
Sbjct: 151 ------DQEGRVIVAEFESFILVTAYVPNA-GRGLVRLEYRQRWDEAFRK---FLKDLAS 200
Query: 179 GRRIFVVGDLNIAPAAID-------RCDAGPDFAKNEFRIWFRSMLVESGGSFFDVFRSK 231
+ + + GDLN+A ID + +AG F E R F ML D FR
Sbjct: 201 RKPLVLCGDLNVAHEEIDLRNPKGNKKNAG--FTPQE-RQGFGEML--QAVPLADSFRHL 255
Query: 232 HPERREAYTCWPSNTGAEQFNYGTRIDHILCA 263
+P AYT W A N G R+D+ L +
Sbjct: 256 YPNTAYAYTFWTYMMNARSKNVGWRLDYFLLS 287
>sp|P23196|APEX1_BOVIN DNA-(apurinic or apyrimidinic site) lyase OS=Bos taurus GN=APEX1
PE=1 SV=2
Length = 318
Score = 70.1 bits (170), Expect = 3e-11, Method: Compositional matrix adjust.
Identities = 75/276 (27%), Positives = 114/276 (41%), Gaps = 54/276 (19%)
Query: 1 MKIVTYNVNGLRQRVSQFGSLRKLLDSFDADIICFQETKLRRQELKSDLVMADGYESFFS 60
+KI ++NV+GLR + + G + DI+C QETK +L +L G +
Sbjct: 62 LKICSWNVDGLRAWIKKKG--LDWVKEEAPDILCLQETKCSENKLPVELQELSGLSHQY- 118
Query: 61 CTRTSDKGRTGYSGVATFCRVKSPFSSTEVALPVAAEEGFTGLLETSGSKIMEGLEDFSK 120
+ SDK GYSGV R + P +V+ + EE
Sbjct: 119 WSAPSDK--EGYSGVGLLSR-QCPL---KVSYGIGEEEH--------------------- 151
Query: 121 DELLKIDSEGRCVITDHGHFILFNVYGPRADSEDTVRIQFKLQFFHVLQKRWEFL--LCQ 178
D EGR ++ ++ F+L Y P A VR++++ ++ +K FL L
Sbjct: 152 ------DQEGRVIVAEYDAFVLVTAYVPNA-GRGLVRLEYRQRWDEAFRK---FLKGLAS 201
Query: 179 GRRIFVVGDLNIAPAAID-------RCDAGPDFAKNEFRIWFRSMLVESGGSFFDVFRSK 231
+ + + GDLN+A ID + +AG F E R F +L D FR
Sbjct: 202 RKPLVLCGDLNVAHEEIDLRNPKGNKKNAG--FTPQE-RQGFGELL--QAVPLTDSFRHL 256
Query: 232 HPERREAYTCWPSNTGAEQFNYGTRIDHILCAGPCL 267
+P AYT W A N G R+D+ L + L
Sbjct: 257 YPNTAYAYTFWTYMMNARSKNVGWRLDYFLLSQSVL 292
>sp|P27695|APEX1_HUMAN DNA-(apurinic or apyrimidinic site) lyase OS=Homo sapiens GN=APEX1
PE=1 SV=2
Length = 318
Score = 70.1 bits (170), Expect = 4e-11, Method: Compositional matrix adjust.
Identities = 74/272 (27%), Positives = 113/272 (41%), Gaps = 54/272 (19%)
Query: 1 MKIVTYNVNGLRQRVSQFGSLRKLLDSFDADIICFQETKLRRQELKSDLVMADGYESFFS 60
+KI ++NV+GLR + + G + DI+C QETK +L ++L G +
Sbjct: 62 LKICSWNVDGLRAWIKKKG--LDWVKEEAPDILCLQETKCSENKLPAELQELPGLSHQY- 118
Query: 61 CTRTSDKGRTGYSGVATFCRVKSPFSSTEVALPVAAEEGFTGLLETSGSKIMEGLEDFSK 120
+ SDK GYSGV R + P K+ G+ D
Sbjct: 119 WSAPSDK--EGYSGVGLLSR-QCPL------------------------KVSYGIGDEEH 151
Query: 121 DELLKIDSEGRCVITDHGHFILFNVYGPRADSEDTVRIQFKLQFFHVLQKRWEFL--LCQ 178
D+ EGR ++ + F+L Y P A VR++++ ++ +K FL L
Sbjct: 152 DQ------EGRVIVAEFDSFVLVTAYVPNA-GRGLVRLEYRQRWDEAFRK---FLKGLAS 201
Query: 179 GRRIFVVGDLNIAPAAID-------RCDAGPDFAKNEFRIWFRSMLVESGGSFFDVFRSK 231
+ + + GDLN+A ID + +AG F E R F +L D FR
Sbjct: 202 RKPLVLCGDLNVAHEEIDLRNPKGNKKNAG--FTPQE-RQGFGELL--QAVPLADSFRHL 256
Query: 232 HPERREAYTCWPSNTGAEQFNYGTRIDHILCA 263
+P AYT W A N G R+D+ L +
Sbjct: 257 YPNTPYAYTFWTYMMNARSKNVGWRLDYFLLS 288
>sp|P28352|APEX1_MOUSE DNA-(apurinic or apyrimidinic site) lyase OS=Mus musculus GN=Apex1
PE=1 SV=2
Length = 317
Score = 69.3 bits (168), Expect = 5e-11, Method: Compositional matrix adjust.
Identities = 74/272 (27%), Positives = 113/272 (41%), Gaps = 54/272 (19%)
Query: 1 MKIVTYNVNGLRQRVSQFGSLRKLLDSFDADIICFQETKLRRQELKSDLVMADGYESFFS 60
+KI ++NV+GLR + + G + DI+C QETK +L ++L G +
Sbjct: 61 LKICSWNVDGLRAWIKKKG--LDWVKEEAPDILCLQETKCSENKLPAELQELPGLTHQY- 117
Query: 61 CTRTSDKGRTGYSGVATFCRVKSPFSSTEVALPVAAEEGFTGLLETSGSKIMEGLEDFSK 120
+ SDK GYSGV R + P +V+ + EE
Sbjct: 118 WSAPSDK--EGYSGVGLLSR-QCPL---KVSYGIGEEEH--------------------- 150
Query: 121 DELLKIDSEGRCVITDHGHFILFNVYGPRADSEDTVRIQFKLQFFHVLQKRWEFL--LCQ 178
D EGR ++ + F+L Y P A VR++++ ++ +K FL L
Sbjct: 151 ------DQEGRVIVAEFESFVLVTAYVPNA-GRGLVRLEYRQRWDEAFRK---FLKDLAS 200
Query: 179 GRRIFVVGDLNIAPAAID-------RCDAGPDFAKNEFRIWFRSMLVESGGSFFDVFRSK 231
+ + + GDLN+A ID + +AG F E R F +L D FR
Sbjct: 201 RKPLVLCGDLNVAHEEIDLRNPKGNKKNAG--FTPQE-RQGFGELL--QAVPLADSFRHL 255
Query: 232 HPERREAYTCWPSNTGAEQFNYGTRIDHILCA 263
+P AYT W A N G R+D+ L +
Sbjct: 256 YPNTAYAYTFWTYMMNARSKNVGWRLDYFLLS 287
>sp|A2T6Y4|APEX1_PANTR DNA-(apurinic or apyrimidinic site) lyase OS=Pan troglodytes
GN=APEX1 PE=3 SV=1
Length = 318
Score = 69.3 bits (168), Expect = 6e-11, Method: Compositional matrix adjust.
Identities = 74/272 (27%), Positives = 113/272 (41%), Gaps = 54/272 (19%)
Query: 1 MKIVTYNVNGLRQRVSQFGSLRKLLDSFDADIICFQETKLRRQELKSDLVMADGYESFFS 60
+KI ++NV+GLR + + G + DI+C QETK +L ++L G +
Sbjct: 62 LKICSWNVDGLRAWIKKKG--LDWVKEEAPDILCLQETKCSENKLPAELQELPGLSHQY- 118
Query: 61 CTRTSDKGRTGYSGVATFCRVKSPFSSTEVALPVAAEEGFTGLLETSGSKIMEGLEDFSK 120
+ SDK GYSGV R + P +V+ + EE
Sbjct: 119 WSAPSDK--EGYSGVGLLSR-QCPL---KVSYGIGEEEH--------------------- 151
Query: 121 DELLKIDSEGRCVITDHGHFILFNVYGPRADSEDTVRIQFKLQFFHVLQKRWEFL--LCQ 178
D EGR ++ + F+L Y P A VR++++ ++ +K FL L
Sbjct: 152 ------DQEGRVIVAEFDSFVLVTAYVPNA-GRGLVRLEYRQRWDEAFRK---FLKGLAS 201
Query: 179 GRRIFVVGDLNIAPAAID-------RCDAGPDFAKNEFRIWFRSMLVESGGSFFDVFRSK 231
+ + + GDLN+A ID + +AG F E R F +L D FR
Sbjct: 202 RKPLVLCGDLNVAHEEIDLRNPKGNKKNAG--FTPQE-RQGFGELL--QAVPLADSFRHL 256
Query: 232 HPERREAYTCWPSNTGAEQFNYGTRIDHILCA 263
+P AYT W A N G R+D+ L +
Sbjct: 257 YPNTPYAYTFWTYMMNARSKNVGWRLDYFLLS 288
>sp|A1YFZ3|APEX1_PANPA DNA-(apurinic or apyrimidinic site) lyase OS=Pan paniscus GN=APEX1
PE=3 SV=1
Length = 318
Score = 69.3 bits (168), Expect = 6e-11, Method: Compositional matrix adjust.
Identities = 74/272 (27%), Positives = 113/272 (41%), Gaps = 54/272 (19%)
Query: 1 MKIVTYNVNGLRQRVSQFGSLRKLLDSFDADIICFQETKLRRQELKSDLVMADGYESFFS 60
+KI ++NV+GLR + + G + DI+C QETK +L ++L G +
Sbjct: 62 LKICSWNVDGLRAWIKKKG--LDWVKEEAPDILCLQETKCSENKLPAELQELPGLSHQY- 118
Query: 61 CTRTSDKGRTGYSGVATFCRVKSPFSSTEVALPVAAEEGFTGLLETSGSKIMEGLEDFSK 120
+ SDK GYSGV R + P +V+ + EE
Sbjct: 119 WSAPSDK--EGYSGVGLLSR-QCPL---KVSYGIGEEEH--------------------- 151
Query: 121 DELLKIDSEGRCVITDHGHFILFNVYGPRADSEDTVRIQFKLQFFHVLQKRWEFL--LCQ 178
D EGR ++ + F+L Y P A VR++++ ++ +K FL L
Sbjct: 152 ------DQEGRVIVAEFDSFVLVTAYVPNA-GRGLVRLEYRQRWDEAFRK---FLKGLAS 201
Query: 179 GRRIFVVGDLNIAPAAID-------RCDAGPDFAKNEFRIWFRSMLVESGGSFFDVFRSK 231
+ + + GDLN+A ID + +AG F E R F +L D FR
Sbjct: 202 RKPLVLCGDLNVAHEEIDLRNPKGNKKNAG--FTPQE-RQGFGELL--QAVPLADSFRHL 256
Query: 232 HPERREAYTCWPSNTGAEQFNYGTRIDHILCA 263
+P AYT W A N G R+D+ L +
Sbjct: 257 YPNTPYAYTFWTYMMNARSKNVGWRLDYFLLS 288
>sp|A2T7I6|APEX1_PONPY DNA-(apurinic or apyrimidinic site) lyase OS=Pongo pygmaeus
GN=APEX1 PE=3 SV=1
Length = 318
Score = 68.9 bits (167), Expect = 9e-11, Method: Compositional matrix adjust.
Identities = 85/333 (25%), Positives = 131/333 (39%), Gaps = 86/333 (25%)
Query: 1 MKIVTYNVNGLRQRVSQFGSLRKLLDSFDADIICFQETKLRRQELKSDLVMADGYESFFS 60
+KI ++NV+GLR + + G + DI+C QETK +L ++L G +
Sbjct: 62 LKICSWNVDGLRAWIKKKG--LDWVKEEAPDILCLQETKCSENKLPAELQELPGLSHQY- 118
Query: 61 CTRTSDKGRTGYSGVATFCRVKSPFSSTEVALPVAAEEGFTGLLETSGSKIMEGLEDFSK 120
+ SDK GYSGV R + P +V+ + EE
Sbjct: 119 WSAPSDK--EGYSGVGLLSR-QCPL---KVSYGIGEEEH--------------------- 151
Query: 121 DELLKIDSEGRCVITDHGHFILFNVYGPRADSEDTVRIQFKLQFFHVLQKRWEFL--LCQ 178
D EGR ++ + F+L Y P A VR++++ ++ ++ FL L
Sbjct: 152 ------DQEGRVIVAEFDSFVLVTAYVPNA-GRGLVRLEYRQRWDEAFRR---FLKGLAS 201
Query: 179 GRRIFVVGDLNIAPAAID-------RCDAGPDFAKNEFRIWFRSMLVESGGSFFDVFRSK 231
+ + + GDLN+A ID + +AG F E R F +L D FR
Sbjct: 202 RKPLVLCGDLNVAHEEIDLRNPKGNKKNAG--FTPQE-RQGFGELL--QAVPLADSFRHL 256
Query: 232 HPERREAYTCWPSNTGAEQFNYGTRIDHILCAGPCLHQKHDLQSHNFVTCHVNECDILID 291
+P AYT W A N G R+D+ L SH+ +T L D
Sbjct: 257 YPNTPYAYTFWTYMMNARSKNVGWRLDYFLL------------SHSLLTA-------LCD 297
Query: 292 YKRWKPGNAPRWKGGMSTRLEGSDHAPVYMCLG 324
K + ++ GSDH P+ + L
Sbjct: 298 SK-------------IRSKALGSDHCPITLYLA 317
>sp|A1YES6|APEX1_GORGO DNA-(apurinic or apyrimidinic site) lyase OS=Gorilla gorilla
gorilla GN=APEX1 PE=3 SV=1
Length = 318
Score = 67.4 bits (163), Expect = 3e-10, Method: Compositional matrix adjust.
Identities = 70/272 (25%), Positives = 112/272 (41%), Gaps = 54/272 (19%)
Query: 1 MKIVTYNVNGLRQRVSQFGSLRKLLDSFDADIICFQETKLRRQELKSDLVMADGYESFFS 60
+KI ++NV+GLR + + G + DI+C QETK +L ++L G +
Sbjct: 62 LKICSWNVDGLRAWIKKKG--LDWVKEEAPDILCLQETKCSENKLPAELQELPGLSYQYW 119
Query: 61 CTRTSDKGRTGYSGVATFCRVKSPFSSTEVALPVAAEEGFTGLLETSGSKIMEGLEDFSK 120
++ + GYSGV R + P +V+ + EE
Sbjct: 120 ---SAPXXKEGYSGVGLLSR-QCPL---KVSYGIGEEEH--------------------- 151
Query: 121 DELLKIDSEGRCVITDHGHFILFNVYGPRADSEDTVRIQFKLQFFHVLQKRWEFL--LCQ 178
D EGR ++ + F+L Y P A VR++++ ++ ++ FL L
Sbjct: 152 ------DQEGRVIVAEFDSFVLVTAYVPNA-GRGLVRLEYRQRWDEAFRR---FLKGLAS 201
Query: 179 GRRIFVVGDLNIAPAAID-------RCDAGPDFAKNEFRIWFRSMLVESGGSFFDVFRSK 231
+ + + GDLN+A ID + +AG F E R F +L D FR
Sbjct: 202 RKPLVLCGDLNVAHEEIDLRNPKGNKKNAG--FTPQE-RQGFGELL--QAVPLADSFRHL 256
Query: 232 HPERREAYTCWPSNTGAEQFNYGTRIDHILCA 263
+P AYT W A N G R+D+ L +
Sbjct: 257 YPNTPYAYTFWTYMMNARSKNVGWRLDYFLLS 288
>sp|P44318|EX3_HAEIN Exodeoxyribonuclease III OS=Haemophilus influenzae (strain ATCC
51907 / DSM 11121 / KW20 / Rd) GN=xthA PE=3 SV=1
Length = 267
Score = 65.5 bits (158), Expect = 9e-10, Method: Compositional matrix adjust.
Identities = 68/280 (24%), Positives = 113/280 (40%), Gaps = 63/280 (22%)
Query: 1 MKIVTYNVNGLRQRVSQFGSLRKLLDSFDADIICFQETKLRRQELKSDLVMADGYESFFS 60
MK +++N+NGLR R Q L +++ + D+I QE K+ + ++ GY F
Sbjct: 1 MKFISFNINGLRARPHQ---LEAIIEKYQPDVIGLQEIKVADEAFPYEITENLGYHVFHH 57
Query: 61 CTRTSDKGRTGYSGVATFCRVKSPFSSTEVALPVAAEEGFTGLLETSGSK-IMEGLEDFS 119
G+ G+ GVA + + P GF E + + IM LE
Sbjct: 58 -------GQKGHYGVALLTKQE----------PKVIRRGFPTDNEDAQKRIIMADLE--- 97
Query: 120 KDELLKIDSEGRCVITDHGHFILFNVYGPRADSE-DTVRIQFKLQFFHVLQKRWEFLLCQ 178
T+ G + N Y P+ +S + K +F+ LQ+ E +
Sbjct: 98 ---------------TEFGLLTVINGYFPQGESRAHETKFPAKEKFYADLQQYLEKEHDK 142
Query: 179 GRRIFVVGDLNIAPAAID---------------RCDAGPDFAKNEFRIWFRSMLVESGGS 223
I ++GD+NI+P+ +D +C P E R W++ L + G
Sbjct: 143 SNPILIMGDMNISPSDLDIGIGDENRKRWLRTGKCSFLP-----EERAWYQR-LYDYG-- 194
Query: 224 FFDVFRSKHPERREAYTCWPSNTGAEQFNYGTRIDHILCA 263
D FR +P + ++ + + N G RIDHIL +
Sbjct: 195 LEDSFRKLNPTANDKFSWFDYRSKGFDDNRGLRIDHILVS 234
>sp|P0A2X4|EXOA_STRR6 Exodeoxyribonuclease OS=Streptococcus pneumoniae (strain ATCC
BAA-255 / R6) GN=exoA PE=3 SV=1
Length = 275
Score = 58.2 bits (139), Expect = 1e-07, Method: Compositional matrix adjust.
Identities = 70/283 (24%), Positives = 113/283 (39%), Gaps = 58/283 (20%)
Query: 1 MKIVTYNVNGLRQRVSQFGSLRKL-------LDSFDADIICFQETKL-------RRQELK 46
MK++++N++ L ++ + KL L + +ADII QETKL + E+
Sbjct: 1 MKLISWNIDSLNAALTSDSARAKLSQEVLQTLVAENADIIAIQETKLSAKGPTKKHVEIL 60
Query: 47 SDLVMADGYESFFSCTRTSDKGRTGYSGVATFCRVKSPFSSTEVALPVAAEEGFTGLLET 106
+L GYE+ + ++ + R GY+G T K + T ++ P E
Sbjct: 61 EELF--PGYENTWRSSQ--EPARKGYAG--TMFLYKKELTPT-ISFP-----------EI 102
Query: 107 SGSKIMEGLEDFSKDELLKIDSEGRCVITDHGHFILFNVYGPRADSEDTVRIQFKLQFFH 166
M D EGR + + F + VY P A D ++ + Q +
Sbjct: 103 GAPSTM--------------DLEGRIITLEFDAFFVTQVYTPNAG--DGLKRLEERQVWD 146
Query: 167 VLQKRWEFLLCQGRRIFVVGDLNIAPAAIDRCDAG-----PDFAKNEFRIWFRSMLVESG 221
+ L + + + GD N+A ID + P F E R F ++L
Sbjct: 147 AKYAEYLAELDKEKPVLATGDYNVAHNEIDLANPASNRRSPGFTDEE-RAGFTNLL---A 202
Query: 222 GSFFDVFRSKHPERREAYTCWPSNTGAEQF-NYGTRIDHILCA 263
F D FR H + E YT W + + N G RID+ L +
Sbjct: 203 TGFTDTFRHVHGDVPERYTWWAQRSKTSKINNTGWRIDYWLTS 245
>sp|P0A2X3|EXOA_STRPN Exodeoxyribonuclease OS=Streptococcus pneumoniae serotype 4 (strain
ATCC BAA-334 / TIGR4) GN=exoA PE=3 SV=1
Length = 275
Score = 58.2 bits (139), Expect = 1e-07, Method: Compositional matrix adjust.
Identities = 70/283 (24%), Positives = 113/283 (39%), Gaps = 58/283 (20%)
Query: 1 MKIVTYNVNGLRQRVSQFGSLRKL-------LDSFDADIICFQETKL-------RRQELK 46
MK++++N++ L ++ + KL L + +ADII QETKL + E+
Sbjct: 1 MKLISWNIDSLNAALTSDSARAKLSQEVLQTLVAENADIIAIQETKLSAKGPTKKHVEIL 60
Query: 47 SDLVMADGYESFFSCTRTSDKGRTGYSGVATFCRVKSPFSSTEVALPVAAEEGFTGLLET 106
+L GYE+ + ++ + R GY+G T K + T ++ P E
Sbjct: 61 EELF--PGYENTWRSSQ--EPARKGYAG--TMFLYKKELTPT-ISFP-----------EI 102
Query: 107 SGSKIMEGLEDFSKDELLKIDSEGRCVITDHGHFILFNVYGPRADSEDTVRIQFKLQFFH 166
M D EGR + + F + VY P A D ++ + Q +
Sbjct: 103 GAPSTM--------------DLEGRIITLEFDAFFVTQVYTPNAG--DGLKRLEERQVWD 146
Query: 167 VLQKRWEFLLCQGRRIFVVGDLNIAPAAIDRCDAG-----PDFAKNEFRIWFRSMLVESG 221
+ L + + + GD N+A ID + P F E R F ++L
Sbjct: 147 AKYAEYLAELDKEKPVLATGDYNVAHNEIDLANPASNRRSPGFTDEE-RAGFTNLL---A 202
Query: 222 GSFFDVFRSKHPERREAYTCWPSNTGAEQF-NYGTRIDHILCA 263
F D FR H + E YT W + + N G RID+ L +
Sbjct: 203 TGFTDTFRHVHGDVPERYTWWAQRSKTSKINNTGWRIDYWLTS 245
>sp|Q5A761|CCR4_CANAL Glucose-repressible alcohol dehydrogenase transcriptional effector
OS=Candida albicans (strain SC5314 / ATCC MYA-2876)
GN=CCR4 PE=3 SV=1
Length = 787
Score = 33.5 bits (75), Expect = 3.6, Method: Compositional matrix adjust.
Identities = 22/65 (33%), Positives = 35/65 (53%), Gaps = 5/65 (7%)
Query: 21 LRKLLDSFDADIICFQETKLRR-QELKSDLVMADGYES-FFSCTRTSDKGRTG---YSGV 75
L K + +++ DI+C QE + + QE ++ A+GY+ FFS TR+ T G
Sbjct: 491 LEKEVLNYNTDIVCMQEVETKTFQEFWLPVMTANGYKGYFFSKTRSKTMSETDSKKVDGC 550
Query: 76 ATFCR 80
ATF +
Sbjct: 551 ATFFK 555
>sp|A7KAI1|ATG2_PICAN Autophagy-related protein 2 OS=Pichia angusta GN=ATG2 PE=3 SV=1
Length = 1765
Score = 32.3 bits (72), Expect = 9.3, Method: Compositional matrix adjust.
Identities = 27/102 (26%), Positives = 44/102 (43%), Gaps = 8/102 (7%)
Query: 384 TEDCSENVDRSLNNYCDSGILQGVYCSSSNQESEGEFTKTIENCRDSANVASHSTITQGS 443
+ D +E ++N I G+ Q K+I NC S NV+ H+ ++ GS
Sbjct: 1144 SNDSTELDTEAVNELLYESIHVGLLAGQDPQSFYDNINKSISNCGGSENVSEHNNLSPGS 1203
Query: 444 SNHISPFHVDRARKKAKKS--QLGQLS------LKSFFHKRL 477
S+ S + R A +LG++S +S +HK L
Sbjct: 1204 SSPASVNTANSTRSAASSHNIELGKVSSRRVRLKRSIYHKVL 1245
Database: swissprot
Posted date: Mar 23, 2013 2:32 AM
Number of letters in database: 191,569,459
Number of sequences in database: 539,616
Lambda K H
0.320 0.135 0.412
Lambda K H
0.267 0.0410 0.140
Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Number of Hits to DB: 180,530,150
Number of Sequences: 539616
Number of extensions: 7678372
Number of successful extensions: 16350
Number of sequences better than 100.0: 34
Number of HSP's better than 100.0 without gapping: 10
Number of HSP's successfully gapped in prelim test: 24
Number of HSP's that attempted gapping in prelim test: 16250
Number of HSP's gapped (non-prelim): 56
length of query: 482
length of database: 191,569,459
effective HSP length: 121
effective length of query: 361
effective length of database: 126,275,923
effective search space: 45585608203
effective search space used: 45585608203
T: 11
A: 40
X1: 16 ( 7.4 bits)
X2: 38 (14.6 bits)
X3: 64 (24.7 bits)
S1: 41 (21.8 bits)
S2: 63 (28.9 bits)