BLASTP 2.2.26 [Sep-21-2011]
Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer,
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997),
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs", Nucleic Acids Res. 25:3389-3402.
Reference for compositional score matrix adjustment: Altschul, Stephen F.,
John C. Wootton, E. Michael Gertz, Richa Agarwala, Aleksandr Morgulis,
Alejandro A. Schaffer, and Yi-Kuo Yu (2005) "Protein database searches
using compositionally adjusted substitution matrices", FEBS J. 272:5101-5109.
Query= 020295
(328 letters)
Database: nr
23,463,169 sequences; 8,064,228,071 total letters
Searching..................................................done
>gi|225459615|ref|XP_002285873.1| PREDICTED: N-glycosylase/DNA lyase [Vitis vinifera]
Length = 399
Score = 452 bits (1164), Expect = e-125, Method: Compositional matrix adjust.
Identities = 217/291 (74%), Positives = 242/291 (83%), Gaps = 17/291 (5%)
Query: 37 KWTPLNLTQSELSLPLTFPTGQTFRWKKTGPLQYTGPIGPHLISLKHLQNGDVCYHIHTS 96
KW PLN+ +SELSL LTFPTGQTFRWK+T PLQYTG IG HLISLKHLQNGDV Y IH S
Sbjct: 65 KWVPLNIAKSELSLALTFPTGQTFRWKQTTPLQYTGVIGSHLISLKHLQNGDVAYLIHQS 124
Query: 97 PSEPAAKSALLDFLNMGISLGELWEGFSASDCRFAELAKYLAGARVLRQDPVECLLQFLC 156
PSE A+SALLDFLN+GISL E+WE F ASD RFAELA+YL GARVLRQDP+ECL+QFLC
Sbjct: 125 PSEENARSALLDFLNVGISLSEMWEVFKASDSRFAELAQYLGGARVLRQDPLECLIQFLC 184
Query: 157 SSNNNIARITKMVDFLASLGSHLGNVEGFEFHEFPSLERLSLVSEVELRNAGFGYRSAPQ 216
SSNNNI RITKMVDF++SLG++LG+V GF+F+EFPSL+RLS+VSE + R AGFGYR
Sbjct: 185 SSNNNIGRITKMVDFVSSLGNYLGSVGGFDFYEFPSLDRLSMVSEEDFREAGFGYR---- 240
Query: 217 SSLLFSVRRSFKQAKYITGTVDVLQSKHSGGAEWLLSLRKLDLQEAIDALCTLPGVGPKV 276
AKYI GTV LQSK GG EWL SLR++DLQE +DAL TLPGVGPKV
Sbjct: 241 -------------AKYIIGTVKALQSKSGGGIEWLASLREMDLQEVVDALSTLPGVGPKV 287
Query: 277 AACIALFSLDQHHAIPVDTHVWKIATRYLLPELAGVRLTPKLCSRVAEAFC 327
AACIALFSLDQHHAIPVDTHVW+IATRYL+PELAG RLTPKLCSRVA+AF
Sbjct: 288 AACIALFSLDQHHAIPVDTHVWQIATRYLIPELAGARLTPKLCSRVADAFV 338
>gi|255558622|ref|XP_002520336.1| 8-oxoguanine DNA glycosylase, putative [Ricinus communis]
gi|223540555|gb|EEF42122.1| 8-oxoguanine DNA glycosylase, putative [Ricinus communis]
Length = 412
Score = 451 bits (1160), Expect = e-124, Method: Compositional matrix adjust.
Identities = 220/293 (75%), Positives = 243/293 (82%), Gaps = 18/293 (6%)
Query: 36 SKWTPLNLTQSELSLPLTFPTGQTFRWKKTGPLQYTGPIGPHLISLKHL-QNGDVCYHIH 94
+ W LNLTQSELSLPLTFPTGQTFRWK+TG QYTG +GPHLISLKH QNG+V YHIH
Sbjct: 47 TNWNALNLTQSELSLPLTFPTGQTFRWKQTGHHQYTGTLGPHLISLKHHHQNGNVYYHIH 106
Query: 95 TSPSEPAAKSALLDFLNMGISLGELWEGFSASDCRFAELAKYLAGARVLRQDPVECLLQF 154
T+PSE AAKSALLDFLN+ +SL +W FSASD RFAELA +L GARVLRQDP+ECL+QF
Sbjct: 107 TTPSETAAKSALLDFLNINVSLANIWADFSASDSRFAELAMHLKGARVLRQDPLECLIQF 166
Query: 155 LCSSNNNIARITKMVDFLASLGSHLGNVEGFEFHEFPSLERLSLVSEVELRNAGFGYRSA 214
LCSSNNNIARITKMVD+++SLG +LGNVEGFEF+EFP LERL LV+E +LR AGFGYR
Sbjct: 167 LCSSNNNIARITKMVDYVSSLGQYLGNVEGFEFYEFPDLERLGLVTEEQLRAAGFGYR-- 224
Query: 215 PQSSLLFSVRRSFKQAKYITGTVDVLQSKHSGGAEWLLSLRKLDLQEAIDALCTLPGVGP 274
AKYITGT+D LQ K GG EWL +LRKLDLQ IDALCTLPGVGP
Sbjct: 225 ---------------AKYITGTIDALQLKPGGGVEWLTALRKLDLQVVIDALCTLPGVGP 269
Query: 275 KVAACIALFSLDQHHAIPVDTHVWKIATRYLLPELAGVRLTPKLCSRVAEAFC 327
KVAACIALFSLDQHHAIPVDTHVW+IATR+LLPELAG RLTPKLCSRVA+AF
Sbjct: 270 KVAACIALFSLDQHHAIPVDTHVWQIATRHLLPELAGARLTPKLCSRVADAFV 322
>gi|224084932|ref|XP_002307451.1| predicted protein [Populus trichocarpa]
gi|222856900|gb|EEE94447.1| predicted protein [Populus trichocarpa]
Length = 367
Score = 448 bits (1152), Expect = e-123, Method: Compositional matrix adjust.
Identities = 214/294 (72%), Positives = 242/294 (82%), Gaps = 17/294 (5%)
Query: 34 KPSKWTPLNLTQSELSLPLTFPTGQTFRWKKTGPLQYTGPIGPHLISLKHLQNGDVCYHI 93
KP+ W P+NLTQSELSLPLTFPTGQTFRWK+TGPLQYTG +G HLISLKH QNGDV Y I
Sbjct: 43 KPTNWAPVNLTQSELSLPLTFPTGQTFRWKQTGPLQYTGSLGRHLISLKHHQNGDVYYQI 102
Query: 94 HTSPSEPAAKSALLDFLNMGISLGELWEGFSASDCRFAELAKYLAGARVLRQDPVECLLQ 153
H SPS+ AAKSALL FLN ISL E+W GF+A+D RFAELA++ GARVLRQDP+ECL+Q
Sbjct: 103 HHSPSQSAAKSALLSFLNTNISLTEMWHGFAAADSRFAELAQHFKGARVLRQDPLECLIQ 162
Query: 154 FLCSSNNNIARITKMVDFLASLGSHLGNVEGFEFHEFPSLERLSLVSEVELRNAGFGYRS 213
FLCSSNNNI+RITKMVDF++SLG HLGNVEGFEFH FPSLERL+LV+E +LR AGFGYR
Sbjct: 163 FLCSSNNNISRITKMVDFVSSLGDHLGNVEGFEFHAFPSLERLALVTEQQLREAGFGYR- 221
Query: 214 APQSSLLFSVRRSFKQAKYITGTVDVLQSKHSGGAEWLLSLRKLDLQEAIDALCTLPGVG 273
AKY+TGTV+ LQSK GG +WL SLRKL LQ ID+LCTLPG+G
Sbjct: 222 ----------------AKYVTGTVNALQSKPEGGVKWLESLRKLQLQMVIDSLCTLPGIG 265
Query: 274 PKVAACIALFSLDQHHAIPVDTHVWKIATRYLLPELAGVRLTPKLCSRVAEAFC 327
KVA+CIALFSLDQHHAIPVDTHVW+IAT +L+PELAG LTPKLC RVA+AF
Sbjct: 266 LKVASCIALFSLDQHHAIPVDTHVWRIATTHLVPELAGASLTPKLCGRVADAFV 319
>gi|112982633|dbj|BAF03495.1| 8-oxoguanine DNA glycosylase [Populus nigra]
Length = 378
Score = 447 bits (1150), Expect = e-123, Method: Compositional matrix adjust.
Identities = 213/294 (72%), Positives = 243/294 (82%), Gaps = 17/294 (5%)
Query: 34 KPSKWTPLNLTQSELSLPLTFPTGQTFRWKKTGPLQYTGPIGPHLISLKHLQNGDVCYHI 93
+P+ W P+NLTQSELSLPLTFPTGQTFRWK+TGPLQYTG +G HLISLKH +NGDV Y I
Sbjct: 43 QPTNWAPVNLTQSELSLPLTFPTGQTFRWKQTGPLQYTGSLGRHLISLKHHKNGDVYYQI 102
Query: 94 HTSPSEPAAKSALLDFLNMGISLGELWEGFSASDCRFAELAKYLAGARVLRQDPVECLLQ 153
H SPS+PAAKSALL FLN ISL E+W GF+A+D RFAELA++ GARVLRQDP+ECL+Q
Sbjct: 103 HHSPSQPAAKSALLSFLNTNISLTEMWHGFAAADSRFAELAQHFKGARVLRQDPLECLIQ 162
Query: 154 FLCSSNNNIARITKMVDFLASLGSHLGNVEGFEFHEFPSLERLSLVSEVELRNAGFGYRS 213
FLCSSNNNI+RITKMVDF++SLG HLGNVEGFEFH FPSLERL+LV+E +LR AGFGYR
Sbjct: 163 FLCSSNNNISRITKMVDFVSSLGDHLGNVEGFEFHAFPSLERLALVTEQQLREAGFGYR- 221
Query: 214 APQSSLLFSVRRSFKQAKYITGTVDVLQSKHSGGAEWLLSLRKLDLQEAIDALCTLPGVG 273
AKY+TGTV+ LQSK GG +WL SLRKL LQ ID+LCTLPG+G
Sbjct: 222 ----------------AKYVTGTVNALQSKPEGGVKWLESLRKLQLQMVIDSLCTLPGIG 265
Query: 274 PKVAACIALFSLDQHHAIPVDTHVWKIATRYLLPELAGVRLTPKLCSRVAEAFC 327
KVA+CIALFSLDQHHAIPVDTHVW+IAT +L+PELAG LTPKLC RVA+AF
Sbjct: 266 LKVASCIALFSLDQHHAIPVDTHVWRIATTHLVPELAGASLTPKLCGRVADAFV 319
>gi|356518553|ref|XP_003527943.1| PREDICTED: N-glycosylase/DNA lyase-like [Glycine max]
Length = 395
Score = 419 bits (1078), Expect = e-115, Method: Compositional matrix adjust.
Identities = 202/299 (67%), Positives = 238/299 (79%), Gaps = 17/299 (5%)
Query: 28 QTLTTNKPSKWTPLNLTQSELSLPLTFPTGQTFRWKKTGPLQYTGPIGPHLISLKHLQNG 87
+L T K W PL+LT+ ELSLPLTFPTGQTF WK T P QYTG +GPHLISLKHLQNG
Sbjct: 30 HSLNTAKARAWAPLSLTRHELSLPLTFPTGQTFLWKNTAPSQYTGVVGPHLISLKHLQNG 89
Query: 88 DVCYHIHTSPSEPAAKSALLDFLNMGISLGELWEGFSASDCRFAELAKYLAGARVLRQDP 147
DV Y +H+ A++ALLDFLN +SL +LW+ FSASD RFAELA +L+GARVLRQDP
Sbjct: 90 DVSYCLHSPSHSDTAETALLDFLNATVSLADLWKTFSASDARFAELAHHLSGARVLRQDP 149
Query: 148 VECLLQFLCSSNNNIARITKMVDFLASLGSHLGNVEGFEFHEFPSLERLSLVSEVELRNA 207
ECL+QFLCSSNNNI RITKMV++++SLG+H+G+V F+FH FPSLE+LS VSE +LR+A
Sbjct: 150 FECLIQFLCSSNNNIGRITKMVNYVSSLGTHVGDVGEFQFHAFPSLEQLSSVSEQQLRDA 209
Query: 208 GFGYRSAPQSSLLFSVRRSFKQAKYITGTVDVLQSKHSGGAEWLLSLRKLDLQEAIDALC 267
GFGYR AKYI GT++ L+SK GG EWL SLRK++L++ I ALC
Sbjct: 210 GFGYR-----------------AKYIIGTINALKSKPGGGEEWLRSLRKMELRDVISALC 252
Query: 268 TLPGVGPKVAACIALFSLDQHHAIPVDTHVWKIATRYLLPELAGVRLTPKLCSRVAEAF 326
TLPGVGPKVAACIALFSLDQHHA+PVDTHVW+IAT+YLLPELAG +LTPKLC RVAEAF
Sbjct: 253 TLPGVGPKVAACIALFSLDQHHAVPVDTHVWRIATKYLLPELAGSQLTPKLCDRVAEAF 311
>gi|5263323|gb|AAD41425.1|AC007727_14 Similar to gb|U96710 8-oxoguanine DNA-glycosylase from Homo sapiens
[Arabidopsis thaliana]
Length = 391
Score = 416 bits (1070), Expect = e-114, Method: Compositional matrix adjust.
Identities = 215/329 (65%), Positives = 244/329 (74%), Gaps = 30/329 (9%)
Query: 11 LKRPRLTPQP------------PPTPPNPQTLTTNKPSKWTPLNLTQSELSLPLTFPTGQ 58
+KRPR T QP P TP Q L KW PL LT +EL+LPLTFPTGQ
Sbjct: 1 MKRPRPTSQPSISSTVKPPLSPPVTPILKQKLHRTGTPKWFPLKLTHTELTLPLTFPTGQ 60
Query: 59 TFRWKKTGPLQYTGPIGPHLISLKHLQNGD-VCYHIHTSPSEPAAKSALLDFLNMGISLG 117
TFRWKKTG +QY+G IGPHL+SL+ D V Y +H S S +A+ ALLDFLN ISL
Sbjct: 61 TFRWKKTGAIQYSGTIGPHLVSLRQRPGDDAVSYCVHCSTSPKSAELALLDFLNAEISLA 120
Query: 118 ELWEGFSASDCRFAELAKYLAGARVLRQDPVECLLQFLCSSNNNIARITKMVDFLASLGS 177
ELW FS D RF ELA++L GARVLRQDP+ECL+QFLCSSNNNIARITKMVDF++SLG
Sbjct: 121 ELWSDFSKKDPRFGELARHLRGARVLRQDPLECLIQFLCSSNNNIARITKMVDFVSSLGL 180
Query: 178 HLGNVEGFEFHEFPSLERLSLVSEVELRNAGFGYRSAPQSSLLFSVRRSFKQAKYITGTV 237
HLG+++GFEFH+FPSL+RLS VSE E R AGFGYR AKYITGTV
Sbjct: 181 HLGDIDGFEFHQFPSLDRLSRVSEEEFRKAGFGYR-----------------AKYITGTV 223
Query: 238 DVLQSKHSGGAEWLLSLRKLDLQEAIDALCTLPGVGPKVAACIALFSLDQHHAIPVDTHV 297
+ LQ+K GG EWLLSLRK++LQEA+ ALCTLPGVGPKVAACIALFSLDQH AIPVDTHV
Sbjct: 224 NALQAKPGGGNEWLLSLRKVELQEAVAALCTLPGVGPKVAACIALFSLDQHSAIPVDTHV 283
Query: 298 WKIATRYLLPELAGVRLTPKLCSRVAEAF 326
W+IAT YLLP+LAG +LTPKL RVAEAF
Sbjct: 284 WQIATNYLLPDLAGAKLTPKLHGRVAEAF 312
>gi|15219068|ref|NP_173590.1| 8-oxoguanine-DNA glycosylase 1 [Arabidopsis thaliana]
gi|11967657|emb|CAC19363.1| 8-oxoguanine DNA glycosylase [Arabidopsis thaliana]
gi|16074117|emb|CAC83625.1| 8-oxoguanine-DNA glycosylase [Arabidopsis thaliana]
gi|51971190|dbj|BAD44287.1| 8-oxoguanine DNA glycosylase-like protein [Arabidopsis thaliana]
gi|94442429|gb|ABF19002.1| At1g21710 [Arabidopsis thaliana]
gi|332192024|gb|AEE30145.1| 8-oxoguanine-DNA glycosylase 1 [Arabidopsis thaliana]
Length = 365
Score = 416 bits (1070), Expect = e-114, Method: Compositional matrix adjust.
Identities = 215/329 (65%), Positives = 244/329 (74%), Gaps = 30/329 (9%)
Query: 11 LKRPRLTPQP------------PPTPPNPQTLTTNKPSKWTPLNLTQSELSLPLTFPTGQ 58
+KRPR T QP P TP Q L KW PL LT +EL+LPLTFPTGQ
Sbjct: 1 MKRPRPTSQPSISSTVKPPLSPPVTPILKQKLHRTGTPKWFPLKLTHTELTLPLTFPTGQ 60
Query: 59 TFRWKKTGPLQYTGPIGPHLISLKHLQNGD-VCYHIHTSPSEPAAKSALLDFLNMGISLG 117
TFRWKKTG +QY+G IGPHL+SL+ D V Y +H S S +A+ ALLDFLN ISL
Sbjct: 61 TFRWKKTGAIQYSGTIGPHLVSLRQRPGDDAVSYCVHCSTSPKSAELALLDFLNAEISLA 120
Query: 118 ELWEGFSASDCRFAELAKYLAGARVLRQDPVECLLQFLCSSNNNIARITKMVDFLASLGS 177
ELW FS D RF ELA++L GARVLRQDP+ECL+QFLCSSNNNIARITKMVDF++SLG
Sbjct: 121 ELWSDFSKKDPRFGELARHLRGARVLRQDPLECLIQFLCSSNNNIARITKMVDFVSSLGL 180
Query: 178 HLGNVEGFEFHEFPSLERLSLVSEVELRNAGFGYRSAPQSSLLFSVRRSFKQAKYITGTV 237
HLG+++GFEFH+FPSL+RLS VSE E R AGFGYR AKYITGTV
Sbjct: 181 HLGDIDGFEFHQFPSLDRLSRVSEEEFRKAGFGYR-----------------AKYITGTV 223
Query: 238 DVLQSKHSGGAEWLLSLRKLDLQEAIDALCTLPGVGPKVAACIALFSLDQHHAIPVDTHV 297
+ LQ+K GG EWLLSLRK++LQEA+ ALCTLPGVGPKVAACIALFSLDQH AIPVDTHV
Sbjct: 224 NALQAKPGGGNEWLLSLRKVELQEAVAALCTLPGVGPKVAACIALFSLDQHSAIPVDTHV 283
Query: 298 WKIATRYLLPELAGVRLTPKLCSRVAEAF 326
W+IAT YLLP+LAG +LTPKL RVAEAF
Sbjct: 284 WQIATNYLLPDLAGAKLTPKLHGRVAEAF 312
>gi|357463557|ref|XP_003602060.1| N-glycosylase/DNA lyase [Medicago truncatula]
gi|357520327|ref|XP_003630452.1| N-glycosylase/DNA lyase [Medicago truncatula]
gi|355491108|gb|AES72311.1| N-glycosylase/DNA lyase [Medicago truncatula]
gi|355524474|gb|AET04928.1| N-glycosylase/DNA lyase [Medicago truncatula]
Length = 420
Score = 412 bits (1060), Expect = e-113, Method: Compositional matrix adjust.
Identities = 198/295 (67%), Positives = 235/295 (79%), Gaps = 18/295 (6%)
Query: 33 NKPSKWTPLNLTQSELSLPLTFPTGQTFRWKKTGPLQYTGPIGPHLISLKHLQNGDVCYH 92
+K W PLN+T+ ELSLPLTFPTGQTFRWK T P QYTG +G HLISLKHL NGDVCY
Sbjct: 35 SKSRAWIPLNITRQELSLPLTFPTGQTFRWKNTAPSQYTGVVGSHLISLKHLHNGDVCYT 94
Query: 93 IHT-SPSEPAAKSALLDFLNMGISLGELWEGFSASDCRFAELAKYLAGARVLRQDPVECL 151
+H+ SPS K+ALLDFLN +SL + W+ FS SD RFAELA++L+GARVLRQDP ECL
Sbjct: 95 LHSQSPSNDDCKTALLDFLNADVSLADTWKVFSDSDERFAELAQHLSGARVLRQDPFECL 154
Query: 152 LQFLCSSNNNIARITKMVDFLASLGSHLGNVEGFEFHEFPSLERLSLVSEVELRNAGFGY 211
+QF+CSSNN+I+RITKMVD+++SLG++LG VEGF+FH FP+L +LSLVSE +LR+AGFGY
Sbjct: 155 IQFMCSSNNHISRITKMVDYVSSLGTYLGCVEGFDFHAFPTLNQLSLVSEQQLRDAGFGY 214
Query: 212 RSAPQSSLLFSVRRSFKQAKYITGTVDVLQSKHSGGAEWLLSLRKLDLQEAIDALCTLPG 271
R AKYITGTV+VLQSK GG EWL SLRKL+L++ I L LPG
Sbjct: 215 R-----------------AKYITGTVNVLQSKPEGGEEWLYSLRKLELEDVISELIKLPG 257
Query: 272 VGPKVAACIALFSLDQHHAIPVDTHVWKIATRYLLPELAGVRLTPKLCSRVAEAF 326
VGPKVAACIAL+SLDQHHAIPVD H+W+IA +Y+LPELAG +LT KL SRVAEAF
Sbjct: 258 VGPKVAACIALYSLDQHHAIPVDVHIWRIAQKYILPELAGSKLTQKLHSRVAEAF 312
>gi|297845148|ref|XP_002890455.1| 8-oxoguanine-DNA glycosylase 1 [Arabidopsis lyrata subsp. lyrata]
gi|297336297|gb|EFH66714.1| 8-oxoguanine-DNA glycosylase 1 [Arabidopsis lyrata subsp. lyrata]
Length = 366
Score = 412 bits (1059), Expect = e-112, Method: Compositional matrix adjust.
Identities = 215/330 (65%), Positives = 242/330 (73%), Gaps = 31/330 (9%)
Query: 11 LKRPRLTP--------QPPPTPPNPQTLT-----TNKPSKWTPLNLTQSELSLPLTFPTG 57
+KRPR T +PP +PP L T KW L LT +EL+LPLTFPTG
Sbjct: 1 MKRPRATSPPSISTTVKPPLSPPVTPILKQKLHRTGTTPKWFSLKLTHTELTLPLTFPTG 60
Query: 58 QTFRWKKTGPLQYTGPIGPHLISLKHLQNGD-VCYHIHTSPSEPAAKSALLDFLNMGISL 116
QTFRWK+TG +QY+G IGPHL+SL+ D V Y +H S S A+ ALLDFLN ISL
Sbjct: 61 QTFRWKQTGAIQYSGAIGPHLVSLRQRPGDDTVSYCVHCSTSPKLAELALLDFLNAEISL 120
Query: 117 GELWEGFSASDCRFAELAKYLAGARVLRQDPVECLLQFLCSSNNNIARITKMVDFLASLG 176
ELW FS D RF ELAK+L GARVLRQDP+ECL+QFLCSSNNNI RITKMVDF++SLG
Sbjct: 121 AELWSDFSKKDPRFGELAKHLRGARVLRQDPLECLIQFLCSSNNNIGRITKMVDFVSSLG 180
Query: 177 SHLGNVEGFEFHEFPSLERLSLVSEVELRNAGFGYRSAPQSSLLFSVRRSFKQAKYITGT 236
HLG+++GFEFH+FPSL+RLS VSE E R AGFGYR AKYITGT
Sbjct: 181 LHLGDIDGFEFHQFPSLDRLSRVSEAEFRKAGFGYR-----------------AKYITGT 223
Query: 237 VDVLQSKHSGGAEWLLSLRKLDLQEAIDALCTLPGVGPKVAACIALFSLDQHHAIPVDTH 296
V+ LQSK GG EWLLSLRKLDLQEA+ ALCTLPGVGPKVAACIALFSLDQH AIPVDTH
Sbjct: 224 VNALQSKPGGGDEWLLSLRKLDLQEAVSALCTLPGVGPKVAACIALFSLDQHSAIPVDTH 283
Query: 297 VWKIATRYLLPELAGVRLTPKLCSRVAEAF 326
VW+IAT YLLP+LAG +LTPKL RVAEAF
Sbjct: 284 VWQIATNYLLPDLAGAKLTPKLHCRVAEAF 313
>gi|449464184|ref|XP_004149809.1| PREDICTED: N-glycosylase/DNA lyase-like [Cucumis sativus]
Length = 393
Score = 403 bits (1036), Expect = e-110, Method: Compositional matrix adjust.
Identities = 203/294 (69%), Positives = 234/294 (79%), Gaps = 19/294 (6%)
Query: 36 SKWTPLNLTQSELSLPLTFPTGQTFRWKKTGPLQYTGPIGPHLISLKHLQNGDV--CYHI 93
S W LNLT+S+LSLPLTFPTGQTFRWK+T P ++TG +G HLISL HL NGDV C H
Sbjct: 62 SNWVSLNLTRSDLSLPLTFPTGQTFRWKQTNPFEFTGVVGSHLISLNHLPNGDVSYCLHF 121
Query: 94 HTSPSEPAAKSALLDFLNMGISLGELWEGFSASDCRFAELAKYLAGARVLRQDPVECLLQ 153
++ S AA+ ALLDFLN ISL +WE FSA+D RF LA++ GARVLRQDP+ECL+Q
Sbjct: 122 SSTSSSAAARLALLDFLNASISLSSIWEVFSAADPRFDALARHFEGARVLRQDPLECLIQ 181
Query: 154 FLCSSNNNIARITKMVDFLASLGSHLGNVEGFEFHEFPSLERLSLVSEVELRNAGFGYRS 213
FLCSSNNNI RITKMVD+++SLG++LGNV GF+F+EFPSLERLSLVSE ELR AGFGYR
Sbjct: 182 FLCSSNNNIGRITKMVDYISSLGNYLGNVGGFDFYEFPSLERLSLVSEAELREAGFGYR- 240
Query: 214 APQSSLLFSVRRSFKQAKYITGTVDVLQSKHSGGAEWLLSLRKLDLQEAIDALCTLPGVG 273
AKYI G V+ L++K GGAEWLLSLR DL+E I+AL TLPGVG
Sbjct: 241 ----------------AKYIIGAVNALKAKPVGGAEWLLSLRDSDLEEVIEALSTLPGVG 284
Query: 274 PKVAACIALFSLDQHHAIPVDTHVWKIATRYLLPELAGVRLTPKLCSRVAEAFC 327
PKVAAC+ALFSLDQHHAIPVDTHVW+IATRYL+PELAG RLTPKLC+RVAEAF
Sbjct: 285 PKVAACVALFSLDQHHAIPVDTHVWQIATRYLVPELAGARLTPKLCNRVAEAFV 338
>gi|449529070|ref|XP_004171524.1| PREDICTED: N-glycosylase/DNA lyase-like, partial [Cucumis sativus]
Length = 359
Score = 403 bits (1035), Expect = e-110, Method: Compositional matrix adjust.
Identities = 206/312 (66%), Positives = 241/312 (77%), Gaps = 19/312 (6%)
Query: 17 TPQPPPTPPNPQTLTTNKPSKWTPLNLTQSELSLPLTFPTGQTFRWKKTGPLQYTGPIGP 76
+ P T P ++ + S W LNLT+S+LSLPLTFPTGQTFRWK+T P ++TG +G
Sbjct: 9 SKNPNKTLPLLKSPQSPSSSNWVSLNLTRSDLSLPLTFPTGQTFRWKQTNPFEFTGVVGS 68
Query: 77 HLISLKHLQNGDV--CYHIHTSPSEPAAKSALLDFLNMGISLGELWEGFSASDCRFAELA 134
HLISL HL NGDV C H ++ S AA+ ALLDFLN ISL +WE FSA+D RF LA
Sbjct: 69 HLISLNHLPNGDVSYCLHFSSTSSSAAARLALLDFLNASISLSSIWEVFSAADPRFDALA 128
Query: 135 KYLAGARVLRQDPVECLLQFLCSSNNNIARITKMVDFLASLGSHLGNVEGFEFHEFPSLE 194
++ GARVLRQDP+ECL+QFLCSSNNNI RITKMVD+++SLG++LGNV GF+F+EFPSLE
Sbjct: 129 RHFEGARVLRQDPLECLIQFLCSSNNNIGRITKMVDYISSLGNYLGNVGGFDFYEFPSLE 188
Query: 195 RLSLVSEVELRNAGFGYRSAPQSSLLFSVRRSFKQAKYITGTVDVLQSKHSGGAEWLLSL 254
RLSLVSE ELR AGFGYR AKYI G V+ L++K GGAEWLLSL
Sbjct: 189 RLSLVSEAELREAGFGYR-----------------AKYIIGAVNALKAKPVGGAEWLLSL 231
Query: 255 RKLDLQEAIDALCTLPGVGPKVAACIALFSLDQHHAIPVDTHVWKIATRYLLPELAGVRL 314
R DL+E I+AL TLPGVGPKVAAC+ALFSLDQHHAIPVDTHVW+IATRYL+PELAG RL
Sbjct: 232 RDSDLEEVIEALSTLPGVGPKVAACVALFSLDQHHAIPVDTHVWQIATRYLVPELAGARL 291
Query: 315 TPKLCSRVAEAF 326
TPKLC+RVAEAF
Sbjct: 292 TPKLCNRVAEAF 303
>gi|226498646|ref|NP_001141776.1| uncharacterized protein LOC100273912 [Zea mays]
gi|194705894|gb|ACF87031.1| unknown [Zea mays]
gi|413937227|gb|AFW71778.1| hypothetical protein ZEAMMB73_088129 [Zea mays]
Length = 419
Score = 346 bits (888), Expect = 8e-93, Method: Compositional matrix adjust.
Identities = 190/358 (53%), Positives = 233/358 (65%), Gaps = 51/358 (14%)
Query: 4 LKNSPSPLKRPRLTPQPPPTP--------------PNPQTL--------TTNKPSKWTPL 41
+K +P P+L+ PPPTP P P +W PL
Sbjct: 29 VKATPLKPSSPQLSSTPPPTPSVEVPKTEPPDHHDPTPSARRRLPLAAGAVEAQEEWHPL 88
Query: 42 NLTQSELSLPLTFPTGQTFRWKKTG--PLQYTGPIGPHLISLKHL---QNGDVCYHIHT- 95
L+ ++LSLPLT PTGQTF W++T PL++TG +G HL+SL HL +G + + +H
Sbjct: 89 PLSTADLSLPLTLPTGQTFLWRRTSLSPLRFTGAVGSHLVSLSHLPDSDDGRLAFLLHND 148
Query: 96 ----SPSEPAAKSALLDFLNMGISLGELWEGFSASDCRFAELAKYLAG--ARVLRQDPVE 149
+ S PAA++ALLD+LN +SL +LW F+A+D RFA+++ L G ARVLRQDPVE
Sbjct: 149 GGCPASSLPAARAALLDYLNAAVSLADLWRQFAAADERFADVSARLGGGGARVLRQDPVE 208
Query: 150 CLLQFLCSSNNNIARITKMVDFLASLGSHLGNVEGFEFHEFPSLERLSLVSEVELRNAGF 209
C+ QFLCSSNNNIARI KMV LA G LG V GF FH FP++ERL+ VSE ELR AGF
Sbjct: 209 CVFQFLCSSNNNIARIEKMVWTLAGYGERLGEVGGFVFHRFPTIERLARVSEQELREAGF 268
Query: 210 GYRSAPQSSLLFSVRRSFKQAKYITGTVDVLQSKHSGGAEWLLSLRKLDLQEAIDALCTL 269
GYR AKYI GT LQ+K SGG +WL SLR+ +L + I+ALCTL
Sbjct: 269 GYR-----------------AKYIVGTAKELQAKPSGGQKWLASLREKELSDVIEALCTL 311
Query: 270 PGVGPKVAACIALFSLDQHHAIPVDTHVWKIATRYLLPELAGVRLTPKLCSRVAEAFC 327
PGVGPKVAACIALFSLDQ+HAIPVDTHVWK+AT+YLLPELAG LTPKL VA+AF
Sbjct: 312 PGVGPKVAACIALFSLDQNHAIPVDTHVWKVATQYLLPELAGKSLTPKLSVVVADAFV 369
>gi|242065370|ref|XP_002453974.1| hypothetical protein SORBIDRAFT_04g022540 [Sorghum bicolor]
gi|241933805|gb|EES06950.1| hypothetical protein SORBIDRAFT_04g022540 [Sorghum bicolor]
Length = 413
Score = 343 bits (879), Expect = 8e-92, Method: Compositional matrix adjust.
Identities = 179/302 (59%), Positives = 216/302 (71%), Gaps = 29/302 (9%)
Query: 37 KWTPLNLTQSELSLPLTFPTGQTFRWKKTG--PLQYTGPIGPHLISLKHL---QNGDVCY 91
+W PL L+ ++LSLPLT PTGQTF W++T PL++TG +GPHL+SL HL +G + +
Sbjct: 78 EWHPLPLSTADLSLPLTLPTGQTFLWRRTSLSPLRFTGAVGPHLVSLSHLPDSDDGRLAF 137
Query: 92 HIHT-----SPSEPAAKSALLDFLNMGISLGELWEGFSASDCRFAELAKYLAG--ARVLR 144
+H + S PAA++AL D+LN +SL +LW F+A+D RFAE++ L G ARVLR
Sbjct: 138 LLHNDGSCPASSVPAARAALCDYLNAAVSLADLWRQFAAADERFAEVSARLGGGGARVLR 197
Query: 145 QDPVECLLQFLCSSNNNIARITKMVDFLASLGSHLGNVEGFEFHEFPSLERLSLVSEVEL 204
QDPVEC+ QFLCSSNNNIARI KMV LA G LG V GF FH FP++ERL+ VSE EL
Sbjct: 198 QDPVECVFQFLCSSNNNIARIEKMVWTLAGYGERLGEVGGFVFHRFPTIERLARVSEQEL 257
Query: 205 RNAGFGYRSAPQSSLLFSVRRSFKQAKYITGTVDVLQSKHSGGAEWLLSLRKLDLQEAID 264
R AGFGYR AKYI GT LQ+K GG +WL SLR+ +L I+
Sbjct: 258 REAGFGYR-----------------AKYIVGTAKELQAKPGGGEKWLASLREKELSYVIE 300
Query: 265 ALCTLPGVGPKVAACIALFSLDQHHAIPVDTHVWKIATRYLLPELAGVRLTPKLCSRVAE 324
ALCTLPGVGPKVAACIALFSLDQ+HAIPVDTHVWK+AT+YLLPELAG LTPKL VA+
Sbjct: 301 ALCTLPGVGPKVAACIALFSLDQNHAIPVDTHVWKVATQYLLPELAGKSLTPKLSVIVAD 360
Query: 325 AF 326
AF
Sbjct: 361 AF 362
>gi|326532252|dbj|BAK05055.1| predicted protein [Hordeum vulgare subsp. vulgare]
Length = 401
Score = 337 bits (863), Expect = 6e-90, Method: Compositional matrix adjust.
Identities = 185/340 (54%), Positives = 224/340 (65%), Gaps = 43/340 (12%)
Query: 14 PRLTPQPPPTP-----PNPQTLTTNKPS----------KWTPLNLTQSELSLPLTFPTGQ 58
P L P PP P P+T + P+ +W PL L+ +ELSLPLT PTGQ
Sbjct: 29 PALLPSTPPPAAPVEIPKPETTHQHPPAARRRLPLADEEWHPLPLSLAELSLPLTLPTGQ 88
Query: 59 TFRWKKTG--PLQYTGPIGPHLISLKHLQNGD-VCYHIHTS-PSEP-----AAKSALLDF 109
TF W++T PL++T +GPHL+SL HL + D + + +H P+ P AA++AL D+
Sbjct: 89 TFLWRRTSLSPLRFTAAVGPHLVSLSHLPDDDRLAFLLHNDDPASPTSSVAAARAALCDY 148
Query: 110 LNMGISLGELWEGFSASDCRFAELAKYLAG--ARVLRQDPVECLLQFLCSSNNNIARITK 167
LN + L +LW F+A+D RFAE+A L G ARVLRQDPVECL QFLCSSNNNIARI K
Sbjct: 149 LNSAVPLADLWVQFTAADARFAEVAARLGGGGARVLRQDPVECLFQFLCSSNNNIARIEK 208
Query: 168 MVDFLASLGSHLGNVEGFEFHEFPSLERLSLVSEVELRNAGFGYRSAPQSSLLFSVRRSF 227
MV LA G LG V GF FH FP++ERL+ VSE ELR AGFGYR
Sbjct: 209 MVWTLAGYGERLGEVGGFVFHRFPTVERLAQVSEQELREAGFGYR--------------- 253
Query: 228 KQAKYITGTVDVLQSKHSGGAEWLLSLRKLDLQEAIDALCTLPGVGPKVAACIALFSLDQ 287
AKY+ GT L +K GGA+WL LR +L E I+ALCTLPGVGPKVAACIALFSLD+
Sbjct: 254 --AKYVAGTAKELMAKPGGGAKWLSLLRNRELPEVIEALCTLPGVGPKVAACIALFSLDK 311
Query: 288 HHAIPVDTHVWKIATRYLLPELAGVRLTPKLCSRVAEAFC 327
+ AIPVDTHVWK+AT+Y+LPELAG LTPKL VA+AF
Sbjct: 312 NQAIPVDTHVWKVATKYMLPELAGKSLTPKLSIVVADAFV 351
>gi|357149486|ref|XP_003575128.1| PREDICTED: N-glycosylase/DNA lyase-like [Brachypodium distachyon]
Length = 407
Score = 331 bits (848), Expect = 3e-88, Method: Compositional matrix adjust.
Identities = 182/334 (54%), Positives = 223/334 (66%), Gaps = 31/334 (9%)
Query: 8 PSPL-KRPRL-TPQ--PPPTPPNPQTLTTNKPSKWTPLNLTQSELSLPLTFPTGQTFRWK 63
P+P + P+L TPQ PPP +P +W PL L+ +L LPLT PTGQTF W+
Sbjct: 41 PTPFSETPKLETPQQHPPPARRRLPLDGDAQPEEWHPLPLSAGDLFLPLTLPTGQTFLWR 100
Query: 64 KTG--PLQYTGPIGPHLISLKHLQNGDVCYHIH------TSPSEPAAKSALLDFLNMGIS 115
+T PL++TG +G HL+SL HL +G + + +H +S S A++A+ D+LN +
Sbjct: 101 RTSLSPLRFTGAVGSHLLSLSHLPDGGLAFLLHNDDADTSSCSVATARAAICDYLNATVP 160
Query: 116 LGELWEGFSASDCRFAELAKYLAG--ARVLRQDPVECLLQFLCSSNNNIARITKMVDFLA 173
L +LW F+A+D RFAE+A L G ARVLRQDPVECL QFLCSSNNNIARI KMV LA
Sbjct: 161 LADLWRQFTAADARFAEVAARLGGGGARVLRQDPVECLFQFLCSSNNNIARIEKMVWTLA 220
Query: 174 SLGSHLGNVEGFEFHEFPSLERLSLVSEVELRNAGFGYRSAPQSSLLFSVRRSFKQAKYI 233
G LG V GF FH FP++E+L+ VSE ELR+AGFGYR AKYI
Sbjct: 221 GYGERLGEVGGFVFHRFPTIEQLAQVSEQELRDAGFGYR-----------------AKYI 263
Query: 234 TGTVDVLQSKHSGGAEWLLSLRKLDLQEAIDALCTLPGVGPKVAACIALFSLDQHHAIPV 293
GT L +K GG +WL LR +L E I+ LCTLPGVGPKVAACIALFSLD++ AIPV
Sbjct: 264 VGTATELMAKPGGGEKWLALLRDRELPEVIEGLCTLPGVGPKVAACIALFSLDKNQAIPV 323
Query: 294 DTHVWKIATRYLLPELAGVRLTPKLCSRVAEAFC 327
DTHVWK+AT+YLLPELAG LTPKL VA+AF
Sbjct: 324 DTHVWKVATQYLLPELAGKSLTPKLSIVVADAFV 357
>gi|302804230|ref|XP_002983867.1| hypothetical protein SELMODRAFT_119537 [Selaginella moellendorffii]
gi|300148219|gb|EFJ14879.1| hypothetical protein SELMODRAFT_119537 [Selaginella moellendorffii]
Length = 351
Score = 329 bits (844), Expect = 1e-87, Method: Compositional matrix adjust.
Identities = 170/299 (56%), Positives = 209/299 (69%), Gaps = 23/299 (7%)
Query: 32 TNKP---SKWTPLNLTQSELSLPLTFPTGQTFRWKKTGPLQYTGPIGPHLISLKHLQNGD 88
T KP +W L++ EL+L LT TGQTF WK+T P +TG +GPHL+SL+ D
Sbjct: 22 TKKPRAREEWRSLSVPPVELNLALTLLTGQTFLWKQTSPGVFTGALGPHLVSLRQTPQ-D 80
Query: 89 VCYHIHTSPSEPAAKSALLDFLNMGISLGELWEGFSASDCRFAELAKYLAGARVLRQDPV 148
Y +HT P AK AL +F + SL LW FSA+D RFA +A Y+ GARVLRQDPV
Sbjct: 81 TLYRVHT--EFPGAKEALREFFTLDTSLATLWSSFSAADERFAAVAPYIQGARVLRQDPV 138
Query: 149 ECLLQFLCSSNNNIARITKMVDFLASLGSHLGNVEGFEFHEFPSLERLSLVSEVELRNAG 208
EC+ QF+CSSNN+I RITKMVDFLA+ GS LG V+G F +FPSLE+LS ++E +LR+AG
Sbjct: 139 ECVFQFICSSNNHIQRITKMVDFLATQGSPLGCVDGQSFFQFPSLEQLSFLTEKQLRDAG 198
Query: 209 FGYRSAPQSSLLFSVRRSFKQAKYITGTVDVLQSKHSGGAEWLLSLRKLDLQEAIDALCT 268
FGYR AKYI G V+ L+SK SGG EWL SLR+ L++A ALCT
Sbjct: 199 FGYR-----------------AKYIVGAVETLRSKESGGDEWLKSLREGSLEQATAALCT 241
Query: 269 LPGVGPKVAACIALFSLDQHHAIPVDTHVWKIATRYLLPELAGVRLTPKLCSRVAEAFC 327
LPG+GPKVAAC++LFSLD+H AIPVDTHVW+IA +YL PELAG RLT K+ S VA+AF
Sbjct: 242 LPGIGPKVAACVSLFSLDKHDAIPVDTHVWQIAVQYLKPELAGQRLTSKMHSAVAQAFV 300
>gi|302754758|ref|XP_002960803.1| hypothetical protein SELMODRAFT_75917 [Selaginella moellendorffii]
gi|300171742|gb|EFJ38342.1| hypothetical protein SELMODRAFT_75917 [Selaginella moellendorffii]
Length = 351
Score = 329 bits (843), Expect = 1e-87, Method: Compositional matrix adjust.
Identities = 170/299 (56%), Positives = 209/299 (69%), Gaps = 23/299 (7%)
Query: 32 TNKP---SKWTPLNLTQSELSLPLTFPTGQTFRWKKTGPLQYTGPIGPHLISLKHLQNGD 88
T KP +W L++ EL+L LT TGQTF WK+T P +TG +GPHL+SL+ D
Sbjct: 22 TKKPRAREEWRSLSVPPVELNLALTLLTGQTFLWKQTSPGVFTGALGPHLVSLRQTPQ-D 80
Query: 89 VCYHIHTSPSEPAAKSALLDFLNMGISLGELWEGFSASDCRFAELAKYLAGARVLRQDPV 148
Y +HT P AK AL +F + SL LW FSA+D RFA +A Y+ GARVLRQDPV
Sbjct: 81 TLYRLHT--EFPGAKEALREFFTLDTSLAALWSSFSAADERFAAVAPYIQGARVLRQDPV 138
Query: 149 ECLLQFLCSSNNNIARITKMVDFLASLGSHLGNVEGFEFHEFPSLERLSLVSEVELRNAG 208
EC+ QF+CSSNN+I RITKMVDFLA+ GS LG V+G F +FPSLE+LS ++E +LR+AG
Sbjct: 139 ECVFQFICSSNNHIQRITKMVDFLATQGSPLGCVDGQSFFQFPSLEQLSFLTEKQLRDAG 198
Query: 209 FGYRSAPQSSLLFSVRRSFKQAKYITGTVDVLQSKHSGGAEWLLSLRKLDLQEAIDALCT 268
FGYR AKYI G V+ L+SK SGG EWL SLR+ L++A ALCT
Sbjct: 199 FGYR-----------------AKYIVGAVETLRSKESGGDEWLKSLREGSLEQATAALCT 241
Query: 269 LPGVGPKVAACIALFSLDQHHAIPVDTHVWKIATRYLLPELAGVRLTPKLCSRVAEAFC 327
LPG+GPKVAAC++LFSLD+H AIPVDTHVW+IA +YL PELAG RLT K+ S VA+AF
Sbjct: 242 LPGIGPKVAACVSLFSLDKHDAIPVDTHVWQIAVQYLKPELAGQRLTSKMHSAVAQAFV 300
>gi|168045681|ref|XP_001775305.1| predicted protein [Physcomitrella patens subsp. patens]
gi|162673386|gb|EDQ59910.1| predicted protein [Physcomitrella patens subsp. patens]
Length = 303
Score = 320 bits (819), Expect = 7e-85, Method: Compositional matrix adjust.
Identities = 160/297 (53%), Positives = 209/297 (70%), Gaps = 24/297 (8%)
Query: 37 KWTPLNLTQSELSLPLTFPTGQTFRWKKTGPLQYTGPIGPHLISLKHLQNGDVCYHIHTS 96
KW L + ++EL L T PTGQ+FRW++TG QYTG +G HLISL+ + + DV + H S
Sbjct: 1 KWCSLGVGRAELCLDFTLPTGQSFRWRRTGLSQYTGVLGAHLISLRQIAD-DVEFFHHAS 59
Query: 97 PSEPAAKSA-----LLDFLNMGISLGELWEGFSASDCRFAELAKYLAGARVLRQDPVECL 151
+ A + + ++LN+ SL L+ FS +D RFA +A ++AGAR+LRQ P+EC+
Sbjct: 60 SCKANASGSDIEKDIREYLNLDTSLVVLYTEFSVADARFAAVAPFIAGARLLRQSPLECV 119
Query: 152 LQFLCSSNNNIARITKMVDFLASLGSHLGNVEGFEFHEFPSLERLSLVSEVELRNAGFGY 211
QF+CSSNN+I RIT MVD+L+S GS LG+V G FH+FPSLE L+ +SE +LR GFGY
Sbjct: 120 FQFICSSNNHIQRITTMVDYLSSHGSFLGSVNGIRFHQFPSLEELAPISESQLRENGFGY 179
Query: 212 RSAPQSSLLFSVRRSFKQAKYITGTVDVLQSKHSGGAEWLLSLRKLDLQEAIDALCTLPG 271
R AKYI G V++L+ K GG +WL+SLR+L L++A+ +LCTLPG
Sbjct: 180 R-----------------AKYIVGAVEMLRQK-EGGEQWLMSLRRLSLEDAVTSLCTLPG 221
Query: 272 VGPKVAACIALFSLDQHHAIPVDTHVWKIATRYLLPELAGVRLTPKLCSRVAEAFCE 328
+GPKVAACIALFSLDQHHAIPVDTHVW+IA +YLLPEL G +LT KL VAEAF +
Sbjct: 222 IGPKVAACIALFSLDQHHAIPVDTHVWQIAVKYLLPELEGRKLTVKLHREVAEAFVQ 278
>gi|46389889|dbj|BAD15490.1| putative 8-oxoguanine DNA glycosylase isoform 1a; 8-hydroxyguanine
DNA glycosylase [Oryza sativa Japonica Group]
Length = 399
Score = 318 bits (816), Expect = 2e-84, Method: Compositional matrix adjust.
Identities = 171/304 (56%), Positives = 214/304 (70%), Gaps = 29/304 (9%)
Query: 36 SKWTPLNLTQSELSLPLTFPTGQTFRWKKTG--PLQYTGPIGPHLISLKHL--QNGDVCY 91
+W PL L+ ++LSLPLT PTGQTF W++T PL++TG +GPHL+SL HL +G + +
Sbjct: 65 GEWHPLPLSAADLSLPLTLPTGQTFLWRRTSLSPLRFTGAVGPHLVSLSHLPSSDGRLAF 124
Query: 92 HIHTSPSEPAAKSA------LLDFLNMGISLGELWEGFSASDCRFAELAKYLAG--ARVL 143
+H + ++ L D+LN + L +LW F+A+D RFAE++ L G ARVL
Sbjct: 125 LLHNNGGSSSSSVPAAARAALSDYLNAAVPLADLWRRFAAADARFAEVSARLGGGGARVL 184
Query: 144 RQDPVECLLQFLCSSNNNIARITKMVDFLASLGSHLGNVEGFEFHEFPSLERLSLVSEVE 203
RQDPVEC+ QFLCSSNNNIARI KMV LA G LG V G++FH+FP++ERL+ VSE E
Sbjct: 185 RQDPVECVFQFLCSSNNNIARIEKMVWALAGYGERLGEVGGYQFHQFPTIERLARVSEQE 244
Query: 204 LRNAGFGYRSAPQSSLLFSVRRSFKQAKYITGTVDVLQSKHSGGAEWLLSLRKLDLQEAI 263
LR+AGFGYR AKYI GT +LQ+K GG +WL SLR +L E I
Sbjct: 245 LRDAGFGYR-----------------AKYIVGTAKILQAKPGGGEKWLASLRTRELPEVI 287
Query: 264 DALCTLPGVGPKVAACIALFSLDQHHAIPVDTHVWKIATRYLLPELAGVRLTPKLCSRVA 323
+ALCTLPGVGPKVAAC+ALFSLDQ+HAIPVDTHVWK+AT+YL+PELAG LTPKL VA
Sbjct: 288 EALCTLPGVGPKVAACVALFSLDQNHAIPVDTHVWKVATQYLMPELAGKSLTPKLSVAVA 347
Query: 324 EAFC 327
+AF
Sbjct: 348 DAFV 351
>gi|125582486|gb|EAZ23417.1| hypothetical protein OsJ_07109 [Oryza sativa Japonica Group]
Length = 760
Score = 316 bits (809), Expect = 1e-83, Method: Compositional matrix adjust.
Identities = 171/306 (55%), Positives = 215/306 (70%), Gaps = 29/306 (9%)
Query: 34 KPSKWTPLNLTQSELSLPLTFPTGQTFRWKKTG--PLQYTGPIGPHLISLKHL--QNGDV 89
+ +W PL L+ ++LSLPLT PTGQTF W++T PL++TG +GPHL+SL HL +G +
Sbjct: 424 EDGEWHPLPLSAADLSLPLTLPTGQTFLWRRTSLSPLRFTGAVGPHLVSLSHLPSSDGRL 483
Query: 90 CYHIHTSPSEPAAKSA------LLDFLNMGISLGELWEGFSASDCRFAELAKYLAG--AR 141
+ +H + ++ L D+LN + L +LW F+A+D RFAE++ L G AR
Sbjct: 484 AFLLHNNGGSSSSSVPAAARAALSDYLNAAVPLADLWRRFAAADARFAEVSARLGGGGAR 543
Query: 142 VLRQDPVECLLQFLCSSNNNIARITKMVDFLASLGSHLGNVEGFEFHEFPSLERLSLVSE 201
VLRQDPVEC+ QFLCSSNNNIARI KMV LA G LG V G++FH+FP++ERL+ VSE
Sbjct: 544 VLRQDPVECVFQFLCSSNNNIARIEKMVWALAGYGERLGEVGGYQFHQFPTIERLARVSE 603
Query: 202 VELRNAGFGYRSAPQSSLLFSVRRSFKQAKYITGTVDVLQSKHSGGAEWLLSLRKLDLQE 261
ELR+AGFGYR AKYI GT +LQ+K GG +WL SLR +L E
Sbjct: 604 QELRDAGFGYR-----------------AKYIVGTAKILQAKPGGGEKWLASLRTRELPE 646
Query: 262 AIDALCTLPGVGPKVAACIALFSLDQHHAIPVDTHVWKIATRYLLPELAGVRLTPKLCSR 321
I+ALCTLPGVGPKVAAC+ALFSLDQ+HAIPVDTHVWK+AT+YL+PELAG LTPKL
Sbjct: 647 VIEALCTLPGVGPKVAACVALFSLDQNHAIPVDTHVWKVATQYLMPELAGKSLTPKLSVA 706
Query: 322 VAEAFC 327
VA+AF
Sbjct: 707 VADAFV 712
>gi|384247806|gb|EIE21292.1| DNA glycosylase, partial [Coccomyxa subellipsoidea C-169]
Length = 316
Score = 252 bits (644), Expect = 1e-64, Method: Compositional matrix adjust.
Identities = 140/299 (46%), Positives = 182/299 (60%), Gaps = 29/299 (9%)
Query: 36 SKWTPLNLTQSELSLPLTFPTGQTFRWKKTGPLQYTGPIGPHLISLKHLQNGDVCYHIHT 95
+ W L+ SEL L T PTGQ+FRW+KTGP +YTG I ++ ++ ++ DV + +
Sbjct: 10 ASWQSLDCPSSELRLEFTLPTGQSFRWRKTGPQEYTGVIQQRVVQMRQEED-DVLFRVLA 68
Query: 96 S-----PSEPAAKSALLDFLNMGISLGELWEGFSASDCRFAELAKYLAGARVLRQDPVEC 150
P E AA AL D+ N+G + +L ++ D RF + Y GARVLRQDPVEC
Sbjct: 69 RGDGALPGEDAA--ALHDYFNLGSRMADLSAHWAERDPRFRNIHPYFPGARVLRQDPVEC 126
Query: 151 LLQFLCSSNNNIARITKMVDFLA-SLGSHLGNVEGFEFHEFPSLERLSLVSEVELRNAGF 209
L QF+CSSNN+I+RI MV+ L + G+ L ++ F+ FP+LE L ++ LR GF
Sbjct: 127 LFQFVCSSNNHISRIHGMVERLCRTYGTPLPDLS---FYAFPTLENLEEATDEALRAEGF 183
Query: 210 GYRSAPQSSLLFSVRRSFKQAKYITGTVDVLQSKHSGGAEWLLSLRKLDLQEAIDALCTL 269
GYR AK+I G+V++L K GGA WL LR++ QEA DALCTL
Sbjct: 184 GYR-----------------AKFIVGSVEMLLEKEGGGAAWLQQLRQVPYQEASDALCTL 226
Query: 270 PGVGPKVAACIALFSLDQHHAIPVDTHVWKIATRYLLPELAGVRLTPKLCSRVAEAFCE 328
PGVGPKVAAC+ LFSLD+H AIPVDTHVW++ATRY P L G LT L V A E
Sbjct: 227 PGVGPKVAACVCLFSLDKHEAIPVDTHVWQLATRYYTPSLKGKTLTKPLMVAVEAALQE 285
>gi|302141786|emb|CBI18989.3| unnamed protein product [Vitis vinifera]
Length = 204
Score = 232 bits (591), Expect = 2e-58, Method: Compositional matrix adjust.
Identities = 112/159 (70%), Positives = 128/159 (80%), Gaps = 17/159 (10%)
Query: 168 MVDFLASLGSHLGNVEGFEFHEFPSLERLSLVSEVELRNAGFGYRSAPQSSLLFSVRRSF 227
MVDF++SLG++LG+V GF+F+EFPSL+RLS+VSE + R AGFGYR
Sbjct: 1 MVDFVSSLGNYLGSVGGFDFYEFPSLDRLSMVSEEDFREAGFGYR--------------- 45
Query: 228 KQAKYITGTVDVLQSKHSGGAEWLLSLRKLDLQEAIDALCTLPGVGPKVAACIALFSLDQ 287
AKYI GTV LQSK GG EWL SLR++DLQE +DAL TLPGVGPKVAACIALFSLDQ
Sbjct: 46 --AKYIIGTVKALQSKSGGGIEWLASLREMDLQEVVDALSTLPGVGPKVAACIALFSLDQ 103
Query: 288 HHAIPVDTHVWKIATRYLLPELAGVRLTPKLCSRVAEAF 326
HHAIPVDTHVW+IATRYL+PELAG RLTPKLCSRVA+AF
Sbjct: 104 HHAIPVDTHVWQIATRYLIPELAGARLTPKLCSRVADAF 142
>gi|255088900|ref|XP_002506372.1| predicted protein [Micromonas sp. RCC299]
gi|226521644|gb|ACO67630.1| predicted protein [Micromonas sp. RCC299]
Length = 500
Score = 225 bits (574), Expect = 2e-56, Method: Compositional matrix adjust.
Identities = 134/354 (37%), Positives = 185/354 (52%), Gaps = 68/354 (19%)
Query: 25 PNPQTLTTNKPSKWTPLNLTQSELSLPLTFPTGQTFRWKKTGPLQYTGPIGPHLISLKHL 84
P + + W L ++EL L T PTGQ+FRW+ T Y G IG ++S++
Sbjct: 69 PPAAAVVDERGGGWVSLGTPRAELVLAHTLPTGQSFRWRLTTRGDYVGVIGKRVVSMRQC 128
Query: 85 QNGDVCYHIHTSPSEPAA---KSALLDFLNMGISLGELWEGFSASDCRFAELAKYLAGAR 141
++ DV Y +H P+ A +A+ D+ N+ +SLG L G++ +D RFA+L +L G R
Sbjct: 129 ED-DVLYRVHCRPAGEDATHDAAAVADYFNLSVSLGALATGWAKADARFAKLQPHLPGCR 187
Query: 142 VLRQDPVECLLQFLCSSNNNIARITKMVDFL-ASLGSHLG-------------------- 180
+LRQDP ECL F+CSSNN+I+RI MV+ L A+ G+ L
Sbjct: 188 MLRQDPAECLFSFICSSNNHISRIHGMVERLCATYGTKLAVDAALSEARKATANPTGTPK 247
Query: 181 ----------------------NVEG---FEFHEFPSLERLSLVSEVELRNAGFGYRSAP 215
+ EG +F+ FP++ +L+ +E +LR AGFGYR
Sbjct: 248 KKKKRKVVADNETKAEDNDDRDDDEGEPLGDFYSFPTVSQLTNATEADLRAAGFGYR--- 304
Query: 216 QSSLLFSVRRSFKQAKYITGTVDVLQSKHSGGAEWLLSLR-KLDLQEAIDALCTLPGVGP 274
AK+I G V L K +G +L +LR + +EA AL LPGVGP
Sbjct: 305 --------------AKFIAGAVSALNEKPNGADAYLRTLRHECSYKEAQAALAELPGVGP 350
Query: 275 KVAACIALFSLDQHHAIPVDTHVWKIATRYLLPELAGVRLTPKLCSRVAEAFCE 328
KVAAC LFSLD+H AIPVDTHVW +AT + ELAG LTP++ V EA E
Sbjct: 351 KVAACACLFSLDKHEAIPVDTHVWHLATEHYCAELAGKSLTPRVMDAVEEAVGE 404
>gi|307105926|gb|EFN54173.1| hypothetical protein CHLNCDRAFT_16754, partial [Chlorella
variabilis]
Length = 291
Score = 222 bits (565), Expect = 2e-55, Method: Compositional matrix adjust.
Identities = 131/293 (44%), Positives = 174/293 (59%), Gaps = 41/293 (13%)
Query: 57 GQTFRWKKTG--PLQYTGPIGPHLISLKHLQNGDVCYHI---HTSPSEPAAKSALLDFLN 111
GQ+FRW++TG PL++TG IG + L+ L + DV Y + + A +AL D+ N
Sbjct: 1 GQSFRWRQTGSEPLEFTGVIGQRAVRLRQLPD-DVQYQVVARGAGAAAAADAAALHDYFN 59
Query: 112 MGISLGELWEGFSASDCRFAELAKYLAGARVLRQDPVECLLQFLCSSNNNIARITKMVD- 170
+G L +L + A+ R+A + L GAR+LRQDP+EC ++F+CS NN+I RI MV+
Sbjct: 60 LGTRLADLAPSWRAACPRYAHVHHLLPGARMLRQDPLECSMEFICSQNNHIQRIHGMVER 119
Query: 171 --------FLASLGSHLGNV-------EGFEFHEFPSLERLSLVSEVELRNAGFGYRSAP 215
LA+ G G+V H FP+LE+LS +E ELR G+GYR
Sbjct: 120 LCRQHGTPLLAAAGD--GSVPPAGATPAKLALHAFPTLEQLSAATEEELRADGYGYR--- 174
Query: 216 QSSLLFSVRRSFKQAKYITGTVDVLQSKHSGGAEWLLSLRKLDLQEAIDALCTLPGVGPK 275
AKYITG+V L +K GGA WLL LR + +EA++ LCTLPG+GPK
Sbjct: 175 --------------AKYITGSVAQLLAKPEGGAAWLLGLRSVPYEEAVEELCTLPGIGPK 220
Query: 276 VAACIALFSLDQHHAIPVDTHVWKIATRYLLPELAGVRLTPKLCSRVAEAFCE 328
VAACIALFSLD+H AIPVDTHVW +A +Y LP L G LT K+ V AF +
Sbjct: 221 VAACIALFSLDKHEAIPVDTHVWALACKYYLPHLRGKSLTKKVHGEVQAAFVK 273
>gi|452821221|gb|EME28254.1| trifunctional protein carbamoyl-phosphate synthase
(glutamine-dependent) / aspartate carbamoyltransferase /
dihydroorotase (CAD protein) [Galdieria sulphuraria]
Length = 2626
Score = 214 bits (544), Expect = 6e-53, Method: Compositional matrix adjust.
Identities = 122/309 (39%), Positives = 171/309 (55%), Gaps = 29/309 (9%)
Query: 28 QTLTTNKPSKWTPLNLTQSELSLPLTFPTGQTFRWKKTGPLQ-----YTGPIGPHLISLK 82
Q L S W PLN ELSL T P GQ+F WK+ G + + G + ++ L
Sbjct: 14 QRLCGKASSLWYPLNNPPKELSLSRTLPCGQSFLWKRYGKEKDNQEYWLGVVDDTIVQLY 73
Query: 83 HLQN-GDVCYH----IHTSPSEPAAKSALLDFLNMGISLGELWEGFSASDCRFAELAKYL 137
G V + + S S + L++F N+ IS L E F D RF ++ +Y
Sbjct: 74 QPNTLGPVYFQPVHCLSESISIEHLRKRLVEFFNLHISYSSLLEQFCKVDPRFEKVVEYT 133
Query: 138 AGARVLRQDPVECLLQFLCSSNNNIARITKMVDFLASLGSHLGNVEGFEFHEFPSLERLS 197
GARVL DP ECL +LCSSNNNI RI+KM+ F+A G+++G VE +F+ FP+L ++
Sbjct: 134 QGARVLSLDPFECLCGYLCSSNNNIQRISKMMQFIAQHGNYIGQVEQHKFYCFPTLNQMR 193
Query: 198 LVSEVELRNAGFGYRSAPQSSLLFSVRRSFKQAKYITGTVDVLQSKHSGGAEWLLSLRKL 257
V+E +LR+ FGYR AKY+ T+D L GG EWL L++L
Sbjct: 194 SVTEQDLRSHRFGYR-----------------AKYLVHTIDRL--VQLGGIEWLSELKQL 234
Query: 258 DLQEAIDALCTLPGVGPKVAACIALFSLDQHHAIPVDTHVWKIATRYLLPELAGVRLTPK 317
D ++ + L LPG+G KVA+CIAL SL +H IPVDTH+W++ +Y LP+L G LT +
Sbjct: 235 DREQVVSQLMQLPGIGHKVASCIALVSLGKHDEIPVDTHIWQVVLKYYLPQLHGKTLTQR 294
Query: 318 LCSRVAEAF 326
+ + E F
Sbjct: 295 VHREIGEWF 303
>gi|303288331|ref|XP_003063454.1| predicted protein [Micromonas pusilla CCMP1545]
gi|226455286|gb|EEH52590.1| predicted protein [Micromonas pusilla CCMP1545]
Length = 438
Score = 210 bits (534), Expect = 8e-52, Method: Compositional matrix adjust.
Identities = 126/297 (42%), Positives = 172/297 (57%), Gaps = 26/297 (8%)
Query: 38 WTPLNLTQSELSLPLTFPTGQTFRWKKTGPLQYTGPIGPHLISLKHLQNGDVCYHIH--- 94
W L EL L T PTGQ+FRW+K Y G IG ++S++ + DV Y +H
Sbjct: 32 WRSLGTPAKELRLENTLPTGQSFRWRKNADGDYVGVIGRRVVSMRQDVD-DVLYRVHCRG 90
Query: 95 TSPSEPAAKSALLDFLNMGISLGELWEGFSASDCRFAELAKYLAGARVLRQDPVECLLQF 154
SE + + + ++ N+ SL L E ++A+D RFA+LA +L G R+LRQDP ECL F
Sbjct: 91 DDESEASDAAVIAEYFNISTSLASLSEAWAAADARFAKLAPHLPGCRMLRQDPAECLFSF 150
Query: 155 LCSSNNNIARITKMVDFL-ASLGSHLGNVEG--FEFHEFPSLERLSLVSEVELRNAGFGY 211
+CSSNN+I+RI MV+ L A+ G+ L E +F+ FP++ +L SE LR GFGY
Sbjct: 151 ICSSNNHISRIHGMVERLCAAYGTKLAPDERPLGDFYAFPTVAQLRAASEETLRGMGFGY 210
Query: 212 RSAPQSSLLFSVRRSFKQAKYITGTVDVLQSKHSGGAEWLLSLRKLD--LQEAIDALCTL 269
R AK+I G V L +K G +L+ LR + ++A AL L
Sbjct: 211 R-----------------AKFIAGAVAALVAKPEGPDAYLMKLRAAETPYRDARAALAEL 253
Query: 270 PGVGPKVAACIALFSLDQHHAIPVDTHVWKIATRYLLPELAGVRLTPKLCSRVAEAF 326
PGVGPKVAAC+ LFSLD+H AIPVDTHVW++A + +P LA LTP++ V A
Sbjct: 254 PGVGPKVAACVCLFSLDKHDAIPVDTHVWQLAVEHYVPALASKSLTPRVMDEVEAAI 310
>gi|323452587|gb|EGB08460.1| hypothetical protein AURANDRAFT_3135, partial [Aureococcus
anophagefferens]
Length = 318
Score = 207 bits (526), Expect = 8e-51, Method: Compositional matrix adjust.
Identities = 128/297 (43%), Positives = 164/297 (55%), Gaps = 29/297 (9%)
Query: 38 WTPLNLTQSELSLPLTFPTGQTFRWKKTGPLQYTGPIGPHLISLK-HLQNGDVCYHIHTS 96
W L L SEL L T PTGQ F W + G +Y G + ++L+ H +G + S
Sbjct: 2 WRSLGLAASELKLAHTLPTGQAFGWVRVGD-EYHGCLRDAGVALREHPVSGAAEWRCDAS 60
Query: 97 PSEPAAKSALLDFLNMGISLGELWEGFSASDCRFAELAKYLAGARVLRQDPVECLLQFLC 156
E AA++ + D+ +S EL+ G+S +D RFA++A + GARVLRQDP+ECL+ F+C
Sbjct: 61 -REAAAEAYVRDYFRADVSFAELYGGWSRADARFAKIATVVDGARVLRQDPLECLISFIC 119
Query: 157 SSNNNIARITKMVDFLASLGSHLGNV---EGFEFHE---FPSLERLSLVSEVELRNAGFG 210
SSNNNI RIT M+ L L + V G E E FP+LERL V E LR+ GFG
Sbjct: 120 SSNNNIPRITLMLGRLRELCGDVAAVPGGAGMEASEPRTFPTLERLCGVEERTLRDLGFG 179
Query: 211 YRSAPQSSLLFSVRRSFKQAKYITGTVDVLQSKHSGGAEWLLSLRKLDLQEAIDALCTLP 270
YR A Y+ T ++ K GG WL LR D + AL
Sbjct: 180 YR-----------------APYVAKTARHVRDK--GGDAWLAGLRNADRADVKVALLECA 220
Query: 271 GVGPKVAACIALFSLDQHHAIPVDTHVWKIATRYLLPELAGV-RLTPKLCSRVAEAF 326
GVGPKVA C+ALFSLDQ A+PVDTHVW+IA R P L V +TPK+ RV + F
Sbjct: 221 GVGPKVADCVALFSLDQADAVPVDTHVWRIARRDFDPTLDDVASITPKVYDRVGDLF 277
>gi|334335529|ref|XP_001375382.2| PREDICTED: n-glycosylase/DNA lyase-like [Monodelphis domestica]
Length = 340
Score = 199 bits (505), Expect = 2e-48, Method: Compositional matrix adjust.
Identities = 110/274 (40%), Positives = 156/274 (56%), Gaps = 21/274 (7%)
Query: 32 TNKPSKWTPLNLTQSELSLPLTFPTGQTFRWKKTGPLQYTGPIGPHLISLKHLQNGDVCY 91
T+ PS W + +SEL L L +GQ+FRW++ P +TG + L +L + + Y
Sbjct: 7 TSIPSLWDSIPCPRSELDLDLVLSSGQSFRWREHSPRHWTGVLDGRLWTLTQA-DERLYY 65
Query: 92 HIHTSPSEPAAKSALLDFLNMGISLGELWEGFSASDCRFAELAKYLAGARVLRQDPVECL 151
++ + P A L F + + L +L++ +S++D F E+A G R+LRQDPVECL
Sbjct: 66 TVYGEEAGPGAPQTLCRFFQLHVCLADLYQHWSSADPHFREVASRFPGVRLLRQDPVECL 125
Query: 152 LQFLCSSNNNIARITKMVDFLAS-LGSHLGNVEGFEFHEFPSLERLSLVS-EVELRNAGF 209
F+CSSNN+I+RIT MV L LG LG ++G +H FPSL+ L+ E +LR GF
Sbjct: 126 FSFICSSNNHISRITSMVQRLCQHLGPPLGQLDGVSYHGFPSLQALAKAEVEPQLRELGF 185
Query: 210 GYRSAPQSSLLFSVRRSFKQAKYITGTVDVLQSKHSGGAEWLLSLRKLDLQEAIDALCTL 269
GYR A++++ + V+ K GGA WL LR +EA LC+L
Sbjct: 186 GYR-----------------ARFVSESARVV-LKERGGASWLQQLRAASYEEAQRDLCSL 227
Query: 270 PGVGPKVAACIALFSLDQHHAIPVDTHVWKIATR 303
PGVG KVA C+ L +LD+ A+PVD HVW IA R
Sbjct: 228 PGVGVKVADCVCLMALDKPQAVPVDVHVWHIARR 261
>gi|218190956|gb|EEC73383.1| hypothetical protein OsI_07627 [Oryza sativa Indica Group]
Length = 191
Score = 198 bits (503), Expect = 3e-48, Method: Compositional matrix adjust.
Identities = 98/160 (61%), Positives = 116/160 (72%), Gaps = 17/160 (10%)
Query: 168 MVDFLASLGSHLGNVEGFEFHEFPSLERLSLVSEVELRNAGFGYRSAPQSSLLFSVRRSF 227
MV LA G LG V G++FH+FP++ERL+ VSE ELR+AGFGYR
Sbjct: 1 MVWALAGYGERLGEVGGYQFHQFPTIERLARVSEQELRDAGFGYR--------------- 45
Query: 228 KQAKYITGTVDVLQSKHSGGAEWLLSLRKLDLQEAIDALCTLPGVGPKVAACIALFSLDQ 287
AKYI GT +LQ+K GG +WL SLR +L E I+ALCTLPGVGPKVAAC+ALFSLDQ
Sbjct: 46 --AKYIVGTAKILQAKPGGGEKWLASLRTRELPEVIEALCTLPGVGPKVAACVALFSLDQ 103
Query: 288 HHAIPVDTHVWKIATRYLLPELAGVRLTPKLCSRVAEAFC 327
+HAIPVDTHVWK+AT+YL+PELAG LTPKL VA+AF
Sbjct: 104 NHAIPVDTHVWKVATQYLMPELAGKSLTPKLSVAVADAFV 143
>gi|443704748|gb|ELU01650.1| hypothetical protein CAPTEDRAFT_113225 [Capitella teleta]
Length = 300
Score = 195 bits (495), Expect = 3e-47, Method: Compositional matrix adjust.
Identities = 117/299 (39%), Positives = 166/299 (55%), Gaps = 30/299 (10%)
Query: 38 WTPLNLTQSELSLPLTFPTGQTFRWKKTGPLQYTGPIGPHLISLKHLQNGDVCYHIHTSP 97
W + +++EL L +T GQ+FRWK+T P ++TG + + +LK Q+ D+ ++ T
Sbjct: 3 WRKIACSKTELDLGVTLICGQSFRWKETAPGEWTGVLKNLVWNLK--QSDDLLFY-KTLE 59
Query: 98 SEPAAKSALL------DFLNMGISLGELWEGFSASDCRFAELAKYLAGARVLRQDPVECL 151
PAA S L D+ + ++L +L+E +S+ D F A G R+LRQDPVE L
Sbjct: 60 ESPAASSDLTQEDILKDYFQLHVNLSKLYEQWSSDDPNFKSKASSFRGVRILRQDPVENL 119
Query: 152 LQFLCSSNNNIARITKMVDFLAS-LGSHLGNVEGFEFHEFPSLERLSLVS-EVELRNAGF 209
F+CSSNNNI+RI+ MV+ + G+ LG EG + FPS+E L+ S E +LR GF
Sbjct: 120 FSFICSSNNNISRISGMVERMCEEFGTSLGEYEGLHHYSFPSIESLAKGSVEQKLRKLGF 179
Query: 210 GYRSAPQSSLLFSVRRSFKQAKYITGTVDVLQSKHSGGAEWLLSLRKLDLQEAIDALCTL 269
GYR AKYI + + HS WL +LRKL EA L L
Sbjct: 180 GYR-----------------AKYINQSARFIMENHS--EHWLHNLRKLPYNEAKSELMKL 220
Query: 270 PGVGPKVAACIALFSLDQHHAIPVDTHVWKIATRYLLPELAGVRLTPKLCSRVAEAFCE 328
GVG KVA C+ L SLD++ +PVDTHVW+IA +YL + +TP++ + V F E
Sbjct: 221 CGVGAKVADCVCLMSLDKNDVVPVDTHVWQIAQKYLPSLRSSKTITPQVYNAVGNHFRE 279
>gi|386780658|ref|NP_001247508.1| N-glycosylase/DNA lyase [Macaca mulatta]
gi|380790213|gb|AFE66982.1| N-glycosylase/DNA lyase isoform 1a [Macaca mulatta]
Length = 345
Score = 189 bits (481), Expect = 1e-45, Method: Compositional matrix adjust.
Identities = 102/279 (36%), Positives = 152/279 (54%), Gaps = 25/279 (8%)
Query: 32 TNKPSKWTPLNLTQSELSLPLTFPTGQTFRWKKTGPLQYTGPIGPHLISLKHLQNGDVCY 91
+ P+ W + +SEL L L P+GQ+FRW++ P ++G + + +L + C
Sbjct: 17 ASAPALWASIPCPRSELRLDLVLPSGQSFRWREQSPAHWSGVLADQVWTLTQTEEQLHCT 76
Query: 92 HIHTSPSEPAAKS-----ALLDFLNMGISLGELWEGFSASDCRFAELAKYLAGARVLRQD 146
S+P + A+ + + ++L +L+ + + D F E+A+ G R+LRQD
Sbjct: 77 VYRGDKSQPGRPTPDELEAVRKYFQLDVTLAQLYHHWGSVDSHFQEVAQKFQGVRLLRQD 136
Query: 147 PVECLLQFLCSSNNNIARITKMVDFLA-SLGSHLGNVEGFEFHEFPSLERLSLVS-EVEL 204
P+ECL F+CSSNNNIARIT MV+ L + G L ++ +H FPSL+ L+ E L
Sbjct: 137 PIECLFSFICSSNNNIARITGMVERLCQAFGPRLIQLDDVTYHGFPSLQALAGPEVEAHL 196
Query: 205 RNAGFGYRSAPQSSLLFSVRRSFKQAKYITGTVDVLQSKHSGGAEWLLSLRKLDLQEAID 264
R G GYR A+Y++ + + + GG WL LR+ +EA
Sbjct: 197 RKLGLGYR-----------------ARYVSASARAILEEQ-GGLPWLQQLREASYEEAHK 238
Query: 265 ALCTLPGVGPKVAACIALFSLDQHHAIPVDTHVWKIATR 303
ALCTLPGVG KVA CI L +LD+ A+PVD H+W+IA R
Sbjct: 239 ALCTLPGVGTKVADCICLMALDKPQAVPVDVHIWQIAQR 277
>gi|297670746|ref|XP_002813520.1| PREDICTED: N-glycosylase/DNA lyase isoform 1 [Pongo abelii]
Length = 345
Score = 189 bits (481), Expect = 1e-45, Method: Compositional matrix adjust.
Identities = 103/279 (36%), Positives = 152/279 (54%), Gaps = 25/279 (8%)
Query: 32 TNKPSKWTPLNLTQSELSLPLTFPTGQTFRWKKTGPLQYTGPIGPHLISLKHLQNGDVCY 91
+ P+ W + +SEL L L P+GQ+FRW++ P ++G + + +L + C
Sbjct: 17 ASAPALWASIPCPRSELRLDLVLPSGQSFRWREQSPAHWSGVLADQVWTLTQTEEQLHCT 76
Query: 92 -----HIHTSPSEPAAKSALLDFLNMGISLGELWEGFSASDCRFAELAKYLAGARVLRQD 146
TS P A+ +L + ++L +L+ + + D F E+A+ G R+LRQD
Sbjct: 77 VYRGGKSQTSRPTPDELEAVRKYLQLDVTLAQLYHHWGSVDSHFQEVAQKFQGVRLLRQD 136
Query: 147 PVECLLQFLCSSNNNIARITKMVDFLA-SLGSHLGNVEGFEFHEFPSLERLSLVS-EVEL 204
P+ECL F+CSSNNNIARIT MV+ L + G L ++ +H FPSL+ L+ E L
Sbjct: 137 PIECLFSFICSSNNNIARITGMVERLCQAFGPRLIQLDDVTYHGFPSLQALAGPEVETHL 196
Query: 205 RNAGFGYRSAPQSSLLFSVRRSFKQAKYITGTVDVLQSKHSGGAEWLLSLRKLDLQEAID 264
R G GYR A+Y++ + + +H GG WL L++ +EA
Sbjct: 197 RKLGLGYR-----------------ARYVSASARAILEEH-GGLAWLQQLQEASYEEAHK 238
Query: 265 ALCTLPGVGPKVAACIALFSLDQHHAIPVDTHVWKIATR 303
LCTLPGVG KVA CI L +LD+ A+PVD H+W+IA R
Sbjct: 239 GLCTLPGVGTKVADCICLMALDKPQAVPVDVHMWQIAQR 277
>gi|395733480|ref|XP_003776243.1| PREDICTED: N-glycosylase/DNA lyase isoform 4 [Pongo abelii]
Length = 356
Score = 189 bits (480), Expect = 2e-45, Method: Compositional matrix adjust.
Identities = 103/278 (37%), Positives = 152/278 (54%), Gaps = 25/278 (8%)
Query: 33 NKPSKWTPLNLTQSELSLPLTFPTGQTFRWKKTGPLQYTGPIGPHLISLKHLQNGDVCY- 91
+ P+ W + +SEL L L P+GQ+FRW++ P ++G + + +L + C
Sbjct: 18 SAPALWASIPCPRSELRLDLVLPSGQSFRWREQSPAHWSGVLADQVWTLTQTEEQLHCTV 77
Query: 92 ----HIHTSPSEPAAKSALLDFLNMGISLGELWEGFSASDCRFAELAKYLAGARVLRQDP 147
TS P A+ +L + ++L +L+ + + D F E+A+ G R+LRQDP
Sbjct: 78 YRGGKSQTSRPTPDELEAVRKYLQLDVTLAQLYHHWGSVDSHFQEVAQKFQGVRLLRQDP 137
Query: 148 VECLLQFLCSSNNNIARITKMVDFLA-SLGSHLGNVEGFEFHEFPSLERLSLVS-EVELR 205
+ECL F+CSSNNNIARIT MV+ L + G L ++ +H FPSL+ L+ E LR
Sbjct: 138 IECLFSFICSSNNNIARITGMVERLCQAFGPRLIQLDDVTYHGFPSLQALAGPEVETHLR 197
Query: 206 NAGFGYRSAPQSSLLFSVRRSFKQAKYITGTVDVLQSKHSGGAEWLLSLRKLDLQEAIDA 265
G GYR A+Y++ + + +H GG WL L++ +EA
Sbjct: 198 KLGLGYR-----------------ARYVSASARAILEEH-GGLAWLQQLQEASYEEAHKG 239
Query: 266 LCTLPGVGPKVAACIALFSLDQHHAIPVDTHVWKIATR 303
LCTLPGVG KVA CI L +LD+ A+PVD H+W+IA R
Sbjct: 240 LCTLPGVGTKVADCICLMALDKPQAVPVDVHMWQIAQR 277
>gi|395733478|ref|XP_003776242.1| PREDICTED: N-glycosylase/DNA lyase isoform 3 [Pongo abelii]
Length = 322
Score = 189 bits (479), Expect = 2e-45, Method: Compositional matrix adjust.
Identities = 103/279 (36%), Positives = 152/279 (54%), Gaps = 25/279 (8%)
Query: 32 TNKPSKWTPLNLTQSELSLPLTFPTGQTFRWKKTGPLQYTGPIGPHLISLKHLQNGDVCY 91
+ P+ W + +SEL L L P+GQ+FRW++ P ++G + + +L + C
Sbjct: 17 ASAPALWASIPCPRSELRLDLVLPSGQSFRWREQSPAHWSGVLADQVWTLTQTEEQLHCT 76
Query: 92 -----HIHTSPSEPAAKSALLDFLNMGISLGELWEGFSASDCRFAELAKYLAGARVLRQD 146
TS P A+ +L + ++L +L+ + + D F E+A+ G R+LRQD
Sbjct: 77 VYRGGKSQTSRPTPDELEAVRKYLQLDVTLAQLYHHWGSVDSHFQEVAQKFQGVRLLRQD 136
Query: 147 PVECLLQFLCSSNNNIARITKMVDFLA-SLGSHLGNVEGFEFHEFPSLERLSLVS-EVEL 204
P+ECL F+CSSNNNIARIT MV+ L + G L ++ +H FPSL+ L+ E L
Sbjct: 137 PIECLFSFICSSNNNIARITGMVERLCQAFGPRLIQLDDVTYHGFPSLQALAGPEVETHL 196
Query: 205 RNAGFGYRSAPQSSLLFSVRRSFKQAKYITGTVDVLQSKHSGGAEWLLSLRKLDLQEAID 264
R G GYR A+Y++ + + +H GG WL L++ +EA
Sbjct: 197 RKLGLGYR-----------------ARYVSASARAILEEH-GGLAWLQQLQEASYEEAHK 238
Query: 265 ALCTLPGVGPKVAACIALFSLDQHHAIPVDTHVWKIATR 303
LCTLPGVG KVA CI L +LD+ A+PVD H+W+IA R
Sbjct: 239 GLCTLPGVGTKVADCICLMALDKPQAVPVDVHMWQIAQR 277
>gi|402859468|ref|XP_003894181.1| PREDICTED: N-glycosylase/DNA lyase [Papio anubis]
Length = 345
Score = 188 bits (478), Expect = 2e-45, Method: Compositional matrix adjust.
Identities = 101/279 (36%), Positives = 152/279 (54%), Gaps = 25/279 (8%)
Query: 32 TNKPSKWTPLNLTQSELSLPLTFPTGQTFRWKKTGPLQYTGPIGPHLISLKHLQNGDVCY 91
+ P+ W + +SEL L L P+GQ+FRW++ P ++G + + +L + C
Sbjct: 17 ASAPALWASIPCPRSELRLDLVLPSGQSFRWREQSPAHWSGVLADQVWTLTQTEEQLHCT 76
Query: 92 HIHTSPSEPAAKS-----ALLDFLNMGISLGELWEGFSASDCRFAELAKYLAGARVLRQD 146
S+P + A+ + + ++L +L+ + + D F E+A+ G R+LRQD
Sbjct: 77 VYRGDKSQPGRPTPDELEAVRKYFQLDVTLAQLYHHWGSVDSHFQEVAQKFQGVRLLRQD 136
Query: 147 PVECLLQFLCSSNNNIARITKMVDFLA-SLGSHLGNVEGFEFHEFPSLERLSLVS-EVEL 204
P+ECL F+CSSNNNIARIT MV+ L + G L ++ +H FPS++ L+ E L
Sbjct: 137 PIECLFSFICSSNNNIARITGMVERLCQAFGPRLIQLDDVTYHGFPSVQALAGPEVEAHL 196
Query: 205 RNAGFGYRSAPQSSLLFSVRRSFKQAKYITGTVDVLQSKHSGGAEWLLSLRKLDLQEAID 264
R G GYR A+Y++ + + + GG WL LR+ +EA
Sbjct: 197 RKLGLGYR-----------------ARYVSASARAILEEQ-GGLPWLQQLREASYEEAHK 238
Query: 265 ALCTLPGVGPKVAACIALFSLDQHHAIPVDTHVWKIATR 303
ALCTLPGVG KVA CI L +LD+ A+PVD H+W+IA R
Sbjct: 239 ALCTLPGVGTKVADCICLMALDKPQAVPVDVHIWQIAQR 277
>gi|390475300|ref|XP_002807648.2| PREDICTED: N-glycosylase/DNA lyase [Callithrix jacchus]
Length = 345
Score = 188 bits (478), Expect = 3e-45, Method: Compositional matrix adjust.
Identities = 102/279 (36%), Positives = 150/279 (53%), Gaps = 25/279 (8%)
Query: 32 TNKPSKWTPLNLTQSELSLPLTFPTGQTFRWKKTGPLQYTGPIGPHLISLKHLQNGDVCY 91
+ P+ W + +SEL L L P+GQ+FRW++ P ++G + + +L + C
Sbjct: 17 ASAPAMWASIPCPRSELRLDLVLPSGQSFRWREQSPAHWSGVLADQVWTLTQTEEQLHCT 76
Query: 92 HIHTSPSE-----PAAKSALLDFLNMGISLGELWEGFSASDCRFAELAKYLAGARVLRQD 146
S P A+ + + ++L +L+ + + D F E+A+ G R+LRQD
Sbjct: 77 VYRGDQSRAGRPTPDELEAVCKYFQLDVTLAQLYHHWGSVDSHFQEVAQKFQGVRLLRQD 136
Query: 147 PVECLLQFLCSSNNNIARITKMVDFLA-SLGSHLGNVEGFEFHEFPSLERLSLVS-EVEL 204
P+ECL F+CSSNNNIARIT MV+ L + G L ++ +H FPSL+ L+ E L
Sbjct: 137 PIECLFSFICSSNNNIARITGMVERLCQAFGPQLIQLDDVTYHGFPSLQALAGPDVEAHL 196
Query: 205 RNAGFGYRSAPQSSLLFSVRRSFKQAKYITGTVDVLQSKHSGGAEWLLSLRKLDLQEAID 264
R G GYR A+Y++ + + + GG WL LR+ +EA
Sbjct: 197 RKLGLGYR-----------------ARYVSASARAILEEQ-GGLAWLQQLREASYEEAHK 238
Query: 265 ALCTLPGVGPKVAACIALFSLDQHHAIPVDTHVWKIATR 303
ALCTLPGVG KVA CI L +LD+ A+PVD H+W+IA R
Sbjct: 239 ALCTLPGVGTKVADCICLMALDKPQAVPVDVHMWQIAQR 277
>gi|395733476|ref|XP_003776241.1| PREDICTED: N-glycosylase/DNA lyase isoform 2 [Pongo abelii]
Length = 424
Score = 188 bits (478), Expect = 3e-45, Method: Compositional matrix adjust.
Identities = 103/279 (36%), Positives = 152/279 (54%), Gaps = 25/279 (8%)
Query: 32 TNKPSKWTPLNLTQSELSLPLTFPTGQTFRWKKTGPLQYTGPIGPHLISLKHLQNGDVCY 91
+ P+ W + +SEL L L P+GQ+FRW++ P ++G + + +L + C
Sbjct: 17 ASAPALWASIPCPRSELRLDLVLPSGQSFRWREQSPAHWSGVLADQVWTLTQTEEQLHCT 76
Query: 92 -----HIHTSPSEPAAKSALLDFLNMGISLGELWEGFSASDCRFAELAKYLAGARVLRQD 146
TS P A+ +L + ++L +L+ + + D F E+A+ G R+LRQD
Sbjct: 77 VYRGGKSQTSRPTPDELEAVRKYLQLDVTLAQLYHHWGSVDSHFQEVAQKFQGVRLLRQD 136
Query: 147 PVECLLQFLCSSNNNIARITKMVDFLA-SLGSHLGNVEGFEFHEFPSLERLSLVS-EVEL 204
P+ECL F+CSSNNNIARIT MV+ L + G L ++ +H FPSL+ L+ E L
Sbjct: 137 PIECLFSFICSSNNNIARITGMVERLCQAFGPRLIQLDDVTYHGFPSLQALAGPEVETHL 196
Query: 205 RNAGFGYRSAPQSSLLFSVRRSFKQAKYITGTVDVLQSKHSGGAEWLLSLRKLDLQEAID 264
R G GYR A+Y++ + + +H GG WL L++ +EA
Sbjct: 197 RKLGLGYR-----------------ARYVSASARAILEEH-GGLAWLQQLQEASYEEAHK 238
Query: 265 ALCTLPGVGPKVAACIALFSLDQHHAIPVDTHVWKIATR 303
LCTLPGVG KVA CI L +LD+ A+PVD H+W+IA R
Sbjct: 239 GLCTLPGVGTKVADCICLMALDKPQAVPVDVHMWQIAQR 277
>gi|332231627|ref|XP_003264995.1| PREDICTED: N-glycosylase/DNA lyase isoform 1 [Nomascus leucogenys]
Length = 345
Score = 188 bits (477), Expect = 3e-45, Method: Compositional matrix adjust.
Identities = 102/279 (36%), Positives = 150/279 (53%), Gaps = 25/279 (8%)
Query: 32 TNKPSKWTPLNLTQSELSLPLTFPTGQTFRWKKTGPLQYTGPIGPHLISLKHLQNGDVCY 91
+ P+ W + +SEL L L P+GQ+FRW++ P ++G + + +L + C
Sbjct: 17 ASAPALWASIPCPRSELRLDLVLPSGQSFRWREQSPAHWSGVLADQVWTLTQTEEQLHCT 76
Query: 92 HIHTSPSE-----PAAKSALLDFLNMGISLGELWEGFSASDCRFAELAKYLAGARVLRQD 146
S P A+ + + ++L +L+ + + D F E+A+ G R+LRQD
Sbjct: 77 VYRGDKSRASRPTPDELDAVRKYFQLDVTLAQLYHHWGSVDSHFQEVAQKFQGVRLLRQD 136
Query: 147 PVECLLQFLCSSNNNIARITKMVDFLA-SLGSHLGNVEGFEFHEFPSLERLSLVS-EVEL 204
P+ECL F+CSSNNNIARIT MV+ L + G L ++ +H FPSL+ L+ E L
Sbjct: 137 PIECLFSFICSSNNNIARITGMVERLCQAFGPRLIQLDDVTYHGFPSLQALAGPEVEARL 196
Query: 205 RNAGFGYRSAPQSSLLFSVRRSFKQAKYITGTVDVLQSKHSGGAEWLLSLRKLDLQEAID 264
R G GYR A+Y++ + + + GG WL LR+ +EA
Sbjct: 197 RKLGLGYR-----------------ARYVSASARAILEEQ-GGLAWLQQLREASYEEAHK 238
Query: 265 ALCTLPGVGPKVAACIALFSLDQHHAIPVDTHVWKIATR 303
ALCTLPGVG KVA CI L +LD+ A+PVD H+W+IA R
Sbjct: 239 ALCTLPGVGTKVADCICLMALDKPQAVPVDVHMWQIAQR 277
>gi|340378892|ref|XP_003387961.1| PREDICTED: n-glycosylase/DNA lyase-like [Amphimedon queenslandica]
Length = 308
Score = 187 bits (475), Expect = 6e-45, Method: Compositional matrix adjust.
Identities = 116/308 (37%), Positives = 169/308 (54%), Gaps = 32/308 (10%)
Query: 30 LTTNKPSK--WTPLNLTQSELSLPLTFPTGQTFRWKKT--GPLQYTGPIGPHLISLKHLQ 85
+ N+ K W+ + L + ELSL PTGQTFRW KT P +++G I + L+ +
Sbjct: 1 MAANEEDKMSWSTITLEKKELSLADVLPTGQTFRWYKTRDDPEEWSGVIKGRVFILQQTE 60
Query: 86 NGDVCYHIH----TSPSEPAAKSALLDFLNMG-ISLGELWEGFSASDCRFAELAKYLAGA 140
N ++ YH+H TS S L ++ + ISL +L+ +S D E+ G
Sbjct: 61 N-ELKYHVHSVKTTSSSNEEDLLTLKEYFQLDTISLSKLYSDWSEVDETMREVCHECPGV 119
Query: 141 RVLRQDPVECLLQFLCSSNNNIARITKMVDFLAS-LGSHLGNVEGFEFHEFPSLERLSLV 199
R+L QDP+E L+ F+CSSNNNI+RI+ M++ L+ G+ LG G EF+ FPS+E LS
Sbjct: 120 RILGQDPLETLISFICSSNNNISRISLMINRLSRHFGASLGTHRGTEFYSFPSVESLSRP 179
Query: 200 S-EVELRNAGFGYRSAPQSSLLFSVRRSFKQAKYITGTVDVLQSKHSGGAEWLLSLRKLD 258
E +LR GFGYR AKY++G V L S+ S WL SL+ L
Sbjct: 180 GLEDKLRELGFGYR-----------------AKYVSGAVSYLSSQPSN---WLESLQDLS 219
Query: 259 LQEAIDALCTLPGVGPKVAACIALFSLDQHHAIPVDTHVWKIATRYLLPELAGVRLTPKL 318
EA +AL +PGVGPKV+ C+ L L ++ A+PVD H+W+++ R + AG L+ L
Sbjct: 220 YLEAWEALQIIPGVGPKVSDCVCLMGLRKYEAVPVDVHMWRVSERLYGFKGAGKNLSIAL 279
Query: 319 CSRVAEAF 326
+ + F
Sbjct: 280 YKEIGDMF 287
>gi|363738680|ref|XP_001234323.2| PREDICTED: N-glycosylase/DNA lyase [Gallus gallus]
Length = 329
Score = 186 bits (471), Expect = 2e-44, Method: Compositional matrix adjust.
Identities = 108/297 (36%), Positives = 160/297 (53%), Gaps = 23/297 (7%)
Query: 35 PSKWTPLNLTQSELSLPLTFPTGQTFRWKKTGPLQYTGPIGPHLISLKHLQNGDVCYHIH 94
P+ W L +EL L L +GQ FRW+++ P +TG +G + +L+ + + Y ++
Sbjct: 11 PTLWRWLRCPPAELRLDLVLASGQAFRWRESSPGAWTGVLGDRVWTLRQ-ERDRLWYTVY 69
Query: 95 TSPS-EPAAKSALLDFLNMGISLGELWEGFSASDCRFAELAKYLAGARVLRQDPVECLLQ 153
+ P L D+ + + L L+ + A+D F + A G RVLRQDPVECLL
Sbjct: 70 GRETPGPETDRILRDYFQLDVGLAALYRTWGAADPLFCQTATAFPGVRVLRQDPVECLLS 129
Query: 154 FLCSSNNNIARITKMVDFLA-SLGSHLGNVEGFEFHEFPSLERLS-LVSEVELRNAGFGY 211
F+C+SNN++ARIT M++ L + G HL +++ FH FPSL L+ +E +LR GFGY
Sbjct: 130 FICTSNNHVARITTMIERLCQAFGQHLCSLDEQPFHAFPSLAALAGSEAEAKLRALGFGY 189
Query: 212 RSAPQSSLLFSVRRSFKQAKYITGTVDVLQSKHSGGAEWLLSLRKLDLQEAIDALCTLPG 271
R A++++GT + GA+ L LR + EA LC LPG
Sbjct: 190 R-----------------ARFVSGTARAI--AEGMGAKGLCQLRAVPYAEARRVLCALPG 230
Query: 272 VGPKVAACIALFSLDQHHAIPVDTHVWKIATRYLLPELAGVRLTPKLCSRVAEAFCE 328
VG KVA C+ L +LD+ A+PVDTHVW IA + L LT ++ + + F E
Sbjct: 231 VGAKVADCVCLMALDKAEAVPVDTHVWHIARQRYGAALGARSLTARVHQEIGDFFRE 287
>gi|344250084|gb|EGW06188.1| N-glycosylase/DNA lyase [Cricetulus griseus]
Length = 335
Score = 186 bits (471), Expect = 2e-44, Method: Compositional matrix adjust.
Identities = 106/278 (38%), Positives = 151/278 (54%), Gaps = 27/278 (9%)
Query: 32 TNKPSKWTPLNLTQSELSLPLTFPTGQTFRWKKTGPLQYTGPIGPHLISLKHLQNGDVCY 91
T+ P+ W + +SEL L L +GQ+FRW++ P ++G + + +L + C
Sbjct: 7 TSSPALWASIPCPRSELRLDLVLASGQSFRWREQSPAHWSGVLADQVWTLTQTEEQLYCT 66
Query: 92 HIHTSPSEPAAKS-----ALLDFLNMGISLGELWEGFSASDCRFAELAKYLAGARVLRQD 146
S+ + AL + + +SL +L+ +++ D F ++A+ G R+LRQD
Sbjct: 67 VYRGDKSQVGRPTLEELEALHKYFQLDVSLAQLYSHWASVDSHFQKVAQKFQGVRLLRQD 126
Query: 147 PVECLLQFLCSSNNNIARITKMVDFLA-SLGSHLGNVEGFEFHEFPSLERLSLVS-EVEL 204
P ECL F+CSSNNNIARIT MV+ L + G L ++ +H FPSLE L+ E L
Sbjct: 127 PTECLFSFICSSNNNIARITGMVERLCQAFGPRLIQLDDVIYHSFPSLEALAGPEVEAHL 186
Query: 205 RNAGFGYRSAPQSSLLFSVRRSFKQAKYITGTVD-VLQSKHSGGAEWLLSLRKLDLQEAI 263
R G GYR A+Y+ + +L+ K GG WL LRK +EA
Sbjct: 187 RKLGLGYR-----------------ARYVCASAKAILEEK--GGLAWLQQLRKAPYEEAH 227
Query: 264 DALCTLPGVGPKVAACIALFSLDQHHAIPVDTHVWKIA 301
ALCTLPGVG KVA CI L +LD+ A+PVD HVW+IA
Sbjct: 228 KALCTLPGVGTKVADCICLMALDKPQAVPVDVHVWQIA 265
>gi|426339383|ref|XP_004033630.1| PREDICTED: N-glycosylase/DNA lyase isoform 3 [Gorilla gorilla
gorilla]
Length = 345
Score = 185 bits (470), Expect = 2e-44, Method: Compositional matrix adjust.
Identities = 101/279 (36%), Positives = 150/279 (53%), Gaps = 25/279 (8%)
Query: 32 TNKPSKWTPLNLTQSELSLPLTFPTGQTFRWKKTGPLQYTGPIGPHLISLKHLQNGDVCY 91
+ P+ W + +SEL L L P+GQ+FRW++ P ++G + + +L + C
Sbjct: 17 ASAPALWASIPCPRSELRLDLVLPSGQSFRWREQSPSHWSGVLADQVWTLTQTEEQLHCT 76
Query: 92 HIHTSPSE-----PAAKSALLDFLNMGISLGELWEGFSASDCRFAELAKYLAGARVLRQD 146
S+ P A+ + + ++L +L++ + + D F E+A+ G R+LRQD
Sbjct: 77 VYRGGKSQASRPTPDELEAVRKYFQLDVTLAQLYDHWGSVDSHFQEVAQKFQGVRLLRQD 136
Query: 147 PVECLLQFLCSSNNNIARITKMVDFLA-SLGSHLGNVEGFEFHEFPSLERLSLVS-EVEL 204
P+ECL F+CSSNNNIARIT MV+ L + G L ++ +H FPSL+ L+ E L
Sbjct: 137 PIECLFSFICSSNNNIARITGMVERLCQAFGPRLIQLDDVTYHGFPSLQALAGPEVEAHL 196
Query: 205 RNAGFGYRSAPQSSLLFSVRRSFKQAKYITGTVDVLQSKHSGGAEWLLSLRKLDLQEAID 264
R G GYR A+Y++ + + + GG WL LR+ +EA
Sbjct: 197 RKLGLGYR-----------------ARYVSASARAILEEQ-GGLAWLQQLRESSYEEAHK 238
Query: 265 ALCTLPGVGPKVAACIALFSLDQHHAIPVDTHVWKIATR 303
ALC LPGVG KVA CI L +LD+ A+PVD H+W IA R
Sbjct: 239 ALCILPGVGTKVADCICLMALDKPQAVPVDVHMWHIAQR 277
>gi|156372480|ref|XP_001629065.1| predicted protein [Nematostella vectensis]
gi|156216057|gb|EDO37002.1| predicted protein [Nematostella vectensis]
Length = 321
Score = 185 bits (470), Expect = 2e-44, Method: Compositional matrix adjust.
Identities = 109/321 (33%), Positives = 172/321 (53%), Gaps = 53/321 (16%)
Query: 38 WTPLNLTQSELSLPLTFPTGQTFRWKKTGPLQYTGPIGPHLISLKHLQNGDVCYH-IH-- 94
W + S++ L + GQ+FRW +T ++ + + +LK Q+ + Y+ +H
Sbjct: 6 WRRIPCKASQVRLDVVLSAGQSFRWHETSEREWRSVLRGKVWTLK--QDAEFLYYTVHDP 63
Query: 95 ----TSPSEPAA----------------------KSALLDFLNMGISLGELWEGFSASDC 128
TS +P + ++ L D+ + I L +L++ +SA+D
Sbjct: 64 VPAVTSSIKPCSSEQSSLQGERTSSDDNQNWNDYEAILTDYFQLNIDLQDLYDKWSAADI 123
Query: 129 RFAELAKYLAGARVLRQDPVECLLQFLCSSNNNIARITKMVDFLAS-LGSHLGNVEGFEF 187
F++++K G R LRQ+PVE L F+CSSNNNI+RIT M++ + S G + +V+G ++
Sbjct: 124 NFSKVSKSFTGIRTLRQEPVENLFSFICSSNNNISRITSMIEKMCSKYGEKVASVDGIDY 183
Query: 188 HEFPSLERLSL-VSEVELRNAGFGYRSAPQSSLLFSVRRSFKQAKYITGTVDVLQSKHSG 246
FP++ L+ E +LR+ GFGYR AK+I + ++ G
Sbjct: 184 FSFPTVSSLADPKVEQDLRSMGFGYR-----------------AKFINKSAQLI--IEYG 224
Query: 247 GAEWLLSLRKLDLQEAIDALCTLPGVGPKVAACIALFSLDQHHAIPVDTHVWKIATRYLL 306
G +WLLSLRK Q+A ALC LPG+G KVA C+ L SLD+H AIPVDTHVW+I + +
Sbjct: 225 GQDWLLSLRKQSYQDAHSALCKLPGIGSKVADCVCLMSLDKHSAIPVDTHVWQITAKCYM 284
Query: 307 PELAGVR-LTPKLCSRVAEAF 326
P L + LT KL ++ + +
Sbjct: 285 PNLLKNKTLTDKLYLQIGDHY 305
>gi|311976782|gb|ADQ20164.1| N-glycosylase/DNA lyase [Homo sapiens]
gi|311976838|gb|ADQ20213.1| N-glycosylase/DNA lyase [Homo sapiens]
gi|311976878|gb|ADQ20248.1| N-glycosylase/DNA lyase [Homo sapiens]
gi|311976886|gb|ADQ20255.1| N-glycosylase/DNA lyase [Homo sapiens]
Length = 410
Score = 185 bits (469), Expect = 3e-44, Method: Compositional matrix adjust.
Identities = 101/278 (36%), Positives = 149/278 (53%), Gaps = 25/278 (8%)
Query: 33 NKPSKWTPLNLTQSELSLPLTFPTGQTFRWKKTGPLQYTGPIGPHLISLKHLQNGDVCYH 92
+ P+ W + +SEL L L P+GQ+FRW++ P ++G + + +L + C
Sbjct: 18 STPALWASIPCPRSELRLDLVLPSGQSFRWREQSPAHWSGVLADQVWTLTQTEEQLHCTV 77
Query: 93 IHTSPSE-----PAAKSALLDFLNMGISLGELWEGFSASDCRFAELAKYLAGARVLRQDP 147
S+ P A+ + + ++L +L+ + + D F E+A+ G R+LRQDP
Sbjct: 78 YRGDKSQASRPTPDELEAVRKYFQLDVTLAQLYHHWGSVDSHFQEVAQKFQGVRLLRQDP 137
Query: 148 VECLLQFLCSSNNNIARITKMVDFLA-SLGSHLGNVEGFEFHEFPSLERLSLVS-EVELR 205
+ECL F+CSSNNNIARIT MV+ L + G L ++ +H FPSL+ L+ E LR
Sbjct: 138 IECLFSFICSSNNNIARITGMVERLCQAFGPRLIQLDDVTYHGFPSLQALAGPEVEAHLR 197
Query: 206 NAGFGYRSAPQSSLLFSVRRSFKQAKYITGTVDVLQSKHSGGAEWLLSLRKLDLQEAIDA 265
G GYR A+Y++ + + + GG WL LR+ +EA A
Sbjct: 198 KLGLGYR-----------------ARYVSASARAILEEQ-GGLAWLQQLRESSYEEAHKA 239
Query: 266 LCTLPGVGPKVAACIALFSLDQHHAIPVDTHVWKIATR 303
LC LPGVG KVA CI L +LD+ A+PVD H+W IA R
Sbjct: 240 LCILPGVGTKVADCICLMALDKPQAVPVDVHMWHIAQR 277
>gi|8670530|ref|NP_058212.1| N-glycosylase/DNA lyase isoform 1b [Homo sapiens]
gi|5305499|gb|AAD41681.1|AF088282_2 8-hydroxyguanine-specific DNA glycosylase type 1b [Homo sapiens]
gi|119584384|gb|EAW63980.1| 8-oxoguanine DNA glycosylase, isoform CRA_c [Homo sapiens]
gi|168277444|dbj|BAG10700.1| N-glycosylase/DNA lyase [synthetic construct]
Length = 324
Score = 185 bits (469), Expect = 3e-44, Method: Compositional matrix adjust.
Identities = 101/279 (36%), Positives = 149/279 (53%), Gaps = 25/279 (8%)
Query: 32 TNKPSKWTPLNLTQSELSLPLTFPTGQTFRWKKTGPLQYTGPIGPHLISLKHLQNGDVCY 91
+ P+ W + +SEL L L P+GQ+FRW++ P ++G + + +L + C
Sbjct: 17 ASTPALWASIPCPRSELRLDLVLPSGQSFRWREQSPAHWSGVLADQVWTLTQTEEQLHCT 76
Query: 92 HIHTSPSE-----PAAKSALLDFLNMGISLGELWEGFSASDCRFAELAKYLAGARVLRQD 146
S+ P A+ + + ++L +L+ + + D F E+A+ G R+LRQD
Sbjct: 77 VYRGDKSQASRPTPDELEAVRKYFQLDVTLAQLYHHWGSVDSHFQEVAQKFQGVRLLRQD 136
Query: 147 PVECLLQFLCSSNNNIARITKMVDFLA-SLGSHLGNVEGFEFHEFPSLERLSLVS-EVEL 204
P+ECL F+CSSNNNIARIT MV+ L + G L ++ +H FPSL+ L+ E L
Sbjct: 137 PIECLFSFICSSNNNIARITGMVERLCQAFGPRLIQLDDVTYHGFPSLQALAGPEVEAHL 196
Query: 205 RNAGFGYRSAPQSSLLFSVRRSFKQAKYITGTVDVLQSKHSGGAEWLLSLRKLDLQEAID 264
R G GYR A+Y++ + + + GG WL LR+ +EA
Sbjct: 197 RKLGLGYR-----------------ARYVSASARAILEEQ-GGLAWLQQLRESSYEEAHK 238
Query: 265 ALCTLPGVGPKVAACIALFSLDQHHAIPVDTHVWKIATR 303
ALC LPGVG KVA CI L +LD+ A+PVD H+W IA R
Sbjct: 239 ALCILPGVGTKVADCICLMALDKPQAVPVDVHMWHIAQR 277
>gi|8670532|ref|NP_058213.1| N-glycosylase/DNA lyase isoform 1c [Homo sapiens]
gi|5305500|gb|AAD41682.1|AF088282_3 8-hydroxyguanine-specific DNA glycosylase type 1c [Homo sapiens]
gi|119584385|gb|EAW63981.1| 8-oxoguanine DNA glycosylase, isoform CRA_d [Homo sapiens]
gi|311976734|gb|ADQ20122.1| N-glycosylase/DNA lyase [Homo sapiens]
gi|311976742|gb|ADQ20129.1| N-glycosylase/DNA lyase [Homo sapiens]
gi|311976750|gb|ADQ20136.1| N-glycosylase/DNA lyase [Homo sapiens]
gi|311976758|gb|ADQ20143.1| N-glycosylase/DNA lyase [Homo sapiens]
gi|311976766|gb|ADQ20150.1| N-glycosylase/DNA lyase [Homo sapiens]
gi|311976774|gb|ADQ20157.1| N-glycosylase/DNA lyase [Homo sapiens]
gi|311976790|gb|ADQ20171.1| N-glycosylase/DNA lyase [Homo sapiens]
gi|311976798|gb|ADQ20178.1| N-glycosylase/DNA lyase [Homo sapiens]
gi|311976806|gb|ADQ20185.1| N-glycosylase/DNA lyase [Homo sapiens]
gi|311976814|gb|ADQ20192.1| N-glycosylase/DNA lyase [Homo sapiens]
gi|311976822|gb|ADQ20199.1| N-glycosylase/DNA lyase [Homo sapiens]
gi|311976830|gb|ADQ20206.1| N-glycosylase/DNA lyase [Homo sapiens]
gi|311976846|gb|ADQ20220.1| N-glycosylase/DNA lyase [Homo sapiens]
gi|311976854|gb|ADQ20227.1| N-glycosylase/DNA lyase [Homo sapiens]
gi|311976862|gb|ADQ20234.1| N-glycosylase/DNA lyase [Homo sapiens]
gi|311976870|gb|ADQ20241.1| N-glycosylase/DNA lyase [Homo sapiens]
gi|311976894|gb|ADQ20262.1| N-glycosylase/DNA lyase [Homo sapiens]
gi|311976902|gb|ADQ20269.1| N-glycosylase/DNA lyase [Homo sapiens]
gi|311976910|gb|ADQ20276.1| N-glycosylase/DNA lyase [Homo sapiens]
gi|311976918|gb|ADQ20283.1| N-glycosylase/DNA lyase [Homo sapiens]
gi|311976926|gb|ADQ20290.1| N-glycosylase/DNA lyase [Homo sapiens]
gi|311976934|gb|ADQ20297.1| N-glycosylase/DNA lyase [Homo sapiens]
gi|311976942|gb|ADQ20304.1| N-glycosylase/DNA lyase [Homo sapiens]
gi|311976950|gb|ADQ20311.1| N-glycosylase/DNA lyase [Homo sapiens]
gi|311976958|gb|ADQ20318.1| N-glycosylase/DNA lyase [Homo sapiens]
gi|311976966|gb|ADQ20325.1| N-glycosylase/DNA lyase [Homo sapiens]
gi|311976974|gb|ADQ20332.1| N-glycosylase/DNA lyase [Homo sapiens]
gi|311976982|gb|ADQ20339.1| N-glycosylase/DNA lyase [Homo sapiens]
gi|311976990|gb|ADQ20346.1| N-glycosylase/DNA lyase [Homo sapiens]
gi|311976998|gb|ADQ20353.1| N-glycosylase/DNA lyase [Homo sapiens]
gi|311977006|gb|ADQ20360.1| N-glycosylase/DNA lyase [Homo sapiens]
gi|311977014|gb|ADQ20367.1| N-glycosylase/DNA lyase [Homo sapiens]
gi|311977022|gb|ADQ20374.1| N-glycosylase/DNA lyase [Homo sapiens]
gi|311977030|gb|ADQ20381.1| N-glycosylase/DNA lyase [Homo sapiens]
gi|311977038|gb|ADQ20388.1| N-glycosylase/DNA lyase [Homo sapiens]
gi|311977046|gb|ADQ20395.1| N-glycosylase/DNA lyase [Homo sapiens]
Length = 410
Score = 185 bits (469), Expect = 3e-44, Method: Compositional matrix adjust.
Identities = 101/278 (36%), Positives = 149/278 (53%), Gaps = 25/278 (8%)
Query: 33 NKPSKWTPLNLTQSELSLPLTFPTGQTFRWKKTGPLQYTGPIGPHLISLKHLQNGDVCYH 92
+ P+ W + +SEL L L P+GQ+FRW++ P ++G + + +L + C
Sbjct: 18 STPALWASIPCPRSELRLDLVLPSGQSFRWREQSPAHWSGVLADQVWTLTQTEEQLHCTV 77
Query: 93 IHTSPSE-----PAAKSALLDFLNMGISLGELWEGFSASDCRFAELAKYLAGARVLRQDP 147
S+ P A+ + + ++L +L+ + + D F E+A+ G R+LRQDP
Sbjct: 78 YRGDKSQASRPTPDELEAVRKYFQLDVTLAQLYHHWGSVDSHFQEVAQKFQGVRLLRQDP 137
Query: 148 VECLLQFLCSSNNNIARITKMVDFLA-SLGSHLGNVEGFEFHEFPSLERLSLVS-EVELR 205
+ECL F+CSSNNNIARIT MV+ L + G L ++ +H FPSL+ L+ E LR
Sbjct: 138 IECLFSFICSSNNNIARITGMVERLCQAFGPRLIQLDDVTYHGFPSLQALAGPEVEAHLR 197
Query: 206 NAGFGYRSAPQSSLLFSVRRSFKQAKYITGTVDVLQSKHSGGAEWLLSLRKLDLQEAIDA 265
G GYR A+Y++ + + + GG WL LR+ +EA A
Sbjct: 198 KLGLGYR-----------------ARYVSASARAILEEQ-GGLAWLQQLRESSYEEAHKA 239
Query: 266 LCTLPGVGPKVAACIALFSLDQHHAIPVDTHVWKIATR 303
LC LPGVG KVA CI L +LD+ A+PVD H+W IA R
Sbjct: 240 LCILPGVGTKVADCICLMALDKPQAVPVDVHMWHIAQR 277
>gi|2078294|gb|AAB81132.1| 8-oxoguanine DNA-glycosylase [Homo sapiens]
Length = 351
Score = 185 bits (469), Expect = 3e-44, Method: Compositional matrix adjust.
Identities = 101/279 (36%), Positives = 149/279 (53%), Gaps = 25/279 (8%)
Query: 32 TNKPSKWTPLNLTQSELSLPLTFPTGQTFRWKKTGPLQYTGPIGPHLISLKHLQNGDVCY 91
+ P+ W + +SEL L L P+GQ+FRW++ P ++G + + +L + C
Sbjct: 17 ASTPALWASIPCPRSELRLDLVLPSGQSFRWREQSPAHWSGVLADQVWTLTQTEEQLHCT 76
Query: 92 HIHTSPSE-----PAAKSALLDFLNMGISLGELWEGFSASDCRFAELAKYLAGARVLRQD 146
S+ P A+ + + ++L +L+ + + D F E+A+ G R+LRQD
Sbjct: 77 VYRGDKSQASRPTPDELEAVRKYFQLDVTLAQLYHHWGSVDSHFQEVAQKFQGVRLLRQD 136
Query: 147 PVECLLQFLCSSNNNIARITKMVDFLA-SLGSHLGNVEGFEFHEFPSLERLSLVS-EVEL 204
P+ECL F+CSSNNNIARIT MV+ L + G L ++ +H FPSL+ L+ E L
Sbjct: 137 PIECLFSFICSSNNNIARITGMVERLCQAFGPRLIQLDDVTYHGFPSLQALAGPEVEAHL 196
Query: 205 RNAGFGYRSAPQSSLLFSVRRSFKQAKYITGTVDVLQSKHSGGAEWLLSLRKLDLQEAID 264
R G GYR A+Y++ + + + GG WL LR+ +EA
Sbjct: 197 RKLGLGYR-----------------ARYVSASARAILEEQ-GGLAWLQQLRESSYEEAHK 238
Query: 265 ALCTLPGVGPKVAACIALFSLDQHHAIPVDTHVWKIATR 303
ALC LPGVG KVA CI L +LD+ A+PVD H+W IA R
Sbjct: 239 ALCILPGVGTKVADCICLMALDKPQAVPVDVHMWHIAQR 277
>gi|4505495|ref|NP_002533.1| N-glycosylase/DNA lyase isoform 1a [Homo sapiens]
gi|12643548|sp|O15527.2|OGG1_HUMAN RecName: Full=N-glycosylase/DNA lyase; Includes: RecName:
Full=8-oxoguanine DNA glycosylase; Includes: RecName:
Full=DNA-(apurinic or apyrimidinic site) lyase; Short=AP
lyase
gi|18158657|pdb|1KO9|A Chain A, Native Structure Of The Human 8-Oxoguanine Dna Glycosylase
Hogg1
gi|5305498|gb|AAD41680.1|AF088282_1 8-hydroxyguanine-specific DNA glycosylase type 1a [Homo sapiens]
gi|21668118|gb|AAM74236.1|AF521807_1 8-oxoguanine DNA glycosylase [Homo sapiens]
gi|1890182|emb|CAA72414.1| DNA glycosylase/AP lyase [Homo sapiens]
gi|1903206|emb|CAA72536.1| 8-oxoguanine DNA glycosylase homolog 1 [Homo sapiens]
gi|1906757|dbj|BAA19103.1| hOGG1 [Homo sapiens]
gi|2197083|gb|AAB61340.1| 8-oxoguanine DNA glycosylase 1 [Homo sapiens]
gi|2351704|gb|AAB68614.1| 8-hydroxyguanine glycosylase/lyase [Homo sapiens]
gi|119584387|gb|EAW63983.1| 8-oxoguanine DNA glycosylase, isoform CRA_f [Homo sapiens]
gi|311976735|gb|ADQ20123.1| N-glycosylase/DNA lyase [Homo sapiens]
gi|311976743|gb|ADQ20130.1| N-glycosylase/DNA lyase [Homo sapiens]
gi|311976751|gb|ADQ20137.1| N-glycosylase/DNA lyase [Homo sapiens]
gi|311976759|gb|ADQ20144.1| N-glycosylase/DNA lyase [Homo sapiens]
gi|311976767|gb|ADQ20151.1| N-glycosylase/DNA lyase [Homo sapiens]
gi|311976775|gb|ADQ20158.1| N-glycosylase/DNA lyase [Homo sapiens]
gi|311976791|gb|ADQ20172.1| N-glycosylase/DNA lyase [Homo sapiens]
gi|311976799|gb|ADQ20179.1| N-glycosylase/DNA lyase [Homo sapiens]
gi|311976807|gb|ADQ20186.1| N-glycosylase/DNA lyase [Homo sapiens]
gi|311976815|gb|ADQ20193.1| N-glycosylase/DNA lyase [Homo sapiens]
gi|311976823|gb|ADQ20200.1| N-glycosylase/DNA lyase [Homo sapiens]
gi|311976831|gb|ADQ20207.1| N-glycosylase/DNA lyase [Homo sapiens]
gi|311976847|gb|ADQ20221.1| N-glycosylase/DNA lyase [Homo sapiens]
gi|311976855|gb|ADQ20228.1| N-glycosylase/DNA lyase [Homo sapiens]
gi|311976863|gb|ADQ20235.1| N-glycosylase/DNA lyase [Homo sapiens]
gi|311976871|gb|ADQ20242.1| N-glycosylase/DNA lyase [Homo sapiens]
gi|311976895|gb|ADQ20263.1| N-glycosylase/DNA lyase [Homo sapiens]
gi|311976903|gb|ADQ20270.1| N-glycosylase/DNA lyase [Homo sapiens]
gi|311976911|gb|ADQ20277.1| N-glycosylase/DNA lyase [Homo sapiens]
gi|311976919|gb|ADQ20284.1| N-glycosylase/DNA lyase [Homo sapiens]
gi|311976927|gb|ADQ20291.1| N-glycosylase/DNA lyase [Homo sapiens]
gi|311976935|gb|ADQ20298.1| N-glycosylase/DNA lyase [Homo sapiens]
gi|311976943|gb|ADQ20305.1| N-glycosylase/DNA lyase [Homo sapiens]
gi|311976951|gb|ADQ20312.1| N-glycosylase/DNA lyase [Homo sapiens]
gi|311976959|gb|ADQ20319.1| N-glycosylase/DNA lyase [Homo sapiens]
gi|311976967|gb|ADQ20326.1| N-glycosylase/DNA lyase [Homo sapiens]
gi|311976975|gb|ADQ20333.1| N-glycosylase/DNA lyase [Homo sapiens]
gi|311976983|gb|ADQ20340.1| N-glycosylase/DNA lyase [Homo sapiens]
gi|311976991|gb|ADQ20347.1| N-glycosylase/DNA lyase [Homo sapiens]
gi|311976999|gb|ADQ20354.1| N-glycosylase/DNA lyase [Homo sapiens]
gi|311977007|gb|ADQ20361.1| N-glycosylase/DNA lyase [Homo sapiens]
gi|311977015|gb|ADQ20368.1| N-glycosylase/DNA lyase [Homo sapiens]
gi|311977023|gb|ADQ20375.1| N-glycosylase/DNA lyase [Homo sapiens]
gi|311977031|gb|ADQ20382.1| N-glycosylase/DNA lyase [Homo sapiens]
gi|311977039|gb|ADQ20389.1| N-glycosylase/DNA lyase [Homo sapiens]
gi|311977047|gb|ADQ20396.1| N-glycosylase/DNA lyase [Homo sapiens]
Length = 345
Score = 185 bits (469), Expect = 3e-44, Method: Compositional matrix adjust.
Identities = 101/279 (36%), Positives = 149/279 (53%), Gaps = 25/279 (8%)
Query: 32 TNKPSKWTPLNLTQSELSLPLTFPTGQTFRWKKTGPLQYTGPIGPHLISLKHLQNGDVCY 91
+ P+ W + +SEL L L P+GQ+FRW++ P ++G + + +L + C
Sbjct: 17 ASTPALWASIPCPRSELRLDLVLPSGQSFRWREQSPAHWSGVLADQVWTLTQTEEQLHCT 76
Query: 92 HIHTSPSE-----PAAKSALLDFLNMGISLGELWEGFSASDCRFAELAKYLAGARVLRQD 146
S+ P A+ + + ++L +L+ + + D F E+A+ G R+LRQD
Sbjct: 77 VYRGDKSQASRPTPDELEAVRKYFQLDVTLAQLYHHWGSVDSHFQEVAQKFQGVRLLRQD 136
Query: 147 PVECLLQFLCSSNNNIARITKMVDFLA-SLGSHLGNVEGFEFHEFPSLERLSLVS-EVEL 204
P+ECL F+CSSNNNIARIT MV+ L + G L ++ +H FPSL+ L+ E L
Sbjct: 137 PIECLFSFICSSNNNIARITGMVERLCQAFGPRLIQLDDVTYHGFPSLQALAGPEVEAHL 196
Query: 205 RNAGFGYRSAPQSSLLFSVRRSFKQAKYITGTVDVLQSKHSGGAEWLLSLRKLDLQEAID 264
R G GYR A+Y++ + + + GG WL LR+ +EA
Sbjct: 197 RKLGLGYR-----------------ARYVSASARAILEEQ-GGLAWLQQLRESSYEEAHK 238
Query: 265 ALCTLPGVGPKVAACIALFSLDQHHAIPVDTHVWKIATR 303
ALC LPGVG KVA CI L +LD+ A+PVD H+W IA R
Sbjct: 239 ALCILPGVGTKVADCICLMALDKPQAVPVDVHMWHIAQR 277
>gi|332816035|ref|XP_003309654.1| PREDICTED: N-glycosylase/DNA lyase isoform 4 [Pan troglodytes]
gi|397486361|ref|XP_003814298.1| PREDICTED: N-glycosylase/DNA lyase isoform 4 [Pan paniscus]
gi|410215468|gb|JAA04953.1| 8-oxoguanine DNA glycosylase [Pan troglodytes]
gi|410215470|gb|JAA04954.1| 8-oxoguanine DNA glycosylase [Pan troglodytes]
gi|410266318|gb|JAA21125.1| 8-oxoguanine DNA glycosylase [Pan troglodytes]
gi|410291776|gb|JAA24488.1| 8-oxoguanine DNA glycosylase [Pan troglodytes]
gi|410291778|gb|JAA24489.1| 8-oxoguanine DNA glycosylase [Pan troglodytes]
gi|410349817|gb|JAA41512.1| 8-oxoguanine DNA glycosylase [Pan troglodytes]
Length = 345
Score = 184 bits (468), Expect = 3e-44, Method: Compositional matrix adjust.
Identities = 101/279 (36%), Positives = 149/279 (53%), Gaps = 25/279 (8%)
Query: 32 TNKPSKWTPLNLTQSELSLPLTFPTGQTFRWKKTGPLQYTGPIGPHLISLKHLQNGDVCY 91
+ P+ W + +SEL L L P+GQ+FRW++ P ++G + + +L + C
Sbjct: 17 ASTPALWASIPCPRSELRLDLVLPSGQSFRWREQSPAHWSGVLADQVWTLTQTEEQLHCT 76
Query: 92 HIHTSPSE-----PAAKSALLDFLNMGISLGELWEGFSASDCRFAELAKYLAGARVLRQD 146
S+ P A+ + + ++L +L+ + + D F E+A+ G R+LRQD
Sbjct: 77 VYRGDKSQASRPTPDELEAVRKYFQLDVTLAQLYHHWGSVDSHFQEVAQKFQGVRLLRQD 136
Query: 147 PVECLLQFLCSSNNNIARITKMVDFLA-SLGSHLGNVEGFEFHEFPSLERLSLVS-EVEL 204
P+ECL F+CSSNNNIARIT MV+ L + G L ++ +H FPSL+ L+ E L
Sbjct: 137 PIECLFSFICSSNNNIARITGMVERLCQAFGPRLIQLDDVTYHGFPSLQALAGPEVEAHL 196
Query: 205 RNAGFGYRSAPQSSLLFSVRRSFKQAKYITGTVDVLQSKHSGGAEWLLSLRKLDLQEAID 264
R G GYR A+Y++ + + + GG WL LR+ +EA
Sbjct: 197 RKLGLGYR-----------------ARYVSASARAILEEQ-GGLAWLQQLRESSYEEAHK 238
Query: 265 ALCTLPGVGPKVAACIALFSLDQHHAIPVDTHVWKIATR 303
ALC LPGVG KVA CI L +LD+ A+PVD H+W IA R
Sbjct: 239 ALCILPGVGTKVADCICLMALDKPQAVPVDVHMWHIAQR 277
>gi|332816039|ref|XP_003309656.1| PREDICTED: N-glycosylase/DNA lyase isoform 6 [Pan troglodytes]
gi|410215474|gb|JAA04956.1| 8-oxoguanine DNA glycosylase [Pan troglodytes]
gi|410291782|gb|JAA24491.1| 8-oxoguanine DNA glycosylase [Pan troglodytes]
gi|410349821|gb|JAA41514.1| 8-oxoguanine DNA glycosylase [Pan troglodytes]
Length = 410
Score = 184 bits (468), Expect = 3e-44, Method: Compositional matrix adjust.
Identities = 101/278 (36%), Positives = 149/278 (53%), Gaps = 25/278 (8%)
Query: 33 NKPSKWTPLNLTQSELSLPLTFPTGQTFRWKKTGPLQYTGPIGPHLISLKHLQNGDVCYH 92
+ P+ W + +SEL L L P+GQ+FRW++ P ++G + + +L + C
Sbjct: 18 STPALWASIPCPRSELRLDLVLPSGQSFRWREQSPAHWSGVLADQVWTLTQTEEQLHCTV 77
Query: 93 IHTSPSE-----PAAKSALLDFLNMGISLGELWEGFSASDCRFAELAKYLAGARVLRQDP 147
S+ P A+ + + ++L +L+ + + D F E+A+ G R+LRQDP
Sbjct: 78 YRGDKSQASRPTPDELEAVRKYFQLDVTLAQLYHHWGSVDSHFQEVAQKFQGVRLLRQDP 137
Query: 148 VECLLQFLCSSNNNIARITKMVDFLA-SLGSHLGNVEGFEFHEFPSLERLSLVS-EVELR 205
+ECL F+CSSNNNIARIT MV+ L + G L ++ +H FPSL+ L+ E LR
Sbjct: 138 IECLFSFICSSNNNIARITGMVERLCQAFGPRLIQLDDVTYHGFPSLQALAGPEVEAHLR 197
Query: 206 NAGFGYRSAPQSSLLFSVRRSFKQAKYITGTVDVLQSKHSGGAEWLLSLRKLDLQEAIDA 265
G GYR A+Y++ + + + GG WL LR+ +EA A
Sbjct: 198 KLGLGYR-----------------ARYVSASARAILEEQ-GGLAWLQQLRESSYEEAHKA 239
Query: 266 LCTLPGVGPKVAACIALFSLDQHHAIPVDTHVWKIATR 303
LC LPGVG KVA CI L +LD+ A+PVD H+W IA R
Sbjct: 240 LCILPGVGTKVADCICLMALDKPQAVPVDVHMWHIAQR 277
>gi|332816031|ref|XP_003309652.1| PREDICTED: N-glycosylase/DNA lyase isoform 2 [Pan troglodytes]
gi|397486357|ref|XP_003814296.1| PREDICTED: N-glycosylase/DNA lyase isoform 2 [Pan paniscus]
Length = 356
Score = 184 bits (468), Expect = 3e-44, Method: Compositional matrix adjust.
Identities = 101/278 (36%), Positives = 149/278 (53%), Gaps = 25/278 (8%)
Query: 33 NKPSKWTPLNLTQSELSLPLTFPTGQTFRWKKTGPLQYTGPIGPHLISLKHLQNGDVCYH 92
+ P+ W + +SEL L L P+GQ+FRW++ P ++G + + +L + C
Sbjct: 18 STPALWASIPCPRSELRLDLVLPSGQSFRWREQSPAHWSGVLADQVWTLTQTEEQLHCTV 77
Query: 93 IHTSPSE-----PAAKSALLDFLNMGISLGELWEGFSASDCRFAELAKYLAGARVLRQDP 147
S+ P A+ + + ++L +L+ + + D F E+A+ G R+LRQDP
Sbjct: 78 YRGDKSQASRPTPDELEAVRKYFQLDVTLAQLYHHWGSVDSHFQEVAQKFQGVRLLRQDP 137
Query: 148 VECLLQFLCSSNNNIARITKMVDFLA-SLGSHLGNVEGFEFHEFPSLERLSLVS-EVELR 205
+ECL F+CSSNNNIARIT MV+ L + G L ++ +H FPSL+ L+ E LR
Sbjct: 138 IECLFSFICSSNNNIARITGMVERLCQAFGPRLIQLDDVTYHGFPSLQALAGPEVEAHLR 197
Query: 206 NAGFGYRSAPQSSLLFSVRRSFKQAKYITGTVDVLQSKHSGGAEWLLSLRKLDLQEAIDA 265
G GYR A+Y++ + + + GG WL LR+ +EA A
Sbjct: 198 KLGLGYR-----------------ARYVSASARAILEEQ-GGLAWLQQLRESSYEEAHKA 239
Query: 266 LCTLPGVGPKVAACIALFSLDQHHAIPVDTHVWKIATR 303
LC LPGVG KVA CI L +LD+ A+PVD H+W IA R
Sbjct: 240 LCILPGVGTKVADCICLMALDKPQAVPVDVHMWHIAQR 277
>gi|12653743|gb|AAH00657.1| 8-oxoguanine DNA glycosylase [Homo sapiens]
gi|123999040|gb|ABM87106.1| 8-oxoguanine DNA glycosylase [synthetic construct]
gi|157929116|gb|ABW03843.1| 8-oxoguanine DNA glycosylase [synthetic construct]
gi|158260739|dbj|BAF82547.1| unnamed protein product [Homo sapiens]
gi|311976783|gb|ADQ20165.1| N-glycosylase/DNA lyase [Homo sapiens]
gi|311976839|gb|ADQ20214.1| N-glycosylase/DNA lyase [Homo sapiens]
gi|311976879|gb|ADQ20249.1| N-glycosylase/DNA lyase [Homo sapiens]
gi|311976887|gb|ADQ20256.1| N-glycosylase/DNA lyase [Homo sapiens]
Length = 345
Score = 184 bits (468), Expect = 3e-44, Method: Compositional matrix adjust.
Identities = 101/279 (36%), Positives = 149/279 (53%), Gaps = 25/279 (8%)
Query: 32 TNKPSKWTPLNLTQSELSLPLTFPTGQTFRWKKTGPLQYTGPIGPHLISLKHLQNGDVCY 91
+ P+ W + +SEL L L P+GQ+FRW++ P ++G + + +L + C
Sbjct: 17 ASTPALWASIPCPRSELRLDLVLPSGQSFRWREQSPAHWSGVLADQVWTLTQTEEQLHCT 76
Query: 92 HIHTSPSE-----PAAKSALLDFLNMGISLGELWEGFSASDCRFAELAKYLAGARVLRQD 146
S+ P A+ + + ++L +L+ + + D F E+A+ G R+LRQD
Sbjct: 77 VYRGDKSQASRPTPDELEAVRKYFQLDVTLAQLYHHWGSVDSHFQEVAQKFQGVRLLRQD 136
Query: 147 PVECLLQFLCSSNNNIARITKMVDFLA-SLGSHLGNVEGFEFHEFPSLERLSLVS-EVEL 204
P+ECL F+CSSNNNIARIT MV+ L + G L ++ +H FPSL+ L+ E L
Sbjct: 137 PIECLFSFICSSNNNIARITGMVERLCQAFGPRLIQLDDVTYHGFPSLQALAGPEVEAHL 196
Query: 205 RNAGFGYRSAPQSSLLFSVRRSFKQAKYITGTVDVLQSKHSGGAEWLLSLRKLDLQEAID 264
R G GYR A+Y++ + + + GG WL LR+ +EA
Sbjct: 197 RKLGLGYR-----------------ARYVSASARAILEEQ-GGLAWLQQLRESSYEEAHK 238
Query: 265 ALCTLPGVGPKVAACIALFSLDQHHAIPVDTHVWKIATR 303
ALC LPGVG KVA CI L +LD+ A+PVD H+W IA R
Sbjct: 239 ALCILPGVGTKVADCICLMALDKPQAVPVDVHMWHIAQR 277
>gi|403270336|ref|XP_003927142.1| PREDICTED: N-glycosylase/DNA lyase isoform 1 [Saimiri boliviensis
boliviensis]
Length = 345
Score = 184 bits (468), Expect = 4e-44, Method: Compositional matrix adjust.
Identities = 101/279 (36%), Positives = 150/279 (53%), Gaps = 25/279 (8%)
Query: 32 TNKPSKWTPLNLTQSELSLPLTFPTGQTFRWKKTGPLQYTGPIGPHLISLKHLQNGDVCY 91
+ P+ W + +SEL L L P+GQ+FRW++ P ++G + + +L + C
Sbjct: 17 ASAPALWASIPCPRSELRLDLVLPSGQSFRWREQSPAHWSGVLADQVWTLTQTEEQLHCT 76
Query: 92 HIHTSPSE-----PAAKSALLDFLNMGISLGELWEGFSASDCRFAELAKYLAGARVLRQD 146
S P A+ + + ++L +L+ + + D F E+A+ G R+LRQD
Sbjct: 77 VYRGDKSRAGRPTPDELEAVCKYFQLDVTLTQLYHHWGSVDSHFQEVAQKFQGVRLLRQD 136
Query: 147 PVECLLQFLCSSNNNIARITKMVDFLA-SLGSHLGNVEGFEFHEFPSLERLSLVS-EVEL 204
P+ECL F+CSSNNNIARIT MV+ L + G L ++ ++ FPSL+ L+ E L
Sbjct: 137 PIECLFSFICSSNNNIARITGMVERLCQAFGPRLIQLDDVTYYGFPSLQALAGPDVEAHL 196
Query: 205 RNAGFGYRSAPQSSLLFSVRRSFKQAKYITGTVDVLQSKHSGGAEWLLSLRKLDLQEAID 264
R G GYR A+Y++ + + + GG WL LR+ +EA
Sbjct: 197 RKLGLGYR-----------------ARYVSASARAILEEQ-GGLAWLQQLREASYEEAHK 238
Query: 265 ALCTLPGVGPKVAACIALFSLDQHHAIPVDTHVWKIATR 303
ALCTLPGVG KVA CI L +LD+ A+PVD H+W+IA R
Sbjct: 239 ALCTLPGVGTKVADCICLMALDKPQAVPVDVHMWQIAQR 277
>gi|8670542|ref|NP_058438.1| N-glycosylase/DNA lyase isoform 2e [Homo sapiens]
gi|332816033|ref|XP_003309653.1| PREDICTED: N-glycosylase/DNA lyase isoform 3 [Pan troglodytes]
gi|397486359|ref|XP_003814297.1| PREDICTED: N-glycosylase/DNA lyase isoform 3 [Pan paniscus]
gi|119584389|gb|EAW63985.1| 8-oxoguanine DNA glycosylase, isoform CRA_h [Homo sapiens]
gi|311976733|gb|ADQ20121.1| N-glycosylase/DNA lyase [Homo sapiens]
gi|311976741|gb|ADQ20128.1| N-glycosylase/DNA lyase [Homo sapiens]
gi|311976749|gb|ADQ20135.1| N-glycosylase/DNA lyase [Homo sapiens]
gi|311976757|gb|ADQ20142.1| N-glycosylase/DNA lyase [Homo sapiens]
gi|311976765|gb|ADQ20149.1| N-glycosylase/DNA lyase [Homo sapiens]
gi|311976773|gb|ADQ20156.1| N-glycosylase/DNA lyase [Homo sapiens]
gi|311976781|gb|ADQ20163.1| N-glycosylase/DNA lyase [Homo sapiens]
gi|311976789|gb|ADQ20170.1| N-glycosylase/DNA lyase [Homo sapiens]
gi|311976797|gb|ADQ20177.1| N-glycosylase/DNA lyase [Homo sapiens]
gi|311976805|gb|ADQ20184.1| N-glycosylase/DNA lyase [Homo sapiens]
gi|311976813|gb|ADQ20191.1| N-glycosylase/DNA lyase [Homo sapiens]
gi|311976821|gb|ADQ20198.1| N-glycosylase/DNA lyase [Homo sapiens]
gi|311976829|gb|ADQ20205.1| N-glycosylase/DNA lyase [Homo sapiens]
gi|311976837|gb|ADQ20212.1| N-glycosylase/DNA lyase [Homo sapiens]
gi|311976845|gb|ADQ20219.1| N-glycosylase/DNA lyase [Homo sapiens]
gi|311976853|gb|ADQ20226.1| N-glycosylase/DNA lyase [Homo sapiens]
gi|311976861|gb|ADQ20233.1| N-glycosylase/DNA lyase [Homo sapiens]
gi|311976869|gb|ADQ20240.1| N-glycosylase/DNA lyase [Homo sapiens]
gi|311976877|gb|ADQ20247.1| N-glycosylase/DNA lyase [Homo sapiens]
gi|311976885|gb|ADQ20254.1| N-glycosylase/DNA lyase [Homo sapiens]
gi|311976893|gb|ADQ20261.1| N-glycosylase/DNA lyase [Homo sapiens]
gi|311976901|gb|ADQ20268.1| N-glycosylase/DNA lyase [Homo sapiens]
gi|311976909|gb|ADQ20275.1| N-glycosylase/DNA lyase [Homo sapiens]
gi|311976917|gb|ADQ20282.1| N-glycosylase/DNA lyase [Homo sapiens]
gi|311976925|gb|ADQ20289.1| N-glycosylase/DNA lyase [Homo sapiens]
gi|311976933|gb|ADQ20296.1| N-glycosylase/DNA lyase [Homo sapiens]
gi|311976941|gb|ADQ20303.1| N-glycosylase/DNA lyase [Homo sapiens]
gi|311976949|gb|ADQ20310.1| N-glycosylase/DNA lyase [Homo sapiens]
gi|311976957|gb|ADQ20317.1| N-glycosylase/DNA lyase [Homo sapiens]
gi|311976965|gb|ADQ20324.1| N-glycosylase/DNA lyase [Homo sapiens]
gi|311976973|gb|ADQ20331.1| N-glycosylase/DNA lyase [Homo sapiens]
gi|311976981|gb|ADQ20338.1| N-glycosylase/DNA lyase [Homo sapiens]
gi|311976989|gb|ADQ20345.1| N-glycosylase/DNA lyase [Homo sapiens]
gi|311976997|gb|ADQ20352.1| N-glycosylase/DNA lyase [Homo sapiens]
gi|311977005|gb|ADQ20359.1| N-glycosylase/DNA lyase [Homo sapiens]
gi|311977013|gb|ADQ20366.1| N-glycosylase/DNA lyase [Homo sapiens]
gi|311977021|gb|ADQ20373.1| N-glycosylase/DNA lyase [Homo sapiens]
gi|311977029|gb|ADQ20380.1| N-glycosylase/DNA lyase [Homo sapiens]
gi|311977037|gb|ADQ20387.1| N-glycosylase/DNA lyase [Homo sapiens]
gi|311977045|gb|ADQ20394.1| N-glycosylase/DNA lyase [Homo sapiens]
Length = 322
Score = 184 bits (468), Expect = 4e-44, Method: Compositional matrix adjust.
Identities = 101/279 (36%), Positives = 149/279 (53%), Gaps = 25/279 (8%)
Query: 32 TNKPSKWTPLNLTQSELSLPLTFPTGQTFRWKKTGPLQYTGPIGPHLISLKHLQNGDVCY 91
+ P+ W + +SEL L L P+GQ+FRW++ P ++G + + +L + C
Sbjct: 17 ASTPALWASIPCPRSELRLDLVLPSGQSFRWREQSPAHWSGVLADQVWTLTQTEEQLHCT 76
Query: 92 HIHTSPSE-----PAAKSALLDFLNMGISLGELWEGFSASDCRFAELAKYLAGARVLRQD 146
S+ P A+ + + ++L +L+ + + D F E+A+ G R+LRQD
Sbjct: 77 VYRGDKSQASRPTPDELEAVRKYFQLDVTLAQLYHHWGSVDSHFQEVAQKFQGVRLLRQD 136
Query: 147 PVECLLQFLCSSNNNIARITKMVDFLA-SLGSHLGNVEGFEFHEFPSLERLSLVS-EVEL 204
P+ECL F+CSSNNNIARIT MV+ L + G L ++ +H FPSL+ L+ E L
Sbjct: 137 PIECLFSFICSSNNNIARITGMVERLCQAFGPRLIQLDDVTYHGFPSLQALAGPEVEAHL 196
Query: 205 RNAGFGYRSAPQSSLLFSVRRSFKQAKYITGTVDVLQSKHSGGAEWLLSLRKLDLQEAID 264
R G GYR A+Y++ + + + GG WL LR+ +EA
Sbjct: 197 RKLGLGYR-----------------ARYVSASARAILEEQ-GGLAWLQQLRESSYEEAHK 238
Query: 265 ALCTLPGVGPKVAACIALFSLDQHHAIPVDTHVWKIATR 303
ALC LPGVG KVA CI L +LD+ A+PVD H+W IA R
Sbjct: 239 ALCILPGVGTKVADCICLMALDKPQAVPVDVHMWHIAQR 277
>gi|354469095|ref|XP_003496966.1| PREDICTED: N-glycosylase/DNA lyase-like [Cricetulus griseus]
Length = 449
Score = 184 bits (468), Expect = 4e-44, Method: Compositional matrix adjust.
Identities = 106/278 (38%), Positives = 151/278 (54%), Gaps = 27/278 (9%)
Query: 32 TNKPSKWTPLNLTQSELSLPLTFPTGQTFRWKKTGPLQYTGPIGPHLISLKHLQNGDVCY 91
T+ P+ W + +SEL L L +GQ+FRW++ P ++G + + +L + C
Sbjct: 121 TSSPALWASIPCPRSELRLDLVLASGQSFRWREQSPAHWSGVLADQVWTLTQTEEQLYCT 180
Query: 92 HIHTSPSEPAAKS-----ALLDFLNMGISLGELWEGFSASDCRFAELAKYLAGARVLRQD 146
S+ + AL + + +SL +L+ +++ D F ++A+ G R+LRQD
Sbjct: 181 VYRGDKSQVGRPTLEELEALHKYFQLDVSLAQLYSHWASVDSHFQKVAQKFQGVRLLRQD 240
Query: 147 PVECLLQFLCSSNNNIARITKMVDFLA-SLGSHLGNVEGFEFHEFPSLERLSLVS-EVEL 204
P ECL F+CSSNNNIARIT MV+ L + G L ++ +H FPSLE L+ E L
Sbjct: 241 PTECLFSFICSSNNNIARITGMVERLCQAFGPRLIQLDDVIYHSFPSLEALAGPEVEAHL 300
Query: 205 RNAGFGYRSAPQSSLLFSVRRSFKQAKYITGTVD-VLQSKHSGGAEWLLSLRKLDLQEAI 263
R G GYR A+Y+ + +L+ K GG WL LRK +EA
Sbjct: 301 RKLGLGYR-----------------ARYVCASAKAILEEK--GGLAWLQQLRKAPYEEAH 341
Query: 264 DALCTLPGVGPKVAACIALFSLDQHHAIPVDTHVWKIA 301
ALCTLPGVG KVA CI L +LD+ A+PVD HVW+IA
Sbjct: 342 KALCTLPGVGTKVADCICLMALDKPQAVPVDVHVWQIA 379
>gi|8670540|ref|NP_058437.1| N-glycosylase/DNA lyase isoform 2d [Homo sapiens]
gi|119584391|gb|EAW63987.1| 8-oxoguanine DNA glycosylase, isoform CRA_j [Homo sapiens]
gi|311976731|gb|ADQ20119.1| N-glycosylase/DNA lyase [Homo sapiens]
gi|311976739|gb|ADQ20126.1| N-glycosylase/DNA lyase [Homo sapiens]
gi|311976747|gb|ADQ20133.1| N-glycosylase/DNA lyase [Homo sapiens]
gi|311976755|gb|ADQ20140.1| N-glycosylase/DNA lyase [Homo sapiens]
gi|311976763|gb|ADQ20147.1| N-glycosylase/DNA lyase [Homo sapiens]
gi|311976771|gb|ADQ20154.1| N-glycosylase/DNA lyase [Homo sapiens]
gi|311976779|gb|ADQ20161.1| N-glycosylase/DNA lyase [Homo sapiens]
gi|311976787|gb|ADQ20168.1| N-glycosylase/DNA lyase [Homo sapiens]
gi|311976795|gb|ADQ20175.1| N-glycosylase/DNA lyase [Homo sapiens]
gi|311976803|gb|ADQ20182.1| N-glycosylase/DNA lyase [Homo sapiens]
gi|311976811|gb|ADQ20189.1| N-glycosylase/DNA lyase [Homo sapiens]
gi|311976819|gb|ADQ20196.1| N-glycosylase/DNA lyase [Homo sapiens]
gi|311976827|gb|ADQ20203.1| N-glycosylase/DNA lyase [Homo sapiens]
gi|311976835|gb|ADQ20210.1| N-glycosylase/DNA lyase [Homo sapiens]
gi|311976843|gb|ADQ20217.1| N-glycosylase/DNA lyase [Homo sapiens]
gi|311976851|gb|ADQ20224.1| N-glycosylase/DNA lyase [Homo sapiens]
gi|311976859|gb|ADQ20231.1| N-glycosylase/DNA lyase [Homo sapiens]
gi|311976867|gb|ADQ20238.1| N-glycosylase/DNA lyase [Homo sapiens]
gi|311976875|gb|ADQ20245.1| N-glycosylase/DNA lyase [Homo sapiens]
gi|311976883|gb|ADQ20252.1| N-glycosylase/DNA lyase [Homo sapiens]
gi|311976891|gb|ADQ20259.1| N-glycosylase/DNA lyase [Homo sapiens]
gi|311976899|gb|ADQ20266.1| N-glycosylase/DNA lyase [Homo sapiens]
gi|311976907|gb|ADQ20273.1| N-glycosylase/DNA lyase [Homo sapiens]
gi|311976915|gb|ADQ20280.1| N-glycosylase/DNA lyase [Homo sapiens]
gi|311976923|gb|ADQ20287.1| N-glycosylase/DNA lyase [Homo sapiens]
gi|311976931|gb|ADQ20294.1| N-glycosylase/DNA lyase [Homo sapiens]
gi|311976939|gb|ADQ20301.1| N-glycosylase/DNA lyase [Homo sapiens]
gi|311976947|gb|ADQ20308.1| N-glycosylase/DNA lyase [Homo sapiens]
gi|311976955|gb|ADQ20315.1| N-glycosylase/DNA lyase [Homo sapiens]
gi|311976963|gb|ADQ20322.1| N-glycosylase/DNA lyase [Homo sapiens]
gi|311976971|gb|ADQ20329.1| N-glycosylase/DNA lyase [Homo sapiens]
gi|311976979|gb|ADQ20336.1| N-glycosylase/DNA lyase [Homo sapiens]
gi|311976987|gb|ADQ20343.1| N-glycosylase/DNA lyase [Homo sapiens]
gi|311976995|gb|ADQ20350.1| N-glycosylase/DNA lyase [Homo sapiens]
gi|311977003|gb|ADQ20357.1| N-glycosylase/DNA lyase [Homo sapiens]
gi|311977011|gb|ADQ20364.1| N-glycosylase/DNA lyase [Homo sapiens]
gi|311977019|gb|ADQ20371.1| N-glycosylase/DNA lyase [Homo sapiens]
gi|311977027|gb|ADQ20378.1| N-glycosylase/DNA lyase [Homo sapiens]
gi|311977035|gb|ADQ20385.1| N-glycosylase/DNA lyase [Homo sapiens]
gi|311977043|gb|ADQ20392.1| N-glycosylase/DNA lyase [Homo sapiens]
Length = 356
Score = 184 bits (467), Expect = 5e-44, Method: Compositional matrix adjust.
Identities = 101/278 (36%), Positives = 149/278 (53%), Gaps = 25/278 (8%)
Query: 33 NKPSKWTPLNLTQSELSLPLTFPTGQTFRWKKTGPLQYTGPIGPHLISLKHLQNGDVCYH 92
+ P+ W + +SEL L L P+GQ+FRW++ P ++G + + +L + C
Sbjct: 18 STPALWASIPCPRSELRLDLVLPSGQSFRWREQSPAHWSGVLADQVWTLTQTEEQLHCTV 77
Query: 93 IHTSPSE-----PAAKSALLDFLNMGISLGELWEGFSASDCRFAELAKYLAGARVLRQDP 147
S+ P A+ + + ++L +L+ + + D F E+A+ G R+LRQDP
Sbjct: 78 YRGDKSQASRPTPDELEAVRKYFQLDVTLAQLYHHWGSVDSHFQEVAQKFQGVRLLRQDP 137
Query: 148 VECLLQFLCSSNNNIARITKMVDFLA-SLGSHLGNVEGFEFHEFPSLERLSLVS-EVELR 205
+ECL F+CSSNNNIARIT MV+ L + G L ++ +H FPSL+ L+ E LR
Sbjct: 138 IECLFSFICSSNNNIARITGMVERLCQAFGPRLIQLDDVTYHGFPSLQALAGPEVEAHLR 197
Query: 206 NAGFGYRSAPQSSLLFSVRRSFKQAKYITGTVDVLQSKHSGGAEWLLSLRKLDLQEAIDA 265
G GYR A+Y++ + + + GG WL LR+ +EA A
Sbjct: 198 KLGLGYR-----------------ARYVSASARAILEEQ-GGLAWLQQLRESSYEEAHKA 239
Query: 266 LCTLPGVGPKVAACIALFSLDQHHAIPVDTHVWKIATR 303
LC LPGVG KVA CI L +LD+ A+PVD H+W IA R
Sbjct: 240 LCILPGVGTKVADCICLMALDKPQAVPVDVHMWHIAQR 277
>gi|28948369|pdb|1HU0|A Chain A, Crystal Structure Of An Hogg1-Dna Borohydride Trapped
Intermediate Complex
gi|28948498|pdb|1LWV|A Chain A, Borohydride-Trapped Hogg1 Intermediate Structure Co-
Crystallized With 8-Aminoguanine
gi|28948501|pdb|1LWW|A Chain A, Borohydride-Trapped Hogg1 Intermediate Structure Co-
Crystallized With 8-Bromoguanine
gi|28948504|pdb|1LWY|A Chain A, Hogg1 Borohydride-Trapped Intermediate Without
8-Oxoguanine
Length = 324
Score = 184 bits (467), Expect = 5e-44, Method: Compositional matrix adjust.
Identities = 101/279 (36%), Positives = 149/279 (53%), Gaps = 25/279 (8%)
Query: 32 TNKPSKWTPLNLTQSELSLPLTFPTGQTFRWKKTGPLQYTGPIGPHLISLKHLQNGDVCY 91
+ P+ W + +SEL L L P+GQ+FRW++ P ++G + + +L + C
Sbjct: 14 ASTPALWASIPCPRSELRLDLVLPSGQSFRWREQSPAHWSGVLADQVWTLTQTEEQLHCT 73
Query: 92 HIHTSPSE-----PAAKSALLDFLNMGISLGELWEGFSASDCRFAELAKYLAGARVLRQD 146
S+ P A+ + + ++L +L+ + + D F E+A+ G R+LRQD
Sbjct: 74 VYRGDKSQASRPTPDELEAVRKYFQLDVTLAQLYHHWGSVDSHFQEVAQKFQGVRLLRQD 133
Query: 147 PVECLLQFLCSSNNNIARITKMVDFLA-SLGSHLGNVEGFEFHEFPSLERLSLVS-EVEL 204
P+ECL F+CSSNNNIARIT MV+ L + G L ++ +H FPSL+ L+ E L
Sbjct: 134 PIECLFSFICSSNNNIARITGMVERLCQAFGPRLIQLDDVTYHGFPSLQALAGPEVEAHL 193
Query: 205 RNAGFGYRSAPQSSLLFSVRRSFKQAKYITGTVDVLQSKHSGGAEWLLSLRKLDLQEAID 264
R G GYR A+Y++ + + + GG WL LR+ +EA
Sbjct: 194 RKLGLGYR-----------------ARYVSASARAILEEQ-GGLAWLQQLRESSYEEAHK 235
Query: 265 ALCTLPGVGPKVAACIALFSLDQHHAIPVDTHVWKIATR 303
ALC LPGVG KVA CI L +LD+ A+PVD H+W IA R
Sbjct: 236 ALCILPGVGTKVADCICLMALDKPQAVPVDVHMWHIAQR 274
>gi|119390537|pdb|2NOZ|A Chain A, Structure Of Q315f Human 8-Oxoguanine Glycosylase Distal
Crosslink To 8-Oxoguanine Dna
Length = 325
Score = 184 bits (467), Expect = 5e-44, Method: Compositional matrix adjust.
Identities = 101/279 (36%), Positives = 149/279 (53%), Gaps = 25/279 (8%)
Query: 32 TNKPSKWTPLNLTQSELSLPLTFPTGQTFRWKKTGPLQYTGPIGPHLISLKHLQNGDVCY 91
+ P+ W + +SEL L L P+GQ+FRW++ P ++G + + +L + C
Sbjct: 15 ASTPALWASIPCPRSELRLDLVLPSGQSFRWREQSPAHWSGVLADQVWTLTQTEEQLHCT 74
Query: 92 HIHTSPSE-----PAAKSALLDFLNMGISLGELWEGFSASDCRFAELAKYLAGARVLRQD 146
S+ P A+ + + ++L +L+ + + D F E+A+ G R+LRQD
Sbjct: 75 VYRGDKSQASRPTPDELEAVRKYFQLDVTLAQLYHHWGSVDSHFQEVAQKFQGVRLLRQD 134
Query: 147 PVECLLQFLCSSNNNIARITKMVDFLA-SLGSHLGNVEGFEFHEFPSLERLSLVS-EVEL 204
P+ECL F+CSSNNNIARIT MV+ L + G L ++ +H FPSL+ L+ E L
Sbjct: 135 PIECLFSFICSSNNNIARITGMVERLCQAFGPRLIQLDDVTYHGFPSLQALAGPEVEAHL 194
Query: 205 RNAGFGYRSAPQSSLLFSVRRSFKQAKYITGTVDVLQSKHSGGAEWLLSLRKLDLQEAID 264
R G GYR A+Y++ + + + GG WL LR+ +EA
Sbjct: 195 RKLGLGYR-----------------ARYVSASARAILEEQ-GGLAWLQQLRESSYEEAHK 236
Query: 265 ALCTLPGVGPKVAACIALFSLDQHHAIPVDTHVWKIATR 303
ALC LPGVG KVA CI L +LD+ A+PVD H+W IA R
Sbjct: 237 ALCILPGVGTKVADCICLMALDKPQAVPVDVHMWHIAQR 275
>gi|14278394|pdb|1FN7|A Chain A, Coupling Of Damage Recognition And Catalysis By A Human
Base-Excision Dna Repair Protein
Length = 317
Score = 184 bits (466), Expect = 6e-44, Method: Compositional matrix adjust.
Identities = 101/279 (36%), Positives = 149/279 (53%), Gaps = 25/279 (8%)
Query: 32 TNKPSKWTPLNLTQSELSLPLTFPTGQTFRWKKTGPLQYTGPIGPHLISLKHLQNGDVCY 91
+ P+ W + +SEL L L P+GQ+FRW++ P ++G + + +L + C
Sbjct: 9 ASTPALWASIPCPRSELRLDLVLPSGQSFRWREQSPAHWSGVLADQVWTLTQTEEQLHCT 68
Query: 92 HIHTSPSE-----PAAKSALLDFLNMGISLGELWEGFSASDCRFAELAKYLAGARVLRQD 146
S+ P A+ + + ++L +L+ + + D F E+A+ G R+LRQD
Sbjct: 69 VYRGDKSQASRPTPDELEAVRKYFQLDVTLAQLYHHWGSVDSHFQEVAQKFQGVRLLRQD 128
Query: 147 PVECLLQFLCSSNNNIARITKMVDFLA-SLGSHLGNVEGFEFHEFPSLERLSLVS-EVEL 204
P+ECL F+CSSNNNIARIT MV+ L + G L ++ +H FPSL+ L+ E L
Sbjct: 129 PIECLFSFICSSNNNIARITGMVERLCQAFGPRLIQLDDVTYHGFPSLQALAGPEVEAHL 188
Query: 205 RNAGFGYRSAPQSSLLFSVRRSFKQAKYITGTVDVLQSKHSGGAEWLLSLRKLDLQEAID 264
R G GYR A+Y++ + + + GG WL LR+ +EA
Sbjct: 189 RKLGLGYR-----------------ARYVSASARAILEEQ-GGLAWLQQLRESSYEEAHK 230
Query: 265 ALCTLPGVGPKVAACIALFSLDQHHAIPVDTHVWKIATR 303
ALC LPGVG KVA CI L +LD+ A+PVD H+W IA R
Sbjct: 231 ALCILPGVGTKVADCICLMALDKPQAVPVDVHMWHIAQR 269
>gi|73985020|ref|XP_541781.2| PREDICTED: N-glycosylase/DNA lyase [Canis lupus familiaris]
Length = 334
Score = 184 bits (466), Expect = 6e-44, Method: Compositional matrix adjust.
Identities = 102/276 (36%), Positives = 149/276 (53%), Gaps = 25/276 (9%)
Query: 35 PSKWTPLNLTQSELSLPLTFPTGQTFRWKKTGPLQYTGPIGPHLISLKHLQNGDVCYHIH 94
P+ W + +SEL L L +GQ+FRW++ P +TG + + +L + C
Sbjct: 10 PALWASIPCPRSELRLDLVLASGQSFRWREQDPAHWTGVLANRVWTLTQTEEKLYCTVYR 69
Query: 95 TSPS-----EPAAKSALLDFLNMGISLGELWEGFSASDCRFAELAKYLAGARVLRQDPVE 149
P + + + +SL +L+ +S++D F E+++ G R+L+QDP+E
Sbjct: 70 DDNGWVGRPTPEELKTVHQYFQLDVSLAQLYHHWSSADPHFQEVSQKFQGVRLLQQDPIE 129
Query: 150 CLLQFLCSSNNNIARITKMVDFLA-SLGSHLGNVEGFEFHEFPSLERLSLVS-EVELRNA 207
CL F+CSSNNNIARIT MV+ L + G L ++ +H FPSL+ L+ E +LR
Sbjct: 130 CLFSFICSSNNNIARITGMVERLCQAFGPRLIQLDDVTYHGFPSLQALAGPEVEAQLRKL 189
Query: 208 GFGYRSAPQSSLLFSVRRSFKQAKYITGTVDVLQSKHSGGAEWLLSLRKLDLQEAIDALC 267
G GYR A+Y++ + + + GG WL LRK +EA ALC
Sbjct: 190 GLGYR-----------------ARYVSASARAILEEQ-GGLPWLQQLRKAPYEEAHKALC 231
Query: 268 TLPGVGPKVAACIALFSLDQHHAIPVDTHVWKIATR 303
TLPGVG KVA CI L +LD+ A+PVD HVW+IA R
Sbjct: 232 TLPGVGTKVADCICLMALDKPQAVPVDVHVWQIAQR 267
>gi|403270338|ref|XP_003927143.1| PREDICTED: N-glycosylase/DNA lyase isoform 2 [Saimiri boliviensis
boliviensis]
Length = 322
Score = 184 bits (466), Expect = 6e-44, Method: Compositional matrix adjust.
Identities = 101/279 (36%), Positives = 150/279 (53%), Gaps = 25/279 (8%)
Query: 32 TNKPSKWTPLNLTQSELSLPLTFPTGQTFRWKKTGPLQYTGPIGPHLISLKHLQNGDVCY 91
+ P+ W + +SEL L L P+GQ+FRW++ P ++G + + +L + C
Sbjct: 17 ASAPALWASIPCPRSELRLDLVLPSGQSFRWREQSPAHWSGVLADQVWTLTQTEEQLHCT 76
Query: 92 HIHTSPSE-----PAAKSALLDFLNMGISLGELWEGFSASDCRFAELAKYLAGARVLRQD 146
S P A+ + + ++L +L+ + + D F E+A+ G R+LRQD
Sbjct: 77 VYRGDKSRAGRPTPDELEAVCKYFQLDVTLTQLYHHWGSVDSHFQEVAQKFQGVRLLRQD 136
Query: 147 PVECLLQFLCSSNNNIARITKMVDFLA-SLGSHLGNVEGFEFHEFPSLERLSLVS-EVEL 204
P+ECL F+CSSNNNIARIT MV+ L + G L ++ ++ FPSL+ L+ E L
Sbjct: 137 PIECLFSFICSSNNNIARITGMVERLCQAFGPRLIQLDDVTYYGFPSLQALAGPDVEAHL 196
Query: 205 RNAGFGYRSAPQSSLLFSVRRSFKQAKYITGTVDVLQSKHSGGAEWLLSLRKLDLQEAID 264
R G GYR A+Y++ + + + GG WL LR+ +EA
Sbjct: 197 RKLGLGYR-----------------ARYVSASARAILEEQ-GGLAWLQQLREASYEEAHK 238
Query: 265 ALCTLPGVGPKVAACIALFSLDQHHAIPVDTHVWKIATR 303
ALCTLPGVG KVA CI L +LD+ A+PVD H+W+IA R
Sbjct: 239 ALCTLPGVGTKVADCICLMALDKPQAVPVDVHMWQIAQR 277
>gi|426339379|ref|XP_004033628.1| PREDICTED: N-glycosylase/DNA lyase isoform 1 [Gorilla gorilla
gorilla]
Length = 424
Score = 184 bits (466), Expect = 6e-44, Method: Compositional matrix adjust.
Identities = 101/279 (36%), Positives = 150/279 (53%), Gaps = 25/279 (8%)
Query: 32 TNKPSKWTPLNLTQSELSLPLTFPTGQTFRWKKTGPLQYTGPIGPHLISLKHLQNGDVCY 91
+ P+ W + +SEL L L P+GQ+FRW++ P ++G + + +L + C
Sbjct: 17 ASAPALWASIPCPRSELRLDLVLPSGQSFRWREQSPSHWSGVLADQVWTLTQTEEQLHCT 76
Query: 92 HIHTSPSE-----PAAKSALLDFLNMGISLGELWEGFSASDCRFAELAKYLAGARVLRQD 146
S+ P A+ + + ++L +L++ + + D F E+A+ G R+LRQD
Sbjct: 77 VYRGGKSQASRPTPDELEAVRKYFQLDVTLAQLYDHWGSVDSHFQEVAQKFQGVRLLRQD 136
Query: 147 PVECLLQFLCSSNNNIARITKMVDFLA-SLGSHLGNVEGFEFHEFPSLERLSLVS-EVEL 204
P+ECL F+CSSNNNIARIT MV+ L + G L ++ +H FPSL+ L+ E L
Sbjct: 137 PIECLFSFICSSNNNIARITGMVERLCQAFGPRLIQLDDVTYHGFPSLQALAGPEVEAHL 196
Query: 205 RNAGFGYRSAPQSSLLFSVRRSFKQAKYITGTVDVLQSKHSGGAEWLLSLRKLDLQEAID 264
R G GYR A+Y++ + + + GG WL LR+ +EA
Sbjct: 197 RKLGLGYR-----------------ARYVSASARAILEEQ-GGLAWLQQLRESSYEEAHK 238
Query: 265 ALCTLPGVGPKVAACIALFSLDQHHAIPVDTHVWKIATR 303
ALC LPGVG KVA CI L +LD+ A+PVD H+W IA R
Sbjct: 239 ALCILPGVGTKVADCICLMALDKPQAVPVDVHMWHIAQR 277
>gi|8670534|ref|NP_058214.1| N-glycosylase/DNA lyase isoform 2a [Homo sapiens]
gi|2351706|gb|AAB68615.1| 8-hydroxyguanine glycosylase [Homo sapiens]
gi|2558577|emb|CAA73726.1| 8-hydroxyguanine-DNA glycosylase [Homo sapiens]
gi|119584382|gb|EAW63978.1| 8-oxoguanine DNA glycosylase, isoform CRA_a [Homo sapiens]
gi|311976729|gb|ADQ20117.1| N-glycosylase/DNA lyase [Homo sapiens]
gi|311976737|gb|ADQ20124.1| N-glycosylase/DNA lyase [Homo sapiens]
gi|311976745|gb|ADQ20131.1| N-glycosylase/DNA lyase [Homo sapiens]
gi|311976753|gb|ADQ20138.1| N-glycosylase/DNA lyase [Homo sapiens]
gi|311976761|gb|ADQ20145.1| N-glycosylase/DNA lyase [Homo sapiens]
gi|311976769|gb|ADQ20152.1| N-glycosylase/DNA lyase [Homo sapiens]
gi|311976777|gb|ADQ20159.1| N-glycosylase/DNA lyase [Homo sapiens]
gi|311976785|gb|ADQ20166.1| N-glycosylase/DNA lyase [Homo sapiens]
gi|311976793|gb|ADQ20173.1| N-glycosylase/DNA lyase [Homo sapiens]
gi|311976801|gb|ADQ20180.1| N-glycosylase/DNA lyase [Homo sapiens]
gi|311976809|gb|ADQ20187.1| N-glycosylase/DNA lyase [Homo sapiens]
gi|311976817|gb|ADQ20194.1| N-glycosylase/DNA lyase [Homo sapiens]
gi|311976825|gb|ADQ20201.1| N-glycosylase/DNA lyase [Homo sapiens]
gi|311976833|gb|ADQ20208.1| N-glycosylase/DNA lyase [Homo sapiens]
gi|311976841|gb|ADQ20215.1| N-glycosylase/DNA lyase [Homo sapiens]
gi|311976849|gb|ADQ20222.1| N-glycosylase/DNA lyase [Homo sapiens]
gi|311976857|gb|ADQ20229.1| N-glycosylase/DNA lyase [Homo sapiens]
gi|311976865|gb|ADQ20236.1| N-glycosylase/DNA lyase [Homo sapiens]
gi|311976873|gb|ADQ20243.1| N-glycosylase/DNA lyase [Homo sapiens]
gi|311976881|gb|ADQ20250.1| N-glycosylase/DNA lyase [Homo sapiens]
gi|311976889|gb|ADQ20257.1| N-glycosylase/DNA lyase [Homo sapiens]
gi|311976897|gb|ADQ20264.1| N-glycosylase/DNA lyase [Homo sapiens]
gi|311976905|gb|ADQ20271.1| N-glycosylase/DNA lyase [Homo sapiens]
gi|311976913|gb|ADQ20278.1| N-glycosylase/DNA lyase [Homo sapiens]
gi|311976921|gb|ADQ20285.1| N-glycosylase/DNA lyase [Homo sapiens]
gi|311976929|gb|ADQ20292.1| N-glycosylase/DNA lyase [Homo sapiens]
gi|311976937|gb|ADQ20299.1| N-glycosylase/DNA lyase [Homo sapiens]
gi|311976945|gb|ADQ20306.1| N-glycosylase/DNA lyase [Homo sapiens]
gi|311976953|gb|ADQ20313.1| N-glycosylase/DNA lyase [Homo sapiens]
gi|311976961|gb|ADQ20320.1| N-glycosylase/DNA lyase [Homo sapiens]
gi|311976969|gb|ADQ20327.1| N-glycosylase/DNA lyase [Homo sapiens]
gi|311976977|gb|ADQ20334.1| N-glycosylase/DNA lyase [Homo sapiens]
gi|311976985|gb|ADQ20341.1| N-glycosylase/DNA lyase [Homo sapiens]
gi|311976993|gb|ADQ20348.1| N-glycosylase/DNA lyase [Homo sapiens]
gi|311977001|gb|ADQ20355.1| N-glycosylase/DNA lyase [Homo sapiens]
gi|311977009|gb|ADQ20362.1| N-glycosylase/DNA lyase [Homo sapiens]
gi|311977017|gb|ADQ20369.1| N-glycosylase/DNA lyase [Homo sapiens]
gi|311977025|gb|ADQ20376.1| N-glycosylase/DNA lyase [Homo sapiens]
gi|311977033|gb|ADQ20383.1| N-glycosylase/DNA lyase [Homo sapiens]
gi|311977041|gb|ADQ20390.1| N-glycosylase/DNA lyase [Homo sapiens]
Length = 424
Score = 184 bits (466), Expect = 7e-44, Method: Compositional matrix adjust.
Identities = 101/279 (36%), Positives = 149/279 (53%), Gaps = 25/279 (8%)
Query: 32 TNKPSKWTPLNLTQSELSLPLTFPTGQTFRWKKTGPLQYTGPIGPHLISLKHLQNGDVCY 91
+ P+ W + +SEL L L P+GQ+FRW++ P ++G + + +L + C
Sbjct: 17 ASTPALWASIPCPRSELRLDLVLPSGQSFRWREQSPAHWSGVLADQVWTLTQTEEQLHCT 76
Query: 92 HIHTSPSE-----PAAKSALLDFLNMGISLGELWEGFSASDCRFAELAKYLAGARVLRQD 146
S+ P A+ + + ++L +L+ + + D F E+A+ G R+LRQD
Sbjct: 77 VYRGDKSQASRPTPDELEAVRKYFQLDVTLAQLYHHWGSVDSHFQEVAQKFQGVRLLRQD 136
Query: 147 PVECLLQFLCSSNNNIARITKMVDFLA-SLGSHLGNVEGFEFHEFPSLERLSLVS-EVEL 204
P+ECL F+CSSNNNIARIT MV+ L + G L ++ +H FPSL+ L+ E L
Sbjct: 137 PIECLFSFICSSNNNIARITGMVERLCQAFGPRLIQLDDVTYHGFPSLQALAGPEVEAHL 196
Query: 205 RNAGFGYRSAPQSSLLFSVRRSFKQAKYITGTVDVLQSKHSGGAEWLLSLRKLDLQEAID 264
R G GYR A+Y++ + + + GG WL LR+ +EA
Sbjct: 197 RKLGLGYR-----------------ARYVSASARAILEEQ-GGLAWLQQLRESSYEEAHK 238
Query: 265 ALCTLPGVGPKVAACIALFSLDQHHAIPVDTHVWKIATR 303
ALC LPGVG KVA CI L +LD+ A+PVD H+W IA R
Sbjct: 239 ALCILPGVGTKVADCICLMALDKPQAVPVDVHMWHIAQR 277
>gi|332816029|ref|XP_003309651.1| PREDICTED: N-glycosylase/DNA lyase isoform 1 [Pan troglodytes]
gi|397486355|ref|XP_003814295.1| PREDICTED: N-glycosylase/DNA lyase isoform 1 [Pan paniscus]
Length = 424
Score = 184 bits (466), Expect = 7e-44, Method: Compositional matrix adjust.
Identities = 101/279 (36%), Positives = 149/279 (53%), Gaps = 25/279 (8%)
Query: 32 TNKPSKWTPLNLTQSELSLPLTFPTGQTFRWKKTGPLQYTGPIGPHLISLKHLQNGDVCY 91
+ P+ W + +SEL L L P+GQ+FRW++ P ++G + + +L + C
Sbjct: 17 ASTPALWASIPCPRSELRLDLVLPSGQSFRWREQSPAHWSGVLADQVWTLTQTEEQLHCT 76
Query: 92 HIHTSPSE-----PAAKSALLDFLNMGISLGELWEGFSASDCRFAELAKYLAGARVLRQD 146
S+ P A+ + + ++L +L+ + + D F E+A+ G R+LRQD
Sbjct: 77 VYRGDKSQASRPTPDELEAVRKYFQLDVTLAQLYHHWGSVDSHFQEVAQKFQGVRLLRQD 136
Query: 147 PVECLLQFLCSSNNNIARITKMVDFLA-SLGSHLGNVEGFEFHEFPSLERLSLVS-EVEL 204
P+ECL F+CSSNNNIARIT MV+ L + G L ++ +H FPSL+ L+ E L
Sbjct: 137 PIECLFSFICSSNNNIARITGMVERLCQAFGPRLIQLDDVTYHGFPSLQALAGPEVEAHL 196
Query: 205 RNAGFGYRSAPQSSLLFSVRRSFKQAKYITGTVDVLQSKHSGGAEWLLSLRKLDLQEAID 264
R G GYR A+Y++ + + + GG WL LR+ +EA
Sbjct: 197 RKLGLGYR-----------------ARYVSASARAILEEQ-GGLAWLQQLRESSYEEAHK 238
Query: 265 ALCTLPGVGPKVAACIALFSLDQHHAIPVDTHVWKIATR 303
ALC LPGVG KVA CI L +LD+ A+PVD H+W IA R
Sbjct: 239 ALCILPGVGTKVADCICLMALDKPQAVPVDVHMWHIAQR 277
>gi|310942670|pdb|3IH7|A Chain A, Crystal Structure Of Catalytically Active Human
8-Oxoguanine Glycosylase Distally Crosslinked To
Guanine-Containing Dna
Length = 316
Score = 183 bits (464), Expect = 1e-43, Method: Compositional matrix adjust.
Identities = 102/282 (36%), Positives = 151/282 (53%), Gaps = 31/282 (10%)
Query: 32 TNKPSKWTPLNLTQSELSLPLTFPTGQTFRWKKTGPLQYTGPIGPHLISLKHLQN----- 86
+ P+ W + +SEL L L P+GQ+FRW++ P ++G + + +L +
Sbjct: 8 ASTPALWASIPCPRSELRLDLVLPSGQSFRWREQSPAHWSGVLADQVWTLTQTEEQLHCT 67
Query: 87 ---GDVCYHIHTSPSEPAAKSALLDFLNMGISLGELWEGFSASDCRFAELAKYLAGARVL 143
GD+ +P E A+ + + ++L +L+ + + D F E+A+ G R+L
Sbjct: 68 VYRGDLSQASRPTPDE---LEAVRKYFQLDVTLAQLYHHWGSVDSHFQEVAQKFQGVRLL 124
Query: 144 RQDPVECLLQFLCSSNNNIARITKMVDFLA-SLGSHLGNVEGFEFHEFPSLERLSLVS-E 201
RQDP+ECL F+CSSNNNIARIT MV+ L + G L ++ +H FPSL+ L+ E
Sbjct: 125 RQDPIECLFSFICSSNNNIARITGMVERLCQAFGPRLIQLDDVTYHGFPSLQALAGPEVE 184
Query: 202 VELRNAGFGYRSAPQSSLLFSVRRSFKQAKYITGTVDVLQSKHSGGAEWLLSLRKLDLQE 261
LR G GYR A+Y++ + + + GG WL LR+ +E
Sbjct: 185 AHLRKLGLGYR-----------------ARYVSASARAILEEQ-GGLAWLQQLRESSYEE 226
Query: 262 AIDALCTLPGVGPKVAACIALFSLDQHHAIPVDTHVWKIATR 303
A ALC LPGVG KVA CI L +LD+ A+PVD H+W IA R
Sbjct: 227 AHKALCILPGVGTKVADCICLMALDKPQAVPVDVHMWHIAQR 268
>gi|345323696|ref|XP_003430738.1| PREDICTED: LOW QUALITY PROTEIN: N-glycosylase/DNA lyase-like
[Ornithorhynchus anatinus]
Length = 387
Score = 183 bits (464), Expect = 1e-43, Method: Compositional matrix adjust.
Identities = 104/282 (36%), Positives = 160/282 (56%), Gaps = 30/282 (10%)
Query: 32 TNKPSKWTPLNLTQSELSLPLTFPTGQTFRWKKTGPLQYTGPIGPHLISLKHLQNGDVCY 91
+ P+ W + +SELSL L +GQ+FRW++ P +TG + + +L + + Y
Sbjct: 38 ASAPALWASIPCPRSELSLDLVLASGQSFRWREQSPGYWTGVLAGRVWTLTQAEE-RLYY 96
Query: 92 HIHTSPS-------EPAAKSALL-DFLNMGISLGELWEGFSASDCRFAELAKYLAGARVL 143
++ + + E A + +L F + + L EL+ + + D F ++A+ G R+L
Sbjct: 97 TVYKAEAGSPDLRKETAEEQEILHQFFQLDVQLAELYRHWGSVDPHFRQVAQKFLGVRLL 156
Query: 144 RQDPVECLLQFLCSSNNNIARITKMVDFLA-SLGSHLGNVEGFEFHEFPSLERLSLVS-E 201
RQDPVECL+ F+CSSNN+I+RIT+MV+ L + G L +++ +H FPSL+ L+ E
Sbjct: 157 RQDPVECLISFICSSNNHISRITRMVEQLCQAFGPRLCHLDSVPYHSFPSLQTLAGSDVE 216
Query: 202 VELRNAGFGYRSAPQSSLLFSVRRSFKQAKYITGTVDVLQSKHSGGAEWLLSLRKLDLQE 261
+LR GFGYR A++++ + + S H G +WL LR E
Sbjct: 217 QQLRALGFGYR-----------------ARFVSESAQTILSAH--GPDWLQRLRTASYTE 257
Query: 262 AIDALCTLPGVGPKVAACIALFSLDQHHAIPVDTHVWKIATR 303
A ALC+LPGVG KVA C+ L +LD+ A+PVDTHVW+IA R
Sbjct: 258 ARQALCSLPGVGAKVADCVCLMALDKPGAVPVDTHVWQIAHR 299
>gi|7546476|pdb|1EBM|A Chain A, Crystal Structure Of The Human 8-Oxoguanine Glycosylase
(Hogg1) Bound To A Substrate Oligonucleotide
Length = 317
Score = 182 bits (463), Expect = 1e-43, Method: Compositional matrix adjust.
Identities = 100/279 (35%), Positives = 149/279 (53%), Gaps = 25/279 (8%)
Query: 32 TNKPSKWTPLNLTQSELSLPLTFPTGQTFRWKKTGPLQYTGPIGPHLISLKHLQNGDVCY 91
+ P+ W + +SEL L L P+GQ+FRW++ P ++G + + +L + C
Sbjct: 9 ASTPALWASIPCPRSELRLDLVLPSGQSFRWREQSPAHWSGVLADQVWTLTQTEEQLHCT 68
Query: 92 HIHTSPSE-----PAAKSALLDFLNMGISLGELWEGFSASDCRFAELAKYLAGARVLRQD 146
S+ P A+ + + ++L +L+ + + D F E+A+ G R+LRQD
Sbjct: 69 VYRGDKSQASRPTPDELEAVRKYFQLDVTLAQLYHHWGSVDSHFQEVAQKFQGVRLLRQD 128
Query: 147 PVECLLQFLCSSNNNIARITKMVDFLA-SLGSHLGNVEGFEFHEFPSLERLSLVS-EVEL 204
P+ECL F+CSSNNNIARIT MV+ L + G L ++ +H FPSL+ L+ E L
Sbjct: 129 PIECLFSFICSSNNNIARITGMVERLCQAFGPRLIQLDDVTYHGFPSLQALAGPEVEAHL 188
Query: 205 RNAGFGYRSAPQSSLLFSVRRSFKQAKYITGTVDVLQSKHSGGAEWLLSLRKLDLQEAID 264
R G GYR A+Y++ + + + GG WL LR+ +EA
Sbjct: 189 RKLGLGYR-----------------ARYVSASARAILEEQ-GGLAWLQQLRESSYEEAHK 230
Query: 265 ALCTLPGVGPKVAACIALFSLDQHHAIPVDTHVWKIATR 303
ALC LPGVG +VA CI L +LD+ A+PVD H+W IA R
Sbjct: 231 ALCILPGVGTQVADCICLMALDKPQAVPVDVHMWHIAQR 269
>gi|119390519|pdb|2NOL|A Chain A, Structure Of Catalytically Inactive Human 8-Oxoguanine
Glycosylase Distal Crosslink To Oxog Dna
Length = 325
Score = 182 bits (463), Expect = 1e-43, Method: Compositional matrix adjust.
Identities = 100/279 (35%), Positives = 149/279 (53%), Gaps = 25/279 (8%)
Query: 32 TNKPSKWTPLNLTQSELSLPLTFPTGQTFRWKKTGPLQYTGPIGPHLISLKHLQNGDVCY 91
+ P+ W + +SEL L L P+GQ+FRW++ P ++G + + +L + C
Sbjct: 15 ASTPALWASIPCPRSELRLDLVLPSGQSFRWREQSPAHWSGVLADQVWTLTQTEEQLHCT 74
Query: 92 HIHTSPSE-----PAAKSALLDFLNMGISLGELWEGFSASDCRFAELAKYLAGARVLRQD 146
S+ P A+ + + ++L +L+ + + D F E+A+ G R+LRQD
Sbjct: 75 VYRGDKSQASRPTPDELEAVRKYFQLDVTLAQLYHHWGSVDSHFQEVAQKFQGVRLLRQD 134
Query: 147 PVECLLQFLCSSNNNIARITKMVDFLA-SLGSHLGNVEGFEFHEFPSLERLSLVS-EVEL 204
P+ECL F+CSSNNNIARIT MV+ L + G L ++ +H FPSL+ L+ E L
Sbjct: 135 PIECLFSFICSSNNNIARITGMVERLCQAFGPRLIQLDDVTYHGFPSLQALAGPEVEAHL 194
Query: 205 RNAGFGYRSAPQSSLLFSVRRSFKQAKYITGTVDVLQSKHSGGAEWLLSLRKLDLQEAID 264
R G GYR A+Y++ + + + GG WL LR+ +EA
Sbjct: 195 RKLGLGYR-----------------ARYVSASARAILEEQ-GGLAWLQQLRESSYEEAHK 236
Query: 265 ALCTLPGVGPKVAACIALFSLDQHHAIPVDTHVWKIATR 303
ALC LPGVG +VA CI L +LD+ A+PVD H+W IA R
Sbjct: 237 ALCILPGVGTQVADCICLMALDKPQAVPVDVHMWHIAQR 275
>gi|119390513|pdb|2NOH|A Chain A, Structure Of Catalytically Inactive Q315a Human 8-
Oxoguanine Glycosylase Complexed To 8-oxoguanine Dna
Length = 325
Score = 182 bits (463), Expect = 1e-43, Method: Compositional matrix adjust.
Identities = 100/279 (35%), Positives = 149/279 (53%), Gaps = 25/279 (8%)
Query: 32 TNKPSKWTPLNLTQSELSLPLTFPTGQTFRWKKTGPLQYTGPIGPHLISLKHLQNGDVCY 91
+ P+ W + +SEL L L P+GQ+FRW++ P ++G + + +L + C
Sbjct: 15 ASTPALWASIPCPRSELRLDLVLPSGQSFRWREQSPAHWSGVLADQVWTLTQTEEQLHCT 74
Query: 92 HIHTSPSE-----PAAKSALLDFLNMGISLGELWEGFSASDCRFAELAKYLAGARVLRQD 146
S+ P A+ + + ++L +L+ + + D F E+A+ G R+LRQD
Sbjct: 75 VYRGDKSQASRPTPDELEAVRKYFQLDVTLAQLYHHWGSVDSHFQEVAQKFQGVRLLRQD 134
Query: 147 PVECLLQFLCSSNNNIARITKMVDFLA-SLGSHLGNVEGFEFHEFPSLERLSLVS-EVEL 204
P+ECL F+CSSNNNIARIT MV+ L + G L ++ +H FPSL+ L+ E L
Sbjct: 135 PIECLFSFICSSNNNIARITGMVERLCQAFGPRLIQLDDVTYHGFPSLQALAGPEVEAHL 194
Query: 205 RNAGFGYRSAPQSSLLFSVRRSFKQAKYITGTVDVLQSKHSGGAEWLLSLRKLDLQEAID 264
R G GYR A+Y++ + + + GG WL LR+ +EA
Sbjct: 195 RKLGLGYR-----------------ARYVSASARAILEEQ-GGLAWLQQLRESSYEEAHK 236
Query: 265 ALCTLPGVGPKVAACIALFSLDQHHAIPVDTHVWKIATR 303
ALC LPGVG +VA CI L +LD+ A+PVD H+W IA R
Sbjct: 237 ALCILPGVGTQVADCICLMALDKPQAVPVDVHMWHIAQR 275
>gi|296421389|ref|XP_002840247.1| hypothetical protein [Tuber melanosporum Mel28]
gi|295636462|emb|CAZ84438.1| unnamed protein product [Tuber melanosporum]
Length = 413
Score = 182 bits (462), Expect = 2e-43, Method: Compositional matrix adjust.
Identities = 104/253 (41%), Positives = 148/253 (58%), Gaps = 25/253 (9%)
Query: 57 GQTFRWKKTGPLQYTGPIGPHLISLK----HLQNGDVCYHIHTSPSEPAAKSALLDFLNM 112
GQ+FRWK + P ++T + +++L+ HL + T+P + D+ N+
Sbjct: 27 GQSFRWKASNPGEWTCALNGRILTLRQDDSHLHYRAIFPATTTAPQNDDTVELIRDYFNL 86
Query: 113 GISLGELWEGFSASDCRFAELAKYLAGARVLRQDPVECLLQFLCSSNNNIARITKMVDFL 172
++L +L+E +SA+D F + A AG R+LRQDP E L+ F+CSSNNNI+RI++MVD L
Sbjct: 87 DVNLTKLYERWSAADAHFLKKAVRFAGIRMLRQDPWENLVSFICSSNNNISRISQMVDKL 146
Query: 173 -ASLGSHLGNVEGFEFHEFPSLERL-SLVSEVELRNAGFGYRSAPQSSLLFSVRRSFKQA 230
A+ G LG V+G +H+FPS L +E LR GFGYR A
Sbjct: 147 CATFGPKLGQVDGHTYHDFPSPGALMGDGTEQTLRELGFGYR-----------------A 189
Query: 231 KYITGTVDVLQSKHSGGAEWLLSLRKLDLQEAIDALCTLPGVGPKVAACIALFSLDQHHA 290
KYI+ T ++ + G WL LRK+ ++A +AL L GVGPKVA C+ L SLD+ A
Sbjct: 190 KYISTTARIIAQERLAG--WLDGLRKVGYRDAHEALLGLSGVGPKVADCVCLMSLDKAEA 247
Query: 291 IPVDTHVWKIATR 303
+PVDTHVW+IA R
Sbjct: 248 VPVDTHVWQIAQR 260
>gi|28948715|pdb|1N39|A Chain A, Structural And Biochemical Exploration Of A Critical Amino
Acid In Human 8-Oxoguanine Glycosylase
gi|46014904|pdb|1M3Q|A Chain A, Crystal Structure Of Hogg1 D268e Mutant With Base-Excised
Dna And 8-Aminoguanine
gi|48425055|pdb|1M3H|A Chain A, Crystal Structure Of Hogg1 D268e Mutant With Product
Oligonucleotide
Length = 317
Score = 182 bits (462), Expect = 2e-43, Method: Compositional matrix adjust.
Identities = 100/279 (35%), Positives = 149/279 (53%), Gaps = 25/279 (8%)
Query: 32 TNKPSKWTPLNLTQSELSLPLTFPTGQTFRWKKTGPLQYTGPIGPHLISLKHLQNGDVCY 91
+ P+ W + +SEL L L P+GQ+FRW++ P ++G + + +L + C
Sbjct: 9 ASTPALWASIPCPRSELRLDLVLPSGQSFRWREQSPAHWSGVLADQVWTLTQTEEQLHCT 68
Query: 92 HIHTSPSE-----PAAKSALLDFLNMGISLGELWEGFSASDCRFAELAKYLAGARVLRQD 146
S+ P A+ + + ++L +L+ + + D F E+A+ G R+LRQD
Sbjct: 69 VYRGDKSQASRPTPDELEAVRKYFQLDVTLAQLYHHWGSVDSHFQEVAQKFQGVRLLRQD 128
Query: 147 PVECLLQFLCSSNNNIARITKMVDFLA-SLGSHLGNVEGFEFHEFPSLERLSLVS-EVEL 204
P+ECL F+CSSNNNIARIT MV+ L + G L ++ +H FPSL+ L+ E L
Sbjct: 129 PIECLFSFICSSNNNIARITGMVERLCQAFGPRLIQLDDVTYHGFPSLQALAGPEVEAHL 188
Query: 205 RNAGFGYRSAPQSSLLFSVRRSFKQAKYITGTVDVLQSKHSGGAEWLLSLRKLDLQEAID 264
R G GYR A+Y++ + + + GG WL LR+ +EA
Sbjct: 189 RKLGLGYR-----------------ARYVSASARAILEEQ-GGLAWLQQLRESSYEEAHK 230
Query: 265 ALCTLPGVGPKVAACIALFSLDQHHAIPVDTHVWKIATR 303
ALC LPGVG KVA CI L +LD+ A+PV+ H+W IA R
Sbjct: 231 ALCILPGVGTKVADCICLMALDKPQAVPVEVHMWHIAQR 269
>gi|28948721|pdb|1N3C|A Chain A, Structural And Biochemical Exploration Of A Critical Amino
Acid In Human 8-oxoguanine Glycosylase
Length = 317
Score = 182 bits (461), Expect = 2e-43, Method: Compositional matrix adjust.
Identities = 100/279 (35%), Positives = 149/279 (53%), Gaps = 25/279 (8%)
Query: 32 TNKPSKWTPLNLTQSELSLPLTFPTGQTFRWKKTGPLQYTGPIGPHLISLKHLQNGDVCY 91
+ P+ W + +SEL L L P+GQ+FRW++ P ++G + + +L + C
Sbjct: 9 ASTPALWASIPCPRSELRLDLVLPSGQSFRWREQSPAHWSGVLADQVWTLTQTEEQLHCT 68
Query: 92 HIHTSPSE-----PAAKSALLDFLNMGISLGELWEGFSASDCRFAELAKYLAGARVLRQD 146
S+ P A+ + + ++L +L+ + + D F E+A+ G R+LRQD
Sbjct: 69 VYRGDKSQASRPTPDELEAVRKYFQLDVTLAQLYHHWGSVDSHFQEVAQKFQGVRLLRQD 128
Query: 147 PVECLLQFLCSSNNNIARITKMVDFLA-SLGSHLGNVEGFEFHEFPSLERLSLVS-EVEL 204
P+ECL F+CSSNNNIARIT MV+ L + G L ++ +H FPSL+ L+ E L
Sbjct: 129 PIECLFSFICSSNNNIARITGMVERLCQAFGPRLIQLDDVTYHGFPSLQALAGPEVEAHL 188
Query: 205 RNAGFGYRSAPQSSLLFSVRRSFKQAKYITGTVDVLQSKHSGGAEWLLSLRKLDLQEAID 264
R G GYR A+Y++ + + + GG WL LR+ +EA
Sbjct: 189 RKLGLGYR-----------------ARYVSASARAILEEQ-GGLAWLQQLRESSYEEAHK 230
Query: 265 ALCTLPGVGPKVAACIALFSLDQHHAIPVDTHVWKIATR 303
ALC LPGVG KVA CI L +LD+ A+PV+ H+W IA R
Sbjct: 231 ALCILPGVGTKVADCICLMALDKPQAVPVNVHMWHIAQR 269
>gi|301779335|ref|XP_002925079.1| PREDICTED: n-glycosylase/DNA lyase-like [Ailuropoda melanoleuca]
Length = 334
Score = 181 bits (460), Expect = 3e-43, Method: Compositional matrix adjust.
Identities = 101/276 (36%), Positives = 149/276 (53%), Gaps = 25/276 (9%)
Query: 35 PSKWTPLNLTQSELSLPLTFPTGQTFRWKKTGPLQYTGPIGPHLISLKHLQNGDVCYHIH 94
P+ W + +SEL L L +GQ+FRW++ P +TG + + +L + C
Sbjct: 10 PALWASIPCPRSELRLDLVLASGQSFRWREQNPAHWTGVLANQVWTLTQTEEQLYCTVYR 69
Query: 95 -----TSPSEPAAKSALLDFLNMGISLGELWEGFSASDCRFAELAKYLAGARVLRQDPVE 149
S P + + + ++L +L+ +S+ D F E+A+ G R+L+QDP+E
Sbjct: 70 GDKGWVSRPTPEELKTVHQYFQLDVNLAQLYHHWSSVDPHFQEVAQKFQGVRLLQQDPIE 129
Query: 150 CLLQFLCSSNNNIARITKMVDFLA-SLGSHLGNVEGFEFHEFPSLERLSLVS-EVELRNA 207
CL F+CSSNNNIARIT MV+ L + G L ++ +H FPSL+ L+ E +LR
Sbjct: 130 CLFSFICSSNNNIARITGMVERLCQAFGPRLIQLDDVTYHGFPSLQALAGPEVEAQLRKL 189
Query: 208 GFGYRSAPQSSLLFSVRRSFKQAKYITGTVDVLQSKHSGGAEWLLSLRKLDLQEAIDALC 267
G GYR A+Y++ + + + GG WL L+K +EA ALC
Sbjct: 190 GLGYR-----------------ARYVSASARAILEEQ-GGLSWLQQLQKAPYEEAHKALC 231
Query: 268 TLPGVGPKVAACIALFSLDQHHAIPVDTHVWKIATR 303
TLPGVG KVA CI L +LD+ A+PVD H+W+IA R
Sbjct: 232 TLPGVGTKVADCICLMALDKPQAVPVDIHMWQIAQR 267
>gi|431899909|gb|ELK07856.1| N-glycosylase/DNA lyase [Pteropus alecto]
Length = 335
Score = 181 bits (460), Expect = 3e-43, Method: Compositional matrix adjust.
Identities = 101/276 (36%), Positives = 149/276 (53%), Gaps = 25/276 (9%)
Query: 35 PSKWTPLNLTQSELSLPLTFPTGQTFRWKKTGPLQYTGPIGPHLISLKHLQNGDVCYHIH 94
P+ W+ + SEL L L +GQ+FRW++ P ++G + + +L + C
Sbjct: 10 PALWSSIPCPLSELRLDLVLASGQSFRWREQSPAHWSGVLANQVWTLTQTEEQLYCTVYR 69
Query: 95 TSPSE-----PAAKSALLDFLNMGISLGELWEGFSASDCRFAELAKYLAGARVLRQDPVE 149
+ P A+ + + +SL +L+ +S+ D F +A+ G R+L+QDP+E
Sbjct: 70 GEKGQVGRPTPEELKAVHQYFQLDVSLAKLYHHWSSMDPHFQGVAQKFQGVRLLQQDPIE 129
Query: 150 CLLQFLCSSNNNIARITKMVDFLA-SLGSHLGNVEGFEFHEFPSLERLSLVS-EVELRNA 207
CL F+CSSNNNIARIT MV+ L + G L ++ +H FPSL+ L+ E +LR
Sbjct: 130 CLFSFICSSNNNIARITGMVERLCQAFGPQLIQLDEITYHGFPSLQDLAGPQVETQLRKL 189
Query: 208 GFGYRSAPQSSLLFSVRRSFKQAKYITGTVDVLQSKHSGGAEWLLSLRKLDLQEAIDALC 267
G GYR A+Y++ + + + GG WL LRK +EA ALC
Sbjct: 190 GLGYR-----------------ARYVSASARAILEEQ-GGLPWLQQLRKASYEEAHKALC 231
Query: 268 TLPGVGPKVAACIALFSLDQHHAIPVDTHVWKIATR 303
TLPGVG KVA CI L +LD+ A+PVD H+W+IA R
Sbjct: 232 TLPGVGTKVADCICLMALDKPQAVPVDVHMWQIAQR 267
>gi|2196876|emb|CAA73883.1| 8-oxoguanine DNA glycosylase [Mus musculus]
Length = 345
Score = 181 bits (460), Expect = 3e-43, Method: Compositional matrix adjust.
Identities = 104/289 (35%), Positives = 155/289 (53%), Gaps = 26/289 (8%)
Query: 22 PTPPNPQTLTTNKPSKWTPLNLTQSELSLPLTFPTGQTFRWKKTGPLQYTGPIGPHLISL 81
P+ +TL+++ P+ W + +SEL L L +GQ+FRWK+ P ++G + + +L
Sbjct: 8 PSSMRHRTLSSS-PALWASIPCPRSELRLDLVLASGQSFRWKEQSPAHWSGVLADQVWTL 66
Query: 82 KHLQNGDVCYHIHTSPSEPAAKS-----ALLDFLNMGISLGELWEGFSASDCRFAELAKY 136
++ C S+ + + L + + +SL +L+ +++ D F +A+
Sbjct: 67 TQTEDQLYCTVYRGDDSQVSRPTLEELETLHKYFQLDVSLAQLYSHWASVDSHFQRVAQK 126
Query: 137 LAGARVLRQDPVECLLQFLCSSNNNIARITKMVDFLA-SLGSHLGNVEGFEFHEFPSLER 195
G R+LRQDP ECL F+CSSNNNIARIT MV+ L + G L ++ +H FP+L
Sbjct: 127 FQGVRLLRQDPTECLFSFICSSNNNIARITGMVERLCQAFGPRLIQLDDVTYHGFPNLHA 186
Query: 196 LSL-VSEVELRNAGFGYRSAPQSSLLFSVRRSFKQAKYITGTVDVLQSKHSGGAEWLLSL 254
L+ +E LR G GYR A+Y+ + + + GG WL L
Sbjct: 187 LAGPEAETHLRKLGLGYR-----------------ARYVRASAKAILEEQ-GGPAWLQQL 228
Query: 255 RKLDLQEAIDALCTLPGVGPKVAACIALFSLDQHHAIPVDTHVWKIATR 303
R +EA ALCTLPGVG KVA CI L +LD+ A+PVD HVW+IA R
Sbjct: 229 RVAPYEEAHKALCTLPGVGAKVADCICLMALDKPQAVPVDVHVWQIAHR 277
>gi|28948718|pdb|1N3A|A Chain A, Structural And Biochemical Exploration Of A Critical Amino
Acid In Human 8-Oxoguanine Glycosylase
Length = 317
Score = 181 bits (460), Expect = 3e-43, Method: Compositional matrix adjust.
Identities = 100/279 (35%), Positives = 148/279 (53%), Gaps = 25/279 (8%)
Query: 32 TNKPSKWTPLNLTQSELSLPLTFPTGQTFRWKKTGPLQYTGPIGPHLISLKHLQNGDVCY 91
+ P+ W + +SEL L L P+GQ+FRW++ P ++G + + +L + C
Sbjct: 9 ASTPALWASIPCPRSELRLDLVLPSGQSFRWREQSPAHWSGVLADQVWTLTQTEEQLHCT 68
Query: 92 HIHTSPSE-----PAAKSALLDFLNMGISLGELWEGFSASDCRFAELAKYLAGARVLRQD 146
S+ P A+ + + ++L +L+ + + D F E+A+ G R+LRQD
Sbjct: 69 VYRGDKSQASRPTPDELEAVRKYFQLDVTLAQLYHHWGSVDSHFQEVAQKFQGVRLLRQD 128
Query: 147 PVECLLQFLCSSNNNIARITKMVDFLA-SLGSHLGNVEGFEFHEFPSLERLSLVS-EVEL 204
P+ECL F+CSSNNNIARIT MV+ L + G L ++ +H FPSL+ L+ E L
Sbjct: 129 PIECLFSFICSSNNNIARITGMVERLCQAFGPRLIQLDDVTYHGFPSLQALAGPEVEAHL 188
Query: 205 RNAGFGYRSAPQSSLLFSVRRSFKQAKYITGTVDVLQSKHSGGAEWLLSLRKLDLQEAID 264
R G GYR A+Y++ + + + GG WL LR+ +EA
Sbjct: 189 RKLGLGYR-----------------ARYVSASARAILEEQ-GGLAWLQQLRESSYEEAHK 230
Query: 265 ALCTLPGVGPKVAACIALFSLDQHHAIPVDTHVWKIATR 303
ALC LPGVG KVA CI L +LD+ A+PV H+W IA R
Sbjct: 231 ALCILPGVGTKVADCICLMALDKPQAVPVQVHMWHIAQR 269
>gi|410951696|ref|XP_003982529.1| PREDICTED: N-glycosylase/DNA lyase [Felis catus]
Length = 463
Score = 181 bits (460), Expect = 3e-43, Method: Compositional matrix adjust.
Identities = 105/279 (37%), Positives = 150/279 (53%), Gaps = 31/279 (11%)
Query: 35 PSKWTPLNLTQSELSLPLTFPTGQTFRWKKTGPLQYTGPIGPHLISLKHLQN-------- 86
P+ W + +SEL L L +GQ+FRW++ P +TG +G + +L +
Sbjct: 139 PALWASIPCPRSELRLDLVLASGQSFRWREQNPAHWTGVLGNQVWTLTQTEEQIYCTVYR 198
Query: 87 GDVCYHIHTSPSEPAAKSALLDFLNMGISLGELWEGFSASDCRFAELAKYLAGARVLRQD 146
GD + +P E + + + +SL +L+ +S+ D F E+A G R+L+QD
Sbjct: 199 GDKGWVGRPTPEE---LKTVHQYFQLDVSLAQLYHHWSSVDPHFQEVAHKFQGVRLLQQD 255
Query: 147 PVECLLQFLCSSNNNIARITKMVDFLA-SLGSHLGNVEGFEFHEFPSLERLSLVS-EVEL 204
PVECL F+CSSNNNIARIT MV+ L + G L ++ +H FPSL+ L+ E +L
Sbjct: 256 PVECLFSFICSSNNNIARITGMVERLCQAFGPQLIQLDDVIYHGFPSLQALAGPEVEAQL 315
Query: 205 RNAGFGYRSAPQSSLLFSVRRSFKQAKYITGTVDVLQSKHSGGAEWLLSLRKLDLQEAID 264
R G GYR A+Y+ + + + GG WL L K +EA
Sbjct: 316 RKLGLGYR-----------------ARYVNASARAILEEQ-GGLPWLQQLHKAPYEEAHK 357
Query: 265 ALCTLPGVGPKVAACIALFSLDQHHAIPVDTHVWKIATR 303
ALCTLPGVG KVA CI L +LD+ A+PVD H+W+IA R
Sbjct: 358 ALCTLPGVGTKVADCICLMALDKPQAVPVDIHMWQIAQR 396
>gi|281354157|gb|EFB29741.1| hypothetical protein PANDA_014516 [Ailuropoda melanoleuca]
Length = 321
Score = 181 bits (459), Expect = 4e-43, Method: Compositional matrix adjust.
Identities = 101/276 (36%), Positives = 149/276 (53%), Gaps = 25/276 (9%)
Query: 35 PSKWTPLNLTQSELSLPLTFPTGQTFRWKKTGPLQYTGPIGPHLISLKHLQNGDVCYHIH 94
P+ W + +SEL L L +GQ+FRW++ P +TG + + +L + C
Sbjct: 11 PALWASIPCPRSELRLDLVLASGQSFRWREQNPAHWTGVLANQVWTLTQTEEQLYCTVYR 70
Query: 95 -----TSPSEPAAKSALLDFLNMGISLGELWEGFSASDCRFAELAKYLAGARVLRQDPVE 149
S P + + + ++L +L+ +S+ D F E+A+ G R+L+QDP+E
Sbjct: 71 GDKGWVSRPTPEELKTVHQYFQLDVNLAQLYHHWSSVDPHFQEVAQKFQGVRLLQQDPIE 130
Query: 150 CLLQFLCSSNNNIARITKMVDFLA-SLGSHLGNVEGFEFHEFPSLERLSLVS-EVELRNA 207
CL F+CSSNNNIARIT MV+ L + G L ++ +H FPSL+ L+ E +LR
Sbjct: 131 CLFSFICSSNNNIARITGMVERLCQAFGPRLIQLDDVTYHGFPSLQALAGPEVEAQLRKL 190
Query: 208 GFGYRSAPQSSLLFSVRRSFKQAKYITGTVDVLQSKHSGGAEWLLSLRKLDLQEAIDALC 267
G GYR A+Y++ + + + GG WL L+K +EA ALC
Sbjct: 191 GLGYR-----------------ARYVSASARAILEEQ-GGLSWLQQLQKAPYEEAHKALC 232
Query: 268 TLPGVGPKVAACIALFSLDQHHAIPVDTHVWKIATR 303
TLPGVG KVA CI L +LD+ A+PVD H+W+IA R
Sbjct: 233 TLPGVGTKVADCICLMALDKPQAVPVDIHMWQIAQR 268
>gi|187960092|ref|NP_035087.3| N-glycosylase/DNA lyase [Mus musculus]
gi|12643533|sp|O08760.2|OGG1_MOUSE RecName: Full=N-glycosylase/DNA lyase; Includes: RecName:
Full=8-oxoguanine DNA glycosylase; Includes: RecName:
Full=DNA-(apurinic or apyrimidinic site) lyase; Short=AP
lyase
gi|2257848|gb|AAB63151.1| mOGG1 [Mus musculus]
gi|2738598|gb|AAB94512.1| 8-oxoguanine DNA glycosylase [Mus musculus]
gi|74147175|dbj|BAE27493.1| unnamed protein product [Mus musculus]
gi|148667034|gb|EDK99450.1| 8-oxoguanine DNA-glycosylase 1, isoform CRA_a [Mus musculus]
gi|187952987|gb|AAI38737.1| 8-oxoguanine DNA-glycosylase 1 [Mus musculus]
gi|187954039|gb|AAI38734.1| 8-oxoguanine DNA-glycosylase 1 [Mus musculus]
Length = 345
Score = 181 bits (459), Expect = 4e-43, Method: Compositional matrix adjust.
Identities = 104/289 (35%), Positives = 155/289 (53%), Gaps = 26/289 (8%)
Query: 22 PTPPNPQTLTTNKPSKWTPLNLTQSELSLPLTFPTGQTFRWKKTGPLQYTGPIGPHLISL 81
P+ +TL+++ P+ W + +SEL L L +GQ+FRWK+ P ++G + + +L
Sbjct: 8 PSSMRHRTLSSS-PALWASIPCPRSELRLDLVLASGQSFRWKEQSPAHWSGVLADQVWTL 66
Query: 82 KHLQNGDVCYHIHTSPSEPAAKS-----ALLDFLNMGISLGELWEGFSASDCRFAELAKY 136
++ C S+ + + L + + +SL +L+ +++ D F +A+
Sbjct: 67 TQTEDQLYCTVYRGDDSQVSRPTLEELETLHKYFQLDVSLAQLYSHWASVDSHFQRVAQK 126
Query: 137 LAGARVLRQDPVECLLQFLCSSNNNIARITKMVDFLA-SLGSHLGNVEGFEFHEFPSLER 195
G R+LRQDP ECL F+CSSNNNIARIT MV+ L + G L ++ +H FP+L
Sbjct: 127 FQGVRLLRQDPTECLFSFICSSNNNIARITGMVERLCQAFGPRLIQLDDVTYHGFPNLHA 186
Query: 196 LSL-VSEVELRNAGFGYRSAPQSSLLFSVRRSFKQAKYITGTVDVLQSKHSGGAEWLLSL 254
L+ +E LR G GYR A+Y+ + + + GG WL L
Sbjct: 187 LAGPEAETHLRKLGLGYR-----------------ARYVRASAKAILEEQ-GGPAWLQQL 228
Query: 255 RKLDLQEAIDALCTLPGVGPKVAACIALFSLDQHHAIPVDTHVWKIATR 303
R +EA ALCTLPGVG KVA CI L +LD+ A+PVD HVW+IA R
Sbjct: 229 RVAPYEEAHKALCTLPGVGAKVADCICLMALDKPQAVPVDVHVWQIAHR 277
>gi|2078296|gb|AAB81133.1| 8-oxoguanine DNA-glycosylase [Mus musculus]
Length = 345
Score = 181 bits (459), Expect = 5e-43, Method: Compositional matrix adjust.
Identities = 104/289 (35%), Positives = 155/289 (53%), Gaps = 26/289 (8%)
Query: 22 PTPPNPQTLTTNKPSKWTPLNLTQSELSLPLTFPTGQTFRWKKTGPLQYTGPIGPHLISL 81
P+ +TL+++ P+ W + +SEL L L +GQ+FRWK+ P ++G + + +L
Sbjct: 8 PSSMRHRTLSSS-PALWASIPCPRSELRLDLVLASGQSFRWKEQSPAHWSGVLADQVWTL 66
Query: 82 KHLQNGDVCYHIHTSPSEPAAKS-----ALLDFLNMGISLGELWEGFSASDCRFAELAKY 136
++ C S+ + + L + + +SL +L+ +++ D F +A+
Sbjct: 67 TQTEDQLYCTVYRGDDSQVSRPTLEELETLHKYFQLDVSLAQLYSHWASVDSHFQRVAQK 126
Query: 137 LAGARVLRQDPVECLLQFLCSSNNNIARITKMVDFLA-SLGSHLGNVEGFEFHEFPSLER 195
G R+LRQDP ECL F+CSSNNNIARIT MV+ L + G L ++ +H FP+L
Sbjct: 127 FQGVRLLRQDPTECLFSFICSSNNNIARITGMVERLCQAFGPRLIQLDDVTYHGFPNLHA 186
Query: 196 LSL-VSEVELRNAGFGYRSAPQSSLLFSVRRSFKQAKYITGTVDVLQSKHSGGAEWLLSL 254
L+ +E LR G GYR A+Y+ + + + GG WL L
Sbjct: 187 LAGPEAETHLRKLGLGYR-----------------ARYVRASAKAILEEQ-GGPAWLQQL 228
Query: 255 RKLDLQEAIDALCTLPGVGPKVAACIALFSLDQHHAIPVDTHVWKIATR 303
R +EA ALCTLPGVG KVA CI L +LD+ A+PVD HVW+IA R
Sbjct: 229 RVAPYEEAHKALCTLPGVGAKVADCICLMALDKPQAVPVDVHVWQIAHR 277
>gi|2598924|gb|AAB84013.1| 8-oxoguanine DNA glycosylase [Homo sapiens]
Length = 341
Score = 181 bits (458), Expect = 5e-43, Method: Compositional matrix adjust.
Identities = 100/279 (35%), Positives = 148/279 (53%), Gaps = 25/279 (8%)
Query: 32 TNKPSKWTPLNLTQSELSLPLTFPTGQTFRWKKTGPLQYTGPIGPHLISLKHLQNGDVCY 91
+ P+ W + +SEL L L P+GQ+FRW++ P ++G + + +L + C
Sbjct: 7 ASTPALWASIPCPRSELRLDLVLPSGQSFRWREQSPAHWSGVLADQVWTLTQTEEQLHCT 66
Query: 92 HIHTSPSE-----PAAKSALLDFLNMGISLGELWEGFSASDCRFAELAKYLAGARVLRQD 146
S+ P A+ + + ++L +L+ + + D F E+A+ G R+LRQD
Sbjct: 67 VYRGDKSQASRPTPDELEAVRKYFQLDVTLAQLYHHWGSVDYHFQEVAQKFQGVRLLRQD 126
Query: 147 PVECLLQFLCSSNNNIARITKMVDFLA-SLGSHLGNVEGFEFHEFPSLERLSLVS-EVEL 204
P+ECL F+CSSNNNIARIT MV+ L + G L ++ +H FPSL+ L+ E L
Sbjct: 127 PIECLFSFICSSNNNIARITGMVERLCQAFGPRLIQLDDVTYHGFPSLQALAGPEVEAHL 186
Query: 205 RNAGFGYRSAPQSSLLFSVRRSFKQAKYITGTVDVLQSKHSGGAEWLLSLRKLDLQEAID 264
R G GYR A+Y++ + + + GG WL L + +EA
Sbjct: 187 RKLGLGYR-----------------ARYVSASARAILEEQ-GGLAWLQQLPQSSYEEAHK 228
Query: 265 ALCTLPGVGPKVAACIALFSLDQHHAIPVDTHVWKIATR 303
ALC LPGVG KVA CI L +LD+ A+PVD H+W IA R
Sbjct: 229 ALCILPGVGTKVADCICLMALDKPQAVPVDVHMWHIAQR 267
>gi|2104622|emb|CAA72117.1| 8-oxoguanine DNA glycosylase-like protein [Mus musculus]
gi|6688667|emb|CAB65240.1| 8-oxoguanine DNA glycosylase 1 [Mus musculus]
Length = 345
Score = 181 bits (458), Expect = 5e-43, Method: Compositional matrix adjust.
Identities = 104/289 (35%), Positives = 155/289 (53%), Gaps = 26/289 (8%)
Query: 22 PTPPNPQTLTTNKPSKWTPLNLTQSELSLPLTFPTGQTFRWKKTGPLQYTGPIGPHLISL 81
P+ +TL+++ P+ W + +SEL L L +GQ+FRWK+ P ++G + + +L
Sbjct: 8 PSSMRHRTLSSS-PALWASIPCPRSELRLDLVLASGQSFRWKEQSPAHWSGVLADQVWTL 66
Query: 82 KHLQNGDVCYHIHTSPSEPAAKS-----ALLDFLNMGISLGELWEGFSASDCRFAELAKY 136
++ C S+ + + L + + +SL +L+ +++ D F +A+
Sbjct: 67 TQTEDQLYCTVYRGDDSQVSRPTLEELETLHKYFQLDVSLAQLYSHWASVDSHFQRVAQK 126
Query: 137 LAGARVLRQDPVECLLQFLCSSNNNIARITKMVDFLA-SLGSHLGNVEGFEFHEFPSLER 195
G R+LRQDP ECL F+CSSNNNIARIT MV+ L + G L ++ +H FP+L
Sbjct: 127 FQGVRLLRQDPTECLFSFICSSNNNIARITGMVERLCQAFGPRLIQLDDVTYHGFPNLHA 186
Query: 196 LSL-VSEVELRNAGFGYRSAPQSSLLFSVRRSFKQAKYITGTVDVLQSKHSGGAEWLLSL 254
L+ +E LR G GYR A+Y+ + + + GG WL L
Sbjct: 187 LAGPEAETHLRKLGLGYR-----------------ARYVRASAKAILEEQ-GGPAWLQQL 228
Query: 255 RKLDLQEAIDALCTLPGVGPKVAACIALFSLDQHHAIPVDTHVWKIATR 303
R +EA ALCTLPGVG KVA CI L +LD+ A+PVD HVW+IA R
Sbjct: 229 RVAPYEEAHKALCTLPGVGAKVADCICLMALDKPQAVPVDVHVWQIAHR 277
>gi|224004634|ref|XP_002295968.1| glycosylase [Thalassiosira pseudonana CCMP1335]
gi|209586000|gb|ACI64685.1| glycosylase [Thalassiosira pseudonana CCMP1335]
Length = 284
Score = 181 bits (458), Expect = 5e-43, Method: Compositional matrix adjust.
Identities = 115/301 (38%), Positives = 159/301 (52%), Gaps = 31/301 (10%)
Query: 36 SKWTPLNLTQSELSLPLTFPTGQTFRW-KKTGPLQYTGPIGPHLISLKHLQNGDVCYHIH 94
+ W L + EL T GQ F W ++ GP+G ++S+K + +H
Sbjct: 3 NDWVDLKVAPEELRPSSTLTNGQCFNWMSHHDAKEWIGPLGNRVLSIKETPTTTLFRIVH 62
Query: 95 TSPSEPAAKSALLDFLNMGISLGELWEGFSASDCRFAELAKYLAGARVLRQDPVECLLQF 154
P+E + L + + +L L++ +S +D R +++A + G R+LRQ+PVEC+ F
Sbjct: 63 -GPTE-GSTEYLQQYFRLETALSPLYKEWSKADDRLSKIATAIPGVRILRQEPVECMFSF 120
Query: 155 LCSSNNNIARITKMV-DFLASLGSHLGNV-----EGFEFHEFPSLERLSLVSEVELRNAG 208
+CSSNNNI RITKM+ F G+ + +V EG + FP++E LS +E ELR G
Sbjct: 121 ICSSNNNIPRITKMLAAFREKYGTFMIDVPSQNGEGIALYTFPTIESLSEATEDELRGMG 180
Query: 209 FGYRSAPQSSLLFSVRRSFKQAKYITGTVDVLQSKHSGGAEWLLSLRKL-DLQEAIDALC 267
GYRS KYIT T D+L SGG E+L+ LR L D Q D L
Sbjct: 181 LGYRS-----------------KYITETRDLLVK--SGGEEYLIGLRSLTDAQVVQDELT 221
Query: 268 TLPGVGPKVAACIALFSLDQHHAIPVDTHVWKIATRYLLPELAG--VRLTPKLCSRVAEA 325
G+G KVA C+ALFSLDQ AIPVD HV IA+R P + G LTP + RV +
Sbjct: 222 KFSGIGRKVADCVALFSLDQDDAIPVDVHVQHIASRDYDPTVLGGAKSLTPTIYKRVGDL 281
Query: 326 F 326
F
Sbjct: 282 F 282
>gi|118138210|pdb|2I5W|A Chain A, Structure Of Hogg1 Crosslinked To Dna Sampling A Normal G
Adjacent To An Oxog
Length = 315
Score = 181 bits (458), Expect = 5e-43, Method: Compositional matrix adjust.
Identities = 100/279 (35%), Positives = 148/279 (53%), Gaps = 25/279 (8%)
Query: 32 TNKPSKWTPLNLTQSELSLPLTFPTGQTFRWKKTGPLQYTGPIGPHLISLKHLQNGDVCY 91
+ P+ W + +SEL L L P+GQ+FRW++ P ++G + + +L + C
Sbjct: 9 ASTPALWASIPCPRSELRLDLVLPSGQSFRWREQSPAHWSGVLADQVWTLTQTEEQLHCT 68
Query: 92 HIHTSPSE-----PAAKSALLDFLNMGISLGELWEGFSASDCRFAELAKYLAGARVLRQD 146
S+ P A+ + + ++L +L+ + + D F E+A+ G R+LRQD
Sbjct: 69 VYRGDKSQASRPTPDELEAVRKYFQLDVTLAQLYHHWGSVDSHFQEVAQKFQGVRLLRQD 128
Query: 147 PVECLLQFLCSSNNNIARITKMVDFLA-SLGSHLGNVEGFEFHEFPSLERLSLVS-EVEL 204
P+ECL F+CSS NNIARIT MV+ L + G L ++ +H FPSL+ L+ E L
Sbjct: 129 PIECLFSFICSSCNNIARITGMVERLCQAFGPRLIQLDDVTYHGFPSLQALAGPEVEAHL 188
Query: 205 RNAGFGYRSAPQSSLLFSVRRSFKQAKYITGTVDVLQSKHSGGAEWLLSLRKLDLQEAID 264
R G GYR A+Y++ + + + GG WL LR+ +EA
Sbjct: 189 RKLGLGYR-----------------ARYVSASARAILEEQ-GGLAWLQQLRESSYEEAHK 230
Query: 265 ALCTLPGVGPKVAACIALFSLDQHHAIPVDTHVWKIATR 303
ALC LPGVG KVA CI L +LD+ A+PVD H+W IA R
Sbjct: 231 ALCILPGVGTKVADCICLMALDKPQAVPVDVHMWHIAQR 269
>gi|300508445|pdb|3KTU|A Chain A, Structure Of Human 8-Oxoguanine Glycosylase 1 Bound To
Fluorninated Oxog-Containing Dna
Length = 317
Score = 181 bits (458), Expect = 5e-43, Method: Compositional matrix adjust.
Identities = 100/279 (35%), Positives = 148/279 (53%), Gaps = 25/279 (8%)
Query: 32 TNKPSKWTPLNLTQSELSLPLTFPTGQTFRWKKTGPLQYTGPIGPHLISLKHLQNGDVCY 91
+ P+ W + +SEL L L P+GQ+FRW++ P ++G + + +L + C
Sbjct: 9 ASTPALWASIPCPRSELRLDLVLPSGQSFRWREQSPAHWSGVLADQVWTLTQTEEQLHCT 68
Query: 92 HIHTSPSE-----PAAKSALLDFLNMGISLGELWEGFSASDCRFAELAKYLAGARVLRQD 146
S+ P A+ + + ++L +L+ + + D F E+A+ G R+LRQD
Sbjct: 69 VYRGDKSQASRPTPDELEAVRKYFQLDVTLAQLYHHWGSVDSHFQEVAQKFQGVRLLRQD 128
Query: 147 PVECLLQFLCSSNNNIARITKMVDFLA-SLGSHLGNVEGFEFHEFPSLERLSLVS-EVEL 204
P+ECL F+CSS NNIARIT MV+ L + G L ++ +H FPSL+ L+ E L
Sbjct: 129 PIECLFSFICSSCNNIARITGMVERLCQAFGPRLIQLDDVTYHGFPSLQALAGPEVEAHL 188
Query: 205 RNAGFGYRSAPQSSLLFSVRRSFKQAKYITGTVDVLQSKHSGGAEWLLSLRKLDLQEAID 264
R G GYR A+Y++ + + + GG WL LR+ +EA
Sbjct: 189 RKLGLGYR-----------------ARYVSASARAILEEQ-GGLAWLQQLRESSYEEAHK 230
Query: 265 ALCTLPGVGPKVAACIALFSLDQHHAIPVDTHVWKIATR 303
ALC LPGVG KVA CI L +LD+ A+PVD H+W IA R
Sbjct: 231 ALCILPGVGTKVADCICLMALDKPQAVPVDVHMWHIAQR 269
>gi|62738729|pdb|1YQK|A Chain A, Human 8-Oxoguanine Glycosylase Crosslinked With Guanine
Containing Dna
Length = 319
Score = 181 bits (458), Expect = 6e-43, Method: Compositional matrix adjust.
Identities = 100/279 (35%), Positives = 148/279 (53%), Gaps = 25/279 (8%)
Query: 32 TNKPSKWTPLNLTQSELSLPLTFPTGQTFRWKKTGPLQYTGPIGPHLISLKHLQNGDVCY 91
+ P+ W + +SEL L L P+GQ+FRW++ P ++G + + +L + C
Sbjct: 9 ASTPALWASIPCPRSELRLDLVLPSGQSFRWREQSPAHWSGVLADQVWTLTQTEEQLHCT 68
Query: 92 HIHTSPSE-----PAAKSALLDFLNMGISLGELWEGFSASDCRFAELAKYLAGARVLRQD 146
S+ P A+ + + ++L +L+ + + D F E+A+ G R+LRQD
Sbjct: 69 VYRGDKSQASRPTPDELEAVRKYFQLDVTLAQLYHHWGSVDSHFQEVAQKFQGVRLLRQD 128
Query: 147 PVECLLQFLCSSNNNIARITKMVDFLA-SLGSHLGNVEGFEFHEFPSLERLSLVS-EVEL 204
P+ECL F+CSS NNIARIT MV+ L + G L ++ +H FPSL+ L+ E L
Sbjct: 129 PIECLFSFICSSCNNIARITGMVERLCQAFGPRLIQLDDVTYHGFPSLQALAGPEVEAHL 188
Query: 205 RNAGFGYRSAPQSSLLFSVRRSFKQAKYITGTVDVLQSKHSGGAEWLLSLRKLDLQEAID 264
R G GYR A+Y++ + + + GG WL LR+ +EA
Sbjct: 189 RKLGLGYR-----------------ARYVSASARAILEEQ-GGLAWLQQLRESSYEEAHK 230
Query: 265 ALCTLPGVGPKVAACIALFSLDQHHAIPVDTHVWKIATR 303
ALC LPGVG KVA CI L +LD+ A+PVD H+W IA R
Sbjct: 231 ALCILPGVGTKVADCICLMALDKPQAVPVDVHMWHIAQR 269
>gi|119390510|pdb|2NOF|A Chain A, Structure Of Q315f Human 8-oxoguanine Glycosylase Proximal
Crosslink To 8-oxoguanine Dna
Length = 325
Score = 181 bits (458), Expect = 6e-43, Method: Compositional matrix adjust.
Identities = 100/279 (35%), Positives = 148/279 (53%), Gaps = 25/279 (8%)
Query: 32 TNKPSKWTPLNLTQSELSLPLTFPTGQTFRWKKTGPLQYTGPIGPHLISLKHLQNGDVCY 91
+ P+ W + +SEL L L P+GQ+FRW++ P ++G + + +L + C
Sbjct: 15 ASTPALWASIPCPRSELRLDLVLPSGQSFRWREQSPAHWSGVLADQVWTLTQTEEQLHCT 74
Query: 92 HIHTSPSE-----PAAKSALLDFLNMGISLGELWEGFSASDCRFAELAKYLAGARVLRQD 146
S+ P A+ + + ++L +L+ + + D F E+A+ G R+LRQD
Sbjct: 75 VYRGDKSQASRPTPDELEAVRKYFQLDVTLAQLYHHWGSVDSHFQEVAQKFQGVRLLRQD 134
Query: 147 PVECLLQFLCSSNNNIARITKMVDFLA-SLGSHLGNVEGFEFHEFPSLERLSLVS-EVEL 204
P+ECL F+CSS NNIARIT MV+ L + G L ++ +H FPSL+ L+ E L
Sbjct: 135 PIECLFSFICSSCNNIARITGMVERLCQAFGPRLIQLDDVTYHGFPSLQALAGPEVEAHL 194
Query: 205 RNAGFGYRSAPQSSLLFSVRRSFKQAKYITGTVDVLQSKHSGGAEWLLSLRKLDLQEAID 264
R G GYR A+Y++ + + + GG WL LR+ +EA
Sbjct: 195 RKLGLGYR-----------------ARYVSASARAILEEQ-GGLAWLQQLRESSYEEAHK 236
Query: 265 ALCTLPGVGPKVAACIALFSLDQHHAIPVDTHVWKIATR 303
ALC LPGVG KVA CI L +LD+ A+PVD H+W IA R
Sbjct: 237 ALCILPGVGTKVADCICLMALDKPQAVPVDVHMWHIAQR 275
>gi|348502689|ref|XP_003438900.1| PREDICTED: N-glycosylase/DNA lyase-like [Oreochromis niloticus]
Length = 515
Score = 181 bits (458), Expect = 6e-43, Method: Compositional matrix adjust.
Identities = 115/338 (34%), Positives = 170/338 (50%), Gaps = 69/338 (20%)
Query: 38 WTPLNLTQSELSLPLTFPTGQTFRWKKTGPLQYTGPIGPHLISLKHLQNGDVCYHIHTS- 96
W ++ ++SEL L LT GQ+FRW++T +TG +G + +L + + YH++ +
Sbjct: 14 WRSMSCSKSELRLDLTLACGQSFRWRETSEGHWTGVMGGRVWTLTQT-DETLWYHVYNNQ 72
Query: 97 ----------------------------------PSEPAAKSA----------LLDFLNM 112
EP A S+ L D+ +
Sbjct: 73 DKQKEGSDRKRRVGVSVLDKNTAEKRFRCAPEKEEDEPLAVSSVQNRELDEEMLRDYFQL 132
Query: 113 GISLGELWEGFSASDCRFAELAKYLAGARVLRQDPVECLLQFLCSSNNNIARITKMVDFL 172
+ +G+L + + A+DC F +A+ G R+LRQDP ECL F+C+SNN+I+RI MV+ L
Sbjct: 133 DVRMGDLHKEWGAADCHFKHIAEVFKGVRMLRQDPTECLFSFICTSNNHISRIQGMVERL 192
Query: 173 A-SLGSHLGNVEGFEFHEFPSLERLSLVS-EVELRNAGFGYRSAPQSSLLFSVRRSFKQA 230
+LGS L ++ +H+FP+L L+ S E LR+ GFGYR A
Sbjct: 193 CQALGSPLCQLDQTSYHDFPTLSGLADSSIEARLRDLGFGYR-----------------A 235
Query: 231 KYITGTVDVLQSKHSGGAEWLLSLRKLDLQEAIDALCTLPGVGPKVAACIALFSLDQHHA 290
++I + + H G +WL LR + EA DAL TLPGVG KVA C+ L SLD+ A
Sbjct: 236 RFIQQSAKQILDNH--GPQWLEGLRSIPYLEARDALRTLPGVGTKVADCVCLMSLDKPEA 293
Query: 291 IPVDTHVWKIATR-YLLPELAGVR-LTPKLCSRVAEAF 326
+PVDTHVW+IA R Y G + +T KL + + F
Sbjct: 294 VPVDTHVWQIAKRDYKYAANNGQKSITEKLNRDIGDFF 331
>gi|119390507|pdb|2NOE|A Chain A, Structure Of Catalytically Inactive G42a Human
8-Oxoguanine Glycosylase Complexed To 8-Oxoguanine Dna
Length = 325
Score = 181 bits (458), Expect = 6e-43, Method: Compositional matrix adjust.
Identities = 99/279 (35%), Positives = 148/279 (53%), Gaps = 25/279 (8%)
Query: 32 TNKPSKWTPLNLTQSELSLPLTFPTGQTFRWKKTGPLQYTGPIGPHLISLKHLQNGDVCY 91
+ P+ W + +SEL L L P+ Q+FRW++ P ++G + + +L + C
Sbjct: 15 ASTPALWASIPCPRSELRLDLVLPSAQSFRWREQSPAHWSGVLADQVWTLTQTEEQLHCT 74
Query: 92 HIHTSPSE-----PAAKSALLDFLNMGISLGELWEGFSASDCRFAELAKYLAGARVLRQD 146
S+ P A+ + + ++L +L+ + + D F E+A+ G R+LRQD
Sbjct: 75 VYRGDKSQASRPTPDELEAVRKYFQLDVTLAQLYHHWGSVDSHFQEVAQKFQGVRLLRQD 134
Query: 147 PVECLLQFLCSSNNNIARITKMVDFLA-SLGSHLGNVEGFEFHEFPSLERLSLVS-EVEL 204
P+ECL F+CSSNNNIARIT MV+ L + G L ++ +H FPSL+ L+ E L
Sbjct: 135 PIECLFSFICSSNNNIARITGMVERLCQAFGPRLIQLDDVTYHGFPSLQALAGPEVEAHL 194
Query: 205 RNAGFGYRSAPQSSLLFSVRRSFKQAKYITGTVDVLQSKHSGGAEWLLSLRKLDLQEAID 264
R G GYR A+Y++ + + + GG WL LR+ +EA
Sbjct: 195 RKLGLGYR-----------------ARYVSASARAILEEQ-GGLAWLQQLRESSYEEAHK 236
Query: 265 ALCTLPGVGPKVAACIALFSLDQHHAIPVDTHVWKIATR 303
ALC LPGVG +VA CI L +LD+ A+PVD H+W IA R
Sbjct: 237 ALCILPGVGTQVADCICLMALDKPQAVPVDVHMWHIAQR 275
>gi|4210714|emb|CAA10351.1| 8-hydroxy-guanine glycosylase [Homo sapiens]
Length = 346
Score = 180 bits (457), Expect = 7e-43, Method: Compositional matrix adjust.
Identities = 101/280 (36%), Positives = 149/280 (53%), Gaps = 26/280 (9%)
Query: 32 TNKPSKWTPLNLTQSELSLPLTFPTGQTFRW-KKTGPLQYTGPIGPHLISLKHLQNGDVC 90
+ P+ W + +SEL L L P+GQ+FRW ++ P ++G + + +L + C
Sbjct: 17 ASTPALWASIPCPRSELRLDLVLPSGQSFRWWREQSPAHWSGVLADQVWTLTQTEEQLHC 76
Query: 91 YHIHTSPSE-----PAAKSALLDFLNMGISLGELWEGFSASDCRFAELAKYLAGARVLRQ 145
S+ P A+ + + ++L +L+ + + D F E+A+ G R+LRQ
Sbjct: 77 TVYRGDKSQASRPTPDELEAVRKYFQLDVTLAQLYHHWGSVDSHFQEVAQKFQGVRLLRQ 136
Query: 146 DPVECLLQFLCSSNNNIARITKMVDFLA-SLGSHLGNVEGFEFHEFPSLERLSLVS-EVE 203
DP+ECL F+CSSNNNIARIT MV+ L + G L ++ +H FPSL+ L+ E
Sbjct: 137 DPIECLFSFICSSNNNIARITGMVERLCQAFGPRLIQLDDVTYHGFPSLQALAGPEVEAH 196
Query: 204 LRNAGFGYRSAPQSSLLFSVRRSFKQAKYITGTVDVLQSKHSGGAEWLLSLRKLDLQEAI 263
LR G GYR A+Y++ + + + GG WL LR+ +EA
Sbjct: 197 LRKLGLGYR-----------------ARYVSASARAILEEQ-GGLAWLQQLRESSYEEAH 238
Query: 264 DALCTLPGVGPKVAACIALFSLDQHHAIPVDTHVWKIATR 303
ALC LPGVG KVA CI L +LD+ A+PVD H+W IA R
Sbjct: 239 KALCILPGVGTKVADCICLMALDKPQAVPVDVHMWHIAQR 278
>gi|350591420|ref|XP_001928227.3| PREDICTED: N-glycosylase/DNA lyase-like [Sus scrofa]
Length = 335
Score = 180 bits (457), Expect = 8e-43, Method: Compositional matrix adjust.
Identities = 104/278 (37%), Positives = 152/278 (54%), Gaps = 29/278 (10%)
Query: 35 PSKWTPLNLTQSELSLPLTFPTGQTFRWKKTGPLQYTGPIGPHLISLKHLQNGDVCYHIH 94
P+ W + +SEL L L +GQ+FRW++ P ++G + + +L + C ++
Sbjct: 10 PALWASIPCPRSELRLDLVLASGQSFRWREQSPAHWSGVLADRVWTLTQTEEQLYCT-VY 68
Query: 95 TSPSEPAAKSALLD------FLNMGISLGELWEGFSASDCRFAELAKYLAGARVLRQDPV 148
+ + L + + + ISL +L+ +S D F E+A+ G R+L+QDP+
Sbjct: 69 RGDTGRVGRPTLEELKVVRQYFQLDISLAQLYRHWSTVDPHFQEVAQKFQGVRLLQQDPI 128
Query: 149 ECLLQFLCSSNNNIARITKMVDFLA-SLGSHLGNVEGFEFHEFPSLERLSLVSEVE--LR 205
ECL F+CSSNNNIARIT MV+ L + G L ++ +H FPSL+ L+ SEVE LR
Sbjct: 129 ECLFSFICSSNNNIARITGMVERLCKAFGPRLIQLDDVTYHGFPSLQALA-GSEVEAQLR 187
Query: 206 NAGFGYRSAPQSSLLFSVRRSFKQAKYITGTVDVLQSKHSGGAEWLLSLRKLDLQEAIDA 265
G GYR A+Y++ + + + GG W+ LR +EA A
Sbjct: 188 KLGLGYR-----------------ARYVSASARAILEER-GGLPWMQQLRLAPYEEAHKA 229
Query: 266 LCTLPGVGPKVAACIALFSLDQHHAIPVDTHVWKIATR 303
LCTLPGVG KVA CI L +LD+ A+PVD HVW+IA R
Sbjct: 230 LCTLPGVGTKVADCICLMALDKPQAVPVDVHVWQIAQR 267
>gi|444722826|gb|ELW63501.1| Peregrin [Tupaia chinensis]
Length = 1505
Score = 180 bits (456), Expect = 9e-43, Method: Compositional matrix adjust.
Identities = 106/284 (37%), Positives = 157/284 (55%), Gaps = 28/284 (9%)
Query: 28 QTLTTNKPSKWTPLNLTQSELSLPLTFPTGQTFRWKKTGPLQYTGPIGPHLISLKHLQNG 87
+TL++N P+ W + +SEL L L +GQ+FRW++ P ++G + + +L +
Sbjct: 1174 RTLSSN-PALWASIPCPRSELRLDLVLASGQSFRWREQSPGHWSGVLADQVWTLTQTEEQ 1232
Query: 88 DVCYHIHTSPS---EPAAKS--ALLDFLNMGISLGELWEGFSASDCRFAELAKYLAGARV 142
C P A+ + + +SL +L+ ++++D F E+A+ G R+
Sbjct: 1233 LYCTVYRGDEGWVGRPTIDELEAVRKYFQLDVSLVQLYHHWASTDSHFQEVAQKFQGVRL 1292
Query: 143 LRQDPVECLLQFLCSSNNNIARITKMVDFLA-SLGSHLGNVEGFEFHEFPSLERLSLVSE 201
LR DPVECL F+CSSNNNIARIT MV+ L + G L ++ ++ FPSL+ L+ E
Sbjct: 1293 LRLDPVECLFSFICSSNNNIARITGMVERLCQAFGPRLIQLDDVTYYGFPSLQALA-GPE 1351
Query: 202 VE--LRNAGFGYRSAPQSSLLFSVRRSFKQAKYITGTVDVLQSKHSGGAEWLLSLRKLDL 259
VE LR G GYR A+Y++G+ + K GG WL L++
Sbjct: 1352 VEGHLRKLGLGYR-----------------ARYVSGSARAILEKQ-GGLAWLQQLQEAPY 1393
Query: 260 QEAIDALCTLPGVGPKVAACIALFSLDQHHAIPVDTHVWKIATR 303
EA+ ALCTLPGVG KVA CI L +L++ A+PVD HVW+IA R
Sbjct: 1394 DEALKALCTLPGVGTKVADCICLMALNKPQAVPVDVHVWQIAQR 1437
>gi|327266336|ref|XP_003217962.1| PREDICTED: n-glycosylase/DNA lyase-like [Anolis carolinensis]
Length = 340
Score = 180 bits (456), Expect = 9e-43, Method: Compositional matrix adjust.
Identities = 105/284 (36%), Positives = 156/284 (54%), Gaps = 26/284 (9%)
Query: 34 KPSKWTPLNLTQSELSLPLTFPTGQTFRWKKTGPLQYTGPIGPHLISLKHLQNGDVCYHI 93
+ + W ++EL L L GQTFRW +T P +TG + + +L + + Y +
Sbjct: 11 RAAPWRCFPCPRAELRLELVLCGGQTFRWSETSPGYWTGVLAGRVWTLTQSEE-QLWYTL 69
Query: 94 HTSPSEPAAKSA-----LLDFLNMGISLGELWEGFSASDCRFAELAKYLAGARVLRQDPV 148
H E ++ L D+ + I+L L++G+S D F E+A G RVLRQDPV
Sbjct: 70 HDEEKEEGSREEDGLQILQDYFQLHINLSALYQGWSYVDSHFREVAIKFPGVRVLRQDPV 129
Query: 149 ECLLQFLCSSNNNIARITKMVDFLA-SLGSHLGNVEGFEFHEFPSLERLSLV-SEVELRN 206
ECL F+C+SNN+++RIT M+ L + G L ++ +H FPSL+ ++ +E LR+
Sbjct: 130 ECLFSFICTSNNHLSRITNMIQHLCQAFGRRLCQLDTKTYHAFPSLQAMAGADTEARLRD 189
Query: 207 AGFGYRSAPQSSLLFSVRRSFKQAKYITGTVDVLQSKHSGGAEWLLSLRKLDLQEAIDAL 266
GFGYR A++++ + + K GGA L LR + ++A AL
Sbjct: 190 LGFGYR-----------------ARFVSESARAVL-KTLGGAAGLQQLRTVPYEQARHAL 231
Query: 267 CTLPGVGPKVAACIALFSLDQHHAIPVDTHVWKIATRYLLPELA 310
CTLPGVG KVA C+ L SLD+ A+PVDTH+W++A RY EL
Sbjct: 232 CTLPGVGVKVADCVCLMSLDKAEAVPVDTHIWQVAKRYYGQELG 275
>gi|213513948|ref|NP_001134019.1| N-glycosylase/DNA lyase [Salmo salar]
gi|209156182|gb|ACI34323.1| N-glycosylase/DNA lyase [Salmo salar]
Length = 401
Score = 179 bits (455), Expect = 1e-42, Method: Compositional matrix adjust.
Identities = 117/336 (34%), Positives = 168/336 (50%), Gaps = 66/336 (19%)
Query: 38 WTPLNLTQSELSLPLTFPTGQTFRWKKTGPLQYTGPIGPHLISLKHLQNGDVCYHIHTSP 97
W L ++SEL L LT GQ+FRW++TG +TG +G + +L + + YH + SP
Sbjct: 14 WRSLACSRSELRLDLTLGCGQSFRWRETGDGHWTGVMGGRVWTLTQTDD-TLWYHTYNSP 72
Query: 98 S-----------------------------------------EPAAKSALL--DFLNMGI 114
+ +P K L D+ + +
Sbjct: 73 NTIGGDGRKRSAGSLLQGSGKRSKGVIEVKEEEEGEPVAVTPDPDRKEEELLNDYFQLKV 132
Query: 115 SLGELWEGFSASDCRFAELAKYLAGARVLRQDPVECLLQFLCSSNNNIARITKMVDFLA- 173
LG+L+ + A+D F +AK G R+LRQDP ECL F+C+SNN+I+RI MV+ L
Sbjct: 133 KLGDLYRDWGAADPHFNSIAKIFTGVRMLRQDPTECLFSFICTSNNHISRIQGMVERLCQ 192
Query: 174 SLGSHLGNVEGFEFHEFPSLERLSLVS-EVELRNAGFGYRSAPQSSLLFSVRRSFKQAKY 232
SLG+ L ++ +H+FPSL L+ S E LR+ GFGYR A++
Sbjct: 193 SLGTPLCQLDQTSYHDFPSLHALADNSVEARLRDLGFGYR-----------------ARF 235
Query: 233 ITGTVDVLQSKHSGGAEWLLSLRKLDLQEAIDALCTLPGVGPKVAACIALFSLDQHHAIP 292
+ + + H GG WL LR +A DAL TLPGVGPKVA C+ L SL++ +P
Sbjct: 236 LQQSARQILDSH-GGPHWLQGLRSAPYLQARDALRTLPGVGPKVADCVCLMSLEKACVVP 294
Query: 293 VDTHVWKIATR-YLLPELAGVR-LTPKLCSRVAEAF 326
VDTHVW+IA R Y G + LT K+ ++ + F
Sbjct: 295 VDTHVWQIAKRDYSCAAGNGQKSLTDKVHRQIGDFF 330
>gi|62738732|pdb|1YQL|A Chain A, Catalytically Inactive Hogg1 Crosslinked With 7-Deaza-8-
Azaguanine Containing Dna
gi|62738735|pdb|1YQM|A Chain A, Catalytically Inactive Human 8-Oxoguanine Glycosylase
Crosslinked To 7-Deazaguanine Containing Dna
gi|62738738|pdb|1YQR|A Chain A, Catalytically Inactive Human 8-Oxoguanine Glycosylase
Crosslinked To Oxog Containing Dna
Length = 319
Score = 179 bits (455), Expect = 1e-42, Method: Compositional matrix adjust.
Identities = 99/279 (35%), Positives = 148/279 (53%), Gaps = 25/279 (8%)
Query: 32 TNKPSKWTPLNLTQSELSLPLTFPTGQTFRWKKTGPLQYTGPIGPHLISLKHLQNGDVCY 91
+ P+ W + +SEL L L P+GQ+FRW++ P ++G + + +L + C
Sbjct: 9 ASTPALWASIPCPRSELRLDLVLPSGQSFRWREQSPAHWSGVLADQVWTLTQTEEQLHCT 68
Query: 92 HIHTSPSE-----PAAKSALLDFLNMGISLGELWEGFSASDCRFAELAKYLAGARVLRQD 146
S+ P A+ + + ++L +L+ + + D F E+A+ G R+LRQD
Sbjct: 69 VYRGDKSQASRPTPDELEAVRKYFQLDVTLAQLYHHWGSVDSHFQEVAQKFQGVRLLRQD 128
Query: 147 PVECLLQFLCSSNNNIARITKMVDFLA-SLGSHLGNVEGFEFHEFPSLERLSLVS-EVEL 204
P+ECL F+CSS NNIARIT MV+ L + G L ++ +H FPSL+ L+ E L
Sbjct: 129 PIECLFSFICSSCNNIARITGMVERLCQAFGPRLIQLDDVTYHGFPSLQALAGPEVEAHL 188
Query: 205 RNAGFGYRSAPQSSLLFSVRRSFKQAKYITGTVDVLQSKHSGGAEWLLSLRKLDLQEAID 264
R G GYR A+Y++ + + + GG WL LR+ +EA
Sbjct: 189 RKLGLGYR-----------------ARYVSASARAILEEQ-GGLAWLQQLRESSYEEAHK 230
Query: 265 ALCTLPGVGPKVAACIALFSLDQHHAIPVDTHVWKIATR 303
ALC LPGVG +VA CI L +LD+ A+PVD H+W IA R
Sbjct: 231 ALCILPGVGTQVADCICLMALDKPQAVPVDVHMWHIAQR 269
>gi|326928008|ref|XP_003210177.1| PREDICTED: LOW QUALITY PROTEIN: n-glycosylase/DNA lyase-like
[Meleagris gallopavo]
Length = 328
Score = 179 bits (454), Expect = 2e-42, Method: Compositional matrix adjust.
Identities = 104/271 (38%), Positives = 148/271 (54%), Gaps = 25/271 (9%)
Query: 35 PSKWTPLNLTQSELSLPLTFPTGQTFRWKKTGPLQYTGPIGPHLISLKHLQNGDVCYHIH 94
P+ W L +EL L L +GQ FRW+++ P +TG +G + +L+ Q D ++
Sbjct: 11 PALWRWLRCPPAELRLDLVLASGQAFRWRESSPGAWTGVLGDRVWTLR--QERDRLWYTV 68
Query: 95 TSPSEPAAKS--ALLDFLNMGISLGELWEGFSASDCRFAELAKYLAGARVLRQDPVECLL 152
P K+ L D+ + + L L+ + A+D F + A G RVLRQDPVECLL
Sbjct: 69 YGKERPGPKTDQILRDYFQLDVGLAALYRTWGAADPLFCQTATAFPGVRVLRQDPVECLL 128
Query: 153 QFLCSSNNNIARITKMVDFLA-SLGSHLGNVEGFEFHEFPSLERLSLV-SEVELRNAGFG 210
F+C+SNN++ARIT M++ L + G L +++ FH FPS L+ +E +LR GFG
Sbjct: 129 SFICTSNNHVARITTMIERLCQAFGQQLCSLDEQPFHAFPSPAALAGSEAEAKLRALGFG 188
Query: 211 YRSAPQSSLLFSVRRSFKQAKYITGTVDVLQSKHSGGAEWLLSLRKLDLQEAIDALCTLP 270
YR A++++GT + GAE L LR + EA LC LP
Sbjct: 189 YR-----------------ARFVSGTARAITEGM--GAEGLCQLRAVPYAEARRVLCALP 229
Query: 271 GVGPKVAACIALFSLDQHHAIPVDTHVWKIA 301
GVG KVA C+ L +LD+ A+PVDTHVW IA
Sbjct: 230 GVGAKVADCVCLMALDKAEAVPVDTHVWNIA 260
>gi|440896053|gb|ELR48090.1| N-glycosylase/DNA lyase [Bos grunniens mutus]
Length = 345
Score = 179 bits (454), Expect = 2e-42, Method: Compositional matrix adjust.
Identities = 101/276 (36%), Positives = 148/276 (53%), Gaps = 25/276 (9%)
Query: 35 PSKWTPLNLTQSELSLPLTFPTGQTFRWKKTGPLQYTGPIGPHLISLKHLQNGDVCYHIH 94
P+ W + +SEL L L +GQ+FRW++ P ++G + + +L + C
Sbjct: 20 PALWASIPCPRSELRLDLVLASGQSFRWREQSPAHWSGVLADQVWTLTQTEEHLYCTVYR 79
Query: 95 TSPSEPAAKS-----ALLDFLNMGISLGELWEGFSASDCRFAELAKYLAGARVLRQDPVE 149
+ A+ + + +SL L+ +S+ D F E+A+ G R+L+ DP+E
Sbjct: 80 GDKGRVGRPTLEELKAVQQYFQLDVSLAPLYHHWSSVDPHFQEVAQKFKGVRLLQLDPIE 139
Query: 150 CLLQFLCSSNNNIARITKMVDFLA-SLGSHLGNVEGFEFHEFPSLERLSLVS-EVELRNA 207
CL F+CSSNNNIARIT MV+ L + G L ++ +H FPSL+ L+ E +LRN
Sbjct: 140 CLFSFICSSNNNIARITGMVERLCQTFGPRLIQLDDVTYHGFPSLQALAGPEVEAQLRNL 199
Query: 208 GFGYRSAPQSSLLFSVRRSFKQAKYITGTVDVLQSKHSGGAEWLLSLRKLDLQEAIDALC 267
G GYR A++++ + + + GG WL LRK +EA ALC
Sbjct: 200 GLGYR-----------------ARFVSASARAILEER-GGLPWLQQLRKAPYEEAHKALC 241
Query: 268 TLPGVGPKVAACIALFSLDQHHAIPVDTHVWKIATR 303
TLPGVG KVA CI L +LD+ A+PVD HVW+IA R
Sbjct: 242 TLPGVGTKVADCICLMALDKPQAVPVDVHVWQIAQR 277
>gi|161760649|ref|NP_001073754.2| N-glycosylase/DNA lyase [Bos taurus]
gi|296475044|tpg|DAA17159.1| TPA: 8-oxoguanine DNA-glycosylase 1 [Bos taurus]
Length = 347
Score = 179 bits (454), Expect = 2e-42, Method: Compositional matrix adjust.
Identities = 101/276 (36%), Positives = 148/276 (53%), Gaps = 25/276 (9%)
Query: 35 PSKWTPLNLTQSELSLPLTFPTGQTFRWKKTGPLQYTGPIGPHLISLKHLQNGDVCYHIH 94
P+ W + +SEL L L +GQ+FRW++ P ++G + + +L + C
Sbjct: 22 PALWASIPCPRSELRLDLVLASGQSFRWREQSPAHWSGVLADQVWTLTQTEEHLYCTVYR 81
Query: 95 TSPSEPAAKS-----ALLDFLNMGISLGELWEGFSASDCRFAELAKYLAGARVLRQDPVE 149
+ A+ + + +SL L+ +S+ D F E+A+ G R+L+ DP+E
Sbjct: 82 GDKGRVGRPTLEELKAVQQYFQLDVSLAPLYHHWSSVDPHFQEVAQKFKGVRLLQLDPIE 141
Query: 150 CLLQFLCSSNNNIARITKMVDFLA-SLGSHLGNVEGFEFHEFPSLERLSLVS-EVELRNA 207
CL F+CSSNNNIARIT MV+ L + G L ++ +H FPSL+ L+ E +LRN
Sbjct: 142 CLFSFICSSNNNIARITGMVERLCQTFGPRLIQLDDVTYHGFPSLQALAGPEVEAQLRNL 201
Query: 208 GFGYRSAPQSSLLFSVRRSFKQAKYITGTVDVLQSKHSGGAEWLLSLRKLDLQEAIDALC 267
G GYR A++++ + + + GG WL LRK +EA ALC
Sbjct: 202 GLGYR-----------------ARFVSASARAILEER-GGLPWLQQLRKAPYEEAHKALC 243
Query: 268 TLPGVGPKVAACIALFSLDQHHAIPVDTHVWKIATR 303
TLPGVG KVA CI L +LD+ A+PVD HVW+IA R
Sbjct: 244 TLPGVGTKVADCICLMALDKPQAVPVDVHVWQIAQR 279
>gi|426249194|ref|XP_004018335.1| PREDICTED: N-glycosylase/DNA lyase isoform 2 [Ovis aries]
Length = 352
Score = 179 bits (453), Expect = 2e-42, Method: Compositional matrix adjust.
Identities = 101/276 (36%), Positives = 148/276 (53%), Gaps = 25/276 (9%)
Query: 35 PSKWTPLNLTQSELSLPLTFPTGQTFRWKKTGPLQYTGPIGPHLISLKHLQNGDVCYHIH 94
P+ W + +SEL L L +GQ+FRW++ P ++G + + +L + C
Sbjct: 20 PALWASIPCPRSELRLDLVLASGQSFRWREQSPAHWSGVLADQVWTLTQTEEHLYCTVYR 79
Query: 95 TSPSEPAAKS-----ALLDFLNMGISLGELWEGFSASDCRFAELAKYLAGARVLRQDPVE 149
+ A+ + + +SL L+ +S+ D F E+A+ G R+L+ DP+E
Sbjct: 80 GDKGRVGRPTLEELKAVQQYFQLDVSLAPLYHHWSSVDPHFKEVAQKFKGVRLLQLDPIE 139
Query: 150 CLLQFLCSSNNNIARITKMVDFLA-SLGSHLGNVEGFEFHEFPSLERLSLVS-EVELRNA 207
CL F+CSSNNNIARIT +V+ L + G L ++ +H FPSL+ LS E +LRN
Sbjct: 140 CLFSFICSSNNNIARITGIVERLCQTFGPRLIQLDDVTYHGFPSLQALSGPEVEAQLRNL 199
Query: 208 GFGYRSAPQSSLLFSVRRSFKQAKYITGTVDVLQSKHSGGAEWLLSLRKLDLQEAIDALC 267
G GYR A++++ + + + GG WL LRK +EA ALC
Sbjct: 200 GLGYR-----------------ARFVSASARAILEER-GGLPWLQQLRKAPYEEAHKALC 241
Query: 268 TLPGVGPKVAACIALFSLDQHHAIPVDTHVWKIATR 303
TLPGVG KVA CI L +LD+ A+PVD HVW+IA R
Sbjct: 242 TLPGVGTKVADCICLMALDKPQAVPVDVHVWQIAQR 277
>gi|196002011|ref|XP_002110873.1| hypothetical protein TRIADDRAFT_5178 [Trichoplax adhaerens]
gi|190586824|gb|EDV26877.1| hypothetical protein TRIADDRAFT_5178, partial [Trichoplax
adhaerens]
Length = 308
Score = 179 bits (453), Expect = 2e-42, Method: Compositional matrix adjust.
Identities = 112/310 (36%), Positives = 164/310 (52%), Gaps = 41/310 (13%)
Query: 38 WTPLNLTQSELSLPLTFPTGQTFRWKKT----------GPLQ--YTGPIGPHLISLKHLQ 85
W + L + EL+L T +GQ F W+K G Q + G I + +LK +
Sbjct: 3 WRVIKLPRGELNLSKTLQSGQQFTWRKIARENETATVDGDTQASWRGVIAEMVWTLKQDR 62
Query: 86 NGD-VCYHIH-----TSPSEPAAKSALLDFLNMGISLGELWEGFSASDCRFAELAKYLAG 139
N + + Y IH T S + D+ + ++L +L+ +S +D FA++A + G
Sbjct: 63 NDENLLYMIHGPWKDTKSQYQTFDSIIKDYFQLDVNLRQLYAEWSKADSNFAKVATSMTG 122
Query: 140 ARVLRQDPVECLLQFLCSSNNNIARITKMV-DFLASLGSHLGNVEGFEFHEFPSLERLSL 198
R+LRQDPVE L F+CSSNNNI+RIT MV + G L NV+G ++++FP + L+
Sbjct: 123 IRILRQDPVENLFSFICSSNNNISRITGMVGNLCKRYGRKLLNVDGIDYYQFPEIAALAQ 182
Query: 199 -VSEVELRNAGFGYRSAPQSSLLFSVRRSFKQAKYITGTVDVLQSKHSGGAEWLLSLRKL 257
+E +R+ GFGYR AKYI + +L K G WL SLR+
Sbjct: 183 HDAEKVMRDMGFGYR-----------------AKYINESAKILNKK---GVAWLYSLRQT 222
Query: 258 DLQEAIDALCTLPGVGPKVAACIALFSLDQHHAIPVDTHVWKIATRYLLPELAGVR-LTP 316
+E L L GVG KVA CI L SLD+ +PVDTHV++I++RY +P L + LT
Sbjct: 223 PYKECQQQLRQLYGVGAKVADCICLMSLDKPSVVPVDTHVFQISSRYYIPGLRKQKSLTG 282
Query: 317 KLCSRVAEAF 326
K + ++E F
Sbjct: 283 KAYTEISEYF 292
>gi|298713786|emb|CBJ27158.1| endonuclease III/similar to 8-oxoguanine DNA glycosylase isoform 1b
[Ectocarpus siliculosus]
Length = 425
Score = 179 bits (453), Expect = 2e-42, Method: Compositional matrix adjust.
Identities = 123/352 (34%), Positives = 170/352 (48%), Gaps = 55/352 (15%)
Query: 11 LKRPRLT---PQPPPTPPNPQTLTTNKP--SKWTPLNLTQSELSLPLTFPTGQTFRWKKT 65
+KR R + Q P P T +P W L + EL T GQ F W++
Sbjct: 6 VKRRRASTSVKQEPAQDAAPLTANIKQPLDGLWHDLCVPPEELRPDSTLTVGQCFNWRQA 65
Query: 66 GPLQYTGPIGPHLISLKHLQNGDVCYHIHTSPSEPAAKSA------LLDFLNMGISLGEL 119
G + G +G +I+++ + + ++ AK A L ++ + L L
Sbjct: 66 GADCWVGVLGREVIAIRQTPCTTLFRSLSAVTTDGDAKVAAALATTLREYFFLRTPLAPL 125
Query: 120 WEGFSASDCRFAELAKYLAGARVLRQDPVECLLQFLCSSNNNIARITKMV--------DF 171
++ +S D R A +A ++G RV+RQDPVEC+ F+CSSNNNI RIT M+ D
Sbjct: 126 YQRWSEGDPRMAAVACSISGVRVVRQDPVECIFSFICSSNNNIPRITGMLGKLRRTYGDL 185
Query: 172 LASLG-------SHLGNVEG--------FEFHEFPSLERLSL-VSEVELRNAGFGYRSAP 215
L ++G LGN+ E H FP+++ L+ +E +LR GFGYR
Sbjct: 186 LLNVGKGGLAATGALGNITECEEWAKLPLELHSFPTVDALATRATEADLRAMGFGYR--- 242
Query: 216 QSSLLFSVRRSFKQAKYITGTVDVLQSKHSGGAEWLLSLRKLDLQEAIDALCTLPGVGPK 275
AKYI + ++ S +GGA W L +R D E L TL GVGPK
Sbjct: 243 --------------AKYIVDSAKLMHS--NGGAGWALDMRNKDRDEVRSQLVTLCGVGPK 286
Query: 276 VAACIALFSLDQHHAIPVDTHVWKIATRYLLPELAGVR-LTPKLCSRVAEAF 326
VA CIALFSLDQ IPVD HVW+IA R P L + LTP + +RV F
Sbjct: 287 VADCIALFSLDQASTIPVDVHVWRIACRDYDPTLVECKSLTPTVYARVGNLF 338
>gi|2351708|gb|AAB68616.1| 8-hydroxyguanine glycosylase [Mus musculus]
Length = 345
Score = 179 bits (453), Expect = 2e-42, Method: Compositional matrix adjust.
Identities = 103/289 (35%), Positives = 154/289 (53%), Gaps = 26/289 (8%)
Query: 22 PTPPNPQTLTTNKPSKWTPLNLTQSELSLPLTFPTGQTFRWKKTGPLQYTGPIGPHLISL 81
P+ +TL+++ P+ W + +SEL L L +GQ+FRWK+ P ++G + + +L
Sbjct: 8 PSSMRHRTLSSS-PALWASIPCPRSELRLDLVLASGQSFRWKEQSPAHWSGVLADQVWTL 66
Query: 82 KHLQNGDVCYHIHTSPSEPAAKS-----ALLDFLNMGISLGELWEGFSASDCRFAELAKY 136
++ C S+ + + L + + +SL +L+ +++ D F +A+
Sbjct: 67 TQTEDQLYCTVYRGDDSQVSRPTLEELETLHKYFQLDVSLAQLYSHWASVDSHFQRVAQK 126
Query: 137 LAGARVLRQDPVECLLQFLCSSNNNIARITKMVDFLA-SLGSHLGNVEGFEFHEFPSLER 195
G R+LRQDP ECL F+CSSNNNIARIT MV+ L + G L ++ +H FP+L
Sbjct: 127 FQGVRLLRQDPTECLFSFICSSNNNIARITGMVERLCQAFGPRLIQLDDVTYHGFPNLHA 186
Query: 196 LSL-VSEVELRNAGFGYRSAPQSSLLFSVRRSFKQAKYITGTVDVLQSKHSGGAEWLLSL 254
L+ +E LR G GYR A+Y+ + + + GG WL L
Sbjct: 187 LAGPEAETHLRKLGLGYR-----------------ARYVRASAKAILEEQ-GGPAWLQQL 228
Query: 255 RKLDLQEAIDALCTLPGVGPKVAACIALFSLDQHHAIPVDTHVWKIATR 303
R + A ALCTLPGVG KVA CI L +LD+ A+PVD HVW+IA R
Sbjct: 229 RVAPYEXAHKALCTLPGVGAKVADCICLMALDKPQAVPVDVHVWQIAHR 277
>gi|426249192|ref|XP_004018334.1| PREDICTED: N-glycosylase/DNA lyase isoform 1 [Ovis aries]
Length = 345
Score = 178 bits (452), Expect = 2e-42, Method: Compositional matrix adjust.
Identities = 101/276 (36%), Positives = 148/276 (53%), Gaps = 25/276 (9%)
Query: 35 PSKWTPLNLTQSELSLPLTFPTGQTFRWKKTGPLQYTGPIGPHLISLKHLQNGDVCYHIH 94
P+ W + +SEL L L +GQ+FRW++ P ++G + + +L + C
Sbjct: 20 PALWASIPCPRSELRLDLVLASGQSFRWREQSPAHWSGVLADQVWTLTQTEEHLYCTVYR 79
Query: 95 TSPSEPAAKS-----ALLDFLNMGISLGELWEGFSASDCRFAELAKYLAGARVLRQDPVE 149
+ A+ + + +SL L+ +S+ D F E+A+ G R+L+ DP+E
Sbjct: 80 GDKGRVGRPTLEELKAVQQYFQLDVSLAPLYHHWSSVDPHFKEVAQKFKGVRLLQLDPIE 139
Query: 150 CLLQFLCSSNNNIARITKMVDFLA-SLGSHLGNVEGFEFHEFPSLERLSLVS-EVELRNA 207
CL F+CSSNNNIARIT +V+ L + G L ++ +H FPSL+ LS E +LRN
Sbjct: 140 CLFSFICSSNNNIARITGIVERLCQTFGPRLIQLDDVTYHGFPSLQALSGPEVEAQLRNL 199
Query: 208 GFGYRSAPQSSLLFSVRRSFKQAKYITGTVDVLQSKHSGGAEWLLSLRKLDLQEAIDALC 267
G GYR A++++ + + + GG WL LRK +EA ALC
Sbjct: 200 GLGYR-----------------ARFVSASARAILEER-GGLPWLQQLRKAPYEEAHKALC 241
Query: 268 TLPGVGPKVAACIALFSLDQHHAIPVDTHVWKIATR 303
TLPGVG KVA CI L +LD+ A+PVD HVW+IA R
Sbjct: 242 TLPGVGTKVADCICLMALDKPQAVPVDVHVWQIAQR 277
>gi|449017433|dbj|BAM80835.1| probable 8-oxoguanine-DNA-glycosylase [Cyanidioschyzon merolae
strain 10D]
Length = 402
Score = 177 bits (450), Expect = 4e-42, Method: Compositional matrix adjust.
Identities = 118/304 (38%), Positives = 159/304 (52%), Gaps = 30/304 (9%)
Query: 36 SKWTPLNLTQSELSLPLTFPTGQTFRWKKTGPLQYTGPIGPHLISLKHLQNGDVCYHIHT 95
S W + + EL LPL GQTFRWK+ + G +L + Y
Sbjct: 60 SPWYAIAVPVEELDLPLCLEAGQTFRWKQRENGLWCGVHAELAWNLYRPDATTLRYCYAP 119
Query: 96 SPSEPA-AKSALLDFLNMGI---SLGELWEGFSASDCRFAELAKYLAGARVLRQDPVECL 151
+ +P A+ L DFL + SL L+E +S+SD F +A + G RVLRQDPVECL
Sbjct: 120 ATVDPVRARLVLQDFLRLSDDAPSLVSLYEDWSSSDPHFRSVAPFFQGTRVLRQDPVECL 179
Query: 152 LQFLCSSNNNIARITKMVDFLA-----SLGSHLGNVEGFEFHEFPSLERLSLVSEV-ELR 205
FLCSSNN+I RI+ MV+FLA ++GS + E++ FPS+ L+ + V ELR
Sbjct: 180 FSFLCSSNNHIRRISGMVEFLARRYGRAIGSVPAGLADDEYYAFPSVAELAAQASVAELR 239
Query: 206 NAGFGYRSAPQSSLLFSVRRSFKQAKYITGTVDVLQSKHSGGAEWLLSLR-KLDLQEAID 264
GFGYR A+++ T L+S+ GG +L LR + + E I
Sbjct: 240 ANGFGYR-----------------AEFVVNTAKELESR--GGNSYLRGLRSRENKDEVIR 280
Query: 265 ALCTLPGVGPKVAACIALFSLDQHHAIPVDTHVWKIATRYLLPELAGVRLTPKLCSRVAE 324
L TLPGVG KVA+CIAL SLD H +PVDTHVW++ R+ P+ LT K +
Sbjct: 281 ELVTLPGVGRKVASCIALLSLDCAHEVPVDTHVWQLTLRHYAPDWRTKSLTEKRHEEIGA 340
Query: 325 AFCE 328
F E
Sbjct: 341 LFRE 344
>gi|13540701|ref|NP_110497.1| N-glycosylase/DNA lyase [Rattus norvegicus]
gi|10720147|sp|O70249.1|OGG1_RAT RecName: Full=N-glycosylase/DNA lyase; Includes: RecName:
Full=8-oxoguanine DNA glycosylase; Includes: RecName:
Full=DNA-(apurinic or apyrimidinic site) lyase; Short=AP
lyase
gi|3004865|gb|AAC77525.1| 8-oxoguanine-DNA-glycosylase [Rattus norvegicus]
gi|149036902|gb|EDL91520.1| 8-oxoguanine DNA-glycosylase 1, isoform CRA_a [Rattus norvegicus]
Length = 345
Score = 177 bits (450), Expect = 4e-42, Method: Compositional matrix adjust.
Identities = 100/279 (35%), Positives = 146/279 (52%), Gaps = 25/279 (8%)
Query: 32 TNKPSKWTPLNLTQSELSLPLTFPTGQTFRWKKTGPLQYTGPIGPHLISLKHLQNGDVCY 91
T+ P+ W + +SEL L L +GQ+FRW++ P ++G + + +L ++ C
Sbjct: 17 TSSPALWASIPCPRSELRLDLVLASGQSFRWREQSPAHWSGVLADQVWTLTQTEDQLYCT 76
Query: 92 HIHTSPSEPAAKS-----ALLDFLNMGISLGELWEGFSASDCRFAELAKYLAGARVLRQD 146
+ + L + + +SL +L+ +++ D F +A+ G R+LRQD
Sbjct: 77 VYRGDKGQVGRPTLEELETLHKYFQLDVSLTQLYSHWASVDSHFQSVAQKFQGVRLLRQD 136
Query: 147 PVECLLQFLCSSNNNIARITKMVDFLA-SLGSHLGNVEGFEFHEFPSLERLSLVS-EVEL 204
P ECL F+CSSNNNIARIT MV+ L + G L ++ +H FP+L L+ E L
Sbjct: 137 PTECLFSFICSSNNNIARITGMVERLCQAFGPRLVQLDDVTYHGFPNLHALAGPEVETHL 196
Query: 205 RNAGFGYRSAPQSSLLFSVRRSFKQAKYITGTVDVLQSKHSGGAEWLLSLRKLDLQEAID 264
R G GYR A+Y+ + + + GG WL LR +EA
Sbjct: 197 RKLGLGYR-----------------ARYVCASAKAILEEQ-GGPAWLQQLRVASYEEAHK 238
Query: 265 ALCTLPGVGPKVAACIALFSLDQHHAIPVDTHVWKIATR 303
ALCTLPGVG KVA CI L +LD+ A+PVD HVW+IA R
Sbjct: 239 ALCTLPGVGTKVADCICLMALDKPQAVPVDIHVWQIAHR 277
>gi|344276462|ref|XP_003410027.1| PREDICTED: N-glycosylase/DNA lyase [Loxodonta africana]
Length = 378
Score = 177 bits (449), Expect = 6e-42, Method: Compositional matrix adjust.
Identities = 98/274 (35%), Positives = 148/274 (54%), Gaps = 25/274 (9%)
Query: 35 PSKWTPLNLTQSELSLPLTFPTGQTFRWKKTGPLQYTGPIGPHLISLKHLQNGDVCYHIH 94
P+ W + +SEL L L +GQ+FRW++ P ++G + + +L + C
Sbjct: 53 PALWASIPCLRSELRLDLVLASGQSFRWREQSPAHWSGVLADQVWTLTQTEEQLYCTVYR 112
Query: 95 TSPSEPAAKS-----ALLDFLNMGISLGELWEGFSASDCRFAELAKYLAGARVLRQDPVE 149
+ + + A+ + + +S+ +L+ +S+ D F E+A+ G R+LRQDP+E
Sbjct: 113 GNKGQVGKPTQEELQAVHQYFQLDVSVAQLYYHWSSVDPHFQEVAQKFQGVRLLRQDPIE 172
Query: 150 CLLQFLCSSNNNIARITKMVDFLA-SLGSHLGNVEGFEFHEFPSLERLSLVS-EVELRNA 207
CL F+CSSNNNIARIT MV+ L + G L ++ +H FPSL+ L+ E LR
Sbjct: 173 CLFSFICSSNNNIARITGMVERLCQAFGPRLIQLDDVTYHGFPSLQALAGPKVEAHLRKL 232
Query: 208 GFGYRSAPQSSLLFSVRRSFKQAKYITGTVDVLQSKHSGGAEWLLSLRKLDLQEAIDALC 267
G GYR A+Y++ + + + GG WL LR+ ++A ALC
Sbjct: 233 GLGYR-----------------ARYVSASARAILEEQ-GGPSWLQQLREAPYEDAHKALC 274
Query: 268 TLPGVGPKVAACIALFSLDQHHAIPVDTHVWKIA 301
TLPGVG KVA CI L +LD+ A+PVD H+W IA
Sbjct: 275 TLPGVGTKVADCICLMALDKPQAVPVDIHMWHIA 308
>gi|119390516|pdb|2NOI|A Chain A, Structure Of G42a Human 8-Oxoguanine Glycosylase
Crosslinked To Undamaged G-Containing Dna
Length = 325
Score = 177 bits (449), Expect = 6e-42, Method: Compositional matrix adjust.
Identities = 98/279 (35%), Positives = 147/279 (52%), Gaps = 25/279 (8%)
Query: 32 TNKPSKWTPLNLTQSELSLPLTFPTGQTFRWKKTGPLQYTGPIGPHLISLKHLQNGDVCY 91
+ P+ W + +SEL L L P+ Q+FRW++ P ++G + + +L + C
Sbjct: 15 ASTPALWASIPCPRSELRLDLVLPSAQSFRWREQSPAHWSGVLADQVWTLTQTEEQLHCT 74
Query: 92 HIHTSPSE-----PAAKSALLDFLNMGISLGELWEGFSASDCRFAELAKYLAGARVLRQD 146
S+ P A+ + + ++L +L+ + + D F E+A+ G R+LRQD
Sbjct: 75 VYRGDKSQASRPTPDELEAVRKYFQLDVTLAQLYHHWGSVDSHFQEVAQKFQGVRLLRQD 134
Query: 147 PVECLLQFLCSSNNNIARITKMVDFLA-SLGSHLGNVEGFEFHEFPSLERLSLVS-EVEL 204
P+ECL F+CSS NNIARIT MV+ L + G L ++ +H FPSL+ L+ E L
Sbjct: 135 PIECLFSFICSSCNNIARITGMVERLCQAFGPRLIQLDDVTYHGFPSLQALAGPEVEAHL 194
Query: 205 RNAGFGYRSAPQSSLLFSVRRSFKQAKYITGTVDVLQSKHSGGAEWLLSLRKLDLQEAID 264
R G GYR A+Y++ + + + GG WL LR+ +EA
Sbjct: 195 RKLGLGYR-----------------ARYVSASARAILEEQ-GGLAWLQQLRESSYEEAHK 236
Query: 265 ALCTLPGVGPKVAACIALFSLDQHHAIPVDTHVWKIATR 303
ALC LPGVG +VA CI L +LD+ A+PVD H+W IA R
Sbjct: 237 ALCILPGVGTQVADCICLMALDKPQAVPVDVHMWHIAQR 275
>gi|395824506|ref|XP_003785504.1| PREDICTED: N-glycosylase/DNA lyase [Otolemur garnettii]
Length = 335
Score = 177 bits (448), Expect = 7e-42, Method: Compositional matrix adjust.
Identities = 98/279 (35%), Positives = 148/279 (53%), Gaps = 25/279 (8%)
Query: 32 TNKPSKWTPLNLTQSELSLPLTFPTGQTFRWKKTGPLQYTGPIGPHLISLKHLQNGDVCY 91
+ P+ W + SEL L L +GQ+FRW++ P ++G + + +L + C
Sbjct: 7 ASTPALWASIPCPLSELRLDLVLASGQSFRWREQSPAHWSGVLADQVWTLTQTEEQLYCT 66
Query: 92 HIHTSPS---EPAAKS--ALLDFLNMGISLGELWEGFSASDCRFAELAKYLAGARVLRQD 146
P + A+ ++ + +SL +L+ + + D F E+A+ G R+LRQD
Sbjct: 67 VYRGDKGWVGRPTLEELEAVHEYFQLDVSLAQLYRHWGSVDSHFKEVARKYQGVRLLRQD 126
Query: 147 PVECLLQFLCSSNNNIARITKMVDFLA-SLGSHLGNVEGFEFHEFPSLERLSLVS-EVEL 204
P+ECL F+CSSNNNI RIT MV+ L + G L ++ ++ FPSL+ L+ E L
Sbjct: 127 PIECLFSFICSSNNNITRITGMVERLCQAFGPRLIQLDDVTYYGFPSLQALAGPEVEAHL 186
Query: 205 RNAGFGYRSAPQSSLLFSVRRSFKQAKYITGTVDVLQSKHSGGAEWLLSLRKLDLQEAID 264
R G GYR A+Y+ + + + GG WL LR++ +EA
Sbjct: 187 RKLGLGYR-----------------ARYVNASAQAILEEQ-GGQAWLQQLREIPYEEAHK 228
Query: 265 ALCTLPGVGPKVAACIALFSLDQHHAIPVDTHVWKIATR 303
ALC+LPGVG KVA C+ L +LD+ A+PVD HVW+IA R
Sbjct: 229 ALCSLPGVGTKVADCVCLMALDKPQAVPVDVHVWQIAQR 267
>gi|119390503|pdb|2NOB|A Chain A, Structure Of Catalytically Inactive H270a Human 8-
Oxoguanine Glycosylase Crosslinked To 8-Oxoguanine Dna
Length = 325
Score = 176 bits (445), Expect = 2e-41, Method: Compositional matrix adjust.
Identities = 98/279 (35%), Positives = 147/279 (52%), Gaps = 25/279 (8%)
Query: 32 TNKPSKWTPLNLTQSELSLPLTFPTGQTFRWKKTGPLQYTGPIGPHLISLKHLQNGDVCY 91
+ P+ W + +SEL L L P+GQ+FRW++ P ++G + + +L + C
Sbjct: 15 ASTPALWASIPCPRSELRLDLVLPSGQSFRWREQSPAHWSGVLADQVWTLTQTEEQLHCT 74
Query: 92 HIHTSPSE-----PAAKSALLDFLNMGISLGELWEGFSASDCRFAELAKYLAGARVLRQD 146
S+ P A+ + + ++L +L+ + + D F E+A+ G R+LRQD
Sbjct: 75 VYRGDKSQASRPTPDELEAVRKYFQLDVTLAQLYHHWGSVDSHFQEVAQKFQGVRLLRQD 134
Query: 147 PVECLLQFLCSSNNNIARITKMVDFLA-SLGSHLGNVEGFEFHEFPSLERLSLVS-EVEL 204
P+ECL F+CSS NNIARIT MV+ L + G L ++ +H FPSL+ L+ E L
Sbjct: 135 PIECLFSFICSSCNNIARITGMVERLCQAFGPRLIQLDDVTYHGFPSLQALAGPEVEAHL 194
Query: 205 RNAGFGYRSAPQSSLLFSVRRSFKQAKYITGTVDVLQSKHSGGAEWLLSLRKLDLQEAID 264
R G GYR A+Y++ + + + GG WL LR+ +EA
Sbjct: 195 RKLGLGYR-----------------ARYVSASARAILEEQ-GGLAWLQQLRESSYEEAHK 236
Query: 265 ALCTLPGVGPKVAACIALFSLDQHHAIPVDTHVWKIATR 303
ALC LPGVG +VA CI L +LD+ A+PVD +W IA R
Sbjct: 237 ALCILPGVGTQVADCICLMALDKPQAVPVDVAMWHIAQR 275
>gi|432865720|ref|XP_004070580.1| PREDICTED: N-glycosylase/DNA lyase-like [Oryzias latipes]
Length = 395
Score = 176 bits (445), Expect = 2e-41, Method: Compositional matrix adjust.
Identities = 110/324 (33%), Positives = 163/324 (50%), Gaps = 55/324 (16%)
Query: 38 WTPLNLTQSELSLPLTFPTGQTFRWKKTGPLQYTGPIGPHLISLKHLQNGDVCYHIHTSP 97
W L SEL L LT GQ+FRWK+T +TG +G + +L + + YH++T+
Sbjct: 15 WRSLPCLTSELRLDLTLACGQSFRWKETAEGHWTGVMGGRVWTLTQTDD-RLWYHVYTNQ 73
Query: 98 SEPAAKS-------------------------------ALLDFLNMGISLGELWEGFSAS 126
+ + L D+ + + + +L + A+
Sbjct: 74 KKRGGRENRAEEKPEGVPNVEEEAPAVTDEQQSTMEAEMLRDYFQLHVKMEDLCRHWGAA 133
Query: 127 DCRFAELAKYLAGARVLRQDPVECLLQFLCSSNNNIARITKMVDFLA-SLGSHLGNVEGF 185
D F A G R+LRQDP ECL F+C+SNN+I+RI MV+ L +LG+ L ++
Sbjct: 134 DRHFKNTADVFTGVRMLRQDPTECLFSFICTSNNHISRIQGMVERLCQALGAPLCQLDQT 193
Query: 186 EFHEFPSLERLSLVS-EVELRNAGFGYRSAPQSSLLFSVRRSFKQAKYITGTVDVLQSKH 244
+H+FP+L L+ S E LR+ GFGYR A+Y+ + ++ H
Sbjct: 194 PYHDFPTLSALADSSVEARLRDLGFGYR-----------------ARYLQQSAKMILDSH 236
Query: 245 SGGAEWLLSLRKLDLQEAIDALCTLPGVGPKVAACIALFSLDQHHAIPVDTHVWKIATR- 303
G++W+ LR++ EA +AL TLPGVG KVA C+ L SLD+ A+PVDTHVW+IA R
Sbjct: 237 --GSQWIEGLRRVPYLEAREALRTLPGVGAKVADCVCLMSLDKPEAVPVDTHVWQIAKRD 294
Query: 304 YLLPELAGVR-LTPKLCSRVAEAF 326
Y G + LT KL + + F
Sbjct: 295 YKYAADNGQKSLTDKLYKDIGDFF 318
>gi|343785574|gb|AEM59540.1| 8-oxoguanine DNA glycosylase [Tigriopus japonicus]
Length = 343
Score = 175 bits (444), Expect = 2e-41, Method: Compositional matrix adjust.
Identities = 110/297 (37%), Positives = 156/297 (52%), Gaps = 27/297 (9%)
Query: 38 WTPLNLTQSELSLPLTFPTGQTFRWK--KTGPLQYTGPIGPHLISLKHLQNGDVCYHIHT 95
W + +SEL L + GQ+FRWK P Q+ G +G + L Q+
Sbjct: 10 WKLIPCLRSELRLDIVLKCGQSFRWKSFNDQPDQWIGVLGSRVWLLSQDQDHLKYKTFPK 69
Query: 96 SPSEPAAKSALL-DFLNMGISLGELWEGFSASDCRFAELAKYLAGARVLRQDPVECLLQF 154
+ + A A L D+ + + L L++ ++ D F +++ +G R+LRQ PVE L F
Sbjct: 70 AEEDSGALDAFLADYFQLKVKLQPLYKVWAKEDPVFEKISSKFSGVRMLRQHPVENLFSF 129
Query: 155 LCSSNNNIARITKMVDFLA-SLGSHLGNVEGFEFHEFPSLERLS---LVSEVELRNAGFG 210
+CSSNNNI RI+ MV+ L G+ + EG +H FPS+E+L+ + E +LR GFG
Sbjct: 130 ICSSNNNIQRISSMVENLCIHFGTEIWKHEGVSYHTFPSVEQLAKNPMNVEKKLRGLGFG 189
Query: 211 YRSAPQSSLLFSVRRSFKQAKYITGTVDVLQSKHSGGAEWLLSLRKLDLQEAIDALCTLP 270
YR+A YI + Q +GG E+LL LR L QEA D L L
Sbjct: 190 YRAA-----------------YIAKSAK--QIAENGGEEYLLKLRTLPYQEARDELLKLT 230
Query: 271 GVGPKVAACIALFSLDQHHAIPVDTHVWKIATRYLLPELAGVR-LTPKLCSRVAEAF 326
G+GPKVA C+ L SLDQ AIPVDTH+++IA LP L G + +T K + + F
Sbjct: 231 GIGPKVADCVLLMSLDQTAAIPVDTHMFQIAANKYLPHLKGYKSVTDKAYKEIGDHF 287
>gi|320089681|pdb|2XHI|A Chain A, Separation-Of-Function Mutants Unravel The Dual Reaction
Mode Of Human 8-Oxoguanine Dna Glycosylase
Length = 360
Score = 175 bits (443), Expect = 3e-41, Method: Compositional matrix adjust.
Identities = 98/279 (35%), Positives = 147/279 (52%), Gaps = 25/279 (8%)
Query: 32 TNKPSKWTPLNLTQSELSLPLTFPTGQTFRWKKTGPLQYTGPIGPHLISLKHLQNGDVCY 91
+ P+ W + +SEL L L P+GQ+FRW++ P ++G + + +L + C
Sbjct: 32 ASTPALWASIPCPRSELRLDLVLPSGQSFRWREQSPAHWSGVLADQVWTLTQTEEQLHCT 91
Query: 92 HIHTSPSE-----PAAKSALLDFLNMGISLGELWEGFSASDCRFAELAKYLAGARVLRQD 146
S+ P A+ + + ++L +L+ + + D F E+A+ G R+LRQD
Sbjct: 92 VYRGDKSQASRPTPDELEAVRKYFQLDVTLAQLYHHWGSVDSHFQEVAQKFQGVRLLRQD 151
Query: 147 PVECLLQFLCSSNNNIARITKMVDFLA-SLGSHLGNVEGFEFHEFPSLERLSLVS-EVEL 204
P+ECL F+CSSNNNIARIT MV+ L + G L ++ +H FPSL+ L+ E L
Sbjct: 152 PIECLFSFICSSNNNIARITGMVERLCQAFGPRLIQLDDVTYHGFPSLQALAGPEVEAHL 211
Query: 205 RNAGFGYRSAPQSSLLFSVRRSFKQAKYITGTVDVLQSKHSGGAEWLLSLRKLDLQEAID 264
R G GYR A+Y++ + + + GG WL LR+ +EA
Sbjct: 212 RKLGLGYR-----------------ARYVSASARAILEEQ-GGLAWLQQLRESSYEEAHK 253
Query: 265 ALCTLPGVGPKVAACIALFSLDQHHAIPVDTHVWKIATR 303
ALC LPGVG VA I L +LD+ A+PV+ H+W IA R
Sbjct: 254 ALCILPGVGTCVADKICLMALDKPQAVPVNVHMWHIAQR 292
>gi|2197093|gb|AAB61289.1| 8-oxoguanine DNA glycosylase 1 [Mus musculus]
Length = 345
Score = 175 bits (443), Expect = 3e-41, Method: Compositional matrix adjust.
Identities = 98/266 (36%), Positives = 142/266 (53%), Gaps = 25/266 (9%)
Query: 45 QSELSLPLTFPTGQTFRWKKTGPLQYTGPIGPHLISLKHLQNGDVCYHIHTSPSEPAAKS 104
+SEL L L +GQ+FRWK+ P ++G + + +L ++ C S+ + +
Sbjct: 30 RSELRLDLVLASGQSFRWKEQSPAHWSGVLADQVWTLTQTEDQLYCTVYRGDDSQVSRPT 89
Query: 105 -----ALLDFLNMGISLGELWEGFSASDCRFAELAKYLAGARVLRQDPVECLLQFLCSSN 159
L + + +SL +L+ +++ D F +A+ G R+LRQDP ECL F+CSSN
Sbjct: 90 LEELETLHKYFQLDVSLAQLYSHWASVDSHFQRVAQKFQGVRLLRQDPTECLFSFICSSN 149
Query: 160 NNIARITKMVDFLA-SLGSHLGNVEGFEFHEFPSLERLSL-VSEVELRNAGFGYRSAPQS 217
NNIARIT MV+ L + G L ++ +H FP+L L+ +E LR G GYR
Sbjct: 150 NNIARITGMVERLCQAFGPRLIQLDDVTYHGFPNLHALAGPEAETHLRKLGLGYR----- 204
Query: 218 SLLFSVRRSFKQAKYITGTVDVLQSKHSGGAEWLLSLRKLDLQEAIDALCTLPGVGPKVA 277
A+Y+ + + + GG WL LR +EA LCTLPGVG KVA
Sbjct: 205 ------------ARYVRASAKAILEEQ-GGPAWLQQLRVAPYEEAHKGLCTLPGVGAKVA 251
Query: 278 ACIALFSLDQHHAIPVDTHVWKIATR 303
CI L +LD+ A+PVD HVW+IA R
Sbjct: 252 DCICLMALDKPQAVPVDVHVWQIAHR 277
>gi|321474741|gb|EFX85705.1| hypothetical protein DAPPUDRAFT_313422 [Daphnia pulex]
Length = 347
Score = 175 bits (443), Expect = 3e-41, Method: Compositional matrix adjust.
Identities = 114/304 (37%), Positives = 161/304 (52%), Gaps = 45/304 (14%)
Query: 46 SELSLPLTFPTGQTFRWKKTGPLQYTGPIGPHLISLKHLQNGDVCYH---------IHTS 96
+ELSL ++ GQ+FRWKK+ ++ G +G +L + + C + I +
Sbjct: 8 AELSLDISLNCGQSFRWKKSLTGEWIGVLGGYLWLMTQ---DETCINYRVVQPSVKIEPA 64
Query: 97 PSEPAAKSALL-----------DFLNMGISLGELWEGFSASDCRFAELAKYLAGARVLRQ 145
S A K+ LL D+ + I L L+ +S +D F ++AK AG R+LRQ
Sbjct: 65 SSPSAVKTELLKECIDYEHILKDYFQLNIDLESLYRQWSLADPNFNQVAKNFAGVRMLRQ 124
Query: 146 DPVECLLQFLCSSNNNIARITKMVDFLA-SLGSHLGNVEGFEFHEFPSLERLSLVS-EVE 203
DPVE L F+CSSNNNI RIT MV+ L + G+ L E ++ FP+LE L+ E
Sbjct: 125 DPVENLFAFICSSNNNIQRITGMVEKLCENYGNQLLTHEDVAYYSFPALESLAADKVESR 184
Query: 204 LRNAGFGYRSAPQSSLLFSVRRSFKQAKYITGTVDVLQSKHSGGAEWLLSLRKLDLQEAI 263
LR GFGYR AK+I + + +GG +WL++LR L EA
Sbjct: 185 LRTLGFGYR-----------------AKFIQQSAAKIV--ENGGRDWLMNLRNLSYPEAK 225
Query: 264 DALCTLPGVGPKVAACIALFSLDQHHAIPVDTHVWKIATRYLLPELAGVR-LTPKLCSRV 322
AL TLPG+G KVA CI L SLD AIPVDTHV++IA +P L+ + +T K+ +
Sbjct: 226 TALMTLPGIGAKVADCICLMSLDHSVAIPVDTHVFQIAKASYVPHLSQTKSVTDKVYKEI 285
Query: 323 AEAF 326
+ F
Sbjct: 286 SAHF 289
>gi|181339906|ref|NP_001116780.1| N-glycosylase/DNA lyase [Danio rerio]
Length = 391
Score = 174 bits (440), Expect = 6e-41, Method: Compositional matrix adjust.
Identities = 109/311 (35%), Positives = 158/311 (50%), Gaps = 65/311 (20%)
Query: 38 WTPLNLTQSELSLPLTFPTGQTFRWKKTGPLQYTGPIGPHLISLKHLQNGDVCYHIH--- 94
W L+ +SEL L LT GQTFRW++T +TG + + +L ++ + Y+++
Sbjct: 14 WRSLSCPRSELRLDLTLGCGQTFRWRETADSHWTGVMRGKVWTLTQTEDT-LWYYVYSHN 72
Query: 95 --------------------------------------TSPSEPAAKSA--LLDFLNMGI 114
T SEP K L D+ + +
Sbjct: 73 NNPGPENRQKGKVEEEEQLLGKMSKRTTCIKQEDEGLCTVLSEPDKKEEELLRDYFQLDV 132
Query: 115 SLGELWEGFSASDCRFAELAKYLAGARVLRQDPVECLLQFLCSSNNNIARITKMVDFLA- 173
LG+L++ +S D F A G R+LRQDPVECL F+CSSNN+I+RI MV+ L
Sbjct: 133 KLGDLYKNWSTVDPHFKHTANIFTGVRLLRQDPVECLFSFICSSNNHISRIQGMVERLCQ 192
Query: 174 SLGSHLGNVEGFEFHEFPSLERLSL-VSEVELRNAGFGYRSAPQSSLLFSVRRSFKQAKY 232
+LG+ L ++ +H+FPSL+ L+ E+ LR+ GFGYR A++
Sbjct: 193 TLGTLLCKLDDVAYHDFPSLQDLTDPCVEMRLRDLGFGYR-----------------ARF 235
Query: 233 ITGTVDVLQSKHSGGAEWLLSLRKLDLQEAIDALCTLPGVGPKVAACIALFSLDQHHAIP 292
+ + ++ + H +WL SLR +A DAL TLPGVG KVA C+ L SLD+ A+P
Sbjct: 236 LQQSSQMIMNSHH--PDWLQSLRSTPYLQARDALRTLPGVGLKVADCVCLMSLDKFEALP 293
Query: 293 VDTHVWKIATR 303
VDTHVW+IA R
Sbjct: 294 VDTHVWQIAKR 304
>gi|112418846|gb|AAI22184.1| LOC793885 protein [Danio rerio]
Length = 388
Score = 173 bits (438), Expect = 1e-40, Method: Compositional matrix adjust.
Identities = 111/312 (35%), Positives = 157/312 (50%), Gaps = 67/312 (21%)
Query: 38 WTPLNLTQSELSLPLTFPTGQTFRWKKTGPLQYTGPIGPHLISLKHLQNGDVCYHI---- 93
W L+ +SEL L LT GQTFRW++T +TG + + +L Q GD ++
Sbjct: 14 WRSLSCPRSELRLDLTLGCGQTFRWRETADSHWTGVMRGKVWTLT--QTGDTLWYYVYSH 71
Query: 94 HTSP--------------------------------------SEPAAKSA--LLDFLNMG 113
H +P SEP K L D+ +
Sbjct: 72 HNNPGPENRQKGKVEEEEQLLGKMSKRMTGIKQEDEGLCTVVSEPDKKEEELLRDYFQLD 131
Query: 114 ISLGELWEGFSASDCRFAELAKYLAGARVLRQDPVECLLQFLCSSNNNIARITKMVDFLA 173
+ LG+L++ +S D F A G R+LRQDPVECL F+CSSNN+I+RI MV+ L
Sbjct: 132 VKLGDLYKNWSTVDPHFKHTANIFTGVRLLRQDPVECLFSFICSSNNHISRIQGMVERLC 191
Query: 174 -SLGSHLGNVEGFEFHEFPSLERLSL-VSEVELRNAGFGYRSAPQSSLLFSVRRSFKQAK 231
+LG+ L ++ +H+FPSL+ L+ E+ LR+ GFGYR A+
Sbjct: 192 QTLGTLLCKLDDVAYHDFPSLQDLTDPCVEMRLRDLGFGYR-----------------AR 234
Query: 232 YITGTVDVLQSKHSGGAEWLLSLRKLDLQEAIDALCTLPGVGPKVAACIALFSLDQHHAI 291
++ + ++ + H +WL SLR +A DAL TLPGVG KVA C+ L SLD+ A+
Sbjct: 235 FLQQSSQMIMNSHH--PDWLQSLRSTPYLQARDALRTLPGVGLKVADCVCLMSLDKFEAL 292
Query: 292 PVDTHVWKIATR 303
PV THVW+IA R
Sbjct: 293 PVGTHVWQIAKR 304
>gi|302833822|ref|XP_002948474.1| hypothetical protein VOLCADRAFT_103937 [Volvox carteri f.
nagariensis]
gi|300266161|gb|EFJ50349.1| hypothetical protein VOLCADRAFT_103937 [Volvox carteri f.
nagariensis]
Length = 1704
Score = 172 bits (437), Expect = 2e-40, Method: Compositional matrix adjust.
Identities = 105/234 (44%), Positives = 141/234 (60%), Gaps = 15/234 (6%)
Query: 98 SEPAAKSALL-DFLNMGIS-LGELWEGFSASDCRFAELAKYLAGARVLRQDPVECLLQFL 155
++ A +A+L D+ N+ + L L G++AS RF E++ + GAR+LRQDP+ECL QF+
Sbjct: 39 ADAAEDAAVLRDYFNLDTTRLCHLAIGWAASCSRFREVSPHFPGARMLRQDPLECLFQFI 98
Query: 156 CSSNNNIARITKMVDFLASL-GSHLGNVEGFEFHEFPSLERLSLVSEVELRNAGFGYRSA 214
CSSNN+I+RI MV+ L S+ G+ L ++ S + L A G+
Sbjct: 99 CSSNNHISRIHGMVERLCSMYGTPLEPTTAPATPGAAAVAAASGSGALRLLLANGGH--G 156
Query: 215 PQSSLLFSVRRSFKQAKYITGTVDVLQSKHSGGAEWLLSLRKLDLQEAIDALCTLPGVGP 274
P S +A++I GT L SK GG WLL LR + L EA++AL LPG+GP
Sbjct: 157 P----------SCYRARFIVGTTRALLSKPGGGRAWLLGLRDVALGEAVEALTDLPGIGP 206
Query: 275 KVAACIALFSLDQHHAIPVDTHVWKIATRYLLPELAGVRLTPKLCSRVAEAFCE 328
KVAACI LFSLD+H AIPVDTHVW+IA RY P+LAG LT K+ V + F E
Sbjct: 207 KVAACICLFSLDKHEAIPVDTHVWQIAVRYYCPQLAGKSLTKKVHGEVQQVFVE 260
>gi|66911165|gb|AAH97662.1| LOC733253 protein [Xenopus laevis]
Length = 414
Score = 171 bits (432), Expect = 6e-40, Method: Compositional matrix adjust.
Identities = 93/228 (40%), Positives = 136/228 (59%), Gaps = 23/228 (10%)
Query: 103 KSALLDFLNMGISLGELWEGFSASDCRFAELAKYLAGARVLRQDPVECLLQFLCSSNNNI 162
+ L D+ + +SL L++ + SD F +A+ G RVLRQDP ECL F+C+SNNNI
Sbjct: 158 QEVLEDYFQLNVSLRTLYQQWERSDPNFQRVAQDFPGIRVLRQDPTECLFSFICTSNNNI 217
Query: 163 ARITKMVD-FLASLGSHLGNVEGFEFHEFPSLERLSLV-SEVELRNAGFGYRSAPQSSLL 220
+RIT M++ +SLG L ++ +H FP+LE+L+ +E +LR+ GFGYR
Sbjct: 218 SRITGMIERVCSSLGQRLCQLDSEVYHTFPTLEKLAANGTEAKLRDLGFGYR-------- 269
Query: 221 FSVRRSFKQAKYITGTVDVLQSKHSGGAEWLLSLRKLDLQEAIDALCTLPGVGPKVAACI 280
AK+++ + + SKH G +WL SLR + +EA ALC+LPGVG KVA C+
Sbjct: 270 ---------AKFVSESARTILSKH--GPDWLESLRLVPYEEAKTALCSLPGVGAKVADCV 318
Query: 281 ALFSLDQHHAIPVDTHVWKIATRYLLPELAGVR--LTPKLCSRVAEAF 326
L +LD+ A+PVDTHV +IA R +P+L G LT ++ + F
Sbjct: 319 CLMALDKSEAVPVDTHVLQIAKRDYVPQLGGCNKSLTDRVYRETGDFF 366
>gi|410926081|ref|XP_003976507.1| PREDICTED: uncharacterized protein LOC101072813 [Takifugu rubripes]
Length = 1048
Score = 170 bits (430), Expect = 1e-39, Method: Compositional matrix adjust.
Identities = 107/313 (34%), Positives = 156/313 (49%), Gaps = 67/313 (21%)
Query: 38 WTPLNLTQSELSLPLTFPTGQTFRWKKTGPLQYTGPIGPHLISLKHLQNGDVCYHIHTS- 96
W L +SEL L LT GQ+FRW+KT +TG + + +L + + YH++ +
Sbjct: 14 WRSLPCAKSELRLDLTLECGQSFRWRKTAEGHWTGVMKGRVWTLTQTDD-TLWYHVYRTQ 72
Query: 97 -------------------PSEPAA-------------------------KSALLDFLNM 112
+EP + L D+ +
Sbjct: 73 ERQRKGNDRKRKTIVSVQMENEPKKKFKREVKEEEEEPLVVTLVQEAGEEQQMLRDYFQL 132
Query: 113 GISLGELWEGFSASDCRFAELAKYLAGARVLRQDPVECLLQFLCSSNNNIARITKMVDFL 172
++LGEL+ + A+D F +A G R+LRQDPVECL F+C+SNN+I+RI MV+ L
Sbjct: 133 DVNLGELYNKWGAADPHFRSVANVFTGVRMLRQDPVECLFSFICTSNNHISRIQGMVERL 192
Query: 173 A-SLGSHLGNVEGFEFHEFPSLERLSLVS-EVELRNAGFGYRSAPQSSLLFSVRRSFKQA 230
+LG+ L ++ ++ FP+L LS S E +LR+ GFGYR A
Sbjct: 193 CQTLGTPLCELDQDSYYSFPTLAALSGKSVEAQLRDLGFGYR-----------------A 235
Query: 231 KYITGTVDVLQSKHSGGAEWLLSLRKLDLQEAIDALCTLPGVGPKVAACIALFSLDQHHA 290
+++ + + H G +WL LR + +A DAL TLPGVG KVA C+ L SLD+ A
Sbjct: 236 RFLQQSAKQILDTH--GMQWLDGLRTVPYVQARDALRTLPGVGTKVADCVCLMSLDKAEA 293
Query: 291 IPVDTHVWKIATR 303
+PVDTHVW+IA R
Sbjct: 294 VPVDTHVWQIAKR 306
>gi|390337049|ref|XP_791749.3| PREDICTED: N-glycosylase/DNA lyase-like isoform 2
[Strongylocentrotus purpuratus]
Length = 393
Score = 169 bits (429), Expect = 1e-39, Method: Compositional matrix adjust.
Identities = 116/351 (33%), Positives = 165/351 (47%), Gaps = 78/351 (22%)
Query: 38 WTPLNLTQSELSLPLTFPTGQTFRWKKTGP------------------------------ 67
W + SEL L +T GQ+FRWK+ P
Sbjct: 5 WRTIPCHISELRLDITLACGQSFRWKEIEPGTWRSVLAGSVWTLKQTDTHISYQIHNPST 64
Query: 68 ---LQYTGPIGPHLISLKHLQN-----------------------GDVCYHIHTSPSEPA 101
++ G G L H + GD C + + +
Sbjct: 65 HQEVKACGKSGDELRVQSHKKKKRGAAPNQDCESKKRQKMTLNGGGDECGVVKGAEEDDK 124
Query: 102 AKSALL-DFLNMGISLGELWEGFSASDCRFAELAKYLAGARVLRQDPVECLLQFLCSSNN 160
+ SA+L D+L + ++LGEL+ + +D F+++++ G R+LRQDPVE L+ F+CSSNN
Sbjct: 125 SSSAILRDYLQLEVNLGELYSQWQKADKNFSKVSESFPGVRILRQDPVENLVSFICSSNN 184
Query: 161 NIARITKMVDFLA-SLGSHLGNVEGFEFHEFPSLERLSLVS-EVELRNAGFGYRSAPQSS 218
NIARIT MV+ L G + ++G +H FPSL LS E LR GFGYR
Sbjct: 185 NIARITGMVEKLCQQYGDEVAVLDGVSWHSFPSLVALSAKGVEESLRKMGFGYR------ 238
Query: 219 LLFSVRRSFKQAKYITGTVDVLQSKHSGGAEWLLSLRKLDLQEAIDALCTLPGVGPKVAA 278
AK+I+ + ++ ++ GG WL LR QEA L LPGVG KVA
Sbjct: 239 -----------AKFISQSARMV-TEDFGGEAWLRGLRDQPYQEAHPELMKLPGVGAKVAD 286
Query: 279 CIALFSLDQHHAIPVDTHVWKIATRYLLPELAGVR-LTPKLCSRVAEAFCE 328
C+ L SLD+ AIPVDTHVW+IA+R + L + LT + ++ + F E
Sbjct: 287 CVCLMSLDKRGAIPVDTHVWQIASRDYIHTLQKTKTLTDRTYRQIGDFFRE 337
>gi|47223186|emb|CAG11321.1| unnamed protein product [Tetraodon nigroviridis]
Length = 385
Score = 169 bits (428), Expect = 2e-39, Method: Compositional matrix adjust.
Identities = 104/312 (33%), Positives = 156/312 (50%), Gaps = 66/312 (21%)
Query: 38 WTPLNLTQSELSLPLTFPTGQTFRWKKTGPLQYTGPIGPHLISLKHLQNGDVCYHIHTS- 96
W L T+SEL L LT GQ+FRW++T +TG + + +L ++ + +H++ +
Sbjct: 14 WRSLPCTKSELRLDLTLECGQSFRWRETAEGHWTGVMKGRVWTLTQTEDT-LWFHVYENQ 72
Query: 97 ------------------PSEP-------------------------AAKSALLDFLNMG 113
EP K L D+ +
Sbjct: 73 ERQSKTQDRKRRASVSVQEDEPRKKFQSQVKKEDGEPLDVPLLQDYREEKEMLRDYFQLD 132
Query: 114 ISLGELWEGFSASDCRFAELAKYLAGARVLRQDPVECLLQFLCSSNNNIARITKMVDFLA 173
++LG+L++ + +D F +A G R+LRQDP ECL F+C+SNN+I+RI MV+ L
Sbjct: 133 VNLGDLYKKWGDADPHFRSIANVFTGVRMLRQDPTECLFSFICTSNNHISRIQGMVERLC 192
Query: 174 -SLGSHLGNVEGFEFHEFPSLERLSLVS-EVELRNAGFGYRSAPQSSLLFSVRRSFKQAK 231
+LG+ L ++ ++ +PSL LS S E LR+ GFGYR A+
Sbjct: 193 QALGTPLCKLDQTTYYSYPSLSALSGSSVEARLRDLGFGYR-----------------AR 235
Query: 232 YITGTVDVLQSKHSGGAEWLLSLRKLDLQEAIDALCTLPGVGPKVAACIALFSLDQHHAI 291
++ + +Q H G +WL LR + +A +AL TLPGVG KVA C+ L SLD+ A+
Sbjct: 236 FLQQSAKQIQDAH--GMQWLDGLRTVPYVQAREALRTLPGVGTKVADCVCLMSLDKAEAV 293
Query: 292 PVDTHVWKIATR 303
PVDTHVW+IA R
Sbjct: 294 PVDTHVWQIAKR 305
>gi|291230163|ref|XP_002735040.1| PREDICTED: predicted protein-like [Saccoglossus kowalevskii]
Length = 373
Score = 168 bits (426), Expect = 3e-39, Method: Compositional matrix adjust.
Identities = 111/340 (32%), Positives = 168/340 (49%), Gaps = 66/340 (19%)
Query: 36 SKWTPLNLTQSELSLPLTFPTGQTFRWKKTGPLQYTGPIGPHLISLKHLQN--------- 86
S+W + ++L L +T GQ+FRWK+ P ++G + + +LK Q+
Sbjct: 4 SRWRSIPCKVTDLRLDITLLCGQSFRWKEKQPGLWSGVLKGKVWTLKQTQDELLYQVCER 63
Query: 87 --------------GDVC--------YHIHT--SPSEPAAKSA-----------LLDFLN 111
++C Y ++T + E K+ L D+ +
Sbjct: 64 EKTDLHSRKRSSTDAEICKSKRAKLQYGLYTNIATDEKVTKNEISDNSLIHDNILKDYFH 123
Query: 112 MGISLGELWEGFSASDCRFAELAKYLAGARVLRQDPVECLLQFLCSSNNNIARITKMVDF 171
+ + L +L++ +SA D F +A G R+L+QDPVE L F+CSSNNNI RI+ MV+
Sbjct: 124 LNVDLQDLYQKWSAVDDNFKAVAANFTGIRLLKQDPVENLFSFICSSNNNITRISGMVER 183
Query: 172 LA-SLGSHLGNVEGFEFHEFPSLERL-SLVSEVELRNAGFGYRSAPQSSLLFSVRRSFKQ 229
+ G + VEG ++ FPS+ L S +E +LR GFGYR
Sbjct: 184 MCQKYGEKICEVEGLPYYSFPSVSALASKQTEQQLRELGFGYR----------------- 226
Query: 230 AKYITGTVDVLQSKHSGGAEWLLSLRKLDLQEAIDALCTLPGVGPKVAACIALFSLDQHH 289
AKYI+ + ++ WL SLR + +EA L L GVG KVA C+ L SL++++
Sbjct: 227 AKYISQAAMHIMENYN--ENWLASLRDVSYKEAHVQLMKLAGVGAKVADCVCLMSLNKYN 284
Query: 290 AIPVDTHVWKIATRYLLPELAGVR-LTPKLCSRVAEAFCE 328
AIPVDTHVW+IATR + L + LT K+ +V + F E
Sbjct: 285 AIPVDTHVWQIATRDYMKSLRKTKSLTDKVYKQVGDFFRE 324
>gi|390337051|ref|XP_003724477.1| PREDICTED: N-glycosylase/DNA lyase-like isoform 1
[Strongylocentrotus purpuratus]
Length = 393
Score = 166 bits (421), Expect = 9e-39, Method: Compositional matrix adjust.
Identities = 115/351 (32%), Positives = 164/351 (46%), Gaps = 78/351 (22%)
Query: 38 WTPLNLTQSELSLPLTFPTGQTFRWKKTGP------------------------------ 67
W + SEL L +T GQ+FRWK+ P
Sbjct: 5 WRTIPCHISELRLDITLACGQSFRWKEIEPGTWRSVLAGSIWTLRQTDTHISYQIHNPST 64
Query: 68 ---LQYTGPIGPHLISLKHLQN-----------------------GDVCYHIHTSPSEPA 101
++ G G L H + G+ C + + +
Sbjct: 65 HPEVKTCGKSGDELRVQSHKKKQRGAAPNQDCKSKKRQKMTLSGGGNECGVVKGAEDDEK 124
Query: 102 AKSALL-DFLNMGISLGELWEGFSASDCRFAELAKYLAGARVLRQDPVECLLQFLCSSNN 160
SA+L D+L + ++LGEL+ + +D F+++++ G R+LRQDPVE L+ F+CSSNN
Sbjct: 125 FSSAILRDYLQLDVNLGELYTQWQKADKNFSKVSESFPGVRILRQDPVENLVSFICSSNN 184
Query: 161 NIARITKMVDFLA-SLGSHLGNVEGFEFHEFPSLERLSLVS-EVELRNAGFGYRSAPQSS 218
NIARIT MV+ L G + ++G +H FPSL LS E LR GFGYR
Sbjct: 185 NIARITGMVEKLCQQYGDEVAVLDGVSWHSFPSLVALSAKGVEESLRKMGFGYR------ 238
Query: 219 LLFSVRRSFKQAKYITGTVDVLQSKHSGGAEWLLSLRKLDLQEAIDALCTLPGVGPKVAA 278
AK+I+ + ++ ++ GG WL LR QEA L LPGVG KVA
Sbjct: 239 -----------AKFISQSARMV-TEDFGGEAWLRGLRDQPYQEAHPELMKLPGVGAKVAD 286
Query: 279 CIALFSLDQHHAIPVDTHVWKIATRYLLPELAGVR-LTPKLCSRVAEAFCE 328
C+ L SLD+ AIPVDTHVW+IA+R + L + LT + ++ + F E
Sbjct: 287 CVCLMSLDKRGAIPVDTHVWQIASRDYIHTLQKTKTLTDRTYRQIGDFFRE 337
>gi|301624863|ref|XP_002941717.1| PREDICTED: n-glycosylase/DNA lyase [Xenopus (Silurana) tropicalis]
Length = 390
Score = 166 bits (420), Expect = 1e-38, Method: Compositional matrix adjust.
Identities = 87/207 (42%), Positives = 127/207 (61%), Gaps = 21/207 (10%)
Query: 106 LLDFLNMGISLGELWEGFSASDCRFAELAKYLAGARVLRQDPVECLLQFLCSSNNNIARI 165
L D+ + ISL L++ + +SD F +A+ G R+LRQDP ECL F+C+SNNNI+RI
Sbjct: 137 LEDYFQLNISLRTLYQHWESSDPNFQRVAQDFPGIRILRQDPTECLFSFICTSNNNISRI 196
Query: 166 TKMVD-FLASLGSHLGNVEGFEFHEFPSLERLSLV-SEVELRNAGFGYRSAPQSSLLFSV 223
T M++ SLG L ++ +H FP+L+ L+ +E +LR+ GFGYR
Sbjct: 197 TGMIERVCCSLGQRLCQLDSDVYHTFPTLQELAAEGTEAKLRDLGFGYR----------- 245
Query: 224 RRSFKQAKYITGTVDVLQSKHSGGAEWLLSLRKLDLQEAIDALCTLPGVGPKVAACIALF 283
A++++ + + SKH +WL SLR + +EA ALC+LPGVG KVA C+ L
Sbjct: 246 ------ARFVSESARTILSKHC--PDWLESLRLVPYEEAKTALCSLPGVGAKVADCVCLM 297
Query: 284 SLDQHHAIPVDTHVWKIATRYLLPELA 310
+LD+ A+PVDTHVW++A R LP+L
Sbjct: 298 ALDKPEAVPVDTHVWQVAKRDYLPQLG 324
Score = 43.1 bits (100), Expect = 0.16, Method: Compositional matrix adjust.
Identities = 23/77 (29%), Positives = 38/77 (49%), Gaps = 1/77 (1%)
Query: 28 QTLTTNKPSKWTPLNLTQSELSLPLTFPTGQTFRWKKTGPLQYTGPIGPHLISLKHLQNG 87
+T ++ P+ W + SEL L GQTFRWK+ P +TG + + +L +
Sbjct: 4 RTSVSSSPACWRSIPCQHSELRLDYVLACGQTFRWKEFSPGYWTGVLKGRVWTLTQT-DE 62
Query: 88 DVCYHIHTSPSEPAAKS 104
+ Y ++T+ PA S
Sbjct: 63 HIWYTVYTNDQRPAQDS 79
>gi|213403202|ref|XP_002172373.1| N-glycosylase/DNA lyase [Schizosaccharomyces japonicus yFS275]
gi|212000420|gb|EEB06080.1| N-glycosylase/DNA lyase [Schizosaccharomyces japonicus yFS275]
Length = 386
Score = 166 bits (419), Expect = 2e-38, Method: Compositional matrix adjust.
Identities = 105/276 (38%), Positives = 151/276 (54%), Gaps = 31/276 (11%)
Query: 38 WTPLNLTQSELSLPLTFPTGQTFRWKKTGPLQ--YTGPIGPHLISLKHLQNGDV---CYH 92
W + L EL++ +T GQ FRW + Y + ++ LK DV C+
Sbjct: 2 WKKIPLVPEELNISVTLTNGQAFRWVCVDEKEKIYAASLWSCVVLLKQKHETDVVEYCFK 61
Query: 93 I--HTSPSEPAAKSALLDFLNMGISLGELWEGFSASDCRFAELAKYLAGARVLRQDPVEC 150
+ P+E ++ L +LN+ + L++ + D FA+ K L G R+L+QDP E
Sbjct: 62 YPENEVPNESKSRKLLFRYLNVDVDTRNLFQLWEEKDPTFAQHCKNLYGVRILKQDPWEN 121
Query: 151 LLQFLCSSNNNIARITKMVDFL-ASLGSHLGNVEGFEFHEFPSLERLSLVSEV--ELRNA 207
L+ F+CSSNNNI RI++M+ L +S G+ LG G +F+ FPSL +L+ +SE+ ELRN
Sbjct: 122 LISFICSSNNNIPRISQMLQKLCSSYGTWLGYWFGHDFYSFPSLSKLASISELERELRNL 181
Query: 208 GFGYRSAPQSSLLFSVRRSFKQAKYITGTVDVLQSKHSGGAEWLLSLRKLDLQEAIDALC 267
GFGYR AK+I +V L SK+ G E +L L + D LC
Sbjct: 182 GFGYR-----------------AKFIHKSVLYL-SKNKGFLE---NLSSLSYEACRDELC 220
Query: 268 TLPGVGPKVAACIALFSLDQHHAIPVDTHVWKIATR 303
LPGVGPKVA C+ L SL++ +IPVDTHV ++A R
Sbjct: 221 LLPGVGPKVADCVCLMSLNKFESIPVDTHVLQLAVR 256
>gi|294462981|gb|ADE77029.1| unknown [Picea sitchensis]
Length = 255
Score = 165 bits (417), Expect = 3e-38, Method: Compositional matrix adjust.
Identities = 78/151 (51%), Positives = 103/151 (68%), Gaps = 4/151 (2%)
Query: 38 WTPLNLTQSELSLPLTFPTGQTFRWKKTGPLQYTGPIGPHLISLKHL---QNGDVCYHIH 94
W L + EL L +TFPTGQTFRWKKTGP +YTG +G +L +L+ + + +V Y IH
Sbjct: 104 WRSLGVGIKELKLDITFPTGQTFRWKKTGPSEYTGAVGTYLFTLRQVTIDDDDEVEYFIH 163
Query: 95 T-SPSEPAAKSALLDFLNMGISLGELWEGFSASDCRFAELAKYLAGARVLRQDPVECLLQ 153
S A+ AL D+LN+ ISL ++W+GF+ +D RFA LA Y+ GAR+LRQDPVECL Q
Sbjct: 164 NPSGFSQEAELALRDYLNLSISLTDMWKGFAEADSRFAALAPYMGGARLLRQDPVECLFQ 223
Query: 154 FLCSSNNNIARITKMVDFLASLGSHLGNVEG 184
F+CSSNNNI RIT+MV + + G+ G
Sbjct: 224 FICSSNNNIQRITQMVGVFVRIWTLFGDCRG 254
>gi|403341550|gb|EJY70081.1| Endonuclease III/similar to 8-oxoguanine DNA glycosylase isoform 1b
[Oxytricha trifallax]
Length = 422
Score = 164 bits (416), Expect = 4e-38, Method: Compositional matrix adjust.
Identities = 107/353 (30%), Positives = 161/353 (45%), Gaps = 83/353 (23%)
Query: 38 WTPLNLTQSELSLPLTFPTGQTFRWKKTGPLQYTGPIGPHLISLKHLQNGDVCY------ 91
W L + E +L T GQ F WKK G + G + + +K + + Y
Sbjct: 6 WKQLVFHRQEFNLENTLVNGQCFNWKKLGEDHFEGVFSEYYVQVKRTHDDYIEYCTIPDH 65
Query: 92 --------------------------HIHTSPSEPAAKS-----ALLD----FLNMGISL 116
+ + P E S LD ++N I +
Sbjct: 66 TIKTQQIQPQDAEKKTKKISKIEEEKKVRSQPEEEEKTSHKTQDKFLDQFKAYINYDIKV 125
Query: 117 GELWEGFSASDCRFAELAKYLAGARVLRQDPVECLLQFLCSSNNNIARITKMVDFLAS-- 174
+L++ ++ D RF+++A+ + G R LRQDP EC + F+CS NNI RIT+M+D L
Sbjct: 126 LDLYDHWAKRDKRFSQIAEPIHGVRCLRQDPWECTVSFICSQCNNIKRITQMLDTLRQKV 185
Query: 175 --------------LGSHLGNVEGFE-----FHEFPSLERLSLVSEVELRNAGFGYRSAP 215
G+ + ++ + ++FP++E++S VSE ELR+ FGYR
Sbjct: 186 SKNIFVLMNKLILQYGTKICEIDQDDGSKKSIYKFPTIEQMSQVSEKELRDLKFGYR--- 242
Query: 216 QSSLLFSVRRSFKQAKYITGTVDVLQSKHSGGAEWLLSLRKLDLQEAIDALCTLPGVGPK 275
AKY+ ++ K GG +WL SLR +E + L TL G+G K
Sbjct: 243 --------------AKYLVANAKMMNEK--GGKQWLESLRGKSNEEVREQLITLNGIGNK 286
Query: 276 VAACIALFSLDQHHAIPVDTHVWKIATR--YLLPELAGVRLTPKLCSRVAEAF 326
VA CIALFSLD ++IPVDTHV++IA + Y+ L KL ++ EAF
Sbjct: 287 VADCIALFSLDCANSIPVDTHVFQIAQKLGYVQGMKKDANLNQKLYMQIVEAF 339
>gi|328769544|gb|EGF79588.1| hypothetical protein BATDEDRAFT_89698 [Batrachochytrium
dendrobatidis JAM81]
Length = 306
Score = 164 bits (415), Expect = 5e-38, Method: Compositional matrix adjust.
Identities = 115/302 (38%), Positives = 161/302 (53%), Gaps = 32/302 (10%)
Query: 38 WTPLNLTQSELSLPLTFPTGQTFRWKKTGPLQYTGPIGPHLISLKHLQNGDVCYHIHTSP 97
W L + SEL + + GQ FRW K + G I +L+SL + DV +H ++
Sbjct: 2 WHSLGVPPSELRITRSLLCGQAFRWVKLSDNIWAGVILQNLVSLLQ-TDTDVLFHFYSDT 60
Query: 98 -SEPAAKSALLDFLNMGISLGELWEGFSASDCRFAE--LAKYLAGARVLRQDPVECLLQF 154
S A+ L D+ +G SL L++ + D FA+ A+ L G RVLRQDP E + F
Sbjct: 61 ISLTEARLILYDYFQLGSSLELLYKTWLL-DSNFAKKVTAQNLIGLRVLRQDPSENVFSF 119
Query: 155 LCSSNNNIARITKMVDFL-ASLGSHLGNV-----EGFEFHEFPSLERLSLVS-EVELRNA 207
+CSSNNNI RI+ M+ L A G+ + ++ E F+ FP ++ L+ E LR
Sbjct: 120 ICSSNNNIPRISSMIRSLCAEYGTRIDSLRNDKGEHIVFYTFPEIKALAGDDVEQRLRQL 179
Query: 208 GFGYRSAPQSSLLFSVRRSFKQAKYITGTVDVLQSKHSGGAEWLLSLRKLDLQEAIDALC 267
GFGYR AK+I G+ ++ + GG++WL SLR + Q A L
Sbjct: 180 GFGYR-----------------AKFIVGSAKLILER--GGSKWLHSLRDISYQAAHTELL 220
Query: 268 TLPGVGPKVAACIALFSLDQHHAIPVDTHVWKIATR-YLLPELAGVRLTPKLCSRVAEAF 326
+LPGVGPKV+ CI L SLD+ AIPVDTHVW+IA R Y + + +T L + EAF
Sbjct: 221 SLPGVGPKVSDCICLMSLDKIGAIPVDTHVWQIAQRDYGMVGSSAKTITKSLYLHIGEAF 280
Query: 327 CE 328
E
Sbjct: 281 RE 282
>gi|50546697|ref|XP_500818.1| YALI0B12870p [Yarrowia lipolytica]
gi|49646684|emb|CAG83069.1| YALI0B12870p [Yarrowia lipolytica CLIB122]
Length = 403
Score = 164 bits (414), Expect = 7e-38, Method: Compositional matrix adjust.
Identities = 103/298 (34%), Positives = 149/298 (50%), Gaps = 26/298 (8%)
Query: 38 WTPLNLTQSELSLPLTFPTGQTFRWKKTG-PLQYTGPIGPHLISLKHLQNGDVCYHIHTS 96
W L L +++ +L + GQ+FRW K P IG + Q D S
Sbjct: 13 WNRLGLAKTDANLEILLKCGQSFRWTKIEHPNNNYWIIGMEGRGIVLNQKDDDTMWAEVS 72
Query: 97 PSEPAAKS-----ALLDFLNMGISLGELWEGFSASDCRFAELAKYLAGARVLRQDPVECL 151
KS L D+ N+ +L+E +S+ D F + G RVLRQDP E L
Sbjct: 73 DKGKPVKSRDTAAILNDYFNISTDTIKLYEDWSSRDDHFKNKSIKYLGIRVLRQDPWENL 132
Query: 152 LQFLCSSNNNIARITKMVDFLA-SLGSHLGNVEGFEFHEFPSLERLSLVSEVELRNAGFG 210
F+CSSNNN+ RI+++V + + G H+ ++ + H FPS ++L+ + LR G G
Sbjct: 133 CSFICSSNNNVKRISQLVQKMTITFGDHVATLDDLKIHSFPSPDKLADTEPI-LRELGLG 191
Query: 211 YRSAPQSSLLFSVRRSFKQAKYITGTVDVLQSKHSGGAEWLLSLRKLDLQEAIDALCTLP 270
YR AKYI+ T ++L +K GG ++L LR EA ++
Sbjct: 192 YR-----------------AKYISKTAEMLLTK-PGGEQFLHELRDASFDEAKSSIMEFL 233
Query: 271 GVGPKVAACIALFSLDQHHAIPVDTHVWKIATRYLLPELAGVRLTPKLCSRVAEAFCE 328
GVGPKVA C+ LFSLD+H +PVDTHVW+IA + + +TPK ++V E F E
Sbjct: 234 GVGPKVADCVCLFSLDKHDTVPVDTHVWQIAQKDYGVARSAKTMTPKAYAQVQEFFRE 291
>gi|448105755|ref|XP_004200572.1| Piso0_003164 [Millerozyma farinosa CBS 7064]
gi|448108864|ref|XP_004201203.1| Piso0_003164 [Millerozyma farinosa CBS 7064]
gi|359381994|emb|CCE80831.1| Piso0_003164 [Millerozyma farinosa CBS 7064]
gi|359382759|emb|CCE80066.1| Piso0_003164 [Millerozyma farinosa CBS 7064]
Length = 365
Score = 163 bits (412), Expect = 1e-37, Method: Compositional matrix adjust.
Identities = 107/290 (36%), Positives = 148/290 (51%), Gaps = 48/290 (16%)
Query: 38 WTPLNLTQSELSLPLTFPTGQTFRWKKTGPLQYTGPIGPHLISLKHLQNGDVCYHIHT-- 95
W L +++SE+ L GQTFRWK + +T I ++ L+ QN D + H
Sbjct: 7 WNKLCISRSEVELSKVLRCGQTFRWKNIDDV-WTFTIKDRIVLLR--QNDDAVEYSHVPH 63
Query: 96 --------SPSEPAAKSALLDFLNMGISLGELWEGFSASDCRFAELAK-----YLAGARV 142
S+ KS L D+ N+ I+LG+L++ +S D F + K G R+
Sbjct: 64 HKKSDDGDIRSQGETKSFLQDYFNLDITLGDLYDHWSKKDGNFKRVLKSSPFQEFKGIRI 123
Query: 143 LRQDPVECLLQFLCSSNNNIARITKMVDFL-ASLGSHLGNVEGFEFHEFPSLERL--SLV 199
LRQDP E L+ F+CSSNNN+ RI+KM + L G ++ G + + FPS E+L S
Sbjct: 124 LRQDPWETLVSFICSSNNNVKRISKMCEALCVEFGDYVNEYAGVKHYSFPSPEQLLSSPS 183
Query: 200 SEVELRNAGFGYRSAPQSSLLFSVRRSFKQAKYITGTVDVLQSKHSGG------AEWLLS 253
E LR GFGYR AKYI T S+ +G E L S
Sbjct: 184 LESRLRELGFGYR-----------------AKYIYATA----SRMTGDEYPDITVEKLNS 222
Query: 254 LRKLDLQEAIDALCTLPGVGPKVAACIALFSLDQHHAIPVDTHVWKIATR 303
+R+ D + A + L L GVGPKVA CI L +LD+H +PVDTHV++IA R
Sbjct: 223 MREEDYETAHNFLLQLMGVGPKVADCICLMALDKHDVVPVDTHVYQIALR 272
>gi|430813566|emb|CCJ29082.1| unnamed protein product [Pneumocystis jirovecii]
Length = 337
Score = 162 bits (411), Expect = 2e-37, Method: Compositional matrix adjust.
Identities = 102/264 (38%), Positives = 139/264 (52%), Gaps = 30/264 (11%)
Query: 48 LSLPLTFPTGQTFRWKKTGPLQYTGPIGPHLISLKHLQNGDVCYHIHTS------PSEPA 101
+ L T GQ FRWK G +++ +I LK Q+ Y+ S +E
Sbjct: 18 VRLETTLQAGQAFRWKYLGD-EWSCAFNNMIILLK--QDASYVYYRVISRELAFMKNEKE 74
Query: 102 AKSALLDFLNMGISLGELWEGFSASDCRFAELAKYLAGARVLRQDPVECLLQFLCSSNNN 161
+ L ++ N+ + L +L++ ++ D F + G R+LRQDP E L+ F+CSSNNN
Sbjct: 75 HIAFLENYFNLSVRLCDLYKEWAHKDKNFKLQMRRFKGVRILRQDPWENLICFICSSNNN 134
Query: 162 IARITKMVDFLAS-LGSHLGNVEGFEFHEFPSLERLSLVS-EVELRNAGFGYRSAPQSSL 219
I RITKMVD L S GS +G V ++ FP+ +LS S E ELR GFGYR
Sbjct: 135 IRRITKMVDILCSRYGSLVGTVRNIDYFNFPTPAKLSESSVESELRQLGFGYR------- 187
Query: 220 LFSVRRSFKQAKYITGTVDVLQSKHSGGAEWLLSLRKLDLQEAIDALCTLPGVGPKVAAC 279
AK+I T +++++ EW SLR EA LC L GVG KVA C
Sbjct: 188 ----------AKFIQKTAWIIENERP--KEWFESLRNKPYNEAKKNLCELIGVGDKVADC 235
Query: 280 IALFSLDQHHAIPVDTHVWKIATR 303
+ L SLD+ AIP+DTHVW+IA R
Sbjct: 236 VCLMSLDKPSAIPIDTHVWQIARR 259
>gi|412985316|emb|CCO20341.1| predicted protein [Bathycoccus prasinos]
Length = 475
Score = 162 bits (410), Expect = 2e-37, Method: Compositional matrix adjust.
Identities = 119/357 (33%), Positives = 158/357 (44%), Gaps = 98/357 (27%)
Query: 38 WTPLNLTQSELSLPLTFPTGQTFRWKKTGPLQ-----------YTGPIGPHLISLK---- 82
W L T+ +LSL T PTGQ+FRW+K Y G IG ++ ++
Sbjct: 18 WKSLRTTRDDLSLDRTLPTGQSFRWQKVEEYDENDENEKKYSTYVGVIGRRVVQIRERME 77
Query: 83 -------------------HLQNGDVCYHIHTSPSEPAAKSALLD---FLNMG--ISLGE 118
+ ++C + E + D + N+ I L +
Sbjct: 78 FEEEEEKSYETTTIEFRVLNTNESNICKNEDKENGEQEEEQVTKDVRAYFNLDDPIPLKD 137
Query: 119 LWEGFSASD-CRFAELAKYLAGARVLRQDPVECLLQFLCSSNNNIARITKMVDFL-ASLG 176
L+E FS +D RF L+ Y+ GAR LRQ P ECL+ F+CSSNNNIARI KMV+ L G
Sbjct: 138 LFEQFSKADPYRFGRLSAYVRGARTLRQPPAECLISFVCSSNNNIARIKKMVESLCEKYG 197
Query: 177 SHL-----------------------GNVEGFE-------FHEFPSLERLS-LVSEVELR 205
L V E +H FP++E+L+ E LR
Sbjct: 198 EELVCNVSSRSRSRSSGSSGRDDDEENEVNDKEEINESKIYHAFPTVEQLAEKCDEQTLR 257
Query: 206 NAGFGYRSAPQSSLLFSVRRSFKQAKYITGTVDVLQSKHSGGAEWLLSLRKLDLQE---- 261
+ GFGYR AK+I L + GG +LL LR E
Sbjct: 258 DLGFGYR-----------------AKFIPAMAKELVKR--GGEAYLLKLRDAPQTEEEKY 298
Query: 262 -AIDALCTLPGVGPKVAACIALFSLDQHHAIPVDTHVWKIATRYLLP--ELAGVRLT 315
AI L +LPGVGPKVA C ALFSLD+ +PVDTHVW++A + P ELA + T
Sbjct: 299 RAITELTSLPGVGPKVAGCAALFSLDKKSVVPVDTHVWQLAREHFDPNGELASMETT 355
>gi|345569089|gb|EGX51958.1| hypothetical protein AOL_s00043g692 [Arthrobotrys oligospora ATCC
24927]
Length = 420
Score = 162 bits (409), Expect = 2e-37, Method: Compositional matrix adjust.
Identities = 116/333 (34%), Positives = 166/333 (49%), Gaps = 58/333 (17%)
Query: 30 LTTNKPSKWTPLNLTQSELSLPLTFPTGQTFRWKKTGPLQYTGPIGPHLISLKHLQNGDV 89
++ + W L ++ ++L L GQ+FRWK +GP +++ +ISL+
Sbjct: 1 MSKGHATAWAQLPVSLNQLCLATVLRCGQSFRWKASGPDEWSCGFSDRIISLRQSPT--- 57
Query: 90 CYHIH---TSPSEP--AAKSALL----DFLNMGISLGELWEGFSASDCRFAELAKYLAGA 140
H+H T PS P +AK + D+ N+ I L L+E +S +D F + A G
Sbjct: 58 --HLHYRATFPSPPPKSAKDDTVEIVKDYFNLSIDLAALYEKWSLADPNFKKKAGEFKGV 115
Query: 141 RVLRQDPVECLLQFLCSSNNNIARITKMVDFL-ASLGSHLGNV----------------- 182
R+LRQDP ECL+ F+CSSNNNI+RI +MVD L A+ G+ LG +
Sbjct: 116 RMLRQDPWECLIGFICSSNNNISRIGQMVDKLCATYGTPLGTITHTSTTPLSPSSPSSEN 175
Query: 183 ----EGFEFHEFPSLERLSLVS-EVELRNAGFGYRSAPQSSLLFSVRRSFKQAKYITGTV 237
++ FP++E L+ E L+ GFGYR AKYI T
Sbjct: 176 KTPPTEVAYYSFPTIEALTTPDVEETLKTLGFGYR-----------------AKYIYKTA 218
Query: 238 DVLQSKHSGGAEWLLSLR-KLDLQEAIDALCTLPGVGPKVAACIALFSLDQHHAIPVDTH 296
++ G +L +LR + D +EA L GVGPKVA C+ L SLD+ A+PVDTH
Sbjct: 219 CMVHEDRPKG--FLNTLRDEKDYKEAHAELIQFMGVGPKVADCVCLMSLDKMGAVPVDTH 276
Query: 297 VWKIATR-YLLPELAGVRLTPKLCSRVAEAFCE 328
VW+IATR Y + LTP + V + F E
Sbjct: 277 VWQIATRDYKFGKGKHRSLTPAVYEAVGDLFRE 309
>gi|255731942|ref|XP_002550895.1| conserved hypothetical protein [Candida tropicalis MYA-3404]
gi|240131904|gb|EER31463.1| conserved hypothetical protein [Candida tropicalis MYA-3404]
Length = 345
Score = 162 bits (409), Expect = 3e-37, Method: Compositional matrix adjust.
Identities = 102/278 (36%), Positives = 148/278 (53%), Gaps = 29/278 (10%)
Query: 38 WTPLNLTQSELSLPLTFPTGQTFRWKKTGPLQYTGPIGPHLISLKHLQNGDVCYHIHT-- 95
W + +++SE+SL GQTFRWK + ++ I +I LK ++ HI
Sbjct: 7 WKKIAISESEVSLHKVLRCGQTFRWKSINNV-WSFAISDRIILLKQDESNIYYSHIMQKG 65
Query: 96 -SPSEPAAKSALLDFLNMGISLGELWEGFSASDCRFAELAK-----YLAGARVLRQDPVE 149
S ++ S + D+ +GI+LGEL+ + + K G R+LRQDP E
Sbjct: 66 RSGNDSETLSFINDYFTLGINLGELYTHWKLKHQVYQSKTKKSPFDLFTGIRILRQDPWE 125
Query: 150 CLLQFLCSSNNNIARITKMVDFLA-SLGSHLGNVEGFEFHEFPSLERLSLVS-EVELRNA 207
CL+ F+CSSNNN+ RI+KM D L G ++ EG +++ FP+ E LS S E +LR
Sbjct: 126 CLVSFICSSNNNVKRISKMCDNLCIHFGEYINEYEGHKYYSFPTPESLSQSSVESKLREL 185
Query: 208 GFGYRSA--PQSSLLFSVRRSFKQAKYITGTVDVLQSKHSGGAEWLLSLRKLDLQEAIDA 265
GFGYR+ Q++ F+ +KY T+D L S +R +A +
Sbjct: 186 GFGYRAKYIHQTACKFT-----DDSKYPDITLDQLNS-----------MRNEKYIKAHEF 229
Query: 266 LCTLPGVGPKVAACIALFSLDQHHAIPVDTHVWKIATR 303
L L GVGPKVA CI L SLD+H +P+DTHV++IA R
Sbjct: 230 LLQLTGVGPKVADCICLMSLDKHDVVPIDTHVYQIAVR 267
>gi|449474119|ref|XP_002186975.2| PREDICTED: N-glycosylase/DNA lyase [Taeniopygia guttata]
Length = 357
Score = 161 bits (408), Expect = 4e-37, Method: Compositional matrix adjust.
Identities = 102/286 (35%), Positives = 147/286 (51%), Gaps = 41/286 (14%)
Query: 61 RWKKTGPLQYTGPIGPHLISLKHLQNGDVCYHI------------HTSPSEPAAKSA--- 105
RW ++ P +TG +G + +L+ Q+GD ++ P++ A S+
Sbjct: 60 RWWESSPGAWTGVLGGRVWTLR--QDGDRLWYTVYGEEEEHQEERDERPAKAAKLSSAET 117
Query: 106 ---LLDFLNMGISLGELWEGFSASDCRFAELAKYLAGARVLRQDPVECLLQFLCSSNNNI 162
L D+ + + L L+ + A+D F ++A G RVLRQDPVECL F+C+SNN+I
Sbjct: 118 ERILRDYFQLDVGLSPLYHAWGAADPLFRKVASDFPGVRVLRQDPVECLFSFICTSNNHI 177
Query: 163 ARITKMVDFLA-SLGSHLGNVEGFEFHEFPSLERLSLV-SEVELRNAGFGYRSAPQSSLL 220
+RIT M++ L + G L ++ FH FPSL L+ +E LR GFGYR
Sbjct: 178 SRITAMIERLCQAFGRRLCCLDSRPFHAFPSLSALTGADAEARLRALGFGYR-------- 229
Query: 221 FSVRRSFKQAKYITGTVDVLQSKHSGGAEWLLSLRKLDLQEAIDALCTLPGVGPKVAACI 280
AK+++G+ + G+E L LR EA LC LPGVG KVA C+
Sbjct: 230 ---------AKFVSGSARAIAEGL--GSEGLCQLRTAPYAEARRVLCALPGVGAKVADCV 278
Query: 281 ALFSLDQHHAIPVDTHVWKIATRYLLPELAGVRLTPKLCSRVAEAF 326
L SLD+ A+PVDTHVW IA + L G LTP+ + + F
Sbjct: 279 CLLSLDKAEAVPVDTHVWHIARQRYGVALGGKSLTPRAYQEIGDFF 324
>gi|405972859|gb|EKC37606.1| N-glycosylase/DNA lyase [Crassostrea gigas]
Length = 364
Score = 161 bits (407), Expect = 5e-37, Method: Compositional matrix adjust.
Identities = 91/231 (39%), Positives = 128/231 (55%), Gaps = 22/231 (9%)
Query: 99 EPAAKSALLDFLNMGISLGELWEGFSASDCRFAELAKYLAGARVLRQDPVECLLQFLCSS 158
E + + + D+ + I +G L++ +S D F ++ G R+LRQDPVE L F+CSS
Sbjct: 92 ENSYEDLIKDYFQLNIKVGNLYQKWSDVDSNFQSISSKFGGIRILRQDPVENLFSFICSS 151
Query: 159 NNNIARITKMVDFL-ASLGSHLGNVEGFEFHEFPSLERLSLVS-EVELRNAGFGYRSAPQ 216
NN+I+RI+ MV+ L + G +G G + FP++ LS E ELRN GFGYR
Sbjct: 152 NNHISRISSMVEKLCENYGREVGKFNGKTYFSFPTIFELSEDGVESELRNLGFGYR---- 207
Query: 217 SSLLFSVRRSFKQAKYITGTVDVLQSKHSGGAEWLLSLRKLDLQEAIDALCTLPGVGPKV 276
AKYI + + K GG WL +LR++ EA L L GVG KV
Sbjct: 208 -------------AKYINKSAQQILEK--GGETWLRALREIPYAEAKKELLALNGVGAKV 252
Query: 277 AACIALFSLDQHHAIPVDTHVWKIATRYLLPELAGVR-LTPKLCSRVAEAF 326
A C+ L SLD+ A+PVDTHVW+IA R +P L+ + LT KL + + + F
Sbjct: 253 ADCVCLMSLDKTDALPVDTHVWQIAARGYMPSLSKCKSLTDKLYNEIGDHF 303
>gi|91086671|ref|XP_968299.1| PREDICTED: similar to N-glycosylase/DNA lyase [Tribolium castaneum]
gi|270009750|gb|EFA06198.1| hypothetical protein TcasGA2_TC009047 [Tribolium castaneum]
Length = 308
Score = 160 bits (405), Expect = 8e-37, Method: Compositional matrix adjust.
Identities = 103/299 (34%), Positives = 155/299 (51%), Gaps = 29/299 (9%)
Query: 38 WTPLNLTQSELSLPLTFPTGQTFRWKKT---GPLQYTGPIGPHLISLKHLQNGDVCYHIH 94
W L + +L L T GQ+FRWK T G ++ G H+ LK + + Y ++
Sbjct: 5 WFKLICQKEQLQLLGTLNGGQSFRWKFTEAEGDKKWIGVFSDHVWVLKQTDDC-ILYQVY 63
Query: 95 TSP-SEPAAKSALLDFLNMGISLGELWEGFSASDCRFAELAKYLAGARVLRQDPVECLLQ 153
S SE L ++ + + L E + +S+ D F + A G R+L+QD VE +
Sbjct: 64 GSENSEDFYNKLLSNYFQLNLDLKEKFAEWSSKDPIFEKAASQFYGIRILKQDLVENIFS 123
Query: 154 FLCSSNNNIARITKMVDFLASL-GSHLGNVEGFEFHEFPSLERLSLVSEVE--LRNAGFG 210
F+CSSNNNI+RIT MV+ LA G + +E ++ FP ++ L+ +VE L+ GFG
Sbjct: 124 FICSSNNNISRITGMVEKLAKFYGEKICELEDQTYYSFPKIDSLA-DDKVESVLKKEGFG 182
Query: 211 YRSAPQSSLLFSVRRSFKQAKYITGTVDVLQSKHSGGAEWLLSLRKLDLQEAIDALCTLP 270
YR AKYI + ++ GG +WL L+KL +E+ L TL
Sbjct: 183 YR-----------------AKYINQSAKIIM--QEGGEKWLDDLKKLPYEESKSKLMTLT 223
Query: 271 GVGPKVAACIALFSLDQHHAIPVDTHVWKIATRYLLPELAGVR-LTPKLCSRVAEAFCE 328
G+GPKVA CI L SL IP+DTHV++IA ++ +P L + +T K+ + + F E
Sbjct: 224 GIGPKVADCICLMSLGHLGTIPIDTHVYQIARKFYMPHLPKRKTVTAKIYKDIGDHFRE 282
>gi|346469589|gb|AEO34639.1| hypothetical protein [Amblyomma maculatum]
Length = 351
Score = 160 bits (405), Expect = 8e-37, Method: Compositional matrix adjust.
Identities = 104/320 (32%), Positives = 155/320 (48%), Gaps = 52/320 (16%)
Query: 41 LNLTQSELSLPLTFPTGQTFRWKKTGPLQYTGPIGPHLISLKHLQNGDVCYHIHTSPSE- 99
+ T +L+L +T GQ FRWKK ++ + L+ +G++ Y + +P+
Sbjct: 7 IKCTTRQLNLDVTLTCGQAFRWKKNDQSEWMSTFAGRVWCLRQDDDGNLHYRVLNNPNNG 66
Query: 100 ---PAAKSA-------------------------LLDFLNMGISLGELWEGFSASDCRFA 131
P S+ L D+ +G+ L L+ G+ +D F
Sbjct: 67 LVCPNFDSSQTESRKRKRRRLEVVPHPTEQNEELLRDYFQLGVDLEGLYTGWCQADPTFR 126
Query: 132 ELAKYLAGARVLRQDPVECLLQFLCSSNNNIARITKMVDFLASL-GSHLGNVEGFEFHEF 190
A +L G RVLRQ+P+E L+ F+CSSNNNI RI+ MV+ L ++ G+ L + F+ F
Sbjct: 127 ATASFLPGIRVLRQEPLEALMAFICSSNNNITRISSMVEKLCTMYGTKLFEGKEGSFYSF 186
Query: 191 PSLERLSLVSEVELRN-AGFGYRSAPQSSLLFSVRRSFKQAKYITGTVDVLQSKHSGGAE 249
P+ ++ AGFGYR AKY+ GT +L S+ G
Sbjct: 187 PTASQMDEERVEVELREAGFGYR-----------------AKYVHGTAKILTSR---GPL 226
Query: 250 WLLSLRKLDLQEAIDALCTLPGVGPKVAACIALFSLDQHHAIPVDTHVWKIATRYLLPEL 309
WL SLR EA L LPGVG KVA C+ L +LD+ A+PVD HVW++AT++ LP L
Sbjct: 227 WLQSLRGAPYIEAHRQLMELPGVGAKVADCVCLMALDKSEAVPVDVHVWRMATQHYLPHL 286
Query: 310 AGVR-LTPKLCSRVAEAFCE 328
++ LT + + F E
Sbjct: 287 RSLKNLTALAYKEIGDHFRE 306
>gi|241735371|ref|XP_002404906.1| 8-oxoguanine DNA glycosylase, putative [Ixodes scapularis]
gi|215505591|gb|EEC15085.1| 8-oxoguanine DNA glycosylase, putative [Ixodes scapularis]
Length = 329
Score = 160 bits (404), Expect = 1e-36, Method: Compositional matrix adjust.
Identities = 106/309 (34%), Positives = 153/309 (49%), Gaps = 56/309 (18%)
Query: 38 WTPLNLTQSELSLPLTFPTGQTFRWKKTGPLQYTGPIGPHLISLKHLQNGDVCYHI---- 93
W + T+ +L+L +T GQ+FRWKK ++ + LK G++ Y +
Sbjct: 3 WASIQCTRRQLNLDITLRCGQSFRWKKNDEGEWLSTFANKVWVLKQDLEGNILYKVLDGN 62
Query: 94 -----------------------HTSPSEPAAKSAL-------LDFLNMGISLGELWEGF 123
SE A+ L D+ M + L +L++ +
Sbjct: 63 GTATSSCNDGLSLAKGANRKRKAKNGLSESQAECQLDDHGHLLRDYFQMNVDLEKLYKQW 122
Query: 124 SASDCRFAELAKYLAGARVLRQDPVECLLQFLCSSNNNIARITKMVDFLASL-GSHLGNV 182
SA+D F ++A L G R+LRQDP E L+ F+CSSNNNI RI+ MV+ L + G+HL +
Sbjct: 123 SAADEIFRDVAVALPGVRILRQDPFEALMAFICSSNNNITRISSMVNKLCEMYGTHLLSS 182
Query: 183 EGFEFHEFPSLERLSLVS-EVELRNAGFGYRSAPQSSLLFSVRRSFKQAKYITGTVDVLQ 241
F+ FP+ E+L S E EL+ AGFGYR+ + ++ R D L
Sbjct: 183 STGSFYAFPTAEQLDRDSVEPELKAAGFGYRARYVRAAARALSRQ-----------DPL- 230
Query: 242 SKHSGGAEWLLSLRKLDLQEAIDALCTLPGVGPKVAACIALFSLDQHHAIPVDTHVWKIA 301
WL SLR++ EA L LPGVGPKVA C+ L +LD+ +PVD HVWK+A
Sbjct: 231 --------WLASLREVSYPEAHSQLLQLPGVGPKVADCVCLMALDKADVVPVDVHVWKLA 282
Query: 302 TRYLLPELA 310
R+ LP L+
Sbjct: 283 VRHYLPHLS 291
>gi|339245891|ref|XP_003374579.1| N-glycosylase/DNA lyase [Trichinella spiralis]
gi|316972176|gb|EFV55864.1| N-glycosylase/DNA lyase [Trichinella spiralis]
Length = 382
Score = 160 bits (404), Expect = 1e-36, Method: Compositional matrix adjust.
Identities = 99/292 (33%), Positives = 150/292 (51%), Gaps = 30/292 (10%)
Query: 38 WTPLNLTQSELSLPLTFPTGQTFRWKKTGPLQYTGPIGPHLISLKHLQNGDVCYHIHT-- 95
W + + E++L ++ +GQ+FRW +TG + ++ ++ GD C +
Sbjct: 5 WVKIKCSIQEVNL-ISLNSGQSFRWTVDKDGIWTGVVHHQVLRVR---RGDGCIWLQRIG 60
Query: 96 --SPSEPAAKSALLDFLNMGISLGELWEGFSASDCRFAELAKYLAGARVLRQDPVECLLQ 153
S + K L D+ +G + ++ +SA D RFAE + G RVLRQDP ECL+
Sbjct: 61 RFSRCQIGKKDCLRDYFQLGTPIADMHSHWSALDPRFAEAHRRHPGVRVLRQDPFECLIA 120
Query: 154 FLCSSNNNIARITKMVDFLASLGSHLGNVEGFEFHEFPSLERLS-LVSEVELRNAGFGYR 212
F+CSSNNNI RIT M++ L V +++FP+ E LS + +E LR GFGYR
Sbjct: 121 FICSSNNNIPRITSMINRLCERFGERIVVGRHSYYDFPTAEALSAMHAEGSLRRLGFGYR 180
Query: 213 SAPQSSLLFSVRRSFKQAKYITGTVDVLQSKHSGGAEWLLSLRKLDLQEAIDALCTLPGV 272
A+++ ++Q + G +WL SLR + A L LPGV
Sbjct: 181 -----------------ARFVVEASRIVQDR---GRDWLQSLRHCSYEVASGQLQRLPGV 220
Query: 273 GPKVAACIALFSLDQHHAIPVDTHVWKIATRYLLPELAGVR-LTPKLCSRVA 323
G KVA C+ L +LD+ A+PVDTHVW++ + L L + LTP L ++
Sbjct: 221 GQKVADCVCLMALDKTDAVPVDTHVWRLTRDHYLTNLDDRQHLTPALYKQIG 272
>gi|391342466|ref|XP_003745541.1| PREDICTED: N-glycosylase/DNA lyase-like [Metaseiulus occidentalis]
Length = 341
Score = 159 bits (403), Expect = 1e-36, Method: Compositional matrix adjust.
Identities = 103/307 (33%), Positives = 159/307 (51%), Gaps = 43/307 (14%)
Query: 41 LNLTQSELSLPLTFPTGQTFRWK-KTGPLQYTGPIGPHLISLKHLQNGDVCYHIHTSPS- 98
+++ ++EL+L + GQ FRWK ++ + + L+ +G + Y +
Sbjct: 8 IHVLKNELNLDVVLRCGQAFRWKLDEDKREWISTLKGRVFGLQQTDDG-IQYRVWKGDDA 66
Query: 99 -------------EPAAKSALLDFLNMGISLGELWEGFSASDCRFAELAKYLAGARVLRQ 145
+A++ L ++ +G+ L L E + +D F + Y G R LRQ
Sbjct: 67 ENSENNGTDGIYKHESAEAILTNYFQLGVRLQNLVEEWK-NDPLFENVHIY--GVRTLRQ 123
Query: 146 DPVECLLQFLCSSNNNIARITKMVDFLASLGSHLGNVEGFEFHEFPSLERLSLVS---EV 202
P ECL+ F+ SS NNI RI+ MVD L S G L + G E+H+FP++E LS + E
Sbjct: 124 QPFECLMSFIISSCNNIKRISSMVDHLCSYGPILATINGTEYHDFPTVEALSSIGPELET 183
Query: 203 ELRNAGFGYRSAPQSSLLFSVRRSFKQAKYITGTVDVLQSKHSGGAEWLLSLRKLDLQEA 262
LR+A FGYR A YI VD+++ K G +WL SLR+ +EA
Sbjct: 184 NLRSASFGYR-----------------ATYIAKVVDIVKKK--GSDDWLNSLREAPYEEA 224
Query: 263 IDALCTLPGVGPKVAACIALFSLDQHHAIPVDTHVWKIATRYLLPELAGVR-LTPKLCSR 321
D L PGVG KVA C+ L +LD+ A+P+DTHV++IA + +P+L V+ L+P +
Sbjct: 225 RDRLMEFPGVGRKVADCVCLMALDKPQAVPIDTHVFQIA-KANMPQLKDVKNLSPAVYPV 283
Query: 322 VAEAFCE 328
+ + F E
Sbjct: 284 IVDWFRE 290
>gi|345497745|ref|XP_001602207.2| PREDICTED: N-glycosylase/DNA lyase-like [Nasonia vitripennis]
Length = 367
Score = 158 bits (400), Expect = 3e-36, Method: Compositional matrix adjust.
Identities = 104/287 (36%), Positives = 151/287 (52%), Gaps = 30/287 (10%)
Query: 45 QSELSLPLTFPTGQTFRWKK-TGPLQYTGPIGPHLISLKHLQNGDVCYHIHTSPSEPAAK 103
+ EL L +T GQ+FRW + + Y G L +L N + Y H S
Sbjct: 47 KDELDLGITLKGGQSFRWVECSDGNSYRGVFNSALWTLSQSDN-KLFYTRHGSKKGFQCD 105
Query: 104 SALLDFLNMGISLGELWEGFSASDCRFAELAKYLAGARVLRQDPVECLLQFLCSSNNNIA 163
AL ++ + +SL E + ++ +D F + R+L QD VE L F+CSSNNNI
Sbjct: 106 KALSNYFRLDVSLKENLKSWAGTDSHFKTTYDKIGAVRILNQDVVENLFSFICSSNNNIT 165
Query: 164 RITKMVDFLASL-GSHLGNVEGFEFHEFPSLERLSLVSEVE--LRNAGFGYRSAPQSSLL 220
RI+ MVD L L G ++ VEG ++++FP +E+LS SE+E LR GFGYR
Sbjct: 166 RISGMVDKLCRLFGEYICTVEGQDYYDFPKIEKLS-SSELENILRKEGFGYR-------- 216
Query: 221 FSVRRSFKQAKYITGTVDVLQSKHSGGAEWLLSLRKLD---LQEAIDALCTLPGVGPKVA 277
A YI + L S G +WLL L+K + + A ++L +LPG+GPKVA
Sbjct: 217 ---------AGYIVKSAKKLLSL---GEDWLLGLKKENGATYEHARESLMSLPGIGPKVA 264
Query: 278 ACIALFSLDQHHAIPVDTHVWKIATRYLLPELAGVR-LTPKLCSRVA 323
CI L SL +IPVDTH++++A P L+ + +TPK+ V+
Sbjct: 265 DCICLMSLGHLESIPVDTHIFQVACSNYTPHLSKQKTVTPKIHQEVS 311
>gi|320580173|gb|EFW94396.1| mitochondrial N-glycosylase/DNA lyase [Ogataea parapolymorpha DL-1]
Length = 295
Score = 157 bits (397), Expect = 6e-36, Method: Compositional matrix adjust.
Identities = 103/283 (36%), Positives = 148/283 (52%), Gaps = 29/283 (10%)
Query: 47 ELSLPLTFPTGQTFRWKKTGPLQYTGPIGPHLISLKHLQNGDVCYHIHTSPSEPAAKSAL 106
ELS+ GQ FRWK + ++ I ++ LK D H + P + +
Sbjct: 6 ELSVARVLRCGQAFRWKYVDDI-WSCTIQNRVLLLKQ---DDTFLHYASIPHLKDTEEVV 61
Query: 107 LDFLNMGISLGELWEGFSASDCRFAELAKYLAGARVLRQDPVECLLQFLCSSNNNIARIT 166
LD+ N+ I L EL+ +++ D F + + AG R+LRQDP E L+ F+CS+NNN+ RI+
Sbjct: 62 LDYFNLSIRLEELYLDWASKDKHFTKNSANFAGIRMLRQDPWENLVSFICSTNNNVKRIS 121
Query: 167 KMVDFLA-SLGSHLGNVEGFEFHEFPSLERLSLVS-EVELRNAGFGYRSAPQSSLLFSVR 224
KM + L G L +G + ++FP ERL+ + E ELR GFGYR
Sbjct: 122 KMCENLCIHYGEFLVEYQGIKHYKFPEPERLAQPNVEAELRALGFGYR------------ 169
Query: 225 RSFKQAKYITGTVDVLQSKHSGGAEWLLSLRKLDLQEAIDALCTLPGVGPKVAACIALFS 284
AK+I T +L K L S+R + D L T GVGPKVA C+ L S
Sbjct: 170 -----AKFIHQTAKMLTEKDEFAK--LYSMRTEPHKTCHDYLQTFMGVGPKVADCVCLMS 222
Query: 285 LDQHHAIPVDTHVWKIATR-YLLPELAGVRLTPKLCSRVAEAF 326
LD+H +PVDTHV+KIAT+ Y +P G L +L +++ + F
Sbjct: 223 LDKHDVVPVDTHVFKIATKTYRMP---GKLLNKELYAQIQDKF 262
>gi|320165611|gb|EFW42510.1| DNA N-glycosylase [Capsaspora owczarzaki ATCC 30864]
Length = 441
Score = 157 bits (397), Expect = 7e-36, Method: Compositional matrix adjust.
Identities = 109/332 (32%), Positives = 150/332 (45%), Gaps = 77/332 (23%)
Query: 37 KWTPLNLTQSELSLPLTFPTGQTFRWKKTGPLQYTGPIGPHLISLKH------------- 83
+W L +EL + +T GQ+FRWKKTG +++ +G LISL+
Sbjct: 5 RWRSLAAPPAELRIDVTLACGQSFRWKKTGDNEWSSVLGGRLISLQQTADDVLFADRTVS 64
Query: 84 ---------------------------------------LQNGDVCYHIHTSPSEPAAKS 104
LQ+ +++ +P
Sbjct: 65 EPASAAATSMSMDSHDNGATDAAGAAAAGASSATKASVLLQSSIAPTMTNSTDVKPMLDD 124
Query: 105 ALLDFLN----MGISLGELWEGFSASDCRFAELAKYLAGARVLRQDPVECLLQFLCSSNN 160
L D L + + L L+ ++ASD RF +++ L G R+LRQDP E L F+CSSNN
Sbjct: 125 TLGDMLRRYFWLDVKLEPLYASWAASDPRFKYISQRLPGVRILRQDPTENLFSFICSSNN 184
Query: 161 NIARITKMVDFL-ASLGSHLGNVEGFEFHEFPSLERLSLVS-EVELRNAGFGYRSAPQSS 218
NI+RIT M+D + G G+++G F +FP + L+ E LR GFGYR
Sbjct: 185 NISRITLMIDRMCQQYGVKRGDIDGQSFFDFPEVSALAQDGVEERLRELGFGYR------ 238
Query: 219 LLFSVRRSFKQAKYITGTVDVLQSKHSGGAEWLLSLRKLDLQEAIDALCTLPGVGPKVAA 278
AK+I Q GG+EWLL R L A L L GVGPKVA
Sbjct: 239 -----------AKFIEQAAK--QVLKLGGSEWLLQQRALPYVTARTNLVQLQGVGPKVAD 285
Query: 279 CIALFSLDQHHAIPVDTHVWKIATRYLLPELA 310
CI L S+D+H AIPVD HV ++A R +LA
Sbjct: 286 CICLMSMDKHDAIPVDVHVRRLAERDYKLKLA 317
>gi|449665557|ref|XP_002157849.2| PREDICTED: N-glycosylase/DNA lyase-like [Hydra magnipapillata]
Length = 369
Score = 156 bits (395), Expect = 1e-35, Method: Compositional matrix adjust.
Identities = 100/299 (33%), Positives = 146/299 (48%), Gaps = 34/299 (11%)
Query: 45 QSELSLPLTFPTGQTFRWKKT--GPLQYTGPIGPHLISLKHLQNGDVCYHIHTSP----- 97
+ E++L L GQ+FRWK P + + L L+ +G + Y S
Sbjct: 39 EKEINLKLILSGGQSFRWKTNSEAPNIWFNVLNKKLWYLEQKDDG-IYYKTVKSTCLSDN 97
Query: 98 -----SEPAAKSALLDFLNMGISLGELWEGFSASDCRFAELAK-YLAGARVLRQDPVECL 151
+ K L D+ + I + +L+E + D L+ G RVL+QDPVE L
Sbjct: 98 NKCINDDITDKKFLYDYFQLDIDISKLYEEWRRVDNNIKVLSDCNFYGIRVLKQDPVENL 157
Query: 152 LQFLCSSNNNIARITKMVDFLASLGSHLGNVEGFEFHEFPSLERLSLVS-EVELRNAGFG 210
F+CS NNNI RI+++V+ L + E FP+++ L++ E LR GFG
Sbjct: 158 FSFICSQNNNIVRISQLVEKLCVFFGEKIVCDNKEVFCFPTVQALAVTGVEDALRKLGFG 217
Query: 211 YRSAPQSSLLFSVRRSFKQAKYITGTVDVLQSKHSGGAEWLLSLRKLDLQEAIDALCTLP 270
YR AK+I + +H G WL LR +EAI LC+LP
Sbjct: 218 YR-----------------AKFIHHAAKSIIEEH-GNENWLFDLRNKSYEEAIQNLCSLP 259
Query: 271 GVGPKVAACIALFSLDQHHAIPVDTHVWKIATRYLLPELAGVR-LTPKLCSRVAEAFCE 328
GVG KVA C+ L SL++ +AIPVDTH+W+IA++Y +P L + LT K+ + F E
Sbjct: 260 GVGAKVADCVCLMSLNKKNAIPVDTHMWQIASQYYMPHLKSNKTLTSKVYKDIGNFFRE 318
>gi|150863900|ref|XP_001382534.2| 8-oxoguanine DNA glycosylase [Scheffersomyces stipitis CBS 6054]
gi|149385156|gb|ABN64505.2| 8-oxoguanine DNA glycosylase [Scheffersomyces stipitis CBS 6054]
Length = 336
Score = 156 bits (394), Expect = 1e-35, Method: Compositional matrix adjust.
Identities = 101/285 (35%), Positives = 150/285 (52%), Gaps = 37/285 (12%)
Query: 38 WTPLNLTQSELSLPLTFPTGQTFRWKKTGPLQYTGPIGPHLISLK---------HLQNGD 88
W + + ++ELSL GQTFRWK + ++ I ++ LK HL
Sbjct: 7 WRQIPIREAELSLTKVLRCGQTFRWKNINNV-WSFAIHDRVVLLKQDEEYLHYSHLMKES 65
Query: 89 VCYHIHTSPSEPAAKSALLDFLNMGISLGELWEGFSASDCRFAELA----KYLAGARVLR 144
+ T+ SE + D+ N+ ++L +L++ +S+ F + + G R+LR
Sbjct: 66 LTTQKSTAESEKQTLEFVKDYFNLSVNLVDLYDHWSSQHEPFKKSKLTPFEQFKGIRILR 125
Query: 145 QDPVECLLQFLCSSNNNIARITKMVDFLAS-LGSHLGNVEGFEFHEFPSLERLSLVSEVE 203
QDP E ++ F+CSSNNN+ RI+KM D L S G ++ +G ++H FPS E+L+ S++E
Sbjct: 126 QDPWETVVSFICSSNNNVKRISKMCDSLCSEFGDYINEHDGVQYHSFPSAEKLASSSKIE 185
Query: 204 --LRNAGFGYRSAPQSSLLFSVRRSFKQAKYITGT-VDVLQSKH--SGGAEWLLSLRKLD 258
LR GFGYR AKYI T V + +K E L SLR +
Sbjct: 186 TRLRELGFGYR-----------------AKYIYQTAVKFVDNKGFPDITIEKLNSLRNEE 228
Query: 259 LQEAIDALCTLPGVGPKVAACIALFSLDQHHAIPVDTHVWKIATR 303
+ + + L L GVGPKVA CI L +LD+H +PVDTHV++IA R
Sbjct: 229 YELSHNFLLQLTGVGPKVADCICLMALDKHDCVPVDTHVFQIAVR 273
>gi|145355019|ref|XP_001421769.1| predicted protein [Ostreococcus lucimarinus CCE9901]
gi|144582007|gb|ABP00063.1| predicted protein [Ostreococcus lucimarinus CCE9901]
Length = 245
Score = 155 bits (392), Expect = 2e-35, Method: Compositional matrix adjust.
Identities = 93/224 (41%), Positives = 133/224 (59%), Gaps = 25/224 (11%)
Query: 103 KSALLDFLNMGISLGELWEGFSASDCRFAELAKYLAGARVLRQDPVECLLQFLCSSNNNI 162
+ + ++ N +SL L+E F+A D RF LA ++ GAR+LRQDP ECL F+CSSNN+I
Sbjct: 10 RRVMREYFNADVSLERLYEEFAAKDGRFRALAAHVDGARMLRQDPSECLFSFICSSNNHI 69
Query: 163 ARITKMVDFLASLG-SHLGNVEGFEFHEFPSLER-LSLVSEVELRNAGFGYRSAPQSSLL 220
+RI MV+ + + G E +F+ FPS+ + L +E ELR GFGYR
Sbjct: 70 SRIHGMVNKMCNGGEETAAAAEAEKFYAFPSVAQILDGATEEELRALGFGYR-------- 121
Query: 221 FSVRRSFKQAKYITGTVDVL----QSKHSGGAEWLLSLR-KLDLQEAIDALCTLPGVGPK 275
AK+I G+ L + S E+L +LR + +A AL TLPG+GPK
Sbjct: 122 ---------AKFIVGSAAALGEAGDAVGSSPDEFLRALRDETPYVDAHAALMTLPGIGPK 172
Query: 276 VAACIALFSLDQHHAIPVDTHVWKIATRYLLPELAGVR-LTPKL 318
V+AC+ LFSL++H AIPVDTHVW+ A + +PEL + +TP++
Sbjct: 173 VSACVCLFSLNKHEAIPVDTHVWQFAVEHYMPELRDAKSVTPRV 216
>gi|344299660|gb|EGW30013.1| hypothetical protein SPAPADRAFT_63635 [Spathaspora passalidarum
NRRL Y-27907]
Length = 344
Score = 155 bits (391), Expect = 3e-35, Method: Compositional matrix adjust.
Identities = 98/278 (35%), Positives = 147/278 (52%), Gaps = 32/278 (11%)
Query: 38 WTPLNLTQSELSLPLTFPTGQTFRWKKTGPLQYTGPIGPHLISLKHLQNGDVCYHIHTSP 97
W + + Q+EL+L GQTFRWK +++ +I LK Q+ ++ H
Sbjct: 7 WNKIPIIQTELNLSKVLRCGQTFRWKNINH-EWSFTTQDRIILLK--QDDQFIHYSHIMK 63
Query: 98 SEPAAKSA-----LLDFLNMGISLGELWEGFSASDCRFAE-LAKYLA--GARVLRQDPVE 149
+ S+ L D+ ++ + L +L+ + S ++ + ++ + A G R++RQDP E
Sbjct: 64 HDTNGTSSQTLDFLHDYFSLDLKLADLYTSWKTSHKKYEKGISPFDAFQGIRIIRQDPWE 123
Query: 150 CLLQFLCSSNNNIARITKMVDFLAS-LGSHLGNVEGFEFHEFPSLERLSLVS-EVELRNA 207
CL+ F+CSSNNN+ RI+KM D L S G+++ ++++ FP+ E L+ E +LR+
Sbjct: 124 CLISFICSSNNNVKRISKMCDSLCSEFGTYINEYNDYKYYSFPTAETLAQPDVEAKLRDL 183
Query: 208 GFGYRSAPQSSLLFSVRRSFKQAKYITGTVDVLQSKH--SGGAEWLLSLRKLDLQEAIDA 265
GFGYR AKYI T S E L SLR D EA +
Sbjct: 184 GFGYR-----------------AKYIHQTAMKFTSPDFPEITIENLNSLRSKDYTEAHEF 226
Query: 266 LCTLPGVGPKVAACIALFSLDQHHAIPVDTHVWKIATR 303
L L GVGPKVA CI L SLD+H +P+DTHV++IA R
Sbjct: 227 LLQLTGVGPKVADCICLMSLDKHDIVPIDTHVYQIAVR 264
>gi|4587154|dbj|BAA76637.1| OGG1 protein type 2d [Homo sapiens]
Length = 275
Score = 155 bits (391), Expect = 3e-35, Method: Compositional matrix adjust.
Identities = 83/206 (40%), Positives = 116/206 (56%), Gaps = 20/206 (9%)
Query: 100 PAAKSALLDFLNMGISLGELWEGFSASDCRFAELAKYLAGARVLRQDPVECLLQFLCSSN 159
P A+ + + ++L +L+ + + D F E+A+ G R+LRQDP+ECL F+CSSN
Sbjct: 9 PDELEAVRKYFQLDVTLAQLYHHWGSVDSHFQEVAQKFQGVRLLRQDPIECLFSFICSSN 68
Query: 160 NNIARITKMVDFLA-SLGSHLGNVEGFEFHEFPSLERLSLVS-EVELRNAGFGYRSAPQS 217
NNIARIT MV+ L + G L ++ +H FPSL+ L+ E LR G GYR
Sbjct: 69 NNIARITGMVERLCQAFGPRLIQLDDVTYHGFPSLQALAGPEVEAHLRKLGLGYR----- 123
Query: 218 SLLFSVRRSFKQAKYITGTVDVLQSKHSGGAEWLLSLRKLDLQEAIDALCTLPGVGPKVA 277
A+Y++ + + + GG WL LR+ +EA ALC LPGVG KVA
Sbjct: 124 ------------ARYVSASARAILEEQ-GGLAWLQQLRESSYEEAHKALCILPGVGTKVA 170
Query: 278 ACIALFSLDQHHAIPVDTHVWKIATR 303
CI L +LD+ A+PVD H+W IA R
Sbjct: 171 DCICLMALDKPQAVPVDVHMWHIAQR 196
>gi|149235536|ref|XP_001523646.1| conserved hypothetical protein [Lodderomyces elongisporus NRRL
YB-4239]
gi|146452625|gb|EDK46881.1| conserved hypothetical protein [Lodderomyces elongisporus NRRL
YB-4239]
Length = 438
Score = 154 bits (390), Expect = 4e-35, Method: Compositional matrix adjust.
Identities = 98/295 (33%), Positives = 152/295 (51%), Gaps = 45/295 (15%)
Query: 38 WTPLNLTQSELSLPLTFPTGQTFRWKKTGPLQYTGPIGPHLISLKH-------------- 83
W L ++++EL+L + GQTFRWK + +T ++ LK
Sbjct: 45 WHKLPISETELNLSIVLRCGQTFRWKNINNV-WTFATKDRILLLKQEQRQQRYHHQHHHQ 103
Query: 84 ------LQNGDVCY-HIHTSPSEPAAKSALL---DFLNMGISLGELWEGFSASDCRFAE- 132
+N + Y HI S + L D+ + L +L+ + S +++
Sbjct: 104 QQQQQQRENDFIEYSHIMKSDKKGTYNDTLSWISDYFTLDTKLHDLYASWKVSGKKYSTT 163
Query: 133 LAKYLA--GARVLRQDPVECLLQFLCSSNNNIARITKMVDFLAS-LGSHLGNVEGFEFHE 189
++K+ + G R LRQDP ECL+ F+CSSNNN+ RI+KM D L G + EG++ +
Sbjct: 164 ISKFDSHTGIRTLRQDPWECLISFICSSNNNVKRISKMCDNLCEHFGDLVNEYEGYKHYS 223
Query: 190 FPSLERLSLV-SEVELRNAGFGYRSAPQSSLLFSVRRSFKQAKYITGTVDVLQSKHSGGA 248
FP+ E+LS +E +LR GFGYR+ ++ + F +Y T+
Sbjct: 224 FPTPEQLSASNTESKLRELGFGYRAK----YIYQTAKKFTSPEYPDITI----------- 268
Query: 249 EWLLSLRKLDLQEAIDALCTLPGVGPKVAACIALFSLDQHHAIPVDTHVWKIATR 303
E L+S+R +D ++A + L L GVGPKVA CI L SLD+H +P+DTHV++IA R
Sbjct: 269 EKLMSMRDMDYEQAHEFLLQLSGVGPKVADCICLMSLDKHDVVPIDTHVYQIAVR 323
>gi|67541024|ref|XP_664286.1| hypothetical protein AN6682.2 [Aspergillus nidulans FGSC A4]
gi|40738435|gb|EAA57625.1| hypothetical protein AN6682.2 [Aspergillus nidulans FGSC A4]
gi|259480266|tpe|CBF71239.1| TPA: mitochondrial glycosylase/lyase (Eurofung) [Aspergillus
nidulans FGSC A4]
Length = 414
Score = 154 bits (390), Expect = 4e-35, Method: Compositional matrix adjust.
Identities = 111/314 (35%), Positives = 156/314 (49%), Gaps = 61/314 (19%)
Query: 30 LTTNKPSKWTPLNLTQSELSLPLTFPTGQTFRWKKT-GPLQYTGPIGPHLISLKHLQNGD 88
+T S+W L + SEL + T GQTFRW K ++ + LISLK Q+
Sbjct: 1 MTVGAFSEWHKLPIGLSELCINTTLRCGQTFRWHKDPDNDEWRCVLYGRLISLK--QDPS 58
Query: 89 VCYH---IHTSPS----------EPAAKSALLDFLNMGISLGELWEGFSASDCRFAELAK 135
Y+ +++ PS E + + + N+ +L L+ +S+SD F + A
Sbjct: 59 HLYYRTYVNSKPSGSCNGSDSGSEDTTLAIIKHYFNLNSNLTTLYAQWSSSDPNFRKKAS 118
Query: 136 YLAGARVLRQDPVECLLQFLCSSNNNIARITKMVDFL-ASLGSHLGNVEGFEFHEFPSLE 194
G R+LRQD E L+ F+CSSNNNIARI++MV+ L A+ G H+ +V+G +H+FP E
Sbjct: 119 QFTGIRILRQDAWEALISFICSSNNNIARISQMVEKLCANYGLHIADVDGRSYHDFPPPE 178
Query: 195 RLSLVSEVE--LRNAGFGYRSAPQSSLLFSVRRSFKQAKYITGTVDVLQSKHSGGAEWLL 252
RL+ VE LR+ GFGYR AKYI T ++ + G WL
Sbjct: 179 RLAEDEGVEARLRSLGFGYR-----------------AKYIYQTAVIIAKQKENG--WLN 219
Query: 253 SLR-----------------------KLDLQEAIDALCTLPGVGPKVAACIALFSLDQHH 289
SLR + +EA + L L GVGPKVA C+AL L
Sbjct: 220 SLRNPEAPAFGLEVVAGQEGEMPPEGRSGYREAHEKLLELQGVGPKVADCVALMGLGWGE 279
Query: 290 AIPVDTHVWKIATR 303
++PVDTHVW+IA R
Sbjct: 280 SVPVDTHVWQIAQR 293
>gi|4587156|dbj|BAA76638.1| OGG1 protein type 2e [Homo sapiens]
Length = 241
Score = 154 bits (389), Expect = 5e-35, Method: Compositional matrix adjust.
Identities = 83/206 (40%), Positives = 116/206 (56%), Gaps = 20/206 (9%)
Query: 100 PAAKSALLDFLNMGISLGELWEGFSASDCRFAELAKYLAGARVLRQDPVECLLQFLCSSN 159
P A+ + + ++L +L+ + + D F E+A+ G R+LRQDP+ECL F+CSSN
Sbjct: 9 PDELEAVRKYFQLDVTLAQLYHHWGSVDSHFQEVAQKFQGVRLLRQDPIECLFSFICSSN 68
Query: 160 NNIARITKMVDFLA-SLGSHLGNVEGFEFHEFPSLERLSLVS-EVELRNAGFGYRSAPQS 217
NNIARIT MV+ L + G L ++ +H FPSL+ L+ E LR G GYR
Sbjct: 69 NNIARITGMVERLCQAFGPRLIQLDDVTYHGFPSLQALAGPEVEAHLRKLGLGYR----- 123
Query: 218 SLLFSVRRSFKQAKYITGTVDVLQSKHSGGAEWLLSLRKLDLQEAIDALCTLPGVGPKVA 277
A+Y++ + + + GG WL LR+ +EA ALC LPGVG KVA
Sbjct: 124 ------------ARYVSASARAILEEQ-GGLAWLQQLRESSYEEAHKALCILPGVGTKVA 170
Query: 278 ACIALFSLDQHHAIPVDTHVWKIATR 303
CI L +LD+ A+PVD H+W IA R
Sbjct: 171 DCICLMALDKPQAVPVDVHMWHIAQR 196
>gi|84998168|ref|XP_953805.1| 7,8 dihydro-8-oxoguanine DNA glycosylase [Theileria annulata]
gi|65304802|emb|CAI73127.1| 7,8 dihydro-8-oxoguanine DNA glycosylase, putative [Theileria
annulata]
Length = 299
Score = 154 bits (388), Expect = 6e-35, Method: Compositional matrix adjust.
Identities = 98/296 (33%), Positives = 153/296 (51%), Gaps = 33/296 (11%)
Query: 36 SKWTPLNLTQSELSLPLTFPTGQTFRWKKTGPLQYTGPIGPHLISLKHLQNGDVCYHIHT 95
+KW L + + L L TGQ+F WK G + G +G + +K Q+ D ++
Sbjct: 9 TKWFDLRVPLTVLRPELLLTTGQSFTWKCVGNKHWVGVLGSSVYEIK--QSDDTT--LYR 64
Query: 96 SPSEPAAKSALLDFLNMGISLGELWEGFSASDCRFAELAKYLAGARVLRQDPVECLLQFL 155
+ ++ L D+ ++ E F+ + ++ K +G R+L+QDP ECL+ F+
Sbjct: 65 TLFGKCSRERLWDYFDLD---NEYSVDFTKAPKPVKQIIKRRSGVRILQQDPFECLISFI 121
Query: 156 CSSNNNIARITKMVDFLA-SLGSHLG----NVEGFEFHEFPSLERLSLVSEVELRNAGFG 210
CSSNNNI+RIT+M++ + + G+ L N E F+F+ FPS+++L + +L+ G G
Sbjct: 122 CSSNNNISRITRMLNEIKRNFGTFLAKSEVNNETFDFYAFPSVDQLRKATPEQLKKLGLG 181
Query: 211 YRSAPQSSLLFSVRRSFKQAKYITGTVDVLQSKHSGGAEWLLSLRKLDLQEAIDALCTLP 270
YRS +I TV++L S+ G WL SLR D AL +LP
Sbjct: 182 YRS-----------------DFIFKTVEILNSR---GLNWLYSLRNEDSDTCKSALTSLP 221
Query: 271 GVGPKVAACIALFSLDQHHAIPVDTHVWKIATRYLLPELAGVRLTPKLCSRVAEAF 326
GVG KVA C++LFSL + +PVD H+ KIA + + G L+ R+ AF
Sbjct: 222 GVGRKVADCVSLFSLGKRDVVPVDVHIQKIANTFFGVK-CGKSLSDSDYERIGTAF 276
>gi|383863957|ref|XP_003707446.1| PREDICTED: N-glycosylase/DNA lyase-like [Megachile rotundata]
Length = 335
Score = 152 bits (383), Expect = 3e-34, Method: Compositional matrix adjust.
Identities = 107/286 (37%), Positives = 150/286 (52%), Gaps = 30/286 (10%)
Query: 46 SELSLPLTFPTGQTFRWKKTGPLQYTGPIGPHLISLKHLQNGDVCYHIHTSP--SEPAAK 103
SEL+L +T GQ+FRW Y G + +LK QN ++ P
Sbjct: 20 SELNLGITLKGGQSFRWFSYND-GYRGIFDGCVWTLK--QNDTHLFYDVQGPLLDSLNYD 76
Query: 104 SALLDFLNMGISLGELWEGFSASDCRFAELAKYLAGARVLRQDPVECLLQFLCSSNNNIA 163
L ++ + +SL +L E ++A D F G R+L QD VE + F+CSSNNNI
Sbjct: 77 EILSEYFRLDVSLHDLSEKWAAVDYHFERALNETNGVRILNQDVVENVFSFICSSNNNIQ 136
Query: 164 RITKMVDFLASL-GSHLGNVEGFEFHEFPSLERLS-LVSEVELRNAGFGYRSAPQSSLLF 221
RI+ MV+ L S+ GS + V+G +++ FPS+E LS +E++L+ FGYR
Sbjct: 137 RISSMVEKLCSMFGSLICTVDGKDYYAFPSVEALSGEDTELKLKKEKFGYR--------- 187
Query: 222 SVRRSFKQAKYITGTVDVLQSKHSGGAEWLLSLRKLD---LQEAIDALCTLPGVGPKVAA 278
A YI+ T L KH GG +WLL L K + A + L TLPG+GPKVA
Sbjct: 188 --------AGYISNTAKRLL-KH-GGQQWLLDLHKRNNTSYSNAREQLITLPGIGPKVAD 237
Query: 279 CIALFSLDQHHAIPVDTHVWKIATRYLLPELAGVR-LTPKLCSRVA 323
CI L SL AIPVDTH+++IA LP L + +TPK+ ++
Sbjct: 238 CICLMSLGHLEAIPVDTHIFQIARANYLPRLVQQKTVTPKIHEEIS 283
>gi|71033695|ref|XP_766489.1| 8-oxoguanine DNA-glycosylase [Theileria parva strain Muguga]
gi|68353446|gb|EAN34206.1| 8-oxoguanine DNA-glycosylase, putative [Theileria parva]
Length = 279
Score = 152 bits (383), Expect = 3e-34, Method: Compositional matrix adjust.
Identities = 98/296 (33%), Positives = 151/296 (51%), Gaps = 33/296 (11%)
Query: 36 SKWTPLNLTQSELSLPLTFPTGQTFRWKKTGPLQYTGPIGPHLISLKHLQNGDVCYHIHT 95
SKW L + S L L TGQ+F WK G + G +G + +K Q+ D ++
Sbjct: 9 SKWLDLRVPLSVLRPELLLTTGQSFTWKCVGDKHWVGVLGSSVYEIK--QSTDT--TLYR 64
Query: 96 SPSEPAAKSALLDFLNMGISLGELWEGFSASDCRFAELAKYLAGARVLRQDPVECLLQFL 155
+ ++ L D+ ++ E F + ++ K +G R+L+Q+P ECL+ F+
Sbjct: 65 TLFGTCSREKLWDYFDLD---NEYSVDFKKAPKSVKQILKRRSGVRILQQEPFECLISFI 121
Query: 156 CSSNNNIARITKMVDFLA-SLGSHLG----NVEGFEFHEFPSLERLSLVSEVELRNAGFG 210
CSSNNNI RIT+M++ + + G+ L N E F+F+ FPS+++L + +L+ G G
Sbjct: 122 CSSNNNITRITRMLNEIKRNFGTFLAKSEVNNEIFDFYAFPSVDQLGRATPEQLKKLGLG 181
Query: 211 YRSAPQSSLLFSVRRSFKQAKYITGTVDVLQSKHSGGAEWLLSLRKLDLQEAIDALCTLP 270
YRS ++I TV++L S+ G WL SL+ D AL +LP
Sbjct: 182 YRS-----------------EFIFRTVEILNSR---GLNWLYSLKNEDSDTCKSALTSLP 221
Query: 271 GVGPKVAACIALFSLDQHHAIPVDTHVWKIATRYLLPELAGVRLTPKLCSRVAEAF 326
GVG KVA C++LFSL + +PVD H+ KIA + G L+ R+ AF
Sbjct: 222 GVGRKVADCVSLFSLGKRDVVPVDVHIQKIANT-MFGVKCGKSLSDSDYERIGSAF 276
>gi|193636542|ref|XP_001952017.1| PREDICTED: n-glycosylase/DNA lyase-like [Acyrthosiphon pisum]
Length = 312
Score = 152 bits (383), Expect = 3e-34, Method: Compositional matrix adjust.
Identities = 101/288 (35%), Positives = 152/288 (52%), Gaps = 32/288 (11%)
Query: 44 TQSELSLPLTFPTGQTFRWKKTGPLQYTGPIGPHLISLKHLQNGDVCYHIHTS--PSEPA 101
++ +++L GQ+FRWK+T P +Y+G LK QN D HI P+
Sbjct: 7 SRDQVNLKCVLLGGQSFRWKETDPDEYSGVFAGGFWVLK--QNDD---HIAYKRVPATDL 61
Query: 102 AKSALLDFLNMGISLGELWEGFSASDCRFAELAKYLAGARVLRQDPVECLLQFLCSSNNN 161
+ L ++L M L ++ +S D F + A+ G R+L+Q+PVE +L F+CSSNNN
Sbjct: 62 DEKMLSNYLRMDEPLDAYYKEWSICDPIFKKSAERFKGIRMLKQEPVENILSFICSSNNN 121
Query: 162 IARITKMVDFLASLGSHLGNVEGFEFHEFPSLERLSLVSEVE--LRNAGFGYRSAPQSSL 219
I RI+ MV+ + +L + ++G ++ FPS+E L + EVE L+ A FGYR
Sbjct: 122 ITRISSMVEKMCTL--YGKKIDGTNYYAFPSIETL-IGQEVEDDLKAAKFGYR------- 171
Query: 220 LFSVRRSFKQAKYITGTVDVLQSKHSGGAEWLLSLRKLDLQEAIDALCTLPGVGPKVAAC 279
AK+I T + + G W+ LR ++ A L LPGVGPKVA C
Sbjct: 172 ----------AKFIRQTAEKMF--ELGSNIWIDKLRNMNYDSAKKELMLLPGVGPKVADC 219
Query: 280 IALFSLDQHHAIPVDTHVWKIATRYLLPELAGVR-LTPKLCSRVAEAF 326
I L SL+ A+PVDTHV++IA LP L + +T + + + + F
Sbjct: 220 ICLMSLEHLEAVPVDTHVFQIARDNYLPHLKKYKTVTQPVYNEIGDCF 267
>gi|367042856|ref|XP_003651808.1| hypothetical protein THITE_2112505 [Thielavia terrestris NRRL 8126]
gi|346999070|gb|AEO65472.1| hypothetical protein THITE_2112505 [Thielavia terrestris NRRL 8126]
Length = 441
Score = 151 bits (382), Expect = 3e-34, Method: Compositional matrix adjust.
Identities = 114/322 (35%), Positives = 151/322 (46%), Gaps = 77/322 (23%)
Query: 36 SKWTPLNLTQSELSLPLTFPTGQTFRWKKTGPLQYTGPIGPHLISLKHLQNGDVCYHIHT 95
S W L L+ +EL + T GQ+FRW+K ++ + +ISL+ Q+ D ++ T
Sbjct: 7 SDWRKLPLSLAELCIDTTLRCGQSFRWRKIKD-EWHCVLRGRIISLR--QDSDHLHYRAT 63
Query: 96 SPS---------------------------EPAAKSALLDFLNMGISLGELWEGFSASDC 128
P E +S L + + SL L++ ++ASD
Sbjct: 64 WPEPTTGPAPGPMTPPPSVPPARGKDVGAEEDDTESLLRSYFALSHSLSSLYKHWAASDA 123
Query: 129 RFAELAKYLAGARVLRQDPVECLLQFLCSSNNNIARITKMV-DFLASLGSHLGNVEGFEF 187
FA A G R+L QDP E L+ F+CSSNNNI+RI++MV A G ++G++EG F
Sbjct: 124 NFARRAPAFTGIRILNQDPWETLVAFICSSNNNISRISQMVLKLCAHYGPYIGSIEGEPF 183
Query: 188 HEFPSLERLSLVSEVE--LRNAGFGYRSAPQSSLLFSVRRSFKQAKYITGTVDVLQSKHS 245
H+FP E LS SEVE LR GFGYR AKYI T ++ S+
Sbjct: 184 HDFPGPEVLS-GSEVEAHLRQLGFGYR-----------------AKYIAETARIVSSEKP 225
Query: 246 GGAEWLLSLRK----------LDLQEAID--------------ALCTLPGVGPKVAACIA 281
G WLL LR L E D AL L GVGPKVA C+
Sbjct: 226 AG--WLLQLRNPACAALGAPLSSLSEGDDETAAAPPTYRTAHEALLQLAGVGPKVADCVC 283
Query: 282 LFSLDQHHAIPVDTHVWKIATR 303
L L A+PVDTHVW+IA R
Sbjct: 284 LMGLGWGEAVPVDTHVWQIAQR 305
>gi|119584383|gb|EAW63979.1| 8-oxoguanine DNA glycosylase, isoform CRA_b [Homo sapiens]
Length = 354
Score = 149 bits (376), Expect = 2e-33, Method: Compositional matrix adjust.
Identities = 85/250 (34%), Positives = 129/250 (51%), Gaps = 25/250 (10%)
Query: 33 NKPSKWTPLNLTQSELSLPLTFPTGQTFRWKKTGPLQYTGPIGPHLISLKHLQNGDVCYH 92
+ P+ W + +SEL L L P+GQ+FRW++ P ++G + + +L + C
Sbjct: 18 STPALWASIPCPRSELRLDLVLPSGQSFRWREQSPAHWSGVLADQVWTLTQTEEQLHCTV 77
Query: 93 IHTSPSE-----PAAKSALLDFLNMGISLGELWEGFSASDCRFAELAKYLAGARVLRQDP 147
S+ P A+ + + ++L +L+ + + D F E+A+ G R+LRQDP
Sbjct: 78 YRGDKSQASRPTPDELEAVRKYFQLDVTLAQLYHHWGSVDSHFQEVAQKFQGVRLLRQDP 137
Query: 148 VECLLQFLCSSNNNIARITKMVDFLA-SLGSHLGNVEGFEFHEFPSLERLSLVS-EVELR 205
+ECL F+CSSNNNIARIT MV+ L + G L ++ +H FPSL+ L+ E LR
Sbjct: 138 IECLFSFICSSNNNIARITGMVERLCQAFGPRLIQLDDVTYHGFPSLQALAGPEVEAHLR 197
Query: 206 NAGFGYRSAPQSSLLFSVRRSFKQAKYITGTVDVLQSKHSGGAEWLLSLRKLDLQEAIDA 265
G GYR A+Y++ + + + GG WL LR+ +EA A
Sbjct: 198 KLGLGYR-----------------ARYVSASARAILEEQ-GGLAWLQQLRESSYEEAHKA 239
Query: 266 LCTLPGVGPK 275
LC LPGVG K
Sbjct: 240 LCILPGVGTK 249
>gi|403345092|gb|EJY71905.1| Endonuclease III/similar to 8-oxoguanine DNA glycosylase isoform 1b
[Oxytricha trifallax]
Length = 303
Score = 149 bits (375), Expect = 2e-33, Method: Compositional matrix adjust.
Identities = 87/241 (36%), Positives = 131/241 (54%), Gaps = 42/241 (17%)
Query: 109 FLNMGISLGELWEGFSASDCRFAELAKYLAGARVLRQDPVECLLQFLCSSNNNIARITKM 168
++N I + +L++ ++ D RF+++A+ + G R LRQDP EC + F+CS NNI RIT+M
Sbjct: 47 YINYDIKVLDLYDHWAKRDKRFSQIAEPIHGVRCLRQDPWECTVSFICSQCNNIKRITQM 106
Query: 169 VDFLAS----------------LGSHLGNVEGFE-----FHEFPSLERLSLVSEVELRNA 207
+D L G+ + ++ + ++FP++E++S VSE ELR+
Sbjct: 107 LDTLRQKVSKNIFVLMNKLILQYGTKICEIDQDDGSKKSIYKFPTIEQMSQVSEKELRDL 166
Query: 208 GFGYRSAPQSSLLFSVRRSFKQAKYITGTVDVLQSKHSGGAEWLLSLRKLDLQEAIDALC 267
FGYR AKY+ ++ K GG +WL SLR +E + L
Sbjct: 167 KFGYR-----------------AKYLVANAKMMNEK--GGKQWLESLRGKSNEEVREQLI 207
Query: 268 TLPGVGPKVAACIALFSLDQHHAIPVDTHVWKIATR--YLLPELAGVRLTPKLCSRVAEA 325
TL G+G KVA CIALFSLD ++IPVDTHV++IA + Y+ L KL ++ EA
Sbjct: 208 TLNGIGNKVADCIALFSLDCANSIPVDTHVFQIAQKLGYVQGMKKDANLNQKLYMQIVEA 267
Query: 326 F 326
F
Sbjct: 268 F 268
>gi|8670536|ref|NP_058434.1| N-glycosylase/DNA lyase isoform 2b [Homo sapiens]
gi|119584392|gb|EAW63988.1| 8-oxoguanine DNA glycosylase, isoform CRA_k [Homo sapiens]
gi|311976730|gb|ADQ20118.1| N-glycosylase/DNA lyase [Homo sapiens]
gi|311976738|gb|ADQ20125.1| N-glycosylase/DNA lyase [Homo sapiens]
gi|311976746|gb|ADQ20132.1| N-glycosylase/DNA lyase [Homo sapiens]
gi|311976754|gb|ADQ20139.1| N-glycosylase/DNA lyase [Homo sapiens]
gi|311976762|gb|ADQ20146.1| N-glycosylase/DNA lyase [Homo sapiens]
gi|311976770|gb|ADQ20153.1| N-glycosylase/DNA lyase [Homo sapiens]
gi|311976778|gb|ADQ20160.1| N-glycosylase/DNA lyase [Homo sapiens]
gi|311976786|gb|ADQ20167.1| N-glycosylase/DNA lyase [Homo sapiens]
gi|311976794|gb|ADQ20174.1| N-glycosylase/DNA lyase [Homo sapiens]
gi|311976802|gb|ADQ20181.1| N-glycosylase/DNA lyase [Homo sapiens]
gi|311976810|gb|ADQ20188.1| N-glycosylase/DNA lyase [Homo sapiens]
gi|311976818|gb|ADQ20195.1| N-glycosylase/DNA lyase [Homo sapiens]
gi|311976826|gb|ADQ20202.1| N-glycosylase/DNA lyase [Homo sapiens]
gi|311976834|gb|ADQ20209.1| N-glycosylase/DNA lyase [Homo sapiens]
gi|311976842|gb|ADQ20216.1| N-glycosylase/DNA lyase [Homo sapiens]
gi|311976850|gb|ADQ20223.1| N-glycosylase/DNA lyase [Homo sapiens]
gi|311976858|gb|ADQ20230.1| N-glycosylase/DNA lyase [Homo sapiens]
gi|311976866|gb|ADQ20237.1| N-glycosylase/DNA lyase [Homo sapiens]
gi|311976874|gb|ADQ20244.1| N-glycosylase/DNA lyase [Homo sapiens]
gi|311976882|gb|ADQ20251.1| N-glycosylase/DNA lyase [Homo sapiens]
gi|311976890|gb|ADQ20258.1| N-glycosylase/DNA lyase [Homo sapiens]
gi|311976898|gb|ADQ20265.1| N-glycosylase/DNA lyase [Homo sapiens]
gi|311976906|gb|ADQ20272.1| N-glycosylase/DNA lyase [Homo sapiens]
gi|311976914|gb|ADQ20279.1| N-glycosylase/DNA lyase [Homo sapiens]
gi|311976922|gb|ADQ20286.1| N-glycosylase/DNA lyase [Homo sapiens]
gi|311976930|gb|ADQ20293.1| N-glycosylase/DNA lyase [Homo sapiens]
gi|311976938|gb|ADQ20300.1| N-glycosylase/DNA lyase [Homo sapiens]
gi|311976946|gb|ADQ20307.1| N-glycosylase/DNA lyase [Homo sapiens]
gi|311976954|gb|ADQ20314.1| N-glycosylase/DNA lyase [Homo sapiens]
gi|311976962|gb|ADQ20321.1| N-glycosylase/DNA lyase [Homo sapiens]
gi|311976970|gb|ADQ20328.1| N-glycosylase/DNA lyase [Homo sapiens]
gi|311976978|gb|ADQ20335.1| N-glycosylase/DNA lyase [Homo sapiens]
gi|311976986|gb|ADQ20342.1| N-glycosylase/DNA lyase [Homo sapiens]
gi|311976994|gb|ADQ20349.1| N-glycosylase/DNA lyase [Homo sapiens]
gi|311977002|gb|ADQ20356.1| N-glycosylase/DNA lyase [Homo sapiens]
gi|311977010|gb|ADQ20363.1| N-glycosylase/DNA lyase [Homo sapiens]
gi|311977018|gb|ADQ20370.1| N-glycosylase/DNA lyase [Homo sapiens]
gi|311977026|gb|ADQ20377.1| N-glycosylase/DNA lyase [Homo sapiens]
gi|311977034|gb|ADQ20384.1| N-glycosylase/DNA lyase [Homo sapiens]
gi|311977042|gb|ADQ20391.1| N-glycosylase/DNA lyase [Homo sapiens]
Length = 357
Score = 148 bits (374), Expect = 3e-33, Method: Compositional matrix adjust.
Identities = 85/251 (33%), Positives = 129/251 (51%), Gaps = 25/251 (9%)
Query: 32 TNKPSKWTPLNLTQSELSLPLTFPTGQTFRWKKTGPLQYTGPIGPHLISLKHLQNGDVCY 91
+ P+ W + +SEL L L P+GQ+FRW++ P ++G + + +L + C
Sbjct: 17 ASTPALWASIPCPRSELRLDLVLPSGQSFRWREQSPAHWSGVLADQVWTLTQTEEQLHCT 76
Query: 92 HIHTSPSE-----PAAKSALLDFLNMGISLGELWEGFSASDCRFAELAKYLAGARVLRQD 146
S+ P A+ + + ++L +L+ + + D F E+A+ G R+LRQD
Sbjct: 77 VYRGDKSQASRPTPDELEAVRKYFQLDVTLAQLYHHWGSVDSHFQEVAQKFQGVRLLRQD 136
Query: 147 PVECLLQFLCSSNNNIARITKMVDFLA-SLGSHLGNVEGFEFHEFPSLERLSLVS-EVEL 204
P+ECL F+CSSNNNIARIT MV+ L + G L ++ +H FPSL+ L+ E L
Sbjct: 137 PIECLFSFICSSNNNIARITGMVERLCQAFGPRLIQLDDVTYHGFPSLQALAGPEVEAHL 196
Query: 205 RNAGFGYRSAPQSSLLFSVRRSFKQAKYITGTVDVLQSKHSGGAEWLLSLRKLDLQEAID 264
R G GYR A+Y++ + + + GG WL LR+ +EA
Sbjct: 197 RKLGLGYR-----------------ARYVSASARAILEEQ-GGLAWLQQLRESSYEEAHK 238
Query: 265 ALCTLPGVGPK 275
ALC LPGVG K
Sbjct: 239 ALCILPGVGTK 249
>gi|426339381|ref|XP_004033629.1| PREDICTED: N-glycosylase/DNA lyase isoform 2 [Gorilla gorilla
gorilla]
Length = 357
Score = 148 bits (374), Expect = 3e-33, Method: Compositional matrix adjust.
Identities = 85/251 (33%), Positives = 130/251 (51%), Gaps = 25/251 (9%)
Query: 32 TNKPSKWTPLNLTQSELSLPLTFPTGQTFRWKKTGPLQYTGPIGPHLISLKHLQNGDVCY 91
+ P+ W + +SEL L L P+GQ+FRW++ P ++G + + +L + C
Sbjct: 17 ASAPALWASIPCPRSELRLDLVLPSGQSFRWREQSPSHWSGVLADQVWTLTQTEEQLHCT 76
Query: 92 HIHTSPSE-----PAAKSALLDFLNMGISLGELWEGFSASDCRFAELAKYLAGARVLRQD 146
S+ P A+ + + ++L +L++ + + D F E+A+ G R+LRQD
Sbjct: 77 VYRGGKSQASRPTPDELEAVRKYFQLDVTLAQLYDHWGSVDSHFQEVAQKFQGVRLLRQD 136
Query: 147 PVECLLQFLCSSNNNIARITKMVDFLA-SLGSHLGNVEGFEFHEFPSLERLSLVS-EVEL 204
P+ECL F+CSSNNNIARIT MV+ L + G L ++ +H FPSL+ L+ E L
Sbjct: 137 PIECLFSFICSSNNNIARITGMVERLCQAFGPRLIQLDDVTYHGFPSLQALAGPEVEAHL 196
Query: 205 RNAGFGYRSAPQSSLLFSVRRSFKQAKYITGTVDVLQSKHSGGAEWLLSLRKLDLQEAID 264
R G GYR A+Y++ + + + GG WL LR+ +EA
Sbjct: 197 RKLGLGYR-----------------ARYVSASARAILEEQ-GGLAWLQQLRESSYEEAHK 238
Query: 265 ALCTLPGVGPK 275
ALC LPGVG K
Sbjct: 239 ALCILPGVGTK 249
>gi|332816037|ref|XP_003309655.1| PREDICTED: N-glycosylase/DNA lyase isoform 5 [Pan troglodytes]
Length = 357
Score = 148 bits (374), Expect = 3e-33, Method: Compositional matrix adjust.
Identities = 85/251 (33%), Positives = 129/251 (51%), Gaps = 25/251 (9%)
Query: 32 TNKPSKWTPLNLTQSELSLPLTFPTGQTFRWKKTGPLQYTGPIGPHLISLKHLQNGDVCY 91
+ P+ W + +SEL L L P+GQ+FRW++ P ++G + + +L + C
Sbjct: 17 ASTPALWASIPCPRSELRLDLVLPSGQSFRWREQSPAHWSGVLADQVWTLTQTEEQLHCT 76
Query: 92 HIHTSPSE-----PAAKSALLDFLNMGISLGELWEGFSASDCRFAELAKYLAGARVLRQD 146
S+ P A+ + + ++L +L+ + + D F E+A+ G R+LRQD
Sbjct: 77 VYRGDKSQASRPTPDELEAVRKYFQLDVTLAQLYHHWGSVDSHFQEVAQKFQGVRLLRQD 136
Query: 147 PVECLLQFLCSSNNNIARITKMVDFLA-SLGSHLGNVEGFEFHEFPSLERLSLVS-EVEL 204
P+ECL F+CSSNNNIARIT MV+ L + G L ++ +H FPSL+ L+ E L
Sbjct: 137 PIECLFSFICSSNNNIARITGMVERLCQAFGPRLIQLDDVTYHGFPSLQALAGPEVEAHL 196
Query: 205 RNAGFGYRSAPQSSLLFSVRRSFKQAKYITGTVDVLQSKHSGGAEWLLSLRKLDLQEAID 264
R G GYR A+Y++ + + + GG WL LR+ +EA
Sbjct: 197 RKLGLGYR-----------------ARYVSASARAILEEQ-GGLAWLQQLRESSYEEAHK 238
Query: 265 ALCTLPGVGPK 275
ALC LPGVG K
Sbjct: 239 ALCILPGVGTK 249
>gi|440798926|gb|ELR19987.1| base excision DNA repair protein, HhHGPD subfamily protein
[Acanthamoeba castellanii str. Neff]
Length = 470
Score = 148 bits (373), Expect = 4e-33, Method: Compositional matrix adjust.
Identities = 110/337 (32%), Positives = 153/337 (45%), Gaps = 61/337 (18%)
Query: 32 TNKPSKWTPLNLTQ-SELSLPLTFPTGQTFRW---------------KKTGPLQYTGPIG 75
+ + + W L L EL L L+ P+GQ+FRW K+T +
Sbjct: 22 SQESTSWHRLQLDSPDELRLHLSLPSGQSFRWSTCLVDGREVWAGVIKQTAYFLHQPLDD 81
Query: 76 PHLISLKHLQNGDVCYHIHTSPSEPAAKSALLDFLNMGISLGELWEGFSASD-------C 128
P L+ ++L + D + A + L D+ +G+ + + W ++A
Sbjct: 82 PALVLFRYLNSSDA--DRESGVEVEAKRQELEDYFQLGMRMEDAWNRWTAQREKDPPIVA 139
Query: 129 RFAELAKYLAGARVLRQDPVECLLQFLCSSNNNIARITKMVDFLAS-LGSHLG------- 180
F + G R LR DPVECL F+CS NNNIARIT+MV+ L + G +G
Sbjct: 140 HFRDACLRFRGLRTLRTDPVECLFSFICSQNNNIARITQMVNALCTRYGDKIGEAVVQRT 199
Query: 181 ---NVEGFEFHEFPSLERLSLVSEVELRNAGFGYRSAPQSSLLFSVRRSFKQAKYI-TGT 236
E F+ FP+LE L+ V E LR+ FGYR AKYI T
Sbjct: 200 SDTKAETRAFYAFPTLETLAGVPERALRDMSFGYR-----------------AKYIPTAA 242
Query: 237 VDVLQSKH---SGGAEWLLSLRK--LDLQEAIDALCTLPGVGPKVAACIALFSLDQHHAI 291
VL G +WL LR+ + + A L +L GVG KVA C+ L SLD+ AI
Sbjct: 243 QQVLDKDDPELGAGRDWLYGLRERAVSRERAQKELTSLLGVGKKVADCVCLVSLDKLDAI 302
Query: 292 PVDTHVWKIATRYL--LPELAGVRLTPKLCSRVAEAF 326
PVDTHV +IA R++ L A LT L ++ F
Sbjct: 303 PVDTHVRQIAQRFMPSLKRKANSSLTDALYGEISSFF 339
>gi|327297190|ref|XP_003233289.1| N-glycosylase/DNA lyase [Trichophyton rubrum CBS 118892]
gi|326464595|gb|EGD90048.1| N-glycosylase/DNA lyase [Trichophyton rubrum CBS 118892]
Length = 426
Score = 148 bits (373), Expect = 4e-33, Method: Compositional matrix adjust.
Identities = 104/320 (32%), Positives = 158/320 (49%), Gaps = 68/320 (21%)
Query: 30 LTTNKPSKWTPLNLTQSELSLPLTFPTGQTFRWKKTGPLQYTGPIGPHLISLKHLQNGDV 89
++T + S+W L LT EL + T GQ+FRW+K ++T + ++SLK Q+ D
Sbjct: 1 MSTGRFSEWRKLPLTLKELCIDTTLRCGQSFRWRKLDD-EWTCSLYGRVLSLK--QDSDS 57
Query: 90 CYH-------IHTS----------------PSEPAAKSALLDFLNMGISLGELWEGFSAS 126
++ + +S P + ++ + + N+ +L +L+E ++AS
Sbjct: 58 LWYRSFKPSSVESSTLPTPPVSNATTQRGTPDDDDTEALIHHYFNLEYNLSDLYEQWAAS 117
Query: 127 DCRFAELAKYLAGARVLRQDPVECLLQFLCSSNNNIARITKMVDFLA-SLGSHLGNVEGF 185
D F + A AG R++RQD E L+ F+CSSNNNIARI++MV+ L + G +G +
Sbjct: 118 DPIFKKKAVQFAGIRIMRQDAWETLVSFICSSNNNIARISQMVEKLCINYGPFIGQLGDQ 177
Query: 186 EFHEFPSLERLSLVS-EVELRNAGFGYRSAPQSSLLFSVRRSFKQAKYITGTVDVLQSKH 244
++++FP L+ E LR GFGYR AKYI T ++ ++
Sbjct: 178 KYYDFPEPSALTGTGVESHLRELGFGYR-----------------AKYIYQTAKIVANQR 220
Query: 245 SGGAEWLLSLR---------------------KLDLQEAIDALCTLPGVGPKVAACIALF 283
G WL SLR K +EA + L TL GVGPKVA C+ L
Sbjct: 221 EPG--WLNSLRNPEKPAFNEKPATPGSGAKGDKSGYREAHEQLLTLQGVGPKVADCVCLM 278
Query: 284 SLDQHHAIPVDTHVWKIATR 303
L ++PVDTHVW+IA R
Sbjct: 279 GLGWGESVPVDTHVWQIAQR 298
>gi|406602046|emb|CCH46366.1| N-glycosylase/DNA lyase [Wickerhamomyces ciferrii]
Length = 345
Score = 148 bits (373), Expect = 4e-33, Method: Compositional matrix adjust.
Identities = 103/284 (36%), Positives = 146/284 (51%), Gaps = 28/284 (9%)
Query: 46 SELSLPLTFPTGQTFRWKKTGPLQYTGPIGPHLISLKHLQNGDVCYHIHTSPSEPAAKSA 105
++L+ +T + + K ++ IG +++ LK Q GD H + P
Sbjct: 2 TQLNWSITKVSAKELSLAKNINNVWSCSIGKNVLFLK--QTGDDL-HYASYPESTKTFEL 58
Query: 106 LLDFLNMGISLGELWEGFSASDCRFAELAKYLAGARVLRQDPVECLLQFLCSSNNNIARI 165
+ D+ N+ I + +L++ +S SD F + A G R+LRQDP E L+ F+CSSNNNI RI
Sbjct: 59 IKDYFNLDIKVLDLYDKWSISDKNFNKNAIGFEGVRMLRQDPWENLISFICSSNNNIKRI 118
Query: 166 TKMVDFLA-SLGSHLGNVEGFEFHEFPSLERLSLV-SEVELRNAGFGYRSAPQSSLLFSV 223
++M D L G + N EG +++ FPS LS +E +LRN FGYR
Sbjct: 119 SQMCDNLCLHFGDFIINHEGIDYYSFPSPSILSQEGTEEKLRNLSFGYR----------- 167
Query: 224 RRSFKQAKYITGT-VDVLQSKHSGGAEWLLSLRKLDLQEAIDALCTLPGVGPKVAACIAL 282
AKYI T + V +SK L LR L +EA L GVGPKVA C+ L
Sbjct: 168 ------AKYIYKTALMVHESKQD-----LFKLRSLPYEEAHSELIKFTGVGPKVADCVCL 216
Query: 283 FSLDQHHAIPVDTHVWKIATRYLLPELAGVRLTPKLCSRVAEAF 326
SLD+H AIPVDTHV++IA R + G +T + V + F
Sbjct: 217 MSLDKHDAIPVDTHVYQIAKRDYKLKSKGDTVTKQTYEIVRKFF 260
>gi|428673210|gb|EKX74123.1| 8-oxoguanine DNA-glycosylase, putative [Babesia equi]
Length = 293
Score = 147 bits (372), Expect = 4e-33, Method: Compositional matrix adjust.
Identities = 95/295 (32%), Positives = 144/295 (48%), Gaps = 33/295 (11%)
Query: 37 KWTPLNLTQSELSLPLTFPTGQTFRWKKTGPLQYTGPIGPHLISLKHLQNGDVCYHIHTS 96
+W LN++ L L TGQ+F W G + G +G + +K ++ + IH
Sbjct: 5 EWRDLNVSSRLLRPSLLLTTGQSFSWHSVGDNHWVGVLGNSVYEIKENKDTTLYRCIHGE 64
Query: 97 PSEPAAKSALLDFLNMGISLGELWEGFSASDCRFAELAKYLAGARVLRQDPVECLLQFLC 156
+E + L D+ ++ S ++ + G R+L+Q+P+ECL+ F+C
Sbjct: 65 ANEDS----LCDYFDLKHEYAVDMNKISKD---VQKIFQERQGVRILQQEPLECLISFIC 117
Query: 157 SSNNNIARITKMV-DFLASLGSHLGNV----EGFEFHEFPSLERLSLVSEVELRNAGFGY 211
SSNNNI+RIT+MV D G+ L + + F+ FPS+ +L V LR GFGY
Sbjct: 118 SSNNNISRITRMVGDLKREYGTFLASKHYKDKKMSFYSFPSIAQLKSVDTETLRKMGFGY 177
Query: 212 RSAPQSSLLFSVRRSFKQAKYITGTVDVLQSKHSGGAEWLLSLRKLDLQEAIDALCTLPG 271
R A +I TVD+L+S+ G +WL LR D + + L +LPG
Sbjct: 178 R-----------------AGFIVKTVDILKSR---GLDWLYDLRLRDCVTSRNELMSLPG 217
Query: 272 VGPKVAACIALFSLDQHHAIPVDTHVWKIATRYLLPELAGVRLTPKLCSRVAEAF 326
VG KVA C+ LF L + +PVD H+ I+ R L +G LT C + + F
Sbjct: 218 VGRKVADCVLLFGLGKREVVPVDVHIRNISQR-LFGIKSGRTLTDAQCEVITDTF 271
>gi|307170596|gb|EFN62783.1| N-glycosylase/DNA lyase [Camponotus floridanus]
Length = 332
Score = 147 bits (372), Expect = 5e-33, Method: Compositional matrix adjust.
Identities = 104/291 (35%), Positives = 151/291 (51%), Gaps = 28/291 (9%)
Query: 45 QSELSLPLTFPTGQTFRWKKTGPLQYTGPIGPHLISLKHLQNGDVCYHIHTS-PSEPAAK 103
+SEL+L LT GQ+FRW Y G + +L + Y + S
Sbjct: 15 RSELNLALTLHGGQSFRWTDCDD-GYKGIFDGCIWTLSQ-NETHLLYTVQGQLKSSVHYD 72
Query: 104 SALLDFLNMGISLGELWEGFSASDCRFAELAKYLAGARVLRQDPVECLLQFLCSSNNNIA 163
L ++L + ISL E ++ ++ D F + R+L+QD VE L F+CSSNNNI+
Sbjct: 73 DILSEYLRLSISLKEHYKKWAEVDTHFQKNLDESNAVRILKQDVVETLFSFICSSNNNIS 132
Query: 164 RITKMVDFLASL-GSHLGNVEGFEFHEFPSLERLSLVS-EVELRNAGFGYRSAPQSSLLF 221
RI+ MV+ L L G + ++E E+++FP++E L S E L+ FGYR+A
Sbjct: 133 RISNMVEKLCLLFGQKICSIEDREYYDFPTIEALKEKSVESILKREKFGYRAA------- 185
Query: 222 SVRRSFKQAKYITGTVDVLQSKHSGGAEWLLSLRK---LDLQEAIDALCTLPGVGPKVAA 278
YI T + L + GG +WLL+L+K + Q A + L TLPG+G KVA
Sbjct: 186 ----------YIAKTAERLSA--LGGKKWLLNLQKENNVSYQTAREQLMTLPGIGLKVAD 233
Query: 279 CIALFSLDQHHAIPVDTHVWKIATRYLLPELAGVR-LTPKLCSRVAEAFCE 328
CI L SL AIPVDTH+++IA LP L + +TPK+ + V+ E
Sbjct: 234 CICLMSLGHLDAIPVDTHIFQIAQAKYLPHLRKQKTVTPKIHAEVSNYLGE 284
>gi|350399207|ref|XP_003485455.1| PREDICTED: N-glycosylase/DNA lyase-like [Bombus impatiens]
Length = 331
Score = 147 bits (371), Expect = 7e-33, Method: Compositional matrix adjust.
Identities = 101/290 (34%), Positives = 149/290 (51%), Gaps = 28/290 (9%)
Query: 46 SELSLPLTFPTGQTFRWKKTGPLQYTGPIGPHLISLKHLQNGDVCYHIHTSPSEPAAKSA 105
SEL L +T GQ+FRW Y G G ++ +L N + Y++ +
Sbjct: 18 SELDLGITLKGGQSFRWFNYNN-GYRGVFGGYVWTLTQHDNY-LFYNVQGPLVDSKNYDT 75
Query: 106 LL-DFLNMGISLGELWEGFSASDCRFAELAKYLAGARVLRQDPVECLLQFLCSSNNNIAR 164
+L D+ + L +L E ++A+D F E G R+L QD VE + F+CSSNNNI R
Sbjct: 76 ILSDYFRLNECLKDLCEKWTAADPHFKESLNKTNGVRILNQDVVENVFSFICSSNNNIQR 135
Query: 165 ITKMVDFLAS-LGSHLGNVEGFEFHEFPSLERL-SLVSEVELRNAGFGYRSAPQSSLLFS 222
I+ MV+ L G + ++EG ++++FPS+E L S E +LR FGYR
Sbjct: 136 ISGMVEKLCMFFGEKICSIEGKDYYKFPSIEALASKDVESKLREEKFGYR---------- 185
Query: 223 VRRSFKQAKYITGTVDVLQSKHSGGAEWLLSLRKLD---LQEAIDALCTLPGVGPKVAAC 279
A+YI T + GG +WL +L K + +A + L LPG+GPKVA C
Sbjct: 186 -------AEYIVNT--AMHLIDLGGKQWLSNLHKKNNTSYAKAREQLIILPGIGPKVADC 236
Query: 280 IALFSLDQHHAIPVDTHVWKIATRYLLPELAGVR-LTPKLCSRVAEAFCE 328
+ L SL AIPVDTH+++IA LP L + +TPK+ + ++ E
Sbjct: 237 VCLMSLGHLEAIPVDTHIFQIARARYLPHLKQYKTVTPKIHAEISNYLRE 286
>gi|328875371|gb|EGG23736.1| 8-oxoguanine DNA-glycosylase [Dictyostelium fasciculatum]
Length = 477
Score = 146 bits (369), Expect = 1e-32, Method: Compositional matrix adjust.
Identities = 100/333 (30%), Positives = 152/333 (45%), Gaps = 75/333 (22%)
Query: 40 PLNLTQSELSLPLTFPTGQTFRWKKTGPL-------------QYTGPIGPHLISLKHLQ- 85
+ L + L+L T +GQ+F W++T P ++ G IG H + L+
Sbjct: 91 SIELINNTLNLRKTLLSGQSFVWRRTKPSASAADQDINTDNERWLGVIGKHAVQLQRRDT 150
Query: 86 ---------NGDVCYHIHTSPSEPAAKSALLDFLNMGISLGELWEGFSASDCR------- 129
N + ++ +TS +S + D+ N+ L L+E +S + +
Sbjct: 151 YLDYQFIDSNNRINFN-NTSIGPDERRSIINDYFNLKYHLPLLFETWSNGETKETNESKE 209
Query: 130 -----------------------FAELAKYLAGARVLRQDPVECLLQFLCSSNNNIARIT 166
F ++ G R+LRQ P++CL F+CS NNNI RIT
Sbjct: 210 DKENEQDEEEEDKDASLHSLNKEFIRVSPSFIGLRLLRQYPLDCLFSFICSQNNNITRIT 269
Query: 167 KMVDFLA-SLGSHLGNVEGFEFHEFPSLERLSLVSEVELRNAGFGYRSAPQSSLLFSVRR 225
KMV+ L + G H+ +G FP+LE+L + E L + GFGYRS
Sbjct: 270 KMVNSLCETYGDHITTFQGHRLCSFPTLEQLLTIKESSLNDLGFGYRS------------ 317
Query: 226 SFKQAKYITGTVDVLQSKHSGGAEWLLSLRKLDLQEAIDALCTLPGVGPKVAACIALFSL 285
K+I Q K GG +WL SLR + + + L +L GVG KVA C+ LFSL
Sbjct: 318 -----KFIVKAAQ--QVKEKGGLQWLQSLRTSNHEHSHKELISLMGVGQKVADCVCLFSL 370
Query: 286 DQHHAIPVDTHVWKIATRYLLPELAGVRLTPKL 318
D+ +P+DTH+W I+ ++ P L LT K+
Sbjct: 371 DKFDIVPIDTHIWTISKKH-FPSLKNKSLTQKV 402
>gi|378731410|gb|EHY57869.1| N-glycosylase/DNA lyase [Exophiala dermatitidis NIH/UT8656]
Length = 422
Score = 146 bits (369), Expect = 1e-32, Method: Compositional matrix adjust.
Identities = 100/321 (31%), Positives = 148/321 (46%), Gaps = 75/321 (23%)
Query: 34 KPSKWTPLNLTQSELSLPLTFPTGQTFRWKKTGPLQYTGPIGPHLISLKHLQNGDVCYHI 93
K + W + ++ +EL + T GQ+FRW++ + + ++ L + H+
Sbjct: 3 KITDWRKIPVSLNELCIATTLRCGQSFRWRQCDDQTWACTLRGRIVQLSQDTS-----HL 57
Query: 94 HTSPSEPAA----------------------------KSALLDFLNMGISLGELWEGFSA 125
H PAA KS ++ +LN+ +L L+ +SA
Sbjct: 58 HYRSIWPAAIETPPTPPSSIPPSAAHDDPETAELDDTKSLIMHYLNLEPNLETLYAQWSA 117
Query: 126 SDCRFAELAKYLAGARVLRQDPVECLLQFLCSSNNNIARITKMVDFLAS-LGSHLGNVEG 184
SD F + A G R+LRQD E L+ F+CSSNNNI RI++MVD L + G ++G ++G
Sbjct: 118 SDANFKKKAPKFTGVRILRQDAWEALIGFICSSNNNIIRISQMVDKLCTHYGKYIGTLDG 177
Query: 185 FEFHEFPSLERLSLVS-EVELRNAGFGYRSAPQSSLLFSVRRSFKQAKYITGTVDVLQSK 243
+H+FP L+ + E LR GFGYR AKYI+ T ++ ++
Sbjct: 178 RAYHDFPDPSALAGKNVETHLRELGFGYR-----------------AKYISRTAQIVANE 220
Query: 244 HSGGAEWLLSLR---------------------KLDLQEAIDALCTLPGVGPKVAACIAL 282
G WL LR + +EA + L L GVGPKVA C+ L
Sbjct: 221 REQG--WLDGLRNPESPAYGRKASPGGEVKPEGRDGYREAHEKLLELQGVGPKVADCVCL 278
Query: 283 FSLDQHHAIPVDTHVWKIATR 303
L A+PVDTHVW+IA R
Sbjct: 279 MGLGWGEAVPVDTHVWQIAQR 299
>gi|195059553|ref|XP_001995660.1| GH17644 [Drosophila grimshawi]
gi|193896446|gb|EDV95312.1| GH17644 [Drosophila grimshawi]
Length = 328
Score = 146 bits (369), Expect = 1e-32, Method: Compositional matrix adjust.
Identities = 103/306 (33%), Positives = 159/306 (51%), Gaps = 32/306 (10%)
Query: 32 TNKPSKWTPLNLTQSELSLPLTFPTGQTFRWKK--TGPLQYTGPIGPHLISLKHLQNGDV 89
TN + + L S L T GQ+FRW+K P G + + + ++ +
Sbjct: 2 TNTVKQSGIIRLAASVCDLERTLLGGQSFRWRKLCDAPTVKYGGVALNTYWVLTPESEYI 61
Query: 90 CYHIHTSPSEPAAK-SALL-DFLNMGISLGELWEGFSASDCRFAELAKYLAGARVLRQDP 147
Y H + ++ A SALL D+L L E + +D +F + R+L Q+P
Sbjct: 62 GYEAHATAAKQAPNFSALLSDYLRAEFDLLEQQTAWMHADEQFGKFVG--KPVRMLSQEP 119
Query: 148 VECLLQFLCSSNNNIARITKMVDFL-ASLGSHLGNVEGFEFHEFPSLERLSLVS-----E 201
+E ++ F+CS NNNI RI+ M+ +L A+ G +G+ G + + FP+LE L+ +
Sbjct: 120 LENIICFMCSQNNNIKRISSMIQWLCAAYGHKIGHFHGQDEYTFPTLEALTGERTCSQLD 179
Query: 202 VELRNAGFGYRSAPQSSLLFSVRRSFKQAKYITGTVDVLQSKHSGGAEWLLSLRKLDLQE 261
ELR A FGYR AK+I +++ +Q + GG W LR+L +
Sbjct: 180 SELRAAKFGYR-----------------AKFIARSLEQIQER--GGNVWFEQLRQLPYVD 220
Query: 262 AIDALCTLPGVGPKVAACIALFSLDQHHAIPVDTHVWKIATRYLLPELAGVR-LTPKLCS 320
A +AL LPG+G KVA CI L SL A+P+DTH++K+A R+ LP LAG + +T K+ +
Sbjct: 221 AREALVQLPGIGYKVADCICLMSLGHLEAVPIDTHIYKLAQRHYLPHLAGQKSVTTKIYA 280
Query: 321 RVAEAF 326
VA+ F
Sbjct: 281 EVAQHF 286
>gi|303390272|ref|XP_003073367.1| 8-oxoguanine DNA glycosylase [Encephalitozoon intestinalis ATCC
50506]
gi|303302513|gb|ADM12007.1| 8-oxoguanine DNA glycosylase [Encephalitozoon intestinalis ATCC
50506]
Length = 302
Score = 146 bits (368), Expect = 1e-32, Method: Compositional matrix adjust.
Identities = 100/294 (34%), Positives = 145/294 (49%), Gaps = 42/294 (14%)
Query: 38 WTPLNLTQSELSLPLTFPTGQTFRWKKTGPLQYTGPIGPHLISLKHLQNGD-VCYHIHTS 96
W L T+ + L LT +GQ F +K+TG +YTG +G L+S LQ+G V Y + +
Sbjct: 6 WISLR-TEEAIDLELTLFSGQVFSFKRTGEKEYTGVLGECLVSF--LQDGSRVLYKVLSG 62
Query: 97 PSEPA-AKSALLDFLNMGISLGELWEGFSASDCRFAELAKYLAGARVLRQDPVECLLQFL 155
P + + F + I L L E + L G R LR D + + F+
Sbjct: 63 KRSPKEVEEDIFYFFTLDIKLKPLLEKWKVDS------DGLLNGLRALRYDLIPTIFSFI 116
Query: 156 CSSNNNIARITKMVDFLASLGSHLGNVEGFEFHEFPSLERLSLVSEVELRNAGFGYRSAP 215
CSSNNNI+RIT+MV+FL S G + +G +FH FPSLE L + E EL+ GFGYRS+
Sbjct: 117 CSSNNNISRITRMVEFLYSKGEFIMKYKGIDFHRFPSLEELVSI-EGELKAKGFGYRSS- 174
Query: 216 QSSLLFSVRRSFKQAKYITGTVDVLQSKHSGGAEWLLSLRKLDLQEAI-DALCTLPGVGP 274
YI LQ +S KL + I +AL ++ GVG
Sbjct: 175 ----------------YICNAATYLQENYS----------KLQQRSRIREALVSIKGVGE 208
Query: 275 KVAACIALFSLDQHHAIPVDTHVWKIATRYLLPELAGVRLTPKLCSRVAEAFCE 328
K+A CI L + + A+P+DTH++K + + + + G LT + V E + E
Sbjct: 209 KIADCILLIGVGELSAVPIDTHIFKHSKK--MFRIHGKTLTRRAYGAVQELYKE 260
>gi|68481742|ref|XP_715239.1| hypothetical protein CaO19.7190 [Candida albicans SC5314]
gi|77023104|ref|XP_888996.1| hypothetical protein CaO19_7190 [Candida albicans SC5314]
gi|46436852|gb|EAK96208.1| hypothetical protein CaO19.7190 [Candida albicans SC5314]
gi|76573809|dbj|BAE44893.1| hypothetical protein [Candida albicans]
gi|238883521|gb|EEQ47159.1| conserved hypothetical protein [Candida albicans WO-1]
Length = 353
Score = 146 bits (368), Expect = 1e-32, Method: Compositional matrix adjust.
Identities = 101/283 (35%), Positives = 142/283 (50%), Gaps = 40/283 (14%)
Query: 38 WTPLNLTQSELSLPLTFPTGQTFRWKKTGPLQYTGPIGPHLISLKHLQNGDVCYHIHTS- 96
W L +T+ E+SL GQTFRWK + ++ I ++ LK HIH S
Sbjct: 7 WKSLPITEVEVSLQKVLRCGQTFRWKNINNV-WSFTIQDRIVLLKQDST-----HIHYSQ 60
Query: 97 ---PSEPAAKSALLDFLN----MGISLGELWEGFSASD--CRFAELAKY--LAGARVLRQ 145
LDF+N + L +L+ + R +++ + AG R+LRQ
Sbjct: 61 VMKKDTNGNHKQTLDFINDYFVLDTKLTDLYTHWKLQHEPFRSKKISPFDSFAGIRILRQ 120
Query: 146 DPVECLLQFLCSSNNNIARITKMVDFLA-SLGSHLGNVEGFEFHEFPSLERLSLVS-EVE 203
DP ECL+ F+CSSNNN+ RI+KM + L + G ++ + EG E++ FP+ E LS + E +
Sbjct: 121 DPWECLISFICSSNNNVKRISKMCENLCINFGDYINDFEGHEYYTFPTPEALSKPNVEAK 180
Query: 204 LRNAGFGYRSAPQSSLLFSVRRSFKQAKYITGTVDVLQSKHSG---GAEWLLSLRKLDLQ 260
LR+ GFGYR AKYI T + E L LR+
Sbjct: 181 LRDLGFGYR-----------------AKYIYQTACKFTDNNKYPDITIENLNLLREAKYT 223
Query: 261 EAIDALCTLPGVGPKVAACIALFSLDQHHAIPVDTHVWKIATR 303
A + L L GVGPKVA CI L SLD+H +P+DTHV++IA R
Sbjct: 224 TAHEFLLQLTGVGPKVADCICLMSLDKHDVVPIDTHVYQIAVR 266
>gi|170043233|ref|XP_001849300.1| N-glycosylase/DNA lyase [Culex quinquefasciatus]
gi|167866625|gb|EDS30008.1| N-glycosylase/DNA lyase [Culex quinquefasciatus]
Length = 365
Score = 145 bits (367), Expect = 2e-32, Method: Compositional matrix adjust.
Identities = 106/327 (32%), Positives = 156/327 (47%), Gaps = 56/327 (17%)
Query: 38 WTPLNLTQSELSLPLTFPTGQTFRWKKTGPL--QYTGPIGPHLISLKHLQN-------GD 88
W + S+L L T GQ+FRWK G ++ G + LK ++ G+
Sbjct: 6 WQKIPCLSSQLQLKTTLTGGQSFRWKPHGTCSDEFVGVFANIVWILKQSKSELLYRIVGE 65
Query: 89 VCYHIHTSPSE-----------PAAKSALLDFLN-----------MGISLGELWEGFSAS 126
+ Y + + + P LL LN + + L E + ++
Sbjct: 66 LPYPVKETSAAIRFKVSEPERAPGVGGDLLYSLNYYEQLLRIYFRLDVDLEECYRQWTKC 125
Query: 127 DCRFAELAKYLAGARVLRQDPVECLLQFLCSSNNNIARITKMVDFLAS-LGSHLGNVEGF 185
F A R L QDPVE L+ F+CS NNNI+RI+ +V+ + S G + + +G
Sbjct: 126 HVHFESSADQFYAVRQLDQDPVENLVSFICSQNNNISRISGLVEKICSHYGEKICDYDGT 185
Query: 186 EFHEFPSLERLSL-VSEVELRNAGFGYRSAPQSSLLFSVRRSFKQAKYITGTVDVLQSKH 244
++ FP + RL+ E LR+ FGYR AKYI + + + SK
Sbjct: 186 TYYNFPDVARLAPPAVEAHLRDLSFGYR-----------------AKYIQKSAEEIVSK- 227
Query: 245 SGGAEWLLSLRKLDLQEAIDALCTLPGVGPKVAACIALFSLDQHHAIPVDTHVWKIATRY 304
GG EW +L++LD +EA L TL G+GPKVA CI L SL+ AIPVDTHV++IA Y
Sbjct: 228 -GGLEWFRTLQRLDYKEAHRELLTLTGIGPKVADCICLMSLNHLQAIPVDTHVFQIAKHY 286
Query: 305 LLPELAGVR---LTPKLCSRVAEAFCE 328
LP+L G + +T +L + + F E
Sbjct: 287 -LPQLGGKQTKSITGRLYGEIGDKFRE 312
>gi|440633241|gb|ELR03160.1| hypothetical protein GMDG_05986 [Geomyces destructans 20631-21]
Length = 434
Score = 145 bits (367), Expect = 2e-32, Method: Compositional matrix adjust.
Identities = 103/316 (32%), Positives = 151/316 (47%), Gaps = 61/316 (19%)
Query: 30 LTTNKPSKWTPLNLTQSELSLPLTFPTGQTFRWKKTGPLQYTGPIGPHLISLKH------ 83
+ + + +W L ++ ++LSL T GQ+FRWKK ++T + +ISLK
Sbjct: 1 MNSIRTGEWQKLAVSLADLSLDTTLRCGQSFRWKKLNEDEWTCSLRGRVISLKQDPKQDP 60
Query: 84 -----------------LQNGDVCYHIHTSPSEPAAKSALLDFLNMGISLGELWEGFSAS 126
L +G + + ++ L + N+ +L EL+E +S +
Sbjct: 61 NHVLYKVTWPSNSLLAKLSSGGKGVKVEDDSIKDDTEALLRHYFNLEPNLTELYERWSKA 120
Query: 127 DCRFAELAKYLAGARVLRQDPVECLLQFLCSSNNNIARITKMVDFLA-SLGSHLGNVEGF 185
D F + A G R+L+QD E L+ F+CSSNNNIARI++MV+ L S G +G+V
Sbjct: 121 DLNFKKRAPKFTGVRILKQDAWEALVGFICSSNNNIARISQMVNSLCLSYGPLIGHVHDQ 180
Query: 186 EFHEFPSLERLSL-VSEVELRNAGFGYRSAPQSSLLFSVRRSFKQAKYITGTVDVLQSKH 244
FH+FP+ E L+ E +LR GFGYR AKY+ T V+ SK
Sbjct: 181 AFHDFPTPESLAGPEVESQLRKLGFGYR-----------------AKYLARTA-VIVSKE 222
Query: 245 SGGAEWLLSLRKLD-----------------LQEAIDALCTLPGVGPKVAACIALFSLDQ 287
WL SL + ++A + L L GVGPKVA C+ L L
Sbjct: 223 K-PENWLKSLSNTEPFDETEWKTLEPGGRPGYRKAHEELLALQGVGPKVADCVCLMGLGW 281
Query: 288 HHAIPVDTHVWKIATR 303
++PVDTHVW+IA R
Sbjct: 282 GESVPVDTHVWQIAQR 297
>gi|212539734|ref|XP_002150022.1| DNA N-glycosylase, putative [Talaromyces marneffei ATCC 18224]
gi|210067321|gb|EEA21413.1| DNA N-glycosylase, putative [Talaromyces marneffei ATCC 18224]
Length = 403
Score = 145 bits (366), Expect = 2e-32, Method: Compositional matrix adjust.
Identities = 100/307 (32%), Positives = 150/307 (48%), Gaps = 61/307 (19%)
Query: 36 SKWTPLNLTQSELSLPLTFPTGQTFRWKKT-GPLQYTGPIGPHLISLKHLQNGDVCYHIH 94
++W L ++ SEL + T GQ+FRW++ ++ + L+SL+ Q+ Y+
Sbjct: 6 TEWRKLPISLSELCINTTLRCGQSFRWQQRPDSDEWRCVLRGRLVSLR--QDQSYLYYRS 63
Query: 95 TSPSEPAA---------------KSALLDFLNMGISLGELWEGFSASDCRFAELAKYLAG 139
+P +P + + + +LN+ +L +L+ +S SD F + A G
Sbjct: 64 CTPHKPTSSVPTMNGTVNTSDETREIVTHYLNLTSNLTDLYNQWSDSDPNFKKKAPNFTG 123
Query: 140 ARVLRQDPVECLLQFLCSSNNNIARITKMVDFL-ASLGSHLGNVEGFEFHEFPSLERLSL 198
R+LRQD E L+ F+CSSNNNIARI++MV+ L A+ G + ++ +H+FP+ E L+
Sbjct: 124 IRILRQDAWEALVSFICSSNNNIARISQMVEKLCANYGDLVATIDDRSYHDFPAPEALTG 183
Query: 199 VS-EVELRNAGFGYRSAPQSSLLFSVRRSFKQAKYITGTVDVLQSKHSGGAEWLLSLR-- 255
E LR GFGYR AKYI T ++ K G WL SLR
Sbjct: 184 KDVEARLRELGFGYR-----------------AKYIYQTAVMVSEKEKG---WLDSLRNP 223
Query: 256 -------------------KLDLQEAIDALCTLPGVGPKVAACIALFSLDQHHAIPVDTH 296
+ ++A + L L GVGPKVA C+ L L A+PVDTH
Sbjct: 224 ESPAFGVAPSQGGEMRPEGREGYRDAHEKLLELQGVGPKVADCVCLMGLGWGEAVPVDTH 283
Query: 297 VWKIATR 303
VW+IA R
Sbjct: 284 VWQIAQR 290
>gi|195425861|ref|XP_002061181.1| GK10343 [Drosophila willistoni]
gi|194157266|gb|EDW72167.1| GK10343 [Drosophila willistoni]
Length = 325
Score = 145 bits (366), Expect = 3e-32, Method: Compositional matrix adjust.
Identities = 98/297 (32%), Positives = 154/297 (51%), Gaps = 32/297 (10%)
Query: 41 LNLTQSELSLPLTFPTGQTFRWKKTGPLQYTGPIGPHLISLKHLQ--NGDVCYHIHTS-- 96
+ L S L T GQ+FRW+K + T G L + L N + +
Sbjct: 8 IRLAASACDLERTLLGGQSFRWRKLSNERQTKYCGVALDTFWVLAPTNEHIEFEAFADSV 67
Query: 97 PSEPAAKSALL-DFLNMGISLGELWEGFSASDCRFAELAKYLAGARVLRQDPVECLLQFL 155
P + A+L ++L L ++ +D +FA + R+L Q+P+E ++ F+
Sbjct: 68 PEQTYVYEAMLSEYLRADFDLEREQCSWTKNDEKFATFSG--KPVRILSQEPLENIICFI 125
Query: 156 CSSNNNIARITKMVD-FLASLGSHLGNVEGFEFHEFPSLERLSLVS----EVELRNAGFG 210
CS NNNI RI+ M+ F +S G+ +G+ G + + FP+L++ + +++LR A FG
Sbjct: 126 CSQNNNIKRISAMIQWFCSSYGTKIGHFHGQDEYTFPALQKFKDIDCKQLDLDLRAAKFG 185
Query: 211 YRSAPQSSLLFSVRRSFKQAKYITGTVDVLQSKHSGGAEWLLSLRKLDLQEAIDALCTLP 270
YRS K+I T++ +Q + GG EW LR+L EA +AL LP
Sbjct: 186 YRS-----------------KFIAQTLEQIQKR--GGDEWFEKLRQLPYNEAREALVELP 226
Query: 271 GVGPKVAACIALFSLDQHHAIPVDTHVWKIATRYLLPELAGVR-LTPKLCSRVAEAF 326
G+G KVA CI L SL A+PVDTH++K+A ++ LP L G + +T K+ + VA+ F
Sbjct: 227 GIGYKVADCICLMSLGHLIAVPVDTHIYKLAQQHYLPHLTGQKSVTNKIYNEVAQHF 283
>gi|340714445|ref|XP_003395739.1| PREDICTED: LOW QUALITY PROTEIN: n-glycosylase/DNA lyase-like
[Bombus terrestris]
Length = 331
Score = 145 bits (366), Expect = 3e-32, Method: Compositional matrix adjust.
Identities = 99/290 (34%), Positives = 147/290 (50%), Gaps = 28/290 (9%)
Query: 46 SELSLPLTFPTGQTFRWKKTGPLQYTGPIGPHLISLKHLQNGDVCYHIHTSPSEPAAKSA 105
SEL L +T GQ+FRW Y G G ++ +L + + Y++ + A
Sbjct: 18 SELDLGITLKGGQSFRWSNYNN-GYRGVFGGYVWTLTQ-HDSHLFYNVQGPLVDSKNYDA 75
Query: 106 LL-DFLNMGISLGELWEGFSASDCRFAELAKYLAGARVLRQDPVECLLQFLCSSNNNIAR 164
+L D+ + L +L + ++A+D F E G R+L QD VE + F+CSSNNNI R
Sbjct: 76 ILSDYFRLNERLQDLCKKWTAADPHFKESLNKTNGVRILNQDVVENVFSFICSSNNNIQR 135
Query: 165 ITKMVDFLASL-GSHLGNVEGFEFHEFPSLERL-SLVSEVELRNAGFGYRSAPQSSLLFS 222
I+ MV+ L G + ++EG ++++FPS+E L S E +LR FGYR
Sbjct: 136 ISGMVEKLCMFFGEKICSIEGKDYYKFPSIEALASKDVESKLRKEKFGYR---------- 185
Query: 223 VRRSFKQAKYITGTVDVLQSKHSGGAEWLLSLRKLD---LQEAIDALCTLPGVGPKVAAC 279
A YI + GG +WL +L K + +A + L LPG+GPKVA C
Sbjct: 186 -------AGYIANAA--MHLIDLGGKQWLSNLHKKNNTSYAKAREQLIILPGIGPKVADC 236
Query: 280 IALFSLDQHHAIPVDTHVWKIATRYLLPELAGVR-LTPKLCSRVAEAFCE 328
+ L SL AIPVDTH+++IA LP L + +TPK+ ++ E
Sbjct: 237 VCLMSLGHLEAIPVDTHIFQIARARYLPHLKQYKTVTPKIHEEISNYLRE 286
>gi|326483751|gb|EGE07761.1| N-glycosylase/DNA lyase [Trichophyton equinum CBS 127.97]
Length = 426
Score = 145 bits (366), Expect = 3e-32, Method: Compositional matrix adjust.
Identities = 102/320 (31%), Positives = 156/320 (48%), Gaps = 68/320 (21%)
Query: 30 LTTNKPSKWTPLNLTQSELSLPLTFPTGQTFRWKKTGPLQYTGPIGPHLISLKHLQNGDV 89
++ + S+W L LT +EL + T GQ+FRW+K ++T + ++SLK Q+ D
Sbjct: 1 MSIGRFSEWRKLPLTLTELCIDTTLRCGQSFRWRKLDD-EWTCSLYGRVLSLK--QDSDS 57
Query: 90 CYHIHTSPS-----------------------EPAAKSALLDFLNMGISLGELWEGFSAS 126
++ PS + ++ + + N+ +L +L+E ++AS
Sbjct: 58 LWYRSFKPSSVESTTLPTPPISNATTQRGTPDDDDTEALIHHYFNLEYNLSDLYEQWAAS 117
Query: 127 DCRFAELAKYLAGARVLRQDPVECLLQFLCSSNNNIARITKMVDFLA-SLGSHLGNVEGF 185
D F + A AG R++RQD E L+ F+CSSNNNIARI++MV+ L + G +G +
Sbjct: 118 DPNFKKKAVQFAGIRIMRQDAWETLVSFICSSNNNIARISQMVEKLCINYGPFIGQLGDQ 177
Query: 186 EFHEFPSLERLSLVS-EVELRNAGFGYRSAPQSSLLFSVRRSFKQAKYITGTVDVLQSKH 244
++++FP L+ E LR GFGYR AKYI T +++ ++
Sbjct: 178 KYYDFPEPSALTGNGVESHLRELGFGYR-----------------AKYIYQTANIVANQR 220
Query: 245 SGGAEWLLSLR---------------------KLDLQEAIDALCTLPGVGPKVAACIALF 283
G WL SLR K +EA + L L GVGPKVA C+ L
Sbjct: 221 EPG--WLNSLRNPEKPAFNEKPATPGSGAKGDKSGYREAHEQLLALQGVGPKVADCVCLM 278
Query: 284 SLDQHHAIPVDTHVWKIATR 303
L ++PVDTHVW+IA R
Sbjct: 279 GLGWGESVPVDTHVWQIAQR 298
>gi|322799498|gb|EFZ20806.1| hypothetical protein SINV_05704 [Solenopsis invicta]
Length = 346
Score = 145 bits (365), Expect = 3e-32, Method: Compositional matrix adjust.
Identities = 97/291 (33%), Positives = 152/291 (52%), Gaps = 31/291 (10%)
Query: 46 SELSLPLTFPTGQTFRWK--KTGPLQYTGPIGPHLISLKHLQNGDVCYHIHTSPSEPAAK 103
+EL+L LT GQ+FRW TG Y G + +L + Y + +
Sbjct: 32 TELNLALTLHGGQSFRWTVCDTG---YRGIFNGCIWTLSQ-NKTHLLYTVQGQLKDSVNY 87
Query: 104 SALL-DFLNMGISLGELWEGFSASDCRFAELAKYLAGARVLRQDPVECLLQFLCSSNNNI 162
+L ++ + +SL + ++ ++ +D F + R+L+QD +E L F+CSSNNNI
Sbjct: 88 DNILSEYFRLSVSLQDYYKEWAKADAHFQKCLDENNAVRILKQDVIENLFSFICSSNNNI 147
Query: 163 ARITKMVDFLASL-GSHLGNVEGFEFHEFPSLERLSLVS-EVELRNAGFGYRSAPQSSLL 220
+RI+ MV+ + SL G + ++E E+++FP++E L E L+ FGYR+A
Sbjct: 148 SRISNMVEKMCSLFGRKICSIEDKEYYDFPTIEVLKKKHVESILKREKFGYRAA------ 201
Query: 221 FSVRRSFKQAKYITGTVDVLQSKHSGGAEWLLSLRK--LDLQEAIDALCTLPGVGPKVAA 278
YI T + L + GG +WLL+L++ + A + L TL G+GPKVA
Sbjct: 202 -----------YIVNTAERLSA--LGGKDWLLNLQRENVSYHNAREQLMTLSGIGPKVAD 248
Query: 279 CIALFSLDQHHAIPVDTHVWKIATRYLLPELAGVR-LTPKLCSRVAEAFCE 328
CI L SL AIPVDTH+++IA LP L + +TPK+ + V+ E
Sbjct: 249 CICLMSLGHLDAIPVDTHIFQIAQANYLPHLKKQKTVTPKIHTEVSNYLRE 299
>gi|406867696|gb|EKD20734.1| N-glycosylase/DNA lyase [Marssonina brunnea f. sp. 'multigermtubi'
MB_m1]
Length = 406
Score = 145 bits (365), Expect = 3e-32, Method: Compositional matrix adjust.
Identities = 103/296 (34%), Positives = 157/296 (53%), Gaps = 37/296 (12%)
Query: 36 SKWTPLNLTQSELSLPLTFPTGQTFRWKKTGPLQYTGPIGPHLISLK----HLQNG---- 87
S+W L ++ +EL + T GQ+FRWKK ++T + ++SLK HL
Sbjct: 6 SEWRKLPVSLAELCIETTLRCGQSFRWKKLRDDEWTCALHGRILSLKQDSTHLHYRTIWP 65
Query: 88 DVCYHIHTSPS-----EPAAKSA-----LLDFLNMGISLGELWEGFSASDCRFAELAKYL 137
+ + +SPS +P+ + A L +LN+ L L+E +S++D F + A
Sbjct: 66 SISRSLKSSPSTNVTSDPSGEDADTEDLLKHYLNLSPDLTSLYEQWSSADSNFKKRAPKF 125
Query: 138 AGARVLRQDPVECLLQFLCSSNNNIARITKMVDFLA-SLGSHLGNVEGFEFHEFPSLERL 196
G R+L+QDP E L+ F+CSSNNNI RI++M++ L G +G+V+ H+FP+ L
Sbjct: 126 TGVRILKQDPWEALVGFICSSNNNIVRISQMINNLCLHYGPLVGHVDDIPIHDFPTPSAL 185
Query: 197 SLVS-EVELRNAGFGYRSA--PQSSLLFSVRR------SFKQAKYITGTVDVLQSKHSGG 247
S E LR GFGYR+A +++L+ + + S + G++D + GG
Sbjct: 186 SGPGVESRLRGLGFGYRAAYIAKTALMVAKEKPEGWLESLCNRQPYDGSLDQ-KPLPQGG 244
Query: 248 AEWLLSLRKLDLQEAIDALCTLPGVGPKVAACIALFSLDQHHAIPVDTHVWKIATR 303
E ++A +AL L GVGPKVA C+ L L A+PVDTHVW+IA R
Sbjct: 245 RE--------GYRKAHEALLELQGVGPKVADCVCLMGLGWGEAVPVDTHVWQIAQR 292
>gi|308812728|ref|XP_003083671.1| A Chain A, Hogg1 Borohydride-Trapped Intermediate Without (ISS)
[Ostreococcus tauri]
gi|116055552|emb|CAL58220.1| A Chain A, Hogg1 Borohydride-Trapped Intermediate Without (ISS),
partial [Ostreococcus tauri]
Length = 317
Score = 144 bits (364), Expect = 4e-32, Method: Compositional matrix adjust.
Identities = 95/259 (36%), Positives = 135/259 (52%), Gaps = 55/259 (21%)
Query: 108 DFLNMGISLGELWEGFSASDCRFAELAKYLAGARVLRQDPVECLLQFLCSSNNNIARITK 167
++ N + L L++ F++ D RF +LA ++ GAR+LRQ+P ECL F+CSSNN+I+RI
Sbjct: 8 EYFNADVRLVTLYDAFASRDGRFRDLATHVDGARMLRQNPSECLFSFICSSNNHISRIHG 67
Query: 168 MVDFLAS-LGSHLGNVEGF----------------------------EFHEFPSLERL-S 197
MV+ + G L E F FPS R+ S
Sbjct: 68 MVEKMCERYGEALPVTEAVVALDDDMERASPAAEASGKEAAAAAKRDVFFSFPSASRIAS 127
Query: 198 LVSEVELRNAGFGYRSAPQSSLLFSVRRSFKQAKYITGTVDVL--QSKHSGG----AEWL 251
+E ELR GFGYR AK+I GT L ++K GG +L
Sbjct: 128 EATEEELRAMGFGYR-----------------AKFIVGTAKALMDRAKALGGDATPESYL 170
Query: 252 LSLR-KLDLQEAIDALCTLPGVGPKVAACIALFSLDQHHAIPVDTHVWKIATRYLLPELA 310
+LR + +A AL LPG+GPKV++C+ LFSLD+H AIPVDTHVW++A + PEL+
Sbjct: 171 RTLRDETSYVDAHVALQELPGIGPKVSSCVCLFSLDKHAAIPVDTHVWRLACEHYAPELS 230
Query: 311 GVR-LTPKLCSRVAEAFCE 328
+ +TPK+ + + F E
Sbjct: 231 EAKTVTPKIMRAIEQRFEE 249
>gi|295663749|ref|XP_002792427.1| N-glycosylase/DNA lyase [Paracoccidioides sp. 'lutzii' Pb01]
gi|226279097|gb|EEH34663.1| N-glycosylase/DNA lyase [Paracoccidioides sp. 'lutzii' Pb01]
Length = 412
Score = 144 bits (363), Expect = 6e-32, Method: Compositional matrix adjust.
Identities = 99/317 (31%), Positives = 151/317 (47%), Gaps = 73/317 (23%)
Query: 36 SKWTPLNLTQSELSLPLTFPTGQTFRWKKTGPLQYTGPIGPHLISLKHLQNGDVCYHIHT 95
S+W L ++ +EL + T GQ+FRW+K+G +++ + ++SL+ H++
Sbjct: 7 SEWRELPISLAELCINTTLRCGQSFRWQKSGENEWSCALYGRIVSLRQDST-----HLYY 61
Query: 96 SPSEPAAKSALL--------------------------DFLNMGISLGELWEGFSASDCR 129
P S+LL ++ N+ ++L L+E +S +D
Sbjct: 62 RSYFPPTASSLLTPPSSVPPLTTEELPKKLDDTDALISNYFNLELNLKSLYEQWSTADEN 121
Query: 130 FAELAKYLAGARVLRQDPVECLLQFLCSSNNNIARITKMVDFLA-SLGSHLGNVEGFEFH 188
F + A G R+LRQ+ E L+ F+CSSNNNIARI++MV+ L + G +G ++ +H
Sbjct: 122 FKKKAPQFTGIRILRQNAWEALISFICSSNNNIARISQMVEKLCLNYGQFIGFIDKKPYH 181
Query: 189 EFPSLERLSLVS-EVELRNAGFGYRSAPQSSLLFSVRRSFKQAKYITGTVDVLQSKHSGG 247
+FP+ + L+ E LR GFGYR AKYI T ++ + G
Sbjct: 182 DFPAPQALAGRDVEARLRELGFGYR-----------------AKYIYQTALIVANHREEG 224
Query: 248 AEWLLSLR---------------------KLDLQEAIDALCTLPGVGPKVAACIALFSLD 286
WL SLR + ++A D L L GVGPKVA C+ L L
Sbjct: 225 --WLNSLRNPECPSFGAKPLSAGEMMEGGREGYRKAHDCLLELQGVGPKVADCVCLMGLG 282
Query: 287 QHHAIPVDTHVWKIATR 303
A+PVDTHVW+IA R
Sbjct: 283 WGEAVPVDTHVWQIAQR 299
>gi|302414572|ref|XP_003005118.1| N-glycosylase/DNA lyase [Verticillium albo-atrum VaMs.102]
gi|261356187|gb|EEY18615.1| N-glycosylase/DNA lyase [Verticillium albo-atrum VaMs.102]
Length = 412
Score = 144 bits (363), Expect = 6e-32, Method: Compositional matrix adjust.
Identities = 105/304 (34%), Positives = 147/304 (48%), Gaps = 56/304 (18%)
Query: 36 SKWTPLNLTQSELSLPLTFPTGQTFRWKKTGPLQYTGPIGPHLISLK----HL------- 84
++W L L+ +EL + T GQ+FRW+K ++ + +ISLK HL
Sbjct: 7 TEWRKLPLSLTELCIDTTLRCGQSFRWRKIND-EWCCTLHGRIISLKQDPTHLHYRSIWP 65
Query: 85 ---QNGDVCYHIHTSPSEPAAKSALLDFLNMGISLGELWEGFSASDCRFAELAKYLAGAR 141
Q + T SE + L +L++ + L L+E +S +D F + A G R
Sbjct: 66 AAAQRPQTAPTVKTESSEDDTEELLKHYLSLKLDLTSLYEQWSEADANFRKRAPKFGGVR 125
Query: 142 VLRQDPVECLLQFLCSSNNNIARITKMVDFLA-SLGSHLGNVEGFEFHEFPSLERLSLVS 200
+L QD E L+ F+CSSNNNI+RI++MV L G +G++ FH+FP+ + L+
Sbjct: 126 MLSQDAWEALISFICSSNNNISRISQMVHKLCLHYGPLIGHINEEPFHDFPTPDALTGKQ 185
Query: 201 -EVELRNAGFGYRSAPQSSLLFSVRRSFKQAKYITGTVDVLQSKHSGGAEWLLSL----- 254
E LR GFGYR AKYI T V+ S+ G WL SL
Sbjct: 186 VESHLRELGFGYR-----------------AKYIAETARVVSSEKPKG--WLASLANPAN 226
Query: 255 -----------RKL----DLQEAIDALCTLPGVGPKVAACIALFSLDQHHAIPVDTHVWK 299
K+ +EA +AL L GVGPKVA C+ L L A+PVDTHVW+
Sbjct: 227 PNIRAASEAEDEKIVSPPTYKEAHEALLALAGVGPKVADCVCLMGLGWGEAVPVDTHVWQ 286
Query: 300 IATR 303
IA R
Sbjct: 287 IAQR 290
>gi|242020300|ref|XP_002430593.1| 8-oxoguanine DNA glycosylase, putative [Pediculus humanus corporis]
gi|212515765|gb|EEB17855.1| 8-oxoguanine DNA glycosylase, putative [Pediculus humanus corporis]
Length = 323
Score = 144 bits (362), Expect = 7e-32, Method: Compositional matrix adjust.
Identities = 100/301 (33%), Positives = 145/301 (48%), Gaps = 36/301 (11%)
Query: 43 LTQSELSLPLTFPTGQTFRWKKTGPLQYTGPI---------GPHLISLKHLQNGDVCYHI 93
+++ L L T GQ+FRWK ++ G IS K +
Sbjct: 8 ISKEYLQLKPTLCGGQSFRWKNIKNDEWIGVFQNSVWILNQSDSYISFKCISKSKNVEKN 67
Query: 94 HTSPSEP-----AAKSALLDFLNMGISLGELWEGFSASDCRFAELAKYLAGARVLRQDPV 148
+ S + +K L +L + + L ++ +S SD F + A L G R+L Q+
Sbjct: 68 NLSEIDGLSDFINSKDVLHSYLRLNVDLNNYYKIWSESDPHFKKAAPQLHGVRILNQEVT 127
Query: 149 ECLLQFLCSSNNNIARITKMVDFLASL-GSHLGNVEGFEFHEFPSLERLSLVS-EVELRN 206
E L F+CSSNNNI RI+ MV+ L G ++ ++G +F+ FP LE L+ S E LR
Sbjct: 128 ENLFSFICSSNNNIKRISSMVEKLCEFYGDYITELDGKKFYGFPKLENLADSSVESNLRK 187
Query: 207 AGFGYRSAPQSSLLFSVRRSFKQAKYITGTVDVLQSKHSGGAEWLLSLRKLDLQEAIDAL 266
AGFGYR AK+I + ++ K GG WL +L+ L + + L
Sbjct: 188 AGFGYR-----------------AKFIYQSACLI--KELGGTNWLNNLKTLPYELSKMEL 228
Query: 267 CTLPGVGPKVAACIALFSLDQHHAIPVDTHVWKIATRYLLPELAGVR-LTPKLCSRVAEA 325
LPGVG KVA CI L SL AIPVDTH++K+A+ LP L + L K+ + +
Sbjct: 229 MKLPGVGAKVADCICLMSLGHMEAIPVDTHIFKVASEIYLPHLKKYKNLNNKIYNEIGNH 288
Query: 326 F 326
F
Sbjct: 289 F 289
>gi|315044447|ref|XP_003171599.1| N-glycosylase/DNA lyase [Arthroderma gypseum CBS 118893]
gi|311343942|gb|EFR03145.1| N-glycosylase/DNA lyase [Arthroderma gypseum CBS 118893]
Length = 426
Score = 144 bits (362), Expect = 7e-32, Method: Compositional matrix adjust.
Identities = 102/320 (31%), Positives = 156/320 (48%), Gaps = 68/320 (21%)
Query: 30 LTTNKPSKWTPLNLTQSELSLPLTFPTGQTFRWKKTGPLQYTGPIGPHLISLKHLQNGDV 89
++ + S+W L LT +EL + T GQ+FRW+K ++T + ++SLK Q D
Sbjct: 1 MSIGRFSEWRKLPLTLTELCIDTTLRCGQSFRWRKLDD-EWTCSLYGRVLSLK--QESDS 57
Query: 90 CYH-----------------IHT------SPSEPAAKSALLDFLNMGISLGELWEGFSAS 126
++ I T +P + ++ + + N+ +L +L+E ++AS
Sbjct: 58 LWYRSYKPPPVESSTLPTPPISTAATQRGTPDDDDTEALIHHYFNLEYNLSDLYEQWAAS 117
Query: 127 DCRFAELAKYLAGARVLRQDPVECLLQFLCSSNNNIARITKMVDFL-ASLGSHLGNVEGF 185
D F + A AG R++RQD E L+ F+CSSNNNIARI++MV+ L + G +G +
Sbjct: 118 DPNFKKKAVKFAGIRIMRQDAWETLVSFICSSNNNIARISQMVEKLCVNYGPFIGQLGDQ 177
Query: 186 EFHEFPSLERLSLVS-EVELRNAGFGYRSAPQSSLLFSVRRSFKQAKYITGTVDVLQSKH 244
++++FP L+ E LR GFGYR AKYI T ++ ++
Sbjct: 178 KYYDFPDPSALTGTGVESHLRELGFGYR-----------------AKYIYQTAKIVANER 220
Query: 245 SGGAEWLLSLRKLD---------------------LQEAIDALCTLPGVGPKVAACIALF 283
G WL SLR + +EA + L L GVGPKVA C+ L
Sbjct: 221 EPG--WLNSLRNPEKPAFNEEPATPGGVPRGEESGYREAHEQLLALQGVGPKVADCVCLM 278
Query: 284 SLDQHHAIPVDTHVWKIATR 303
L ++PVDTHVW+IA R
Sbjct: 279 GLGWGESVPVDTHVWQIAQR 298
>gi|119189251|ref|XP_001245232.1| hypothetical protein CIMG_04673 [Coccidioides immitis RS]
gi|392868135|gb|EAS33878.2| 8-oxoguanine DNA-glycosylase (ogg) [Coccidioides immitis RS]
Length = 403
Score = 144 bits (362), Expect = 7e-32, Method: Compositional matrix adjust.
Identities = 102/305 (33%), Positives = 148/305 (48%), Gaps = 61/305 (20%)
Query: 36 SKWTPLNLTQSELSLPLTFPTGQTFRWKKTGPLQYTGPIGPHLISLKHLQNGDVCYHIHT 95
S+W L ++ +EL + T GQTFRW+K+ +++ + ++SL+ Q+ D H+H
Sbjct: 7 SEWRKLPVSLTELCINTTLRCGQTFRWRKSAEDEWSCVLHGRIVSLR--QDAD---HLHY 61
Query: 96 SPSEPAAKSALL--------------DFLNMGISLGELWEGFSASDCRFAELAKYLAGAR 141
P A+ L + N+ +L L+E ++ +D F + A G R
Sbjct: 62 RSLFPRARGDLATAQKTNDDTEALIRHYFNLEPNLSGLYEQWAEADKNFKKKALQFTGIR 121
Query: 142 VLRQDPVECLLQFLCSSNNNIARITKMVDFL-ASLGSHLGNVEGFEFHEFPSLERL-SLV 199
+LRQD E L+ F+CSSNNNIARI++MV+ L + G +G+++ +H FP L
Sbjct: 122 ILRQDAWEALVSFICSSNNNIARISQMVEKLCTNYGPLIGHIDKQPYHGFPPPSALIGQG 181
Query: 200 SEVELRNAGFGYRSAPQSSLLFSVRRSFKQAKYITGTVDVLQSKHSGGAEWLLSLR---- 255
E LR GFGYR AKYI T ++ ++ G WL SLR
Sbjct: 182 VEARLRELGFGYR-----------------AKYIYKTAVMVANEREKG--WLNSLRNPER 222
Query: 256 -----------------KLDLQEAIDALCTLPGVGPKVAACIALFSLDQHHAIPVDTHVW 298
K +EA + L L GVGPKVA C+ L L A+PVDTHVW
Sbjct: 223 PAFGEKPNLACKMEAQGKTGYREAHENLLELQGVGPKVADCVCLMGLGWGEAVPVDTHVW 282
Query: 299 KIATR 303
+IA R
Sbjct: 283 QIAQR 287
>gi|226287344|gb|EEH42857.1| N-glycosylase/DNA lyase [Paracoccidioides brasiliensis Pb18]
Length = 412
Score = 144 bits (362), Expect = 7e-32, Method: Compositional matrix adjust.
Identities = 99/317 (31%), Positives = 151/317 (47%), Gaps = 73/317 (23%)
Query: 36 SKWTPLNLTQSELSLPLTFPTGQTFRWKKTGPLQYTGPIGPHLISLKHLQNGDVCYHIHT 95
S+W L ++ +EL + T GQ+FRW+K+G +++ + ++SL+ H++
Sbjct: 7 SEWRKLPISLAELCINTTLRCGQSFRWQKSGDNEWSCALYGRIVSLRQDST-----HLYY 61
Query: 96 SPSEPAAKSALL--------------------------DFLNMGISLGELWEGFSASDCR 129
P S+LL ++ N+ ++L L+E +S +D
Sbjct: 62 RSYFPPTASSLLTPPSSVPPLTTEELPKKLDDTDALISNYFNLELNLKSLYEQWSTADEN 121
Query: 130 FAELAKYLAGARVLRQDPVECLLQFLCSSNNNIARITKMVDFLA-SLGSHLGNVEGFEFH 188
F + A G R+LRQ+ E L+ F+CSSNNNIARI++MV+ L + G +G ++ +H
Sbjct: 122 FKKKAPQFTGIRILRQNAWEALISFICSSNNNIARISQMVEKLCLNYGQFIGFIDKKPYH 181
Query: 189 EFPSLERLSLVS-EVELRNAGFGYRSAPQSSLLFSVRRSFKQAKYITGTVDVLQSKHSGG 247
+FP+ + L+ E LR GFGYR AKYI T ++ + G
Sbjct: 182 DFPAPQALAGRDVEARLRELGFGYR-----------------AKYIYQTALIVANHREEG 224
Query: 248 AEWLLSLR---------------------KLDLQEAIDALCTLPGVGPKVAACIALFSLD 286
WL SLR + ++A D L L GVGPKVA C+ L L
Sbjct: 225 --WLNSLRNPECPSFGAKPLSAGEMMEGGREGYRKAHDCLLELQGVGPKVADCVCLMGLG 282
Query: 287 QHHAIPVDTHVWKIATR 303
A+PVDTHVW+IA R
Sbjct: 283 WGEAVPVDTHVWQIAQR 299
>gi|320031289|gb|EFW13262.1| DNA N-glycosylase [Coccidioides posadasii str. Silveira]
Length = 403
Score = 144 bits (362), Expect = 8e-32, Method: Compositional matrix adjust.
Identities = 102/305 (33%), Positives = 148/305 (48%), Gaps = 61/305 (20%)
Query: 36 SKWTPLNLTQSELSLPLTFPTGQTFRWKKTGPLQYTGPIGPHLISLKHLQNGDVCYHIHT 95
S+W L ++ +EL + T GQTFRW+K+ +++ + ++SL+ Q+ D H+H
Sbjct: 7 SEWRKLPVSLTELCINTTLRCGQTFRWRKSAEDEWSCVLHGRIVSLR--QDAD---HLHY 61
Query: 96 SPSEPAAKSALL--------------DFLNMGISLGELWEGFSASDCRFAELAKYLAGAR 141
P A+ L + N+ +L L+E ++ +D F + A G R
Sbjct: 62 RSLFPRARGNLATGQKTNDDTEALIRHYFNLEPNLSGLYEQWAEADKNFKKKALQFTGIR 121
Query: 142 VLRQDPVECLLQFLCSSNNNIARITKMVDFL-ASLGSHLGNVEGFEFHEFPSLERL-SLV 199
+LRQD E L+ F+CSSNNNIARI++MV+ L + G +G+++ +H FP L
Sbjct: 122 ILRQDAWEALVSFICSSNNNIARISQMVEKLCTNYGPLIGHIDKQPYHGFPPPSALIGQG 181
Query: 200 SEVELRNAGFGYRSAPQSSLLFSVRRSFKQAKYITGTVDVLQSKHSGGAEWLLSLR---- 255
E LR GFGYR AKYI T ++ ++ G WL SLR
Sbjct: 182 VEARLRELGFGYR-----------------AKYIYKTAVMVANEREKG--WLNSLRNPER 222
Query: 256 -----------------KLDLQEAIDALCTLPGVGPKVAACIALFSLDQHHAIPVDTHVW 298
K +EA + L L GVGPKVA C+ L L A+PVDTHVW
Sbjct: 223 PAFGEKPNLACKMEAQGKTGYREAHENLLELQGVGPKVADCVCLMGLGWGEAVPVDTHVW 282
Query: 299 KIATR 303
+IA R
Sbjct: 283 QIAQR 287
>gi|241957693|ref|XP_002421566.1| mitochondrial N-glycosylase/DNA lyase [includes: 8-oxoguanine DNA
glycosylase; DNA-(apurinic or apyrimidinic site) lyase],
putative [Candida dubliniensis CD36]
gi|223644910|emb|CAX40909.1| mitochondrial N-glycosylase/DNA lyase [includes: 8-oxoguanine DNA
glycosylase; DNA-(apurinic or apyrimidinic site) lyase],
putative [Candida dubliniensis CD36]
Length = 346
Score = 143 bits (361), Expect = 9e-32, Method: Compositional matrix adjust.
Identities = 99/280 (35%), Positives = 142/280 (50%), Gaps = 34/280 (12%)
Query: 38 WTPLNLTQSELSLPLTFPTGQTFRWKKTGPLQYTGPIGPHLISLKHLQNGDVCYHIHTSP 97
W L +T+ E++L GQTFRWK + ++ I ++ LK Q+ Y+ H
Sbjct: 7 WKSLPITEVEVNLQKVLRCGQTFRWKNINNV-WSFTIKDRIVLLK--QDSTHIYYSHVMK 63
Query: 98 SEPAAK-SALLDFLN----MGISLGELWEGFSASD--CRFAELAKY--LAGARVLRQDPV 148
+ LDF+N + L +L+ + R +++ + AG R+LRQ P
Sbjct: 64 KDTNGNHKQTLDFVNDYFVLDTKLTDLYAHWKLQHEPFRSKKVSPFDSFAGIRILRQGPW 123
Query: 149 ECLLQFLCSSNNNIARITKMVDFLA-SLGSHLGNVEGFEFHEFPSLERLSLVS-EVELRN 206
ECL+ F+CSSNNN+ RI+KM + L + G ++ EG E++ FP+ E LS E +LR+
Sbjct: 124 ECLISFICSSNNNVKRISKMCENLCINFGEYINEFEGHEYYTFPTPEALSQPDVEPKLRD 183
Query: 207 AGFGYRSAPQSSLLFSVRRSFKQAKYITGTVDVLQSKHSG---GAEWLLSLRKLDLQEAI 263
GFGYR AKYI T + E L SLR+ A
Sbjct: 184 LGFGYR-----------------AKYIYQTACKFTDNNKYPDITIENLNSLREAKYTTAH 226
Query: 264 DALCTLPGVGPKVAACIALFSLDQHHAIPVDTHVWKIATR 303
+ L L GVGPKVA CI L SLD+H +P+DTHV++IA R
Sbjct: 227 EFLLQLTGVGPKVADCICLMSLDKHDVVPIDTHVYQIAVR 266
>gi|396472859|ref|XP_003839216.1| hypothetical protein LEMA_P028890.1 [Leptosphaeria maculans JN3]
gi|312215785|emb|CBX95737.1| hypothetical protein LEMA_P028890.1 [Leptosphaeria maculans JN3]
Length = 406
Score = 143 bits (361), Expect = 9e-32, Method: Compositional matrix adjust.
Identities = 99/312 (31%), Positives = 151/312 (48%), Gaps = 63/312 (20%)
Query: 34 KPSKWTPLNLTQSELSLPLTFPTGQTFRWKKTGPLQYTGPIGPHLISLKHLQNGDVCYHI 93
+ ++W L + +EL L T GQ+FRW+K+ +++ + ++SL+ Q+ + ++
Sbjct: 4 RAAEWHKLPTSLTELCLNTTLRCGQSFRWRKSAEDEWSMALHGRILSLR--QDAECLHYR 61
Query: 94 HTSPSEPAA---------KSALLD----------FLNMGISLGELWEGFSASDCRFAELA 134
PS +A S LD + N+ LG+L+E ++A+D F + A
Sbjct: 62 AIFPSSTSALPTPPPSNAPSVALDEDDTPALVKHYFNLAPDLGQLYEQWAAADANFKKRA 121
Query: 135 KYLAGARVLRQDPVECLLQFLCSSNNNIARITKMVDFLA-SLGSHLGNVEGFEFHEFPSL 193
G R+LRQD E L+ F+CSSNNNI+RI MV L G + ++ +H+FP+
Sbjct: 122 PKFTGIRILRQDAWEALIGFICSSNNNISRIGGMVQNLCLHYGPLIDRIDDVPYHDFPTP 181
Query: 194 ERLSLVS-EVELRNAGFGYRSAPQSSLLFSVRRSFKQAKYITGTVDVLQSKHSGGAEWLL 252
E LS S E L+ GFGYR AKYI T ++ + G +WL
Sbjct: 182 EALSGPSVEAHLQKLGFGYR-----------------AKYIAKTARIVAEEK--GLKWLE 222
Query: 253 SLR---------------------KLDLQEAIDALCTLPGVGPKVAACIALFSLDQHHAI 291
L ++ ++A + L L GVGPKVA C+ LF L ++
Sbjct: 223 DLSNPECPQFGAKEMPAGEMMDDGRVGYRKAHEELLALHGVGPKVADCVCLFGLGWSESV 282
Query: 292 PVDTHVWKIATR 303
PVDTHVW+IA R
Sbjct: 283 PVDTHVWQIAQR 294
>gi|380096158|emb|CCC06205.1| unnamed protein product [Sordaria macrospora k-hell]
Length = 457
Score = 143 bits (360), Expect = 1e-31, Method: Compositional matrix adjust.
Identities = 111/331 (33%), Positives = 149/331 (45%), Gaps = 77/331 (23%)
Query: 30 LTTNKPSKWTPLNLTQSELSLPLTFPTGQTFRWKKTGPLQYTGPIGPHLISLK----HLQ 85
+ K S+W L ++ +EL + T GQ+FRW+K Q+ + LISL+ HL
Sbjct: 1 MAAQKVSEWRKLPVSLTELCIETTLRCGQSFRWRKINE-QWHCVLKGRLISLRQEPTHLH 59
Query: 86 -------NGDVCYHIH-TSPSEPAAKSALLDFLNMGISLGELWEGFSASDCRFAELAKYL 137
G H S ++ S L + + SL L+ +S SD FA A
Sbjct: 60 YKVTWPTTGATTPTKHENSEADDDIPSLLHSYFALSSSLTTLYAQWSLSDANFARRAPAF 119
Query: 138 AGARVLRQDPVECLLQFLCSSNNNIARITKMVDFLAS-LGSHLGNVEGFEFHEFPSLERL 196
G R+L QD E L+ F+CSSNNNI+RI++MV L + G ++ VEG FH+FP E L
Sbjct: 120 TGIRILNQDAWETLISFICSSNNNISRISQMVLKLCTHYGPYVATVEGEAFHDFPGPEAL 179
Query: 197 SLVS-EVELRNAGFGYRSAPQSSLLFSVRRSFKQAKYITGTVDVLQSKHSGGAEWLLSLR 255
+ E LR GFGYR AKYI T + + G +WLL LR
Sbjct: 180 AGEGVEAHLRELGFGYR-----------------AKYIAETAGCVAQVY--GEKWLLRLR 220
Query: 256 ------------------------------------KLDLQE-------AIDALCTLPGV 272
K DL E A +AL TLPGV
Sbjct: 221 NPGVPALGVAAGVEVKKEEEGAEDEKKEERVDDGDSKADLDEDPPTYKKAHEALLTLPGV 280
Query: 273 GPKVAACIALFSLDQHHAIPVDTHVWKIATR 303
GPKV+ C+ L L ++P+DTHVW+IA R
Sbjct: 281 GPKVSDCVCLMGLGWGESVPIDTHVWQIAQR 311
>gi|330931858|ref|XP_003303564.1| hypothetical protein PTT_15820 [Pyrenophora teres f. teres 0-1]
gi|311320369|gb|EFQ88343.1| hypothetical protein PTT_15820 [Pyrenophora teres f. teres 0-1]
Length = 419
Score = 142 bits (359), Expect = 2e-31, Method: Compositional matrix adjust.
Identities = 101/312 (32%), Positives = 151/312 (48%), Gaps = 63/312 (20%)
Query: 34 KPSKWTPLNLTQSELSLPLTFPTGQTFRWKKTGPLQYTGPIGPHLISLKHLQNGDVCYHI 93
+ ++W L + +EL + T GQ+FRW+K+ +++ + +ISL+ Q+ + ++
Sbjct: 4 RAAEWHKLPTSLTELCIDTTLRCGQSFRWRKSADDEWSMALHGRIISLR--QDSEHLHYR 61
Query: 94 HTSPS------------------EPAAKSALLD-FLNMGISLGELWEGFSASDCRFAELA 134
T PS E AL++ + N+ +LG+L+E ++ASD F + A
Sbjct: 62 ATYPSITTTPLTPPPSKAPSVAPEEDDTPALINHYFNLAPNLGQLYEQWAASDTNFKKRA 121
Query: 135 KYLAGARVLRQDPVECLLQFLCSSNNNIARITKMVDFLA-SLGSHLGNVEGFEFHEFPSL 193
G R+LRQD E L+ F+CSSNNNI+RI+ MV L G +G++ +H+FP+
Sbjct: 122 PKFTGIRILRQDAWEALIGFICSSNNNISRISGMVHNLCLHYGPLIGHIHDVPYHDFPTP 181
Query: 194 ERLSL-VSEVELRNAGFGYRSAPQSSLLFSVRRSFKQAKYITGTVDVLQSKHSGGAEWLL 252
LS E L GFGYR AKYI T ++ G +WL
Sbjct: 182 AALSGPTVEANLMKLGFGYR-----------------AKYIAKTARMVSE--DKGMKWLE 222
Query: 253 SLR-----KLDLQE----------------AIDALCTLPGVGPKVAACIALFSLDQHHAI 291
L + +QE A + L L GVGPKVA C+ LF L ++
Sbjct: 223 DLSNPEYPQFGVQEKPAGDMPEGGREGYRKAHEELLALHGVGPKVADCVCLFGLGWSESV 282
Query: 292 PVDTHVWKIATR 303
PVDTHVW+IA R
Sbjct: 283 PVDTHVWQIAQR 294
>gi|380025853|ref|XP_003696678.1| PREDICTED: N-glycosylase/DNA lyase-like [Apis florea]
Length = 328
Score = 142 bits (358), Expect = 2e-31, Method: Compositional matrix adjust.
Identities = 103/289 (35%), Positives = 148/289 (51%), Gaps = 36/289 (12%)
Query: 46 SELSLPLTFPTGQTFRW--KKTGPLQYTGPIGPHLISLKHLQNGDVCYHIHTSP--SEPA 101
SEL L T +GQ+FRW G Y G + +L +QN +I P
Sbjct: 18 SELDLGATLKSGQSFRWFCHNNG---YRGVFDGCVWTL--MQNDTHLSYIVQGPLIDSKN 72
Query: 102 AKSALLDFLNMGISLGELWEGFSASDCRFAELAKYLAGARVLRQDPVECLLQFLCSSNNN 161
L ++ + L +L + +SA D F + ++G R+L Q+ +E + F+CSSNNN
Sbjct: 73 YDEILSEYFRLNECLTDLCKKWSAIDQHFKKSLNKISGVRILNQNVIENVFSFICSSNNN 132
Query: 162 IARITKMVDFLASL-GSHLGNVEGFEFHEFPSLERLSLVSEVE--LRNAGFGYRSAPQSS 218
I RI++MV+ L +L G + +VEG ++ FPS+E L+ ++E L+ FGYR
Sbjct: 133 IQRISRMVEKLCTLFGEKICSVEGKNYYNFPSIEALA-SKDIENILKMEKFGYR------ 185
Query: 219 LLFSVRRSFKQAKYITGTVDVLQSKHSGGAEWLLSLRKLDLQEAIDA---LCTLPGVGPK 275
A+YI L GG EWLL+L K + I A L TLPG+GPK
Sbjct: 186 -----------ARYIANAAKCLIE--LGGKEWLLNLHKKNNVSYIQARKQLITLPGIGPK 232
Query: 276 VAACIALFSLDQHHAIPVDTHVWKIATRYLLPELAGVR-LTPKLCSRVA 323
VA CI L SL +IPVDTH+++IA LP L + +TPK+ V+
Sbjct: 233 VADCICLMSLGHLESIPVDTHIFQIARTNYLPHLKQYKTVTPKIHEEVS 281
>gi|449299099|gb|EMC95113.1| hypothetical protein BAUCODRAFT_25224 [Baudoinia compniacensis UAMH
10762]
Length = 375
Score = 142 bits (358), Expect = 2e-31, Method: Compositional matrix adjust.
Identities = 101/312 (32%), Positives = 149/312 (47%), Gaps = 67/312 (21%)
Query: 37 KWTPLNLTQSELSLPLTFPTGQTFRWKKTGPLQYTGPIGPHLISLKHLQNGDVCYHIHTS 96
+W L ++ +EL + T GQ+FRW+K ++ + ++ L Q+ D Y+
Sbjct: 8 EWQKLPVSLTELCINTTLRCGQSFRWRKNDAGVWSMALHNRIVCLH--QDTDHVYYRSIP 65
Query: 97 PSEPAAKSA---------------------LLDFLNMGISLGELWEGFSASDCRFAELAK 135
P A L +LN+ +L +L+ +SASD FA+ A
Sbjct: 66 PMRLQAPPTPPSSNPPSVAPESDADNTLELLKHYLNLTPNLTQLYAQWSASDANFAKKAP 125
Query: 136 YLAGARVLRQDPVECLLQFLCSSNNNIARITKMVDFL-ASLGSHLGNVEGFEFHEFPSLE 194
G R+L+QD E L+ F+CSSNNNI RI++M+ L + G LG+ +G +H+FP+ +
Sbjct: 126 KFTGVRILKQDAWEALVGFICSSNNNIVRISQMISKLCVNYGPLLGHFDGEAYHDFPTPQ 185
Query: 195 RLSLVS-EVELRNAGFGYRSAPQSSLLFSVRRSFKQAKYITGTVDVLQSKHSGGAEWLLS 253
L++ E +LR+ GFGYR AKY+ T ++ S A+WL S
Sbjct: 186 ALAVDGVEAKLRSLGFGYR-----------------AKYLYQTACIVSSH---PADWLDS 225
Query: 254 LR----------------------KLDLQEAIDALCTLPGVGPKVAACIALFSLDQHHAI 291
LR + A +AL TL GVGPKVA C+ L L A+
Sbjct: 226 LRNPESPAFGTKPSPSTDDWAPEGRAGYHAAHEALLTLQGVGPKVADCVCLMGLGWGEAV 285
Query: 292 PVDTHVWKIATR 303
PVDTHVW+IA R
Sbjct: 286 PVDTHVWQIAQR 297
>gi|407929465|gb|EKG22294.1| hypothetical protein MPH_00361 [Macrophomina phaseolina MS6]
Length = 416
Score = 142 bits (357), Expect = 3e-31, Method: Compositional matrix adjust.
Identities = 104/338 (30%), Positives = 160/338 (47%), Gaps = 69/338 (20%)
Query: 36 SKWTPLNLTQSELSLPLTFPTGQTFRWKKTGPLQYTGPIGPHLISLKHLQNGDVCYH--I 93
++W + ++ +EL + T GQ+FRW+K+ +++ + ++SL+ Q+ + ++ I
Sbjct: 7 TEWRRMPMSLTELCINTTLRCGQSFRWRKSADDEWSCALHGRILSLR--QDPEYLHYRVI 64
Query: 94 HTS---------------------PSEPAAKSALLDFLNMGISLGELWEGFSASDCRFAE 132
+ S P + + + N+ +LG+L+E ++A+D F +
Sbjct: 65 YPSTSRILPTPPPSTAPSTTESPAPDDDDTVELIEHYFNLKPNLGQLYEDWAAADANFKK 124
Query: 133 LAKYLAGARVLRQDPVECLLQFLCSSNNNIARITKMVDFLA-SLGSHLGNVEGFEFHEFP 191
A G R+L+QD E L+ F+CSSNNNIARI++MV+ L + G +G+V FH+FP
Sbjct: 125 KAPKFTGVRILKQDAWEALVGFICSSNNNIARISQMVEKLCINYGPLIGHVGSQAFHDFP 184
Query: 192 SLERLSL-VSEVELRNAGFGYRSAPQSSLLFSVRRSFKQAKYITGTVDVLQSKHSGGAEW 250
S L+ E +LR GFGYR AKY+ T ++ K G W
Sbjct: 185 SASALTDPKVEQQLRQLGFGYR-----------------AKYLHQTAVMIAEKEEG---W 224
Query: 251 LLSLR---------------------KLDLQEAIDALCTLPGVGPKVAACIALFSLDQHH 289
L SLR + + A + L L GVGPKVA C+ L L
Sbjct: 225 LDSLRNPESPVLGVDARPAGEFVESGREGYRSAHEQLLALKGVGPKVADCVCLMGLGWGE 284
Query: 290 AIPVDTHVWKIATR-YLLPELAGVRLTPKLCSRVAEAF 326
A+PVDTHVW+IA R Y + LT VA F
Sbjct: 285 AVPVDTHVWQIAQRDYRFGKGKHSSLTKATYDAVANKF 322
>gi|353235459|emb|CCA67472.1| related to 8-oxoguanine DNA-glycosylase [Piriformospora indica DSM
11827]
Length = 404
Score = 142 bits (357), Expect = 3e-31, Method: Compositional matrix adjust.
Identities = 103/340 (30%), Positives = 157/340 (46%), Gaps = 67/340 (19%)
Query: 33 NKPSKWTPLNLTQSELSLPLTFPTGQTFRW---------KKTGPLQYTGPIGPHLISLKH 83
N P + L L ++LSL GQ+FRW K+ + P + ++L+
Sbjct: 3 NIPKGFKSLPLPIAQLSLAAVLKCGQSFRWSSIPLAAGVKQEEEPENAYPSHEYRLALRD 62
Query: 84 L-----QNGDVCYHIHTSPSEPAAKSA-----------LLDFLNMGISLGELWEGFSASD 127
Q+ D ++ P P ++ L D+ + + L +L+ ++ D
Sbjct: 63 RVVCLRQDKDTLFYRAAFPDVPLSEDQSAKRDKETLEWLRDYFQLDVDLVKLYSDWAERD 122
Query: 128 CRFAELAKYLAGARVLRQDPVECLLQFLCSSNNNIARITKMVDFLASL------------ 175
F E A +G R+LRQDP E + F+CSSNN+I RI KMV L +
Sbjct: 123 VVFKETALRFSGIRILRQDPFENTISFICSSNNHINRIGKMVQSLCTEYGPLACSLPPPE 182
Query: 176 GSHLGNVEGFEFHEFPSLERLSLVS-EVELRNAGFGYRSAPQSSLLFSVRRSFKQAKYIT 234
S + + ++ FP+ ++L+ S LR GFGYR A+Y+
Sbjct: 183 DSEETSRDAIAYYGFPTPDKLTDESVSTRLRELGFGYR-----------------AEYVQ 225
Query: 235 GTVDVLQSKHSGGAEWLLSLRKLDLQEAIDALCTLPGVGPKVAACIALFSLDQHHAIPVD 294
T +L +H +L+ LRKL ++EA L L GVGPKVA CI L SLD+ + +PVD
Sbjct: 226 RTAQMLCDEHEDPESYLVGLRKLPVEEARAELLKLCGVGPKVADCILLMSLDKRNVVPVD 285
Query: 295 THVWKIATRYLLPELAGVRLTPK-------LCSRVAEAFC 327
THV +IA +Y G+R TP+ + ++ EA C
Sbjct: 286 THVHQIALKYY-----GLRGTPQGKGGKVPMTPKIYEAVC 320
>gi|195565805|ref|XP_002106489.1| GD16912 [Drosophila simulans]
gi|194203865|gb|EDX17441.1| GD16912 [Drosophila simulans]
Length = 327
Score = 142 bits (357), Expect = 3e-31, Method: Compositional matrix adjust.
Identities = 96/297 (32%), Positives = 147/297 (49%), Gaps = 36/297 (12%)
Query: 43 LTQSELSLPLTFPTGQTFRWKKT---GPLQYTGPIGPHLISLKHLQNGDVCYHIHTSPSE 99
L+ E L T GQ+FRW+ +Y G + L+ + + Y + + S
Sbjct: 13 LSLEECDLERTLLGGQSFRWRSICDGNRTKYGGVVFNTYWVLQQ-EESSIIYEAYGTSSP 71
Query: 100 PAAK---SALLDFLNMGISLGELWEGFSASDCRFAELAKYLAG-ARVLRQDPVECLLQFL 155
A K S + D+L M L + + D F K+L+ R+L Q+P E + FL
Sbjct: 72 LATKDYSSLISDYLRMDFDLKLNQKDWLNKDDNFV---KFLSKPVRLLAQEPFENIFSFL 128
Query: 156 CSSNNNIARITKMVD-FLASLGSHLGNVEGFEFHEFPSLERLSLVS----EVELRNAGFG 210
CS NNNI RI+ M++ F A+ G+ +G+ G + + FP++ R + +LR A FG
Sbjct: 129 CSQNNNIKRISSMIEWFCATFGTKIGHFNGADAYTFPTINRFHDIPCEDLNAQLRAAKFG 188
Query: 211 YRSAPQSSLLFSVRRSFKQAKYITGTVDVLQSKHSGGAEWLLSLRKLDLQEAIDALCTLP 270
YR AK+I T+ +Q K GG W +SL+ + ++A + L LP
Sbjct: 189 YR-----------------AKFIAQTLQEIQKK--GGQNWFISLKSMPFEKAREELTLLP 229
Query: 271 GVGPKVAACIALFSLDQHHAIPVDTHVWKIATRYLLPELAGVR-LTPKLCSRVAEAF 326
G+G KVA CI L SL ++PVD H+++IA Y LP L G + +T K+ V+ F
Sbjct: 230 GIGYKVADCICLMSLGHLESVPVDIHIYRIAQNYYLPHLTGQKNVTKKIYEEVSNHF 286
>gi|242803785|ref|XP_002484244.1| DNA N-glycosylase, putative [Talaromyces stipitatus ATCC 10500]
gi|218717589|gb|EED17010.1| DNA N-glycosylase, putative [Talaromyces stipitatus ATCC 10500]
Length = 410
Score = 142 bits (357), Expect = 3e-31, Method: Compositional matrix adjust.
Identities = 103/314 (32%), Positives = 150/314 (47%), Gaps = 68/314 (21%)
Query: 36 SKWTPLNLTQSELSLPLTFPTGQTFRWKKT-GPLQYTGPIGPHLISLKHLQNGDVCYH-- 92
++W L + SEL + T GQ+FRW+K ++ + LISL+ Q+ Y+
Sbjct: 6 TEWRKLPINLSELCINTTLRCGQSFRWQKIPDSDEWRCVLRGRLISLR--QDPSYLYYRS 63
Query: 93 ----------IHTSPSEPA----------AKSALLDFLNMGISLGELWEGFSASDCRFAE 132
++PS P A+ + +LN+ +L +L+ +S SD F +
Sbjct: 64 YVPRIPLAATFKSTPSVPTTNGTASTSDEAREIITHYLNLTSNLTDLYTQWSDSDPNFKK 123
Query: 133 LAKYLAGARVLRQDPVECLLQFLCSSNNNIARITKMVDFL-ASLGSHLGNVEGFEFHEFP 191
A G R+LRQD E L+ F+CSSNNNIARI++MV+ L + G+ + ++ +H+FP
Sbjct: 124 KAPNFTGIRILRQDAWEALVSFICSSNNNIARISQMVEKLCTNYGNLVATIDDQSYHDFP 183
Query: 192 SLERLSLVS-EVELRNAGFGYRSAPQSSLLFSVRRSFKQAKYITGTVDVLQSKHSGGAEW 250
+ E L+ E LR GFGYR AKYI T ++ K G W
Sbjct: 184 APEALTGKDVEARLRELGFGYR-----------------AKYIYQTAVMVSEKEKG---W 223
Query: 251 LLSLR---------------------KLDLQEAIDALCTLPGVGPKVAACIALFSLDQHH 289
L SLR + ++A + L L GVGPKVA C+ L L
Sbjct: 224 LDSLRNPESPAFGMAPRPGGEMRPGGREGYRDAHEKLLELQGVGPKVADCVCLMGLGWGE 283
Query: 290 AIPVDTHVWKIATR 303
A+PVDTHVW+IA R
Sbjct: 284 AVPVDTHVWQIAQR 297
>gi|195480038|ref|XP_002101114.1| GE15798 [Drosophila yakuba]
gi|194188638|gb|EDX02222.1| GE15798 [Drosophila yakuba]
Length = 331
Score = 141 bits (356), Expect = 3e-31, Method: Compositional matrix adjust.
Identities = 99/300 (33%), Positives = 147/300 (49%), Gaps = 34/300 (11%)
Query: 41 LNLTQSELSLPLTFPTGQTFRWKKTGPLQYTGPIGPHLISLKHLQNGD--VCYHIH--TS 96
+NL+ E L T GQ+FRW+ T G + LQ + + Y + +S
Sbjct: 11 INLSLKECDLERTLLGGQSFRWRSICDGNKTKYCGVAFNTYWVLQQEESLITYEAYGPSS 70
Query: 97 PSEPAAKSALL-DFLNMGISLGELWEGFSASDCRFAELAKYLAG-ARVLRQDPVECLLQF 154
PS S+L+ D+L + L + + D F K+L+ RVL Q+P E + F
Sbjct: 71 PSLAKDYSSLISDYLRLDFDLKLNQKDWLNKDENFV---KFLSKPVRVLSQEPFENIFSF 127
Query: 155 LCSSNNNIARITKMVD-FLASLGSHLGNVEGFEFHEFPSLERLSLVS----EVELRNAGF 209
LCS NNNI RI+ MV F A+ G+ +G+ G + + FP + R + +LR A F
Sbjct: 128 LCSQNNNIKRISSMVQWFCATFGTKIGHFNGSDEYTFPMINRFHDIPCDDLNAQLRAAKF 187
Query: 210 GYRSAPQSSLLFSVRRSFKQAKYITGTVDVLQSKHSGGAEWLLSLRKLDLQEAIDALCTL 269
GYR AK+I T+ +Q K GG W + L+ + +A L L
Sbjct: 188 GYR-----------------AKFIAQTLQEIQKK--GGQNWFIRLKSMPFAKARKELTLL 228
Query: 270 PGVGPKVAACIALFSLDQHHAIPVDTHVWKIATRYLLPELAGVR-LTPKLCSRVAEAFCE 328
PG+G KVA CI L S+ ++PVD H+++IA +Y LP L G + +T K+ V+ F E
Sbjct: 229 PGIGYKVADCICLMSMGHLESVPVDIHIYRIAQKYYLPHLTGQKNVTKKIYEEVSNHFQE 288
>gi|357631580|gb|EHJ79049.1| putative N-glycosylase/DNA lyase [Danaus plexippus]
Length = 325
Score = 141 bits (356), Expect = 3e-31, Method: Compositional matrix adjust.
Identities = 99/303 (32%), Positives = 144/303 (47%), Gaps = 35/303 (11%)
Query: 38 WTPLNLTQSELSLPLTFPTGQTFRWKKTGPL-QYTGPIGPHLISLKHLQNGDVC-YHIHT 95
W +N Q EL L T GQ+FRW ++ G L L+ Q D Y +
Sbjct: 3 WNKINCCQRELQLLGTLNGGQSFRWNYNKDTNEWKGVFSRTLWKLR--QRDDFLEYQVLG 60
Query: 96 S--PSEPAAKSALLDFLNM-------GISLGELWEGFSASDCRFAELAKYLAGARVLRQD 146
S S +DF +M +L + ++ +S D F G R+L Q+
Sbjct: 61 SLLIKSKENNSVKVDFADMLTKYFRLDFNLKDHYKVWSDKDELFKSACTKFYGIRMLNQE 120
Query: 147 PVECLLQFLCSSNNNIARITKMVDFLA-SLGSHLGNVEGFEFHEFPSLERL-SLVSEVEL 204
PVE L F+CS NN+I+RI+ +V+ L G + EG ++ FP +E+L + E +L
Sbjct: 121 PVENLFSFICSQNNHISRISSLVEKLCIYYGDEICQFEGVTYYAFPDVEKLMDIKVESKL 180
Query: 205 RNAGFGYRSAPQSSLLFSVRRSFKQAKYITGTVDVLQSKHSGGAEWLLSLRKLDLQEAID 264
R GFGYR AK+I + Q GG EW L+ + ++A
Sbjct: 181 RELGFGYR-----------------AKFIQKS--AAQIVEWGGDEWFKRLKDMKYKDARQ 221
Query: 265 ALCTLPGVGPKVAACIALFSLDQHHAIPVDTHVWKIATRYLLPELAGVR-LTPKLCSRVA 323
L L G+GPKVA CI L SL+ A+PVDTHV++IA LP L G + +T K+ + +
Sbjct: 222 ELIKLCGIGPKVADCICLMSLNHLEALPVDTHVYQIAATNYLPHLKGKKSVTEKIYTEIG 281
Query: 324 EAF 326
+ F
Sbjct: 282 DHF 284
>gi|240279649|gb|EER43154.1| 8-oxoguanine DNA glycosylase [Ajellomyces capsulatus H143]
gi|325092782|gb|EGC46092.1| 8-oxoguanine DNA glycosylase [Ajellomyces capsulatus H88]
Length = 422
Score = 141 bits (356), Expect = 4e-31, Method: Compositional matrix adjust.
Identities = 102/317 (32%), Positives = 153/317 (48%), Gaps = 73/317 (23%)
Query: 36 SKWTPLNLTQSELSLPLTFPTGQTFRWKKTGPLQYTGPIGPHLISLKHLQNGDVCYHIH- 94
S+W L + +EL + T GQ+FRW+K+G +++ + ++SL+ H+H
Sbjct: 7 SEWRKLPIGLTELCINTTLRCGQSFRWQKSGDNEWSCALYGRIVSLRQDPT-----HLHY 61
Query: 95 -------------TSPSEPAAK-----------SALLD-FLNMGISLGELWEGFSASDCR 129
S PA++ AL++ + N+ ++L +L+E +S +D
Sbjct: 62 RSYFPPIPPALPTPPSSVPASRREESCEKIDDTEALINNYFNLDLNLTDLYEQWSTADKN 121
Query: 130 FAELAKYLAGARVLRQDPVECLLQFLCSSNNNIARITKMVDFLA-SLGSHLGNVEGFEFH 188
F + A AG R+LRQD E L+ F+CSSNNNIARI++MV+ L + G +G ++ +H
Sbjct: 122 FKKKAPKFAGIRILRQDSWEALISFICSSNNNIARISQMVEKLCLNYGPLIGYIDKKPYH 181
Query: 189 EFPSLERL-SLVSEVELRNAGFGYRSAPQSSLLFSVRRSFKQAKYITGTVDVLQSKHSGG 247
+FP+ + L E LR GFGYR AKYI T ++ + G
Sbjct: 182 DFPTPQALVGRDVEARLRELGFGYR-----------------AKYIYQTALIVVNDREEG 224
Query: 248 AEWLLSLR---------------------KLDLQEAIDALCTLPGVGPKVAACIALFSLD 286
WL SLR + ++A + L L GVGPKVA C L L
Sbjct: 225 --WLNSLRNPECPSFGQSPAHAGEMAEGGREGYRKAHECLLELQGVGPKVADCACLMGLG 282
Query: 287 QHHAIPVDTHVWKIATR 303
A+PVDTHVW+IA R
Sbjct: 283 WGEAVPVDTHVWQIAQR 299
>gi|260828601|ref|XP_002609251.1| hypothetical protein BRAFLDRAFT_86839 [Branchiostoma floridae]
gi|229294607|gb|EEN65261.1| hypothetical protein BRAFLDRAFT_86839 [Branchiostoma floridae]
Length = 401
Score = 141 bits (355), Expect = 4e-31, Method: Compositional matrix adjust.
Identities = 84/226 (37%), Positives = 121/226 (53%), Gaps = 22/226 (9%)
Query: 104 SALLDFLNMGISLGELWEGFSASDCRFAELAKYLAGARVLRQDPVECLLQFLCSSNNNIA 163
S L D+ + + L +L+E + D F +++ G R+LRQDPVE L F+CSSNN+I+
Sbjct: 149 SILRDYFQLDVKLTDLYEQWCKDDPHFKQVSPNFTGIRMLRQDPVENLFSFICSSNNHIS 208
Query: 164 RITKMVDFLA-SLGSHLGNVEGFEFHEFPSLERLSLVS-EVELRNAGFGYRSAPQSSLLF 221
RI+ MV+ L + + L V G ++ FP++ L+ E LR GFGYR
Sbjct: 209 RISGMVERLCEAYSARLCEVCGVTYYAFPTVSALAGRGVEERLRKLGFGYR--------- 259
Query: 222 SVRRSFKQAKYITGTVDVLQSKHSGGAEWLLSLRKLDLQEAIDALCTLPGVGPKVAACIA 281
A+YI+ T + + GG WL +L+ L +EA L L GVG KVA C+
Sbjct: 260 --------ARYISETAQYIMEQ--GGESWLYNLKTLPYEEAKAELIKLSGVGAKVADCVC 309
Query: 282 LFSLDQHHAIPVDTHVWKIATRYLLPELAGVR-LTPKLCSRVAEAF 326
L S+D+ AIPVDTHVW+I R +L + LT K + + F
Sbjct: 310 LMSMDKTGAIPVDTHVWQIVNRDYKHKLGTTKTLTDKTYKEIGDFF 355
>gi|326428505|gb|EGD74075.1| OGG1 protein type 2c [Salpingoeca sp. ATCC 50818]
Length = 496
Score = 141 bits (355), Expect = 4e-31, Method: Compositional matrix adjust.
Identities = 87/227 (38%), Positives = 119/227 (52%), Gaps = 22/227 (9%)
Query: 32 TNKPSKWTPLNLTQSELSLPLTFPTGQTFRWKKTGPLQYTGPIGPHLISLKHLQNGDVCY 91
T S W L + ELS+ L GQ F WK+ + G + ++L+ NG Y
Sbjct: 2 TTSSSTWQRLCGCRDELSIGLALRCGQAFGWKQLEATAFVGTVNDTAVALRTAPNGATEY 61
Query: 92 HIHTSPSEPAA--KSALLDFLNMGISLGELWEGFSASDCRFAELAKYLAGARVLRQDPVE 149
+ +PS A + L D+ + L L+E ++A+D R A LAG RV+RQ P E
Sbjct: 62 RVLHAPSRSFASTRDLLSDYFQLSTKLKPLYEQWAAADPRMQTAAAALAGLRVIRQPPFE 121
Query: 150 CLLQFLCSSNNNIARITKMVDFLA-SLGSHLGNVEGFEFHEFPSLERLSLVSEVELRNAG 208
CL+ F+CSSNNNIARIT M+ L G+ L + +G H FP+L++L V+E ELR+ G
Sbjct: 122 CLISFICSSNNNIARITSMLSSLRLRYGTKLTHHDGDPIHAFPTLDQLQPVTEQELRDLG 181
Query: 209 FGYRSAPQSSLLFSVRRSFKQAKYITGTVDVLQSKHSGGAEWLLSLR 255
FGYR AK+I TVD L K +GG +L SLR
Sbjct: 182 FGYR-----------------AKFIVKTVDAL--KAAGGEAYLQSLR 209
>gi|225562834|gb|EEH11113.1| 8-oxoguanine DNA glycosylase [Ajellomyces capsulatus G186AR]
Length = 422
Score = 141 bits (355), Expect = 4e-31, Method: Compositional matrix adjust.
Identities = 101/317 (31%), Positives = 154/317 (48%), Gaps = 73/317 (23%)
Query: 36 SKWTPLNLTQSELSLPLTFPTGQTFRWKKTGPLQYTGPIGPHLISLKHLQNGDVCYHIH- 94
S+W L ++ +EL + T GQ+FRW+++G +++ + ++SL+ H+H
Sbjct: 7 SEWRKLPISLTELCINTTLRCGQSFRWQQSGDNEWSCALYGRIVSLRQDPT-----HLHY 61
Query: 95 -------------TSPSEPAAK-----------SALLD-FLNMGISLGELWEGFSASDCR 129
S PA++ AL++ + N+ ++L +L+E +S +D
Sbjct: 62 RSYFPPIPPALPTPPSSVPASRREESCEKIDDTEALINNYFNLDLNLTDLYEQWSTADKN 121
Query: 130 FAELAKYLAGARVLRQDPVECLLQFLCSSNNNIARITKMVDFLA-SLGSHLGNVEGFEFH 188
F + A AG R+LRQD E L+ F+CSSNNNIARI++MV+ L + G +G ++ +H
Sbjct: 122 FKKKAPKFAGIRILRQDSWEALISFICSSNNNIARISQMVEKLCLNYGPLIGYIDKKPYH 181
Query: 189 EFPSLERL-SLVSEVELRNAGFGYRSAPQSSLLFSVRRSFKQAKYITGTVDVLQSKHSGG 247
+FP+ + L E LR GFGYR AKYI T ++ + G
Sbjct: 182 DFPTPQALVGRDVEARLRELGFGYR-----------------AKYIYQTALIVVNDREEG 224
Query: 248 AEWLLSLR---------------------KLDLQEAIDALCTLPGVGPKVAACIALFSLD 286
WL SLR + ++A + L L GVGPKVA C L L
Sbjct: 225 --WLNSLRNPECPSFGQSPAHAGEMSEGGREGYRKAHECLLELQGVGPKVADCACLMGLG 282
Query: 287 QHHAIPVDTHVWKIATR 303
A+PVDTHVW+IA R
Sbjct: 283 WGEAVPVDTHVWQIAQR 299
>gi|195398793|ref|XP_002058005.1| GJ15730 [Drosophila virilis]
gi|194150429|gb|EDW66113.1| GJ15730 [Drosophila virilis]
Length = 323
Score = 141 bits (355), Expect = 4e-31, Method: Compositional matrix adjust.
Identities = 97/290 (33%), Positives = 150/290 (51%), Gaps = 35/290 (12%)
Query: 49 SLPLTFPTGQTFRWKKTGP---LQYTG-PIGPHLISLKHLQNGDVCYHIHTS--PSEPAA 102
L T GQ+FRW+K ++Y G + + + + ++ + Y H + SE
Sbjct: 19 DLERTLLGGQSFRWRKLCAEPKVKYCGVALNTYWVLMP--EDEHIAYEAHATGTQSEHNF 76
Query: 103 KSALLDFLNMGISLGELWEGFSASDCRFAELAKYLAGARVLRQDPVECLLQFLCSSNNNI 162
S L +L L G+ +D F + R+L Q+P+E ++ F+CS NNNI
Sbjct: 77 GSLLSKYLRADFDLESEQAGWMQADAHFGKFVS--KPVRMLSQEPLENIVCFMCSQNNNI 134
Query: 163 ARITKMVD-FLASLGSHLGNVEGFEFHEFPSLERLSLVS----EVELRNAGFGYRSAPQS 217
RI+ M+ F A+ G +G+ G + + FP+L+ LS S + ELR A FGYR
Sbjct: 135 KRISAMIQWFCAAYGHKIGHFHGRDEYTFPTLKALSERSCSELDAELRAAKFGYR----- 189
Query: 218 SLLFSVRRSFKQAKYITGTVDVLQSKHSGGAEWLLSLRKLDLQEAIDALCTLPGVGPKVA 277
AK+I ++ +++ GG +W LR+L EA +AL LPG+G KVA
Sbjct: 190 ------------AKFIARSLSQIET--LGGNDWFEQLRQLPYAEAREALVQLPGIGYKVA 235
Query: 278 ACIALFSLDQHHAIPVDTHVWKIATRYLLPELAGVR-LTPKLCSRVAEAF 326
CI L SL A+P+DTH++K+A R+ LP LA + +T K+ + VA+ F
Sbjct: 236 DCICLMSLGHLEAVPIDTHIFKLAQRHYLPHLASQKSVTSKIYAEVAQHF 285
>gi|313227944|emb|CBY23093.1| unnamed protein product [Oikopleura dioica]
Length = 374
Score = 141 bits (355), Expect = 5e-31, Method: Compositional matrix adjust.
Identities = 93/279 (33%), Positives = 138/279 (49%), Gaps = 37/279 (13%)
Query: 48 LSLPLTFPTGQTFRWKKTGPLQYTGPIGPHLISLKHLQNGDVCYHIHTSPSEPAAKSALL 107
L + G TFRW+K G + + G IG + + L+ G ++ S+ S L+
Sbjct: 15 LDIDKVLQMGMTFRWRKHGQI-WHGVIGRNAVQLEQKNEGVAYSCVNGDSSDSGLNSELI 73
Query: 108 DFLNMGISLGELWEGFSASDCRFAELAKYLAGARVLRQDPVECLLQFLCSSNNNIARITK 167
F ++ E D G R L+ DP+E L F+CSSNN+I+RI+
Sbjct: 74 RFFDL-----ERKYEIPHDDENLTNAVGKYEGIRQLQLDPIETLYSFICSSNNHISRISS 128
Query: 168 MVDFLASLGSHLGNVEG--------FEFHEFPSLERLSLVSEVELRNAGFGYRSAPQSSL 219
M+ FL S G L ++ F + FPS+E L+ ++E ELR A FGYR
Sbjct: 129 MMTFLGSKGDLLAQIKSDTDVEPNLFRYQAFPSIETLAQLTEQELRAANFGYR------- 181
Query: 220 LFSVRRSFKQAKYITGT-VDVLQSKHSGGAEWLLSLRKLDLQEAIDALCTLPGVGPKVAA 278
AK+I T +L++ GG W+ +L+ D A++ L TL G+G KVA
Sbjct: 182 ----------AKFIVKTSASILEN---GGNSWVETLKSEDYVVAVEKLMTLAGIGRKVAD 228
Query: 279 CIALFSLDQHHAIPVDTHVWKIATR-YLLPELAGVRLTP 316
CIAL+ L++ +IP+D H+ ++A R Y L E A LTP
Sbjct: 229 CIALYGLNKRESIPLDVHLIRLANRDYNLKE-AEKGLTP 266
>gi|46409138|gb|AAS93726.1| RE57519p [Drosophila melanogaster]
Length = 359
Score = 141 bits (355), Expect = 5e-31, Method: Compositional matrix adjust.
Identities = 95/298 (31%), Positives = 149/298 (50%), Gaps = 34/298 (11%)
Query: 41 LNLTQSELSLPLTFPTGQTFRWKKTGPLQYTGPIGPHLISLKHLQNGD--VCYHIHTSPS 98
+ L+ E L T GQ+FRW+ T G + LQ + + Y + + S
Sbjct: 27 IGLSLEECDLERTLLGGQSFRWRSICDGNRTKYGGVVFNTYWVLQQEESFITYEAYGTSS 86
Query: 99 EPAAK---SALLDFLNMGISLGELWEGFSASDCRFAELAKYLAG-ARVLRQDPVECLLQF 154
A K S + D+L + L + + + D F K+L+ R+L Q+P E + F
Sbjct: 87 PLATKDYSSLISDYLRVDFDLKVNQKDWLSKDDNFV---KFLSKPVRLLSQEPFENIFSF 143
Query: 155 LCSSNNNIARITKMVD-FLASLGSHLGNVEGFEFHEFPSLERLSLVS----EVELRNAGF 209
LCS NNNI RI+ M++ F A+ G+ +G+ G + + FP++ R + +LR A F
Sbjct: 144 LCSQNNNIKRISSMIEWFCATFGTKIGHFNGADAYTFPTINRFHDIPCEDLNAQLRAAKF 203
Query: 210 GYRSAPQSSLLFSVRRSFKQAKYITGTVDVLQSKHSGGAEWLLSLRKLDLQEAIDALCTL 269
GYR AK+I T+ +Q K GG W +SL+ + ++A + L L
Sbjct: 204 GYR-----------------AKFIAQTLQEIQKK--GGQNWFISLKSMPFEKAREELTLL 244
Query: 270 PGVGPKVAACIALFSLDQHHAIPVDTHVWKIATRYLLPELAGVR-LTPKLCSRVAEAF 326
PG+G KVA CI L S+ ++PVD H+++IA Y LP L G + +T K+ V++ F
Sbjct: 245 PGIGYKVADCICLMSMGHLESVPVDIHIYRIAQNYYLPHLTGQKNVTKKIYEEVSKHF 302
>gi|189204131|ref|XP_001938401.1| N-glycosylase/DNA lyase [Pyrenophora tritici-repentis Pt-1C-BFP]
gi|187985500|gb|EDU50988.1| N-glycosylase/DNA lyase [Pyrenophora tritici-repentis Pt-1C-BFP]
Length = 416
Score = 140 bits (354), Expect = 7e-31, Method: Compositional matrix adjust.
Identities = 100/312 (32%), Positives = 151/312 (48%), Gaps = 63/312 (20%)
Query: 34 KPSKWTPLNLTQSELSLPLTFPTGQTFRWKKTGPLQYTGPIGPHLISLKHLQNGDVCYHI 93
+ ++W L + +EL + T GQ+FRW+K+ +++ + +I+L+ Q+ + ++
Sbjct: 4 RAAEWHKLPTSLTELCIDTTLRCGQSFRWRKSADGEWSMALHGRIITLR--QDAEHLHYR 61
Query: 94 HTSPS------------------EPAAKSALLD-FLNMGISLGELWEGFSASDCRFAELA 134
T PS E AL++ + N+ +LG+L+E ++ASD F + A
Sbjct: 62 ATYPSITTTPLTPPPSKAPSVAPEEDDTPALINHYFNLTPNLGQLYEQWAASDANFKKRA 121
Query: 135 KYLAGARVLRQDPVECLLQFLCSSNNNIARITKMVDFLA-SLGSHLGNVEGFEFHEFPSL 193
G R+LRQD E L+ F+CSSNNNI+RI+ MV L G +G++ +H+FP+
Sbjct: 122 PKFTGIRILRQDAWEALIGFICSSNNNISRISGMVHNLCLHYGPLIGHIHDVPYHDFPTP 181
Query: 194 ERLSL-VSEVELRNAGFGYRSAPQSSLLFSVRRSFKQAKYITGTVDVLQSKHSGGAEWLL 252
LS E L GFGYR AKYI T ++ G +WL
Sbjct: 182 AALSGPAVEANLMKLGFGYR-----------------AKYIAKTARMVSE--DKGMKWLE 222
Query: 253 SLR-----KLDLQE----------------AIDALCTLPGVGPKVAACIALFSLDQHHAI 291
L + +QE A + L L GVGPKVA C+ LF L ++
Sbjct: 223 DLSNPEYPQFGVQEKPAGDMPEGGREGYRKAHEELLALHGVGPKVADCVCLFGLGWSESV 282
Query: 292 PVDTHVWKIATR 303
PVDTHVW+IA R
Sbjct: 283 PVDTHVWQIAQR 294
>gi|5679152|gb|AAD46882.1|AF160942_1 BcDNA.LD19945 [Drosophila melanogaster]
Length = 327
Score = 140 bits (354), Expect = 7e-31, Method: Compositional matrix adjust.
Identities = 95/298 (31%), Positives = 149/298 (50%), Gaps = 34/298 (11%)
Query: 41 LNLTQSELSLPLTFPTGQTFRWKKTGPLQYTGPIGPHLISLKHLQNGD--VCYHIHTSPS 98
+ L+ E L T GQ+FRW+ T G + LQ + + Y + + S
Sbjct: 11 IGLSLEECDLERTLLGGQSFRWRSICDGNRTKYGGVVFNTYWVLQQEESFITYEAYGTSS 70
Query: 99 EPAAK---SALLDFLNMGISLGELWEGFSASDCRFAELAKYLAG-ARVLRQDPVECLLQF 154
A K S + D+L + L + + + D F K+L+ R+L Q+P E + F
Sbjct: 71 PLATKDYSSLISDYLRVDFDLKVNQKDWLSKDDNFV---KFLSKPVRLLSQEPFENIFSF 127
Query: 155 LCSSNNNIARITKMVD-FLASLGSHLGNVEGFEFHEFPSLERLSLVS----EVELRNAGF 209
LCS NNNI RI+ M++ F A+ G+ +G+ G + + FP++ R + +LR A F
Sbjct: 128 LCSQNNNIKRISSMIEWFCATFGTKIGHFNGADAYTFPTINRFHDIPCEDLNAQLRAAKF 187
Query: 210 GYRSAPQSSLLFSVRRSFKQAKYITGTVDVLQSKHSGGAEWLLSLRKLDLQEAIDALCTL 269
GYR AK+I T+ +Q K GG W +SL+ + ++A + L L
Sbjct: 188 GYR-----------------AKFIAQTLQEIQKK--GGQNWFISLKSMPFEKAREELTLL 228
Query: 270 PGVGPKVAACIALFSLDQHHAIPVDTHVWKIATRYLLPELAGVR-LTPKLCSRVAEAF 326
PG+G KVA CI L S+ ++PVD H+++IA Y LP L G + +T K+ V++ F
Sbjct: 229 PGIGYKVADCICLMSMGHLESVPVDIHIYRIAQNYYLPHLTGQKNVTKKIYEEVSKHF 286
>gi|358378061|gb|EHK15744.1| hypothetical protein TRIVIDRAFT_228771 [Trichoderma virens Gv29-8]
Length = 396
Score = 140 bits (354), Expect = 7e-31, Method: Compositional matrix adjust.
Identities = 100/297 (33%), Positives = 145/297 (48%), Gaps = 53/297 (17%)
Query: 38 WTPLNLTQSELSLPLTFPTGQTFRWKKTGPLQYTGPIGPHLISLKHLQNGDVCYHIHTSP 97
W L ++ +EL + T GQ+FRW+K ++T + ++SLK Q+ ++ T P
Sbjct: 9 WRKLPVSLTELCIDTTLRCGQSFRWRKLND-EWTCTLHGRILSLK--QDATHLHYKVTWP 65
Query: 98 SE---------PAAKSALLD----FLNMGISLGELWEGFSASDCRFAELAKYLAGARVLR 144
+ P+ K D + ++ I L L++ +S D F E A G R+L
Sbjct: 66 EQSPISSSKDLPSEKDDTEDLLRHYFSLNIDLASLYQQWSKDDPNFREKAPQFTGVRILN 125
Query: 145 QDPVECLLQFLCSSNNNIARITKMVDFLAS-LGSHLGNVEGFEFHEFPSLERLSLVS-EV 202
QD E L+ F+CSSNNNI+RI++MV L G +G++EG H+FP+ E L+ + E
Sbjct: 126 QDAWEALICFICSSNNNISRISQMVHKLCKHYGPLIGHIEGEAMHDFPTPESLTKKAVEA 185
Query: 203 ELRNAGFGYRSAPQSSLLFSVRRSFKQAKYITGTVDVLQSKHSGGAEWLLSLRKLDL--- 259
LR GFGYR AKYI T ++ + WL SLR D
Sbjct: 186 HLRELGFGYR-----------------AKYIAETARIIAKEKPSA--WLDSLRNPDFPAF 226
Query: 260 -------------QEAIDALCTLPGVGPKVAACIALFSLDQHHAIPVDTHVWKIATR 303
++A AL +L GVGPKVA C+ L L ++PVDTHV +IA R
Sbjct: 227 NAVAVVDGPQSTYKDAQAALLSLTGVGPKVADCVCLMGLGWGESVPVDTHVLQIAQR 283
>gi|347842432|emb|CCD57004.1| hypothetical protein [Botryotinia fuckeliana]
Length = 424
Score = 140 bits (353), Expect = 7e-31, Method: Compositional matrix adjust.
Identities = 107/312 (34%), Positives = 151/312 (48%), Gaps = 62/312 (19%)
Query: 34 KPSKWTPLNLTQSELSLPLTFPTGQTFRWKKTGPLQY-TGPIGPHLISLK----HLQNG- 87
K S+W L ++ +EL + T GQ+FRWKK Y + + +ISLK HL
Sbjct: 5 KISEWRKLPVSLAELCIDTTLRCGQSFRWKKAVDEDYWSCTLHGRIISLKQDSTHLHYRT 64
Query: 88 ---DVCYHIHTSP--------------SEPAAKSALLDFLNMGISLGELWEGFSASDCRF 130
+V +H P + +S L +LN+ +L E++E +S +D F
Sbjct: 65 IFPEVKDSLHHQPVVKKEELEDENEEDDKDDTESLLRHYLNLSPNLTEMYEQWSLADPNF 124
Query: 131 AELAKYLAGARVLRQDPVECLLQFLCSSNNNIARITKMVDFLA-SLGSHLGNVEGFEFHE 189
+ A G R+L+QD E L+ F+CSSNNNI RI++MV+ L G +G+++ FH+
Sbjct: 125 KKRAPKFTGVRILKQDAWEALVGFICSSNNNIIRISQMVNNLCLHYGPLIGHIDDQPFHD 184
Query: 190 FPSLERLSLVS-EVELRNAGFGYRSAPQSSLLFSVRRSFKQAKYITGTVDVLQSKHSGGA 248
FP E L+ E LR GFGYR AKYI T ++ SK
Sbjct: 185 FPQPEALTGSGVESHLRALGFGYR-----------------AKYIAQTASIVASK---PK 224
Query: 249 EWLLSLRKLD-----------------LQEAIDALCTLPGVGPKVAACIALFSLDQHHAI 291
+WL +LR + ++A + L L GVGPKVA C+ L L A+
Sbjct: 225 DWLENLRNPETFDHPFEGEVPAGGRPGYRKAHEELLELQGVGPKVADCVCLMGLGWGEAV 284
Query: 292 PVDTHVWKIATR 303
PVDTHVWKIA R
Sbjct: 285 PVDTHVWKIAQR 296
>gi|195355152|ref|XP_002044057.1| GM21766 [Drosophila sechellia]
gi|194129310|gb|EDW51353.1| GM21766 [Drosophila sechellia]
Length = 327
Score = 140 bits (353), Expect = 7e-31, Method: Compositional matrix adjust.
Identities = 96/297 (32%), Positives = 147/297 (49%), Gaps = 36/297 (12%)
Query: 43 LTQSELSLPLTFPTGQTFRWKKT---GPLQYTGPIGPHLISLKHLQNGDVCYHIHTSPSE 99
L+ E L T GQ+FRW+ +Y G + L+ + + Y + + S
Sbjct: 13 LSLEECDLERTLLGGQSFRWRSICDGNRTKYGGVVFNTYWVLQQ-EEYSITYEAYGTSSP 71
Query: 100 PAAK---SALLDFLNMGISLGELWEGFSASDCRFAELAKYLAG-ARVLRQDPVECLLQFL 155
A K S + D+L M L + + D F K+L+ R+L Q+P E + FL
Sbjct: 72 LATKEYSSLISDYLRMDFDLKLNQKDWLNKDDNFL---KFLSKPVRLLSQEPFENIFSFL 128
Query: 156 CSSNNNIARITKMVD-FLASLGSHLGNVEGFEFHEFPSLERLSLVS----EVELRNAGFG 210
CS NNNI RI+ M++ F A+ G+ +G+ G + + FP++ R + +LR A FG
Sbjct: 129 CSQNNNIKRISSMIEWFCATFGTKIGHFNGADAYTFPTINRFHDIPCEDLNAQLRAAKFG 188
Query: 211 YRSAPQSSLLFSVRRSFKQAKYITGTVDVLQSKHSGGAEWLLSLRKLDLQEAIDALCTLP 270
YR AK+I T+ +Q K GG W +SL+ + ++A + L LP
Sbjct: 189 YR-----------------AKFIAQTLQEIQKK--GGQNWFISLKSMPFEKAREELTLLP 229
Query: 271 GVGPKVAACIALFSLDQHHAIPVDTHVWKIATRYLLPELAGVR-LTPKLCSRVAEAF 326
G+G KVA CI L SL ++PVD H+++IA Y LP L G + +T K+ V+ F
Sbjct: 230 GIGYKVADCICLMSLGHLESVPVDIHIYRIAQNYYLPHLTGQKNVTKKIYEEVSNHF 286
>gi|452979967|gb|EME79729.1| hypothetical protein MYCFIDRAFT_37962 [Pseudocercospora fijiensis
CIRAD86]
Length = 400
Score = 140 bits (353), Expect = 8e-31, Method: Compositional matrix adjust.
Identities = 101/310 (32%), Positives = 151/310 (48%), Gaps = 64/310 (20%)
Query: 36 SKWTPLNLTQSELSLPLTFPTGQTFRWKKTGPLQYTGPIGPHLISLKHLQNGDVCYHIHT 95
++W L ++ SEL + T GQ+FRW+K ++ + ++SL Q+ + Y+
Sbjct: 7 AEWQKLPVSLSELCINTTLRCGQSFRWRKNDSGVWSMALHNRILSL--YQDPEHLYYRSL 64
Query: 96 SPSEPAA-------------------KSALLDFLNMGISLGELWEGFSASDCRFAELAKY 136
SP++ A + + +LN+ +L L+ +S++D FA+ A
Sbjct: 65 SPADLQAPMTPPSSNPPSLPDIKDDTRDLVRHYLNLEPTLTTLYAQWSSADANFAKKAPK 124
Query: 137 LAGARVLRQDPVECLLQFLCSSNNNIARITKMVDFL-ASLGSHLGNVEGFEFHEFPSLER 195
G R+LRQD E L+ F+CSSNNNI RI++MV L + G LG ++ +H+FP +
Sbjct: 125 FTGVRILRQDAWEALIGFICSSNNNIIRISQMVHKLCVNYGPLLGYLDEEAYHDFPEPKD 184
Query: 196 LSLVS-EVELRNAGFGYRSAPQSSLLFSVRRSFKQAKYITGTVDVLQSKHSGGAEWLLSL 254
L+ E +LR+ GFGYR AKYI T ++ K G WL +L
Sbjct: 185 LAQDGVEAKLRSLGFGYR-----------------AKYIYQTACMVAEKPKG---WLDAL 224
Query: 255 R---------------------KLDLQEAIDALCTLPGVGPKVAACIALFSLDQHHAIPV 293
R + + A + L TL GVGPKVA C+AL L A+PV
Sbjct: 225 RNPESPALGVKPEFGGEMLPEGREGYRRAHEELLTLQGVGPKVADCVALMGLGWGEAVPV 284
Query: 294 DTHVWKIATR 303
DTHVW+IA R
Sbjct: 285 DTHVWQIAVR 294
>gi|24640654|ref|NP_572499.2| Ogg1 [Drosophila melanogaster]
gi|30923457|sp|Q9V3I8.2|OGG1_DROME RecName: Full=N-glycosylase/DNA lyase; AltName: Full=dOgg1;
Includes: RecName: Full=8-oxoguanine DNA glycosylase;
Includes: RecName: Full=DNA-(apurinic or apyrimidinic
site) lyase; Short=AP lyase
gi|22831969|gb|AAF46404.2| Ogg1 [Drosophila melanogaster]
gi|237513014|gb|ACQ99832.1| FI01819p [Drosophila melanogaster]
Length = 343
Score = 140 bits (353), Expect = 8e-31, Method: Compositional matrix adjust.
Identities = 95/298 (31%), Positives = 149/298 (50%), Gaps = 34/298 (11%)
Query: 41 LNLTQSELSLPLTFPTGQTFRWKKTGPLQYTGPIGPHLISLKHLQNGD--VCYHIHTSPS 98
+ L+ E L T GQ+FRW+ T G + LQ + + Y + + S
Sbjct: 27 IGLSLEECDLERTLLGGQSFRWRSICDGNRTKYGGVVFNTYWVLQQEESFITYEAYGTSS 86
Query: 99 EPAAK---SALLDFLNMGISLGELWEGFSASDCRFAELAKYLAG-ARVLRQDPVECLLQF 154
A K S + D+L + L + + + D F K+L+ R+L Q+P E + F
Sbjct: 87 PLATKDYSSLISDYLRVDFDLKVNQKDWLSKDDNFV---KFLSKPVRLLSQEPFENIFSF 143
Query: 155 LCSSNNNIARITKMVD-FLASLGSHLGNVEGFEFHEFPSLERLSLV----SEVELRNAGF 209
LCS NNNI RI+ M++ F A+ G+ +G+ G + + FP++ R + +LR A F
Sbjct: 144 LCSQNNNIKRISSMIEWFCATFGTKIGHFNGADAYTFPTINRFHDIPCEDLNAQLRAAKF 203
Query: 210 GYRSAPQSSLLFSVRRSFKQAKYITGTVDVLQSKHSGGAEWLLSLRKLDLQEAIDALCTL 269
GYR AK+I T+ +Q K GG W +SL+ + ++A + L L
Sbjct: 204 GYR-----------------AKFIAQTLQEIQKK--GGQNWFISLKSMPFEKAREELTLL 244
Query: 270 PGVGPKVAACIALFSLDQHHAIPVDTHVWKIATRYLLPELAGVR-LTPKLCSRVAEAF 326
PG+G KVA CI L S+ ++PVD H+++IA Y LP L G + +T K+ V++ F
Sbjct: 245 PGIGYKVADCICLMSMGHLESVPVDIHIYRIAQNYYLPHLTGQKNVTKKIYEEVSKHF 302
>gi|303323231|ref|XP_003071607.1| HhH-GPD family base excision DNA repair protein [Coccidioides
posadasii C735 delta SOWgp]
gi|240111309|gb|EER29462.1| HhH-GPD family base excision DNA repair protein [Coccidioides
posadasii C735 delta SOWgp]
Length = 403
Score = 140 bits (352), Expect = 1e-30, Method: Compositional matrix adjust.
Identities = 101/305 (33%), Positives = 147/305 (48%), Gaps = 61/305 (20%)
Query: 36 SKWTPLNLTQSELSLPLTFPTGQTFRWKKTGPLQYTGPIGPHLISLKHLQNGDVCYHIHT 95
S+W L ++ +EL + T GQTFRW+K+ +++ + ++SL+ Q+ D H+H
Sbjct: 7 SEWRKLPVSLTELCINTTLRCGQTFRWRKSAEDEWSCVLHGRIVSLR--QDAD---HLHY 61
Query: 96 SPSEPAAKSALL--------------DFLNMGISLGELWEGFSASDCRFAELAKYLAGAR 141
P A+ L + N+ +L L+E ++ +D F + A G R
Sbjct: 62 RSLFPRARGNLATGQKTNDDTEALIRHYFNLEPNLSGLYEQWAEADKNFKKKALQFTGIR 121
Query: 142 VLRQDPVECLLQFLCSSNNNIARITKMVDFL-ASLGSHLGNVEGFEFHEFPSLERL-SLV 199
+LRQD E L+ F+CSSNNNIARI++MV+ L + G +G+++ +H FP L
Sbjct: 122 ILRQDAWEALVSFICSSNNNIARISQMVEKLCTNYGPLIGHIDKQPYHGFPPPSALIGQG 181
Query: 200 SEVELRNAGFGYRSAPQSSLLFSVRRSFKQAKYITGTVDVLQSKHSGGAEWLLSLR---- 255
E LR GFGYR AK I T ++ ++ G WL SLR
Sbjct: 182 VEARLRELGFGYR-----------------AKCIYKTAVMVANEREKG--WLNSLRNPER 222
Query: 256 -----------------KLDLQEAIDALCTLPGVGPKVAACIALFSLDQHHAIPVDTHVW 298
K +EA + L L GVGPKVA C+ L L A+PVDTHVW
Sbjct: 223 PAFGEKPNLACKMEAQGKTGYREAHENLLELQGVGPKVADCVCLMGLGWGEAVPVDTHVW 282
Query: 299 KIATR 303
+IA R
Sbjct: 283 QIAQR 287
>gi|328788317|ref|XP_003251105.1| PREDICTED: n-glycosylase/DNA lyase-like [Apis mellifera]
Length = 338
Score = 139 bits (351), Expect = 1e-30, Method: Compositional matrix adjust.
Identities = 104/295 (35%), Positives = 146/295 (49%), Gaps = 34/295 (11%)
Query: 44 TQSELSLPLTFPTGQTFRW--KKTGPLQYTGPIGPHLISLKHLQNGDVCYHIHTSP--SE 99
+ SEL L +T GQ+FRW G Y G + +L QN +I P
Sbjct: 18 SSSELDLGVTLKGGQSFRWFCHNNG---YRGIFDGCVWTLT--QNDTHLSYIVQGPLIDS 72
Query: 100 PAAKSALLDFLNMGISLGELWEGFSASDCRFAELAKYLAGARVLRQDPVECLLQFLCSSN 159
L ++ + L +L + ++ D F + + G R+L Q+ VE + F+CSSN
Sbjct: 73 KNYDGILSEYFRLNECLTDLCKRWTTIDQHFKKSLNKINGVRILNQNVVENVFSFICSSN 132
Query: 160 NNIARITKMVDFLASL-GSHLGNVEGFEFHEFPSLERLSLVS-EVELRNAGFGYRSAPQS 217
NNI RI+KMV+ L +L G + ++EG ++ FPS+E L+ + E L+ FGYR
Sbjct: 133 NNIQRISKMVEKLCTLFGEKICSIEGKNYYNFPSIEALANKNVENILKMEKFGYR----- 187
Query: 218 SLLFSVRRSFKQAKYITGTVDVLQSKHSGGAEWLLSLRKLDLQEAIDA---LCTLPGVGP 274
AKYI L GG EWLL+L K + I A L TLPG+GP
Sbjct: 188 ------------AKYIANAAKCLIE--LGGKEWLLNLHKKNNVSYIQARKQLITLPGIGP 233
Query: 275 KVAACIALFSLDQHHAIPVDTHVWKIATRYLLPELAGVR-LTPKLCSRVAEAFCE 328
KVA CI L SL +IPVDTH+++IA LP L + +TPK+ V+ E
Sbjct: 234 KVADCICLMSLGHLESIPVDTHIFQIAKTNYLPHLKQYKTVTPKIHEEVSNYLRE 288
>gi|403221331|dbj|BAM39464.1| 7,8 dihydro-8-oxoguanine DNA glycosylase [Theileria orientalis
strain Shintoku]
Length = 279
Score = 139 bits (351), Expect = 1e-30, Method: Compositional matrix adjust.
Identities = 97/299 (32%), Positives = 145/299 (48%), Gaps = 33/299 (11%)
Query: 33 NKPSKWTPLNLTQSELSLPLTFPTGQTFRWKKTGPLQYTGPIGPHLISLKHLQNGDVCYH 92
+ S W L + L L TGQ+F WK G + G + + +K
Sbjct: 6 DSDSTWFDLCVPHDVLRPELLMTTGQSFGWKSVGDKHWAGVLDSSVYEIKQGDETTFYRT 65
Query: 93 IHTSPSEPAAKSALLDFLNMGISLGELWEGFSASDCRFAELAKYLAGARVLRQDPVECLL 152
I+ S S K + + ++ E +S + + G R+L+QDP+E L+
Sbjct: 66 IYGSCS----KDRIWQYFDLN---HEYTIDYSKAPKSVHNIFNNNQGVRILQQDPLESLI 118
Query: 153 QFLCSSNNNIARITKMV-DFLASLGSHLG----NVEGFEFHEFPSLERLSLVSEVELRNA 207
FLCS+NNNI RITKMV D + G+ L N + +F+ FPSL++L V+ L+
Sbjct: 119 SFLCSTNNNIGRITKMVNDLRTNFGTFLLKADLNGKSVDFYTFPSLDQLCKVTPEFLKEL 178
Query: 208 GFGYRSAPQSSLLFSVRRSFKQAKYITGTVDVLQSKHSGGAEWLLSLRKLDLQEAIDALC 267
GFGYRS +I T+ +L+S+ G +WL LR D DAL
Sbjct: 179 GFGYRS-----------------DFIFKTIHILKSR---GVKWLYDLRFEDYDFCRDALT 218
Query: 268 TLPGVGPKVAACIALFSLDQHHAIPVDTHVWKIATRYLLPELAGVRLTPKLCSRVAEAF 326
LPGVG KVA C+ LF L + ++PVD H+ KIA + + +G LT +++++AF
Sbjct: 219 KLPGVGRKVADCVLLFGLGKRESVPVDVHIKKIAESHFGVK-SGKSLTDSDYNKISQAF 276
>gi|194890941|ref|XP_001977410.1| GG18268 [Drosophila erecta]
gi|190649059|gb|EDV46337.1| GG18268 [Drosophila erecta]
Length = 327
Score = 139 bits (350), Expect = 2e-30, Method: Compositional matrix adjust.
Identities = 93/299 (31%), Positives = 148/299 (49%), Gaps = 36/299 (12%)
Query: 41 LNLTQSELSLPLTFPTGQTFRWKKTGP---LQYTGPIGPHLISLKHLQNGDVCYHIHTSP 97
++L+ E L T GQ+FRW+ +Y+G + L+ + + Y +
Sbjct: 11 IDLSLEECDLERTLLGGQSFRWRSISDGSRTKYSGVVFNTYWVLQQ-EESFITYEAYGPS 69
Query: 98 SEPAAK---SALLDFLNMGISLGELWEGFSASDCRFAELAKYLAG-ARVLRQDPVECLLQ 153
S K S + D+L M L + + D F ++L+ RVL Q+P E +
Sbjct: 70 SALDTKDYSSLISDYLRMDFDLKLNQKDWLNKDENFV---RFLSKPVRVLSQEPFENIFS 126
Query: 154 FLCSSNNNIARITKMVD-FLASLGSHLGNVEGFEFHEFPSLERLSLVS----EVELRNAG 208
FLCS NNNI RI+ M+ F A+ G+ +G+ G + + FP++ R + +LR A
Sbjct: 127 FLCSQNNNIKRISSMIQWFCATFGTKIGHFNGSDEYTFPTINRFHDIPCDELNAQLRAAK 186
Query: 209 FGYRSAPQSSLLFSVRRSFKQAKYITGTVDVLQSKHSGGAEWLLSLRKLDLQEAIDALCT 268
FGYR AK+I T+ ++ K GG W + LR + ++A + L
Sbjct: 187 FGYR-----------------AKFIAQTLQEIRKK--GGQNWFIRLRSMPFEKAREELTL 227
Query: 269 LPGVGPKVAACIALFSLDQHHAIPVDTHVWKIATRYLLPELAGVR-LTPKLCSRVAEAF 326
LPG+G KVA CI L S+ ++PVD H+++IA +Y LP L+G + +T K+ V+ F
Sbjct: 228 LPGIGYKVADCICLMSMGHLESVPVDIHIYRIAQKYYLPHLSGQKNVTKKIYDEVSNHF 286
>gi|171684583|ref|XP_001907233.1| hypothetical protein [Podospora anserina S mat+]
gi|170942252|emb|CAP67904.1| unnamed protein product [Podospora anserina S mat+]
Length = 439
Score = 139 bits (350), Expect = 2e-30, Method: Compositional matrix adjust.
Identities = 104/313 (33%), Positives = 152/313 (48%), Gaps = 52/313 (16%)
Query: 15 RLTPQPPPTPPNPQTLTTNKPSKWTPLN-LTQSELSLPLTFPTGQTFRWKKTGPLQYTGP 73
++TP+P + L + PS WT ++ L+ + P ++FRW+K ++
Sbjct: 3 KITPEPS------RALYRHNPSLWTYVSPLSSIHFITTILIPPLESFRWRKINN-EWHCV 55
Query: 74 IGPHLISLK----HLQNGDVCYHIHTSPSEPAAKSALLDFLNMGISLGELWEGFSASDCR 129
+ LI+L HL I+ P E L +L++G+ L +L+ +S +D
Sbjct: 56 LSNRLITLTQSPTHLSYKSTLPAINPPPHE-TTLPLLQSYLSLGVPLTQLYTQWSITDPN 114
Query: 130 FAELAKYLAGARVLRQDPVECLLQFLCSSNNNIARITKMVDFLA-SLGSHLGNVEGFEFH 188
FA G R+L QD E L+ F+CSSNNNI+RIT MV+ L G ++ + G FH
Sbjct: 115 FARRCSSFTGIRILNQDAWETLISFICSSNNNISRITSMVNNLCLHYGPYITTISGEPFH 174
Query: 189 EFPSLERLSLVSEVE--LRNAGFGYRSAPQSSLLFSVRRSFKQAKYITGTVDVLQSKHSG 246
+FPS E LS EVE LR+ GFGYR AKYI T ++ +
Sbjct: 175 DFPSPEALS-GPEVESHLRSLGFGYR-----------------AKYIADTAQIISQQRPK 216
Query: 247 GAEWLLSLR----------------KLDLQEAIDALCTLPGVGPKVAACIALFSLDQHHA 290
G WL +R K ++A +AL L GVGPKV+ C+ L L +
Sbjct: 217 G--WLDQIRNPAVPPADNPKFQPGEKPTYRKAHEALLELTGVGPKVSDCVCLMGLGWWES 274
Query: 291 IPVDTHVWKIATR 303
+P+DTHVW+IA R
Sbjct: 275 VPIDTHVWQIAQR 287
>gi|448536092|ref|XP_003871069.1| Ogg1 protein [Candida orthopsilosis Co 90-125]
gi|380355425|emb|CCG24944.1| Ogg1 protein [Candida orthopsilosis]
Length = 400
Score = 139 bits (349), Expect = 2e-30, Method: Compositional matrix adjust.
Identities = 96/299 (32%), Positives = 141/299 (47%), Gaps = 54/299 (18%)
Query: 37 KWTPLNLTQSELSLPLTFPTGQTFRWKKTGPLQYTGPIGPHLISLK----HL-------- 84
KW L L ++E+SL GQTFRWK + ++ I ++ L+ H+
Sbjct: 6 KWNKLPLKEAEVSLAKVLRCGQTFRWKSINNV-WSFAISDRIVLLRQDPDHIYYSQILAK 64
Query: 85 ---QNGDVCYHIHTSPSEPAAKSALLDFLNMGISLGELWEG-------FSASDCRFAELA 134
+ GD+ H + + + D+ + + L +L+ + + +
Sbjct: 65 QKSKAGDIPLHPVDENVSKSTLNFIQDYFALDVKLTDLYHQWKLNHKPYITTKVKQQSPF 124
Query: 135 KYLAGARVLRQDPVECLLQFLCSSNNNIARITKMVDFLA-SLGSHLGNVEGFEFHEFPSL 193
+G R LRQDP ECL+ F+CSSNNN+ RI+KM D L + G+H+ + G F+ FP
Sbjct: 125 DLFSGIRTLRQDPWECLISFICSSNNNVKRISKMCDNLCINFGNHINDYGGISFYSFPGP 184
Query: 194 ERLSLVSEVE--LRNAGFGYRSAPQSSLLFSVRRSFKQAKYITGTVDVLQSKHSGGAEW- 250
+LS +E LR GFGYR AKYI T K + +
Sbjct: 185 RQLSADPSIETKLRELGFGYR-----------------AKYIYQTA----CKFTDDVNYP 223
Query: 251 ------LLSLRKLDLQEAIDALCTLPGVGPKVAACIALFSLDQHHAIPVDTHVWKIATR 303
L LR +EA + L L GVGPKVA CI L SLD+ + +P+DTHV++IA R
Sbjct: 224 DINLDRLNGLRHASYEEAHEFLLQLTGVGPKVADCICLMSLDKSNIVPIDTHVYQIAVR 282
>gi|346979375|gb|EGY22827.1| N-glycosylase/DNA lyase [Verticillium dahliae VdLs.17]
Length = 414
Score = 138 bits (348), Expect = 3e-30, Method: Compositional matrix adjust.
Identities = 101/305 (33%), Positives = 143/305 (46%), Gaps = 57/305 (18%)
Query: 36 SKWTPLNLTQSELSLPLTFPTGQTFRWKKTGPLQYTGPIGPHLISLK----HLQNGDVCY 91
++W L L+ +EL + T GQ+FRW+K ++ + +ISLK HL +
Sbjct: 7 TEWRKLPLSLTELCIDTTLRCGQSFRWRKIND-EWCCTLHGRIISLKQDPTHLHYRSIWP 65
Query: 92 H----------IHTSPSEPAAKSALLDFLNMGISLGELWEGFSASDCRFAELAKYLAGAR 141
+ T E + L +L++ + L L+E +S +D F + A G R
Sbjct: 66 AAAQRPTSAPIVKTETPEDDTEELLKHYLSLKLDLTSLYEQWSEADANFRKRAPKFGGVR 125
Query: 142 VLRQDPVECLLQFLCSSNNNIARITKMVDFLA-SLGSHLGNVEGFEFHEFPSLERLSLVS 200
+L QD E L+ F+CSSNNNI+RI++MV L G +G++ FH+FP+ E L+
Sbjct: 126 MLSQDAWEALISFICSSNNNISRISQMVHKLCLHYGPLIGHINEQPFHDFPTPEALTGKQ 185
Query: 201 -EVELRNAGFGYRSAPQSSLLFSVRRSFKQAKYITGTVDVLQSKHSGGAEWLLSLRK--- 256
E LR GFGYR AKYI T V+ ++ G WL L
Sbjct: 186 VESHLRELGFGYR-----------------AKYIAETARVVSTEKPKG--WLAGLANPAH 226
Query: 257 ------------------LDLQEAIDALCTLPGVGPKVAACIALFSLDQHHAIPVDTHVW 298
+EA +AL L GVGPKVA C+ L L A+PVDTHVW
Sbjct: 227 PNFRAASEAEDEKMLPPPTTYKEAHEALLALAGVGPKVADCVCLMGLGWGEAVPVDTHVW 286
Query: 299 KIATR 303
+IA R
Sbjct: 287 QIAQR 291
>gi|324516785|gb|ADY46633.1| N-glycosylase/DNA lyase [Ascaris suum]
Length = 372
Score = 138 bits (348), Expect = 3e-30, Method: Compositional matrix adjust.
Identities = 95/293 (32%), Positives = 137/293 (46%), Gaps = 41/293 (13%)
Query: 41 LNLTQSELSLPLTFPTGQTFRWKK----------TGPLQYTGPIGPHLISLKHLQNGDVC 90
L +SEL+L + GQ+FRW+K +G + G L + ++
Sbjct: 4 LRCLKSELNLDVVLLNGQSFRWRKESTADRKCSLSGRQYFVGVAKHRFWRLWREDDANIG 63
Query: 91 YHI--HTSPSEPAAKSALLDFLNMGISLGELWEGFSASDCRFAELAKY----LAGARVLR 144
Y + + AL ++ +GISL L+ + +D FA+ K L G R+L
Sbjct: 64 YDVLARFKKCNDDDEIALRNYFQLGISLASLYAQWKKNDENFAKALKISGPLLEGIRILA 123
Query: 145 QDPVECLLQFLCSSNNNIARITKMVDFLASLGSHLGNVEGFE----FHEFPSLERLSLVS 200
QDP+E L F+CSSNNNI RI+KM++ L L + F ++F L R++
Sbjct: 124 QDPIETLFSFICSSNNNIKRISKMIERLCELYGECITLSSFPSQEIVYDFADLARMATDD 183
Query: 201 EVE--LRNAGFGYRSAPQSSLLFSVRRSFKQAKYITGTVDVLQSKHSGGAEWLLSLRKLD 258
+E LR +GFGYR+A + R+ K I GG WL L
Sbjct: 184 AMESKLRESGFGYRAA-------YLHRAAKNLHEI------------GGELWLNELANET 224
Query: 259 LQEAIDALCTLPGVGPKVAACIALFSLDQHHAIPVDTHVWKIATRYLLPELAG 311
A L LPGVGPKVA CI L SL + +PVDTH+++I +P+L G
Sbjct: 225 YDIAKQKLQQLPGVGPKVADCICLMSLGKSDVVPVDTHIFQITAARYMPKLNG 277
>gi|118384668|ref|XP_001025476.1| HhH-GPD superfamily base excision DNA repair protein [Tetrahymena
thermophila]
gi|89307243|gb|EAS05231.1| HhH-GPD superfamily base excision DNA repair protein [Tetrahymena
thermophila SB210]
Length = 327
Score = 138 bits (348), Expect = 3e-30, Method: Compositional matrix adjust.
Identities = 86/229 (37%), Positives = 120/229 (52%), Gaps = 34/229 (14%)
Query: 85 QNGDVCYHIHTSPSEPAAKSALLDFLNMGISLGELWEGFSASDCRFAELAKYLAGARVLR 144
+N +V Y P + + D+L + + L +L +S D +A++ + L G R+LR
Sbjct: 15 ENNEVMYK--EIPENSKIEEIINDYLQLNVDLVKLTVEWSEKDNYYAKVKEQLKGVRILR 72
Query: 145 QDPVECLLQFLCSSNNNIARITKMVDFLASLGSHLGN-------VEGFE--FHEFPSLER 195
Q P EC++ F+CS NNNI RITK+ L SL + G +G E ++ FPS+
Sbjct: 73 QFPFECMISFICSQNNNIPRITKI---LKSLRQNFGEEIYREKLEDGSEEIYYSFPSVAS 129
Query: 196 LSLVSEVELRNAGFGYRSAPQSSLLFSVRRSFKQAKYITGTVDVLQSKHSGGAEWLLSLR 255
L +E LR+ G GYR A YI V +L+ K GG E+L LR
Sbjct: 130 LRKATEQNLRDLGLGYR-----------------ANYIVSAVQLLEEK--GGEEYLHKLR 170
Query: 256 KL-DLQEAIDALCTLPGVGPKVAACIALFSLDQHHAIPVDTHVWKIATR 303
+L D E +AL G+G KVA CI+LFSLD IPVDTHV++I R
Sbjct: 171 QLKDSHEKREALLEFKGIGNKVADCISLFSLDAKDLIPVDTHVFQIYNR 219
>gi|302915707|ref|XP_003051664.1| hypothetical protein NECHADRAFT_5326 [Nectria haematococca mpVI
77-13-4]
gi|256732603|gb|EEU45951.1| hypothetical protein NECHADRAFT_5326 [Nectria haematococca mpVI
77-13-4]
Length = 362
Score = 138 bits (348), Expect = 3e-30, Method: Compositional matrix adjust.
Identities = 98/294 (33%), Positives = 141/294 (47%), Gaps = 53/294 (18%)
Query: 38 WTPLNLTQSELSLPLTFPTGQTFRWKKTGPLQYTGPIGPHLISLKHLQNGDVCYHIHTSP 97
W L ++ +EL + T GQ+FRW+K ++T + ++ LK H+H
Sbjct: 1 WRKLPVSLAELCINTTLRCGQSFRWRKIDH-EWTCTLHGRILHLKQDST-----HLHYRV 54
Query: 98 SEPAAKSA----------LLDFLNMGISLGELWEGFSASDCRFAELAKYLAGARVLRQDP 147
+ PA KS L + ++ + LG L+E +S +D F + A G R+L QD
Sbjct: 55 TWPAVKSQASPKDDTEALLRHYFSLKLDLGTLYEQWSEADPNFRKRAPQFKGVRILSQDA 114
Query: 148 VECLLQFLCSSNNNIARITKMVDFLAS-LGSHLGNVEGFEFHEFPSLERLSLVS-EVELR 205
E L+ F+CSSNNNI+RI++MV L G +G+V FH+FP+ L+ E LR
Sbjct: 115 WEALICFICSSNNNISRISQMVHKLCQHYGPLIGHVGDEAFHDFPTPHDLTGDRVEAHLR 174
Query: 206 NAGFGYRSAPQSSLLFSVRRSFKQAKYITGTVDVLQSKHSGGAEWLLSLRK--------- 256
GFGYR AKYI T ++ + WL +LR
Sbjct: 175 ELGFGYR-----------------AKYIAETARMVAKEKPDN--WLETLRNPAHPGFNTP 215
Query: 257 -------LDLQEAIDALCTLPGVGPKVAACIALFSLDQHHAIPVDTHVWKIATR 303
+EA + L +L GVGPKVA C+ L L ++PVDTHVW+IA R
Sbjct: 216 RVPDEQHATYKEAQEQLLSLKGVGPKVADCVCLMGLGWGESVPVDTHVWQIAQR 269
>gi|294659829|ref|XP_462255.2| DEHA2G16324p [Debaryomyces hansenii CBS767]
gi|199434263|emb|CAG90751.2| DEHA2G16324p [Debaryomyces hansenii CBS767]
Length = 376
Score = 138 bits (348), Expect = 3e-30, Method: Compositional matrix adjust.
Identities = 96/289 (33%), Positives = 140/289 (48%), Gaps = 44/289 (15%)
Query: 38 WTPLNLTQSELSLPLTFPTGQTFRWKKTGPLQYTGPIGPHLISLKHLQNGDVCY------ 91
W + Q E+ L GQTFRWK + ++ +I LK Q+ + +
Sbjct: 7 WKKFPVKQVEVKLSKVLRCGQTFRWKNVNDI-WSYTTENKIILLK--QDEEYIHYSWIAA 63
Query: 92 -HIHTSPS-----EPAAKSALLDFLNMGISLGELWEGFSASDCRF------AELAKYLAG 139
H+ T + + D+ N+ I L L+E + D F + +K+ G
Sbjct: 64 EHMQTQSKINCHFDKETLDFIKDYFNLPIKLETLYEEWIEKDGLFKISQQKSAFSKF-TG 122
Query: 140 ARVLRQDPVECLLQFLCSSNNNIARITKMVDFLAS-LGSHLGNVEGFEFHEFPSLERLSL 198
R+LRQ+P E L+ F+CSSNNN+ RI+KM D + + G ++ G +++ FP+ LS
Sbjct: 123 IRILRQEPWETLISFICSSNNNVKRISKMCDNICTEFGKYINEYNGIKYYSFPTAVDLSS 182
Query: 199 VSEVE--LRNAGFGYRSAPQSSLLFSVRRSFKQAKYITGTVDVLQSKH--SGGAEWLLSL 254
+VE LR+ GFGYR AKYI T S + L L
Sbjct: 183 SPKVESRLRDLGFGYR-----------------AKYIYQTALKFTSDEFPDITLKNLYQL 225
Query: 255 RKLDLQEAIDALCTLPGVGPKVAACIALFSLDQHHAIPVDTHVWKIATR 303
R+ + + A D L L GVGPKVA CI L +LD+H +PVDTHV++IA R
Sbjct: 226 RQENYETAHDFLLQLTGVGPKVADCICLMALDKHDIVPVDTHVYQIAIR 274
>gi|190348010|gb|EDK40389.2| hypothetical protein PGUG_04487 [Meyerozyma guilliermondii ATCC
6260]
Length = 400
Score = 138 bits (347), Expect = 4e-30, Method: Compositional matrix adjust.
Identities = 102/293 (34%), Positives = 144/293 (49%), Gaps = 54/293 (18%)
Query: 38 WTPLNLTQSELSLPLTFPTGQTFRWKKTGPLQYTGPIGPHLISLKHLQNGDVCYHIH--- 94
W + + ELSL GQTFRWK + ++ + +I LK + Y IH
Sbjct: 4 WNKFPIKEVELSLAKVLRCGQTFRWKNINNV-WSFSLDDKVIFLKQDE-----YSIHYSW 57
Query: 95 ------TSPSEPAAKSALL----DFLNMGISLGELWEGFS-----ASDCRFAELAKYLAG 139
+S S +L D+ N+ ++L L++ +S + G
Sbjct: 58 IMKEGVSSKDRFDTSSDVLGFVRDYFNLNVNLESLYQDWSHRSSLVKAATHGSAFQRFTG 117
Query: 140 ARVLRQDPVECLLQFLCSSNNNIARITKMVDFLAS-LGSHLGNVEGFEFHEFPSLERLSL 198
R+LRQDP E L+ F+CSSNNN+ RI+KM D L + G+++ G +++ FP+ + LS
Sbjct: 118 IRILRQDPWETLVSFICSSNNNVKRISKMCDSLCTEFGTYIATHGGVDYYSFPTAQALSS 177
Query: 199 VSEVE--LRNAGFGYRSAPQSSLLFSVRRSFKQAKYITGTVDVLQSKHSGGAEWLLSLRK 256
VE LR GFGYR AKYI T +L + LSL K
Sbjct: 178 SPSVETKLRELGFGYR-----------------AKYIYQTALMLTDPDNPD----LSLAK 216
Query: 257 L-DLQ-EAIDA----LCTLPGVGPKVAACIALFSLDQHHAIPVDTHVWKIATR 303
L DL+ E+ +A L L GVGPKVA CI L +LD+H +PVDTHV++IA R
Sbjct: 217 LYDLRLESYEACSEFLIQLSGVGPKVADCICLMALDKHDVVPVDTHVYQIAIR 269
>gi|115400043|ref|XP_001215610.1| hypothetical protein ATEG_06432 [Aspergillus terreus NIH2624]
gi|114191276|gb|EAU32976.1| hypothetical protein ATEG_06432 [Aspergillus terreus NIH2624]
Length = 437
Score = 138 bits (347), Expect = 4e-30, Method: Compositional matrix adjust.
Identities = 113/341 (33%), Positives = 155/341 (45%), Gaps = 86/341 (25%)
Query: 30 LTTNKPSKWTPLNLTQSELSLPLTFPTGQTFRWKKT-GPLQYTGPIGPHLISLK----HL 84
+T S+W L ++ SEL + T GQ+FRW+K ++ + HL+SLK HL
Sbjct: 1 MTVGAFSEWRKLPVSLSELCINTTLRCGQSFRWQKVPDTDEWRCVLHGHLLSLKQDPLHL 60
Query: 85 -----QNGDVCYHIHTSPS------EPAAK------SALLDFLNMGISLGELWEGFSASD 127
+N I T PS EP + S L + N+ +L L+E +SA D
Sbjct: 61 YYRTYRNTAPSRPIPTPPSSDKPATEPTTQEDTTTLSLLAHYFNLSSNLTSLYEQWSAHD 120
Query: 128 CRFAELAKYLAGARVLRQDPVECLLQFLCSSNNNIARITKMVDFLA-SLGSHLGNVEGFE 186
F + A G R+LRQD E L+ F+CSSNNNIARI++MV+ L G H+ ++G
Sbjct: 121 ANFRKKAPQFTGVRILRQDAWEALVSFICSSNNNIARISQMVEKLCLHYGPHIATLDGRA 180
Query: 187 FHEFPSLERLSLVS-EVELRNAGFGYRSAPQSSLLFSVRRSFKQAKYITGTVDVLQSKHS 245
+H+FP+ E L+ E LR+ GFGYR AKYI T ++ +
Sbjct: 181 YHDFPAPEALTGDDVEPRLRSLGFGYR-----------------AKYIYQTAVIVAQQRE 223
Query: 246 GGAEWLLSLR--------------------------KLDLQEAIDALCTLPGVGPKVAAC 279
G WL SLR + +EA + L L GVGPKVA C
Sbjct: 224 KG--WLDSLRNPEAPAFGVGGSGEGTEESVEMRPEGRQGYREAHEKLLELQGVGPKVADC 281
Query: 280 IALFSLDQHHAIPVDTH-----------------VWKIATR 303
+ L L A+PVDTH VW+IA R
Sbjct: 282 VCLMGLGWGEAVPVDTHGEFVDDAGYVRDVLTWTVWQIAQR 322
>gi|307203199|gb|EFN82354.1| N-glycosylase/DNA lyase [Harpegnathos saltator]
Length = 330
Score = 138 bits (347), Expect = 4e-30, Method: Compositional matrix adjust.
Identities = 96/270 (35%), Positives = 137/270 (50%), Gaps = 27/270 (10%)
Query: 46 SELSLPLTFPTGQTFRWKKTGPLQYTGPIGPHLISLKHLQNGDVCYHIHTSPSEPAAKSA 105
+EL+L LT GQ+FRW Y G + +L + Y + +
Sbjct: 16 TELNLALTLHGGQSFRWTDCDA-GYRGVFDGCVWTLSQ-NKTHLLYAVQGRLKDSTDYDN 73
Query: 106 LL-DFLNMGISLGELWEGFSASDCRFAELAKYLAGARVLRQDPVECLLQFLCSSNNNIAR 164
+L ++L++ ISL E ++ ++A D F + R+L+QD VE L F+CSSNNN+ R
Sbjct: 74 VLSEYLSLSISLQEQYKRWTAVDMYFQKCLNGNNAVRILQQDVVETLFSFICSSNNNVMR 133
Query: 165 ITKMVDFLASL-GSHLGNVEGFEFHEFPSLERL-SLVSEVELRNAGFGYRSAPQSSLLFS 222
I+ MV+ L L G + +VE E+++FP++E L +E L+ FGYR
Sbjct: 134 ISSMVEKLCLLFGQKICSVENKEYYDFPAIESLKEKNTENILKREKFGYR---------- 183
Query: 223 VRRSFKQAKYITGTVDVLQSKHSGGAEWLLSLRK---LDLQEAIDALCTLPGVGPKVAAC 279
A YI L GG WLLSL++ + A + L TLPG+GPKVA C
Sbjct: 184 -------AGYIANAAAHLS--RLGGKNWLLSLQRKNNISYCTAREQLMTLPGIGPKVADC 234
Query: 280 IALFSLDQHHAIPVDTHVWKIATRYLLPEL 309
I L SL AIPVDTH+++IA LP L
Sbjct: 235 ICLMSLGHLDAIPVDTHIFQIARTNYLPHL 264
>gi|146415576|ref|XP_001483758.1| hypothetical protein PGUG_04487 [Meyerozyma guilliermondii ATCC
6260]
Length = 400
Score = 137 bits (346), Expect = 5e-30, Method: Compositional matrix adjust.
Identities = 102/293 (34%), Positives = 145/293 (49%), Gaps = 54/293 (18%)
Query: 38 WTPLNLTQSELSLPLTFPTGQTFRWKKTGPLQYTGPIGPHLISLKHLQNGDVCYHIH--- 94
W + + ELSL GQTFRWK + ++ + +I LK + Y IH
Sbjct: 4 WNKFPIKEVELSLAKVLRCGQTFRWKNINNV-WSFSLDDKVIFLKQDE-----YSIHYSW 57
Query: 95 ------TSPSEPAAKSALL----DFLNMGISLGELWEGFS-----ASDCRFAELAKYLAG 139
+S S +L D+ N+ ++L L++ +S + G
Sbjct: 58 IMKEGVSSKDRFDTSSDVLGFVRDYFNLNVNLELLYQDWSHRSSLVKAATHGSAFQRFTG 117
Query: 140 ARVLRQDPVECLLQFLCSSNNNIARITKMVDFLAS-LGSHLGNVEGFEFHEFPSLERLSL 198
R+LRQDP E L+ F+CSSNNN+ RI+KM D L + G+++ G +++ FP+ + LS
Sbjct: 118 IRILRQDPWETLVSFICSSNNNVKRISKMCDSLCTEFGTYIATHGGVDYYSFPTAQALSS 177
Query: 199 VSEVE--LRNAGFGYRSAPQSSLLFSVRRSFKQAKYITGTVDVLQSKHSGGAEWLLSLRK 256
+ VE LR GFGYR AKYI T +L + LSL K
Sbjct: 178 LPSVETKLRELGFGYR-----------------AKYIYQTALMLTDPDNPD----LSLAK 216
Query: 257 L-DLQ-EAIDA----LCTLPGVGPKVAACIALFSLDQHHAIPVDTHVWKIATR 303
L DL+ E+ +A L L GVGPKVA CI L +LD+H +PVDTHV++IA R
Sbjct: 217 LYDLRLESYEACSEFLIQLSGVGPKVADCICLMALDKHDVVPVDTHVYQIAIR 269
>gi|358369528|dbj|GAA86142.1| 8-oxoguanine DNA glycosylase [Aspergillus kawachii IFO 4308]
Length = 418
Score = 137 bits (346), Expect = 6e-30, Method: Compositional matrix adjust.
Identities = 100/306 (32%), Positives = 144/306 (47%), Gaps = 52/306 (16%)
Query: 30 LTTNKPSKWTPLNLTQSELSLPLTFPTGQTFRWKKT-GPLQYTGPIGPHLISLKHLQNGD 88
+T S+W L ++ +EL + T GQ+FRW ++ + L+SLK
Sbjct: 1 MTVGAFSEWRKLPVSLNELCINTTLRCGQSFRWHNVPDTDEWRCVLHGRLLSLKQ-DPTH 59
Query: 89 VCYHIHTSPSEPAAK--------SALLDFLNMGISLGELWEGFSASDCRFAELAKYLAGA 140
+ Y + + P K S L + N+ +L L+ +SA D F + A G
Sbjct: 60 LYYRTYRTTQSPKPKNNDDDTTLSLLTHYFNLSANLTSLYTEWSAQDPNFRKKAPQFTGI 119
Query: 141 RVLRQDPVECLLQFLCSSNNNIARITKMVDFLA-SLGSHLGNVEGFEFHEFPSLERLSLV 199
R+LRQD E L+ F+CSSNNNIARI++MV+ L + G + ++ G +H FP+ + L+
Sbjct: 120 RILRQDAWEALISFICSSNNNIARISQMVEKLCLNYGPFIASINGRAYHGFPAPDALTAT 179
Query: 200 S-EVELRNAGFGYRSAPQSSLLFSVRRSFKQAKYITGTVDVLQSKHSGGAEWLLSL---- 254
E +LR GFGYR AKYI T ++ + G WL SL
Sbjct: 180 DVEGKLRGLGFGYR-----------------AKYIYQTAVMVAKEREAG--WLDSLCNPE 220
Query: 255 -----------------RKLDLQEAIDALCTLPGVGPKVAACIALFSLDQHHAIPVDTHV 297
+ +EA + L L GVGPKVA C+ L L A+PVDTHV
Sbjct: 221 CPAWRVEAAPAGEMKPEGREGYREAHEKLLELQGVGPKVADCVCLMGLGWGEAVPVDTHV 280
Query: 298 WKIATR 303
W+IA R
Sbjct: 281 WQIAQR 286
>gi|388580364|gb|EIM20679.1| DNA glycosylase [Wallemia sebi CBS 633.66]
Length = 344
Score = 137 bits (345), Expect = 6e-30, Method: Compositional matrix adjust.
Identities = 83/211 (39%), Positives = 116/211 (54%), Gaps = 25/211 (11%)
Query: 101 AAKSALLDFLNMGISLGELWEGFSASDCRFAEL--AKYLAGARVLRQDPVECLLQFLCSS 158
+ KS + D+ + + +L++ +SA D F + G +LRQDP ECL+ F+CSS
Sbjct: 39 SDKSFIHDYFQLDTRIIDLYDDWSARDRHFKSIKDTNKFDGILILRQDPWECLISFICSS 98
Query: 159 NNNIARITKMVDFLASLGSH-LGNV-EGFEFHEFPSLERLSLVS-EVELRNAGFGYRSAP 215
NNNIARI++M+ L++ S LG++ E + + FP +L+ E LR GFGYR
Sbjct: 99 NNNIARISQMITKLSTTFSEPLGDLDEIYSRYPFPPPSKLAGEDVEDTLRTLGFGYR--- 155
Query: 216 QSSLLFSVRRSFKQAKYITGTVDVLQSKHSGGAE---WLLSLRKLDLQEAIDALCTLPGV 272
AKY+ +L +H +L SLRK + I L + GV
Sbjct: 156 --------------AKYVANVAKMLVEEHGSDENIFTYLHSLRKESYENVIPQLTRMMGV 201
Query: 273 GPKVAACIALFSLDQHHAIPVDTHVWKIATR 303
GPKVA C+AL SLDQH +IPVDTHVW+IA R
Sbjct: 202 GPKVADCVALMSLDQHSSIPVDTHVWQIAVR 232
>gi|452841880|gb|EME43816.1| hypothetical protein DOTSEDRAFT_53107 [Dothistroma septosporum
NZE10]
Length = 414
Score = 137 bits (345), Expect = 7e-30, Method: Compositional matrix adjust.
Identities = 103/336 (30%), Positives = 152/336 (45%), Gaps = 65/336 (19%)
Query: 36 SKWTPLNLTQSELSLPLTFPTGQTFRWKKTGPLQYTGPIGPHLISLKHLQNGDVCYHIHT 95
++W L ++ SEL + T GQ+FRW+K ++ + ++SL Q+ + Y+
Sbjct: 7 AEWQKLPISLSELCINTTLRCGQSFRWRKNDAGVWSMALHNRILSL--YQDPEHLYYRTI 64
Query: 96 SPSEPAAKSALL-----------------------DFLNMGISLGELWEGFSASDCRFAE 132
PS+ ++ + N+ L+ +S SD FA+
Sbjct: 65 WPSQQTLQAPPTPPSSKPPSIPEDDKDDDTLDLLHHYFNLKADSKSLYNQWSTSDPNFAK 124
Query: 133 LAKYLAGARVLRQDPVECLLQFLCSSNNNIARITKMVDFL-ASLGSHLGNVEGFEFHEFP 191
A G R++ QD E L+ F+CSSNNNI+RI++MV L + G + + E +H+FP
Sbjct: 125 KAPKFTGIRIMNQDAWEALIGFICSSNNNISRISQMVHKLCTNYGPLINHFEDEAYHDFP 184
Query: 192 SLERLSLVS-EVELRNAGFGYRSAPQSSLLFSVRRSFKQAKYITGTVDVLQSKHSG---- 246
+ + L+ E +LR GFGYR AKYI T ++ SK SG
Sbjct: 185 TPQSLAQDGVETKLRALGFGYR-----------------AKYIHQTARIVASKESGWLDS 227
Query: 247 ---------------GAEWLLSLRKLDLQEAIDALCTLPGVGPKVAACIALFSLDQHHAI 291
EW +S R+ + A +AL L GVGPKVA C+ L L AI
Sbjct: 228 LRDPECPTLGTSPSPAGEWSISGRE-GYRHAHEALLDLQGVGPKVADCVCLMGLGWSEAI 286
Query: 292 PVDTHVWKIATR-YLLPELAGVRLTPKLCSRVAEAF 326
PVDTHVW+IA R Y + LT + E F
Sbjct: 287 PVDTHVWQIAQRDYRFGKGKHASLTKGTYDAIGEKF 322
>gi|451849635|gb|EMD62938.1| hypothetical protein COCSADRAFT_37822 [Cochliobolus sativus ND90Pr]
Length = 407
Score = 136 bits (343), Expect = 1e-29, Method: Compositional matrix adjust.
Identities = 97/312 (31%), Positives = 149/312 (47%), Gaps = 63/312 (20%)
Query: 34 KPSKWTPLNLTQSELSLPLTFPTGQTFRWKKTGPLQYTGPIGPHLISLKHLQNGDVCYHI 93
+ ++W L + +EL + T GQ+FRW+K+ +++ + ++SL+ Q+ ++
Sbjct: 4 RAAEWHKLPTSLTELCIDTTLRCGQSFRWRKSAEGEWSAALHGRILSLR--QDPQYLHYR 61
Query: 94 HTSPS------------------EPAAKSALLD-FLNMGISLGELWEGFSASDCRFAELA 134
T PS E AL++ + N+ +LG L++ ++ASD F + A
Sbjct: 62 VTYPSITTTLPTPPLSNAPSVAPEEDDTLALVNHYFNLSPNLGHLYDQWAASDANFRKRA 121
Query: 135 KYLAGARVLRQDPVECLLQFLCSSNNNIARITKMVDFLA-SLGSHLGNVEGFEFHEFPSL 193
G R+LRQD E L+ F+CSSNNNI+RI+ MV L G +G+++ +H+FP+
Sbjct: 122 PKFTGIRILRQDAWEALIGFICSSNNNISRISGMVHNLCLHYGPLIGHIDEVPYHDFPTP 181
Query: 194 ERLSLVS-EVELRNAGFGYRSAPQSSLLFSVRRSFKQAKYITGTVDVLQSKHSGGAEWLL 252
LS E L GFGYR AKYI T ++ G +WL
Sbjct: 182 VALSGPDVEAHLMKLGFGYR-----------------AKYIAKTARIVSE--DKGLKWLE 222
Query: 253 SLR---------------------KLDLQEAIDALCTLPGVGPKVAACIALFSLDQHHAI 291
L + ++A + L L GVGPKVA C+ LF L ++
Sbjct: 223 DLSNPEYPQFGVEEKPAGDMLEGGREGYRKAHEELLALHGVGPKVADCVCLFGLGWSESV 282
Query: 292 PVDTHVWKIATR 303
PVDTHVW+IA R
Sbjct: 283 PVDTHVWQIAQR 294
>gi|452001452|gb|EMD93911.1| hypothetical protein COCHEDRAFT_1093403 [Cochliobolus
heterostrophus C5]
Length = 408
Score = 136 bits (343), Expect = 1e-29, Method: Compositional matrix adjust.
Identities = 94/312 (30%), Positives = 146/312 (46%), Gaps = 63/312 (20%)
Query: 34 KPSKWTPLNLTQSELSLPLTFPTGQTFRWKKTGPLQYTGPIGPHLISLKHLQNGDVCYHI 93
+ ++W L + +EL + T GQ+FRW+++ ++ + ++SL+ Q+ ++
Sbjct: 4 RAAEWHKLPTSLTELCIDTTLRCGQSFRWRRSAEGVWSAALHGRILSLR--QDSQYLHYR 61
Query: 94 HT-------------------SPSEPAAKSALLDFLNMGISLGELWEGFSASDCRFAELA 134
T +P E + + + N+ +LG L+E ++ASD F + A
Sbjct: 62 ATYPPITTALPTPPPSNAPSVAPEEDDTLTLVNHYFNLSPNLGHLYEQWAASDSNFRKRA 121
Query: 135 KYLAGARVLRQDPVECLLQFLCSSNNNIARITKMVDFLA-SLGSHLGNVEGFEFHEFPSL 193
G R+LRQD E L+ F+CSSNNNI+RI+ MV L G +G+++ +H+FP+
Sbjct: 122 PKFTGIRILRQDAWEALIGFICSSNNNISRISGMVHNLCLHYGPLIGHIDEVPYHDFPTP 181
Query: 194 ERLSLVS-EVELRNAGFGYRSAPQSSLLFSVRRSFKQAKYITGTVDVLQSKHSGGAEWLL 252
LS E L GFGYR AKYI T ++ G +WL
Sbjct: 182 AALSGPDVEAHLMKLGFGYR-----------------AKYIAKTARIVSE--DKGLKWLE 222
Query: 253 SLR---------------------KLDLQEAIDALCTLPGVGPKVAACIALFSLDQHHAI 291
L + ++A + L L GVGPKVA C+ LF L ++
Sbjct: 223 DLSNPEYPQFGVEEKPAGEMLEGGREGYRKAHEELLALHGVGPKVADCVCLFGLGWSESV 282
Query: 292 PVDTHVWKIATR 303
PVDTHVW+IA R
Sbjct: 283 PVDTHVWQIAQR 294
>gi|317029340|ref|XP_001391372.2| DNA N-glycosylase [Aspergillus niger CBS 513.88]
Length = 416
Score = 136 bits (343), Expect = 1e-29, Method: Compositional matrix adjust.
Identities = 103/307 (33%), Positives = 146/307 (47%), Gaps = 54/307 (17%)
Query: 30 LTTNKPSKWTPLNLTQSELSLPLTFPTGQTFRWKKT-GPLQYTGPIGPHLISLKHLQNGD 88
+T S+W L ++ +EL + T GQ+FRW ++ + L+SLK
Sbjct: 1 MTVGAFSEWRKLPVSLNELCINTTLRCGQSFRWHNVPDTDEWRCVLHGRLLSLKQ-DPTH 59
Query: 89 VCYHIHTSPSEPAAK--------SALLDFLNMGISLGELWEGFSASDCRFAELAKYLAGA 140
+ Y + + P K S L + N+ +L L+ +SA D F + A G
Sbjct: 60 LYYRTYRTAQSPKPKINDDDTTLSLLTHYFNLSANLTSLYTEWSAQDPNFRKKAPQFTGI 119
Query: 141 RVLRQDPVECLLQFLCSSNNNIARITKMVDFLA-SLGSHLGNVEGFEFHEFPSLERLSLV 199
R+LRQD E L+ F+CSSNNNIARI++MV+ L + G + ++ G +H FP+ + L+
Sbjct: 120 RILRQDAWEALISFICSSNNNIARISQMVEKLCLNYGPFIASINGRAYHGFPAPDALTAH 179
Query: 200 S-EVELRNAGFGYRSAPQSSLLFSVRRSFKQAKYITGT-VDVLQSKHSGGAEWLLSL--- 254
E +LR GFGYR AKYI T V V + + SG WL SL
Sbjct: 180 DVEGKLRGLGFGYR-----------------AKYIYQTAVMVAKERESG---WLDSLCNP 219
Query: 255 ------------------RKLDLQEAIDALCTLPGVGPKVAACIALFSLDQHHAIPVDTH 296
+ +EA + L L GVGPKVA C+ L L A+PVDTH
Sbjct: 220 ECPAWGVEAAPAGEMKPEGREGYREAHEKLLELQGVGPKVADCVCLMGLGWGEAVPVDTH 279
Query: 297 VWKIATR 303
VW+IA R
Sbjct: 280 VWQIAQR 286
>gi|453082789|gb|EMF10836.1| DNA glycosylase [Mycosphaerella populorum SO2202]
Length = 716
Score = 136 bits (342), Expect = 1e-29, Method: Compositional matrix adjust.
Identities = 106/328 (32%), Positives = 151/328 (46%), Gaps = 81/328 (24%)
Query: 36 SKWTPLNLTQSELSLPLTFPTGQTFRWKKTGPLQYTGPIGPHLISLKHLQNGDVCYHIHT 95
++W L ++ SEL + T GQ+FRW+K ++ + ++SL Q+ D Y+
Sbjct: 294 AEWQRLPVSLSELCINTTLRCGQSFRWRKNDLGVWSIALHNRILSLH--QDKDYLYYRSI 351
Query: 96 SPS----------------------EPAAKSALLD----FLNMGISLGELWEGFSASDCR 129
PS E LD +LN+ + EL++ +SA+D
Sbjct: 352 PPSLTKIKLEAPPTPPSSKPPSLPEEGDMDDDTLDLVRHYLNLSPNSIELYKQWSANDAN 411
Query: 130 FAELAKYLAGARVLRQDPVECLLQFLCSSNNNIARITKMVDFL-ASLGSHLGNVEGFEFH 188
FA A G R+L+QD E L+ F+CSSNNNI+RI++MV L + G +LG ++ +H
Sbjct: 412 FARRAPAFTGIRILQQDAWEALVGFICSSNNNISRISQMVHKLCTNYGPYLGTLDEEAYH 471
Query: 189 EFPSLERLSLVS-EVELRNAGFGYRSAPQSSLLFSVRRSFKQAKYITGTVDVLQSKHSGG 247
+FPS L++ E LRN GFGYR AKYI T ++ K
Sbjct: 472 DFPSPNALAVPGVEQTLRNLGFGYR-----------------AKYIATTARLVSEKED-- 512
Query: 248 AEWLLSLRK--------------------------LD------LQEAIDALCTLPGVGPK 275
WL SLR +D + A ++L TL GVGPK
Sbjct: 513 PNWLDSLRNPASPSYLNSNIKSPLLSGEEEDNGYDVDPGGRPGYRAAHESLLTLQGVGPK 572
Query: 276 VAACIALFSLDQHHAIPVDTHVWKIATR 303
VA C+ L L A+PVDTHVW+IA R
Sbjct: 573 VADCVCLMGLGWGEAVPVDTHVWQIAQR 600
>gi|261196610|ref|XP_002624708.1| 8-oxoguanine DNA glycosylase [Ajellomyces dermatitidis SLH14081]
gi|239595953|gb|EEQ78534.1| 8-oxoguanine DNA glycosylase [Ajellomyces dermatitidis SLH14081]
gi|239609529|gb|EEQ86516.1| 8-oxoguanine DNA glycosylase [Ajellomyces dermatitidis ER-3]
gi|327350237|gb|EGE79094.1| 8-oxoguanine DNA glycosylase [Ajellomyces dermatitidis ATCC 18188]
Length = 423
Score = 136 bits (342), Expect = 2e-29, Method: Compositional matrix adjust.
Identities = 100/317 (31%), Positives = 152/317 (47%), Gaps = 74/317 (23%)
Query: 36 SKWTPLNLTQSELSLPLTFPTGQTFRWKKTGPLQYTGPIGPHLISLKHLQNGDVCYHIH- 94
S+W L ++ +EL + T GQ+FRW+K+G +++ + +++L+ H+H
Sbjct: 19 SEWRKLPISLTELCINTTLRCGQSFRWQKSGDNEWSCALYGRIVTLRQDPT-----HLHY 73
Query: 95 -------------TSPSEPAAK-----------SALLD-FLNMGISLGELWEGFSASDCR 129
+ S PA++ AL++ + N+ ++L L+E +S +D
Sbjct: 74 RSYFPPAPPALPTPASSVPASRREASCEKVDDTEALINNYFNLDLNLTGLYEQWSTADQN 133
Query: 130 FAELAKYLAGARVLRQDPVECLLQFLCSSNNNIARITKMVDFLA-SLGSHLGNVEGFEFH 188
F + A G R+LRQD E L+ F+CSSNNNIARI++MV+ L + G +G ++ +H
Sbjct: 134 FKKKAPQFTGIRILRQDAWEALISFICSSNNNIARISQMVEKLCLNYGPLIGYIDKKPYH 193
Query: 189 EFPSLERL-SLVSEVELRNAGFGYRSAPQSSLLFSVRRSFKQAKYITGTVDVLQSKHSGG 247
+FP L E LR GFGYR AKYI T ++ ++ G
Sbjct: 194 DFPPPNALVGRDVESRLRELGFGYR-----------------AKYIYQTALIVANREEG- 235
Query: 248 AEWLLSLR---------------------KLDLQEAIDALCTLPGVGPKVAACIALFSLD 286
WL SLR + ++A + L L GVGPKVA C L L
Sbjct: 236 --WLNSLRNPESPSFGANPAPAGEMTEGGRDGYRKAHELLLELQGVGPKVADCACLMGLG 293
Query: 287 QHHAIPVDTHVWKIATR 303
A+PVDTHVW+IA R
Sbjct: 294 WGEAVPVDTHVWQIAQR 310
>gi|389748669|gb|EIM89846.1| DNA glycosylase [Stereum hirsutum FP-91666 SS1]
Length = 450
Score = 135 bits (341), Expect = 2e-29, Method: Compositional matrix adjust.
Identities = 113/345 (32%), Positives = 152/345 (44%), Gaps = 83/345 (24%)
Query: 40 PLNLTQSELSLPLTFPTGQTFRWKKTGPLQYTGPIGPHLISLKHL-----------QNGD 88
P+ L Q LSL GQ+FRW PL + P PH + ++ Q+ D
Sbjct: 13 PIPLVQ--LSLAAVLKCGQSFRWS-IYPLHASAPSDPHFPTHEYRFCLRDRVICLRQSRD 69
Query: 89 VCYHIHTSPS-----------EPAAKSALLDFLNMGISLGELWEGFSASDCRFAELAKYL 137
++ P E + + D+ + + L +L++ +S D FA L +
Sbjct: 70 TLFYRTVLPESSRSSSERSQYEAETLAWINDYFQLDVDLVKLYDEWSKRDAVFARLRERF 129
Query: 138 AGARVLRQDPVECLLQFLCSSNNNIARITKMVDFLAS-------------LGSHLGN--- 181
G R+LRQDP E L+ F+CSSNNNIARITKMV+ L S L S +
Sbjct: 130 EGIRMLRQDPWENLISFICSSNNNIARITKMVNALCSNYSPPLLSMELPPLESDISQEPL 189
Query: 182 ----------VEGFEFHEFPSLERLSLVSEV--ELRNAGFGYRSAPQSSLLFSVRRSFKQ 229
E FH FP RL+ EV ELR GFGYR
Sbjct: 190 TPASNSPEPEQEQHTFHPFPPPSRLAQ-PEVASELRKLGFGYR----------------- 231
Query: 230 AKYITGTVDVLQSKHSGGA---------EWLLSLRKLDLQEAIDALCTLPGVGPKVAACI 280
A +I T +L H +WL++LR L +A L L GVG KVA CI
Sbjct: 232 ADFIQKTAKMLVDAHGASVADEVTEQPEKWLMTLRDLPTAQARVELLKLMGVGRKVADCI 291
Query: 281 ALFSLDQHHAIPVDTHVWKIATR-YLLPELAGVR--LTPKLCSRV 322
L SLD+ +PVDTHV +IA + Y + +G + +TPKL V
Sbjct: 292 LLMSLDKREVVPVDTHVHQIAIKHYGMRGSSGAKQNMTPKLYDEV 336
>gi|350635493|gb|EHA23854.1| Hypothetical protein ASPNIDRAFT_48500 [Aspergillus niger ATCC 1015]
Length = 416
Score = 135 bits (340), Expect = 3e-29, Method: Compositional matrix adjust.
Identities = 103/307 (33%), Positives = 145/307 (47%), Gaps = 54/307 (17%)
Query: 30 LTTNKPSKWTPLNLTQSELSLPLTFPTGQTFRWKKT-GPLQYTGPIGPHLISLKHLQNGD 88
+T S+W L ++ +EL + T GQ+FRW ++ + L+SLK
Sbjct: 1 MTVGAFSEWRKLPVSLNELCINTTLRCGQSFRWHNVPDTDEWRCVLHGRLLSLKQ-DPTH 59
Query: 89 VCYHIHTSPSEPAAK--------SALLDFLNMGISLGELWEGFSASDCRFAELAKYLAGA 140
+ Y + + P K S L + N+ +L L+ +SA D F + A G
Sbjct: 60 LYYRTYRTTQSPKPKINDDDTTLSLLTHYFNLSANLTSLYTEWSAQDPNFRKKAPQFTGI 119
Query: 141 RVLRQDPVECLLQFLCSSNNNIARITKMVDFLA-SLGSHLGNVEGFEFHEFPSLERLSLV 199
R+LRQD E L+ F+CSSNNNIARI++MV+ L + G + ++ G +H FP+ L+
Sbjct: 120 RILRQDAWEALISFICSSNNNIARISQMVEKLCLNYGPFIASINGRAYHGFPAPNALTAH 179
Query: 200 S-EVELRNAGFGYRSAPQSSLLFSVRRSFKQAKYITGT-VDVLQSKHSGGAEWLLSL--- 254
E +LR GFGYR AKYI T V V + + SG WL SL
Sbjct: 180 DVEGKLRGLGFGYR-----------------AKYIYQTAVMVAKERESG---WLDSLCNP 219
Query: 255 ------------------RKLDLQEAIDALCTLPGVGPKVAACIALFSLDQHHAIPVDTH 296
+ +EA + L L GVGPKVA C+ L L A+PVDTH
Sbjct: 220 ECPAWGVEAAPAGEMKPEGREGYREAHEKLLELQGVGPKVADCVCLMGLGWGEAVPVDTH 279
Query: 297 VWKIATR 303
VW+IA R
Sbjct: 280 VWQIAQR 286
>gi|66801203|ref|XP_629527.1| 8-oxoguanine DNA-glycosylase [Dictyostelium discoideum AX4]
gi|60462915|gb|EAL61112.1| 8-oxoguanine DNA-glycosylase [Dictyostelium discoideum AX4]
Length = 439
Score = 135 bits (340), Expect = 3e-29, Method: Compositional matrix adjust.
Identities = 92/318 (28%), Positives = 154/318 (48%), Gaps = 55/318 (17%)
Query: 41 LNLTQSELSLPLTFPTGQTFRWKKTG----PLQYTGPIGPHLISLKHLQNGD---VCYHI 93
L L +++L L T + Q+F WK+ +Y G IG ++I L++ + + + Y
Sbjct: 44 LKLFKNKLDLKKTLFSAQSFIWKEIVINEIVNEYVGVIGDNIIVLRYKEGSEQDIIEYKF 103
Query: 94 HTSPSEPAAKSALL----------DFLNMGISLGELWEGFSASDC--------------- 128
S S L+ ++ + + + +L++ + +
Sbjct: 104 IDSEKRINHDSELMKMQDRISTLQNYFYLDLDINQLFKKWRNDEIGNDDNGDKKLHQLNH 163
Query: 129 RFAELAKYLAGARVLRQDPVECLLQFLCSSNNNIARITKMVD-FLASLGSHLGNVEGFEF 187
+F + G R+ RQ+PV+CL F+CS NNN++RIT +V+ + GS + + ++
Sbjct: 164 QFRKSCDKFIGLRLTRQNPVDCLFSFICSQNNNVSRITSLVNKLIVGYGSKITDYNHTDY 223
Query: 188 HEFPSLERLSLVSEVELRNAGFGYRSAPQSSLLFSVRRSFKQAKYITGTVDVLQSKHSGG 247
++FP+L +LS + EL + GFGYRS KYI + Q GG
Sbjct: 224 YKFPTLSQLSNATVEELNDLGFGYRS-----------------KYIVESCK--QVIEKGG 264
Query: 248 AEWLLSLRKLDLQEAIDALCTLPGVGPKVAACIALFSLDQHHAIPVDTHVWKIATRYLLP 307
+WL L+ E L TL GVG KVA C++L S+D +A+ +DTH+++I+ +Y LP
Sbjct: 265 EQWLNQLKLKPHLEVQKELTTLMGVGKKVADCVSLMSMDNPNAVAIDTHIYQISKKY-LP 323
Query: 308 ELAG--VRLTPKLCSRVA 323
LA LTPKL ++
Sbjct: 324 SLANSSSSLTPKLYQDIS 341
>gi|19173404|ref|NP_597207.1| 8-OXOGUANINE DNA GLYCOSYLASE [Encephalitozoon cuniculi GB-M1]
gi|19170993|emb|CAD26383.1| 8-OXOGUANINE DNA GLYCOSYLASE [Encephalitozoon cuniculi GB-M1]
Length = 290
Score = 135 bits (339), Expect = 3e-29, Method: Compositional matrix adjust.
Identities = 99/284 (34%), Positives = 139/284 (48%), Gaps = 55/284 (19%)
Query: 38 WTPLNLTQSELSLPLTFPTGQTFRWKKTGPLQYTGPIGPHLISLKHLQNGD-VCYHIHTS 96
WTPL +++ + L T +GQ F +K+T +YTG +G L+S LQ+G+ V Y I +
Sbjct: 6 WTPLRASEA-IDLEKTLFSGQVFSFKRTDKGEYTGTLGGCLVSF--LQDGEKVLYRILSG 62
Query: 97 PSEPAAKSALLD-FLNMGISLG---ELWEGFSASDCRFAELAKYLAGARVLRQDPVECLL 152
P + F + I+L ++W GF LAG R LR V +
Sbjct: 63 DKAPEDVEVDISYFFTLEINLRPLLQMW-GFDTDGS--------LAGLRALRYALVPTIF 113
Query: 153 QFLCSSNNNIARITKMVDFLASLGSHLGNVEGFEFHEFPSLERLSLVSEVELRNAGFGYR 212
F+CSSNNNI+RITKMV FL S G + +G +FH FPSLE+L + E ELR GFGYR
Sbjct: 114 SFICSSNNNISRITKMVGFLYSKGEFIMKYKGLDFHHFPSLEKLVGI-EGELRANGFGYR 172
Query: 213 SAPQSSLLFSVRRSFKQAKYITGTVDVLQSKHSGGAEWLLSLRKLDLQEAI---DALCTL 269
S +YI + L + LQ+A D + ++
Sbjct: 173 S-----------------RYICSAAEYLMENYP------------RLQQASGVRDMMVSI 203
Query: 270 PGVGPKVAACIALFSLDQHHAIPVDTHVWKIATRYLLPELAGVR 313
GVG K+A CI L L +PVDTH+++ + + L GVR
Sbjct: 204 KGVGDKIADCILLIGLGNLSVVPVDTHIFRHSRK-----LFGVR 242
>gi|449328819|gb|AGE95095.1| 8-oxoguanine DNA glycosylase [Encephalitozoon cuniculi]
Length = 290
Score = 134 bits (338), Expect = 4e-29, Method: Compositional matrix adjust.
Identities = 95/270 (35%), Positives = 133/270 (49%), Gaps = 50/270 (18%)
Query: 38 WTPLNLTQSELSLPLTFPTGQTFRWKKTGPLQYTGPIGPHLISLKHLQNGD-VCYHIHTS 96
WTPL +++ + L T +GQ F +K+T +YTG +G L+S LQ+G+ V Y I +
Sbjct: 6 WTPLRASEA-IDLEKTLFSGQVFSFKRTDKGEYTGTLGGCLVSF--LQDGEKVLYRILSG 62
Query: 97 PSEPAAKSALLD-FLNMGISLG---ELWEGFSASDCRFAELAKYLAGARVLRQDPVECLL 152
P + F + I+L ++W GF LAG R LR V +
Sbjct: 63 DKAPEDVEVDISYFFTLEINLRPLLQMW-GFDTDGS--------LAGLRALRYALVPTIF 113
Query: 153 QFLCSSNNNIARITKMVDFLASLGSHLGNVEGFEFHEFPSLERLSLVSEVELRNAGFGYR 212
F+CSSNNNI+RITKMV FL S G + +G +FH FPSLE+L + E ELR GFGYR
Sbjct: 114 SFICSSNNNISRITKMVGFLYSKGEFIMKYKGLDFHHFPSLEKLVGI-EGELRANGFGYR 172
Query: 213 SAPQSSLLFSVRRSFKQAKYITGTVDVLQSKHSGGAEWLLSLRKLDLQEAI---DALCTL 269
S +YI + L + LQ+A D + ++
Sbjct: 173 S-----------------RYICSAAEYLMENYP------------RLQQASGVRDMMVSI 203
Query: 270 PGVGPKVAACIALFSLDQHHAIPVDTHVWK 299
GVG K+A CI L L +PVDTH+++
Sbjct: 204 KGVGDKIADCILLIGLGNLSVVPVDTHIFR 233
>gi|402225313|gb|EJU05374.1| DNA glycosylase, partial [Dacryopinax sp. DJM-731 SS1]
Length = 311
Score = 134 bits (338), Expect = 4e-29, Method: Compositional matrix adjust.
Identities = 115/314 (36%), Positives = 147/314 (46%), Gaps = 42/314 (13%)
Query: 35 PSKWTPLNLTQSELSLPLTFPTGQTFRWKKTGPL--------QYTGPIGPHLISLKHLQN 86
P+ +T L + S L+L +GQ+F W PL +Y I LI L+ N
Sbjct: 4 PNGFTSLQIPISLLNLHTVLTSGQSFLWH-IYPLSPPHPQSHEYRLCITDRLICLRQ-DN 61
Query: 87 GDVCYHIHTSPSEPAAKSALLDFLN----MGISLGELWEGFSASDCRFAELA--KYLAGA 140
+ Y S + +LN + ++L L+ FS D FA LA L G
Sbjct: 62 THIYYRTVFPTSSSSPTETTESWLNSYFQLSVNLPSLYTSFSL-DPHFARLALRPQLQGI 120
Query: 141 RVLRQDPVECLLQFLCSSNNNIARITKMVDFLASLGS-------HLGNVEGFEFHEFPSL 193
R+LRQDP ECLL FLCS NN+IARIT MV L S H ++ FP+
Sbjct: 121 RILRQDPWECLLSFLCSQNNHIARITNMVHSLPEHFSLILVHEPHPSQDRTVQYRPFPAP 180
Query: 194 ERLSLVS-EVELRNAGFGYRSAPQSSLLFSVRRSFKQAKYITGTVDVLQSKHSGGAEWLL 252
E L+ E +LR GFGYR AKY++ T L S G LL
Sbjct: 181 EVLAKEGVESQLRALGFGYR-----------------AKYVSHTASALCSASPPGRAKLL 223
Query: 253 SLRKLDLQEAIDALCTLPGVGPKVAACIALFSLDQHHAIPVDTHVWKIATRYLLPELAGV 312
LR L L +A + L +LPGVGPKVA C L SLDQ +PVDTHV ++A RY +L
Sbjct: 224 ELRTLPLPQAREFLLSLPGVGPKVADCTLLMSLDQPGVVPVDTHVLRVAKRYYGLKLGKG 283
Query: 313 RLTPKLCSRVAEAF 326
K RVA+
Sbjct: 284 MGGGKGYERVAQGL 297
>gi|195133718|ref|XP_002011286.1| GI16083 [Drosophila mojavensis]
gi|193907261|gb|EDW06128.1| GI16083 [Drosophila mojavensis]
Length = 324
Score = 134 bits (336), Expect = 8e-29, Method: Compositional matrix adjust.
Identities = 99/293 (33%), Positives = 148/293 (50%), Gaps = 39/293 (13%)
Query: 49 SLPLTFPTGQTFRWKKTGPLQYTGPIGPHLISLKHL--QNGDVCYHIHTSPSEPAAK--- 103
L T GQ+FRW+K G L + L + + Y H + + AAK
Sbjct: 19 DLERTLLGGQSFRWRKLCSEPEVKYCGVALNTYWELTPEKDHIAYDAHVADN-TAAKINF 77
Query: 104 SALL-DFLNMGISLGELWEGFSASDCRFAELAKYLAG--ARVLRQDPVECLLQFLCSSNN 160
+ALL D+L L + + +D F K G R+L Q+P+E ++ F+CS NN
Sbjct: 78 TALLSDYLRADFDLELQQKNWMQADENF----KKFVGQPVRILAQEPLENIICFMCSQNN 133
Query: 161 NIARITKMVDFLAS-LGSHLGNVEGFEFHEFPSLE-----RLSLVSEVELRNAGFGYRSA 214
NI RI+ MV +L S G+ +G+ G + FP+L+ R + ELR A FGYR
Sbjct: 134 NIKRISSMVQWLCSTYGNKIGHFNGQDEFSFPTLKDLQGGRTCSQLDAELRAAKFGYR-- 191
Query: 215 PQSSLLFSVRRSFKQAKYITGTVDVLQSKHSGGAEWLLSLRKLDLQEAIDALCTLPGVGP 274
AK+I ++ ++ GG +W LR++ EA +AL LPG+G
Sbjct: 192 ---------------AKFIARSLTEIEK--LGGLDWFEQLRRMPYAEAREALVQLPGIGY 234
Query: 275 KVAACIALFSLDQHHAIPVDTHVWKIATRYLLPELAGVR-LTPKLCSRVAEAF 326
KVA CI L SL+ ++PVDTH++K+A R+ LP LA + +T K+ +A+ F
Sbjct: 235 KVADCICLMSLNHLQSVPVDTHIFKLAQRHYLPHLASQKSVTSKIYEEIAQHF 287
>gi|157134929|ref|XP_001663362.1| 8-oxoguanine DNA glycosylase [Aedes aegypti]
gi|108870365|gb|EAT34590.1| AAEL013179-PA [Aedes aegypti]
Length = 372
Score = 133 bits (334), Expect = 1e-28, Method: Compositional matrix adjust.
Identities = 105/345 (30%), Positives = 147/345 (42%), Gaps = 73/345 (21%)
Query: 36 SKWTPLNLTQSELSLPLTFPTGQTFRWKKTGPLQ--YTGPIGPHLISLKHLQNGDVCYHI 93
S+W + L L T GQ+FRW K Q + G + LK + D+ Y I
Sbjct: 2 SQWMKIPCLHKHLQLKTTLTGGQSFRWNKHDSFQDEFIGVFANIVWILKQTET-DLLYKI 60
Query: 94 -------------HTS-----------------------PSEPAA-----------KSAL 106
H S P A+ +S L
Sbjct: 61 VGELPYPNPNNQDHVSDVKKQRISAKKENLAQIRLKVPEPDRYASCGKLLYPQSYYESLL 120
Query: 107 LDFLNMGISLGELWEGFSASDCRFAELAKYLAGARVLRQDPVECLLQFLCSSNNNIARIT 166
+ + + L ++ ++ F A R L QDPVE L F+CS NNNI+RI+
Sbjct: 121 RVYFRLDVDLERCYDQWNKCHVHFENSADQFYAVRQLDQDPVENLFSFICSQNNNISRIS 180
Query: 167 KMVDFLAS-LGSHLGNVEGFEFHEFPSLERL-SLVSEVELRNAGFGYRSAPQSSLLFSVR 224
+V+ + + G + G F+ FP +L S E LR FGYR
Sbjct: 181 GLVEKICTHYGDKICEYNGTTFYNFPDAGKLASQQVEEHLRQLSFGYR------------ 228
Query: 225 RSFKQAKYITGTVDVLQSKHSGGAEWLLSLRKLDLQEAIDALCTLPGVGPKVAACIALFS 284
AKYI + D + +K G EW L++LD +EA L TL G+GPKVA CI L S
Sbjct: 229 -----AKYIQKSADEILAK--GDLEWFRKLQQLDYKEAHRELLTLTGIGPKVADCICLMS 281
Query: 285 LDQHHAIPVDTHVWKIATRYLLPELAGVR-LTPKLCSRVAEAFCE 328
L+ AIPVDTHV++IA Y LP LA + ++ K+ + + F E
Sbjct: 282 LNHLQAIPVDTHVFQIARNY-LPHLAKCKTISGKMYGEIGDKFRE 325
>gi|260948792|ref|XP_002618693.1| hypothetical protein CLUG_02152 [Clavispora lusitaniae ATCC 42720]
gi|238848565|gb|EEQ38029.1| hypothetical protein CLUG_02152 [Clavispora lusitaniae ATCC 42720]
Length = 494
Score = 132 bits (333), Expect = 2e-28, Method: Compositional matrix adjust.
Identities = 111/344 (32%), Positives = 151/344 (43%), Gaps = 72/344 (20%)
Query: 38 WTPLNLTQSELSLPLTFPTGQTFRWKKTGPLQYTGPIGPHLISLK----HLQ-------- 85
W + +T +ELSL GQTFRWK + ++ ++ LK HL
Sbjct: 6 WRKIPITPAELSLAAVLRCGQTFRWKNVNSV-WSFSTNDRVVLLKQDDAHLHYASIMGNA 64
Query: 86 ----NGDV--CYHIHTSP------------------------SEPAAKSALLDFLNMGIS 115
GDV SP S+ + + + D+ N+ +S
Sbjct: 65 STRGGGDVSGARSARNSPTDNKNSIDGAQNDNSPPVENIKNESQDSTLAFIHDYFNLHVS 124
Query: 116 LGELWEGFSASDCRFAELAKYLA---GARVLRQDPVECLLQFLCSSNNNIARITKMVDFL 172
L EL+ ++A + + A G R+LRQDP E ++ F+CSSNNN+ RI+KM D L
Sbjct: 125 LSELYAHWTAVETQARGKCTSFAQFPGIRILRQDPWETVVSFICSSNNNVKRISKMCDAL 184
Query: 173 -ASLGSHLGNVEGFEFHEFPSLERLSLVSEVE--LRNAGFGYRSAPQSSLLFSVRRSFKQ 229
A G L +G ++ FP + LS EVE LR GFGYR
Sbjct: 185 CAEYGRFLARHDGIDYFSFPGPQVLS-SPEVEGRLRELGFGYR----------------- 226
Query: 230 AKYITGTVDVLQSKH--SGGAEWLLSLRKLDLQEAIDALCTLPGVGPKVAACIALFSLDQ 287
AKYI T + E L SLR EA + L L GVGPKVA CI L +LD+
Sbjct: 227 AKYIASTAKMFADSKWPHISLERLESLRTKPFAEAHEFLLQLTGVGPKVADCICLMALDK 286
Query: 288 HHAIPVDTHVWKIATR---YLLPELAGVRLTPKLCSRVAEAFCE 328
H +PVDTHV +IA R Y P + + +A+ F E
Sbjct: 287 HDVVPVDTHVLQIAVRDYKYRGPRTMNKKTYEAVRGHLADLFGE 330
>gi|409046031|gb|EKM55511.1| hypothetical protein PHACADRAFT_96257 [Phanerochaete carnosa
HHB-10118-sp]
Length = 356
Score = 132 bits (333), Expect = 2e-28, Method: Compositional matrix adjust.
Identities = 111/317 (35%), Positives = 143/317 (45%), Gaps = 48/317 (15%)
Query: 40 PLNLTQSELSLPLTFPTGQTFRWKKTGPLQYTGPIGPHLISLKH-----LQNGDVC---- 90
PL+L Q LSL GQ+FRW Q +G + H LQ+ VC
Sbjct: 15 PLSLAQ--LSLVAVLQCGQSFRWSVFPLTQGVPEVGTDTATPTHEYRFCLQDRVVCLRQT 72
Query: 91 ----YHIHTSP----------SEPAAKSALLDFLNMGISLGELWEGFSASDCRFAELAKY 136
Y+ P E S + D+ + L EL+E +S D F L
Sbjct: 73 SDALYYRSVCPPSSELLDDETREARTLSWIRDYFQLDNDLLELYEEWSGRDPVFRNLRLR 132
Query: 137 LAGARVLRQDPVECLLQFLCSSNNNIARITKMVDFLA--------SLGSHLGNVEGF--- 185
+G R+LRQDP ECLL F+CSSNNNI RITKMV L SL G E F
Sbjct: 133 FSGIRMLRQDPFECLLSFICSSNNNIKRITKMVRSLCTEYSPALLSLPPPDGAAEEFGIE 192
Query: 186 EFHEFPSLERLSLVSEV--ELRNAGFGYRSAPQSSLLFSVRRSFKQAKYITGTVDVLQSK 243
+H FP LS EV LR GFGYR+ ++++ K G ++ ++
Sbjct: 193 AYHPFPPPSALS-APEVAANLRKLGFGYRAN-------FIQKTAKMLVDAHGLSSIVGAR 244
Query: 244 HSGGAEWLLSLRKLDLQEAIDALCTLPGVGPKVAACIALFSLDQHHAIPVDTHVWKIATR 303
EWL LR + +A L L GVG KVA CI L S+D+ +PVDTHV IA +
Sbjct: 245 PEPAEEWLFKLRAMTTADARAELLELMGVGRKVADCILLMSMDKPEVVPVDTHVHDIAKK 304
Query: 304 Y--LLPELAGVRLTPKL 318
Y A +TP+L
Sbjct: 305 YYGFSGSKAKTNMTPQL 321
>gi|254569604|ref|XP_002491912.1| Mitochondrial glycosylase/lyase [Komagataella pastoris GS115]
gi|238031709|emb|CAY69632.1| Mitochondrial glycosylase/lyase [Komagataella pastoris GS115]
gi|328351588|emb|CCA37987.1| N-glycosylase/DNA lyase [Komagataella pastoris CBS 7435]
Length = 320
Score = 132 bits (332), Expect = 2e-28, Method: Compositional matrix adjust.
Identities = 87/287 (30%), Positives = 144/287 (50%), Gaps = 41/287 (14%)
Query: 38 WTPLNLTQSELSLPLTFPTGQTFRWKKTGPLQYTGPIGPHLISLKHLQNGDVCYHIHTSP 97
W +NL +E+ L GQ FRW K + ++ + ++ L+ ++ + Y TS
Sbjct: 5 WHSINLPPNEIYLDKLLRCGQAFRWVKNNNI-WSSTLNNRVVFLRQTED-QLEYASLTSS 62
Query: 98 SEPAAKSALL-----------------DFLNMGISLGELWEGFSASDCRFAELAKYLAGA 140
+ ++ L ++LN+ IS+ +L++ +++ D FA+++ AG
Sbjct: 63 QKNSSNIKKLQPKLEEDTQDDVLGLISNYLNLQISIVQLYKDWASKDAHFAKVSAAFAGI 122
Query: 141 RVLRQDPVECLLQFLCSSNNNIARITKMVDFLA-SLGSHLGNVEGFEFHEFPS-LERLSL 198
R+L+QDP E L+ F+CSSNNN+ RI+KM L G + G +++ FP+ ++ S
Sbjct: 123 RMLQQDPWETLISFICSSNNNVKRISKMCHALCLEYGDFIVEYAGTKYYSFPTPVQLASR 182
Query: 199 VSEVELRNAGFGYRSAPQSSLLFSVRRSFKQAKYITGTVDVLQSKHSGGAEWLLSLRKLD 258
SE LR GFGYR A+Y+ T +L + + L S+R
Sbjct: 183 ASEASLRELGFGYR-----------------ARYVYETAQMLADDKALFMQ-LHSMRSSS 224
Query: 259 L--QEAIDALCTLPGVGPKVAACIALFSLDQHHAIPVDTHVWKIATR 303
++ + L GVGPKVA C+AL SL++H +P+DTHV + A R
Sbjct: 225 FTDEQVHEFLLQFKGVGPKVADCVALMSLNRHSLVPIDTHVLQFARR 271
>gi|336364947|gb|EGN93300.1| hypothetical protein SERLA73DRAFT_189864 [Serpula lacrymans var.
lacrymans S7.3]
gi|336377515|gb|EGO18677.1| hypothetical protein SERLADRAFT_481001 [Serpula lacrymans var.
lacrymans S7.9]
Length = 439
Score = 132 bits (332), Expect = 2e-28, Method: Compositional matrix adjust.
Identities = 105/312 (33%), Positives = 146/312 (46%), Gaps = 45/312 (14%)
Query: 47 ELSLPLTFPTGQTFRWK-----------KTGPLQYTGPIGPHLISLKHLQN---GDVCYH 92
+LSL GQ+FRW T +Y + ++ L+ N +
Sbjct: 21 QLSLAAVLKCGQSFRWSAFPLHVTTVDASTPTHEYRLCLRDRVVCLRQTPNFLFYRSVFP 80
Query: 93 IHTSPSEPAAKSA-----LLDFLNMGISLGELWEGFSASDCRFAELAKYLAGARVLRQDP 147
SP++ A K L D+ + + L EL++ +SA D F L +G R+LRQDP
Sbjct: 81 KQLSPTQQAVKETETLAWLRDYFQLEVDLVELYDQWSARDAVFDNLRSRFSGIRMLRQDP 140
Query: 148 VECLLQFLCSSNNNIARITKMVDFLASLGS---------HLGNVEG--FEFHEFPSLERL 196
E L+ F+CSSNNNI+RITKMV L S H E +H FPS L
Sbjct: 141 WENLISFICSSNNNISRITKMVQSLCKQYSTPLLSLPPPHETTEEQQCQSYHPFPSPSAL 200
Query: 197 SLVSEV-ELRNAGFGYRSAPQSSLLFSVRRSFKQAKYITGTVDVLQSKHSGGAEWLLSLR 255
+ V LR+ GFGYR++ ++R+ K G + S EWL++LR
Sbjct: 201 AAPEVVGTLRSLGFGYRAS-------FIQRTAKMLVDTHGHTSL--SSLEASEEWLMTLR 251
Query: 256 KLDLQEAIDALCTLPGVGPKVAACIALFSLDQHHAIPVDTHVWKIATR-YLLP----ELA 310
L EA + L GVG KVA C+ L S+D+ IPVDTHV +IA + Y P A
Sbjct: 252 DLTTDEAREELLKFMGVGRKVADCVLLMSMDKKDVIPVDTHVHQIAIKHYGFPSSSSAKA 311
Query: 311 GVRLTPKLCSRV 322
+TPKL ++
Sbjct: 312 KASMTPKLYEKI 323
>gi|429849699|gb|ELA25054.1| n-glycosylase dna lyase [Colletotrichum gloeosporioides Nara gc5]
Length = 407
Score = 132 bits (331), Expect = 3e-28, Method: Compositional matrix adjust.
Identities = 100/294 (34%), Positives = 152/294 (51%), Gaps = 27/294 (9%)
Query: 30 LTTNKPSKWTPLNLTQSELSLPLTFPTGQTFRWKKTGPLQYTGPIGPHLISLKHLQNGDV 89
++ + S+W L L+ +EL + T GQ+FRW+K ++ + L+SLK Q+
Sbjct: 1 MSLARVSEWRKLPLSLAELCIDTTLRCGQSFRWRKIND-EWCCTLYGRLVSLK--QDPTH 57
Query: 90 CYHIHTSPSEPA----------------AKSALLDFLNMGISLGELWEGFSASDCRFAEL 133
++ T P +P + L +L++ + L L+E +S +D F +
Sbjct: 58 LHYKVTWPKKPVYPLTPPVADDEADGDDTEELLRHYLSLKLDLKSLYEQWSEADPNFRKR 117
Query: 134 AKYLAGARVLRQDPVECLLQFLCSSNNNIARITKMVDFLAS-LGSHLGNVEGFEFHEFPS 192
A G R+L QD E L+ F+CSSNNNI+RI++MV L + G +G+V FH+FP+
Sbjct: 118 APEFGGVRMLSQDAWEALICFICSSNNNISRISQMVHKLCTHYGPLIGHVGDEAFHDFPT 177
Query: 193 LERLSLVS-EVELRNAGFGYRSA--PQSSLLFSVRRSFKQAKYITGTVDVLQSKHSGGAE 249
E L+ S E LR GFGYR+ Q++ + R + +T + K GA+
Sbjct: 178 PEALTGTSVEAHLRELGFGYRAKYIAQTASIVVNDRPKGWFESLTNPENPCYRKTPEGAK 237
Query: 250 WLLSLRKLDLQEAIDALCTLPGVGPKVAACIALFSLDQHHAIPVDTHVWKIATR 303
L + +EA + L L GVGPKVA C+ L L A+PVDTHVW+IA R
Sbjct: 238 ----LAQCTYKEAHEQLLQLSGVGPKVADCVCLMGLGWGEAVPVDTHVWQIAQR 287
>gi|344234851|gb|EGV66719.1| hypothetical protein CANTEDRAFT_91890 [Candida tenuis ATCC 10573]
Length = 363
Score = 132 bits (331), Expect = 3e-28, Method: Compositional matrix adjust.
Identities = 99/291 (34%), Positives = 139/291 (47%), Gaps = 49/291 (16%)
Query: 38 WTPLNLTQSELSLPLTFPTGQTFRWKKTGPLQYTGPIGPHLISLKHLQNGDVCYHIHTS- 96
W L L ELSL GQTFRWK + ++ ++ L+ + H+H S
Sbjct: 7 WKSLPLKPVELSLSRVLRCGQTFRWKNIDHV-WSFTTSDRIVLLRQDEE-----HLHYSW 60
Query: 97 -------------PSEPAAKSALLDFLNMGISLGELWEGFSASDCRFAELAK-----YLA 138
E ++D+ ++ SL +L+ +S + ++ + K
Sbjct: 61 IMEEDNKTMKPLKLRESETLEFIMDYFSLSTSLEKLYSDWSIVNQKYNKSVKNSPFVKFP 120
Query: 139 GARVLRQDPVECLLQFLCSSNNNIARITKMVDFLASLGSHLGNVEGFE-FHEFPSLERLS 197
G R+LRQDP E + F+CSSNNN+ RI+KM D L S NV G E F+ FP L+
Sbjct: 121 GIRILRQDPWETTISFICSSNNNVKRISKMCDSLCSEFGKFINVYGGESFYSFPDPSSLA 180
Query: 198 L-VSEVELRNAGFGYRSAPQSSLLFSVRRSFKQAKYITGT----VDVLQSKHSGGAEWLL 252
+E +LR GFGYR A+YI T VD H + L
Sbjct: 181 KPGTEQKLRELGFGYR-----------------ARYIYNTACKFVDDDGFPHIT-TKTLH 222
Query: 253 SLRKLDLQEAIDALCTLPGVGPKVAACIALFSLDQHHAIPVDTHVWKIATR 303
++RK +EA + L L GVGPKVA CI L +LD+H +PVDTHV++IA R
Sbjct: 223 AMRKSPYEEAHEFLLLLDGVGPKVADCICLMALDKHEIVPVDTHVYQIAIR 273
>gi|296811256|ref|XP_002845966.1| 8-oxoguanine DNA glycosylase [Arthroderma otae CBS 113480]
gi|238843354|gb|EEQ33016.1| 8-oxoguanine DNA glycosylase [Arthroderma otae CBS 113480]
Length = 404
Score = 131 bits (329), Expect = 5e-28, Method: Compositional matrix adjust.
Identities = 81/231 (35%), Positives = 120/231 (51%), Gaps = 42/231 (18%)
Query: 96 SPSEPAAKSALLDFLNMGISLGELWEGFSASDCRFAELAKYLAGARVLRQDPVECLLQFL 155
+P + ++ + + N+ +L +L+E ++ASD F + A AG R++RQD E L+ F+
Sbjct: 59 TPDDDDTEALIYHYFNLEYNLSDLYEQWAASDANFKKKAVQFAGIRIMRQDAWETLVSFI 118
Query: 156 CSSNNNIARITKMVDFL-ASLGSHLGNVEGFEFHEFPSLERLSLVS-EVELRNAGFGYRS 213
CSSNNNIARI++MV+ L + G +G + ++++FP+ L+ + E LR GFGYR
Sbjct: 119 CSSNNNIARISQMVEKLCVNYGPFIGQLGDQKYYDFPAPSALTGIGVESHLRELGFGYR- 177
Query: 214 APQSSLLFSVRRSFKQAKYITGTVDVLQSKHSGGAEWLLSLR------------------ 255
AKYI T ++ + G WL SLR
Sbjct: 178 ----------------AKYIYQTAMIVVNDRDPG--WLNSLRNPEKPAFNEEPTAPGAVA 219
Query: 256 ---KLDLQEAIDALCTLPGVGPKVAACIALFSLDQHHAIPVDTHVWKIATR 303
K +EA + L L GVGPKVA C+ L L ++PVDTHVW+IA R
Sbjct: 220 RVEKSGYREAHEQLLALQGVGPKVADCVCLMGLGWGESVPVDTHVWQIAQR 270
>gi|198429493|ref|XP_002131434.1| PREDICTED: similar to LOC733253 protein [Ciona intestinalis]
Length = 350
Score = 131 bits (329), Expect = 5e-28, Method: Compositional matrix adjust.
Identities = 79/200 (39%), Positives = 111/200 (55%), Gaps = 21/200 (10%)
Query: 106 LLDFLNMGISLGELWEGFSASDCRFAELAKYLAGARVLRQDPVECLLQFLCSSNNNIARI 165
L D+L + I L +L+ + D +F ++ G RVLRQ+P+E L FLCS NNNI RI
Sbjct: 112 LCDYLRLNIDLKKLYTEWKNRDEKFTHISSSTPGIRVLRQNPLETLFCFLCSVNNNITRI 171
Query: 166 TKMVDFLASL-GSHLGNVEGFEFHEFPSLERL-SLVSEVELRNAGFGYRSAPQSSLLFSV 223
M++ + L G+ L +G FH+FP+L L S ++ +LR GFGYR
Sbjct: 172 VSMIERMCKLYGNKLLEYKGTTFHDFPTLGMLASKDTDEDLRKLGFGYR----------- 220
Query: 224 RRSFKQAKYITGTVDVLQSKHSGGAEWLLSLRKLDLQEAIDALCTLPGVGPKVAACIALF 283
A +I + SK GG WL SLRK ++A AL L GVG KVA C+ L
Sbjct: 221 ------APFIGKCAKEVMSK--GGEVWLESLRKQTYKKAFTALTDLTGVGMKVADCVCLM 272
Query: 284 SLDQHHAIPVDTHVWKIATR 303
++D+ A+P+DTHV ++A R
Sbjct: 273 AMDKLEAVPLDTHVRQVALR 292
>gi|425773484|gb|EKV11836.1| DNA N-glycosylase, putative [Penicillium digitatum Pd1]
gi|425775780|gb|EKV14032.1| DNA N-glycosylase, putative [Penicillium digitatum PHI26]
Length = 417
Score = 131 bits (329), Expect = 6e-28, Method: Compositional matrix adjust.
Identities = 106/335 (31%), Positives = 153/335 (45%), Gaps = 63/335 (18%)
Query: 30 LTTNKPSKWTPLNLTQSELSLPLTFPTGQTFRWKKTGPLQ-YTGPIGPHLISLKHLQNGD 88
++T S+W L L+ SEL + T GQ+FRW+ Q + + +L+SLK Q+ +
Sbjct: 1 MSTGAFSEWRKLPLSLSELCINTTLKCGQSFRWQNFSESQEWRCVLYGNLLSLK--QDSN 58
Query: 89 VCYHIHTSPSEPAAKSALLD----------FLNMGISLGELWEGFSASDCRFAELAKYLA 138
Y+ P+ + D + N+ +L EL+ +S+ D F + A
Sbjct: 59 FLYYRSVQPTPHTSTPTTSDDDHLVRIIKHYFNLTPNLTELYSLWSSQDPNFKKKAAQFT 118
Query: 139 GARVLRQDPVECLLQFLCSSNNNIARITKMVDFLASLGSHLGN----VEGFEFHEFPSLE 194
G R+LRQD E L+ F+CSSNNNIARI++MV+ L H GN + +H+FP
Sbjct: 119 GIRILRQDAWEALVSFICSSNNNIARISQMVE---KLCIHYGNPVTTIGARAYHDFPPPG 175
Query: 195 RLSLVS-EVELRNAGFGYRSAPQSSLLFSVRRSFKQAKYITGTVDVLQSKHSGGAEWLLS 253
L+ E LR GFGYR AKYI T ++ ++ G WL S
Sbjct: 176 ALTGNDVESNLRKLGFGYR-----------------AKYIHQTAVMVSNRDKG---WLES 215
Query: 254 LR---------------------KLDLQEAIDALCTLPGVGPKVAACIALFSLDQHHAIP 292
L + +EA + L L GVGPKV+ C+ L L ++P
Sbjct: 216 LSNPECPAFGVVPKPSGEMKPEGREGYREAHEKLLELQGVGPKVSDCVCLMGLGWGESVP 275
Query: 293 VDTHVWKIATR-YLLPELAGVRLTPKLCSRVAEAF 326
VDTHVW+IA R Y + + L VA F
Sbjct: 276 VDTHVWQIAQRDYRFGKSSNKNLNKTTYDAVANHF 310
>gi|45190732|ref|NP_984986.1| AER127Cp [Ashbya gossypii ATCC 10895]
gi|44983711|gb|AAS52810.1| AER127Cp [Ashbya gossypii ATCC 10895]
Length = 391
Score = 130 bits (328), Expect = 6e-28, Method: Compositional matrix adjust.
Identities = 93/283 (32%), Positives = 133/283 (46%), Gaps = 35/283 (12%)
Query: 37 KWTPLNLTQSELSLPLTFPTGQTFRWKKTGPLQY-------TGPIGPHLISLKHLQNGDV 89
K+ L + E+ L GQ FRW +Y +G +I LK +
Sbjct: 2 KFNRLIFPKGEIVLDHVLQCGQAFRWIWNHEKRYYSASMLLNDKLGYRIIVLKQPDQCSI 61
Query: 90 CYHIHTSPSEP---AAKSALLDFLNMGISLGELWEGFSASDCRFAELAKYLAGARVLRQD 146
+ + + + AA+ L+ +L M ++L L + +D RF + K G R+LRQ+
Sbjct: 62 EFSVAGNKDDDCSGAARQWLMRYLRMEVNLEALLAEWQKADTRF--IGKTHRGVRILRQE 119
Query: 147 PVECLLQFLCSSNNNIARITKMVDFLAS-LGSHLGNVEGFEFHEFPSLERL-SLVSEVEL 204
P E L F+CSSNNNI RITKM L S GS LG ++G ++ FP+ ++L SE L
Sbjct: 120 PWETLCSFICSSNNNIGRITKMCHALCSQYGSFLGELDGTPYYSFPTSKQLMEGASEDAL 179
Query: 205 RNAGFGYRSAPQSSLLFSVRRSFKQAKYITGTVDVLQSK---HSGGAEWLLS-LRKLDLQ 260
R+ GFGYR AKYI + + S H E L S L + +
Sbjct: 180 RDLGFGYR-----------------AKYIMAAAEWMDSSKPAHMSDTEHLESWLDMISRE 222
Query: 261 EAIDALCTLPGVGPKVAACIALFSLDQHHAIPVDTHVWKIATR 303
E +PGVGPKVA C+ L + +PVD H+ +IA R
Sbjct: 223 EIRQRFMEVPGVGPKVADCVCLMGMQMDDHVPVDVHINRIAQR 265
>gi|157118037|ref|XP_001658977.1| 8-oxoguanine DNA glycosylase [Aedes aegypti]
gi|108875881|gb|EAT40106.1| AAEL008148-PA [Aedes aegypti]
Length = 370
Score = 130 bits (328), Expect = 6e-28, Method: Compositional matrix adjust.
Identities = 104/344 (30%), Positives = 147/344 (42%), Gaps = 71/344 (20%)
Query: 36 SKWTPLNLTQSELSLPLTFPTGQTFRWKKTGPLQ--YTGPIGPHLISLKHLQN------- 86
S+W + L L T GQ+FRW K Q + G + LK +
Sbjct: 2 SQWMKIPCLHKHLQLKTTLTGGQSFRWNKHDSFQDEFIGVFANIVWILKQTETALLYKIV 61
Query: 87 GDVCY-------HI------HTSPSEPA--------------------------AKSALL 107
G++ Y HI SP E +S L
Sbjct: 62 GELPYPNPNNQDHISNAKKQRISPKEENLAQIRLKVPEPDRYASGGKLLYPQSYYESLLR 121
Query: 108 DFLNMGISLGELWEGFSASDCRFAELAKYLAGARVLRQDPVECLLQFLCSSNNNIARITK 167
+ + + L ++ ++ F A R L QDPVE L F+CS NNNI+RI+
Sbjct: 122 VYFRLDVDLERCYDQWNKCHVHFENSADQFYAVRQLDQDPVENLFSFICSQNNNISRISG 181
Query: 168 MVDFLAS-LGSHLGNVEGFEFHEFPSLERL-SLVSEVELRNAGFGYRSAPQSSLLFSVRR 225
+V+ + + G + G F+ FP +L S E LR FGYR
Sbjct: 182 LVEKICTHYGDKICEYNGTTFYNFPDAGKLASQQVEERLRQLSFGYR------------- 228
Query: 226 SFKQAKYITGTVDVLQSKHSGGAEWLLSLRKLDLQEAIDALCTLPGVGPKVAACIALFSL 285
AKYI + + + +K G EW L++LD +EA L TL G+GPKVA CI L SL
Sbjct: 229 ----AKYIQKSAEEILAK--GDLEWFRKLQQLDYKEAHRELLTLTGIGPKVADCICLMSL 282
Query: 286 DQHHAIPVDTHVWKIATRYLLPELAGVR-LTPKLCSRVAEAFCE 328
+ AIPVDTHV++IA Y LP LA + ++ K+ + + F E
Sbjct: 283 NHLQAIPVDTHVFQIARNY-LPHLAKCKTISGKMYGEIGDKFRE 325
>gi|374108209|gb|AEY97116.1| FAER127Cp [Ashbya gossypii FDAG1]
Length = 391
Score = 130 bits (328), Expect = 6e-28, Method: Compositional matrix adjust.
Identities = 93/283 (32%), Positives = 133/283 (46%), Gaps = 35/283 (12%)
Query: 37 KWTPLNLTQSELSLPLTFPTGQTFRWKKTGPLQY-------TGPIGPHLISLKHLQNGDV 89
K+ L + E+ L GQ FRW +Y +G +I LK +
Sbjct: 2 KFNRLIFPKGEIVLDHVLQCGQAFRWIWNHEKRYYSASMLLNDKLGYRIIVLKQPDQCSI 61
Query: 90 CYHIHTSPSEP---AAKSALLDFLNMGISLGELWEGFSASDCRFAELAKYLAGARVLRQD 146
+ + + + AA+ L+ +L M ++L L + +D RF + K G R+LRQ+
Sbjct: 62 EFSVAGNKDDDCSGAARQWLMRYLRMEVNLEALLAEWQKADTRF--IGKTHRGVRILRQE 119
Query: 147 PVECLLQFLCSSNNNIARITKMVDFLAS-LGSHLGNVEGFEFHEFPSLERL-SLVSEVEL 204
P E L F+CSSNNNI RITKM L S GS LG ++G ++ FP+ ++L SE L
Sbjct: 120 PWETLCSFICSSNNNIGRITKMCHALCSQYGSFLGELDGTPYYSFPTSKQLMEGASEDAL 179
Query: 205 RNAGFGYRSAPQSSLLFSVRRSFKQAKYITGTVDVLQSK---HSGGAEWLLS-LRKLDLQ 260
R+ GFGYR AKYI + + S H E L S L + +
Sbjct: 180 RDLGFGYR-----------------AKYIMAAAEWMDSSKPAHMSDTEHLESWLDMISRE 222
Query: 261 EAIDALCTLPGVGPKVAACIALFSLDQHHAIPVDTHVWKIATR 303
E +PGVGPKVA C+ L + +PVD H+ +IA R
Sbjct: 223 EIRQRFMEVPGVGPKVADCVCLMGMQMDDHVPVDVHINRIAQR 265
>gi|156337931|ref|XP_001619921.1| hypothetical protein NEMVEDRAFT_v1g149802 [Nematostella vectensis]
gi|156203995|gb|EDO27821.1| predicted protein [Nematostella vectensis]
Length = 231
Score = 130 bits (328), Expect = 7e-28, Method: Compositional matrix adjust.
Identities = 77/238 (32%), Positives = 123/238 (51%), Gaps = 23/238 (9%)
Query: 38 WTPLNLTQSELSLPLTFPTGQTFRWKKTGPLQYTGPIGPHLISLKHLQNGDVCYHIHTSP 97
W + S++ L + GQ+FRW +T ++ + + +LK Q+ + Y+ P
Sbjct: 6 WRRIPCKASQVRLDVVLSAGQSFRWHETSEREWRSVLRGKVWTLK--QDAEFLYYTVHDP 63
Query: 98 SEPAAKSALLDFLNMGISLGELWEGFSASDCRFAELAKYLAGARVLRQDPVECLLQFLCS 157
S + + I L +L++ +SA+D F+++AK G R LRQ+PVE L F+CS
Sbjct: 64 VPAVTSSIKPYYFQLNIDLQDLYKKWSAADINFSKVAKSFTGIRTLRQEPVENLFSFICS 123
Query: 158 SNNNIARITKMVDFLAS-LGSHLGNVEGFEFHEFPSLERLSL-VSEVELRNAGFGYRSAP 215
SNNNI RIT M++ + S G + +V+G ++ FP++ L+ E +LR+ GFGYR
Sbjct: 124 SNNNIWRITSMIEKMCSKYGEKVASVDGIDYFSFPTVSSLADPKVEQDLRSMGFGYR--- 180
Query: 216 QSSLLFSVRRSFKQAKYITGTVDVLQSKHSGGAEWLLSLRKLDLQEAIDALCTLPGVG 273
AK+I + ++ GG +WLLSLRK Q+A +C VG
Sbjct: 181 --------------AKFINKSAQLI--TEYGGQDWLLSLRKQSYQDAHSGVCCWGCVG 222
>gi|255931765|ref|XP_002557439.1| Pc12g05950 [Penicillium chrysogenum Wisconsin 54-1255]
gi|211582058|emb|CAP80222.1| Pc12g05950 [Penicillium chrysogenum Wisconsin 54-1255]
Length = 393
Score = 130 bits (328), Expect = 7e-28, Method: Compositional matrix adjust.
Identities = 92/267 (34%), Positives = 130/267 (48%), Gaps = 39/267 (14%)
Query: 77 HLISLKHLQNGDVCYHIHTSPSEPAAKSALLD----------FLNMGISLGELWEGFSAS 126
HL+SL+ Q+ D Y+ P + D + N+ +L EL+ +S+
Sbjct: 41 HLLSLR--QDSDFLYYRSVQPPSHTLTPTISDDDHLVRIIKHYFNLTPNLTELYSQWSSQ 98
Query: 127 DCRFAELAKYLAGARVLRQDPVECLLQFLCSSNNNIARITKMVDFLA-SLGSHLGNVEGF 185
D F + A G R+LRQD E L+ F+CSSNNNIARI++MV+ L G + +EG
Sbjct: 99 DPNFKKKAAQFTGIRILRQDAWEALVSFICSSNNNIARISQMVEKLCIHYGKPVATIEGR 158
Query: 186 EFHEFPSLERLSLVS-EVELRNAGFGYRSAPQSSLLFSVRRSFKQAKYITGTVDVLQSKH 244
+H+FP L+ E LR GFGYR AKYI T ++ ++
Sbjct: 159 AYHDFPPPGALTGNDVESNLRKLGFGYR-----------------AKYIYQTAVMVSNRD 201
Query: 245 SGGAEWLLSLR----KLDLQEAIDALCTLPGVGPKVAACIALFSLDQHHAIPVDTHVWKI 300
G WL SL + +EA + L L GVGPKV+ C++L L ++PVDTHVW+I
Sbjct: 202 EG---WLNSLSNPEWREGYREAHEKLLELQGVGPKVSDCVSLMGLGWGESVPVDTHVWQI 258
Query: 301 ATR-YLLPELAGVRLTPKLCSRVAEAF 326
A R Y + + L VA F
Sbjct: 259 AQRDYRFGKSSNKNLNKTTYDAVANHF 285
>gi|225677914|gb|EEH16198.1| N-glycosylase/DNA lyase [Paracoccidioides brasiliensis Pb03]
Length = 392
Score = 130 bits (327), Expect = 7e-28, Method: Compositional matrix adjust.
Identities = 92/300 (30%), Positives = 138/300 (46%), Gaps = 73/300 (24%)
Query: 53 TFPTGQTFRWKKTGPLQYTGPIGPHLISLKHLQNGDVCYHIHTSPSEPAAKSALL----- 107
P FRW+K+G +++ + ++SL+ H++ P S+LL
Sbjct: 4 NIPLSGRFRWQKSGDNEWSCALYGRIVSLRQDST-----HLYYRSYFPPTASSLLTPPSS 58
Query: 108 ---------------------DFLNMGISLGELWEGFSASDCRFAELAKYLAGARVLRQD 146
++ N+ ++L L+E +S +D F + A G R+LRQ+
Sbjct: 59 VPPLTTEELPKKLDDTDALISNYFNLELNLKSLYEQWSTADENFKKKAPQFTGIRILRQN 118
Query: 147 PVECLLQFLCSSNNNIARITKMVDFLA-SLGSHLGNVEGFEFHEFPSLERLSLVS-EVEL 204
E L+ F+CSSNNNIARI++MV+ L + G +G ++ +H+FP+ + L+ E L
Sbjct: 119 AWEALISFICSSNNNIARISQMVEKLCLNYGQFIGFIDKKPYHDFPAPQALAGRDVEARL 178
Query: 205 RNAGFGYRSAPQSSLLFSVRRSFKQAKYITGTVDVLQSKHSGGAEWLLSLR--------- 255
R GFGYR AKYI T ++ + G WL SLR
Sbjct: 179 RELGFGYR-----------------AKYIYQTALIVANHREEG--WLNSLRNPECPSFGA 219
Query: 256 ------------KLDLQEAIDALCTLPGVGPKVAACIALFSLDQHHAIPVDTHVWKIATR 303
+ ++A D L L GVGPKVA C+ L L A+PVDTHVW+IA R
Sbjct: 220 KPLSAGEMMEGGREGYRKAHDCLLELQGVGPKVADCVCLMGLGWGEAVPVDTHVWQIAQR 279
>gi|50303645|ref|XP_451764.1| hypothetical protein [Kluyveromyces lactis NRRL Y-1140]
gi|49640896|emb|CAH02157.1| KLLA0B05159p [Kluyveromyces lactis]
Length = 397
Score = 130 bits (327), Expect = 8e-28, Method: Compositional matrix adjust.
Identities = 91/270 (33%), Positives = 138/270 (51%), Gaps = 31/270 (11%)
Query: 47 ELSLPLTFPTGQTFRW---KKTGPLQYTGPIGP--HLISLKHLQNGDVCYHIHTSPSEPA 101
EL L + GQ+FRW + TG T + L+ L+ L + + Y + +
Sbjct: 11 ELVLRNSLHCGQSFRWVIQEHTGFFYSTLKLDDSYDLVMLRQLDDNCIEYDCYKRANLEK 70
Query: 102 AKSALLDFLNMGISLGELWEGFSASDCRFAELAKYLAGARVLRQDPVECLLQFLCSSNNN 161
+S L D+ + +SL L +S +D + + ++ G RVL Q+P E L F+CSSNNN
Sbjct: 71 LESHLKDYFELDVSLNNLMTQWSDADPKGFQNKEH-RGVRVLNQNPWETLCSFICSSNNN 129
Query: 162 IARITKMVDFLAS-LGSHLGNVEGFEFHEFPSLERL-SLVSEVELRNAGFGYRSAPQSSL 219
I+RITKM LA+ G + +G + + FPS +++ SE +LR+ GFGYR
Sbjct: 130 ISRITKMCHSLATEFGDEIAEFDGAKQYSFPSSDQIVERASEEKLRDLGFGYR------- 182
Query: 220 LFSVRRSFKQAKYITGTVDV--LQSKHSGGAEWLLSLR---KLDLQEAIDALCTLPGVGP 274
AKYI GT + ++ +E++LS R K+ Q+ + L PGVGP
Sbjct: 183 ----------AKYIIGTAQLMSMEKGDMSDSEYMLSWRIKGKMGYQDVKEKLMAYPGVGP 232
Query: 275 KVAACIALF-SLDQHHAIPVDTHVWKIATR 303
KVA C+ L L +PVD H+ +IA+R
Sbjct: 233 KVADCVLLMGGLGFDEVVPVDVHIARIASR 262
>gi|299753214|ref|XP_001833132.2| N-glycosylase/DNA lyase [Coprinopsis cinerea okayama7#130]
gi|298410196|gb|EAU88821.2| N-glycosylase/DNA lyase [Coprinopsis cinerea okayama7#130]
Length = 409
Score = 130 bits (327), Expect = 9e-28, Method: Compositional matrix adjust.
Identities = 107/317 (33%), Positives = 151/317 (47%), Gaps = 45/317 (14%)
Query: 40 PLNLTQSELSLPLTFPTGQTFRWK-----KTGPLQYTGPIGPHLISLKH-----LQNGDV 89
PL++TQ LSL GQ+FRW ++ + P + + LK Q+
Sbjct: 10 PLSVTQ--LSLAAVLKCGQSFRWSIIPLPESTSDESGIPCHEYRLCLKDRVVCLRQSPST 67
Query: 90 CYHIHT--SPSEPAAKSA---------LLDFLNMGISLGELWEGFSASDCRFAELAKYLA 138
Y+ P P +K L D+ + + L L+ +++ D FA++ +
Sbjct: 68 LYYRSAFPDPQPPLSKRTIHEAETLAWLRDYFQLDVDLESLYAEWASRDKVFAKVKDRFS 127
Query: 139 GARVLRQDPVECLLQFLCSSNNNIARITKMVDFLA--------SLGSHLGNVEGFEFHEF 190
G R+LRQDP E L+ F+CSSNNNI+RITKMV L SL E +H F
Sbjct: 128 GIRILRQDPWENLVSFICSSNNNISRITKMVHNLCQHYSPPLLSLPDPCNPSENLSYHPF 187
Query: 191 PSLERL--SLVSEVELRNAGFGYRSAPQSSLLFSVRRSFKQAKYITGTVDVLQSKHSGGA 248
P L S VS LR+ GFGYR+ ++R+ K G+ +L H +
Sbjct: 188 PPPSALADSTVSAT-LRSLGFGYRAD-------YIQRTAKMLVDAHGS-SLLSDTHGEAS 238
Query: 249 E-WLLSLRKLDLQEAIDALCTLPGVGPKVAACIALFSLDQHHAIPVDTHVWKIATRY--L 305
E WL LR L+ ++A + L GVG KVA CI L SLD+ IPVDTHV +IA ++ +
Sbjct: 239 EVWLEGLRSLETEKAREELLKFVGVGRKVADCILLMSLDKKEVIPVDTHVHQIAVKHYGM 298
Query: 306 LPELAGVRLTPKLCSRV 322
+TPKL V
Sbjct: 299 KGNSGKTAMTPKLYDEV 315
>gi|302656963|ref|XP_003020216.1| hypothetical protein TRV_05721 [Trichophyton verrucosum HKI 0517]
gi|291184025|gb|EFE39598.1| hypothetical protein TRV_05721 [Trichophyton verrucosum HKI 0517]
Length = 398
Score = 130 bits (326), Expect = 1e-27, Method: Compositional matrix adjust.
Identities = 80/233 (34%), Positives = 120/233 (51%), Gaps = 42/233 (18%)
Query: 94 HTSPSEPAAKSALLDFLNMGISLGELWEGFSASDCRFAELAKYLAGARVLRQDPVECLLQ 153
T+P + ++ + + N+ +L +L+E ++ SD F + A AG R++RQD E L+
Sbjct: 57 RTTPDDDDTEALIHHYFNLEYNLSDLYEQWATSDPNFKKKAVQFAGIRIMRQDAWETLVS 116
Query: 154 FLCSSNNNIARITKMVDFLA-SLGSHLGNVEGFEFHEFPSLERLSLVS-EVELRNAGFGY 211
F+CSSNNNIARI++MV+ L + G +G + ++++FP L+ E LR GFGY
Sbjct: 117 FICSSNNNIARISQMVEKLCINYGPFIGQLGDQKYYDFPEPSALTGTGVESHLRELGFGY 176
Query: 212 RSAPQSSLLFSVRRSFKQAKYITGTVDVLQSKHSGGAEWLLSLRKLD------------- 258
R AKYI T +++ ++ G WL SLR +
Sbjct: 177 R-----------------AKYIYQTANIVANQREPG--WLNSLRNPEKPAFNEKPATPGS 217
Query: 259 --------LQEAIDALCTLPGVGPKVAACIALFSLDQHHAIPVDTHVWKIATR 303
+EA + L L GVGPKVA C+ L L ++PVDTHVW+IA R
Sbjct: 218 GAKGDESGYREAHEQLLALQGVGPKVADCVCLMGLGWGESVPVDTHVWQIAQR 270
>gi|323352965|gb|EGA85265.1| Ogg1p [Saccharomyces cerevisiae VL3]
Length = 304
Score = 130 bits (326), Expect = 1e-27, Method: Compositional matrix adjust.
Identities = 91/285 (31%), Positives = 143/285 (50%), Gaps = 37/285 (12%)
Query: 37 KWTPLNLTQSELSLPLTFPTGQTFRW---KKTGPLQYTGPIGPH----LISLKHLQNGDV 89
K+ L + +SEL L GQ+FRW +K T IG ++ L+ + ++
Sbjct: 4 KFGKLAINKSELCLANVLQAGQSFRWIWDEKLNQYSTTMKIGQQEKYSVVILRQDEENEI 63
Query: 90 CYHIHTSP--SEPAAKSALLDFLNMGISLGELWEG-FSASDCRFAELAKYLAGARVLRQD 146
+ ++ A K+ L+ + + +SL L++ + SD FA+L+ G R+L Q+
Sbjct: 64 LEFVAVGDCGNQDALKTHLMKYFRLDVSLKHLFDNVWIPSDKAFAKLSP--QGIRILAQE 121
Query: 147 PVECLLQFLCSSNNNIARITKMVDFLAS-LGSHLGNVEGFEFHEFPSLERL-SLVSEVEL 204
P E L+ F+CSSNNNI+RIT+M + L S G+ + ++G +H FP+ E L S +E +L
Sbjct: 122 PWETLISFICSSNNNISRITRMCNSLCSNFGNLITTIDGVAYHSFPTSEELTSRATEAKL 181
Query: 205 RNAGFGYRSAPQSSLLFSVRRSFKQAKYITGTV-----DVLQSKHSGGAEWLLSL-RKLD 258
R GFGYR AKYI T D ++ + +L S+ +
Sbjct: 182 RELGFGYR-----------------AKYIIETARKLVNDKAEANITSDTTYLQSICKDAQ 224
Query: 259 LQEAIDALCTLPGVGPKVAACIALFSLDQHHAIPVDTHVWKIATR 303
++ + L + GVGPKVA C+ L L +PVD HV +IA R
Sbjct: 225 YEDVREHLMSYNGVGPKVADCVCLMGLHMDGIVPVDVHVSRIAKR 269
>gi|302511199|ref|XP_003017551.1| hypothetical protein ARB_04433 [Arthroderma benhamiae CBS 112371]
gi|291181122|gb|EFE36906.1| hypothetical protein ARB_04433 [Arthroderma benhamiae CBS 112371]
Length = 398
Score = 130 bits (326), Expect = 1e-27, Method: Compositional matrix adjust.
Identities = 80/231 (34%), Positives = 120/231 (51%), Gaps = 42/231 (18%)
Query: 96 SPSEPAAKSALLDFLNMGISLGELWEGFSASDCRFAELAKYLAGARVLRQDPVECLLQFL 155
+P + ++ + + N+ +L +L+E ++ASD F + A AG R++RQD E L+ F+
Sbjct: 59 TPDDDDTEALIHHYFNLEYNLSDLYEQWAASDPNFKKKAVQFAGIRIMRQDAWETLVSFI 118
Query: 156 CSSNNNIARITKMVDFLA-SLGSHLGNVEGFEFHEFPSLERLSLVS-EVELRNAGFGYRS 213
CSSNNNIARI++MV+ L + G +G + ++++FP L+ E LR GFGYR
Sbjct: 119 CSSNNNIARISQMVEKLCINYGPFIGQLGDQKYYDFPEPSALTGTGVESHLRELGFGYR- 177
Query: 214 APQSSLLFSVRRSFKQAKYITGTVDVLQSKHSGGAEWLLSLRKLD--------------- 258
AKYI T +++ ++ G WL SLR +
Sbjct: 178 ----------------AKYIYQTANIVANQREPG--WLNSLRNPEKPAFNEKPATPGSGA 219
Query: 259 ------LQEAIDALCTLPGVGPKVAACIALFSLDQHHAIPVDTHVWKIATR 303
+EA + L L GVGPKVA C+ L L ++PVDTHVW+IA R
Sbjct: 220 KGDESGYREAHEQLLALQGVGPKVADCVCLMGLGWGESVPVDTHVWQIAQR 270
>gi|312376358|gb|EFR23467.1| hypothetical protein AND_12830 [Anopheles darlingi]
Length = 383
Score = 129 bits (325), Expect = 1e-27, Method: Compositional matrix adjust.
Identities = 104/332 (31%), Positives = 140/332 (42%), Gaps = 75/332 (22%)
Query: 30 LTTNKPSKWTPLNLTQSELSLPLTFPTGQTFRWKK---TGPLQYTGPIG------PHLIS 80
+TT SKW L S+L L T GQ+FRWK TG Q G +++
Sbjct: 1 MTTASGSKWRSLLCESSQLQLKATLLGGQSFRWKDRKSTGKDQSDDANGEFIGVFANIVW 60
Query: 81 LKHLQNGDVCYHIH----------TSPSEPAAKSA------------------------- 105
L ++ Y I +P PA A
Sbjct: 61 LLRQTERELQYRIVGEQPYPNASLNAPQTPADSKAPEGNNLAQVRMKLPEPKEFAKHSEL 120
Query: 106 ----------LLDFLNMGISLGELWEGFSASDCRFAELAKYLAGARVLRQDPVECLLQFL 155
L + + + L + + + FA A R L QDPVE L F+
Sbjct: 121 LYPAAHYEQLLRIYFRLDVDLEQHYRKWIECHDHFANSAHRFYAVRQLDQDPVENLFSFI 180
Query: 156 CSSNNNIARITKMVDFL-ASLGSHLGNVEGFEFHEFPSLERLSL-VSEVELRNAGFGYRS 213
CS NN+I RI+ +V+ L + G + + EG + FPS+E L+ E LR GFGYR
Sbjct: 181 CSQNNHITRISDLVEKLCTNYGDKICDFEGTTYWNFPSVEALADGAVEGRLRELGFGYR- 239
Query: 214 APQSSLLFSVRRSFKQAKYITGTVDVLQSKHSGGAEWLLSLRKLDLQEAIDALCTLPGVG 273
AKYI G+ + S GG EW L +LD + A L LPG+G
Sbjct: 240 ----------------AKYIQGSAAKILS--LGGLEWFRRLTELDYKAARTELIGLPGIG 281
Query: 274 PKVAACIALFSLDQHHAIPVDTHVWKIATRYL 305
PKVA C+ L SL+ AIPVDTHV+++A YL
Sbjct: 282 PKVADCVCLMSLNHLQAIPVDTHVFQLAKHYL 313
>gi|310800350|gb|EFQ35243.1| 8-oxoguanine DNA-glycosylase [Glomerella graminicola M1.001]
Length = 403
Score = 129 bits (325), Expect = 1e-27, Method: Compositional matrix adjust.
Identities = 97/294 (32%), Positives = 152/294 (51%), Gaps = 27/294 (9%)
Query: 30 LTTNKPSKWTPLNLTQSELSLPLTFPTGQTFRWKKTGPLQYTGPIGPHLISLKHLQNGDV 89
++ + ++W L L+ +EL + T GQ+FRW+K ++ + +++LK Q+
Sbjct: 1 MSIARVTEWRKLPLSLAELCIDTTLRCGQSFRWRKIND-EWCCTLYGRIVALK--QDPTH 57
Query: 90 CYHIHTSPSEPA----------------AKSALLDFLNMGISLGELWEGFSASDCRFAEL 133
++ T P +P + L +L++ + L L+E +S +D F +
Sbjct: 58 LHYKVTWPVKPTYPLTPPVADGEVKDDDTEDLLRHYLSLKLDLKSLYEQWSEADPNFRKR 117
Query: 134 AKYLAGARVLRQDPVECLLQFLCSSNNNIARITKMVDFLAS-LGSHLGNVEGFEFHEFPS 192
A G R+L QD E L+ F+CSSNNNI RI++MV L + G +G+V FH+FP+
Sbjct: 118 APEFGGVRMLSQDAWEALICFICSSNNNIPRISQMVHKLCTHYGPLIGHVGDEPFHDFPT 177
Query: 193 LERLSLVS-EVELRNAGFGYRSA--PQSSLLFSVRRSFKQAKYITGTVDVLQSKHSGGAE 249
E L+ S E LR GFGYR+ Q++ + R + +T + + GG+E
Sbjct: 178 PEALTGKSVEAHLRELGFGYRAKYIAQTASIIVKDRPKGWFESLTNPENPCYRRPQGGSE 237
Query: 250 WLLSLRKLDLQEAIDALCTLPGVGPKVAACIALFSLDQHHAIPVDTHVWKIATR 303
L + +EA + L L GVGPKVA C+ L L A+PVDTHVW+IA R
Sbjct: 238 ----LPQYTYKEAHEQLLQLAGVGPKVADCVCLMGLGWGEAVPVDTHVWQIAQR 287
>gi|392595759|gb|EIW85082.1| hypothetical protein CONPUDRAFT_48745 [Coniophora puteana
RWD-64-598 SS2]
Length = 372
Score = 129 bits (325), Expect = 1e-27, Method: Compositional matrix adjust.
Identities = 108/343 (31%), Positives = 147/343 (42%), Gaps = 67/343 (19%)
Query: 30 LTTNKPSKWTPLNLTQSELSLPLTFPTGQTFRWK---------KTGPLQYTGPIGPHLIS 80
+ ++ P+ + L L ELSL GQ+FRW ++ +Y + ++
Sbjct: 1 MASSVPAGFHALPLPIVELSLAAVLSCGQSFRWTSHPLPTTSYQSSTHEYRLCLSDRVVC 60
Query: 81 LKH----------LQNGDVCYHIHTSPSEPAAKSALLDFLNMGISLGELWEGFSASDCRF 130
L+ L +G V H E + D+ + I L L+ +S D F
Sbjct: 61 LRQSPEFLFYRTALPSGSVNLH----EKEAETLVWIRDYFQLDIKLETLYNHWSERDIIF 116
Query: 131 AELAKYLAGARVLRQDPVECLLQFLCSSNNNIARITKMVDFLAS------------LGSH 178
+EL AG R+LRQDP E L+ F+CSSNNNI+RITKMV L S
Sbjct: 117 SELRSRFAGIRMLRQDPWENLVSFICSSNNNISRITKMVHSLCKKYSSPLLSLPSPDPSS 176
Query: 179 LGNVEGFEFHEFPSLERLSL-VSEVELRNAGFGYRSAPQSSLLFSVRRSFKQAKYITGTV 237
+H FP L+ + LR+ GFGYR A +I T
Sbjct: 177 ESASTADTYHPFPPPSALAAPQASATLRSMGFGYR-----------------ADFIQKTA 219
Query: 238 DVLQSKHSGGAE---------WLLSLRKLDLQEAIDALCTLPGVGPKVAACIALFSLDQH 288
+L H E WLLSLR + EA L L GVG KVA C+ L SLD+
Sbjct: 220 KLLVDAHGIACENHDMEPSHRWLLSLRSISTAEARVELLKLMGVGRKVADCVLLMSLDKR 279
Query: 289 HAIPVDTHVWKIATRYLLPELAG-----VRLTPKLCSRVAEAF 326
+PVDTHV +IA ++ ++G LTPKL V F
Sbjct: 280 EVVPVDTHVHQIAIKHYGLRVSGKAGAKATLTPKLYEDVNTKF 322
>gi|134075844|emb|CAL00223.1| unnamed protein product [Aspergillus niger]
Length = 405
Score = 129 bits (325), Expect = 1e-27, Method: Compositional matrix adjust.
Identities = 84/205 (40%), Positives = 113/205 (55%), Gaps = 25/205 (12%)
Query: 104 SALLDFLNMGISLGELWEGFSASDCRFAELAKYLAGARVLRQDPVECLLQFLCSSNNNIA 163
S L + N+ +L L+ +SA D F + A G R+LRQD E L+ F+CSSNNNIA
Sbjct: 91 SLLTHYFNLSANLTSLYTEWSAQDPNFRKKAPQFTGIRILRQDAWEALISFICSSNNNIA 150
Query: 164 RITKMVDFLA-SLGSHLGNVEGFEFHEFPSLERLSLVS-EVELRNAGFGYRSAPQSSLLF 221
RI++MV+ L + G + ++ G +H FP+ + L+ E +LR GFGYR
Sbjct: 151 RISQMVEKLCLNYGPFIASINGRAYHGFPAPDALTAHDVEGKLRGLGFGYR--------- 201
Query: 222 SVRRSFKQAKYITGT-VDVLQSKHSGGAEWLLSLRKLD--LQEAIDALCTLPGVGPKVAA 278
AKYI T V V + + SG WL SL + +EA + L L GVGPKVA
Sbjct: 202 --------AKYIYQTAVMVAKERESG---WLDSLCNPEEGYREAHEKLLELQGVGPKVAD 250
Query: 279 CIALFSLDQHHAIPVDTHVWKIATR 303
C+ L L A+PVDTHVW+IA R
Sbjct: 251 CVCLMGLGWGEAVPVDTHVWQIAQR 275
>gi|238482355|ref|XP_002372416.1| DNA N-glycosylase, putative [Aspergillus flavus NRRL3357]
gi|83765204|dbj|BAE55347.1| unnamed protein product [Aspergillus oryzae RIB40]
gi|220700466|gb|EED56804.1| DNA N-glycosylase, putative [Aspergillus flavus NRRL3357]
gi|391864579|gb|EIT73874.1| 8-oxoguanine DNA glycosylase [Aspergillus oryzae 3.042]
Length = 434
Score = 129 bits (325), Expect = 2e-27, Method: Compositional matrix adjust.
Identities = 107/353 (30%), Positives = 158/353 (44%), Gaps = 76/353 (21%)
Query: 30 LTTNKPSKWTPLNLTQSELSLPLTFPTGQTFRWKKT-GPLQYTGPIGPHLISLKHLQNGD 88
+T S+W L ++ SEL + T GQ+FRW ++ + L+SLK D
Sbjct: 1 MTVGAFSEWRRLPVSLSELCINTTLRCGQSFRWHNVPDSDEWRCVLYGRLLSLKQ-DPTD 59
Query: 89 VCYHIH-----TSP---------------------SEPAAKSA-----LLDFLNMGISLG 117
+ Y + ++P +P + L + N+ +L
Sbjct: 60 LYYRTYLPAKLSNPIPLPTPPSSHPPSRADSNKPLDQPQSDKDDILPILTHYFNLDSNLT 119
Query: 118 ELWEGFSASDCRFAELAKYLAGARVLRQDPVECLLQFLCSSNNNIARITKMVDFL-ASLG 176
L+ +S++D F + A G R+LRQD E L+ F+CSSNNNIARI++MV+ L + G
Sbjct: 120 SLYSYWSSNDPNFKKKAPQFTGIRILRQDAWEALVSFICSSNNNIARISQMVEKLCTNYG 179
Query: 177 SHLGNVEGFEFHEFPSLERLSLVS-EVELRNAGFGYRSAPQSSLLFSVRRSFKQAKYITG 235
+ +V+G +H+FP E L+ E LR+ GFGYR AKYI
Sbjct: 180 PFIASVDGRAYHDFPPPEALTGDDVESRLRSLGFGYR-----------------AKYIYQ 222
Query: 236 TVDVLQSKHSGGAEWLLSLR---------------------KLDLQEAIDALCTLPGVGP 274
T ++ ++ G WL SLR + +EA + L L GVGP
Sbjct: 223 TAVIVSTQREKG--WLDSLRNPESPALGVEPAPGGEMRPEGREGYREAHEKLLELQGVGP 280
Query: 275 KVAACIALFSLDQHHAIPVDTHVWKIATR-YLLPELAGVRLTPKLCSRVAEAF 326
KV+ C+ L L ++PVDTHVW+IA R Y + A LT V F
Sbjct: 281 KVSDCVCLMGLGWGESVPVDTHVWQIAQRDYKFGKGAHKSLTKATYDAVGNHF 333
>gi|323303643|gb|EGA57431.1| Ogg1p [Saccharomyces cerevisiae FostersB]
gi|349580228|dbj|GAA25388.1| K7_Ogg1p [Saccharomyces cerevisiae Kyokai no. 7]
gi|392297523|gb|EIW08623.1| Ogg1p [Saccharomyces cerevisiae CEN.PK113-7D]
Length = 376
Score = 129 bits (324), Expect = 2e-27, Method: Compositional matrix adjust.
Identities = 91/285 (31%), Positives = 143/285 (50%), Gaps = 37/285 (12%)
Query: 37 KWTPLNLTQSELSLPLTFPTGQTFRW---KKTGPLQYTGPIGPH----LISLKHLQNGDV 89
K+ L + +SEL L GQ+FRW +K T IG ++ L+ + ++
Sbjct: 4 KFGKLAINKSELCLANVLQAGQSFRWIWDEKLNQYSTTMKIGQQEKYSVVILRQDEENEI 63
Query: 90 CYHIHTSP--SEPAAKSALLDFLNMGISLGELWEG-FSASDCRFAELAKYLAGARVLRQD 146
+ ++ A K+ L+ + + +SL L++ + SD FA+L+ G R+L Q+
Sbjct: 64 LEFVAVGDCGNQDALKTHLMKYFRLDVSLKHLFDNVWIPSDKAFAKLSP--QGIRILAQE 121
Query: 147 PVECLLQFLCSSNNNIARITKMVDFLAS-LGSHLGNVEGFEFHEFPSLERL-SLVSEVEL 204
P E L+ F+CSSNNNI+RIT+M + L S G+ + ++G +H FP+ E L S +E +L
Sbjct: 122 PWETLISFICSSNNNISRITRMCNSLCSNFGNLITTIDGVAYHSFPTSEELTSRATEAKL 181
Query: 205 RNAGFGYRSAPQSSLLFSVRRSFKQAKYITGTV-----DVLQSKHSGGAEWLLSL-RKLD 258
R GFGYR AKYI T D ++ + +L S+ +
Sbjct: 182 RELGFGYR-----------------AKYIIETARKLVKDKAEANITSDTTYLQSICKDAQ 224
Query: 259 LQEAIDALCTLPGVGPKVAACIALFSLDQHHAIPVDTHVWKIATR 303
++ + L + GVGPKVA C+ L L +PVD HV +IA R
Sbjct: 225 YEDVREHLMSYNGVGPKVADCVCLMGLHMDGIVPVDVHVSRIAKR 269
>gi|6323580|ref|NP_013651.1| 8-oxoguanine glycosylase OGG1 [Saccharomyces cerevisiae S288c]
gi|1709456|sp|P53397.1|OGG1_YEAST RecName: Full=N-glycosylase/DNA lyase; Includes: RecName:
Full=8-oxoguanine DNA glycosylase; Includes: RecName:
Full=DNA-(apurinic or apyrimidinic site) lyase; Short=AP
lyase
gi|577136|emb|CAA86715.1| unknown [Saccharomyces cerevisiae]
gi|1373127|gb|AAC49312.1| 8-oxoguanine DNA glycosylase [Saccharomyces cerevisiae]
gi|1911630|gb|AAB50772.1| 8-oxoguanine DNA glycosylase, 8-oxo-dG:dC DNA glycosylase/lyase,
Ogg1=43 kDa base-excision DNA-repair protein
[Saccharomyces cerevisiae=yeast, YPH 252, Peptide, 376
aa]
gi|190408183|gb|EDV11448.1| 43 kDa 8-oxo-guanine DNA glycosylase [Saccharomyces cerevisiae
RM11-1a]
gi|256269753|gb|EEU05020.1| Ogg1p [Saccharomyces cerevisiae JAY291]
gi|259148516|emb|CAY81761.1| Ogg1p [Saccharomyces cerevisiae EC1118]
gi|285813942|tpg|DAA09837.1| TPA: 8-oxoguanine glycosylase OGG1 [Saccharomyces cerevisiae S288c]
gi|323332260|gb|EGA73670.1| Ogg1p [Saccharomyces cerevisiae AWRI796]
gi|323336180|gb|EGA77451.1| Ogg1p [Saccharomyces cerevisiae Vin13]
gi|323347075|gb|EGA81350.1| Ogg1p [Saccharomyces cerevisiae Lalvin QA23]
Length = 376
Score = 129 bits (323), Expect = 2e-27, Method: Compositional matrix adjust.
Identities = 91/285 (31%), Positives = 143/285 (50%), Gaps = 37/285 (12%)
Query: 37 KWTPLNLTQSELSLPLTFPTGQTFRW---KKTGPLQYTGPIGPH----LISLKHLQNGDV 89
K+ L + +SEL L GQ+FRW +K T IG ++ L+ + ++
Sbjct: 4 KFGKLAINKSELCLANVLQAGQSFRWIWDEKLNQYSTTMKIGQQEKYSVVILRQDEENEI 63
Query: 90 CYHIHTSP--SEPAAKSALLDFLNMGISLGELWEG-FSASDCRFAELAKYLAGARVLRQD 146
+ ++ A K+ L+ + + +SL L++ + SD FA+L+ G R+L Q+
Sbjct: 64 LEFVAVGDCGNQDALKTHLMKYFRLDVSLKHLFDNVWIPSDKAFAKLSP--QGIRILAQE 121
Query: 147 PVECLLQFLCSSNNNIARITKMVDFLAS-LGSHLGNVEGFEFHEFPSLERL-SLVSEVEL 204
P E L+ F+CSSNNNI+RIT+M + L S G+ + ++G +H FP+ E L S +E +L
Sbjct: 122 PWETLISFICSSNNNISRITRMCNSLCSNFGNLITTIDGVAYHSFPTSEELTSRATEAKL 181
Query: 205 RNAGFGYRSAPQSSLLFSVRRSFKQAKYITGTV-----DVLQSKHSGGAEWLLSL-RKLD 258
R GFGYR AKYI T D ++ + +L S+ +
Sbjct: 182 RELGFGYR-----------------AKYIIETARKLVNDKAEANITSDTTYLQSICKDAQ 224
Query: 259 LQEAIDALCTLPGVGPKVAACIALFSLDQHHAIPVDTHVWKIATR 303
++ + L + GVGPKVA C+ L L +PVD HV +IA R
Sbjct: 225 YEDVREHLMSYNGVGPKVADCVCLMGLHMDGIVPVDVHVSRIAKR 269
>gi|354548496|emb|CCE45232.1| hypothetical protein CPAR2_702450 [Candida parapsilosis]
Length = 333
Score = 129 bits (323), Expect = 2e-27, Method: Compositional matrix adjust.
Identities = 80/216 (37%), Positives = 116/216 (53%), Gaps = 39/216 (18%)
Query: 106 LLDFLNMGISLGELWEGFSASDCRFAELAKY--------LAGARVLRQDPVECLLQFLCS 157
+ D+ + + L +L++ + + + AK +G R LRQDP ECL+ F+CS
Sbjct: 22 IQDYFALDVKLIDLYQQWKQNHKPYITTAKVKQQSPFDLFSGIRTLRQDPWECLISFICS 81
Query: 158 SNNNIARITKMVDFLA-SLGSHLGNVEGFEFHEFPSLERLSLVSEVE--LRNAGFGYRSA 214
SNNN+ RI+KM D L G+++ + G F+ FPS ++LS + VE LR+ GFGYR
Sbjct: 82 SNNNVKRISKMCDNLCLHFGNYINDYGGISFYSFPSPQQLSTDASVETKLRDLGFGYR-- 139
Query: 215 PQSSLLFSVRRSFKQAKYITGTVDVLQSKHSGGAEW-------LLSLRKLDLQEAIDALC 267
AKYI T K + A++ L +LR +EA + L
Sbjct: 140 ---------------AKYIYQTA----CKFTDNAKYPDITLTKLNNLRHASYEEAHEFLL 180
Query: 268 TLPGVGPKVAACIALFSLDQHHAIPVDTHVWKIATR 303
L GVGPKVA CI L SLD+ + +P+DTHV++IA R
Sbjct: 181 QLTGVGPKVADCICLMSLDKSNIVPIDTHVYQIAVR 216
>gi|399217150|emb|CCF73837.1| unnamed protein product [Babesia microti strain RI]
Length = 297
Score = 129 bits (323), Expect = 3e-27, Method: Compositional matrix adjust.
Identities = 91/285 (31%), Positives = 137/285 (48%), Gaps = 42/285 (14%)
Query: 38 WTPLNLTQSELSLPLTFPTG-------QTFRWKKTGPLQYTGPIGPHLISLKH------- 83
W L + L L + PTG Q F WK+T + G I ++ K
Sbjct: 2 WKDLKVPPETLRLSMVLPTGTTAIKIGQLFTWKETKSGTWMGVIRSLIVECKETPTTTKY 61
Query: 84 -LQNGDVCYHIHTSPSEPAAKSALLDFLNMGISLGELWEG--FSASDCRFAELAKYLAGA 140
+ N D ++ SE S + N+ + + E + D R +L + G
Sbjct: 62 RIINLD--NPLNNKLSEEFIHSLVTHLFNLQVDTMHVIERCVYPIHD-RLVKLHEKYKGL 118
Query: 141 RVLRQDPVECLLQFLCSSNNNIARITKMVDFLASLGSHLGNVEGFEFHEFPSLERLSLVS 200
RVL+QDPVEC+ F+CS+NNNI RITKM+ SL G G +++FP L +L+ +
Sbjct: 119 RVLQQDPVECIFSFICSANNNIKRITKMIQ---SLKEEAGIPLGNGWYQFPELYKLANLK 175
Query: 201 EVELRNAGFGYRSAPQSSLLFSVRRSFKQAKYITGTVDVLQSKHSGGAEWLLSLRKLDLQ 260
LR GFG+R++ R AK I GG EWL L+KL L+
Sbjct: 176 TETLRKLGFGFRAS----------RIPHAAKQILD---------RGGVEWLHQLKKLPLE 216
Query: 261 EAIDALCTLPGVGPKVAACIALFSLDQHHAIPVDTHVWKIATRYL 305
+ + L L GVGPK+A CI L+SL + +P+D H+ +++++Y+
Sbjct: 217 DCRNELIQLSGVGPKIADCICLYSLGFINCVPLDVHILRVSSKYV 261
>gi|4587158|dbj|BAA76639.1| OGG1 protein type 1b [Homo sapiens]
Length = 190
Score = 129 bits (323), Expect = 3e-27, Method: Compositional matrix adjust.
Identities = 71/161 (44%), Positives = 92/161 (57%), Gaps = 20/161 (12%)
Query: 145 QDPVECLLQFLCSSNNNIARITKMVDFLA-SLGSHLGNVEGFEFHEFPSLERLSLVS-EV 202
QDP+ECL F+CSSNNNIARIT MV+ L + G L ++ +H FPSL+ L+ E
Sbjct: 1 QDPIECLFSFICSSNNNIARITGMVERLCQAFGPRLIQLDDVTYHGFPSLQALAGPEVEA 60
Query: 203 ELRNAGFGYRSAPQSSLLFSVRRSFKQAKYITGTVDVLQSKHSGGAEWLLSLRKLDLQEA 262
LR G GYR A+Y++ + + + GG WL LR+ +EA
Sbjct: 61 HLRKLGLGYR-----------------ARYVSASARAILEEQ-GGLAWLQQLRESSYEEA 102
Query: 263 IDALCTLPGVGPKVAACIALFSLDQHHAIPVDTHVWKIATR 303
ALC LPGVG KVA CI L +LD+ A+PVD H+W IA R
Sbjct: 103 HKALCILPGVGTKVADCICLMALDKPQAVPVDVHMWHIAQR 143
>gi|151946104|gb|EDN64335.1| 43 kDa 8-oxo-guanine DNA glycosylase [Saccharomyces cerevisiae
YJM789]
Length = 376
Score = 128 bits (322), Expect = 3e-27, Method: Compositional matrix adjust.
Identities = 90/285 (31%), Positives = 143/285 (50%), Gaps = 37/285 (12%)
Query: 37 KWTPLNLTQSELSLPLTFPTGQTFRW---KKTGPLQYTGPIGPH----LISLKHLQNGDV 89
K+ L + +SEL L GQ+FRW +K T IG ++ ++ + ++
Sbjct: 4 KFGKLAINKSELCLANVLQAGQSFRWIWDEKLNQYSTTMKIGQQEKYSVVIMRQDEENEI 63
Query: 90 CYHIHTSP--SEPAAKSALLDFLNMGISLGELWEG-FSASDCRFAELAKYLAGARVLRQD 146
+ ++ A K+ L+ + + +SL L++ + SD FA+L+ G R+L Q+
Sbjct: 64 LEFVAVGDCGNQDALKTHLMKYFRLDVSLKHLFDNVWIPSDKAFAKLSP--QGIRILAQE 121
Query: 147 PVECLLQFLCSSNNNIARITKMVDFLAS-LGSHLGNVEGFEFHEFPSLERL-SLVSEVEL 204
P E L+ F+CSSNNNI+RIT+M + L S G+ + ++G +H FP+ E L S +E +L
Sbjct: 122 PWETLISFICSSNNNISRITRMCNSLCSNFGNLITTIDGVAYHSFPTSEELTSRATEAKL 181
Query: 205 RNAGFGYRSAPQSSLLFSVRRSFKQAKYITGTV-----DVLQSKHSGGAEWLLSL-RKLD 258
R GFGYR AKYI T D ++ + +L S+ +
Sbjct: 182 RELGFGYR-----------------AKYIIETARKLVKDKAEANITSDTTYLQSICKDAQ 224
Query: 259 LQEAIDALCTLPGVGPKVAACIALFSLDQHHAIPVDTHVWKIATR 303
++ + L + GVGPKVA C+ L L +PVD HV +IA R
Sbjct: 225 YEDVREHLMSYNGVGPKVADCVCLMGLHMDGIVPVDVHVSRIAKR 269
>gi|389642207|ref|XP_003718736.1| N-glycosylase/DNA lyase [Magnaporthe oryzae 70-15]
gi|351641289|gb|EHA49152.1| N-glycosylase/DNA lyase [Magnaporthe oryzae 70-15]
Length = 440
Score = 128 bits (321), Expect = 4e-27, Method: Compositional matrix adjust.
Identities = 99/323 (30%), Positives = 141/323 (43%), Gaps = 82/323 (25%)
Query: 36 SKWTPLNLTQSELSLPLTFPTGQTFRWKKTGPLQYTGPIGPHLISLKHLQNGDVCYHIHT 95
S+ P++L +EL + T GQ+FRW+K ++ + ++SL+ + H+H
Sbjct: 8 SRMLPVSL--AELCIDTTLRCGQSFRWRKVDD-EWHCSLHGRVVSLRQDET-----HLHY 59
Query: 96 SPSEPAAKSA-------------------------------------LLDFLNMGISLGE 118
+ PA + LL++ N+ SL
Sbjct: 60 RATWPAGRQVSIKSSVKVKREEEDESIKSEDAAAVKVDEEEDDTEALLLNYFNLHHSLSG 119
Query: 119 LWEGFSASDCRFAELAKYLAGARVLRQDPVECLLQFLCSSNNNIARITKMVDFLASL--G 176
++ +S D F A G R+L QD E L+ F+CSSNNNIARI++M L G
Sbjct: 120 MYTHWSDRDANFRSKAPKFTGVRILNQDAWETLVAFICSSNNNIARISQMCHKLCDHYGG 179
Query: 177 SHLGNVEGFEFHEFPSLERLSLVS-EVELRNAGFGYRSAPQSSLLFSVRRSFKQAKYITG 235
+ V G + +FPS E L+ E LR GFGYR A+YI
Sbjct: 180 EPIATVAGHVYRDFPSPEALAGDGVEAHLRELGFGYR-----------------ARYIAE 222
Query: 236 TVDVLQSKHSGGAEWLLSLR---------------KLDLQEAIDALCTLPGVGPKVAACI 280
T V+ + G WLLSLR + D + A AL L GVGPKV+ C+
Sbjct: 223 TARVVARQRPAG--WLLSLRNPASRSPTSTPPQQQQPDYRTAFAALQELSGVGPKVSDCV 280
Query: 281 ALFSLDQHHAIPVDTHVWKIATR 303
L L A+P+DTH+W IATR
Sbjct: 281 CLMGLGWGEAVPLDTHMWTIATR 303
>gi|398399438|ref|XP_003853104.1| hypothetical protein MYCGRDRAFT_71455 [Zymoseptoria tritici IPO323]
gi|339472986|gb|EGP88080.1| hypothetical protein MYCGRDRAFT_71455 [Zymoseptoria tritici IPO323]
Length = 417
Score = 128 bits (321), Expect = 4e-27, Method: Compositional matrix adjust.
Identities = 107/338 (31%), Positives = 151/338 (44%), Gaps = 69/338 (20%)
Query: 36 SKWTPLNLTQSELSLPLTFPTGQTFRWKKTGPLQYTGPIGPHLISLKHLQNGDVCYH--- 92
++W L ++ SEL + T GQ+FRW+K ++ + ++SL Q+ Y+
Sbjct: 7 AEWQKLPMSLSELCINTTLRCGQSFRWRKNDLDVWSIALHNRILSLH--QDPQYLYYRSI 64
Query: 93 --------------------IHTSPSEPAAKSALLDFLNMGISLGELWEGFSASDCRFAE 132
+S S S + +LN+ +L L+ ++ASD FA
Sbjct: 65 PPTTTTLTTPPTPPSSKPASPPSSSSSDDTLSLIHHYLNLEPNLTTLYAQWAASDANFAR 124
Query: 133 LAKYLAGARVLRQDPVECLLQFLCSSNNNIARITKMVDFLA-SLGSHLGNVEGFEFHEFP 191
A G R+LRQD E L+ F+CSSNNNIARI +MV L G LG ++ +H+FP
Sbjct: 125 KAPQFTGIRILRQDAWEALIGFICSSNNNIARIGQMVHKLCIRYGPLLGYLDEEPYHDFP 184
Query: 192 SLERLSLVS-EVELRNAGFGYRSAPQSSLLFSVRRSFKQAKYITGTVDVLQSKHSGGAEW 250
+ L+ E ELR GFGYR AKYI T ++ K G W
Sbjct: 185 EPKDLAQDGVEAELRRLGFGYR-----------------AKYIYKTACMVSEKPDG---W 224
Query: 251 LLSLR---------------------KLDLQEAIDALCTLPGVGPKVAACIALFSLDQHH 289
L +LR + + A L TL GVGPKVA C++L L
Sbjct: 225 LDTLRNPETPILGVEPSDAGKWAAEGREGYRTAHAELLTLQGVGPKVADCVSLMGLGWGE 284
Query: 290 AIPVDTHVWKIATR-YLLPELAGVRLTPKLCSRVAEAF 326
A+PVDTHVW+IA + Y + LT + V F
Sbjct: 285 AVPVDTHVWQIAVKDYKFGKGKHASLTKQTYDAVGNKF 322
>gi|395333689|gb|EJF66066.1| DNA glycosylase [Dichomitus squalens LYAD-421 SS1]
Length = 452
Score = 128 bits (321), Expect = 4e-27, Method: Compositional matrix adjust.
Identities = 99/311 (31%), Positives = 136/311 (43%), Gaps = 58/311 (18%)
Query: 35 PSKWTPLNLTQSELSLPLTFPTGQTFRWK-------KTGPLQYTGPIGPHLISLKHL--- 84
P+ + L L ++LSL GQ+FRW L P + + L+
Sbjct: 6 PAGFRALPLPLAQLSLAAVLKCGQSFRWSIFPLDVPSNSALGTNSPTHEYRLCLRDRVVC 65
Query: 85 --QNGDVCYHIHTSP---------SEPAAKSALLDFLNMGISLGELWEGFSASDCRFAEL 133
Q+ D ++ P E + + D+ + + L +L+ +S +D F +
Sbjct: 66 LRQSPDTLFYRSVFPPLVSEDEETREAGTLAWIRDYFQLNVDLVDLYRQWSLADPIFKRI 125
Query: 134 AKYLAGARVLRQDPVECLLQFLCSSNNNIARITKMVDFLA--------SLGSHLG-NVEG 184
G R+LRQDP E L+ F+CSSNNNI+RITKMV L SL G N+
Sbjct: 126 QDRFEGIRMLRQDPFENLISFICSSNNNISRITKMVTSLCRTYSPALLSLPPPAGTNLPE 185
Query: 185 FEFHEFPSLERLSLVS-EVELRNAGFGYRSAPQSSLLFSVRRSFKQAKYITGTVDVLQSK 243
+H FPS L+ +LR GFGYR A +I T +L
Sbjct: 186 EAYHPFPSPSALAAADVTAKLRALGFGYR-----------------ADFIQKTAAMLVET 228
Query: 244 H----------SGGAEWLLSLRKLDLQEAIDALCTLPGVGPKVAACIALFSLDQHHAIPV 293
H EWL++LR L EA L GVG KVA CI L SLD+ IPV
Sbjct: 229 HGLTRNPETSMEASEEWLMTLRHLGTPEARAELLNFMGVGRKVADCILLMSLDKREVIPV 288
Query: 294 DTHVWKIATRY 304
DTHV +IA ++
Sbjct: 289 DTHVQQIAIKH 299
>gi|403413783|emb|CCM00483.1| predicted protein [Fibroporia radiculosa]
Length = 409
Score = 128 bits (321), Expect = 4e-27, Method: Compositional matrix adjust.
Identities = 87/250 (34%), Positives = 114/250 (45%), Gaps = 49/250 (19%)
Query: 108 DFLNMGISLGELWEGFSASDCRFAELAKYLAGARVLRQDPVECLLQFLCSSNNNIARITK 167
D+ + + L EL++ + A D F +G R+LRQDP E L+ F+CSSNNNI RITK
Sbjct: 64 DYFQLKVDLSELYKQWGARDAVFHSRKDRFSGIRMLRQDPFENLISFICSSNNNIGRITK 123
Query: 168 MV------------------DFLASLGSHLGNVEGFEFHEFPSLERLSLVSEV-ELRNAG 208
M+ DF ++ G G E +H FP L+ LR G
Sbjct: 124 MIQALCRHYSTTLVTLPPPADFSSTSGQPSG--ESHAYHPFPPPSVLAAPGVAGTLRTLG 181
Query: 209 FGYRSAPQSSLLFSVRRSFKQAKYITGTVDVLQSKHSG--------GAEWLLSLRKLDLQ 260
FGYR A +I T +L H G WLL+LR+ Q
Sbjct: 182 FGYR-----------------ADFIQKTAKMLVDTHGGTDRDALEPAERWLLTLREKSTQ 224
Query: 261 EAIDALCTLPGVGPKVAACIALFSLDQHHAIPVDTHVWKIATRYL---LPELAGVRLTPK 317
EA + L GVG KVA CI L SLD++ IPVDTHV +IA ++ A +TPK
Sbjct: 225 EAREELLKFMGVGRKVADCILLMSLDKNEVIPVDTHVHQIAMKHYGVNGSSKAKTAMTPK 284
Query: 318 LCSRVAEAFC 327
L V+
Sbjct: 285 LYDEVSNKLA 294
>gi|294950227|ref|XP_002786524.1| 8-oxoguanine DNA glycosylase, putative [Perkinsus marinus ATCC
50983]
gi|239900816|gb|EER18320.1| 8-oxoguanine DNA glycosylase, putative [Perkinsus marinus ATCC
50983]
Length = 338
Score = 127 bits (320), Expect = 5e-27, Method: Compositional matrix adjust.
Identities = 94/295 (31%), Positives = 144/295 (48%), Gaps = 46/295 (15%)
Query: 32 TNKPSKWTPLNLTQSELSLPLTFPTGQTFRWK-----KTGPLQYTGPIGPHLISLKHLQN 86
++K S+W L + + +L+L LT +GQ F WK G + Y G IG + L+ N
Sbjct: 2 SSKFSEWISL-VAERQLNLRLTLFSGQVFVWKPMLDGGDGEV-YYGVIGSTAVRLR--AN 57
Query: 87 GDVCY-HIHTSPSEPAAKSA--LLDFLNMGISLGELWEGFSASDCRFAELA--KYLAGAR 141
D+ P+ K+A L F + + L+ + D F + K L G R
Sbjct: 58 SDLAMVEFSCCPAGHVQKAAAQLQQFFQLETDIDALYRDWETRDEIFRTVVRNKNLRGLR 117
Query: 142 VLRQDPVECLLQFLCSSNNNIARITKMVDFL-ASLGSHLGNV-----EGFEFHEFPSLER 195
V++Q+P ECL+ F+ S NNN+ RI+ +++ L G+HL V E E +FPSL++
Sbjct: 118 VIKQEPFECLVSFITSQNNNVKRISLLLNTLRQQYGTHLATVTASGDEVLELFQFPSLQQ 177
Query: 196 LSLVSEVELRNAGFGYRSAPQSSLLFSVRRSFKQAKYITGTVDVLQSKHSGGAEWLLSLR 255
L +E +LR GFGYR A+Y+ +D L + + G L+
Sbjct: 178 LDTATEEDLRKMGFGYR-----------------ARYLRALIDSLHDEGTLG-----KLQ 215
Query: 256 KLDL----QEAIDALCTLPGVGPKVAACIALFSLDQHHAIPVDTHVWKIATRYLL 306
LD +E + L + GVG KVA C+ALFS+ IP D HV ++A R+ +
Sbjct: 216 ALDAFTREEECREFLTSFVGVGRKVADCVALFSMRGRQIIPCDVHVIRLAYRHYM 270
>gi|164427521|ref|XP_964045.2| hypothetical protein NCU03040 [Neurospora crassa OR74A]
gi|157071778|gb|EAA34809.2| hypothetical protein NCU03040 [Neurospora crassa OR74A]
Length = 402
Score = 127 bits (320), Expect = 6e-27, Method: Compositional matrix adjust.
Identities = 87/236 (36%), Positives = 113/236 (47%), Gaps = 57/236 (24%)
Query: 106 LLDFLNMGISLGELWEGFSASDCRFAELAKYLAGARVLRQDPVECLLQFLCSSNNNIARI 165
L ++ + SL L+ +S SD FA A G R+L QD E L+ F+CSSNNNI+RI
Sbjct: 40 LHNYFALSHSLTTLYAQWSLSDANFARRAPAFTGIRILNQDAWETLISFICSSNNNISRI 99
Query: 166 TKMVDFLAS-LGSHLGNVEGFEFHEFPSLERLSLVS-EVELRNAGFGYRSAPQSSLLFSV 223
++MV L + G ++ VEG FH+FP E L+ E LR GFGYR
Sbjct: 100 SQMVLKLCTHYGPYVATVEGEAFHDFPGPEALAGEGVEAHLRELGFGYR----------- 148
Query: 224 RRSFKQAKYITGTVDVLQSKHSGGAEWLLSLR---------------------------- 255
AKYI T + + G WLL LR
Sbjct: 149 ------AKYIAETAGCVAQVY--GERWLLRLRNPGVPALRAAEEDILVKKEEGAEEKESV 200
Query: 256 --KLDL------QEAIDALCTLPGVGPKVAACIALFSLDQHHAIPVDTHVWKIATR 303
K DL Q+A +AL TLPGVGPKV+ C+ L L ++P+DTHVW+IA R
Sbjct: 201 ESKADLDNPPTYQKAHEALLTLPGVGPKVSDCVCLMGLGWGESVPIDTHVWQIAQR 256
>gi|326475624|gb|EGD99633.1| N-glycosylase/DNA lyase [Trichophyton tonsurans CBS 112818]
Length = 360
Score = 127 bits (320), Expect = 6e-27, Method: Compositional matrix adjust.
Identities = 80/218 (36%), Positives = 114/218 (52%), Gaps = 42/218 (19%)
Query: 109 FLNMGISLGELWEGFSASDCRFAELAKYLAGARVLRQDPVECLLQFLCSSNNNIARITKM 168
+ N+ +L +L+E ++ASD F + A AG R++RQD E L+ F+CSSNNNIARI++M
Sbjct: 34 YFNLEYNLSDLYEQWAASDPNFKKKAVQFAGIRIMRQDAWETLVSFICSSNNNIARISQM 93
Query: 169 VDFLA-SLGSHLGNVEGFEFHEFPSLERLSLVS-EVELRNAGFGYRSAPQSSLLFSVRRS 226
V+ L + G +G + ++++FP L+ E LR GFGYR
Sbjct: 94 VEKLCINYGPFIGQLGDQKYYDFPEPSALTGNGVESHLRELGFGYR-------------- 139
Query: 227 FKQAKYITGTVDVLQSKHSGGAEWLLSLR---------------------KLDLQEAIDA 265
AKYI T +++ ++ G WL SLR K +EA +
Sbjct: 140 ---AKYIYQTANIVANQREPG--WLNSLRNPEKPAFNEKPATPGSGAKGDKSGYREAHEQ 194
Query: 266 LCTLPGVGPKVAACIALFSLDQHHAIPVDTHVWKIATR 303
L L GVGPKVA C+ L L ++PVDTHVW+IA R
Sbjct: 195 LLALQGVGPKVADCVCLMGLGWGESVPVDTHVWQIAQR 232
>gi|449547334|gb|EMD38302.1| hypothetical protein CERSUDRAFT_113469 [Ceriporiopsis subvermispora
B]
Length = 466
Score = 127 bits (319), Expect = 7e-27, Method: Compositional matrix adjust.
Identities = 113/340 (33%), Positives = 156/340 (45%), Gaps = 80/340 (23%)
Query: 41 LNLTQSELSLPLTFPTGQTFRWKKTGPLQYTGPIGPHLISLKH------LQNGDVCYHIH 94
L L ++LSL GQ+FRW T PL Y+G PH S L++ VC
Sbjct: 23 LPLPVAQLSLAAVLQCGQSFRWD-TYPL-YSGQTSPHDTSTPTHEYRLCLRDRVVCLRQS 80
Query: 95 ------------TSPSEPAAKSA------LLDFLNMGISLGELWEGFSASDCRFAELAKY 136
TSP A + + D+ + I L +L++ + D F + +
Sbjct: 81 PDTLWYRSVFPVTSPINVAEEREVETLAWIKDYFQLHIDLDDLYDQWGKRDPVFQTVKQR 140
Query: 137 LAGARVLRQDPVECLLQFLCSSNNNIARITKMVDFLASLGSHL----------GNVEG-- 184
AG R+LRQDP E L+ F+CSSNNNIARITKMV +L H ++G
Sbjct: 141 FAGIRMLRQDPFENLMSFICSSNNNIARITKMVK---ALCQHYSPPLVTIPPPAMIDGSC 197
Query: 185 -------FEFHEFPSLERLSL--VSEVELRNAGFGYRSAPQSSLLFSVRRSFKQAKYITG 235
++ FPS L+ VS+ LR+ GFGYR A+++
Sbjct: 198 PTAVPLPCVYYPFPSPSSLASPEVSQ-HLRSLGFGYR-----------------AEFVQR 239
Query: 236 TVDVLQSKHSGGA----------EWLLSLRKLDLQEAIDALCTLPGVGPKVAACIALFSL 285
T +L HS A ++L+SLR+ EA + L GVG KVA C+ L SL
Sbjct: 240 TAQMLVEAHSHAAVSEDVQEPAEKFLISLRQRSTLEAREELLNFVGVGRKVADCVLLMSL 299
Query: 286 DQHHAIPVDTHVWKIATR-YLLPELAG-VRLTPKLCSRVA 323
D+ IPVDTHV +IA + Y L AG +TPKL ++
Sbjct: 300 DKREVIPVDTHVQQIAAKHYGLRAPAGKTNMTPKLYDEIS 339
>gi|392566933|gb|EIW60108.1| DNA glycosylase [Trametes versicolor FP-101664 SS1]
Length = 436
Score = 127 bits (318), Expect = 9e-27, Method: Compositional matrix adjust.
Identities = 111/328 (33%), Positives = 150/328 (45%), Gaps = 42/328 (12%)
Query: 30 LTTNKPSKWTPLNLTQSELSLPLTFPTGQTFRWKKTGPLQYTGPIGPHLISLKH-----L 84
+ T P+ + L L ++LSL GQ+FRW L P P L H L
Sbjct: 1 MNTVIPNGFRALPLPLAQLSLAAVLKCGQSFRWCIFS-LHELSPPSPELNIPTHEYRLCL 59
Query: 85 QNGDVCYHI------------HTSPSEPAAKSA-----LLDFLNMGISLGELWEGFSASD 127
++ VC T+ +P A+ A + D+ + + L EL+ +SA+D
Sbjct: 60 RDRVVCLRQSSTSLFYRSVFPQTTNGDPEAREAETLAWIRDYFQLDVDLKELYAQWSAAD 119
Query: 128 CRFAELAKYLAGARVLRQDPVECLLQFLCSSNNNIARITKMVD--------FLASLGSHL 179
F L G R+LRQDP E L+ F+CSSNNNI+RITKMV L SL
Sbjct: 120 PVFHRLQDRFEGIRMLRQDPFENLMSFICSSNNNISRITKMVKGLCQHYSPALVSLPPPS 179
Query: 180 GNVEGFE-FHEFPSLERLSLVS-EVELRNAGFGYRSAPQSSLLFSVRRSFKQAKYITGTV 237
G+ E +H FP L+ +LR GFGYR+ ++R+ G
Sbjct: 180 GSTAAEEPYHPFPPPSALAAPEVSTQLRALGFGYRAD-------FIQRTAAMLVETHGVT 232
Query: 238 DVLQSKHSGGAEWLLSLRKLDLQEAIDALCTLPGVGPKVAACIALFSLDQHHAIPVDTHV 297
+++ EWL +LR L EA L GVG KVA CI L SLD+ IPVDTHV
Sbjct: 233 QSPKTRMEPSEEWLTTLRTLSTTEARAELLKFVGVGRKVADCILLMSLDKREVIPVDTHV 292
Query: 298 WKIATR-YLLPELA-GVRLTPKLCSRVA 323
+IA + Y L A + PKL V+
Sbjct: 293 HQIAVKNYGLKGAAKAASMNPKLYEEVS 320
>gi|350295507|gb|EGZ76484.1| DNA glycosylase [Neurospora tetrasperma FGSC 2509]
Length = 403
Score = 127 bits (318), Expect = 9e-27, Method: Compositional matrix adjust.
Identities = 87/236 (36%), Positives = 113/236 (47%), Gaps = 57/236 (24%)
Query: 106 LLDFLNMGISLGELWEGFSASDCRFAELAKYLAGARVLRQDPVECLLQFLCSSNNNIARI 165
L ++ + SL L+ +S SD FA A G R+L QD E L+ F+CSSNNNI+RI
Sbjct: 41 LHNYFALSHSLTTLYAQWSLSDANFARRAPAFTGIRILNQDAWETLISFICSSNNNISRI 100
Query: 166 TKMVDFLAS-LGSHLGNVEGFEFHEFPSLERLSLVS-EVELRNAGFGYRSAPQSSLLFSV 223
++MV L + G ++ VEG FH+FP E L+ E LR GFGYR
Sbjct: 101 SQMVLKLCTHYGPYVATVEGEAFHDFPGPEVLAGEGVEAHLRELGFGYR----------- 149
Query: 224 RRSFKQAKYITGTVDVLQSKHSGGAEWLLSLR---------------------------- 255
AKYI T + + G WLL LR
Sbjct: 150 ------AKYIAETAGCVAQVY--GERWLLRLRNPGVPALGAAEEDISVKKEEGAEEKESV 201
Query: 256 --KLDL------QEAIDALCTLPGVGPKVAACIALFSLDQHHAIPVDTHVWKIATR 303
K DL Q+A +AL TLPGVGPKV+ C+ L L ++P+DTHVW+IA R
Sbjct: 202 ESKADLDNPTTYQKAHEALLTLPGVGPKVSDCVCLMGLGWGESVPIDTHVWQIAQR 257
>gi|336465213|gb|EGO53453.1| hypothetical protein NEUTE1DRAFT_126754 [Neurospora tetrasperma
FGSC 2508]
Length = 398
Score = 127 bits (318), Expect = 9e-27, Method: Compositional matrix adjust.
Identities = 87/236 (36%), Positives = 113/236 (47%), Gaps = 57/236 (24%)
Query: 106 LLDFLNMGISLGELWEGFSASDCRFAELAKYLAGARVLRQDPVECLLQFLCSSNNNIARI 165
L ++ + SL L+ +S SD FA A G R+L QD E L+ F+CSSNNNI+RI
Sbjct: 36 LHNYFVLSHSLTTLYTQWSLSDANFARRAPAFTGIRILNQDAWETLISFICSSNNNISRI 95
Query: 166 TKMVDFLAS-LGSHLGNVEGFEFHEFPSLERLSLVS-EVELRNAGFGYRSAPQSSLLFSV 223
++MV L + G ++ VEG FH+FP E L+ E LR GFGYR
Sbjct: 96 SQMVMKLCTHYGPYVATVEGEAFHDFPGPEALAGDGVEAHLRELGFGYR----------- 144
Query: 224 RRSFKQAKYITGTVDVLQSKHSGGAEWLLSLR---------------------------- 255
AKYI T + + G WLL LR
Sbjct: 145 ------AKYIAETAGCVAQVY--GERWLLRLRNPGVPALGAAEEDISVKKEEGAEEKEPV 196
Query: 256 --KLDL------QEAIDALCTLPGVGPKVAACIALFSLDQHHAIPVDTHVWKIATR 303
K DL Q+A +AL TLPGVGPKV+ C+ L L ++P+DTHVW+IA R
Sbjct: 197 ESKADLDNPPTYQKAHEALLTLPGVGPKVSDCVCLMGLGWGESVPIDTHVWQIAQR 252
>gi|403368333|gb|EJY83996.1| N-glycosylase/DNA lyase [Oxytricha trifallax]
Length = 287
Score = 126 bits (316), Expect = 1e-26, Method: Compositional matrix adjust.
Identities = 83/292 (28%), Positives = 128/292 (43%), Gaps = 66/292 (22%)
Query: 38 WTPLNLTQSELSLPLTFPTGQTFRWKKTGPLQYTGPIGPHLISLKHLQNGDVCY------ 91
W L + E +L T GQ F WKK G + G + + +K + + Y
Sbjct: 6 WKQLVFHRQEFNLENTLVNGQCFNWKKLGEDHFEGVFSEYYVQVKRTHDDYIEYCTIPDH 65
Query: 92 HIHTSPSEPA-------------------------------AKSALLD----FLNMGISL 116
I TS +P + LD ++N I +
Sbjct: 66 KIQTSQIQPQDSEKKTKKINKIEEEKKVRQQPEEEEKTSHQTQDKFLDQFKAYINYDIKV 125
Query: 117 GELWEGFSASDCRFAELAKYLAGARVLRQDPVECLLQFLCSSNNNIARITKMVDFL-ASL 175
+L++ ++ D RF+++A+ + G R LRQDP EC + F+CS NNI RIT+M+D L
Sbjct: 126 LDLYDHWAKRDKRFSQIAEPIHGVRCLRQDPWECTVSFICSQCNNIKRITQMLDTLRQKY 185
Query: 176 GSHLGNVEGFE-----FHEFPSLERLSLVSEVELRNAGFGYRSAPQSSLLFSVRRSFKQA 230
G+ + ++ + ++FP++E++S VSE ELR+ FGYR A
Sbjct: 186 GTKICEIDQDDGSKKSIYKFPTIEQMSQVSEKELRDLKFGYR-----------------A 228
Query: 231 KYITGTVDVLQSKHSGGAEWLLSLRKLDLQEAIDALCTLPGVGPKVAACIAL 282
KY+ + K GG +WL SLR +E + L TL G+G K L
Sbjct: 229 KYLVANAKTMNEK--GGKQWLESLRGKSNEEVREQLITLNGIGNKTPMSFKL 278
>gi|169602911|ref|XP_001794877.1| hypothetical protein SNOG_04460 [Phaeosphaeria nodorum SN15]
gi|160706285|gb|EAT88220.2| hypothetical protein SNOG_04460 [Phaeosphaeria nodorum SN15]
Length = 321
Score = 126 bits (316), Expect = 1e-26, Method: Compositional matrix adjust.
Identities = 82/218 (37%), Positives = 111/218 (50%), Gaps = 42/218 (19%)
Query: 109 FLNMGISLGELWEGFSASDCRFAELAKYLAGARVLRQDPVECLLQFLCSSNNNIARITKM 168
+ N+ +LG+L+E ++ASD F + A G R+LRQD E L+ F+CSSNNNI+RI+ M
Sbjct: 9 YFNLAPNLGQLYEQWAASDANFRKKAPKFTGVRILRQDAWEALIGFICSSNNNISRISGM 68
Query: 169 VDFLA-SLGSHLGNVEGFEFHEFPSLERLSL-VSEVELRNAGFGYRSAPQSSLLFSVRRS 226
V L G + +++ +H+FP+ + LS E LR GFGYR
Sbjct: 69 VHNLCLHYGPLIAHIDDVPYHDFPTPDALSGPKVEAHLRELGFGYR-------------- 114
Query: 227 FKQAKYITGTVDVLQSKHSGGAEWLLSLRK-------------LDLQE--------AIDA 265
AKYI T ++ + G +WL L DL E A +
Sbjct: 115 ---AKYIAKTAQIVSQEK--GMKWLEDLSNPECPQFGMTEKPAGDLLEGGREGYRTAHEE 169
Query: 266 LCTLPGVGPKVAACIALFSLDQHHAIPVDTHVWKIATR 303
L L GVGPKVA C+ LF L A+PVDTHVW+IA R
Sbjct: 170 LLALSGVGPKVADCVCLFGLGWSEAVPVDTHVWQIAQR 207
>gi|317139211|ref|XP_001817349.2| DNA N-glycosylase [Aspergillus oryzae RIB40]
Length = 428
Score = 125 bits (315), Expect = 2e-26, Method: Compositional matrix adjust.
Identities = 86/245 (35%), Positives = 121/245 (49%), Gaps = 43/245 (17%)
Query: 106 LLDFLNMGISLGELWEGFSASDCRFAELAKYLAGARVLRQDPVECLLQFLCSSNNNIARI 165
L + N+ +L L+ +S++D F + A G R+LRQD E L+ F+CSSNNNIARI
Sbjct: 102 LTHYFNLDSNLTSLYSYWSSNDPNFKKKAPQFTGIRILRQDAWEALVSFICSSNNNIARI 161
Query: 166 TKMVDFL-ASLGSHLGNVEGFEFHEFPSLERLSLVS-EVELRNAGFGYRSAPQSSLLFSV 223
++MV+ L + G + +V+G +H+FP E L+ E LR+ GFGYR
Sbjct: 162 SQMVEKLCTNYGPFIASVDGRAYHDFPPPEALTGDDVESRLRSLGFGYR----------- 210
Query: 224 RRSFKQAKYITGTVDVLQSKHSGGAEWLLSLR---------------------KLDLQEA 262
AKYI T ++ ++ G WL SLR + +EA
Sbjct: 211 ------AKYIYQTAVIVSTQREKG--WLDSLRNPESPALGVEPAPGGEMRPEGREGYREA 262
Query: 263 IDALCTLPGVGPKVAACIALFSLDQHHAIPVDTHVWKIATR-YLLPELAGVRLTPKLCSR 321
+ L L GVGPKV+ C+ L L ++PVDTHVW+IA R Y + A LT
Sbjct: 263 HEKLLELQGVGPKVSDCVCLMGLGWGESVPVDTHVWQIAQRDYKFGKGAHKSLTKATYDA 322
Query: 322 VAEAF 326
V F
Sbjct: 323 VGNHF 327
>gi|302691168|ref|XP_003035263.1| hypothetical protein SCHCODRAFT_66004 [Schizophyllum commune H4-8]
gi|300108959|gb|EFJ00361.1| hypothetical protein SCHCODRAFT_66004, partial [Schizophyllum
commune H4-8]
Length = 332
Score = 125 bits (315), Expect = 2e-26, Method: Compositional matrix adjust.
Identities = 104/335 (31%), Positives = 152/335 (45%), Gaps = 63/335 (18%)
Query: 30 LTTNKPSKWTPLNLTQSELSLPLTFPTGQTFRWKKTG------------PLQYTGPIGPH 77
+ + P+ + L L +LSL GQ+FRW P +Y +
Sbjct: 1 MAASIPAGFQALPLPIIQLSLAAVLKCGQSFRWSILELPSDSAASACHLPHEYRLCLKDR 60
Query: 78 LISLKHLQNGDVCYHI----HTSPSEPAAKSA-------LLDFLNMGISLGELWEGFSAS 126
++ L+ Q+ D ++ +P++ A+K L D+ + + L +L+ ++
Sbjct: 61 VVCLR--QSTDHLFYCSVFPQATPTDTASKLKDAETLLWLRDYFQLDVDLVKLYREWAER 118
Query: 127 DCRFAELAKYLAGARVLRQDPVECLLQFLCSSNNNIARITKMVDFLA--------SLGSH 178
D F ++ AG R+LRQDP E L+ F+CSSNNNI+RITKMV L SL +
Sbjct: 119 DPVFLKIQDRFAGIRMLRQDPWENLVSFICSSNNNISRITKMVQNLCREYSPPLLSLPNA 178
Query: 179 LGNVEGFEFHEFPSLERLSLVS-EVELRNAGFGYRSAPQSSLLFSVRRSFKQAKYITGTV 237
LG +E +H FP L+ LR GFGYR A +I T
Sbjct: 179 LGELE--TYHPFPPPSALAEPEVATRLRALGFGYR-----------------ADFIHKTA 219
Query: 238 DVLQSKH----SGGAEWLLSLRKLDLQEAIDALCTLPGVGPKVAACIALFSLDQHHAIPV 293
+L H + WL LR EA + L GVG KVA C+ L SLD+ IPV
Sbjct: 220 KMLVDAHGEDPAAADRWLGGLRTRSTNEAREELLKFMGVGRKVADCVLLMSLDKREVIPV 279
Query: 294 DTHVWKIATRYLLPELAG------VRLTPKLCSRV 322
DTHV +IA ++ ++ G V +TPK+ V
Sbjct: 280 DTHVHQIAAKHYGLKVGGKSTNGKVPMTPKIYDEV 314
>gi|258576331|ref|XP_002542347.1| conserved hypothetical protein [Uncinocarpus reesii 1704]
gi|237902613|gb|EEP77014.1| conserved hypothetical protein [Uncinocarpus reesii 1704]
Length = 366
Score = 125 bits (315), Expect = 2e-26, Method: Compositional matrix adjust.
Identities = 85/242 (35%), Positives = 121/242 (50%), Gaps = 43/242 (17%)
Query: 109 FLNMGISLGELWEGFSASDCRFAELAKYLAGARVLRQDPVECLLQFLCSSNNNIARITKM 168
+ N+ +L +L+E ++ D F + A + G R+LRQD E L+ F+CSSNNNIARI++M
Sbjct: 46 YFNLEPNLSDLYEQWAKGDANFKKKALHFTGIRILRQDAWEALVSFICSSNNNIARISQM 105
Query: 169 VDFL-ASLGSHLGNVEGFEFHEFPSLERLSLVS-EVELRNAGFGYRSAPQSSLLFSVRRS 226
V+ L + GS +G+++ +++FP L+ E LR GFGYR
Sbjct: 106 VEKLCVNYGSLIGHIDEQPYYDFPPPSALTGREVETRLRQLGFGYR-------------- 151
Query: 227 FKQAKYITGTVDVLQSKHSGGAEWLLSLR---------------------KLDLQEAIDA 265
AKYI T ++ ++ G WL SLR +L ++A
Sbjct: 152 ---AKYIYQTAKMVANEREQG--WLDSLRNPERPAYGGVPAHAGKMEGQGRLGYRDAHRN 206
Query: 266 LCTLPGVGPKVAACIALFSLDQHHAIPVDTHVWKIATR-YLLPELAGVRLTPKLCSRVAE 324
L L GVGPKVA C+ L L A+PVDTHVW+IA R Y + LT VA+
Sbjct: 207 LLELQGVGPKVADCVCLMGLGWGEAVPVDTHVWQIAQRDYKFGKGKQKSLTKATYDGVAD 266
Query: 325 AF 326
F
Sbjct: 267 HF 268
>gi|401838182|gb|EJT41926.1| OGG1-like protein [Saccharomyces kudriavzevii IFO 1802]
Length = 376
Score = 125 bits (315), Expect = 2e-26, Method: Compositional matrix adjust.
Identities = 96/287 (33%), Positives = 143/287 (49%), Gaps = 41/287 (14%)
Query: 37 KWTPLNLTQSELSLPLTFPTGQTFRW---KKTGPLQYTGPIGPH----LISLKHLQNGDV 89
K+ L + +SEL L GQ+FRW +K T +G ++ L+ QNGD
Sbjct: 4 KFGVLTVNKSELCLANVLQAGQSFRWIWDEKLNQYSTTMKVGKRENYSVVILR--QNGDK 61
Query: 90 CY----HIHTSPSEPAAKSALLDFLNMGISLGELWEG-FSASDCRFAELAKYLAGARVLR 144
I + A K+ L ++ + +SL L++ + +D FA+L+ G R+L
Sbjct: 62 NLIEFAAIGDYGGQDALKTHLTEYFRLDVSLKHLFDNVWIPNDKTFAKLSP--QGIRILA 119
Query: 145 QDPVECLLQFLCSSNNNIARITKMVDFLAS-LGSHLGNVEGFEFHEFPSLERL-SLVSEV 202
Q+P E L+ F+CSSNNNI+RIT+M + L+S G+ + ++G +H FP+ E L S +E
Sbjct: 120 QEPWETLISFICSSNNNISRITRMCNSLSSNFGNLITTIDGVTYHSFPTSEELASRGTET 179
Query: 203 ELRNAGFGYRSAPQSSLLFSVRRSFKQAKYITGT-----VDVLQSKHSGGAEWLLSL-RK 256
LR GFGYR AKYI T D S +G E+L + +
Sbjct: 180 SLRELGFGYR-----------------AKYIIETAKKLKKDKADSNIAGDTEYLQRICKD 222
Query: 257 LDLQEAIDALCTLPGVGPKVAACIALFSLDQHHAIPVDTHVWKIATR 303
++ + L + GVGPKVA C+ L L +PVD HV +IA R
Sbjct: 223 AQYEDVREHLMSYNGVGPKVADCVCLMGLHMDGIVPVDVHVSRIAKR 269
>gi|156083188|ref|XP_001609078.1| 8-oxoguanine DNA glycosylase [Babesia bovis T2Bo]
gi|154796328|gb|EDO05510.1| 8-oxoguanine DNA glycosylase, putative [Babesia bovis]
Length = 266
Score = 125 bits (315), Expect = 2e-26, Method: Compositional matrix adjust.
Identities = 96/298 (32%), Positives = 142/298 (47%), Gaps = 42/298 (14%)
Query: 37 KWTPLNLTQSELSLPLTFPTGQTFRWKKTGPLQYTGPIGPHLISLKHLQNGDVCYHIHTS 96
+WT L TQ + L TGQ+F W G + G I +I L+ + +V Y
Sbjct: 4 EWTLL-CTQRCMDPSLVLTTGQSFGWSPVGDDTWVGVIDSFVIELRKAGD-NVEYRNIFG 61
Query: 97 PSEPAAKSALLDFLNMGISLGELWEGFSASDCRFAELAKYLAGARVLRQDPVECLLQFLC 156
+ A D L+ ++ S + + G R+L+QDP+E L+ F+C
Sbjct: 62 DCDVKRLRAYFD-LDYSYTIDR-----SIAPDSVLRIVDRRVGVRILQQDPLETLVSFIC 115
Query: 157 SSNNNIARITKMV-DFLASLGSHLG----NVEGFEFHEFPSLERLSLVSEVELRNAGFGY 211
S+NNNI RIT+M + +S G+ LG N F+ FP+L +LS + + G G+
Sbjct: 116 SANNNIKRITRMCYEIRSSYGTLLGSKDYNGTTLRFYSFPTLSQLS---AADFTSLGLGF 172
Query: 212 RSAPQSSLLFSVRRSFKQAKYITGTVDVLQSKHSGGAEWLLSLRKLDLQEAIDALCTLPG 271
R AKY+ ++ L+ G EWLL+LR + EA++ L LPG
Sbjct: 173 R-----------------AKYVARSISFLREN---GLEWLLNLRDIPYPEALEELVKLPG 212
Query: 272 VGPKVAACIALFSLDQHHAIPVDTHVWKIATRYLLPELAGVRLTPKLC-SRVAEAFCE 328
VG KVA CI L+SL + +PVD HV +IA + G+R + R+ EAF E
Sbjct: 213 VGRKVADCILLYSLGKRECVPVDVHVNRIAKLHF-----GIRASGSGSYDRIHEAFLE 265
>gi|336274242|ref|XP_003351875.1| hypothetical protein SMAC_00422 [Sordaria macrospora k-hell]
Length = 424
Score = 125 bits (314), Expect = 2e-26, Method: Compositional matrix adjust.
Identities = 88/245 (35%), Positives = 113/245 (46%), Gaps = 64/245 (26%)
Query: 104 SALLDFLNMGISLGELWEGFSASDCRFAELAKYLAGARVLRQDPVECLLQFLCSSNNNIA 163
S L + + SL L+ +S SD FA A G R+L QD E L+ F+CSSNNNI+
Sbjct: 53 SLLHSYFALSSSLTTLYAQWSLSDANFARRAPAFTGIRILNQDAWETLISFICSSNNNIS 112
Query: 164 RITKMVDFLAS-LGSHLGNVEGFEFHEFPSLERLSLVS-EVELRNAGFGYRSAPQSSLLF 221
RI++MV L + G ++ VEG FH+FP E L+ E LR GFGYR
Sbjct: 113 RISQMVLKLCTHYGPYVATVEGEAFHDFPGPEALAGEGVEAHLRELGFGYR--------- 163
Query: 222 SVRRSFKQAKYITGTVDVLQSKHSGGAEWLLSLR-------------------------- 255
AKYI T + + G +WLL LR
Sbjct: 164 --------AKYIAETAGCVAQVY--GEKWLLRLRNPGVPALGVAAGVEVKKEEEGAEDEK 213
Query: 256 ----------KLDLQE-------AIDALCTLPGVGPKVAACIALFSLDQHHAIPVDTHVW 298
K DL E A +AL TLPGVGPKV+ C+ L L ++P+DTHVW
Sbjct: 214 KEERVDDGDSKADLDEDPPTYKKAHEALLTLPGVGPKVSDCVCLMGLGWGESVPIDTHVW 273
Query: 299 KIATR 303
+IA R
Sbjct: 274 QIAQR 278
>gi|46125345|ref|XP_387226.1| hypothetical protein FG07050.1 [Gibberella zeae PH-1]
Length = 333
Score = 125 bits (314), Expect = 2e-26, Method: Compositional matrix adjust.
Identities = 83/227 (36%), Positives = 111/227 (48%), Gaps = 37/227 (16%)
Query: 95 TSPSEPAAKSALLDFLNMGISLGELWEGFSASDCRFAELAKYLAGARVLRQDPVECLLQF 154
T+P +S L + N+ L L++ +S D F + A G R+L QD E L+ F
Sbjct: 7 TTPEPKDTESLLRHYFNLDTGLESLYKQWSEVDSNFRKRAPQFQGVRILNQDAWETLIGF 66
Query: 155 LCSSNNNIARITKMVDFLAS-LGSHLGNVEGFEFHEFPSLERLSLVS-EVELRNAGFGYR 212
+CSSNNNI+RI++MV L G + + FH+FPS + L+ S E LR GFGYR
Sbjct: 67 ICSSNNNISRISQMVHKLCRHYGPFIAYIGDEAFHDFPSPQALTGDSVEAHLRTLGFGYR 126
Query: 213 SAPQSSLLFSVRRSFKQAKYITGTVDVLQSKHSGGAEWLLSLRKLD-------------- 258
AKYI T ++ ++ WL SLR +
Sbjct: 127 -----------------AKYIAETARIVANEKP--ETWLESLRNPEHPGFNTTPVPREQH 167
Query: 259 --LQEAIDALCTLPGVGPKVAACIALFSLDQHHAIPVDTHVWKIATR 303
+EA L TL GVGPKVA C+ L L A+PVDTHVW+IA R
Sbjct: 168 ATYKEAHQQLLTLKGVGPKVADCVCLMGLGWGEAVPVDTHVWQIAQR 214
>gi|167534411|ref|XP_001748881.1| hypothetical protein [Monosiga brevicollis MX1]
gi|163772561|gb|EDQ86211.1| predicted protein [Monosiga brevicollis MX1]
Length = 430
Score = 125 bits (313), Expect = 3e-26, Method: Compositional matrix adjust.
Identities = 96/267 (35%), Positives = 121/267 (45%), Gaps = 61/267 (22%)
Query: 67 PLQYTGPIGPHLISLKHLQNGDV-CYHIHTSPSEPAAKSALLDFLNMGISLGELWEGFSA 125
PL+ T P LQN V + P E + AL D+ + +SL L+ ++A
Sbjct: 100 PLEGTAP---------ELQNEPVNAAQLSPIPEEASIYHALRDYFQLDVSLSRLYATWAA 150
Query: 126 SDCRFAELAKYLAGARVLRQDPVECLLQFLCSSNNNIARITKMVDFLASLGSHLGN---- 181
+D R +A +L G RVLRQ P ECL+ F+CSSNNNI RIT M+D L H G
Sbjct: 151 ADPRMKTIADHLPGLRVLRQPPFECLISFICSSNNNIGRITLMLD---RLKQHYGQPAGQ 207
Query: 182 -VEGFEFHEFPSLERLSLVSEVELRNAGFGYRSAPQSSLLFSVRRSFKQAKYITGTVDVL 240
G + FP+L LS E LR G GYR AK+IT T L
Sbjct: 208 LATGQILYSFPTLTSLSQAGEAHLRELGLGYR-----------------AKFITETCQAL 250
Query: 241 QSKHSGGAEWLLSLRKLDLQEAIDALCTLPGVGPKVAACIALFSLDQHHAIPVDTHVWKI 300
Q GG +L LR T P + + AIPVDTHVW+I
Sbjct: 251 Q--RLGGEPYLERLR------------TQP-----------YAQVQEIGAIPVDTHVWQI 285
Query: 301 ATRYLLPELAGVR-LTPKLCSRVAEAF 326
A R + ELA R LTP + RV + F
Sbjct: 286 AVRDMDKELAHARSLTPTIYKRVGDLF 312
>gi|367020818|ref|XP_003659694.1| hypothetical protein MYCTH_2297044 [Myceliophthora thermophila ATCC
42464]
gi|347006961|gb|AEO54449.1| hypothetical protein MYCTH_2297044 [Myceliophthora thermophila ATCC
42464]
Length = 395
Score = 125 bits (313), Expect = 3e-26, Method: Compositional matrix adjust.
Identities = 85/228 (37%), Positives = 109/228 (47%), Gaps = 63/228 (27%)
Query: 120 WEGFSASDCRFAELAKYLAGARVLRQDPVECLLQFLCSSNNNIARITKMVDFLA-SLGSH 178
++ ++ASD FA A G R+L QD E L+ F+CSSNNNI+RI++MV L G +
Sbjct: 31 YQQWAASDANFARRAPAFTGIRILNQDAWEALVAFICSSNNNISRISQMVQKLCIHYGPY 90
Query: 179 LGNVEGFEFHEFPSLERLSLVS-EVELRNAGFGYRSAPQSSLLFSVRRSFKQAKYITGTV 237
+G VEG FH+FP+ E LS E LR GFGYR AKYI T
Sbjct: 91 IGTVEGEPFHDFPAPEALSGSQVEAHLRQLGFGYR-----------------AKYIAETA 133
Query: 238 DVLQSKHSGGAEWLLSLRK----------------------LDL---------------- 259
++ S+ +WLL LR DL
Sbjct: 134 RIIASEKP--RDWLLQLRNPACPALGTAINTTTAAVGKTKDSDLGGSAPAPQGPSSSPSQ 191
Query: 260 ----QEAIDALCTLPGVGPKVAACIALFSLDQHHAIPVDTHVWKIATR 303
+ A +AL TLPGVGPKVA C+ L L A+P+DTHVW+IA R
Sbjct: 192 LPTYRAAHEALLTLPGVGPKVADCVCLMGLGWGEAVPIDTHVWQIAQR 239
>gi|393216974|gb|EJD02464.1| DNA glycosylase [Fomitiporia mediterranea MF3/22]
Length = 498
Score = 125 bits (313), Expect = 4e-26, Method: Compositional matrix adjust.
Identities = 91/257 (35%), Positives = 124/257 (48%), Gaps = 53/257 (20%)
Query: 99 EPAAKSALL----DFLNMGISLGELWEGFSASDCRFAELAKYLAGARVLRQDPVECLLQF 154
+P + S L D+ + + L +L++ +SASD F L +G R+LRQDP ECL+ F
Sbjct: 118 DPGSDSETLAFIQDYFQLDMDLLKLYDEWSASDPIFRNLKGRFSGIRMLRQDPWECLISF 177
Query: 155 LCSSNNNIARITKMVDFLAS-----------------------LGSHLGNVEGFEFHEFP 191
+CSSNNNI+RI+KMV L + + + NV +H FP
Sbjct: 178 ICSSNNNISRISKMVQNLCTHFSPPLVTLDDPTVPLNPTTSTPIPTSNANV---TYHPFP 234
Query: 192 SLERLSL-VSEVELRNAGFGYRSAPQSSLLFSVRRSFKQAKYITGTVDVLQSKH---SGG 247
L+ +LR GFGYR A +I T +L H G
Sbjct: 235 PPSSLAHPTVSTKLRTLGFGYR-----------------ADFIQKTASMLLKTHGSDQGV 277
Query: 248 AEWLLSLRKLDLQEAIDALCTLPGVGPKVAACIALFSLDQHHAIPVDTHVWKIATRYLLP 307
+L LRK+ EA + L L GVG KVA CI L SLD+ IPVDTHV +IA ++
Sbjct: 278 FNFLEGLRKVGTNEAREELMKLMGVGRKVADCILLMSLDKREVIPVDTHVHQIALKHYGM 337
Query: 308 ELAGVR--LTPKLCSRV 322
+GV+ +TPKL V
Sbjct: 338 RSSGVKTAMTPKLYDEV 354
>gi|401827330|ref|XP_003887757.1| 8-oxoguanine DNA glycosylase [Encephalitozoon hellem ATCC 50504]
gi|392998764|gb|AFM98776.1| 8-oxoguanine DNA glycosylase [Encephalitozoon hellem ATCC 50504]
Length = 303
Score = 124 bits (312), Expect = 5e-26, Method: Compositional matrix adjust.
Identities = 84/273 (30%), Positives = 134/273 (49%), Gaps = 50/273 (18%)
Query: 34 KPSKWTPLNLTQSELSLPLTFPTGQTFRWKKTGPLQYTGPIGPHLISLKHLQNGD-VCYH 92
K W PL T+ + L + +GQ F +++T +YTG +G L+S LQ+GD V Y
Sbjct: 2 KDGAWVPL-YTKEVIDLEKSLFSGQVFSFRQTDKKEYTGVLGTCLVSF--LQDGDRVLYK 58
Query: 93 I-HTSPSEPAAKSALLDFLNMGISLGEL---WEGFSASDCRFAELAKYLAGARVLRQDPV 148
+ ++ ++ + +F + + L L W+ + + L G R LR V
Sbjct: 59 VLDGDKTQEEVEAEISNFFTLDVELCPLLLRWK---------LDPSNLLVGLRALRYSLV 109
Query: 149 ECLLQFLCSSNNNIARITKMVDFLASLGSHLGNVEGFEFHEFPSLERLSLVSEVELRNAG 208
+ F+CSSNNNI+RIT+MV FL S G + +G +F+ FPS+E+L + E ELR+ G
Sbjct: 110 PTIFSFICSSNNNISRITRMVGFLYSKGEFITRYKGVDFYYFPSVEKLVDI-EDELRSNG 168
Query: 209 FGYRSAPQSSLLFSVRRSFKQAKYITGTVDVLQSKHSGGAEWLLSLRKLDLQ---EAIDA 265
FGYRS+ + A+Y+ G +LD Q E +
Sbjct: 169 FGYRSSYICN----------AARYLLGN-------------------RLDYQRTSEIREM 199
Query: 266 LCTLPGVGPKVAACIALFSLDQHHAIPVDTHVW 298
+ ++ G+G K+A CI L +P+DTH++
Sbjct: 200 MVSIKGIGYKIADCILLIGAGDLSVVPIDTHIF 232
>gi|121711507|ref|XP_001273369.1| 8-oxoguanine DNA glycosylase [Aspergillus clavatus NRRL 1]
gi|119401520|gb|EAW11943.1| 8-oxoguanine DNA glycosylase [Aspergillus clavatus NRRL 1]
Length = 446
Score = 124 bits (310), Expect = 9e-26, Method: Compositional matrix adjust.
Identities = 99/336 (29%), Positives = 150/336 (44%), Gaps = 83/336 (24%)
Query: 30 LTTNKPSKWTPLNLTQSELSLPLTFPTGQTFRWKKT-GPLQYTGPIGPHLISLKHLQNGD 88
+T S+W +++ SEL + T GQ+FRW++ ++ + L+SL+ Q+ +
Sbjct: 1 MTVGVFSEWRKFSISLSELCINTTLRCGQSFRWQRVPDTDEWRCVLHGRLLSLR--QDPE 58
Query: 89 VCYHI-------------------------------HT-------SPSEPAAKSALLDFL 110
Y+ HT + + + L +L
Sbjct: 59 YLYYRTHRSPSLLSKSPSTSSPSHVDSSSIPGSTNQHTVQERSSGTDQDDESLKILKHYL 118
Query: 111 NMGISLGELWEGFSASDCRFAELAKYLAGARVLRQDPVECLLQFLCSSNNNIARITKMVD 170
N+ +L +L+ +S+ D F + A G R+LRQD E L+ F+CSSNNNI+RI++MV+
Sbjct: 119 NLSSNLTDLYTQWSSQDPNFKKKAPQFTGIRILRQDAWEALVSFICSSNNNISRISQMVE 178
Query: 171 FL-ASLGSHLGNVEGFEFHEFPSLERLSLVS-EVELRNAGFGYRSAPQSSLLFSVRRSFK 228
L + G + V +H+FP+ E L+ E LR+ GFGYR
Sbjct: 179 KLCVNYGPLVATVGERAYHDFPTPEALTGDDVESRLRSLGFGYR---------------- 222
Query: 229 QAKYITGTVDVLQSKHSGGAEWLLSLR---------------------KLDLQEAIDALC 267
AKYI T ++ + G WL SLR + + A + L
Sbjct: 223 -AKYIYQTALIVAKERGQG--WLDSLRNPECPAFGIEPVPGGEMQPEGRQGYRYAHEQLL 279
Query: 268 TLPGVGPKVAACIALFSLDQHHAIPVDTHVWKIATR 303
L GVGPKVA C+ L L A+PVDTHVW+IA R
Sbjct: 280 ELQGVGPKVADCVCLMGLGWGEAVPVDTHVWQIAQR 315
>gi|158301545|ref|XP_321213.4| AGAP001854-PA [Anopheles gambiae str. PEST]
gi|157012524|gb|EAA01084.4| AGAP001854-PA [Anopheles gambiae str. PEST]
Length = 380
Score = 123 bits (309), Expect = 1e-25, Method: Compositional matrix adjust.
Identities = 83/223 (37%), Positives = 113/223 (50%), Gaps = 23/223 (10%)
Query: 109 FLNMGISLGELWEGFSASDCRFAELAKYLAGARVLRQDPVECLLQFLCSSNNNIARITKM 168
+ + + L + ++ + FA A R L QDPVE L F+CS NNNIARI+ M
Sbjct: 133 YFRLDVDLEQQYQQWIRCHEHFANSATKFYAVRQLDQDPVENLFCFICSQNNNIARISDM 192
Query: 169 VDFLA-SLGSHLGNVEGFEFHEFPSLERLSL-VSEVELRNAGFGYRSAPQSSLLFSVRRS 226
V+ L + G + EG ++ FPS+ L+ E +LR GFGYR
Sbjct: 193 VEKLCRNYGEKICEHEGTCYYNFPSVSSLAGPTVEGQLRELGFGYR-------------- 238
Query: 227 FKQAKYITGTVD-VLQSKHSGGAEWLLSLRKLDLQEAIDALCTLPGVGPKVAACIALFSL 285
AKYI + + +LQ G EW L +LD + A L +LPG+GPKVA CI L SL
Sbjct: 239 ---AKYIQRSAEKILQL---GDLEWFRQLCQLDYKAAHKELVSLPGIGPKVADCICLMSL 292
Query: 286 DQHHAIPVDTHVWKIATRYLLPELAGVRLTPKLCSRVAEAFCE 328
AIPVDTHV+++A YL +T + VA+ F E
Sbjct: 293 GHLQAIPVDTHVFQLAKHYLPALNISKNVTDRQYVTVADKFRE 335
>gi|401624467|gb|EJS42524.1| ogg1p [Saccharomyces arboricola H-6]
Length = 376
Score = 123 bits (309), Expect = 1e-25, Method: Compositional matrix adjust.
Identities = 87/280 (31%), Positives = 142/280 (50%), Gaps = 27/280 (9%)
Query: 37 KWTPLNLTQSELSLPLTFPTGQTFRW---KKTGPLQYTGPIGPH----LISLKHLQNGDV 89
K+ L + +SEL L GQ+FRW +K T IG ++ L+ ++ ++
Sbjct: 4 KFGKLAVNKSELCLANVLQAGQSFRWIWDEKLNQYSTTMKIGKQQNYSVVILRQNEDKEL 63
Query: 90 CYH--IHTSPSEPAAKSALLDFLNMGISLGELWEG-FSASDCRFAELAKYLAGARVLRQD 146
+ A K+ L+ + + +SL L++ + +D +FA+L+ G R+L Q+
Sbjct: 64 LEFAAVGDYGGLDALKTHLMKYFRLDVSLKHLFDNVWIPNDKKFAKLSP--QGIRILAQE 121
Query: 147 PVECLLQFLCSSNNNIARITKMVDFLAS-LGSHLGNVEGFEFHEFPSLERL-SLVSEVEL 204
P E L+ F+CSSNNNI+RIT+M + L S G+ + + G +H FP+ E L S +E L
Sbjct: 122 PWETLISFICSSNNNISRITRMCNSLCSNFGNLISTINGVTYHSFPTSEELASRATEARL 181
Query: 205 RNAGFGYRSAPQSSLLFSVRRSFKQAKYITGTVDVLQSKHSGGAEWLLSL-RKLDLQEAI 263
R GFGYR+ ++ + R+ K + D E+L +L + ++
Sbjct: 182 RELGFGYRA---KYIIETARKLAKDKAVLNIACD---------TEYLQNLCKDTQYEDVR 229
Query: 264 DALCTLPGVGPKVAACIALFSLDQHHAIPVDTHVWKIATR 303
+ + + GVGPKVA C+ L L +PVD HV +IA R
Sbjct: 230 EHIMSYNGVGPKVADCVCLMGLQMDGIVPVDVHVSRIAKR 269
>gi|400602735|gb|EJP70337.1| N-glycosylase/DNA lyase [Beauveria bassiana ARSEF 2860]
Length = 408
Score = 123 bits (308), Expect = 1e-25, Method: Compositional matrix adjust.
Identities = 85/220 (38%), Positives = 114/220 (51%), Gaps = 39/220 (17%)
Query: 103 KSALLDFLNMGISLGELWEGFSASDCRFAELAKYLAGARVLRQDPVECLLQFLCSSNNNI 162
K L + ++ ++L +L+E +S +D FA+ A AG R+L QD E L+ F+CSSNNNI
Sbjct: 98 KDLLQHYFSLHLNLADLYEQWSKADPNFAKKAPEFAGVRILSQDAWEALVGFICSSNNNI 157
Query: 163 ARITKMVDFLA-SLGSHLGNVEGFEFHEFPSLERLSLVSEVE--LRNAGFGYRSAPQSSL 219
ARI++MV L G +G V H+FP+ E LS S+VE LR GFGYR
Sbjct: 158 ARISQMVHKLCIHYGPLIGYVGDEPIHDFPTPEALS-GSKVESHLRELGFGYR------- 209
Query: 220 LFSVRRSFKQAKYITGTVDVLQSKHSGGAEWLLSLRKLD----------------LQEAI 263
AKYI T V+ + G WL +LR + + A
Sbjct: 210 ----------AKYIAQTARVIAKEKPAG--WLETLRNPESAIKIFDKAAPGDIPTYRAAH 257
Query: 264 DALCTLPGVGPKVAACIALFSLDQHHAIPVDTHVWKIATR 303
+ L L GVGPKVA C+ L L ++PVDTHVW+IA R
Sbjct: 258 EELLALTGVGPKVADCVCLMGLGWGESVPVDTHVWQIAQR 297
>gi|154280002|ref|XP_001540814.1| hypothetical protein HCAG_04654 [Ajellomyces capsulatus NAm1]
gi|150412757|gb|EDN08144.1| hypothetical protein HCAG_04654 [Ajellomyces capsulatus NAm1]
Length = 391
Score = 123 bits (308), Expect = 1e-25, Method: Compositional matrix adjust.
Identities = 93/296 (31%), Positives = 145/296 (48%), Gaps = 62/296 (20%)
Query: 36 SKWTPLNLTQSELSLPLTFPTGQTFRWKKTGPLQYTGPIGPHLISLKHLQNGDVCYHIH- 94
S+W L ++ +EL + T GQ+FRW+++G +++ + ++SL+ H+H
Sbjct: 7 SEWRKLPISLTELCINTTLRCGQSFRWQQSGDNEWSCALYGRIVSLRQDPT-----HLHY 61
Query: 95 -------------TSPSEPAAK-----------SALLD-FLNMGISLGELWEGFSASDCR 129
S PA++ AL++ + N+ ++L +L+E +S +D
Sbjct: 62 RSYFPPIPPALPTPPSSVPASRREESCEKIDDTEALINNYFNLDLNLTDLYEQWSTADKN 121
Query: 130 FAELAKYLAGARVLRQDPVECLLQFLCSSNNNIARITKMVDFLA-SLGSHLGNVEGFEFH 188
F + A AG R+LRQD E L+ F+CSSNNNIARI++MV+ L + G +G ++ +H
Sbjct: 122 FKKKAPKFAGIRILRQDSWEALISFICSSNNNIARISQMVEKLCLNYGPLIGYIDKKPYH 181
Query: 189 EFPSLERL-SLVSEVELRNAGFGYRSAPQSSLLFSVRRSFKQAKYITGTVDVLQSKHSGG 247
+FP+ + L E LR GFGYR AKYI T ++ + G
Sbjct: 182 DFPTPQALVGRDVEARLRELGFGYR-----------------AKYIYQTALIVVNDREEG 224
Query: 248 AEWLLSLRKLDLQEAIDALCTLPGVGPKVAACIALFSLDQHHAIPVDTHVWKIATR 303
WL SLR + C G P A +A + A+PVDTHVW+IA R
Sbjct: 225 --WLNSLRNPE--------CPSFGQSPAHAGEMAEGGWGE--AVPVDTHVWQIAQR 268
>gi|50293685|ref|XP_449254.1| hypothetical protein [Candida glabrata CBS 138]
gi|49528567|emb|CAG62228.1| unnamed protein product [Candida glabrata]
Length = 366
Score = 122 bits (307), Expect = 2e-25, Method: Compositional matrix adjust.
Identities = 87/275 (31%), Positives = 140/275 (50%), Gaps = 25/275 (9%)
Query: 41 LNLTQSELSLPLTFPTGQTFRW---KKTG----PLQYTGPIGPHLISLKH-LQNGDVCYH 92
LN++ EL L TGQ+FRW + TG L+ + +G +I+L+ + G V Y
Sbjct: 7 LNVSAGELCLKNVLQTGQSFRWLLDEATGIYSTTLKISDTLGYSIITLRQDVDFGQVFYE 66
Query: 93 IHTSPSEPAA-KSALLDFLNMGISLGELWEG-FSASDCRFAELAKYLAGARVLRQDPVEC 150
+ + + ++ L ++ + + L L + + A+D +F L G R+L Q+P E
Sbjct: 67 VLNAKVDTETIENHLKNYFRLDVDLQNLHKNHWLANDEKFKMLDH--KGIRILGQEPWET 124
Query: 151 LLQFLCSSNNNIARITKMVDFLA-SLGSHLGNVEGFEFHEFPSLERLSL-VSEVELRNAG 208
L+ F+CS+NNNI RI+KM L+ + G ++ +G +++ FPS E ++ +E++LR G
Sbjct: 125 LVSFICSTNNNIGRISKMCHALSENFGEYIDEYKGTKYYTFPSSEDIATKATEIQLRGLG 184
Query: 209 FGYRSAPQSSLLFSVRRSFKQAKYITGTVDVLQSKHSGGAEWLLSLRKLDLQEAIDALCT 268
FGYR + + + F + K G D Q + + K EA + L
Sbjct: 185 FGYR----AKYIIETAKKFVEDKKKYGLSDDAQFLNE-------VVAKKGYLEAREFLMG 233
Query: 269 LPGVGPKVAACIALFSLDQHHAIPVDTHVWKIATR 303
GVGPKVA C+ L L +PVD HV +IA R
Sbjct: 234 YCGVGPKVADCVCLMGLHMDEVVPVDVHVGRIAKR 268
>gi|342873172|gb|EGU75392.1| hypothetical protein FOXB_14097 [Fusarium oxysporum Fo5176]
Length = 375
Score = 122 bits (306), Expect = 2e-25, Method: Compositional matrix adjust.
Identities = 84/226 (37%), Positives = 112/226 (49%), Gaps = 37/226 (16%)
Query: 96 SPSEPAAKSALLDFLNMGISLGELWEGFSASDCRFAELAKYLAGARVLRQDPVECLLQFL 155
+P+ ++ L + N+ SL L++ +S +D F + A G R+L QD E L+ F+
Sbjct: 50 TPAPNNTEALLRHYFNLDTSLEPLYKQWSDADANFKKRAPQFKGVRILSQDAWETLICFI 109
Query: 156 CSSNNNIARITKMVDFLASLGSHLGNVEGFE-FHEFPSLERLSLVS-EVELRNAGFGYRS 213
CSSNNNI RI++MV L L G E FH+FPS + L+ E LR GFGYR
Sbjct: 110 CSSNNNITRISQMVYKLCQNYGPLIAYMGDEPFHDFPSPQDLTGDDVESHLRELGFGYR- 168
Query: 214 APQSSLLFSVRRSFKQAKYITGTVDVLQSKHSGGAEWLLSLRKLD--------------- 258
AKYI T ++ ++ WL SLR D
Sbjct: 169 ----------------AKYIAETARMVANEKP--ETWLESLRNPDQPGFNTTPVPKEKHA 210
Query: 259 -LQEAIDALCTLPGVGPKVAACIALFSLDQHHAIPVDTHVWKIATR 303
+EA+ L TL GVGPKVA C+ L L A+PVDTHVW+IA R
Sbjct: 211 SYKEALAQLLTLKGVGPKVADCVCLMGLGWGEAVPVDTHVWQIAQR 256
>gi|358391564|gb|EHK40968.1| hypothetical protein TRIATDRAFT_207755, partial [Trichoderma
atroviride IMI 206040]
Length = 317
Score = 122 bits (305), Expect = 3e-25, Method: Compositional matrix adjust.
Identities = 82/217 (37%), Positives = 110/217 (50%), Gaps = 39/217 (17%)
Query: 106 LLDFLNMGISLGELWEGFSASDCRFAELAKYLAGARVLRQDPVECLLQFLCSSNNNIARI 165
L + ++ I L L++ +S D F + A G R+L QD E L+ F+CSSNNNI+RI
Sbjct: 6 LRHYFSLDIDLASLYQQWSKDDPNFRKKAPKFTGVRILNQDAWEALICFICSSNNNISRI 65
Query: 166 TKMVDFLAS-LGSHLGNVEGFEFHEFPSLERLSLVS-EVELRNAGFGYRSAPQSSLLFSV 223
++MV L G +G+V+G H+FP+ E L+ E LR GFGYR
Sbjct: 66 SQMVHKLCKHYGPFIGHVDGEAMHDFPTPESLNQKKVEAHLRELGFGYR----------- 114
Query: 224 RRSFKQAKYITGTVDVL-QSKHSGGAEWLLSLRKLDL----------------QEAIDAL 266
AKYI T ++ Q K S WL SLR D ++A AL
Sbjct: 115 ------AKYIAETARIIAQDKPSA---WLDSLRNPDFPAFNAVAVSDGPSSTYKDAQSAL 165
Query: 267 CTLPGVGPKVAACIALFSLDQHHAIPVDTHVWKIATR 303
+L GVGPKVA C+ L L ++PVDTHV +IA R
Sbjct: 166 LSLTGVGPKVADCVCLMGLGWGESVPVDTHVLQIAQR 202
>gi|392576331|gb|EIW69462.1| hypothetical protein TREMEDRAFT_68703 [Tremella mesenterica DSM
1558]
Length = 413
Score = 122 bits (305), Expect = 3e-25, Method: Compositional matrix adjust.
Identities = 98/297 (32%), Positives = 139/297 (46%), Gaps = 41/297 (13%)
Query: 35 PSKWTPLNLTQSELSLPLTFPTGQTFRWKKTGPLQYTGPIGPHLISLKHLQN-------G 87
PS W + T S LSL T P GQ+F W K L GP P + ++N
Sbjct: 11 PSGWGSIPTTLSNLSLSNTLPVGQSFLWHKRT-LPAAGPDEPLEEYSRAIRNPCRVIFLR 69
Query: 88 DVCYHIH--------TSPSEPAAKSALLDFLNMGI--SLGELWEGFSASDCRF---AELA 134
+ +H ++ + + L D+ + L L++ + D E
Sbjct: 70 QSIHRLHYVSLLPDGSTSKDEETRQWLEDYFQLVNYPDLSLLYKEWRDRDTELFGKIEFD 129
Query: 135 KYLAGARVLRQDPVECLLQFLCSSNNNIARITKMVDFLA-SLGSHLGNVEGFE------F 187
K G RVLRQDP ECL+ F+ S+NN+I RI+ ++ L+ + L +E FE +
Sbjct: 130 KKAIGVRVLRQDPWECLISFITSTNNHITRISSLMHKLSINYSKPLFTIEQFEGSSSITY 189
Query: 188 HEFPSLERLSLVSEVELRNAGFGYRSAPQSSLLFSVRRSFKQAKYITGTVDVLQSKHSGG 247
H FP E E LR GFGYR++ S + S+R+ F G+++
Sbjct: 190 HPFPIPEDFPDSFETNLRQLGFGYRASFIESSIKSLRQQFGSN---PGSIE--------- 237
Query: 248 AEWLLSLRKLDLQEAIDALCTLPGVGPKVAACIALFSLDQHHAIPVDTHVWKIATRY 304
+ L+ LRK DL + L +L GVG KVA C+ L SLDQ IPVDTHV IA R+
Sbjct: 238 -KGLVDLRKKDLDVIGECLISLKGVGRKVADCVMLMSLDQPSLIPVDTHVASIAARH 293
>gi|269925931|ref|YP_003322554.1| 8-oxoguanine DNA glycosylase [Thermobaculum terrenum ATCC BAA-798]
gi|269789591|gb|ACZ41732.1| 8-oxoguanine DNA glycosylase domain protein [Thermobaculum terrenum
ATCC BAA-798]
Length = 306
Score = 121 bits (304), Expect = 4e-25, Method: Compositional matrix adjust.
Identities = 83/283 (29%), Positives = 135/283 (47%), Gaps = 30/283 (10%)
Query: 48 LSLPLTFPTGQTFRWKKTGPLQYTGPIGPHLISLKHLQNGDVCYHIHTSPSEPAAKSALL 107
L+L F +GQ FRWK+ + G + ++ ++ + V + + P P +
Sbjct: 21 LNLRWCFLSGQAFRWKEDPNGWWIGIVRDSVLRIR-MDGSTVTWAAY--PKLPWS-DFWE 76
Query: 108 DFLNMGISLGELWEGFSASDCRFAELAKYLAGARVLRQDPVECLLQFLCSSNNNIARITK 167
+L + L L+ + D A G RVL QDP+E + FLC++ N+I RIT+
Sbjct: 77 SYLRLDFDLAALYRDYEGFDQFVAYSFSRWRGLRVLSQDPLETITCFLCTTANSIPRITR 136
Query: 168 MVDFLASL-GSHLGNVEGFEFHEFPSLERLSLVSEVE---LRNAGFGYRSAPQSSLLFSV 223
+ +++L G H+ ++G +++ P E +LV E+ + G G+R+
Sbjct: 137 AISCMSTLYGQHIATIDGIDYYTLPKPE--ALVEEIAPVLDKRCGLGFRA---------- 184
Query: 224 RRSFKQAKYITGTVDVLQSKHSGGAEWLLSLRKLDLQEAIDALCTLPGVGPKVAACIALF 283
V +Q S +W SL + +A L T+ G+G KVA C+ALF
Sbjct: 185 ----------NNLVRAMQDLRSKPTDWPQSLINMPYPQARAELMTIRGIGRKVADCVALF 234
Query: 284 SLDQHHAIPVDTHVWKIATRYLLPELAGVRLTPKLCSRVAEAF 326
+L + A+PVDTHVW+IA L E+ LT K+ +AE F
Sbjct: 235 ALRKDEAVPVDTHVWQIALELYLREIKTKSLTTKVYETIAEHF 277
>gi|407472811|ref|YP_006787211.1| 8-oxoguanine DNA glycosylase [Clostridium acidurici 9a]
gi|407049319|gb|AFS77364.1| 8-oxoguanine DNA glycosylase [Clostridium acidurici 9a]
Length = 294
Score = 121 bits (304), Expect = 4e-25, Method: Compositional matrix adjust.
Identities = 78/256 (30%), Positives = 122/256 (47%), Gaps = 27/256 (10%)
Query: 54 FPTGQTFRWKKTGPLQYTGPIGPHLISLKHLQNGDVCYHIHTSPSEPAAKSALLDFLNMG 113
F GQ FRW G +TG ++++K ++ + + + E D+L++G
Sbjct: 26 FECGQCFRWNDEGDGSFTGVAFGKVVNIKKEEDNIIISNTNIKEFE----DIWYDYLDLG 81
Query: 114 ISLGELWEGFSASDCRFAELAKYLAGARVLRQDPVECLLQFLCSSNNNIARITKMVDFLA 173
+ GE+ E S +D E K+ G R+LRQD E L+ F+ S+NN I I K +D L+
Sbjct: 82 RNYGEVKEKLS-NDPILKEAIKFGEGIRILRQDEWEILISFIISANNRIPMIKKAIDILS 140
Query: 174 S-LGSHLGNVEGFEFHEFPSLERLSLVSEVELRNAGFGYRSAPQSSLLFSVRRSFKQAKY 232
G +G G +++ FP+ E L S ++ N G+R AKY
Sbjct: 141 EKYGEFIGEFNGKKYYAFPTPENLKDASIEDIENCKTGFR-----------------AKY 183
Query: 233 ITGTVDVLQSKHSGGAEWLLSLRKLDLQEAIDALCTLPGVGPKVAACIALFSLDQHHAIP 292
I+ ++ + K + L L +EA + L T GVGPKV+ CI LFS+ + A P
Sbjct: 184 ISAAINRVLDKEID----IYKLENLSTEEAREELMTFSGVGPKVSDCIMLFSMGKADAFP 239
Query: 293 VDTHVWKIATRYLLPE 308
+D V +I + L E
Sbjct: 240 IDVWVKRIMEHFYLEE 255
>gi|119482896|ref|XP_001261476.1| 8-oxoguanine DNA glycosylase [Neosartorya fischeri NRRL 181]
gi|119409631|gb|EAW19579.1| 8-oxoguanine DNA glycosylase [Neosartorya fischeri NRRL 181]
Length = 430
Score = 121 bits (303), Expect = 5e-25, Method: Compositional matrix adjust.
Identities = 87/245 (35%), Positives = 116/245 (47%), Gaps = 43/245 (17%)
Query: 106 LLDFLNMGISLGELWEGFSASDCRFAELAKYLAGARVLRQDPVECLLQFLCSSNNNIARI 165
L +LN+ +L +L+ +S+ D F + A G R+LRQD E L+ F+CSSNNNIARI
Sbjct: 97 LKHYLNLSSNLTDLYTQWSSRDPNFEKKAPQFTGIRILRQDAWEALVSFICSSNNNIARI 156
Query: 166 TKMVDFL-ASLGSHLGNVEGFEFHEFPSLERLSLVS-EVELRNAGFGYRSAPQSSLLFSV 223
++MV+ L + G + V +H+FP E L+ E LR+ GFGYR
Sbjct: 157 SQMVEKLCVNYGPLVATVGDRAYHDFPPPEALTADDVEGRLRSLGFGYR----------- 205
Query: 224 RRSFKQAKYITGTVDVLQSKHSGGAEWLLSLR---------------------KLDLQEA 262
AKYI T ++ + G WL SLR + + A
Sbjct: 206 ------AKYIHQTALIVAEEREQG--WLDSLRNPESPVLGVEPVPGDEMRPEGRQGYRHA 257
Query: 263 IDALCTLPGVGPKVAACIALFSLDQHHAIPVDTHVWKIATR-YLLPELAGVRLTPKLCSR 321
+ L L GVGPKVA C+ L L A+PVDTHVW+IA R Y A LT
Sbjct: 258 HEQLLGLQGVGPKVADCVCLMGLGWGEAVPVDTHVWQIAQRDYKFGRGAHKSLTKATYDA 317
Query: 322 VAEAF 326
V F
Sbjct: 318 VGNHF 322
>gi|120537565|gb|AAI29497.1| Zgc:158858 [Danio rerio]
Length = 268
Score = 121 bits (303), Expect = 6e-25, Method: Compositional matrix adjust.
Identities = 78/212 (36%), Positives = 112/212 (52%), Gaps = 28/212 (13%)
Query: 69 QYTGPIGPHLISLKHLQNGDVCYHIHTSPSEPAAKSA--LLDFLNMGISLGELWEGFSAS 126
Q G + + +K G +C T SEP K L D+ + + LG+L++ +S
Sbjct: 41 QLLGKMSKKMTGIKQEDEG-LC----TVVSEPDKKEEELLRDYFQLDVKLGDLYKNWSTV 95
Query: 127 DCRFAELAKYLAGARVLRQDPVECLLQFLCSSNNNIARITKMVDFLA-SLGSHLGNVEGF 185
D F A G R+LRQDPVECL F+CSSNN I+RI MV+ L +LG+ L ++
Sbjct: 96 DPHFKHTANIFTGVRLLRQDPVECLFSFICSSNNYISRIQGMVERLCQTLGTLLCKLDDV 155
Query: 186 EFHEFPSLERLSL-VSEVELRNAGFGYRSAPQSSLLFSVRRSFKQAKYITGTVDVLQSKH 244
+H+FPSL+ L+ E+ LR+ GFGYR A+++ + ++ + H
Sbjct: 156 AYHDFPSLQDLTDPCVEMRLRDLGFGYR-----------------ARFLQQSSQMIMNSH 198
Query: 245 SGGAEWLLSLRKLDLQEAIDALCTLPGVGPKV 276
+WL SLR +A DAL TLPGVG KV
Sbjct: 199 H--PDWLQSLRSTPYLQARDALRTLPGVGLKV 228
>gi|366986937|ref|XP_003673235.1| hypothetical protein NCAS_0A02860 [Naumovozyma castellii CBS 4309]
gi|342299098|emb|CCC66844.1| hypothetical protein NCAS_0A02860 [Naumovozyma castellii CBS 4309]
Length = 367
Score = 120 bits (302), Expect = 7e-25, Method: Compositional matrix adjust.
Identities = 91/278 (32%), Positives = 136/278 (48%), Gaps = 36/278 (12%)
Query: 43 LTQSELSLPLTFPTGQTFRW---KKTGPLQYTGPIGP----HLISLKHLQNGDVCYHI-H 94
+ +ELSL TGQ FRW + T T IG I+L+ + + +
Sbjct: 9 VANNELSLENVLQTGQAFRWVFDESTKRYSSTMRIGDACHYSFITLRQPNSNTIEFSTTD 68
Query: 95 TSPSEPAAKSALLDFLNMGISLGELW-EGFSASDCRFAELAKYLAGARVLRQDPVECLLQ 153
T S + L+ + + +SL L+ + + +D F +L+ G R+L Q+P E L+
Sbjct: 69 TLQSVSEIRDHLIRYFRLDVSLNSLFNDKWIPNDKNFKKLSP--QGIRILAQEPWETLVS 126
Query: 154 FLCSSNNNIARITKMVDFL-ASLGSHLGNVEGFEFHEFPSLERL-SLVSEVELRNAGFGY 211
F+CSSNNNI+RITKM L + G+ + +G +F+ FP+ + L S +E ELR GFGY
Sbjct: 127 FICSSNNNISRITKMCHSLCTNYGNEVCTYDGNKFYSFPTSDELASKATESELRELGFGY 186
Query: 212 RSAPQSSLLFSVRRSFKQAKYITGTV-DVLQSKHSG----GAEWLLSLR-KLDLQEAIDA 265
R AKYI T V++ + G + + LR K +E +
Sbjct: 187 R-----------------AKYIIETAKKVVKDRRDGDYKTDSAYFEHLRSKFTYEEMREH 229
Query: 266 LCTLPGVGPKVAACIALFSLDQHHAIPVDTHVWKIATR 303
L + GVGPKVA C+ L +PVD H+ +IATR
Sbjct: 230 LMSYVGVGPKVADCVCLMGFRMDEIVPVDVHIGRIATR 267
>gi|68067024|ref|XP_675483.1| N-glycosylase/DNA lyase [Plasmodium berghei strain ANKA]
gi|56494695|emb|CAH95996.1| N-glycosylase/DNA lyase, putative [Plasmodium berghei]
Length = 492
Score = 120 bits (302), Expect = 7e-25, Method: Compositional matrix adjust.
Identities = 99/386 (25%), Positives = 158/386 (40%), Gaps = 84/386 (21%)
Query: 1 MQSLKNSPSPLKR------PRLTPQPPPTPPNPQ-------TLTTNKPSKWTPLNLTQSE 47
M +LKN K P+L NP TL N + WT LN T ++
Sbjct: 18 MNNLKNKIEIKKEIQYDAPPKLINNYNCKNVNPNKISKSFTTLIKNYKNNWTILNATPND 77
Query: 48 LSLPLTFPTGQTFRWKKTGPLQYTGPIGPHLISLKHLQNG-------DVCYHIHTSPS-- 98
L L GQ F +K+ Y G + + K + D + ++PS
Sbjct: 78 LQLKYCLLIGQEFHFKQVEKCSYIGMVNKKIYLFKETCDKILYQCLFDTNMELSSNPSCN 137
Query: 99 EPAAKSALLD---FLNMGISLGELWEGFSASDCRFAELAKYLAGARVLRQDPVECLLQFL 155
+ K+ +D F N+ L + + ++ D R E++ + G R+L+ D VE FL
Sbjct: 138 DSEKKNDDIDIYEFFNLSFPLEDQIKEWTKKDKRIEEISNKIKGLRILKNDSVESFFSFL 197
Query: 156 CSSNNNIARITKMVDFL-ASLGSHLGNV-----EGFEFHEFPSLERLSLVSEV------- 202
CS+NNNI RIT M+D L G ++ V + F+E ++ + VS++
Sbjct: 198 CSTNNNIPRITLMIDCLRRRYGEYIATVIFTNDDIIIFNEKNGEDKKNDVSKIKTDIKNE 257
Query: 203 ------------------------------------ELRNAGFGYRSAPQSSLLFSVRRS 226
E +N F + P + +++ S
Sbjct: 258 DVLDNCLGSLNIQKISKERSSENKMFYENIKTEVKEEGKNKIFHFYKFPSIETISNLKES 317
Query: 227 F-------KQAKYITGTVDVLQSKHSGGAEWLLSLRK-LDLQEAIDALCTLPGVGPKVAA 278
++ Y+ + +L K GG EW+ L+K +E ID L PG+G KVA
Sbjct: 318 DLRNLGFGYRSGYVIESAKML--KKLGGEEWIEDLKKEKKTKECIDKLIKFPGIGLKVAN 375
Query: 279 CIALFSLDQHHAIPVDTHVWKIATRY 304
CI LF L+++ IP+DTH++ I +Y
Sbjct: 376 CICLFGLNRYDCIPIDTHIYDIIYKY 401
>gi|300708052|ref|XP_002996214.1| hypothetical protein NCER_100720 [Nosema ceranae BRL01]
gi|239605495|gb|EEQ82543.1| hypothetical protein NCER_100720 [Nosema ceranae BRL01]
Length = 311
Score = 120 bits (300), Expect = 1e-24, Method: Compositional matrix adjust.
Identities = 92/317 (29%), Positives = 146/317 (46%), Gaps = 44/317 (13%)
Query: 23 TPPNPQTLTTNKPSKWTPLNLTQSELSLPLTFPTGQTFRWKKTGPLQYTGPIGPHLISLK 82
N Q +T N KW L+ T+ + L T +GQ F + KT ++TG I +IS K
Sbjct: 4 NKENMQNITLNDNEKWNFLDTTEI-IDLKETLFSGQIFNFHKTDLEEFTGTIYFFVISFK 62
Query: 83 HLQNGDVCYHI-HTS-PSEPAAKSALLDFLNMGISLGELWEGFSASDCRFAELAKYLAGA 140
+ N V Y I HT SE + F N+ +S ++ + + G
Sbjct: 63 QV-NQKVLYKILHTKINSEELILFFIKKFFNLNLSYKDILKDIDVT------------GL 109
Query: 141 RVLRQDPVECLLQFLCSSNNNIARITKMVDFLASLGSHLGNVEGFEFHEFPSLERLSLVS 200
R++ + + F+CS+NNN+ RITKMV+++ S G + + +F FP + RL L
Sbjct: 110 RLITNSLIPTIFSFICSANNNVKRITKMVNYMYSKGEFACSYKKIDFFYFPDISRL-LDC 168
Query: 201 EVELRNAGFGYRSAPQSSLLFSVRRSFKQAKYITGTVDVLQSKH-----SGGAEWLLSLR 255
E + + GFGYR A+YI T D+L++ + +L +
Sbjct: 169 EHDFKENGFGYR-----------------ARYICKTADLLKNDKLFAEINKYDVDILQFK 211
Query: 256 KL----DLQEAIDALCTLPGVGPKVAACIALFSLDQHHAIPVDTHVWKIATRYLLPELAG 311
K+ ++ + L LPG+G KV CI L SL+ +P+DTH+ K A R++
Sbjct: 212 KVKNNLNINLIKNLLIKLPGIGFKVRDCICLMSLECFEIVPIDTHILKYA-RHIFNLENN 270
Query: 312 VRLTPKLCSRVAEAFCE 328
V LT +++ F E
Sbjct: 271 VLLTKISYNKIQSKFIE 287
>gi|396081879|gb|AFN83493.1| 8-oxoguanine DNA glycosylase [Encephalitozoon romaleae SJ-2008]
Length = 296
Score = 119 bits (299), Expect = 1e-24, Method: Compositional matrix adjust.
Identities = 86/271 (31%), Positives = 132/271 (48%), Gaps = 44/271 (16%)
Query: 34 KPSKWTPLNLTQSELSLPLTFPTGQTFRWKKTGPLQYTGPIGPHLISLKHLQNGD-VCYH 92
K W PL T+ + L T +GQ F +++T +YTG +G L+S LQ+G+ V Y
Sbjct: 2 KDEGWEPL-CTEETVDLEKTLHSGQVFSFRQTDEKEYTGVLGTCLVSF--LQDGNRVLYK 58
Query: 93 I-HTSPSEPAAKSALLDFLNMGISLGELWEGFSASDCRFAELAKYLAGARVLRQDPVECL 151
+ + + + + F + + L L + + L G R LR + + +
Sbjct: 59 VLNGDKTSKEIELEITSFFTLDVKLCPLLRRWRL------DPNNLLVGLRALRYNLIPTI 112
Query: 152 LQFLCSSNNNIARITKMVDFLASLGSHLGNVEGFEFHEFPSLERLSLVSEVELRNAGFGY 211
F+CSSNNNIARIT+MV FL S G + + +F+ FP LE+L + E EL++ GFGY
Sbjct: 113 FSFICSSNNNIARITRMVGFLYSKGEFIMKYKNADFYHFPDLEKLVDIEE-ELKSNGFGY 171
Query: 212 RSAPQSSLLFSVRRSFKQAKYITGTVDVLQSKHSGGAEWLLSLRKLDLQEAI---DALCT 268
RS+ YI A++LL R LD Q+A + + +
Sbjct: 172 RSS-----------------YI-----------CNAAKYLLKNR-LDHQQASRIREMMVS 202
Query: 269 LPGVGPKVAACIALFSLDQHHAIPVDTHVWK 299
+ G+G K+A CI L +PVDTH++K
Sbjct: 203 IKGIGYKIADCILLIGTGNLDVVPVDTHIFK 233
>gi|340520777|gb|EGR51013.1| predicted protein [Trichoderma reesei QM6a]
Length = 259
Score = 119 bits (299), Expect = 1e-24, Method: Compositional matrix adjust.
Identities = 82/217 (37%), Positives = 109/217 (50%), Gaps = 39/217 (17%)
Query: 106 LLDFLNMGISLGELWEGFSASDCRFAELAKYLAGARVLRQDPVECLLQFLCSSNNNIARI 165
L + N+ I L L+ +S D F + A G R+L QD E L+ F+CSSNNNI+RI
Sbjct: 6 LRHYFNLNIDLASLYRQWSKDDPNFRKKAPQFTGVRILNQDAWEALICFICSSNNNISRI 65
Query: 166 TKMVDFLAS-LGSHLGNVEGFEFHEFPSLERLSLVS-EVELRNAGFGYRSAPQSSLLFSV 223
++M L G +G+V+G H+FP+ E L+ + E LR GFGYR
Sbjct: 66 SQMAHKLCKYYGPLIGHVDGEAMHDFPTPESLTDKTVEARLRELGFGYR----------- 114
Query: 224 RRSFKQAKYITGTVDVL-QSKHSGGAEWLLSLRKLDL----------------QEAIDAL 266
AKYI T ++ Q K S WL SLR D ++A AL
Sbjct: 115 ------AKYIAETARIIAQEKPSA---WLDSLRNPDFPAFNAVAVSDGPQSTYKDAQTAL 165
Query: 267 CTLPGVGPKVAACIALFSLDQHHAIPVDTHVWKIATR 303
+L GVGPKVA C+ L L ++PVDTHV +IA R
Sbjct: 166 LSLTGVGPKVADCVCLMGLGWGESVPVDTHVLQIAQR 202
>gi|242219429|ref|XP_002475494.1| predicted protein [Postia placenta Mad-698-R]
gi|220725309|gb|EED79302.1| predicted protein [Postia placenta Mad-698-R]
Length = 356
Score = 119 bits (299), Expect = 2e-24, Method: Compositional matrix adjust.
Identities = 102/341 (29%), Positives = 156/341 (45%), Gaps = 73/341 (21%)
Query: 40 PLNLTQSELSLPLTFPTGQTFRWKKTGPL----------------QYTGPIGPHLISLKH 83
PL++TQ LSL GQ+FRW PL +Y + ++ L+
Sbjct: 5 PLSVTQ--LSLSAVLQCGQSFRWS-IFPLVIPSISPGTTLHSPTHEYRLCLRDRIVCLR- 60
Query: 84 LQNGDVCYHIHTSPS-------------EPAAKSALLDFLNMGISLGELWEGFSASDCRF 130
Q D+ ++ P+ E + + D+ + + L L+ + D F
Sbjct: 61 -QCPDILFYRSVFPNTSGELDLEVLAQREKETLAWIHDYFQLNVDLVSLYNQWGKRDPVF 119
Query: 131 AELAKYLAGARVLRQDPVECLLQFLCSSNNNIARITKMVDFL------------------ 172
+ + +G R+LRQDP E ++ F+CSSNNNI RITKMV L
Sbjct: 120 QKCKERFSGIRMLRQDPFENIISFICSSNNNIVRITKMVKALCQRYSPALATIPPPDMAQ 179
Query: 173 -ASLGSHLGNVEGFE-FHEFP---SLERLSLVSEVELRNAGFGYRSAPQSSLLFSVRRSF 227
S + L N+ E +H FP +L R + + LR+ GFGYR+ ++++
Sbjct: 180 NTSDEATLRNLPLTETYHPFPPPSALARPEVTAT--LRSLGFGYRAD-------YIQKTC 230
Query: 228 KQAKYITGTVDVLQSKHSGGAEWLLSLRKLDLQEAIDALCTLPGVGPKVAACIALFSLDQ 287
K G+ ++ ++ WL +LR L EA D L GVG KVA C+ L SLD+
Sbjct: 231 KMLVDARGSGNIQGNQVEPAESWLQTLRNLSTLEARDELLKFVGVGRKVADCVLLMSLDK 290
Query: 288 HHAIPVDTHVWKIATRYLLPELAG-----VRLTPKLCSRVA 323
H IPVDTHV +IA ++ ++G + +TPKL V+
Sbjct: 291 HEVIPVDTHVHQIAVKHY--GMSGSSKSKMAMTPKLYDEVS 329
>gi|70986983|ref|XP_748977.1| DNA N-glycosylase [Aspergillus fumigatus Af293]
gi|66846607|gb|EAL86939.1| DNA N-glycosylase, putative [Aspergillus fumigatus Af293]
gi|159123254|gb|EDP48374.1| DNA N-glycosylase, putative [Aspergillus fumigatus A1163]
Length = 349
Score = 118 bits (296), Expect = 3e-24, Method: Compositional matrix adjust.
Identities = 86/245 (35%), Positives = 115/245 (46%), Gaps = 43/245 (17%)
Query: 106 LLDFLNMGISLGELWEGFSASDCRFAELAKYLAGARVLRQDPVECLLQFLCSSNNNIARI 165
L +LN+ +L +L+ +S+ D F + A G R+LRQD E L+ F+CSSNNNI RI
Sbjct: 16 LKHYLNLSSNLTDLYTQWSSQDPNFKKKAPQFTGIRILRQDAWEALVSFICSSNNNITRI 75
Query: 166 TKMVDFL-ASLGSHLGNVEGFEFHEFPSLERLSLVS-EVELRNAGFGYRSAPQSSLLFSV 223
++MV+ L + G + V +H+FP E L+ E LR+ GFGYR
Sbjct: 76 SQMVEKLCVNYGPLVATVGDRAYHDFPPPEALTADDVEGRLRSLGFGYR----------- 124
Query: 224 RRSFKQAKYITGTVDVLQSKHSGGAEWLLSLR---------------------KLDLQEA 262
AKYI T ++ + G WL SLR + + A
Sbjct: 125 ------AKYIHQTALIVAKEREQG--WLDSLRNPESPVLGVQPVPGDEMRPEGRQGYRHA 176
Query: 263 IDALCTLPGVGPKVAACIALFSLDQHHAIPVDTHVWKIATR-YLLPELAGVRLTPKLCSR 321
+ L L GVGPKVA C+ L L A+PVDTHVW+IA R Y A LT
Sbjct: 177 HEQLLGLQGVGPKVADCVCLMGLGWGEAVPVDTHVWQIAQRDYKFGRGAHKSLTKATYDA 236
Query: 322 VAEAF 326
V F
Sbjct: 237 VGNHF 241
>gi|159484390|ref|XP_001700241.1| predicted protein [Chlamydomonas reinhardtii]
gi|158272557|gb|EDO98356.1| predicted protein [Chlamydomonas reinhardtii]
Length = 155
Score = 118 bits (296), Expect = 4e-24, Method: Compositional matrix adjust.
Identities = 65/147 (44%), Positives = 87/147 (59%), Gaps = 8/147 (5%)
Query: 35 PSKWTPLNLTQSELSLPLTFPTGQTFRWKKTGPLQYTGPIGPHLISLKHLQNGDVCYHIH 94
P+ W L SEL L PTGQ+FRW+ T P +YTG IGP L++L+ L DV + +
Sbjct: 4 PAGWRSLGTPPSELQLDFCLPTGQSFRWRATAPSEYTGVIGPRLVALRQLP-ADVVFRVL 62
Query: 95 TSPSEPAAKSALLDFLNM-------GISLGELWEGFSASDCRFAELAKYLAGARVLRQDP 147
S P A + + G+SL +L +SA+ RFA ++ Y GAR+LRQDP
Sbjct: 63 ARGSGPEAAEEGDEAADQDQEQQQQGVSLAQLTRDWSAACGRFAAVSPYFPGARMLRQDP 122
Query: 148 VECLLQFLCSSNNNIARITKMVDFLAS 174
ECL QF+CSSNN+I+RI MV+ L S
Sbjct: 123 HECLFQFICSSNNHISRIHGMVERLCS 149
>gi|440493808|gb|ELQ76234.1| 8-oxoguanine DNA glycosylase [Trachipleistophora hominis]
Length = 308
Score = 118 bits (295), Expect = 4e-24, Method: Compositional matrix adjust.
Identities = 89/289 (30%), Positives = 132/289 (45%), Gaps = 55/289 (19%)
Query: 37 KWTPLNLTQSELSLPLTFPTGQTFRWKKTGPLQYTGPIGPHLISLKHLQNGD-VCYHIHT 95
+W L T ++L T +GQ F ++KTG + TG + L + K Q+GD + Y I
Sbjct: 2 EWQRLE-TDQYVNLDKTLYSGQIFSFQKTGENEDTGMVEGFLTTFK--QDGDHIYYKIFN 58
Query: 96 SPSEPAAKSALLDFLNMGISLGEL---W-------EGFSASDCR----------FAELAK 135
++ F + ++ ++ W E + R F E
Sbjct: 59 YKESVDYQTIFSRFFTLDLNYKKITKEWNDKLLKCENEVYDENRLNNGLCTLRPFKE--- 115
Query: 136 YLAGARVLRQDPVECLLQFLCSSNNNIARITKMVDFLASLGSHLGNVEGFEFHEFPSLER 195
G R+LR D E + F+CS+NNNI RITKMV L SLG ++ + F+EFP ++
Sbjct: 116 --TGLRLLRCDLKETIFSFICSANNNIKRITKMVLVLFSLGKYITTINDKNFYEFPDPDQ 173
Query: 196 LSLVSEVELRNAGFGYRSAPQSSLLFSVRRSFKQAKYITGTVDVLQS-KHSGGAEWLLSL 254
L E+ LR +GFGYR A YI T ++ K+S L
Sbjct: 174 LC-DKELFLRESGFGYR-----------------ASYIVKTAQQMKMLKYS-------QL 208
Query: 255 RKLDLQEAIDALCTLPGVGPKVAACIALFSLDQHHAIPVDTHVWKIATR 303
LD Q A + L G+ KVA C+ L L H +P+DTH++K+A+R
Sbjct: 209 YNLDYQHAFEMLNEYNGISYKVADCVCLLGLHFHSVVPIDTHIFKVASR 257
>gi|8670538|ref|NP_058436.1| N-glycosylase/DNA lyase isoform 2c [Homo sapiens]
gi|119584386|gb|EAW63982.1| 8-oxoguanine DNA glycosylase, isoform CRA_e [Homo sapiens]
gi|311976732|gb|ADQ20120.1| N-glycosylase/DNA lyase [Homo sapiens]
gi|311976740|gb|ADQ20127.1| N-glycosylase/DNA lyase [Homo sapiens]
gi|311976748|gb|ADQ20134.1| N-glycosylase/DNA lyase [Homo sapiens]
gi|311976756|gb|ADQ20141.1| N-glycosylase/DNA lyase [Homo sapiens]
gi|311976764|gb|ADQ20148.1| N-glycosylase/DNA lyase [Homo sapiens]
gi|311976772|gb|ADQ20155.1| N-glycosylase/DNA lyase [Homo sapiens]
gi|311976780|gb|ADQ20162.1| N-glycosylase/DNA lyase [Homo sapiens]
gi|311976788|gb|ADQ20169.1| N-glycosylase/DNA lyase [Homo sapiens]
gi|311976796|gb|ADQ20176.1| N-glycosylase/DNA lyase [Homo sapiens]
gi|311976804|gb|ADQ20183.1| N-glycosylase/DNA lyase [Homo sapiens]
gi|311976812|gb|ADQ20190.1| N-glycosylase/DNA lyase [Homo sapiens]
gi|311976820|gb|ADQ20197.1| N-glycosylase/DNA lyase [Homo sapiens]
gi|311976828|gb|ADQ20204.1| N-glycosylase/DNA lyase [Homo sapiens]
gi|311976836|gb|ADQ20211.1| N-glycosylase/DNA lyase [Homo sapiens]
gi|311976844|gb|ADQ20218.1| N-glycosylase/DNA lyase [Homo sapiens]
gi|311976852|gb|ADQ20225.1| N-glycosylase/DNA lyase [Homo sapiens]
gi|311976860|gb|ADQ20232.1| N-glycosylase/DNA lyase [Homo sapiens]
gi|311976868|gb|ADQ20239.1| N-glycosylase/DNA lyase [Homo sapiens]
gi|311976876|gb|ADQ20246.1| N-glycosylase/DNA lyase [Homo sapiens]
gi|311976884|gb|ADQ20253.1| N-glycosylase/DNA lyase [Homo sapiens]
gi|311976892|gb|ADQ20260.1| N-glycosylase/DNA lyase [Homo sapiens]
gi|311976900|gb|ADQ20267.1| N-glycosylase/DNA lyase [Homo sapiens]
gi|311976908|gb|ADQ20274.1| N-glycosylase/DNA lyase [Homo sapiens]
gi|311976916|gb|ADQ20281.1| N-glycosylase/DNA lyase [Homo sapiens]
gi|311976924|gb|ADQ20288.1| N-glycosylase/DNA lyase [Homo sapiens]
gi|311976932|gb|ADQ20295.1| N-glycosylase/DNA lyase [Homo sapiens]
gi|311976940|gb|ADQ20302.1| N-glycosylase/DNA lyase [Homo sapiens]
gi|311976948|gb|ADQ20309.1| N-glycosylase/DNA lyase [Homo sapiens]
gi|311976956|gb|ADQ20316.1| N-glycosylase/DNA lyase [Homo sapiens]
gi|311976964|gb|ADQ20323.1| N-glycosylase/DNA lyase [Homo sapiens]
gi|311976972|gb|ADQ20330.1| N-glycosylase/DNA lyase [Homo sapiens]
gi|311976980|gb|ADQ20337.1| N-glycosylase/DNA lyase [Homo sapiens]
gi|311976988|gb|ADQ20344.1| N-glycosylase/DNA lyase [Homo sapiens]
gi|311976996|gb|ADQ20351.1| N-glycosylase/DNA lyase [Homo sapiens]
gi|311977004|gb|ADQ20358.1| N-glycosylase/DNA lyase [Homo sapiens]
gi|311977012|gb|ADQ20365.1| N-glycosylase/DNA lyase [Homo sapiens]
gi|311977020|gb|ADQ20372.1| N-glycosylase/DNA lyase [Homo sapiens]
gi|311977028|gb|ADQ20379.1| N-glycosylase/DNA lyase [Homo sapiens]
gi|311977036|gb|ADQ20386.1| N-glycosylase/DNA lyase [Homo sapiens]
gi|311977044|gb|ADQ20393.1| N-glycosylase/DNA lyase [Homo sapiens]
Length = 195
Score = 118 bits (295), Expect = 4e-24, Method: Compositional matrix adjust.
Identities = 59/171 (34%), Positives = 95/171 (55%), Gaps = 6/171 (3%)
Query: 33 NKPSKWTPLNLTQSELSLPLTFPTGQTFRWKKTGPLQYTGPIGPHLISLKHLQNGDVCYH 92
+ P+ W + +SEL L L P+GQ+FRW++ P ++G + + +L + C
Sbjct: 18 STPALWASIPCPRSELRLDLVLPSGQSFRWREQSPAHWSGVLADQVWTLTQTEEQLHCTV 77
Query: 93 IHTSPSE-----PAAKSALLDFLNMGISLGELWEGFSASDCRFAELAKYLAGARVLRQDP 147
S+ P A+ + + ++L +L+ + + D F E+A+ G R+LRQDP
Sbjct: 78 YRGDKSQASRPTPDELEAVRKYFQLDVTLAQLYHHWGSVDSHFQEVAQKFQGVRLLRQDP 137
Query: 148 VECLLQFLCSSNNNIARITKMVDFLA-SLGSHLGNVEGFEFHEFPSLERLS 197
+ECL F+CSSNNNIARIT MV+ L + G L ++ +H FPSL+ L+
Sbjct: 138 IECLFSFICSSNNNIARITGMVERLCQAFGPRLIQLDDVTYHGFPSLQALA 188
>gi|332816041|ref|XP_003309657.1| PREDICTED: N-glycosylase/DNA lyase isoform 7 [Pan troglodytes]
Length = 195
Score = 118 bits (295), Expect = 4e-24, Method: Compositional matrix adjust.
Identities = 59/171 (34%), Positives = 95/171 (55%), Gaps = 6/171 (3%)
Query: 33 NKPSKWTPLNLTQSELSLPLTFPTGQTFRWKKTGPLQYTGPIGPHLISLKHLQNGDVCYH 92
+ P+ W + +SEL L L P+GQ+FRW++ P ++G + + +L + C
Sbjct: 18 STPALWASIPCPRSELRLDLVLPSGQSFRWREQSPAHWSGVLADQVWTLTQTEEQLHCTV 77
Query: 93 IHTSPSE-----PAAKSALLDFLNMGISLGELWEGFSASDCRFAELAKYLAGARVLRQDP 147
S+ P A+ + + ++L +L+ + + D F E+A+ G R+LRQDP
Sbjct: 78 YRGDKSQASRPTPDELEAVRKYFQLDVTLAQLYHHWGSVDSHFQEVAQKFQGVRLLRQDP 137
Query: 148 VECLLQFLCSSNNNIARITKMVDFLA-SLGSHLGNVEGFEFHEFPSLERLS 197
+ECL F+CSSNNNIARIT MV+ L + G L ++ +H FPSL+ L+
Sbjct: 138 IECLFSFICSSNNNIARITGMVERLCQAFGPRLIQLDDVTYHGFPSLQALA 188
>gi|367012784|ref|XP_003680892.1| hypothetical protein TDEL_0D00970 [Torulaspora delbrueckii]
gi|359748552|emb|CCE91681.1| hypothetical protein TDEL_0D00970 [Torulaspora delbrueckii]
Length = 387
Score = 117 bits (294), Expect = 6e-24, Method: Compositional matrix adjust.
Identities = 89/285 (31%), Positives = 135/285 (47%), Gaps = 36/285 (12%)
Query: 36 SKWTPLNLTQSELSLPLTFPTGQTFRW------KKTGPLQYTGPIGPHLISLKHLQNGDV 89
SK+ +N+ EL L GQ FRW G G + + + + +V
Sbjct: 2 SKFAQINIANGELYLENVLQAGQAFRWVLNEVKNHYSSTMKIGSKGRYSVVILRQPSPEV 61
Query: 90 CYH--IHTSPSEPAAKSALLDFLNMGISLGELW-EGFSASDCRFAELAKYLAGARVLRQD 146
+ + + K L+ + + +SL +L + + +D RF + G R+L Q+
Sbjct: 62 LEYASLDNTCDLKVLKEHLVKYFRLEVSLHDLHSKQWLPNDSRFEDFKP--KGVRMLGQE 119
Query: 147 PVECLLQFLCSSNNNIARITKMVDFLAS-LGSHLGNVEGFEFHEFPSLERLS-LVSEVEL 204
P E L+ F+CSSNNNI+RITKM L++ G+ LG + ++ FP+ + ++ SE +L
Sbjct: 120 PWETLVSFICSSNNNISRITKMCHGLSTNYGNKLGTFDSLDYFSFPTSDEIANKASEEQL 179
Query: 205 RNAGFGYRSAPQSSLLFSVRRSFKQAKYITGTV-DVLQSKHSGG----AEWLLSLRK-LD 258
R GFGYR AKYI T ++Q K G ++L LR +
Sbjct: 180 RELGFGYR-----------------AKYIMETAKQMVQDKKEKGFTDDTQFLEYLRSHMT 222
Query: 259 LQEAIDALCTLPGVGPKVAACIALFSLDQHHAIPVDTHVWKIATR 303
++ + L T GVGPKVA CI L L +PVD HV +IA R
Sbjct: 223 YEQMREHLMTYTGVGPKVADCICLMGLRMDEVVPVDVHVGRIAKR 267
>gi|156841170|ref|XP_001643960.1| hypothetical protein Kpol_1001p14 [Vanderwaltozyma polyspora DSM
70294]
gi|156114591|gb|EDO16102.1| hypothetical protein Kpol_1001p14 [Vanderwaltozyma polyspora DSM
70294]
Length = 373
Score = 117 bits (293), Expect = 7e-24, Method: Compositional matrix adjust.
Identities = 92/288 (31%), Positives = 142/288 (49%), Gaps = 39/288 (13%)
Query: 36 SKWTPLNLTQSELSLPLTFPTGQTFRWKKTGPL-QYTG--PIGPH----LISLK-HLQNG 87
SK+ +++ EL L GQ+FRW L QY+ +G L+ L+ + NG
Sbjct: 4 SKFGRFLVSKEELCLTNVLQVGQSFRWIFDEKLNQYSSSLKVGNDEKYALVVLRQNSSNG 63
Query: 88 DVCYHIHTSPSE---PAAKSALLDFLNMGISLGELWEG-FSASDCRFAELAKYLAGARVL 143
+ + + S L ++ + +S+ +L+ + D RF + +G R+L
Sbjct: 64 EDFVEFGSVGGDCELGTLDSHLRNYFRLDVSVNDLYSTQWLPRDDRFKSHSP--SGNRIL 121
Query: 144 RQDPVECLLQFLCSSNNNIARITKMVDFLAS-LGSHLGNVEGFEFHEFPSLERL-SLVSE 201
Q+P E L+ F+CSSNNNI+RITKM L S G+ +G + +++ FPS + + SE
Sbjct: 122 AQEPWETLVSFICSSNNNISRITKMCHELCSKFGNKIGTLGDVDYYSFPSSDDIVERSSE 181
Query: 202 VELRNAGFGYRSAPQSSLLFSVRRSFKQAKYITGTVDVL-QSKHSGG----AEWLLSL-R 255
ELR GFGYR AKYI T ++ + K + G ++L+ L
Sbjct: 182 EELRKLGFGYR-----------------AKYIIDTAKMMVEDKTANGYTSDTQYLMELGS 224
Query: 256 KLDLQEAIDALCTLPGVGPKVAACIALFSLDQHHAIPVDTHVWKIATR 303
KL ++ + L GVGPKVA C+ L H +P+DTHV +IA R
Sbjct: 225 KLTYEQLREHLMRYSGVGPKVADCVCLMGFKMDHVVPIDTHVSRIAKR 272
>gi|255714254|ref|XP_002553409.1| KLTH0D16126p [Lachancea thermotolerans]
gi|238934789|emb|CAR22971.1| KLTH0D16126p [Lachancea thermotolerans CBS 6340]
Length = 386
Score = 117 bits (293), Expect = 7e-24, Method: Compositional matrix adjust.
Identities = 87/278 (31%), Positives = 134/278 (48%), Gaps = 30/278 (10%)
Query: 37 KWTPLNLTQSELSLPLTFPTGQTFRWK-KTGPLQYTGPIGP----HLISLKHLQNGDVCY 91
K+ L + EL L GQ FRW G QY+ + ++ L+ L++ + Y
Sbjct: 3 KFQKLLFKRGELYLDKVLQCGQAFRWIFHEGLGQYSTTMRIDDRFRIVVLRQLEDNYIEY 62
Query: 92 HIHTSPSEPAAKSALLDFLNMGISLGELWEG-FSASDCRFAELAKYLAGARVLRQDPVEC 150
+ + S L + + + L EL+E + D RF + K G R+L QDP E
Sbjct: 63 ASLGAEECSSLGSFLKRYFRLEVPLSELYENQWLPRDSRFEK--KRPHGIRILSQDPWET 120
Query: 151 LLQFLCSSNNNIARITKMVDFLA-SLGSHLGNVEGFEFHEFP-SLERLSLVSEVELRNAG 208
LL ++CSSNNNI+RITKM L G+ +G + +++ FP S E + SE +LR G
Sbjct: 121 LLSYICSSNNNISRITKMCHALCIEFGNPVGQYDKVDYYSFPTSKELVERASEEKLRALG 180
Query: 209 FGYRSAPQSSLLFSVRRSFKQAKYITGTVDVLQSKH--SGGAEWLLSLRK-LDLQEAIDA 265
FGYR AK++ T D + + ++L S + L+ ++ +
Sbjct: 181 FGYR-----------------AKFLMKTADKMLKERLDMSDTQYLESWKDHLEYEQVRER 223
Query: 266 LCTLPGVGPKVAACIALFSLDQHHAIPVDTHVWKIATR 303
+ GVGPKVA C+ L L+ +PVD H+ +IA R
Sbjct: 224 VMGFDGVGPKVADCVCLSGLEMDEVVPVDVHIARIAQR 261
>gi|360042712|emb|CCD78122.1| putative 8-oxoguanine DNA glycosylase [Schistosoma mansoni]
Length = 329
Score = 117 bits (293), Expect = 8e-24, Method: Compositional matrix adjust.
Identities = 94/309 (30%), Positives = 149/309 (48%), Gaps = 38/309 (12%)
Query: 36 SKWTPLNLTQSELSLPLTFPTGQTFRWKKTGPL-QYTGPIGPHLISLKHLQNGD-VCYHI 93
+ W + + + +L T +GQ FRW L ++ G I HL L+ +++ + V Y++
Sbjct: 5 NNWHSIPVDPQDFNLFSTLNSGQAFRWVFNSTLNEWHGVINGHLWRLRQMEDSNPVEYYL 64
Query: 94 HTSPSEPAAKS--ALLDFLNMGISLGELWEGFSASD----CRFAELAKY--LAGARVLRQ 145
+ L D+ + ++L +L + +S D RF++ + G R+LRQ
Sbjct: 65 EKNSKIKLNDVPFDLRDYFRLDMNLSDLVKEWSIKDKWFENRFSKNCQMDTARGLRLLRQ 124
Query: 146 DPVECLLQFLCSSNNNIARITKMVDFLASLGSHLGNVEGFEFHE-----FPSLERLSLVS 200
DP E L F+ S+NNN+ RITK+ L L S GN E + FPSLE L+
Sbjct: 125 DPEETLFSFITSANNNLTRITKL---LCKLCSEYGNPLYLENGDLGHWTFPSLEILAQPG 181
Query: 201 -EVELRNAGFGYRSAPQSSLLFSVRRSFKQAKYITGTVDVLQSKHSGGAEWLLSLRKLDL 259
+ L+ GFGYRS K+IT + L +GG LL LR +
Sbjct: 182 MQDNLKKIGFGYRS-----------------KFITLAANWLLK--NGGRSRLLELRSVSH 222
Query: 260 QEAIDALCTLPGVGPKVAACIALFSLDQHHAIPVDTHVWKIATRYLLPELAGVRLTPKLC 319
EA L +PG+G KVA CI L SLD+ +P+D H+ ++A +PE + +T K
Sbjct: 223 VEAQAFLLQIPGIGKKVADCICLMSLDKLDVVPIDVHMLRVAREKGIPEASCKTMTTKSY 282
Query: 320 SRVAEAFCE 328
++++ +
Sbjct: 283 DIISKSLSD 291
>gi|441665416|ref|XP_004091811.1| PREDICTED: N-glycosylase/DNA lyase isoform 2 [Nomascus leucogenys]
Length = 193
Score = 117 bits (293), Expect = 8e-24, Method: Compositional matrix adjust.
Identities = 59/171 (34%), Positives = 94/171 (54%), Gaps = 6/171 (3%)
Query: 33 NKPSKWTPLNLTQSELSLPLTFPTGQTFRWKKTGPLQYTGPIGPHLISLKHLQNGDVCYH 92
+ P+ W + +SEL L L P+GQ+FRW++ P ++G + + +L + C
Sbjct: 18 SAPALWASIPCPRSELRLDLVLPSGQSFRWREQSPAHWSGVLADQVWTLTQTEEQLHCTV 77
Query: 93 IHTSPSE-----PAAKSALLDFLNMGISLGELWEGFSASDCRFAELAKYLAGARVLRQDP 147
S P A+ + + ++L +L+ + + D F E+A+ G R+LRQDP
Sbjct: 78 YRGDKSRASRPTPDELDAVRKYFQLDVTLAQLYHHWGSVDSHFQEVAQKFQGVRLLRQDP 137
Query: 148 VECLLQFLCSSNNNIARITKMVDFLA-SLGSHLGNVEGFEFHEFPSLERLS 197
+ECL F+CSSNNNIARIT MV+ L + G L ++ +H FPSL+ L+
Sbjct: 138 IECLFSFICSSNNNIARITGMVERLCQAFGPRLIQLDDVTYHGFPSLQALA 188
>gi|291523766|emb|CBK89353.1| 3-methyladenine DNA glycosylase/8-oxoguanine DNA glycosylase
[Eubacterium rectale DSM 17629]
gi|291528769|emb|CBK94355.1| 3-methyladenine DNA glycosylase/8-oxoguanine DNA glycosylase
[Eubacterium rectale M104/1]
Length = 267
Score = 117 bits (292), Expect = 9e-24, Method: Compositional matrix adjust.
Identities = 87/266 (32%), Positives = 123/266 (46%), Gaps = 38/266 (14%)
Query: 46 SELSLPLTFPTGQTFRWKKTGPLQYTGPIGPHLISLKHLQNGDVCYHIHTSPSEPAAKSA 105
L L +GQ FRWK TG +YT + +K Q+G+ E +
Sbjct: 7 DNLDLKQIADSGQCFRWKNTGENEYTVVAFDRALRIK--QDGN---EFELDCDEADWNNI 61
Query: 106 LLDFLNMGISLGELWEGFS-ASDCRFAELAKYLAGARVLRQDPVECLLQFLCSSNNNIAR 164
+L+M + + + D E Y +G R+LRQD E ++ F+ S NNNI R
Sbjct: 62 WKSYLDMDTDYAGIAKLIADGDDAHLKEAYAYGSGVRILRQDLWEMIVTFMISQNNNIKR 121
Query: 165 ITKMVDFLASLGSHL--GNVEGFEFHEFPSLERLSLVSEV-ELRNAGFGYRSAPQSSLLF 221
IT VD L H G+ EG E + FP + L + EV + R+ GFGYR AP
Sbjct: 122 ITNSVDLLCRRCGHKIDGSAEGEELYTFP--KPLEVPDEVFDDRSMGFGYR-AP------ 172
Query: 222 SVRRSFKQAKYITGTVDVLQSKHSGGA---EWLLSLRKLDLQEAIDALCTLPGVGPKVAA 278
L+ + GA +WL +LRK+ EA+++L T G+G KVA
Sbjct: 173 -----------------YLKEIYEYGANNPDWLDNLRKMSYDEAMESLLTRKGIGKKVAN 215
Query: 279 CIALFSLDQHHAIPVDTHVWKIATRY 304
CI LF L A P+DTHV ++ +Y
Sbjct: 216 CICLFGLHHVDAFPIDTHVKQLLDKY 241
>gi|4587150|dbj|BAA76635.1| OGG1 protein type 2b [Homo sapiens]
Length = 255
Score = 117 bits (292), Expect = 1e-23, Method: Compositional matrix adjust.
Identities = 65/169 (38%), Positives = 93/169 (55%), Gaps = 20/169 (11%)
Query: 109 FLNMGISLGELWEGFSASDCRFAELAKYLAGARVLRQDPVECLLQFLCSSNNNIARITKM 168
+ + ++L +L+ + + D F E+A+ G R+LRQDP+ECL F+CSSNNNIARIT M
Sbjct: 4 YFQLDVTLAQLYHHWGSVDSHFQEVAQKFQGVRLLRQDPIECLFSFICSSNNNIARITGM 63
Query: 169 VDFLA-SLGSHLGNVEGFEFHEFPSLERLSLVS-EVELRNAGFGYRSAPQSSLLFSVRRS 226
V+ L + G L ++ +H FPSL+ L+ E LR G GYR
Sbjct: 64 VERLCQAFGPRLIQLDDVTYHGFPSLQALAGPEVEAHLRKLGLGYR-------------- 109
Query: 227 FKQAKYITGTVDVLQSKHSGGAEWLLSLRKLDLQEAIDALCTLPGVGPK 275
A+Y++ + + + GG WL LR+ +EA ALC LPGVG K
Sbjct: 110 ---ARYVSASARAILEE-QGGLAWLQQLRESSYEEAHKALCILPGVGTK 154
>gi|254579138|ref|XP_002495555.1| ZYRO0B14146p [Zygosaccharomyces rouxii]
gi|238938445|emb|CAR26622.1| ZYRO0B14146p [Zygosaccharomyces rouxii]
Length = 373
Score = 116 bits (291), Expect = 1e-23, Method: Compositional matrix adjust.
Identities = 85/278 (30%), Positives = 134/278 (48%), Gaps = 24/278 (8%)
Query: 37 KWTPLNLTQSELSLPLTFPTGQTFRW-------KKTGPLQYTGPIGPHLISLKHLQNGDV 89
K+ + L + EL + GQ FRW + ++ ++ L+ ++ +
Sbjct: 4 KYGQIVLPKGELCISNVLQAGQAFRWVLNESEERYATSMKVGNDNSYSVVVLRQPEDHIL 63
Query: 90 CY-HIHTSPSEPAAKSALLDFLNMGISLGEL-WEGFSASDCRFAELAKYLAGARVLRQDP 147
+ ++ S K L+ + + I L EL + + D RF E+ + G R+L Q+P
Sbjct: 64 EFASLNESCELKTLKDHLIRYFRLDIPLQELHYSDWQKRDARFEEITPH--GMRMLGQEP 121
Query: 148 VECLLQFLCSSNNNIARITKMVDFLA-SLGSHLGNVEGFEFHEFP-SLERLSLVSEVELR 205
E L+ F+CSSNNNI+RITKM L G+ +G ++ +F+ FP S E + SE +LR
Sbjct: 122 WETLVSFICSSNNNISRITKMCSNLCIHYGNKIGTMDSLDFYSFPTSDELVEKASETQLR 181
Query: 206 NAGFGYRSAPQSSLLFSVRRSFKQAKYITGTVDVLQSKHSGGAEWLLSLRKLDLQEAIDA 265
GFGYR+ F + + K + D +G E L S KL ++ +
Sbjct: 182 ELGFGYRAK------FIIETA---KKMVQDKTDAGFKSDTGFLEDLNS--KLTYEQMREH 230
Query: 266 LCTLPGVGPKVAACIALFSLDQHHAIPVDTHVWKIATR 303
L + G+GPKVA C+ L L +PVD HV +IA R
Sbjct: 231 LMSYQGIGPKVADCVCLMGLRMDQVVPVDVHVGRIAKR 268
>gi|238922868|ref|YP_002936381.1| 8-oxoguanine DNA glycosylase domain protein [Eubacterium rectale
ATCC 33656]
gi|238874540|gb|ACR74247.1| 8-oxoguanine DNA glycosylase domain protein [Eubacterium rectale
ATCC 33656]
Length = 267
Score = 116 bits (291), Expect = 1e-23, Method: Compositional matrix adjust.
Identities = 87/266 (32%), Positives = 123/266 (46%), Gaps = 38/266 (14%)
Query: 46 SELSLPLTFPTGQTFRWKKTGPLQYTGPIGPHLISLKHLQNGDVCYHIHTSPSEPAAKSA 105
L L +GQ FRWK TG +YT + +K Q+G+ E +
Sbjct: 7 DNLDLKQIADSGQCFRWKNTGENEYTVVAFDRALRIK--QDGN---EFELDCDEADWNNI 61
Query: 106 LLDFLNMGISLGELWEGFS-ASDCRFAELAKYLAGARVLRQDPVECLLQFLCSSNNNIAR 164
+L+M + + + D E Y +G R+LRQD E ++ F+ S NNNI R
Sbjct: 62 WKSYLDMDTDYAGIAKLIADGDDAHLKEAYAYGSGVRILRQDLWEMIVTFMISQNNNIKR 121
Query: 165 ITKMVDFLASLGSHL--GNVEGFEFHEFPSLERLSLVSEV-ELRNAGFGYRSAPQSSLLF 221
IT VD L H G+ EG E + FP + L + EV + R+ GFGYR AP
Sbjct: 122 ITNSVDLLCRRCGHKIDGSAEGGELYTFP--KPLEVPDEVFDDRSMGFGYR-AP------ 172
Query: 222 SVRRSFKQAKYITGTVDVLQSKHSGGA---EWLLSLRKLDLQEAIDALCTLPGVGPKVAA 278
L+ + GA +WL +LRK+ EA+++L T G+G KVA
Sbjct: 173 -----------------YLKEIYGYGANNPDWLDNLRKMSYDEAMESLLTRKGIGKKVAN 215
Query: 279 CIALFSLDQHHAIPVDTHVWKIATRY 304
CI LF L A P+DTHV ++ +Y
Sbjct: 216 CICLFGLHHVDAFPIDTHVKQLLDKY 241
>gi|452993771|emb|CCQ94581.1| 8-oxoguanine DNA glycosylase [Clostridium ultunense Esp]
Length = 295
Score = 115 bits (289), Expect = 2e-23, Method: Compositional matrix adjust.
Identities = 85/275 (30%), Positives = 129/275 (46%), Gaps = 31/275 (11%)
Query: 54 FPTGQTFRWKKTGPLQYTGPIGPHLISLKHLQNGDVCYHIHTSPSEPAAKSALLDFLNMG 113
F GQ FRW YT +I++K D+ I ++ + ++ D+ ++G
Sbjct: 26 FECGQAFRWHVEEDDSYTAIHKSKVINVKR-DGKDI---IFSNTNMEDFENIWYDYFDLG 81
Query: 114 ISLGELWEGFSASDCRFAELAKYLAGARVLRQDPVECLLQFLCSSNNNIARITKMVDFLA 173
GE+ + S D E K+ G R+L Q+P E + F+ S+NN I RI + ++ ++
Sbjct: 82 RDYGEIKKELS-KDPILNEAIKFGEGIRILNQEPYETTISFIISANNQIPRIKRSIELIS 140
Query: 174 -SLGSHLGNVEGFEFHEFPSLERLSLVSEVELR-NAGFGYRSAPQSSLLFSVRRSFKQAK 231
G +G+ G E+ FPS E L+ + E EL N GYR AK
Sbjct: 141 QKYGESIGSYNGKEYFSFPSSEILASIDEKELEDNCKVGYR-----------------AK 183
Query: 232 YITGTVDVLQSKHSGGAEWLLSLRKLDLQEAIDALCTLPGVGPKVAACIALFSLDQHHAI 291
YI T ++++K L L +L + A + L LPGVGPKV+ CI LFSL+++ A
Sbjct: 184 YIVNTSKMIKNKEID----LDKLFQLPTETAKEILMRLPGVGPKVSNCILLFSLNKNEAF 239
Query: 292 PVDTHVWKIATRYLLPELAGVRLTPKLCSRVAEAF 326
PVD V +I E T K+ + E F
Sbjct: 240 PVDVWVKRIMESLYFKENTS---TKKISAYAEENF 271
>gi|164687038|ref|ZP_02211066.1| hypothetical protein CLOBAR_00664 [Clostridium bartlettii DSM
16795]
gi|164603923|gb|EDQ97388.1| 8-oxoguanine DNA-glycosylase (ogg) [Clostridium bartlettii DSM
16795]
Length = 291
Score = 115 bits (289), Expect = 2e-23, Method: Compositional matrix adjust.
Identities = 73/256 (28%), Positives = 123/256 (48%), Gaps = 26/256 (10%)
Query: 54 FPTGQTFRWKKTGPLQYTGPIGPHLISLKHLQNGDVCYHIHTSPSEPAAKSALLDFLNMG 113
F GQ FRW K YTG +I++K Q+G+ Y +T+ + + D+ ++G
Sbjct: 24 FECGQCFRWIKQEDDSYTGVAMGKVINVK--QDGNKIYLDNTTKED--FDNIWYDYFDLG 79
Query: 114 ISLGELWEGFSASDCRFAELAKYLAGARVLRQDPVECLLQFLCSSNNNIARITKMVDFLA 173
E+ D + ++ G R+L+QD E L+ F+ S+NN I I + ++ L+
Sbjct: 80 RDYEEMVNTLKVMDEYLEKATEFGEGIRILKQDGWEMLISFIISANNRIPMIQRAINNLS 139
Query: 174 -SLGSHLGNVEGFEFHEFPSLERLSLVSEVELRNAGFGYRSAPQSSLLFSVRRSFKQAKY 232
+ G+++G +G E++ FP+ E+LS S E+R G+R KY
Sbjct: 140 KNYGTYIGEYKGQEYYAFPTSEQLSKASVEEIRACSTGFRD-----------------KY 182
Query: 233 ITGTVDVLQSKHSGGAEWLLSLRKLDLQEAIDALCTLPGVGPKVAACIALFSLDQHHAIP 292
I T + ++ +L R+L + + L GVGPKVA CIALF + ++ P
Sbjct: 183 IKSTSQTVNDENID----VLEYRELSTDQCLKELLKFNGVGPKVADCIALFGMQKYDTFP 238
Query: 293 VDTHVWKIATRYLLPE 308
VD V ++ + + E
Sbjct: 239 VDVWVKRVMQEFYVDE 254
>gi|403216150|emb|CCK70648.1| hypothetical protein KNAG_0E03950 [Kazachstania naganishii CBS
8797]
Length = 374
Score = 115 bits (288), Expect = 3e-23, Method: Compositional matrix adjust.
Identities = 89/276 (32%), Positives = 130/276 (47%), Gaps = 38/276 (13%)
Query: 47 ELSLPLTFPTGQTFRW---KKTGPLQYTGPI-----GPHLISLKHLQNGDVCY-HIHTSP 97
EL L GQ FRW ++TG T I ++ L+ + V Y + H
Sbjct: 18 ELLLKNVLQVGQAFRWILEEETGHYYTTMMIPTISVSSGIVILRQTDDSVVEYAYSHEKW 77
Query: 98 SEPAAKSALLDFLNMGISLGELWEG-FSASDCRFAELAKYLAGARVLRQDPVECLLQFLC 156
+ + L + + I L E+ E + +D FA ++ G R+L Q+P E L+ F+C
Sbjct: 78 TSEQVERHLFSYFRLDIRLLEVHEKEWKLTDPNFAHVST--QGVRILAQEPWETLVSFIC 135
Query: 157 SSNNNIARITKMVDFLA-SLGSHLGNVEGFEFHEFPSLER-LSLVSEVELRNAGFGYRSA 214
SSNNNI+RITKM LA + G+ +G G + FP+ E+ L +E +LR GFGYR
Sbjct: 136 SSNNNISRITKMCHGLAENFGTQVGKYCGTVLYSFPTSEQILENGTEDQLRKLGFGYR-- 193
Query: 215 PQSSLLFSVRRSFKQAKYITGTVD-VLQSKHSGGAEW------LLSLRKLDLQEAIDALC 267
AKYI T V+ K G ++ ++ R + + + L
Sbjct: 194 ---------------AKYIMETAKAVVAEKKKHGLDYDYQLLEQMTTRGQSYEMSREYLM 238
Query: 268 TLPGVGPKVAACIALFSLDQHHAIPVDTHVWKIATR 303
GVGPKVA C+ L L H +PVD H+ +IA R
Sbjct: 239 QFTGVGPKVADCVCLMGLRMDHVVPVDVHIGRIAKR 274
>gi|355708625|gb|AES03328.1| 8-oxoguanine DNA glycosylase [Mustela putorius furo]
Length = 190
Score = 115 bits (287), Expect = 3e-23, Method: Compositional matrix adjust.
Identities = 63/181 (34%), Positives = 97/181 (53%), Gaps = 7/181 (3%)
Query: 35 PSKWTPLNLTQSELSLPLTFPTGQTFRWKKTGPLQYTGPIGPHLISLKHLQNGDVCYHIH 94
P+ W + +SEL L L +GQ+FRW++ P +TG + + +L + C
Sbjct: 10 PALWASIPCPRSELRLDLVLASGQSFRWREQNPAHWTGVLANQVWTLTQTEEQLYCTVYR 69
Query: 95 TSPS-----EPAAKSALLDFLNMGISLGELWEGFSASDCRFAELAKYLAGARVLRQDPVE 149
P + + + +SL +L+ +S+ D F E+A+ G R+L+QDP+E
Sbjct: 70 GDKGWVGRPTPEELKTVYQYFQLDVSLAQLYHHWSSVDPHFQEVAQKFQGVRLLQQDPIE 129
Query: 150 CLLQFLCSSNNNIARITKMVDFLA-SLGSHLGNVEGFEFHEFPSLERLSLVS-EVELRNA 207
CL F+CSSNNNIARIT MV+ L + G L ++ +H FPSL+ L+ EV+LR
Sbjct: 130 CLFSFICSSNNNIARITGMVERLCEAFGPRLIQLDDVTYHGFPSLQALAGPEVEVQLRTL 189
Query: 208 G 208
G
Sbjct: 190 G 190
>gi|146296528|ref|YP_001180299.1| 8-oxoguanine DNA glycosylase domain-containing protein
[Caldicellulosiruptor saccharolyticus DSM 8903]
gi|145410104|gb|ABP67108.1| 8-oxoguanine DNA glycosylase domain protein [Caldicellulosiruptor
saccharolyticus DSM 8903]
Length = 286
Score = 115 bits (287), Expect = 4e-23, Method: Compositional matrix adjust.
Identities = 81/270 (30%), Positives = 123/270 (45%), Gaps = 27/270 (10%)
Query: 37 KWTP--LNLTQSELSLPLTFPTGQTFRWKKTGPLQYTGPIGPHLISLKHLQNGDVCYHIH 94
KW + ++ + TF +GQ FRWKK Y G + +I +K Q+ D +
Sbjct: 5 KWYTDFIRISGVHIDFDATFFSGQCFRWKKMDDF-YIGVVDEKIIFIKP-QDEDTFDIYN 62
Query: 95 TSPSEPAAKSALLDFLNMGISLGELWEGFSASDCRFAELAKYLAGARVLRQDPVECLLQF 154
SP E K + + ++ E+ E S D E + G R+L Q+P EC++ F
Sbjct: 63 CSPDE--FKKSFYWYFDLDKDYDEILEKLSGHDSILKEAVEKYRGMRLLNQEPFECMISF 120
Query: 155 LCSSNNNIARITKMVDFLASLGSHLGNVEGFEFHEFPSLERLSLVSEVELRNAGFGYRSA 214
+ S NNNI RI +++ L +GF FP+LE L S +L+ G GYR
Sbjct: 121 IISQNNNIKRIQLLIERLCQAFGKKITYKGFVSWSFPTLESLWSSSIDDLKLLGLGYR-- 178
Query: 215 PQSSLLFSVRRSFKQAKYITGTVDVLQSKHSGGAEWLLSLRKLDLQEAIDALCTLPGVGP 274
A+YI V+ K G L L++Q+A L T+ G+G
Sbjct: 179 ---------------AEYIKDAVE----KVKNGLINFDELTDLEVQKAKQVLKTIKGIGD 219
Query: 275 KVAACIALFSLDQHHAIPVDTHVWKIATRY 304
KVA CI L+SL +++ P+D V ++ Y
Sbjct: 220 KVADCILLYSLQKYNVFPIDVWVKRVLKEY 249
>gi|256084717|ref|XP_002578573.1| 8-oxoguanine DNA glycosylase [Schistosoma mansoni]
Length = 329
Score = 115 bits (287), Expect = 4e-23, Method: Compositional matrix adjust.
Identities = 93/309 (30%), Positives = 148/309 (47%), Gaps = 38/309 (12%)
Query: 36 SKWTPLNLTQSELSLPLTFPTGQTFRWKKTGPL-QYTGPIGPHLISLKHLQNGD-VCYHI 93
+ W + + + +L T +GQ FRW L ++ G I HL L+ +++ + V Y++
Sbjct: 5 NNWHSIPVDPQDFNLFSTLNSGQAFRWVFNSTLNEWHGVINGHLWRLRQMEDSNPVEYYL 64
Query: 94 HTSPSEPAAKS--ALLDFLNMGISLGELWEGFSASD----CRFAELAKY--LAGARVLRQ 145
+ L D+ + ++L +L + +S D RF++ + G R+LRQ
Sbjct: 65 EKNSKIKLNDVPFDLRDYFRLDMNLSDLVKEWSIKDKWFENRFSKNCQMDTARGLRLLRQ 124
Query: 146 DPVECLLQFLCSSNNNIARITKMVDFLASLGSHLGNVEGFEFHE-----FPSLERLSLVS 200
DP E L F+ S+NNN+ RITK+ L L S GN E + FPSLE L+
Sbjct: 125 DPEETLFSFITSANNNLTRITKL---LCKLCSEYGNPLYLENGDLGHWTFPSLEILAQPG 181
Query: 201 -EVELRNAGFGYRSAPQSSLLFSVRRSFKQAKYITGTVDVLQSKHSGGAEWLLSLRKLDL 259
+ L+ GFGYRS K+IT + L +GG LL LR +
Sbjct: 182 MQDNLKKIGFGYRS-----------------KFITLAANWLLK--NGGRSRLLELRSVSH 222
Query: 260 QEAIDALCTLPGVGPKVAACIALFSLDQHHAIPVDTHVWKIATRYLLPELAGVRLTPKLC 319
EA L +PG+G KVA CI L SLD+ +P+D H+ ++A +PE + + K
Sbjct: 223 VEAQAFLLQIPGIGKKVADCICLMSLDKLDVVPIDVHMHRVAREKGIPEASCKTMATKSY 282
Query: 320 SRVAEAFCE 328
++++ +
Sbjct: 283 DIISKSLSD 291
>gi|297599406|ref|NP_001047107.2| Os02g0552600 [Oryza sativa Japonica Group]
gi|255670991|dbj|BAF09021.2| Os02g0552600 [Oryza sativa Japonica Group]
Length = 74
Score = 114 bits (286), Expect = 5e-23, Method: Composition-based stats.
Identities = 52/72 (72%), Positives = 61/72 (84%)
Query: 229 QAKYITGTVDVLQSKHSGGAEWLLSLRKLDLQEAIDALCTLPGVGPKVAACIALFSLDQH 288
+AKYI GT +LQ+K GG +WL SLR +L E I+ALCTLPGVGPKVAAC+ALFSLDQ+
Sbjct: 3 RAKYIVGTAKILQAKPGGGEKWLASLRTRELPEVIEALCTLPGVGPKVAACVALFSLDQN 62
Query: 289 HAIPVDTHVWKI 300
HAIPVDTHVWK+
Sbjct: 63 HAIPVDTHVWKV 74
>gi|346324366|gb|EGX93963.1| N-glycosylase/DNA lyase [Cordyceps militaris CM01]
Length = 388
Score = 114 bits (285), Expect = 6e-23, Method: Compositional matrix adjust.
Identities = 91/313 (29%), Positives = 134/313 (42%), Gaps = 74/313 (23%)
Query: 30 LTTNKPSKWTPLNLTQSELSLPLTFPTGQTFRWKKTGPLQYTGPIGPHLISLKHLQNGDV 89
+ + + W L +T +EL + T GQ+FRW+K Y+ +G + L +G +
Sbjct: 1 MAMKRAADWQKLPVTLAELCIDTTLRCGQSFRWRKINDEWYSRRLGRFEVVLTCALHGRI 60
Query: 90 ------CYHIHTSPSEPAAKSALL----------------DFLNMGISLGELWEGFSASD 127
H+H + P+ +S L + ++ ++L EL++ +S +D
Sbjct: 61 LSLKQDSQHLHYQTTWPSFRSELAARSVSIAEDDTEDLLRHYFSLRLNLAELYQQWSRAD 120
Query: 128 CRFAELAKYLAGARVLRQDPVECLLQFLCSSNNNIARITKMVDFLASLGSHLGNVEGFEF 187
FA+ A AG R+L QD E L+ LC G LG+V
Sbjct: 121 PNFAKKAPEFAGVRILSQDAWEALVHKLC----------------IHYGPLLGHVGDDPI 164
Query: 188 HEFPSLERLSLVS-EVELRNAGFGYRSAPQSSLLFSVRRSFKQAKYITGTVDVLQSKHSG 246
H+FP+ E LS + E LR GFGYR AKYI T V+ +
Sbjct: 165 HDFPTPESLSGPNVESHLRELGFGYR-----------------AKYIAQTARVIAKEKPA 207
Query: 247 GAEWLLSLRKLD----------------LQEAIDALCTLPGVGPKVAACIALFSLDQHHA 290
G WL +LR + + A + L L GVGPKVA C+ L L +
Sbjct: 208 G--WLETLRNPESVMKTSVKAAAGKSPTYKAAHEELLALTGVGPKVADCVCLMGLGWGES 265
Query: 291 IPVDTHVWKIATR 303
+PVDTHVW+IA R
Sbjct: 266 VPVDTHVWQIAQR 278
>gi|328853791|gb|EGG02927.1| hypothetical protein MELLADRAFT_27095 [Melampsora larici-populina
98AG31]
Length = 322
Score = 114 bits (285), Expect = 7e-23, Method: Compositional matrix adjust.
Identities = 88/290 (30%), Positives = 142/290 (48%), Gaps = 42/290 (14%)
Query: 41 LNLTQSELSLPLTFPTGQTFRWK--------KTGPLQYTGPIGPHLISLKHLQNGD---V 89
LN+ EL+L LT +GQ+F WK +T ++++ I L+ + +G
Sbjct: 1 LNIHPIELNLSLTLRSGQSFNWKSSIFQTIDQTQHIEWSIVNQKRTIILRQVDSGIHYLS 60
Query: 90 CYH------IHTSPSEPAAKSALLDFLNMGISLGELWEGFSASDC--RFAELAKYLAGAR 141
YH ++ + L D+ + +SL +L++ +S D +F L G R
Sbjct: 61 LYHSNQIQDYQNDLNQNTTLTFLNDYFVLNVSLQKLYQEWSRRDSNFKFKTLNGSYNGLR 120
Query: 142 VLRQDPVECLLQFLCSSNNNIARITKMVDFLASLGSHLGNVEG--FEFHEFPSLERLSL- 198
V +QDP E L+ F+CSSNNNI RIT M++ L+ + +++ H FP+ +LS
Sbjct: 121 VCKQDPWETLISFICSSNNNIPRITSMINNLSESFDEIISIQSKPNSIHTFPTPHQLSRS 180
Query: 199 VSEVELRNAGFGYRS-----APQSSLLFSVRRSFKQAKYITGTVDVLQSKHSGGAEWLLS 253
E +L+ GFGYRS + QS + SV ++ + +Y L S +G + ++
Sbjct: 181 CDETKLKQLGFGYRSSYIMTSTQSLIELSVSQNCQPIEY-------LNSLSNGTLSYSVA 233
Query: 254 LRKLDLQEAIDALCTLPGVGPKVAACIALFSLDQHHAIPVDTHVWKIATR 303
+ L + GVGPKV+ CI LF L +PVD H+++IA R
Sbjct: 234 HQTL--------ITHFSGVGPKVSDCICLFGLGFKETVPVDVHLYQIAKR 275
>gi|429966416|gb|ELA48413.1| 8-oxoguanine DNA-glycosylase (ogg) [Vavraia culicis 'floridensis']
Length = 305
Score = 113 bits (282), Expect = 2e-22, Method: Compositional matrix adjust.
Identities = 83/288 (28%), Positives = 128/288 (44%), Gaps = 53/288 (18%)
Query: 37 KWTPLNLTQSELSLPLTFPTGQTFRWKKTGPLQYTGPIGPHLISLKHLQNGDVCYH-IHT 95
+W L T ++L T +GQ F +++TG + TG + L++ K Q DV Y+ I
Sbjct: 2 EWQRLK-TDQYINLEKTLYSGQIFSFQRTGDNESTGIVEGFLVTFK--QEKDVIYYKIFN 58
Query: 96 SPSEPAAKSALLDFLNMGISLGELWEGFS--ASDCR------------------FAELAK 135
++ F + ++ ++++ ++ S CR F E
Sbjct: 59 YKENVNYETVFWKFFTLDLNYKKIFKEWNDKISKCRTETNSENEQNYDLYGLSPFKE--- 115
Query: 136 YLAGARVLRQDPVECLLQFLCSSNNNIARITKMVDFLASLGSHLGNVEGFEFHEFPSLER 195
G R+LR D + + F+CSSNNNI RITKMV L SLG ++ + G F+ FP+ E
Sbjct: 116 --NGLRLLRCDLKDTIFSFICSSNNNIKRITKMVLVLFSLGEYITTINGKNFYHFPNPEE 173
Query: 196 LSLVSEVELRNAGFGYRSAPQSSLLFSVRRSFKQAKYITGTVDVLQSKHSGGAEWLLSLR 255
LS E LR+ GFGYR A YI T ++ + L
Sbjct: 174 LS-DKEQFLRDNGFGYR-----------------ASYIAKTARLMTTMKYD------QLY 209
Query: 256 KLDLQEAIDALCTLPGVGPKVAACIALFSLDQHHAIPVDTHVWKIATR 303
+ A + L G+ KVA C+ L L H +P+D H+++ A R
Sbjct: 210 NRNYHWAFERLNRYSGISYKVADCVCLLGLHFHDVVPIDIHIFRTAKR 257
>gi|444320759|ref|XP_004181036.1| hypothetical protein TBLA_0E04650 [Tetrapisispora blattae CBS 6284]
gi|387514079|emb|CCH61517.1| hypothetical protein TBLA_0E04650 [Tetrapisispora blattae CBS 6284]
Length = 375
Score = 112 bits (280), Expect = 2e-22, Method: Compositional matrix adjust.
Identities = 85/279 (30%), Positives = 135/279 (48%), Gaps = 31/279 (11%)
Query: 41 LNLTQSELSLPLTFPTGQTFRW---KKTGPLQYTGPIGP-----HLISLKHLQNGDVCYH 92
N+ + L L TGQ FRW ++G T + P +I LK + + YH
Sbjct: 8 FNICRRSLQLENVLQTGQAFRWVLDSESGWYTNTLKLYPLTPEFQIIRLKQ-NDSKLEYH 66
Query: 93 IHTSPSEPAAK--SALLDFLNMGISLGELWE-GFSASDCRFAELAKYLAGARVLRQDPVE 149
+ +S + + + + + M + + +L E + +D F K L G RVL+Q+P E
Sbjct: 67 LESSIKLESREIHNWIKQYFRMEVDVEKLCEEQWLPNDSTFK--GKNLRGVRVLQQEPWE 124
Query: 150 CLLQFLCSSNNNIARITKMVDFLASLGSHLGNV----EGFEFHEFPSLERLSLVSEVE-L 204
L+ F+CSSNNNI RI++M L SH GN+ E ++ FP+ E L + VE L
Sbjct: 125 TLISFICSSNNNIPRISQMCQ---KLSSHYGNLLNKHEFSNYYSFPTSEELCDRASVESL 181
Query: 205 RNAGFGYRSAPQSSLLFSVRRSFKQAKYITGTVDVLQSKHSGGAEWLLSLRKLDLQEAID 264
R+ GFGYR + + + Q K I G ++ + +++ +L K D + +
Sbjct: 182 RSLGFGYR----ARYIIETAKLLVQQKKIVGIIN-----DTEYFQYISNLYKNDYLQIRE 232
Query: 265 ALCTLPGVGPKVAACIALFSLDQHHAIPVDTHVWKIATR 303
L GVGPKVA C+ L +P+D H+ ++A R
Sbjct: 233 HLMGYSGVGPKVADCVCLMGFHMDDIVPIDVHIGRVAKR 271
>gi|82793294|ref|XP_727983.1| hypothetical protein [Plasmodium yoelii yoelii 17XNL]
gi|23484099|gb|EAA19548.1| OGG1 protein type 2e-related [Plasmodium yoelii yoelii]
Length = 571
Score = 112 bits (280), Expect = 2e-22, Method: Compositional matrix adjust.
Identities = 87/349 (24%), Positives = 139/349 (39%), Gaps = 76/349 (21%)
Query: 29 TLTTNKPSKWTPLNLTQSELSLPLTFPTGQTFRWKKTGPLQYTGPIGPHLISLKHL---- 84
L N + W LN+T ++L L GQ F +K+ Y G + + K
Sbjct: 128 NLIKNYKNNWRILNVTPNDLQLKYCLLIGQEFHFKQVENDSYIGMVNKKIYLFKETCDKI 187
Query: 85 -----------QNGDVCYHIHTSPSEPAAKSALLDFLNMGISLGELWEGFSASDCRFAEL 133
N H + S E + +F N+ L + + + D R E+
Sbjct: 188 LYQCILDTNRKANNANMEHCNDSKKENND-IDIYEFFNLSFPLEDQIKEWIKKDKRMEEV 246
Query: 134 AKYLAGARVLRQDPVECLLQFLCSSNNNIARITKMVD--------FLASL---------- 175
+ + G R+L+ D VE FLCS+NNNI RIT M+D F+A++
Sbjct: 247 SNKIKGLRILKNDSVESFFSFLCSTNNNIPRITLMIDCLRRRYGKFIATVIFTNDDIIIF 306
Query: 176 -------------------------------GSHLGNVEGFEFHEFPSL-ERLSLVSEVE 203
S++ N+ F E E + + E
Sbjct: 307 NEKNEQDNKNDVSKIKTEIKHEDAVLDNCLKNSNIQNISKDRFSENKMFYENIKTEVKEE 366
Query: 204 LRNAGFGYRSAPQSSLLFSVRRSF-------KQAKYITGTVDVLQSKHSGGAEWLLSLRK 256
+N F + P + ++ S ++ Y+ + +L K GG EW+ +L+K
Sbjct: 367 GKNKIFHFYKFPSIETISNLNESDLRNLGFGYRSGYVIESAKML--KRLGGEEWIENLKK 424
Query: 257 -LDLQEAIDALCTLPGVGPKVAACIALFSLDQHHAIPVDTHVWKIATRY 304
+E ID L PG+G KVA CI LF L+++ IP+DTH++ I +Y
Sbjct: 425 EKKTKECIDKLIKFPGIGLKVANCICLFGLNRYDCIPIDTHIYDIIYKY 473
>gi|365763693|gb|EHN05219.1| Ogg1p [Saccharomyces cerevisiae x Saccharomyces kudriavzevii VIN7]
Length = 335
Score = 112 bits (279), Expect = 3e-22, Method: Compositional matrix adjust.
Identities = 74/215 (34%), Positives = 114/215 (53%), Gaps = 28/215 (13%)
Query: 98 SEPAAKSALLDFLNMGISLGELWEG-FSASDCRFAELAKYLAGARVLRQDPVECLLQFLC 156
++ A K+ L+ + + +SL L++ + SD FA+L+ G R+L Q+P E L+ F+C
Sbjct: 33 NQDALKTHLMKYFRLDVSLKHLFDNVWIPSDKAFAKLSP--QGIRILAQEPWETLISFIC 90
Query: 157 SSNNNIARITKMVDFLAS-LGSHLGNVEGFEFHEFPSLERL-SLVSEVELRNAGFGYRSA 214
SSNNNI+RIT+M + L S G+ + ++G +H FP+ E L S +E +L GFGYR
Sbjct: 91 SSNNNISRITRMCNSLCSNFGNLITTIDGVAYHSFPTSEELTSRATEAKLXELGFGYR-- 148
Query: 215 PQSSLLFSVRRSFKQAKYITGTV-----DVLQSKHSGGAEWLLSL-RKLDLQEAIDALCT 268
AKYI T D ++ + +L S+ + ++ + L +
Sbjct: 149 ---------------AKYIIETARKLVNDKAEANITSDTTYLQSICKDAQYEDVREHLMS 193
Query: 269 LPGVGPKVAACIALFSLDQHHAIPVDTHVWKIATR 303
GVGPKVA C+ L L +PVD HV +IA R
Sbjct: 194 YNGVGPKVADCVCLMGLHMDGIVPVDVHVSRIAKR 228
>gi|323307819|gb|EGA61081.1| Ogg1p [Saccharomyces cerevisiae FostersO]
Length = 222
Score = 112 bits (279), Expect = 3e-22, Method: Compositional matrix adjust.
Identities = 72/206 (34%), Positives = 109/206 (52%), Gaps = 28/206 (13%)
Query: 107 LDFLNMGISLGELWEG-FSASDCRFAELAKYLAGARVLRQDPVECLLQFLCSSNNNIARI 165
+ + + +SL L++ + SD FA+L+ G R+L Q+P E L+ F+CSSNNNI+RI
Sbjct: 1 MKYFRLDVSLKHLFDNVWIPSDKAFAKLSP--QGIRILAQEPWETLISFICSSNNNISRI 58
Query: 166 TKMVDFLAS-LGSHLGNVEGFEFHEFPSLERL-SLVSEVELRNAGFGYRSAPQSSLLFSV 223
T+M + L S G+ + ++G +H FP+ E L S +E +LR GFGYR
Sbjct: 59 TRMCNSLCSNFGNLITTIDGVAYHSFPTSEELTSRATEAKLRELGFGYR----------- 107
Query: 224 RRSFKQAKYITGTV-----DVLQSKHSGGAEWLLSL-RKLDLQEAIDALCTLPGVGPKVA 277
AKYI T D ++ + +L S+ + ++ + L + GVGPKVA
Sbjct: 108 ------AKYIIETARKLVXDKAEANITSDTTYLQSICKDAQYEDVREHLMSYNGVGPKVA 161
Query: 278 ACIALFSLDQHHAIPVDTHVWKIATR 303
C+ L L +PVD HV +IA R
Sbjct: 162 DCVCLMGLHMDGIVPVDVHVSRIAKR 187
>gi|393234640|gb|EJD42201.1| DNA glycosylase [Auricularia delicata TFB-10046 SS5]
Length = 353
Score = 111 bits (278), Expect = 4e-22, Method: Compositional matrix adjust.
Identities = 81/250 (32%), Positives = 123/250 (49%), Gaps = 27/250 (10%)
Query: 106 LLDFLNMGISLGELWEGFSASDCRFAELAK-YLAGARVLRQDPVECLLQFLCSSNNNIAR 164
+ D+ + + L +L+ ++A D F L K G R+LRQDP E L+ F+CS NN+I+R
Sbjct: 1 MRDYFQLDVDLSDLYATWAARDTVFDTLVKDRFKGLRMLRQDPWENLISFICSQNNHISR 60
Query: 165 ITKMVDFLAS-LGSHLGNV----------EGFEFHEFPSLERLSL-VSEVELRNAGFGYR 212
I+KMV L + G+H+ ++ ++H FP L+ +LR GFGYR
Sbjct: 61 ISKMVQALCTHFGTHVASLPIPDSPESESTMQDYHSFPPPSALAAPTVAAKLRALGFGYR 120
Query: 213 SA----PQSSLLFSVRRSFKQAKYITGTVDVLQSKHSGGA-EWLLSLRKLDLQEAIDALC 267
+ +L S S+ + + +S+ A L +LRK+ A L
Sbjct: 121 AEYIQRTAQMVLESACDSYAPCEVDAWPSETARSEADTKALASLEALRKMGTDAARAELL 180
Query: 268 TLPGVGPKVAACIALFSLDQHHAIPVDTHVWKIATRYLLPELAGV---------RLTPKL 318
L GVG KVA CI L SLD+ +PVDTHV++IA ++ + G +TPK+
Sbjct: 181 KLMGVGRKVADCILLMSLDKREVVPVDTHVYQIALKHYGFRVPGSPTKGGNGKPAMTPKI 240
Query: 319 CSRVAEAFCE 328
VA F +
Sbjct: 241 YDAVANKFVD 250
>gi|358055651|dbj|GAA97996.1| hypothetical protein E5Q_04676 [Mixia osmundae IAM 14324]
Length = 450
Score = 110 bits (274), Expect = 1e-21, Method: Compositional matrix adjust.
Identities = 98/297 (32%), Positives = 137/297 (46%), Gaps = 53/297 (17%)
Query: 41 LNLTQSELSLPLTFPTGQTFRWKKTGPLQ---YTGPIGPHLISLKHLQNGDVCYHIHTSP 97
L + +EL L GQ+FRW K Q ++ I L+ Q+G Y+ P
Sbjct: 13 LKILPAELCLAAVLRNGQSFRWHKRITCQGELWSIAWSDRTIELR--QDGTAIYYRALYP 70
Query: 98 -----------SEPAAKSALLDFLNMGISLGELWEGFSASDCRFAELAK--YLAGARVLR 144
++ + L +L++ ISL +L+ +S+ D F + G RVLR
Sbjct: 71 LSAQKEHELDEAQATTLAILRRYLSLSISLSDLYAHWSSRDPNFVKQTDNGRFGGIRVLR 130
Query: 145 QDPVECLLQFLCSSNNNIARITKMVDFL-ASLGSHLGNVEGFE----FHEFPSLERLSLV 199
Q E ++ F+CSSNNNIARI M+ L SLGS + + + E + FPS E +
Sbjct: 131 QPIWETIVAFICSSNNNIARIGLMISRLCESLGSKMPSSDSAEEATVMYSFPSPEAIGQA 190
Query: 200 -SEVELRNAGFGYRSAPQSSLLFSVRRSFKQAKYITGTVDVLQS------------KHSG 246
+E LR GFGYR A+Y+ T ++Q +H G
Sbjct: 191 GNETLLRQLGFGYR-----------------AEYVVKTAQMVQELAKAHNASLAPGEHLG 233
Query: 247 GAEWLLSLRKLDLQEAIDALCTLPGVGPKVAACIALFSLDQHHAIPVDTHVWKIATR 303
+L S KL +A + L L GVGPKVA CIALF L +PVD HVW+IA R
Sbjct: 234 VEAYLDSWSKLSYTDAREQLLQLCGVGPKVADCIALFGLGFAQTVPVDRHVWQIAIR 290
>gi|329766589|ref|ZP_08258132.1| HhH-GPD family protein [Candidatus Nitrosoarchaeum limnia SFB1]
gi|329136844|gb|EGG41137.1| HhH-GPD family protein [Candidatus Nitrosoarchaeum limnia SFB1]
Length = 280
Score = 110 bits (274), Expect = 1e-21, Method: Compositional matrix adjust.
Identities = 77/272 (28%), Positives = 124/272 (45%), Gaps = 37/272 (13%)
Query: 48 LSLPLTFPTGQTFRWKKTGPLQYTGPIGPHLISLKHLQNGDVCYHIHTSPSEPAAKSALL 107
++L T +GQ F WKK Y NG I+ S +
Sbjct: 7 INLENTINSGQVFLWKKHKEFWYG-------------INGQDVLKINDSGKITTYSNKKY 53
Query: 108 DFLNMGISLGELWEGFSASDCRFAELAKYLAGARVLRQDPVECLLQFLCSSNNNIARITK 167
DF G ++ ++ + S + KYL G R+LRQDP +C + F+ SSN+NI +I
Sbjct: 54 DFFRTGDNIEKIIKFISKDKTTKIAVKKYL-GLRLLRQDPFQCFISFIVSSNSNIQKIKS 112
Query: 168 MVDFLASLGSHLGNVEGFEFHEFPSLERLSLVSEVELRNAGFGYRSAPQSSLLFSVRRSF 227
++ ++ +G EF+ FP ++++ S E++N G GYR
Sbjct: 113 SLEKISIQFGEKIEFDGKEFYLFPEAKKIADASIREIQNCGVGYR--------------- 157
Query: 228 KQAKYITGTVDVLQSKHSGGAEWLLSLRKLDLQEAIDALCTLPGVGPKVAACIALFSLDQ 287
A++I +++S+ ++K + Q+A + T PG+G KVA CI LFSLD+
Sbjct: 158 --ARFIINAAKMIESEQID----FEYMKKSNYQDAKKMILTTPGIGNKVADCILLFSLDK 211
Query: 288 HHAIPVDTHVWKIATRYLLP--ELAGVRLTPK 317
A P+D + +I +Y L EL +T K
Sbjct: 212 LEAFPLDRWMIRILEKYYLEKFELKTKSITEK 243
>gi|169335491|ref|ZP_02862684.1| hypothetical protein ANASTE_01905 [Anaerofustis stercorihominis DSM
17244]
gi|169258229|gb|EDS72195.1| 8-oxoguanine DNA-glycosylase (ogg) [Anaerofustis stercorihominis
DSM 17244]
Length = 289
Score = 110 bits (274), Expect = 1e-21, Method: Compositional matrix adjust.
Identities = 78/271 (28%), Positives = 131/271 (48%), Gaps = 30/271 (11%)
Query: 43 LTQSE-LSLPLTFPTGQTFRWKKTGPLQYTGPIGPHLISLKHLQNGDVCYHIHTSPSEPA 101
LT+++ +L TF GQ FRW + ++ G + +I +K NG ++ +E
Sbjct: 12 LTKTDHFNLKKTFECGQCFRWDEIEKDEFIGVVNGKVIHIKE-NNGTFTFY---DTTEEY 67
Query: 102 AKSALLDFLNMGISLGELWEGFSASDCRFAELAKYLAGARVLRQDPVECLLQFLCSSNNN 161
+ L+D+ + + E+ + S +D + +Y G R+L+QD E ++ F+ S+NNN
Sbjct: 68 FNNVLVDYFDFNLDYNEIDKKIS-TDEHIKKCIEYGNGIRILKQDLFETIISFIISANNN 126
Query: 162 IARITKMV-DFLASLGSHLGNVEGFEFHEFPSLERLSLVSEVELRNAGFGYRSAPQSSLL 220
I RI K++ D G + +G ++ FPS E L +++ E G+R
Sbjct: 127 IPRIKKIISDLCERYGKEII-YKGKSYYSFPSYEDLKDITQEEFHELKMGFRD------- 178
Query: 221 FSVRRSFKQAKYITGTVDVLQSKHSGGAEWLLSLRKLDLQEAIDALCTLPGVGPKVAACI 280
KY+ VD + +SG + L ++ ++ EA L + GVG KV+ CI
Sbjct: 179 ----------KYL---VDAISKINSGEID-LDTITNMETAEAKKELMKIKGVGEKVSDCI 224
Query: 281 ALFSLDQHHAIPVDTHVWKI-ATRYLLPELA 310
LFSL + P+D V +I TRY L +L
Sbjct: 225 LLFSLKRLELCPMDVWVKRIFETRYGLTDLT 255
>gi|312134912|ref|YP_004002250.1| 8-oxoguanine DNA glycosylase domain-containing protein
[Caldicellulosiruptor owensensis OL]
gi|311774963|gb|ADQ04450.1| 8-oxoguanine DNA glycosylase domain-containing protein
[Caldicellulosiruptor owensensis OL]
Length = 284
Score = 110 bits (274), Expect = 1e-21, Method: Compositional matrix adjust.
Identities = 84/279 (30%), Positives = 127/279 (45%), Gaps = 32/279 (11%)
Query: 41 LNLTQSELSLPLTFPTGQTFRWKKTGPLQYTGPIGPHLISLKHLQNGDVCYHIHTSPSEP 100
L ++ E+ TF +GQ FRWKK +Y G + ++ L + Q+ + + SP E
Sbjct: 10 LRISGVEIDFDATFFSGQCFRWKKVDG-RYIGVVNRKIV-LVYPQDSNTFDIYNCSPEE- 66
Query: 101 AAKSALLDFLNMGISLGELWEGFSASDCRFAELAKYLAGARVLRQDPVECLLQFLCSSNN 160
K + ++ + + S D + + G R+L Q+P EC++ F+ S NN
Sbjct: 67 -FKKFFYWYFDLDKDYDLILKELSEHDEILKKAVEKYRGMRLLNQEPFECMISFIISQNN 125
Query: 161 NIARITKMVD-FLASLGSHLGNVEGFEFHEFPSLERLSLVSEVELRNAGFGYRSAPQSSL 219
NI RI +++ F S G + +GF FP +E L +S EL+ G GYR
Sbjct: 126 NIKRIQLLIERFCQSYGEKV-EYKGFYSWTFPEIENLKKISTEELKRLGLGYR------- 177
Query: 220 LFSVRRSFKQAKYITGTVDVLQSKHSGGAEWLLSLRKLDLQEAIDALCTLPGVGPKVAAC 279
A+YI + +K G SL L EA L T+ G+G KVA C
Sbjct: 178 ----------AEYIKDAI----AKLDEGKIDFESLETLSSDEARKILKTVKGIGDKVANC 223
Query: 280 IALFSLDQHHAIPVDTHVWKIATRYLLPELAGVRLTPKL 318
I L+SL ++ PVD VW + L E G+ T +L
Sbjct: 224 ILLYSLQKYDVFPVD--VW---VKRALREFYGIENTKQL 257
>gi|255099354|ref|ZP_05328331.1| putative DNA glycosylase (DNA repair protein) [Clostridium
difficile QCD-63q42]
Length = 291
Score = 109 bits (273), Expect = 1e-21, Method: Compositional matrix adjust.
Identities = 72/256 (28%), Positives = 117/256 (45%), Gaps = 26/256 (10%)
Query: 54 FPTGQTFRWKKTGPLQYTGPIGPHLISLKHLQNGDVCYHIHTSPSEPAAKSALLDFLNMG 113
F GQ FRW K YTG ++++K + D Y +T+ E S ++ ++G
Sbjct: 24 FECGQCFRWHKQEDGSYTGVAKGRILNVK--KENDKIYLNNTNLKE--FNSIWYNYFDLG 79
Query: 114 ISLGELWEGFSASDCRFAELAKYLAGARVLRQDPVECLLQFLCSSNNNIARITKMVDFLA 173
E+ D + ++ G R+LRQD E L+ F+ SSNN I I + ++ L+
Sbjct: 80 TDYTEIKNKLKNMDEYLNKATEFGWGIRILRQDGWEMLISFIISSNNRIPMIQRAIENLS 139
Query: 174 -SLGSHLGNVEGFEFHEFPSLERLSLVSEVELRNAGFGYRSAPQSSLLFSVRRSFKQAKY 232
G ++G EG E++ FP+ E L+ S+ E+R G+R KY
Sbjct: 140 RKFGKYIGEYEGNEYYAFPTPEELNKASQEEIRACQTGFRD-----------------KY 182
Query: 233 ITGTVDVLQSKHSGGAEWLLSLRKLDLQEAIDALCTLPGVGPKVAACIALFSLDQHHAIP 292
I T + + +E+ L ++ L GVGPKV CIALF + ++ + P
Sbjct: 183 IKSTTQAVIENNDEVSEYT----NLSTEDCRKELLKFNGVGPKVCDCIALFGMQKYDSFP 238
Query: 293 VDTHVWKIATRYLLPE 308
VD V ++ + + E
Sbjct: 239 VDVWVKRVMQEFYIDE 254
>gi|255654273|ref|ZP_05399682.1| putative DNA glycosylase (DNA repair protein) [Clostridium
difficile QCD-23m63]
gi|296452563|ref|ZP_06894258.1| 8-oxoguanine DNA glycosylase [Clostridium difficile NAP08]
gi|296881025|ref|ZP_06904970.1| 8-oxoguanine DNA glycosylase [Clostridium difficile NAP07]
gi|296258586|gb|EFH05486.1| 8-oxoguanine DNA glycosylase [Clostridium difficile NAP08]
gi|296427984|gb|EFH13886.1| 8-oxoguanine DNA glycosylase [Clostridium difficile NAP07]
Length = 291
Score = 109 bits (273), Expect = 1e-21, Method: Compositional matrix adjust.
Identities = 72/256 (28%), Positives = 117/256 (45%), Gaps = 26/256 (10%)
Query: 54 FPTGQTFRWKKTGPLQYTGPIGPHLISLKHLQNGDVCYHIHTSPSEPAAKSALLDFLNMG 113
F GQ FRW K YTG ++++K + D Y +T+ E S ++ ++G
Sbjct: 24 FECGQCFRWHKQEDGSYTGVAKGRILNVK--KENDKIYLNNTNLKE--FNSIWYNYFDLG 79
Query: 114 ISLGELWEGFSASDCRFAELAKYLAGARVLRQDPVECLLQFLCSSNNNIARITKMVDFLA 173
E+ D + ++ G R+LRQD E L+ F+ SSNN I I + ++ L+
Sbjct: 80 TDYTEIKNKLKNMDEYLNKATEFGWGIRILRQDGWEMLISFIISSNNRIPMIQRAIENLS 139
Query: 174 -SLGSHLGNVEGFEFHEFPSLERLSLVSEVELRNAGFGYRSAPQSSLLFSVRRSFKQAKY 232
G ++G EG E++ FP+ E L+ S+ E+R G+R KY
Sbjct: 140 RKFGKYIGEYEGKEYYAFPTPEELNKASQEEIRACQTGFRD-----------------KY 182
Query: 233 ITGTVDVLQSKHSGGAEWLLSLRKLDLQEAIDALCTLPGVGPKVAACIALFSLDQHHAIP 292
I T + + +E+ L ++ L GVGPKV CIALF + ++ + P
Sbjct: 183 IKSTTQAVIENNDKVSEYT----NLSTEDCRKELLKFNGVGPKVCDCIALFGMQKYDSFP 238
Query: 293 VDTHVWKIATRYLLPE 308
VD V ++ + + E
Sbjct: 239 VDVWVKRVMQEFYIDE 254
>gi|366998842|ref|XP_003684157.1| hypothetical protein TPHA_0B00510 [Tetrapisispora phaffii CBS 4417]
gi|357522453|emb|CCE61723.1| hypothetical protein TPHA_0B00510 [Tetrapisispora phaffii CBS 4417]
Length = 379
Score = 109 bits (273), Expect = 2e-21, Method: Compositional matrix adjust.
Identities = 92/296 (31%), Positives = 141/296 (47%), Gaps = 59/296 (19%)
Query: 43 LTQSELSLPLTFPTGQTFRW-------KKTGPLQYTGPIGPHLISLKH----------LQ 85
+ + EL L GQ+FRW + ++ +G H++ LK L+
Sbjct: 11 VRKKELLLDNVLQVGQSFRWVYNESSQEYVTTMKISGQY--HVVLLKQVELNQDDESDLK 68
Query: 86 NGDVCYH-IHTSPSEPAAKSALLDFLNMGISLGELWEGFSA-SDCRFAELAKYLAGARVL 143
N V Y + + + +S L+D+ M +S+ L+ + +D RF A Y G R+L
Sbjct: 69 NTIVQYATLGDTCDSKSLESHLMDYFRMDVSVENLYNKYWLPNDVRFK--AHYPTGNRML 126
Query: 144 RQDPVECLLQFLCSSNNNIARITKMV-DFLASLGSHLG-----NVEGFEFHEFPSLERL- 196
Q+P E L+ F+CSSNNNI+RITKM + + G+ L + G + FPS + +
Sbjct: 127 AQEPWETLVSFICSSNNNISRITKMCHELCRNFGTELKPEIDRDDTGSSHYTFPSSDDIV 186
Query: 197 SLVSEVELRNAGFGYRSAPQSSLLFSVRRSFKQAKYITGTVDVL--QSK---HSGGAEWL 251
+ +E +LR GFGYR AKYI T +L Q K + +E+L
Sbjct: 187 NKATEDKLRELGFGYR-----------------AKYIMETAKLLVEQKKANDFTDDSEYL 229
Query: 252 LSLRK----LDLQEAIDALCTLPGVGPKVAACIALFSLDQHHAIPVDTHVWKIATR 303
LS++ ++L+E L GVGPKVA C+ L +PVD H+ +IA R
Sbjct: 230 LSVKDSSSYIELRE---HLMGYTGVGPKVADCVCLMGFRHDDVVPVDVHISRIAKR 282
>gi|126697764|ref|YP_001086661.1| DNA glycosylase [Clostridium difficile 630]
gi|255305187|ref|ZP_05349359.1| putative DNA glycosylase (DNA repair protein) [Clostridium
difficile ATCC 43255]
gi|423089089|ref|ZP_17077453.1| 8-oxoguanine DNA-glycosylase [Clostridium difficile 70-100-2010]
gi|115249201|emb|CAJ67013.1| putative DNA glycosylase [Clostridium difficile 630]
gi|357558670|gb|EHJ40155.1| 8-oxoguanine DNA-glycosylase [Clostridium difficile 70-100-2010]
Length = 291
Score = 109 bits (273), Expect = 2e-21, Method: Compositional matrix adjust.
Identities = 72/256 (28%), Positives = 117/256 (45%), Gaps = 26/256 (10%)
Query: 54 FPTGQTFRWKKTGPLQYTGPIGPHLISLKHLQNGDVCYHIHTSPSEPAAKSALLDFLNMG 113
F GQ FRW K YTG ++++K + D Y +T+ E S ++ ++G
Sbjct: 24 FECGQCFRWHKQEDGSYTGVAKGRILNVK--KENDKIYLNNTNLKE--FNSIWYNYFDLG 79
Query: 114 ISLGELWEGFSASDCRFAELAKYLAGARVLRQDPVECLLQFLCSSNNNIARITKMVDFLA 173
E+ D + ++ G R+LRQD E L+ F+ SSNN I I + ++ L+
Sbjct: 80 TDYTEIKNKLKNMDEYLNKATEFGWGIRILRQDGWEMLISFIISSNNRIPMIQRAIENLS 139
Query: 174 -SLGSHLGNVEGFEFHEFPSLERLSLVSEVELRNAGFGYRSAPQSSLLFSVRRSFKQAKY 232
G ++G EG E++ FP+ E L+ S+ E+R G+R KY
Sbjct: 140 RKFGKYIGEYEGNEYYAFPTPEELNKASQEEIRACQTGFRD-----------------KY 182
Query: 233 ITGTVDVLQSKHSGGAEWLLSLRKLDLQEAIDALCTLPGVGPKVAACIALFSLDQHHAIP 292
I T + + +E+ L ++ L GVGPKV CIALF + ++ + P
Sbjct: 183 IKSTTQAVIENNDEVSEYT----NLSTEDCRKELLKFNGVGPKVCDCIALFGMQKYDSFP 238
Query: 293 VDTHVWKIATRYLLPE 308
VD V ++ + + E
Sbjct: 239 VDVWVKRVMQEFYIDE 254
>gi|158321471|ref|YP_001513978.1| 8-oxoguanine DNA glycosylase [Alkaliphilus oremlandii OhILAs]
gi|158141670|gb|ABW19982.1| 8-oxoguanine DNA glycosylase domain protein [Alkaliphilus
oremlandii OhILAs]
Length = 291
Score = 109 bits (272), Expect = 2e-21, Method: Compositional matrix adjust.
Identities = 78/256 (30%), Positives = 117/256 (45%), Gaps = 26/256 (10%)
Query: 54 FPTGQTFRWKKTGPLQYTGPIGPHLISLKHLQNGDVCYHIHTSPSEPAAKSALLDFLNMG 113
F GQ FRW K YTG ++++K ++ + H S E K + + ++
Sbjct: 24 FECGQCFRWDKEQDGSYTGVAFGKVLNVKKEEDNVIFKH---SNKEEFHK-IWIPYFDLD 79
Query: 114 ISLGELWEGFSASDCRFAELAKYLAGARVLRQDPVECLLQFLCSSNNNIARITKMVDFLA 173
G++ D A G R+L+QD E L+ F+ SSNNNI RI K V+ ++
Sbjct: 80 KDYGKIKAALIKEDEIMKSAADLGNGIRILQQDSWETLISFIISSNNNIPRIKKAVNLIS 139
Query: 174 S-LGSHLGNVEGFEFHEFPSLERLSLVSEVELRNAGFGYRSAPQSSLLFSVRRSFKQAKY 232
G +LG +G + + FP E + +S EL + G GYR AKY
Sbjct: 140 ERFGLYLGEYDGKKQYSFPEPEVVCALSNEELTSCGVGYR-----------------AKY 182
Query: 233 ITGTVDVLQSKHSGGAEWLLSLRKLDLQEAIDALCTLPGVGPKVAACIALFSLDQHHAIP 292
I T + K+ L L+KLD + +AL GVGPKVA CI F++ + A P
Sbjct: 183 IIDTAKAVVEKNI----LLDELKKLDSSDCFEALLQFNGVGPKVANCILFFAMGKVDAFP 238
Query: 293 VDTHVWKIATRYLLPE 308
VD V ++ + +
Sbjct: 239 VDVWVKRVMEHFYFKK 254
>gi|254973851|ref|ZP_05270323.1| putative DNA glycosylase (DNA repair protein) [Clostridium
difficile QCD-66c26]
gi|255091236|ref|ZP_05320714.1| putative DNA glycosylase (DNA repair protein) [Clostridium
difficile CIP 107932]
gi|255312895|ref|ZP_05354478.1| putative DNA glycosylase (DNA repair protein) [Clostridium
difficile QCD-76w55]
gi|255515654|ref|ZP_05383330.1| putative DNA glycosylase (DNA repair protein) [Clostridium
difficile QCD-97b34]
gi|255648748|ref|ZP_05395650.1| putative DNA glycosylase (DNA repair protein) [Clostridium
difficile QCD-37x79]
gi|260681970|ref|YP_003213255.1| DNA glycosylase [Clostridium difficile CD196]
gi|260685568|ref|YP_003216701.1| DNA glycosylase [Clostridium difficile R20291]
gi|384359522|ref|YP_006197374.1| DNA glycosylase [Clostridium difficile BI1]
gi|260208133|emb|CBA60418.1| putative DNA glycosylase (DNA repair protein) [Clostridium
difficile CD196]
gi|260211584|emb|CBE01792.1| putative DNA glycosylase (DNA repair protein) [Clostridium
difficile R20291]
Length = 291
Score = 109 bits (272), Expect = 2e-21, Method: Compositional matrix adjust.
Identities = 72/256 (28%), Positives = 117/256 (45%), Gaps = 26/256 (10%)
Query: 54 FPTGQTFRWKKTGPLQYTGPIGPHLISLKHLQNGDVCYHIHTSPSEPAAKSALLDFLNMG 113
F GQ FRW K YTG ++++K + D Y +T+ E S ++ ++G
Sbjct: 24 FECGQCFRWHKQEDGSYTGVARGRILNVK--KENDKIYLNNTNLKE--FNSIWYNYFDLG 79
Query: 114 ISLGELWEGFSASDCRFAELAKYLAGARVLRQDPVECLLQFLCSSNNNIARITKMVDFLA 173
E+ D + ++ G R+LRQD E L+ F+ SSNN I I + ++ L+
Sbjct: 80 TDYTEIKNKLKNMDEYLNKATEFGWGIRILRQDGWEMLISFIISSNNRIPMIQRAIENLS 139
Query: 174 -SLGSHLGNVEGFEFHEFPSLERLSLVSEVELRNAGFGYRSAPQSSLLFSVRRSFKQAKY 232
G ++G EG E++ FP+ E L+ S+ E+R G+R KY
Sbjct: 140 RKFGKYIGEYEGNEYYAFPTPEELNKASQEEIRACQTGFRD-----------------KY 182
Query: 233 ITGTVDVLQSKHSGGAEWLLSLRKLDLQEAIDALCTLPGVGPKVAACIALFSLDQHHAIP 292
I T + + +E+ L ++ L GVGPKV CIALF + ++ + P
Sbjct: 183 IKSTTQAVIENNDEVSEYT----NLSTEDCRKELLKFNGVGPKVCDCIALFGMQKYDSFP 238
Query: 293 VDTHVWKIATRYLLPE 308
VD V ++ + + E
Sbjct: 239 VDVWVKRVMQEFYIDE 254
>gi|302872099|ref|YP_003840735.1| 8-oxoguanine DNA glycosylase [Caldicellulosiruptor obsidiansis
OB47]
gi|302574958|gb|ADL42749.1| 8-oxoguanine DNA glycosylase domain-containing protein
[Caldicellulosiruptor obsidiansis OB47]
Length = 284
Score = 108 bits (270), Expect = 3e-21, Method: Compositional matrix adjust.
Identities = 83/278 (29%), Positives = 122/278 (43%), Gaps = 30/278 (10%)
Query: 41 LNLTQSELSLPLTFPTGQTFRWKKTGPLQYTGPIGPHLISLKHLQNGDVCYHIHTSPSEP 100
L ++ E+ TF +GQ FRWKK Y G + ++ L + Q+ + + SP E
Sbjct: 10 LRISGVEIDFDATFFSGQCFRWKKVDG-GYIGVVNRKIV-LVYPQDSNTFDIYNCSPEE- 66
Query: 101 AAKSALLDFLNMGISLGELWEGFSASDCRFAELAKYLAGARVLRQDPVECLLQFLCSSNN 160
K + ++ + + S D + + G R+L Q+P EC++ F+ S NN
Sbjct: 67 -FKKFFYWYFDLDKDYDLILKELSEHDEILKKAVEKYRGMRLLNQEPFECMISFIISQNN 125
Query: 161 NIARITKMVDFLASLGSHLGNVEGFEFHEFPSLERLSLVSEVELRNAGFGYRSAPQSSLL 220
NI RI +V+ L GF FP +E L +S EL+ G GYR
Sbjct: 126 NIKRIQMLVERLCQAYGEKIEYRGFSSWTFPEIENLKKISTEELKRLGLGYR-------- 177
Query: 221 FSVRRSFKQAKYITGTVDVLQSKHSGGAEWLLSLRKLDLQEAIDALCTLPGVGPKVAACI 280
A+YI + +K G SL L EA L T+ G+G KVA CI
Sbjct: 178 ---------AEYIKDAI----AKLDEGKIDFESLESLSSDEARKILKTVKGIGDKVANCI 224
Query: 281 ALFSLDQHHAIPVDTHVWKIATRYLLPELAGVRLTPKL 318
L+SL ++ PVD VW + L E G+ T +L
Sbjct: 225 LLYSLQKYDVFPVD--VW---VKRALREYYGIENTKQL 257
>gi|344995770|ref|YP_004798113.1| 8-oxoguanine DNA glycosylase [Caldicellulosiruptor lactoaceticus
6A]
gi|343963989|gb|AEM73136.1| 8-oxoguanine DNA glycosylase domain-containing protein
[Caldicellulosiruptor lactoaceticus 6A]
Length = 284
Score = 108 bits (270), Expect = 3e-21, Method: Compositional matrix adjust.
Identities = 84/278 (30%), Positives = 124/278 (44%), Gaps = 30/278 (10%)
Query: 41 LNLTQSELSLPLTFPTGQTFRWKKTGPLQYTGPIGPHLISLKHLQNGDVCYHIHTSPSEP 100
L ++ E++ TF +GQ FRWKK Y G + ++ L + Q+ + + SP E
Sbjct: 10 LRISGVEINFDATFFSGQCFRWKKVDG-GYIGVVNRKIV-LVYPQDNNTFDIYNCSPDE- 66
Query: 101 AAKSALLDFLNMGISLGELWEGFSASDCRFAELAKYLAGARVLRQDPVECLLQFLCSSNN 160
K + ++ + + S D + + G R+L Q+P EC++ F+ S NN
Sbjct: 67 -FKKFFYWYFDLDKDYDLILKELSEHDEILKKAVEKYRGMRLLNQEPFECMISFIISQNN 125
Query: 161 NIARITKMVDFLASLGSHLGNVEGFEFHEFPSLERLSLVSEVELRNAGFGYRSAPQSSLL 220
NI RI +V+ L +GF F FP +E L VS EL+ G GYR
Sbjct: 126 NIKRIQILVERLCQAYGEKIEYKGFSFWTFPEIEDLKKVSIEELKCLGLGYR-------- 177
Query: 221 FSVRRSFKQAKYITGTVDVLQSKHSGGAEWLLSLRKLDLQEAIDALCTLPGVGPKVAACI 280
A+YI + +K G SL L EA L T+ G+G KVA CI
Sbjct: 178 ---------AEYIKDAI----AKLEEGKIDFESLASLKSDEARKILKTVKGIGDKVADCI 224
Query: 281 ALFSLDQHHAIPVDTHVWKIATRYLLPELAGVRLTPKL 318
L+SL ++ PVD VW + L E G+ +L
Sbjct: 225 LLYSLQKYDVFPVD--VW---VKRALREYYGIENIKQL 257
>gi|71660156|ref|XP_821796.1| 8-oxoguanine DNA glycosylase [Trypanosoma cruzi strain CL Brener]
gi|70887184|gb|EAN99945.1| 8-oxoguanine DNA glycosylase, putative [Trypanosoma cruzi]
Length = 486
Score = 107 bits (268), Expect = 6e-21, Method: Compositional matrix adjust.
Identities = 105/354 (29%), Positives = 147/354 (41%), Gaps = 113/354 (31%)
Query: 48 LSLPLTFPTGQTFRWKKTGPLQYTGPI--GPHLIS------------------------L 81
++LP+T GQ FRW++T + G + G + +S L
Sbjct: 51 VNLPMTLCGGQCFRWRRTPRGTWVGVVERGAYELSDAAHPPEFQAVYSRGEEKRGMRPSL 110
Query: 82 KHLQN---GDVCYH--IHTSPSEPAAKSA----LLDFLNMGISLGELWEGFSASD-CRFA 131
H + GDV + +H P + S L +L + + L +LW ++ + R
Sbjct: 111 SHPSDDLSGDVLWFRCLHREPKNASDLSTEACFLRHYLALDVDLQKLWRRWTRDNPMRNH 170
Query: 132 ELAKYL---AGA------RVLRQDPVECLLQFLCSSNNNIARITKMVDFLA-SLGSHL-- 179
L +YL AG R LRQ+ E LL FLCS NNN+ RIT +V+ LA S G HL
Sbjct: 171 PLVRYLTSNAGKGPSVKIRHLRQNIHETLLAFLCSQNNNVQRITGLVEKLATSYGDHLCD 230
Query: 180 ------------------------------GNVEGFEFHEFPSLERLSLVSEVELRNAGF 209
G+ + H PS++ L+ SE ELR GF
Sbjct: 231 YNLETGDVRNVGYLNHSSTRSTKNAKRADTGDGDWIPLHTIPSMDELARRSEDELRALGF 290
Query: 210 GYRSAPQSSLLFSVRRSFKQAKYITGTVDVLQS-------KHSG---------GAEWLLS 253
GYRS KYI ++QS K G +W
Sbjct: 291 GYRS-----------------KYIVQCASIIQSSGATRRKKEEGVNCFCSSMQSYKWYDD 333
Query: 254 LRK--LDLQEAIDALCTLPGVGPKVAACIALFSLDQHHAIPVDTHVWKIATRYL 305
L L L + + L +LPGVG KVA CI LF++ H +PVDTH+ ++AT YL
Sbjct: 334 LLDPCLSLSDRREKLLSLPGVGRKVADCILLFAVGHHEIVPVDTHMAQVATEYL 387
>gi|342320515|gb|EGU12455.1| glycosylase/lyase [Rhodotorula glutinis ATCC 204091]
Length = 491
Score = 107 bits (267), Expect = 7e-21, Method: Compositional matrix adjust.
Identities = 100/343 (29%), Positives = 146/343 (42%), Gaps = 71/343 (20%)
Query: 41 LNLTQSELSLPLTFPTGQTFRWKKTGPLQYT----GPIGPHLISLKHL-------QNGDV 89
L LT+ +LSL +GQ+FRW + P+ T +L H Q+
Sbjct: 11 LALTRQQLSLKTVCLSGQSFRWHRCTPITATIDAPQSADSEEWALAHAGRTLVLRQDDSG 70
Query: 90 CYHIHTSPSEPAAKSALLD------------FLNMGISLGELWEGFSASDCRF-----AE 132
Y+ P P + + D + + ++L +L+ ++ D +F ++
Sbjct: 71 IYYRALYPFSPPHTAYIADLATDTSTPLLRAYFQLDVNLDDLYRQWARDDPKFRRKIESD 130
Query: 133 LAKYLAGARVLRQDPVECLLQFLCSSNNNIARITKMVDFL-ASLGSHLGNVEGF------ 185
L K L G RVL+QD E L+ F+CS+NNNIARIT MV+ L A+LGS L + F
Sbjct: 131 LEKRLEGIRVLKQDEWETLVSFICSANNNIARITLMVNRLCAALGSPLPHPSHFTPSCVH 190
Query: 186 -----------------EFHEFPSLERLSLVSEVE--LRNAGFGYR-----SAPQSSLLF 221
FP L+ ++ LR GFGYR S Q+ L
Sbjct: 191 STSSSIPASPSPAPNDSSLFSFPPPHALADEKRIDPLLRQLGFGYRAPFIPSTAQTLLSS 250
Query: 222 SVRRSFKQAKYITGTVDVLQSKHSGGAEWLLSLRKLDLQEAIDALCTLPGVGPKVAACIA 281
S Y++ ++ G + EA + L GVG KVA C+
Sbjct: 251 SSSLGLSPESYLSSLNRAQFTREGKG-----------IAEAREKLVEFKGVGRKVADCVL 299
Query: 282 LFSLDQHHAIPVDTHVWKIATR-YLLPELAGVRLTPKLCSRVA 323
LF + + +PVDTHV++IA R Y P LTP L RV+
Sbjct: 300 LFGMGWNELVPVDTHVFQIAIRDYSFPSPRSASLTPALHDRVS 342
>gi|423080568|ref|ZP_17069188.1| 8-oxoguanine DNA-glycosylase [Clostridium difficile 002-P50-2011]
gi|423086019|ref|ZP_17074452.1| 8-oxoguanine DNA-glycosylase [Clostridium difficile 050-P50-2011]
gi|357548043|gb|EHJ29916.1| 8-oxoguanine DNA-glycosylase [Clostridium difficile 050-P50-2011]
gi|357552941|gb|EHJ34704.1| 8-oxoguanine DNA-glycosylase [Clostridium difficile 002-P50-2011]
Length = 291
Score = 107 bits (267), Expect = 7e-21, Method: Compositional matrix adjust.
Identities = 72/256 (28%), Positives = 116/256 (45%), Gaps = 26/256 (10%)
Query: 54 FPTGQTFRWKKTGPLQYTGPIGPHLISLKHLQNGDVCYHIHTSPSEPAAKSALLDFLNMG 113
F GQ FRW K YTG ++++K + D Y +T+ E S ++ ++G
Sbjct: 24 FECGQCFRWHKQEDGSYTGVAKGRILNVK--KENDKIYLNNTNLKE--FNSIWYNYFDLG 79
Query: 114 ISLGELWEGFSASDCRFAELAKYLAGARVLRQDPVECLLQFLCSSNNNIARITKMVDFLA 173
E+ D + ++ G R+LRQD E L+ F+ SSNN I I + ++ L+
Sbjct: 80 TDYTEIKNKLKNMDEYLNKATEFGWGIRILRQDGWEMLISFIISSNNRIPMIQRAIENLS 139
Query: 174 -SLGSHLGNVEGFEFHEFPSLERLSLVSEVELRNAGFGYRSAPQSSLLFSVRRSFKQAKY 232
G +G EG E++ FP+ E L+ S+ E+R G+R KY
Sbjct: 140 RKFGKCIGEYEGNEYYAFPTPEELNKASQEEIRACQTGFRD-----------------KY 182
Query: 233 ITGTVDVLQSKHSGGAEWLLSLRKLDLQEAIDALCTLPGVGPKVAACIALFSLDQHHAIP 292
I T + + +E+ L ++ L GVGPKV CIALF + ++ + P
Sbjct: 183 IKSTTQAVIENNDKVSEYT----NLSTEDCRKELLKFNGVGPKVCDCIALFGMQKYDSFP 238
Query: 293 VDTHVWKIATRYLLPE 308
VD V ++ + + E
Sbjct: 239 VDVWVKRVMQEFYIDE 254
>gi|312622687|ref|YP_004024300.1| 8-oxoguanine DNA glycosylase domain-containing protein
[Caldicellulosiruptor kronotskyensis 2002]
gi|312203154|gb|ADQ46481.1| 8-oxoguanine DNA glycosylase domain-containing protein
[Caldicellulosiruptor kronotskyensis 2002]
Length = 284
Score = 107 bits (267), Expect = 7e-21, Method: Compositional matrix adjust.
Identities = 84/278 (30%), Positives = 124/278 (44%), Gaps = 30/278 (10%)
Query: 41 LNLTQSELSLPLTFPTGQTFRWKKTGPLQYTGPIGPHLISLKHLQNGDVCYHIHTSPSEP 100
L ++ E++ TF +GQ FRWKK Y G + ++ L + Q+ + + SP E
Sbjct: 10 LRISGVEINFDATFFSGQCFRWKKV-EYGYIGVVNRKIV-LVYPQDNNTFDIYNCSPDE- 66
Query: 101 AAKSALLDFLNMGISLGELWEGFSASDCRFAELAKYLAGARVLRQDPVECLLQFLCSSNN 160
K + ++ + + S D + + G R+L Q+P EC++ F+ S NN
Sbjct: 67 -FKKFFYWYFDLDKDYDLILKELSEHDEILKKAVEKYRGMRLLNQEPFECMISFIISQNN 125
Query: 161 NIARITKMVDFLASLGSHLGNVEGFEFHEFPSLERLSLVSEVELRNAGFGYRSAPQSSLL 220
NI RI +V+ L GF FP +E L VS +L+ G GYRS
Sbjct: 126 NIKRIQILVERLCQAYGEKIEYRGFSSWTFPEIEDLKKVSIEDLKRLGLGYRS------- 178
Query: 221 FSVRRSFKQAKYITGTVDVLQSKHSGGAEWLLSLRKLDLQEAIDALCTLPGVGPKVAACI 280
+YI V +K G SL L+ EA L T+ G+G KVA CI
Sbjct: 179 ----------EYIKDAV----AKLDEGKIDFESLGSLNSDEARKILKTVKGIGDKVANCI 224
Query: 281 ALFSLDQHHAIPVDTHVWKIATRYLLPELAGVRLTPKL 318
L+SL ++ PVD VW + L E G+ T +L
Sbjct: 225 LLYSLQKYDVFPVD--VW---VKRALREYYGIENTKQL 257
>gi|269860503|ref|XP_002649972.1| 8-oxoguanine DNA glycosylase DNA-(apurinic or apyrimidinic site)
lyase [Enterocytozoon bieneusi H348]
gi|269862387|ref|XP_002650817.1| 8-oxoguanine DNA glycosylase; DNA-lyase [Enterocytozoon bieneusi
H348]
gi|220065570|gb|EED43240.1| 8-oxoguanine DNA glycosylase; DNA-lyase [Enterocytozoon bieneusi
H348]
gi|220066591|gb|EED44067.1| 8-oxoguanine DNA glycosylase DNA-(apurinic or apyrimidinic site)
lyase [Enterocytozoon bieneusi H348]
Length = 278
Score = 107 bits (267), Expect = 8e-21, Method: Compositional matrix adjust.
Identities = 78/270 (28%), Positives = 122/270 (45%), Gaps = 42/270 (15%)
Query: 35 PSKWTPLNL-TQSELSLPLTFPTGQTFRWKKTGPLQYTGPIGPHLISLKHLQNGDVCYHI 93
+W ++L + ++ L T +GQ F + KT +Y G +G L+ LK +N V I
Sbjct: 6 DDQWIEISLGSGMKIDLEETLYSGQVFHFNKTNDNEYCGFVGEDLLILKQ-RNSTVLCKI 64
Query: 94 HTSPSEPAAKSALLDFLNMGISLGELWEGFSASDCRFAELAKYLAGARVLRQDPVECLLQ 153
+E A + +F N+ I + ++L + G R L D +
Sbjct: 65 VNEYTEQAVR----NFFNLDILVD-------------SQLCS-VDGLRFLTNDLYSTIFS 106
Query: 154 FLCSSNNNIARITKMVDFLASLGSHLGNVEGFEFHEFPSLERLSLVSEVELRNAGFGYRS 213
F+CS NNN+ RI MV L G ++ N + H+FP L+ LV ++ R GFGYR+
Sbjct: 107 FICSQNNNVKRIEGMVHCLYEKGPYI-NYKNISVHKFPQLD--ELVDDMHWRAHGFGYRA 163
Query: 214 APQSSLLFSVRRSFKQAKYITGTVDVLQSKHSGGAEWLLSLRKLDLQEAIDALCTLPGVG 273
S + AKYI +H L+K+ A + LC + G+G
Sbjct: 164 DYIS----------QAAKYIREHQSSFTKEH---------LQKIGYGNAKNKLCEIKGIG 204
Query: 274 PKVAACIALFSLDQHHAIPVDTHVWKIATR 303
KVA CI L SL+ +P+D H++ + R
Sbjct: 205 RKVADCICLISLEYFDVVPIDRHIFNYSQR 234
>gi|393911137|gb|EJD76182.1| hypothetical protein LOAG_16817 [Loa loa]
Length = 312
Score = 107 bits (267), Expect = 8e-21, Method: Compositional matrix adjust.
Identities = 95/301 (31%), Positives = 135/301 (44%), Gaps = 43/301 (14%)
Query: 41 LNLTQSELSLPLTFPTGQTFRWKKTGPLQYTGPIGPHLISLKHLQNGDVCYHI--HTSPS 98
L ++ EL+L +GQ+FRWKK I P + D+ + + H
Sbjct: 4 LKCSKEELNLGAVLLSGQSFRWKKL-VTNVDENIAP--------SSDDIFFGVAKHRVWK 54
Query: 99 EPAAKSALLDFLNMGISLGELWEGFSASDCRFAEL----AKYLAGARVLRQDPVECLLQF 154
L + +GI L L++ +S +D FA L L G RVL QDP+E + F
Sbjct: 55 VWRENDEQLGY-ELGIELIPLYKLWSENDKHFARLLANHRTKLEGIRVLGQDPLETVFAF 113
Query: 155 LCSSNNNIARITKMVDFLASLGSHLGNVEGFE----FHEFPSLERLSLVSEVE--LRNAG 208
+CS+NN+I RIT MV+ L L NV F++F L+R++ +E LR G
Sbjct: 114 ICSANNHIRRITNMVETLCELYGESTNVSCLNGIKTFYDFADLKRMTDDPSLETVLRTRG 173
Query: 209 FGYRSAPQSSLLFSVRRSFKQAKYITGTVDVLQSKHSGGAEWLLSLRKLDLQEAIDALCT 268
FGYR A I LQ G ++L +L K + A++ L
Sbjct: 174 FGYR-----------------ALNIALASKALQD---NGEQFLENLSKGTYENAVEELQQ 213
Query: 269 LPGVGPKVAACIALFSLDQHHAIPVDTHVWKI-ATRYLLPELAGVRLTPKLCSRVAEAFC 327
+ G+G KVA CI L L H +P+DTH +I A YL L L K +VA +
Sbjct: 214 MRGIGAKVADCICLMGLRMHFVVPIDTHTLQITAENYLGTLLQRKSLQEKDRRQVAAIWQ 273
Query: 328 E 328
E
Sbjct: 274 E 274
>gi|222529057|ref|YP_002572939.1| 8-oxoguanine DNA glycosylase [Caldicellulosiruptor bescii DSM 6725]
gi|222455904|gb|ACM60166.1| 8-oxoguanine DNA glycosylase domain protein [Caldicellulosiruptor
bescii DSM 6725]
Length = 284
Score = 107 bits (266), Expect = 9e-21, Method: Compositional matrix adjust.
Identities = 83/278 (29%), Positives = 122/278 (43%), Gaps = 30/278 (10%)
Query: 41 LNLTQSELSLPLTFPTGQTFRWKKTGPLQYTGPIGPHLISLKHLQNGDVCYHIHTSPSEP 100
L + E++ TF +GQ FRWKK Y G + ++ L + Q+ + + SP E
Sbjct: 10 LRINGVEINFDATFFSGQCFRWKKV-EYGYIGVVNRKIV-LVYPQDNNTFDIYNCSPDE- 66
Query: 101 AAKSALLDFLNMGISLGELWEGFSASDCRFAELAKYLAGARVLRQDPVECLLQFLCSSNN 160
K + ++ + + S D + + G R+L Q+P EC++ F+ S NN
Sbjct: 67 -FKKFFYWYFDLDKDYDLILKELSEHDEILKKAVEKYRGMRLLNQEPFECMISFIISQNN 125
Query: 161 NIARITKMVDFLASLGSHLGNVEGFEFHEFPSLERLSLVSEVELRNAGFGYRSAPQSSLL 220
NI RI +V+ L GF FP +E L VS +L+ G GYR
Sbjct: 126 NIKRIQILVERLCQAYGEKIEYRGFSSWTFPEIEDLKKVSIEDLKRLGLGYR-------- 177
Query: 221 FSVRRSFKQAKYITGTVDVLQSKHSGGAEWLLSLRKLDLQEAIDALCTLPGVGPKVAACI 280
A+YI + +K G SL L EA L T+ G+G KVA CI
Sbjct: 178 ---------AEYIKDAI----AKLDEGKIDFESLESLSSDEARKILKTVKGIGDKVANCI 224
Query: 281 ALFSLDQHHAIPVDTHVWKIATRYLLPELAGVRLTPKL 318
L+SL ++ PVD VW + L E G+ T +L
Sbjct: 225 LLYSLQKYDVFPVD--VW---VKRALREYYGIENTKQL 257
>gi|402073698|gb|EJT69250.1| N-glycosylase/DNA lyase [Gaeumannomyces graminis var. tritici
R3-111a-1]
Length = 538
Score = 107 bits (266), Expect = 1e-20, Method: Compositional matrix adjust.
Identities = 97/354 (27%), Positives = 136/354 (38%), Gaps = 107/354 (30%)
Query: 38 WTPLNLTQSELSLPLTFPTGQTFRWKKTGP-------------------LQYTG------ 72
W L ++ +EL + T GQ+FRW K L Y
Sbjct: 59 WRKLPVSLAELCIDTTLRCGQSFRWLKIQDEWHCSLHGRVVSLKQDETHLHYRATWPTSR 118
Query: 73 -------PIGPHLISLKHLQNGDV-------CYHIHTSPSEPAAKSALLDFLNMGISLGE 118
P+ P I + Q+G+ + + + LL + NM SL
Sbjct: 119 QLKLSPAPLSPEAIKDEQEQDGEAESTPTTPPPEATAAADDDGTERLLLSYFNMSHSLAG 178
Query: 119 LWEGFSASDCRFAELAKYLAGARVLRQDPVECLLQFLCSSNNNIARITKMVDFLASL--G 176
L+ +S D F+ A AG R+L QD E L+ F+CS+NNNIARI++M L G
Sbjct: 179 LYAEWSGKDANFSRKAPKFAGVRILNQDAWETLVAFICSANNNIARISQMTHKLCDHYGG 238
Query: 177 SHLGNVEGFEFHEFPSLERLSLVS-EVELRNAGFGYRSAPQSSLLFSVRRSFKQAKYITG 235
+ V G F +FP+ L+ E LR GFGYR A+YI
Sbjct: 239 EPIAMVAGHVFRDFPAPSALAADGVEQNLRLLGFGYR-----------------ARYIAE 281
Query: 236 TVDVLQSKHSGGAEWLLSLR--------------------KLDLQEAID----------- 264
T ++ + G WL +LR +L E D
Sbjct: 282 TARMVTGQLPHG--WLRALRNPALLGWGSDGAEGQEQQKQRLVKTEQPDDDEVAAAAAAA 339
Query: 265 ---------------ALCTLPGVGPKVAACIALFSLDQHHAIPVDTHVWKIATR 303
AL +L GVGPKV+ C+ L L ++P+DTH+W IATR
Sbjct: 340 AVTPAEAPTYRAAHVALLSLSGVGPKVSDCVCLMGLGWGESVPIDTHMWTIATR 393
>gi|313887714|ref|ZP_07821396.1| 8-oxoguanine DNA-glycosylase (ogg) [Peptoniphilus harei
ACS-146-V-Sch2b]
gi|312846323|gb|EFR33702.1| 8-oxoguanine DNA-glycosylase (ogg) [Peptoniphilus harei
ACS-146-V-Sch2b]
Length = 291
Score = 107 bits (266), Expect = 1e-20, Method: Compositional matrix adjust.
Identities = 83/278 (29%), Positives = 133/278 (47%), Gaps = 37/278 (13%)
Query: 54 FPTGQTFRWKKTGPLQYTGPIGPHLISLKHLQNGDVCYHIHTSPSEPAAKSALLDFLNMG 113
F GQ FRW + +T H I + GD I E + + D+ ++
Sbjct: 24 FTCGQAFRWYEEEDGSFT--FVTHGIVANAKKVGDEI--ILKGVDEKSFEEIFYDYFDLS 79
Query: 114 ISLGELWEGFSASDCRFAELAKYLAGARVLRQDPVECLLQFLCSSNNNIARITKMVDFLA 173
+ E A D + Y G R+L QD E ++ F+ S+NN I RI K ++ ++
Sbjct: 80 RDYKLVMEEL-AQDEVMKDATDYGKGIRILNQDKFETIISFIISANNQIPRIQKAIEKIS 138
Query: 174 SL-GSHLGNVEGFEFHEFPSLERLSLVSEVELRN-AGFGYRSAPQSSLLFSVRRSFKQAK 231
+ G +LG + +++ FPS ++L+L +LR A G+R +R + +K
Sbjct: 139 EMYGDYLGEDKNRKYYAFPSAQQLALAKPEDLREFARVGFRD----------KRIVEASK 188
Query: 232 YI-TGTVDVLQSKHSGGAEWLLSLRKLDLQEAIDALCTLPGVGPKVAACIALFSLDQHHA 290
I +G VD+ + + ++DL++A L TLPGVGPKVA CI LF+ D+ +
Sbjct: 189 LIASGQVDIER------------ISEMDLEDARKELQTLPGVGPKVADCILLFAFDRKES 236
Query: 291 IPVDTHVWKIATRYLLPELAGVRLTP--KLCSRVAEAF 326
PVD VW + ++ EL +TP K+ +R E F
Sbjct: 237 FPVD--VW---IKRVMEELYLKEVTPKSKIATRGREVF 269
>gi|71402505|ref|XP_804159.1| 8-oxoguanine DNA glycosylase [Trypanosoma cruzi strain CL Brener]
gi|70866980|gb|EAN82308.1| 8-oxoguanine DNA glycosylase, putative [Trypanosoma cruzi]
Length = 448
Score = 106 bits (265), Expect = 1e-20, Method: Compositional matrix adjust.
Identities = 101/338 (29%), Positives = 144/338 (42%), Gaps = 81/338 (23%)
Query: 48 LSLPLTFPTGQTFRWKKTGPLQYTGPI--GPHLIS------------------------L 81
++LP+T GQ FRW++T + G + G + +S L
Sbjct: 13 VNLPMTLCGGQCFRWRRTPRGTWVGVVERGAYELSDAAHPPEFQAVHPRGEEKRGMRSSL 72
Query: 82 KHLQN---GDVCYH--IHTSPSEPAAKSA----LLDFLNMGISLGELWEGFSASD-CRFA 131
H + GDV + +H P S L +L + + L +LW ++ + R
Sbjct: 73 SHPSDDLSGDVFWFRCLHREPKNALDLSTEACFLRHYLALDVDLQKLWRRWTRDNPMRDH 132
Query: 132 ELAKYL---AGA------RVLRQDPVECLLQFLCSSNNNIARITKMVDFLA-SLGSHL-- 179
L +YL AG R LRQ+ E LL FLCS NNN+ RIT +V+ LA S G HL
Sbjct: 133 PLVRYLTSNAGKGPSVNIRHLRQNIHETLLAFLCSQNNNVQRITGLVEKLATSYGDHLCD 192
Query: 180 ------------------------------GNVEGFEFHEFPSLERLSLVSEVELRNAGF 209
G+ + H PS++ L+ SE ELR GF
Sbjct: 193 YNLETGDVRNVGYLNHTSTRPTKNAKRDDTGDGDWIPLHTMPSMDELARRSEDELRTLGF 252
Query: 210 GYRSAPQSSLLFSVRRSFKQAKYITGTVDVLQSKHSGGAEWLLSLRK--LDLQEAIDALC 267
GYRS ++ S + V+ S +W L L L + + L
Sbjct: 253 GYRSKYIVQCASIIQSSGATRRKKEEGVNCFCSSMQS-YKWYDDLLDPCLSLYDRREKLL 311
Query: 268 TLPGVGPKVAACIALFSLDQHHAIPVDTHVWKIATRYL 305
+LPGVG KVA CI LF++ H +PVDTH+ ++AT YL
Sbjct: 312 SLPGVGRKVADCILLFAVGHHEIVPVDTHMAQVATEYL 349
>gi|402591993|gb|EJW85922.1| helix-hairpin-helix domain-containing protein family protein
[Wuchereria bancrofti]
Length = 333
Score = 106 bits (265), Expect = 1e-20, Method: Compositional matrix adjust.
Identities = 85/290 (29%), Positives = 134/290 (46%), Gaps = 47/290 (16%)
Query: 41 LNLTQSELSLPLTFPTGQTFRWKK----------TGPLQYTGPIGPHLISLKHLQNGDVC 90
L +++EL+L GQ+FRW+K + G + + + +
Sbjct: 4 LKCSKNELNLGAVLLGGQSFRWRKLVTNDENISPASDDIFLGVAKHRVWKIWRENDEQLG 63
Query: 91 YHIHTSPSEPAAKS--ALLDFLNMGISLGELWEGFSASDCRFAELAK----YLAGARVLR 144
Y + S+ L D+ + I L L++ ++ +D FA L + L G RVL
Sbjct: 64 YEVLAKFSKARGNDLDVLKDYFQLDIELMPLYKLWAENDKYFAHLLENHRMKLEGIRVLG 123
Query: 145 QDPVECLLQFLCSSNNNIARITKMVDFLASLGSHLGNV---EGFE-FHEFPSLERLSLVS 200
QDP+E + F+CS+NN+I RIT MV+ L L N+ G + F++F L+R++
Sbjct: 124 QDPLETVFAFICSANNHIRRITNMVETLCELYGESTNIPCSNGMKTFYDFADLKRMTDDP 183
Query: 201 EVE--LRNAGFGYRSAPQSSLLFSVRRSFKQAKYITGTVDVLQSK--HSGGAEWLLSLRK 256
+E LR GFGYR+ + L SK G ++L +L K
Sbjct: 184 ALEKVLRIRGFGYRALNIA----------------------LASKTLEDNGEQFLENLSK 221
Query: 257 LDLQEAIDALCTLPGVGPKVAACIALFSLDQHHAIPVDTHVWKI-ATRYL 305
++A++ L + G+G KVA CI L L H +P+DTH +I A YL
Sbjct: 222 GSYEDAMEELQQMRGIGAKVADCICLMGLRMHSVVPIDTHTLQITAENYL 271
>gi|347542114|ref|YP_004856750.1| 8-oxoguanine DNA glycosylase [Candidatus Arthromitus sp.
SFB-rat-Yit]
gi|346985149|dbj|BAK80824.1| 8-oxoguanine-DNA-glycosylase [Candidatus Arthromitus sp.
SFB-rat-Yit]
Length = 291
Score = 106 bits (265), Expect = 1e-20, Method: Compositional matrix adjust.
Identities = 82/278 (29%), Positives = 126/278 (45%), Gaps = 52/278 (18%)
Query: 50 LPLTFPTGQTFRWKKTGPLQYTGPIGPHLISLKHLQNGDVCYHIHTSPSEPAAKSALLDF 109
L F GQ FRW++T Y +I L+ + N ++ H S L DF
Sbjct: 23 LKHIFENGQCFRWERTDTGTYIVVAKERVIELE-MDNFNLIIH----------NSNLEDF 71
Query: 110 LNMGISLGELWEGFSA------SDCRFAELAKYLAGARVLRQDPVECLLQFLCSSNNNIA 163
N+ I + +S D E + G R+L QDP E +L F+ SSNN I
Sbjct: 72 ENIWIDYFDFERNYSKLKKNLKQDKYLYEAINFGYGLRILNQDPFEMILSFIISSNNRIP 131
Query: 164 RITKMVDFLASLGSHLGNVEGFEFHEFPSLERLSLVSEVELRNAGFGYRSAPQSSLLFSV 223
I K + ++ + +G +++FPSL++LS V++ ELR G G+R
Sbjct: 132 MIKKAISNISERYGDSISYKGKIYYKFPSLDKLSTVTQEELRECGVGFRD---------- 181
Query: 224 RRSFKQAKYITGTVDVLQSKHSGGAEWLLSLRKLDLQEAIDALC-----TLPGVGPKVAA 278
KY+ T+ +L ++ E++++L D LC GVG KVA
Sbjct: 182 -------KYLYNTIKLLSEEND--IEYIMNLG--------DDLCHKELQKFNGVGVKVAD 224
Query: 279 CIALFSLDQHHAIPVDTHVWKIATR-YLLPE--LAGVR 313
CI LFS+ ++ A PVD V + + Y+ P+ L G+R
Sbjct: 225 CIMLFSMKKYSAFPVDVWVKRAMMKFYVAPDTSLKGIR 262
>gi|312127881|ref|YP_003992755.1| 8-oxoguanine DNA glycosylase domain-containing protein
[Caldicellulosiruptor hydrothermalis 108]
gi|311777900|gb|ADQ07386.1| 8-oxoguanine DNA glycosylase domain protein [Caldicellulosiruptor
hydrothermalis 108]
Length = 284
Score = 106 bits (265), Expect = 1e-20, Method: Compositional matrix adjust.
Identities = 82/278 (29%), Positives = 122/278 (43%), Gaps = 30/278 (10%)
Query: 41 LNLTQSELSLPLTFPTGQTFRWKKTGPLQYTGPIGPHLISLKHLQNGDVCYHIHTSPSEP 100
L ++ E++ TF +GQ FRWKK Y G + ++ L + Q+ + + SP E
Sbjct: 10 LRISGVEINFDATFFSGQCFRWKK-AECGYIGVVNRKIV-LVYPQDNNTFDIYNCSPDE- 66
Query: 101 AAKSALLDFLNMGISLGELWEGFSASDCRFAELAKYLAGARVLRQDPVECLLQFLCSSNN 160
K + ++ + + S D + + G R+L Q+P EC++ F+ S NN
Sbjct: 67 -FKKFFYWYFDLDKDYDLILKELSEHDEILKKAVEKYRGMRLLNQEPFECMISFIISQNN 125
Query: 161 NIARITKMVDFLASLGSHLGNVEGFEFHEFPSLERLSLVSEVELRNAGFGYRSAPQSSLL 220
NI RI +V+ L GF FP +E L +S EL+ G GYR
Sbjct: 126 NIKRIQILVERLCQAYGEKIEYRGFSSWTFPEIEDLKKISTEELKRLGLGYR-------- 177
Query: 221 FSVRRSFKQAKYITGTVDVLQSKHSGGAEWLLSLRKLDLQEAIDALCTLPGVGPKVAACI 280
A+YI + +K G SL L EA L T+ G+G KVA CI
Sbjct: 178 ---------AEYIKDAI----AKLDEGKIDFESLESLSSDEARKILKTVKGIGDKVANCI 224
Query: 281 ALFSLDQHHAIPVDTHVWKIATRYLLPELAGVRLTPKL 318
L+SL ++ PVD VW + L E G+ +L
Sbjct: 225 LLYSLQKYDVFPVD--VW---VKRALREYYGIENIKQL 257
>gi|407850887|gb|EKG05061.1| 8-oxoguanine DNA glycosylase, putative [Trypanosoma cruzi]
Length = 449
Score = 106 bits (265), Expect = 1e-20, Method: Compositional matrix adjust.
Identities = 111/375 (29%), Positives = 154/375 (41%), Gaps = 119/375 (31%)
Query: 48 LSLPLTFPTGQTFRWKKTGPLQYTGPI--GPHLIS------------------------L 81
++LP+T GQ FRW++T + G + G + +S L
Sbjct: 14 VNLPMTLCGGQCFRWRRTPRGTWVGVVERGAYELSDAAHPPEFQAVHSRGEEKRGMRSSL 73
Query: 82 KHLQN---GDVCYH--IHTSPSEPAAKSA----LLDFLNMGISLGELWEGFSASD-CRFA 131
H + GDV + +H P S L +L + + L +LW ++ + R
Sbjct: 74 SHPSDDLTGDVLWFRCLHREPKNALDLSTEACFLRHYLALDVDLQKLWRRWTRDNPMRDH 133
Query: 132 ELAKYL---AGA------RVLRQDPVECLLQFLCSSNNNIARITKMVDFLA-SLGSHL-- 179
L +YL AG R LRQ+ E LL FLCS NNN+ RIT +V+ LA S G HL
Sbjct: 134 PLVRYLTSNAGKGPFVKIRHLRQNIHETLLAFLCSQNNNVQRITGLVEKLATSYGGHLCD 193
Query: 180 ------------------------------GNVEGFEFHEFPSLERLSLVSEVELRNAGF 209
G+ + H PS++ L+ SE ELR GF
Sbjct: 194 YNLETGDVRNVGYLNHSSTRSTKNAKRADTGDGDWIPLHTIPSMDELARRSEDELRALGF 253
Query: 210 GYRSAPQSSLLFSVRRSFKQAKYITGTVDVLQS-------KHSG---------GAEWLLS 253
GYRS KYI ++QS K G +W
Sbjct: 254 GYRS-----------------KYIVQCASIIQSSGATRRKKEEGVNCFCSSMQSYKWYDD 296
Query: 254 LRK--LDLQEAIDALCTLPGVGPKVAACIALFSLDQHHAIPVDTHVWKIATRYLLPELAG 311
L L L + + L +LPGVG KVA CI LF++ H +PVDTH+ ++AT Y LAG
Sbjct: 297 LLDPCLSLSDRREKLLSLPGVGRKVADCILLFAVGHHEIVPVDTHMAQVATEY----LAG 352
Query: 312 VRLTPK--LCSRVAE 324
K LC+ + E
Sbjct: 353 TATCSKKVLCNGMGE 367
>gi|395518379|ref|XP_003763339.1| PREDICTED: N-glycosylase/DNA lyase-like, partial [Sarcophilus
harrisii]
Length = 150
Score = 106 bits (264), Expect = 1e-20, Method: Compositional matrix adjust.
Identities = 52/144 (36%), Positives = 84/144 (58%), Gaps = 1/144 (0%)
Query: 32 TNKPSKWTPLNLTQSELSLPLTFPTGQTFRWKKTGPLQYTGPIGPHLISLKHLQNGDVCY 91
T+ P W + +SEL L L +GQ+F W++ P +TG + L +L +G + Y
Sbjct: 7 TSAPGLWASIPCPRSELDLDLVLSSGQSFSWREQSPGHWTGVLDGRLWTLTQ-ADGRLHY 65
Query: 92 HIHTSPSEPAAKSALLDFLNMGISLGELWEGFSASDCRFAELAKYLAGARVLRQDPVECL 151
++ P+ P A L F + + L +L+ ++++D F ++A G R+LRQ+PVECL
Sbjct: 66 TVYGEPAGPGAPRTLRRFFQLHVRLADLYRRWASADPHFRDVAPRFPGVRLLRQEPVECL 125
Query: 152 LQFLCSSNNNIARITKMVDFLASL 175
F+CSSNN+++RIT MV L +L
Sbjct: 126 FSFICSSNNHLSRITGMVQRLRAL 149
>gi|443924912|gb|ELU43858.1| ubiquitin fusion degradation protein I [Rhizoctonia solani AG-1 IA]
Length = 859
Score = 105 bits (262), Expect = 3e-20, Method: Compositional matrix adjust.
Identities = 103/337 (30%), Positives = 139/337 (41%), Gaps = 89/337 (26%)
Query: 41 LNLTQSELSLPLTFPTGQTFRWKKTGPLQYTGPIGPHLISLKHLQNGD------------ 88
+ L ++L+L GQ+FRW P+ P LKH + D
Sbjct: 445 IKLPLAQLNLKAVLKCGQSFRWTMV-------PLDPTNQDLKHTRIDDEANPHELPTEEW 497
Query: 89 --------VC------------YHIHTSP---SEPAAKSALL---DFLNMGISLGELWEG 122
VC Y +P S+ A S LL D+ + I L L+
Sbjct: 498 RLTLNDRVVCLRQTENELFYRAYFPTNAPPDLSDDAQDSTLLWLRDYFQLDIDLEALYAD 557
Query: 123 FSASDCRFAELAKYLAGARVLRQDPVECLLQFLCSSNNNIARITKMVDFLA--------S 174
+S D F ++A G R+LRQDP E ++ S NN+I+RIT MV L S
Sbjct: 558 WSKRDAVFQKVAPRFLGIRILRQDPWENVV----SQNNHISRITSMVHSLCIHFSPPVCS 613
Query: 175 LGSHLGNVEGFEFHEFPSLERLSLVS-EVELRNAGFGYRSAPQSSLLFSVRRSFKQAKYI 233
G+ +H FP + L+ S E +LR GFGYR AKYI
Sbjct: 614 SSDLPGSPLNTAWHSFPPPKALADPSVETKLRELGFGYR-----------------AKYI 656
Query: 234 TGTVDVLQSKHSGGAEWLLSLRKLDLQEAIDALCTLPGVGPKVAACIALFSLDQHHAIPV 293
T +L KH + L LR+L EA + L L GVGPKVA +PV
Sbjct: 657 QKTATMLCEKHEDPMKALFELRRLSTSEARERLLELHGVGPKVA-----------EVVPV 705
Query: 294 DTHVWKIATRYLLPELAGVR---LTPKLCSRVAEAFC 327
DTHV +IAT+ + G + + PKL S +A F
Sbjct: 706 DTHVQQIATKMYGFKFQGKQPKAMNPKLYSEIASKFA 742
>gi|15895964|ref|NP_349313.1| 8-oxoguanine DNA glycosylase [Clostridium acetobutylicum ATCC 824]
gi|337737917|ref|YP_004637364.1| 8-oxoguanine DNA glycosylase [Clostridium acetobutylicum DSM 1731]
gi|384459427|ref|YP_005671847.1| 8-oxoguanine DNA glycosylase [Clostridium acetobutylicum EA 2018]
gi|15025740|gb|AAK80653.1|AE007768_7 8-oxoguanine-DNA-glycosylase [Clostridium acetobutylicum ATCC 824]
gi|325510116|gb|ADZ21752.1| 8-oxoguanine-DNA-glycosylase [Clostridium acetobutylicum EA 2018]
gi|336291376|gb|AEI32510.1| 8-oxoguanine-DNA-glycosylase [Clostridium acetobutylicum DSM 1731]
Length = 292
Score = 105 bits (262), Expect = 3e-20, Method: Compositional matrix adjust.
Identities = 74/264 (28%), Positives = 124/264 (46%), Gaps = 27/264 (10%)
Query: 48 LSLPLTFPTGQTFRWKKTGPLQYTGPIGPHLISLKHLQNGDVCYHIHTSPSEPAAKSALL 107
L F GQ FRW + Y G ++ ++ + V Y+I+ E K+
Sbjct: 21 FELKDIFDCGQCFRWHRQENGNYIGIAFEKVVEVQKIGEDVVIYNIN----EEEFKNVWS 76
Query: 108 DFLNMGISLGELWEGFSASDCRFAELAKYLAGARVLRQDPVECLLQFLCSSNNNIARITK 167
++ ++ GE+ + S D + + G R+LRQDP E LL F+ S+NN I I K
Sbjct: 77 EYFDLYRDYGEIKKELS-RDPLLKKSVDFGEGIRILRQDPFEILLSFIISANNRIPMIKK 135
Query: 168 MVDFLASLGSHLGNVEGFEFHEFPSLERLSLVSEVELRNAGFGYRSAPQSSLLFSVRRSF 227
++ ++ +G ++ FP++++L +E + G+R
Sbjct: 136 CINNISEKAGKKLEYKGKIYYAFPTVDKLHEFTEKDFEECTAGFR--------------- 180
Query: 228 KQAKYITGTVDVLQSKHSGGAEWLLSLRKLDLQEAIDALCTLPGVGPKVAACIALFSLDQ 287
AKY+ TVD + + G L ++ L+ E + L GVGPKVA CI LFS+ +
Sbjct: 181 --AKYLKDTVDRIYN----GELNLEYIKSLNDNECHEELKKFMGVGPKVADCIMLFSMQK 234
Query: 288 HHAIPVDTHVWK-IATRYLLPELA 310
+ A PVDT V K + + Y+ P+++
Sbjct: 235 YSAFPVDTWVKKAMMSLYVAPDVS 258
>gi|182413797|ref|YP_001818863.1| helix-hairpin-helix DNA-binding motif-containing protein [Opitutus
terrae PB90-1]
gi|177841011|gb|ACB75263.1| helix-hairpin-helix motif [Opitutus terrae PB90-1]
Length = 334
Score = 105 bits (262), Expect = 3e-20, Method: Compositional matrix adjust.
Identities = 97/318 (30%), Positives = 133/318 (41%), Gaps = 58/318 (18%)
Query: 23 TPPNPQTLTTNKP----SKWT--PLNLTQSELSLPLTFPTGQTFRWKKTGPLQYTGPIGP 76
TPP ++L P S+W P T S L GQ FRW++T + + G G
Sbjct: 2 TPPASRSLIPEAPASGWSEWQRLPSAETFSPRVLAEILDGGQAFRWQRTSDVTWLGQFGD 61
Query: 77 HLISLKHLQNGDVCYHIHTSPSEPAAKS--ALLDFLNMGISLGELWEGFS-ASDCRFAEL 133
++ + G + + +SPS A AL+ +L++ L + SD A
Sbjct: 62 GVVRIALTAEGQLAW---SSPSSLTAHVGPALVAYLDLARDSRTLADTLPWRSDAHLARC 118
Query: 134 AKYLAGARVLRQDPVECLLQFLCSSNNNIARITKMVDFLA------------------SL 175
+ G R+LRQ E LL FLCS+ I +I +MV LA +
Sbjct: 119 LETFPGLRILRQPFGETLLGFLCSATKQIVQIKQMVALLADRHGAPVLPTTADRSSLANR 178
Query: 176 GSHLGNVEGFE------FHEFPSLERLSLVSEVELRNAGFGYRSAPQSSLLFSVRRSFKQ 229
+ LG+ FH P+ L+ VSE ELR G+R
Sbjct: 179 AAALGDPSTLNPQPSTPFHRLPTWPELAAVSEAELRACLLGFR----------------- 221
Query: 230 AKYITGTVDVLQSKHSGGAEWLLSLRKLDLQEAIDALCTLPGVGPKVAACIALFSLDQHH 289
A+YI T L + H G WL L A D LC+LPGVG KVA C+ LF +
Sbjct: 222 ARYIHQTAQFL-AAHPG---WLAETEALPYAAAKDRLCSLPGVGEKVADCVLLFGAGRLE 277
Query: 290 AIPVDTHVWK-IATRYLL 306
A PVD + K +A RY L
Sbjct: 278 AFPVDVWIIKTMAARYGL 295
>gi|124486288|ref|YP_001030904.1| hypothetical protein Mlab_1472 [Methanocorpusculum labreanum Z]
gi|124363829|gb|ABN07637.1| 8-oxoguanine DNA glycosylase domain protein [Methanocorpusculum
labreanum Z]
Length = 299
Score = 105 bits (261), Expect = 4e-20, Method: Compositional matrix adjust.
Identities = 74/266 (27%), Positives = 121/266 (45%), Gaps = 32/266 (12%)
Query: 41 LNLTQSELSLPLTFPTGQTFRWKKTGPLQYTGPIGPHLISLKHLQNGDVCYHIHTSPSEP 100
+L + L LT GQ FRW++ Y P+G + ++ Q D+ ++ P+E
Sbjct: 23 FSLKDTPFQLDLTVSCGQAFRWRQYNGFWYA-PVGDKIWKIR--QERDLL--LYDGPTE- 76
Query: 101 AAKSALLDFLNMGISLGELWEGFSASDCRFAELAKYLAGARVLRQDPVECLLQFLCSSNN 160
L+ + + I L ++ A + + L G R++RQDP ECL+ ++C++
Sbjct: 77 ---KELIRYFGLDIPLDDILTDIDRDPLIHAAIRRCL-GLRIIRQDPWECLISYICATCA 132
Query: 161 NIARITKMVDFLASLGSHLGNVEGFEFHEFPSLERLSLVSEVELRNAGFGYRSAPQSSLL 220
NI I ++ L+ + ++ FH FP +RL+ +R GYR A
Sbjct: 133 NIPGIMMRIENLSERYGNKIEMDEMTFHTFPDAKRLAEEEMCSIRTCKVGYRDA------ 186
Query: 221 FSVRRSFKQAKYITGTVDVLQSKHSGGAEWLLSLRKLDLQEAIDALCTLPGVGPKVAACI 280
YI G ++ S A W + + EA + L L GVGPKVA C+
Sbjct: 187 -----------YICGAAEMAASD----ANWAEKIAGMPYPEAHEKLMRLNGVGPKVADCV 231
Query: 281 ALFSLDQHHAIPVDTHVWKI-ATRYL 305
LF +++ A PVD + +I T+YL
Sbjct: 232 LLFGFEKYEAFPVDVWIERILRTKYL 257
>gi|312793252|ref|YP_004026175.1| 8-oxoguanine DNA glycosylase domain-containing protein
[Caldicellulosiruptor kristjanssonii 177R1B]
gi|312180392|gb|ADQ40562.1| 8-oxoguanine DNA glycosylase domain-containing protein
[Caldicellulosiruptor kristjanssonii 177R1B]
Length = 284
Score = 104 bits (260), Expect = 5e-20, Method: Compositional matrix adjust.
Identities = 83/278 (29%), Positives = 123/278 (44%), Gaps = 30/278 (10%)
Query: 41 LNLTQSELSLPLTFPTGQTFRWKKTGPLQYTGPIGPHLISLKHLQNGDVCYHIHTSPSEP 100
L ++ E++ TF +GQ FRWKK Y G + ++ L + Q+ + + SP E
Sbjct: 10 LRISGVEINFDATFFSGQCFRWKKVDG-GYIGVVNRKIV-LVYPQDNNTFDIYNCSPDE- 66
Query: 101 AAKSALLDFLNMGISLGELWEGFSASDCRFAELAKYLAGARVLRQDPVECLLQFLCSSNN 160
K + ++ + + S D + + G R+L Q+P EC++ F+ S NN
Sbjct: 67 -FKKFFYWYFDLDKDYDLILKELSEHDEILKKAVEKYRGMRLLNQEPFECMISFIISQNN 125
Query: 161 NIARITKMVDFLASLGSHLGNVEGFEFHEFPSLERLSLVSEVELRNAGFGYRSAPQSSLL 220
NI RI +V+ L +GF FP +E L VS EL+ G GYR
Sbjct: 126 NIKRIQILVERLCQAYGEKIEYKGFSSWTFPEIEDLKKVSIEELKCLGLGYR-------- 177
Query: 221 FSVRRSFKQAKYITGTVDVLQSKHSGGAEWLLSLRKLDLQEAIDALCTLPGVGPKVAACI 280
A+YI + +K G SL L EA L T+ G+G KVA CI
Sbjct: 178 ---------AEYIKDAI----AKLEEGKIDFESLASLKSDEARKILKTVKGIGDKVADCI 224
Query: 281 ALFSLDQHHAIPVDTHVWKIATRYLLPELAGVRLTPKL 318
L+SL ++ PVD VW + L E G+ +L
Sbjct: 225 LLYSLQKYDVFPVD--VW---VKRALREYYGIENIKQL 257
>gi|330794206|ref|XP_003285171.1| hypothetical protein DICPUDRAFT_148999 [Dictyostelium purpureum]
gi|325084892|gb|EGC38310.1| hypothetical protein DICPUDRAFT_148999 [Dictyostelium purpureum]
Length = 352
Score = 104 bits (259), Expect = 6e-20, Method: Compositional matrix adjust.
Identities = 75/274 (27%), Positives = 120/274 (43%), Gaps = 58/274 (21%)
Query: 37 KWTPLNLTQSELSLPLTFPTGQTFRWKKTGP--------------LQYTGPIGPHLISLK 82
+W L + L L T +GQ+F W + L Y G I +++ L+
Sbjct: 10 EWECYQLNNNVLDLKKTLFSGQSFIWSEVKEKDIIEFKNESTSENLVYIGVINKYIVLLR 69
Query: 83 HLQ---NGDVCYHIHTS----------PSEPAAKSALLDFLNMGISLGEL---WEGFSAS 126
+ + N + + S SE L D+ N+ + EL W S
Sbjct: 70 YKEFSNNNIIEFKFKNSNNLVNKEIVNQSEQDRLEILKDYFNLKYDINELFSQWRADSVK 129
Query: 127 DCR--------FAELAKYLAGARVLRQDPVECLLQFLCSSNNNIARITKMV-DFLASLGS 177
D + F E ++ G R+++Q+P++CL F+CS NNNI RI+KMV + + G+
Sbjct: 130 DLKQLHSLNSQFREASESFKGLRLIKQNPLDCLFSFICSQNNNIGRISKMVKSLITTYGT 189
Query: 178 HLGNVEGFEFHEFPSLERLSLVSEVELRNAGFGYRSAPQSSLLFSVRRSFKQAKYITGTV 237
+ + G +++FP+LE+LS+ E +L + GFGYRS +YI +
Sbjct: 190 KISDFNGQSYYQFPTLEQLSVAKEKDLNDMGFGYRS-----------------RYIVESC 232
Query: 238 DVLQSKHSGGAEWLLSLRKLDLQEAIDALCTLPG 271
Q +GG EWL L E+ L +L G
Sbjct: 233 K--QVIANGGEEWLNGLVSKPHTESHKELISLMG 264
>gi|358338118|dbj|GAA56435.1| N-glycosylase/DNA lyase [Clonorchis sinensis]
Length = 382
Score = 104 bits (259), Expect = 6e-20, Method: Compositional matrix adjust.
Identities = 87/269 (32%), Positives = 124/269 (46%), Gaps = 48/269 (17%)
Query: 58 QTFRWKKTGPL-QYTGPIGPHLISLKHLQNGDVCYHIHTSPSEPAAKSA----------- 105
Q RW L ++ G + L LK + G Y+ H + AAK++
Sbjct: 7 QLSRWICNEKLDEWYGVVNTRLWRLKQPKAGGPVYYYHY---QTAAKNSDVINGIDQPED 63
Query: 106 LLDFLNMGISLGELWEGFSASDCRFAELA-----------KYLAGARVLRQDPVECLLQF 154
L+D+ + I L L + +SD FA + G R+LRQDPVE L F
Sbjct: 64 LVDYFRLSIDLPSLLVRWQSSDALFASYSTKSESVNSLSWDEAYGIRLLRQDPVETLFAF 123
Query: 155 LCSSNNNIARITKMVDFLASLGSHLGNVEGFEFHEFPSLERLSLVSEVE-LRNAGFGYRS 213
+ S+NNN+ RI++++ L + + G + FPSLE L+ E L+ GFGYRS
Sbjct: 124 ITSANNNVPRISRLLRDLCEMLGKPVSCHGLKHWLFPSLETLAKPGLEECLKKLGFGYRS 183
Query: 214 APQSSLLFSVRRSFKQAKYITGTVD-VLQSKHSGGAEWLLSLRKLDLQEAIDALCTLPGV 272
K+I VLQ GG +L SLR + +A + L +PGV
Sbjct: 184 -----------------KFIPAAARYVLQH---GGVAYLHSLRSSSMADAREFLLRIPGV 223
Query: 273 GPKVAACIALFSLDQHHAIPVDTHVWKIA 301
G KVA CI L SLD+ +PVD H+ + A
Sbjct: 224 GNKVADCICLCSLDKVDVVPVDIHILRAA 252
>gi|340053284|emb|CCC47572.1| putative 8-oxoguanine DNA glycosylase [Trypanosoma vivax Y486]
Length = 478
Score = 104 bits (259), Expect = 7e-20, Method: Compositional matrix adjust.
Identities = 97/350 (27%), Positives = 138/350 (39%), Gaps = 90/350 (25%)
Query: 46 SELSLPLTFPTGQTFRWKKTGPLQYTGPIGPHLISLKHLQN------------------- 86
S + LP+T GQ FRW++T Y G + L + +
Sbjct: 11 SAVHLPMTLCGGQCFRWRRTPRGTYVGVVRRIAYELANFSSCHTLPFQKDCHDSTPGFVV 70
Query: 87 ----------GDV--------CYHIHTSPSEPAAKSA--LLDFLNMGISLGELWEGFSAS 126
GDV C + S A L +L + + L ++WE ++A
Sbjct: 71 PAGVGSVKDEGDVPSGCLWFRCLNCELRDSSEVESQAVFLRHYLAVDVDLEKMWEQWTAE 130
Query: 127 DCRFAE-LAKYL---------AGARVLRQDPVECLLQFLCSSNNNIARITKMVDFLA-SL 175
+ L +YL R LRQ+ E L FLCS NNN+ RIT +++ L+
Sbjct: 131 NPMCGHPLVRYLTLHSSLKLPVKIRHLRQELHETLFAFLCSQNNNVQRITSLIERLSLKY 190
Query: 176 GSHL-------GNVEGFE--------------------------FHEFPSLERLSLVSEV 202
G HL G+V F+ H P++E+LS +E
Sbjct: 191 GDHLCDYNLETGDVRCFDHVCKLHRKRAARCGQEAAEERNQWIALHVLPTVEQLSAATED 250
Query: 203 ELRNAGFGYRSAPQSSLLFSVRRSFKQAK-----YITGTVDVLQSKHSGGAEWLLSLR-- 255
ELR+ GFGYRS +R+S + + G +S W L
Sbjct: 251 ELRSLGFGYRSNYIIGCTNVIRQSGVVGREVLSEHCRGATKFSESTVVPSYRWYEELLSP 310
Query: 256 KLDLQEAIDALCTLPGVGPKVAACIALFSLDQHHAIPVDTHVWKIATRYL 305
L+E L TLPGVG KVA C+ LF L H +PVDTH+ ++A YL
Sbjct: 311 NCTLEERRAKLLTLPGVGRKVADCVLLFGLGHHELVPVDTHMAQVAAEYL 360
>gi|260100098|pdb|3I0W|A Chain A, Crystal Structure Of Clostridium Acetobutylicum
8-Oxoguanine GlycosylaseLYASE IN COMPLEX WITH DSDNA
CONTAINING CYTOSINE Opposite To 8-Oxog
Length = 290
Score = 103 bits (258), Expect = 9e-20, Method: Compositional matrix adjust.
Identities = 73/264 (27%), Positives = 124/264 (46%), Gaps = 27/264 (10%)
Query: 48 LSLPLTFPTGQTFRWKKTGPLQYTGPIGPHLISLKHLQNGDVCYHIHTSPSEPAAKSALL 107
L F GQ FRW + Y G ++ ++ + V Y+I+ E K+
Sbjct: 21 FELKDIFDCGQCFRWHRQENGNYIGIAFEKVVEVQKIGEDVVIYNIN----EEEFKNVWS 76
Query: 108 DFLNMGISLGELWEGFSASDCRFAELAKYLAGARVLRQDPVECLLQFLCSSNNNIARITK 167
++ ++ GE+ + S D + + G R+LRQDP E LL F+ S+NN I I K
Sbjct: 77 EYFDLYRDYGEIKKELS-RDPLLKKSVDFGEGIRILRQDPFEILLSFIISANNRIPMIKK 135
Query: 168 MVDFLASLGSHLGNVEGFEFHEFPSLERLSLVSEVELRNAGFGYRSAPQSSLLFSVRRSF 227
++ ++ +G ++ FP++++L +E + G+R
Sbjct: 136 CINNISEKAGKKLEYKGKIYYAFPTVDKLHEFTEKDFEECTAGFR--------------- 180
Query: 228 KQAKYITGTVDVLQSKHSGGAEWLLSLRKLDLQEAIDALCTLPGVGPKVAACIALFSLDQ 287
AKY+ TVD + + G L ++ L+ E + L GVGP+VA CI LFS+ +
Sbjct: 181 --AKYLKDTVDRIYN----GELNLEYIKSLNDNECHEELKKFMGVGPQVADCIMLFSMQK 234
Query: 288 HHAIPVDTHVWK-IATRYLLPELA 310
+ A PVDT V K + + Y+ P+++
Sbjct: 235 YSAFPVDTWVKKAMMSLYVAPDVS 258
>gi|260100101|pdb|3I0X|A Chain A, Crystal Structure Of Clostridium Acetobutylicum
8-Oxoguanine GlycosylaseLYASE IN COMPLEX WITH DSDNA
CONTAINING ADENINE Opposite To 8-Oxog
Length = 291
Score = 103 bits (258), Expect = 9e-20, Method: Compositional matrix adjust.
Identities = 73/264 (27%), Positives = 124/264 (46%), Gaps = 27/264 (10%)
Query: 48 LSLPLTFPTGQTFRWKKTGPLQYTGPIGPHLISLKHLQNGDVCYHIHTSPSEPAAKSALL 107
L F GQ FRW + Y G ++ ++ + V Y+I+ E K+
Sbjct: 21 FELKDIFDCGQCFRWHRQENGNYIGIAFEKVVEVQKIGEDVVIYNIN----EEEFKNVWS 76
Query: 108 DFLNMGISLGELWEGFSASDCRFAELAKYLAGARVLRQDPVECLLQFLCSSNNNIARITK 167
++ ++ GE+ + S D + + G R+LRQDP E LL F+ S+NN I I K
Sbjct: 77 EYFDLYRDYGEIKKELS-RDPLLKKSVDFGEGIRILRQDPFEILLSFIISANNRIPMIKK 135
Query: 168 MVDFLASLGSHLGNVEGFEFHEFPSLERLSLVSEVELRNAGFGYRSAPQSSLLFSVRRSF 227
++ ++ +G ++ FP++++L +E + G+R
Sbjct: 136 CINNISEKAGKKLEYKGKIYYAFPTVDKLHEFTEKDFEECTAGFR--------------- 180
Query: 228 KQAKYITGTVDVLQSKHSGGAEWLLSLRKLDLQEAIDALCTLPGVGPKVAACIALFSLDQ 287
AKY+ TVD + + G L ++ L+ E + L GVGP+VA CI LFS+ +
Sbjct: 181 --AKYLKDTVDRIYN----GELNLEYIKSLNDNECHEELKKFMGVGPQVADCIMLFSMQK 234
Query: 288 HHAIPVDTHVWK-IATRYLLPELA 310
+ A PVDT V K + + Y+ P+++
Sbjct: 235 YSAFPVDTWVKKAMMSLYVAPDVS 258
>gi|226438257|pdb|3F10|A Chain A, Crystal Structure Of Clostridium Acetobutylicum
8-Oxoguanine Dna Glycosylase In Complex With
8-Oxoguanosine
Length = 292
Score = 103 bits (258), Expect = 9e-20, Method: Compositional matrix adjust.
Identities = 73/264 (27%), Positives = 124/264 (46%), Gaps = 27/264 (10%)
Query: 48 LSLPLTFPTGQTFRWKKTGPLQYTGPIGPHLISLKHLQNGDVCYHIHTSPSEPAAKSALL 107
L F GQ FRW + Y G ++ ++ + V Y+I+ E K+
Sbjct: 21 FELKDIFDCGQCFRWHRQENGNYIGIAFEKVVEVQKIGEDVVIYNIN----EEEFKNVWS 76
Query: 108 DFLNMGISLGELWEGFSASDCRFAELAKYLAGARVLRQDPVECLLQFLCSSNNNIARITK 167
++ ++ GE+ + S D + + G R+LRQDP E LL F+ S+NN I I K
Sbjct: 77 EYFDLYRDYGEIKKELS-RDPLLKKSVDFGEGIRILRQDPFEILLSFIISANNRIPMIKK 135
Query: 168 MVDFLASLGSHLGNVEGFEFHEFPSLERLSLVSEVELRNAGFGYRSAPQSSLLFSVRRSF 227
++ ++ +G ++ FP++++L +E + G+R
Sbjct: 136 CINNISEKAGKKLEYKGKIYYAFPTVDKLHEFTEKDFEECTAGFR--------------- 180
Query: 228 KQAKYITGTVDVLQSKHSGGAEWLLSLRKLDLQEAIDALCTLPGVGPKVAACIALFSLDQ 287
AKY+ TVD + + G L ++ L+ E + L GVGP+VA CI LFS+ +
Sbjct: 181 --AKYLKDTVDRIYN----GELNLEYIKSLNDNECHEELKKFMGVGPQVADCIMLFSMQK 234
Query: 288 HHAIPVDTHVWK-IATRYLLPELA 310
+ A PVDT V K + + Y+ P+++
Sbjct: 235 YSAFPVDTWVKKAMMSLYVAPDVS 258
>gi|226438256|pdb|3F0Z|A Chain A, Crystal Structure Of Clostridium Acetobutylicum
8-Oxoguanine GlycosylaseLYASE IN ITS APO-Form
Length = 292
Score = 103 bits (257), Expect = 1e-19, Method: Compositional matrix adjust.
Identities = 74/264 (28%), Positives = 122/264 (46%), Gaps = 27/264 (10%)
Query: 48 LSLPLTFPTGQTFRWKKTGPLQYTGPIGPHLISLKHLQNGDVCYHIHTSPSEPAAKSALL 107
L F GQ FRW + Y G ++ ++ + V Y+I+ E K+
Sbjct: 21 FELKDIFDCGQCFRWHRQENGNYIGIAFEKVVEVQKIGEDVVIYNIN----EEEFKNVWS 76
Query: 108 DFLNMGISLGELWEGFSASDCRFAELAKYLAGARVLRQDPVECLLQFLCSSNNNIARITK 167
++ ++ GE+ + S D + + G R+LRQDP E LL F+ S+NN I I K
Sbjct: 77 EYFDLYRDYGEIKKELS-RDPLLKKSVDFGEGIRILRQDPFEILLSFIISANNRIPXIKK 135
Query: 168 MVDFLASLGSHLGNVEGFEFHEFPSLERLSLVSEVELRNAGFGYRSAPQSSLLFSVRRSF 227
++ ++ +G ++ FP++++L +E + G+R
Sbjct: 136 CINNISEKAGKKLEYKGKIYYAFPTVDKLHEFTEKDFEECTAGFR--------------- 180
Query: 228 KQAKYITGTVDVLQSKHSGGAEWLLSLRKLDLQEAIDALCTLPGVGPKVAACIALFSLDQ 287
AKY+ TVD + + G L ++ L+ E + L GVGPKVA CI LFS +
Sbjct: 181 --AKYLKDTVDRIYN----GELNLEYIKSLNDNECHEELKKFXGVGPKVADCIXLFSXQK 234
Query: 288 HHAIPVDTHVWK-IATRYLLPELA 310
+ A PVDT V K + Y+ P+++
Sbjct: 235 YSAFPVDTWVKKAXXSLYVAPDVS 258
>gi|302380379|ref|ZP_07268849.1| 8-oxoguanine DNA-glycosylase (ogg) [Finegoldia magna
ACS-171-V-Col3]
gi|302311869|gb|EFK93880.1| 8-oxoguanine DNA-glycosylase (ogg) [Finegoldia magna
ACS-171-V-Col3]
Length = 294
Score = 103 bits (257), Expect = 1e-19, Method: Compositional matrix adjust.
Identities = 78/277 (28%), Positives = 123/277 (44%), Gaps = 28/277 (10%)
Query: 43 LTQSELSLPLTFPTGQTFRWKKTGPLQYTGPIGPHLISLKHLQNGDVCYHIHTSPSEPAA 102
+ QS + F GQ FRW K + YT G ++ + GD Y + + +
Sbjct: 16 IKQSSFNPTHIFECGQAFRWSKEDDMSYTTIDGDMYCNVSLV--GDYVYIKNCTEEDYYK 73
Query: 103 KSALLDFLNMGISLGELWEGFSASDCRFAELAKYLAGARVLRQDPVECLLQFLCSSNNNI 162
K ++ ++ + ++ E S ++ +Y G R+L QD ++ F+ S+NN I
Sbjct: 74 KWE--NYFDLKTNYTKIKEKLSFNET-LKNALEYGDGIRILNQDKFSTIISFIISANNQI 130
Query: 163 ARITKMVDFLA-SLGSHLGNVEGFEFHEFPSLERLSLVSEVELRNAGFGYRSAPQSSLLF 221
RI K V+ + + G LG+ G + + FP E L+ V E+R
Sbjct: 131 PRIMKSVNIICENYGKFLGDFNGRKLYSFPKPEDLAKVPVQEMREV-------------- 176
Query: 222 SVRRSFKQAKYITGTVDVLQSKHSGGAEW-LLSLRKLDLQEAIDALCTLPGVGPKVAACI 280
R F+ + VDV SK E+ +L + KL +E L LPGVGPKVA CI
Sbjct: 177 -CRVGFRDKR----IVDV--SKMVANNEFDILEIDKLSNEELRKELIKLPGVGPKVADCI 229
Query: 281 ALFSLDQHHAIPVDTHVWKIATRYLLPELAGVRLTPK 317
LFS ++H+ PVD + ++ + E L K
Sbjct: 230 MLFSYNRHNTFPVDVWIKRVMEYLFIKEETNKNLIAK 266
>gi|407425226|gb|EKF39343.1| 8-oxoguanine DNA glycosylase, putative [Trypanosoma cruzi
marinkellei]
Length = 478
Score = 103 bits (257), Expect = 1e-19, Method: Compositional matrix adjust.
Identities = 102/342 (29%), Positives = 145/342 (42%), Gaps = 89/342 (26%)
Query: 48 LSLPLTFPTGQTFRWKKTGPLQYTGPI--GPHLIS------------------------L 81
++L +T GQ FRW++T + G + G + +S L
Sbjct: 43 VNLSMTLCGGQCFRWRRTPRGTWVGVVERGAYELSDAAHPPEFQAVHPRGEEKRGRRSSL 102
Query: 82 KHLQN---GDVCYH--IHTSPSEPAAKSA----LLDFLNMGISLGELWEGFSASD-CRFA 131
H + GDV + +H P + S L +L + + L +LW ++ + R
Sbjct: 103 SHSSDDPSGDVLWFRCLHHEPKDALEVSTEACFLRHYLALDVDLQKLWRRWTCDNPMRHH 162
Query: 132 ELAKYL---AGA------RVLRQDPVECLLQFLCSSNNNIARITKMVDFLA-SLGSHL-- 179
L +YL AG R LRQ E LL FLCS NNN+ RIT +V+ LA S G HL
Sbjct: 163 PLVQYLTSNAGKGPSVNIRHLRQHIHETLLAFLCSQNNNVQRITGLVEKLATSYGDHLCD 222
Query: 180 ------------------------------GNVEGFEFHEFPSLERLSLVSEVELRNAGF 209
G + H PS++ L+ SE E R GF
Sbjct: 223 YNLATGDVRNVSSLNHVSTRLTKNAKRADTGGGDWISLHTIPSMDELARRSEDEFRALGF 282
Query: 210 GYRSAP----QSSLLFSVRRSFKQAKYITGTVDVLQSKHSGGAEWLLSLR--KLDLQEAI 263
GYRS S + S K+ + +T +QS +W L +L L +
Sbjct: 283 GYRSKYIVQCASIIQASGATRRKKEEGVTCHGSSMQS-----YKWYDELLDPRLSLPDRR 337
Query: 264 DALCTLPGVGPKVAACIALFSLDQHHAIPVDTHVWKIATRYL 305
L +LPGVG KVA CI LF++ H +PVDTH+ ++AT YL
Sbjct: 338 KKLLSLPGVGRKVADCILLFAVGHHEIVPVDTHMAQVATEYL 379
>gi|170573147|ref|XP_001892364.1| Helix-hairpin-helix motif family protein [Brugia malayi]
gi|158602160|gb|EDP38811.1| Helix-hairpin-helix motif family protein [Brugia malayi]
Length = 333
Score = 103 bits (257), Expect = 1e-19, Method: Compositional matrix adjust.
Identities = 86/290 (29%), Positives = 134/290 (46%), Gaps = 47/290 (16%)
Query: 41 LNLTQSELSLPLTFPTGQTFRWKK-------TGPLQ---YTGPIGPHLISLKHLQNGDVC 90
L ++ EL+L GQ+FRWKK P + G + + + +
Sbjct: 4 LKCSKEELNLGAVLLGGQSFRWKKLVTNDENIAPASDDIFFGVAKHRVWKIWRENDEQLG 63
Query: 91 YHIHTSPSEPAAKS--ALLDFLNMGISLGELWEGFSASDCRFAELAK----YLAGARVLR 144
Y + S+ + L D+ + I L L++ ++ +D FA L + L G RVL
Sbjct: 64 YEVLAKFSKARSNDLDVLKDYFQLDIELMPLYKLWAENDKYFAHLLENHRTKLEGIRVLG 123
Query: 145 QDPVECLLQFLCSSNNNIARITKMVDFLASLGSHLGNV---EGFE-FHEFPSLERLSLVS 200
QDP+E + F+CS+NN+I RIT MV+ L L + G + F++F +R++
Sbjct: 124 QDPLETVFAFICSANNHIRRITNMVETLCELYGESAIIPCSNGMKTFYDFADPKRMTDDP 183
Query: 201 EVE--LRNAGFGYRSAPQSSLLFSVRRSFKQAKYITGTVDVLQSK--HSGGAEWLLSLRK 256
E+E LR GFGYR+ + L SK G ++L +L K
Sbjct: 184 ELEKVLRIRGFGYRALNIA----------------------LASKTLEDNGEQFLENLSK 221
Query: 257 LDLQEAIDALCTLPGVGPKVAACIALFSLDQHHAIPVDTHVWKI-ATRYL 305
++A++ L + G+G KVA CI L L H +P+DTH +I A YL
Sbjct: 222 GTYEDAMEELQQMRGIGAKVADCICLMGLRMHSVVPIDTHTLQITAENYL 271
>gi|331270382|ref|YP_004396874.1| putative 8-oxoguanine DNA glycosylase [Clostridium botulinum
BKT015925]
gi|329126932|gb|AEB76877.1| putative 8-oxoguanine DNA glycosylase [Clostridium botulinum
BKT015925]
Length = 292
Score = 103 bits (256), Expect = 1e-19, Method: Compositional matrix adjust.
Identities = 72/265 (27%), Positives = 121/265 (45%), Gaps = 29/265 (10%)
Query: 48 LSLPLTFPTGQTFRWKKTGPLQYTGPIGPHLISLKHLQNGDVCYHIHTSPSEPAAKSALL 107
LP F GQ FRW + Y G +I ++ + Y+ +E K
Sbjct: 21 FELPHIFECGQCFRWNRQENTNYIGVAFGKVIEVEKKGKDVIIYN----ATEEDFKKIWC 76
Query: 108 DFLNMGISLGELWEGFSASDCRFAELAKYLAGARVLRQDPVECLLQFLCSSNNNIARITK 167
D+ ++ + E S D + ++ +G R+L+Q+P E +L F+ S+NN I I +
Sbjct: 77 DYFDLYRDYSIIKENLS-EDELLKKSVEFGSGIRILKQEPFEIVLSFIISANNRIPMIKR 135
Query: 168 MVDFLASLGSHLGNVEGFEFHEFPSLERLSLVSEVELRNAGFGYRSAPQSSLLFSVRRSF 227
++ ++ +G +++ FPS+E L + EL G G+R
Sbjct: 136 AIERISKRWGKKVQYKGKDYYTFPSVEMLKECTIEELEECGMGFR--------------- 180
Query: 228 KQAKYITGTV-DVLQSKHSGGAEWLLSLRKLDLQEAIDALCTLPGVGPKVAACIALFSLD 286
AKYI T+ D++ ++ L ++ L+ E L + GVGPKVA CI LFS+
Sbjct: 181 --AKYIKNTIKDIISNEFD-----LEYIKSLEDDECHKELQKITGVGPKVADCIMLFSMQ 233
Query: 287 QHHAIPVDTHVWKIATR-YLLPELA 310
++ A PVD V + YL P+++
Sbjct: 234 KYSAFPVDVWVKRAMQHFYLAPDVS 258
>gi|405118528|gb|AFR93302.1| purine-specific oxidized base lesion DNA N-glycosylase
[Cryptococcus neoformans var. grubii H99]
Length = 410
Score = 103 bits (256), Expect = 2e-19, Method: Compositional matrix adjust.
Identities = 91/303 (30%), Positives = 126/303 (41%), Gaps = 51/303 (16%)
Query: 35 PSKWTPLNLTQSELSLPLTFPTGQTFRWKKTGPLQYTGPIGPHLISLKHLQNGDVCY--- 91
P+ W + + LSL T P GQ F W + PL T P H VC
Sbjct: 7 PAGWASVRMDPRNLSLANTLPVGQAFLWHRL-PLPATDPPFEEYSRAVHSPPRVVCLRQS 65
Query: 92 --HIHTSPSEPAA------------KSALLDFLNMGI--SLGELWEGFSASDCRF---AE 132
HI+ + P + L D+ + L L+ + D
Sbjct: 66 PTHIYYTAVYPPGSAPEPDRSNHFTRQWLEDYFQLVRYPDLETLYLDWRRRDPELFGKVH 125
Query: 133 LAKYLAGARVLRQDPVECLLQFLCSSNNNIARITKMV-DFLASLGS------HLGNVEGF 185
+ G RVLRQDP ECLL F+ S+NN+I RIT ++ F S H N
Sbjct: 126 VNDRATGVRVLRQDPWECLLAFITSTNNHIPRITSLLHKFSQSFTEPVLTLKHPSNGISI 185
Query: 186 EFHEFPSLERLSLVSEVELRNAGFGYRSAPQSSLLFSVRRSFKQAKYITGTVDVLQSK-- 243
+H FP+ ++ E LR+ GFGYR A +I ++ +L++K
Sbjct: 186 PYHLFPAPHQIPTTLEKPLRDMGFGYR-----------------APFIEASLQMLRNKFG 228
Query: 244 -HSGGAE-WLLSLRKLDLQEAIDALCTLPGVGPKVAACIALFSLDQHHAIPVDTHVWKIA 301
G E L+ R D+ + L L GVG KVA C+ L LD+ IP+DTHV IA
Sbjct: 229 DKEGDIEAGLVGWRNQDVDIVRENLIALKGVGRKVADCVMLMCLDKPSLIPIDTHVAHIA 288
Query: 302 TRY 304
R+
Sbjct: 289 ARH 291
>gi|402468083|gb|EJW03284.1| hypothetical protein EDEG_02350 [Edhazardia aedis USNM 41457]
Length = 526
Score = 103 bits (256), Expect = 2e-19, Method: Compositional matrix adjust.
Identities = 70/262 (26%), Positives = 117/262 (44%), Gaps = 54/262 (20%)
Query: 70 YTGPIGPHLISLKHLQNGDVCYHIHTSPSEPAAKSALLDFLNMGISLGELWEGF---SAS 126
+ G I L KH G C I S P +++ ++ F + + ++++ F ++S
Sbjct: 276 FIGHIEGKLFVFKH---GISC--IFFISSHPNSQNTIMRFFTLDLDYSDIFKSFKPINSS 330
Query: 127 DCRFAELAKYLAGARVLRQDPVECLLQFLCSSNNNIARITKMVDFLASLGSHLGNVEGFE 186
+ F G R+LR D EC+L F+CSS NNI RI+KMV +L + ++E E
Sbjct: 331 NLVFN-------GLRMLRIDKYECILSFICSSMNNIKRISKMVLYLKHKTKNFTDIENIE 383
Query: 187 FHEFPSLERLSLVSEVELRNAGFGYRSAPQSSLLFSVRRSFKQAKYITGTVDVLQSKHSG 246
+SE +L+ GFGYRS + + K ++I Q+ +
Sbjct: 384 ------------ISEEDLKEKGFGYRSK----FIIDAIKKLKNLRHIN------QNDKNT 421
Query: 247 GAEWLLSLRKLDLQEAIDALCTLPGVGPKVAACIALFSLDQHHAIPVDTHVWKIATRYLL 306
++LL+++ GVG KV CI L +P+D H+++I +Y
Sbjct: 422 VQKFLLNIK---------------GVGKKVCDCILLMGFSFFDTVPIDVHIYRICIKYFS 466
Query: 307 PELAGVRLTPKLCSRVAEAFCE 328
E V+L K+ + + F E
Sbjct: 467 FE--NVKLNKKIYNEINTVFNE 486
>gi|303233727|ref|ZP_07320381.1| 8-oxoguanine DNA-glycosylase (ogg) [Finegoldia magna BVS033A4]
gi|302495161|gb|EFL54913.1| 8-oxoguanine DNA-glycosylase (ogg) [Finegoldia magna BVS033A4]
Length = 294
Score = 103 bits (256), Expect = 2e-19, Method: Compositional matrix adjust.
Identities = 78/277 (28%), Positives = 123/277 (44%), Gaps = 28/277 (10%)
Query: 43 LTQSELSLPLTFPTGQTFRWKKTGPLQYTGPIGPHLISLKHLQNGDVCYHIHTSPSEPAA 102
+ QS + F GQ FRW K + YT G ++ + GD Y + + +
Sbjct: 16 IKQSSFNPTHIFECGQAFRWYKEDDMSYTTIDGDMYCNVSLV--GDYVYIKNCTEEDYYK 73
Query: 103 KSALLDFLNMGISLGELWEGFSASDCRFAELAKYLAGARVLRQDPVECLLQFLCSSNNNI 162
K ++ ++ + ++ E S ++ +Y G R+L QD ++ F+ S+NN I
Sbjct: 74 KWE--NYFDLKTNYTKIKEKLSFNET-LKNALEYGDGIRILNQDKFSTIISFIISANNQI 130
Query: 163 ARITKMVDFLA-SLGSHLGNVEGFEFHEFPSLERLSLVSEVELRNAGFGYRSAPQSSLLF 221
RI K V+ + + G LG+ G + + FP E L+ V E+R
Sbjct: 131 PRIMKSVNIICENYGKFLGDFNGRKLYSFPKPEDLAKVPVEEMREV-------------- 176
Query: 222 SVRRSFKQAKYITGTVDVLQSKHSGGAEW-LLSLRKLDLQEAIDALCTLPGVGPKVAACI 280
R F+ + VDV SK E+ +L + KL +E L LPGVGPKVA CI
Sbjct: 177 -CRVGFRDKR----IVDV--SKMVANNEFDILEIDKLSNEELRKELIKLPGVGPKVADCI 229
Query: 281 ALFSLDQHHAIPVDTHVWKIATRYLLPELAGVRLTPK 317
LFS ++H+ PVD + ++ + E L K
Sbjct: 230 MLFSYNRHNTFPVDVWIKRVMEYLFIKEETNKNLIAK 266
>gi|169824585|ref|YP_001692196.1| putative 8-oxoguanine DNA glycosylase [Finegoldia magna ATCC 29328]
gi|167831390|dbj|BAG08306.1| putative 8-oxoguanine DNA glycosylase [Finegoldia magna ATCC 29328]
Length = 294
Score = 103 bits (256), Expect = 2e-19, Method: Compositional matrix adjust.
Identities = 78/277 (28%), Positives = 123/277 (44%), Gaps = 28/277 (10%)
Query: 43 LTQSELSLPLTFPTGQTFRWKKTGPLQYTGPIGPHLISLKHLQNGDVCYHIHTSPSEPAA 102
+ QS + F GQ FRW K + YT G ++ + GD Y + + +
Sbjct: 16 IKQSSFNPTHIFECGQAFRWYKEDDMSYTTIDGDMYCNVSLV--GDYVYIKNCTEEDYYK 73
Query: 103 KSALLDFLNMGISLGELWEGFSASDCRFAELAKYLAGARVLRQDPVECLLQFLCSSNNNI 162
K ++ ++ + ++ E S ++ +Y G R+L QD ++ F+ S+NN I
Sbjct: 74 KWE--NYFDLKTNYTKIKEKLSFNET-LKNALEYGDGIRILNQDKFSTIISFIISANNQI 130
Query: 163 ARITKMVDFLA-SLGSHLGNVEGFEFHEFPSLERLSLVSEVELRNAGFGYRSAPQSSLLF 221
RI K V+ + + G LG+ G + + FP E L+ V E+R
Sbjct: 131 PRIMKSVNIICENYGKFLGDFNGRKLYSFPKPEDLAKVPVQEMREV-------------- 176
Query: 222 SVRRSFKQAKYITGTVDVLQSKHSGGAEW-LLSLRKLDLQEAIDALCTLPGVGPKVAACI 280
R F+ + VDV SK E+ +L + KL +E L LPGVGPKVA CI
Sbjct: 177 -CRVGFRDKR----IVDV--SKMVANNEFDILEIDKLSNEELRKELIKLPGVGPKVADCI 229
Query: 281 ALFSLDQHHAIPVDTHVWKIATRYLLPELAGVRLTPK 317
LFS ++H+ PVD + ++ + E L K
Sbjct: 230 MLFSYNRHNTFPVDVWIKRVMEYLFIKEETNKNLIAK 266
>gi|417926059|ref|ZP_12569470.1| 8-oxoguanine DNA glycosylase, N-terminal domain protein [Finegoldia
magna SY403409CC001050417]
gi|341590280|gb|EGS33524.1| 8-oxoguanine DNA glycosylase, N-terminal domain protein [Finegoldia
magna SY403409CC001050417]
Length = 294
Score = 102 bits (255), Expect = 2e-19, Method: Compositional matrix adjust.
Identities = 78/277 (28%), Positives = 123/277 (44%), Gaps = 28/277 (10%)
Query: 43 LTQSELSLPLTFPTGQTFRWKKTGPLQYTGPIGPHLISLKHLQNGDVCYHIHTSPSEPAA 102
+ QS + F GQ FRW K + YT G ++ + GD Y + + +
Sbjct: 16 IKQSSFNPTHIFECGQAFRWYKEDDMSYTTIDGDMYCNVSLV--GDYVYIKNCTEEDYYK 73
Query: 103 KSALLDFLNMGISLGELWEGFSASDCRFAELAKYLAGARVLRQDPVECLLQFLCSSNNNI 162
K ++ ++ + ++ E S ++ +Y G R+L QD ++ F+ S+NN I
Sbjct: 74 KWE--NYFDLKTNYTKIKEKLSFNET-LKNALEYGDGIRILNQDKFSTIISFIISANNQI 130
Query: 163 ARITKMVDFLA-SLGSHLGNVEGFEFHEFPSLERLSLVSEVELRNAGFGYRSAPQSSLLF 221
RI K V+ + + G LG+ G + + FP E L+ V E+R
Sbjct: 131 PRIMKSVNIICENYGKFLGDFNGRKLYSFPKPEDLAKVPVEEMREV-------------- 176
Query: 222 SVRRSFKQAKYITGTVDVLQSKHSGGAEW-LLSLRKLDLQEAIDALCTLPGVGPKVAACI 280
R F+ + VDV SK E+ +L + KL +E L LPGVGPKVA CI
Sbjct: 177 -CRVGFRDKR----IVDV--SKMVANNEFDILEIDKLSNEELRKELIKLPGVGPKVADCI 229
Query: 281 ALFSLDQHHAIPVDTHVWKIATRYLLPELAGVRLTPK 317
LFS ++H+ PVD + ++ + E L K
Sbjct: 230 MLFSYNRHNTFPVDVWIKRVMEYLFIKEDTNKNLIAK 266
>gi|321251796|ref|XP_003192181.1| purine-specific oxidized base lesion DNA N-glycosylase
[Cryptococcus gattii WM276]
gi|317458649|gb|ADV20394.1| Purine-specific oxidized base lesion DNA N-glycosylase, putative
[Cryptococcus gattii WM276]
Length = 410
Score = 102 bits (254), Expect = 2e-19, Method: Compositional matrix adjust.
Identities = 88/299 (29%), Positives = 124/299 (41%), Gaps = 43/299 (14%)
Query: 35 PSKWTPLNLTQSELSLPLTFPTGQTFRWKKTGPLQYTGPIGPHLISLKHLQNGDVCY--- 91
P+ W + + LSL T P GQ F W + PL T P H VC
Sbjct: 7 PAGWASVRMDPRNLSLANTLPVGQAFLWHRL-PLPATAPPFEEYSRAIHSPPRVVCLRQS 65
Query: 92 --HIHTSPSEPAA------------KSALLDFLNMGI--SLGELWEGFSASDCRF---AE 132
HI+ + P + L D+ + L ++ + D
Sbjct: 66 PTHIYYTAVYPPGSVPEPDSSFHLTRQWLEDYFQLVKYPDLETMYLDWRRRDPELFGKVH 125
Query: 133 LAKYLAGARVLRQDPVECLLQFLCSSNNNIARITKMV-DFLASLGS------HLGNVEGF 185
+ G RVLRQDP ECLL F+ S+NN+I RIT ++ F S H N
Sbjct: 126 VNDRATGVRVLRQDPWECLLAFITSTNNHIPRITSLLHKFSQSFTKPVLTLKHPSNGNLI 185
Query: 186 EFHEFPSLERLSLVSEVELRNAGFGYRSAPQSSLLFSVRRSFKQAKYITGTVDVLQSKHS 245
+H FP+ ++ E LR+ GFGYR+ + L +R F + G ++
Sbjct: 186 PYHLFPAPHQIPTRLEKSLRDLGFGYRAPFIEASLLLLRNKFGDKE---GNIEA------ 236
Query: 246 GGAEWLLSLRKLDLQEAIDALCTLPGVGPKVAACIALFSLDQHHAIPVDTHVWKIATRY 304
L+ R D+ + L L GVG KVA C+ L LD+ IP+DTHV IA R+
Sbjct: 237 ----GLMGWRDEDVDIVRENLIALKGVGRKVADCVMLMCLDKPSLIPIDTHVANIAARH 291
>gi|304317561|ref|YP_003852706.1| DNA-(apurinic or apyrimidinic site) lyase [Thermoanaerobacterium
thermosaccharolyticum DSM 571]
gi|302779063|gb|ADL69622.1| DNA-(apurinic or apyrimidinic site) lyase [Thermoanaerobacterium
thermosaccharolyticum DSM 571]
Length = 303
Score = 102 bits (254), Expect = 2e-19, Method: Compositional matrix adjust.
Identities = 70/256 (27%), Positives = 121/256 (47%), Gaps = 28/256 (10%)
Query: 47 ELSLPLTFPTGQTFRWKKTGPLQYTGPIGPHLISLKHLQNGDVCYHIHTSPSEPAAKSAL 106
+ +L T GQ FRW + YTG +I++K +GD+ +T+ ++
Sbjct: 24 DFNLKETLECGQCFRWNEEDDGSYTGVAFDRVINVK--LDGDILTIDNTTLAD--FNDIW 79
Query: 107 LDFLNMGISLGELWEGFSASDCRFAELAKYLAGARVLRQDPVECLLQFLCSSNNNIARIT 166
D+ ++G G++ E S + A + KY G R+LRQD E L+ F+ S NN I +I
Sbjct: 80 YDYFDLGRDYGKIKEALSQDEILRAAI-KYGEGIRILRQDTWETLISFIISQNNRIPQIK 138
Query: 167 KMVDFLASLGSHLGNVEGFEFHEFPSLERLSLVSEVELRNAGFGYRSAPQSSLLFSVRRS 226
K+++ L+ H + ++ FP ++ + + E L N+ G+RS
Sbjct: 139 KVIENLSRAFGHPIVYKNKTYYTFPKVQDIIMADEESLNNSKCGFRS------------- 185
Query: 227 FKQAKYIT-GTVDVLQSKHSGGAEWLLSLRKLDLQEAIDALCTLPGVGPKVAACIALFSL 285
KYI + V + + L L+ D E + L ++ GVGPKVA C+ L+S+
Sbjct: 186 ----KYIIDAALKVFNDEIN-----LFELQLYDTHEVRNILMSIRGVGPKVADCVILYSI 236
Query: 286 DQHHAIPVDTHVWKIA 301
++ A P D + ++
Sbjct: 237 GRYEAFPTDVWIKRVV 252
>gi|300808473|gb|ADK35367.1| 8-oxo guanine DNA glycosylase-like protein [Camelus dromedarius]
Length = 188
Score = 102 bits (254), Expect = 3e-19, Method: Compositional matrix adjust.
Identities = 58/176 (32%), Positives = 94/176 (53%), Gaps = 7/176 (3%)
Query: 45 QSELSLPLTFPTGQTFRWKKTGPLQYTGPIGPHLISLKHLQNGDVCYHI-----HTSPSE 99
+SEL L L +GQ+F W + P + +G + + +L + C +
Sbjct: 4 RSELRLDLILASGQSFPWMEQSPARGSGVLADQVWTLTQTEEQLYCTVYRGDKGQVGKAT 63
Query: 100 PAAKSALLDFLNMGISLGELWEGFSASDCRFAELAKYLAGARVLRQDPVECLLQFLCSSN 159
P A+ + + +SL +L+ +S+ D F E+A+ G +L+QDP+ECL F+CSS+
Sbjct: 64 PEELKAMRQYFQLIVSLAQLYHHWSSMDPHFQEVAQKFQGVGLLQQDPIECLFSFICSSH 123
Query: 160 NNIARITKMVDFLA-SLGSHLGNVEGFEFHEFPSLERLS-LVSEVELRNAGFGYRS 213
NNIARI MV+ L + G L ++ ++ FP L+ L+ L E +LR G GYR+
Sbjct: 124 NNIARIMGMVEQLCQAFGPRLIQLDDVTYYGFPRLQVLAGLEVEAQLRKLGLGYRA 179
>gi|253680924|ref|ZP_04861727.1| 8-oxoguanine-DNA-glycosylase [Clostridium botulinum D str. 1873]
gi|253562773|gb|EES92219.1| 8-oxoguanine-DNA-glycosylase [Clostridium botulinum D str. 1873]
Length = 292
Score = 102 bits (253), Expect = 3e-19, Method: Compositional matrix adjust.
Identities = 71/265 (26%), Positives = 123/265 (46%), Gaps = 29/265 (10%)
Query: 48 LSLPLTFPTGQTFRWKKTGPLQYTGPIGPHLISLKHLQNGDVCYHIHTSPSEPAAKSALL 107
LP F GQ FRW + Y G ++ ++ +N + Y+ +E +
Sbjct: 21 FELPHIFDCGQCFRWNRQENGNYIGVALGKVLEIEKKENDVIIYN----ATEEEFEKIWC 76
Query: 108 DFLNMGISLGELWEGFSASDCRFAELAKYLAGARVLRQDPVECLLQFLCSSNNNIARITK 167
D+ ++ + E F+ D + ++ G R+L+Q+P E ++ F+ S+NN I I +
Sbjct: 77 DYFDLYRDYSTIKEIFN-KDELLKKSVEFGKGIRILKQEPFEIVVSFIISANNRIPMIKR 135
Query: 168 MVDFLASLGSHLGNVEGFEFHEFPSLERLSLVSEVELRNAGFGYRSAPQSSLLFSVRRSF 227
++ ++ +G +++ FPS E L ++ EL G G+R
Sbjct: 136 AIEKISKRWGKKVQYKGKDYYTFPSAEILKDCTQEELEECGVGFR--------------- 180
Query: 228 KQAKYITGTV-DVLQSKHSGGAEWLLSLRKLDLQEAIDALCTLPGVGPKVAACIALFSLD 286
AKYI T+ D++ + + L ++ LD E L + GVGPKVA CI LFS+
Sbjct: 181 --AKYIKNTIEDIIYNGLN-----LDYIKSLDDDECHKELQKISGVGPKVADCIMLFSMQ 233
Query: 287 QHHAIPVDTHVWKIATR-YLLPELA 310
++ A PVD V + YL P+++
Sbjct: 234 KYSAFPVDVWVKRAMQHFYLAPDVS 258
>gi|88602311|ref|YP_502489.1| 8-oxoguanine DNA glycosylase [Methanospirillum hungatei JF-1]
gi|88187773|gb|ABD40770.1| 8-oxoguanine DNA glycosylase-like protein [Methanospirillum
hungatei JF-1]
Length = 299
Score = 102 bits (253), Expect = 4e-19, Method: Compositional matrix adjust.
Identities = 85/271 (31%), Positives = 132/271 (48%), Gaps = 45/271 (16%)
Query: 41 LNLTQSEL--SLPLTFPTGQTFRWKKTGPLQYTGPIGPHLISLKHLQNGDVCYHIHTSPS 98
+LT+S+L L L+ GQ F WKKT + G +++++ Q G + +
Sbjct: 4 FSLTESQLPFDLDLSLSCGQIFGWKKTRD-TWKGVHNGSIVTIR--QKGILIEY------ 54
Query: 99 EPAAKSALLDFLNMGISLGELWE-------GFSASDCRFAELAKYLA--GARVLRQDPVE 149
+ ++ ++ FL + + ++ E + S RF E +Y G R+LRQ P E
Sbjct: 55 DGLSQQDVIRFLGLSDPISDIIESIREHIIAYKGSPDRFFE-TRYSQSRGLRILRQHPWE 113
Query: 150 CLLQFLCSSNNNIARITKMVDFLASLGSHLGNVEGFEFHEFPSLERLSLVSEVELRNAGF 209
CL+ F+CS+N+N++ I K ++ + LG + + FE + FP LS E ELR
Sbjct: 114 CLVSFICSANSNVSTIGKRINLI--LGRYGTSCTLFE-NTFPDPAVLSQCQEPELRECLT 170
Query: 210 GYRSAPQSSLLFSVRRSFKQAKYITGTVDVLQSKHSGGAEWLLSLRKLDLQEAIDALCTL 269
GYR AP K A+YI D +RK++ +A D L L
Sbjct: 171 GYR-APY---------LIKTAQYIHEHPDFFH-----------QVRKMEYHQAKDTLMNL 209
Query: 270 PGVGPKVAACIALFSLDQHHAIPVDTHVWKI 300
PGVGPKVA C+ LF+ + +A+PVD + KI
Sbjct: 210 PGVGPKVADCVLLFAFEHLNAVPVDIRIRKI 240
>gi|340345542|ref|ZP_08668674.1| HhH-GPD family protein [Candidatus Nitrosoarchaeum koreensis MY1]
gi|339520683|gb|EGP94406.1| HhH-GPD family protein [Candidatus Nitrosoarchaeum koreensis MY1]
Length = 280
Score = 101 bits (252), Expect = 4e-19, Method: Compositional matrix adjust.
Identities = 70/268 (26%), Positives = 125/268 (46%), Gaps = 37/268 (13%)
Query: 43 LTQSELSLPLTFPTGQTFRWKKTGPLQYTGPIGPHLISLKHLQNGDVCYHIHTSPSEPAA 102
+ S +++ T +GQ F WKK Y NG I+ S + +
Sbjct: 2 IDYSTINIENTINSGQVFLWKKQDDFWYG-------------VNGQDILKINNSGNVISY 48
Query: 103 KSALLDFLNMGISLGELWEGFSASDCRFAELAKYLAGARVLRQDPVECLLQFLCSSNNNI 162
+ DF ++ ++ + S + KYL G R++RQDP +C + F+ SSN+NI
Sbjct: 49 SNKKYDFFRSNDNIEKIIKSISKDKTTKIAVKKYL-GLRLIRQDPFQCFISFIVSSNSNI 107
Query: 163 ARITKMVDFLA-SLGSHLGNVEGFEFHEFPSLERLSLVSEVELRNAGFGYRSAPQSSLLF 221
+I ++ ++ G+ + + EF+ FP +++ S E+++ G GYR
Sbjct: 108 QKIKSSLEKISIKFGTKI-KFDNQEFYGFPEPIKIANASIQEIQSCGVGYR--------- 157
Query: 222 SVRRSFKQAKYITGTVDVLQSKHSGGAEWLLSLRKLDLQEAIDALCTLPGVGPKVAACIA 281
AK+I +++S L+KL+ +A + + T+ G+G KVA C+
Sbjct: 158 --------AKFIIDAAKMVESNEID----FEHLKKLNYHDAKETILTVFGIGNKVADCVL 205
Query: 282 LFSLDQHHAIPVDTHVWKIATRYLLPEL 309
LFSL++ A P+DT + +I +Y L +
Sbjct: 206 LFSLEKTEAFPLDTWMIRILKKYYLEKF 233
>gi|210620562|ref|ZP_03292110.1| hypothetical protein CLOHIR_00053 [Clostridium hiranonis DSM 13275]
gi|210155276|gb|EEA86282.1| hypothetical protein CLOHIR_00053 [Clostridium hiranonis DSM 13275]
Length = 291
Score = 101 bits (252), Expect = 4e-19, Method: Compositional matrix adjust.
Identities = 72/256 (28%), Positives = 116/256 (45%), Gaps = 26/256 (10%)
Query: 54 FPTGQTFRWKKTGPLQYTGPIGPHLISLKHLQNGDVCYHIHTSPSEPAAKSALLDFLNMG 113
F GQ FRW+ YTG +I++K N DV I + ++ ++ D+ +M
Sbjct: 24 FECGQCFRWRAEEDGSYTGVAMGKVINVKREGN-DV---IIDNTTKEDYENIWRDYFDMD 79
Query: 114 ISLGELWEGFSASDCRFAELAKYLAGARVLRQDPVECLLQFLCSSNNNIARITKMVDFLA 173
G++ D + + G R+LRQD E L+ F+ SSNN I I + ++ L+
Sbjct: 80 TDYGKIKSTLKEFDEYLDKAVDFGWGIRILRQDSWEMLISFIISSNNRIPMIQRAIENLS 139
Query: 174 S-LGSHLGNVEGFEFHEFPSLERLSLVSEVELRNAGFGYRSAPQSSLLFSVRRSFKQAKY 232
G +G G +++ FP+ E L+ ++ E+R G+R KY
Sbjct: 140 ERYGKLIGEYRGKKYYAFPTPEELNKATQEEIRECKTGFRD-----------------KY 182
Query: 233 ITGTVDVLQSKHSGGAEWLLSLRKLDLQEAIDALCTLPGVGPKVAACIALFSLDQHHAIP 292
I T +S G + + + RKL ++ L GVGPKV CIALF + + P
Sbjct: 183 IKSTC---KSVVELGLD-VKNFRKLSTEDCCKELKKFNGVGPKVCDCIALFGMQKIDTFP 238
Query: 293 VDTHVWKIATRYLLPE 308
VD V ++ + + E
Sbjct: 239 VDVWVKRVMQEFYIEE 254
>gi|268608661|ref|ZP_06142388.1| 8-oxoguanine DNA glycosylase-like protein [Ruminococcus
flavefaciens FD-1]
Length = 269
Score = 101 bits (252), Expect = 4e-19, Method: Compositional matrix adjust.
Identities = 77/265 (29%), Positives = 117/265 (44%), Gaps = 38/265 (14%)
Query: 43 LTQSELSLPLTFPTGQTFRWKKTG---PLQYTGPIGPHLISLKHLQNGDVCYHIHTSPSE 99
+++S+L L T GQ FRWKK Y G + + + G+ +H E
Sbjct: 12 VSESDLDLDETLDCGQAFRWKKIDSDYDCTYEGYFINDYLRVSQTKKGEFIFH---DTDE 68
Query: 100 PAAKSALLDFLNMGISLGELWEGFSASDCRFAELAKYLAGARVLRQDPVECLLQFLCSSN 159
+ LD+ + EL FS D ++ K+ G R+L+Q+ ECL+ F+ S N
Sbjct: 69 NTFVNKWLDYFDFSTDYSELKRQFS-EDETLSKACKFAGGIRLLKQNSWECLISFIISQN 127
Query: 160 NNIARITKMVDFLASLGSHLGNVEGFEFHEFPSLERLSLVSEVELRNAGFGYRSAPQSSL 219
NNI RI ++D L GN G EFP+ +L+ P S
Sbjct: 128 NNIPRIKGIIDRLC------GNYGG----EFPAPHQLA--------------EETPDS-- 161
Query: 220 LFSVRRSFKQAKYITGTVDVLQSKHSGGAEWLLSLRKLDLQEAIDALCTLPGVGPKVAAC 279
S RS +AKYI + + G L ++ K+ + EA L + GVGPKVA C
Sbjct: 162 -LSYLRSGFRAKYIADAA----HRTAEGTTDLAAIAKMPIDEARTELKKILGVGPKVAEC 216
Query: 280 IALFSLDQHHAIPVDTHVWKIATRY 304
+ LF + + A P+D + ++ Y
Sbjct: 217 VLLFGMYRTEAFPIDVWIKRVLAEY 241
>gi|332798680|ref|YP_004460179.1| DNA-(apurinic or apyrimidinic site) lyase [Tepidanaerobacter
acetatoxydans Re1]
gi|332696415|gb|AEE90872.1| DNA-(apurinic or apyrimidinic site) lyase [Tepidanaerobacter
acetatoxydans Re1]
Length = 285
Score = 101 bits (251), Expect = 5e-19, Method: Compositional matrix adjust.
Identities = 73/256 (28%), Positives = 115/256 (44%), Gaps = 32/256 (12%)
Query: 48 LSLPLTFPTGQTFRWKKTGPLQYTGPIGPHLISLKHLQNGDVCYHIHTSPSEPAAKSALL 107
+L + GQ FRW ++ +Y G + ++ + N ++ + E A
Sbjct: 11 FNLEIIADGGQAFRWNRSEDGKYIGVVEDYIFEVVQ-ANDELIIDSNLEGKEEAFAKEYF 69
Query: 108 DFLNMGISLGELWEGFSASDCRFAELA---KYLAGARVLRQDPVECLLQFLCSSNNNIAR 164
D L +E F E+A Y +G R+L QDP E + F+ S+NN+I
Sbjct: 70 D-------LPRDYEQIEKELMCFKEVAAAVNYCSGYRILFQDPWETTISFIISANNHIRN 122
Query: 165 ITKMVDFLASLGSHLGNVEGFEFHEFPSLERLSLVSEVELRNAGFGYRSAPQSSLLFSVR 224
I ++ + + G ++ FPS +RL+ +SE EL+ GYR
Sbjct: 123 IKSTIENMCKIYGEPLEYRGKTYYSFPSPDRLASLSEDELKKTKCGYR------------ 170
Query: 225 RSFKQAKYITGTVDVLQSKHSGGAEWLLSLRKLDLQEAIDALCTLPGVGPKVAACIALFS 284
AKYI T ++ + G L LR+L EA + L TLPGVG KVA CI L+S
Sbjct: 171 -----AKYIIETARMI----ADGKVDLYGLRELPTDEAREVLLTLPGVGRKVADCIMLYS 221
Query: 285 LDQHHAIPVDTHVWKI 300
+ + A P+D + ++
Sbjct: 222 MRKFDAFPIDVWIKRV 237
>gi|307354718|ref|YP_003895769.1| DNA-(apurinic or apyrimidinic site) lyase [Methanoplanus
petrolearius DSM 11571]
gi|307157951|gb|ADN37331.1| DNA-(apurinic or apyrimidinic site) lyase [Methanoplanus
petrolearius DSM 11571]
Length = 280
Score = 101 bits (251), Expect = 6e-19, Method: Compositional matrix adjust.
Identities = 78/254 (30%), Positives = 112/254 (44%), Gaps = 33/254 (12%)
Query: 48 LSLPLTFPTGQTFRWKKTGPLQYTGPIGPHLISLKHLQNGDVCYHIHTSPSEPAAKSALL 107
+L +T GQ FRW+K G L +TG +IS++ Q+G I S L
Sbjct: 13 FNLDITLSCGQVFRWEKRGDL-WTGVSAGRVISIR--QSGR---KIEYS----GCDEEYL 62
Query: 108 DFL-NMGISLGELWEGFSASDCRFAELAKYLAGARVLRQDPVECLLQFLCSSNNNIARIT 166
++L ++ L + FS +D E + G RV+RQDP ECLL +LC+ N I I
Sbjct: 63 EYLFHLDFDLESVIGSFS-TDEYIGEAVERYRGLRVMRQDPWECLLSYLCAQNTGIPNIK 121
Query: 167 KMVDFLASLGSHLGNVEGFEFHEFPSLERLSLVSEVELRNAGFGYRSAPQSSLLFSVRRS 226
+M+ +A + G E + FP E LS + +L+ GYRS
Sbjct: 122 RMLGNMAVMNGKKIVQGGQEAYAFPGAEDLSRRCDADLKGCSTGYRSG------------ 169
Query: 227 FKQAKYITGTVDVLQSKHSGGAEWLLSLRKLDLQEAIDALCTLPGVGPKVAACIALFSLD 286
YI T ++ W +R LD A + PGVG KVA CI LF
Sbjct: 170 -----YICRTSSMIADDPG----WADRIRSLDYTGARKEIMKYPGVGRKVADCILLFGFQ 220
Query: 287 QHHAIPVDTHVWKI 300
+ + PVD + +I
Sbjct: 221 FYESFPVDVWIRRI 234
>gi|438001671|ref|YP_007271414.1| 8-oxoguanine-DNA-glycosylase [Tepidanaerobacter acetatoxydans Re1]
gi|432178465|emb|CCP25438.1| 8-oxoguanine-DNA-glycosylase [Tepidanaerobacter acetatoxydans Re1]
Length = 285
Score = 100 bits (250), Expect = 7e-19, Method: Compositional matrix adjust.
Identities = 73/256 (28%), Positives = 115/256 (44%), Gaps = 32/256 (12%)
Query: 48 LSLPLTFPTGQTFRWKKTGPLQYTGPIGPHLISLKHLQNGDVCYHIHTSPSEPAAKSALL 107
+L + GQ FRW ++ +Y G + ++ + N ++ + E A
Sbjct: 11 FNLEIIADGGQAFRWNRSEDGKYIGVVEDYIFEVVQ-ANDELIIDSNLEGKEEAFAKEYF 69
Query: 108 DFLNMGISLGELWEGFSASDCRFAELA---KYLAGARVLRQDPVECLLQFLCSSNNNIAR 164
D L +E F E+A Y +G R+L QDP E + F+ S+NN+I
Sbjct: 70 D-------LPRDYEQIEKELMCFKEVAAAVNYCSGYRILFQDPWETTISFIISANNHIRN 122
Query: 165 ITKMVDFLASLGSHLGNVEGFEFHEFPSLERLSLVSEVELRNAGFGYRSAPQSSLLFSVR 224
I ++ + + G ++ FPS +RL+ +SE EL+ GYR
Sbjct: 123 IKSTIENMCKIYGEPLEYRGKTYYSFPSPDRLASLSEDELKKDKCGYR------------ 170
Query: 225 RSFKQAKYITGTVDVLQSKHSGGAEWLLSLRKLDLQEAIDALCTLPGVGPKVAACIALFS 284
AKYI T ++ + G L LR+L EA + L TLPGVG KVA CI L+S
Sbjct: 171 -----AKYIIETARMI----ADGKVDLYGLRELPTDEAREVLLTLPGVGRKVADCIMLYS 221
Query: 285 LDQHHAIPVDTHVWKI 300
+ + A P+D + ++
Sbjct: 222 MRKFDAFPIDVWIKRV 237
>gi|403387381|ref|ZP_10929438.1| 8-oxoguanine DNA glycosylase [Clostridium sp. JC122]
Length = 291
Score = 100 bits (250), Expect = 8e-19, Method: Compositional matrix adjust.
Identities = 77/273 (28%), Positives = 120/273 (43%), Gaps = 45/273 (16%)
Query: 48 LSLPLTFPTGQTFRWKKTGPLQYTGPIGPHLISLKHLQNGDVCYHIHTSPSEPAAKSALL 107
L F GQ FRW KT Y G +I ++ N + IH S L
Sbjct: 21 FELKDIFDCGQCFRWNKTKDNTYIGVAYNKVIEIEKEGNNVI---IHNSN--------LK 69
Query: 108 DFLNMGISLGELWEGFSASDCRFAELAK---------YLAGARVLRQDPVECLLQFLCSS 158
DF + +L +S EL K + G R+L+Q+P E L+ F+ S+
Sbjct: 70 DFNEIWCEYFDLKRNYSKIK---EELKKDPILDTAVAFGEGIRILKQEPFEILISFIVSA 126
Query: 159 NNNIARITKMVDFLASLGSHLGNVEGFEFHEFPSLERLSLVSEVELRNAGFGYRSAPQSS 218
NN I I + ++ ++ + +G +++ FPS++RL S+ E+ G G+R
Sbjct: 127 NNRIPMIKRAINNISKKYGNELEYKGEKYYSFPSVDRLIEASQEEIEEMGVGFR------ 180
Query: 219 LLFSVRRSFKQAKYITGTVDVLQSKHSGGAEWLLSLRKLDLQEAIDALCTLPGVGPKVAA 278
AKYI +VD K L ++ +D E + L GVGPKVA
Sbjct: 181 -----------AKYIVDSVD----KVYNNVYNLEEIKSMDDDECHEGLKQFNGVGPKVAD 225
Query: 279 CIALFSLDQHHAIPVDTHVWKIATR-YLLPELA 310
CI LFS+ ++ + PVD V + Y+ P+++
Sbjct: 226 CIMLFSMQKYSSFPVDVWVKRAMQHFYVAPDVS 258
>gi|150388666|ref|YP_001318715.1| 8-oxoguanine DNA glycosylase [Alkaliphilus metalliredigens QYMF]
gi|149948528|gb|ABR47056.1| 8-oxoguanine DNA glycosylase domain protein [Alkaliphilus
metalliredigens QYMF]
Length = 296
Score = 100 bits (250), Expect = 8e-19, Method: Compositional matrix adjust.
Identities = 71/249 (28%), Positives = 114/249 (45%), Gaps = 26/249 (10%)
Query: 53 TFPTGQTFRWKKTGPLQYTGPIGPHLISLKHLQNGDVCYHIHTSPSEPAAKSALLDFLNM 112
TF GQ FRW++ YTG + ++++K + N + Y + E + +++ ++
Sbjct: 25 TFECGQCFRWEREEDGSYTGVVYDKVLNVKKIGNDVILYPTNQLDFE----NIWIEYFDL 80
Query: 113 GISLGELWEGFSASDCRFAELAKYLAGARVLRQDPVECLLQFLCSSNNNIARITKMVDFL 172
+ E A D + + G R+LRQDP E ++ F+ S+NNNI RI + +D +
Sbjct: 81 HTDYQMIQEHLQAIDPVMKKAIGFGRGIRILRQDPWETIISFIISANNNIPRIKRAIDLM 140
Query: 173 A-SLGSHLGNVEGFEFHEFPSLERLSLVSEVELRNAGFGYRSAPQSSLLFSVRRSFKQAK 231
+ G + + G + FP LS + EL GYR A
Sbjct: 141 SRGYGQPVEDFRGGANYTFPDAATLSKRTVEELLACNTGYR-----------------AP 183
Query: 232 YITGTVDVLQSKHSGGAEWLLSLRKLDLQEAIDALCTLPGVGPKVAACIALFSLDQHHAI 291
YI T Q + E+ +L+KLD + L T G+GPKVA C+ FS+ + A
Sbjct: 184 YILKTA---QQVSTANIEFQ-NLKKLDRESCQRQLMTFNGIGPKVANCVLFFSMGKFDAF 239
Query: 292 PVDTHVWKI 300
PVD V ++
Sbjct: 240 PVDVWVKRV 248
>gi|374298152|ref|YP_005048343.1| 3-methyladenine DNA glycosylase [Clostridium clariflavum DSM 19732]
gi|359827646|gb|AEV70419.1| 3-methyladenine DNA glycosylase/8-oxoguanine DNA glycosylase
[Clostridium clariflavum DSM 19732]
Length = 308
Score = 100 bits (249), Expect = 8e-19, Method: Compositional matrix adjust.
Identities = 74/261 (28%), Positives = 120/261 (45%), Gaps = 30/261 (11%)
Query: 53 TFPTGQTFRWKKTGPLQYTGPIGPHLISLKHLQNGDVCYHIHTSPSEPAAKSALLDFLNM 112
TF GQ FRW + YTG +I++ + +G + + + + K+ D+ ++
Sbjct: 42 TFECGQCFRWIRQEDGSYTGVAKGKVINV-SIDDGVL---VLKNTNLEDFKNIWFDYFDL 97
Query: 113 GISLGELWEGFSASDCRFAELAKYLAGARVLRQDPVECLLQFLCSSNNNIARITKMVDFL 172
G G++ E D + + G R+L+QD E L+ F+ S+NN I RI K V+ +
Sbjct: 98 GRDYGQVKEAVMKDDI-MQKAIDFGWGIRLLKQDIWETLISFIISANNRIPRIMKTVEII 156
Query: 173 ASLGSHLGNVEGFEFHEFPSLERL--SLVSEVELRNAGFGYRSAPQSSLLFSVRRSFKQA 230
A L ++G +++ FP + +L S V E+E+ GF + SS + SV
Sbjct: 157 ARLYGDELMMDGNKYYSFPDVNKLAQSNVEELEVCKGGFRCKYILNSSQMVSV------- 209
Query: 231 KYITGTVDVLQSKHSGGAEWLLSLRKLDLQEAIDALCTLPGVGPKVAACIALFSLDQHHA 290
G VD+ Q L ++D A + L PGVGPKVA C+ L+S ++
Sbjct: 210 ----GQVDLKQ------------LSQMDTDSAREELMKFPGVGPKVADCVLLYSGTKYDV 253
Query: 291 IPVDTHVWKIATRYLLPELAG 311
P D V ++ AG
Sbjct: 254 FPTDVWVRRVMGELYFKREAG 274
>gi|326389681|ref|ZP_08211247.1| 8-oxoguanine DNA glycosylase domain-containing protein
[Thermoanaerobacter ethanolicus JW 200]
gi|345017034|ref|YP_004819387.1| 8-oxoguanine DNA glycosylase [Thermoanaerobacter wiegelii Rt8.B1]
gi|392940354|ref|ZP_10305998.1| 3-methyladenine DNA glycosylase/8-oxoguanine DNA glycosylase
[Thermoanaerobacter siderophilus SR4]
gi|325994396|gb|EGD52822.1| 8-oxoguanine DNA glycosylase domain-containing protein
[Thermoanaerobacter ethanolicus JW 200]
gi|344032377|gb|AEM78103.1| 8-oxoguanine DNA glycosylase domain-containing protein
[Thermoanaerobacter wiegelii Rt8.B1]
gi|392292104|gb|EIW00548.1| 3-methyladenine DNA glycosylase/8-oxoguanine DNA glycosylase
[Thermoanaerobacter siderophilus SR4]
Length = 297
Score = 100 bits (249), Expect = 9e-19, Method: Compositional matrix adjust.
Identities = 81/273 (29%), Positives = 125/273 (45%), Gaps = 27/273 (9%)
Query: 46 SELSLPLTFPTGQTFRWKKTGPLQYTGPIGPHLISLKHLQNGDVCYHIHTSPSEPAAKSA 105
+ +L TF GQ FRW + YTG +I++K GDV +T ++
Sbjct: 18 EDFNLKETFECGQCFRWNEEEDGSYTGVAYDRVINVKL--EGDVLIIDNTDLNDFY--DI 73
Query: 106 LLDFLNMGISLGELWEGFSASDCRFAELAKYLAGARVLRQDPVECLLQFLCSSNNNIARI 165
D+ ++G ++ E S D E +Y G R+LRQD E L+ F+ S NN I +I
Sbjct: 74 WFDYFDLGRDYKQIKENLS-KDPILKEAIQYGQGIRILRQDTWETLISFIISQNNRIPQI 132
Query: 166 TKMVDFLASLGSHLGNVEGFEFHEFPSLERLSLVSEVELRNAGFGYRSAPQSSLLFSVRR 225
K+++ LAS + +G ++ FP E L + F + Q+ F
Sbjct: 133 KKVIENLASSFGNPIEYKGRVYYTFPKAEELVM----------FDVETIAQTKCGF---- 178
Query: 226 SFKQAKYITGTVDVLQSKHSGGAEWLLSLRKLDLQEAIDALCTLPGVGPKVAACIALFSL 285
+AKYI +D SG + LL L + E D L ++ GVGPKVA C+ L+S+
Sbjct: 179 ---RAKYI---LDAASKVFSGEID-LLKLFEYSTNEIRDILMSINGVGPKVADCVILYSI 231
Query: 286 DQHHAIPVDTHVWKIATRYLLPELAGVRLTPKL 318
++ P D + +I YL + G L +L
Sbjct: 232 GRYDTFPTDVWIKRIV-EYLYLKREGTPLEIQL 263
>gi|384488198|gb|EIE80378.1| hypothetical protein RO3G_05083 [Rhizopus delemar RA 99-880]
Length = 144
Score = 100 bits (248), Expect = 1e-18, Method: Compositional matrix adjust.
Identities = 57/134 (42%), Positives = 76/134 (56%), Gaps = 21/134 (15%)
Query: 173 ASLGSHLGNVEGFEFHEFPSLERLSLVSEVE--LRNAGFGYRSAPQSSLLFSVRRSFKQA 230
G + +E ++++FP+L+RL + +VE LR GFGYR A
Sbjct: 7 VEFGEKIARLEEEDYYDFPTLDRL-VGDDVEKRLRELGFGYR-----------------A 48
Query: 231 KYITGTVDVLQSKHSG-GAEWLLSLRKLDLQEAIDALCTLPGVGPKVAACIALFSLDQHH 289
KYI T L + H G +WL SLRK+D +EA + L GVGPKVA C+ L SLD
Sbjct: 49 KYIANTAKKLNADHPDLGEKWLHSLRKVDYKEAKEELMKFQGVGPKVADCVCLMSLDHPE 108
Query: 290 AIPVDTHVWKIATR 303
+IPVDTHVW+IA+R
Sbjct: 109 SIPVDTHVWQIASR 122
>gi|20807070|ref|NP_622241.1| 3-methyladenine DNA glycosylase [Thermoanaerobacter tengcongensis
MB4]
gi|20515559|gb|AAM23845.1| 3-Methyladenine DNA glycosylase [Thermoanaerobacter tengcongensis
MB4]
Length = 297
Score = 100 bits (248), Expect = 1e-18, Method: Compositional matrix adjust.
Identities = 74/256 (28%), Positives = 117/256 (45%), Gaps = 26/256 (10%)
Query: 46 SELSLPLTFPTGQTFRWKKTGPLQYTGPIGPHLISLKHLQNGDVCYHIHTSPSEPAAKSA 105
++ +L TF +GQ FRW + YTG ++++K GD +T+ ++
Sbjct: 18 ADFNLKETFESGQCFRWNEEEDGSYTGVAYDRVVNVK--LEGDTLIIDNTNLTDFY--DI 73
Query: 106 LLDFLNMGISLGELWEGFSASDCRFAELAKYLAGARVLRQDPVECLLQFLCSSNNNIARI 165
D+ ++G G++ E S D E K+ G R+LRQD E L+ F+ S NN I +I
Sbjct: 74 WFDYFDLGRDYGQIKESLS-KDPVLKEAIKFGQGIRILRQDTWETLVSFIVSQNNRIPQI 132
Query: 166 TKMVDFLASLGSHLGNVEGFEFHEFPSLERLSLVSEVELRNAGFGYRSAPQSSLLFSVRR 225
K+++ LA+ + +G ++ FP E L + + G+R
Sbjct: 133 KKVIENLATSFGNPIEYKGKIYYTFPKPEELVMYDVETIAKTRCGFR------------- 179
Query: 226 SFKQAKYITGTVDVLQSKHSGGAEWLLSLRKLDLQEAIDALCTLPGVGPKVAACIALFSL 285
AKYI SK G LL L + E D L T+ GVGPKVA C+ L+S+
Sbjct: 180 ----AKYIFDAA----SKVFSGEINLLKLHEYSTSEIRDILMTINGVGPKVADCVILYSI 231
Query: 286 DQHHAIPVDTHVWKIA 301
++ P D + +I
Sbjct: 232 GRYDTFPTDVWIKRIV 247
>gi|58263208|ref|XP_569014.1| purine-specific oxidized base lesion DNA N-glycosylase
[Cryptococcus neoformans var. neoformans JEC21]
gi|134108192|ref|XP_777047.1| hypothetical protein CNBB2790 [Cryptococcus neoformans var.
neoformans B-3501A]
gi|50259732|gb|EAL22400.1| hypothetical protein CNBB2790 [Cryptococcus neoformans var.
neoformans B-3501A]
gi|57223664|gb|AAW41707.1| purine-specific oxidized base lesion DNA N-glycosylase, putative
[Cryptococcus neoformans var. neoformans JEC21]
Length = 410
Score = 100 bits (248), Expect = 1e-18, Method: Compositional matrix adjust.
Identities = 91/305 (29%), Positives = 133/305 (43%), Gaps = 55/305 (18%)
Query: 35 PSKWTPLNLTQSELSLPLTFPTGQTFRWKKTGPLQYTGP----------IGPHLISLKHL 84
P+ W + + LSL T P GQ F W + PL + P P ++ L+
Sbjct: 7 PAGWASVRMDPRNLSLANTLPVGQAFLWHRL-PLPASDPPFEEYSRAVHSPPRVVCLR-- 63
Query: 85 QNGDVCYHIHTSPS----EP-----AAKSALLDFLNMGI--SLGELWEGFSASDCRF--- 130
Q+ Y+ PS EP + + L D+ + L L+ + D
Sbjct: 64 QSPTHIYYTAVYPSGSAPEPERSNLSTRQWLEDYFQLVRYPDLEALYLDWRRRDPELFGK 123
Query: 131 AELAKYLAGARVLRQDPVECLLQFLCSSNNNIARITKMV-DFLASLGS------HLGNVE 183
+ G RVLRQDP ECLL F+ S+NN+I RIT ++ F S H N
Sbjct: 124 VHVNDRATGIRVLRQDPWECLLAFITSTNNHIPRITSLLHKFSQSFTKPVLTLKHPSNGI 183
Query: 184 GFEFHEFPSLERLSLVSEVELRNAGFGYRSAPQSSLLFSVRRSFKQAKYITGTVDVLQSK 243
+H FP+ ++ E LR+ GFGYR A +I ++ +L++K
Sbjct: 184 LIPYHLFPAPHQIPTRLEKPLRDMGFGYR-----------------APFIEASLQLLRNK 226
Query: 244 ---HSGGAE-WLLSLRKLDLQEAIDALCTLPGVGPKVAACIALFSLDQHHAIPVDTHVWK 299
G E L++ R D+ + L L GVG KVA C+ L LD+ IP+DTH+
Sbjct: 227 FGDKEGDIEAGLVAWRNEDVDIVRENLIALKGVGRKVADCVMLMCLDKPSLIPIDTHIAH 286
Query: 300 IATRY 304
IA R+
Sbjct: 287 IAARH 291
>gi|399924709|ref|ZP_10782067.1| 8-oxoguanine DNA glycosylase [Peptoniphilus rhinitidis 1-13]
Length = 291
Score = 100 bits (248), Expect = 1e-18, Method: Compositional matrix adjust.
Identities = 74/278 (26%), Positives = 131/278 (47%), Gaps = 37/278 (13%)
Query: 54 FPTGQTFRWKKTGPLQYTGPIGPHLISLKHLQNGDVCYHIHTSPSEPAAKSALLDFLNMG 113
F GQ FRW + +T + ++K + NG++ + + + ++ ++
Sbjct: 24 FTCGQAFRWFEESDRSFTFVTNGKVANVKKI-NGEI---VLRGVDKASFYQIFYNYFDLD 79
Query: 114 ISLGELWEGFSASDCRFAELAKYLAGARVLRQDPVECLLQFLCSSNNNIARITKMVDFLA 173
+ E+ + S + +Y G R+L Q+ E ++ F+ S+NN I RI K ++ ++
Sbjct: 80 RNYKEIIDELSRDEI-MKNATEYGKGIRILNQEKFETIISFIISANNQIPRIKKSIEKIS 138
Query: 174 SL-GSHLGNVEGFEFHEFPSLERLSLVSEVELRN-AGFGYRSAPQSSLLFSVRRSFKQAK 231
+ G + E ++ FPS +RL +LR A G+R +R + AK
Sbjct: 139 QMYGDFISEDENRKYFSFPSADRLKEAKAEDLREFARVGFRD----------KRIVETAK 188
Query: 232 YI-TGTVDVLQSKHSGGAEWLLSLRKLDLQEAIDALCTLPGVGPKVAACIALFSLDQHHA 290
I +G V++ + + K++L++A AL LPGVGPKVA CI LF+ D+ +
Sbjct: 189 LIDSGEVNISE------------IEKMELEDARAALQILPGVGPKVADCILLFAFDRKES 236
Query: 291 IPVDTHVWKIATRYLLPELAGVRLTP--KLCSRVAEAF 326
PVD + ++ + L E +TP K+ R E F
Sbjct: 237 FPVDVWIKRVMEKLYLKE-----VTPKSKIAMRGREVF 269
>gi|397906423|ref|ZP_10507225.1| 8-oxoguanine-DNA-glycosylase [Caloramator australicus RC3]
gi|397160532|emb|CCJ34562.1| 8-oxoguanine-DNA-glycosylase [Caloramator australicus RC3]
Length = 285
Score = 100 bits (248), Expect = 1e-18, Method: Compositional matrix adjust.
Identities = 76/263 (28%), Positives = 119/263 (45%), Gaps = 29/263 (11%)
Query: 53 TFPTGQTFRWKKTGPLQYTGPIGPHLISLKHLQNGDVCYHIHTSPSEPAAKSALLDFLNM 112
F GQ FRW K Y G + ++ + ++GDV +I E + + ++
Sbjct: 23 IFECGQCFRWNKYEN-GYIGVVSGR-VAFVYKKDGDV--YIEGGKKED--EDFWRKYFDL 76
Query: 113 GISLGELWEGFSASDCRFAELAKYLAGARVLRQDPVECLLQFLCSSNNNIARITKMVDFL 172
G++ E + E ++ G R+L+Q+P E L+ F+ S+NN I I K VD +
Sbjct: 77 DRDYGKIKERLREDEV-LKEAVEHGYGIRLLKQEPFETLISFIISANNRIPMIKKAVDNI 135
Query: 173 ASLGSHLGNVEGFEFHEFPSLERLSLVSEVELRNAGFGYRSAPQSSLLFSVRRSFKQAKY 232
+ + EF+ FPS+E L S EL G G+R+ Y
Sbjct: 136 SKRFGKKIIFKDMEFYSFPSIEELKEASIDELEECGTGFRAV-----------------Y 178
Query: 233 ITGTVDVLQSKHSGGAEWLLSLRKLDLQEAIDALCTLPGVGPKVAACIALFSLDQHHAIP 292
I V + ++ L + LD A + L GVGPKVA C+ LFS+ ++ A P
Sbjct: 179 IYEAVKRIYEENIS----LDEIMTLDTDAAQEKLLQFKGVGPKVADCVLLFSMQKYDAFP 234
Query: 293 VDTHVWKIATR-YLLPELAGVRL 314
VD V +I R YL P+++ R+
Sbjct: 235 VDVWVKRIMQRFYLAPDVSLKRI 257
>gi|187776780|ref|ZP_02993253.1| hypothetical protein CLOSPO_00296 [Clostridium sporogenes ATCC
15579]
gi|187775439|gb|EDU39241.1| 8-oxoguanine DNA-glycosylase (ogg) [Clostridium sporogenes ATCC
15579]
Length = 305
Score = 100 bits (248), Expect = 1e-18, Method: Compositional matrix adjust.
Identities = 71/270 (26%), Positives = 126/270 (46%), Gaps = 29/270 (10%)
Query: 48 LSLPLTFPTGQTFRWKKTGPLQYTGPIGPHLISLKHLQNGDVCYHIHTSPSEPAAKSALL 107
L F GQ FRW + Y G +I ++ N DV H + +E K+
Sbjct: 21 FELAHIFECGQCFRWYREEKGSYIGVAYGKVIEVEK-ANNDVILH---NATEEDFKNIWA 76
Query: 108 DFLNMGISLGELWEGFSASDCRFAELAKYLAGARVLRQDPVECLLQFLCSSNNNIARITK 167
++ ++ GE+ S D A+ ++ G R+L+QDP E ++ F+ S+NN I I +
Sbjct: 77 EYFDLYRDYGEIKHILSKDDL-LAKSVEFGHGIRLLKQDPFEMIVSFIISANNRIPMIKR 135
Query: 168 MVDFLASLGSHLGNVEGFEFHEFPSLERLSLVSEVELRNAGFGYRSAPQSSLLFSVRRSF 227
+ ++ + +G ++ FP++E+L +E EL+ G G+R
Sbjct: 136 AIKNISEKWGYPIEYKGSIYYSFPTVEQLKDATEEELKACGVGFR--------------- 180
Query: 228 KQAKYITGTVD-VLQSKHSGGAEWLLSLRKLDLQEAIDALC-----TLPGVGPKVAACIA 281
AKYI TV+ + ++ ++ L ++ D +C G+G KVA C+
Sbjct: 181 --AKYIKDTVNKIYENSVEQSEQYKKEYDMLWIKNQQDDICHKMLQNYSGIGAKVADCVM 238
Query: 282 LFSLDQHHAIPVDTHVWK-IATRYLLPELA 310
LFS++++ A PVD V + + YL P+++
Sbjct: 239 LFSMEKYSAFPVDVWVKRAMQYFYLAPDVS 268
>gi|398022382|ref|XP_003864353.1| 8-oxoguanine DNA glycosylase, putative [Leishmania donovani]
gi|322502588|emb|CBZ37671.1| 8-oxoguanine DNA glycosylase, putative [Leishmania donovani]
Length = 565
Score = 99.8 bits (247), Expect = 2e-18, Method: Compositional matrix adjust.
Identities = 83/297 (27%), Positives = 122/297 (41%), Gaps = 97/297 (32%)
Query: 106 LLDFLNMGISLGELWEGFSAS-DCRFAELAKYLAGARV---------------------- 142
L +L++ + L +LW+ ++ S + R L +YL G R+
Sbjct: 157 LSRYLSLDVDLDQLWQEWTDSPETRKHPLVEYLVGNRLHREFLRSGHGQRVCEHDEAQAN 216
Query: 143 -------LRQDPVECLLQFLCSSNNNIARITKMVDFLA-SLGSHLGNV------------ 182
+RQD CL FLCS NNN+ RIT M+ L+ + G HL +V
Sbjct: 217 LYIPIRHVRQDLHSCLFSFLCSQNNNVTRITGMIYALSRTYGDHLCDVQLATGEVQEPRK 276
Query: 183 -----------------------------------------EGFEFHEFPSLERLSLVSE 201
E + FPSLE+L+ +E
Sbjct: 277 SAADTRSKVPQTQFQSDAAATRRDAIASPSLRSHSSSTNSAEWLSVYSFPSLEQLAAATE 336
Query: 202 VELRNAGFGYRSAPQSSLLFSVR-----RSFKQAKYITGTVDV---LQSKH-----SGGA 248
LRN GFGYRS + +R ++ + K + G + L +H +G
Sbjct: 337 DTLRNLGFGYRSKYIVEAVNFIRTQLPPQALQDKKAVKGKMHSPPHLIQQHGACYRNGFY 396
Query: 249 EWLLSLRKLDLQEAIDALCTLPGVGPKVAACIALFSLDQHHAIPVDTHVWKIATRYL 305
+LS Q D L LPGVG KVA C+ALF+L++ H +PVDTH+ ++A YL
Sbjct: 397 SAVLSHHNYHHQHQRDMLLLLPGVGRKVADCVALFALNRTHIVPVDTHMAQVAVEYL 453
>gi|325263223|ref|ZP_08129958.1| 8-oxoguanine-DNA-glycosylase [Clostridium sp. D5]
gi|324031616|gb|EGB92896.1| 8-oxoguanine-DNA-glycosylase [Clostridium sp. D5]
Length = 269
Score = 99.8 bits (247), Expect = 2e-18, Method: Compositional matrix adjust.
Identities = 73/243 (30%), Positives = 108/243 (44%), Gaps = 27/243 (11%)
Query: 56 TGQTFRWKKTGPLQYTGPIGPHLISLKHLQNGDVCYHIHTSPSEPAAKSALLDFLNMGIS 115
+GQ FR G Y+ G + ++ QNGD C SE + + ++
Sbjct: 17 SGQCFRMSLKGDDTYSVIAGERYLEIR--QNGDKCKFF---CSENEYEDFWKHYFDLDTD 71
Query: 116 LGELWEGFSASDCRFAELAKYLAGARVLRQDPVECLLQFLCSSNNNIARITKMV-DFLAS 174
E + S D A + G R+L QD E ++ FL S NNI RI + + +
Sbjct: 72 YEEYRKCISQRDKYLMSAADFGCGIRILNQDLWEMIVSFLISQQNNIVRIRRCIQNICQK 131
Query: 175 LGSHLGNVEGFEFHEFPSLERLSLVSEVELRNAGFGYRSAPQSSLLFSVRRSFKQAKYIT 234
G + +G ++ FPS E L+ + E EL+ GYRS KY+
Sbjct: 132 YGERKEDADGNIYYTFPSAEVLACLEEDELKACNLGYRS-----------------KYVV 174
Query: 235 GTVDVLQSKHSGGAEWLLSLRKLDLQEAIDALCTLPGVGPKVAACIALFSLDQHHAIPVD 294
T + + G L ++RK+ ++A + L L GVG KVA CI LF+L A PVD
Sbjct: 175 RTARSV----ANGEISLDAIRKMPYKKAKEELLKLYGVGEKVADCICLFALHHLAAFPVD 230
Query: 295 THV 297
TH+
Sbjct: 231 THI 233
>gi|402835889|ref|ZP_10884445.1| 8-oxoguanine DNA glycosylase, N-terminal domain protein
[Mogibacterium sp. CM50]
gi|402273122|gb|EJU22331.1| 8-oxoguanine DNA glycosylase, N-terminal domain protein
[Mogibacterium sp. CM50]
Length = 282
Score = 99.8 bits (247), Expect = 2e-18, Method: Compositional matrix adjust.
Identities = 82/271 (30%), Positives = 119/271 (43%), Gaps = 38/271 (14%)
Query: 47 ELSLPLTFPTGQTFRWKKTGPLQYTGPIGPHL--ISLKHLQNGDVCYHIHTSPSEPAAKS 104
++ L F GQ FRW + G Y G G + + LKH + T SE S
Sbjct: 14 DIELAHIFECGQCFRWNECGDGSYIGVAGGNAAHVVLKHKDDASTLVVECTGGSEEYWCS 73
Query: 105 ALLDFLNMGISLGELWEGFSASDCRFAELAKYLAGARVLRQDPVECLLQFLCSSNNNIAR 164
+ ++ G++ ASD + E G R+L QD E L+ F+ S NNNI R
Sbjct: 74 ----YFDLDTGYGDMKRALLASDSKLDEAIDTCYGIRILNQDYWEVLISFIVSQNNNIPR 129
Query: 165 ITKMVDFLA-SLGSHLGNVEGFEFHEFPSLERLSLVSEVELRNAGFGYRSAPQSSLLFSV 223
I K ++ LA S G+ +G G + FP+ E+L+ + EL GYR ++ L +
Sbjct: 130 IKKCIESLARSYGTEIGEFMGEVRYAFPTPEQLAEATLEELAALKLGYR----NTYLAAA 185
Query: 224 RRSFKQAKYITGTVDVLQSKHSGGAEWLLSLRKLDLQEAIDALCTLPGVGPKVAACIALF 283
+K+ TG+ + KH AL + GVGPKVA CI LF
Sbjct: 186 PEHYKEQGVPTGSA---EEKHK-------------------ALLSYLGVGPKVANCILLF 223
Query: 284 SLDQHHAIPVDTHVWKIATRYLLPELAGVRL 314
L A P+D VW + ++ EL G +
Sbjct: 224 GLRDFTAFPID--VW---MKKIMAELYGFDI 249
>gi|322708990|gb|EFZ00567.1| 8-oxoguanine DNA glycosylase [Metarhizium anisopliae ARSEF 23]
Length = 263
Score = 99.0 bits (245), Expect = 2e-18, Method: Compositional matrix adjust.
Identities = 69/191 (36%), Positives = 97/191 (50%), Gaps = 37/191 (19%)
Query: 109 FLNMGISLGELWEGFSASDCRFAELAKYLAGARVLRQDPVECLLQFLCSSNNNIARITKM 168
+ ++ + LG L++ +S SD F + A G R+L QD E L+ F+CSSNNNI+RI++M
Sbjct: 65 YFSLNLDLGALYDQWSKSDPNFQKKAPEFKGVRILSQDAWEALICFICSSNNNISRISQM 124
Query: 169 VDFLAS-LGSHLGNVEGFEFHEFPSLERLSLVS-EVELRNAGFGYRSAPQSSLLFSVRRS 226
V L + G +G ++G H+FP+ E L+ + E LR GFGYR
Sbjct: 125 VHKLCNHYGPLIGYIDGEPMHDFPTPESLTGKNVESHLRELGFGYR-------------- 170
Query: 227 FKQAKYITGTVDVLQSKHSGGAEWLLSLR----------------KLDLQEAIDALCTLP 270
AKYI T V+ + + WL SLR + A +AL +L
Sbjct: 171 ---AKYIANTARVVALEKP--SSWLESLRNPRHSGPGLSSSAGSSNPTYKGAHEALLSLT 225
Query: 271 GVGPKVAACIA 281
GVGPKVA CIA
Sbjct: 226 GVGPKVADCIA 236
>gi|223936761|ref|ZP_03628671.1| 8-oxoguanine DNA glycosylase domain protein [bacterium Ellin514]
gi|223894612|gb|EEF61063.1| 8-oxoguanine DNA glycosylase domain protein [bacterium Ellin514]
Length = 293
Score = 99.0 bits (245), Expect = 2e-18, Method: Compositional matrix adjust.
Identities = 81/266 (30%), Positives = 117/266 (43%), Gaps = 31/266 (11%)
Query: 46 SELSLPLTFPTGQTFRWKKTGPLQYTGPIGPHLISLKHLQNGDVCYHIHTSPSEPAAK-S 104
++ LP T +GQ FRW++ + G IG H + L+ N I +EP S
Sbjct: 13 TDYDLPGTLSSGQAFRWREEHN-SWIGVIGNHWVRLRSSSNS-----IIAEVAEPVTDWS 66
Query: 105 ALLDFLNMGISLGELWEGFSASDCRFAELAKYLAGARVLRQDPVECLLQFLCSSNNNIAR 164
L+DFL + L + F D + G R+LRQ+P ECL F+ SS I +
Sbjct: 67 WLVDFLQTHLELKSVLATFP-KDEPLGNAIRACHGLRLLRQNPWECLASFILSSTKQIVQ 125
Query: 165 ITKMVDFLA-SLGSHLGNVEGFE-FHEFPSLERLSLVSEVELRNAGFGYRSAPQSSLLFS 222
I ++V+ L G + G + FPS RL+ +E ELR+ G+R
Sbjct: 126 IQQIVELLCIRFGEPVPVPPGHSPAYAFPSAMRLAAATEAELRDCKMGFR---------- 175
Query: 223 VRRSFKQAKYITGTVDVLQSKHSGGAEWLLSLRKLDLQEAIDALCTLPGVGPKVAACIAL 282
A Y+ T ++ S G L L +D+ +A L LPGVG K+A C+ L
Sbjct: 176 -------APYLRETARMIHS----GEVILERLYGMDVDDARAELLKLPGVGRKIADCVLL 224
Query: 283 FSLDQHHAIPVDTHVWKIATRYLLPE 308
F+ A PVD V K P+
Sbjct: 225 FAYGFQAAFPVDVWVMKALQHLYFPK 250
>gi|424833384|ref|ZP_18258109.1| putative 8-oxoguanine DNA glycosylase [Clostridium sporogenes PA
3679]
gi|365979372|gb|EHN15434.1| putative 8-oxoguanine DNA glycosylase [Clostridium sporogenes PA
3679]
Length = 305
Score = 98.2 bits (243), Expect = 4e-18, Method: Compositional matrix adjust.
Identities = 75/273 (27%), Positives = 128/273 (46%), Gaps = 35/273 (12%)
Query: 48 LSLPLTFPTGQTFRWKKTGPLQYTGPIGPHLISLKHLQNGDVCYHIHTSPSEPAAKSALL 107
L F GQ FRW + Y G +I ++ + N DV H + +E KS
Sbjct: 21 FELAHIFECGQCFRWYREEKDSYIGIAYGKVIEVEKV-NNDVILH---NATEEDFKSIWA 76
Query: 108 DFLNMGISLGELWEGFSASDCRFAELAKYLAGARVLRQDPVECLLQFLCSSNNNIARITK 167
++ ++ G++ S D A+ ++ G R+L+QDP E ++ F+ S+NN I I K
Sbjct: 77 EYFDLYRDYGQIKHILSKDDL-LAKSVEFGHGIRLLKQDPFEIIVSFIISANNRIPMIKK 135
Query: 168 MVDFLASLGSHLGN-VEGFE--FHEFPSLERLSLVSEVELRNAGFGYRSAPQSSLLFSVR 224
+ S+ G +E E ++ FP++E+L +E EL+ G G+R
Sbjct: 136 AI---KSISEKWGCPIEYKESIYYSFPTVEQLKDATEEELKACGVGFR------------ 180
Query: 225 RSFKQAKYITGTVD-VLQSKHSGGAEWLLSLRKLDLQEAIDALC-----TLPGVGPKVAA 278
AKYI TV+ + ++ ++ L ++ D +C G+G KVA
Sbjct: 181 -----AKYIKDTVNKIYENSIEQSEQYKKEYDMLWIKNQQDDICHKMLQNYSGIGAKVAD 235
Query: 279 CIALFSLDQHHAIPVDTHVWK-IATRYLLPELA 310
C+ LFS++++ A PVD V + + YL P+++
Sbjct: 236 CVMLFSMEKYSAFPVDVWVKRAMQYFYLAPDVS 268
>gi|317056672|ref|YP_004105139.1| DNA-(apurinic or apyrimidinic site) lyase [Ruminococcus albus 7]
gi|315448941|gb|ADU22505.1| DNA-(apurinic or apyrimidinic site) lyase [Ruminococcus albus 7]
Length = 273
Score = 98.2 bits (243), Expect = 4e-18, Method: Compositional matrix adjust.
Identities = 82/269 (30%), Positives = 121/269 (44%), Gaps = 38/269 (14%)
Query: 43 LTQSELSLPLTFPTGQTFRWKKTGPLQYTGPIGPHLISLKHLQNGDVCYHIHTSPSEPAA 102
L Q + L T GQ FRW+KT Y+G + + +NG + +H + +E
Sbjct: 12 LCQEDFDLDQTLDCGQAFRWEKTDENTYSGAFLNKKLVI-SCENGKDVFKLHDT-TEVDF 69
Query: 103 KSALLDFLNMGISLGELWEGFSASDCRFAELAKYLAGARVLRQDPVECLLQFLCSSNNNI 162
S D+ ++ EL FS D ++ KY +G R+LRQD E L F+ S NNNI
Sbjct: 70 LSLWADYFDLSTDYSELKRRFS-EDETLSKACKYASGIRILRQDSWEALSSFIISQNNNI 128
Query: 163 ARITKMVDFLASLGSHLGNVEGFEFHEFPSLERLSLVSEVELRNAGFGYRSAPQSSLLFS 222
RI ++ L H G +P+ + L+ +E L G+R
Sbjct: 129 PRIKGII---GRLCEHCGG--------YPAWQELTEETEESLAYLRAGFR---------- 167
Query: 223 VRRSFKQAKYITGTVDVLQSKHSGGAEWLLSLRKLDLQEAIDALCTLPGVGPKVAACIAL 282
AKY+ VD +Q SG + L +R +D++ A L T+ GVGPKVA C L
Sbjct: 168 -------AKYL---VDAVQKIQSGTID-LEKIRSMDIECARKLLQTIKGVGPKVAECALL 216
Query: 283 FSLDQHHAIPVDTHVWKIATRYL---LPE 308
F + A P+D V ++ + LPE
Sbjct: 217 FGFYRTEAFPIDVWVKRVMANWYPNGLPE 245
>gi|332982797|ref|YP_004464238.1| 8-oxoguanine DNA glycosylase domain-containing protein [Mahella
australiensis 50-1 BON]
gi|332700475|gb|AEE97416.1| 8-oxoguanine DNA glycosylase domain-containing protein [Mahella
australiensis 50-1 BON]
Length = 287
Score = 98.2 bits (243), Expect = 4e-18, Method: Compositional matrix adjust.
Identities = 72/258 (27%), Positives = 117/258 (45%), Gaps = 26/258 (10%)
Query: 50 LPLTFPTGQTFRWKKTGPLQYTGPIGPHLISLKHLQNGDVCYHIHTSPSEPAAKSALLDF 109
L F +GQ FRW + Y G H++ ++ Q +V H T + +
Sbjct: 21 LEHIFESGQCFRWNASSEGSYIGVAMGHVLEIRQ-QGPNVVLHPCTMEE---FERVWRRY 76
Query: 110 LNMGISLGELWEGFSASDCRFAELAKYLAGARVLRQDPVECLLQFLCSSNNNIARITKMV 169
++ G + A D + ++ G R+L QDP ECL+ F+ S+NN I RI ++
Sbjct: 77 FDLDADYGAIKHSL-AGDPTLDKAMEFGYGMRLLHQDPWECLISFIISANNRIPRIKGII 135
Query: 170 DFLASLGSHLGNVEGFEFHEFPSLERLSLVSEVELRNAGFGYRSAPQSSLLFSVRRSFKQ 229
+ L+ + +G +++FPS L+ + E+ GYR
Sbjct: 136 EELSMRYGNQLEYKGRIYYDFPSPNELARHTPDEICECRCGYR----------------- 178
Query: 230 AKYITGTVDVLQSKHSGGAEWLLSLRKLDLQEAIDALCTLPGVGPKVAACIALFSLDQHH 289
A YI T ++ +GG L ++ ++ EA AL LPGVGPKVA C+ LF + +
Sbjct: 179 APYIVETARII----AGGEIDLKAIESMEYHEAHRALMKLPGVGPKVADCVLLFGMGKGE 234
Query: 290 AIPVDTHVWKIATRYLLP 307
A PVD + ++ + LP
Sbjct: 235 AFPVDVWIKRVVEQLYLP 252
>gi|118576830|ref|YP_876573.1| 3-methyladenine DNA glycosylase/8-oxoguanine DNA glycosylase
[Cenarchaeum symbiosum A]
gi|118195351|gb|ABK78269.1| 3-methyladenine DNA glycosylase/8-oxoguanine DNA glycosylase
[Cenarchaeum symbiosum A]
Length = 225
Score = 98.2 bits (243), Expect = 5e-18, Method: Compositional matrix adjust.
Identities = 58/174 (33%), Positives = 84/174 (48%), Gaps = 21/174 (12%)
Query: 131 AELAKYLAGARVLRQDPVECLLQFLCSSNNNIARITKMVDFLASLGSHLGNVEGFEFHEF 190
A+ K G R+LRQDP +C + F+ SSN++I I + + S G EF F
Sbjct: 2 AKAVKKYPGLRLLRQDPFQCCISFMASSNSSIPCIRDRLRRICSTFGKKTKFRGEEFRVF 61
Query: 191 PSLERLSLVSEVELRNAGFGYRSAPQSSLLFSVRRSFKQAKYITGTVDVLQSKHSGGAEW 250
P L+ S EL + G GYR G + ++ +GG
Sbjct: 62 PRPRDLASASRAELLSCGLGYR---------------------VGFIKDASAEAAGGGLD 100
Query: 251 LLSLRKLDLQEAIDALCTLPGVGPKVAACIALFSLDQHHAIPVDTHVWKIATRY 304
L SLR+ Q+A++AL +PG+G K+A C+ LFSLD+ A P+D +I RY
Sbjct: 101 LASLRRSGYQKAMEALIAVPGIGGKIADCVMLFSLDKLEAFPIDRWTMRILERY 154
>gi|429244121|ref|ZP_19207602.1| 8-oxoguanine DNA glycosylase [Clostridium botulinum CFSAN001628]
gi|428758846|gb|EKX81238.1| 8-oxoguanine DNA glycosylase [Clostridium botulinum CFSAN001628]
Length = 271
Score = 97.8 bits (242), Expect = 5e-18, Method: Compositional matrix adjust.
Identities = 72/270 (26%), Positives = 125/270 (46%), Gaps = 29/270 (10%)
Query: 48 LSLPLTFPTGQTFRWKKTGPLQYTGPIGPHLISLKHLQNGDVCYHIHTSPSEPAAKSALL 107
L F GQ FRW KT Y G +I ++ N DV H + +E K+
Sbjct: 21 FELAHIFECGQCFRWYKTEEGSYIGVAYGKVIEVEK-ANNDVILH---NATEEDFKNIWA 76
Query: 108 DFLNMGISLGELWEGFSASDCRFAELAKYLAGARVLRQDPVECLLQFLCSSNNNIARITK 167
++ ++ E+ + + D A+ ++ G R+L+QDP E ++ F+ S+NN I I K
Sbjct: 77 EYFDLYRDYSEI-KNILSKDEILAKSVEFGHGIRLLKQDPFEIIVSFIISANNRIPMIKK 135
Query: 168 MVDFLASLGSHLGNVEGFEFHEFPSLERLSLVSEVELRNAGFGYRSAPQSSLLFSVRRSF 227
+ ++ +G ++ FP++E+L +E EL+ G+R
Sbjct: 136 AIKNISERWGDPIEYKGNIYYSFPTVEQLKDATEDELKACSVGFR--------------- 180
Query: 228 KQAKYITGTVD-VLQSKHSGGAEWLLSLRKLDLQEAIDALC-----TLPGVGPKVAACIA 281
AKYI TV+ + Q+ ++ L ++ D +C G+G KVA C+
Sbjct: 181 --AKYIKDTVNKIYQNSIEECEQYEKEYDMLWIKNQQDDICHKVLQNYSGIGAKVADCVM 238
Query: 282 LFSLDQHHAIPVDTHVWK-IATRYLLPELA 310
LFS++++ A PVD V + + YL P+++
Sbjct: 239 LFSMEKYSAFPVDVWVKRAMQYFYLAPDVS 268
>gi|255994545|ref|ZP_05427680.1| 8-oxoguanine DNA glycosylase [Eubacterium saphenum ATCC 49989]
gi|255993258|gb|EEU03347.1| 8-oxoguanine DNA glycosylase [Eubacterium saphenum ATCC 49989]
Length = 294
Score = 97.8 bits (242), Expect = 5e-18, Method: Compositional matrix adjust.
Identities = 74/257 (28%), Positives = 116/257 (45%), Gaps = 17/257 (6%)
Query: 45 QSELSLPLTFPTGQTFRWKKTGPLQYTGPIGPHLISLKHLQNGDVCYHIHTSPSEPAAKS 104
+ E+ L F GQ FR+ Y+G I++ + G++ + E +
Sbjct: 10 RDEVVLRHIFDCGQCFRFVCEEDGSYSGVFRRSFINVSLQEQGNIM-KVKRILGESIDEE 68
Query: 105 ALLDFLNMGISLGELWEGFSASDCRFAELAKYLAGARVLRQDPVECLLQFLCSSNNNIAR 164
L F ++G S + E F A D + G R+LRQD E L+ F+ S NNNI+R
Sbjct: 69 ELYRFFDLGSSYERIKEKFKAKDEVMKKAILKGEGIRILRQDFFETLITFIISQNNNISR 128
Query: 165 ITKMVDFL-ASLGSHLGNVEGFEFHEFPSLERLSLVSEVELRNAGFGYRSAPQSSLLFSV 223
I K ++ + A+ GS + G + FP+ E L+ E +L+ GYR+ ++ SV
Sbjct: 129 IRKNIESICAAYGSEVEAGSG--IYAFPTAEELAGAKEKDLKALKLGYRAG---YIVKSV 183
Query: 224 RRSFKQAKYITGTVDVLQSKHSGGAEWLLSLRKLDLQEAIDALCTLPGVGPKVAACIALF 283
K+ I ++ L ++ E L + L PGVG KVA CI LF
Sbjct: 184 EHYIKKKDRIQRCIEKLAKEYEKQEEENL----------FNELMKFPGVGAKVADCIMLF 233
Query: 284 SLDQHHAIPVDTHVWKI 300
+ + P+DT + KI
Sbjct: 234 ATPCKNRFPIDTWIKKI 250
>gi|395645026|ref|ZP_10432886.1| 8-oxoguanine DNA glycosylase domain-containing protein
[Methanofollis liminatans DSM 4140]
gi|395441766|gb|EJG06523.1| 8-oxoguanine DNA glycosylase domain-containing protein
[Methanofollis liminatans DSM 4140]
Length = 282
Score = 97.8 bits (242), Expect = 5e-18, Method: Compositional matrix adjust.
Identities = 80/264 (30%), Positives = 115/264 (43%), Gaps = 38/264 (14%)
Query: 48 LSLPLTFPTGQTFRWKKTGPLQYTGPIGPHLISLKHLQNGDVCYHIHTSPSEPAAKSALL 107
+L T GQ FRW+K Y G G H I ++ Q+G+ + + EP +
Sbjct: 13 FNLDATLGCGQAFRWEKVDGTWY-GVAGDHAIGIR--QDGERL--LFSGADEPFIRRYFA 67
Query: 108 DFLNMGISLGELWEGFSASDCRFAELAKYLAGARVLRQDPVECLLQFLCSSNNNIARITK 167
L++ L + D R G R+LRQ+P E L+ ++C++N NI I K
Sbjct: 68 LDLDLPAILSSV-----DRDPRIHAAIGRCPGLRILRQEPFETLISYICATNTNIPTIKK 122
Query: 168 MVDFLAS-LGSHL-GNVEGFEFHEFPSLERLSLVSEVELRNAGFGYRSAPQSSLLFSVRR 225
+ +A G L G V F P L+ SE E+R GYR
Sbjct: 123 RIALIAERYGRRLPGGVSAF-----PDAAALAPCSEGEMRGCVLGYR------------- 164
Query: 226 SFKQAKYITGTVDVLQSKHSGGAEWLLSLRKLDLQEAIDALCTLPGVGPKVAACIALFSL 285
A Y T L ++ G W + L EA L PGVGPK A CI LF+
Sbjct: 165 ----APYACATA-ALCAEDPG---WAERVAALPYGEARRELLRFPGVGPKAADCILLFAF 216
Query: 286 DQHHAIPVDTHVWKIATRYLLPEL 309
+++ A PVD H+ ++ R+ LP+L
Sbjct: 217 EKYEAFPVDVHIRRMMRRHYLPDL 240
>gi|226950728|ref|YP_002805819.1| putative 8-oxoguanine DNA glycosylase [Clostridium botulinum A2
str. Kyoto]
gi|226841466|gb|ACO84132.1| putative 8-oxoguanine DNA glycosylase [Clostridium botulinum A2
str. Kyoto]
Length = 305
Score = 97.8 bits (242), Expect = 6e-18, Method: Compositional matrix adjust.
Identities = 72/270 (26%), Positives = 124/270 (45%), Gaps = 29/270 (10%)
Query: 48 LSLPLTFPTGQTFRWKKTGPLQYTGPIGPHLISLKHLQNGDVCYHIHTSPSEPAAKSALL 107
L F GQ FRW KT Y G +I ++ N DV H + +E K+
Sbjct: 21 FELAHIFECGQCFRWYKTEEDSYIGVAYGKVIEVEK-ANNDVILH---NATEEDFKNIWA 76
Query: 108 DFLNMGISLGELWEGFSASDCRFAELAKYLAGARVLRQDPVECLLQFLCSSNNNIARITK 167
++ ++ E+ + + D A+ ++ G R+L+QDP E ++ F+ S+NN I I K
Sbjct: 77 EYFDLYRDYSEI-KNILSKDEILAKSVEFGHGIRLLKQDPFEIIVSFIISANNRIPMIKK 135
Query: 168 MVDFLASLGSHLGNVEGFEFHEFPSLERLSLVSEVELRNAGFGYRSAPQSSLLFSVRRSF 227
+ + +G ++ FP++E+L +E EL+ G+R
Sbjct: 136 AIKNICERWGDPIEYKGNIYYSFPTVEQLKDATEDELKACSVGFR--------------- 180
Query: 228 KQAKYITGTVD-VLQSKHSGGAEWLLSLRKLDLQEAIDALC-----TLPGVGPKVAACIA 281
AKYI TV+ + Q+ ++ L ++ D +C G+G KVA C+
Sbjct: 181 --AKYIKDTVNKIYQNSIEDCEQYEKEYDMLWIKNQQDDICHKVLQNYSGIGAKVADCVM 238
Query: 282 LFSLDQHHAIPVDTHVWK-IATRYLLPELA 310
LFS++++ A PVD V + + YL P+++
Sbjct: 239 LFSMEKYSAFPVDVWVKRAMQYFYLAPDVS 268
>gi|433655784|ref|YP_007299492.1| 3-methyladenine DNA glycosylase/8-oxoguanine DNA glycosylase
[Thermoanaerobacterium thermosaccharolyticum M0795]
gi|433293973|gb|AGB19795.1| 3-methyladenine DNA glycosylase/8-oxoguanine DNA glycosylase
[Thermoanaerobacterium thermosaccharolyticum M0795]
Length = 303
Score = 97.8 bits (242), Expect = 6e-18, Method: Compositional matrix adjust.
Identities = 69/256 (26%), Positives = 120/256 (46%), Gaps = 28/256 (10%)
Query: 47 ELSLPLTFPTGQTFRWKKTGPLQYTGPIGPHLISLKHLQNGDVCYHIHTSPSEPAAKSAL 106
+ +L T GQ FRW + YTG +I++K +GD+ +T+ ++
Sbjct: 24 DFNLKETLECGQCFRWNEEDDGSYTGVAFDRVINVK--LDGDILTIDNTTLAD--FNDIW 79
Query: 107 LDFLNMGISLGELWEGFSASDCRFAELAKYLAGARVLRQDPVECLLQFLCSSNNNIARIT 166
D+ ++G G++ E S + A + K G R+LRQD E L+ F+ S NN I +I
Sbjct: 80 YDYFDLGRDYGKIKETLSQDEILRAAI-KCGGGIRILRQDTWETLISFIISQNNRIPQIK 138
Query: 167 KMVDFLASLGSHLGNVEGFEFHEFPSLERLSLVSEVELRNAGFGYRSAPQSSLLFSVRRS 226
K+++ L+ H + ++ FP ++ + + E L N+ G+RS
Sbjct: 139 KVIENLSRAFGHPIVYKNKTYYTFPKVQDIIMADEESLNNSKCGFRS------------- 185
Query: 227 FKQAKYIT-GTVDVLQSKHSGGAEWLLSLRKLDLQEAIDALCTLPGVGPKVAACIALFSL 285
KYI + V + + L L+ D E + L ++ GVGPKVA C+ L+S+
Sbjct: 186 ----KYIIDAALKVFNDEIN-----LFELQLYDTHEVRNILMSIRGVGPKVADCVILYSI 236
Query: 286 DQHHAIPVDTHVWKIA 301
++ A P D + ++
Sbjct: 237 GRYEAFPTDVWIKRVV 252
>gi|146099107|ref|XP_001468558.1| putative 8-oxoguanine DNA glycosylase [Leishmania infantum JPCM5]
gi|134072926|emb|CAM71644.1| putative 8-oxoguanine DNA glycosylase [Leishmania infantum JPCM5]
Length = 565
Score = 97.8 bits (242), Expect = 6e-18, Method: Compositional matrix adjust.
Identities = 82/297 (27%), Positives = 122/297 (41%), Gaps = 97/297 (32%)
Query: 106 LLDFLNMGISLGELWEGFSAS-DCRFAELAKYLAGARV---------------------- 142
L +L++ + L +LW+ ++ S + R L +YL G R+
Sbjct: 157 LSRYLSLDVDLDQLWQEWTDSPETRKHPLVEYLVGNRLHREFLRSGHGQRVCEHDEAQAN 216
Query: 143 -------LRQDPVECLLQFLCSSNNNIARITKMVDFLA-SLGSHLGNV------------ 182
+RQD CL FLCS NNN+ RIT M+ L+ + G HL +V
Sbjct: 217 LYIPIRHVRQDLHSCLFSFLCSQNNNVTRITGMIYALSRTYGDHLCDVQLATGEVQEPRK 276
Query: 183 -----------------------------------------EGFEFHEFPSLERLSLVSE 201
E + FPSLE+L+ +E
Sbjct: 277 SAADTRSKVPQTQFQSDAAATRRDAIASPSLRSHSSSTNSPEWLSVYSFPSLEQLAAATE 336
Query: 202 VELRNAGFGYRSAPQSSLLFSVR-----RSFKQAKYITGTVDV---LQSKH-----SGGA 248
LR+ GFGYRS + +R ++ + K + G + L +H +G
Sbjct: 337 DTLRSLGFGYRSKYIVEAVSFIRTQLPPQALQDKKAVKGKMHSPPHLIQQHGACYRNGFY 396
Query: 249 EWLLSLRKLDLQEAIDALCTLPGVGPKVAACIALFSLDQHHAIPVDTHVWKIATRYL 305
+LS Q D L LPGVG KVA C+ALF+L++ H +PVDTH+ ++A YL
Sbjct: 397 SAVLSHHNYHHQHQRDMLLLLPGVGRKVADCVALFALNRTHIVPVDTHMAQVAVEYL 453
>gi|302875793|ref|YP_003844426.1| DNA-(apurinic or apyrimidinic site) lyase [Clostridium
cellulovorans 743B]
gi|307689228|ref|ZP_07631674.1| putative 8-oxoguanine DNA glycosylase [Clostridium cellulovorans
743B]
gi|302578650|gb|ADL52662.1| DNA-(apurinic or apyrimidinic site) lyase [Clostridium
cellulovorans 743B]
Length = 291
Score = 97.8 bits (242), Expect = 7e-18, Method: Compositional matrix adjust.
Identities = 72/264 (27%), Positives = 121/264 (45%), Gaps = 28/264 (10%)
Query: 48 LSLPLTFPTGQTFRWKKTGPLQYTGPIGPHLISLKHLQNGDVCYHIHTSPSEPAAKSALL 107
L F GQ FRW +T Y G +I ++ Q D+ H + SE L
Sbjct: 21 FQLDHVFKCGQCFRWDETESGTYIGVAFGKVIEVEK-QEADLIIH---NTSEEDFNRIWL 76
Query: 108 DFLNMGISLGELWEGFSASDCRFAELAKYLAGARVLRQDPVECLLQFLCSSNNNIARITK 167
++ ++ + ++ + + D + + G R+L+QDP E ++ F+ S+NN I I K
Sbjct: 77 NYFDLSRNYSDI-KAVLSEDPLLNKAVVFGEGIRLLKQDPFEMMISFIISANNRIPMIKK 135
Query: 168 MVDFLASLGSHLGNVEGFEFHEFPSLERLSLVSEVELRNAGFGYRSAPQSSLLFSVRRSF 227
++ ++ +G +++ FPS+E L+ SE E+ G G+R
Sbjct: 136 EINLISKKWGQKLTYKGKDYYAFPSIEALNSASEEEIEKCGVGFR--------------- 180
Query: 228 KQAKYITGTVDVLQSKHSGGAEWLLSLRKLDLQEAIDALCTLPGVGPKVAACIALFSLDQ 287
AKYI TV + ++ + ++S E L G+GPKVA CI LFS+ +
Sbjct: 181 --AKYIKDTVARINDNNN--IDQMISGTDDQCHE---MLKEYMGIGPKVADCIMLFSMGK 233
Query: 288 HHAIPVDTHVWKIATR-YLLPELA 310
+ A PVD V + YL P+++
Sbjct: 234 YSAFPVDVWVKRAMQHFYLAPDVS 257
>gi|325846861|ref|ZP_08169718.1| 8-oxoguanine DNA-glycosylase (ogg) [Anaerococcus hydrogenalis
ACS-025-V-Sch4]
gi|325481103|gb|EGC84147.1| 8-oxoguanine DNA-glycosylase (ogg) [Anaerococcus hydrogenalis
ACS-025-V-Sch4]
Length = 301
Score = 97.4 bits (241), Expect = 7e-18, Method: Compositional matrix adjust.
Identities = 75/257 (29%), Positives = 119/257 (46%), Gaps = 28/257 (10%)
Query: 54 FPTGQTFRWKKTGPLQYTGPIGPHLISLKHLQNGDVCYHIHTSPSEPAAKSALLDFLNMG 113
F GQ F + + +T +I+LK + GD I + SE D+ ++G
Sbjct: 34 FTCGQCFNFFEEEDGSFTAVFLGKIINLK--KEGDKI--IIDNVSEEEFYDIFYDYFDLG 89
Query: 114 ISLGELWEGFSASDCRFAELAKYLAGARVLRQDPVECLLQFLCSSNNNIARITKMVDFLA 173
+ G++ E S D + +Y G R+L Q+ E L+ F+ S+NN I RI K V ++
Sbjct: 90 VDYGKIKEKISV-DETLKKATEYGTGIRILNQEFFETLISFIISANNQIPRIKKAVRIIS 148
Query: 174 SL-GSHLGNVEGFEFHEFPSLERLSLVSEVELRNAGFGYRSAPQSSLLFSVRRSFKQAKY 232
+ G +LG G +++ FPS E+LS +LR ++ + F +R + AK
Sbjct: 149 EMYGDYLGEYRGRKYYSFPSAEKLSKARPEDLR---------EKARVGFRDKRIVQTAKI 199
Query: 233 IT-GTVDVLQSKHSGGAEWLLSLRKLDLQEAIDALCTLPGVGPKVAACIALFSLDQHHAI 291
I G D E + + DL++ L LPGVGPKVA CI LF+ +
Sbjct: 200 INDGFFDF---------EKDIKMPTEDLRKK---LQELPGVGPKVADCILLFAFHKRETF 247
Query: 292 PVDTHVWKIATRYLLPE 308
PVD + ++ + E
Sbjct: 248 PVDVWIKRVMEFLFIKE 264
>gi|407465545|ref|YP_006776427.1| HhH-GPD family protein [Candidatus Nitrosopumilus sp. AR2]
gi|407048733|gb|AFS83485.1| HhH-GPD family protein [Candidatus Nitrosopumilus sp. AR2]
Length = 286
Score = 97.4 bits (241), Expect = 7e-18, Method: Compositional matrix adjust.
Identities = 71/257 (27%), Positives = 116/257 (45%), Gaps = 35/257 (13%)
Query: 48 LSLPLTFPTGQTFRWKKTGPLQYTGPIGPHLISLKHLQNGDVCYHIHTSPSEPAAKSALL 107
+++ + +GQ F WKK Y G G + LK +GD+ + ++
Sbjct: 13 INVDSSINSGQVFLWKKNNANWY-GVNGQDI--LKITNSGDI----------KSYQNDKT 59
Query: 108 DFLNMGISLGELWEGFSASDCRFAELAKYLAGARVLRQDPVECLLQFLCSSNNNIARITK 167
DF ++ ++ + S D + +Y G RV QDP +CL+ F+ SSN+NI +I
Sbjct: 60 DFFRKKDNIEKIIKSISKDDVTKKAIKQY-PGLRVFEQDPFQCLISFIVSSNSNIQKIKN 118
Query: 168 MVDFLASLGSHLGNVEGFEFHEFPSLERLSLVSEVELRNAGFGYRSAPQSSLLFSVRRSF 227
++ ++ + EF FP E+L+ S E+++ G GYR
Sbjct: 119 SLEKMSKRFGIKTEFDDQEFFLFPKPEKLAKASINEIKSCGVGYR--------------- 163
Query: 228 KQAKYITGTVDVLQSKHSGGAEWLLSLRKLDLQEAIDALCTLPGVGPKVAACIALFSLDQ 287
A +I + K L+K QEA +C +PGVG KVA CI LFSL++
Sbjct: 164 --ANFIKEAANRTALKKID----FEYLKKCSYQEAKKEICQIPGVGNKVADCIMLFSLNK 217
Query: 288 HHAIPVDTHVWKIATRY 304
+ P+D + +I +Y
Sbjct: 218 LESFPLDRWMIRILEKY 234
>gi|291546572|emb|CBL19680.1| 3-methyladenine DNA glycosylase/8-oxoguanine DNA glycosylase
[Ruminococcus sp. SR1/5]
Length = 268
Score = 97.4 bits (241), Expect = 8e-18, Method: Compositional matrix adjust.
Identities = 76/256 (29%), Positives = 110/256 (42%), Gaps = 31/256 (12%)
Query: 48 LSLPLTFPTGQTFRWKKTGPLQYTGPIGPHLISLKHLQNG--DVCYHIHTSPSEPAAKSA 105
SL +GQ FR +K G ++ G + + + ++G D C S E
Sbjct: 9 FSLDKICDSGQCFRMRKLGDGHFSMVAGDQYLEM-YQKDGVVDFC----CSQEEFICYWV 63
Query: 106 LLDFLNMGISLGELWEGFSASDCRFAELAKYLAGARVLRQDPVECLLQFLCSSNNNIARI 165
L + ++ + D K G R+LRQD E ++ FL S NNI RI
Sbjct: 64 L--YFDLDTDYEAYMNAANPRDKYLGAAIKAGDGIRILRQDLWEMIVTFLISQQNNIKRI 121
Query: 166 TKMVDFLA-SLGSHLGNVEGFEFHEFPSLERLSLVSEVELRNAGFGYRSAPQSSLLFSVR 224
K ++ + G + G E++ FP++E LS +E ELR G GYR
Sbjct: 122 RKCIETICRKYGERKNSSAGVEYYAFPTVEALSQATEEELRGCGLGYR------------ 169
Query: 225 RSFKQAKYITGTVDVLQSKHSGGAEWLLSLRKLDLQEAIDALCTLPGVGPKVAACIALFS 284
AKYI T + S G L + + +A L L GVG KVA CI LF+
Sbjct: 170 -----AKYIAVTARTIAS----GQISLEKIYDMRYHQAKKELMKLCGVGEKVAECICLFA 220
Query: 285 LDQHHAIPVDTHVWKI 300
L A P+DTH+ ++
Sbjct: 221 LHHMDAFPIDTHIRQV 236
>gi|153939309|ref|YP_001392648.1| 8-oxoguanine DNA glycosylase [Clostridium botulinum F str.
Langeland]
gi|384463616|ref|YP_005676211.1| putative 8-oxoguanine DNA glycosylase [Clostridium botulinum F str.
230613]
gi|152935205|gb|ABS40703.1| putative 8-oxoguanine DNA glycosylase [Clostridium botulinum F str.
Langeland]
gi|295320633|gb|ADG01011.1| putative 8-oxoguanine DNA glycosylase [Clostridium botulinum F str.
230613]
Length = 305
Score = 97.4 bits (241), Expect = 8e-18, Method: Compositional matrix adjust.
Identities = 72/270 (26%), Positives = 125/270 (46%), Gaps = 29/270 (10%)
Query: 48 LSLPLTFPTGQTFRWKKTGPLQYTGPIGPHLISLKHLQNGDVCYHIHTSPSEPAAKSALL 107
L F GQ FRW KT Y G +I ++ N DV H + +E K+
Sbjct: 21 FELAHIFECGQCFRWYKTEEGSYIGVAYGKVIEVEK-ANNDVILH---NATEDDFKNIWA 76
Query: 108 DFLNMGISLGELWEGFSASDCRFAELAKYLAGARVLRQDPVECLLQFLCSSNNNIARITK 167
++ ++ E+ + + D A+ ++ G R+L+QDP E ++ F+ S+NN I I K
Sbjct: 77 EYFDLYRDYSEI-KNILSKDEILAKSVEFGHGIRLLKQDPFEIIVSFIISANNRIPMIKK 135
Query: 168 MVDFLASLGSHLGNVEGFEFHEFPSLERLSLVSEVELRNAGFGYRSAPQSSLLFSVRRSF 227
+ ++ +G ++ FP++E+L +E EL+ G+R
Sbjct: 136 AIKNISERWGDPIEYKGNIYYSFPTVEQLKDATEDELKACSVGFR--------------- 180
Query: 228 KQAKYITGTVD-VLQSKHSGGAEWLLSLRKLDLQEAIDALC-----TLPGVGPKVAACIA 281
AKYI TV+ + Q+ ++ L ++ D +C G+G KVA C+
Sbjct: 181 --AKYIKDTVNKIYQNSIEECEQYEKEYDMLWIKNQQDDICHKVLQNYSGIGAKVADCVM 238
Query: 282 LFSLDQHHAIPVDTHVWK-IATRYLLPELA 310
LFS++++ A PVD V + + YL P+++
Sbjct: 239 LFSMEKYSAFPVDVWVKRAMQYFYLAPDVS 268
>gi|170757841|ref|YP_001782926.1| 8-oxoguanine DNA glycosylase [Clostridium botulinum B1 str. Okra]
gi|169123053|gb|ACA46889.1| putative 8-oxoguanine DNA glycosylase [Clostridium botulinum B1
str. Okra]
Length = 305
Score = 97.4 bits (241), Expect = 8e-18, Method: Compositional matrix adjust.
Identities = 72/270 (26%), Positives = 125/270 (46%), Gaps = 29/270 (10%)
Query: 48 LSLPLTFPTGQTFRWKKTGPLQYTGPIGPHLISLKHLQNGDVCYHIHTSPSEPAAKSALL 107
L F GQ FRW KT Y G +I ++ N DV H + +E K+
Sbjct: 21 FELAHIFECGQCFRWYKTEEGSYIGVAYGKVIEVEK-ANNDVILH---NATEEDFKNIWA 76
Query: 108 DFLNMGISLGELWEGFSASDCRFAELAKYLAGARVLRQDPVECLLQFLCSSNNNIARITK 167
++ ++ E+ + + D A+ ++ G R+L+QDP E ++ F+ S+NN I I K
Sbjct: 77 EYFDLYRDYSEI-KNILSKDEILAKSVEFGHGIRLLKQDPFEIIVSFIISANNRIPMIKK 135
Query: 168 MVDFLASLGSHLGNVEGFEFHEFPSLERLSLVSEVELRNAGFGYRSAPQSSLLFSVRRSF 227
+ ++ +G ++ FP++E+L +E EL+ G+R
Sbjct: 136 AIKNISERWGDPIEYKGNIYYSFPTVEQLKDATEDELKACSVGFR--------------- 180
Query: 228 KQAKYITGTVD-VLQSKHSGGAEWLLSLRKLDLQEAIDALC-----TLPGVGPKVAACIA 281
AKYI TV+ + Q+ ++ L ++ D +C G+G KVA C+
Sbjct: 181 --AKYIKDTVNKIYQNSIEECEQYEKEYDMLWIKNQQDDICHKVLQNYSGIGAKVADCVM 238
Query: 282 LFSLDQHHAIPVDTHVWK-IATRYLLPELA 310
LFS++++ A PVD V + + YL P+++
Sbjct: 239 LFSMEKYSAFPVDVWVKRAMQYFYLAPDVS 268
>gi|148381246|ref|YP_001255787.1| 8-oxoguanine DNA glycosylase [Clostridium botulinum A str. ATCC
3502]
gi|153930996|ref|YP_001385621.1| 8-oxoguanine DNA glycosylase [Clostridium botulinum A str. ATCC
19397]
gi|153936532|ref|YP_001389027.1| 8-oxoguanine DNA glycosylase [Clostridium botulinum A str. Hall]
gi|148290730|emb|CAL84861.1| putative DNA repair protein [Clostridium botulinum A str. ATCC
3502]
gi|152927040|gb|ABS32540.1| putative 8-oxoguanine DNA glycosylase [Clostridium botulinum A str.
ATCC 19397]
gi|152932446|gb|ABS37945.1| putative 8-oxoguanine DNA glycosylase [Clostridium botulinum A str.
Hall]
Length = 305
Score = 97.4 bits (241), Expect = 8e-18, Method: Compositional matrix adjust.
Identities = 72/270 (26%), Positives = 125/270 (46%), Gaps = 29/270 (10%)
Query: 48 LSLPLTFPTGQTFRWKKTGPLQYTGPIGPHLISLKHLQNGDVCYHIHTSPSEPAAKSALL 107
L F GQ FRW KT Y G +I ++ N DV H + +E K+
Sbjct: 21 FELAHIFECGQCFRWYKTEEGSYIGVAYGKVIEVEK-ANNDVILH---NATEEDFKNIWA 76
Query: 108 DFLNMGISLGELWEGFSASDCRFAELAKYLAGARVLRQDPVECLLQFLCSSNNNIARITK 167
++ ++ E+ + + D A+ ++ G R+L+QDP E ++ F+ S+NN I I K
Sbjct: 77 EYFDLYRDYSEI-KNILSKDEILAKSVEFGHGIRLLKQDPFEIIVSFIISANNRIPMIKK 135
Query: 168 MVDFLASLGSHLGNVEGFEFHEFPSLERLSLVSEVELRNAGFGYRSAPQSSLLFSVRRSF 227
+ ++ +G ++ FP++E+L +E EL+ G+R
Sbjct: 136 AIKNISERWGDPIEYKGNIYYSFPTVEQLKDATEDELKACSVGFR--------------- 180
Query: 228 KQAKYITGTVD-VLQSKHSGGAEWLLSLRKLDLQEAIDALC-----TLPGVGPKVAACIA 281
AKYI TV+ + Q+ ++ L ++ D +C G+G KVA C+
Sbjct: 181 --AKYIKDTVNKIYQNSIEECEQYEKEYDMLWIKNQQDDICHKVLQNYSGIGAKVADCVM 238
Query: 282 LFSLDQHHAIPVDTHVWK-IATRYLLPELA 310
LFS++++ A PVD V + + YL P+++
Sbjct: 239 LFSMEKYSAFPVDVWVKRAMQYFYLAPDVS 268
>gi|27764447|emb|CAD60546.1| putative 8-oxoguanine DNA glycosylase [Trypanosoma brucei]
Length = 482
Score = 97.4 bits (241), Expect = 9e-18, Method: Compositional matrix adjust.
Identities = 78/250 (31%), Positives = 112/250 (44%), Gaps = 50/250 (20%)
Query: 106 LLDFLNMGISLGELWEGFSASD-CRFAELAKYL---------AGARVLRQDPVECLLQFL 155
L +L + + L ++W ++A + + L K L R LRQD E LL FL
Sbjct: 121 LRHYLALDVDLQKMWLRWTADNPMKSHPLVKCLTPCGSTDLPVSIRHLRQDLHETLLGFL 180
Query: 156 CSSNNNIARITKMV--------DFLASL---------GSHLGNVEG-----------FEF 187
CS NNN+ RIT ++ D+L +H GNV+G
Sbjct: 181 CSQNNNVPRITSLMERLSISYGDYLCDYNVVTGDVRHAAHCGNVKGNPSEQNDGGGWIAL 240
Query: 188 HEFPSLERLSLVSEVELRNAGFGYRSAPQSSLLFSVRRSFKQAKYITGTVDVLQSKHSGG 247
H PS+++L+ +E LR GFGYRS + + +S ++ K G
Sbjct: 241 HALPSMDQLAAATEESLRALGFGYRSRYLTECAAIISKSGATRVQKEVKDKKMKKKEVGK 300
Query: 248 ----------AEWL--LSLRKLDLQEAIDALCTLPGVGPKVAACIALFSLDQHHAIPVDT 295
+W L+ +L LQE + L LPGVG KVA CI LF+L H +PVDT
Sbjct: 301 PAGPLAAVQPYKWYEELASNRLTLQERREKLLALPGVGRKVADCILLFALGHHELVPVDT 360
Query: 296 HVWKIATRYL 305
H+ ++A YL
Sbjct: 361 HMAQVAAEYL 370
>gi|182419453|ref|ZP_02950705.1| 8-oxoguanine DNA glycosylase [Clostridium butyricum 5521]
gi|237666456|ref|ZP_04526441.1| 8-oxoguanine DNA glycosylase domain protein [Clostridium butyricum
E4 str. BoNT E BL5262]
gi|182376784|gb|EDT74356.1| 8-oxoguanine DNA glycosylase [Clostridium butyricum 5521]
gi|237657655|gb|EEP55210.1| 8-oxoguanine DNA glycosylase domain protein [Clostridium butyricum
E4 str. BoNT E BL5262]
Length = 309
Score = 97.4 bits (241), Expect = 9e-18, Method: Compositional matrix adjust.
Identities = 71/259 (27%), Positives = 119/259 (45%), Gaps = 28/259 (10%)
Query: 48 LSLPLTFPTGQTFRWKKTGPLQYTGPIGPHLISLKHLQNGDVCYHIHTSPSEPAAKSALL 107
L TF GQ FR+ K Y +I L+ N D+ I+ S E ++ +
Sbjct: 21 FKLKQTFECGQCFRFHKISDTNYITVAFERVIELEEDGN-DI--KIYNSNREDV-ENIWI 76
Query: 108 DFLNMGISLGELWEGFSASDCRFAELAKYLAGARVLRQDPVECLLQFLCSSNNNIARITK 167
D+ ++ ++ E S D + +Y G R+L QDP E L+ F+ S+ N+I I K
Sbjct: 77 DYFDLNRDYSKIKEELSKDDL-LRKSVEYGPGIRILNQDPFEILISFIISARNSIPSIMK 135
Query: 168 MVDFLASLGSHLGNVEGFEFHEFPSLERLSLVSEVELRNAGFGYRSAPQSSLLFSVRRSF 227
++ +++ +G ++ FP++E++ + E++ G +RS + +V S+
Sbjct: 136 TINKISAKWGRTIEYKGNTYNAFPTIEQIKDATLEEIQETGASFRSKYIIDTISNVYSSY 195
Query: 228 KQAKYITGTVDVLQSKHSGGAEWLLSLRKLDLQ--------EAIDALCTLPGVGPKVAAC 279
K +K +E + +K DL+ E AL GVG KVA C
Sbjct: 196 K-------------AKKDKDSEKIEEFKKYDLEYIKSLNDDECHSALQEFKGVGAKVADC 242
Query: 280 IALFSLDQHHAIPVDTHVW 298
I LFS++++ A PVD VW
Sbjct: 243 IMLFSMEKYSAFPVD--VW 259
>gi|72387948|ref|XP_844398.1| 8-oxoguanine DNA glycosylase [Trypanosoma brucei brucei strain
927/4 GUTat10.1]
gi|62358606|gb|AAX79066.1| 8-oxoguanine DNA glycosylase, putative [Trypanosoma brucei]
gi|70800931|gb|AAZ10839.1| 8-oxoguanine DNA glycosylase, putative [Trypanosoma brucei brucei
strain 927/4 GUTat10.1]
Length = 500
Score = 97.4 bits (241), Expect = 9e-18, Method: Compositional matrix adjust.
Identities = 78/250 (31%), Positives = 112/250 (44%), Gaps = 50/250 (20%)
Query: 106 LLDFLNMGISLGELWEGFSASD-CRFAELAKYL---------AGARVLRQDPVECLLQFL 155
L +L + + L ++W ++A + + L K L R LRQD E LL FL
Sbjct: 139 LRHYLALDVDLQKMWLRWTADNPMKSHPLVKCLTPCGSTDLPVSIRHLRQDLHETLLGFL 198
Query: 156 CSSNNNIARITKMV--------DFLASL---------GSHLGNVEG-----------FEF 187
CS NNN+ RIT ++ D+L +H GNV+G
Sbjct: 199 CSQNNNVPRITSLMERLSISYGDYLCDYNVVTGDVRHAAHCGNVKGNPSEQNDGGGWIAL 258
Query: 188 HEFPSLERLSLVSEVELRNAGFGYRSAPQSSLLFSVRRSFKQAKYITGTVDVLQSKHSGG 247
H PS+++L+ +E LR GFGYRS + + +S ++ K G
Sbjct: 259 HALPSMDQLAAATEESLRALGFGYRSRYLTECAAIISKSGATRVQKEVKDKKMKKKEVGK 318
Query: 248 ----------AEWL--LSLRKLDLQEAIDALCTLPGVGPKVAACIALFSLDQHHAIPVDT 295
+W L+ +L LQE + L LPGVG KVA CI LF+L H +PVDT
Sbjct: 319 PAGPLAAVQPYKWYEELASNRLTLQERREKLLALPGVGRKVADCILLFALGHHELVPVDT 378
Query: 296 HVWKIATRYL 305
H+ ++A YL
Sbjct: 379 HMAQVAAEYL 388
>gi|168182072|ref|ZP_02616736.1| putative 8-oxoguanine DNA glycosylase [Clostridium botulinum Bf]
gi|182674929|gb|EDT86890.1| putative 8-oxoguanine DNA glycosylase [Clostridium botulinum Bf]
Length = 305
Score = 97.1 bits (240), Expect = 1e-17, Method: Compositional matrix adjust.
Identities = 75/270 (27%), Positives = 119/270 (44%), Gaps = 41/270 (15%)
Query: 48 LSLPLTFPTGQTFRWKKTGPLQYTGPIGPHLISLKHLQNGDVCYHIHTSPSEPAAKSALL 107
L F GQ FRW KT Y G +I ++ N DV H T
Sbjct: 21 FELAHIFECGQCFRWYKTEEGSYIGVAYGKVIEVEK-ANNDVILHNATEE---------- 69
Query: 108 DFLNMGISLGELWEGFS------ASDCRFAELAKYLAGARVLRQDPVECLLQFLCSSNNN 161
DF N+ +L+ +S + D A+ ++ G R+L+QDP E ++ F+ S+NN
Sbjct: 70 DFKNIWAEYFDLYRDYSEIKHILSKDEILAKAVEFGHGIRLLKQDPFEIIVSFIISANNR 129
Query: 162 IARITKMVDFLASLGSHLGNVEGFEFHEFPSLERLSLVSEVELRNAGFGYRSAPQSSLLF 221
I I K + ++ +G ++ FP++E+L +E EL+ G+R
Sbjct: 130 IPMIKKAIKNISERWGDPIEYKGNIYYSFPTVEQLKDATEDELKACSVGFR--------- 180
Query: 222 SVRRSFKQAKYITGTVD-VLQSKHSGGAEWLLSLRKLDLQEAIDALC-----TLPGVGPK 275
AKYI TV+ + Q+ ++ L ++ D +C G+G K
Sbjct: 181 --------AKYIKDTVNKIYQNSIEECEQYEKEYDMLWIKNQQDDICHKMLQNYSGIGAK 232
Query: 276 VAACIALFSLDQHHAIPVDTHVWKIATRYL 305
VA C+ LFS++++ A PVD V K A +Y
Sbjct: 233 VADCVMLFSMEKYSAFPVDVWV-KRAMQYF 261
>gi|410082762|ref|XP_003958959.1| hypothetical protein KAFR_0I00430 [Kazachstania africana CBS 2517]
gi|372465549|emb|CCF59824.1| hypothetical protein KAFR_0I00430 [Kazachstania africana CBS 2517]
Length = 370
Score = 97.1 bits (240), Expect = 1e-17, Method: Compositional matrix adjust.
Identities = 94/291 (32%), Positives = 129/291 (44%), Gaps = 50/291 (17%)
Query: 37 KWTPLNLTQSELSLPLTFPTGQTFRWKKTGPLQY------TGPIGPHLISLKHLQN---- 86
++ L + + L+L GQ+FRW Y G + I L Q+
Sbjct: 3 QFGKLRVKREHLNLQNVLHAGQSFRWIFHADKDYYATSMKVGSYSEYCIVLLKQQDENTI 62
Query: 87 -----GDVCYHIHTSPSEPAAKSALLDFLNMGISLGELWEG-FSASDCRFAELAKYLAGA 140
G+VC L D+ + + L E+ + SD +F L G
Sbjct: 63 EYASVGNVC-------EMNVLGKHLEDYFRLEVDLAEVHANEWIPSDSKFKGLPS--QGI 113
Query: 141 RVLRQDPVECLLQFLCSSNNNIARITKMVDFLAS-LGSHLGNVEGFEFHEFPSLERL-SL 198
R+L Q+P E L+ F+CSSNNNI+RITKM L S G+ L + +F+ FP+ E L
Sbjct: 114 RILGQEPWETLISFICSSNNNISRITKMCHNLCSYYGNKLDEFDSLQFYSFPTSEDLVKR 173
Query: 199 VSEVELRNAGFGYRSAPQSSLLFSVRRSFKQAKYITGTVD--VLQSKHSG---GAEWLLS 253
+E ELR GFGYR AKYI T VL K G E+ +
Sbjct: 174 ATEAELRELGFGYR-----------------AKYILETAKKIVLAKKAEGFSTDQEYFKN 216
Query: 254 LRKL-DLQEAIDALCTLPGVGPKVAACIALFSLDQHHAIPVDTHVWKIATR 303
L + + E + L + GVGPKVA C+ L L H +PVD HV +IA R
Sbjct: 217 LMETKSIGEVKETLMSFSGVGPKVADCVCLMGLRMDHVVPVDVHVGRIAKR 267
>gi|302390179|ref|YP_003826000.1| 8-oxoguanine DNA glycosylase [Thermosediminibacter oceani DSM
16646]
gi|302200807|gb|ADL08377.1| 8-oxoguanine DNA glycosylase domain protein [Thermosediminibacter
oceani DSM 16646]
Length = 292
Score = 97.1 bits (240), Expect = 1e-17, Method: Compositional matrix adjust.
Identities = 72/254 (28%), Positives = 116/254 (45%), Gaps = 26/254 (10%)
Query: 48 LSLPLTFPTGQTFRWKKTGPLQYTGPIGPHLISLKHLQNGDVCYHIHTSPSEPAAKSALL 107
+L +GQ FRW + Y G +G +I K Q D I T+ + +
Sbjct: 11 FNLEAIAESGQAFRWNRLEDGGYLGVVGNLVI--KAYQEEDTL-RIFTNGGNDSI-GFIR 66
Query: 108 DFLNMGISLGELWEGFSASDCRFAELAKYLAGARVLRQDPVECLLQFLCSSNNNIARITK 167
D+ ++ E+ + S D K+ +G R+L Q P E L+ F+ S+NN+I I +
Sbjct: 67 DYFDLERDYREIEDKLSGFD-ELVPAVKFCSGNRILHQQPWETLISFILSANNSIPNIKR 125
Query: 168 MVDFLASLGSHLGNVEGFEFHEFPSLERLSLVSEVELRNAGFGYRSAPQSSLLFSVRRSF 227
++ + S EG ++ FP+ E L+ +SE ++R+ G+R
Sbjct: 126 TIERMCSCYGTPVEFEGEIYYTFPTPEVLASLSEAQIRDTRCGFR--------------- 170
Query: 228 KQAKYITGTVDVLQSKHSGGAEWLLSLRKLDLQEAIDALCTLPGVGPKVAACIALFSLDQ 287
KY+ ++ +GG L L KL EA D L +PGVG K+A C+ LFSL +
Sbjct: 171 --GKYVIAAARMV----AGGDIVLDELEKLPTGEARDYLMKIPGVGRKIADCVLLFSLRK 224
Query: 288 HHAIPVDTHVWKIA 301
A PVD + ++
Sbjct: 225 FDAFPVDVWIKRVV 238
>gi|299143297|ref|ZP_07036377.1| 8-oxoguanine DNA glycosylase [Peptoniphilus sp. oral taxon 386 str.
F0131]
gi|298517782|gb|EFI41521.1| 8-oxoguanine DNA glycosylase [Peptoniphilus sp. oral taxon 386 str.
F0131]
Length = 291
Score = 96.7 bits (239), Expect = 1e-17, Method: Compositional matrix adjust.
Identities = 74/259 (28%), Positives = 121/259 (46%), Gaps = 30/259 (11%)
Query: 53 TFPTGQTFRWKKTGPLQYTGPIGPHLISLKHLQNGDVCYHIHTSPSEPAAKSALLDFLNM 112
F GQ FRWK+ L +T ++++ + G+ +TS E +D+ ++
Sbjct: 23 IFKCGQAFRWKEEQDLSFTNIAFGKILNISKI--GEDVILKNTSIEE--FNELWVDYFDL 78
Query: 113 GISLGELWEGFSASDCRFAELAKYLAGARVLRQDPVECLLQFLCSSNNNIARITKMVDFL 172
L E S + A L +Y G R+L QDP E ++ F+ S+NN I RI K ++ +
Sbjct: 79 NRDYFSLREELSFDEILRAAL-EYGNGIRILNQDPFETIITFIISANNQIPRIKKSIEKI 137
Query: 173 ASL-GSHLGNVEGFEFHEFPSLERLSLVSEVELRN-AGFGYRSAPQSSLLFSVRRSFKQA 230
+ + G +G G E++ FPS ++L+L +LR A G+R R K +
Sbjct: 138 SQMYGEKIGEYLGEEYYNFPSADKLALADPKDLREFAKVGFRD----------ERIVKAS 187
Query: 231 KYITG-TVDVLQSKHSGGAEWLLSLRKLDLQEAIDALCTLPGVGPKVAACIALFSLDQHH 289
K I +D+ G L ++ A + L LPGVGPKVA C+ LF+ +
Sbjct: 188 KMIRDREIDI-------GL-----LYDCPIEMAREELMKLPGVGPKVADCVLLFAFKRQE 235
Query: 290 AIPVDTHVWKIATRYLLPE 308
+ PVD + ++ L +
Sbjct: 236 SFPVDVWIKRVMEELYLKK 254
>gi|289423363|ref|ZP_06425171.1| 8-oxoguanine DNA glycosylase domain protein [Peptostreptococcus
anaerobius 653-L]
gi|289156294|gb|EFD04951.1| 8-oxoguanine DNA glycosylase domain protein [Peptostreptococcus
anaerobius 653-L]
Length = 296
Score = 96.7 bits (239), Expect = 1e-17, Method: Compositional matrix adjust.
Identities = 72/266 (27%), Positives = 121/266 (45%), Gaps = 28/266 (10%)
Query: 45 QSELSLPLTFPTGQTFRWKKTGPLQYTGPIGPHLISLKHLQNGDVCYHIHTSPSEPAAKS 104
QS F GQ FRW K YTG ++++ ++GD+ I + + ++
Sbjct: 20 QSNFEPSHIFDCGQCFRWMKEDDGSYTGVAMGKILNISK-RDGDI---ILLNTNLGDFEN 75
Query: 105 ALLDFLNMGISLGELWEGFSASDCRFAELAKYLAGARVLRQDPVECLLQFLCSSNNNIAR 164
D+ ++G G++ + D + ++ G R+L+QD E L+ F+ SSNN I
Sbjct: 76 IWYDYFDLGRDYGQMIDKLKVHDDNLLKATEFGHGIRLLQQDGWEMLISFIISSNNRIPM 135
Query: 165 ITKMVDFLA-SLGSHLGNVEGFEFHEFPSLERLSLVSEVELRNAGFGYRSAPQSSLLFSV 223
I + ++ ++ + G LG +G ++ FPS +LS S +LRN G+R
Sbjct: 136 IQRAINNISENYGDCLGEYKGKTYYSFPSPAQLSKASVEDLRNLKLGFRD---------- 185
Query: 224 RRSFKQAKYITGTVD-VLQSKHSGGAEWLLSLRKLDLQEAIDALCTLPGVGPKVAACIAL 282
KY+ T VL+S ++ K++ L GVG KVA CIAL
Sbjct: 186 -------KYVYETTKAVLESGLD-----MVDFIKMEADICHKELMKFKGVGAKVADCIAL 233
Query: 283 FSLDQHHAIPVDTHVWKIATRYLLPE 308
F + ++ + PVD V ++ + E
Sbjct: 234 FGMRKYDSFPVDVWVKRVMQEFYGAE 259
>gi|261327566|emb|CBH10542.1| 8-oxoguanine DNA glycosylase, putative [Trypanosoma brucei
gambiense DAL972]
Length = 500
Score = 96.7 bits (239), Expect = 1e-17, Method: Compositional matrix adjust.
Identities = 78/250 (31%), Positives = 111/250 (44%), Gaps = 50/250 (20%)
Query: 106 LLDFLNMGISLGELWEGFSASD-CRFAELAKYL---------AGARVLRQDPVECLLQFL 155
L +L + + L ++W ++A + + L K L R LRQD E LL FL
Sbjct: 139 LRHYLALDVDLQKMWLRWTADNPMKSHPLVKCLTPCGSTDLPVSIRHLRQDLHETLLGFL 198
Query: 156 CSSNNNIARITKMV--------DFLASL---------GSHLGNVEG-----------FEF 187
CS NNN+ RIT ++ D+L +H GNV+G
Sbjct: 199 CSQNNNVPRITSLMERLSISYGDYLCDYNVVTGDVRHAAHCGNVKGKPSEQNDGGGWIAL 258
Query: 188 HEFPSLERLSLVSEVELRNAGFGYRSAPQSSLLFSVRRSFKQAKYITGTVDVLQSKHSGG 247
H PS+++L+ +E LR GFGYRS + + +S ++ K G
Sbjct: 259 HALPSMDQLAAATEESLRALGFGYRSRYLTECAAIISKSGATRVQKEVKDKKMKKKEVGK 318
Query: 248 ----------AEWL--LSLRKLDLQEAIDALCTLPGVGPKVAACIALFSLDQHHAIPVDT 295
+W L+ +L LQE + L LPGVG KVA CI LF+L H +PVDT
Sbjct: 319 PAGPLAAVQPYKWYEELASNRLTLQERREKLLALPGVGRKVADCILLFALGHHELVPVDT 378
Query: 296 HVWKIATRYL 305
H ++A YL
Sbjct: 379 HTAQVAAEYL 388
>gi|219851216|ref|YP_002465648.1| 8-oxoguanine DNA glycosylase [Methanosphaerula palustris E1-9c]
gi|219545475|gb|ACL15925.1| 8-oxoguanine DNA glycosylase domain protein [Methanosphaerula
palustris E1-9c]
Length = 285
Score = 96.7 bits (239), Expect = 1e-17, Method: Compositional matrix adjust.
Identities = 81/284 (28%), Positives = 125/284 (44%), Gaps = 35/284 (12%)
Query: 41 LNLTQSE-LSLPLTFPTGQTFRWKKTGPLQYTGPIGPHLISLKHLQNGDVCYHIHTSPSE 99
LNL+ + SL LT GQ FRW++ + G +G +I ++ ++ + + T SE
Sbjct: 5 LNLSSDQSFSLSLTLGCGQAFRWEQDEAGWWEGVVGDEVIRVRQ-EDRSLTF---TGTSE 60
Query: 100 PAAKSALLDFLNMGISLGELWEGFSASDCRFAELAKYLAGARVLRQDPVECLLQFLCSSN 159
L+++ + + L + E A + + AG R+LRQ ECLL +LC++N
Sbjct: 61 ----ERLIEYFALDMDLAHVLETIDRDPFIHAAIEE-CAGLRILRQPSWECLLSYLCATN 115
Query: 160 NNIARITKMVDFLA-SLGSHLGNVEGFEFHEFPSLERLSLVSEVELRNAGFGYRSAPQSS 218
NI + K V LA SLG + + F F P + L + GYR
Sbjct: 116 TNIPMVKKRVRLLAESLGERIPGTDQFAF---PVPSVFNETCAEPLDHCRLGYRKG---- 168
Query: 219 LLFSVRRSFKQAKYITGTVDVLQSKHSGGAEWLLSLRKLDLQEAIDALCTLPGVGPKVAA 278
Y+ T L ++ GG W +R +EA L LPG+GPK A
Sbjct: 169 -------------YLATTACQLAAE--GG--WEGRVRAQPFEEARQVLTRLPGIGPKAAD 211
Query: 279 CIALFSLDQHHAIPVDTHVWKIATRYLLPELAGVRLTPKLCSRV 322
C+ LF ++ A PVD + +I ++ A TPK R+
Sbjct: 212 CVLLFGFSRYEAFPVDVWIRRIMQQFYPETAAEGSFTPKEYERI 255
>gi|402837376|ref|ZP_10885901.1| 8-oxoguanine DNA glycosylase, N-terminal domain protein
[Eubacteriaceae bacterium OBRC8]
gi|402275493|gb|EJU24646.1| 8-oxoguanine DNA glycosylase, N-terminal domain protein
[Eubacteriaceae bacterium OBRC8]
Length = 294
Score = 96.7 bits (239), Expect = 1e-17, Method: Compositional matrix adjust.
Identities = 68/256 (26%), Positives = 121/256 (47%), Gaps = 26/256 (10%)
Query: 54 FPTGQTFRWKKTGPLQYTGPIGPHLISLKHLQNGDVCYHIHTSPSEPAAKSALLDFLNMG 113
F GQ FRW YTG + +I++ L++ DV + + + ++ +++ ++G
Sbjct: 24 FECGQCFRWDLEEDGSYTGVVYDKVINV-SLEDEDV---VLKNTNVDDVENIWINYFDIG 79
Query: 114 ISLGELWEGFSASDCRFAELAKYLAGARVLRQDPVECLLQFLCSSNNNIARITKMVDFLA 173
G + + S D E K+ G R+L+Q E ++ F+ S+ N I I + V+ L+
Sbjct: 80 KDYGHIKKELSDMDEYLKESTKFGYGIRILKQPFYEMVISFIISARNAIPMIKRSVNKLS 139
Query: 174 -SLGSHLGNVEGFEFHEFPSLERLSLVSEVELRNAGFGYRSAPQSSLLFSVRRSFKQAKY 232
LG H+ G +++ FP+++ L+ NA +A + + +A Y
Sbjct: 140 EDLGEHIDTYNGKKYYSFPTVKSLA--------NADIEKITASKIAF---------RAPY 182
Query: 233 ITGTVDVLQSKHSGGAEWLLSLRKLDLQEAIDALCTLPGVGPKVAACIALFSLDQHHAIP 292
I T +++ + ++ KL L E + L GVG KVA CIALF L ++ A P
Sbjct: 183 IQKTAKMIEDEDIQEKDF----EKLTLDETAEKLTKFAGVGAKVADCIALFGLSKYDAFP 238
Query: 293 VDTHVWKIATRYLLPE 308
VD V ++ + L +
Sbjct: 239 VDVWVKRVMEEFYLKD 254
>gi|435852146|ref|YP_007313732.1| 3-methyladenine DNA glycosylase/8-oxoguanine DNA glycosylase
[Methanomethylovorans hollandica DSM 15978]
gi|433662776|gb|AGB50202.1| 3-methyladenine DNA glycosylase/8-oxoguanine DNA glycosylase
[Methanomethylovorans hollandica DSM 15978]
Length = 279
Score = 96.7 bits (239), Expect = 1e-17, Method: Compositional matrix adjust.
Identities = 75/264 (28%), Positives = 122/264 (46%), Gaps = 35/264 (13%)
Query: 43 LTQSELSLPLTFPTGQTFRWKKTGPLQYTGPI-GPHLISLKHLQNGDVCYHIHTSPSEPA 101
L + +L T GQ FRW K G + +TG + G + + + +Q G +H P
Sbjct: 4 LQCDDFNLDYTLDCGQVFRWDKVGDV-WTGVVRGDVVRTWQDIQTG----AVHIDSILP- 57
Query: 102 AKSALLDFLNMGISLGELWEGFSASDCRFAELAKYLAGARVLRQDPVECLLQFLCSSNNN 161
K L++ L + E + D AE G R++RQDP ECL+ +L ++ +
Sbjct: 58 -KDFFLNYFRFDDDLNSIMEAVN-KDKYMAEAISGYRGMRLIRQDPWECLISYLLATAWS 115
Query: 162 IARITKMVDFLASL-GSHLGNVEGFEFHEFPSLERLSLVSEVELRNAGFGYRSAPQSSLL 220
I I + L+ + G + + ++ FP + L++ + +L + G+R+
Sbjct: 116 IPNIKNGIFKLSRIFGEEIAD----GYYSFPKPDSLAIACDSDLCDCRLGFRTN------ 165
Query: 221 FSVRRSFKQAKYITGTVDVLQSKHSGGAEWLLSLRKLDLQEAIDALCTLPGVGPKVAACI 280
+R++ Q + G +D L L +LD + A L TL GVG KVA CI
Sbjct: 166 -RIRKAAVQV--VNGDID------------LAELFELDYEAAKKKLMTLDGVGEKVADCI 210
Query: 281 ALFSLDQHHAIPVDTHVWKIATRY 304
LF+ ++ A PVDTHV K+ Y
Sbjct: 211 LLFAFEKMEAFPVDTHVEKVIRTY 234
>gi|237796743|ref|YP_002864295.1| putative 8-oxoguanine DNA glycosylase [Clostridium botulinum Ba4
str. 657]
gi|229262070|gb|ACQ53103.1| putative 8-oxoguanine DNA glycosylase [Clostridium botulinum Ba4
str. 657]
Length = 305
Score = 96.7 bits (239), Expect = 1e-17, Method: Compositional matrix adjust.
Identities = 75/270 (27%), Positives = 119/270 (44%), Gaps = 41/270 (15%)
Query: 48 LSLPLTFPTGQTFRWKKTGPLQYTGPIGPHLISLKHLQNGDVCYHIHTSPSEPAAKSALL 107
L F GQ FRW KT Y G +I ++ N DV H T
Sbjct: 21 FELAHIFECGQCFRWYKTEEGSYIGVAYGKVIEVEK-ANNDVILHNATEE---------- 69
Query: 108 DFLNMGISLGELWEGFS------ASDCRFAELAKYLAGARVLRQDPVECLLQFLCSSNNN 161
DF N+ +L+ +S + D A+ ++ G R+L+QDP E ++ F+ S+NN
Sbjct: 70 DFKNIWAEYFDLYRDYSEIKHILSKDEILAKSVEFGHGIRLLKQDPFEIIVSFIISANNR 129
Query: 162 IARITKMVDFLASLGSHLGNVEGFEFHEFPSLERLSLVSEVELRNAGFGYRSAPQSSLLF 221
I I K + ++ +G ++ FP++E+L +E EL+ G+R
Sbjct: 130 IPMIKKAIKNISERWGDPIEYKGNIYYSFPTVEQLKDATEDELKACSVGFR--------- 180
Query: 222 SVRRSFKQAKYITGTVD-VLQSKHSGGAEWLLSLRKLDLQEAIDALC-----TLPGVGPK 275
AKYI TV+ + Q+ ++ L ++ D +C G+G K
Sbjct: 181 --------AKYIKDTVNKIYQNSIEECEQYEKEYDMLWIKNQQDDICHKMLQNYSGIGAK 232
Query: 276 VAACIALFSLDQHHAIPVDTHVWKIATRYL 305
VA C+ LFS++++ A PVD V K A +Y
Sbjct: 233 VADCVMLFSMEKYSAFPVDVWV-KRAMQYF 261
>gi|157875890|ref|XP_001686315.1| putative 8-oxoguanine DNA glycosylase [Leishmania major strain
Friedlin]
gi|68129389|emb|CAJ07930.1| putative 8-oxoguanine DNA glycosylase [Leishmania major strain
Friedlin]
Length = 565
Score = 96.7 bits (239), Expect = 1e-17, Method: Compositional matrix adjust.
Identities = 82/297 (27%), Positives = 120/297 (40%), Gaps = 97/297 (32%)
Query: 106 LLDFLNMGISLGELWEGFSAS-DCRFAELAKYLAGARV---------------------- 142
L +L++ + L +LW+ ++ S + R L +YL G+R
Sbjct: 157 LSRYLSLDVDLDQLWQEWTDSPETRKHPLVEYLVGSRRQRQLLRSGHGQRVCEHDEAQAN 216
Query: 143 -------LRQDPVECLLQFLCSSNNNIARITKMVDFLA-SLGSHLGNV------------ 182
+RQD CL FLCS NNN+ RIT M+ L+ + G HL +V
Sbjct: 217 LYIPIRHVRQDLHSCLFSFLCSQNNNVTRITGMIYALSRAYGDHLCDVQLATGEVRAPRK 276
Query: 183 -----------------------------------------EGFEFHEFPSLERLSLVSE 201
E + FPSLE+L+ V+E
Sbjct: 277 SAADTRSKAPQTHSRSDPAATRRDTMASPSLKPHSASANAPEWLSVYSFPSLEQLATVTE 336
Query: 202 VELRNAGFGYRSAPQSSLLFSVRR-----SFKQAKYITGTVDV---LQSKH-----SGGA 248
LR+ GFGYRS + +R + K + G + L +H +G
Sbjct: 337 DTLRSLGFGYRSKYIVEAVSFIRTQLPPPELQDEKAVKGKMHFPPHLIRQHGACYRNGFY 396
Query: 249 EWLLSLRKLDLQEAIDALCTLPGVGPKVAACIALFSLDQHHAIPVDTHVWKIATRYL 305
+L Q D L LPGVG KVA C+ALF+L++ H +PVDTH+ ++A YL
Sbjct: 397 SAVLGHHSYHHQHQRDMLLLLPGVGRKVADCVALFALNRTHIVPVDTHMAQVAVEYL 453
>gi|326202347|ref|ZP_08192216.1| DNA-(apurinic or apyrimidinic site) lyase [Clostridium
papyrosolvens DSM 2782]
gi|325987465|gb|EGD48292.1| DNA-(apurinic or apyrimidinic site) lyase [Clostridium
papyrosolvens DSM 2782]
Length = 295
Score = 96.7 bits (239), Expect = 2e-17, Method: Compositional matrix adjust.
Identities = 77/275 (28%), Positives = 125/275 (45%), Gaps = 27/275 (9%)
Query: 47 ELSLPLTFPTGQTFRWKKTGPLQYTGPIGPHLISLKHLQNGDVCYHIHTSPSEPAAKSAL 106
+ +L F GQ FRW + Y G G L+++ + N +C T+ E K+
Sbjct: 22 DFNLTHIFECGQCFRWIRQEDGSYKGIAGGRLVNVSY-DNEILCI---TNSGEQDFKNIW 77
Query: 107 LDFLNMGISLGELWEGFSASDCRFAELAKYLAGARVLRQDPVECLLQFLCSSNNNIARIT 166
++ ++G ++ + E K G R+L+QD E L+ F+ S+NN I RI
Sbjct: 78 YEYFDLGTDYSKIKASLEQDEI-MKEAIKTGWGIRLLKQDFWEMLISFIISANNMIPRIM 136
Query: 167 KMVDFLASLGSHLGNVEGFEFHEFPSLERLSLVSEVELRNAGFGYRSAPQSSLLFSVRRS 226
K VD L++L + + E + FP +E L+ +S E++ G+R
Sbjct: 137 KTVDTLSALRGKCIDSQQ-EAYSFPDVETLAKMSLEEIQQCKAGFR-------------- 181
Query: 227 FKQAKYITGTVDVLQSKHSGGAEWLLSLRKLDLQEAIDALCTLPGVGPKVAACIALFSLD 286
KYI T ++ + G +LR +D A L LPGVGPKVA CI LFS
Sbjct: 182 ---CKYIHKTAALM----AQGIITEDNLRGMDTALARKELMKLPGVGPKVADCILLFSGM 234
Query: 287 QHHAIPVDTHVWKIATRYLLPELAGVRLTPKLCSR 321
++ P D V ++ L + +G++ + S+
Sbjct: 235 KYDVFPTDVWVKRVMEELYLKKESGLKEIQQFASK 269
>gi|297588569|ref|ZP_06947212.1| 3-methyladenine DNA glycosylase [Finegoldia magna ATCC 53516]
gi|297573942|gb|EFH92663.1| 3-methyladenine DNA glycosylase [Finegoldia magna ATCC 53516]
Length = 297
Score = 96.3 bits (238), Expect = 2e-17, Method: Compositional matrix adjust.
Identities = 75/277 (27%), Positives = 118/277 (42%), Gaps = 28/277 (10%)
Query: 43 LTQSELSLPLTFPTGQTFRWKKTGPLQYTGPIGPHLISLKHLQNGDVCYHIHTSPSEPAA 102
+ QS + F GQ FRW K + YT G ++ + GD Y + + +
Sbjct: 19 IKQSSFNPTHIFECGQAFRWYKEDDMSYTTVDGDMYCNVSLV--GDYVYIKNCTEDDYYK 76
Query: 103 KSALLDFLNMGISLGELWEGFSASDCRFAELAKYLAGARVLRQDPVECLLQFLCSSNNNI 162
K ++ ++ ++ E S ++ +Y G R+L QD ++ F+ S+NN I
Sbjct: 77 KWE--NYFDLKTDYSKIKEELSFNET-LKNALEYGDGIRILNQDKFSTIISFIISANNQI 133
Query: 163 ARITKMVDFLA-SLGSHLGNVEGFEFHEFPSLERLSLVSEVELRNAGFGYRSAPQSSLLF 221
RI K V+ + + G LG G + + FP E L V E+R
Sbjct: 134 PRIMKSVNLICENYGKLLGEFNGRKLYSFPKPEDLEKVPVEEMREV-------------- 179
Query: 222 SVRRSFKQAKYITGTVDVLQSKHSGGAEW-LLSLRKLDLQEAIDALCTLPGVGPKVAACI 280
R F+ + VDV +K E+ + + L +E L LPGVGPKVA CI
Sbjct: 180 -CRVGFRDKR----IVDV--AKMVADNEFDIFEIDNLSNEELRKELIKLPGVGPKVADCI 232
Query: 281 ALFSLDQHHAIPVDTHVWKIATRYLLPELAGVRLTPK 317
LFS ++H+ PVD + ++ + E L K
Sbjct: 233 MLFSYNRHNTFPVDVWIKRVMEYLFIKEETNKNLIAK 269
>gi|363888869|ref|ZP_09316246.1| hypothetical protein HMPREF9628_00890 [Eubacteriaceae bacterium
CM5]
gi|361967246|gb|EHL20079.1| hypothetical protein HMPREF9628_00890 [Eubacteriaceae bacterium
CM5]
Length = 294
Score = 96.3 bits (238), Expect = 2e-17, Method: Compositional matrix adjust.
Identities = 69/256 (26%), Positives = 121/256 (47%), Gaps = 26/256 (10%)
Query: 54 FPTGQTFRWKKTGPLQYTGPIGPHLISLKHLQNGDVCYHIHTSPSEPAAKSALLDFLNMG 113
F GQ FRW YTG + +I++ L++ DV I + + ++ +++ ++
Sbjct: 24 FECGQCFRWDLEEDGSYTGVVYDKVINV-SLEDEDV---ILKNTNVDDVENIWINYFDLD 79
Query: 114 ISLGELWEGFSASDCRFAELAKYLAGARVLRQDPVECLLQFLCSSNNNIARITKMVDFLA 173
G + + S D E K+ G R+L+Q E ++ F+ S+ N I I + V+ L+
Sbjct: 80 KDYGHIKKELSDIDDYLKESTKFGYGIRILKQPFYEMVISFIISARNAIPMIKRSVNKLS 139
Query: 174 -SLGSHLGNVEGFEFHEFPSLERLSLVSEVELRNAGFGYRSAPQSSLLFSVRRSFKQAKY 232
LG H+ G +++ FP++E L+ NA +A + + +A Y
Sbjct: 140 EDLGEHIDTYNGKKYYSFPTVESLA--------NANIETITASKVAF---------RAPY 182
Query: 233 ITGTVDVLQSKHSGGAEWLLSLRKLDLQEAIDALCTLPGVGPKVAACIALFSLDQHHAIP 292
I T ++ +++ ++ KL L E + L GVG KVA CIALF L ++ A P
Sbjct: 183 IQKTAKMIVNENIQEKDF----EKLTLDETAEKLTKFAGVGAKVADCIALFGLSKYDAFP 238
Query: 293 VDTHVWKIATRYLLPE 308
VD V ++ + L +
Sbjct: 239 VDVWVKRVMEEFYLKD 254
>gi|342732014|ref|YP_004770853.1| 8-oxoguanine DNA glycosylase [Candidatus Arthromitus sp.
SFB-mouse-Japan]
gi|384455432|ref|YP_005668026.1| 8-oxoguanine DNA glycosylase [Candidatus Arthromitus sp.
SFB-mouse-Yit]
gi|417961791|ref|ZP_12604140.1| 8-oxoguanine DNA glycosylase domain-containing protein [Candidatus
Arthromitus sp. SFB-2]
gi|417967883|ref|ZP_12608940.1| 8-oxoguanine DNA glycosylase domain-containing protein [Candidatus
Arthromitus sp. SFB-co]
gi|418016590|ref|ZP_12656155.1| putative 8-oxoguanine DNA glycosylase [Candidatus Arthromitus sp.
SFB-mouse-NYU]
gi|418372264|ref|ZP_12964356.1| 8-oxoguanine DNA glycosylase domain-containing protein [Candidatus
Arthromitus sp. SFB-mouse-SU]
gi|342329469|dbj|BAK56111.1| 8-oxoguanine DNA glycosylase [Candidatus Arthromitus sp.
SFB-mouse-Japan]
gi|345506925|gb|EGX29219.1| putative 8-oxoguanine DNA glycosylase [Candidatus Arthromitus sp.
SFB-mouse-NYU]
gi|346983774|dbj|BAK79450.1| 8-oxoguanine-DNA-glycosylase [Candidatus Arthromitus sp.
SFB-mouse-Yit]
gi|380332276|gb|EIA23132.1| 8-oxoguanine DNA glycosylase domain-containing protein [Candidatus
Arthromitus sp. SFB-2]
gi|380340577|gb|EIA29153.1| 8-oxoguanine DNA glycosylase domain-containing protein [Candidatus
Arthromitus sp. SFB-co]
gi|380341933|gb|EIA30378.1| 8-oxoguanine DNA glycosylase domain-containing protein [Candidatus
Arthromitus sp. SFB-mouse-SU]
Length = 291
Score = 95.9 bits (237), Expect = 2e-17, Method: Compositional matrix adjust.
Identities = 85/289 (29%), Positives = 130/289 (44%), Gaps = 66/289 (22%)
Query: 46 SELSLPLTFPTGQTFRWKKTGPLQYTGPIGPHLISLKHLQNGDVCYHIHTSPSEPAAKSA 105
+ L F GQ FRW++T Y +I L+ + +GD+ IH S +
Sbjct: 19 NNFVLKHIFENGQCFRWERTNIGTYIIVAKGRVIELE-MDSGDLL--IHNSNVD------ 69
Query: 106 LLDFLNMGISLGELWEGFSASDCRF-AELA--KYLA-------GARVLRQDPVECLLQFL 155
DF N+ I+ + F C+ EL KYL G R+L QDP E +L F+
Sbjct: 70 --DFENIWIN----YFDFERDYCKLKGELKHDKYLDIAINFGHGLRILNQDPFEMILSFI 123
Query: 156 CSSNNNIARITKMVDFLASLGSHLGN---VEGFEFHEFPSLERLSLVSEVELRNAGFGYR 212
SSNN I I K + S+ G+ +G +++FPSLE L+ +S+ + R G+R
Sbjct: 124 ISSNNRIPMIKKAI---LSISEKYGDPISYKGRIYYKFPSLENLANLSQEKFRECSVGFR 180
Query: 213 SAPQSSLLFSVRRSFKQAKYITGTVDVLQSKHSGGAEWLLSLRKLDLQEAIDALC----- 267
KY+ TV +L + E++++L D +C
Sbjct: 181 D-----------------KYLYNTVKLLNEDND--IEYIMNLN--------DDMCHKELQ 213
Query: 268 TLPGVGPKVAACIALFSLDQHHAIPVDTHVWKIATR-YLLPE--LAGVR 313
GVG KV+ CI LFS+ ++ A PVD V + + Y+ P+ L G+R
Sbjct: 214 KFSGVGSKVSDCIMLFSMKKYSAFPVDVWVKRAMMKFYVAPDTSLKGIR 262
>gi|167747489|ref|ZP_02419616.1| hypothetical protein ANACAC_02210 [Anaerostipes caccae DSM 14662]
gi|317473546|ref|ZP_07932837.1| 8-oxoguanine DNA-glycosylase [Anaerostipes sp. 3_2_56FAA]
gi|167652851|gb|EDR96980.1| 8-oxoguanine DNA-glycosylase (ogg) [Anaerostipes caccae DSM 14662]
gi|316898983|gb|EFV21006.1| 8-oxoguanine DNA-glycosylase [Anaerostipes sp. 3_2_56FAA]
Length = 272
Score = 95.9 bits (237), Expect = 2e-17, Method: Compositional matrix adjust.
Identities = 70/245 (28%), Positives = 110/245 (44%), Gaps = 29/245 (11%)
Query: 55 PTGQTFRWKKTGPLQYTGPIGPHLISLKHLQNGDVC-YHIHTSPSEPAAKSALLDFLNMG 113
+GQ FR ++ Y+ G + + QNG C +H + E K + ++
Sbjct: 16 DSGQCFRMRQKRDNLYSVIAGNEYLEV--TQNGGQCTFHCDENKYETFWKK----YFDLN 69
Query: 114 ISLGELWEGFSASDCRFAELAKYLAGARVLRQDPVECLLQFLCSSNNNIARITKMVDFLA 173
+ E D + +++ G R+LRQD E ++ FL S NNI RI + +D +
Sbjct: 70 ADYKKYIESIDEEDSYLKKASEFGFGIRILRQDLWEMIVSFLISQQNNIVRIRRCIDNIC 129
Query: 174 S-LGSHLGNVEGFEFHEFPSLERLSLVSEVELRNAGFGYRSAPQSSLLFSVRRSFKQAKY 232
G + G ++ FP E L+ + E L++ GYRS KY
Sbjct: 130 ERYGEKRSDGCGNFYYTFPKAEILAELHEDALKDCNLGYRS-----------------KY 172
Query: 233 ITGTVDVLQSKHSGGAEWLLSLRKLDLQEAIDALCTLPGVGPKVAACIALFSLDQHHAIP 292
+ T L +GG L +++++ +EA L L G+G KVA CI LF+L A P
Sbjct: 173 VVRTAKSL----TGGEVSLEAIKQMPYKEAKAELMKLYGIGEKVADCICLFALHHFEAFP 228
Query: 293 VDTHV 297
VDTH+
Sbjct: 229 VDTHI 233
>gi|402309111|ref|ZP_10828107.1| 8-oxoguanine DNA glycosylase, N-terminal domain protein
[Eubacterium sp. AS15]
gi|400373230|gb|EJP26164.1| 8-oxoguanine DNA glycosylase, N-terminal domain protein
[Eubacterium sp. AS15]
Length = 293
Score = 95.9 bits (237), Expect = 2e-17, Method: Compositional matrix adjust.
Identities = 70/257 (27%), Positives = 118/257 (45%), Gaps = 26/257 (10%)
Query: 54 FPTGQTFRWKKTGPLQYTGPIGPHLISLKHLQNGDVCYHIHTSPSEPAAKSALLDFLNMG 113
F GQ FRW+ YT +I++ +++ ++ I + S + + + ++
Sbjct: 24 FECGQCFRWEAEDDGSYTAIAFDKVINV-SIEDDNI---ILKNSSVDDVEKLWISYFDLN 79
Query: 114 ISLGELWEGFSASDCRFAELAKYLAGARVLRQDPVECLLQFLCSSNNNIARITKMVDFLA 173
+ E S D AK+ G R+LRQD E ++ F+ S+ N+I I K V+ L+
Sbjct: 80 TDYSAIKEKLSNIDEYLKASAKFGYGIRILRQDFHEMIISFIISARNSIPMIKKSVETLS 139
Query: 174 -SLGSHLGNVEGFEFHEFPSLERLSLVSEVELRNAGFGYRSAPQSSLLFSVRRSFKQAKY 232
SLG H+ G +++ FP+++ L +R+A +R A Y
Sbjct: 140 QSLGQHIDTFNGKKYYSFPTMQALRDADIQLIRDAKTAFR-----------------APY 182
Query: 233 ITGTVDVLQSKHSGGAEWLLSLRKLDLQEAIDALCTLPGVGPKVAACIALFSLDQHHAIP 292
I T + + +++ KL L E + L L GVG KVA CIALF L ++ A P
Sbjct: 183 IQKTASYIVENNIKSSDF----DKLSLDECSEKLQELQGVGAKVADCIALFGLAKYDAFP 238
Query: 293 VDTHVWKIATRYLLPEL 309
VD V ++ + L ++
Sbjct: 239 VDVWVKRVIGEFYLQDV 255
>gi|170759581|ref|YP_001788619.1| 8-oxoguanine DNA glycosylase [Clostridium botulinum A3 str. Loch
Maree]
gi|169406570|gb|ACA54981.1| putative 8-oxoguanine DNA glycosylase [Clostridium botulinum A3
str. Loch Maree]
Length = 305
Score = 95.9 bits (237), Expect = 3e-17, Method: Compositional matrix adjust.
Identities = 72/270 (26%), Positives = 124/270 (45%), Gaps = 29/270 (10%)
Query: 48 LSLPLTFPTGQTFRWKKTGPLQYTGPIGPHLISLKHLQNGDVCYHIHTSPSEPAAKSALL 107
L F GQ FRW KT Y G +I ++ N DV H + +E K+
Sbjct: 21 FELAHIFECGQCFRWYKTEEGSYIGVAYGKVIEVEK-ANNDVILH---NATEEDFKNIWG 76
Query: 108 DFLNMGISLGELWEGFSASDCRFAELAKYLAGARVLRQDPVECLLQFLCSSNNNIARITK 167
++ ++ E+ + + D A+ ++ G R+L+QDP E ++ F+ S+NN I I K
Sbjct: 77 EYFDLYRDYSEI-KNILSKDEILAKSVEFGHGIRLLKQDPFEIIVSFIISANNRIPMIKK 135
Query: 168 MVDFLASLGSHLGNVEGFEFHEFPSLERLSLVSEVELRNAGFGYRSAPQSSLLFSVRRSF 227
+ ++ +G ++ FP++E+L +E EL+ G+R
Sbjct: 136 AIKNISERWGDPIEYKGNIYYSFPTVEQLKNATEDELKACSVGFR--------------- 180
Query: 228 KQAKYITGTVD-VLQSKHSGGAEWLLSLRKLDLQEAIDALC-----TLPGVGPKVAACIA 281
AKYI TV+ + Q+ ++ L ++ D +C G+G KVA C+
Sbjct: 181 --AKYIKDTVNKICQNSIEEREQYEKEYDMLWIKNQQDDICHKMLQNYSGIGAKVADCVM 238
Query: 282 LFSLDQHHAIPVDTHVWK-IATRYLLPELA 310
LFS+ ++ A PVD V + + YL P+++
Sbjct: 239 LFSMKKYSAFPVDVWVKRAMQYFYLAPDVS 268
>gi|119936181|gb|ABM06086.1| 8-oxoguanine DNA glycosylase [Bos taurus]
Length = 188
Score = 95.5 bits (236), Expect = 3e-17, Method: Compositional matrix adjust.
Identities = 56/138 (40%), Positives = 76/138 (55%), Gaps = 20/138 (14%)
Query: 168 MVDFLA-SLGSHLGNVEGFEFHEFPSLERLSLVS-EVELRNAGFGYRSAPQSSLLFSVRR 225
MV+ L + G L ++ +H FPSL+ L+ E +LRN G GYR
Sbjct: 1 MVERLCQTFGPRLIQLDDVTYHGFPSLQALAGPEVEAQLRNLGLGYR------------- 47
Query: 226 SFKQAKYITGTVDVLQSKHSGGAEWLLSLRKLDLQEAIDALCTLPGVGPKVAACIALFSL 285
A++++ + + + GG WL LRK +EA ALCTLPGVG KVA CI L +L
Sbjct: 48 ----ARFVSASARAILEER-GGLPWLQQLRKAPYEEAHKALCTLPGVGTKVADCICLMAL 102
Query: 286 DQHHAIPVDTHVWKIATR 303
D+ A+PVD HVW+IA R
Sbjct: 103 DKPQAVPVDVHVWQIAQR 120
>gi|119936073|gb|ABM06068.1| 8-oxoguanine DNA glycosylase isoform [Bos taurus]
Length = 180
Score = 95.1 bits (235), Expect = 3e-17, Method: Compositional matrix adjust.
Identities = 56/138 (40%), Positives = 76/138 (55%), Gaps = 20/138 (14%)
Query: 168 MVDFLA-SLGSHLGNVEGFEFHEFPSLERLSLVS-EVELRNAGFGYRSAPQSSLLFSVRR 225
MV+ L + G L ++ +H FPSL+ L+ E +LRN G GYR
Sbjct: 1 MVERLCQTFGPRLIQLDDVTYHGFPSLQALAGPEVEAQLRNLGLGYR------------- 47
Query: 226 SFKQAKYITGTVDVLQSKHSGGAEWLLSLRKLDLQEAIDALCTLPGVGPKVAACIALFSL 285
A++++ + + + GG WL LRK +EA ALCTLPGVG KVA CI L +L
Sbjct: 48 ----ARFVSASARAILEER-GGLPWLQQLRKAPYEEAHKALCTLPGVGTKVADCICLMAL 102
Query: 286 DQHHAIPVDTHVWKIATR 303
D+ A+PVD HVW+IA R
Sbjct: 103 DKPQAVPVDVHVWQIAQR 120
>gi|429728866|ref|ZP_19263567.1| 8-oxoguanine DNA-glycosylase [Peptostreptococcus anaerobius VPI
4330]
gi|429147382|gb|EKX90410.1| 8-oxoguanine DNA-glycosylase [Peptostreptococcus anaerobius VPI
4330]
Length = 296
Score = 95.1 bits (235), Expect = 4e-17, Method: Compositional matrix adjust.
Identities = 70/257 (27%), Positives = 119/257 (46%), Gaps = 28/257 (10%)
Query: 54 FPTGQTFRWKKTGPLQYTGPIGPHLISLKHLQNGDVCYHIHTSPSEPAAKSALLDFLNMG 113
F GQ FRW K YTG ++++ ++GD+ I + + ++ D+ ++G
Sbjct: 29 FDCGQCFRWIKEDDGSYTGVAMGKILNVSK-RDGDI---ILLNTNLDDFENIWYDYFDLG 84
Query: 114 ISLGELWEGFSASDCRFAELAKYLAGARVLRQDPVECLLQFLCSSNNNIARITKMVDFLA 173
G++ + D + ++ G R+L+QD E L+ F+ SSNN I I + ++ ++
Sbjct: 85 RDYGQMIDKLKVHDDNLLKATEFGHGIRLLQQDGWEMLISFIISSNNRIPMIQRAINNIS 144
Query: 174 -SLGSHLGNVEGFEFHEFPSLERLSLVSEVELRNAGFGYRSAPQSSLLFSVRRSFKQAKY 232
+ G LG +G ++ FPS +LS S +LRN G+R KY
Sbjct: 145 ENYGDCLGQYKGKIYYSFPSPAQLSKASVEDLRNLKLGFRD-----------------KY 187
Query: 233 ITGTVD-VLQSKHSGGAEWLLSLRKLDLQEAIDALCTLPGVGPKVAACIALFSLDQHHAI 291
+ T VL+S ++ K++ L GVG KVA CIALF + ++ +
Sbjct: 188 VYETTKAVLESGLD-----MVDFIKMEADICHKELMKFKGVGAKVADCIALFGMRKYDSF 242
Query: 292 PVDTHVWKIATRYLLPE 308
PVD V ++ + E
Sbjct: 243 PVDVWVKRVMQEFYGAE 259
>gi|363892653|ref|ZP_09319815.1| 8-oxoguanine DNA-glycosylase (ogg) [Eubacteriaceae bacterium CM2]
gi|361963418|gb|EHL16493.1| 8-oxoguanine DNA-glycosylase (ogg) [Eubacteriaceae bacterium CM2]
Length = 294
Score = 95.1 bits (235), Expect = 4e-17, Method: Compositional matrix adjust.
Identities = 71/256 (27%), Positives = 118/256 (46%), Gaps = 26/256 (10%)
Query: 54 FPTGQTFRWKKTGPLQYTGPIGPHLISLKHLQNGDVCYHIHTSPSEPAAKSALLDFLNMG 113
F GQ FRW YTG + +I++ L++ DV + + + ++ +++ ++G
Sbjct: 24 FECGQCFRWDLEEDGSYTGVVYDKVINV-SLEDEDV---VLKNTNVDDVENIWINYFDIG 79
Query: 114 ISLGELWEGFSASDCRFAELAKYLAGARVLRQDPVECLLQFLCSSNNNIARITKMVDFLA 173
G + + S D E K+ G R+L+Q E ++ F+ S+ N I I + V+ L+
Sbjct: 80 KDYGHIKKELSDMDEYLKESTKFGYGIRILKQPFYEMVISFIISARNAIPMIKRSVNKLS 139
Query: 174 -SLGSHLGNVEGFEFHEFPSLERLSLVSEVELRNAGFGYRSAPQSSLLFSVRRSFKQAKY 232
LG H+ G +++ FP+++ L+ NA + S + F K AK
Sbjct: 140 EDLGEHIDTYNGKKYYSFPTVKSLA--------NADI--ETITASKVAFRAPYIQKTAKM 189
Query: 233 ITGTVDVLQSKHSGGAEWLLSLRKLDLQEAIDALCTLPGVGPKVAACIALFSLDQHHAIP 292
I + +Q K KL L E + L GVG KVA CIALF L ++ A P
Sbjct: 190 IVD--EDIQEK---------DFEKLTLDETAEKLTKFAGVGAKVADCIALFGLSKYDAFP 238
Query: 293 VDTHVWKIATRYLLPE 308
VD V ++ + L +
Sbjct: 239 VDVWVKRVMEEFYLKD 254
>gi|161529048|ref|YP_001582874.1| HhH-GPD family protein [Nitrosopumilus maritimus SCM1]
gi|160340349|gb|ABX13436.1| HhH-GPD family protein [Nitrosopumilus maritimus SCM1]
Length = 287
Score = 95.1 bits (235), Expect = 4e-17, Method: Compositional matrix adjust.
Identities = 67/257 (26%), Positives = 116/257 (45%), Gaps = 35/257 (13%)
Query: 48 LSLPLTFPTGQTFRWKKTGPLQYTGPIGPHLISLKHLQNGDVCYHIHTSPSEPAAKSALL 107
+ + + +GQ F W+K Y G G ++ + +NG + + ++
Sbjct: 14 IDVENSINSGQVFLWRKNKEFWY-GVNGQDILEVN--KNGKI----------KSLQNYKT 60
Query: 108 DFLNMGISLGELWEGFSASDCRFAELAKYLAGARVLRQDPVECLLQFLCSSNNNIARITK 167
DF + E+ + S + KY G R+++QDP +CL+ F+ SSN+NI +I
Sbjct: 61 DFFRNNDNFDEIIKSISKDKIVKNAVKKY-PGLRIIKQDPFQCLISFIVSSNSNIQKIKT 119
Query: 168 MVDFLASLGSHLGNVEGFEFHEFPSLERLSLVSEVELRNAGFGYRSAPQSSLLFSVRRSF 227
++ ++ + EF FP+ + LS S E++N G GYR
Sbjct: 120 NLENISQKFGERVEYKDQEFFLFPNAKTLSKASITEIKNCGVGYR--------------- 164
Query: 228 KQAKYITGTVDVLQSKHSGGAEWLLSLRKLDLQEAIDALCTLPGVGPKVAACIALFSLDQ 287
AK+I + S+ + L+ D +A + +PG+G KVA CI LFSLD+
Sbjct: 165 --AKFIKEASKIFASEKI----MIDDLKSSDYFDAKKKIRIIPGIGNKVADCILLFSLDK 218
Query: 288 HHAIPVDTHVWKIATRY 304
+ P+D + +I +Y
Sbjct: 219 LESFPLDRWMIRILEKY 235
>gi|255525712|ref|ZP_05392644.1| 8-oxoguanine DNA glycosylase domain protein [Clostridium
carboxidivorans P7]
gi|296184816|ref|ZP_06853227.1| 8-oxoguanine DNA-glycosylase (ogg) [Clostridium carboxidivorans P7]
gi|255510614|gb|EET86922.1| 8-oxoguanine DNA glycosylase domain protein [Clostridium
carboxidivorans P7]
gi|296050598|gb|EFG90021.1| 8-oxoguanine DNA-glycosylase (ogg) [Clostridium carboxidivorans P7]
Length = 305
Score = 94.7 bits (234), Expect = 5e-17, Method: Compositional matrix adjust.
Identities = 66/264 (25%), Positives = 124/264 (46%), Gaps = 17/264 (6%)
Query: 48 LSLPLTFPTGQTFRWKKTGPLQYTGPIGPHLISLKHLQNGDVCYHIHTSPSEPAAKSALL 107
+ F GQ FRW + Y G +I ++ +N DV I+ + E K +
Sbjct: 21 FEITHIFDCGQCFRWHRQKNGNYIGVAYGKVIEIEKKEN-DVI--IYNTCKEDFDK-IWM 76
Query: 108 DFLNMGISLGELWEGFSASDCRFAELAKYLAGARVLRQDPVECLLQFLCSSNNNIARITK 167
D+ ++ E+ + D + + G R+L+Q+P E ++ F+ S+NN I I +
Sbjct: 77 DYFDLDRDYSEI-KNLLGKDPILSTSVNFGYGIRLLKQEPFEIIVSFITSANNRIPMIKR 135
Query: 168 MVDFLASLGSHLGNVEGFEFHEFPSLERLSLVSEVELRNAGFGYRSAPQSSLLFSVRRSF 227
++ ++S + ++ FP++++LS VS EL + G G+R+ + ++
Sbjct: 136 AIENISSKWGEKLEYKNNTYYTFPTIDKLSSVSLEELESCGVGFRAKYIKDTVSNI---- 191
Query: 228 KQAKYITGTVDVLQSKHSGGAEWLLSLRKLDLQEAIDALCTLPGVGPKVAACIALFSLDQ 287
Y G ++ + +W+ S + ++ L G+GPKVA CI LFS+++
Sbjct: 192 ----YTKGNIENKEYDEQYDIKWIKS---QEAEKCHKELQKFMGIGPKVADCIMLFSMEK 244
Query: 288 HHAIPVDTHVWKIATR-YLLPELA 310
+ A PVD V + YL P+++
Sbjct: 245 YSAFPVDVWVKRAMQHFYLAPDVS 268
>gi|424813461|ref|ZP_18238659.1| 3-methyladenine DNA glycosylase/8-oxoguanine DNA glycosylase
[Candidatus Nanosalina sp. J07AB43]
gi|339758750|gb|EGQ44003.1| 3-methyladenine DNA glycosylase/8-oxoguanine DNA glycosylase
[Candidatus Nanosalina sp. J07AB43]
Length = 289
Score = 94.7 bits (234), Expect = 6e-17, Method: Compositional matrix adjust.
Identities = 83/270 (30%), Positives = 120/270 (44%), Gaps = 33/270 (12%)
Query: 46 SELSLPLTFPTGQTFRWKK-TGPLQYTGPIGPHLISLKHLQNGDVCYHIHTSPSEPAAKS 104
+ L LT GQTF W + G L + + NG+ P K+
Sbjct: 9 EDFDLELTLTCGQTFCWHRLNGDLFEDSSENSWFYTFR---NGEPVMVRQDQPDRLTVKT 65
Query: 105 ALLDFLNMGISLG---ELWEGFSA--SDCRFAELAKYLAGARVLRQDPVECLLQFLCSSN 159
L D + +LG +L + FS D E + L G R++R + CL+ +LCS
Sbjct: 66 DL-DPSQIKQALGLDKDLQKVFSRFPDDDELEEAKESLWGLRIVRDEFFPCLISYLCSPQ 124
Query: 160 NNIARITKMVDFLASLGSHLGNVEGFEFHEFPSLERLSLVSEVELRNAGFGYRSAPQSSL 219
I RI KM + ++ +E + FP+++ LS SE +LR+ G GYR
Sbjct: 125 MRIPRIKKMHNSISREFGEPIEMEQKSVYRFPTVKELSRASEEDLRSLGVGYR------- 177
Query: 220 LFSVRRSFKQAKYITGTVDVLQSKHSGGAEWLLSLRKLDLQEAIDALCTLPGVGPKVAAC 279
A+YI+ TV++++S L+ + +EA L L GVG KVA C
Sbjct: 178 ----------AEYISETVEIIRSDFDSD-----KLQDFEYEEARKYLKQLHGVGDKVADC 222
Query: 280 IALFSLDQHHAIPVDTHVWKIATRYLLPEL 309
+ LFS H A PVDT K A + L PEL
Sbjct: 223 VLLFSCGFHQAYPVDTWAEK-ALKALYPEL 251
>gi|52549862|gb|AAU83711.1| 8-oxoguanine DNA glycosylase [uncultured archaeon GZfos33E1]
Length = 288
Score = 94.7 bits (234), Expect = 6e-17, Method: Compositional matrix adjust.
Identities = 68/259 (26%), Positives = 118/259 (45%), Gaps = 33/259 (12%)
Query: 45 QSELSLPLTFPTGQTFRWKKTGPLQYTGPIGPHLISLKHLQNGDVCYHIHTSPSEPAAKS 104
+S +L T GQ FRW K Y G + ++ ++ Q GD H P + +
Sbjct: 4 ESHFNLDYTLSCGQVFRWGKANGWWY-GIVDGSILKVR--QEGDEL-HFSAYPKD-VEEE 58
Query: 105 ALLDFLNMGISLGELWEGFSASDCRFAELAKYLAGARVLRQDPVECLLQFLCSSNNNIAR 164
+ + + L + + D + + G R++RQ +C++ ++C++N +IA
Sbjct: 59 FIKSYFRLNDDLSNILRIIN-KDVEINKAIRQFNGLRIVRQSVWDCMISYICATNASIAV 117
Query: 165 ITKMVDFLASLGSHLGN---VEGFEFHEFPSLERLSLVSEVELRNAGFGYRSAPQSSLLF 221
I M L +L G+ V G F FP +++L+ S +++ GYR+ S +
Sbjct: 118 IESM---LRNLSEKFGDEIVVNGKAFFSFPKVKKLAKASVNKIKLCKVGYRAKFLSEI-- 172
Query: 222 SVRRSFKQAKYITGTVDVLQSKHSGGAEWLLSLRKLDLQEAIDALCTLPGVGPKVAACIA 281
AK + ++L+ LR D E D L +LPGVGPKVA C++
Sbjct: 173 --------AKQVKNNPNLLEE-----------LRNSDYLELWDELRSLPGVGPKVADCVS 213
Query: 282 LFSLDQHHAIPVDTHVWKI 300
LF+ D+ A P+D + ++
Sbjct: 214 LFAFDKLEAFPIDVWIRRV 232
>gi|363895654|ref|ZP_09322644.1| hypothetical protein HMPREF9629_00926 [Eubacteriaceae bacterium
ACC19a]
gi|361956621|gb|EHL09934.1| hypothetical protein HMPREF9629_00926 [Eubacteriaceae bacterium
ACC19a]
Length = 310
Score = 94.4 bits (233), Expect = 6e-17, Method: Compositional matrix adjust.
Identities = 68/256 (26%), Positives = 121/256 (47%), Gaps = 26/256 (10%)
Query: 54 FPTGQTFRWKKTGPLQYTGPIGPHLISLKHLQNGDVCYHIHTSPSEPAAKSALLDFLNMG 113
F GQ FRW YTG + +I++ L++ DV I + + ++ +++ ++
Sbjct: 40 FECGQCFRWNLEEDGSYTGVVYDKVINV-SLEDEDV---ILKNTNVDDVENIWINYFDLD 95
Query: 114 ISLGELWEGFSASDCRFAELAKYLAGARVLRQDPVECLLQFLCSSNNNIARITKMVDFLA 173
G + + S D E K+ G R+L+Q E ++ F+ S+ N I I + V+ L+
Sbjct: 96 KDYGHIKKELSDIDDYLKESTKFGYGIRILKQPFYEMVISFIISARNAIPMIKRSVNKLS 155
Query: 174 -SLGSHLGNVEGFEFHEFPSLERLSLVSEVELRNAGFGYRSAPQSSLLFSVRRSFKQAKY 232
LG H+ G +++ FP+++ L+ NA +A + + +A Y
Sbjct: 156 EDLGEHIDTYNGKKYYSFPTVKSLA--------NANIETITASKVAF---------RAPY 198
Query: 233 ITGTVDVLQSKHSGGAEWLLSLRKLDLQEAIDALCTLPGVGPKVAACIALFSLDQHHAIP 292
I T ++ +++ ++ KL L E + L GVG KVA CIALF L ++ A P
Sbjct: 199 IQKTAKMIVNENIQEKDF----EKLTLDETAEKLTKFAGVGAKVADCIALFGLSKYDAFP 254
Query: 293 VDTHVWKIATRYLLPE 308
VD V ++ + L +
Sbjct: 255 VDVWVKRVMEEFYLKD 270
>gi|333896525|ref|YP_004470399.1| DNA-(apurinic or apyrimidinic site) lyase [Thermoanaerobacterium
xylanolyticum LX-11]
gi|333111790|gb|AEF16727.1| DNA-(apurinic or apyrimidinic site) lyase [Thermoanaerobacterium
xylanolyticum LX-11]
Length = 304
Score = 94.4 bits (233), Expect = 6e-17, Method: Compositional matrix adjust.
Identities = 68/257 (26%), Positives = 119/257 (46%), Gaps = 28/257 (10%)
Query: 46 SELSLPLTFPTGQTFRWKKTGPLQYTGPIGPHLISLKHLQNGDVCYHIHTSPSEPAAKSA 105
+ +L T GQ FRW + YTG +I++K +GD+ +T+ ++
Sbjct: 24 KDFNLKETLECGQCFRWNEEDDGSYTGVAFDRVINVK--LDGDILTIDNTTLAD--FNDI 79
Query: 106 LLDFLNMGISLGELWEGFSASDCRFAELAKYLAGARVLRQDPVECLLQFLCSSNNNIARI 165
D+ ++G G++ E S + A + KY G R+LRQD E L+ F+ S NN I +I
Sbjct: 80 WYDYFDLGRDYGKIKETLSQDEILKAAI-KYGEGIRILRQDTWETLISFIISQNNRIPQI 138
Query: 166 TKMVDFLASLGSHLGNVEGFEFHEFPSLERLSLVSEVELRNAGFGYRSAPQSSLLFSVRR 225
K+++ L+ L H + ++ FP ++ + L + G+RS
Sbjct: 139 KKVIENLSRLLGHPIVYKDKTYYTFPKVQDFIMADIEVLEKSKCGFRS------------ 186
Query: 226 SFKQAKYIT-GTVDVLQSKHSGGAEWLLSLRKLDLQEAIDALCTLPGVGPKVAACIALFS 284
KYI + V + + L L+ D + + L ++ GVGPKVA C+ L+S
Sbjct: 187 -----KYIIDAALKVFNDEVN-----LFELQLYDTYDVRNILMSIRGVGPKVADCVMLYS 236
Query: 285 LDQHHAIPVDTHVWKIA 301
+ ++ A P D + ++
Sbjct: 237 IGRYEAFPTDVWIKRVV 253
>gi|387819567|ref|YP_005679914.1| 8-oxoguanine DNA glycosylase [Clostridium botulinum H04402 065]
gi|322807611|emb|CBZ05186.1| 8-oxoguanine-DNA-glycosylase [Clostridium botulinum H04402 065]
Length = 305
Score = 94.4 bits (233), Expect = 6e-17, Method: Compositional matrix adjust.
Identities = 71/270 (26%), Positives = 124/270 (45%), Gaps = 29/270 (10%)
Query: 48 LSLPLTFPTGQTFRWKKTGPLQYTGPIGPHLISLKHLQNGDVCYHIHTSPSEPAAKSALL 107
L F GQ FRW KT Y G +I ++ N DV H + +E K+
Sbjct: 21 FELAHIFECGQCFRWYKTEEGSYIGVAYGKVIEVEK-ANNDVILH---NATEDDFKNIWA 76
Query: 108 DFLNMGISLGELWEGFSASDCRFAELAKYLAGARVLRQDPVECLLQFLCSSNNNIARITK 167
++ ++ E+ + + D A+ ++ G R+L+QD E ++ F+ S+NN I I K
Sbjct: 77 EYFDLYRDYSEI-KNILSKDEILAKSVEFGHGIRLLKQDSFEIIVSFIISANNRIPMIKK 135
Query: 168 MVDFLASLGSHLGNVEGFEFHEFPSLERLSLVSEVELRNAGFGYRSAPQSSLLFSVRRSF 227
+ ++ +G ++ FP++E+L +E EL+ G+R
Sbjct: 136 AIKNISERWGDPIEYKGNIYYSFPTVEQLKDATEDELKACSVGFR--------------- 180
Query: 228 KQAKYITGTVD-VLQSKHSGGAEWLLSLRKLDLQEAIDALC-----TLPGVGPKVAACIA 281
AKYI TV+ + Q+ ++ L ++ D +C G+G KVA C+
Sbjct: 181 --AKYIKDTVNKIYQNSIEECEQYEKEYDMLWIKNQQDDICHKVLQNYSGIGAKVADCVM 238
Query: 282 LFSLDQHHAIPVDTHVWK-IATRYLLPELA 310
LFS++++ A PVD V + + YL P+++
Sbjct: 239 LFSMEKYSAFPVDVWVKRAMQYFYLAPDVS 268
>gi|212696739|ref|ZP_03304867.1| hypothetical protein ANHYDRO_01281 [Anaerococcus hydrogenalis DSM
7454]
gi|212676238|gb|EEB35845.1| hypothetical protein ANHYDRO_01281 [Anaerococcus hydrogenalis DSM
7454]
Length = 301
Score = 94.4 bits (233), Expect = 7e-17, Method: Compositional matrix adjust.
Identities = 73/257 (28%), Positives = 120/257 (46%), Gaps = 28/257 (10%)
Query: 54 FPTGQTFRWKKTGPLQYTGPIGPHLISLKHLQNGDVCYHIHTSPSEPAAKSALLDFLNMG 113
F GQ F + + +T +I+LK + GD I + SE D+ ++G
Sbjct: 34 FTCGQCFNFFEEEDGSFTAVFLGKIINLK--KEGDKI--IIDNVSEEEFYDIFYDYFDLG 89
Query: 114 ISLGELWEGFSASDCRFAELAKYLAGARVLRQDPVECLLQFLCSSNNNIARITKMVDFLA 173
+ G++ E S D + +Y +G R+L Q+ E L+ F+ S+NN I RI K V ++
Sbjct: 90 VDYGKIKEKISV-DETLKKATEYGSGIRILNQEFFETLISFIISANNQIPRIKKAVRIIS 148
Query: 174 SL-GSHLGNVEGFEFHEFPSLERLSLVSEVELRNAGFGYRSAPQSSLLFSVRRSFKQAKY 232
+ G ++G G +++ FP+ E+LS +LR ++ + F +R + AK
Sbjct: 149 EMYGDYIGEYRGRKYYSFPNPEQLSKARPEDLR---------EKARVGFRDKRIVETAKI 199
Query: 233 IT-GTVDVLQSKHSGGAEWLLSLRKLDLQEAIDALCTLPGVGPKVAACIALFSLDQHHAI 291
I G D E + + DL++ L LPGVGPKVA CI LF+ +
Sbjct: 200 INDGFFDF---------EKDIKMPTEDLRKK---LQELPGVGPKVADCILLFAFHKRETF 247
Query: 292 PVDTHVWKIATRYLLPE 308
PVD + ++ + E
Sbjct: 248 PVDVWIKRVMEFLFIKE 264
>gi|330836821|ref|YP_004411462.1| DNA-(apurinic or apyrimidinic site) lyase [Sphaerochaeta coccoides
DSM 17374]
gi|329748724|gb|AEC02080.1| DNA-(apurinic or apyrimidinic site) lyase [Sphaerochaeta coccoides
DSM 17374]
Length = 267
Score = 94.0 bits (232), Expect = 8e-17, Method: Compositional matrix adjust.
Identities = 84/268 (31%), Positives = 121/268 (45%), Gaps = 33/268 (12%)
Query: 42 NLTQSELSLPLTFPTGQTFRWKKTGPLQYTGPIGPHLISLKHLQNGDVCYHIHTSPSEPA 101
N S LSL TF GQ FRW + ++ G G + L+ G+ +H
Sbjct: 3 NTVSSYLSLSGTFSCGQCFRWHQDEAGRWCGVAGGNAYYLEQRMVGNPSSILH------- 55
Query: 102 AKSALLDFLNMGISLGELWEGFSASDCRFAELAKYLAGARVLRQDPVECLLQFLCSSNNN 161
A S L + + + + ++ D A + G R+LRQD E LL F+ S NNN
Sbjct: 56 ADSFLRSYFALDMDYDSILRDIASRDRHLAHAVENSPGIRILRQDAFETLLSFIISQNNN 115
Query: 162 IARITKMVDFLA-SLGSHLGNVEGFEFHEFPSLERLSLVSEVELRNAGFGYRSAPQSSLL 220
I RI+ ++ L + G + + E+ FP + L+ VSE E R+ G G+R AP L+
Sbjct: 116 IPRISGIIGRLCKAWGRRIHDGTDIEY-AFPLPQALAEVSEQEFRDIGAGFR-APY--LV 171
Query: 221 FSVRRSFKQAKYITGTVDVLQSKHSGGAEWLLSLRKLDLQEAIDALCTLPGVGPKVAACI 280
+VRR + G +D L + L + EA L + GVGPKVA C+
Sbjct: 172 DAVRR------VLDGRLD------------LQIVPSLPISEARSVLQEVRGVGPKVAECV 213
Query: 281 ALFSLDQHHAIPVDTHVW-KIATRYLLP 307
L+ A P+D VW K A L P
Sbjct: 214 LLYGCGHLEAFPLD--VWMKRAMVELFP 239
>gi|420157408|ref|ZP_14664243.1| 8-oxoguanine DNA glycosylase, N-terminal domain protein
[Clostridium sp. MSTE9]
gi|394756226|gb|EJF39340.1| 8-oxoguanine DNA glycosylase, N-terminal domain protein
[Clostridium sp. MSTE9]
Length = 278
Score = 94.0 bits (232), Expect = 8e-17, Method: Compositional matrix adjust.
Identities = 88/271 (32%), Positives = 129/271 (47%), Gaps = 32/271 (11%)
Query: 47 ELSLPLTFPTGQTFRWKKTGPLQYTGPIGPHLISLKHLQNGDVCYHIHTSPSEPAAKSAL 106
+L L T GQ FRW+ +Y G +++ QN V +H T+ ++
Sbjct: 17 DLDLLQTLDCGQCFRWQVLPGGEYRGIACGRTLTITE-QNETVLFHGVTAAE---FETTW 72
Query: 107 LDFLNMGISLGELWEGFSASDCRFAELAKYLAGARVLRQDPVECLLQFLCSSNNNIARIT 166
+ + ++ + E + + A++ G R+L+QDP E L F+ S NNNI RI
Sbjct: 73 IPYFDLDFDYAAVRESLAQMHPALRQAAEFAPGIRLLQQDPWEALCSFILSQNNNIPRIK 132
Query: 167 KMVDFLASLGSHLGNVEGFEFHEFPSLERLSLVSEVELRNAGFGYRSAPQSSLLFSVRRS 226
+VD L L LG E F++FPS ERL++ + L AP VR
Sbjct: 133 GLVDRLCQL---LGE-EHDGFYDFPSPERLAVQTVDSL---------AP-------VRSG 172
Query: 227 FKQAKYITGTVDVLQSKHSGGAEWLLSLRKLDLQEAIDALCTLPGVGPKVAACIALFSLD 286
F+ AKY+ +D Q K +GG L +L +L + EA +L + GVG KVA C L+ L
Sbjct: 173 FR-AKYL---IDAAQ-KVAGGEVDLEALSRLPIDEARQSLMKIYGVGEKVAECALLYGLH 227
Query: 287 QHHAIPVDTHVW-KIATRYLLPELAGVRLTP 316
+ A P+D VW K A LLP +L P
Sbjct: 228 RLEAFPMD--VWMKRAMAVLLPGYTPQQLGP 256
>gi|307243711|ref|ZP_07525851.1| 8-oxoguanine DNA-glycosylase (ogg) [Peptostreptococcus stomatis DSM
17678]
gi|306492920|gb|EFM64933.1| 8-oxoguanine DNA-glycosylase (ogg) [Peptostreptococcus stomatis DSM
17678]
Length = 291
Score = 94.0 bits (232), Expect = 8e-17, Method: Compositional matrix adjust.
Identities = 72/258 (27%), Positives = 114/258 (44%), Gaps = 28/258 (10%)
Query: 53 TFPTGQTFRWKKTGPLQYTGPIGPHLISLKHLQNGDVCYHIHTSPSEPAAKSALLDFLNM 112
F GQ FRW + YTG +I++ + GD HI + E ++ ++ ++
Sbjct: 23 VFDCGQCFRWIRQEDGSYTGVAMNRVININ--KEGDSI-HIDNTNLEDF-ENIWYEYFDL 78
Query: 113 GISLGELWEGFSASDCRFAELAKYLAGARVLRQDPVECLLQFLCSSNNNIARITKMVDFL 172
G L + + D + ++ G R+L+QD E L+ F+ SSNN I I + ++ +
Sbjct: 79 ARDYGALKKDLAKHDENLKKAVEFGPGIRILKQDGWEMLISFIISSNNRIPMIQRAINNI 138
Query: 173 AS-LGSHLGNVEGFEFHEFPSLERLSLVSEVELRNAGFGYRSAPQSSLLFSVRRSFKQAK 231
+ G +G G +++ FPS E LS S +LR+ G+R K
Sbjct: 139 SERYGHKIGTYRGKDYYAFPSPEELSRASIEDLRDCKTGFRD-----------------K 181
Query: 232 YITGTVD-VLQSKHSGGAEWLLSLRKLDLQEAIDALCTLPGVGPKVAACIALFSLDQHHA 290
YI T VL K L + +D L GVG KVA CIALF + ++ +
Sbjct: 182 YIYHTTRAVLDEKID-----LKAFVDMDQDTCHKELMKFKGVGAKVADCIALFGMRKYQS 236
Query: 291 IPVDTHVWKIATRYLLPE 308
PVD V ++ + E
Sbjct: 237 FPVDVWVKRVMQEFYGAE 254
>gi|168186042|ref|ZP_02620677.1| 8-oxoguanine-DNA-glycosylase [Clostridium botulinum C str. Eklund]
gi|169296132|gb|EDS78265.1| 8-oxoguanine-DNA-glycosylase [Clostridium botulinum C str. Eklund]
Length = 292
Score = 94.0 bits (232), Expect = 8e-17, Method: Compositional matrix adjust.
Identities = 68/265 (25%), Positives = 123/265 (46%), Gaps = 29/265 (10%)
Query: 48 LSLPLTFPTGQTFRWKKTGPLQYTGPIGPHLISLKHLQNGDVCYHIHTSPSEPAAKSALL 107
LP F GQ FRW + Y G +I ++ +N + Y + +E +
Sbjct: 21 FELPHIFDCGQCFRWNRQDNSNYIGVAFNKVIEVEKRENDVIIY----NTNEEEFEKIWS 76
Query: 108 DFLNMGISLGELWEGFSASDCRFAELAKYLAGARVLRQDPVECLLQFLCSSNNNIARITK 167
D+ ++ + + F D + ++ G R+L+Q+P E ++ F+ S+NN I I +
Sbjct: 77 DYFDLYRDYSTIKDIFK-KDPLLKKSVEFGHGIRLLKQEPFELVISFIISANNRIPMIKR 135
Query: 168 MVDFLASLGSHLGNVEGFEFHEFPSLERLSLVSEVELRNAGFGYRSAPQSSLLFSVRRSF 227
+ ++ + +G ++ FP++++L + +L + G G+R
Sbjct: 136 AILNISKMWGDKLEYKGKIYYAFPNVQQLKDCTIEQLSDCGVGFR--------------- 180
Query: 228 KQAKYITGTV-DVLQSKHSGGAEWLLSLRKLDLQEAIDALCTLPGVGPKVAACIALFSLD 286
AKYI T+ D++ G L ++ L+ E L + GVGPKVA CI LFS++
Sbjct: 181 --AKYIYKTIQDII-----NGTINLEHIKSLNDDECHKELQKISGVGPKVADCIMLFSME 233
Query: 287 QHHAIPVDTHVWKIATR-YLLPELA 310
++ A PVD V + YL P+++
Sbjct: 234 KYTAFPVDVWVKRAMQHFYLAPDVS 258
>gi|365986406|ref|XP_003670035.1| hypothetical protein NDAI_0D04790 [Naumovozyma dairenensis CBS 421]
gi|343768804|emb|CCD24792.1| hypothetical protein NDAI_0D04790 [Naumovozyma dairenensis CBS 421]
Length = 376
Score = 94.0 bits (232), Expect = 9e-17, Method: Compositional matrix adjust.
Identities = 86/269 (31%), Positives = 124/269 (46%), Gaps = 42/269 (15%)
Query: 37 KWTPLNLTQSELSLPLTFPTGQTFRW-------KKTGPLQYTGPIGPHLISLKHLQNGDV 89
K+ + + + ELSL GQ FRW T ++ T ++ L + +
Sbjct: 3 KFHQIKVNKDELSLTNVLQAGQAFRWVFNEKENHYTTTMKVTESNKYFIVVLSQPADDLI 62
Query: 90 CYHIHTSPS---EPAA-KSALLDFLNMGISLGELW-EGFSASDCRFAELAKYLAGARVLR 144
+ TSP +P +S L + + +SL L + D RF ++ + G R+L
Sbjct: 63 EF---TSPDLSLDPEHLRSHLERYFRLDVSLNNLLLNEWIPKDHRFKTISAH--GVRILA 117
Query: 145 QDPVECLLQFLCSSNNNIARITKMVDFL-ASLGSHLGNVEGFEFHEFPSLERLS-LVSEV 202
Q+P E L+ F+CSSNNNIARITKM L + G +G +G +F FPS + ++ +E
Sbjct: 118 QEPWETLISFICSSNNNIARITKMCHSLCTNYGEEVGVFDGQKFFSFPSSDVIAERATET 177
Query: 203 ELRNAGFGYRSAPQSSLLFSVRRSFKQAKYITGTV-DVLQSKHSGGAEWLLSL-RKLD-- 258
LR+ GFGYR AKYI T V + + G E +L RKL
Sbjct: 178 ALRDLGFGYR-----------------AKYIIETAKKVSKVRADGNYENDTALFRKLQSS 220
Query: 259 --LQEAIDALCTLPGVGPKVAACIALFSL 285
E + L GVGPKVA C+ L L
Sbjct: 221 FSYIEMREHLMGYTGVGPKVADCVCLMGL 249
>gi|443897285|dbj|GAC74626.1| 8-oxoguanine DNA glycosylase [Pseudozyma antarctica T-34]
Length = 536
Score = 94.0 bits (232), Expect = 9e-17, Method: Compositional matrix adjust.
Identities = 78/262 (29%), Positives = 113/262 (43%), Gaps = 68/262 (25%)
Query: 35 PSKWTPLNLTQSELSLPLTFPT--GQTFRWKKTG----------------PLQYTGPIGP 76
P+ + L S++ LPLT GQ FRW+ ++++ +
Sbjct: 27 PAGYAALRAAPSQILLPLTVSNKCGQAFRWRANKVWVPTASAPSSGGWDEQIEWSLCLAD 86
Query: 77 HLISLKHLQNGDVCYHIHTSPSEPA--------------AKSALLDFLNMGISLGELWEG 122
++ L+ ++ YH PS + + L D+LN+ + L L+
Sbjct: 87 RVVLLRQDEHRGFLYHKTLLPSTSSRPVDGANDAQTIRETERWLKDYLNLDVPLEALYAE 146
Query: 123 FSASDCRFAELAKYLAGARVLRQDPVECLLQFLCSSNNNIARITKMVD-----FLASLGS 177
+ A D FA A +G R+LRQDP ECL F+CSSNNNIARI +MV F +L
Sbjct: 147 WEAKDAVFARFATRFSGLRMLRQDPWECLCAFVCSSNNNIARIGQMVQNLCTHFSPALLE 206
Query: 178 HL---------GNVEGFE----FHEFPSLERLSLVS-EVELRNAGFGYRSAPQSSLLFSV 223
H+ G VE E +H FPS E L+ E +LR GFGYR
Sbjct: 207 HVYAAPPPTVAGEVEQGEVKIVYHPFPSPEALAKPGVEEKLRELGFGYR----------- 255
Query: 224 RRSFKQAKYITGTVDVLQSKHS 245
AKY+ T +L +H+
Sbjct: 256 ------AKYLAKTAQMLCEQHA 271
Score = 58.9 bits (141), Expect = 3e-06, Method: Compositional matrix adjust.
Identities = 33/77 (42%), Positives = 41/77 (53%), Gaps = 7/77 (9%)
Query: 250 WLLSLRKLDLQEAIDALCTLPGVGPKVAACIALFSLDQHHAIPVDTHVWKIATRYLLPEL 309
+L LR L A L PGVGPKVA CI L SLDQ +IPVD HV++ A ++
Sbjct: 358 YLQHLRTLSYHSARQELIQFPGVGPKVADCILLMSLDQASSIPVDRHVFQFAEKWY---- 413
Query: 310 AGVRLTPKLCSRVAEAF 326
RL K +A+ F
Sbjct: 414 ---RLRTKRYEEIADYF 427
>gi|168179050|ref|ZP_02613714.1| putative 8-oxoguanine DNA glycosylase [Clostridium botulinum NCTC
2916]
gi|182670030|gb|EDT82006.1| putative 8-oxoguanine DNA glycosylase [Clostridium botulinum NCTC
2916]
Length = 305
Score = 94.0 bits (232), Expect = 9e-17, Method: Compositional matrix adjust.
Identities = 70/270 (25%), Positives = 124/270 (45%), Gaps = 29/270 (10%)
Query: 48 LSLPLTFPTGQTFRWKKTGPLQYTGPIGPHLISLKHLQNGDVCYHIHTSPSEPAAKSALL 107
L F GQ FRW KT Y G +I ++ N DV H + +E K+
Sbjct: 21 FELAHIFECGQCFRWYKTEEDSYIGVAYGKVIEVEK-ANNDVILH---NATEDDFKNIWA 76
Query: 108 DFLNMGISLGELWEGFSASDCRFAELAKYLAGARVLRQDPVECLLQFLCSSNNNIARITK 167
++ ++ E+ + + D A+ ++ G R+L+QD E ++ F+ S+NN I I K
Sbjct: 77 EYFDLYRDYSEI-KNILSKDEILAKSVEFGHGIRLLKQDSFEIIVSFIISANNRIPMIKK 135
Query: 168 MVDFLASLGSHLGNVEGFEFHEFPSLERLSLVSEVELRNAGFGYRSAPQSSLLFSVRRSF 227
+ ++ +G ++ FP++E+L +E EL+ G+R
Sbjct: 136 AIKNISERWGDPIEYKGNIYYSFPTVEQLKDATEDELKACSVGFR--------------- 180
Query: 228 KQAKYITGTVD-VLQSKHSGGAEWLLSLRKLDLQEAIDALC-----TLPGVGPKVAACIA 281
A+YI TV+ + Q+ ++ L ++ D +C G+G KVA C+
Sbjct: 181 --ARYIKDTVNKIYQNSIEECEQYEKEYDMLWIKNQQDDICHKVLQNYSGIGAKVADCVM 238
Query: 282 LFSLDQHHAIPVDTHVWK-IATRYLLPELA 310
LFS++++ A PVD V + + YL P+++
Sbjct: 239 LFSMEKYSAFPVDVWVKRAMQYFYLAPDVS 268
>gi|289577790|ref|YP_003476417.1| DNA-(apurinic or apyrimidinic site) lyase [Thermoanaerobacter
italicus Ab9]
gi|297544078|ref|YP_003676380.1| 8-oxoguanine DNA glycosylase domain-containing protein
[Thermoanaerobacter mathranii subsp. mathranii str. A3]
gi|289527503|gb|ADD01855.1| DNA-(apurinic or apyrimidinic site) lyase [Thermoanaerobacter
italicus Ab9]
gi|296841853|gb|ADH60369.1| 8-oxoguanine DNA glycosylase domain protein [Thermoanaerobacter
mathranii subsp. mathranii str. A3]
Length = 297
Score = 94.0 bits (232), Expect = 1e-16, Method: Compositional matrix adjust.
Identities = 77/273 (28%), Positives = 121/273 (44%), Gaps = 27/273 (9%)
Query: 46 SELSLPLTFPTGQTFRWKKTGPLQYTGPIGPHLISLKHLQNGDVCYHIHTSPSEPAAKSA 105
+ +L TF GQ FRW + YTG ++++K + D+ +T ++
Sbjct: 18 EDFNLKETFECGQCFRWNEEEDGSYTGVAYDRVVNVKLEE--DMLIIDNTDLNDFY--DI 73
Query: 106 LLDFLNMGISLGELWEGFSASDCRFAELAKYLAGARVLRQDPVECLLQFLCSSNNNIARI 165
D+ ++G ++ E S D E +Y G R+LRQD E L+ F+ S NN I +I
Sbjct: 74 WFDYFDLGRDYKKIKENLS-RDPILKEAIQYGQGIRILRQDTWETLVSFIISQNNRIPQI 132
Query: 166 TKMVDFLASLGSHLGNVEGFEFHEFPSLERLSLVSEVELRNAGFGYRSAPQSSLLFSVRR 225
K+++ LAS +G ++ FP E L + + G+R
Sbjct: 133 KKVIENLASSFGEPIEYKGKIYYTFPKAEELVMFDVDTIAKTKCGFR------------- 179
Query: 226 SFKQAKYITGTVDVLQSKHSGGAEWLLSLRKLDLQEAIDALCTLPGVGPKVAACIALFSL 285
AKYI +D SG + LL L + E D L + GVGPKVA C+ L+S+
Sbjct: 180 ----AKYI---LDAASKVFSGEID-LLKLFEYSTNEIRDILMNINGVGPKVADCVILYSI 231
Query: 286 DQHHAIPVDTHVWKIATRYLLPELAGVRLTPKL 318
++ P D + +I YL + G L +L
Sbjct: 232 GRYDTFPTDVWIKRIV-EYLYLKREGTPLEIQL 263
>gi|256750867|ref|ZP_05491751.1| 8-oxoguanine DNA glycosylase domain protein [Thermoanaerobacter
ethanolicus CCSD1]
gi|256750202|gb|EEU63222.1| 8-oxoguanine DNA glycosylase domain protein [Thermoanaerobacter
ethanolicus CCSD1]
Length = 302
Score = 93.6 bits (231), Expect = 1e-16, Method: Compositional matrix adjust.
Identities = 77/273 (28%), Positives = 121/273 (44%), Gaps = 27/273 (9%)
Query: 46 SELSLPLTFPTGQTFRWKKTGPLQYTGPIGPHLISLKHLQNGDVCYHIHTSPSEPAAKSA 105
+ +L TF GQ FRW + YTG +I++K + D+ +T ++
Sbjct: 23 EDFNLKETFECGQCFRWNEEEDGSYTGIAYDRVINVKLEE--DMLIIDNTDLNDFY--DI 78
Query: 106 LLDFLNMGISLGELWEGFSASDCRFAELAKYLAGARVLRQDPVECLLQFLCSSNNNIARI 165
D+ ++G ++ E S D E +Y G R+LRQD E L+ F+ S NN I +I
Sbjct: 79 WFDYFDLGRDYKQIKENLS-RDPILKEAIQYGQGIRILRQDTWETLVSFIISQNNRIPQI 137
Query: 166 TKMVDFLASLGSHLGNVEGFEFHEFPSLERLSLVSEVELRNAGFGYRSAPQSSLLFSVRR 225
K+++ LAS +G ++ FP E L + + G+R
Sbjct: 138 KKVIENLASSFGEPIEYKGKIYYTFPKAEELVMFDVETIAKTKCGFR------------- 184
Query: 226 SFKQAKYITGTVDVLQSKHSGGAEWLLSLRKLDLQEAIDALCTLPGVGPKVAACIALFSL 285
AKYI +D SG + LL L + + D L + GVGPKVA C+ L+S+
Sbjct: 185 ----AKYI---LDAASKVFSGEID-LLKLFEYSTNDIKDILMNINGVGPKVADCVILYSI 236
Query: 286 DQHHAIPVDTHVWKIATRYLLPELAGVRLTPKL 318
++ P D + +I YL + G L +L
Sbjct: 237 GRYDTFPTDVWIKRIV-EYLYLKREGTPLEIQL 268
>gi|167038123|ref|YP_001665701.1| 8-oxoguanine DNA glycosylase [Thermoanaerobacter pseudethanolicus
ATCC 33223]
gi|167039165|ref|YP_001662150.1| 8-oxoguanine DNA glycosylase [Thermoanaerobacter sp. X514]
gi|300913243|ref|ZP_07130560.1| 8-oxoguanine DNA glycosylase domain protein [Thermoanaerobacter sp.
X561]
gi|307723743|ref|YP_003903494.1| 8-oxoguanine DNA glycosylase domain-containing protein
[Thermoanaerobacter sp. X513]
gi|320116532|ref|YP_004186691.1| 8-oxoguanine DNA glycosylase domain-containing protein
[Thermoanaerobacter brockii subsp. finnii Ako-1]
gi|166853405|gb|ABY91814.1| 8-oxoguanine DNA glycosylase domain protein [Thermoanaerobacter sp.
X514]
gi|166856957|gb|ABY95365.1| 8-oxoguanine DNA glycosylase domain protein [Thermoanaerobacter
pseudethanolicus ATCC 33223]
gi|300889928|gb|EFK85073.1| 8-oxoguanine DNA glycosylase domain protein [Thermoanaerobacter sp.
X561]
gi|307580804|gb|ADN54203.1| 8-oxoguanine DNA glycosylase domain-containing protein
[Thermoanaerobacter sp. X513]
gi|319929623|gb|ADV80308.1| 8-oxoguanine DNA glycosylase domain-containing protein
[Thermoanaerobacter brockii subsp. finnii Ako-1]
Length = 297
Score = 93.6 bits (231), Expect = 1e-16, Method: Compositional matrix adjust.
Identities = 77/273 (28%), Positives = 121/273 (44%), Gaps = 27/273 (9%)
Query: 46 SELSLPLTFPTGQTFRWKKTGPLQYTGPIGPHLISLKHLQNGDVCYHIHTSPSEPAAKSA 105
+ +L TF GQ FRW + YTG +I++K + D+ +T ++
Sbjct: 18 EDFNLKETFECGQCFRWNEEEDGSYTGVAYDRVINVKLEE--DMLIIDNTDLNDFY--DI 73
Query: 106 LLDFLNMGISLGELWEGFSASDCRFAELAKYLAGARVLRQDPVECLLQFLCSSNNNIARI 165
D+ ++G ++ E S D E +Y G R+LRQD E L+ F+ S NN I +I
Sbjct: 74 WFDYFDLGRDYKQIKENLS-RDPILKEAIQYGQGIRILRQDTWETLVSFIISQNNRIPQI 132
Query: 166 TKMVDFLASLGSHLGNVEGFEFHEFPSLERLSLVSEVELRNAGFGYRSAPQSSLLFSVRR 225
K+++ LAS +G ++ FP E L + + G+R
Sbjct: 133 KKVIENLASSFGEPIEYKGKIYYTFPKAEELVMFDVETIAKTKCGFR------------- 179
Query: 226 SFKQAKYITGTVDVLQSKHSGGAEWLLSLRKLDLQEAIDALCTLPGVGPKVAACIALFSL 285
AKYI +D SG + LL L + + D L + GVGPKVA C+ L+S+
Sbjct: 180 ----AKYI---LDAASKVFSGEID-LLKLFEYSTNDIRDILMNINGVGPKVADCVILYSI 231
Query: 286 DQHHAIPVDTHVWKIATRYLLPELAGVRLTPKL 318
++ P D + +I YL + G L +L
Sbjct: 232 GRYDTFPTDVWIKRIV-EYLYLKREGTPLEIQL 263
>gi|408404048|ref|YP_006862031.1| 8-oxoguanine DNA glycosylase [Candidatus Nitrososphaera gargensis
Ga9.2]
gi|408364644|gb|AFU58374.1| 8-oxoguanine DNA glycosylase [Candidatus Nitrososphaera gargensis
Ga9.2]
Length = 302
Score = 93.2 bits (230), Expect = 1e-16, Method: Compositional matrix adjust.
Identities = 74/277 (26%), Positives = 122/277 (44%), Gaps = 31/277 (11%)
Query: 52 LTFPTGQTFRWKKTGPLQYTGPIGPHLISLKHLQNGDVCYHIHTSPSEPAAKSALLDFLN 111
++ +GQ F W+K G Y G G ++ + +G + + P +
Sbjct: 26 VSINSGQVFLWEKRGNSWY-GVHGDRIVRFAQMIDGHAEF-----AAFPEDRLCEQKMFR 79
Query: 112 MGISLGELWEGFSASDCRFAELAKYLAGARVLRQDPVECLLQFLCSSNNNIARITKMVDF 171
MG + ++ S D L + G R++RQ+P +CL F+C+SN NI I +M+
Sbjct: 80 MGDDVRAIFSEIS-RDPLVRRLVRTYPGLRLMRQEPHQCLFSFVCASNTNIPMIRRMLYT 138
Query: 172 LASLGSHLGNVEGFEFHEFPSLERLSLVSEVELRNAGFGYRSAPQSSLLFSVRRSFKQAK 231
L +V+G EF FPS ++ S ELR G GYR+ +
Sbjct: 139 LTRKFGRPVHVDGIEFFTFPSASDINRASIDELRACGLGYRAKAIKA---------AAGA 189
Query: 232 YITGTVDVLQSKHSGGAEWLLSLRKLDLQEAIDALCTLPGVGPKVAACIALFSLDQHHAI 291
TG +D K +G +EA L + GVG K+A C+ LFSL++ A
Sbjct: 190 IATGQLDFDVLKSTG------------YEEAKKELLQVYGVGNKIADCVLLFSLEKLDAF 237
Query: 292 PVDTHVWKIAT---RYLLPELAGVRLTPKLCSRVAEA 325
P+D + + R+L + G ++TP +++E+
Sbjct: 238 PIDVWIARALAGHYRWLHKKKFGDKITPHQYEQLSES 274
>gi|306819770|ref|ZP_07453425.1| 8-oxoguanine DNA glycosylase [Eubacterium yurii subsp. margaretiae
ATCC 43715]
gi|304552139|gb|EFM40075.1| 8-oxoguanine DNA glycosylase [Eubacterium yurii subsp. margaretiae
ATCC 43715]
Length = 298
Score = 93.2 bits (230), Expect = 2e-16, Method: Compositional matrix adjust.
Identities = 69/256 (26%), Positives = 116/256 (45%), Gaps = 26/256 (10%)
Query: 54 FPTGQTFRWKKTGPLQYTGPIGPHLISLKHLQNGDVCYHIHTSPSEPAAKSALLDFLNMG 113
F GQ FRW+ YT +I++ +++ ++ I + S + + + ++
Sbjct: 29 FECGQCFRWEAEDDGSYTAIAFDKVINVS-IEDDNI---ILKNSSVDDVEKLWISYFDLN 84
Query: 114 ISLGELWEGFSASDCRFAELAKYLAGARVLRQDPVECLLQFLCSSNNNIARITKMVDFLA 173
+ E S D A++ G R+L QD E ++ F+ S+ N+I I K V+ L+
Sbjct: 85 TDYSAIKEKLSGIDEYLKASAEFGYGIRILHQDFHEMIISFIISARNSIPMIKKSVETLS 144
Query: 174 -SLGSHLGNVEGFEFHEFPSLERLSLVSEVELRNAGFGYRSAPQSSLLFSVRRSFKQAKY 232
SLG H+ G +F+ FP+++ L +R+A +R A Y
Sbjct: 145 QSLGQHIDTFNGKKFYSFPTMQALKDADIQLIRDAKTAFR-----------------APY 187
Query: 233 ITGTVDVLQSKHSGGAEWLLSLRKLDLQEAIDALCTLPGVGPKVAACIALFSLDQHHAIP 292
I T + + +++ KL L E + L L GVG KVA CIALF L ++ A P
Sbjct: 188 IQKTASYIVENNIKSSDF----DKLSLDECSEKLQELQGVGAKVADCIALFGLAKYDAFP 243
Query: 293 VDTHVWKIATRYLLPE 308
VD V ++ + L +
Sbjct: 244 VDVWVKRVIGEFYLQD 259
>gi|256544392|ref|ZP_05471767.1| 3-methyladenine DNA glycosylase [Anaerococcus vaginalis ATCC 51170]
gi|256399924|gb|EEU13526.1| 3-methyladenine DNA glycosylase [Anaerococcus vaginalis ATCC 51170]
Length = 301
Score = 93.2 bits (230), Expect = 2e-16, Method: Compositional matrix adjust.
Identities = 73/257 (28%), Positives = 120/257 (46%), Gaps = 28/257 (10%)
Query: 54 FPTGQTFRWKKTGPLQYTGPIGPHLISLKHLQNGDVCYHIHTSPSEPAAKSALLDFLNMG 113
F GQ F + + +T +I+LK + GD I + SE D+ ++G
Sbjct: 34 FTCGQCFNFFEEEDGSFTAVFLGKIINLK--KEGDKI--IIENVSEEEFYDIFYDYFDLG 89
Query: 114 ISLGELWEGFSASDCRFAELAKYLAGARVLRQDPVECLLQFLCSSNNNIARITKMVDFLA 173
++ ++ E S D E +Y +G R+L Q+ E L+ F+ S+NN I RI K V ++
Sbjct: 90 VNYEDIKEKISL-DKTLKEATEYGSGIRILNQEFFETLISFIISANNQIPRIKKAVRIIS 148
Query: 174 SL-GSHLGNVEGFEFHEFPSLERLSLVSEVELRNAGFGYRSAPQSSLLFSVRRSFKQAKY 232
+ G ++G G +++ FP+ E+LS +LR ++ + F +R + AK
Sbjct: 149 EMYGDYIGEYRGRKYYSFPNPEQLSKARPEDLR---------EKARVGFRDKRIVQTAKI 199
Query: 233 IT-GTVDVLQSKHSGGAEWLLSLRKLDLQEAIDALCTLPGVGPKVAACIALFSLDQHHAI 291
I G D E + + DL++ L LPGVGPKVA CI LF+ +
Sbjct: 200 INDGFFDF---------EKDIKMPTEDLRKK---LQELPGVGPKVADCILLFAFHKRETF 247
Query: 292 PVDTHVWKIATRYLLPE 308
PVD + ++ + E
Sbjct: 248 PVDVWIKRVMEFLFIKE 264
>gi|390935699|ref|YP_006393204.1| 8-oxoguanine DNA glycosylase [Thermoanaerobacterium saccharolyticum
JW/SL-YS485]
gi|389571200|gb|AFK87605.1| 8-oxoguanine DNA glycosylase domain-containing protein
[Thermoanaerobacterium saccharolyticum JW/SL-YS485]
Length = 304
Score = 93.2 bits (230), Expect = 2e-16, Method: Compositional matrix adjust.
Identities = 68/256 (26%), Positives = 118/256 (46%), Gaps = 28/256 (10%)
Query: 47 ELSLPLTFPTGQTFRWKKTGPLQYTGPIGPHLISLKHLQNGDVCYHIHTSPSEPAAKSAL 106
+ +L T GQ FRW + YTG +I++K +GD+ +T+ ++
Sbjct: 25 DFNLKETLECGQCFRWNEEDDGSYTGVAFDRVINVK--LDGDILTIDNTTLAD--FNDIW 80
Query: 107 LDFLNMGISLGELWEGFSASDCRFAELAKYLAGARVLRQDPVECLLQFLCSSNNNIARIT 166
D+ ++G G++ E S + A + KY G R+LRQD E L+ F+ S NN I +I
Sbjct: 81 YDYFDLGRDYGKIKETLSQDEILKAAI-KYGEGIRILRQDTWETLISFIISQNNRIPQIK 139
Query: 167 KMVDFLASLGSHLGNVEGFEFHEFPSLERLSLVSEVELRNAGFGYRSAPQSSLLFSVRRS 226
K+++ L+ L H ++ FP ++ + L + G+RS
Sbjct: 140 KVIENLSRLLGHPIVYRDKTYYTFPKVQDFIMADLELLSKSKCGFRS------------- 186
Query: 227 FKQAKYIT-GTVDVLQSKHSGGAEWLLSLRKLDLQEAIDALCTLPGVGPKVAACIALFSL 285
KYI + V + + L L+ D + + L ++ GVGPKVA C+ L+S+
Sbjct: 187 ----KYIIDAALKVFNDEVN-----LFELQLYDTYDVRNILMSIRGVGPKVADCVMLYSI 237
Query: 286 DQHHAIPVDTHVWKIA 301
++ A P D + ++
Sbjct: 238 GRYEAFPTDVWIKRVV 253
>gi|116192499|ref|XP_001222062.1| hypothetical protein CHGG_05967 [Chaetomium globosum CBS 148.51]
gi|88181880|gb|EAQ89348.1| hypothetical protein CHGG_05967 [Chaetomium globosum CBS 148.51]
Length = 436
Score = 92.8 bits (229), Expect = 2e-16, Method: Compositional matrix adjust.
Identities = 59/160 (36%), Positives = 81/160 (50%), Gaps = 21/160 (13%)
Query: 99 EPAAKSALLDFLNMGISLGELWEGFSASDCRFAELAKYLAGARVLRQDPVECLLQFLCSS 158
E S L + + +S+ L++ ++ASD FA A G R+L Q E L+ F+CSS
Sbjct: 76 EDDTASLLHSYFALSLSVASLYKQWAASDANFARRAPAFTGIRILNQPAWEALVAFICSS 135
Query: 159 NNNIARITKMVDFLA-SLGSHLGNVEGFEFHEFPSLERLSLVS-EVELRNAGFGYRSAPQ 216
NNNI+RI++MV L G ++ +EG FH+FP E LS E LR GFGYR
Sbjct: 136 NNNISRISQMVQKLCIYYGPYVATIEGEPFHDFPGPEALSGDQVEAHLRQLGFGYR---- 191
Query: 217 SSLLFSVRRSFKQAKYITGTVDVLQSKHSGGAEWLLSLRK 256
A+YI T ++ S+ G WLL LR
Sbjct: 192 -------------ARYIVETARLVSSEKPAG--WLLQLRN 216
Score = 57.8 bits (138), Expect = 6e-06, Method: Compositional matrix adjust.
Identities = 26/44 (59%), Positives = 32/44 (72%)
Query: 260 QEAIDALCTLPGVGPKVAACIALFSLDQHHAIPVDTHVWKIATR 303
+ A +AL TLPGVGPKV+ C+ L L A+PVDTHVW+IA R
Sbjct: 278 RAAHEALLTLPGVGPKVSDCVCLMGLGWWEAVPVDTHVWQIAQR 321
>gi|417965054|ref|ZP_12606667.1| 8-oxoguanine DNA glycosylase domain-containing protein, partial
[Candidatus Arthromitus sp. SFB-4]
gi|380339104|gb|EIA27901.1| 8-oxoguanine DNA glycosylase domain-containing protein, partial
[Candidatus Arthromitus sp. SFB-4]
Length = 248
Score = 92.8 bits (229), Expect = 2e-16, Method: Compositional matrix adjust.
Identities = 80/271 (29%), Positives = 120/271 (44%), Gaps = 65/271 (23%)
Query: 46 SELSLPLTFPTGQTFRWKKTGPLQYTGPIGPHLISLKHLQNGDVCYHIHTSPSEPAAKSA 105
+ L F GQ FRW++T Y +I L+ + +GD+ H S
Sbjct: 19 NNFVLKHIFENGQCFRWERTNIGTYIIVAKGRVIELE-MDSGDLLIH----------NSN 67
Query: 106 LLDFLNMGISLGELWEGFSASDCRF-AELA--KYLA-------GARVLRQDPVECLLQFL 155
+ DF N+ I+ + F C+ EL KYL G R+L QDP E +L F+
Sbjct: 68 VDDFENIWIN----YFDFERDYCKLKGELKHDKYLDIAINFGHGLRILNQDPFEMILSFI 123
Query: 156 CSSNNNIARITKMVDFLASLGSHLGN---VEGFEFHEFPSLERLSLVSEVELRNAGFGYR 212
SSNN I I K + S+ G+ +G +++FPSLE L+ +S+ + R G+R
Sbjct: 124 ISSNNRIPMIKKAI---LSISEKYGDPISYKGRIYYKFPSLENLANLSQEKFRECSVGFR 180
Query: 213 SAPQSSLLFSVRRSFKQAKYITGTVDVLQSKHSGGAEWLLSLRKLDLQEAIDALC----- 267
KY+ TV +L + E++++L D +C
Sbjct: 181 D-----------------KYLYNTVKLLNEDND--IEYIMNLN--------DDMCHKELQ 213
Query: 268 TLPGVGPKVAACIALFSLDQHHAIPVDTHVW 298
GVG KV+ CI LFS+ ++ A PVD VW
Sbjct: 214 KFSGVGSKVSDCIMLFSMKKYSAFPVD--VW 242
>gi|253580499|ref|ZP_04857764.1| conserved hypothetical protein [Ruminococcus sp. 5_1_39B_FAA]
gi|251848229|gb|EES76194.1| conserved hypothetical protein [Ruminococcus sp. 5_1_39BFAA]
Length = 268
Score = 92.8 bits (229), Expect = 2e-16, Method: Compositional matrix adjust.
Identities = 82/272 (30%), Positives = 113/272 (41%), Gaps = 55/272 (20%)
Query: 41 LNLTQSELSLPLTFPTGQTFRWKKTGPLQYTGPIGPHLISLKHLQNGDVCYHIHTSPSEP 100
L + L +GQ FR + G +Y G + L + G V + P E
Sbjct: 2 LTIEMDNFDLGQICRSGQCFRMDQIGDDRYRVIAGDKYLELTQ-ERGIVNFF---CPEE- 56
Query: 101 AAKSALLDFLNMGISLGELWEGFSASDCRFAELA-------KYL-------AGARVLRQD 146
DF+ W + DC ++E KYL +G R+L+QD
Sbjct: 57 -------DFIF-------FWIRYFDLDCDYSEYINMINPRDKYLTAAGEMGSGIRILQQD 102
Query: 147 PVECLLQFLCSSNNNIARITKMVDFLA-SLGSHLGNVEGFEFHEFPSLERLSLVSEVELR 205
E ++ FL S NNI RI K ++ ++ G + G E++ FP+ E L+L +E +LR
Sbjct: 103 LWEMIISFLISQQNNITRIKKCIENISREFGVRKTSSTGAEYYAFPTAEALALATEEQLR 162
Query: 206 NAGFGYRSAPQSSLLFSVRRSFKQAKYITGTVDVLQSKHSGGAEWLLSLRKLDLQEAIDA 265
GYR AKY+ T K G L SL + + A
Sbjct: 163 ECNLGYR-----------------AKYVLDTA----RKVCFGDISLNSLHDMTYKAARKE 201
Query: 266 LCTLPGVGPKVAACIALFSLDQHHAIPVDTHV 297
L L GVG KVA CI LF L Q A PVDTH+
Sbjct: 202 LLGLYGVGEKVADCICLFGLHQLDAFPVDTHI 233
>gi|350565486|ref|ZP_08934248.1| DNA-(apurinic or apyrimidinic site) lyase [Peptoniphilus indolicus
ATCC 29427]
gi|348663730|gb|EGY80281.1| DNA-(apurinic or apyrimidinic site) lyase [Peptoniphilus indolicus
ATCC 29427]
Length = 291
Score = 92.8 bits (229), Expect = 2e-16, Method: Compositional matrix adjust.
Identities = 79/267 (29%), Positives = 121/267 (45%), Gaps = 40/267 (14%)
Query: 54 FPTGQTFRWKKTGPLQYTGPIGPHLISLKHLQNGDVCYHIHTSPSEPAAKSALLDFLNMG 113
F GQ FRW+ YT ++++K + D ++T+ E + D+ ++
Sbjct: 28 FECGQAFRWEAEEDGSYTTVAFGRVLNVKVV---DGVVELNTTIEE--FEKIWRDYFDLD 82
Query: 114 ISLGELWEGFSASDCRFAELAKYLAGARVLRQDPVECLLQFLCSSNNNIARITKMVDFLA 173
+ E S + + +Y G R+L QDP E ++ F+ S N+NI RI K V+F+A
Sbjct: 83 RDYLSMIEELSFEET-LKDAVEYGRGIRILNQDPFETIISFIISQNSNITRIKKAVNFIA 141
Query: 174 S-LGSHLGNVEGFEFH-EFPSLERLSLVSEVELR-NAGFGYRSAPQSSLLFSVRRSFKQA 230
G + E H FP+ E L+ S +LR A GYR
Sbjct: 142 DKYGEKID-----ETHSRFPTPEELAKASVEDLREKARVGYRD----------------- 179
Query: 231 KYITGTVDVLQSKHSGGAEWLLSLRKLDLQEAIDALCTLPGVGPKVAACIALFSLDQHHA 290
+YI + ++ S G + LR D++ A L LPGVGPKV CI LF+ + +
Sbjct: 180 RYIVESAQMI----SNGMLDMELLRTADIETARKELMKLPGVGPKVCDCILLFAFKRSES 235
Query: 291 IPVDTHVWKIATRYLLPELAGVRLTPK 317
PVD VW + ++ EL + TPK
Sbjct: 236 FPVD--VW---IKRVMEELYLGKATPK 257
>gi|268324685|emb|CBH38273.1| putative 8-oxoguanine DNA glycosylase [uncultured archaeon]
Length = 288
Score = 92.8 bits (229), Expect = 2e-16, Method: Compositional matrix adjust.
Identities = 67/256 (26%), Positives = 116/256 (45%), Gaps = 27/256 (10%)
Query: 45 QSELSLPLTFPTGQTFRWKKTGPLQYTGPIGPHLISLKHLQNGDVCYHIHTSPSEPAAKS 104
+S +L T GQ FRW K Y G + ++ ++ Q GD H P E
Sbjct: 4 ESHFNLDYTLSCGQVFRWGKANGWWY-GIVDGSILKVR--QEGDEL-HFSAYP-EDVEDE 58
Query: 105 ALLDFLNMGISLGELWEGFSASDCRFAELAKYLAGARVLRQDPVECLLQFLCSSNNNIAR 164
+ + + L + + D + + G R++RQ +CL+ ++C++N +IA
Sbjct: 59 FIKSYFRLDDDLTNILRIIN-KDATINKAIQQFNGLRIVRQSVWDCLISYVCATNASIAI 117
Query: 165 ITKMVDFLASLGSHLGNVEGFEFHEFPSLERLSLVSEVELRNAGFGYRSAPQSSLLFSVR 224
I M+ L+ + V G F FP +++L+ S +++ GYR+ S +
Sbjct: 118 IESMLRNLSEMFGDEIVVNGKAFFSFPKVKKLAKASVNKIKLCNVGYRARFLSEI----- 172
Query: 225 RSFKQAKYITGTVDVLQSKHSGGAEWLLSLRKLDLQEAIDALCTLPGVGPKVAACIALFS 284
AK + ++L+ LR D + D L +LPGVGPKVA C++LF+
Sbjct: 173 -----AKQVENNPNLLEE-----------LRNSDYLKLRDELRSLPGVGPKVADCVSLFA 216
Query: 285 LDQHHAIPVDTHVWKI 300
D+ A P+D + ++
Sbjct: 217 FDKLEAFPIDVWIRRV 232
>gi|359416793|ref|ZP_09209067.1| 8-oxoguanine DNA glycosylase [Candidatus Haloredivivus sp. G17]
gi|358032881|gb|EHK01512.1| 8-oxoguanine DNA glycosylase [Candidatus Haloredivivus sp. G17]
Length = 284
Score = 92.4 bits (228), Expect = 2e-16, Method: Compositional matrix adjust.
Identities = 78/263 (29%), Positives = 119/263 (45%), Gaps = 38/263 (14%)
Query: 47 ELSLPLTFPTGQTFRWKKTGPLQYTGPIGPHLISLKH------LQNGDVCYHIHTSPSEP 100
E +L LT GQTF W + Y G G + + Q+GD + S P
Sbjct: 10 EFNLELTLTCGQTFCWHRIEGDLY-GDGGNKFYTFRKDEPIIAHQDGDT---VTVETSLP 65
Query: 101 AAKSALLDFLNMGISLGELWEGFSASDCRFAELAKYLAGARVLRQDPVECLLQFLCSSNN 160
+S + + L + L +++ F D + K L G R+++ + CL+ +LCS
Sbjct: 66 --RSEVEEALGLDRDLEKVFSTF-PDDEKLQYAMKELKGLRIVQDEFFPCLISYLCSPQM 122
Query: 161 NIARITKMVDFLASLGSHLGNVEGFEFHEFPSLERLSLVSEVELRNAGFGYRSAPQSSLL 220
I RI +M + LA G E +FP+LE+L SE +LR G GYR
Sbjct: 123 RIPRIKQMHNKLAENYGEAVEFRGEELLKFPTLEQLLQASEEDLRELGLGYR-------- 174
Query: 221 FSVRRSFKQAKYITGTVDVLQSKHSGGAEWLLSLRKLDLQEAIDALCTLPGVGPKVAACI 280
AKY++ +V++LQ + ++ D + A + L L GVG KVA C+
Sbjct: 175 ---------AKYVSNSVEILQKEGLAVSD--------DYEVAREDLKRLYGVGDKVADCV 217
Query: 281 ALFSLDQHHAIPVDTHVWKIATR 303
LFSL + A P+DT K+ +
Sbjct: 218 LLFSLGFYEAYPLDTWALKVLEK 240
>gi|227485323|ref|ZP_03915639.1| 8-oxoguanine DNA glycosylase [Anaerococcus lactolyticus ATCC 51172]
gi|227236614|gb|EEI86629.1| 8-oxoguanine DNA glycosylase [Anaerococcus lactolyticus ATCC 51172]
Length = 300
Score = 92.4 bits (228), Expect = 3e-16, Method: Compositional matrix adjust.
Identities = 73/263 (27%), Positives = 117/263 (44%), Gaps = 30/263 (11%)
Query: 41 LNLTQSELSLPLTFPTGQTFRWKKTGPLQYTGPIGPHLISLKHLQNGDVCYHIHTSPSEP 100
L L + + F GQ F ++K +T +I+L Y + + S
Sbjct: 20 LILREDSFKVDHIFECGQCFNFRKNDDGSFTAVFLGKIINLLECDG----YTLVRNVSLD 75
Query: 101 AAKSALLDFLNMGISLGELWEGFSASDCRFAELAKYLAGARVLRQDPVECLLQFLCSSNN 160
D+ ++G G + + S S+ E + Y G R+L Q+ E + F+ S+NN
Sbjct: 76 DFYDIFYDYFDLGTDYGAIKQALSESEI-LKEASDYGYGIRILNQELFETTISFIISANN 134
Query: 161 NIARITKMVDFLAS-LGSHLGNVEGFEFHEFPSLERLSLVSEVELRN-AGFGYRSAPQSS 218
I RI K V ++ G++LG G +++ FPS E L+ V +ELR A G+R
Sbjct: 135 QIPRIKKAVRIISERYGTYLGEYMGEKYYSFPSPEVLANVDPLELREYARVGFRDV---- 190
Query: 219 LLFSVRRSFKQAK-YITGTVDVLQSKHSGGAEWLLSLRKLDLQEAIDALCTLPGVGPKVA 277
R + AK ++ G ++ K +L DL + L LPG+GPKVA
Sbjct: 191 ------RIVETAKAFVDGFLNFEDEK---------TLTDKDLH---NKLNNLPGIGPKVA 232
Query: 278 ACIALFSLDQHHAIPVDTHVWKI 300
CI LF+ + PVD + ++
Sbjct: 233 DCIMLFAYHRRETFPVDVWIKRV 255
>gi|366163868|ref|ZP_09463623.1| 8-oxoguanine DNA glycosylase-like protein [Acetivibrio
cellulolyticus CD2]
Length = 294
Score = 92.0 bits (227), Expect = 3e-16, Method: Compositional matrix adjust.
Identities = 70/270 (25%), Positives = 119/270 (44%), Gaps = 30/270 (11%)
Query: 53 TFPTGQTFRWKKTGPLQYTGPIGPHLISLKHLQNGDVCYHIHTSPSEPAAKSALLDFLNM 112
TF GQ FRW + YTG ++++ + +G + I + K+ D+ ++
Sbjct: 28 TFECGQCFRWIRQDDGSYTGVARGKVLNV-SVDDGVL---ILKNTDLDDFKNIWFDYFDL 83
Query: 113 GISLGELWEGFSASDCRFAELAKYLAGARVLRQDPVECLLQFLCSSNNNIARITKMVDFL 172
G G++ E D E A + G R+L+QD E L+ F+ S+NN I RI K V +
Sbjct: 84 GRDYGKIKEAI-MKDNTMKEAAAFGWGIRLLKQDIWEALISFIISANNRIPRIMKTVGAI 142
Query: 173 ASLGSHLGNVEGFEFHEFPSLERL--SLVSEVELRNAGFGYRSAPQSSLLFSVRRSFKQA 230
+ + ++G ++ FP +++L S + ++E+ GF + SSL+ S
Sbjct: 143 SKVYGVELEMDGKSYYSFPEVDKLTQSSIEDLEVCKGGFRCKYILNSSLMIS-------- 194
Query: 231 KYITGTVDVLQSKHSGGAEWLLSLRKLDLQEAIDALCTLPGVGPKVAACIALFSLDQHHA 290
G + L ++ ++D A + L PGVGPKVA C+ L+S +
Sbjct: 195 ---EGDIS------------LGNISQMDTDMAREELMRFPGVGPKVADCVLLYSGTKFDV 239
Query: 291 IPVDTHVWKIATRYLLPELAGVRLTPKLCS 320
P D V ++ AG + K
Sbjct: 240 FPTDVWVKRVMEELYFKRDAGFKEIQKFAG 269
>gi|386876019|ref|ZP_10118160.1| 8-oxoguanine DNA-glycosylase (ogg) [Candidatus Nitrosopumilus
salaria BD31]
gi|386806162|gb|EIJ65640.1| 8-oxoguanine DNA-glycosylase (ogg) [Candidatus Nitrosopumilus
salaria BD31]
Length = 280
Score = 92.0 bits (227), Expect = 3e-16, Method: Compositional matrix adjust.
Identities = 69/257 (26%), Positives = 118/257 (45%), Gaps = 35/257 (13%)
Query: 48 LSLPLTFPTGQTFRWKKTGPLQYTGPIGPHLISLKHLQNGDVCYHIHTSPSEPAAKSALL 107
+++ + +GQ F WK+ Y G G ++ + L G + + +++
Sbjct: 7 INVENSINSGQVFLWKENDQYWY-GVNGQDILRVDKL--GKI----------KSYQNSKT 53
Query: 108 DFLNMGISLGELWEGFSASDCRFAELAKYLAGARVLRQDPVECLLQFLCSSNNNIARITK 167
DF ++ ++ S + KY+ G R+L QDP +CL+ F+ SSN+NI +I
Sbjct: 54 DFFRKKDNMEKIISSISRDSTTKEAVKKYI-GLRLLDQDPFQCLISFIVSSNSNIQKIKT 112
Query: 168 MVDFLASLGSHLGNVEGFEFHEFPSLERLSLVSEVELRNAGFGYRSAPQSSLLFSVRRSF 227
++ ++ E EF FP E+L+ S E+ + G GYR
Sbjct: 113 SLEKISKKFGTKVQFENQEFFLFPKPEKLAKASINEITDCGVGYR--------------- 157
Query: 228 KQAKYITGTVDVLQSKHSGGAEWLLSLRKLDLQEAIDALCTLPGVGPKVAACIALFSLDQ 287
AK+I ++ K L+K + +A + +C +PGVG KVA CI LFSL++
Sbjct: 158 --AKFIKEAANMSFLKEID----FEYLKKSNYHDAKENICLVPGVGNKVADCILLFSLNK 211
Query: 288 HHAIPVDTHVWKIATRY 304
A P+D + +I +Y
Sbjct: 212 LEAFPLDRWMIRILEKY 228
>gi|336430585|ref|ZP_08610529.1| hypothetical protein HMPREF0994_06535 [Lachnospiraceae bacterium
3_1_57FAA_CT1]
gi|336017494|gb|EGN47254.1| hypothetical protein HMPREF0994_06535 [Lachnospiraceae bacterium
3_1_57FAA_CT1]
Length = 317
Score = 91.7 bits (226), Expect = 4e-16, Method: Compositional matrix adjust.
Identities = 81/283 (28%), Positives = 110/283 (38%), Gaps = 46/283 (16%)
Query: 47 ELSLPLTFPTGQTFRWKKTGPLQYTGPIGPHLISLKHLQNG---------------DVCY 91
+ + + +GQ FR G Y G HL+ ++ G V
Sbjct: 8 DFDMRMLMDSGQVFRICCCGENTYAVASGRHLVRIRQTACGPSKEGTLSAFGMSGEGVSV 67
Query: 92 HIHTSPSEPAAKSALLDFLNMGISLGELWEGFSASDCRFAELAKYLAGARVLRQDPVECL 151
SP E S ++ ++ G + + D Y G R+LRQD E +
Sbjct: 68 EFSCSPQE--FDSYWHNYFDLSTDYGAMKKAVDPKDSFLTSAISYGGGIRILRQDLWETI 125
Query: 152 LQFLCSSNNNIARITKMVDFLASLGSHLGNVEGF----------EFHEFPSLERLSLVSE 201
L FL S NNNI RI VD L EGF +F+ FP E+++
Sbjct: 126 LCFLISQNNNITRIRNSVDALCRRYGERLEPEGFLTGEESLSEKDFYSFPEPEKIAAGGL 185
Query: 202 VELRNAGFGYRSAPQSSLLFSVRRSFKQAKYITGTVDVLQSKHSGGAEWLLSLRKLDLQE 261
L+ G GYR KYI + + G E+ SL+ D +E
Sbjct: 186 EGLQGLGLGYRD-----------------KYILAMAK--RCCNGSGKEFFHSLQNADYEE 226
Query: 262 AIDALCTLPGVGPKVAACIALFSLDQHHAIPVDTHVWKIATRY 304
AI L G+G KVA CI LF+L A PVDTHV +I Y
Sbjct: 227 AIAILTGEFGIGRKVADCICLFALHHIGAFPVDTHVKQILAAY 269
>gi|376263106|ref|YP_005149826.1| 3-methyladenine DNA glycosylase [Clostridium sp. BNL1100]
gi|373947100|gb|AEY68021.1| 3-methyladenine DNA glycosylase/8-oxoguanine DNA glycosylase
[Clostridium sp. BNL1100]
Length = 295
Score = 91.7 bits (226), Expect = 4e-16, Method: Compositional matrix adjust.
Identities = 77/278 (27%), Positives = 126/278 (45%), Gaps = 33/278 (11%)
Query: 47 ELSLPLTFPTGQTFRWKKTGPLQYTGPIGPHLISLKHLQNGDVCYHIHTSPSEPAAKSAL 106
+ +L F GQ FRW + Y G G L+++ + N +C T+ E
Sbjct: 22 DFNLTHIFDCGQCFRWIRQEDGSYRGIAGGRLVNVSY-NNEILCI---TNSREQDFIDIW 77
Query: 107 LDFLNMGISLGELWEGFSASDCRFAELAKYLAGARVLRQDPVECLLQFLCSSNNNIARIT 166
++ ++G + ++ + D E K G R+L+QD E L+ F+ S+NN I RI
Sbjct: 78 YEYFDLGTNYSKI-KTILEQDEIMKEAIKTGWGIRLLKQDFWEILISFIISANNMIPRIM 136
Query: 167 KMVDFLASLGSHLGNVEGFEFHEFPSLERLSLVSEVELRNAGFGYRSAPQSSLLFSVRRS 226
K VD L+ L + + + FP +E LS +S +++ G+R
Sbjct: 137 KTVDTLSVLKGKCIDPQ-LNAYSFPDVETLSNMSLEDIQQCKAGFR-------------- 181
Query: 227 FKQAKYITGTVDVLQSKHSGGAEWLLS---LRKLDLQEAIDALCTLPGVGPKVAACIALF 283
KYI T ++ A+ +L+ LR +D A L LPGVGPKVA CI LF
Sbjct: 182 ---CKYIHKTSALM-------AQGILTEENLRGMDTAMARKELMKLPGVGPKVADCILLF 231
Query: 284 SLDQHHAIPVDTHVWKIATRYLLPELAGVRLTPKLCSR 321
S ++ P+D V ++ L + +G++ + S+
Sbjct: 232 SGLKYDVFPIDVWVKRVMEELYLKKESGLKEIQQFASK 269
>gi|70917589|ref|XP_732905.1| N-glycosylase/DNA lyase [Plasmodium chabaudi chabaudi]
gi|56504211|emb|CAH82174.1| N-glycosylase/DNA lyase, putative [Plasmodium chabaudi chabaudi]
Length = 168
Score = 91.7 bits (226), Expect = 5e-16, Method: Compositional matrix adjust.
Identities = 48/121 (39%), Positives = 68/121 (56%), Gaps = 20/121 (16%)
Query: 185 FEFHEFPSLERLSLVSEVELRNAGFGYRSAPQSSLLFSVRRSFKQAKYITGTVDVLQSKH 244
F F++FPS+E +S + E +LR+ GFGYRS Y+ + +L K
Sbjct: 24 FHFYKFPSIETISNLKESDLRDLGFGYRSG-----------------YVIESAKML--KD 64
Query: 245 SGGAEWLLSLRKLDLQEA-IDALCTLPGVGPKVAACIALFSLDQHHAIPVDTHVWKIATR 303
GG EW+ L+K +A ID L PGVG KVA CI LF L+++ IP+DTH++ I +
Sbjct: 65 LGGEEWIEDLKKEKKTKACIDKLIKFPGVGLKVANCICLFGLNRYDCIPIDTHIYDIIYK 124
Query: 304 Y 304
Y
Sbjct: 125 Y 125
>gi|207342534|gb|EDZ70271.1| YML060Wp-like protein [Saccharomyces cerevisiae AWRI1631]
Length = 254
Score = 91.7 bits (226), Expect = 5e-16, Method: Compositional matrix adjust.
Identities = 57/158 (36%), Positives = 82/158 (51%), Gaps = 25/158 (15%)
Query: 154 FLCSSNNNIARITKMVDFLAS-LGSHLGNVEGFEFHEFPSLERL-SLVSEVELRNAGFGY 211
F+CSSNNNI+RIT+M + L S G+ + ++G +H FP+ E L S +E +LR GFGY
Sbjct: 7 FICSSNNNISRITRMCNSLCSNFGNLITTIDGVAYHSFPTSEELTSRATEAKLRELGFGY 66
Query: 212 RSAPQSSLLFSVRRSFKQAKYITGTV-----DVLQSKHSGGAEWLLSL-RKLDLQEAIDA 265
R AKYI T D ++ + +L S+ + ++ +
Sbjct: 67 R-----------------AKYIIETARKLVNDKAEANITSDTTYLQSICKDAQYEDVREH 109
Query: 266 LCTLPGVGPKVAACIALFSLDQHHAIPVDTHVWKIATR 303
L + GVGPKVA C+ L L +PVD HV +IA R
Sbjct: 110 LMSYNGVGPKVADCVCLMGLHMDGIVPVDVHVSRIAKR 147
>gi|374308069|ref|YP_005054500.1| 3-methyladenine DNA glycosylase [Filifactor alocis ATCC 35896]
gi|291165562|gb|EFE27611.1| 3-Methyladenine DNA glycosylase [Filifactor alocis ATCC 35896]
Length = 296
Score = 91.3 bits (225), Expect = 5e-16, Method: Compositional matrix adjust.
Identities = 76/270 (28%), Positives = 121/270 (44%), Gaps = 32/270 (11%)
Query: 46 SELSLPLTFPTGQTFRWKKTGPLQYTGPIGPHLISLKHLQNGDVCYHIHTSPSEPAAKSA 105
S+ SL + +GQ FR+K+ + H + + GD +T+ E +
Sbjct: 19 SDFSLRQIYESGQAFRFKEVREDTFLVVHQQHWAIVS--EQGDTITLQNTTLQE--CEDV 74
Query: 106 LLDFLNMGISLGELWEGFSASDCRFAELAKYLAGARVLRQDPVECLLQFLCSSNNNIARI 165
+ ++ L + F D + + G R+L QD E ++ F+ S+NN+I RI
Sbjct: 75 WNRYFDLERDYHTLKQSFLGLDDYLDKAISFGEGLRILNQDIFEMIITFIISANNHIPRI 134
Query: 166 ----TKMVDFLASLGSHLGNVEGFEFHEFPSLERLSLVSEVELRNAGFGYRSAPQSSLLF 221
TK+ +F G+ +G +G + FP E++ +S+ + GYR
Sbjct: 135 KNSITKICEFA---GTPIGEYDGEVIYAFPKAEQIQKLSQEQWDGIKLGYRE-------- 183
Query: 222 SVRRSFKQAKYITGTVDVLQSKHSGGAEWLLSLRKLDLQEAIDALCTLPGVGPKVAACIA 281
KYI TVD++ ++ A+ L LD EA L L GVG KVA CI
Sbjct: 184 ---------KYIKKTVDMIVNQEIDIAQ----LPSLDTLEAQKELMKLSGVGEKVADCIL 230
Query: 282 LFSLDQHHAIPVDTHVWKIATRYLLPELAG 311
LFS+ ++ A P+DT + K+ RY E G
Sbjct: 231 LFSMAKYDAFPMDTWMKKVMQRYYPDESMG 260
>gi|440468064|gb|ELQ37247.1| N-glycosylase/DNA lyase [Magnaporthe oryzae Y34]
gi|440489024|gb|ELQ68705.1| N-glycosylase/DNA lyase [Magnaporthe oryzae P131]
Length = 480
Score = 91.3 bits (225), Expect = 5e-16, Method: Compositional matrix adjust.
Identities = 75/260 (28%), Positives = 110/260 (42%), Gaps = 67/260 (25%)
Query: 36 SKWTPLNLTQSELSLPLTFPTGQTFRWKKTGPLQYTGPIGPHLISLKHLQNGDVCYHIHT 95
S+ P++L +EL + T GQ+FRW+K ++ + ++SL+ + H+H
Sbjct: 8 SRMLPVSL--AELCIDTTLRCGQSFRWRKVDD-EWHCSLHGRVVSLRQDET-----HLHY 59
Query: 96 SPSEPAAKSA-------------------------------------LLDFLNMGISLGE 118
+ PA + LL++ N+ SL
Sbjct: 60 RATWPAGRQVSIKSSVKVKREEEDESIKSEDAAAVKVDEEEDDTEALLLNYFNLHHSLSG 119
Query: 119 LWEGFSASDCRFAELAKYLAGARVLRQDPVECLLQFLCSSNNNIARITKMVDFLASL--G 176
++ +S D F A G R+L QD E L+ F+CSSNNNIARI++M L G
Sbjct: 120 MYTHWSDRDANFRSKAPKFTGVRILNQDAWETLVAFICSSNNNIARISQMCHKLCDHYGG 179
Query: 177 SHLGNVEGFEFHEFPSLERLSLVS-EVELRNAGFGYRSAPQSSLLFSVRRSFKQAKYITG 235
+ V G + +FPS E L+ E LR GFGYR A+YI
Sbjct: 180 EPIATVAGHVYRDFPSPEALAGDGVEAHLRELGFGYR-----------------ARYIAE 222
Query: 236 TVDVLQSKHSGGAEWLLSLR 255
T V+ + G WLLSLR
Sbjct: 223 TARVVARQRPAG--WLLSLR 240
>gi|153953098|ref|YP_001393863.1| 3-methyladenine DNA glycosylase [Clostridium kluyveri DSM 555]
gi|219853749|ref|YP_002470871.1| hypothetical protein CKR_0406 [Clostridium kluyveri NBRC 12016]
gi|146345979|gb|EDK32515.1| Predicted 3-methyladenine DNA glycosylase/8-oxoguanine DNA
glycosylase [Clostridium kluyveri DSM 555]
gi|219567473|dbj|BAH05457.1| hypothetical protein [Clostridium kluyveri NBRC 12016]
Length = 302
Score = 91.3 bits (225), Expect = 6e-16, Method: Compositional matrix adjust.
Identities = 69/270 (25%), Positives = 117/270 (43%), Gaps = 29/270 (10%)
Query: 48 LSLPLTFPTGQTFRWKKTGPLQYTGPIGPHLISLKHLQNGDVCYHIHTSPSEPAAKSALL 107
LP F GQ FRW + Y G +I ++ ++ Y + E K
Sbjct: 21 FELPHIFDCGQCFRWNREKNGNYIGTAFGKVIEVEKKEDDVFIY----NTCEKDFKEIWC 76
Query: 108 DFLNMGISLGELWEGFSASDCRFAELAKYLAGARVLRQDPVECLLQFLCSSNNNIARITK 167
++ ++ GE+ + D + + G R+L+Q+P E ++ F+ S+NN I I K
Sbjct: 77 EYFDLYRDYGEI-KHILGKDPILKKAVGFGGGIRLLKQEPFELIVSFIISANNRIPMIKK 135
Query: 168 MVDFLASLGSHLGNVEGFEFHEFPSLERLSLVSEVELRNAGFGYRSAPQSSLLFSVRRSF 227
++ ++ + ++ FP L+RL + E+ G G+R
Sbjct: 136 AIENISKRWGETLEFKDRVYYAFPKLDRLKEATLEEMEACGTGFR--------------- 180
Query: 228 KQAKYITGTVDVL-----QSKHSGGAEWLLSLRKLDLQEAI-DALCTLPGVGPKVAACIA 281
AKYI T+ + +S S E+ + ++ EA L G+GPKVA CI
Sbjct: 181 --AKYIVDTISKIYNNGTKSGESYHEEYDIDWIRMQEDEACHKELQKFMGIGPKVADCIM 238
Query: 282 LFSLDQHHAIPVDTHVWKIATR-YLLPELA 310
LFS+ ++ A PVD V + YL P+++
Sbjct: 239 LFSMQKYSAFPVDVWVKRAMNHFYLAPDVS 268
>gi|118444968|ref|YP_878936.1| 8-oxoguanine DNA glycosylase [Clostridium novyi NT]
gi|118135424|gb|ABK62468.1| 8-oxoguanine-DNA-glycosylase, putative [Clostridium novyi NT]
Length = 292
Score = 91.3 bits (225), Expect = 6e-16, Method: Compositional matrix adjust.
Identities = 69/268 (25%), Positives = 122/268 (45%), Gaps = 35/268 (13%)
Query: 48 LSLPLTFPTGQTFRWKKTGPLQYTGPIGPHLISLKHLQNGDVCYHIHTSPSEPAAKSALL 107
LP F GQ FRW + Y G +I ++ +N + Y + +E K
Sbjct: 21 FELPHIFDCGQCFRWNRQENGNYIGVAFNKVIEVEKKENDVIIY----NTNEKEFKEIWC 76
Query: 108 DFLNMGISLGELWEGFSASDCRFAELAKYLAGARVLRQDPVECLLQFLCSSNNNIARITK 167
++ ++ + + D + ++ G R+L+QDP E ++ F+ S+NN I I +
Sbjct: 77 EYFDLYRDYSTI-KDILKKDPLLKKSVEFGHGIRLLKQDPFELVISFIISANNRIPMIKR 135
Query: 168 MVDFLASLGSHLGN---VEGFEFHEFPSLERLSLVSEVELRNAGFGYRSAPQSSLLFSVR 224
+ ++ GN +G ++ FP++++L + +L G G+R
Sbjct: 136 AI---LNISKKWGNELEYKGKTYYSFPNVQQLKDSTIEQLSECGVGFR------------ 180
Query: 225 RSFKQAKYITGTV-DVLQSKHSGGAEWLLSLRKLDLQEAIDALCTLPGVGPKVAACIALF 283
AKYI T+ D+++ L ++ L+ E L + GVGPKVA CI LF
Sbjct: 181 -----AKYIYKTIQDIIEETID-----LDYIKSLNDDECHKELQKISGVGPKVADCIMLF 230
Query: 284 SLDQHHAIPVDTHVWKIATR-YLLPELA 310
S++++ A PVD V + YL P+++
Sbjct: 231 SMEKYTAFPVDVWVKRAMQHFYLAPDVS 258
>gi|401428575|ref|XP_003878770.1| putative 8-oxoguanine DNA glycosylase [Leishmania mexicana
MHOM/GT/2001/U1103]
gi|322495019|emb|CBZ30322.1| putative 8-oxoguanine DNA glycosylase [Leishmania mexicana
MHOM/GT/2001/U1103]
Length = 565
Score = 90.9 bits (224), Expect = 7e-16, Method: Compositional matrix adjust.
Identities = 81/297 (27%), Positives = 120/297 (40%), Gaps = 97/297 (32%)
Query: 106 LLDFLNMGISLGELWEGFSAS-DCRFAELAKYLAGARV---------------------- 142
L +L++ + L +LW+ ++ S + R L +YL G R+
Sbjct: 157 LSRYLSLDVDLDKLWQEWTDSPETRRHPLVEYLVGNRLERQFLGSGHDQRVCEHDEAQAN 216
Query: 143 -------LRQDPVECLLQFLCSSNNNIARITKMVDFLA-SLGSHLGNV------------ 182
+RQD CL FLCS NNN+ RIT M+ L+ + G HL +V
Sbjct: 217 LYIPIRHVRQDLHSCLFSFLCSQNNNVTRITGMIYALSRAYGDHLCDVQLATGEVRAPRT 276
Query: 183 -----------------------------------------EGFEFHEFPSLERLSLVSE 201
E + FPSLE+L+ +E
Sbjct: 277 SAADWRSKATQRKFQSAAAAIRRDTIAYPSLKPHSSSTNAPEWLSVYSFPSLEQLAAATE 336
Query: 202 VELRNAGFGYRSAPQSSLLFSVRR-----SFKQAKYITGTVDV---LQSKH-----SGGA 248
LR+ GFGYRS + ++ + K + G + L +H +G
Sbjct: 337 DTLRSLGFGYRSKYMVEAVSFIQTQLPPPELQDKKAVEGKMHSPPHLIRQHGACYRNGFY 396
Query: 249 EWLLSLRKLDLQEAIDALCTLPGVGPKVAACIALFSLDQHHAIPVDTHVWKIATRYL 305
+LS Q D L LPGVG KVA C+ALF+L++ H +PVDTH+ ++A YL
Sbjct: 397 SAVLSHHSHHHQHQRDMLLLLPGVGRKVADCVALFALNRTHIVPVDTHMAQVAVEYL 453
>gi|154336427|ref|XP_001564449.1| putative 8-oxoguanine DNA glycosylase [Leishmania braziliensis
MHOM/BR/75/M2904]
gi|134061484|emb|CAM38513.1| putative 8-oxoguanine DNA glycosylase [Leishmania braziliensis
MHOM/BR/75/M2904]
Length = 569
Score = 90.9 bits (224), Expect = 8e-16, Method: Compositional matrix adjust.
Identities = 83/328 (25%), Positives = 121/328 (36%), Gaps = 131/328 (39%)
Query: 99 EPAAKSALLDFLNMGISLGELWEGFSAS-DCRFAELAKYLAGARV--------------- 142
E + + L +L++ + L +LW+ ++AS R L +YL G R+
Sbjct: 154 EESDEEILSHYLSLDVDLDQLWQDWTASPQTRKHPLVQYLVGNRLQLYSLLSDRNQRVCE 213
Query: 143 --------------LRQDPVECLLQFLCSSNNNIARITKMV------------------- 169
+RQD CL FLCS NNN+ RIT M+
Sbjct: 214 HDKAQQSLYISIRHVRQDLHSCLFSFLCSQNNNVTRITGMIYALCRAYGDQLCDVQLATA 273
Query: 170 ------------------------------DFLAS--LGSHLGNVEG---FEFHEFPSLE 194
D +AS L H +++ + FPSLE
Sbjct: 274 EVRTPCKSASDREHKLTKTELRRQTAAHKEDVIASPSLKPHRSSIDAPGWLSMYSFPSLE 333
Query: 195 RLSLVSEVELRNAGFGYRSAPQSSLLFSVRRSFKQAKYITGTVDVLQSK----------- 243
+LS + +LR GFGYR KYI T+ +Q++
Sbjct: 334 QLSAATTEKLRMLGFGYR-----------------GKYIVDTIRFIQTQSPQQELQDNVM 376
Query: 244 -------------------HSGGAEWLLSLRKLDLQEAIDALCTLPGVGPKVAACIALFS 284
+G +L Q D L LPGVG KVA C+ALF+
Sbjct: 377 LKGQMHPPPHLMRQHGACYRNGFYSTVLGHHDHHHQHQRDMLLLLPGVGRKVADCVALFA 436
Query: 285 LDQHHAIPVDTHVWKIATRYLLPELAGV 312
L++ H +PVDTH+ ++A YL+ A V
Sbjct: 437 LNRTHIVPVDTHMAQVAAEYLVSPRAAV 464
>gi|397780937|ref|YP_006545410.1| N-glycosylase/DNA lyase [Methanoculleus bourgensis MS2]
gi|396939439|emb|CCJ36694.1| N-glycosylase/DNA lyase [Methanoculleus bourgensis MS2]
Length = 288
Score = 90.9 bits (224), Expect = 8e-16, Method: Compositional matrix adjust.
Identities = 72/276 (26%), Positives = 115/276 (41%), Gaps = 33/276 (11%)
Query: 48 LSLPLTFPTGQTFRWKKTGPLQYTGPIGPHLISLKHLQNGDVCYHIHTSPSEPAAKSALL 107
L T GQ FRW+K + G + I ++ + H+ S + +
Sbjct: 12 FDLDRTLSCGQAFRWEKVEGW-WQGIVDGRAIRIRQDER-----HLTFSGVD---TGFVR 62
Query: 108 DFLNMGISLGELWEGFSASDCRFAELAKYLAGARVLRQDPVECLLQFLCSSNNNIARITK 167
D+ + L + D A + G R+ RQ P ECL+ ++C++N NI + +
Sbjct: 63 DYFRLDQDLPAILSSID-RDPVIAAAIRECRGLRLARQQPWECLVSYICATNTNIPAVKR 121
Query: 168 MVDFLAS-LGSHLGNVEGFEFHEFPSLERLSLVSEVELRNAGFGYRSAPQSSLLFSVRRS 226
V +A G + G + FP E L+ VS +LR+ GYR+
Sbjct: 122 RVALMAERFGRPVDGPFGTAY-TFPEPEVLASVSRADLRDCRLGYRT------------- 167
Query: 227 FKQAKYITGTVDVLQSKHSGGAEWLLSLRKLDLQEAIDALCTLPGVGPKVAACIALFSLD 286
Y++ T ++H +W + L ++A AL GVGPK A C+ LF+
Sbjct: 168 ----DYVSSTA-AFATEHP---DWAERIAALTFEDARQALMEFRGVGPKAADCVLLFAFG 219
Query: 287 QHHAIPVDTHVWKIATRYLLPELAGVRLTPKLCSRV 322
A PVD + +I LP+L G TP+ R+
Sbjct: 220 FFEAFPVDVWIHRIMAETYLPDLTGKSCTPRDYERI 255
>gi|154482964|ref|ZP_02025412.1| hypothetical protein EUBVEN_00662 [Eubacterium ventriosum ATCC
27560]
gi|149736248|gb|EDM52134.1| 8-oxoguanine DNA-glycosylase (ogg) [Eubacterium ventriosum ATCC
27560]
Length = 306
Score = 90.5 bits (223), Expect = 1e-15, Method: Compositional matrix adjust.
Identities = 69/256 (26%), Positives = 113/256 (44%), Gaps = 26/256 (10%)
Query: 46 SELSLPLTFPTGQTFRWKKTGPLQYTGPIGPHLISLKHLQNGDVCYHIHTSPSEPAAKSA 105
++ L T GQ FR+ K Y L+ +K Q+G+ +T P ++
Sbjct: 28 TDFDLEQTLECGQCFRFYKQDEEDYVVVAFNKLLHIK--QSGNQLIFFNTEK--PDVENI 83
Query: 106 LLDFLNMGISLGELWEGFSASDCRFAELAKYLAGARVLRQDPVECLLQFLCSSNNNIARI 165
+ + ++ E+ D + E K G R+L Q+ E L+ F+ S N I I
Sbjct: 84 WIPYFDLERDYNEIKSFLLKKDSKLEEAIKEKWGIRILNQEFYETLISFIISQNKQIPHI 143
Query: 166 TKMVDFLAS-LGSHLGNVEGFEFHEFPSLERLSLVSEVELRNAGFGYRSAPQSSLLFSVR 224
++V ++ G +LG V G ++ FP ++ L ++E R G+R AP L+
Sbjct: 144 KQLVRRISEEYGEYLGEVNGERYYSFPDVKTLGTITEEAFREMKTGFR-AP---YLYDAA 199
Query: 225 RSFKQAKYITGTVDVLQSKHSGGAEWLLSLRKLDLQEAIDALCTLPGVGPKVAACIALFS 284
+ K TG + + +L+ L+ E D L ++ GVG KVA C+ LFS
Sbjct: 200 Q-----KLATGVIS------------METLKGLNENETRDKLISIKGVGEKVANCVMLFS 242
Query: 285 LDQHHAIPVDTHVWKI 300
L A PVD + +I
Sbjct: 243 LGFREAFPVDVWIKRI 258
>gi|291541735|emb|CBL14845.1| 3-methyladenine DNA glycosylase/8-oxoguanine DNA glycosylase
[Ruminococcus bromii L2-63]
Length = 277
Score = 90.1 bits (222), Expect = 1e-15, Method: Compositional matrix adjust.
Identities = 81/267 (30%), Positives = 122/267 (45%), Gaps = 33/267 (12%)
Query: 47 ELSLPLTFPTGQTFRWKKTGPLQYTGPIGPHLISLKHLQNGDVCYHIHTSPSEPAAKSAL 106
+L L T GQ+FRWK + G +++ L D+ +I + ++ K+
Sbjct: 17 DLDLAQTLDCGQSFRWKPQDDGSFHGVAYGKSVTVS-LDKTDM--YIENATADDF-KNIW 72
Query: 107 LDFLNMGISLGELWEGFSASDCRFAELAKYLAGARVLRQDPVECLLQFLCSSNNNIARIT 166
+ + + G++ E S E AKY G R+LRQ+P E L F+ S NNNI RI
Sbjct: 73 YSYFDFSLDYGKIREEISTIHPVLNEAAKYAPGIRILRQEPFEALCTFIISQNNNIKRIK 132
Query: 167 KMVDFLA-SLGSHLGNVEGFEFHEFPSLERLSLVSEVELRNAGFGYRSAPQSSLLFSVRR 225
+V+ L + G+ L + E FP+ E L+ +S +L G+R+
Sbjct: 133 GIVERLCENFGTPLDDGE----FAFPTAETLAKLSPDDLAPLRAGFRN------------ 176
Query: 226 SFKQAKYITGTVDVLQSKHSGGAEWLLSLRKLDLQEAIDALCTLPGVGPKVAACIALFSL 285
+YI +D Q K + G L S LD ++A L + GVG KVA C LF +
Sbjct: 177 -----RYI---IDAAQ-KVANGEVNLDSCFTLDYEDARAELMKITGVGKKVADCTLLFGM 227
Query: 286 DQHHAIPVDTHVW-KIATRYLLPELAG 311
+ A P+D VW K A L P + G
Sbjct: 228 HRIEAFPID--VWMKRAMEKLFPNMNG 252
>gi|310657597|ref|YP_003935318.1| putative DNA glycosylase (DNA repair protein) [[Clostridium]
sticklandii]
gi|308824375|emb|CBH20413.1| putative DNA glycosylase (DNA repair protein) [[Clostridium]
sticklandii]
Length = 293
Score = 90.1 bits (222), Expect = 1e-15, Method: Compositional matrix adjust.
Identities = 72/263 (27%), Positives = 118/263 (44%), Gaps = 40/263 (15%)
Query: 54 FPTGQTFRWKKTGPLQYTGPIGPHLISLKHLQNGDVCYHIHTSPSEPAAKSALLDFLNMG 113
F GQ FRW K YTG ++++K GD +T+ L D+ +
Sbjct: 24 FECGQCFRWDKEEDGSYTGVANGKVLNVK--LEGDTLILDNTN---------LDDYHRIW 72
Query: 114 ISLGELWEGFSASDCRFAELAKYL-------AGARVLRQDPVECLLQFLCSSNNNIARIT 166
+ ++ +SA A++ ++L G R+L QD E ++ F+ S+ N I I
Sbjct: 73 HNYFDMGRDYSAIKAELAQMDEHLYNATIFGQGMRILNQDTFEIVISFIISARNAIPMIK 132
Query: 167 KMVDFLA-SLGSHLGNVEGFEFHEFPSLERLSLVSEVELRNAGFGYRSAPQSSLLFSVRR 225
+ V FL+ +LG +G G +++ FP+ + LS E L ++ +R
Sbjct: 133 RSVAFLSKALGEEIGEYRGKKYYAFPTPKALSSCDEQVLIDSKVAFRKG----------- 181
Query: 226 SFKQAKYITGTVDVLQSKHSGGAEWLLSLRKLDLQEAIDALCTLPGVGPKVAACIALFSL 285
+ A YI +HS + + LR L A L T+PGVGPKV+ CI LF L
Sbjct: 182 YIRDASYI---------QHSLQMD-MYKLRNLPTDMARKELMTIPGVGPKVSDCILLFGL 231
Query: 286 DQHHAIPVDTHVWKIATRYLLPE 308
++ PVD V ++ + + +
Sbjct: 232 SKYDVFPVDVWVQRVMHEFYIEK 254
>gi|325679631|ref|ZP_08159206.1| 8-oxoguanine DNA-glycosylase (ogg) [Ruminococcus albus 8]
gi|324108661|gb|EGC02902.1| 8-oxoguanine DNA-glycosylase (ogg) [Ruminococcus albus 8]
Length = 271
Score = 90.1 bits (222), Expect = 1e-15, Method: Compositional matrix adjust.
Identities = 77/270 (28%), Positives = 117/270 (43%), Gaps = 40/270 (14%)
Query: 43 LTQSELSLPLTFPTGQTFRWKKTGPLQYTGP-IGPHLISLKHLQNGDVCYHIHTSPSEPA 101
L Q++ L T GQ FRW+K Y G + L+ + G + +H + +E
Sbjct: 12 LKQADFDLDETLDCGQAFRWEKISDNTYKGAFLNKGLVISSEEEKG--IFRLHDT-NESD 68
Query: 102 AKSALLDFLNMGISLGELWEGFSASDCRFAELAKYLAGARVLRQDPVECLLQFLCSSNNN 161
++ ++ GEL FS D A+ Y G R+L+QD E L F+ S NNN
Sbjct: 69 LLEIWAEYFDLYTDYGELKRRFS-EDSTLAKACGYAGGIRILKQDRWEALSSFIISQNNN 127
Query: 162 IARITKMVDFLASLGSHLGNVEGFEFHEFPSLERLSLVSEVELRNAGFGYRSAPQSSLLF 221
I RI ++ L H + +PS + + + SL +
Sbjct: 128 IPRIKGII---GRLCEH--------YDGYPSWQEM---------------KDETAESLAY 161
Query: 222 SVRRSFKQAKYITGTVDVLQSKHSGGAEWLLSLRKLDLQEAIDALCTLPGVGPKVAACIA 281
+R F+ AKY+ V +K S G L + K+D+ +A L T+ GVGPKVA C
Sbjct: 162 -LRSGFR-AKYLVDAV----AKISSGEIDLDEVAKMDIADARKTLQTIKGVGPKVAECTL 215
Query: 282 LFSLDQHHAIPVDTHVWKIATRYL---LPE 308
LF + A P+D V ++ + LPE
Sbjct: 216 LFGFYRTEAFPMDVWVKRVMANWYPNGLPE 245
>gi|256004083|ref|ZP_05429068.1| DNA-(apurinic or apyrimidinic site) lyase [Clostridium thermocellum
DSM 2360]
gi|385779996|ref|YP_005689161.1| DNA-(apurinic or apyrimidinic site) lyase [Clostridium thermocellum
DSM 1313]
gi|419721341|ref|ZP_14248505.1| 8-oxoguanine DNA glycosylase domain-containing protein [Clostridium
thermocellum AD2]
gi|419726881|ref|ZP_14253901.1| 8-oxoguanine DNA glycosylase domain-containing protein [Clostridium
thermocellum YS]
gi|255992006|gb|EEU02103.1| DNA-(apurinic or apyrimidinic site) lyase [Clostridium thermocellum
DSM 2360]
gi|316941676|gb|ADU75710.1| DNA-(apurinic or apyrimidinic site) lyase [Clostridium thermocellum
DSM 1313]
gi|380769846|gb|EIC03746.1| 8-oxoguanine DNA glycosylase domain-containing protein [Clostridium
thermocellum YS]
gi|380782511|gb|EIC12145.1| 8-oxoguanine DNA glycosylase domain-containing protein [Clostridium
thermocellum AD2]
Length = 295
Score = 90.1 bits (222), Expect = 1e-15, Method: Compositional matrix adjust.
Identities = 70/248 (28%), Positives = 110/248 (44%), Gaps = 26/248 (10%)
Query: 53 TFPTGQTFRWKKTGPLQYTGPIGPHLISLKHLQNGDVCYHIHTSPSEPAAKSALLDFLNM 112
F GQ FRW + YTG +++K+ D + + E KS D+ ++
Sbjct: 28 VFDCGQCFRWIRQPDGSYTGVACGRALNVKYR---DGVLELSNTGIEDF-KSIWFDYFDL 83
Query: 113 GISLGELWEGFSASDCRFAELAKYLAGARVLRQDPVECLLQFLCSSNNNIARITKMVDFL 172
G + E + E K+ +G R+L+Q+ E L+ F+ S+NN I RI K VD +
Sbjct: 84 GRDYSHIKEKVMKDEI-MREAVKFGSGIRLLKQNIWETLISFIISANNRIPRIMKTVDEI 142
Query: 173 ASLGSHLGNVEGFEFHEFPSLERLSLVSEVELRNAGFGYRSAPQSSLLFSVRRSFKQAKY 232
+ L ++G +++ FPS ++LS + EL G G+R KY
Sbjct: 143 SRLYGCEIEMDGEKYYAFPSAKQLSHATLEELEQTGAGFR-----------------CKY 185
Query: 233 ITGTVDVLQSKHSGGAEWLLSLRKLDLQEAIDALCTLPGVGPKVAACIALFSLDQHHAIP 292
I ++ G L + +D EA D L GVGPKVA C L+S ++ P
Sbjct: 186 IMNAAKMVNE----GKINLEDVCSMDTVEARDFLMRFEGVGPKVADCTLLYSGTKYDVFP 241
Query: 293 VDTHVWKI 300
D V ++
Sbjct: 242 TDVWVKRV 249
>gi|429190408|ref|YP_007176086.1| 3-methyladenine DNA glycosylase [Natronobacterium gregoryi SP2]
gi|448326318|ref|ZP_21515685.1| 8-oxoguanine DNA glycosylase domain-containing protein
[Natronobacterium gregoryi SP2]
gi|429134626|gb|AFZ71637.1| 3-methyladenine DNA glycosylase/8-oxoguanine DNA glycosylase
[Natronobacterium gregoryi SP2]
gi|445612975|gb|ELY66692.1| 8-oxoguanine DNA glycosylase domain-containing protein
[Natronobacterium gregoryi SP2]
Length = 299
Score = 90.1 bits (222), Expect = 1e-15, Method: Compositional matrix adjust.
Identities = 75/269 (27%), Positives = 115/269 (42%), Gaps = 34/269 (12%)
Query: 40 PLNLTQSELSLPLTFPTGQTFRWKKTGPLQYTG--PIGPHLISLKHLQNG---DVCY--- 91
P+ L L T +GQT+ W++ Y G P ++ +G DV
Sbjct: 7 PVAALSGGLDLYRTLESGQTYLWRREDGEMYAGDPPSDTWYYTVVDASSGPKPDVIRVRT 66
Query: 92 ---HIHTSPSEPAAKSALLDFLNMGISLGELWEGFSASDCRFAELAKYLAGARVLRQDPV 148
H+ + AA A+ L + L + + D A+ + G R++ P
Sbjct: 67 RDGHLEWESTTDAAP-AVCRLLRLDDDLASIVAA-APDDPLLAQAYEAHCGMRLVGDPPF 124
Query: 149 ECLLQFLCSSNNNIARITKMVDFLASLGSHLGNVEGFEFHEFPSLERLSLVSEVELRNAG 208
CL+ F+CS+ + RI MV LA +G +H FP+ +L+ +E ELR G
Sbjct: 125 GCLISFICSAQMRVQRIHSMVSALAREYGDSLEFDGRTYHAFPTPTQLATATEDELRELG 184
Query: 209 FGYRSAPQSSLLFSVRRSFKQAKYITGTVDVLQSKHSGGAEWLLSLRKLDLQEAIDALCT 268
GYR A Y+ T +++ + AE R L+ ++A D LC
Sbjct: 185 LGYR-----------------APYVVRTAEMVADGEAYPAE----ARDLEYEQARDYLCQ 223
Query: 269 LPGVGPKVAACIALFSLDQHHAIPVDTHV 297
GVG KVA C+ LFSL A+P+DT +
Sbjct: 224 FVGVGDKVADCVLLFSLGFDEAVPLDTWI 252
>gi|125974558|ref|YP_001038468.1| 8-oxoguanine DNA glycosylase-like protein [Clostridium thermocellum
ATCC 27405]
gi|281419082|ref|ZP_06250099.1| DNA-(apurinic or apyrimidinic site) lyase [Clostridium thermocellum
JW20]
gi|125714783|gb|ABN53275.1| DNA-(apurinic or apyrimidinic site) lyase [Clostridium thermocellum
ATCC 27405]
gi|281407231|gb|EFB37492.1| DNA-(apurinic or apyrimidinic site) lyase [Clostridium thermocellum
JW20]
Length = 295
Score = 90.1 bits (222), Expect = 1e-15, Method: Compositional matrix adjust.
Identities = 70/248 (28%), Positives = 110/248 (44%), Gaps = 26/248 (10%)
Query: 53 TFPTGQTFRWKKTGPLQYTGPIGPHLISLKHLQNGDVCYHIHTSPSEPAAKSALLDFLNM 112
F GQ FRW + YTG +++K+ D + + E KS D+ ++
Sbjct: 28 VFDCGQCFRWIRQPDGSYTGVACGRALNVKYR---DGVLELSNTGIEDF-KSIWFDYFDL 83
Query: 113 GISLGELWEGFSASDCRFAELAKYLAGARVLRQDPVECLLQFLCSSNNNIARITKMVDFL 172
G + E + E K+ +G R+L+Q+ E L+ F+ S+NN I RI K VD +
Sbjct: 84 GRDYSHIKEKVMKDEI-MREAVKFGSGIRLLKQNIWETLISFIISANNRIPRIMKTVDEI 142
Query: 173 ASLGSHLGNVEGFEFHEFPSLERLSLVSEVELRNAGFGYRSAPQSSLLFSVRRSFKQAKY 232
+ L ++G +++ FPS ++LS + EL G G+R KY
Sbjct: 143 SRLYGCEIEMDGEKYYAFPSAKQLSHATLEELEQTGAGFR-----------------CKY 185
Query: 233 ITGTVDVLQSKHSGGAEWLLSLRKLDLQEAIDALCTLPGVGPKVAACIALFSLDQHHAIP 292
I ++ G L + +D EA D L GVGPKVA C L+S ++ P
Sbjct: 186 IMNAAKMVNE----GKINLEDVCSMDTVEARDFLMRFQGVGPKVADCTLLYSGTKYDVFP 241
Query: 293 VDTHVWKI 300
D V ++
Sbjct: 242 TDVWVKRV 249
>gi|291556559|emb|CBL33676.1| 3-methyladenine DNA glycosylase/8-oxoguanine DNA glycosylase
[Eubacterium siraeum V10Sc8a]
Length = 264
Score = 90.1 bits (222), Expect = 1e-15, Method: Compositional matrix adjust.
Identities = 81/268 (30%), Positives = 119/268 (44%), Gaps = 36/268 (13%)
Query: 43 LTQSELSLPLTFPTGQTFRWKKTGPLQYTGPIGPHLISLKHLQNGDVCYHIHTSPSEPAA 102
T + T GQ FRWK+ ++ G G + NGD Y +H E +
Sbjct: 5 FTCERFDIRRTLFCGQAFRWKELDG-RFCGIAGGRYAEIS--DNGDSTYTVH--DIEKSD 59
Query: 103 KSALLDFLNMGISLGELWEGFSASD-CRFAELAKYLAGARVLRQDPVECLLQFLCSSNNN 161
S + ++ L + FS + R A K G RVLRQ+P E L+ F+ S NNN
Sbjct: 60 ISYWRSYFDLDTDYDALIKQFSEDEHMRLA--CKENPGIRVLRQEPFETLISFIISQNNN 117
Query: 162 IARITKMVDFLA-SLGSHLGNVEGFEFHEFPSLERLSLVSEVELRNAGFGYRSAPQSSLL 220
I RIT ++D L S G + FP+LE+L V+ +L AP
Sbjct: 118 IKRITGIIDRLCESFGEKTDRG-----YMFPTLEKLVGVTAEDL---------AP----- 158
Query: 221 FSVRRSFKQAKYITGTVDVLQSKHSGGAEWLLSLRKLDLQEAIDALCTLPGVGPKVAACI 280
+R F+ A+YI V+ L S +L ++ +D A + L + GVG KVA C+
Sbjct: 159 --LRAGFR-ARYIVDAVEKLHSTEV----FLDGIKAMDTTAAREELKKIKGVGDKVADCV 211
Query: 281 ALFSLDQHHAIPVDTHVWKIATRYLLPE 308
LF + A P D + +I + L P+
Sbjct: 212 LLFGYHKTDAFPRDVWIKRIEQK-LYPD 238
>gi|254479426|ref|ZP_05092756.1| 8-oxoguanine DNA glycosylase, N-terminal domain family
[Carboxydibrachium pacificum DSM 12653]
gi|214034631|gb|EEB75375.1| 8-oxoguanine DNA glycosylase, N-terminal domain family
[Carboxydibrachium pacificum DSM 12653]
Length = 266
Score = 89.7 bits (221), Expect = 2e-15, Method: Compositional matrix adjust.
Identities = 69/242 (28%), Positives = 108/242 (44%), Gaps = 26/242 (10%)
Query: 60 FRWKKTGPLQYTGPIGPHLISLKHLQNGDVCYHIHTSPSEPAAKSALLDFLNMGISLGEL 119
FRW + YTG ++++K GD +T+ ++ D+ ++G G++
Sbjct: 1 FRWNEEDDGSYTGVAYDRVVNVK--LEGDTLIIDNTNLTD--FYDIWFDYFDLGRDYGQI 56
Query: 120 WEGFSASDCRFAELAKYLAGARVLRQDPVECLLQFLCSSNNNIARITKMVDFLASLGSHL 179
E S D E K+ G R+LRQD E L+ F+ S NN I +I K+++ LA+ +
Sbjct: 57 KESLS-KDPVLKEAIKFGQGIRILRQDTWETLVSFIVSQNNRIPQIKKVIENLAASFGNP 115
Query: 180 GNVEGFEFHEFPSLERLSLVSEVELRNAGFGYRSAPQSSLLFSVRRSFKQAKYITGTVDV 239
+G ++ FP E L + + G+R AKYI
Sbjct: 116 IEYKGKIYYTFPKPEELVMYDVETIAKTRCGFR-----------------AKYIFDAA-- 156
Query: 240 LQSKHSGGAEWLLSLRKLDLQEAIDALCTLPGVGPKVAACIALFSLDQHHAIPVDTHVWK 299
SK G LL L + E D L T+ GVGPKVA C+ L+S+ ++ P D + +
Sbjct: 157 --SKVFSGEINLLKLHEYSTSEIRDILMTINGVGPKVADCVILYSIGRYDTFPTDVWIKR 214
Query: 300 IA 301
I
Sbjct: 215 IV 216
>gi|448336786|ref|ZP_21525877.1| 8-oxoguanine DNA glycosylase domain-containing protein [Natrinema
pallidum DSM 3751]
gi|445627877|gb|ELY81192.1| 8-oxoguanine DNA glycosylase domain-containing protein [Natrinema
pallidum DSM 3751]
Length = 293
Score = 89.4 bits (220), Expect = 2e-15, Method: Compositional matrix adjust.
Identities = 77/282 (27%), Positives = 124/282 (43%), Gaps = 45/282 (15%)
Query: 40 PLNLTQSELSLPLTFPTGQTFRWKKTGPLQYTGPIGP----------HLISLKHLQNGDV 89
P++ L L T +GQ++ W++ YT P +I ++ ++G +
Sbjct: 7 PIDDLGGGLDLYRTLESGQSYLWRRGDGEMYTDTPAPGAWYATVVDGEVIRVR-TRDGRL 65
Query: 90 CYHIHTSPSEPAAKSALLDFLNMGISLGELWEGFSAS---DCRFAELAKYLAGARVLRQD 146
+ T+ +EP + L L + E +A+ D E + G R++R
Sbjct: 66 EWE-STADAEPTVRRLL--------RLEDDLEAIAAAGPDDPLLREAYEAHPGMRLVRDP 116
Query: 147 PVECLLQFLCSSNNNIARITKMVDFLASLGSHLGNVEGFEFHEFPSLERLSLVSEVELRN 206
P CL+ F+CS+ + RI MV LA +G +H FP+ ++L+ +E ELR+
Sbjct: 117 PFGCLISFICSAQMRVGRIHTMVTALAREYGSPITFDGATYHAFPTPDQLATATEAELRD 176
Query: 207 AGFGYRSAPQSSLLFSVRRSFKQAKYITGTVDVLQSKHSGGAEWLLSLRKLDLQEAIDAL 266
G GYR A Y+ T +++ +GG R L+ + A + L
Sbjct: 177 LGLGYR-----------------APYVVRTAEMV----AGGKAHPADARDLEYEAAREFL 215
Query: 267 CTLPGVGPKVAACIALFSLDQHHAIPVDTHVWKIATRYLLPE 308
GVG KVA C+ LFSL A+P+DT + K A PE
Sbjct: 216 TQFVGVGDKVADCVLLFSLGFDEAVPLDTWI-KSAVEEYYPE 256
>gi|399888785|ref|ZP_10774662.1| 8-oxoguanine DNA glycosylase [Clostridium arbusti SL206]
Length = 292
Score = 89.4 bits (220), Expect = 2e-15, Method: Compositional matrix adjust.
Identities = 70/264 (26%), Positives = 115/264 (43%), Gaps = 27/264 (10%)
Query: 48 LSLPLTFPTGQTFRWKKTGPLQYTGPIGPHLISLKHLQNGDVCYHIHTSPSEPAAKSALL 107
L F GQ FRW KT + G +I ++ N V Y+ +E + L
Sbjct: 21 FELEHVFECGQCFRWNKTEQGTFIGVAFGKVIEVEKKDNNLVIYN----TTEEEFNNIWL 76
Query: 108 DFLNMGISLGELWEGFSASDCRFAELAKYLAGARVLRQDPVECLLQFLCSSNNNIARITK 167
D+ N+ E+ E S D + + G R+L+Q+P E L+ F+ S+NN I I +
Sbjct: 77 DYFNLRRDYKEIKEELS-EDPLLKKSVDFGNGIRILKQEPFEILISFIISANNRIPMIKR 135
Query: 168 MVDFLASLGSHLGNVEGFEFHEFPSLERLSLVSEVELRNAGFGYRSAPQSSLLFSVRRSF 227
++ ++ + ++ FP +E L + + G G+R
Sbjct: 136 EINTISKKWGKEIKYKDDIYYTFPKVEDLQDKTIEDFEGCGVGFR--------------- 180
Query: 228 KQAKYITGTVDVLQSKHSGGAEWLLSLRKLDLQEAIDALCTLPGVGPKVAACIALFSLDQ 287
AKYI T + + + E++ S D + L GVGPKVA CI LFS+++
Sbjct: 181 --AKYIKDTCEKVYA-SDNILEYIYSC---DDDKCHIELQKFMGVGPKVADCIMLFSMEK 234
Query: 288 HHAIPVDTHVWKIATR-YLLPELA 310
+ A PVD V + Y+ P+++
Sbjct: 235 YSAFPVDVWVKRAMMHFYVAPDVS 258
>gi|291531898|emb|CBK97483.1| 3-methyladenine DNA glycosylase/8-oxoguanine DNA glycosylase
[Eubacterium siraeum 70/3]
Length = 264
Score = 89.4 bits (220), Expect = 2e-15, Method: Compositional matrix adjust.
Identities = 81/267 (30%), Positives = 117/267 (43%), Gaps = 34/267 (12%)
Query: 43 LTQSELSLPLTFPTGQTFRWKKTGPLQYTGPIGPHLISLKHLQNGDVCYHIHTSPSEPAA 102
T + T GQ FRWK+ ++ G G + NGD Y +H E +
Sbjct: 5 FTCERFDIRRTLFCGQAFRWKELDG-RFCGIAGGRYAEIS--DNGDSTYTVH--DIEKSD 59
Query: 103 KSALLDFLNMGISLGELWEGFSASDCRFAELAKYLAGARVLRQDPVECLLQFLCSSNNNI 162
S + ++ L + FS D K G RVLRQ+P E L+ F+ S NNNI
Sbjct: 60 ISYWQSYFDLDTDYDALIKQFS-EDEHMKLACKENPGIRVLRQEPFETLISFIISQNNNI 118
Query: 163 ARITKMVDFLA-SLGSHLGNVEGFEFHEFPSLERLSLVSEVELRNAGFGYRSAPQSSLLF 221
RIT ++D L S G + FP+LE+L V+ +L AP
Sbjct: 119 KRITGIIDRLCESFGEKTDRG-----YMFPTLEKLVGVTAEDL---------AP------ 158
Query: 222 SVRRSFKQAKYITGTVDVLQSKHSGGAEWLLSLRKLDLQEAIDALCTLPGVGPKVAACIA 281
+R F+ A+YI V+ L S G L ++ +D A + L + GVG KVA C+
Sbjct: 159 -LRAGFR-ARYIVDAVEKLSS----GEVSLDGIKAMDTAAAREELKRIKGVGDKVADCVL 212
Query: 282 LFSLDQHHAIPVDTHVWKIATRYLLPE 308
LF + A P D + +I + L P+
Sbjct: 213 LFGYHKTDAFPRDVWIKRIEQK-LYPD 238
>gi|325663087|ref|ZP_08151537.1| hypothetical protein HMPREF0490_02277 [Lachnospiraceae bacterium
4_1_37FAA]
gi|325470541|gb|EGC73771.1| hypothetical protein HMPREF0490_02277 [Lachnospiraceae bacterium
4_1_37FAA]
Length = 275
Score = 89.4 bits (220), Expect = 2e-15, Method: Compositional matrix adjust.
Identities = 75/251 (29%), Positives = 114/251 (45%), Gaps = 27/251 (10%)
Query: 48 LSLPLTFPTGQTFRWKKTGPLQYTGPIGPHLISLKHLQNGDVCYHIHTSPSEPAAKSALL 107
+L +GQ FR ++ Y + SL+ LQ G+ C T P E +
Sbjct: 9 FNLQQICDSGQCFRMERVSENCYR--VIAFGRSLEILQEGEQCTFFCT-PHE--FEEIWN 63
Query: 108 DFLNMGISLGELWEGFSASDCRFAELAKYLAGARVLRQDPVECLLQFLCSSNNNIARITK 167
D+ ++ EG + +D A++ +G R+LRQD E + FL S N+I RI K
Sbjct: 64 DYFDLETDYQSYIEGINPNDSYLLAAAEWGSGIRILRQDLWEMIASFLISQQNHITRIRK 123
Query: 168 MVDFLA-SLGSHLGNVEGFEFHEFPSLERLSLVSEVELRNAGFGYRSAPQSSLLFSVRRS 226
+ L + G + G F+ FP E+L+ + + +L+ GYRS + VR +
Sbjct: 124 CIQNLCETYGEERTSDSGNTFYTFPEPEKLAELGDDDLKACNLGYRSK------YVVRTA 177
Query: 227 FKQAKYITGTVDVLQSKHSGGAEWLLSLRKLDLQEAIDALCTLPGVGPKVAACIALFSLD 286
++G VD L ++R L +A L L GVG KVA CI LF+L
Sbjct: 178 ---KSIVSGEVD------------LDAIRHLPYAKAKKELLKLFGVGEKVADCICLFALH 222
Query: 287 QHHAIPVDTHV 297
A P+DTH+
Sbjct: 223 HLQAFPIDTHI 233
>gi|154319440|ref|XP_001559037.1| hypothetical protein BC1G_02201 [Botryotinia fuckeliana B05.10]
Length = 221
Score = 89.4 bits (220), Expect = 2e-15, Method: Compositional matrix adjust.
Identities = 58/155 (37%), Positives = 82/155 (52%), Gaps = 22/155 (14%)
Query: 104 SALLDFLNMGISLGELWEGFSASDCRFAELAKYLAGARVLRQDPVECLLQFLCSSNNNIA 163
S L +LN+ +L E++E +S +D F + A G R+L+QD E L+ F+CSSNNNI
Sbjct: 79 SLLRHYLNLSPNLTEMYEQWSLADPNFKKRAPKFTGVRILKQDAWEALVGFICSSNNNII 138
Query: 164 RITKMVDFLA-SLGSHLGNVEGFEFHEFPSLERLSLVS-EVELRNAGFGYRSAPQSSLLF 221
RI++MV+ L G +G+++ FH+FP E L+ E LR GFGYR
Sbjct: 139 RISQMVNNLCLHYGPLIGHIDDQPFHDFPQPEALTGSGVESHLRALGFGYR--------- 189
Query: 222 SVRRSFKQAKYITGTVDVLQSKHSGGAEWLLSLRK 256
AKYI T + SK +WL +LR
Sbjct: 190 --------AKYIAQTASIAASK---PKDWLENLRN 213
>gi|297171515|gb|ADI22514.1| 3-methyladenine DNA glycosylase/8-oxoguanine DNA glycosylase
[uncultured verrucomicrobium HF0500_08N17]
Length = 244
Score = 89.4 bits (220), Expect = 2e-15, Method: Compositional matrix adjust.
Identities = 75/253 (29%), Positives = 115/253 (45%), Gaps = 30/253 (11%)
Query: 47 ELSLPLTFPTGQTFRWKKTGPLQYTGPIGPHLISLKHLQNGDVCYHIHTSPSEPAAKSAL 106
+ L T +GQ FRW++ G + G IG + L+ + G I ++P A
Sbjct: 12 DYDLDATLSSGQAFRWQQLGQ-AWEGVIGGRWVRLRLTKGG-----IAAEAAKPVRDWAW 65
Query: 107 LD-FLNMGISLGELWEGFSASDCRFAELAKYLAGARVLRQDPVECLLQFLCSSNNNIARI 165
L+ +L + LG+ F + +A + G R+LRQD ECL F+ S+ I +I
Sbjct: 66 LEHYLQLHFDLGQALAMFPDDEPMQNAVAAF-PGLRLLRQDYWECLASFILSATKQIVQI 124
Query: 166 TKMVDFLAS-LGSHLGNVEGFEFHEFPSLERLSLVSEVELRNAGFGYRSAPQSSLLFSVR 224
+MV L+ G + +V FP++E ++ E ELR G+R AP +LL + R
Sbjct: 125 RQMVTLLSKRYGEPIASVSDDPAFAFPTIEIIAACGEAELRECKLGFR-AP--NLLGAAR 181
Query: 225 RSFKQAKYITGTVDVLQSKHSGGAEWLLSLRKLDLQEAIDALCTLPGVGPKVAACIALFS 284
D+L K +W L + EA + L L GVGPK+A C+ LF+
Sbjct: 182 -------------DILDEK----IDWQ-RLPTMTSSEAREELMKLRGVGPKIADCVLLFA 223
Query: 285 LDQHHAIPVDTHV 297
PVD +
Sbjct: 224 GGHQEVFPVDVWI 236
>gi|91773114|ref|YP_565806.1| 8-oxoguanine DNA glycosylase-like protein [Methanococcoides
burtonii DSM 6242]
gi|91712129|gb|ABE52056.1| 8-oxoguanine DNA glycosylase [Methanococcoides burtonii DSM 6242]
Length = 279
Score = 89.0 bits (219), Expect = 2e-15, Method: Compositional matrix adjust.
Identities = 76/264 (28%), Positives = 118/264 (44%), Gaps = 35/264 (13%)
Query: 43 LTQSELSLPLTFPTGQTFRWKKTGPLQYTGPIGPHLISLKHL-QNGDVCYHIHTSPSEPA 101
+ +L+L T GQ FRW + G +TG I + + + GD+ + +S E
Sbjct: 4 IKTEDLNLEYTLDCGQVFRWDRDGDW-WTGVINGVVARISQSPETGDLL--VDSSLDEDF 60
Query: 102 AKSALLDFLNMGISLGELWEGFSASDCRFAELAKYLAGARVLRQDPVECLLQFLCSSNNN 161
+ + L +++ + + ++KY G R++RQDP ECL+ ++ ++ +N
Sbjct: 61 FHR----YFRLDDDLPAIFKQINKDEHMDVAISKY-RGLRLIRQDPWECLISYMLATASN 115
Query: 162 IARITKMVDFLASL-GSHLGNVEGFEFHEFPSLERLSLVSEVELRNAGFGYRSAPQSSLL 220
I RI K + L++L G L + FP +E L+ S +L G+R+A
Sbjct: 116 IPRIKKNIYMLSALFGEELEKGH----YSFPKVEALAASSCDDLCECKMGFRTA------ 165
Query: 221 FSVRRSFKQAKYITGTVDVLQSKHSGGAEWLLSLRKLDLQEAIDALCTLPGVGPKVAACI 280
R K A + VL L LD EA + L L G+G KVA CI
Sbjct: 166 ----RIIKAANAVINGDIVLDE-----------LFSLDYGEAKNELMKLEGIGEKVADCI 210
Query: 281 ALFSLDQHHAIPVDTHVWKIATRY 304
LF+ + PVDTHV KI Y
Sbjct: 211 LLFAFAKMEGFPVDTHVEKIVKAY 234
>gi|294495228|ref|YP_003541721.1| DNA-(apurinic or apyrimidinic site) lyase [Methanohalophilus mahii
DSM 5219]
gi|292666227|gb|ADE36076.1| DNA-(apurinic or apyrimidinic site) lyase [Methanohalophilus mahii
DSM 5219]
Length = 282
Score = 89.0 bits (219), Expect = 3e-15, Method: Compositional matrix adjust.
Identities = 77/274 (28%), Positives = 116/274 (42%), Gaps = 37/274 (13%)
Query: 48 LSLPLTFPTGQTFRWKKTGPLQYTGPIGPHLISLKHLQNGDVCYHIHTSPSEPAAKSALL 107
L T GQ FRW K + G + + + Q + Y P E K
Sbjct: 9 FDLDHTLDCGQVFRWSKEDDW-WHGVVQGDYVHAFYEQEAETLYIDSRLPVEFFVK---- 63
Query: 108 DFLNMGISLGELWEGFSASDCRFAELAKYLAGARVLRQDPVECLLQFLCSSNNNIARITK 167
+ + L ++ +AKY G R++ QDP ECL+ ++ ++ +NI RI K
Sbjct: 64 -YFRLDDDLPYIFSSIDHDSYVHEAIAKY-KGLRLVMQDPWECLVSYMIATASNIPRIMK 121
Query: 168 MVDFLASLGSHLGN--VEGFEFHEFPSLERLSLVSEVELRNAGFGYRSAPQSSLLFSVRR 225
++ L+ L LG V+G + FP + L+ +L + G+R+ RR
Sbjct: 122 SIEKLSRL---LGEEIVDGI--YAFPEISTLAACCGEDLCDCSLGFRA----------RR 166
Query: 226 SFKQAKYI-TGTVDVLQSKHSGGAEWLLSLRKLDLQEAIDALCTLPGVGPKVAACIALFS 284
K A+ I +G +D L L +D +A L + G+G KVA CI LFS
Sbjct: 167 LVKAARMIESGELD------------LWGLYDMDYFQAKKQLMNIEGIGDKVADCILLFS 214
Query: 285 LDQHHAIPVDTHVWKIATRYLLPELAGVRLTPKL 318
+ A PVDTHV K+ Y G PK+
Sbjct: 215 YGKMEAFPVDTHVDKVIRNYYSDSFEGPYTKPKM 248
>gi|238916461|ref|YP_002929978.1| N-glycosylase/DNA lyase [Eubacterium eligens ATCC 27750]
gi|238871821|gb|ACR71531.1| N-glycosylase/DNA lyase [Eubacterium eligens ATCC 27750]
Length = 287
Score = 89.0 bits (219), Expect = 3e-15, Method: Compositional matrix adjust.
Identities = 68/255 (26%), Positives = 113/255 (44%), Gaps = 26/255 (10%)
Query: 47 ELSLPLTFPTGQTFRWKKTGPLQYTGPIGPHLISLKHLQNGDVCYHIHTSPSEPAAKSAL 106
+ L T GQ F + K Y +++ + Q D T E +
Sbjct: 11 DFDLAQTLECGQCFHFVKLDEEDYVLTAKGYVLHVS--QEADTVTFYDTDKDEYV--NVW 66
Query: 107 LDFLNMGISLGELWEGFSASDCRFAELAKYLAGARVLRQDPVECLLQFLCSSNNNIARIT 166
D+ +M + + D + + + + G R+L QD E L+ F+ S N I I
Sbjct: 67 KDYFDMDRDYSAIKKKLLEKDDKLKDAIESMWGVRILNQDFFETLISFIISQNKQIPHIK 126
Query: 167 KMV-DFLASLGSHLGNVEGFEFHEFPSLERLSLVSEVELRNAGFGYRSAPQSSLLFSVRR 225
K+V D A G++ G G + + FP+LE+L+ SE + + G+R AP ++ ++RR
Sbjct: 127 KIVADISAKFGTYKGTYGGADMYTFPTLEQLANASEEDFKELKTGFR-AP--YIMDAIRR 183
Query: 226 SFKQAKYITGTVDVLQSKHSGGAEWLLSLRKLDLQEAIDALCTLPGVGPKVAACIALFSL 285
+ + G D+ + L+ +D I L T+ GVG KVA C++LF L
Sbjct: 184 N------MAGQFDINE------------LKSMDYDSCIKELMTIKGVGEKVANCVSLFGL 225
Query: 286 DQHHAIPVDTHVWKI 300
+ A PVD + +I
Sbjct: 226 GKKEAFPVDVWIKRI 240
>gi|374317597|ref|YP_005064025.1| 3-methyladenine DNA glycosylase [Sphaerochaeta pleomorpha str.
Grapes]
gi|359353241|gb|AEV31015.1| 3-methyladenine DNA glycosylase/8-oxoguanine DNA glycosylase
[Sphaerochaeta pleomorpha str. Grapes]
Length = 253
Score = 89.0 bits (219), Expect = 3e-15, Method: Compositional matrix adjust.
Identities = 71/263 (26%), Positives = 115/263 (43%), Gaps = 41/263 (15%)
Query: 43 LTQSELSLPLTFPTGQTFRWKKTGPLQYTGPIGPHLISLKHLQNGDVCYHIHTSPSEPAA 102
+ S ++L T GQ+F W + G ++ + L+ L+ D C
Sbjct: 1 MENSVINLQTTLFCGQSFAWLQHGE-TFSAVLKGRLVQLRQDTCIDQC----------KE 49
Query: 103 KSALLDFLNMGISLGELWEGFSASDCRFAELAKYLAGARVLRQDPVECLLQFLCSSNNNI 162
+ L + +M D +E Y G +L QDP E L+ F+ S NN+I
Sbjct: 50 DAFLYHYFDMDFDYASANRHLITLDHPMSEAIAYAKGLHILNQDPWEVLIGFILSQNNSI 109
Query: 163 ARITKMVDFLA-SLGSHLGNVEGFEFHEFPSLERLSLVSEVELRNAGFGYRSAPQSSLLF 221
RIT + + L+ + G+ +G FP+ ++L+ V E ELR+ G G+RS
Sbjct: 110 KRITMLYEKLSINFGTEVGKGR----FSFPTPDQLAGVGESELRSLGVGFRSP------- 158
Query: 222 SVRRSFKQAKYITGTVDVLQSKHSGGAEWLLSLRKLDLQEAIDALCTLPGVGPKVAACIA 281
YI VD ++ + L ++ L +A+ L T+ GVGPKV +CI
Sbjct: 159 ----------YI---VDAIEKSY-----LLDDIKSLPFDDALSVLMTIKGVGPKVGSCIL 200
Query: 282 LFSLDQHHAIPVDTHVWKIATRY 304
L+ + A P+DT + K R+
Sbjct: 201 LYGFHRMEAFPMDTWMKKAMARW 223
>gi|440781406|ref|ZP_20959748.1| 8-oxoguanine DNA glycosylase [Clostridium pasteurianum DSM 525]
gi|440221011|gb|ELP60217.1| 8-oxoguanine DNA glycosylase [Clostridium pasteurianum DSM 525]
Length = 292
Score = 88.6 bits (218), Expect = 3e-15, Method: Compositional matrix adjust.
Identities = 68/264 (25%), Positives = 114/264 (43%), Gaps = 27/264 (10%)
Query: 48 LSLPLTFPTGQTFRWKKTGPLQYTGPIGPHLISLKHLQNGDVCYHIHTSPSEPAAKSALL 107
L F GQ FRW KT +TG +I ++ N D+ H + +E +
Sbjct: 21 FELDHVFECGQCFRWNKTKEGTFTGVAFDRVIEVERKGN-DLIIH---NTTEKEFNDIWI 76
Query: 108 DFLNMGISLGELWEGFSASDCRFAELAKYLAGARVLRQDPVECLLQFLCSSNNNIARITK 167
+ ++ E+ E S D + + +G R+L+Q+P E L+ F+ S+NN I I +
Sbjct: 77 KYFDLERDYNEIKEELS-KDPLLKKSVNFGSGIRILKQEPFEILISFIISANNRIPMIKR 135
Query: 168 MVDFLASLGSHLGNVEGFEFHEFPSLERLSLVSEVELRNAGFGYRSAPQSSLLFSVRRSF 227
+ ++ +G ++ FP +E L + + G G+R
Sbjct: 136 AIKAISKKWGKKIQYKGETYYTFPKVEDLQDKTIEDFEECGTGFR--------------- 180
Query: 228 KQAKYITGTVDVLQSKHSGGAEWLLSLRKLDLQEAIDALCTLPGVGPKVAACIALFSLDQ 287
AKYI T + + ++ L + D + L GVGPKVA CI LFS+ +
Sbjct: 181 --AKYIKDTCERVYESNN----ILEKIYSCDDNQCHIELQKFMGVGPKVADCIMLFSMQK 234
Query: 288 HHAIPVDTHVWKIATR-YLLPELA 310
+ A PVD V + Y+ P+++
Sbjct: 235 YSAFPVDVWVKRAMMHFYVAPDVS 258
>gi|167750146|ref|ZP_02422273.1| hypothetical protein EUBSIR_01115 [Eubacterium siraeum DSM 15702]
gi|167656889|gb|EDS01019.1| 8-oxoguanine DNA-glycosylase (ogg) [Eubacterium siraeum DSM 15702]
Length = 264
Score = 88.6 bits (218), Expect = 3e-15, Method: Compositional matrix adjust.
Identities = 81/268 (30%), Positives = 118/268 (44%), Gaps = 36/268 (13%)
Query: 43 LTQSELSLPLTFPTGQTFRWKKTGPLQYTGPIGPHLISLKHLQNGDVCYHIHTSPSEPAA 102
T + T GQ FRWK+ ++ G G + NGD Y +H E +
Sbjct: 5 FTCERFDIRRTLFCGQAFRWKELDG-RFCGIAGGRYAEIS--DNGDSTYTVH--GIEKSD 59
Query: 103 KSALLDFLNMGISLGELWEGFSASD-CRFAELAKYLAGARVLRQDPVECLLQFLCSSNNN 161
S + ++ L + FS + R A K G RVLRQ+P E L+ F+ S NNN
Sbjct: 60 ISYWQSYFDLDTDYDALIKQFSEDEHMRLA--CKENPGIRVLRQEPFETLISFIISQNNN 117
Query: 162 IARITKMVDFLA-SLGSHLGNVEGFEFHEFPSLERLSLVSEVELRNAGFGYRSAPQSSLL 220
I RIT ++D L S G + FP+LE+L V+ +L AP
Sbjct: 118 IKRITGIIDRLCESFGEKTDRG-----YMFPTLEKLVGVTAEDL---------AP----- 158
Query: 221 FSVRRSFKQAKYITGTVDVLQSKHSGGAEWLLSLRKLDLQEAIDALCTLPGVGPKVAACI 280
+R F+ A+YI V+ L S L ++ +D A + L + GVG KVA C+
Sbjct: 159 --LRAGFR-ARYIVDAVEKLHSTEVS----LDGIKAMDTTAAREELKKIKGVGDKVADCV 211
Query: 281 ALFSLDQHHAIPVDTHVWKIATRYLLPE 308
LF + A P D + +I + L P+
Sbjct: 212 LLFGYHKTDAFPRDVWIKRIEQK-LYPD 238
>gi|359410287|ref|ZP_09202752.1| 8-oxoguanine DNA glycosylase domain-containing protein [Clostridium
sp. DL-VIII]
gi|357169171|gb|EHI97345.1| 8-oxoguanine DNA glycosylase domain-containing protein [Clostridium
sp. DL-VIII]
Length = 309
Score = 88.6 bits (218), Expect = 4e-15, Method: Compositional matrix adjust.
Identities = 70/251 (27%), Positives = 111/251 (44%), Gaps = 13/251 (5%)
Query: 48 LSLPLTFPTGQTFRWKKTGPLQYTGPIGPHLISLKHLQNGDVCYHIHTSPSEPAAKSALL 107
L F GQ FR+++ + LI +K N + Y+ +E L
Sbjct: 21 FKLKHIFECGQIFRFEEIEENDFIVIAFGKLIEVKEEGNNVIIYN----TNEEEFNRIWL 76
Query: 108 DFLNMGISLGELWEGFSASDCRFAELAKYLAGARVLRQDPVECLLQFLCSSNNNIARITK 167
+ ++ ++ E S D + ++ G RVL QDP E LL F+ S+ NNI I K
Sbjct: 77 SYFDLNRDYSDIKEELS-KDVLLKQSIEFGYGVRVLNQDPFEMLLSFIISARNNIPSIKK 135
Query: 168 MVDFLASLGSHLGNVEGFEFHEFPSLERLSLVSEVELRNAGFGYRSAPQSSLLFSVRRSF 227
V+ +++ N + ++ FPS+ + + E++ G +RS + +V S
Sbjct: 136 TVNKISAKWGKEINYKNKVYYAFPSIYEVKDATLEEIQETGASFRSKYLIDTIKNVYNSV 195
Query: 228 KQAKYITGTVDVLQSKHSGGAEWLLSLRKLDLQEAIDALCTLPGVGPKVAACIALFSLDQ 287
K L ++S L +++L E DAL GVG KVA CI LFS+++
Sbjct: 196 KDENL------KLNEENSYIKYNLDYIKRLSDDECHDALQEFKGVGSKVADCIMLFSMEK 249
Query: 288 HHAIPVDTHVW 298
A PVD VW
Sbjct: 250 TSAFPVD--VW 258
>gi|448300618|ref|ZP_21490617.1| 8-oxoguanine DNA glycosylase domain-containing protein
[Natronorubrum tibetense GA33]
gi|445585437|gb|ELY39732.1| 8-oxoguanine DNA glycosylase domain-containing protein
[Natronorubrum tibetense GA33]
Length = 293
Score = 88.6 bits (218), Expect = 4e-15, Method: Compositional matrix adjust.
Identities = 69/263 (26%), Positives = 115/263 (43%), Gaps = 44/263 (16%)
Query: 48 LSLPLTFPTGQTFRWKKTGPLQYTGP----------IGPHLISLKHLQNGDVCYHIHTSP 97
L L T +GQ++ W+++ Y G +G ++ ++ + DV T+
Sbjct: 15 LDLYRTLESGQSYCWRRSDGEMYGGASPDGAWYHTVVGGDVVRIRQVD--DVLEWESTTD 72
Query: 98 SEPAAKSALLDFLNMGISLGELWEGFSAS---DCRFAELAKYLAGARVLRQDPVECLLQF 154
+E + + L + E A+ D E G R++ P CL+ F
Sbjct: 73 AEATVRRL--------VRLDDDLEAIVAAAPDDPLLREAYDAHRGMRLVEDPPFGCLISF 124
Query: 155 LCSSNNNIARITKMVDFLASLGSHLGNVEGFEFHEFPSLERLSLVSEVELRNAGFGYRSA 214
+CS+ ++RI MV LA +G +H FP+ ++L+ +E ELR G GYR
Sbjct: 125 ICSAQMRVSRIHTMVSTLAREYGDAIEFDGETYHAFPTPDQLATATEAELRELGLGYR-- 182
Query: 215 PQSSLLFSVRRSFKQAKYITGTVDVLQSKHSGGAEWLLSLRKLDLQEAIDALCTLPGVGP 274
A Y+ T +++ + + A + R L+ + A D L GVG
Sbjct: 183 ---------------APYVVRTAEMVANGEADPA----AARGLEYEAARDFLTQFVGVGD 223
Query: 275 KVAACIALFSLDQHHAIPVDTHV 297
KVA C+ LFSL+ A+P+DT +
Sbjct: 224 KVADCVLLFSLEFDEAVPLDTWI 246
>gi|269867341|ref|XP_002652562.1| 8-oxoguanine DNA glycosylase / DNA-lyase [Enterocytozoon bieneusi
H348]
gi|220062219|gb|EED41494.1| 8-oxoguanine DNA glycosylase / DNA-lyase [Enterocytozoon bieneusi
H348]
Length = 213
Score = 88.6 bits (218), Expect = 4e-15, Method: Compositional matrix adjust.
Identities = 54/167 (32%), Positives = 79/167 (47%), Gaps = 22/167 (13%)
Query: 137 LAGARVLRQDPVECLLQFLCSSNNNIARITKMVDFLASLGSHLGNVEGFEFHEFPSLERL 196
+ G R L D + F+CS NNN+ RI MV L G ++ N + H+FP L+
Sbjct: 25 VDGLRFLTNDLYSTIFSFICSQNNNVKRIEGMVHCLYEKGPYI-NYKNISVHKFPQLD-- 81
Query: 197 SLVSEVELRNAGFGYRSAPQSSLLFSVRRSFKQAKYITGTVDVLQSKHSGGAEWLLSLRK 256
LV ++ R GFGYR+ S + AKYI +H L+K
Sbjct: 82 ELVDDMHWRAHGFGYRADYIS----------QAAKYIREHQSSFTKEH---------LQK 122
Query: 257 LDLQEAIDALCTLPGVGPKVAACIALFSLDQHHAIPVDTHVWKIATR 303
+ A + LC + G+G KVA CI L SL+ +P+D H++ + R
Sbjct: 123 IGYGNAKNKLCEIKGIGRKVADCICLISLEYFDVVPIDRHIFNYSQR 169
>gi|448345941|ref|ZP_21534830.1| 8-oxoguanine DNA glycosylase domain-containing protein [Natrinema
altunense JCM 12890]
gi|445633874|gb|ELY87061.1| 8-oxoguanine DNA glycosylase domain-containing protein [Natrinema
altunense JCM 12890]
Length = 293
Score = 88.2 bits (217), Expect = 5e-15, Method: Compositional matrix adjust.
Identities = 79/283 (27%), Positives = 127/283 (44%), Gaps = 47/283 (16%)
Query: 40 PLNLTQSELSLPLTFPTGQTFRWKKTGPLQYTGPIGPHL----------ISLKHLQNGDV 89
P++ L L T +GQ++ W++ YT P + I ++ ++G +
Sbjct: 7 PIDDLGGGLDLYRTLESGQSYLWRREDGEMYTDTPAPGVWYATVVDGEAIRVR-TRDGRL 65
Query: 90 CYHIHTSPSEPAAKSALLDFLNMGISLGELWEGFSAS---DCRFAELAKYLAGARVLRQD 146
+ T+ +EP + L L + E +A+ D E + G R++R
Sbjct: 66 EWE-STADAEPTVRRLL--------RLEDDLEAIAAAGPDDPLLREAYEAHPGMRLVRDP 116
Query: 147 PVECLLQFLCSSNNNIARITKMVDFLA-SLGSHLGNVEGFEFHEFPSLERLSLVSEVELR 205
P CL+ F+CS+ + RI MV LA GS + +G +H FP+ ++L+ +E ELR
Sbjct: 117 PFGCLISFICSAQMRVGRIHTMVTALAREYGSPIA-FDGATYHAFPTPDQLATATEAELR 175
Query: 206 NAGFGYRSAPQSSLLFSVRRSFKQAKYITGTVDVLQSKHSGGAEWLLSLRKLDLQEAIDA 265
+ G GYR A Y+ T +++ +GG R L+ + A +
Sbjct: 176 DLGLGYR-----------------APYVVRTAEMV----AGGNAHPADARDLEYEAAREF 214
Query: 266 LCTLPGVGPKVAACIALFSLDQHHAIPVDTHVWKIATRYLLPE 308
L GVG KVA C+ LFSL A+P+DT + K A PE
Sbjct: 215 LTQFVGVGDKVADCVLLFSLGFDEAVPLDTWI-KSAIEEYYPE 256
>gi|401888039|gb|EJT52007.1| hypothetical protein A1Q1_06720 [Trichosporon asahii var. asahii
CBS 2479]
Length = 671
Score = 87.8 bits (216), Expect = 6e-15, Method: Compositional matrix adjust.
Identities = 85/268 (31%), Positives = 120/268 (44%), Gaps = 43/268 (16%)
Query: 82 KHLQNGDVCYH-IHTSPS--EPAAKSALL---DFLNMG-----ISLGELWEGFSASDCRF 130
+ L+ GD+ + +PS PAA L D+ + L + WE +
Sbjct: 352 QFLKQGDLLLSAVSATPSTASPAADDTRLWLEDYFQLSQYPDLTKLYQAWEERDPALFGK 411
Query: 131 AELAKYLAGARVLRQDPVECLLQFLCSSNNNIARITKMVDFLASLGSH----LGNVEG-- 184
EL G RVLRQDP + F+ S+NN+I RI+ ++ L S L N +G
Sbjct: 412 TELNAKAVGVRVLRQDPWD----FITSTNNHITRISSLLHKLCIAFSSPVLTLDNPDGQG 467
Query: 185 -FEFHEFPSLERLSLVS-EVELRNAGFGYRSAPQSSLLFSVRRSFKQAKYITGTVDVLQS 242
+H FP E + L+ E LR+ GFGYR+A +S L S+R + VD
Sbjct: 468 QTTYHLFPRAEHIPLIGLEQVLRDLGFGYRAAFIASTLNSLR--------VFDNVD---- 515
Query: 243 KHSGGAEWLLSLRKLDLQEAIDALCTLPGVGPKVAACIALFSLDQHHAIPVDTHVWKIAT 302
E L R L+E L L GVG KVA C+ L +DQ +P+DTH+ KIA
Sbjct: 516 ------EELARWRAAPLEETRGRLLELKGVGRKVADCVQLMCMDQLSLVPIDTHLHKIAM 569
Query: 303 RY--LLPELAGVRLTPKLCSRVAEAFCE 328
R+ L R++P + V E
Sbjct: 570 RHPSFPSSLKAKRMSPAIYGDVQSFLAE 597
>gi|91202726|emb|CAJ72365.1| similar to 8-oxoguanine glycosylase (Hogg1) [Candidatus Kuenenia
stuttgartiensis]
Length = 269
Score = 87.8 bits (216), Expect = 6e-15, Method: Compositional matrix adjust.
Identities = 71/261 (27%), Positives = 109/261 (41%), Gaps = 35/261 (13%)
Query: 43 LTQSELSLPLTFPTGQTFRWKKTGPLQYTGPIGPHLISLKHLQNGDVCYHIHTSPSEPAA 102
L + +L T GQ FR KK Y + ++ L Y +
Sbjct: 4 LAVKDFNLSHTLLCGQLFRVKKIDDWFYVAA-KNRIFKIRQLSEHVEYYGV--------G 54
Query: 103 KSALLDFLNMGISLGELWEGFSASDCRFAELAKYLAGARVLRQDPVECLLQFLCSSNNNI 162
K L +F + G++ + + + K+ G R++RQDP ECL+ F+CSS NI
Sbjct: 55 KKFLTNFFALDEPYGDILSQINKDSHMNSAIGKFY-GLRIVRQDPWECLISFMCSSAANI 113
Query: 163 ARITKMVDFLASLGSHLGNVEGFEFHEFPSLERLSLVSEVELRNAGFGYRSAPQSSLLFS 222
+I ++ L+ + FE+H FP +L ++ + GF
Sbjct: 114 PKIQLNLENLSEYFGEKVRLHDFEWHTFPRPGKLDDYQQILMAKTGF------------- 160
Query: 223 VRRSFKQAKYITGTVDVLQSKHSGGAEWLLSLRKLDLQEAIDALCTLPGVGPKVAACIAL 282
+A +I D + +L +L KL EA AL +PGVG K+A C+ L
Sbjct: 161 ------RAVHIKKANDTIND------AFLTALNKLPYPEAKKALLQIPGVGNKIADCVLL 208
Query: 283 FSLDQHHAIPVDTHVWKIATR 303
FSL A P+DT + KI R
Sbjct: 209 FSLGFSEAFPIDTWIKKILQR 229
>gi|71005570|ref|XP_757451.1| hypothetical protein UM01304.1 [Ustilago maydis 521]
gi|46096934|gb|EAK82167.1| hypothetical protein UM01304.1 [Ustilago maydis 521]
Length = 569
Score = 87.8 bits (216), Expect = 7e-15, Method: Compositional matrix adjust.
Identities = 81/290 (27%), Positives = 115/290 (39%), Gaps = 85/290 (29%)
Query: 24 PPNPQTLTTNKPSKWTPLNLTQSELSLPLTFPT--GQTFRWKKTGPLQ-YTGPIGPH--- 77
P N P ++ L S++ LPLT GQ FRW+ + +G IGP
Sbjct: 18 PQNEDFFGPRPPPGYSALRAATSQILLPLTVSNKCGQAFRWRCNKVWEPRSGSIGPEYDE 77
Query: 78 -----------LISLKHLQNGDVCYHIHTSPSEPAAKSA--------------LLDFLNM 112
++ L+ ++ YH P+ + + L D+L++
Sbjct: 78 QIEWSLCLADRVVLLRQDEHRGFLYHKTLLPAASSCSTNIERDLEISRETTRWLTDYLSL 137
Query: 113 GISLGELWEGFSASDCRFAELAKYLAGARVLRQDPVECLLQFLCSSNNNIARITKMVDFL 172
+ L L+ ++ D FA A +G R+LRQDP ECL F+CSSNNNIARI +MV L
Sbjct: 138 DVPLESLYAEWAEKDSVFARFATRFSGLRMLRQDPWECLCAFVCSSNNNIARIGQMVQNL 197
Query: 173 ---------------------------------ASLGSHL--GNVEGFEFHEFPSLERLS 197
AS +H+ G V+ FH FP E L+
Sbjct: 198 CTHFSPVLLEYSYPPPPASVRALESIKTEADGDASKQNHVEQGQVK-IAFHPFPPAEALA 256
Query: 198 LVS-EVELRNAGFGYRSAPQSSLLFSVRRSFKQAKYITGTVDVLQSKHSG 246
E +LR GFGYR AKY+ T +L + H G
Sbjct: 257 KPGVEEKLRELGFGYR-----------------AKYLARTAQMLYATHGG 289
Score = 65.1 bits (157), Expect = 4e-08, Method: Compositional matrix adjust.
Identities = 37/91 (40%), Positives = 48/91 (52%), Gaps = 7/91 (7%)
Query: 236 TVDVLQSKHSGGAEWLLSLRKLDLQEAIDALCTLPGVGPKVAACIALFSLDQHHAIPVDT 295
T V Q + +L LR L +EA D L PG+GPKVA CI L SLDQ +IPVD
Sbjct: 369 TEKVEQHQFDSVRSYLQHLRTLSYREARDQLIQFPGIGPKVADCILLMSLDQASSIPVDR 428
Query: 296 HVWKIATRYLLPELAGVRLTPKLCSRVAEAF 326
HV++ A ++ RL K +A+ F
Sbjct: 429 HVFQFAEKWY-------RLRTKKYEEIADYF 452
>gi|397773290|ref|YP_006540836.1| 8-oxoguanine DNA glycosylase domain-containing protein [Natrinema
sp. J7-2]
gi|397682383|gb|AFO56760.1| 8-oxoguanine DNA glycosylase domain-containing protein [Natrinema
sp. J7-2]
Length = 293
Score = 87.8 bits (216), Expect = 7e-15, Method: Compositional matrix adjust.
Identities = 78/276 (28%), Positives = 122/276 (44%), Gaps = 33/276 (11%)
Query: 40 PLNLTQSELSLPLTFPTGQTFRWKKTGPLQYTGPIGPHLISLKHLQNGDVCYHIHTSP-- 97
P++ L L T +GQ++ W++ YT P + +G+V + T
Sbjct: 7 PIDDLGGGLDLYRTLESGQSYLWRREDGEMYTDTPAPGAW-YATVVDGEVI-RVRTRDGR 64
Query: 98 ----SEPAAKSALLDFLNMGISLGELWEGFSASDCRFAELAKYLAGARVLRQDPVECLLQ 153
S A+S + L + L E D E + G R++R P CL+
Sbjct: 65 LEWESTADAESTVRRLLRLEDDL-EAIAAAGPDDPLLREAYEAHPGMRLVRDPPFGCLIS 123
Query: 154 FLCSSNNNIARITKMVDFLA-SLGSHLGNVEGFEFHEFPSLERLSLVSEVELRNAGFGYR 212
F+CS+ + RI MV LA GS + +G +H FP+ ++L+ +E ELR+ G GYR
Sbjct: 124 FICSAQMRVGRIHTMVTALAREYGSPIA-FDGATYHAFPTPDQLATATEAELRDLGLGYR 182
Query: 213 SAPQSSLLFSVRRSFKQAKYITGTVDVLQSKHSGGAEWLLSLRKLDLQEAIDALCTLPGV 272
A Y+ T +++ ++ A+ R L+ + A + L GV
Sbjct: 183 -----------------APYVVRTAEMVADGNAHPAD----ARDLEYEAAREFLTQFVGV 221
Query: 273 GPKVAACIALFSLDQHHAIPVDTHVWKIATRYLLPE 308
G KVA C+ LFSL A+P+DT + K A PE
Sbjct: 222 GDKVADCVLLFSLGFDEAVPLDTWI-KSAIEEYYPE 256
>gi|389583009|dbj|GAB65745.1| hypothetical protein PCYB_072470, partial [Plasmodium cynomolgi
strain B]
Length = 739
Score = 87.8 bits (216), Expect = 7e-15, Method: Compositional matrix adjust.
Identities = 47/121 (38%), Positives = 66/121 (54%), Gaps = 20/121 (16%)
Query: 185 FEFHEFPSLERLSLVSEVELRNAGFGYRSAPQSSLLFSVRRSFKQAKYITGTVDVLQSKH 244
F F+EFPS+E LS + E +LRN GFGYRS+ Y+ + +L +
Sbjct: 518 FPFYEFPSVEILSKLKEQDLRNLGFGYRSS-----------------YVIESAKMLVKR- 559
Query: 245 SGGAEWLLSLRK-LDLQEAIDALCTLPGVGPKVAACIALFSLDQHHAIPVDTHVWKIATR 303
G EW+ L+K + ID L PG+G KVA CI LF L++ IP+DTH++ I +
Sbjct: 560 -GSEEWIQDLKKEKKTKNCIDQLILFPGIGLKVANCICLFGLNKFDCIPIDTHIYDIIYK 618
Query: 304 Y 304
Y
Sbjct: 619 Y 619
Score = 68.2 bits (165), Expect = 6e-09, Method: Compositional matrix adjust.
Identities = 48/184 (26%), Positives = 71/184 (38%), Gaps = 41/184 (22%)
Query: 30 LTTNKPSKWTPLNLTQSELSLPLTFPTGQTFRWKKTGPLQYTGPIGPHLISLKHLQNG-- 87
+ N +W +N+T++EL L F GQ F + + Y G + + K + G
Sbjct: 154 VIKNFQRRWKRINVTKNELQLKYCFLIGQEFCFSEVSRDTYIGLVNKKIYLFKESEEGVF 213
Query: 88 ----------------------------------DVCYHIHTSPSEPAAKSA-----LLD 108
D H+S E A + + D
Sbjct: 214 YQCVYDRSGGSSGEWSGGRSDGRSGGRSDGAEGEDEPQEAHSSNPEKANTNDHYDQDVND 273
Query: 109 FLNMGISLGELWEGFSASDCRFAELAKYLAGARVLRQDPVECLLQFLCSSNNNIARITKM 168
F N+ L + E + D R E+ + G R+L+ D VE FLCS+NNNI RIT M
Sbjct: 274 FFNLQFPLSKNIEMWKKKDKRMKEITDKIRGLRILKADSVESFFSFLCSTNNNIPRITLM 333
Query: 169 VDFL 172
+D L
Sbjct: 334 IDCL 337
>gi|386811416|ref|ZP_10098642.1| 8-oxoguanine glycosylase [planctomycete KSU-1]
gi|386406140|dbj|GAB61523.1| 8-oxoguanine glycosylase [planctomycete KSU-1]
Length = 249
Score = 87.0 bits (214), Expect = 1e-14, Method: Compositional matrix adjust.
Identities = 57/165 (34%), Positives = 81/165 (49%), Gaps = 25/165 (15%)
Query: 139 GARVLRQDPVECLLQFLCSSNNNIARITKMVDFLASLGSHLGNVEGFEFHEFPSLERLSL 198
G R++RQDP ECL+ FLCSS NI +IT ++ L+ + FE + FP+ L+
Sbjct: 68 GLRIIRQDPWECLISFLCSSAANIPKITLNLEMLSRAFGKKIELNEFEQYSFPNPGELNN 127
Query: 199 VSEVELRNAGFGYRSAPQSSLLFSVRRSFKQAKYITGTVDVLQSKHSGGAEWLLSLRKLD 258
++ L GF +AKYI ++ ++L SLRKL
Sbjct: 128 YEQILLAKTGF-------------------RAKYIKIANSLVSE------DFLKSLRKLS 162
Query: 259 LQEAIDALCTLPGVGPKVAACIALFSLDQHHAIPVDTHVWKIATR 303
EA L +PG+G K+A CI LFSL A P+DT + KI +
Sbjct: 163 YGEAKQTLRNIPGIGDKIADCILLFSLGFTEAFPIDTWMKKILQK 207
>gi|388851915|emb|CCF54509.1| related to 8-oxoguanine DNA-glycosylase [Ustilago hordei]
Length = 583
Score = 87.0 bits (214), Expect = 1e-14, Method: Compositional matrix adjust.
Identities = 83/302 (27%), Positives = 127/302 (42%), Gaps = 69/302 (22%)
Query: 35 PSKWTPLNLTQSELSLPLTFPT--GQTFRWKKT----------------GPLQYTGPIGP 76
P + L S++ LPLT GQ FRW+ G ++++ +
Sbjct: 27 PPGYATLRAAPSQILLPLTVSNKCGQAFRWRGNKVWVLRNEGSGTQIYEGQIEWSLCLSD 86
Query: 77 HLISLKHLQNGDVCYHIHTSPSEPAAKSA------------LLDFLNMGISLGELWEGFS 124
++ L+ ++ YH PS S L D+LN+ + L L+ +
Sbjct: 87 RVVLLQQDEHQGFLYHKTLLPSTRGRDSMEVAAINKETERWLNDYLNLDVPLEALYAEWE 146
Query: 125 ASDCRFAELAKYLAGARVLRQDPVECLLQFLCSSNNNIARITKMVD-----FLASLGSHL 179
A D FA A G R+LRQDP ECL F+CSSNNNIARI +MV F + L H+
Sbjct: 147 AKDPVFARFATRFTGLRMLRQDPWECLCAFVCSSNNNIARIGQMVQNLCTHFSSPLLEHV 206
Query: 180 -------------------GNVEGFE----FHEFPSLERLSLVS-EVELRNAGFGYRSA- 214
+VE E +H FP E L+ E +LR GFGYR+
Sbjct: 207 YLPPPESVKAEGGDADAVGKDVEQGEVRIAYHPFPPPEALAKPGVEEKLRELGFGYRAKY 266
Query: 215 -PQSSLLFSVRRSFKQAKYITGTVDVLQSKHSGGAEWLLSLRKLD------LQEAIDALC 267
+++ + + K++ G +++H + WL ++ D Q++I A
Sbjct: 267 LARTAQMLCAKHGQKESNVGFGATK--KAEHLNESHWLYGIKAEDDDTFTLAQQSITASL 324
Query: 268 TL 269
TL
Sbjct: 325 TL 326
Score = 56.2 bits (134), Expect = 2e-05, Method: Compositional matrix adjust.
Identities = 31/77 (40%), Positives = 41/77 (53%), Gaps = 7/77 (9%)
Query: 250 WLLSLRKLDLQEAIDALCTLPGVGPKVAACIALFSLDQHHAIPVDTHVWKIATRYLLPEL 309
+L LR + A L PGVGPKVA CI L SLDQ +IPVD HV++ A ++
Sbjct: 384 YLQHLRTTSYRHARSELMQFPGVGPKVADCILLMSLDQASSIPVDRHVFQFAEKWY---- 439
Query: 310 AGVRLTPKLCSRVAEAF 326
+L K +A+ F
Sbjct: 440 ---KLRTKKYEEIADHF 453
>gi|76156071|gb|AAX27308.2| SJCHGC04233 protein [Schistosoma japonicum]
Length = 213
Score = 87.0 bits (214), Expect = 1e-14, Method: Compositional matrix adjust.
Identities = 69/185 (37%), Positives = 94/185 (50%), Gaps = 28/185 (15%)
Query: 139 GARVLRQDPVECLLQFLCSSNNNIARITKMVDFLA-SLGSHL----GNVEGFEFHEFPSL 193
G R+LRQDP E L F+ S+NNNI RITK++ L + G+ L G+V + F PSL
Sbjct: 2 GLRLLRQDPEETLFAFITSANNNITRITKLLRKLCIAYGNPLYLENGDVTNWTF---PSL 58
Query: 194 ERLSLVS-EVELRNAGFGYRSAPQSSLLFSVRRSFKQAKYITGTVDVLQSKHSGGAEWLL 252
E L+ + L+ GFGYRS K+I + L +GG LL
Sbjct: 59 EILAQPGMQDNLKKIGFGYRS-----------------KFIELAANWLLK--NGGRSCLL 99
Query: 253 SLRKLDLQEAIDALCTLPGVGPKVAACIALFSLDQHHAIPVDTHVWKIATRYLLPELAGV 312
LR EA L +PG+G KVA CI L +LD+ + +PVD H+ + A + E +
Sbjct: 100 ELRSKSHCEAQAFLLQIPGIGKKVADCICLCALDKVNVVPVDVHMRRAAHERGILESSNK 159
Query: 313 RLTPK 317
LT K
Sbjct: 160 NLTSK 164
>gi|432331903|ref|YP_007250046.1| 3-methyladenine DNA glycosylase/8-oxoguanine DNA glycosylase
[Methanoregula formicicum SMSP]
gi|432138612|gb|AGB03539.1| 3-methyladenine DNA glycosylase/8-oxoguanine DNA glycosylase
[Methanoregula formicicum SMSP]
Length = 282
Score = 87.0 bits (214), Expect = 1e-14, Method: Compositional matrix adjust.
Identities = 71/262 (27%), Positives = 107/262 (40%), Gaps = 30/262 (11%)
Query: 48 LSLPLTFPTGQTFRWKKTGPLQYTGPIGPHLISLKHLQNGDVCYHIHTSPSEPAAKSALL 107
SL T GQ FRW+++ + G + H+I ++ Q G +P E +
Sbjct: 12 FSLDQTLGCGQVFRWERSPDGTWLGIVDNHVIRIR--QEGTR-LRFRGAPEEFIRR---- 64
Query: 108 DFLNMGISLGELWEGFSASDCRFAELAKYLAGARVLRQDPVECLLQFLCSSNNNIARITK 167
+ ++ I L L +D G R++RQ P EC + ++CS+N+NI I +
Sbjct: 65 -YFSLDIDLLPLLSSID-NDPVIHRAIGQCNGLRLVRQLPWECTVSYICSTNSNIPTIRR 122
Query: 168 MVDFLASLGSHLGNVEGFEFHEFPSLERLSLVSEVELRNAGFGYRSAPQSSLLFSVRRSF 227
+ LA G F FP +S L GYR
Sbjct: 123 RIGLLAERFGTRIQSGGQTFFTFPEPSCISCKGHEGLTICKLGYRQP------------- 169
Query: 228 KQAKYITGTVDVLQSKHSGGAEWLLSLRKLDLQEAIDALCTLPGVGPKVAACIALFSLDQ 287
Y+ GT + S W +R+L ++A L L GVGPK A CI LF+ +
Sbjct: 170 ----YVYGTSCTVTDVRS----WEKKIRRLPYEDARKELMKLQGVGPKAADCILLFAFQK 221
Query: 288 HHAIPVDTHVWKIATRYLLPEL 309
+ A PVD + +I LP+L
Sbjct: 222 YDAFPVDVWIRRILKEQYLPDL 243
>gi|293115699|ref|ZP_05792672.2| 8-oxoguanine DNA glycosylase [Butyrivibrio crossotus DSM 2876]
gi|292808696|gb|EFF67901.1| 8-oxoguanine DNA glycosylase [Butyrivibrio crossotus DSM 2876]
Length = 330
Score = 87.0 bits (214), Expect = 1e-14, Method: Compositional matrix adjust.
Identities = 67/257 (26%), Positives = 110/257 (42%), Gaps = 31/257 (12%)
Query: 45 QSELSLPLTFPTGQTFRWKKTGPLQYTGPIGPHLISLKHLQNGDVCYHIHTSPSEPAAKS 104
Q + L T GQ F + K Y ++ + +N + Y+ + K
Sbjct: 59 QKDFILSQTLECGQCFHFVKLDEEDYLISAKSRILHIAQKENRTIFYNTDEQTFDKVWKH 118
Query: 105 ALLDFLNMGISLGELWEGFSASDCRFAELAKYLAGARVLRQDPVECLLQFLCSSNNNIAR 164
+ ++ G++ D R A + ++G R+L Q+ E L+ F+ S N I R
Sbjct: 119 ----YFDLERDYGKIKSILIEKDDRLATAIEAMSGVRILNQEFFETLISFIISQNQQIVR 174
Query: 165 ITKMVDFLASLGSHLGNV-EGFEFHEFPSLERLSLVSEVELRNAGFGYRSAPQSSLLFSV 223
I ++V A++ S G EG ++ FP + E +++ G+R
Sbjct: 175 IKQIV---AAISSRYGRKGEGIDY--FPGATEILAAGEQGMKDCKAGFR----------- 218
Query: 224 RRSFKQAKYITGTVDVLQSKHSGGAEWLLSLRKLDLQEAIDALCTLPGVGPKVAACIALF 283
AKYI + K+ G L D EA++ L ++ G+G KVA CIALF
Sbjct: 219 ------AKYIIDACE----KYVAGELDEKKLLAADYNEAVEILKSVKGIGSKVANCIALF 268
Query: 284 SLDQHHAIPVDTHVWKI 300
SLD+ +A PVD + +I
Sbjct: 269 SLDKRNAFPVDVWIKRI 285
>gi|153814191|ref|ZP_01966859.1| hypothetical protein RUMTOR_00400 [Ruminococcus torques ATCC 27756]
gi|317499986|ref|ZP_07958222.1| 8-oxoguanine DNA glycosylase [Lachnospiraceae bacterium 8_1_57FAA]
gi|336438213|ref|ZP_08617853.1| hypothetical protein HMPREF0990_00247 [Lachnospiraceae bacterium
1_1_57FAA]
gi|145848587|gb|EDK25505.1| 8-oxoguanine DNA-glycosylase (ogg) [Ruminococcus torques ATCC
27756]
gi|316898703|gb|EFV20738.1| 8-oxoguanine DNA glycosylase [Lachnospiraceae bacterium 8_1_57FAA]
gi|336014019|gb|EGN43882.1| hypothetical protein HMPREF0990_00247 [Lachnospiraceae bacterium
1_1_57FAA]
Length = 274
Score = 87.0 bits (214), Expect = 1e-14, Method: Compositional matrix adjust.
Identities = 56/168 (33%), Positives = 84/168 (50%), Gaps = 22/168 (13%)
Query: 134 AKYLAGARVLRQDPVECLLQFLCSSNNNIARITKMVDFLAS-LGSHLGNVEGFEFHEFPS 192
A + +G R+L+QD E ++ F+ S +NI RI M+ ++ G EG ++ FP
Sbjct: 93 AGFGSGIRILQQDLWEMIVTFILSQQSNIPRIKSMIQRISERYGDKKETGEGHVYYAFPR 152
Query: 193 LERLSLVSEVELRNAGFGYRSAPQSSLLFSVRRSFKQAKYITGTVDVLQSKHSGGAEWLL 252
E+L SE ELR GYRS KY+ T +++ + G L
Sbjct: 153 AEQLVQASEEELRALKLGYRS-----------------KYLCNTAEMV----AEGTVNLE 191
Query: 253 SLRKLDLQEAIDALCTLPGVGPKVAACIALFSLDQHHAIPVDTHVWKI 300
++++ EA L L G+G KVA CI LF+L + A PVDTH+ K+
Sbjct: 192 KIKEMSYDEAKKELLKLSGIGSKVADCICLFALHKLDAFPVDTHIQKV 239
>gi|325971570|ref|YP_004247761.1| DNA-(apurinic or apyrimidinic site) lyase [Sphaerochaeta globus
str. Buddy]
gi|324026808|gb|ADY13567.1| DNA-(apurinic or apyrimidinic site) lyase [Sphaerochaeta globus
str. Buddy]
Length = 253
Score = 87.0 bits (214), Expect = 1e-14, Method: Compositional matrix adjust.
Identities = 76/260 (29%), Positives = 117/260 (45%), Gaps = 41/260 (15%)
Query: 46 SELSLPLTFPTGQTFRWKKTGPLQYTGPIGPHLISLKHLQNGDVCYHIHTSPSEPAAKSA 105
+ + L T +GQ+F W + + Y + L+S + +H S S P
Sbjct: 3 ASIHLEATLFSGQSFAWTQQDGI-YQAVLNQELVSFT-----EESFHTVVS-SNPQ---- 51
Query: 106 LLDFLNMGISLGELWEGFSASDCRFAELAKYLAGARVLRQDPVECLLQFLCSSNNNIARI 165
L + +M + S+ D +E + G +L QDP E L+ FL S NNNI RI
Sbjct: 52 LRHYFDMDWEYEKALSHLSSLDGHLSEAIQSYRGIHILNQDPWEVLMSFLLSQNNNIKRI 111
Query: 166 TKMVDFLASL-GSHLGNVEGFEFHEFPSLERLSLVSEVELRNAGFGYRSAPQSSLLFSVR 224
M L+ + G+H +EG ++ FP + L V++ ELR G G+R
Sbjct: 112 RSMYQKLSEVYGAH---IEG-PWYAFPRPQHLQGVTQRELRALGMGFR------------ 155
Query: 225 RSFKQAKYITGTVDVLQSKHSGGAEWLLSLRKLDLQEAIDALCTLPGVGPKVAACIALFS 284
A Y+ +D +Q+ L + L Q+A + L T+ GVGPKVA CI F+
Sbjct: 156 -----APYL---LDAIQNH-----AILAGIPLLSDQKASEELQTIKGVGPKVALCILAFA 202
Query: 285 LDQHHAIPVDTHVWKIATRY 304
+ A P+DT + K+ Y
Sbjct: 203 YHRQQAFPLDTWMLKVMKTY 222
>gi|331087788|ref|ZP_08336714.1| hypothetical protein HMPREF1025_00297 [Lachnospiraceae bacterium
3_1_46FAA]
gi|330409769|gb|EGG89205.1| hypothetical protein HMPREF1025_00297 [Lachnospiraceae bacterium
3_1_46FAA]
Length = 274
Score = 86.7 bits (213), Expect = 1e-14, Method: Compositional matrix adjust.
Identities = 56/168 (33%), Positives = 84/168 (50%), Gaps = 22/168 (13%)
Query: 134 AKYLAGARVLRQDPVECLLQFLCSSNNNIARITKMVDFLAS-LGSHLGNVEGFEFHEFPS 192
A + +G R+L+QD E ++ F+ S +NI RI M+ ++ G EG ++ FP
Sbjct: 93 AGFGSGIRILQQDLWEMIVTFILSQQSNIPRIKSMIQRISEWYGDKKETGEGHVYYAFPR 152
Query: 193 LERLSLVSEVELRNAGFGYRSAPQSSLLFSVRRSFKQAKYITGTVDVLQSKHSGGAEWLL 252
E+L SE ELR GYRS KY+ T +++ + G L
Sbjct: 153 AEQLVQASEEELRALKLGYRS-----------------KYLCNTAEMV----AEGTVNLE 191
Query: 253 SLRKLDLQEAIDALCTLPGVGPKVAACIALFSLDQHHAIPVDTHVWKI 300
++++ EA L L G+G KVA CI LF+L + A PVDTH+ K+
Sbjct: 192 KIKEMSYDEAKKELLKLSGIGSKVADCICLFALHKLDAFPVDTHIQKV 239
>gi|336254135|ref|YP_004597242.1| DNA-(apurinic or apyrimidinic site) lyase [Halopiger xanaduensis
SH-6]
gi|335338124|gb|AEH37363.1| DNA-(apurinic or apyrimidinic site) lyase [Halopiger xanaduensis
SH-6]
Length = 293
Score = 86.7 bits (213), Expect = 1e-14, Method: Compositional matrix adjust.
Identities = 55/160 (34%), Positives = 82/160 (51%), Gaps = 23/160 (14%)
Query: 139 GARVLRQDPVECLLQFLCSSNNNIARITKMVDFLA-SLGSHLGNVEGFEFHEFPSLERLS 197
G R++ CL+ F+CS+ ++RI MV LA G+ L +V+G ++ FP+ E+L+
Sbjct: 109 GMRLVTDPAFGCLISFICSAQMRVSRIHGMVSTLAREYGTPL-DVDGRTYYAFPTPEQLA 167
Query: 198 LVSEVELRNAGFGYRSAPQSSLLFSVRRSFKQAKYITGTVDVLQSKHSGGAEWLLSLRKL 257
+E ELR G GYR A Y+ T +++ + AE R L
Sbjct: 168 TATEAELRELGLGYR-----------------APYVVRTAEMVADGEAHPAE----ARNL 206
Query: 258 DLQEAIDALCTLPGVGPKVAACIALFSLDQHHAIPVDTHV 297
+ + A D LC GVG KVA C+ LFSL A+P+DT +
Sbjct: 207 EYEAARDYLCQFVGVGDKVADCVLLFSLGFDEAVPLDTWI 246
>gi|448377622|ref|ZP_21560318.1| 8-oxoguanine DNA glycosylase domain-containing protein [Halovivax
asiaticus JCM 14624]
gi|445655566|gb|ELZ08411.1| 8-oxoguanine DNA glycosylase domain-containing protein [Halovivax
asiaticus JCM 14624]
Length = 336
Score = 86.7 bits (213), Expect = 1e-14, Method: Compositional matrix adjust.
Identities = 71/264 (26%), Positives = 118/264 (44%), Gaps = 39/264 (14%)
Query: 52 LTFPTGQTFRWKKTGPLQYTGPIGPH----------LISLKHLQNGDVCYHIHTSPSEPA 101
LT +GQT+RW ++ Y G P ++ ++ +G T+ +EP
Sbjct: 58 LTIESGQTYRWTRSDGELYAGSRAPEAWYELAIDGEVVRVRSTDDG--LEWESTTDAEPI 115
Query: 102 AKSALLDFLNMGISLGELWEGFSASDCRFAELAKYLAGARVLRQDPVECLLQFLCSSNNN 161
+ L ++ + + + D + G R++ P + L+ F+CS+
Sbjct: 116 LRRRLRLDDDLPAIVAD-----APDDTLVEDAYATHRGLRLVTDPPFDTLIAFICSAQMR 170
Query: 162 IARITKMVDFLASLGSHLGNVEGFEFHEFPSLERLSLVSEVELRNAGFGYRSAPQSSLLF 221
+ RI +MV LA+ +G +H+FP+ ++L+ +E ELR+ G GYR
Sbjct: 171 VERINEMVTTLATEYGTQIEFDGRTYHDFPTPDQLAAATESELRDLGLGYR--------- 221
Query: 222 SVRRSFKQAKYITGTVDVLQSKHSGGAEWLLSLRKLDLQEAIDALCTLPGVGPKVAACIA 281
A Y+ T ++ A+ R L +EA + L GVG KVA CI
Sbjct: 222 --------APYVVETARMVADGEGHPAD----ARDLPYEEAREYLTRFVGVGEKVADCIC 269
Query: 282 LFSLDQHHAIPVDTHVWK-IATRY 304
LF+L+ A+P+DT + K IAT Y
Sbjct: 270 LFALEFDEAVPLDTWIRKAIATYY 293
>gi|126178603|ref|YP_001046568.1| 8-oxoguanine DNA glycosylase [Methanoculleus marisnigri JR1]
gi|125861397|gb|ABN56586.1| 8-oxoguanine DNA glycosylase domain protein [Methanoculleus
marisnigri JR1]
Length = 285
Score = 86.7 bits (213), Expect = 1e-14, Method: Compositional matrix adjust.
Identities = 71/281 (25%), Positives = 112/281 (39%), Gaps = 43/281 (15%)
Query: 48 LSLPLTFPTGQTFRWKKTGPLQYTGPIGPHLISLKH------LQNGDVCYHIHTSPSEPA 101
L LT GQ FRW+KT + G G + ++ + GD + +
Sbjct: 12 FDLDLTLACGQAFRWEKTDGW-WQGVAGGRAVRIRQDANRLTFEGGDAGF-VRDYFQLDQ 69
Query: 102 AKSALLDFLNMGISLGELWEGFSASDCRFAELAKYLAGARVLRQDPVECLLQFLCSSNNN 161
+A+L ++ ++G + +CR G R++RQ P ECL+ ++C++N N
Sbjct: 70 DLNAILSSIDRDPAIGA-----AVRECR---------GLRLVRQPPWECLISYICATNTN 115
Query: 162 IARITKMVDFLASLGSHLGNVEGFEFHEFPSLERLSLVSEVELRNAGFGYRSAPQSSLLF 221
I + + +A + + FP E L VS +L + GYR+
Sbjct: 116 IPAVKRRAALMAERYGRPVDGPFGRTYAFPEPEALGAVSRADLWDCKLGYRTD------- 168
Query: 222 SVRRSFKQAKYITGTVDVLQSKHSGGAEWLLSLRKLDLQEAIDALCTLPGVGPKVAACIA 281
+ A Y T D W + L +EA + L GVGPK A C+
Sbjct: 169 ---YVHEAAGYATEYPD-----------WAERIAALPFEEARETLMRFRGVGPKAADCVL 214
Query: 282 LFSLDQHHAIPVDTHVWKIATRYLLPELAGVRLTPKLCSRV 322
LF+ A PVD + +I L +L G TP R+
Sbjct: 215 LFAFGFFEAFPVDVWIRRIVRETYLSDLTGKNCTPPEYERI 255
>gi|410728239|ref|ZP_11366420.1| 3-methyladenine DNA glycosylase/8-oxoguanine DNA glycosylase
[Clostridium sp. Maddingley MBC34-26]
gi|410597178|gb|EKQ51811.1| 3-methyladenine DNA glycosylase/8-oxoguanine DNA glycosylase
[Clostridium sp. Maddingley MBC34-26]
Length = 310
Score = 86.7 bits (213), Expect = 2e-14, Method: Compositional matrix adjust.
Identities = 70/252 (27%), Positives = 117/252 (46%), Gaps = 13/252 (5%)
Query: 48 LSLPLTFPTGQTFRWKKTGPLQYTGPIGPHLISLKHLQNGDVCYHIHTSPSEPAAKSALL 107
L F GQ FR+ + + LI +K + + Y+ +E + L
Sbjct: 21 FKLKHIFECGQIFRFWEVEENDFIVIAFGKLIEVKEEGSNIIIYN----TTEEEFYTIWL 76
Query: 108 DFLNMGISLGELWEGFSASDCRFAELAKYLAGARVLRQDPVECLLQFLCSSNNNIARITK 167
+ ++ ++ E S D + ++ G RVL QDP E L+ F+ S+ NNI I K
Sbjct: 77 RYFDLDRDYSDIKEELS-KDPLLKKSIEFGYGVRVLNQDPFEMLISFIISARNNIPSIKK 135
Query: 168 MVDFLASLGSHLGNVEGFEFHEFPSLERLSLVSEVELRNAGFGYRSAPQSSLLFSVRRSF 227
V+ +++ N +G ++ FPS+E + S E++ G +RS +L +++ +
Sbjct: 136 TVNKISTKWGKEINYKGNTYYAFPSIEEIKDASLEEIQETGASFRSKY---ILDTIKNVY 192
Query: 228 KQAKYITGTVDVLQSKHSGGAEWLLS-LRKLDLQEAIDALCTLPGVGPKVAACIALFSLD 286
+ I D + S ++ L+ +R L E AL + GVG KVA CI LFS++
Sbjct: 193 NAS--INEVSDGNVDEESSYLKYDLNYIRSLSDDECHSALQSFKGVGSKVADCIMLFSME 250
Query: 287 QHHAIPVDTHVW 298
+ A PVD VW
Sbjct: 251 KTSAFPVD--VW 260
>gi|221054608|ref|XP_002258443.1| n-glycosylase/dna lyase [Plasmodium knowlesi strain H]
gi|193808512|emb|CAQ39215.1| n-glycosylase/dna lyase, putative [Plasmodium knowlesi strain H]
Length = 614
Score = 86.7 bits (213), Expect = 2e-14, Method: Compositional matrix adjust.
Identities = 45/121 (37%), Positives = 66/121 (54%), Gaps = 20/121 (16%)
Query: 185 FEFHEFPSLERLSLVSEVELRNAGFGYRSAPQSSLLFSVRRSFKQAKYITGTVDVLQSKH 244
F F+EFPS++ +S + E +LRN GFGYRS+ Y+ + +L +
Sbjct: 394 FPFYEFPSIDIISKLREQDLRNLGFGYRSS-----------------YVIESAKMLVKR- 435
Query: 245 SGGAEWLLSLRK-LDLQEAIDALCTLPGVGPKVAACIALFSLDQHHAIPVDTHVWKIATR 303
G EW+ L+K + ID L PG+G KVA CI LF L++ IP+DTH++ I +
Sbjct: 436 -GSEEWIEDLKKEKSTKNCIDQLIQFPGIGLKVANCICLFGLNKFDCIPIDTHIYDIIYK 494
Query: 304 Y 304
Y
Sbjct: 495 Y 495
Score = 75.1 bits (183), Expect = 5e-11, Method: Compositional matrix adjust.
Identities = 49/174 (28%), Positives = 74/174 (42%), Gaps = 29/174 (16%)
Query: 37 KWTPLNLTQSELSLPLTFPTGQTFRWKKTGPLQYTGPIGPHLISLKHLQNGDVCYHIHTS 96
KW +N+ ++EL L F GQ F + + Y G + + K G + Y +
Sbjct: 100 KWKRINVKKNELQLKYCFLIGQEFCFSEVCMDMYIGLVNKKIYLFKETDEG-IFYQCVYN 158
Query: 97 PSEPAAKSA--------------------LLDFLNMGISLGELWEGFSASDCRFAELAKY 136
P A + + DF N+ L + E + D R E+
Sbjct: 159 PRSEAGDGSQERQCDNPQNGNTNDHDEEDVKDFFNLQFPLSKNIEMWKRKDKRMKEITDK 218
Query: 137 LAGARVLRQDPVECLLQFLCSSNNNIARITKMVD--------FLASLGSHLGNV 182
++G R+L+ D VE FLCS+NNNI RIT M+D FLA++ H +V
Sbjct: 219 ISGLRILKADSVESFFSFLCSTNNNIPRITLMIDCLRRRYGKFLATVIFHGEDV 272
>gi|448340928|ref|ZP_21529896.1| 8-oxoguanine DNA glycosylase domain-containing protein [Natrinema
gari JCM 14663]
gi|445629403|gb|ELY82690.1| 8-oxoguanine DNA glycosylase domain-containing protein [Natrinema
gari JCM 14663]
Length = 293
Score = 86.7 bits (213), Expect = 2e-14, Method: Compositional matrix adjust.
Identities = 78/276 (28%), Positives = 121/276 (43%), Gaps = 33/276 (11%)
Query: 40 PLNLTQSELSLPLTFPTGQTFRWKKTGPLQYTGPIGPHLISLKHLQNGDVCYHIHTSP-- 97
P++ L L T +GQ++ W++ YT P + +G+V + T
Sbjct: 7 PIDDLGGGLDLYRTLESGQSYLWQRGDGEMYTDTPAPDAW-YATVVDGEVI-RVRTRDGR 64
Query: 98 ----SEPAAKSALLDFLNMGISLGELWEGFSASDCRFAELAKYLAGARVLRQDPVECLLQ 153
S A S + L + L E D E + G R++R P CL+
Sbjct: 65 LEWESTADAGSTVRRLLRLEDDL-EAIAAAGPDDPLLREAYEAHPGMRLVRDPPFGCLIS 123
Query: 154 FLCSSNNNIARITKMVDFLA-SLGSHLGNVEGFEFHEFPSLERLSLVSEVELRNAGFGYR 212
F+CS+ + RI MV LA GS + +G +H FP+ ++L+ +E ELR+ G GYR
Sbjct: 124 FICSAQMRVGRIHTMVTALAREYGSPIA-FDGATYHAFPTPDQLATATEAELRDLGLGYR 182
Query: 213 SAPQSSLLFSVRRSFKQAKYITGTVDVLQSKHSGGAEWLLSLRKLDLQEAIDALCTLPGV 272
A Y+ T +++ ++ A+ R L+ + A + L GV
Sbjct: 183 -----------------APYVVRTAEMVADGNAHPAD----ARDLEYEAAREFLTQFVGV 221
Query: 273 GPKVAACIALFSLDQHHAIPVDTHVWKIATRYLLPE 308
G KVA C+ LFSL A+P+DT + K A PE
Sbjct: 222 GDKVADCVLLFSLGFDEAVPLDTWI-KSAIEEYYPE 256
>gi|73669505|ref|YP_305520.1| 8-oxoguanine DNA glycosylase [Methanosarcina barkeri str. Fusaro]
gi|72396667|gb|AAZ70940.1| 8-oxoguanine DNA glycosylase [Methanosarcina barkeri str. Fusaro]
Length = 283
Score = 86.3 bits (212), Expect = 2e-14, Method: Compositional matrix adjust.
Identities = 71/262 (27%), Positives = 114/262 (43%), Gaps = 35/262 (13%)
Query: 48 LSLPLTFPTGQTFRWKKTGPLQYTGPIGPHLISLKHLQNGDVCYHIHTSPSEPAAKSALL 107
L T GQ FRW++ G +TG +G +I L Q G + + P L
Sbjct: 12 FDLNYTLDCGQVFRWERNGDW-WTGVVGDQVIRLSQDQ-GQLIVDSNLQPE------FLT 63
Query: 108 DFLNMGISLGELWEGFSASDCRFAELAKYLAGARVLRQDPVECLLQFLCSSNNNIARITK 167
+ + +L ++E + + KY G R++RQDP ECL+ ++ S+ ++I I K
Sbjct: 64 RYFRLDDNLPSIYESINRDLLIDRAIRKY-RGLRLIRQDPWECLISYMLSTASSIPTIQK 122
Query: 168 MVDFLA-SLGSHLGNVEGFEFHEFPSLERLSLVSEVELRNAGFGYRSAPQSSLLFSVRRS 226
+ L+ S G + + FP E L+ V EL G+R
Sbjct: 123 RIYLLSRSFGQEIEP----GYFSFPDPETLANVDPAELDKCKLGFR-------------- 164
Query: 227 FKQAKYITGTVDVLQSKHSGGAEWLLSLRKLDLQEAIDALCTLPGVGPKVAACIALFSLD 286
T + + + G L L +L+ + A + L L G+G KVA C+ LF+ D
Sbjct: 165 -------TQNIIAAAEEVASGELDLDVLFRLEYRYARERLMRLRGIGEKVADCVLLFAFD 217
Query: 287 QHHAIPVDTHVWKIATRYLLPE 308
+ A PVDTH+ +I Y + +
Sbjct: 218 KMEAFPVDTHIRQIIQHYHIDD 239
>gi|28212004|ref|NP_782948.1| 8-oxoguanine DNA glycosylase [Clostridium tetani E88]
gi|28204447|gb|AAO36885.1| 8-oxoguanine DNA glycosylase [Clostridium tetani E88]
Length = 311
Score = 86.3 bits (212), Expect = 2e-14, Method: Compositional matrix adjust.
Identities = 70/264 (26%), Positives = 122/264 (46%), Gaps = 17/264 (6%)
Query: 48 LSLPLTFPTGQTFRWKKTGPLQYTGPIGPHLISLKHLQNGDVCYHIHTSPSEPAAKSALL 107
L F GQ FRW + Y G +I ++ Q+ + Y+ + E K+
Sbjct: 30 FELNHIFDCGQCFRWNRQENGNYIGVAYGKVIEVEKKQDEIILYNTNEKEFEEIWKN--- 86
Query: 108 DFLNMGISLGELWEGFSASDCRFAELAKYLAGARVLRQDPVECLLQFLCSSNNNIARITK 167
+ ++ ++ + + D E Y G R+L+Q+P E ++ F+ S+NN I I K
Sbjct: 87 -YFDLNREYSKI-KTILSEDKLLKEAIDYGYGIRILQQEPFEIIISFIISANNRIPMIKK 144
Query: 168 MVDFLASLGSHLGNVEGFEFHEFPSLERLSLVSEVELRNAGFGYRSAPQSSLLFSVRRSF 227
++ ++ +G E++ FPS+E+L S E+ G G+RS ++ +V + +
Sbjct: 145 AIEKISRKWGKKLEYKGKEYYGFPSIEKLYNASIEEVEECGTGFRS---KYIVDTVSKIY 201
Query: 228 KQAKYITGTVDVLQSKHSGGAEWLLSLRKLDLQEAIDALCTLPGVGPKVAACIALFSLDQ 287
+ T D L+ ++ +L+ E AL G+GPKVA CI LFS+ +
Sbjct: 202 ENI-----TTDSLEYNEQFDINYIKTLQD---DECHSALQNFKGIGPKVADCIMLFSMGK 253
Query: 288 HHAIPVDTHVWKIATR-YLLPELA 310
A PVD V + YL P+++
Sbjct: 254 DSAFPVDVWVKRAMQHFYLAPDVS 277
>gi|304439103|ref|ZP_07399022.1| possible DNA-(apurinic or apyrimidinic site) lyase [Peptoniphilus
duerdenii ATCC BAA-1640]
gi|304372462|gb|EFM26049.1| possible DNA-(apurinic or apyrimidinic site) lyase [Peptoniphilus
duerdenii ATCC BAA-1640]
Length = 291
Score = 86.3 bits (212), Expect = 2e-14, Method: Compositional matrix adjust.
Identities = 72/251 (28%), Positives = 112/251 (44%), Gaps = 33/251 (13%)
Query: 54 FPTGQTFRWKKTGPLQYTGPIGPHLISLKHLQNGDVCYHIH-TSPSEPAAKSALLDFLNM 112
F GQ FRW + + + + ++GD IH E + L+D+ ++
Sbjct: 24 FKCGQAFRWDELEDGSFINVAFNRVFRV--YRDGD---DIHLEGVDEDFYNAHLVDYFDL 78
Query: 113 GISLGELWEGFSASDCRFAELAKYLAGARVLRQDPVECLLQFLCSSNNNIARITKMV-DF 171
S ++ S D E Y G R+L Q+P E ++ F+ S+NNNI RI + D
Sbjct: 79 NRSYLDIKNDLS-KDETLKEAMLYGEGIRILNQEPFETIITFIISANNNIKRIKNSIRDI 137
Query: 172 LASLGSHLGNVEGFEFHEFPSLERLSLVSEVELRN-AGFGYRSAPQSSLLFSVRRSFKQA 230
G + N G E++ FP++E LS +LR A G+R R K +
Sbjct: 138 SRMYGEKIENPYG-EYYAFPTVEALSKADPKDLREFAKVGFRD----------ERIVKAS 186
Query: 231 KYI-TGTVDVLQSKHSGGAEWLLSLRKLDLQEAIDALCTLPGVGPKVAACIALFSLDQHH 289
+ I G V++ L +DL+ A + L LPGVGPKV CI LF+ +
Sbjct: 187 QMILNGEVEID------------PLFTMDLELAREELMKLPGVGPKVCDCILLFAFKRSE 234
Query: 290 AIPVDTHVWKI 300
+ PVD + ++
Sbjct: 235 SFPVDVWIKRV 245
>gi|167042244|gb|ABZ06975.1| putative helix-hairpin-helix motif protein [uncultured marine
crenarchaeote HF4000_ANIW93J19]
Length = 282
Score = 86.3 bits (212), Expect = 2e-14, Method: Compositional matrix adjust.
Identities = 58/171 (33%), Positives = 85/171 (49%), Gaps = 21/171 (12%)
Query: 134 AKYLAGARVLRQDPVECLLQFLCSSNNNIARITKMVDFLASLGSHLGNVEGFEFHEFPSL 193
K+ G RV RQDP +C + F+ SSN+NI I + L E EF FP
Sbjct: 81 TKHYPGLRVTRQDPFQCCISFIVSSNSNIPNIRMRLQKLCRKFGTKVRFEKREFFLFPRP 140
Query: 194 ERLSLVSEVELRNAGFGYRSAPQSSLLFSVRRSFKQAKYITGTVDVLQSKHSGGAEWLLS 253
+RL+ + +L+ GYRS KY+ +D ++ SG ++
Sbjct: 141 KRLAKATLQDLQECKLGYRS-----------------KYV---LDASRAIASGEIDFD-E 179
Query: 254 LRKLDLQEAIDALCTLPGVGPKVAACIALFSLDQHHAIPVDTHVWKIATRY 304
L+K D +E + L LPG+G KVA C+ LFSL++ A P+DT + KI +Y
Sbjct: 180 LKKADYREGKELLLKLPGIGDKVADCVMLFSLEKLEAFPLDTWIVKILQKY 230
>gi|167042967|gb|ABZ07681.1| putative HhH-GPD superfamily base excision DNA repair protein
[uncultured marine crenarchaeote HF4000_ANIW137N18]
Length = 282
Score = 86.3 bits (212), Expect = 2e-14, Method: Compositional matrix adjust.
Identities = 58/171 (33%), Positives = 81/171 (47%), Gaps = 21/171 (12%)
Query: 134 AKYLAGARVLRQDPVECLLQFLCSSNNNIARITKMVDFLASLGSHLGNVEGFEFHEFPSL 193
K+ G RV RQDP +C + F+ SSN+NI I + L E EF FP
Sbjct: 81 TKHYPGLRVTRQDPFQCCISFIVSSNSNIPNIRMRLQKLCRKFGTKVRFEQREFFLFPRP 140
Query: 194 ERLSLVSEVELRNAGFGYRSAPQSSLLFSVRRSFKQAKYITGTVDVLQSKHSGGAEWLLS 253
+ L+ + +L+ GYRS KY+ T + S E
Sbjct: 141 KILAKATLQDLQECKLGYRS-----------------KYVLDTSRAVASGEIDFDE---- 179
Query: 254 LRKLDLQEAIDALCTLPGVGPKVAACIALFSLDQHHAIPVDTHVWKIATRY 304
L+K+D QE + L LPG+G KVA C+ LFSL++ A P+DT + KI +Y
Sbjct: 180 LKKVDYQECKELLLKLPGIGDKVADCVMLFSLEKLEAFPLDTWIVKILQKY 230
>gi|365128463|ref|ZP_09340633.1| 8-oxoguanine DNA-glycosylase (ogg) [Subdoligranulum sp.
4_3_54A2FAA]
gi|363622853|gb|EHL73998.1| 8-oxoguanine DNA-glycosylase (ogg) [Subdoligranulum sp.
4_3_54A2FAA]
Length = 261
Score = 86.3 bits (212), Expect = 2e-14, Method: Compositional matrix adjust.
Identities = 76/274 (27%), Positives = 108/274 (39%), Gaps = 65/274 (23%)
Query: 41 LNLTQSELSLPLTFPTGQTFRWKKTGPLQYTGPIGPHLISLKHLQNGDVCYHIHTSPSEP 100
+ L+ +L L + +GQ FRW+ G Y P ++++
Sbjct: 1 MQLSIPDLDLRHIYESGQCFRWEPLGQDTYRIPAFGRTLTVRQ----------------- 43
Query: 101 AAKSALLDFLNMGISLGELWEGFSASDCRFAELAKYLA--------------GARVLRQD 146
+ DF LW + D +A + +A G R+LRQD
Sbjct: 44 --TDGVFDFSCGAGEFDALWRSYFDLDTDYAAIKARVAPRDAYLQAAVAYGWGMRILRQD 101
Query: 147 PVECLLQFLCSSNNNIARITKMVDFLASLGSHLGNVEGFEFHEFPSLERLSLVSEVELRN 206
E ++ F+ S NNNI RI K +L ++ + FP+ L+ LR
Sbjct: 102 LWEVIVSFIVSQNNNIPRIRK----------NLRDLCAMQGGAFPTPAALAAAQPETLRA 151
Query: 207 AGFGYRSAPQSSLLFSVRRSFKQAKYITGTVDVLQSKHSGGAEWLLSLRKLDLQEAIDAL 266
G GYR+ L + F QA G +D +LR + EA AL
Sbjct: 152 LGLGYRA----EYLCAAGAHFTQA----GALD--------------ALRAMSYPEAHTAL 189
Query: 267 CTLPGVGPKVAACIALFSLDQHHAIPVDTHVWKI 300
+ GVGPKVA CI LF L A PVDTHV +I
Sbjct: 190 RAVKGVGPKVADCICLFGLHHVDAFPVDTHVKQI 223
>gi|254443556|ref|ZP_05057032.1| 8-oxoguanine DNA glycosylase, N-terminal domain family
[Verrucomicrobiae bacterium DG1235]
gi|198257864|gb|EDY82172.1| 8-oxoguanine DNA glycosylase, N-terminal domain family
[Verrucomicrobiae bacterium DG1235]
Length = 293
Score = 85.9 bits (211), Expect = 2e-14, Method: Compositional matrix adjust.
Identities = 76/275 (27%), Positives = 117/275 (42%), Gaps = 29/275 (10%)
Query: 36 SKWTPLNLTQ--SELSLPLTFPTGQTFRWKKTGPLQYTGPIGPHLISLKHLQNGDVCYHI 93
+ WTPL+L+ +E +L T GQ FRW I ++ + +
Sbjct: 5 TAWTPLSLSHPFTEQTLAETLNGGQAFRWNYHETESCWQGIWDQYVARIRFSDNNTLLAS 64
Query: 94 HTSPSEPAAKSALLDFLNMGISLGELWEGFS-ASDCRFAELAKYLAGARVLRQDPVECLL 152
+ S P +AL ++L + + + SD K G R+L+Q E +L
Sbjct: 65 YPK-SSPYTPNALANYLRLDFAWTTAVDSLPWRSDAHLDTALKAFPGLRILKQPFPEAVL 123
Query: 153 QFLCSSNNNIARITKMVDFLASLGSHLGNVEGFEFHEFPSLERLSLVSEVELRNAGFGYR 212
FLCS+ I +I M + +A LGN H P+ ++L+ SE +LR+ G G+R
Sbjct: 124 CFLCSATKQIPQIKVMCERMAET---LGNELLPGIHALPTWDQLAAASETDLRSLGLGFR 180
Query: 213 SAPQSSLLFSVRRSFKQAKYITGTVDVLQSKHSGGAEWLLSLRKLDLQEAIDALCTLPGV 272
AK I T D+L ++ E L + + L +LPG+
Sbjct: 181 -----------------AKNIKKTADILAAQ----PELLSQIESAPYETGKTQLVSLPGI 219
Query: 273 GPKVAACIALFSLDQHHAIPVDTHVWKI-ATRYLL 306
G K+A C LF + A PVDT + K+ RY L
Sbjct: 220 GEKIADCALLFGAAKLEAFPVDTWIIKVLEKRYQL 254
>gi|331092205|ref|ZP_08341035.1| hypothetical protein HMPREF9477_01678 [Lachnospiraceae bacterium
2_1_46FAA]
gi|330401977|gb|EGG81551.1| hypothetical protein HMPREF9477_01678 [Lachnospiraceae bacterium
2_1_46FAA]
Length = 270
Score = 85.9 bits (211), Expect = 2e-14, Method: Compositional matrix adjust.
Identities = 57/176 (32%), Positives = 83/176 (47%), Gaps = 22/176 (12%)
Query: 126 SDCRFAELAKYLAGARVLRQDPVECLLQFLCSSNNNIARITKMV-DFLASLGSHLGNVEG 184
+D +E ++ G R+L QD E ++ FL S NNIARI K + + G N G
Sbjct: 82 NDRYLSEAGRFGYGIRILHQDLWEMIVSFLISQQNNIARIRKCIQNICEEYGERQLNFRG 141
Query: 185 FEFHEFPSLERLSLVSEVELRNAGFGYRSAPQSSLLFSVRRSFKQAKYITGTVDVLQSKH 244
++ FP+ E L + + EL+ GYRS KY+ T +
Sbjct: 142 EVYYTFPTAETLEKLGDDELKACNLGYRS-----------------KYVVRTAKSV---- 180
Query: 245 SGGAEWLLSLRKLDLQEAIDALCTLPGVGPKVAACIALFSLDQHHAIPVDTHVWKI 300
G L ++K+ ++A L L GVG KVA CI LF+L A PVDTH+ ++
Sbjct: 181 VAGEISLEQVKKMSYRKAKAELLNLFGVGEKVADCICLFALHHLQAFPVDTHIRQV 236
>gi|406699257|gb|EKD02464.1| hypothetical protein A1Q2_03224 [Trichosporon asahii var. asahii
CBS 8904]
Length = 671
Score = 85.9 bits (211), Expect = 3e-14, Method: Compositional matrix adjust.
Identities = 85/271 (31%), Positives = 120/271 (44%), Gaps = 43/271 (15%)
Query: 79 ISLKHLQNGDVCYH-IHTSPS--EPAAKSALL---DFLNMG-----ISLGELWEGFSASD 127
I + L+ GD+ + +PS PAA L D+ + L + WE +
Sbjct: 349 IYEQFLKQGDLLLSAVSATPSTASPAADDTRLWLEDYFQLSQYPDLTKLYQAWEERDPAL 408
Query: 128 CRFAELAKYLAGARVLRQDPVECLLQFLCSSNNNIARITKMVDFLASLGSH----LGNVE 183
EL G RVLRQDP + F+ S+NN+I RI+ ++ L S L N +
Sbjct: 409 FGKTELNAKAVGVRVLRQDPWD----FITSTNNHITRISSLLHKLCIAFSSPVLTLDNPD 464
Query: 184 G---FEFHEFPSLERLSLVS-EVELRNAGFGYRSAPQSSLLFSVRRSFKQAKYITGTVDV 239
G +H FP E + L E LR+ GFGYR+A +S L S+R + VD
Sbjct: 465 GQGQTTYHLFPRAEHIPLTGLEQVLRDLGFGYRAAFIASTLNSLR--------VFDNVD- 515
Query: 240 LQSKHSGGAEWLLSLRKLDLQEAIDALCTLPGVGPKVAACIALFSLDQHHAIPVDTHVWK 299
E L R L+E L L GVG KVA C+ L +DQ +P+DTH+ +
Sbjct: 516 ---------EELARWRAAPLEETRGRLLELKGVGRKVADCVQLMCMDQLSLVPIDTHLHR 566
Query: 300 IATRY--LLPELAGVRLTPKLCSRVAEAFCE 328
IA R+ L R++P + V E
Sbjct: 567 IAMRHPSFPSSLKAKRMSPAIYGDVQSFLAE 597
>gi|336476758|ref|YP_004615899.1| DNA-(apurinic or apyrimidinic site) lyase [Methanosalsum zhilinae
DSM 4017]
gi|335930139|gb|AEH60680.1| DNA-(apurinic or apyrimidinic site) lyase [Methanosalsum zhilinae
DSM 4017]
Length = 287
Score = 85.9 bits (211), Expect = 3e-14, Method: Compositional matrix adjust.
Identities = 67/255 (26%), Positives = 116/255 (45%), Gaps = 32/255 (12%)
Query: 47 ELSLPLTFPTGQTFRWKKTGPLQYTGPIGPHLISLKHLQNGDVCYHIHTSPSEPAAKSAL 106
+ +L T GQ FRW K + +TG +I ++G V H P+E
Sbjct: 12 DFNLDHTLGCGQVFRWDKKDDV-WTGVAEGEIIHAIQQKDGSVLIDSHL-PAE-----FF 64
Query: 107 LDFLNMGISLGELWEGFSASDCRFAELAKYLAGARVLRQDPVECLLQFLCSSNNNIARIT 166
+++ + L + + + C + + G R++RQDP ECL+ ++ ++ ++I I
Sbjct: 65 INYFRIDDDLCHILQKINKDPCINKAINDFY-GLRLIRQDPWECLISYMLATASSIPTIK 123
Query: 167 KMVDFLASLGSHLGNVEGFEFHEFPSLERLSLVSEVELRNAGFGYRSAPQSSLLFSVRRS 226
K + ++L S GN + FP ++ +L+ G+R+ + V R
Sbjct: 124 KRI---SNLCSMFGNEITDGYFSFPEPSDIANACISDLQECKLGFRTQRIKNAAEIVNR- 179
Query: 227 FKQAKYITGTVDVLQSKHSGGAEWLLSLRKLDLQEAIDALCTLPGVGPKVAACIALFSLD 286
G +D L SL+K++ +A L ++ GVG KVA C+ LFS+D
Sbjct: 180 --------GNID------------LESLQKMEYYDARKKLMSIEGVGEKVADCVLLFSMD 219
Query: 287 QHHAIPVDTHVWKIA 301
+ + PVDTH+ KI
Sbjct: 220 KLESFPVDTHIKKIV 234
>gi|429764603|ref|ZP_19296914.1| 8-oxoguanine DNA-glycosylase [Clostridium celatum DSM 1785]
gi|429187672|gb|EKY28580.1| 8-oxoguanine DNA-glycosylase [Clostridium celatum DSM 1785]
Length = 308
Score = 85.9 bits (211), Expect = 3e-14, Method: Compositional matrix adjust.
Identities = 71/277 (25%), Positives = 117/277 (42%), Gaps = 37/277 (13%)
Query: 47 ELSLPLTFPTGQTFRWKKTGPLQYTGPIGPHLISLKHLQNGDVCYHIHTSPSEPAAKSAL 106
+ ++ GQ FRW++ Y +I + N + Y+ + KS
Sbjct: 20 DFNIKQILECGQCFRWERITDTNYIIVAFRRVIEVVQEGNKVIIYNTNMKDFNEIWKS-- 77
Query: 107 LDFLNMGISLGELWEGFSASDCRFAELAKYLAGARVLRQDPVECLLQFLCSSNNNIARIT 166
+ ++ G + E S D + ++ G R+L QDP E L+ F+ S+ N+I I
Sbjct: 78 --YFDLDRDYGVIKEELS-KDELLRKSVEFGYGIRILNQDPFEILISFIISARNSIPSIM 134
Query: 167 KMVDFLASLGSHLGNVEGFEFHEFPSLERLSLVSEVELRNAGFGYRSAPQSSLLFSVRRS 226
K + ++ +G ++ FP+ E+L + E++ G +RS
Sbjct: 135 KTIKKISEKWGDKLEYKGNIYYAFPTPEQLKKATLDEIKETGASFRS------------- 181
Query: 227 FKQAKYITGTVD----VLQSKHSGGAEWLLSLRKLDLQ--------EAIDALCTLPGVGP 274
KYI T+ +++K G E LR+ DL E AL GVG
Sbjct: 182 ----KYIVDTIGNVNAAIEAKQKG--ELTEELRQFDLDYIKSLPSAECHKALQNFMGVGA 235
Query: 275 KVAACIALFSLDQHHAIPVDTHVWKIATR-YLLPELA 310
KVA CI LFS+ +H A PVD + + YL P+++
Sbjct: 236 KVADCIMLFSMGKHSAFPVDVWIKRAMIHFYLAPDVS 272
>gi|38492207|gb|AAR22409.1| 8-oxo-guanine DNA glycosylase [Plasmodium falciparum]
Length = 757
Score = 85.5 bits (210), Expect = 3e-14, Method: Compositional matrix adjust.
Identities = 48/138 (34%), Positives = 73/138 (52%), Gaps = 23/138 (16%)
Query: 171 FLASLGSHLG---NVEGFEFHEFPSLERLSLVSEVELRNAGFGYRSAPQSSLLFSVRRSF 227
F SL + L N + F F+EFP ++ LS + E +LR+ GFGYRS
Sbjct: 531 FYESLKTTLKDERNQKTFHFYEFPKIQVLSKLKEDDLRSLGFGYRSN------------- 577
Query: 228 KQAKYITGTVDVLQSKHSGGAEWLLSLRKLD-LQEAIDALCTLPGVGPKVAACIALFSLD 286
Y+ + +L + G EW+ +L+ + + ID L PG+G KVA CI LF L+
Sbjct: 578 ----YVIESAKMLVN--MGEEEWIENLKNEEKTKTCIDKLIQFPGIGLKVANCICLFGLN 631
Query: 287 QHHAIPVDTHVWKIATRY 304
++ IP+DTH++ I +Y
Sbjct: 632 KYDCIPIDTHIYDIIYKY 649
Score = 65.1 bits (157), Expect = 4e-08, Method: Compositional matrix adjust.
Identities = 48/179 (26%), Positives = 68/179 (37%), Gaps = 32/179 (17%)
Query: 26 NPQTLTTNKPSKWTPLNLTQSELSLPLTFPTGQTFRWKKTGPLQYTGPIGPHLISLKHLQ 85
N + N +W L+ + ++L L GQ F + + Y G + I L
Sbjct: 148 NSPIIIKNYEKRWIKLDASTNDLKLKYCLLIGQEFCFNEVEKNMYIGLLNIKKIYLFKET 207
Query: 86 NGDVCYH--------------------IHTSPSEPAAKSAL------------LDFLNMG 113
D+ Y T S K+ L +F N+
Sbjct: 208 EKDIFYQCLYDDDNDCDNYNDSNNNNVTCTGYSNKDDKNILKKRKSNNHHNEVYNFFNLH 267
Query: 114 ISLGELWEGFSASDCRFAELAKYLAGARVLRQDPVECLLQFLCSSNNNIARITKMVDFL 172
L E E + D R E+ + G R+L+ DPVE FLCS+NNNI RIT M+D L
Sbjct: 268 FPLNENIEIWKKKDQRMKEITNKITGLRILKTDPVESFFSFLCSTNNNIPRITLMIDSL 326
>gi|171909544|ref|ZP_02925014.1| 8-oxoguanine DNA glycosylase domain protein [Verrucomicrobium
spinosum DSM 4136]
Length = 300
Score = 85.5 bits (210), Expect = 3e-14, Method: Compositional matrix adjust.
Identities = 69/254 (27%), Positives = 109/254 (42%), Gaps = 32/254 (12%)
Query: 46 SELSLPLTFPTGQTFRWKKTGPLQYTGPIGPHLISLKHLQNGDVCYHIHTSPSEPAAKSA 105
+ L T +GQ F W + + G IG L+++ + G + H+ + ++
Sbjct: 20 ASFDLAATLNSGQVFHWHEQDG-GFLGLIGGELVTVSQPEPGML--HVSAGHGDLVSRYL 76
Query: 106 LLDFLNMGISLGELWEGFSASDCRFAELAKYLAGARVLRQDPVECLLQFLCSSNNNIARI 165
LD L ++ D +Y G R+LRQ ECL F+ SS +A I
Sbjct: 77 ALDH-----DLMKMQRTLPKRDAHLKRALQYSPGLRILRQPRWECLATFITSSLKQVAHI 131
Query: 166 TKM-VDFLASLGSHLGNVEGFEFHEFPSLERLSLVSEVELRNAGFGYRSAPQSSLLFSVR 224
++ + G + + G H +P+ E L+ E LR G GYR
Sbjct: 132 RQISLTLRERFGQPVATLNGKVLHTYPTPEALARAGEAALRACGLGYR------------ 179
Query: 225 RSFKQAKYITGTVDVLQSKHSGGAEWLLSLRKLDLQEAIDALCTLPGVGPKVAACIALFS 284
AK++ T S+ + G L + +D + A + LCTLPGVGPK+A C LF+
Sbjct: 180 -----AKFLHQTA----SRIAEGNFDLETPVAVDDEAACELLCTLPGVGPKIAQCTLLFA 230
Query: 285 LDQHHAIPVDTHVW 298
++ P+D VW
Sbjct: 231 YERLGVFPID--VW 242
>gi|4587152|dbj|BAA76636.1| OGG1 protein type 2c [Homo sapiens]
Length = 100
Score = 85.5 bits (210), Expect = 3e-14, Method: Compositional matrix adjust.
Identities = 39/90 (43%), Positives = 59/90 (65%), Gaps = 1/90 (1%)
Query: 109 FLNMGISLGELWEGFSASDCRFAELAKYLAGARVLRQDPVECLLQFLCSSNNNIARITKM 168
+ + ++L +L+ + + D F E+A+ G R+LRQDP+ECL F+CSSNNNIARIT M
Sbjct: 4 YFQLDVTLAQLYHHWGSVDSHFQEVAQKFQGVRLLRQDPIECLFSFICSSNNNIARITGM 63
Query: 169 VDFLA-SLGSHLGNVEGFEFHEFPSLERLS 197
V+ L + G L ++ +H FPSL+ L+
Sbjct: 64 VERLCQAFGPRLIQLDDVTYHGFPSLQALA 93
>gi|361132024|gb|EHL03639.1| putative N-glycosylase/DNA lyase [Glarea lozoyensis 74030]
Length = 319
Score = 85.5 bits (210), Expect = 3e-14, Method: Compositional matrix adjust.
Identities = 67/221 (30%), Positives = 98/221 (44%), Gaps = 49/221 (22%)
Query: 98 SEPAAKSALLDFLNMGISLGELWEGFSASDCRFAELAKYLAGARVLRQDPVECLLQFLCS 157
E + L +LN+ +L L+E +S+ D F + A G R+L+QD E L+
Sbjct: 14 DEDDTERLLKHYLNLEPNLTALYEYWSSVDPNFKKRAPKFTGVRILKQDAWEALMV---- 69
Query: 158 SNNNIARITKMVDFLASLGSHLGNVEGFEFHEFPSLERLSLVS-EVELRNAGFGYRSAPQ 216
NN+ G +G+V+ +H+FP+ E L+ E LR GFGYR
Sbjct: 70 --NNLC---------LHYGPLIGHVDAQPYHDFPAPETLTGPGVEAHLRELGFGYR---- 114
Query: 217 SSLLFSVRRSFKQAKYITGTVDVLQSKHSGGAEWLLSLRKLD--------------LQEA 262
A YI T ++ + G WL +L +D ++A
Sbjct: 115 -------------ASYIAKTAKMVAKEKPEG--WLQNLSNVDHSPDQKMPPGGRDGYRKA 159
Query: 263 IDALCTLPGVGPKVAACIALFSLDQHHAIPVDTHVWKIATR 303
+ L L GVGPKVA C+ L L ++PVDTHVW+ A R
Sbjct: 160 HEELLQLQGVGPKVADCVCLMGLGWGESVPVDTHVWQFAQR 200
>gi|218133268|ref|ZP_03462072.1| hypothetical protein BACPEC_01133 [[Bacteroides] pectinophilus ATCC
43243]
gi|217992141|gb|EEC58145.1| 8-oxoguanine DNA-glycosylase (ogg) [[Bacteroides] pectinophilus
ATCC 43243]
Length = 293
Score = 85.5 bits (210), Expect = 3e-14, Method: Compositional matrix adjust.
Identities = 75/283 (26%), Positives = 118/283 (41%), Gaps = 34/283 (12%)
Query: 46 SELSLPLTFPTGQTFRWKKTGPLQYTGPIGPHLISLKHLQNGDVCYHIHTSPSEPAAKSA 105
S+ + T GQ F + + Y L+ G V Y E +S
Sbjct: 16 SDFDIAQTLECGQCFHFNRIDDHDYALAANGMLLHAAQTDTGIVLYDTDKGQYENIWRS- 74
Query: 106 LLDFLNMGISLGELWEGFSASDCRFAELAKYLAGARVLRQDPVECLLQFLCSSNNNIARI 165
+ +M + E D R + + G R+L Q+ E L+ F+ S N I I
Sbjct: 75 ---YFDMDRDYSSIKEYLVRYDSRLEDAINAMGGVRILNQEFFETLMSFIISQNKQIPHI 131
Query: 166 TKMVDFLA-SLGSHLGNVEGFEFHEFPSLERLSLVSEVE-LRNAGFGYRSAPQSSLLFSV 223
++V ++ G++LG++ G E++ FP + L + VE R G+R+ + SV
Sbjct: 132 KQIVAQISHDYGTYLGDIAGQEYYAFPDCDTLYSNAGVEAFRACKTGFRAPYLWDAVCSV 191
Query: 224 RRSFKQAKYITGTVDVLQSKHSGGAEWLLSLRKLDLQEAIDALCTLPGVGPKVAACIALF 283
+ G ++ + LR + E ++ L + GVG KVA+C+ALF
Sbjct: 192 KE---------GRINERR------------LRTDGIAECLEELMQIKGVGTKVASCVALF 230
Query: 284 SLDQHHAIPVDTHVWKIATRYLLPELAGVRLTPKLCSRVAEAF 326
SL Q A PVD VW R + R TPK +V E+F
Sbjct: 231 SLGQTAAFPVD--VW--IKRMMEQMYFDGRETPK---KVIESF 266
>gi|410670363|ref|YP_006922734.1| 8-oxoguanine DNA glycosylase-like protein [Methanolobus
psychrophilus R15]
gi|409169491|gb|AFV23366.1| 8-oxoguanine DNA glycosylase-like protein [Methanolobus
psychrophilus R15]
Length = 279
Score = 85.5 bits (210), Expect = 3e-14, Method: Compositional matrix adjust.
Identities = 72/271 (26%), Positives = 115/271 (42%), Gaps = 55/271 (20%)
Query: 46 SELSLPLTFPTGQTFRWKKTGPLQYTGPIGPHLISLKHLQNGDVCYHIHTSPSEPAAKSA 105
+ +L T GQ FRW K + G + ++ L+ +P +K
Sbjct: 7 EDFNLDYTLDCGQAFRWNKIDSW-WVGVVNGSIVRLQQ---------------DPESKEV 50
Query: 106 LLD----------FLNMGISLGELWEGFSASDCRFAELA-KYLAGARVLRQDPVECLLQF 154
L+D + L ++ + RF + A + G R++RQDP ECL+ +
Sbjct: 51 LVDSSLSEAYMQNYFRFDDDLDDILRQVNKD--RFMDEAIRKYRGLRLIRQDPWECLISY 108
Query: 155 LCSSNNNIARITKMVDFLAS-LGSHLGNVEGFEFHEFPSLERLSLVSEVELRNAGFGYRS 213
+ ++ +I I + + L+S G L + + FP E L+ + + ELR+ G+RS
Sbjct: 109 MLATAWSIPNIKRGISLLSSTYGEELED----GLYSFPEPEALAGLCDGELRDCKLGFRS 164
Query: 214 APQSSLLFSVRRSFKQAKYITGTVDVLQSKHSGGAEWLLSLRKLDLQEAIDALCTLPGVG 273
R K A+++ VL + G +EA L L G+G
Sbjct: 165 ----------NRIVKAARHVMDGNLVLDELFTIG-----------YKEAKQRLMFLEGIG 203
Query: 274 PKVAACIALFSLDQHHAIPVDTHVWKIATRY 304
KVA CI LF+ D+ A PVDTHV K+ Y
Sbjct: 204 EKVADCILLFAFDKMEAFPVDTHVEKVVRHY 234
>gi|124506889|ref|XP_001352042.1| N-glycosylase/DNA lyase, putative [Plasmodium falciparum 3D7]
gi|23505071|emb|CAD51853.1| N-glycosylase/DNA lyase, putative [Plasmodium falciparum 3D7]
Length = 657
Score = 85.1 bits (209), Expect = 4e-14, Method: Compositional matrix adjust.
Identities = 48/138 (34%), Positives = 73/138 (52%), Gaps = 23/138 (16%)
Query: 171 FLASLGSHLG---NVEGFEFHEFPSLERLSLVSEVELRNAGFGYRSAPQSSLLFSVRRSF 227
F SL + L N + F F+EFP ++ LS + E +LR+ GFGYRS
Sbjct: 431 FYESLKTTLKDERNQKTFHFYEFPKIQVLSKLKEDDLRSLGFGYRSN------------- 477
Query: 228 KQAKYITGTVDVLQSKHSGGAEWLLSLRKLD-LQEAIDALCTLPGVGPKVAACIALFSLD 286
Y+ + +L + G EW+ +L+ + + ID L PG+G KVA CI LF L+
Sbjct: 478 ----YVIESAKMLVN--MGEEEWIENLKNEEKTKTCIDKLIQFPGIGLKVANCICLFGLN 531
Query: 287 QHHAIPVDTHVWKIATRY 304
++ IP+DTH++ I +Y
Sbjct: 532 KYDCIPIDTHIYDIIYKY 549
Score = 65.1 bits (157), Expect = 4e-08, Method: Compositional matrix adjust.
Identities = 48/179 (26%), Positives = 68/179 (37%), Gaps = 32/179 (17%)
Query: 26 NPQTLTTNKPSKWTPLNLTQSELSLPLTFPTGQTFRWKKTGPLQYTGPIGPHLISLKHLQ 85
N + N +W L+ + ++L L GQ F + + Y G + I L
Sbjct: 48 NSPIIIKNYEKRWIKLDASTNDLKLKYCLLIGQEFCFNEVEKNMYIGLLNIKKIYLFKET 107
Query: 86 NGDVCYH--------------------IHTSPSEPAAKSAL------------LDFLNMG 113
D+ Y T S K+ L +F N+
Sbjct: 108 EKDIFYQCLYDDDNDCDNYNDSNNNNVTCTGYSNKDDKNILKKRKSNNHHNEVYNFFNLH 167
Query: 114 ISLGELWEGFSASDCRFAELAKYLAGARVLRQDPVECLLQFLCSSNNNIARITKMVDFL 172
L E E + D R E+ + G R+L+ DPVE FLCS+NNNI RIT M+D L
Sbjct: 168 FPLNENIEIWKKKDQRMKEITNKITGLRILKTDPVESFFSFLCSTNNNIPRITLMIDSL 226
>gi|312136299|ref|YP_004003636.1| DNA-(apurinic or apyrimidinic site) lyase [Methanothermus fervidus
DSM 2088]
gi|311224018|gb|ADP76874.1| DNA-(apurinic or apyrimidinic site) lyase [Methanothermus fervidus
DSM 2088]
Length = 293
Score = 84.7 bits (208), Expect = 5e-14, Method: Compositional matrix adjust.
Identities = 78/277 (28%), Positives = 124/277 (44%), Gaps = 44/277 (15%)
Query: 48 LSLPLTFPTGQTFR--WKKTGPLQYTGPI----GPHLISLKHLQNG-DVCYHIHTSPSEP 100
L LT +GQT + WK Y I P LI ++ N +V Y T S
Sbjct: 18 FDLKLTMFSGQTSQPPWKHKNDY-YQELIFIEDKPCLIKVRQKNNHLEVKYETDTKVSPQ 76
Query: 101 AAKSALLDFLNMGISLGELWEGFSASDCRFAELAKYLAGARV-LRQDPVECLLQFLCSSN 159
K + ++ +L + + F+ +D F+ + + G R+ L + EC++ + S+N
Sbjct: 77 DIKKTIKYIFDLEFNLDKFYFNFTQNDSIFSRIVEKYNGLRLFLAHNKYECIISSITSAN 136
Query: 160 NNIARITKMVDFLASLGSHLGNVEGF----EFHEFPSLERLSLVSEVELRNAGFGYRSAP 215
N+I++ T+ + + LG F E + FP E + L ++ +L+ G GYR
Sbjct: 137 NSISKWTRSI---LRIKEKLGRKYVFPSSEEMYYFPDPETV-LENKDKLKYCGVGYR--- 189
Query: 216 QSSLLFSVRRSFKQAKYITGTVDVLQSKHSGGAEWLLSLRKLDLQEAIDALCTLPGVGPK 275
KYI T +L S + + K++ EA + L LPGVGPK
Sbjct: 190 --------------YKYIVETTKILYENLS-----IDEVNKMEYDEAFETLLELPGVGPK 230
Query: 276 VAACIALFSLDQHHAIPVDTHVWKIATRYLLPELAGV 312
VA CI L+ ++ A PVD +W TR + +L GV
Sbjct: 231 VADCILLYGFRRYEAFPVD--IW--ITR-AIKDLFGV 262
>gi|156097342|ref|XP_001614704.1| hypothetical protein [Plasmodium vivax Sal-1]
gi|148803578|gb|EDL44977.1| hypothetical protein, conserved [Plasmodium vivax]
Length = 668
Score = 84.7 bits (208), Expect = 5e-14, Method: Compositional matrix adjust.
Identities = 45/121 (37%), Positives = 66/121 (54%), Gaps = 20/121 (16%)
Query: 185 FEFHEFPSLERLSLVSEVELRNAGFGYRSAPQSSLLFSVRRSFKQAKYITGTVDVLQSKH 244
F F+EFPS+E LS + E +LR+ GFGYRS+ Y+ + +L +
Sbjct: 424 FPFYEFPSVENLSKLKEEDLRSLGFGYRSS-----------------YVIESAKMLVKR- 465
Query: 245 SGGAEWLLSLRK-LDLQEAIDALCTLPGVGPKVAACIALFSLDQHHAIPVDTHVWKIATR 303
G +W+ L+K + ID L PG+G KVA CI LF L++ IP+DTH++ I +
Sbjct: 466 -GSEQWIEDLKKEKKTKNCIDQLVLFPGIGLKVANCICLFGLNKFDCIPIDTHIYDIIYK 524
Query: 304 Y 304
Y
Sbjct: 525 Y 525
Score = 70.5 bits (171), Expect = 1e-09, Method: Compositional matrix adjust.
Identities = 45/171 (26%), Positives = 73/171 (42%), Gaps = 25/171 (14%)
Query: 37 KWTPLNLTQSELSLPLTFPTGQTFRWKKTGPLQYTGPIGPHLISLKHLQNGDVCYHIHTS 96
+W +N++++EL L F GQ F + + Y G + + K + G ++
Sbjct: 133 RWKRINVSKNELQLKYCFLIGQEFCFSEVSRDTYIGLVNNKIYLFKETEEGAFYQCVYDG 192
Query: 97 P-----------------SEPAAKSALLDFLNMGISLGELWEGFSASDCRFAELAKYLAG 139
++ + DF N+ L + E + D R E+ + G
Sbjct: 193 GGGSDGSEGSRGGHFDGHTDDDYYQDVSDFFNLEFPLSKHVEMWRKKDKRMNEITDKIRG 252
Query: 140 ARVLRQDPVECLLQFLCSSNNNIARITKMVD--------FLASLGSHLGNV 182
R+LR + VE FLCS+NNNI RIT M+D FLA++ H +V
Sbjct: 253 LRILRANSVESFFSFLCSTNNNIPRITLMIDCLRRRYGRFLATVMFHGQDV 303
>gi|225016230|ref|ZP_03705422.1| hypothetical protein CLOSTMETH_00133 [Clostridium methylpentosum
DSM 5476]
gi|224950988|gb|EEG32197.1| hypothetical protein CLOSTMETH_00133 [Clostridium methylpentosum
DSM 5476]
Length = 281
Score = 84.3 bits (207), Expect = 6e-14, Method: Compositional matrix adjust.
Identities = 77/268 (28%), Positives = 115/268 (42%), Gaps = 33/268 (12%)
Query: 46 SELSLPLTFPTGQTFRWKKTGPLQYTGPIGPHLISLKHLQNGDVCYHIHTSPSEPAAKSA 105
+ SL T GQ FR+ + G G ++++ Q+G+ TS +
Sbjct: 15 DDFSLADTLDCGQCFRFVPQEDGSFCGFAGGRYLAMR--QDGNQLTFYDTSAEDFERHWK 72
Query: 106 LLDFLNMGISLGELWEGFSASDCRFAELAKYLAGARVLRQDPVECLLQFLCSSNNNIARI 165
L + + + +GF D + Y G R+LRQDP E L F+ S NNNI RI
Sbjct: 73 L--YFDFDTDYQAIKQGF-LEDEVLKKSCDYAGGIRILRQDPWETLCSFIISQNNNIPRI 129
Query: 166 TKMVDFLASLGSHLGNVEGFEFHEFPSLERLSLVSEVELRNAGFGYRSAPQSSLLFSVRR 225
++D L L V G + FP+ E L+ S +L S+ R
Sbjct: 130 KGIIDRLCKLCGE--QVPG--GYAFPTPEALAAKSLDDL-----------------SIMR 168
Query: 226 SFKQAKYITGTVDVLQSKHSGGAEWLLSLRKLDLQEAIDALCTLPGVGPKVAACIALFSL 285
+ +A+Y+ K S G L SL +++ EA L ++ GVGPKVA C+ LF
Sbjct: 169 AGFRARYLLDAA----HKVSTGKIDLPSLYTMEIDEARKTLTSICGVGPKVAECVLLFGF 224
Query: 286 DQHHAIPVDTHVWKIATRYL---LPELA 310
+ A PVD + + T + PE A
Sbjct: 225 HRLEAFPVDVWIKRAITYFYQDGFPEFA 252
>gi|326792401|ref|YP_004310222.1| DNA-(apurinic or apyrimidinic site) lyase [Clostridium lentocellum
DSM 5427]
gi|326543165|gb|ADZ85024.1| DNA-(apurinic or apyrimidinic site) lyase [Clostridium lentocellum
DSM 5427]
Length = 298
Score = 84.3 bits (207), Expect = 7e-14, Method: Compositional matrix adjust.
Identities = 69/256 (26%), Positives = 112/256 (43%), Gaps = 28/256 (10%)
Query: 46 SELSLPLTFPTGQTFRWKKTGPLQYTGPIGPHLISLKHLQNGDVCYHIHTSPSEPAAKSA 105
S ++ +GQ FR++K Y LI + + +T+ SE +
Sbjct: 11 SHFNIGQILESGQVFRFEKISDHSYLLNAKQKLIKITQQPDSSSVAIYNTTASE--VEEI 68
Query: 106 LLDFLNMGISLGELWEGFSASDCRFAELAKYLAGARVLRQDPVECLLQFLCSSNNNIARI 165
+++ ++ + + +A D A+ ++ G R+LRQDP E L+ F+ S N +I I
Sbjct: 69 WMNYFDINTNYENITATLAAKDEWMAKAVEFGKGIRILRQDPWEMLISFIVSQNKSIPHI 128
Query: 166 TK-MVDFLASLGSHLGNVEGF--EFHEFPSLERLSLVSEVELRNAGFGYRSAPQSSLLFS 222
+ + G + V+G +H FP+ + LS +E +LR G+R
Sbjct: 129 KACLKNITNKFGFPINEVDGMGEHYHSFPTPKELSRATEEDLRECKVGFR---------- 178
Query: 223 VRRSFKQAKYITGTVDVLQSKHSGGAEWLLSLRKLDLQEAIDALCTLPGVGPKVAACIAL 282
A YI +D Q +G L L + ++EA L T+ GVGPKVA CI L
Sbjct: 179 -------APYI---MDACQKVLNGDV-ILNDLYIMPIEEAKAKLMTIKGVGPKVADCILL 227
Query: 283 FSLDQHHAIPVDTHVW 298
F+ + P D VW
Sbjct: 228 FAYSRMELFPTD--VW 241
>gi|331086694|ref|ZP_08335771.1| hypothetical protein HMPREF0987_02074 [Lachnospiraceae bacterium
9_1_43BFAA]
gi|330409860|gb|EGG89295.1| hypothetical protein HMPREF0987_02074 [Lachnospiraceae bacterium
9_1_43BFAA]
Length = 275
Score = 84.3 bits (207), Expect = 7e-14, Method: Compositional matrix adjust.
Identities = 73/251 (29%), Positives = 112/251 (44%), Gaps = 27/251 (10%)
Query: 48 LSLPLTFPTGQTFRWKKTGPLQYTGPIGPHLISLKHLQNGDVCYHIHTSPSEPAAKSALL 107
+L +GQ FR ++ Y + SL+ LQ G+ C T P E +
Sbjct: 9 FNLQQICDSGQCFRMERVSENCYR--VIAFGRSLEILQEGEQCTFFCT-PHE--FEEIWN 63
Query: 108 DFLNMGISLGELWEGFSASDCRFAELAKYLAGARVLRQDPVECLLQFLCSSNNNIARITK 167
D+ ++ E + +D A++ +G R+LRQD E + FL S N+I RI K
Sbjct: 64 DYFDLETDYQSYIEEINPNDSYLLAAAEWGSGIRILRQDLWEMIASFLISQQNHITRIRK 123
Query: 168 MVDFLA-SLGSHLGNVEGFEFHEFPSLERLSLVSEVELRNAGFGYRSAPQSSLLFSVRRS 226
+ L + G G F+ FP E+L+ + + +L+ GYRS + VR +
Sbjct: 124 CIQNLCETYGEERTGDSGNTFYTFPEPEKLAELGDDDLKACNLGYRSK------YVVRTA 177
Query: 227 FKQAKYITGTVDVLQSKHSGGAEWLLSLRKLDLQEAIDALCTLPGVGPKVAACIALFSLD 286
++G VD L +++ L +A L L GVG KVA CI LF+L
Sbjct: 178 ---KSIVSGEVD------------LDAIQHLPYAKAKKELLKLFGVGEKVADCICLFALH 222
Query: 287 QHHAIPVDTHV 297
A P+DTH+
Sbjct: 223 HLQAFPIDTHI 233
>gi|448585348|ref|ZP_21647741.1| DNA N-glycosylase [Haloferax gibbonsii ATCC 33959]
gi|445726048|gb|ELZ77665.1| DNA N-glycosylase [Haloferax gibbonsii ATCC 33959]
Length = 304
Score = 84.3 bits (207), Expect = 8e-14, Method: Compositional matrix adjust.
Identities = 52/159 (32%), Positives = 78/159 (49%), Gaps = 21/159 (13%)
Query: 139 GARVLRQDPVECLLQFLCSSNNNIARITKMVDFLASLGSHLGNVEGFEFHEFPSLERLSL 198
G R++R P CL+ F+CS+ ++RI M L +G + +P+ ERL+
Sbjct: 119 GMRLVRDPPFGCLVSFICSAQMRVSRIFGMQSRLRETYGERVEFDGETYFAYPTPERLAE 178
Query: 199 VSEVELRNAGFGYRSAPQSSLLFSVRRSFKQAKYITGTVDVLQSKHSGGAEWLLSLRKLD 258
+E ELR+ GYR A Y+ T +++ + + AE R D
Sbjct: 179 ATEDELRDLSLGYR-----------------APYVRRTAEMVATGEATPAE----ARGRD 217
Query: 259 LQEAIDALCTLPGVGPKVAACIALFSLDQHHAIPVDTHV 297
++A D L T GVG KVA C+ LFSLD A+P+DT +
Sbjct: 218 YEDARDFLTTFVGVGDKVADCVLLFSLDYLEAVPLDTWI 256
>gi|389847226|ref|YP_006349465.1| 8-oxoguanine DNA glycosylase [Haloferax mediterranei ATCC 33500]
gi|448614828|ref|ZP_21663856.1| 8-oxoguanine DNA glycosylase [Haloferax mediterranei ATCC 33500]
gi|388244532|gb|AFK19478.1| 8-oxoguanine DNA glycosylase [Haloferax mediterranei ATCC 33500]
gi|445752915|gb|EMA04334.1| 8-oxoguanine DNA glycosylase [Haloferax mediterranei ATCC 33500]
Length = 304
Score = 84.3 bits (207), Expect = 8e-14, Method: Compositional matrix adjust.
Identities = 51/159 (32%), Positives = 78/159 (49%), Gaps = 21/159 (13%)
Query: 139 GARVLRQDPVECLLQFLCSSNNNIARITKMVDFLASLGSHLGNVEGFEFHEFPSLERLSL 198
G R++R P CL+ F+CS+ ++RI M L +G ++ +P+ ERL+
Sbjct: 119 GMRLVRDPPFGCLISFICSAQMRVSRIFGMQSRLRETYGEAVEFDGETYYAYPTPERLAE 178
Query: 199 VSEVELRNAGFGYRSAPQSSLLFSVRRSFKQAKYITGTVDVLQSKHSGGAEWLLSLRKLD 258
+E ELR+ GYR A Y+ T +++ + + +E L D
Sbjct: 179 TTEAELRDLSLGYR-----------------APYVQRTAEMVATGEATPSEAL----GRD 217
Query: 259 LQEAIDALCTLPGVGPKVAACIALFSLDQHHAIPVDTHV 297
++A D L GVG KVA CI LFSLD A+P+DT +
Sbjct: 218 YEDARDYLTNFVGVGDKVADCILLFSLDYLEAVPLDTWI 256
>gi|291522888|emb|CBK81181.1| 3-methyladenine DNA glycosylase/8-oxoguanine DNA glycosylase
[Coprococcus catus GD/7]
Length = 271
Score = 84.0 bits (206), Expect = 8e-14, Method: Compositional matrix adjust.
Identities = 68/254 (26%), Positives = 110/254 (43%), Gaps = 27/254 (10%)
Query: 48 LSLPLTFPTGQTFRWKKTGPLQYTGPIGPHLISLKHLQNGDVCYHIHTSPSEPAAKSALL 107
S+ +GQ FR ++ +Y G + L+ +N + +E +
Sbjct: 9 FSIAQICESGQCFRMEEIQDGRYQVIAGDRYLELEQQEN-----RVRFDCTEVEYRDFWR 63
Query: 108 DFLNMGISLGELWEGFSASDCRFAELAKYLAGARVLRQDPVECLLQFLCSSNNNIARITK 167
+ ++ G + + +D A+ G R+L QD E ++ FL S NNI RI +
Sbjct: 64 HYFDIETDYGAVMAQINPNDTYLTAAAQLAWGVRILNQDLWEMIVTFLISQQNNIVRIRR 123
Query: 168 MV-DFLASLGSHLGNVEGFEFHEFPSLERLSLVSEVELRNAGFGYRSAPQSSLLFSVRRS 226
+ + + G+ G ++ FP+ + L+ + E L GYRS V R+
Sbjct: 124 CIKNICETYGTAKTASNGVCYYAFPTAQALAGLEEDALMACNLGYRSK-------YVVRA 176
Query: 227 FKQAKYITGTVDVLQSKHSGGAEWLLSLRKLDLQEAIDALCTLPGVGPKVAACIALFSLD 286
K +G VD+ W+ SL + Q+A L L GVG KVA CI LF+L
Sbjct: 177 AKAVS--SGEVDL---------AWISSL---NYQKARAELLKLFGVGEKVADCICLFALH 222
Query: 287 QHHAIPVDTHVWKI 300
A PVDTH+ ++
Sbjct: 223 HLDAFPVDTHIRQV 236
>gi|257066873|ref|YP_003153129.1| 8-oxoguanine DNA glycosylase [Anaerococcus prevotii DSM 20548]
gi|256798753|gb|ACV29408.1| 8-oxoguanine DNA glycosylase domain protein [Anaerococcus prevotii
DSM 20548]
Length = 300
Score = 84.0 bits (206), Expect = 8e-14, Method: Compositional matrix adjust.
Identities = 71/281 (25%), Positives = 117/281 (41%), Gaps = 41/281 (14%)
Query: 28 QTLTTNKPSKWTPLNLTQSELSLPL---------TFPTGQTFRWKKTGPLQYTGPIGPHL 78
+ TT+ K +N+ +++ L L F GQ F + K YT +
Sbjct: 2 ERYTTDSKRK---INIEETDKGLILREASFEPSHIFECGQCFNFHKEEDGSYTAVFLGKI 58
Query: 79 ISLKHLQNGDVCYHIHTSPSEPAAKSALLDFLNMGISLGELWEGFSASDCRFAELAKYLA 138
I++ L G + + S D+ ++G+ + + A D + Y
Sbjct: 59 INVLKLDEG---VSLIRNISLEDFNEIFYDYFDLGLDYDAIKKEV-AIDPVMEKATAYGY 114
Query: 139 GARVLRQDPVECLLQFLCSSNNNIARITKMVDFLAS-LGSHLGNVEGFEFHEFPSLERLS 197
G R+L Q+ E + F+ S+NN I RI K V ++ G ++G +G +++ FP E L
Sbjct: 115 GIRILNQEVFETTISFIISANNQIPRIKKAVRIISERYGDYIGEYKGRKYYSFPRPEVLM 174
Query: 198 LVSEVELRNAGFGYRSAPQSSLLFSVRRSFKQAKYITGTVDVLQSKHSGGAEWLLSLRKL 257
V +LR R F+ + + + + + + G + KL
Sbjct: 175 KVKPEDLREYA---------------RVGFRDKRIVEASRMIYEGQLDGAS-------KL 212
Query: 258 DLQEAIDALCTLPGVGPKVAACIALFSLDQHHAIPVDTHVW 298
D ++ L LPGVGPKVA CI LF+ + PVD VW
Sbjct: 213 DTEDLRKKLMELPGVGPKVADCILLFAYHRRETFPVD--VW 251
>gi|345005832|ref|YP_004808685.1| DNA-(apurinic or apyrimidinic site) lyase [halophilic archaeon
DL31]
gi|344321458|gb|AEN06312.1| DNA-(apurinic or apyrimidinic site) lyase [halophilic archaeon
DL31]
Length = 335
Score = 84.0 bits (206), Expect = 8e-14, Method: Compositional matrix adjust.
Identities = 67/218 (30%), Positives = 98/218 (44%), Gaps = 24/218 (11%)
Query: 95 TSPSEPAAKSALLD-FLNMGISLGELWEGFSASDCRFAELAKYLAGARVLRQDPVECLLQ 153
T+PS+PA ALL L + L ++E + + E + G R+ R P CL+
Sbjct: 91 TTPSDPADGEALLTHLLRLDDDLETIYE-VTMDEPLVREAVERYPGLRLTRDPPFGCLVS 149
Query: 154 FLCSSNNNIARITKMVDFLASLGSHLGNVEGFEFHEFPSLERLSLVSEVELRNAGFGYRS 213
F+CS+ +ARI M L V+G H +P+ E L+ SE +LR GYR
Sbjct: 150 FICSAQMRVARIFGMQQSLREAYGEPVTVDGRTVHAYPTPEALAERSEDDLRGLSLGYR- 208
Query: 214 APQSSLLFSVRRSFKQAKYITGTVDVLQSKHSGGAEWLLSLRKLDLQEAIDALCTLPGVG 273
A Y+ T +++ + + R L +EA + L GVG
Sbjct: 209 ----------------APYVQRTAEMVAEGEANPED----ARGLPYEEAREYLTKFVGVG 248
Query: 274 PKVAACIALFSLDQHHAIPVDTHVWKIATRYLLPELAG 311
KVA C+ LFSL AIP+DT + + + Y PE G
Sbjct: 249 DKVADCVLLFSLGYLQAIPLDTWIQTVISDY-FPECEG 285
>gi|448562510|ref|ZP_21635468.1| DNA N-glycosylase [Haloferax prahovense DSM 18310]
gi|445718828|gb|ELZ70512.1| DNA N-glycosylase [Haloferax prahovense DSM 18310]
Length = 304
Score = 84.0 bits (206), Expect = 9e-14, Method: Compositional matrix adjust.
Identities = 52/159 (32%), Positives = 78/159 (49%), Gaps = 21/159 (13%)
Query: 139 GARVLRQDPVECLLQFLCSSNNNIARITKMVDFLASLGSHLGNVEGFEFHEFPSLERLSL 198
G R++R P CL+ F+CS+ ++RI M L +G + +P+ ERL+
Sbjct: 119 GMRLVRDPPFGCLVSFICSAQMRVSRIFGMQSRLRESYGERVEFDGETYFAYPTPERLAE 178
Query: 199 VSEVELRNAGFGYRSAPQSSLLFSVRRSFKQAKYITGTVDVLQSKHSGGAEWLLSLRKLD 258
+E ELR+ GYR A Y+ T +++ + + AE R D
Sbjct: 179 ATEDELRDLSLGYR-----------------APYVRRTAEMVATGEATPAE----ARGRD 217
Query: 259 LQEAIDALCTLPGVGPKVAACIALFSLDQHHAIPVDTHV 297
++A D L T GVG KVA C+ LFSLD A+P+DT +
Sbjct: 218 YEDARDFLTTFVGVGDKVADCVLLFSLDYLEAVPLDTWI 256
>gi|300813997|ref|ZP_07094290.1| 8-oxoguanine DNA-glycosylase (ogg) [Peptoniphilus sp. oral taxon
836 str. F0141]
gi|300511891|gb|EFK39098.1| 8-oxoguanine DNA-glycosylase (ogg) [Peptoniphilus sp. oral taxon
836 str. F0141]
Length = 293
Score = 84.0 bits (206), Expect = 9e-14, Method: Compositional matrix adjust.
Identities = 64/257 (24%), Positives = 122/257 (47%), Gaps = 28/257 (10%)
Query: 54 FPTGQTFRWKKTGPLQYTGPIGPHLISLKHLQNGDVCYHIHTSPSEPAAKSALLDFLNMG 113
F GQ FR+ + +T ++++K + D Y I ++ E K+ + ++
Sbjct: 24 FTCGQCFRFYEEDDESFTIVAYGKVLNIK--KEDDDIYIIGSN--EEDFKNIWYKYFDLE 79
Query: 114 ISLGELWEGFSASDCRFAELAKYLAGARVLRQDPVECLLQFLCSSNNNIARITKMVDFLA 173
++ + S D + Y G R+L Q+ E ++ F+ S+NN I+RI K ++ ++
Sbjct: 80 KDYKKIEKALSI-DPVMKKAISYGKGIRILNQEKFETIISFIISANNGISRIRKSIEKIS 138
Query: 174 SL-GSHLGNVEGFEFHEFPSLERLSLVSEVELRNAGFGYRSAPQSSLLFSVRRSFKQAKY 232
+L G+++ + ++ FPS L + +E+R ++ + F +R + +K
Sbjct: 139 NLYGNYITEDKNRSYYSFPSPSVLKDIDPLEIR---------EKTKVGFRDKRIVESSKM 189
Query: 233 I-TGTVDVLQSKHSGGAEWLLSLRKLDLQEAIDALCTLPGVGPKVAACIALFSLDQHHAI 291
I +D+ + + KL LQ + L LPGVGPKVA CI LFS +++ +
Sbjct: 190 IYNKDIDIEE------------ISKLSLQSQREELMKLPGVGPKVADCILLFSFERNESF 237
Query: 292 PVDTHVWKIATRYLLPE 308
PVD + ++ + E
Sbjct: 238 PVDVWIKRVMEELYIKE 254
>gi|317133542|ref|YP_004092856.1| 8-oxoguanine DNA glycosylase [Ethanoligenens harbinense YUAN-3]
gi|315471521|gb|ADU28125.1| 8-oxoguanine DNA glycosylase domain-containing protein
[Ethanoligenens harbinense YUAN-3]
Length = 290
Score = 84.0 bits (206), Expect = 1e-13, Method: Compositional matrix adjust.
Identities = 76/253 (30%), Positives = 114/253 (45%), Gaps = 31/253 (12%)
Query: 53 TFPTGQTFRWKKTGPLQYTGPIGPHLISLKHLQNGDVCYHIHTSPSEPAAKSALLDFLNM 112
F GQ FRW++ ++G + L+ + G+ + SP E ++ + ++
Sbjct: 26 VFENGQCFRWERRRDGTWSGIAHGRRLHLR--KEGNRLLFLGASPEE--VQTIWHHYFDL 81
Query: 113 GISLGELWEGFSASDCRFAELAKYLAGARVLRQDPVECLLQFLCSSNNNIARITKMVDFL 172
G + F A D A + G R+LRQDP E L F+ S NNNI RI ++D L
Sbjct: 82 GRDYAAIRTAF-ARDKALARAVAFAPGLRLLRQDPWEALCSFILSQNNNIPRIKGIIDRL 140
Query: 173 -ASLGSHLGNVEGFEFHEFPSLERLSLVSEVELRNAGFGYRSAPQSSLLFSVRRSFKQAK 231
A+ G+ L + GF FP + L+ + +L G+R+ +L + R K
Sbjct: 141 CAAFGAPLED-GGF---AFPLPQALAELDAADLAPVRCGFRA---GYVLDAAR------K 187
Query: 232 YITGTVDVLQSKHSGGAEWLLSLRKLDLQEAIDALCTLPGVGPKVAACIALFSLDQHHAI 291
+G +D L +L L L +A L T+ GVGPKVA C LF +
Sbjct: 188 VASGEID------------LDALAALPLDKARAMLQTIRGVGPKVADCALLFGCGRLDCF 235
Query: 292 PVDTHVWKIATRY 304
PVD V +I T Y
Sbjct: 236 PVDVWVRRILTAY 248
>gi|391232015|ref|ZP_10268221.1| 8-oxoguanine DNA glycosylase family protein [Opitutaceae bacterium
TAV1]
gi|391221676|gb|EIQ00097.1| 8-oxoguanine DNA glycosylase family protein [Opitutaceae bacterium
TAV1]
Length = 322
Score = 84.0 bits (206), Expect = 1e-13, Method: Compositional matrix adjust.
Identities = 77/257 (29%), Positives = 107/257 (41%), Gaps = 34/257 (13%)
Query: 57 GQTFRWKKTGPLQYTGPIGPHLISLKHLQ-----NGDVCYHIHTSPSEPAAKSALLDFLN 111
GQ FRW + + G I + S H + +G + + I SP+ + +A + L
Sbjct: 29 GQAFRWHASA--EADGTIWTGIWSASHFRLCLDSSGQLLWSIPQSPASASEAAASENTLR 86
Query: 112 M---GISLGELWEGFS-ASDCRFAELAKYLAGARVLRQDPVECLLQFLCSSNNNIARITK 167
G L + SD A + G +LRQ E LL FLCS+ I +I +
Sbjct: 87 RYLCGHDTSALVDALPWRSDAHLARCIEAFPGLTLLRQPFGETLLAFLCSATKQIVQIRQ 146
Query: 168 MVDFLASLGSHLGNVEGFEFHEFPSLERLSLVSEVELRNAGFGYRSAPQSSLLFSVRRSF 227
M+ LA G H + G PS RL+ + E LR G+R
Sbjct: 147 MLALLA--GRHGEPLRGNAVRTLPSWTRLATIPESGLRACQLGFR--------------- 189
Query: 228 KQAKYITGTVDVLQSKHSGGAEWLLSLRKLDLQEAIDALCTLPGVGPKVAACIALFSLDQ 287
A+YI+ T L ++ WL L EA L LPGVG KVA C+ LF +
Sbjct: 190 --ARYISETARYLAARPG----WLEETEALPYPEAKARLMALPGVGEKVADCVLLFGAGR 243
Query: 288 HHAIPVDTHVWKIATRY 304
A PVDT + K R+
Sbjct: 244 LEAFPVDTWIIKAMARH 260
>gi|448329715|ref|ZP_21519012.1| 8-oxoguanine DNA glycosylase domain-containing protein [Natrinema
versiforme JCM 10478]
gi|445613639|gb|ELY67335.1| 8-oxoguanine DNA glycosylase domain-containing protein [Natrinema
versiforme JCM 10478]
Length = 293
Score = 83.6 bits (205), Expect = 1e-13, Method: Compositional matrix adjust.
Identities = 72/263 (27%), Positives = 111/263 (42%), Gaps = 28/263 (10%)
Query: 40 PLNLTQSELSLPLTFPTGQTFRWKKTGPLQYTGPIGPHLISLKHLQNGDVCYHIHTSPSE 99
PL+ L L T +GQ++ W++ Y G P + +GDV + T
Sbjct: 7 PLDDLAGGLDLYRTLESGQSYLWRRDDGEMYGGTPAPGAW-YSTVVDGDVI-RVRTRDGR 64
Query: 100 PAAKSAL-----LDFLNMGISLGELWEGFSASDCRFAELAKYLAGARVLRQDPVECLLQF 154
+S + L E D E + G R+++ CL+ F
Sbjct: 65 LEWESTTDAEPTVRRLLRLDDDLEAIVAAGPDDQLLREAYEAHRGMRLVQDPAFGCLISF 124
Query: 155 LCSSNNNIARITKMVDFLASLGSHLGNVEGFEFHEFPSLERLSLVSEVELRNAGFGYRSA 214
+CS+ + RI MV LAS V+G +H FP+ ++L+ +E ELR G GYR
Sbjct: 125 ICSAQMRVGRIHTMVKTLASEYGDEIVVDGETYHAFPTPDQLATATEAELRELGLGYR-- 182
Query: 215 PQSSLLFSVRRSFKQAKYITGTVDVLQSKHSGGAEWLLSLRKLDLQEAIDALCTLPGVGP 274
A Y+ T +++ S G R+L+ + A + L GVG
Sbjct: 183 ---------------APYVVRTAEMVAS----GEAHPEDARELEYEAAREYLTKFVGVGD 223
Query: 275 KVAACIALFSLDQHHAIPVDTHV 297
KVA C+ LFSL+ A+P+DT +
Sbjct: 224 KVADCVLLFSLEFDEAVPLDTWI 246
>gi|433590518|ref|YP_007280014.1| 3-methyladenine DNA glycosylase/8-oxoguanine DNA glycosylase
[Natrinema pellirubrum DSM 15624]
gi|448332079|ref|ZP_21521327.1| 8-oxoguanine DNA glycosylase domain-containing protein [Natrinema
pellirubrum DSM 15624]
gi|433305298|gb|AGB31110.1| 3-methyladenine DNA glycosylase/8-oxoguanine DNA glycosylase
[Natrinema pellirubrum DSM 15624]
gi|445627924|gb|ELY81238.1| 8-oxoguanine DNA glycosylase domain-containing protein [Natrinema
pellirubrum DSM 15624]
Length = 293
Score = 83.6 bits (205), Expect = 1e-13, Method: Compositional matrix adjust.
Identities = 75/269 (27%), Positives = 116/269 (43%), Gaps = 42/269 (15%)
Query: 48 LSLPLTFPTGQTFRWKKTGPLQYTGPIGPHLISLKHLQNGDVCY---------HIHTSPS 98
L L T +GQ++ W++ Y G P + +GDV T+ +
Sbjct: 15 LDLYRTLESGQSYLWRREDGEMYHGDPAPGAW-YSTVVDGDVIRVRSRDDRLEWASTTDA 73
Query: 99 EPAAKSALL--DFLNMGISLGELWEGFSASDCRFAELAKYLAGARVLRQDPVECLLQFLC 156
EP + L D L ++ G D E + G R+++ P CL+ F+C
Sbjct: 74 EPTVRRLLCLDDDLEAIVAAG-------PDDPLLREAYEAHRGLRLVQDPPFGCLIAFIC 126
Query: 157 SSNNNIARITKMVDFLA-SLGSHLGNVEGFEFHEFPSLERLSLVSEVELRNAGFGYRSAP 215
S+ ++RI MV LA GS + +G +H FP+ E+L+ +E ELR+ G GYR
Sbjct: 127 SAQMRVSRIHGMVRALAREYGSPIA-FDGETYHAFPTPEQLAAATEAELRDLGLGYR--- 182
Query: 216 QSSLLFSVRRSFKQAKYITGTVDVLQSKHSGGAEWLLSLRKLDLQEAIDALCTLPGVGPK 275
A Y+ T +++ + A + R L+ + A L GVG K
Sbjct: 183 --------------APYVVRTAEMVADGEAHPA----AARDLEYEAARAYLTQFVGVGDK 224
Query: 276 VAACIALFSLDQHHAIPVDTHVWKIATRY 304
VA C+ LFSL A+P+DT + Y
Sbjct: 225 VADCVLLFSLGFDEAVPLDTWIRSAIEEY 253
>gi|20092367|ref|NP_618442.1| 8-oxoguanine DNA glycosylase [Methanosarcina acetivorans C2A]
gi|19917617|gb|AAM06922.1| 8-oxoguanine DNA glycosylase [Methanosarcina acetivorans C2A]
Length = 282
Score = 83.6 bits (205), Expect = 1e-13, Method: Compositional matrix adjust.
Identities = 73/262 (27%), Positives = 120/262 (45%), Gaps = 33/262 (12%)
Query: 48 LSLPLTFPTGQTFRWKKTGPLQYTGPIGPHLISLKHLQNGDVCYHIHTSPSEPAAKSALL 107
+L T GQ FRW+K+G +TG +G H+I L ++ P E +
Sbjct: 9 FNLDYTLDCGQVFRWEKSGDW-WTGVVGDHVIRLSQEEDSGELLIDSRLPPEFFSH---- 63
Query: 108 DFLNMGISLGELWEGFSASDCRFAELAKYLAGARVLRQDPVECLLQFLCSSNNNIARITK 167
+ + L ++E + + KY +G R++RQDP ECL+ ++ ++ ++I I K
Sbjct: 64 -YFRLDDDLPSIYESINKDLLINRAINKY-SGLRLIRQDPWECLISYMLATASSIPTIQK 121
Query: 168 MVDFLASL-GSHLGNVEGFEFHEFPSLERLSLVSEVELRNAGFGYRSAPQSSLLFSVRRS 226
+ L+ G L EG+ FP + L+ E +L SLL +
Sbjct: 122 RIYLLSRFFGQELE--EGY--FSFPDPDTLA---EADL-------------SLLDKCKLG 161
Query: 227 FKQAKYITGTVDVLQSKHSGGAEWLLSLRKLDLQEAIDALCTLPGVGPKVAACIALFSLD 286
F+ + DV + G L L +L+ + A + L L G+G KVA C+ LF+ +
Sbjct: 162 FRTERIKEAARDV-----ASGELDLNVLYRLEYRYARERLMRLRGIGEKVADCVLLFAFE 216
Query: 287 QHHAIPVDTHVWKIATRYLLPE 308
+ A PVDTHV +I Y + +
Sbjct: 217 KMEAFPVDTHVRQIIQHYHIDD 238
>gi|389843409|ref|YP_006345489.1| 3-methyladenine DNA glycosylase [Mesotoga prima MesG1.Ag.4.2]
gi|387858155|gb|AFK06246.1| 3-methyladenine DNA glycosylase/8-oxoguanine DNA glycosylase
[Mesotoga prima MesG1.Ag.4.2]
Length = 297
Score = 83.6 bits (205), Expect = 1e-13, Method: Compositional matrix adjust.
Identities = 70/262 (26%), Positives = 116/262 (44%), Gaps = 32/262 (12%)
Query: 48 LSLPLTFPTGQTFRWKKTGPLQYTGPIGPHLISLKHLQNGDVCYHIHTSPSEPAAKSALL 107
+ L T GQTFRW K G + G + ++ +K N +S + ++
Sbjct: 12 IDLDSTLDCGQTFRWVKDGEW-WKGVVRDTVLFMKESLNEITVKSSSSSLLGMDISTGII 70
Query: 108 DFLNMGISLGELWEGFSASDCRFAELAKYLA--------GARVLRQDPVECLLQFLCSSN 159
+L + L E+ RF A+ ++ G R+LRQDP E +++++ S+
Sbjct: 71 HYLGLEDDLDEIHSTLRRRLSRFNRSARDISERALSEASGLRILRQDPFEMIVEYILSTR 130
Query: 160 NNIARITKMVDFL-ASLGSHLGNVEGFEFHEFPSLERLSLVSEVELRNAGFGYRSAPQSS 218
N+I I M D L A+ + + G F++FPSL + L+SE L N G+R P
Sbjct: 131 NSIPMIRWMSDSLSAAFPENRVDFCGETFYKFPSLGQAKLISEELLTNLKIGFR-VPWLM 189
Query: 219 LLFSVRRSFKQAKYITGTVDVLQSKHSGGAEWLLSLRKLDLQEAIDALCTLPGVGPKVAA 278
LFS Q Y + L L ++E ++ L G+G KV +
Sbjct: 190 KLFS---RIDQESYFS------------------QLIPLTVEEKLEELIRHDGIGYKVGS 228
Query: 279 CIALFSLDQHHAIPVDTHVWKI 300
C+ LF+ + +A PVD + ++
Sbjct: 229 CVVLFAYGELNAFPVDIWISRV 250
>gi|417002656|ref|ZP_11941976.1| 8-oxoguanine DNA-glycosylase (ogg) [Anaerococcus prevotii
ACS-065-V-Col13]
gi|325479082|gb|EGC82182.1| 8-oxoguanine DNA-glycosylase (ogg) [Anaerococcus prevotii
ACS-065-V-Col13]
Length = 302
Score = 83.6 bits (205), Expect = 1e-13, Method: Compositional matrix adjust.
Identities = 74/271 (27%), Positives = 118/271 (43%), Gaps = 40/271 (14%)
Query: 40 PLNLTQSELSLPL---------TFPTGQTFRWKKTGPLQYTGPIGPHLISLKHLQNGDVC 90
+NL +++ L L F GQ F + K YT +I++ L +G
Sbjct: 11 KINLEKTDEGLILREASFNPTHIFECGQCFNFHKEEDGSYTAVFLDKIINVLELGDGS-- 68
Query: 91 YHIHTSPSEPAAKSALLDFLNMGISLGELWEGFSASDCRFAELAKYLAGARVLRQDPVEC 150
+ + SE D+ ++ ++ + A D + Y G R+L Q+ E
Sbjct: 69 -SLIRNISEEDFYEYFYDYFDLATDYEKIKKEI-AIDETLKKATDYGYGIRILNQELFET 126
Query: 151 LLQFLCSSNNNIARITKMVDFLAS-LGSHLGNVEGFEFHEFPSLERLSLVSEVELRN-AG 208
+ F+ S+NN I RI K V ++ G +G +G +++ FP +E L VS ELR A
Sbjct: 127 TISFIISANNQIPRIKKAVRIISERYGKFIGEYKGRKYYSFPKVEDLMNVSPEELREYAR 186
Query: 209 FGYRSAPQSSLLFSVRRSFKQAKYI-TGTVDVLQSKHSGGAEWLLSLRKLDLQEAIDALC 267
G+R +R + ++ I G +D + + L DL++ L
Sbjct: 187 VGFRD----------KRIVEASRMIYEGALDFDRDQE---------LESADLKKK---LM 224
Query: 268 TLPGVGPKVAACIALFSLDQHHAIPVDTHVW 298
LPG+GPKVA CI LF+ + PVD VW
Sbjct: 225 ELPGIGPKVADCILLFAYHRRETFPVD--VW 253
>gi|297181966|gb|ADI18142.1| 3-methyladenine DNA glycosylase/8-oxoguanine DNA glycosylase
[uncultured Verrucomicrobiales bacterium HF0200_39L05]
Length = 285
Score = 83.6 bits (205), Expect = 1e-13, Method: Compositional matrix adjust.
Identities = 75/264 (28%), Positives = 118/264 (44%), Gaps = 30/264 (11%)
Query: 47 ELSLPLTFPTGQTFRWKKTGPLQYTGPIGPHLISLKHLQNGDVCYHIHTSPSEPAAKSAL 106
+ L T +GQ FRW++ G + G IG + L+ L GD+ ++P A
Sbjct: 12 DYDLDATLSSGQAFRWQQLGQ-AWEGVIGGRWVRLR-LSKGDIT----AEAAKPVRDWAW 65
Query: 107 LD-FLNMGISLGELWEGFSASDCRFAELAKYLAGARVLRQDPVECLLQFLCSSNNNIARI 165
L+ +L + +L + F + +A + G R+LRQD ECL F+ S+ I +I
Sbjct: 66 LEHYLQLHFNLDQALATFPDDEPMQNAVASF-PGLRLLRQDYWECLASFILSATKQIVQI 124
Query: 166 TKMVDFLAS-LGSHLGNVEGFEFHEFPSLERLSLVSEVELRNAGFGYRSAPQSSLLFSVR 224
+MV L+ G + + + FP+++ ++ E ELR G+R AP +LL + R
Sbjct: 125 RQMVMLLSKRYGEPIASNDDDPAFAFPTIKVIAACGEKELRECKLGFR-AP--NLLGAAR 181
Query: 225 RSFKQAKYITGTVDVLQSKHSGGAEWLLSLRKLDLQEAIDALCTLPGVGPKVAACIALFS 284
D+L K L L L EA L L GVGPK+A C+ LF+
Sbjct: 182 -------------DILDKKID-----LQRLPTLTSSEARKELMKLHGVGPKIADCVLLFA 223
Query: 285 LDQHHAIPVDTHVWKIATRYLLPE 308
PVD + + + P+
Sbjct: 224 GGHQEVFPVDVWIERALQQLYFPK 247
>gi|373853949|ref|ZP_09596747.1| DNA-(apurinic or apyrimidinic site) lyase [Opitutaceae bacterium
TAV5]
gi|372471816|gb|EHP31828.1| DNA-(apurinic or apyrimidinic site) lyase [Opitutaceae bacterium
TAV5]
Length = 322
Score = 83.2 bits (204), Expect = 1e-13, Method: Compositional matrix adjust.
Identities = 61/179 (34%), Positives = 79/179 (44%), Gaps = 23/179 (12%)
Query: 126 SDCRFAELAKYLAGARVLRQDPVECLLQFLCSSNNNIARITKMVDFLASLGSHLGNVEGF 185
SD A + G +LRQ E LL FLCS+ I +I +M+ LA G H + G
Sbjct: 105 SDAHLARCIEAFPGLTLLRQPFGETLLAFLCSATKQIVQIRQMLALLA--GRHGEPLRGN 162
Query: 186 EFHEFPSLERLSLVSEVELRNAGFGYRSAPQSSLLFSVRRSFKQAKYITGTVDVLQSKHS 245
PS RL+ + E LR G+R A+YI+ T L ++
Sbjct: 163 AVRTLPSWTRLATIPESGLRACQLGFR-----------------ARYISETARYLAARPG 205
Query: 246 GGAEWLLSLRKLDLQEAIDALCTLPGVGPKVAACIALFSLDQHHAIPVDTHVWKIATRY 304
WL L EA L LPGVG KVA C+ LF + A PVDT + K TR+
Sbjct: 206 ----WLEETEALPYPEAKARLMALPGVGEKVADCVLLFGAGRLEAFPVDTWIIKAMTRH 260
>gi|448591531|ref|ZP_21651019.1| 8-oxoguanine DNA glycosylase [Haloferax elongans ATCC BAA-1513]
gi|445733505|gb|ELZ85074.1| 8-oxoguanine DNA glycosylase [Haloferax elongans ATCC BAA-1513]
Length = 304
Score = 83.2 bits (204), Expect = 2e-13, Method: Compositional matrix adjust.
Identities = 50/159 (31%), Positives = 78/159 (49%), Gaps = 21/159 (13%)
Query: 139 GARVLRQDPVECLLQFLCSSNNNIARITKMVDFLASLGSHLGNVEGFEFHEFPSLERLSL 198
G R++R P CL+ F+CS+ ++RI M + L +G ++ +P ERL+
Sbjct: 119 GMRLVRDPPFGCLISFICSAQMRVSRIFGMQERLRETYGEAVEFDGDTYYAYPPAERLAE 178
Query: 199 VSEVELRNAGFGYRSAPQSSLLFSVRRSFKQAKYITGTVDVLQSKHSGGAEWLLSLRKLD 258
SE ELR+ GYR A Y+ T +++ + + A+ R D
Sbjct: 179 TSEDELRDLSLGYR-----------------APYVQRTAEMVATGEADPAD----ARGRD 217
Query: 259 LQEAIDALCTLPGVGPKVAACIALFSLDQHHAIPVDTHV 297
+ A + L + GVG KVA C+ LFSLD A+P+DT +
Sbjct: 218 YESAREFLTSFVGVGDKVADCVLLFSLDYLEAVPLDTWI 256
>gi|193215895|ref|YP_001997094.1| 8-oxoguanine DNA glycosylase [Chloroherpeton thalassium ATCC 35110]
gi|193089372|gb|ACF14647.1| 8-oxoguanine DNA glycosylase domain protein [Chloroherpeton
thalassium ATCC 35110]
Length = 323
Score = 83.2 bits (204), Expect = 2e-13, Method: Compositional matrix adjust.
Identities = 82/294 (27%), Positives = 138/294 (46%), Gaps = 31/294 (10%)
Query: 46 SELSLPLTFPTGQTFRWKKTGPLQ---YTGPIGPHLISLKHLQNGDVCYHIHTSPSEP-- 100
+ L+L T GQ FRW+K L Y G I H I++ ++ + TS ++P
Sbjct: 10 TSLNLEATLFCGQAFRWRKANTLHNKIYNGIIYGHAITITQRNAHEL---LITSKNDPYI 66
Query: 101 ---AAKSALLDFLNMGISLGELWEGFSASDCR-FAELAKYLAGA-RVLRQDPVECLLQFL 155
A + A++ +L + S L++ + F A+ L G ++LRQ+P E L+ F+
Sbjct: 67 GNKALRDAVITYLGLSDSCKGLFDNTPLLEKYPFMNQARALYGGLKLLRQEPFEALISFM 126
Query: 156 CSSNNNIARITKMVDFLA-SLGSHLGNVEGFE---FHEFPSLERLSLVSEVELRNAGFGY 211
C+ I I + ++ LA G + + F+ + FPS + L+ +++EL
Sbjct: 127 CAQGMGIQIIRRQIEQLARQFGEKINDSPPFDSEHCYSFPSPQALA-NADIEL------L 179
Query: 212 RSAPQSSLLFSVRRSFKQAKYITGTVDVLQSKHSGGAEWLLSLRKLDLQEAIDALCTLPG 271
+ ++L+ + + G + VL+ H+ E L K ++A AL PG
Sbjct: 180 KKCTNNNLVRAKNIKHISEAVVNGEL-VLEHIHAIHNENLGLCSKCSYKKAKAALLRFPG 238
Query: 272 VGPKVAACIALFSLDQHHAIPVDTHVWKIATRYLLPELAGVR-LTPKLCSRVAE 324
+G K+A CI LF L+ AIP+D HV R L E G++ +T L S+ E
Sbjct: 239 IGDKIADCICLFGLEHGEAIPIDRHV-----RMYLYEWFGLKTVTAALNSKNYE 287
>gi|416352812|ref|ZP_11681401.1| putative 8-oxoguanine DNA glycosylase [Clostridium botulinum C str.
Stockholm]
gi|338195702|gb|EGO87951.1| putative 8-oxoguanine DNA glycosylase [Clostridium botulinum C str.
Stockholm]
Length = 232
Score = 83.2 bits (204), Expect = 2e-13, Method: Compositional matrix adjust.
Identities = 58/220 (26%), Positives = 105/220 (47%), Gaps = 25/220 (11%)
Query: 93 IHTSPSEPAAKSALLDFLNMGISLGELWEGFSASDCRFAELAKYLAGARVLRQDPVECLL 152
I + +E + D+ ++ + E F+ + + ++ G R+L+Q+P E ++
Sbjct: 2 IIYNATEEEFEKIWCDYFDLYRDYSTIKEIFNKDEL-LKKSVEFGKGIRILKQEPFEIVV 60
Query: 153 QFLCSSNNNIARITKMVDFLASLGSHLGNVEGFEFHEFPSLERLSLVSEVELRNAGFGYR 212
F+ S+NN I I + ++ ++ +G +++ FPS E L ++ +L G G+R
Sbjct: 61 SFIISANNRIPMIKRAIEKISKRWGKKVQYKGKDYYTFPSAEILKDCTQEDLEECGVGFR 120
Query: 213 SAPQSSLLFSVRRSFKQAKYITGTV-DVLQSKHSGGAEWLLSLRKLDLQEAIDALCTLPG 271
AKYI T+ D++ + + L ++ LD E L + G
Sbjct: 121 -----------------AKYIKNTIEDIIYNGLN-----LDYIKSLDDDECHKELQKISG 158
Query: 272 VGPKVAACIALFSLDQHHAIPVDTHVWKIATR-YLLPELA 310
VGPKVA CI LFS+ ++ A PVD V + YL P+++
Sbjct: 159 VGPKVADCIMLFSMQKYSAFPVDVWVKRAMQHFYLAPDVS 198
>gi|403170103|ref|XP_003329493.2| hypothetical protein PGTG_11243 [Puccinia graminis f. sp. tritici
CRL 75-36-700-3]
gi|375168566|gb|EFP85074.2| hypothetical protein PGTG_11243 [Puccinia graminis f. sp. tritici
CRL 75-36-700-3]
Length = 533
Score = 83.2 bits (204), Expect = 2e-13, Method: Compositional matrix adjust.
Identities = 77/263 (29%), Positives = 113/263 (42%), Gaps = 54/263 (20%)
Query: 109 FLNMGISLGELWEGFSASDCRFAE--LAKYLAGARVLRQDPVECLLQFLCSSNNNIARIT 166
+ + +SL L+ +S D F+ + G RV+RQD E ++ F+CS+NNNI RI+
Sbjct: 156 YFVLDVSLVNLYHDWSERDPVFSNKVASGEWDGLRVVRQDSWETMISFICSANNNIPRIS 215
Query: 167 KMVDFL-ASLG----------------------SHLGNVEGFEFHEFPSLERLSLVSEVE 203
M++ L A+ G +H + EF FPS RLS ++
Sbjct: 216 LMLNRLCATFGDPMPCPPLGITLPSMLERQNVVAHDPSSPRLEFFSFPSPRRLSQPDVID 275
Query: 204 -LRNAGFGYRSA---PQSSLLFSVRRSFKQAKYITGTVD----VLQSKHSGGAEW----- 250
L+ GFGYR++ S L + ++ G +D KH+ E
Sbjct: 276 KLKLLGFGYRASYVYKTSIKLCEIASEAQKENRWPGFLDPDCQESTDKHNKQEEGEEEEE 335
Query: 251 ---LLSLRKLDLQEAIDALCT-------------LPGVGPKVAACIALFSLDQHHAIPVD 294
LL + Q+ +D L + PGVGPKV CI LF L H +PVD
Sbjct: 336 KPALLKKSSFEPQDFLDFLASQSYEQAHSKLVNQFPGVGPKVGDCICLFGLGFVHVVPVD 395
Query: 295 THVWKIATRYLLPELAGVRLTPK 317
H++KIA R +L T K
Sbjct: 396 IHIYKIALRDYQLDLNSSTTTKK 418
>gi|448604557|ref|ZP_21657724.1| DNA N-glycosylase [Haloferax sulfurifontis ATCC BAA-897]
gi|445743966|gb|ELZ95446.1| DNA N-glycosylase [Haloferax sulfurifontis ATCC BAA-897]
Length = 304
Score = 83.2 bits (204), Expect = 2e-13, Method: Compositional matrix adjust.
Identities = 51/159 (32%), Positives = 77/159 (48%), Gaps = 21/159 (13%)
Query: 139 GARVLRQDPVECLLQFLCSSNNNIARITKMVDFLASLGSHLGNVEGFEFHEFPSLERLSL 198
G R++R P CL+ F+CS+ ++RI M L +G + +P+ ERL+
Sbjct: 119 GMRLVRDPPFGCLVSFICSAQMRVSRIFGMQSRLRETYGERVEFDGETYFAYPTPERLAE 178
Query: 199 VSEVELRNAGFGYRSAPQSSLLFSVRRSFKQAKYITGTVDVLQSKHSGGAEWLLSLRKLD 258
+E ELR+ GYR A Y+ T +++ + + E R D
Sbjct: 179 ATEAELRDLSLGYR-----------------APYVRRTAEMVATGEATPDE----ARGRD 217
Query: 259 LQEAIDALCTLPGVGPKVAACIALFSLDQHHAIPVDTHV 297
++A D L T GVG KVA C+ LFSLD A+P+DT +
Sbjct: 218 YEDARDFLTTFVGVGDKVADCVLLFSLDYLEAVPLDTWI 256
>gi|343427188|emb|CBQ70716.1| related to 8-oxoguanine DNA-glycosylase [Sporisorium reilianum
SRZ2]
Length = 573
Score = 83.2 bits (204), Expect = 2e-13, Method: Compositional matrix adjust.
Identities = 77/276 (27%), Positives = 111/276 (40%), Gaps = 83/276 (30%)
Query: 35 PSKWTPLNLTQSELSLPLTFPT--GQTFRWK-----------------KTGPLQYTGPIG 75
P + L S++ LPLT GQ FRW+ ++++ +
Sbjct: 29 PPGYFALRAATSQILLPLTVSNKCGQAFRWRCNKVWEPRTQGSSTDEAYEEQIEWSLCLS 88
Query: 76 PHLISLKHLQNGDVCYHIHTSPSEPAAKSA--------------LLDFLNMGISLGELWE 121
++ L+ ++ YH PS A+ S L+D+L++ + L L+
Sbjct: 89 DRVVLLRQDEHRGFLYHKTLLPSTSASPSDSKQEMETSRGTVRFLIDYLSLDVPLESLYT 148
Query: 122 GFSASDCRFAELAKYLAGARVLRQDPVECLLQFLCSSNNNIARITKMVDFLAS------- 174
++ D FA A +G R+LRQDP ECL F+CSSNNNIARI +MV L +
Sbjct: 149 EWAEKDPVFARFATRFSGLRMLRQDPWECLCAFVCSSNNNIARIGQMVQNLCTHFSPVLL 208
Query: 175 ---------------------LGSHLGNVEGFE----FHEFPSLERLSLVS-EVELRNAG 208
GS +VE E +H FP E L+ E +LR G
Sbjct: 209 EHSYAAPPPSFQSADTLKSEGEGSTPRDVEQGEVRIAYHPFPPPEALAKPGVEEKLRELG 268
Query: 209 FGYRSAPQSSLLFSVRRSFKQAKYITGTVDVLQSKH 244
FGYR AKY+ T +L KH
Sbjct: 269 FGYR-----------------AKYLARTAQMLSEKH 287
Score = 61.6 bits (148), Expect = 5e-07, Method: Compositional matrix adjust.
Identities = 33/79 (41%), Positives = 44/79 (55%), Gaps = 7/79 (8%)
Query: 250 WLLSLRKLDLQEAIDALCTLPGVGPKVAACIALFSLDQHHAIPVDTHVWKIATRYLLPEL 309
+L LR + ++A L PGVGPKVA CI L SLDQ +IPVD HV++ A ++
Sbjct: 389 YLQHLRTISYRDARQELMQFPGVGPKVADCILLMSLDQASSIPVDRHVFQFAEKWY---- 444
Query: 310 AGVRLTPKLCSRVAEAFCE 328
RL K +A+ F E
Sbjct: 445 ---RLRTKKYEEIADYFRE 460
>gi|448348499|ref|ZP_21537348.1| DNA-(apurinic or apyrimidinic site) lyase [Natrialba taiwanensis
DSM 12281]
gi|445642866|gb|ELY95928.1| DNA-(apurinic or apyrimidinic site) lyase [Natrialba taiwanensis
DSM 12281]
Length = 293
Score = 83.2 bits (204), Expect = 2e-13, Method: Compositional matrix adjust.
Identities = 49/157 (31%), Positives = 81/157 (51%), Gaps = 21/157 (13%)
Query: 139 GARVLRQDPVECLLQFLCSSNNNIARITKMVDFLASLGSHLGNVEGFEFHEFPSLERLSL 198
G R++ P L+ F+CS+ ++RI MV LA + + +G +H FP+ ++L+
Sbjct: 109 GMRLVTDPPFGTLISFICSAQMRVSRIHSMVSTLAREYGDVVSFDGETYHAFPTPDQLAT 168
Query: 199 VSEVELRNAGFGYRSAPQSSLLFSVRRSFKQAKYITGTVDVLQSKHSGGAEWLLSLRKLD 258
+E ELR+ G GYR A Y+ T +++ + A+ R+L+
Sbjct: 169 ATESELRDLGLGYR-----------------APYVVRTAEMVADGKAHPAD----ARELE 207
Query: 259 LQEAIDALCTLPGVGPKVAACIALFSLDQHHAIPVDT 295
+ A + LC GVG KVA C+ LF+L+ A+P+DT
Sbjct: 208 YEAAREFLCQFVGVGDKVADCVLLFALEFDEAVPLDT 244
>gi|448318487|ref|ZP_21508008.1| 8-oxoguanine DNA glycosylase domain-containing protein
[Natronococcus jeotgali DSM 18795]
gi|445599031|gb|ELY53076.1| 8-oxoguanine DNA glycosylase domain-containing protein
[Natronococcus jeotgali DSM 18795]
Length = 293
Score = 82.8 bits (203), Expect = 2e-13, Method: Compositional matrix adjust.
Identities = 70/251 (27%), Positives = 104/251 (41%), Gaps = 24/251 (9%)
Query: 48 LSLPLTFPTGQTFRWKKTGPLQYTGPIGPHLISLKHLQNGDVCYHIHTSPSEPAAKSALL 107
L L T +GQT+ W++T Y G P + V E +
Sbjct: 15 LDLYRTLESGQTYLWRRTDGAMYGGAPAPDEWYYTVVDGAVVRVRSRNGVLEWESSIEAD 74
Query: 108 DFLNMGISLGELWEGFSAS---DCRFAELAKYLAGARVLRQDPVECLLQFLCSSNNNIAR 164
+ + L + E A+ D E G R++ P L+ F+CS+ + R
Sbjct: 75 PLVRRLLRLEDDLEAIVAAAPDDPLLDEAYAAHRGMRLVDDPPFGTLISFICSAQMRVDR 134
Query: 165 ITKMVDFLASLGSHLGNVEGFEFHEFPSLERLSLVSEVELRNAGFGYRSAPQSSLLFSVR 224
I MV LA +G +H FP+ ERL+ +E ELR G GYR
Sbjct: 135 IHGMVSALARQYGEPVAFDGATYHAFPTPERLAAATEGELRELGLGYR------------ 182
Query: 225 RSFKQAKYITGTVDVLQSKHSGGAEWLLSLRKLDLQEAIDALCTLPGVGPKVAACIALFS 284
A Y+ T +++ + AE R+L+ ++A + L GVG KVA C+ LFS
Sbjct: 183 -----APYVVRTAEMVADGEAHPAE----ARELEYEDAREFLKRFVGVGDKVADCVLLFS 233
Query: 285 LDQHHAIPVDT 295
L A+P+DT
Sbjct: 234 LGFDQAVPLDT 244
>gi|167769679|ref|ZP_02441732.1| hypothetical protein ANACOL_01013 [Anaerotruncus colihominis DSM
17241]
gi|167668040|gb|EDS12170.1| 8-oxoguanine DNA-glycosylase (ogg) [Anaerotruncus colihominis DSM
17241]
Length = 300
Score = 82.8 bits (203), Expect = 2e-13, Method: Compositional matrix adjust.
Identities = 77/258 (29%), Positives = 111/258 (43%), Gaps = 31/258 (12%)
Query: 47 ELSLPLTFPTGQTFRWKKTGPLQYTGPIGPHLISLKHLQNGDVCYHIHTSPSEPAAKSAL 106
+ SLP T GQ FRW++ + G + + NG + ++ A
Sbjct: 24 DFSLPQTLLCGQAFRWRQLDQNTFIGAAFEKVCQIAQ-TNGKIVVSCQSAGDFDAVWRGY 82
Query: 107 LDFLNMGISLGELWEGFSASDCRFAELAKYLAGARVLRQDPVECLLQFLCSSNNNIARIT 166
D + G L F A D AE G RVLRQ P E L F+ S NNNI RIT
Sbjct: 83 FD---LDRDYGALKNRFRA-DPALAEAVAKAPGIRVLRQQPWEALCTFIISQNNNIPRIT 138
Query: 167 KMVDFL-ASLGSHLGNVEGFEFHEFPSLERLSLVSEVELRNAGFGYRSAPQSSLLFSVRR 225
+V + A+ G LG +H FP+ E ++ ++ L G+R+ L+ + RR
Sbjct: 139 GIVGRMCAAFGRPLGG----GWHAFPTPEAIAALAPDALAPLRAGFRA---RYLVDAARR 191
Query: 226 SFKQAKYITGTVDVLQSKHSGGAEWLLSLRKLDLQEAIDALCTLPGVGPKVAACIALFSL 285
++G VD L +L L L EA L + GVG KVAAC L+
Sbjct: 192 ------VVSGEVD------------LEALGTLPLDEAQAMLTRITGVGVKVAACTLLYGC 233
Query: 286 DQHHAIPVDTHVWKIATR 303
+ +PVD + ++ R
Sbjct: 234 GRVECVPVDVWMRRVLDR 251
>gi|295109019|emb|CBL22972.1| 3-methyladenine DNA glycosylase/8-oxoguanine DNA glycosylase
[Ruminococcus obeum A2-162]
Length = 269
Score = 82.8 bits (203), Expect = 2e-13, Method: Compositional matrix adjust.
Identities = 73/253 (28%), Positives = 107/253 (42%), Gaps = 25/253 (9%)
Query: 48 LSLPLTFPTGQTFRWKKTGPLQYTGPIGPHLISLKHLQNGDVCYHIHTSPSEPAAKSALL 107
SL +GQ FR +T Y G LK Q+G + + H S E +
Sbjct: 9 FSLKEICQSGQCFRMYETEENTYELIAGDQY--LKMTQDGGIV-NFHCSDMEFIC--YWV 63
Query: 108 DFLNMGISLGELWEGFSASDCRFAELAKYLAGARVLRQDPVECLLQFLCSSNNNIARITK 167
+ ++ + D + A+ +G R+L+QD E ++ FL S NNI+RI
Sbjct: 64 PYFDIDADYSSYIAKINPRDTYLSAAAECGSGIRILQQDLWEMIITFLISQQNNISRIRG 123
Query: 168 MVDFLASLGSHLGNVEGFEFHEFPSLERLSLVSEVELRNAGFGYRSAPQSSLLFSVRRSF 227
V+ L E++ FP+ ERL+ SE +LR G GYR
Sbjct: 124 CVERLCEAYGEKKISGEKEYYAFPNPERLAAASEDDLRGLGMGYR--------------- 168
Query: 228 KQAKYITGTVDVLQSKHSGGAEWLLSLRKLDLQEAIDALCTLPGVGPKVAACIALFSLDQ 287
A+YI V+ +S +G + A L L GVG KVA CI LF+L
Sbjct: 169 --ARYI---VETTRSILNGEISLDRIYQYRYYSRAKQELLKLSGVGEKVADCICLFALHH 223
Query: 288 HHAIPVDTHVWKI 300
A P+DTH+ ++
Sbjct: 224 MDAFPIDTHIRQV 236
>gi|448624531|ref|ZP_21670479.1| DNA N-glycosylase [Haloferax denitrificans ATCC 35960]
gi|445749736|gb|EMA01178.1| DNA N-glycosylase [Haloferax denitrificans ATCC 35960]
Length = 304
Score = 82.4 bits (202), Expect = 2e-13, Method: Compositional matrix adjust.
Identities = 51/159 (32%), Positives = 77/159 (48%), Gaps = 21/159 (13%)
Query: 139 GARVLRQDPVECLLQFLCSSNNNIARITKMVDFLASLGSHLGNVEGFEFHEFPSLERLSL 198
G R++R P CL+ F+CS+ ++RI M L +G + +P+ ERL+
Sbjct: 119 GMRLVRDPPFGCLVSFICSAQMRVSRIFGMQSRLRETYGERVEFDGETYFAYPTPERLAE 178
Query: 199 VSEVELRNAGFGYRSAPQSSLLFSVRRSFKQAKYITGTVDVLQSKHSGGAEWLLSLRKLD 258
+E ELR+ GYR A Y+ T +++ + + E R D
Sbjct: 179 ATEEELRDLSLGYR-----------------APYVRRTAEMVATGEATPDE----ARGRD 217
Query: 259 LQEAIDALCTLPGVGPKVAACIALFSLDQHHAIPVDTHV 297
++A D L T GVG KVA C+ LFSLD A+P+DT +
Sbjct: 218 YEDARDFLTTFVGVGDKVADCVLLFSLDYLEAVPLDTWI 256
>gi|153810074|ref|ZP_01962742.1| hypothetical protein RUMOBE_00455 [Ruminococcus obeum ATCC 29174]
gi|149834252|gb|EDM89332.1| 8-oxoguanine DNA-glycosylase (ogg) [Ruminococcus obeum ATCC 29174]
Length = 252
Score = 82.4 bits (202), Expect = 2e-13, Method: Compositional matrix adjust.
Identities = 59/193 (30%), Positives = 89/193 (46%), Gaps = 22/193 (11%)
Query: 109 FLNMGISLGELWEGFSASDCRFAELAKYLAGARVLRQDPVECLLQFLCSSNNNIARITKM 168
+ ++ G E + D + + G R+LRQD E ++ FL S NNI+RI
Sbjct: 44 YFDIDSDYGRYIEKVNPRDTYLSAAVQCGNGIRILRQDLWEMIITFLISQQNNISRIRSC 103
Query: 169 VDFLASLGSHLGNVEGFEFHEFPSLERLSLVSEVELRNAGFGYRSAPQSSLLFSVRRSFK 228
++ L E++ FP+ ++LS +E ELR G GYR
Sbjct: 104 IERLCVRYGEKLKAGDIEYYSFPTPQQLSGATEEELRRLGMGYR---------------- 147
Query: 229 QAKYITGTV-DVLQSKHSGGAEWLLSLRKLDLQEAIDALCTLPGVGPKVAACIALFSLDQ 287
A+YI T +L+ + S E L ++ + A L L GVG KVA CI LF+L
Sbjct: 148 -ARYIVETTRSILEGEVS--LEKLYQMKYY--RRARKELMKLSGVGEKVADCICLFALHH 202
Query: 288 HHAIPVDTHVWKI 300
A P+DTH+ ++
Sbjct: 203 MDAFPIDTHIRQV 215
>gi|150015216|ref|YP_001307470.1| 8-oxoguanine DNA glycosylase [Clostridium beijerinckii NCIMB 8052]
gi|149901681|gb|ABR32514.1| 8-oxoguanine DNA glycosylase domain protein [Clostridium
beijerinckii NCIMB 8052]
Length = 309
Score = 82.4 bits (202), Expect = 2e-13, Method: Compositional matrix adjust.
Identities = 72/258 (27%), Positives = 117/258 (45%), Gaps = 26/258 (10%)
Query: 48 LSLPLTFPTGQTFRWKKTGPLQYTGPIGPHLISLKHLQNGDVCYHIHTSPSEPAAKSALL 107
L F GQ FR+++ + LI +K N DV I+ S E
Sbjct: 21 FKLKHIFECGQIFRFEEVAENDFIVIAFGRLIEVKEDGN-DVI--IYNSTKE-------- 69
Query: 108 DFLNMGISLGELWEGFS------ASDCRFAELAKYLAGARVLRQDPVECLLQFLCSSNNN 161
DF N+ + +L +S + D + ++ G RVL QDP E LL F+ S+ NN
Sbjct: 70 DFKNIWLKYFDLDRDYSVIKDELSKDVLLKQSIEFGYGVRVLNQDPFEMLLSFIISARNN 129
Query: 162 IARITKMVDFLASLGSHLGNVEGFEFHEFPSLERLSLVSEVELRNAGFGYRSAPQSSLLF 221
I I K V+ +++ + ++ FP++ + + E++ G +RS +F
Sbjct: 130 IPSIKKTVNKISNKWGKEIIYKDKTYYAFPNIGEIKDATLEEIQETGASFRS----KYIF 185
Query: 222 SVRRSFKQAKYITGTVDVLQSKHSGGAEWLLS-LRKLDLQEAIDALCTLPGVGPKVAACI 280
++ +K T D+ + + ++ L ++ LD E +AL GVG KVA CI
Sbjct: 186 DTIKNVYNSK--TAKEDLKANDANECIKYDLDYIKNLDDDECHNALQEFKGVGSKVADCI 243
Query: 281 ALFSLDQHHAIPVDTHVW 298
LFS+++ A PVD VW
Sbjct: 244 MLFSMEKTSAFPVD--VW 259
>gi|292655829|ref|YP_003535726.1| DNA N-glycosylase [Haloferax volcanii DS2]
gi|448289816|ref|ZP_21480979.1| DNA N-glycosylase [Haloferax volcanii DS2]
gi|448595744|ref|ZP_21653191.1| DNA N-glycosylase [Haloferax alexandrinus JCM 10717]
gi|291372196|gb|ADE04423.1| DNA N-glycosylase [Haloferax volcanii DS2]
gi|445581333|gb|ELY35694.1| DNA N-glycosylase [Haloferax volcanii DS2]
gi|445742198|gb|ELZ93693.1| DNA N-glycosylase [Haloferax alexandrinus JCM 10717]
Length = 304
Score = 82.4 bits (202), Expect = 3e-13, Method: Compositional matrix adjust.
Identities = 51/159 (32%), Positives = 78/159 (49%), Gaps = 21/159 (13%)
Query: 139 GARVLRQDPVECLLQFLCSSNNNIARITKMVDFLASLGSHLGNVEGFEFHEFPSLERLSL 198
G R++R P CL+ F+CS+ ++RI M L +G + +P+ ERL+
Sbjct: 119 GMRLVRDPPFGCLVSFICSAQMRVSRIFGMQSRLRETYGERVEFDGETYFAYPTPERLAE 178
Query: 199 VSEVELRNAGFGYRSAPQSSLLFSVRRSFKQAKYITGTVDVLQSKHSGGAEWLLSLRKLD 258
+E ELR+ GYR A Y+ T +++ + + AE R D
Sbjct: 179 ATEDELRDLSLGYR-----------------APYVRRTAEMVATGEATPAE----ARGRD 217
Query: 259 LQEAIDALCTLPGVGPKVAACIALFSLDQHHAIPVDTHV 297
++A + L T GVG KVA C+ LFSLD A+P+DT +
Sbjct: 218 YEDAREFLTTFVGVGDKVADCVLLFSLDYLEAVPLDTWI 256
>gi|220930689|ref|YP_002507598.1| 8-oxoguanine DNA glycosylase [Clostridium cellulolyticum H10]
gi|220001017|gb|ACL77618.1| 8-oxoguanine DNA glycosylase domain protein [Clostridium
cellulolyticum H10]
Length = 295
Score = 82.4 bits (202), Expect = 3e-13, Method: Compositional matrix adjust.
Identities = 72/265 (27%), Positives = 116/265 (43%), Gaps = 33/265 (12%)
Query: 47 ELSLPLTFPTGQTFRWKKTGPLQYTGPIGPHLISLKHLQNGDVCYHIHTSPSEPAAKSAL 106
+ +L F GQ FRW + Y G ++++ + N C T+ SE
Sbjct: 22 DFNLTHIFDCGQCFRWIRQEDGSYRGIARGRMVNVSY-DNEVFCI---TNSSEQDFIDIW 77
Query: 107 LDFLNMGISLGELWEGFSASDCRFAELAKYLAGARVLRQDPVECLLQFLCSSNNNIARIT 166
++ ++G ++ + D E K G R+L+QD E L+ F+ S+NN I RI
Sbjct: 78 YEYFDLGTDYSKI-KSVLEQDEIMREAIKTGWGIRLLKQDFWEMLISFIISANNMIPRIM 136
Query: 167 KMVDFLASLGSHLGNVEGFEFHEFPSLERLSLVSEVELRNAGFGYRSAPQSSLLFSVRRS 226
K VD L+ L + G + FP ++ L+ S +++ G+R
Sbjct: 137 KTVDALSVLRGKCID-SGRNAYSFPEIKALAETSLEDIKQCKAGFR-------------- 181
Query: 227 FKQAKYITGTVDVLQSKHSGGAEWLLS---LRKLDLQEAIDALCTLPGVGPKVAACIALF 283
KYI T ++ A+ +++ LR +D A L LPGVGPKVA CI LF
Sbjct: 182 ---CKYIHKTSALM-------AQGIVTEEILRSMDTAMARKELMILPGVGPKVADCILLF 231
Query: 284 SLDQHHAIPVDTHVWKIATRYLLPE 308
S ++ P+D V ++ L +
Sbjct: 232 SGLKYDVFPIDVWVKRVMEELYLKK 256
>gi|448570846|ref|ZP_21639357.1| DNA N-glycosylase [Haloferax lucentense DSM 14919]
gi|445722764|gb|ELZ74415.1| DNA N-glycosylase [Haloferax lucentense DSM 14919]
Length = 304
Score = 82.4 bits (202), Expect = 3e-13, Method: Compositional matrix adjust.
Identities = 51/159 (32%), Positives = 78/159 (49%), Gaps = 21/159 (13%)
Query: 139 GARVLRQDPVECLLQFLCSSNNNIARITKMVDFLASLGSHLGNVEGFEFHEFPSLERLSL 198
G R++R P CL+ F+CS+ ++RI M L +G + +P+ ERL+
Sbjct: 119 GMRLVRDPPFGCLVSFICSAQMRVSRIFGMQSRLRETYGERVEFDGETYFAYPTPERLAE 178
Query: 199 VSEVELRNAGFGYRSAPQSSLLFSVRRSFKQAKYITGTVDVLQSKHSGGAEWLLSLRKLD 258
+E ELR+ GYR A Y+ T +++ + + AE R D
Sbjct: 179 ATEDELRDLSLGYR-----------------APYVRRTAEMVATGEATPAE----ARGRD 217
Query: 259 LQEAIDALCTLPGVGPKVAACIALFSLDQHHAIPVDTHV 297
++A + L T GVG KVA C+ LFSLD A+P+DT +
Sbjct: 218 YEDAREFLTTFVGVGDKVADCVLLFSLDYLEAVPLDTWI 256
>gi|448540710|ref|ZP_21623631.1| DNA N-glycosylase [Haloferax sp. ATCC BAA-646]
gi|448549012|ref|ZP_21627788.1| DNA N-glycosylase [Haloferax sp. ATCC BAA-645]
gi|448555813|ref|ZP_21631742.1| DNA N-glycosylase [Haloferax sp. ATCC BAA-644]
gi|445708863|gb|ELZ60698.1| DNA N-glycosylase [Haloferax sp. ATCC BAA-646]
gi|445713701|gb|ELZ65476.1| DNA N-glycosylase [Haloferax sp. ATCC BAA-645]
gi|445717336|gb|ELZ69054.1| DNA N-glycosylase [Haloferax sp. ATCC BAA-644]
Length = 304
Score = 82.4 bits (202), Expect = 3e-13, Method: Compositional matrix adjust.
Identities = 51/159 (32%), Positives = 78/159 (49%), Gaps = 21/159 (13%)
Query: 139 GARVLRQDPVECLLQFLCSSNNNIARITKMVDFLASLGSHLGNVEGFEFHEFPSLERLSL 198
G R++R P CL+ F+CS+ ++RI M L +G + +P+ ERL+
Sbjct: 119 GMRLVRDPPFGCLVSFICSAQMRVSRIFGMQSRLRETYGERVEFDGETYFAYPTPERLAE 178
Query: 199 VSEVELRNAGFGYRSAPQSSLLFSVRRSFKQAKYITGTVDVLQSKHSGGAEWLLSLRKLD 258
+E ELR+ GYR A Y+ T +++ + + AE R D
Sbjct: 179 ATEDELRDLSLGYR-----------------APYVRRTAEMVATGEATPAE----ARGRD 217
Query: 259 LQEAIDALCTLPGVGPKVAACIALFSLDQHHAIPVDTHV 297
++A + L T GVG KVA C+ LFSLD A+P+DT +
Sbjct: 218 YEDAREFLTTFVGVGDKVADCVLLFSLDYLEAVPLDTWI 256
>gi|448303105|ref|ZP_21493055.1| 8-oxoguanine DNA glycosylase domain-containing protein
[Natronorubrum sulfidifaciens JCM 14089]
gi|445594112|gb|ELY48279.1| 8-oxoguanine DNA glycosylase domain-containing protein
[Natronorubrum sulfidifaciens JCM 14089]
Length = 299
Score = 82.4 bits (202), Expect = 3e-13, Method: Compositional matrix adjust.
Identities = 69/265 (26%), Positives = 107/265 (40%), Gaps = 30/265 (11%)
Query: 40 PLNLTQSELSLPLTFPTGQTFRWKKTGPLQYTGPIGP----HLISLKHLQNGDVCYHIHT 95
PL+ L L T +GQ++ W+++ Y G P H + + D +
Sbjct: 7 PLDDLSGGLDLYRTLESGQSYCWRRSDGEMYRGSPEPDAWYHTVVSGPETDSDAVIRVRV 66
Query: 96 SPS-----EPAAKSALLDFLNMGISLGELWEGFSASDCRFAELAKYLAGARVLRQDPVEC 150
A +L+ L E + D E + G R++ C
Sbjct: 67 RADTLEWESTVAAESLVRRLLRLDDDLESIVAAAPDDPLLDEAYEVHRGMRLVEDPSFPC 126
Query: 151 LLQFLCSSNNNIARITKMVDFLASLGSHLGNVEGFEFHEFPSLERLSLVSEVELRNAGFG 210
L+ F+CS+ ++RI MV LA G +H FP+ +L+ +E ELR+ G G
Sbjct: 127 LISFICSAQMRVSRIHTMVSTLAREYGDPIEFNGDTYHAFPTPVQLAAATEAELRDLGLG 186
Query: 211 YRSAPQSSLLFSVRRSFKQAKYITGTVDVLQSKHSGGAEWLLSLRKLDLQEAIDALCTLP 270
YR A Y+ T ++ + E R L+ ++A + L T
Sbjct: 187 YR-----------------APYVVRTAKMVADGEAHPEE----ARDLEYEQAREYLTTFV 225
Query: 271 GVGPKVAACIALFSLDQHHAIPVDT 295
GVG KVA C+ LFSL A+P+DT
Sbjct: 226 GVGDKVADCVLLFSLGFDEAVPLDT 250
>gi|110801758|ref|YP_699554.1| 8-oxoguanine DNA glycosylase [Clostridium perfringens SM101]
gi|110682259|gb|ABG85629.1| putative 8-oxoguanine DNA glycosylase [Clostridium perfringens
SM101]
Length = 312
Score = 82.4 bits (202), Expect = 3e-13, Method: Compositional matrix adjust.
Identities = 65/255 (25%), Positives = 108/255 (42%), Gaps = 16/255 (6%)
Query: 48 LSLPLTFPTGQTFRWKKTGPLQYTGPIGPHLISLKHLQNGDVCYHIHTSPSEPAAKSALL 107
++ F GQ FRW++T Y G +I L N + Y+ + E + +
Sbjct: 21 FNIKQVFECGQCFRWERTESGSYIGVAFGKVIELAQEGNDVIIYNTNKEDFE----NIWI 76
Query: 108 DFLNMGISLGELWEGFSASDCRFAELAKYLAGARVLRQDPVECLLQFLCSSNNNIARITK 167
D+ ++ ++ E S D ++ G R+L QDP E ++ F+ S+ N+I I+K
Sbjct: 77 DYFDLERDYSKVKEALSW-DETLKSAVEFGYGIRILNQDPFELVISFIISARNSIPVISK 135
Query: 168 MVDFLASLGSHLGNVEGFEFHEFPSLERLSLVSEVELRNAGFGYRSAPQSSLLFSVRRSF 227
+ ++ +G ++ FP+ E LS S ++R G +RS ++
Sbjct: 136 TIKKISERWGEPIEYKGNTYYSFPTPESLSKASIDDIRETGASFRSKYIWDTTKNIYNCE 195
Query: 228 KQAKYI-----TGTVDVLQSKHSGGAEWLLSLRKLDLQEAIDALCTLPGVGPKVAACIAL 282
K I V++L+ L + +D L GVG KVA CI L
Sbjct: 196 LAKKGILDAPEEDIVEILEKYD------LEKITNMDADGCHKGLQEFMGVGAKVADCIML 249
Query: 283 FSLDQHHAIPVDTHV 297
FS+ + A PVD V
Sbjct: 250 FSMKKSSAFPVDVWV 264
>gi|448720894|ref|ZP_21703488.1| 8-oxoguanine DNA glycosylase domain-containing protein [Halobiforma
nitratireducens JCM 10879]
gi|445780507|gb|EMA31389.1| 8-oxoguanine DNA glycosylase domain-containing protein [Halobiforma
nitratireducens JCM 10879]
Length = 302
Score = 82.0 bits (201), Expect = 3e-13, Method: Compositional matrix adjust.
Identities = 52/159 (32%), Positives = 76/159 (47%), Gaps = 21/159 (13%)
Query: 139 GARVLRQDPVECLLQFLCSSNNNIARITKMVDFLASLGSHLGNVEGFEFHEFPSLERLSL 198
G R++ CL+ F+CS+ ++RI MV +A +G +H FP+ +L+
Sbjct: 118 GLRLVEDPAFGCLISFICSAQMRVSRIHSMVSTVAREYGTPVEFDGRTYHAFPTPAQLAA 177
Query: 199 VSEVELRNAGFGYRSAPQSSLLFSVRRSFKQAKYITGTVDVLQSKHSGGAEWLLSLRKLD 258
+E ELR G GYR A Y+ T +++ + AE R L
Sbjct: 178 ATEDELRELGLGYR-----------------APYVVRTAEMVADGDAHPAE----ARDLA 216
Query: 259 LQEAIDALCTLPGVGPKVAACIALFSLDQHHAIPVDTHV 297
+EA D LC GVG KVA C+ LFSL A+P+DT +
Sbjct: 217 YEEARDYLCQFVGVGDKVADCVLLFSLGFDQAVPLDTWI 255
>gi|329121378|ref|ZP_08250004.1| 3-methyladenine DNA glycosylase [Dialister micraerophilus DSM
19965]
gi|327469787|gb|EGF15253.1| 3-methyladenine DNA glycosylase [Dialister micraerophilus DSM
19965]
Length = 270
Score = 82.0 bits (201), Expect = 3e-13, Method: Compositional matrix adjust.
Identities = 70/268 (26%), Positives = 114/268 (42%), Gaps = 29/268 (10%)
Query: 45 QSELSLPLTFPTGQTFRWKKTGPLQYTGPIGPHLISLKHLQNGDVCYHIHTSPSEPAAKS 104
+ L +GQ FR K+ Y +I ++ + D Y I S E A
Sbjct: 6 NDDFDLTKIINSGQCFRGKEIEENIYRFITNDEIIYIR--KKSDDAYEI--SCDERAWNK 61
Query: 105 ALLDFLNMGISLGELWEGFSASDCRFAELAKYLAGARVLRQDPVECLLQFLCSSNNNIAR 164
++ +M + E+ + +D + A+ G R+L QDP E L+ F+ S +I
Sbjct: 62 VWKNYFDMERNYKEIRKQILENDTFLIKSARTGRGIRILNQDPWEMLITFITSQRKSIPA 121
Query: 165 ITKMVDFLASLGSHLGNVEGFEFHEFPSLERLSLVSEVELRNAGFGYRSAPQSSLLFSVR 224
I ++ L L + + + FP++E LS ++ +L + G GYR
Sbjct: 122 IRSAIEKLCILAGNKKTTDYETIYTFPTIETLSELALKDLESCGLGYR------------ 169
Query: 225 RSFKQAKYITGTVDVLQSKHSGGAEWLLSLRKLDLQEAIDALCTLPGVGPKVAACIALFS 284
A YI T +++ G L L + ++ I+ L PGVG KVA C+ALF+
Sbjct: 170 -----APYIYKTTQIIRK----GEINLEELYAYNDEKLIETLKIFPGVGIKVANCVALFA 220
Query: 285 LDQHHAIPVDTHVWKIATRYLLPELAGV 312
+ + PVD VW R + E G+
Sbjct: 221 YGRTNRAPVD--VW--IERIIKNEYEGI 244
>gi|448576229|ref|ZP_21642272.1| 8-oxoguanine DNA glycosylase [Haloferax larsenii JCM 13917]
gi|445729909|gb|ELZ81503.1| 8-oxoguanine DNA glycosylase [Haloferax larsenii JCM 13917]
Length = 304
Score = 82.0 bits (201), Expect = 3e-13, Method: Compositional matrix adjust.
Identities = 49/159 (30%), Positives = 79/159 (49%), Gaps = 21/159 (13%)
Query: 139 GARVLRQDPVECLLQFLCSSNNNIARITKMVDFLASLGSHLGNVEGFEFHEFPSLERLSL 198
G R++R P CL+ F+CS+ ++RI M + L +G ++ +P+ ERL+
Sbjct: 119 GMRLVRDPPFGCLISFICSAQMRVSRIFGMQERLRETYGEAVEFDGDTYYAYPTAERLAE 178
Query: 199 VSEVELRNAGFGYRSAPQSSLLFSVRRSFKQAKYITGTVDVLQSKHSGGAEWLLSLRKLD 258
SE ELR+ GYR A Y+ T +++ + + A+ R +
Sbjct: 179 TSEDELRDLSLGYR-----------------APYVQRTAEMVATGEADPAD----ARGRN 217
Query: 259 LQEAIDALCTLPGVGPKVAACIALFSLDQHHAIPVDTHV 297
+ A + L + GVG KVA C+ LFSLD A+P+DT +
Sbjct: 218 YESAREFLTSFVGVGDKVADCVLLFSLDYLEAVPLDTWI 256
>gi|433639112|ref|YP_007284872.1| 3-methyladenine DNA glycosylase/8-oxoguanine DNA glycosylase
[Halovivax ruber XH-70]
gi|433290916|gb|AGB16739.1| 3-methyladenine DNA glycosylase/8-oxoguanine DNA glycosylase
[Halovivax ruber XH-70]
Length = 297
Score = 82.0 bits (201), Expect = 3e-13, Method: Compositional matrix adjust.
Identities = 76/264 (28%), Positives = 120/264 (45%), Gaps = 39/264 (14%)
Query: 52 LTFPTGQTFRWKKTGPLQYTGPIGPHLI-------SLKHLQNGDVCYHIH-TSPSEPAAK 103
LT +GQT+RW ++ Y G P + ++ D T+ +EP +
Sbjct: 19 LTIESGQTYRWTRSDDKLYAGNRAPEAWYELAIDGEVVRARSTDAGLEWESTTDAEPILR 78
Query: 104 SALLDFLNMGISLGELWEGFSASDCRFAELAKYLA--GARVLRQDPVECLLQFLCSSNNN 161
L ++ + + A D E A Y A G R++ P + L+ F+CS+
Sbjct: 79 RRLRLDDDLPAIVAD------APDDTLVEDA-YAAHQGLRLVTDPPFDTLIAFICSAQMR 131
Query: 162 IARITKMVDFLASLGSHLGNVEGFEFHEFPSLERLSLVSEVELRNAGFGYRSAPQSSLLF 221
+ RI +MV L++ +G +H+FP+ ++L+ +E ELR+ G GYR
Sbjct: 132 VERINEMVTTLSTEYGTPIEFDGRTYHDFPTPDQLAAATESELRDLGLGYR--------- 182
Query: 222 SVRRSFKQAKYITGTVDVLQSKHSGGAEWLLSLRKLDLQEAIDALCTLPGVGPKVAACIA 281
A Y+ T ++ + G E R L +EA + L GVG KVA CI
Sbjct: 183 --------APYVVETARMV-ADGEGHPE---DARDLPYEEAREYLTRFVGVGEKVADCIC 230
Query: 282 LFSLDQHHAIPVDTHVWK-IATRY 304
LF+L+ A+P+DT + K IAT Y
Sbjct: 231 LFALEFDEAVPLDTWIRKAIATYY 254
>gi|260587448|ref|ZP_05853361.1| 8-oxoguanine DNA glycosylase [Blautia hansenii DSM 20583]
gi|331083728|ref|ZP_08332839.1| hypothetical protein HMPREF0992_01763 [Lachnospiraceae bacterium
6_1_63FAA]
gi|260542315|gb|EEX22884.1| 8-oxoguanine DNA glycosylase [Blautia hansenii DSM 20583]
gi|330403939|gb|EGG83491.1| hypothetical protein HMPREF0992_01763 [Lachnospiraceae bacterium
6_1_63FAA]
Length = 272
Score = 82.0 bits (201), Expect = 4e-13, Method: Compositional matrix adjust.
Identities = 62/243 (25%), Positives = 103/243 (42%), Gaps = 27/243 (11%)
Query: 56 TGQTFRWKKTGPLQYTGPIGPHLISLKHLQNGDVCYHIHTSPSEPAAKSALLDFLNMGIS 115
+GQ FR KK +Y +G + + + ++G+ ++ + ++ ++ +
Sbjct: 17 SGQCFRMKKLAENKYRIIVGARYLDV-YQKDGESIFYCSLEEFTEVWE----EYFDLKRA 71
Query: 116 LGELWEGFSASDCRFAELAKYLAGARVLRQDPVECLLQFLCSSNNNIARITKMVDFLA-S 174
E + D AK G R+L+QD E ++ FL S NNI RI + ++ +
Sbjct: 72 YKLYIEKINPKDTYLVNAAKLGGGIRILKQDLWEMIVSFLISQQNNIVRIRRCIENICRE 131
Query: 175 LGSHLGNVEGFEFHEFPSLERLSLVSEVELRNAGFGYRSAPQSSLLFSVRRSFKQAKYIT 234
G G ++ FP E + + E +L+ GYRS +V
Sbjct: 132 YGEEKITASGEHYYAFPKAEAFACLEEDDLKACNLGYRSKYVVRAAKAVV---------A 182
Query: 235 GTVDVLQSKHSGGAEWLLSLRKLDLQEAIDALCTLPGVGPKVAACIALFSLDQHHAIPVD 294
G +D L ++ K+ +A + L + GVG KVA CI LF L A PVD
Sbjct: 183 GEID------------LQAIEKMKYAKAKEELLKIFGVGVKVADCICLFGLHHLEAFPVD 230
Query: 295 THV 297
TH+
Sbjct: 231 THI 233
>gi|295094027|emb|CBK83118.1| 3-methyladenine DNA glycosylase/8-oxoguanine DNA glycosylase
[Coprococcus sp. ART55/1]
Length = 291
Score = 81.6 bits (200), Expect = 4e-13, Method: Compositional matrix adjust.
Identities = 60/191 (31%), Positives = 86/191 (45%), Gaps = 24/191 (12%)
Query: 109 FLNMGISLGELWEGFSASDCRFAELAKYLAGARVLRQDPVECLLQFLCSSNNNIARITKM 168
+L+M G + E +D G R+L QD E L+ F+ S N NI +I +
Sbjct: 75 YLDMDNDYGLIKESVIKADSALQTAVNEKDGIRILNQDFFETLISFIISQNKNIPQIKQC 134
Query: 169 VDFLA-SLGSHLGNVEGFEFHEFPSLERLSLVSEVELRNAGFGYRSAPQSSLLFSVRRSF 227
V ++ G + G F+ FP ++RL V+E ELR G+R
Sbjct: 135 VKNISHRFGDEVIGYNGEAFYVFPDVDRLHEVTEDELRECKVGFR--------------- 179
Query: 228 KQAKYITGTVDVLQSKHSGGAEWLLSLRKLDLQEAIDALCTLPGVGPKVAACIALFSLDQ 287
A YI + + S G L LD+++A + L T+ GVG KVA C+ LF L +
Sbjct: 180 --APYIMNATEAVYS----GNVTKEKLDALDIEQARELLMTIKGVGEKVANCVLLFGLGR 233
Query: 288 HHAIPVDTHVW 298
A PVD VW
Sbjct: 234 REAFPVD--VW 242
>gi|21673824|ref|NP_661889.1| 8-oxoguanine DNA glycosylase [Chlorobium tepidum TLS]
gi|21646956|gb|AAM72231.1| 8-oxoguanine DNA glycosylase, putative [Chlorobium tepidum TLS]
Length = 321
Score = 81.6 bits (200), Expect = 4e-13, Method: Compositional matrix adjust.
Identities = 84/304 (27%), Positives = 133/304 (43%), Gaps = 36/304 (11%)
Query: 27 PQTLTTNKPSKWTPLNLTQSELSLPLTFPTGQTFRWKKTGPLQ--YTGPIGPHLISLKHL 84
P + T+K + LN T+S L+ T +GQ+FRW L+ Y+ I +I + +
Sbjct: 2 PDIMITHKKIYSSLLN-TKSRLNFKDTVQSGQSFRWNLNETLKSYYSSVIYNSIIFICEI 60
Query: 85 QNGDVCYHIHTSPSEPAAKSALLDFLNMGISLGELWEGFSASDCRFAE-------LAKYL 137
+ + + + E + K + +FL SL E +S RF + L +
Sbjct: 61 NSEMI--EVLCTDQELSGK-PINEFLKHLFSLDFQEETVFSS--RFQQEFPEVWNLVQSY 115
Query: 138 AGARVLRQDPVECLLQFLCSSNNNIARITKMVDFLASLGSHLGNVE----GFEFHEFPSL 193
RV+RQDP E ++ F+C+ + I + V +A +E FH FP
Sbjct: 116 RSVRVMRQDPFEIMVTFMCAQGIGMHLIRRQVSMIAERYGQKIMLELPEGNLTFHSFPPP 175
Query: 194 ERLSLVSEVELRNAGFGYRSAPQSSLLFSVRRSFKQAKYITGTVDVLQSKHSGGAEWLLS 253
+ L+ EL A + +++ + ++ RSF+ K VD SG
Sbjct: 176 QALASADPNEL--AVCTNNNRIRAANIIAMARSFESGKLALACVD------SG------- 220
Query: 254 LRKLDLQEAIDALCTLPGVGPKVAACIALFSLDQHHAIPVDTHVWKIATRYLLPELAGVR 313
K DL+ + LC G+G K+A CIALF L + A P+DTHV + + E A
Sbjct: 221 --KCDLETLRETLCVHSGIGLKIADCIALFGLGRFDAFPIDTHVKQYLWEWFGIEEARHS 278
Query: 314 LTPK 317
LT K
Sbjct: 279 LTEK 282
>gi|417963159|ref|ZP_12605183.1| Putative 8-oxoguanine DNA glycosylase, partial [Candidatus
Arthromitus sp. SFB-3]
gi|380334101|gb|EIA24571.1| Putative 8-oxoguanine DNA glycosylase, partial [Candidatus
Arthromitus sp. SFB-3]
Length = 224
Score = 81.6 bits (200), Expect = 4e-13, Method: Compositional matrix adjust.
Identities = 58/186 (31%), Positives = 90/186 (48%), Gaps = 41/186 (22%)
Query: 139 GARVLRQDPVECLLQFLCSSNNNIARITKMVDFLASLGSHLGN---VEGFEFHEFPSLER 195
G R+L QDP E +L F+ SSNN I I K + S+ G+ +G +++FPSLE
Sbjct: 40 GLRILNQDPFEMILSFIISSNNRIPMIKKAI---LSISEKYGDPISYKGRIYYKFPSLEN 96
Query: 196 LSLVSEVELRNAGFGYRSAPQSSLLFSVRRSFKQAKYITGTVDVLQSKHSGGAEWLLSLR 255
L+ +S+ + R G+R KY+ TV +L + E++++L
Sbjct: 97 LANLSQEKFRECSVGFRD-----------------KYLYNTVKLLNEDND--IEYIMNLN 137
Query: 256 KLDLQEAIDALC-----TLPGVGPKVAACIALFSLDQHHAIPVDTHVWKIATR-YLLPE- 308
D +C GVG KV+ CI LFS+ ++ A PVD V + + Y+ P+
Sbjct: 138 --------DDMCHKELQKFSGVGSKVSDCIMLFSMKKYSAFPVDVWVKRAMMKFYVAPDT 189
Query: 309 -LAGVR 313
L G+R
Sbjct: 190 SLKGIR 195
>gi|408381073|ref|ZP_11178623.1| DNA-(apurinic or apyrimidinic site) lyase [Methanobacterium
formicicum DSM 3637]
gi|407816338|gb|EKF86900.1| DNA-(apurinic or apyrimidinic site) lyase [Methanobacterium
formicicum DSM 3637]
Length = 327
Score = 81.6 bits (200), Expect = 5e-13, Method: Compositional matrix adjust.
Identities = 71/286 (24%), Positives = 128/286 (44%), Gaps = 27/286 (9%)
Query: 48 LSLPLTFPTGQTFR--WKKTGP-----LQYTGPIGPHLISLKHLQ-----NGDVCYHIHT 95
L+L LT +GQT + WK++ +Q G P L+++KH N ++
Sbjct: 16 LNLSLTINSGQTSQPPWKESNSYFQELIQVEG--APCLVNIKHDDADTDSNLEIIAESTE 73
Query: 96 SPSEPAAKSALLDFLNMGISLGELWEGFSASDCRFAELAKYLAGARVLR-QDPVECLLQF 154
SE K+++ + ++ L + ++ F + D + ++ G R+ +P EC++
Sbjct: 74 KVSEKGIKNSVREIFSLDHDLNQFYQ-FLSEDPKLTPTIEFCQGLRLFNAHNPFECIISS 132
Query: 155 LCSSNNNIARITKMVDFLASLGSHLGNVEGFEFHEFPSLERLSLVSEVELRNAGFGYRSA 214
+ S+N +I R T+ V+ + + + +F+ FP L V E +L
Sbjct: 133 ISSANCSILRWTRSVNDIKRKWGDQYHFDSGDFYSFPVPNVLGNVPEHDLEEMQRCEDEL 192
Query: 215 PQSSLLFSVRRSF---KQAKYITGTVDVLQSKHSGGAEWLLSLRKLDLQEAIDALCTLPG 271
P+ + + ++ +AKYI ++QS+ + + ++ EA D + LPG
Sbjct: 193 PEDFIFENNLQACGVGYRAKYIINAARMIQSEIG-----IQKVARMRYDEAFDTILQLPG 247
Query: 272 VGPKVAACIALFSLDQHHAIPVDTHVWKIATRYLLPELAGVRLTPK 317
VGPKVA CI L+ A PVD + +I P G L P+
Sbjct: 248 VGPKVADCILLYGFKMGEAFPVDVWIKRIVEHLYFP---GEELKPQ 290
>gi|160880161|ref|YP_001559129.1| 8-oxoguanine DNA glycosylase [Clostridium phytofermentans ISDg]
gi|160428827|gb|ABX42390.1| 8-oxoguanine DNA glycosylase domain protein [Clostridium
phytofermentans ISDg]
Length = 272
Score = 81.6 bits (200), Expect = 5e-13, Method: Compositional matrix adjust.
Identities = 68/247 (27%), Positives = 103/247 (41%), Gaps = 29/247 (11%)
Query: 56 TGQTFRWKKTGPLQYTGPIGPHLISLKHLQNGDVCYHIHTSPSEPAAKSALLDFLNMGIS 115
+GQ FR K+ Y H + + N + S +E S ++L + +
Sbjct: 17 SGQCFRMKEISETSYEVVAYGHYLRISQQDN-----ELTFSCTEGEFHSIWNEYLGLNVD 71
Query: 116 LGELWEGFSASDCRFAELAKYLA-GARVLRQDPVECLLQFLCSSNNNIARITKMVDFLAS 174
+ G D R+ + A G R+L+Q+ E ++ FL S NNI RI K + L
Sbjct: 72 YDTI-TGLVNEDDRYMKSAMSFGWGIRILKQELWETIVSFLISQQNNIPRIKKSIQMLCE 130
Query: 175 -LGSHLGNVEGFEFHEFPSLERLSLVSEVELRNAGFGYRSAPQSSLLFSVRRSFKQAKYI 233
G N G ++ FP E + + EL+ GYR+ KYI
Sbjct: 131 RYGDEKLNENGDVYYTFPKPEAFLNLKDSELKECNLGYRT-----------------KYI 173
Query: 234 TGTVDVLQSKHSGGAEWLLSLRKLDLQEAIDALCTLPGVGPKVAACIALFSLDQHHAIPV 293
T + G+ L L L ++A L L GVG KV+ CI L++L A P+
Sbjct: 174 LRTAKAVVD----GSFDLEGLPNLSYEDAKKELMKLYGVGIKVSECICLYALHHFDAFPI 229
Query: 294 DTHVWKI 300
DTH+ K+
Sbjct: 230 DTHIQKV 236
>gi|331004378|ref|ZP_08327851.1| hypothetical protein HMPREF0491_02713 [Lachnospiraceae oral taxon
107 str. F0167]
gi|330411108|gb|EGG90527.1| hypothetical protein HMPREF0491_02713 [Lachnospiraceae oral taxon
107 str. F0167]
Length = 282
Score = 81.3 bits (199), Expect = 5e-13, Method: Compositional matrix adjust.
Identities = 71/262 (27%), Positives = 116/262 (44%), Gaps = 41/262 (15%)
Query: 48 LSLPLTFPTGQTFRWKKTGPLQYTGPIGPHLISLKHLQNGDVCYHIHTSPSEPAAKSALL 107
L + +GQ FR+ +++ G L+ +K +NG + S SE L
Sbjct: 9 LDMKKIADSGQIFRFNVYDD-EFSLVAGDKLLFIKEDKNGYIL-----SCSEKEFNEFWL 62
Query: 108 DFLNMGISLGELWEGFSASDCRFAELAKYLAGARVLRQDPVECLLQFLCSSNNNIARITK 167
D+ ++ + G+ + +D + AKY G R+L QD E L+ F+ S +I I
Sbjct: 63 DYFDLSLDYGDFEKNIPETDVFLKDAAKYSYGIRILNQDKWEMLISFIISQRKSIPAIKS 122
Query: 168 MVDFLASLG---------SHLGNVEG-FEFHEFPSLERLSLVSEVELRNA-GFGYRSAPQ 216
++ LA + + N++ EF+ FP+ + L+ +++E NA GYRS
Sbjct: 123 SIEKLAKVYGKKIDMKIPEFIKNIDADTEFYTFPTPKELA-NADIEALNACSLGYRSP-- 179
Query: 217 SSLLFSVRRSFKQAKYITGTVDVLQSKHSGGAEWLLSLRKLDLQEAIDALCTLPGVGPKV 276
YI + + K L ++ LD +E + AL +L GVG KV
Sbjct: 180 ---------------YIEASAKAVYRKDID----LNAIDSLDDEELLKALMSLKGVGIKV 220
Query: 277 AACIALFSLDQHHAIPVDTHVW 298
A C+ALF + A P+D VW
Sbjct: 221 ANCVALFGYHRIAAFPID--VW 240
>gi|448411913|ref|ZP_21576269.1| 8-oxoguanine DNA glycosylase [Halosimplex carlsbadense 2-9-1]
gi|445669847|gb|ELZ22455.1| 8-oxoguanine DNA glycosylase [Halosimplex carlsbadense 2-9-1]
Length = 298
Score = 81.3 bits (199), Expect = 6e-13, Method: Compositional matrix adjust.
Identities = 51/159 (32%), Positives = 77/159 (48%), Gaps = 21/159 (13%)
Query: 139 GARVLRQDPVECLLQFLCSSNNNIARITKMVDFLASLGSHLGNVEGFEFHEFPSLERLSL 198
G R++ P L+ F+CS+ +ARI M L +G FH FP+ E+L+
Sbjct: 117 GMRLVDDPPFPSLVSFICSAQMRVARIHGMQRELERAFGSTVEFDGRTFHAFPTPEQLAE 176
Query: 199 VSEVELRNAGFGYRSAPQSSLLFSVRRSFKQAKYITGTVDVLQSKHSGGAEWLLSLRKLD 258
SE +LR+ G GYR A Y+ T ++ + E + LD
Sbjct: 177 ASEADLRDLGLGYR-----------------APYVRDTARMVADGEANPEEAV----GLD 215
Query: 259 LQEAIDALCTLPGVGPKVAACIALFSLDQHHAIPVDTHV 297
++A ++L GVG KVA C+ALF+LD A+P+DT +
Sbjct: 216 YEDARESLTRFVGVGDKVADCVALFALDYLEAVPLDTWI 254
>gi|419720395|ref|ZP_14247631.1| 8-oxoguanine DNA glycosylase, N-terminal domain / base excision DNA
repair protein, HhH-GPD family multi-domain protein
[Lachnoanaerobaculum saburreum F0468]
gi|383303495|gb|EIC94944.1| 8-oxoguanine DNA glycosylase, N-terminal domain / base excision DNA
repair protein, HhH-GPD family multi-domain protein
[Lachnoanaerobaculum saburreum F0468]
Length = 284
Score = 81.3 bits (199), Expect = 6e-13, Method: Compositional matrix adjust.
Identities = 72/261 (27%), Positives = 114/261 (43%), Gaps = 39/261 (14%)
Query: 48 LSLPLTFPTGQTFRWKKTGPLQYTGPIGPHLISLKHLQNGDVCYHIHTSPSEPAAKSALL 107
L + +GQ FR+ +Y+ G L+ +K +NG + S SE ++ +
Sbjct: 9 LDMKKIADSGQIFRFNVYND-EYSLVAGDKLLFIKEDKNGYIL-----SCSEFEFENFWM 62
Query: 108 DFLNMGISLGELWEGFSASDCRFAELAKYLAGARVLRQDPVECLLQFLCSSNNNIARITK 167
D+ ++ + + + A D A+Y G R+L QD E L+ F+ S +I I
Sbjct: 63 DYFDLRLDYSDFEKNIPADDLFLINAAEYSYGIRILNQDKWEMLISFIISQRKSIPAIKS 122
Query: 168 MVDFLAS---------LGSHLGNVE-GFEFHEFPSLERLSLVSEVELRNAGFGYRSAPQS 217
++ LA + + N++ EF FP+ + L+ S EL GYRS
Sbjct: 123 SIEKLAKRYGKKIDMKVPDFIKNIDKNSEFFAFPTPKVLANASTDELNACSLGYRSPYIE 182
Query: 218 SLLFSVRRSFKQAKYITGTVDVLQSKHSGGAEWLLSLRKLDLQEAIDALCTLPGVGPKVA 277
+ +V R G +D L +L +LD E + AL +L GVG KVA
Sbjct: 183 ATAKTVYR---------GDID------------LEALSRLDDNELLAALMSLKGVGIKVA 221
Query: 278 ACIALFSLDQHHAIPVDTHVW 298
C+ALF + A P+D VW
Sbjct: 222 NCVALFGYHRIAAFPID--VW 240
>gi|188590592|ref|YP_001919800.1| 8-oxoguanine DNA glycosylase [Clostridium botulinum E3 str. Alaska
E43]
gi|188500873|gb|ACD54009.1| 8-oxoguanine DNA glycosylase [Clostridium botulinum E3 str. Alaska
E43]
Length = 308
Score = 81.3 bits (199), Expect = 6e-13, Method: Compositional matrix adjust.
Identities = 68/251 (27%), Positives = 114/251 (45%), Gaps = 12/251 (4%)
Query: 48 LSLPLTFPTGQTFRWKKTGPLQYTGPIGPHLISLKHLQNGDVCYHIHTSPSEPAAKSALL 107
+L F GQ FR+++ + LI L +Q D + +T+ E K L
Sbjct: 21 FNLKHIFECGQIFRFEEIEDDNFIVIAFGKLIEL--VQKKDDVFIYNTTEKE--FKEIWL 76
Query: 108 DFLNMGISLGELWEGFSASDCRFAELAKYLAGARVLRQDPVECLLQFLCSSNNNIARITK 167
++ ++ + E S D + ++ +G RVL QDP E L+ F+ S+ N+I I K
Sbjct: 77 NYFDLDRDYSIIKEELS-KDELLKQSIEFGSGVRVLNQDPFEMLISFIISARNSIPSIRK 135
Query: 168 MVDFLASLGSHLGNVEGFEFHEFPSLERLSLVSEVELRNAGFGYRSAPQSSLLFSVRRSF 227
V+ ++ + +G ++ FP++E++ E++ G +RS + +V S
Sbjct: 136 TVNKISEQWGNKIEYKGKVYYAFPTIEQIKDAKLDEIKETGASFRSKYLIDTIENVHNSI 195
Query: 228 KQAKYITGTVDVLQSKHSGGAEWLLSLRKLDLQEAIDALCTLPGVGPKVAACIALFSLDQ 287
++ L +K+ L ++ L E AL GVG KVA CI LFS+ +
Sbjct: 196 GNKSNLSENEKELLNKYD-----LKYIKGLSDDECHAALQEFKGVGAKVADCIMLFSMGK 250
Query: 288 HHAIPVDTHVW 298
A PVD VW
Sbjct: 251 TSAFPVD--VW 259
>gi|448369494|ref|ZP_21556046.1| DNA-(apurinic or apyrimidinic site) lyase [Natrialba aegyptia DSM
13077]
gi|445650669|gb|ELZ03585.1| DNA-(apurinic or apyrimidinic site) lyase [Natrialba aegyptia DSM
13077]
Length = 293
Score = 81.3 bits (199), Expect = 6e-13, Method: Compositional matrix adjust.
Identities = 49/157 (31%), Positives = 79/157 (50%), Gaps = 21/157 (13%)
Query: 139 GARVLRQDPVECLLQFLCSSNNNIARITKMVDFLASLGSHLGNVEGFEFHEFPSLERLSL 198
G R++ P L+ F+CS+ ++RI MV LA + +G +H FP+ +L+
Sbjct: 109 GMRLVTDPPFGTLISFICSAQMRVSRIHSMVSTLAREYGDAVSFDGETYHAFPAPAQLAT 168
Query: 199 VSEVELRNAGFGYRSAPQSSLLFSVRRSFKQAKYITGTVDVLQSKHSGGAEWLLSLRKLD 258
+E ELR+ G GYR A Y+ T +++ + A+ R+L+
Sbjct: 169 ATESELRDLGLGYR-----------------APYVVRTAEMVADGEAHPAD----ARELE 207
Query: 259 LQEAIDALCTLPGVGPKVAACIALFSLDQHHAIPVDT 295
+ A + LC GVG KVA C+ LF+L+ A+P+DT
Sbjct: 208 YEAAREFLCQFVGVGDKVADCVLLFALEFDEAVPLDT 244
>gi|448383053|ref|ZP_21562482.1| 8-oxoguanine DNA glycosylase domain-containing protein
[Haloterrigena thermotolerans DSM 11522]
gi|445660233|gb|ELZ13030.1| 8-oxoguanine DNA glycosylase domain-containing protein
[Haloterrigena thermotolerans DSM 11522]
Length = 293
Score = 81.3 bits (199), Expect = 7e-13, Method: Compositional matrix adjust.
Identities = 74/269 (27%), Positives = 116/269 (43%), Gaps = 42/269 (15%)
Query: 48 LSLPLTFPTGQTFRWKKTGPLQYTGPIGPHLISLKHLQNGDVCY---------HIHTSPS 98
L L T +GQ++ W++ Y G P + +GDV T+ +
Sbjct: 15 LDLYRTLESGQSYLWRREDGEMYHGDPAPGAW-YSTVVDGDVIRVRSRDDRLEWESTTDA 73
Query: 99 EPAAKSALL--DFLNMGISLGELWEGFSASDCRFAELAKYLAGARVLRQDPVECLLQFLC 156
EP + L D L ++ G D E + G R+++ P CL+ F+C
Sbjct: 74 EPTVRRLLCLDDDLEAIVAAG-------PDDPLLREAYEAHRGLRLVQDPPFGCLIAFIC 126
Query: 157 SSNNNIARITKMVDFLA-SLGSHLGNVEGFEFHEFPSLERLSLVSEVELRNAGFGYRSAP 215
S+ ++RI MV LA GS + +G +H FP+ +L+ +E ELR+ G GYR
Sbjct: 127 SAQMRVSRIHGMVRALAREYGSPIA-FDGETYHAFPTPAQLAAATEDELRDLGLGYR--- 182
Query: 216 QSSLLFSVRRSFKQAKYITGTVDVLQSKHSGGAEWLLSLRKLDLQEAIDALCTLPGVGPK 275
A Y+ T +++ + A + R L+ + A + L GVG K
Sbjct: 183 --------------APYVVRTAEMVADGEAHPA----AARDLEYEAAREYLTQFVGVGEK 224
Query: 276 VAACIALFSLDQHHAIPVDTHVWKIATRY 304
VA C+ LFSL A+P+DT + Y
Sbjct: 225 VADCVLLFSLGFDEAVPLDTWIRSAIEEY 253
>gi|282883388|ref|ZP_06291981.1| 8-oxoguanine DNA glycosylase domain-containing protein
[Peptoniphilus lacrimalis 315-B]
gi|281296799|gb|EFA89302.1| 8-oxoguanine DNA glycosylase domain-containing protein
[Peptoniphilus lacrimalis 315-B]
Length = 293
Score = 81.3 bits (199), Expect = 7e-13, Method: Compositional matrix adjust.
Identities = 64/256 (25%), Positives = 121/256 (47%), Gaps = 26/256 (10%)
Query: 54 FPTGQTFRWKKTGPLQYTGPIGPHLISLKHLQNGDVCYHIHTSPSEPAAKSALLDFLNMG 113
F GQ FR+ + +T ++++K + D Y I ++ E K+ + ++
Sbjct: 24 FTCGQCFRFYEEDDESFTIVAYGKVLNIK--KEDDNIYIIGSN--EEDFKNIWYKYFDLE 79
Query: 114 ISLGELWEGFSASDCRFAELAKYLAGARVLRQDPVECLLQFLCSSNNNIARITKMVDFLA 173
++ + S D + Y G R+L Q+ E ++ F+ S+NN I+RI K ++ ++
Sbjct: 80 KDYKKIEKALSI-DPVMKKAISYGKGIRILNQEKFETIISFIISANNGISRIRKSIEKIS 138
Query: 174 SL-GSHLGNVEGFEFHEFPSLERLSLVSEVELRNAGFGYRSAPQSSLLFSVRRSFKQAKY 232
+L G+++ + ++ FPS L + +E+R ++ + F +R + +K
Sbjct: 139 NLYGNYITEDKNRIYYSFPSPSVLKDIDPLEIR---------EKTKVGFRDKRIVESSKM 189
Query: 233 ITGTVDVLQSKHSGGAEWLLSLRKLDLQEAIDALCTLPGVGPKVAACIALFSLDQHHAIP 292
I D+ + + KL LQ + L LPGVGPKVA CI LFS +++ + P
Sbjct: 190 IYNK-DIN----------IEEISKLSLQAQREELMKLPGVGPKVADCILLFSFERNESFP 238
Query: 293 VDTHVWKIATRYLLPE 308
VD + ++ + E
Sbjct: 239 VDVWIKRVMEELYIKE 254
>gi|387203940|gb|AFJ69006.1| N-glycosylase/DNA lyase, partial [Nannochloropsis gaditana CCMP526]
Length = 160
Score = 80.9 bits (198), Expect = 7e-13, Method: Compositional matrix adjust.
Identities = 42/76 (55%), Positives = 50/76 (65%), Gaps = 1/76 (1%)
Query: 252 LSLRKLDLQEAIDALCTLPGVGPKVAACIALFSLDQHHAIPVDTHVWKIATRYLLPELAG 311
+SLR+ +EA ALC PGVG KVA C+ALFSLDQH AIPVD HVW+IA R P L
Sbjct: 1 MSLREKSREEAQQALCAFPGVGRKVADCVALFSLDQHGAIPVDVHVWRIACRDYSPVLKE 60
Query: 312 VR-LTPKLCSRVAEAF 326
+ LTP + V + F
Sbjct: 61 HKSLTPAVYEAVGDIF 76
>gi|313892362|ref|ZP_07825954.1| 8-oxoguanine DNA-glycosylase (ogg) [Dialister microaerophilus UPII
345-E]
gi|313119221|gb|EFR42421.1| 8-oxoguanine DNA-glycosylase (ogg) [Dialister microaerophilus UPII
345-E]
Length = 270
Score = 80.9 bits (198), Expect = 7e-13, Method: Compositional matrix adjust.
Identities = 69/268 (25%), Positives = 113/268 (42%), Gaps = 29/268 (10%)
Query: 45 QSELSLPLTFPTGQTFRWKKTGPLQYTGPIGPHLISLKHLQNGDVCYHIHTSPSEPAAKS 104
+ L +GQ FR K+ Y +I ++ + D Y I S E
Sbjct: 6 NDDFDLTKIINSGQCFRGKEIEENIYRFITNDEIIYIR--KKSDDAYEI--SCDERTWNK 61
Query: 105 ALLDFLNMGISLGELWEGFSASDCRFAELAKYLAGARVLRQDPVECLLQFLCSSNNNIAR 164
++ +M + E+ + +D + A+ G R+L QDP E L+ F+ S +I
Sbjct: 62 VWKNYFDMERNYKEIRKQIPENDTFLIKSARTGRGIRILNQDPWEMLITFITSQRKSIPA 121
Query: 165 ITKMVDFLASLGSHLGNVEGFEFHEFPSLERLSLVSEVELRNAGFGYRSAPQSSLLFSVR 224
I ++ L L + + + FP++E LS ++ +L + G GYR
Sbjct: 122 IRSAIEKLCILAGNKKTTDYETIYTFPTIETLSELALKDLESCGLGYR------------ 169
Query: 225 RSFKQAKYITGTVDVLQSKHSGGAEWLLSLRKLDLQEAIDALCTLPGVGPKVAACIALFS 284
A YI T +++ G L L + ++ I+ L PGVG KVA C+ALF+
Sbjct: 170 -----APYIYKTTQIIRK----GEINLEELYTYNDEKLIETLKIFPGVGIKVANCVALFA 220
Query: 285 LDQHHAIPVDTHVWKIATRYLLPELAGV 312
+ + PVD VW R + E G+
Sbjct: 221 YGRTNRAPVD--VW--IERIIKNEYEGI 244
>gi|383624779|ref|ZP_09949185.1| DNA-(apurinic or apyrimidinic site) lyase [Halobiforma lacisalsi
AJ5]
gi|448697120|ref|ZP_21698276.1| 8-oxoguanine DNA glycosylase domain-containing protein [Halobiforma
lacisalsi AJ5]
gi|445782249|gb|EMA33099.1| 8-oxoguanine DNA glycosylase domain-containing protein [Halobiforma
lacisalsi AJ5]
Length = 301
Score = 80.9 bits (198), Expect = 7e-13, Method: Compositional matrix adjust.
Identities = 78/285 (27%), Positives = 116/285 (40%), Gaps = 64/285 (22%)
Query: 40 PLNLTQSELSLPLTFPTGQTFRWKKTGPLQYTGPIGPHLISLKHLQNGDVCYHIHTSPSE 99
PL L L LT +GQ++ W++ YTG + GD Y+ + +
Sbjct: 7 PLADLPGGLDLYLTLESGQSYLWRREDGKMYTGE-----------RPGDAWYY---TVVD 52
Query: 100 PAAKSALLDFLNMGISLGEL-WEGFSASDCRFAEL------------------------A 134
A+ D + + G L WE S +D A + A
Sbjct: 53 DASSGPEPDVVRVRTRDGRLEWE--STADDAEATVRRLLRLDDDLEAIVAAAPDDPLLTA 110
Query: 135 KYLA--GARVLRQDPVECLLQFLCSSNNNIARITKMVDFLASLGSHLGNVEGFEFHEFPS 192
Y A G R++ CL+ F+CS+ ++RI MV L+ +G +H FP+
Sbjct: 111 AYEAHRGMRLVSDPAFGCLISFICSAQMRVSRIHSMVSALSREYGDPIEFDGRTYHAFPT 170
Query: 193 LERLSLVSEVELRNAGFGYRSAPQSSLLFSVRRSFKQAKYITGTVDVLQSKHSGGAEWLL 252
ERL+ +E ELR G GYR A Y+ T +++ + G
Sbjct: 171 PERLAAATEDELRELGLGYR-----------------APYVVRTAEMV----ADGEAHPE 209
Query: 253 SLRKLDLQEAIDALCTLPGVGPKVAACIALFSLDQHHAIPVDTHV 297
R L+ + A D L GVG KVA C+ LFSL A+P+DT +
Sbjct: 210 DARDLEYEAARDYLGRFVGVGDKVADCVLLFSLGFDEAVPLDTWI 254
>gi|154150205|ref|YP_001403823.1| 8-oxoguanine DNA glycosylase [Methanoregula boonei 6A8]
gi|153998757|gb|ABS55180.1| 8-oxoguanine DNA glycosylase domain protein [Methanoregula boonei
6A8]
Length = 286
Score = 80.9 bits (198), Expect = 8e-13, Method: Compositional matrix adjust.
Identities = 73/266 (27%), Positives = 104/266 (39%), Gaps = 37/266 (13%)
Query: 48 LSLPLTFPTGQTFRWKKTGPLQYTGPIGPHLISLKHLQNGDVCYHIHTSPSEPAAKSALL 107
SL T GQ FRW + +TG +I + Q G T EP +
Sbjct: 12 FSLDQTLGCGQVFRWNHSPDGSWTGVASGKVIRCR--QKGRTLSF--TGVDEPFIRH--- 64
Query: 108 DFLNMGISLGELWEGFSASDCRFAELAKYLAGARVLRQDPVECLLQFLCSSNNNIARITK 167
+ + + L + E A + + G R++RQ P ECL ++C++N NI I +
Sbjct: 65 -YFSFDLDLLAILESVDRDPFIHAAIGQ-CRGLRLIRQPPWECLCSYICATNTNIPAIRR 122
Query: 168 MVDFLASLGSHLGNV--EGFE-FHEFPSLERLSLVSEVE-LRNAGFGYRSAPQSSLLFSV 223
V A+L G+ EG F FP R+S L G GYR
Sbjct: 123 RV---ATLAQQFGDAIQEGENTFFSFPDPSRISCTGTASALAECGLGYRE---------- 169
Query: 224 RRSFKQAKYITGTVDVLQSKHSGGAEWLLSLRKLDLQEAIDALCTLPGVGPKVAACIALF 283
Y+ T + W ++ L +EA L G+GPK A C+ LF
Sbjct: 170 -------PYVFKTA----CDTTAHETWAENIAALPYEEARRELMKFSGIGPKAADCVLLF 218
Query: 284 SLDQHHAIPVDTHVWKIATRYLLPEL 309
S A PVD + +I + LPEL
Sbjct: 219 SFQTFEAFPVDVWIRRIMREHYLPEL 244
>gi|442803740|ref|YP_007371889.1| N-glycosylase/DNA lyase [Clostridium stercorarium subsp.
stercorarium DSM 8532]
gi|442739590|gb|AGC67279.1| N-glycosylase/DNA lyase [Clostridium stercorarium subsp.
stercorarium DSM 8532]
Length = 290
Score = 80.5 bits (197), Expect = 9e-13, Method: Compositional matrix adjust.
Identities = 67/250 (26%), Positives = 107/250 (42%), Gaps = 30/250 (12%)
Query: 53 TFPTGQTFRWKKTGPLQYTGPIGPHLISLKHLQNGDVCYHIHTSPSEPAAKSALLDFLNM 112
TF GQ FRW + ++G + LK +G++C + E K +D+ ++
Sbjct: 24 TFLCGQCFRWNQEDEGTWSGVALGRFVRLKW--DGELCTFFDMTEKEFIEK--WVDYFDL 79
Query: 113 GISLGELWEGFSASDCRFAELAKYLAGARVLRQDPVECLLQFLCSSNNNIARITKMVDFL 172
GE+ + + D E + G R+LRQD E +L F+ S NN I RI K+++
Sbjct: 80 NTDYGEIKKILAGKDEHLREAVSFGYGIRLLRQDLWEVMLSFIISQNNQIPRIKKIIE-- 137
Query: 173 ASLGSHLG-NVEGFE-FHEFPSLERLSLVSEVELRNAGFGYRSAPQSSLLFSVRRSFKQA 230
+L H G + G + + FP L+ + +L+ GYR
Sbjct: 138 -TLSEHYGQKISGADNRYTFPDAFSLARANIEDLKICRPGYR------------------ 178
Query: 231 KYITGTVDVLQSKHSGGAEWLLSLRKLDLQEAIDALCTLPGVGPKVAACIALFSLDQHHA 290
G + K S L L+ + EA L + G+G KVA CI L+S +A
Sbjct: 179 ---LGYIMSAAEKVSKSPSLLGELKSIPSAEARKILLSFTGIGEKVADCILLYSGLDRNA 235
Query: 291 IPVDTHVWKI 300
P+D V ++
Sbjct: 236 FPIDRWVKRV 245
>gi|451817311|ref|YP_007453512.1| 3-methyladenine DNA glycosylase/8-oxoguanine DNA glycosylase
[Clostridium saccharoperbutylacetonicum N1-4(HMT)]
gi|451783290|gb|AGF54258.1| 3-methyladenine DNA glycosylase/8-oxoguanine DNA glycosylase
[Clostridium saccharoperbutylacetonicum N1-4(HMT)]
Length = 308
Score = 80.5 bits (197), Expect = 9e-13, Method: Compositional matrix adjust.
Identities = 68/252 (26%), Positives = 113/252 (44%), Gaps = 14/252 (5%)
Query: 48 LSLPLTFPTGQTFRWKKTGPLQYTGPIGPHLISLKHLQNGDVCYHIHTSPSEPAAKSALL 107
L F GQ FR+++ Y LI +K + Y + +E +
Sbjct: 21 FKLKHIFECGQIFRFEEIAEGHYIVIAFGKLIEVKEEGANIIIY----NATEEEVNEIWI 76
Query: 108 DFLNMGISLGELWEGFSASDCRFAELAKYLAGARVLRQDPVECLLQFLCSSNNNIARITK 167
+ ++ E+ + S D + ++ G RVL QDP E L+ F+ S+ NNI I K
Sbjct: 77 KYFDLDRDYSEIKQELS-KDPLLKQSIEFGYGVRVLNQDPFEMLISFIISARNNIPSIKK 135
Query: 168 MVDFLAS-LGSHLGNVEGFEFHEFPSLERLSLVSEVELRNAGFGYRSAPQSSLLFSVRRS 226
V+ +++ G + E ++ FPS+ + + E++ G +RS +L +++
Sbjct: 136 TVNKISNKWGKEIIYKEK-TYYAFPSINEIKDATLEEIQETGASFRS---KYILDTIKNV 191
Query: 227 FKQAKYITGTVDVLQSKHSGGAEWLLSLRKLDLQEAIDALCTLPGVGPKVAACIALFSLD 286
+ K + + L K+S L ++ L E +AL GVG KVA CI LFS+
Sbjct: 192 YASDKVMDDS--NLDKKNSYLKYNLDYIKNLKDDECHEALQEFKGVGAKVADCIMLFSMG 249
Query: 287 QHHAIPVDTHVW 298
+ A PVD VW
Sbjct: 250 KTSAFPVD--VW 259
>gi|182625563|ref|ZP_02953334.1| putative 8-oxoguanine DNA glycosylase [Clostridium perfringens D
str. JGS1721]
gi|177909251|gb|EDT71716.1| putative 8-oxoguanine DNA glycosylase [Clostridium perfringens D
str. JGS1721]
Length = 312
Score = 80.5 bits (197), Expect = 9e-13, Method: Compositional matrix adjust.
Identities = 64/255 (25%), Positives = 108/255 (42%), Gaps = 16/255 (6%)
Query: 48 LSLPLTFPTGQTFRWKKTGPLQYTGPIGPHLISLKHLQNGDVCYHIHTSPSEPAAKSALL 107
++ F GQ FRW++T Y G +I L + + Y+ + E + +
Sbjct: 21 FNIKQVFECGQCFRWERTESGSYIGVAFGKVIELAQEDHDVIIYNTNKEDFE----NIWI 76
Query: 108 DFLNMGISLGELWEGFSASDCRFAELAKYLAGARVLRQDPVECLLQFLCSSNNNIARITK 167
D+ ++ ++ E S D ++ G R+L QDP E ++ F+ S+ N+I I+K
Sbjct: 77 DYFDLERDYSKVKEALSW-DETLKSAVEFGYGIRILNQDPFELVISFIISARNSIPVISK 135
Query: 168 MVDFLASLGSHLGNVEGFEFHEFPSLERLSLVSEVELRNAGFGYRSAPQSSLLFSVRRSF 227
+ ++ +G ++ FP+ E LS S ++R G +RS ++
Sbjct: 136 TIKKISERWGEPIEYKGNTYYSFPTPESLSKASIDDIRETGASFRSKYIWDTTKNIYNCE 195
Query: 228 KQAKYI-----TGTVDVLQSKHSGGAEWLLSLRKLDLQEAIDALCTLPGVGPKVAACIAL 282
K I V++L+ L + +D L GVG KVA CI L
Sbjct: 196 LAKKGILDAPEEDIVEILEKYD------LEKITNMDADGCHKGLQEFMGVGAKVADCIML 249
Query: 283 FSLDQHHAIPVDTHV 297
FS+ + A PVD V
Sbjct: 250 FSMKKSSAFPVDVWV 264
>gi|348685486|gb|EGZ25301.1| hypothetical protein PHYSODRAFT_479372 [Phytophthora sojae]
Length = 116
Score = 80.5 bits (197), Expect = 1e-12, Method: Compositional matrix adjust.
Identities = 46/102 (45%), Positives = 62/102 (60%), Gaps = 6/102 (5%)
Query: 119 LWEGFSASDCRFAELAKYLAGARVLRQDPVECLLQFLCSSNNNIARITKMVDFL-ASLGS 177
L+E ++ R E+ L G R++RQ+PVECL F+CSSNNNIARI MVD L A+ G
Sbjct: 6 LYERWTKPTDRMTEIITRLRGLRIVRQEPVECLFSFICSSNNNIARIQGMVDKLKAAYGD 65
Query: 178 HLGNVEGFE----FHEFPSLERLSL-VSEVELRNAGFGYRSA 214
+ E + F+ FPS++ L+ E LR GFGYR+A
Sbjct: 66 LIYEGEDKQEQQYFYAFPSVDTLAAKCEEATLRALGFGYRAA 107
>gi|290996368|ref|XP_002680754.1| predicted protein [Naegleria gruberi]
gi|284094376|gb|EFC48010.1| predicted protein [Naegleria gruberi]
Length = 222
Score = 80.5 bits (197), Expect = 1e-12, Method: Compositional matrix adjust.
Identities = 59/173 (34%), Positives = 84/173 (48%), Gaps = 35/173 (20%)
Query: 98 SEPAAKSALLDFLNM-GISLGELWEGFSASD-CRFAELAKYLAGARVLRQDPVECLLQFL 155
E + + D+ + + EL E F+ D F ++ KY G R+L+ +P+E L F+
Sbjct: 36 DEKEVEQLISDYFQIDNYDIEELHEHFAKVDPDLFGKIVKYYCGWRLLKMEPLETLFSFI 95
Query: 156 CSSNNNIARITKMVDFLA-SLGSHLGN------VEGFEFHEFPSLERLSLVSEVEL---- 204
CSSNNN++RITKMV L G LG V F ++FP+LE+L +E EL
Sbjct: 96 CSSNNNVSRITKMVQVLCEEYGECLGEFTIDDGVTTFPMYKFPTLEQLEECTEKELKYVL 155
Query: 205 --RNAGFGYRSAPQSSLLFSVRRSFKQAKYITGTVDVLQSKHSGGAEWLLSLR 255
R FGYR AKYI TV+ L+ K ++L +LR
Sbjct: 156 NFRKHNFGYR-----------------AKYIVQTVEKLKEK---PEKYLYTLR 188
>gi|300711220|ref|YP_003737034.1| 8-oxoguanine DNA glycosylase domain-containing protein
[Halalkalicoccus jeotgali B3]
gi|448296678|ref|ZP_21486732.1| 8-oxoguanine DNA glycosylase domain-containing protein
[Halalkalicoccus jeotgali B3]
gi|299124903|gb|ADJ15242.1| 8-oxoguanine DNA glycosylase domain protein [Halalkalicoccus
jeotgali B3]
gi|445580971|gb|ELY35337.1| 8-oxoguanine DNA glycosylase domain-containing protein
[Halalkalicoccus jeotgali B3]
Length = 297
Score = 80.1 bits (196), Expect = 1e-12, Method: Compositional matrix adjust.
Identities = 77/272 (28%), Positives = 117/272 (43%), Gaps = 34/272 (12%)
Query: 40 PLNLTQSELSLPLTFPTGQTFRWKKTGPLQYTGPIGP--HLI---SLKHLQNGDVCYHIH 94
PL L L T +GQ++RW++ L Y G G H + +L ++ D
Sbjct: 7 PLETCPGGLDLRATLESGQSYRWRREDGLLYEGVPGGWYHTVLDGNLIRVRQTDRALEWE 66
Query: 95 TSPSEPAAKSALLDFLNMGISLGELWEGFSASDCRFAELAKYLAGARVLRQDPV-ECLLQ 153
++ LL + ++ G D E G R++ DPV CL+
Sbjct: 67 STTDAVPYLRRLLRLDDDLDAI----VGTGPDDPLLCEAYAAHRGLRIV-NDPVFACLIS 121
Query: 154 FLCSSNNNIARITKMVDFLAS-LGSHLGNVEGFEFHEFPSLERLSLVSEVELRNAGFGYR 212
F+CS+ + RI M LA G+ L +V+G +H FP+ ++L+ S ELR+ GYR
Sbjct: 122 FICSAQMRVGRIHGMQTTLADRFGASL-DVDGRTYHAFPTPDQLASTSVEELRDCSLGYR 180
Query: 213 SAPQSSLLFSVRRSFKQAKYITGTVDVLQSKHSGGAEWLLSLRKLDLQEAIDALCTLPGV 272
A Y+ T +++ +GG R LD + A + L GV
Sbjct: 181 -----------------APYVKRTAEMV----AGGEAHPAEARGLDYETAREFLTRFVGV 219
Query: 273 GPKVAACIALFSLDQHHAIPVDTHVWKIATRY 304
G KVA C+ LFSL A+P+DT + Y
Sbjct: 220 GNKVADCVLLFSLGYLEAVPLDTWIRSAIEEY 251
>gi|313115564|ref|ZP_07801023.1| 8-oxoguanine DNA-glycosylase [Faecalibacterium cf. prausnitzii
KLE1255]
gi|310622079|gb|EFQ05575.1| 8-oxoguanine DNA-glycosylase [Faecalibacterium cf. prausnitzii
KLE1255]
Length = 278
Score = 80.1 bits (196), Expect = 1e-12, Method: Compositional matrix adjust.
Identities = 80/261 (30%), Positives = 111/261 (42%), Gaps = 38/261 (14%)
Query: 56 TGQTFRWKKTGPLQYTGPIGPHLISLKHLQNGDVCYHIHTSPSEPAAKSALL-DFLNMGI 114
+GQ FR +K Y G L+ L+ L++G +T EP A + ++G
Sbjct: 25 SGQCFRPRKQAEGWYRFVSGRQLLYLRPLRSG-----TYTVRCEPGAWDTFWHGYFDLGR 79
Query: 115 SLGELWEGFSASDCRFAELAKYLAGARVLRQDPVECLLQFLCSSNNNIARITKMVDFLAS 174
S L + D +Y G RVLRQD E L+ F+ S +I I + V+ L+
Sbjct: 80 SYAALRGKLDSRDDFLQRAMEYGRGIRVLRQDEWEMLVSFIISQRKSIPAIRRAVELLSE 139
Query: 175 -LGSHLGNVEGFEFHEFPSLERLSLVSEVELRNAGFGYRSAPQSSLLFSVRRSFKQAKYI 233
G LG+ + FP+ E L E L+ G GYR+ +Y
Sbjct: 140 RFGERLGSDSEGPVYAFPTAEALCCAGEQALQECGLGYRT-----------------RY- 181
Query: 234 TGTVDVLQSKHSGGAEWLLSLRKLDL--QEAIDA-LCTLPGVGPKVAACIALFSLDQHHA 290
VL + AE L L+KL EA+ A L L GVG KVA C+ LF +
Sbjct: 182 -----VLHAAQQ-AAEGTLDLKKLASLPDEALFARLMELDGVGKKVANCVCLFGYGRVGR 235
Query: 291 IPVDTHVWKIATRYLLPELAG 311
+PVD VW R + E AG
Sbjct: 236 VPVD--VW--IERLIRDEFAG 252
>gi|18311275|ref|NP_563209.1| 8-oxoguanine DNA glycosylase [Clostridium perfringens str. 13]
gi|170764063|ref|ZP_02631128.2| putative 8-oxoguanine DNA glycosylase [Clostridium perfringens E
str. JGS1987]
gi|18145958|dbj|BAB81999.1| probable 8-oxoguanine DNA glycosylase [Clostridium perfringens str.
13]
gi|170663251|gb|EDT15934.1| putative 8-oxoguanine DNA glycosylase [Clostridium perfringens E
str. JGS1987]
Length = 312
Score = 80.1 bits (196), Expect = 1e-12, Method: Compositional matrix adjust.
Identities = 67/258 (25%), Positives = 114/258 (44%), Gaps = 22/258 (8%)
Query: 48 LSLPLTFPTGQTFRWKKTGPLQYTGPIGPHLISLKHLQNGDVCYHIHTSPSEPAAKSALL 107
++ F GQ FRW++T Y G +I L + + Y+ + E + +
Sbjct: 21 FNIKQVFECGQCFRWERTESGSYIGVAFGKVIELAQEGHDVIIYNTNKEDFE----NIWV 76
Query: 108 DFLNMGISLGELWEGFSASDCRFAELAKYLAGARVLRQDPVECLLQFLCSSNNNIARITK 167
D+ ++ ++ E S D ++ G R+L QDP E ++ F+ S+ N+I I+K
Sbjct: 77 DYFDLERDYSKVKEALSW-DETLKSAVEFGYGIRILNQDPFELVISFIISARNSIPVISK 135
Query: 168 MVDFLASLGSHLGNVEGFEFHEFPSLERLSLVSEVELRNAGFGYRSAPQSSLLFSVRRSF 227
+ ++ +G ++ FP+ E LS S ++R G +RS + + + +
Sbjct: 136 TIKKISERWGEPIEYKGNTYYSFPTPESLSKASIDDIRETGASFRS---KYIWDTTKNIY 192
Query: 228 KQAKYITGTVDVLQSKHSGGAEWLLSLRKLDLQEA--IDA------LCTLPGVGPKVAAC 279
G +D + E + L K DL++ +DA L GVG KVA C
Sbjct: 193 NCELAKKGILDAPEE------EIVEILEKYDLEKITNMDADGCHKGLQEFMGVGAKVADC 246
Query: 280 IALFSLDQHHAIPVDTHV 297
I LFS+ + A PVD V
Sbjct: 247 IMLFSMKKSSAFPVDVWV 264
>gi|448361950|ref|ZP_21550563.1| 8-oxoguanine DNA glycosylase domain-containing protein [Natrialba
asiatica DSM 12278]
gi|445649630|gb|ELZ02567.1| 8-oxoguanine DNA glycosylase domain-containing protein [Natrialba
asiatica DSM 12278]
Length = 293
Score = 80.1 bits (196), Expect = 1e-12, Method: Compositional matrix adjust.
Identities = 48/157 (30%), Positives = 79/157 (50%), Gaps = 21/157 (13%)
Query: 139 GARVLRQDPVECLLQFLCSSNNNIARITKMVDFLASLGSHLGNVEGFEFHEFPSLERLSL 198
G R++ P L+ F+CS+ ++RI MV LA + +G +H FP+ +L+
Sbjct: 109 GMRLVTDPPFGTLISFICSAQMRVSRIHSMVSTLAREYGDAVSFDGETYHAFPTPAQLAT 168
Query: 199 VSEVELRNAGFGYRSAPQSSLLFSVRRSFKQAKYITGTVDVLQSKHSGGAEWLLSLRKLD 258
+E ELR+ G GYR A Y+ T +++ + A+ R+L+
Sbjct: 169 ATESELRDLGLGYR-----------------APYVVRTAEMVADGEAHPAD----ARELE 207
Query: 259 LQEAIDALCTLPGVGPKVAACIALFSLDQHHAIPVDT 295
+ A + LC GVG KVA C+ LF+L+ ++P+DT
Sbjct: 208 YEAAREFLCQFVGVGDKVADCVLLFALEFDESVPLDT 244
>gi|84488864|ref|YP_447096.1| 3-methyladenine DNA glycosylase/8-oxoguanine DNA glycosylase
[Methanosphaera stadtmanae DSM 3091]
gi|84372183|gb|ABC56453.1| putative 3-methyladenine DNA glycosylase/8-oxoguanine DNA
glycosylase [Methanosphaera stadtmanae DSM 3091]
Length = 311
Score = 80.1 bits (196), Expect = 1e-12, Method: Compositional matrix adjust.
Identities = 69/275 (25%), Positives = 127/275 (46%), Gaps = 35/275 (12%)
Query: 39 TPLNLTQSELSLPLTFPTGQTFR--WKKTGPLQYTGPIGPH---LISLKHLQNGD---VC 90
N + L LT +GQT + WK Y + + L+ + + D V
Sbjct: 12 VDFNEYVGDFDLILTMNSGQTSQPPWKMDDNNYYEIILIDNIDVLVKISQEKLNDPLIVK 71
Query: 91 YHIHTSPSEPAAKSALLDFLNMGISLGELWEGFSASDCRFAELAKYLAGARVLR-QDPVE 149
Y + ++ L ++ ++ E+++ F + + +++ ++ G R+ + Q P E
Sbjct: 72 YFSKEEFNVEKLRTKLFYIFDLDYNINEVYD-FLEENSQLSDVYEFNRGLRLFKSQFPFE 130
Query: 150 CLLQFLCSSNNNIARITK-MVDFLASLGSHLGNVEGFE-FHEFPSLERLSLVSEVELRNA 207
C++ +CS+NN+I R TK + D G + V G + ++ FP E +S EL+N
Sbjct: 131 CIISSICSANNSIKRWTKSLYDIRRFCGRSV--VFGKDTYYVFPREEVFINMSLDELKNC 188
Query: 208 GFGYRSAPQSSLLFSVRRSFKQAKYITGTVDVLQSKHSGGAEWLLSLRKLDLQEAIDALC 267
G GYR+ KY+ + + ++ ++ ++ KL ++A + +
Sbjct: 189 GVGYRN-----------------KYMLNSTEKIRD----SIDFHENIFKLSYKKAYNEII 227
Query: 268 TLPGVGPKVAACIALFSLDQHHAIPVDTHVWKIAT 302
L GVGPKVA CI L+ ++H A PVD + +I T
Sbjct: 228 KLEGVGPKVADCILLYGYNKHEAYPVDVWINRITT 262
>gi|225027447|ref|ZP_03716639.1| hypothetical protein EUBHAL_01703 [Eubacterium hallii DSM 3353]
gi|224955182|gb|EEG36391.1| 8-oxoguanine DNA-glycosylase (ogg) [Eubacterium hallii DSM 3353]
Length = 617
Score = 80.1 bits (196), Expect = 1e-12, Method: Compositional matrix adjust.
Identities = 76/276 (27%), Positives = 117/276 (42%), Gaps = 39/276 (14%)
Query: 38 WTPLNLTQSELSLPLTFPTGQTFRWKKTGPLQYTGPIGPHLISLKHLQNGDVCYHIHTSP 97
+ + + + L +GQ FRWKK P +Y I + + + H
Sbjct: 3 YIDIFIKNDYIDLKQIAESGQCFRWKKICPGRYF-VISDGRTACFFQEKTGIRILCHEKD 61
Query: 98 SEPAAKSALLDFLNMGISLGELWEGFSASDCRFAELAKYLAGARVLRQDPVECLLQFLCS 157
E K +L++ ++ E D + AK G R+LRQD E ++ F+ S
Sbjct: 62 EEYFRK-----YLDLDTDYEKIIEQIDPEDRFLSGAAKMGKGIRILRQDLWEMIVSFIIS 116
Query: 158 SNNNIARITKMVDFLASLGSHLGNVEGFEFHE-------FPSLERLSLVSEVELRNAGFG 210
NNI RI K +D +L LG F + E FP E ++++ +L FG
Sbjct: 117 QRNNIPRIMKSID---ALCEKLGEKIVFNYEEEHLIGYSFPGPE---VLAKADLSEFKFG 170
Query: 211 YRSAPQSSLLFSVRRSFKQAKYITGTVD--VLQSKHSGGAEWLLSLRKLDLQEAIDALCT 268
YR +R++ + +TG + VLQ+ + GA +E + L
Sbjct: 171 YREK-------YIRQTAEDI--LTGKFELSVLQTAVAKGA---------SPEEGKEMLKK 212
Query: 269 LPGVGPKVAACIALFSLDQHHAIPVDTHVWKIATRY 304
L GVG KVA+CI LF L Q P+DT + K+ Y
Sbjct: 213 LHGVGEKVASCIQLFGLHQLSLFPIDTWIAKVEKMY 248
>gi|187934120|ref|YP_001884614.1| 8-oxoguanine DNA glycosylase [Clostridium botulinum B str. Eklund
17B]
gi|187722273|gb|ACD23494.1| 8-oxoguanine DNA glycosylase [Clostridium botulinum B str. Eklund
17B]
Length = 308
Score = 79.7 bits (195), Expect = 1e-12, Method: Compositional matrix adjust.
Identities = 69/252 (27%), Positives = 115/252 (45%), Gaps = 14/252 (5%)
Query: 48 LSLPLTFPTGQTFRWKKTGPLQYTGPIGPHLISLKHLQNGDVCYHIHTSPSEPAAKSALL 107
L F GQ FR+++ + LI L +N DV + + +E K L
Sbjct: 21 FKLKHIFECGQVFRFEEIEEDNFIIIAFGKLIELVQ-RNNDVFIY---NTTEKEFKEIWL 76
Query: 108 DFLNMGISLGELWEGFSASDCRFAELAKYLAGARVLRQDPVECLLQFLCSSNNNIARITK 167
++ ++ + E S D + ++ G RVL QDP E LL F+ S+ N+I I K
Sbjct: 77 NYFDLDRDYSIIKEELS-KDELLKQSIEFGCGVRVLNQDPFEMLLSFIISARNSIPSIRK 135
Query: 168 MVDFLASLGSHLGNVEGFEFHEFPSLERLSLVSEVELRNAGFGYRSAPQSSLLFSVRRSF 227
V+ ++ + +G ++ FP++E++ E++ G +RS + +V S
Sbjct: 136 TVNKISEKWGNKIEYKGKIYYAFPTIEQIKDAKLDEIKETGASFRSKYLLDTIENVYNS- 194
Query: 228 KQAKYITGTVDVLQSKHSGGAEWLLS-LRKLDLQEAIDALCTLPGVGPKVAACIALFSLD 286
I +D +++ S + L+ ++ L E AL GVG KVA CI LFS+
Sbjct: 195 -----INNNLDSSENEKSLLKRYDLNYIKSLSDDECHVALQEFKGVGAKVADCIMLFSMG 249
Query: 287 QHHAIPVDTHVW 298
+ + PVD VW
Sbjct: 250 KTSSFPVD--VW 259
>gi|196233444|ref|ZP_03132287.1| 8-oxoguanine DNA glycosylase domain protein [Chthoniobacter flavus
Ellin428]
gi|196222440|gb|EDY16967.1| 8-oxoguanine DNA glycosylase domain protein [Chthoniobacter flavus
Ellin428]
Length = 282
Score = 79.7 bits (195), Expect = 2e-12, Method: Compositional matrix adjust.
Identities = 71/266 (26%), Positives = 110/266 (41%), Gaps = 31/266 (11%)
Query: 42 NLTQSELSLPLTFPTGQTFRWKKTGPLQYTGPIGPHLISLKHLQNGDVCYHIHTSPSEPA 101
L + L LT GQ F W + G + G IG + ++ Q G+ + +E
Sbjct: 5 RLAAKDFDLTLTLECGQVFHWVREGA-GWLGVIGELPMYVE--QQGEEL--LIPRGTEEV 59
Query: 102 AKSALLDFLNMGISLGELWEGFSASDCRFAELAKYLAGARVLRQDPVECLLQFLCSSNNN 161
A+ D+ + L E+ F D ++ G R++RQ EC+ F+ SS
Sbjct: 60 AR----DYFALDHPLAEICASFP-EDPAMQAAREFCRGMRIIRQPAWECIATFITSSMKQ 114
Query: 162 IARITKMVDFLASLGSHLGNVEGFEFHEFPSLERLSLVSEVELRNAGFGYRSAPQSSLLF 221
+A I ++ L EG +P+ + L+ + E +LR GYR+ +LL
Sbjct: 115 VAHIAQISHTLRRRYGKKVAWEGHTLFAYPTPQALAQLEEEDLRACALGYRA---KNLLG 171
Query: 222 SVRRSFKQAKYITGTVDVLQSKHSGGAEWLLSLRKLDLQEAIDALCTLPGVGPKVAACIA 281
S R G VD L + +LD A + LC LPGVG KVA C
Sbjct: 172 SAR------MIADGEVD------------LEAFARLDDDAAREELCRLPGVGEKVANCAL 213
Query: 282 LFSLDQHHAIPVDTHVWKIATRYLLP 307
LF ++ A P+D + ++ P
Sbjct: 214 LFGFERVRAFPIDVWIERVLREIYFP 239
>gi|225575344|ref|ZP_03783954.1| hypothetical protein RUMHYD_03434 [Blautia hydrogenotrophica DSM
10507]
gi|225037441|gb|EEG47687.1| 8-oxoguanine DNA-glycosylase (ogg) [Blautia hydrogenotrophica DSM
10507]
Length = 276
Score = 79.7 bits (195), Expect = 2e-12, Method: Compositional matrix adjust.
Identities = 57/172 (33%), Positives = 85/172 (49%), Gaps = 22/172 (12%)
Query: 134 AKYLAGARVLRQDPVECLLQFLCSSNNNIARITKMVDFLA-SLGSHLGNVEGFEFHEFPS 192
A++ G R+LRQD E ++ FL S NNI RI + ++ L S G G ++ FP
Sbjct: 90 AEFGIGIRILRQDLWEMIVSFLISQQNNIVRIRRCIENLCESYGEKKIGGGGGVYYAFPR 149
Query: 193 LERLSLVSEVELRNAGFGYRSAPQSSLLFSVRRSFKQAKYITGTVDVLQSKHSGGAEWLL 252
E L+ + E L+ GYRS + VR + ++G +D+ Q
Sbjct: 150 PEVLAGLQEDALKGCNLGYRSK------YVVRSA---QSVVSGEIDLNQ----------- 189
Query: 253 SLRKLDLQEAIDALCTLPGVGPKVAACIALFSLDQHHAIPVDTHVWKIATRY 304
+R + +A L L GVG KVA CI LF+L Q A P+DTH+ ++ R+
Sbjct: 190 -VRSMPYSKARTELQKLYGVGEKVADCICLFALHQLEAFPIDTHIHQVLERH 240
>gi|170764236|ref|ZP_02638829.2| putative 8-oxoguanine DNA glycosylase [Clostridium perfringens CPE
str. F4969]
gi|170715272|gb|EDT27454.1| putative 8-oxoguanine DNA glycosylase [Clostridium perfringens CPE
str. F4969]
Length = 312
Score = 79.7 bits (195), Expect = 2e-12, Method: Compositional matrix adjust.
Identities = 64/255 (25%), Positives = 107/255 (41%), Gaps = 16/255 (6%)
Query: 48 LSLPLTFPTGQTFRWKKTGPLQYTGPIGPHLISLKHLQNGDVCYHIHTSPSEPAAKSALL 107
++ F GQ FRW++T Y G +I L + + Y+ + E + +
Sbjct: 21 FNIKQVFECGQCFRWERTESGSYIGVAFGKVIELAQEGHDVIIYNTNKEDFE----NIWV 76
Query: 108 DFLNMGISLGELWEGFSASDCRFAELAKYLAGARVLRQDPVECLLQFLCSSNNNIARITK 167
D+ ++ ++ E S D ++ G R+L QDP E ++ F+ S+ N+I I+K
Sbjct: 77 DYFDLERDYSKVKEALSW-DETLKSAVEFGYGIRILNQDPFELVISFIISARNSIPVISK 135
Query: 168 MVDFLASLGSHLGNVEGFEFHEFPSLERLSLVSEVELRNAGFGYRSAPQSSLLFSVRRSF 227
+ ++ G ++ FP+ E LS S ++R G +RS ++
Sbjct: 136 TIKKISERWGEPIEYNGNTYYSFPTPESLSKASIDDIRETGASFRSKYICDTTKNIYNCE 195
Query: 228 KQAKYI-----TGTVDVLQSKHSGGAEWLLSLRKLDLQEAIDALCTLPGVGPKVAACIAL 282
K I V++L+ L + +D L GVG KVA CI L
Sbjct: 196 LAKKGILDAPEEDIVEILEKYD------LEKITNMDADGCHKGLQEFMGVGAKVADCIML 249
Query: 283 FSLDQHHAIPVDTHV 297
FS+ + A PVD V
Sbjct: 250 FSMKKSSAFPVDVWV 264
>gi|110799731|ref|YP_696967.1| 8-oxoguanine DNA glycosylase [Clostridium perfringens ATCC 13124]
gi|169347041|ref|ZP_02865983.1| putative 8-oxoguanine DNA glycosylase [Clostridium perfringens C
str. JGS1495]
gi|170764010|ref|ZP_02637261.2| putative 8-oxoguanine DNA glycosylase [Clostridium perfringens B
str. ATCC 3626]
gi|422347354|ref|ZP_16428266.1| 8-oxoguanine DNA-glycosylase (ogg) [Clostridium perfringens
WAL-14572]
gi|110674378|gb|ABG83365.1| putative 8-oxoguanine DNA glycosylase [Clostridium perfringens ATCC
13124]
gi|169296724|gb|EDS78853.1| putative 8-oxoguanine DNA glycosylase [Clostridium perfringens C
str. JGS1495]
gi|170710391|gb|EDT22573.1| putative 8-oxoguanine DNA glycosylase [Clostridium perfringens B
str. ATCC 3626]
gi|373224652|gb|EHP46989.1| 8-oxoguanine DNA-glycosylase (ogg) [Clostridium perfringens
WAL-14572]
Length = 312
Score = 79.7 bits (195), Expect = 2e-12, Method: Compositional matrix adjust.
Identities = 64/255 (25%), Positives = 108/255 (42%), Gaps = 16/255 (6%)
Query: 48 LSLPLTFPTGQTFRWKKTGPLQYTGPIGPHLISLKHLQNGDVCYHIHTSPSEPAAKSALL 107
++ F GQ FRW++T Y G +I L + + Y+ + E + +
Sbjct: 21 FNIKQVFECGQCFRWERTESGSYIGVAFGKVIELAQEGHDVIIYNTNKEDFE----NIWV 76
Query: 108 DFLNMGISLGELWEGFSASDCRFAELAKYLAGARVLRQDPVECLLQFLCSSNNNIARITK 167
D+ ++ ++ E S D ++ G R+L QDP E ++ F+ S+ N+I I+K
Sbjct: 77 DYFDLERDYSKVKEALSW-DETLKSAVEFGYGIRILNQDPFELVISFIISARNSIPVISK 135
Query: 168 MVDFLASLGSHLGNVEGFEFHEFPSLERLSLVSEVELRNAGFGYRSAPQSSLLFSVRRSF 227
+ ++ +G ++ FP+ E LS S ++R G +RS ++
Sbjct: 136 TIKKISERWGEPIEYKGNTYYSFPTPESLSKASIDDIRETGASFRSKYIWDTTKNIYNCE 195
Query: 228 KQAKYI-----TGTVDVLQSKHSGGAEWLLSLRKLDLQEAIDALCTLPGVGPKVAACIAL 282
K I V++L+ L + +D L GVG KVA CI L
Sbjct: 196 LAKKGILDAPEEDIVEILEKYD------LEKITNMDADGCHKGLQEFMGVGAKVADCIML 249
Query: 283 FSLDQHHAIPVDTHV 297
FS+ + A PVD V
Sbjct: 250 FSMKKSSAFPVDVWV 264
>gi|258645968|ref|ZP_05733437.1| putative 3-Methyladenine DNA glycosylase [Dialister invisus DSM
15470]
gi|260403339|gb|EEW96886.1| putative 3-Methyladenine DNA glycosylase [Dialister invisus DSM
15470]
Length = 271
Score = 79.3 bits (194), Expect = 2e-12, Method: Compositional matrix adjust.
Identities = 56/196 (28%), Positives = 82/196 (41%), Gaps = 21/196 (10%)
Query: 109 FLNMGISLGELWEGFSASDCRFAELAKYLAGARVLRQDPVECLLQFLCSSNNNIARITKM 168
+ ++ +S L A D A Y G R+LRQDP E L+ F+ S +I I
Sbjct: 66 YFDLSVSYKSLRRSIPAKDHFLRAAAGYGKGIRILRQDPFEMLITFIISQRKSIPAIRSS 125
Query: 169 VDFLASLGSHLGNVEGFEFHEFPSLERLSLVSEVELRNAGFGYRSAPQSSLLFSVRRSFK 228
VD L G F+ FPS L +S EL+N GYR+A
Sbjct: 126 VDKLCLAAGKEITKNGRTFYTFPSPSALGKLSVNELKNCSLGYRAA-------------- 171
Query: 229 QAKYITGTVDVLQSKHSGGAEWLLSLRKLDLQEAIDALCTLPGVGPKVAACIALFSLDQH 288
YI T ++ + E + L +E + L PG G KV CI+LF+ +
Sbjct: 172 ---YIHATARMIAREKISFTE----METLSDKELTEKLLLFPGAGIKVVNCISLFAYHRT 224
Query: 289 HAIPVDTHVWKIATRY 304
A PVD + ++ ++
Sbjct: 225 AAAPVDVWIGRVIQKH 240
>gi|435846714|ref|YP_007308964.1| 3-methyladenine DNA glycosylase/8-oxoguanine DNA glycosylase
[Natronococcus occultus SP4]
gi|433672982|gb|AGB37174.1| 3-methyladenine DNA glycosylase/8-oxoguanine DNA glycosylase
[Natronococcus occultus SP4]
Length = 293
Score = 79.3 bits (194), Expect = 2e-12, Method: Compositional matrix adjust.
Identities = 70/251 (27%), Positives = 102/251 (40%), Gaps = 24/251 (9%)
Query: 48 LSLPLTFPTGQTFRWKKTGPLQYTGPIGPHLISLKHLQNGDVCYHIHTSPSEPAAKSALL 107
L L T +GQT+ W++T Y G P + V E A A
Sbjct: 15 LDLYRTLESGQTYLWRRTDGAMYGGEPAPEAWYYTVVDGAVVRVRSEDGVLEWEASIAAE 74
Query: 108 DFLNMGISLGELWEGFSAS---DCRFAELAKYLAGARVLRQDPVECLLQFLCSSNNNIAR 164
+ + L + E A+ D E G R++ P L+ F+CS+ + R
Sbjct: 75 PLVRRLLRLDDDLEAIVAAAPDDPLLDEAYAAYRGMRLIDDPPFGTLISFICSAQMRVDR 134
Query: 165 ITKMVDFLASLGSHLGNVEGFEFHEFPSLERLSLVSEVELRNAGFGYRSAPQSSLLFSVR 224
I MV LA +G +H FP+ ERL+ + ELR G GYR
Sbjct: 135 IHDMVSALAREYGEPIAFDGRTYHAFPTPERLAAATADELRELGLGYR------------ 182
Query: 225 RSFKQAKYITGTVDVLQSKHSGGAEWLLSLRKLDLQEAIDALCTLPGVGPKVAACIALFS 284
A Y+ T +++ + G R L+ ++A + L GVG KVA C+ LFS
Sbjct: 183 -----APYVVRTAEMV----ADGEAHPEVARDLEYEDAREYLKRFVGVGDKVADCVLLFS 233
Query: 285 LDQHHAIPVDT 295
L A+P+DT
Sbjct: 234 LGFDQAVPLDT 244
>gi|315651178|ref|ZP_07904209.1| 8-oxoguanine DNA glycosylase [Lachnoanaerobaculum saburreum DSM
3986]
gi|315486576|gb|EFU76927.1| 8-oxoguanine DNA glycosylase [Lachnoanaerobaculum saburreum DSM
3986]
Length = 284
Score = 79.0 bits (193), Expect = 3e-12, Method: Compositional matrix adjust.
Identities = 71/272 (26%), Positives = 116/272 (42%), Gaps = 37/272 (13%)
Query: 48 LSLPLTFPTGQTFRWKKTGPLQYTGPIGPHLISLKHLQNGDVCYHIHTSPSEPAAKSALL 107
L + +GQ FR+ +Y+ G L+ +K +NG + S SE ++ +
Sbjct: 9 LDMKKIADSGQIFRFNVYND-EYSLVAGDKLLFIKEDKNGYIL-----SCSEFEFENFWM 62
Query: 108 DFLNMGISLGELWEGFSASDCRFAELAKYLAGARVLRQDPVECLLQFLCSSNNNIARITK 167
D+ ++ + + + A D A+Y G R+L QD E L+ F+ S +I I
Sbjct: 63 DYFDLRLDYSDFEKNIPADDLFLINAAEYSYGIRILNQDKWEMLISFIISQRKSIPAIKS 122
Query: 168 MVDFLAS---------LGSHLGNVE-GFEFHEFPSLERLSLVSEVELRNAGFGYRSAPQS 217
++ LA + + N++ EF FP+ + L+ S EL GYRS
Sbjct: 123 SIEKLAKRYGKKIDMKVPDFIKNIDKNSEFFAFPTPKVLANASVDELNACSLGYRSPYIE 182
Query: 218 SLLFSVRRSFKQAKYITGTVDVLQSKHSGGAEWLLSLRKLDLQEAIDALCTLPGVGPKVA 277
+ +V R G +D L +L LD E + AL +L GVG KVA
Sbjct: 183 ATAKTVYR---------GDID------------LEALSGLDDNELLAALMSLKGVGIKVA 221
Query: 278 ACIALFSLDQHHAIPVDTHVWKIATRYLLPEL 309
C+ALF + A P+D + ++ + E
Sbjct: 222 NCVALFGYHRIAAFPIDVWIKRMIDEHYAGEF 253
>gi|182420483|ref|ZP_02642367.2| putative 8-oxoguanine DNA glycosylase [Clostridium perfringens NCTC
8239]
gi|182381269|gb|EDT78748.1| putative 8-oxoguanine DNA glycosylase [Clostridium perfringens NCTC
8239]
Length = 312
Score = 79.0 bits (193), Expect = 3e-12, Method: Compositional matrix adjust.
Identities = 64/255 (25%), Positives = 107/255 (41%), Gaps = 16/255 (6%)
Query: 48 LSLPLTFPTGQTFRWKKTGPLQYTGPIGPHLISLKHLQNGDVCYHIHTSPSEPAAKSALL 107
++ F GQ FRW++T Y G +I L + + Y+ + E + +
Sbjct: 21 FNIKQVFECGQCFRWERTESGSYIGVAFGKVIELAQEGHDVIIYNTNKEDFE----NIWV 76
Query: 108 DFLNMGISLGELWEGFSASDCRFAELAKYLAGARVLRQDPVECLLQFLCSSNNNIARITK 167
D+ ++ ++ E S D ++ G R+L QDP E ++ F+ S+ N+I I+K
Sbjct: 77 DYFDLERDYSKVKEALSW-DETLKSAVEFGYGIRILNQDPFELVISFIISARNSIPVISK 135
Query: 168 MVDFLASLGSHLGNVEGFEFHEFPSLERLSLVSEVELRNAGFGYRSAPQSSLLFSVRRSF 227
+ ++ G ++ FP+ E LS S ++R G +RS ++
Sbjct: 136 TIKKISERWGEPIEYNGNTYYSFPTPESLSKASIDDIRETGASFRSKYIWDTTKNIYNCE 195
Query: 228 KQAKYI-----TGTVDVLQSKHSGGAEWLLSLRKLDLQEAIDALCTLPGVGPKVAACIAL 282
K I V++L+ L + +D L GVG KVA CI L
Sbjct: 196 LAKKGILDAPEEDIVEILEKYD------LEKITNMDADGCHKGLQEFMGVGAKVADCIML 249
Query: 283 FSLDQHHAIPVDTHV 297
FS+ + A PVD V
Sbjct: 250 FSMKKSSAFPVDVWV 264
>gi|422875238|ref|ZP_16921723.1| putative 8-oxoguanine DNA glycosylase [Clostridium perfringens
F262]
gi|380303768|gb|EIA16064.1| putative 8-oxoguanine DNA glycosylase [Clostridium perfringens
F262]
Length = 312
Score = 79.0 bits (193), Expect = 3e-12, Method: Compositional matrix adjust.
Identities = 63/255 (24%), Positives = 108/255 (42%), Gaps = 16/255 (6%)
Query: 48 LSLPLTFPTGQTFRWKKTGPLQYTGPIGPHLISLKHLQNGDVCYHIHTSPSEPAAKSALL 107
++ F GQ FRW++T Y G +I L + + Y+ + E + +
Sbjct: 21 FNIKQVFECGQCFRWERTESGSYIGVAFGKVIELAQEDHDVIIYNTNKEDFE----NIWV 76
Query: 108 DFLNMGISLGELWEGFSASDCRFAELAKYLAGARVLRQDPVECLLQFLCSSNNNIARITK 167
D+ ++ ++ E S D ++ G R+L QDP E ++ F+ S+ N+I I+K
Sbjct: 77 DYFDLERDYSKVKEALSW-DETLKSAVEFGYGIRILNQDPFELVISFIISARNSIPVISK 135
Query: 168 MVDFLASLGSHLGNVEGFEFHEFPSLERLSLVSEVELRNAGFGYRSAPQSSLLFSVRRSF 227
+ ++ +G ++ FP+ E L+ S ++R G +RS ++
Sbjct: 136 TIKKISERWGEPIEYKGNTYYSFPTPESLAKASIDDIRETGASFRSKYIWDTTKNIYNCE 195
Query: 228 KQAKYI-----TGTVDVLQSKHSGGAEWLLSLRKLDLQEAIDALCTLPGVGPKVAACIAL 282
K I V++L+ L + +D L GVG KVA CI L
Sbjct: 196 LAKKGILDAPEEDIVEILEKYD------LEKITNMDADGCHKGLQEFMGVGAKVADCIML 249
Query: 283 FSLDQHHAIPVDTHV 297
FS+ + A PVD V
Sbjct: 250 FSMKKSSAFPVDVWV 264
>gi|448608898|ref|ZP_21660177.1| 8-oxoguanine DNA glycosylase [Haloferax mucosum ATCC BAA-1512]
gi|445747275|gb|ELZ98731.1| 8-oxoguanine DNA glycosylase [Haloferax mucosum ATCC BAA-1512]
Length = 304
Score = 79.0 bits (193), Expect = 3e-12, Method: Compositional matrix adjust.
Identities = 50/159 (31%), Positives = 76/159 (47%), Gaps = 21/159 (13%)
Query: 139 GARVLRQDPVECLLQFLCSSNNNIARITKMVDFLASLGSHLGNVEGFEFHEFPSLERLSL 198
G R++R P CL+ F+CS+ ++RI M L + +G + +P+ +RL+
Sbjct: 119 GMRLVRDPPFGCLISFICSAQMRVSRIFGMQSRLRETYGEVVEFDGETQYAYPTADRLAA 178
Query: 199 VSEVELRNAGFGYRSAPQSSLLFSVRRSFKQAKYITGTVDVLQSKHSGGAEWLLSLRKLD 258
+E ELR+ GYR A Y+ T +++ + + AE R D
Sbjct: 179 ATEDELRDLSLGYR-----------------APYVQRTAEMVATGEATPAE----ARGRD 217
Query: 259 LQEAIDALCTLPGVGPKVAACIALFSLDQHHAIPVDTHV 297
+ A D L GVG KVA CI LFSL A+P+DT +
Sbjct: 218 YEGARDYLTNFVGVGDKVADCILLFSLGYLEAVPLDTWI 256
>gi|289580206|ref|YP_003478672.1| DNA-(apurinic or apyrimidinic site) lyase [Natrialba magadii ATCC
43099]
gi|448281445|ref|ZP_21472751.1| DNA-(apurinic or apyrimidinic site) lyase [Natrialba magadii ATCC
43099]
gi|289529759|gb|ADD04110.1| DNA-(apurinic or apyrimidinic site) lyase [Natrialba magadii ATCC
43099]
gi|445578867|gb|ELY33267.1| DNA-(apurinic or apyrimidinic site) lyase [Natrialba magadii ATCC
43099]
Length = 293
Score = 79.0 bits (193), Expect = 3e-12, Method: Compositional matrix adjust.
Identities = 69/253 (27%), Positives = 114/253 (45%), Gaps = 28/253 (11%)
Query: 48 LSLPLTFPTGQTFRWKKTGPLQYTGPIGPHLISLKHLQNGDVCYH-----IHTSPSEPAA 102
L L T +GQT+ W++ Y+G P + + +GDV + S A
Sbjct: 15 LDLYRTLESGQTYLWRRADGEMYSGTPAPGEWYVT-VVDGDVIRARTVDGVLEWQSTTDA 73
Query: 103 KSALLDFLNMGISLGELWEGFSASDCRFAELAKYLAGARVLRQDPVECLLQFLCSSNNNI 162
+S + L + L + + + D E + G R++ L+ F+CS+ +
Sbjct: 74 ESTIRRLLRLDDDLEAIVDA-APDDPLLDEAYEAHRGMRLVTDPSFGTLISFICSAQMRV 132
Query: 163 ARITKMVDFLASLGSHLGNVEGFEFHEFPSLERLSLVSEVELRNAGFGYRSAPQSSLLFS 222
+RI MV LA + ++ G +H FP+ ++L+ +E ELR+ G GYR
Sbjct: 133 SRIHSMVSTLAREYGNEVSLNGDTYHAFPTPDQLASATEAELRDLGLGYR---------- 182
Query: 223 VRRSFKQAKYITGTVDVLQSKHSGGAEWLLSLRKLDLQEAIDALCTLPGVGPKVAACIAL 282
A Y+ T +++ S G R L+ + A + LC GVG KVA C+ L
Sbjct: 183 -------APYVVRTAEMVAS----GEAHPDDARPLEYEAAREHLCQFVGVGDKVADCVLL 231
Query: 283 FSLDQHHAIPVDT 295
F+L+ A+P+DT
Sbjct: 232 FALEFDEAVPLDT 244
>gi|375092083|ref|ZP_09738369.1| 8-oxoguanine DNA-glycosylase (ogg) [Helcococcus kunzii ATCC 51366]
gi|374562149|gb|EHR33483.1| 8-oxoguanine DNA-glycosylase (ogg) [Helcococcus kunzii ATCC 51366]
Length = 299
Score = 78.6 bits (192), Expect = 4e-12, Method: Compositional matrix adjust.
Identities = 68/264 (25%), Positives = 110/264 (41%), Gaps = 47/264 (17%)
Query: 48 LSLPLTFPTGQTFRWKKTGPLQYTGPIGPHLISLKHLQNGDVCYHIHTSPSEPAAKSALL 107
L F GQ FRW+KT Y +I L + D+ I+ + E
Sbjct: 21 FDLTHIFECGQAFRWEKTVDNSYIIVAYGRVIELIKDASDDLI--IYNTNKE-------- 70
Query: 108 DFLNMGISLGELWEGFSASDCRFAELAKYL------------AGARVLRQDPVECLLQFL 155
DF+N+ + +L + + A+ A Y G R+LRQ+ E ++ F+
Sbjct: 71 DFVNIWHNYFDLNRDYDSLKLELAKTAAYKLNTSLKEAIEFGYGIRILRQEEFEMIISFI 130
Query: 156 CSSNNNIARITKMVDFLA-SLGSHLGNVEGFEFHEFPSLERLSLVSEVELRNAGFGYRSA 214
S+NN I RI + L+ + G + +G ++ FP+ E+L+ +E++
Sbjct: 131 ISANNQIPRIKNSIRLLSETYGEFIQEYKGQRYYSFPTPEKLASADPLEIKEI------- 183
Query: 215 PQSSLLFSVRRSFKQAKYITGTVDVLQSKHSGGAEWLLSLRKLDLQEAIDALCTLPGVGP 274
R F+ + + + L++ + +LD Q D L LPGVGP
Sbjct: 184 --------CRVGFRNERIVLTSRLYLENPENFS-------NQLDDQTLGDNLLALPGVGP 228
Query: 275 KVAACIALFSLDQHHAIPVDTHVW 298
KV CI LF + + PVD VW
Sbjct: 229 KVRDCILLFGYARGNTFPVD--VW 250
>gi|227500180|ref|ZP_03930249.1| 8-oxoguanine DNA glycosylase [Anaerococcus tetradius ATCC 35098]
gi|227217702|gb|EEI83006.1| 8-oxoguanine DNA glycosylase [Anaerococcus tetradius ATCC 35098]
Length = 302
Score = 78.6 bits (192), Expect = 4e-12, Method: Compositional matrix adjust.
Identities = 72/263 (27%), Positives = 106/263 (40%), Gaps = 35/263 (13%)
Query: 41 LNLTQSELSLPLTFPTGQTFRWKKTGPLQYTGPIGPHLISLKHLQNGDVCYHIHTSPSEP 100
L L +S F GQ F + YT +I++ L++G + + S
Sbjct: 21 LILRESSFDPTHIFECGQCFNFHIEDDGSYTAVFMGKIINVLKLEDG---LSLIRNVSLC 77
Query: 101 AAKSALLDFLNMGISLGELWEGFSASDCRFAELAKYLAGARVLRQDPVECLLQFLCSSNN 160
D+ ++G ++ E A D + Y G R+L Q+ E + F+ S+NN
Sbjct: 78 DFYDIFYDYFDLGTDYEKIKEK-VAIDPIMQKATAYGRGIRILNQELFETTISFIISANN 136
Query: 161 NIARITKMVDFLAS-LGSHLGNVEGFEFHEFPSLERLSLVSEVELRN-AGFGYRS---AP 215
I RI K V L+ G ++G+ ++ FP E L V ELR A G+R
Sbjct: 137 QIPRIKKAVRILSERYGKYIGDYNNRSYYSFPRPEDLMNVDPDELREYARVGFRDKRIVQ 196
Query: 216 QSSLLFSVRRSFKQAKYITGTVDVLQSKHSGGAEWLLSLRKLDLQEAIDALCTLPGVGPK 275
S +++ F K + D LQ K L LPG+GPK
Sbjct: 197 ASKMIYEGHLDFSNDKNLAS--DKLQKK----------------------LIDLPGIGPK 232
Query: 276 VAACIALFSLDQHHAIPVDTHVW 298
VA CI LF+ + PVD VW
Sbjct: 233 VADCIMLFAYHKRETFPVD--VW 253
>gi|291561112|emb|CBL39911.1| 3-methyladenine DNA glycosylase/8-oxoguanine DNA glycosylase
[butyrate-producing bacterium SS3/4]
Length = 298
Score = 78.2 bits (191), Expect = 5e-12, Method: Compositional matrix adjust.
Identities = 81/279 (29%), Positives = 122/279 (43%), Gaps = 41/279 (14%)
Query: 56 TGQTFRWKKTGPLQYTGPIGPH-LISLKHL----QNGD-VCYHIHTSPSEPAAKSALLD- 108
+GQ FR P P + +IS H QN D V +H P +SA D
Sbjct: 17 SGQCFRMIPCAP---NDPQTAYRVISSGHFLIVEQNRDEVIFHC------PDDESAFWDH 67
Query: 109 FLNMGISLGELWEGFSASDCRFAELAKYLAGARVLRQDPVECLLQFLCSSNNNIARITKM 168
+ ++G G +D ++ A + +G R+L QD E ++ F+ S I I +
Sbjct: 68 YFDLGTDYGAYIRAVDPADEYLSKAAAFGSGIRILNQDLWEMIITFIISQQKTIPAIRAL 127
Query: 169 VDFLAS-LGSHLGNVEGFEFHEFPSLERLSLVSEVELRNAGFGYRSAPQSSLLFSVRRSF 227
V+ L+ G+ G+ ++ FP+ E L+ S +L GYR
Sbjct: 128 VEALSEKYGTRYELSSGY-YYAFPTPEELNHASLDDLLALKLGYR--------------- 171
Query: 228 KQAKYITGTVDVLQSKHSGGAEWLLSLRKLDLQEAIDALCTLPGVGPKVAACIALFSLDQ 287
AKYI T + + S G L LR L+ ++++AL + G+G KVA CI LF L
Sbjct: 172 --AKYIKRTCEDVCS----GKMDLDRLRGLNYSDSMEALLSCYGIGVKVANCICLFGLHH 225
Query: 288 HHAIPVDTHVWKIATRYLLPELAGVRLTP--KLCSRVAE 324
A PVDT + KI R P+ P +LC+ + E
Sbjct: 226 IGAFPVDTWIKKILLREYAPKSRCTSHVPESRLCAALIE 264
>gi|251778219|ref|ZP_04821139.1| 8-oxoguanine DNA glycosylase [Clostridium botulinum E1 str. 'BoNT E
Beluga']
gi|243082534|gb|EES48424.1| 8-oxoguanine DNA glycosylase [Clostridium botulinum E1 str. 'BoNT E
Beluga']
Length = 308
Score = 78.2 bits (191), Expect = 5e-12, Method: Compositional matrix adjust.
Identities = 72/266 (27%), Positives = 118/266 (44%), Gaps = 42/266 (15%)
Query: 48 LSLPLTFPTGQTFRWKKTGPLQYTGPIGPHLISLKHLQNGDVCYHIHTSPSEPAAKSALL 107
+L F GQ FR+++ + LI L +Q D + +T+ E K L
Sbjct: 21 FNLKHIFECGQIFRFEEVEDNNFIVIAFGKLIEL--VQKKDDVFIYNTTEKE--FKEIWL 76
Query: 108 DFLNMGISLGELWEGFSASDCRFAELAKYLAGARVLRQDPVECLLQFLCSSNNNIARITK 167
++ ++ + E S D + ++ +G RVL QDP E L+ F+ S+ N+I I K
Sbjct: 77 NYFDLDRDYSIIKEELS-KDELLKQSIEFGSGVRVLNQDPFEMLISFIISARNSIPSIRK 135
Query: 168 MVDFLASLGSHLGNVEGFEFHEFPSLERLSLVSEVELRNAGFGYRSAPQSSLLFSVRRSF 227
V+ ++ + +G ++ FP++E++ E++ G +RS
Sbjct: 136 TVNKISEQWGNKIEYKGKVYYAFPTIEQIKDAKLDEIKETGASFRS-------------- 181
Query: 228 KQAKYITGTVDVLQSKHSGGAEWLL-------SLRKLDLQ--------EAIDALCTLPGV 272
KY+ VD +++ H+ + SL+K DL+ E AL GV
Sbjct: 182 ---KYL---VDTIENVHNSISNSSNLSEDEKESLKKYDLKYIKGLSDDECHAALQEFKGV 235
Query: 273 GPKVAACIALFSLDQHHAIPVDTHVW 298
G KVA CI LFS+ + A PVD VW
Sbjct: 236 GAKVADCIMLFSMGKTSAFPVD--VW 259
>gi|300856867|ref|YP_003781851.1| 8-oxoguanine DNA glycosylase [Clostridium ljungdahlii DSM 13528]
gi|300436982|gb|ADK16749.1| putative 8-oxoguanine DNA glycosylase [Clostridium ljungdahlii DSM
13528]
Length = 302
Score = 78.2 bits (191), Expect = 5e-12, Method: Compositional matrix adjust.
Identities = 62/270 (22%), Positives = 116/270 (42%), Gaps = 29/270 (10%)
Query: 48 LSLPLTFPTGQTFRWKKTGPLQYTGPIGPHLISLKHLQNGDVCYHIHTSPSEPAAKSALL 107
L F GQ FRW + Y G +I ++ + Y+ S+ K
Sbjct: 21 FELAHIFECGQCFRWHREENGNYIGVSFGKVIEVEKKGEDVILYN----TSQQDFKEIWA 76
Query: 108 DFLNMGISLGELWEGFSASDCRFAELAKYLAGARVLRQDPVECLLQFLCSSNNNIARITK 167
++ ++ E+ + + D + ++ G R+L+QDP E ++ F+ S+NN I I +
Sbjct: 77 EYFDLYRDYNEI-KNILSKDPVLKKAIEFGEGIRLLKQDPFELIVSFIISANNRIPMIQR 135
Query: 168 MVDFLASLGSHLGNVEGFEFHEFPSLERLSLVSEVELRNAGFGYRSAPQSSLLFSVRRSF 227
+ ++ + ++ FP +++L + EL + G G+R+
Sbjct: 136 AIKNISEKWGEPLEYKNKIYYSFPKIDKLKEATLEELASCGTGFRN-------------- 181
Query: 228 KQAKYITGTVDVLQSKHSGGAEWLLSLRKLD-LQEAIDALC-----TLPGVGPKVAACIA 281
KYI TV + + E ++ +++ D +C G+GPKV+ CI
Sbjct: 182 ---KYIKDTVSKIYYNGTKTKENYDEKYDINWIKDQEDEVCHKELQKFMGIGPKVSDCIM 238
Query: 282 LFSLDQHHAIPVDTHVWKIATR-YLLPELA 310
LFS+ ++ A PVD V + YL P+++
Sbjct: 239 LFSMQKYSAFPVDVWVKRAMNYFYLAPDVS 268
>gi|335047633|ref|ZP_08540654.1| 8-oxoguanine DNA-glycosylase (ogg) [Parvimonas sp. oral taxon 110
str. F0139]
gi|333761441|gb|EGL38996.1| 8-oxoguanine DNA-glycosylase (ogg) [Parvimonas sp. oral taxon 110
str. F0139]
Length = 291
Score = 78.2 bits (191), Expect = 5e-12, Method: Compositional matrix adjust.
Identities = 66/271 (24%), Positives = 119/271 (43%), Gaps = 28/271 (10%)
Query: 54 FPTGQTFRWKKTGPLQYTGPIGPHLISLKHLQNGDVCYHIHTSPSEPAAKSALLDFLNMG 113
F GQ FRW + +T ++++ L D+ T+ E +D+ ++
Sbjct: 24 FTCGQAFRWYEESDGSFTTVHLGRVLNV--LNEKDMVVFKGTNLEE--FNEIWIDYFDLN 79
Query: 114 ISLGELWEGFSASDCRFAELAKYLAGARVLRQDPVECLLQFLCSSNNNIARITKMVDFLA 173
+ E+ + S ++ A+ +Y G R+L Q+ E L+ F+ S+NN I RI K ++ ++
Sbjct: 80 TNYKEIRKELSNNEI-LAKAMEYGKGIRILNQNHFEMLISFIISANNMIPRIKKSIEVIS 138
Query: 174 -SLGSHLGNVEGFEFHEFPSLERLSLVSEVELRN-AGFGYRSAPQSSLLFSVRRSFKQAK 231
G + E +++ FP++E LS + +LR A G+R K
Sbjct: 139 MRYGKFICEDENRKYYSFPTVEELSKATVEDLREFAKVGFRD-----------------K 181
Query: 232 YITGTVDVLQSKHSGGAEWLLSLRKLDLQEAIDALCTLPGVGPKVAACIALFSLDQHHAI 291
I TV+++ ++ L + L+ + L GVG KVA CI LFS +
Sbjct: 182 RIFETVNMILNEKID----LDNFENLETDILREELLKFSGVGNKVADCIMLFSYKRGEVF 237
Query: 292 PVDTHVWKIATRYLLPELAGVRLTPKLCSRV 322
PVD + ++ + + V+ K R+
Sbjct: 238 PVDVWIKRVMEELFIKKETPVKKISKEADRI 268
>gi|21226573|ref|NP_632495.1| 8-oxoguanine DNA glycosylase [Methanosarcina mazei Go1]
gi|20904847|gb|AAM30167.1| 8-oxoguanine DNA glycosylase [Methanosarcina mazei Go1]
Length = 285
Score = 78.2 bits (191), Expect = 6e-12, Method: Compositional matrix adjust.
Identities = 65/262 (24%), Positives = 115/262 (43%), Gaps = 33/262 (12%)
Query: 48 LSLPLTFPTGQTFRWKKTGPLQYTGPIGPHLISLKHLQNGDVCYHIHTSPSEPAAKSALL 107
+L T GQ FRW+K G +TG +G ++I L ++ P E ++
Sbjct: 12 FNLDYTLDCGQVFRWEKNGDW-WTGVVGNNVIRLSQEEDSRELLIDSKLPPEFFSR---- 66
Query: 108 DFLNMGISLGELWEGFSASDCRFAELAKYLAGARVLRQDPVECLLQFLCSSNNNIARITK 167
+ + L ++E + D K G R++RQDP ECL+ ++ ++ ++I I K
Sbjct: 67 -YFRLDDDLPLIYESIN-RDLLIDRAIKRYKGLRLIRQDPWECLISYMLATASSIPTIQK 124
Query: 168 MVDFLASL-GSHLGNVEGFEFHEFPSLERLSLVSEVELRNAGFGYRSAPQSSLLFSVRRS 226
+ L+ + G L + + FP E L+ L G+R+ R
Sbjct: 125 RICLLSRIFGQELED----GYFSFPDPETLANADMSMLDLCKLGFRAD---------RIK 171
Query: 227 FKQAKYITGTVDVLQSKHSGGAEWLLSLRKLDLQEAIDALCTLPGVGPKVAACIALFSLD 286
+ +G +D +L +L+ + A + L L G+G KVA C+ LF+ +
Sbjct: 172 MAAEEVCSGELD------------FNTLFRLEYKYARERLMRLRGIGEKVADCVLLFAFE 219
Query: 287 QHHAIPVDTHVWKIATRYLLPE 308
+ + PVDTH+ +I Y + +
Sbjct: 220 KMESFPVDTHIRQIIQHYHIDD 241
>gi|281209193|gb|EFA83368.1| hypothetical protein PPL_04161 [Polysphondylium pallidum PN500]
Length = 756
Score = 77.8 bits (190), Expect = 6e-12, Method: Compositional matrix adjust.
Identities = 47/127 (37%), Positives = 67/127 (52%), Gaps = 20/127 (15%)
Query: 196 LSLVSEVELRNAGFGYRSAPQSSLLFSVRRSFKQAKYITGTVDVLQSKHSGGAEWLLSLR 255
L+L + +L GFGYRS K+I + Q K GG WL SLR
Sbjct: 248 LNLNLKEQLNTLGFGYRS-----------------KFIVKSAS--QVKEKGGEVWLESLR 288
Query: 256 KLDLQEAIDALCTLPGVGPKVAACIALFSLDQHHAIPVDTHVWKIATRYLLPELAGVRLT 315
K + A L +L GVG KVA C+ LFS+D+ +P+DTH++ I+ ++ +P LA LT
Sbjct: 289 KQSHENAHKELTSLMGVGQKVADCVCLFSMDKLDIVPIDTHIFTISKKH-MPSLAKKTLT 347
Query: 316 PKLCSRV 322
PK+ +
Sbjct: 348 PKIYQDI 354
>gi|404372496|ref|ZP_10977792.1| 8-oxoguanine DNA-glycosylase (ogg) [Clostridium sp. 7_2_43FAA]
gi|226911366|gb|EEH96567.1| 8-oxoguanine DNA-glycosylase (ogg) [Clostridium sp. 7_2_43FAA]
Length = 313
Score = 77.8 bits (190), Expect = 6e-12, Method: Compositional matrix adjust.
Identities = 67/268 (25%), Positives = 117/268 (43%), Gaps = 14/268 (5%)
Query: 48 LSLPLTFPTGQTFRWKKTGPLQYTGPIGPHLISLKHLQNGDVCYHIHTSPSEPAAKSALL 107
++ GQ FRW K Y +I + +QN D I + +E +
Sbjct: 21 FNIKQILECGQCFRWDKIEDNNYIVVAYGRVIEV--IQNDDEV--IIYNSNEEDFNNIWK 76
Query: 108 DFLNMGISLGELWEGFSASDCRFAELAKYLAGARVLRQDPVECLLQFLCSSNNNIARITK 167
++ ++ + E+ + A D + ++ G R+L QDP E L+ F+ S+ N+I I K
Sbjct: 77 NYFDLERNYDEI-KDVLAKDEILRKSVEFGYGIRILNQDPFEILISFIISARNSIPSIKK 135
Query: 168 MVDFLASLGSHLGNVEGFEFHEFPSLERLSLVSEVELRNAGFGYRSAPQSSLLFSVRRSF 227
V + +G E++ FP+ + + + E++ G +R S + +
Sbjct: 136 TVKKICERWGEKIEYKGNEYYTFPTPQAIKDATLEEIQETGASFR----SKYIVDTIKKV 191
Query: 228 KQAKYITGTVDVLQSKHSGGAEW----LLSLRKLDLQEAIDALCTLPGVGPKVAACIALF 283
+A + + + SK+ E L ++ L E AL GVG KVA CI LF
Sbjct: 192 NEAIEVKEDIKLNPSKYDERPEILDFDLEYIKSLGDDECHIALQKFMGVGAKVADCIMLF 251
Query: 284 SLDQHHAIPVDTHVWKIATR-YLLPELA 310
S+ +H A PVD V + Y+ P+++
Sbjct: 252 SMTKHSAFPVDVWVKRAMIHFYVAPDVS 279
>gi|452209077|ref|YP_007489191.1| 8-oxoguanine DNA glycosylase [Methanosarcina mazei Tuc01]
gi|452098979|gb|AGF95919.1| 8-oxoguanine DNA glycosylase [Methanosarcina mazei Tuc01]
Length = 282
Score = 77.8 bits (190), Expect = 6e-12, Method: Compositional matrix adjust.
Identities = 65/262 (24%), Positives = 115/262 (43%), Gaps = 33/262 (12%)
Query: 48 LSLPLTFPTGQTFRWKKTGPLQYTGPIGPHLISLKHLQNGDVCYHIHTSPSEPAAKSALL 107
+L T GQ FRW+K G +TG +G ++I L ++ P E ++
Sbjct: 9 FNLDYTLDCGQVFRWEKNGDW-WTGVVGNNVIRLSQEEDSRELLIDSKLPPEFFSR---- 63
Query: 108 DFLNMGISLGELWEGFSASDCRFAELAKYLAGARVLRQDPVECLLQFLCSSNNNIARITK 167
+ + L ++E + D K G R++RQDP ECL+ ++ ++ ++I I K
Sbjct: 64 -YFRLDDDLPLIYESIN-RDLLIDRAIKRYKGLRLIRQDPWECLISYMLATASSIPTIQK 121
Query: 168 MVDFLASL-GSHLGNVEGFEFHEFPSLERLSLVSEVELRNAGFGYRSAPQSSLLFSVRRS 226
+ L+ + G L + + FP E L+ L G+R+ R
Sbjct: 122 RICLLSRIFGQELED----GYFSFPDPETLANADMSMLDLCKLGFRAD---------RIK 168
Query: 227 FKQAKYITGTVDVLQSKHSGGAEWLLSLRKLDLQEAIDALCTLPGVGPKVAACIALFSLD 286
+ +G +D +L +L+ + A + L L G+G KVA C+ LF+ +
Sbjct: 169 MAAEEVCSGELD------------FNTLFRLEYKYARERLMRLRGIGEKVADCVLLFAFE 216
Query: 287 QHHAIPVDTHVWKIATRYLLPE 308
+ + PVDTH+ +I Y + +
Sbjct: 217 KMESFPVDTHIRQIIQHYHIDD 238
>gi|373495328|ref|ZP_09585914.1| hypothetical protein HMPREF0380_01552 [Eubacterium infirmum F0142]
gi|371965194|gb|EHO82695.1| hypothetical protein HMPREF0380_01552 [Eubacterium infirmum F0142]
Length = 295
Score = 77.8 bits (190), Expect = 6e-12, Method: Compositional matrix adjust.
Identities = 69/282 (24%), Positives = 118/282 (41%), Gaps = 50/282 (17%)
Query: 41 LNLTQSELSLPLTFPTGQTFRWKKTGPLQYTGPIGPHLISL------KHLQNGDVCYHIH 94
++ + L F GQ FRW+K YTG ++++ K +G +
Sbjct: 4 FDIEVHDFDLDHIFDCGQCFRWRKQRDGSYTGIAYRRIVNMNFQPFEKSKFDGMLRITNL 63
Query: 95 TSPSEPAAKSALLDFLNMGISLGELWEGFSASDCRFAELAKYLAGARVLRQDPVECLLQF 154
+ SE + + +L++ G++ S +D + Y AG R+L+Q+ E ++ F
Sbjct: 64 GNYSEDDLEKIWIPYLDIDRDYGKIKLTLSQNDETIRDAISYGAGIRILKQELWETIVSF 123
Query: 155 LCSSNNNIARITKMVDFLASLGSHLGNVEG-------------FEFHEFPSLERLSLVSE 201
+ S NNNI RI ++ L+ NV+ E + PS E+L+ +++
Sbjct: 124 IISQNNNIPRIKGCIENLSREFGEKINVKSASWQSENTEKLTKLEPYTIPSAEKLASLTQ 183
Query: 202 VELRNAGFGYRSAPQSSLLFSVRRSFKQAKYITGTVDVLQSKHSGGAEWLLSLRKLDLQE 261
+L + GYRS R + AK + +G L
Sbjct: 184 DDLASIKLGYRS----------RYLIESAKTVC---------ENG------------LPV 212
Query: 262 AIDALCTLPGVGPKVAACIALFSLDQHHAIPVDTHVWKIATR 303
+ L L GVGPKVA CI+LF ++ + P+D V ++ R
Sbjct: 213 NFEELSKLCGVGPKVANCISLFGMNDTESFPIDVWVKRVMAR 254
>gi|429963297|gb|ELA42841.1| hypothetical protein VICG_00156 [Vittaforma corneae ATCC 50505]
Length = 300
Score = 77.8 bits (190), Expect = 7e-12, Method: Compositional matrix adjust.
Identities = 85/325 (26%), Positives = 120/325 (36%), Gaps = 85/325 (26%)
Query: 38 WTPLNLTQSELSLPLTFPTGQTFRWKKTGPLQYTGPIGPHLISLKHLQNGDVCYHIHTSP 97
W LN + + L T +GQ F ++ + G + L L Q Y + T
Sbjct: 2 WRELN-SHCFIDLKQTLFSGQIFHFEHANNTLFLGNVHNCLAILA--QYNRKVYFVETCD 58
Query: 98 SEPAAKSALLDFLNMGISL-GELWEGFSASDCRFAELAKYLAGARVLRQDPVECLLQFLC 156
+ AL F N+ + L L E G R L D + F+C
Sbjct: 59 Q---IEQALEKFFNLDVQLPSSLKED----------------GLRFLTNDINSAIFSFIC 99
Query: 157 SSNNNIARITKMVDFLASL--------------------GSHLGNV-----------EGF 185
SSNNNI RI+KMV F+ SL GSH+ +V
Sbjct: 100 SSNNNIKRISKMVKFIYSLGTPFEVEAMLSRYKQSGTINGSHVCSVLPGIDEISDAFRAL 159
Query: 186 EFHEFPSLERLSLVSEVELRNAGFGYRSAPQSSLLFSVRRSFKQAKYITGTVDVLQSKHS 245
+ +FP L+ + S + L+ FGYRS T +D +
Sbjct: 160 KVFKFPDLKTIE-KSLLNLQEQRFGYRS--------------------TFIIDAARFLQD 198
Query: 246 GGAEWLLSLRKLDLQEAIDALCTLPGVGPKVAACIALFSLDQHHAIPVDTHVWKIATRYL 305
W + + A L + GVG KVA CI L SL H +P+DTH+ RY
Sbjct: 199 NNVNW----SAISFESARSILMNIKGVGRKVADCICLTSLRFFHVVPLDTHI----IRYS 250
Query: 306 LPE--LAGVRLTPKLCSRVAEAFCE 328
+ E L LT K+ S + + E
Sbjct: 251 IKEFNLKHKTLTDKMYSGIQRMWTE 275
>gi|335438684|ref|ZP_08561420.1| 8-oxoguanine DNA glycosylase domain protein [Halorhabdus tiamatea
SARL4B]
gi|334890806|gb|EGM29066.1| 8-oxoguanine DNA glycosylase domain protein [Halorhabdus tiamatea
SARL4B]
Length = 308
Score = 77.8 bits (190), Expect = 7e-12, Method: Compositional matrix adjust.
Identities = 73/281 (25%), Positives = 105/281 (37%), Gaps = 54/281 (19%)
Query: 48 LSLPLTFPTGQTFRWKKTGPLQYTGPIGPHLISLKHLQNGDVCYHIHTSPSEPAAKSALL 107
L T +GQTFRW + Y GD Y PA+ L
Sbjct: 14 FDLQSTVESGQTFRWTREDGRSY---------EESDAYGGDAWYATVVVGDGPASDERDL 64
Query: 108 DFLNMGISLGEL-WEG-FSA----------------------SDCRFAELAKYLAGARVL 143
D + + + L WE F A SD G R++
Sbjct: 65 DVVRVRQTDDRLEWEASFDAEATLRRRLRLDDDLPAIRDATPSDPLLETAFDRYWGLRLV 124
Query: 144 RQDPVECLLQFLCSSNNNIARITKMVDFLASLGSHLGNVEGFEFHEFPSLERLSLVSEVE 203
R P L+ F+CS+ + RI +M L +G E++ FP+ ERL+ +E E
Sbjct: 125 RDPPFATLIAFICSAQMRVERIHEMQRALEERFGESITFDGEEYYAFPTPERLAEATEAE 184
Query: 204 LRNAGFGYRSAPQSSLLFSVRRSFKQAKYITGTVDVLQSKHSGGAEWLLSLRKLDLQEAI 263
LR+ G GYR A Y+ T +++ + E L ++A
Sbjct: 185 LRDLGLGYR-----------------APYVQRTAEMVAEGTAHPQE----ATDLAYEDAR 223
Query: 264 DALCTLPGVGPKVAACIALFSLDQHHAIPVDTHVWKIATRY 304
+ L GVG KVA C+ LFSL A+P+DT + Y
Sbjct: 224 EYLAQFVGVGEKVADCVLLFSLGFIEAVPLDTWIRSTIEEY 264
>gi|373454907|ref|ZP_09546768.1| 8-oxoguanine DNA-glycosylase (ogg) [Dialister succinatiphilus YIT
11850]
gi|371935409|gb|EHO63157.1| 8-oxoguanine DNA-glycosylase (ogg) [Dialister succinatiphilus YIT
11850]
Length = 286
Score = 77.4 bits (189), Expect = 7e-12, Method: Compositional matrix adjust.
Identities = 62/258 (24%), Positives = 108/258 (41%), Gaps = 25/258 (9%)
Query: 43 LTQSELSLPLTFPTGQTFRWKKTGPLQYTGPIGPHLISLKHLQNGDVCYHIHTSPSEPAA 102
+ E L +GQ FR + + G H++ ++H+Q + Y + SP
Sbjct: 19 FIRDEFDLYKIIHSGQCFRAVEINDGLFRFITGRHVLYIRHIQG--IRYDVSCSPY--LW 74
Query: 103 KSALLDFLNMGISLGELWEGFSASDCRFAELAKYLAGARVLRQDPVECLLQFLCSSNNNI 162
+ ++ + + S +D + A Y G R+LRQDP E L+ F+ S +I
Sbjct: 75 NHLWKPYFDLSENYEAIRNSISPNDHYLQKAADYSRGIRILRQDPWETLITFIISQRKSI 134
Query: 163 ARITKMVDFLASLGSHLGNVEGFEFHEFPSLERLSLVSEVELRNAGFGYRSAPQSSLLFS 222
I V+ L + FP+ ++LS +S+ +L+ GYR
Sbjct: 135 PAIASAVEKLCQKAGEEIKTPLEILYTFPTPQKLSSLSQEDLKACSLGYR---------- 184
Query: 223 VRRSFKQAKYITGTVDVLQSKHSGGAEWLLSLRKLDLQEAIDALCTLPGVGPKVAACIAL 282
YI+ + ++++ L + D ++ ++AL LPGVG KVA C AL
Sbjct: 185 -------IPYISKSAEIIKK----SPHLLKDMATYDDEKLLNALMELPGVGIKVANCTAL 233
Query: 283 FSLDQHHAIPVDTHVWKI 300
F + PVD + ++
Sbjct: 234 FGFHRTALAPVDVWIQRV 251
>gi|424812423|ref|ZP_18237663.1| 3-methyladenine DNA glycosylase/8-oxoguanine DNA glycosylase
[Candidatus Nanosalinarum sp. J07AB56]
gi|339756645|gb|EGQ40228.1| 3-methyladenine DNA glycosylase/8-oxoguanine DNA glycosylase
[Candidatus Nanosalinarum sp. J07AB56]
Length = 301
Score = 77.4 bits (189), Expect = 9e-12, Method: Compositional matrix adjust.
Identities = 77/288 (26%), Positives = 124/288 (43%), Gaps = 53/288 (18%)
Query: 41 LNLTQSE--LSLPLTFPTGQTFRWKKTG--------------------PLQYTGPIGPHL 78
+NL+Q + L T +GQT+ W++ P T P +
Sbjct: 6 INLSQLDGGFDLQSTLESGQTYCWERLDGEMFEDDNLRGSEARYATVVPASRTDSGNPEI 65
Query: 79 ISLKHLQNGDVCYHIHTSPSEPAAKSALLDFLNMGISLGELWEGFSASDCRFAELAKYLA 138
I+++ +G + + T+ ++P L +FL +G L E+ D F E
Sbjct: 66 ITVRQT-DGYLDWE-STTDADPH----LREFLRLGDDLDEIVRQSPDLDV-FKEAYDRFR 118
Query: 139 GARVLRQDPV-ECLLQFLCSSNNNIARITKMVDFLASLGSHLGNVEGFEFHEFPSLERLS 197
G R++ DPV CL+ F+CS+ + RI M + L +EG F+ FP+ E+L+
Sbjct: 119 GMRIV-NDPVFPCLVSFICSAQMRVERIQLMQENLRREFGTPVELEGDTFYAFPTPEQLA 177
Query: 198 LVSEVELRNAGFGYRSAPQSSLLFSVRRSFKQAKYITGTVDVLQSKHSGGAEWLLSLRKL 257
SE ELR+ G GYR A Y+ T +++ S ++
Sbjct: 178 EASEEELRDCGLGYR-----------------APYVKSTAEMVVDGLSPS-----KAAEM 215
Query: 258 DLQEAIDALCTLPGVGPKVAACIALFSLDQHHAIPVDTHVWKIATRYL 305
+ +A + L GVG KVA C+ LFSL +A+P+DT + Y
Sbjct: 216 EYYDARNHLKQFMGVGDKVADCVLLFSLGFLNAVPLDTWIRTTIDDYF 263
>gi|448321387|ref|ZP_21510867.1| 8-oxoguanine DNA glycosylase domain-containing protein
[Natronococcus amylolyticus DSM 10524]
gi|445604247|gb|ELY58198.1| 8-oxoguanine DNA glycosylase domain-containing protein
[Natronococcus amylolyticus DSM 10524]
Length = 293
Score = 77.4 bits (189), Expect = 9e-12, Method: Compositional matrix adjust.
Identities = 68/251 (27%), Positives = 103/251 (41%), Gaps = 24/251 (9%)
Query: 48 LSLPLTFPTGQTFRWKKTGPLQYTGPIGPHLISLKHLQNGDVCYHIHTSPSEPAAKSALL 107
L L T +GQT+ W++T Y G P + V E +
Sbjct: 15 LDLYRTLESGQTYLWRRTDGEMYGGEPAPDEWYYTVVDGAVVRVRSRDGVLEWESSIDAD 74
Query: 108 DFLNMGISLGELWEGFSAS---DCRFAELAKYLAGARVLRQDPVECLLQFLCSSNNNIAR 164
+ + L + E A+ D E + G R++ L+ F+CS+ ++R
Sbjct: 75 PLVRRLLRLEDDLEAIVAAAPDDPLLDEAYEAHRGLRLVDDPSFGTLISFICSAQMRVSR 134
Query: 165 ITKMVDFLASLGSHLGNVEGFEFHEFPSLERLSLVSEVELRNAGFGYRSAPQSSLLFSVR 224
I MV LA +G +H FP+ ERL+ +E ELR G GYR
Sbjct: 135 IHGMVSTLAREYGEPIAFDGETYHAFPTPERLATATEDELRELGLGYR------------ 182
Query: 225 RSFKQAKYITGTVDVLQSKHSGGAEWLLSLRKLDLQEAIDALCTLPGVGPKVAACIALFS 284
A Y+ T +++ + E R L + A ++L GVG KVA C+ LFS
Sbjct: 183 -----APYVVRTAEMVADGEAHPEE----ARDLAYEAARESLKRFVGVGDKVADCVLLFS 233
Query: 285 LDQHHAIPVDT 295
L+ A+P+DT
Sbjct: 234 LEFDQAVPLDT 244
>gi|399574245|ref|ZP_10768004.1| 3-methyladenine DNA glycosylase [Halogranum salarium B-1]
gi|399240077|gb|EJN61002.1| 3-methyladenine DNA glycosylase [Halogranum salarium B-1]
Length = 304
Score = 77.4 bits (189), Expect = 9e-12, Method: Compositional matrix adjust.
Identities = 52/166 (31%), Positives = 79/166 (47%), Gaps = 21/166 (12%)
Query: 139 GARVLRQDPVECLLQFLCSSNNNIARITKMVDFLASLGSHLGNVEGFEFHEFPSLERLSL 198
G R++R P CL+ F+CS+ ++RI M LA + G E+H FP+ E+L+
Sbjct: 119 GMRLVRDPPFACLISFICSAQMRVSRIHGMQMALAREFGSTVEMNGEEYHAFPTPEQLAA 178
Query: 199 VSEVELRNAGFGYRSAPQSSLLFSVRRSFKQAKYITGTVDVLQSKHSGGAEWLLSLRKLD 258
+E ELR+ GYR A Y+ T +++ + E L L
Sbjct: 179 RTEDELRDLKLGYR-----------------APYVQRTAEMVADGEAHPDEAL----GLA 217
Query: 259 LQEAIDALCTLPGVGPKVAACIALFSLDQHHAIPVDTHVWKIATRY 304
++A ++L GVG KVA C+ LFSL A+P+DT + Y
Sbjct: 218 YEDARESLTRFVGVGDKVADCVLLFSLGFLEAVPLDTWIQSAIADY 263
>gi|448739370|ref|ZP_21721385.1| 3-methyladenine DNA glycosylase [Halococcus thailandensis JCM
13552]
gi|445799965|gb|EMA50334.1| 3-methyladenine DNA glycosylase [Halococcus thailandensis JCM
13552]
Length = 302
Score = 77.0 bits (188), Expect = 1e-11, Method: Compositional matrix adjust.
Identities = 54/160 (33%), Positives = 78/160 (48%), Gaps = 23/160 (14%)
Query: 139 GARVLRQDPVECLLQFLCSSNNNIARITKMVDFLA-SLGSHLGNVEGFEFHEFPSLERLS 197
G RV+R P L+ F+CS+ ++RI M LA GS + +G +H FP+ +L+
Sbjct: 117 GLRVVRDPPFVSLVSFICSAQMRVSRIHGMQRTLAREFGSTI-EFDGETYHAFPTPTQLA 175
Query: 198 LVSEVELRNAGFGYRSAPQSSLLFSVRRSFKQAKYITGTVDVLQSKHSGGAEWLLSLRKL 257
SE ELR GYR A Y+ T +++ S G S R +
Sbjct: 176 AASEAELRECSLGYR-----------------APYVERTAELVAS----GEAHPESARGM 214
Query: 258 DLQEAIDALCTLPGVGPKVAACIALFSLDQHHAIPVDTHV 297
+EA + L GVG KVA C+ LF+LD A+P+DT +
Sbjct: 215 AFEEAHEYLKQFVGVGDKVADCVLLFALDYLEAVPLDTWI 254
>gi|448401929|ref|ZP_21571840.1| 8-oxoguanine DNA glycosylase domain-containing protein
[Haloterrigena limicola JCM 13563]
gi|445665987|gb|ELZ18658.1| 8-oxoguanine DNA glycosylase domain-containing protein
[Haloterrigena limicola JCM 13563]
Length = 299
Score = 77.0 bits (188), Expect = 1e-11, Method: Compositional matrix adjust.
Identities = 72/271 (26%), Positives = 116/271 (42%), Gaps = 33/271 (12%)
Query: 48 LSLPLTFPTGQTFRWKKTGPLQYTGPIGPH-----LISLKHLQNGDVCYHIHTSPS--EP 100
L T +GQ++ W+++ Y G P ++ +GDV + T E
Sbjct: 15 FDLYRTLESGQSYLWQRSDGEMYGGTPAPDAWYTTVVGSDGTGDGDVI-RVRTRDGSLEW 73
Query: 101 AAKSALLDFLNMGISLGELWEGFSAS---DCRFAELAKYLAGARVLRQDPVECLLQFLCS 157
+ + + + L + EG A+ D E + G R++ CL+ F+CS
Sbjct: 74 ESTTDAEPIVRRLLRLDDDLEGIVAAGPDDPLLREAYEAHRGMRLVDDPAFGCLISFICS 133
Query: 158 SNNNIARITKMVDFLASLGSHLGNVEGFEFHEFPSLERLSLVSEVELRNAGFGYRSAPQS 217
+ ++RI MV LA +G +H FP+ E+L+ +E +LR+ G GYR
Sbjct: 134 AQMRVSRIHTMVTTLAREYGDEVAFDGETYHAFPTPEQLATATEADLRDLGLGYR----- 188
Query: 218 SLLFSVRRSFKQAKYITGTVDVLQSKHSGGAEWLLSLRKLDLQEAIDALCTLPGVGPKVA 277
A Y+ T +++ + G + R L+ +A L GVG KVA
Sbjct: 189 ------------APYVVRTAEMVAN----GEAHPEAARDLEYGDARKYLTRFVGVGDKVA 232
Query: 278 ACIALFSLDQHHAIPVDTHVWKIATRYLLPE 308
C+ LFSL A+P+DT + K A PE
Sbjct: 233 DCVLLFSLGFDEAVPLDTWI-KSAIEEYYPE 262
>gi|163816518|ref|ZP_02207882.1| hypothetical protein COPEUT_02708 [Coprococcus eutactus ATCC 27759]
gi|158448218|gb|EDP25213.1| 8-oxoguanine DNA-glycosylase (ogg) [Coprococcus eutactus ATCC
27759]
Length = 297
Score = 77.0 bits (188), Expect = 1e-11, Method: Compositional matrix adjust.
Identities = 69/254 (27%), Positives = 106/254 (41%), Gaps = 28/254 (11%)
Query: 46 SELSLPLTFPTGQTFRWKKTGPLQYTGPIGPHLISLKHLQNGDVCYHIHTSPSEPAAKSA 105
++ +L GQ F + K Y + ++ D I+ S E
Sbjct: 22 NDFTLSQILECGQCFHFDKLDEEVYEVVAFGRAVKMEQT---DKVLRIYGSSMEDYEGIW 78
Query: 106 LLDFLNMGISLGELWEGFSASDCRFAELAKYLAGARVLRQDPVECLLQFLCSSNNNIARI 165
L +L+M G + + +D +G +L QD E L+ F+ S N +I +I
Sbjct: 79 KL-YLDMDNDYGLIKQSVIKADGALKTAVDEKSGIHILNQDFFETLISFIVSQNKSIPQI 137
Query: 166 TKMVDFLA-SLGSHLGNVEGFEFHEFPSLERLSLVSEVELRNAGFGYRSAPQSSLLFSVR 224
+ V ++ G + G F+ FP ++RL +E ELR G+R AP
Sbjct: 138 KQCVKNISHRFGDEVIGYNGEAFYVFPDVQRLHDATEEELRECKVGFR-AP--------- 187
Query: 225 RSFKQAKYITGTVDVLQSKHSGGAEWLLSLRKLDLQEAIDALCTLPGVGPKVAACIALFS 284
YI + + S GA L +LD+ +A + L T+ GVG KVA C+ LF
Sbjct: 188 -------YIKNATEAVYS----GAVTKEKLDELDIAQARELLMTIKGVGEKVANCVLLFG 236
Query: 285 LDQHHAIPVDTHVW 298
L + A PVD VW
Sbjct: 237 LGRREAFPVD--VW 248
>gi|373470091|ref|ZP_09561236.1| 8-oxoguanine DNA-glycosylase [Lachnospiraceae bacterium oral taxon
082 str. F0431]
gi|371763059|gb|EHO51558.1| 8-oxoguanine DNA-glycosylase [Lachnospiraceae bacterium oral taxon
082 str. F0431]
Length = 284
Score = 77.0 bits (188), Expect = 1e-11, Method: Compositional matrix adjust.
Identities = 72/261 (27%), Positives = 111/261 (42%), Gaps = 39/261 (14%)
Query: 48 LSLPLTFPTGQTFRWKKTGPLQYTGPIGPHLISLKHLQNGDVCYHIHTSPSEPAAKSALL 107
L + +GQ FR+ +Y+ G L+ +K +G V S SE +
Sbjct: 11 LDMKKIAESGQIFRFNVYDD-EYSLVAGDKLLFIKEDDDGYVL-----SCSESEFEEFWT 64
Query: 108 DFLNMGISLGELWEGFSASDCRFAELAKYLAGARVLRQDPVECLLQFLCSSNNNIARITK 167
D+ ++ + + + A D A+Y G R+L QD E L+ F+ S +I I
Sbjct: 65 DYFDLRLDYSDFEKNIPADDLFLTNAAEYSYGIRILNQDKWEMLISFIISQRKSIPAIKS 124
Query: 168 MVDFLA---------SLGSHLGNVE-GFEFHEFPSLERLSLVSEVELRNAGFGYRSAPQS 217
++ LA + + N++ EF FP+ + L+ S +L GYRS
Sbjct: 125 SIEKLARTYGKKIDMQVPDFIKNIDKNSEFFAFPTPKALADASLDDLNACSLGYRSPYIE 184
Query: 218 SLLFSVRRSFKQAKYITGTVDVLQSKHSGGAEWLLSLRKLDLQEAIDALCTLPGVGPKVA 277
+ +V R G +D L +L KLD E + AL +L GVG KVA
Sbjct: 185 ASAKAVYR---------GDID------------LEALSKLDDNELLAALMSLKGVGIKVA 223
Query: 278 ACIALFSLDQHHAIPVDTHVW 298
C+ALF + A P+D VW
Sbjct: 224 NCVALFGYHRIAAFPID--VW 242
>gi|448358675|ref|ZP_21547352.1| DNA-(apurinic or apyrimidinic site) lyase [Natrialba chahannaoensis
JCM 10990]
gi|445645257|gb|ELY98263.1| DNA-(apurinic or apyrimidinic site) lyase [Natrialba chahannaoensis
JCM 10990]
Length = 293
Score = 77.0 bits (188), Expect = 1e-11, Method: Compositional matrix adjust.
Identities = 70/268 (26%), Positives = 113/268 (42%), Gaps = 58/268 (21%)
Query: 48 LSLPLTFPTGQTFRWKKTGPLQYTG--PIGPHLISLKHLQNGDVCYH---------IHTS 96
L L T +GQT+ W++ Y+G +G +++ +GDV T+
Sbjct: 15 LDLYRTLESGQTYLWRRADGEMYSGTPALGEWYVTVV---DGDVIRARTVDGVLEWQSTT 71
Query: 97 PSEPAAK---------SALLDFLNMGISLGELWEGFSASDCRFAELAKYLAGARVLRQDP 147
+EP + A++D L E +E G R++
Sbjct: 72 DAEPTVRRLLRLDDDLEAIVDAAPDDPLLDEAYEAHR--------------GMRLVTDPA 117
Query: 148 VECLLQFLCSSNNNIARITKMVDFLASLGSHLGNVEGFEFHEFPSLERLSLVSEVELRNA 207
L+ F+CS+ ++RI MV LA ++ G +H FP+ +L+ +E ELR+
Sbjct: 118 FGTLVSFICSAQMRVSRIHSMVSTLAREYGDEVSLNGDTYHAFPTPAQLASATEAELRDL 177
Query: 208 GFGYRSAPQSSLLFSVRRSFKQAKYITGTVDVLQSKHSGGAEWLLSLRKLDLQEAIDALC 267
G GYR A Y+ T +++ S G R L+ + A + LC
Sbjct: 178 GLGYR-----------------APYVVRTAEMVAS----GETHPDDARTLEYEAAREHLC 216
Query: 268 TLPGVGPKVAACIALFSLDQHHAIPVDT 295
GVG KVA C+ LF+L+ A+P+DT
Sbjct: 217 QFVGVGDKVADCVLLFALEFDEAVPLDT 244
>gi|284165716|ref|YP_003403995.1| 8-oxoguanine DNA glycosylase [Haloterrigena turkmenica DSM 5511]
gi|284015371|gb|ADB61322.1| 8-oxoguanine DNA glycosylase domain protein [Haloterrigena
turkmenica DSM 5511]
Length = 309
Score = 77.0 bits (188), Expect = 1e-11, Method: Compositional matrix adjust.
Identities = 49/157 (31%), Positives = 75/157 (47%), Gaps = 21/157 (13%)
Query: 139 GARVLRQDPVECLLQFLCSSNNNIARITKMVDFLASLGSHLGNVEGFEFHEFPSLERLSL 198
G R++ CL+ F+CS+ ++RI MV LA+ +G +H FP+ +L+
Sbjct: 125 GMRLVDDPAFGCLISFICSTQMRVSRIHAMVSTLAAEYGDAIAFDGETYHAFPTPAQLAA 184
Query: 199 VSEVELRNAGFGYRSAPQSSLLFSVRRSFKQAKYITGTVDVLQSKHSGGAEWLLSLRKLD 258
+E ELR+ G GYR A Y+ T +++ + E R L+
Sbjct: 185 ATESELRDLGLGYR-----------------APYVVRTAELVADGEAHPEE----ARDLE 223
Query: 259 LQEAIDALCTLPGVGPKVAACIALFSLDQHHAIPVDT 295
+ A + L GVG KVA C+ LFSL A+P+DT
Sbjct: 224 YEAAREYLTRFVGVGDKVADCVLLFSLGFDEAVPLDT 260
>gi|448314357|ref|ZP_21504055.1| 8-oxoguanine DNA glycosylase domain-containing protein
[Natronolimnobius innermongolicus JCM 12255]
gi|445595184|gb|ELY49296.1| 8-oxoguanine DNA glycosylase domain-containing protein
[Natronolimnobius innermongolicus JCM 12255]
Length = 295
Score = 76.3 bits (186), Expect = 2e-11, Method: Compositional matrix adjust.
Identities = 68/263 (25%), Positives = 111/263 (42%), Gaps = 30/263 (11%)
Query: 40 PLNLTQSELSLPLTFPTGQTFRWKKTGPLQYTGPIGPHLISLKHLQNGDVCYH------- 92
PL L L T +GQ++ W+++ Y+ P + +G+V
Sbjct: 7 PLEECVGGLDLYRTLESGQSYLWRRSDGEMYSDSPAPDAW-YSTVVDGEVVRARLDEGAG 65
Query: 93 IHTSPSEPAAKSALLDFLNMGISLGELWEGFSASDCRFAELAKYLAGARVLRQDPVECLL 152
+ S A + + + L + L E+ + D + + G R++ CL+
Sbjct: 66 VLEWASTTDADALVRELLCLEDDLEEIVAN-APDDPLLEDAYEAHRGMRLVDDPAFGCLI 124
Query: 153 QFLCSSNNNIARITKMVDFLASLGSHLGNVEGFEFHEFPSLERLSLVSEVELRNAGFGYR 212
F+CS+ ++RI MV LA +G + FP+ E L+ +E +LR+ G GYR
Sbjct: 125 SFICSTQMRVSRIHTMVSTLAREYGTPIEFDGETYSAFPTPEELATATEAQLRDLGLGYR 184
Query: 213 SAPQSSLLFSVRRSFKQAKYITGTVDVLQSKHSGGAEWLLSLRKLDLQEAIDALCTLPGV 272
A Y+ T +++ +GG R L+ + A D L GV
Sbjct: 185 -----------------APYVVRTAEMV----AGGEAHPEDARDLEYEAARDFLTRFVGV 223
Query: 273 GPKVAACIALFSLDQHHAIPVDT 295
G KVA C+ LFSL A+P+DT
Sbjct: 224 GDKVADCVLLFSLGFDEAVPLDT 246
>gi|448414459|ref|ZP_21577528.1| 3-methyladenine DNA glycosylase [Halosarcina pallida JCM 14848]
gi|445682025|gb|ELZ34449.1| 3-methyladenine DNA glycosylase [Halosarcina pallida JCM 14848]
Length = 310
Score = 75.9 bits (185), Expect = 2e-11, Method: Compositional matrix adjust.
Identities = 51/159 (32%), Positives = 76/159 (47%), Gaps = 21/159 (13%)
Query: 139 GARVLRQDPVECLLQFLCSSNNNIARITKMVDFLASLGSHLGNVEGFEFHEFPSLERLSL 198
G R++R CL+ F+CS+ ++RI M LA V+G FH FP+ ERL+
Sbjct: 125 GMRLVRDPAFPCLVSFICSAQMRVSRIHGMQMRLAREYGETVVVDGETFHAFPTPERLAA 184
Query: 199 VSEVELRNAGFGYRSAPQSSLLFSVRRSFKQAKYITGTVDVLQSKHSGGAEWLLSLRKLD 258
+E ELR+ GYR A Y+ T +++ + E L
Sbjct: 185 RTEAELRDLSLGYR-----------------APYVQRTAEMVADGEAHPDE----AAGLP 223
Query: 259 LQEAIDALCTLPGVGPKVAACIALFSLDQHHAIPVDTHV 297
+EA ++L GVG KV+ C+ LFSL A+P+DT +
Sbjct: 224 YEEARESLTRFVGVGDKVSDCVLLFSLGYLEAVPLDTWI 262
>gi|404481939|ref|ZP_11017168.1| 8-oxoguanine DNA-glycosylase (ogg) [Clostridiales bacterium
OBRC5-5]
gi|404344909|gb|EJZ71264.1| 8-oxoguanine DNA-glycosylase (ogg) [Clostridiales bacterium
OBRC5-5]
Length = 282
Score = 75.9 bits (185), Expect = 3e-11, Method: Compositional matrix adjust.
Identities = 71/253 (28%), Positives = 108/253 (42%), Gaps = 39/253 (15%)
Query: 56 TGQTFRWKKTGPLQYTGPIGPHLISLKHLQNGDVCYHIHTSPSEPAAKSALLDFLNMGIS 115
+GQ FR+ +++ G L+ +K +G + S SE LD+ ++ +
Sbjct: 17 SGQIFRFNIYED-EFSLVAGERLLFIKEDGDGYIL-----SCSEEEFNDFWLDYFDLRLD 70
Query: 116 LGELWEGFSASDCRFAELAKYLAGARVLRQDPVECLLQFLCSSNNNIARITKMVDFLASL 175
E + SD A+Y G R+L QD E L+ F+ S +I I V+ L+ +
Sbjct: 71 YKEYEKNIPKSDLFLINAAEYSYGIRILNQDKWEMLISFIISQRKSIPAIKSSVEKLSKV 130
Query: 176 G---------SHLGNVE-GFEFHEFPSLERLSLVSEVELRNAGFGYRSAPQSSLLFSVRR 225
+ N++ EF+ FP+ E L+ S EL GYRS
Sbjct: 131 YGKKIDMQVPDFIKNIDPNTEFYSFPTPEALANASIDELNACSLGYRSP----------- 179
Query: 226 SFKQAKYITGTVDVLQSKHSGGAEWLLSLRKLDLQEAIDALCTLPGVGPKVAACIALFSL 285
YI T + K + +L KL+ E + AL +L GVG KVA C+ALF
Sbjct: 180 ------YIEATAKAVFRKDIE----IEALAKLNDTELLAALMSLKGVGIKVANCVALFGY 229
Query: 286 DQHHAIPVDTHVW 298
+ A P+D VW
Sbjct: 230 HRIAAFPID--VW 240
>gi|448306114|ref|ZP_21496024.1| 8-oxoguanine DNA glycosylase domain-containing protein
[Natronorubrum bangense JCM 10635]
gi|445598752|gb|ELY52804.1| 8-oxoguanine DNA glycosylase domain-containing protein
[Natronorubrum bangense JCM 10635]
Length = 299
Score = 75.5 bits (184), Expect = 3e-11, Method: Compositional matrix adjust.
Identities = 49/157 (31%), Positives = 72/157 (45%), Gaps = 21/157 (13%)
Query: 139 GARVLRQDPVECLLQFLCSSNNNIARITKMVDFLASLGSHLGNVEGFEFHEFPSLERLSL 198
G R++ CL+ F+CS+ + RI MV LA +G +H FP+ +L+
Sbjct: 115 GMRLVEDPSFACLISFICSAQMRVGRIHGMVSTLAREYGDPIEFDGDVYHAFPTPTQLAA 174
Query: 199 VSEVELRNAGFGYRSAPQSSLLFSVRRSFKQAKYITGTVDVLQSKHSGGAEWLLSLRKLD 258
+E ELR G GYR A Y+ T ++ + E R L+
Sbjct: 175 ATEAELRELGLGYR-----------------APYVVRTAKMVADGEAHPEE----ARDLE 213
Query: 259 LQEAIDALCTLPGVGPKVAACIALFSLDQHHAIPVDT 295
++A + L GVG KVA C+ LFSL A+P+DT
Sbjct: 214 YEQAREYLTRFVGVGDKVADCVLLFSLGFDEAVPLDT 250
>gi|448391798|ref|ZP_21566893.1| 8-oxoguanine DNA glycosylase [Haloterrigena salina JCM 13891]
gi|445665210|gb|ELZ17888.1| 8-oxoguanine DNA glycosylase [Haloterrigena salina JCM 13891]
Length = 309
Score = 75.5 bits (184), Expect = 3e-11, Method: Compositional matrix adjust.
Identities = 49/157 (31%), Positives = 73/157 (46%), Gaps = 21/157 (13%)
Query: 139 GARVLRQDPVECLLQFLCSSNNNIARITKMVDFLASLGSHLGNVEGFEFHEFPSLERLSL 198
G R++ CL+ F+CS+ + RI MV LA + +G +H FP+ +L+
Sbjct: 125 GMRLVDDPAFGCLISFICSTQMRVGRIHTMVSTLAREYGDAISFDGETYHAFPTPAQLAA 184
Query: 199 VSEVELRNAGFGYRSAPQSSLLFSVRRSFKQAKYITGTVDVLQSKHSGGAEWLLSLRKLD 258
+E ELR+ G GYR A Y+ T +++ + E R L
Sbjct: 185 ATEAELRDLGLGYR-----------------APYVVRTAEMVADGEAHPEE----ARDLA 223
Query: 259 LQEAIDALCTLPGVGPKVAACIALFSLDQHHAIPVDT 295
+ A + L GVG KVA C+ LFSL A+P+DT
Sbjct: 224 YEPAREYLTRFVGVGDKVADCVLLFSLGFDEAVPLDT 260
>gi|363897807|ref|ZP_09324345.1| hypothetical protein HMPREF9624_00907 [Oribacterium sp. ACB7]
gi|361958272|gb|EHL11574.1| hypothetical protein HMPREF9624_00907 [Oribacterium sp. ACB7]
Length = 296
Score = 74.7 bits (182), Expect = 5e-11, Method: Compositional matrix adjust.
Identities = 72/275 (26%), Positives = 114/275 (41%), Gaps = 37/275 (13%)
Query: 37 KWTPLNLTQSELSLPLTFPTGQTFRWKKTGPLQYTGPIGPHLISLKHLQNGDVCYHIHTS 96
KW+ L + E+SL +GQ FR ++ P + ++ LK + ++ ++I+ +
Sbjct: 13 KWSELCFPE-EISLEKIVRSGQCFRPREIFPSCFRFITKDSILYLKQVSAKELAFYINIT 71
Query: 97 P---------SEPAAKSALLDFLNMGISLGELWEGFSASDCRFAELAKYLAGARVLRQDP 147
+E K L + + G++ E S D + Y G RVLRQDP
Sbjct: 72 EEGDGEAREITEKQWKKIWLPYFDGKRKYGKI-ERKSREDDYLQKCIAYGKGLRVLRQDP 130
Query: 148 VECLLQFLCSSNNNIARITKMVDFLASL--GSHLGNVEGFEFHEFPSLERLSLVSEVELR 205
E LL F+ S +I I V+ L VE E + FP E L + +
Sbjct: 131 FETLLTFILSQRKSIPAIRSSVEKLCERFGEKQYSKVEEKEVYLFPRAEALQ---DADFS 187
Query: 206 NAGFGYRSAPQSSLLFSVRRSFKQAKYITGTVDVLQSKHSGGAEWLLSLRKLDLQEAIDA 265
GYR A ++ V+ + K L +L K+ +E +
Sbjct: 188 ECSLGYR-----------------APFVRDAVERVLEKRLD----LYALDKVPTEELLSK 226
Query: 266 LCTLPGVGPKVAACIALFSLDQHHAIPVDTHVWKI 300
L + GVG KVAAC+ALF+ + IP D + +I
Sbjct: 227 LMEVHGVGIKVAACVALFAYSRMDIIPEDVWMKRI 261
>gi|295101968|emb|CBK99513.1| 3-methyladenine DNA glycosylase/8-oxoguanine DNA glycosylase
[Faecalibacterium prausnitzii L2-6]
Length = 283
Score = 74.7 bits (182), Expect = 5e-11, Method: Compositional matrix adjust.
Identities = 72/263 (27%), Positives = 109/263 (41%), Gaps = 40/263 (15%)
Query: 45 QSELSLPLTFPTGQTFRWKKTGPLQYTGPIGPHLISLKHLQNGDVCYHIHTSPSEPAAKS 104
Q + L +GQ FR +K +Y G L+ L L +G + S E
Sbjct: 14 QDDFDLDKIIESGQCFRPQKLSDGRYRFLSGSALLYLTPLGDGQYDAAWYGSDRE----- 68
Query: 105 ALLDFLNMGISLGELWEGFSASDCRFAELAKYL-------AGARVLRQDPVECLLQFLCS 157
+D+ ++G ++A C A + YL G R+L QDP E L+ FL S
Sbjct: 69 YWVDYFDLG-------RNYAALRCSLAGQSSYLDKSLNFGQGIRILHQDPWEMLITFLIS 121
Query: 158 SNNNIARITKMVDFLASLGSHLGNVEGFEFHEFPSLERLSLVSEVELRNAGFGYRSAPQS 217
+I I V+ LA + EG E FP+ ++L +SE +L G GYR+
Sbjct: 122 QRKSIPAIRTAVEHLARCCGEPLSAEGDEVFLFPTPQQLCGLSEAQLMGCGLGYRTR--- 178
Query: 218 SLLFSVRRSFKQAKYITGTVDVLQSKHSGGAEWLLSLRKLDLQEAIDALCTLPGVGPKVA 277
++ + QA +GT+D L +L L L L GVG KVA
Sbjct: 179 ----YIQNAAAQAS--SGTLD------------LGALAALPDDVLFSRLLELDGVGKKVA 220
Query: 278 ACIALFSLDQHHAIPVDTHVWKI 300
C+ LF + P+D + ++
Sbjct: 221 NCVCLFGYGRTAMAPIDVWIQRL 243
>gi|257051908|ref|YP_003129741.1| 8-oxoguanine DNA glycosylase domain protein [Halorhabdus utahensis
DSM 12940]
gi|256690671|gb|ACV11008.1| 8-oxoguanine DNA glycosylase domain protein [Halorhabdus utahensis
DSM 12940]
Length = 307
Score = 74.7 bits (182), Expect = 6e-11, Method: Compositional matrix adjust.
Identities = 50/166 (30%), Positives = 74/166 (44%), Gaps = 21/166 (12%)
Query: 139 GARVLRQDPVECLLQFLCSSNNNIARITKMVDFLASLGSHLGNVEGFEFHEFPSLERLSL 198
G R++R P L+ F+CS+ I RI +M L +G E+H FP+ RL+
Sbjct: 119 GLRLVRDPPFATLIAFICSAQMRIGRIHEMQRALEERFGESVTFDGVEYHAFPTPARLAE 178
Query: 199 VSEVELRNAGFGYRSAPQSSLLFSVRRSFKQAKYITGTVDVLQSKHSGGAEWLLSLRKLD 258
+E +LR+ G GYR A Y+ T +++ G L
Sbjct: 179 ATEADLRDLGLGYR-----------------APYVQRTAEMVAE----GTAHPQEATDLA 217
Query: 259 LQEAIDALCTLPGVGPKVAACIALFSLDQHHAIPVDTHVWKIATRY 304
++A + L GVG KVA C+ LFSL A+P+DT + Y
Sbjct: 218 YEDAREYLTQFVGVGEKVADCVLLFSLGFVEAVPLDTWIRSTIEEY 263
>gi|322371679|ref|ZP_08046222.1| DNA N-glycosylase [Haladaptatus paucihalophilus DX253]
gi|320548564|gb|EFW90235.1| DNA N-glycosylase [Haladaptatus paucihalophilus DX253]
Length = 293
Score = 74.3 bits (181), Expect = 6e-11, Method: Compositional matrix adjust.
Identities = 48/159 (30%), Positives = 74/159 (46%), Gaps = 21/159 (13%)
Query: 139 GARVLRQDPVECLLQFLCSSNNNIARITKMVDFLASLGSHLGNVEGFEFHEFPSLERLSL 198
G R++R CL+ F+CS+ + RI M LA +G +HEFP+ +L+
Sbjct: 111 GMRIVRDPFFPCLVSFICSAQMRVRRIYDMQTALARRFGESVEFDGKTYHEFPTPTQLAA 170
Query: 199 VSEVELRNAGFGYRSAPQSSLLFSVRRSFKQAKYITGTVDVLQSKHSGGAEWLLSLRKLD 258
E +LR GYR A Y+ + ++L S + +R L
Sbjct: 171 ADESDLRALKLGYR-----------------APYVRKSAELLASGEVTAED----VRGLP 209
Query: 259 LQEAIDALCTLPGVGPKVAACIALFSLDQHHAIPVDTHV 297
++A DA+ GVG KVA C+ LF+L A+P+DT +
Sbjct: 210 YEDARDAMQAFVGVGDKVADCVLLFALGYLEAVPLDTWI 248
>gi|402312125|ref|ZP_10831055.1| 8-oxoguanine DNA glycosylase, N-terminal domain protein
[Lachnospiraceae bacterium ICM7]
gi|400370786|gb|EJP23768.1| 8-oxoguanine DNA glycosylase, N-terminal domain protein
[Lachnospiraceae bacterium ICM7]
Length = 282
Score = 74.3 bits (181), Expect = 7e-11, Method: Compositional matrix adjust.
Identities = 71/253 (28%), Positives = 108/253 (42%), Gaps = 39/253 (15%)
Query: 56 TGQTFRWKKTGPLQYTGPIGPHLISLKHLQNGDVCYHIHTSPSEPAAKSALLDFLNMGIS 115
+GQ FR+ +++ G L+ +K +G + S SE + LD+ ++ +
Sbjct: 17 SGQIFRFNIYED-EFSLVAGDKLLFIKEDGDGYIL-----SCSEEEFNNFWLDYFDLRLD 70
Query: 116 LGELWEGFSASDCRFAELAKYLAGARVLRQDPVECLLQFLCSSNNNIARITKMVDFLASL 175
E + SD +Y G R+L QD E L+ F+ S +I I V+ L+ +
Sbjct: 71 YKEYEKNIPKSDLFLINATEYSYGIRILNQDKWEMLISFIISQRKSIPAIKSSVEKLSKV 130
Query: 176 G---------SHLGNVE-GFEFHEFPSLERLSLVSEVELRNAGFGYRSAPQSSLLFSVRR 225
+ N++ EF+ FP+ E L+ S EL GYRS
Sbjct: 131 YGKKIDMQVPDFIKNIDPDTEFYSFPTPEALANASIDELNACSLGYRSP----------- 179
Query: 226 SFKQAKYITGTVDVLQSKHSGGAEWLLSLRKLDLQEAIDALCTLPGVGPKVAACIALFSL 285
YI T + K L +L KL+ E + AL +L GVG KVA C+ALF
Sbjct: 180 ------YIESTAKAVFRKDIE----LEALAKLNDTELLAALMSLKGVGIKVANCVALFGY 229
Query: 286 DQHHAIPVDTHVW 298
+ A P+D VW
Sbjct: 230 HRIAAFPID--VW 240
>gi|257388451|ref|YP_003178224.1| 8-oxoguanine DNA glycosylase [Halomicrobium mukohataei DSM 12286]
gi|257170758|gb|ACV48517.1| 8-oxoguanine DNA glycosylase domain protein [Halomicrobium
mukohataei DSM 12286]
Length = 302
Score = 74.3 bits (181), Expect = 7e-11, Method: Compositional matrix adjust.
Identities = 49/159 (30%), Positives = 74/159 (46%), Gaps = 21/159 (13%)
Query: 139 GARVLRQDPVECLLQFLCSSNNNIARITKMVDFLASLGSHLGNVEGFEFHEFPSLERLSL 198
G R++R P L+ F+CS+ + RI M L +G +H +P+ E+L+
Sbjct: 115 GMRLVRDPPFGSLISFICSAQMRVGRIHGMQQALRETYGDPVTFDGETYHAYPTPEQLAA 174
Query: 199 VSEVELRNAGFGYRSAPQSSLLFSVRRSFKQAKYITGTVDVLQSKHSGGAEWLLSLRKLD 258
+E LR+ G GYR A Y+ T ++ G + R++
Sbjct: 175 TTEAALRDLGLGYR-----------------APYVLRTATMVAE----GKADPHAAREMV 213
Query: 259 LQEAIDALCTLPGVGPKVAACIALFSLDQHHAIPVDTHV 297
+EA D L GVG KVA C+ LFSLD A+P+DT +
Sbjct: 214 YEEARDHLTQFVGVGDKVADCVLLFSLDFLQAVPLDTWI 252
>gi|313126280|ref|YP_004036550.1| 3-methyladenine DNA glycosylase [Halogeometricum borinquense DSM
11551]
gi|448286121|ref|ZP_21477356.1| 3-methyladenine DNA glycosylase [Halogeometricum borinquense DSM
11551]
gi|312292645|gb|ADQ67105.1| 3-methyladenine DNA glycosylase/8-oxoguanine DNA glycosylase
[Halogeometricum borinquense DSM 11551]
gi|445575172|gb|ELY29651.1| 3-methyladenine DNA glycosylase [Halogeometricum borinquense DSM
11551]
Length = 306
Score = 74.3 bits (181), Expect = 8e-11, Method: Compositional matrix adjust.
Identities = 48/159 (30%), Positives = 79/159 (49%), Gaps = 21/159 (13%)
Query: 139 GARVLRQDPVECLLQFLCSSNNNIARITKMVDFLASLGSHLGNVEGFEFHEFPSLERLSL 198
G R++R P CL+ F+CS+ ++RI M LA V G +H FP+ ++L++
Sbjct: 121 GMRLVRDPPFPCLISFICSAQMRVSRIHGMQMRLAREYGDTVTVAGETYHAFPTADQLAV 180
Query: 199 VSEVELRNAGFGYRSAPQSSLLFSVRRSFKQAKYITGTVDVLQSKHSGGAEWLLSLRKLD 258
+E ELR+ GYR A Y+ T +++ + + + + L
Sbjct: 181 RTEDELRDLSLGYR-----------------APYVQRTAEMVANGDAHPNDAV----GLP 219
Query: 259 LQEAIDALCTLPGVGPKVAACIALFSLDQHHAIPVDTHV 297
++A D+L GVG KV+ C+ LFSL A+P+DT +
Sbjct: 220 YEDARDSLTQFVGVGDKVSDCVLLFSLGFLEAVPLDTWI 258
>gi|448727275|ref|ZP_21709641.1| 3-methyladenine DNA glycosylase [Halococcus morrhuae DSM 1307]
gi|445791489|gb|EMA42129.1| 3-methyladenine DNA glycosylase [Halococcus morrhuae DSM 1307]
Length = 302
Score = 73.9 bits (180), Expect = 9e-11, Method: Compositional matrix adjust.
Identities = 72/275 (26%), Positives = 109/275 (39%), Gaps = 46/275 (16%)
Query: 41 LNLTQSELSLPLTFPTGQTFRWKKTGPLQYTGP-----------------IGPHLISLKH 83
L+ + L T +GQ++ W++ Y P G H +
Sbjct: 8 LDEVAGDFDLQATLESGQSYAWQRADGRMYDRPSAVGGSAWYETVLPATVTGEHEVIRTR 67
Query: 84 LQNGDVCYHIHTSPSEPAAKSALLDFLNMGISLGELWEGFSASDCRFAELAKYLAGARVL 143
+G + + T +E L L + L + E A Y G RV+
Sbjct: 68 QIDGRLDWEATTDATE-----LLTHRLRLDDDLDAIREATPAEPLIERAFDAY-EGLRVV 121
Query: 144 RQDPVECLLQFLCSSNNNIARITKMVDFLA-SLGSHLGNVEGFEFHEFPSLERLSLVSEV 202
R P L+ F+CS+ ++RI M LA GS + G +H FP+ +L+ SE
Sbjct: 122 RDPPFVSLVSFICSAQMRVSRIHGMQRTLAREFGSAI-EFNGETYHAFPTPVQLADASEA 180
Query: 203 ELRNAGFGYRSAPQSSLLFSVRRSFKQAKYITGTVDVLQSKHSGGAEWLLSLRKLDLQEA 262
ELR GYR A Y+ T +++ S G S + +EA
Sbjct: 181 ELRECSLGYR-----------------APYVERTAELVAS----GEAHPESASGMAFEEA 219
Query: 263 IDALCTLPGVGPKVAACIALFSLDQHHAIPVDTHV 297
+ L GVG KVA C+ LF+LD A+P+DT +
Sbjct: 220 REYLKQFVGVGDKVADCVLLFALDYLEAVPLDTWI 254
>gi|160947243|ref|ZP_02094410.1| hypothetical protein PEPMIC_01176 [Parvimonas micra ATCC 33270]
gi|158446377|gb|EDP23372.1| 8-oxoguanine DNA-glycosylase (ogg) [Parvimonas micra ATCC 33270]
Length = 291
Score = 73.9 bits (180), Expect = 9e-11, Method: Compositional matrix adjust.
Identities = 65/271 (23%), Positives = 116/271 (42%), Gaps = 28/271 (10%)
Query: 54 FPTGQTFRWKKTGPLQYTGPIGPHLISLKHLQNGDVCYHIHTSPSEPAAKSALLDFLNMG 113
F GQ FRW + +T ++++ L D T+ E +D+ ++
Sbjct: 24 FTCGQAFRWYEESDGSFTTVHLGRVLNV--LNEKDRVIFKGTNLKE--FDEIWMDYFDLN 79
Query: 114 ISLGELWEGFSASDCRFAELAKYLAGARVLRQDPVECLLQFLCSSNNNIARITKMVDFLA 173
E+ + S ++ + +Y G R+L Q+ E L+ F+ S+NN I RI K ++ ++
Sbjct: 80 TDYKEIRKVLSNNEI-LPKAMEYGEGIRILNQNHFEMLISFIISANNMIPRIKKSIEVIS 138
Query: 174 -SLGSHLGNVEGFEFHEFPSLERLSLVSEVELRN-AGFGYRSAPQSSLLFSVRRSFKQAK 231
G + E +++ FP++E LS + +LR A G+R K
Sbjct: 139 MRYGKFICEDENRKYYSFPTVEELSRATVEDLRKFAKVGFRD-----------------K 181
Query: 232 YITGTVDVLQSKHSGGAEWLLSLRKLDLQEAIDALCTLPGVGPKVAACIALFSLDQHHAI 291
I TV+++ ++ L + L+ + L GVG KVA CI LFS +
Sbjct: 182 RIFDTVNMILNEKID----LDNFENLETDILREELLKFSGVGNKVADCIMLFSYKRGEVF 237
Query: 292 PVDTHVWKIATRYLLPELAGVRLTPKLCSRV 322
PVD + ++ + + V+ K R+
Sbjct: 238 PVDVWIKRVMEELFIKKETPVKKISKEADRI 268
>gi|302387985|ref|YP_003823807.1| DNA-(apurinic or apyrimidinic site) lyase [Clostridium
saccharolyticum WM1]
gi|302198613|gb|ADL06184.1| DNA-(apurinic or apyrimidinic site) lyase [Clostridium
saccharolyticum WM1]
Length = 293
Score = 73.9 bits (180), Expect = 1e-10, Method: Compositional matrix adjust.
Identities = 68/218 (31%), Positives = 87/218 (39%), Gaps = 32/218 (14%)
Query: 96 SPSEPAAKSALLDFLNMGISLGELWEGFSASDCRFAELAKYLAGARVLRQDPVECLLQFL 155
S E + + ++ G + + D E ++ G R+LRQD E L+ FL
Sbjct: 54 SCGEKEFQDMWAGYFDLQTDYGGFKKRVDSEDSYLKEAMEWGWGVRILRQDLWEMLVTFL 113
Query: 156 CSSNNNIARIT------------KMVDFLASLGSHLGNVEGFE-FHEFPSLERLSLVSEV 202
S NNNI RIT K + ++ S G VE + + FP E LS
Sbjct: 114 ISQNNNITRITGSVKEICKRFGEKRIGLGLAVSSDGGWVERSKSYDAFPEPEDLSSAGLR 173
Query: 203 ELRNAGFGYRSAPQSSLLFSVRRSFKQAKYITGTVDVLQSKHSGGAEWLLSLRKLDLQEA 262
L G GYR KYI G EWLL L++ D A
Sbjct: 174 GLAGLGLGYRD-----------------KYILSIAKACSGPE--GREWLLRLKEADYTSA 214
Query: 263 IDALCTLPGVGPKVAACIALFSLDQHHAIPVDTHVWKI 300
L G+G KVA CI LF L A PVDTHV +I
Sbjct: 215 HSLLMEQYGIGRKVADCICLFGLHHVGAFPVDTHVRQI 252
>gi|337743307|gb|AEI73150.1| 8-oxoG [Kryptolebias marmoratus]
Length = 167
Score = 73.6 bits (179), Expect = 1e-10, Method: Compositional matrix adjust.
Identities = 40/82 (48%), Positives = 51/82 (62%), Gaps = 2/82 (2%)
Query: 247 GAEWLLSLRKLDLQEAIDALCTLPGVGPKVAACIALFSLDQHHAIPVDTHVWKIATR-YL 305
G +WL LR++ EA DAL TLPGVG KVA C+ L LD+ A+PVDTHVW+IA R Y
Sbjct: 7 GLQWLEGLRRVSYLEARDALRTLPGVGTKVADCVCLMCLDKPDAVPVDTHVWQIAQRDYK 66
Query: 306 LPELAGVR-LTPKLCSRVAEAF 326
G + LT K+ + + F
Sbjct: 67 YAAEKGQKSLTEKVHRNIGDFF 88
>gi|358068543|ref|ZP_09155005.1| hypothetical protein HMPREF9333_01886 [Johnsonella ignava ATCC
51276]
gi|356693360|gb|EHI55039.1| hypothetical protein HMPREF9333_01886 [Johnsonella ignava ATCC
51276]
Length = 274
Score = 73.6 bits (179), Expect = 1e-10, Method: Compositional matrix adjust.
Identities = 63/267 (23%), Positives = 112/267 (41%), Gaps = 29/267 (10%)
Query: 41 LNLTQSELSLPLTFPTGQTFRWKKTGPLQYTGPIGPHLISLKHLQN--GDVCYHIHTSPS 98
+ ++ + T +GQ FR+ + G ++ L+ +K ++N G + H S +
Sbjct: 2 FRIEAEKIDMNKTADSGQIFRFNRIGEDKFELIASDKLLYIKEIENDAGSLS-HYEISCT 60
Query: 99 EPAAKSALLDFLNMGISLGELWEGFSASDCRFAELAKYLAGARVLRQDPVECLLQFLCSS 158
E K+ ++ ++ + D + Y G +L QD E L+ F+ S
Sbjct: 61 EYEYKNYWENYFDLKTDYSAFIKDIPKDDIFLNKSVSYSNGIHILNQDKWEMLISFIISQ 120
Query: 159 NNNIARITKMVDFLASL-GSHLGNVEGFEFHEFPSLERLSLVSEVELRNAGFGYRSAPQS 217
+I+ I ++ L + G + + + FP+ E L+ + EL GYRS
Sbjct: 121 RKSISAIKTSIEKLCCMFGKQIADGR----YAFPTPEALACANLHELNACSLGYRS---- 172
Query: 218 SLLFSVRRSFKQAKYITGTVDVLQSKHSGGAEWLLSLRKLDLQEAIDALCTLPGVGPKVA 277
+YI ++ S G L L KL+ +E I++L GVG KVA
Sbjct: 173 -------------EYIYEAASLIAS----GKIVLDKLDKLNDKELIESLMRFKGVGIKVA 215
Query: 278 ACIALFSLDQHHAIPVDTHVWKIATRY 304
C+ LF+ + A PVD + ++ Y
Sbjct: 216 NCVVLFAYHRIGAFPVDVWIDRVIKTY 242
>gi|448474137|ref|ZP_21602105.1| 8-oxoguanine DNA glycosylase domain protein [Halorubrum aidingense
JCM 13560]
gi|445818417|gb|EMA68276.1| 8-oxoguanine DNA glycosylase domain protein [Halorubrum aidingense
JCM 13560]
Length = 308
Score = 73.6 bits (179), Expect = 1e-10, Method: Compositional matrix adjust.
Identities = 50/166 (30%), Positives = 76/166 (45%), Gaps = 21/166 (12%)
Query: 139 GARVLRQDPVECLLQFLCSSNNNIARITKMVDFLASLGSHLGNVEGFEFHEFPSLERLSL 198
G R+ R CL+ F+CS+ +ARI M L ++G +H FP+ ++L+
Sbjct: 123 GMRLTRDPVFPCLISFICSAQMRVARIHGMQRRLRETYGDAVELDGESYHAFPTPDQLAA 182
Query: 199 VSEVELRNAGFGYRSAPQSSLLFSVRRSFKQAKYITGTVDVLQSKHSGGAEWLLSLRKLD 258
+E ELR+ GYR A Y+ T +++ S + AE L
Sbjct: 183 RTESELRDLALGYR-----------------APYVQRTAEMVASGEAHPAE----AADLQ 221
Query: 259 LQEAIDALCTLPGVGPKVAACIALFSLDQHHAIPVDTHVWKIATRY 304
+ A ++L GVG KVA C+ LFSL A+P+DT + Y
Sbjct: 222 YEAARESLTRFVGVGDKVADCVLLFSLGFLEAVPLDTWIQTTIEEY 267
>gi|222479938|ref|YP_002566175.1| 8-oxoguanine DNA glycosylase domain protein [Halorubrum
lacusprofundi ATCC 49239]
gi|222452840|gb|ACM57105.1| 8-oxoguanine DNA glycosylase domain protein [Halorubrum
lacusprofundi ATCC 49239]
Length = 308
Score = 73.2 bits (178), Expect = 2e-10, Method: Compositional matrix adjust.
Identities = 51/166 (30%), Positives = 76/166 (45%), Gaps = 21/166 (12%)
Query: 139 GARVLRQDPVECLLQFLCSSNNNIARITKMVDFLASLGSHLGNVEGFEFHEFPSLERLSL 198
G R+ R CL+ F+CS+ +ARI M L V+G E+ FP+ ++L+
Sbjct: 123 GMRLTRDPVFPCLISFICSAQMRVARIHGMQRRLRETYGDAVTVDGEEYRAFPTPDQLAS 182
Query: 199 VSEVELRNAGFGYRSAPQSSLLFSVRRSFKQAKYITGTVDVLQSKHSGGAEWLLSLRKLD 258
+E ELR+ GYR A Y+ T +++ S + E L
Sbjct: 183 RTESELRDLSLGYR-----------------APYVQRTAEMIASGEADPRE----AANLP 221
Query: 259 LQEAIDALCTLPGVGPKVAACIALFSLDQHHAIPVDTHVWKIATRY 304
+EA ++L GVG KVA C+ LFSL A+P+DT + Y
Sbjct: 222 YEEARESLTKFVGVGDKVADCVLLFSLGFLEAVPLDTWIRTTIEEY 267
>gi|448687954|ref|ZP_21693922.1| 8-oxoguanine DNA glycosylase [Haloarcula japonica DSM 6131]
gi|445779745|gb|EMA30661.1| 8-oxoguanine DNA glycosylase [Haloarcula japonica DSM 6131]
Length = 302
Score = 73.2 bits (178), Expect = 2e-10, Method: Compositional matrix adjust.
Identities = 46/159 (28%), Positives = 76/159 (47%), Gaps = 21/159 (13%)
Query: 139 GARVLRQDPVECLLQFLCSSNNNIARITKMVDFLASLGSHLGNVEGFEFHEFPSLERLSL 198
G R+++ P L+ F+CS+ +ARI M L +G ++ +P+ L+
Sbjct: 115 GMRLVQDPPFGSLISFICSAQMRVARIHSMQQALRDAFGEEVEFDGRTYNAYPTPSALAE 174
Query: 199 VSEVELRNAGFGYRSAPQSSLLFSVRRSFKQAKYITGTVDVLQSKHSGGAEWLLSLRKLD 258
+E LR+ G GYR A Y+ T +++ + + E + LD
Sbjct: 175 TTEARLRDLGLGYR-----------------APYVQRTAEMVATGEANPGEAV----GLD 213
Query: 259 LQEAIDALCTLPGVGPKVAACIALFSLDQHHAIPVDTHV 297
++A ++L GVG KVA C+ LFSLD A+P+DT +
Sbjct: 214 YEDARESLTRFVGVGDKVADCVLLFSLDYLEAVPLDTWI 252
>gi|448733609|ref|ZP_21715852.1| 3-methyladenine DNA glycosylase [Halococcus salifodinae DSM 8989]
gi|445802498|gb|EMA52803.1| 3-methyladenine DNA glycosylase [Halococcus salifodinae DSM 8989]
Length = 310
Score = 72.8 bits (177), Expect = 2e-10, Method: Compositional matrix adjust.
Identities = 49/159 (30%), Positives = 76/159 (47%), Gaps = 21/159 (13%)
Query: 139 GARVLRQDPVECLLQFLCSSNNNIARITKMVDFLASLGSHLGNVEGFEFHEFPSLERLSL 198
G R++R P L+ F+CS+ + RI M LA + +G +H FP+ E+L+
Sbjct: 120 GLRLVRDPPFATLIAFICSAQMRVGRIHGMQRALAREFGSVVEFDGETYHAFPTPEQLAD 179
Query: 199 VSEVELRNAGFGYRSAPQSSLLFSVRRSFKQAKYITGTVDVLQSKHSGGAEWLLSLRKLD 258
+E ELR+ GYR A Y+ T +++ S G + R +
Sbjct: 180 ATEAELRDCSLGYR-----------------APYVERTAELVAS----GEVLPENARGRE 218
Query: 259 LQEAIDALCTLPGVGPKVAACIALFSLDQHHAIPVDTHV 297
++A + L GVG KVA C+ LFSL A+P+DT +
Sbjct: 219 YEDAREHLKQFMGVGDKVADCVLLFSLGYLEAVPLDTWI 257
>gi|194333980|ref|YP_002015840.1| 8-oxoguanine DNA glycosylase [Prosthecochloris aestuarii DSM 271]
gi|194311798|gb|ACF46193.1| 8-oxoguanine DNA glycosylase domain protein [Prosthecochloris
aestuarii DSM 271]
Length = 312
Score = 72.8 bits (177), Expect = 2e-10, Method: Compositional matrix adjust.
Identities = 77/287 (26%), Positives = 126/287 (43%), Gaps = 35/287 (12%)
Query: 44 TQSELSLPLTFPTGQTFRWKK--TGPLQYTGPIGPHLISLKHLQNGDV--CYHIHTSPSE 99
T+ +++L T +GQTFRW++ Y IG I L+ + N + C +SP
Sbjct: 9 TKRQINLKETLFSGQTFRWEQLEIDSNTYISMIGNTHIQLQQISNNKIRLC---SSSPLI 65
Query: 100 PAAKSALLDFLNMGISLGE-----LWEGFSASDCRFAELAKYLAGARVLRQDPVECLLQF 154
K + F N SL E F ++LA+ G RVLR + E L+ F
Sbjct: 66 DGQKPTV--FFNEYCSLDIDERLCFSEEFRRVYPVVSQLAEPYMGVRVLRLNAFETLITF 123
Query: 155 LCSSNNNIARITKMVDFLAS-LGS-HLGNVEGFEF--HEFPSLERLSLVSEVELRNAGFG 210
+C+ + I K + + + G H+ ++G + FPS E L+ S +LR
Sbjct: 124 MCAQAIGMNLIRKQIRTICNRFGERHMTEIDGNPLIQYSFPSPETLAAASPQDLRICT-- 181
Query: 211 YRSAPQSSLLFSVRRSFKQAKYITGTVDVLQSKHSGGAEWLLSLRKLDLQEAIDALCTLP 270
+ ++S + S R+ + + +D L + +L L ++L
Sbjct: 182 NNNCERASNIISAARAVAEGRL---CMDELINN------------ELSLGSIRNSLTAYR 226
Query: 271 GVGPKVAACIALFSLDQHHAIPVDTHVWKIATRYLLPELAGVRLTPK 317
G+G K+A C+ LF L +H A P+DTHV + ++ E LTPK
Sbjct: 227 GIGLKIADCVMLFGLHRHDAFPIDTHVRQYLGKWFGLEKTQKALTPK 273
>gi|304315458|ref|YP_003850605.1| 3-methyladenine DNA glycosylase/8-oxoguanine DNA glycosylase
[Methanothermobacter marburgensis str. Marburg]
gi|302588917|gb|ADL59292.1| 3-methyladenine DNA glycosylase/8-oxoguanine DNA glycosylase
[Methanothermobacter marburgensis str. Marburg]
Length = 309
Score = 72.8 bits (177), Expect = 2e-10, Method: Compositional matrix adjust.
Identities = 77/294 (26%), Positives = 125/294 (42%), Gaps = 53/294 (18%)
Query: 41 LNLTQSELSLPLTFPTGQTFR--WKKT-GPLQYTGPIGPHLISLKHLQNGDVCYHIH--- 94
+++ E L LT +GQT + WK+ G + IG ++ ++ GD ++
Sbjct: 1 MDIPVREFDLELTQESGQTSQPPWKRVDGAFRELLIIGGVPCPVE-VRAGDEVLRVNPYS 59
Query: 95 TSPSEPAAKSALLDFLNMGISLGELWEGFSASDCRFAELAKYLAGARV-LRQDPVECLLQ 153
P +P K F ++ + +L+ D ++ + G R+ L +DP EC++
Sbjct: 60 DVPRKPLKKKIEYIF-DLKFEIEDLYSFLE--DKGLSDTIQSSRGLRLFLAKDPFECIIS 116
Query: 154 FLCSSNNNIARITKMVDFLASLGSHLGNVEGFEFHEFPSLERLSLVSE------------ 201
+ S+N +I R T+ VD + G F+ FPS L+ V E
Sbjct: 117 SIASANCSIKRWTRAVDDIKRGWGDCHLFRGERFYTFPSPATLAGVEEESLEDLQRREDK 176
Query: 202 -------VELRNAGFGYRSAPQSSLLFSVRRSFKQAKYITGTVDVLQSKHSGGAEWLLSL 254
+LR+ G GYR A YI T +L + + +
Sbjct: 177 LPDDFRFTDLRSCGVGYR-----------------APYIRETSRILSEELD-----ISKI 214
Query: 255 RKLDLQEAIDALCTLPGVGPKVAACIALFSLDQHHAIPVDTHVWKIATRYLLPE 308
++D Q+A + L LPGVGPKVA CI L+ + A PVD V +I +L P+
Sbjct: 215 HRMDYQDAREVLLELPGVGPKVADCILLYGFRKTEAFPVDVWVRRIMN-HLYPD 267
>gi|238569222|ref|XP_002386605.1| hypothetical protein MPER_15084 [Moniliophthora perniciosa FA553]
gi|215438996|gb|EEB87535.1| hypothetical protein MPER_15084 [Moniliophthora perniciosa FA553]
Length = 137
Score = 72.8 bits (177), Expect = 2e-10, Method: Compositional matrix adjust.
Identities = 50/149 (33%), Positives = 73/149 (48%), Gaps = 27/149 (18%)
Query: 110 LNMGISLGELWEGFSASDCRFAELAKYLAGARVLRQDPVECLLQFLCSSNNNIARITKMV 169
+ + + L +L++ +S D F + +G R+LRQDP E L+ F+CSSNNNIARI+KMV
Sbjct: 6 IKLDVDLLKLYDEWSDRDQVFLNVKDRFSGIRILRQDPWENLISFICSSNNNIARISKMV 65
Query: 170 DFLASLGS-------HLGNVEGFEFHEF--PSLERLSLVSEVELRNAGFGYRSAPQSSLL 220
L S G + +H F P++ + VS + LR GFGYR
Sbjct: 66 QNLCKAFSPPLLTLQDPGAITLSNYHPFPPPAVLAKTEVSSI-LRGLGFGYR-------- 116
Query: 221 FSVRRSFKQAKYITGTVDVLQSKHSGGAE 249
A++I T +L HS +E
Sbjct: 117 ---------AEFIQKTAQMLVDAHSDASE 136
>gi|408397957|gb|EKJ77094.1| hypothetical protein FPSE_02738 [Fusarium pseudograminearum CS3096]
Length = 265
Score = 72.8 bits (177), Expect = 2e-10, Method: Compositional matrix adjust.
Identities = 53/145 (36%), Positives = 67/145 (46%), Gaps = 36/145 (24%)
Query: 176 GSHLGNVEGFEFHEFPSLERLSLVS-EVELRNAGFGYRSAPQSSLLFSVRRSFKQAKYIT 234
G + +V FH+FPS + L+ S E LR GFGYR AKYI
Sbjct: 21 GPFIAHVGDEAFHDFPSPQALTGDSVEAHLRTLGFGYR-----------------AKYIA 63
Query: 235 GTVDVLQSKHSGGAEWLLSLRKLD----------------LQEAIDALCTLPGVGPKVAA 278
T ++ ++ WL SLR + +EA L TL GVGPKVA
Sbjct: 64 ETARIVANEKP--ETWLESLRNPEHPGFNTTPVPREQHATYKEAHQQLLTLKGVGPKVAD 121
Query: 279 CIALFSLDQHHAIPVDTHVWKIATR 303
C+ L L A+PVDTHVW+IA R
Sbjct: 122 CVCLMGLGWGEAVPVDTHVWQIAQR 146
>gi|302672170|ref|YP_003832130.1| 8-oxoguanine DNA glycosylase [Butyrivibrio proteoclasticus B316]
gi|302396643|gb|ADL35548.1| 8-oxoguanine DNA glycosylase Ogg [Butyrivibrio proteoclasticus
B316]
Length = 297
Score = 72.4 bits (176), Expect = 3e-10, Method: Compositional matrix adjust.
Identities = 72/278 (25%), Positives = 116/278 (41%), Gaps = 45/278 (16%)
Query: 45 QSELSLPLTFPTGQTFRWKKTGPLQYTGPIGPHLISLKHLQNGDVCYHIHTSPSE----- 99
Q + L +GQ FR+ K G Y+ G + +K + + DV I S+
Sbjct: 6 QDDFDLQKIADSGQCFRFNKCGE-GYSVVAGDKYLFIKEVYDSDVEPDIVVGESDNKDEK 64
Query: 100 --PAAKSAL-----------LDFLNMGISLGELWEGFSASDCRF-AELAKYLAGARVLRQ 145
P + L ++ ++ S ++ S+ + ++Y G R+LRQ
Sbjct: 65 NRPGGRYELSCNKKEFEAFWKNYFDLNSSYSDIRSLIDKSEDEYLYNASEYGQGIRILRQ 124
Query: 146 DPVECLLQFLCSSNNNIARITKMVDFLASLGSHLGNV--EGFEFHEFPSLERLSLVSEVE 203
DP E L+ F+ S NI I ++ + +L V EG + + FP+ E LS ++ +
Sbjct: 125 DPWEMLISFIISQRKNIPAIKASIEKICALAGRKIAVDPEGNDVYSFPTPEELSKLTVEQ 184
Query: 204 LRNAGFGYRSAPQSSLLFSVRRSFKQAKYI-TGTVDVLQSKHSGGAEWLLSLRKLDLQEA 262
L GYR KY+ +DV+ SG A+ L S +++D
Sbjct: 185 LSACSLGYRD-----------------KYVHQAALDVV----SGAAD-LDSWKEIDDDAL 222
Query: 263 IDALCTLPGVGPKVAACIALFSLDQHHAIPVDTHVWKI 300
+ L L GVG KVA C LF + A P D + K+
Sbjct: 223 MQKLLALYGVGVKVANCEILFGYHRLDAFPKDVWINKV 260
>gi|78186949|ref|YP_374992.1| hypothetical protein Plut_1087 [Chlorobium luteolum DSM 273]
gi|78166851|gb|ABB23949.1| HhH-GPD [Chlorobium luteolum DSM 273]
Length = 311
Score = 72.0 bits (175), Expect = 3e-10, Method: Compositional matrix adjust.
Identities = 70/253 (27%), Positives = 111/253 (43%), Gaps = 31/253 (12%)
Query: 56 TGQTFRWKKTGP--LQYTGPIGPHLISLKHLQNGDVCYHIHTSPSEPAAKSALLDF---L 110
+GQ+F W+K G + I + +++ +G V +HT + +S + F
Sbjct: 20 SGQSFSWQKHGNDGRYVSAIINGSAVVIENTNDGGVV--LHTDGNTIGVESPQVWFRRYF 77
Query: 111 NMGISLGELW-EGFSASDCRFAELAKYLAGARVLRQDPVECLLQFLCSSNNNIARITKMV 169
++ + L+ E F + A + G RVLRQDP E ++ F+C+ +A I + V
Sbjct: 78 SLDVDTETLFSEPFRNAHPELALQLERYRGLRVLRQDPYETMVTFMCAQGIGMALIRRQV 137
Query: 170 DFLAS-LGSHLG---NVEGFEFHEFPSLERLSLVSEVELRNAGFGYRSAPQSSLLFSVRR 225
LA G H+ N + FP+ RL +ELR A ++ L R
Sbjct: 138 SMLARRYGEHVPLSLNGCTINLYRFPTPSRLGAADPMELR--------ACTNNNLMRARN 189
Query: 226 SFKQAKYITGTVDVLQSKHSGGAEWLLSLRKLDLQEAIDA-LCTLPGVGPKVAACIALFS 284
++ +T G ++ K + QE I A L G+G K+A CIALF
Sbjct: 190 IISASQKVT----------EGCIDFKALASKKNTQEDIQAALSRCGGIGLKIADCIALFG 239
Query: 285 LDQHHAIPVDTHV 297
L + A P+DTHV
Sbjct: 240 LGRFDAFPIDTHV 252
>gi|344212039|ref|YP_004796359.1| 8-oxoguanine DNA glycosylase [Haloarcula hispanica ATCC 33960]
gi|343783394|gb|AEM57371.1| 8-oxoguanine DNA glycosylase [Haloarcula hispanica ATCC 33960]
Length = 302
Score = 72.0 bits (175), Expect = 3e-10, Method: Compositional matrix adjust.
Identities = 46/159 (28%), Positives = 76/159 (47%), Gaps = 21/159 (13%)
Query: 139 GARVLRQDPVECLLQFLCSSNNNIARITKMVDFLASLGSHLGNVEGFEFHEFPSLERLSL 198
G R+++ P L+ F+CS+ +ARI M L +G ++ +P+ + L+
Sbjct: 115 GMRLVQDPPFGSLISFICSAQMRVARIHSMQQALRDAFGETVEFDGRTYNAYPTPDALAE 174
Query: 199 VSEVELRNAGFGYRSAPQSSLLFSVRRSFKQAKYITGTVDVLQSKHSGGAEWLLSLRKLD 258
+E LR+ G GYR A Y+ T +++ + E + LD
Sbjct: 175 TTEERLRDLGLGYR-----------------APYVQRTAEMVAGGEADPEEAV----GLD 213
Query: 259 LQEAIDALCTLPGVGPKVAACIALFSLDQHHAIPVDTHV 297
++A ++L GVG KVA C+ LFSLD A+P+DT +
Sbjct: 214 YEDARESLTRFVGVGDKVADCVLLFSLDYLEAVPLDTWI 252
>gi|168334188|ref|ZP_02692395.1| 8-oxoguanine DNA glycosylase domain protein [Epulopiscium sp. 'N.t.
morphotype B']
Length = 289
Score = 72.0 bits (175), Expect = 4e-10, Method: Compositional matrix adjust.
Identities = 60/272 (22%), Positives = 111/272 (40%), Gaps = 34/272 (12%)
Query: 39 TPLNLTQSELSLPLTFPTGQTFRWKKTGPLQYTGPIGPHLISLKHLQNGDVCYHIHTSPS 98
T N+TQ T GQ FR+ K P + + ++++ Q D + +T+
Sbjct: 6 TDFNITQ-------TLECGQIFRFYKIEPDNFNIIVQDKVVNIS--QREDQLFIENTTLD 56
Query: 99 EPAAKSALLDFLNMGISLGELWEGFSASDCRFAELAKYLAGARVLRQDPVECLLQFLCSS 158
E ++L++ + + + D ++ G R+L+QDP E L+ F+ S
Sbjct: 57 E--FNLFWKNYLDLDTNYKIIKDTLCEIDIHMNNAIRFGGGIRILKQDPFEMLISFIISQ 114
Query: 159 NNNIARITKMVDFLASLGSH--LGNVEGFEFHEFPSLERLSLVSEVELRNAGFGYRSAPQ 216
N I I + ++ +A + ++ FP+L +L + +L G+R+A
Sbjct: 115 NKAIPHIKQCINNIAERFGQPIFQEISSETYYAFPTLAQLQAATIDDLSECKVGFRAA-- 172
Query: 217 SSLLFSVRRSFKQAKYITGTVDVLQSKHSGGAEWLLSLRKLDLQEAIDALCTLPGVGPKV 276
YI +D K S G L S+ L+ +A L + GVG K+
Sbjct: 173 ---------------YIKDAID----KLSSGEVDLTSIASLETADARKQLMKIKGVGKKI 213
Query: 277 AACIALFSLDQHHAIPVDTHVWKIATRYLLPE 308
A C+ LF+ + P D + ++ + +
Sbjct: 214 ADCVLLFAYYRTDVFPTDVWIKRVVEGFYFNQ 245
>gi|448666452|ref|ZP_21685097.1| 8-oxoguanine DNA glycosylase [Haloarcula amylolytica JCM 13557]
gi|445771583|gb|EMA22639.1| 8-oxoguanine DNA glycosylase [Haloarcula amylolytica JCM 13557]
Length = 302
Score = 71.6 bits (174), Expect = 5e-10, Method: Compositional matrix adjust.
Identities = 46/159 (28%), Positives = 75/159 (47%), Gaps = 21/159 (13%)
Query: 139 GARVLRQDPVECLLQFLCSSNNNIARITKMVDFLASLGSHLGNVEGFEFHEFPSLERLSL 198
G R+++ P L+ F+CS+ +ARI M L +G ++ +P+ L+
Sbjct: 115 GMRLVQDPPFGSLISFICSAQMRVARIHSMQQALRDAFGETVEFDGRTYNAYPTPSALAE 174
Query: 199 VSEVELRNAGFGYRSAPQSSLLFSVRRSFKQAKYITGTVDVLQSKHSGGAEWLLSLRKLD 258
+E LR+ G GYR A Y+ T +++ + E + LD
Sbjct: 175 TTEERLRDLGLGYR-----------------APYVQRTAEMVAGGEADPEEAV----GLD 213
Query: 259 LQEAIDALCTLPGVGPKVAACIALFSLDQHHAIPVDTHV 297
++A ++L GVG KVA C+ LFSLD A+P+DT +
Sbjct: 214 YEDARESLTRFVGVGDKVADCVLLFSLDYLEAVPLDTWI 252
>gi|261368602|ref|ZP_05981485.1| 3-Methyladenine DNA glycosylase [Subdoligranulum variabile DSM
15176]
gi|282569326|gb|EFB74861.1| 8-oxoguanine DNA-glycosylase (ogg) [Subdoligranulum variabile DSM
15176]
Length = 272
Score = 71.2 bits (173), Expect = 5e-10, Method: Compositional matrix adjust.
Identities = 66/248 (26%), Positives = 101/248 (40%), Gaps = 26/248 (10%)
Query: 56 TGQTFRWKKTGPLQYTGPIGPHLISLKHLQNGDVCYHIHTSPSEPAAKSALLDFLNMGIS 115
+GQ FR K+ + G ++ L+ G S E + + ++G +
Sbjct: 17 SGQCFRVKEFEDGSFRFVTGREVLYLRQSAGG-----FAVSCPEETWQRVWGPYFDLGRT 71
Query: 116 LGELWEGFSASDCRFAELAKYLAGARVLRQDPVECLLQFLCSSNNNIARITKMVDFLASL 175
+ + SD A+ G R+LRQDP E L+ F+ S +I I + V+ LA
Sbjct: 72 YSAVQKIIPESDTYMQLAAQEGRGIRILRQDPWEMLVTFIISQRKSIPAIQQAVELLAER 131
Query: 176 GSHLGNVEGFEFHEFPSLERLSLVSEVELRNAGFGYRSAPQSSLLFSVRRSFKQAKYITG 235
+ FP+ E+L+ EL GYR + AK ++G
Sbjct: 132 FGEAVTTPYETLYTFPTAEQLAGAQAGELAACKLGYRVPYIQDAV---------AKVLSG 182
Query: 236 TVDVLQSKHSGGAEWLLSLRKLDLQEAIDALCTLPGVGPKVAACIALFSLDQHHAIPVDT 295
+D L +L L E + L T+ GVG KVA CI LF+ + A P+DT
Sbjct: 183 QMD------------LAALATLPDAELFEKLKTIRGVGDKVANCICLFAYGRMGAAPIDT 230
Query: 296 HVWKIATR 303
+ KI R
Sbjct: 231 WIHKIIAR 238
>gi|448455070|ref|ZP_21594396.1| 8-oxoguanine DNA glycosylase domain protein [Halorubrum lipolyticum
DSM 21995]
gi|445814185|gb|EMA64153.1| 8-oxoguanine DNA glycosylase domain protein [Halorubrum lipolyticum
DSM 21995]
Length = 308
Score = 71.2 bits (173), Expect = 6e-10, Method: Compositional matrix adjust.
Identities = 51/166 (30%), Positives = 76/166 (45%), Gaps = 21/166 (12%)
Query: 139 GARVLRQDPVECLLQFLCSSNNNIARITKMVDFLASLGSHLGNVEGFEFHEFPSLERLSL 198
G R+ R CL+ F+CS+ +ARI M L ++G E+ FP+ E+L+
Sbjct: 123 GMRLTRDPVFPCLISFICSAQMRVARIHGMQRRLRETYGDALALDGEEYRAFPTPEQLAS 182
Query: 199 VSEVELRNAGFGYRSAPQSSLLFSVRRSFKQAKYITGTVDVLQSKHSGGAEWLLSLRKLD 258
+E ELR+ GYR A Y+ T +++ S + E L
Sbjct: 183 RTEAELRDLSLGYR-----------------APYVQRTAEMIASGEAHPRE----AAGLP 221
Query: 259 LQEAIDALCTLPGVGPKVAACIALFSLDQHHAIPVDTHVWKIATRY 304
+EA ++L GVG KVA C+ LFSL A+P+DT + Y
Sbjct: 222 YEEARESLTRFVGVGDKVADCVLLFSLGFLEAVPLDTWIRTTIEEY 267
>gi|355571306|ref|ZP_09042558.1| DNA-(apurinic or apyrimidinic site) lyase [Methanolinea tarda
NOBI-1]
gi|354825694|gb|EHF09916.1| DNA-(apurinic or apyrimidinic site) lyase [Methanolinea tarda
NOBI-1]
Length = 301
Score = 71.2 bits (173), Expect = 6e-10, Method: Compositional matrix adjust.
Identities = 73/273 (26%), Positives = 111/273 (40%), Gaps = 36/273 (13%)
Query: 48 LSLPLTFPTGQTFRWKKTGPLQYTGPIGPHLISLKHLQNGDVCYHIHTSPSEPAAKSALL 107
L L+ GQ F W++ + G +G +I + +Q GD S A + +
Sbjct: 12 FDLDLSLSCGQVFGWERE-ETAWRGIVGQKVIRI--VQKGD-------RLSFSGADTGFI 61
Query: 108 DF-LNMGISLGELWEGFSASDCRFAELAKYLAGARVLRQDPVECLLQFLCSSNNNIARIT 166
+ + + L E+ D +G R++RQ P ECL F+C++ +NI I
Sbjct: 62 RYYFALDVDLPEILSSID-RDPVIHYAVSSCSGLRIIRQPPWECLASFICATYSNIPGIR 120
Query: 167 KMVDFLA-SLGSHLGNVEGFEFHEFPSLERLSLVSEVELRNAGFGYRSAPQSSLLFSVRR 225
K V L +LG+ L + E FPS ++ L GYR
Sbjct: 121 KRVHLLCQNLGAPL-SPEYPGLFSFPSPTAIAGAETCMLSGCSLGYR------------- 166
Query: 226 SFKQAKYITGTVDVLQSKHSGGAEWLLSLRKLDLQEAIDALCTLPGVGPKVAACIALFSL 285
A Y+ T VL + W + L +A L L GVG KVA C+ LF+
Sbjct: 167 ----AGYLRETARVLAADTG----WEERIDALSFPKARANLLRLKGVGKKVADCVLLFAF 218
Query: 286 DQHHAIPVDTHVWKIATRYLLPELAGVRLTPKL 318
+ + PVD + +I TR PE +R K+
Sbjct: 219 GKWESFPVDVWIERIMTR-CYPECRDLRTYDKI 250
>gi|355670657|ref|ZP_09057404.1| hypothetical protein HMPREF9469_00441 [Clostridium citroniae
WAL-17108]
gi|354816094|gb|EHF00683.1| hypothetical protein HMPREF9469_00441 [Clostridium citroniae
WAL-17108]
Length = 301
Score = 71.2 bits (173), Expect = 7e-10, Method: Compositional matrix adjust.
Identities = 57/169 (33%), Positives = 81/169 (47%), Gaps = 30/169 (17%)
Query: 139 GARVLRQDPVECLLQFLCSSNNNIARITKMVDFL-ASLGSHL------GNVEGFEFHEFP 191
G R+LRQDP E ++ F+ S I +I ++V+ L A+ G L G++E H FP
Sbjct: 102 GIRILRQDPWEMIITFVISQQKTIPKIRELVEALSAAYGKALEGDPGCGDME--TVHAFP 159
Query: 192 SLERLSLVSEVELRNAGFGYRSAPQSSLLFSVRRSFKQAKYITGTVDVLQSKHSGGAEWL 251
+ LS + EL+ GYR AKYI + Q SG + L
Sbjct: 160 TPRELSRATLEELQALKLGYR-----------------AKYIH---RLCQDAVSGALD-L 198
Query: 252 LSLRKLDLQEAIDALCTLPGVGPKVAACIALFSLDQHHAIPVDTHVWKI 300
L ++ +EA++ L G+G KVA C+ LF L A PVDT + KI
Sbjct: 199 KGLAQMGYEEAMEYLTGFYGIGKKVANCVCLFGLHHISAFPVDTWIEKI 247
>gi|407463159|ref|YP_006774476.1| HhH-GPD family protein [Candidatus Nitrosopumilus koreensis AR1]
gi|407046781|gb|AFS81534.1| HhH-GPD family protein [Candidatus Nitrosopumilus koreensis AR1]
Length = 287
Score = 71.2 bits (173), Expect = 7e-10, Method: Compositional matrix adjust.
Identities = 64/257 (24%), Positives = 109/257 (42%), Gaps = 35/257 (13%)
Query: 48 LSLPLTFPTGQTFRWKKTGPLQYTGPIGPHLISLKHLQNGDVCYHIHTSPSEPAAKSALL 107
+ + + +GQ F WKK Y NG + + K+
Sbjct: 14 IDVENSINSGQVFLWKKNDKYWYGI-------------NGQDVLQVDEYGKIKSLKNYKT 60
Query: 108 DFLNMGISLGELWEGFSASDCRFAELAKYLAGARVLRQDPVECLLQFLCSSNNNIARITK 167
DF + ++ + S + KY G R+ +QDP +C++ F+ SSN+NI +I
Sbjct: 61 DFFRNKDNFDKIIKSISKDKIVKNAVKKY-PGLRITKQDPFQCIISFIISSNSNIQKIKN 119
Query: 168 MVDFLASLGSHLGNVEGFEFHEFPSLERLSLVSEVELRNAGFGYRSAPQSSLLFSVRRSF 227
++ ++ + EF FP+ LS S E++ G GYR
Sbjct: 120 NLENISKRFGERIEYKDQEFFLFPNARILSKASINEIKKCGVGYR--------------- 164
Query: 228 KQAKYITGTVDVLQSKHSGGAEWLLSLRKLDLQEAIDALCTLPGVGPKVAACIALFSLDQ 287
A++I ++ S+ + +L+ +A +C +PG+G KVA CI LFSLD+
Sbjct: 165 --AEFIKEAAKIISSEKFK----IENLKTDSYSDAKKKMCIIPGIGNKVADCILLFSLDK 218
Query: 288 HHAIPVDTHVWKIATRY 304
A P+D + +I +Y
Sbjct: 219 LEAFPLDRWMIRILEKY 235
>gi|291519138|emb|CBK74359.1| 3-methyladenine DNA glycosylase/8-oxoguanine DNA glycosylase
[Butyrivibrio fibrisolvens 16/4]
Length = 279
Score = 70.9 bits (172), Expect = 7e-10, Method: Compositional matrix adjust.
Identities = 72/270 (26%), Positives = 110/270 (40%), Gaps = 49/270 (18%)
Query: 43 LTQSELSLPLTFPTGQTFRWKKTGPLQYTGPIG-PHLISLKHLQNGDVCYHIHTSPS--E 99
LT E + + +GQ FR KK + G G P S N D+ ++++T +
Sbjct: 7 LTAEEFNPEVIMNSGQVFRMKK----ETNGADGIPDTYSACSGDN-DIYFYLNTKNDTWD 61
Query: 100 PAAKSALLDFLNMGISLGELWEGFS-----ASDCRFAELAKYLAGARVLRQDPVECLLQF 154
+ DF + ++ +SD + K G R+LRQD E + +
Sbjct: 62 FVCEEDQWDFWQRYFDFDTDYVAYNNKIRKSSDKYLKDALKDSFGMRILRQDLWEVFISY 121
Query: 155 LCSSNNNIARITKMVDFLA---SLGSHLGNVEGFEFHEFPSLERLSLVSEVELRNA-GFG 210
+ S NNNI +I K + L S G H FP E L+ V E EL + G
Sbjct: 122 VISQNNNIPKIKKSIQILCERYSDGIH-----------FPKPEVLAFVPETELMDGTALG 170
Query: 211 YRSAPQSSLLFSVRRSFKQAKYITGTVDVLQSKHSGGAEWLLSLRKLDLQEAIDALCTLP 270
YR A YI G + G + ++ +L ++A + L +
Sbjct: 171 YR-----------------ADYIIGISKAVME----GRLDIEAISQLPYEQAYEKLLEIK 209
Query: 271 GVGPKVAACIALFSLDQHHAIPVDTHVWKI 300
G+GPKVA CI L+ + P+DT + KI
Sbjct: 210 GIGPKVANCIMLYGFHFMESYPIDTWMKKI 239
>gi|160932765|ref|ZP_02080154.1| hypothetical protein CLOLEP_01606 [Clostridium leptum DSM 753]
gi|156867839|gb|EDO61211.1| 8-oxoguanine DNA-glycosylase (ogg) [Clostridium leptum DSM 753]
Length = 225
Score = 70.9 bits (172), Expect = 7e-10, Method: Compositional matrix adjust.
Identities = 62/209 (29%), Positives = 94/209 (44%), Gaps = 28/209 (13%)
Query: 109 FLNMGISLGELWEGFSASDCRFAELAKYLAGARVLRQDPVECLLQFLCSSNNNIARITKM 168
+ ++ + ++ + D E A + G R+L Q+P E L F+ S NNNI RI +
Sbjct: 23 YFDLDLDYNKIRGALANKDPILKEAAAFAPGIRILNQEPFETLCSFIISQNNNIPRIKGI 82
Query: 169 VDFLASLGSHLGNVEGFEFHEFPSLERLSLVSEVELRNAGFGYRSAPQSSLLFSVRRSFK 228
++ L L E FP+ E L+ E +L G+R+A
Sbjct: 83 IERLCLLFER----ETGSADRFPTAEMLACRREEDLAPLRAGWRAA-------------- 124
Query: 229 QAKYITGTVDVLQSKHSGGAEWLLSLRKLDLQEAIDALCTLPGVGPKVAACIALFSLDQH 288
Y+ +D Q K +GG L ++ L + EA L + GVGPKVA C+ L+ L +
Sbjct: 125 ---YL---LDAAQ-KVAGGEIDLERIKALPMPEARAELMRIKGVGPKVAECVLLYGLHRL 177
Query: 289 HAIPVDTHVW-KIATRYLLPELAGVRLTP 316
A P+D VW K A L PE + + P
Sbjct: 178 EAFPMD--VWMKRAMATLFPEKSVLDFGP 204
>gi|374630819|ref|ZP_09703204.1| 8-oxoguanine DNA glycosylase domain-containing protein
[Methanoplanus limicola DSM 2279]
gi|373908932|gb|EHQ37036.1| 8-oxoguanine DNA glycosylase domain-containing protein
[Methanoplanus limicola DSM 2279]
Length = 321
Score = 70.9 bits (172), Expect = 8e-10, Method: Compositional matrix adjust.
Identities = 69/277 (24%), Positives = 119/277 (42%), Gaps = 31/277 (11%)
Query: 48 LSLPLTFPTGQTFRWKKTGPLQYTGPIGPHLISLKHL-QNGDVCYHIHTSPSEPAAKSAL 106
+L T GQ FRW+K + +G G H+ QNG + + + + L
Sbjct: 14 FNLDGTLSCGQVFRWEK----RSSGWFGVAWGKALHVRQNGRIIEYSGCN------EKFL 63
Query: 107 LDFLNMGISLGELWEGFSASDCRFAELAKYLAGARVLRQDPVECLLQFLCSSNNNIARIT 166
D+ + ++L + + + D G R++ Q+P ECL+ + C+ N NI I+
Sbjct: 64 RDYFQLDLNLDRVLDSVN-KDEHIGTAISESYGLRLVSQNPWECLITYSCAQNANIPFIS 122
Query: 167 KMVDFLA-SLGSHLGNVEGFEFHEFPSLERLS------LVSEVE----------LRNAG- 208
+M++ L+ + G L + E + + P + S L + E + N+G
Sbjct: 123 RMLENLSFAYGDPLPSAEDYMDNYSPGADDNSENSAGILTDDSEGDVCHGADKRIDNSGR 182
Query: 209 -FGYRSAPQSSLLFSVRRSFKQAKYITGTVDVLQSKHSGGAEWLLSLRKLDLQEAIDALC 267
F Y SA SL + S Y +G + + + W + LD + A +
Sbjct: 183 FFSYPSAKALSLSCAADVSGCSTGYRSGNICDTAGRVTANPGWADEISALDYELARGKIM 242
Query: 268 TLPGVGPKVAACIALFSLDQHHAIPVDTHVWKIATRY 304
GVGPKVA CI LF+ + + PVD + +I + +
Sbjct: 243 EFKGVGPKVADCILLFAFRKFESFPVDVWMRRIMSEF 279
>gi|266619149|ref|ZP_06112084.1| N-glycosylase/DNA lyase [Clostridium hathewayi DSM 13479]
gi|288869329|gb|EFD01628.1| N-glycosylase/DNA lyase [Clostridium hathewayi DSM 13479]
Length = 287
Score = 70.9 bits (172), Expect = 8e-10, Method: Compositional matrix adjust.
Identities = 61/180 (33%), Positives = 74/180 (41%), Gaps = 32/180 (17%)
Query: 138 AGARVLRQDPVECLLQFLCSSNNNIARITKMVDFL--------ASLGSHLGNVEGFE--- 186
G R+LRQD E ++ FL S NNNI+RI VD L G L EG +
Sbjct: 98 GGVRILRQDLWEMIVTFLISQNNNISRIRNSVDALCEKFGTRKTGTGLVLDPNEGVKSVE 157
Query: 187 --FHEFPSLERLSLVSEVELRNAGFGYRSAPQSSLLFSVRRSFKQAKYITGTVDVLQSKH 244
++ FP ++ L G GYR KYI L+
Sbjct: 158 RTYNAFPEAGAVAAGGPEGLGGLGLGYRD-----------------KYIWAMA--LKCSG 198
Query: 245 SGGAEWLLSLRKLDLQEAIDALCTLPGVGPKVAACIALFSLDQHHAIPVDTHVWKIATRY 304
GA WL LR D A L G+G KVA C+ LF L A PVDTHV +I Y
Sbjct: 199 PDGAAWLDDLRAADYHTAHGMLTAEFGIGRKVADCVCLFGLHHVEAFPVDTHVKQIVNAY 258
>gi|156061665|ref|XP_001596755.1| hypothetical protein SS1G_02978 [Sclerotinia sclerotiorum 1980]
gi|154700379|gb|EDO00118.1| hypothetical protein SS1G_02978 [Sclerotinia sclerotiorum 1980
UF-70]
Length = 264
Score = 70.9 bits (172), Expect = 9e-10, Method: Compositional matrix adjust.
Identities = 51/143 (35%), Positives = 66/143 (46%), Gaps = 38/143 (26%)
Query: 179 LGNVEGFEFHEFPSLERLSLVS-EVELRNAGFGYRSAPQSSLLFSVRRSFKQAKYITGTV 237
+G++ FH+FP E L+ E LR GFGYR AKYI T
Sbjct: 13 IGHIGDQPFHDFPEPEALTGSGVESHLRVLGFGYR-----------------AKYIAQTA 55
Query: 238 DVLQSKHSGGAEWLLSLRKLD-----------------LQEAIDALCTLPGVGPKVAACI 280
++ SK G WL +LR + ++A + L L GVGPKVA C+
Sbjct: 56 SIVASKPKG---WLENLRNQETFDIPFEGEIPAGGRPGYRKAHEELLELQGVGPKVADCV 112
Query: 281 ALFSLDQHHAIPVDTHVWKIATR 303
L L A+PVDTHVW+IA R
Sbjct: 113 CLMGLGWGEAVPVDTHVWQIAQR 135
>gi|189346725|ref|YP_001943254.1| 8-oxoguanine DNA glycosylase [Chlorobium limicola DSM 245]
gi|189340872|gb|ACD90275.1| 8-oxoguanine DNA glycosylase domain protein [Chlorobium limicola
DSM 245]
Length = 312
Score = 70.5 bits (171), Expect = 9e-10, Method: Compositional matrix adjust.
Identities = 79/289 (27%), Positives = 121/289 (41%), Gaps = 40/289 (13%)
Query: 38 WTPLNLTQSELSLPLTFPTGQTFRWKKTGPLQ--YTGPIGPHLISLKHLQNGDVCYHIHT 95
+ L LT +++ + +GQ+F W + + I + + + N Y I+
Sbjct: 3 YQSLLLTDIPVNIEDSLFSGQSFSWNRLSFKDNFFISVINNVPVVINQINN----YFINI 58
Query: 96 SPSEPA-----AKSALLDFLNMGISLGELWEG-FSASDCRFAELAKYLAGARVLRQDPVE 149
+ AL + + I G+L++ F A L + G ++LRQDP E
Sbjct: 59 YTPDKFIGGIPVSEALSAYFTLDIDNGKLFDDHFIKRFPAIATLLQEYMGLKLLRQDPFE 118
Query: 150 CLLQFLCSSNNNIARITKMVDFLASLGSHLGNVE----GFEFHEFPSLERLSLVSEVELR 205
+ F+C+ +A I + + L +E FP E L+ S + L+
Sbjct: 119 TTITFMCAQGIGMALIRRQIGMLCEKYGTPCTIELMGQKHRIFRFPKPEMLAETSVLSLQ 178
Query: 206 N-AGFGYRSAPQSSLLFSVRRSFKQAKYITGTVDVLQSKHSGGAEWLLSLRKLDLQEAID 264
YR A ++RR A GT+D S G++ L LD A+
Sbjct: 179 ACTNNNYRRA------LNIRRVAAAAA--EGTLDFTIS----GSQSL----SLDRIRAM- 221
Query: 265 ALCTLPGVGPKVAACIALFSLDQHHAIPVDTHVWKIATRYLLPELAGVR 313
LC G+GPK+A CIALFSL + A PVDTHV R L E G+R
Sbjct: 222 -LCEYDGIGPKIADCIALFSLGRFDAFPVDTHV-----RQYLAEWFGIR 264
>gi|380496302|emb|CCF31808.1| N-glycosylase/DNA lyase [Colletotrichum higginsianum]
Length = 382
Score = 70.5 bits (171), Expect = 1e-09, Method: Compositional matrix adjust.
Identities = 79/300 (26%), Positives = 116/300 (38%), Gaps = 62/300 (20%)
Query: 30 LTTNKPSKWTPLNLTQSELSLPLTFPTGQTFRWKKTGPLQYTGPIGPHLISLKHLQNGDV 89
++ + ++W L L+ +EL + T GQ+FRW+K + +S +Q+ D
Sbjct: 1 MSVARVTEWRKLPLSLTELCIDTTLRCGQSFRWRKIKNEWFDD------LSYMRMQDCDT 54
Query: 90 CYHIHTSPSEPAAKSALLDFLNMGISLGELWEGFSASDCRFAELAKYLAGARVLRQDPVE 149
+ + S D ++ + W + + L G D E
Sbjct: 55 EHEVRCCTLYGRILSLKQDPTHLHYKV--TWPAKPSYPLTPPVVDNELDG------DDTE 106
Query: 150 CLLQFLCSSNN--------NIARITKMVDFLASLGSHLGNVEGFEFHEFPSLERLSLVS- 200
LL+ S ++ K+ L H+G+ FH+FP E L+ S
Sbjct: 107 KLLRHYLSLKLDLKSLSLLTFVQVHKLCIHYGPLIGHIGDEP---FHDFPPPEALTGSSV 163
Query: 201 EVELRNAGFGYRSAPQSSLLFSVRRSFKQAKYITGTVDVLQSKHSGGAEWLLSL------ 254
E LR GFGYR AKYI T ++ + G W L
Sbjct: 164 EAHLRELGFGYR-----------------AKYIAQTASIIVNDRPRG--WFEGLTNPENP 204
Query: 255 --RKL---------DLQEAIDALCTLPGVGPKVAACIALFSLDQHHAIPVDTHVWKIATR 303
RK +EA + L L GVGPKVA C+ L L A+PVDTHVW+IA R
Sbjct: 205 CFRKTPEGSRLPQSTYKEAHEKLLQLAGVGPKVADCVCLMGLGWGEAVPVDTHVWQIAQR 264
>gi|255513919|gb|EET90184.1| 3-Methyladenine DNA glycosylase [Candidatus Micrarchaeum
acidiphilum ARMAN-2]
Length = 303
Score = 70.1 bits (170), Expect = 1e-09, Method: Compositional matrix adjust.
Identities = 68/254 (26%), Positives = 106/254 (41%), Gaps = 28/254 (11%)
Query: 73 PIGPHLISLKHLQNGDVCYHIHTSPSEPAAKSALLDFLNMGISLGELWEGFSASDCRFAE 132
P +I+L+ L++G S S A + +G L ++ G ++D
Sbjct: 46 PSDGKIINLRELESGRGRRIGIASKSIDYAVYDVKRRFRLGDRLSSIY-GAISTDATMEG 104
Query: 133 LAKYLAGARVLRQDPVECLLQFLCSSNNNIARITKMVD-FLASLGSHL-GNVEGFEFHEF 190
L + +G R+ DP E + ++ S NNI RI + +A GS + G+ + F
Sbjct: 105 LIQNFSGMRITLNDPWETTMCYILSQYNNIPRIRGITKRMIARFGSDIFGDHDSVVGKAF 164
Query: 191 PSLERLSLVSEVELRNAGFGYRSAPQSSLLFSVRRSFKQAKYITGTVDVLQSKHSGGAEW 250
P ++ SE + G G+R+ + + A Y T +D+ +
Sbjct: 165 PKSHEIAAASEKSIVECGAGFRA----------KYLVEAADYCTNNIDMAR--------- 205
Query: 251 LLSLRKLDLQEAIDALCTLPGVGPKVAACIALFSLDQHHAIPVDTHVWKIATRYLLPELA 310
L KLD E D L + GVG KVA CIALF + A PVD + +I R
Sbjct: 206 ---LGKLDYPELKDELLQIKGVGDKVADCIALFGYGKLEAFPVDVWIKRIVERLYFR--- 259
Query: 311 GVRLTPKLCSRVAE 324
G + + K R AE
Sbjct: 260 GRKKSIKEIHRFAE 273
>gi|325957884|ref|YP_004289350.1| DNA-(apurinic or apyrimidinic site) lyase [Methanobacterium sp.
AL-21]
gi|325329316|gb|ADZ08378.1| DNA-(apurinic or apyrimidinic site) lyase [Methanobacterium sp.
AL-21]
Length = 328
Score = 70.1 bits (170), Expect = 1e-09, Method: Compositional matrix adjust.
Identities = 50/209 (23%), Positives = 95/209 (45%), Gaps = 16/209 (7%)
Query: 101 AAKSALLDFLNMGISLGELWEGFSASDCRFAELAKYLAGARVLR-QDPVECLLQFLCSSN 159
+ ++ L+D ++ L + ++ F D + +G R+ + DP EC++ + S+N
Sbjct: 77 SIRTKLMDIFSLNDDLNDFYD-FLRKDSELKPTIDFCSGLRLFKAHDPFECIISSISSAN 135
Query: 160 NNIARITKMVDFLASLGSHLGNVEGFEFHEFPSLERLSLVSEVELRNAGFGYRSAPQSSL 219
+I R + + + S N ++ FPS + L V E +L + P
Sbjct: 136 CSIIRWNRSIRDIKSKWGEEYNSNSNSWYTFPSPDILKNVPEHDLEEMDRCETNLPDDH- 194
Query: 220 LFSVRRSFK------QAKYITGTVDVLQSKHSGGAEWLLSLRKLDLQEAIDALCTLPGVG 273
S ++ K +AK++ T +++Q++ + A + K+ A + + LPGVG
Sbjct: 195 --SFEKNLKSCGVGYRAKFMIKTAEIVQNQINIDA-----IHKMSYNNAFETMLDLPGVG 247
Query: 274 PKVAACIALFSLDQHHAIPVDTHVWKIAT 302
PKVA CI + A PVD + +I +
Sbjct: 248 PKVADCILFYGFGFKEAFPVDVWIGRIVS 276
>gi|227874004|ref|ZP_03992218.1| 8-oxoguanine DNA glycosylase [Oribacterium sinus F0268]
gi|227840150|gb|EEJ50566.1| 8-oxoguanine DNA glycosylase [Oribacterium sinus F0268]
Length = 275
Score = 70.1 bits (170), Expect = 1e-09, Method: Compositional matrix adjust.
Identities = 54/161 (33%), Positives = 75/161 (46%), Gaps = 21/161 (13%)
Query: 139 GARVLRQDPVECLLQFLCSSNNNIARITKMVDFLASLGSHLGNVEGFEFHEFPSLERLSL 198
G R+LRQDP E LL F+ S +I I K V+ L+ H E FPS E +
Sbjct: 98 GVRLLRQDPWEMLLTFIISQRKSIPAIIKSVEALSEKYGHDIVTEQERLKAFPSPEEMKE 157
Query: 199 VSEVELRNAGFGYRSAPQSSLLFSVRRSFKQAKYITGTVDVLQSKHSGGAEWLLSLRKLD 258
+ EL G GYR KYI +D +Q +SG L ++ KL
Sbjct: 158 ATAEELAACGLGYR-----------------VKYI---LDAIQKVNSGELN-LKAIAKLP 196
Query: 259 LQEAIDALCTLPGVGPKVAACIALFSLDQHHAIPVDTHVWK 299
++ L + GVG KVA CIALF+ + +PVD +++
Sbjct: 197 DDVLLEKLQAVMGVGIKVANCIALFAYGRTACVPVDVWIFR 237
>gi|448503024|ref|ZP_21612888.1| 8-oxoguanine DNA glycosylase domain protein [Halorubrum coriense
DSM 10284]
gi|445693426|gb|ELZ45578.1| 8-oxoguanine DNA glycosylase domain protein [Halorubrum coriense
DSM 10284]
Length = 308
Score = 69.7 bits (169), Expect = 2e-09, Method: Compositional matrix adjust.
Identities = 52/166 (31%), Positives = 75/166 (45%), Gaps = 21/166 (12%)
Query: 139 GARVLRQDPVECLLQFLCSSNNNIARITKMVDFLASLGSHLGNVEGFEFHEFPSLERLSL 198
G R+ R CL+ F+CS+ +ARI M L ++ G + FP+ E+LS
Sbjct: 123 GMRLTRDPVFPCLVSFICSAQMRVARIHGMQRRLREAYGDGVSLGGETYRAFPTPEQLSA 182
Query: 199 VSEVELRNAGFGYRSAPQSSLLFSVRRSFKQAKYITGTVDVLQSKHSGGAEWLLSLRKLD 258
+E ELR+ GYR A Y+ T +++ S G L L
Sbjct: 183 RTEDELRDLSLGYR-----------------APYVQRTAEMVAS----GEADPLDAADLP 221
Query: 259 LQEAIDALCTLPGVGPKVAACIALFSLDQHHAIPVDTHVWKIATRY 304
+EA ++L GVG KVA C+ LFSL A+P+DT + Y
Sbjct: 222 YEEARESLTRFVGVGDKVADCVLLFSLGFLEAVPLDTWIRTTVEEY 267
>gi|225571487|ref|ZP_03780483.1| hypothetical protein CLOHYLEM_07585 [Clostridium hylemonae DSM
15053]
gi|225159963|gb|EEG72582.1| hypothetical protein CLOHYLEM_07585 [Clostridium hylemonae DSM
15053]
Length = 270
Score = 69.7 bits (169), Expect = 2e-09, Method: Compositional matrix adjust.
Identities = 69/243 (28%), Positives = 97/243 (39%), Gaps = 27/243 (11%)
Query: 56 TGQTFRWKKTGPLQYTGPIGPHLISLKHLQNGDVCYHIHTSPSEPAAKSALLDFLNMGIS 115
+GQ FR + Y+ G + + Q G C +E +S + ++
Sbjct: 17 SGQCFRMEPIDGSTYSVVAGGRYVEIT--QEGKSC---TFRCAEEEYESFWRHYFDLDED 71
Query: 116 LGELWEGFSASDCRFAELAKYLAGARVLRQDPVECLLQFLCSSNNNIARITKMV-DFLAS 174
G + D A +G R+LRQD E FL S NNI RI + + + +
Sbjct: 72 YGAYIKNIDVKDDYLRSAALLGSGIRILRQDLWEMTASFLISQQNNITRIRRCISNICET 131
Query: 175 LGSHLGNVEGFEFHEFPSLERLSLVSEVELRNAGFGYRSAPQSSLLFSVRRSFKQAKYIT 234
G + G ++ FP E L+ + E L+ GYRS KY+
Sbjct: 132 YGEKRESASGRVYYTFPEPEALAELEEDALKECNLGYRS-----------------KYVV 174
Query: 235 GTVDVLQSKHSGGAEWLLSLRKLDLQEAIDALCTLPGVGPKVAACIALFSLDQHHAIPVD 294
T + + G L +R L EA L L GVG KVA CI LF L A PVD
Sbjct: 175 RTARSI----AYGDVNLDHIRSLSYPEAKAELLKLYGVGDKVADCICLFGLHHLDAFPVD 230
Query: 295 THV 297
TH+
Sbjct: 231 THI 233
>gi|350269719|ref|YP_004881027.1| putative N-glycosylase/DNA lyase [Oscillibacter valericigenes
Sjm18-20]
gi|348594561|dbj|BAK98521.1| putative N-glycosylase/DNA lyase [Oscillibacter valericigenes
Sjm18-20]
Length = 269
Score = 69.7 bits (169), Expect = 2e-09, Method: Compositional matrix adjust.
Identities = 70/263 (26%), Positives = 107/263 (40%), Gaps = 44/263 (16%)
Query: 45 QSELSLPLTFPTGQTFRWKKTGPLQYTGPIGPHLISLKHLQNGDVCYHIHTSPSEPAAKS 104
+ + +L +GQ FR + P ++ L D C + A
Sbjct: 6 KQDFNLEKIADSGQCFRLNQLEPGRF------------RLAARDRCVELTERTDAWALDC 53
Query: 105 ALLDFLNMGISLGELWEGFSA-------SDCRFAELAKYLAGARVLRQDPVECLLQFLCS 157
+ +F + S +L E ++A D A ++ G R+LRQDP E L+ F+ S
Sbjct: 54 SAAEFNALWRSYFDLDENYAAFRAAVPKKDMYLAAATEFGMGIRILRQDPWEMLVTFIIS 113
Query: 158 SNNNIARITKMVDFLASLGSHLGNVEGFEFHEFPSLERLSLVSEVELRNAGFGYRSAPQS 217
NI I V+ +L + G G E+ FPS LS E L GYR
Sbjct: 114 QRKNIPAIRACVE---TLCTRYGKSLGSEY-AFPSPAALSGAGEEGLHACALGYR----- 164
Query: 218 SLLFSVRRSFKQAKYITGTVDVLQSKHSGGAEWLLSLRKLDLQEAIDALCTLPGVGPKVA 277
A Y+ ++Q+ GA L +L L E ++AL +PGVG KVA
Sbjct: 165 ------------AGYVLAATRLVQT----GALDLEALASLGDGELMEALMAVPGVGVKVA 208
Query: 278 ACIALFSLDQHHAIPVDTHVWKI 300
C++LF + A P D + ++
Sbjct: 209 NCVSLFGYHRIAAFPRDVWINRV 231
>gi|295090684|emb|CBK76791.1| 3-methyladenine DNA glycosylase/8-oxoguanine DNA glycosylase
[Clostridium cf. saccharolyticum K10]
Length = 308
Score = 69.3 bits (168), Expect = 2e-09, Method: Compositional matrix adjust.
Identities = 73/276 (26%), Positives = 111/276 (40%), Gaps = 54/276 (19%)
Query: 46 SELSLPLTFPTGQTFRWKKTGPLQYTGPIGPHLISLKHLQNGDVCYHIHTSPSEPAAKSA 105
S + L +GQ+FR K +++ G H + I P E A
Sbjct: 10 SYMDLRQIADSGQSFRMAKLQDRRFSVISGEHYLE------------IVQGPGEDHAAER 57
Query: 106 LLDF-------------LNMGISLGELWEGFSASDCRFAELAKYLAGARVLRQDPVECLL 152
F ++ G D ++ A+ +G R+L QDP E ++
Sbjct: 58 SFTFCCDEEELHFWEHYFDLEHDYGVYINSVRKRDSYLSQAARAGSGIRILNQDPWEMVI 117
Query: 153 QFLCSSNNNIARITKMVDFLA-SLGSH----LGNVEG---FEFHEFPSLERLSLVSEVEL 204
F+ S I +I + V+ L+ + G+ + + EG ++ FP+ +LS SE EL
Sbjct: 118 TFVISQQKTIPKIREAVEALSRNYGTRHETVVNDDEGEHTVVWYSFPTPAQLSAASETEL 177
Query: 205 RNAGFGYRSAPQSSLLFSVRRSFKQAKYITGTVDVLQSKHSGGAEWLLSLRKLDLQEAID 264
R GYR AKYI + Q G + L L++L EA+
Sbjct: 178 RELKLGYR-----------------AKYI---FRLCQDAVEGRLDLEL-LQRLSYGEAMT 216
Query: 265 ALCTLPGVGPKVAACIALFSLDQHHAIPVDTHVWKI 300
L + G+G KVA CI LF L A PVDT + +I
Sbjct: 217 YLGSFYGIGTKVANCICLFGLHHIEAFPVDTWIEQI 252
>gi|385803946|ref|YP_005840346.1| DNA N-glycosylase [Haloquadratum walsbyi C23]
gi|339729438|emb|CCC40695.1| DNA N-glycosylase [Haloquadratum walsbyi C23]
Length = 337
Score = 68.9 bits (167), Expect = 3e-09, Method: Compositional matrix adjust.
Identities = 48/159 (30%), Positives = 76/159 (47%), Gaps = 21/159 (13%)
Query: 139 GARVLRQDPVECLLQFLCSSNNNIARITKMVDFLASLGSHLGNVEGFEFHEFPSLERLSL 198
G R++R P L+ F+CS+ ++RI +M +A V G F FP+ +L+
Sbjct: 143 GLRLVRDPPFSTLISFICSAQMRVSRIHQMQLSMADAFGTTHTVGGESFAAFPTPSQLAA 202
Query: 199 VSEVELRNAGFGYRSAPQSSLLFSVRRSFKQAKYITGTVDVLQSKHSGGAEWLLSLRKLD 258
SE LR+ GYR A Y+ T +++ +G A ++ +
Sbjct: 203 QSESALRDLSLGYR-----------------ASYVQRTAEMVS---TGEAHPAIAAERA- 241
Query: 259 LQEAIDALCTLPGVGPKVAACIALFSLDQHHAIPVDTHV 297
++A + L GVG KVA C+ LFSLD A+P+DT +
Sbjct: 242 YEDAREYLTQFVGVGNKVADCVLLFSLDYTEAVPLDTWI 280
>gi|283798664|ref|ZP_06347817.1| putative N-glycosylase/DNA lyase [Clostridium sp. M62/1]
gi|291073649|gb|EFE11013.1| 8-oxoguanine DNA-glycosylase (ogg) [Clostridium sp. M62/1]
Length = 308
Score = 68.9 bits (167), Expect = 3e-09, Method: Compositional matrix adjust.
Identities = 73/276 (26%), Positives = 111/276 (40%), Gaps = 54/276 (19%)
Query: 46 SELSLPLTFPTGQTFRWKKTGPLQYTGPIGPHLISLKHLQNGDVCYHIHTSPSEPAAKSA 105
S + L +GQ+FR K +++ G H + I P E A
Sbjct: 10 SYMDLRQIADSGQSFRMAKLQDKRFSVISGEHYLE------------IVQGPGEDHAAER 57
Query: 106 LLDF-------------LNMGISLGELWEGFSASDCRFAELAKYLAGARVLRQDPVECLL 152
F ++ G D ++ A+ +G R+L QDP E ++
Sbjct: 58 SFTFCCDEEELHFWEHYFDLEHDYGVYINSVRKRDSYLSQAARAGSGIRILNQDPWEMVI 117
Query: 153 QFLCSSNNNIARITKMVDFLA-SLGSH----LGNVEG---FEFHEFPSLERLSLVSEVEL 204
F+ S I +I + V+ L+ + G+ + + EG ++ FP+ +LS SE EL
Sbjct: 118 TFVISQQKTIPKIREAVEALSRNYGTRHETVVNDDEGEHTVVWYSFPTPAQLSAASETEL 177
Query: 205 RNAGFGYRSAPQSSLLFSVRRSFKQAKYITGTVDVLQSKHSGGAEWLLSLRKLDLQEAID 264
R GYR AKYI + Q G + L L++L EA+
Sbjct: 178 RELKLGYR-----------------AKYI---FRLCQDAVEGRLDLEL-LQRLSYGEAMT 216
Query: 265 ALCTLPGVGPKVAACIALFSLDQHHAIPVDTHVWKI 300
L + G+G KVA CI LF L A PVDT + +I
Sbjct: 217 YLGSFYGIGTKVANCICLFGLHHIEAFPVDTWIEQI 252
>gi|15679342|ref|NP_276459.1| 8-oxoguanine DNA glycosylase [Methanothermobacter
thermautotrophicus str. Delta H]
gi|10720150|sp|O27397.1|OGG1_METTH RecName: Full=Probable N-glycosylase/DNA lyase; Includes: RecName:
Full=8-oxoguanine DNA glycosylase; Includes: RecName:
Full=DNA-(apurinic or apyrimidinic site) lyase; Short=AP
lyase
gi|2622449|gb|AAB85820.1| 8-oxoguanine DNA glycosylase [Methanothermobacter
thermautotrophicus str. Delta H]
Length = 312
Score = 68.9 bits (167), Expect = 3e-09, Method: Compositional matrix adjust.
Identities = 53/175 (30%), Positives = 83/175 (47%), Gaps = 13/175 (7%)
Query: 133 LAKYLAGARVLR----QDPVECLLQFLCSSNNNIARITKMVDFLASLGSHLGNVEGFEFH 188
L+ L +R LR +DP EC++ + S+N ++ R T+ ++ + L G FH
Sbjct: 92 LSYTLDSSRGLRLFLAKDPFECVISSIASANCSVVRWTRSIEDIRRLWGQANTFNGETFH 151
Query: 189 EFPSLERLSLVSEVELRNAGFGYRSAPQSSLLFSVRRSFK---QAKYITGTVDVLQSKHS 245
FPS L+ V+E L + + P S F+ RS +A YI T +L +
Sbjct: 152 TFPSPHVLTGVAEGSLEDLQRAEDNLP-SDFSFNDLRSCGVGYRAPYIRETSRILAEEMD 210
Query: 246 GGAEWLLSLRKLDLQEAIDALCTLPGVGPKVAACIALFSLDQHHAIPVDTHVWKI 300
+ + +D +A + L L GVGPKVA CI L+ + A PVD + +I
Sbjct: 211 -----IRRIDGMDYDDARELLLELSGVGPKVADCILLYGFRKTEAFPVDVWIRRI 260
>gi|291458678|ref|ZP_06598068.1| 8-oxoguanine-DNA-glycosylase [Oribacterium sp. oral taxon 078 str.
F0262]
gi|291419211|gb|EFE92930.1| 8-oxoguanine-DNA-glycosylase [Oribacterium sp. oral taxon 078 str.
F0262]
Length = 277
Score = 68.9 bits (167), Expect = 3e-09, Method: Compositional matrix adjust.
Identities = 69/271 (25%), Positives = 111/271 (40%), Gaps = 34/271 (12%)
Query: 44 TQSELSLPLTFPTGQTFRWKKTGPLQYTGPIGPHLISLKHLQNGDVCYHIHTSPSEPAAK 103
T+ ELSL +GQ FR ++ Y G ++ ++ + G+ + + S + +
Sbjct: 6 TEEELSLDKIIDSGQCFRPRRLSSGLYRFIKGREILYIRP-ERGEREFRVSCSGEK--WE 62
Query: 104 SALLDFLNMGISLGELWEGFSASDCRFAELAKYLAGARVLRQDPVECLLQFLCSSNNNIA 163
++ ++ S +L G D + G RVLRQDP E L+ F+ S +I
Sbjct: 63 RIWAEYFDLRRSYRKLLRGIPEEDGYLRRAEAFGRGIRVLRQDPFEVLISFIISQRKSIP 122
Query: 164 RITKMVDFLASLGSHLGNVEGFEFHE-----FPSLERLSLVSEVELRNAGFGYRSAPQSS 218
I ++ L + G + E FPS ++ +SE ELR G GYR
Sbjct: 123 AIRGCIE---KLSARCGRSVRTPYEEEPVFLFPSPGSIAALSEEELRKLGLGYR------ 173
Query: 219 LLFSVRRSFKQAKYITGTVD-VLQSKHSGGAEWLLSLRKLDLQEAIDALCTLPGVGPKVA 277
A Y+ + L+ + S AE L + L + + L GVG KVA
Sbjct: 174 -----------APYLRDAAERTLRGELS--AERLSA---LPTERLCEELLRTRGVGRKVA 217
Query: 278 ACIALFSLDQHHAIPVDTHVWKIATRYLLPE 308
C+ LF + P+D + + Y L E
Sbjct: 218 DCVLLFGYGRTERAPIDVWIRRTIEEYYLGE 248
>gi|448627232|ref|ZP_21671845.1| 8-oxoguanine DNA glycosylase [Haloarcula vallismortis ATCC 29715]
gi|445759061|gb|EMA10348.1| 8-oxoguanine DNA glycosylase [Haloarcula vallismortis ATCC 29715]
Length = 302
Score = 68.6 bits (166), Expect = 4e-09, Method: Compositional matrix adjust.
Identities = 46/159 (28%), Positives = 75/159 (47%), Gaps = 21/159 (13%)
Query: 139 GARVLRQDPVECLLQFLCSSNNNIARITKMVDFLASLGSHLGNVEGFEFHEFPSLERLSL 198
G R+++ P L+ F+CS+ +ARI M L G ++ +P+ L+
Sbjct: 115 GMRLVQDPPFGSLISFICSAQMRVARIHSMQQALRDAFGEEIECGGRTYNAYPTPAALAE 174
Query: 199 VSEVELRNAGFGYRSAPQSSLLFSVRRSFKQAKYITGTVDVLQSKHSGGAEWLLSLRKLD 258
+E LR+ G GYR A Y+ T +++ + + E + LD
Sbjct: 175 TTEGRLRDLGLGYR-----------------APYVQRTAEMVAAGEADPEEAV----GLD 213
Query: 259 LQEAIDALCTLPGVGPKVAACIALFSLDQHHAIPVDTHV 297
++A ++L GVG KVA C+ LFSLD A+P+DT +
Sbjct: 214 YEDARESLTRFVGVGDKVADCVLLFSLDYLEAVPLDTWI 252
>gi|448639286|ref|ZP_21676700.1| 8-oxoguanine DNA glycosylase [Haloarcula sinaiiensis ATCC 33800]
gi|445762873|gb|EMA14085.1| 8-oxoguanine DNA glycosylase [Haloarcula sinaiiensis ATCC 33800]
Length = 302
Score = 68.6 bits (166), Expect = 4e-09, Method: Compositional matrix adjust.
Identities = 46/159 (28%), Positives = 73/159 (45%), Gaps = 21/159 (13%)
Query: 139 GARVLRQDPVECLLQFLCSSNNNIARITKMVDFLASLGSHLGNVEGFEFHEFPSLERLSL 198
G R+++ P L+ F+CS+ +ARI M L G + +P+ L+
Sbjct: 115 GMRLVQDPPFGSLISFICSAQMRVARIHSMQQALRDAFGEEVEFGGRTYSAYPTPSALAE 174
Query: 199 VSEVELRNAGFGYRSAPQSSLLFSVRRSFKQAKYITGTVDVLQSKHSGGAEWLLSLRKLD 258
+E LR+ G GYR A Y+ T +++ + E + LD
Sbjct: 175 TTEERLRDLGLGYR-----------------APYVQRTAEMVAGGEADPEEAV----GLD 213
Query: 259 LQEAIDALCTLPGVGPKVAACIALFSLDQHHAIPVDTHV 297
++A ++L GVG KVA C+ LFSLD A+P+DT +
Sbjct: 214 YEDARESLTRFVGVGDKVADCVLLFSLDYLEAVPLDTWI 252
>gi|448677570|ref|ZP_21688760.1| 8-oxoguanine DNA glycosylase [Haloarcula argentinensis DSM 12282]
gi|445773245|gb|EMA24278.1| 8-oxoguanine DNA glycosylase [Haloarcula argentinensis DSM 12282]
Length = 302
Score = 68.6 bits (166), Expect = 4e-09, Method: Compositional matrix adjust.
Identities = 45/159 (28%), Positives = 75/159 (47%), Gaps = 21/159 (13%)
Query: 139 GARVLRQDPVECLLQFLCSSNNNIARITKMVDFLASLGSHLGNVEGFEFHEFPSLERLSL 198
G R+++ P L+ F+CS+ +ARI M L + ++ +P+ L+
Sbjct: 115 GMRLVQDPPFGSLISFICSAQMRVARIHSMQQALRDAFGETVEFDERTYNAYPTPSALAE 174
Query: 199 VSEVELRNAGFGYRSAPQSSLLFSVRRSFKQAKYITGTVDVLQSKHSGGAEWLLSLRKLD 258
+E LR+ G GYR A Y+ T +++ + + E + LD
Sbjct: 175 TTEERLRDLGLGYR-----------------APYVQRTAEMVANGEADPEEAV----GLD 213
Query: 259 LQEAIDALCTLPGVGPKVAACIALFSLDQHHAIPVDTHV 297
++A ++L GVG KVA C+ LFSLD A+P+DT +
Sbjct: 214 YEDARESLTRFVGVGDKVADCVLLFSLDYLEAVPLDTWI 252
>gi|148508030|gb|ABQ75828.1| DNA N-glycosylase / DNA lyase [uncultured haloarchaeon]
Length = 321
Score = 68.6 bits (166), Expect = 4e-09, Method: Compositional matrix adjust.
Identities = 48/177 (27%), Positives = 75/177 (42%), Gaps = 31/177 (17%)
Query: 126 SDCRFAELAKYLAGARVLRQDPVECLLQFLCSSNNNIARITKMVDFLASLGSHLGNVEGF 185
SD G R++R P L+ F+CS+ ++RI +M LA +G
Sbjct: 124 SDSLITNAIDAFEGMRLVRDPPFVSLISFICSAQMRVSRIHQMQKRLAETYGQTVTADGE 183
Query: 186 EFHEFPSLERLSLVSEVELRNAGFGYRSAPQSSLLFSVRRSFKQAKYITGTVDVLQS--- 242
+FH FP+ LS +E LR+ GYR A Y+ T +++
Sbjct: 184 QFHVFPTSSELSTATEESLRDLKLGYR-----------------APYVLDTAEMVARDEA 226
Query: 243 --KHSGGAEWLLSLRKLDLQEAIDALCTLPGVGPKVAACIALFSLDQHHAIPVDTHV 297
+H+ + ++A L GVG KVA C+ LF+L A+P+DT +
Sbjct: 227 HPEHAATQPY---------EQARSYLTQFTGVGNKVADCVLLFALGYLEAVPLDTWI 274
>gi|55377973|ref|YP_135823.1| 8-oxoguanine DNA glycosylase [Haloarcula marismortui ATCC 43049]
gi|448658436|ref|ZP_21682836.1| 8-oxoguanine DNA glycosylase [Haloarcula californiae ATCC 33799]
gi|55230698|gb|AAV46117.1| 8-oxoguanine DNA glycosylase [Haloarcula marismortui ATCC 43049]
gi|445761161|gb|EMA12410.1| 8-oxoguanine DNA glycosylase [Haloarcula californiae ATCC 33799]
Length = 302
Score = 68.6 bits (166), Expect = 4e-09, Method: Compositional matrix adjust.
Identities = 46/159 (28%), Positives = 73/159 (45%), Gaps = 21/159 (13%)
Query: 139 GARVLRQDPVECLLQFLCSSNNNIARITKMVDFLASLGSHLGNVEGFEFHEFPSLERLSL 198
G R+++ P L+ F+CS+ +ARI M L G + +P+ L+
Sbjct: 115 GMRLVQDPPFGSLISFICSAQMRVARIHSMQQALRDAFGEEVEFGGRTYSAYPTPSALAE 174
Query: 199 VSEVELRNAGFGYRSAPQSSLLFSVRRSFKQAKYITGTVDVLQSKHSGGAEWLLSLRKLD 258
+E LR+ G GYR A Y+ T +++ + E + LD
Sbjct: 175 TTEERLRDLGLGYR-----------------APYVQRTAEMVAGGEADPEEAV----GLD 213
Query: 259 LQEAIDALCTLPGVGPKVAACIALFSLDQHHAIPVDTHV 297
++A ++L GVG KVA C+ LFSLD A+P+DT +
Sbjct: 214 YEDARESLTRFVGVGDKVADCVLLFSLDYLEAVPLDTWI 252
>gi|452207593|ref|YP_007487715.1| DNA N-glycosylase [Natronomonas moolapensis 8.8.11]
gi|452083693|emb|CCQ37007.1| DNA N-glycosylase [Natronomonas moolapensis 8.8.11]
Length = 306
Score = 68.2 bits (165), Expect = 5e-09, Method: Compositional matrix adjust.
Identities = 50/159 (31%), Positives = 74/159 (46%), Gaps = 21/159 (13%)
Query: 139 GARVLRQDPVECLLQFLCSSNNNIARITKMVDFLASLGSHLGNVEGFEFHEFPSLERLSL 198
G R++R P CL+ F+CS+ +ARI M LA +G +H FP+ L+
Sbjct: 118 GMRLVRDPPFGCLISFICSAQMRVARIHGMQTALAETYGEAVTFDGETYHAFPAPSALAA 177
Query: 199 VSEVELRNAGFGYRSAPQSSLLFSVRRSFKQAKYITGTVDVLQSKHSGGAEWLLSLRKLD 258
+E ELR GYR A Y+ T +++ + E R L+
Sbjct: 178 ATEAELRELSLGYR-----------------APYVQRTAEMVAEGEARPDE----ARGLE 216
Query: 259 LQEAIDALCTLPGVGPKVAACIALFSLDQHHAIPVDTHV 297
+A ++L GVG KVA C+ LFSL A+P+DT +
Sbjct: 217 YAKARESLTRFVGVGEKVADCVLLFSLGYLEAVPLDTWI 255
>gi|239616443|ref|YP_002939765.1| 8-oxoguanine DNA glycosylase domain protein [Kosmotoga olearia TBF
19.5.1]
gi|239505274|gb|ACR78761.1| 8-oxoguanine DNA glycosylase domain protein [Kosmotoga olearia TBF
19.5.1]
Length = 297
Score = 68.2 bits (165), Expect = 6e-09, Method: Compositional matrix adjust.
Identities = 70/281 (24%), Positives = 119/281 (42%), Gaps = 41/281 (14%)
Query: 48 LSLPLTFPTGQTFRWKKTGPLQYTGPIGPHLISLKHLQNGDVCYHIHTSPSEPAAK--SA 105
L T GQTFRW+KT + G + ++ LK Q+G I + +
Sbjct: 12 FDLDTTLDCGQTFRWEKTDGW-WKGVVRDTVLFLK--QSGKKLDVIASRDTLLGMDIDEG 68
Query: 106 LLDFLNMGISLGELWEGFSASDCRFAELAKYLA--------GARVLRQDPVECLLQFLCS 157
L +L L E+ E K L+ G R+LRQD +E ++++ S
Sbjct: 69 LRKYLGFEDDLEEVHSTLEMIIRSLPERTKELSLNAVKQARGLRILRQDALEMTVEYIIS 128
Query: 158 SNNNIARITKMVDFL-ASLGSHLGNVEGFEFHEFPSLERLSLVSEVELRNAGFGYRSAPQ 216
+ N+I I K+ D L A + ++ F+ FPSLE+L + +L G+R
Sbjct: 129 TRNSIPTIRKISDLLSAKFPENRVEMDDEVFYTFPSLEQLKHLKLEDLLEIKLGFR---- 184
Query: 217 SSLLFSVRRSFKQAKYITGTVDVLQSKHSGGAEWLLSLRKLDLQEAIDALCTLPGVGPKV 276
+ L+ + ++ + + D L E ++ L + G+G KV
Sbjct: 185 ADWLYELFQNLEDEMFFEELYDK------------------PLMEKLEELMKIKGIGYKV 226
Query: 277 AACIALFSLDQHHAIPVDTHVWKIATRYLLPELAGVRLTPK 317
+CI LF+ + ++ PVD VW + ++ +L GV + K
Sbjct: 227 GSCITLFAYAELNSFPVD--VW---IKRVMKDLFGVNGSTK 262
>gi|373106876|ref|ZP_09521176.1| 8-oxoguanine DNA-glycosylase (ogg) [Stomatobaculum longum]
gi|371651815|gb|EHO17241.1| 8-oxoguanine DNA-glycosylase (ogg) [Stomatobaculum longum]
Length = 299
Score = 67.8 bits (164), Expect = 6e-09, Method: Compositional matrix adjust.
Identities = 75/268 (27%), Positives = 113/268 (42%), Gaps = 28/268 (10%)
Query: 37 KWTPLNLTQSELSLPLTFPTGQTFRWKKTGPLQYTGPIGPHLISLKHLQNGDVCYHIHTS 96
K+ + L ++ LSL +GQ FR + G + G ++ L+ Q GD + S
Sbjct: 3 KFRTIELREA-LSLRKIADSGQCFRPGEIGDGVFRFISGNEVLELR--QRGDCA--VEVS 57
Query: 97 PSEPAAKSALLDFLNMGISLGELWEG-FSASDCRFAELAKYLAGARVLRQDPVECLLQFL 155
+ + ++ + F + D AE + G RVLRQ+P E L+ F+
Sbjct: 58 CDAESWDKVWRTYFDLERDYSAIQRDIFESGDTLLAEATVHGDGIRVLRQEPFEMLVSFV 117
Query: 156 CSSNNNIARITKMVDFLA-SLGSHLGNVEGFEFHEFPSLERLSLVSEVELRNAGFGYRSA 214
S +I I V LA + G G FP E L+ SE LRN G GYR A
Sbjct: 118 LSQRKSIPAIRTAVRELAEAYGEATEAPSGETVFLFPKPEVLAGSSEESLRNLGLGYR-A 176
Query: 215 PQSSLLFSVRRSFKQAKYITGTVDVLQSKHSGGAEWLLSLRKLDLQEAIDALCTLPGVGP 274
P +L + RR G +D L ++R L +E L ++ GVG
Sbjct: 177 P--YVLDAARR------VADGRLD------------LDAVRDLPDEELFAELQSVHGVGK 216
Query: 275 KVAACIALFSLDQHHAIPVDTHVWKIAT 302
KVA C+ LF + +P D + +I +
Sbjct: 217 KVANCVMLFGYGRTERVPSDVWMQRIVS 244
>gi|110668484|ref|YP_658295.1| DNA N-glycosylase / DNA lyase [Haloquadratum walsbyi DSM 16790]
gi|109626231|emb|CAJ52689.1| DNA N-glycosylase [Haloquadratum walsbyi DSM 16790]
Length = 337
Score = 67.4 bits (163), Expect = 8e-09, Method: Compositional matrix adjust.
Identities = 48/159 (30%), Positives = 75/159 (47%), Gaps = 21/159 (13%)
Query: 139 GARVLRQDPVECLLQFLCSSNNNIARITKMVDFLASLGSHLGNVEGFEFHEFPSLERLSL 198
G R++R P L+ F+CS+ ++RI +M +A V G F FP+ +L+
Sbjct: 143 GLRLVRDPPFSTLISFICSAQMRVSRIHQMQLSMADAFGTTHTVGGESFAAFPTPSQLAA 202
Query: 199 VSEVELRNAGFGYRSAPQSSLLFSVRRSFKQAKYITGTVDVLQSKHSGGAEWLLSLRKLD 258
SE LR+ GYR A Y+ T +++ +G A ++ +
Sbjct: 203 QSESALRDLSLGYR-----------------APYVQRTAEMVS---TGEAHPAIAAERA- 241
Query: 259 LQEAIDALCTLPGVGPKVAACIALFSLDQHHAIPVDTHV 297
+ A + L GVG KVA C+ LFSLD A+P+DT +
Sbjct: 242 YENAREYLTQFVGVGNKVADCVLLFSLDYTEAVPLDTWI 280
>gi|354611217|ref|ZP_09029173.1| DNA-(apurinic or apyrimidinic site) lyase [Halobacterium sp. DL1]
gi|353196037|gb|EHB61539.1| DNA-(apurinic or apyrimidinic site) lyase [Halobacterium sp. DL1]
Length = 296
Score = 67.4 bits (163), Expect = 9e-09, Method: Compositional matrix adjust.
Identities = 70/263 (26%), Positives = 111/263 (42%), Gaps = 32/263 (12%)
Query: 41 LNLTQSELSLPLTFPTGQTFRWKKTGPLQY--TGPIGPHLISLKHLQNGDVCYHIHTSP- 97
L+ + L T +GQT+ W + Y TG G + + +GDV + +P
Sbjct: 8 LDDVPGSVDLQSTLESGQTYLWWRPDGATYDTTGAYGGDAW-YRTVVDGDVV-EVRQTPD 65
Query: 98 -----SEPAAKSALLDFLNMGISLGELWEGFSASDCRFAELAKYLAGARVLRQDPVECLL 152
+ A + + L + L E+ + + A Y G R++R CL+
Sbjct: 66 AVEWRATTDADPLVRELLGLHDDLHEIRAAVTDDELLQAAWDTY-DGLRIVRDPFFGCLV 124
Query: 153 QFLCSSNNNIARITKMVDFLASLGSHLGNVEGFEFHEFPSLERLSLVSEVELRNAGFGYR 212
F+CS+ + RI M + L +G H FP L+ +E +LR+ GYR
Sbjct: 125 SFICSAQMRVERIFAMQEALRETYGDPIEYDGETVHGFPEPAALAAATEDDLRDLKLGYR 184
Query: 213 SAPQSSLLFSVRRSFKQAKYITGTVDVLQSKHSGGAEWLLSLRKLDLQEAIDALCTLPGV 272
AP V+R+ + + T ++ ++ G A + A DAL GV
Sbjct: 185 -APY------VQRTAE----MVATGELTKADVEGRA----------YETARDALTGFVGV 223
Query: 273 GPKVAACIALFSLDQHHAIPVDT 295
G KVA C+ LFSL A+P+DT
Sbjct: 224 GDKVADCVLLFSLGYLEAVPLDT 246
>gi|110598335|ref|ZP_01386609.1| HhH-GPD [Chlorobium ferrooxidans DSM 13031]
gi|110340033|gb|EAT58534.1| HhH-GPD [Chlorobium ferrooxidans DSM 13031]
Length = 280
Score = 67.4 bits (163), Expect = 9e-09, Method: Compositional matrix adjust.
Identities = 65/200 (32%), Positives = 93/200 (46%), Gaps = 32/200 (16%)
Query: 108 DFLNMGISLGELW-EGFSASDCRFAELAKYLAGA---RVLRQDPVECLLQFLCSSNNNIA 163
D+L + I + L+ GFS + EL K L+G R+LRQD E L+ F+C+ +
Sbjct: 44 DYLTLDIDINRLFPSGFSE---HYPELWKLLSGYFSLRILRQDFFETLITFMCAQGIGMH 100
Query: 164 RITKMVDFLA-SLG---SHLGNVEGFEFHEFPSLERLSLVSEVELRNAGFGYRSAPQSSL 219
I K V L + G S L N + + FP+ L+ V L R +++
Sbjct: 101 LIRKQVTMLCHTFGEKRSILFNGKSITLYSFPTPLSLAEADIVMLSRCTNNNRI--RATN 158
Query: 220 LFSVRRSFKQAKYITGTVD--VLQSKHSGGAEWLLSLRKLDLQEAIDALCTLPGVGPKVA 277
+ RSF + G +D +L++ H L L E LC PG+G K+A
Sbjct: 159 ISRAARSF-----MDGALDPELLRNPH------------LPLPELRSMLCRNPGIGYKIA 201
Query: 278 ACIALFSLDQHHAIPVDTHV 297
CIALF L + A PVDTHV
Sbjct: 202 DCIALFGLGRFDAFPVDTHV 221
>gi|148643425|ref|YP_001273938.1| 3-methyladenine DNA glycosylase/8-oxoguanine DNA glycosylase
[Methanobrevibacter smithii ATCC 35061]
gi|148552442|gb|ABQ87570.1| 3-methyladenine DNA glycosylase/8-oxoguanine DNA glycosylase
[Methanobrevibacter smithii ATCC 35061]
Length = 311
Score = 67.4 bits (163), Expect = 9e-09, Method: Compositional matrix adjust.
Identities = 64/287 (22%), Positives = 120/287 (41%), Gaps = 53/287 (18%)
Query: 43 LTQSELSLPLTFPTGQTFR--WKKTGPLQYTGPIGPH---LISLKHLQNGDVCYHIHTSP 97
+ ++ + L LT +GQT + WK + + + L +K +N ++ ++
Sbjct: 2 IIKAPIDLELTQNSGQTSQPPWKLDNNIYSDVVVADNKAVLFQVKQKEN-NLDFNFIGDI 60
Query: 98 SEPAAKSALLDFLNMGISLGELWEGFSASDCRFAELAKYLAGARV-LRQDPVECLLQFLC 156
S A + ++ +L + ++ + + A++ + R+ L +D EC++ +C
Sbjct: 61 SNKEATKKIKTIFDLDFNLNKFYK-YLNNQPELADMTNFCRDLRLFLAKDKFECVISSVC 119
Query: 157 SSNNNIARITKMVDFLA-SLG----------------SHLGNVEGFEFHEFPSLERLSLV 199
S+NN+I R TK +D + S G S N+ + EF L+
Sbjct: 120 SANNSIVRWTKSIDDIKKSWGNKYTYSEKDYFTFPDVSDFKNIYCDDVEEFECCSNLNNP 179
Query: 200 SEV--ELRNAGFGYRS--APQSSLLFSVRRSFKQAKYITGTVDVLQSKHSGGAEWLLSLR 255
E L++ G GYR+ ++S +F++ L +
Sbjct: 180 GECINNLKSCGVGYRAPYMKKASEIFTLEMD------------------------LDDIS 215
Query: 256 KLDLQEAIDALCTLPGVGPKVAACIALFSLDQHHAIPVDTHVWKIAT 302
K+ EA D + +PGVGPKVA CI L+ + A P D + +I +
Sbjct: 216 KMSYDEAFDTMLKVPGVGPKVADCILLYGFNFRQAFPTDVWIKRIVS 262
>gi|261349785|ref|ZP_05975202.1| 8-oxoguanine DNA glycosylase [Methanobrevibacter smithii DSM 2374]
gi|288860569|gb|EFC92867.1| 8-oxoguanine DNA glycosylase [Methanobrevibacter smithii DSM 2374]
Length = 311
Score = 67.4 bits (163), Expect = 9e-09, Method: Compositional matrix adjust.
Identities = 64/287 (22%), Positives = 120/287 (41%), Gaps = 53/287 (18%)
Query: 43 LTQSELSLPLTFPTGQTFR--WKKTGPLQYTGPIGPH---LISLKHLQNGDVCYHIHTSP 97
+ ++ + L LT +GQT + WK + + + L +K +N ++ ++
Sbjct: 2 IIKAPIDLELTQNSGQTSQPPWKLDNNIYSDVVVADNKAVLFQVKQKEN-NLDFNFIGDI 60
Query: 98 SEPAAKSALLDFLNMGISLGELWEGFSASDCRFAELAKYLAGARV-LRQDPVECLLQFLC 156
S A + ++ +L + ++ + + A++ + R+ L +D EC++ +C
Sbjct: 61 SNKEATKKIKTIFDLDFNLNKFYK-YLNNQPELADMTNFCRDLRLFLAKDKFECVISSVC 119
Query: 157 SSNNNIARITKMVDFLA-SLG----------------SHLGNVEGFEFHEFPSLERLSLV 199
S+NN+I R TK +D + S G S N+ + EF L+
Sbjct: 120 SANNSIVRWTKSIDDIKKSWGNKYTYSEKDYFTFPDVSDFKNIYCDDVEEFECCSNLNNP 179
Query: 200 SEV--ELRNAGFGYRS--APQSSLLFSVRRSFKQAKYITGTVDVLQSKHSGGAEWLLSLR 255
E L++ G GYR+ ++S +F++ L +
Sbjct: 180 GECINNLKSCGVGYRAPYMKKASEIFTLEMD------------------------LDDIS 215
Query: 256 KLDLQEAIDALCTLPGVGPKVAACIALFSLDQHHAIPVDTHVWKIAT 302
K+ EA D + +PGVGPKVA CI L+ + A P D + +I +
Sbjct: 216 KMSYDEAFDTMLKVPGVGPKVADCILLYGFNFRQAFPTDVWIKRIVS 262
>gi|288559394|ref|YP_003422880.1| 8-oxoguanine DNA glycosylase Ogg [Methanobrevibacter ruminantium
M1]
gi|288542104|gb|ADC45988.1| 8-oxoguanine DNA glycosylase Ogg [Methanobrevibacter ruminantium
M1]
Length = 358
Score = 67.0 bits (162), Expect = 1e-08, Method: Compositional matrix adjust.
Identities = 60/225 (26%), Positives = 95/225 (42%), Gaps = 45/225 (20%)
Query: 99 EPAAKSALLDFLNMGISLGELWEGFSASDCRFAELAKYLAGARV-LRQDPVECLLQFLCS 157
E S L + ++ L + +E F D + A + G R+ + +DP EC++ +CS
Sbjct: 109 EREIDSNLNNIYDLEFDLEKFYE-FLLEDEKLAPSVDFCKGLRLFIAKDPFECIISSICS 167
Query: 158 SNNNIARITKMVDFLA-SLGSHLGNVEGFEFHEFPSLERLSLVSEV-------------- 202
+NN+IAR T +D + + G + EG F+ FPS + E
Sbjct: 168 ANNSIARWTASIDKIKLNWGEKVEFDEGM-FYGFPSPKDFLDFYETPIEQSEADGRRYEV 226
Query: 203 -----ELRNAGFGYRSAPQSSLLFSVRRSFKQAKYITGTVDVLQSKHSGGAEWLLSLRKL 257
L++ G GYR AP K ++ + +D + + K+
Sbjct: 227 DCYTKNLKSCGVGYR-APYMK---------KASQMLIDEID------------MNDVSKM 264
Query: 258 DLQEAIDALCTLPGVGPKVAACIALFSLDQHHAIPVDTHVWKIAT 302
EA D + LPGVGPKVA CI L+ A P D + +I +
Sbjct: 265 AYDEAFDLILRLPGVGPKVADCILLYGFGFQEAFPSDVWIKRIVS 309
>gi|448470301|ref|ZP_21600444.1| 8-oxoguanine DNA glycosylase domain protein [Halorubrum kocurii JCM
14978]
gi|445808275|gb|EMA58348.1| 8-oxoguanine DNA glycosylase domain protein [Halorubrum kocurii JCM
14978]
Length = 308
Score = 67.0 bits (162), Expect = 1e-08, Method: Compositional matrix adjust.
Identities = 49/166 (29%), Positives = 75/166 (45%), Gaps = 21/166 (12%)
Query: 139 GARVLRQDPVECLLQFLCSSNNNIARITKMVDFLASLGSHLGNVEGFEFHEFPSLERLSL 198
G R+ R CL+ F+CS+ +ARI M L + G E+ FP+ ++L+
Sbjct: 123 GMRLTRDPVFPCLISFICSAQMRVARIHGMQRRLRETYGDTLTLGGEEYSVFPTPDQLAS 182
Query: 199 VSEVELRNAGFGYRSAPQSSLLFSVRRSFKQAKYITGTVDVLQSKHSGGAEWLLSLRKLD 258
+E ELR+ GYR A Y+ T +++ S + E L
Sbjct: 183 RTEGELRDLSLGYR-----------------APYVQRTAEMVASGEAHPRE----AAALP 221
Query: 259 LQEAIDALCTLPGVGPKVAACIALFSLDQHHAIPVDTHVWKIATRY 304
+EA +++ GVG KVA C+ LFSL A+P+DT + Y
Sbjct: 222 YEEARESMTRFVGVGDKVADCVLLFSLGFLEAVPLDTWIRTTVEEY 267
>gi|222445667|ref|ZP_03608182.1| hypothetical protein METSMIALI_01308 [Methanobrevibacter smithii
DSM 2375]
gi|222435232|gb|EEE42397.1| 8-oxoguanine DNA-glycosylase (ogg) [Methanobrevibacter smithii DSM
2375]
Length = 311
Score = 67.0 bits (162), Expect = 1e-08, Method: Compositional matrix adjust.
Identities = 64/287 (22%), Positives = 120/287 (41%), Gaps = 53/287 (18%)
Query: 43 LTQSELSLPLTFPTGQTFR--WKKTGPLQYTGPIGPH---LISLKHLQNGDVCYHIHTSP 97
+ ++ + L LT +GQT + WK + + + L +K +N ++ ++
Sbjct: 2 IIKAPIDLELTQNSGQTSQPPWKLDNNIYSDVVVADNKAVLFQVKQKEN-NLDFNFIGDI 60
Query: 98 SEPAAKSALLDFLNMGISLGELWEGFSASDCRFAELAKYLAGARV-LRQDPVECLLQFLC 156
S A + ++ +L + ++ + + A++ + R+ L +D EC++ +C
Sbjct: 61 SNKEATQKIKAIFDLDFNLNKFYK-YLNNQPELADMTNFCRDLRLFLAKDKFECVISSVC 119
Query: 157 SSNNNIARITKMVDFLA-SLG----------------SHLGNVEGFEFHEFPSLERLSLV 199
S+NN+I R TK +D + S G S N+ + EF L+
Sbjct: 120 SANNSIVRWTKSIDDIKKSWGNKYTYSEKDYFTFPDVSDFKNIYCDDVEEFECCSNLNNP 179
Query: 200 SEV--ELRNAGFGYRS--APQSSLLFSVRRSFKQAKYITGTVDVLQSKHSGGAEWLLSLR 255
E L++ G GYR+ ++S +F++ L +
Sbjct: 180 GECINNLKSCGVGYRAPYMKKASEIFTLEMD------------------------LDDIS 215
Query: 256 KLDLQEAIDALCTLPGVGPKVAACIALFSLDQHHAIPVDTHVWKIAT 302
K+ EA D + +PGVGPKVA CI L+ + A P D + +I +
Sbjct: 216 KMSYDEAFDTMLKVPGVGPKVADCILLYGFNFRQAFPTDVWIKRIVS 262
>gi|448492018|ref|ZP_21608686.1| 8-oxoguanine DNA glycosylase domain protein [Halorubrum
californiensis DSM 19288]
gi|445692236|gb|ELZ44417.1| 8-oxoguanine DNA glycosylase domain protein [Halorubrum
californiensis DSM 19288]
Length = 308
Score = 66.2 bits (160), Expect = 2e-08, Method: Compositional matrix adjust.
Identities = 52/167 (31%), Positives = 76/167 (45%), Gaps = 23/167 (13%)
Query: 139 GARVLRQDPVECLLQFLCSSNNNIARITKMVDFL-ASLGSHLGNVEGFEFHEFPSLERLS 197
G R+ R CL+ F+CS+ +ARI M L + G + +G + FPS E+L+
Sbjct: 123 GMRLTRDPVFPCLISFICSAQMRVARIHGMQRRLRETYGDAVALGDG-TYRAFPSPEQLA 181
Query: 198 LVSEVELRNAGFGYRSAPQSSLLFSVRRSFKQAKYITGTVDVLQSKHSGGAEWLLSLRKL 257
+E ELR+ GYR A Y+ T +++ S G L
Sbjct: 182 ARTEEELRDLSLGYR-----------------APYVQRTAEMVAS----GEADPFDAADL 220
Query: 258 DLQEAIDALCTLPGVGPKVAACIALFSLDQHHAIPVDTHVWKIATRY 304
+EA ++L GVG KVA C+ LFSL A+P+DT + Y
Sbjct: 221 PYEEARESLTRFVGVGDKVADCVLLFSLGFLEAVPLDTWIRTTIEEY 267
>gi|365759137|gb|EHN00944.1| Ogg1p [Saccharomyces cerevisiae x Saccharomyces kudriavzevii VIN7]
Length = 156
Score = 65.9 bits (159), Expect = 2e-08, Method: Compositional matrix adjust.
Identities = 48/141 (34%), Positives = 66/141 (46%), Gaps = 28/141 (19%)
Query: 174 SLGSHLGN----VEGFEFHEFPSLERL-SLVSEVELRNAGFGYRSAPQSSLLFSVRRSFK 228
SL S+ GN ++G +H FP+ E L S +E LR GFGYR
Sbjct: 4 SLSSNFGNLITTIDGVTYHSFPTSEELASRGTETSLRELGFGYR---------------- 47
Query: 229 QAKYITGTVDVLQ-----SKHSGGAEWLLSL-RKLDLQEAIDALCTLPGVGPKVAACIAL 282
AKYI T L+ S +G E+ + + ++ + L + GVGPKVA C+ L
Sbjct: 48 -AKYIIETAKKLKKDKADSNIAGDTEYFQHICKDAQYEDVREHLMSYNGVGPKVADCVCL 106
Query: 283 FSLDQHHAIPVDTHVWKIATR 303
L +PVD HV +IA R
Sbjct: 107 MGLHMDGIVPVDVHVSRIAKR 127
>gi|448436027|ref|ZP_21587043.1| 8-oxoguanine DNA glycosylase domain protein [Halorubrum
tebenquichense DSM 14210]
gi|445683187|gb|ELZ35590.1| 8-oxoguanine DNA glycosylase domain protein [Halorubrum
tebenquichense DSM 14210]
Length = 308
Score = 65.9 bits (159), Expect = 2e-08, Method: Compositional matrix adjust.
Identities = 51/169 (30%), Positives = 78/169 (46%), Gaps = 27/169 (15%)
Query: 139 GARVLRQDPVECLLQFLCSSNNNIARITKMVDFLASL---GSHLGNVEGFEFHEFPSLER 195
G R++R CL+ F+CS+ +ARI M L G LG+ + FP+ E+
Sbjct: 123 GLRLIRDPVFPCLISFICSAQMRVARIHGMQRRLRETYGDGVALGDET---YRAFPTPEQ 179
Query: 196 LSLVSEVELRNAGFGYRSAPQSSLLFSVRRSFKQAKYITGTVDVLQSKHSGGAEWLLSLR 255
L+ +E ELR+ GYR A Y+ T +++ S + E +
Sbjct: 180 LAARTESELRDLSLGYR-----------------APYVQRTAEMVASGEADPREAV---- 218
Query: 256 KLDLQEAIDALCTLPGVGPKVAACIALFSLDQHHAIPVDTHVWKIATRY 304
L ++A ++L GVG KVA C+ LFSL A+P+DT + Y
Sbjct: 219 DLPYEDARESLTRFVGVGDKVADCVLLFSLGFLEAVPLDTWIRTTIEEY 267
>gi|193212613|ref|YP_001998566.1| 8-oxoguanine DNA glycosylase domain-containing protein
[Chlorobaculum parvum NCIB 8327]
gi|193086090|gb|ACF11366.1| 8-oxoguanine DNA glycosylase domain protein [Chlorobaculum parvum
NCIB 8327]
Length = 317
Score = 65.9 bits (159), Expect = 2e-08, Method: Compositional matrix adjust.
Identities = 76/287 (26%), Positives = 120/287 (41%), Gaps = 35/287 (12%)
Query: 44 TQSELSLPLTFPTGQTFRW--KKTGPLQYTGPIGPHLISLKHLQNGDVCYHIHTSPSEPA 101
+ L T +GQ+FRW K+ Y+ I + L+ + + + I S S+ A
Sbjct: 14 SHRNFDLHATLFSGQSFRWSQKENFSGYYSTVIDNKIFLLRSIDS--TTFEIF-SQSKEA 70
Query: 102 AKSALLDFLNMGISLGELWEGFSASDCR--FAELAKYLA---GARVLRQDPVECLLQFLC 156
+ DF + +SD + + EL + + RV+RQ+P E ++ F+C
Sbjct: 71 FCLQISDFFRWYFAFDIDENSIFSSDFQRDYPELWRMVKPYRSVRVMRQEPFEIMVTFMC 130
Query: 157 SSNNNIARITKMVDFLASLGSH---LGNVEG-FEFHEFPSLERLSLV--SEVELRNAGFG 210
+ + I + V +A L EG F+ FP+ L+ SE+ L
Sbjct: 131 AQGIGMHLIRRQVSMIAERYGQKIVLETPEGEMVFYGFPTPSALASADPSELALCTNNNR 190
Query: 211 YRSAPQSSLLFSVRRSFKQAKYITGTVDVLQSKHSGGAEWLLSLRKLDLQEAIDALCTLP 270
R+A + ++ RSF+ K V G E DL+ + LC
Sbjct: 191 IRAAN----IIAMARSFESGKLALACV--------GSGE-------CDLETLRETLCVHS 231
Query: 271 GVGPKVAACIALFSLDQHHAIPVDTHVWKIATRYLLPELAGVRLTPK 317
G+G K+A CIALF L + A P+DTHV + + E A LT K
Sbjct: 232 GIGLKIADCIALFGLGRFDAFPIDTHVKQYLWEWFGIEEARRSLTEK 278
>gi|225390500|ref|ZP_03760224.1| hypothetical protein CLOSTASPAR_04255 [Clostridium asparagiforme
DSM 15981]
gi|225043429|gb|EEG53675.1| hypothetical protein CLOSTASPAR_04255 [Clostridium asparagiforme
DSM 15981]
Length = 293
Score = 65.5 bits (158), Expect = 3e-08, Method: Compositional matrix adjust.
Identities = 55/178 (30%), Positives = 78/178 (43%), Gaps = 25/178 (14%)
Query: 143 LRQDPVECLLQFLCSSNNNIARITKMV-DFLASLGSHL--GNVEGFEFHEFPSLERLSLV 199
LRQDP E ++ F+ S I I ++V D G + G+ + F + FP+ + L+
Sbjct: 99 LRQDPWEMIITFVISQQKTIPCIRRLVEDISRRWGQKIEDGDEKNFAVYSFPTPKELARA 158
Query: 200 SEVELRNAGFGYRSAPQSSLLFSVRRSFKQAKYITGTVDVLQSKHSGGAEWLLSLRKLDL 259
S EL + GYR AKYI L + G L +L ++
Sbjct: 159 SLEELLDLKLGYR-----------------AKYIHR----LSQDAAAGILDLKNLETMEY 197
Query: 260 QEAIDALCTLPGVGPKVAACIALFSLDQHHAIPVDTHVWKIATRYLLPELAGVRLTPK 317
+A++ L G+G KVA C+ LF L A PVDT + KI R R TPK
Sbjct: 198 GQAMEYLTGFYGIGKKVANCVCLFGLHHIEAFPVDTWIEKILLREYFSA-KKYRRTPK 254
>gi|189500344|ref|YP_001959814.1| 8-oxoguanine DNA glycosylase [Chlorobium phaeobacteroides BS1]
gi|189495785|gb|ACE04333.1| 8-oxoguanine DNA glycosylase domain protein [Chlorobium
phaeobacteroides BS1]
Length = 313
Score = 64.7 bits (156), Expect = 5e-08, Method: Compositional matrix adjust.
Identities = 65/256 (25%), Positives = 109/256 (42%), Gaps = 31/256 (12%)
Query: 53 TFPTGQTFRWKKTG--PLQYTGPIGPHLISLKHLQNGDVCYHIHTS---PSEPAAKSALL 107
T +GQ FRWK T P +G IG + + ++ V Y I ++ + A
Sbjct: 19 TIFSGQCFRWKVTDNTPTVLSGVIGSEMFIID--RSNPVKYTISSTIKFRNINAFNEFNR 76
Query: 108 DFLNMGISLGELW-EGFSASDCRFAELAKYLAGARVLRQDPVECLLQFLCSSNNNIARIT 166
+ ++ + + L+ E F + + ++LRQDP E L+ F+C+ + I
Sbjct: 77 KYFSLDVDVNSLFPEDFRKRYPEVWDRIQPYTDIKILRQDPFETLITFMCAQGLGMHLIR 136
Query: 167 KMVDFLA-SLGSH----LGNVEGFEFHEFPSLERLSLVSEVELRNAGFGYRSAPQSSLLF 221
K V +LA G+ L +V + + FP+ E L+ ++P+S L
Sbjct: 137 KQVTYLAQEYGTRHTIRLNDVP-YTYFSFPTPEALA--------------STSPESLRLC 181
Query: 222 SVRRSFKQAKYITGTVDVLQSKHSGGAEWLLSLRKLDLQEAIDALCTLPGVGPKVAACIA 281
+ + I V+ K A L + L+ LC+ PG+G K+A C+
Sbjct: 182 TNNNCIRADNIIQAAQAVVSGKLDLQA---LKDPAMPLENVRKTLCSQPGIGFKIADCVM 238
Query: 282 LFSLDQHHAIPVDTHV 297
LF L + A P+D HV
Sbjct: 239 LFGLHRFAAFPIDRHV 254
>gi|194336401|ref|YP_002018195.1| 8-oxoguanine DNA glycosylase [Pelodictyon phaeoclathratiforme BU-1]
gi|194308878|gb|ACF43578.1| 8-oxoguanine DNA glycosylase domain protein [Pelodictyon
phaeoclathratiforme BU-1]
Length = 312
Score = 64.7 bits (156), Expect = 6e-08, Method: Compositional matrix adjust.
Identities = 69/284 (24%), Positives = 117/284 (41%), Gaps = 31/284 (10%)
Query: 44 TQSELSLPLTFPTGQTFRWK--KTGPLQYTGPIGPHLISLKHLQNGDVCYHIHTSPSEPA 101
T++ + T +GQ+F W K P Y I + +K + + ++ + + +
Sbjct: 9 TRAPFDIRQTLFSGQSFLWNINKDEPDFYAAIIKSKPLIIKQIADNELEVY---AEDKVI 65
Query: 102 AKSALLDFLN----MGISLGELW-EGFSASDCRFAELAKYLAGARVLRQDPVECLLQFLC 156
L+DF++ I +++ + FS +L R++RQDP E ++ F+C
Sbjct: 66 NGVPLVDFISHYFTFDIDTEQIFPDNFSHLYPTLWQLLTDYFPVRIMRQDPFETMISFMC 125
Query: 157 SSNNNIARITKMVDFLASLGSHLGNV----EGFEFHEFPSLERLSLVSEVELRNAGFGYR 212
+ + I K V L + + H FPS ERL+ + + L
Sbjct: 126 AQGIGMPLIRKQVSMLLQNYGEKRTISYSGKEITLHHFPSPERLAAANPIALSTCT--NN 183
Query: 213 SAPQSSLLFSVRRSFKQAKYITGTVDVLQSKHSGGAEWLLSLRKLDLQEAIDALCTLPGV 272
+ P++ + + + G +D+ LS L L E LC GV
Sbjct: 184 NHPRARNIVRIAKGVAD-----GKIDLDA----------LSDPLLPLSELRRTLCQNEGV 228
Query: 273 GPKVAACIALFSLDQHHAIPVDTHVWKIATRYLLPELAGVRLTP 316
G K+A CIALF L + A P+DTHV + ++ A LTP
Sbjct: 229 GYKIADCIALFGLGRFDAFPIDTHVKQYLGQWFNSTTALQSLTP 272
>gi|374629535|ref|ZP_09701920.1| 8-oxoguanine DNA glycosylase domain-containing protein
[Methanoplanus limicola DSM 2279]
gi|373907648|gb|EHQ35752.1| 8-oxoguanine DNA glycosylase domain-containing protein
[Methanoplanus limicola DSM 2279]
Length = 292
Score = 64.7 bits (156), Expect = 6e-08, Method: Compositional matrix adjust.
Identities = 60/258 (23%), Positives = 113/258 (43%), Gaps = 32/258 (12%)
Query: 41 LNL-TQSELSLPLTFPTGQTFRWKKTGPLQYTGPIGPHLISLKHLQNGDVCYHIHTSPSE 99
+NL +++ +L +T GQ RW+ + +TG + ++I ++ QN + +I +
Sbjct: 7 INLNSETPFNLDITLSCGQAPRWEYSDGW-WTGVVTENVIKIRQKQN--IIEYIGCN--- 60
Query: 100 PAAKSALLDFLNMGISLGELWEGFSASDCRFAELAKYLAGARVLRQDPVECLLQFLCSSN 159
+ + ++ + L E ++ F D + L G R++ QDP ECLL F + N
Sbjct: 61 ---EKFIFNYFCLDYDLNEFYKKFRG-DYLLKNSFEKLRGLRIINQDPWECLL-FQMTVN 115
Query: 160 NNIARITKMVDFLASLGSHLGNVEGFEFHEFPSLERLSLVSEVELRNAGFGYRSAPQSSL 219
R D + + +G FE ++ S+ R ++SE G R ++
Sbjct: 116 K--IRTRGDDDRITRIAKGIGKEITFEGRKYYSVPRPEIISES-------GLRILKTCNI 166
Query: 220 LFSVRRSFKQAKYITGTVDVLQSKHSGGAEWLLSLRKLDLQEAIDALCTLPGVGPKVAAC 279
+ F A+ +++ K+ W + ++ +EA+ L GV VA
Sbjct: 167 GYYATNIFNTAE------KIIEDKN-----WADKVSAMNYEEAVAYLSEFKGVKRSVAEW 215
Query: 280 IALFSLDQHHAIPVDTHV 297
+ L SL ++ PVDTH+
Sbjct: 216 VLLLSLKRYDIFPVDTHI 233
>gi|448448378|ref|ZP_21591191.1| 8-oxoguanine DNA glycosylase domain protein [Halorubrum litoreum
JCM 13561]
gi|445814794|gb|EMA64752.1| 8-oxoguanine DNA glycosylase domain protein [Halorubrum litoreum
JCM 13561]
Length = 308
Score = 64.7 bits (156), Expect = 6e-08, Method: Compositional matrix adjust.
Identities = 51/166 (30%), Positives = 75/166 (45%), Gaps = 21/166 (12%)
Query: 139 GARVLRQDPVECLLQFLCSSNNNIARITKMVDFLASLGSHLGNVEGFEFHEFPSLERLSL 198
G R+ R CL+ F+CS+ +ARI M L + + FP+ +L+
Sbjct: 123 GMRLTRDPVFPCLISFICSAQMRVARIHGMQRRLRETYGDAVALGDETYQAFPTPGQLAA 182
Query: 199 VSEVELRNAGFGYRSAPQSSLLFSVRRSFKQAKYITGTVDVLQSKHSGGAEWLLSLRKLD 258
+E ELR+ GYR AP VRR+ + +G D L++ L
Sbjct: 183 RTEDELRDLSLGYR-APY------VRRTAEMVA--SGEADPLEAA------------DLP 221
Query: 259 LQEAIDALCTLPGVGPKVAACIALFSLDQHHAIPVDTHVWKIATRY 304
+EA ++L GVG KVA C+ LFSL A+P+DT + Y
Sbjct: 222 YEEARESLTRFVGVGDKVADCVLLFSLGFLEAVPLDTWIRTTVEEY 267
>gi|145219662|ref|YP_001130371.1| 8-oxoguanine DNA glycosylase [Chlorobium phaeovibrioides DSM 265]
gi|145205826|gb|ABP36869.1| 8-oxoguanine DNA glycosylase domain protein [Chlorobium
phaeovibrioides DSM 265]
Length = 313
Score = 64.7 bits (156), Expect = 6e-08, Method: Compositional matrix adjust.
Identities = 77/296 (26%), Positives = 122/296 (41%), Gaps = 64/296 (21%)
Query: 38 WTPLNLTQSELSL-PLTFPTGQTFRWKKTGPL--QYTGPIGPHLISLKHLQNGDVCYHIH 94
+ P+ ++ L L T +GQ+F W ++ Y+ I + +K D Y +
Sbjct: 3 YQPIETNKNRLILVKETLFSGQSFMWNESDHECGIYSSVIDGSPVLIKQ----DNPYSLS 58
Query: 95 TSPSEPA-----AKSALLDFLNMGISLGELWEGFSASDCRFAELAKYL---AGARVLRQD 146
S + +S + ++ I+ L++ A+ R+ +LA L G RVLRQD
Sbjct: 59 VSVEKNELFGIPVQSFFEHYFSLDIATNSLFDDEFAA--RYPDLAFRLLKLQGLRVLRQD 116
Query: 147 PVECLLQFLCSSNNNIARITKMVDFLASLGSHLGNVEGFEFHEFPSLERLSLVSEVELRN 206
P E L+ F+C+ ++ I + V+ L + GN EV + +
Sbjct: 117 PYETLVTFMCAQGIGMSIIRRQVNMLC---RYFGN-------------------EVRVAD 154
Query: 207 AGFG--YRSAPQSSLLFS-----VRRSFKQAKYITGTVDVLQSKHSGGAEWLLSLRKLDL 259
G S P S+L +RR G + G A LL+L +LDL
Sbjct: 155 GGRDTPLYSFPAPSVLADADPALLRRCCNNNSMRAGNI--------GEASRLLALGRLDL 206
Query: 260 QEAID----------ALCTLPGVGPKVAACIALFSLDQHHAIPVDTHVWKIATRYL 305
Q D L L G+G K+A CIALF L + A P+DTHV + + +
Sbjct: 207 QALSDPSLPLSEIRTELTALKGIGFKIADCIALFGLGRFDAFPIDTHVEQFLSSWF 262
>gi|448424044|ref|ZP_21582170.1| 8-oxoguanine DNA glycosylase domain protein [Halorubrum terrestre
JCM 10247]
gi|448478860|ref|ZP_21603941.1| 8-oxoguanine DNA glycosylase domain protein [Halorubrum arcis JCM
13916]
gi|448512190|ref|ZP_21616304.1| 8-oxoguanine DNA glycosylase domain protein [Halorubrum distributum
JCM 9100]
gi|448520813|ref|ZP_21618146.1| 8-oxoguanine DNA glycosylase domain protein [Halorubrum distributum
JCM 10118]
gi|445682709|gb|ELZ35122.1| 8-oxoguanine DNA glycosylase domain protein [Halorubrum terrestre
JCM 10247]
gi|445694510|gb|ELZ46635.1| 8-oxoguanine DNA glycosylase domain protein [Halorubrum distributum
JCM 9100]
gi|445702949|gb|ELZ54888.1| 8-oxoguanine DNA glycosylase domain protein [Halorubrum distributum
JCM 10118]
gi|445822812|gb|EMA72573.1| 8-oxoguanine DNA glycosylase domain protein [Halorubrum arcis JCM
13916]
Length = 308
Score = 64.7 bits (156), Expect = 6e-08, Method: Compositional matrix adjust.
Identities = 51/166 (30%), Positives = 75/166 (45%), Gaps = 21/166 (12%)
Query: 139 GARVLRQDPVECLLQFLCSSNNNIARITKMVDFLASLGSHLGNVEGFEFHEFPSLERLSL 198
G R+ R CL+ F+CS+ +ARI M L + + FP+ +L+
Sbjct: 123 GMRLTRDPVFPCLISFICSAQMRVARIHGMQRRLRETYGDAVALGDETYQAFPTPGQLAA 182
Query: 199 VSEVELRNAGFGYRSAPQSSLLFSVRRSFKQAKYITGTVDVLQSKHSGGAEWLLSLRKLD 258
+E ELR+ GYR AP VRR+ + +G D L++ L
Sbjct: 183 RTEDELRDLSLGYR-APY------VRRTAEMVA--SGEADPLEAA------------DLP 221
Query: 259 LQEAIDALCTLPGVGPKVAACIALFSLDQHHAIPVDTHVWKIATRY 304
+EA ++L GVG KVA C+ LFSL A+P+DT + Y
Sbjct: 222 YEEARESLTRFVGVGDKVADCVLLFSLGFLEAVPLDTWIRTTVEEY 267
>gi|448730164|ref|ZP_21712474.1| 8-oxoguanine DNA glycosylase, partial [Halococcus saccharolyticus
DSM 5350]
gi|445793895|gb|EMA44460.1| 8-oxoguanine DNA glycosylase, partial [Halococcus saccharolyticus
DSM 5350]
Length = 183
Score = 64.3 bits (155), Expect = 8e-08, Method: Compositional matrix adjust.
Identities = 47/147 (31%), Positives = 70/147 (47%), Gaps = 23/147 (15%)
Query: 152 LQFLCSSNNNIARITKMVDFLA-SLGSHLGNVEGFEFHEFPSLERLSLVSEVELRNAGFG 210
L F+CS+ + RI M LA GS + +G +H FP+ E+L+ +E ELRN G
Sbjct: 6 LSFICSAQMRVGRIHGMQRTLAREFGSEVA-FDGQTYHAFPTPEQLADATEAELRNCSLG 64
Query: 211 YRSAPQSSLLFSVRRSFKQAKYITGTVDVLQSKHSGGAEWLLSLRKLDLQEAIDALCTLP 270
YR A Y+ T +++ + G + R + + A + L
Sbjct: 65 YR-----------------APYVERTAELV----ANGEVLPENARGREYENAREYLKQFM 103
Query: 271 GVGPKVAACIALFSLDQHHAIPVDTHV 297
GVG KVA C+ LFSL A+P+DT +
Sbjct: 104 GVGDKVADCVLLFSLGYLEAVPLDTWI 130
>gi|433439963|ref|ZP_20408489.1| DNA N-glycosylase [Haloferax sp. BAB2207]
gi|432188169|gb|ELK45386.1| DNA N-glycosylase [Haloferax sp. BAB2207]
Length = 224
Score = 64.3 bits (155), Expect = 8e-08, Method: Compositional matrix adjust.
Identities = 42/137 (30%), Positives = 64/137 (46%), Gaps = 21/137 (15%)
Query: 161 NIARITKMVDFLASLGSHLGNVEGFEFHEFPSLERLSLVSEVELRNAGFGYRSAPQSSLL 220
++RI M L +G + +P+ ERL+ +E ELR+ GYR
Sbjct: 2 RVSRIFGMQSRLRETYGERVEFDGETYFAYPTPERLAEATEDELRDLSLGYR-------- 53
Query: 221 FSVRRSFKQAKYITGTVDVLQSKHSGGAEWLLSLRKLDLQEAIDALCTLPGVGPKVAACI 280
A Y+ T +++ + + AE R D ++A + L T GVG KVA C+
Sbjct: 54 ---------APYVRRTAEMVATGEATPAE----ARGRDYEDAREFLTTFVGVGDKVADCV 100
Query: 281 ALFSLDQHHAIPVDTHV 297
LFSLD A+P+DT +
Sbjct: 101 LLFSLDYLEAVPLDTWI 117
>gi|448439570|ref|ZP_21588134.1| 8-oxoguanine DNA glycosylase domain protein [Halorubrum
saccharovorum DSM 1137]
gi|445691104|gb|ELZ43299.1| 8-oxoguanine DNA glycosylase domain protein [Halorubrum
saccharovorum DSM 1137]
Length = 308
Score = 64.3 bits (155), Expect = 8e-08, Method: Compositional matrix adjust.
Identities = 48/166 (28%), Positives = 73/166 (43%), Gaps = 21/166 (12%)
Query: 139 GARVLRQDPVECLLQFLCSSNNNIARITKMVDFLASLGSHLGNVEGFEFHEFPSLERLSL 198
G R+ R CL+ F+CS+ +ARI M L ++ E FPS ++L+
Sbjct: 123 GMRLTRDPVFPCLISFICSAQMRVARIHGMQRRLRETYGDAVTLDDEEHRAFPSPDQLAS 182
Query: 199 VSEVELRNAGFGYRSAPQSSLLFSVRRSFKQAKYITGTVDVLQSKHSGGAEWLLSLRKLD 258
+E ELR+ GYR A Y+ T +++ + + E L
Sbjct: 183 RTEEELRDLSLGYR-----------------APYVQRTAEMIATGEAHPRE----AADLP 221
Query: 259 LQEAIDALCTLPGVGPKVAACIALFSLDQHHAIPVDTHVWKIATRY 304
+EA +L GVG KV+ C+ LFSL A+P+DT + Y
Sbjct: 222 YEEARKSLTRFVGVGDKVSDCVLLFSLGFLEAVPLDTWIRTTIEEY 267
>gi|315925453|ref|ZP_07921664.1| conserved hypothetical protein [Pseudoramibacter alactolyticus ATCC
23263]
gi|315621354|gb|EFV01324.1| conserved hypothetical protein [Pseudoramibacter alactolyticus ATCC
23263]
Length = 325
Score = 64.3 bits (155), Expect = 8e-08, Method: Compositional matrix adjust.
Identities = 67/263 (25%), Positives = 101/263 (38%), Gaps = 37/263 (14%)
Query: 46 SELSLPLTFPTGQTFRWKKTGPLQYTGPIGPHLI-----SLKHLQNGDVCYHIHTSPSEP 100
+ L +GQ FRW Y P G + I ++ Q G + + E
Sbjct: 47 DDFDLDKMAESGQCFRW-------YPLPDGGYRICAFRRAVTARQEG---HRLTLDCPEE 96
Query: 101 AAKSALLDFLNMGISLGELWEGFSASDCRFAELAKYLAGARVLRQDPVECLLQFLCSSNN 160
A +S + +M + + D A+ G R+LRQDP E L+ F+ S
Sbjct: 97 AYRSIWRQYFDMTTHYAAIRDRIDPGDAYLQAAARLGRGIRILRQDPWETLVTFIISQRK 156
Query: 161 NIARITKMVDFL-ASLGSHLGNVEGFEFHEFPSLERLSLVSEVELRNAGFGYRSAPQSSL 219
+I I V+ L A+ G +G G + FPS + V ++ R G
Sbjct: 157 SIPAIRSCVEKLCAAAGDSIGAAAGEPLYAFPSPAQ---VMALDCRKTG----------- 202
Query: 220 LFSVRRSFKQAKYITGTVDVLQSKHSGGAEWLLSLRKLDLQEAI-----DALCTLPGVGP 274
S R SF+ ++ A W+++ L AL + GVG
Sbjct: 203 --SDRCSFQDTGVGNCSLGYRMPYVRAAARWMIAHDAAALDALDDGSLETALMQIKGVGV 260
Query: 275 KVAACIALFSLDQHHAIPVDTHV 297
KVAAC+ LF + +A PVD V
Sbjct: 261 KVAACVMLFGFHRMNAFPVDVWV 283
>gi|209877000|ref|XP_002139942.1| 8-oxoguanine DNA glycosylase protein [Cryptosporidium muris RN66]
gi|209555548|gb|EEA05593.1| 8-oxoguanine DNA glycosylase protein, putative [Cryptosporidium
muris RN66]
Length = 449
Score = 63.9 bits (154), Expect = 1e-07, Method: Compositional matrix adjust.
Identities = 53/211 (25%), Positives = 90/211 (42%), Gaps = 43/211 (20%)
Query: 139 GARVLRQDPVECLLQFLCSSNNNIARITKMVDFL-ASLGSHLGNVEGFE----------- 186
G R+ DPVE + ++NNNI RIT +++ + S G+++ +++ +
Sbjct: 235 GIRMFNIDPVEATFSAIITANNNIPRITSIIERIRISYGTYICSIDHNKLLEETNNLNFC 294
Query: 187 ---------FHEFPSLERLSLVSEVE--LRNAGFGYRSAPQSSLLFSVRRSFKQAKYITG 235
++ FP+ + + + +E + G GYR AK I
Sbjct: 295 NNKIPNIRHYYSFPTCDEIFKNASIEHLKSHCGTGYR-----------------AKSIIT 337
Query: 236 TVDVLQSKHSGGAEWLLSLRKLDLQEAIDALCTLPGVGPKVAACIALFSLDQHHAIPVDT 295
+ ++ G ++ L+ L+ +A+ L L GVG KVA I L L A+P+DT
Sbjct: 338 SAKIIHD--IGEFKYFNYLKSLEYHDAVSELLKLHGVGRKVADFILLSGLGFTQAVPIDT 395
Query: 296 HVWKIATRYLLPELAGVRLTPKLCSRVAEAF 326
H+ KI RY + KLC A +
Sbjct: 396 HMLKILKRY-NHKFQNTSSLSKLCYEEASRY 425
Score = 39.7 bits (91), Expect = 1.8, Method: Compositional matrix adjust.
Identities = 29/141 (20%), Positives = 59/141 (41%), Gaps = 11/141 (7%)
Query: 37 KWTPLNLTQSELSLPLTFPTGQTFRWKKTGPLQYTGPIGPHLISLKHLQNGDVCYHIHTS 96
++ L +T +EL + PTGQ+F W + G +G + +K L N + ++T+
Sbjct: 14 EYISLGITANELRISKCLPTGQSFSWYSIADGTFVGILGHRIFQMKELPNDTLYRCLYTA 73
Query: 97 PSEPAAKSALLDFLNMGISLGELWEGFSASDCRFAELAKYLAGARVLRQDPVECLLQFLC 156
S + ++ + +EG S +D KY+ +D + + +
Sbjct: 74 CSNDNSN----QYIQIKKRKMSTYEGSSTADVSPTNEVKYIK-----IKDSTDIYISDI- 123
Query: 157 SSNNNIARITKMVDFLASLGS 177
SNN ++ I + + + S
Sbjct: 124 -SNNQVSLIDGNISYYSDFNS 143
>gi|448528864|ref|ZP_21620244.1| 8-oxoguanine DNA glycosylase domain protein [Halorubrum hochstenium
ATCC 700873]
gi|445710312|gb|ELZ62131.1| 8-oxoguanine DNA glycosylase domain protein [Halorubrum hochstenium
ATCC 700873]
Length = 308
Score = 63.5 bits (153), Expect = 1e-07, Method: Compositional matrix adjust.
Identities = 49/163 (30%), Positives = 74/163 (45%), Gaps = 22/163 (13%)
Query: 143 LRQDPV-ECLLQFLCSSNNNIARITKMVDFLASLGSHLGNVEGFEFHEFPSLERLSLVSE 201
L +DPV CL+ F+CS+ +ARI M L + + FP+ E+L+ +E
Sbjct: 126 LTRDPVFPCLISFICSAQMRVARIHGMQRRLRETYGDAVALGDETYRAFPTPEQLAARTE 185
Query: 202 VELRNAGFGYRSAPQSSLLFSVRRSFKQAKYITGTVDVLQSKHSGGAEWLLSLRKLDLQE 261
ELR+ GYR A Y+ T +++ S + E L ++
Sbjct: 186 GELRDLSLGYR-----------------APYVQRTAEMVASGEADPRE----AADLPYED 224
Query: 262 AIDALCTLPGVGPKVAACIALFSLDQHHAIPVDTHVWKIATRY 304
A ++L GVG KVA C+ LFSL A+P+DT + Y
Sbjct: 225 ARESLTRFVGVGEKVADCVLLFSLGFLEAVPLDTWIRTTIEEY 267
>gi|239626737|ref|ZP_04669768.1| conserved hypothetical protein [Clostridiales bacterium 1_7_47_FAA]
gi|239516883|gb|EEQ56749.1| conserved hypothetical protein [Clostridiales bacterium 1_7_47FAA]
Length = 313
Score = 63.5 bits (153), Expect = 1e-07, Method: Compositional matrix adjust.
Identities = 68/254 (26%), Positives = 100/254 (39%), Gaps = 65/254 (25%)
Query: 107 LDFLNMGISLGELWEGFSASDCRFAELAKYLAGARVLRQDPVECLLQFLCSSNNNIARIT 166
LD+ ++ + + + AE A+ G R+L+QDP E ++ F+ S I +I
Sbjct: 48 LDYFDIATDYQAMINCVAPGNTYLAEAAQAGKGIRILKQDPWEMIITFVISQQKTIPKIR 107
Query: 167 KMVDFL-ASLGSHLG----------------------------------NVEGFE----F 187
++V+ L AS G+ + +++G +
Sbjct: 108 QLVEALCASYGTPVNGALVNGAPVNGTPVNGAPVNGTPANGIPGKGPEVSIDGPDSSQRL 167
Query: 188 HEFPSLERLSLVSEVELRNAGFGYRSAPQSSLLFSVRRSFKQAKYITGTVDVLQSKHSGG 247
FPS E LS + EL+ GYR AKYI + Q SG
Sbjct: 168 FSFPSPEELSRATLDELKELKLGYR-----------------AKYI---YQLCQDAVSGA 207
Query: 248 AEWLLSLRKLDLQEAIDALCTLPGVGPKVAACIALFSLDQHHAIPVDTHVWKI--ATRYL 305
+ L L +D A+ L G+G KVA C+ LF L A PVDT + KI A Y
Sbjct: 208 LD-LAMLDTMDYDAALLYLTGFYGIGKKVANCVCLFGLHHIGAFPVDTWIEKILMAQYY- 265
Query: 306 LPELAGVRLTPKLC 319
+ R TPK C
Sbjct: 266 --DKRKYRRTPKTC 277
>gi|223928094|gb|ACN23809.1| 8-oxoguanine DNA glycosylase [Clostridium sp. enrichment culture
clone 7-14]
Length = 269
Score = 63.5 bits (153), Expect = 1e-07, Method: Compositional matrix adjust.
Identities = 54/191 (28%), Positives = 76/191 (39%), Gaps = 29/191 (15%)
Query: 109 FLNMGISLGELWEGFSASDCRFAELAKYLAGARVLRQDPVECLLQFLCSSNNNIARITKM 168
+ ++ G D A + G R+LRQDP E L+ F+ S NI I
Sbjct: 65 YFDLDTDYGAYRAAVPEKDVYLTAAADFGRGIRILRQDPWEILVTFIISQRKNIPAIRAC 124
Query: 169 VDFLAS-LGSHLGNVEGFEFHEFPSLERLSLVSEVELRNAGFGYRSAPQSSLLFSVRRSF 227
V+ L S G +G + FP+ L+ E LR GYR+
Sbjct: 125 VETLCSRYGEPIGPT-----YAFPTPAALAGADEEALRACALGYRA-------------- 165
Query: 228 KQAKYITGTVDVLQSKHSGGAEWLLSLRKLDLQEAIDALCTLPGVGPKVAACIALFSLDQ 287
G V G L +L L+ + ++L T+PGVG KVA C++LF +
Sbjct: 166 -------GYVLAAAQMADAGTLDLFALVSLEDDQLAESLMTVPGVGVKVANCVSLFGYHR 218
Query: 288 HHAIPVDTHVW 298
A P D VW
Sbjct: 219 IAAFPRD--VW 227
>gi|360042711|emb|CCD78121.1| putative 8-oxoguanine DNA glycosylase [Schistosoma mansoni]
Length = 281
Score = 63.2 bits (152), Expect = 2e-07, Method: Compositional matrix adjust.
Identities = 42/144 (29%), Positives = 74/144 (51%), Gaps = 10/144 (6%)
Query: 36 SKWTPLNLTQSELSLPLTFPTGQTFRWKKTGPL-QYTGPIGPHLISLKHLQNGD-VCYHI 93
+ W + + + +L T +GQ FRW L ++ G I HL L+ +++ + V Y++
Sbjct: 5 NNWHSIPVDPQDFNLFSTLNSGQAFRWVFNSTLNEWHGVINGHLWRLRQMEDSNPVEYYL 64
Query: 94 HTSPSEPAAKS--ALLDFLNMGISLGELWEGFSASD----CRFAELAKY--LAGARVLRQ 145
+ L D+ + ++L +L + +S D RF++ + G R+LRQ
Sbjct: 65 EKNSKIKLNDVPFDLRDYFRLDMNLSDLVKEWSIKDKWFENRFSKNCQMDTARGLRLLRQ 124
Query: 146 DPVECLLQFLCSSNNNIARITKMV 169
DP E L F+ S+NNN+ RITK++
Sbjct: 125 DPEETLFSFITSANNNLTRITKLL 148
Score = 57.4 bits (137), Expect = 8e-06, Method: Compositional matrix adjust.
Identities = 29/84 (34%), Positives = 47/84 (55%)
Query: 245 SGGAEWLLSLRKLDLQEAIDALCTLPGVGPKVAACIALFSLDQHHAIPVDTHVWKIATRY 304
+GG LL LR + EA L +PG+G KVA CI L SLD+ +P+D H+ ++A
Sbjct: 160 NGGRSRLLELRSVSHVEAQAFLLQIPGIGKKVADCICLMSLDKLDVVPIDVHMLRVAREK 219
Query: 305 LLPELAGVRLTPKLCSRVAEAFCE 328
+PE + +T K ++++ +
Sbjct: 220 GIPEASCKTMTTKSYDIISKSLSD 243
>gi|393796832|ref|ZP_10380196.1| HhH-GPD family protein [Candidatus Nitrosoarchaeum limnia BG20]
Length = 160
Score = 63.2 bits (152), Expect = 2e-07, Method: Compositional matrix adjust.
Identities = 38/124 (30%), Positives = 62/124 (50%), Gaps = 21/124 (16%)
Query: 186 EFHEFPSLERLSLVSEVELRNAGFGYRSAPQSSLLFSVRRSFKQAKYITGTVDVLQSKHS 245
EF+ FP ++++ S E++N G GYR AK+I +++S+
Sbjct: 11 EFYLFPEPKKIADASIQEIQNCGVGYR-----------------AKFIIDAAKMVESEQI 53
Query: 246 GGAEWLLSLRKLDLQEAIDALCTLPGVGPKVAACIALFSLDQHHAIPVDTHVWKIATRYL 305
++K + Q+A + T PG+G KVA CI LFSLD+ A P+D + +I +Y
Sbjct: 54 D----FEYMKKSNYQDAKKVILTTPGIGNKVADCILLFSLDKLEAFPLDRWMIRILGKYY 109
Query: 306 LPEL 309
L +
Sbjct: 110 LEKF 113
>gi|390556756|ref|ZP_10243163.1| DNA-(Apurinic or apyrimidinic site) lyase [Nitrolancetus
hollandicus Lb]
gi|390174670|emb|CCF82450.1| DNA-(Apurinic or apyrimidinic site) lyase [Nitrolancetus
hollandicus Lb]
Length = 247
Score = 62.8 bits (151), Expect = 2e-07, Method: Compositional matrix adjust.
Identities = 45/123 (36%), Positives = 65/123 (52%), Gaps = 22/123 (17%)
Query: 190 FPSLE--RLSLVSEVE--LRNAGFGYRSAPQSSLLFSVRRSFKQAKYITGTVDVLQSKHS 245
+P+ E R + V++V +R+ G G R AP+ I +D + S H
Sbjct: 71 YPTWEAVRTAPVADVADAIRSGGLGARKAPR----------------IQAALDRILS-HG 113
Query: 246 GGAEWLLSLRKLDLQEAIDALCTLPGVGPKVAACIALFSLDQHHAIPVDTHVWKIATRYL 305
EW +L+ L L EA L LPGVGPK AAC+ LF+ + A+PVDTHV++++ R
Sbjct: 114 TEDEWSRALKTLPLAEAKARLMALPGVGPKTAACVLLFACGR-PALPVDTHVYRVSKRLG 172
Query: 306 LPE 308
L E
Sbjct: 173 LIE 175
>gi|384915612|ref|ZP_10015824.1| 3-methyladenine DNA glycosylase/8-oxoguanine DNA glycosylase
[Methylacidiphilum fumariolicum SolV]
gi|384526915|emb|CCG91695.1| 3-methyladenine DNA glycosylase/8-oxoguanine DNA glycosylase
[Methylacidiphilum fumariolicum SolV]
Length = 298
Score = 62.8 bits (151), Expect = 2e-07, Method: Compositional matrix adjust.
Identities = 66/256 (25%), Positives = 97/256 (37%), Gaps = 47/256 (18%)
Query: 48 LSLPLTFPTGQTFRWKKTGPLQYTGPIGPHLISLKHLQNGDVCYHIH--------TSPSE 99
L+L T GQTF WK+ L G IG Q G + Y I SPS
Sbjct: 19 LNLDATLGCGQTFSWKR---LDNGGWIG---------QAGSIPYAIFPASDKLYIYSPS- 65
Query: 100 PAAKSALLDFLNMGISLGELWEGFSASDCRFAELAKYLAGARVLRQDPVECLLQFLCSSN 159
+ ++ L +++ F + D + + R+L+QDP E +L FLCSS
Sbjct: 66 -GSLDGFQEYFQTTFDLEKVFNSFPSDDYILQQARLFSQKLRILKQDPWETILSFLCSSA 124
Query: 160 NNIARITKMVDFLASLGSHLGNVEGFEFHEFPSLERLSLVSEVELRNAGFGYRSAPQSSL 219
I +I K+ L G F FPS E + L G+RS
Sbjct: 125 KPIIQIRKVTQLLRRF---YGKEVWPGFFSFPSPEDIFDKGPSLLEQCKLGFRS------ 175
Query: 220 LFSVRRSFKQAKYITGTVDVLQSKHSGGAEWLLSLRKLDLQEAI-DALCTLPGVGPKVAA 278
Y+ +L ++ LS+ + + I L LPG+G K+A
Sbjct: 176 -----------DYVWKVASILAKENKK----FLSMLAIASTDTIRQTLLELPGIGRKIAD 220
Query: 279 CIALFSLDQHHAIPVD 294
C+ LF + P+D
Sbjct: 221 CVLLFGYARIEVFPID 236
>gi|409730109|ref|ZP_11271700.1| 3-methyladenine DNA glycosylase [Halococcus hamelinensis 100A6]
gi|448722753|ref|ZP_21705284.1| 3-methyladenine DNA glycosylase [Halococcus hamelinensis 100A6]
gi|445788890|gb|EMA39591.1| 3-methyladenine DNA glycosylase [Halococcus hamelinensis 100A6]
Length = 305
Score = 62.8 bits (151), Expect = 2e-07, Method: Compositional matrix adjust.
Identities = 47/161 (29%), Positives = 75/161 (46%), Gaps = 25/161 (15%)
Query: 139 GARVLRQDPVECLLQFLCSSNNNIARITKMVDFLASLGSHLGNVEGFEFHEFPSLERLSL 198
G R++R P L+ F+CS+ + RI M +A +G +H FP+ ++L+
Sbjct: 117 GLRIVRDPPFPTLITFICSAQMRVKRIHTMQRTMARRFGTAIEFDGETYHAFPTPDQLAA 176
Query: 199 VSEVELRNAGFGYRS--APQSSLLFSVRRSFKQAKYITGTVDVLQSKHSGGAEWLLSLRK 256
+ ELR+ GYR+ +S+ L + G +D ++ G E+
Sbjct: 177 ATVDELRDCSLGYRAPYVAESARLVA-----------DGELDPTDAR---GREY------ 216
Query: 257 LDLQEAIDALCTLPGVGPKVAACIALFSLDQHHAIPVDTHV 297
+ A + L GVG KVA CI LFSL A+P+DT +
Sbjct: 217 ---EAAREFLKGFMGVGDKVADCILLFSLGYLEAVPLDTWI 254
>gi|67611060|ref|XP_667131.1| ENSANGP00000020191 [Cryptosporidium hominis TU502]
gi|54658229|gb|EAL36897.1| ENSANGP00000020191 [Cryptosporidium hominis]
Length = 388
Score = 62.4 bits (150), Expect = 3e-07, Method: Compositional matrix adjust.
Identities = 58/211 (27%), Positives = 89/211 (42%), Gaps = 40/211 (18%)
Query: 115 SLGELWEGFSASDCRFAELAKYLAGARVLRQDPVECLLQFLCSSNNNIARITKMV-DFLA 173
SL E + ++ +D + K G R+L DP+E L + ++NNNI+RIT +V +
Sbjct: 156 SLSEAYMNWNEADSCISSSHK---GIRLLNIDPIEALFVGVITANNNISRITSIVKNMRK 212
Query: 174 SLGSHLGNVEG-----------------FEFHEFP-SLERLSLVSEVELR-NAGFGYRSA 214
+LG+ L N+ + + FP SL+ + SE LR G GYR
Sbjct: 213 NLGTFLCNISDSEIVSCNETDESELDIDYHYFSFPTSLQIFNNASEEMLREKCGVGYR-- 270
Query: 215 PQSSLLFSVRRSFKQAKYITGTVDVLQSKHSGGAEWLLSLRKLDLQEAIDALCTLPGVGP 274
AK I L K ++ ++ LD A L G+G
Sbjct: 271 ---------------AKSIISISKELNQKFESDRVFVDYIKSLDYSNATLYLQKFHGIGQ 315
Query: 275 KVAACIALFSLDQHHAIPVDTHVWKIATRYL 305
KVA + L A+PVDTH+ K +++
Sbjct: 316 KVADFVLLSGFGFSAAVPVDTHILKYICKHV 346
Score = 43.1 bits (100), Expect = 0.17, Method: Compositional matrix adjust.
Identities = 21/78 (26%), Positives = 37/78 (47%)
Query: 37 KWTPLNLTQSELSLPLTFPTGQTFRWKKTGPLQYTGPIGPHLISLKHLQNGDVCYHIHTS 96
K+ LN+ + EL + P GQ+F W+K + G +G + LK L+N + ++
Sbjct: 5 KFESLNVPREELRITKCLPAGQSFSWRKVSKDSFVGILGHRVFQLKELENDTLYRCLYDG 64
Query: 97 PSEPAAKSALLDFLNMGI 114
S+ L D+ + I
Sbjct: 65 CSKTIKDEELGDYDDQEI 82
>gi|339249321|ref|XP_003373648.1| putative G patch domain-containing protein 1-like protein
[Trichinella spiralis]
gi|316970197|gb|EFV54175.1| putative G patch domain-containing protein 1-like protein
[Trichinella spiralis]
Length = 1154
Score = 62.4 bits (150), Expect = 3e-07, Method: Compositional matrix adjust.
Identities = 33/87 (37%), Positives = 50/87 (57%), Gaps = 11/87 (12%)
Query: 217 SSLLFSVRRSFKQAKYITGTVDVLQSKHSGGAEWLLSLRKLDLQEAIDALCTLPGVGPKV 276
+ L++ V +AKYI T ++LQS++ G D+ +++D LC LPGVGPK+
Sbjct: 1006 AELIYPVGFCKSKAKYIKKTTEILQSQYDG-----------DIPKSVDELCQLPGVGPKM 1054
Query: 277 AACIALFSLDQHHAIPVDTHVWKIATR 303
A L + +Q I VDTHV +I+ R
Sbjct: 1055 ALLTMLTAWNQCEGIAVDTHVHRISNR 1081
>gi|218766464|gb|ACL11884.1| hypothetical protein, partial [Phytophthora cinnamomi]
Length = 140
Score = 62.4 bits (150), Expect = 3e-07, Method: Compositional matrix adjust.
Identities = 32/64 (50%), Positives = 41/64 (64%), Gaps = 1/64 (1%)
Query: 264 DALCTLPGVGPKVAACIALFSLDQHHAIPVDTHVWKIATRYLLPELAGVR-LTPKLCSRV 322
D L GVG KVA C+ALFSL++ AIPVDTHVW+IA R L L+ + +TP + V
Sbjct: 20 DELMVFAGVGRKVADCVALFSLEKLEAIPVDTHVWQIACRELDANLSDRKSITPTVYRMV 79
Query: 323 AEAF 326
+ F
Sbjct: 80 GDLF 83
>gi|78188984|ref|YP_379322.1| hypothetical protein Cag_1017 [Chlorobium chlorochromatii CaD3]
gi|78171183|gb|ABB28279.1| HhH-GPD [Chlorobium chlorochromatii CaD3]
Length = 312
Score = 62.4 bits (150), Expect = 3e-07, Method: Compositional matrix adjust.
Identities = 65/280 (23%), Positives = 120/280 (42%), Gaps = 29/280 (10%)
Query: 47 ELSLPLTFPTGQTFRWKK--TGPLQYTGPIGPHLISLKHLQNGDVCYHIHTSPSEPAAKS 104
EL + + +GQ+F WKK + + + L+ + L + H + S
Sbjct: 12 ELEIEKSLFSGQSFLWKKHQSNLDSFVTVMDKRLVIISQLSPYTIRVHCDSEVLYGQKIS 71
Query: 105 ALLD-FLNMGISLGELWEGFSASDCRFAELAKYLAGAR---VLRQDPVECLLQFLCSSNN 160
A + + + + +++ S+ ++E+ + L G + +LRQ P E L+ F+C+
Sbjct: 72 AFISHYFTLDVPFQKIFS--SSFKSNYSEVWRLLDGYKSIALLRQHPFETLISFMCAQGI 129
Query: 161 NIARITKMVDFLASLGSHL--GNVEG--FEFHEFPSLERLSLVSEVELRNAGFGYRSAPQ 216
+ I + ++ L +EG F FP+ E+L+ ++ EL R +
Sbjct: 130 GMRLIRQQINRLCERYGEFYEAEMEGEMLCFSGFPAPEQLACLNAEELSYCTNNNR--ER 187
Query: 217 SSLLFSVRRSFKQAKYITGTVDVLQSKHSGGAEWLLSLRKLDLQEAIDALCTLPGVGPKV 276
++ + +V R K + G +D+ LS + +E L G+G K+
Sbjct: 188 AANIIAVAR-----KVVEGRLDLSS----------LSYPNMAFEEVQARLTQERGIGLKI 232
Query: 277 AACIALFSLDQHHAIPVDTHVWKIATRYLLPELAGVRLTP 316
A C+ALF L A P+DTHV + ++ A LTP
Sbjct: 233 ADCVALFGLGYFEAFPIDTHVHQFMAQWFKVPAASRSLTP 272
>gi|76802512|ref|YP_327520.1| DNA N-glycosylase / DNA lyase [Natronomonas pharaonis DSM 2160]
gi|76558377|emb|CAI49968.1| DNA N-glycosylase [Natronomonas pharaonis DSM 2160]
Length = 302
Score = 62.0 bits (149), Expect = 3e-07, Method: Compositional matrix adjust.
Identities = 45/159 (28%), Positives = 72/159 (45%), Gaps = 21/159 (13%)
Query: 139 GARVLRQDPVECLLQFLCSSNNNIARITKMVDFLASLGSHLGNVEGFEFHEFPSLERLSL 198
G R++ P L+ F+CS+ ++RI M LA +G E+ FP+ L+
Sbjct: 118 GLRLVCDPPFGTLISFICSAQMRVSRIHDMQAALADAYGETVVFDGTEYTAFPTPSALAA 177
Query: 199 VSEVELRNAGFGYRSAPQSSLLFSVRRSFKQAKYITGTVDVLQSKHSGGAEWLLSLRKLD 258
SE LR+ GYR A Y+ T +++ + + + + L
Sbjct: 178 ASEDALRDLSLGYR-----------------APYVQRTAEMVANGDAHPTDAV----GLA 216
Query: 259 LQEAIDALCTLPGVGPKVAACIALFSLDQHHAIPVDTHV 297
+ A ++L GVG KVA C+ LFSL A+P+DT +
Sbjct: 217 YEAARESLTRFVGVGEKVADCVLLFSLGYLEAVPLDTWI 255
>gi|66359856|ref|XP_627106.1| OGG1 like 8-oxoguanine DNA glycosylase involved in DNA repair
[Cryptosporidium parvum Iowa II]
gi|46228820|gb|EAK89690.1| OGG1 like 8-oxoguanine DNA glycosylase involved in DNA repair
[Cryptosporidium parvum Iowa II]
Length = 390
Score = 62.0 bits (149), Expect = 3e-07, Method: Compositional matrix adjust.
Identities = 53/187 (28%), Positives = 79/187 (42%), Gaps = 37/187 (19%)
Query: 139 GARVLRQDPVECLLQFLCSSNNNIARITKMV-DFLASLGSHLGNVEG------------- 184
G R+L DP+E L + ++NNNI+RIT +V + +LG+ L N+
Sbjct: 179 GIRLLNIDPIEALFVGVITANNNISRITSIVKNMRKNLGTFLCNISDSEIVSCNETDESE 238
Query: 185 ----FEFHEFP-SLERLSLVSEVELR-NAGFGYRSAPQSSLLFSVRRSFKQAKYITGTVD 238
+ + FP SL+ + SE LR G GYR AK I
Sbjct: 239 LDIDYHYFSFPTSLQIFNNASEEMLREKCGVGYR-----------------AKSIISISK 281
Query: 239 VLQSKHSGGAEWLLSLRKLDLQEAIDALCTLPGVGPKVAACIALFSLDQHHAIPVDTHVW 298
L K ++ ++ LD A L G+G KVA + L A+PVDTH+
Sbjct: 282 ELNQKFESDRVFVDYIKSLDYSNATLYLQKFHGIGQKVADFVLLSGFGFSAAVPVDTHIL 341
Query: 299 KIATRYL 305
K +++
Sbjct: 342 KYICKHV 348
Score = 43.5 bits (101), Expect = 0.15, Method: Compositional matrix adjust.
Identities = 21/78 (26%), Positives = 37/78 (47%)
Query: 37 KWTPLNLTQSELSLPLTFPTGQTFRWKKTGPLQYTGPIGPHLISLKHLQNGDVCYHIHTS 96
K+ LN+ + EL + P GQ+F W+K + G +G + LK L+N + ++
Sbjct: 7 KFESLNVPREELRISKCLPAGQSFSWRKVSKDSFVGILGHRVFQLKELENDTLYRCLYDG 66
Query: 97 PSEPAAKSALLDFLNMGI 114
S+ L D+ + I
Sbjct: 67 CSKVIKDEELDDYDDQEI 84
>gi|348685487|gb|EGZ25302.1| hypothetical protein PHYSODRAFT_479501 [Phytophthora sojae]
Length = 149
Score = 61.6 bits (148), Expect = 5e-07, Method: Compositional matrix adjust.
Identities = 32/64 (50%), Positives = 41/64 (64%), Gaps = 1/64 (1%)
Query: 264 DALCTLPGVGPKVAACIALFSLDQHHAIPVDTHVWKIATRYLLPELAGVR-LTPKLCSRV 322
D L GVG KVA C+ALFSL++ AIPVDTHVW+IA R L L+ + +TP + V
Sbjct: 29 DELMVFAGVGRKVADCVALFSLEKLEAIPVDTHVWQIACRELDASLSDRKSITPTVYRMV 88
Query: 323 AEAF 326
+ F
Sbjct: 89 GDLF 92
>gi|357055226|ref|ZP_09116300.1| hypothetical protein HMPREF9467_03272 [Clostridium clostridioforme
2_1_49FAA]
gi|355383182|gb|EHG30268.1| hypothetical protein HMPREF9467_03272 [Clostridium clostridioforme
2_1_49FAA]
Length = 327
Score = 61.6 bits (148), Expect = 5e-07, Method: Compositional matrix adjust.
Identities = 71/253 (28%), Positives = 97/253 (38%), Gaps = 54/253 (21%)
Query: 104 SALLDFLNMGISLGELWEGFSASDCRFAELAKYLAGARVLRQDPVECLLQFLCSSNNNIA 163
S L +L+M + + + A AG R+LRQDP E ++ F+ S I
Sbjct: 68 SFWLSYLDMDRDYQAVIDSIDRENSYLTAAAMSGAGIRILRQDPWEMIITFVISQQKTIP 127
Query: 164 RITKMVDFLASL-------GSHLG---------NVEGFEFHE--------------FPSL 193
I ++V+ L+S G LG + E E FP+
Sbjct: 128 NIRQLVEALSSRYGTVLEDGQDLGKEDDRGKANDREKENDREKENDPGRCLPPACSFPAP 187
Query: 194 ERLSLVSEVELRNAGFGYRSAPQSSLLFSVRRSFKQAKYITGTVDVLQSKHSGGAEWLLS 253
+L L S +L GYR AKYI + Q SG E L
Sbjct: 188 SQLCLASLDDLMGLKLGYR-----------------AKYIHR---LCQDAVSGRLE-LSH 226
Query: 254 LRKLDLQEAIDALCTLPGVGPKVAACIALFSLDQHHAIPVDTHVWKIATRYLLPELAGVR 313
L LD + A++ L G+G KVA C+ LF L A PVDT + KI + R
Sbjct: 227 LSTLDYEGAMEYLTGFYGIGKKVANCVCLFGLHHIDAFPVDTWIEKILMEQYF-DRKKYR 285
Query: 314 LTPK--LCSRVAE 324
PK LC + E
Sbjct: 286 CIPKNRLCETIVE 298
>gi|333988587|ref|YP_004521194.1| DNA-(apurinic or apyrimidinic site) lyase [Methanobacterium sp.
SWAN-1]
gi|333826731|gb|AEG19393.1| DNA-(apurinic or apyrimidinic site) lyase [Methanobacterium sp.
SWAN-1]
Length = 328
Score = 61.2 bits (147), Expect = 7e-07, Method: Compositional matrix adjust.
Identities = 60/234 (25%), Positives = 98/234 (41%), Gaps = 20/234 (8%)
Query: 80 SLKHLQNGDVCYHIHTSPSEPAAKSALLDFLNMGISLGELWEGFSASDCRFAELAKYLAG 139
S K D+ + E K+ +++ + L +L+E + SD K+ G
Sbjct: 58 SCKKDSTMDIVVECPENLEEDVIKARIMEIFGLNDDLNDLYE-YLRSDPMLEPTIKFCNG 116
Query: 140 ARVLR-QDPVECLLQFLCSSNNNIARITKMV-DFLASLGSHLGNVEGFEFHEFPSLERLS 197
R+ + D ECL+ + S+N +I R K + D G G F+ FP E L+
Sbjct: 117 LRLFKAHDVFECLISSISSANCSIIRWNKAIKDIKMKWGDEYKFPSG-TFYTFPKPEILA 175
Query: 198 LVSEVELRNAGFGYRSAPQSSLLFSVRRSFK------QAKYITGTVDVLQSKHSGGAEWL 251
V E +L + P+ F + K +AKYI +++Q++ L
Sbjct: 176 NVPEHDLEEMQRCEDNLPED---FKFTNNLKSCGVGYRAKYIIKAAEMVQTEIE-----L 227
Query: 252 LSLRKLDLQEAIDALCTLPGVGPKVAACI--ALFSLDQHHAIPVDTHVWKIATR 303
L + + A + L +PGVGPKVA CI + A PVD + +I +
Sbjct: 228 DKLGNMSYENAFNTLLKIPGVGPKVADCILCYGYGYGYGKAFPVDVWIGRIVSE 281
>gi|363898510|ref|ZP_09325033.1| hypothetical protein HMPREF9625_00050 [Oribacterium sp. ACB1]
gi|395208074|ref|ZP_10397411.1| 8-oxoguanine DNA glycosylase, N-terminal domain protein
[Oribacterium sp. ACB8]
gi|361960977|gb|EHL14207.1| hypothetical protein HMPREF9625_00050 [Oribacterium sp. ACB1]
gi|394706282|gb|EJF13801.1| 8-oxoguanine DNA glycosylase, N-terminal domain protein
[Oribacterium sp. ACB8]
Length = 300
Score = 60.8 bits (146), Expect = 9e-07, Method: Compositional matrix adjust.
Identities = 50/168 (29%), Positives = 76/168 (45%), Gaps = 26/168 (15%)
Query: 139 GARVLRQDPVECLLQFLCSSNNNIARITKMVDFLAS-LGS-HLGNVEGFEFHEFPSLERL 196
G R+L+QDP E L+ F+ S +I I V+ L G VE + + FPS E L
Sbjct: 126 GLRILKQDPFETLITFILSQRKSIPAIRSSVEKLCEQFGEKRYSKVEEKDVYLFPSAEAL 185
Query: 197 SLVSEVELRNAGFGYRSAPQSSLLFSVRRSFKQAKYITGTVDVLQSKHSGGAEWLLSLRK 256
+L N GYR + + + G +D L +L K
Sbjct: 186 Y---HADLSNCSLGYRVPFVQDAV---------ERIVEGRLD------------LSALEK 221
Query: 257 LDLQEAIDALCTLPGVGPKVAACIALFSLDQHHAIPVDTHVWKIATRY 304
L +E ++AL + GVG KVA+C+ALF+ + IP D + +I ++
Sbjct: 222 LPTKELLEALMEVHGVGIKVASCVALFAYSRMDIIPEDVWMKRIWEKH 269
>gi|160937119|ref|ZP_02084482.1| hypothetical protein CLOBOL_02010 [Clostridium bolteae ATCC
BAA-613]
gi|158440020|gb|EDP17768.1| hypothetical protein CLOBOL_02010 [Clostridium bolteae ATCC
BAA-613]
Length = 318
Score = 60.5 bits (145), Expect = 1e-06, Method: Compositional matrix adjust.
Identities = 63/217 (29%), Positives = 86/217 (39%), Gaps = 52/217 (23%)
Query: 139 GARVLRQDPVECLLQFLCSSNNNIARITKMVDFLASLGSHL---------GNV------- 182
G R+LRQDP E ++ F+ S I I ++V+ L+S L G V
Sbjct: 94 GIRILRQDPWEMIITFVISQQKTIPNIRQLVEALSSRYGTLLEDRQNRGSGEVREKDEVR 153
Query: 183 -------EGFEFHE-------FPSLERLSLVSEVELRNAGFGYRSAPQSSLLFSVRRSFK 228
EG E FP+ +L L S +L GYR
Sbjct: 154 EKEGARREGAALEESLPPAFSFPAPSQLCLASLEDLMGLKLGYR---------------- 197
Query: 229 QAKYITGTVDVLQSKHSGGAEWLLSLRKLDLQEAIDALCTLPGVGPKVAACIALFSLDQH 288
AKYI + Q SG + L L L+ +EA++ L G+G KVA C+ LF L
Sbjct: 198 -AKYIH---RLCQDAVSGRLD-LSHLAALNYEEAMEYLTGFYGIGKKVANCVCLFGLHHI 252
Query: 289 HAIPVDTHVWKIATRYLLPELAGVRL-TPKLCSRVAE 324
A PVDT + KI R+ +LC + E
Sbjct: 253 DAFPVDTWIEKILMEQYFDRKKYRRIPKNRLCETIVE 289
>gi|307352518|ref|YP_003893569.1| 8-oxoguanine DNA glycosylase domain-containing protein
[Methanoplanus petrolearius DSM 11571]
gi|307155751|gb|ADN35131.1| 8-oxoguanine DNA glycosylase domain protein [Methanoplanus
petrolearius DSM 11571]
Length = 284
Score = 60.1 bits (144), Expect = 1e-06, Method: Compositional matrix adjust.
Identities = 59/265 (22%), Positives = 106/265 (40%), Gaps = 38/265 (14%)
Query: 45 QSELSLPLTFPTGQTFRWKKTGPLQYTGPIGPHLISLKHLQNGDVCYHIHTSPSEPAAKS 104
S SL T +GQ RW+K Y G + ++I + L + I+ S ++
Sbjct: 12 DSPFSLDSTLASGQAPRWEKVNGWWY-GVVKDNVIKTRQLDD-----KIYFSG---CSER 62
Query: 105 ALLDFLNMGISLGELWEGFSASDCRFAELAKYLAGARVLRQDPVECLLQFLCSSNNNIAR 164
++ ++ +L E ++ FS D + G R++ QDP ECL L N R
Sbjct: 63 YYREYFSLDYNLKEFYDSFS-DDMYLKSAIEANPGLRLVGQDPWECLCFQLTI---NKKR 118
Query: 165 ITKMVDFLASLGSHLGN---VEGFEFHEFPSLERLSLVSEVELRNAGFGYRSAPQSSLLF 221
+ D + G+ ++G +H FP+ E L +L+ GY++
Sbjct: 119 TSPGEDCFTRISQKFGDEIELDGKIYHTFPTAETLVKEGLSKLKTCNLGYKA-------- 170
Query: 222 SVRRSFKQAKYITGTVDVLQSKHSGGAEWLLSLRKLDLQEAIDALCTLPGVGPKVAACIA 281
+ K + W + + ++AI + GV P VA I
Sbjct: 171 -------------NNIHSAAKKVAEDPLWSKKIESMKSEDAIKIISGFRGVKPTVAEWIL 217
Query: 282 LFSLDQHHAIPVDTHVW-KIATRYL 305
+F+ ++ PVD+H+ ++ +YL
Sbjct: 218 IFAFRRYELFPVDSHIRSRMVKKYL 242
>gi|332018909|gb|EGI59455.1| Mannose-6-phosphate isomerase [Acromyrmex echinatior]
Length = 532
Score = 59.7 bits (143), Expect = 2e-06, Method: Compositional matrix adjust.
Identities = 36/120 (30%), Positives = 59/120 (49%), Gaps = 3/120 (2%)
Query: 46 SELSLPLTFPTGQTFRWKKTGPLQYTGPIGPHLISLKHLQNGDVCYHIHTSPSEPAAKSA 105
+EL+L LT GQ+FRW Y G + +L + Y + +
Sbjct: 11 TELNLALTLHGGQSFRWTACDN-GYKGIFNGCIWTLSQ-NKTHLSYTVQGHLKDSVNYDN 68
Query: 106 LL-DFLNMGISLGELWEGFSASDCRFAELAKYLAGARVLRQDPVECLLQFLCSSNNNIAR 164
+L ++ + +SL E ++ ++ +D F + R+L+QD VE L F+CSSNNNI+R
Sbjct: 69 ILSEYFRLSVSLKEHYKQWAKADTHFQKCLDENNAVRILKQDVVENLFSFICSSNNNISR 128
>gi|189218958|ref|YP_001939599.1| 3-methyladenine DNA glycosylase/8-oxoguanine DNA glycosylase
[Methylacidiphilum infernorum V4]
gi|189185816|gb|ACD83001.1| 3-methyladenine DNA glycosylase/8-oxoguanine DNA glycosylase
[Methylacidiphilum infernorum V4]
Length = 320
Score = 59.3 bits (142), Expect = 2e-06, Method: Compositional matrix adjust.
Identities = 60/257 (23%), Positives = 96/257 (37%), Gaps = 29/257 (11%)
Query: 53 TFPTGQTFRWKKTGPLQYTGPIGPHLISLKHLQNGDVCYHIHTSPSEPAAKSALLDFLNM 112
T +GQ F W++ + G +G +L + Y S + A ++
Sbjct: 39 TLGSGQAFSWQRIAKGSWIGQVGADPYALIPGEKNLNVY------SPRGSFCAFQEYFQT 92
Query: 113 GISLGELWEGFSASDCRFAELAKYLAGARVLRQDPVECLLQFLCSSNNNIARITKMVDFL 172
L ++++ F D R+L+QDP E L+ FL SS I +I K+
Sbjct: 93 EFDLEKVFQSFPPGDLVLERARCSCPRLRILKQDPWETLVCFLSSSAKPIVQIRKIC--- 149
Query: 173 ASLGSHLGNVEGFEFHEFPSLERLSLVSEVELRNAGFGYRSAPQSSLLFSVRRSFKQAKY 232
L + G F FPS E + + L+ A G+R A +
Sbjct: 150 GRLRAFYGKEIYPRFFSFPSAEDIIVKGPEGLKQARLGFR-----------------ANF 192
Query: 233 ITGTVDVLQSKHSGGAEWLLSLRKLDLQEAIDALCTLPGVGPKVAACIALFSLDQHHAIP 292
I +L LL L+ + L LPGVG K+A C+ LF + P
Sbjct: 193 IWKVSTILSKLKP---NLLLELKDAPTGDIRQILMELPGVGRKIADCVLLFGYGRLEVFP 249
Query: 293 VDTHVWKIATRYLLPEL 309
+D + ++ + P L
Sbjct: 250 IDRWMERVLRTFYSPLL 266
>gi|15790318|ref|NP_280142.1| 8-oxoguanine DNA glycosylase [Halobacterium sp. NRC-1]
gi|169236051|ref|YP_001689251.1| 8-oxoguanine DNA glycosylase [Halobacterium salinarum R1]
gi|10580792|gb|AAG19622.1| 8-oxoguanine DNA glycosylase [Halobacterium sp. NRC-1]
gi|167727117|emb|CAP13902.1| DNA N-glycosylase [Halobacterium salinarum R1]
Length = 301
Score = 58.5 bits (140), Expect = 4e-06, Method: Compositional matrix adjust.
Identities = 64/271 (23%), Positives = 104/271 (38%), Gaps = 46/271 (16%)
Query: 40 PLNLTQSELSLPLTFPTGQTFRWKK------------TGPLQYTGPIGPHLISLKHLQNG 87
PL+ + + T +GQT+ W + G Y I +I ++ Q
Sbjct: 7 PLDDVPGAVDVQATLESGQTYLWWRPDGDTYRTDGLSGGDAWYRTVIDDDVIDVR--QTA 64
Query: 88 DVCYHIHTSPSEPAAKSALLDFLNMGISLGELWEGFSASDCRFAELAKYLAGARVLRQDP 147
T+ + P + AL + L + G + SD G R++R
Sbjct: 65 TAIEWRGTTDAAPIVRDAL----GLHDDLDAV-RGAARSDPLITAAWDAYDGLRIVRDPF 119
Query: 148 VECLLQFLCSSNNNIARITKMVDFLASLGSHLGNVEGFEFHEFPSLERLSLVSEVELRNA 207
CL+ F+CS+ + RI +M + L +G + P L+ +E +LR+
Sbjct: 120 FGCLVSFICSAQMRVERIFEMQERLRREYGTPITFDGQTVYSVPEPSALAAATESDLRDL 179
Query: 208 GFGYRSAPQSSLLFSVRRSFKQAKYITGTVDVLQSKHSGGAEWLLSLRKLD---LQEAID 264
GYR A Y+ T ++ S L+ R +D + A +
Sbjct: 180 KLGYR-----------------APYVQRTAAMVASGE-------LTKRDIDGRAYELARE 215
Query: 265 ALCTLPGVGPKVAACIALFSLDQHHAIPVDT 295
+ GVG KVA C+ LFSL A+P+DT
Sbjct: 216 TMTDFVGVGNKVADCVLLFSLGYLAAVPLDT 246
>gi|342216305|ref|ZP_08708952.1| 8-oxoguanine DNA glycosylase, N-terminal domain protein
[Peptoniphilus sp. oral taxon 375 str. F0436]
gi|341587195|gb|EGS30595.1| 8-oxoguanine DNA glycosylase, N-terminal domain protein
[Peptoniphilus sp. oral taxon 375 str. F0436]
Length = 274
Score = 58.2 bits (139), Expect = 6e-06, Method: Compositional matrix adjust.
Identities = 53/165 (32%), Positives = 78/165 (47%), Gaps = 29/165 (17%)
Query: 138 AGARVLRQDPVECLLQFLCSSNNNIARITKMVDFLASLGSHLGNVEGFE---FHEFPSLE 194
+G +LRQD E ++ F+ S+N+NI RI K L +L + G+ + + ++ FP +
Sbjct: 88 SGIHILRQDLWEVVVSFIISANSNIPRIRKS---LFALCQNYGDQKQDQWGTYYSFPRPQ 144
Query: 195 RLSLVSEVELRNA-GFGYRSAPQSSLLFSVRRSFKQAKYITGTVDVLQSKHSGGAEWLLS 253
L+ V + R GYR + V+ + + + G VD+ E L
Sbjct: 145 DLAGVCREDFREKIKLGYRDQ------YLVKTT---QQVLEGRVDL---------EALKE 186
Query: 254 LRKLDLQEAIDALCTLPGVGPKVAACIALFSLDQHHAIPVDTHVW 298
+ DL A+ L L GVGPKVA CI LF L PVD VW
Sbjct: 187 VADKDL--AMKELTRLSGVGPKVAHCILLFGLHNWQGFPVD--VW 227
>gi|355627153|ref|ZP_09049089.1| hypothetical protein HMPREF1020_03168 [Clostridium sp. 7_3_54FAA]
gi|354820398|gb|EHF04814.1| hypothetical protein HMPREF1020_03168 [Clostridium sp. 7_3_54FAA]
Length = 293
Score = 57.0 bits (136), Expect = 1e-05, Method: Compositional matrix adjust.
Identities = 48/201 (23%), Positives = 80/201 (39%), Gaps = 26/201 (12%)
Query: 109 FLNMGISLGELWEGFSASDCRFAELAKYLAGARVLRQDPVECLLQFLCSSNNNIARITKM 168
+ ++ G D A +G R+L QD E ++ F+ S I +I +
Sbjct: 64 YFDLNTDYGAFLSSVKKRDHYLQNAANAGSGIRILNQDAWEMIITFIISQQRTIPKIREA 123
Query: 169 VDFLASLGSHLGNV-----EGFEFHEFPSLERLSLVSEVELRNAGFGYRSAPQSSLLFSV 223
V+ L+ L + + + FP+ +L S +L++ GYR+ +
Sbjct: 124 VENLSRLYGEEKHCTFPGGKTVVYFSFPTPAQLKKASLEDLQSLKLGYRARYIHRICLDA 183
Query: 224 RRSFKQAKYITGTVDVLQSKHSGGAEWLLSLRKLDLQEAIDALCTLPGVGPKVAACIALF 283
GT+D L L + ++A++ L G+G KVA CI LF
Sbjct: 184 D---------NGTLD------------LNRLSAMGYRDAMEYLSGFYGIGTKVANCICLF 222
Query: 284 SLDQHHAIPVDTHVWKIATRY 304
L A PVDT + +I T +
Sbjct: 223 GLHHIEAFPVDTWIQQILTNH 243
>gi|323691117|ref|ZP_08105397.1| hypothetical protein HMPREF9475_00258 [Clostridium symbiosum
WAL-14673]
gi|323504814|gb|EGB20596.1| hypothetical protein HMPREF9475_00258 [Clostridium symbiosum
WAL-14673]
Length = 293
Score = 57.0 bits (136), Expect = 1e-05, Method: Compositional matrix adjust.
Identities = 48/201 (23%), Positives = 80/201 (39%), Gaps = 26/201 (12%)
Query: 109 FLNMGISLGELWEGFSASDCRFAELAKYLAGARVLRQDPVECLLQFLCSSNNNIARITKM 168
+ ++ G D A +G R+L QD E ++ F+ S I +I +
Sbjct: 64 YFDLNTDYGAFLSSVKKRDHYLQNAANAGSGIRILNQDAWEMIITFIISQQRTIPKIREA 123
Query: 169 VDFLASLGSHLGNV-----EGFEFHEFPSLERLSLVSEVELRNAGFGYRSAPQSSLLFSV 223
V+ L+ L + + + FP+ +L S +L++ GYR+ +
Sbjct: 124 VENLSRLYGEEKHCTFPGGKTVVYFSFPTPAQLKKASLEDLQSLKLGYRARYIHRICLDA 183
Query: 224 RRSFKQAKYITGTVDVLQSKHSGGAEWLLSLRKLDLQEAIDALCTLPGVGPKVAACIALF 283
GT+D L L + ++A++ L G+G KVA CI LF
Sbjct: 184 D---------NGTLD------------LNRLSAMGYRDAMEYLSGFYGIGTKVANCICLF 222
Query: 284 SLDQHHAIPVDTHVWKIATRY 304
L A PVDT + +I T +
Sbjct: 223 GLHHIEAFPVDTWIQQILTNH 243
>gi|156041124|ref|XP_001587548.1| hypothetical protein SS1G_11541 [Sclerotinia sclerotiorum 1980]
gi|154695924|gb|EDN95662.1| hypothetical protein SS1G_11541 [Sclerotinia sclerotiorum 1980
UF-70]
Length = 347
Score = 57.0 bits (136), Expect = 1e-05, Method: Compositional matrix adjust.
Identities = 32/87 (36%), Positives = 45/87 (51%), Gaps = 10/87 (11%)
Query: 223 VRRSFKQAKYITGTVDVLQSKHSGGAEWLLSLRKLDLQEAIDALCTLPGVGPKVAACIAL 282
+RR + + +D+L +H +R L EA D T PGVGPK AACI
Sbjct: 256 IRRQRGKVATNSRGIDLLSLEH---------MRSLSKDEAFDKFITFPGVGPKTAACIIS 306
Query: 283 FSLDQHHAIPVDTHVWKIATRYLLPEL 309
+ QH+A VDTHV++I+ R E+
Sbjct: 307 ICM-QHNAFAVDTHVYQISDRRQEAEV 332
>gi|323483384|ref|ZP_08088772.1| 3-methyladenine DNA glycosylase/8-oxoguanine DNA glycosylase
[Clostridium symbiosum WAL-14163]
gi|323403238|gb|EGA95548.1| 3-methyladenine DNA glycosylase/8-oxoguanine DNA glycosylase
[Clostridium symbiosum WAL-14163]
Length = 293
Score = 57.0 bits (136), Expect = 1e-05, Method: Compositional matrix adjust.
Identities = 48/201 (23%), Positives = 80/201 (39%), Gaps = 26/201 (12%)
Query: 109 FLNMGISLGELWEGFSASDCRFAELAKYLAGARVLRQDPVECLLQFLCSSNNNIARITKM 168
+ ++ G D A +G R+L QD E ++ F+ S I +I +
Sbjct: 64 YFDLNTDYGAFLSSVKKRDHYLQNAANAGSGIRILNQDAWEMIITFIISQQRTIPKIREA 123
Query: 169 VDFLASLGSHLGNV-----EGFEFHEFPSLERLSLVSEVELRNAGFGYRSAPQSSLLFSV 223
V+ L+ L + + + FP+ +L S +L++ GYR+ +
Sbjct: 124 VENLSRLYGEEKHCTFPGGKTVVYFSFPTPAQLKKASLEDLQSLKLGYRARYIHRICLDA 183
Query: 224 RRSFKQAKYITGTVDVLQSKHSGGAEWLLSLRKLDLQEAIDALCTLPGVGPKVAACIALF 283
GT+D L L + ++A++ L G+G KVA CI LF
Sbjct: 184 D---------NGTLD------------LNRLSAMGYRDAMEYLLGFYGIGTKVANCICLF 222
Query: 284 SLDQHHAIPVDTHVWKIATRY 304
L A PVDT + +I T +
Sbjct: 223 GLHHIEAFPVDTWIQQILTNH 243
>gi|393796833|ref|ZP_10380197.1| HhH-GPD family protein [Candidatus Nitrosoarchaeum limnia BG20]
Length = 125
Score = 56.6 bits (135), Expect = 2e-05, Method: Compositional matrix adjust.
Identities = 35/118 (29%), Positives = 52/118 (44%), Gaps = 14/118 (11%)
Query: 48 LSLPLTFPTGQTFRWKKTGPLQYTGPIGPHLISLKHLQNGDVCYHIHTSPSEPAAKSALL 107
++L T +GQ F WKK Y NG I+ S +
Sbjct: 7 INLENTINSGQVFLWKKHKEFWYG-------------INGQDVLKINDSGKITTYSNKKY 53
Query: 108 DFLNMGISLGELWEGFSASDCRFAELAKYLAGARVLRQDPVECLLQFLCSSNNNIARI 165
DF G ++ ++ + S + KYL G R+LRQDP +C + F+ SSN+NI +I
Sbjct: 54 DFFRTGDNIEKIIKSISKDKTTKIAVKKYL-GLRLLRQDPFQCFISFIVSSNSNIQKI 110
>gi|341900542|gb|EGT56477.1| CBN-NTH-1 protein [Caenorhabditis brenneri]
Length = 298
Score = 56.2 bits (134), Expect = 2e-05, Method: Compositional matrix adjust.
Identities = 28/76 (36%), Positives = 44/76 (57%), Gaps = 11/76 (14%)
Query: 228 KQAKYITGTVDVLQSKHSGGAEWLLSLRKLDLQEAIDALCTLPGVGPKVAACIALFSLDQ 287
++A Y+ T +L K+SG D+ +++D LC+LPGVGPK+A + + D+
Sbjct: 113 RKAIYLQQTARILTDKYSG-----------DIPDSLDGLCSLPGVGPKMANLVMQIAWDK 161
Query: 288 HHAIPVDTHVWKIATR 303
I VDTHV +I+ R
Sbjct: 162 CEGIAVDTHVHRISNR 177
>gi|449015802|dbj|BAM79204.1| similar to endonuclease III [Cyanidioschyzon merolae strain 10D]
Length = 276
Score = 55.8 bits (133), Expect = 2e-05, Method: Compositional matrix adjust.
Identities = 22/50 (44%), Positives = 36/50 (72%)
Query: 254 LRKLDLQEAIDALCTLPGVGPKVAACIALFSLDQHHAIPVDTHVWKIATR 303
LR + + ++ L PGVGPK AAC+A+F+L++ + PVDTH++++A R
Sbjct: 157 LRNVPTAQVMEYLTRYPGVGPKTAACVAMFTLERDDSFPVDTHIFRLAKR 206
>gi|282165629|ref|YP_003358014.1| putative endonuclease III [Methanocella paludicola SANAE]
gi|282157943|dbj|BAI63031.1| putative endonuclease III [Methanocella paludicola SANAE]
Length = 293
Score = 55.5 bits (132), Expect = 3e-05, Method: Compositional matrix adjust.
Identities = 30/75 (40%), Positives = 44/75 (58%), Gaps = 4/75 (5%)
Query: 229 QAKYITGTVDVLQSKHSGGAEWLLSLRKLDLQEAIDALCTLPGVGPKVAACIALFSLDQH 288
+A YI + + G EW+ +K+ +EA +AL +L GVGPK A C+ LFSL H
Sbjct: 152 KAAYIIECSRQIIERWGGSLEWM---KKVSTEEAREALLSLYGVGPKTADCVLLFSLG-H 207
Query: 289 HAIPVDTHVWKIATR 303
PVDTH+ +++ R
Sbjct: 208 SVTPVDTHICRVSQR 222
>gi|383318935|ref|YP_005379776.1| EndoIII-related endonuclease [Methanocella conradii HZ254]
gi|379320305|gb|AFC99257.1| putative EndoIII-related endonuclease [Methanocella conradii HZ254]
Length = 291
Score = 55.1 bits (131), Expect = 4e-05, Method: Compositional matrix adjust.
Identities = 29/75 (38%), Positives = 43/75 (57%), Gaps = 4/75 (5%)
Query: 229 QAKYITGTVDVLQSKHSGGAEWLLSLRKLDLQEAIDALCTLPGVGPKVAACIALFSLDQH 288
+A Y+ ++ + G EW+ R+ EA AL +L GVGPK A C+ LFSL H
Sbjct: 150 KASYMIECSRQIEERWGGSLEWM---RRAPTDEARRALLSLHGVGPKTADCVLLFSLG-H 205
Query: 289 HAIPVDTHVWKIATR 303
+PVDTH+ +++ R
Sbjct: 206 SVVPVDTHICRVSQR 220
>gi|256084715|ref|XP_002578572.1| 8-oxoguanine DNA glycosylase [Schistosoma mansoni]
Length = 253
Score = 55.1 bits (131), Expect = 4e-05, Method: Compositional matrix adjust.
Identities = 27/64 (42%), Positives = 38/64 (59%)
Query: 245 SGGAEWLLSLRKLDLQEAIDALCTLPGVGPKVAACIALFSLDQHHAIPVDTHVWKIATRY 304
+GG LL LR + EA L +PG+G KVA CI L SLD+ +P+D H+ ++A
Sbjct: 132 NGGRSRLLELRSVSHVEAQAFLLQIPGIGKKVADCICLMSLDKLDVVPIDVHMHRVAREK 191
Query: 305 LLPE 308
+PE
Sbjct: 192 GIPE 195
>gi|119357080|ref|YP_911724.1| HhH-GPD family protein [Chlorobium phaeobacteroides DSM 266]
gi|119354429|gb|ABL65300.1| HhH-GPD family protein [Chlorobium phaeobacteroides DSM 266]
Length = 287
Score = 55.1 bits (131), Expect = 5e-05, Method: Compositional matrix adjust.
Identities = 51/182 (28%), Positives = 83/182 (45%), Gaps = 23/182 (12%)
Query: 141 RVLRQDPVECLLQFLCSSNNNIARITKMVDFLA-SLGSHLGNV---EGFEFHEFPSLERL 196
++LRQDP ++ F+C+ + I + V+ +A + G + + H FP L
Sbjct: 85 KILRQDPFSTMITFMCAQGIGMQLIRRQVELIAKNYGEEITVAFSGKQITLHRFPDPITL 144
Query: 197 SLVSEVELRNAGFGYRSAPQSSLLFSVRRSFKQAKYITGTVDVLQSKHSGGAEWLLSLRK 256
+ ELR R + +L S G VD+ +G L+ R
Sbjct: 145 ARADPEELRYCTNNNRVRAANIVLAS-------KAVAEGRVDL-----AG-----LTDRT 187
Query: 257 LDLQEAIDALCTLPGVGPKVAACIALFSLDQHHAIPVDTHVWK-IATRYLLPELAGVRLT 315
+ L + L + G+G K+A CIALF L + A PVD HV + +AT + +P+ A L+
Sbjct: 188 IPLALLQETLSGIQGIGYKIADCIALFGLGRFDAFPVDNHVRQYLATWFNMPQ-ASRELS 246
Query: 316 PK 317
P+
Sbjct: 247 PR 248
>gi|308481910|ref|XP_003103159.1| CRE-NTH-1 protein [Caenorhabditis remanei]
gi|308260264|gb|EFP04217.1| CRE-NTH-1 protein [Caenorhabditis remanei]
Length = 299
Score = 54.3 bits (129), Expect = 8e-05, Method: Compositional matrix adjust.
Identities = 27/76 (35%), Positives = 44/76 (57%), Gaps = 11/76 (14%)
Query: 228 KQAKYITGTVDVLQSKHSGGAEWLLSLRKLDLQEAIDALCTLPGVGPKVAACIALFSLDQ 287
++A Y+ T +L K+SG D+ +++D LC+LPGVGPK+A + + ++
Sbjct: 121 RKAVYLQQTAKILVDKYSG-----------DIPDSLDGLCSLPGVGPKMANLVMQIAWNK 169
Query: 288 HHAIPVDTHVWKIATR 303
I VDTHV +I+ R
Sbjct: 170 CEGIAVDTHVHRISNR 185
>gi|405957861|gb|EKC24039.1| Endonuclease III-like protein 1 [Crassostrea gigas]
Length = 259
Score = 53.9 bits (128), Expect = 9e-05, Method: Compositional matrix adjust.
Identities = 34/107 (31%), Positives = 55/107 (51%), Gaps = 17/107 (15%)
Query: 203 ELRNAGFGYRSAPQSS------LLFSVRRSFKQAKYITGTVDVLQSKHSGGAEWLLSLRK 256
+LR G + Q+S L++ V K+ +YI T +LQ +++G
Sbjct: 81 KLREHGCSIDNILQTSDKELGELIYPVGFWKKKVEYIKRTSKILQEEYNG---------- 130
Query: 257 LDLQEAIDALCTLPGVGPKVAACIALFSLDQHHAIPVDTHVWKIATR 303
D+ + ++ LC LPGVGPK+A + + +Q I VDTHV +I+ R
Sbjct: 131 -DIPDTVEKLCKLPGVGPKMAYLVMKCAWNQIVGIGVDTHVHRISNR 176
>gi|147919100|ref|YP_687169.1| endonuclease III [Methanocella arvoryzae MRE50]
gi|110622565|emb|CAJ37843.1| predicted endonuclease III [Methanocella arvoryzae MRE50]
Length = 243
Score = 53.9 bits (128), Expect = 1e-04, Method: Compositional matrix adjust.
Identities = 36/102 (35%), Positives = 58/102 (56%), Gaps = 8/102 (7%)
Query: 213 SAPQSSLLFSVRRSF---KQAKYITGTVDVLQSKHSGGAEWLLSLRKLDLQEAIDALCTL 269
SAP+S++ +R ++AK I ++ ++ G +++ + D + A L T+
Sbjct: 67 SAPESAIADLIRIGGLHEQKAKLIKNISQLVIDEYDGTLDFVC---ETDPEVARKELLTI 123
Query: 270 PGVGPKVAACIALFSLDQHHAIPVDTHVWKIATRY-LLPELA 310
GVGPK A C+ LFS D+ IPVDTHV++I R ++PE A
Sbjct: 124 KGVGPKTADCVLLFSCDR-DVIPVDTHVFRITKRLGIVPEKA 164
>gi|421836986|ref|ZP_16271295.1| 8-oxoguanine DNA glycosylase, partial [Clostridium botulinum
CFSAN001627]
gi|409740983|gb|EKN41017.1| 8-oxoguanine DNA glycosylase, partial [Clostridium botulinum
CFSAN001627]
Length = 137
Score = 53.5 bits (127), Expect = 1e-04, Method: Compositional matrix adjust.
Identities = 37/141 (26%), Positives = 67/141 (47%), Gaps = 28/141 (19%)
Query: 177 SHLGNVEGFEFHEFPSLERLSLVSEVELRNAGFGYRSAPQSSLLFSVRRSFKQAKYITGT 236
+ GN+ ++ FP++E+L +E EL+ G+R A+YI T
Sbjct: 15 EYKGNI----YYSFPTVEQLKDATEDELKACSVGFR-----------------ARYIKDT 53
Query: 237 VD-VLQSKHSGGAEWLLSLRKLDLQEAIDALC-----TLPGVGPKVAACIALFSLDQHHA 290
V+ + Q+ ++ L ++ D +C G+G KVA C+ LFS++++ A
Sbjct: 54 VNKIYQNSIEECEQYEKEYDMLWIKNQQDDICHKVLQNYSGIGAKVADCVMLFSMEKYSA 113
Query: 291 IPVDTHVWK-IATRYLLPELA 310
PVD V + + YL P+++
Sbjct: 114 FPVDVWVKRAMQYFYLAPDVS 134
>gi|340374045|ref|XP_003385549.1| PREDICTED: endonuclease III-like protein 1-like [Amphimedon
queenslandica]
Length = 295
Score = 53.1 bits (126), Expect = 2e-04, Method: Compositional matrix adjust.
Identities = 30/85 (35%), Positives = 45/85 (52%), Gaps = 11/85 (12%)
Query: 219 LLFSVRRSFKQAKYITGTVDVLQSKHSGGAEWLLSLRKLDLQEAIDALCTLPGVGPKVAA 278
L+F V ++A+Y+ T +L+ KH+G D+ E+++ L LPGVGPK+A
Sbjct: 157 LIFPVGFYRRKAQYLKKTAQILKEKHNG-----------DIPESLEGLMALPGVGPKMAH 205
Query: 279 CIALFSLDQHHAIPVDTHVWKIATR 303
+ I VDTHV +IA R
Sbjct: 206 LCMDIAWGNVTGIGVDTHVHRIANR 230
>gi|18408516|ref|NP_566893.1| HhH-GPD base excision DNA repair protein-related protein
[Arabidopsis thaliana]
gi|332644814|gb|AEE78335.1| HhH-GPD base excision DNA repair protein-related protein
[Arabidopsis thaliana]
Length = 293
Score = 52.8 bits (125), Expect = 2e-04, Method: Compositional matrix adjust.
Identities = 30/74 (40%), Positives = 44/74 (59%), Gaps = 3/74 (4%)
Query: 228 KQAKYITGTVDVLQSKHSGGAEWLLSLRKLDLQEAIDALCTLPGVGPKVAACIALFSLDQ 287
K+A I ++ LQ++ G L LR L ++E L GVGPK +C+ +F+L Q
Sbjct: 151 KKAVCIKNILNRLQNER--GRLCLEYLRGLSVEEVKTELSHFKGVGPKTVSCVLMFNL-Q 207
Query: 288 HHAIPVDTHVWKIA 301
H+ PVDTHV++IA
Sbjct: 208 HNDFPVDTHVFEIA 221
>gi|116754221|ref|YP_843339.1| HhH-GPD family protein [Methanosaeta thermophila PT]
gi|116665672|gb|ABK14699.1| HhH-GPD family protein [Methanosaeta thermophila PT]
Length = 219
Score = 52.4 bits (124), Expect = 3e-04, Method: Compositional matrix adjust.
Identities = 31/95 (32%), Positives = 55/95 (57%), Gaps = 4/95 (4%)
Query: 229 QAKYITGTVDVLQSKHSGGAEWLLSLRKLDLQEAIDALCTLPGVGPKVAACIALFSLDQH 288
+A I G ++ L+ G+ L L+++ EA + L ++PG+GPK A+ + LF
Sbjct: 83 KAARIKGALERLRDDF--GSVDLSPLKRMSAVEARNYLKSIPGIGPKTASVLMLFGFGM- 139
Query: 289 HAIPVDTHVWKIATRY-LLPELAGVRLTPKLCSRV 322
A+PVDTHV++++ R L+PE A + T ++ +
Sbjct: 140 SAMPVDTHVYRVSRRMGLVPENASIEETQRILEEI 174
>gi|297815970|ref|XP_002875868.1| predicted protein [Arabidopsis lyrata subsp. lyrata]
gi|297321706|gb|EFH52127.1| predicted protein [Arabidopsis lyrata subsp. lyrata]
Length = 294
Score = 52.4 bits (124), Expect = 3e-04, Method: Compositional matrix adjust.
Identities = 29/74 (39%), Positives = 44/74 (59%), Gaps = 3/74 (4%)
Query: 228 KQAKYITGTVDVLQSKHSGGAEWLLSLRKLDLQEAIDALCTLPGVGPKVAACIALFSLDQ 287
K+A I ++ LQ++ G L LR L ++E L G+GPK +C+ +F+L Q
Sbjct: 151 KKAVCIKNILNRLQTER--GVLCLEYLRGLSVEEVKTELSHFKGIGPKTVSCVLMFNL-Q 207
Query: 288 HHAIPVDTHVWKIA 301
H+ PVDTHV++IA
Sbjct: 208 HNDFPVDTHVFEIA 221
>gi|399575316|ref|ZP_10769074.1| endoiii-related endonuclease [Halogranum salarium B-1]
gi|399239584|gb|EJN60510.1| endoiii-related endonuclease [Halogranum salarium B-1]
Length = 265
Score = 52.4 bits (124), Expect = 3e-04, Method: Compositional matrix adjust.
Identities = 32/97 (32%), Positives = 50/97 (51%), Gaps = 3/97 (3%)
Query: 213 SAPQSSLLFSVRRSF---KQAKYITGTVDVLQSKHSGGAEWLLSLRKLDLQEAIDALCTL 269
A QS L +++ + ++++ I G + + + G E+ +R+ D Q L +
Sbjct: 92 DAEQSELAETIQSAGLYNQKSEMIVGAAEEIVDEFGGADEFDEFVREEDPQTVRTRLLEI 151
Query: 270 PGVGPKVAACIALFSLDQHHAIPVDTHVWKIATRYLL 306
GVGPK A C+ LFS + PVDTHV +IA R L
Sbjct: 152 HGVGPKTADCVLLFSGGRGGVFPVDTHVHRIARRMGL 188
>gi|440292608|gb|ELP85795.1| endonuclease III, putative [Entamoeba invadens IP1]
Length = 236
Score = 52.0 bits (123), Expect = 3e-04, Method: Compositional matrix adjust.
Identities = 23/41 (56%), Positives = 28/41 (68%)
Query: 263 IDALCTLPGVGPKVAACIALFSLDQHHAIPVDTHVWKIATR 303
D LC LPGVG K+A+ I + D H AIPVDTHV+ I+ R
Sbjct: 129 FDELCALPGVGTKIASLILAIAFDCHVAIPVDTHVFTISNR 169
>gi|269837103|ref|YP_003319331.1| DNA-(apurinic or apyrimidinic site) lyase [Sphaerobacter
thermophilus DSM 20745]
gi|269786366|gb|ACZ38509.1| DNA-(apurinic or apyrimidinic site) lyase [Sphaerobacter
thermophilus DSM 20745]
Length = 247
Score = 52.0 bits (123), Expect = 4e-04, Method: Compositional matrix adjust.
Identities = 30/63 (47%), Positives = 41/63 (65%), Gaps = 1/63 (1%)
Query: 244 HSGGAEWLLSLRKLDLQEAIDALCTLPGVGPKVAACIALFSLDQHHAIPVDTHVWKIATR 303
+SG L SL L L EA L +LPG+GPK AAC+ LF+ + A+PVDTHV++++ R
Sbjct: 114 NSGEDPPLASLFTLPLPEAKRRLTSLPGIGPKTAACVLLFACGR-PALPVDTHVYRVSRR 172
Query: 304 YLL 306
L
Sbjct: 173 VGL 175
>gi|335045935|ref|ZP_08538958.1| hypothetical protein HMPREF9124_0883 [Oribacterium sp. oral taxon
108 str. F0425]
gi|333759721|gb|EGL37278.1| hypothetical protein HMPREF9124_0883 [Oribacterium sp. oral taxon
108 str. F0425]
Length = 172
Score = 51.6 bits (122), Expect = 5e-04, Method: Compositional matrix adjust.
Identities = 48/164 (29%), Positives = 67/164 (40%), Gaps = 32/164 (19%)
Query: 142 VLRQDPVECLLQFLCSSNNNIARITKMVDFLASL--GSHLGNVEGFEFHEFPSLERLSLV 199
+LRQDP E LL F+ + +I I V+ L VE E + FP E L
Sbjct: 1 MLRQDPFETLLTFILTQRKSIPAIRSSVEKLCERFGEKQYSKVEEKEVYLFPRAEALQ-- 58
Query: 200 SEVELRNAGFGYRSAPQSSLLFSVRRSFKQAKYITGTVDVLQSKHSGGAEWLLSLRKLD- 258
+ GYR A ++ V+ + K L LR LD
Sbjct: 59 -NADFSECSLGYR-----------------APFVRDAVERVLEKR-------LDLRALDK 93
Query: 259 --LQEAIDALCTLPGVGPKVAACIALFSLDQHHAIPVDTHVWKI 300
+E + L + GVG KVAAC+ALF+ + IP D + +I
Sbjct: 94 VPTEELLFKLMEVHGVGIKVAACVALFAYSRMDIIPEDVWMKRI 137
>gi|409082157|gb|EKM82515.1| hypothetical protein AGABI1DRAFT_117974 [Agaricus bisporus var.
burnettii JB137-S8]
Length = 139
Score = 51.6 bits (122), Expect = 5e-04, Method: Compositional matrix adjust.
Identities = 27/55 (49%), Positives = 33/55 (60%), Gaps = 3/55 (5%)
Query: 271 GVGPKVAACIALFSLDQHHAIPVDTHVWKIATRYL---LPELAGVRLTPKLCSRV 322
GVG KVA CI L SLD+ +PVDTHV +IA +Y P + +TPKL V
Sbjct: 2 GVGRKVADCILLMSLDKKEVVPVDTHVHQIAVKYYGMKRPGSGKMTMTPKLYEEV 56
>gi|426199984|gb|EKV49908.1| hypothetical protein AGABI2DRAFT_190340 [Agaricus bisporus var.
bisporus H97]
Length = 139
Score = 51.6 bits (122), Expect = 6e-04, Method: Compositional matrix adjust.
Identities = 27/55 (49%), Positives = 33/55 (60%), Gaps = 3/55 (5%)
Query: 271 GVGPKVAACIALFSLDQHHAIPVDTHVWKIATRYL---LPELAGVRLTPKLCSRV 322
GVG KVA CI L SLD+ +PVDTHV +IA +Y P + +TPKL V
Sbjct: 2 GVGRKVADCILLMSLDKKEVVPVDTHVHQIAVKYYGMKRPGSGKMTMTPKLYEEV 56
>gi|448590303|ref|ZP_21650179.1| endonuclease III [Haloferax elongans ATCC BAA-1513]
gi|445734936|gb|ELZ86491.1| endonuclease III [Haloferax elongans ATCC BAA-1513]
Length = 267
Score = 50.8 bits (120), Expect = 8e-04, Method: Compositional matrix adjust.
Identities = 28/97 (28%), Positives = 53/97 (54%), Gaps = 3/97 (3%)
Query: 213 SAPQSSLLFSVRRSF---KQAKYITGTVDVLQSKHSGGAEWLLSLRKLDLQEAIDALCTL 269
+A + +L+ ++R + +++K I G + + + A++ +R+ D + + L +
Sbjct: 93 AADREALVEAIRSAGLYNQKSKLIIGVAEAVLADFGSEADFDTYIREADPETVRERLLEM 152
Query: 270 PGVGPKVAACIALFSLDQHHAIPVDTHVWKIATRYLL 306
GVGPK A C+ LF+ + PVDTHV +I+ R L
Sbjct: 153 KGVGPKTADCVLLFAGGRDGVFPVDTHVHRISRRIGL 189
>gi|405957860|gb|EKC24038.1| Endonuclease III-like protein 1 [Crassostrea gigas]
Length = 372
Score = 50.8 bits (120), Expect = 8e-04, Method: Compositional matrix adjust.
Identities = 29/87 (33%), Positives = 46/87 (52%), Gaps = 11/87 (12%)
Query: 217 SSLLFSVRRSFKQAKYITGTVDVLQSKHSGGAEWLLSLRKLDLQEAIDALCTLPGVGPKV 276
L++ V K+ +YI T +LQ +++G D+ + ++ LC LPGVGPK+
Sbjct: 72 GELIYPVGFWKKKIEYIKRTSKILQEEYNG-----------DIPDTVEKLCKLPGVGPKM 120
Query: 277 AACIALFSLDQHHAIPVDTHVWKIATR 303
A + +Q I VDTHV +I+ R
Sbjct: 121 AYLAMKCAWNQIVGIGVDTHVHRISNR 147
Score = 44.3 bits (103), Expect = 0.085, Method: Compositional matrix adjust.
Identities = 24/68 (35%), Positives = 37/68 (54%), Gaps = 11/68 (16%)
Query: 236 TVDVLQSKHSGGAEWLLSLRKLDLQEAIDALCTLPGVGPKVAACIALFSLDQHHAIPVDT 295
T +LQ +++G D+ + ++ LC LPGVGPK+A + +Q I VDT
Sbjct: 233 TSKILQEEYNG-----------DIPDTVEKLCKLPGVGPKMAYLAMKCAWNQIVGIGVDT 281
Query: 296 HVWKIATR 303
HV +I+ R
Sbjct: 282 HVHRISNR 289
>gi|147920603|ref|YP_685600.1| endonuclease III [Methanocella arvoryzae MRE50]
gi|110620996|emb|CAJ36274.1| predicted endonuclease III [Methanocella arvoryzae MRE50]
Length = 307
Score = 50.8 bits (120), Expect = 9e-04, Method: Compositional matrix adjust.
Identities = 28/75 (37%), Positives = 42/75 (56%), Gaps = 4/75 (5%)
Query: 229 QAKYITGTVDVLQSKHSGGAEWLLSLRKLDLQEAIDALCTLPGVGPKVAACIALFSLDQH 288
+A +I + + G EW+ R QEA +AL +L GVGPK A C+ LF+L H
Sbjct: 162 KADFIIRCSQEIIDRWGGSLEWM---RTAPTQEAREALMSLYGVGPKTADCVLLFAL-GH 217
Query: 289 HAIPVDTHVWKIATR 303
+ VDTH+ +++ R
Sbjct: 218 SVVAVDTHICRVSER 232
>gi|310800884|gb|EFQ35777.1| base excision DNA repair protein [Glomerella graminicola M1.001]
Length = 389
Score = 50.4 bits (119), Expect = 0.001, Method: Compositional matrix adjust.
Identities = 28/72 (38%), Positives = 43/72 (59%), Gaps = 3/72 (4%)
Query: 229 QAKYITGTVDVLQSKHSGGAEWLLSLRKLDLQEAIDALCTLPGVGPKVAACIALFSLDQH 288
++K I G ++ ++ K+ G+ L L +EA+ L + GVGPK A+C+ LF L Q
Sbjct: 242 KSKVIIGILNQVKEKY--GSYTLDHLFNATNEEAMQELISFQGVGPKTASCVLLFCL-QR 298
Query: 289 HAIPVDTHVWKI 300
+ VDTHVW+I
Sbjct: 299 ESFAVDTHVWRI 310
>gi|449469507|ref|XP_004152461.1| PREDICTED: DEMETER-like protein 2-like [Cucumis sativus]
Length = 234
Score = 50.4 bits (119), Expect = 0.001, Method: Compositional matrix adjust.
Identities = 66/247 (26%), Positives = 98/247 (39%), Gaps = 24/247 (9%)
Query: 56 TGQTFRWKKTGPLQYTGPIGPHLISLKHLQNGDVC-----YHIHTSPSEPAAKSALLDFL 110
TG+ R +K P + GP S++ + NG C Y H+SP+ S D L
Sbjct: 5 TGKKRRLQKPEPPADSHSDGPQFKSIRTV-NGFNCSTIDPYPAHSSPTSDECLSVRDDLL 63
Query: 111 NMGISLGELWEGFSASDCRFAELAKYLAGARVLRQDPVECLLQFLCSSNNNIARITKMVD 170
N+ E + + R +E + G R +D VE +F+ + + + +V
Sbjct: 64 NLHGFPREFLK-YRKERERLSECCSVVDGVRAEHRDNVES--EFV---DEKESVLDGLVK 117
Query: 171 FLASLGSHLGNVEGFEFHEFPSLERLSLVSEVELRNAGFGYRSAPQSSLLFSVRRSFKQA 230
+ S + N E F SL+ E L A + L S + A
Sbjct: 118 TVLSQNTTEANSE----RAFASLKSAFAAWEDVLSAESKCIEDAIRCGGLASTK-----A 168
Query: 231 KYITGTVDVLQSKHSGGAEWLLSLRKLDLQEAIDALCTLPGVGPKVAACIALFSLDQHHA 290
I + L + G L LR L + E L T G+GPK AC+ +F+L Q
Sbjct: 169 SCIKNILSSLSKRR--GKLCLEFLRDLSIDEIKAELSTFKGIGPKTVACVLMFNL-QKDD 225
Query: 291 IPVDTHV 297
PVDTHV
Sbjct: 226 FPVDTHV 232
>gi|392894668|ref|NP_001254908.1| Protein NTH-1, isoform b [Caenorhabditis elegans]
gi|293324783|emb|CBK55599.1| Protein NTH-1, isoform b [Caenorhabditis elegans]
Length = 224
Score = 50.1 bits (118), Expect = 0.001, Method: Compositional matrix adjust.
Identities = 26/76 (34%), Positives = 41/76 (53%), Gaps = 11/76 (14%)
Query: 228 KQAKYITGTVDVLQSKHSGGAEWLLSLRKLDLQEAIDALCTLPGVGPKVAACIALFSLDQ 287
++A Y+ T +L+ SG D+ +++D LC LPGVGPK+A + + +
Sbjct: 48 RKAVYLQKTAKILKDDFSG-----------DIPDSLDGLCALPGVGPKMANLVMQIAWGE 96
Query: 288 HHAIPVDTHVWKIATR 303
I VDTHV +I+ R
Sbjct: 97 CVGIAVDTHVHRISNR 112
>gi|436837874|ref|YP_007323090.1| HhH-GPD family protein [Fibrella aestuarina BUZ 2]
gi|384069287|emb|CCH02497.1| HhH-GPD family protein [Fibrella aestuarina BUZ 2]
Length = 236
Score = 50.1 bits (118), Expect = 0.001, Method: Compositional matrix adjust.
Identities = 33/95 (34%), Positives = 52/95 (54%), Gaps = 6/95 (6%)
Query: 212 RSAPQSSLLFSVRRSFK---QAKYITGTVDVLQSKHSGGAEWLLSLRKLDLQEAIDALCT 268
R AP L+ +++ + +A YI + VL ++ G + LR+L +EA+ L
Sbjct: 65 RDAPLPDLIDAIQTANYPEIKAPYIQNVLRVLIAER--GTANIDFLRELSTEEAMQWLTN 122
Query: 269 LPGVGPKVAACIALFSLDQHHAIPVDTHVWKIATR 303
LPG+GPK A + LF+ Q +PVDTHV ++ R
Sbjct: 123 LPGIGPKTATLLLLFAF-QKPVLPVDTHVHRVTQR 156
>gi|392407188|ref|YP_006443796.1| endoIII-related endonuclease [Anaerobaculum mobile DSM 13181]
gi|390620324|gb|AFM21471.1| putative endoIII-related endonuclease [Anaerobaculum mobile DSM
13181]
Length = 253
Score = 50.1 bits (118), Expect = 0.001, Method: Compositional matrix adjust.
Identities = 46/174 (26%), Positives = 72/174 (41%), Gaps = 29/174 (16%)
Query: 130 FAELAKYLAGARVLRQDPVECLLQFLCSSNNNIARITKMVDFLASLGSHLGNVEGFEFHE 189
A L YL AR+ R++P++ L+ + S N N + L + V E
Sbjct: 39 LAGLWGYLRPARMTREEPLDGLILTVLSQNTNDKNRDRAYALLKARFPRWDEVLAATHEE 98
Query: 190 FPSLERLSLVSEVELRNAGFGYRSAPQSSLLFSVRRSFKQAKYITGTVDVLQSKHSGGAE 249
S+ +R AG S +A I + ++ + +
Sbjct: 99 IASV----------IRPAGL----------------SNVKAMRIKEILKIVSDRFGDCS- 131
Query: 250 WLLSLRKLDLQEAIDALCTLPGVGPKVAACIALFSLDQHHAIPVDTHVWKIATR 303
L LR++ + ++ L +LPGVGPK AC+ LF L A PVDTHV ++ R
Sbjct: 132 -LRDLRRMKKGDIVEFLSSLPGVGPKTVACVLLFDLGL-PAFPVDTHVGRLCKR 183
>gi|289523004|ref|ZP_06439858.1| endonuclease III [Anaerobaculum hydrogeniformans ATCC BAA-1850]
gi|289503547|gb|EFD24711.1| endonuclease III [Anaerobaculum hydrogeniformans ATCC BAA-1850]
Length = 255
Score = 50.1 bits (118), Expect = 0.001, Method: Compositional matrix adjust.
Identities = 48/174 (27%), Positives = 76/174 (43%), Gaps = 29/174 (16%)
Query: 130 FAELAKYLAGARVLRQDPVECLLQFLCSSNNNIARITKMVDFLASLGSHLGNVEGFEFHE 189
A L +L A+ +P++ L+ + S N N + +FL S
Sbjct: 39 LAGLWSHLRPAKTTSDEPLDGLILTILSQNTNDKNRDRAYEFLRS--------------R 84
Query: 190 FPSLERLSLVSEVELRNAGFGYRSAPQSSLLFSVRRSFKQAKYITGTVDVLQSKHSGGAE 249
P E + + +E EL + A S++ +A I + ++ + G+
Sbjct: 85 SPRWEDVLVTAETELAEV---IKPAGLSNI---------KASRIKSVLGLITERF--GSC 130
Query: 250 WLLSLRKLDLQEAIDALCTLPGVGPKVAACIALFSLDQHHAIPVDTHVWKIATR 303
L L+ + +E ID L +LPGVGPK AC+ LF L A PVDTHV ++ R
Sbjct: 131 SLKPLKGMKKEEIIDFLSSLPGVGPKTVACVLLFDLGI-PAFPVDTHVNRLCKR 183
>gi|453088940|gb|EMF16980.1| DNA glycosylase [Mycosphaerella populorum SO2202]
Length = 489
Score = 50.1 bits (118), Expect = 0.001, Method: Compositional matrix adjust.
Identities = 28/58 (48%), Positives = 39/58 (67%), Gaps = 5/58 (8%)
Query: 251 LLSLRKLDL---QEAIDALCTLPGVGPKVAACIALFSLDQHHAIPVDTHVWKIATRYL 305
++SL L L +AID L T PG+GPK A+C+ALF L Q + VDTHV+++ +YL
Sbjct: 307 VISLDHLHLMSTNDAIDKLITYPGIGPKTASCVALFCL-QRPSFAVDTHVFRL-VQYL 362
>gi|357419910|ref|YP_004932902.1| HhH-GPD family protein [Thermovirga lienii DSM 17291]
gi|355397376|gb|AER66805.1| HhH-GPD family protein [Thermovirga lienii DSM 17291]
Length = 244
Score = 50.1 bits (118), Expect = 0.001, Method: Compositional matrix adjust.
Identities = 45/159 (28%), Positives = 73/159 (45%), Gaps = 29/159 (18%)
Query: 145 QDPVECLLQFLCSSNNNIARITKMVDFLASLGSHLGNVEGFEFHEFPSLERLSLVSEVEL 204
+DP++ L++ + S N N + L FPS ++ V + ++
Sbjct: 49 EDPIDGLIKTVLSQNTNDKNRDMAWESLKD--------------RFPSWQQ---VIDAKV 91
Query: 205 RNAGFGYRSAPQSSLLFSVRRSFKQAKYITGTVDVLQSKHSGGAEWLLSLRKLDLQEAID 264
RSA +++ ++ K I TV+ + S L L++L+ +E I
Sbjct: 92 EEVADAIRSAGIANV------KAQRIKKILATVNGVWGSCS-----LKGLKRLEKEEIIS 140
Query: 265 ALCTLPGVGPKVAACIALFSLDQHHAIPVDTHVWKIATR 303
L +LPGVGPK AC+ LF L A PVDTHV +++ R
Sbjct: 141 FLSSLPGVGPKTIACVLLFDLGI-PAFPVDTHVARLSKR 178
>gi|448613277|ref|ZP_21663157.1| DNA-(apurinic or apyrimidinic site) lyase / endonuclease III
[Haloferax mucosum ATCC BAA-1512]
gi|445740174|gb|ELZ91680.1| DNA-(apurinic or apyrimidinic site) lyase / endonuclease III
[Haloferax mucosum ATCC BAA-1512]
Length = 268
Score = 50.1 bits (118), Expect = 0.002, Method: Compositional matrix adjust.
Identities = 28/94 (29%), Positives = 49/94 (52%), Gaps = 3/94 (3%)
Query: 213 SAPQSSLLFSVRRSF---KQAKYITGTVDVLQSKHSGGAEWLLSLRKLDLQEAIDALCTL 269
+A + ++ ++R +++K I G + + + A++ +R+ D D L +
Sbjct: 94 AADREGIIDAIRSGGLYNRKSKLIQGVAEDVLTNFGSEADFDRYVREEDPSTVRDRLLEM 153
Query: 270 PGVGPKVAACIALFSLDQHHAIPVDTHVWKIATR 303
GVGPK A C+ LF+ + PVDTHV +IA R
Sbjct: 154 NGVGPKTADCVLLFAGGRGGVFPVDTHVHRIARR 187
>gi|392894664|ref|NP_001254906.1| Protein NTH-1, isoform d [Caenorhabditis elegans]
gi|341941198|sp|P54137.2|NTH1_CAEEL RecName: Full=Probable endonuclease III homolog; AltName:
Full=CeNTH; AltName: Full=DNA-(Apurinic or apyrimidinic
site) lyase
gi|256773103|dbj|BAI22676.1| homolog of human endonuclease III [Caenorhabditis elegans]
gi|257145792|emb|CAA90766.2| Protein NTH-1, isoform d [Caenorhabditis elegans]
Length = 298
Score = 50.1 bits (118), Expect = 0.002, Method: Compositional matrix adjust.
Identities = 26/76 (34%), Positives = 41/76 (53%), Gaps = 11/76 (14%)
Query: 228 KQAKYITGTVDVLQSKHSGGAEWLLSLRKLDLQEAIDALCTLPGVGPKVAACIALFSLDQ 287
++A Y+ T +L+ SG D+ +++D LC LPGVGPK+A + + +
Sbjct: 122 RKAVYLQKTAKILKDDFSG-----------DIPDSLDGLCALPGVGPKMANLVMQIAWGE 170
Query: 288 HHAIPVDTHVWKIATR 303
I VDTHV +I+ R
Sbjct: 171 CVGIAVDTHVHRISNR 186
>gi|392894666|ref|NP_001254907.1| Protein NTH-1, isoform a [Caenorhabditis elegans]
gi|293324782|emb|CBK55598.1| Protein NTH-1, isoform a [Caenorhabditis elegans]
Length = 293
Score = 49.7 bits (117), Expect = 0.002, Method: Compositional matrix adjust.
Identities = 26/76 (34%), Positives = 41/76 (53%), Gaps = 11/76 (14%)
Query: 228 KQAKYITGTVDVLQSKHSGGAEWLLSLRKLDLQEAIDALCTLPGVGPKVAACIALFSLDQ 287
++A Y+ T +L+ SG D+ +++D LC LPGVGPK+A + + +
Sbjct: 117 RKAVYLQKTAKILKDDFSG-----------DIPDSLDGLCALPGVGPKMANLVMQIAWGE 165
Query: 288 HHAIPVDTHVWKIATR 303
I VDTHV +I+ R
Sbjct: 166 CVGIAVDTHVHRISNR 181
>gi|10038661|dbj|BAB13294.1| OGG1 type 1e [Homo sapiens]
Length = 181
Score = 49.7 bits (117), Expect = 0.002, Method: Compositional matrix adjust.
Identities = 32/90 (35%), Positives = 43/90 (47%), Gaps = 19/90 (21%)
Query: 187 FHEFPSLERLSLVS-EVELRNAGFGYRSAPQSSLLFSVRRSFKQAKYITGTVDVLQSKHS 245
+H FPSL+ L+ E LR G GYR A+Y++ + + +
Sbjct: 5 YHGFPSLQALAGPEVEAHLRKLGLGYR-----------------ARYVSASARAILEEQ- 46
Query: 246 GGAEWLLSLRKLDLQEAIDALCTLPGVGPK 275
GG WL LR+ +EA ALC LPGVG K
Sbjct: 47 GGLAWLQQLRESSYEEAHKALCILPGVGTK 76
>gi|367471322|ref|ZP_09470967.1| Endonuclease III [Patulibacter sp. I11]
gi|365813601|gb|EHN08854.1| Endonuclease III [Patulibacter sp. I11]
Length = 248
Score = 49.7 bits (117), Expect = 0.002, Method: Compositional matrix adjust.
Identities = 24/43 (55%), Positives = 29/43 (67%), Gaps = 1/43 (2%)
Query: 264 DALCTLPGVGPKVAACIALFSLDQHHAIPVDTHVWKIATRYLL 306
D LC LPGVG K AAC+ LF+ H +PVDTHV ++ TR L
Sbjct: 148 DFLCELPGVGRKTAACVLLFAYGLHD-VPVDTHVQRVGTRLAL 189
>gi|380494482|emb|CCF33119.1| base excision DNA repair protein [Colletotrichum higginsianum]
Length = 385
Score = 49.7 bits (117), Expect = 0.002, Method: Compositional matrix adjust.
Identities = 27/72 (37%), Positives = 43/72 (59%), Gaps = 3/72 (4%)
Query: 229 QAKYITGTVDVLQSKHSGGAEWLLSLRKLDLQEAIDALCTLPGVGPKVAACIALFSLDQH 288
++K I G ++ ++ K+ G+ L L ++A+ L + GVGPK A+C+ LF L Q
Sbjct: 238 KSKVIIGILNQVREKY--GSYSLDHLFNASNEDAMQELISFQGVGPKTASCVLLFCL-QR 294
Query: 289 HAIPVDTHVWKI 300
+ VDTHVW+I
Sbjct: 295 ESFAVDTHVWRI 306
>gi|402589187|gb|EJW83119.1| hypothetical protein WUBG_05972, partial [Wuchereria bancrofti]
Length = 130
Score = 49.3 bits (116), Expect = 0.003, Method: Compositional matrix adjust.
Identities = 25/76 (32%), Positives = 40/76 (52%), Gaps = 11/76 (14%)
Query: 228 KQAKYITGTVDVLQSKHSGGAEWLLSLRKLDLQEAIDALCTLPGVGPKVAACIALFSLDQ 287
K+A YI ++L+ ++ G D+ ++ LC+LPGVG K+A + DQ
Sbjct: 1 KKAVYIKKVAEILKERYDG-----------DIPNTVEGLCSLPGVGEKMAYLAMCTAWDQ 49
Query: 288 HHAIPVDTHVWKIATR 303
+ VDTHV +I+ R
Sbjct: 50 LEGLGVDTHVHRISNR 65
>gi|424814000|ref|ZP_18239178.1| putative EndoIII-related endonuclease [Candidatus Nanosalina sp.
J07AB43]
gi|339757616|gb|EGQ42873.1| putative EndoIII-related endonuclease [Candidatus Nanosalina sp.
J07AB43]
Length = 119
Score = 48.9 bits (115), Expect = 0.003, Method: Compositional matrix adjust.
Identities = 28/65 (43%), Positives = 37/65 (56%), Gaps = 9/65 (13%)
Query: 257 LDLQEAIDALCTLPGVGPKVAACIALFSLDQHHAIPVDTHVWKIATRYLLPELAGVRLTP 316
+D++E + L +PG+GPK AA I F Q IPVDTHV +IA R+ L P
Sbjct: 1 MDVEEGKNWLTDIPGIGPKTAAVILCFHF-QKPVIPVDTHVHRIAQRF--------DLVP 51
Query: 317 KLCSR 321
+ CSR
Sbjct: 52 ENCSR 56
>gi|113205550|ref|NP_001037884.1| nth endonuclease III-like 1 [Xenopus (Silurana) tropicalis]
gi|89267879|emb|CAJ83279.1| nth endonuclease III-like 1 (E. coli) [Xenopus (Silurana)
tropicalis]
gi|166796488|gb|AAI59396.1| nth endonuclease III-like 1 (E. coli) [Xenopus (Silurana)
tropicalis]
Length = 300
Score = 48.9 bits (115), Expect = 0.003, Method: Compositional matrix adjust.
Identities = 28/85 (32%), Positives = 43/85 (50%), Gaps = 11/85 (12%)
Query: 219 LLFSVRRSFKQAKYITGTVDVLQSKHSGGAEWLLSLRKLDLQEAIDALCTLPGVGPKVAA 278
L++ V + KYI T ++LQ K+ G D+ + + L LPGVGPK+A
Sbjct: 160 LIYPVGFWKNKVKYIKQTTEILQEKYGG-----------DIPDNVTDLVKLPGVGPKMAH 208
Query: 279 CIALFSLDQHHAIPVDTHVWKIATR 303
+ + + I VDTHV +I+ R
Sbjct: 209 LVMDIAWNNVSGIGVDTHVHRISNR 233
>gi|312073211|ref|XP_003139418.1| hypothetical protein LOAG_03833 [Loa loa]
Length = 130
Score = 48.9 bits (115), Expect = 0.003, Method: Compositional matrix adjust.
Identities = 26/76 (34%), Positives = 39/76 (51%), Gaps = 11/76 (14%)
Query: 228 KQAKYITGTVDVLQSKHSGGAEWLLSLRKLDLQEAIDALCTLPGVGPKVAACIALFSLDQ 287
K+A YI +L+ ++ G D+ I+ LC+LPGVG K+A + DQ
Sbjct: 1 KKALYIKKVAKILKERYDG-----------DIPNTIEELCSLPGVGEKMAYLAMCNAWDQ 49
Query: 288 HHAIPVDTHVWKIATR 303
+ VDTHV +I+ R
Sbjct: 50 MKGLGVDTHVHRISNR 65
>gi|448574894|ref|ZP_21641417.1| endonuclease III [Haloferax larsenii JCM 13917]
gi|445732573|gb|ELZ84155.1| endonuclease III [Haloferax larsenii JCM 13917]
Length = 267
Score = 48.9 bits (115), Expect = 0.003, Method: Compositional matrix adjust.
Identities = 26/79 (32%), Positives = 43/79 (54%)
Query: 228 KQAKYITGTVDVLQSKHSGGAEWLLSLRKLDLQEAIDALCTLPGVGPKVAACIALFSLDQ 287
+++K I G + + S A++ +R+ D + + L + GVGPK A C+ LF+ +
Sbjct: 111 QKSKLIIGVAEAVLSDFGSEADFDQWVREGDPETVRERLLEMKGVGPKTADCVLLFAGGR 170
Query: 288 HHAIPVDTHVWKIATRYLL 306
PVDTHV +I+ R L
Sbjct: 171 DGVFPVDTHVHRISRRIGL 189
>gi|330508003|ref|YP_004384431.1| DNA-(apurinic or apyrimidinic site) lyase [Methanosaeta concilii
GP6]
gi|328928811|gb|AEB68613.1| DNA-(apurinic or apyrimidinic site) lyase, putative [Methanosaeta
concilii GP6]
Length = 224
Score = 48.9 bits (115), Expect = 0.003, Method: Compositional matrix adjust.
Identities = 29/71 (40%), Positives = 42/71 (59%), Gaps = 2/71 (2%)
Query: 239 VLQSKHSGGAEWLLSLRKLDLQEAIDALCTLPGVGPKVAACIALFSLDQHHAIPVDTHVW 298
+L K GA + L +D EA+ L LPGVGPK A+ + LF+ +PVDTHV+
Sbjct: 95 LLSIKRDRGAMDIGFLEGMDKDEAMSYLLDLPGVGPKTASIVLLFAFGMPF-MPVDTHVF 153
Query: 299 KIATRY-LLPE 308
+++ R L+PE
Sbjct: 154 RVSQRLGLVPE 164
>gi|354611948|ref|ZP_09029900.1| HhH-GPD family protein [Halobacterium sp. DL1]
gi|353191526|gb|EHB57032.1| HhH-GPD family protein [Halobacterium sp. DL1]
Length = 269
Score = 48.9 bits (115), Expect = 0.004, Method: Compositional matrix adjust.
Identities = 27/64 (42%), Positives = 35/64 (54%)
Query: 243 KHSGGAEWLLSLRKLDLQEAIDALCTLPGVGPKVAACIALFSLDQHHAIPVDTHVWKIAT 302
++ G A + +R D DAL + GVGPK A C+ LFS + PVDTHV +IA
Sbjct: 128 EYGGEAGFDEFVRTEDPDAVRDALLDMKGVGPKTADCVLLFSGGRAGVFPVDTHVHRIAR 187
Query: 303 RYLL 306
R L
Sbjct: 188 RMGL 191
>gi|324511601|gb|ADY44825.1| Endonuclease III-like protein 1 [Ascaris suum]
Length = 266
Score = 48.5 bits (114), Expect = 0.004, Method: Compositional matrix adjust.
Identities = 33/112 (29%), Positives = 55/112 (49%), Gaps = 23/112 (20%)
Query: 192 SLERLSLVSEVELRNAGFGYRSAPQSSLLFSVRRSFKQAKYITGTVDVLQSKHSGGAEWL 251
++E + +SE+EL++ LL V ++A Y+ D+L +K+ G
Sbjct: 113 TVEGIIEMSELELQD------------LLIPVGFYKRKAIYLKKVADILSNKYGG----- 155
Query: 252 LSLRKLDLQEAIDALCTLPGVGPKVAACIALFSLDQHHAIPVDTHVWKIATR 303
D+ ++ LC+LPGVGPK+A + + + VDTHV +IA R
Sbjct: 156 ------DIPNTVEDLCSLPGVGPKMAHLAMQHAWGRIEGLGVDTHVHRIANR 201
>gi|237841919|ref|XP_002370257.1| N-glycosylase/DNA lyase, putative [Toxoplasma gondii ME49]
gi|211967921|gb|EEB03117.1| N-glycosylase/DNA lyase, putative [Toxoplasma gondii ME49]
gi|221482729|gb|EEE21067.1| 8-oxoguanine DNA glycosylase, putative [Toxoplasma gondii GT1]
Length = 752
Score = 48.5 bits (114), Expect = 0.004, Method: Compositional matrix adjust.
Identities = 39/127 (30%), Positives = 60/127 (47%), Gaps = 19/127 (14%)
Query: 187 FHEFPSLERLSLVSEVELRNAGFGYRSAPQSSLLFSVRR------------SFKQAKYIT 234
+H FPS L+ E +L+ G GYR+ LL S + S ++A+
Sbjct: 495 WHSFPSSSALARAREEDLKKLGLGYRA----RLLHSAAKALDARGGDAFLLSLREARRRQ 550
Query: 235 GTVDVLQSKHSGGAEWLLSLRKLDLQEAIDALCTLPGVGPKVAACIALFSLDQHHAIPVD 294
G D + +E + + +++EA L G+G KVA C+AL+SL A PVD
Sbjct: 551 GHPDSKEGTVEKESEESVFQVEKEIREA---LLPFAGIGRKVADCVALYSLGCWAAWPVD 607
Query: 295 THVWKIA 301
TH+ + A
Sbjct: 608 THLLQHA 614
Score = 43.1 bits (100), Expect = 0.17, Method: Compositional matrix adjust.
Identities = 30/74 (40%), Positives = 40/74 (54%), Gaps = 7/74 (9%)
Query: 138 AGARVLRQDPVECLLQFLCSSNNNIARITKMVDFLA-SLGSHLGNVEGFEFHEFPSLERL 196
+ RVL+ VEC FLCSSNNNI RI +MV L + G L V G E + E+
Sbjct: 347 SAGRVLQLPVVECFFSFLCSSNNNIPRIMQMVRALRNAYGDFL--VRGAE----AAAEKR 400
Query: 197 SLVSEVELRNAGFG 210
+ +E + + AG G
Sbjct: 401 QMQAEPDRKPAGRG 414
>gi|448419884|ref|ZP_21580728.1| endoiii-related endonuclease [Halosarcina pallida JCM 14848]
gi|445674798|gb|ELZ27335.1| endoiii-related endonuclease [Halosarcina pallida JCM 14848]
Length = 283
Score = 48.5 bits (114), Expect = 0.004, Method: Compositional matrix adjust.
Identities = 45/167 (26%), Positives = 73/167 (43%), Gaps = 14/167 (8%)
Query: 145 QDPVECLLQFLCSSNNNIARITKMVDFLASLGSHLGNVEGFEFHEFPSLERLSLVSEV-- 202
+D ECL++ + S N + +AS +H +E + E S SE
Sbjct: 47 RDAFECLVRTILSQNTSD---------VASQPAHDSLMERYSSGEARSASETRAGSEATR 97
Query: 203 ELRNAGFGYRSAPQSSLLFSVRRSF---KQAKYITGTVDVLQSKHSGGAEWLLSLRKLDL 259
E + +A QS L ++ + ++++ I G + + + G + + D
Sbjct: 98 ERSDLAESLAAAEQSELAETISSAGLYNQKSEMIIGAAERICEEFGGADGFDAFVTGGDP 157
Query: 260 QEAIDALCTLPGVGPKVAACIALFSLDQHHAIPVDTHVWKIATRYLL 306
E + L + GVGPK A C+ LFS + PVDTHV +IA R L
Sbjct: 158 GEVRERLLDIHGVGPKTADCVLLFSGGRGGVFPVDTHVHRIARRMGL 204
>gi|393906282|gb|EJD74237.1| hypothetical protein LOAG_18420 [Loa loa]
Length = 204
Score = 48.5 bits (114), Expect = 0.004, Method: Compositional matrix adjust.
Identities = 26/76 (34%), Positives = 39/76 (51%), Gaps = 11/76 (14%)
Query: 228 KQAKYITGTVDVLQSKHSGGAEWLLSLRKLDLQEAIDALCTLPGVGPKVAACIALFSLDQ 287
K+A YI +L+ ++ G D+ I+ LC+LPGVG K+A + DQ
Sbjct: 75 KKALYIKKVAKILKERYDG-----------DIPNTIEELCSLPGVGEKMAYLAMCNAWDQ 123
Query: 288 HHAIPVDTHVWKIATR 303
+ VDTHV +I+ R
Sbjct: 124 MKGLGVDTHVHRISNR 139
>gi|257389114|ref|YP_003178887.1| DNA-(apurinic or apyrimidinic site) lyase [Halomicrobium mukohataei
DSM 12286]
gi|257171421|gb|ACV49180.1| DNA-(apurinic or apyrimidinic site) lyase [Halomicrobium mukohataei
DSM 12286]
Length = 270
Score = 48.5 bits (114), Expect = 0.004, Method: Compositional matrix adjust.
Identities = 26/60 (43%), Positives = 34/60 (56%)
Query: 247 GAEWLLSLRKLDLQEAIDALCTLPGVGPKVAACIALFSLDQHHAIPVDTHVWKIATRYLL 306
GAE+ +R D + AL + GVGPK A C+ LF+ + PVDTHV +IA R L
Sbjct: 133 GAEFDAFVRGGDPGDVRSALLDMKGVGPKTADCVLLFAGGRGGVFPVDTHVHRIARRMGL 192
>gi|221503078|gb|EEE28784.1| 8-oxoguanine DNA glycosylase, putative [Toxoplasma gondii VEG]
Length = 351
Score = 48.5 bits (114), Expect = 0.005, Method: Compositional matrix adjust.
Identities = 40/128 (31%), Positives = 58/128 (45%), Gaps = 21/128 (16%)
Query: 187 FHEFPSLERLSLVSEVELRNAGFGYRSAPQSSLLFSVRR------------SFKQAKYIT 234
+H FPS L+ E +L+ G GYR+ LL S + S ++A+
Sbjct: 94 WHSFPSSSALARAREEDLKKLGLGYRA----RLLHSAAKALDARGGDAFLLSLREARRRQ 149
Query: 235 GTVDVLQSK-HSGGAEWLLSLRKLDLQEAIDALCTLPGVGPKVAACIALFSLDQHHAIPV 293
G D + E + + K E +AL G+G KVA C+AL+SL A PV
Sbjct: 150 GHPDSKEGTVEKESEESVFQVEK----EIREALLPFAGIGRKVADCVALYSLGCWAAWPV 205
Query: 294 DTHVWKIA 301
DTH+ + A
Sbjct: 206 DTHLLQHA 213
>gi|448582052|ref|ZP_21645556.1| endonuclease III [Haloferax gibbonsii ATCC 33959]
gi|445731700|gb|ELZ83283.1| endonuclease III [Haloferax gibbonsii ATCC 33959]
Length = 268
Score = 48.1 bits (113), Expect = 0.005, Method: Compositional matrix adjust.
Identities = 29/97 (29%), Positives = 49/97 (50%), Gaps = 3/97 (3%)
Query: 213 SAPQSSLLFSVRRSF---KQAKYITGTVDVLQSKHSGGAEWLLSLRKLDLQEAIDALCTL 269
+A + ++ ++R +++K I G + + + A + +R+ D D L +
Sbjct: 94 AADREGIVEAIRSGGLYNQKSKLIQGVAEDVLADFGSEAGFDRYVREEDPSTVRDRLLEM 153
Query: 270 PGVGPKVAACIALFSLDQHHAIPVDTHVWKIATRYLL 306
GVGPK A C+ LF+ + PVDTHV +IA R L
Sbjct: 154 KGVGPKTADCVLLFAGGRGGVFPVDTHVHRIARRIGL 190
>gi|452988619|gb|EME88374.1| hypothetical protein MYCFIDRAFT_25275 [Pseudocercospora fijiensis
CIRAD86]
Length = 470
Score = 48.1 bits (113), Expect = 0.005, Method: Compositional matrix adjust.
Identities = 21/44 (47%), Positives = 31/44 (70%), Gaps = 1/44 (2%)
Query: 257 LDLQEAIDALCTLPGVGPKVAACIALFSLDQHHAIPVDTHVWKI 300
L EAI+ + + PG+GPK A+C+ALF L Q + VDTHV+++
Sbjct: 314 LSTNEAIEKMLSFPGIGPKTASCVALFCL-QRPSFAVDTHVFRL 356
>gi|409074630|gb|EKM75023.1| hypothetical protein AGABI1DRAFT_24137, partial [Agaricus bisporus
var. burnettii JB137-S8]
Length = 235
Score = 48.1 bits (113), Expect = 0.005, Method: Compositional matrix adjust.
Identities = 32/108 (29%), Positives = 55/108 (50%), Gaps = 12/108 (11%)
Query: 205 RNAGFGYRSAPQSSLLFSVRR---SFKQAKYITGTVDVLQSKHSGGA--------EWLLS 253
RN +AP+ ++ ++R + K+A+ I G + ++ KH + E S
Sbjct: 80 RNNFEAIAAAPKQKVIDAIRSGGLANKKARMIQGLLQAVKEKHGSYSLQHLARKPESTSS 139
Query: 254 LRKLDLQEAIDALCTLPGVGPKVAACIALFSLDQHHAIPVDTHVWKIA 301
L E +D L + GVGPK A+C+ LF + + + PVDTHV +++
Sbjct: 140 APALGDDEIMDELLSYDGVGPKTASCVLLFCIGR-DSFPVDTHVLRLS 186
>gi|389846321|ref|YP_006348560.1| DNA-(apurinic or apyrimidinic site) lyase / endonuclease III
[Haloferax mediterranei ATCC 33500]
gi|448616059|ref|ZP_21664769.1| DNA-(apurinic or apyrimidinic site) lyase / endonuclease III
[Haloferax mediterranei ATCC 33500]
gi|388243627|gb|AFK18573.1| DNA-(apurinic or apyrimidinic site) lyase / endonuclease III
[Haloferax mediterranei ATCC 33500]
gi|445750714|gb|EMA02151.1| DNA-(apurinic or apyrimidinic site) lyase / endonuclease III
[Haloferax mediterranei ATCC 33500]
Length = 268
Score = 48.1 bits (113), Expect = 0.005, Method: Compositional matrix adjust.
Identities = 27/94 (28%), Positives = 48/94 (51%), Gaps = 3/94 (3%)
Query: 213 SAPQSSLLFSVRRSF---KQAKYITGTVDVLQSKHSGGAEWLLSLRKLDLQEAIDALCTL 269
+A + ++ ++R +++K I G + + + ++ +R+ D D L +
Sbjct: 94 AADREGIIDAIRSGGLYNQKSKLIQGVAEEVLADFGSEDDFDRYVREADPSTVRDRLLEM 153
Query: 270 PGVGPKVAACIALFSLDQHHAIPVDTHVWKIATR 303
GVGPK A C+ LF+ + PVDTHV +IA R
Sbjct: 154 KGVGPKTADCVLLFAGGRGGVFPVDTHVHRIARR 187
>gi|357610921|gb|EHJ67214.1| hypothetical protein KGM_13809 [Danaus plexippus]
Length = 429
Score = 48.1 bits (113), Expect = 0.005, Method: Compositional matrix adjust.
Identities = 25/75 (33%), Positives = 39/75 (52%), Gaps = 11/75 (14%)
Query: 229 QAKYITGTVDVLQSKHSGGAEWLLSLRKLDLQEAIDALCTLPGVGPKVAACIALFSLDQH 288
+ KYI T L+ ++ G D+ +++D LC L GVGPK+A + ++
Sbjct: 170 KVKYIKKTTQTLKDQYDG-----------DIPDSVDKLCKLTGVGPKMAHICMKVAWNKV 218
Query: 289 HAIPVDTHVWKIATR 303
I VDTHV +I+ R
Sbjct: 219 TGIGVDTHVHRISNR 233
>gi|95007107|emb|CAJ20327.1| hypothetical protein, expressed [Toxoplasma gondii RH]
Length = 378
Score = 48.1 bits (113), Expect = 0.005, Method: Compositional matrix adjust.
Identities = 40/128 (31%), Positives = 58/128 (45%), Gaps = 21/128 (16%)
Query: 187 FHEFPSLERLSLVSEVELRNAGFGYRSAPQSSLLFSVRR------------SFKQAKYIT 234
+H FPS L+ E +L+ G GYR+ LL S + S ++A+
Sbjct: 121 WHSFPSSSALARAREEDLKKLGLGYRA----RLLHSAAKALDARGGDAFLLSLREARRRQ 176
Query: 235 GTVDVLQSK-HSGGAEWLLSLRKLDLQEAIDALCTLPGVGPKVAACIALFSLDQHHAIPV 293
G D + E + + K E +AL G+G KVA C+AL+SL A PV
Sbjct: 177 GHPDSKEGTVEKESEESVFQVEK----EIREALLPFAGIGRKVADCVALYSLGCWAAWPV 232
Query: 294 DTHVWKIA 301
DTH+ + A
Sbjct: 233 DTHLLQHA 240
>gi|426195516|gb|EKV45446.1| hypothetical protein AGABI2DRAFT_44066, partial [Agaricus bisporus
var. bisporus H97]
Length = 234
Score = 48.1 bits (113), Expect = 0.005, Method: Compositional matrix adjust.
Identities = 32/108 (29%), Positives = 55/108 (50%), Gaps = 12/108 (11%)
Query: 205 RNAGFGYRSAPQSSLLFSVRR---SFKQAKYITGTVDVLQSKHSGGA--------EWLLS 253
RN +AP+ ++ ++R + K+A+ I G + ++ KH + E S
Sbjct: 79 RNNFEAIAAAPKQKVIDAIRSGGLANKKARMIQGLLQAVKEKHGSYSLQHLARKPESTSS 138
Query: 254 LRKLDLQEAIDALCTLPGVGPKVAACIALFSLDQHHAIPVDTHVWKIA 301
L E +D L + GVGPK A+C+ LF + + + PVDTHV +++
Sbjct: 139 APALRDDEIMDELLSYDGVGPKTASCVLLFCIGR-DSFPVDTHVLRLS 185
>gi|170584526|ref|XP_001897050.1| Endonuclease III-like protein 1 [Brugia malayi]
gi|158595585|gb|EDP34128.1| Endonuclease III-like protein 1, putative [Brugia malayi]
Length = 261
Score = 48.1 bits (113), Expect = 0.005, Method: Compositional matrix adjust.
Identities = 27/85 (31%), Positives = 43/85 (50%), Gaps = 11/85 (12%)
Query: 219 LLFSVRRSFKQAKYITGTVDVLQSKHSGGAEWLLSLRKLDLQEAIDALCTLPGVGPKVAA 278
LL V K+A YI ++L+ ++ G D+ ++ LC+LPGVG K+A
Sbjct: 123 LLIPVGFYKKKAVYIKKVAEILKERYDG-----------DIPNTVEGLCSLPGVGEKMAY 171
Query: 279 CIALFSLDQHHAIPVDTHVWKIATR 303
+ +Q + VDTHV +I+ R
Sbjct: 172 LTMCTAWNQLEGLGVDTHVHRISNR 196
>gi|452847106|gb|EME49038.1| hypothetical protein DOTSEDRAFT_49374 [Dothistroma septosporum
NZE10]
Length = 497
Score = 48.1 bits (113), Expect = 0.006, Method: Compositional matrix adjust.
Identities = 27/68 (39%), Positives = 45/68 (66%), Gaps = 5/68 (7%)
Query: 241 QSKHSGGAEWLLSLRKLDL---QEAIDALCTLPGVGPKVAACIALFSLDQHHAIPVDTHV 297
Q++ + + ++SL L L +AI+ + +LPG+GPK A+C+ALF L Q + VDTHV
Sbjct: 306 QNEVTKAKQNIISLDHLHLLSTDDAINKMLSLPGIGPKTASCVALFCL-QRPSFAVDTHV 364
Query: 298 WKIATRYL 305
+++ +YL
Sbjct: 365 FRLC-QYL 371
>gi|356565037|ref|XP_003550751.1| PREDICTED: DEMETER-like protein 2-like [Glycine max]
Length = 172
Score = 48.1 bits (113), Expect = 0.006, Method: Compositional matrix adjust.
Identities = 31/87 (35%), Positives = 44/87 (50%), Gaps = 6/87 (6%)
Query: 218 SLLFSVRRSF---KQAKYITGTVDVLQSKHSGGAEWLLSLRKLDLQEAIDALCTLPGVGP 274
S LFS+R +A I + L+ + G L LR L + E L G+GP
Sbjct: 15 SFLFSIRCGGLAPTKASCIKNVLRCLRERR--GELCLEYLRDLSVDEVKAELSLFKGIGP 72
Query: 275 KVAACIALFSLDQHHAIPVDTHVWKIA 301
K AC+ +F+L Q PVDTH+++IA
Sbjct: 73 KTVACVLMFNL-QQDDFPVDTHIFEIA 98
>gi|385810254|ref|YP_005846650.1| endonuclease III [Ignavibacterium album JCM 16511]
gi|383802302|gb|AFH49382.1| Putative endonuclease III [Ignavibacterium album JCM 16511]
Length = 236
Score = 48.1 bits (113), Expect = 0.006, Method: Compositional matrix adjust.
Identities = 29/84 (34%), Positives = 44/84 (52%), Gaps = 3/84 (3%)
Query: 228 KQAKYITGTVDVLQSKHSGGAEWLLSLRKLDLQEAIDALCTLPGVGPKVAACIALFSLDQ 287
++AK I + L+ H G L L+K E +D L G+G K A+C+ LF+ D+
Sbjct: 83 QKAKAIKNLLKNLKKYH--GKLSLTHLKKKSDDEVLDELILHKGIGVKTASCVLLFAFDR 140
Query: 288 HHAIPVDTHVWKIATRYLLPELAG 311
+ PVDTHV +I R + E +
Sbjct: 141 -NVCPVDTHVHRILNRVGVVETSN 163
>gi|406868323|gb|EKD21360.1| HhH-GPD superfamily base excision DNA repair protein [Marssonina
brunnea f. sp. 'multigermtubi' MB_m1]
Length = 441
Score = 48.1 bits (113), Expect = 0.006, Method: Compositional matrix adjust.
Identities = 25/84 (29%), Positives = 44/84 (52%), Gaps = 11/84 (13%)
Query: 217 SSLLFSVRRSFKQAKYITGTVDVLQSKHSGGAEWLLSLRKLDLQEAIDALCTLPGVGPKV 276
+ L+++V + KYI T ++L+ +++G D+ + I+ L +LPGVGPK+
Sbjct: 217 NELIWAVGFHNNKTKYIKATAEILRDQYNG-----------DIPDTIEGLMSLPGVGPKM 265
Query: 277 AACIALFSLDQHHAIPVDTHVWKI 300
+ +H I VD HV +I
Sbjct: 266 GYLCLSAAWGKHEGIGVDVHVHRI 289
>gi|448605866|ref|ZP_21658459.1| endonuclease III [Haloferax sulfurifontis ATCC BAA-897]
gi|445741189|gb|ELZ92693.1| endonuclease III [Haloferax sulfurifontis ATCC BAA-897]
Length = 268
Score = 47.8 bits (112), Expect = 0.006, Method: Compositional matrix adjust.
Identities = 28/97 (28%), Positives = 49/97 (50%), Gaps = 3/97 (3%)
Query: 213 SAPQSSLLFSVRRSF---KQAKYITGTVDVLQSKHSGGAEWLLSLRKLDLQEAIDALCTL 269
+A + ++ ++R +++ I G + + + A++ +R+ D D L +
Sbjct: 94 AADREGIVDAIRSGGLYNQKSNLIQGVAEEVLADFGSEADFDRYVREEDPATVRDRLLEM 153
Query: 270 PGVGPKVAACIALFSLDQHHAIPVDTHVWKIATRYLL 306
GVGPK A C+ LF+ + PVDTHV +IA R L
Sbjct: 154 KGVGPKTADCVLLFAGGRGGVFPVDTHVHRIARRIGL 190
>gi|427777899|gb|JAA54401.1| Putative endonuclease iii [Rhipicephalus pulchellus]
Length = 321
Score = 47.8 bits (112), Expect = 0.006, Method: Compositional matrix adjust.
Identities = 28/85 (32%), Positives = 44/85 (51%), Gaps = 11/85 (12%)
Query: 219 LLFSVRRSFKQAKYITGTVDVLQSKHSGGAEWLLSLRKLDLQEAIDALCTLPGVGPKVAA 278
L++ V +AK++ T VL K+ G D+ ++++ LC+LPGVGPK+A
Sbjct: 180 LIYPVSFYKNKAKHLKQTAKVLLEKYDG-----------DIPDSVEGLCSLPGVGPKMAY 228
Query: 279 CIALFSLDQHHAIPVDTHVWKIATR 303
+ I VDTHV +I+ R
Sbjct: 229 LAMSCGWGRTVGIGVDTHVHRISNR 253
>gi|187469057|gb|AAI66807.1| Ogg1 protein [Rattus norvegicus]
Length = 137
Score = 47.8 bits (112), Expect = 0.007, Method: Compositional matrix adjust.
Identities = 25/113 (22%), Positives = 52/113 (46%), Gaps = 5/113 (4%)
Query: 32 TNKPSKWTPLNLTQSELSLPLTFPTGQTFRWKKTGPLQYTGPIGPHLISLKHLQNGDVCY 91
T+ P+ W + +SEL L L +GQ+FRW++ P ++G + + +L ++ C
Sbjct: 17 TSSPALWASIPCPRSELRLDLVLASGQSFRWREQSPAHWSGVLADQVWTLTQTEDQLYCT 76
Query: 92 HIHTSPSEPAAKS-----ALLDFLNMGISLGELWEGFSASDCRFAELAKYLAG 139
+ + L + + +SL +L+ +++ D F +A+ G
Sbjct: 77 VYRGDKGQVGRPTLEELETLHKYFQLDVSLTQLYSHWASVDSHFQSVAQKFQG 129
>gi|427783639|gb|JAA57271.1| Putative endonuclease iii [Rhipicephalus pulchellus]
Length = 306
Score = 47.8 bits (112), Expect = 0.007, Method: Compositional matrix adjust.
Identities = 28/85 (32%), Positives = 44/85 (51%), Gaps = 11/85 (12%)
Query: 219 LLFSVRRSFKQAKYITGTVDVLQSKHSGGAEWLLSLRKLDLQEAIDALCTLPGVGPKVAA 278
L++ V +AK++ T VL K+ G D+ ++++ LC+LPGVGPK+A
Sbjct: 165 LIYPVSFYKNKAKHLKQTAKVLLEKYDG-----------DIPDSVEGLCSLPGVGPKMAY 213
Query: 279 CIALFSLDQHHAIPVDTHVWKIATR 303
+ I VDTHV +I+ R
Sbjct: 214 LAMSCGWGRTVGIGVDTHVHRISNR 238
>gi|312880248|ref|ZP_07740048.1| DNA-(apurinic or apyrimidinic site) lyase [Aminomonas paucivorans
DSM 12260]
gi|310783539|gb|EFQ23937.1| DNA-(apurinic or apyrimidinic site) lyase [Aminomonas paucivorans
DSM 12260]
Length = 232
Score = 47.8 bits (112), Expect = 0.007, Method: Compositional matrix adjust.
Identities = 28/61 (45%), Positives = 34/61 (55%), Gaps = 1/61 (1%)
Query: 243 KHSGGAEWLLSLRKLDLQEAIDALCTLPGVGPKVAACIALFSLDQHHAIPVDTHVWKIAT 302
+ GA L+ LR A + L LPGVGPK AAC+ +F L A PVDTHV +I
Sbjct: 106 REDWGAPTLVPLRSWKPDRAREYLEALPGVGPKTAACVMVFDLG-FPAFPVDTHVARICR 164
Query: 303 R 303
R
Sbjct: 165 R 165
>gi|226356223|ref|YP_002785963.1| DNA-(apurinic or apyrimidinic site) lyase [Deinococcus deserti
VCD115]
gi|226318213|gb|ACO46209.1| putative DNA-(apurinic or apyrimidinic site) lyase (Endonuclease
III) [Deinococcus deserti VCD115]
Length = 237
Score = 47.8 bits (112), Expect = 0.008, Method: Compositional matrix adjust.
Identities = 40/100 (40%), Positives = 53/100 (53%), Gaps = 6/100 (6%)
Query: 229 QAKYITGTVDVLQSKHSGGAEWLLSLRKLDLQEAIDALCTLPGVGPKVAACIALFSLDQH 288
+A YI G + L + G L ++RKL+ QEA L LPGVG K A+ I LF L
Sbjct: 80 KAGYIHGILVHLDERL--GTLDLSAVRKLNDQEARTLLEGLPGVGMKTASLILLFDL-LR 136
Query: 289 HAIPVDTHVWKIATRYLLPELAGVRLTPKLCSRVAEAFCE 328
A+PVDT++ +IA R EL R TP+ R +A
Sbjct: 137 PALPVDTNIERIAKRL---ELVPQRWTPEKVERWFDAVVR 173
>gi|159898923|ref|YP_001545170.1| DNA-(apurinic or apyrimidinic site) lyase [Herpetosiphon
aurantiacus DSM 785]
gi|159891962|gb|ABX05042.1| DNA-(apurinic or apyrimidinic site) lyase [Herpetosiphon
aurantiacus DSM 785]
Length = 222
Score = 47.8 bits (112), Expect = 0.008, Method: Compositional matrix adjust.
Identities = 24/50 (48%), Positives = 33/50 (66%), Gaps = 1/50 (2%)
Query: 254 LRKLDLQEAIDALCTLPGVGPKVAACIALFSLDQHHAIPVDTHVWKIATR 303
LR+L L EA L LPG+GPK A C+ F+ +Q A+ VDTH+ ++A R
Sbjct: 108 LRELGLHEARAWLTALPGIGPKTAGCVLCFACNQ-PAMIVDTHIHRVAKR 156
>gi|409082857|gb|EKM83215.1| hypothetical protein AGABI1DRAFT_69456 [Agaricus bisporus var.
burnettii JB137-S8]
Length = 383
Score = 47.8 bits (112), Expect = 0.008, Method: Compositional matrix adjust.
Identities = 21/46 (45%), Positives = 29/46 (63%)
Query: 258 DLQEAIDALCTLPGVGPKVAACIALFSLDQHHAIPVDTHVWKIATR 303
D+ + +D LC+LPGVGPK+A + D +H I VD HV +I R
Sbjct: 208 DVPKTVDELCSLPGVGPKMAFLALQVAWDLNHGIGVDVHVHRITNR 253
>gi|242017392|ref|XP_002429173.1| endonuclease III, putative [Pediculus humanus corporis]
gi|212514051|gb|EEB16435.1| endonuclease III, putative [Pediculus humanus corporis]
Length = 292
Score = 47.4 bits (111), Expect = 0.008, Method: Compositional matrix adjust.
Identities = 33/108 (30%), Positives = 51/108 (47%), Gaps = 18/108 (16%)
Query: 196 LSLVSEVELRNAGFGYRSAPQSSLLFSVRRSFKQAKYITGTVDVLQSKHSGGAEWLLSLR 255
L++ S +E+ + G L++ V ++ KY+ T VL+ K G
Sbjct: 138 LTIESIIEMSDDTLG-------ELIYPVGFWKQKTKYLKQTCQVLKEKFDG--------- 181
Query: 256 KLDLQEAIDALCTLPGVGPKVAACIALFSLDQHHAIPVDTHVWKIATR 303
D+ ++ LC+LPGVG K+A + D I VDTHV +IA R
Sbjct: 182 --DIPNTVELLCSLPGVGLKMAHICMKTAWDVISGIGVDTHVHRIANR 227
>gi|383788033|ref|YP_005472601.1| putative DNA glycosylase [Caldisericum exile AZM16c01]
gi|381363669|dbj|BAL80498.1| putative DNA glycosylase [Caldisericum exile AZM16c01]
Length = 289
Score = 47.4 bits (111), Expect = 0.008, Method: Compositional matrix adjust.
Identities = 61/262 (23%), Positives = 102/262 (38%), Gaps = 42/262 (16%)
Query: 47 ELSLPLTFPTGQTFRWKKTGPLQYTGPIGPHLISLKHLQNGDV------CYHIHTSPSEP 100
+ SL T GQTFR+ G P G +I L+ N D+ Y + S E
Sbjct: 18 DFSLEKTLNCGQTFRFFNIGGYY-YYPYGDSVIKLQESVNNDITTLKVEIYGTYLSKDEF 76
Query: 101 AAKSAL---LDFLNMGISLGELWEGFSASDCRFAELAKYLAGARVLRQDPVECLLQFLCS 157
++ L +D++N I E ++ G R+++ P E + F+ S
Sbjct: 77 SSIFGLNHDIDYINKKII---------QITPILKEAIEFSKGIRLIQMPPYETTISFIFS 127
Query: 158 SNNNIARITKMVDFLASLGSHLGNVEGFEFHEFPSLERLSLVSEVELRNAGFGYRSAPQS 217
+ I IT ++ LA L + ++ +F+ FP E + + + + G+R
Sbjct: 128 IQSQIPVITSRLNKLAELTNRQIEIDNMKFYLFPKREDILELPIEAVHSLKLGFREKFFL 187
Query: 218 SLLFSVRRSFKQAKYITGTVDVLQSKHSGGAEWLLSLRKLDLQEAIDALCTLPGVGPKVA 277
+L+ + E +LR +E + L ++ GVG KV+
Sbjct: 188 NLI-----------------------KNYTMEDFENLRDKPYEEKKNFLTSILGVGDKVS 224
Query: 278 ACIALFSLDQHHAIPVDTHVWK 299
CI LF PVDT + K
Sbjct: 225 ECIILFGYGDLTGFPVDTWIKK 246
>gi|426200726|gb|EKV50650.1| hypothetical protein AGABI2DRAFT_63644 [Agaricus bisporus var.
bisporus H97]
Length = 370
Score = 47.4 bits (111), Expect = 0.009, Method: Compositional matrix adjust.
Identities = 21/46 (45%), Positives = 29/46 (63%)
Query: 258 DLQEAIDALCTLPGVGPKVAACIALFSLDQHHAIPVDTHVWKIATR 303
D+ + +D LC+LPGVGPK+A + D +H I VD HV +I R
Sbjct: 195 DVPKTVDELCSLPGVGPKMAFLALQVAWDLNHGIGVDVHVHRITNR 240
>gi|449273757|gb|EMC83166.1| Endonuclease III-like protein 1, partial [Columba livia]
Length = 256
Score = 47.4 bits (111), Expect = 0.009, Method: Compositional matrix adjust.
Identities = 27/86 (31%), Positives = 40/86 (46%), Gaps = 11/86 (12%)
Query: 218 SLLFSVRRSFKQAKYITGTVDVLQSKHSGGAEWLLSLRKLDLQEAIDALCTLPGVGPKVA 277
+++ V + KYI T +L+ K+ G D+ ++ L LPGVGPK+A
Sbjct: 118 QMIYPVGFWRNKVKYIKQTTAILKQKYGG-----------DIPRTVEELVQLPGVGPKMA 166
Query: 278 ACIALFSLDQHHAIPVDTHVWKIATR 303
+ D I VDTHV +I R
Sbjct: 167 HLAMTIAWDSVSGIAVDTHVHRITNR 192
>gi|284042265|ref|YP_003392605.1| DNA-(apurinic or apyrimidinic site) lyase [Conexibacter woesei DSM
14684]
gi|283946486|gb|ADB49230.1| DNA-(apurinic or apyrimidinic site) lyase [Conexibacter woesei DSM
14684]
Length = 241
Score = 47.4 bits (111), Expect = 0.009, Method: Compositional matrix adjust.
Identities = 27/78 (34%), Positives = 39/78 (50%), Gaps = 1/78 (1%)
Query: 229 QAKYITGTVDVLQSKHSGGAEWLLSLRKLDLQEAIDALCTLPGVGPKVAACIALFSLDQH 288
++K I + + G L LR + + LC+LPGVG K AAC+ LF+
Sbjct: 104 KSKRIQQILRAIDDSPEGAGLDLSFLRDASVPDGQRYLCSLPGVGRKTAACVLLFAYGLR 163
Query: 289 HAIPVDTHVWKIATRYLL 306
+PVDTHV ++ R L
Sbjct: 164 D-VPVDTHVSRVGMRLRL 180
>gi|440791540|gb|ELR12778.1| base excision DNA repair protein, HhHGPD subfamily protein
[Acanthamoeba castellanii str. Neff]
Length = 494
Score = 47.4 bits (111), Expect = 0.009, Method: Compositional matrix adjust.
Identities = 30/95 (31%), Positives = 50/95 (52%), Gaps = 5/95 (5%)
Query: 212 RSAPQSSLLFSVRR---SFKQAKYITGTVDVLQSKHSGGAEWLLSLRKLDLQEAIDALCT 268
R+A S L +R + ++ I G ++ L + G L LR + +E AL
Sbjct: 233 RTAKPSQLADVIRTGGLAQRKGGIIQGLLNQLHEERDGELS-LEHLRDMPTEEVKHALTK 291
Query: 269 LPGVGPKVAACIALFSLDQHHAIPVDTHVWKIATR 303
G+GPK A+C+ +F++ + PVDTHV +++TR
Sbjct: 292 YKGIGPKTASCVLMFNM-KRADFPVDTHVHRVSTR 325
>gi|170093764|ref|XP_001878103.1| predicted protein [Laccaria bicolor S238N-H82]
gi|164646557|gb|EDR10802.1| predicted protein [Laccaria bicolor S238N-H82]
Length = 236
Score = 47.4 bits (111), Expect = 0.011, Method: Compositional matrix adjust.
Identities = 21/46 (45%), Positives = 29/46 (63%)
Query: 258 DLQEAIDALCTLPGVGPKVAACIALFSLDQHHAIPVDTHVWKIATR 303
D+ + +D LC+LPGVGPK+A + D +H I VD HV +I R
Sbjct: 127 DVPKTVDELCSLPGVGPKMAFLALQVAWDLNHGIGVDVHVHRITNR 172
>gi|424811992|ref|ZP_18237232.1| putative EndoIII-related endonuclease [Candidatus Nanosalinarum sp.
J07AB56]
gi|339756214|gb|EGQ39797.1| putative EndoIII-related endonuclease [Candidatus Nanosalinarum sp.
J07AB56]
Length = 219
Score = 47.0 bits (110), Expect = 0.011, Method: Compositional matrix adjust.
Identities = 33/113 (29%), Positives = 63/113 (55%), Gaps = 6/113 (5%)
Query: 213 SAPQSSLLFSVRRSF---KQAKYITGTVDVLQSKHSGGAEWLLSLRKLDLQEAIDALCTL 269
AP+ L VR + +A+ I +++++ + +GG + L ++ ++A + L +L
Sbjct: 66 EAPEGELTDVVRVAGLGPTKAERIQKALELVREE-TGGDYSVDFLDEMSTEDAKEWLESL 124
Query: 270 PGVGPKVAACIALFSLDQHHAIPVDTHVWKIATRY-LLPELAGVRLTPKLCSR 321
PGVGPK AA + F + + PVDTHV +++ R+ ++PE + T ++ R
Sbjct: 125 PGVGPKTAAIVLCFHMGR-AVFPVDTHVHRLSKRWGIVPENTSRKKTHRIMER 176
>gi|156055880|ref|XP_001593864.1| hypothetical protein SS1G_05292 [Sclerotinia sclerotiorum 1980]
gi|154703076|gb|EDO02815.1| hypothetical protein SS1G_05292 [Sclerotinia sclerotiorum 1980
UF-70]
Length = 551
Score = 47.0 bits (110), Expect = 0.011, Method: Compositional matrix adjust.
Identities = 23/46 (50%), Positives = 30/46 (65%), Gaps = 1/46 (2%)
Query: 254 LRKLDLQEAIDALCTLPGVGPKVAACIALFSLDQHHAIPVDTHVWK 299
+ LD A+D L LPG+G K AAC+ALF L + + VDTHVW+
Sbjct: 362 IHALDKPAAMDVLMDLPGIGVKTAACVALFCLGR-PSFAVDTHVWR 406
>gi|386002355|ref|YP_005920654.1| hypothetical protein Mhar_1672 [Methanosaeta harundinacea 6Ac]
gi|357210411|gb|AET65031.1| hypothetical protein Mhar_1672 [Methanosaeta harundinacea 6Ac]
Length = 133
Score = 47.0 bits (110), Expect = 0.012, Method: Compositional matrix adjust.
Identities = 28/65 (43%), Positives = 40/65 (61%), Gaps = 3/65 (4%)
Query: 241 QSKHSGGAEW--LLSLRKLDLQEAIDALCTLPGVGPKVAACIALFSLDQHHAIPVDTHVW 298
+ + +GG E L LR D +EA L +LPGVGPK AA + LF+ + +PVDTHV
Sbjct: 3 EREEAGGPERIDLEFLRGADAEEARQFLVSLPGVGPKTAAVVLLFAF-RMPLLPVDTHVN 61
Query: 299 KIATR 303
+++ R
Sbjct: 62 RLSRR 66
>gi|300120326|emb|CBK19880.2| unnamed protein product [Blastocystis hominis]
Length = 198
Score = 47.0 bits (110), Expect = 0.012, Method: Compositional matrix adjust.
Identities = 32/108 (29%), Positives = 50/108 (46%), Gaps = 17/108 (15%)
Query: 202 VELRNAGFGYRSAPQSS------LLFSVRRSFKQAKYITGTVDVLQSKHSGGAEWLLSLR 255
++LR G + ++S L+ V K+ KYI T D+L K+ G
Sbjct: 16 LKLREYGLTPKHIQETSDEKLGELICKVGFWTKKVKYIKKTTDILLEKYDG--------- 66
Query: 256 KLDLQEAIDALCTLPGVGPKVAACIALFSLDQHHAIPVDTHVWKIATR 303
D+ + I+ L LPGVGPK+ + ++ I VD HV +I+ R
Sbjct: 67 --DIPDTIEELVKLPGVGPKMGYLALKVAWNKIDGIGVDVHVHRISNR 112
>gi|448542485|ref|ZP_21624719.1| endonuclease III [Haloferax sp. ATCC BAA-646]
gi|448549955|ref|ZP_21628560.1| endonuclease III [Haloferax sp. ATCC BAA-645]
gi|445707085|gb|ELZ58948.1| endonuclease III [Haloferax sp. ATCC BAA-646]
gi|445713003|gb|ELZ64784.1| endonuclease III [Haloferax sp. ATCC BAA-645]
Length = 270
Score = 47.0 bits (110), Expect = 0.012, Method: Compositional matrix adjust.
Identities = 28/97 (28%), Positives = 49/97 (50%), Gaps = 3/97 (3%)
Query: 213 SAPQSSLLFSVRRSF---KQAKYITGTVDVLQSKHSGGAEWLLSLRKLDLQEAIDALCTL 269
+A + ++ ++R +++K I G + + + A++ +R+ D L +
Sbjct: 94 AADREGIIEAIRSGGLYNQKSKLIQGVAEEVLADFGSEADFDRYVREEAPATVRDRLLEM 153
Query: 270 PGVGPKVAACIALFSLDQHHAIPVDTHVWKIATRYLL 306
GVGPK A C+ LF+ + PVDTHV +IA R L
Sbjct: 154 KGVGPKTADCVLLFAGGRGGVFPVDTHVHRIARRMGL 190
>gi|297621994|ref|YP_003710131.1| endonuclease III [Waddlia chondrophila WSU 86-1044]
gi|297377295|gb|ADI39125.1| endonuclease III [Waddlia chondrophila WSU 86-1044]
gi|337294254|emb|CCB92238.1| putative endonuclease III [Waddlia chondrophila 2032/99]
Length = 204
Score = 47.0 bits (110), Expect = 0.012, Method: Compositional matrix adjust.
Identities = 46/178 (25%), Positives = 72/178 (40%), Gaps = 38/178 (21%)
Query: 145 QDPVECLLQFLCSSNNNIARITKMVDFLASLGSHLGNVEGFEFHEFPSLERLSLVSEVEL 204
QDP L+ L S+ AR+ ++ L FH + +++++V E+
Sbjct: 25 QDPYTLLIAVLLSAQCTDARVNQITPIL--------------FHRADTPQQMAVVPVEEI 70
Query: 205 RNAGFGYRSAPQSSLLFSVRRSFKQAKYITGTVDVLQSKHSGGAEWLLSLRKLDLQEAID 264
AP K+AK I G +L KH+G ++ + +
Sbjct: 71 EEIIRPCGLAP------------KKAKAIRGLSQILLDKHNG-----------NVPDTFE 107
Query: 265 ALCTLPGVGPKVAACIALFSLDQHHAIPVDTHVWKIATRYLLPELAGVRLTPKLCSRV 322
L LPGVG K A+ + + H A PVDTH+ + A R+ L V T K R+
Sbjct: 108 ELEALPGVGHKTAS-VVMSQAFHHPAFPVDTHIHRAAKRWGLSNGKSVEQTEKDLKRI 164
>gi|336375725|gb|EGO04061.1| hypothetical protein SERLA73DRAFT_175790 [Serpula lacrymans var.
lacrymans S7.3]
gi|336388834|gb|EGO29978.1| hypothetical protein SERLADRAFT_458392 [Serpula lacrymans var.
lacrymans S7.9]
Length = 366
Score = 47.0 bits (110), Expect = 0.012, Method: Compositional matrix adjust.
Identities = 27/74 (36%), Positives = 44/74 (59%), Gaps = 3/74 (4%)
Query: 228 KQAKYITGTVDVLQSKHSGGAEWLLSLRKLDLQEAIDALCTLPGVGPKVAACIALFSLDQ 287
K+AK I ++ ++ +H GA L L + +EA+ L + GVGPK AAC+ F L +
Sbjct: 194 KKAKTIQDILESVKQRH--GAYNLQHLADVPDEEAMRELVSYNGVGPKTAACVLSFCLGR 251
Query: 288 HHAIPVDTHVWKIA 301
+ VDTHV++++
Sbjct: 252 -ESFAVDTHVYRLS 264
>gi|325283930|ref|YP_004256471.1| HhH-GPD family protein [Deinococcus proteolyticus MRP]
gi|324315739|gb|ADY26854.1| HhH-GPD family protein [Deinococcus proteolyticus MRP]
Length = 248
Score = 47.0 bits (110), Expect = 0.012, Method: Compositional matrix adjust.
Identities = 34/94 (36%), Positives = 51/94 (54%), Gaps = 6/94 (6%)
Query: 213 SAPQSSLLFSVRRSF---KQAKYITGTVDVLQSKHSGGAEWLLSLRKLDLQEAIDALCTL 269
+AP + ++RRS ++A I T+ ++ K G L L +L + EA+ L L
Sbjct: 85 AAPVEQVAHAIRRSNYPDQKAPRIQATLRAIREKRGGYD--LDFLAELPVAEALSWLTDL 142
Query: 270 PGVGPKVAACIALFSLDQHHAIPVDTHVWKIATR 303
PGVG K A+ + LF+ + PVDTHV +I TR
Sbjct: 143 PGVGVKTASLVLLFNYAR-PVFPVDTHVHRINTR 175
>gi|429858184|gb|ELA33014.1| base excision dna repair protein [Colletotrichum gloeosporioides
Nara gc5]
Length = 309
Score = 47.0 bits (110), Expect = 0.012, Method: Compositional matrix adjust.
Identities = 20/41 (48%), Positives = 28/41 (68%), Gaps = 1/41 (2%)
Query: 260 QEAIDALCTLPGVGPKVAACIALFSLDQHHAIPVDTHVWKI 300
++A+ L + GVGPK A+C+ LF L Q + VDTHVW+I
Sbjct: 223 EDAMQELISFQGVGPKTASCVLLFCL-QRESFAVDTHVWRI 262
>gi|384490585|gb|EIE81807.1| hypothetical protein RO3G_06512 [Rhizopus delemar RA 99-880]
Length = 322
Score = 47.0 bits (110), Expect = 0.012, Method: Compositional matrix adjust.
Identities = 27/87 (31%), Positives = 44/87 (50%), Gaps = 11/87 (12%)
Query: 217 SSLLFSVRRSFKQAKYITGTVDVLQSKHSGGAEWLLSLRKLDLQEAIDALCTLPGVGPKV 276
L+ SV K+A YI T ++L+ ++ G D+ + I+ L +LPGVGPK+
Sbjct: 141 DQLIRSVGFHSKKAVYIKKTAEILRDQYDG-----------DIPDTIEGLTSLPGVGPKM 189
Query: 277 AACIALFSLDQHHAIPVDTHVWKIATR 303
+ +++ I VD HV +I R
Sbjct: 190 GYLTLQVAWNKNLGIGVDVHVHRITNR 216
>gi|448681086|ref|ZP_21691232.1| endonuclease III [Haloarcula argentinensis DSM 12282]
gi|445768144|gb|EMA19231.1| endonuclease III [Haloarcula argentinensis DSM 12282]
Length = 277
Score = 47.0 bits (110), Expect = 0.013, Method: Compositional matrix adjust.
Identities = 23/53 (43%), Positives = 31/53 (58%)
Query: 254 LRKLDLQEAIDALCTLPGVGPKVAACIALFSLDQHHAIPVDTHVWKIATRYLL 306
+R+ D +E L + GVGPK A C+ LF+ + PVDTHV +IA R L
Sbjct: 148 VRESDPEEVRSTLLDMNGVGPKTADCVLLFAGGRGGVFPVDTHVHRIARRMGL 200
>gi|356513571|ref|XP_003525486.1| PREDICTED: endonuclease III-like [Glycine max]
Length = 284
Score = 47.0 bits (110), Expect = 0.013, Method: Compositional matrix adjust.
Identities = 33/113 (29%), Positives = 53/113 (46%), Gaps = 17/113 (15%)
Query: 190 FPSLERLSLVSEVELRNA-GFGYRSAPQSSLLFSVRRSFKQAKYITGTVDVLQSKHSGGA 248
FPS E++ ++ NA G + ++S + +V R ++ + G
Sbjct: 114 FPSWEQVLWAESKDVENAIRCGGLAPTKASCIKNVLRCLRERR---------------GE 158
Query: 249 EWLLSLRKLDLQEAIDALCTLPGVGPKVAACIALFSLDQHHAIPVDTHVWKIA 301
L LR L + E L G+GPK AC+ +F+L Q PVDTH+++IA
Sbjct: 159 LCLEYLRDLSVDEVKAELSLFKGIGPKTVACVLMFNL-QQDDFPVDTHIFEIA 210
>gi|397691245|ref|YP_006528499.1| HhH-GPD family protein [Melioribacter roseus P3M]
gi|395812737|gb|AFN75486.1| HhH-GPD family protein [Melioribacter roseus P3M]
Length = 221
Score = 47.0 bits (110), Expect = 0.013, Method: Compositional matrix adjust.
Identities = 26/78 (33%), Positives = 43/78 (55%), Gaps = 6/78 (7%)
Query: 254 LRKLDLQEAIDALCTLPGVGPKVAACIALFSLDQHHAIPVDTHVWKIATRYLL-----PE 308
++K+D +EA++ L + G+G K A+C+ LFSL + PVDTHV + R + PE
Sbjct: 95 IKKMDNKEALEYLTSFKGIGIKTASCVLLFSL-YRNVCPVDTHVHRTLNRIGIVNEKSPE 153
Query: 309 LAGVRLTPKLCSRVAEAF 326
+R+ L +A +
Sbjct: 154 KTFLRINENLPEGIAHSL 171
>gi|359417776|ref|ZP_09209845.1| HhH-GPD family protein [Candidatus Haloredivivus sp. G17]
gi|358031869|gb|EHK00704.1| HhH-GPD family protein [Candidatus Haloredivivus sp. G17]
Length = 215
Score = 47.0 bits (110), Expect = 0.013, Method: Compositional matrix adjust.
Identities = 34/108 (31%), Positives = 58/108 (53%), Gaps = 7/108 (6%)
Query: 212 RSAPQSSLLFSVRRSF---KQAKYITGTVDVLQSKHSGGAEWLLSLRKLDLQEAIDALCT 268
R A + L+ ++R + +A+ I ++++ + G+ + L +L L E L
Sbjct: 66 RYADKEELIDTIRIAGLGPTKAERIQEALNIIYEEE--GSYSVEFLDELSLDEGKKWLTD 123
Query: 269 LPGVGPKVAACIALFSLDQHHAIPVDTHVWKIATRY-LLPELAGVRLT 315
+PG+GPK A+ I F + IPVDTHV ++A RY L+PE + + T
Sbjct: 124 IPGIGPKTASVILGFHF-KKPTIPVDTHVHRLANRYGLVPEGSSRKRT 170
>gi|443717727|gb|ELU08655.1| hypothetical protein CAPTEDRAFT_25579, partial [Capitella teleta]
Length = 231
Score = 47.0 bits (110), Expect = 0.013, Method: Compositional matrix adjust.
Identities = 25/76 (32%), Positives = 40/76 (52%), Gaps = 11/76 (14%)
Query: 228 KQAKYITGTVDVLQSKHSGGAEWLLSLRKLDLQEAIDALCTLPGVGPKVAACIALFSLDQ 287
++ +YI T +L+S S +D+ ++ LC+LPGVGPK+A + + +
Sbjct: 100 RKVEYIKKTSKILKSDFS-----------MDIPNTVEGLCSLPGVGPKMAHLVMKSAWGE 148
Query: 288 HHAIPVDTHVWKIATR 303
I VDTHV +I R
Sbjct: 149 ISGIGVDTHVHRICNR 164
>gi|242780797|ref|XP_002479670.1| HhH-GPD family base excision DNA repair protein [Talaromyces
stipitatus ATCC 10500]
gi|218719817|gb|EED19236.1| HhH-GPD family base excision DNA repair protein [Talaromyces
stipitatus ATCC 10500]
Length = 407
Score = 47.0 bits (110), Expect = 0.013, Method: Compositional matrix adjust.
Identities = 22/47 (46%), Positives = 30/47 (63%), Gaps = 1/47 (2%)
Query: 254 LRKLDLQEAIDALCTLPGVGPKVAACIALFSLDQHHAIPVDTHVWKI 300
+ LD +EA+ L PG+GPK AAC+ LF L Q VDTH+++I
Sbjct: 309 IHALDKEEAMLELIKFPGIGPKTAACVVLFCL-QRPCFAVDTHIFRI 354
>gi|86609598|ref|YP_478360.1| base excision DNA repair protein [Synechococcus sp.
JA-2-3B'a(2-13)]
gi|86558140|gb|ABD03097.1| base excision DNA repair protein, HhH-GPD family [Synechococcus sp.
JA-2-3B'a(2-13)]
Length = 212
Score = 47.0 bits (110), Expect = 0.013, Method: Compositional matrix adjust.
Identities = 34/95 (35%), Positives = 50/95 (52%), Gaps = 8/95 (8%)
Query: 229 QAKYITGTVDVLQSKHSGGAEWLLSLRKLDLQEAIDALCTLPGVGPKVAACIALFSLDQH 288
+A I + +Q + G+ L LR LD A+D L +L GVG K A C+ LF L +
Sbjct: 79 KAARIQEILAAIQERQ--GSLSLDFLRDLDDAGALDFLLSLKGVGLKTATCVLLFGLGR- 135
Query: 289 HAIPVDTHVWKIATRYLL-----PELAGVRLTPKL 318
PVDTHV ++A R L P+ +L+P++
Sbjct: 136 DLCPVDTHVHRVANRLGLVRAKHPDDTFAQLSPQI 170
>gi|332028140|gb|EGI68191.1| Endonuclease III-like protein 1 [Acromyrmex echinatior]
Length = 341
Score = 47.0 bits (110), Expect = 0.013, Method: Compositional matrix adjust.
Identities = 29/85 (34%), Positives = 42/85 (49%), Gaps = 11/85 (12%)
Query: 219 LLFSVRRSFKQAKYITGTVDVLQSKHSGGAEWLLSLRKLDLQEAIDALCTLPGVGPKVAA 278
L++ V K+ +YI T +L K+ G D+ + + LC LPGVGPK+A
Sbjct: 199 LIYPVGFWKKKVEYIKKTSVILLDKYGG-----------DIPKTVKELCELPGVGPKMAH 247
Query: 279 CIALFSLDQHHAIPVDTHVWKIATR 303
+ + I VDTHV +IA R
Sbjct: 248 LCMRTAWGEVSGIGVDTHVHRIANR 272
>gi|167043328|gb|ABZ08032.1| hypothetical protein ALOHA_HF4000ANIW141N1ctg1g7 [uncultured marine
crenarchaeote HF4000_ANIW141N1]
Length = 156
Score = 47.0 bits (110), Expect = 0.014, Method: Compositional matrix adjust.
Identities = 34/137 (24%), Positives = 62/137 (45%), Gaps = 14/137 (10%)
Query: 45 QSELSLPLTFPTGQTFRWKKTGPLQYTGPIGPHLISLKHLQNGDVCYHIHTSPSEPAAKS 104
+ +++ T +GQ F W+ + + G G +I +KH + I T EP
Sbjct: 19 HTNINIHETINSGQVFLWENHKNVWF-GIDGSDIIQVKHEP-----FEILTLSKEPK--- 69
Query: 105 ALLDFLNMGISLGELWEGFSASDCRFAELAKYLAGARVLRQDPVECLLQFLCSSNNNIAR 164
DF + ++ + S D + K+ G R+ RQDP +C + F+ S+N+NI+
Sbjct: 70 ---DFFRNDDNYKKILKSIS-HDKIVKKAVKHYPGLRITRQDPFQCCISFIISANSNISN 125
Query: 165 I-TKMVDFLASLGSHLG 180
I T++ G+ +
Sbjct: 126 IKTRLQKLCRKFGTRVN 142
>gi|161527564|ref|YP_001581390.1| DNA-(apurinic or apyrimidinic site) lyase [Nitrosopumilus maritimus
SCM1]
gi|160338865|gb|ABX11952.1| DNA-(apurinic or apyrimidinic site) lyase [Nitrosopumilus maritimus
SCM1]
Length = 218
Score = 46.6 bits (109), Expect = 0.014, Method: Compositional matrix adjust.
Identities = 27/80 (33%), Positives = 42/80 (52%), Gaps = 12/80 (15%)
Query: 229 QAKYITGTVDVLQSKHSGGAEWLLSLRKLDLQEAIDALCTLPGVGPKVAACIALFSLDQH 288
++K I ++ SK+ G + E +D L LPGVG K A C+ +++ ++
Sbjct: 91 KSKRIIEVAKIIDSKYKGK-----------VPEDLDTLVQLPGVGRKTANCVLVYAFEK- 138
Query: 289 HAIPVDTHVWKIATRYLLPE 308
AIPVD HV +I+ R L E
Sbjct: 139 PAIPVDIHVHRISNRLGLVE 158
>gi|288575002|ref|ZP_06393359.1| HhH-GPD family protein [Dethiosulfovibrio peptidovorans DSM 11002]
gi|288570743|gb|EFC92300.1| HhH-GPD family protein [Dethiosulfovibrio peptidovorans DSM 11002]
Length = 238
Score = 46.6 bits (109), Expect = 0.014, Method: Compositional matrix adjust.
Identities = 29/68 (42%), Positives = 39/68 (57%), Gaps = 2/68 (2%)
Query: 237 VDVLQSKHSGGAEWLLSLRKLDLQEAIDA-LCTLPGVGPKVAACIALFSLDQHHAIPVDT 295
+DVL+ H E+ L+ K + + A L LPGVGPK AAC+ +F +D A PVDT
Sbjct: 109 LDVLKIIHDELGEYGLTGLKYRDHDGVRAFLEGLPGVGPKTAACVLVFDMDI-PAFPVDT 167
Query: 296 HVWKIATR 303
HV + R
Sbjct: 168 HVARFCRR 175
>gi|255540783|ref|XP_002511456.1| Endonuclease III, putative [Ricinus communis]
gi|223550571|gb|EEF52058.1| Endonuclease III, putative [Ricinus communis]
Length = 291
Score = 46.6 bits (109), Expect = 0.014, Method: Compositional matrix adjust.
Identities = 32/88 (36%), Positives = 43/88 (48%), Gaps = 4/88 (4%)
Query: 229 QAKYITGTVDVLQSKHSGGAEWLLSLRKLDLQEAIDALCTLPGVGPKVAACIALFSLDQH 288
+A I ++ L K G L LR + + E L GVGPK AC+ +F L Q
Sbjct: 142 KASCIKNILNCLLEKK--GKICLEYLRDMSVDEIKAELSQFKGVGPKTVACVLMFHL-QQ 198
Query: 289 HAIPVDTHVWKIATRY-LLPELAGVRLT 315
PVDTHV++IA +PE+A T
Sbjct: 199 EDFPVDTHVFEIAKALGWVPEVADRNKT 226
>gi|448625246|ref|ZP_21671013.1| endonuclease III [Haloferax denitrificans ATCC 35960]
gi|445749008|gb|EMA00454.1| endonuclease III [Haloferax denitrificans ATCC 35960]
Length = 258
Score = 46.6 bits (109), Expect = 0.015, Method: Compositional matrix adjust.
Identities = 28/97 (28%), Positives = 49/97 (50%), Gaps = 3/97 (3%)
Query: 213 SAPQSSLLFSVRRSF---KQAKYITGTVDVLQSKHSGGAEWLLSLRKLDLQEAIDALCTL 269
+A + ++ ++R +++K I G + + + A++ +R+ D L +
Sbjct: 94 AADREGIVDAIRSGGLYNQKSKLIQGVAEEVLADFGSEADFDRYVREEAPATVRDRLLEM 153
Query: 270 PGVGPKVAACIALFSLDQHHAIPVDTHVWKIATRYLL 306
GVGPK A C+ LF+ + PVDTHV +IA R L
Sbjct: 154 KGVGPKTADCVLLFAGGRGGVFPVDTHVHRIARRIGL 190
>gi|448728587|ref|ZP_21710911.1| endonuclease III [Halococcus saccharolyticus DSM 5350]
gi|445796772|gb|EMA47269.1| endonuclease III [Halococcus saccharolyticus DSM 5350]
Length = 272
Score = 46.6 bits (109), Expect = 0.015, Method: Compositional matrix adjust.
Identities = 24/62 (38%), Positives = 33/62 (53%)
Query: 242 SKHSGGAEWLLSLRKLDLQEAIDALCTLPGVGPKVAACIALFSLDQHHAIPVDTHVWKIA 301
++ AE+ + + D + DAL GVGPK A C+ LFS + PVDTHV +I
Sbjct: 129 DEYGSAAEFDAFVTEKDPETVRDALLDFSGVGPKTADCVLLFSGGREGVFPVDTHVHRIY 188
Query: 302 TR 303
R
Sbjct: 189 RR 190
>gi|380488747|emb|CCF37167.1| HhH-GPD superfamily base excision DNA repair protein
[Colletotrichum higginsianum]
Length = 439
Score = 46.6 bits (109), Expect = 0.015, Method: Compositional matrix adjust.
Identities = 26/84 (30%), Positives = 43/84 (51%), Gaps = 11/84 (13%)
Query: 217 SSLLFSVRRSFKQAKYITGTVDVLQSKHSGGAEWLLSLRKLDLQEAIDALCTLPGVGPKV 276
+ L+++V + KYI + ++L+ K G D+ + I+ L +LPGVGPK+
Sbjct: 263 NELIWAVGFHNNKTKYIKASAEILRDKFDG-----------DIPDTIEGLTSLPGVGPKM 311
Query: 277 AACIALFSLDQHHAIPVDTHVWKI 300
A + D+ I VD HV +I
Sbjct: 312 AYLCLSAAWDRTEGIGVDVHVHRI 335
>gi|288560832|ref|YP_003424318.1| endonuclease III Nth [Methanobrevibacter ruminantium M1]
gi|288543542|gb|ADC47426.1| endonuclease III Nth [Methanobrevibacter ruminantium M1]
Length = 215
Score = 46.6 bits (109), Expect = 0.017, Method: Compositional matrix adjust.
Identities = 21/42 (50%), Positives = 29/42 (69%), Gaps = 1/42 (2%)
Query: 263 IDALCTLPGVGPKVAACIALFSLDQHHAIPVDTHVWKIATRY 304
+D + LPGVG K A C+ +F+ Q AIPVDTHV +I+ R+
Sbjct: 112 MDEMLKLPGVGRKTANCVIVFAF-QDAAIPVDTHVHRISNRW 152
>gi|256071646|ref|XP_002572150.1| endonuclease III [Schistosoma mansoni]
gi|353229188|emb|CCD75359.1| putative endonuclease III [Schistosoma mansoni]
Length = 260
Score = 46.6 bits (109), Expect = 0.017, Method: Compositional matrix adjust.
Identities = 28/92 (30%), Positives = 45/92 (48%), Gaps = 11/92 (11%)
Query: 212 RSAPQSSLLFSVRRSFKQAKYITGTVDVLQSKHSGGAEWLLSLRKLDLQEAIDALCTLPG 271
++ L++ V +A I T ++L+ K++ D+ E ++ LCTLPG
Sbjct: 109 KTGELQDLIYPVGFYKTKALNIKKTCEILKEKYNS-----------DIPETVEELCTLPG 157
Query: 272 VGPKVAACIALFSLDQHHAIPVDTHVWKIATR 303
VGPK+A + + I VDTHV +I R
Sbjct: 158 VGPKMAYLAMQCAWKKVTGIGVDTHVHRIVNR 189
>gi|410917257|ref|XP_003972103.1| PREDICTED: endonuclease III-like protein 1-like [Takifugu rubripes]
Length = 357
Score = 46.6 bits (109), Expect = 0.017, Method: Compositional matrix adjust.
Identities = 28/85 (32%), Positives = 43/85 (50%), Gaps = 11/85 (12%)
Query: 219 LLFSVRRSFKQAKYITGTVDVLQSKHSGGAEWLLSLRKLDLQEAIDALCTLPGVGPKVAA 278
L++ V + KY+ T +LQ + G D+ ++++ L LPGVGPK+A
Sbjct: 188 LIYPVGFWRNKVKYLKLTSAMLQKEFGG-----------DIPDSVEGLVRLPGVGPKMAH 236
Query: 279 CIALFSLDQHHAIPVDTHVWKIATR 303
+ DQ I VDTHV +I+ R
Sbjct: 237 LAMDIAWDQVSGIGVDTHVHRISNR 261
>gi|410985533|ref|XP_003999075.1| PREDICTED: endonuclease III-like protein 1 [Felis catus]
Length = 323
Score = 46.6 bits (109), Expect = 0.017, Method: Compositional matrix adjust.
Identities = 28/86 (32%), Positives = 42/86 (48%), Gaps = 11/86 (12%)
Query: 218 SLLFSVRRSFKQAKYITGTVDVLQSKHSGGAEWLLSLRKLDLQEAIDALCTLPGVGPKVA 277
+L++ V + KYI T +LQ ++ G D+ +++ L LPGVGPK+A
Sbjct: 185 TLIYPVGFWRSKVKYIKQTSAILQQRYGG-----------DIPDSVAELVALPGVGPKMA 233
Query: 278 ACIALFSLDQHHAIPVDTHVWKIATR 303
+ I VDTHV +IA R
Sbjct: 234 HLAMAVAWGTVSGIAVDTHVHRIANR 259
>gi|212526396|ref|XP_002143355.1| HhH-GPD family base excision DNA repair protein [Talaromyces
marneffei ATCC 18224]
gi|210072753|gb|EEA26840.1| HhH-GPD family base excision DNA repair protein [Talaromyces
marneffei ATCC 18224]
Length = 449
Score = 46.6 bits (109), Expect = 0.017, Method: Compositional matrix adjust.
Identities = 22/47 (46%), Positives = 30/47 (63%), Gaps = 1/47 (2%)
Query: 254 LRKLDLQEAIDALCTLPGVGPKVAACIALFSLDQHHAIPVDTHVWKI 300
+ LD +EA+ L PG+GPK AAC+ LF L Q VDTH+++I
Sbjct: 323 IHALDKEEAMLELIKYPGIGPKTAACVVLFCL-QRPCFAVDTHIFRI 368
>gi|395217179|ref|ZP_10401538.1| HhH-GPD family protein [Pontibacter sp. BAB1700]
gi|394455098|gb|EJF09644.1| HhH-GPD family protein [Pontibacter sp. BAB1700]
Length = 235
Score = 46.6 bits (109), Expect = 0.017, Method: Compositional matrix adjust.
Identities = 44/155 (28%), Positives = 73/155 (47%), Gaps = 18/155 (11%)
Query: 164 RITKMVDFLASLGSHLGNVEGFEFHEFPSLERLSLVSEVELRNAGFGYRSAPQSSLLFSV 223
R + M + ++++ SH N + + LER G +AP+ L +V
Sbjct: 31 RRSHMHELISTMLSHRTNHADEQQAYYTMLERFGDWE---------GVLNAPEDELADAV 81
Query: 224 ---RRSFKQAKYITGTVDVLQSKHSGGAEWLLSLRKLDLQEAIDALCTLPGVGPKVAACI 280
R ++A I T+ ++ K G + L L + EA+ L LPGVGPK A +
Sbjct: 82 STTRYPLQKAPQIQQTLRMI--KEDRGEINIDFLADLPIDEAMAWLTRLPGVGPKTATLL 139
Query: 281 ALFSLDQHHAIPVDTHVWKIATRYLLPELAGVRLT 315
LF+ + +PVDTHV++++ R L G ++T
Sbjct: 140 LLFNFKK-PVLPVDTHVFRVSQRV---GLIGAKIT 170
>gi|448637456|ref|ZP_21675694.1| endonuclease III [Haloarcula sinaiiensis ATCC 33800]
gi|445764303|gb|EMA15458.1| endonuclease III [Haloarcula sinaiiensis ATCC 33800]
Length = 278
Score = 46.2 bits (108), Expect = 0.018, Method: Compositional matrix adjust.
Identities = 24/64 (37%), Positives = 35/64 (54%)
Query: 243 KHSGGAEWLLSLRKLDLQEAIDALCTLPGVGPKVAACIALFSLDQHHAIPVDTHVWKIAT 302
++ G A + +R D ++ L + GVGPK A C+ LF+ + PVDTHV +IA
Sbjct: 138 EYGGEAGFDAFVRDSDSEKVRSTLLDMNGVGPKTADCVLLFAGGRGGVFPVDTHVHRIAR 197
Query: 303 RYLL 306
R L
Sbjct: 198 RMGL 201
>gi|340716729|ref|XP_003396847.1| PREDICTED: endonuclease III-like protein 1-like [Bombus terrestris]
Length = 352
Score = 46.2 bits (108), Expect = 0.018, Method: Compositional matrix adjust.
Identities = 27/85 (31%), Positives = 42/85 (49%), Gaps = 11/85 (12%)
Query: 219 LLFSVRRSFKQAKYITGTVDVLQSKHSGGAEWLLSLRKLDLQEAIDALCTLPGVGPKVAA 278
L++ V ++ +YI T +L K++G D+ ++ LC LPGVGPK+A
Sbjct: 209 LIYPVGFWKRKVQYIKKTSKILIEKYNG-----------DIPRTLEELCQLPGVGPKMAH 257
Query: 279 CIALFSLDQHHAIPVDTHVWKIATR 303
+ + I VDTHV +I R
Sbjct: 258 ICMQIAWGEVSGIGVDTHVHRICNR 282
>gi|343520720|ref|ZP_08757689.1| base excision DNA repair protein, HhH-GPD family [Parvimonas sp.
oral taxon 393 str. F0440]
gi|343397678|gb|EGV10212.1| base excision DNA repair protein, HhH-GPD family [Parvimonas sp.
oral taxon 393 str. F0440]
Length = 153
Score = 46.2 bits (108), Expect = 0.019, Method: Compositional matrix adjust.
Identities = 37/148 (25%), Positives = 62/148 (41%), Gaps = 22/148 (14%)
Query: 176 GSHLGNVEGFEFHEFPSLERLSLVSEVELRN-AGFGYRSAPQSSLLFSVRRSFKQAKYIT 234
G ++ E +++ FP++E LS + +LR A G+R K I
Sbjct: 4 GKYICEDENRKYYSFPTVEELSKATVEDLREFAKVGFRD-----------------KRIF 46
Query: 235 GTVDVLQSKHSGGAEWLLSLRKLDLQEAIDALCTLPGVGPKVAACIALFSLDQHHAIPVD 294
TV+++ ++ L + L+ + L GVG KVA CI LFS + PVD
Sbjct: 47 DTVNMILNEKID----LDNFENLETDILREELLKFAGVGNKVADCIMLFSYKRGEVFPVD 102
Query: 295 THVWKIATRYLLPELAGVRLTPKLCSRV 322
+ ++ + E V+ K R+
Sbjct: 103 VWIKRVMEELFIKEETPVKKISKEADRI 130
>gi|86605550|ref|YP_474313.1| base excision DNA repair protein [Synechococcus sp. JA-3-3Ab]
gi|86554092|gb|ABC99050.1| base excision DNA repair protein, HhH-GPD family [Synechococcus sp.
JA-3-3Ab]
Length = 222
Score = 46.2 bits (108), Expect = 0.019, Method: Compositional matrix adjust.
Identities = 29/70 (41%), Positives = 41/70 (58%), Gaps = 6/70 (8%)
Query: 254 LRKLDLQEAIDALCTLPGVGPKVAACIALFSLDQHHAIPVDTHVWKIATRYLL-----PE 308
LR LD +A+ L +L GVG K AAC+ LF L + PVDTHV ++A R L P+
Sbjct: 110 LRDLDDADALAYLLSLKGVGLKTAACVLLFGLGR-DLCPVDTHVHRVANRLGLVRARHPD 168
Query: 309 LAGVRLTPKL 318
+L+P++
Sbjct: 169 DTFAQLSPRI 178
>gi|448652011|ref|ZP_21681024.1| endonuclease III [Haloarcula californiae ATCC 33799]
gi|445769414|gb|EMA20488.1| endonuclease III [Haloarcula californiae ATCC 33799]
Length = 238
Score = 46.2 bits (108), Expect = 0.020, Method: Compositional matrix adjust.
Identities = 42/165 (25%), Positives = 73/165 (44%), Gaps = 17/165 (10%)
Query: 145 QDPVECLLQFLCSSN-NNIARITKMVDFLASLGS-HLGNVEGFEFHEFPSLERLSLVSEV 202
+D ECL++ + S N ++ A + D +A G N EG + S + +++
Sbjct: 51 RDAFECLVRTILSQNTSDKASQSAHDDLMARYGGGEDANSEG----DIDSTDLARALADA 106
Query: 203 ELRNAGFGYRSAPQSSLLFSVRRSFKQ-AKYITGTVDVLQSKHSGGAEWLLSLRKLDLQE 261
+ P+ + S + Q ++ I + ++ G A + +R D ++
Sbjct: 107 D----------QPELAETISSAGLYNQKSERIIALAQRICEEYGGEAGFDAFVRDSDSEK 156
Query: 262 AIDALCTLPGVGPKVAACIALFSLDQHHAIPVDTHVWKIATRYLL 306
L + GVGPK A C+ LF+ + PVDTHV +IA R L
Sbjct: 157 VRSTLLDMNGVGPKTADCVLLFAGGRGGVFPVDTHVHRIARRIGL 201
>gi|302410315|ref|XP_003002991.1| base excision DNA repair protein [Verticillium albo-atrum VaMs.102]
gi|261358015|gb|EEY20443.1| base excision DNA repair protein [Verticillium albo-atrum VaMs.102]
Length = 327
Score = 46.2 bits (108), Expect = 0.020, Method: Compositional matrix adjust.
Identities = 28/72 (38%), Positives = 41/72 (56%), Gaps = 3/72 (4%)
Query: 229 QAKYITGTVDVLQSKHSGGAEWLLSLRKLDLQEAIDALCTLPGVGPKVAACIALFSLDQH 288
++K I G ++ Q+K GA L + +EA+ L GVGPK A+C+ LF L +
Sbjct: 172 KSKVILGILE--QAKARYGAYSLDHMFDKTDEEAMRELIGFQGVGPKTASCVLLFCL-RR 228
Query: 289 HAIPVDTHVWKI 300
+ VDTHVW+I
Sbjct: 229 ESFAVDTHVWRI 240
>gi|346321129|gb|EGX90729.1| HhH-GPD family base excision DNA repair protein [Cordyceps
militaris CM01]
Length = 330
Score = 46.2 bits (108), Expect = 0.020, Method: Compositional matrix adjust.
Identities = 29/66 (43%), Positives = 39/66 (59%), Gaps = 2/66 (3%)
Query: 237 VDVLQSKHSG-GAEWLLSLRKLDLQEAIDALCTLPGVGPKVAACIALFSLDQHHAIPVDT 295
V++L + HS G L L EA++ L PGVGPK A+C+ LF L Q + VDT
Sbjct: 201 VNLLAAVHSKYGMYSLDHLFHASDTEAMEELLAFPGVGPKTASCVLLFCL-QRPSFAVDT 259
Query: 296 HVWKIA 301
HV++IA
Sbjct: 260 HVYRIA 265
>gi|291226292|ref|XP_002733128.1| PREDICTED: predicted protein-like [Saccoglossus kowalevskii]
Length = 318
Score = 46.2 bits (108), Expect = 0.020, Method: Compositional matrix adjust.
Identities = 28/85 (32%), Positives = 42/85 (49%), Gaps = 11/85 (12%)
Query: 219 LLFSVRRSFKQAKYITGTVDVLQSKHSGGAEWLLSLRKLDLQEAIDALCTLPGVGPKVAA 278
L++ V ++ +YI T +L+S++ D+ I LC LPGVGPK+A
Sbjct: 172 LIYPVGFWKRKVEYIKKTSTLLESQYDN-----------DIPSTISELCQLPGVGPKMAY 220
Query: 279 CIALFSLDQHHAIPVDTHVWKIATR 303
+ Q I VDTHV +I+ R
Sbjct: 221 LCMNIAWHQTTGIGVDTHVHRISNR 245
>gi|402225782|gb|EJU05843.1| DNA glycosylase [Dacryopinax sp. DJM-731 SS1]
Length = 321
Score = 46.2 bits (108), Expect = 0.020, Method: Compositional matrix adjust.
Identities = 25/76 (32%), Positives = 41/76 (53%), Gaps = 11/76 (14%)
Query: 228 KQAKYITGTVDVLQSKHSGGAEWLLSLRKLDLQEAIDALCTLPGVGPKVAACIALFSLDQ 287
++ +Y+ T++ L+ + G D+ + +D LC+LPGVGPK+A + D
Sbjct: 108 RKTQYLRQTMEKLRDEFDG-----------DVPKTVDELCSLPGVGPKMAFLCLQNAWDI 156
Query: 288 HHAIPVDTHVWKIATR 303
+ I VDTHV +I R
Sbjct: 157 NVGIGVDTHVHRITNR 172
>gi|164663005|ref|XP_001732624.1| hypothetical protein MGL_0399 [Malassezia globosa CBS 7966]
gi|159106527|gb|EDP45410.1| hypothetical protein MGL_0399 [Malassezia globosa CBS 7966]
Length = 663
Score = 46.2 bits (108), Expect = 0.021, Method: Compositional matrix adjust.
Identities = 25/76 (32%), Positives = 41/76 (53%), Gaps = 12/76 (15%)
Query: 228 KQAKYITGTVDVLQSKHSGGAEWLLSLRKLDLQEAIDALCTLPGVGPKVAACIALFSLDQ 287
++ Y+ +L+ KH G D+ + +D LC +PGVGPK+ A + + S+
Sbjct: 484 RKTDYLKTMTRILEEKHHG-----------DVPQTVDELCEIPGVGPKM-AFLQMQSMGL 531
Query: 288 HHAIPVDTHVWKIATR 303
+ I VDTHV +I+ R
Sbjct: 532 NVGIGVDTHVHRISNR 547
>gi|167997017|ref|XP_001751215.1| predicted protein [Physcomitrella patens subsp. patens]
gi|162697196|gb|EDQ83532.1| predicted protein [Physcomitrella patens subsp. patens]
Length = 272
Score = 46.2 bits (108), Expect = 0.021, Method: Compositional matrix adjust.
Identities = 24/73 (32%), Positives = 43/73 (58%), Gaps = 3/73 (4%)
Query: 229 QAKYITGTVDVLQSKHSGGAEWLLSLRKLDLQEAIDALCTLPGVGPKVAACIALFSLDQH 288
+AK I +D + ++ G+ L +R +++ + L GVGPK AC+ +F L+Q+
Sbjct: 120 KAKRIINILDTIFTER--GSICLEYVRSMNVDQIKAELSRFKGVGPKTVACVLMFHLEQN 177
Query: 289 HAIPVDTHVWKIA 301
PVDTHV++++
Sbjct: 178 E-FPVDTHVFRLS 189
>gi|292655041|ref|YP_003534938.1| endonuclease III [Haloferax volcanii DS2]
gi|448292657|ref|ZP_21483063.1| endonuclease III [Haloferax volcanii DS2]
gi|291372380|gb|ADE04607.1| endonuclease III [Haloferax volcanii DS2]
gi|445572413|gb|ELY26953.1| endonuclease III [Haloferax volcanii DS2]
Length = 268
Score = 46.2 bits (108), Expect = 0.022, Method: Compositional matrix adjust.
Identities = 28/97 (28%), Positives = 49/97 (50%), Gaps = 3/97 (3%)
Query: 213 SAPQSSLLFSVRRSF---KQAKYITGTVDVLQSKHSGGAEWLLSLRKLDLQEAIDALCTL 269
+A + ++ ++R +++K I G + + + A++ +R+ D L +
Sbjct: 94 AADREGIVEAIRSGGLYNQKSKLIQGVAEEVLADFGSEADFDRYVREEAPATVRDRLLEM 153
Query: 270 PGVGPKVAACIALFSLDQHHAIPVDTHVWKIATRYLL 306
GVGPK A C+ LF+ + PVDTHV +IA R L
Sbjct: 154 KGVGPKTADCVLLFAGGRGGVFPVDTHVHRIARRIGL 190
>gi|37521390|ref|NP_924767.1| endonuclease III [Gloeobacter violaceus PCC 7421]
gi|35212387|dbj|BAC89762.1| endonuclease III [Gloeobacter violaceus PCC 7421]
Length = 220
Score = 46.2 bits (108), Expect = 0.022, Method: Compositional matrix adjust.
Identities = 36/119 (30%), Positives = 52/119 (43%), Gaps = 28/119 (23%)
Query: 187 FHEFPSLERLSLVSEVELRN--AGFGYRSAPQSSLLFSVRRSFKQAKYITGTVDVLQSKH 244
F +P + L+ E EL AG YR A L+ +L K+
Sbjct: 60 FARYPDMASLAAADEKELLVLLAGSEYREAKGPRLI--------------AMATILLEKY 105
Query: 245 SGGAEWLLSLRKLDLQEAIDALCTLPGVGPKVAACIALFSLDQHHAIPVDTHVWKIATR 303
G + + IDAL LPG+G K A C+ +++ ++ AI VDTH+ KIA R
Sbjct: 106 GG-----------RVPDDIDALLALPGIGRKTANCVLIYAFNR-EAICVDTHMHKIANR 152
>gi|408403428|ref|YP_006861411.1| endonuclease III [Candidatus Nitrososphaera gargensis Ga9.2]
gi|408364024|gb|AFU57754.1| putative endonuclease III [Candidatus Nitrososphaera gargensis
Ga9.2]
Length = 223
Score = 46.2 bits (108), Expect = 0.022, Method: Compositional matrix adjust.
Identities = 21/41 (51%), Positives = 29/41 (70%), Gaps = 1/41 (2%)
Query: 263 IDALCTLPGVGPKVAACIALFSLDQHHAIPVDTHVWKIATR 303
ID+L LPGVG K A C+ +++ D+ AIPVD HV +I+ R
Sbjct: 113 IDSLLKLPGVGRKTANCVLVYAFDK-PAIPVDVHVHRISNR 152
>gi|15807425|ref|NP_296158.1| endonuclease III [Deinococcus radiodurans R1]
gi|6460251|gb|AAF11977.1|AE002073_7 endonuclease III [Deinococcus radiodurans R1]
Length = 259
Score = 46.2 bits (108), Expect = 0.023, Method: Compositional matrix adjust.
Identities = 31/91 (34%), Positives = 52/91 (57%), Gaps = 2/91 (2%)
Query: 214 APQSSLLFSVRRS-FKQAKYITGTVDVLQSKHSGGAEWLLSLRKLDLQEAIDALCTLPGV 272
AP ++ ++RRS + ++K + + K + G L LR L +++A+ L LPGV
Sbjct: 86 APTDAVAHAIRRSNYPESKAPRIQETLRRIKAAPGGYDLDFLRDLPVKDALKWLTDLPGV 145
Query: 273 GPKVAACIALFSLDQHHAIPVDTHVWKIATR 303
G K A+ + LF+ + PVDTHV +++TR
Sbjct: 146 GVKTASLVLLFNYAR-PVFPVDTHVHRVSTR 175
>gi|340345847|ref|ZP_08668979.1| DNA-(Apurinic or apyrimidinic site) lyase [Candidatus
Nitrosoarchaeum koreensis MY1]
gi|339520988|gb|EGP94711.1| DNA-(Apurinic or apyrimidinic site) lyase [Candidatus
Nitrosoarchaeum koreensis MY1]
Length = 217
Score = 45.8 bits (107), Expect = 0.025, Method: Compositional matrix adjust.
Identities = 25/75 (33%), Positives = 41/75 (54%), Gaps = 12/75 (16%)
Query: 229 QAKYITGTVDVLQSKHSGGAEWLLSLRKLDLQEAIDALCTLPGVGPKVAACIALFSLDQH 288
+++ I ++ SK+SG + + +D L LPGVG K A C+ +++ D+
Sbjct: 91 KSRRIIEVSKIIDSKYSGI-----------VPDTLDELIQLPGVGRKTANCVLVYAFDK- 138
Query: 289 HAIPVDTHVWKIATR 303
AIPVD HV +I+ R
Sbjct: 139 PAIPVDIHVHRISNR 153
>gi|347754956|ref|YP_004862520.1| 3-methyladenine DNA glycosylase [Candidatus Chloracidobacterium
thermophilum B]
gi|347587474|gb|AEP12004.1| 3-methyladenine DNA glycosylase/8-oxoguanine DNA glycosylase
[Candidatus Chloracidobacterium thermophilum B]
Length = 352
Score = 45.8 bits (107), Expect = 0.025, Method: Compositional matrix adjust.
Identities = 44/163 (26%), Positives = 66/163 (40%), Gaps = 25/163 (15%)
Query: 136 YLAGARVLRQDPVECLLQFLCSSNNNIARITKMVDFLASLGSHLGNVEGFEFHEFPSLER 195
+L G R+LRQ E ++ F+ SSNN++ RI +++ ++ H + F P+ E
Sbjct: 143 HLTGLRLLRQPWFEVMVSFVISSNNHLPRIRQIISIISRTFGHPITPTDYTF---PTPEA 199
Query: 196 LSLVSEVELRNA-GFGYRSAPQSSLLFSVRRSFKQAKYITGTVDVLQSKHSGGAEWLLSL 254
L+ LR+ GYR L + + + +H
Sbjct: 200 LAAACPGTLRHTCRVGYRDHALHQLATQIAHNLTFWETAAACPTPELRRH---------- 249
Query: 255 RKLDLQEAIDALCTLPGVGPKVAACIALFSLDQHHAIPVDTHV 297
L LPGVGPKVA CI LF + A P+D V
Sbjct: 250 -----------LLALPGVGPKVAECILLFGFHRWEAFPIDVWV 281
>gi|378733828|gb|EHY60287.1| endonuclease III [Exophiala dermatitidis NIH/UT8656]
Length = 442
Score = 45.8 bits (107), Expect = 0.025, Method: Compositional matrix adjust.
Identities = 25/72 (34%), Positives = 40/72 (55%), Gaps = 3/72 (4%)
Query: 229 QAKYITGTVDVLQSKHSGGAEWLLSLRKLDLQEAIDALCTLPGVGPKVAACIALFSLDQH 288
++K I ++ + K+ G L L ++A+ + + GVGPK A+C+ LF L Q
Sbjct: 282 KSKVIISILNQVYEKY--GVYSLDHLHSASSEDAMREMLSFQGVGPKTASCVLLFCL-QR 338
Query: 289 HAIPVDTHVWKI 300
+ VDTHVW+I
Sbjct: 339 ESFAVDTHVWRI 350
>gi|448572049|ref|ZP_21640138.1| endonuclease III [Haloferax lucentense DSM 14919]
gi|448596871|ref|ZP_21654009.1| endonuclease III [Haloferax alexandrinus JCM 10717]
gi|445721082|gb|ELZ72751.1| endonuclease III [Haloferax lucentense DSM 14919]
gi|445740752|gb|ELZ92257.1| endonuclease III [Haloferax alexandrinus JCM 10717]
Length = 268
Score = 45.8 bits (107), Expect = 0.025, Method: Compositional matrix adjust.
Identities = 28/97 (28%), Positives = 49/97 (50%), Gaps = 3/97 (3%)
Query: 213 SAPQSSLLFSVRRSF---KQAKYITGTVDVLQSKHSGGAEWLLSLRKLDLQEAIDALCTL 269
+A + ++ ++R +++K I G + + + A++ +R+ D L +
Sbjct: 94 AADREGIVEAIRSGGLYNQKSKLIRGVAEEVLADFGSEADFDRYVREEAPATVRDRLLEM 153
Query: 270 PGVGPKVAACIALFSLDQHHAIPVDTHVWKIATRYLL 306
GVGPK A C+ LF+ + PVDTHV +IA R L
Sbjct: 154 KGVGPKTADCVLLFAGGRGGVFPVDTHVHRIARRIGL 190
>gi|238588806|ref|XP_002391837.1| hypothetical protein MPER_08677 [Moniliophthora perniciosa FA553]
gi|215457043|gb|EEB92767.1| hypothetical protein MPER_08677 [Moniliophthora perniciosa FA553]
Length = 305
Score = 45.8 bits (107), Expect = 0.025, Method: Compositional matrix adjust.
Identities = 20/46 (43%), Positives = 29/46 (63%)
Query: 258 DLQEAIDALCTLPGVGPKVAACIALFSLDQHHAIPVDTHVWKIATR 303
D+ + +D LC+LPGVGPK+A + + +H I VD HV +I R
Sbjct: 229 DVPKTVDELCSLPGVGPKMAFLALQVAWNLNHGIGVDVHVHRITNR 274
>gi|307196709|gb|EFN78168.1| Endonuclease III-like protein 1 [Harpegnathos saltator]
Length = 368
Score = 45.8 bits (107), Expect = 0.026, Method: Compositional matrix adjust.
Identities = 28/85 (32%), Positives = 43/85 (50%), Gaps = 11/85 (12%)
Query: 219 LLFSVRRSFKQAKYITGTVDVLQSKHSGGAEWLLSLRKLDLQEAIDALCTLPGVGPKVAA 278
L++ V ++ +YI T +L +K++G D+ I LC LPGVGPK+A
Sbjct: 225 LIYPVGFWKRKVEYIKKTSVILLNKYNG-----------DIPRTIKELCDLPGVGPKMAH 273
Query: 279 CIALFSLDQHHAIPVDTHVWKIATR 303
+ + I VDTHV +I+ R
Sbjct: 274 ICMQIAWGEVSGIGVDTHVHRISNR 298
>gi|226357357|ref|YP_002787097.1| DNA-(apurinic or apyrimidinic site) lyase [Deinococcus deserti
VCD115]
gi|226319347|gb|ACO47343.1| putative DNA-(apurinic or apyrimidinic site) lyase (endonuclease
III) [Deinococcus deserti VCD115]
Length = 247
Score = 45.8 bits (107), Expect = 0.026, Method: Compositional matrix adjust.
Identities = 32/94 (34%), Positives = 53/94 (56%), Gaps = 6/94 (6%)
Query: 213 SAPQSSLLFSVRRSF---KQAKYITGTVDVLQSKHSGGAEWLLSLRKLDLQEAIDALCTL 269
+AP ++ ++RRS +A I T+ + + + G L LR+L +++A+ L L
Sbjct: 78 AAPTEAVAHAIRRSNYPESKAPRIQATLRAI--RDAPGGYNLDFLRELPVKDALKWLTDL 135
Query: 270 PGVGPKVAACIALFSLDQHHAIPVDTHVWKIATR 303
PGVG K A+ + LF+ + PVDTHV ++ TR
Sbjct: 136 PGVGIKTASLVLLFNYAR-PVFPVDTHVHRVNTR 168
>gi|448733380|ref|ZP_21715625.1| HhH-GPD family protein [Halococcus salifodinae DSM 8989]
gi|445803114|gb|EMA53414.1| HhH-GPD family protein [Halococcus salifodinae DSM 8989]
Length = 279
Score = 45.8 bits (107), Expect = 0.026, Method: Compositional matrix adjust.
Identities = 24/64 (37%), Positives = 34/64 (53%)
Query: 240 LQSKHSGGAEWLLSLRKLDLQEAIDALCTLPGVGPKVAACIALFSLDQHHAIPVDTHVWK 299
+ ++ G E+ + + D + DAL GVGPK A C+ LFS + PVDTHV +
Sbjct: 134 IVDEYGGAEEFNGFVAEGDPETVRDALLDFSGVGPKTADCVLLFSGGRAGVFPVDTHVHR 193
Query: 300 IATR 303
I R
Sbjct: 194 IYRR 197
>gi|433424488|ref|ZP_20406484.1| endonuclease III [Haloferax sp. BAB2207]
gi|432198075|gb|ELK54399.1| endonuclease III [Haloferax sp. BAB2207]
Length = 268
Score = 45.8 bits (107), Expect = 0.026, Method: Compositional matrix adjust.
Identities = 28/97 (28%), Positives = 49/97 (50%), Gaps = 3/97 (3%)
Query: 213 SAPQSSLLFSVRRSF---KQAKYITGTVDVLQSKHSGGAEWLLSLRKLDLQEAIDALCTL 269
+A + ++ ++R +++K I G + + + A++ +R+ D L +
Sbjct: 94 AADREGIVEAIRSGGLYNQKSKLIRGVAEEVLADFGSEADFDRYVREEAPATVRDRLLEM 153
Query: 270 PGVGPKVAACIALFSLDQHHAIPVDTHVWKIATRYLL 306
GVGPK A C+ LF+ + PVDTHV +IA R L
Sbjct: 154 KGVGPKTADCVLLFAGGRGGVFPVDTHVHRIARRIGL 190
>gi|398411994|ref|XP_003857329.1| hypothetical protein MYCGRDRAFT_34626 [Zymoseptoria tritici IPO323]
gi|339477214|gb|EGP92305.1| hypothetical protein MYCGRDRAFT_34626 [Zymoseptoria tritici IPO323]
Length = 407
Score = 45.8 bits (107), Expect = 0.028, Method: Compositional matrix adjust.
Identities = 27/86 (31%), Positives = 43/86 (50%), Gaps = 11/86 (12%)
Query: 217 SSLLFSVRRSFKQAKYITGTVDVLQSKHSGGAEWLLSLRKLDLQEAIDALCTLPGVGPKV 276
++L+ V + KYI T ++L+ K +G D+ E I+ L +LPGVGPK+
Sbjct: 205 NNLIAKVGFHNNKTKYIKATAELLRDKWNG-----------DIPETIEGLVSLPGVGPKM 253
Query: 277 AACIALFSLDQHHAIPVDTHVWKIAT 302
A + + I VD HV +I+
Sbjct: 254 AYLTMSSAWGRDEGIGVDVHVHRISN 279
>gi|329766249|ref|ZP_08257807.1| DNA-(apurinic or apyrimidinic site) lyase [Candidatus
Nitrosoarchaeum limnia SFB1]
gi|329137308|gb|EGG41586.1| DNA-(apurinic or apyrimidinic site) lyase [Candidatus
Nitrosoarchaeum limnia SFB1]
Length = 170
Score = 45.8 bits (107), Expect = 0.029, Method: Compositional matrix adjust.
Identities = 34/99 (34%), Positives = 51/99 (51%), Gaps = 18/99 (18%)
Query: 219 LLFSVRRSFKQAKYITG--TVDVLQSKHSGGAEWLLSLRKLDLQEAIDA----------- 265
+LFS +K AK + T DV + S G + S R +++ + ID+
Sbjct: 50 VLFS---KYKNAKELANAKTKDVEKIIKSIGFYHVKSKRIIEVAKIIDSKYKGKVPDNLE 106
Query: 266 -LCTLPGVGPKVAACIALFSLDQHHAIPVDTHVWKIATR 303
L LPGVG K A C+ +++ D+ AIPVD HV +I+ R
Sbjct: 107 KLVELPGVGRKTANCVLVYAFDK-PAIPVDIHVHRISNR 144
>gi|269792590|ref|YP_003317494.1| DNA-(apurinic or apyrimidinic site) lyase [Thermanaerovibrio
acidaminovorans DSM 6589]
gi|269100225|gb|ACZ19212.1| DNA-(apurinic or apyrimidinic site) lyase [Thermanaerovibrio
acidaminovorans DSM 6589]
Length = 232
Score = 45.8 bits (107), Expect = 0.029, Method: Compositional matrix adjust.
Identities = 40/118 (33%), Positives = 55/118 (46%), Gaps = 23/118 (19%)
Query: 190 FPSLERL--SLVSEVE--LRNAGFGYRSAPQSSLLFSVRRSFKQAKYITGTVDVLQSKHS 245
FPS E + + V ++E +R AG G A R K+ Y + K +
Sbjct: 72 FPSWESVMEAPVEDLEGAIRVAGLGASKA----------RRIKEVLY--------KVKET 113
Query: 246 GGAEWLLSLRKLDLQEAIDALCTLPGVGPKVAACIALFSLDQHHAIPVDTHVWKIATR 303
G L ++R E L TLPGVGPK AC+ +F L A PVDTHV +++ R
Sbjct: 114 LGTLSLGAMRSWRRDEVEAFLSTLPGVGPKTVACVLVFDLG-IPAFPVDTHVGRLSVR 170
>gi|121703826|ref|XP_001270177.1| helix-hairpin-helix motif protein [Aspergillus clavatus NRRL 1]
gi|119398321|gb|EAW08751.1| helix-hairpin-helix motif protein [Aspergillus clavatus NRRL 1]
Length = 465
Score = 45.8 bits (107), Expect = 0.029, Method: Compositional matrix adjust.
Identities = 23/51 (45%), Positives = 31/51 (60%), Gaps = 1/51 (1%)
Query: 251 LLSLRKLDLQEAIDALCTLPGVGPKVAACIALFSLDQHHAIPVDTHVWKIA 301
L L L +EA+ L PG+GPK AAC+ LF L Q VDTH+++I+
Sbjct: 326 LNHLHHLSTEEAMTELVKYPGIGPKTAACVLLFCL-QRPCFAVDTHIFRIS 375
>gi|357454433|ref|XP_003597497.1| Endonuclease III-like protein [Medicago truncatula]
gi|355486545|gb|AES67748.1| Endonuclease III-like protein [Medicago truncatula]
Length = 331
Score = 45.8 bits (107), Expect = 0.030, Method: Compositional matrix adjust.
Identities = 24/84 (28%), Positives = 43/84 (51%), Gaps = 11/84 (13%)
Query: 220 LFSVRRSFKQAKYITGTVDVLQSKHSGGAEWLLSLRKLDLQEAIDALCTLPGVGPKVAAC 279
++ V ++A + D+ +K+ G D+ +I+ L +LPGVGPK+A
Sbjct: 170 IYPVGFYIRKAGNLKKIADICLTKYDG-----------DIPNSIEELLSLPGVGPKIAHL 218
Query: 280 IALFSLDQHHAIPVDTHVWKIATR 303
+ + + + I VDTHV +I+ R
Sbjct: 219 VMIIAWNNVQGICVDTHVHRISNR 242
>gi|269986126|gb|EEZ92440.1| 8-oxoguanine DNA glycosylase domain protein [Candidatus
Parvarchaeum acidiphilum ARMAN-4]
Length = 289
Score = 45.8 bits (107), Expect = 0.030, Method: Compositional matrix adjust.
Identities = 60/256 (23%), Positives = 103/256 (40%), Gaps = 40/256 (15%)
Query: 48 LSLPLTFPTGQTFRWKKTGPLQYTGPIGPHLISLKHLQNGDVCYHIHTSPSEPAAKSALL 107
+SL T +GQ F + K Y+ P L+ Q G+ ++T E K
Sbjct: 23 VSLDRTLNSGQIFSFFKEDNKWYSFVDRP----LEIRQEGNNL--VYTGLDEMGVK---- 72
Query: 108 DFLNMGISLGELWEGFSASDCRFAELA-KYLAGARVLRQDPVECLLQFLCSSNNNIARIT 166
+ L + + ++ S D F E A Y RV++ L F+ S +N+ I
Sbjct: 73 ELLGLNDPINDIKAEISKDD--FIEKAINYSGSVRVVKSGIWPATLGFVLSIQSNVNLIL 130
Query: 167 KMVDFLASLGSHLGNVEGFEFHEFPSLERLSLVSEVELRNAGFGYRSAPQSSLLFSVRRS 226
+ ++ +++ G++ G FPS + + N GY Q +L F +
Sbjct: 131 RRINAMSNYYGKEGDINGKLLKSFPSFQEI-------YEN---GYNKLKQFNLGFRTKFV 180
Query: 227 FKQAKYITGTVDVLQSKHSGGAEWLLSLRKLDLQEAIDALCTLPGVGPKVAACIALFSLD 286
F A+Y K+ +++ + ++ + L + GVG KV CI L+ L
Sbjct: 181 FSAAEYFY--------KNEINSDFTI-------EQIKENLTNIYGVGEKVLDCILLYGLH 225
Query: 287 QHHAIPVDTHVWKIAT 302
A P+D VW + T
Sbjct: 226 DLSAFPMD--VWILRT 239
>gi|195030160|ref|XP_001987936.1| GH10834 [Drosophila grimshawi]
gi|193903936|gb|EDW02803.1| GH10834 [Drosophila grimshawi]
Length = 373
Score = 45.8 bits (107), Expect = 0.031, Method: Compositional matrix adjust.
Identities = 47/175 (26%), Positives = 77/175 (44%), Gaps = 26/175 (14%)
Query: 135 KYLAGARVLRQD---PVECLLQFLCSSNNNIARITKMVDFLA-SLGSHLGNVEGFEFHEF 190
++L R++R PV+ L C+ N A+ + +A L S + FE
Sbjct: 142 QHLENIRLMRSQKPAPVDTLGCHQCADINADAKTQRFHKLVALMLSSQTKDETTFE---- 197
Query: 191 PSLERLS--LVSEVELRNAGFGYRSAPQSSLLFSVRRSFKQAKYITGTVDVLQSKHSGGA 248
+++RL ++ +++ G +LL V +AKY+ T +L K++
Sbjct: 198 -AMKRLKAQTLTPASMQSMPVGVLE----NLLHPVSFYKNKAKYLKKTSQILVDKYNE-- 250
Query: 249 EWLLSLRKLDLQEAIDALCTLPGVGPKVAACIALFSLDQHHAIPVDTHVWKIATR 303
D+ + I L LPGVGPK+A + +Q I VDTHV +IA R
Sbjct: 251 ---------DIPDNIPELLKLPGVGPKMAHICMATAWNQITGIGVDTHVHRIANR 296
>gi|344212676|ref|YP_004796996.1| endonuclease III [Haloarcula hispanica ATCC 33960]
gi|343784031|gb|AEM58008.1| endonuclease III [Haloarcula hispanica ATCC 33960]
Length = 277
Score = 45.4 bits (106), Expect = 0.031, Method: Compositional matrix adjust.
Identities = 25/79 (31%), Positives = 41/79 (51%)
Query: 228 KQAKYITGTVDVLQSKHSGGAEWLLSLRKLDLQEAIDALCTLPGVGPKVAACIALFSLDQ 287
++++ I + ++ G A + +R D ++ L + GVGPK A C+ LF+ +
Sbjct: 122 QKSERIIALAQRICEEYGGEAGFDEFVRDSDPEKVRSTLLDMNGVGPKTADCVLLFAGGR 181
Query: 288 HHAIPVDTHVWKIATRYLL 306
PVDTHV +IA R L
Sbjct: 182 GGVFPVDTHVHRIARRMGL 200
>gi|335437951|ref|ZP_08560708.1| DNA-(apurinic or apyrimidinic site) lyase [Halorhabdus tiamatea
SARL4B]
gi|334893555|gb|EGM31766.1| DNA-(apurinic or apyrimidinic site) lyase [Halorhabdus tiamatea
SARL4B]
Length = 281
Score = 45.4 bits (106), Expect = 0.032, Method: Compositional matrix adjust.
Identities = 23/76 (30%), Positives = 41/76 (53%)
Query: 228 KQAKYITGTVDVLQSKHSGGAEWLLSLRKLDLQEAIDALCTLPGVGPKVAACIALFSLDQ 287
+++ I+ + ++ K+ G E+ +R+ + + L + GVGPK A C+ LF+ +
Sbjct: 124 QKSARISQIAERVREKYGGEDEFDAFVREEPAEAVRETLLAMTGVGPKTADCVLLFAGGR 183
Query: 288 HHAIPVDTHVWKIATR 303
PVDTHV +I R
Sbjct: 184 DGVFPVDTHVHRIYRR 199
>gi|170101314|ref|XP_001881874.1| predicted protein [Laccaria bicolor S238N-H82]
gi|164643229|gb|EDR07482.1| predicted protein [Laccaria bicolor S238N-H82]
Length = 308
Score = 45.4 bits (106), Expect = 0.034, Method: Compositional matrix adjust.
Identities = 32/108 (29%), Positives = 55/108 (50%), Gaps = 15/108 (13%)
Query: 205 RNAGFGYRSAPQSSLLFSVRR---SFKQAKYITGTVDVLQSKHSGGAEWLL--------S 253
RN AP+ L+ ++R + K+A I + ++ KH E+ L S
Sbjct: 132 RNNFVAIAQAPRERLVDAIRSGGLANKKAATIQNLLHSIRGKH---GEYSLQHLAAAESS 188
Query: 254 LRKLDLQEAIDALCTLPGVGPKVAACIALFSLDQHHAIPVDTHVWKIA 301
R++ E + L + GVGPK A+C+ LF L + ++ VDTHV++++
Sbjct: 189 GRRMSDDEIMKELISYDGVGPKTASCVLLFCLGR-NSFAVDTHVFRLS 235
>gi|348536721|ref|XP_003455844.1| PREDICTED: endonuclease III-like protein 1-like [Oreochromis
niloticus]
Length = 419
Score = 45.4 bits (106), Expect = 0.035, Method: Compositional matrix adjust.
Identities = 28/85 (32%), Positives = 42/85 (49%), Gaps = 11/85 (12%)
Query: 219 LLFSVRRSFKQAKYITGTVDVLQSKHSGGAEWLLSLRKLDLQEAIDALCTLPGVGPKVAA 278
L++ V + KY+ T +LQ + G D+ +++ L LPGVGPK+A
Sbjct: 222 LIYPVGFWRNKVKYLKLTSAMLQKEFGG-----------DIPNSVEGLVRLPGVGPKMAH 270
Query: 279 CIALFSLDQHHAIPVDTHVWKIATR 303
+ DQ I VDTHV +I+ R
Sbjct: 271 LAMDIAWDQVSGIGVDTHVHRISNR 295
>gi|448561313|ref|ZP_21634665.1| endonuclease III [Haloferax prahovense DSM 18310]
gi|445721545|gb|ELZ73213.1| endonuclease III [Haloferax prahovense DSM 18310]
Length = 268
Score = 45.4 bits (106), Expect = 0.035, Method: Compositional matrix adjust.
Identities = 28/97 (28%), Positives = 49/97 (50%), Gaps = 3/97 (3%)
Query: 213 SAPQSSLLFSVRRSF---KQAKYITGTVDVLQSKHSGGAEWLLSLRKLDLQEAIDALCTL 269
+A + ++ ++R +++K I G + + + A++ +R+ D L +
Sbjct: 94 AADREGIVEAIRSGGLYNQKSKLIQGVAEDVLADFGSEADFDRYVREEVPATVRDRLLEM 153
Query: 270 PGVGPKVAACIALFSLDQHHAIPVDTHVWKIATRYLL 306
GVGPK A C+ LF+ + PVDTHV +IA R L
Sbjct: 154 KGVGPKTADCVLLFAGGRGGVFPVDTHVHRIARRIGL 190
>gi|328720736|ref|XP_001949525.2| PREDICTED: endonuclease III-like protein 1-like [Acyrthosiphon
pisum]
Length = 280
Score = 45.4 bits (106), Expect = 0.037, Method: Compositional matrix adjust.
Identities = 44/188 (23%), Positives = 81/188 (43%), Gaps = 24/188 (12%)
Query: 121 EGFSASDCRFAELAKYLAGARVLRQDPVECLLQFLCSSNNNIARITKMVDFLASLGSHLG 180
E S S+ + K L R +R+D V + C ++ ++V + + L
Sbjct: 41 EITSDSNWKPPNWEKTLDNIRKMRKDIVAPVDNMGCDQAADLNEPPEVVRYHVLISLMLS 100
Query: 181 NVEGFEFHEFPSLERL-----SLVSEVELRNAGFGYRSAPQSSLLFSVRRSFKQAKYITG 235
+ E + F +++RL ++ + +E + G L++ V + +YI
Sbjct: 101 SQTKDEVN-FAAMQRLKQHGLTVDNILETSDDHLG-------KLIYPVGFWKTKVQYIKR 152
Query: 236 TVDVLQSKHSGGAEWLLSLRKLDLQEAIDALCTLPGVGPKVAACIALFSLDQHHAIPVDT 295
T +L+ ++G D+ I LC LPG+GPK+A + ++ I VDT
Sbjct: 153 TTRILKDTYNG-----------DIPNTIKDLCQLPGIGPKMAHLCMSCAWNEVTGIGVDT 201
Query: 296 HVWKIATR 303
HV +I+ R
Sbjct: 202 HVHRISNR 209
>gi|374850043|dbj|BAL53042.1| HhH-GPD family protein [uncultured Chloroflexi bacterium]
Length = 235
Score = 45.4 bits (106), Expect = 0.037, Method: Compositional matrix adjust.
Identities = 33/95 (34%), Positives = 48/95 (50%), Gaps = 2/95 (2%)
Query: 210 GYRSAPQSSLLFSVRRSFKQAKYITGTVDVLQS-KHSGGAEWLLSLRKLDLQEAIDALCT 268
R AP ++ ++R + + VL+ GA L L L L+EA L
Sbjct: 72 AVRDAPLDEVIEAIRLAGLAQQKAPRLQQVLREITQQRGALELDFLADLPLEEARRWLMQ 131
Query: 269 LPGVGPKVAACIALFSLDQHHAIPVDTHVWKIATR 303
GVGPK AA + LFSL + A PVDTH+++++ R
Sbjct: 132 FKGVGPKTAAIVLLFSLGR-PAFPVDTHIYRVSGR 165
>gi|281342124|gb|EFB17708.1| hypothetical protein PANDA_016228 [Ailuropoda melanoleuca]
Length = 266
Score = 45.4 bits (106), Expect = 0.037, Method: Compositional matrix adjust.
Identities = 30/93 (32%), Positives = 42/93 (45%), Gaps = 11/93 (11%)
Query: 218 SLLFSVRRSFKQAKYITGTVDVLQSKHSGGAEWLLSLRKLDLQEAIDALCTLPGVGPKVA 277
+L++ V + KYI T +LQ ++ G D+ + L LPGVGPK+A
Sbjct: 128 TLIYPVGFWRSKVKYIKQTSAILQQRYGG-----------DIPATVAELVALPGVGPKMA 176
Query: 278 ACIALFSLDQHHAIPVDTHVWKIATRYLLPELA 310
+ I VDTHV +IA R E A
Sbjct: 177 HLAMAVAWGTVSGIAVDTHVHRIANRLGWTETA 209
>gi|219519404|gb|AAI45444.1| Nthl1 protein [Mus musculus]
Length = 280
Score = 45.4 bits (106), Expect = 0.038, Method: Compositional matrix adjust.
Identities = 28/85 (32%), Positives = 40/85 (47%), Gaps = 11/85 (12%)
Query: 219 LLFSVRRSFKQAKYITGTVDVLQSKHSGGAEWLLSLRKLDLQEAIDALCTLPGVGPKVAA 278
L++ V + KYI T +LQ ++ G D+ ++ L LPGVGPK+A
Sbjct: 143 LIYPVGFWRNKVKYIKQTTAILQQRYEG-----------DIPASVAELVALPGVGPKMAH 191
Query: 279 CIALFSLDQHHAIPVDTHVWKIATR 303
+ I VDTHV +IA R
Sbjct: 192 LAMAVAWGTISGIAVDTHVHRIANR 216
>gi|388855561|emb|CCF50784.1| related to NTG1-DNA repair protein [Ustilago hordei]
Length = 509
Score = 45.4 bits (106), Expect = 0.038, Method: Compositional matrix adjust.
Identities = 23/46 (50%), Positives = 30/46 (65%), Gaps = 1/46 (2%)
Query: 258 DLQEAIDALCTLPGVGPKVAACIALFSLDQHHAIPVDTHVWKIATR 303
DL +D LC+LPGVGPK+ A +AL S+ I VDTHV ++ R
Sbjct: 322 DLPRTVDELCSLPGVGPKM-AFLALSSMGIQIGIGVDTHVHRLTNR 366
>gi|45593498|sp|O35980.1|NTHL1_MOUSE RecName: Full=Endonuclease III-like protein 1
gi|2351099|dbj|BAA22080.1| endonuclease III homologue [Mus musculus]
gi|2407946|emb|CAA70866.1| endonuclease III homologue 1 [Mus musculus]
gi|3219302|dbj|BAA28846.1| homologue of endonuclease III [Mus musculus]
gi|6688669|emb|CAB65239.1| Endonuclease III homologue 1 [Mus musculus]
gi|148690400|gb|EDL22347.1| nth (endonuclease III)-like 1 (E.coli), isoform CRA_b [Mus
musculus]
gi|187952063|gb|AAI38853.1| Nth (endonuclease III)-like 1 (E.coli) [Mus musculus]
gi|187954093|gb|AAI38856.1| Nth (endonuclease III)-like 1 (E.coli) [Mus musculus]
Length = 300
Score = 45.4 bits (106), Expect = 0.038, Method: Compositional matrix adjust.
Identities = 28/85 (32%), Positives = 40/85 (47%), Gaps = 11/85 (12%)
Query: 219 LLFSVRRSFKQAKYITGTVDVLQSKHSGGAEWLLSLRKLDLQEAIDALCTLPGVGPKVAA 278
L++ V + KYI T +LQ ++ G D+ ++ L LPGVGPK+A
Sbjct: 163 LIYPVGFWRNKVKYIKQTTAILQQRYEG-----------DIPASVAELVALPGVGPKMAH 211
Query: 279 CIALFSLDQHHAIPVDTHVWKIATR 303
+ I VDTHV +IA R
Sbjct: 212 LAMAVAWGTISGIAVDTHVHRIANR 236
>gi|170289841|ref|YP_001736657.1| EndoIII-related endonuclease [Candidatus Korarchaeum cryptofilum
OPF8]
gi|170173921|gb|ACB06974.1| Predicted EndoIII-related endonuclease [Candidatus Korarchaeum
cryptofilum OPF8]
Length = 223
Score = 45.4 bits (106), Expect = 0.039, Method: Compositional matrix adjust.
Identities = 29/77 (37%), Positives = 45/77 (58%), Gaps = 10/77 (12%)
Query: 228 KQAKYITGTVDVLQSKHSGGA-EWLLSLRKLDLQEAIDALCTLPGVGPKVAACIALFSLD 286
++AKY+ ++ + SGGA E +LSL + +EA D L +PG+GPK A L SL
Sbjct: 89 QKAKYLK----LIAERLSGGALEEILSL---ETEEARDRLLEIPGIGPKTAD--VLLSLM 139
Query: 287 QHHAIPVDTHVWKIATR 303
I VD H+ ++++R
Sbjct: 140 GRETIGVDRHIARVSSR 156
>gi|297697807|ref|XP_002826033.1| PREDICTED: endonuclease III-like protein 1 [Pongo abelii]
Length = 312
Score = 45.4 bits (106), Expect = 0.040, Method: Compositional matrix adjust.
Identities = 28/85 (32%), Positives = 40/85 (47%), Gaps = 11/85 (12%)
Query: 219 LLFSVRRSFKQAKYITGTVDVLQSKHSGGAEWLLSLRKLDLQEAIDALCTLPGVGPKVAA 278
L++ V + KYI T +LQ ++ G D+ ++ L LPGVGPK+A
Sbjct: 175 LIYPVGFWRSKVKYIKQTSAILQQRYGG-----------DIPASVAELVALPGVGPKMAH 223
Query: 279 CIALFSLDQHHAIPVDTHVWKIATR 303
+ I VDTHV +IA R
Sbjct: 224 LAMAVAWGTVSGIAVDTHVHRIANR 248
>gi|357477261|ref|XP_003608916.1| Ultraviolet N-glycosylase/AP lyase [Medicago truncatula]
gi|355509971|gb|AES91113.1| Ultraviolet N-glycosylase/AP lyase [Medicago truncatula]
Length = 280
Score = 45.4 bits (106), Expect = 0.040, Method: Compositional matrix adjust.
Identities = 51/177 (28%), Positives = 69/177 (38%), Gaps = 38/177 (21%)
Query: 132 ELAKYLAGARVLRQDPVE-------CLLQFLCSSNNNIARITKMVDFLASLGSHLGNVEG 184
ELAKY + D VE L++ + S N A K L SL
Sbjct: 63 ELAKYRKSQQT--NDTVEPPETVLDGLVRTILSQNTTEANSNKAFASLKSL--------- 111
Query: 185 FEFHEFPSLERLSLVSEVELRNAGFGYRSAPQSSLLFSVRRSFKQAKYITGTVDVLQSKH 244
FP+ E + EL NA AP +AK I + L +
Sbjct: 112 -----FPTWEHVHGAESKELENAIRCGGLAPT------------KAKCIKNLLSCLLERK 154
Query: 245 SGGAEWLLSLRKLDLQEAIDALCTLPGVGPKVAACIALFSLDQHHAIPVDTHVWKIA 301
G L LR L + E L G+GPK +C+ +F+L Q PVDTH+++IA
Sbjct: 155 --GKMCLEYLRDLSVDEVKAELSLFKGIGPKTVSCVLMFNL-QLDDFPVDTHIFEIA 208
>gi|227908769|ref|NP_032769.2| endonuclease III-like protein 1 [Mus musculus]
Length = 300
Score = 45.1 bits (105), Expect = 0.041, Method: Compositional matrix adjust.
Identities = 28/85 (32%), Positives = 40/85 (47%), Gaps = 11/85 (12%)
Query: 219 LLFSVRRSFKQAKYITGTVDVLQSKHSGGAEWLLSLRKLDLQEAIDALCTLPGVGPKVAA 278
L++ V + KYI T +LQ ++ G D+ ++ L LPGVGPK+A
Sbjct: 163 LIYPVGFWRNKVKYIKQTTAILQQRYEG-----------DIPASVAELVALPGVGPKMAH 211
Query: 279 CIALFSLDQHHAIPVDTHVWKIATR 303
+ I VDTHV +IA R
Sbjct: 212 LAMAVAWGTISGIAVDTHVHRIANR 236
>gi|148690399|gb|EDL22346.1| nth (endonuclease III)-like 1 (E.coli), isoform CRA_a [Mus
musculus]
Length = 277
Score = 45.1 bits (105), Expect = 0.041, Method: Compositional matrix adjust.
Identities = 33/106 (31%), Positives = 46/106 (43%), Gaps = 17/106 (16%)
Query: 204 LRNAGFGYRSAPQSS------LLFSVRRSFKQAKYITGTVDVLQSKHSGGAEWLLSLRKL 257
LR G S Q+ L++ V + KYI T +LQ ++ G
Sbjct: 119 LRARGLTVESILQTDDDTLGRLIYPVGFWRNKVKYIKQTTAILQQRYEG----------- 167
Query: 258 DLQEAIDALCTLPGVGPKVAACIALFSLDQHHAIPVDTHVWKIATR 303
D+ ++ L LPGVGPK+A + I VDTHV +IA R
Sbjct: 168 DIPASVAELVALPGVGPKMAHLAMAVAWGTISGIAVDTHVHRIANR 213
>gi|301782301|ref|XP_002926577.1| PREDICTED: LOW QUALITY PROTEIN: endonuclease III-like protein
1-like [Ailuropoda melanoleuca]
Length = 316
Score = 45.1 bits (105), Expect = 0.041, Method: Compositional matrix adjust.
Identities = 30/93 (32%), Positives = 42/93 (45%), Gaps = 11/93 (11%)
Query: 218 SLLFSVRRSFKQAKYITGTVDVLQSKHSGGAEWLLSLRKLDLQEAIDALCTLPGVGPKVA 277
+L++ V + KYI T +LQ ++ G D+ + L LPGVGPK+A
Sbjct: 175 TLIYPVGFWRSKVKYIKQTSAILQQRYGG-----------DIPATVAELVALPGVGPKMA 223
Query: 278 ACIALFSLDQHHAIPVDTHVWKIATRYLLPELA 310
+ I VDTHV +IA R E A
Sbjct: 224 HLAMAVAWGTVSGIAVDTHVHRIANRLGWTETA 256
>gi|390333159|ref|XP_001200032.2| PREDICTED: endonuclease III-like protein 1-like [Strongylocentrotus
purpuratus]
Length = 409
Score = 45.1 bits (105), Expect = 0.042, Method: Compositional matrix adjust.
Identities = 35/134 (26%), Positives = 62/134 (46%), Gaps = 24/134 (17%)
Query: 183 EGFEFHEFPSL-------ERLSLVSEVELRNAGFGYRS------APQSSLLFSVRRSFKQ 229
E + +H SL ++++ + V+LR+ G + A L++ V ++
Sbjct: 215 EVYRYHVLLSLMLSSQTKDQVTSAAMVKLRSHGLTVDNILKTPEAKIGELIYPVGFWKRK 274
Query: 230 AKYITGTVDVLQSKHSGGAEWLLSLRKLDLQEAIDALCTLPGVGPKVAACIALFSLDQHH 289
A +I T +L+ ++ G D+ ++ L LPGVGPK+A + +Q
Sbjct: 275 ADFIKRTTQILKDQYQG-----------DIPPSLKELIQLPGVGPKMAHIVMDVGWNQIT 323
Query: 290 AIPVDTHVWKIATR 303
I VDTHV +I+ R
Sbjct: 324 GIGVDTHVHRISNR 337
>gi|390369569|ref|XP_793669.3| PREDICTED: endonuclease III-like protein 1-like [Strongylocentrotus
purpuratus]
Length = 445
Score = 45.1 bits (105), Expect = 0.043, Method: Compositional matrix adjust.
Identities = 35/134 (26%), Positives = 62/134 (46%), Gaps = 24/134 (17%)
Query: 183 EGFEFHEFPSL-------ERLSLVSEVELRNAGFGYRS------APQSSLLFSVRRSFKQ 229
E + +H SL ++++ + V+LR+ G + A L++ V ++
Sbjct: 239 EVYRYHVLLSLMLSSQTKDQVTSAAMVKLRSHGLTVDNILKTPEAKIGELIYPVGFWKRK 298
Query: 230 AKYITGTVDVLQSKHSGGAEWLLSLRKLDLQEAIDALCTLPGVGPKVAACIALFSLDQHH 289
A +I T +L+ ++ G D+ ++ L LPGVGPK+A + +Q
Sbjct: 299 ADFIKRTTQILKDQYQG-----------DIPPSLKELIQLPGVGPKMAHIVMDVGWNQIT 347
Query: 290 AIPVDTHVWKIATR 303
I VDTHV +I+ R
Sbjct: 348 GIGVDTHVHRISNR 361
>gi|383413091|gb|AFH29759.1| endonuclease III-like protein 1 [Macaca mulatta]
Length = 312
Score = 45.1 bits (105), Expect = 0.043, Method: Compositional matrix adjust.
Identities = 28/85 (32%), Positives = 40/85 (47%), Gaps = 11/85 (12%)
Query: 219 LLFSVRRSFKQAKYITGTVDVLQSKHSGGAEWLLSLRKLDLQEAIDALCTLPGVGPKVAA 278
L++ V + KYI T +LQ ++ G D+ ++ L LPGVGPK+A
Sbjct: 175 LIYPVGFWRSKVKYIKQTSTILQQRYGG-----------DIPASVAELVALPGVGPKMAH 223
Query: 279 CIALFSLDQHHAIPVDTHVWKIATR 303
+ I VDTHV +IA R
Sbjct: 224 LAMAVAWGTVSGIAVDTHVHRIANR 248
>gi|109127234|ref|XP_001082772.1| PREDICTED: nth endonuclease III-like 1 [Macaca mulatta]
Length = 312
Score = 45.1 bits (105), Expect = 0.044, Method: Compositional matrix adjust.
Identities = 28/85 (32%), Positives = 40/85 (47%), Gaps = 11/85 (12%)
Query: 219 LLFSVRRSFKQAKYITGTVDVLQSKHSGGAEWLLSLRKLDLQEAIDALCTLPGVGPKVAA 278
L++ V + KYI T +LQ ++ G D+ ++ L LPGVGPK+A
Sbjct: 175 LIYPVGFWRSKVKYIKQTSTILQQRYGG-----------DIPASVAELVALPGVGPKMAH 223
Query: 279 CIALFSLDQHHAIPVDTHVWKIATR 303
+ I VDTHV +IA R
Sbjct: 224 LAMAVAWGTVSGIAVDTHVHRIANR 248
>gi|312136836|ref|YP_004004173.1| endonuclease iii ;DNA-(apurinic or apyrimidinic site) lyase
[Methanothermus fervidus DSM 2088]
gi|311224555|gb|ADP77411.1| endonuclease III ;DNA-(apurinic or apyrimidinic site) lyase
[Methanothermus fervidus DSM 2088]
Length = 209
Score = 45.1 bits (105), Expect = 0.045, Method: Compositional matrix adjust.
Identities = 33/121 (27%), Positives = 55/121 (45%), Gaps = 32/121 (26%)
Query: 187 FHEFPSLERLSLVSEVELRN----AGFGYRSAPQSSLLFSVRRSFKQAKYITGTVDVLQS 242
F ++ ++E+++ E +L GF YR +AK I +L +
Sbjct: 53 FSKYDTIEKIANAQEKDLEKLIKCVGF-YRV---------------KAKRIKKISKILIN 96
Query: 243 KHSGGAEWLLSLRKLDLQEAIDALCTLPGVGPKVAACIALFSLDQHHAIPVDTHVWKIAT 302
K+ G + + + L LPGVG K A C+ ++ ++ AIPVDTHV ++A
Sbjct: 97 KYDGK-----------VPKNLKELLKLPGVGRKTANCVLVYGFNED-AIPVDTHVHRVAN 144
Query: 303 R 303
R
Sbjct: 145 R 145
>gi|298252264|ref|ZP_06976067.1| HhH-GPD family protein [Ktedonobacter racemifer DSM 44963]
gi|297546856|gb|EFH80724.1| HhH-GPD family protein [Ktedonobacter racemifer DSM 44963]
Length = 242
Score = 45.1 bits (105), Expect = 0.046, Method: Compositional matrix adjust.
Identities = 36/140 (25%), Positives = 62/140 (44%), Gaps = 23/140 (16%)
Query: 175 LGSHLGNVE-GFEFHEFPSLERLSLVSEVE----------LRNAGFGYRSAPQSSLLFSV 223
L H ++ G +H+ +ER S EV +++ G AP+
Sbjct: 42 LSQHTSDINSGRAYHQL--IERFSTWEEVRDAPTHEVAEAIKSGGLANVKAPRIQSALHT 99
Query: 224 RRSFKQAKYITGTVDVLQSKHSGGAEWLLSLRKLDLQEAIDALCTLPGVGPKVAACIALF 283
+++AK T ++ L+ L+EA L +PGVGPK AAC+ LF
Sbjct: 100 LSEWQRAKGDTRSLSAFLQD---------ELKGQPLEEAWRYLQQMPGVGPKTAACVLLF 150
Query: 284 SLDQHHAIPVDTHVWKIATR 303
++ + +P+DTH+ ++ R
Sbjct: 151 NMGR-PLMPIDTHLHRLTHR 169
>gi|380790641|gb|AFE67196.1| endonuclease III-like protein 1 [Macaca mulatta]
Length = 312
Score = 45.1 bits (105), Expect = 0.046, Method: Compositional matrix adjust.
Identities = 28/85 (32%), Positives = 40/85 (47%), Gaps = 11/85 (12%)
Query: 219 LLFSVRRSFKQAKYITGTVDVLQSKHSGGAEWLLSLRKLDLQEAIDALCTLPGVGPKVAA 278
L++ V + KYI T +LQ ++ G D+ ++ L LPGVGPK+A
Sbjct: 175 LIYPVGFWRSKVKYIKQTSTILQQRYGG-----------DIPASVAELVALPGVGPKMAH 223
Query: 279 CIALFSLDQHHAIPVDTHVWKIATR 303
+ I VDTHV +IA R
Sbjct: 224 LAMAVAWGTVSGIAVDTHVHRIANR 248
>gi|402907295|ref|XP_003916412.1| PREDICTED: endonuclease III-like protein 1 [Papio anubis]
Length = 312
Score = 45.1 bits (105), Expect = 0.047, Method: Compositional matrix adjust.
Identities = 28/85 (32%), Positives = 40/85 (47%), Gaps = 11/85 (12%)
Query: 219 LLFSVRRSFKQAKYITGTVDVLQSKHSGGAEWLLSLRKLDLQEAIDALCTLPGVGPKVAA 278
L++ V + KYI T +LQ ++ G D+ ++ L LPGVGPK+A
Sbjct: 175 LIYPVGFWRSKVKYIKQTSTILQQRYGG-----------DIPASVAELVALPGVGPKMAH 223
Query: 279 CIALFSLDQHHAIPVDTHVWKIATR 303
+ I VDTHV +IA R
Sbjct: 224 LAMAVAWGTVSGIAVDTHVHRIANR 248
>gi|444727310|gb|ELW67811.1| Endonuclease III-like protein 1 [Tupaia chinensis]
Length = 258
Score = 45.1 bits (105), Expect = 0.048, Method: Compositional matrix adjust.
Identities = 28/85 (32%), Positives = 41/85 (48%), Gaps = 11/85 (12%)
Query: 219 LLFSVRRSFKQAKYITGTVDVLQSKHSGGAEWLLSLRKLDLQEAIDALCTLPGVGPKVAA 278
L++ V + KYI T +LQ +++G D+ ++ L LPGVGPK+A
Sbjct: 121 LIYPVGFWRSKVKYIKQTSAILQQRYAG-----------DIPASVAELVALPGVGPKMAH 169
Query: 279 CIALFSLDQHHAIPVDTHVWKIATR 303
+ I VDTHV +IA R
Sbjct: 170 LAMAVAWGTVSGIAVDTHVHRIANR 194
>gi|91088451|ref|XP_968911.1| PREDICTED: similar to predicted protein [Tribolium castaneum]
Length = 283
Score = 45.1 bits (105), Expect = 0.048, Method: Compositional matrix adjust.
Identities = 25/85 (29%), Positives = 45/85 (52%), Gaps = 11/85 (12%)
Query: 219 LLFSVRRSFKQAKYITGTVDVLQSKHSGGAEWLLSLRKLDLQEAIDALCTLPGVGPKVAA 278
L++ V + K+I ++L+++++G D+ ++ LC LPGVGPK+A
Sbjct: 142 LIYPVGFWKTKVKHIKKASEILKNEYNG-----------DIPRTVEDLCKLPGVGPKMAN 190
Query: 279 CIALFSLDQHHAIPVDTHVWKIATR 303
+ ++ I VDTHV +I+ R
Sbjct: 191 LCMKTAWNEVTGIGVDTHVHRISNR 215
>gi|387015722|gb|AFJ49980.1| Endonuclease III-like protein 1-like [Crotalus adamanteus]
Length = 282
Score = 45.1 bits (105), Expect = 0.048, Method: Compositional matrix adjust.
Identities = 25/85 (29%), Positives = 42/85 (49%), Gaps = 11/85 (12%)
Query: 219 LLFSVRRSFKQAKYITGTVDVLQSKHSGGAEWLLSLRKLDLQEAIDALCTLPGVGPKVAA 278
L++ V +A+Y+ T +L+ ++G D+ + + L LPGVGPK+A
Sbjct: 145 LIYPVGFWRNKARYVQQTTAILKRDYAG-----------DIPQTVAELVKLPGVGPKMAH 193
Query: 279 CIALFSLDQHHAIPVDTHVWKIATR 303
+ + + I VDTHV +I R
Sbjct: 194 LVMDIAWKKASGIGVDTHVHRICNR 218
>gi|150391360|ref|YP_001321409.1| endonuclease III [Alkaliphilus metalliredigens QYMF]
gi|149951222|gb|ABR49750.1| endonuclease III [Alkaliphilus metalliredigens QYMF]
Length = 216
Score = 45.1 bits (105), Expect = 0.048, Method: Compositional matrix adjust.
Identities = 37/119 (31%), Positives = 58/119 (48%), Gaps = 28/119 (23%)
Query: 187 FHEFPSLERLSLVSEVELRNAGFGYRSAPQSSLLFSVRRSFKQAKYITGTVDVLQSKHSG 246
F ++P+ ER+ ++EVEL G +S ++ ++K I T +L KH G
Sbjct: 60 FEKYPTPERILTLTEVEL---GQWIKSCGFYNM---------KSKNILATCHLLMEKHGG 107
Query: 247 GAEWLLSLRKLDLQEAIDALCTLPGVGPKVAACIA--LFSLDQHHAIPVDTHVWKIATR 303
++ E +AL LPGVG K A + +F D AI VDTHV++++ R
Sbjct: 108 -----------EVPEEREALMALPGVGRKTANVVISNVFGQD---AIAVDTHVFRVSNR 152
>gi|440913510|gb|ELR62959.1| Endonuclease III-like protein 1, partial [Bos grunniens mutus]
Length = 316
Score = 45.1 bits (105), Expect = 0.048, Method: Compositional matrix adjust.
Identities = 28/86 (32%), Positives = 41/86 (47%), Gaps = 11/86 (12%)
Query: 218 SLLFSVRRSFKQAKYITGTVDVLQSKHSGGAEWLLSLRKLDLQEAIDALCTLPGVGPKVA 277
+L++ V + KYI T +LQ ++ G D+ ++ L LPGVGPK+A
Sbjct: 178 ALIYPVGFWRSKVKYIKQTSAILQQRYDG-----------DIPASVAELVALPGVGPKMA 226
Query: 278 ACIALFSLDQHHAIPVDTHVWKIATR 303
+ I VDTHV +IA R
Sbjct: 227 HLAMAVAWGTVSGIAVDTHVHRIANR 252
>gi|432847899|ref|XP_004066205.1| PREDICTED: endonuclease III-like protein 1-like [Oryzias latipes]
Length = 416
Score = 45.1 bits (105), Expect = 0.048, Method: Compositional matrix adjust.
Identities = 27/86 (31%), Positives = 42/86 (48%), Gaps = 11/86 (12%)
Query: 218 SLLFSVRRSFKQAKYITGTVDVLQSKHSGGAEWLLSLRKLDLQEAIDALCTLPGVGPKVA 277
+L++ V + KY+ T +LQ + G D+ ++ L LPGVGPK+A
Sbjct: 227 TLIYPVGFWRNKVKYLKKTSAILQEQFGG-----------DIPSDVEGLVRLPGVGPKMA 275
Query: 278 ACIALFSLDQHHAIPVDTHVWKIATR 303
+ +Q I VDTHV +I+ R
Sbjct: 276 HLAMDIAWNQVSGIGVDTHVHRISNR 301
>gi|167043370|gb|ABZ08073.1| putative HhH-GPD superfamily base excision DNA repair protein
[uncultured marine crenarchaeote HF4000_ANIW141O9]
Length = 217
Score = 45.1 bits (105), Expect = 0.048, Method: Compositional matrix adjust.
Identities = 20/41 (48%), Positives = 29/41 (70%), Gaps = 1/41 (2%)
Query: 263 IDALCTLPGVGPKVAACIALFSLDQHHAIPVDTHVWKIATR 303
ID L +PGVG K A C+ +++ ++ AIPVDTHV +I+ R
Sbjct: 113 IDKLVEIPGVGRKTANCVLVYAFEK-PAIPVDTHVHRISNR 152
>gi|425773861|gb|EKV12186.1| HhH-GPD family base excision DNA repair protein [Penicillium
digitatum PHI26]
gi|425782437|gb|EKV20346.1| HhH-GPD family base excision DNA repair protein [Penicillium
digitatum Pd1]
Length = 512
Score = 45.1 bits (105), Expect = 0.049, Method: Compositional matrix adjust.
Identities = 22/50 (44%), Positives = 30/50 (60%), Gaps = 1/50 (2%)
Query: 251 LLSLRKLDLQEAIDALCTLPGVGPKVAACIALFSLDQHHAIPVDTHVWKI 300
L L L +EA+ L PG+GPK AAC+ LF L Q VDTH++++
Sbjct: 344 LNHLHNLSTEEAMTNLIKYPGIGPKTAACVILFCL-QRPCFAVDTHIFRL 392
>gi|114051958|ref|NP_001039862.1| endonuclease III-like protein 1 [Bos taurus]
gi|109892805|sp|Q2KID2.1|NTHL1_BOVIN RecName: Full=Endonuclease III-like protein 1
gi|86826431|gb|AAI12682.1| Nth endonuclease III-like 1 (E. coli) [Bos taurus]
gi|296473490|tpg|DAA15605.1| TPA: nth endonuclease III-like 1 [Bos taurus]
Length = 305
Score = 45.1 bits (105), Expect = 0.049, Method: Compositional matrix adjust.
Identities = 28/86 (32%), Positives = 41/86 (47%), Gaps = 11/86 (12%)
Query: 218 SLLFSVRRSFKQAKYITGTVDVLQSKHSGGAEWLLSLRKLDLQEAIDALCTLPGVGPKVA 277
+L++ V + KYI T +LQ ++ G D+ ++ L LPGVGPK+A
Sbjct: 167 ALIYPVGFWRSKVKYIKQTSAILQQRYDG-----------DIPASVAELVALPGVGPKMA 215
Query: 278 ACIALFSLDQHHAIPVDTHVWKIATR 303
+ I VDTHV +IA R
Sbjct: 216 HLAMAVAWGTVSGIAVDTHVHRIANR 241
>gi|389861151|ref|YP_006363391.1| HhH-GPD family protein [Thermogladius cellulolyticus 1633]
gi|388526055|gb|AFK51253.1| HhH-GPD family protein [Thermogladius cellulolyticus 1633]
Length = 227
Score = 45.1 bits (105), Expect = 0.049, Method: Compositional matrix adjust.
Identities = 27/80 (33%), Positives = 42/80 (52%), Gaps = 11/80 (13%)
Query: 224 RRSFKQAKYITGTVDVLQSKHSGGAEWLLSLRKLDLQEAIDALCTLPGVGPKVAACIALF 283
RR + A Y+ ++ +S+ +R++D+ EA + L LPGVG K A + L
Sbjct: 92 RRIVELASYLRDRLEEFESE----------VRRMDVLEAREVLMNLPGVGDKTADVVLLV 141
Query: 284 SLDQHHAIPVDTHVWKIATR 303
+ A PVDTH+ +I TR
Sbjct: 142 YFGK-PAFPVDTHIKRITTR 160
>gi|358381882|gb|EHK19556.1| hypothetical protein TRIVIDRAFT_24735, partial [Trichoderma virens
Gv29-8]
Length = 274
Score = 45.1 bits (105), Expect = 0.050, Method: Compositional matrix adjust.
Identities = 26/65 (40%), Positives = 38/65 (58%), Gaps = 2/65 (3%)
Query: 237 VDVL-QSKHSGGAEWLLSLRKLDLQEAIDALCTLPGVGPKVAACIALFSLDQHHAIPVDT 295
+++L Q+K G+ L L A+ L + GVGPK A+C+ LF L Q + PVDT
Sbjct: 138 INILRQTKAKYGSYSLDHLLHASNDSAMRELLSFQGVGPKTASCVLLFCL-QRDSFPVDT 196
Query: 296 HVWKI 300
HV++I
Sbjct: 197 HVYRI 201
>gi|357124677|ref|XP_003564024.1| PREDICTED: endonuclease III-like [Brachypodium distachyon]
Length = 277
Score = 45.1 bits (105), Expect = 0.050, Method: Compositional matrix adjust.
Identities = 24/58 (41%), Positives = 31/58 (53%), Gaps = 1/58 (1%)
Query: 243 KHSGGAEWLLSLRKLDLQEAIDALCTLPGVGPKVAACIALFSLDQHHAIPVDTHVWKI 300
+ GA L LR+L + E L G+GPK AC+ +F L Q PVDTHV +I
Sbjct: 145 RERRGAICLEYLRELSVDEVKRELSQFKGIGPKTMACVLMFYL-QKDDFPVDTHVLRI 201
>gi|194219367|ref|XP_001915373.1| PREDICTED: LOW QUALITY PROTEIN: endonuclease III-like protein
1-like [Equus caballus]
Length = 312
Score = 45.1 bits (105), Expect = 0.050, Method: Compositional matrix adjust.
Identities = 28/85 (32%), Positives = 40/85 (47%), Gaps = 11/85 (12%)
Query: 219 LLFSVRRSFKQAKYITGTVDVLQSKHSGGAEWLLSLRKLDLQEAIDALCTLPGVGPKVAA 278
L++ V + KYI T +LQ ++ G D+ ++ L LPGVGPK+A
Sbjct: 175 LIYPVGFWRNKVKYIKQTSAILQQRYDG-----------DIPASVAELVALPGVGPKMAH 223
Query: 279 CIALFSLDQHHAIPVDTHVWKIATR 303
+ I VDTHV +IA R
Sbjct: 224 LAMAVAWGTVSGIAVDTHVHRIANR 248
>gi|154302382|ref|XP_001551601.1| hypothetical protein BC1G_09975 [Botryotinia fuckeliana B05.10]
Length = 548
Score = 45.1 bits (105), Expect = 0.050, Method: Compositional matrix adjust.
Identities = 30/71 (42%), Positives = 40/71 (56%), Gaps = 12/71 (16%)
Query: 239 VLQSKHSGGAEWLLSLRK----------LDLQEAIDALCTLPGVGPKVAACIALFSLDQH 288
VL++K AE + SLR+ LD A+ L LPG+G K AAC+ALF L +
Sbjct: 338 VLETKEQKEAE-IKSLRENMLSLDYVHTLDKPAAMRVLMDLPGIGVKTAACVALFCLGR- 395
Query: 289 HAIPVDTHVWK 299
+ VDTHVW+
Sbjct: 396 PSFAVDTHVWR 406
>gi|426380776|ref|XP_004057037.1| PREDICTED: endonuclease III-like protein 1 [Gorilla gorilla
gorilla]
Length = 312
Score = 45.1 bits (105), Expect = 0.051, Method: Compositional matrix adjust.
Identities = 28/85 (32%), Positives = 40/85 (47%), Gaps = 11/85 (12%)
Query: 219 LLFSVRRSFKQAKYITGTVDVLQSKHSGGAEWLLSLRKLDLQEAIDALCTLPGVGPKVAA 278
L++ V + KYI T +LQ ++ G D+ ++ L LPGVGPK+A
Sbjct: 175 LIYPVGFWRSKVKYIKQTSAILQQRYGG-----------DIPASVAELVALPGVGPKMAH 223
Query: 279 CIALFSLDQHHAIPVDTHVWKIATR 303
+ I VDTHV +IA R
Sbjct: 224 LAMAVAWGTVSGIAVDTHVHRIANR 248
>gi|347828944|emb|CCD44641.1| hypothetical protein [Botryotinia fuckeliana]
Length = 548
Score = 45.1 bits (105), Expect = 0.051, Method: Compositional matrix adjust.
Identities = 30/71 (42%), Positives = 40/71 (56%), Gaps = 12/71 (16%)
Query: 239 VLQSKHSGGAEWLLSLRK----------LDLQEAIDALCTLPGVGPKVAACIALFSLDQH 288
VL++K AE + SLR+ LD A+ L LPG+G K AAC+ALF L +
Sbjct: 338 VLETKEQKEAE-IKSLRENMLSLDYVHTLDKPAAMRVLMDLPGIGVKTAACVALFCLGR- 395
Query: 289 HAIPVDTHVWK 299
+ VDTHVW+
Sbjct: 396 PSFAVDTHVWR 406
>gi|400601349|gb|EJP68992.1| base excision DNA repair protein [Beauveria bassiana ARSEF 2860]
Length = 361
Score = 45.1 bits (105), Expect = 0.052, Method: Compositional matrix adjust.
Identities = 21/40 (52%), Positives = 28/40 (70%), Gaps = 1/40 (2%)
Query: 261 EAIDALCTLPGVGPKVAACIALFSLDQHHAIPVDTHVWKI 300
EA++ L PGVGPK A+CI LF L Q + VDTHV+++
Sbjct: 260 EAMEELLAFPGVGPKTASCILLFCL-QRPSFAVDTHVYRL 298
>gi|311251810|ref|XP_003124777.1| PREDICTED: endonuclease III-like protein 1-like [Sus scrofa]
Length = 312
Score = 45.1 bits (105), Expect = 0.052, Method: Compositional matrix adjust.
Identities = 28/87 (32%), Positives = 41/87 (47%), Gaps = 11/87 (12%)
Query: 217 SSLLFSVRRSFKQAKYITGTVDVLQSKHSGGAEWLLSLRKLDLQEAIDALCTLPGVGPKV 276
+L++ V + KYI T +LQ ++ G D+ ++ L LPGVGPK+
Sbjct: 173 GTLIYPVGFWRSKVKYIKQTSAILQQRYGG-----------DIPASVPELVALPGVGPKM 221
Query: 277 AACIALFSLDQHHAIPVDTHVWKIATR 303
A + I VDTHV +IA R
Sbjct: 222 AHLAMAVAWGTVSGIAVDTHVHRIAGR 248
>gi|358388627|gb|EHK26220.1| hypothetical protein TRIVIDRAFT_17977, partial [Trichoderma virens
Gv29-8]
Length = 249
Score = 45.1 bits (105), Expect = 0.053, Method: Compositional matrix adjust.
Identities = 23/88 (26%), Positives = 45/88 (51%), Gaps = 11/88 (12%)
Query: 217 SSLLFSVRRSFKQAKYITGTVDVLQSKHSGGAEWLLSLRKLDLQEAIDALCTLPGVGPKV 276
+ ++++V + KYI T ++L+ + +G D+ + ++ L +LPGVGPK+
Sbjct: 101 NEMIWAVGFHNNKTKYIKKTAEILRDEWNG-----------DIPDTVEGLTSLPGVGPKM 149
Query: 277 AACIALFSLDQHHAIPVDTHVWKIATRY 304
A + ++ I VD HV +I +
Sbjct: 150 AYLCLSVAWNRTEGIGVDVHVHRITNMW 177
>gi|270011744|gb|EFA08192.1| hypothetical protein TcasGA2_TC005819 [Tribolium castaneum]
Length = 266
Score = 44.7 bits (104), Expect = 0.053, Method: Compositional matrix adjust.
Identities = 25/86 (29%), Positives = 45/86 (52%), Gaps = 11/86 (12%)
Query: 218 SLLFSVRRSFKQAKYITGTVDVLQSKHSGGAEWLLSLRKLDLQEAIDALCTLPGVGPKVA 277
L++ V + K+I ++L+++++G D+ ++ LC LPGVGPK+A
Sbjct: 124 ELIYPVGFWKTKVKHIKKASEILKNEYNG-----------DIPRTVEDLCKLPGVGPKMA 172
Query: 278 ACIALFSLDQHHAIPVDTHVWKIATR 303
+ ++ I VDTHV +I+ R
Sbjct: 173 NLCMKTAWNEVTGIGVDTHVHRISNR 198
>gi|308811190|ref|XP_003082903.1| putative endonuclease (ISS) [Ostreococcus tauri]
gi|116054781|emb|CAL56858.1| putative endonuclease (ISS), partial [Ostreococcus tauri]
Length = 820
Score = 44.7 bits (104), Expect = 0.053, Method: Compositional matrix adjust.
Identities = 25/92 (27%), Positives = 47/92 (51%), Gaps = 13/92 (14%)
Query: 228 KQAKYITGTVDVLQSKHSGGAEWLLSLRKLDLQEAIDALCTLPGVGPKVAACIALFSLDQ 287
++A+Y+ T +L ++ G D+ +++ LC LPGVGPK+A + +
Sbjct: 370 RKAQYLRATAKILLDEYDG-----------DIPPSVETLCALPGVGPKMAYLVMNVGWGE 418
Query: 288 HHAIPVDTHVWKIATR--YLLPELAGVRLTPK 317
I VD HV +I+ R ++ ++ G +P+
Sbjct: 419 PTGICVDVHVHRISERLGWVAKDVMGKNGSPR 450
>gi|76802666|ref|YP_330761.1| repair DNA N-glycosylase / DNA-(apurinic or apyrimidinic site)
lyase [Natronomonas pharaonis DSM 2160]
gi|76558531|emb|CAI50123.1| endonuclease III [Natronomonas pharaonis DSM 2160]
Length = 268
Score = 44.7 bits (104), Expect = 0.054, Method: Compositional matrix adjust.
Identities = 23/47 (48%), Positives = 27/47 (57%)
Query: 260 QEAIDALCTLPGVGPKVAACIALFSLDQHHAIPVDTHVWKIATRYLL 306
E + L L GVGPK A C+ LFS + PVDTHV +IA R L
Sbjct: 142 DEVRETLLELNGVGPKTADCVLLFSGGRDGVFPVDTHVHRIARRMGL 188
>gi|342180669|emb|CCC90145.1| putative 8-oxoguanine DNA glycosylase [Trypanosoma congolense
IL3000]
Length = 267
Score = 44.7 bits (104), Expect = 0.054, Method: Compositional matrix adjust.
Identities = 29/82 (35%), Positives = 42/82 (51%), Gaps = 16/82 (19%)
Query: 106 LLDFLNMGISLGELWEGFSASD-------------CRFAELAKYLAGARVLRQDPVECLL 152
L +L + I L ++W ++ + C ++L + R LRQD E LL
Sbjct: 120 LRHYLALDIDLEKMWRRWTKENPMGNHPLVRALISCGLSDLPVSI---RHLRQDLHETLL 176
Query: 153 QFLCSSNNNIARITKMVDFLAS 174
FLCS NNNI RIT +V+ LA+
Sbjct: 177 TFLCSQNNNIQRITGLVERLAA 198
>gi|358400221|gb|EHK49552.1| hypothetical protein TRIATDRAFT_26345, partial [Trichoderma
atroviride IMI 206040]
Length = 268
Score = 44.7 bits (104), Expect = 0.054, Method: Compositional matrix adjust.
Identities = 25/61 (40%), Positives = 34/61 (55%), Gaps = 1/61 (1%)
Query: 241 QSKHSGGAEWLLSLRKLDLQEAIDALCTLPGVGPKVAACIALFSLDQHHAIPVDTHVWKI 300
Q+K G L L EA+ L + GVGPK A+C+ LF L Q + VDTHV++I
Sbjct: 137 QTKDQYGTYSLNHLLNASNDEAMRELLSFQGVGPKTASCVLLFCL-QRDSFAVDTHVYRI 195
Query: 301 A 301
+
Sbjct: 196 S 196
>gi|340923756|gb|EGS18659.1| hypothetical protein CTHT_0052650 [Chaetomium thermophilum var.
thermophilum DSM 1495]
Length = 471
Score = 44.7 bits (104), Expect = 0.055, Method: Compositional matrix adjust.
Identities = 25/74 (33%), Positives = 40/74 (54%), Gaps = 13/74 (17%)
Query: 229 QAKYITGTVDVLQSKHSGGAEWLLSLRKLDLQEAIDALCTLPGVGPKVAA-CI-ALFSLD 286
+ KY+ T ++L+ K G D+ ++ LC+LPGVGPK+A C+ A + +
Sbjct: 277 KTKYLKQTAEILRDKFGG-----------DIPPTVETLCSLPGVGPKMAHLCMSATYGWN 325
Query: 287 QHHAIPVDTHVWKI 300
+ I VD HV +I
Sbjct: 326 KVEGIGVDVHVHRI 339
>gi|145356922|ref|XP_001422672.1| predicted protein [Ostreococcus lucimarinus CCE9901]
gi|144582915|gb|ABP00989.1| predicted protein [Ostreococcus lucimarinus CCE9901]
Length = 273
Score = 44.7 bits (104), Expect = 0.055, Method: Compositional matrix adjust.
Identities = 23/76 (30%), Positives = 38/76 (50%), Gaps = 11/76 (14%)
Query: 228 KQAKYITGTVDVLQSKHSGGAEWLLSLRKLDLQEAIDALCTLPGVGPKVAACIALFSLDQ 287
++A+Y+ T +L ++ G D+ +++ LC LPGVGPK+A + +
Sbjct: 114 RKAQYLRATAKILLDEYDG-----------DIPSSVETLCALPGVGPKMAYLVMNVGWQK 162
Query: 288 HHAIPVDTHVWKIATR 303
I VD HV +I R
Sbjct: 163 PTGICVDVHVHRITER 178
>gi|326510563|dbj|BAJ87498.1| predicted protein [Hordeum vulgare subsp. vulgare]
Length = 282
Score = 44.7 bits (104), Expect = 0.057, Method: Compositional matrix adjust.
Identities = 24/58 (41%), Positives = 30/58 (51%), Gaps = 1/58 (1%)
Query: 243 KHSGGAEWLLSLRKLDLQEAIDALCTLPGVGPKVAACIALFSLDQHHAIPVDTHVWKI 300
+ GA L LR L + E L G+GPK AC+ +F L Q PVDTHV +I
Sbjct: 150 REKRGAICLEYLRDLSVDEVKRELSQFKGIGPKTVACVLMFYL-QKDDFPVDTHVLRI 206
>gi|301105551|ref|XP_002901859.1| endonuclease III, HhH-GPD superfamily base excision DNA repair,
putative [Phytophthora infestans T30-4]
gi|262099197|gb|EEY57249.1| endonuclease III, HhH-GPD superfamily base excision DNA repair,
putative [Phytophthora infestans T30-4]
Length = 287
Score = 44.7 bits (104), Expect = 0.057, Method: Compositional matrix adjust.
Identities = 20/46 (43%), Positives = 28/46 (60%)
Query: 258 DLQEAIDALCTLPGVGPKVAACIALFSLDQHHAIPVDTHVWKIATR 303
D+ ++D L LPG+GPK+ I L + Q I VDTHV ++A R
Sbjct: 155 DIPRSLDELLQLPGIGPKIGRVITLLAWGQVDGIVVDTHVHRLAQR 200
>gi|221058032|ref|XP_002261524.1| endonuclease iii homologue [Plasmodium knowlesi strain H]
gi|194247529|emb|CAQ40929.1| endonuclease iii homologue, putative [Plasmodium knowlesi strain H]
Length = 396
Score = 44.7 bits (104), Expect = 0.058, Method: Compositional matrix adjust.
Identities = 20/46 (43%), Positives = 27/46 (58%)
Query: 258 DLQEAIDALCTLPGVGPKVAACIALFSLDQHHAIPVDTHVWKIATR 303
D+ ++ L LPG+G KVA I +LD H I VD HV +I+ R
Sbjct: 279 DIPHTLEGLLELPGIGQKVAHLILQTALDTHEGIAVDIHVHRISNR 324
>gi|195351989|ref|XP_002042498.1| GM23290 [Drosophila sechellia]
gi|194124367|gb|EDW46410.1| GM23290 [Drosophila sechellia]
Length = 378
Score = 44.7 bits (104), Expect = 0.058, Method: Compositional matrix adjust.
Identities = 27/87 (31%), Positives = 42/87 (48%), Gaps = 11/87 (12%)
Query: 217 SSLLFSVRRSFKQAKYITGTVDVLQSKHSGGAEWLLSLRKLDLQEAIDALCTLPGVGPKV 276
+LL V +AKY+ TVD+L K+ D+ + + L LPGVGPK+
Sbjct: 230 ENLLHPVSFYKNKAKYLKQTVDILMDKYGS-----------DIPDNVKDLVALPGVGPKM 278
Query: 277 AACIALFSLDQHHAIPVDTHVWKIATR 303
A + ++ I VD HV +++ R
Sbjct: 279 AHICMAVAWNKITGIGVDVHVHRLSNR 305
>gi|326436846|gb|EGD82416.1| hypothetical protein PTSG_03059 [Salpingoeca sp. ATCC 50818]
Length = 495
Score = 44.7 bits (104), Expect = 0.061, Method: Compositional matrix adjust.
Identities = 26/75 (34%), Positives = 42/75 (56%), Gaps = 3/75 (4%)
Query: 229 QAKYITGTVDVLQSKHSGGAEWLLSLRKLDLQEAIDALCTLPGVGPKVAACIALFSLDQH 288
+++ I +D L ++H G L LR+L + L GVGPK +C+ LF++ Q
Sbjct: 295 KSRRIKSILDTLHAEH--GKLSLEYLRELPSHTVKEQLARFKGVGPKTISCLLLFAM-QR 351
Query: 289 HAIPVDTHVWKIATR 303
+ VDTHV+++A R
Sbjct: 352 PDMAVDTHVFRLAKR 366
>gi|195580555|ref|XP_002080101.1| GD21665 [Drosophila simulans]
gi|194192110|gb|EDX05686.1| GD21665 [Drosophila simulans]
Length = 388
Score = 44.7 bits (104), Expect = 0.061, Method: Compositional matrix adjust.
Identities = 27/87 (31%), Positives = 42/87 (48%), Gaps = 11/87 (12%)
Query: 217 SSLLFSVRRSFKQAKYITGTVDVLQSKHSGGAEWLLSLRKLDLQEAIDALCTLPGVGPKV 276
+LL V +AKY+ TVD+L K+ D+ + + L LPGVGPK+
Sbjct: 240 ENLLHPVSFYKNKAKYLKQTVDILMDKYGS-----------DIPDNVKDLVALPGVGPKM 288
Query: 277 AACIALFSLDQHHAIPVDTHVWKIATR 303
A + ++ I VD HV +++ R
Sbjct: 289 AHICMAVAWNKITGIGVDVHVHRLSNR 315
>gi|55378639|ref|YP_136489.1| endonuclease III [Haloarcula marismortui ATCC 43049]
gi|55231364|gb|AAV46783.1| endonuclease III [Haloarcula marismortui ATCC 43049]
Length = 278
Score = 44.7 bits (104), Expect = 0.062, Method: Compositional matrix adjust.
Identities = 24/64 (37%), Positives = 34/64 (53%)
Query: 243 KHSGGAEWLLSLRKLDLQEAIDALCTLPGVGPKVAACIALFSLDQHHAIPVDTHVWKIAT 302
++ G A + +R D + L + GVGPK A C+ LF+ + PVDTHV +IA
Sbjct: 138 EYGGEAGFDAFVRDSDPEAVRSTLLDMNGVGPKTADCVLLFAGGRGGVFPVDTHVHRIAR 197
Query: 303 RYLL 306
R L
Sbjct: 198 RMGL 201
>gi|355756457|gb|EHH60065.1| Endonuclease III-like protein 1 [Macaca fascicularis]
Length = 245
Score = 44.7 bits (104), Expect = 0.062, Method: Compositional matrix adjust.
Identities = 28/85 (32%), Positives = 40/85 (47%), Gaps = 11/85 (12%)
Query: 219 LLFSVRRSFKQAKYITGTVDVLQSKHSGGAEWLLSLRKLDLQEAIDALCTLPGVGPKVAA 278
L++ V + KYI T +LQ ++ G D+ ++ L LPGVGPK+A
Sbjct: 108 LIYPVGFWRSKVKYIKQTSTILQQRYGG-----------DIPASVAELVALPGVGPKMAH 156
Query: 279 CIALFSLDQHHAIPVDTHVWKIATR 303
+ I VDTHV +IA R
Sbjct: 157 LAMAVAWGTVSGIAVDTHVHRIANR 181
>gi|156563964|dbj|BAF76070.1| Escherichia coli endonuclease III-like 1 [Gallus gallus]
Length = 281
Score = 44.7 bits (104), Expect = 0.063, Method: Compositional matrix adjust.
Identities = 30/108 (27%), Positives = 50/108 (46%), Gaps = 18/108 (16%)
Query: 196 LSLVSEVELRNAGFGYRSAPQSSLLFSVRRSFKQAKYITGTVDVLQSKHSGGAEWLLSLR 255
L++ S +++ +A G +++ V + KYI T +L+ K+ G
Sbjct: 128 LTVDSILQMDDATLG-------QIIYPVGFWRNKVKYIKQTTAILKQKYGG--------- 171
Query: 256 KLDLQEAIDALCTLPGVGPKVAACIALFSLDQHHAIPVDTHVWKIATR 303
D+ ++ L LPGVGPK+A + + I VDTHV +I R
Sbjct: 172 --DIPGTVEELVKLPGVGPKMAHLAMNIAWNSVSGIAVDTHVHRITNR 217
>gi|237807732|ref|YP_002892172.1| A/G-specific adenine glycosylase [Tolumonas auensis DSM 9187]
gi|237499993|gb|ACQ92586.1| A/G-specific adenine glycosylase [Tolumonas auensis DSM 9187]
Length = 363
Score = 44.7 bits (104), Expect = 0.063, Method: Compositional matrix adjust.
Identities = 30/98 (30%), Positives = 47/98 (47%), Gaps = 14/98 (14%)
Query: 213 SAPQSSLL--FSVRRSFKQAKYITGTVDVLQSKHSGGAEWLLSLRKLDLQEAIDALCTLP 270
APQ +L ++ + +A+ + V++ KH+G E D + LP
Sbjct: 80 DAPQDEVLHLWTGLGYYARARNLHKAAQVIRDKHNGS-----------FPETFDEVADLP 128
Query: 271 GVGPKVAACIALFSLDQHHAIPVDTHVWKIATRYLLPE 308
G+G A I SL QHHAI +D +V ++ TR+L E
Sbjct: 129 GIGRSTAGAILSLSLKQHHAI-LDGNVKRVLTRWLALE 165
>gi|403178239|ref|XP_003336685.2| hypothetical protein PGTG_18481 [Puccinia graminis f. sp. tritici
CRL 75-36-700-3]
gi|375164069|gb|EFP92266.2| hypothetical protein PGTG_18481 [Puccinia graminis f. sp. tritici
CRL 75-36-700-3]
Length = 424
Score = 44.7 bits (104), Expect = 0.063, Method: Compositional matrix adjust.
Identities = 22/47 (46%), Positives = 31/47 (65%), Gaps = 1/47 (2%)
Query: 254 LRKLDLQEAIDALCTLPGVGPKVAACIALFSLDQHHAIPVDTHVWKI 300
LR L EA++ L + GVG K A+C+ LF L ++ PVDTHV++I
Sbjct: 187 LRYLPDSEAMETLTSFKGVGAKTASCVLLFCLGRNF-FPVDTHVFRI 232
>gi|448671097|ref|ZP_21687089.1| endonuclease III [Haloarcula amylolytica JCM 13557]
gi|445766179|gb|EMA17313.1| endonuclease III [Haloarcula amylolytica JCM 13557]
Length = 278
Score = 44.7 bits (104), Expect = 0.064, Method: Compositional matrix adjust.
Identities = 25/79 (31%), Positives = 40/79 (50%)
Query: 228 KQAKYITGTVDVLQSKHSGGAEWLLSLRKLDLQEAIDALCTLPGVGPKVAACIALFSLDQ 287
++++ I + ++ G A + +R D + L + GVGPK A C+ LF+ +
Sbjct: 123 QKSERIIALAQRICEEYGGEAGFDEFVRDSDPEAVRSTLLDMNGVGPKTADCVLLFAGGR 182
Query: 288 HHAIPVDTHVWKIATRYLL 306
PVDTHV +IA R L
Sbjct: 183 GGVFPVDTHVHRIARRMGL 201
>gi|310796892|gb|EFQ32353.1| HhH-GPD superfamily base excision DNA repair protein [Glomerella
graminicola M1.001]
Length = 469
Score = 44.7 bits (104), Expect = 0.064, Method: Compositional matrix adjust.
Identities = 25/84 (29%), Positives = 42/84 (50%), Gaps = 11/84 (13%)
Query: 217 SSLLFSVRRSFKQAKYITGTVDVLQSKHSGGAEWLLSLRKLDLQEAIDALCTLPGVGPKV 276
+ L+++V + KYI ++L+ + G D+ + I+ L +LPGVGPK+
Sbjct: 242 NELIWAVGFHNNKTKYIKAAAEILRDRFDG-----------DIPDTIEGLTSLPGVGPKM 290
Query: 277 AACIALFSLDQHHAIPVDTHVWKI 300
A + D+ I VD HV +I
Sbjct: 291 AYLCLSAAWDRTEGIGVDVHVHRI 314
>gi|393796716|ref|ZP_10380080.1| DNA-(apurinic or apyrimidinic site) lyase [Candidatus
Nitrosoarchaeum limnia BG20]
Length = 175
Score = 44.7 bits (104), Expect = 0.065, Method: Compositional matrix adjust.
Identities = 24/75 (32%), Positives = 40/75 (53%), Gaps = 12/75 (16%)
Query: 229 QAKYITGTVDVLQSKHSGGAEWLLSLRKLDLQEAIDALCTLPGVGPKVAACIALFSLDQH 288
++K I ++ SK+ G + + ++ L LPGVG K A C+ +++ D+
Sbjct: 48 KSKRIIEVAKIINSKYKGK-----------VPDNLEKLVELPGVGRKTANCVLVYAFDK- 95
Query: 289 HAIPVDTHVWKIATR 303
AIPVD HV +I+ R
Sbjct: 96 PAIPVDIHVHRISNR 110
>gi|118601744|ref|NP_001073043.1| endonuclease III-like protein 1 [Gallus gallus]
gi|118341820|dbj|BAF37123.1| Escherichia coli endonuclease III-like 1 [Gallus gallus]
Length = 281
Score = 44.7 bits (104), Expect = 0.065, Method: Compositional matrix adjust.
Identities = 30/108 (27%), Positives = 50/108 (46%), Gaps = 18/108 (16%)
Query: 196 LSLVSEVELRNAGFGYRSAPQSSLLFSVRRSFKQAKYITGTVDVLQSKHSGGAEWLLSLR 255
L++ S +++ +A G +++ V + KYI T +L+ K+ G
Sbjct: 128 LTVDSILQMDDATLG-------QIIYPVGFWRNKVKYIKQTTAILKQKYGG--------- 171
Query: 256 KLDLQEAIDALCTLPGVGPKVAACIALFSLDQHHAIPVDTHVWKIATR 303
D+ ++ L LPGVGPK+A + + I VDTHV +I R
Sbjct: 172 --DIPGTVEELVKLPGVGPKMAHLAMNIAWNSVSGIAVDTHVHRITNR 217
>gi|332240056|ref|XP_003269206.1| PREDICTED: endonuclease III-like protein 1 [Nomascus leucogenys]
Length = 312
Score = 44.7 bits (104), Expect = 0.066, Method: Compositional matrix adjust.
Identities = 26/75 (34%), Positives = 36/75 (48%), Gaps = 11/75 (14%)
Query: 229 QAKYITGTVDVLQSKHSGGAEWLLSLRKLDLQEAIDALCTLPGVGPKVAACIALFSLDQH 288
+ KYI T +LQ ++ G D+ ++ L LPGVGPK+A +
Sbjct: 185 KVKYIKQTSAILQQRYGG-----------DIPASVAELVALPGVGPKMAHLAMAVAWGSV 233
Query: 289 HAIPVDTHVWKIATR 303
I VDTHV +IA R
Sbjct: 234 SGIAVDTHVHRIANR 248
>gi|70984374|ref|XP_747701.1| HhH-GPD family base excision DNA repair protein [Aspergillus
fumigatus Af293]
gi|66845328|gb|EAL85663.1| HhH-GPD family base excision DNA repair protein [Aspergillus
fumigatus Af293]
Length = 470
Score = 44.7 bits (104), Expect = 0.066, Method: Compositional matrix adjust.
Identities = 22/50 (44%), Positives = 30/50 (60%), Gaps = 1/50 (2%)
Query: 251 LLSLRKLDLQEAIDALCTLPGVGPKVAACIALFSLDQHHAIPVDTHVWKI 300
L L L ++A+ L PG+GPK AAC+ LF L Q VDTH+++I
Sbjct: 320 LNHLHTLSTEDAMTELVKYPGIGPKTAACVILFCL-QRPCFAVDTHIFRI 368
>gi|195385699|ref|XP_002051542.1| GJ16118 [Drosophila virilis]
gi|194147999|gb|EDW63697.1| GJ16118 [Drosophila virilis]
Length = 353
Score = 44.7 bits (104), Expect = 0.066, Method: Compositional matrix adjust.
Identities = 49/176 (27%), Positives = 77/176 (43%), Gaps = 28/176 (15%)
Query: 135 KYLAGARVLRQD---PVECLLQFLCS---SNNNIARITKMVDFLASLGSHLGNVEGFEFH 188
++L R++R + PV+ L C+ +N R K+V + L S + FE
Sbjct: 130 QHLENIRLMRCEKPAPVDTLGCHQCADQNANEKTQRFHKLVALM--LSSQTKDETTFE-- 185
Query: 189 EFPSLERLSLVSEVELRNAGF-GYRSAPQSSLLFSVRRSFKQAKYITGTVDVLQSKHSGG 247
+++RL + L A G + LL V +AKY+ T +L K++
Sbjct: 186 ---AMKRLKAQT---LTPASIQGMPAVELERLLHPVSFYKNKAKYLKQTSQILVDKYNE- 238
Query: 248 AEWLLSLRKLDLQEAIDALCTLPGVGPKVAACIALFSLDQHHAIPVDTHVWKIATR 303
D+ + I L LPGVGPK+A + ++ I VDTHV +IA R
Sbjct: 239 ----------DIPDNIQELLKLPGVGPKMAHICMATAWNKITGIGVDTHVHRIANR 284
>gi|159122487|gb|EDP47608.1| conserved hypothetical protein [Aspergillus fumigatus A1163]
Length = 470
Score = 44.7 bits (104), Expect = 0.066, Method: Compositional matrix adjust.
Identities = 22/50 (44%), Positives = 30/50 (60%), Gaps = 1/50 (2%)
Query: 251 LLSLRKLDLQEAIDALCTLPGVGPKVAACIALFSLDQHHAIPVDTHVWKI 300
L L L ++A+ L PG+GPK AAC+ LF L Q VDTH+++I
Sbjct: 320 LNHLHTLSTEDAMTELVKYPGIGPKTAACVILFCL-QRPCFAVDTHIFRI 368
>gi|449298566|gb|EMC94581.1| hypothetical protein BAUCODRAFT_73567 [Baudoinia compniacensis UAMH
10762]
Length = 452
Score = 44.7 bits (104), Expect = 0.069, Method: Compositional matrix adjust.
Identities = 20/44 (45%), Positives = 30/44 (68%), Gaps = 1/44 (2%)
Query: 257 LDLQEAIDALCTLPGVGPKVAACIALFSLDQHHAIPVDTHVWKI 300
L AI+ + + PG+GPK A+C+ALF L Q + VDTHV+++
Sbjct: 283 LSTSAAIEKMLSYPGIGPKTASCVALFCL-QRPSFAVDTHVFRL 325
Database: nr
Posted date: Mar 3, 2013 10:45 PM
Number of letters in database: 999,999,864
Number of sequences in database: 2,912,245
Database: /local_scratch/syshi//blastdatabase/nr.01
Posted date: Mar 3, 2013 10:52 PM
Number of letters in database: 999,999,666
Number of sequences in database: 2,912,720
Database: /local_scratch/syshi//blastdatabase/nr.02
Posted date: Mar 3, 2013 10:58 PM
Number of letters in database: 999,999,938
Number of sequences in database: 3,014,250
Database: /local_scratch/syshi//blastdatabase/nr.03
Posted date: Mar 3, 2013 11:03 PM
Number of letters in database: 999,999,780
Number of sequences in database: 2,805,020
Database: /local_scratch/syshi//blastdatabase/nr.04
Posted date: Mar 3, 2013 11:08 PM
Number of letters in database: 999,999,551
Number of sequences in database: 2,816,253
Database: /local_scratch/syshi//blastdatabase/nr.05
Posted date: Mar 3, 2013 11:13 PM
Number of letters in database: 999,999,897
Number of sequences in database: 2,981,387
Database: /local_scratch/syshi//blastdatabase/nr.06
Posted date: Mar 3, 2013 11:18 PM
Number of letters in database: 999,999,649
Number of sequences in database: 2,911,476
Database: /local_scratch/syshi//blastdatabase/nr.07
Posted date: Mar 3, 2013 11:24 PM
Number of letters in database: 999,999,452
Number of sequences in database: 2,920,260
Database: /local_scratch/syshi//blastdatabase/nr.08
Posted date: Mar 3, 2013 11:25 PM
Number of letters in database: 64,230,274
Number of sequences in database: 189,558
Lambda K H
0.319 0.135 0.415
Lambda K H
0.267 0.0410 0.140
Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Number of Hits to DB: 5,428,675,034
Number of Sequences: 23463169
Number of extensions: 230238087
Number of successful extensions: 1299383
Number of sequences better than 100.0: 1000
Number of HSP's better than 100.0 without gapping: 1027
Number of HSP's successfully gapped in prelim test: 655
Number of HSP's that attempted gapping in prelim test: 1295505
Number of HSP's gapped (non-prelim): 2378
length of query: 328
length of database: 8,064,228,071
effective HSP length: 142
effective length of query: 186
effective length of database: 9,027,425,369
effective search space: 1679101118634
effective search space used: 1679101118634
T: 11
A: 40
X1: 16 ( 7.4 bits)
X2: 38 (14.6 bits)
X3: 64 (24.7 bits)
S1: 41 (21.8 bits)
S2: 77 (34.3 bits)